BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (497 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P31447 Uncharacterized sulfatase yidJ n=52 Tax=Enteroba... 1046 0.0 UniRef50_C9L4Q0 Putative sulfatase YidJ n=2 Tax=Blautia hansenii... 416 e-115 UniRef50_B0N997 Putative uncharacterized protein n=1 Tax=Clostri... 410 e-113 UniRef50_C4G6V3 Putative uncharacterized protein n=1 Tax=Abiotro... 384 e-105 UniRef50_UPI0001911724 sulfatase/phosphatase n=1 Tax=Salmonella ... 199 1e-49 UniRef50_C6D1Q0 Sulfatase n=2 Tax=Paenibacillus sp. JDR-2 RepID=... 187 6e-46 UniRef50_C5EHR5 Putative uncharacterized protein n=1 Tax=Clostri... 180 1e-43 UniRef50_Q01ZJ7 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 179 3e-43 UniRef50_Q7MBV5 Arylsulfatase A n=31 Tax=Bacteria RepID=Q7MBV5_V... 178 4e-43 UniRef50_UPI00019126F6 sulfatase/phosphatase n=1 Tax=Salmonella ... 176 2e-42 UniRef50_D2MLH4 Sulfatase family protein n=1 Tax=Candidatus Pori... 175 3e-42 UniRef50_A6DKC5 Putative sulfatase yidj n=1 Tax=Lentisphaera ara... 172 2e-41 UniRef50_C5BVK2 Sulfatase n=11 Tax=Actinomycetales RepID=C5BVK2_... 171 4e-41 UniRef50_Q029P1 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 171 6e-41 UniRef50_C0G116 Sulfatase n=1 Tax=Natrialba magadii ATCC 43099 R... 171 8e-41 UniRef50_C6J2Z0 Sulfatase n=4 Tax=Firmicutes RepID=C6J2Z0_9BACL 170 1e-40 UniRef50_C9L4R7 Putative sulfatase YidJ n=1 Tax=Blautia hansenii... 167 8e-40 UniRef50_Q15XH4 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 166 1e-39 UniRef50_Q482B9 Sulfatase family protein n=1 Tax=Colwellia psych... 166 2e-39 UniRef50_C6J3H9 Sulfatase n=2 Tax=Paenibacillus sp. oral taxon 7... 166 2e-39 UniRef50_A6DLX7 Putative sulfatase n=1 Tax=Lentisphaera araneosa... 165 4e-39 UniRef50_A7LY81 Putative uncharacterized protein n=5 Tax=Bactero... 165 5e-39 UniRef50_C5BXT8 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 1233... 163 2e-38 UniRef50_D2RQH7 Sulfatase n=1 Tax=Haloterrigena turkmenica DSM 5... 162 2e-38 UniRef50_C6J5I7 Sulfatase n=1 Tax=Paenibacillus sp. oral taxon 7... 161 7e-38 UniRef50_A4AP83 Putative sulfatase n=1 Tax=Flavobacteriales bact... 160 1e-37 UniRef50_C5HLB2 Putative sulfatase n=1 Tax=uncultured bacterium ... 159 3e-37 UniRef50_C3WAQ9 Sulfatase n=1 Tax=Fusobacterium mortiferum ATCC ... 159 3e-37 UniRef50_Q7W424 Putative sulfatase n=2 Tax=Bordetella RepID=Q7W4... 157 7e-37 UniRef50_C3WCE8 Arylsulfatase n=2 Tax=Fusobacterium RepID=C3WCE8... 157 1e-36 UniRef50_D2QL61 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepI... 156 2e-36 UniRef50_UPI0001C36AAF N-acetylgalactosamine 6-sulfate sulfatase... 155 4e-36 UniRef50_UPI0001968556 hypothetical protein BACCELL_00122 n=1 Ta... 154 6e-36 UniRef50_C0QY53 Sulfatase n=2 Tax=Brachyspira RepID=C0QY53_BRAHW 154 1e-35 UniRef50_Q1GMK9 Choline sulfatase n=8 Tax=Alphaproteobacteria Re... 153 1e-35 UniRef50_A6DFB2 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Lent... 153 1e-35 UniRef50_A3P379 Choline-sulfatase n=63 Tax=cellular organisms Re... 153 1e-35 UniRef50_UPI0001968553 hypothetical protein BACCELL_00119 n=1 Ta... 153 1e-35 UniRef50_B6AU86 Putative sulfatase YidJ n=1 Tax=Rhodobacterales ... 152 2e-35 UniRef50_UPI00016C0A06 sulfatase n=1 Tax=Epulopiscium sp. 'N.t. ... 152 3e-35 UniRef50_A0JVP0 Sulfatase n=1 Tax=Arthrobacter sp. FB24 RepID=A0... 152 3e-35 UniRef50_Q5LRB5 Choline sulfatase n=1 Tax=Ruegeria pomeroyi RepI... 152 3e-35 UniRef50_Q46P27 Sulfatase n=3 Tax=Proteobacteria RepID=Q46P27_RALEJ 152 4e-35 UniRef50_A6DM50 Choline sulfatase n=6 Tax=Bacteria RepID=A6DM50_... 151 5e-35 UniRef50_Q7UH28 Mucin-desulfating sulfatase (N-acetylglucosamine... 151 5e-35 UniRef50_Q5LH37 Putative sulfatase n=16 Tax=Bacteroides RepID=Q5... 151 6e-35 UniRef50_A4W906 Sulfatase n=43 Tax=Enterobacteriaceae RepID=A4W9... 150 1e-34 UniRef50_C3QDX1 Sulfatase n=2 Tax=Bacteroides RepID=C3QDX1_9BACE 150 1e-34 UniRef50_B5JYP8 Choline-sulfatase n=1 Tax=Octadecabacter antarct... 150 1e-34 UniRef50_UPI00017453D4 choline sulfatase n=1 Tax=Verrucomicrobiu... 150 1e-34 UniRef50_C2KTX6 Arylsulfatase n=2 Tax=Mobiluncus mulieris RepID=... 150 1e-34 UniRef50_Q01RE9 Sulfatase n=4 Tax=Bacteria RepID=Q01RE9_SOLUE 150 1e-34 UniRef50_A4AWR8 Iduronate-2-sulfatase n=5 Tax=Bacteria RepID=A4A... 150 1e-34 UniRef50_Q89YS5 N-acetylglucosamine-6-sulfatase n=12 Tax=Bactero... 149 2e-34 UniRef50_B9XND0 Sulfatase n=3 Tax=Bacteria RepID=B9XND0_9BACT 149 2e-34 UniRef50_B6B0A5 Putative sulfatase YidJ n=1 Tax=Rhodobacterales ... 149 3e-34 UniRef50_A4CMA4 Mucin-desulfating sulfatase (N-acetylglucosamine... 149 3e-34 UniRef50_Q0K3Z4 Arylsulfatase A n=4 Tax=Burkholderiales RepID=Q0... 148 4e-34 UniRef50_A9MER1 Putative uncharacterized protein n=2 Tax=Enterob... 148 4e-34 UniRef50_B8FL44 Sulfatase n=1 Tax=Desulfatibacillum alkenivorans... 148 4e-34 UniRef50_UPI0001C3580F sulfatase n=2 Tax=Clostridium hathewayi D... 148 5e-34 UniRef50_Q127E2 Sulfatase n=1 Tax=Polaromonas sp. JS666 RepID=Q1... 148 5e-34 UniRef50_UPI000051016C choline-sulfatase n=1 Tax=Brevibacterium ... 148 6e-34 UniRef50_C7MEQ7 Choline-sulfatase n=1 Tax=Brachybacterium faeciu... 148 6e-34 UniRef50_C6MEX8 Sulfatase n=1 Tax=Nitrosomonas sp. AL212 RepID=C... 147 7e-34 UniRef50_Q7UW58 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 147 8e-34 UniRef50_A4A280 Iduronate-2-sulfatase n=1 Tax=Blastopirellula ma... 146 1e-33 UniRef50_A6C9F6 Iduronate-2-sulfatase n=1 Tax=Planctomyces maris... 146 1e-33 UniRef50_UPI0001C3604A sulfatase n=1 Tax=Clostridium hathewayi D... 146 2e-33 UniRef50_Q02B50 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 145 3e-33 UniRef50_A0JVN2 Sulfatase n=1 Tax=Arthrobacter sp. FB24 RepID=A0... 145 4e-33 UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 144 6e-33 UniRef50_Q5UEW6 Probable phosphonate monoester hydrolase n=1 Tax... 144 6e-33 UniRef50_B9XGI2 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XGI... 144 6e-33 UniRef50_C6D448 Sulfatase n=2 Tax=Bacteria RepID=C6D448_PAESJ 144 6e-33 UniRef50_UPI0001C36159 sulfatase n=2 Tax=Clostridium hathewayi D... 144 9e-33 UniRef50_B1KD82 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 143 1e-32 UniRef50_A3I0S5 Putative sulfatase yidJ n=1 Tax=Algoriphagus sp.... 143 2e-32 UniRef50_B5GLL7 Sulfatase n=1 Tax=Streptomyces clavuligerus ATCC... 143 2e-32 UniRef50_C0S8M2 Choline sulfatase n=8 Tax=Eurotiomycetidae RepID... 142 3e-32 UniRef50_D1AWE3 Sulfatase n=3 Tax=Fusobacteriaceae RepID=D1AWE3_... 141 6e-32 UniRef50_Q5UEY3 Probable sulfatase n=1 Tax=uncultured alpha prot... 141 6e-32 UniRef50_UPI00016BFE17 putative sulfatase n=1 Tax=Epulopiscium s... 141 6e-32 UniRef50_Q01PN7 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 141 6e-32 UniRef50_Q7UJ67 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 141 6e-32 UniRef50_A6DKS7 N-acetylglucosamine-6-sulfatase n=1 Tax=Lentisph... 141 6e-32 UniRef50_Q7UFA5 Putative sulfatase yidj n=1 Tax=Rhodopirellula b... 141 7e-32 UniRef50_B6A548 Choline-sulfatase n=1 Tax=Rhizobium leguminosaru... 141 7e-32 UniRef50_UPI00016BFAFE putative sulfatase n=1 Tax=Epulopiscium s... 140 1e-31 UniRef50_UPI00016C0B77 sulfatase n=1 Tax=Epulopiscium sp. 'N.t. ... 140 1e-31 UniRef50_UPI0001C369FC arylsulfatase n=1 Tax=Clostridium hathewa... 140 1e-31 UniRef50_D2R1A1 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 140 1e-31 UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC1... 139 2e-31 UniRef50_C9L4R5 Mucin-desulfating sulfatase n=1 Tax=Blautia hans... 139 2e-31 UniRef50_C6J5I8 Sulfatase n=2 Tax=Paenibacillus sp. oral taxon 7... 139 3e-31 UniRef50_A6DFZ4 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 139 3e-31 UniRef50_Q7UGD6 Mucin-desulfating sulfatase (N-acetylglucosamine... 139 3e-31 UniRef50_C5BAV0 Sulfatase, putative n=2 Tax=Edwardsiella RepID=C... 139 3e-31 UniRef50_A0JVM4 Sulfatase n=2 Tax=Actinomycetales RepID=A0JVM4_A... 139 4e-31 UniRef50_A0LYA0 Sulfatase n=8 Tax=Bacteria RepID=A0LYA0_GRAFK 138 4e-31 UniRef50_A6CG48 Sulfatase family protein n=1 Tax=Planctomyces ma... 138 5e-31 UniRef50_A0Q2E3 N-acetylgalactosamine 6-sulfate sulfatase n=3 Ta... 138 6e-31 UniRef50_Q1IH24 Choline sulfatase n=29 Tax=cellular organisms Re... 137 8e-31 UniRef50_C6CRB3 Sulfatase n=2 Tax=Bacilli RepID=C6CRB3_PAESJ 137 8e-31 UniRef50_D2MKV1 Choline-sulfatase n=1 Tax=Candidatus Poribacteri... 137 9e-31 UniRef50_C5BYA8 Sulfatase n=2 Tax=Micrococcineae RepID=C5BYA8_BEUC1 137 9e-31 UniRef50_UPI0001746164 choline-sulfatase n=1 Tax=Verrucomicrobiu... 137 1e-30 UniRef50_A6LF65 Choline-sulfatase n=26 Tax=Bacteroidales RepID=A... 137 1e-30 UniRef50_D2R203 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 136 2e-30 UniRef50_UPI0000E11039 sulfatase 1 precursor n=1 Tax=Rhodobacter... 136 2e-30 UniRef50_UPI00016C09FC sulfatase n=2 Tax=Epulopiscium sp. 'N.t. ... 135 3e-30 UniRef50_UPI0001745B0B sulfatase n=1 Tax=Verrucomicrobium spinos... 135 3e-30 UniRef50_UPI0001C36C38 arylsulfatase n=1 Tax=Clostridium hathewa... 135 3e-30 UniRef50_A6DSH1 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 135 4e-30 UniRef50_D0PR12 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Flam... 135 5e-30 UniRef50_C9L086 Mucin-desulfating sulfatase n=54 Tax=Bacteria Re... 135 5e-30 UniRef50_A6DPD0 Sulfatase family protein n=1 Tax=Lentisphaera ar... 135 5e-30 UniRef50_C0W1U3 Sulfatase n=1 Tax=Actinomyces coleocanis DSM 154... 134 6e-30 UniRef50_D2QWC7 Sulfatase n=5 Tax=Bacteria RepID=D2QWC7_9PLAN 134 6e-30 UniRef50_A6DNI8 Putative N-acetylglucosamine-6-sulfatase n=1 Tax... 133 1e-29 UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD 133 1e-29 UniRef50_B6GZC3 Pc12g01800 protein n=14 Tax=cellular organisms R... 133 2e-29 UniRef50_A6DGT7 Sulfatase family protein n=1 Tax=Lentisphaera ar... 133 2e-29 UniRef50_B3C5J5 Putative uncharacterized protein n=7 Tax=Bactero... 133 2e-29 UniRef50_A6C8U0 Choline sulfatase n=1 Tax=Planctomyces maris DSM... 133 2e-29 UniRef50_B4D6H3 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 132 2e-29 UniRef50_UPI00016C0ED5 sulfatase n=1 Tax=Epulopiscium sp. 'N.t. ... 132 2e-29 UniRef50_B9XEU8 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XEU... 132 2e-29 UniRef50_C6VXD1 Sulfatase n=4 Tax=Bacteria RepID=C6VXD1_DYAFD 132 2e-29 UniRef50_B8KY63 N-sulphoglucosamine sulphohydrolase n=1 Tax=gamm... 132 3e-29 UniRef50_A4U8Q3 Sulfatase n=2 Tax=Bacteria RepID=A4U8Q3_9BACT 132 3e-29 UniRef50_A3SJ21 Sulfatase n=1 Tax=Roseovarius nubinhibens ISM Re... 132 4e-29 UniRef50_A4AMS2 Choline sulfatase n=1 Tax=Flavobacteriales bacte... 131 5e-29 UniRef50_C9L4R3 N-acetylglucosamine-6-sulfatase n=2 Tax=Blautia ... 131 5e-29 UniRef50_D1AX15 Sulfatase n=2 Tax=Fusobacteriaceae RepID=D1AX15_... 131 5e-29 UniRef50_C6J2X5 Sulfatase n=1 Tax=Paenibacillus sp. oral taxon 7... 131 6e-29 UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodop... 131 6e-29 UniRef50_A6DME6 Sulfatase family protein n=1 Tax=Lentisphaera ar... 130 8e-29 UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 130 8e-29 UniRef50_Q7UMT6 Mucin-desulfating sulfatase (N-acetylglucosamine... 130 8e-29 UniRef50_UPI0001C35525 putative sulfatase yidJ n=1 Tax=Clostridi... 130 9e-29 UniRef50_A7V656 Putative uncharacterized protein n=6 Tax=Bactero... 130 1e-28 UniRef50_Q7UYA8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 130 1e-28 UniRef50_A6DJ72 Mucin-desulfating sulfatase (N-acetylglucosamine... 130 2e-28 UniRef50_Q7WC54 Putative sulfatase n=3 Tax=Proteobacteria RepID=... 130 2e-28 UniRef50_D2S234 Sulfatase n=1 Tax=Haloterrigena turkmenica DSM 5... 129 2e-28 UniRef50_UPI0001BC85B0 choline sulfatase n=1 Tax=Bacteroides sp.... 129 2e-28 UniRef50_A5FX90 Sulfatase n=4 Tax=Alphaproteobacteria RepID=A5FX... 129 2e-28 UniRef50_Q7UWE8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 129 2e-28 UniRef50_B0TKJ5 Sulfatase n=2 Tax=Gammaproteobacteria RepID=B0TK... 129 2e-28 UniRef50_D0DCV9 Choline-sulfatase n=2 Tax=Citreicella sp. SE45 R... 129 2e-28 UniRef50_B8KHZ9 Arylsulfatase A n=2 Tax=Gammaproteobacteria RepI... 129 3e-28 UniRef50_D2MLH3 Mucin-desulfating sulfatase (N-acetylglucosamine... 129 3e-28 UniRef50_A6DNH0 Choline sulfatase n=1 Tax=Lentisphaera araneosa ... 129 3e-28 UniRef50_Q15XR5 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 129 3e-28 UniRef50_A6DLY1 Putative sulfatase n=1 Tax=Lentisphaera araneosa... 129 3e-28 UniRef50_C7MHR6 Arylsulfatase A family protein n=3 Tax=Bacteria ... 129 3e-28 UniRef50_Q7UZ92 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 128 4e-28 UniRef50_A6DPE5 Iduronate-2-sulfatase n=2 Tax=Lentisphaera arane... 128 4e-28 UniRef50_C5BWB0 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 1233... 128 5e-28 UniRef50_Q482E2 Sulfatase family protein n=1 Tax=Colwellia psych... 127 7e-28 UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=... 127 8e-28 UniRef50_A6DIH4 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 127 9e-28 UniRef50_Q7UER3 Iduronate-2-sulfatase n=2 Tax=Planctomycetaceae ... 127 1e-27 UniRef50_UPI0000E0F7B6 iduronate 2-sulfatase precursor n=1 Tax=G... 127 1e-27 UniRef50_C6LAI4 Arylsulfatase n=6 Tax=Bacteria RepID=C6LAI4_9FIRM 127 1e-27 UniRef50_A3HTC7 Putative uncharacterized protein n=1 Tax=Algorip... 127 1e-27 UniRef50_D0TVM5 Choline sulfatase n=2 Tax=Bacteroides RepID=D0TV... 127 1e-27 UniRef50_C5EPJ8 Sulfatase n=8 Tax=Bacteria RepID=C5EPJ8_9FIRM 126 2e-27 UniRef50_A7HWE6 Sulfatase n=2 Tax=Bacteria RepID=A7HWE6_PARL1 126 2e-27 UniRef50_O69787 Choline-sulfatase n=53 Tax=Alphaproteobacteria R... 126 2e-27 UniRef50_A6DM29 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 126 2e-27 UniRef50_B4D780 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 126 2e-27 UniRef50_A0JVM5 Sulfatase n=1 Tax=Arthrobacter sp. FB24 RepID=A0... 125 3e-27 UniRef50_UPI0001C35789 arylsulfatase n=1 Tax=Clostridium hathewa... 125 3e-27 UniRef50_C6IGG0 Iduronate 2-sulfatase n=2 Tax=Bacteroides RepID=... 125 3e-27 UniRef50_A6DG71 Mucin-desulfating sulfatase (N-acetylglucosamine... 125 3e-27 UniRef50_A6E5R0 Putative sulfatase n=1 Tax=Roseovarius sp. TM103... 125 3e-27 UniRef50_Q0TUK6 Arylsulfatase n=9 Tax=Bacteria RepID=SULF_CLOP1 125 4e-27 UniRef50_B2URC2 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC B... 125 4e-27 UniRef50_C9L4S2 Arylsulfatase n=1 Tax=Blautia hansenii DSM 20583... 125 4e-27 UniRef50_B5CWC2 Putative uncharacterized protein n=1 Tax=Bactero... 125 4e-27 UniRef50_A6DG34 Choline sulfatase n=1 Tax=Lentisphaera araneosa ... 125 4e-27 UniRef50_Q7UHJ4 Mucin-desulfating sulfatase n=2 Tax=Planctomycet... 125 4e-27 UniRef50_B4D026 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 125 5e-27 UniRef50_Q15NY5 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 125 5e-27 UniRef50_C5BXG2 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 1233... 125 5e-27 UniRef50_C5C586 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 1233... 124 6e-27 UniRef50_A6DJJ1 Sulfatase family protein n=1 Tax=Lentisphaera ar... 124 7e-27 UniRef50_A9ECS8 Sulfatase n=3 Tax=Bacteria RepID=A9ECS8_9FLAO 124 8e-27 UniRef50_C7MI43 Arylsulfatase A family protein n=5 Tax=Bacteria ... 124 8e-27 UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 124 8e-27 UniRef50_UPI0001746432 sulfatase n=1 Tax=Verrucomicrobium spinos... 124 1e-26 UniRef50_B6HPN7 Pc22g01020 protein n=15 Tax=Eukaryota RepID=B6HP... 124 1e-26 UniRef50_Q7UH46 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 123 1e-26 UniRef50_A4A047 Iduronate-2-sulfatase n=2 Tax=Bacteria RepID=A4A... 123 2e-26 UniRef50_B2AAG4 Predicted CDS Pa_1_3920 n=1 Tax=Podospora anseri... 122 2e-26 UniRef50_Q7NMX5 Gll0640 protein n=1 Tax=Gloeobacter violaceus Re... 122 3e-26 UniRef50_UPI00016C001E mucin-desulfating sulfatase n=1 Tax=Epulo... 122 3e-26 UniRef50_UPI0000E11054 iduronate-2-sulfatase n=1 Tax=Rhodobacter... 122 3e-26 UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 122 3e-26 UniRef50_A6DHY0 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 122 3e-26 UniRef50_Q7UGI8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 122 4e-26 UniRef50_C8VYX4 Sulfatase n=2 Tax=Firmicutes RepID=C8VYX4_DESAS 122 4e-26 UniRef50_Q15XI1 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 122 4e-26 UniRef50_C7MHD7 Arylsulfatase A family protein n=1 Tax=Brachybac... 121 5e-26 UniRef50_UPI0000E1104B N-acetylgalactosamine 6-sulfate sulfatase... 121 5e-26 UniRef50_Q7MBV3 Arylsulfatase A n=6 Tax=Vibrio RepID=Q7MBV3_VIBVY 121 8e-26 UniRef50_A6CFT9 Iduronate-2-sulfatase n=2 Tax=Planctomycetaceae ... 120 8e-26 UniRef50_A6DSH0 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 120 1e-25 UniRef50_A6DG72 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 120 1e-25 UniRef50_Q7UVD9 N-acetylgalactosamine 6-sulfate sulfatase n=1 Ta... 120 1e-25 UniRef50_D0Z4S7 Iduronate sulfatase n=1 Tax=Photobacterium damse... 120 1e-25 UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi D... 120 1e-25 UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 120 1e-25 UniRef50_A3JPC9 Mucin-desulfating sulfatase (N-acetylglucosamine... 120 1e-25 UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 T... 120 2e-25 UniRef50_A6DFR6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Lentis... 120 2e-25 UniRef50_D2R925 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 119 2e-25 UniRef50_B7AMH4 Putative uncharacterized protein n=1 Tax=Bactero... 119 2e-25 UniRef50_A6L183 Iduronate 2-sulfatase n=11 Tax=Bacteroides RepID... 119 2e-25 UniRef50_Q7UVD4 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 119 2e-25 UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_R... 119 2e-25 UniRef50_A6UE90 Sulfatase n=1 Tax=Sinorhizobium medicae WSM419 R... 119 3e-25 UniRef50_A3HWG3 Choline sulfatase n=1 Tax=Algoriphagus sp. PR1 R... 119 3e-25 UniRef50_A0LK86 Sulfatase n=1 Tax=Syntrophobacter fumaroxidans M... 119 4e-25 UniRef50_A6DNH1 Choline sulfatase n=2 Tax=Lentisphaera araneosa ... 118 4e-25 UniRef50_C1ZIM5 Arylsulfatase A family protein n=2 Tax=Planctomy... 118 4e-25 UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=B... 118 5e-25 UniRef50_A9V5D4 Predicted protein n=1 Tax=Monosiga brevicollis R... 118 5e-25 UniRef50_B4X2F4 Sulfatase, putative n=1 Tax=Alcanivorax sp. DG88... 118 6e-25 UniRef50_C6Y1N2 Sulfatase n=2 Tax=Pedobacter heparinus DSM 2366 ... 117 7e-25 UniRef50_A6DM53 Arylsulfatase (Aryl-sulfate sulphohydrolase) n=1... 117 7e-25 UniRef50_UPI0001BC7CBC sulfatase n=1 Tax=Bacteroides sp. D2 RepI... 117 8e-25 UniRef50_UPI0001C35757 sulfatase n=1 Tax=Clostridium hathewayi D... 117 1e-24 UniRef50_Q1ARG1 Sulfatase n=2 Tax=Rubrobacter xylanophilus DSM 9... 117 1e-24 UniRef50_A6DJ24 Iduronate-2-sulfatase n=3 Tax=Lentisphaera arane... 117 1e-24 UniRef50_B1I7R7 Sulfatase n=16 Tax=Lactobacillales RepID=B1I7R7_... 116 2e-24 UniRef50_A6DHY1 Mucin-desulfating sulfatase n=1 Tax=Lentisphaera... 116 2e-24 UniRef50_A6DG79 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 116 2e-24 UniRef50_UPI00016C500A sulfatase n=1 Tax=Gemmata obscuriglobus U... 116 2e-24 UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3J... 116 2e-24 UniRef50_Q482D6 Sulfatase family protein n=2 Tax=Bacteria RepID=... 116 2e-24 UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria ... 115 3e-24 UniRef50_Q7UYD2 Sulfatase 1 n=1 Tax=Rhodopirellula baltica RepID... 115 3e-24 UniRef50_C9L4I6 Arylsulfatase n=1 Tax=Blautia hansenii DSM 20583... 115 4e-24 UniRef50_C5C4L8 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 1233... 115 4e-24 UniRef50_D2R201 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 115 4e-24 UniRef50_A4GIB1 Arylsulfatase n=2 Tax=Bacteria RepID=A4GIB1_9BACT 115 4e-24 UniRef50_Q7ULE7 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Rhod... 115 5e-24 >UniRef50_P31447 Uncharacterized sulfatase yidJ n=52 Tax=Enterobacteriaceae RepID=YIDJ_ECOLI Length = 497 Score = 1046 bits (2705), Expect = 0.0, Method: Compositional matrix adjust. Identities = 497/497 (100%), Positives = 497/497 (100%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF Sbjct: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA Sbjct: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA Sbjct: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV Sbjct: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKGAA 300 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKGAA Sbjct: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKGAA 300 Query: 301 MYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRG 360 MYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRG Sbjct: 301 MYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRG 360 Query: 361 VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRF 420 VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRF Sbjct: 361 VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRF 420 Query: 421 ADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQDGYSPVVRDYD 480 ADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQDGYSPVVRDYD Sbjct: 421 ADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQDGYSPVVRDYD 480 Query: 481 TGLPTQGVKVEEKKQKF 497 TGLPTQGVKVEEKKQKF Sbjct: 481 TGLPTQGVKVEEKKQKF 497 >UniRef50_C9L4Q0 Putative sulfatase YidJ n=2 Tax=Blautia hansenii DSM 20583 RepID=C9L4Q0_RUMHA Length = 505 Score = 416 bits (1070), Expect = e-115, Method: Compositional matrix adjust. Identities = 218/506 (43%), Positives = 306/506 (60%), Gaps = 22/506 (4%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+ +F+MTDTQ T+M+GCY + T N+D LAAEGIR++ AYT PVC PAR+ +F Sbjct: 1 MKKRQVIFIMTDTQRTDMLGCYGNSAMVTPNLDRLAAEGIRYDKAYTTQPVCQPARSAIF 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG Y + W+N + N+ ++G+ DAG HT Y+GKWHLDG DYFG G CP WD Sbjct: 61 TGSYPHSCAGWSNCMGLSDNVQSIGQRLSDAGIHTAYVGKWHLDGGDYFGLGRCPKGWDE 120 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 DYW+D +L ELT +E R + +E ++ +I E T+ HR ++RAVDF+++ Sbjct: 121 DYWYDMKCFLDELTPEE----RYRIRQIESIEKYNITEDMTYGHRCADRAVDFIEK--HK 174 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA-QAMPSP 239 DE + +V+S DEPH P CP +Y++ Y D+ + E +D L +KPEHHR+WA Sbjct: 175 DEDYFLVMSLDEPHGPHICPKKYVDLYKDYEIPVKENMKDTLEDKPEHHRIWAGDEYLKA 234 Query: 240 VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKGA 299 +D + CN F D +IGRV++A + E +IYTSDHG+MM H L KG Sbjct: 235 CREDFKLSPKEFLGCNTFADYEIGRVLDAAAQYEDEPI-IIYTSDHGDMMYGHSLTGKGP 293 Query: 300 AMYDDITRIPLIIRSPQGERRQVD-TPVSHIDLLPTMMALADIEKPEILPGENIL-AVKE 357 A+Y++IT IPL+I+ G + VD PVSHI+L PT+ + + P++ G +I VK Sbjct: 294 ALYEEITHIPLMIK---GFGKGVDKNPVSHINLAPTIFDMFGVPIPKMFEGRSIFEEVKN 350 Query: 358 PRG-----VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMH 412 P V +EF RYE++HD FGG+ P+R +K+V+NL TSDELYD + DP EM Sbjct: 351 PEVRCNDYVFMEFGRYEVDHDGFGGYQPLRGAFDGRYKMVINLMTSDELYDLQEDPQEMK 410 Query: 413 NLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPR-WMGAF--RPRPQ 469 NLI++ + ++R ++H+A+LD M + RDPFR Y W RPW + + W R R Sbjct: 411 NLINEPGYDEIRKRLHEAILDNMYETRDPFRGYYWEDRPWNRITEYKTWDSRLMTRQREN 470 Query: 470 DGYSPVVRDYDTGLPTQGVKVEEKKQ 495 + Y P DY TGLP V +K Q Sbjct: 471 EEYEPRQLDYGTGLPMTSA-VRKKGQ 495 >UniRef50_B0N997 Putative uncharacterized protein n=1 Tax=Clostridium scindens ATCC 35704 RepID=B0N997_EUBSP Length = 495 Score = 410 bits (1055), Expect = e-113, Method: Compositional matrix adjust. Identities = 218/499 (43%), Positives = 299/499 (59%), Gaps = 27/499 (5%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 + +F+MTDT +MVGCY + T N+D LA EGIR+ +AYTC PVC PAR+ +FTG Sbjct: 2 KKQVIFLMTDTTRKDMVGCYGNPKMKTPNLDRLAEEGIRYENAYTCQPVCGPARSAIFTG 61 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 + + +G TN++A G N+ T+G+ + G YIGKWHLDG DYFG G CP WD +Y Sbjct: 62 TFPHTNGMVTNSIAMGDNVKTIGQRLHNHGISCGYIGKWHLDGSDYFGNGRCPEGWDPEY 121 Query: 123 WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADE 182 W+D YL ELT++E R+ +D E FT+AHR S+RA+ +L+ DE Sbjct: 122 WYDMKTYLDELTDEEKVRSRDPKECYKD----GFSEEFTYAHRCSDRAIKYLEN--HQDE 175 Query: 183 PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGD 242 F + VSYDEPH P CP + + F +E QDDL+ KP RLW+ D Sbjct: 176 DFFLSVSYDEPHGPSLCPEPFNHMFDGFKFESCPNFQDDLSKKPFMQRLWSGKNLHATED 235 Query: 243 ------DGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLIS 296 DGL L+ CN F D +IGRV++ + E + VI+TSDHG+M+GAH+L S Sbjct: 236 EINQPSDGL---SLFLGCNSFADYEIGRVLDKIR-EVAPDALVIFTSDHGDMLGAHRLFS 291 Query: 297 KGAAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLLPTMMALADIEKPEILPGENIL-A 354 K AA Y ++ IPLII+ GER V D SHID+ PT++ + P++L G+++L Sbjct: 292 KNAAAYKEVANIPLIIKG--GERGYVEDAMASHIDIAPTILDYFGLPIPKLLEGKSMLPQ 349 Query: 355 VKEPRG-----VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPN 409 +K P V EF RYEI+HD FGG +R ++ +KLV++L +DE YD NDP Sbjct: 350 IKNPEKEINDVVFTEFTRYEIDHDGFGGLQIMRAVMSKRYKLVIHLLDTDEFYDLENDPY 409 Query: 410 EMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRW--MGAFRPR 467 EM+NLI+D ++ + R+ +HD L+ +M+ RD +R YQWS+RPWR D P W G R R Sbjct: 410 EMNNLIEDKKYIEERNALHDKLIQHMNDTRDLYRGYQWSMRPWRTDFIPDWENEGYTRQR 469 Query: 468 PQDGYSPVVRDYDTGLPTQ 486 + Y P DYDTGLP + Sbjct: 470 ENEEYEPRQLDYDTGLPME 488 >UniRef50_C4G6V3 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G6V3_ABIDE Length = 502 Score = 384 bits (986), Expect = e-105, Method: Compositional matrix adjust. Identities = 214/502 (42%), Positives = 289/502 (57%), Gaps = 26/502 (5%) Query: 1 MKRPNFLFVMTDTQATNMVG-CYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAG 58 M + F+ +MTD+Q +M+ C G+ ++T +D L +G+ F SAYT PVC PARAG Sbjct: 1 MAKKQFIVIMTDSQRRDMISRCNERGENMHTPCLDRLCDQGLAFQSAYTTQPVCGPARAG 60 Query: 59 LFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 LFTG Y + +G N +A + T+G+ AG H YIGKWHLDG DYFG G CP W Sbjct: 61 LFTGTYPHTNGMLGNCMALSQQSLTIGQRLSKAGIHAAYIGKWHLDGGDYFGDGICPEGW 120 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 D +YW+D NYL EL E L++ L+ I E FT+ +R + RA+DF+++ Sbjct: 121 DENYWYDMRNYLDELESDEDRARSRTLDTA--LEGEGIGEEFTYGYRCTKRALDFMEK-- 176 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD--DLANKPEHHRLWAQAM 236 DE + +VVSYDEPHHPF P + Y FY +K D + PEH ++W + Sbjct: 177 YKDEDYFLVVSYDEPHHPFLSPKSF---YKPFYQPYLQKPNQHMDFSKLPEHIQVWHEKF 233 Query: 237 PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLIS 296 G G N F+D QIGRV+ A + E+ V+YTSDHG+ G+H + + Sbjct: 234 SEIQGGKGDGFAVGLLGSNSFIDSQIGRVLEA-AEKNAEDALVLYTSDHGDSQGSHGIHA 292 Query: 297 KGAAMYDDITRIPLIIRSPQGERRQVDT--PVSHIDLLPTMMALADIEKPEILPGENIL- 353 KG AMY++IT IPLI R + T PVSHID++PT++ + +P+ L GE++L Sbjct: 293 KGPAMYEEITNIPLIARWKNKIEAGITTQMPVSHIDIVPTILDFYGLPQPKSLEGESLLN 352 Query: 354 --------AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRR 405 KE R V VEFNRYE++HD +GGF PVRC V +KL +NL T DELY+ Sbjct: 353 SLTDKEITGQKEGRPVFVEFNRYEVDHDGWGGFQPVRCVVKGKWKLTINLMTQDELYNLE 412 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKD-ARPRW--MG 462 D NEMHNLIDD +R+++HD LLD+ ++ RDP R Y W RPWRKD + W G Sbjct: 413 EDYNEMHNLIDDPNCESIRNQLHDLLLDWQNETRDPLRGYYWEKRPWRKDRQKVSWDCGG 472 Query: 463 AFRPRPQDGYSPVVRDYDTGLP 484 R R ++ Y TGLP Sbjct: 473 YSRSRHREDGEVGEYGYSTGLP 494 >UniRef50_UPI0001911724 sulfatase/phosphatase n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. M223 RepID=UPI0001911724 Length = 91 Score = 199 bits (507), Expect = 1e-49, Method: Compositional matrix adjust. Identities = 91/91 (100%), Positives = 91/91 (100%) Query: 37 AEGIRFNSAYTCSPVCTPARAGLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTC 96 AEGIRFNSAYTCSPVCTPARAGLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTC Sbjct: 1 AEGIRFNSAYTCSPVCTPARAGLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTC 60 Query: 97 YIGKWHLDGHDYFGTGECPPEWDADYWFDGA 127 YIGKWHLDGHDYFGTGECPPEWDADYWFDGA Sbjct: 61 YIGKWHLDGHDYFGTGECPPEWDADYWFDGA 91 >UniRef50_C6D1Q0 Sulfatase n=2 Tax=Paenibacillus sp. JDR-2 RepID=C6D1Q0_PAESJ Length = 480 Score = 187 bits (476), Expect = 6e-46, Method: Compositional matrix adjust. Identities = 150/484 (30%), Positives = 225/484 (46%), Gaps = 53/484 (10%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN L++ TD Q + +G Y + +NT +ID LAAEG+ F A+ SPVCTP+RA Sbjct: 1 MKKPNILWICTDQQRQDTLGAYGNQWVNTPHIDRLAAEGVLFEQAFCQSPVCTPSRASFL 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL----------------D 104 TG Y + N + + + + GY GK HL D Sbjct: 61 TGRYPRTTRCRANGQDIPADEKLISKLLSEEGYICGLAGKLHLSACHPSVNKGTERRIDD 120 Query: 105 GHDYFGTGECP-PEWDAD---YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF 160 G D F P EW + W G E S + N ED Q Sbjct: 121 GFDQFFWSHHPNAEWPTNEYTQWLKGKGKTFSPRPFENSPYVNCGPDAEDHQT------- 173 Query: 161 TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE--LGEKA 218 TW + +AV F++ + + P+ +V+ +PHHPF P EYL++Y D E L Sbjct: 174 TWC---AEKAVQFIETNSDYERPWFFLVNLFDPHHPFDPPKEYLDRYLDRLDEIPLPNYE 230 Query: 219 QDDLANKPEHHRL---WAQAMPSPVGDDGLYH--HPL----YFACNDFVDDQIGRVINAL 269 + +L NKP + R+ A M + + H L Y+A D +DDQ+GR++++L Sbjct: 231 EGELENKPVYQRIDRDGAYGMRGHLAASDMSERDHRLIRAAYWAMCDLIDDQVGRMLDSL 290 Query: 270 TPE-QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPV 326 Q +NT V++ SDHGE++G H + KG YD R+PLI+R P G RR + + V Sbjct: 291 ERSGQLDNTIVVFMSDHGELLGDHGMYLKGPHFYDCSVRVPLIVRGPGIHGGRR-IASLV 349 Query: 327 SHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIP---VRC 383 DL PT++ + I + G+++ + +G +R ++ +S+ P +R Sbjct: 350 ELADLAPTLLEASQIPTYTGMQGKSLWPILLNKGEDAPNHREDVYCESYDANFPHGDLRA 409 Query: 384 WV----TDDFKLVL-NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 W TD KLVL + S ELYD DP E N +D + V+ ++ L + M Sbjct: 410 WATMVRTDSHKLVLYHNDNSGELYDLLADPKENRNAWNDHAYTSVKFELMQRLCNRMALT 469 Query: 439 RDPF 442 DP Sbjct: 470 VDPL 473 >UniRef50_C5EHR5 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EHR5_9FIRM Length = 490 Score = 180 bits (456), Expect = 1e-43, Method: Compositional matrix adjust. Identities = 137/487 (28%), Positives = 224/487 (45%), Gaps = 53/487 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + M D + +GCY ++T +IDSLA G F++ + +PVC+P+R + T Sbjct: 4 RRPNIILFMCDQLRFDALGCYGNNQIHTPHIDSLALNGSTFDNHFVQNPVCSPSRCTVLT 63 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G Y G N + + T+ +D GY T IGK H+ + + +W D Sbjct: 64 GRYPKNHGTRDNGIPLRDSEITLAETLRDNGYRTAAIGKMHITTQ-FVPKEDEQEDWPED 122 Query: 122 -YWFDGANYLSELTEKEISLWRNGLNSVEDLQA--------------------------- 153 Y FD + + E W S ED + Sbjct: 123 NYGFDIIHTTCDCKTGEYLDWLKAA-SPEDYEEVKMQGERKAKEDRASAADKDTGGPPQV 181 Query: 154 --NHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFY 211 + I+ ++ +H I++R +D +++ + D+PF S+ +PHHPF P Y + Y Sbjct: 182 YPSGINPSYHQSHWIADRMIDLIEE-SGPDQPFFAYCSFVDPHHPFDPPKPYGDMYDPDA 240 Query: 212 YELGEKAQDDLANKPEHHR--LWAQAMPSPVGD-DGLYHH------PLYFACNDFVDDQI 262 E+ + + +L +KP H R L A+ + D L H Y+ +DD I Sbjct: 241 LEVPVRMEGELLDKPPHFRKALTARGFSNEKYDYRKLTDHQWGQVKAAYYGMITLIDDNI 300 Query: 263 GRVINALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIR----SPQG 317 GR++NAL E +T +++T+DHGE++G H L+ KG YD I + P+II+ PQG Sbjct: 301 GRILNALRENGLEKDTLILFTNDHGELLGDHGLLFKGPFHYDCIIKAPMIIKWPGVVPQG 360 Query: 318 ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAV-KEPRGVMVEFNRYEIEHDSFG 376 R T H+D++PT++ A + P + G ++ + + +G E+ E +G Sbjct: 361 SRYSQVT--EHVDIMPTLLEYAGVRPPYGVQGCSMAPILRGDKGAGKEYAMTEFNCYDWG 418 Query: 377 GFIPVRCWVTDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 + V+ ++KL ELYDR DP E NL DD + V++ M L+D + Sbjct: 419 --LSVKTLTGRNYKLTYYAGEEYGELYDRNLDPEEFKNLWDDEAYGAVKAYMMKKLMDRI 476 Query: 436 DKIRDPF 442 + DP Sbjct: 477 IETEDPL 483 >UniRef50_Q01ZJ7 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01ZJ7_SOLUE Length = 516 Score = 179 bits (453), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 140/468 (29%), Positives = 216/468 (46%), Gaps = 35/468 (7%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L +MTD Q + SG T NID LA++G+ F +YT S VC PARA L +G Sbjct: 30 RPNILHIMTDQQQWATIAGRSG--CRTPNIDRLASQGMLFERSYTPSAVCCPARAMLLSG 87 Query: 63 IYANQSGPWTNNVAP-------GKNISTMGRYFKDAGYHTCYIGKWH---------LDGH 106 Y +G + +P ++ + ++AGY Y GKWH H Sbjct: 88 AYHWHNGVYNQVHSPPSVHRDMNADVVLYSQRLREAGYRLGYTGKWHASYLRTPLDFGFH 147 Query: 107 DYFGTGECPPEW--DADYWFDGANYLSE---LTEKEISLWRNGLNSVEDLQANHIDETFT 161 + G C PE D D ++E T++ + W G E T Sbjct: 148 EIAGVAGCDPELLKKIDLNPDRVPRITEPLRTTQQRMMRW-PGSEPFVMWGYREGPEEST 206 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 +RI+ A +++ A+ ++P+ + V + EPH P+ +YL++Y + + D Sbjct: 207 PEYRIAEMASRMMKRFAKGEQPWHLEVHFVEPHDPYMPLKQYLDRYDPRSIPVPKSFADT 266 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVI 280 A KP HR ++ DD Y+A + +D QIGRV+ AL Q + T V Sbjct: 267 FAGKPGLHRRESETWGKVTEDDVRQSRAHYYAYAEQLDAQIGRVLKALDETGQADRTLVA 326 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP---QGERRQVDTPVSHIDLLPTMMA 337 +T+DHG+M+GAH++ KG Y++ R+P+I+R P Q + +H DL T +A Sbjct: 327 FTADHGDMVGAHRMWIKGWLPYEECYRVPMIVRWPGHVQAGSKSSKLVQTH-DLGHTYLA 385 Query: 338 LADIEKPEILPGENILAV-KEPRGVMVEFNRYEIEHDSFGG--FIPVRCWVTDDFKLVLN 394 A G ++ + +PR + R +I +GG R +TD FK V N Sbjct: 386 AAGARSLPFPDGASLAPLFADPR---RKDWRDDILCAYYGGEYLYTQRIAITDRFKYVFN 442 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 F DE+YD DP+EM N++ D +A M + + M + DP+ Sbjct: 443 GFDYDEMYDLERDPDEMRNVVADSEYARFTGDMQARMYELMARFHDPY 490 >UniRef50_Q7MBV5 Arylsulfatase A n=31 Tax=Bacteria RepID=Q7MBV5_VIBVY Length = 486 Score = 178 bits (451), Expect = 4e-43, Method: Compositional matrix adjust. Identities = 131/482 (27%), Positives = 215/482 (44%), Gaps = 36/482 (7%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 NF ++ D +M+GCY + T N+DS+AA G RF A+ S VCTP+R LFTG Sbjct: 2 NFALLLMDQTRADMLGCYGHPVVQTPNMDSIAAAGERFEQAFCASSVCTPSRTSLFTGKM 61 Query: 65 ANQSGPWTNNVAPGKN----ISTMGRYFKDAGYHTCYIGKWHL----------------D 104 + G N+ G + + + YIGKWH+ D Sbjct: 62 PSHHGVMCNSDKEGDKCDVPLEDANLISELPNHQHIYIGKWHIGHQKLPQEYGFVGHNFD 121 Query: 105 GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 G+ Y G+G +G Y L EK +L + + + I E + H Sbjct: 122 GYAYPGSGVYQNLAFDSVPLNGNRYQEWLQEKGFALPKVSDCTFGNNPNLKIQEFYGLLH 181 Query: 165 R---------ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG 215 + + A+ +++ + ++ F + +++ PH P P Y Y Sbjct: 182 APVEASIPYFLVDEAISHIEKCLQQNQSFTLWMNFWGPHTPCIIPEPYFSMYQPEQVTFD 241 Query: 216 EKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTP-E 272 E L KPEH++ A+ D+ ++ + Y+ +DD IG++++ L + Sbjct: 242 ESFYHPLIGKPEHYQNIAKMWGVWSLDEEIWRDIVCKYWGYITLIDDAIGQLLDFLKQHD 301 Query: 273 QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHID 330 + ++ ++DHG+ MGAH++I KG M+D R+PLII+ P + D V D Sbjct: 302 LYDGLFLSISADHGDAMGAHRMIEKGEFMFDQTYRVPLIIKDPNASQIGAHYDDLVYLHD 361 Query: 331 LLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV--RCWVTDD 388 L T +A + PE G+++L + + R I G F P R W T + Sbjct: 362 LTATYADIASSKVPESFDGQSLLPILRQQAGQSVPAREGILAQQNGHFTPYPQRMWRTKE 421 Query: 389 FKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWS 448 +KLV N ELY R+DP EMHNLIDD + +++ M +A+ M + DP ++ + Sbjct: 422 YKLVFNASGRSELYHLRHDPQEMHNLIDDPNYGEIKQSMIEAMYAEMQRYHDPLCTWFYR 481 Query: 449 LR 450 ++ Sbjct: 482 MK 483 >UniRef50_UPI00019126F6 sulfatase/phosphatase n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. M223 RepID=UPI00019126F6 Length = 84 Score = 176 bits (445), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 84/84 (100%), Positives = 84/84 (100%) Query: 386 TDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 TDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY Sbjct: 1 TDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 60 Query: 446 QWSLRPWRKDARPRWMGAFRPRPQ 469 QWSLRPWRKDARPRWMGAFRPRPQ Sbjct: 61 QWSLRPWRKDARPRWMGAFRPRPQ 84 >UniRef50_D2MLH4 Sulfatase family protein n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MLH4_9BACT Length = 476 Score = 175 bits (444), Expect = 3e-42, Method: Compositional matrix adjust. Identities = 145/480 (30%), Positives = 221/480 (46%), Gaps = 55/480 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN L++ TD Q + + + + T N+D L AEG+ F A+ S +CTP+R+ T Sbjct: 4 KRPNILWICTDQQRYDTIHALGNEHIQTPNLDRLCAEGVAFTHAHCQSAICTPSRSSFLT 63 Query: 62 GIYANQSGPWTNNVA---PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+Y + N A + + + + DAGY GK HL + G + + Sbjct: 64 GLYPSTVHGNRNGNAYFPANERVQLITKRLADAGYDCGLSGKLHL-ASAWNGEEQRVDDG 122 Query: 119 DADYWF---------DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRIS-- 167 +W+ +G Y LTE+ + L + N+ + H+ + Sbjct: 123 YRKFWYSHSHNQGIGNGNQYTDSLTEQGMDLGDVFQTKKDGTYGNYRPDMNPQYHQTTWC 182 Query: 168 -NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADF----YYELGEKAQDDL 222 +RA++F++ P D P+LM V+ +PH PF P + AD + E ++ Q L Sbjct: 183 ADRAIEFIESP--HDSPYLMSVNPFDPHGPFDAPDTHKYNPADLPPPIFRESDQQTQTRL 240 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALT-PEQRENTWVIY 281 R +A +P GD ++ Y+ +D+ +GR++NAL QRENT VI+ Sbjct: 241 K------RFFADKEGNPPGDREQHNKASYYGMIALIDENVGRMLNALERTGQRENTIVIF 294 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP----QGERRQVDTPVSHIDLLPTMMA 337 TSDHGEM+G H L KG Y+ + R+PLII P QG R D + +D+ PT+ Sbjct: 295 TSDHGEMLGDHGLTGKGCRFYEALVRVPLIISWPGTFLQGHR--ADGLTALLDIAPTLAD 352 Query: 338 LADI-----EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR------CWVT 386 LA I ++P IL + P +F R E +D F P CW T Sbjct: 353 LAGIPLEWTHGKSLIP---ILTGEHPGHAHHDFVRCEY-YDVVDKFAPHASEKHKPCWAT 408 Query: 387 ----DDFKLVL-NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 D +KLV+ + ELYD DP+E HNL D AD++ ++ D+ DP Sbjct: 409 MLRNDRYKLVVYHDEDYGELYDLWEDPDEFHNLWKDPSRADLKYQLTKQNFDHTVICADP 468 >UniRef50_A6DKC5 Putative sulfatase yidj n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKC5_9BACT Length = 511 Score = 172 bits (437), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 140/478 (29%), Positives = 209/478 (43%), Gaps = 75/478 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCY-------------SGKPLNTQNIDSLAAEGIRFNSAYTC 48 + PN L +MTD +GCY G + T +ID LA EG+ N+ Y Sbjct: 31 ENPNLLIIMTDEHNFRTLGCYRKLLSKDQAMIWGDGNIVETPHIDKLAEEGVLCNNFYAS 90 Query: 49 SPVCTPARAGLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDY 108 SPVC+PAR +G Y + NN ++ + G + GY T Y GKWHLD Sbjct: 91 SPVCSPARGSFISGQYPQNTPVIDNNTHMSDDVVSFGSILQSHGYTTGYSGKWHLD---- 146 Query: 109 FGTGECPPEW---------DADYWFDGANY---LSELTEKEISLWRNGLNSVEDLQANHI 156 G+ P+W D Y F+ ++ L + +I + G + + N Sbjct: 147 ---GDGKPQWGPERQFGFEDNRYMFNRGHWKKILDTASGPKIGAEKRGTPTYD---VNGA 200 Query: 157 DETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE--- 213 DE ++N+ +DF+ Q PF +VSY +PH P T Y Y ++ Sbjct: 201 DENTYTTDWLTNKTIDFITQ--HKASPFCYMVSYPDPHGPDTVRAPYDTMYTHMNFQKPK 258 Query: 214 LGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ 273 K QDDL + WA G + Y+ +DD I R++ L + Sbjct: 259 TASKKQDDLPS-------WATT------KRGAANQSQYYGMIKCIDDNIARIMTCLDEQG 305 Query: 274 -RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHID 330 ENT V++TSDHG+M G H +KG + + ++P I+R P+ + V+ +S +D Sbjct: 306 ILENTIVVFTSDHGDMRGEHGRQNKGIPL-EASAKVPFIVRYPKKISSGKIVNEALSGVD 364 Query: 331 LLPTMMALADIE---KPEILPGENILAVKEPRGVM-VEFNRYEIEHDSFGGFIPVRCWV- 385 LPT++ L D E K E G +L K P G V F R E WV Sbjct: 365 FLPTILGLMDKETAGKEEGRDGSQLLHGKVPTGWSDVTFIRGTKEK-----------WVA 413 Query: 386 --TDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 TD +KLV+ + L D++N+P+E N I+D ++ V + + Y K DP Sbjct: 414 AITDQYKLVMAPWDEPWLIDKKNNPDETINYINDPQYRSVIRSLAKEMQRYGTKYNDP 471 >UniRef50_C5BVK2 Sulfatase n=11 Tax=Actinomycetales RepID=C5BVK2_BEUC1 Length = 505 Score = 171 bits (434), Expect = 4e-41, Method: Compositional matrix adjust. Identities = 145/486 (29%), Positives = 225/486 (46%), Gaps = 52/486 (10%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF +TD + +G Y + T N+D+LAA+G F+ YT + +CTPARA L TG Sbjct: 14 NILFFLTDQHRKDTLGAYGNATVRTPNLDALAADGTTFDRFYTPTAICTPARASLLTGAA 73 Query: 65 ANQSGPWTN---NVAPGKNIS----TMGRYFKDAGYHTCYIGKWHLDGHDYFG----TGE 113 + N NV + +S T +AGYH +GKWH+ H G G Sbjct: 74 PFRHKLLANYERNVGYQEELSEGQFTFSEDLAEAGYHLGLVGKWHVGTHRTAGDLGFDGP 133 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWR----------NGLNSVEDLQANHIDETF--T 161 P W D A+YL+ L E ++ +R NG + +L A + + T Sbjct: 134 HLPGWHNP--VDHADYLAYLEENDLPPYRISDEVRGTFPNG--APGNLLAARLHQPLEAT 189 Query: 162 WAHRISNRAVDFLQQPAR----ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK 217 + + ++ RA+D L+ AR + PF + + PH P+ P EYL+ Y EL Sbjct: 190 FEYFLAERAIDLLRTYARDHRTSGRPFFLATHFFGPHLPYILPSEYLDMYDADDVELPLS 249 Query: 218 AQDDLANKPE-HHRLWAQAMPSPVGDDGLYHH-PLYFACNDFVDDQIGRVINALTP-EQR 274 + A KP A +GD+ Y+ VD Q+GR+++A Sbjct: 250 VAETFAGKPPVQGNYSAHWTFDTLGDETSRKLIAAYWGYVTLVDSQVGRILDAARELGVY 309 Query: 275 ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSH-IDLLP 333 ++ V +++DHGE GAH+L KG AMY+DI IP I++ P G Q ++H IDL Sbjct: 310 DDAAVFFSADHGEFTGAHRLHDKGPAMYEDIYTIPGIVKLPGGVPGQRSDRLAHLIDLTA 369 Query: 334 TMMALADIEKPEILPGENILAVKEPRG--------VMVEFNRYEIEHDSFGGFIPVRCWV 385 T++ +A + + G + + RG ++ EF+ + H P R V Sbjct: 370 TILDVAGRDPARAVDGVPVTPLV--RGEETPWREDLVAEFHGHHFPH-------PQRMLV 420 Query: 386 TDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 T+ +KLV+N + +ELYD DP+E+ N A VR+++ L + + D F + Sbjct: 421 TERWKLVVNPESVNELYDLVRDPDELQNRYTHPETAAVRAELLGRLYRQLRERGDNFYHW 480 Query: 446 QWSLRP 451 S+ P Sbjct: 481 MTSMYP 486 >UniRef50_Q029P1 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q029P1_SOLUE Length = 467 Score = 171 bits (433), Expect = 6e-41, Method: Compositional matrix adjust. Identities = 129/450 (28%), Positives = 210/450 (46%), Gaps = 35/450 (7%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N L + D + +GCY + T N D LA EG+RF +A+ +P C P+R L TG Y Sbjct: 27 NLLVITNDQHRADCLGCYGNPVIRTPNTDRLAGEGVRFGNAFVHAPQCVPSRVSLHTGRY 86 Query: 65 ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYWF 124 + TN+ ++ T+ + GY T +G+ Y G + + +Y Sbjct: 87 PHVHRVPTNSYDLPESEQTLAKVLNANGYRTACVGEMPFAPRAYTGGFQQVLASNREYDQ 146 Query: 125 DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPF 184 A + + + + + +DL DET +A A DFL+ A D PF Sbjct: 147 FLAGHGLKFPKSDGPFQAAPVPWTDDL-----DETAFFA----GHARDFLK--ANRDRPF 195 Query: 185 LMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDG 244 + +++ PHHPF P + + Y + ++ANKP + A+ + VG D Sbjct: 196 FLDINFRRPHHPFNPPAPFDKMYLGAAFPPSHARPGEMANKPPQQK---AALENSVGFDL 252 Query: 245 LYHHPL--------YFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGAHKLI 295 P Y+ D IG V++ L + E+ T V++ +DHGEM+G H L+ Sbjct: 253 RSMTPADLDRVKAYYYGMISENDKYIGTVLDELKSQGLEDRTVVVFNADHGEMLGDHGLL 312 Query: 296 SKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENIL 353 KG+ MYD +T++PLI+R+P R VD V +D++PT++ L I+ P + G++++ Sbjct: 313 FKGSYMYDGVTQVPLILRAPGKLPARTVVDGLVEEVDVMPTLLELLGIDVPAGVQGKSLV 372 Query: 354 AVKEPRGVMVEFNRYEIEHDS-FGGFIPVRCWVTDDFKLV-LNLFTSDELYDRRNDPNEM 411 + + N D+ F F ++ T ++KLV N ELY DP+E+ Sbjct: 373 PLAD--------NPKARHKDAVFAEFPTIKMARTREWKLVHYNKAKYGELYHLTEDPHEL 424 Query: 412 HNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 NL DD ++A + M L D++ DP Sbjct: 425 TNLYDDPKYAPASADMQGLLADWLATSTDP 454 >UniRef50_C0G116 Sulfatase n=1 Tax=Natrialba magadii ATCC 43099 RepID=C0G116_NATMA Length = 499 Score = 171 bits (432), Expect = 8e-41, Method: Compositional matrix adjust. Identities = 146/494 (29%), Positives = 215/494 (43%), Gaps = 61/494 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYS--GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 RPN L V+TD + + + + T+ ID L+A G F A+T +C+ ARA L Sbjct: 7 RPNVLLVLTDQERYDCSALDGPVAETVETETIDHLSATGTHFERAFTPISICSSARASLL 66 Query: 61 TGIYANQSGPWTN---------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLD------- 104 TG + + G N N+ PG + T DAGYH Y GKWH+ Sbjct: 67 TGQFPHGHGMLNNCHEDDALQPNLPPG--VPTFSEKLDDAGYHLTYTGKWHVGRDQTPED 124 Query: 105 -GHDYFGTGECPPEWDADYWFDG--ANYLSELTEKEI-SLWRNGLNSVEDLQAN------ 154 G Y G G D D F A + + E ++ + G N +D Sbjct: 125 FGFSYLG-GSDKHHDDIDDAFREYRAERGTPVGEADLDDVIYTGTNPRDDSNGTFVAATT 183 Query: 155 --HIDETFTWAHRISNRAVDFLQQPARADE--PFLMVVSYDEPHHPFTCPVEYLEKYADF 210 ++ET W ++ R +D +++ A D PF + PHHP+ P Y Y Sbjct: 184 SVEVEETRAWF--LAERTIDAIEEHASRDRDAPFFHRADFYGPHHPYVVPEPYASMYDPE 241 Query: 211 YYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINA 268 +L E + A KP H + D ++ + Y+ +DDQ GR+++A Sbjct: 242 NIDLPESYAETDAGKPRVHANYRSYRGVEQFDRDVWKEAIAKYWGFVTLIDDQFGRILDA 301 Query: 269 L-TPEQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ----GERRQVD 323 L + + T V++ SDHG+ G H+ +KG MYDD IPL +R P G R+ Sbjct: 302 LESTGLTDETVVVHASDHGDFAGGHRQFNKGPLMYDDTYHIPLQVRWPGVTEPGSVRE-- 359 Query: 324 TPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVE----------FNRYEIEHD 373 PV DL T + + + PE +++ + + G E F +Y H Sbjct: 360 EPVHLHDLAATFLEMGGVAIPESFDSRSLVPLLDADGPEQESAPSAWPDSVFAQY---HG 416 Query: 374 SFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 G R TD +K V N DELYD DP E+ NLID +ADVR ++ L+D Sbjct: 417 DEFGLYTQRMVRTDRYKYVYNAPDVDELYDLEADPAELQNLIDHPDYADVRRELRTRLID 476 Query: 434 YMDKIRDPFRSYQW 447 +M++ DP R QW Sbjct: 477 WMEETDDPNR--QW 488 >UniRef50_C6J2Z0 Sulfatase n=4 Tax=Firmicutes RepID=C6J2Z0_9BACL Length = 502 Score = 170 bits (430), Expect = 1e-40, Method: Compositional matrix adjust. Identities = 142/491 (28%), Positives = 224/491 (45%), Gaps = 62/491 (12%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN L + +D Q N +G ++ + L+T N+D L EG F AY +P CTP RA + Sbjct: 1 MKKPNILLITSDQQHWNTIGAFNPE-LSTPNLDRLVQEGTTFTRAYCPNPTCTPTRASII 59 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH---LDGHDYFGTGECPPE 117 TG+Y +Q G WT ++ +G FK+AGY T +GK H L G++ + + E P Sbjct: 60 TGLYPSQHGAWTLGTKLLEDRPVVGTNFKEAGYRTALVGKAHFQPLMGNEEYPSLESYPL 119 Query: 118 W-DADYW---------FD--------------GANYLSELTEKEISLWRN------GLNS 147 D DYW FD G +Y + EK + WR+ G S Sbjct: 120 LQDLDYWRQFSDSFYGFDHVELARNHTNEAHVGQHYAIWMEEKGCTNWRDYFLPPTGTMS 179 Query: 148 VEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY 207 + L + E + + I+ R L+Q +E F + S+ +PH P+ + Y Sbjct: 180 PKQLHRWDLPEEYHYNTWIAERTNALLEQYKNNNESFFLWASFFDPHPPYLVSEPWDTMY 239 Query: 208 ADFYYELGEKAQDDLANKPEHHRLWAQAMP--SPVGDDGL----YHHPL----------- 250 + E + + N P H + Q P S + G YH L Sbjct: 240 DPESLTIPEVSPGEHDNNPPHFGMTQQKSPDFSAWKETGQAIHGYHSHLMPESERKQLVA 299 Query: 251 -YFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRI 308 Y+ +D IGR+++ L E+T V++T+DHG G H L +KG MY+D+ ++ Sbjct: 300 TYYGMISMMDKYIGRILDRLDELGLAEDTIVVFTTDHGHFFGQHGLQAKGGFMYEDLIKL 359 Query: 309 PLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAV-KEPRGVMVEF 365 P I+R P Q D VS +DL PT ++ A + P + G + AV + + Sbjct: 360 PFIVRYPGKVPANVQSDALVSLVDLAPTFLSFAGLPIPVWMTGVDQSAVWTGSKSSARDH 419 Query: 366 NRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVR 424 E H+ I R +V +KL + EL+D + DP E++N +D +A+++ Sbjct: 420 IICEFRHEP--TTIHQRTYVDQRYKLTVYYNQPYGELFDLQEDPGELNNRWNDPSYANLK 477 Query: 425 SKMHDALLDYM 435 S++ LL Y+ Sbjct: 478 SEL---LLKYV 485 >UniRef50_C9L4R7 Putative sulfatase YidJ n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L4R7_RUMHA Length = 458 Score = 167 bits (423), Expect = 8e-40, Method: Compositional matrix adjust. Identities = 124/455 (27%), Positives = 199/455 (43%), Gaps = 34/455 (7%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N L + D +++ CY GK + T NID LA G+ + YT S VCTP+R FTG Y Sbjct: 3 NVLIIHVDQLRRDVLSCYGGKEVQTPNIDFLAENGVLLENFYTPSAVCTPSRGCFFTGNY 62 Query: 65 ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG--TGECPPEWDADY 122 +++G + N + +++ F AGYHT Y+GKWHL H G GE P Sbjct: 63 PHENGAYRNGIPVKRDVHGFAEVFAKAGYHTGYLGKWHLADHKERGDDLGEYNP-----L 117 Query: 123 WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADE 182 F+ +Y E + + NG + N T W +++ + FL ++ + Sbjct: 118 GFEDWDYKVEFGHCKSVAYENGKVRPKREVGNDKSYTTDW---LTDETIRFLNNQLKSTQ 174 Query: 183 PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGD 242 PFL VS +PH PF Y + E+ E + W + P G Sbjct: 175 PFLFTVSIPDPHQPFEVRPPYDTMFDPLKVEIPESFWEKEIPDWAERDTWGRLHYYPYGL 234 Query: 243 DGLYHH-----PLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLIS 296 H Y +DD +GR+I L ENT V++T+DHGE MG H L+ Sbjct: 235 FEREGHLRRLKAQYLGAVKCIDDNVGRIIQCLKDTGLWENTMVVFTTDHGEYMGEHGLME 294 Query: 297 KGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN--- 351 K +Y+ + IP +I P + + R+ +T ++ +D PT+ + I P + G++ Sbjct: 295 KN-NLYESVYHIPCVISMPWKKIQERRCNTWINVVDFAPTLAGMLGIPYPFKVQGKDLST 353 Query: 352 -ILAVKEPRGVMV----EFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRN 406 +L +E ++ + R I F + W ++F L+D R Sbjct: 354 YLLENRETEQILYIHPSDVPRAGILTPEFELAYVGKGWCEEEFH-------DHILFDMRK 406 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 DP +M N+ +A V+ + + L + ++I P Sbjct: 407 DPLQMTNVFGKPEYAKVQKMLTEKLKRHFEEIGTP 441 >UniRef50_Q15XH4 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XH4_PSEA6 Length = 517 Score = 166 bits (421), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 132/480 (27%), Positives = 220/480 (45%), Gaps = 60/480 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN L +M D + Y G P+ T NID+LA +G RF++A T + +C+P+RA LFT Sbjct: 33 KQPNVLVLMFDDMRFDTF-SYRGGPVPTPNIDALANDGTRFDNAMTTTGLCSPSRAALFT 91 Query: 62 GIYANQSGPWTNNVAPGKNISTMG-------RYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 G + +++G N ++ + R D GYH Y+GKWHL + Sbjct: 92 GRWGHKTGLDDNVGLYHSHVDELSEEEGGVIRRAADTGYHVGYVGKWHL-------GPQG 144 Query: 115 PPEWDADYWFDGANYLSELT------EKEISLWRNGLNSVEDLQANH-----IDETFTWA 163 P AD+ + + + + EK+ + + ++ H + T+ + Sbjct: 145 PALRGADFMWGKEHSQARHSRPYVPYEKQAKMAQYNRGERDENGEKHEYYQTLPGTYETS 204 Query: 164 HRISN--RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 H N L++ A+ DEPF V+S+++PH P+ P E YA + Sbjct: 205 HTAENVDMGQKMLREAAKMDEPFFGVISFEQPHPPYRVP----EPYASMF-------DPK 253 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHH--------------PLYFACNDFVDDQIGRVI- 266 P +H + Q P +D H Y+ +D +G +I Sbjct: 254 TVKLPANHAVKRQFKPMAQDEDWWPWHDVGHMTDMDWRKSRTFYYGAIAMIDHAVGDIIK 313 Query: 267 NALTPEQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPV 326 A ++ +I D G M+G H L KG YD++ R+PLIIR+P E R V+ V Sbjct: 314 TAKDVGMYDDLTIIVLGDQGSMLGEHNLYDKGPYAYDELMRMPLIIRAPNVEPRIVNKQV 373 Query: 327 SHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSF----GGFIPVR 382 S +D+ PT+ + +E + G +++ + E +G + + R + ++ GG+ +R Sbjct: 374 SMLDIAPTISEMMSLEPDGDVDGRSLVNLME-QGDIADKGRVDQALYAYEWYNGGWFGIR 432 Query: 383 CWVTDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 T + K V N + DELYD +NDP E++NLI D ++ M + D + +I+DP Sbjct: 433 ALRTPEMKFVWNPGDNRDELYDLKNDPIEVNNLIKDKKYTKQLRHMVQLMEDELVRIKDP 492 >UniRef50_Q482B9 Sulfatase family protein n=1 Tax=Colwellia psychrerythraea 34H RepID=Q482B9_COLP3 Length = 511 Score = 166 bits (421), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 137/486 (28%), Positives = 227/486 (46%), Gaps = 60/486 (12%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+ D N +G Y + + NID+LA +GIRF+ AY+ SP+CTP+R+ TG+Y Sbjct: 42 NVLFITID-DLNNDLGAYGHHLVKSPNIDALAKKGIRFDKAYSQSPMCTPSRSSFMTGLY 100 Query: 65 ANQSGPWTNNVAPGKN-------------ISTMGRYFKDAGYHTCYIGK-WHLDGHDYFG 110 +Q+G +A G + ++T+ + FK+ GY + +GK +H + G Sbjct: 101 PDQTGI----IAHGSHTQMTAHFREHIPKVTTLPQLFKNNGYFSGRVGKIYHQGVPNQIG 156 Query: 111 TGECPPEWDADYWFDGANYLS---ELTEK-----EISLWRNGLNSVEDLQANHIDETFTW 162 T DA W + N + ++ +K E +L R V A D+ Sbjct: 157 TSGAD---DAASWHETVNPIGLDKDVEDKIIAFNEKALVRQSFGGVLSFLAIGDDDKAHT 213 Query: 163 AHRISNRAVDFLQ--QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD 220 +++ ++ ++ P + +PF + + PH PF P +Y + Y EK + Sbjct: 214 DGKVATETINMIKDHHPDKTGKPFFIGAGFYRPHTPFVAPKKYFDLYPL------EKIKP 267 Query: 221 DLANKPEHHRLWAQAMPSPVGDDGLYHHPL------YFACNDFVDDQIGRVINALTPEQ- 273 +A K + + A+ G GL + Y+A +VD Q+GRV++AL + Sbjct: 268 YIAPKNDRKDIPDIALQDREGQVGLTLNQRKQIIQGYYAAVSYVDAQVGRVLDALKQQDL 327 Query: 274 RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDL 331 +NT V++ SDHG +G H L KG+ +++ R PLII +P + R V +PV +D+ Sbjct: 328 SDNTIVVFLSDHGYELGQHGLWQKGS-LFEGSARAPLIIYAPNVKDNGRVVTSPVELVDI 386 Query: 332 LPTMMALADIEKPEILPGENILAVKEPRGVMVE-------FNRYEIEHDSFGGFIPVR-- 382 PT+ L + PE L G+++ V NR + +++ F F +R Sbjct: 387 YPTLAKLTGLVAPEYLAGKDLTPALNDVDFQVRKGAYSAILNRNKGDNNQFA-FTKIRGH 445 Query: 383 CWVTDDFKLVL--NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 T+ ++ + ELYD +NDP E+ NL D + VR KM L D MD + Sbjct: 446 SIRTNRYRYTEWGEGYFGAELYDHKNDPQELKNLADKVSLESVRIKMKWLLNDAMDDAQK 505 Query: 441 PFRSYQ 446 +S + Sbjct: 506 RIKSIE 511 >UniRef50_C6J3H9 Sulfatase n=2 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J3H9_9BACL Length = 503 Score = 166 bits (419), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 137/508 (26%), Positives = 226/508 (44%), Gaps = 71/508 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PNFL + D + + C+ + T NID LA EG+ F AY +PVC P+RA L TG Sbjct: 2 KPNFLVFVVDQMQSRTLSCHGHPDVKTPNIDRLAREGVSFTRAYCNNPVCMPSRASLLTG 61 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH----------------LDGH 106 + A Q G TN +A ++ T+ + GY T +GK H +G Sbjct: 62 LTARQHGVLTNGIALSEHFPTLPGVLSEHGYRTHAVGKLHHQPIGSVSREEQMEFSWEGM 121 Query: 107 DYFGTGECPPEWDADYWFDGANYLS-------ELTEKEISLWRNGLNSVEDLQANHID-- 157 ++ +GE Y + +Y+ + ++ G + A + D Sbjct: 122 KFWESGEIRSIPSGYYGYQSVDYVGGHVTCFGDYLRWLEQVYPGGGKKLSKEGAYYADDK 181 Query: 158 ----------ETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY 207 E + + H I+ R++DFL+Q ++ D+PF + S+ +PHHPF Y E Y Sbjct: 182 IPMSWRIDLPEEYHYNHWIAERSIDFLEQMSQQDQPFFLWCSFPDPHHPFAACRPYSEMY 241 Query: 208 ADFYYELGEK--AQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRV 265 L E ++D + + R S D + VD IG + Sbjct: 242 DPASLTLPEHWDVEEDGISWLKERRNIHPDYTSFDEHDLREILAQTYGMISHVDKTIGEI 301 Query: 266 INALTP-EQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV-- 322 L E +NT +++ +DHGE +G+H LI+KG ++++ R+P I + P+ ++ Sbjct: 302 TKKLKELELDQNTVIVFLADHGEYLGSHHLITKGEWPWEELIRVPFIWKIPESMKKGYLN 361 Query: 323 DTPVSHIDLLPTMMALADIE------------KPEILPGENILAVKE------PRGVMVE 364 + VS +D +PT++ LA IE +P LPG ++ + E P +VE Sbjct: 362 EQVVSLLDFVPTILDLAGIEPAVMDVRGVQYTEPLGLPGRSLRPIIEQGDVLPPGPAIVE 421 Query: 365 FNRYEIEHDSF-GGFIPVRCWVTDDFKLVLNLFTSDE-LYDRRNDPNEMHNLIDDIRFAD 422 ++ D F +R VT+ +K+ + L T D LYD + DP E NL D FA Sbjct: 422 YDE-----DWFPPNVCRMRTIVTERYKMTVYLNTEDGLLYDLQEDPYEQKNLWFDPSFAR 476 Query: 423 VRSKMHDALLDYMDKIRDPFRSYQWSLR 450 V+ + + +L R+ R+ +W + Sbjct: 477 VKHILTEQML------RELVRTDRWDTK 498 >UniRef50_A6DLX7 Putative sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLX7_9BACT Length = 502 Score = 165 bits (417), Expect = 4e-39, Method: Compositional matrix adjust. Identities = 138/510 (27%), Positives = 230/510 (45%), Gaps = 40/510 (7%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 M +PN L++M+D N G Y+G P + T N+D LA EG+ F + +P+C+P+R Sbjct: 1 MSKPNVLWLMSDQHNANCTG-YAGNPNVKTPNLDDLANEGVEFEQGFCNNPICSPSRLSF 59 Query: 60 FTGIYANQSGPW--TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL------DGHDYFGT 111 TG+Y N G NN N +T+ F+ GY T +GK H+ +G +Y Sbjct: 60 ITGLYTNNHGYLGNRNNDVTTPNPNTLSSLFRRFGYQTGLVGKSHMITGWDKEGFEYIRY 119 Query: 112 GECPPEWDAD----YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRIS 167 + D D ++FD E + G +++ Q + + H Sbjct: 120 TDMCDADDNDPHTCHYFDYLAQRGLADHYEEGSPKEGQQTLDGSQPASLPYKHSIEHYTG 179 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN--- 224 N++++FL+ + D PF + +S+ PH P T E + Y L E D N Sbjct: 180 NKSLEFLENRDQ-DRPFFLKMSFQRPHDPITPAPEDFDMYNPEDIVLPESISDLFENKFV 238 Query: 225 -KPEHHRLWAQA---MPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTPE-QRENT 277 KP+ + + P V D+ L Y+A +D++IGRVI+ L + +NT Sbjct: 239 GKPQFMQDYVANPGDYPMCVADEAKLKRALASYYALITKIDEEIGRVIDHLKETGEYDNT 298 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTP-VSHIDLLPTMM 336 + YT+DHG+ G H L K +Y+ I RIP +++ P G + V +D T+ Sbjct: 299 IIFYTADHGDFAGEHGLFLKNLGIYESIHRIPFLLKWPGGPTGVKNKELVESVDWYATLC 358 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 L +I+ P+ + G +++ V +G + EH + T ++LV Sbjct: 359 DLCNIQAPDNVDGRSLVPVA--KGEAKGSDAIICEHHTSTAI------RTKQYRLVYYRE 410 Query: 397 TSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY-MDKIRDPFRSYQWSLRPWRK 454 T + ELYDR NDP+E++NL + +R + +L+Y M R + + RK Sbjct: 411 TGEGELYDRGNDPDELNNLWSHADYQSIRMDLMQQVLNYHMSYQRKTYNELDQVINKKRK 470 Query: 455 DARPRWMGAFRPRPQDGYSPVVRDYDTGLP 484 + A + + YS +++ Y+T P Sbjct: 471 HS----FSALLQKEKAYYSDLIKVYETKKP 496 >UniRef50_A7LY81 Putative uncharacterized protein n=5 Tax=Bacteroides RepID=A7LY81_BACOV Length = 517 Score = 165 bits (417), Expect = 5e-39, Method: Compositional matrix adjust. Identities = 134/499 (26%), Positives = 227/499 (45%), Gaps = 62/499 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLN-TQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN + +MTD Q ++ G G PL T +D LA E + FN AYT P +PAR +FT Sbjct: 25 KPNIVVIMTDQQRADLCG-REGFPLEVTPFVDRLAQENVWFNKAYTVMPASSPARCSMFT 83 Query: 62 GIYANQSGPWTNNVAPGKNIST-MGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 G + + + TN+ P + + K+ GY T +GK H Y D Sbjct: 84 GRFPSATHVRTNHNIPDISYQQDLVGVLKENGYKTALVGK----NHAYLKPA------DL 133 Query: 121 DYWFD----GANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 D+W + G + + EKE + + N + L+ + I +I N A+ +++Q Sbjct: 134 DFWSEYGHWGKHKKTTPAEKETARFLNQQARGQWLEPSPISLEEQHPTKIVNEALAWIKQ 193 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ-- 234 + + PF + VS+ EPH+P+ Y ++ + + ++ DLA K E +R+ AQ Sbjct: 194 --QKENPFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLKTSRKDLAKKGEKYRILAQLE 251 Query: 235 -AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHGEMMGAH 292 A + D Y +DDQI R+I +L Q ENT + SDHG+ G + Sbjct: 252 DASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASGQYENTLFVVLSDHGDYWGEY 311 Query: 293 KLISKGAAMYDDITRIPLIIRS--PQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGE 350 LI KGA + + + RIP++ + + +D+ VS DL PT + E P + G Sbjct: 312 GLIRKGAGLSESLARIPMVWAGYHIKNQPAPMDSHVSIADLFPTFCSAIGAEIPAGVQGR 371 Query: 351 NILAVKEPRGVMVEFNRYEIEHDSFGG------------------------FIPVRCWV- 385 ++ + + E + FGG F + W Sbjct: 372 SLWPMLTGKAYPKEEFSSMVVQQGFGGADVGLDASLTFEQEGALTPGKIAHFDELNTWTQ 431 Query: 386 --------TDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 DD+KLV+N + + ELY+ + DP+E+HNL + +++++++++ LL + + Sbjct: 432 SGTSRMIRKDDWKLVMNHYGNGELYNLKKDPSEVHNLFGEKKYSEIQTELLTRLLAWELR 491 Query: 438 IRDPF----RSYQWSLRPW 452 ++DP R Y + P+ Sbjct: 492 LQDPLPLPQRRYHFKQNPF 510 >UniRef50_C5BXT8 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5BXT8_BEUC1 Length = 497 Score = 163 bits (412), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 139/482 (28%), Positives = 206/482 (42%), Gaps = 64/482 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN L + TD + VG + G T N+ +LA G F Y + VC+P RA + T Sbjct: 14 SRPNILVICTDQHRFDAVGTHPGSAAITPNLVALAERGAVFEQCYAPNTVCSPTRASMLT 73 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL-------------DGHDY 108 G Y + G W N V + S + R DAGY T +GK+HL DG + Sbjct: 74 GEYPSSHGLWANGVTLPEGRSLVSRELADAGYRTGLVGKFHLASAFEGRTEERLDDGFET 133 Query: 109 FGTGECPPEWDADYWFDGA---NYLSELTEKEISLWRNGLNSV----------EDLQANH 155 F P F GA Y L E+ +LW + V E+ + + Sbjct: 134 FAWAHDP--------FHGAPENAYHRWLRERHPALWAEAMGDVVTPDVENFAHENTRFDE 185 Query: 156 IDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG 215 + +++ ++ DFL+ D PF ++ +Y PHHPF P EYL+ Y Sbjct: 186 MPAHASYSTWVTEEVGDFLR--TEDDRPFFLLANYFAPHHPFAAPQEYLDLYPPGSVPPP 243 Query: 216 EKAQDDLANKPEHHRLWAQAM-----PS-----PVGDDGLYHHPLYFACNDFVDDQIGRV 265 D+LA KP ++A PS P G D + Y A +DD +GR+ Sbjct: 244 VGGPDELATKPTLQSEASRASYVGHGPSFADFTPEGIDEIRR--TYHAMVSQIDDGVGRI 301 Query: 266 INALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIR----SPQGERR 320 + L + E +T V++ SDHGEM+G H L+ KG MYD R+PL++ P G R Sbjct: 302 LRTLREQGLERDTLVVFVSDHGEMLGDHALLLKGPMMYDPAVRVPLVVSWPDLVPAGHR- 360 Query: 321 QVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIP 380 V V D+ T+ A +E G +++AV + E + P Sbjct: 361 -VTDFVGVHDVAHTIRCAAGLEPYARDQGLDLVAVAREEREARTYAWAEYRDSGYPYDPP 419 Query: 381 VRC--WVTDDFKLVL-------NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + D K+V+ + ELYD +DP+E+ N DD +A R ++ A+ Sbjct: 420 AHTTMYRRHDSKVVVWHGDPDAGRPATGELYDLADDPDELVNRWDDPAYARRRLELCAAV 479 Query: 432 LD 433 D Sbjct: 480 SD 481 >UniRef50_D2RQH7 Sulfatase n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2RQH7_9EURY Length = 498 Score = 162 bits (411), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 141/490 (28%), Positives = 210/490 (42%), Gaps = 58/490 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN LFV+TD + + G P+ T +D L++EG+RF+ A T +CT ARA L T Sbjct: 4 SRPNVLFVLTDQERYDCTAP-EGPPVETPAMDRLSSEGMRFSRACTPISICTSARASLMT 62 Query: 62 GIYANQSGPWTN---------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLD-------- 104 G++ + G N N+ P + T + GY Y GKWH+ Sbjct: 63 GLFPHGHGMLNNSHEADAIRPNLPP--ELPTFSELLAENGYDCSYTGKWHVGRDQTPEDF 120 Query: 105 GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLW---------RNGLNSVEDLQANH 155 G Y G G D D F + E+ L R+ Sbjct: 121 GFAYLG-GSDKHHDDIDEAFREYREERGVPPGEVDLEEVLYTGDDPRDASEGTFVAATTP 179 Query: 156 IDETFTWAHRISNRAVDFLQQPARAD---------EPFLMVVSYDEPHHPFTCPVEYLEK 206 +D T A+ ++ R +D ++ A D +PF + PHHP+ P Y Sbjct: 180 VDVEETRAYFLAERTIDAIEAHADGDSGEGDGNGSDPFFHRADFYGPHHPYVVPEPYASM 239 Query: 207 YADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGL-YHH-----PLYFACNDFVDD 260 Y + E + KP+ H + G DGL + H Y+ +DD Sbjct: 240 YDPNEIDPPESYAETYDGKPQVHENFHYYR----GADGLEWDHWAEATAKYWGFVSLIDD 295 Query: 261 QIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER 319 Q+ R++ AL + T V++ SDHG+ +G H+ +KG MYDD RIPL +R P Sbjct: 296 QLERILEALEEHGLADETAVVHASDHGDFVGNHRQFNKGPLMYDDTYRIPLQVRWPGVAE 355 Query: 320 --RQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKE----PRGVMVEF--NRYEIE 371 + PV DL T + + ++ PE +++ + E P V ++ + + Sbjct: 356 PGTTCEVPVHLHDLAATFLEMGGVDVPESFDSRSLVPLLETGDDPDAVPDDWPDSTFAQY 415 Query: 372 HDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 H G R T +K V N DELYD + DP E+ NLID +ADVR +M D L Sbjct: 416 HGDEFGLYTQRMVRTGRYKYVYNGPDIDELYDLKADPAELQNLIDHPGYADVREEMRDRL 475 Query: 432 LDYMDKIRDP 441 +D+M + DP Sbjct: 476 VDWMQETDDP 485 >UniRef50_C6J5I7 Sulfatase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J5I7_9BACL Length = 522 Score = 161 bits (407), Expect = 7e-38, Method: Compositional matrix adjust. Identities = 136/493 (27%), Positives = 215/493 (43%), Gaps = 80/493 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF+ TD Q + +GCY + T N+D LAA G F +A+ P+C P+RA L T Sbjct: 15 ERPNILFIHTDQQRADSLGCYGNTVIRTPNLDQLAASGTLFENAHCTHPLCMPSRATLLT 74 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHD----------YFGT 111 G Y + + N + + T+ +GY T IGK H + Sbjct: 75 GRYMHAHRLYRNGIPLSQQEQTIAHLLSKSGYATGLIGKAHFTPYKGDPKVNPESVQINN 134 Query: 112 GECPPEWDADYW--FDGANYLSELTEKEISLWRNGLNSV------------------EDL 151 G P E A YW F+G Y + + + G+ +D+ Sbjct: 135 GVAPEECWA-YWRQFEGPYYGFDHVQMSMGHGDYGMKGGHYGLWVHEQHPDKVPLFDQDI 193 Query: 152 QANHIDETF-TWAHRI----------SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCP 200 D + +W + +N+A++F++Q D PF + Y EPH PF P Sbjct: 194 HGEPSDGVYRSWKSAVPLEIHSSTWTTNKAIEFIKQ--NKDRPFYAWIGYQEPHEPFNPP 251 Query: 201 VEYLEKY--ADFYYELGEKAQDDLANKPEHHRL------WAQAMPSPVGDDGLYHHPLYF 252 Y + Y + +G + + PEH + W V + + H Y+ Sbjct: 252 RPYCDMYDPQEILLPVGRDGEWG-SESPEHVQYYLNRGKWKDIREEKVRE--IIAH--YY 306 Query: 253 ACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLI 311 C +DD IGR++ L E +NT +I+TSDHGE +G H L KGA +TRIPL+ Sbjct: 307 GCVSMIDDCIGRLMKTLEEEGLADNTIIIFTSDHGEWLGDHGLWLKGAVHARGLTRIPLM 366 Query: 312 IRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAV-------------- 355 I+ P R+V S ID++PT++ A E P + G ++ +V Sbjct: 367 IKWPGTAVSGRRVSNVASLIDVMPTLLDAAGAEIPYGVQGTSLRSVLAGEQDKVRDYALI 426 Query: 356 ---KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL-VLNLFTSDELYDRRNDPNEM 411 EP + ++ + E+ + ++ VTD ++L + ELYD + DP+E+ Sbjct: 427 EHRHEPYHLNIQLEKEELVINKGTEEWHMKTIVTDRYRLSYIPSAQYGELYDHQTDPDEL 486 Query: 412 HNLIDDIRFADVR 424 NL D +F ++R Sbjct: 487 INLWD--KFPELR 497 >UniRef50_A4AP83 Putative sulfatase n=1 Tax=Flavobacteriales bacterium HTCC2170 RepID=A4AP83_9FLAO Length = 467 Score = 160 bits (404), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 123/475 (25%), Positives = 215/475 (45%), Gaps = 67/475 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN ++V+ D +G + T N+D LA+EGI F +A + SPVCTP R+ + T Sbjct: 22 KKPNIIYVLADQWRAEALGSNGNPNVITPNLDKLASEGISFTNAISTSPVCTPYRSMMLT 81 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G Y ++G + N+V+ + + G+ +K+ GY T YIGKWH+DG D Sbjct: 82 GRYPLKNGMFMNDVSLDPDSQSFGKLYKNEGYSTAYIGKWHVDGKGRSAFIPKERRQGFD 141 Query: 122 YW--------FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTW----AHRISNR 169 YW ++ +NY W N DE +W A + Sbjct: 142 YWKVLECSHSYNNSNY-----------WGND------------DELHSWEGYDAAAQTKD 178 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHHPF-TCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 A+ F++ PF +++S+ PH P+ T P E+ + Y + D+ +P Sbjct: 179 AIAFIEAQTENKSPFCLILSWGPPHAPYKTAPKEFQKLYENM----------DIQLRPN- 227 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGE 287 +P + ++ Y+A +D I ++ +A+ E NT ++TSDHG+ Sbjct: 228 -------VPVELAENTKAMLKGYYAHCSALDSYIKQLQDAIKRNNLEDNTIFVFTSDHGD 280 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQ---GERRQVDTPVSHIDLLPTMMALADIEKP 344 ++ +H K +Y++ ++P II+ P + R+ D ++ +D+LPTM+ ++ I+ P Sbjct: 281 LINSHTE-RKKQRIYEESAKVPFIIKYPALLGKQGRKSDFLLNTLDILPTMLGMSSIKAP 339 Query: 345 EILPGEN----ILAVKE---PRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 E L GE+ IL KE ++ + + GG R +T + +L Sbjct: 340 EGLDGEDISDVILGEKEDNRKAALVACIQPFGQWKRTLGG-KEFRGVITKRYTYAKDLSG 398 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPW 452 +D DP +++NL+ + F V + + L +D++ D F L W Sbjct: 399 EWLFFDNVEDPYQLNNLVGNPSFKSVAENLEELLDKELDRLDDDFLPGASYLETW 453 >UniRef50_C5HLB2 Putative sulfatase n=1 Tax=uncultured bacterium FLS12 RepID=C5HLB2_9BACT Length = 503 Score = 159 bits (402), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 130/467 (27%), Positives = 209/467 (44%), Gaps = 53/467 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN + VM D + Y G L T + D LA+EG F+ A T SP+CTP+R +T Sbjct: 7 RTPNLVMVMVDQLQAQRMKLYGGTDLLTPHFDRLASEGALFSQAITTSPLCTPSRISFWT 66 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH-LDGH---DYFGTGECPPE 117 G Y + G N P ++ + K AGYHT IGK H G D F Sbjct: 67 GQYPSAVGGMNNGPLPLTDVPHLPGMLKAAGYHTALIGKNHCFRGEVVADLFDA-----T 121 Query: 118 WDADYWF-----DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 WDA + D + L+ ++ R + DL +H+ T R + + Sbjct: 122 WDAGHGGAQGGKDDPDILAYERTAQLEFLRMCHGRIVDL-PDHVTTTA----RATKNGLA 176 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 +L++ + D+PF + +SY EPH PF + + Y L E + D+++KP H + Sbjct: 177 WLEE--QGDDPFFLWLSYPEPHSPFVTTRNWADLYDPAKLTLPESWRSDISDKPAHFQEL 234 Query: 233 AQAMPSP-VGDDGLYH-HPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMM 289 + M +P V DD L +Y+ +DD +G+V++ L + ++T V++ SDHGE + Sbjct: 235 HELMGAPAVSDDELRELTQIYYGMASQIDDGLGQVLDCLERKGLADDTIVVFVSDHGEYI 294 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEIL 347 G++ ++ K A + + + R+PL IR P D PV H D++PT+ L + P+ + Sbjct: 295 GSNYMLQKSAHLPEALIRVPLAIRWPGHVPSGAVYDDPVEHHDMMPTLCTLMGFDVPDSV 354 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDS--------------------------FGGFIPV 381 ++ + + + + EI H + FG + V Sbjct: 355 QAADLTPLFDGKPFARDAAYSEIGHHADREMTREKTYAPDLPWAEARAFYHFVFGHYAHV 414 Query: 382 -RCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKM 427 R T K V + ELYD NDP EM NL A++ + + Sbjct: 415 GRGIRTRTHKYVAYEYGEKELYDLANDPEEMVNLAGKPAAAEIEADL 461 >UniRef50_C3WAQ9 Sulfatase n=1 Tax=Fusobacterium mortiferum ATCC 9817 RepID=C3WAQ9_FUSMR Length = 523 Score = 159 bits (402), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 138/509 (27%), Positives = 219/509 (43%), Gaps = 80/509 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN L + +D Q N +G ++ K + T N+D L EG F+ AY +P CTP R + T Sbjct: 3 KRPNILLITSDQQHFNTIGAFN-KEIITPNLDRLVREGTTFDRAYCPNPTCTPTRGSIIT 61 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH---LDGHDYFGTGEC-PPE 117 G Y +Q G WT V + +T+G F+ AGY + IGK H L F + E P Sbjct: 62 GKYPSQHGAWTLGVKLPETENTIGNEFRKAGYKSALIGKAHFQPLASTLEFPSLEAYPCL 121 Query: 118 WDADYW---------FD--------------GANYLSELTEKEISLWR------------ 142 D ++W FD G +Y L EK WR Sbjct: 122 QDLEFWRSYKGIFYGFDHVELARNHTNEAHVGQHYALWLEEKGCKNWRDYYLESTGNMSE 181 Query: 143 --------------NGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVV 188 N LNS + I E + + I+ R L++ DE F + Sbjct: 182 KEYPRLEILVEQEGNILNSRRNWGKWEIPEKYHYNTWIAERTNKMLEEYKNNDESFFLWA 241 Query: 189 SYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP-------SPVG 241 S+ +PH + P + Y + + P H + + P + G Sbjct: 242 SFFDPHPEYFVPEPWASMYDPEKLTINGLVPGEHLKNPPHFQKTQEENPNFDEYKETGFG 301 Query: 242 DDGLYHH-----------PLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMM 289 G++ H LY+ +D IG ++N L E+T V++T+DHG + Sbjct: 302 IHGMHSHLQKIEDIKKDLALYYGMVSMMDKYIGEILNKLDELGLAEDTIVVFTTDHGHFV 361 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEIL 347 G H LI KG Y+D+ ++P I+R P E + + S +DL PT ++ D++ P + Sbjct: 362 GQHGLIRKGPFHYEDLIKVPFIVRYPNHVPENKVSSSIQSLVDLAPTFLSFCDLKIPYDM 421 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL-FTSDELYDRRN 406 G + V E + V + E+ I ++ +V +K+ + T ELYD + Sbjct: 422 TGIDQKKVWENPDLEVR-DHAICENHHEPTTIHLKTYVDKRYKITVYYNKTYGELYDLQE 480 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYM 435 DPNE++NL D+ + +++S++ LL Y+ Sbjct: 481 DPNEINNLWDNEDYKELKSEL---LLKYI 506 >UniRef50_Q7W424 Putative sulfatase n=2 Tax=Bordetella RepID=Q7W424_BORPA Length = 485 Score = 157 bits (398), Expect = 7e-37, Method: Compositional matrix adjust. Identities = 130/454 (28%), Positives = 206/454 (45%), Gaps = 37/454 (8%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + +M+D + +GCY + ++T N+D+LAA G RF SAY SPVC PARA TG Y Sbjct: 7 NMVVIMSDEHQSRALGCYGHEFVHTPNLDALAARGTRFASAYCTSPVCIPARASFATGKY 66 Query: 65 ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYWF 124 NQ G W N A ++ + +D + IGK H DY G E + Sbjct: 67 INQIGFWDNADAYDGSVPSWHHMLRDRDHQVVSIGKLHF--RDYGGDHGFSEEIIPMHIV 124 Query: 125 DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL--QQPARADE 182 G L L ++ + + + TF + I +RA +L Q P AD+ Sbjct: 125 GGKGDLMGLVRSDLPVRKGAYKMAQMAGPGESQYTF-YDREIVSRAQIWLREQAPRHADK 183 Query: 183 PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGD 242 P+++ VS+ PH P T P E+ +Y + +L D + +P+H + Q Sbjct: 184 PWVLFVSFVSPHFPLTAPPEHYYRY--YNRDLPLPKLYDRSQRPDHP--YQQDYRGSFNY 239 Query: 243 DGLYHHPL-------YFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKL 294 D + L Y+ F+D+ IG+++ L + ++T V+YTSDHG+ +GA + Sbjct: 240 DDYFDPGLVKKAQAGYYGLCSFLDENIGKLLGTLDDLDILDSTRVVYTSDHGDNLGARGM 299 Query: 295 ISKGAAMYDDITRIPLIIRS---PQGERRQVDTPVSHIDLLPTMMALADIEKP---EILP 348 K + M+++ +PLII P G V+TPVSH+D+ P + P E LP Sbjct: 300 WGK-SNMFEEAAAVPLIIAGRDIPSGV--TVNTPVSHVDVAPFIYDAVGETSPGLKEGLP 356 Query: 349 GENILAVKE----PRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDR 404 G ++ + R VM E++ S G +R +K + + +L+D Sbjct: 357 GVSLFQLARGETPQRNVMAEYHGM----GSATGAFMIR---EGRYKYIFYVHYPHQLFDL 409 Query: 405 RNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 DP E+ +L +ADV + L D + Sbjct: 410 EADPEELEDLAGRPDYADVVALCKQKLWQLCDPV 443 >UniRef50_C3WCE8 Arylsulfatase n=2 Tax=Fusobacterium RepID=C3WCE8_FUSMR Length = 476 Score = 157 bits (396), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 129/487 (26%), Positives = 225/487 (46%), Gaps = 57/487 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN + + TD + +G + K + T N+D +A EG+ F +++ SPVCTP+RAG+FT Sbjct: 6 KKPNIVLITTDQMRADAIGYINSKVI-TPNLDMMAKEGVVFTNSFCSSPVCTPSRAGIFT 64 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL--------DGHDYFGTGE 113 G Y +G W +N T+ + K GY+ +GK H + ++ + Sbjct: 65 GRYPMNTGAWNIGTCLDENEITLADWLKGEGYYNIGVGKMHFRPQLKDFDNNYEDVEVRD 124 Query: 114 CPPEWDADYW-FD---------GANYLSELTEK----EISLWRNGLNSVEDLQANHIDET 159 E D Y+ FD YL L E E+ +G+N + E Sbjct: 125 RVRERDKTYYGFDETYITEDDKQGKYLDFLDENGYHLEVGKGNDGMNP--------LPEE 176 Query: 160 FTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ 219 F + I ++ + +++ ++P M+ ++ +PHHPF P E + D ++ Sbjct: 177 FNQTYWIGMKSCEAIRK-YDFNKPLFMMTNFVDPHHPFD-PAEKFARMYDGVEIDSPISK 234 Query: 220 DDLAN-KPEHHRLWAQAMPSPVGDDGLYHHPL-----------YFACNDFVDDQIGRVIN 267 D N +PE+ + + P G + H L Y+A F+D +IG++ Sbjct: 235 DKFCNERPEYLKRQGERGYWPGGGE---QHKLSDEKVEEYTRYYYAMITFIDQEIGKIRK 291 Query: 268 ALTPE-QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV-DTP 325 L + + +NT +I+TSDHGE MG + L+ KG MYD++ ++PL+ E+ D Sbjct: 292 ELEKKGELDNTIIIFTSDHGEYMGDYGLLQKGPFMYDNLIKVPLLFWGKGVEKSVTSDEI 351 Query: 326 VSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWV 385 V +ID++PT++ L E P + GE++ + + + +D+ I V+ + Sbjct: 352 VENIDIVPTILELIGKEVPYGIQGESLKNILQKIDKERVKKSAIVTYDARDRGIMVKSYR 411 Query: 386 TDDFKLVLNLFTSD---ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 +K LNLF ++ E+YD DP E NL + +++++ M + DP Sbjct: 412 DKRYK--LNLFMNEEYGEMYDLEVDPQETTNLFFKEEYLQLKNELLLKACYRMMECSDPL 469 Query: 443 --RSYQW 447 R+ W Sbjct: 470 SKRTANW 476 >UniRef50_D2QL61 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QL61_9SPHI Length = 489 Score = 156 bits (394), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 127/480 (26%), Positives = 218/480 (45%), Gaps = 69/480 (14%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 +++PN + VM D +G + + T N+D LA E + PVC+P+RA L Sbjct: 35 VRKPNIVIVMADQWRAQDLGYAGNREVITPNLDKLALESVNAPLCVAEVPVCSPSRASLL 94 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDG--HDYFGTGECPPEW 118 TG +A G + N+ T+ + GY T +IGKWH++G F G P Sbjct: 95 TGQHATTHGVFYNDRPLRNEAVTLAEVCQQNGYKTGFIGKWHINGGLAKDFAAGRLAP-- 152 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQA----NHIDETFTW----AHRISNRA 170 + + WR GL D N +++ F W A ++ A Sbjct: 153 -----------IPVDRRQGFEYWR-GLECTHDYNNSPYYNEVNKRFVWQQYDAISQTDSA 200 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPF-TCPVEYLEKYADFYYELGEKAQDDLANKPEHH 229 + F+ Q + EPFL+V+++ PH P+ T P EY ++YAD L+ +P Sbjct: 201 ISFMTQSRK--EPFLLVLAWGPPHDPYQTAPKEYRQRYAD----------KTLSLRPN-- 246 Query: 230 RLWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 P D + L Y+A + +DD IGR+ AL + ENT ++TSDHG Sbjct: 247 --------VPAKDTMEANRALKGYYAHINALDDCIGRLQAALKGAKLDENTIFVFTSDHG 298 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQG---ERRQVDTPVSHIDLLPTMMALADIEK 343 +M+ +H I+K +D+ RIP +++ P G + R +D P++ D++PT+++L+ Sbjct: 299 DMLYSHDQINKQKP-WDESIRIPFLLKYPAGLSRKGRTLDVPITLTDVMPTVLSLSGQTI 357 Query: 344 PEILPGENILA-VKEPR----------GVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV 392 P + G+N+ + +++PR +V F+++ G R T + V Sbjct: 358 PASVQGQNVASLIRQPRAPRPDDAALIACIVPFHQWNYGR----GGREYRGIRTARYTYV 413 Query: 393 LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPW 452 +L LYD + DP ++ NL ++ + A + ++ L + D F++ + W Sbjct: 414 RDLKGPWLLYDNQQDPYQLTNLANEPKLAGTQKQLEGILAQKLRAANDNFQAGNVYMDKW 473 >UniRef50_UPI0001C36AAF N-acetylgalactosamine 6-sulfate sulfatase n=2 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C36AAF Length = 468 Score = 155 bits (391), Expect = 4e-36, Method: Compositional matrix adjust. Identities = 135/465 (29%), Positives = 205/465 (44%), Gaps = 63/465 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LF++TD Q +GCY + T N+D LA +G+RF++ + SPVC+PARA L T Sbjct: 4 KKPNVLFILTDDQGIWSMGCYGNSEIQTPNLDKLAKQGVRFDNFFCTSPVCSPARASLLT 63 Query: 62 GIYANQSG-----PWTNNVAPGKNISTMGRY------FKDAGYHTCYIGKWHL--DGHDY 108 G +Q G N A I + + + GY GKWHL GH Sbjct: 64 GKIPSQHGILDYLSGGNGGASQAAIEFLKDHRGYTDILAEEGYTCGLSGKWHLGDGGH-- 121 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 P+ +W+ A+ ++RNG I+E I++ Sbjct: 122 -------PQKGFSFWY--AHQKGGGPYYNAPMFRNG---------QKIEEKGYITDVITD 163 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHP-FTC-PVEYLEKYADFYYELGEKAQDDLANKP 226 A+ F+ + ++PF + V Y PH P C P +Y + Y D +E + + K Sbjct: 164 EAISFIDREKNKEQPFYLSVHYTAPHSPWINCHPKKYTDLYEDCPFETCPQGEVHPWAKT 223 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDH 285 E + + S +G YFA +DD +GR++ L E E+T +I++SD+ Sbjct: 224 EVIAGYQKPRESLIG---------YFAAVTAMDDNVGRILKKLEEENLMEDTLIIFSSDN 274 Query: 286 GEMMGAHKLISKGAA-----MYDDITRIPLII--RSPQGERRQVDTPVSHIDLLPTMMAL 338 G G H + KG MYD ++PLI+ + E D S D +PT + Sbjct: 275 GFNCGHHGIWGKGNGTFPLNMYDSSVKVPLIMCHKGHIPENHVCDEMHSGYDFMPTPLDY 334 Query: 339 ADIEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL- 395 + E LPG++ L+ + E N + D +G PVR + +KLV Sbjct: 335 LGFKNDEADKLPGKSFLSALMGQEQKGEENSV-VVFDEYG---PVRMIRSRKYKLVHRYP 390 Query: 396 FTSDELYDRRNDPNEMHNLIDDIRFADV----RSKMHDALLDYMD 436 F DE YD DP E +N I+D + DV + +M L Y+D Sbjct: 391 FGPDEFYDLEVDPGEAYNGIEDESYQDVIRDMKKQMELWFLQYVD 435 >UniRef50_UPI0001968556 hypothetical protein BACCELL_00122 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968556 Length = 510 Score = 154 bits (390), Expect = 6e-36, Method: Compositional matrix adjust. Identities = 133/488 (27%), Positives = 216/488 (44%), Gaps = 64/488 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LFVM+D +G P+ T N+D A E + F AY+ PV P RA LF+ Sbjct: 46 KKPNVLFVMSDQHRRQALGFMKEDPVITPNLDKFAKEAVTFTRAYSAHPVSGPNRACLFS 105 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE--WD 119 G Y + + N+ + + MG FK +GY T YIGKWHLDGH+ P E Sbjct: 106 GKYTQNNKVFGNDCRLEDDGNGMGALFKKSGYSTGYIGKWHLDGHEGGKYSFVPRERRLG 165 Query: 120 ADYW--------FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 +YW FDG Y + + W + D + + +A+ Sbjct: 166 FEYWLISQGHRHFDGRYYGDKDSLIVTGRW------MPDYE--------------TEKAL 205 Query: 172 DFLQ-QPARADE--PFLMVVSYDEPHH---PFTCPVEYLEKYADFYYELGEKAQDDLANK 225 +FL+ + DE PF +VVSY PH+ P + + +L + A Sbjct: 206 EFLKNRNGERDEGKPFCLVVSYAPPHNGMGPGFQNKHNIGHWNALLKDLPIRKGSGFAAP 265 Query: 226 PEHHRLWAQAMPSP-------VGDDGLY-HHPLYFACNDFVDDQIGRVINALTPEQR-EN 276 ++ A P V + Y P YF +D+ G++I L EN Sbjct: 266 KRFEEMYEPADKLPRRKNVEKVDNKESYPALPGYFGAITSIDENFGKLIQELKDSGEWEN 325 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLPTM 335 T V+YTSDHGE++G+H + K Y++ +PL+I P + ++ ++ ID+LPT+ Sbjct: 326 TIVVYTSDHGELLGSHGRMYKD-LWYEESIGVPLMISYPAKLKPKKAAQLINSIDILPTL 384 Query: 336 MALADIEKPEILPGE---NILAVKEPR---GVMVEFNRYEIEHDSFGGFIPVRCWVTDDF 389 + LA IE PE++ G + + KE + +F+R + + + R T + Sbjct: 385 LELAGIEIPEVIDGNSYADYMNGKEKETTDKIFFQFDRGVLNDNGPDRY--YRAVRTKRY 442 Query: 390 KLVLNLF---------TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 V + + LYD DP ++H + ++ + +++ A++D+ ++ D Sbjct: 443 TYVAAMSPYYDQFVGKNKEVLYDNEKDPYQLHPIFLGEKYDAIMNELRTAVMDWCEQTHD 502 Query: 441 PFRSYQWS 448 PF S W Sbjct: 503 PFFSKYWK 510 >UniRef50_C0QY53 Sulfatase n=2 Tax=Brachyspira RepID=C0QY53_BRAHW Length = 474 Score = 154 bits (388), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 124/480 (25%), Positives = 213/480 (44%), Gaps = 55/480 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN + + D + + Y + T ++ LA G F +++ SPVCTP+RA +FT Sbjct: 4 KKPNIILITADQMRADSIE-YINDEVKTPVLNELAENGSVFTNSFCTSPVCTPSRASIFT 62 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G Y G W ++ T+ Y K+ Y GK H P + + Sbjct: 63 GRYPMNIGAWNIGTELNEDEVTLADYLKEDNYFNVASGKMHFR----------PQLKNLN 112 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSV--EDLQANHID---------------------- 157 + F+ + E++ + + + + +D Q ++D Sbjct: 113 WEFEDVPKRDRVRERDKTYYGFDITHITEDDKQGEYLDFANSHGCNLEIGKGIDGINPIP 172 Query: 158 ETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFY-YELGE 216 E + + +A+D + D+P M VS+ +PHHPF +Y + Y D EL Sbjct: 173 EELHQTYWTAQKAIDEIDN-FNFDKPLFMWVSFVDPHHPFDPIKKYYDIYKDIKPKELNS 231 Query: 217 KAQDDLANKPEHHRLWAQAMPSPVGDDGLYHH----------PLYFACNDFVDDQIGRVI 266 K + D +PEH P G G HH LY+ F+D QIGR+I Sbjct: 232 KLKLD-KKRPEHLTKQGDRGYWPGG--GEEHHYSQEEIKEIKKLYYGMISFIDSQIGRII 288 Query: 267 NALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTP 325 + L + +NT +I+TSDHGE +G + L+ KG MYD + ++PL+ + + D Sbjct: 289 DKLKEKNEFDNTIIIFTSDHGEYLGDYGLLKKGPFMYDCLIKVPLLFYGKGIVKNRSDEI 348 Query: 326 VSHIDLLPTMMALADIEKPEILPGENI--LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRC 383 + +ID+LPT++ + E P + G +I + + E + + I +D+ I ++ Sbjct: 349 IENIDILPTILDMLGKEIPYGIQGHSIKNILIGEDKNKTYKKGAV-ITYDAHDRGIFIKT 407 Query: 384 WVTDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 + T +KL + L E YD DPNE +NL + + ++++K+ + M + DP Sbjct: 408 YRTKQYKLSIFLDEEYGEFYDLEKDPNEENNLFFNKEYDEIKNKLLLEMCHKMIECSDPL 467 >UniRef50_Q1GMK9 Choline sulfatase n=8 Tax=Alphaproteobacteria RepID=Q1GMK9_SILST Length = 504 Score = 153 bits (387), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 120/438 (27%), Positives = 193/438 (44%), Gaps = 36/438 (8%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M PN L M D + + L+ N+ LAA RF + YT SP+C P RA Sbjct: 1 MTLPNILIFMVDQLNGTLFPDGPAEWLHAPNMKKLAARSTRFRNCYTASPLCAPGRASFM 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDY-------FGTGE 113 +G + +G + N +I T + + AGY+TC GK H G D T Sbjct: 61 SGQLPSATGVYDNAAEFASSIPTYAHHLRRAGYYTCLSGKMHFVGPDQLHGFEERLTTDI 120 Query: 114 CPPE--WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 PP+ W DY G + I W + + SV I + ++ A Sbjct: 121 YPPDFGWTPDYRKPG---------ERIDWWYHNMGSVTGAGVAEISNQMEFDDEVAFHAT 171 Query: 172 DFLQQPARADE--PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHH 229 + AR + P+ + VS+ PH P+ +Y + Y D + + E A N+ H Sbjct: 172 QKIYDLARGKDARPWCLTVSFTHPHDPYVTRKKYWDLYEDCPHLMPEVADLGYENQDPHS 231 Query: 230 RLWAQAMP----SPVGDDGLYHHPLYFACNDFVDDQIGRVINALT-PEQRENTWVIYTSD 284 + A +D YF ++DD+IG V+ AL Q ++T +++ SD Sbjct: 232 KRIFDANDWRNFDITEEDIRRSRRAYFGNISYLDDKIGEVMEALEGTRQDKDTIILFVSD 291 Query: 285 HGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKP 344 HG+M+G L K + Y+ +R+P++I +P V PVS+ID+ PT+ LA + Sbjct: 292 HGDMLGERGLWFK-MSFYEGSSRVPMMISAPNMTPGLVCDPVSNIDVCPTLCDLAGVSMS 350 Query: 345 EILP---GENILAVKE--PRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD 399 E++P GE+++ + + R V +E+ + + P+ + +KL L D Sbjct: 351 EVMPWTAGESLVPLGQGGTRSTPV-----AMEYAAEASYAPMVSLRSGRYKLNLCALDPD 405 Query: 400 ELYDRRNDPNEMHNLIDD 417 +L+D DP+E NL D Sbjct: 406 QLFDLDADPHERVNLAKD 423 >UniRef50_A6DFB2 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFB2_9BACT Length = 474 Score = 153 bits (387), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 132/461 (28%), Positives = 213/461 (46%), Gaps = 62/461 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + + +F++ D T +G G P T N+D+LA G+ F +A+ +P C P+R T Sbjct: 30 KKDIVFIIVDDLNT-WIGAMGGHPQTKTPNLDALATRGVLFTNAHCNAPQCGPSRKSFLT 88 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKD---AGYHTCYIGKWHLDGHDYF--------- 109 G+Y +G + N++ +FKD G + K LD H +F Sbjct: 89 GLYPKSTGKYF-------NVAKKMPFFKDQPLKGATSKNPPKKPLDFHTHFMKNNYRVVS 141 Query: 110 --GTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRIS 167 + D FD + T+K ++LW G +ID+T T ++ + Sbjct: 142 GGKVDHGSLKAKIDNKFDRPKEVKHFTDKRVNLWGEG-------GPQNIDDTMTGDYKTA 194 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ-DDLANKP 226 A+ Q ++D+P LM V + PH PF P EY +K+ +L + + DDLA+ P Sbjct: 195 QWAIK--QWNTKSDKPLLMSVGFYRPHRPFNVPKEYFDKFPLESIQLPKVPEFDDLADLP 252 Query: 227 EHHRLWAQA-------MPSPV---------GDDGLYHHPLYFACNDFVDDQIGRVINALT 270 E+ + A++ P V D+ Y Y AC ++VD QIG + L Sbjct: 253 EYGKALARSNAHKNLFKPRTVHEHILHLGGEDEWKYMVQSYLACINYVDTQIGLFLETLK 312 Query: 271 PEQREN-TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVS 327 R N T +I TSDHG +G + K AA++ TR+P I+ +P + P+S Sbjct: 313 NNPRGNDTVIILTSDHGWDLGEKEHWCK-AALWRTTTRVPYIVVAPGLTQAGTVNQQPIS 371 Query: 328 HIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWV-T 386 H+D+ PT+ A I KP+ L G++IL + V + E + S+G P V T Sbjct: 372 HVDIYPTLCDFAGIAKPKHLEGQSILPL-----VKDSSAKREAAYLSYG---PRNTAVQT 423 Query: 387 DDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKM 427 + ++ + S ELYD + DP E NL + +A++++KM Sbjct: 424 ERYRYISYEDGSGELYDHQKDPREWTNLSSNPEYAELKAKM 464 >UniRef50_A3P379 Choline-sulfatase n=63 Tax=cellular organisms RepID=A3P379_BURP0 Length = 517 Score = 153 bits (387), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 130/435 (29%), Positives = 195/435 (44%), Gaps = 32/435 (7%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L +M D + Y + T ID LAAEG+ F++AY SP+C P+R L G Sbjct: 11 QPNILVLMADQLTPFALRAYGHRATRTPTIDRLAAEGVVFDAAYCASPLCAPSRFALMAG 70 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 + G + N T Y + AGY T GK H G D E D Sbjct: 71 KLPSALGAYDNAAELPAQTLTFAHYLRAAGYRTMLSGKMHFCGPDQLHGFE--ERLTTDI 128 Query: 123 WFDGANYLSELTE-KEISLWRNGLNSVED----LQANHI----DETFTWAHRISNRAVDF 173 + ++ + T E W + ++SV D ++ N + D TF +I + A + Sbjct: 129 YPADFGWVPDWTRPAERPSWYHNMSSVLDAGPCVRTNQLDFDDDATFAARQKIFDVARE- 187 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA 233 + R PF MVVS PH P+ EY + Y D +L D A+ P RL A Sbjct: 188 -RAAGRDTRPFCMVVSLTHPHDPYAITREYWDLYRDEDIDLPAVQMDFDASDPHSRRLRA 246 Query: 234 --QAMPSPVGDDGLYH-HPLYFACNDFVDDQIGRVINALTPEQ---RENTWVIYTSDHGE 287 + +P D + Y+ +VD Q G ++ L EQ ++T VI T+DHG+ Sbjct: 247 VCEVDRTPPEDLQIRRARRAYYGATSYVDAQFGALLATL--EQCGLADDTIVIVTADHGD 304 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEK--- 343 M+G L K ++ R+PLI+ +P+ +V VSH+DLLPT++ LA E+ Sbjct: 305 MLGERGLWYK-MTFFEGACRVPLIVHAPRRFPAARVPAAVSHVDLLPTLVELATGERRAD 363 Query: 344 -PEILPGENILAVKEPRGVMVE-FNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDEL 401 P+ + G +++ G E F Y E G P+ K + + D+L Sbjct: 364 WPDAVDGRSLVPHLRGEGGHDEAFGEYLAE----GAIAPIVMMRRGSHKYIHSPADPDQL 419 Query: 402 YDRRNDPNEMHNLID 416 +D RNDP E+ NL + Sbjct: 420 FDLRNDPRELDNLAN 434 >UniRef50_UPI0001968553 hypothetical protein BACCELL_00119 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968553 Length = 514 Score = 153 bits (387), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 127/495 (25%), Positives = 221/495 (44%), Gaps = 83/495 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN +FV+TD +G P+ T ++D A+ + F++A +C PV P RA LFT Sbjct: 13 EKPNVIFVLTDQWRKQALGFKGEDPVQTPHLDEFASWAVSFDNATSCRPVSGPGRACLFT 72 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G Y+ +G + N V + +MGR FK AGY T YIGKWHL+G + T + D Sbjct: 73 GKYSINNGVFANKVPLATDEESMGRVFKAAGYATAYIGKWHLNGMNDHVTDSVRRQ-GFD 131 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNR-AVDFLQQPARA 180 Y+ + + G +D + +I WA + ++F+++ + Sbjct: 132 YFVQSMGHQP---------FFQGYYVQDDKERTYIK---GWAPTYETQLGIEFIEKQKTS 179 Query: 181 DEPFLMVVSYDEPH-------------------------HPFTCPVEYLEKYADFYYELG 215 ++PF +V+SY+ PH + + P EY Y D YE Sbjct: 180 EQPFCLVLSYNPPHTGGGPGFEDRYQPGKYGPDKKLKMGYGYAGPAEYEALYKDIDYEK- 238 Query: 216 EKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR- 274 + +L A+ +P YF +D+ G ++ L EQ Sbjct: 239 NPIRGNLKPIRRSSDTSARVIPG------------YFGAITAIDNDFGNLMTYL--EQND 284 Query: 275 --ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLII--RSPQGERRQVDTPVSHID 330 ENT +++TSDHGE MG+ L++KG +++ +P ++ + +R++ + ID Sbjct: 285 LLENTIIVFTSDHGESMGSQGLMTKG-TWFEESMGVPCLVGWKGVIKPKREI-VVFNSID 342 Query: 331 LLPTMMALADIEKPEILPGEN---ILAVKEPRGVMVEFNRYEIEHDSFGGFIPV------ 381 L+PT++ L+ + P+ + G + +L K+ + F ++ FGG + Sbjct: 343 LMPTLLGLSGLSIPQGVDGVDYSPLLLGKKFKAPEYAFTSFD-----FGGVEELKAPRYW 397 Query: 382 RCWVTDDFKLVL------NLFTSDE--LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 R T + VL FT D LYDR DP +++ + + + ++H L+ Sbjct: 398 RAVYTSRYTYVLCGMNQNRAFTKDGLVLYDREKDPLQLNPIYKGMGYDKTIDRLHAELVK 457 Query: 434 YMDKIRDPFRSYQWS 448 ++D+ DPF W+ Sbjct: 458 HLDETGDPFIKEYWN 472 >UniRef50_B6AU86 Putative sulfatase YidJ n=1 Tax=Rhodobacterales bacterium HTCC2083 RepID=B6AU86_9RHOB Length = 527 Score = 152 bits (384), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 138/510 (27%), Positives = 215/510 (42%), Gaps = 81/510 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RP+++ +TD Q + +GC L T NID++A EG+ + Y SPVC P RA L T Sbjct: 4 RPSYILFITDQQRYDHLGCNGHPVLRTPNIDAMATEGVSHDRFYVASPVCMPNRASLMTC 63 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 G + + ++ T AGY TC IGK HL + PPE Y Sbjct: 64 RMPASHGTRSLGIPLNQDNVTFVELLAAAGYDTCLIGKSHLQNVSDWDIQIDPPEHRDGY 123 Query: 123 W-------------FDGANYLSELTE-----------------KEISLWRNGLNS----- 147 D Y E E + +++ R+G N+ Sbjct: 124 AAPPEELAVATRSDLDSGTYQYERQEYWDQPDPKVHLPFYGFNEYVAVMRHGFNTGGEHL 183 Query: 148 -------VEDLQANHIDETFTWAHR------------------ISNRAVDFLQQPARADE 182 E L D+ FT + I+NRA +L+ D+ Sbjct: 184 EWIKKNAPETLALRGRDKQFTHDYTVPQAIRTKVPEEHYSTTYIANRAAAWLKARKGKDK 243 Query: 183 PFLMVVSYDEPHHPFTCPVEYLEKY------ADFYYELGEKAQDDLANKPEHHRLWAQAM 236 PFL+VVS+ +PHHPFT P +Y + Y YE+ + + E R ++ Sbjct: 244 PFLLVVSFPDPHHPFTPPGKYWDMYKPKDMVVPAAYEIDDWDPPEYVKSAERARAADPSL 303 Query: 237 PSPVG-------DDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEM 288 G + L L +DD +G+V A+T + T IYTSDHG+ Sbjct: 304 GQMEGYSLAVSKQEALEARALTCGMIAMIDDAVGQVRTAVTKAGVADKTVQIYTSDHGDH 363 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIEKPEI 346 +G H+L+ KGA YD +T +P I P+G+ R D +H D+ T++ A +EK Sbjct: 364 LGEHRLLFKGAEQYDSLTHVPFIWADPKGDSGTRSSDLAQTH-DIGTTILEHAKVEKSLG 422 Query: 347 LPGENILAVKEPRGVMVEFNRYEIE--HDSFGGFIPVRCWVTDDFKLVLNLF-TSDELYD 403 + G +L G +YE + ++FG V V ++++L + L ++EL+D Sbjct: 423 MQG-VVLPTVGGAGRDAAHIQYETQRTQEAFGPRPRVHTVVYENWRLSMYLGKCANELFD 481 Query: 404 RRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 DP EM NL +V++++ + L + Sbjct: 482 LAEDPGEMTNLWKSADHQEVKARLLERLAE 511 >UniRef50_UPI00016C0A06 sulfatase n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0A06 Length = 485 Score = 152 bits (384), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 135/469 (28%), Positives = 203/469 (43%), Gaps = 47/469 (10%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+ TD + +G + + T N+D LA + + F +++T +P+CTPAR L TG+Y Sbjct: 3 NVLFIFTDQWRADCMGYRNHPVVKTPNLDKLAQKSVDFKNSFTTTPLCTPARGSLLTGLY 62 Query: 65 ANQSGPWTNNVAPGK-------NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE---- 113 +QSG N G N T +G T Y GKWHL G D+ G+ Sbjct: 63 PHQSGIIDNCDVGGSSQEYLPGNAFTWLDAMAKSGRKTGYFGKWHL-GLDWDGSNSGVDF 121 Query: 114 --CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQAN------HIDETFT-WAH 164 C E + +G + +TE+ + G V + N ID + H Sbjct: 122 DICRKEGNRAKHMNGIPF---VTERGQLIKDRGERFVPEKNGNKLPFYGKIDSVENRYEH 178 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 +++ + +DFL+ A D+P+ + S+ PH P P Y Y L E D N Sbjct: 179 KVTTKVLDFLE--ANKDQPWCLTASFVGPHFPSILPEPYFSMYPPAEMALPENITDTFFN 236 Query: 225 KP-EHHRLWAQAMPSPVGDDGLYHH-----PLYFACNDFVDDQIGRVIN-ALTPEQRENT 277 KP H R W PS DD + Y+ C +D IG +++ A T Sbjct: 237 KPWFHSRNW---WPSVATDDFTAENWQKTISAYYGCITMMDALIGEILDKAAACSGGRPT 293 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLI----IRSPQGERRQVDTPVSHIDLLP 333 VI+TSDHGEM+G H K A Y+++ R PL+ + QG + D + +D+ Sbjct: 294 KVIFTSDHGEMLGGHARFDKDAYFYEEVLRTPLLYCANLNGDQGGFARDDYATT-LDIAQ 352 Query: 334 TMMALADIEKPEILPGENIL-AVKEPRGVMVEF-NRYEIEHDSFGGFIPVRCWVTDDFKL 391 T LA +L +P + F N Y+ SF +R T +K Sbjct: 353 TFFGLAGFGAENGSSLTPMLDKSYQPDAEKIMFGNYYKYNGHSF----EIRFARTPRYKY 408 Query: 392 VLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 DELYD NDP E+ NL D ++ D++ ++ +L +M D Sbjct: 409 SFIPQDIDELYDMENDPQELVNLSDRAQYQDIKEELKARVLQHMKDTND 457 >UniRef50_A0JVP0 Sulfatase n=1 Tax=Arthrobacter sp. FB24 RepID=A0JVP0_ARTS2 Length = 508 Score = 152 bits (384), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 139/488 (28%), Positives = 221/488 (45%), Gaps = 55/488 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 R N LF+MTD Q + +GCY + +T +D LAA G ++ AYT + +CTPARA L TG Sbjct: 8 RTNILFLMTDQQRIDTMGCYGNRSRHTPYLDGLAARGTVYDRAYTPTAICTPARASLLTG 67 Query: 63 IYANQSGPWTN--------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLD---GHDYFG- 110 ++ + G +N + P T + GY ++GKWH+ G D++G Sbjct: 68 LHPFEHGLLSNFEWNSGHRDELP-DGTPTFADELRKQGYRLGHVGKWHVGRERGPDFYGF 126 Query: 111 TGECPPEWDADYWFDGANYLSELTEKEISLWR--NGLNSVE-DLQANHI---------DE 158 GE P A FD Y S L EK +R + + +V+ D H+ + Sbjct: 127 EGEHLP--GALNTFDNPAYTSWLAEKGFPSFRIVDPVYTVQKDGSQGHLIAGITDQPTEA 184 Query: 159 TF-TW-AHRISNRAVDFLQ-QPA-------RADEPFLMVVSYDEPHHPFTCPVEYLEKYA 208 TF W A + + +F Q PA A PF + PH P+ P ++ + Sbjct: 185 TFEAWLADQTIAKLREFAQTHPAGGAPGTETAVAPFYLSCHIFGPHLPYLIPRQWYDLVD 244 Query: 209 DFYYELGEKAQDDLANKPEHHRLWAQ--AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVI 266 +L + + KP + +A+ + S ++ +Y+ +D +IGR++ Sbjct: 245 PATVQLPKSFAETFNGKPLVQQTYAEYWSTDSFTVEEWKKLTAVYWGYVSMIDHEIGRIL 304 Query: 267 NALTP-EQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTP 325 + ++T +++T+DHGE GAH+L KG AMY+DI RIP I+ +P E R+ Sbjct: 305 QTVEELGLNDSTVIMFTADHGEFTGAHRLNDKGPAMYEDIYRIPAIVAAPGQEPRRESKF 364 Query: 326 VSHIDLLPTMMALAD-----IEKPEILPGENILAVKEPRGVMV-EFNRYEIEHDSFGGFI 379 VS D T + +AD I ++P + R MV EF+ + + Sbjct: 365 VSLQDFTATFIDIADGYAGNIRGSSLMPSTTAPLPADWRTEMVCEFHGHHFPYAQ----- 419 Query: 380 PVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 R + +K + N DE YD +DP+E+HN++ +A M +L + Sbjct: 420 --RMIRNERYKYIANPEGIDEFYDLVSDPDELHNVVTVPAYATQLKTMRLSLYKELVSRG 477 Query: 440 DPFRSYQW 447 D F YQW Sbjct: 478 DKF--YQW 483 >UniRef50_Q5LRB5 Choline sulfatase n=1 Tax=Ruegeria pomeroyi RepID=Q5LRB5_SILPO Length = 498 Score = 152 bits (384), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 126/451 (27%), Positives = 194/451 (43%), Gaps = 48/451 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L +M D M+ G T+++ LA ++F +AYT SP+C PAR+ TG Sbjct: 16 RPNILLIMADQMTPFMLEACGGTGARTRHLTRLAGRAVQFTNAYTPSPICVPARSCFMTG 75 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHD--------------- 107 +Y + +G + N + T Y +AGY T GK H G D Sbjct: 76 LYTSTTGCYDNGDPYHSFLPTFAHYLTNAGYETVLSGKMHFIGADQLHGFQRRLNPDIYP 135 Query: 108 --YFGTGECPPEWDADYWFDGANYLSELTEKEISL-WRNGLNSVEDLQANHIDETFTWAH 164 + + PP+ DA F ++ + + I W L E+ Q Sbjct: 136 SGFLWSYPLPPDGDAS--FQAFDFTPQYLAENIGPGWSKELQYDEETQF----------- 182 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 RA+++L+ D P+++ VS+ PH P+ P Y E Y D L + D A Sbjct: 183 ----RALEYLRH--APDTPWMLTVSFTNPHPPYVVPRPYWEMYKDADIPLPDYPADMDAR 236 Query: 225 KPEH-------HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QREN 276 E H L + + + + A +VDD+IG ++ L QR+ Sbjct: 237 YSEFDHALRRWHGLHQRGHEVRDPRNLIAMRRGFAALAHYVDDKIGALLEVLDETGQRDE 296 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMM 336 T +I TSDHGEM+G LI K ++Y+ RIPLII P +VDTPVS +DL T++ Sbjct: 297 TVIIVTSDHGEMLGEKGLIQK-RSLYEWSARIPLIIDLPGAAPGRVDTPVSLLDLPATLI 355 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 L+ L G ++L RG ++ E+ G P D+K Sbjct: 356 ELSGQTPVAPLEGRSLLGAV--RGQELDTVPIVSEYHGEGIMRPSFMVRLGDWKYHYCHG 413 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKM 427 ++ +LY+ DP E HN + A+ +++ Sbjct: 414 SAPQLYNLARDPGEWHNRAGEPDLAETEARL 444 >UniRef50_Q46P27 Sulfatase n=3 Tax=Proteobacteria RepID=Q46P27_RALEJ Length = 482 Score = 152 bits (383), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 131/458 (28%), Positives = 210/458 (45%), Gaps = 40/458 (8%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 M N + +M+D M+GC SG P + T N+D+LAA G+RF+SAYT SP+C PARA Sbjct: 1 MASKNVVVIMSDEHDPRMMGC-SGHPFVKTPNLDALAARGVRFSSAYTPSPICVPARAAF 59 Query: 60 FTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 TG +Q W N + G +D G IGK H + E P +D Sbjct: 60 ATGRRVHQVRLWDNAMPYTGEQRGWGHVLQDRGIRVESIGKLH------YRNEEDPAGFD 113 Query: 120 ADYWFDGANYLSELTEKEISLW---RNGLNSVED---LQANHI---DETFTWAHR-ISNR 169 A++ + +W RN E+ + HI + ++T R ++ R Sbjct: 114 AEH------LPMHVVGGHGMVWASIRNPFRPRENGPRMLGEHIGPGESSYTQYDRAVTQR 167 Query: 170 AVDFLQQPA-RADEPFLMVVSYDEPHHPFTCPVEYLEKY-ADFYYELGEKAQDDLANKP- 226 AV +LQ+ A R + F++ V PH PF P E+ Y D E + P Sbjct: 168 AVQWLQEAAQRQEAGFVLYVGLVAPHFPFVVPEEFYSLYPTDGLPEPKLHPRTGYEQHPW 227 Query: 227 --EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTW-VIYTS 283 E+ A D+ L Y+ ++D +G+++ AL E+T ++YTS Sbjct: 228 VREYCDFMASERQFADADERLRAFAAYYGLCTWLDHNVGQILGALRDNGLEDTTHIVYTS 287 Query: 284 DHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEK 343 DHG+ +GA + K + +Y++ ++P+++ P +TPV +DL PT++ A ++ Sbjct: 288 DHGDNLGARGVWGK-STLYEESVKVPMLLAGPIVTPGVCNTPVDLLDLFPTILQGAGVDP 346 Query: 344 PEIL---PGENI--LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS 398 + PG ++ LA P V + Y + GGF+ + +K + Sbjct: 347 ATEIDERPGRSLFELARSAPEPDRVILSEYHAAGSNAGGFMLRK----GRWKYHHYVGFR 402 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 EL+D +DP E+ +L D +A V + MH+ALL D Sbjct: 403 PELFDLESDPEELTDLAGDPAYAPVLASMHEALLAICD 440 >UniRef50_A6DM50 Choline sulfatase n=6 Tax=Bacteria RepID=A6DM50_9BACT Length = 647 Score = 151 bits (382), Expect = 5e-35, Method: Compositional matrix adjust. Identities = 120/471 (25%), Positives = 214/471 (45%), Gaps = 66/471 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTC----SPVCTPARA 57 K+PNF+F+ D Q+ +G Y + T N+D L GI F Y VC +RA Sbjct: 26 KKPNFMFIFADDQSYESIGAYGQLNIKTPNLDRLVKRGISFTHTYNMGAWGGAVCVASRA 85 Query: 58 GLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT-----G 112 L +G + N++ K + AGY T GKWH+ G+ F G Sbjct: 86 MLNSGRFVNRAEKGV------KQYPHWSQIMNSAGYTTYMTGKWHVHGNPRFDVMKDVRG 139 Query: 113 ECPPEWDADY--WFDGANYLSELT---EKEISLWRNGLNSVEDLQANHIDETFTWAHRIS 167 P + A Y F Y SE +++ WR G + W ++ Sbjct: 140 GMPNQTPARYKRTFKPELYESEWLPWDKRQQGFWRGGTH---------------WTQVVA 184 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 + + F ++ ++PF M ++++ PH P P EY++ Y ++ E N Sbjct: 185 DNTLTFFEKVKNDNKPFFMYLAFNAPHDPRQAPKEYVDMYPLDSIKIPE-------NYMP 237 Query: 228 HHRLWAQAMPSPVGDDGLYHHPL-----------YFACNDFVDDQIGRVINALTPEQR-E 275 + A+ + D+ L +P Y+A ++D IGR+++AL + E Sbjct: 238 EYPYAAEICGKKLRDEVLAPYPRTTYAVKRNRQEYYASITYMDHHIGRMLDALEASGKAE 297 Query: 276 NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPT 334 NT++I+T+DHG G H L+ K +MY+ R P I+ P + ++DTP+ D + T Sbjct: 298 NTYIIFTADHGLAAGHHGLMGK-QSMYEHSMRPPFIVVGPGIKQNSKIDTPIYLQDAMAT 356 Query: 335 MMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV-RCWVTDDFKLVL 393 + LA +EKP + ++++ + + V+++R +G ++ R + DD+KL+ Sbjct: 357 AIELAGVEKPAHVEFKSLMPLIKGEKT-VQYDRI------YGKYMNTQRMILKDDWKLIF 409 Query: 394 NLFTSDE--LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 + + L++ +NDP EM++LID+ +A ++ ++ ++ DP Sbjct: 410 YPHAAKKMRLFNIKNDPAEMNDLIDNPEYATKIQELKREFVELQKEMGDPL 460 >UniRef50_Q7UH28 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=1 Tax=Rhodopirellula baltica RepID=Q7UH28_RHOBA Length = 534 Score = 151 bits (382), Expect = 5e-35, Method: Compositional matrix adjust. Identities = 130/456 (28%), Positives = 212/456 (46%), Gaps = 60/456 (13%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N +F++TD + +GC L T N+DS+AA G +A+ + +C+P+RA + TG+Y Sbjct: 58 NVVFILTDDHRFDAMGCAGHPFLETPNLDSIAANGTHIKNAFVTTSLCSPSRASILTGLY 117 Query: 65 ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYW- 123 ++ NN +Y + AGY T ++GKWH+ GH P D+W Sbjct: 118 THKHRVIDNNRLVPDGTLFFPQYLQRAGYDTAFVGKWHMGGH------HDDPRPGFDHWV 171 Query: 124 -FDG-ANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 F G NYL + ++ +N Q +I + T + AVD+L++ D Sbjct: 172 SFRGQGNYLPPGPKYTLN-----VNGERVKQKGYITDELT------DYAVDWLKE-RDDD 219 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYAD---FYYELGEKAQDDLANKP----EHHRLWAQ 234 EPF + +S+ H FT + +YAD + G++ D N P + W Sbjct: 220 EPFFLYLSHKAVHSNFTPAERHQGRYADEDLSFLPTGKELSAD-KNTPRWVRDQKNSWHG 278 Query: 235 AMPSPVGDDGL-YHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAH 292 S D GL Y + Y VDD +GRV+ L ++T +IY D+G M G H Sbjct: 279 IDFSYHSDKGLDYLYRRYCESVLAVDDSVGRVLQQLKDMGIHDDTLIIYMGDNGFMWGEH 338 Query: 293 KLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGE 350 LI K + Y+ R+P++++ P + ++ V +ID+ PT++ A ++ PE + G+ Sbjct: 339 GLIDKRVS-YEASIRVPMLMQCPNLFDGGQPIENVVGNIDVGPTILHAAGLQTPEYMDGQ 397 Query: 351 NILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVT-------------DDFKLVL--NL 395 + L + P ++ +Y F+ V W D FK + L Sbjct: 398 SFLDL--PNNRDADWRKY---------FLYVYYWEKNFPQTPTQFALRGDRFKYITYYGL 446 Query: 396 FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + +DELYD + DP+E++NLI D + V +M D L Sbjct: 447 WDTDELYDLQTDPDELNNLIHDPDYKSVAKEMEDQL 482 >UniRef50_Q5LH37 Putative sulfatase n=16 Tax=Bacteroides RepID=Q5LH37_BACFN Length = 483 Score = 151 bits (381), Expect = 6e-35, Method: Compositional matrix adjust. Identities = 135/484 (27%), Positives = 215/484 (44%), Gaps = 74/484 (15%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN +F+M D + +GC +P+ T ++D LA+EGI F +A + PV +PAR L TG+ Sbjct: 28 PNLVFIMADQYRGDAIGCIGKEPVKTPHLDKLASEGINFTNAISSYPVSSPARGMLMTGM 87 Query: 64 Y---ANQSGPWTNNVAP-GKNISTMGR----YFKDAGYHTCYIGKWHLDG------HDYF 109 Y + +G + AP G +S R KD GY+ YIGKWHLD Y Sbjct: 88 YPIGSKVTGNCNSETAPYGVELSQNARCWSDVLKDQGYNMGYIGKWHLDAPYKPYVDTYN 147 Query: 110 GTGE------CPPE--WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 G+ CPPE D+W Y L + W N + Sbjct: 148 NRGKVAWNEWCPPERRHGFDHWIAYGTYDYHL---KPMYWNTTAPRDSFYYVNQWGPEYE 204 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPF-TCPVEYLEKYADFYYELGEKAQD 220 +++A++++ +PF +VVS + PH + P Y E Y D E K + Sbjct: 205 -----ASKAIEYINGQKDQKQPFALVVSMNPPHTGYELVPDRYKEIYKDLDVEALCKGRP 259 Query: 221 DLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWV 279 D+ A + +GD + Y+AC VD+ +GR+I AL +NT V Sbjct: 260 DIP-----------AKGTEMGDYFRNNIRNYYACITGVDENVGRIIEALKQNNLFDNTIV 308 Query: 280 IYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTP--VSHIDLLPTMMA 337 ++TSDHG MGAH+ K Y++ RIP+I+ P + + P ++ DL PT+++ Sbjct: 309 VFTSDHGICMGAHENAGKD-IFYEESMRIPMILSWPDQIKPRKSDPLMIAFADLYPTLLS 367 Query: 338 LADIEKP------------EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWV 385 + K E+L G+N + +P V+F+ + + R Sbjct: 368 MMGFSKEIPETVQTFDLSNEVLTGKNKKDLVQPY-YFVKFDNHATGY---------RGLR 417 Query: 386 TDDFKLVLNL----FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 TD + ++ + L+DR NDP+EM+N+ + + + L +++K DP Sbjct: 418 TDRYTYAVHATDGKIDNVILFDRTNDPHEMNNIAS--QQLKLTHTFNRQLKTWLEKTNDP 475 Query: 442 FRSY 445 F Y Sbjct: 476 FAQY 479 >UniRef50_A4W906 Sulfatase n=43 Tax=Enterobacteriaceae RepID=A4W906_ENT38 Length = 501 Score = 150 bits (379), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 120/396 (30%), Positives = 186/396 (46%), Gaps = 66/396 (16%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 + +PN + ++ D +G Y + T NID LA EG+RF+ Y +P+C+P+RAGL Sbjct: 33 LNKPNVVIILADDLGYGDLGIYGHPIVKTPNIDKLAQEGVRFSQYYAPAPLCSPSRAGLL 92 Query: 61 TGIYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL----DGHDYFGT 111 TG ++G P N+A G+N T+ Y KD GY T +GKWHL D HD Sbjct: 93 TGRTPFRTGIRSWIPTNKNIALGRNEKTIASYLKDQGYDTAMMGKWHLNAGVDRHD---- 148 Query: 112 GECPPEWDADY---WFDGANYLSELTEKEISLWRNGL--------NSVEDLQANHIDETF 160 P DA + + A +++ +K RNG+ N N I F Sbjct: 149 --QPQAEDAGFDYTLVNAAGFVTSDLDKAKERPRNGVVYPNGFYRNGKALGTVNQISGEF 206 Query: 161 TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD 220 +S A+++L + ++PF M V++ E H P P +YLE Y ++ E EK Sbjct: 207 -----VSQEAINWLND-KKDNKPFFMYVAFTEVHTPLASPKKYLEIYKNYMSEY-EKQHP 259 Query: 221 DLANKPEHHRLWAQAMPSPVGDDGLYHHP-LYFACNDFVDDQIGRVINAL-TPEQRENTW 278 D+ +A + P Y P Y+A ++D+Q+G+V+ + + Q +NT Sbjct: 260 DM--------FYADWVDKP------YRGPGEYYANISYMDEQVGKVLAKIKSMGQEDNTI 305 Query: 279 VIYTSDHG------------EMMG-AHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVD 323 +I+TSD+G M G L + +++ R+P II+ Q D Sbjct: 306 IIFTSDNGPVTREARKWYELNMAGETDGLRGRKDNLWEGGIRVPAIIKYGQHLHAGTVTD 365 Query: 324 TPVSHIDLLPTMMALADIEKP--EILPGENILAVKE 357 TPVS +D+LPT+ L P I+ GE+I+ V E Sbjct: 366 TPVSGLDILPTLAELTHFNLPTDRIIDGESIVPVLE 401 >UniRef50_C3QDX1 Sulfatase n=2 Tax=Bacteroides RepID=C3QDX1_9BACE Length = 485 Score = 150 bits (379), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 118/453 (26%), Positives = 209/453 (46%), Gaps = 28/453 (6%) Query: 5 NFLFVMTDTQATNMVGCYSGKPL-NTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 N LF+ D + G +SGK L T NID LA+EG+ F ++Y+C P PAR L +G Sbjct: 30 NILFIQADQHRYDCTG-FSGKGLVKTPNIDKLASEGVIFTNSYSCIPTSCPARQSLISGK 88 Query: 64 YANQ-SGPWTNNV---APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDY---FGTGECPP 116 + Q G W ++ N T + Y+GKWH+ FG + P Sbjct: 89 WPEQHKGLWNYDITLPVTPFNGPTWTEKLSEKDIKMGYVGKWHVSDRKSPKDFGFDDYVP 148 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 EW + W N L + ++ G + V+ +Q+ H ++ R ++ +++ Sbjct: 149 EWSYNNWRKKNN-LPDYVWQDSRWVMGGYDPVDKMQSR--------THWLAQRVIEMIKK 199 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP--EHHRLWAQ 234 + + + +PH P E+L Y + +DDL+NKP + +++ Sbjct: 200 YQSEGKKWHVRFDTSDPHLPCYPVREFLAMYDKEKIQEWPNYRDDLSNKPYIQRQQIYNW 259 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHK 293 + + + YFA +DD +G VI AL +NT+++YT+DHG+ G+H Sbjct: 260 ELEDSNWEMWQGYLQRYFANITQLDDAVGMVIEALKEMGVYDNTFIVYTTDHGDAAGSHN 319 Query: 294 LISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSH-IDLLPTMMALADIEKPEILPGENI 352 ++ K MY++ +PL+++ P R +D V++ +D+ T + ++ GE++ Sbjct: 320 MVDKHYVMYEEEVHVPLVMKIPGVSHRIIDRFVNNQLDMAATFCDMYQLDYK--TQGESL 377 Query: 353 LAVKEPRGVMVEFNRYEIEH---DSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPN 409 L + E + ++ Y + FG F+ R K V NL +DELYD +DP Sbjct: 378 LPLIEEKKEASDWREYAFSNYNGQQFGLFVQ-RMIRDKRMKYVWNLTDTDELYDLESDPW 436 Query: 410 EMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 E++NL+ + ++ AL + + + +DP Sbjct: 437 ELNNLVYSKEYKAELVRLRKALYEDLKQRKDPL 469 >UniRef50_B5JYP8 Choline-sulfatase n=1 Tax=Octadecabacter antarcticus 238 RepID=B5JYP8_9RHOB Length = 531 Score = 150 bits (379), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 128/456 (28%), Positives = 207/456 (45%), Gaps = 54/456 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN + +M D A + + Y T N++ LAA+G F + Y+ +P+C P+RA + + Sbjct: 5 SRPNIILIMADQMAAHALSLYGNTVCKTPNLERLAAQGTVFENGYSNNPLCVPSRASMLS 64 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHD------YFGTGECP 115 G+ + + N ++ TM Y + AGY T GK H G D + Sbjct: 65 GMLSPDVNVFDNANELPSSVPTMAHYLRHAGYWTELCGKMHFIGPDQEHGFNQRSVTDVY 124 Query: 116 P---EWDADYWFDGANYLSELTEKEISLWRNGL----NSVEDLQANHIDETFTWAHRISN 168 P +W AD W G ++ T NG+ V +Q ++ DE H Sbjct: 125 PASFQWIAD-WQAGPAFVPSGTA------LNGVVESGPCVRTMQEDYDDEV---EHCAIQ 174 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE---LGEKAQDDL--- 222 D ++P R +PF +VS+ PH PFT EY ++Y + +G +DL Sbjct: 175 SLYDRAREPDR--QPFFQIVSFTNPHTPFTVSQEYWDRYESSEIDAPAVGALPFEDLDYH 232 Query: 223 ------ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRE 275 A+ H++ + + + Y+ +VDD++GR+++ L QR+ Sbjct: 233 SKALFFAHGRHRHKVTQKHL--------IAARHAYYGMISYVDDKVGRILDTLEKTGQRD 284 Query: 276 NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLP 333 NT V + SDHGEM+G + K ++ +P I P G R + VS +DLLP Sbjct: 285 NTAVFFVSDHGEMLGERGMWFK-QTFWEWSAHVPFIASVPGITGGGRS-EKVVSLVDLLP 342 Query: 334 TMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEI-EHDSFGGFIPVRCWVTDDFKLV 392 T + LA + PE L G ++L + E G + I ++ + G +P R FK + Sbjct: 343 TFLDLAGADSPE-LAGSSVLPLME--GDADAWPDIAISDYLAIGPCVPCRMVRKGRFKFI 399 Query: 393 LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMH 428 LYD ++DP E++NL D+ FADV +++ Sbjct: 400 YTHGHPALLYDLQDDPLELNNLADNAAFADVLAELQ 435 >UniRef50_UPI00017453D4 choline sulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017453D4 Length = 485 Score = 150 bits (379), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 124/438 (28%), Positives = 201/438 (45%), Gaps = 32/438 (7%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 R N LF++ D + +G G + T N+D LA G F +A+ +P C+P+R TG Sbjct: 26 RLNVLFILVD-DLNDQIGWLGGAGI-TPNMDRLAQRGTLFANAHAQAPWCSPSRTSFLTG 83 Query: 63 IYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + +G PW NV + + T+ ++F GY T IGK + +G CPP Sbjct: 84 KRPSTTGIYALTPWFRNVPALRELVTLPQHFAAHGYETFGIGKVYHEG--------CPPA 135 Query: 118 WDADYWFDGANYLSELTEKEISL-WRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 F Y + + S + N +D + T ++++ A++ L + Sbjct: 136 NQPTPEFSVMGYQGNWRKPQPSKPFVNTPGMRQDFGQFPDRDDQTDDFKVASSAIECLGR 195 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYAD---FYYELGEKAQDDLANKPEHHRLW- 232 P +PF + V PH+P P ++ Y + E+ +DDL RL Sbjct: 196 PH--TKPFFIAVGLRRPHYPLYAPQQWFSLYDPQNVWLPEVPATDRDDLPRFARALRLGN 253 Query: 233 AQAMPSPVGDDGLY--HHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHGEMM 289 + P+ + GL+ H+ Y AC FVD+QIGR+++AL + T ++ SDHG + Sbjct: 254 TEPTLGPIVNAGLWRSHNHAYLACVSFVDNQIGRILDALEQSGEAHRTVIVLASDHGFHL 313 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 G +L +K +++ T +PLI P R PV +D+ PT+ + + P L G Sbjct: 314 GEKELFAK-RTLWERATHVPLIFAGPGVGRGTSKRPVELLDIYPTLTEICGLPTPPGLEG 372 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPN 409 E++ A+ R R I G F VR T+D++ + S+ELYD R DP Sbjct: 373 ESLGALL--RDPSAARTRPAITGQMQGSFA-VR---TEDWRYIRYADGSEELYDHREDPQ 426 Query: 410 EMHNLIDDIRFADVRSKM 427 E NL D R+ V++++ Sbjct: 427 EFLNLAADQRWTSVKTEL 444 >UniRef50_C2KTX6 Arylsulfatase n=2 Tax=Mobiluncus mulieris RepID=C2KTX6_9ACTO Length = 505 Score = 150 bits (379), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 128/456 (28%), Positives = 220/456 (48%), Gaps = 40/456 (8%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + MTD + +GC + T NIDSLAA+G F +T + +CTPAR+ + TG Sbjct: 14 NIINFMTDQHRIDTLGCLGNENAQTPNIDSLAADGCIFEKGFTPTAICTPARSSMLTGKL 73 Query: 65 ANQSGPWTN---NVAPGKNIS----TMGRYFKDAGYHTCYIGKWHLDGH--DYFGTGECP 115 + N N+A I T + +D GY+ +GK+H + D FG + Sbjct: 74 PFKHLTLANPEWNIAYSTAIPEDDWTYTQQLRDDGYNVGMVGKYHCGTNLPDKFGCDD-D 132 Query: 116 PEWDADYWFDGANYLSELTEKEI------SLWRNGLNSVED--LQANHID--ETFTWAHR 165 W A+ + Y + L E + +WR L D + A +D E T+ Sbjct: 133 TYWGAENPVNNEKYTAWLEENHLPPVKAHDIWRGKLPGNRDGHIIAARLDQPEEATFERF 192 Query: 166 ISNRAVDFLQQPAR----ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 I++ ++ L+Q A+ +D+PF + V + PH P+ P E+ + L + D Sbjct: 193 IADVSIAKLRQYAKDYRESDKPFSLDVHFFGPHLPYFLPDEWFDLIDPESIVLPKNFGDT 252 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHH--PLYFACNDFVDDQIGRVINALTP-EQRENTW 278 L KP + +A + ++ + +Y+ +D +IGR++ + + ++T Sbjct: 253 LVGKPPIQQNYATYWSTSSFNNDQWRKLIAVYWGYVAMIDFEIGRILEVVRELDLMDDTA 312 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP---QGERRQVDTPVSHIDLLPTM 335 + + +DHGE G+H+L KG AMYD+I RIP I+R P G R + V+ +DL T+ Sbjct: 313 MFFCADHGEFTGSHRLNDKGPAMYDEIYRIPFIVRIPGLTHGNRCR--EYVNLLDLTATI 370 Query: 336 MALADIEKPEILPGENI--LAVKEPRGVMVEFNRYEIEHDSFGGFIPV--RCWVTDDFKL 391 + +A + + G+++ LA +P R +I + G PV R DD+KL Sbjct: 371 IDIAGGDTSRVEDGKSLVRLAAGKPEADW----RQDIVCEFHGLHFPVQQRMLRNDDYKL 426 Query: 392 VLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKM 427 +++ + +ELYD + DP+EM+N+ + ++R +M Sbjct: 427 IVSHESINELYDLKRDPDEMNNVYAAPAYDEIRRQM 462 >UniRef50_Q01RE9 Sulfatase n=4 Tax=Bacteria RepID=Q01RE9_SOLUE Length = 499 Score = 150 bits (378), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 126/455 (27%), Positives = 212/455 (46%), Gaps = 39/455 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 +R N +F+++D + +G +P L T ++D+LA +G +A+ C+ +C+P+RA + Sbjct: 27 RRRNVIFILSDDHRYDALGFMHPQPWLRTPHLDTLARDGAHLKNAFVCTALCSPSRASIL 86 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG+YA++ NN A + + + AGY T ++GKWH+ G P+ Sbjct: 87 TGVYAHRHHIVDNNTAIPRGTRFFPQLLQRAGYKTGFVGKWHM------GREGDDPQPGF 140 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D W S L E RNGLN + H+ + +++ A+D+L+ + Sbjct: 141 DKWVSFRGQGSYLPE------RNGLN----VDGKHVPQKGYITDELTDYALDWLRTVPK- 189 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP- 239 ++P+ + +S+ H F + YA + D+ +H +W Q + Sbjct: 190 EQPYFLYLSHKAVHADFIPADRHKGAYAKETFR-PPTTMDESGPNAQHRPMWVQNQRNSW 248 Query: 240 VGDDGLYHHPL--------YFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHGEMMG 290 G D YH L Y VDD + R+++AL Q ++T VIY D+G G Sbjct: 249 HGVDFPYHSDLDVGEYYKRYAETLLGVDDSVDRMLDALRERGQLDSTLVIYMGDNGFQFG 308 Query: 291 AHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILP 348 H LI K A Y++ R+PL+ R P+ R VD V+ +D++PT++ A P+ L Sbjct: 309 EHGLIDKRTA-YEESMRVPLLARCPEMFSGGRVVDRMVAGLDIMPTVLDAAGAAIPQGLD 367 Query: 349 GENILAV----KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV--LNLFTSDELY 402 G ++L + +P+ Y E + F + TD +K V ++ SDELY Sbjct: 368 GRSMLPLLRGENDPQWRTQLLYEYYWERN-FPQTPTMHALRTDRYKYVRYYGIWDSDELY 426 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 D + DPNE NLI + + + L D M++ Sbjct: 427 DLQEDPNETTNLIYNPERKATIEEFNKRLFDEMER 461 >UniRef50_A4AWR8 Iduronate-2-sulfatase n=5 Tax=Bacteria RepID=A4AWR8_9FLAO Length = 498 Score = 150 bits (378), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 120/474 (25%), Positives = 218/474 (45%), Gaps = 40/474 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LF++ D T V Y +NT +ID LA+EG+ F Y+ PVC P+RA + Sbjct: 39 KKPNVLFIIADDLTTTAVSSYGNSEVNTPHIDKLASEGVLFTRTYSQYPVCGPSRASFMS 98 Query: 62 GIYANQSGPWTNNVAPGKNIS----TMGRYFKDAGYHTCYIGK-WHL----------DGH 106 G Y + + + V+ KNI T + FKD GY+T + K +H+ +G Sbjct: 99 GYYPSATTTY-GYVSGRKNIGSERKTWSQVFKDNGYYTARVSKIFHMGVPIDIEKGSNGQ 157 Query: 107 D--------YFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDE 158 D + G PEW A GA L + + +L G N + ++A+ D+ Sbjct: 158 DDEQSWTERFNSQG---PEWKAP----GAGELVQ-GNPDGTLPIKGGNVMTIVKADG-DD 208 Query: 159 TFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA 218 + + +A + +++ D+PF + + + PH PF P Y E Y +L +K Sbjct: 209 LVHSDGKTAEKASELIRK--HKDKPFFLAIGFVRPHVPFVAPKSYFEPYPHNQTKLPKKV 266 Query: 219 QDDLANKPEHHRLWAQAMPSPVGDDGLYHH-PLYFACNDFVDDQIGRVINALTPEQRE-N 276 ++D + P+ + ++ + + Y+A ++D Q+G+V+ L E E N Sbjct: 267 ENDWDDIPKRGINYVTSVNGKMNTEQEKKAIAAYYASVSYMDAQVGKVLKTLKEEGLEDN 326 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMM 336 T V++TSDHG +G H+ K +++++ ++PLII+ P + + +DL PT+ Sbjct: 327 TIVVFTSDHGFHLGEHEFWMK-VSLHEESVKVPLIIKVPGKKPAVCHSFTELLDLYPTIT 385 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 ALA ++ + L GE+++ + + V + + + W + + Sbjct: 386 ALAGLKYSDQLQGESLVNILDEPTYEVRDMAFSVSQGGKSFLLRNEDWAYIQYD--EDAA 443 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLR 450 + EL+D + DP + NL +A + + L + +R + +SL+ Sbjct: 444 SGIELFDMKKDPKQFTNLAQLPEYASIVDSFKEKLKTKLKAVRSNDLNIDYSLK 497 >UniRef50_Q89YS5 N-acetylglucosamine-6-sulfatase n=12 Tax=Bacteroidales RepID=Q89YS5_BACTN Length = 558 Score = 149 bits (377), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 139/520 (26%), Positives = 220/520 (42%), Gaps = 114/520 (21%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN +F+MTD T + CY G + T N+D +A EGIRF++ Y + + P+RA + T Sbjct: 51 KRPNIIFMMTDDHTTQAMSCYGGNLIQTPNMDRIANEGIRFDNCYAVNALSGPSRACILT 110 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL----DGHDYF----GTGE 113 G +++++G N + T + + AGY T IGKWHL G D++ G E Sbjct: 111 GKFSHENGFTDNASTFNGDQQTFPKLLQQAGYQTAMIGKWHLISEPQGFDHWSILSGQHE 170 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 +D D+W DG HI E I+++A++F Sbjct: 171 QGDYYDPDFWEDG---------------------------KHIVEKGYATDIITDKAINF 203 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY--------ADFY--YELGEKAQDDLA 223 L+ + ++PF M+ PH + +L + A+ + YE KA + Sbjct: 204 LENRDK-NKPFCMMYHQKAPHRNWMPAPRHLGIFNNTIFPEPANLFDDYEGRGKAAREQD 262 Query: 224 NKPEH---------------------HRLWA--QAMPSPVGD--DGLYHHPL-------- 250 EH +RL++ + MPS V D D Y + Sbjct: 263 MSIEHTLTNDWDLKLLTREEMLKDTTNRLYSVYKRMPSEVQDKWDSAYAQRIAEYRKGDL 322 Query: 251 ----------------YFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHK 293 Y A VD+ IGR++N L + +NT ++YTSD G +G H Sbjct: 323 KGKALISWKYQQYMRDYLATVLAVDENIGRLLNYLEKIGELDNTIIVYTSDQGFFLGEHG 382 Query: 294 LISKGAAMYDDITRIPLIIRSPQGERR-QVDTPVS-HIDLLPTMMALADIEKPEILPGEN 351 K MY++ R+PLIIR P+ + + +S ++D PT + A +E P + G + Sbjct: 383 WFDK-RFMYEECQRMPLIIRYPKAIKAGSTSSAISMNVDFAPTFLDFAGVEVPSDIQGAS 441 Query: 352 ILAVKEPRG---------VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV--LNLFTSDE 400 + V E G + Y EH S +R T DFKL+ N E Sbjct: 442 LKPVLENEGKTPADWRKAAYYHYYEYPAEH-SVKRHYGIR---TQDFKLIHFYNDIDEWE 497 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 +YD + DP EM+N+ +A + ++ L + + +D Sbjct: 498 MYDMKADPREMNNIFGKAEYAKKQKELMQLLEETQKQYKD 537 >UniRef50_B9XND0 Sulfatase n=3 Tax=Bacteria RepID=B9XND0_9BACT Length = 492 Score = 149 bits (377), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 135/479 (28%), Positives = 204/479 (42%), Gaps = 79/479 (16%) Query: 4 PNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 PN LF++ D +G Y+G P + T ++D L +E + F +A + PVC+P RA L TG Sbjct: 49 PNVLFIIADQWRAEAMG-YNGNPDVKTPHLDHLQSESVDFVNAVSSVPVCSPTRASLMTG 107 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH--------------DY 108 A G + N+V T+ + AGY T IGKWHLDGH DY Sbjct: 108 QRALTHGVFVNDVPLSPKAITLSKVLHQAGYDTACIGKWHLDGHGRSQFIPRERRQNFDY 167 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 + EC +++ ++F + L +G + F H S Sbjct: 168 WKVLECTHQYNNSFYF---------ADLPFKLKWDGY------------DVFAQTHDASQ 206 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPF-TCPVEYLEKYADFYYELGEKAQDDLANKPE 227 +L+ + A +PF + +S+ PH P+ T P Y +Y K + L N P Sbjct: 207 ----YLRNHSHAKKPFFLYLSWGPPHDPYQTAPATYRSQYQ------AAKIKTRL-NVPP 255 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHG 286 R AQ + G Y H C +D +G ++ L E NT VI+TSDHG Sbjct: 256 GMRASAQTNLA-----GYYSH-----CTA-IDSCVGTLLQTLKDTGLETNTLVIFTSDHG 304 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQG---ERRQVDTPVSHIDLLPTMMALADIEK 343 +M+ +H L+ K +D+ R+PL++R P G + R++D P + D +PT++ L Sbjct: 305 DMLHSHGLVKKQHP-FDESIRVPLLMRWPAGLGTQPRKLDAPFNSPDFMPTILGLCGAPV 363 Query: 344 PEILPGENILA-----VKEPRGVM-----VEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL 393 P + G + A V G V F Y +H G R T + V Sbjct: 364 PNTVEGIDYSAYLQGDVNPSDGATLISCPVPFGEYSRQH----GGREYRGIRTTRYTYVR 419 Query: 394 NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPW 452 +L L+D DP +M NL+ A + + LL + + D F Q L W Sbjct: 420 DLNGPWLLFDNLEDPAQMDNLVGQPECAQLEEDLEKILLQKLAEANDQFLPGQAYLDRW 478 >UniRef50_B6B0A5 Putative sulfatase YidJ n=1 Tax=Rhodobacterales bacterium HTCC2083 RepID=B6B0A5_9RHOB Length = 520 Score = 149 bits (375), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 141/508 (27%), Positives = 220/508 (43%), Gaps = 78/508 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PNFL +TD + +GC + T NID++AA+G RF + PVC P RA L T Sbjct: 3 QKPNFLLFITDQHRADWLGCAGHPVVRTPNIDAIAAKGTRFEDFHVALPVCMPNRASLMT 62 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGK--------------------- 100 G + G N + +T GY T GK Sbjct: 63 GRMPSVHGLRHNGCLLSERANTFVDVLAAGGYATASFGKSHLQPFTDSPPLRKEDHVERL 122 Query: 101 ----WHLDGHDYFGTGECPPEWD--ADYWFDGANYLSELTEKEIS-----------LWRN 143 W D Y T E P +D +D F+ Y + E S +R Sbjct: 123 VPEAWKSDTGKY--TKEEPASFDKPSDAPFETPYYGFQHVEMATSHGDQCGGQYGQWFRE 180 Query: 144 GLNSVEDLQ--ANHI--DETFTWAHR------------ISNRAVDFLQQPARADEPFLMV 187 + + ++L AN + D T A+R ++++A+ +L++ +PF Sbjct: 181 TVPNWQELHDPANELAHDYTCPQAYRTPIPEDKYPTAWVADKAISYLEERKAGGDPFFAF 240 Query: 188 VSYDEPHHPFTCPVEYLEKY--ADFYYELGEKAQDDLANKPEHH-RLWAQ----AMPSPV 240 VS+ +PHHPF P +Y + Y DF +L A + +H ++W A+P Sbjct: 241 VSFPDPHHPFNPPGKYWDMYDPNDFDVDLPYDAHQNPTPPMQHETKMWETGEEPAIPQFA 300 Query: 241 GDDGLYH----HPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHGEMMGAHKLI 295 H L +DD IGRVI+AL + +NT +I+TSDHG+ +G L+ Sbjct: 301 FRASDQHVREAKALTAGMITMIDDHIGRVIDALKASGEYDNTVIIFTSDHGDYLGDFNLM 360 Query: 296 SKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLPTMMALADIEKPEILPGENILA 354 KGA ITR+P+I P + Q +T S ID+ +++ A + + G + + Sbjct: 361 LKGAIPLPSITRVPMIWSDPATRKGAQTETLASTIDISASILDRAGLAPYNGIQGSSFIE 420 Query: 355 VKEPRGVMVEFNRYEIEHDSFG---GF---IPVRCWVTDDFKL-VLNLFTSDELYDRRND 407 + G V + IEH+ G GF + R T D+++ + T ELYD D Sbjct: 421 ALD--GTQVHRDEVMIEHNDGGPRMGFTKAVRARTLRTKDWRMSIFAGETWGELYDLNTD 478 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYM 435 P E +NL DD DVR+++ L+D++ Sbjct: 479 PRECNNLWDDPSAKDVRAELTLRLVDHL 506 >UniRef50_A4CMA4 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=2 Tax=Flavobacteriales RepID=A4CMA4_9FLAO Length = 490 Score = 149 bits (375), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 139/488 (28%), Positives = 228/488 (46%), Gaps = 54/488 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 K+ N +F++TD + +G P L T N+D LAAEG +A+ + +C+P+RA + Sbjct: 28 KQRNVIFILTDDHRFDYMGFTGKVPWLETPNMDRLAAEGAYLPNAFVTTSLCSPSRASIL 87 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG++++ N N++ +Y ++AGY T ++GKWH+ H T E P +D Sbjct: 88 TGMFSHTHTIVDNQAPNPGNLTYFPQYLQEAGYQTAFLGKWHMSSH----TDEPRPGFDH 143 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 F G ++ N ++ + + D T+ ++ AVD+L+ + Sbjct: 144 WESFFGQ-----------GVYYNPTLNINGERIEYKDSTYI-TDLLTEHAVDWLESRDK- 190 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG---EKAQDDLANKPEHHRLWAQAMP 237 D+PF + +S+ H F + +YA EL E+ + + A Sbjct: 191 DKPFFLYLSHKAVHAEFQPARRHKGRYAGKKIELPPTYEQTKTGAWRDLKWPEWVADQRV 250 Query: 238 SPVGDDGLYHHPL--------YFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEM 288 S G D +YH + Y VDD +G V+ L E E T VIY D+G Sbjct: 251 SWHGVDYMYHSNIDMQELVQAYCETLLGVDDSVGAVLEYLEEEGLDEETLVIYMGDNGFS 310 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQV-DTPVSHIDLLPTMMALADIEKPEI 346 G H LI K Y++ ++PL++R P+ E QV V +ID+ PT++A A + +P+ Sbjct: 311 WGEHGLIDK-RHFYEESVKVPLLVRCPELFEGGQVPQDMVQNIDIGPTVLAEAGVAQPDD 369 Query: 347 LPGENILAV----KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL--NLFTSDE 400 +PG + + + K+ F Y E+D F V TD +K + ++ +E Sbjct: 370 MPGVSFIPILTGDKDATKRDKIFYEYYWEND-FPMTPTVFGMRTDKYKYIRYHGIWDRNE 428 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM---DKIRDPFRSYQWSLRPWRKDAR 457 LYD NDP+EM+NLI D +V M D+L +++ D ++ P +S R Sbjct: 429 LYDLENDPHEMYNLIGDPEKQEVIQTMLDSLYNWLETTDGMKIPLKSTD----------R 478 Query: 458 PRWMGAFR 465 PRW G +R Sbjct: 479 PRW-GDYR 485 >UniRef50_Q0K3Z4 Arylsulfatase A n=4 Tax=Burkholderiales RepID=Q0K3Z4_RALEH Length = 537 Score = 148 bits (374), Expect = 4e-34, Method: Compositional matrix adjust. Identities = 140/516 (27%), Positives = 194/516 (37%), Gaps = 86/516 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPNFL +TD + +G Y + T NID L + G Y SP+C P RA L TG Sbjct: 11 RPNFLLFITDQHRADHLGIYGNAVVQTPNIDRLGSHGWVAERCYVASPICMPNRASLMTG 70 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG------------ 110 + G N + + +T +DAGY T +GK HL Sbjct: 71 RVPSVHGARHNGIPLPLSQTTFVERLRDAGYRTGLVGKSHLQNMTGLAPVWPRPGDPRTE 130 Query: 111 -------TGECPPEW------DADYWFD-------------------GANYLSELTEKEI 138 +G EW D D+ G +Y L ++ Sbjct: 131 GEARRPESGRFDQEWGPAWRDDPDFAMSLPYYGFADVQLVTDHGDTAGGHYRRWLEQRHP 190 Query: 139 SLWRN----------GLNSVEDLQA--NHIDETFTWAHRISNRAVDFLQQPARADEPFLM 186 + R G E QA + ET + I +R +D L Q A+ D PF++ Sbjct: 191 EVARACGQEHAIPTPGYVLAEARQAWRTRVPETLSTTAWIGDRTIDLLGQYAQEDAPFMI 250 Query: 187 VVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLY 246 S+ +PHHPFT P + + Y L D A P H R W A + Sbjct: 251 QCSFPDPHHPFTPPGRFWDMYRPEDMTLPPSFDADPALAPPHLR-WMHAQRD--AGRAVK 307 Query: 247 HHPLYFACN---------------DFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMG 290 H P FAC +DD IGRV+ L + NT V++TSDHG+ G Sbjct: 308 HTPAMFACTAREAREALALNYGSIAHIDDTIGRVMAHLAALGLDRNTVVLFTSDHGDFFG 367 Query: 291 AHKLISKGAAMYDDITRIPLIIRSPQGERRQV------DTPVSHIDLLPTMMALADIEKP 344 H+L+ KG Y + R PLI P S ID+ P+++A A Sbjct: 368 DHQLLWKGPLHYQGLIRTPLIWSEPAARTATDARAARSQALCSSIDIAPSILARAGCAPY 427 Query: 345 EILPGENILAVKEPRGVMVEFNRYEIEHDS--FGGFIPVRCWVTDDFKLVLNLFTS---D 399 + G ++L + E E E+ FG + R + L+++ Sbjct: 428 NGMQGLSLLPAIAGEALPREALLIEEENQRTMFGFPMRTRMRTLQTARYRLSVYEGAPWG 487 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 ELYD DP E HNL D A R + LL M Sbjct: 488 ELYDLETDPTESHNLWDAPALASTRQALLHQLLVTM 523 >UniRef50_A9MER1 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A9MER1_SALAR Length = 430 Score = 148 bits (374), Expect = 4e-34, Method: Compositional matrix adjust. Identities = 129/452 (28%), Positives = 197/452 (43%), Gaps = 50/452 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L TD Q + VGCY+ T +D LA EG++F +A+T PVC PAR+ L TG Sbjct: 2 QPNILVFFTDQQRWDTVGCYNPVVSTTPVLDQLAREGVKFENAFTVQPVCGPARSCLQTG 61 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC-PPEWDAD 121 Y Q+G + NN+A ++ T+ + F AGY T YIGKWHL D E W Sbjct: 62 RYPTQNGCYRNNIAMRQDEVTLAKLFNQAGYDTAYIGKWHLADLDEKPVLEALRGGW--Q 119 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 YW A+ L + + + N +D+ T+ A+D+L+ R D Sbjct: 120 YWL-AADALEHTSHPYGGHFFDNDNQPVHFDGYRVDDQTTF-------ALDYLKNRQR-D 170 Query: 182 EPFLMVVSYDEPHHP-----FTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM 236 PFL+ +SY EPH F P Y E++ DL N+P W Q + Sbjct: 171 NPFLLFLSYLEPHFQNDMARFVAPDGYAERFQT------ASVPPDLINRPGD---WPQNL 221 Query: 237 PSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHGEMMGAHKLI 295 P Y+ +D+ +GR+++ L + + +NT +++ SDHG Sbjct: 222 PD------------YYGMCQNLDENLGRIVDYLKSSGEYDNTIILFFSDHGCHFRTRNDE 269 Query: 296 SKGAAMYDDITRIPLIIR-SPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILA 354 K + I RIP + R P R V+ V+ +D+ TM++ A I P+ + G ++ Sbjct: 270 YKRSCHESSI-RIPCVARGGPFSGGRTVEHLVTLLDIPVTMLSAAGITVPDAMVGRDLQT 328 Query: 355 VKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDF--------KLVLNLFTSDELYDRRN 406 + G E +I G + W + + ++ +LYD N Sbjct: 329 ALDA-GHWDEEVLIQISESEVGRALRTTRWKYEIVAPGSDPWNESAATIYVESQLYDLLN 387 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 DP E NLI A +R K+ + M I Sbjct: 388 DPWERQNLIASPEHARIRDKLRQDIGRKMTAI 419 >UniRef50_B8FL44 Sulfatase n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FL44_DESAA Length = 468 Score = 148 bits (374), Expect = 4e-34, Method: Compositional matrix adjust. Identities = 129/437 (29%), Positives = 204/437 (46%), Gaps = 49/437 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LF++TD + +GC + T N+D LA++G+ FN+++ S +C+P+RA T Sbjct: 50 KKPNVLFILTDDHRYDHMGCAGHPFIKTPNLDRLASQGVYFNNSFVTSSLCSPSRASFLT 109 Query: 62 GIYANQSGPWTNNVAP--GKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 G YA+ G NN+ P N++ + R FK GY T ++GKWH+ GE P Sbjct: 110 GQYAHTHGV-QNNLTPWDNGNVTFLER-FKQEGYDTAFLGKWHM-------PGELPKLRG 160 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQAN--HIDETFTWAHRISNRAVDFLQQP 177 D + + + L NG ED + N +I E T +RA++F+ + Sbjct: 161 VDEFVTFTVRGGQGQYWDCPLIVNG----EDAKPNKRYITEELT------DRAINFIDR- 209 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 +D PF + +S+ HH + P + + Y+D L E+A W Sbjct: 210 -ESDNPFCLYLSHKAAHHDWKPPTDLKDLYSDEELPLAEEAD-----------TWVTMTN 257 Query: 238 SPV--GDDGL--YHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAH 292 V G G YH+ Y VD Q+GR++ L + +NT V+Y D+G G H Sbjct: 258 GAVFCGTTGTLQYHYRNYCRVVASVDRQVGRLLKFLEDKGLADNTIVVYAGDNGYFWGEH 317 Query: 293 KLISKGAAMYDDITRIPLIIRSPQ---GERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 + I K A Y++ RIP +IR+P R+ D +IDL PT+ LA IE + G Sbjct: 318 RKIDKRWA-YEESIRIPFMIRAPGVVPDPGRKADQMALNIDLAPTLFDLAGIEPHAGMEG 376 Query: 350 ENILAV-KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD--ELYDRRN 406 +++ + + R E YE D + +P + + + +S E YD + Sbjct: 377 QSLAPILRNGRTPGREAWLYEYFKD-YPYNVPAIQAIRTQNNIYIEYESSRKPEYYDLQA 435 Query: 407 DPNEMHNLIDDIRFADV 423 DP E N+ D + AD+ Sbjct: 436 DPKEKQNIYDQLEAADI 452 >UniRef50_UPI0001C3580F sulfatase n=2 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C3580F Length = 471 Score = 148 bits (373), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 137/506 (27%), Positives = 207/506 (40%), Gaps = 113/506 (22%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 + N L+V D VG + T NID + E + +AY+ P+C+P RA L TG Sbjct: 2 KTNLLYVFADQWRAEAVGSLGSDQVVTPNIDRFSEESVCCTNAYSTFPLCSPHRASLMTG 61 Query: 63 IYANQSGPWTN-------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLD----------- 104 Y + G WTN + + + KDAG+ T YIGKWHLD Sbjct: 62 KYPFRLGMWTNCKIGLEEKIMLKPQETCIANVLKDAGFATGYIGKWHLDASELNFSPHPK 121 Query: 105 -----------------GHDYFGT-GECPPEWDADYWFDGANYLSELTEKEISLWRNGLN 146 G DYF + G C D YW D + T+ + W Sbjct: 122 SGAGEWDAYTPPGERRQGFDYFLSYGACDDHLDPHYWLD------DETQIKPGKWS---- 171 Query: 147 SVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPF-TCPVEYLE 205 A +++A++++ Q +EPF + VSY+ PH P+ P Y E Sbjct: 172 ----------------AEFETDKAIEYMNQKKDGEEPFALFVSYNPPHLPYELVPERYYE 215 Query: 206 KYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRV 265 K+ + N PE R + + YFA +D+Q GR+ Sbjct: 216 KFKNLKVHY-------RPNVPESMREEGGLLETQTRQ--------YFAAVHGIDEQFGRI 260 Query: 266 INALTPE-QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR--QV 322 + L E T V+ ++DHGEM+G+H L+SK YD+ IPLI R +G + + Sbjct: 261 LAWLKENGMEEKTLVVLSADHGEMLGSHGLMSKN-IWYDEALHIPLIFRQ-KGRLKPGKN 318 Query: 323 DTPVSHIDLLPTMMALADIEKPEILPG-----------------ENILAVKEPRG--VMV 363 D + D +PT++ L D+ PE G E++L P G ++ Sbjct: 319 DVIFASPDHMPTLLELLDLAVPETCEGYSHADSLIRGSAVPGEPEDMLICSYPGGADMVA 378 Query: 364 EFNRYEIEHDSFG-GFIPVRCWVTDDFKLVLNLFTSDE-----LYDRRNDPNEMHNLIDD 417 F++ + H ++G I R + ++ N + DE LYDR DP EM+ + Sbjct: 379 AFSKRGLTHKAYGWRGIRNRRYTY----VITNGYAPDEPQREFLYDRELDPYEMNPAAIE 434 Query: 418 IRFADVRS-KMHDALLDYMDKIRDPF 442 D R + L DY++ DPF Sbjct: 435 KDCTDERILAFRERLKDYLELTEDPF 460 >UniRef50_Q127E2 Sulfatase n=1 Tax=Polaromonas sp. JS666 RepID=Q127E2_POLSJ Length = 511 Score = 148 bits (373), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 134/499 (26%), Positives = 210/499 (42%), Gaps = 74/499 (14%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MKRPN L + TD + +G ++G+ + T +ID +A G F S T + VC P+RA + Sbjct: 1 MKRPNILLITTDQHRGDCLG-FAGRKVKTPHIDEMARTGTHFTSCITPNIVCQPSRASIL 59 Query: 61 TGIYANQSGPWTNNVA--PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYF---GTGECP 115 TG+ G N + + + +GY T +IGK H H F G EC Sbjct: 60 TGLLPLTHGVCDNGIDLDEARGEAGFAGTLASSGYSTGFIGKAHFSTHHTFAKTGRPECQ 119 Query: 116 -------PEW---------------DADYWF-----DGANYLSELTEKEISLWRNGLNSV 148 P W +YW G ++ + RN L Sbjct: 120 FSEADYGPAWYGPYMGFEHVELAVEGHNYWLPTPLPGGLHHSRWYYGDGLGEMRNRLYQQ 179 Query: 149 EDLQANHIDETF-----------TWAHRISNRAVDFLQQPA-RADEPFLMVVSYDEPHHP 196 + + +TF TW I +R ++F+++ A A + F + S+ +PHHP Sbjct: 180 DMGPPSGAPQTFNSALPSAWHNSTW---IGDRTIEFMRKHAGEAAKRFCLWASFPDPHHP 236 Query: 197 FTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLY-------HHP 249 F CP + + +L D +P H+ A PVGD + P Sbjct: 237 FDCPEPWSRLHHPDEVDLPAHRTTDFERRPWWHK--ASMDSKPVGDAAVQALRQNFSRMP 294 Query: 250 L------------YFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHKLIS 296 Y+ VD Q+GR+ AL + NT VI+TSDHGE +G H L+ Sbjct: 295 TPAEQQLRNITANYYGMISLVDHQVGRIQTALQQLGLDGNTLVIFTSDHGEWLGDHGLML 354 Query: 297 KGAAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLLPTMMALADIEKPEILPGENILAV 355 KG Y+ + R+ +++ PQ + QV PVS +DL T A L G+++ + Sbjct: 355 KGPIPYEGVLRVGMVVNGPQVQAGQVRHEPVSTLDLAATFADYATATALAPLHGQSLRPL 414 Query: 356 KEPRGVMVEF--NRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT-SDELYDRRNDPNEMH 412 E +F + + + G + +R T+++KL L + + E+Y DPNEM Sbjct: 415 LEGGQQTRDFALSEWNVAASRCGLELQLRTVRTENWKLTLEQNSGAGEMYCLSEDPNEMD 474 Query: 413 NLIDDIRFADVRSKMHDAL 431 NL DD + R ++ D + Sbjct: 475 NLFDDPGYTAKRKELSDMI 493 >UniRef50_UPI000051016C choline-sulfatase n=1 Tax=Brevibacterium linens BL2 RepID=UPI000051016C Length = 509 Score = 148 bits (373), Expect = 6e-34, Method: Compositional matrix adjust. Identities = 139/506 (27%), Positives = 224/506 (44%), Gaps = 50/506 (9%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M+ PN + + D A +G Y T N+D+LAA+G F+ AY +P+C+P+RA + Sbjct: 1 MQPPNIVVIQADQMAAQALGAYGDTAALTPNMDALAADGAVFDRAYCNTPLCSPSRASMM 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG + N ++ T + GYHT IG+ H G D E D Sbjct: 61 TGRMPSDIDCLDNGDDFAASVPTFAHRLRKLGYHTALIGRMHFIGPDQHHGFEERLTTDV 120 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANH-IDETFTWAHRISNRAVDF------ 173 Y ++L + W+ L+ + LQ H D FT +N DF Sbjct: 121 --------YPADL--DMVPDWQRPLD--QKLQWYHEADPVFTAGAAKANVQQDFDDEVIF 168 Query: 174 -----------LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADF-YYELGEKAQDD 221 Q A D+PFLMV S+ PH P+ P E+ +++A+ + D Sbjct: 169 RTLRHLNGRVRANQAAGEDQPFLMVTSFIHPHDPYEPPREHWDRFAEVDIPDPAHPEVPD 228 Query: 222 LANKPEHHRLWAQA---MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENT 277 +A P HRL + P +D Y+A ++DD IG++ L E +NT Sbjct: 229 IAEDPHSHRLRTMSGLDKKEPGTEDIRRARRAYYAAVSYIDDHIGKIRQRLRELELEDNT 288 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTM 335 +I TSDHG+M+G L K + Y+ +R+P+II P + PVS +DL+PT+ Sbjct: 289 VIIVTSDHGDMLGEKGLWYK-MSPYEQSSRVPIIINGPAEAVTPGRYANPVSLVDLMPTL 347 Query: 336 MALADIEKPEILPGENIL--AVKEPRGVMVEFNR-YEIEHDSFGGFIPVRCWVTDDFKLV 392 + LA P+ G ++ A +E G +R IE+ + G + P + +KL Sbjct: 348 LELAGTSDPDAT-GVSLFESARQEAAGETGPADRDVIIEYFAEGTYRPQVTLIRGQYKLT 406 Query: 393 LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL-----LDYMDK-IRDPFRSYQ 446 + + L+D +DP+E+ N D +A++ + M L L+++++ + S Q Sbjct: 407 ICPGDPELLFDLESDPDELVNRAGDAAYAELVATMRAELDSRYDLEHLEEHVLGSQSSRQ 466 Query: 447 WSLRPWRKDARPRWMGAFRPRPQDGY 472 + W F P P++GY Sbjct: 467 LVADALKIGTVRHW--DFDPEPENGY 490 >UniRef50_C7MEQ7 Choline-sulfatase n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MEQ7_BRAFD Length = 520 Score = 148 bits (373), Expect = 6e-34, Method: Compositional matrix adjust. Identities = 127/468 (27%), Positives = 207/468 (44%), Gaps = 54/468 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + + D A +G Y T ++D+LAAE F+ AY +P+C P+RA + TG Sbjct: 4 RPNIVVIQADQMAAQALGAYGDTAARTPHMDALAAEAAVFDRAYCNTPLCAPSRASMMTG 63 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDY-------FGTGECP 115 + + N +I T + + AGYHT +G+ H G D T P Sbjct: 64 RMPSDIDCFDNGSDFAASIPTFAHHLRAAGYHTALVGRMHFIGPDQHHGFEQRLTTDVYP 123 Query: 116 PEWDA--DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 + D D+ D + L W + ++V + + + RA+ Sbjct: 124 ADMDMVPDWQRDLGDRLQ---------WYHDADAVHTAGVSQATVQLDFDDEVGFRALRH 174 Query: 174 LQQPARADE------PFLMVVSYDEPHHPFTCPVEYLEKYADF------YYELGEKAQDD 221 L RAD+ PFLMV S+ PH P+ P E+ +++AD + E+ + AQD Sbjct: 175 LNDRVRADQAAGERVPFLMVASFIHPHDPYEPPQEHWDRFADVDIPAPRHPEVPDPAQD- 233 Query: 222 LANKPEHHRLWAQA---MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENT 277 P HRL A + ++ Y+A ++DD +GR+ L E+T Sbjct: 234 ----PHSHRLRAMSGFDQRETTEEEVRRARRSYYAAVSYIDDHVGRIRERLESLGLWEDT 289 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTM 335 V+ TSDHG+M+G L K + Y++ +R+PLI+ P+ + PVS +DL+PT+ Sbjct: 290 VVVVTSDHGDMLGEKGLWFK-MSPYEESSRVPLILHGPEHLVPAGRYANPVSLLDLMPTL 348 Query: 336 MALADIEKPEIL---------PGENIL--AVKEPRGVMVEFNR-YEIEHDSFGGFIPVRC 383 + L + G ++L A +E G +R IE+ + G P Sbjct: 349 LELGGADGATSAAAEATTPARQGLSLLESARRERSGTAGPADRDVIIEYLAEGTLRPQLT 408 Query: 384 WVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 V K V+ D+L+D DP+E N+ D A++ +++ A+ Sbjct: 409 LVRGQHKFVVCPGDPDQLFDLHTDPHERTNIAADPAQAELVAELRAAV 456 >UniRef50_C6MEX8 Sulfatase n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MEX8_9PROT Length = 463 Score = 147 bits (372), Expect = 7e-34, Method: Compositional matrix adjust. Identities = 132/476 (27%), Positives = 207/476 (43%), Gaps = 83/476 (17%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 + RPN + ++TD N + C ++T IDS+AA G+RF +Y P+CTP+RA L Sbjct: 29 VDRPNIILILTDQHYANAMSCAGNTNVSTPAIDSIAANGVRFVKSYCAFPLCTPSRAALI 88 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL-------DGHDYFGTGE 113 + G N+ ++I T+G F++AGY T + GKWH+ +D G Sbjct: 89 ASRMPYELGVSGNDQGIPEDIETLGETFQNAGYQTFWSGKWHVPIGIPEPGSNDVRGFDV 148 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 P +D Y + E +I NSV+ + + F Sbjct: 149 LP--------YDDPFYNGNIAELDI-------NSVKSV-------------------IKF 174 Query: 174 LQQPARADEPFLMVVSYDEPH----------HPF---TCPVEYLEKYADFYYELGEKAQD 220 L++ + PFL+ +S++ PH + F T PV ++FY E Q Sbjct: 175 LRE--KPINPFLLSISFNNPHDITHVNEKTINEFGIPTNPVLLPHLASNFYAPDLEFGQA 232 Query: 221 DLANKPEHHRLWAQAMPSPVGDDGL---YHHPLYFACNDFVDDQIGRVINALTPEQRE-N 276 + P A+ S +G L H Y+ + VD QIG ++ + E N Sbjct: 233 SGSTTPP-------AINSNIGWSNLNFRRHSYQYYRFIETVDSQIGEILEGIKLAGLESN 285 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTP--VSHIDLLPT 334 T++I+T DHGEM G+H+ I K +MY++ +P II+ P V+ +S +DL PT Sbjct: 286 TYLIFTGDHGEMNGSHRRIRK-FSMYNEALSVPFIIKGPGIPEGVVNRKHLISALDLYPT 344 Query: 335 MMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 + A I P L G++ A+K R +Y G F R V+ +K L Sbjct: 345 LCDAAGITTPLGLRGKS--ALKVLRNFNTPLRKYAFAQ--IGNFFS-RVAVSTRYKYALF 399 Query: 395 LFT--------SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 L T + +D DP E NLI+++ + +AL ++M DPF Sbjct: 400 LKTHTAYDIPIREAFFDMEADPGETKNLINNLALKPIIDNHRNALNNWMIHTNDPF 455 >UniRef50_Q7UW58 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UW58_RHOBA Length = 541 Score = 147 bits (372), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 125/435 (28%), Positives = 207/435 (47%), Gaps = 32/435 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +R N LF+++D T +GCY + T NID LAA G+ F +A P+C P+R + Sbjct: 65 QRKNVLFLISDDLNTR-IGCYGDPIVQTPNIDRLAARGVLFENAACQYPLCGPSRNSMLC 123 Query: 62 GIYANQSGPWTNNVAPGKNIS---TMGRYFKDAGYHTCYIGK-WHLDGHDYFGTG--ECP 115 G+Y + +G N +I ++ + F+ GY +GK +H + GT + P Sbjct: 124 GLYPDTTGIHGNAQIFRDSIPERWSLPQAFRLDGYFAGRVGKLYHYNVPKSVGTNGHDDP 183 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGL--NSVEDLQANHIDETFTWAHRISNRAVDF 173 W+ + G + L E E +I R G ++ + DE T + + Sbjct: 184 ASWELELNPAGCDRLIE--EPDIFTLRKGAFGGTLSWYASPRPDEAHTDGMLADDASWVL 241 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA 233 + R D PF + V + PH P+ P EY E Y L + ++D A+ P L + Sbjct: 242 ERCAKRNDRPFFLAVGFYRPHTPYVAPKEYFEPYKLEDMPLFDNVEEDNADVPAAA-LLS 300 Query: 234 QAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMG 290 + + +D L + Y+A F+D Q+G+V++ L + NT V++TSDHG +G Sbjct: 301 KKKEQDLLNDELRRQAIQAYYASTTFMDAQVGKVLDTLKRTGLDKNTIVVFTSDHGYFLG 360 Query: 291 AHKLISKGAAMYDDITRIPLIIRSP-QGERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 L K A++D + +PLII P + E +PV +DL PT+ L D+ +++ G Sbjct: 361 EKGLWQK-QALFDKVAGVPLIIAEPGRTEGAIAKSPVGLVDLYPTLAELCDVPTQKLMQG 419 Query: 350 ENIL-AVKEP----RG---VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL--NLFTSD 399 ++++ +++P RG MV N + + +G I T+ ++L L + Sbjct: 420 QSLVPMLRDPSQTGRGYSMSMVARNDRQTKQRYYGYSI-----RTERYRLTLWDDGKRGT 474 Query: 400 ELYDRRNDPNEMHNL 414 ELYD +NDP E NL Sbjct: 475 ELYDHQNDPEEFTNL 489 >UniRef50_A4A280 Iduronate-2-sulfatase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A280_9PLAN Length = 475 Score = 146 bits (369), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 123/465 (26%), Positives = 208/465 (44%), Gaps = 49/465 (10%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+++D + + CY + T NID LA G++F AY PVC P+RA L +G++ Sbjct: 27 NVLFIISDDLSAESLSCYGHRECQTPNIDRLAQRGVKFTHAYCQYPVCGPSRAALMSGLH 86 Query: 65 A--------NQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGK-WHL----------DG 105 A QS +T N+ + ++M ++F+D GY+ + K +H+ +G Sbjct: 87 AATIGVMGNGQSTRFTQNLG---DRASMSQHFRDQGYYAARVSKIYHMRIPGDITAGTNG 143 Query: 106 HDYFGTGE----C-PPEWDADYWFDGANYLSELTEKEI-SLWRNGLNSVEDLQANHIDET 159 D+ + + C PEW + D A Y +E K+ + G + D Sbjct: 144 DDHAASWDERFNCQAPEWMS--AGDAATYSNEKLNKDPDKHYGLGFGTAFYAVKASTDGA 201 Query: 160 FTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ 219 H+ +++A++ L++ +E F + V PH P P ++ E YAD EL K Sbjct: 202 EQADHKAADKAIELLRK--HKEERFFLAVGMVRPHVPLVAPAKFFEPYADGQMELPLKVA 259 Query: 220 ---DDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE- 275 DD+ A M + L Y+A ++D Q+GRV++ L + Sbjct: 260 GDWDDIPKAGISRNSKATGMTLEGQRNTL---SAYYAAVAYMDYQVGRVLDELHQLGLDK 316 Query: 276 NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTM 335 NT V++T+DHG +G H K +++++ T IPLI+ P + + V+ + ID+ PT+ Sbjct: 317 NTVVVFTADHGYHLGEHDFWQK-MSLHEESTHIPLIVAIPGEQPKVVNGLAAQIDIYPTL 375 Query: 336 MALADIEKPEILPG-ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 L ++ P L G + A+ P + D + TD + + Sbjct: 376 AQLCELPVPTYLQGVSQVAAIASPDAA--------VRDDVLCMTSKGKLLRTDRYAYISY 427 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 ++ELYD ++DP + NL D V K+ L + D R Sbjct: 428 SGGTEELYDMQSDPQQYTNLAKDPASQPVLGKLRAQLKERADLPR 472 >UniRef50_A6C9F6 Iduronate-2-sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C9F6_9PLAN Length = 506 Score = 146 bits (369), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 124/456 (27%), Positives = 205/456 (44%), Gaps = 38/456 (8%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN LF++ D ++ GCY + + NID LA +G+RF AY P+C P+RA TG Sbjct: 45 KPNVLFLICDDLNCDL-GCYGHPQVQSPNIDQLAKQGVRFEHAYCQFPLCGPSRASFMTG 103 Query: 63 IYANQS-----GPWTNNVAPGKNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTG--EC 114 +Y +Q+ G + P N+ TM + F+D GY +GK +H + + GT + Sbjct: 104 MYPDQTLVHRNGIYIREHVP--NVKTMSQMFRDHGYFATRVGKIYHYNVPKHIGTSGHDD 161 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 P W+ + G + E ++ SL A + ++ A+ L Sbjct: 162 PYSWNQTFNPRGRDVDDE--DQIFSLVPGSYGGTLSWLAAEGTDAEQTDGIAADIAIQQL 219 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 ++ A + EPF + V PH P+ P Y EKY ++ + L P R Sbjct: 220 KKFAESKEPFFLAVGLYRPHTPYVAPKSYFEKYPVEQIKVPQIPDGYLKTIPASARKSVT 279 Query: 235 AMPSPVG-DDGLYHHPL--YFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMG 290 + D L + Y+A F D Q+G +++AL ENT V++TSDHG MG Sbjct: 280 RKKDQIDLPDKLARQAIQAYYASITFADAQLGHILSALKETGLDENTIVVFTSDHGYHMG 339 Query: 291 AHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMALADIEKPEILP 348 H K ++++ T +P+II P + + P +D PT+ L ++ P + Sbjct: 340 EHGHWQK-TTLFENATHVPMIIAGPGVTAKGQAAAAPAEMVDFYPTLAELCGLKAPASVS 398 Query: 349 G-ENILAVKE----PRGVMVE--FNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDEL 401 G + A+K+ PR + N Y + +F W T+ + V EL Sbjct: 399 GISQVPALKDATATPRKTALTQYLNGYSLRTPTFR----YTEWGTNGSEGV-------EL 447 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 YD +DP EMHNL + + +R ++ + L + +++ Sbjct: 448 YDHSSDPAEMHNLANQAKTQKLRDELAEILHERIEQ 483 >UniRef50_UPI0001C3604A sulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C3604A Length = 497 Score = 146 bits (368), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 138/486 (28%), Positives = 210/486 (43%), Gaps = 58/486 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++ NFL MTD Q SG N++ L G+ F AY SP C P+RA F+ Sbjct: 6 RKSNFLIFMTDQQLGTTQ--QSGGSAYMPNLERLKRHGVTFQEAYCPSPHCCPSRASFFS 63 Query: 62 GIYANQSGPWTN-------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLD-----GHDYF 109 G+Y ++ G W N + + I K+AGY + GKWH+ G F Sbjct: 64 GLYPSEHGIWNNIDLADAFSHGLNEGIRLFSEDLKEAGYQMYFSGKWHVSAEEGPGDRGF 123 Query: 110 GTGECPPE---------------WDADYWFDGANYLSELTEK---EISLWRNGLNSVEDL 151 G PP+ WD W +Y++E+ + EIS R G Sbjct: 124 GPWIYPPQEGRYKQWKKTPFMGDWD---WLLEDDYITEVRGRGDGEIS--RVGFPPYTQY 178 Query: 152 QANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFY 211 + E + + A D+L++ + +EPF M V PH P+ EYL+ Y D Sbjct: 179 G---VKENPFGDGDVVSCAEDWLEKVSE-EEPFCMYVGTLGPHDPYFVQEEYLDLYPDES 234 Query: 212 YELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP 271 EL D + +KP +R + + Y A + D GR+++ L Sbjct: 235 MELPVSWTDRMLDKPNLYRRTRERFDQLDETEQKRSLKHYLAFCSYEDALFGRLLDVL-- 292 Query: 272 EQR---ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIR--SPQGERRQVDTPV 326 E+R ENT V+Y SDHG+ GAH L +KG + + I ++ + E + V Sbjct: 293 ERRNLLENTVVLYVSDHGDYAGAHGLWTKGLPCFKEAYHICSVMGYGGIKAEGAVIGHRV 352 Query: 327 SHIDLLPTMMALADIEKPEILPGENILAVKEPR----GVMVEFNRYEIEHDSFGG--FIP 380 S +D PT + LA I K +++ G + + G + E R E S G + Sbjct: 353 SLLDYAPTFLDLAGILKADVIAGRQRFSGYSLKPFLEGRIPENWREETYTQSNGNECYGI 412 Query: 381 VRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM----D 436 R TD++ LV N F DELYD R DP+ M N+I++ ++ V ++ L + D Sbjct: 413 QRSIFTDEYHLVFNGFDYDELYDLRKDPDCMKNVIEEQQYETVIYDLYRKLWRFAYLHRD 472 Query: 437 KIRDPF 442 + DP+ Sbjct: 473 ALGDPY 478 >UniRef50_Q02B50 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q02B50_SOLUE Length = 478 Score = 145 bits (367), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 122/369 (33%), Positives = 173/369 (46%), Gaps = 48/369 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLN-TQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN L +++D + +G P+N T N+D +A+ G+ F SA + PVC PARA +FT Sbjct: 34 RPNVLLIISDQFRWDCIGAMGLNPMNLTPNLDGMASRGVLFRSAISNQPVCAPARASIFT 93 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLD--GHDYFGT-GECPPEW 118 G Y ++ G W N + N T+G K AGY T YIGKWHL D T G PE Sbjct: 94 GQYPSRHGVWRNGLGLAANAVTLGSAMKQAGYSTNYIGKWHLSPGAADTPETRGPVKPEN 153 Query: 119 DA---DYWFDGANYLSELTEK--EISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 D W + AN L ELT E L+ D + H + A +++RA F Sbjct: 154 RGGFQDLW-EAANVL-ELTSHAYEGDLFDG------DGKPLHFSNRYR-ADFMTDRAQLF 204 Query: 174 LQQPARADEPFLMVVSYDEPHH-----PFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 L+ A A PFL+ +SY E HH F P E+ +Y + + P+ Sbjct: 205 LRSRA-ARSPFLLTLSYLEVHHQNDKDTFDPPKEFAGRYPNPFV-------------PQD 250 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGE 287 R PS + D YFAC +D+ +G + L +NT V++TSDHG Sbjct: 251 LRPLPGTWPSQLAD--------YFACVAKMDEIVGTLRKTLVETGLDKNTIVMFTSDHGN 302 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLPTMMALADIEKPEI 346 K + I IPL++ P R +V+ VSH+D+ PT++A A +E P Sbjct: 303 HFRTRNAEYKRSPHESSI-HIPLVMEGPGFNRGMEVNQLVSHVDMAPTLLAAAGLEVPAS 361 Query: 347 LPGENILAV 355 + G N L + Sbjct: 362 MQGHNFLPL 370 >UniRef50_A0JVN2 Sulfatase n=1 Tax=Arthrobacter sp. FB24 RepID=A0JVN2_ARTS2 Length = 449 Score = 145 bits (365), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 133/463 (28%), Positives = 201/463 (43%), Gaps = 84/463 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L +M D A + +GC +NT N+D+LAA G RF+ AYT P+C PAR+ L +G Sbjct: 8 QPNILIIMADQWAAHAMGCAGSTVVNTPNLDNLAAAGTRFDRAYTTFPLCVPARSSLVSG 67 Query: 63 IYANQSGPWTNNV----APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 Y ++ G N V PG+ ++G +FK AGY Y GKWH PE Sbjct: 68 RYPHELGIDGNAVPAGSGPGRTPGSLGHWFKAAGYDCAYAGKWHA------------PEA 115 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 A +G + + DE T A+D+L Sbjct: 116 SAQP-------------------EDGFDVIHPFG----DEGLT------ASAIDWLGARH 146 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ--------DDLANKP---- 226 PFL++VS+D PH C EY Y ++ A + A P Sbjct: 147 DTGTPFLLLVSFDNPH--TIC--EYARGQHLPYGDVQRPADIRDAPPLPSNFATTPYSPQ 202 Query: 227 --EHHRLWAQAMPSPV----GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWV 279 H R A+ D LY H Y + D+QIG ++ L + RE T V Sbjct: 203 ALTHERAQAEQAYGTADFSHDDWRLYRH-AYAQLIERTDEQIGVILGELDRQGLRETTVV 261 Query: 280 IYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSH--IDLLPTMMA 337 ++TSDHG+ AH K ++ ++ R+PL++R P QV + + +DL+PT+ + Sbjct: 262 LFTSDHGDGDAAHGWNQK-TSLQEEAIRVPLLMRGPGVGYSQVGSQLISLGLDLIPTLCS 320 Query: 338 LADIEKPEILPGENILAVKEPR----GVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL 393 LA I+ P G + + EPR G+ VE + + G R +T +K + Sbjct: 321 LAGIDAPATATGVDW--ITEPRAPGEGITVETAFSAGQRATTLG----RALITGRYKYTV 374 Query: 394 NLFTS--DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 + ++L D DP E+ NL ++ F +V + LLD+ Sbjct: 375 YSWGKHREQLVDLTADPGELRNLAEESAFDEVLEEFRRRLLDW 417 >UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6C430_9PLAN Length = 503 Score = 144 bits (364), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 111/388 (28%), Positives = 171/388 (44%), Gaps = 64/388 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + V+ D + CY + + NID A EG++ S Y P C+P+RAGL TG Sbjct: 34 RPNIMVVLCDDLGYGDLACYGHPVIQSPNIDRFAKEGLKLTSCYAAHPNCSPSRAGLMTG 93 Query: 63 IYANQSG--PWTNNVAP---GKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + G W ++P K T+ + AGY TC++GKWHL+G P + Sbjct: 94 RTPFRVGIYNWIPMLSPMHVRKREITIATLLRQAGYATCHVGKWHLNGMFNMVGQPQPSD 153 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 D+WF N E + RN V LQ + +++ A ++L Q Sbjct: 154 HGFDHWFSTQNNALPTHENPFNFVRNA-RPVGPLQG-------FASQLVADEAEEWLTQL 205 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 ++PF M V + EPH P + E++ Y + + P HH Q Sbjct: 206 RDKEKPFFMFVCFHEPHEP----IASAERFRKLY------TAPEGSTLPAHHGNVTQ--- 252 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMM------- 289 +DD GR++ L ++ RENT +I+TSD+G + Sbjct: 253 --------------------MDDAFGRILKTLDDQKLRENTLIIFTSDNGPAITRRHPHG 292 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIEKP--E 345 + L K A Y+ R+P I++ P+ + D PV +D+LPT+ A+ADI P Sbjct: 293 SSGPLRDKKGATYEGGIRVPGIVQWPEHVQPGTTSDVPVCGVDILPTLCAVADIPAPTDR 352 Query: 346 ILPGENILAVKEPRGVMV------EFNR 367 +L G NIL + E + ++ +FNR Sbjct: 353 VLDGTNILPLLEGKPILRKKPLYWQFNR 380 >UniRef50_Q5UEW6 Probable phosphonate monoester hydrolase n=1 Tax=uncultured alpha proteobacterium EBAC2C11 RepID=Q5UEW6_9PROT Length = 512 Score = 144 bits (364), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 130/481 (27%), Positives = 218/481 (45%), Gaps = 61/481 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + +MTD Q + +G + T N+D L EG F + + SPVC +RA +F G Sbjct: 22 KPNIVLIMTDQQRADTIGALGSPWMQTPNLDRLVNEGTSFTNCFVTSPVCVSSRASIFLG 81 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHD--------YFGTGEC 114 Y + + +TN N ++ D+GYH IGK H++ +D +F + Sbjct: 82 GYPHTTNVYTNFETWEPNWV---KWLSDSGYHCVNIGKMHINPYDAKGGFHQRFFVENKD 138 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWR----NGLNSVEDLQANHIDE--TFTWAHRISN 168 P + D+ + + +K + + R + V D + + FTW Sbjct: 139 RPLFLEDH----ERAIYDEWDKALKVRRLEKPSRYTRVRDNRDAFLKNLGCFTWEIDDDM 194 Query: 169 RAVDFLQQPA-------RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 +F+ A +A+ PF + + + PH P+ ++L Y D + +Q + Sbjct: 195 HPDNFVGNTASWWLNDRKAESPFFLQIGFPGPHPPYDPTGDFLSIYKDTKFPHRAASQRE 254 Query: 222 LANKPEHHRLWAQAM-----------PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALT 270 L +PE H+ Q+M + DD H Y A +D Q+G++++ L Sbjct: 255 LEKQPEMHKQLRQSMIDFNIDSVAWRENLTDDDIQLLHRYYSANVSMIDCQVGQILSTL- 313 Query: 271 PEQR---ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTP-- 325 EQR +NT VI+ SDH + +G H I K MYD +TR+PLI +P+ + Q Sbjct: 314 -EQRGYLDNTIVIFCSDHADALGEHGHIQKW-TMYDCVTRVPLIFWAPKTVKMQHQCADL 371 Query: 326 VSHIDLLPTMMALADIEKP---EILPGENILAV-----KEPRGVMVEFNRYEIEHD---S 374 V +D+ PT++ A+IE P E L +L + P + E+ E+ D S Sbjct: 372 VQLMDIAPTILNFANIEPPHNWEALALNKMLKTGCWDDQRPDHKLREYVYAELGRDHIQS 431 Query: 375 FGGFIPVRCWVTDDFKLVLNLFT-SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 ++ +R + +K V+ + ELYD RNDP+E NL +D ++ D R + +L Sbjct: 432 GAEYVIMR--RDEHWKYVIYPGNDTGELYDIRNDPHETVNLWNDPQYLDQRKEATIEILS 489 Query: 434 Y 434 + Sbjct: 490 W 490 >UniRef50_B9XGI2 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XGI2_9BACT Length = 515 Score = 144 bits (364), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 124/454 (27%), Positives = 200/454 (44%), Gaps = 69/454 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN LF+M D A + +G Y K T NIDS+A G+RF++ + + +CTP+RA + TG Sbjct: 61 RPNILFIMADDHAAHAIGAYGSKINQTPNIDSIAKAGMRFDNCFVVNSICTPSRAAILTG 120 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLD----GHDYFGTGECPPEW 118 Y++ +G N G N T+ + + AGY+T +GKWHL+ G DY+ ++ Sbjct: 121 KYSHINGVTVFNRFDG-NQPTVAKMLQAAGYYTGMVGKWHLESDPTGFDYWNVLPGQGKY 179 Query: 119 -DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 D D+ G E EI I++ +++FL+ Sbjct: 180 HDPDFIEMGNRKKIEGYATEI---------------------------ITDLSINFLKNR 212 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHH-------- 229 + D+PF ++ + PH P+ ++ + Y D E DD + Sbjct: 213 PQ-DKPFFLMCHHKAPHRPWEPDEKHAKMYEDVTIPEPETFNDDYKTRSSAATEATMRID 271 Query: 230 -----RLWAQAMPSPVGDDGL----YHHPL--YFACNDFVDDQIGRVINALTPEQ-RENT 277 + QA P + + L Y + Y C VDD +GR++ L ENT Sbjct: 272 RDLTPKDLKQAPPPGLAGEALKKWKYQRYIKDYLRCIASVDDNVGRLLKFLDDSGLAENT 331 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTP--VSHIDLLPTM 335 VIYTSD G +G H K MY++ R+P I+R P + + ++D PT Sbjct: 332 IVIYTSDQGFFLGDHNWFDK-RFMYEESLRMPFIVRYPNHIKPATVNKDMILNVDFAPTF 390 Query: 336 MALADIEKPEILPGENILAV---KEPR----GVMVEFNRYEIEHDSFGGFIPVRCWVTDD 388 + A +E P+ + G +IL + K P+ + + Y +H P ++ Sbjct: 391 LQCAGLEVPKEIQGRSILPLLQGKAPKDWRTSMYYRYYHYPADHR----VQPHYGVRSER 446 Query: 389 FKLV-LNLFTSDELYDRRNDPNEMHNLIDDIRFA 421 +KL+ N E YD + DP+E+ N+ D +A Sbjct: 447 YKLIYFNKINEWEFYDLKRDPHELKNVYADPAYA 480 >UniRef50_C6D448 Sulfatase n=2 Tax=Bacteria RepID=C6D448_PAESJ Length = 511 Score = 144 bits (364), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 134/500 (26%), Positives = 213/500 (42%), Gaps = 71/500 (14%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN L + +D Q N +G ++ + L+T N+D L G F AY +P CTP+RA + Sbjct: 1 MKKPNILLITSDQQHWNTLGYFNNE-LSTPNLDRLIKAGTTFTRAYCPNPTCTPSRASII 59 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL----DGHDYFGTGECPP 116 TG Y +Q G WT ++ +G F AGY T +GK H +Y P Sbjct: 60 TGQYPSQHGAWTLGTKLLEDRHFVGEDFNSAGYKTALVGKAHFQPLSSTEEYPSLEAYPV 119 Query: 117 EWDADYW--FDGANYLSELTE------------KEISLWRNGLNSVE------------D 150 D + W F+G Y E E + +LW V D Sbjct: 120 LQDLEMWKQFNGPFYGFEHVELTRNHTNEAHVGQHYALWMEEKGCVNWRDYFLPPTGNMD 179 Query: 151 LQANH---IDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY 207 + I E + + I+ R ++Q A D+PF + S+ +PH + P + Y Sbjct: 180 PAITYKWPIPEKYHYNTWIAERTNALMEQYAEEDKPFFLWSSFFDPHPEYLVPEPWDTMY 239 Query: 208 ADFYYELGEKAQDDLANKPEHHRLWAQAMP--SPVGDDG------LYHH----------- 248 + + + P H L + P SP + G HH Sbjct: 240 DPDSLTIPDIVPGEHDKNPPHFGLTQEDNPDFSPWAETGNGIHGYRSHHYYEYGEKKKLT 299 Query: 249 --------PLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGA 299 +Y+ +D IG +++ L +NT V++T+DHG G H L +KG Sbjct: 300 DYDKKKLVAVYYGMISMMDKYIGTILDKLEELGIADNTVVVFTTDHGHFFGQHGLQAKGG 359 Query: 300 AMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEKPEILPGENILAVKE 357 Y+D+ R+P I+R P V D S +DL PT ++L+ I P + G + V Sbjct: 360 FHYEDLIRLPFIVRYPGQVPAGVTSDAIQSLVDLAPTFLSLSGIPVPHAITGVDQSEVW- 418 Query: 358 PRGVMVEFNRYEI-EHDSFGGFIPVRCWVTDDFKLVLNLF-TSDELYDRRNDPNEMHNLI 415 RG + I E I + +V +K+ + T E++D ++DP+E++NL Sbjct: 419 -RGTASAARDHAICEFRHEPTTIHQKTYVDQRYKITVYYNQTYGEIFDLQDDPSELNNLW 477 Query: 416 DDIRFADVRSKMHDALLDYM 435 DD +A ++S++ LL Y+ Sbjct: 478 DDPAYAALKSEL---LLKYI 494 >UniRef50_UPI0001C36159 sulfatase n=2 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C36159 Length = 497 Score = 144 bits (363), Expect = 9e-33, Method: Compositional matrix adjust. Identities = 126/471 (26%), Positives = 199/471 (42%), Gaps = 49/471 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+ N ++ TD Q + + ++T NID L EG+ F AYT +P+CTP+RA T Sbjct: 8 KKKNIVWFCTDQQRWDTIHSLGNPYIHTPNIDRLVKEGVAFTRAYTQAPICTPSRACFLT 67 Query: 62 GIYANQSGPWTN-NVAPGKNISTMGRYFKDAGYHTCYIGKWHL------------DGHDY 108 G Y + N N K+ + + D GY +GK HL DG+ Y Sbjct: 68 GRYPRTTKTIFNGNEKFSKDEKLVTKLLSDEGYTCGLVGKLHLTSAEGRVEKRCDDGYSY 127 Query: 109 FGTGECPPEWDADYWFDGAN-YLSELTEKEI-------------SLWRNGLNSVEDLQAN 154 F P + W DG N Y + L EK + + W N + Sbjct: 128 FQYSHHP----HNDWKDGGNDYQNWLNEKGVHWEEIYGGKFMTMATWPPQANPSFSGKQV 183 Query: 155 HIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYEL 214 + + + +DF++ + EP+L+ V+ +PH P P EY ++ L Sbjct: 184 GVPAQYHQTTWCVEKTIDFIETRRNSGEPWLISVNPFDPHPPLDPPQEYKDRLNVEEMPL 243 Query: 215 GEKAQDDLANKPEHHRL---------WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRV 265 ++ KP H + A+ + S ++ Y+A + +DDQ GR+ Sbjct: 244 PLWEDGEMEGKPPHQQKDVIQGGQDGQAEPIGSLTEEEKRERFRDYYAEIELIDDQFGRL 303 Query: 266 INALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ--V 322 ++ L RE+T +I+ SDHGEM G H L KGA Y+ + +PLII P ++ Sbjct: 304 LSYLDQTGLREDTIIIFMSDHGEMSGDHGLYWKGAYFYEGLVHVPLIISCPSIFKQGFLC 363 Query: 323 DTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFN---RYEIEHDSFGGF- 378 D V +D+ PT+M A +E P + G + + F E H G Sbjct: 364 DALVELVDIAPTLMEAAGLEVPYFMQGRSFYDILTGEADPHHFKDAVYSEFYHCLRGTHE 423 Query: 379 -IPVRCWVTDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKM 427 I + +KL++ ELYD D NE HNL D + +++++ Sbjct: 424 DIDATMYYNGRYKLIVYHGKEFGELYDHETDQNEFHNLWDKPEYEALKTEL 474 >UniRef50_B1KD82 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD82_SHEWM Length = 526 Score = 143 bits (361), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 132/500 (26%), Positives = 214/500 (42%), Gaps = 100/500 (20%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+MTD N++G + T N+D LA+EG + +AYT +P+C+P+R FT Y Sbjct: 24 NLLFIMTDEMKWNVMGVAGHPVVKTPNLDRLASEGTYYKTAYTVAPICSPSRRSFFTSRY 83 Query: 65 ANQSGPWTNNVAPGKNIST--MGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 + G N+ N + K GY T GK H PEW D+ Sbjct: 84 THVHGVIDNSKQALANDGEVDLQTILKHQGYRTAISGKLHFY-----------PEWH-DW 131 Query: 123 WFD--------GANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI-------- 166 FD G N L T ++ + ++G ++ + ++ + H + Sbjct: 132 GFDEFWARSSEGPNRLE--TYRQYMVAKHGDDAFKPIKGSVTYPKDPLGHDLGRYRFGKE 189 Query: 167 -------SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY-----------A 208 +++A+D+L + + +PF + +SY+EPH P+ Y Y A Sbjct: 190 DFETYWLTDKALDYLAR--KEKKPFFLFLSYNEPHSPYMVTEPYASMYDPKTLPVPVIPA 247 Query: 209 DFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINA 268 E + + K H Q M Y VDD +GRV++ Sbjct: 248 SAKAERKVALEKKIKGKSRHLIDDEQMMRDLTAQ--------YLGHVSNVDDNVGRVLSY 299 Query: 269 L-TPEQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER-------- 319 L + +NT V++T+DHG M+G H KG M++ +RIPLIIR+ + R Sbjct: 300 LDSSGLADNTIVVFTADHGNMLGDHGKWFKGV-MHEGSSRIPLIIRAGKHTRYAKVMNRG 358 Query: 320 RQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFI 379 R V+ V ID++PT++ + DI+ P + GE++L++ + NR + F Sbjct: 359 RVVEQVVESIDVMPTLLEMLDIKAPRGMQGESLLSLTAGEAKNWK-NRAFSQRSDF---- 413 Query: 380 PVRCWVTDDFKLVLNLFTSD----ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 ++ DFKL++ ELY+ NDP E HNL + +Y Sbjct: 414 ---MFIEGDFKLIMPAKAGKKGKLELYNLANDPLENHNLA--------------GMTEYQ 456 Query: 436 DKIRDPFRSYQWSLRPWRKD 455 K+ +S Q S++ W+ D Sbjct: 457 AKV----KSMQQSIQVWQAD 472 >UniRef50_A3I0S5 Putative sulfatase yidJ n=1 Tax=Algoriphagus sp. PR1 RepID=A3I0S5_9SPHI Length = 491 Score = 143 bits (360), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 123/460 (26%), Positives = 216/460 (46%), Gaps = 57/460 (12%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN +FV+ D VG + T N++ LA E + F +A T VC P RA TG Sbjct: 38 PNIVFVLADQWRAQEVGYAGNDQIITPNLNKLATESLIFENAVTTMAVCAPWRASFLTGQ 97 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH----DYFGTGECPPEWD 119 Y G + N+ T +K+AGY T YIGKWHL+GH D F + P D Sbjct: 98 YPLTHGVFYNDKPLPNEAYTFAEIYKEAGYQTGYIGKWHLNGHARGADPFSARDQPVPKD 157 Query: 120 A----DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 DYW +E++ N ++ H+ E + ++ A+ ++ Sbjct: 158 RRQGFDYW----------KVREVTHNYNNSFYFDEEDKKHVWEGYD-VFPQTDSAISYIS 206 Query: 176 QPARADEPFLMVVSYDEPHHP-FTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 + ++PF++++SY PH P F+ P EY + Y +L N PE ++ A+ Sbjct: 207 K--NKEKPFVLMLSYGPPHDPYFSAPKEYQDLYDAGTLKLR-------PNVPEEYQDSAR 257 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHK 293 + + Y+A +D IG ++ + +NT ++TS+HG+M+ + Sbjct: 258 RVLAG-----------YYAHATAIDKAIGDLLEGIEKAGVADNTIFVFTSEHGDMLMSRG 306 Query: 294 LISKGAAMYDDITRIPLIIRSP-QGERRQVDTPVSHIDLLPTMMALADIEKPEILPG--- 349 ++ K +D+ ++P++IR P + E R+V P+ D+LPT++ L+DI P+ + G Sbjct: 307 VVKKQRP-WDEAIKVPMLIRYPGKLESRRVLDPIGTPDILPTLLGLSDIPIPKSIEGKDF 365 Query: 350 -ENILAVKEPRG------VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELY 402 +N+L+ K+ + V F+ ++ ++ G R T + V +L LY Sbjct: 366 SKNLLSGKDLENDATLIMLPVPFHEWQFKN----GGREYRGIRTRRYTYVKDLLGPWLLY 421 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 D NDP +++NL++ + ++ K+ +L + +I D F Sbjct: 422 DNENDPFQLNNLVNQSEYNSLQKKLEKSLSKKLKEINDKF 461 >UniRef50_B5GLL7 Sulfatase n=1 Tax=Streptomyces clavuligerus ATCC 27064 RepID=B5GLL7_STRCL Length = 599 Score = 143 bits (360), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 139/511 (27%), Positives = 214/511 (41%), Gaps = 112/511 (21%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 R N LF+MTD T+ +G Y +T ++D+LAA G RF+ YT + +CTPARA L TG Sbjct: 19 RRNILFLMTDQHRTDTLGAYGAPGGHTPHLDALAASGTRFDRFYTPTSICTPARASLLTG 78 Query: 63 -------IYANQSGPWTNNVAPGKNIS----TMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 + AN + NV ++++ T DAGY IGKWH G Sbjct: 79 YAPFRHRLLAN----YERNVGYREDLADDQFTFTAPLADAGYRLGLIGKWHT------GV 128 Query: 112 GECPPEWDADYWFDGA------------NYLSELTEKEISLW--------RNGLNSVEDL 151 P AD+ F+G +YL+ L ++ + R L Sbjct: 129 RRVP----ADFGFEGPYLPGWHNPVTHPDYLAYLDAHDLPPYAVTDPVRTRLPGGGPGPL 184 Query: 152 QANHIDETF--TWAHRISNRAVDFL----QQPA-----------------RAD------- 181 A + + T+ H ++ RA++ L ++P RAD Sbjct: 185 LAARLRQPVEATFEHYLATRAIELLHRWAEEPGTDGEHRTNGEPGTDEGHRADGEPGANG 244 Query: 182 --------EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR--- 230 PF + V + PH P+ P Y + EL + KP R Sbjct: 245 EHRGDTSPRPFFLAVHFFGPHLPYLLPDAYYDLVDPATVELPASLAETFEGKPPVQRNYS 304 Query: 231 -LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEM 288 LW + GL Y +D QIGR+ A+ ++T V++T+DHGE Sbjct: 305 ALWGHDSLTETEYRGLI--AAYHGYVALIDLQIGRIRRAVDELGLTDSTAVVFTADHGEF 362 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQG----ERRQVDTPVSHIDLLPTMMALADIEKP 344 G+H++ KG AMY+D+ RIP +++ P RR+ T D+ T++ LA + Sbjct: 363 TGSHRMHDKGPAMYEDVYRIPGLLQVPGAPAGVARREFAT---LTDVTATLLDLAGCDPA 419 Query: 345 EILPGENILAVKE--------PRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 + G ++L + PR V+ EF+ + + P R + +K+V+N Sbjct: 420 PAVDGRSLLPLTGGGPAPEDWPREVVGEFHGHHFPY-------PQRMLRDERYKIVINPE 472 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKM 427 + ELYD DP+E+ N DD DVR + Sbjct: 473 SVCELYDLDRDPHELRNAWDDPALRDVRDHL 503 >UniRef50_C0S8M2 Choline sulfatase n=8 Tax=Eurotiomycetidae RepID=C0S8M2_PARBP Length = 619 Score = 142 bits (358), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 118/439 (26%), Positives = 195/439 (44%), Gaps = 30/439 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCY-SGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++P+ L++M D A ++ Y P+ T N++ LA EG+ F SAY SP+C P+R + Sbjct: 5 EKPSILYIMADQMAAPLLSLYDENSPIKTPNLERLAREGVCFESAYCNSPLCAPSRFSMV 64 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG +++G + N + T Y + GYHT GK H G D E + Sbjct: 65 TGQLPSKTGGYDNASDLPADTPTYAHYLRKEGYHTALAGKMHFVGPDQLHGYEQ--RLTS 122 Query: 121 DYWFDGANYLSELTEKEISL-WRNGLNSVED----LQANHIDETFTWAHRISNRAVDFLQ 175 D + + E E+ W + ++SV + ++ N +D HR + D + Sbjct: 123 DIYPGDYGWTVNWDEPEVRPDWYHDMSSVLEAGPCVRTNQLDYDDEVIHRATQYLYDHTR 182 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR----- 230 RA +PF + VS PH P+ EY + Y D L + + H + Sbjct: 183 H--RAGQPFCLTVSMTHPHDPYAMTKEYWDLYEDIDIPLPKTPVIPHDEQDPHSQRVLKS 240 Query: 231 --LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGE 287 L+ + +P L YFA +VD Q+GR++ L + +NT V++T DHG+ Sbjct: 241 IDLFGKEIPEQC---ILAARRAYFAACSYVDSQVGRLMATLKACDLADNTIVVFTGDHGD 297 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSP-QGERRQVDTPVSHIDLLPTMMALADIEKPEI 346 M+G L K Y+ R+P+ + +P + + ++V VS +DLLPT A+A E Sbjct: 298 MLGERGLWYK-MVWYEHAARVPMFVHAPGRYKPKRVKENVSTMDLLPTFAAMAGGEINNH 356 Query: 347 LPGENI-----LAVKEPRGVMVEFNRYEI--EHDSFGGFIPVRCWVTDDFKLVLNLFTSD 399 LP + + L + R + + E+ + G PV +K + + Sbjct: 357 LPIDGVSLMPYLLDSDSREAVSGLKTDTVIGEYMAEGTLAPVVMIRRGPWKFIYSPIDPP 416 Query: 400 ELYDRRNDPNEMHNLIDDI 418 L++ + DP E NL I Sbjct: 417 MLFNVKRDPTEAVNLASGI 435 >UniRef50_D1AWE3 Sulfatase n=3 Tax=Fusobacteriaceae RepID=D1AWE3_STRM9 Length = 467 Score = 141 bits (356), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 111/383 (28%), Positives = 190/383 (49%), Gaps = 54/383 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+ N +F+ D N G + + T NID + E + F +A + P+C+P+RA + T Sbjct: 3 KKYNLIFLFADQWRRNAAGFVGTEDVITPNIDEFSKESLVFTNAVSTGPLCSPSRASILT 62 Query: 62 GIYANQSGPWTN------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 G Y G WTN +V + T+ K+ Y+ YIGKWHLD + E P Sbjct: 63 GTYPATHGVWTNCKTGLYDVWLKEESITITDVLKENDYYIGYIGKWHLDNPEE-NVEEKP 121 Query: 116 P----EWDA-----------DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF 160 +WDA DYW+ Y + L W N N +E ID+ + Sbjct: 122 KSGARDWDAYTPPGKKRHGIDYWYSYGAYDNHLKP---HYWENSHNMIE------IDK-W 171 Query: 161 TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD 220 + H +++A++FL + D PF + +S++ PH P EKY D Y + + D Sbjct: 172 SVEHE-TDKAIEFLDK--NKDNPFALFLSWNPPHTPLDL---VPEKYIDLYKDKKLRVSD 225 Query: 221 D--LANKPEHHRLWAQAMPSPVG--DDGLYHHPL--YFACNDFVDDQIGRVINALTPEQ- 273 + L N +H ++MP + +DG + L Y+A +D+ GR+I+ L Sbjct: 226 NVILNNVIDH----TESMPEALNFTEDG-FQDALRKYYAAISGIDEHFGRLIDYLKENNI 280 Query: 274 RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLL 332 EN+ ++ T+DHGEM+ +H L SK Y++ +P +I+ G+ R + ++ +S +D++ Sbjct: 281 YENSIIVLTADHGEMLCSHGLWSK-HVWYEESIGVPFMIKF--GDNRGITESVLSGVDIM 337 Query: 333 PTMMALADIEKPEILPGENILAV 355 PT+++L D++ P+ + G+++ V Sbjct: 338 PTLLSLLDLKIPKTVEGKDLKEV 360 >UniRef50_Q5UEY3 Probable sulfatase n=1 Tax=uncultured alpha proteobacterium EBAC2C11 RepID=Q5UEY3_9PROT Length = 512 Score = 141 bits (356), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 127/500 (25%), Positives = 212/500 (42%), Gaps = 75/500 (15%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MKRPNF+F+ +D Q + G + G+ L T ++D L EG+ F + T SPVC PARA + Sbjct: 1 MKRPNFVFITSDQQRGDCYG-FMGRKLKTPHLDQLRREGMHFRNCITPSPVCQPARAAIL 59 Query: 61 TGIYANQSGPWTNNVAPGKNISTMG--RYFKDAGYHTCYIGKWHLDGHDYF--------- 109 TG +G N + +G + Y T IGK H F Sbjct: 60 TGKLPKTNGVKDNGIDLRAERGELGFAAALTNVDYETALIGKAHFATTQTFSPQTSVECK 119 Query: 110 -GTGECPPEWDADYWFDGANYLSELT-----------------------------EKEIS 139 G+ + PP W+ Y G ++ LT E Sbjct: 120 TGSADYPPNWNGPYM--GFQHVELLTQGHWHKIRPPVIPPSGQHYEDWFFNVVGKESAFE 177 Query: 140 LW----RNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHH 195 LW R G+ + + A+ + + + +++R++ +L R PF + +S+ +PHH Sbjct: 178 LWKSETRKGVGAAQTW-ASALPVAWHSSTWVADRSIHWLSN-RRESNPFCLWISFPDPHH 235 Query: 196 PFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHH------- 248 PF CP + + +L + + DL ++P HR ++ P + D L Sbjct: 236 PFDCPEPWNLLHNPEDVDLPKFLEKDLNDRPWWHRRSLESEPD-LSDPVLKRFRKQGSRM 294 Query: 249 ------------PLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLI 295 Y+ +D +GRVI L + + T +IYTSDHG+ MG L Sbjct: 295 PDQSEAQLREMTANYYGMISLIDHNVGRVIACLREKGILDETIIIYTSDHGDHMGERGLY 354 Query: 296 SKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILA 354 KG +YD + + +I+R P R + P++ +D+ T A P+ ++ + Sbjct: 355 LKGPMLYDSLINVGMIVRGPGVAAGRSENAPITTLDVGATFCDYAGTSLPKEAQSVSLKS 414 Query: 355 VKEPRGVM--VEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD-ELYDRRNDPNEM 411 G ++ +++ G + +R T KL + L + D ELYD ++D NEM Sbjct: 415 CFGGAGSPHDAVYSEWDVAPSRCGVRLDLRTVHTGKAKLTIELQSGDGELYDLQSDGNEM 474 Query: 412 HNLIDDIRFADVRSKMHDAL 431 NL ++ A++++ M L Sbjct: 475 INLWNEPLAAELQNHMTKLL 494 >UniRef50_UPI00016BFE17 putative sulfatase n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016BFE17 Length = 454 Score = 141 bits (355), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 122/462 (26%), Positives = 201/462 (43%), Gaps = 45/462 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN ++V+TD + +G + T N+D++A EG+ F +A +P C P R L T Sbjct: 3 NKPNVIWVLTDQMRASAMGFMGDANVRTPNLDNMAREGVAFVNAVAGTPWCCPFRGALLT 62 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G+Y +Q+G A +T+ F+ AGYHT Y+GKWHLDG + T PPE Sbjct: 63 GLYPHQNGVTQTPQALDPATATITAPFRTAGYHTAYVGKWHLDGSNSV-THYIPPERRGG 121 Query: 122 Y-WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 + +F G Y + + E ++ N L D ++N + LQQ Sbjct: 122 FDYFMG--YENNNNQNESYVFGNDCERPTRLDGYETDA-------LTNIFIKHLQQHTSQ 172 Query: 181 D--EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 D +PF V+S PH P+ P+ E YY + P +L P Sbjct: 173 DSYQPFFAVLSVQPPHDPYVPPLYTGE--GKIYY-----------HNPADIKLRPNVPPG 219 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHKLISK 297 D+ Y++ + +D +G++ AL + T++I+ SDHG+ + +H SK Sbjct: 220 RWSDEARMDLAGYYSMIENIDTNVGKLXMALKQMNIDRETYIIFMSDHGDCLNSHGQWSK 279 Query: 298 GAAMYDDITRIPLII-RSPQGERRQV---DTPVSHIDLLPTMMALADIEKPEILPGENI- 352 + +++ RIP I+ R Q D ++HID+ PT + L I P+ + G + Sbjct: 280 SSP-WEEAIRIPFIVCRVGTNYHMQSGIRDAVINHIDIAPTTLGLCGITPPQNMVGFDYS 338 Query: 353 -LAVK-------EPRGVMVEFNRYEIEHDSFGGFIPVRCW----VTDDFKLVLNLFTSDE 400 L ++ E +G + + + + + W D +K V Sbjct: 339 PLCIQKSQPEYHELQGAIPDSAYLQQIPRKYHAHTANKAWRAVVTRDGYKYVCYPHQDVM 398 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 L++ +DP EM NL D ++ K+H L Y+ D F Sbjct: 399 LFNLNDDPYEMANLCHDSTSQVIKEKLHSLLQKYIVDTGDTF 440 >UniRef50_Q01PN7 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01PN7_SOLUE Length = 496 Score = 141 bits (355), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 134/453 (29%), Positives = 206/453 (45%), Gaps = 65/453 (14%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L +M D + +G + ++T N+D LAA G+RF +AY+ +P CTPARAGL TG Sbjct: 24 RPNILLLMADQWRADCLGAAGNRAIHTPNLDQLAASGVRFTNAYSATPTCTPARAGLLTG 83 Query: 63 IYANQSGPWTNNVAPGKNIST-----MGRYFKDAGYHTCYIGKWHL----DGHDYF---- 109 + PW + + + M R +DAGY+T IGK H + H Y Sbjct: 84 L-----APWNHGMLRYAEVGARYPVEMPRALRDAGYYTAAIGKLHYHPQRNVHGYHQALL 138 Query: 110 ---GTGECPPEWDADY--WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 G E P++ +DY WF + L L N ++ + T TW Sbjct: 139 DESGRIES-PDFRSDYRSWF--WSQAPNLDPDATGLGWNDFDARPYTLPERLHPT-TWTG 194 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 + + ++ Q+ EPF + VS+ PH P+ P +Y D L A A+ Sbjct: 195 QTAASWIETYQR----SEPFFLKVSFARPHSPYDPPDRLWRRYQDA--PLPPAAVAGWAS 248 Query: 225 KPEHHRLWAQAMPSP---VGDDGLYH----HPLYFACNDFVDDQIGRVINALTPEQ-REN 276 R A++ P P GD G Y+ FVD+QIGR++ +LT + Sbjct: 249 -----RYAARSGPQPDAWHGDLGAEQVRRSRQGYYGSVTFVDEQIGRIMESLTRRGLLDQ 303 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG---ERR--QVDTPVSHIDL 331 T +++ SDHG+M+G H L K A Y +R+P ++R P+G RR +D V D+ Sbjct: 304 TLIVFFSDHGDMLGDHNLWRKSYA-YAGSSRVPFLVRWPEGMLTARRGGTIDQMVELRDV 362 Query: 332 LPTMMALADIEKPEILPGENIL---AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCW---V 385 LPT + A L G+++L A K P F ++EH + P W Sbjct: 363 LPTFLDAAAAAPARPLDGQSLLPLIAGKSP--AWRPF--LDLEHGVC--YSPDNHWNALA 416 Query: 386 TDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDD 417 +K + + ++L+D + D +E+H+L D Sbjct: 417 DQQYKYIFHARDGREQLFDVQRDAHELHDLSGD 449 >UniRef50_Q7UJ67 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UJ67_RHOBA Length = 505 Score = 141 bits (355), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 129/457 (28%), Positives = 195/457 (42%), Gaps = 70/457 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN LF+ D A+ + GCY T NID LAA G+ F AY P+C P RA + T Sbjct: 44 SKPNVLFIAVDDLASAL-GCYGDVVAKTPNIDRLAATGVCFRRAYNQLPLCNPTRASVMT 102 Query: 62 GIYANQSGPWT-----NNVAPGKNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTG--E 113 G+ +Q + + P N+ T+ + F+ AGY +GK +H + GT + Sbjct: 103 GLRPDQIKVYDLDRHFRDEVP--NVITLSQAFQQAGYFAARVGKIYHYNVPASIGTDGFD 160 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 PP W+ G + E R ++ L A+ DE T I+ A+ Sbjct: 161 DPPSWNQTVNPKGRDKDDEHLIFNAEPHRKISGALSWLAADGEDEEQTDG-MIATEAIRI 219 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA 233 +++ + DEPF + V + PH P+ P +Y + Y L D + P A Sbjct: 220 MRE--KKDEPFFLGVGFFRPHTPYVAPKKYFDMYPLESLRLPFAPAGDREDIPTA----A 273 Query: 234 QAMPSPVGDDGLYHHPL------YFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG 286 A PV + GL L Y+AC F+D Q+GR+++AL + +NT V++ SDHG Sbjct: 274 FAHNCPVPNYGLDETTLLKATQAYYACVSFIDAQVGRLLDALEEQGLADNTIVVFWSDHG 333 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSP-QGERRQVDTPVSHIDLLPTMMALADIEKPE 345 +G H + + ++++ + PLIIR P Q + V +D+ PT+ +A IE P Sbjct: 334 YHLGEHNGVWQKRTLFEEGAKAPLIIRDPSQLGLGSCNRIVEFVDIYPTLTDVAGIESPS 393 Query: 346 ILPGEN----------------ILAVKEPR---------GVMVEFNRYEIEHDSFGGFIP 380 L G + I V P G + +RY Sbjct: 394 GLAGRSLKPLLNDPVANWNGTAITQVLRPADDRLPEQVMGCSIRTHRYRYTE-------- 445 Query: 381 VRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDD 417 W + ELYD ++DPNE HNL D Sbjct: 446 ---WAEGRHGV--------ELYDHQSDPNEFHNLALD 471 >UniRef50_A6DKS7 N-acetylglucosamine-6-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKS7_9BACT Length = 515 Score = 141 bits (355), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 131/494 (26%), Positives = 213/494 (43%), Gaps = 82/494 (16%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN LF+ +D AT VG Y +T NID +A+EGIRF+ + +C P+RA + TG Sbjct: 22 PNILFIFSDDHATQAVGSYGSIINSTPNIDRIASEGIRFDRCLVTNAICGPSRATILTGK 81 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL----DGHDYFGTGECPPEWD 119 Y++ +G + N++ T + + AGY T IGKWHL G D+F E Sbjct: 82 YSHLNGFYKNDMYFDGRQITFPKLLRQAGYQTAVIGKWHLASLPTGFDHF-------EVI 134 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 Y G Y + RNG + + I+ +++L+ Sbjct: 135 TGYGGQGKYYHPVMN-------RNGEPTKHRGYTTEV---------ITKLNMEWLKNQRD 178 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKY--------ADFYYELGEKA-----QD------ 220 ++PF++++ + PH + +Y+ + A+ + + KA QD Sbjct: 179 PNKPFMLMMQHKAPHRAWLPSPKYMNAFKDKKFPKPANLHTDYQGKASHVKKQDMMIKDS 238 Query: 221 -----------------DLAN----KPEHHRLWAQAMPSPVGDDGLYHHPL---YFACND 256 DLAN E + +A+A S + Y C Sbjct: 239 MNPGDLKLTPPKYLDGADLANWHKAYDEENAAFAKAKLSGKALRSWNYQRYIRDYVRCVQ 298 Query: 257 FVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP 315 +DD IG V+N L ENT +IY+SD G +G H K MY++ R PL++R P Sbjct: 299 SIDDSIGEVLNYLDESGLAENTLLIYSSDQGFFLGEHGWFDK-RFMYEEALRTPLVMRWP 357 Query: 316 -QGERRQVDTPV-SHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHD 373 + + VD+ + S++D T + +A ++ P + G ++L + +G E R + Sbjct: 358 GKIKAGSVDSHITSNLDFAQTFLEVAGVKVPAEMQGASLLPIM--KGQQPENWRESFYYH 415 Query: 374 SFG----GFIPVRCWVTDDFKLVLNLFTSDE--LYDRRNDPNEMHNLIDDIRFADVRSKM 427 +G + C VTD +++ +T+DE LYDR+NDP E N D F + M Sbjct: 416 YYGYPDWHLVQKHCGVTDGRYKLIHFYTTDEWELYDRKNDPEENINRASDPEFKSILQNM 475 Query: 428 HDALLDYMDKIRDP 441 L +++ P Sbjct: 476 RKKLSQQRIQLKVP 489 >UniRef50_Q7UFA5 Putative sulfatase yidj n=1 Tax=Rhodopirellula baltica RepID=Q7UFA5_RHOBA Length = 527 Score = 141 bits (355), Expect = 7e-32, Method: Compositional matrix adjust. Identities = 126/482 (26%), Positives = 201/482 (41%), Gaps = 70/482 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCY-------------SGKPLNTQNIDSLAAEGIRFNSAYTC 48 ++PN L + TD +GCY G + T +IDS+AA G S Y Sbjct: 56 EQPNVLIIQTDEHNFRTLGCYRDTLPIEEAEIWGKGAVVETPSIDSIAARGAICTSFYAT 115 Query: 49 SPVCTPARAGLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDY 108 SPVCTP+RA FTG Y +G + N+ ++ T + GY T Y GKWHLDG Sbjct: 116 SPVCTPSRAAFFTGRYPQNTGAYQNDRPLRGDMVTFAEVLRRDGYATGYAGKWHLDGPG- 174 Query: 109 FGTGECPPEW--DADYWFDGANYLSELTEKEISLWRNGLNSVE--------DLQANHIDE 158 P+W D + F Y+ + + NG SV + N DE Sbjct: 175 ------KPQWGPDRQFGFSDNRYMFNRGHWKKFDFENGQPSVAATNKKGQPNYDLNGADE 228 Query: 159 TFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYAD--------F 210 + +RA DF+++ + EPF +S +PH P T Y + + F Sbjct: 229 KTFSTDWLCDRAADFIRE--HSQEPFCYHLSLPDPHGPNTVRQPYDTMFENMPVRPPMTF 286 Query: 211 YYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIG-RVINAL 269 + + N+ R A+ M YF +DD +G + Sbjct: 287 QLDGDQPGWLPATNRNSQQRFNARLMTQ------------YFGMVRCIDDNVGMLLSLLD 334 Query: 270 TPEQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVS 327 + T V++TSDHG++ H ++KG Y+ ++P+II +P ++D + Sbjct: 335 ELSLTKRTVVVFTSDHGDLCYEHGRLNKGNP-YEGSAKVPMIIAAPGLISAGLRIDQAMG 393 Query: 328 HIDLLPTMMALADIEKPEILPGENI--------LAVKEPRGVMVEFNRYEIEHDSFGGFI 379 +D PT+++L E P G ++ + +E V F R S +I Sbjct: 394 TVDFAPTLLSLLRKEVPAGTQGRDLSEWFRNIETSDEESHRNSVTFLR---AASSKAAWI 450 Query: 380 PVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 VTD +KL+++ L+D + DP+E N I + +++ +LL Y D ++ Sbjct: 451 AA---VTDRYKLIVSADDQPWLFDLKEDPHETTNHIGKPENQTIAARLARSLLRYGDLMK 507 Query: 440 DP 441 DP Sbjct: 508 DP 509 >UniRef50_B6A548 Choline-sulfatase n=1 Tax=Rhizobium leguminosarum bv. trifolii WSM2304 RepID=B6A548_RHILW Length = 503 Score = 141 bits (355), Expect = 7e-32, Method: Compositional matrix adjust. Identities = 123/458 (26%), Positives = 207/458 (45%), Gaps = 38/458 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN LF+ D + Y N++ +A G+ F +AY P+C P+R + Sbjct: 5 ENPNILFIQVDQLTAASLSAYGDTVCRAPNLERIADTGVVFETAYCNFPLCAPSRFSMAA 64 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE---CPPEW 118 G + G + N +I T Y + AGY T GK H G D F E P + Sbjct: 65 GQLCSTIGAYDNAAEMPASIPTYAHYLRAAGYQTALSGKMHFIGPDQFHGFEKRLTPDLY 124 Query: 119 DADY-WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 AD+ W N+ +E +++ + R L S ++ ID + ++ +A+ L Sbjct: 125 PADFSWV--PNWGNE-GKRDTNDTRAVLISGICERSVQID----FDENVTFQAIQHLYNI 177 Query: 178 ARADE--PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK---PEHHRLW 232 AR+D+ PF + VSY PH P+ C E+ + Y ++ A D L+ + P RL Sbjct: 178 ARSDDKRPFFLQVSYTHPHEPYLCRKEFWDLYEGV--DVPMPAVDALSEQEHDPHSVRLL 235 Query: 233 AQ-AMPSPVGDDGLYHHP--LYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEM 288 AM DG Y+ ++D IG++++ L RENT +++ SDHGEM Sbjct: 236 KDFAMLDVRFADGDIQRARRAYYGSISYIDSMIGQILDTLEAIGARENTAIVFASDHGEM 295 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALAD----IEKP 344 +G + K ++ R+PL++ +P + ++V VS +DLLPT+M LA + Sbjct: 296 LGERGMWFK-KHFFEAALRVPLLLNAPWIKPQRVSETVSLVDLLPTLMGLATGRVWRSET 354 Query: 345 EILPGENILAV-----KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD 399 E L G+++ EP + F Y E +P+ +KL+ + + Sbjct: 355 EELEGQDLTGFLDREDHEPSRAV--FAEYLAEATP----VPIFMVRKGRYKLISSSHDGN 408 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 L+D DP E+ NL +A++ +++ + D D+ Sbjct: 409 LLFDLMADPKELQNLAGHTDYAEIEARLLKIVADKWDE 446 >UniRef50_UPI00016BFAFE putative sulfatase n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016BFAFE Length = 462 Score = 140 bits (353), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 126/493 (25%), Positives = 217/493 (44%), Gaps = 84/493 (17%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN +FV D + +GC K +T N+D+LAA G F+ AYT P+C P RA + Sbjct: 1 MKKPNVIFVYPDQLRYDALGCNGNKVASTPNLDALAAAGANFDEAYTSYPLCCPFRASIM 60 Query: 61 TGIYANQSGPWTNNVAPGKNI-STMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 TG+Y +++G ++N+ K++ + + AGY T ++GKWHL+G F E P+ Sbjct: 61 TGLYPHKNGMYSNHYPLRKDLPHYLPKXMNKAGYKTAWVGKWHLNGGRKF---ERVPK-- 115 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI-----SNRAVDFL 174 +YW ++ + G + ++ L + D+T + + + + +DF+ Sbjct: 116 -EYWCGFEEFIG---------YGRGHHYIDSLYYKNEDDTPYKSKKYEPIYQTEQLIDFI 165 Query: 175 QQPARADEPFLMVVSYDEPHHPFTC-PVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA 233 + ++PF+ ++ Y PH P P E KY+ EL + +D K + + A Sbjct: 166 DRAVSEEKPFMGMICYGLPHPPVEMQPEESKNKYSASEIELPDTVPEDTKEKAK--KYIA 223 Query: 234 QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAH 292 Q Y+ + VD +IG+++ L + ++T I SDHG+M G + Sbjct: 224 Q----------------YYGMVEVVDTEIGKLVQHLKAKGIFDDTIFIVASDHGDMCGEY 267 Query: 293 KLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSH-----IDLLPTMMALADIEKPEIL 347 + K + Y +PLII P + +T + H ID+ PT++ L +E PE + Sbjct: 268 GMFEK-SIFYSSAAHVPLIISYPN--MIKPETKIDHLVDPLIDITPTILDLCGLEVPETM 324 Query: 348 PGENI--LAVKEPRGVMVEFNRYEIEHDSFGGFIPV------------------RCWVTD 387 G ++ LA +F Y+I IP+ R + T Sbjct: 325 DGYSMKTLAQTGEDKTFRDFVYYQI--------IPLPEALCDQMDKPDRKPYAERGFRTK 376 Query: 388 DFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQW 447 D V ++D + + E N +D+ ++ + L + M I D W Sbjct: 377 DVLYVEKENVPFAVFDYQREKKEFMNCVDNFKYYPQVKEYRTRLANIMGDIGD-----SW 431 Query: 448 SLRPWRKDARPRW 460 LR R+D P + Sbjct: 432 RLR--REDLPPEF 442 >UniRef50_UPI00016C0B77 sulfatase n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0B77 Length = 541 Score = 140 bits (353), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 136/472 (28%), Positives = 203/472 (43%), Gaps = 76/472 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L V+ D Q + +G + T N D LA EG F A++ +PVC PAR L TG Sbjct: 2 KPNILIVLADQQRYDTIGELGFNHMITPNFDRLAKEGCSFTYAHSHNPVCMPARHDLLTG 61 Query: 63 IYANQSGPWTNNVAPGK-----NISTMGRYFKDAGYHTCYIGKWH---LDGHDYFG---T 111 + G + N A GK I T+ R GY T IGK H + H +G T Sbjct: 62 MTGRAHGYFQN--AGGKPMKDYGIPTLPRILSQXGYRTASIGKMHFFPVKEHHGYGDRLT 119 Query: 112 GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQ-----ANHI-------DET 159 E P+ L E + + L NG V+++ A H DE Sbjct: 120 MEEIPK------------LLEDDDYAMFLKANGDGHVQNIHGVRPLAYHTPQQSLVKDEN 167 Query: 160 FTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ 219 + A + N+ +++L+Q D PF + V Y +PH P+ P +Y Y Sbjct: 168 YETA-WVENQTINWLEQ--NGDNPFFLFVGYIKPHPPWNIPAKYQGIYQXAAIPEAIPKA 224 Query: 220 DDLANKPEHHRLWAQAMPSPVGDDG-----LYHHPLYFACNDFVDDQIGRVINALTPEQR 274 P H+ + GDD H Y+ VD+ G++I L + Sbjct: 225 RYYPEDPNHNSWY--------GDDDSNXQLRKHQEAYYTAVSMVDESFGKIIEHLRXTGK 276 Query: 275 -ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTP--VSHIDL 331 +NT VIYTSDHGEM+G SK + Y+ RIPL++R P+ V +D+ Sbjct: 277 LDNTLVIYTSDHGEMLGDRGYYSK-SVPYESAVRIPLLVRYPEVFEPGTTNEDFVDLLDI 335 Query: 332 LPTMMALADIEKPEI---LPGENI--LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWV- 385 LPT + +A ++ P+ L G +I A + R ++ N G+I WV Sbjct: 336 LPTCLDVAGVKYPQCDYKLYGSSIADTAAGKNRQIITASN----------GYINKNRWVM 385 Query: 386 --TDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 D+K + N +E+Y DP+E +N+ID +R D + + A + Y Sbjct: 386 ARNKDYKYIYNFNQGYEEMYYLALDPHETNNIIDSLRGGDAYNMLKTAAIAY 437 >UniRef50_UPI0001C369FC arylsulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C369FC Length = 497 Score = 140 bits (352), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 123/460 (26%), Positives = 198/460 (43%), Gaps = 47/460 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 R N L + TD Q + + + + + T +I+ +AA G F S Y P+C P+R + TG Sbjct: 12 RDNILLITTDQQRFDTINAWGNQSIFTPHINYMAAMGTSFTSCYASCPICVPSRTTIMTG 71 Query: 63 IYANQSGPWTNN------VAPGKNISTMGRYFKDAGYHTCYIGKWHLD-GHDYFGTGECP 115 I +SG +N A T+ DAGY TC GK H + +G + Sbjct: 72 IDGYESGVVSNADHRAFMEAQTAGRLTLPAVLTDAGYQTCAKGKMHFEPARACYGFEQMS 131 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 D + + + ++ L + D TW I + A+DF++ Sbjct: 132 LPLD---YMRSCDKRGDRIRPKVHGVGECLMEPVISTVDVRDSMTTW---IGDEAIDFIE 185 Query: 176 Q--PARADEPFLMVVSYDEPHHPFTCPVEYLE----------KYADFYYELGEKAQDDLA 223 P R PF + S+ +PH PF ++ E Y D+ L Q LA Sbjct: 186 TRDPLR---PFFLWTSFTKPHPPFDPCRDFWELYRQIPMPEPVYGDWSRTLEGTPQGFLA 242 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYT 282 E+ + P + Y+AC VD Q+GR+ AL ENTW+++T Sbjct: 243 GSYENTDMHLFG-PEQIAAS----RRAYYACITQVDYQLGRLFGALRENGLLENTWILFT 297 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE----RRQVDTPVSHIDLLPTMMAL 338 SDHGEM+G H + S+ ++ +P II P G ++D ++ D+ PT++A+ Sbjct: 298 SDHGEMLGDHYM-SQKNLFFEGSAHVPFIIVPPAGRGIVHNNRIDRVITLADVFPTVLAM 356 Query: 339 ADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN-LFT 397 A + P+ GEN+L + + H +F C + + KL+ + + Sbjct: 357 AGLPSPKEKRGENLLKWIGGKRQDERIFYGDSLHTNF-------CVMENRKKLIYTRIGS 409 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 S L+D DP E HNL DD +A R ++ L+ ++ K Sbjct: 410 SLLLFDLETDPMERHNLADDPEYAKCRERLWTLLISHVKK 449 >UniRef50_D2R1A1 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R1A1_9PLAN Length = 486 Score = 140 bits (352), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 114/452 (25%), Positives = 205/452 (45%), Gaps = 24/452 (5%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF++ D +GCY + T +ID LA+EG+RF A+ + C P+RA L +G Y Sbjct: 31 NVLFIIADDLTATALGCYGNQICQTPHIDRLASEGMRFTHAFCNATYCGPSRASLMSGYY 90 Query: 65 ANQSG--PWTNNVAPGKNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGEC----PPE 117 + +G +T+ +T +F+++GY+ + K +H+ TG Sbjct: 91 PHATGILGYTSPRPAIGQRATWSEHFRNSGYYAARVSKIYHMGVPGDIETGSNGADDAAS 150 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGL------NSVEDLQANHIDETFTWAHRISNRAV 171 WD + +G + + T + + +G N+ ++A+ D+ R + + Sbjct: 151 WDERFNIEGPEWKAAGTGETLEGNPDGKKPVMGGNTFVVVEADG-DDLVHSDGRAALKTA 209 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADF-YYELGEKAQDDLANKPEHHR 230 + ++Q +PF + + PH PF P +Y E Y + L K DD + P Sbjct: 210 ELIRQ--HTQKPFFIACGFVRPHVPFVAPRQYFEPYLPYDKLPLPTKVADDWKDIPLAGI 267 Query: 231 LWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHGE 287 + ++ + D+ + Y+A ++D Q+G+V++AL ++T VI+TSDHG Sbjct: 268 NYKTSVNMKM-DERRQKKAIGGYYAAVSYMDAQVGKVLDALEQSGAADHTIVIFTSDHGY 326 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEIL 347 +G H +K ++ D +++PLIIR P + + V IDL PT+ +L +E PE L Sbjct: 327 HLGEHDFWAK-VSLLDQSSKVPLIIRVPGKKPAVCHSLVELIDLYPTIASLCGLEVPERL 385 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRND 407 G+NI + + V + + + G + W + EL+D + D Sbjct: 386 QGKNIATLWDDPHKQVRDTAFSVAPMTQGFLLRDHQWSFIQYG--EEGAKGLELFDVKAD 443 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 P + NL + DV L +++ +R Sbjct: 444 PQQHTNLAQSPEYEDVVRGFQSKLKEHLQTLR 475 >UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788C38 Length = 452 Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 134/500 (26%), Positives = 215/500 (43%), Gaps = 126/500 (25%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PNF+ + D +GCY + T ++D LA EGIRF + Y+ SPVC+P+RA L Sbjct: 14 MKQPNFIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLL 73 Query: 61 TGIYANQSGPWTNNVAPGKNIS--------TMGRYFKDAGYHTCYIGKWHL--------D 104 TG Y ++G + K S T+ + K AGY T GKWHL + Sbjct: 74 TGKYPARAG--VGEILGAKRGSHGLPADEVTLAKALKPAGYRTALYGKWHLGLSEETSPN 131 Query: 105 GH---DYFG-TGECPPEWDADYWFD---GANYLSELTEKEISLWRNGLNSVEDLQANHID 157 H ++FG C + +++ G N L +L E E +W NG E Sbjct: 132 AHGFDEFFGFKAGCVDFYSHIFYWGQAHGVNPLHDLWENETEVWENGRYMTE-------- 183 Query: 158 ETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK 217 I+ R+VDF+Q+ + PF + SY+ PH+P P +Y++++A Sbjct: 184 -------LITERSVDFIQRSREQEAPFFLFASYNAPHYPMHAPQKYMDRFA--------- 227 Query: 218 AQDDLANKPEHHRLW-AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RE 275 H W Q M + + VDD +G+++ AL E Sbjct: 228 -----------HLPWDRQVMAAMIAA---------------VDDGVGKIVKALKEAGCYE 261 Query: 276 NTWVIYTSDHGE--------------MMGAHKLISKG--AAMYDDITRIPLIIRSPQG-E 318 +T + ++SD+G G I +G A++++ R P I+ P G E Sbjct: 262 DTVIFFSSDNGPSSESRNWLDGTEDVYYGGSAGIFRGHKASLFEGGIREPAILSWPNGWE 321 Query: 319 RRQV-DTPVSHIDLLPTMMALADIEKPEILPGENI----------LAVKEPRGVMVEFNR 367 QV D + +DL PT + LA ++ P P + + L ++EP F Sbjct: 322 GGQVRDEVAAMMDLAPTFLDLAGVD-PAAGPLQGVALDGSSLKEMLQMREPSPHQQLFWE 380 Query: 368 YEIEHDSFGGFIPVRCWVTDDFKLVLN------LFTSDELY--DRRNDPNEMHNLIDDIR 419 Y+ G + VR D+KLVLN D+++ D DP E NL D R Sbjct: 381 YQ-------GQLAVR---EGDWKLVLNGKLDFDRVVPDQIHLSDLSRDPGERSNLAD--R 428 Query: 420 FADVRSKMHDALLDYMDKIR 439 + ++ ++ + D+ ++++ Sbjct: 429 YPEIVERLSRDVRDWYEEVQ 448 >UniRef50_C9L4R5 Mucin-desulfating sulfatase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L4R5_RUMHA Length = 484 Score = 139 bits (350), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 129/490 (26%), Positives = 207/490 (42%), Gaps = 94/490 (19%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+M D Q + + C K L T N++ +A G++F + Y SPVC+PARA + TG Sbjct: 2 NILFIMADDQGSWAMNCGGTKELCTPNLNRIAESGMQFQNFYCVSPVCSPARASVLTGDI 61 Query: 65 ANQSGP-----------------------WTNNVAPGKNIS------TMGRYFKDAGYHT 95 + G W K IS T + GY Sbjct: 62 PSSHGVHDWIRSGNIDKDKFEEAGRENPYWNGYSCEDKPISYLEGKTTYTDVLNENGYRC 121 Query: 96 CYIGKWHLDGHDYFGTGECPPEWDADYW---FDGANYLSELTEKEISLWRNGLNSVEDLQ 152 GKWHL G CP + ++ G +Y + NG +++ L Sbjct: 122 ALAGKWHL------GDSVCPQHGFSKWYTIGLGGCDYFHP------DIVENG--NIKVLH 167 Query: 153 ANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYY 212 ++ E I+N+A+++L + +EPF + V + PH P+ ++ +K+ D+Y Sbjct: 168 EQYVTEV------IANKAIEYLNEFQHQEEPFYLSVHFTAPHSPWG-EEQHPKKWMDYYE 220 Query: 213 ELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALT 270 ++ D A+ P+ PV L YFA +D+QIGR+++ L Sbjct: 221 NCDFQSIPDEADHPD-------LTTGPVFGTEKRKENLRGYFAAISAMDEQIGRILDTLE 273 Query: 271 PEQ-RENTWVIYTSDHGEMMGAHKLISKGAA-----MYDDITRIPLIIR----SPQGERR 320 RENT V+YT+D+G MG H + KG MY+ ++P ++ PQG+R Sbjct: 274 ANGLRENTLVVYTADNGMSMGHHGVWGKGNGTFPFNMYETSVKVPFLMSLPGVIPQGKRE 333 Query: 321 QVDTPVSHIDLLPTMMALADIEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDS---- 374 + T +S D+ PT++ L +++ E LPG + + R+E EH Sbjct: 334 E--TILSAYDIFPTLLELCKLDRKECEKLPGRSFAYLL----------RWEKEHKKRDEE 381 Query: 375 ---FGGFIPVRCWVTDDFKLVLNL-FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDA 430 F + PVR D+K + + ELY DP E NL + + +M Sbjct: 382 IVVFDEYGPVRMIRNQDWKYIHRYPYGPHELYYLTEDPEEKENLYGQPEYEKMVVEMRTR 441 Query: 431 LLDYMDKIRD 440 L ++ +K D Sbjct: 442 LNEWFNKYAD 451 >UniRef50_C6J5I8 Sulfatase n=2 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J5I8_9BACL Length = 522 Score = 139 bits (350), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 128/477 (26%), Positives = 203/477 (42%), Gaps = 64/477 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRP+ F+M D + +G + T ++D+L+ + + F +AYT P+C PARA + T Sbjct: 8 KRPHVFFLMCDELRADSLGYMGNSIVKTPHLDNLSKDAVIFENAYTNCPMCVPARASMMT 67 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLD------GHDYFGTGECP 115 G +G N + + + + GY T GK H+ G + F +G Sbjct: 68 GRNPISNGVLDNAMLMIDDEKPLPDLLRQNGYTTTLFGKLHVHRSAEEIGFEEFQSGYGD 127 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNS---------VEDLQANHIDETFTWAHRI 166 P Y S L K+ + + + H D+T R+ Sbjct: 128 P------------YTSFLGIKDPEMRKKSSYKKNEGDIPLVIHGESPTHPDQT--PCSRL 173 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 + + + + +D+P +S +PH P+ Y E Y L A L +KP Sbjct: 174 TEDYIRRISEIPGSDKPIFHHLSLHDPHTPYMPTKPYSEMYDPAQMPLPPNAGRSLDDKP 233 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTP-EQRENTWVIYTS 283 HR + + + Y L Y+ VDD+IG+VI L E +++ +I+TS Sbjct: 234 ITHRYFHKVRGFDKLTEEDYRKSLASYYGLVTHVDDRIGKVIARLKELELYDDSLIIFTS 293 Query: 284 DHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADI 341 DHG MMG H + K MY+ + RIPL+++ PQ ++DT ID+LPT++ A I Sbjct: 294 DHGSMMGEHGFVEKWGHMYEPVVRIPLLVKLPQNVNGGMRLDTFAEIIDILPTILDAAGI 353 Query: 342 EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFI---PVRCWVTDDFKLVL----- 393 PE + G+++L V RG E +R E F G + P +KL + Sbjct: 354 AVPEEVQGKSLLPVC--RGESKE-HRTEAHSQYFCGSLHREPALMIRDHQWKLTIYPEQE 410 Query: 394 ----NLF---------------TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 L+ ELYD +DP E HNL D+ ++A + +M L Sbjct: 411 SIHEKLYGDHYLKYSPFFDLPLVEGELYDLLSDPYEQHNLFDNPKYAAQKEEMLSKL 467 >UniRef50_A6DFZ4 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFZ4_9BACT Length = 519 Score = 139 bits (350), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 129/479 (26%), Positives = 208/479 (43%), Gaps = 47/479 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++ N L + D + CY K + NIDSLA G F S Y VC P+R +FT Sbjct: 19 EKANVLIITIDDLKPTL-ACYGDKYAVSPNIDSLADNGTLFRSNYCQQAVCAPSRISMFT 77 Query: 62 GIYANQSGPW---TNNVAPGKNISTMGRYFKDAGYHTCYIGK-------------WHLDG 105 G+ + +G T+ NI TM +YFK+ GY + GK W G Sbjct: 78 GLRPDTTGILDLHTHMRDINPNILTMPQYFKENGYLSIGYGKLMHGAKNDDKELSWSELG 137 Query: 106 HDY-FGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNS-VEDLQANHIDETFTW- 162 D + P D +L + + L + L +++ A + E + Sbjct: 138 DDLPYNKNHPKPVLDKFQNPKAHQVFKKLNKTQKRLKTSLLQKEMKNKGAYLVSEAYDLP 197 Query: 163 --AHR---ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGE- 216 A+R ++ + L + A E F MV+ +++PH PF P +Y + Y L E Sbjct: 198 DDAYRDGAVAKAGIQRLNELAETKEKFFMVLGFNKPHLPFNAPKKYWDMYDPNKLPLAEH 257 Query: 217 KAQDDLANKPEHHRLWAQA-----MPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINAL 269 + QD K +H A D+ H + Y+AC +VD Q+GRV++ L Sbjct: 258 QKQDQQRPKYAYHSFGELAAYKDYQIGKAVDEKRQRHLIHAYYACVSYVDAQVGRVMDEL 317 Query: 270 TP-EQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVD-TPVS 327 +NT V+ DHG +G H L K + ++ TR PLII +P ++ QV +P Sbjct: 318 KRLNLDKNTIVVLWGDHGWHLGDHGLWCKHSN-FEQATRAPLIISAPNQKKGQVSQSPTE 376 Query: 328 HIDLLPTMMALADIEKPEILPGENILAVKE-PRGVMVEFNRYEIEHDSFGGFI------P 380 ID+ P++ L +E PE L GE++ + E P+ + +++ + + G+ Sbjct: 377 FIDIFPSLCKLTGLEIPEQLEGEDLSPILEDPKAKVKDYSISQYLRWANHGYTMRSGKYR 436 Query: 381 VRCWVTDDF----KLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 + W+ ++ K N ELYD + DPNE N ++ +A+V K+ Y Sbjct: 437 LTLWMPKNYYGFMKFDENDIVEVELYDYQKDPNETTNFANNPEYAEVLRKLKKQFASYF 495 >UniRef50_Q7UGD6 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=1 Tax=Rhodopirellula baltica RepID=Q7UGD6_RHOBA Length = 578 Score = 139 bits (349), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 137/499 (27%), Positives = 211/499 (42%), Gaps = 98/499 (19%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPNFLFV+TD Q+ M+GC + T NID LA EGI F+ AY S +CTP+R +F Sbjct: 51 RPNFLFVLTDDQSYGMMGCDGNELTRTPNIDQLAREGIFFDRAYVTSAICTPSRISIFLS 110 Query: 63 IYANQSGPWTN---NVAPGKNISTMGRYFKDAGYHTCYIGKWHLD-GHDYFGTGECPPEW 118 Y + G N +VAP + +D GY+T Y+GK H G D + +G E Sbjct: 111 QYERKHGVNFNSGTSVAPEAWAKSYPVVMRDNGYYTGYVGKNHAPIGKDGYNSGLM--EE 168 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF-TWAHRISNRAVDFLQQP 177 DY++ G ++ + ++ + N E F ++ HR+ AV FL++ Sbjct: 169 SFDYFYAGHGHIRFYPKAVHKIFEGAEYDTQVEIVNEGAEDFLSYEHRLDG-AVRFLEER 227 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHH-------- 229 AD+PF + + + PH T ++ E D Y L + L P+H+ Sbjct: 228 P-ADKPFCLSICLNLPHSAGTGSMQQRESDDDIYKSLYRDIEIPL---PKHYVAKDDIKT 283 Query: 230 -RLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-------------- 274 RL A + + G + + + ++I R + +LT R Sbjct: 284 PRLPADVLRASDRQTGYN----FVDTPELLKERIIRQMQSLTGIDRLIGNLRTKLETEGV 339 Query: 275 -ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ------GERRQVDTPVS 327 +NT +I+ SDHG MG H L K A Y+ T +PLI+ P+ G R + V Sbjct: 340 DDNTIIIFCSDHGLFMGQHGLGGK-ALCYEQTTHVPLIVYDPELPTVLKGAR--CNELVQ 396 Query: 328 HIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFN--------------RYEIEHD 373 ID+ TM+ LADIE P G+++ + G + + R E D Sbjct: 397 TIDIAATMLDLADIETPATFQGKSMRPLLSGDGGAIRDHVFTENLWVTHFGNPRIEAVQD 456 Query: 374 SFGGFI----------PVRCWVTDDFKLVLNLF-------------------------TS 398 +I V+ V +D + +L Sbjct: 457 KRWKYIRYYRNDRVSASVKIQVAEDLGMKSSLMLYGVHDNEIAVYRNHAEASLRGEEPIH 516 Query: 399 DELYDRRNDPNEMHNLIDD 417 +EL+D +DP+E++NLIDD Sbjct: 517 EELFDLESDPDELNNLIDD 535 >UniRef50_C5BAV0 Sulfatase, putative n=2 Tax=Edwardsiella RepID=C5BAV0_EDWI9 Length = 544 Score = 139 bits (349), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 132/476 (27%), Positives = 209/476 (43%), Gaps = 58/476 (12%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + + D A VG Y +NT IDSL A G RF AY P+C P+RA +TG Sbjct: 45 NIVIITADQLARRGVGGYGNPHVNTPAIDSLIARGTRFEQAYCPYPLCAPSRACYWTGRL 104 Query: 65 ANQSGPWTNNVAPG--KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 +Q+G N+ +P +++ T+G F AGY + GK H ++ A Sbjct: 105 PHQTGVIAND-SPNLPQDMVTLGELFSRAGYECRHFGKRH--------------DYGALK 149 Query: 123 WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADE 182 F A+ + EL + + ++ ED+ + E+ + + R AD Sbjct: 150 GFTCADQI-ELPYDSPAAYPVDYDTREDVYC--LQESLKYIDTLKGRG---------ADA 197 Query: 183 PFLMVVSYDEPHH------PFTCPVEYLEKYADFYYELGE-KAQDDLANKP--------E 227 PF++ + ++ PH+ F P ++ L DL N+P Sbjct: 198 PFMLAIEFNNPHNINGWTGAFAGPHGDIDGLGTLPPLLDNFDTSADLPNRPLAIQYACCT 257 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHG 286 H+R+ A SP+ + + Y+ + D IG+V++AL ++T V++ +DHG Sbjct: 258 HNRVMQAANWSPL--NFRQYLKAYYHFTELADGFIGQVLSALRASGHADDTLVVFFADHG 315 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTP--VSHIDLLPTMMALADIEKP 344 + MGAH+L++K Y++ T +PLI P G R + S DLLPT+ A + P Sbjct: 316 DAMGAHRLVAKMNWFYEESTNVPLIFAGP-GIRPHSSSRHLTSLCDLLPTLCDYAGLTPP 374 Query: 345 EILPGENILAVKEPRGVMVEFNRYEI----EHDSFGGFIPVRCWVTDDFK-LVLNLFTSD 399 L G ++L + RG + R E+ D P R TD +K ++ + Sbjct: 375 PGLYGRSLLPIL--RGEQPDTWRDEVITQWNTDRNVDVQPARMLRTDRYKYIIYKENEEE 432 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL-RPWRK 454 ELYD + DP E HNL + R + DY+ DPF S + + R WR Sbjct: 433 ELYDLQQDPGETHNLAHSAEHQEQRQALRARFDDYVRNQIDPFYSQEAIIDRRWRS 488 >UniRef50_A0JVM4 Sulfatase n=2 Tax=Actinomycetales RepID=A0JVM4_ARTS2 Length = 479 Score = 139 bits (349), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 140/507 (27%), Positives = 222/507 (43%), Gaps = 76/507 (14%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG- 62 PN L +++D Q +GC + T ++D+LA+ G R ++ + SPVC+PARA L TG Sbjct: 7 PNILLILSDDQGAWALGCSGNTEIQTPHLDNLASGGTRLDNFFCVSPVCSPARASLMTGT 66 Query: 63 ----------IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 ++ ++GP + G+ + T AGY+ GKWHL +D G Sbjct: 67 IPSKHGVHDYLHGVETGPEAPDYLQGQRLFT--DDLAAAGYYMGLSGKWHLGANDRAREG 124 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 +WF A S +++RNG V++ ++ + I+ + Sbjct: 125 -------FSHWFSLAGGGSPY--DAATMYRNG---VKETVYGYLTDA------ITADSTG 166 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFT--CPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 F+++ A D PF + ++Y PH P+ P E+ Y D +E +P H Sbjct: 167 FMERAAGQDSPFFLALNYTAPHKPWKDQHPAEFTALYDDCAFE-------SCPQEPTHP- 218 Query: 231 LWAQAMPS-PVGDDGLYHHPL--YFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHG 286 W + P+G + L YFA +D IG+V+ L RE+T VI++SD+G Sbjct: 219 -WTPTVDGVPIGGEADVRAALVGYFAAVSAMDAGIGQVLQKLDELGLREDTLVIFSSDNG 277 Query: 287 EMMGAHKLISKGAA-----MYDDITRIPLIIRSP----QGERRQVDTPVSHIDLLPTMMA 337 G H + KG ++D ++P I P +G+ R+ +S DL T++ Sbjct: 278 FNCGQHGVWGKGNGTFPLNVFDSSIKVPAIFSFPGRIARGKVRE--ELLSAYDLPATILE 335 Query: 338 LADIEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL 395 LA ++ E PG++ V + + R + D +G PVR +D +K V Sbjct: 336 LAGLDPLEFEQGPGKSFADVLRGKPLAPARPRPVVVFDEYG---PVRMIRSDSWKYVHRY 392 Query: 396 FTS-DELYDRRNDPNEMHNLIDDI----RFADVRSKMHDALLDYMDKIRD----PFRSYQ 446 ELYD DP E HNL+ ++ R A +R M Y ++ D P Sbjct: 393 PQGPHELYDLATDPGERHNLVREVRHEERVAGMRRDMQLWFEQYQEEEADGRKFPVVGAG 452 Query: 447 WSLRPWRKDARPRWMGAFRPRPQDGYS 473 +L P R D +GAF P DG S Sbjct: 453 QTL-PVRADP----LGAFTPPSWDGIS 474 >UniRef50_A0LYA0 Sulfatase n=8 Tax=Bacteria RepID=A0LYA0_GRAFK Length = 566 Score = 138 bits (348), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 125/509 (24%), Positives = 219/509 (43%), Gaps = 111/509 (21%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLN----TQNIDSLAAEGIRFNSAYTCSPVCTPARA 57 KRPN +F+MTD A + Y G P++ T NID +A G +F + + + +C P+RA Sbjct: 41 KRPNIVFIMTDDHAAQAISAY-GHPVSQKAPTPNIDRIANNGAKFLNNFCTNSICGPSRA 99 Query: 58 GLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + TG +++ +G N + T+ +Y K AGY T +GKWHL G P+ Sbjct: 100 VILTGKFSHINGFRMNGETFDGSQPTLPKYLKKAGYQTAIVGKWHLHG---------KPQ 150 Query: 118 WDADYW---FDGANYLS-ELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 DYW D NY + E K + NG + D+ I++ +++ Sbjct: 151 -GFDYWNILKDQGNYYNPEFIHKNDTSIVNGYAT--DI--------------ITDMGIEY 193 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGE----KAQDDLANKPEHH 229 L++ + DEPF ++V + PH + P+ ++ Y + L + K ++ +A + + Sbjct: 194 LEKKRKKDEPFFLMVHHKAPHRNWMPPLRHINTYDSITFTLPDTYFSKHENQVAAQEQLQ 253 Query: 230 RLWAQ-------AMPSPVGDDGLYHHPL-------------------------------- 250 ++ M G D L H+P Sbjct: 254 TIYEDMYEGHDLKMTISKGSDSLRHNPWKTDFNRMSKEQRIAWNDAYRPKNDAFHDANLT 313 Query: 251 ---------------YFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKL 294 Y VD+ +G++++ L + ENT ++YT+D G +G + Sbjct: 314 GKDLAEWKGQRYLRDYMGTVAAVDEGVGKILDYLEEQGLTENTIIVYTTDQGFYLGEKGM 373 Query: 295 ISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTMMALADIEKPEILPGEN- 351 K MY++ +PL+I+ P+G ++ +D ++D PT + A E PE + G++ Sbjct: 374 FDK-RFMYEESLAMPLLIQYPKGIKKGTTIDALTQNLDFAPTFLDFAGAEIPESMQGKSL 432 Query: 352 --ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWV---TDDFKLVLNLFTSD----ELY 402 +L+ P G + Y + F F V+ T+ +KL+ F D ELY Sbjct: 433 RPLLSGNNPDGNFRDAVYY--HYYDFPAFHMVKRHYGVRTERYKLI--HFYDDIDTWELY 488 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 D + DP E NL + + +++ +H+ L Sbjct: 489 DLKEDPKEEINLYGSVEYEEIQKNLHEKL 517 >UniRef50_A6CG48 Sulfatase family protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CG48_9PLAN Length = 472 Score = 138 bits (348), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 127/437 (29%), Positives = 200/437 (45%), Gaps = 41/437 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF+ D + CY + +++ NID LA + F A+ P C +RA L T Sbjct: 21 ERPNVLFIAVDDLRPEL-ACYGKQHIHSPNIDKLAESSVLFERAFCMVPTCGASRASLMT 79 Query: 62 GIYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECP 115 GI ++ W AP N +TM FK GY+T +GK +H + G E Sbjct: 80 GIRPARNRFVNFLAWAERDAP--NATTMNTQFKQNGYYTASLGKIFHHPADNRQGWSE-- 135 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVED---LQANHIDETFTWAHRISNRAVD 172 P W G + +E R L + + ++ + + ++ +A++ Sbjct: 136 PPWRP----KGVQWYQRPENQEKHAARQKLGNKKKGPAWESADVPDNAYMDGVLAEKAIE 191 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYEL--GEKAQDDLANKPEHHR 230 LQQ + ++PF + V + +PH PF P +Y + Y +L K D A K HR Sbjct: 192 KLQQLEKQEQPFFLAVGFFKPHLPFIAPQKYWDLYDHDKIQLPANHKVPQD-APKESIHR 250 Query: 231 ---LWAQA-MPS--PVGDD---GLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVI 280 L A A +P+ PV ++ L H Y+AC + D QIG+++ L Q +NT V+ Sbjct: 251 FGELRAYADIPAKGPVSEETARNLIHG--YYACVSYTDAQIGKLLAELDRLQLSDNTIVV 308 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP---QGERRQVDTPVSHIDLLPTMMA 337 DHG +G H L K + Y+ IPLI+R+P GERR + + ID+ PT+ Sbjct: 309 LWGDHGWNLGDHTLWCKHSC-YESSLHIPLIVRAPGIKGGERR--SSLMESIDVYPTLCD 365 Query: 338 LADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 LADI +P+ L G++ +++ + E+ + + G I ++ L Sbjct: 366 LADIPQPKHLKGQSFVSLM--KDSTAEWKQAAVSRYRNGDTIRTDTLRYTEYTLPKGKLV 423 Query: 398 SDELYDRRNDPNEMHNL 414 S LYD DP E N+ Sbjct: 424 SQMLYDHSTDPLENVNV 440 >UniRef50_A0Q2E3 N-acetylgalactosamine 6-sulfate sulfatase n=3 Tax=Firmicutes RepID=A0Q2E3_CLONN Length = 483 Score = 138 bits (347), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 137/457 (29%), Positives = 200/457 (43%), Gaps = 70/457 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 + N + ++TD Q +GCY T +DSLA GIRF + + SPVC+PARA ++TG Sbjct: 5 KINVISIITDDQGYWSMGCYGNHDAITPTLDSLANNGIRFENFFCVSPVCSPARASIYTG 64 Query: 63 IYANQSG------PWTNNVAPG---KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE 113 +Q G W N K ST GY GKWHL D Sbjct: 65 RIPSQHGIHDWLDEWNNGYTTEEYLKGQSTFVDILAKNGYECAMSGKWHLGVAD------ 118 Query: 114 CPPEWDADYWFD----GANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNR 169 P+ YW+ G Y K+ +L I E +++ Sbjct: 119 -KPQNGFKYWYSHQKGGGPYYGAPMYKDGTL---------------IHEERYVTDVMTDY 162 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHHPFT---CPVEYLEKYADFYYELGEK-AQDDLANK 225 ++F+++ +D PF + ++Y PH P++ P E L+ Y D ++ K ++D Sbjct: 163 GLEFIEKQRDSDNPFYLSLNYTAPHAPWSPENHPKELLDLYKDCEFKSCPKDGKND---- 218 Query: 226 PEHHRLWAQAMPSPVGDDGLYHHPL-YFACNDFVDDQIGRVINALTPEQ-RENTWVIYTS 283 W+ P +D YFA VD+ I RVI+ L ENT +I+TS Sbjct: 219 ------WSIDYIFPKTEDERREVLRGYFAALTSVDNNIKRVIDKLKEMGVLENTLIIFTS 272 Query: 284 DHGEMMGAHKLISKGAA-----MYDDITRIPLIIRSPQGERRQVDTP-VSHIDLLPTMMA 337 D+G MG H + KG M+D +IP I + QV T +SH D+ PT+M Sbjct: 273 DNGMNMGHHGIFGKGNGTSPVNMFDTSVKIPCFITKIGDIKPQVSTDLLSHYDIRPTLME 332 Query: 338 LADIEKPEI-----LPGENILAVKEPRGVMVEFNRYEIE-HDSFGGFIPVRCWVTDDFKL 391 IE EI LPG + ++ RG +E + E+ +D +G P R T ++K Sbjct: 333 YLGIED-EIDEGVKLPGRSFASLL--RGEKLERDDNEVVIYDEYG---PARMIRTKEWKY 386 Query: 392 VLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKM 427 V ELYD NDP+E NLIDD D+ ++ Sbjct: 387 VHRYPAGPHELYDLVNDPDEKINLIDDEDKKDIVKEL 423 >UniRef50_Q1IH24 Choline sulfatase n=29 Tax=cellular organisms RepID=Q1IH24_PSEE4 Length = 505 Score = 137 bits (346), Expect = 8e-31, Method: Compositional matrix adjust. Identities = 120/431 (27%), Positives = 191/431 (44%), Gaps = 30/431 (6%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN LF+M D A ++ Y+ P+ N+ LA + + F+SAY SP+C P+R L Sbjct: 1 MKQPNILFIMADQMAAPLLPIYTPSPIKMPNLARLAEQAVVFDSAYCNSPLCAPSRFTLV 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE---CPPE 117 +G ++ G + N +I T Y + GY T GK H G D E Sbjct: 61 SGQLPSRIGAYDNAADFPADIPTYAHYLRRLGYRTALSGKMHFCGPDQLHGYEERLTSDI 120 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDET--FTWAHRISNRAVDFLQ 175 + ADY + N+ + ++ S W + ++SV LQA T + + +A +L Sbjct: 121 YPADYGW-AVNW--DAPDQRPS-WYHNMSSV--LQAGPCVRTNQLDFDEEVVFKARQYLY 174 Query: 176 QPARAD--EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR--- 230 R D PF + VS PH P+T P Y + Y L P R Sbjct: 175 DHVREDHGRPFCLTVSMTHPHDPYTIPKRYWDLYEAVDIPLPRDVIAQSQQDPHSQRLLK 234 Query: 231 ---LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHG 286 LW + +P D YF ++DD IG ++ L ++T ++++ DHG Sbjct: 235 VYDLWDKPLPVDKIRDA---RRAYFGACSYIDDNIGLLVQTLEDCGLADDTLIVFSGDHG 291 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALAD--IEK 343 +M+G L K ++ R+PL+I +P+ ++ VS DLLPT++ LA ++K Sbjct: 292 DMLGERGLWYK-MHWFEMSARVPLLIHAPKRFAPARISASVSTCDLLPTLVELAGGAVDK 350 Query: 344 PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYD 403 L G ++L + +G E + + G + +R +K V + LYD Sbjct: 351 DLHLDGRSLLGHLQGQGGHDEVIGEYMAEGTVGPLMMIR---RGAYKFVYSEDDPCLLYD 407 Query: 404 RRNDPNEMHNL 414 DP+E NL Sbjct: 408 LSRDPHERENL 418 >UniRef50_C6CRB3 Sulfatase n=2 Tax=Bacilli RepID=C6CRB3_PAESJ Length = 473 Score = 137 bits (346), Expect = 8e-31, Method: Compositional matrix adjust. Identities = 108/379 (28%), Positives = 179/379 (47%), Gaps = 52/379 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +++ D G + T NID A E + FN A +C+P+C+P RA L TG Sbjct: 2 KPNIVYIFADQWRKQAAGFMGEDQVLTPNIDRFARESLVFNHALSCTPLCSPHRAALMTG 61 Query: 63 IYANQSGPWTN-----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP- 116 Y +G +TN ++ + +G K AGY T YIGKWHLD + CP Sbjct: 62 RYPQSNGVYTNCKNGADIMLSPDEICIGDVLKGAGYQTGYIGKWHLDLPE---QNHCPEP 118 Query: 117 -----EWDA-----------DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF 160 EWDA D+W+ Y L WR+ V+ Q + ET Sbjct: 119 ESGAREWDAYTPPGPKRHGFDFWYSYGAYDHHLRPH---YWRDSPEMVQIEQWSLEHET- 174 Query: 161 TWAHRISNRAVDFLQQPARAD--EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA 218 + A+D++++ EPF + +S++ PH PF E + Y E+ Sbjct: 175 -------DVALDYIREKGSGGNMEPFALFLSWNPPHSPF-------ELVPELYKEIYRGR 220 Query: 219 QDDLANKPEHHRLWAQAMPS-PVGDDGLYHHPL-YFACNDFVDDQIGRVINALTPEQ-RE 275 + +L RL A + G++ L + YFA +D+Q GR+ + E Sbjct: 221 EINLRPNVRWERLEAHTGETFESGEEALLRYTRDYFAAVSGMDEQFGRIWREIQELGISE 280 Query: 276 NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPV--SHIDLLP 333 +T ++ +SDHGE+MG+H L++K + +++ IP ++ +G+ +Q DT + + ID++P Sbjct: 281 DTLIVLSSDHGELMGSHGLMAK-HSWHEESIGIPCVMNW-EGKIKQGDTDMLFNSIDIMP 338 Query: 334 TMMALADIEKPEILPGENI 352 T++ LA + P + G ++ Sbjct: 339 TLLGLAGLPVPASVEGLDL 357 >UniRef50_D2MKV1 Choline-sulfatase n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MKV1_9BACT Length = 404 Score = 137 bits (345), Expect = 9e-31, Method: Compositional matrix adjust. Identities = 110/405 (27%), Positives = 184/405 (45%), Gaps = 46/405 (11%) Query: 58 GLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH---DYFGTGEC 114 G F G++ANQ + + +AGY IGKWHL +++G + Sbjct: 2 GNFNGVFANQI----------LDKPAYPKLLSEAGYRVSCIGKWHLAKEGDTEFWGYDKW 51 Query: 115 PPEWDADYWF--DGANYLSELTEKEISLW--RNGLNSVEDLQANHIDETFTWAHRISNRA 170 P + W DG ++ + + W L A E +T +++ Sbjct: 52 HPYREWHQWLRDDGIDFRIDRDAVQPYEWGGEAPFYGRHPLPAERTMEAWT-----ADKT 106 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP---- 226 ++ + + +++PF++ ++ PH P+ P Y Y E + NKP Sbjct: 107 IELINGYSDSEQPFMIAANFFGPHFPYAVPAPYDTMYDPDTAERWGNFDEQFINKPLIQQ 166 Query: 227 -EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSD 284 E R A + P + Y+ F+DDQ+ RV++ L ENT VI+++D Sbjct: 167 KEMLRWNASHLTWPDWQKAI---AAYWGFCTFIDDQVRRVLDCLEANGLAENTVVIFSTD 223 Query: 285 HGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTMMALADIE 342 HG+M+G+H+L +KG MY++ IPL+IR P D V+ +DL+PT + L Sbjct: 224 HGDMIGSHRLFNKGFHMYEETHHIPLVIRHPGATSSGTTCDEFVNLVDLMPTFLELGGAA 283 Query: 343 KPEILPGENILAVKE-------PRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL 395 P+ + G +I+ + P V EF+ YE S VR TD +K V N Sbjct: 284 VPDEIDGRSIMPLLRGETVEDWPDDVFAEFHGYEPTLAS------VRMVRTDSWKYVYNP 337 Query: 396 FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 ++ DELYD ++DP+E+HNL + + + V +M ++ + + RD Sbjct: 338 YSEDELYDMKSDPHELHNLANHLGYKHVLRRMKARMVARLRETRD 382 >UniRef50_C5BYA8 Sulfatase n=2 Tax=Micrococcineae RepID=C5BYA8_BEUC1 Length = 478 Score = 137 bits (345), Expect = 9e-31, Method: Compositional matrix adjust. Identities = 133/482 (27%), Positives = 205/482 (42%), Gaps = 79/482 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN ++TD A + VG Y +T ID +A G R ++ + + +CTP+RA + T Sbjct: 3 RRPNICLILTDDHAAHAVGTYGSVVNSTPRIDEIAQRGWRLDNLFCTNSICTPSRASILT 62 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G +++ +G T + + + T KDAGY T +GKWHL G GE D Sbjct: 63 GQHSHTNGVRTLSTPMDRELPTFVSQLKDAGYRTAIVGKWHL------GEGEEHRPRAFD 116 Query: 122 YWFDGANYLSELTEKEISLWR--NGLNSVEDLQANHI-DETFTWAHRISNRAVDFLQQPA 178 +W L + E +R +GL +V + I D W D P Sbjct: 117 HWM----ILRDQGEYHDPTFRTPDGLRTVTGYATDVITDLALQWLD-------DLDYGPD 165 Query: 179 RADEPFLMVVSYDEPH---------------HPFTCPVEYLEKYADFYYELGEKAQDDLA 223 D P+ +++ + PH P P + + YA +A +A Sbjct: 166 GTDSPWCLLIHHKAPHRSWEPDEAHRAQFAGRPIPVPATFTDDYAT-RSGAAHRAAMRVA 224 Query: 224 NKPEHHRLWAQAMPSPVG----DDGLYHHPL----YFACNDFVDDQIGRVINALTPE-QR 274 ++ L A P G D+ L+ + Y AC VDD +GRVI+ L + Sbjct: 225 DQLTRRDLKAD---PPAGLSYEDEALWKYQRYMEDYLACVASVDDNVGRVIDRLAERGEL 281 Query: 275 ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLL 332 ++T ++YTSD G +G H K MYD+ R+P ++ P R D V+++DL Sbjct: 282 DDTLLMYTSDQGFFLGDHGWFDK-RFMYDESIRMPFVVSCPTALDGGRSTDQIVTNVDLA 340 Query: 333 PTMMALADIEKPEILPGENI---LAVKEPRGVMVEFNRYEIEHDS----FGGFIPVRCWV 385 T++ AD+E + GE+ LA E F EHD G +R Sbjct: 341 RTILEAADVEPHPGMQGESFWGTLARGETPPADQSFYYRYWEHDDGAHHAAGHYGIR--- 397 Query: 386 TDDFKLVLNLFTSD----------------ELYDRRNDPNEMHNLIDDIRFADVRSKMHD 429 T+ +KL+ F +D ELYD DP+E+ N+ DD +A VR + + Sbjct: 398 TERYKLI--YFYNDGLGLPGTGWATYAPEWELYDLEADPDELVNVADDPTYAVVRRDLTE 455 Query: 430 AL 431 L Sbjct: 456 RL 457 >UniRef50_UPI0001746164 choline-sulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746164 Length = 469 Score = 137 bits (345), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 129/457 (28%), Positives = 201/457 (43%), Gaps = 45/457 (9%) Query: 10 MTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIYANQSG 69 MTD +++GC K T ++D+LA G+ F++ Y SP+CTP+R TG Y + Sbjct: 1 MTDEHNASVMGCAGDKVARTPHLDALAERGVLFDAHYCASPICTPSRQSFTTGKYVSGHR 60 Query: 70 PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG-------TGECPP--EWDA 120 W+N + ++ R AGY + GK H G G G P E A Sbjct: 61 VWSNTPGVPEGTPSLARILNAAGYDSYLNGKMHYKGGMTHGYQIISEKDGRITPGKEPGA 120 Query: 121 DYWFDGANYLS---ELTEKEISLWRNGLNSVEDLQANH--IDETFTWAHRISNRAVDFLQ 175 + G+N + L + L + H +DE R + A+ FL+ Sbjct: 121 ERAARGSNAIKPRQRLAAGRFEDRGDELGEEFEHAGEHADMDEFVDVVRR--DHAIKFLK 178 Query: 176 Q-PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADF--YYELGEKAQDDLANKPEHHR-- 230 + A ++PF + + + PH+P P EYLE + D + E+ D L H R Sbjct: 179 ERGADNNKPFFLTIGFIAPHYPLVAPPEYLEHFRDKVPFPEVPPGYVDTLPLNYRHLRND 238 Query: 231 LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMM 289 + +P + L Y+A +++DDQIG V+ AL + ENT VIYTSDHGE + Sbjct: 239 RKFERVPPALAKRALEG---YYARVEWIDDQIGMVLEALKNSRFAENTVVIYTSDHGENL 295 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQ----GERRQVDTPVSHIDLLPTMMALADIEKPE 345 G H L K M+D R+PLI+ P G+ R V +DL+ T+ A+ + P Sbjct: 296 GEHGLWWKN-CMFDSGARVPLIVSWPSRWKGGQHRTGACGV--LDLVQTIAAIGGAQVPS 352 Query: 346 ILPGENILAVKEPRGVM---VEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD--- 399 G +++ + + + Y + + GF +R D+K V + + Sbjct: 353 DWKGVSMIPWLDDHSAPWRDLAVSEYYASYVA-SGFAMIR---QGDWKYVYHTRADELHG 408 Query: 400 ---ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 ELY+ R DP E+HNL MH A+++ Sbjct: 409 PERELYNLREDPRELHNLAGKEENMPRMEAMHKAMVE 445 >UniRef50_A6LF65 Choline-sulfatase n=26 Tax=Bacteroidales RepID=A6LF65_PARD8 Length = 520 Score = 137 bits (344), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 129/481 (26%), Positives = 215/481 (44%), Gaps = 67/481 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +P+ + +MTD Q + +GC K + + NID LA EG F S Y+ +P TP RAGL TG Sbjct: 45 KPHIILIMTDQQRGDALGCMGNKAVISPNIDRLAQEGSLFVSGYSSAPSSTPGRAGLLTG 104 Query: 63 IYANQSGPWTNNVAPGKNISTMGRY-----FKDAGYHTCYIGKWH------LDGH----- 106 + PW + + ++ RY ++ GY+T IGK H L G Sbjct: 105 M-----SPWHHGMLGYGRMALKYRYEMPQMMRNLGYYTFGIGKMHWFPQKALHGFHATLI 159 Query: 107 DYFGTGECPPEWDADY--WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 D G E P++ +DY WF + K+ L G N +DE Sbjct: 160 DESGRVES-PDFISDYREWFQ-----LQAPGKDPDLTGIGWND-HAAGVYKLDERLHPTA 212 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 A + ++ D+P + VS+ PH P+ P YL+ Y D ++ + D Sbjct: 213 WTGQTACELIRN-YDNDKPLFLKVSFARPHSPYDPPQRYLDMYKDA--DIPKPHIGDWCG 269 Query: 225 K-PEHHRLWAQAMPSPVGDDG----LYHHPLYFACNDFVDDQIGRVINALTPE-QRENTW 278 + E A +P G+ G + Y+A F+DDQ+G++I L + +N Sbjct: 270 QYAEPKDPLQGASDAPFGNFGDAYAINSRRHYYANITFIDDQVGQIIQTLKDKGMYDNAL 329 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG------ERRQVDTPVSHIDLL 332 + +T+DHG+M+G H K Y+ IP I++ P G + ++ PV D L Sbjct: 330 ICFTADHGDMLGDHYHWRK-TYPYEGSAHIPYIVKWPAGISKSIPDGSSIEQPVELRDFL 388 Query: 333 PTMMALADIEKPEILPGENILAVKEPRGVMVEFNRY-EIEHDSFGGFIPVRCWVTDDF-- 389 PT + +A P + G ++L + + G ++ Y ++EH C+ D++ Sbjct: 389 PTFIDIAGGSVPPDMDGRSLLKLIQ--GQQEQWRPYIDMEH--------ATCYSDDNYWA 438 Query: 390 -------KLVLNLFT-SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 K + N S++L+D R DP E HNL +D + + S++ +++++ + D Sbjct: 439 ALTDGKIKYIWNFHNGSEQLFDLREDPGETHNLSEDAAYQNKLSELRKMMVEHLSERGDS 498 Query: 442 F 442 F Sbjct: 499 F 499 >UniRef50_D2R203 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R203_9PLAN Length = 490 Score = 136 bits (343), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 128/468 (27%), Positives = 207/468 (44%), Gaps = 54/468 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAY---TCSP-VCTPARAG 58 +PN +F+ D + + + T N+D LA +G F+ AY + SP VC +R+ Sbjct: 37 KPNVVFLFADDLSYEALAYAGNGQVKTPNLDRLAKQGTSFSHAYNMGSFSPAVCIASRSM 96 Query: 59 LFTG--IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 L TG ++ Q+ P KN+ R AGY T GKWH+ Sbjct: 97 LVTGRSVWKAQTLHAAGGKEP-KNVVLWPRQMHGAGYQTFITGKWHV------------- 142 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNS--------VEDLQANHIDETFTWAHRISN 168 W+ FD ++ K++ + + +S + W+ ++ Sbjct: 143 PWNPMLAFDVTAHVRGGMPKDVPSFYDRPHSDKPDTFDPANPGNGGYWQGGKHWSEVTAD 202 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 AV+F P M V+++ PH P P YL++Y E E +D PE Sbjct: 203 DAVEFFSASRDKSRPCFMYVAFNAPHDPRQAPQTYLDRYPT---ETIEVPKDFQPLYPER 259 Query: 229 HRLWA-------QAMPSPVGDDGL-YHHPLYFACNDFVDDQIGRVINALTPEQREN-TWV 279 + A + P P + + H Y+A +DDQIGR+++A+ + + T V Sbjct: 260 ASIGADEKLRDEKLAPFPRTEFAVRTHRREYYALITHLDDQIGRILDAIEQTKSDRPTMV 319 Query: 280 IYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRS---PQGERRQVDTPVSHIDLLPTMM 336 ++T+DHG G H L+ K MYD R+PLII PQG + +D PV D++PT + Sbjct: 320 MFTADHGLACGHHGLMGK-QNMYDHSIRVPLIIAGENIPQG--KTIDVPVYLQDVMPTAL 376 Query: 337 ALADIEK-PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-N 394 LA + PE+ + V+ + V + Y + S+ RC V D FKLV+ Sbjct: 377 ELAGVAPGPEVHFHSLLPIVRGEQKV----SNYPAIYSSYLNL--QRCVVKDGFKLVVYP 430 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 + +L+D ++DP E+ +L D A + ++ DAL+ + I DP Sbjct: 431 ALPAAKLFDLQHDPLELSDLSADPNHATRKEQLFDALVAEAESISDPL 478 >UniRef50_UPI0000E11039 sulfatase 1 precursor n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=UPI0000E11039 Length = 469 Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 130/443 (29%), Positives = 209/443 (47%), Gaps = 51/443 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN LF++TD N VG YS + T N+D +AA G+RF AY+ + VC+P+R LFTG Sbjct: 30 KPNVLFILTDDLGFNQVGAYSSTKIKTPNLDEMAANGVRFEQAYSGNTVCSPSRVSLFTG 89 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 +N V + TM K A Y T GK+ + G + P D Sbjct: 90 RDGRYMSNNSNTVQLEEIDITMAHVLKHADYDTALFGKYSIGSK--MGVTD-PLAMGFDT 146 Query: 123 WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI-SNRAVDFLQQPARAD 181 W+ + L + LWR+G+ E ++AN +A I +N +DF++Q + Sbjct: 147 WYGMYSILEGHRQYPQILWRDGVK--ERIKANEGGRQGAYASEIFTNETIDFIKQ--DRE 202 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN-KPEHHRLWAQAMPSPV 240 PF ++++Y PH P ++Y+D Y+ E + N KP W P P+ Sbjct: 203 NPFFVMLAYTSPHADLAVP----KQYSDM-YDFPETPYLGMQNGKPTDKYAW--YYPEPI 255 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHG--EMMGAHKLISK 297 + A +D+V G++I+ L Q +NT +I+TSD+G + GA Sbjct: 256 ERPNAVLAGMVTALDDYV----GQLIDTLKKTGQLDNTIIIFTSDNGPHDEGGADPEFFD 311 Query: 298 GAA--------MYDDITRIPLIIRSP-QGERRQVD-TPVSHIDLLPTMMALADIEKPEIL 347 AA +YD +P+I+ P Q ++ +VD TP + D+LPT+ +A + +++ Sbjct: 312 AAAPYKGQKRDLYDGGIHVPMILHWPDQIKQGRVDQTPWTFADVLPTLADIAGVNL-DLV 370 Query: 348 P--GENILAVKE----------PRGVMVEFNRYEIEHDSFGGFI--PVRCWVTDDFKLVL 393 P N +++K R + EF++ + +S G I + D+K V Sbjct: 371 PRVRTNGVSIKSILNDSPIEMPERTLYWEFSKQVGDPNS--GIIGDTFQAARRGDWKAVR 428 Query: 394 NLFTSD-ELYDRRNDPNEMHNLI 415 F +D ELY+ NDPNE +NL+ Sbjct: 429 YGFDADLELYNISNDPNESNNLV 451 >UniRef50_UPI00016C09FC sulfatase n=2 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C09FC Length = 463 Score = 135 bits (341), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 134/479 (27%), Positives = 211/479 (44%), Gaps = 51/479 (10%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M+RPN LF+MTD Q + G + L T N+D L + + FN+AY +P C P+RA + Sbjct: 1 MRRPNILFLMTDEQKFDTFG-FVNSVLKTPNLDKLISNSVFFNNAYCSNPSCVPSRAAIV 59 Query: 61 TGIYANQ-SGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLD------GHDYF---- 109 TG Y P + P ++ M + ++AGY+T +GK H D G+DY Sbjct: 60 TGKYPTACQCPTYISTLPKDEVTFMAK-LQEAGYYTAVVGKQHFDASEIYKGYDYECIVD 118 Query: 110 GTGECPPEWD----ADYWFDGANY-LSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 G + P + A+Y + Y + ++ EK I G D++ NHID + Sbjct: 119 GHSQTAPYENIVAFAEYLKEKGVYKVKQVGEKLIC----GSEWTGDIE-NHID--YFIGE 171 Query: 165 RISNRAVDFLQQPARAD-EPFLMVVSYDEPHHPFTCP-VEYLEKYADFYYELGEKAQDDL 222 + + + AD +P+ M +S+ PH P+ C Y Y L E DL Sbjct: 172 KGREWLANHIDTTKAADAKPWFMTLSFPGPHQPYDCEGTAYAANYNLADMTLEESKVADL 231 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALT-PEQRENTWV 279 KP H++ D+ LY Y+A +D++IG VI L ++ +NT + Sbjct: 232 ETKPPHYKHLNPRAYIDQYDEELYRRTKRSYYANMSLIDEKIGEVIAMLKEKDEYDNTLI 291 Query: 280 IYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP----QGERRQVDTPVSHIDLLPTM 335 I+++DHG+ MG L++K + + + +PL ++ P QG R + V++ID+ T Sbjct: 292 IFSTDHGDFMGDFGLVTKAQYLSEGLMHVPLFVKPPIKDFQGFR--TNDLVTNIDIASTC 349 Query: 336 MALADIEKPEILPGEN--ILAVKEPRGVMVEFNRYEIEHDSFGGF---IPVRCWVTDDFK 390 + A + EN E V Y HD G I +V D+ Sbjct: 350 LTAAKAPEKITEHMENHPYNDYWEKDTVSARDYVYMEAHDIKGVIRDGIKTLYYVERDY- 408 Query: 391 LVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL 449 ELYD DP E NL DD ++ + ++D M + P S QW++ Sbjct: 409 --------GELYDLNTDPAERINLWDDPKYQTAKMAGMAKIIDKMFWLS-PKSSMQWNV 458 >UniRef50_UPI0001745B0B sulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745B0B Length = 676 Score = 135 bits (340), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 121/456 (26%), Positives = 199/456 (43%), Gaps = 49/456 (10%) Query: 4 PNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 PN LF++ D + +G G P T N+D LA G+RF +A+ +C P+R + TG Sbjct: 37 PNVLFIIAD-DLNDWIGWMGGHPQARTPNMDRLARMGMRFMNAHCSYALCNPSRTSMLTG 95 Query: 63 IYANQSGPWTN-----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 I SG N N P + T+ YF+ GY T GK F PE Sbjct: 96 IQPWNSGVAGNEQDWRNAEPLQGKPTLPEYFRQQGYTTAAGGK-------VFHASHGGPE 148 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT--------WA------ 163 W G + + ++ NG+ + DL H + F W Sbjct: 149 GRLTGWHGGRRGFEQDSAWDVRFPGNGVQ-IPDLPV-HTGQNFNGLDIWHWDWGTVDVKP 206 Query: 164 -----HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA 218 ++ N A +LQ+ + PF + V PH P+ P +Y + L E Sbjct: 207 EATDDGQVVNWAAQYLQR--KQPRPFFLTVGLYRPHAPWYVPRQYFAERPLSEVRLPEVK 264 Query: 219 QDDLANKPEHHRLWAQ-AMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINAL-TPEQR 274 +DDLA+ P + + + + D L+ + Y A F D +GRV++AL + + Sbjct: 265 EDDLADVPAAAKAYLNGGLHRKMLDRQLWGSAVRAYLASISFCDAMVGRVLDALESSPNK 324 Query: 275 ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLL 332 NT +++TSDHG +G + KG +++ +T +PL++ +P + Q VS +DL Sbjct: 325 TNTVIVFTSDHGLYLGEKQRWHKG-GLWERVTHVPLVVVAPGVTQPDTQSSQAVSLVDLY 383 Query: 333 PTMMALADIEKPEILPGENILA-VKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL 391 PT+ L + KP+ L G +++ +++P + + VR TD ++ Sbjct: 384 PTLCELTGLPKPQSLDGISLVPLLRDPNASRTTPAVTAMGEGDKASYA-VR---TDRWRY 439 Query: 392 VLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKM 427 + S+ELYD ++DP+E NL A V+ + Sbjct: 440 IRYANGSEELYDHQSDPHEWTNLAGRTNLAAVQKDL 475 >UniRef50_UPI0001C36C38 arylsulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C36C38 Length = 522 Score = 135 bits (340), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 133/441 (30%), Positives = 202/441 (45%), Gaps = 47/441 (10%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN L + TD Q + + + T N+D LAAEG + +AY+ +PVC AR + Sbjct: 1 MKKPNILLITTDQQRYDTICAMGYDFMETPNLDRLAAEGCCYPNAYSSNPVCMAARHQII 60 Query: 61 TGIYAN---------QSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLD---GHDY 108 TG+ A +S P T ++ T+ + DAGY T IGK H H+ Sbjct: 61 TGLTARYHRFDDNYFESDPKTIPF----DLPTLPQLLSDAGYDTIAIGKMHFQPCRRHNG 116 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLWRN--GLNSVEDL--QANHIDETFTWAH 164 F E E + + Y L E ++ G+ + + Q + I E + Sbjct: 117 FTKMELMEEIPR--YLEDDEYAKYLKENGYGHLQSPHGVRHLLYMVPQRSLIPEEHHGST 174 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFY--YELG--EKAQD 220 ++ R+V L++ PF + S+ PH PF P EK+AD Y EL ++++ Sbjct: 175 WVAKRSVYHLKENG-GKRPFFLWSSFIAPHPPFDVP----EKWADLYKGKELPPLKESKT 229 Query: 221 DLANKPEHHRLWAQAMPSPVGDDGLYH-HPLYFACNDFVDDQIGRVINALTPE-QRENTW 278 ++ E W + + + L LY+A FVD IG ++ L + +NT Sbjct: 230 PISGIAE----WKKYIADYPNESYLRRARELYYASISFVDYNIGTILQQLKDMGEYDNTL 285 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP-QGERRQVDTPVSHI-DLLPTMM 336 +++TSDHGEM+G H K YD RIP I+R P Q + DT I D+LPT++ Sbjct: 286 ILFTSDHGEMLGDHGTFQKMLP-YDGSVRIPFIMRYPDQLKAGSEDTRFIDINDILPTVL 344 Query: 337 ALADI--EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 +A + PE LPGE+I AV + VE+ +EH S G + Sbjct: 345 DVAGVPCPNPERLPGESIFAVDGRKDRTVEY----VEH-SHGKLRWISLITKSYKYNYYY 399 Query: 395 LFTSDELYDRRNDPNEMHNLI 415 +EL+D NDP+E NL+ Sbjct: 400 GGGKEELFDLENDPDETTNLL 420 >UniRef50_A6DSH1 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH1_9BACT Length = 462 Score = 135 bits (340), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 118/454 (25%), Positives = 210/454 (46%), Gaps = 50/454 (11%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+M+D + + Y + T N+D L ++ + F+ AY+ P+C P+R + +G+Y Sbjct: 24 NVLFIMSDDLNVD-IASYGHPIVKTPNLDKLRSKSVLFSQAYSQYPLCNPSRNSILSGMY 82 Query: 65 ANQSGPWTN-----NVAPGKNISTMGRYFKDAGYHTCYIGKW--HLDGHDYFG-----TG 112 SG +N AP +I+T+ FK GY GK H D + G TG Sbjct: 83 PGTSGCLSNADQLRKTAP--DITTLPEAFKKQGYEVISTGKIFHHEDPQSWTGITNLRTG 140 Query: 113 ECPPEW-DADYW---FDGANYLSE---LTEKEISL--WRNGLNSVEDLQANHIDETFTWA 163 + P+ D +++ FD + E LTE E+ WR+ + ED+ + +T Sbjct: 141 KLHPQGKDYNFYRPAFDERKTIGEGRNLTEGELGFMTWRS-VTEKEDILFDSRTARWTMQ 199 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 H L++ A ++PF + V + PH PF P + + Y +L E Q+ A Sbjct: 200 H---------LEKLAEDEKPFFLGVGFSRPHDPFFAPKRFFDMYPMESIKLPETPQN--A 248 Query: 224 NKP---EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWV 279 +K ++ ++ +A L Y+A ++D+Q+G V++ L NT V Sbjct: 249 SKVPMMAYYDVFKRAFDKMDTQKRLEFVRSYYASISYMDEQLGLVLDKLEALNLSNNTLV 308 Query: 280 IYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMA 337 ++ SDHG +G +K +++ R PL+I +P + +VD V ID+LPT+ Sbjct: 309 VFISDHGYQVGEKGYFNK-TLLFERSCRAPLMISNPKLKSSVNKVDKIVEFIDVLPTITE 367 Query: 338 LADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 + + P+ G +++ + +G VE+ I + + R T+ ++L+ Sbjct: 368 ITSVPTPKTAEGRSLIPLM--KGKKVEWKEEAISYVNAD-----RSIRTERYRLINWRGQ 420 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + LYD + DP E N +D+ + +V ++ L Sbjct: 421 KEALYDHQRDPGEHFNQVDNPEYKEVLKRLRSKL 454 >UniRef50_D0PR12 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR12_9SPHI Length = 519 Score = 135 bits (339), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 130/506 (25%), Positives = 220/506 (43%), Gaps = 90/506 (17%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++PN L +M D N VG G P T +D LAAE + F +A++ +P+C P+RA F Sbjct: 23 QKPNVLLIMVD-DLNNYVGFMGGHPQTKTPGMDKLAAESVIFTNAFSNNPICAPSRASFF 81 Query: 61 TGIYANQSGPW------TNNVAPGKNISTMGRYFKDAGYHTCYIGKW--HLDGHDYFGTG 112 +G+Y + S + N + G TM F + GY GK H D Y G Sbjct: 82 SGLYPHTSKNFGFKKFQKNEILMGS--KTMMELFMENGYKVAGTGKLMHHFDRKLYHQFG 139 Query: 113 ECPPEWDADYW---FDGANYLSELTEKE----ISLWRNGLNSVEDLQANHIDETFTWAHR 165 + +D+ F+G + + E + + D+ ++ W + Sbjct: 140 D-----KSDFGPFPFNGTKPVGHPSVPEPFRSVGTLDGSFAPLSDVPT--VNGNVGWFDK 192 Query: 166 ISN---------------------------RAVDFLQQPAR-ADEPFLMVVSYDEPHHPF 197 + N A+ FL+ + D+PF + V ++ PH P Sbjct: 193 VKNWPKYEVIPFKYNSDDDRDLMPDEVKAQDAIKFLKSFEKNTDQPFFLSVGFNRPHTPL 252 Query: 198 TCPVEYLEKY--------------ADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDD 243 P +Y ++ + Y++ +A+ + K H +A+ + S D Sbjct: 253 YVPEKYFNQFPLDQVKLPDTSSDVSKIYFD--TQAEGNSGQKGYTH--YAKLVESFGNDK 308 Query: 244 GL---YHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHGEMMGAHKLISKGA 299 L + Y AC FVDDQI V+NAL ENT VI SDH +MG + + K A Sbjct: 309 DLALRKYIQAYLACVAFVDDQISEVLNALDNSNLAENTIVILVSDHAYVMGDKQYLYKNA 368 Query: 300 AMYDDITRIPLIIRSPQGERR-QVDTPVSHIDLLPTMMALADIEKPEI-------LPGEN 351 ++D+ TRIP++I+ PQ +++ +VD PVS ID+ PT++ + +E I L G + Sbjct: 369 -LWDEATRIPMLIKYPQQQKKHKVDHPVSLIDIYPTLIEMCGLEGTTIKNEKGKPLDGHS 427 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFG-GFIPVR---CWVTDDFKLVLNLFTSDELYDRRND 407 + + Y + + G G + R T +++ + +ELYD + D Sbjct: 428 LYPFVINKKNDYTGKEYALSTVTGGMGDVVKRNHFSIRTKEYRYIRYANNKEELYDHKKD 487 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLD 433 P E +N+ D ++ +++++ LLD Sbjct: 488 PFEKNNVAKDKKYKAIKAEL-SMLLD 512 >UniRef50_C9L086 Mucin-desulfating sulfatase n=54 Tax=Bacteria RepID=C9L086_9BACE Length = 527 Score = 135 bits (339), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 129/520 (24%), Positives = 221/520 (42%), Gaps = 118/520 (22%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N +++MTD M+ CY + + T N+D +AA+G+RF ++ + + P+RA + TG + Sbjct: 37 NIVYIMTDDHTAQMMSCYDTRYMETPNLDRIAADGVRFTQSFVANSLSGPSRACMITGKH 96 Query: 65 ANQSGPWTNNVAP-GKNISTMGRYFKDAGYHTCYIGKWHLD----GHDYF----GTGECP 115 + + + N + T + + AGY T +GKWHL+ G DY+ G G+ Sbjct: 97 SCANKFYDNTTCVFDSSQQTFPKLLQKAGYQTALVGKWHLESLPSGFDYWEIVPGQGDY- 155 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 ++ D+ +T+K ++ ++G + + I++ A+D+++ Sbjct: 156 --YNPDF----------ITQKNDTIQKHGYIT----------------NLITDNAIDWME 187 Query: 176 QPARADEPFLMVVSYDEPHH---------------PFTCPVEYLEKY------------- 207 ++PF +++ + H F P + + Y Sbjct: 188 HKRNPEKPFCLLIHHKAIHRNWMADTCNLALYEDKTFPLPDNFFDDYEGRPAAAAQEMSV 247 Query: 208 ---ADFYYELGEKAQDD-----------LANKPEHHRLWAQAMPSPVGDDGLYHHPL--- 250 D Y+L D L E R +P+ +D Y L Sbjct: 248 VKDMDMIYDLKMLRSDKNSRLKSLYEKFLGRMDEGQRAAWDKFYAPIIED-FYKQNLQGK 306 Query: 251 -------------YFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLIS 296 Y +DD +GRV++ L + +NT V+YTSD G MG H Sbjct: 307 ELANWKFQRYMRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWFD 366 Query: 297 KGAAMYDDITRIPLIIRSPQGERRQVDTP--VSHIDLLPTMMALADIEKPEILPGENILA 354 K MY++ R PLI+R P+G R+ D V +ID PT + LA +E P + G +++ Sbjct: 367 K-RFMYEESMRTPLIMRLPKGFDRRGDITEMVQNIDYAPTFLELAGVEIPSDIQGVSLVP 425 Query: 355 V---KEP----RGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD----ELYD 403 + K+P + + F Y EH VR T+ +KL+ F +D ELYD Sbjct: 426 LLKGKQPENWRKALYYHFYEYPAEH-MVKRHYGVR---TERYKLIH--FYNDINWWELYD 479 Query: 404 RRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFR 443 + DP+EMHNL + V +++ + L ++ DP R Sbjct: 480 MKTDPSEMHNLYGQPEYESVVNELKEELQKLQEQYNDPVR 519 >UniRef50_A6DPD0 Sulfatase family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DPD0_9BACT Length = 471 Score = 135 bits (339), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 127/463 (27%), Positives = 214/463 (46%), Gaps = 47/463 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++ N LF++ D + GCY K + + NID LA+EG F+ AY PVC +RA + T Sbjct: 24 EKNNVLFIIVDDLRPEL-GCYGNKQVLSPNIDRLASEGTLFSKAYCNVPVCGASRASVMT 82 Query: 62 GIYANQSGPWTNNVAPGK---NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+ + + N K + + F+ GY T IGK + + +DY + W Sbjct: 83 GLRPTKDRFISYNAKAYKESGGVLDLAGIFQKNGYTTISIGKVYHERNDYRSS------W 136 Query: 119 DADYWFDGANYLSELTEKEISLWRN----GLNSVEDL-----QANHIDETFTWAHRISNR 169 D F + ++ + ++ L N G S E L A+ DE + + +++++ Sbjct: 137 D----FKDSPLITSPSMRDYHLPENQAGRGKYSFEALGTACEAADEPDEKY-FTYQLADA 191 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEY--LEKYADFYYELGEKAQDDLANKPE 227 A+D++ + + ++P+ + V + +PH PF P +Y L K +DF + + Sbjct: 192 AIDYIDKTEKKNKPWFLAVGFTKPHLPFVAPKKYWDLYKRSDFKLASNPNMPKNAPTQAS 251 Query: 228 HH----RLWAQAMP--SPVGDD-GLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWV 279 H R +P PV DD L Y+AC F D IGR+++ L T R+NT V Sbjct: 252 HQWHELRKMYNDIPQTGPVPDDKALELKHGYYACVSFTDAMIGRILDYLDTNNLRKNTTV 311 Query: 280 IYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTP--VSHIDLLPTMMA 337 I DHG +G H L K A ++ PLI+ S G+ Q + V +D+ P++ Sbjct: 312 ILWGDHGWQLGEHGLWCKHAN-FETSLNTPLIV-SAAGQNAQGPSKALVEFVDIYPSLCD 369 Query: 338 LADIEKPEILPGEN---ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 LA KP L G++ +L + F+RY G I ++ +++ N Sbjct: 370 LAGFTKPPHLQGKSFAPLLKKPNTKWKSAVFSRYHA-----GDSIHTNRFLYTEWRNKSN 424 Query: 395 L-FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 T+ LYD + DP+E N+ + +A++ K+ L ++D Sbjct: 425 GNITARMLYDHQRDPDENFNIAANPEYAELVKKLSKRLQAHID 467 >UniRef50_C0W1U3 Sulfatase n=1 Tax=Actinomyces coleocanis DSM 15436 RepID=C0W1U3_9ACTO Length = 482 Score = 134 bits (338), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 132/459 (28%), Positives = 201/459 (43%), Gaps = 73/459 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PNF+ +TD Q + + L T N+D L+ E F++ Y SPVC+PAR L T Sbjct: 3 KQPNFVIFVTDDQGPWATSEHWPE-LQTPNLDQLSKESSTFSNYYCASPVCSPARGTLLT 61 Query: 62 GIYANQSG--PWTNNVAPGKN-----------ISTMGRYFKDAGYHTCYIGKWHLDGHDY 108 G + G W + G++ I T+ D GY+ +GKWH+ Sbjct: 62 GRMPSAHGIHDW---LVGGRHPDALEEPFLDGIITLPEVLDDNGYYCGMVGKWHV----- 113 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 GT + P YW+ A+ +W D N E + I+ Sbjct: 114 -GTSQTPAP-GFSYWY--AHRYGGGPYYNAPIW--------DENGNEATEPKYFTDAIAE 161 Query: 169 RAVDFLQQPARADE--PFLMVVSYDEPHHPF--TCPVEYLEKYADFYYELGEKAQ----- 219 A DF+Q A +E PF ++V++ PH P+ P E ++ YAD + + + Sbjct: 162 NACDFIQSAASVNEEKPFFLMVNFTAPHSPWINNHPQELMDLYADTDFPSIPREEPHPWT 221 Query: 220 ---DDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RE 275 DD A+ +A +PS G Y A VD+ +G ++ AL + Sbjct: 222 KYYDDFADA------FADPVPSLRG---------YAASLTGVDNAVGDILKALEENAYAD 266 Query: 276 NTWVIYTSDHGEMMGAHKLISKGAAMY-----DDITRIPLIIRSP-QGERRQVDTPVSHI 329 NT V+Y SD+G G H + KG + ++ R+P II P Q E R+VD VS Sbjct: 267 NTVVMYMSDNGFSCGQHGIWGKGNGTFPLNFWENSVRVPFIIHLPGQHEYRKVDDHVSAC 326 Query: 330 DLLPTMMALADIEKPE-ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDD 388 T+ LA+I PE L G +A RG + + + + D +GG +R D Sbjct: 327 SFFETVCELAEITPPEDPLRGARSIA-DLARGEIRDSDEPVMVFDEYGGGRMIR---YGD 382 Query: 389 FKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKM 427 K + ELYD +NDP E++NL+ D + VR ++ Sbjct: 383 LKFIDRFDGPQELYDLKNDPAELNNLVHDESYEKVRDEL 421 >UniRef50_D2QWC7 Sulfatase n=5 Tax=Bacteria RepID=D2QWC7_9PLAN Length = 490 Score = 134 bits (338), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 125/477 (26%), Positives = 207/477 (43%), Gaps = 57/477 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+P + ++ N +GCY + + ID LAA G RF+ AY P+C P+R+ T Sbjct: 32 KKPYNVLLIASDDLNNSLGCYGHATVKSPRIDELAARGTRFDRAYCQFPLCNPSRSSFLT 91 Query: 62 GIYANQSGPWTNNV---APGKNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGEC--P 115 G+ +Q+ N + +I T+ + F +AGY+ +GK +H GT Sbjct: 92 GLRPDQTTVHDNARKFRSERPDIVTLPQMFMNAGYYVARVGKLYHYGVPLQIGTSGLDDE 151 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVED-LQANHIDETFTWAHRISNRAVDFL 174 P W G + E K SL L A D+ T A + A+ L Sbjct: 152 PSWQQVVNPRGRDRDDE--PKIFSLVPGQFGGTPSWLAAEGTDDEQTDAIGAAE-AIKLL 208 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY-ADFYYELGEKAQDDLANKPEHHRLWA 233 + A ++PF + V + PH P+ P Y EKY AD + + PE R Sbjct: 209 E--ANKEKPFFLAVGFYRPHTPYVAPKSYFEKYPAD---------KIPIVTTPEGDR--- 254 Query: 234 QAMPSPVGDDGLYHHPL-----------YFACNDFVDDQIGRVINAL-TPEQRENTWVIY 281 + +P P H + YFA F+D Q+G++++AL + R+NT V++ Sbjct: 255 RDIPEPAVSQHSARHNMNEKLQREATQAYFASITFMDQQVGKLLDALDRLKLRDNTIVVF 314 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP-QGERRQVDTPVSH-IDLLPTMMALA 339 SDHG +G H + + +++++ R+PLII P Q + V+ ID+ PT+ L Sbjct: 315 LSDHGYHLGEHGGLWQKQSLFEESARVPLIISVPGQKHAGEGTAAVAELIDIYPTLADLC 374 Query: 340 DIEKPEILPGENIL-AVKEPRGVMVEFNRYEIEHDS-------------FGGFIPVRCWV 385 ++ P LPG+++ +++P+ F ++ GGF Sbjct: 375 GLKAPANLPGQSLRPQIEDPQAPGKGFAITQVRRGGNPGGAKAGKKNPPAGGFAGY-SLR 433 Query: 386 TDDFKLVL---NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 TD ++L + ELYD + DP E NL D A+ +++ L ++ + Sbjct: 434 TDKYRLTIWGEEGAKGLELYDHQTDPQEYTNLASDPSKAETITELKALLAKHLSAAK 490 >UniRef50_A6DNI8 Putative N-acetylglucosamine-6-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DNI8_9BACT Length = 705 Score = 133 bits (335), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 124/466 (26%), Positives = 201/466 (43%), Gaps = 63/466 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 K PN +F++TD Q + +G P L T NID + EG+ F +++ +C PARAG Sbjct: 21 KGPNIIFILTDDQKYDAMGFMGHYPFLKTPNIDRIRNEGVHFKNSFVTLSMCAPARAGFL 80 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRY---FKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 TG Y +G TN N + + + AGY T + GKWHLD + P Sbjct: 81 TGTYPQVNGVCTNVEGREFNQNKTPSFPLLLQRAGYETGFFGKWHLDHSN-------KPR 133 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANH--IDETFTWAHRISNRAVDFLQ 175 D W +S G + DL + + +++ A+DF+ Sbjct: 134 LGFDRW--------------VSFSGQGKYNGNDLNIDGKLVHNPGYITDELTDYALDFID 179 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE-------- 227 + +D+PF + +S+ H PFT + Y E D+L +KP+ Sbjct: 180 K--NSDKPFCVYLSHKAVHQPFTPAKRHSSLYKGETVPKKESFFDNLKDKPKWQRVNLPP 237 Query: 228 ----------HHRLWAQAMPSP-VGDDGLYHHPL-YFACNDFVDDQIGRVINALTPEQ-R 274 H A P P ++G + H Y VD+ IG++ L ++ Sbjct: 238 EKLYRLRYNNTHETPAVKTPRPYTKENGSHPHTKDYLRAIAAVDEGIGKIYALLENKKIL 297 Query: 275 ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLL 332 +NT +I+ D+G ++G H+ K Y++ RIPLI+R P +D V +ID+ Sbjct: 298 DNTVIIFAGDNGYLLGEHQRGDK-RVHYNESMRIPLIMRYPAKIPADSTLDQMVLNIDVA 356 Query: 333 PTMMALADIEKPEILPGENILAV-----KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTD 387 PT++ +A ++ PEI+ GE+ + + K P F + + + VR TD Sbjct: 357 PTILDIAGVKAPEIMQGESCMPLFDKSKKTPWRDAYLFTYWRDLIPTLPRIVAVR---TD 413 Query: 388 DFKLVL--NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + ++ +ELYD NDP+EM NL A++ M + Sbjct: 414 RYVYTTYPDIDDVNELYDLENDPHEMRNLATSPEHAEIVKAMEQKI 459 >UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD Length = 452 Score = 133 bits (335), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 117/378 (30%), Positives = 166/378 (43%), Gaps = 88/378 (23%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN L + TD Q T V CY K L+T NID LA EG+ F+ Y +PVC+P+RA L T Sbjct: 26 KRPNVLIIYTDDQGTLDVNCYGAKDLHTPNIDRLAKEGVLFSQFYAAAPVCSPSRASLLT 85 Query: 62 GIYANQSGPWTNNV--------APGKNISTMGRYFKDAGYHTCYIGKWHL---------- 103 G Y Q NN PG TM FKD GY T +IGKWH+ Sbjct: 86 GRYP-QRAQLDNNAPSEEGHAGMPGSQY-TMAEMFKDGGYTTAHIGKWHIGYSPETMPNQ 143 Query: 104 DGHDY-FG-TGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 G DY FG G C + +++ G N LWRNG ED + Sbjct: 144 QGFDYSFGFMGGCIDNYSHYFYWAGPN--------RHDLWRNGQEIWEDGK--------F 187 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 +A FL++ RAD+PF + + + PH+ P++ EK+ +Y +L Sbjct: 188 FADLTVQEVNGFLEKNKRADKPFFLYWAINMPHY----PLQGQEKWRQYYKDL------- 236 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVI 280 P R++A A+ + +D++IG+V+ L ENT V+ Sbjct: 237 ----PAPRRMYAAAVST-------------------MDEKIGQVLQQLDRLGLAENTIVV 273 Query: 281 YTSDHGEMM-------GAHKLISKGA--AMYDDITRIPLIIR----SPQGERRQVDTPVS 327 + SD G G +GA ++++ R+P IIR P+ E R D Sbjct: 274 FQSDQGHSTEDRSFGGGGFTGPYRGAKFSLFEGGIRVPAIIRWTGHLPKNEVR--DQLCV 331 Query: 328 HIDLLPTMMALADIEKPE 345 +ID PT+ L + P+ Sbjct: 332 NIDWYPTLAGLCKVALPQ 349 >UniRef50_B6GZC3 Pc12g01800 protein n=14 Tax=cellular organisms RepID=B6GZC3_PENCW Length = 516 Score = 133 bits (335), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 139/535 (25%), Positives = 214/535 (40%), Gaps = 122/535 (22%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN +F+M D A+ + CY +T N+D LA EG+RFN Y + +CTP+RA + TG Sbjct: 6 RPNIIFIMADDHASKSMSCYGAGINHTPNLDRLATEGMRFNHCYVTNSICTPSRAAILTG 65 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL-DGHDYFGTGECPPEWDAD 121 + + +G T + K++ + ++ + GY T +GKWHL +G D+ TG D Sbjct: 66 THNHVNGVMTLDSKINKHVPNVAKHLRSGGYQTAMVGKWHLGEGKDHEPTG-------FD 118 Query: 122 YWF----DGANYLSELTEKEISLWRNGLNSVEDLQANHI-DETFTWAHRISNRAVDFLQQ 176 YW G + + E G++ + I D++ W I NR V Sbjct: 119 YWSVVPGQGEYHDPQFIGPE------GISRESGYATDIITDKSLNW---IRNRDV----- 164 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE---LGEKAQDDLANKPEHHRLWA 233 D PF ++ + PH + EY K+ D Y E L + DD N+ + ++ Sbjct: 165 ----DRPFFLMCHHKAPHRSW----EYDPKHHDLYTEPVRLPDTFTDDYKNRAKAAKVAK 216 Query: 234 Q-----------AMPSPVG----------DDGLYHHPLYFACND-----FVDDQIGRVIN 267 + P G D +H A D +D Q G V Sbjct: 217 MRVAEDLTYMDLGLAQPDGGSDVVGPKMIDAWWWHDRKVPAPEDVTKLRLIDKQDGTVFT 276 Query: 268 ALTPEQ------------------------------------RENTWVIYTSDHGEMMGA 291 TPEQ ENT VIYTSD G +G Sbjct: 277 FKTPEQLAEFKFQRYMQRYLRTIQSIDDNVGRTLDFLDAEGLAENTIVIYTSDQGFFLGE 336 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 H K MY++ ++P +IR P+ D + ++D PT + A + P + G Sbjct: 337 HGWFDK-RFMYEESFQMPFLIRYPREIAAGTVCDDIICNVDFAPTFLDFAQLRIPTYMQG 395 Query: 350 ---ENILAVKEPRG-VMVEFNRY----EIEHDSFGGF------IPVRCWVTDDFKLVLNL 395 +L K P V ++RY ++ H+++ + + W +DF L Sbjct: 396 VTFRELLRHKTPSDWQQVAYHRYWMHRDVIHEAYAHYGVRDQRYKLIYWYNEDFGLEGTR 455 Query: 396 FTSD----ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD-PFRSY 445 + EL+D DP E+ N + + DV KM L D M +I D P +Y Sbjct: 456 PGGEEKEWELFDCDKDPLELFNCWHEPEYRDVVEKMTKLLEDKMQEIGDEPVHTY 510 >UniRef50_A6DGT7 Sulfatase family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DGT7_9BACT Length = 504 Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 127/478 (26%), Positives = 208/478 (43%), Gaps = 58/478 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L + D M+G Y + + ID LA + AY VC +RA + TG Sbjct: 19 RPNILIISVD-DLKPMLGTYGDPLVQSPTIDKLAEASALYEKAYCQQAVCGASRASIMTG 77 Query: 63 IYANQSGPWT-NNVAPGKN--ISTMGRYFKDAGYHTCYIGKWH-----LDGHDY----FG 110 + + S W V +N T+ YFK GY TC+ GK DG + Sbjct: 78 LRPDNSRVWEFRQVMRERNPQAITIPEYFKSQGYMTCFAGKIFDYRCVADGKKQDLKSWS 137 Query: 111 TGECPPEWDA--DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHI------------ 156 E P +A + F + +L KEI L +NG + D I Sbjct: 138 RPEQPRNSEAMKNLGFADPAFREKLRLKEIELKKNGQKASYDAIKKAIGGSPCYEDSIDG 197 Query: 157 -DETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG 215 DE + I+ V +++ + +PF + V + +PH PF P +Y + Y + + L Sbjct: 198 PDEIYEDGM-IAREGVRLIKELGQKKKPFFIAVGFKKPHLPFNAPKKYWDLYKETDFAL- 255 Query: 216 EKAQDDLANKPE--HHRLWAQA---MPSPVGD------DGLYHHPLYFACNDFVDDQIGR 264 EK Q + P + W + +P G+ L H Y AC +VD QI + Sbjct: 256 EKYQKPVQGAPHYAYQNSWEFSGYNVPRINGEVLESFQRKLKHA--YAACISYVDAQIAK 313 Query: 265 VINALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQ 321 ++ L + E NT +++ SDHG +G H + K + Y+ TR+P + P+ ++ + Sbjct: 314 LLKTLKDQGLEKNTVIVFWSDHGFHLGDHGMWCKHSN-YEQATRVPFFVYDPRQNLKKGR 372 Query: 322 VDTPVSHIDLLPTMMALADIEKPEILPGENIL--AVKEPRGVMVEFNRYEIEHDSFGG-- 377 PV ID+ PT+ L+ + PEIL G+++L A + + + +F R + ++ G Sbjct: 373 YTQPVELIDMFPTLCQLSGLAIPEILDGKSLLSEAAENAKFALSQFPRNQGKNKKIMGYG 432 Query: 378 --FIPVRC--WVTDDFK---LVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMH 428 F R WV ++++ L + ELYD DP E NL ++ + + ++ Sbjct: 433 FRFERYRYIEWVDNNYQQDNTQLGPLKAVELYDYEKDPLEQVNLANNPEYKSILRRLQ 490 >UniRef50_B3C5J5 Putative uncharacterized protein n=7 Tax=Bacteroidales RepID=B3C5J5_9BACE Length = 472 Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 124/463 (26%), Positives = 196/463 (42%), Gaps = 78/463 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN +++ TD Q + + C L+T N+D LAA GI FN+AY +P+ P+R +FT Sbjct: 30 ERPNIIYIFTDQQTASAMSCAGNPDLHTPNLDRLAAAGIMFNNAYCTAPLSGPSRGAMFT 89 Query: 62 GIYANQSGPWTNN--VAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 G Y + G N + T+G K+AGY Y GKWHL D P D Sbjct: 90 GHYPDAVGLSVNGSPMPDSLRAQTLGTLIKNAGYDCAYGGKWHLPLLD------VP---D 140 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 +Y FD N +D A E + H+ Sbjct: 141 KEYGFD-----------------NIYKHSDDGLAEACAEYLSRKHK-------------- 169 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP------------- 226 +PF +V SYD PH+ C EY Y + D P Sbjct: 170 --KPFFLVASYDNPHN--IC--EYARSQNFPYGNIDTPDIRDCPGVPANFAKNPYDADVI 223 Query: 227 -----EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVI 280 ++ ++ A +P +D + Y+ + VD +IG++I+A+ +NT VI Sbjct: 224 ESERANNYNVYPTAGFTP--EDWRMYRYTYYRLVEKVDKEIGKIIDAIDKNDLWKNTVVI 281 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTP---VSHIDLLPTMMA 337 ++SDHG+ +GAH K +A+Y+++ IPLI+ P + P + ID ++ Sbjct: 282 FSSDHGDGIGAHHWNQK-SALYEEVINIPLIVTLPGKKNAGKVLPQLISNGIDFFASVCD 340 Query: 338 LADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWV--TDDFKLVL-- 393 A + PE G++ + E Y I F G R WV ++ +K VL Sbjct: 341 WAGAKMPEGAAGKSFRKIVEEGNPQALHQEYIITETRFDG-SKTRGWVVRSERYKYVLYD 399 Query: 394 NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 ++L+D +ND EM NL+ + + ++ D L +M+ Sbjct: 400 KGRHREQLFDMQNDRGEMRNLVMENAYNQELQRLRDVLEKWMN 442 >UniRef50_A6C8U0 Choline sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8U0_9PLAN Length = 479 Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 124/475 (26%), Positives = 202/475 (42%), Gaps = 73/475 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +F+++D Q + + + T ++D L G F A +P+CTP+RA + +G Sbjct: 33 QPNIVFLLSDDQRPDTIAALGNPIIKTPHLDQLVKAGTSFTRAVCANPICTPSRAEILSG 92 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYF-GTGE-------- 113 + +G K + T + AGY+T Y+GKWH DG G E Sbjct: 93 VSGFHNGSMDFGKPIKKELPTWSQTLSKAGYNTWYVGKWHNDGKPVLRGYDETLGLFTGG 152 Query: 114 ----CPPEWDADY--------WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 P +D + W + EK + L N I E F Sbjct: 153 GGRWAVPSYDGNGVLVTGYRGWIFQDDERHFFPEKGVGLTSN------------ISEHF- 199 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY--------ADFY-- 211 ++ A++F+++ + +PF + V + PH P P+ Y + Y A+F Sbjct: 200 -----ADAAIEFVER--KHQKPFFLHVCFTAPHDPLLMPIGYEQNYDPDQMPVPANFLPQ 252 Query: 212 --YELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL 269 ++ G D A P W + D LY ++ +D Q+GR++ AL Sbjct: 253 HPFDHGNFDGRDEALLP-----WPRTKEIVKNDLSLY-----YSVISHLDAQVGRIVKAL 302 Query: 270 TPE-QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSH 328 + ENT +I++SDHG MG+H L K MY+ +PLI+ P + + Sbjct: 303 KKTGEWENTILIFSSDHGLAMGSHGLRGK-QNMYEHTVNVPLIMVGPGIPADTLSNAQCY 361 Query: 329 I-DLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTD 387 + DL PT LA + P+ + G+++ V + V Y+ + F F R TD Sbjct: 362 LRDLYPTSCDLAGVPIPKTVEGKSLKPVLSGQLDAV----YDEVYCYFRNF--QRMIRTD 415 Query: 388 DFKLVLN-LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 +KL+ +L+D +NDP E H+L + VR K+ D L D+ + DP Sbjct: 416 RWKLIYYPHLDRVQLFDLKNDPLEQHDLSGEAALQQVRGKLLDQLNDWRKQQNDP 470 >UniRef50_B4D6H3 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D6H3_9BACT Length = 525 Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 119/480 (24%), Positives = 199/480 (41%), Gaps = 54/480 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF+MTD Q + VG + T N+D LAA G F++ Y SPVC P+R FT Sbjct: 30 RRPNILFIMTDQQRWDCVGANGNTIIKTPNMDRLAARGANFSNVYVASPVCVPSRISFFT 89 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL----------DGHDYFGT 111 G YA+ N + + K+AGY T +GK H G D Sbjct: 90 GRYAHSHRNRVNYTPLDASEVLLQARLKEAGYRTASVGKLHYFPPTVEHAKSTGFDIVEL 149 Query: 112 GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVE---DLQANHIDETFTWAHRISN 168 + P D W D + K+ +R ++E + ID +T Sbjct: 150 HDGVPF--TDKWSDYVKWRQANDPKKDIYYRATAKNIEPGKNPNRAAIDTQYTDTTWTGE 207 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ-DDLANKP- 226 R +L + A+ +PF + VS+ +PH P+ Y Y D + E +DLA+ P Sbjct: 208 RGRYWLTELAKGQQPFFLYVSFWKPHSPYEIGPPYDSMYDDANIPIPETVTANDLASMPL 267 Query: 227 --------EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENT 277 E+ +W Q + + + Y+ VD +IG ++ AL Q +NT Sbjct: 268 PLQKLSLRENPNVWKQTQ-----ERVEWMYRSYYGAISHVDHEIGLLLEALEASGQAQNT 322 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP-QGERRQVDTPVSHIDLLPTMM 336 ++++SDHG+ + H++ K ++ ++PL++ P + + D + +DL+PT++ Sbjct: 323 LIVFSSDHGDQLMEHRIYGKN-CFFEPSVKVPLMVSLPGRIKPAHYDQLMETVDLVPTLL 381 Query: 337 ALADIEKPEILPGENILAVKEPRGV----------------MVEFNRYEIEHDSFGGFIP 380 + +P + G + + G ++ + ++ + G Sbjct: 382 DFIGLPEPREVQGRSFAPLIADLGRPYTPHDAVFSENIIPEVITSGKMDLPFEKGKGVDG 441 Query: 381 VR-----CWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 VR TD +K ELYD + DP E NL V +M LL+++ Sbjct: 442 VRHPDAKMVRTDRWKYCYYPEGYAELYDLQKDPGERTNLAGRPENHAVEEEMRTRLLNWL 501 >UniRef50_UPI00016C0ED5 sulfatase n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0ED5 Length = 490 Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 124/510 (24%), Positives = 215/510 (42%), Gaps = 110/510 (21%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M +PN +F+MTD A++ + CY K T N+D +A EG+RF++ + + +C P+RA + Sbjct: 1 MNQPNIVFIMTDDHASHSMSCYGSKINVTPNMDRIANEGMRFDNCFCTNSICAPSRAVIL 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL--------DGHDYFGTG 112 TG +++ +G T N T + + AGY T IGKWHL G DY+ Sbjct: 61 TGKHSHLNGVITLNDEFDGRQQTFPKLLQKAGYQTGIIGKWHLGEGGYADPTGFDYY--- 117 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 C +Y+ + + G + + A I I++ +++ Sbjct: 118 -CVLHGQGEYF-------------DPKMREQGEDKIFKGYATDI---------ITDMSLN 154 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK------- 225 F++ ++ +PF+++ + PH P+T +Y + Y D E DD A + Sbjct: 155 FIKDRDKS-KPFMLMCHHKAPHRPWTVSAKYADLYKDEEILQPETFDDDYATRCDAAREA 213 Query: 226 ----------------PEHHRLWAQAMPSPVGDDGLYHHPL------------------- 250 P + +P P +G P Sbjct: 214 EMXXDNDFMYRDLKLVPPPTKRPMDKIPPPDSLEGYTLTPEETGVPVSFSSYAELKNFKY 273 Query: 251 ------YFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYD 303 Y C VDD IG++++ L E E+T V+YTSD G +G H K MY+ Sbjct: 274 QRYIKDYLRCVASVDDGIGQLLDCLNEEGIAEDTIVVYTSDQGFFLGDHGWYDK-RFMYE 332 Query: 304 DITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKE---- 357 + R+P +I+ P+ + R D + ++D T + A + P+ + G++ + E Sbjct: 333 ESLRMPFVIKYPRAIKAGRVSDKMILNLDFAETFLDFAGVPIPDDMQGKSFRRILEDENA 392 Query: 358 PRGVMVEFNRY--EIEHDSFGGFIPVRCWVTDDFKLVL-------NLFTSDE-------L 401 P + RY + H + +R T D+KL+ T DE L Sbjct: 393 PAIQTAMYYRYWMHLAHHNIWSHYGIR---TLDYKLIYYYAQALGKSGTIDEYREAAWEL 449 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 +D + DP+E++N+ D+ +AD+ K+ D + Sbjct: 450 FDLKKDPHELNNVYDNPEYADLIVKLKDEM 479 >UniRef50_B9XEU8 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XEU8_9BACT Length = 456 Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 125/445 (28%), Positives = 199/445 (44%), Gaps = 58/445 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +F++ D + GC + T NID +A EG F + + P+C+P+R TG Sbjct: 17 QPNIVFILVDDIRWDAFGCMGHPFVKTPNIDRIAKEGALFKNFFVTLPLCSPSRGSFLTG 76 Query: 63 IYANQSGPWTN--NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 YA+ +G N + + T R DAGY T ++GKWH+ D P Sbjct: 77 QYAHVNGVTNNGEHSTLSHQLVTFPRLLHDAGYETSFVGKWHMGTDD-------TPRPGF 129 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D+W L+ K ++ N N D + + ++ T +++RAV+F++Q + Sbjct: 130 DHW---------LSFKGQGVYENP-NLNIDGKVSRVEGYIT--DILNSRAVEFVKQEHK- 176 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA----- 235 +PF + V + H PFT + E Y DDLA KP R Q Sbjct: 177 -KPFCLYVGHKAVHGPFTPAERHKELYTKEQIPHPPSIDDDLAGKPVLTRKEQQGPKDGQ 235 Query: 236 MPSPVGDDGLYHHPLYFACNDFV----------DDQIGRVINAL-TPEQRENTWVIYTSD 284 P VG D P+ V D+ +G+++ AL Q ENT +I+TSD Sbjct: 236 KPQKVGFDDEAERPMGKVPERLVRQQLRTLMAIDEGVGQLLRALEESRQLENTVIIFTSD 295 Query: 285 HGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ----GERRQVDTPVSHIDLLPTMMALA- 339 +G G H L K A Y++ R PL+IR P+ G R D V +ID+ PT++ LA Sbjct: 296 NGYFWGEHHLGDKRWA-YEESIRDPLLIRYPKLIKPGTVR--DQMVLNIDIAPTLLELAH 352 Query: 340 -----DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV-- 392 ++ ++P N +V+ + + E+ + ++ + T+ +K + Sbjct: 353 APVSRSMQGRSLVPLFNKDSVEWRKSALFEY----FQEKAYPRTPTWQAIRTEQWKYIHY 408 Query: 393 LNLFTSDELYDRRNDPNEMHNLIDD 417 L DELY+ + D EM NLI + Sbjct: 409 TELEGMDELYNLKADSYEMKNLIKE 433 >UniRef50_C6VXD1 Sulfatase n=4 Tax=Bacteria RepID=C6VXD1_DYAFD Length = 474 Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 119/434 (27%), Positives = 197/434 (45%), Gaps = 26/434 (5%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+ N LF+ D N +G Y + + NID LA G+RF+ AYT P+C+P+R+ L T Sbjct: 30 KKFNVLFIAVD-DLNNDLGTYGNTFVKSPNIDRLAKRGVRFDKAYTQFPLCSPSRSSLLT 88 Query: 62 GIYANQSGPWTNNVAPGKN---ISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGEC--P 115 G + + + KN I T+ + FK+ Y++ +GK +H GT P Sbjct: 89 GQRPDMTKIYELQTHFRKNLPDIVTLPQLFKNNNYYSARVGKIFHYGVPSQIGTDGLDDP 148 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 W G + E K ++ R GL S +A + I++ A+ + Sbjct: 149 ESWSYRVNPKGRDKTEEPLIKNLTPDR-GLGSALAWRATEGTDDEQTDGLIASEAIKIMT 207 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + + +EPF + V + PH P+ P +Y + Y L ++ +DL + PE Sbjct: 208 E--KKNEPFFLAVGFFRPHTPYVAPQKYFDMYPVDKVPLPKEIPNDLDDVPEAALF---T 262 Query: 236 MPSPVG-DDGLYHHPL--YFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGA 291 P G D+ L Y+A F+D Q+G++I+AL + ENT ++ SDHG +G Sbjct: 263 KPPHWGLDEAKRREALRAYYATITFMDAQVGKLIDALDKLKLAENTIIVLWSDHGYNVGQ 322 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQGERRQVD-TPVSHIDLLPTMMALADIEKPEILPGE 350 H K +++++ R+PLII P G + + V +D+ PT+ L ++ + L G+ Sbjct: 323 HGQWMK-QSLFENSARVPLIISVPGGTKGKASGRTVELVDIFPTLAELCGLDPKQNLQGK 381 Query: 351 NIL-AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL--NLFTSDELYDRRND 407 ++ +K P + + ++ G R T+ F+ ELYD + D Sbjct: 382 SLTPLLKNPAAIWDKPAYTQVRRGQIFG----RSVRTERFRYTEWDGGNAGVELYDHQKD 437 Query: 408 PNEMHNLIDDIRFA 421 P E NL D F Sbjct: 438 PGEFTNLAKDNSFV 451 >UniRef50_B8KY63 N-sulphoglucosamine sulphohydrolase n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KY63_9GAMM Length = 537 Score = 132 bits (332), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 132/506 (26%), Positives = 226/506 (44%), Gaps = 95/506 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++TD Q+ ++G Y + + T NID+LA++G RF A+ S VC+P RA L TG Sbjct: 22 RPNIILIVTDNQSDKLLGVYGNEDVRTPNIDALASQGTRFTRAFAASGVCSPTRASLLTG 81 Query: 63 IYANQSGPWTNNVAPG----------KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 + +Q G +N P + +M + +AGY T IGK+HL D Sbjct: 82 LMPSQHG--VHNGLPSVFDLEDYSAIEEFRSMPQTLSEAGYRTAMIGKYHLGVPD----- 134 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWRNGLN-SVEDLQANHIDETFTWAHRISNRAV 171 P+ DYW + + T ++ + NG V+D H+ + +T RAV Sbjct: 135 --SPQIGFDYWVVLPSGHTT-TFYDLEVIDNGQRYRVKD---EHLTDFWT------RRAV 182 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLE---KYADFY----------------- 211 DF+++ ++ PF + ++Y+ P+ PV + ++AD+Y Sbjct: 183 DFIEE-QDSNAPFFLYLAYNGPYG--LAPVVTRKPDNRHADYYRRHVPSFPQEPVHPFLR 239 Query: 212 -YELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGL-----YHHPLYFACND--------- 256 Y + + D + + +R W P +GD G Y A N+ Sbjct: 240 NYAIEASSGDHILVEQAANRDWLIEDPMELGDLGAKIEFSYAWQTIHALNNNEAMINLAS 299 Query: 257 ---FVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAA-----MYDDITR 307 +DD + V AL + ENT +++T+D G G H L +A + + Sbjct: 300 QVTMIDDGVAAVAEALQKKGLNENTVIVFTADQGAAFGQHGLWGNSSAASPFTAQREHLQ 359 Query: 308 IPLIIRSP---QGERRQVDTPVSHIDLLPTMMALADIEKPEIL--PGENILAVKEPRGVM 362 +PLI+ P +G + ++ +D+ PT++ LA + EI PG + + RG Sbjct: 360 VPLIVNDPRVAEGPETSANI-INQVDIFPTVLELAGLGDIEIANSPGRSFAPLL--RGES 416 Query: 363 VEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFA 421 E+ + ++ +I R VT ++K V +F ELY +DP E HN+I+D A Sbjct: 417 PEW-----DDAAYFEYITTRAIVTSEWKYVKRIFGQPSELYHLASDPGERHNIINDPDKA 471 Query: 422 D----VRSKMHDALLDYMDKIRDPFR 443 + ++ D ++ ++ DP+R Sbjct: 472 ATLVWLDGRLTDFFSEFSNEKYDPWR 497 >UniRef50_A4U8Q3 Sulfatase n=2 Tax=Bacteria RepID=A4U8Q3_9BACT Length = 556 Score = 132 bits (332), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 106/369 (28%), Positives = 162/369 (43%), Gaps = 42/369 (11%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN L VM D A ++ Y G T N++ LA EG+ F +AY P+C PAR L +G Sbjct: 58 PNILLVMMDQLAPQVLKPYGGTVCRTPNLERLAGEGVVFENAYCNYPICAPARFSLMSGR 117 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG------TGECPP- 116 ++ G + N + T Y + GYHTC GK H G D T + P Sbjct: 118 MPSRIGAFDNATEFPSEVPTFAHYLRAMGYHTCLSGKMHFVGADQLHGFEDRVTTDVYPA 177 Query: 117 --EWDADY---------WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHR 165 W +D+ WF + + + R +N+ D +A E W H Sbjct: 178 DFSWTSDWSLGPTFWEPWFHSVRIVRDAGPR-----RRSVNTSYDEEATV--EACRWLHD 230 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 ++RA PF + S+ PH P+ P + + Y D + L + Sbjct: 231 HADRA---------DGRPFFLAASFISPHDPYLAPPSHWDLYTDDGIDDPRVGDIPLEER 281 Query: 226 -PEHHRLW---AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVI 280 P RL+ + + + D Y+A ++DD+IGR++ L + +NT V+ Sbjct: 282 DPHSRRLYYTIGRHIETIGPADVRRARRAYYAVMSWLDDRIGRILETLKAIDADDNTIVV 341 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLPTMMALA 339 T+DHG+M+G L K ++ R+PLI+ +P R R+V VS +DL PT + A Sbjct: 342 LTADHGDMLGERGLWLK-MNFFEWSVRVPLIVHAPTLYRARRVRENVSLLDLFPTFLEWA 400 Query: 340 -DIEKPEIL 347 D E PE+ Sbjct: 401 GDGELPELF 409 >UniRef50_A3SJ21 Sulfatase n=1 Tax=Roseovarius nubinhibens ISM RepID=A3SJ21_9RHOB Length = 518 Score = 132 bits (331), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 125/453 (27%), Positives = 202/453 (44%), Gaps = 47/453 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + +M D A + G Y + T ++D+LAA G+RF++AY +P+C P+R + Sbjct: 10 RRPNIVVIMADQLAPHFTGAYGHQVAKTPHMDALAARGMRFDAAYCNAPLCAPSRFAFMS 69 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDG-------HDYFGTGEC 114 G ++ + N + T Y GY TC GK H G D T Sbjct: 70 GQLISRIAAYDNASEFRATVPTFAHYLSALGYRTCLSGKMHFVGPDQKHGFQDRVTTDIY 129 Query: 115 PPE--WDADYWFDGANYLSELTEKEISLWRNGLNSVED-------LQANHIDETFTWAHR 165 P + W D+ E ++ I W + + +V++ Q ++ DE A R Sbjct: 130 PSDFAWTPDW---------EAPDERIDKWYHNMQTVKESGCAIATFQTDYDDEVEFAARR 180 Query: 166 -ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 + +RA D + A + P MV S+ PH P+ E+ + Y+D EL E LA+ Sbjct: 181 WLIDRARD---RAAGQEAPLCMVASFIHPHDPYVARPEWWDLYSDDEIELPEVLP--LAD 235 Query: 225 K-PEHHRLW--AQAMPSPVG-DDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWV 279 P RL +A P+ D+ + Y A + D +IG ++ L + +NT V Sbjct: 236 HDPFSRRLMDGIEASYVPLSRDEVIRARRAYLANVSYFDSKIGALVKTLDETGELDNTVV 295 Query: 280 IYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALA 339 I T+DHG+M+G L K ++ R+PLI+ P + S IDLLP+ + +A Sbjct: 296 IVTADHGDMLGERGLWYK-MNFFEHSARVPLIMAGPGVVQGAAANACSLIDLLPSFLEIA 354 Query: 340 DIEKP---EILPGENI--LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 ++ E + G ++ LA E + Y E ++ F+ R D K + Sbjct: 355 GADESVLGEPVDGRSLMPLARGEADPQDEAISEYCAEMTAWPVFMIRR----GDLKYIHC 410 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKM 427 +LYD DP E N ++D +A R++M Sbjct: 411 DGDPPQLYDLSVDPGERVNRVEDPDYA-CRARM 442 >UniRef50_A4AMS2 Choline sulfatase n=1 Tax=Flavobacteriales bacterium HTCC2170 RepID=A4AMS2_9FLAO Length = 503 Score = 131 bits (330), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 126/478 (26%), Positives = 213/478 (44%), Gaps = 52/478 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSP----VCTPARA 57 K+PN + + D + K + T N+D L G F +AY VC +RA Sbjct: 26 KKPNIVLIFADDMTYTAINALGNKEIQTPNLDRLVKGGTTFKNAYNMGAWNGAVCVASRA 85 Query: 58 GLFTGIYANQSGPWTNNVAPGKNI-STMGRYFKDAGYHTCYIGKWHLDG------HDYFG 110 + +G + + N GK T G+ + AGY T GKWH+D + Sbjct: 86 MMISGRSVWNANNFRQNWLEGKEFDKTWGKLMESAGYDTYMTGKWHVDAPADSVFQNVTH 145 Query: 111 TGECPPEWDADYWFDGANY--LSEL-----TEKEI-SLWRNGLNSVEDLQANHIDETFT- 161 P WD+ W G ++E+ ++KEI ++ N + D N +D+ F Sbjct: 146 VRRGMP-WDS--WGHGGKIPAINEMIKEGKSKKEIRAIGYNRPLNENDTTWNPVDKKFGG 202 Query: 162 -------WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYEL 214 W+ + + AV F+ Q D PF M ++++ PH P P EY++ Y+ L Sbjct: 203 FWVGGKHWSEVLKDDAVGFIDQAKVKDNPFFMYLAFNAPHDPRQAPQEYVDMYSLDKISL 262 Query: 215 GEK------AQDDLANKPEHHRLWAQAM-PSPVGDDGLYHH-PLYFACNDFVDDQIGRVI 266 + +D + N P L +A+ P P + H Y+A +D+QIG ++ Sbjct: 263 PKSWMPMYPYKDSIGNGP---GLRDEALAPFPRTEYATKKHIQEYYALISHMDNQIGEIL 319 Query: 267 NALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER-RQVDT 324 +AL + ENT+VI+T+DHG +G H L+ K + +D R P +I P + +D Sbjct: 320 DALENSGKMENTYVIFTADHGLAIGKHGLLGK-QSQFDHSIRPPFMIVGPDIPKDASIDK 378 Query: 325 PVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV-RC 383 + D + T + LA IEKP+ + +I + +G E + EI +GG+ R Sbjct: 379 DIYLQDAMATSLDLAGIEKPDYVFFNSIKDL--AKGERKESHYKEI----YGGYTTTQRM 432 Query: 384 WVTDDFKLVLN-LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 D +KL++ L++ DP E+++L ++ + + + LL D++ D Sbjct: 433 IRKDGYKLIVYPKLKKVLLFNMETDPEEINDLSENPEYQGKINTLFKELLVLQDELND 490 >UniRef50_C9L4R3 N-acetylglucosamine-6-sulfatase n=2 Tax=Blautia hansenii DSM 20583 RepID=C9L4R3_RUMHA Length = 469 Score = 131 bits (330), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 109/371 (29%), Positives = 165/371 (44%), Gaps = 47/371 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN L+V D +G + T ++DS A E + A + P+C+P RA LFT Sbjct: 13 KRPNLLYVFLDQWRYQAMGYAKSDEVCTPHMDSFARESLDCTEAVSTFPLCSPHRASLFT 72 Query: 62 GIYANQSGPWTNNVAPGKNI-------STMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 G Y G WTN + + +G KD GYHT YIGKWHLD + E Sbjct: 73 GKYPFSVGMWTNCKIGLSEVLMLKPQETCIGNVLKDTGYHTGYIGKWHLDASE--QNFEK 130 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDET------FTWAHRI-S 167 P A +W D E + W + E L ++ +T W+ + Sbjct: 131 NPISGASHW-DAYTPPGE-RRQGFDYWLSYGACDEHLDPHYWADTPEQIKPGCWSPEFET 188 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPF-TCPVEYLEKYADFYYELGEKAQDDLANKP 226 ++A++++++ EPF + VSY+ PH P P +Y E+Y D E ++ P Sbjct: 189 DKALEYMEEKKNQAEPFALFVSYNPPHLPHELVPDKYYEQYKDMPVYFRENVPEE-KRTP 247 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDH 285 + + Q YFA VD+Q GR++ L E+T V+ ++DH Sbjct: 248 DMETITRQ----------------YFAAVTGVDEQFGRLLEFLKENGMEEDTIVVLSADH 291 Query: 286 GEMMGAHKLISKGAAMYDDITRIPLIIRS-----PQGERRQVDTPVSHIDLLPTMMALAD 340 GEM+G+H + K Y++ IPL IR P + V +P D +PT++ L + Sbjct: 292 GEMLGSHGHMGKN-VWYEESIHIPLYIRQKGRLVPVKYKELVASP----DHMPTILGLLN 346 Query: 341 IEKPEILPGEN 351 I P+ G N Sbjct: 347 IPVPDTCQGFN 357 >UniRef50_D1AX15 Sulfatase n=2 Tax=Fusobacteriaceae RepID=D1AX15_STRM9 Length = 491 Score = 131 bits (330), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 120/464 (25%), Positives = 203/464 (43%), Gaps = 69/464 (14%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+ N LF++ D +GCY K T NID LA +G F + + SPVC+PARA +F Sbjct: 1 MKKNNILFIIADDLGAWALGCYGNKDAITPNIDMLAEKGKIFENFFCVSPVCSPARASIF 60 Query: 61 TGIYANQSG------PWTNNVAPG---KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 TG +Q G W N K ST Y C GKWH+ D Sbjct: 61 TGRIPSQHGIHDWLDEWENGTTTEDYLKGQSTFVDVLSKNNYICCMSGKWHMGLADV--- 117 Query: 112 GECPPEWDADYWFD-----GANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI 166 P+ YW+ G Y++ + +D + H +E T +I Sbjct: 118 ----PQKGFHYWYSHQKGGGPYYMAPM--------------YKDGKLIHEEEYIT--DKI 157 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 + A+DFL + D+PF + V+Y PH P+ + + + +L E + + Sbjct: 158 TEYAIDFLDDVYKEDKPFFLNVNYTAPHSPWD-----KKNHKEEILKLYEGCKFKSCPRD 212 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDH 285 +H ++ + YFA +D IG +I L + +NT +I+TSD+ Sbjct: 213 PYHPWKISETFEGNEEERIQILKGYFAALTSMDFGIGEIIKKLEEKDMLKNTLIIFTSDN 272 Query: 286 GEMMGAHKLISKGAA-----MYDDITRIPLII-RSPQGERRQVDTPVSHIDLLPTM---M 336 G MG H + KG MYD ++P II + + E +V+ +SH D+ T+ + Sbjct: 273 GMNMGHHGIFGKGNGTSPLNMYDSSVKVPFIIYKKDETEAEKVNNLLSHYDVRSTLLEYL 332 Query: 337 ALADIEKPEI-LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL 395 L D++ I PG + + + ++ ++ + +D +G P R + +K V Sbjct: 333 GLDDVKDENIDYPGNSFSEILNNK--KIDDDKNVVIYDEYG---PTRMIRNEKYKYV--- 384 Query: 396 FTSDELYDRRNDPNEMHNLIDDI--RFADVRSKMHDALLDYMDK 437 + + P+E +NLI+D+ + ++ ++ + ++D M K Sbjct: 385 ------HRYPDGPHEFYNLIEDVEEKVNEINNEKYSKIIDQMRK 422 >UniRef50_C6J2X5 Sulfatase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J2X5_9BACL Length = 466 Score = 131 bits (329), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 124/479 (25%), Positives = 204/479 (42%), Gaps = 54/479 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN LF+MTD Q + C + + + T ++D L A+ + F++A +P C P+RA + TG Sbjct: 6 KPNILFLMTDQQRNDTFSCINPEVV-TPHMDQLIADSVFFSNARCANPSCVPSRAAIMTG 64 Query: 63 IYANQS-GPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 + ++ P P + M R ++AGYHT IGK H F +D + Sbjct: 65 KFPSECECPTFITHLPAHETTFMSR-LQEAGYHTAVIGKQH------FAGSPIKRGYDEE 117 Query: 122 YWFDGAN----------YLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 DG + YL LTE I ++ +D + I Sbjct: 118 MIIDGHSALYPDDTIQPYLDYLTENGIDRKHVMSKTLISGGTWEVDTKYHLDDYIGELGK 177 Query: 172 DFLQQPARA----DEPFLMVVSYDEPHHPFTCP-VEYLEKYADFYYELGEKAQDDLANKP 226 ++++ A D+P+ + +S+ PHHP+ C E+ E Y + + + DL KP Sbjct: 178 AWMKKKGAAKDECDKPWFLTLSFSGPHHPYDCEGTEFAELYDYEKLSVPKTSYADLDEKP 237 Query: 227 EHHRLWAQAMPSPVGDDGLYHHP---------LYFACNDFVDDQIGRVINALTPEQR-EN 276 H + D L H Y+A +D ++G +I + +N Sbjct: 238 PHFKQMGS-----YADIYLKHFSEETFKKTKRSYYANISLIDQKVGELIRIMKENHLYDN 292 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP-QG-ERRQVDTPVSHIDLLPT 334 T +IYTSDHG+ MG ++ K + D + R+PL I+ P QG +V V +ID+ T Sbjct: 293 TLIIYTSDHGDFMGDFGMVEKLQCLSDSLMRVPLFIKPPIQGFSGFEVKDDVLNIDIAAT 352 Query: 335 MMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 + +A E P L + + +E S +R + K V Sbjct: 353 CLEVAGAEVPASLSNYPYTCYWDDSKQKKVRDAIYMEAGS------IRGCIHQGIKTVHY 406 Query: 395 LFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPW 452 L S ELYD DP E HNL +D + + + + + ++++M +R+ S PW Sbjct: 407 LDRSYGELYDLNKDPQESHNLWNDPEYTEHKLEGYRIIVNHM------YRAIPKSDIPW 459 >UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UZ43_RHOBA Length = 608 Score = 131 bits (329), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 121/446 (27%), Positives = 185/446 (41%), Gaps = 91/446 (20%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + V+TD Q G K + T NID+LAAE Y +P C+P R+ L TG Sbjct: 31 RPNVVMVITDDQGYGDCGFTGNKVVQTPNIDALAAESSVLTD-YHVAPTCSPTRSALMTG 89 Query: 63 IYANQSGPWTNNVAPGK-----NISTMGRYFKDAGYHTCYIGKWHLDG------------ 105 + N++G W + G+ N T G F DAGY T GKWHL Sbjct: 90 HWTNRTGVW--HTISGRSMLRDNEVTFGEIFSDAGYQTGMFGKWHLGDNYPYRAEDNGFT 147 Query: 106 ----HDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 H G G+ P WD Y FDG S + NG + D F Sbjct: 148 EVYRHGGGGVGQTPDFWDNAY-FDG------------SYFHNG--KAVKAEGFCTDVFFK 192 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 +R F+++ ADEPF ++ + PH P P +KY D Y E+ D+ Sbjct: 193 EGNR-------FIRECVEADEPFFAYIATNAPHGPLHAP----QKYIDMYPEM----NDN 237 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVI 280 +A +F VDD +G+ L +NT I Sbjct: 238 VAT--------------------------FFGMITNVDDNVGQTRKLLRELGVHDNTIFI 271 Query: 281 YTSDHGEMMGAH----KLISKGAAMYDDITRIPLIIRSPQG---ERRQVDTPVSHIDLLP 333 +T+D+G GA + K + Y+ R+P ++ P+G + R +T +D++P Sbjct: 272 FTTDNGTAGGASVYNAGMRGKKGSPYEGGHRVPFVMHYPEGGFAKSRTNNTLCHAVDVVP 331 Query: 334 TMMALADIEKPEILP--GENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL 391 T++ + +E PE + G +I+++ + V FN + DS P++ + + Sbjct: 332 TLLDMCGVEAPESVKFDGTSIVSLLKDE-VDSSFNDRMLITDSQRVIDPIKWRQSSVMQD 390 Query: 392 VLNLFTSDELYDRRNDPNEMHNLIDD 417 L ELY+ NDP + +N+ D Sbjct: 391 KWRLINGKELYNIANDPGQENNIAGD 416 >UniRef50_A6DME6 Sulfatase family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DME6_9BACT Length = 461 Score = 130 bits (328), Expect = 8e-29, Method: Compositional matrix adjust. Identities = 123/457 (26%), Positives = 206/457 (45%), Gaps = 55/457 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN LF+ D + G Y + + NID LA+ F +A+ VC P+RA L T Sbjct: 19 EKPNVLFIAVDDLKPEL-GAYGNTQVKSPNIDKLASRSSVFTNAHCQWAVCGPSRASLMT 77 Query: 62 GIYANQSGPW-----TNNVAPGKNISTMGRYFKDAGYHTCYIGKWH----LDGHDYFGTG 112 G+Y +G +V P ++ T+ ++FK++GY T GK + +DG T Sbjct: 78 GLYPESTGVMDLKTPMRSVNP--DVLTLPQHFKNSGYFTAATGKIYDPRCVDGR----TK 131 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 + P W Y L ++ L ++G + + + N DE T + N +D Sbjct: 132 DDAPSWSTPY--------KTLNYGKVKL-KDGKHFAKAPELN--DEDLTDGQILLN-GLD 179 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG---EKAQDDL------A 223 L+Q D+PF + V + +PH PF P +Y + Y L +KAQ + Sbjct: 180 LLEQAQNQDKPFFVAVGFKKPHLPFVAPKKYWDLYDRERLTLPSFLDKAQGASDYGWHDS 239 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR---ENTWVI 280 N+ + + P + +H Y AC ++D +GR+I L E+R +NT ++ Sbjct: 240 NELRSYDGIPKKGPIAIELQKEAYHG-YLACVSYIDALVGRLIQDL--EKRNLADNTIIV 296 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALAD 340 DHG +G H + K + + TR PLII P+ + ++ TP ID+ PT+ A Sbjct: 297 LWGDHGFHLGDHNMWGKHTNL-EQATRSPLIISLPKQKAQKSHTPAGLIDIFPTLCEAAG 355 Query: 341 IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR--CWVTDDFKLVL----N 394 +E PE++ G ++ V + E ++++ SF + + T ++ + N Sbjct: 356 LEVPEVVQGTSLFPV-----INGEKDQHKNGAISFFKSKGAKGYSYRTKRYRYIEWSKGN 410 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + ELYD NDP E NL ++ + AL Sbjct: 411 KVEAIELYDYENDPQEKINLATQQESKELIRTLSQAL 447 >UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKC9_9BACT Length = 454 Score = 130 bits (328), Expect = 8e-29, Method: Compositional matrix adjust. Identities = 115/461 (24%), Positives = 204/461 (44%), Gaps = 82/461 (17%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L ++ D VG + + + T NID +A EG++F++ Y+ +C P RA L +G Sbjct: 19 KPNILIILADDLGYADVGYHGLEEIPTPNIDRIANEGVQFSAGYSNGSICGPTRAALMSG 78 Query: 63 IYANQSG--------PWTNNVAPG--KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 +Y + G +V G + + T+ +YF++AGY T GKWHL G F Sbjct: 79 VYQQRIGCEGICGGRKLNEHVVVGMPREVKTLAQYFQEAGYATGLFGKWHLGGERLFDKT 138 Query: 113 ECPPEWDADYWF---DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNR 169 P D +F +GA+ + +E R +D ++ E FT A I Sbjct: 139 LMPTSRGFDEFFGILEGASLYDDTVNRERKYIR------QDTVIDYEGEYFTDA--IGRE 190 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHH 229 AV F+ + + D+PF + + + H P +Y++++A + Sbjct: 191 AVSFITR--KGDKPFFLYLPFTAVHAPMQASEKYMQRFAHI--------------ADPNR 234 Query: 230 RLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG-- 286 R++A + + +DD IGRV +AL + +NT +++ SD+G Sbjct: 235 RVFAAMLSA-------------------MDDNIGRVFDALEHQGILDNTLIVFWSDNGGK 275 Query: 287 ---EMMGAHKLISKGAAMYDDITRIPLIIRSPQGE---RRQVDTPVSHIDLLPTMMALAD 340 H L + Y+ R+P +R P+G+ + +D PV +D+ P+ + A Sbjct: 276 PDNNYSLNHPLKGQKTQFYEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDIFPSALEAAQ 335 Query: 341 IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSF----GGFIPVRCWVTDDFKLVLNLF 396 I P+ + + IL + + + + H + G + VR D+KL N Sbjct: 336 ITVPKDIEAKTILPLMQGK-------TNQTPHPAMFWKRAGKMAVR---MGDWKLS-NAG 384 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 EL++ + D +E N+ID + D+ +KM+ L++ K Sbjct: 385 GPSELFNLKQDISESRNIID--QHPDIANKMNRLWLNWDKK 423 >UniRef50_Q7UMT6 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=2 Tax=Bacteria RepID=Q7UMT6_RHOBA Length = 524 Score = 130 bits (328), Expect = 8e-29, Method: Compositional matrix adjust. Identities = 126/463 (27%), Positives = 201/463 (43%), Gaps = 59/463 (12%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN LF++ D + +G L T +ID++A +G AY + +C+P+RA + TG Sbjct: 43 PNILFILCDDHRFDCLGVAGHPFLETPHIDTMARDGAMLRRAYVTTSLCSPSRASILTGQ 102 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW----- 118 YA+ N A N+ +DAGY T +IGKWH+ G D W Sbjct: 103 YAHNHRVVDNYHAVDPNLVFFPESLQDAGYQTAFIGKWHMGG-DIDDPQRGFDHWVSFRG 161 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 YW DG E+ + +G N + + + ++ ++D+L+ Sbjct: 162 QGTYWPDGHGTTREVPQTTY----DGFN----VNGKRVPQRGYITDELTEYSLDWLKG-R 212 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYAD--FYYELGEKAQDDLANKPEHHRLWAQAM 236 ++PF + VS+ H F + +Y + E+ D NKP +W + Sbjct: 213 DPNKPFFLYVSHKAVHADFVPADRHRGRYDNEALPIEIPTVEAMDAGNKP----MWVRNQ 268 Query: 237 -PSPVGDDGLYHHP-----LYFA--CNDF--VDDQIGRVINALTPEQR-ENTWVIYTSDH 285 S G D Y+ P +Y+ C VDD +G++ L ++ +NT V+Y D+ Sbjct: 269 RNSRHGVDFGYNLPGFSPEVYYRRYCESLLAVDDSVGQLREFLKQQELDQNTIVVYMGDN 328 Query: 286 GEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEK 343 G G H LI K A Y+ ++PL++ +P V D V +ID+ PT++ A+ Sbjct: 329 GFQFGDHGLIDKRTA-YEASAKVPLLVVAPGKIPAGVPFDGLVGNIDIAPTLLEAANASA 387 Query: 344 PEILPGENI---LAVKEPRGVMVEFNRYE-----------IEHDSFGG-FIPVRCWVTDD 388 P+ + G+++ L + + YE H GG F +RC Sbjct: 388 PKNINGQSVWQALCSSDASSLNDRTLLYEYYWERNYPHTPTLHAVIGGRFKYIRCH---- 443 Query: 389 FKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 L+ DELYD +DP EM NLIDD R+AD ++ L Sbjct: 444 -----GLWDRDELYDLESDPGEMQNLIDDSRYADRVESLNQRL 481 >UniRef50_UPI0001C35525 putative sulfatase yidJ n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C35525 Length = 491 Score = 130 bits (328), Expect = 9e-29, Method: Compositional matrix adjust. Identities = 130/485 (26%), Positives = 196/485 (40%), Gaps = 83/485 (17%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N L ++ D + + Y +T N+D LAA AYT P+C PARA +T Y Sbjct: 6 NILIIICDQLSATALSAYGNTYSDTPNLDRLAAGSAVMEYAYTSCPLCQPARASFWTSRY 65 Query: 65 ANQSGPWTNNVAPG-----KNISTMGRYFKDAGYHTCYIGKWH----LDGHDYFGTGEC- 114 +Q+G +N G I T+G F AGY + GK H L G + E Sbjct: 66 PHQTGVLSNLPDQGFPAVSDGIPTLGELFSRAGYDCVHFGKTHDYGALRGFQVIESEEIH 125 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 P + FD +L ID T ++V +L Sbjct: 126 VPRTNPAIKFDYETFLD------------------------IDTT--------EKSVQYL 153 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE-----LGEKAQ-DDLANKPEH 228 +R + PFLMV PH+ E+ E Y DF E L E DD+AN+PE Sbjct: 154 S--SRPEGPFLMVSDLQNPHNICAYIGEHSEGYGDFPLERELPPLPENYDFDDIANRPEF 211 Query: 229 HRLWA-----QAMPSPVGDDGLYHHPLYFACND-FVDDQIGRVINALTPE-QRENTWVIY 281 R Q S +D H+ + VD QIG+++ AL + T V++ Sbjct: 212 IRYLCCAHRRQRQASGWKEDDFRHYLYAYYYYLAMVDKQIGQILEALAESGATDQTMVVF 271 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR-------------------QV 322 +DHGE M +H L++K A Y++ R+P P G+ + ++ Sbjct: 272 LADHGEGMASHHLVTKYGAFYEETNRVPFFFSLPAGDNKSGTPGSCNKNAVPLYKKQNRI 331 Query: 323 DTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGF---- 378 S +DL+PT++ A I P + E I + + G +R + + F Sbjct: 332 GGITSLLDLVPTLLDYAGIPCPAGV--EGISLMPQITGAKTRSDRTAAVAEWYDEFRDYT 389 Query: 379 IPVRCWVTDDFKLVLNLF-TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 +P R +++K + L S+ELYD + D E NL +A V + L ++ K Sbjct: 390 VPGRMICDEEYKYICYLEPDSEELYDMKRDRYEKTNLAGKQEYAPVLERYRSLLKQHLKK 449 Query: 438 IRDPF 442 DPF Sbjct: 450 SDDPF 454 >UniRef50_A7V656 Putative uncharacterized protein n=6 Tax=Bacteroides RepID=A7V656_BACUN Length = 521 Score = 130 bits (327), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 125/490 (25%), Positives = 210/490 (42%), Gaps = 66/490 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNI-DSLAAEGIRFNSAYTCSPVCTPARAGLF 60 K PN + +M D +++ G PLNT D+LA G F+ AYT +P PAR + Sbjct: 30 KEPNIIIIMADQLRVDLL-QREGYPLNTMPFADNLAKNGTWFDCAYTSAPASGPARVSML 88 Query: 61 TGIYANQSGPWTN-NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 TG + + + +N N+ + K+ GY T +GK H + Sbjct: 89 TGRFPSATHVKSNHNIKDAYYTKDLFDVAKEKGYTTAMVGK----NHSHLTADRV----- 139 Query: 120 ADYWF---DGANYLSELTEKEISLWR--NGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 DYW G +EK + R L+ ++ + +R+ + A ++ Sbjct: 140 -DYWSPYNHGGQESRNKSEKGKAFDRYLGTLDMYASMEPSPYGVEAQLPYRMVDDACHWI 198 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYA-DFYYELGEKAQDDLANKPEHHRLWA 233 + D+PFLM S EPH+P+ Y + + E+G A+D L K E ++L A Sbjct: 199 D--SHKDKPFLMWFSIAEPHNPYQVCEPYYSMFPPESLPEMGSSAKD-LNTKGEEYQLLA 255 Query: 234 QAMPSPVGDDGLYHHPL------YFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG 286 + M G G Y L Y +DDQ+ R + L +NT +I+ +DHG Sbjct: 256 EMMAQ--GHVG-YRENLQRLRSNYHGMLRMIDDQLSRFVGELKKNGVYDNTIIIFVADHG 312 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEKP 344 + +G + L+ KG + D +TRIP+ P + + + VS ID+ PT+ + E P Sbjct: 313 DYVGEYGLMKKGVGLDDVLTRIPMQWTGPGIKASAIPHNAHVSIIDIFPTICEIIGAEIP 372 Query: 345 EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTD----------------- 387 + G ++ + + + + R + D FGG + TD Sbjct: 373 MGVQGRSLWPLLQGKEYPEQEFRSVMAQDGFGGMYYTKVDATDYREEGAVGKKGLFFDEL 432 Query: 388 ---------------DFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALL 432 D+KLV ++ + +LY+ + DP+E++NL D +F V+++M + LL Sbjct: 433 NTWTQSGTMRMLRKGDWKLVYDMNANGQLYNLKADPSELNNLFSDKKFNKVKNEMIEELL 492 Query: 433 DYMDKIRDPF 442 + DP Sbjct: 493 RWDISTHDPL 502 >UniRef50_Q7UYA8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA8_RHOBA Length = 745 Score = 130 bits (326), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 122/462 (26%), Positives = 202/462 (43%), Gaps = 57/462 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN LF+ D + VGC G P T N+D A + + FN+A+ +C +RA T Sbjct: 308 RPNVLFITVD-DLNDWVGCLGGNPDAQTPNLDRFAQQSVLFNNAHCQVALCYASRASFMT 366 Query: 62 GIYANQSGPWTNNVAPGKNI----STMGRYFKDAGYHTCYIGKWHLDGH------DYFGT 111 G+YA+++G + N+ ++ M +F ++GY T +GK + + H D G Sbjct: 367 GMYASKTGIYNNSSKSARDAYHRAKQMPVWFGESGYRTMCMGKIYHNDHGKKAYWDEIGP 426 Query: 112 -----GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI 166 G PP G + L + + + G+ DE +I Sbjct: 427 KTLRWGPEPPNGRQFTKRFGKDAQDSLAWAALDIEKGGMP----------DE------QI 470 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 + ++ L Q D+PF + + + +PH P T P Y E++ L ++DL + P Sbjct: 471 AAWGIEKLDQ--EYDQPFFLSLGFYKPHTPMTAPKRYFEQFDRDSLTLPNVLENDLDDVP 528 Query: 227 EHHRLWAQAMPSPVGDDGLYHHP---------LYFACNDFVDDQIGRVINAL-TPEQREN 276 E R W + ++ + + Y AC +DD IG+V+ L N Sbjct: 529 EIGRRWVLDRSKLIAEEAVKQYSPTYRRELVHAYHACVALIDDCIGQVLRKLDNSPYANN 588 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPT 334 T V+ SDHG +G K +++ TR LI+R+P G + V ID+ PT Sbjct: 589 TIVVLCSDHGWHLGEKNHWRKWMP-WEESTRSLLIVRTPDAAGSGQVCQRTVGLIDIYPT 647 Query: 335 MMALADIEKPEILPGENILAVKE-PRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL 393 + L ++ P+ L G + + + P G ++R + G VR +D ++ + Sbjct: 648 LAELCELSPPDGLQGLSFRKLLDNPDG---PWDRPALTSTKAGNHT-VR---SDRWRYIR 700 Query: 394 NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 + S+ELYD DPNE HNL +D ++ K H +D + Sbjct: 701 YIDGSEELYDHDVDPNEWHNLANDPSMNSIK-KQHAEWIDRL 741 >UniRef50_A6DJ72 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=3 Tax=Bacteria RepID=A6DJ72_9BACT Length = 495 Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 122/463 (26%), Positives = 214/463 (46%), Gaps = 63/463 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPL---NTQNIDSLAAEGIRFNSAYTCSPVCTPARAG 58 +RPN +F++TD Q + VG Y KPL +T +I+ +AAEG++F + Y + +C+P+RA Sbjct: 25 QRPNVVFILTDDQRGDAVG-YHKKPLLGIDTPSINKIAAEGVQFENMYCTTSLCSPSRAA 83 Query: 59 LFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 +G Y + + N ++ + + GY T +IGKWH+ G + Sbjct: 84 FLSGTYTHTHKVYDNFTDYPHDLKSFPLLLQQEGYTTGWIGKWHM------GEEDDSKRP 137 Query: 119 DADYWFDGANYLSELTEK-EISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 DYW +T K + W N + +AH++++ A+DFL + Sbjct: 138 GFDYW---------VTHKGQGKYWDTTFN----VNGERKKVPGYYAHKVTDMAIDFLNKV 184 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLE-------KYADFYYELGEKAQDDLANKPEHHR 230 ++ +PF + + + PH PF +Y Y D ++LG+K + + P H Sbjct: 185 DKS-KPFALCLGHKAPHGPFIPEAKYDSIYNDTPVPYPDSSWKLGDKPKWIVDRLPTWHG 243 Query: 231 LWA------QAMPSPVGDDGL-YHHPL--YFACNDFVDDQIGRVINALTPEQ-RENTWVI 280 ++ + P+ + + H + Y A + VDD +GR+ + L +NT +I Sbjct: 244 IYGPLYGFRKDFPNDKASAIVDFEHFVRSYTATINSVDDSVGRIYDHLEEMGILDNTILI 303 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMAL 338 +TSD+G ++G H +I K M++ IPL +R P+ + + V ID+ PT+M L Sbjct: 304 FTSDNGFLLGEHGMIDK-RTMHEASVSIPLTVRFPKKIKGGTVIKEQVLSIDMAPTIMEL 362 Query: 339 ADIEKPEILPGENILAVKEP-------RGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL 391 +K G + + + + + E+N YE++ F VR +K Sbjct: 363 TVGKKMPSAQGLSWATLLDDTKDAEWRKTWLYEYN-YEVQ---FPYTPNVRGIRHGKWKY 418 Query: 392 VL-------NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKM 427 V L +ELY+ DP+E NL +D +AD++S + Sbjct: 419 VAYPHGDGGKLRHMEELYNMERDPSESSNLAEDPAYADIKSML 461 >UniRef50_Q7WC54 Putative sulfatase n=3 Tax=Proteobacteria RepID=Q7WC54_BORPA Length = 529 Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 119/441 (26%), Positives = 191/441 (43%), Gaps = 20/441 (4%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PNFLF+M D + Y T N+D LAA RF + Y P+C P+R + T Sbjct: 5 KQPNFLFLMADQLTAFALRMYGNGVCRTPNLDRLAARSTRFANMYCNFPLCAPSRVAMLT 64 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDG-HDYFGTGECPPEWDA 120 G + G + N + T + AGY T GK H G + G E Sbjct: 65 GRLPSSVGVYDNASEFSAEVPTFLHHLALAGYSTILSGKMHFVGPEQHHGFQE---RLTT 121 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT--WAHRISNRAVDFLQQPA 178 D + + + E EI + G+N ++A + + + R V + Sbjct: 122 DIYPSDFGWTPDWRE-EIPIAPTGMNMRSVIEAGEYRRSMQIDYDDDVVYRGVQKIYDLG 180 Query: 179 RA--DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK-PEHHRLWAQA 235 R D PF + VS PH+P+ E+L+ Y ++ A + P RLW Sbjct: 181 RLHRDRPFFLAVSMTHPHNPYVSTREFLDLYRPEDIDMPAVPPIPFAQQDPHSQRLWYMF 240 Query: 236 MPSP--VGDDGL-YHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGA 291 V D + Y+A +VD Q+GR+++AL + E+T V++T+DHG+M+G Sbjct: 241 RQDEYDVSDAHVRAARHAYYAMVSYVDAQVGRMLDALQAMDLDESTVVVFTADHGDMLGE 300 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPV-SHIDLLPTMMALADIEKPE--ILP 348 L K +D RIPL+I +P R V + S +D+ PTM+ LA + P+ P Sbjct: 301 RGLWYKW-VHFDPAVRIPLLISAPGRTRPAVRHELASLVDIFPTMLELAGVSVPDDGASP 359 Query: 349 GENILAVKEPRGVMVEFNRYEI--EHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRN 406 + ++ E GV + + E + G P +KLV+ L++ ++ Sbjct: 360 PPDGRSLAEGLGVSQDEPTGVVYGEMNGEGAHAPCLAVRQGWWKLVVAEGDPPLLFNLQD 419 Query: 407 DPNEMHNLIDDIRFADVRSKM 427 DP+E+ NL D+ ++ Sbjct: 420 DPHELRNLAGQPAARDIERQL 440 >UniRef50_D2S234 Sulfatase n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2S234_9EURY Length = 461 Score = 129 bits (325), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 126/470 (26%), Positives = 200/470 (42%), Gaps = 76/470 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLN-TQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN + V+TD Q + VG Y G PL+ T +D+LAA+G A T P+C P RA + Sbjct: 21 RPNVIAVVTDQQRWDTVGVY-GCPLDLTPTLDTLAAQGSVLTQAITPQPLCGPFRAAFQS 79 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G YA++ W + V + + R FKDAGY Y+G WH+ G E D Sbjct: 80 GKYASEVDVWRDAVRMPSDELHLSRQFKDAGYDVGYVGNWHIAGTFDNPVPEQSRGGYED 139 Query: 122 YWF--DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 +W D + ++ TE L+ N V+ + +D +A A++ L Sbjct: 140 FWIAADVPEFTTQPTEGH--LFDADGNPVK-FERYRVDAFTAFA----CEAIESLS---- 188 Query: 180 ADEPFLMVVSYDEPHH-----PFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 EPF +VV+Y EPH+ + P Y E Y Y +DL ++P W + Sbjct: 189 --EPFFLVVAYVEPHNQNDMWSYVAPDGYAEPYQKRPY-----VPEDLQDRPGD---WYE 238 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHK 293 A+P Y+ + +D+ + ++ L+ R+ T + YTSDH G H Sbjct: 239 ALPD------------YYGMVERIDECVDNLLEVLSDRGIRDRTIIAYTSDH----GCHF 282 Query: 294 LISKGAAMYD---DITRIPLIIRSPQGERR-QVDTPVSHIDLLPTMMALADIEKPEILPG 349 G D R+P I+ P ++ V P S I+L PT++ A I+ P + G Sbjct: 283 RTRPGEYKRDPHESAIRVPAILVGPGFDKGVDVTQPTSMINLPPTLLDAAGIDVPNEMHG 342 Query: 350 ENILAV-------------------KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFK 390 E++L + + R + + +Y + S G W + Sbjct: 343 ESLLPIIRRDVPDVNGEAFIQISESQVGRALRTDRWKYAVAASSLTG------WRGGSAE 396 Query: 391 LVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 +++ LYD DP+E NL+ F + + D +L Y+ +I D Sbjct: 397 KSSDVYVERYLYDLERDPHEQVNLVGHPDFRSIADDLRDRILAYIQEIED 446 >UniRef50_UPI0001BC85B0 choline sulfatase n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC85B0 Length = 497 Score = 129 bits (325), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 124/449 (27%), Positives = 205/449 (45%), Gaps = 63/449 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYT----CSPVCTPARA 57 ++PN LF++TD + + + + T IDSL AEG+ F + YT C + P+RA Sbjct: 34 QQPNVLFILTDDLQASSIHALGNEDVYTPAIDSLIAEGVTFTNTYTNGALCGALSMPSRA 93 Query: 58 GLFTG--IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDY---FGTG 112 L TG +Y QS + + K +T + F+ GY T GKWH D + F G Sbjct: 94 MLMTGRGLYNIQS----DGMKIPKAHTTFPQQFRRHGYRTFATGKWHSDKAAFNRSFQEG 149 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 + + +F G + + L V + E F+ + ++ A+ Sbjct: 150 D-------NIYFGGMHPYEQNGHCSPHLNHYDSTGVYGPKTKFTGEEFS-SKMYADAAIR 201 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYA--------DFY----YELGE-KAQ 219 FLQ+ +PFL V++ PH P Y KY+ +F + GE + + Sbjct: 202 FLQKQKGDKQPFLAYVAFTSPHDPRNQLPNYGRKYSPDTLDVPRNFLPKHPFNNGEMRVR 261 Query: 220 DDLANKPEHHRLWAQAMPSPVGDDGLYHHPL-YFACNDFVDDQIGRVINALTPE-QRENT 277 D+L +P+P + + Y+ VD QIGR++ L Q ENT Sbjct: 262 DELL------------LPAPRTEQQVQKELSDYYGMISEVDVQIGRIMEVLRATGQAENT 309 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHI---DLLPT 334 V++ SD+G +G H L+ K +YD ++PL I +P + R+ + S D+ PT Sbjct: 310 IVVFASDNGLAVGRHGLLGK-QNLYDHSVKVPLTIIAPSYKNRKGEKNQSLCYLHDIAPT 368 Query: 335 MMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV-RCWVTDDFKLVL 393 + LA+I PE + +++ V E G +R E+ F + + R +V D +K ++ Sbjct: 369 LCELANIPLPESMNAQSLYPVLEDSGTT---HRKEL----FLAYSNIQRAFVNDSYKYII 421 Query: 394 ---NLFTSDELYDRRNDPNEMHNLIDDIR 419 N +++L+D + DP EMHNL+ + R Sbjct: 422 YHVNGKITEQLFDLQKDPLEMHNLLTEKR 450 >UniRef50_A5FX90 Sulfatase n=4 Tax=Alphaproteobacteria RepID=A5FX90_ACICJ Length = 518 Score = 129 bits (325), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 126/432 (29%), Positives = 182/432 (42%), Gaps = 36/432 (8%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L VM D + Y + T NID+LAA G+ F++AY SP+C P+R +G Sbjct: 17 RPNILIVMADQLGARALPAYGNQVALTPNIDALAAGGVVFDNAYCNSPLCGPSRYVFMSG 76 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD--- 119 + G + N + + + + AGY T GK H G D E D Sbjct: 77 QLPSAIGAFDNAAEFPAMLPSFAHHMRAAGYRTILSGKMHFCGPDQMHGFEERLTTDIYP 136 Query: 120 ADY-----WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 AD+ W D A S W + ++SV + + + A L Sbjct: 137 ADFGWTPDWTDFATRPS---------WYHDMSSVREAGLCVRTNQMDYDDEVVFAARQKL 187 Query: 175 QQPARADE--PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRL- 231 AR D+ PF MVVS PH PF EY Y ++ + P RL Sbjct: 188 FDLARDDDGRPFCMVVSLTHPHDPFAMTEEYWNLYDHDAIDMPRVRTAPASMDPHSLRLR 247 Query: 232 -WAQAMPSPVGDDGLYH-HPLYFACNDFVDDQIGRV---INALTPEQRENTWVIYTSDHG 286 + PV + + + Y+A FVD Q+GR+ + A R T + T+DHG Sbjct: 248 HVSNMDNEPVTEAQVRNARHAYYAAISFVDRQLGRLRETVEACGLAAR--TVTVMTADHG 305 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSP-QGERRQVDTPVSHIDLLPTMMALADIEKPE 345 E++G H L K + ++D RIPLI+ +P + +V VS +D+LPT++ L P Sbjct: 306 ELLGEHGLWYK-MSFFEDACRIPLIVHAPGRFAPARVGAAVSSVDMLPTLVGLGGGRIPA 364 Query: 346 ILP--GENILAVKEPRGVM-VEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELY 402 L G ++L E RG F Y E G P+ K + D+L+ Sbjct: 365 GLACDGTSLLGHLEGRGGHDGAFGEYLAE----GAIAPIVMIRRGRHKFIHCPADPDQLF 420 Query: 403 DRRNDPNEMHNL 414 D DP+E NL Sbjct: 421 DLEADPDERANL 432 >UniRef50_Q7UWE8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UWE8_RHOBA Length = 488 Score = 129 bits (325), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 123/452 (27%), Positives = 191/452 (42%), Gaps = 37/452 (8%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N L + D + GCY +++ NID LAA G+RF+ AY VC +RA L +G Sbjct: 36 NVLMIAVDDLRPEL-GCYGKSYMHSPNIDRLAASGMRFDRAYCQVAVCGASRASLMSGCR 94 Query: 65 ANQSGPWTNNV---APGKNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPEWDA 120 + W + ++ T+ ++ GY T ++GK +H D EW Sbjct: 95 PETTQCWNFKTLLRSQMPDVLTLPQHLSRNGYETGFLGKVYHSASDDAAAWTVDANEWAP 154 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVE------DLQANHIDETFTWAHRISNRAVDFL 174 G +Y+ EL K RN NS E + + D +T H ++RAV L Sbjct: 155 RDRSKGKSYVQELPRK-----RNPANSSEKNGPSIENGGDVPDSAYTDGHN-ADRAVALL 208 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 ++ + D+PF + V + +PH PF P +Y + Y ++ + +D + P W + Sbjct: 209 ERFSTQDKPFFLAVGFLKPHLPFNAPAKYWDLYDRDDIKIPSR-EDVVDGLPYARSSWGE 267 Query: 235 A---MPSPVGDDGLYHHPL------YFACNDFVDDQIGRVINALTPE-QRENTWVIYTSD 284 P D L Y A ++D Q+G+V+NAL QRENT V+ D Sbjct: 268 LKNYTDIPAKTDMLDDEKTRELIHGYRAAVSYMDAQVGKVLNALEANGQRENTIVVLWGD 327 Query: 285 HGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKP 344 HG +G K Y+ TR+PLI+ +P + + V +DL PT+ L ++ P Sbjct: 328 HGWYVGDFGDWCK-HTNYEIATRVPLIVSAPGVPAGETKSLVELVDLFPTLCELTELPVP 386 Query: 345 EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV--RCWVTDDFKLVLNLFTSDE-- 400 E G++I V G+ V + S G PV TD F+ + T Sbjct: 387 EHCQGKSIAGVVHDPGLSVRPAAFSQYKKSKLGVGPVLGTSIRTDRFRYTEYVSTKTGKL 446 Query: 401 ----LYDRRNDPNEMHNLIDDIRFADVRSKMH 428 L D DP N+ D + ++H Sbjct: 447 EDIVLIDFDKDPGATRNVASDPAYQPFLPQLH 478 >UniRef50_B0TKJ5 Sulfatase n=2 Tax=Gammaproteobacteria RepID=B0TKJ5_SHEHH Length = 492 Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 119/466 (25%), Positives = 204/466 (43%), Gaps = 87/466 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + D + Y + T NID LAAEGI+F Y+ +P+C+P+RAG+ TG Sbjct: 29 KPNVVIFYVDDLGYGDLATYGHNIVKTPNIDKLAAEGIKFTQYYSPAPLCSPSRAGMLTG 88 Query: 63 IYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDG------------ 105 ++G P NV GK T+ KD GY T GK HL+G Sbjct: 89 RTPYRTGIRSWIPDGQNVHIGKEEITLAHMLKDEGYDTAITGKLHLNGGAHMKDHPQASD 148 Query: 106 ----HDYFGTGECPPEWDADYWFDGANYLSELTEKEI---SLWRNGLNSVEDLQANHIDE 158 H + P W + + N L +I + WRNG+ E Q + Sbjct: 149 LGFEHSFI----IPGGWAKNAKTEAKNADGSLRHGKIHVDNFWRNGVPVGETDQFS---- 200 Query: 159 TFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA 218 A ++N A+ +L D+PF + V + E H P P +YL+ Y D+ + ++ Sbjct: 201 ----ADLVANEAIGWLDDQG-GDKPFFLYVPFSEVHTPIASPQKYLDMYGDYLTDFAKEN 255 Query: 219 QD----DLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQ 273 D D N+P +R + YFA ++D Q+GRVI+ L + Sbjct: 256 PDLFHWDWVNQP--YRGQGE----------------YFANITYMDAQLGRVIDKLKAMGE 297 Query: 274 RENTWVIYTSDHG------------EMMG-AHKLISKGAAMYDDITRIPLIIRSPQGERR 320 +NT ++++SD+G M G L + +++ R+P+I++ + Sbjct: 298 YDNTIILFSSDNGPVTREARKPYELNMAGETGGLRGRKDNLFEGGIRVPMIMKYHGHVKA 357 Query: 321 QVDT--PVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEI-------E 371 + D+ P+ +D++PT+ L + P + + V G+ VE + I + Sbjct: 358 ETDSDEPIYGLDIVPTLSELIGFDTPSDRTIDGVSFVSTFNGLSVERTKPMIWTIDMPYQ 417 Query: 372 HDSFGGFIPVRCWVTDDFKLVLNLFTSDE-LYDRRNDPNEMHNLID 416 D+ + VR DFKL+++ +++ L++ D E++NL++ Sbjct: 418 DDAINEY-AVRI---GDFKLIIDRQGNNKYLFNIGQDKYEVYNLLN 459 >UniRef50_D0DCV9 Choline-sulfatase n=2 Tax=Citreicella sp. SE45 RepID=D0DCV9_9RHOB Length = 474 Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 122/471 (25%), Positives = 212/471 (45%), Gaps = 60/471 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K PN +F++TD Q + +G ++T N+D L EG F Y SP C+P+RA LF+ Sbjct: 3 KHPNIVFIITDQQRIDTIGALGCPWMDTPNLDRLVNEGTAFEQMYVTSPSCSPSRASLFS 62 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGK---WHLDGHDYFGTGE----- 113 G Y + +G + N+ + + + K +GY T +GK W ++G FG E Sbjct: 63 GTYPHTNGVFRND---ERWVYSWVGLLKQSGYRTVNVGKMHTWPVEG--AFGYDERHVTE 117 Query: 114 ----CPP-------EWDADYWFDGANYLSELTEKEISLWRNGL-----NSVEDLQANHID 157 P WD +W G + +T++E+ + L ++ EDL A++ Sbjct: 118 NKDRAHPNLPFYLDNWDKAFWARGVEKPTRVTQREMPDYAERLGCYVWDAPEDLHADNF- 176 Query: 158 ETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK 217 + A +L + + DEPF + + PH P+ EYL KY + +L E Sbjct: 177 --------VPEMACMWLDR-YKGDEPFFLQIGIPGPHPPYDPTAEYLAKY-EGRDDLPEP 226 Query: 218 AQDDLANKPEHHRLWAQA-----------MPSPVGDDGLYHHPLYFACNDFVDDQIGRVI 266 + D +P R + +P P + Y+A +D Q+G ++ Sbjct: 227 IRYDFDTQPGPLRELRRQHLDNDHDAVVHLPDPTAEQMRLQRAHYYANVSMIDTQVGNIL 286 Query: 267 NALTPEQR---ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVD 323 AL E+R ++T +++TSDHG+ + H S+ M++ R+P I+ + D Sbjct: 287 AAL--ERRGVLDDTIIVFTSDHGDCLNDHGH-SQKWNMFEATVRVPAIVWGRGIPAMRRD 343 Query: 324 TPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDS-FGGFIPVR 382 V+ D PT++ A + P + +++ + + + E +D+ G + Sbjct: 344 ELVALFDWGPTILEWAGVTPPAWMEAQSLNPLMAGEEQLRDRVFAEHANDAILTGTSYMT 403 Query: 383 CWVTDDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVR-SKMHDAL 431 D+KLV + +S+ +L+D +DP E NL DD A+ + S +HD L Sbjct: 404 MIRRGDWKLVHFVDSSEGQLFDLASDPGERSNLWDDPAQAERKLSLIHDIL 454 >UniRef50_B8KHZ9 Arylsulfatase A n=2 Tax=Gammaproteobacteria RepID=B8KHZ9_9GAMM Length = 483 Score = 129 bits (324), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 126/451 (27%), Positives = 217/451 (48%), Gaps = 70/451 (15%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N L + D G Y + + T +ID LAAEG+RF Y S +C+P+RAGL TG Sbjct: 30 NVLLIYVDDLGYGDTGAYGHRVVKTPHIDRLAAEGMRFTQFYAPSALCSPSRAGLLTGRT 89 Query: 65 ANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 ++G P + VA G N +T+ K GY T IGKWHL+G + P ++ Sbjct: 90 PYRTGVESWIPDDSQVALGHNETTLADLAKARGYRTAVIGKWHLNGGLHMQGTPQPRDFG 149 Query: 120 ADYWFDGANYLSELTEKEIS-LWRNGLNSVEDLQANH--IDETFTW-AHRISNRAVDFLQ 175 D+ + A ++ + +E L R G +++ N+ + T + A +S+ A+D+L Sbjct: 150 FDHQYGLAAWVKNASVRESKELPRRGAMFPDNMYRNNEAVGPTKKYSAELVSDEAIDWL- 208 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD-------DLANKPEH 228 + A +PF ++++Y E H P P EYL +Y D+ L ++A+D D N+P Sbjct: 209 --SGAKDPFFLLLTYSEVHTPIASPPEYLAQYQDY---LTQEARDNPLLFYFDWRNRPWR 263 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGE 287 R G+ Y+A ++D Q+GRVI L + ++T +I++SD+G Sbjct: 264 GR----------GE--------YYANVSYMDAQLGRVIEYLRGKGVLDDTLIIFSSDNGP 305 Query: 288 MMGA-------------HKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDT-PVSHIDLL 332 + A L K +++ R+P IIR P+ E +V++ P + +D+ Sbjct: 306 VTDAALTPWELGMAGETAGLRGKKRFLFEGGLRVPGIIRYPERIEAGRVESRPATALDVF 365 Query: 333 PTMMALADIEKPEILP--GENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTD-DF 389 PT+ + +P GE++ + + +F R + + S + V D ++ Sbjct: 366 PTLAQWLGVAVDSSVPLDGESLWPLIDGG----DFQRQQAFYWSIPTPDGMEFAVRDGNW 421 Query: 390 KLVLNLFTSDE----LYDRRNDPNEMHNLID 416 KL+L+ +DE L+D +D E++NL++ Sbjct: 422 KLILD---ADERPQYLFDLASDWYEVNNLLE 449 >UniRef50_D2MLH3 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MLH3_9BACT Length = 454 Score = 129 bits (324), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 119/464 (25%), Positives = 203/464 (43%), Gaps = 59/464 (12%) Query: 29 TQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIYANQSGPWTNNVAPGKNISTMGRYF 88 T NID L++EGI F +A + PVC+P RA LFTG Y +G N + ++ T+ Sbjct: 8 TPNIDRLSSEGIDFVNATSVYPVCSPHRASLFTGCYPTTNGYVMNELGARTDLPTLAGTL 67 Query: 89 KDAGYHTCYIGKWHL---DGHDYFGTGE---------CPP--------EWDADYWFDGAN 128 G + YIGKWH+ +G Y G+ PP + A Y F+ + Sbjct: 68 TQNGVNCAYIGKWHIYATEGKVYKQAGDFHKNPANQFVPPGPHRLGFDHYWAAYNFNHSY 127 Query: 129 YLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVV 188 Y E + D+ A D +++ A+ +L+ + PF + V Sbjct: 128 YKGFYYEDKFER--------IDIPAYEPDA-------MTDLAISYLENASENPNPFALFV 172 Query: 189 SYDEPHHPFT---CPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA---MPSPVGD 242 SY PH P+ P E+ + D ++L +D E+ W M S V Sbjct: 173 SYGTPHQPWNWDNVPEEWGMHFKDMAFDLPPNYRD---GSGEYWHAWFDRDWWMKS-VKP 228 Query: 243 DGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKLISKGAAM 301 + + +Y A +D +GR+++A+ +T V++TSDHGEM GAH + K Sbjct: 229 NLVEWQRIYAAMTANLDWNVGRILDAIDRFNLAHDTLVVFTSDHGEMFGAHGRVQKN-IY 287 Query: 302 YDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAV---- 355 Y++ R+P ++R P + + D ++ D++PT+++L D++ P+ + G ++ Sbjct: 288 YEEAARVPFLMRWPNRIAPKSESDACLNTPDIMPTLLSLIDVDIPDGIDGFDLSHCVFGQ 347 Query: 356 --KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHN 413 KEP ++ ++ D GF R + + + LY+ R DP ++HN Sbjct: 348 HGKEPEAAFLQGMGPSVDWDD--GF-EWRALRDKQYTYAIEKH-RESLYNHREDPLQLHN 403 Query: 414 LIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDAR 457 L ++ A D + MD + D F + W ++ R Sbjct: 404 LANNPDQARTIQHYRDQIRTRMDALNDTFEPITFYRDHWIENGR 447 >UniRef50_A6DNH0 Choline sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DNH0_9BACT Length = 466 Score = 129 bits (323), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 121/443 (27%), Positives = 208/443 (46%), Gaps = 38/443 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++PN L + D + G G P + T ++D LA G F +A+ PVC+ +R + Sbjct: 18 EKPNVLMISID-DLNDWTGFLGGHPQVKTPHMDKLANSGRIFANAHCAVPVCSSSRVSVM 76 Query: 61 TGIYAN-----QSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 +G+ A + GP ++ K++ T+ R+FK+ GY+T GK + H + G+ Sbjct: 77 SGLAATTHGSYEIGPSYQSIPALKDVLTIQRHFKNQGYYTLAGGK--VLHHGFKGSVAND 134 Query: 116 PEWDADYWFDGANYLSELTEKE--ISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 + G L E W G + D QA+ + ++++ A Sbjct: 135 NDRSLIKGHSGPKPKQPLNLPEGWSRAWDWGQHPGTDAQAHDM--------KLAHNAAQA 186 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA 233 LQ+ D+PF M V + PH P P ++ Y + L + DL + P++ Sbjct: 187 LQE--DFDKPFFMSVGFFRPHVPLLVPPKWFNLYDEESIVLAPSPKSDLDDVPKNFLSIN 244 Query: 234 QAMPSPVGDDGLY---HHPL---YFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG 286 +P + L H L Y A FVD +GRVI+AL + +NT VI SDHG Sbjct: 245 DYAVAPTHKEVLATDSHRKLTHAYLASISFVDACVGRVIDALKNSKYADNTIVILWSDHG 304 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDT-PVSHIDLLPTMMALADIEKPE 345 +G + +K ++++ T++PL++ P E + P S ID+ PT++ L ++ P+ Sbjct: 305 FHLGEKEHWAK-RTLWEESTKVPLLVYGPGIESGEACLEPASLIDIYPTLVDLCGVKAPK 363 Query: 346 ILPGENIL-AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDR 404 L G +++ +K P + E + I +G VR T D++ + ++ELYD Sbjct: 364 KLDGISLMPQLKNP---LSERKQPAIISSYYGNHA-VR---TRDWRFISYEDGAEELYDH 416 Query: 405 RNDPNEMHNLIDDIRFADVRSKM 427 +NDP+E NLI+D + +R ++ Sbjct: 417 KNDPDEYKNLINDPNYKSIRDEL 439 >UniRef50_Q15XR5 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XR5_PSEA6 Length = 549 Score = 129 bits (323), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 131/508 (25%), Positives = 212/508 (41%), Gaps = 112/508 (22%) Query: 5 NFLFVMTDTQATNMVGCYSGK--PLN-TQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 N L++MTD A + VG Y G+ LN T N+D+LA EG+ F + + + +CTP+RA + T Sbjct: 46 NILYIMTDDHAAHAVGAYQGRLAELNPTPNLDALANEGMTFTNVFVTNSICTPSRATILT 105 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH----DYFGTGECPPE 117 G Y+ +G + + R K+AGY T IGKWHL DY+ E Sbjct: 106 GQYSQTNGVLDLRGKIATSQQHLPRLMKEAGYETAIIGKWHLKAEPGAFDYYQVLES--- 162 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 G + E + W N T + +++ ++++L+ Sbjct: 163 -------QGTYFDPEFRTRGPKPW----------PENETQYTGHSSDVVTDLSIEWLENR 205 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL----------ANKPE 227 A++PF ++ + PH F +Y +Y DF DDL A + + Sbjct: 206 V-ANKPFFLMHQFKAPHDMF----KYAPRYEDFLAAETIPEPDDLYAVAKTFGSIATRGK 260 Query: 228 HHRLWA------------QAMPSPVGDD----------GLYHHPL--YFACNDFVDDQIG 263 + L A ++M +G D Y L Y C VDD + Sbjct: 261 NDTLRADIGTSVSRRNNRRSMGIDLGVDPNLSEEEFTRQAYQKYLKAYLRCVKGVDDNVA 320 Query: 264 RVINALTPE-QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR-- 320 R+I L Q +NT +IYTSD G M+G H L K ++D+ R+PLI++ P Sbjct: 321 RLIQTLRDTGQYKNTIIIYTSDQGMMLGEHDLQDK-RWIFDESIRMPLIVKHPDASETGI 379 Query: 321 QVDTPVSHIDLLPTMMALADIEKPEILPGEN----ILAVKEPRGVMVEFNRY---EIEHD 373 Q D +++ D P ++ LA+I P+ + G++ + + + + RY HD Sbjct: 380 QSDLLINNTDFAPFILDLANISTPKYMHGKSFKTALFSQPPEQWRTASYYRYWTHRAYHD 439 Query: 374 SFGGFIPVRCWV-TDDFKLVLNLFTSD-----------------------------ELYD 403 +P V + ++KLV T+ ELYD Sbjct: 440 -----VPAHFGVRSKEYKLVFYYGTNHMDSPYPFYDKAWLNKTGRSNNTIDTPVAWELYD 494 Query: 404 RRNDPNEMHNLIDDIRFADVRSKMHDAL 431 +NDP+E +N+ D R++ V + + L Sbjct: 495 LKNDPSEQNNVYYDPRYSSVIASLKSEL 522 >UniRef50_A6DLY1 Putative sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLY1_9BACT Length = 441 Score = 129 bits (323), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 119/468 (25%), Positives = 197/468 (42%), Gaps = 62/468 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KR N ++ + D +G + T N+D+++ G F +A + P+C P RA + T Sbjct: 4 KRSNIIWFIADQMRGQAMGVNGDPNIFTPNLDNMSICGTNFPNAISGYPLCCPFRASMLT 63 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL------DGHDYFGTGECP 115 G YAN + + +T+ F D Y T Y+GKWHL G T Sbjct: 64 GKYANNHSVQIHEDRLDPSTTTITDVFNDNQYDTIYLGKWHLAGIKEEKGRSALKTVPVI 123 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 D W N S+ W +G +++ +H +++ A++ + Sbjct: 124 DRGRFDTWIGYDNNNSQW-----DCWVHGHEDGKEI--DHYRLPGYETDCLTDMAIERIS 176 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + + PF M+VS PH P P PEH R AQ Sbjct: 177 KYKDSSNPFFMIVSVQPPHLPTLAP-------------------------PEHRRYQAQQ 211 Query: 236 M------PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEM 288 + PS + + Y+A + VD +GR+++ L E E+T +++ SDHG+ Sbjct: 212 LELRPNIPSDTEQEARFQSSGYYAQIENVDANVGRIVDYLRENEMIEDTHIVFFSDHGDQ 271 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQ----GERRQVDTP--VSHIDLLPTMMALADIE 342 MG+ K Y++ RIP II + RR +TP V+H+D+ PT + L ++ Sbjct: 272 MGSQGRFGK-CVPYEESLRIPFIIGGGKPMAYDGRRCGNTPGLVNHVDIAPTSLGLCGVD 330 Query: 343 KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFI-------PVRCWVT-DDFKLVLN 394 P+ + G + + GV +E YE ++ I RC VT D +K Sbjct: 331 IPDWMEGFDYSHRRT--GVNLEKRVYEEPDSAYSQLIGDRESAYAWRCVVTRDGWKYACI 388 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 L++ +DP E +NL + F + RS++++ + + KI D F Sbjct: 389 RGGEWLLFNLNDDPYEQNNLAFNTAFHEKRSELNNLIRTWAQKINDDF 436 >UniRef50_C7MHR6 Arylsulfatase A family protein n=3 Tax=Bacteria RepID=C7MHR6_BRAFD Length = 480 Score = 129 bits (323), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 128/475 (26%), Positives = 213/475 (44%), Gaps = 64/475 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + +MTD Q + + ++T N+D L EG F + Y SP C P+RA LFT Sbjct: 6 ERPNIVLIMTDQQRFDSIAALGHDHVDTPNLDRLVREGAAFTNTYVPSPSCAPSRASLFT 65 Query: 62 GIYANQSG------PWTNN-----VAPGKNISTMGR-----YFKDAGYHTCYIGKWHLDG 105 G+Y + SG PW+++ A G +++G+ Y G+H ++ + Sbjct: 66 GLYPHSSGVLRNDDPWSHSWVEHLSAAGYRCTSVGKMHTYPYEAPVGFHERHVIENKDRA 125 Query: 106 H---DYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSV-----EDLQANHID 157 H YF +WD W G S +T +E + L + EDL A++ Sbjct: 126 HPDLPYFLD-----QWDKAIWIRGHQKPSRVTYRERDDYAERLGAFEWELPEDLHADNF- 179 Query: 158 ETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK 217 + N A +L+ D+PF + + + PH P+ P +LE Y D ++ Sbjct: 180 --------VGNLARHWLETYPEHDDPFFLQIGFPGPHPPYDPPARHLEPYRDRPMPEAKR 231 Query: 218 AQDDLANKP---EHHRLWAQA--------MPSPVGDDGLYHHPLYFACNDFVDDQIGRVI 266 Q DL ++P + R QA + +P + YFA +D+Q+G ++ Sbjct: 232 TQADLDSQPAPLKELRTHHQANDHDAIVQLENPTAEQLDRQRRHYFANVSLIDEQVGGIL 291 Query: 267 NALTPEQR---ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQ 321 +AL E+R +NT V++TSDHG+ + H S+ MY+ +P II P ++ Sbjct: 292 DAL--EERGVLDNTVVVFTSDHGDALNDHGH-SQKWTMYEPSVHVPGIIWGPGRVEPDQR 348 Query: 322 VDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV 381 D VS +D+ PT++ LA + PE + ++L + G E +Y + + Sbjct: 349 FDGLVSLMDIAPTVLELAGLTPPEWMEARSLLPALQ--GQEWEGRQYVFSEHARDAILTG 406 Query: 382 RCWVT----DDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 +T +KLV + D +L+D DP E NL A+ R ++ A+ Sbjct: 407 TALMTMARDARYKLVEFIDHEDGQLFDLAKDPYEETNLWFCEEHAETRRRLERAI 461 >UniRef50_Q7UZ92 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UZ92_RHOBA Length = 582 Score = 128 bits (322), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 126/463 (27%), Positives = 207/463 (44%), Gaps = 66/463 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF+ D + +GCY T NID LA+ ++FN AY VC P+RA L T Sbjct: 26 QRPNVLFIAVDDLRPS-IGCYGDPQAITPNIDRLASRSVQFNRAYCQVAVCNPSRASLMT 84 Query: 62 GIYANQSGPWTNNV---APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+ + WT + T+ ++F+ GY GK Y P W Sbjct: 85 GLRPDNLAVWTLPIHFREAMPEAVTIPQWFRRYGYTAVSHGKI------YHNPTPDPQSW 138 Query: 119 DA---------DYWFDGA-----NYLSELTEKEISLWR-NGLNSVEDLQANHIDETFTWA 163 ++ DG + +EL +++ WR N L D+ Sbjct: 139 SEPIRDLPRLPAFYPDGTREQMKKFDNELPDRD---WRKNNLRGPSTAAPELADDQLLDG 195 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY--ADFYYELGEKAQDD 221 R +N A++ L++ ++D PF + + Y PH + P +Y + + + GE+ + Sbjct: 196 AR-TNMAIEDLRRLGKSDAPFFLAMGYIRPHLAWVAPKKYWDMHDPSKLPVRTGEQIPKN 254 Query: 222 -----LANKPEH-HRLWAQAMPSPVGDDG--------LYHHPLYFACNDFVDDQIGRVIN 267 + N E H + +P P DD L H Y+AC ++D QIGR+++ Sbjct: 255 SPPYAMHNNSEMTHYVDRMNLPKPWDDDTVPTEDARHLMH--AYYACVSYIDAQIGRLLS 312 Query: 268 ALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDT 324 AL E +NT V+ SDHG +G H+ K Y+ +PL+I P + +Q D Sbjct: 313 ALKEEGLADNTIVVLWSDHGWKLGEHRGWGK-MTNYEIDAHVPLLITGPGVKCLGQQTDQ 371 Query: 325 PVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVE---FNRYEIEHD--SFGGFI 379 +DL PT+ +A I+ P+ + G +++ + V N+Y H+ + G+ Sbjct: 372 LAELLDLFPTLCEMAGIDVPDFVDGSSLVPILNDVDAKVHDGAVNQYYRRHEGRQYMGY- 430 Query: 380 PVRCWVTDDFKLV--LNLFTSD----ELYDRRNDPNEMHNLID 416 +R T D++LV + F+ + ELYD RND +E +++D Sbjct: 431 SIR---TSDYRLVEWRDFFSGEVAAKELYDHRNDDSENESIVD 470 >UniRef50_A6DPE5 Iduronate-2-sulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DPE5_9BACT Length = 487 Score = 128 bits (322), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 117/458 (25%), Positives = 197/458 (43%), Gaps = 40/458 (8%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+ D + +G Y + T N+D LA G F+ AY P+C P+RA + +G+ Sbjct: 22 NVLFISADDLNCD-IGPYGNTQVKTPNLDRLARMGTVFDRAYCQQPLCGPSRASIMSGLR 80 Query: 65 ANQSGPWT-NNVAPGK--NISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPEWDA 120 N G WT N+ G+ N+ TMG +F+ GY++ +GK +H Y GT D Sbjct: 81 PNTLGVWTLNSKLRGRIPNLVTMGEFFQKQGYYSGRVGKIYHYGNPTYIGTNGND---DE 137 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRIS-----------NR 169 W + N +E ++ R I + W +S +R Sbjct: 138 QTWTERFNPKGIDRTQEENIIRYPGGKTGKKGGLGI--SMAWWDPVSKDNEHTDGLVADR 195 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY--ADFYYELGEKAQDDLANKP- 226 A+ ++ A D+PF + + PH P+ P +Y + Y D + E+A+ +LA+ P Sbjct: 196 AIKMIE--ANKDKPFFIAAGFFNPHCPYVAPKKYFDMYDINDIELQELEEAKQELADVPA 253 Query: 227 -----EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVI 280 + + W D+ Y+A F+D Q+GR+ AL + T ++ Sbjct: 254 MAIQRDAGQRWPYFYKGLTRDEAKQCKLAYYATVSFIDAQVGRIFEALEKNNLMDKTIIV 313 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLLPTMMALA 339 + SDHG +G L K A ++ R PL+I +P + QV +PV +D+ PT++ Sbjct: 314 FWSDHGYFLGEKGLWFKRKA-FERSARAPLLIAAPGLSKGQVCKSPVELLDIYPTLVEAT 372 Query: 340 DIEKPEILPGENIL-AVKEPRGVMVEFNRYEIEH--DSFGGFIPVRCWVTDDFKLVLNLF 396 + P L G ++ +K + + +I H D G I + W ++ Sbjct: 373 GFQIPSELEGVSLSPLLKNAQTKWTKPAITQIHHGADKQGYSIRTKKWRYTEWN---KGQ 429 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 ELY+ DP E NL + + +++ L + Sbjct: 430 AGKELYNHETDPEETINLATNPEHTQIVAQLSTELQKF 467 >UniRef50_C5BWB0 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5BWB0_BEUC1 Length = 497 Score = 128 bits (321), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 114/394 (28%), Positives = 175/394 (44%), Gaps = 63/394 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L VMTD Q + +G +G P+ T N+D LAA+G F AY+ +P CTPARA L TG Sbjct: 4 RPNVLLVMTDQQRWDTLGS-AGGPVETANLDHLAAQGTTFTHAYSATPSCTPARASLLTG 62 Query: 63 IYANQSGPWTNNV-------APGKNI-STMGRYFKDAGYHTCYIGKWHLDG----HDYFG 110 PW + P + +T+ DAGYHT +GK H H + Sbjct: 63 -----QDPWHTGILGMGAGQPPMAGLENTLPEALADAGYHTQGVGKMHFSPQRALHGFHA 117 Query: 111 TGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHR----- 165 T D + + S+ T+ W + QA+H + +W R Sbjct: 118 T-----TIDESLRVEEPGFTSDYTQ-----WFERHAPADVRQADHGLDFNSWLARPFHTG 167 Query: 166 --------ISNRAVDFLQQ--PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYY--E 213 ++ FL++ P R PF ++ S+ PH P+ P Y E Y ++ + Sbjct: 168 EHLHPSTWTVTESIRFLERRDPTR---PFFLMTSFARPHSPYDPPAFYYEHYLRRHHTGD 224 Query: 214 LGEKAQDDLANKPEHHRLWAQAM-PSP-----VGDDGLYHHPLYFACNDFVDDQIGRVIN 267 L D A+ H A+ M P+ D+ Y+ +D QIGR++ Sbjct: 225 LPPAVVGDWASV--HDVGGAEGMDPNAWRGRRTADEIGRARAGYYGSIHHIDHQIGRLMR 282 Query: 268 ALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ----- 321 L + + T V++T+DHG+M+G H L K A Y+ +PL++R P G R Sbjct: 283 YLRDRRLDAETLVVFTADHGDMLGDHHLWRKTYA-YEGSAHVPLVVRLPAGMRSAGDAEV 341 Query: 322 VDTPVSHIDLLPTMMALADIEKPEILPGENILAV 355 VD PV D++PT++ ++ P + G + L + Sbjct: 342 VDDPVCLQDVMPTILDACGVDVPASVDGASTLPL 375 >UniRef50_Q482E2 Sulfatase family protein n=1 Tax=Colwellia psychrerythraea 34H RepID=Q482E2_COLP3 Length = 499 Score = 127 bits (320), Expect = 7e-28, Method: Compositional matrix adjust. Identities = 122/457 (26%), Positives = 197/457 (43%), Gaps = 78/457 (17%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN LF+ D ++ Y + T NID LA++ F AY+ PVC P+R + TG Sbjct: 52 KPNILFIAVD-DLKPLIRDYGTAKVQTPNIDKLASQSTVFTRAYSQYPVCGPSRMSILTG 110 Query: 63 IYANQSGPWT-----NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + +G +V P ++ T+ ++FK+ GY T GK Sbjct: 111 LRPESNGIMNLKDKIRDVNP--SVITLPQFFKNNGYETAATGK----------------- 151 Query: 118 WDADYWFDGANYLSELTEKEISLW-------RNGLNSVEDLQANHI---DETFTWAHRIS 167 FD N S +E+E+ W ++GL L I DE F I Sbjct: 152 -----IFDPRNTTSR-SEEEVLSWSIPYQRPKHGLKGKTRLAVESIDEPDEKFVDGG-IL 204 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG--EKAQDD---- 221 R L+Q A ++PF + V + +PH PF P +Y + Y+ ++L + A +D Sbjct: 205 KRGKKLLKQMANKNKPFFLAVGFKKPHLPFVAPKKYYDLYSRESFDLASYQSAPEDADTT 264 Query: 222 -LANKPEHHRLW-------AQAMPSPVGDDGLYHHPL----YFACNDFVDDQIGRVINAL 269 L +K + R + + P P G H YFA F+D +G ++ L Sbjct: 265 YLFHKNQELRGYKPTPIKGGEIKPYPKGKLSSAHQKELLHGYFASVSFIDSLVGELLEEL 324 Query: 270 TPE-QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSH 328 Q ENT +++ DHG +G H L K M + +PLII+ P + + PV Sbjct: 325 EKTGQAENTVIVFWGDHGFHLGDHGLWGKHTTM-EQANHVPLIIKIPGSKANRYAKPVEL 383 Query: 329 IDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEI-EHDSFGGF-IPVRC--- 383 +D+ P++ A + P L G +++++ G + ++ I ++ G + +R Sbjct: 384 LDVFPSLTEAAGLSIPNNLQGTSLVSLVT--GKLKSIDKVAISQYKRKGAYGYSMRTEQY 441 Query: 384 ----WVTDDFKLVLNLFTSDELYDRRNDPNEMHNLID 416 WVT K+V +LYD NDP E N+I+ Sbjct: 442 RYTQWVTPSGKVVYR-----DLYDLINDPLETKNIIN 473 >UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D6K5_PAESJ Length = 434 Score = 127 bits (320), Expect = 8e-28, Method: Compositional matrix adjust. Identities = 108/380 (28%), Positives = 162/380 (42%), Gaps = 86/380 (22%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MKRPN + D +GCY + T ++D LA+EGIRF + Y+ SPVC+P+RA L Sbjct: 1 MKRPNIIVFYCDDLGYGDLGCYGSDAMKTPHLDQLASEGIRFTNWYSNSPVCSPSRASLL 60 Query: 61 TGIYANQSGPWTNNVAPGK--------NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 TG Y ++G ++ GK +T+ K+ GYHT GKWHL +G Sbjct: 61 TGKYPAKAG--VTSILGGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLGASAEYGPN 118 Query: 113 ECPPEWDADYWFDGA--NYLSELTEKEISLW--RNGLNSVEDLQANHIDETFTW------ 162 +D Y F +Y S I W G+N V DL N ET W Sbjct: 119 --AHGFDQFYGFRAGCIDYYS-----HIFYWGQGGGVNPVHDLWRN---ETEVWENGEYM 168 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 I+ A ++ A DEP+ M V+Y+ PH+P P YL+++ D Sbjct: 169 TEAITREATSYI-DAAPDDEPYFMYVAYNAPHYPMHAPKAYLDRFPDL------------ 215 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIY 281 P R+ A + + VDD +G ++ AL + E+T + + Sbjct: 216 ---PPDRRIMAAMIAA-------------------VDDGVGEIVKALKQKGAYEDTIIFF 253 Query: 282 TSDHGE-------MMGAHKLISKG---------AAMYDDITRIPLIIRSPQGERRQ---- 321 +SD+G + G L G A++++ R P I+ P G Q Sbjct: 254 SSDNGPSTESRNWLDGTEDLYYGGSAGRFRGHKASLFEGGIREPAILSYPAGLAEQQGQI 313 Query: 322 VDTPVSHIDLLPTMMALADI 341 D + +D+ PTM+ L+ I Sbjct: 314 SDEMFAMMDIFPTMLELSGI 333 >UniRef50_A6DIH4 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DIH4_9BACT Length = 621 Score = 127 bits (319), Expect = 9e-28, Method: Compositional matrix adjust. Identities = 119/461 (25%), Positives = 198/461 (42%), Gaps = 42/461 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+ N L +++D + + Y T N+D A +FN AY PVC P+RA + Sbjct: 154 KKLNVLMIVSD-DLNHYIKSYGDPQAITPNLDKFMAMSTQFNKAYCQYPVCGPSRASFLS 212 Query: 62 GIYANQSGPWTN-----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 G+Y S TN +V P + M +F++ GY T GK H +G E Sbjct: 213 GLYPESSLVITNTQYLRDVNPSAD--NMLEHFRNNGYWTGAAGKIF---HSTYGMMEKGT 267 Query: 117 EWDADYWFDGANYLSELTEK------------EISLWRNGLNSVEDLQANHIDETFTWAH 164 D F A L K + +N + DL + E H Sbjct: 268 SLDEYEKFSNAENPQLLLLKKRWIKEGKPGDFKAYFNKNKVKDQADLVLGYGTELRDNQH 327 Query: 165 ---RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 R + R +++ + ++PF M +PH PF P +YL+ Y + ++D Sbjct: 328 GDGRNARRVAQWIKNNSAGEKPFFMACGIVKPHTPFYAPKKYLDLYPKDKLIFDDVPEND 387 Query: 222 LANKPEHHRLWA-QAMPSPVG----DDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRE 275 NKP+ + QA +G ++ Y+ Y C F+D Q+ +++AL Q + Sbjct: 388 WDNKPKVAGVKRYQAFRGELGVNDRENRKYYLQSYLGCISFMDAQVKVLMDALKESGQMD 447 Query: 276 NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLP 333 NT +++ SDHG +G H + K ++++ R+P I P G +Q D+ ID+ P Sbjct: 448 NTVIVFMSDHGFQIGEHFMYGK-VTLFEECARVPFGIIYPGNPGAGKQSDSLAELIDVYP 506 Query: 334 TMMALADIEKPE-ILPGENILAVKEPRGVMVEFNRYEI--EHDSFGGFIPVRCWVTDDFK 390 T++ L + +P L G++++ V + + V Y + G I WV + Sbjct: 507 TLLDLCKLPQPSHKLQGKSLVPVTKDTSLQVRNEAYTVVTRGKLMGRAIRKGSWVYAHWG 566 Query: 391 LVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 ++ ELY+ DP + +NL+ D +A V +M AL Sbjct: 567 SDRDV----ELYNMDKDPKQYNNLVKDPEYAKVLKQMDKAL 603 >UniRef50_Q7UER3 Iduronate-2-sulfatase n=2 Tax=Planctomycetaceae RepID=Q7UER3_RHOBA Length = 492 Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 126/461 (27%), Positives = 192/461 (41%), Gaps = 77/461 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K N L + D + GCY +++ NIDSLAA+GI+FN + +P C +R + T Sbjct: 42 KPKNVLLICVDDLRPEL-GCYGADYVSSPNIDSLAAKGIQFNRHFVQAPTCGASRFAMLT 100 Query: 62 GIYANQSGPWTNNVA----------PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 G Y GP N+ P +M R+F+D GY + +GK H G Sbjct: 101 GCY----GPSGNHALFQRAKKIAKDPTSVTPSMPRWFRDHGYTSVSVGKV---SHHPGGR 153 Query: 112 G----------ECPPEWDADY-----WFDGANYLSELTEKEISLWRNGLNSVEDL--QAN 154 G E P WD W + L + EI + ++ + +A Sbjct: 154 GGADWNEEAEIEMPGAWDRHLMPTGPWQHPRGAMHGLADGEIRKDASQMDVFQSAGGEAK 213 Query: 155 HIDETF--TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYY 212 + D+ T + ++ A D A++PF + V + PH PF P EY++ Y Sbjct: 214 YPDDLILETSLNELTTLAED------SANKPFFLAVGFIRPHLPFGAPAEYMKPYRQSVL 267 Query: 213 ELGEKAQDDLANKPEHHRLWAQA---------MPSPVGD----DGLYHHPLYFACNDFVD 259 + E NKP W ++ P D D + H Y AC + D Sbjct: 268 PMIEH-----PNKPFGQTTWHRSGEFMRYNRWGKDPNQDAEFADAVRRH--YAACVSYAD 320 Query: 260 DQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ-G 317 +G V+ L RE+T V+ DHG +G H + K A++++ PLII P Sbjct: 321 ANVGEVLKQLDELGLRESTVVVVWGDHGWHLGEHAIWGK-HALFEESLHSPLIIHDPSMS 379 Query: 318 ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAV-KEPRGVMVEFNRYEIEHDSFG 376 Q D V ID+ PT+ LA++ P + GE+++ V K+P E E S+ Sbjct: 380 NASQTDAIVETIDVFPTLCELANLPSPPKVDGESLVPVLKDPAS-------SEGEAVSYA 432 Query: 377 GFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDD 417 +R T +L+ + ELYD DP E N+ D+ Sbjct: 433 KATTIR---TSTHRLISHPKGFHELYDHETDPGETKNIADE 470 >UniRef50_UPI0000E0F7B6 iduronate 2-sulfatase precursor n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0F7B6 Length = 499 Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 116/446 (26%), Positives = 194/446 (43%), Gaps = 58/446 (13%) Query: 18 MVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIYANQS-----GPWT 72 ++G Y K + NID+LAA+GI F AY PVC +RA + TGI N++ Sbjct: 68 VLGVYGDKNAYSPNIDALAAQGITFTQAYANVPVCGASRASMLTGIRPNKTRFIDYKAKA 127 Query: 73 NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYWF------DG 126 APG ++ + +++GYHT IGK + D +A D Sbjct: 128 QKDAPGAK--SLPQVLRESGYHTMGIGKIFHNSKDLAKVSWSEKLQNAGMGHATRLNPDS 185 Query: 127 ANYL--SELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPF 184 NYL ++ ++ W ++ DE + ++ +A+ L + A+ ++PF Sbjct: 186 ENYLKTTKFNKRGNGPWYETMDVA--------DEAYPDG-KVKEKALKALTRLAKQEQPF 236 Query: 185 LMVVSYDEPHHPFTCPVEYL-----EKYADFY-YELGEKAQDDLANKPEHHRLWAQAMPS 238 + V + PH PF P +Y EK++ F+ A L E H + Sbjct: 237 FLSVGFIRPHLPFYAPKKYYDLHPREKFSPFFDRNKPRNAPKSLNGSGEIHTYHFKDY-- 294 Query: 239 PVGDDGLYHHPL--YFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHGEMMGAHKLI 295 D + L Y+A ++D +G VI + + R+NT ++ TSDHG +G H Sbjct: 295 TYNSDAFHMSSLQGYYASVSYIDALVGDVIAQIDSLGLRDNTTIMLTSDHGFNLGEHNFW 354 Query: 296 SKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILA 354 +K M + RIP+I+ P + + D V +D+ PT+ + + P + G++ Sbjct: 355 TKH-TMLETSLRIPMIVAGPNIAKDEKTDALVELVDVFPTITEITKVNPPATVQGQSF-- 411 Query: 355 VKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS---------DELYDRR 405 VK + V + + F V DDF +FTS + LYD + Sbjct: 412 VKSLQNASVNHKK-----QIYSRFKKGDSVVNDDF-----IFTSYATAENTIEEMLYDHK 461 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDAL 431 DP+E +N++++ R+ V +KM L Sbjct: 462 VDPHETNNVVNEPRYQAVATKMRAQL 487 >UniRef50_C6LAI4 Arylsulfatase n=6 Tax=Bacteria RepID=C6LAI4_9FIRM Length = 481 Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 119/454 (26%), Positives = 201/454 (44%), Gaps = 58/454 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LF+MTD + +G + T +D+LAA+G+ F++AY+ P C PARA L T Sbjct: 4 KKPNILFIMTDQLRGDCLGIAGHPDVKTPYLDTLAAKGVLFSNAYSACPSCIPARAALHT 63 Query: 62 GIYA--NQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL---------------D 104 G+ ++ + + +A + T+ AGY+T +GK H+ D Sbjct: 64 GMLPEHHRRVGYQDGIA-WRYEHTLAGELSRAGYYTQCVGKMHVHPLRNYLGFHNVELHD 122 Query: 105 GHDYFGTGECPPEWDADYWFDGANY-LSELTEKEISLWRNGLNSVEDLQANH-IDETFTW 162 G+ ++ P ++ + D Y L E +GL+ + +E + Sbjct: 123 GYLHYARYGSVPYRESQHVADDYYYWLKEQKGISADPMESGLDCNSWVARPFPYEEKYHP 182 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE--------- 213 + +++R++DFL++ D+PF ++ SY PH PF P Y + Y D Sbjct: 183 TNWVTDRSIDFLRR-RDPDQPFFLMASYLRPHPPFDAPAYYFDLYKDKKLTPPYVGDWED 241 Query: 214 ---LGEKAQD-DLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL 269 L E+ + D PE L QA +G Y+AC +D QIGR++ AL Sbjct: 242 TKLLKERGRIFDSLTGPEDEELIRQAQ---IG---------YYACITHLDHQIGRLLMAL 289 Query: 270 TPEQREN-TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIR-SPQ----GERRQVD 323 T + +N T + +T+DHGE + H K Y IPLII +P+ D Sbjct: 290 TEHELQNDTMIFFTADHGEELCDHHHFRKSLP-YQGSIHIPLIISGNPELTGFAPHSVCD 348 Query: 324 TPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRC 383 D++PT++ +A + P+ + G+++LA + G Y S+G Sbjct: 349 EVTELCDIMPTLLDIAGADIPDRVDGKSLLAFADGEG-----REYLHGEHSYGELSNHYI 403 Query: 384 WVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDD 417 D + ++ + DP+E+H+ I+D Sbjct: 404 VTKKDKFCWFSTSGTEHYFVLEEDPHELHDRIED 437 >UniRef50_A3HTC7 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HTC7_9SPHI Length = 1174 Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 123/470 (26%), Positives = 207/470 (44%), Gaps = 58/470 (12%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 + RPN +F++TD Q + +G + + T +D LA G F +A +P+C +RA LF Sbjct: 29 LNRPNIIFILTDDQRFDALGYAGNQFVQTPEMDRLAESGTYFETAIVTTPICAASRASLF 88 Query: 61 TGIY--ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 TG+Y A+ T N+ + K++GY+T + GK+ + Sbjct: 89 TGLYERAHNFNFQTGNIRAEYMEESYPTILKNSGYYTAFFGKYGVR-------------- 134 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHID----ETFTWAHRISNRAVDFL 174 +D N ++ E E S RN N D + + +T +A+DF+ Sbjct: 135 -----YDNLN--NQFDEYE-SYDRN--NQYPDKRGYYFKTIAGDTVHLTRYTGQKALDFI 184 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYL-EKYADFYYELGEKAQDDLANKPEHHRLWA 233 + A D+PF + +S+ PH P +Y + D + DL + Sbjct: 185 DK-APEDKPFSLSLSFSAPHAHDGAPDQYFWQTTTDPLLQNTTIPGPDLGEDE-----FF 238 Query: 234 QAMPSPVGD-------------DGLYHHPL--YFACNDFVDDQIGRVINALTPEQRE-NT 277 QA P V D + Y H L Y+ +D +I ++ L + + NT Sbjct: 239 QAQPQFVRDGFNRLRWTWRYDTEEKYQHSLKGYYRMISGIDLEIAKIREKLKEKGLDKNT 298 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMM 336 +I D+G +G +L K MYD+ R+PLII P+ G + + V +ID+ T+ Sbjct: 299 VIIVMGDNGYFLGERQLAGKW-LMYDNSIRVPLIIYDPRSGNHQDIKDMVLNIDVPATIA 357 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHD-SFGGFIPVRCWVTDDFKLV--L 393 LA +E PE G++++ + E + + + IEH F P T+++K + Sbjct: 358 DLAGVETPESWQGKSLMPIVEGKSQKIGRDTILIEHIWEFENIPPSEGVRTEEWKYFRYV 417 Query: 394 NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFR 443 N +ELY +DP E++NLIDD + D+ K+ + + K + FR Sbjct: 418 NDKKVEELYHLVDDPKEINNLIDDPEYKDIAIKLRSKTDELISKFGNKFR 467 >UniRef50_D0TVM5 Choline sulfatase n=2 Tax=Bacteroides RepID=D0TVM5_9BACE Length = 491 Score = 127 bits (318), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 126/498 (25%), Positives = 203/498 (40%), Gaps = 93/498 (18%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRP+ + + TD Q N + L T N+D+LA +GIRF +AY SPV P+RA + T Sbjct: 25 KRPHIILIFTDQQNVNAMSAAGNPFLYTPNMDALANDGIRFTNAYCTSPVSGPSRASIVT 84 Query: 62 GIYANQSG-PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL---------DGHDYFGT 111 G+ A ++G W +N + I T+G + GY T + GKWH+ + Y Sbjct: 85 GLMAREAGVEWNDNSKLSEGIHTVGDLLGENGYRTVWAGKWHIPEIYPQRSKNEIKYLHG 144 Query: 112 GECPPEWDA--DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNR 169 E P WDA +W GA LT+ +S Sbjct: 145 FELLPFWDAPNKHWLLGAETDPPLTDAVVS------------------------------ 174 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEY-LEKYADFY--------YELGEKAQ- 219 FL ++P + +SY PH P + E +D Y+L E Sbjct: 175 ---FLDGYDEREKPLFLAISYHNPHDICMYPRKVGWETMSDSLLNIRPFGKYKLPEPMGV 231 Query: 220 --DDLANKP------------EHHRLWAQAMPSPVGDDGLYHHPL-----------YFAC 254 D L+ P + + P+P GD+ Y+ Sbjct: 232 HPDSLSYLPPLPGNFSKNVDEPEFIIDKRVKPNPYGDEVQLSSRFSGREWQAYLNSYYRL 291 Query: 255 NDFVDDQIGRVINALTPE-QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIR 313 + VD +IG VI AL EN+ +I+TSDHG+ M AH+ +K + Y++ ++PLI+ Sbjct: 292 TELVDKEIGEVIEALKRNGMYENSLIIFTSDHGDGMAAHEWAAK-LSFYEESVKVPLIMV 350 Query: 314 SPQG-ERRQVDTP-VSHIDLLPTMMALADIEKPEILPGENILAVKEPRG------VMVEF 365 P+ +R V++ VS +DL+PT A + G ++ P V+ E Sbjct: 351 LPEKWQRGAVNSGLVSLVDLVPTFCDYAGVSPKTNFAGMSLRRAIYPTEERWRDFVVAEL 410 Query: 366 NRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRS 425 + + G I + + + + +++L+D DP E NL + ++ Sbjct: 411 ADHLKDRTRKGRMIRTGRY---KYAIYSSGERNEQLFDLMTDPGETINLAYSKEYWEILK 467 Query: 426 KMHDALLDYMDKIRDPFR 443 K L+ +M + D F+ Sbjct: 468 KHRSLLVKWMKERGDNFK 485 >UniRef50_C5EPJ8 Sulfatase n=8 Tax=Bacteria RepID=C5EPJ8_9FIRM Length = 448 Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 133/484 (27%), Positives = 213/484 (44%), Gaps = 88/484 (18%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLN-TQNIDSLAAE-GIRFNSAYTCSPVCTPARAGL 59 ++ N +F +D Q + +GC +G+PL T +D A E + F++A+T PVC PARA L Sbjct: 7 QKQNIVFFFSDQQRADTLGC-NGQPLPVTPCLDRFACEDAVNFSNAFTPQPVCGPARAML 65 Query: 60 FTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL--DGHD-YFGTGECPP 116 TG+Y Q+G + N V+ T+ Y ++AGY Y+GKWHL D H+ ++ T P Sbjct: 66 QTGLYPTQTGCYRNAVSLPAEQKTLAGYLREAGYRVAYVGKWHLASDEHENHYETIPVPL 125 Query: 117 EWDA---DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 E DYW A + E T + V D N ++ T +++ AV + Sbjct: 126 ERRGGYDDYWM--AADVLEFTSHGYGGY------VFDKDGNKLEFTGYRTDCLTDHAVRY 177 Query: 174 LQQPARADEPFLMVVSYDEPHHP-----FTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 +++ + EPF ++VS+ EPHH + P E++ DF Sbjct: 178 IEE-YDSREPFFLMVSHIEPHHQNDRGDYEGPEGSRERFGDFV----------------- 219 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGE 287 A P D +P Y C +D GRV+ AL + T V+Y SDHG Sbjct: 220 ----PPADLEPGKGDWEKFYPDYLGCCHALDTNFGRVVEALKEKGIYGQTMVVYASDHGC 275 Query: 288 MMGAH--KLISKGAAMYDDITR--------IPLIIRSPQGERRQVDTP--VSHIDLLPTM 335 +++ G YDD R +PL+I+ G R+ V+ VS +DL T+ Sbjct: 276 HFRTRSDEVVEHG---YDDYKRNSFEGTIHVPLLIKG-DGFRKGVNEERVVSLLDLPRTI 331 Query: 336 MALADIEKPEI-LPGENILAVKEP---RGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL 391 + A IE ++ L G + V P V ++ + +SF G R T +K Sbjct: 332 LTAAGIETGKLDLQGRPLQEVDAPDWEEEVYIQIS------ESFVG----RALRTRRYKY 381 Query: 392 VLN-------------LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 V+ ++ L+D DP E +NLI + + V+ ++ + ++ K Sbjct: 382 VVYAPEANPWTESGSPVYQEKYLFDLEQDPLERNNLIAEPGYGAVKERLRERIMKLGVKA 441 Query: 439 RDPF 442 + F Sbjct: 442 GEDF 445 >UniRef50_A7HWE6 Sulfatase n=2 Tax=Bacteria RepID=A7HWE6_PARL1 Length = 512 Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 138/513 (26%), Positives = 213/513 (41%), Gaps = 80/513 (15%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 LF+ TD + +G K T ID+LA GI + A+ + VC PAR+ + TG Y Sbjct: 4 KILFITTDQMRFDAIGANGQKVARTPAIDALAKAGINYTRAHNQNVVCMPARSTMITGQY 63 Query: 65 ANQSGPWTNNVAPGKNISTMGRYFKD-AGYHTCYIGKWH----LDGHDYF------GTGE 113 + G W N V + ++ +Y + GY T IGK H LD H F GE Sbjct: 64 VSTHGVWMNGVPLPVDAPSVAQYLNEKGGYKTALIGKAHFEPFLDLHQQFYESQMARRGE 123 Query: 114 CPPEWDADY---------------WF-----DGANYLSE-LTEK-EISLWRNGLNSVEDL 151 P DY W + NY + L +K +++ G L Sbjct: 124 NGPHRGFDYMELATHSPLILHYNEWMKKNEPEALNYFYQNLNDKFQVNAAGGGETGGCQL 183 Query: 152 QANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY---- 207 N I +++R +D+L D+ F +S+ +PHHP+ P L ++ Sbjct: 184 HFNKIAREHYHTDWVADRTIDWLASVGAGDDWFCW-MSFPDPHHPWDPPQSELHRHPWRD 242 Query: 208 ---ADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGR 264 +FY EK + LA+KP H W V + + P F D DQ+ + Sbjct: 243 TPLPEFYPGSKEKIEAVLADKPRHWMEWYTG--ERVTN---FEAPPEFRAQDMTADQV-Q 296 Query: 265 VINALTPEQRE---------------NTW-----VIYTSDHGEMMGAHKLISKGAAMYDD 304 INA T + E W V++T+DHGE G L+ KG D Sbjct: 297 EINAFTHVENELIDEAIAKVMAYVEKRGWGDDVDVVFTTDHGEFQGEFGLLFKGPYHVDA 356 Query: 305 ITRIPLIIRSPQGER---RQVDTPVSHIDLLPTMMALADIEKPEILPGENIL---AVKEP 358 + R+P+I R + + V+ PV +DL PT +A + PE + G+ + A + Sbjct: 357 LMRLPMIWRPAKSAKVAPAAVEKPVGQVDLAPTFCEIAGLPVPEWMQGKPMPKTDAEGDA 416 Query: 359 RGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL------FTSDELYDRRNDPNEMH 412 +G F ++ +H G + +R D + + L + ELYD NDP + Sbjct: 417 QGRERVFTEWDCKHVD-GTTVGLRTIYRDGYTITAYLPGTIYDGSEGELYDHANDPRQFR 475 Query: 413 NLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 NL +D +A ++S + L D + +RDP Y Sbjct: 476 NLWNDPAYAKLKSDLLADLKDNLPPVRDPQLEY 508 >UniRef50_O69787 Choline-sulfatase n=53 Tax=Alphaproteobacteria RepID=BETC_RHIME Length = 512 Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 107/436 (24%), Positives = 188/436 (43%), Gaps = 29/436 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L +M D + L+ N+ +LA RF++ YT SP+C PARA G Sbjct: 5 KPNILIIMVDQLNGKLFPDGPADFLHAPNLKALAKRSARFHNNYTSSPLCAPARASFMAG 64 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG------TGECPP 116 +++ + N +I T + + AGY+T GK H G D T + P Sbjct: 65 QLPSRTRVYDNAAEYQSSIPTYAHHLRRAGYYTALSGKMHFVGPDQLHGFEERLTTDIYP 124 Query: 117 E---WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 W DY G + I W + L SV I + ++ A Sbjct: 125 ADFGWTPDYRKPG---------ERIDWWYHNLGSVTGAGVAEITNQMEYDDEVAFLANQK 175 Query: 174 LQQPARADE-----PFLMVVSYDEPHHPFTCPVEYLEKYADFYY---ELGEKAQDDLANK 225 L Q +R ++ P+ + VS+ PH P+ ++ + Y D + E+G D+ Sbjct: 176 LYQLSRENDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEDCEHLTPEVGAIPLDEQDPH 235 Query: 226 PEHHRLWAQAMPSPVGDDGLYH-HPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTS 283 + L V ++ + YFA ++D+++G +I+ LT + ++T +++ S Sbjct: 236 SQRIMLSCDYQNFDVTEENVRRSRRAYFANISYLDEKVGELIDTLTRTRMLDDTLILFCS 295 Query: 284 DHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEK 343 DHG+M+G L K ++ R+PL+I P TP S++D+ PT+ LA I Sbjct: 296 DHGDMLGERGLWFK-MNFFEGSARVPLMIAGPGIAPGLHLTPTSNLDVTPTLADLAGISL 354 Query: 344 PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYD 403 E+ P + +++ + +E+ + + P+ +K V ++L+D Sbjct: 355 EEVRPWTDGVSLVPMVNGVERTEPVLMEYAAEASYAPLVAIREGKWKYVYCALDPEQLFD 414 Query: 404 RRNDPNEMHNLIDDIR 419 DP E+ NL ++ R Sbjct: 415 LEADPLELTNLAENPR 430 >UniRef50_A6DM29 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DM29_9BACT Length = 481 Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 99/373 (26%), Positives = 163/373 (43%), Gaps = 69/373 (18%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + ++ D + CY K + T N+D +A EGIRFN Y+ +PVC+ +R GL T Sbjct: 34 ERPNIVLILCDDLGYGDLACYGHKQIKTPNLDQMAKEGIRFNHFYSAAPVCSASRVGLLT 93 Query: 62 GIYANQSGPW------TNNVAPG--KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE 113 G N++G + + + +P KN T + + AGY TC GKWH +G Sbjct: 94 GRSPNRAGVYDWIPHSSESSSPHMRKNEITFPQLLQKAGYATCLSGKWHCNGALINTNQA 153 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNG--LNSVEDLQANHIDETFTWAHRISNRAV 171 P + DYWF N + + ++ RNG L +E ++N A+ Sbjct: 154 QPQDAGFDYWFATQNNAAPSHKNPVNFIRNGVELGPIEGFS----------CQIVTNEAI 203 Query: 172 DFLQQPARADE--PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHH 229 ++++ + +E PF + +S+ EPH P P +K D Y + E N+ E Sbjct: 204 NWMEDHVKQNEKQPFFIYLSFHEPHEPIASP----QKIVDTYKGIAEN-----TNQAE-- 252 Query: 230 RLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG-E 287 YFA + +D +G ++N L + +NT VI+TSD+G E Sbjct: 253 ---------------------YFANVENLDKAVGSLMNQLKKLKINDNTLVIFTSDNGPE 291 Query: 288 MMGAHKLISKGAAMYDDIT-----------RIPLIIRSPQ--GERRQVDTPVSHIDLLPT 334 + ++ S+ ++ R+P I+ P+ + D +S +D PT Sbjct: 292 TLNRYEAASRSYGSPGELKGMKLWTAEAGFRVPAIMHWPEKIATGQISDQVISALDFFPT 351 Query: 335 MMALADIEKPEIL 347 LA + L Sbjct: 352 FCDLAQASNSKSL 364 >UniRef50_B4D780 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D780_9BACT Length = 496 Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 117/473 (24%), Positives = 214/473 (45%), Gaps = 70/473 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN LF++ D N + C L T NID +A EG+RF + + + +C+P+RA + + Sbjct: 27 KRPNVLFILCDDIRWNAMSCAGHPALKTPNIDRIANEGVRFANMFCTTSLCSPSRASILS 86 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLD--------GHDYFGTGE 113 G+YA+ G N + + ++GY T Y+GKWH+ G D+F T + Sbjct: 87 GVYAHTHGVTNNFTEFPEKLVHWPMRLHESGYETAYMGKWHMGEDNDAPRPGFDFFATHK 146 Query: 114 CPPE-WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 + WD + +GA K I + + +++ A+D Sbjct: 147 GQGKYWDTAWNINGAG------SKVIPGYYTTI--------------------VTDMALD 180 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP------ 226 +L++ +P+ + + + PH +T +Y + + E A L +KP Sbjct: 181 WLKK-DHGGKPWALCIGHKAPHSFYTPEEKYAHVFDNVRVPYPESAF-HLEDKPTWMKQR 238 Query: 227 ------------EHHRLWAQAMPSPVGD-DGLYHHPLYFACNDFVDDQIGRVINALT-PE 272 E + + P V D + + H Y+ VDD +GR++ L + Sbjct: 239 LYTWHGIYGPLFEWRKKFPDDRPEAVKDFENMVHG--YWGTILSVDDSVGRLLKYLEDTK 296 Query: 273 QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVS-HIDL 331 Q +NT +++ D+G + G H ++ K A ++ RIP+++R P + +V+ + +D+ Sbjct: 297 QLDNTIIVFMGDNGLLEGEHGMVDKRTA-HEPSMRIPMLVRYPGLAKGKVEEGQALTLDV 355 Query: 332 LPTMMALADIEKPEILPGEN-ILAVKEPRGVMVEFNRYEIEHDSFGGFIP-VRCWVTDDF 389 P+++ L + + + G++ + V+E + YE ++ + P VR TD++ Sbjct: 356 APSLLELCGAKPLDNIQGKSWVKLVREGDPTWRKSWFYEYNYEKQFPYTPNVRAIRTDEW 415 Query: 390 KLV---LNLFTSD----ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 K V T D ELY+ + DPNE HNL+ D + A ++ L++ M Sbjct: 416 KYVHYPHGDGTPDRYIGELYNEKTDPNEDHNLVKDPQQAGRIEELKKLLVEKM 468 >UniRef50_A0JVM5 Sulfatase n=1 Tax=Arthrobacter sp. FB24 RepID=A0JVM5_ARTS2 Length = 475 Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 135/489 (27%), Positives = 208/489 (42%), Gaps = 79/489 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF++ D + +G P T +++S AAE A + PVC+P RA L T Sbjct: 4 ERPNILFIIADQFRNSALGFRGQDPTYTPSLNSFAAESKDILHAVSNYPVCSPHRAMLMT 63 Query: 62 GIYANQSG-PWTNNVAPGKN----ISTMGRYFKDAGYHTCYIGKWHLDG--------HDY 108 G + +++G P N G I T + +DAGY T YIGKWHL+ + Sbjct: 64 GQHPHRNGVPLNINSNTGAGLEPGIGTWSQVLRDAGYGTGYIGKWHLEAVTEEDAIWGEG 123 Query: 109 FGTGECPPEWDA----------DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDE 158 F G WDA +W+ +Y + W E + + Sbjct: 124 FREGAV---WDAYSPVDRRHGFSFWY---SYGAAHDHMHPHYWVGDAPREEKIVVDQWS- 176 Query: 159 TFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPF-TCPVEYLEKYADFYY-ELGE 216 A ++ A+ FL++ A E F +VVSY+ PH PF P Y +YA EL Sbjct: 177 ----AEHETDIAIGFLRETTDAAESFALVVSYNPPHQPFELAPATYRPRYAQLSARELLT 232 Query: 217 KAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRE 275 + D+ A LYFA +D QIGR++ AL + Sbjct: 233 RPNVDVTGPAGAEAAQAAP--------------LYFAAISAIDHQIGRLLVALEASGHHK 278 Query: 276 NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPV--SHIDLLP 333 NT VI+TSDHG +G+H L+ K +++ +P +IR P D V S +D+ P Sbjct: 279 NTIVIFTSDHGMQLGSHGLMFKNVP-WEESMSLPFLIRWPGRIASGPDDKVLISSVDVGP 337 Query: 334 TMMALADI--EKPEILPGEN-----ILAVKEPR-GVMVEFNRYEIEHDSFGGFIPVRCWV 385 T++ LA + +P + G + I A P G + + + G +R Sbjct: 338 TLLGLAGLSTSRPPAMQGADLSSRLIGATTTPVPGPAIYYGP-----PARDGGPGMRGLR 392 Query: 386 TDDFKLVLN----------LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 T KL+ + S +LYD ++DP EM + R A+V+ M L+ + Sbjct: 393 TLSHKLLFSCIPDPAQRSGFLLSAQLYDLKSDPYEMSDQAAS-RPAEVQ-LMGRELVRQL 450 Query: 436 DKIRDPFRS 444 + + DP+ + Sbjct: 451 EIVEDPWAA 459 >UniRef50_UPI0001C35789 arylsulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C35789 Length = 520 Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 123/477 (25%), Positives = 210/477 (44%), Gaps = 44/477 (9%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + +MTD + +G + T +DS+AA+GI F+ AY+ P C PARA L TG+ Sbjct: 35 PNIVLIMTDQMRGDCLGIAGHPDVKTPYLDSIAAKGILFDHAYSACPSCVPARAALHTGM 94 Query: 64 YANQSGPWTNNVAPGKNI-STMGRYFKDAGYHTCYIGKWH------------LDGHDYFG 110 G N TM AGY+T +GK H ++ HD + Sbjct: 95 RQEHHGRVGYQDMVNWNYPHTMAGELAAAGYYTQCVGKMHVHPLRNLMGFHNIELHDGYL 154 Query: 111 TGECPPE--WDA------DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHI-DETFT 161 P W+ DY++ +L + + + G+ + I +E + Sbjct: 155 HAYRDPAAAWEESQKQADDYFY----WLKQELGADADVTDTGMECNSWVSRPWIYEEKYH 210 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYY---ELGEKA 218 + +S R++DFL++ +PF ++ SY PH PF P Y + Y D +G+ Sbjct: 211 PTNWVSTRSIDFLRR-RDTSKPFFLMASYLRPHPPFDAPQYYFDLYRDKQLTPPAVGDWE 269 Query: 219 QDDLANKPEHHRLWAQAMPSPVGDDGLYHHPL-YFACNDFVDDQIGRVINALTPEQ-REN 276 +D + + PV + + + Y+AC +D QIGR+I AL + +N Sbjct: 270 DEDFTGDYQRLGRIYDSATGPVDPELIRQAQIGYYACITHLDHQIGRLIQALVEYKLMDN 329 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHI-----DL 331 T +++TSDHGE + H L K Y+ RIP+++ P+ V H D+ Sbjct: 330 TIILFTSDHGEELCDHHLFRKSRP-YEGSCRIPMLLSGPERLIHAAPGTVCHSVAELRDV 388 Query: 332 LPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRY-EIEHDSFGGFIPVRCWVTDDFK 390 +PT++ A PE + G+++ + +P G + ++ EH++ G VT+ K Sbjct: 389 MPTLLDAAGAPIPETVDGKSM--IPDPDGTLPVIRQWLHGEHEA--GVNSNHFIVTEHDK 444 Query: 391 LVLNLFTSDELY-DRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQ 446 V T E Y + D E+HN I D ++ + + L++ + + + + Q Sbjct: 445 YVWYSQTGREQYFNLDEDRRELHNGIADTQYQERIGLLRGLLIEELKEREEGYSDGQ 501 >UniRef50_C6IGG0 Iduronate 2-sulfatase n=2 Tax=Bacteroides RepID=C6IGG0_9BACE Length = 482 Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 118/446 (26%), Positives = 193/446 (43%), Gaps = 41/446 (9%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 + R N LF+M D + GCY + + T N+D LA+ G+ F +AY PV +RA L Sbjct: 32 VSRMNVLFLMADDMRPEL-GCYGVEAVKTPNMDRLASSGVLFQNAYCNVPVSGASRASLL 90 Query: 61 TGIYANQSGPWTNNVAPGKN----ISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 TG+Y + + N A + +F GYHT GK D+ + PP Sbjct: 91 TGVYPHYPDRFVNFSAYASKDCPEAIPLSGWFTKNGYHTVSDGKVFHHMSDHAASWSEPP 150 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSVED---------LQANHIDETFTWAHRIS 167 + +D Y +E + E LW N + ++ + +T +++ Sbjct: 151 YRNHPDGYDV--YWAEYNKWE--LWMNSESGKTINPKTMRGPFCESADVPDTAYDDGKLA 206 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG------EKAQDD 221 RA+ L++ ++PF + + +PH PF P +Y + Y L E + Sbjct: 207 ERAIRDLRRMKEMNKPFFLACGFWKPHLPFNAPKKYWDLYKREEIPLAPNRFRPEGLPEQ 266 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTP-EQRENTW 278 + N E ++A A S D Y+AC +VD QIG+V++AL ENT Sbjct: 267 VRNSSE---IYAYARVSDTSDADFQREVKHGYYACLSYVDAQIGKVLDALDELGLAENTI 323 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMAL 338 V+ DHG +G H + K M D T +PLIIR P ++ + + V +DL PT+ L Sbjct: 324 VVLLGDHGWNLGEHDFVGKHNLM-DRSTHVPLIIRVPGRKKGKTRSMVEFVDLYPTLCEL 382 Query: 339 ADIEKP-EILPGENILAVKEPRGVMVE---FNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 I +P E L G++ V + + ++E ++ W+ D K Sbjct: 383 CQIPQPAEQLDGQSFAKVFSNLKAKTKDEVYIQWEGGDNAVDQRFSYAEWMKGDVK---- 438 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRF 420 + L+D R D E N +++ ++ Sbjct: 439 --KASMLFDHRIDKEENKNRVNEKKY 462 >UniRef50_A6DG71 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG71_9BACT Length = 515 Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 117/494 (23%), Positives = 205/494 (41%), Gaps = 74/494 (14%) Query: 3 RPNFLFVMTDTQATNMVGCYSGK--PLN-TQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 +PN LF+M D +GCY + LN T ID LA++GI+F++ + + +CTP+RA + Sbjct: 29 KPNILFIMADDHTKQAIGCYGSRLSKLNPTPTIDRLASQGIQFDNVFCSNAICTPSRASI 88 Query: 60 FTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 TG Y+ +G N + G + + + K AGY T IGKWHL P +D Sbjct: 89 ITGQYSQTNGVLDLNGSIGPDKQFLPKEMKKAGYETAMIGKWHLKKE--------PATFD 140 Query: 120 ADYWFDGANYLSE--LTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 G + W + +D + + I++ ++ +L+ Sbjct: 141 YYCVLPGQGLYHNPIFNIRGSKPWPKNTITKKDQHS---------SDAITDISLHWLKNE 191 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYEL-------------GEKAQDDLAN 224 +PF ++ + PH F EY ++Y + ++ G DL + Sbjct: 192 RDKSKPFFLMHHFKAPHDMF----EYAKRYESYLEDVHIPEPESLFSVPAGSAGSKDLGS 247 Query: 225 K-PEHHRLWAQAMPSPVGDD--------GLYHHPL--YFACNDFVDDQIGRVINALT-PE 272 ++H W V DD Y L Y C +DD I R+++ L Sbjct: 248 GLSKNHNPWQLPQKLGVSDDIPEPEYTRLSYQKYLKAYLRCVKGIDDNIARLLSYLKDSN 307 Query: 273 QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHID 330 Q +NT +IYTSD G +G H LI K MY++ +P I+ +P + + +++ D Sbjct: 308 QLDNTIIIYTSDQGFFLGEHNLIDK-RWMYEEAMGMPFIVYAPGMIKNNFKNNCLINNTD 366 Query: 331 LLPTMMALADIEK-PEILPGENILAV-----KEPRGVMVEFNRYEIEHDSFGGFIPVRCW 384 PT++ +A ++K P + G++ K V + RY + H + +P Sbjct: 367 FAPTLLEIAGLKKTPNYMQGKSFYKALSNQQKPDEWRTVTYYRYWM-HMAHKLAVPAHFG 425 Query: 385 VTDDFKLVLNLF-------------TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + + ++ + S E YD DP EM N + + ++ ++ L Sbjct: 426 IRSESHKLIFFYGRKYGRRGGKPTPISWEFYDLDKDPKEMKNEYKNPEYKEIIKRLKTQL 485 Query: 432 LDYMDKIRDPFRSY 445 L+ + + + Y Sbjct: 486 LEIRKDLNEEDKKY 499 >UniRef50_A6E5R0 Putative sulfatase n=1 Tax=Roseovarius sp. TM1035 RepID=A6E5R0_9RHOB Length = 453 Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 114/458 (24%), Positives = 186/458 (40%), Gaps = 46/458 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + + D + C + T+++D LAA G RF+ A+ SP+C P+R TG Sbjct: 2 QPNIVIINPDQMRWDYASCQGHPFIATRHLDRLAAMGTRFSHAFAASPMCGPSRTSFLTG 61 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 Y + G + R DAGY W D+ Sbjct: 62 KYPIEHGIRQYGGTYDQAQPNALRVLGDAGYVRGI--------------------WGKDH 101 Query: 123 WFDGANYLSELTEKE---ISLWRNGLNSVEDLQANHIDETFTW--AHRISNRAVDFLQQP 177 F G S E E I + + + + +D W R+++ + F+ + Sbjct: 102 TFKGNVIGSLYDEGEDICIGIMGGHPDYINAWDSTSLDVGSKWNLTKRLTDEGLAFIHRQ 161 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG----EKAQDDLANKPEHHRLWA 233 AR +PF + ++Y +PH F CP Y + +EL + D + + R+ + Sbjct: 162 ARTSQPFFLTLNYQDPHPFFACPEPYSSLFHPDQFELSPNYRKAPVDGEITRLTNWRIHS 221 Query: 234 QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAH 292 + PV + +Y +VDDQ+GR++N L + ENT V++ SDHGE +G Sbjct: 222 NEINMPVAELK-QAMAIYAGQIRYVDDQVGRILNELEALDLLENTIVLFWSDHGEFIGDF 280 Query: 293 KLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKPEILPGE 350 + K A Y+ + R P++I P G R V +D + T++ L +++PE Sbjct: 281 GVTHKIPAFYECLIRAPMVIWDPTGRVPRGVCSGLVELMDGMATVLDLCGLKQPEGSHAR 340 Query: 351 NILAVKEPR-------GVMVEFNRYEIEHDSFGG------FIPVRCWVTDDFKLVLNLFT 397 ++ R G++V I G F P T+D+KL L Sbjct: 341 SMAGTVVGRRDVYADSGMLVRQPLEPISGHVIKGAMPPTAFGPGSMLRTEDWKLCLYGED 400 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 EL+D R D E NL R A ++ ++ L M Sbjct: 401 KGELFDLRRDRYETTNLFGAPRHAQIQDELMLRLTQRM 438 >UniRef50_Q0TUK6 Arylsulfatase n=9 Tax=Bacteria RepID=SULF_CLOP1 Length = 481 Score = 125 bits (314), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 122/464 (26%), Positives = 197/464 (42%), Gaps = 43/464 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + +M D + +G + + T N+D +A EG F +AYT P C +RA + TG Sbjct: 2 KPNIVLIMVDQMRGDCLGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTG 61 Query: 63 IYANQSGPWTNNVAPGKNI-STMGRYFKDAGYHTCYIGKWHL---------------DGH 106 + G N +T+ F AGYHT IGK H+ DG+ Sbjct: 62 MSQKSHGRVGYEDGVSWNYENTIASEFSKAGYHTQCIGKMHVYPERNLCGFHNIMLHDGY 121 Query: 107 DYFG-TGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQAN-HIDETFTWAH 164 +F E + D + E + L GL+ + +E + Sbjct: 122 LHFARNKEGKASTQIEQCDDYLKWFREKKGHNVDLIDIGLDCNSWVSRPWGYEENLHPTN 181 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 + N ++DFL++ + +PF + +S+ PH P P Y + Y D +L E D AN Sbjct: 182 WVVNESIDFLRRKDPS-KPFFLKMSFVRPHSPLDPPKFYFDMYKD--EDLPEPLMGDWAN 238 Query: 225 KPEHHRLWAQAMPSPVG----DDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWV 279 K + + + G Y+ +D QIGR + AL+ + NT Sbjct: 239 KEDEENR-GKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIF 297 Query: 280 IYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP----QGERRQV-DTPVSHIDLLPT 334 ++ SDHG+MMG H KG Y+ +R+P I P +G++ +V D + D++PT Sbjct: 298 LFVSDHGDMMGDHNWFRKGIP-YEGSSRVPFFIYDPGNLLKGKKGKVFDEVLELRDIMPT 356 Query: 335 MMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 ++ A I P+ + G ++ + E R + Y SFG D L + Sbjct: 357 LLDFAHISIPDSVEGLSLKNLIEERNST--WRDYIHGEHSFGEDSNHYIVTKRDKFLWFS 414 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 ++ +D NDP E+ NLID S+ + +DY+ KI Sbjct: 415 QRGEEQYFDLENDPKELTNLID--------SEEYKERIDYLRKI 450 >UniRef50_B2URC2 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2URC2_AKKM8 Length = 465 Score = 125 bits (314), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 128/476 (26%), Positives = 202/476 (42%), Gaps = 75/476 (15%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + ++ D +GC K + T ++D LA EG+ + AY +P+C+P+R GL TG Sbjct: 29 PNMIVILADDLGYGDLGCTGSKQIKTPSLDRLAREGVFCSRAYVTAPMCSPSRMGLLTGR 88 Query: 64 YANQSGPWTN-----------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 + + G TN + + + Y GY + GKWHL GH G Sbjct: 89 FPKRYGITTNPNIQMDYLPESHYGLPQTEKLIPEYLAPCGYRSAVFGKWHL-GHT---KG 144 Query: 113 ECPPEWDADYW--FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDET-FTW-AHRISN 168 PPE +W F G + +KE GLN + +N D+T T+ I++ Sbjct: 145 YTPPERGFTHWWGFLGGSRHYFPVKKEA----EGLNP-SMIVSNFTDKTDITYLTDDITD 199 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 RAV+FLQ+ + +PF M VSY+ PH P E + K+ + + GE+ Sbjct: 200 RAVEFLQEAGKDKKPFFMFVSYNAPHWPNEAKPEDIAKFRNV--QNGERR---------- 247 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGE 287 +Y A +D IGR+++AL + E +T V++ SD+G Sbjct: 248 ---------------------VYCAMVYAMDRGIGRILDALKADGLEKDTIVVFLSDNGG 286 Query: 288 MMGAHKLIS--KGAAM--YDDITRIPLIIRSPQGER----RQVDTPVSHIDLLPTMMALA 339 A + +GA ++ R+P IIR P +R PVS +DLLP ++ Sbjct: 287 APEASSCNAPFRGAKRQHFEGGVRVPFIIRYPADKRLVPGSVCRQPVSSVDLLPALLKAN 346 Query: 340 DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD 399 P L G +IL + +G V + + +T D K +L + Sbjct: 347 GRHIPRKLDGMDILELVGNKGAPVPRTFFWCTDYT-------SAVLTGDMKYLLVPDRAP 399 Query: 400 ELYDRRNDPNEMHNL-IDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRK 454 + Y+ +DP E +L + AD+ +K L R P S WS + R+ Sbjct: 400 QFYNVADDPQEQRDLYFSRHQDADLLAKKLGTYLTTTPACRFP-DSISWSAKLMRE 454 >UniRef50_C9L4S2 Arylsulfatase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L4S2_RUMHA Length = 475 Score = 125 bits (314), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 117/442 (26%), Positives = 201/442 (45%), Gaps = 34/442 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KR + LF+ D + + T I LA +G+ +++ Y+ PVC PAR L T Sbjct: 4 KRYHVLFIQVDQWGEKFLDFTGNNTIMTPTIHQLARDGVMYSNCYSTCPVCIPARRSLMT 63 Query: 62 GIY-ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL-DGHDYFGTGECPPEWD 119 G++ ++ ++T+ F AGYHT +GK H+ + G + + + Sbjct: 64 GLFPKTHKDRVYSDRMKMPAVTTLAEAFYQAGYHTMAVGKLHVYPQRNRIGFQDVVLQEE 123 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQ--ANHIDETFTW-----AHRISNRAVD 172 Y F G + + +I L NG E L N+ T TW AH + Sbjct: 124 GRYEFGGPD------DYQIWLGENGYIGQEFLHGMGNNTYYTRTWPLSETAHPTTWATGQ 177 Query: 173 FLQQPARAD--EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 ++Q R D +P +SY PH P EY + Y++ ++ E D ++ + Sbjct: 178 MIKQIKRRDPEKPAFFYLSYTFPHPPLVPLSEYWDMYSE--QDIQEPEYGDWEDESFIFK 235 Query: 231 LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMM 289 +A + + Y+A +D+QI VI AL E ++T +++TSDHGEM+ Sbjct: 236 ELTEAARYYSHKEMIRAKRAYYAQCTHIDNQIRLVIGALKEEGILDDTILVFTSDHGEML 295 Query: 290 GAHKLISKGAAMYDDITRIPLIIR-SPQGERR-QVDTPVSHI-DLLPTMMALADIEKPEI 346 H ++ K Y++ IPLI +P E R +VD ++ + D++PT++ L IE P Sbjct: 296 FDHGMVGK-RTFYENSAHIPLIFSGNPVSELRGKVDDRIACLEDIMPTLLELCKIEIPSS 354 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD-ELYDRR 405 + G+++ KE R +F EI G+ R ++KL+ + + +++D Sbjct: 355 VEGQSLFE-KERR----DFLYGEISE----GYRATRMIRMGNYKLIYYPYGNKVQIFDIL 405 Query: 406 NDPNEMHNLIDDIRFADVRSKM 427 D NE+H+L F ++ +M Sbjct: 406 QDKNELHDLSKKEEFKSIKEEM 427 >UniRef50_B5CWC2 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CWC2_9BACE Length = 515 Score = 125 bits (314), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 125/466 (26%), Positives = 206/466 (44%), Gaps = 72/466 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 K N LF+ D + VG G P T N+D LAA G+ F SAY +PV +RA L Sbjct: 24 KPKNVLFIAVD-DLNDWVGFLKGHPNTRTPNMDRLAAMGMVFESAYCAAPVSNASRAALL 82 Query: 61 TGIYANQSGPWTN-----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 +G + +G + N K+ T+ +YF + GY++ GK H G P Sbjct: 83 SGFRTSTTGVYGNAEFMRESPVLKDAVTLPKYFSNHGYYSMARGKIF---HQPMGPWGDP 139 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNS-------VEDLQANHIDETFTWAHRISN 168 WD+ G LS ++ NGL V D +DET T + + Sbjct: 140 QSWDSQENLGG---LSLNPPRQKGKQANGLEKQTTGGAVVLDWAGVDVDETKTNDYLNAQ 196 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK----AQDDLA- 223 A L + + D+PF M PH P+ P +Y +++ +L ++ + L+ Sbjct: 197 WAAQELMK--KHDKPFFMACGIFRPHLPWYVPQKYFDRFKLEDIQLPKQDPMETMEKLSP 254 Query: 224 --------NKPEHHRLWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINAL--TP 271 NKPEH + + G+ + Y AC + DD IG++++AL +P Sbjct: 255 RALSMTGYNKPEHEF-------NILKKYGMEKEAVRAYLACISYADDCIGQIVDALEKSP 307 Query: 272 EQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHI 329 E R+NT V++ DHG +G K+ + +++D +P+II +P + PVS + Sbjct: 308 E-RDNTIVVFWGDHGWHLG-EKMRYRKFSLWDRSCHVPMIIVAPGVTKPGSVCKQPVSLL 365 Query: 330 DLLPTMMALADIEKPEILPGENILAV--------KEPRGVMVEFNRYEIEHDSFGGFIPV 381 DL PT+++LA + + G +I + +P + N + I Sbjct: 366 DLYPTLVSLAGLPANPLNEGNDITPLLQNPNAHWTKPAITTLAQNEHSI----------- 414 Query: 382 RCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKM 427 ++ ++ S+ELYD ++DP E NL D ++ADV++ + Sbjct: 415 ---CDGRYRYIIYRDGSEELYDHKHDPLEWKNLAADKKYADVKAHL 457 >UniRef50_A6DG34 Choline sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG34_9BACT Length = 476 Score = 125 bits (313), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 121/439 (27%), Positives = 200/439 (45%), Gaps = 64/439 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTC----SPVCTPARA 57 ++PNF+F+ D Q + + + ++T N+D LA G F + Y VC +RA Sbjct: 15 QKPNFVFLFADDQRADTIRAHGNDFIHTPNLDRLAESGFSFKNNYCAGSYSGAVCVASRA 74 Query: 58 GLFTGIYANQSGPWTNNVAPGKN----ISTMGRYFKD-AGYHTCYIGKWHLDGHDYFGTG 112 L TG Y W N KN + + Y K+ AGY T IGKWH H T Sbjct: 75 MLMTGRY------WNNIPNVKKNGWASLDLLPTYLKEKAGYETYIIGKWHNGLH----TL 124 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 + A + G +++ T+ E+ + G LQA + F+ + +N A+ Sbjct: 125 RAAFQNGASVYMGG---MADHTDFEVQDFVAG-----QLQAKRRAKEFS-STEFANSAIK 175 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY-------ADFYYELGEKAQDDLANK 225 ++++ A +D+PF + V++ PH P P EY ++Y A Y L + Sbjct: 176 YIEE-APSDKPFFLYVAFMAPHDPRNPPDEYRQRYYKNRPPLAKNYKALHPFRNVKFTTQ 234 Query: 226 PEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSD 284 L + V D L Y+ +D+Q+GR+I+A+ + +NT +IYT+D Sbjct: 235 GRDEGLASWPREKSVISDQLCE---YYGLVTHLDEQVGRIIDAIDQSKHADNTIIIYTAD 291 Query: 285 HGEMMGAHKLISKGAAMYDDITRIPLIIRS---PQGERRQVDTPVSHI-DLLPTMMALAD 340 HG MG+H L+ K +Y+ + PLII P GE ++I DL T+ A Sbjct: 292 HGLAMGSHGLLGK-QNVYEHSMKAPLIISGKTVPNGE----SAAFNYIHDLYATLCDYAR 346 Query: 341 IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR--CWVTDDFKLVLNLFTS 398 I KPE + +++ + E EI+ F+P + + +D + L+++ Sbjct: 347 IAKPEAVDAKSLRPLIEG----------EIKQIHEAMFLPFQDVQFAINDGRWKLHIYPQ 396 Query: 399 DE---LYDRRNDPNEMHNL 414 + L+D NDP+E+H+L Sbjct: 397 IDHYLLFDLENDPDEIHSL 415 >UniRef50_Q7UHJ4 Mucin-desulfating sulfatase n=2 Tax=Planctomycetaceae RepID=Q7UHJ4_RHOBA Length = 514 Score = 125 bits (313), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 114/449 (25%), Positives = 203/449 (45%), Gaps = 52/449 (11%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN +F+ D Q+T VGCY + + T ++D LA +G+ F+ Y + +C +RA +FTG+ Sbjct: 43 PNIVFLFADDQSTYSVGCYGNQDVLTPSMDQLARDGVLFDKHYNTTAICMASRANVFTGM 102 Query: 64 YANQSGPWTNNVAPGKNI--STMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 Y ++G + + + + ++AGY T + GK+ L G G C E D D Sbjct: 103 YEYKTGCNFEHGNMRQEVWAKSYPVLLREAGYLTAFAGKFGLVVD---GKGLC--EDDFD 157 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 +W G + T K S+ + T ++A D +++ + D Sbjct: 158 FWGGGPGQTNYATAKN--------ESMRKYAKQYPHSTLSYAA----FGKDVIREATKQD 205 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYY----ELGEKAQDDLANKPEHHRLWAQAMP 237 +PF + +S+ PH P T + YA + G + LA + + R + + Sbjct: 206 QPFCLSISFKAPHKPATPDPRFDHVYAGKQFTKPLNFGREYSKHLAPQSKLGRQYPRFSE 265 Query: 238 SPVGDD-----GLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGA 291 D YH +Y +D +G + + L + +NT VIYTSD+G + G+ Sbjct: 266 WKYDTDYDDEMAKYHQQVY-----AIDVALGMIRDELKAQGISDNTVVIYTSDNGYICGS 320 Query: 292 HKLISKGAAMYDDITRIPLII---RSP-QGERRQVDTPVSHIDLLPTMMALADIEKPEIL 347 H SK M ++ +R+PLII RSP G++ + + +ID PT++ A + P + Sbjct: 321 HGYGSKVLPM-EESSRVPLIIYDPRSPLNGQQHRCNELTGNIDFAPTILEFAGLPAPSNM 379 Query: 348 PGENILA-VKEPRGVMVEFNRYEIEHDSFGGFIPVRCW--VTDDFKLVL------NLFTS 398 G++++ ++ P + ++ + G +P +T +K + + Sbjct: 380 DGKSLIKLLRSPE----QDGHEQMSFINVFGPLPTHSLTCLTKRYKYTYWWYGDDQMQPT 435 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKM 427 +EL+D +NDP E+ NL D + V M Sbjct: 436 EELFDTQNDPLELVNLASDSESSAVLETM 464 >UniRef50_B4D026 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D026_9BACT Length = 489 Score = 125 bits (313), Expect = 5e-27, Method: Compositional matrix adjust. Identities = 125/464 (26%), Positives = 199/464 (42%), Gaps = 57/464 (12%) Query: 5 NFLFVMTDTQATNMVGCYSGKP--LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 N LF+++D + + P L T N+D LA EG +A+ + +C+P+RA + TG Sbjct: 29 NILFILSDDHRWDFMSFMPEAPKFLETPNLDRLAKEGAHLRNAFCSTSLCSPSRASILTG 88 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 Y + G N I Y + AGY T ++GKWH+ G P DY Sbjct: 89 QYMHHHGVVDNQRPEPAAIRYFPEYLRAAGYETAFLGKWHM------GEDSDNPRKGFDY 142 Query: 123 W--FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 W F G + + T NG + D ++ + +++ A+D+L+ R Sbjct: 143 WAGFRGQGHYFDDTYN-----INGEHKKIDGYSSDV---------LTDLALDWLKH--RG 186 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADF---YYELGEKAQDDLANKP---EHHRLWAQ 234 D+PF + Y PH+PF +Y Y E +++ +P R Sbjct: 187 DKPFFCELCYKAPHYPFEPAPRNKGRYEKAPIPYPETMANTEENYLTQPRWVRERRFGIH 246 Query: 235 AM-----------PSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYT 282 + P P +D LYH Y +D+ IGR++ L R++T V+Y Sbjct: 247 GVDHMETGRFDHDPVPSFED-LYHR--YSETVFSMDENIGRLLKYLDNTGLRDSTIVVYM 303 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALAD 340 +D+G +G H K A ++ R+P+++R+P + V V +ID+ PT++ A Sbjct: 304 ADNGFELGEHGFYDKRDA-FETSMRVPMLLRAPGAVKPGTVVTKMVQNIDIAPTLLEAAG 362 Query: 341 IEKPEILP---GENILAVKEPRGV-MVEFNRYEIEHDSFGGFIPVRCWV-TDDFKLVLN- 394 + P P G + + + R V + YE + P + TD +K V Sbjct: 363 VTVPADAPKMDGYSFWPLVQGRDVPWRDHILYEYYWERNFPATPTTFAIRTDRWKYVYTH 422 Query: 395 -LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 L+ D LYD DP E HNLID F + K+ L D +DK Sbjct: 423 GLWDRDGLYDLETDPVERHNLIDVPAFREQGGKLRGQLFDELDK 466 >UniRef50_Q15NY5 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15NY5_PSEA6 Length = 486 Score = 125 bits (313), Expect = 5e-27, Method: Compositional matrix adjust. Identities = 113/407 (27%), Positives = 175/407 (42%), Gaps = 53/407 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + +MTD Q +G Y K + T NID LA +G+ FN+A T +PVC+ ARA TG Sbjct: 27 KPNIIVIMTDDQGQWTLGAYE-KHMKTPNIDYLADQGVLFNNAMTSAPVCSAARASFHTG 85 Query: 63 IYANQSGPWTNNVAPGKNI--------STMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 +Q G + + ++ G + +G + +GY T GKWH+ Sbjct: 86 KMPSQHGVY-DFLSEGNGFDDKWLQGETFLGERMQQSGYRTGLFGKWHVK------EPSL 138 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF----TWAHRISNRA 170 P D W + + WRN + + E F A ++ +A Sbjct: 139 EPAGGFDRWISHDAFKAG--------WRNQYQHRGKVAFSKDGEAFEHTGVQARFLTEKA 190 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 ++F+ + D+PF + ++Y EPH PF E L Y + K D N Sbjct: 191 IEFIDES--TDKPFFININYVEPHFPFEGLPERL---VSQYRPVARKLLRDGGNSS---- 241 Query: 231 LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR---ENTWVIYTSDHGE 287 L + + V D Y A +DDQ+G++++AL E R +NT + + SDHG Sbjct: 242 LALASKDTAVPKDHEEKLSQYLAAISLIDDQVGQIMDAL--EGRGLLDNTIIAFVSDHGM 299 Query: 288 MMGAHKLISKGAA-----MYDDITRIPLIIRSPQG---ERRQVDTPVSHIDLLPTMMALA 339 +MG + L K A Y++ RIP II P+ R+ D V +DL T++ A Sbjct: 300 LMGQYGLYGKTNASFPYNFYEETVRIPFIIYGPKSLVQGRQSRDEFVDLLDLHNTILDFA 359 Query: 340 DIE--KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCW 384 + + PG I + V ++ RY+I G I W Sbjct: 360 GDKTFTEQDGPGRTIRPLLNAERVQ-DWKRYQIAERGNGRMITDGKW 405 >UniRef50_C5BXG2 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5BXG2_BEUC1 Length = 447 Score = 125 bits (313), Expect = 5e-27, Method: Compositional matrix adjust. Identities = 118/437 (27%), Positives = 177/437 (40%), Gaps = 59/437 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + + +D Q + G T N D +A G A+T P+C P+RA + T Sbjct: 4 QRPNVVVLFSDQQRADTTGMAHNPADVTWNFDRMATHGTWSPYAFTVQPLCAPSRAAVLT 63 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G Y +G N + +++ TMGR F+DAGY T Y+GKWHL D + + + Sbjct: 64 GTYPTTNGVHRNGLPLTEDVPTMGRLFRDAGYETFYVGKWHLADADPVPRHQ---QGGFE 120 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 +W GAN L E +E V D D + + + + F+ + D Sbjct: 121 HWL-GANLL-EFSEDAFH------TRVFDADGAPHDLPGYRSDALIDAVIRFVTKDRDPD 172 Query: 182 EPFLMVVSYDEPHH-----PFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM 236 PF + S EPHH + P Y E+Y +W A Sbjct: 173 RPFFVFCSVLEPHHQNEVDAYLAPDAYAERY---------------------QGVWQPAD 211 Query: 237 PSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHGEMMGAHKLI 295 G H Y VD+ +GR+++AL + + ++T V+ TSDHG Sbjct: 212 LVGAGSTAPRHLAGYLGQVKRVDEGLGRLLDALRSVGEYDDTVVVQTSDHGCHFKTRNNE 271 Query: 296 SKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENIL 353 K + + T +PL R P G R V+ + +D+ PT++ A I++P L G +IL Sbjct: 272 YKRSG-HAASTHVPLAFRGPCFDGGGR-VEELCTLLDVAPTLLDAAGIDRPAHLQGRSIL 329 Query: 354 AVKEPRGVMV-----EFNRYEIEHDSFGGFIPVRCW----------VTDDFKLVLNLFTS 398 V RG E ++ G + W DD + + Sbjct: 330 DVLRTRGTRADPGWREDVYIQVSESQVGRTLRTARWSYGIEAPGAHARDDAG--SDTYVE 387 Query: 399 DELYDRRNDPNEMHNLI 415 LYD NDP E NLI Sbjct: 388 SYLYDVANDPAEQVNLI 404 >UniRef50_C5C586 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C586_BEUC1 Length = 478 Score = 124 bits (312), Expect = 6e-27, Method: Compositional matrix adjust. Identities = 129/469 (27%), Positives = 203/469 (43%), Gaps = 57/469 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + V+ D +GC+ T +ID+LAA G RF +Y +PVC+P RA L TG Sbjct: 15 RPNIVLVVVDDLGWRDLGCFGSTFYETPHIDALAASGTRFTHSYAAAPVCSPTRASLLTG 74 Query: 63 IYANQSGP--WTNNVAPG------------KNISTMGRYFKDAGYHTCYIGKWHLDGHDY 108 Y + G W A G ++ + R + GY T ++GKWHL G Sbjct: 75 KYPARVGVTNWIGGHAIGALRDVPYFHGLPQDEYALARALRAGGYRTWHVGKWHLGG--- 131 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLWRN-GLNSVEDLQANHIDETFTWAHRIS 167 G PE + FD N + +S + G+ ++ED D F R++ Sbjct: 132 ---GRHLPE---HHGFD-LNVGGSASGSPVSYYAPYGIGALEDAP----DGEFL-TDRLT 179 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 + AVD ++ + D PFL+ + + H P P +EKY LG A + Sbjct: 180 DVAVDLVR--SSDDAPFLLNLWHYAVHTPIEAPAHLVEKYRHKAETLGLPTHGPDAVEAG 237 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 H V + P Y A + +D +GR++ AL + ++T +++TSD+G Sbjct: 238 EHMPARHLRSERVRRRRIQSDPTYAAMLETLDGAVGRLVTALRDVGKLDDTLIVFTSDNG 297 Query: 287 EMMGA------HKLISKGAA-MYDDITRIPLII----RSPQGERRQVDTPVSHIDLLPTM 335 + A + +S+G M D TR+P I+ R P G R D P + D PT+ Sbjct: 298 GLSTAEGSPTCNAPLSEGKGWMADGGTRVPTIVSWPGRVPAGARS--DLPFTSPDFYPTL 355 Query: 336 MALADIEKPEILPGENILAVKE-PRGVMVEFNRYEI----EHDSFGGFIPVRCWVTDDFK 390 +A A + + LP +++ V P +R I H S G P +K Sbjct: 356 LAAAGLTQ---LPEQHVDGVNLWPAWQGAPLDRGPIFWHYPHYSNQGGAPSAAVRDGRWK 412 Query: 391 LVLNL-FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 LV + DEL+D D +E H++ R DV +++ L ++ + Sbjct: 413 LVRHFGIEHDELFDVVADVSESHDVSG--RRRDVVARLSVTLDSWLADV 459 >UniRef50_A6DJJ1 Sulfatase family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJJ1_9BACT Length = 510 Score = 124 bits (311), Expect = 7e-27, Method: Compositional matrix adjust. Identities = 115/474 (24%), Positives = 201/474 (42%), Gaps = 61/474 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+ N LF+ D M+GCY + + T NID +A G F +A +C P+RA L T Sbjct: 29 KKMNVLFIPID-DLKPMLGCYGDQAIITPNIDRIAERGTVFLNASCQQAICGPSRASLMT 87 Query: 62 GIYANQSGPW-----TNNVAPGKNISTMGRYFKDAGYHTCYIGK---------------- 100 G+Y + + W ++ P +I ++ +YFK GY T +GK Sbjct: 88 GMYPDHTKVWDLATKMRDINP--DILSIPQYFKQQGYETTGVGKTFDPRCVDGGKFQDKP 145 Query: 101 -WHLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVED------LQA 153 W + H G G PE A W A + T K + + D + Sbjct: 146 SWSIPYHKAGGKGYANPEV-AKAWKKAAELVKGRTFKMGYQRNKAMARLGDPICRPATEC 204 Query: 154 NHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE 213 + + ++ L++ ++AD+PF + V + +PH PF P +Y + Y + Sbjct: 205 MDVPDHVYKDGAVARVGAKLLEELSKADKPFFLSVGFAKPHLPFVAPKKYWDMYNSHDIQ 264 Query: 214 LGE---KAQDDLANKPEHHRLWAQAMPSPVGDDG---------LYHHPLYFACNDFVDDQ 261 + E A++D K + L A S + + G L H Y A ++D Q Sbjct: 265 VAEYQKSAKND--TKIAYKSLGEIAAYSDMPEKGPIDQETQKHLIHG--YMATTSYMDAQ 320 Query: 262 IGRVINALTPEQRENTWVIYT-SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER- 319 +G +++ L N +I DHG +G H + +K ++ R PL+I +P+G + Sbjct: 321 LGLLLDKLEELGIANNTIICLWGDHGFHLGDHGMWTKHTN-FEQAVRSPLLIAAPKGFKP 379 Query: 320 RQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEF---NRYEIEHDSFG 376 + PV +D+ PT+ LA ++ P LPG+++ V + V + +Y + + G Sbjct: 380 NSTNAPVELVDIFPTLCDLAGLDIPTHLPGKSLAPVMKDTSTSVRYAALGQYPRGNKTMG 439 Query: 377 GFIPVR-----CWVTDDFK--LVLNLFTSDELYDRRNDPNEMHNLIDDIRFADV 423 + W+ D++ + + +L+D DP E NL + + + Sbjct: 440 YTLRSERYRYVKWLNLDYRKSVAKGKLVATQLFDYEKDPLETVNLAANPEYKKI 493 >UniRef50_A9ECS8 Sulfatase n=3 Tax=Bacteria RepID=A9ECS8_9FLAO Length = 574 Score = 124 bits (311), Expect = 8e-27, Method: Compositional matrix adjust. Identities = 123/512 (24%), Positives = 214/512 (41%), Gaps = 92/512 (17%) Query: 2 KRPNFLFVMTDTQATNMVGCYS---GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAG 58 KRPN +++M D A + Y GK T NID LA G F + + + +C P+RA Sbjct: 34 KRPNIIYIMADDHAAQAISAYGHPIGKLAPTPNIDRLAKNGAIFKNNFCTNSICGPSRAV 93 Query: 59 LFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 + TG +++ +G N + T+ + K AGY+T IGKWHL G+ PE Sbjct: 94 VLTGKHSHINGFRMNGERFDGSQQTLPKLLKKAGYNTAIIGKWHLHGY---------PE- 143 Query: 119 DADYWF---DGANYLSEL---TEKEISLWRNGLNSVEDLQANHIDETFTWAHR---ISNR 169 DYW D NY + + I + ++S AN D T + I++ Sbjct: 144 GFDYWNILNDQGNYYNPQFIKIQDTIHFNKKHIDSTAHWTANLPDTTTVKGYATDLITDY 203 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHH---------------PFTCPVEYL---------- 204 A+D++ + +D+PF +++ + PH F P Y Sbjct: 204 AIDYIDKKKNSDQPFFIMMHHKAPHRNWMPALRHLNKYDSVQFPLPETYFTNHENSTASK 263 Query: 205 EKYADFYYELGEKAQDDLANK-------------------PEHHRLWAQAMPSPVGD--- 242 E+ Y ++ E + K PE W +A P D Sbjct: 264 EQLQTIYRDMYEGHDLKMTKKKGSPELAWNPWKTDFERMTPEQRAAWDKAY-QPKNDAFH 322 Query: 243 ------------DGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMM 289 G + Y + VD+ +G++++ L ENT V+YTSD G + Sbjct: 323 DANLTGKALAEWKGQRYLQEYLSTIASVDEGVGKILDYLEANGLAENTIVVYTSDQGFYL 382 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQGERRQ--VDTPVSHIDLLPTMMALADIEKPEIL 347 G K MY++ ++PL+I+ P+ + V+ ++D T + A+++ PE + Sbjct: 383 GEKGWFDK-RFMYEESLKMPLLIQYPEKIKSGTVVEGLTQNLDFAETFLDFANVDIPEDM 441 Query: 348 PGENILAVKEPRGVMVEFN----RYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDE--L 401 G++ + + + +F + ++ +F + T +KL+ DE L Sbjct: 442 QGKSFVGLLDGSESDEDFRDAVYYHYYDYPAFHMVKKMYGIRTKRYKLIHVYDDIDEWEL 501 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 YD + DP E+ NLI+D + ++ +K+ L++ Sbjct: 502 YDLQTDPQELTNLINDENYDEIETKLRKRLVE 533 >UniRef50_C7MI43 Arylsulfatase A family protein n=5 Tax=Bacteria RepID=C7MI43_BRAFD Length = 496 Score = 124 bits (311), Expect = 8e-27, Method: Compositional matrix adjust. Identities = 121/458 (26%), Positives = 193/458 (42%), Gaps = 72/458 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RP+ + V D + +G ++T N+D LA G F AY+ +P C PAR +FTG Sbjct: 12 RPHIILVCVDEMRADAMGAAGNPHIDTPNLDDLARGGYHFTRAYSATPTCVPARVAMFTG 71 Query: 63 IYANQSGPWTNNVA---PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 G + P T+ ++AGY T +GK H+ + C +D Sbjct: 72 KSPELHGRYGYREGISFPEAYPVTLQSTLREAGYQTFGVGKMHV----FPDRARC--GFD 125 Query: 120 ADYWFDGANYLSELTEKEIS---------LWRNGLNSVEDLQANHI------------DE 158 DG + S + S L R + D Q I +E Sbjct: 126 EVLLHDGFLHTSRRLSRGPSAAIDDYVEFLRRETGDPRADYQETGIGCNAMTARPWEREE 185 Query: 159 TFTWAHRISNRAVDFLQQPARAD--EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGE 216 +++ ++ FL AR D PF + +S+ PH PF P EKY E Sbjct: 186 RLHPTRWVADESLRFL---ARRDPTRPFFLYMSFHRPHAPFDPPAWLWEKYRG--REFPR 240 Query: 217 KAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPL---YFACNDFVDDQIGRVINALTPEQ 273 + + ++ + HR + H + Y+ +F+D Q+ R+ L+ + Sbjct: 241 RPLGEWVSRFDEHRQDFGSEAEFGAQKETTHQQVRAGYYGSIEFIDLQLNRLKETLSDQG 300 Query: 274 R-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ------GERRQVDTPV 326 E+T V++ SDHG+MMG H + K A Y+ +R+PL++ P G R++DT Sbjct: 301 LLEDTVVVFVSDHGDMMGDHDMYRKSVA-YEGSSRVPLVVHVPPRWREGWGAPREIDTLA 359 Query: 327 SHIDLLPTMMALADIEKPEILPGENILAVKEPR------GVMVEFNRYEIEHDSFGGFIP 380 DLLPTM+ LA +E P + G ++ E R V+ R+ ++ Sbjct: 360 ELRDLLPTMLDLAGLEVPGGVDGISLRPDGEEREHLHGEHVIGSLGRHSMQ--------- 410 Query: 381 VRCWV-TDDFKLVLNLFTSD---ELYDRRNDPNEMHNL 414 W+ +D FK V F+ D +L+D DP E+H+L Sbjct: 411 ---WIRSDRFKYV--WFSGDGHEQLFDLVADPQELHDL 443 >UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UGD7_RHOBA Length = 543 Score = 124 bits (311), Expect = 8e-27, Method: Compositional matrix adjust. Identities = 123/445 (27%), Positives = 192/445 (43%), Gaps = 87/445 (19%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++ D + VG K + T ++D LAA G+ F + Y P C+P+RAGL TG Sbjct: 44 RPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTG 103 Query: 63 IYANQSG---------PWTNNVAPGKNIS--TMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 + + G W PG +S T+ K+AGY T IGKWHL Sbjct: 104 RHQQRFGHGSNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHL-------- 155 Query: 112 GECPPEW----DADYWF----DGANYLSELTEKEISLW-RNGLNSVEDLQANHIDETFTW 162 G+ P W D WF G +Y +L K+ L G V+ H+ + F Sbjct: 156 GDAKPFWPNRRGFDEWFGFSGGGFSYWGDLGMKDPLLGVHRGDEPVDPKTLTHLTDDF-- 213 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 S AV F+Q+ EPF + ++Y+ PH P +L+K A + E G +A Sbjct: 214 ----STEAVKFIQR--HETEPFFLYLAYNAPHAPDHATRAHLQKTA--HIEYGGRA---- 261 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIY 281 +Y A +D+ IGRV++ + ENT +I+ Sbjct: 262 ---------------------------VYGAMVAGMDEGIGRVVDQIRESGLGENTMIIF 294 Query: 282 TSDHG---EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMM 336 SD+G E +++ R+P ++ P R + ++P++ +DL PT + Sbjct: 295 YSDNGGRREHAVNFPYRGHKGMLFEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTAL 354 Query: 337 ALA--DIEKPEILPGENILAV----KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFK 390 A A D + + L G+N+L V K+ F RY + DS+G + W K Sbjct: 355 AAAGMDPSQNDKLDGQNLLPVLTDDKQRLPERPLFWRYSMGDDSYGYAVRDGNW-----K 409 Query: 391 LVLNLFTSDE-LYDRRNDPNEMHNL 414 L+ + + + L+D NDP E +L Sbjct: 410 LIDSRYKDRKLLFDLANDPWEREDL 434 >UniRef50_UPI0001746432 sulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746432 Length = 517 Score = 124 bits (310), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 135/496 (27%), Positives = 201/496 (40%), Gaps = 94/496 (18%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 + RPN LFV D + +GC G P +T N+D LAA G+ F +A+ +PVC +R + Sbjct: 39 VARPNVLFVAVD-DLNDWIGCMKGHPQAHTPNMDRLAARGVLFTNAHCAAPVCLASRTAV 97 Query: 60 FTGIYANQSGPWTN----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH-----DYFG 110 TG+ QSG ++N P + +GY T GK + DYFG Sbjct: 98 LTGLKPEQSGVYSNWGKTRGGPLVKEQQLPVRLAASGYETLGTGKLYHSTQSQWFDDYFG 157 Query: 111 TGE-CPPEWDADYWFDGANYLSELTE------------KEISLWRNGLNSVEDLQANHID 157 T + P + +D ++ +E ++ L NGL S + Q Sbjct: 158 TEQRWSPFTETQSKYDEEELPTKGSEAPRHVIKAGPGGRDWVLPLNGLPSERNAQGRE-G 216 Query: 158 ETFTWAH-----------RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCP------ 200 E+F W RI++ A+ L AR +PF + V Y PH P P Sbjct: 217 ESFDWGAAPVADEDMGDVRITDWALQKLA--ARHQKPFFLGVGYYRPHIPLFAPEVDFKH 274 Query: 201 ------VEYLEKYADFYYELG----EKAQDDLANKPE----HHRLWAQAMPSPVGDDGLY 246 ++ + + +LG E A D + H W +A+ + Sbjct: 275 LPPVQDIQVPQVLGNDLNDLGPVGRETALDPITAGTHDLVVRHGQWKEAVRA-------- 326 Query: 247 HHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDI 305 Y AC VD Q+GR+I+ L +NTW+I SDHG +G K K A + Sbjct: 327 ----YLACITHVDRQVGRLIDRLDASPHADNTWIILWSDHGWHLGEKKHWGKWTA-WRQA 381 Query: 306 TRIPLIIRSPQGERRQVDT----PVSHIDLLPTMMAL------ADIEKPEILPGENILAV 355 TR+PLII P+ + V PVS IDL PT+M + ++++ +LP N Sbjct: 382 TRVPLIIVPPRQQGGPVGKTCAEPVSLIDLYPTIMKICHAPVRSEVKGISLLPLVNDPDS 441 Query: 356 KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLI 415 R V+ D + R W + + +ELYD DP+E NL Sbjct: 442 HTGRAVLTTV-------DPGNHALSTRGW-----RYIRYRTGEEELYDTMRDPHEWQNLA 489 Query: 416 DDIRFADVRSKMHDAL 431 D + S+M L Sbjct: 490 GDGASQEKLSEMRQRL 505 >UniRef50_B6HPN7 Pc22g01020 protein n=15 Tax=Eukaryota RepID=B6HPN7_PENCW Length = 589 Score = 124 bits (310), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 111/431 (25%), Positives = 192/431 (44%), Gaps = 28/431 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCY-SGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 K+PN L++M D A ++ + P+ T N+D LA G+ F+SAY SP+C P+R + Sbjct: 4 KKPNILYIMADQMAAPLLSLHDKNSPIKTPNLDRLAEGGVVFDSAYCNSPLCAPSRFVMV 63 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 +G ++ G + N + T Y + GYHT GK H G D E + Sbjct: 64 SGQLPSKIGAYDNAADLPADTPTYAHYLRREGYHTALAGKMHFCGPDQLHGYEQ--RLTS 121 Query: 121 DYWFDGANYLSELTEKEISL-WRNGLNSVED----LQANHIDETFTWAHRISNRAVDFLQ 175 D + + E +I W + ++SV + ++ N +D ++ + D ++ Sbjct: 122 DIYPGDYGWSVNWDEPDIRADWYHNMSSVMEAGPVVRTNQLDFDEEVIYKSTQYLYDHVR 181 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR----- 230 Q R ++PF + VS PH P+ E+ + Y D L + + H + Sbjct: 182 Q--RNEQPFCLTVSMTHPHDPYAMTKEFWDLYNDVEIPLPKNGAIPHDQQDAHSQRVLKC 239 Query: 231 --LWAQAMPSPVGDDGL--YHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDH 285 L+ + MP D+ + Y AC +VD +G+++ L ++T +++T DH Sbjct: 240 IDLFNKEMP----DERIRAARRAYYAACT-YVDTNVGKLLRVLENTGMADDTIIVFTGDH 294 Query: 286 GEMMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEKP 344 G+M+G L K +++ R+P ++ +P+ ++V VS +DLLPT LA + Sbjct: 295 GDMLGERGLWYK-MTWFENSARVPFLVHAPKHFAPKRVSENVSTMDLLPTFAELAGAKLI 353 Query: 345 EILPGENI-LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYD 403 LP + + L G + + E+ G P+ +K + + LYD Sbjct: 354 SELPLDGVSLVPYLTGGEGLRTDTVYGEYMGEGTQAPLMMIRRGRWKFIYSTIDPPMLYD 413 Query: 404 RRNDPNEMHNL 414 NDP E NL Sbjct: 414 LVNDPEERTNL 424 >UniRef50_Q7UH46 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UH46_RHOBA Length = 490 Score = 123 bits (309), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 124/482 (25%), Positives = 192/482 (39%), Gaps = 104/482 (21%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + +M D G + T +D+LA EG + Y+ PVC+P RA TG Sbjct: 32 PNIVLMMCDDLGWGDTGFNGNTIIQTPELDALANEGTVLDHFYSVGPVCSPTRASFLTGR 91 Query: 64 YANQSGPWTNNVA--PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYF----GTGECP-- 115 + + G WT N P + T+ R K GY T + GKWHL G G P Sbjct: 92 HYFRMGIWTANKGHLPSQEF-TLARMLKTRGYATGHFGKWHLGTLSRTVSAKGKGRRPDL 150 Query: 116 ---PEWDADYWFDGANYLSELTEKEISLWRNGLNS-------VEDLQANHIDETFTWAHR 165 P W+ DY S +TE + W G+ E+ A + + Sbjct: 151 HYAPPWERDY------DASFVTESAVCTWDPGIGKRARNNPYYENGVATDENVLGCDSRV 204 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 + +RA+ F++ A D+PFL V+ + PH EYL KY GE A Sbjct: 205 LMDRALPFIEAAAERDQPFLSVIWFHAPHEDIQAGPEYLAKYEGH----GEAAH------ 254 Query: 226 PEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSD 284 Y+ C VDDQ+GR+ L +NT + + SD Sbjct: 255 -------------------------YYGCITAVDDQVGRLRKKLASLGVADNTLLFFCSD 289 Query: 285 HGEMMG-------------AHKLISKGAAMYDDITRIPLII----RSPQGERRQVDTPVS 327 +G G A + + ++ D R+P + + P G R ++ P+S Sbjct: 290 NGPEGGEPSNRMKTRRAGSAGEFSGRKRSVLDGGVRVPAFVHWPGQIPAGVR--LNAPLS 347 Query: 328 HIDLLPTMMALADIEK--PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR--- 382 +DLLPT+ A+ E +L GEN+L + ++ E IP R Sbjct: 348 VMDLLPTVAAITGAETLPNRLLDGENVLPI------------WKGEQAQREKSIPFRYGQ 395 Query: 383 --CWVTDDFKLVL---NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 C V KL++ N + D L+D D +E +NL + + ++ + M LL +++ Sbjct: 396 FACLVRGKHKLIIESPNDDSKDRLFDLSKDVSESNNLAN--QKPELTASMRTELLGFLES 453 Query: 438 IR 439 + Sbjct: 454 AK 455 >UniRef50_A4A047 Iduronate-2-sulfatase n=2 Tax=Bacteria RepID=A4A047_9PLAN Length = 481 Score = 123 bits (308), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 123/467 (26%), Positives = 201/467 (43%), Gaps = 50/467 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN LF+ D T + GCY +++ NID LAA G F AY VC+P+R L T Sbjct: 18 RQPNVLFIAVDDLRTEL-GCYGASQIHSPNIDRLAAAGTVFTRAYCQQAVCSPSRTSLMT 76 Query: 62 GIYANQSGPWTNNVAPGKNIS---TMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP--- 115 G+ + + + KN+ T+G++FK GY++ +GK + G+D T P Sbjct: 77 GLRPDSTKVYDLVTHFRKNVPDVVTLGQHFKQNGYYSVSMGKIYHGGYDDPPTWSEPARK 136 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANH--------IDETFTWAHRIS 167 P+ A Y A L +T+K + GL V+ +A + + ++ Sbjct: 137 PQGGAGYVL--AENLQTITDKRNAARAKGLRGVQLSRAARGPATEMADVADNAYADGAVA 194 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 + AV L++ ++ DEPF + V + +PH PF P +Y + Y EL P Sbjct: 195 DLAVKSLRELSQRDEPFFLAVGFVKPHLPFNAPKKYWDMYDPAKIELAANPYPPKNVTPY 254 Query: 228 HHRLWAQ-----AMP-----SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-EN 276 W + +P SP L H Y+AC + D +G++++ L + + Sbjct: 255 SLTSWGEMRVYDGIPKQGDLSPEKARELKHG--YYACISYTDANVGKLLDELDKLKLTDE 312 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR---QVDTPVSHIDLLP 333 T V+ DHG +G H K DD PLIIR+P G++ + V +D+ P Sbjct: 313 TIVVLWGDHGWKLGEHNSWCKHTNFEDD-ANAPLIIRAP-GQKSPGAKSTALVEFVDIYP 370 Query: 334 TMMALADIEKPEILPGEN---ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFK 390 T+ LA + P+ L G + +L + F++Y I TD ++ Sbjct: 371 TLCELAALPLPQHLEGTSAAPLLDQPDAAWKTAAFSQYPRRQ------IMGYTMKTDRYR 424 Query: 391 LVL------NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + ELYD + DP E N+ A++ ++ L Sbjct: 425 FTAWKNKKSGKVVATELYDHQVDPAENVNVAGLTENAELIVQLQKQL 471 >UniRef50_B2AAG4 Predicted CDS Pa_1_3920 n=1 Tax=Podospora anserina RepID=B2AAG4_PODAN Length = 611 Score = 122 bits (307), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 114/461 (24%), Positives = 188/461 (40%), Gaps = 49/461 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 KRPN L+VM D A ++ Y+ + T N+D+LAA+ ++F+SAY SP+C P+R + Sbjct: 48 KRPNILYVMADQLAAPLLKMYNPTSQILTPNLDALAAKSVQFDSAYCPSPLCGPSRMSMI 107 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG + G + N +I T Y + GYHT GK H G G + Sbjct: 108 TGQLPMKIGAFDNAAQISSDIPTYAHYLRLKGYHTVLAGKMHFVGDQLHGY---ETRLTS 164 Query: 121 DYWFDGANYLSELTEKEISL-WRNGLNSVED----LQANHIDETFTWAHRISNRAVDFLQ 175 D + ++ E E L W + +SV +++N +D +R DF++ Sbjct: 165 DIYPGDFGWVPNWEEPETRLEWYHNASSVLQAGSCVRSNQLDYDEEVMYRSRQFLYDFVR 224 Query: 176 QPARADEPFLMV----------------------------------VSYDEPHHPFTCPV 201 + PF + VS PH P+T Sbjct: 225 EGEGGRRPFALTVSLTLRHVRVFVSGRGAGCMSDPHGTREVVIMWYVSLTHPHDPYTIEQ 284 Query: 202 EYLEKYADFYYELGEKAQDDLANKPEHHRLW--AQAMPSPVGDDGLYH-HPLYFACNDFV 258 +Y + Y + +L P RL +P D+ + Y+ +V Sbjct: 285 KYWDLYENVDIDLPRVTIPQEDQDPHSKRLLKVCDLWDNPFSDEQIKRARRAYYGAVSYV 344 Query: 259 DD-QIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG 317 DD + + E+T VI++ DHG+M+G L K + ++ R+PL++ P+ Sbjct: 345 DDCLGQLLTLLKQLKLDEDTIVIFSGDHGDMLGERGLWYK-MSYFESSVRVPLLVSYPKR 403 Query: 318 -ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFG 376 E R+V VS +D+LPTM L + +LP + + G + E+ G Sbjct: 404 FEPRRVSQNVSTLDILPTMCDLVGTKPWALLPMDGRSLLPHLEGREGGHDEVFAEYTGEG 463 Query: 377 GFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDD 417 P+ +K V +L+D R DP E+ +L+ + Sbjct: 464 TVRPLMMIRRGRWKYVTCPADGSQLFDLRADPLELRDLVKE 504 >UniRef50_Q7NMX5 Gll0640 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NMX5_GLOVI Length = 834 Score = 122 bits (307), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 125/459 (27%), Positives = 209/459 (45%), Gaps = 55/459 (11%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + ++TD QA N + Y K L +Q LA++G+ F +A+ +C P+RA + TG Y Sbjct: 37 NVVLIVTDDQAWNTL-AYMPK-LQSQ----LASQGVTFTNAFAGQSLCCPSRATILTGRY 90 Query: 65 ANQSGPWTNNVAPGKNI-----STMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 + G N+ G + ST+ + +++GY T GK + +G+ Y PP WD Sbjct: 91 PHNHGVLGNDAPFGGALAFYDASTLPVWLQESGYRTGLFGK-YFNGYSY-SAFYTPPGWD 148 Query: 120 ADYWFDGANYLSELTEKEISLWR-NGLNSVEDL---QANHIDETFTWAHRISNRAVDFLQ 175 F A Y + +R N ++ED ++N+ + T +AV F+ Sbjct: 149 EWQTFQLAGYYN---------YRINANGTIEDYGRSESNYSTDVLT------QKAVAFIT 193 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADF-YYELGEKAQDDLANKPEHHRLWAQ 234 A +D+PF + ++ PH P+T + +YAD + + D+ +KP W Q Sbjct: 194 NSAASDKPFFLFLAPFAPHAPYTPAPRHAGRYADIPPWRPPNYNEQDVLDKPT----WVQ 249 Query: 235 AM--PSP-VGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHGEMMG 290 + SP D Y VDD + ++ AL + QRENT VI+TSD+G G Sbjct: 250 KLRPASPQTQTDYDKERQAYLEMLLAVDDGVESILQALESTGQRENTLVIFTSDNGLTWG 309 Query: 291 AHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMALADIEKPEILP 348 H+ KG + Y++ R+P+++ P RQ + V ++DL T+ A I P + Sbjct: 310 EHRWWEKGCS-YEESLRVPMVVSFPGVSTAARQEELLVLNMDLTATIAEAAGIPIPATVD 368 Query: 349 GENILAVKEPRGVMVEFNRYEIEHDSFGG--FIPVRCWV-TDDFKLVLNLFTSDELYDRR 405 G ++L + + + V E F G P V + +K + NL ELY+ Sbjct: 369 GRSLLPILKGQAVSWR------EQFLFEGWQLTPTHAGVRSTAWKYMENLAGEQELYNLI 422 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDAL--LDYMDKIRDPF 442 +DP E+ N + + +++ L + Y K P+ Sbjct: 423 DDPYELDNAVGVADYGAQVAELQATLAQMRYSAKGMAPY 461 >UniRef50_UPI00016C001E mucin-desulfating sulfatase n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C001E Length = 491 Score = 122 bits (307), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 127/490 (25%), Positives = 217/490 (44%), Gaps = 73/490 (14%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +F +TD Q + G + T NID LA+E +FN+A+ +P+C P+RA + G Sbjct: 2 KPNIIFFLTDDQRYDTFGFMGHSQVFTPNIDKLASESAKFNNAFHVAPICMPSRASMQLG 61 Query: 63 IYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKW-----HLDGHDYFGTG 112 Y Q P + + + + GY T +IGK+ ++ H+ GT Sbjct: 62 KYIAQHNCGFDLPTDYTITTEEYQHSYPVLLRQNGYFTGFIGKFGFPIANVKAHN--GTR 119 Query: 113 EC-PPEWDADYWFDGANYLSE--LTEKEISLW-------------RNGLNSVEDLQANHI 156 E P D ++ SE L + +W N N E+ Q N Sbjct: 120 EIHPNSLPEDPYYKPTRNFSEEALAPQYFDVWNGSPSNLKYFPDKENKFNGYEN-QWNDD 178 Query: 157 DETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYAD------- 209 T AH +A FL++ A + +PF++ VS+ PH P T ++++ Y D Sbjct: 179 HLTMFNAH----QADAFLEKAADSGKPFMLSVSFKAPHRPHTASPKWVQFYKDMTIKRMD 234 Query: 210 -----FYYELGEKAQDDLANKPEH---HRLWAQAMPSPVGDDGLYHHPL--YFACNDFVD 259 ++ L E + N E+ HR +A DD + H Y+ VD Sbjct: 235 NDKPEYFAVLPEVVRTHSRNADEYWGGHRYTRKAW----NDDATFQHDFRNYYGLISGVD 290 Query: 260 DQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE 318 + +G + + L ENT +IYTSD+G G+ +L K +Y++ + PLII P+ + Sbjct: 291 EAVGSIRHKLDELGLAENTIIIYTSDNGYFCGSKQLGGK-ELLYEESIKAPLIIYDPRSK 349 Query: 319 -RRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAV-----KEPRGVMVEFNRYEIEH 372 + VD VS +D+ PT++A A + K + + G++++++ +E + N + + Sbjct: 350 VGKWVDGLVSTVDICPTILAYAGLSKTDEMFGDSVISLIDGQKEEIHDAVYGENDFNDNY 409 Query: 373 DSFGGF--------IPVRCWVTDDFKLV---LNLFTSDELYDRRNDPNEMHNLIDDIRFA 421 G I + T +FK V L +E++D NDP E NL+++ + Sbjct: 410 LDIGHHPNPENYQSIHSKYVRTKNFKYVRYHLCHPIVEEMWDIHNDPLETTNLVNNPEYR 469 Query: 422 DVRSKMHDAL 431 ++M + L Sbjct: 470 TTLNEMRNLL 479 >UniRef50_UPI0000E11054 iduronate-2-sulfatase n=1 Tax=Rhodobacterales bacterium HTCC2255 RepID=UPI0000E11054 Length = 1028 Score = 122 bits (306), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 115/450 (25%), Positives = 203/450 (45%), Gaps = 42/450 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L M D ++ G Y T NID LA +G+ FN AY +C P+R + TG Sbjct: 45 KPNILVFMIDDLRPDL-GSYGHAHAITPNIDKLANQGVSFNRAYAQQAICGPSRVSIMTG 103 Query: 63 IYANQSGPWT----NNVAPGK-NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + +G +T + P + N+ ++ + FK GY T IGK + D Sbjct: 104 LRPETTGLYTIRRDGRLRPNQPNVVSLPQLFKANGYKTISIGKVYHSTTD---------- 153 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 D + W + ++ +L + + + +A ++++ F +++ A L++ Sbjct: 154 -DQENW---STHIKKLPNFYVDPEKQAVRYA--YEAGNVEDDFYKDGKVARDADIALRE- 206 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYA-DFYYELGEKAQDDL----ANKPEHHRLW 232 ++PFLM V + +PH PF P +Y + Y D + K D++ K R++ Sbjct: 207 -HQNDPFLMFVGFSKPHLPFNAPKKYWDMYQRDQFTVPSRKTPDNMFRLALTKWNELRMY 265 Query: 233 AQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMM 289 DD L + Y+A ++D Q+G+V+N L RENT VI+ SDHG + Sbjct: 266 GGIPKEGYTDDELTKTLIHAYYATVSYMDAQVGKVLNTLDELGLRENTTVIFMSDHGYKL 325 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQGERRQ-----VDTPVSHIDLLPTMMALADIEKP 344 G + +K M D TR+PLII E ++ D V ++D+ PT+ A + P Sbjct: 326 GEYGAWNKHTNMELD-TRVPLIISQALEEPKRKSGVTSDALVEYVDIFPTIAETAGLPLP 384 Query: 345 EILPGENILAVKEPRGVMVEFNRYEIEHDS--FGGFIPVRCWVTDDFKLV-LNLFTSDEL 401 +L G ++ + E + ++ + I + G + W +++ + EL Sbjct: 385 -VLDGVSLKPLLEQPQIQLKQAAFSIFNRGPLMGVSVTDGLWRYTEWRDANSHAIKFKEL 443 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 YD R+ N+ ++A+V +++ D L Sbjct: 444 YDHRDSQVATENVAKQPQYAEVETRLRDML 473 >UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6CBM1_9PLAN Length = 497 Score = 122 bits (306), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 118/463 (25%), Positives = 195/463 (42%), Gaps = 96/463 (20%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + ++ D + CY + T ++D LA+EG+R Y +PVC+P+RAGL TG Sbjct: 32 KPNIVIILCDDLGYGDLACYGHPVIKTPHLDQLASEGMRLTDCYASAPVCSPSRAGLLTG 91 Query: 63 IYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 N+ G P + + ++ T+ + + AGY T ++GKWH +G F + E P Sbjct: 92 RTPNRLGVYDWIPEGHPMHLKRDEVTVAQLLQQAGYDTAHVGKWHCNG--MFNSKEQPQP 149 Query: 118 WDADY--WFDGANYLSELTEKEISLWRNG--LNSVEDLQANHIDETFTWAHRISNRAVDF 173 D + WF N E + RNG L +E +++ + + Sbjct: 150 GDHGFRHWFSTQNNALPTHENPNNFVRNGKPLGEIEGFS----------CQIVADEGIRW 199 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYAD-FYYELGEKAQDDLANKPEHHRLW 232 L ++PF + V + EPH P +E Y D YE ++AQ Sbjct: 200 LSDWREKEKPFFLHVCFHEPHERVASPPALVETYLDKSLYE--DQAQ------------- 244 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHG-EMMG 290 YFA +D +G+++ L + +NT V +TSD+G E + Sbjct: 245 ------------------YFANVANMDRAVGKLLIKLDELKVADNTLVFFTSDNGPETLN 286 Query: 291 AHKLISKGA------------AMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMM 336 + S+ + +Y+ R+P I+R P + +++ TPV +DLLPT Sbjct: 287 RYGKGSRRSWGSPGVLRGMKLHIYEGGIRVPGIVRWPGKIKAGQEIATPVCSVDLLPTFC 346 Query: 337 ALADIEKPEILP--GENIL--------------------AVKEPRGVMVEFNRYEIEHDS 374 +A + P+ P G ++L A PR M E + + H S Sbjct: 347 EIAGVAVPDQRPLDGASLLPLFAGNKIERTTPLFWNYYRAYSTPRVAMREGDWKVVAHWS 406 Query: 375 F-GGFIPVRCWVTDDFKLVLN--LFTSDELYDRRNDPNEMHNL 414 G IP+ V + ++ T ELY+ ++D +E HNL Sbjct: 407 GPEGIIPLGGNVNSVSQEIIKNAKLTKFELYNLKDDISEQHNL 449 >UniRef50_A6DHY0 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHY0_9BACT Length = 507 Score = 122 bits (306), Expect = 3e-26, Method: Compositional matrix adjust. Identities = 122/485 (25%), Positives = 199/485 (41%), Gaps = 90/485 (18%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++ N++F+MTD Q G K + T ++D +A EG + Y PVC+P R T Sbjct: 18 EKLNYVFMMTDDQGYGDTGFNGHKIIKTPHLDQMAKEGAKLTQFYAGGPVCSPTRGTYLT 77 Query: 62 GIYANQSGPWTNNVA--PGKNISTMGRYFKDAGYHTCYIGKWHLD--GHDYFGTGEC--- 114 G + + G W NV P + I T+ K GY T + GKWHL DY GE Sbjct: 78 GRHYYRYGIWGANVGHLPKEEI-TLASVLKQQGYVTGHFGKWHLGTLNKDYSTKGESRKP 136 Query: 115 ----PPEWDADYWFDGANYLSELTEKEISLWRNGLNS----VEDLQANHIDETF--TWAH 164 P W+ DY S + E +S W + + +E+ A Sbjct: 137 TENFAPPWERDY------DESFVVESSVSTWDPASEKNPFYINGVPMKGTEESLYGGAAR 190 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 + ++A+ F+++ PFL VV ++ PH P +YLE Y E GE A Sbjct: 191 VVVDKAIPFMERAVSEGNPFLAVVWFNAPHEPIKAGPKYLE----MYKEHGEAAH----- 241 Query: 225 KPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTS 283 Y+ C +D+Q+GR+ L E NT + + S Sbjct: 242 --------------------------YYGCLTEMDEQVGRIRAKLREMGVEKNTVLFFCS 275 Query: 284 DHG---------EMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLL 332 D+G + L + ++YD R+P + P + +D +S +D L Sbjct: 276 DNGPEGKKAKGAKAGTTSGLRGRKRSLYDGGVRVPALAEWPGKIQAGSVIDAAMSTLDYL 335 Query: 333 PTMMALADIEKPEILP--GENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTD--D 388 PT++AL + + P+ P GENILA+ E + FI V + D Sbjct: 336 PTVIALQNHQMPDERPLDGENILAL---------LTGEESQRKRGIPFIHRGKAVLNRGD 386 Query: 389 FKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWS 448 +KLV ELY ND +E +N+ ++ ++ ++M L ++ +++ + Sbjct: 387 YKLVY----PKELYALSNDWSEENNIAS--QYPEIVAEMSKELEAFVLSMKESHAGADYG 440 Query: 449 LRPWR 453 L ++ Sbjct: 441 LAKYK 445 >UniRef50_Q7UGI8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UGI8_RHOBA Length = 505 Score = 122 bits (305), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 124/468 (26%), Positives = 194/468 (41%), Gaps = 85/468 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN + +M D + G Y+G P L T +D +AA G+R + Y P C+P R + T Sbjct: 38 RPNVILLMGDDHGWDETG-YNGHPHLQTPILDQMAATGLRLDRFYAAHPTCSPTRGSVIT 96 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G + N+ G +T N + + K A YHT + GKWHL G + P + D Sbjct: 97 GRHPNRYGTFTPNYSIRPEEIGVASLLKQADYHTAHFGKWHL-GPVKASSPTNPGAFGFD 155 Query: 122 YWFDGANYLSELTEKEISLW--RNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 W N+ E++ W RNG + Q + + + A++FLQ+ Sbjct: 156 EWLSHDNFF------ELNPWLSRNG-GPPQQFQGES-------SELLIDHAIEFLQK-VP 200 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP 239 +DEPF +V+ Y PH P++ E L Y + + ++ +N+ + P Sbjct: 201 SDEPFFLVIWYGSPHEPYSGLPEDLALYDELPSQFADQTTRLTSNET------GLPVQRP 254 Query: 240 VGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDH---GEMMGAHKLI 295 +GD +A +D IG + + L E R+NT + Y D+ G+ + L Sbjct: 255 LGDVLRER----YAEITAMDRSIGTLRHWLAANELRDNTLLWYCGDNGTSGDGIVTSPLR 310 Query: 296 SKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPE-------- 345 + + +YD R+P +I P G+ P D+LPT+ LA+++ P Sbjct: 311 GQKSDLYDGGIRVPGLIEWPAKIGQPTTSQIPSVTTDILPTLCELANVDVPRRPLDGVSL 370 Query: 346 ------------------------------ILPGE----NILAVKEPRG----VMVEFNR 367 LP E VK RG V ++ Sbjct: 371 VPLVEGRMKQRNQPICFWSFDHRRESNRDPYLPEEAQQGTTPLVKLMRGNATRNFVNYHH 430 Query: 368 YEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLI 415 EIE FGG R + D +KLV++ S ELY + DPNE N++ Sbjct: 431 REIEPQDFGG---TRVIMDDRYKLVVSADGSKELYAMQADPNETKNVL 475 >UniRef50_C8VYX4 Sulfatase n=2 Tax=Firmicutes RepID=C8VYX4_DESAS Length = 601 Score = 122 bits (305), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 115/404 (28%), Positives = 171/404 (42%), Gaps = 72/404 (17%) Query: 3 RPNFLFVMTDTQATNM------VGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPAR 56 +PNFL ++ D Q + + + L Q L + G F + Y S C P+R Sbjct: 18 KPNFLVILVDQQRYAVSYENEEIKVWRKTRLKAQEF--LKSRGFEFKNHYAGSAACCPSR 75 Query: 57 AGLFTGIYANQSG-PWTNNVAPG-----------KNISTMGRYFKDAGYHTCYIGKWHLD 104 A L+TG Y + G T+ A G + TMG YF+ AGY T + GKWH Sbjct: 76 ATLYTGQYPSLHGVSQTDGAAKGAYDPDMFWLNPNTVPTMGDYFRTAGYQTYWKGKWHAS 135 Query: 105 GHDY-----------FGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQA 153 D + G P D + + AN L+ S NG E Sbjct: 136 AADILVPGTHKPFLSYNQGNGVPIPDNEKLYINANVLA-------SFGFNGWIGPEPHGV 188 Query: 154 NHIDETFTWAHRISNRAVDFLQQPA-----------RADE----PFLMVVSYDEPH--HP 196 N + + A +S R V + Q +DE P+L++ S+ PH Sbjct: 189 NPRNTGSSAAAGLSGRDVVYSQDTVELIRVLEKEYNESDECRPRPWLIMCSFVNPHDIAL 248 Query: 197 FTCPVEYLEKYADF-------YYELGEKAQDDLANKP----EHHRLWAQAMPSPVGDDGL 245 F L ++ +F Y A + L KP + R++A A + D L Sbjct: 249 FGAISGSLPQF-NFKVNLSVPYISPAPTASESLLTKPSAQSSYRRIYAYAFQPLL--DTL 305 Query: 246 YHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYDD 304 ++ LY++ D QI RVINAL NT +I+TSDHGE++GAH L K Y++ Sbjct: 306 FYRQLYYSLEMEADTQICRVINALRETSFYNNTIIIFTSDHGELLGAHGLFQKWYQAYEE 365 Query: 305 ITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEI 346 +PLII +P + D SH+D+LPTM+ ++ ++ I Sbjct: 366 SIHVPLIIHNPTLFDKPESTDMLTSHVDILPTMLGISGLDTGAI 409 >UniRef50_Q15XI1 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XI1_PSEA6 Length = 510 Score = 122 bits (305), Expect = 4e-26, Method: Compositional matrix adjust. Identities = 120/508 (23%), Positives = 216/508 (42%), Gaps = 91/508 (17%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPL-NTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 + +PN L ++ D + + Y+ +T NID LA++ + F + Y +PVC+P+R L Sbjct: 36 VTKPNVLLILVDDLGYSDIKAYNENSFYDTPNIDKLASQSVMFTNGYAANPVCSPSRFAL 95 Query: 60 FTGIYAN--QSGPW-----------------TNNVAPGKNISTMGRYFKDAGYHTCYIGK 100 TG + ++ W N+ P I T+ FK GY+T ++GK Sbjct: 96 LTGKHPTRGKATDWFPANDKPARAGRFLPAEFNDALPLSEI-TLAEAFKQNGYNTAFLGK 154 Query: 101 WHLDGHDYFGTGECPPEWDADYWFD-----------GANYLSELTEKEISLWRNGLNSVE 149 WHL G+ W + FD A Y S ++ G Sbjct: 155 WHL--------GKTEDLWPENQGFDVNIAGTKNGHPAAGYFSPYKNARLTDGPKG----- 201 Query: 150 DLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYAD 209 E T R++N A+ + + ++ PF M++S+ H P P + +++Y Sbjct: 202 --------EYLT--QRLTNEAISLVDKYSKQTVPFFMMLSFYTVHTPLAAPNKDVQEYQA 251 Query: 210 FYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL 269 ++ + A +D + E ++W A V +HP Y A +D Q+GR++ L Sbjct: 252 ---KIRQYAHNDEFQREE--QVWPTAEKREVRVK--QNHPTYAAMVKQMDTQVGRLLAKL 304 Query: 270 TPE-QRENTWVIYTSDHGEMMGAH-----KLISKGAA--MYDDITRIPLIIRSPQGERR- 320 E+T V++TSD+G + A L +G +Y+ R+PL+++ PQ + + Sbjct: 305 KQAGMEESTLVVFTSDNGGLSSAEGSPTSNLPLRGGKGWLYEGGIRVPLLVKLPQKKHKH 364 Query: 321 -QVDTPVSHIDLLPTMMALADIEKPEILPGENILAV---------KEPRGVMVEFNRYEI 370 Q++ PV+ DL PT+++ + ++LP +++ V + +M + Sbjct: 365 LQINEPVTSTDLYPTLLSAGHL---DLLPQQHLDGVDLNQYFSPGAKRDALMRRPLYFHY 421 Query: 371 EHDSFGGFIPVRCWVTDDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDI--RFADVRSKM 427 H S G P ++KL+ LY+ ND E +L + R A +R K+ Sbjct: 422 PHYSNQGGFPGAAIRQGNWKLIERFEDGKVHLYNLANDIGEQIDLANQAPERVASLRKKL 481 Query: 428 HDALLDYMDKIRDPFRSYQWSLRPWRKD 455 H ++ + F + + PW+ D Sbjct: 482 H----EWYQQTSARFLKAKGNKTPWQPD 505 >UniRef50_C7MHD7 Arylsulfatase A family protein n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MHD7_BRAFD Length = 478 Score = 121 bits (304), Expect = 5e-26, Method: Compositional matrix adjust. Identities = 130/489 (26%), Positives = 204/489 (41%), Gaps = 64/489 (13%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYT----CSPVCTPAR 56 M RPN LF++ D +++G +G P+ T +D+LAA G R + +C P+R Sbjct: 1 MPRPNILFLIADDHRHDVLGS-AGSPVRTPQLDALAARGTRLARHHCQGGMTGAICAPSR 59 Query: 57 AGLFTG--IYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL------ 103 A + +G + A +G + +AP + T+ + ++ GY T +GKWH Sbjct: 60 ASILSGREVLAATAGLGIGTSEAHELAP--DAPTLPQVLRENGYRTYGVGKWHNGTESFH 117 Query: 104 ----DGHDYFGTG----ECPPEWDAD---YWFDGANYLSELTEKEISLWRNGLNSVEDLQ 152 DG F G P D D + D A +L+E ++ + +V DL Sbjct: 118 RSFDDGAQIFFGGMSEHTAVPVHDFDPTGAYPDSARHLAEGFSTDVFV-----QAVTDLL 172 Query: 153 ANH---------IDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEY 203 H A EPF + ++ PH P T P E+ Sbjct: 173 EAHQHRAGGTAGGGAAPDAGAGAGGGPGADDHTGADGPEPFFLWAAFTAPHDPRTPPEEF 232 Query: 204 LEKYAD---FYYELGEKAQDD---LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDF 257 Y L E + D N E A A P G H Y+ Sbjct: 233 ARLYDRTDPAAVPLPENFRTDPVEATNFGERDENLAAAPRDPEEVRG--HLADYYGMISH 290 Query: 258 VDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ 316 +DD IGR++ L ENT V+YT+DHG +G H ++ K ++Y+ R+PL++ P Sbjct: 291 LDDGIGRILAHLERSGLAENTLVVYTADHGLSLGQHGMMGK-QSLYEHSLRVPLLLAGPG 349 Query: 317 GERRQVDTPVS-HIDLLPTMMALADIEKPEILPGENILA-VKEPRGVMVEFNRYEIEHDS 374 E +V P+S H DLLPT++ LA P + G+++ + + P G E+ H + Sbjct: 350 IEAGRVLDPLSLHADLLPTLLGLAGAPVPPGVQGKDLGSLLTAPEGTP---GPREVVHAA 406 Query: 375 FGGFIPVRCWVTDDFKLVLNL--FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALL 432 + R + KL+ +L F DELY DP E ++ D A R ++ +L+ Sbjct: 407 Y--VDRARMASDGEHKLIRHLRPFRRDELYALATDPGETEDVASDAARARTRDRLAASLV 464 Query: 433 DYMDKIRDP 441 + DP Sbjct: 465 AWQHASGDP 473 >UniRef50_UPI0000E1104B N-acetylgalactosamine 6-sulfate sulfatase n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E1104B Length = 485 Score = 121 bits (304), Expect = 5e-26, Method: Compositional matrix adjust. Identities = 133/496 (26%), Positives = 207/496 (41%), Gaps = 92/496 (18%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN LF+ TD QA +G + T N+D LA +G+ ++YT +PVC+PARAGL T Sbjct: 23 QTPNILFIYTDDQAPWALGYSGNTQIYTPNLDDLAEQGLYLPNSYTTTPVCSPARAGLLT 82 Query: 62 GIYANQSG--PWTNNVAPG-----------KNISTMGRYFKDAGYHTCYIGKWHLDGHDY 108 Y + G W N A ++ T + GY T IGKWHL Sbjct: 83 SQYGFELGIDDWINVKAKTLTAHQPLLGIEQSYETWPEILQKVGYKTGLIGKWHL----- 137 Query: 109 FGTGECPPEWDADYWFDG-ANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRIS 167 G P + +D +L+ T E R +N VE + E T Sbjct: 138 ---GYQPEHHPTQHGYDEFIGFLAGGTTPEDP--RLEVNGVETNELGLTVEVLT------ 186 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPV--EYLEKYADFYYELGEKAQDDLANK 225 N A+ FL + D+ F + + Y PH+ F PV E Y D L L N Sbjct: 187 NHAIAFLNR--HKDDKFALSLHYRAPHYRF-LPVAPEDAAPYEDVEIALPHPDYPGL-NT 242 Query: 226 PEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ---RENTWVIYT 282 +L + M S G +D +G ++ L EQ +NT VI+T Sbjct: 243 ERARKLMREYMSSVTG----------------IDRNVGLLMQTL--EQLGLSQNTVVIFT 284 Query: 283 SDHGEMMGAHKLISKGAA---------------------MYDDITRIPLIIRSPQGERRQ 321 SDHG + + + KG MYD+ ++P I+R P + Sbjct: 285 SDHGYNIAHNGMWHKGNGYWLLYEPPLGTPNVPRGQRPNMYDNSLKVPTIVRWPGVIPKA 344 Query: 322 V--DTPVSHIDLLPTMMALA--DIEKPEILPGENILAV-KEPRGVMVE--FNRYEIEHDS 374 D+ +S++D PT++A+A + K I+ G++ L + +P ++ + Y H S Sbjct: 345 SINDSTMSNLDWFPTLVAIARGKVSKDNIVRGQSYLPLFLDPEQILSSDYYAAYSSLHQS 404 Query: 375 FGGFIPVRCWVTDDFKLVLNLFTS--DELYDRRNDPNEMHNLIDDIR--FADVRSKMHDA 430 +R + FKL+ + S DE+YD +NDP E N+I+ ++ Sbjct: 405 ---VTQMRSYSDGRFKLIKDFNNSQRDEMYDLKNDPEEKFNIINSTEADIQKIKVTFDKV 461 Query: 431 LLDYMDKIRDPFRSYQ 446 +++ M++ DP Y Sbjct: 462 IIEKMNETNDPALIYH 477 >UniRef50_Q7MBV3 Arylsulfatase A n=6 Tax=Vibrio RepID=Q7MBV3_VIBVY Length = 497 Score = 121 bits (303), Expect = 8e-26, Method: Compositional matrix adjust. Identities = 124/498 (24%), Positives = 213/498 (42%), Gaps = 74/498 (14%) Query: 1 MKRPNFLFVMTDTQATNMVGCYS-----------GKPLNTQNIDSLAAEGIRFNSAYTCS 49 MK+PN L+V D +G + G P+ T N+D A + ++A + Sbjct: 1 MKKPNLLYVFPDQFRLMSLGIWQDPHYQSLLPGKGDPVLTPNLDHFAEQATLLSNAVSNC 60 Query: 50 PVCTPARAGLFTGIYANQSGPWTNNVAP--GKNISTMGRYF----KDAGYHTCYIGKWHL 103 PVC+P R LFTG + ++SG N + + R F DAGY+ YIGKWHL Sbjct: 61 PVCSPHRGSLFTGQFPSKSGVPLNCHSDRVASQLPEKARCFTDVLSDAGYYLGYIGKWHL 120 Query: 104 D---GHDYFGTGECP----PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHI 156 D +D G+ P WD+ Y I W + ++ Sbjct: 121 DWPTENDPANPGQYVDSKRPAWDS--------YTEPNRRHGIDEWYGYGTFDQHCNPHYY 172 Query: 157 D------ETFTW-AHRISNRAVDFL--QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY 207 D E W A +++A++FL Q R D+PF + VS + PH P++ + E Sbjct: 173 DTNGQRHEPRKWSAEHETDKAIEFLTRHQQQRPDQPFALFVSMNPPHSPYSSLHDCRE-- 230 Query: 208 ADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVIN 267 AD+ + L H++ + + P YFA VD + GR+++ Sbjct: 231 ADWQRYCDQPLTSLLTRDNADHQM-----------EKAHSAPFYFANVTGVDQEFGRLVD 279 Query: 268 ALTPE-QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIR-SPQGERRQVDTP 325 L + + ENT V++TSDHGE + +H + ++Y++ +P +++ + Q + +Q Sbjct: 280 TLKAQGEWENTIVVFTSDHGETLCSHGVTDAKNSIYNESLCVPFLLKDAMQNQAQQHPAF 339 Query: 326 VSHIDLLPTMMALADIEK--PEILPGENILAV-KEPR---GVMVEFNRYEIEHDSFGG-- 377 +S D++PT++ L + P+ + G N+ V + P G I+ Sbjct: 340 LSSADIMPTVLGLMGLSDLCPDDIHGRNLAEVFRSPSVAAGPTCALYLKNIDSPPCADGK 399 Query: 378 ----FIPVRCWVTDDFKLVLNL-----FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMH 428 F R T+ + L L++ + + +D + DP + +N D VR +H Sbjct: 400 IRDYFAVSRGIKTERYSLALHIDAFGQLSESQFFDNQTDPYQTNNRHFDPNDPVVRRLLH 459 Query: 429 DALLDYMDKIRDPFRSYQ 446 A+ + ++ DP+ Q Sbjct: 460 -AMAQELVRVDDPWAEEQ 476 >UniRef50_A6CFT9 Iduronate-2-sulfatase n=2 Tax=Planctomycetaceae RepID=A6CFT9_9PLAN Length = 489 Score = 120 bits (302), Expect = 8e-26, Method: Compositional matrix adjust. Identities = 117/442 (26%), Positives = 195/442 (44%), Gaps = 33/442 (7%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 +++PN LF+ TD ++ CY + T N+D LA G+ F AY +C P+RA L Sbjct: 30 VEKPNVLFIGTDDLRCDL-ACYGHPLVKTPNLDKLATRGVLFKRAYCQQALCNPSRASLM 88 Query: 61 TGIYANQSGPW---TNNVAPGKNISTMGRYFKDAGYHTCYIGK----WHLDGHDYFGTGE 113 TG + W T+ NI T+ + FK GY T IGK W + Sbjct: 89 TGRRPDTLEIWDLPTHFREADPNIVTLPQLFKQQGYFTQNIGKIFHNWRQKIQGDPASWS 148 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 P D L++ E ++L + + D + ++ + RI + AV Sbjct: 149 VPAVMHFARHDDDQPMLNDNRELPVNLAKAPRSESRD-----VPDSAYFDGRIGDLAVKA 203 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE----HH 229 LQ + +PF + V + +PH PF P +Y + Y D + + Q N P+ Sbjct: 204 LQDLKQKQQPFFLAVGFWKPHLPFNPPKKYWDLYDDSPITVPDNPQPP-KNVPDVALHDS 262 Query: 230 RLWAQAMPSPVGDDGLYH-HPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHGE 287 R +A+ + D + Y A ++D Q+G+V+ L RE T +++ SDHG Sbjct: 263 REILRAVKGKLTDAQIIELRTGYLAGISYLDAQLGKVLAELDRLGLREKTIIVFWSDHGF 322 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIEKPE 345 +G H L K + +D R+PL+I P + + D V +D+ PT++ L ++ P Sbjct: 323 HLGEHGLWCKTSNFEND-ARVPLMISVPHMKTAGKTSDALVELLDMYPTLVELCGLDSPG 381 Query: 346 ILPGENILAV-KEPRGVM--VEFNR------YEIEHDSFGGFIPV-RCWVTDDFKLVLNL 395 L G +++ V K+P + F + Y + ++ G + R T+ Sbjct: 382 KLEGTSLVPVLKDPTQSVKPAAFTQHPRPAYYRKQPENMGVSVRTPRYRYTEWRNFKTGK 441 Query: 396 FTSDELYDRRNDPNEMHNLIDD 417 + ELYD +DP E N+I++ Sbjct: 442 VIARELYDHTSDPEENTNIINE 463 >UniRef50_A6DSH0 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH0_9BACT Length = 462 Score = 120 bits (301), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 127/472 (26%), Positives = 213/472 (45%), Gaps = 74/472 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN +F+++D +G Y K + + NID LAA G F +A PVC P+R+ +G Sbjct: 30 RPNVIFMVSD-DLNCYLGAYGNKDVISPNIDKLAARGTVFTNAACQFPVCGPSRSSFMSG 88 Query: 63 IYANQSGPWTN-----NVAPGKNISTMGRYFKDAGYHTCYIGKW--HLDGHDYFGTGECP 115 + N +G +N PG + T+ YFK+ GY T GK H+D ++ Sbjct: 89 LRPNTTGIISNGPSLYKTQPG--VKTIPSYFKNHGYVTARAGKVFNHIDNNE-------- 138 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTW----------AHR 165 + D D+ DG T E N N V + A HI W A Sbjct: 139 -KTDWDFILDGG------TSPEARKRANTGNEVL-VDAGHIHWNAMWRDPECRDEDLADG 190 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY--ADFYYE--------LG 215 + +V + + D+PF + + + +PH P P +Y E Y Y+ + Sbjct: 191 ANTLSVSKWIK-KKKDKPFFLAMGFLKPHRPLIVPKKYYELYDPKKLYHSWSRYANEVIP 249 Query: 216 EKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQR 274 A++ +K E Q M GL H Y+A +VD Q+G+++ AL + Sbjct: 250 ATAKNTAGSKLEEAITAEQRM-------GLNHA--YYATVSYVDAQVGKLMQALDDAGLK 300 Query: 275 ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLP 333 ENT V+ D+G +G H K +++ R+PLII P + + D V IDL P Sbjct: 301 ENTIVVLFGDNGTHLGEHLCWGKN-MLFEASARVPLIIADPANKTVKSYDKVVELIDLFP 359 Query: 334 TMMALADIEKPEILPGENILAVKEPRG---VMVEFNRYEIEHDSFGGFIPVRCWVTDDFK 390 T++ + ++ + + L G ++ A V F++ ++ D + VR TD ++ Sbjct: 360 TLIEMCELPQLDELEGTSLQAAMNGTADSVKAVSFSQVQVR-DGWS----VR---TDRWR 411 Query: 391 LVLNLFTSDE---LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 L+ N +++ LYD +NDP+E++N I++ + SK+ + + K++ Sbjct: 412 LI-NADPANQTLLLYDLKNDPHELNNQINNPERIALVSKLKNLAISKGFKVK 462 >UniRef50_A6DG72 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG72_9BACT Length = 468 Score = 120 bits (301), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 107/462 (23%), Positives = 189/462 (40%), Gaps = 64/462 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K N LF++ D +++ CY K T N+D LA++ I F+ AY C P+R L Sbjct: 27 KIKNVLFIIADDLKASVLACYGDKICQTPNLDKLASQSIVFDRAYCQGLSCGPSRTSLMH 86 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH-----------LDGHDYFG 110 Y G + + K+ G++T +GK + +DG D Sbjct: 87 SRYLGSEG------------INLPEHLKNNGWYTVRVGKIYHMRVPYDIIHGIDGQD--- 131 Query: 111 TGECPPEWDADYWFDGAN--------------YLSELTEKEISLWRNGLNSVEDLQANHI 156 P W + GA + L +E S +N + + + Sbjct: 132 ---IPSSWTEKFNSKGAESHTPGDYACLNKNIFTKSLKNRESSGMKNRMFVSVISEGDGS 188 Query: 157 DETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGE 216 D+ + + + ++ L Q R +EPF + PH+P P E+ + Y +L E Sbjct: 189 DQPDV---KSAEKTIELLNQ--RKNEPFFIATGLVRPHYPNVAPKEFFQNYPWEKIDLPE 243 Query: 217 KAQDDLANKPEHHRLWAQAMPSPVG---DDGLYHHPLYFACNDFVDDQIGRVINAL-TPE 272 P + +G D+ Y+A +F+D QIGR+++ + Sbjct: 244 LRNPTSLGIPAAGHPRITNSNNSIGKYPDNQKRMWSAYYATVEFMDRQIGRILDEVDRLG 303 Query: 273 QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLL 332 + NT +I+ SDHG +G H K +++++TR+PLI P R+++ +D+ Sbjct: 304 LKSNTAIIFLSDHGYHLGEHGFWQKN-NLHEEVTRVPLIAYIPGLAPRRINEVTELVDIY 362 Query: 333 PTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR---CWVTDDF 389 P++ L + KP+ + G++ L + N+ E +S +P + T+DF Sbjct: 363 PSLTELLGVYKPKTVQGKSFLPFLK--------NKTEDFRNSALSLMPGKKGYSIRTEDF 414 Query: 390 KLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + + ELY+ DP ++ NLI + SK+ L Sbjct: 415 SYIRYQNGAAELYNMNKDPKQLVNLIQNPEHKQTISKLDREL 456 >UniRef50_Q7UVD9 N-acetylgalactosamine 6-sulfate sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UVD9_RHOBA Length = 564 Score = 120 bits (301), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 136/493 (27%), Positives = 207/493 (41%), Gaps = 93/493 (18%) Query: 3 RPNFLFVMTDTQA------TNMVGCYSGKPL-NTQNIDSLAAEGIRFNSAYTCSPVCTPA 55 +PN + V+TD QA G +S P+ +T N+D LAAEG F + + +PVC+PA Sbjct: 101 KPNVVLVLTDDQAPWAFAEAVRSGQFSDVPIPSTPNMDRLAAEGAVFRNFFCTTPVCSPA 160 Query: 56 RAGLFTGIYANQSGPWTNNVAPGK--------------NISTMGRYFKDAGYHTCYIGKW 101 RA L TG YA++ G PG N T + GY T +GKW Sbjct: 161 RATLMTGRYASELGIKDFIPQPGHKLYDPDSPIHLDPDNTVTFAEVMQQQGYTTGLVGKW 220 Query: 102 HLDGHDYFG-TGECPPE--WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDE 158 HL G +G+ P +D+ G + E E+ NG V+ Q D Sbjct: 221 HLGDWTANGDSGKHPTRHGFDSFMGLTGGGTTPDNPELEL----NG--KVQQFQGLTTDI 274 Query: 159 TFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA 218 +++ A+DF++Q AD PF + +S PH + PV A ++ E+ Sbjct: 275 -------LTDHAIDFVEQ--NADRPFFLCLSTRAPHGRW-LPV------APEDWQPYEEM 318 Query: 219 QDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENT 277 + P+ W + Y A VD +GR++ L E NT Sbjct: 319 DPTIPQYPDLDTDWVRKKMKE-----------YLASTSGVDRNLGRLLKTLDAQELTSNT 367 Query: 278 WVIYTSDHGEMMGAHKLISKGAA--------------------------MYDDITRIPLI 311 VI+TSDHG MG H + KG +YD R+P I Sbjct: 368 IVIFTSDHGFNMGHHGIYHKGNGIWATRQKPPGKFHQGTRVISDKYRPNLYDHSLRVPAI 427 Query: 312 IRSPQGERRQ--VDTPVSHIDLLPTMMALA-DIEKPEILPGENI--LAVKEPRGVMVEFN 366 +R P + ++ SH+D PT+ A+A D + LPG ++ L E + + Sbjct: 428 VRWPGVVKPSAVIEATASHLDWFPTLCAIAGDGSSAKDLPGRDLSPLLKGELQDDWDQAQ 487 Query: 367 RYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT--SDELYDRRNDPNEMHNLIDDIRFADVR 424 +E + ++ +R + T ++KL+ + DE YD DP+E NLI + V Sbjct: 488 YFEYDMINY-AVASLRGYRTPEYKLIRDRHNEGCDEFYDLTTDPDETVNLIRNPGSQAVI 546 Query: 425 SKMHDALLDYMDK 437 ++ DA L M+K Sbjct: 547 KRL-DAKLRAMEK 558 >UniRef50_D0Z4S7 Iduronate sulfatase n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0Z4S7_LISDA Length = 539 Score = 120 bits (301), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 126/484 (26%), Positives = 203/484 (41%), Gaps = 77/484 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 + PN LF+ D + +G P + T N+D L F +A+ PVC P+R L Sbjct: 33 QHPNVLFLAVD-DLNDWIGALGAHPQVKTPNLDRLYKRSTAFRNAHCQVPVCGPSRTALL 91 Query: 61 TGIYANQSGPWTN---NVAPGKNIS--------TMGRYFKDAGYHTCYIGKWHLDGHDYF 109 TG+ +G +TN + P ++ + ++FK+ GY+T GK G + Sbjct: 92 TGMAPTTTGLYTNKELGIKPFDPVAEQVLGSTPVLPQHFKNNGYYTMASGKISHHGTADY 151 Query: 110 GTGECPPEWDADYWF------------DGANYLSELTE--KEISLWRNGLNSVEDLQANH 155 E +WD + +G Y S + K G ++ + Sbjct: 152 RHKE---QWDEEIPLYVIGPRDEHLKANGYGYGSYGVDDHKYYPFPVGGGQIIQSQEYGP 208 Query: 156 IDETFTWA------HRISNRAV-----------DFLQQPARADEPFLMVVSYDEPHHPFT 198 F+ H I N V + LQ+ ++PF + + PH P+T Sbjct: 209 GTRGFSLCSGALDRHDIPNGGVMPDEYFADWTVERLQR--HYEKPFFLACGFIRPHVPYT 266 Query: 199 CPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHH--------PL 250 P EY + + + E + ++ + P + A + P GD + Sbjct: 267 APREYFDMFPLESIIVPETIEKEMTDIPLMGKALALGI-IPGGDAAAVNKLGIRKELVQA 325 Query: 251 YFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIP 309 Y AC F+D Q+G+V++AL NT V++ DHG+ G H K ++ + TR+P Sbjct: 326 YLACIAFMDAQVGKVLDALEKSPYANNTIVMFWGDHGQNFGEHMNYRK-QTLWQESTRVP 384 Query: 310 LIIRSPQGERRQV-DTPVSHIDLLPTMMALADIEKPEILPGENILA---VKEPRGVMVEF 365 L+IR PQ E+ QV D VS +DL PT++ L + P++ E I + PR F Sbjct: 385 LMIRLPQQEKGQVCDEAVSLLDLYPTLIELCHL--PKVATNEGISLKPLLNNPR-----F 437 Query: 366 NRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF--TSDELYDRRNDPNEMHNLIDDIRFADV 423 +R ++G +C D + + S+ELYDR DPNE HNL D + + Sbjct: 438 DRKIPAVTTYG----YQCHAIRDEQYTYIRYRDGSEELYDRNLDPNEHHNLASDPNYQVI 493 Query: 424 RSKM 427 + M Sbjct: 494 KQAM 497 >UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C366AB Length = 470 Score = 120 bits (301), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 124/461 (26%), Positives = 194/461 (42%), Gaps = 53/461 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PNFLF+ D + C T NID L +G+ F ++Y PVC+P+RA TG Sbjct: 4 QPNFLFIFMDDMGWRDLACTGSTFYETPNIDRLCRQGMVFANSYASCPVCSPSRASCLTG 63 Query: 63 IYANQSG--PW-----TNNVAPGKNIS------------TMGRYFKDAGYHTCYIGKWHL 103 Y + G W T++ GK I T+ + KDAGY T ++GKWHL Sbjct: 64 KYPARLGVTDWIDMEGTSHPLKGKLIDAPYIKHLPEGEYTIAQALKDAGYDTWHVGKWHL 123 Query: 104 DGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWA 163 G +++ P + D G ++ + S + +E L E T Sbjct: 124 GGREFY-----PEHFGFDVNIGGCSW-GHPHDGYFSPY-----GIETLSEGPEGEYLT-- 170 Query: 164 HRISNRAVDFL--QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 RI++ AV L +Q + +PF M + + H P E ++ ELG + Sbjct: 171 DRITDEAVRLLRKRQACGSRKPFYMNLCHYAVHTPIQVKDEDRARFEKKARELGLDKETA 230 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVI 280 L HH V + P Y +D IGR++ AL + ENT V+ Sbjct: 231 LVEGEFHHT--EDKKGRRVVRRVIQSDPSYAGMIWNLDQNIGRLLEALRECGEEENTVVV 288 Query: 281 YTSDHGEMMGAHKL------ISKGAA-MYDDITRIPLIIRSPQ--GERRQVDTPVSHIDL 331 +TSD+G + + S+G +Y+ TR+PLI++ P + D PV+ D Sbjct: 289 FTSDNGGLATSEGSPTCNLPASEGKGWVYEGGTRVPLIVKYPGRVAPGSRCDVPVTTPDF 348 Query: 332 LPTMMALADIEKPEILP--GENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDF 389 PT + LA + + +P G +I+ + + + H G P V D+ Sbjct: 349 YPTFLELAGVPQKAGIPIDGRSIVPLLSGNPMPERPIFWHYPHYGNQGGTPASSVVMGDY 408 Query: 390 KLVLNLFTSD---ELYDRRNDPNEMHNLIDDIRFADVRSKM 427 K + F D ELYD + D +E +NL + + R +M Sbjct: 409 KYI--EFFEDGRGELYDLKADFSETNNLCEKMPETAARLRM 447 >UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UJ66_RHOBA Length = 616 Score = 120 bits (300), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 114/452 (25%), Positives = 186/452 (41%), Gaps = 97/452 (21%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN + V+TD Q + C+ LNT N+D LA + +R + + P CTP RA L T Sbjct: 55 SRPNVILVVTDDQGYGDMSCHGNPWLNTPNLDRLATQSVRLEN-FHVDPFCTPTRAALMT 113 Query: 62 GIYANQSGPWTNNVAPGKNI-----STMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 G Y + G W V G+ + +TM F+++GY T GKWHL F E Sbjct: 114 GRYCTRVGAWA--VTEGRQLLDPDETTMAETFRESGYRTGMFGKWHLGDPPPFAPRERGL 171 Query: 117 EWDADYWFDGANYLSELTEKEI---SLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 E + GA+ + T + + +RNG + E D F A+DF Sbjct: 172 ETVVRHMAGGADEIGNPTGNDYFDDTYYRNG--TPESFDGYCTDIWF-------EEAIDF 222 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA 233 +Q+ +++PF + + H P+ ++Y+D + G + Q Sbjct: 223 IQK--ESEQPFFAYIPTNAMHSPYLV----ADRYSDPFKRQGIEPQ-------------- 262 Query: 234 QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAH 292 Y +F D+ +GR++ L + R+NT +I+ SD+G GA Sbjct: 263 -------------RAAFYGMIQNF-DENLGRLLKRLDQDNLRDNTMLIFMSDNGTAQGAS 308 Query: 293 K----------LISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALAD 340 + + K ++Y+ R+P P R VD H D LPT++ L D Sbjct: 309 EQNRKVGFNAGMRGKKGSVYEGGHRVPCFASWPAKWDGNRPVDQLTCHRDWLPTLIELCD 368 Query: 341 IEKP------------------EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR 382 +++P + P ++ ++P V+ + G P Sbjct: 369 LKRPADVTFDGRSMAGLLSHSSQQWPERTLVIERQPDNVVSATK-------TQGRAQPPF 421 Query: 383 CWVTDDFKLVLNLFTSDELYDRRNDPNEMHNL 414 +TD ++LV DELYD +NDP ++ N+ Sbjct: 422 VVLTDRWRLV-----RDELYDIQNDPGQIKNI 448 >UniRef50_A3JPC9 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=1 Tax=Rhodobacterales bacterium HTCC2150 RepID=A3JPC9_9RHOB Length = 492 Score = 120 bits (300), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 125/520 (24%), Positives = 217/520 (41%), Gaps = 108/520 (20%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN + + TD Q +GCY ++T N+D L+ G+ F++++ + C+P RA + T Sbjct: 3 KQPNIILIFTDNQQAATLGCYGNDEIHTPNLDLLSDTGVTFDNSFCANGFCSPCRASVLT 62 Query: 62 GIYANQSG-----------PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG 110 G ++ G W + ++T+ + K GY T IGK+HL G Sbjct: 63 GKLPSEHGVHSWLDDRKMADWPKDWHALDGLNTLPKALKSQGYSTALIGKYHL------G 116 Query: 111 TGECPPEWDADYWFDGANYLSELTEKEI-SLWRNGLNSVEDLQANHIDETFTWAHRISNR 169 P E G + L + I S +RN + D +H+ + + +N+ Sbjct: 117 QPTSPAE--------GFDKWVTLQDGHIRSFYRNKIFDNGDAY-DHVGHSVDF---FTNK 164 Query: 170 AVDFLQQPARADEPFLMVVSYDEP--HHPFTCPVE---YLEKYADFYYE------LGEKA 218 ++F++Q + + PF + + Y P H P T + + +YAD L + A Sbjct: 165 GIEFIEQETQNENPFFLYLPYPAPYGHWPATKETDENRHTARYADCPMNSIPREPLSKAA 224 Query: 219 QDDL---ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-R 274 D A H ++ M +P + L +++ +DD +G+++ L Sbjct: 225 VDGYMLRAADNSTHMDFSMLMRAP---NDLATLRNFYSQISMIDDGVGKIMETLDRLNIA 281 Query: 275 ENTWVIYTSDHGEMMGAHKLISKGAA-----MYDDITRIPLIIRSPQ----GERRQVDTP 325 E+T +I+T+DHG G H GAA ++ +IP+I+R P G R ++ Sbjct: 282 EDTLLIFTTDHGLSTGEHGFWGHGAATVPSNLHRAAHKIPMIMRQPNVTKPGLRNKL--M 339 Query: 326 VSHIDLLPTMMALA-----DIEKPE---------ILP---GENIL--------AVKEPRG 360 VS+ID+ T++ A D P + P GE+I+ + P+ Sbjct: 340 VSNIDVFATILDHANAPFDDASGPSRSLKPVMKGLSPDDWGEDIVFSEQEETRVARTPKW 399 Query: 361 VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRF 420 + F R+ E FG DELYD NDP E NL+ D F Sbjct: 400 AL--FKRFSSEKQPFG----------------------DELYDVENDPAERKNLVGDPAF 435 Query: 421 ADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRW 460 A ++ ++ + + D+ P RP + +R ++ Sbjct: 436 AKIKQELSAKIDAFFDEHAQPNADLWKGGRPIQNSSRTKY 475 >UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 Tax=Nostocaceae RepID=Q3M597_ANAVT Length = 457 Score = 120 bits (300), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 124/475 (26%), Positives = 194/475 (40%), Gaps = 83/475 (17%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN +F++ D + Y T N+D LA +G+RF +AY VCTP R T Sbjct: 40 SRPNVVFILVDDMGWGDLSIYGRTDYETPNLDRLARQGVRFTNAYANQTVCTPTRIAFLT 99 Query: 62 GIY------------ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYF 109 G Y +S P +NN+ N T+ K GY T +GKWH F Sbjct: 100 GRYQARLPVGLREPLGARSQPASNNIGIPANQPTIASLLKANGYETALVGKWHAGYPPNF 159 Query: 110 GTGECPPEWDADYWF----DGANYLSEL-TEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 G P + D +F G Y + T++ + L+ N V ++ ++ + FT Sbjct: 160 G----PLQKGFDEYFGHLSGGIEYFTHTGTDRILDLYE---NDVPVQRSGYVTDLFT--- 209 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 +RAV+F+Q+P PF + + Y+ PH P+ P + + FY G A Sbjct: 210 ---DRAVEFIQRP--HSRPFYLSLHYNAPHWPWQGPND--QASTAFYLTNGYTVGGSQAT 262 Query: 225 KPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTS 283 Y A +DD +GRV++AL Q +NT VI+TS Sbjct: 263 --------------------------YAAMVKSLDDGVGRVLDALEASGQADNTLVIFTS 296 Query: 284 DHG--EMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPV-SHIDLLPTMMALA 339 D+G + A++Y+ R+P IIR P + QV V DL T++A Sbjct: 297 DNGGERFSNFGPFRGQKASLYEGGIRVPAIIRYPGVTQANQVSNQVIITFDLTATILAAT 356 Query: 340 DIE-KPEILP-GENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 P P G+N+L + RG EF+R +G + R + Sbjct: 357 GTSFHPNYPPDGQNLLPLL--RGDRSEFSRTLFWR--YGAALTTRQRAVR---------S 403 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPW 452 D Y RR + + NL D + + D+ ++R+ F+ ++ + P+ Sbjct: 404 GDWKYWRRGNQEALFNLATD---PGETTDLKDSNAQVFTRLRNQFQHWELQMLPY 455 >UniRef50_A6DFR6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFR6_9BACT Length = 573 Score = 120 bits (300), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 111/450 (24%), Positives = 192/450 (42%), Gaps = 105/450 (23%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++TD Q V + K + T +D L EG+R ++ Y + +C+P+RA L TG Sbjct: 20 RPNVVLILTDDQGYGEVAAHGNKIIQTPEMDKLYREGVRLDN-YHVNSICSPSRAALVTG 78 Query: 63 IYANQSGPWTNNVAPGKNI-----STMGRYFKDAGYHTCYIGKWHLDGH----------- 106 YA++ G W + G+NI T+ +F AGY T +GKWHL + Sbjct: 79 RYASRVGVW--HTLGGRNIIRKDEKTIADHFVAAGYKTGMVGKWHLGDNAPYRPEDRGFQ 136 Query: 107 DYF-----GTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 D F G+ P W D W DG +Y ++ W D+Q ++ Sbjct: 137 DVFRIGGGSIGQLPDYWKNDLW-DG-HYWNK------GQWVKTKGFCTDVQFDY------ 182 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 A+DF+++ ++ PF + +S PH P +YLE Y + G A Sbjct: 183 --------ALDFVEENKKS--PFFLFISTTAPHSPTGADKKYLEPYEKLGLDKGICA--- 229 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVI 280 ++ +DD IGR+ N L + ENT +I Sbjct: 230 -----------------------------FYGMVTNIDDNIGRLRNKLRELKLEENTILI 260 Query: 281 YTSDHGEMMGAH------KLISKGAAMYDDITRIPLIIRSPQG---ERRQVDTPVSHIDL 331 ++SD+G + K ++Y+ R+P + P+G +Q+D +HID+ Sbjct: 261 FSSDNGSACDKKGDSFNGGMQGKKGSLYEGGHRVPCFLYWPKGGWIGGKQLDQVTAHIDI 320 Query: 332 LPTMMALADIEKP-----EILPGENILA--VKEPRGVMVEFNRYEIEHDSFGGFIPVRCW 384 LPT++ IE P + + I+A ++ +++ N+ F + Sbjct: 321 LPTLLKACAIENPLNTAFDGIELNGIIAKPAQKLSRLLITENKANKRDQEFQNSVV---- 376 Query: 385 VTDDFKLVLNLFTSDELYDRRNDPNEMHNL 414 +TD+++L+ +LYD +ND + +++ Sbjct: 377 LTDEWRLI----DGQKLYDVKNDFTQKNDI 402 >UniRef50_D2R925 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R925_9PLAN Length = 468 Score = 119 bits (299), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 131/477 (27%), Positives = 203/477 (42%), Gaps = 74/477 (15%) Query: 3 RPNFLFVMTDTQATNMV---GCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 +PN L ++ D T ++ YSGK + N+ L +GI F AY SPVC P+R L Sbjct: 29 KPNVLLLIVDDLNTWLMEDPTRYSGK-VVAPNLQELGKQGIVFRRAYAASPVCCPSRTAL 87 Query: 60 FTGIYANQSGPWTNNV-----APGKNISTMGRYFKDAGYHTCYIGK----WHLDGHDYFG 110 +G+ QSG + N + A K ++M FK AGY+T GK W L Sbjct: 88 LSGVRPWQSGMYENGLDSSASAALKQATSMPAVFKQAGYYTASYGKVGHGWRL------- 140 Query: 111 TGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRA 170 D W D + + + L + D H+ E +I++ A Sbjct: 141 ---------GDVWDDSLAHRRDPAPPQAPLLPFTRGEL-DWGLTHLKEAEMSDTKIADAA 190 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 V LQ+ + D PF + PH P+ P +Y + + + E DL + P R Sbjct: 191 VTQLQR--KHDRPFFIACGLFHPHMPWYIPQKYFDMFPVDEVKTPEILDTDLDDLPPLGR 248 Query: 231 LWAQA--------MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIY 281 Q M + V +G+ Y A + D Q+GRV+ AL +NT V++ Sbjct: 249 AVTQGKAKFVDQVMENKVHKEGVR---AYLAATAYADFQMGRVVEALKQSPYADNTIVVF 305 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTP-------VSHIDLLPT 334 SDHG +G K ++ + T L R P + TP VS DL PT Sbjct: 306 LSDHGFHLGEKHHWQKN-TLWQEATHCNLAFRVP-----GITTPSSVCQRCVSLQDLYPT 359 Query: 335 MMALADIEKPEILPGENIL-AVKEPRGVMVEFNRYEIEHDSFGG--FIPVRCWVTDDFKL 391 +M L + P + G +++ +KEP+ +E + G ++ +R T+ ++ Sbjct: 360 LMDLTGLTPPSQVEGRSLVPLLKEPKAA------WESTAITSWGDRYVAIR---TEHYRY 410 Query: 392 VLNLFTSDELYDRRNDPNEMHNLIDDIRFAD----VRSKMHDALLDYMDKIRDPFRS 444 + +ELYD NDP+E N I + +A +RSK+ AL D K++ RS Sbjct: 411 IRYRDDQEELYDLLNDPHEWTNQIKNQDYAADVEILRSKI-PALKDMAPKLKSGRRS 466 >UniRef50_B7AMH4 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AMH4_9BACE Length = 520 Score = 119 bits (299), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 119/505 (23%), Positives = 221/505 (43%), Gaps = 87/505 (17%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 +K N +F+++D + + P L T +D +A EG +A+ + + +P+RA + Sbjct: 39 VKPRNVVFILSDDHRYDYMVFLGTIPWLETPCMDRMAREGAYIQNAFVTTSLSSPSRASI 98 Query: 60 FTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 TG+Y++ NN ++ Y + AGY T + GKWH+ G+D TGE P + Sbjct: 99 LTGLYSHTHKVVDNNAPLPDGLTFFPEYLQAAGYETAFFGKWHM-GND---TGEPQPGFT 154 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 +W E + W +N + D T+ +++ A+DF+++ + Sbjct: 155 --HW--------EGIRGQGVYWNPEIN-INGKWKEFKDSTYL-GDLLTDHAIDFIREQKK 202 Query: 180 ADEPFLMVV----------------------------SYDEPHHPFT-CPVEYLE----- 205 AD+PF + + S++ PH+ T P + ++ Sbjct: 203 ADKPFFVYLSHKGVHDPFQAPKRYEGCYANKKVPLPTSFENPHYGITPTPNKSVQTGKPL 262 Query: 206 KYADFYYELGEKAQDDLANKPEH-----------HRLWAQAMPSPVGDDGLYHHPLYFAC 254 D+Y GE+ + D R W + + Y Sbjct: 263 SGVDYY---GEQMKPDWVKMQRESWHGVDFCYNGRRNWEEEVRK------------YCET 307 Query: 255 NDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIR 313 VD+ IGRVI++L ENT VIY D+G G H LI K Y+ R+P++IR Sbjct: 308 LRAVDESIGRVIDSLQEMGLDENTVVIYMGDNGFCWGEHGLIDK-RQFYEASVRVPMLIR 366 Query: 314 SPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVE---FNRY 368 +P + + + V ++D+ PT+++ A ++KP + GE+ + + + + + F Y Sbjct: 367 APGLFPAGQVLKSMVQNVDIAPTILSCAGLDKPAQMVGESYIPLLQGKEIPWRNRIFYEY 426 Query: 369 EIEHDSFGGFIPVRCWVTDDFKLVL--NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSK 426 EH+ + + TD++K + ++ ++E YD DP+E+ N I D + D+ + Sbjct: 427 YWEHE-YPQTPTMHGVRTDNYKYIRYHGIWDTNEFYDLNEDPSELQNRIADPEYQDIIKQ 485 Query: 427 MHDALLDYMDKIRDPFRSYQWSLRP 451 + L D+++ F + ++RP Sbjct: 486 LDADLYDWLETTNGMFIPLKRTVRP 510 >UniRef50_A6L183 Iduronate 2-sulfatase n=11 Tax=Bacteroides RepID=A6L183_BACV8 Length = 477 Score = 119 bits (299), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 105/364 (28%), Positives = 164/364 (45%), Gaps = 27/364 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++ N LF+M D + GCY K + T NID AA G+ F +AY PV +RA L T Sbjct: 26 EKMNVLFLMADDMRPEL-GCYGVKEVKTPNIDRFAASGLLFQNAYCNIPVSGASRASLLT 84 Query: 62 GIYANQSGPWTNNVA-PGKNIST---MGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 G+Y + + N A K+ T + R+F GY+T GK D+ + PP Sbjct: 85 GVYPHYPDRFVNYSAYASKDCPTAIPISRWFTSHGYYTISNGKVFHHLSDHANSWSEPPY 144 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT-WAH---------RIS 167 +D Y +E + E LW N S + + F WA +++ Sbjct: 145 RKHPDGYDV--YWAEYNKWE--LWMNEA-SARTINPKTMRGPFCEWAEVPDTAYDDGKLA 199 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK--AQDDLANK 225 +A+ L++ +PF M + +PH PF P +Y + Y + DL N+ Sbjct: 200 LKAIADLKRLKEQGKPFFMACGFWKPHLPFNAPKKYWDLYDREKIPVANNRFRPKDLPNE 259 Query: 226 PEHH-RLWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTP-EQRENTWVIY 281 ++ ++A A + D Y+AC +VD QIG+V++AL NT V+ Sbjct: 260 VKNSTEIYAYARTTTADDISFQKEAKHGYYACLSYVDAQIGKVLDALDELGLANNTIVVL 319 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADI 341 DHG +G H + K M D T +PLI+R P ++ + + V +DL PT+ L + Sbjct: 320 LGDHGWHLGEHNFLGKHNLM-DRSTHVPLIVRVPGLKKGKTKSMVEFVDLYPTLCELCHL 378 Query: 342 EKPE 345 P+ Sbjct: 379 PIPK 382 >UniRef50_Q7UVD4 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UVD4_RHOBA Length = 510 Score = 119 bits (299), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 123/475 (25%), Positives = 199/475 (41%), Gaps = 58/475 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L ++ D +G Y T N+D+LA G+ F+ AY VC P+R+ TG Sbjct: 41 RPNVLLIVAD-DLNCAIGPYGDPNAITPNLDALANRGLVFDRAYCQQAVCNPSRSSFLTG 99 Query: 63 IYANQSG------PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 + G + G ++ T+ ++FK+ GY+ IGK + G + Sbjct: 100 LRPTTVGVDDLRKSFRETAPNGASLVTLPQHFKNHGYYCQDIGKIFHN----MGDTQDRQ 155 Query: 117 EWDADYWFDGANYLSELTEKE--ISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 W D + ++ ++L L + + +T +I+ A + Sbjct: 156 SWSMDEVLHAGTHAADTVHSNTPVALRARKLKKAPATETLDVPDTAYRDGQIARLAASVI 215 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP-------- 226 + PF + V + PH PF P +KY D Y + E + L P Sbjct: 216 RDYPDDAAPFFLGVGFWRPHLPFVAP----KKYWDLY-DPDEISSPQLETSPVDVPDIAM 270 Query: 227 ----EHHR---LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TW 278 E H + +A SP L H Y+A F+D Q+G ++NAL +N T Sbjct: 271 HISRELHGYDGIPKEAELSPELKRHLRHG--YYASISFLDAQVGLILNALEASGHDNDTI 328 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ--VDTPVSHIDLLPTMM 336 V + SDHG +G L K + D R+PLII P+ +R Q D +DL PT+ Sbjct: 329 VAFVSDHGFHIGEKTLWGKTSNFELD-ARVPLIIADPRVDRTQPRTDCLTELVDLYPTLT 387 Query: 337 ALADIEK--PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWV-------TD 387 +LA I PE L G+++ ++ ++ + F + P WV T Sbjct: 388 SLAGIANDLPENLEGDDLSSLLINPNQTLKTAAFTQHQHPF--YAPREKWVALGYSVRTA 445 Query: 388 DFKLVL------NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 D++ + ++ELYD RNDPNE N+ ++F DV + +L+ + + Sbjct: 446 DWRYTQWRSIQDHHVIAEELYDHRNDPNESQNVA--VQFPDVVQQHSQSLIKHFN 498 >UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_RHOBA Length = 485 Score = 119 bits (299), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 117/449 (26%), Positives = 176/449 (39%), Gaps = 83/449 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++ D VGCY G P+ T ID LAA G RF Y+ VC+P+RA L TG Sbjct: 46 RPNVVMLLADDLGYRDVGCYGG-PVETPTIDQLAAGGTRFQQFYSGCAVCSPSRATLMTG 104 Query: 63 IYANQSG--PWTNNVAPGKNIS----TMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 + ++G W + + ++ T+ +DAGY T ++GKWHL P Sbjct: 105 RHHIRAGVYSWIQDESQNSHLRLREVTLAEVLRDAGYATAHVGKWHLGLPTEERDKPTPD 164 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNG--LNSVEDLQANHI-DETFTWAHRISNRAVDF 173 + D+WF N + RNG + +E + DE W R Sbjct: 165 QHGFDHWFATWNNAQPSHRNPDNFIRNGEPVGQLEGYSCQLVADEAIRWMDR-------- 216 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA 233 + + D+PF + V + EPH P P E +KY +L +K Sbjct: 217 -HRESDPDQPFFLNVWFHEPHAPIAAPDEVTQKYG----KLSDKGA-------------- 257 Query: 234 QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMG-- 290 +Y D D I R++ L RENT ++Y SD+G Sbjct: 258 ----------------VYSGTIDNTDQAIKRLLAKLDALGVRENTLIVYASDNGSYRTDR 301 Query: 291 AHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEKPEI-L 347 KL + A ++ R+P I P V + P +D+LPT+ L I P++ L Sbjct: 302 VGKLRGRKGANWEGGIRVPGIFHWPGHIPAGVVSNEPAGLVDVLPTICGLLKISPPQVHL 361 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV---------LNLFTS 398 G ++ + G F R++ P+ D+ LV NLF Sbjct: 362 DGSDLTPLLT--GHADSFERHQPLFWHLQRSQPIVAMRDGDYSLVGFRDYEMSNKNLFEE 419 Query: 399 D-------------ELYDRRNDPNEMHNL 414 ELY+ ++DP + NL Sbjct: 420 KWIPAIKNGTYHNFELYNLKDDPGQTKNL 448 >UniRef50_A6UE90 Sulfatase n=1 Tax=Sinorhizobium medicae WSM419 RepID=A6UE90_SINMW Length = 489 Score = 119 bits (297), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 113/452 (25%), Positives = 190/452 (42%), Gaps = 34/452 (7%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M +PN LF+ +D A + GCY + T NID LA EG+RF++AY SP+CTP+R + Sbjct: 1 MNKPNVLFIFSDQHAQKVAGCYGDDVVRTPNIDRLAQEGVRFDNAYCPSPICTPSRMSML 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH------LDGHDYFGTGEC 114 T + ++ WTN+ ++ T +AGY IG+ H L G+ G G+ Sbjct: 61 TARWPHRQECWTNDDMLRSDVPTWLHRAGEAGYRPALIGRMHSIGPDQLHGYAERGIGDH 120 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSV--EDLQANHIDETFTWAHRISNRAVD 172 P + F +SL ++G + + +D W D Sbjct: 121 TPNFAGIARFPMGVLEGTNEPDSVSLTQSGAGMAIYQRKDQDVVDAAAAWLR-------D 173 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 A + F + V PH P+ E + Y + ++ D ++ + HR W Sbjct: 174 KGAARNAAGQQFCLTVGLMTPHAPYVVDREAFDHY---HGQVPPPRLDVPQDEHDWHRWW 230 Query: 233 AQAMPSPVGDDGL--YHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMM 289 D + Y+ D+ IG+V++AL ++T ++Y SDHG+ + Sbjct: 231 RHDRGIGEVSDAVRDRARAAYWGLVQRTDEMIGQVLDALKEIGAMDDTLIVYASDHGDHV 290 Query: 290 GAHKLISKGAAMYDDITRIPLIIR----SPQGERRQVDTPVSHIDLLPTMMALADIEKPE 345 G L K +++ + PL++R P GE R D V+ +DL TM+ + + Sbjct: 291 GERGLWWK-HTFFEESVKFPLVMRLPGAIPAGESR--DQVVNLVDLSQTMIEVMGAQPLP 347 Query: 346 ILPGENILAVKEPRGVMVE---FNRY---EIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD 399 G++ AV R E F+ Y + + G + R + +KL + Sbjct: 348 YADGKSFWAVACDREAPWENETFSEYCTDPVPSWTGGRAVQQRMIRSGSWKLSVYDGEPP 407 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 L+D DP+E N +D A++ ++ L Sbjct: 408 LLFDLSTDPDERINRAEDPDCAEMFQRLSARL 439 >UniRef50_A3HWG3 Choline sulfatase n=1 Tax=Algoriphagus sp. PR1 RepID=A3HWG3_9SPHI Length = 505 Score = 119 bits (297), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 126/489 (25%), Positives = 210/489 (42%), Gaps = 85/489 (17%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPV----CTPARA 57 ++PN LF+ D Q + +G + T ID L EG RF++AY V C +RA Sbjct: 41 QKPNVLFLFADDQRADALGINGNPYIQTPTIDQLGREGSRFSNAYVMGGVHGAICMSSRA 100 Query: 58 GLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT------ 111 LF+G ++ + G++ TM F AGY T GKWH + + + Sbjct: 101 MLFSG----KNLYKVTDKLSGEHTMTMS--FAAAGYRTFGTGKWHNEKEAFEASFQEAKN 154 Query: 112 ---GECPPEWDA---DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHR 165 G +D DY DG L E T K S E F A Sbjct: 155 VYLGGMADHYDLPLRDYGADGK--LGEPTRKGFST-----------------EQFAQA-- 193 Query: 166 ISNRAVDFLQQPAR--ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ---- 219 A+DF++ + D+PF V++ PH P++ Y+ Y D L Sbjct: 194 ----AIDFIKDHGQRNTDQPFFCYVAFTAPHDPYSPEANYINHYPDGTLPLPGNYMPYHP 249 Query: 220 ---DDLANKPEHHRLWA---QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE- 272 D L + E+ W + + + D Y+A +D QI +++N L Sbjct: 250 FEFDHLTVRDENLTGWPRKPEVIQMILSD--------YYALVTHLDTQIAKILNTLKETG 301 Query: 273 QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHI-DL 331 Q +NT ++Y +D+G G+H L+ K ++Y+ +++PLII+ P + Q ++I DL Sbjct: 302 QYDNTIIVYAADNGLAAGSHGLLGK-QSLYEHSSKVPLIIKGPGVPQDQELDAFAYIHDL 360 Query: 332 LPTMMALADIEKPEILPGENILAV--KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDF 389 PT+ LA I P + G +++ V E GV + S+ G VR + Sbjct: 361 YPTLAELAGIPDPSDIDGVSLVPVITGEQDGVR------DALFTSYRG--TVRAVRNKKY 412 Query: 390 KLVL---NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQ 446 KL+ +T +L+D DP E++NL ++ + +S+M + + + + +D + Sbjct: 413 KLIRYPERDYT--QLFDLDADPLEINNLAENTEYQSKKSEMFELMEKWQNSFQDTVKLTA 470 Query: 447 WSLRPWRKD 455 ++P + D Sbjct: 471 DKIKPMKYD 479 >UniRef50_A0LK86 Sulfatase n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LK86_SYNFM Length = 487 Score = 119 bits (297), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 123/440 (27%), Positives = 193/440 (43%), Gaps = 58/440 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN L + D + +GC G P + T NID LA G+ F +A SP+C+P+RA FT Sbjct: 47 KPNVLMFVLD-DMNDWIGCLGGHPDVKTPNIDRLAQRGVLFRNAQCSSPICSPSRASFFT 105 Query: 62 GIYANQSGPWTNNVAPGK---NISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPE 117 GI + SG + N+ A K N T+ ++F GY + GK +H D E P Sbjct: 106 GIRPSTSGIYGNSQAFRKIMPNAVTLPQHFIAHGYRSMGCGKLFHFIKTDSRSWHEFFPS 165 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHID--ETFTWAHRISNRAVDFLQ 175 + FD + L+ GL V ID + +++ A D L+ Sbjct: 166 RSMERPFDPVPPNAPLS---------GLPDVNQFDWGPIDIVDEELGDGKLARWAADALR 216 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN---------KP 226 + R D PF + V PH P P +Y + Y L +DL + KP Sbjct: 217 R--RYDRPFFLGVGLLRPHVPLYVPRKYFDMYPPESITLPTVKANDLDDVPPTGVSWAKP 274 Query: 227 EHHRL------WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWV 279 E H+L W +A+ Y A FVD Q+G V++AL NT V Sbjct: 275 ERHQLIVEHDQWRKAVAG------------YLASVSFVDAQVGWVLDALDESPYVNNTVV 322 Query: 280 IYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMA 337 + D+G +G KL ++++ R+PLII P R+ PVS +D+ PT+ Sbjct: 323 VLWGDNGWHLG-EKLHWTKLTLWEESCRVPLIIALPGLTPPGRKCAKPVSTMDVYPTLNE 381 Query: 338 LADIE-KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDD-FKLVLNL 395 L D+ KPE+ + ++ P+ + + ++P + D+ ++ + Sbjct: 382 LCDLTPKPELECRSILELLRNPQSDTWD------GPPALSTYMPGNHSLRDERYRYIRYN 435 Query: 396 FTSDELYDRRNDPNEMHNLI 415 ++ELYD + DP E +NL+ Sbjct: 436 DGTEELYDLKADPMEWNNLL 455 >UniRef50_A6DNH1 Choline sulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DNH1_9BACT Length = 470 Score = 118 bits (296), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 107/448 (23%), Positives = 200/448 (44%), Gaps = 53/448 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++PN L + D + +G G P T N+D LA G+ F + SPVC P+R + Sbjct: 23 EKPNVLLIAVD-DLNDWIGVLGGHPQAKTPNMDRLANRGVLFTNTQCQSPVCNPSRGSMM 81 Query: 61 TGIYANQSGPWTNNVAPG-----KNISTMGRYFKDAGYHTCYIGKW--HLDGHDYFGTGE 113 T +Y + +G + N + G K M + F+ GYH GK + + YF Sbjct: 82 TSLYPSTTGIYFLNPSVGTSPKAKGHLVMPKRFEAEGYHVSAAGKLFHNQENKKYFK--- 138 Query: 114 CPPEWDADYWFDGANYLSELTEKEIS------LWRNGLNSVEDLQANHIDETFTWAHRIS 167 E+ F G + +K+I+ LW G+ D Q + R+ Sbjct: 139 ---EYGGS--FGG---FGPIPKKKITSFPGHPLWDWGVYPERDEQMPDVKIAAWGKERL- 189 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 A D+ D+PF M + + PH P P ++ + Y ++ + ++D+ P+ Sbjct: 190 --ARDY-------DQPFFMGIGFYRPHVPQFAPQKWFDMYPLESVQMPKMRKNDIEGIPQ 240 Query: 228 HH-RLWAQAMPSPVGDDGLYHH------PLYFACNDFVDDQIGRVINAL-TPEQRENTWV 279 + L + +P + + + Y AC FVD Q+G++++AL ++NT+V Sbjct: 241 YGVDLTREKHVAPTYEWVIENKEEKKLVQSYLACVSFVDAQVGKILDALDASPHKDNTYV 300 Query: 280 IYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALA 339 + SDHG +G + +K ++++D R+P++I P + P +D+ PT++ L Sbjct: 301 VLYSDHGFHLGEKERYAK-RSLWEDGARVPMMISGPGIKPGVTHKPTQLLDIYPTLLELT 359 Query: 340 DIEKPEILPGENILA-VKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS 398 ++ L G +++ ++ P+ + R ++ V++ ++ + S Sbjct: 360 GLKSDPKLEGNSLVPLLRNPQSDWPHYARTSFGPGNY-------AIVSERYRYIHYNDGS 412 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSK 426 +E YDR D +E HN I + +A + +K Sbjct: 413 EEFYDRSKDTHEWHNQIKNPEYASIIAK 440 >UniRef50_C1ZIM5 Arylsulfatase A family protein n=2 Tax=Planctomycetaceae RepID=C1ZIM5_PLALI Length = 523 Score = 118 bits (296), Expect = 4e-25, Method: Compositional matrix adjust. Identities = 132/463 (28%), Positives = 197/463 (42%), Gaps = 58/463 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPL-NTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 KRPN L + D Q + + G PL T + SLA G F +A+ +P+C P+R L Sbjct: 46 KRPNVLMIAIDDQ-NDWIEPLGGHPLVKTPQLKSLAERGTVFLNAHCQAPLCNPSRTSLL 104 Query: 61 TGIYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 G+ + +G PW +V T+ + F AGY T GK + G G P Sbjct: 105 LGLRSTTTGIYGLSPWFRDVPALSGRLTLPQAFGKAGYTTLSTGK------IFHGGGGKP 158 Query: 116 PEW--DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQA-NHID------ETFTWA-HR 165 + + D W A + + EK + N + D A H+D + WA + Sbjct: 159 KDRLKEFDEW-GPAGGVGKRPEKRLIQPPPHSNPLVDWGAFPHLDSEKGDTQITDWAIEK 217 Query: 166 ISNRAVDFLQQPARADE--PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 + R V QQ + E PFLM V Y PH P E+L Y D L +DD Sbjct: 218 LKQRQV---QQSSSTGESKPFLMCVGYFLPHVPCYVTPEWLAMYPDDDSILPFIEKDDRK 274 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHPL------YFACNDFVDDQIGRVINALTPE-QREN 276 + P +P P H Y A +VD QIGR++ AL + N Sbjct: 275 DTPRFSWYLHWRLPEPRLKWLQQHEHWRSLVRSYLASTSYVDAQIGRLLAALEATGEANN 334 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTM 335 T ++ SDHG +G K I+ +++ TR+PL+ P + PV +D+ PT+ Sbjct: 335 TLIVLWSDHGWHLG-EKGITGKNTLWERSTRVPLLFAGPGVLAGGKCVEPVELLDIYPTL 393 Query: 336 MALADIEKPEILPG-------ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDD 388 L +E P L G N LAV++ R + N+ G +R T D Sbjct: 394 AQLCQLEAPTDLEGVSLVPQLTNPLAVRQ-RPAITSHNQ---------GNHAIR---TRD 440 Query: 389 FKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + + S+ELYD DP+E+ NL DD + ++ +++ L Sbjct: 441 HRYIRYADGSEELYDHLVDPHELKNLADDPAHSGLKKQLNSWL 483 >UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=Bacteria RepID=Q7UHJ9_RHOBA Length = 1012 Score = 118 bits (296), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 125/483 (25%), Positives = 198/483 (40%), Gaps = 79/483 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PNF+ ++TD Q + C+ K ++T ID +AAEG R S Y +PVCTP+RAGL TG Sbjct: 570 KPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRAGLMTG 629 Query: 63 IYANQSGPWTNNVAPGKNIS---------------TMGRYFKDAGYHTCYIGKWHLDGHD 107 Y P ++A G N T+ K AGY T GKWHL Sbjct: 630 CY-----PKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFGKWHLGDQP 684 Query: 108 YFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWA---- 163 F P + D +F G Y ++ + LQ + + E A Sbjct: 685 EF----LPTKQGFDEFF-GIPYSHDIHPFHPRQNHYHFPPLPLLQNDTVIEMDPDADFLT 739 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 R++ +AV F+++ D+PF + + + PH P ++E AD EK +D Sbjct: 740 KRLTEQAVSFIER--NKDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAAIEK-EDGNI 796 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYT 282 + L+ QA+ +D +G++++AL E T V++T Sbjct: 797 DYATRANLFRQAIAE-------------------IDWSVGQILDALRSNGLDEKTMVLFT 837 Query: 283 SDHGE-----MMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTM 335 SD+G +L ++ R P ++R P Q D ++ +DLLPT Sbjct: 838 SDNGPPKNTLYASPGELRGHKGTTFEGGMREPTVVRWPGQIPAGHQNDELMTAMDLLPTF 897 Query: 336 MALADIEKP--EILPGENILAVKEPRGVMVEFNRYEIEHDSF-----GGFIPVRCWVTDD 388 LA P ++ G++I + + HD+F VR + Sbjct: 898 AKLAGAAIPTDRVIDGKDIWPTLK--------GETQTPHDAFFYHRGNQLAAVR---SGK 946 Query: 389 FKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWS 448 +KL +N + +LYD ND E N+I+ +V K+ L D+ I R ++ Sbjct: 947 WKLHVNNGVAKQLYDLENDLGEKVNVIE--TNPEVVKKLQHQLKDFAADIASNSRPAAFN 1004 Query: 449 LRP 451 P Sbjct: 1005 ANP 1007 Score = 107 bits (267), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 129/554 (23%), Positives = 211/554 (38%), Gaps = 172/554 (31%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + + D +GCY L+T NID LAAEG RF A++ S VCTP+R GL TG Sbjct: 40 PNVVLIFVDDLGYGDLGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSRYGLLTGQ 99 Query: 64 YANQS----GPW-----TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 Y ++ G W T+ + N T+G+ FK+ GY T +GKWHL G E Sbjct: 100 YPVRAMGGQGIWGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKWHL------GFKEE 153 Query: 115 PPEWDA-----------DYWF------DGANYL-------------------------SE 132 P +W D++F G+ Y+ + Sbjct: 154 PCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGGKPVSPTP 213 Query: 133 LTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDE 192 + +E S+ S L+A+ I + ++ RAV ++ + + +EPF + + Sbjct: 214 MFPEEASVKSPNRFSGA-LKAHEIYDDEKTGTLLTERAVKWITE--KKNEPFFLYFATPN 270 Query: 193 PHHPFTCPVEYLEK-----YADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYH 247 HHPFT + Y DF +EL Sbjct: 271 IHHPFTPAPRFKGTSQCGLYGDFVHEL--------------------------------- 297 Query: 248 HPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMM---------GAHK---- 293 D +G ++ +L +NT V++TSD+G M+ H+ Sbjct: 298 -----------DWMVGEIVQSLEDNGLTDNTLVLFTSDNGAMLNRAGRDAIKAGHQPNGE 346 Query: 294 LISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIEKP------- 344 L+ +++ R+PLI + P + Q D +S +DL T AL + E P Sbjct: 347 LLGFKFGVWEGGHRVPLIAKWPGKIKAGTQSDQLISQVDLFATFSALTEQEMPSSEQKDS 406 Query: 345 ------------EILPGENILAVKEPRGVMVE--------------FNRYEIEHDSFGGF 378 E L E +LA ++PR + + FN + +H ++GG Sbjct: 407 INMLPALLDDPNEPLRTELVLAPRQPRNLAIRKGKWLYIGARGSGGFNGSKPQHHAWGGP 466 Query: 379 IPVRCWVTDDFKLVLNLFTSD----ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 V+ + +V + +LYD ND ++ N+ + H +++ Sbjct: 467 AAVQFSGQKNSDIVNGRIKKNAPPAQLYDLENDRSQTTNVF----------REHPEVVEE 516 Query: 435 MDKIRDPFRSYQWS 448 M + + +R Q S Sbjct: 517 MKAMLESYRPKQGS 530 >UniRef50_A9V5D4 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9V5D4_MONBE Length = 619 Score = 118 bits (296), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 126/483 (26%), Positives = 198/483 (40%), Gaps = 57/483 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGK-PLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 K PNF+FV DT G Y P + N+D A EG+RFN A+ C+P+R + Sbjct: 102 KVPNFIFVFPDTLRAESFGAYGNPFPNVSPNLDKFAEEGVRFNQAHVMHTQCSPSRCTMV 161 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG Y + G T + R K+ GYH Y GK + F W Sbjct: 162 TGRYMHLQGHRTQTHLVQDYEANYFRILKEHGYHVQYFGKNDMFSAMAFNLSVS--AWAG 219 Query: 121 DYWF-DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 + + G+N S R G +V ++ D + A++FL ++ Sbjct: 220 NIGYASGSNPFPFGETGYYSFLRTGAGAVNASDLSNGDTLGV------HNAINFLN--SK 271 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKY-ADFYYELGEKAQDDLANKPEHHRLWAQAMP- 237 EPFL+ + H P+ P + + AD E + ++A+KP + + + +P Sbjct: 272 PPEPFLLFLPSRGAHPPYGAPAPFHNLFTADKVKEKIKLRPRNIASKPTYMQ-NSNGIPH 330 Query: 238 ----SPVGDDGLYH-HPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGA 291 + + DD Y Y + D G ++ L R NT + ++SDHG+ G Sbjct: 331 FRNLTYLNDDFFYQIQANYLNMIAYTDWLFGNLLEGLDQSGLRNNTAIFFSSDHGDFGGD 390 Query: 292 HKLISK-GAAMYDDITRIPLIIRSPQGERRQ-VDTPVSHIDLLPTMMALADIEKPEILPG 349 L+ K +M D +TR+PL+ + P G + V+ PV D+L TM+ALA+I+ + G Sbjct: 391 FGLVEKWPGSMSDVLTRVPLLAQVPGGAKNHVVEAPVQTADILETMLALAEIDVDFVRFG 450 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSF---------------GG----FIPV--------- 381 +N+ + G + NR F GG + P Sbjct: 451 QNL--APQLAGSEGQLNRTVYSEGGFYFSNEQMIEANECLSGGPKADYYPRGLEEAQPNG 508 Query: 382 --RCWVTDDF--KLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 R + + KLV ELY+ DP E+ N+ +D A +R+ M L D+M Sbjct: 509 SPRAVMLRNLTAKLVYRPTGISELYNLTADPLELSNVFEDAAHASLRAAMMQQLTDWMVL 568 Query: 438 IRD 440 D Sbjct: 569 TSD 571 >UniRef50_B4X2F4 Sulfatase, putative n=1 Tax=Alcanivorax sp. DG881 RepID=B4X2F4_9GAMM Length = 565 Score = 118 bits (295), Expect = 6e-25, Method: Compositional matrix adjust. Identities = 112/391 (28%), Positives = 176/391 (45%), Gaps = 45/391 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN L +++D + + G L+ + L +G FN + + C+P+R+ ++T Sbjct: 36 RRPNVLLLVSDQERS---GLDLPGSLDLPGHERLRRQGTSFNHYHVNTSPCSPSRSVMYT 92 Query: 62 GIYANQSGPWTNNVAP-----GKNISTMGRYFKDAGYHTCYIGKWHL-DGHDYFGT--GE 113 G + + N AP + T+G +F+D GY+T Y GKWHL D D G G Sbjct: 93 GQHTMHTHMTANLHAPPFPALNDKLKTLGHHFRDQGYYTAYKGKWHLSDIEDGPGLLYGN 152 Query: 114 CPPEWDA-------DYWFDG-------ANYLSE-LTEKEISLWRNGLNSVEDLQANHIDE 158 P A DY G Y+++ + E W G E+ + + Sbjct: 153 YPSRNRALEKHGFSDYNLTGDVHGSVWQGYIADRMVTAEACRWLMGKGQTEE-KPWFLAV 211 Query: 159 TFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA 218 F H I + Q +R + P M PH P Y + ++ + L Sbjct: 212 NFVNPHDIMFFSTGEKQSRSRTN-PQFMAPLRPAPHDPV-----YAKDWS--HISLPASF 263 Query: 219 QDDLANKPEHHRLWAQAMPSPVGDDG-------LYHHPLYFACNDFVDDQIGRVINALTP 271 + L NKP + +A+ + S G L + YF C V Q+ +V+ AL Sbjct: 264 RASLDNKPWCQQAYAKLIDSVYGHIDKDNEAAWLANQSYYFNCLRDVSRQVDQVLQALEE 323 Query: 272 E-QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHI 329 Q +NT ++YT+DHGEM GAH L KG Y + +R+PLII P ++R VD S + Sbjct: 324 SGQADNTIIVYTADHGEMAGAHGLRQKGPFAYKENSRVPLIISHPDARQQRDVDNIGSSV 383 Query: 330 DLLPTMMALADIEKPEI-LPGENILAVKEPR 359 DL+PT+++LA K + PG ++ A + R Sbjct: 384 DLVPTLLSLATEGKADTQTPGTDLSAALDGR 414 >UniRef50_C6Y1N2 Sulfatase n=2 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y1N2_PEDHD Length = 464 Score = 117 bits (294), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 123/482 (25%), Positives = 208/482 (43%), Gaps = 89/482 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + +MTD Q + + K L+T +D LAA G RF AY P+CTP+R+ +F+G Sbjct: 33 RPNIIIIMTDQQTADAMSNAGNKDLHTPAMDVLAANGTRFTRAYCAQPLCTPSRSAIFSG 92 Query: 63 IYANQSGPWTNNVAPGKN------ISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 +++G +T N P K+ + MG+ FK GY T Y+GKWH Sbjct: 93 KMPHETG-FTGNT-PEKDGQWPDSVLMMGKIFKAGGYKTGYVGKWH-------------- 136 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 L + ++G ++E+ + T ++ +F+++ Sbjct: 137 ----------------LPVPVTKVAQHGFETIENTGMGDYTDAVT-----PSQCANFIKK 175 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL------ANKP---- 226 D PFL+V S+ PH + + + ++ + A D AN P Sbjct: 176 --NKDNPFLLVASFLNPHD-----ICEWARGDNLKMDVLDAAPDTAFCPKLPANWPIPAF 228 Query: 227 ------EHHRLWAQAMPSPVGDDGLYHHPLYFACNDF---VDDQIGRVINALTPEQRE-N 276 E ++ + PS VG + +A N VD+ + V+ +L E N Sbjct: 229 EPAIVREQQKVNPRTYPS-VGWNESQWRKYRWAYNRLVEKVDNYMAMVLGSLKKYGIEDN 287 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLII-RSPQGERRQVDTPVSH-IDLLPT 334 T +I+TSDHG+ AH+ K +Y++ RIP II + Q + R D V + ID++PT Sbjct: 288 TIIIFTSDHGDGYAAHEWNQK-QILYEEAARIPFIISKIGQWKARTDDQLVCNGIDIIPT 346 Query: 335 MMALADIEKPEILPGENI--------LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVT 386 + A I KP L G ++ + +++ + +F E+ G R +T Sbjct: 347 ICGFAGIAKPVGLKGLDLSKRIANPSVKLRDTLVIETDFADNELLLGIKG-----RAVIT 401 Query: 387 DDFKLVL--NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRS 444 DFK ++ ++L+D D EM NL + ++M L + + +D F + Sbjct: 402 KDFKYIVYDKGEIREQLFDLEKDAGEMDNLAVKPAYKKKLNEMRAYLKLWCKQHQDSFYA 461 Query: 445 YQ 446 + Sbjct: 462 LK 463 >UniRef50_A6DM53 Arylsulfatase (Aryl-sulfate sulphohydrolase) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DM53_9BACT Length = 540 Score = 117 bits (294), Expect = 7e-25, Method: Compositional matrix adjust. Identities = 91/299 (30%), Positives = 143/299 (47%), Gaps = 32/299 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++ D + +GC+ G+ + T N+DSLAA+G+RF Y S C P+RA L TG Sbjct: 51 RPNIVIILADDAGFSDLGCFGGE-IETPNLDSLAAKGLRFTEFYN-SARCWPSRAALMTG 108 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 Y N KN T+ + K GY T +GKWHL G + G P Sbjct: 109 SYDNYLN---------KNRITIPQVLKTTGYKTAMVGKWHLGGKSFDPNGPNAPMNRGFD 159 Query: 123 WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADE 182 F G + + ++L RN + ++ +H E+F + +I AV ++ A+A++ Sbjct: 160 DFYGTLHGAGSYYDPMTLTRN----RKSMEPDH--ESFYYTDKIGEEAVRQIKALAKAEQ 213 Query: 183 PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW-AQAMPSPVG 241 PF +++ PH P P + ++KY Y EK + D + + + P P Sbjct: 214 PFFQYIAFTAPHWPIHAPEKTIQKYIKRYEGGWEKLRKDRYTRMLKMGIIDEKRFPLPPM 273 Query: 242 D------DGLYHHP-------LYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHG 286 + D + H P +Y A D +D IGRVI+AL + EQ +NT++ Y D+G Sbjct: 274 EPNVKPWDKVDHKPWRIRNMAVYAAMVDHMDQAIGRVIDALKSSEQFDNTFIFYCHDNG 332 >UniRef50_UPI0001BC7CBC sulfatase n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7CBC Length = 496 Score = 117 bits (294), Expect = 8e-25, Method: Compositional matrix adjust. Identities = 130/498 (26%), Positives = 206/498 (41%), Gaps = 84/498 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSG-KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN + ++ D Q V CY K + T NID LAA G++ YT + +P RAGL T Sbjct: 36 KPNIVIILADDQGYGGVNCYPHIKKIVTPNIDKLAASGVQCMQGYTSGHLSSPTRAGLMT 95 Query: 62 GIYANQSG------PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 G Y G P + + + + Y + GY+T IGKWHL DY + Sbjct: 96 GKYQQSFGFYGLSTPHVGGIPQDQKL--LSEYLVENGYNTACIGKWHLG--DYIRSHPNN 151 Query: 116 PEWDADYWFDGA--NYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 + + F +Y L NGL D + E + + RAVDF Sbjct: 152 RGFQTFFGFINGLHDYYDPLVGGSWDGVYNGLAFTLD-NMEPVTEMEYSTYEYTKRAVDF 210 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA 233 +Q+ AD PF + + Y+ H P P E + + A E+G +DD+A Sbjct: 211 IQK--NADHPFFLYLPYNAIHSPLQAPEELIGELAINPQEIG---KDDIA---------- 255 Query: 234 QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ---RENTWVIYTSDHGEMMG 290 + FA +D +G+V+ L EQ R+NT + Y SD+G + Sbjct: 256 --------------RAMTFA----LDQGVGKVVETL--EQLGLRDNTIIFYLSDNGAVEY 295 Query: 291 AHKLISKG--AAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEKPEI 346 + K +G + Y+ R+P I+ P + + PV ID+ PT+M LA + + Sbjct: 296 SDKWEFRGRKGSYYEGGIRVPFIVSYPAKLAKGTIYNKPVMSIDIAPTVMELAGLSHAD- 354 Query: 347 LPGENILAV------KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN-LFTSD 399 + G N+L EP V+ + + F +R +KLV + F D Sbjct: 355 MHGVNLLPYLSGKDRTEPHDVLYWSTEKKSNNQVFKNEFAIR---QGKWKLVSDPHFEKD 411 Query: 400 -ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF--------------RS 444 +LYD DP E H L D ++ + ++ L++++++ + R Sbjct: 412 YDLYDIEADPQEKHGLKD--QYPEKYKELFGMYLNWINQMPEELANGENARLKGMELMRK 469 Query: 445 YQWSLRPWRKDARPRWMG 462 YQ +L+ K P G Sbjct: 470 YQRNLKKSGKKVVPLSFG 487 >UniRef50_UPI0001C35757 sulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C35757 Length = 428 Score = 117 bits (293), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 120/452 (26%), Positives = 191/452 (42%), Gaps = 68/452 (15%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MKRPN L V D G + + + T N+D A E + A + P+C+P R+ L Sbjct: 1 MKRPNLLVVFADQWRNTARGIHDPQ-IVTPNMDQFAEEAFSTDQAVSGCPLCSPYRSELL 59 Query: 61 TGIYANQSGPWTN-----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG--- 112 TG A +G + N ++ + + + K+ GY T YIGKWHLD + Sbjct: 60 TGRRAVHTGVFGNCMTGYDMCLSPDELCISQVLKEYGYRTGYIGKWHLDSPELNSVSHPV 119 Query: 113 ECPPEWDA-----------DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 WDA +YW + L ++ W + + + + ET Sbjct: 120 SGAEGWDAYTPPGKMRHGFEYWHAYNAWNDHL---QMHYWEDSSEKIYADAWSPVHET-- 174 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFT-CPVEYLEKYADFYYELGEKAQD 220 ++A++F+ + ++PF + +S++ PH PF P EY +Y + + Sbjct: 175 ------DKALEFMG--SVKEQPFALFLSWNPPHPPFERVPKEYYNRYRNL--------EP 218 Query: 221 DLANKPEHHRLWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTP-EQRENT 277 DL E R Q G + Y+A +D+Q GR+++ L E + T Sbjct: 219 DLPPNVEGERFDNQTGEPGFGSREELAEAVRCYYAAITGLDEQFGRIVSWLKEMELYDQT 278 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMM 336 V+ T+DHGE +GAH + K Y++ IP ++R P+ + D V +D++PT++ Sbjct: 279 IVLLTADHGEHLGAHGYVGK-HTWYEESINIPFLMRYPEKLPAGRNDISVETVDIVPTLL 337 Query: 337 ALADIEKPEILPGEN----ILAVKEPRGVMVEFNRYEIEHDSF------GGFIPV----R 382 L DI P G I+ K+P V + Y I D F G P R Sbjct: 338 GLLDIAIPPSAEGRCLADWIMCGKKPENEAVYSSAY-ISRDIFLEAYKEKGLDPKRSGWR 396 Query: 383 CWVTDDFKLVLNLFTSDE------LYDRRNDP 408 C T ++K V+ E L+DR DP Sbjct: 397 CIRTPEYKYVIEKGYMPEQIPRFLLFDRIADP 428 >UniRef50_Q1ARG1 Sulfatase n=2 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1ARG1_RUBXD Length = 492 Score = 117 bits (292), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 119/456 (26%), Positives = 199/456 (43%), Gaps = 63/456 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAE-GIRFNSAYTCSPVCTPARAGLF 60 +RPN + ++TD Q VG G + +L + G F +A+ VC P+RA + Sbjct: 45 ERPNLILILTDDQTPGDVGYMPG-------VRALLRDRGTTFRNAFVTDSVCCPSRATIL 97 Query: 61 TGIYANQ---------SGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 G YA+ +G + G ST+ + K GY T ++GK+ L+G Y T Sbjct: 98 RGQYAHNHEIAGAKPPAGGFEKFRRLGLERSTVATWLKARGYATGFVGKY-LNG--YLRT 154 Query: 112 GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 PP WD Y F+G Y + +L NG N +++ + + +A+ Sbjct: 155 THVPPGWDRWYGFNGGGY------HDFTLNENGRNVSYRGPSSYQTDV------LGRKAL 202 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA-----QDDLANKP 226 F++ AR D PF + +S PH P E ++A + + D+++KP Sbjct: 203 GFVRWAARRDRPFFLHLSPWAPH----GPAEPAPRHARLFARTPLPRPPSFDERDVSDKP 258 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLY---FACNDFVDDQIGRVINALTPE-QRENTWVIYT 282 W + P ++ LY VD+ +GR++ AL Q ENT++ +T Sbjct: 259 R----WVRDNPRLGREEVREMGRLYRNRLRTLRAVDELVGRLVAALRESGQLENTYIFFT 314 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADI 341 SD+G MG H+L Y++ R+PL++R P E R + V + DL PT L Sbjct: 315 SDNGFHMGHHRLPEGKWTAYEEDIRVPLLVRGPGVPEGRVLPHLVLNNDLAPTFGRLGGA 374 Query: 342 EKPEILPGENILAV--KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT-- 397 P + G +++ + ++P + + +E G R + V +L+ Sbjct: 375 RVPGYVDGRSLVLLLRRDPPSRRSWRSAFLVEAKRDGAN---RRPAYRALRSVGHLYVEY 431 Query: 398 ---SDELYDRRNDPNEMHNL---IDDIRFADVRSKM 427 ELYD R DP+++ NL +D +RS++ Sbjct: 432 ESGERELYDLRRDPHQLRNLAPRLDGESARKLRSRL 467 >UniRef50_A6DJ24 Iduronate-2-sulfatase n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ24_9BACT Length = 497 Score = 117 bits (292), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 119/466 (25%), Positives = 195/466 (41%), Gaps = 69/466 (14%) Query: 5 NFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEG-IRFNSAYTCSPVCTPARAGLFTG 62 N LF+ D + +G G P + T N D A G + A++ S VC PAR+ + TG Sbjct: 26 NVLFIAID-DLNDWIGPMGGNPAVKTPNFDKFFANGGMSMYKAHSPSTVCGPARSAIMTG 84 Query: 63 IYANQSGPWTN-----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + +G + N N K++ T+ +F GYH+ GK F E Sbjct: 85 KHCYNTGVYGNDTNLKNAPKAKDLLTIPEWFSKHGYHSLSAGK-------IFHKHPTEKE 137 Query: 118 WDADYW-FDGANYL-SELTEKEISLWRNGLNSVEDLQANHIDETFTWA------------ 163 D W FD + + L K + NGL + Q F W Sbjct: 138 IDHGQWAFDEHHVIKGGLGAKSKAKPANGLLDINGKQMKGKGLEFDWGPTVKNDTTQMKD 197 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYEL---GEKAQD 220 ++I++ AV+ Q+ + D+PF M V + +PH P+ P +Y + Y EL E + Sbjct: 198 YKIADWAVNQFQKRS-FDKPFFMAVGFSKPHLPWFVPQKYFDMYPLDKIELPEIKENPHE 256 Query: 221 DLANKP----------EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALT 270 + N+ E + W +A V + L Y A FVDD +G +++ L Sbjct: 257 KIVNEKGEFIYGKAFREDSKRWGRAEKYGVTKNALQ---AYMANVTFVDDCLGHLLDGLN 313 Query: 271 PE-QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVS 327 +NT V+ DHG +G K K ++ + TR+PL+++ P ++ D V+ Sbjct: 314 NSPYADNTIVVLWGDHGWHLGEKKRFGK-CLLWQESTRVPLMLKVPGVTPNNKRCDGVVN 372 Query: 328 HIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTD 387 IDL PT+ L +I P+ F + + D + W+ Sbjct: 373 LIDLYPTLSELCNIPV-------------NPKNDGRSFAKLANQPDMKWNKPTLTSWLEG 419 Query: 388 DFKLV------LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKM 427 + ++ +N DELYD +NDP E +NL ++ +A + +K+ Sbjct: 420 NHRIYDGRYSYINWRGGDELYDHKNDPLEHNNLANNPEYAKIMAKL 465 >UniRef50_B1I7R7 Sulfatase n=16 Tax=Lactobacillales RepID=B1I7R7_STRPI Length = 491 Score = 116 bits (291), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 122/470 (25%), Positives = 209/470 (44%), Gaps = 52/470 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSG-KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 K+PN + ++ D + + S K ++T +D +A+ G F +AY+ P C PARA L Sbjct: 5 KQPNIILIVVDQMRADALSLNSKDKLVSTPTLDMMASVGYNFENAYSPVPSCVPARAALL 64 Query: 61 TGIYANQSGPWT-NNVAPGKNISTMGRYFKDAGYHTCYIGKWHL------DGHDYF---- 109 TG+ ++SG + P +T+ + FKD GY T IGK H+ G D+ Sbjct: 65 TGLDQDKSGRVGYQDEVPWNFTNTLPKVFKDMGYQTECIGKMHVFPSRQRLGFDHVLLHD 124 Query: 110 GTGECPPEWDADY--WFD-GANYLSELTEK---EISLWRNGL--NSVEDLQANHIDETFT 161 G ++D Y FD ++YL+ L K ++ L +G+ NS E + DE Sbjct: 125 GYLHVDRKYDKAYGSQFDYASDYLAFLKGKVGYDVDLIDDGMDCNSWEARPWDK-DEKLH 183 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 + + + ++ FLQ+ PF + +S+++PH P P Y + Y E+ Sbjct: 184 PTNWVVSESISFLQR-RDPTVPFFLKMSFEKPHAPLNPPKYYFDMYM-------ERLPQF 235 Query: 222 LANKPEHHRLWAQAMPSPVG-------DDGLYHHPLYFACNDFVDDQIGRVINALTPEQR 274 L + + + +PS DD Y +D QI R + AL + Sbjct: 236 LDLHIGNWEVLERQIPSIYALRGKLKEDDQRRMVAAYLGLITHIDHQISRFLTALKEFRH 295 Query: 275 ENTWVI-YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG----ERRQVDTPVSHI 329 + +I + SDHG+ +G H L KG Y IP I P G R + V Sbjct: 296 DKDTIIWFVSDHGDQLGEHYLFRKGYP-YQGSIHIPSFIYDPAGLIAGNRGTIKQLVKIQ 354 Query: 330 DLLPTMMALADIEKPEILPGENI--LAVKEPRGVMVEFN-RYEIEHDSFGGFIPVRCWVT 386 D+ P+++ LA + L G ++ L + G EF+ + + DS + +T Sbjct: 355 DIFPSLVDLAGGTTTDELDGRSVKNLLFGQYEGWRTEFHGEHALGKDS------SQYILT 408 Query: 387 DDFKLV-LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 D +K + + +L+D + DP+EM++L ++ + +M L+D++ Sbjct: 409 DQWKFIWFPVLNHYQLFDMKKDPHEMNDLYPSEKYQPIVRQMKKKLVDFL 458 >UniRef50_A6DHY1 Mucin-desulfating sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHY1_9BACT Length = 545 Score = 116 bits (291), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 113/462 (24%), Positives = 202/462 (43%), Gaps = 46/462 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++TD Q + +GC + T +ID L+ G+ F+S YT +P+C +RA TG Sbjct: 21 RPNIIMLLTDDQRYDTLGCMGNDQVKTPHIDKLSERGVTFDSHYTNTPICLGSRASTMTG 80 Query: 63 IYANQSGPWTNNVAPGKNISTMGRY---FKDAGYHTCYIGKWH--LDGHDYFGTGECPPE 117 +Y +G ++ + + Y ++ GY T +IGK+ ++ +Y P + Sbjct: 81 MYEYTNGCNFSHGFLSQELWDEMSYPVILRNNGYFTGFIGKFGFPVNAKNYHEYENLPID 140 Query: 118 -WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 +D Y + G Y K + + V T A + A +F+ + Sbjct: 141 SFDRWYGWTGQGYFDTSKNKYMVKFAKEYPHV------------TLA--TAEAACEFIDE 186 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK-PEHHRLWAQA 235 + D+PF + +S+ H PF+ Y + Y D ++ + A K P +L Q Sbjct: 187 AQKQDKPFCLSLSFKASHKPFSPDPAYDDVYKDTVWKKRANYDEGGARKLPPQAKLGRQY 246 Query: 236 M------PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEM 288 + P + ++ L + +D +G+++ L +NT +IY +D+G Sbjct: 247 LTIDDFAPEKYQESMRKYNQLIYG----IDQAVGKIVEKLDQTGLSKNTVIIYATDNGYS 302 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIEKPEI 346 G+H K Y+ R P+II P+ ++ ++ ++D+ PT+ LA I P Sbjct: 303 CGSHGFGGK-VLPYEGPARGPMIIMDPRSDQTGKRSKGVSGNVDIHPTICDLAGIAIPAK 361 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR--CWVTDDFKLVLNLFTSD----- 399 + G+++L V + + V R + +F G VT+D+K + F D Sbjct: 362 VDGKSLLPVLKDSEIRV---RKAMPVFNFWGSAATHEMTMVTEDYKYIYWYFEGDGMVAA 418 Query: 400 -ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 ELY R D EM+NL+++ A +M + IRD Sbjct: 419 EELYHRHKDSAEMNNLVNNPEMALKLEEMRQLFDAQVQHIRD 460 >UniRef50_A6DG79 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG79_9BACT Length = 486 Score = 116 bits (291), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 116/458 (25%), Positives = 184/458 (40%), Gaps = 88/458 (19%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPNF+F+M D G K + T ++D++A EG RF Y+ PVC P R T Sbjct: 30 ERPNFIFLMADDLGYGDTGFNGNKIIKTPHLDNMAKEGARFTHFYSIGPVCAPTRGSALT 89 Query: 62 GIYANQSGPWTNNVA--PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT-------G 112 G + + G NV P + I T+ R K GY T + GKWHL + Sbjct: 90 GRHYMRYGMMDVNVGKLPHQEI-TIARLCKQQGYTTGHFGKWHLGTLSKIESPRHKNPAK 148 Query: 113 ECPPEWDADYWFDGANYLSELT---------EKEISLWRNGLNSVEDLQANHIDETFTWA 163 + P W+ DY A +S T + + W NG ++L + + Sbjct: 149 DFAPPWERDYDDAFATEISVPTWDPAAGRYPKHDSPYWHNGQKVTDNLLGDD-------S 201 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 I +RA+ F+++ + F+ + + PH P EYL+ Y Y+ GE+ Sbjct: 202 RVIMDRAIPFIRKAVTDKKSFMTTIWFHTPHSPVVAGPEYLKMYEG--YKEGEQH----- 254 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALT------------- 270 Y+ C +D+QIGR+ L Sbjct: 255 ---------------------------YYGCITAMDEQIGRLREELRKLNVDQNTIIWFC 287 Query: 271 ----PEQRENTWVIYTSDHGEMMG-AHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVD 323 PE R N Y + HG G A KL + ++Y+ +P ++ P + ++ Sbjct: 288 SDNGPEGRGNPKKKYDAYHGAFYGTAGKLRGRKRSLYNGGVCVPALVSWPGKIDAGKVIN 347 Query: 324 TPVSHIDLLPTMMALADIEKPEILP--GENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV 381 TP S +D L + +A +++ P+ P GENI+ + G + N+ I P Sbjct: 348 TPCSTLDYLESTLAQMNVKYPDSRPLDGENIIPIL--LGKTNKRNKPIIFASKISS--PK 403 Query: 382 RCWVTDDFKLVLNL--FTSDELYDRRNDPNEMHNLIDD 417 + D+K NL DELY +D +E N+I + Sbjct: 404 TSIIQGDYKFCSNLDGKNKDELYHLHDDFSESKNIIKN 441 >UniRef50_UPI00016C500A sulfatase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C500A Length = 472 Score = 116 bits (290), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 117/458 (25%), Positives = 196/458 (42%), Gaps = 49/458 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + +MTD Q + + C L T N+D +A EG R+ +A+ + +C P+RA L TG Sbjct: 22 RPNIVVMMTDDQRHDYMSCAGHPFLKTPNMDRIAKEGFRYTNAFVTNALCAPSRATLMTG 81 Query: 63 IYANQSGPWTN-NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 Y++ +G N N + + GY + GK H+ GH T WD Sbjct: 82 QYSHLNGVRDNMGTTLNPNAPWLPDELRKLGYEVAFCGKSHVPGHFRDKT------WDYY 135 Query: 122 YWFDG-ANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 + F G NYL L + + G + D ID+ ++++A+ ++++P RA Sbjct: 136 FGFQGQGNYLKPLIAESGPDGKIGPDKPYD---GWIDDV------VTDKALAWVKKP-RA 185 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM---P 237 +PF + + + PH + + + YA + D KP A + P Sbjct: 186 -KPFALFLFFKSPHRAWQPAARHKDLYAGAAVKKPALWDDPGQGKPRAFLQAANMIGQYP 244 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLIS 296 DG+ Y C VDD +G+V+N L ++ + T V+YTSD+G +G + Sbjct: 245 DTKDYDGMIRD--YARCITGVDDNVGKVLNTLDEQKIADTTAVMYTSDNGFFLGEWQRFD 302 Query: 297 KGAAMYDDITRIPLIIRSPQGERRQVDTP-------VSHIDLLPTMMALADIEKPEILPG 349 K M++ R+PL+++ P+ + P V + D+ PT++ LA P+ + G Sbjct: 303 K-RFMHEPSVRVPLLLKVPKALAKDCVPPGSQPGAMVINPDIAPTVLELAGGAPPKAMQG 361 Query: 350 ENIL--AVKEPRGVM--------VEFNRYEIEHDSFGGFIPVRCWVTDDFKLV------L 393 ++L A P G + + Y D R T +KL+ Sbjct: 362 RSVLPFARLPPAGPLPPEMAPREAWYYEYFEFPDPSHNVEKQRGVRTTKWKLIHYYDPPF 421 Query: 394 NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + ELYD DP E NL + F ++ + + Sbjct: 422 KFKDAYELYDLEKDPEERVNLANRPAFQGTVKELQEKM 459 >UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3JD43_NITOC Length = 440 Score = 116 bits (290), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 116/435 (26%), Positives = 178/435 (40%), Gaps = 65/435 (14%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + ++ D VGCY + + T N+D+LA +G RF ++ P+CTP RA L TG Sbjct: 19 PNVILIVADDMGYGDVGCYGNQHIKTPNLDALAKKGARFTDFHSNGPLCTPTRAALLTGC 78 Query: 64 YANQSG----PWTNNVAPGKNIS----TMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 Y + G P A K +S T K GY T +GKWHL F P Sbjct: 79 YQQRVGLHIIPKDQRYAMAKAMSLEEITFAEALKSVGYSTALVGKWHLGDRPAF----LP 134 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT----WAHRISNRAV 171 P D +F G Y ++ WR + ++ I E + AV Sbjct: 135 PRQGFDEYF-GIPY-----SHDMHPWRKSFPPLPLMRGEEIVELNPDLDHLTQYCTEEAV 188 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRL 231 F+ + D PFL+ + + PH PV E++A R Sbjct: 189 KFISK--NKDRPFLLYMPHPMPHQ----PVHVSERFAK--------------------RF 222 Query: 232 WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMG 290 + + + G+D LY A + +D +G +I A+ E+T+V +TSD+G +G Sbjct: 223 SKEQLAAIKGEDKKSRKFLYSATIEEIDWSVGEIIKAVRALGIEESTFVAFTSDNGPAIG 282 Query: 291 -AHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEKP-EI 346 A L K +++ R+P I + R V D +DL PTM A+ P + Sbjct: 283 SAGPLRGKKRELWEGGHRVPFIAYWQEKIRPGVVIDEIAMSMDLFPTMAAMGRAPLPRKK 342 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF-----TSDEL 401 + G N+L ++ E ++ E F + +KL++ TS L Sbjct: 343 IDGVNLLP------LLCEGDKLS-ERTVFWRSKGKKAARKGPWKLLMQPTKKKRPTSIGL 395 Query: 402 YDRRNDPNEMHNLID 416 Y ND +E HNL + Sbjct: 396 YHLNNDLSEQHNLAE 410 >UniRef50_Q482D6 Sulfatase family protein n=2 Tax=Bacteria RepID=Q482D6_COLP3 Length = 492 Score = 116 bits (290), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 121/451 (26%), Positives = 206/451 (45%), Gaps = 68/451 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + ++ D + Y T NID LAA+G++F++AY P C P+R +F+G Sbjct: 30 KPNVVMLLVDDFGRQDLSTYGSNFYETPNIDQLAADGMKFDNAYAAHPRCVPSRVAIFSG 89 Query: 63 IYANQSG-PWTNNVAPGK-----NISTMGRYFKDAGYHTCYIGKWHL--DGHDYFGTGEC 114 Y + G P V GK + T G + K+AGY T YIGKWHL +G D G Sbjct: 90 SYPTRYGVPQGERV--GKHHLPLSAVTFGEHLKEAGYQTGYIGKWHLGKEGGDPTKQG-F 146 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 A +W +Y T+ S G VE + ++ + R+++ A+ F+ Sbjct: 147 DSSIMAGHWGAPPSYYFPYTKMSKSGKNKGFAKVEGSEEEYLTD------RLTDEALTFI 200 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG------EKAQDDLANKPEH 228 +Q + D+PFL+V+++ H P ++KY +LG + D + + + Sbjct: 201 EQ--KKDQPFLLVLAHYAVHTPIEGKPALVKKYKTKMKKLGIANAGPKSDADLIKDSTGY 258 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGE 287 H+ + ++P Y A + VD +GR+ L E NT +I TSDHG Sbjct: 259 HKT-------------IQNNPDYAAMVESVDISVGRIEQQLKRLGLEDNTIIILTSDHGG 305 Query: 288 M----MGAHKLISKG--------AAMYDDITRIPLIIRSPQ----GERRQVDTPVSHIDL 331 + + ++++++ +YD TR+PLI++ P+ G QV V+ D Sbjct: 306 LSSRGLKSNRVLATSNNPYRHGKGWIYDGGTRVPLIVKWPEKVKAGSISQVQ--VTGTDH 363 Query: 332 LPTMMALA--DIEKPEILPGENILAV----KEPRGVMVEFNRYEIEHDSFGGFIPVRCWV 385 PT++ +A + + G + LA + PR M F S G + Sbjct: 364 YPTILQMAGLSLSPKDHQDGVSYLAALNSDETPRKAM--FWHSPAARPSKTGDTNSSAII 421 Query: 386 TDDFKLVLNLFTSD--ELYDRRNDPNEMHNL 414 ++KL L+ +++ ELY+ ++D +E +NL Sbjct: 422 EGEWKL-LDFWSTGKVELYNLKDDKSEANNL 451 >UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria RepID=C1ZCL4_PLALI Length = 470 Score = 115 bits (289), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 119/443 (26%), Positives = 184/443 (41%), Gaps = 54/443 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L + D +G T ID+LA G RF Y+ PVC+P RA L TG Sbjct: 28 KPNVLLIFIDDLGKTDIGIEGSSFYETPRIDALAKSGARFTQFYSAHPVCSPTRAALMTG 87 Query: 63 IYANQSG--PWT---NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + G W ++VA ++ T+G+ F++AGYHT Y+GKWHL G P + Sbjct: 88 KMPQRLGITDWIRPESDVALPQSEVTIGQAFQEAGYHTAYLGKWHL--------GHKPQQ 139 Query: 118 WDADYWFD---GANYLSELTEKEISLWRN-----GLNSVEDLQANHIDETFTWAHRISNR 169 A FD G N+ + + ++N N+V D + ++ T +++ Sbjct: 140 HPAARGFDWTKGVNHGGQPSSYYFP-YKNPQKPDAPNNVPDFEKCQPEDYLT--DVLTSS 196 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHH 229 A++ LQQ R PF + +++ H P P +EKY Q LA + Sbjct: 197 AIEHLQQRDRT-RPFFLCLAHYAVHTPIQPPKNLVEKY-----------QVKLATQKNPK 244 Query: 230 RLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHGEM 288 HP Y A + +D Q+GR+++ L T + T V++TSD+G + Sbjct: 245 SPGEGIQEGSAISRSQQDHPAYAAMVENLDTQVGRLLDELKTQGILDQTIVVFTSDNGGL 304 Query: 289 MGAH----------KLISKGAAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLLPTMMA 337 + L + Y+ RIP I P QV D P D+ PT+++ Sbjct: 305 CTLNGKSPGPTCNLPLRAGKGWTYEGGIRIPTYISWPGKISPQVLDIPAYTCDIYPTLLS 364 Query: 338 LADI--EKPEILPGENILAVKEPRGVMVEFNRYEI---EHDSFGGFIPVRCWVTDDFKLV 392 L I + + G ++ + + E R + H G P +KL+ Sbjct: 365 LCQIPPRPTQHVDGISLAGLLTKSSSLPESERTLVWYYPHTHGSGHKPSAAIRQGPWKLI 424 Query: 393 LNLFTSD-ELYDRRNDPNEMHNL 414 L T ELY +DP E NL Sbjct: 425 HFLETDRIELYHLEDDPGESRNL 447 >UniRef50_Q7UYD2 Sulfatase 1 n=1 Tax=Rhodopirellula baltica RepID=Q7UYD2_RHOBA Length = 478 Score = 115 bits (289), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 94/359 (26%), Positives = 157/359 (43%), Gaps = 30/359 (8%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + ++ D N +G Y P+ T ++D LAA GIRF AY+ + VC+P+R LFTG Sbjct: 60 PNIVLILADDLGFNQIGAYGDTPIQTPHLDQLAANGIRFTQAYSGNTVCSPSRVSLFTGR 119 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYW 123 +N V T+ K AGY T GK+ + G + P D W Sbjct: 120 DGRLMDNNSNTVQLKDIDVTIAHVLKHAGYDTALFGKYSIGSQ--MGVTD-PLAMGFDTW 176 Query: 124 FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI-SNRAVDFLQQPARADE 182 + + L + LWR+G ++ N +A + ++ A+ +++Q D Sbjct: 177 YGMYSILEGHRQYPTILWRDGKKL--RIEENEAGRKGAYAQALFTHEAIQYIKQ--DHDN 232 Query: 183 PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGD 242 PF ++++Y PH P E++E+Y D + E + P W P PV Sbjct: 233 PFFVLLAYSSPHAELAAPPEFVERYKDAFPETRYGGMSN--GTPSDKYAW--YYPEPVER 288 Query: 243 DGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG--EMMGAHKLISKGA 299 H + +D +G++ +L + +NT +++TSD+G + G + + Sbjct: 289 P----HAVLAGMVTALDAYVGQIYQSLESKGIADNTLILFTSDNGPHDEGGGDPTFFRAS 344 Query: 300 A--------MYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIEKPEILP 348 +YD +P+I P R R DTP + D+LPT +A + +I+P Sbjct: 345 EPYKGMKRDLYDGGIHVPMIAHWPAAIRSPRVDDTPWAFADVLPTFADIAGVSL-DIVP 402 >UniRef50_C9L4I6 Arylsulfatase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L4I6_RUMHA Length = 514 Score = 115 bits (288), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 120/473 (25%), Positives = 212/473 (44%), Gaps = 54/473 (11%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 + +PN L +MTD + +G ++G P + T +D+LA+ G+ F++AY+ P C ARA L Sbjct: 26 LNKPNILLIMTDQLRGDCLG-FAGHPDVKTPYLDTLASRGVSFDNAYSSCPSCIAARAAL 84 Query: 60 FTGI---YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDG-HDYFGTGECP 115 TG+ + ++G + +N+ P + +TM AGY+ +GK H+ +Y G Sbjct: 85 HTGMAQEHHRRTG-YEDNI-PWEYPNTMAGELSKAGYYCQCVGKMHVHPLRNYLGFHNVE 142 Query: 116 PEWDADYWFDGANYLS---ELTEKEISLWRNGLNSVEDLQANHID---ETFTWAHR---- 165 D + A Y + + ++K + + L + + + D E +W R Sbjct: 143 LH---DGYLHSARYTNVPWQESQKNADDYFHWLKQEKGIDTDVTDTGLECNSWVARPWIY 199 Query: 166 ---------ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGE 216 +++R++DFL++ +PF ++ SY PH PF P Y + Y EL Sbjct: 200 EEKYHPTNWVTDRSIDFLRR-KDPQKPFFLMASYLRPHPPFDAPSYYFDLYNK--KELTP 256 Query: 217 KAQDDLANKPEHHRLWAQAMPSPVG--DDGLYHHPL--YFACNDFVDDQIGRVINALTPE 272 A D E + + S G D L Y+AC +D QIGR+I AL Sbjct: 257 PAVGDWETTEELQAM-GRVFDSKCGPSDAVLIREAQIGYYACITHLDHQIGRLIQALVEY 315 Query: 273 Q-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIR------SPQGERRQVDTP 325 +NT +++TSDHGE + H + K Y RIP+I+ S + + Sbjct: 316 GVYDNTLILFTSDHGEELCDHHMFRKSRP-YQGSIRIPMIVSGNDKFLSGMKQGTVSHSV 374 Query: 326 VSHIDLLPTMMALADIEKPEILPGENIL-AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCW 384 V D++PT++ + + P+ + G+++L V P + + E + + W Sbjct: 375 VELRDVMPTLLDFVNADIPDSVDGKSMLPLVTNPDEKLRDVLHGEHSYGPYSNH-----W 429 Query: 385 VTDDF-KLVLNLFTSDELYDR-RNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 + + K + T E Y R DP E+H+ I + + ++ L++ + Sbjct: 430 LVSSYDKFIWYSETGTEQYFRISEDPKELHDEISNPAYQKRIEQLRRTLIETL 482 >UniRef50_C5C4L8 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C4L8_BEUC1 Length = 489 Score = 115 bits (288), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 124/465 (26%), Positives = 193/465 (41%), Gaps = 57/465 (12%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQN-IDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 PN L +MTD + G PL+T +DSL G F +AYT +P+C PAR + TG Sbjct: 7 PNVLLIMTDQHRADFTRG-RGFPLDTMPFLDSLGRAGTVFGNAYTSAPLCVPARVSMLTG 65 Query: 63 IYANQSGPWTNNVAP----GKNISTMGRYFKDAGYHTCYIGKWHLD-GHDYFGTGECPPE 117 + + N+ G ++ + + AGY + GK H+ G D F P Sbjct: 66 RFPSAHRVRQNSTGQHALYGDDLLDV---LRAAGYRLGFAGKPHIHRGLDDFDAHRGP-- 120 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 Y DG + ++ W + L+ + +RI++ A+D + + Sbjct: 121 ----YMHDGGTARTA-DDQAFDAWLHDLDHAVHPEPTPFPLERQLPYRITSDAMDVIDEL 175 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 D+PF VS+ EPH+P+ P Y + + + LA K +R + Sbjct: 176 GE-DDPFFCWVSFPEPHNPYQVPEPYFSLFDEDEIPERLAGPEALAGKGRTYRWLGDLIG 234 Query: 238 S--PVGDDGLYHHPL-YFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKL 294 + P D+ + Y +DDQ+ R++ L R +T + + SDHG+ +G + L Sbjct: 235 TKRPGYDEAWRRYAANYCGMLRLIDDQLRRLVGHLGDRAR-DTVIAFVSDHGDFVGEYGL 293 Query: 295 ISKGAAMYDDITRIPLIIRSPQGERRQ--VDTPVSHIDLLPTMMALADIEKPEILPGENI 352 KGA M + + RIP + P G R V PVS +DL PT+ LA + P + G ++ Sbjct: 294 QRKGAGMSEFLMRIPFQLSGP-GVRPGGVVAEPVSMVDLFPTLCELAGQDIPLGVQGRSL 352 Query: 353 --LAVKEPRGVMVEFNRYEIEHDSFGGF-----------IPVRCWVTDDF---------- 389 L EP EF E FGG P D+ Sbjct: 353 EPLLRGEP-APAEEFTSVYGEL-GFGGVPYDDDERPPLHFPYDGATFDELNTVTMSGGSR 410 Query: 390 -------KLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKM 427 KLV+++ + ELYD DP E+ + D A VR ++ Sbjct: 411 MVRAGRHKLVVDVDGNVELYDVEADPAELVDRAADPALATVRDEL 455 >UniRef50_D2R201 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R201_9PLAN Length = 511 Score = 115 bits (288), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 120/455 (26%), Positives = 201/455 (44%), Gaps = 36/455 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 KRP+ LF+ D Q + +G G P T ++D+LAA G +A+ +P+C P+R L Sbjct: 39 KRPHILFIAIDDQ-NDWIGHLGGHPYAKTPHLDALAARGTTLANAHCQAPLCNPSRTSLM 97 Query: 61 TGIYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 G+ +G PW + + ++ +Y + GY T GK + H G + Sbjct: 98 FGLRPTSTGIYGLAPWIRTLPEFEKRVSLPQYLQQHGYRTLTTGKIY---HGGLGPKKRL 154 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDL-QANHIDETFTWAHRISNRAVDFL 174 E+D W ++ +K I G + + D + +H DE ++I++ A++ L Sbjct: 155 EEFDV--WGPAGGIGAKPEKKLIPPTPMGNHPLMDWGKFDHRDEDKG-DYQITSWAIEQL 211 Query: 175 --QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 Q A+ P + V Y PH P ++ ++ L A DD ++ P Sbjct: 212 DDQVQHHAETPMFLSVGYFLPHVPCFISPKWYDEVPQGDKLLPLVAADDRSDIPRFAWYL 271 Query: 233 AQAMPSP----VGDDGLYHHPL--YFACNDFVDDQIGRVINALTPEQR---ENTWVIYTS 283 ++P P V D + + + Y A FVD QIGR++ AL E+R ++T V+ Sbjct: 272 HWSLPEPRLKWVEDHRQWENLVRSYLASTTFVDAQIGRLLTAL--EERKLADDTIVVVWG 329 Query: 284 DHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLLPTMMALADIE 342 DHG +G K I+ +++ TR+PLI P +QV P +D+ PT++ LA + Sbjct: 330 DHGWHLG-EKGITGKNTLWERSTRVPLIFAGPGITPKQVCGEPAELLDIFPTLVELAGLP 388 Query: 343 KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELY 402 L G ++ E+ + R ++ + S+ELY Sbjct: 389 PRNDLEGHSLAPQLRDASRQREWPAITSHNQGNHAIRSAR------YRYITYADGSEELY 442 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 D ++DP E+ NL D +A V + H L MD+ Sbjct: 443 DMQSDPRELTNLASDSTWASVIAD-HRRWLPKMDR 476 >UniRef50_A4GIB1 Arylsulfatase n=2 Tax=Bacteria RepID=A4GIB1_9BACT Length = 608 Score = 115 bits (288), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 120/477 (25%), Positives = 198/477 (41%), Gaps = 88/477 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L +MTD Q V + L T N+D L + +R S Y +P+CTP R L TG Sbjct: 19 KPNVLIIMTDDQGYPEVSAHGNPVLQTPNLDRLHGQSLRL-SDYHVAPMCTPTRGQLLTG 77 Query: 63 IYANQSGPWTNNVAPGK-----NISTMGRYFKDAGYHTCYIGKWHLDGHDYF-------- 109 + A ++G NV+ G+ +ST+ Y+++AGY T GKWHL + F Sbjct: 78 LDAARNG--AVNVSSGRALLRPEVSTIANYYEEAGYSTGVFGKWHLGANYPFRPQDRGFQ 135 Query: 110 --------GTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 P W DY+ D Y+ EK + + F Sbjct: 136 ESVWYPSSSIPSVPAYWGNDYFDD--VYIHNGKEKRFE--------------GYCADVFF 179 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 N A+ F+ + A++ +PF+ ++ + PH PF E ++ A+ L + D+ Sbjct: 180 ------NEAMRFMSESAKSKKPFMCYLATNTPHGPFWPKEEDRKEIAEV---LAQSKFDN 230 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVI 280 L N + RL LY +D +G ++ L E E+T +I Sbjct: 231 LDNNLK-KRL-----------------ALYLGMIRNIDWNMGNLLKFLKEENLAEDTILI 272 Query: 281 YTSDHGEMMGAH----KLISKGAAMYDDITRIPLIIRSPQ---GERRQVDTPVSHIDLLP 333 + +D+G ++G + K +++ R+P IR P G+ R + D+LP Sbjct: 273 FKTDNGSLLGPQYFNAGMRGKKTEIWEGGHRVPCFIRWPNGGFGKARDIGGLTQVQDILP 332 Query: 334 TMMALADIE--KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV-----RCWVT 386 T++ L I+ K G ++ +V + + E I + GF + + Sbjct: 333 TVLDLCGIKPRKNTKFDGISLASVLRGKKKVSEDRTIIINYSRMPGFSNYPSPHSQTQMR 392 Query: 387 DDFKLVL----NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 D VL L ELYD +DP + N+ID + +V +KM L + D ++ Sbjct: 393 ADQAAVLWKRWRLLEDRELYDLASDPLQQKNVID--QHPEVVAKMRQQLYSWWDGVK 447 >UniRef50_Q7ULE7 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Rhodopirellula baltica RepID=Q7ULE7_RHOBA Length = 1049 Score = 115 bits (287), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 124/439 (28%), Positives = 194/439 (44%), Gaps = 45/439 (10%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLN-TQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 PN LF+ D + +GC G P T N+D LAA GI F +A+ +P C P R+ +FTG Sbjct: 31 PNVLFIAMD-DLNDWIGCLGGHPQTITPNLDRLAASGILFTNAHCPAPACNPCRSAVFTG 89 Query: 63 IYANQSGPWTN-----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 NQSG + N V P I + +Y ++ GYH GK D E P+ Sbjct: 90 RAPNQSGLYDNRQQMREVMPDDVI--LPQYMRNHGYHASGSGKLLHYFIDAASWDEYFPK 147 Query: 118 WDADYWFDGANYLSELTEKEISLWRNG-LNSVEDLQA--NHIDETFTWAHRISNRAVDFL 174 +++ F Y S ++ ++L R G VE A + DE F +S + L Sbjct: 148 AESENPFPQTFYPS---QRPVNLKRGGPWQYVETDWAALDVTDEEFGGDWAVSQWIGEQL 204 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG----EKAQDD---LANKPE 227 QQ + D+PF + PH P+ P +Y E + +L E DD + + Sbjct: 205 QQ--KHDQPFFLGCGIYRPHEPWFVPKKYFEPFPLDSIQLPPGYLENDLDDVPPIGQRAA 262 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHG 286 +R +A G+ Y A F D +GR+++AL + +NT V+ SDHG Sbjct: 263 RNRYFAHIQKQDQWKQGIQG---YLASIHFADAMLGRLLDALESGPNADNTIVVLWSDHG 319 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGER----------RQVDTPVSHIDLLPTMM 336 +G + K + +TR+PL+IR P+ + D PV+ + L PT++ Sbjct: 320 WQLGEKEHWQKYTP-WRGVTRVPLMIRVPKTSSPSLPNGTPIGARCDAPVNLLSLFPTVL 378 Query: 337 ALADIEKPEILPGENILA-VKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL 395 L + + G ++L +KEP+ + N + + S G V + + Sbjct: 379 DLCQLPSNPVNDGPSLLPLLKEPKTDTWKHN--SVTYLSHPGAYAVS---GRTHRYIHYQ 433 Query: 396 FTSDELYDRRNDPNEMHNL 414 S+ELY+ DP E +NL Sbjct: 434 DGSEELYNIEADPYEWNNL 452 Score = 95.5 bits (236), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 103/400 (25%), Positives = 169/400 (42%), Gaps = 107/400 (26%) Query: 3 RPNFLFVMTDTQATNMVGCYSG-KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN + ++TD Q + C + + T +ID LAA G+R +AY +P C+P+RAGL T Sbjct: 581 KPNVVVILTDDQGWADLSCQNEVDDIQTPHIDGLAARGVRCTNAYVTAPQCSPSRAGLIT 640 Query: 62 GIYANQSGPWTNNVAP-GKNISTMGRYFKDAGYHTCYIGKWHLDGH----DYFGTGECPP 116 G Y + G T P N T+ + + GY T ++GKWHL+ + D+ E P Sbjct: 641 GRYQQRLGIDTIPDMPLPTNAVTIAEHLQPKGYKTGFVGKWHLEPNVTCIDWM-RRELPA 699 Query: 117 ------------------------EWDADYWFDGANYLS--ELTEKEISLWRNGLNSVED 150 +D YW + NY + +LT E+ L ++ Sbjct: 700 MAGKPRRKVRIPWNKIEPYSPSQQGFDEYYWGERTNYRTNFDLTSGEL------LAEMKP 753 Query: 151 LQANHIDETFTWAHRI---SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY 207 ++ DE F RI +N AV F+Q+ D+PF + ++Y PH P +YL+++ Sbjct: 754 IR----DERF----RIDVQTNAAVKFIQR--NHDQPFYLQLNYYGPHTPLEATQKYLDRF 803 Query: 208 ADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVIN 267 PE R +A AM S +DD +G++++ Sbjct: 804 P--------------GPMPERRR-YALAMIS------------------AIDDGVGQIVD 830 Query: 268 ALTPEQ-RENTWVIYTSDHGEMMGAHKL-------------------ISKGAAMYDDITR 307 L E +NT ++ TSD+G + K + + + + R Sbjct: 831 QLKAEGVLDNTLIVMTSDNGAPLKMTKTDSPINGDAGGWDGSLNDPWVGEKGMLSEGGIR 890 Query: 308 IPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEKPE 345 +P+I P + D PVS +D+ P+++ LA E P Sbjct: 891 VPMIWSLPTQLPSGITYDWPVSALDIAPSVLKLAGGELPS 930 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P31447 Uncharacterized sulfatase yidJ n=52 Tax=Enteroba... 743 0.0 UniRef50_C9L4Q0 Putative sulfatase YidJ n=2 Tax=Blautia hansenii... 504 e-141 UniRef50_B0N997 Putative uncharacterized protein n=1 Tax=Clostri... 503 e-141 UniRef50_C4G6V3 Putative uncharacterized protein n=1 Tax=Abiotro... 460 e-128 UniRef50_C6D1Q0 Sulfatase n=2 Tax=Paenibacillus sp. JDR-2 RepID=... 415 e-114 UniRef50_C5EHR5 Putative uncharacterized protein n=1 Tax=Clostri... 406 e-111 UniRef50_A6DLX7 Putative sulfatase n=1 Tax=Lentisphaera araneosa... 399 e-109 UniRef50_C6J3H9 Sulfatase n=2 Tax=Paenibacillus sp. oral taxon 7... 396 e-109 UniRef50_A4AWR8 Iduronate-2-sulfatase n=5 Tax=Bacteria RepID=A4A... 395 e-108 UniRef50_C6J2Z0 Sulfatase n=4 Tax=Firmicutes RepID=C6J2Z0_9BACL 395 e-108 UniRef50_B9XEU8 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XEU... 395 e-108 UniRef50_B9XGI2 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XGI... 394 e-108 UniRef50_UPI0001C36AAF N-acetylgalactosamine 6-sulfate sulfatase... 389 e-106 UniRef50_C6J5I7 Sulfatase n=1 Tax=Paenibacillus sp. oral taxon 7... 386 e-106 UniRef50_B7AMH4 Putative uncharacterized protein n=1 Tax=Bactero... 386 e-106 UniRef50_C0QY53 Sulfatase n=2 Tax=Brachyspira RepID=C0QY53_BRAHW 386 e-105 UniRef50_UPI0001C36159 sulfatase n=2 Tax=Clostridium hathewayi D... 386 e-105 UniRef50_D2MLH4 Sulfatase family protein n=1 Tax=Candidatus Pori... 384 e-105 UniRef50_D2QWC7 Sulfatase n=5 Tax=Bacteria RepID=D2QWC7_9PLAN 384 e-105 UniRef50_Q7UH28 Mucin-desulfating sulfatase (N-acetylglucosamine... 384 e-105 UniRef50_B4D6H3 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 382 e-104 UniRef50_B5JYP8 Choline-sulfatase n=1 Tax=Octadecabacter antarct... 382 e-104 UniRef50_B9XND0 Sulfatase n=3 Tax=Bacteria RepID=B9XND0_9BACT 381 e-104 UniRef50_A6DPE5 Iduronate-2-sulfatase n=2 Tax=Lentisphaera arane... 381 e-104 UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase... 381 e-104 UniRef50_Q029P1 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 381 e-104 UniRef50_C7MEQ7 Choline-sulfatase n=1 Tax=Brachybacterium faeciu... 380 e-104 UniRef50_C3WCE8 Arylsulfatase n=2 Tax=Fusobacterium RepID=C3WCE8... 379 e-103 UniRef50_UPI000051016C choline-sulfatase n=1 Tax=Brevibacterium ... 379 e-103 UniRef50_A6C9F6 Iduronate-2-sulfatase n=1 Tax=Planctomyces maris... 378 e-103 UniRef50_A6C8U0 Choline sulfatase n=1 Tax=Planctomyces maris DSM... 377 e-103 UniRef50_Q01RE9 Sulfatase n=4 Tax=Bacteria RepID=Q01RE9_SOLUE 377 e-103 UniRef50_Q482B9 Sulfatase family protein n=1 Tax=Colwellia psych... 377 e-103 UniRef50_A9ECS8 Sulfatase n=3 Tax=Bacteria RepID=A9ECS8_9FLAO 376 e-102 UniRef50_A4A280 Iduronate-2-sulfatase n=1 Tax=Blastopirellula ma... 376 e-102 UniRef50_C9L086 Mucin-desulfating sulfatase n=54 Tax=Bacteria Re... 375 e-102 UniRef50_Q7UJ67 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 375 e-102 UniRef50_C5BVK2 Sulfatase n=11 Tax=Actinomycetales RepID=C5BVK2_... 375 e-102 UniRef50_C6J5I8 Sulfatase n=2 Tax=Paenibacillus sp. oral taxon 7... 375 e-102 UniRef50_Q7UMT6 Mucin-desulfating sulfatase (N-acetylglucosamine... 374 e-102 UniRef50_C6VXD1 Sulfatase n=4 Tax=Bacteria RepID=C6VXD1_DYAFD 374 e-102 UniRef50_A4CMA4 Mucin-desulfating sulfatase (N-acetylglucosamine... 374 e-102 UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi D... 374 e-102 UniRef50_B8FL44 Sulfatase n=1 Tax=Desulfatibacillum alkenivorans... 373 e-102 UniRef50_A6DNI8 Putative N-acetylglucosamine-6-sulfatase n=1 Tax... 373 e-101 UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT 372 e-101 UniRef50_A0LYA0 Sulfatase n=8 Tax=Bacteria RepID=A0LYA0_GRAFK 372 e-101 UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria Rep... 372 e-101 UniRef50_D2RQH7 Sulfatase n=1 Tax=Haloterrigena turkmenica DSM 5... 372 e-101 UniRef50_A4AP83 Putative sulfatase n=1 Tax=Flavobacteriales bact... 371 e-101 UniRef50_B4D780 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 371 e-101 UniRef50_Q7NMX5 Gll0640 protein n=1 Tax=Gloeobacter violaceus Re... 371 e-101 UniRef50_Q7UW58 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 368 e-100 UniRef50_D2R1A1 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 367 e-100 UniRef50_C0G116 Sulfatase n=1 Tax=Natrialba magadii ATCC 43099 R... 367 e-100 UniRef50_A4U8Q3 Sulfatase n=2 Tax=Bacteria RepID=A4U8Q3_9BACT 367 e-100 UniRef50_A6LF65 Choline-sulfatase n=26 Tax=Bacteroidales RepID=A... 365 2e-99 UniRef50_A0JVM4 Sulfatase n=2 Tax=Actinomycetales RepID=A0JVM4_A... 365 2e-99 UniRef50_UPI0001745B0B sulfatase n=1 Tax=Verrucomicrobium spinos... 365 2e-99 UniRef50_A7LY81 Putative uncharacterized protein n=5 Tax=Bactero... 365 2e-99 UniRef50_A6DKS7 N-acetylglucosamine-6-sulfatase n=1 Tax=Lentisph... 365 2e-99 UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC1... 365 3e-99 UniRef50_Q01ZJ7 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 365 3e-99 UniRef50_C5HLB2 Putative sulfatase n=1 Tax=uncultured bacterium ... 364 3e-99 UniRef50_Q7MBV5 Arylsulfatase A n=31 Tax=Bacteria RepID=Q7MBV5_V... 364 4e-99 UniRef50_C6D448 Sulfatase n=2 Tax=Bacteria RepID=C6D448_PAESJ 364 4e-99 UniRef50_A6DME6 Sulfatase family protein n=1 Tax=Lentisphaera ar... 364 6e-99 UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 363 7e-99 UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD 363 9e-99 UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces mari... 363 1e-98 UniRef50_C3WAQ9 Sulfatase n=1 Tax=Fusobacterium mortiferum ATCC ... 363 1e-98 UniRef50_A5FX90 Sulfatase n=4 Tax=Alphaproteobacteria RepID=A5FX... 363 1e-98 UniRef50_D2QL61 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepI... 363 1e-98 UniRef50_A6DM50 Choline sulfatase n=6 Tax=Bacteria RepID=A6DM50_... 362 2e-98 UniRef50_A6CFT9 Iduronate-2-sulfatase n=2 Tax=Planctomycetaceae ... 362 2e-98 UniRef50_A3P379 Choline-sulfatase n=63 Tax=cellular organisms Re... 361 3e-98 UniRef50_Q127E2 Sulfatase n=1 Tax=Polaromonas sp. JS666 RepID=Q1... 361 5e-98 UniRef50_D0DCV9 Choline-sulfatase n=2 Tax=Citreicella sp. SE45 R... 360 6e-98 UniRef50_B4D026 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 360 6e-98 UniRef50_Q15XI1 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 359 1e-97 UniRef50_C9L4R7 Putative sulfatase YidJ n=1 Tax=Blautia hansenii... 359 1e-97 UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria ... 359 1e-97 UniRef50_A3SJ21 Sulfatase n=1 Tax=Roseovarius nubinhibens ISM Re... 359 2e-97 UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 359 2e-97 UniRef50_A0Q2E3 N-acetylgalactosamine 6-sulfate sulfatase n=3 Ta... 358 2e-97 UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglob... 358 3e-97 UniRef50_C0S8M2 Choline sulfatase n=8 Tax=Eurotiomycetidae RepID... 358 3e-97 UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomy... 358 3e-97 UniRef50_Q1IH24 Choline sulfatase n=29 Tax=cellular organisms Re... 358 4e-97 UniRef50_UPI00017453D4 choline sulfatase n=1 Tax=Verrucomicrobiu... 357 6e-97 UniRef50_Q7WC54 Putative sulfatase n=3 Tax=Proteobacteria RepID=... 357 6e-97 UniRef50_A6DNH0 Choline sulfatase n=1 Tax=Lentisphaera araneosa ... 357 7e-97 UniRef50_Q46P27 Sulfatase n=3 Tax=Proteobacteria RepID=Q46P27_RALEJ 357 7e-97 UniRef50_A4W906 Sulfatase n=43 Tax=Enterobacteriaceae RepID=A4W9... 356 1e-96 UniRef50_UPI0001744DD5 choline sulfatase n=1 Tax=Verrucomicrobiu... 356 1e-96 UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=B... 356 1e-96 UniRef50_C3QDX1 Sulfatase n=2 Tax=Bacteroides RepID=C3QDX1_9BACE 356 1e-96 UniRef50_Q0TUK6 Arylsulfatase n=9 Tax=Bacteria RepID=SULF_CLOP1 355 2e-96 UniRef50_Q7UYA8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 355 2e-96 UniRef50_A0JVP0 Sulfatase n=1 Tax=Arthrobacter sp. FB24 RepID=A0... 355 3e-96 UniRef50_C9L4R5 Mucin-desulfating sulfatase n=1 Tax=Blautia hans... 354 3e-96 UniRef50_Q01PN7 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 354 4e-96 UniRef50_C7MHR6 Arylsulfatase A family protein n=3 Tax=Bacteria ... 354 4e-96 UniRef50_A6DMZ1 Sulfatase n=3 Tax=Lentisphaera araneosa HTCC2155... 354 4e-96 UniRef50_A6DMW2 Putative exported uslfatase n=1 Tax=Lentisphaera... 354 4e-96 UniRef50_A6DPD0 Sulfatase family protein n=1 Tax=Lentisphaera ar... 354 5e-96 UniRef50_Q1GMK9 Choline sulfatase n=8 Tax=Alphaproteobacteria Re... 354 5e-96 UniRef50_B6HPN7 Pc22g01020 protein n=15 Tax=Eukaryota RepID=B6HP... 354 6e-96 UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 353 7e-96 UniRef50_D2R014 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 353 9e-96 UniRef50_B1KD82 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 352 2e-95 UniRef50_A3HWG3 Choline sulfatase n=1 Tax=Algoriphagus sp. PR1 R... 352 2e-95 UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 351 3e-95 UniRef50_Q7UQ05 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 351 3e-95 UniRef50_C1ZIM5 Arylsulfatase A family protein n=2 Tax=Planctomy... 351 3e-95 UniRef50_A6CG48 Sulfatase family protein n=1 Tax=Planctomyces ma... 351 4e-95 UniRef50_C2KTX6 Arylsulfatase n=2 Tax=Mobiluncus mulieris RepID=... 351 4e-95 UniRef50_A6DSH1 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 351 4e-95 UniRef50_A3HTC7 Putative uncharacterized protein n=1 Tax=Algorip... 351 5e-95 UniRef50_B5CWC2 Putative uncharacterized protein n=1 Tax=Bactero... 350 6e-95 UniRef50_Q5LRB5 Choline sulfatase n=1 Tax=Ruegeria pomeroyi RepI... 350 7e-95 UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM ... 350 7e-95 UniRef50_UPI0001C35789 arylsulfatase n=1 Tax=Clostridium hathewa... 350 8e-95 UniRef50_Q7UWE8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 350 9e-95 UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 350 9e-95 UniRef50_Q5UEY3 Probable sulfatase n=1 Tax=uncultured alpha prot... 349 1e-94 UniRef50_A6DJJ1 Sulfatase family protein n=1 Tax=Lentisphaera ar... 349 1e-94 UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodop... 349 1e-94 UniRef50_A6DJ72 Mucin-desulfating sulfatase (N-acetylglucosamine... 349 1e-94 UniRef50_UPI00016C500A sulfatase n=1 Tax=Gemmata obscuriglobus U... 349 2e-94 UniRef50_Q7UZ92 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 349 2e-94 UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 348 2e-94 UniRef50_C5BYA8 Sulfatase n=2 Tax=Micrococcineae RepID=C5BYA8_BEUC1 348 3e-94 UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 347 4e-94 UniRef50_B8KHZ9 Arylsulfatase A n=2 Tax=Gammaproteobacteria RepI... 347 4e-94 UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN... 347 4e-94 UniRef50_Q7UFA5 Putative sulfatase yidj n=1 Tax=Rhodopirellula b... 347 4e-94 UniRef50_C5BWB0 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 1233... 347 4e-94 UniRef50_A6DKC5 Putative sulfatase yidj n=1 Tax=Lentisphaera ara... 347 5e-94 UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 T... 347 7e-94 UniRef50_C5BXT8 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 1233... 347 7e-94 UniRef50_A6DIH4 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 346 8e-94 UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_R... 346 9e-94 UniRef50_Q7UJQ8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 346 9e-94 UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 346 1e-93 UniRef50_A6DGT7 Sulfatase family protein n=1 Tax=Lentisphaera ar... 346 1e-93 UniRef50_A6DHY1 Mucin-desulfating sulfatase n=1 Tax=Lentisphaera... 346 1e-93 UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9... 346 1e-93 UniRef50_Q5UEW6 Probable phosphonate monoester hydrolase n=1 Tax... 345 2e-93 UniRef50_Q15XH4 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 345 2e-93 UniRef50_O69787 Choline-sulfatase n=53 Tax=Alphaproteobacteria R... 345 2e-93 UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 345 2e-93 UniRef50_A6DFZ4 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 345 3e-93 UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=... 345 3e-93 UniRef50_A6DG71 Mucin-desulfating sulfatase (N-acetylglucosamine... 344 4e-93 UniRef50_C5C586 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 1233... 344 5e-93 UniRef50_B0TKJ5 Sulfatase n=2 Tax=Gammaproteobacteria RepID=B0TK... 344 6e-93 UniRef50_A0LK86 Sulfatase n=1 Tax=Syntrophobacter fumaroxidans M... 343 7e-93 UniRef50_C6LAI4 Arylsulfatase n=6 Tax=Bacteria RepID=C6LAI4_9FIRM 343 7e-93 UniRef50_C0W1U3 Sulfatase n=1 Tax=Actinomyces coleocanis DSM 154... 343 8e-93 UniRef50_C6IGG0 Iduronate 2-sulfatase n=2 Tax=Bacteroides RepID=... 343 8e-93 UniRef50_B6A548 Choline-sulfatase n=1 Tax=Rhizobium leguminosaru... 343 9e-93 UniRef50_A6DNH1 Choline sulfatase n=2 Tax=Lentisphaera araneosa ... 343 1e-92 UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD 343 1e-92 UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 342 1e-92 UniRef50_D2R201 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 342 1e-92 UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7... 342 1e-92 UniRef50_A6DF72 Putative secreted sulfatase ydeN n=1 Tax=Lentisp... 342 1e-92 UniRef50_A4A047 Iduronate-2-sulfatase n=2 Tax=Bacteria RepID=A4A... 342 2e-92 UniRef50_A3I0S5 Putative sulfatase yidJ n=1 Tax=Algoriphagus sp.... 342 2e-92 UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 342 2e-92 UniRef50_A6DFB2 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Lent... 341 3e-92 UniRef50_A4AMS2 Choline sulfatase n=1 Tax=Flavobacteriales bacte... 341 3e-92 UniRef50_UPI00016BFE17 putative sulfatase n=1 Tax=Epulopiscium s... 341 3e-92 UniRef50_D2R203 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 341 4e-92 UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_B... 340 9e-92 UniRef50_A6L183 Iduronate 2-sulfatase n=11 Tax=Bacteroides RepID... 339 1e-91 UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Plancto... 339 1e-91 UniRef50_B9XJI6 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XJI... 339 2e-91 UniRef50_A3I2R7 Arylsulfatase n=2 Tax=Bacteroidetes RepID=A3I2R7... 339 2e-91 UniRef50_UPI0001746164 choline-sulfatase n=1 Tax=Verrucomicrobiu... 338 2e-91 UniRef50_A6UE90 Sulfatase n=1 Tax=Sinorhizobium medicae WSM419 R... 338 3e-91 UniRef50_A6E5R0 Putative sulfatase n=1 Tax=Roseovarius sp. TM103... 338 3e-91 UniRef50_D1AX15 Sulfatase n=2 Tax=Fusobacteriaceae RepID=D1AX15_... 338 3e-91 UniRef50_D0Z4S7 Iduronate sulfatase n=1 Tax=Photobacterium damse... 338 3e-91 UniRef50_A6DR18 Arylsulfatase n=2 Tax=Lentisphaera araneosa HTCC... 337 4e-91 UniRef50_A9MER1 Putative uncharacterized protein n=2 Tax=Enterob... 337 5e-91 UniRef50_UPI0001968556 hypothetical protein BACCELL_00122 n=1 Ta... 337 6e-91 UniRef50_UPI0001BC85B0 choline sulfatase n=1 Tax=Bacteroides sp.... 337 6e-91 UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 337 8e-91 UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 336 1e-90 UniRef50_A6DG72 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 336 1e-90 UniRef50_Q02B50 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 336 1e-90 UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 T... 336 1e-90 UniRef50_Q7UVD9 N-acetylgalactosamine 6-sulfate sulfatase n=1 Ta... 336 1e-90 UniRef50_Q7W424 Putative sulfatase n=2 Tax=Bordetella RepID=Q7W4... 336 2e-90 UniRef50_A6DR15 Arylsulfatase n=2 Tax=Lentisphaera araneosa HTCC... 335 3e-90 UniRef50_UPI0001968553 hypothetical protein BACCELL_00119 n=1 Ta... 335 3e-90 UniRef50_D2QTW5 Sulfatase n=2 Tax=Sphingobacteriales RepID=D2QTW... 335 3e-90 UniRef50_D2R1I8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 334 3e-90 UniRef50_A3ZMT9 Arylsulfatase n=2 Tax=Planctomycetaceae RepID=A3... 334 4e-90 UniRef50_Q1ARG1 Sulfatase n=2 Tax=Rubrobacter xylanophilus DSM 9... 334 5e-90 UniRef50_A7V656 Putative uncharacterized protein n=6 Tax=Bactero... 334 5e-90 UniRef50_Q7UVD4 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 334 6e-90 UniRef50_Q7UGD6 Mucin-desulfating sulfatase (N-acetylglucosamine... 333 7e-90 UniRef50_C5BAV0 Sulfatase, putative n=2 Tax=Edwardsiella RepID=C... 333 8e-90 UniRef50_Q7UJQ7 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 333 8e-90 UniRef50_Q7UH46 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 333 8e-90 UniRef50_UPI0000E0F7B6 iduronate 2-sulfatase precursor n=1 Tax=G... 333 8e-90 UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 333 1e-89 UniRef50_D2R575 Sulfatase n=4 Tax=Bacteria RepID=D2R575_9PLAN 333 1e-89 UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3J... 332 1e-89 UniRef50_A6C2T4 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 Re... 332 1e-89 UniRef50_Q5LH37 Putative sulfatase n=16 Tax=Bacteroides RepID=Q5... 332 2e-89 UniRef50_B5JCS9 Sulfatase, putative n=2 Tax=Bacteria RepID=B5JCS... 331 3e-89 UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 T... 331 3e-89 Sequences not found previously or not previously below threshold: UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 382 e-104 UniRef50_C6CXF5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=... 378 e-103 UniRef50_A6DG38 N-acetylglucosamine-6-sulfatase n=1 Tax=Lentisph... 374 e-102 UniRef50_C6DK82 Sulfatase n=3 Tax=Pectobacterium RepID=C6DK82_PECCP 371 e-101 UniRef50_B5JJG3 Sulfatase, putative n=1 Tax=Verrucomicrobiae bac... 369 e-100 UniRef50_A6CBG2 Mucin-desulfating sulfatase (N-acetylglucosamine... 366 2e-99 UniRef50_A3HTC6 Choline sulfatase n=5 Tax=Bacteria RepID=A3HTC6_... 358 2e-97 UniRef50_A7A9X1 Putative uncharacterized protein n=2 Tax=Parabac... 354 3e-96 UniRef50_Q7UL93 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 351 3e-95 UniRef50_B7ACM6 Putative uncharacterized protein n=1 Tax=Bactero... 351 3e-95 UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Plancto... 351 3e-95 UniRef50_A6C1R0 Choline sulfatase n=1 Tax=Planctomyces maris DSM... 350 7e-95 UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bactero... 350 8e-95 UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina ... 346 1e-93 UniRef50_B4D0V9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 345 2e-93 UniRef50_A4GIB2 Putative secreted sulfatase n=1 Tax=uncultured m... 345 3e-93 UniRef50_Q7US96 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 344 5e-93 UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Plancto... 344 6e-93 UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyc... 342 2e-92 UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 341 3e-92 UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF8... 341 4e-92 UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planct... 340 7e-92 UniRef50_B4CVD2 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 339 1e-91 UniRef50_C7MF96 Arylsulfatase A family protein n=1 Tax=Brachybac... 338 3e-91 UniRef50_A6C1Q0 N-acetylgalactosamine 6-sulfate sulfatase n=1 Ta... 337 6e-91 UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC... 337 7e-91 UniRef50_UPI00016C0B39 choline sulfatase n=1 Tax=Epulopiscium sp... 336 1e-90 UniRef50_A6DMY9 Putative uncharacterized protein n=2 Tax=Lentisp... 336 2e-90 UniRef50_UPI0001C35931 N-acetylgalactosamine 6-sulfate sulfatase... 335 3e-90 UniRef50_C6XTA2 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 ... 334 5e-90 UniRef50_A9G4Y6 Sulfatase n=1 Tax=Phaeobacter gallaeciensis BS10... 333 7e-90 UniRef50_B2T943 Sulfatase n=1 Tax=Burkholderia phytofirmans PsJN... 333 8e-90 UniRef50_B5CYA4 Putative uncharacterized protein n=1 Tax=Bactero... 333 9e-90 UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium sp... 332 2e-89 >UniRef50_P31447 Uncharacterized sulfatase yidJ n=52 Tax=Enterobacteriaceae RepID=YIDJ_ECOLI Length = 497 Score = 743 bits (1918), Expect = 0.0, Method: Composition-based stats. Identities = 497/497 (100%), Positives = 497/497 (100%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF Sbjct: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA Sbjct: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA Sbjct: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV Sbjct: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKGAA 300 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKGAA Sbjct: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKGAA 300 Query: 301 MYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRG 360 MYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRG Sbjct: 301 MYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRG 360 Query: 361 VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRF 420 VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRF Sbjct: 361 VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRF 420 Query: 421 ADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQDGYSPVVRDYD 480 ADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQDGYSPVVRDYD Sbjct: 421 ADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQDGYSPVVRDYD 480 Query: 481 TGLPTQGVKVEEKKQKF 497 TGLPTQGVKVEEKKQKF Sbjct: 481 TGLPTQGVKVEEKKQKF 497 >UniRef50_C9L4Q0 Putative sulfatase YidJ n=2 Tax=Blautia hansenii DSM 20583 RepID=C9L4Q0_RUMHA Length = 505 Score = 504 bits (1297), Expect = e-141, Method: Composition-based stats. Identities = 211/505 (41%), Positives = 301/505 (59%), Gaps = 20/505 (3%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+ +F+MTDTQ T+M+GCY + T N+D LAAEGIR++ AYT PVC PAR+ +F Sbjct: 1 MKKRQVIFIMTDTQRTDMLGCYGNSAMVTPNLDRLAAEGIRYDKAYTTQPVCQPARSAIF 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG Y + W+N + N+ ++G+ DAG HT Y+GKWHLDG DYFG G CP WD Sbjct: 61 TGSYPHSCAGWSNCMGLSDNVQSIGQRLSDAGIHTAYVGKWHLDGGDYFGLGRCPKGWDE 120 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 DYW+D +L ELT +E R + +E ++ +I E T+ HR ++RAVDF+++ Sbjct: 121 DYWYDMKCFLDELTPEE----RYRIRQIESIEKYNITEDMTYGHRCADRAVDFIEKHKDE 176 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA-QAMPSP 239 D + +V+S DEPH P CP +Y++ Y D+ + E +D L +KPEHHR+WA Sbjct: 177 D--YFLVMSLDEPHGPHICPKKYVDLYKDYEIPVKENMKDTLEDKPEHHRIWAGDEYLKA 234 Query: 240 VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKGA 299 +D + CN F D +IGRV++A + + +IYTSDHG+MM H L KG Sbjct: 235 CREDFKLSPKEFLGCNTFADYEIGRVLDAAAQYE-DEPIIIYTSDHGDMMYGHSLTGKGP 293 Query: 300 AMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENIL------ 353 A+Y++IT IPL+I+ + PVSHI+L PT+ + + P++ G +I Sbjct: 294 ALYEEITHIPLMIK--GFGKGVDKNPVSHINLAPTIFDMFGVPIPKMFEGRSIFEEVKNP 351 Query: 354 AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHN 413 V+ V +EF RYE++HD FGG+ P+R +K+V+NL TSDELYD + DP EM N Sbjct: 352 EVRCNDYVFMEFGRYEVDHDGFGGYQPLRGAFDGRYKMVINLMTSDELYDLQEDPQEMKN 411 Query: 414 LIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPR-WMGAF--RPRPQD 470 LI++ + ++R ++H+A+LD M + RDPFR Y W RPW + + W R R + Sbjct: 412 LINEPGYDEIRKRLHEAILDNMYETRDPFRGYYWEDRPWNRITEYKTWDSRLMTRQRENE 471 Query: 471 GYSPVVRDYDTGLPTQGVKVEEKKQ 495 Y P DY TGLP V +K Q Sbjct: 472 EYEPRQLDYGTGLPMTSA-VRKKGQ 495 >UniRef50_B0N997 Putative uncharacterized protein n=1 Tax=Clostridium scindens ATCC 35704 RepID=B0N997_EUBSP Length = 495 Score = 503 bits (1296), Expect = e-141, Method: Composition-based stats. Identities = 216/505 (42%), Positives = 299/505 (59%), Gaps = 23/505 (4%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+ +F+MTDT +MVGCY + T N+D LA EGIR+ +AYTC PVC PAR+ +F Sbjct: 1 MKKQ-VIFLMTDTTRKDMVGCYGNPKMKTPNLDRLAEEGIRYENAYTCQPVCGPARSAIF 59 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG + + +G TN++A G N+ T+G+ + G YIGKWHLDG DYFG G CP WD Sbjct: 60 TGTFPHTNGMVTNSIAMGDNVKTIGQRLHNHGISCGYIGKWHLDGSDYFGNGRCPEGWDP 119 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 +YW+D YL ELT++E R+ +D E FT+AHR S+RA+ +L+ Sbjct: 120 EYWYDMKTYLDELTDEEKVRSRDPKECYKDG----FSEEFTYAHRCSDRAIKYLENHQDE 175 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 D F + VSYDEPH P CP + + F +E QDDL+ KP RLW+ Sbjct: 176 D--FFLSVSYDEPHGPSLCPEPFNHMFDGFKFESCPNFQDDLSKKPFMQRLWSGKNLHAT 233 Query: 241 GDDGLYHH---PLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISK 297 D+ L+ CN F D +IGRV++ + + VI+TSDHG+M+GAH+L SK Sbjct: 234 EDEINQPSDGLSLFLGCNSFADYEIGRVLDKIREV-APDALVIFTSDHGDMLGAHRLFSK 292 Query: 298 GAAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLLPTMMALADIEKPEILPGENILA-V 355 AA Y ++ IPLII+ GER V D SHID+ PT++ + P++L G+++L + Sbjct: 293 NAAAYKEVANIPLIIK--GGERGYVEDAMASHIDIAPTILDYFGLPIPKLLEGKSMLPQI 350 Query: 356 KEPRGV-----MVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNE 410 K P EF RYEI+HD FGG +R ++ +KLV++L +DE YD NDP E Sbjct: 351 KNPEKEINDVVFTEFTRYEIDHDGFGGLQIMRAVMSKRYKLVIHLLDTDEFYDLENDPYE 410 Query: 411 MHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRW--MGAFRPRP 468 M+NLI+D ++ + R+ +HD L+ +M+ RD +R YQWS+RPWR D P W G R R Sbjct: 411 MNNLIEDKKYIEERNALHDKLIQHMNDTRDLYRGYQWSMRPWRTDFIPDWENEGYTRQRE 470 Query: 469 QDGYSPVVRDYDTGLPTQGVKVEEK 493 + Y P DYDTGLP + V +K Sbjct: 471 NEEYEPRQLDYDTGLPMEEA-VRKK 494 >UniRef50_C4G6V3 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G6V3_ABIDE Length = 502 Score = 460 bits (1184), Expect = e-128, Method: Composition-based stats. Identities = 210/501 (41%), Positives = 287/501 (57%), Gaps = 22/501 (4%) Query: 1 MKRPNFLFVMTDTQATNMVG-CY-SGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAG 58 M + F+ +MTD+Q +M+ C G+ ++T +D L +G+ F SAYT PVC PARAG Sbjct: 1 MAKKQFIVIMTDSQRRDMISRCNERGENMHTPCLDRLCDQGLAFQSAYTTQPVCGPARAG 60 Query: 59 LFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 LFTG Y + +G N +A + T+G+ AG H YIGKWHLDG DYFG G CP W Sbjct: 61 LFTGTYPHTNGMLGNCMALSQQSLTIGQRLSKAGIHAAYIGKWHLDGGDYFGDGICPEGW 120 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 D +YW+D NYL EL E L++ L+ I E FT+ +R + RA+DF+++ Sbjct: 121 DENYWYDMRNYLDELESDEDRARSRTLDTA--LEGEGIGEEFTYGYRCTKRALDFMEKYK 178 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 D + +VVSYDEPHHPF P + + + Y + D + PEH ++W + Sbjct: 179 DED--YFLVVSYDEPHHPFLSPKSFYKPFYQPYLQKP-NQHMDFSKLPEHIQVWHEKFSE 235 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKG 298 G G N F+D QIGRV+ A + E+ V+YTSDHG+ G+H + +KG Sbjct: 236 IQGGKGDGFAVGLLGSNSFIDSQIGRVLEA-AEKNAEDALVLYTSDHGDSQGSHGIHAKG 294 Query: 299 AAMYDDITRIPLIIRSPQGERRQVDT--PVSHIDLLPTMMALADIEKPEILPGENIL--- 353 AMY++IT IPLI R + T PVSHID++PT++ + +P+ L GE++L Sbjct: 295 PAMYEEITNIPLIARWKNKIEAGITTQMPVSHIDIVPTILDFYGLPQPKSLEGESLLNSL 354 Query: 354 ------AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRND 407 KE R V VEFNRYE++HD +GGF PVRC V +KL +NL T DELY+ D Sbjct: 355 TDKEITGQKEGRPVFVEFNRYEVDHDGWGGFQPVRCVVKGKWKLTINLMTQDELYNLEED 414 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKD-ARPRWM--GAF 464 NEMHNLIDD +R+++HD LLD+ ++ RDP R Y W RPWRKD + W G Sbjct: 415 YNEMHNLIDDPNCESIRNQLHDLLLDWQNETRDPLRGYYWEKRPWRKDRQKVSWDCGGYS 474 Query: 465 RPRPQDGYSPVVRDYDTGLPT 485 R R ++ Y TGLP Sbjct: 475 RSRHREDGEVGEYGYSTGLPI 495 >UniRef50_C6D1Q0 Sulfatase n=2 Tax=Paenibacillus sp. JDR-2 RepID=C6D1Q0_PAESJ Length = 480 Score = 415 bits (1066), Expect = e-114, Method: Composition-based stats. Identities = 133/473 (28%), Positives = 209/473 (44%), Gaps = 31/473 (6%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN L++ TD Q + +G Y + +NT +ID LAAEG+ F A+ SPVCTP+RA Sbjct: 1 MKKPNILWICTDQQRQDTLGAYGNQWVNTPHIDRLAAEGVLFEQAFCQSPVCTPSRASFL 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDG--HDYFGTGECPPEW 118 TG Y + N + + + + GY GK HL E + Sbjct: 61 TGRYPRTTRCRANGQDIPADEKLISKLLSEEGYICGLAGKLHLSACHPSVNKGTERRIDD 120 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANH--------IDETFTWAHRISNRA 170 D +F + +E E + W G + D + +A Sbjct: 121 GFDQFFWSHHPNAEWPTNEYTQWLKGKGKTFSPRPFENSPYVNCGPDAEDHQTTWCAEKA 180 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYAD--FYYELGEKAQDDLANKPEH 228 V F++ + + P+ +V+ +PHHPF P EYL++Y D L + +L NKP + Sbjct: 181 VQFIETNSDYERPWFFLVNLFDPHHPFDPPKEYLDRYLDRLDEIPLPNYEEGELENKPVY 240 Query: 229 HRLWAQAMPSPVGD---------DGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTW 278 R+ G D Y+A D +DDQ+GR++++L +NT Sbjct: 241 QRIDRDGAYGMRGHLAASDMSERDHRLIRAAYWAMCDLIDDQVGRMLDSLERSGQLDNTI 300 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMA 337 V++ SDHGE++G H + KG YD R+PLI+R P R++ + V DL PT++ Sbjct: 301 VVFMSDHGELLGDHGMYLKGPHFYDCSVRVPLIVRGPGIHGGRRIASLVELADLAPTLLE 360 Query: 338 LADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIP-------VRCWVTDDFK 390 + I + G+++ + +G +R ++ +S+ P TD K Sbjct: 361 ASQIPTYTGMQGKSLWPILLNKGEDAPNHREDVYCESYDANFPHGDLRAWATMVRTDSHK 420 Query: 391 LVLNLFT-SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 LVL S ELYD DP E N +D + V+ ++ L + M DP Sbjct: 421 LVLYHNDNSGELYDLLADPKENRNAWNDHAYTSVKFELMQRLCNRMALTVDPL 473 >UniRef50_C5EHR5 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EHR5_9FIRM Length = 490 Score = 406 bits (1043), Expect = e-111, Method: Composition-based stats. Identities = 127/483 (26%), Positives = 213/483 (44%), Gaps = 45/483 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + M D + +GCY ++T +IDSLA G F++ + +PVC+P+R + T Sbjct: 4 RRPNIILFMCDQLRFDALGCYGNNQIHTPHIDSLALNGSTFDNHFVQNPVCSPSRCTVLT 63 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G Y G N + + T+ +D GY T IGK H+ E + + Sbjct: 64 GRYPKNHGTRDNGIPLRDSEITLAETLRDNGYRTAAIGKMHITTQFVPKEDEQEDWPEDN 123 Query: 122 YWFDGANYLSELTEKEISLW----------------------------RNGLNSVEDLQA 153 Y FD + + E W + Sbjct: 124 YGFDIIHTTCDCKTGEYLDWLKAASPEDYEEVKMQGERKAKEDRASAADKDTGGPPQVYP 183 Query: 154 NHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE 213 + I+ ++ +H I++R +D +++ D+PF S+ +PHHPF P Y + Y E Sbjct: 184 SGINPSYHQSHWIADRMIDLIEESG-PDQPFFAYCSFVDPHHPFDPPKPYGDMYDPDALE 242 Query: 214 LGEKAQDDLANKPEHHR--LWAQAMPSPVGD-------DGLYHHPLYFACNDFVDDQIGR 264 + + + +L +KP H R L A+ + D Y+ +DD IGR Sbjct: 243 VPVRMEGELLDKPPHFRKALTARGFSNEKYDYRKLTDHQWGQVKAAYYGMITLIDDNIGR 302 Query: 265 VINALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQ 321 ++NAL E +T +++T+DHGE++G H L+ KG YD I + P+II+ P + + Sbjct: 303 ILNALRENGLEKDTLILFTNDHGELLGDHGLLFKGPFHYDCIIKAPMIIKWPGVVPQGSR 362 Query: 322 VDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEP-RGVMVEFNRYEIEHDSFGGFIP 380 H+D++PT++ A + P + G ++ + +G E+ E +G + Sbjct: 363 YSQVTEHVDIMPTLLEYAGVRPPYGVQGCSMAPILRGDKGAGKEYAMTEFNCYDWG--LS 420 Query: 381 VRCWVTDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 V+ ++KL ELYDR DP E NL DD + V++ M L+D + + Sbjct: 421 VKTLTGRNYKLTYYAGEEYGELYDRNLDPEEFKNLWDDEAYGAVKAYMMKKLMDRIIETE 480 Query: 440 DPF 442 DP Sbjct: 481 DPL 483 >UniRef50_A6DLX7 Putative sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLX7_9BACT Length = 502 Score = 399 bits (1026), Expect = e-109, Method: Composition-based stats. Identities = 133/509 (26%), Positives = 222/509 (43%), Gaps = 38/509 (7%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M +PN L++M+D N G + T N+D LA EG+ F + +P+C+P+R Sbjct: 1 MSKPNVLWLMSDQHNANCTGYAGNPNVKTPNLDDLANEGVEFEQGFCNNPICSPSRLSFI 60 Query: 61 TGIYANQSGPWT--NNVAPGKNISTMGRYFKDAGYHTCYIGKWHL------DGHDYFGTG 112 TG+Y N G NN N +T+ F+ GY T +GK H+ +G +Y Sbjct: 61 TGLYTNNHGYLGNRNNDVTTPNPNTLSSLFRRFGYQTGLVGKSHMITGWDKEGFEYIRYT 120 Query: 113 ECPPEWDAD----YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 + D D ++FD E + G +++ Q + + H N Sbjct: 121 DMCDADDNDPHTCHYFDYLAQRGLADHYEEGSPKEGQQTLDGSQPASLPYKHSIEHYTGN 180 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN---- 224 ++++FL+ D PF + +S+ PH P T E + Y L E D N Sbjct: 181 KSLEFLENR-DQDRPFFLKMSFQRPHDPITPAPEDFDMYNPEDIVLPESISDLFENKFVG 239 Query: 225 KPEHHRLW---AQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTPEQ-RENTW 278 KP+ + + P V D+ L Y+A +D++IGRVI+ L +NT Sbjct: 240 KPQFMQDYVANPGDYPMCVADEAKLKRALASYYALITKIDEEIGRVIDHLKETGEYDNTI 299 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVD-TPVSHIDLLPTMMA 337 + YT+DHG+ G H L K +Y+ I RIP +++ P G + V +D T+ Sbjct: 300 IFYTADHGDFAGEHGLFLKNLGIYESIHRIPFLLKWPGGPTGVKNKELVESVDWYATLCD 359 Query: 338 LADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 L +I+ P+ + G +++ V + + EH T ++LV T Sbjct: 360 LCNIQAPDNVDGRSLVPVAKGEAKG--SDAIICEHH------TSTAIRTKQYRLVYYRET 411 Query: 398 SD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD-YMDKIRDPFRSYQWSLRPWRKD 455 + ELYDR NDP+E++NL + +R + +L+ +M R + + RK Sbjct: 412 GEGELYDRGNDPDELNNLWSHADYQSIRMDLMQQVLNYHMSYQRKTYNELDQVINKKRKH 471 Query: 456 ARPRWMGAFRPRPQDGYSPVVRDYDTGLP 484 + A + + YS +++ Y+T P Sbjct: 472 S----FSALLQKEKAYYSDLIKVYETKKP 496 >UniRef50_C6J3H9 Sulfatase n=2 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J3H9_9BACL Length = 503 Score = 396 bits (1019), Expect = e-109, Method: Composition-based stats. Identities = 132/491 (26%), Positives = 218/491 (44%), Gaps = 55/491 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PNFL + D + + C+ + T NID LA EG+ F AY +PVC P+RA L TG Sbjct: 2 KPNFLVFVVDQMQSRTLSCHGHPDVKTPNIDRLAREGVSFTRAYCNNPVCMPSRASLLTG 61 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL----------------DGH 106 + A Q G TN +A ++ T+ + GY T +GK H +G Sbjct: 62 LTARQHGVLTNGIALSEHFPTLPGVLSEHGYRTHAVGKLHHQPIGSVSREEQMEFSWEGM 121 Query: 107 DYFGTGECPPEWDADYWFDGANY-------LSELTEKEISLWRNGLNSVEDLQANHID-- 157 ++ +GE Y + +Y + ++ G + A + D Sbjct: 122 KFWESGEIRSIPSGYYGYQSVDYVGGHVTCFGDYLRWLEQVYPGGGKKLSKEGAYYADDK 181 Query: 158 ----------ETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY 207 E + + H I+ R++DFL+Q ++ D+PF + S+ +PHHPF Y E Y Sbjct: 182 IPMSWRIDLPEEYHYNHWIAERSIDFLEQMSQQDQPFFLWCSFPDPHHPFAACRPYSEMY 241 Query: 208 ADFYYELGEK--AQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRV 265 L E ++D + + R S D + VD IG + Sbjct: 242 DPASLTLPEHWDVEEDGISWLKERRNIHPDYTSFDEHDLREILAQTYGMISHVDKTIGEI 301 Query: 266 INALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV-- 322 L + + NT +++ +DHGE +G+H LI+KG ++++ R+P I + P+ ++ Sbjct: 302 TKKLKELELDQNTVIVFLADHGEYLGSHHLITKGEWPWEELIRVPFIWKIPESMKKGYLN 361 Query: 323 DTPVSHIDLLPTMMALADIEK------------PEILPGENILAVKEPRGVMVEFNRY-E 369 + VS +D +PT++ LA IE P LPG ++ + E V+ E Sbjct: 362 EQVVSLLDFVPTILDLAGIEPAVMDVRGVQYTEPLGLPGRSLRPIIEQGDVLPPGPAIVE 421 Query: 370 IEHDSFGGFI-PVRCWVTDDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKM 427 + D F + +R VT+ +K+ + L T D LYD + DP E NL D FA V+ + Sbjct: 422 YDEDWFPPNVCRMRTIVTERYKMTVYLNTEDGLLYDLQEDPYEQKNLWFDPSFARVKHIL 481 Query: 428 HDALLDYMDKI 438 + +L + + Sbjct: 482 TEQMLRELVRT 492 >UniRef50_A4AWR8 Iduronate-2-sulfatase n=5 Tax=Bacteria RepID=A4AWR8_9FLAO Length = 498 Score = 395 bits (1016), Expect = e-108, Method: Composition-based stats. Identities = 106/468 (22%), Positives = 201/468 (42%), Gaps = 26/468 (5%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LF++ D T V Y +NT +ID LA+EG+ F Y+ PVC P+RA + Sbjct: 39 KKPNVLFIIADDLTTTAVSSYGNSEVNTPHIDKLASEGVLFTRTYSQYPVCGPSRASFMS 98 Query: 62 GIYANQS---GPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDG-----HDYFGTGE 113 G Y + + G + G T + FKD GY+T + K G + Sbjct: 99 GYYPSATTTYGYVSGRKNIGSERKTWSQVFKDNGYYTARVSKIFHMGVPIDIEKGSNGQD 158 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQ-----ANHIDETFTWAHRISN 168 W + G + + + + +G ++ D+ + + Sbjct: 159 DEQSWTERFNSQGPEWKAPGAGELVQGNPDGTLPIKGGNVMTIVKADGDDLVHSDGKTAE 218 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 +A + +++ D+PF + + + PH PF P Y E Y +L +K ++D + P+ Sbjct: 219 KASELIRK--HKDKPFFLAIGFVRPHVPFVAPKSYFEPYPHNQTKLPKKVENDWDDIPKR 276 Query: 229 HRLWAQAMPSPV-GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 + ++ + + Y+A ++D Q+G+V+ L E +NT V++TSDHG Sbjct: 277 GINYVTSVNGKMNTEQEKKAIAAYYASVSYMDAQVGKVLKTLKEEGLEDNTIVVFTSDHG 336 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEI 346 +G H+ K +++++ ++PLII+ P + + +DL PT+ ALA ++ + Sbjct: 337 FHLGEHEFWMK-VSLHEESVKVPLIIKVPGKKPAVCHSFTELLDLYPTITALAGLKYSDQ 395 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT---SDELYD 403 L GE+++ + + V + + +D+ + EL+D Sbjct: 396 LQGESLVNILDEPTYEVRDMAFSVSQGGKSFL-----LRNEDWAYIQYDEDAASGIELFD 450 Query: 404 RRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRP 451 + DP + NL +A + + L + +R + +SL+ Sbjct: 451 MKKDPKQFTNLAQLPEYASIVDSFKEKLKTKLKAVRSNDLNIDYSLKK 498 >UniRef50_C6J2Z0 Sulfatase n=4 Tax=Firmicutes RepID=C6J2Z0_9BACL Length = 502 Score = 395 bits (1015), Expect = e-108, Method: Composition-based stats. Identities = 134/482 (27%), Positives = 208/482 (43%), Gaps = 57/482 (11%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN L + +D Q N +G + L+T N+D L EG F AY +P CTP RA + Sbjct: 1 MKKPNILLITSDQQHWNTIGAF-NPELSTPNLDRLVQEGTTFTRAYCPNPTCTPTRASII 59 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL----DGHDYFGTGECPP 116 TG+Y +Q G WT ++ +G FK+AGY T +GK H +Y P Sbjct: 60 TGLYPSQHGAWTLGTKLLEDRPVVGTNFKEAGYRTALVGKAHFQPLMGNEEYPSLESYPL 119 Query: 117 EWDADYW---------FD--------------GANYLSELTEKEISLWRN------GLNS 147 D DYW FD G +Y + EK + WR+ G S Sbjct: 120 LQDLDYWRQFSDSFYGFDHVELARNHTNEAHVGQHYAIWMEEKGCTNWRDYFLPPTGTMS 179 Query: 148 VEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY 207 + L + E + + I+ R L+Q +E F + S+ +PH P+ + Y Sbjct: 180 PKQLHRWDLPEEYHYNTWIAERTNALLEQYKNNNESFFLWASFFDPHPPYLVSEPWDTMY 239 Query: 208 ADFYYELGEKAQDDLANKPEHHRLWAQAMP--SPVGDDGLYHH----------------P 249 + E + + N P H + Q P S + G H Sbjct: 240 DPESLTIPEVSPGEHDNNPPHFGMTQQKSPDFSAWKETGQAIHGYHSHLMPESERKQLVA 299 Query: 250 LYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRI 308 Y+ +D IGR+++ L E+T V++T+DHG G H L +KG MY+D+ ++ Sbjct: 300 TYYGMISMMDKYIGRILDRLDELGLAEDTIVVFTTDHGHFFGQHGLQAKGGFMYEDLIKL 359 Query: 309 PLIIRSPQGERR--QVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFN 366 P I+R P Q D VS +DL PT ++ A + P + G + AV + Sbjct: 360 PFIVRYPGKVPANVQSDALVSLVDLAPTFLSFAGLPIPVWMTGVDQSAVWTGSKSSAR-D 418 Query: 367 RYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRS 425 E I R +V +KL + EL+D + DP E++N +D +A+++S Sbjct: 419 HIICEFRHEPTTIHQRTYVDQRYKLTVYYNQPYGELFDLQEDPGELNNRWNDPSYANLKS 478 Query: 426 KM 427 ++ Sbjct: 479 EL 480 >UniRef50_B9XEU8 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XEU8_9BACT Length = 456 Score = 395 bits (1015), Expect = e-108, Method: Composition-based stats. Identities = 115/463 (24%), Positives = 191/463 (41%), Gaps = 46/463 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN +F++ D + GC + T NID +A EG F + + P+C+P+R T Sbjct: 16 SQPNIVFILVDDIRWDAFGCMGHPFVKTPNIDRIAKEGALFKNFFVTLPLCSPSRGSFLT 75 Query: 62 GIYANQSGPWTN--NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 G YA+ +G N + + T R DAGY T ++GKWH+ D P Sbjct: 76 GQYAHVNGVTNNGEHSTLSHQLVTFPRLLHDAGYETSFVGKWHMGTDDT-------PRPG 128 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 D+W L+ K ++ N ++ +++RAV+F++Q + Sbjct: 129 FDHW---------LSFKGQGVYEN---PNLNIDGKVSRVEGYITDILNSRAVEFVKQEHK 176 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP-----EHHRLWAQ 234 +PF + V + H PFT + E Y DDLA KP E Sbjct: 177 --KPFCLYVGHKAVHGPFTPAERHKELYTKEQIPHPPSIDDDLAGKPVLTRKEQQGPKDG 234 Query: 235 AMPSPVGDDGLYHHPL----------YFACNDFVDDQIGRVINALTPE-QRENTWVIYTS 283 P VG D P+ +D+ +G+++ AL Q ENT +I+TS Sbjct: 235 QKPQKVGFDDEAERPMGKVPERLVRQQLRTLMAIDEGVGQLLRALEESRQLENTVIIFTS 294 Query: 284 DHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADI 341 D+G G H L Y++ R PL+IR P+ + D V +ID+ PT++ LA Sbjct: 295 DNGYFWGEHHL-GDKRWAYEESIRDPLLIRYPKLIKPGTVRDQMVLNIDIAPTLLELAHA 353 Query: 342 EKPEILPGENILAVKEPRGVMVEFN--RYEIEHDSFGGFIPVRCWVTDDFKLVLN--LFT 397 + G +++ + V + + ++ + T+ +K + L Sbjct: 354 PVSRSMQGRSLVPLFNKDSVEWRKSALFEYFQEKAYPRTPTWQAIRTEQWKYIHYTELEG 413 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 DELY+ + D EM NLI + ++ L + + + Sbjct: 414 MDELYNLKADSYEMKNLIKEQSARSSLQELKSELGKLLKQTSN 456 >UniRef50_B9XGI2 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XGI2_9BACT Length = 515 Score = 394 bits (1013), Expect = e-108, Method: Composition-based stats. Identities = 115/464 (24%), Positives = 193/464 (41%), Gaps = 51/464 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN LF+M D A + +G Y K T NIDS+A G+RF++ + + +CTP+RA + TG Sbjct: 61 RPNILFIMADDHAAHAIGAYGSKINQTPNIDSIAKAGMRFDNCFVVNSICTPSRAAILTG 120 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 Y++ +G N N T+ + + AGY+T +GKWHL+ DY Sbjct: 121 KYSHINGVTVFN-RFDGNQPTVAKMLQAAGYYTGMVGKWHLES----------DPTGFDY 169 Query: 123 WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADE 182 W I + N I++ +++FL+ + D+ Sbjct: 170 WNVLPGQGKYHDPDFIEM------------GNRKKIEGYATEIITDLSINFLKNRPQ-DK 216 Query: 183 PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA-------------NKPEHH 229 PF ++ + PH P+ ++ + Y D E DD ++ Sbjct: 217 PFFLMCHHKAPHRPWEPDEKHAKMYEDVTIPEPETFNDDYKTRSSAATEATMRIDRDLTP 276 Query: 230 RLWAQAMPSPVGDDGL------YHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYT 282 + QA P + + L + Y C VDD +GR++ L ENT VIYT Sbjct: 277 KDLKQAPPPGLAGEALKKWKYQRYIKDYLRCIASVDDNVGRLLKFLDDSGLAENTIVIYT 336 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALAD 340 SD G +G H K MY++ R+P I+R P + + ++D PT + A Sbjct: 337 SDQGFFLGDHNWFDK-RFMYEESLRMPFIVRYPNHIKPATVNKDMILNVDFAPTFLQCAG 395 Query: 341 IEKPEILPGENILAVKEPRGVMVEF---NRYEIEHDSFGGFIPVRCWVTDDFKLVL-NLF 396 +E P+ + G +IL + + + + + P ++ +KL+ N Sbjct: 396 LEVPKEIQGRSILPLLQGKAPKDWRTSMYYRYYHYPADHRVQPHYGVRSERYKLIYFNKI 455 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 E YD + DP+E+ N+ D +A + + ++ D Sbjct: 456 NEWEFYDLKRDPHELKNVYADPAYAKEVQRAKAEMERLRKELND 499 >UniRef50_UPI0001C36AAF N-acetylgalactosamine 6-sulfate sulfatase n=2 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C36AAF Length = 468 Score = 389 bits (999), Expect = e-106, Method: Composition-based stats. Identities = 130/489 (26%), Positives = 201/489 (41%), Gaps = 57/489 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LF++TD Q +GCY + T N+D LA +G+RF++ + SPVC+PARA L T Sbjct: 4 KKPNVLFILTDDQGIWSMGCYGNSEIQTPNLDKLAKQGVRFDNFFCTSPVCSPARASLLT 63 Query: 62 GIYANQSGPWTNNVAPGKNISTMG-----------RYFKDAGYHTCYIGKWHLDGHDYFG 110 G +Q G S + GY GKWHL + Sbjct: 64 GKIPSQHGILDYLSGGNGGASQAAIEFLKDHRGYTDILAEEGYTCGLSGKWHLGDGGH-- 121 Query: 111 TGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRA 170 P+ +W+ A+ ++RNG I+E I++ A Sbjct: 122 -----PQKGFSFWY--AHQKGGGPYYNAPMFRNGQK---------IEEKGYITDVITDEA 165 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFT--CPVEYLEKYADFYYELGEKAQDDLANKPEH 228 + F+ + ++PF + V Y PH P+ P +Y + Y D +E + + K E Sbjct: 166 ISFIDREKNKEQPFYLSVHYTAPHSPWINCHPKKYTDLYEDCPFETCPQGEVHPWAKTEV 225 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGE 287 + + S +G YFA +DD +GR++ L E E+T +I++SD+G Sbjct: 226 IAGYQKPRESLIG---------YFAAVTAMDDNVGRILKKLEEENLMEDTLIIFSSDNGF 276 Query: 288 MMGAHKLISKGA-----AMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALAD 340 G H + KG MYD ++PLI+ D S D +PT + Sbjct: 277 NCGHHGIWGKGNGTFPLNMYDSSVKVPLIMCHKGHIPENHVCDEMHSGYDFMPTPLDYLG 336 Query: 341 IEKPE--ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-NLFT 397 + E LPG++ L+ + E N + F + PVR + +KLV F Sbjct: 337 FKNDEADKLPGKSFLSALMGQEQKGEENSVVV----FDEYGPVRMIRSRKYKLVHRYPFG 392 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDAR 457 DE YD DP E +N I+D + DV M + + + DP + P + Sbjct: 393 PDEFYDLEVDPGEAYNGIEDESYQDVIRDMKKQMELWFLQYVDP--RIDGAKEPVMGGGQ 450 Query: 458 PRWMGAFRP 466 G P Sbjct: 451 KDLAGVLGP 459 >UniRef50_C6J5I7 Sulfatase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J5I7_9BACL Length = 522 Score = 386 bits (993), Expect = e-106, Method: Composition-based stats. Identities = 132/487 (27%), Positives = 205/487 (42%), Gaps = 68/487 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF+ TD Q + +GCY + T N+D LAA G F +A+ P+C P+RA L T Sbjct: 15 ERPNILFIHTDQQRADSLGCYGNTVIRTPNLDQLAASGTLFENAHCTHPLCMPSRATLLT 74 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHD----------YFGT 111 G Y + + N + + T+ +GY T IGK H + Sbjct: 75 GRYMHAHRLYRNGIPLSQQEQTIAHLLSKSGYATGLIGKAHFTPYKGDPKVNPESVQINN 134 Query: 112 GECPPE-WDADYWFDGANYLSELTE------------KEISLWRNGLNS------VEDLQ 152 G P E W F+G Y + + LW + + +D+ Sbjct: 135 GVAPEECWAYWRQFEGPYYGFDHVQMSMGHGDYGMKGGHYGLWVHEQHPDKVPLFDQDIH 194 Query: 153 ANHIDETF-----------TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPV 201 D + + +N+A++F++Q D PF + Y EPH PF P Sbjct: 195 GEPSDGVYRSWKSAVPLEIHSSTWTTNKAIEFIKQ--NKDRPFYAWIGYQEPHEPFNPPR 252 Query: 202 EYLEKYADFYYELGEKAQDDL-ANKPEHHRLW--AQAMPSPVGDDGLYHHPLYFACNDFV 258 Y + Y L + + PEH + + + Y+ C + Sbjct: 253 PYCDMYDPQEILLPVGRDGEWGSESPEHVQYYLNRGKWKDIREEKVREIIAHYYGCVSMI 312 Query: 259 DDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ- 316 DD IGR++ L E +NT +I+TSDHGE +G H L KGA +TRIPL+I+ P Sbjct: 313 DDCIGRLMKTLEEEGLADNTIIIFTSDHGEWLGDHGLWLKGAVHARGLTRIPLMIKWPGT 372 Query: 317 -GERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVE-FNRYEIEHDS 374 R+V S ID++PT++ A E P + G ++ +V V + E H+ Sbjct: 373 AVSGRRVSNVASLIDVMPTLLDAAGAEIPYGVQGTSLRSVLAGEQDKVRDYALIEHRHEP 432 Query: 375 FGGFI----------------PVRCWVTDDFKL-VLNLFTSDELYDRRNDPNEMHNLIDD 417 + I ++ VTD ++L + ELYD + DP+E+ NL D Sbjct: 433 YHLNIQLEKEELVINKGTEEWHMKTIVTDRYRLSYIPSAQYGELYDHQTDPDELINLWD- 491 Query: 418 IRFADVR 424 +F ++R Sbjct: 492 -KFPELR 497 >UniRef50_B7AMH4 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AMH4_9BACE Length = 520 Score = 386 bits (992), Expect = e-106, Method: Composition-based stats. Identities = 110/490 (22%), Positives = 208/490 (42%), Gaps = 57/490 (11%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSG-KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 +K N +F+++D + + L T +D +A EG +A+ + + +P+RA + Sbjct: 39 VKPRNVVFILSDDHRYDYMVFLGTIPWLETPCMDRMAREGAYIQNAFVTTSLSSPSRASI 98 Query: 60 FTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 TG+Y++ NN ++ Y + AGY T + GKWH+ TGE P + Sbjct: 99 LTGLYSHTHKVVDNNAPLPDGLTFFPEYLQAAGYETAFFGKWHMG----NDTGEPQPGFT 154 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 G W+ ++ +++ A+DF+++ + Sbjct: 155 HWEGIRGQGVYWNPEININGKWK------------EFKDSTYLGDLLTDHAIDFIREQKK 202 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN--------------- 224 AD+PF + +S+ H PF P Y YA+ L ++ Sbjct: 203 ADKPFFVYLSHKGVHDPFQAPKRYEGCYANKKVPLPTSFENPHYGITPTPNKSVQTGKPL 262 Query: 225 ----------KPEHHRLWAQAMPSPV-----GDDGLYHHPLYFACNDFVDDQIGRVINAL 269 KP+ ++ ++ + Y VD+ IGRVI++L Sbjct: 263 SGVDYYGEQMKPDWVKMQRESWHGVDFCYNGRRNWEEEVRKYCETLRAVDESIGRVIDSL 322 Query: 270 TPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPV 326 + NT VIY D+G G H LI K Y+ R+P++IR+P + + + V Sbjct: 323 QEMGLDENTVVIYMGDNGFCWGEHGLIDK-RQFYEASVRVPMLIRAPGLFPAGQVLKSMV 381 Query: 327 SHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVE---FNRYEIEHDSFGGFIPVRC 383 ++D+ PT+++ A ++KP + GE+ + + + + + F Y EH+ + + Sbjct: 382 QNVDIAPTILSCAGLDKPAQMVGESYIPLLQGKEIPWRNRIFYEYYWEHE-YPQTPTMHG 440 Query: 384 WVTDDFKLVLNL--FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 TD++K + + ++E YD DP+E+ N I D + D+ ++ L D+++ Sbjct: 441 VRTDNYKYIRYHGIWDTNEFYDLNEDPSELQNRIADPEYQDIIKQLDADLYDWLETTNGM 500 Query: 442 FRSYQWSLRP 451 F + ++RP Sbjct: 501 FIPLKRTVRP 510 >UniRef50_C0QY53 Sulfatase n=2 Tax=Brachyspira RepID=C0QY53_BRAHW Length = 474 Score = 386 bits (992), Expect = e-105, Method: Composition-based stats. Identities = 120/473 (25%), Positives = 202/473 (42%), Gaps = 27/473 (5%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN + + D + + Y + T ++ LA G F +++ SPVCTP+RA +FT Sbjct: 4 KKPNIILITADQMRADSIE-YINDEVKTPVLNELAENGSVFTNSFCTSPVCTPSRASIFT 62 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G Y G W ++ T+ Y K+ Y GK H E D Sbjct: 63 GRYPMNIGAWNIGTELNEDEVTLADYLKEDNYFNVASGKMHFRPQLKNLNWEFEDVPKRD 122 Query: 122 ---------YWFDGANYLSELTEKEISLWRNGLNSVEDLQA-----NHIDETFTWAHRIS 167 Y FD + + + E + N ++ N I E + + Sbjct: 123 RVRERDKTYYGFDITHITEDDKQGEYLDFANSHGCNLEIGKGIDGINPIPEELHQTYWTA 182 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 +A+D + D+P M VS+ +PHHPF +Y + Y D + +PE Sbjct: 183 QKAIDEIDNF-NFDKPLFMWVSFVDPHHPFDPIKKYYDIYKDIKPKELNSKLKLDKKRPE 241 Query: 228 HHRLWAQAMPSPVG--------DDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTW 278 H P G ++ LY+ F+D QIGR+I+ L + +NT Sbjct: 242 HLTKQGDRGYWPGGGEEHHYSQEEIKEIKKLYYGMISFIDSQIGRIIDKLKEKNEFDNTI 301 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMAL 338 +I+TSDHGE +G + L+ KG MYD + ++PL+ + + D + +ID+LPT++ + Sbjct: 302 IIFTSDHGEYLGDYGLLKKGPFMYDCLIKVPLLFYGKGIVKNRSDEIIENIDILPTILDM 361 Query: 339 ADIEKPEILPGENILAVKEPRGVMVEFNRYE-IEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 E P + G +I + + + I +D+ I ++ + T +KL + L Sbjct: 362 LGKEIPYGIQGHSIKNILIGEDKNKTYKKGAVITYDAHDRGIFIKTYRTKQYKLSIFLDE 421 Query: 398 S-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL 449 E YD DPNE +NL + + ++++K+ + M + DP S Sbjct: 422 EYGEFYDLEKDPNEENNLFFNKEYDEIKNKLLLEMCHKMIECSDPLNRRYASW 474 >UniRef50_UPI0001C36159 sulfatase n=2 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C36159 Length = 497 Score = 386 bits (991), Expect = e-105, Method: Composition-based stats. Identities = 124/477 (25%), Positives = 199/477 (41%), Gaps = 47/477 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+ N ++ TD Q + + ++T NID L EG+ F AYT +P+CTP+RA T Sbjct: 8 KKKNIVWFCTDQQRWDTIHSLGNPYIHTPNIDRLVKEGVAFTRAYTQAPICTPSRACFLT 67 Query: 62 GIYANQSG-PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL------------DGHDY 108 G Y + + N K+ + + D GY +GK HL DG+ Y Sbjct: 68 GRYPRTTKTIFNGNEKFSKDEKLVTKLLSDEGYTCGLVGKLHLTSAEGRVEKRCDDGYSY 127 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISL-------------WRNGLNSVEDLQANH 155 F P D+ G +Y + L EK + W N + Sbjct: 128 FQYSHHPHN---DWKDGGNDYQNWLNEKGVHWEEIYGGKFMTMATWPPQANPSFSGKQVG 184 Query: 156 IDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG 215 + + + +DF++ + EP+L+ V+ +PH P P EY ++ L Sbjct: 185 VPAQYHQTTWCVEKTIDFIETRRNSGEPWLISVNPFDPHPPLDPPQEYKDRLNVEEMPLP 244 Query: 216 EKAQDDLANKPEHHRL---------WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVI 266 ++ KP H + A+ + S ++ Y+A + +DDQ GR++ Sbjct: 245 LWEDGEMEGKPPHQQKDVIQGGQDGQAEPIGSLTEEEKRERFRDYYAEIELIDDQFGRLL 304 Query: 267 NALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ--VD 323 + L RE+T +I+ SDHGEM G H L KGA Y+ + +PLII P ++ D Sbjct: 305 SYLDQTGLREDTIIIFMSDHGEMSGDHGLYWKGAYFYEGLVHVPLIISCPSIFKQGFLCD 364 Query: 324 TPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFN---RYEIEHDSFGGF-- 378 V +D+ PT+M A +E P + G + + F E H G Sbjct: 365 ALVELVDIAPTLMEAAGLEVPYFMQGRSFYDILTGEADPHHFKDAVYSEFYHCLRGTHED 424 Query: 379 IPVRCWVTDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 I + +KL++ ELYD D NE HNL D + +++++ D+ Sbjct: 425 IDATMYYNGRYKLIVYHGKEFGELYDHETDQNEFHNLWDKPEYEALKTELIRKSFDH 481 >UniRef50_D2MLH4 Sulfatase family protein n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MLH4_9BACT Length = 476 Score = 384 bits (988), Expect = e-105, Method: Composition-based stats. Identities = 132/471 (28%), Positives = 210/471 (44%), Gaps = 37/471 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN L++ TD Q + + + + T N+D L AEG+ F A+ S +CTP+R+ T Sbjct: 4 KRPNILWICTDQQRYDTIHALGNEHIQTPNLDRLCAEGVAFTHAHCQSAICTPSRSSFLT 63 Query: 62 GIYANQ-SGPWTNNVAPGKN--ISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+Y + G N N + + + DAGY GK HL + G + + Sbjct: 64 GLYPSTVHGNRNGNAYFPANERVQLITKRLADAGYDCGLSGKLHL-ASAWNGEEQRVDDG 122 Query: 119 DADYWF---------DGANYLSELTEKEISLWRNGLNSVEDLQANHIDE---TFTWAHRI 166 +W+ +G Y LTE+ + L + N+ + + Sbjct: 123 YRKFWYSHSHNQGIGNGNQYTDSLTEQGMDLGDVFQTKKDGTYGNYRPDMNPQYHQTTWC 182 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 ++RA++F++ P D P+LM V+ +PH PF P + KY + D + Sbjct: 183 ADRAIEFIESPH--DSPYLMSVNPFDPHGPFDAPDTH--KYNPADLPPPIFRESDQQTQT 238 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDH 285 R +A +P GD ++ Y+ +D+ +GR++NAL QRENT VI+TSDH Sbjct: 239 RLKRFFADKEGNPPGDREQHNKASYYGMIALIDENVGRMLNALERTGQRENTIVIFTSDH 298 Query: 286 GEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEK 343 GEM+G H L KG Y+ + R+PLII P + + D + +D+ PT+ LA I Sbjct: 299 GEMLGDHGLTGKGCRFYEALVRVPLIISWPGTFLQGHRADGLTALLDIAPTLADLAGIPL 358 Query: 344 PEILPGENILAVKEPRGVMVEFNRYEI--EHDSFGGFIP----------VRCWVTDDFKL 391 E G++++ + + + +D F P D +KL Sbjct: 359 -EWTHGKSLIPILTGEHPGHAHHDFVRCEYYDVVDKFAPHASEKHKPCWATMLRNDRYKL 417 Query: 392 VLNLF-TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 V+ ELYD DP+E HNL D AD++ ++ D+ DP Sbjct: 418 VVYHDEDYGELYDLWEDPDEFHNLWKDPSRADLKYQLTKQNFDHTVICADP 468 >UniRef50_D2QWC7 Sulfatase n=5 Tax=Bacteria RepID=D2QWC7_9PLAN Length = 490 Score = 384 bits (988), Expect = e-105, Method: Composition-based stats. Identities = 113/462 (24%), Positives = 191/462 (41%), Gaps = 27/462 (5%) Query: 2 KRP-NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 K+P N L + +D N +GCY + + ID LAA G RF+ AY P+C P+R+ Sbjct: 32 KKPYNVLLIASDDL-NNSLGCYGHATVKSPRIDELAARGTRFDRAYCQFPLCNPSRSSFL 90 Query: 61 TGIYANQSGPWTNNVAP---GKNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPP 116 TG+ +Q+ N +I T+ + F +AGY+ +GK +H GT Sbjct: 91 TGLRPDQTTVHDNARKFRSERPDIVTLPQMFMNAGYYVARVGKLYHYGVPLQIGTSGLDD 150 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 E + + K SL A + + A+ L+ Sbjct: 151 EPSWQQVVNPRGRDRDDEPKIFSLVPGQFGGTPSWLAAEGTDDEQTDAIGAAEAIKLLE- 209 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM 236 A ++PF + V + PH P+ P Y EKY + + D + PE A Sbjct: 210 -ANKEKPFFLAVGFYRPHTPYVAPKSYFEKYPADKIPIVTTPEGDRRDIPEPAVSQHSAR 268 Query: 237 PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKLI 295 + YFA F+D Q+G++++AL + R+NT V++ SDHG +G H + Sbjct: 269 HNMNEKLQREATQAYFASITFMDQQVGKLLDALDRLKLRDNTIVVFLSDHGYHLGEHGGL 328 Query: 296 SKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGENIL 353 + +++++ R+PLII P ID+ PT+ L ++ P LPG+++ Sbjct: 329 WQKQSLFEESARVPLIISVPGQKHAGEGTAAVAELIDIYPTLADLCGLKAPANLPGQSLR 388 Query: 354 A-VKEPRGVMVEFNRYEIEHDSFG------------GFIPVRCWVTDDFKLVLNLFTSD- 399 +++P+ F ++ G TD ++L + Sbjct: 389 PQIEDPQAPGKGFAITQVRRGGNPGGAKAGKKNPPAGGFAGYSLRTDKYRLTIWGEEGAK 448 Query: 400 --ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 ELYD + DP E NL D A+ +++ L ++ + Sbjct: 449 GLELYDHQTDPQEYTNLASDPSKAETITELKALLAKHLSAAK 490 >UniRef50_Q7UH28 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=1 Tax=Rhodopirellula baltica RepID=Q7UH28_RHOBA Length = 534 Score = 384 bits (987), Expect = e-105, Method: Composition-based stats. Identities = 120/449 (26%), Positives = 202/449 (44%), Gaps = 30/449 (6%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++ N +F++TD + +GC L T N+DS+AA G +A+ + +C+P+RA + Sbjct: 54 VEPRNVVFILTDDHRFDAMGCAGHPFLETPNLDSIAANGTHIKNAFVTTSLCSPSRASIL 113 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG+Y ++ NN +Y + AGY T ++GKWH+ GH P Sbjct: 114 TGLYTHKHRVIDNNRLVPDGTLFFPQYLQRAGYDTAFVGKWHMGGH------HDDPRPGF 167 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D+W + L G ++ + + +++ AVD+L++ Sbjct: 168 DHWVSFRGQGNYLPP--------GPKYTLNVNGERVKQKGYITDELTDYAVDWLKER-DD 218 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK--PEHHRL----WAQ 234 DEPF + +S+ H FT + +YAD ++ A+K P R W Sbjct: 219 DEPFFLYLSHKAVHSNFTPAERHQGRYADEDLSFLPTGKELSADKNTPRWVRDQKNSWHG 278 Query: 235 AMPSPVGDDGL-YHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAH 292 S D GL Y + Y VDD +GRV+ L ++T +IY D+G M G H Sbjct: 279 IDFSYHSDKGLDYLYRRYCESVLAVDDSVGRVLQQLKDMGIHDDTLIIYMGDNGFMWGEH 338 Query: 293 KLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGE 350 LI K + Y+ R+P++++ P + ++ V +ID+ PT++ A ++ PE + G+ Sbjct: 339 GLIDKRVS-YEASIRVPMLMQCPNLFDGGQPIENVVGNIDVGPTILHAAGLQTPEYMDGQ 397 Query: 351 NILAVKEPRGVMVE--FNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL--FTSDELYDRRN 406 + L + R F +F D FK + + +DELYD + Sbjct: 398 SFLDLPNNRDADWRKYFLYVYYWEKNFPQTPTQFALRGDRFKYITYYGLWDTDELYDLQT 457 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYM 435 DP+E++NLI D + V +M D L + Sbjct: 458 DPDELNNLIHDPDYKSVAKEMEDQLYAML 486 >UniRef50_B4D6H3 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D6H3_9BACT Length = 525 Score = 382 bits (981), Expect = e-104, Method: Composition-based stats. Identities = 114/475 (24%), Positives = 191/475 (40%), Gaps = 44/475 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF+MTD Q + VG + T N+D LAA G F++ Y SPVC P+R FT Sbjct: 30 RRPNILFIMTDQQRWDCVGANGNTIIKTPNMDRLAARGANFSNVYVASPVCVPSRISFFT 89 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL----------DGHDYFGT 111 G YA+ N + + K+AGY T +GK H G D Sbjct: 90 GRYAHSHRNRVNYTPLDASEVLLQARLKEAGYRTASVGKLHYFPPTVEHAKSTGFDIVEL 149 Query: 112 GECPPEWDADYWFDGANYLSELTEKEISLWR---NGLNSVEDLQANHIDETFTWAHRISN 168 + P D W D + K+ +R + ++ ID +T Sbjct: 150 HDGVP--FTDKWSDYVKWRQANDPKKDIYYRATAKNIEPGKNPNRAAIDTQYTDTTWTGE 207 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA-QDDLANKP- 226 R +L + A+ +PF + VS+ +PH P+ Y Y D + E +DLA+ P Sbjct: 208 RGRYWLTELAKGQQPFFLYVSFWKPHSPYEIGPPYDSMYDDANIPIPETVTANDLASMPL 267 Query: 227 EHHRLWAQAMPS---PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYT 282 +L + P+ + + + Y+ VD +IG ++ AL +NT ++++ Sbjct: 268 PLQKLSLRENPNVWKQTQERVEWMYRSYYGAISHVDHEIGLLLEALEASGQAQNTLIVFS 327 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADI 341 SDHG+ + H++ K ++ ++PL++ P + D + +DL+PT++ + Sbjct: 328 SDHGDQLMEHRIYGKN-CFFEPSVKVPLMVSLPGRIKPAHYDQLMETVDLVPTLLDFIGL 386 Query: 342 EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIP--------------------- 380 +P + G + + G + + I Sbjct: 387 PEPREVQGRSFAPLIADLGRPYTPHDAVFSENIIPEVITSGKMDLPFEKGKGVDGVRHPD 446 Query: 381 VRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 + TD +K ELYD + DP E NL V +M LL+++ Sbjct: 447 AKMVRTDRWKYCYYPEGYAELYDLQKDPGERTNLAGRPENHAVEEEMRTRLLNWL 501 >UniRef50_B5JYP8 Choline-sulfatase n=1 Tax=Octadecabacter antarcticus 238 RepID=B5JYP8_9RHOB Length = 531 Score = 382 bits (981), Expect = e-104, Method: Composition-based stats. Identities = 113/443 (25%), Positives = 191/443 (43%), Gaps = 15/443 (3%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN + +M D A + + Y T N++ LAA+G F + Y+ +P+C P+RA + + Sbjct: 5 SRPNIILIMADQMAAHALSLYGNTVCKTPNLERLAAQGTVFENGYSNNPLCVPSRASMLS 64 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G+ + + N ++ TM Y + AGY T GK H G D D Sbjct: 65 GMLSPDVNVFDNANELPSSVPTMAHYLRHAGYWTELCGKMHFIGPDQEHGFNQRSV--TD 122 Query: 122 YWFDGANYLSELTEKEISLWRNG-LNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 + ++++ + LN V + + + + A+ L AR Sbjct: 123 VYPASFQWIADWQAGPAFVPSGTALNGVVESGPCVRTMQEDYDDEVEHCAIQSLYDRARE 182 Query: 181 D--EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW----AQ 234 +PF +VS+ PH PFT EY ++Y + + H + + Sbjct: 183 PDRQPFFQIVSFTNPHTPFTVSQEYWDRYESSEIDAPAVGALPFEDLDYHSKALFFAHGR 242 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHK 293 + Y+ +VDD++GR+++ L QR+NT V + SDHGEM+G Sbjct: 243 HRHKVTQKHLIAARHAYYGMISYVDDKVGRILDTLEKTGQRDNTAVFFVSDHGEMLGERG 302 Query: 294 LISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTMMALADIEKPEILPGENI 352 + K ++ +P I P + + VS +DLLPT + LA + PE L G ++ Sbjct: 303 MWFK-QTFWEWSAHVPFIASVPGITGGGRSEKVVSLVDLLPTFLDLAGADSPE-LAGSSV 360 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMH 412 L + E ++ + G +P R FK + LYD ++DP E++ Sbjct: 361 LPLMEGDADAWPDIAIS-DYLAIGPCVPCRMVRKGRFKFIYTHGHPALLYDLQDDPLELN 419 Query: 413 NLIDDIRFADVRSKMHD-ALLDY 434 NL D+ FADV +++ +L D+ Sbjct: 420 NLADNAAFADVLAELQAFSLTDW 442 >UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R322_9PLAN Length = 513 Score = 382 bits (981), Expect = e-104, Method: Composition-based stats. Identities = 113/492 (22%), Positives = 183/492 (37%), Gaps = 65/492 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN +F + D +GCY T NID LAA+G RF AY PVC+P RA + T Sbjct: 35 QQPNIVFFLVDDLGQRDLGCYGSTFYETPNIDKLAADGARFTQAYAACPVCSPTRASILT 94 Query: 62 GIYANQSGP-----WTNNVAPGK------------------NISTMGRYFKDAGYHTCYI 98 G++ ++G N+ P K + T+ + K AGY T + Sbjct: 95 GLWPQRTGITDYIATDNSNGPAKWNRNTMTLPAAYRDRLALDSPTLAKSLKSAGYATFFA 154 Query: 99 GKWHLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDE 158 GKWHL ++ P D G K+ + H+ Sbjct: 155 GKWHLGPEGFY-----PENQGFDINRGGIERGGPYGGKQYFSPYGNPRLTDGPAGEHLP- 208 Query: 159 TFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA 218 R++ F++ A +PF S+ H P + +KY Sbjct: 209 -----DRLATETCQFIE--AHQKQPFFAYFSFYSVHTPLQAREDLRQKY--------VAK 253 Query: 219 QDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENT 277 ++ L KP R + + + H +Y A D +D +G+V+ L RENT Sbjct: 254 REKLGLKPTWGREHMRDV------RQVQEHAVYAAMVDAMDQAVGKVLAKLDELGLRENT 307 Query: 278 WVIYTSDHGEMMGAHKL-------ISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSH 328 VI+TSD+G + + MY+ R PL++R P +DTPVS Sbjct: 308 LVIFTSDNGGLSTSEGWPTSNLPLRGGKGWMYEGGIREPLVMRWPAKVKAGSTIDTPVSS 367 Query: 329 IDLLPTMMALADIEKPE--ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVT 386 D + T++A + E + G ++L + + + H G P Sbjct: 368 PDFMATLLAATATKPAEQQQIDGVSLLPLLAGEKLKERSLFWHYPHYGNQGGAPAAAIRR 427 Query: 387 DDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 +KL+ L EL++ D +E NL + +M L + ++ Sbjct: 428 GSWKLIEWLEDGQVELFNLATDESETTNLASKE--PALVREMLAELHAWQKEVGAILPEK 485 Query: 446 QWSLRPWRKDAR 457 + P + R Sbjct: 486 NPNYDPAKPSGR 497 >UniRef50_B9XND0 Sulfatase n=3 Tax=Bacteria RepID=B9XND0_9BACT Length = 492 Score = 381 bits (979), Expect = e-104, Method: Composition-based stats. Identities = 124/480 (25%), Positives = 193/480 (40%), Gaps = 49/480 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN LF++ D +G + T ++D L +E + F +A + PVC+P RA L T Sbjct: 47 QPPNVLFIIADQWRAEAMGYNGNPDVKTPHLDHLQSESVDFVNAVSSVPVCSPTRASLMT 106 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G A G + N+V T+ + AGY T IGKWHLDGH + D Sbjct: 107 GQRALTHGVFVNDVPLSPKAITLSKVLHQAGYDTACIGKWHLDGHGRSQFIPRERRQNFD 166 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 YW L + S + L + F H A +L+ + A Sbjct: 167 YW----KVLECTHQYNNSFYFADLPFKLKWDGY---DVFAQTHD----ASQYLRNHSHAK 215 Query: 182 EPFLMVVSYDEPHHPFT-CPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 +PF + +S+ PH P+ P Y +Y + N P R AQ Sbjct: 216 KPFFLYLSWGPPHDPYQTAPATYRSQYQAAKIK-------TRLNVPPGMRASAQT----- 263 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGA 299 + Y++ +D +G ++ L E NT VI+TSDHG+M+ +H L+ K Sbjct: 264 ------NLAGYYSHCTAIDSCVGTLLQTLKDTGLETNTLVIFTSDHGDMLHSHGLV-KKQ 316 Query: 300 AMYDDITRIPLIIRSPQG---ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVK 356 +D+ R+PL++R P G + R++D P + D +PT++ L P + G + A Sbjct: 317 HPFDESIRVPLLMRWPAGLGTQPRKLDAPFNSPDFMPTILGLCGAPVPNTVEGIDYSAYL 376 Query: 357 EPR----------GVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRN 406 + V F Y +H G R T + V +L L+D Sbjct: 377 QGDVNPSDGATLISCPVPFGEYSRQH----GGREYRGIRTTRYTYVRDLNGPWLLFDNLE 432 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRP 466 DP +M NL+ A + + LL + + D F Q L W + + P Sbjct: 433 DPAQMDNLVGQPECAQLEEDLEKILLQKLAEANDQFLPGQAYLDRWGYKLNANGIIPYTP 492 >UniRef50_A6DPE5 Iduronate-2-sulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DPE5_9BACT Length = 487 Score = 381 bits (979), Expect = e-104, Method: Composition-based stats. Identities = 108/454 (23%), Positives = 189/454 (41%), Gaps = 28/454 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 + N LF+ D + +G Y + T N+D LA G F+ AY P+C P+RA + +G Sbjct: 20 KMNVLFISADDLNCD-IGPYGNTQVKTPNLDRLARMGTVFDRAYCQQPLCGPSRASIMSG 78 Query: 63 IYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPE- 117 + N G WT N N+ TMG +F+ GY++ +GK +H Y GT E Sbjct: 79 LRPNTLGVWTLNSKLRGRIPNLVTMGEFFQKQGYYSGRVGKIYHYGNPTYIGTNGNDDEQ 138 Query: 118 -WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHID----ETFTWAHRISNRAVD 172 W + G + E + G + D + +++RA+ Sbjct: 139 TWTERFNPKGIDRTQEENIIRYPGGKTGKKGGLGISMAWWDPVSKDNEHTDGLVADRAIK 198 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYA--DFYYELGEKAQDDLANKP---- 226 ++ A D+PF + + PH P+ P +Y + Y D + E+A+ +LA+ P Sbjct: 199 MIE--ANKDKPFFIAAGFFNPHCPYVAPKKYFDMYDINDIELQELEEAKQELADVPAMAI 256 Query: 227 --EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTS 283 + + W D+ Y+A F+D Q+GR+ AL + T +++ S Sbjct: 257 QRDAGQRWPYFYKGLTRDEAKQCKLAYYATVSFIDAQVGRIFEALEKNNLMDKTIIVFWS 316 Query: 284 DHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIE 342 DHG +G L K A ++ R PL+I +P + + +PV +D+ PT++ + Sbjct: 317 DHGYFLGEKGLWFKRKA-FERSARAPLLIAAPGLSKGQVCKSPVELLDIYPTLVEATGFQ 375 Query: 343 KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT--SDE 400 P L G ++ + + ++ + I G T ++ E Sbjct: 376 IPSELEGVSLSPLL--KNAQTKWTKPAITQIHHGADKQGYSIRTKKWRYTEWNKGQAGKE 433 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 LY+ DP E NL + + +++ L + Sbjct: 434 LYNHETDPEETINLATNPEHTQIVAQLSTELQKF 467 >UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4991 Length = 596 Score = 381 bits (978), Expect = e-104, Method: Composition-based stats. Identities = 116/516 (22%), Positives = 179/516 (34%), Gaps = 96/516 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + ++ D +GCY T NID +A +G+RF Y PVC+P RA + TG Sbjct: 22 KPNVVLIVIDDLGQRDLGCYGSTFYKTPNIDRMAKDGVRFTDFYAACPVCSPTRASIMTG 81 Query: 63 IYANQSGPWT----NNVAPGK-------------NISTMGRYFKDAGYHTCYIGKWHLDG 105 Y + G PG+ T+ K GY T +IGKWHL G Sbjct: 82 KYPQRVGITDWLPGRKDLPGQRLKRPELKNELALEEVTVAETLKGHGYVTAHIGKWHLGG 141 Query: 106 HDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHR 165 G P + D G + + L+ + G ++ L+ DE R Sbjct: 142 K-----GFEPEKQGFDVNVAGDHTGTPLSYFAPFANKAGA-TMPGLEKAAPDE--YLTDR 193 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 ++ A F+ A D+PF + + + H P P ++KY Sbjct: 194 LAAEAETFI--TANKDKPFFLYLPHYGVHTPLRAPQPLVDKYKTQAV------------- 238 Query: 226 PEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSD 284 G +P+Y A + +D +GRV+ L + +NT V++TSD Sbjct: 239 -----------------HGRQSNPVYAAMVESMDAAVGRVLKRLDDLKLSDNTLVLFTSD 281 Query: 285 HGEMMGAHK----------LISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLL 332 +G + L +Y+ R+PLI + P +D ID Sbjct: 282 NGGLATLEGMPFAPTINAPLREGKGYLYEGGVRVPLIAKWPGKVKPGTVMDQVACSIDFF 341 Query: 333 PTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV 392 T++ G +++ + + H + G P ++KLV Sbjct: 342 DTILEATGATSAARRDGVSLVPAFGGEKLKPRALYWHYPHYANQGSRPGGAVRAGNYKLV 401 Query: 393 LNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRP 451 EL+D D +E NL D DV + L Sbjct: 402 EYYEDGRRELFDVAKDLSESRNLAADK--PDVVKDLAAKL------------------DA 441 Query: 452 WRKDARPRWMGAFRPRPQDGYSPVVRDYDTGLPTQG 487 WR D +GA P P Y P D D + Sbjct: 442 WRTD-----VGAKMPTPNPDYRPNPPDKDGAITLHA 472 >UniRef50_Q029P1 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q029P1_SOLUE Length = 467 Score = 381 bits (978), Expect = e-104, Method: Composition-based stats. Identities = 120/446 (26%), Positives = 201/446 (45%), Gaps = 27/446 (6%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N L + D + +GCY + T N D LA EG+RF +A+ +P C P+R L TG Y Sbjct: 27 NLLVITNDQHRADCLGCYGNPVIRTPNTDRLAGEGVRFGNAFVHAPQCVPSRVSLHTGRY 86 Query: 65 ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYWF 124 + TN+ ++ T+ + GY T +G+ Y G + + +Y Sbjct: 87 PHVHRVPTNSYDLPESEQTLAKVLNANGYRTACVGEMPFAPRAYTGGFQQVLASNREYDQ 146 Query: 125 DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPF 184 A + + + + + +DL + A DFL+ A D PF Sbjct: 147 FLAGHGLKFPKSDGPFQAAPVPWTDDLDE---------TAFFAGHARDFLK--ANRDRPF 195 Query: 185 LMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ-----AMPSP 239 + +++ PHHPF P + + Y + ++ANKP + + + S Sbjct: 196 FLDINFRRPHHPFNPPAPFDKMYLGAAFPPSHARPGEMANKPPQQKAALENSVGFDLRSM 255 Query: 240 VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGAHKLISKG 298 D Y+ D IG V++ L + E+ T V++ +DHGEM+G H L+ KG Sbjct: 256 TPADLDRVKAYYYGMISENDKYIGTVLDELKSQGLEDRTVVVFNADHGEMLGDHGLLFKG 315 Query: 299 AAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVK 356 + MYD +T++PLI+R+P R VD V +D++PT++ L I+ P + G++++ + Sbjct: 316 SYMYDGVTQVPLILRAPGKLPARTVVDGLVEEVDVMPTLLELLGIDVPAGVQGKSLVPLA 375 Query: 357 EPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV-LNLFTSDELYDRRNDPNEMHNLI 415 + + F F ++ T ++KLV N ELY DP+E+ NL Sbjct: 376 DNPKARHKDA-------VFAEFPTIKMARTREWKLVHYNKAKYGELYHLTEDPHELTNLY 428 Query: 416 DDIRFADVRSKMHDALLDYMDKIRDP 441 DD ++A + M L D++ DP Sbjct: 429 DDPKYAPASADMQGLLADWLATSTDP 454 >UniRef50_C7MEQ7 Choline-sulfatase n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MEQ7_BRAFD Length = 520 Score = 380 bits (977), Expect = e-104, Method: Composition-based stats. Identities = 129/502 (25%), Positives = 214/502 (42%), Gaps = 34/502 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + + D A +G Y T ++D+LAAE F+ AY +P+C P+RA + TG Sbjct: 4 RPNIVVIQADQMAAQALGAYGDTAARTPHMDALAAEAAVFDRAYCNTPLCAPSRASMMTG 63 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 + + N +I T + + AGYHT +G+ H G D E D Sbjct: 64 RMPSDIDCFDNGSDFAASIPTFAHHLRAAGYHTALVGRMHFIGPDQHHGFEQRLTTDVYP 123 Query: 123 WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADE 182 + + W + ++V + + + RA+ L RAD+ Sbjct: 124 ADMDMVPDWQRDLGDRLQWYHDADAVHTAGVSQATVQLDFDDEVGFRALRHLNDRVRADQ 183 Query: 183 ------PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ-DDLANKPEHHRLWAQA 235 PFLMV S+ PH P+ P E+ +++AD + D A P HRL A + Sbjct: 184 AAGERVPFLMVASFIHPHDPYEPPQEHWDRFADVDIPAPRHPEVPDPAQDPHSHRLRAMS 243 Query: 236 ---MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGA 291 ++ Y+A ++DD +GR+ L E+T V+ TSDHG+M+G Sbjct: 244 GFDQRETTEEEVRRARRSYYAAVSYIDDHVGRIRERLESLGLWEDTVVVVTSDHGDMLGE 303 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIE------- 342 L K + Y++ +R+PLI+ P+ + PVS +DL+PT++ L + Sbjct: 304 KGLWFK-MSPYEESSRVPLILHGPEHLVPAGRYANPVSLLDLMPTLLELGGADGATSAAA 362 Query: 343 --KPEILPGENIL--AVKEPRGVMVEFNR-YEIEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 G ++L A +E G +R IE+ + G P V K V+ Sbjct: 363 EATTPARQGLSLLESARRERSGTAGPADRDVIIEYLAEGTLRPQLTLVRGQHKFVVCPGD 422 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY--MDKIRDPFRSYQWSLRPWRKD 455 D+L+D DP+E N+ D A++ +++ A+ + + + + Q R+ Sbjct: 423 PDQLFDLHTDPHERTNIAADPAQAELVAELRAAVAAQYDLAALEEKVLASQA-----RRR 477 Query: 456 ARPRWMGAFRPRPQDGYSPVVR 477 + + + R RP D Y P Sbjct: 478 LVAQALQSGRSRPWD-YEPDPE 498 >UniRef50_C3WCE8 Arylsulfatase n=2 Tax=Fusobacterium RepID=C3WCE8_FUSMR Length = 476 Score = 379 bits (974), Expect = e-103, Method: Composition-based stats. Identities = 118/473 (24%), Positives = 215/473 (45%), Gaps = 29/473 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN + + TD + +G Y + T N+D +A EG+ F +++ SPVCTP+RAG+FT Sbjct: 6 KKPNIVLITTDQMRADAIG-YINSKVITPNLDMMAKEGVVFTNSFCSSPVCTPSRAGIFT 64 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL--------DGHDYFGTGE 113 G Y +G W +N T+ + K GY+ +GK H + ++ + Sbjct: 65 GRYPMNTGAWNIGTCLDENEITLADWLKGEGYYNIGVGKMHFRPQLKDFDNNYEDVEVRD 124 Query: 114 CPPEWDADYWFDGANYLSELTEKEIS---LWRNGLN---SVEDLQANHIDETFTWAHRIS 167 E D Y+ Y++E ++ L NG + + N + E F + I Sbjct: 125 RVRERDKTYYGFDETYITEDDKQGKYLDFLDENGYHLEVGKGNDGMNPLPEEFNQTYWIG 184 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 ++ + +++ ++P M+ ++ +PHHPF ++ Y + +PE Sbjct: 185 MKSCEAIRKY-DFNKPLFMMTNFVDPHHPFDPAEKFARMYDGVEIDSPISKDKFCNERPE 243 Query: 228 HHRLWAQAMPSPVG--------DDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTW 278 + + + P G + + Y+A F+D +IG++ L + +NT Sbjct: 244 YLKRQGERGYWPGGGEQHKLSDEKVEEYTRYYYAMITFIDQEIGKIRKELEKKGELDNTI 303 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR-QVDTPVSHIDLLPTMMA 337 +I+TSDHGE MG + L+ KG MYD++ ++PL+ E+ D V +ID++PT++ Sbjct: 304 IIFTSDHGEYMGDYGLLQKGPFMYDNLIKVPLLFWGKGVEKSVTSDEIVENIDIVPTILE 363 Query: 338 LADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 L E P + GE++ + + + +D+ I V+ + +KL L + Sbjct: 364 LIGKEVPYGIQGESLKNILQKIDKERVKKSAIVTYDARDRGIMVKSYRDKRYKLNLFMNE 423 Query: 398 S-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF--RSYQW 447 E+YD DP E NL + +++++ M + DP R+ W Sbjct: 424 EYGEMYDLEVDPQETTNLFFKEEYLQLKNELLLKACYRMMECSDPLSKRTANW 476 >UniRef50_UPI000051016C choline-sulfatase n=1 Tax=Brevibacterium linens BL2 RepID=UPI000051016C Length = 509 Score = 379 bits (973), Expect = e-103, Method: Composition-based stats. Identities = 131/496 (26%), Positives = 219/496 (44%), Gaps = 30/496 (6%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M+ PN + + D A +G Y T N+D+LAA+G F+ AY +P+C+P+RA + Sbjct: 1 MQPPNIVVIQADQMAAQALGAYGDTAALTPNMDALAADGAVFDRAYCNTPLCSPSRASMM 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG + N ++ T + GYHT IG+ H G D E Sbjct: 61 TGRMPSDIDCLDNGDDFAASVPTFAHRLRKLGYHTALIGRMHFIGPDQHHGFE--ERLTT 118 Query: 121 DYWFDGANYLSELTEK--EISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL---- 174 D + + + + + W + + V A + + + R + L Sbjct: 119 DVYPADLDMVPDWQRPLDQKLQWYHEADPVFTAGAAKANVQQDFDDEVIFRTLRHLNGRV 178 Query: 175 --QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ-DDLANKPEHHRL 231 Q A D+PFLMV S+ PH P+ P E+ +++A+ + D+A P HRL Sbjct: 179 RANQAAGEDQPFLMVTSFIHPHDPYEPPREHWDRFAEVDIPDPAHPEVPDIAEDPHSHRL 238 Query: 232 ---WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGE 287 P +D Y+A ++DD IG++ L E +NT +I TSDHG+ Sbjct: 239 RTMSGLDKKEPGTEDIRRARRAYYAAVSYIDDHIGKIRQRLRELELEDNTVIIVTSDHGD 298 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMALADIEKPE 345 M+G L K + Y+ +R+P+II P + PVS +DL+PT++ LA P+ Sbjct: 299 MLGEKGLWYK-MSPYEQSSRVPIIINGPAEAVTPGRYANPVSLVDLMPTLLELAGTSDPD 357 Query: 346 ILPGENIL--AVKEPRGVMVEFNR-YEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELY 402 G ++ A +E G +R IE+ + G + P + +KL + + L+ Sbjct: 358 AT-GVSLFESARQEAAGETGPADRDVIIEYFAEGTYRPQVTLIRGQYKLTICPGDPELLF 416 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDAL-----LDYMDK-IRDPFRSYQWSLRPWRKDA 456 D +DP+E+ N D +A++ + M L L+++++ + S Q + Sbjct: 417 DLESDPDELVNRAGDAAYAELVATMRAELDSRYDLEHLEEHVLGSQSSRQLVADALKIGT 476 Query: 457 RPRWMGAFRPRPQDGY 472 W F P P++GY Sbjct: 477 VRHW--DFDPEPENGY 490 >UniRef50_C6CXF5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CXF5_PAESJ Length = 509 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 117/490 (23%), Positives = 194/490 (39%), Gaps = 50/490 (10%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M +PN L + TD Q + Y L+T +++LA G+ F A+ P C P+R+ +F Sbjct: 1 MSKPNILLIQTDQQTAETLSLYGNTALHTPALEALAERGVVFEQAFCNYPACAPSRSSMF 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHD----YFGTGECPP 116 TG Y + N++ + T+ + K+ GY T IGK H Y G P Sbjct: 61 TGRYCSTLNLHANHMLINPSEVTLPQVLKNHGYQTAIIGKNHAFTERPSSIYPGGVPENP 120 Query: 117 E---------WDADYWFDGANYLSELTEKEISLW--RNGLNSVEDLQANHIDETFTWAHR 165 AD+ Y + + W + +S N + Sbjct: 121 SLLHEVFDYVRLADHGHMVDGYRDDPGAQAAHAWAVEHCWSSPLGHGTNPAPVEKCGTYL 180 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 + +D+L + ++PF +S+ +PH P+ P Y + D L K Sbjct: 181 LGETMLDYLAHLRQENQPFFTWLSFPDPHTPYQVPEPYASMIRPEDVPMPPV--DSLEGK 238 Query: 226 PEHHRLWAQAMPSPVGDDGLYH--HPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYT 282 PE ++ D+ L +++ F+DD + ++ + ENT +I+T Sbjct: 239 PERVKVAHLMDAMDTADEQLIRQVRAIHYGMIRFIDDTLAKIFERMDALSLLENTVIIFT 298 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLPTMMALADI 341 SDHG+ MGAH +I K YD T +P I+ P + ++ + ID++PT++ LA I Sbjct: 299 SDHGDSMGAHGIIQKHNFFYDSFTHVPFIMSLPGYKGTKRTSNLLELIDIMPTLLELAGI 358 Query: 342 EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV-------------------- 381 P G++ A E + +E G + V Sbjct: 359 PVPPGCQGKSHAAFLEGDLSVTPREYVVMESGEHGDPVKVSDITLRPEHPLDERYFVWCA 418 Query: 382 ---------RCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALL 432 + T ++KL + ELYD + DP+E+HN D FA VR ++ LL Sbjct: 419 YSDAWIGKGKAIRTKEWKLCIYANGEGELYDLKADPHELHNRFPDPAFASVRIELERKLL 478 Query: 433 DYMDKIRDPF 442 + + D Sbjct: 479 QWSMEKEDRL 488 >UniRef50_A6C9F6 Iduronate-2-sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C9F6_9PLAN Length = 506 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 110/450 (24%), Positives = 191/450 (42%), Gaps = 22/450 (4%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN LF++ D + +GCY + + NID LA +G+RF AY P+C P+RA TG Sbjct: 45 KPNVLFLICDDLNCD-LGCYGHPQVQSPNIDQLAKQGVRFEHAYCQFPLCGPSRASFMTG 103 Query: 63 IYANQSGPWTNNVAPGK---NISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPEW 118 +Y +Q+ N + + N+ TM + F+D GY +GK +H + + GT + Sbjct: 104 MYPDQTLVHRNGIYIREHVPNVKTMSQMFRDHGYFATRVGKIYHYNVPKHIGTSGHDDPY 163 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 + F+ + ++ SL A + ++ A+ L++ A Sbjct: 164 SWNQTFNPRGRDVDDEDQIFSLVPGSYGGTLSWLAAEGTDAEQTDGIAADIAIQQLKKFA 223 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 + EPF + V PH P+ P Y EKY ++ + L P R Sbjct: 224 ESKEPFFLAVGLYRPHTPYVAPKSYFEKYPVEQIKVPQIPDGYLKTIPASARKSVTRKKD 283 Query: 239 PV---GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHKL 294 + Y+A F D Q+G +++AL + NT V++TSDHG MG H Sbjct: 284 QIDLPDKLARQAIQAYYASITFADAQLGHILSALKETGLDENTIVVFTSDHGYHMGEHGH 343 Query: 295 ISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGENI 352 K ++++ T +P+II P + + P +D PT+ L ++ P + G + Sbjct: 344 WQK-TTLFENATHVPMIIAGPGVTAKGQAAAAPAEMVDFYPTLAELCGLKAPASVSGISQ 402 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL---NLFTSDELYDRRNDPN 409 + + + ++ T F+ N ELYD +DP Sbjct: 403 VPALKDATATPR-------KTALTQYLNGYSLRTPTFRYTEWGTNGSEGVELYDHSSDPA 455 Query: 410 EMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 EMHNL + + +R ++ + L + +++ Sbjct: 456 EMHNLANQAKTQKLRDELAEILHERIEQAN 485 >UniRef50_A6C8U0 Choline sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8U0_9PLAN Length = 479 Score = 377 bits (969), Expect = e-103, Method: Composition-based stats. Identities = 109/447 (24%), Positives = 189/447 (42%), Gaps = 17/447 (3%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +F+++D Q + + + T ++D L G F A +P+CTP+RA + +G Sbjct: 33 QPNIVFLLSDDQRPDTIAALGNPIIKTPHLDQLVKAGTSFTRAVCANPICTPSRAEILSG 92 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHD---YFGTGECPPEWD 119 + +G K + T + AGY+T Y+GKWH DG + Sbjct: 93 VSGFHNGSMDFGKPIKKELPTWSQTLSKAGYNTWYVGKWHNDGKPVLRGYDETLGLFTGG 152 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 W + + + W + + T + ++ A++F+++ + Sbjct: 153 GGRWAVPSYDGNGVLVTGYRGWIFQDDERHFFPEKGVGLTSNISEHFADAAIEFVER--K 210 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA-QDDLANKPEHHRLWAQAMPS 238 +PF + V + PH P P+ Y + Y + + +P Sbjct: 211 HQKPFFLHVCFTAPHDPLLMPIGYEQNYDPDQMPVPANFLPQHPFDHGNFDGRDEALLPW 270 Query: 239 PVGDDGLYH-HPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLIS 296 P + + + LY++ +D Q+GR++ AL ENT +I++SDHG MG+H L Sbjct: 271 PRTKEIVKNDLSLYYSVISHLDAQVGRIVKALKKTGEWENTILIFSSDHGLAMGSHGLRG 330 Query: 297 KGAAMYDDITRIPLIIRSPQGERRQ-VDTPVSHIDLLPTMMALADIEKPEILPGENILAV 355 K MY+ +PLI+ P + DL PT LA + P+ + G+++ V Sbjct: 331 K-QNMYEHTVNVPLIMVGPGIPADTLSNAQCYLRDLYPTSCDLAGVPIPKTVEGKSLKPV 389 Query: 356 KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF-TSDELYDRRNDPNEMHNL 414 + V Y+ + F F R TD +KL+ +L+D +NDP E H+L Sbjct: 390 LSGQLDAV----YDEVYCYFRNF--QRMIRTDRWKLIYYPHLDRVQLFDLKNDPLEQHDL 443 Query: 415 IDDIRFADVRSKMHDALLDYMDKIRDP 441 + VR K+ D L D+ + DP Sbjct: 444 SGEAALQQVRGKLLDQLNDWRKQQNDP 470 >UniRef50_Q01RE9 Sulfatase n=4 Tax=Bacteria RepID=Q01RE9_SOLUE Length = 499 Score = 377 bits (969), Expect = e-103, Method: Composition-based stats. Identities = 119/454 (26%), Positives = 200/454 (44%), Gaps = 35/454 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 +R N +F+++D + +G L T ++D+LA +G +A+ C+ +C+P+RA + Sbjct: 27 RRRNVIFILSDDHRYDALGFMHPQPWLRTPHLDTLARDGAHLKNAFVCTALCSPSRASIL 86 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG+YA++ NN A + + + AGY T ++GKWH+ G P+ Sbjct: 87 TGVYAHRHHIVDNNTAIPRGTRFFPQLLQRAGYKTGFVGKWHM------GREGDDPQPGF 140 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D W S L E RNGLN + H+ + +++ A+D+L+ + Sbjct: 141 DKWVSFRGQGSYLPE------RNGLN----VDGKHVPQKGYITDELTDYALDWLRTVPK- 189 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 ++P+ + +S+ H F + YA + + N S Sbjct: 190 EQPYFLYLSHKAVHADFIPADRHKGAYAKETFRPPTTMDESGPNAQHRPMWVQNQRNSWH 249 Query: 241 GDDGLYH--------HPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGA 291 G D YH + Y VDD + R+++AL Q ++T VIY D+G G Sbjct: 250 GVDFPYHSDLDVGEYYKRYAETLLGVDDSVDRMLDALRERGQLDSTLVIYMGDNGFQFGE 309 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 H LI K Y++ R+PL+ R P+ R VD V+ +D++PT++ A P+ L G Sbjct: 310 HGLIDK-RTAYEESMRVPLLARCPEMFSGGRVVDRMVAGLDIMPTVLDAAGAAIPQGLDG 368 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDS---FGGFIPVRCWVTDDFKLVLNL--FTSDELYDR 404 ++L + + E+ F + TD +K V + SDELYD Sbjct: 369 RSMLPLLRGENDPQWRTQLLYEYYWERNFPQTPTMHALRTDRYKYVRYYGIWDSDELYDL 428 Query: 405 RNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 + DPNE NLI + + + L D M++ Sbjct: 429 QEDPNETTNLIYNPERKATIEEFNKRLFDEMERT 462 >UniRef50_Q482B9 Sulfatase family protein n=1 Tax=Colwellia psychrerythraea 34H RepID=Q482B9_COLP3 Length = 511 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 122/472 (25%), Positives = 206/472 (43%), Gaps = 32/472 (6%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+ D N +G Y + + NID+LA +GIRF+ AY+ SP+CTP+R+ TG+Y Sbjct: 42 NVLFITIDDL-NNDLGAYGHHLVKSPNIDALAKKGIRFDKAYSQSPMCTPSRSSFMTGLY 100 Query: 65 ANQSGPWTNNVAPG---------KNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGEC 114 +Q+G + ++T+ + FK+ GY + +GK +H + GT Sbjct: 101 PDQTGIIAHGSHTQMTAHFREHIPKVTTLPQLFKNNGYFSGRVGKIYHQGVPNQIGTSGA 160 Query: 115 PPEWDADYWFDGANYLSELTEK-----EISLWRNGLNSVEDLQANHIDETFTWAHRISNR 169 + ++ +K E +L R V A D+ +++ Sbjct: 161 DDAASWHETVNPIGLDKDVEDKIIAFNEKALVRQSFGGVLSFLAIGDDDKAHTDGKVATE 220 Query: 170 AVDFLQQPA--RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 ++ ++ + +PF + + PH PF P +Y + Y + ++D + P+ Sbjct: 221 TINMIKDHHPDKTGKPFFIGAGFYRPHTPFVAPKKYFDLYPLEKIKPYIAPKNDRKDIPD 280 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 + + Y+A +VD Q+GRV++AL + +NT V++ SDHG Sbjct: 281 IALQDREGQVGLTLNQRKQIIQGYYAAVSYVDAQVGRVLDALKQQDLSDNTIVVFLSDHG 340 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKP 344 +G H L KG ++++ R PLII +P + R V +PV +D+ PT+ L + P Sbjct: 341 YELGQHGLWQKG-SLFEGSARAPLIIYAPNVKDNGRVVTSPVELVDIYPTLAKLTGLVAP 399 Query: 345 EILPGENILAVKEPRGVMVEFNRYEIEHDSFGG--------FIPVRCWVTDDFKLVLNLF 396 E L G+++ V Y + G I T+ ++ Sbjct: 400 EYLAGKDLTPALNDVDFQVRKGAYSAILNRNKGDNNQFAFTKIRGHSIRTNRYRYTEWGE 459 Query: 397 T--SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQ 446 ELYD +NDP E+ NL D + VR KM L D MD + +S + Sbjct: 460 GYFGAELYDHKNDPQELKNLADKVSLESVRIKMKWLLNDAMDDAQKRIKSIE 511 >UniRef50_A9ECS8 Sulfatase n=3 Tax=Bacteria RepID=A9ECS8_9FLAO Length = 574 Score = 376 bits (966), Expect = e-102, Method: Composition-based stats. Identities = 109/511 (21%), Positives = 203/511 (39%), Gaps = 78/511 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP---LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAG 58 KRPN +++M D A + Y T NID LA G F + + + +C P+RA Sbjct: 34 KRPNIIYIMADDHAAQAISAYGHPIGKLAPTPNIDRLAKNGAIFKNNFCTNSICGPSRAV 93 Query: 59 LFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH-DYFGTGECPPE 117 + TG +++ +G N + T+ + K AGY+T IGKWHL G+ + F + Sbjct: 94 VLTGKHSHINGFRMNGERFDGSQQTLPKLLKKAGYNTAIIGKWHLHGYPEGFDYWNILND 153 Query: 118 WDADYW--FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 Y F +K I + ++ D I++ A+D++ Sbjct: 154 QGNYYNPQFIKIQDTIHFNKKHIDSTAHWTANLPDT----TTVKGYATDLITDYAIDYID 209 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK---------- 225 + +D+PF +++ + PH + + +L KY + L E + N Sbjct: 210 KKKNSDQPFFIMMHHKAPHRNWMPALRHLNKYDSVQFPLPETYFTNHENSTASKEQLQTI 269 Query: 226 ----------------------------------PEHHRLWAQAMPSP------------ 239 PE W +A Sbjct: 270 YRDMYEGHDLKMTKKKGSPELAWNPWKTDFERMTPEQRAAWDKAYQPKNDAFHDANLTGK 329 Query: 240 --VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLIS 296 G + Y + VD+ +G++++ L ENT V+YTSD G +G Sbjct: 330 ALAEWKGQRYLQEYLSTIASVDEGVGKILDYLEANGLAENTIVVYTSDQGFYLGEKGWFD 389 Query: 297 KGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTMMALADIEKPEILPGENILA 354 K MY++ ++PL+I+ P+ + V+ ++D T + A+++ PE + G++ + Sbjct: 390 K-RFMYEESLKMPLLIQYPEKIKSGTVVEGLTQNLDFAETFLDFANVDIPEDMQGKSFVG 448 Query: 355 VKEPRGVMVEF----NRYEIEHDSFGGFIPVRCWVTDDFKLV--LNLFTSDELYDRRNDP 408 + + +F + ++ +F + T +KL+ + ELYD + DP Sbjct: 449 LLDGSESDEDFRDAVYYHYYDYPAFHMVKKMYGIRTKRYKLIHVYDDIDEWELYDLQTDP 508 Query: 409 NEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 E+ NLI+D + ++ +K+ L++ + Sbjct: 509 QELTNLINDENYDEIETKLRKRLVELQQQYN 539 >UniRef50_A4A280 Iduronate-2-sulfatase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A280_9PLAN Length = 475 Score = 376 bits (965), Expect = e-102, Method: Composition-based stats. Identities = 115/455 (25%), Positives = 199/455 (43%), Gaps = 31/455 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 + N LF+++D + + CY + T NID LA G++F AY PVC P+RA L +G Sbjct: 25 KYNVLFIISDDLSAESLSCYGHRECQTPNIDRLAQRGVKFTHAYCQYPVCGPSRAALMSG 84 Query: 63 IYANQSGPWTNNVAPG-----KNISTMGRYFKDAGYHTCYIGK-WHL----------DGH 106 ++A G N + + ++M ++F+D GY+ + K +H+ +G Sbjct: 85 LHAATIGVMGNGQSTRFTQNLGDRASMSQHFRDQGYYAARVSKIYHMRIPGDITAGTNGD 144 Query: 107 DYFGTGECPPEWDADYWF---DGANYLSELTEKEIS-LWRNGLNSVEDLQANHIDETFTW 162 D+ + + A W D A Y +E K+ + G + D Sbjct: 145 DHAASWDERFNCQAPEWMSAGDAATYSNEKLNKDPDKHYGLGFGTAFYAVKASTDGAEQA 204 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 H+ +++A++ L++ +E F + V PH P P ++ E YAD EL K D Sbjct: 205 DHKAADKAIELLRK--HKEERFFLAVGMVRPHVPLVAPAKFFEPYADGQMELPLKVAGDW 262 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIY 281 + P+ + Y+A ++D Q+GRV++ L + NT V++ Sbjct: 263 DDIPKAGISRNSKATGMTLEGQRNTLSAYYAAVAYMDYQVGRVLDELHQLGLDKNTVVVF 322 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADI 341 T+DHG +G H K +++++ T IPLI+ P + + V+ + ID+ PT+ L ++ Sbjct: 323 TADHGYHLGEHDFWQK-MSLHEESTHIPLIVAIPGEQPKVVNGLAAQIDIYPTLAQLCEL 381 Query: 342 EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDEL 401 P L G + +A V D + TD + + ++EL Sbjct: 382 PVPTYLQGVSQVAAIASPDAAVRD-------DVLCMTSKGKLLRTDRYAYISYSGGTEEL 434 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 YD ++DP + NL D V K+ L + D Sbjct: 435 YDMQSDPQQYTNLAKDPASQPVLGKLRAQLKERAD 469 >UniRef50_C9L086 Mucin-desulfating sulfatase n=54 Tax=Bacteria RepID=C9L086_9BACE Length = 527 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 115/511 (22%), Positives = 202/511 (39%), Gaps = 92/511 (18%) Query: 2 KRP--NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 ++P N +++MTD M+ CY + + T N+D +AA+G+RF ++ + + P+RA + Sbjct: 32 EQPPLNIVYIMTDDHTAQMMSCYDTRYMETPNLDRIAADGVRFTQSFVANSLSGPSRACM 91 Query: 60 FTGIYANQSGPWTNNV-APGKNISTMGRYFKDAGYHTCYIGKWHLDG-HDYFGTGECPPE 117 TG ++ + + N + T + + AGY T +GKWHL+ F E P Sbjct: 92 ITGKHSCANKFYDNTTCVFDSSQQTFPKLLQKAGYQTALVGKWHLESLPSGFDYWEIVPG 151 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 Y D ++ +K + I++ A+D+++ Sbjct: 152 QGDYYNPDFITQKNDTIQKH----------------------GYITNLITDNAIDWMEHK 189 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP----------- 226 ++PF +++ + H + L Y D + L + DD +P Sbjct: 190 RNPEKPFCLLIHHKAIHRNWMADTCNLALYEDKTFPLPDNFFDDYEGRPAAAAQEMSVVK 249 Query: 227 -------------------------------EHHRLWAQAMPSPVGDDG----------- 244 E R +P+ +D Sbjct: 250 DMDMIYDLKMLRSDKNSRLKSLYEKFLGRMDEGQRAAWDKFYAPIIEDFYKQNLQGKELA 309 Query: 245 ----LYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGA 299 + Y +DD +GRV++ L + +NT V+YTSD G MG H K Sbjct: 310 NWKFQRYMRDYMKTVKSLDDNVGRVLDYLKEKGLLDNTLVVYTSDQGFYMGEHGWFDK-R 368 Query: 300 AMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVK- 356 MY++ R PLI+R P+G R + V +ID PT + LA +E P + G +++ + Sbjct: 369 FMYEESMRTPLIMRLPKGFDRRGDITEMVQNIDYAPTFLELAGVEIPSDIQGVSLVPLLK 428 Query: 357 --EPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS--DELYDRRNDPNEMH 412 +P + E+ + T+ +KL+ ELYD + DP+EMH Sbjct: 429 GKQPENWRKALYYHFYEYPAEHMVKRHYGVRTERYKLIHFYNDINWWELYDMKTDPSEMH 488 Query: 413 NLIDDIRFADVRSKMHDALLDYMDKIRDPFR 443 NL + V +++ + L ++ DP R Sbjct: 489 NLYGQPEYESVVNELKEELQKLQEQYNDPVR 519 >UniRef50_Q7UJ67 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UJ67_RHOBA Length = 505 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 118/451 (26%), Positives = 193/451 (42%), Gaps = 22/451 (4%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN LF+ D + +GCY T NID LAA G+ F AY P+C P RA + T Sbjct: 44 SKPNVLFIAVDDL-ASALGCYGDVVAKTPNIDRLAATGVCFRRAYNQLPLCNPTRASVMT 102 Query: 62 GIYANQSGPWTNNVAPGKNIS---TMGRYFKDAGYHTCYIGK-WHLDGHDYFGTG--ECP 115 G+ +Q + + + T+ + F+ AGY +GK +H + GT + P Sbjct: 103 GLRPDQIKVYDLDRHFRDEVPNVITLSQAFQQAGYFAARVGKIYHYNVPASIGTDGFDDP 162 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 P W+ G + E R ++ L A+ DE T I+ A+ ++ Sbjct: 163 PSWNQTVNPKGRDKDDEHLIFNAEPHRKISGALSWLAADGEDEEQT-DGMIATEAIRIMR 221 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + + DEPF + V + PH P+ P +Y + Y L D + P Sbjct: 222 E--KKDEPFFLGVGFFRPHTPYVAPKKYFDMYPLESLRLPFAPAGDREDIPTAAFAHNCP 279 Query: 236 MPSPVGDDG--LYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAH 292 +P+ D+ L Y+AC F+D Q+GR+++AL + +NT V++ SDHG +G H Sbjct: 280 VPNYGLDETTLLKATQAYYACVSFIDAQVGRLLDALEEQGLADNTIVVFWSDHGYHLGEH 339 Query: 293 KLISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 + + ++++ + PLIIR P + V +D+ PT+ +A IE P L G + Sbjct: 340 NGVWQKRTLFEEGAKAPLIIRDPSQLGLGSCNRIVEFVDIYPTLTDVAGIESPSGLAGRS 399 Query: 352 ILAVK-EPRGVMVEFNRYEIEHDSFGGFIPVRC---WVTDDFKLVLNLFTSD--ELYDRR 405 + + +P ++ + T ++ ELYD + Sbjct: 400 LKPLLNDPVANWNGTAITQVLRPADDRLPEQVMGCSIRTHRYRYTEWAEGRHGVELYDHQ 459 Query: 406 NDPNEMHNLIDDIRFA--DVRSKMHDALLDY 434 +DPNE HNL D V ++ L Sbjct: 460 SDPNEFHNLALDPDERAVAVIRRLRPLLRAK 490 >UniRef50_C5BVK2 Sulfatase n=11 Tax=Actinomycetales RepID=C5BVK2_BEUC1 Length = 505 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 139/500 (27%), Positives = 216/500 (43%), Gaps = 36/500 (7%) Query: 2 KRP----NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARA 57 +RP N LF +TD + +G Y + T N+D+LAA+G F+ YT + +CTPARA Sbjct: 7 ERPVALTNILFFLTDQHRKDTLGAYGNATVRTPNLDALAADGTTFDRFYTPTAICTPARA 66 Query: 58 GLFTGIYANQSGPWTN-------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG 110 L TG + N + T +AGYH +GKWH+ H G Sbjct: 67 SLLTGAAPFRHKLLANYERNVGYQEELSEGQFTFSEDLAEAGYHLGLVGKWHVGTHRTAG 126 Query: 111 T-GECPPEWDADYWF-DGANYLSELTEKEISLWR----------NGLNSVEDLQANHIDE 158 G P + D A+YL+ L E ++ +R NG H Sbjct: 127 DLGFDGPHLPGWHNPVDHADYLAYLEENDLPPYRISDEVRGTFPNGAPGNLLAARLHQPL 186 Query: 159 TFTWAHRISNRAVDFLQQPAR----ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYEL 214 T+ + ++ RA+D L+ AR + PF + + PH P+ P EYL+ Y EL Sbjct: 187 EATFEYFLAERAIDLLRTYARDHRTSGRPFFLATHFFGPHLPYILPSEYLDMYDADDVEL 246 Query: 215 GEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHH--PLYFACNDFVDDQIGRVINALTPE 272 + A KP ++ D Y+ VD Q+GR+++A Sbjct: 247 PLSVAETFAGKPPVQGNYSAHWTFDTLGDETSRKLIAAYWGYVTLVDSQVGRILDAAREL 306 Query: 273 Q-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHID 330 ++ V +++DHGE GAH+L KG AMY+DI IP I++ P G ++ D ID Sbjct: 307 GVYDDAAVFFSADHGEFTGAHRLHDKGPAMYEDIYTIPGIVKLPGGVPGQRSDRLAHLID 366 Query: 331 LLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFK 390 L T++ +A + + G + + + H P R VT+ +K Sbjct: 367 LTATILDVAGRDPARAVDGVPVTPLVRGEETPWREDLVAEFHGHHFPH-PQRMLVTERWK 425 Query: 391 LVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLR 450 LV+N + +ELYD DP+E+ N A VR+++ L + + D F + S+ Sbjct: 426 LVVNPESVNELYDLVRDPDELQNRYTHPETAAVRAELLGRLYRQLRERGDNFYHWMTSMY 485 Query: 451 PWRKD----ARPRWMGAFRP 466 P + + + GA RP Sbjct: 486 PVGEKDYDTSLSMFEGAHRP 505 >UniRef50_C6J5I8 Sulfatase n=2 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J5I8_9BACL Length = 522 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 115/468 (24%), Positives = 187/468 (39%), Gaps = 30/468 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRP+ F+M D + +G + T ++D+L+ + + F +AYT P+C PARA + T Sbjct: 8 KRPHVFFLMCDELRADSLGYMGNSIVKTPHLDNLSKDAVIFENAYTNCPMCVPARASMMT 67 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH-LDGHDYFGTGECPPEWDA 120 G +G N + + + + GY T GK H + G E + Sbjct: 68 GRNPISNGVLDNAMLMIDDEKPLPDLLRQNGYTTTLFGKLHVHRSAEEIGFEEFQSGYGD 127 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 Y E+ +K G + + T R++ + + + + Sbjct: 128 PYTSFLGIKDPEMRKKSSYKKNEGDIPLVIHGESPTHPDQTPCSRLTEDYIRRISEIPGS 187 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP- 239 D+P +S +PH P+ Y E Y L A L +KP HR + + Sbjct: 188 DKPIFHHLSLHDPHTPYMPTKPYSEMYDPAQMPLPPNAGRSLDDKPITHRYFHKVRGFDK 247 Query: 240 -VGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKLISK 297 +D Y+ VDD+IG+VI L E +++ +I+TSDHG MMG H + K Sbjct: 248 LTEEDYRKSLASYYGLVTHVDDRIGKVIARLKELELYDDSLIIFTSDHGSMMGEHGFVEK 307 Query: 298 GAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEKPEILPGENILAV 355 MY+ + RIPL+++ PQ + DT ID+LPT++ A I PE + G+++L V Sbjct: 308 WGHMYEPVVRIPLLVKLPQNVNGGMRLDTFAEIIDILPTILDAAGIAVPEEVQGKSLLPV 367 Query: 356 KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF------------------- 396 + P +KL + Sbjct: 368 CRGESKEHRTEAHSQYFCGSLHREPALMIRDHQWKLTIYPEQESIHEKLYGDHYLKYSPF 427 Query: 397 -----TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 ELYD +DP E HNL D+ ++A + +M L + + Sbjct: 428 FDLPLVEGELYDLLSDPYEQHNLFDNPKYAAQKEEMLSKLESWKQSLG 475 >UniRef50_Q7UMT6 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=2 Tax=Bacteria RepID=Q7UMT6_RHOBA Length = 524 Score = 374 bits (962), Expect = e-102, Method: Composition-based stats. Identities = 111/452 (24%), Positives = 190/452 (42%), Gaps = 27/452 (5%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN LF++ D + +G L T +ID++A +G AY + +C+P+RA + TG Sbjct: 43 PNILFILCDDHRFDCLGVAGHPFLETPHIDTMARDGAMLRRAYVTTSLCSPSRASILTGQ 102 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYW 123 YA+ N A N+ +DAGY T +IGKWH+ G P+ D+W Sbjct: 103 YAHNHRVVDNYHAVDPNLVFFPESLQDAGYQTAFIGKWHMGG------DIDDPQRGFDHW 156 Query: 124 --FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 F G ++ + + ++ ++D+L+ + Sbjct: 157 VSFRGQGTYWPDGHGTTREVPQTTYDGFNVNGKRVPQRGYITDELTEYSLDWLK-GRDPN 215 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADF--YYELGEKAQDDLANKPEHHRLWAQAM--- 236 +PF + VS+ H F + +Y + E+ D NKP R + Sbjct: 216 KPFFLYVSHKAVHADFVPADRHRGRYDNEALPIEIPTVEAMDAGNKPMWVRNQRNSRHGV 275 Query: 237 ---PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAH 292 + G ++ Y VDD +G++ L ++ + NT V+Y D+G G H Sbjct: 276 DFGYNLPGFSPEVYYRRYCESLLAVDDSVGQLREFLKQQELDQNTIVVYMGDNGFQFGDH 335 Query: 293 KLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEKPEILPGE 350 LI K Y+ ++PL++ +P V D V +ID+ PT++ A+ P+ + G+ Sbjct: 336 GLIDK-RTAYEASAKVPLLVVAPGKIPAGVPFDGLVGNIDIAPTLLEAANASAPKNINGQ 394 Query: 351 NILAVK---EPRGVMVEFNRYEIEHDS-FGGFIPVRCWVTDDFKLVLN--LFTSDELYDR 404 ++ + + YE + + + + FK + L+ DELYD Sbjct: 395 SVWQALCSSDASSLNDRTLLYEYYWERNYPHTPTLHAVIGGRFKYIRCHGLWDRDELYDL 454 Query: 405 RNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 +DP EM NLIDD R+AD ++ L + Sbjct: 455 ESDPGEMQNLIDDSRYADRVESLNQRLWQLLK 486 >UniRef50_A6DG38 N-acetylglucosamine-6-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG38_9BACT Length = 498 Score = 374 bits (962), Expect = e-102, Method: Composition-based stats. Identities = 108/475 (22%), Positives = 190/475 (40%), Gaps = 47/475 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + + +D A + CY + T +D LA G+RFN A + CTP+RA T Sbjct: 23 QRPNIILIFSDDHAKKALSCYGNTGIKTPALDRLADGGMRFNHALVTNSFCTPSRATALT 82 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL----DGHDYFGTGECPPE 117 G Y++++G N + + T + + AGY T GKWHL G DY+ + Sbjct: 83 GKYSHKNGVTRLNQSFDGSQQTFPKLLQKAGYETSLFGKWHLLSQPTGFDYYCVQKMQGM 142 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 F+ + + ++ + G ++ I+ A+++++ Sbjct: 143 PFNPRVFEPQHGWVPWSPQDRKSYMKGGRVIK----------GYNNDVITTEAINWIKNR 192 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH--------- 228 ++PF +++ PH P+T + D DD + H Sbjct: 193 ENKNKPFCLLLHPKPPHAPYTPATRDEDYLKDVTIPEPANLHDDYKGRTPHAIAGKMTAN 252 Query: 229 ----------HRLWAQAMPSPVGDDGL------YHHPLYFACNDFVDDQIGRVINALTPE 272 R + + + L + Y+ VDD +GRV++ L Sbjct: 253 RIILNPAFKSMRARIEKENPNISERELTSKMYQEYIKGYYRLVKSVDDNVGRVLDYLKES 312 Query: 273 QRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHI 329 E NT VIYTSD G +G H MY++ P +++ P + ++ SH+ Sbjct: 313 GLEKNTIVIYTSDQGFSLGEHG-FYNKQWMYEEPLHAPFLVKFPGTVKAGQVHNSMTSHV 371 Query: 330 DLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDF 389 D+ PT++ A + PE + G ++ + + V Y +D + TD + Sbjct: 372 DIAPTILDFAGVTIPEGMQGFSLKPILLGKKEKVRDASYYHFYDHGVRLPEMIGIRTDRY 431 Query: 390 KLVLNLFTS----DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 KL+ EL+D +ND EM+NL + + D+ + + L + K D Sbjct: 432 KLIFYPGMKGHYRWELFDLKNDSQEMNNLHYNPEYRDLAQDLKNQLRELTIKYDD 486 >UniRef50_C6VXD1 Sulfatase n=4 Tax=Bacteria RepID=C6VXD1_DYAFD Length = 474 Score = 374 bits (962), Expect = e-102, Method: Composition-based stats. Identities = 115/442 (26%), Positives = 196/442 (44%), Gaps = 18/442 (4%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+ N LF+ D N +G Y + + NID LA G+RF+ AYT P+C+P+R+ L T Sbjct: 30 KKFNVLFIAVDDL-NNDLGTYGNTFVKSPNIDRLAKRGVRFDKAYTQFPLCSPSRSSLLT 88 Query: 62 GIYANQSGPWTNNVAPGKNIS---TMGRYFKDAGYHTCYIGK-WHLDGHDYFGTG--ECP 115 G + + + KN+ T+ + FK+ Y++ +GK +H GT + P Sbjct: 89 GQRPDMTKIYELQTHFRKNLPDIVTLPQLFKNNNYYSARVGKIFHYGVPSQIGTDGLDDP 148 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 W G + E K ++ R GL S +A + I++ A+ + Sbjct: 149 ESWSYRVNPKGRDKTEEPLIKNLTPDR-GLGSALAWRATEGTDDEQTDGLIASEAIKIMT 207 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + + +EPF + V + PH P+ P +Y + Y L ++ +DL + PE Sbjct: 208 E--KKNEPFFLAVGFFRPHTPYVAPQKYFDMYPVDKVPLPKEIPNDLDDVPEAALFTKPP 265 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKL 294 Y+A F+D Q+G++I+AL + ENT ++ SDHG +G H Sbjct: 266 HWGLDEAKRREALRAYYATITFMDAQVGKLIDALDKLKLAENTIIVLWSDHGYNVGQHGQ 325 Query: 295 ISKGAAMYDDITRIPLIIRSPQGERRQVDT-PVSHIDLLPTMMALADIEKPEILPGENIL 353 K +++++ R+PLII P G + + V +D+ PT+ L ++ + L G+++ Sbjct: 326 WMK-QSLFENSARVPLIISVPGGTKGKASGRTVELVDIFPTLAELCGLDPKQNLQGKSLT 384 Query: 354 AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT--SDELYDRRNDPNEM 411 + + + + Y G I R T+ F+ ELYD + DP E Sbjct: 385 PLLKNPAAIWDKPAY---TQVRRGQIFGRSVRTERFRYTEWDGGNAGVELYDHQKDPGEF 441 Query: 412 HNLIDDIRFADVRSKMHDALLD 433 NL D F +++ L Sbjct: 442 TNLAKDNSFVITVNELALLLKK 463 >UniRef50_A4CMA4 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=2 Tax=Flavobacteriales RepID=A4CMA4_9FLAO Length = 490 Score = 374 bits (962), Expect = e-102, Method: Composition-based stats. Identities = 120/488 (24%), Positives = 211/488 (43%), Gaps = 46/488 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSG-KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 K+ N +F++TD + +G L T N+D LAAEG +A+ + +C+P+RA + Sbjct: 28 KQRNVIFILTDDHRFDYMGFTGKVPWLETPNMDRLAAEGAYLPNAFVTTSLCSPSRASIL 87 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG++++ N N++ +Y ++AGY T ++GKWH+ H P Sbjct: 88 TGMFSHTHTIVDNQAPNPGNLTYFPQYLQEAGYQTAFLGKWHMSSHT------DEPRPGF 141 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D+W ++ + +L NG + ++ ++ AVD+L+ Sbjct: 142 DHW---ESFFGQGVYYNPTLNING-------ERIEYKDSTYITDLLTEHAVDWLE-SRDK 190 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 D+PF + +S+ H F + +YA EL + + V Sbjct: 191 DKPFFLYLSHKAVHAEFQPARRHKGRYAGKKIELPPTYEQTKTGAWRDLKWPEWVADQRV 250 Query: 241 GDDGLYHH-----------PLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEM 288 G+ + Y VDD +G V+ L E + T VIY D+G Sbjct: 251 SWHGVDYMYHSNIDMQELVQAYCETLLGVDDSVGAVLEYLEEEGLDEETLVIYMGDNGFS 310 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEI 346 G H LI K Y++ ++PL++R P+ + V +ID+ PT++A A + +P+ Sbjct: 311 WGEHGLIDK-RHFYEESVKVPLLVRCPELFEGGQVPQDMVQNIDIGPTVLAEAGVAQPDD 369 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDS---FGGFIPVRCWVTDDFKLVLNL--FTSDEL 401 +PG + + + + ++ E+ F V TD +K + + +EL Sbjct: 370 MPGVSFIPILTGDKDATKRDKIFYEYYWENDFPMTPTVFGMRTDKYKYIRYHGIWDRNEL 429 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWM 461 YD NDP+EM+NLI D +V M D+L ++++ P + RPRW Sbjct: 430 YDLENDPHEMYNLIGDPEKQEVIQTMLDSLYNWLETT-------DGMKIPLKSTDRPRW- 481 Query: 462 GAFRPRPQ 469 G +R + + Sbjct: 482 GDYRHKGE 489 >UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C366AB Length = 470 Score = 374 bits (960), Expect = e-102, Method: Composition-based stats. Identities = 109/470 (23%), Positives = 184/470 (39%), Gaps = 51/470 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PNFLF+ D + C T NID L +G+ F ++Y PVC+P+RA TG Sbjct: 4 QPNFLFIFMDDMGWRDLACTGSTFYETPNIDRLCRQGMVFANSYASCPVCSPSRASCLTG 63 Query: 63 IYANQSGPWT-------------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHL 103 Y + G + T+ + KDAGY T ++GKWHL Sbjct: 64 KYPARLGVTDWIDMEGTSHPLKGKLIDAPYIKHLPEGEYTIAQALKDAGYDTWHVGKWHL 123 Query: 104 DGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWA 163 G +++ P + D G ++ L+ E Sbjct: 124 GGREFY-----PEHFGFDVNIGGCSWGHPHDGYFSPYGIETLS--------EGPEGEYLT 170 Query: 164 HRISNRAVDFLQQPAR--ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 RI++ AV L++ + +PF M + + H P E ++ ELG + Sbjct: 171 DRITDEAVRLLRKRQACGSRKPFYMNLCHYAVHTPIQVKDEDRARFEKKARELGLDKETA 230 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVI 280 L HH + V + P Y +D IGR++ AL + ENT V+ Sbjct: 231 LVEGEFHHTEDKKGRR--VVRRVIQSDPSYAGMIWNLDQNIGRLLEALRECGEEENTVVV 288 Query: 281 YTSDHGEMMGAHKLIS-------KGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDL 331 +TSD+G + + + +Y+ TR+PLI++ P + D PV+ D Sbjct: 289 FTSDNGGLATSEGSPTCNLPASEGKGWVYEGGTRVPLIVKYPGRVAPGSRCDVPVTTPDF 348 Query: 332 LPTMMALADIEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDF 389 PT + LA + + + G +I+ + + + H G P V D+ Sbjct: 349 YPTFLELAGVPQKAGIPIDGRSIVPLLSGNPMPERPIFWHYPHYGNQGGTPASSVVMGDY 408 Query: 390 KLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 K + ELYD + D +E +NL + + + +++ L + ++ Sbjct: 409 KYIEFFEDGRGELYDLKADFSETNNLCEKM--PETAARLRMLLHGWQREV 456 >UniRef50_B8FL44 Sulfatase n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FL44_DESAA Length = 468 Score = 373 bits (958), Expect = e-102, Method: Composition-based stats. Identities = 116/442 (26%), Positives = 197/442 (44%), Gaps = 32/442 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LF++TD + +GC + T N+D LA++G+ FN+++ S +C+P+RA T Sbjct: 50 KKPNVLFILTDDHRYDHMGCAGHPFIKTPNLDRLASQGVYFNNSFVTSSLCSPSRASFLT 109 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G YA+ G N T FK GY T ++GKWH+ GE P D Sbjct: 110 GQYAHTHGVQNNLTPWDNGNVTFLERFKQEGYDTAFLGKWHM-------PGELPKLRGVD 162 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 + + + L NG ++ + + +++RA++F+ + +D Sbjct: 163 EFVTFTVRGGQGQYWDCPLIVNGEDAKPNKR--------YITEELTDRAINFIDR--ESD 212 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVG 241 PF + +S+ HH + P + + Y+D L E+A + + A+ Sbjct: 213 NPFCLYLSHKAAHHDWKPPTDLKDLYSDEELPLAEEADTWVT-------MTNGAVFCGTT 265 Query: 242 DDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAA 300 YH+ Y VD Q+GR++ L + +NT V+Y D+G G H+ I K Sbjct: 266 GTLQYHYRNYCRVVASVDRQVGRLLKFLEDKGLADNTIVVYAGDNGYFWGEHRKIDK-RW 324 Query: 301 MYDDITRIPLIIRSPQG---ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILA-VK 356 Y++ RIP +IR+P R+ D +IDL PT+ LA IE + G+++ ++ Sbjct: 325 AYEESIRIPFMIRAPGVVPDPGRKADQMALNIDLAPTLFDLAGIEPHAGMEGQSLAPILR 384 Query: 357 EPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT-SDELYDRRNDPNEMHNLI 415 R E YE D ++ T + + + E YD + DP E N+ Sbjct: 385 NGRTPGREAWLYEYFKDYPYNVPAIQAIRTQNNIYIEYESSRKPEYYDLQADPKEKQNIY 444 Query: 416 DDIRFADVRSKMHDALLDYMDK 437 D + AD+ S+ + + + Sbjct: 445 DQLEAADI-SRYQQMIAAFAKE 465 >UniRef50_A6DNI8 Putative N-acetylglucosamine-6-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DNI8_9BACT Length = 705 Score = 373 bits (957), Expect = e-101, Method: Composition-based stats. Identities = 120/488 (24%), Positives = 195/488 (39%), Gaps = 60/488 (12%) Query: 4 PNFLFVMTDTQATNMVGCYSG-KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 PN +F++TD Q + +G L T NID + EG+ F +++ +C PARAG TG Sbjct: 23 PNIIFILTDDQKYDAMGFMGHYPFLKTPNIDRIRNEGVHFKNSFVTLSMCAPARAGFLTG 82 Query: 63 IYANQSGPWTNNVAPGKNI---STMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 Y +G TN N + + AGY T + GKWHLD P Sbjct: 83 TYPQVNGVCTNVEGREFNQNKTPSFPLLLQRAGYETGFFGKWHLD-------HSNKPRLG 135 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 D W + + N LN + + +++ A+DF+ + Sbjct: 136 FDRWVSFSG--------QGKYNGNDLN----IDGKLVHNPGYITDELTDYALDFIDK--N 181 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRL-------- 231 +D+PF + +S+ H PFT + Y E D+L +KP+ R+ Sbjct: 182 SDKPFCVYLSHKAVHQPFTPAKRHSSLYKGETVPKKESFFDNLKDKPKWQRVNLPPEKLY 241 Query: 232 ----------WAQAMPSPVGDDGLY--HHPLYFACNDFVDDQIGRVINALT-PEQRENTW 278 A P P + H Y VD+ IG++ L + +NT Sbjct: 242 RLRYNNTHETPAVKTPRPYTKENGSHPHTKDYLRAIAAVDEGIGKIYALLENKKILDNTV 301 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMM 336 +I+ D+G ++G H Y++ RIPLI+R P +D V +ID+ PT++ Sbjct: 302 IIFAGDNGYLLGEH-QRGDKRVHYNESMRIPLIMRYPAKIPADSTLDQMVLNIDVAPTIL 360 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDS--FGGFIPVRCWVTDDFKLVLN 394 +A ++ PEI+ GE+ + + + + Y + + TD + Sbjct: 361 DIAGVKAPEIMQGESCMPLFDKSKKTPWRDAYLFTYWRDLIPTLPRIVAVRTDRYVYTTY 420 Query: 395 LFTSD--ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPW 452 D ELYD NDP+EM NL A++ M + + + + + P Sbjct: 421 PDIDDVNELYDLENDPHEMRNLATSPEHAEIVKAMEQKIEELKKETK------YKKIVP- 473 Query: 453 RKDARPRW 460 R P+W Sbjct: 474 RPRPEPQW 481 >UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT Length = 500 Score = 372 bits (956), Expect = e-101, Method: Composition-based stats. Identities = 107/479 (22%), Positives = 177/479 (36%), Gaps = 60/479 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPNF+F++ D VG T N+D LA EG+RF AY VC+P RA + TG Sbjct: 38 RPNFVFILADDLGWKDVGFNGSTFYETPNLDRLAREGMRFTDAYAACSVCSPTRASIMTG 97 Query: 63 IYANQSGPWT-----------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDG 105 Y + T+ + ++ GY T +IGKWHL G Sbjct: 98 KYPARLHLTDWLPGRPDKPDQILKHPKIITELPAAEITLAKALQEGGYKTAFIGKWHLGG 157 Query: 106 HDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHR 165 G P + D G S ++N L+ + E A R Sbjct: 158 L-----GHWPEQAGFDINIGGCG--MGHPSSYFSPYKNPT-----LKDGPVGE--YLADR 203 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 +++ AV F++ PFL+ +S+ H P +EKY +L + + Sbjct: 204 LTDEAVKFIENT--KGTPFLLYLSHYSVHTPLQAKKGLIEKYQKKVMQLPPTKGPEFVTE 261 Query: 226 PEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSD 284 + + P+Y A +D+ +GRV++ L + NT +I+TSD Sbjct: 262 GN------------TNARQVQNQPIYAAMMQSLDESVGRVLDKLKELGLDKNTVIIFTSD 309 Query: 285 HGEMMGAHK-------LISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTM 335 +G + A L + Y+ R PL+++ P + D V D PT+ Sbjct: 310 NGGLSTAEGAPTSNMPLRAGKGWPYEGGVREPLVVKWPGVTKAASVSDHQVMSTDYYPTL 369 Query: 336 MALADIE--KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV- 392 + +A + + L G + + + + H S G P D+KL+ Sbjct: 370 LEIAGLPARPEQHLDGISFTPALRGKEMGERPLFWHYPHYSNQGGAPSSSIRKGDWKLIE 429 Query: 393 LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRP 451 EL++ R D E ++L R ++ L + ++ + P Sbjct: 430 WYEENRIELFNLRLDVGEKNDLASTSALK--REELKSELQAWRASVKADMPLPNPNFDP 486 >UniRef50_A0LYA0 Sulfatase n=8 Tax=Bacteria RepID=A0LYA0_GRAFK Length = 566 Score = 372 bits (956), Expect = e-101, Method: Composition-based stats. Identities = 111/512 (21%), Positives = 196/512 (38%), Gaps = 101/512 (19%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLN---TQNIDSLAAEGIRFNSAYTCSPVCTPARAG 58 KRPN +F+MTD A + Y T NID +A G +F + + + +C P+RA Sbjct: 41 KRPNIVFIMTDDHAAQAISAYGHPVSQKAPTPNIDRIANNGAKFLNNFCTNSICGPSRAV 100 Query: 59 LFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 + TG +++ +G N + T+ +Y K AGY T +GKWHL G Sbjct: 101 ILTGKFSHINGFRMNGETFDGSQPTLPKYLKKAGYQTAIVGKWHLHGKP----------Q 150 Query: 119 DADYW----FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 DYW G Y E K + NG I++ +++L Sbjct: 151 GFDYWNILKDQGNYYNPEFIHKNDTSIVNG----------------YATDIITDMGIEYL 194 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGE------------------ 216 ++ + DEPF ++V + PH + P+ ++ Y + L + Sbjct: 195 EKKRKKDEPFFLMVHHKAPHRNWMPPLRHINTYDSITFTLPDTYFSKHENQVAAQEQLQT 254 Query: 217 -------------------------KAQDDLANKPEHHRLWAQAMPSPVGD--------- 242 + D + R+ P D Sbjct: 255 IYEDMYEGHDLKMTISKGSDSLRHNPWKTDFNRMSKEQRIAWNDAYRPKNDAFHDANLTG 314 Query: 243 ------DGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLI 295 G + Y VD+ +G++++ L + ENT ++YT+D G +G + Sbjct: 315 KDLAEWKGQRYLRDYMGTVAAVDEGVGKILDYLEEQGLTENTIIVYTTDQGFYLGEKGMF 374 Query: 296 SKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTMMALADIEKPEILPGENIL 353 K MY++ +PL+I+ P+G ++ +D ++D PT + A E PE + G+++ Sbjct: 375 DK-RFMYEESLAMPLLIQYPKGIKKGTTIDALTQNLDFAPTFLDFAGAEIPESMQGKSLR 433 Query: 354 AVKEPRGVMVEF----NRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF--TSDELYDRRND 407 + F + + +F T+ +KL+ + ELYD + D Sbjct: 434 PLLSGNNPDGNFRDAVYYHYYDFPAFHMVKRHYGVRTERYKLIHFYDDIDTWELYDLKED 493 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 P E NL + + +++ +H+ L K Sbjct: 494 PKEEINLYGSVEYEEIQKNLHEKLKSLQKKYE 525 >UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria RepID=A6DGD3_9BACT Length = 713 Score = 372 bits (955), Expect = e-101, Method: Composition-based stats. Identities = 106/487 (21%), Positives = 191/487 (39%), Gaps = 52/487 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRP+ + + D N + CY + T ++D +A EG RF AY +PVC+P RA + Sbjct: 238 KRPHIILFLIDDLGWNDIACYGSQFYETPHLDKMAKEGFRFTDAYAANPVCSPTRASILL 297 Query: 62 GIYANQSGPWTNNVAPGK------------------NISTMGRYFKDAGYHTCYIGKWHL 103 G Y ++ G ++ + G T+ K+ GY T +IGKWHL Sbjct: 298 GKYPSRVGLSNHSGSSGPKGPGHKLTPVPVKGNMPLEDITLAEALKEVGYKTAHIGKWHL 357 Query: 104 DGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWA 163 H P + D G + + + S E Sbjct: 358 QAHHDTSRNHFPEKHGFDLNIAG-HRMGQPGSFYFPYKSKQHPSTNVPDMADGQEGDYLT 416 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 +++++A+ ++++ D PF + Y H P + +KY ELG Sbjct: 417 DKLTDKAIHYIKE--NKDTPFFLNFWYYTVHTPIIPRQDLKKKYEAKANELGINKNQPGI 474 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYT 282 + +Q PS Y A + +D+ IGR+ L Q ++ T +I+ Sbjct: 475 PVLKSFARSSQNNPS------------YAAMVEAMDENIGRIFKTLKELQIDDETIIIFC 522 Query: 283 SDHGEMMGAHK---------LISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLL 332 SD+G + + L + A +Y+ RIP II+ P + +++ PV D+ Sbjct: 523 SDNGGLSTSTGPNCPTSQLPLKAGKAWVYEGGIRIPFIIKWPGKKGGKELQAPVCTTDIY 582 Query: 333 PTMMALADIE--KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGG---FIPVRCWVTD 387 PT++ + + + L G ++ ++ + ++ I + + P Sbjct: 583 PTLLDMLKLPAKPEQHLDGVSLTSLMNGQAKELQREALFIHYPHYHHINSMGPAGAVRMG 642 Query: 388 DFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQ 446 D+KLV T + ELY+ + D EM+NL+ + + ++M L + + P Sbjct: 643 DYKLVEYYETGEFELYNLKEDIGEMNNLVKEQ--PERAAQMLKKLEQWRQQSNSPKPERN 700 Query: 447 WSLRPWR 453 P + Sbjct: 701 PHYDPQK 707 >UniRef50_D2RQH7 Sulfatase n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2RQH7_9EURY Length = 498 Score = 372 bits (955), Expect = e-101, Method: Composition-based stats. Identities = 131/487 (26%), Positives = 200/487 (41%), Gaps = 44/487 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN LFV+TD + + P+ T +D L++EG+RF+ A T +CT ARA L T Sbjct: 4 SRPNVLFVLTDQERYDCTAPEG-PPVETPAMDRLSSEGMRFSRACTPISICTSARASLMT 62 Query: 62 GIYANQSGPWTNNVA-------PGKNISTMGRYFKDAGYHTCYIGKWH------------ 102 G++ + G N+ + T + GY Y GKWH Sbjct: 63 GLFPHGHGMLNNSHEADAIRPNLPPELPTFSELLAENGYDCSYTGKWHVGRDQTPEDFGF 122 Query: 103 --LDGHDYFGTG--ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDE 158 L G D E E+ + E R+ +D Sbjct: 123 AYLGGSDKHHDDIDEAFREYREERGVPPGEVDLEEVLYTGDDPRDASEGTFVAATTPVDV 182 Query: 159 TFTWAHRISNRAVDFLQQPARAD---------EPFLMVVSYDEPHHPFTCPVEYLEKYAD 209 T A+ ++ R +D ++ A D +PF + PHHP+ P Y Y Sbjct: 183 EETRAYFLAERTIDAIEAHADGDSGEGDGNGSDPFFHRADFYGPHHPYVVPEPYASMYDP 242 Query: 210 FYYELGEKAQDDLANKPEHHR--LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVIN 267 + E + KP+ H + + D Y+ +DDQ+ R++ Sbjct: 243 NEIDPPESYAETYDGKPQVHENFHYYRGADGLEWDHWAEATAKYWGFVSLIDDQLERILE 302 Query: 268 ALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDT 324 AL + T V++ SDHG+ +G H+ +KG MYDD RIPL +R P + Sbjct: 303 ALEEHGLADETAVVHASDHGDFVGNHRQFNKGPLMYDDTYRIPLQVRWPGVAEPGTTCEV 362 Query: 325 PVSHIDLLPTMMALADIEKPEILPGENILAVKE----PRGVMVEFNR--YEIEHDSFGGF 378 PV DL T + + ++ PE +++ + E P V ++ + H G Sbjct: 363 PVHLHDLAATFLEMGGVDVPESFDSRSLVPLLETGDDPDAVPDDWPDSTFAQYHGDEFGL 422 Query: 379 IPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 R T +K V N DELYD + DP E+ NLID +ADVR +M D L+D+M + Sbjct: 423 YTQRMVRTGRYKYVYNGPDIDELYDLKADPAELQNLIDHPGYADVREEMRDRLVDWMQET 482 Query: 439 RDPFRSY 445 DP + + Sbjct: 483 DDPNQGW 489 >UniRef50_A4AP83 Putative sulfatase n=1 Tax=Flavobacteriales bacterium HTCC2170 RepID=A4AP83_9FLAO Length = 467 Score = 371 bits (954), Expect = e-101, Method: Composition-based stats. Identities = 115/462 (24%), Positives = 205/462 (44%), Gaps = 41/462 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN ++V+ D +G + T N+D LA+EGI F +A + SPVCTP R+ + T Sbjct: 22 KKPNIIYVLADQWRAEALGSNGNPNVITPNLDKLASEGISFTNAISTSPVCTPYRSMMLT 81 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G Y ++G + N+V+ + + G+ +K+ GY T YIGKWH+DG D Sbjct: 82 GRYPLKNGMFMNDVSLDPDSQSFGKLYKNEGYSTAYIGKWHVDGKGRSAFIPKERRQGFD 141 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 YW + + W N ++L + A + A+ F++ Sbjct: 142 YW---KVLECSHSYNNSNYWGND----DELHSWE----GYDAAAQTKDAIAFIEAQTENK 190 Query: 182 EPFLMVVSYDEPHHPF-TCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 PF +++S+ PH P+ T P E+ + Y + +L +P + Sbjct: 191 SPFCLILSWGPPHAPYKTAPKEFQKLYENMDIQLRPN------------------VPVEL 232 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGA 299 ++ Y+A +D I ++ +A+ +NT ++TSDHG+++ +H K Sbjct: 233 AENTKAMLKGYYAHCSALDSYIKQLQDAIKRNNLEDNTIFVFTSDHGDLINSHTER-KKQ 291 Query: 300 AMYDDITRIPLIIRSP---QGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVK 356 +Y++ ++P II+ P + R+ D ++ +D+LPTM+ ++ I+ PE L GE+I V Sbjct: 292 RIYEESAKVPFIIKYPALLGKQGRKSDFLLNTLDILPTMLGMSSIKAPEGLDGEDISDVI 351 Query: 357 EPRGVMVEFNRYEIEHDSFGGFIPV------RCWVTDDFKLVLNLFTSDELYDRRNDPNE 410 FG + R +T + +L +D DP + Sbjct: 352 LGEKEDNRKAALVACIQPFGQWKRTLGGKEFRGVITKRYTYAKDLSGEWLFFDNVEDPYQ 411 Query: 411 MHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPW 452 ++NL+ + F V + + L +D++ D F L W Sbjct: 412 LNNLVGNPSFKSVAENLEELLDKELDRLDDDFLPGASYLETW 453 >UniRef50_C6DK82 Sulfatase n=3 Tax=Pectobacterium RepID=C6DK82_PECCP Length = 564 Score = 371 bits (953), Expect = e-101, Method: Composition-based stats. Identities = 123/517 (23%), Positives = 212/517 (41%), Gaps = 35/517 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN ++++ D Q + G K + T ++D +A G F +A+ + +C+P+RA + T Sbjct: 39 QRPNIVYILLDDQRYDAFGFI-NKNIQTPHMDEIAKNGTWFKNAFVTTSLCSPSRASILT 97 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G+Y + G NN ++ K+ GY T + GKWH G DY D Sbjct: 98 GMYVHNHGVSDNNPTDLSKLNYFPEKLKERGYQTGFFGKWHFGGADYTAKAGF---AGFD 154 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 W L + I+++ G + ++ + + +++ AV++L Sbjct: 155 RWVG---LLGQGDYYPINMF--GEQAKLNIDGKMVPQKGYITDELTDYAVNWLD-GIDKK 208 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD---DLANKP---EHHRLWAQA 235 +PF+M +S+ H F + + + L E D + KP ++ R Sbjct: 209 KPFMMYLSHKGVHSDFYPAIRHKGSMDKVTFPLPETYADTPENYEGKPMWVKNQRNSWHG 268 Query: 236 MPSPVGD--DGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAH 292 + P D Y+ VDD +GRV L + NT V+ D+G G H Sbjct: 269 VDYPYNKKMDMQQFQRDYYETLRSVDDSVGRVQEWLKKNGLDKNTIVMVMGDNGFTFGEH 328 Query: 293 KLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 LI K + Y+ R+PLI P G+ V+ V++ID+ PT + A EKP+ G + Sbjct: 329 GLIDK-RSAYETSMRVPLIASGPGFGKGDVVEDLVANIDIAPTFLEAAGAEKPKNYDGNS 387 Query: 352 ILAVKEPRGVMVE----FNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL--FTSDELYDRR 405 L +K + + F F T ++K + + +ELYD + Sbjct: 388 FLNIKSDKEKQAKRKDYFAYEYFWEYDFPYTPTTFAIRTPEYKYIQYYGIWDKEELYDMK 447 Query: 406 NDPNEMHNLID--DIRFADVRSKMHDALLDYMDKIRD----PFRSYQWSLRPWRKDARPR 459 NDP+E NLID D + + + + L + D P+ + +R + Sbjct: 448 NDPDEKQNLIDSKDKKLIETKIALRKQLYMELKDHDDRNVIPYNQRTKEGQVFRYQETGK 507 Query: 460 WMGAFRPRPQDGYSPVVRDYDTGLPTQGVKVEEKKQK 496 M F P RD +G+ + V + + +K Sbjct: 508 KMADF-PDEWLRG-DNPRDKYSGIIPETVNKDAEGEK 542 >UniRef50_B4D780 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D780_9BACT Length = 496 Score = 371 bits (953), Expect = e-101, Method: Composition-based stats. Identities = 108/466 (23%), Positives = 205/466 (43%), Gaps = 48/466 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN LF++ D N + C L T NID +A EG+RF + + + +C+P+RA + + Sbjct: 27 KRPNVLFILCDDIRWNAMSCAGHPALKTPNIDRIANEGVRFANMFCTTSLCSPSRASILS 86 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G+YA+ G N + + ++GY T Y+GKWH+ G P D Sbjct: 87 GVYAHTHGVTNNFTEFPEKLVHWPMRLHESGYETAYMGKWHM------GEDNDAPRPGFD 140 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 ++ A + + + + NG S + +++ A+D+L++ Sbjct: 141 FF---ATHKGQGKYWDTAWNINGAGSKVI--------PGYYTTIVTDMALDWLKK-DHGG 188 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRL---------- 231 +P+ + + + PH +T +Y + + E A L +KP + Sbjct: 189 KPWALCIGHKAPHSFYTPEEKYAHVFDNVRVPYPESAF-HLEDKPTWMKQRLYTWHGIYG 247 Query: 232 ----WAQAMPSPVGD---DGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTS 283 W + P + D Y+ VDD +GR++ L +Q +NT +++ Sbjct: 248 PLFEWRKKFPDDRPEAVKDFENMVHGYWGTILSVDDSVGRLLKYLEDTKQLDNTIIVFMG 307 Query: 284 DHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDT-PVSHIDLLPTMMALADIE 342 D+G + G H ++ K ++ RIP+++R P + +V+ +D+ P+++ L + Sbjct: 308 DNGLLEGEHGMVDK-RTAHEPSMRIPMLVRYPGLAKGKVEEGQALTLDVAPSLLELCGAK 366 Query: 343 KPEILPGENILAV-KEPRGVMVEFNRYEIEHDSFGGFIP-VRCWVTDDFKLVLNLFTS-- 398 + + G++ + + +E + YE ++ + P VR TD++K V Sbjct: 367 PLDNIQGKSWVKLVREGDPTWRKSWFYEYNYEKQFPYTPNVRAIRTDEWKYVHYPHGDGT 426 Query: 399 -----DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 ELY+ + DPNE HNL+ D + A ++ L++ M + Sbjct: 427 PDRYIGELYNEKTDPNEDHNLVKDPQQAGRIEELKKLLVEKMKETG 472 >UniRef50_Q7NMX5 Gll0640 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NMX5_GLOVI Length = 834 Score = 371 bits (952), Expect = e-101, Method: Composition-based stats. Identities = 109/438 (24%), Positives = 190/438 (43%), Gaps = 33/438 (7%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + ++TD QA N + L + LA++G+ F +A+ +C P+RA + TG Y Sbjct: 37 NVVLIVTDDQAWNTLAYM--PKLQS----QLASQGVTFTNAFAGQSLCCPSRATILTGRY 90 Query: 65 ANQSGPWTNNVAPG-----KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 + G N+ G + ST+ + +++GY T GK+ + PP WD Sbjct: 91 PHNHGVLGNDAPFGGALAFYDASTLPVWLQESGYRTGLFGKYFNGY--SYSAFYTPPGWD 148 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 F A Y + N ++ED + E+ ++ +AV F+ A Sbjct: 149 EWQTFQLAGYYN--------YRINANGTIEDYGRS---ESNYSTDVLTQKAVAFITNSAA 197 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD-DLANKPEHHRLWAQAMPS 238 +D+PF + ++ PH P+T + +YAD + D+ +KP + A P Sbjct: 198 SDKPFFLFLAPFAPHAPYTPAPRHAGRYADIPPWRPPNYNEQDVLDKPTWVQKLRPASPQ 257 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKLISK 297 D Y VDD + ++ AL QRENT VI+TSD+G G H+ K Sbjct: 258 TQTDYDKERQA-YLEMLLAVDDGVESILQALESTGQRENTLVIFTSDNGLTWGEHRWWEK 316 Query: 298 GAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAV 355 G + Y++ R+P+++ P RQ + V ++DL T+ A I P + G ++L + Sbjct: 317 GCS-YEESLRVPMVVSFPGVSTAARQEELLVLNMDLTATIAEAAGIPIPATVDGRSLLPI 375 Query: 356 KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLI 415 + + V R + + + + +K + NL ELY+ +DP E+ N + Sbjct: 376 LKGQAVS---WREQFLFEGWQLTPTHAGVRSTAWKYMENLAGEQELYNLIDDPYELDNAV 432 Query: 416 DDIRFADVRSKMHDALLD 433 + +++ L Sbjct: 433 GVADYGAQVAELQATLAQ 450 >UniRef50_B5JJG3 Sulfatase, putative n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JJG3_9BACT Length = 499 Score = 369 bits (947), Expect = e-100, Method: Composition-based stats. Identities = 120/492 (24%), Positives = 194/492 (39%), Gaps = 55/492 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN LF+M+D A V Y + T NID LA EG+ F+ A+ + +C PARA TG Sbjct: 25 RPNILFIMSDDHANAAVSAYDDTLIQTPNIDRLANEGMLFSRAFCTNSICGPARAVTLTG 84 Query: 63 IYANQSGPWTN-NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 Y++ +G N + + T + + AGY T IGKWHL P +D Sbjct: 85 KYSHLNGFIVNESTSFDGGQQTYPKLLQAAGYETAVIGKWHLGS--------DPTGFDFW 136 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 G + + ++ I++ A+D+L + Sbjct: 137 KILIGQGQYYD--------------APFLTAEGQVETEGYVTDVITDLAIDWL-NTREDE 181 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK------------PEHH 229 +PF+++ + PH F +YL D E DD A + P Sbjct: 182 KPFMLMYQHKAPHANFQPGPDYLNWREDETIPEPETLFDDYATRSPAAWDNEMRIDPTLE 241 Query: 230 RLWAQAMPSPVGDDGLYHHP----------LYFACNDFVDDQIGRVINALTP-EQRENTW 278 + + V D H Y C VDD +GRV L + +NT Sbjct: 242 LQYQGELNLKVPDGLRGHERSRWLYQFYIKNYLRCVKSVDDGVGRVFEQLEAMGELDNTI 301 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMM 336 VIYTSD G +G H K MY++ +IPL++R P+ D V+++D TM+ Sbjct: 302 VIYTSDQGFFLGEHGYYDK-RFMYEESLQIPLLVRYPKMIEAGSVRDEIVTNLDFAETML 360 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFN---RYEIEHDSFGGFIPVRCWVTDDFKLVL 393 LA ++ P + GE+++ + + + + + E+ + T+ +KL+ Sbjct: 361 DLAGVKVPSGMQGESLVPLLKGKKRKGWRDAMYYHFYEYPGYHYVKRHYGIRTERYKLIR 420 Query: 394 NLFT--SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRP 451 + ELYD DP E++NL + + ++ L D + Sbjct: 421 FYHDIEAWELYDLDEDPQELNNLYGSDGYEKLTKRLKKRLDKIQSNFGDSPELADELVER 480 Query: 452 WRKDARPRWMGA 463 + + PRW Sbjct: 481 YPHGSMPRWGRY 492 >UniRef50_Q7UW58 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UW58_RHOBA Length = 541 Score = 368 bits (944), Expect = e-100, Method: Composition-based stats. Identities = 120/477 (25%), Positives = 213/477 (44%), Gaps = 24/477 (5%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +R N LF+++D T +GCY + T NID LAA G+ F +A P+C P+R + Sbjct: 65 QRKNVLFLISDDLNTR-IGCYGDPIVQTPNIDRLAARGVLFENAACQYPLCGPSRNSMLC 123 Query: 62 GIYANQSGPWTNNVAPGKNIS---TMGRYFKDAGYHTCYIGK-WHLDGHDYFGT--GECP 115 G+Y + +G N +I ++ + F+ GY +GK +H + GT + P Sbjct: 124 GLYPDTTGIHGNAQIFRDSIPERWSLPQAFRLDGYFAGRVGKLYHYNVPKSVGTNGHDDP 183 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 W+ + G + L E + +L + A+ + +++ A L+ Sbjct: 184 ASWELELNPAGCDRLIEEPD-IFTLRKGAFGGTLSWYASPRPDEAHTDGMLADDASWVLE 242 Query: 176 Q-PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 + R D PF + V + PH P+ P EY E Y L + ++D A+ P L + Sbjct: 243 RCAKRNDRPFFLAVGFYRPHTPYVAPKEYFEPYKLEDMPLFDNVEEDNADVPAAALLSKK 302 Query: 235 AMPSPVGDDGLYH-HPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAH 292 + D+ Y+A F+D Q+G+V++ L + NT V++TSDHG +G Sbjct: 303 KEQDLLNDELRRQAIQAYYASTTFMDAQVGKVLDTLKRTGLDKNTIVVFTSDHGYFLGEK 362 Query: 293 KLISKGAAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLLPTMMALADIEKPEILPGEN 351 L K A++D + +PLII P + +PV +DL PT+ L D+ +++ G++ Sbjct: 363 GLWQK-QALFDKVAGVPLIIAEPGRTEGAIAKSPVGLVDLYPTLAELCDVPTQKLMQGQS 421 Query: 352 ILA-VKEPRGVMVEFNRYEIEHDSFGGFIPVRC--WVTDDFKLVLNLFTSD--ELYDRRN 406 ++ +++P ++ + + T+ ++L L ELYD +N Sbjct: 422 LVPMLRDPSQTGRGYSMSMVARNDRQTKQRYYGYSIRTERYRLTLWDDGKRGTELYDHQN 481 Query: 407 DPNEMHNLI-----DDIRFADVRSKMHDALLDYM-DKIRDPFRSYQWSLRPWRKDAR 457 DP E NL +D A V ++ + L M + + + ++ + W R Sbjct: 482 DPEEFTNLAHGERKNDPNNAKVIRELTEKLKAEMANGMPASGKRTEYKVGNWNPMLR 538 >UniRef50_D2R1A1 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R1A1_9PLAN Length = 486 Score = 367 bits (943), Expect = e-100, Method: Composition-based stats. Identities = 110/453 (24%), Positives = 195/453 (43%), Gaps = 26/453 (5%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF++ D +GCY + T +ID LA+EG+RF A+ + C P+RA L +G Y Sbjct: 31 NVLFIIADDLTATALGCYGNQICQTPHIDRLASEGMRFTHAFCNATYCGPSRASLMSGYY 90 Query: 65 ANQSGPWTNNVAPGK--NISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTG----ECPPE 117 + +G +T +F+++GY+ + K +H+ TG + Sbjct: 91 PHATGILGYTSPRPAIGQRATWSEHFRNSGYYAARVSKIYHMGVPGDIETGSNGADDAAS 150 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQA-----NHIDETFTWAHRISNRAVD 172 WD + +G + + T + + +G V D+ R + + + Sbjct: 151 WDERFNIEGPEWKAAGTGETLEGNPDGKKPVMGGNTFVVVEADGDDLVHSDGRAALKTAE 210 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADF-YYELGEKAQDDLANKPEHHRL 231 ++Q +PF + + PH PF P +Y E Y + L K DD + P Sbjct: 211 LIRQ--HTQKPFFIACGFVRPHVPFVAPRQYFEPYLPYDKLPLPTKVADDWKDIPLAGIN 268 Query: 232 WAQAMPSPVGD-DGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMM 289 + ++ + + Y+A ++D Q+G+V++AL ++T VI+TSDHG + Sbjct: 269 YKTSVNMKMDERRQKKAIGGYYAAVSYMDAQVGKVLDALEQSGAADHTIVIFTSDHGYHL 328 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 G H +K ++ D +++PLIIR P + + V IDL PT+ +L +E PE L G Sbjct: 329 GEHDFWAK-VSLLDQSSKVPLIIRVPGKKPAVCHSLVELIDLYPTIASLCGLEVPERLQG 387 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD---ELYDRRN 406 +NI + + V + + + G + + EL+D + Sbjct: 388 KNIATLWDDPHKQVRDTAFSVAPMTQGFL-----LRDHQWSFIQYGEEGAKGLELFDVKA 442 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 DP + NL + DV L +++ +R Sbjct: 443 DPQQHTNLAQSPEYEDVVRGFQSKLKEHLQTLR 475 >UniRef50_C0G116 Sulfatase n=1 Tax=Natrialba magadii ATCC 43099 RepID=C0G116_NATMA Length = 499 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 131/482 (27%), Positives = 198/482 (41%), Gaps = 39/482 (8%) Query: 3 RPNFLFVMTDTQATNMVGCYS--GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 RPN L V+TD + + + + T+ ID L+A G F A+T +C+ ARA L Sbjct: 7 RPNVLLVLTDQERYDCSALDGPVAETVETETIDHLSATGTHFERAFTPISICSSARASLL 66 Query: 61 TGIYANQSGPWTNNVA-------PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE 113 TG + + G N + T DAGYH Y GKWH+ Sbjct: 67 TGQFPHGHGMLNNCHEDDALQPNLPPGVPTFSEKLDDAGYHLTYTGKWHVGRDQTPEDFG 126 Query: 114 CPPEWDADYWFDGAN--YLSELTEKEISLWRNGLNSVEDLQANHIDE------------- 158 +D D + + E+ + L+ V N D+ Sbjct: 127 FSYLGGSDKHHDDIDDAFREYRAERGTPVGEADLDDVIYTGTNPRDDSNGTFVAATTSVE 186 Query: 159 -TFTWAHRISNRAVDFLQQPARADE--PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG 215 T A ++ R +D +++ A D PF + PHHP+ P Y Y +L Sbjct: 187 VEETRAWFLAERTIDAIEEHASRDRDAPFFHRADFYGPHHPYVVPEPYASMYDPENIDLP 246 Query: 216 EKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTPEQ 273 E + A KP H + D ++ + Y+ +DDQ GR+++AL Sbjct: 247 ESYAETDAGKPRVHANYRSYRGVEQFDRDVWKEAIAKYWGFVTLIDDQFGRILDALESTG 306 Query: 274 R-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHID 330 + T V++ SDHG+ G H+ +KG MYDD IPL +R P + PV D Sbjct: 307 LTDETVVVHASDHGDFAGGHRQFNKGPLMYDDTYHIPLQVRWPGVTEPGSVREEPVHLHD 366 Query: 331 LLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNR-------YEIEHDSFGGFIPVRC 383 L T + + + PE +++ + + G E + H G R Sbjct: 367 LAATFLEMGGVAIPESFDSRSLVPLLDADGPEQESAPSAWPDSVFAQYHGDEFGLYTQRM 426 Query: 384 WVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFR 443 TD +K V N DELYD DP E+ NLID +ADVR ++ L+D+M++ DP R Sbjct: 427 VRTDRYKYVYNAPDVDELYDLEADPAELQNLIDHPDYADVRRELRTRLIDWMEETDDPNR 486 Query: 444 SY 445 + Sbjct: 487 QW 488 >UniRef50_A4U8Q3 Sulfatase n=2 Tax=Bacteria RepID=A4U8Q3_9BACT Length = 556 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 116/463 (25%), Positives = 186/463 (40%), Gaps = 20/463 (4%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN L VM D A ++ Y G T N++ LA EG+ F +AY P+C PAR L +G Sbjct: 58 PNILLVMMDQLAPQVLKPYGGTVCRTPNLERLAGEGVVFENAYCNYPICAPARFSLMSGR 117 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYW 123 ++ G + N + T Y + GYHTC GK H G D E D + Sbjct: 118 MPSRIGAFDNATEFPSEVPTFAHYLRAMGYHTCLSGKMHFVGADQLHGFE--DRVTTDVY 175 Query: 124 FDGANYLSELTEKEI--SLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA- 180 ++ S+ + W + + V D ++ + A +L A Sbjct: 176 PADFSWTSDWSLGPTFWEPWFHSVRIVRDAGPRRRSVNTSYDEEATVEACRWLHDHADRA 235 Query: 181 -DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP 239 PF + S+ PH P+ P + + Y D + L + H R + Sbjct: 236 DGRPFFLAASFISPHDPYLAPPSHWDLYTDDGIDDPRVGDIPLEERDPHSRRLYYTIGRH 295 Query: 240 VGD----DGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKL 294 + D Y+A ++DD+IGR++ L + +NT V+ T+DHG+M+G L Sbjct: 296 IETIGPADVRRARRAYYAVMSWLDDRIGRILETLKAIDADDNTIVVLTADHGDMLGERGL 355 Query: 295 ISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLPTMMALAD-IEKPE---ILPG 349 K ++ R+PLI+ +P R R+V VS +DL PT + A E PE + G Sbjct: 356 WLK-MNFFEWSVRVPLIVHAPTLYRARRVRENVSLLDLFPTFLEWAGDGELPELFAPIDG 414 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPN 409 +I + + E+ G PV +K + L+D DP+ Sbjct: 415 ASIAGLAAGHSDGWP-DVVGSEYCGEGASSPVLMIRRGRWKYIHCEDDPPLLFDIEQDPD 473 Query: 410 EMHNLIDDIRFADVRSKMHDALLDYMD--KIRDPFRSYQWSLR 450 E+ NL V + + + + D ++D Q + Sbjct: 474 ELVNLAGTPEVGGVETDLAGEVCRWWDTAALKDRVIESQRRRK 516 >UniRef50_A6CBG2 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=2 Tax=Planctomyces maris DSM 8797 RepID=A6CBG2_9PLAN Length = 633 Score = 366 bits (939), Expect = 2e-99, Method: Composition-based stats. Identities = 100/467 (21%), Positives = 183/467 (39%), Gaps = 56/467 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +P+ + V+ D + +GC + T +ID ++ EG RF +A+ +P+C+P RA L TG Sbjct: 191 QPDMVVVLVDDLRWDELGCMGHPFVRTPHIDRISREGARFRNAFCSTPLCSPVRACLLTG 250 Query: 63 IYANQSGPWT--NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 Y + G + N + T + + AGY T Y+GKWH+ D G Sbjct: 251 RYTHNHGIFDNINRSEHSHTLKTFPQELQKAGYATAYVGKWHMGNDDTARPG-------F 303 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D+W + + ++ I ++ + +F++ A+ Sbjct: 304 DHWVSMKGQGTS------------FDPTLNINGERIQFKGHTTDVLNQKVNEFVK--AQG 349 Query: 181 DEPFLMVVSYDEPHHP----------------FTCPVEYLEKYADFYYELGEKAQDDLAN 224 ++PF + +++ H F + + Y+D D L Sbjct: 350 EKPFCLYIAHKALHPELTQRDDGSITDPSAAKFMPAKRHEKLYSDDAIPRRLNVVDTLEG 409 Query: 225 KPEHHRLWAQAMP----SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWV 279 K R P + D+ + +D+ +G + L + ++T Sbjct: 410 KRALKRTVPGLPPLSQKTGTSDEVIRDR---LRMLAGIDEGVGSLCELLESQGKLDDTVF 466 Query: 280 IYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMA 337 ++TSDHG G H L + Y++ R+PL++R P +D +DL PTM+ Sbjct: 467 VFTSDHGYWYGEHGLSVERRLPYEEGIRVPLLVRYPPVIKAGTVIDEFAVSVDLAPTMLD 526 Query: 338 LADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIP-----VRCWVTDDFKLV 392 LA ++ + G +++ + + + +E++S F T +K + Sbjct: 527 LAHVKTDQKYDGRSLVPLLKGEHPADWRQSFLVEYNSDTVFPRLVKMGYTAVRTPRWKYI 586 Query: 393 L--NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 L +ELYD DP EM NLI+D + ++ L M Sbjct: 587 QFNELTGMNELYDMLRDPYEMQNLINDPAAKETVKQLQAELKQLMKD 633 >UniRef50_A6LF65 Choline-sulfatase n=26 Tax=Bacteroidales RepID=A6LF65_PARD8 Length = 520 Score = 365 bits (938), Expect = 2e-99, Method: Composition-based stats. Identities = 117/459 (25%), Positives = 189/459 (41%), Gaps = 23/459 (5%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +P+ + +MTD Q + +GC K + + NID LA EG F S Y+ +P TP RAGL TG Sbjct: 45 KPHIILIMTDQQRGDALGCMGNKAVISPNIDRLAQEGSLFVSGYSSAPSSTPGRAGLLTG 104 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH------LDGHDYFGTGECPP 116 + G K M + ++ GY+T IGK H L G E Sbjct: 105 MSPWHHGMLGYGRMALKYRYEMPQMMRNLGYYTFGIGKMHWFPQKALHGFHATLIDESGR 164 Query: 117 EWDADYWFDGANYLS-ELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 D+ D + + K+ L G N +DE A + ++ Sbjct: 165 VESPDFISDYREWFQLQAPGKDPDLTGIGWND-HAAGVYKLDERLHPTAWTGQTACELIR 223 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 D+P + VS+ PH P+ P YL+ Y D D E A Sbjct: 224 NYDN-DKPLFLKVSFARPHSPYDPPQRYLDMYKDADIPKP-HIGDWCGQYAEPKDPLQGA 281 Query: 236 MPSPVGDDGLYH----HPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMG 290 +P G+ G + Y+A F+DDQ+G++I L + +N + +T+DHG+M+G Sbjct: 282 SDAPFGNFGDAYAINSRRHYYANITFIDDQVGQIIQTLKDKGMYDNALICFTADHGDMLG 341 Query: 291 AHKLISKGAAMYDDITRIPLIIRSPQGE------RRQVDTPVSHIDLLPTMMALADIEKP 344 H K Y+ IP I++ P G ++ PV D LPT + +A P Sbjct: 342 DHYHWRK-TYPYEGSAHIPYIVKWPAGISKSIPDGSSIEQPVELRDFLPTFIDIAGGSVP 400 Query: 345 EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN-LFTSDELYD 403 + G ++L + + + + K + N S++L+D Sbjct: 401 PDMDGRSLLKLIQGQQEQWRPYIDMEHATCYSDDNYWAALTDGKIKYIWNFHNGSEQLFD 460 Query: 404 RRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 R DP E HNL +D + + S++ +++++ + D F Sbjct: 461 LREDPGETHNLSEDAAYQNKLSELRKMMVEHLSERGDSF 499 >UniRef50_A0JVM4 Sulfatase n=2 Tax=Actinomycetales RepID=A0JVM4_ARTS2 Length = 479 Score = 365 bits (937), Expect = 2e-99, Method: Composition-based stats. Identities = 128/500 (25%), Positives = 204/500 (40%), Gaps = 62/500 (12%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN L +++D Q +GC + T ++D+LA+ G R ++ + SPVC+PARA L TG Sbjct: 7 PNILLILSDDQGAWALGCSGNTEIQTPHLDNLASGGTRLDNFFCVSPVCSPARASLMTGT 66 Query: 64 YANQSGPWT--NNVAPGKNIST-------MGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 ++ G + V G AGY+ GKWHL +D G Sbjct: 67 IPSKHGVHDYLHGVETGPEAPDYLQGQRLFTDDLAAAGYYMGLSGKWHLGANDRAREG-- 124 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 +WF A S +++RNG+ I+ + F+ Sbjct: 125 -----FSHWFSLAGGGSPYDA--ATMYRNGVKE---------TVYGYLTDAITADSTGFM 168 Query: 175 QQPARADEPFLMVVSYDEPHHPF--TCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 ++ A D PF + ++Y PH P+ P E+ Y D +E + Sbjct: 169 ERAAGQDSPFFLALNYTAPHKPWKDQHPAEFTALYDDCAFESCPQEPTHPWTPT------ 222 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGA 291 +P D YFA +D IG+V+ L RE+T VI++SD+G G Sbjct: 223 VDGVPIGGEADVRAALVGYFAAVSAMDAGIGQVLQKLDELGLREDTLVIFSSDNGFNCGQ 282 Query: 292 HKLISKGA-----AMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKP 344 H + KG ++D ++P I P + + +S DL T++ LA ++ Sbjct: 283 HGVWGKGNGTFPLNVFDSSIKVPAIFSFPGRIARGKVREELLSAYDLPATILELAGLDPL 342 Query: 345 --EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-NLFTSDEL 401 E PG++ V + + R + D +G PVR +D +K V EL Sbjct: 343 EFEQGPGKSFADVLRGKPLAPARPRPVVVFDEYG---PVRMIRSDSWKYVHRYPQGPHEL 399 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD--------PFRSYQWSLRPWR 453 YD DP E HNL+ ++R + + M + + ++ ++ P +L P R Sbjct: 400 YDLATDPGERHNLVREVRHEERVAGMRRDMQLWFEQYQEEEADGRKFPVVGAGQTL-PVR 458 Query: 454 KDARPRWMGAFRPRPQDGYS 473 D +GAF P DG S Sbjct: 459 ADP----LGAFTPPSWDGIS 474 >UniRef50_UPI0001745B0B sulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745B0B Length = 676 Score = 365 bits (937), Expect = 2e-99, Method: Composition-based stats. Identities = 119/485 (24%), Positives = 198/485 (40%), Gaps = 50/485 (10%) Query: 4 PNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 PN LF++ D + +G G P T N+D LA G+RF +A+ +C P+R + TG Sbjct: 37 PNVLFIIADDL-NDWIGWMGGHPQARTPNMDRLARMGMRFMNAHCSYALCNPSRTSMLTG 95 Query: 63 IYANQSGPWTN-----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 I SG N N P + T+ YF+ GY T GK F PE Sbjct: 96 IQPWNSGVAGNEQDWRNAEPLQGKPTLPEYFRQQGYTTAAGGK-------VFHASHGGPE 148 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVE-----------------DLQANHIDETF 160 W G + + ++ NG+ + D + Sbjct: 149 GRLTGWHGGRRGFEQDSAWDVRFPGNGVQIPDLPVHTGQNFNGLDIWHWDWGTVDVKPEA 208 Query: 161 TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD 220 T ++ N A +LQ+ + PF + V PH P+ P +Y + L E +D Sbjct: 209 TDDGQVVNWAAQYLQR--KQPRPFFLTVGLYRPHAPWYVPRQYFAERPLSEVRLPEVKED 266 Query: 221 DLANKPEHHRLWAQ-AMPSPVGDDGLY--HHPLYFACNDFVDDQIGRVINALTPE-QREN 276 DLA+ P + + + + D L+ Y A F D +GRV++AL + N Sbjct: 267 DLADVPAAAKAYLNGGLHRKMLDRQLWGSAVRAYLASISFCDAMVGRVLDALESSPNKTN 326 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPT 334 T +++TSDHG +G + KG +++ +T +PL++ +P + Q VS +DL PT Sbjct: 327 TVIVFTSDHGLYLGEKQRWHKGG-LWERVTHVPLVVVAPGVTQPDTQSSQAVSLVDLYPT 385 Query: 335 MMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 + L + KP+ L G +++ + + G TD ++ + Sbjct: 386 LCELTGLPKPQSLDGISLVPLLRDPNASRTTPAVTAMGE---GDKASYAVRTDRWRYIRY 442 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRK 454 S+ELYD ++DP+E NL A V+ + + + S ++ + Sbjct: 443 ANGSEELYDHQSDPHEWTNLAGRTNLAAVQKDLAAQIPQK-------WVSAFRTVDQMKV 495 Query: 455 DARPR 459 D+ P Sbjct: 496 DSSPD 500 >UniRef50_A7LY81 Putative uncharacterized protein n=5 Tax=Bacteroides RepID=A7LY81_BACOV Length = 517 Score = 365 bits (937), Expect = 2e-99, Method: Composition-based stats. Identities = 125/494 (25%), Positives = 219/494 (44%), Gaps = 52/494 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + +MTD Q ++ G T +D LA E + FN AYT P +PAR +FTG Sbjct: 25 KPNIVVIMTDQQRADLCGREGFPLEVTPFVDRLAQENVWFNKAYTVMPASSPARCSMFTG 84 Query: 63 IYANQSGPWTNNVAPG-KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 + + + TN+ P + K+ GY T +GK H ++ ++ Sbjct: 85 RFPSATHVRTNHNIPDISYQQDLVGVLKENGYKTALVGKNHAY------LKPADLDFWSE 138 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 Y G + + EKE + + N + L+ + I +I N A+ +++Q + + Sbjct: 139 YGHWGKHKKTTPAEKETARFLNQQARGQWLEPSPISLEEQHPTKIVNEALAWIKQ--QKE 196 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP-- 239 PF + VS+ EPH+P+ Y ++ + + ++ DLA K E +R+ AQ + Sbjct: 197 NPFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLKTSRKDLAKKGEKYRILAQLEDASCP 256 Query: 240 -VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISK 297 + D Y +DDQI R+I +L ENT + SDHG+ G + LI K Sbjct: 257 NLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASGQYENTLFVVLSDHGDYWGEYGLIRK 316 Query: 298 GAAMYDDITRIPLIIRS--PQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAV 355 GA + + + RIP++ + + +D+ VS DL PT + E P + G ++ + Sbjct: 317 GAGLSESLARIPMVWAGYHIKNQPAPMDSHVSIADLFPTFCSAIGAEIPAGVQGRSLWPM 376 Query: 356 KEPRGVMVEFNRYEIEHDSFGGFI---------------------------------PVR 382 + E + FGG R Sbjct: 377 LTGKAYPKEEFSSMVVQQGFGGADVGLDASLTFEQEGALTPGKIAHFDELNTWTQSGTSR 436 Query: 383 CWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 DD+KLV+N + + ELY+ + DP+E+HNL + +++++++++ LL + +++DP Sbjct: 437 MIRKDDWKLVMNHYGNGELYNLKKDPSEVHNLFGEKKYSEIQTELLTRLLAWELRLQDPL 496 Query: 443 ----RSYQWSLRPW 452 R Y + P+ Sbjct: 497 PLPQRRYHFKQNPF 510 >UniRef50_A6DKS7 N-acetylglucosamine-6-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKS7_9BACT Length = 515 Score = 365 bits (937), Expect = 2e-99, Method: Composition-based stats. Identities = 115/488 (23%), Positives = 189/488 (38%), Gaps = 70/488 (14%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN LF+ +D AT VG Y +T NID +A+EGIRF+ + +C P+RA + TG Sbjct: 22 PNILFIFSDDHATQAVGSYGSIINSTPNIDRIASEGIRFDRCLVTNAICGPSRATILTGK 81 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYW 123 Y++ +G + N++ T + + AGY T IGKWHL D++ Sbjct: 82 YSHLNGFYKNDMYFDGRQITFPKLLRQAGYQTAVIGKWHLASLPT----------GFDHF 131 Query: 124 FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEP 183 Y + + RNG + I+ +++L+ ++P Sbjct: 132 EVITGYGGQGKYYHPVMNRNGEPTKHR---------GYTTEVITKLNMEWLKNQRDPNKP 182 Query: 184 FLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR------------- 230 F++++ + PH + +Y+ + D + D K H + Sbjct: 183 FMLMMQHKAPHRAWLPSPKYMNAFKDKKFPKPANLHTDYQGKASHVKKQDMMIKDSMNPG 242 Query: 231 ----------------LWAQAMPSPVG--------------DDGLYHHPLYFACNDFVDD 260 W +A + + Y C +DD Sbjct: 243 DLKLTPPKYLDGADLANWHKAYDEENAAFAKAKLSGKALRSWNYQRYIRDYVRCVQSIDD 302 Query: 261 QIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER 319 IG V+N L ENT +IY+SD G +G H K MY++ R PL++R P + Sbjct: 303 SIGEVLNYLDESGLAENTLLIYSSDQGFFLGEHGWFDK-RFMYEEALRTPLVMRWPGKIK 361 Query: 320 RQV--DTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGV--MVEFNRYEIEHDSF 375 S++D T + +A ++ P + G ++L + + + E Y Sbjct: 362 AGSVDSHITSNLDFAQTFLEVAGVKVPAEMQGASLLPIMKGQQPENWRESFYYHYYGYPD 421 Query: 376 GGFIPVRCWVTD-DFKLVL-NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 + C VTD +KL+ ELYDR+NDP E N D F + M L Sbjct: 422 WHLVQKHCGVTDGRYKLIHFYTTDEWELYDRKNDPEENINRASDPEFKSILQNMRKKLSQ 481 Query: 434 YMDKIRDP 441 +++ P Sbjct: 482 QRIQLKVP 489 >UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788C38 Length = 452 Score = 365 bits (937), Expect = 3e-99, Method: Composition-based stats. Identities = 110/480 (22%), Positives = 192/480 (40%), Gaps = 86/480 (17%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PNF+ + D +GCY + T ++D LA EGIRF + Y+ SPVC+P+RA L Sbjct: 14 MKQPNFIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLL 73 Query: 61 TGIYANQSGPW------TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 TG Y ++G + + T+ + K AGY T GKWHL + T Sbjct: 74 TGKYPARAGVGEILGAKRGSHGLPADEVTLAKALKPAGYRTALYGKWHLGLSEE--TSPN 131 Query: 115 PPEWDADYWFDGA--NYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 +D + F ++ S + + N L+ + + + + I+ R+VD Sbjct: 132 AHGFDEFFGFKAGCVDFYSHIFYWGQAHGVNPLHDLWENETEVWENGRYMTELITERSVD 191 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 F+Q+ + PF + SY+ PH+P P +Y++++A Sbjct: 192 FIQRSREQEAPFFLFASYNAPHYPMHAPQKYMDRFAHLP--------------------- 230 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG----- 286 + + A VDD +G+++ AL E+T + ++SD+G Sbjct: 231 -------------WDRQVMAAMIAAVDDGVGKIVKALKEAGCYEDTVIFFSSDNGPSSES 277 Query: 287 ----------EMMGAHK-LISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLP 333 G+ A++++ R P I+ P G + D + +DL P Sbjct: 278 RNWLDGTEDVYYGGSAGIFRGHKASLFEGGIREPAILSWPNGWEGGQVRDEVAAMMDLAP 337 Query: 334 TMMALADIEKPEI------LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTD 387 T + LA ++ L G ++ + + R F + Sbjct: 338 TFLDLAGVDPAAGPLQGVALDGSSLKEMLQMREPSPHQQL-------FWEYQGQLAVREG 390 Query: 388 DFKLVLNLF--------TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 D+KLVLN L D DP E NL D R+ ++ ++ + D+ ++++ Sbjct: 391 DWKLVLNGKLDFDRVVPDQIHLSDLSRDPGERSNLAD--RYPEIVERLSRDVRDWYEEVQ 448 >UniRef50_Q01ZJ7 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01ZJ7_SOLUE Length = 516 Score = 365 bits (937), Expect = 3e-99, Method: Composition-based stats. Identities = 133/490 (27%), Positives = 208/490 (42%), Gaps = 30/490 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L +MTD Q + T NID LA++G+ F +YT S VC PARA L +G Sbjct: 30 RPNILHIMTDQQQWATIA--GRSGCRTPNIDRLASQGMLFERSYTPSAVCCPARAMLLSG 87 Query: 63 IYANQSGPWTNNVAPGK-------NISTMGRYFKDAGYHTCYIGKWH---------LDGH 106 Y +G + +P ++ + ++AGY Y GKWH H Sbjct: 88 AYHWHNGVYNQVHSPPSVHRDMNADVVLYSQRLREAGYRLGYTGKWHASYLRTPLDFGFH 147 Query: 107 DYFGTGECPPE--WDADYWFDGANYLSEL--TEKEISLWRNGLNSVEDLQANHIDETFTW 162 + G C PE D D ++E T ++ + G E T Sbjct: 148 EIAGVAGCDPELLKKIDLNPDRVPRITEPLRTTQQRMMRWPGSEPFVMWGYREGPEESTP 207 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 +RI+ A +++ A+ ++P+ + V + EPH P+ +YL++Y + + D Sbjct: 208 EYRIAEMASRMMKRFAKGEQPWHLEVHFVEPHDPYMPLKQYLDRYDPRSIPVPKSFADTF 267 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIY 281 A KP HR ++ DD Y+A + +D QIGRV+ AL Q + T V + Sbjct: 268 AGKPGLHRRESETWGKVTEDDVRQSRAHYYAYAEQLDAQIGRVLKALDETGQADRTLVAF 327 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALA 339 T+DHG+M+GAH++ KG Y++ R+P+I+R P + V DL T +A A Sbjct: 328 TADHGDMVGAHRMWIKGWLPYEECYRVPMIVRWPGHVQAGSKSSKLVQTHDLGHTYLAAA 387 Query: 340 DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD 399 G ++ + + + R +TD FK V N F D Sbjct: 388 GARSLPFPDGASLAPLFADPRRKDWRDDILCAYYGGEYLYTQRIAITDRFKYVFNGFDYD 447 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPR 459 E+YD DP+EM N++ D +A M + + M + DP + P Sbjct: 448 EMYDLERDPDEMRNVVADSEYARFTGDMQARMYELMARFHDP-----YGDSPEGTKGDRY 502 Query: 460 WMGAFRPRPQ 469 + PR + Sbjct: 503 CAARYLPRGK 512 >UniRef50_C5HLB2 Putative sulfatase n=1 Tax=uncultured bacterium FLS12 RepID=C5HLB2_9BACT Length = 503 Score = 364 bits (936), Expect = 3e-99, Method: Composition-based stats. Identities = 126/469 (26%), Positives = 204/469 (43%), Gaps = 43/469 (9%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + VM D + Y G L T + D LA+EG F+ A T SP+CTP+R +TG Sbjct: 9 PNLVMVMVDQLQAQRMKLYGGTDLLTPHFDRLASEGALFSQAITTSPLCTPSRISFWTGQ 68 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH-LDGH---DYFGTGECPPEWD 119 Y + G N P ++ + K AGYHT IGK H G D F Sbjct: 69 YPSAVGGMNNGPLPLTDVPHLPGMLKAAGYHTALIGKNHCFRGEVVADLFDATWDAGHGG 128 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 A D + L+ ++ R + DL + T R + + +L++ + Sbjct: 129 AQGGKDDPDILAYERTAQLEFLRMCHGRIVDL-----PDHVTTTARATKNGLAWLEE--Q 181 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP 239 D+PF + +SY EPH PF + + Y L E + D+++KP H + + M +P Sbjct: 182 GDDPFFLWLSYPEPHSPFVTTRNWADLYDPAKLTLPESWRSDISDKPAHFQELHELMGAP 241 Query: 240 V--GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLIS 296 D+ +Y+ +DD +G+V++ L + ++T V++ SDHGE +G++ ++ Sbjct: 242 AVSDDELRELTQIYYGMASQIDDGLGQVLDCLERKGLADDTIVVFVSDHGEYIGSNYMLQ 301 Query: 297 KGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILA 354 K A + + + R+PL IR P D PV H D++PT+ L + P+ + ++ Sbjct: 302 KSAHLPEALIRVPLAIRWPGHVPSGAVYDDPVEHHDMMPTLCTLMGFDVPDSVQAADLTP 361 Query: 355 VKEPRGVMVEFNRYEIEHDS--------------------------FGGFIPV-RCWVTD 387 + + + + EI H + FG + V R T Sbjct: 362 LFDGKPFARDAAYSEIGHHADREMTREKTYAPDLPWAEARAFYHFVFGHYAHVGRGIRTR 421 Query: 388 DFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 K V + ELYD NDP EM NL A++ + + L + D Sbjct: 422 THKYVAYEYGEKELYDLANDPEEMVNLAGKPAAAEIEADLAARLEAWSD 470 >UniRef50_Q7MBV5 Arylsulfatase A n=31 Tax=Bacteria RepID=Q7MBV5_VIBVY Length = 486 Score = 364 bits (936), Expect = 4e-99, Method: Composition-based stats. Identities = 131/485 (27%), Positives = 212/485 (43%), Gaps = 42/485 (8%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 NF ++ D +M+GCY + T N+DS+AA G RF A+ S VCTP+R LFTG Sbjct: 2 NFALLLMDQTRADMLGCYGHPVVQTPNMDSIAAAGERFEQAFCASSVCTPSRTSLFTGKM 61 Query: 65 ANQSGPWTNNVA-------PGKNISTMGRYFKDAGYHTCYIGKWH--------------- 102 + G N+ P ++ + + + YIGKWH Sbjct: 62 PSHHGVMCNSDKEGDKCDVPLEDANLISEL---PNHQHIYIGKWHIGHQKLPQEYGFVGH 118 Query: 103 -LDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWR---------NGLNSVEDLQ 152 DG+ Y G+G +G Y L EK +L + L E Sbjct: 119 NFDGYAYPGSGVYQNLAFDSVPLNGNRYQEWLQEKGFALPKVSDCTFGNNPNLKIQEFYG 178 Query: 153 ANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYY 212 H + + + + A+ +++ + ++ F + +++ PH P P Y Y Sbjct: 179 LLHAPVEASIPYFLVDEAISHIEKCLQQNQSFTLWMNFWGPHTPCIIPEPYFSMYQPEQV 238 Query: 213 ELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALT 270 E L KPEH++ A+ D+ ++ + Y+ +DD IG++++ L Sbjct: 239 TFDESFYHPLIGKPEHYQNIAKMWGVWSLDEEIWRDIVCKYWGYITLIDDAIGQLLDFLK 298 Query: 271 PEQRENTWVI-YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVS 327 + + ++DHG+ MGAH++I KG M+D R+PLII+ P D V Sbjct: 299 QHDLYDGLFLSISADHGDAMGAHRMIEKGEFMFDQTYRVPLIIKDPNASQIGAHYDDLVY 358 Query: 328 HIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGF--IPVRCWV 385 DL T +A + PE G+++L + + R I G F P R W Sbjct: 359 LHDLTATYADIASSKVPESFDGQSLLPILRQQAGQSVPAREGILAQQNGHFTPYPQRMWR 418 Query: 386 TDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 T ++KLV N ELY R+DP EMHNLIDD + +++ M +A+ M + DP ++ Sbjct: 419 TKEYKLVFNASGRSELYHLRHDPQEMHNLIDDPNYGEIKQSMIEAMYAEMQRYHDPLCTW 478 Query: 446 QWSLR 450 + ++ Sbjct: 479 FYRMK 483 >UniRef50_C6D448 Sulfatase n=2 Tax=Bacteria RepID=C6D448_PAESJ Length = 511 Score = 364 bits (935), Expect = 4e-99, Method: Composition-based stats. Identities = 126/491 (25%), Positives = 202/491 (41%), Gaps = 66/491 (13%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN L + +D Q N +G ++ + L+T N+D L G F AY +P CTP+RA + Sbjct: 1 MKKPNILLITSDQQHWNTLGYFNNE-LSTPNLDRLIKAGTTFTRAYCPNPTCTPSRASII 59 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL----DGHDYFGTGECPP 116 TG Y +Q G WT ++ +G F AGY T +GK H +Y P Sbjct: 60 TGQYPSQHGAWTLGTKLLEDRHFVGEDFNSAGYKTALVGKAHFQPLSSTEEYPSLEAYPV 119 Query: 117 EWDADYW-----------------------FDGANYLSELTEKEISLWRN------GLNS 147 D + W G +Y + EK WR+ G Sbjct: 120 LQDLEMWKQFNGPFYGFEHVELTRNHTNEAHVGQHYALWMEEKGCVNWRDYFLPPTGNMD 179 Query: 148 VEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY 207 I E + + I+ R ++Q A D+PF + S+ +PH + P + Y Sbjct: 180 PAITYKWPIPEKYHYNTWIAERTNALMEQYAEEDKPFFLWSSFFDPHPEYLVPEPWDTMY 239 Query: 208 ADFYYELGEKAQDDLANKPEHHRLWAQAMP--SPVGDDGLYHH----------------- 248 + + + P H L + P SP + G H Sbjct: 240 DPDSLTIPDIVPGEHDKNPPHFGLTQEDNPDFSPWAETGNGIHGYRSHHYYEYGEKKKLT 299 Query: 249 --------PLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGA 299 +Y+ +D IG +++ L +NT V++T+DHG G H L +KG Sbjct: 300 DYDKKKLVAVYYGMISMMDKYIGTILDKLEELGIADNTVVVFTTDHGHFFGQHGLQAKGG 359 Query: 300 AMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEKPEILPGENILAVKE 357 Y+D+ R+P I+R P V D S +DL PT ++L+ I P + G + V Sbjct: 360 FHYEDLIRLPFIVRYPGQVPAGVTSDAIQSLVDLAPTFLSLSGIPVPHAITGVDQSEVWR 419 Query: 358 PRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL-VLNLFTSDELYDRRNDPNEMHNLID 416 + E I + +V +K+ V T E++D ++DP+E++NL D Sbjct: 420 GTASAARDHAI-CEFRHEPTTIHQKTYVDQRYKITVYYNQTYGEIFDLQDDPSELNNLWD 478 Query: 417 DIRFADVRSKM 427 D +A ++S++ Sbjct: 479 DPAYAALKSEL 489 >UniRef50_A6DME6 Sulfatase family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DME6_9BACT Length = 461 Score = 364 bits (934), Expect = 6e-99, Method: Composition-based stats. Identities = 107/448 (23%), Positives = 179/448 (39%), Gaps = 33/448 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN LF+ D +G Y + + NID LA+ F +A+ VC P+RA L T Sbjct: 19 EKPNVLFIAVDDLKPE-LGAYGNTQVKSPNIDKLASRSSVFTNAHCQWAVCGPSRASLMT 77 Query: 62 GIYANQSGPWTNNVAP---GKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+Y +G ++ T+ ++FK++GY T GK + T + P W Sbjct: 78 GLYPESTGVMDLKTPMRSVNPDVLTLPQHFKNSGYFTAATGKIYDPRCVDGRTKDDAPSW 137 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 Y + ++G + + + N DE T + N +D L+Q Sbjct: 138 STPY---------KTLNYGKVKLKDGKHFAKAPELN--DEDLTDGQILLN-GLDLLEQAQ 185 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD-----DLANKPEHHRLWA 233 D+PF + V + +PH PF P +Y + Y L D + Sbjct: 186 NQDKPFFVAVGFKKPHLPFVAPKKYWDLYDRERLTLPSFLDKAQGASDYGWHDSNELRSY 245 Query: 234 QAMPSPVG---DDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHGEMM 289 +P + + Y AC ++D +GR+I L +NT ++ DHG + Sbjct: 246 DGIPKKGPIAIELQKEAYHGYLACVSYIDALVGRLIQDLEKRNLADNTIIVLWGDHGFHL 305 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 G H + K + + TR PLII P+ + ++ TP ID+ PT+ A +E PE++ G Sbjct: 306 GDHNMWGKHTNL-EQATRSPLIISLPKQKAQKSHTPAGLIDIFPTLCEAAGLEVPEVVQG 364 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD----ELYDRR 405 ++ V + + T ++ + + ELYD Sbjct: 365 TSLFPVINGEKDQHKNGAISFFKSK---GAKGYSYRTKRYRYIEWSKGNKVEAIELYDYE 421 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLD 433 NDP E NL ++ + AL + Sbjct: 422 NDPQEKINLATQQESKELIRTLSQALRE 449 >UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=4 Tax=Bacteria RepID=A6C861_9PLAN Length = 498 Score = 363 bits (933), Expect = 7e-99, Method: Composition-based stats. Identities = 109/485 (22%), Positives = 185/485 (38%), Gaps = 50/485 (10%) Query: 3 RP-NFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 +P NF+F++ D VGC + T +I+ LA G+RF + Y +PVC+P R + Sbjct: 33 KPLNFVFILVDDLGYMDVGCNNPQTFYETPHINQLAKTGMRFTNGYAANPVCSPTRYSIM 92 Query: 61 TGIYANQ-------SGPWTN-------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH 106 TG Y + SG N + +T+ K+ GY T + GKWHL Sbjct: 93 TGKYPTRVDATNFFSGKRAGKFLPAPLNDKMPLSETTIAEALKEHGYSTFFAGKWHLGPT 152 Query: 107 DYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI 166 F P + D G + + + L+ H+ R+ Sbjct: 153 QEF----WPEKQGFDINRGGWHRGGPYGGGKYFSPYGNPRLTDGLKGEHLP------DRL 202 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 ++ F+ A DEPF +++ H P P + KY + LG +++ A++ Sbjct: 203 ASETAQFID--AHRDEPFFAYLAFYSVHTPLMGPGPLVTKYKEKAKRLGLTGKEEFADEE 260 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDH 285 + + + L +H +Y A + +D +G+V+ L ENT V+ T+D+ Sbjct: 261 QVFPVDEKRR-----VRILQNHAVYAAMVESMDKAVGKVLQQLEESGVAENTVVMLTADN 315 Query: 286 GEMMGAHK-------LISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMM 336 G + + L +Y+ R +IR P G D PV D PT++ Sbjct: 316 GGLSTSEGSPTSNLPLRGGKGWLYEGGIREVFLIRWPGGTEPGSVCDEPVITTDFYPTIL 375 Query: 337 ALADIE--KPEILPGENILAVKEPRGVMVEFN-RYEIEHDSFGGFIPVRCWVTDDFKLVL 393 LA + + L G ++ + + H S G IP D+KL+ Sbjct: 376 DLAGLPLKPQQHLDGVSLKPFLQGEAPFKRDALYWHYPHYSNQGGIPGGAIRVGDWKLIE 435 Query: 394 NLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL-RP 451 LY + D E +L + ++ + + M L + + F + P Sbjct: 436 RFEDGQVHLYHLKEDLGEKQDLAE--KYPERVAAMRKQLHKWYQETDAKFLQAKPGGPEP 493 Query: 452 WRKDA 456 WR Sbjct: 494 WRPGT 498 >UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD Length = 481 Score = 363 bits (932), Expect = 9e-99, Method: Composition-based stats. Identities = 104/455 (22%), Positives = 170/455 (37%), Gaps = 44/455 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN +F++ D VG K + T NID LA EG+ FN Y + VC P+R+ L T Sbjct: 28 QRPNIVFILADDLGYGDVGFNGQKLIKTPNIDKLAKEGMIFNQFYAGTSVCAPSRSSLLT 87 Query: 62 GIYANQSGPWTNNVAPGK-------NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 G + + N + +++T+ K +GY T GKW L G+ Sbjct: 88 GQHTGHTYIRGNKGVEPEGQQPIADSVTTLAEVLKKSGYVTAAFGKWGLG---PVGSEGD 144 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 P + D ++ LW N + + I I +A+ F+ Sbjct: 145 PNKQGFDRFYGYNCQSLAHRYYPEHLWDNSKKILLEGNKGLIHNKEYAPDLIQKKALSFV 204 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 +PF + + Y PH P + L +Y +E D +Q Sbjct: 205 -NAQDGKQPFFLFLPYILPHAELVVPDDSLFRYYKGKFEEKPHKGADYGPGANGGGYASQ 263 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMM---- 289 P H + A +D +G+V+NAL + + NT VI+TSD+G + Sbjct: 264 DFP----------HATFAAMVARLDLYVGQVMNALKKKGLDKNTLVIFTSDNGPHVEGGA 313 Query: 290 ------GAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADI 341 +Y+ R P R P + + D + D+LPT LA+ Sbjct: 314 DPRFFNSGAGFRGVKRDLYEGGIREPFAARWPAAIKPGSKSDYIGAFWDILPTFAELANA 373 Query: 342 EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-----NLF 396 P + G + + + + + + E GG + ++K V N Sbjct: 374 PAPRNIDGISFTDALKGKAIQKKHDYLYWEFHEQGG---RQAVRQGNWKAVRLKAAGNPD 430 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 ELYD DP E +NL +F + ++ + Sbjct: 431 ALVELYDLSKDPQEKNNL--TPQFPEKAKELGQIM 463 >UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BZT7_9PLAN Length = 459 Score = 363 bits (932), Expect = 1e-98, Method: Composition-based stats. Identities = 114/474 (24%), Positives = 179/474 (37%), Gaps = 51/474 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN +F+M D +GCY K + T +ID LAAEG++F AY S VC P+R+ L T Sbjct: 15 QKPNIIFIMADDLGYAELGCYGQKKIKTPHIDKLAAEGMKFTQAYAGSMVCQPSRSVLMT 74 Query: 62 GIYANQSGPWTN--NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 G + + N N + +T+ K AGY T GKW L Y GT P + Sbjct: 75 GQHTGHTAVRANDLNQLLYEEDTTVAEVLKIAGYATGAFGKWGLG---YEGTPGRPGQQG 131 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 D + + +W N + + + + I A F+Q+ Sbjct: 132 FDDFTGQLLQVHAHFYYPFWIW-NNEHRLMLPENENNQRGRYIHDLIHEDAKAFIQK--N 188 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP 239 +PF + Y PH P E + Y + + L +P + Sbjct: 189 KAQPFFAYLPYIIPHVELVVPEESEKPYRGQF-----PKKQILDPRPGYIG--------- 234 Query: 240 VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG-----------E 287 +DGL + +DD +G ++ L R+NT +I+TSD+G Sbjct: 235 -SEDGL---TTFAGMVSRLDDHVGEIVTLLEDLGIRDNTLIIFTSDNGGQGGTWKEMTDF 290 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKPE 345 G L +MY+ R+P I P + D ++ D+LPT+ +A P Sbjct: 291 FNGNAPLRGHKGSMYEGGIRVPFIANWPGKIAAGKTSDLQIAFWDVLPTLAQVAGTTVPS 350 Query: 346 IL--PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT-SDELY 402 + G + L +G E E+ G I R ++K V N ELY Sbjct: 351 GVDIDGISFLPTLLGKGKQPEHEYLYWEYTR--GKIRSRAIRQGNWKAVQNRMNQPIELY 408 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDA 456 D D E NL + + + M + R + +L+P Sbjct: 409 DLGTDIGETKNLAK--QHPEKIKDLQ----QIMQQAHSEPRDFPQTLKPVGIKG 456 >UniRef50_C3WAQ9 Sulfatase n=1 Tax=Fusobacterium mortiferum ATCC 9817 RepID=C3WAQ9_FUSMR Length = 523 Score = 363 bits (931), Expect = 1e-98, Method: Composition-based stats. Identities = 132/501 (26%), Positives = 207/501 (41%), Gaps = 77/501 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN L + +D Q N +G + K + T N+D L EG F+ AY +P CTP R + T Sbjct: 3 KRPNILLITSDQQHFNTIGAF-NKEIITPNLDRLVREGTTFDRAYCPNPTCTPTRGSIIT 61 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHD---YFGTGECPP-- 116 G Y +Q G WT V + +T+G F+ AGY + IGK H F + E P Sbjct: 62 GKYPSQHGAWTLGVKLPETENTIGNEFRKAGYKSALIGKAHFQPLASTLEFPSLEAYPCL 121 Query: 117 --------EWDADYWFD--------------GANYLSELTEKEISLWR------------ 142 Y FD G +Y L EK WR Sbjct: 122 QDLEFWRSYKGIFYGFDHVELARNHTNEAHVGQHYALWLEEKGCKNWRDYYLESTGNMSE 181 Query: 143 --------------NGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVV 188 N LNS + I E + + I+ R L++ DE F + Sbjct: 182 KEYPRLEILVEQEGNILNSRRNWGKWEIPEKYHYNTWIAERTNKMLEEYKNNDESFFLWA 241 Query: 189 SYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVG------- 241 S+ +PH + P + Y + + P H + + P+ Sbjct: 242 SFFDPHPEYFVPEPWASMYDPEKLTINGLVPGEHLKNPPHFQKTQEENPNFDEYKETGFG 301 Query: 242 -----------DDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMM 289 +D LY+ +D IG ++N L E+T V++T+DHG + Sbjct: 302 IHGMHSHLQKIEDIKKDLALYYGMVSMMDKYIGEILNKLDELGLAEDTIVVFTTDHGHFV 361 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEIL 347 G H LI KG Y+D+ ++P I+R P E + + S +DL PT ++ D++ P + Sbjct: 362 GQHGLIRKGPFHYEDLIKVPFIVRYPNHVPENKVSSSIQSLVDLAPTFLSFCDLKIPYDM 421 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL-VLNLFTSDELYDRRN 406 G + V E + V + E+ I ++ +V +K+ V T ELYD + Sbjct: 422 TGIDQKKVWENPDLEVRDHAI-CENHHEPTTIHLKTYVDKRYKITVYYNKTYGELYDLQE 480 Query: 407 DPNEMHNLIDDIRFADVRSKM 427 DPNE++NL D+ + +++S++ Sbjct: 481 DPNEINNLWDNEDYKELKSEL 501 >UniRef50_A5FX90 Sulfatase n=4 Tax=Alphaproteobacteria RepID=A5FX90_ACICJ Length = 518 Score = 363 bits (931), Expect = 1e-98, Method: Composition-based stats. Identities = 119/444 (26%), Positives = 181/444 (40%), Gaps = 16/444 (3%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L VM D + Y + T NID+LAA G+ F++AY SP+C P+R +G Sbjct: 17 RPNILIVMADQLGARALPAYGNQVALTPNIDALAAGGVVFDNAYCNSPLCGPSRYVFMSG 76 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 + G + N + + + + AGY T GK H G D E D Sbjct: 77 QLPSAIGAFDNAAEFPAMLPSFAHHMRAAGYRTILSGKMHFCGPDQMHGFE--ERLTTDI 134 Query: 123 WFDGANYLSELTEKEISL-WRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 + + + T+ W + ++SV + + + A L AR D Sbjct: 135 YPADFGWTPDWTDFATRPSWYHDMSSVREAGLCVRTNQMDYDDEVVFAARQKLFDLARDD 194 Query: 182 --EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA--QAMP 237 PF MVVS PH PF EY Y ++ + P RL Sbjct: 195 DGRPFCMVVSLTHPHDPFAMTEEYWNLYDHDAIDMPRVRTAPASMDPHSLRLRHVSNMDN 254 Query: 238 SPVGDDGLYH-HPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKLI 295 PV + + + Y+A FVD Q+GR+ + T + T+DHGE++G H L Sbjct: 255 EPVTEAQVRNARHAYYAAISFVDRQLGRLRETVEACGLAARTVTVMTADHGELLGEHGLW 314 Query: 296 SKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKPEIL--PGENI 352 K + ++D RIPLI+ +P +V VS +D+LPT++ L P L G ++ Sbjct: 315 YK-MSFFEDACRIPLIVHAPGRFAPARVGAAVSSVDMLPTLVGLGGGRIPAGLACDGTSL 373 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMH 412 L E RG + E+ + G P+ K + D+L+D DP+E Sbjct: 374 LGHLEGRGG---HDGAFGEYLAEGAIAPIVMIRRGRHKFIHCPADPDQLFDLEADPDERA 430 Query: 413 NLIDDIRFADVRSKMHDALLDYMD 436 NL A + + + D Sbjct: 431 NLAAAPEHAALVAAFRAEVAARWD 454 >UniRef50_D2QL61 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QL61_9SPHI Length = 489 Score = 363 bits (931), Expect = 1e-98, Method: Composition-based stats. Identities = 120/467 (25%), Positives = 210/467 (44%), Gaps = 43/467 (9%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 +++PN + VM D +G + + T N+D LA E + PVC+P+RA L Sbjct: 35 VRKPNIVIVMADQWRAQDLGYAGNREVITPNLDKLALESVNAPLCVAEVPVCSPSRASLL 94 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDG---HDYFGTGECPPE 117 TG +A G + N+ T+ + GY T +IGKWH++G D+ P Sbjct: 95 TGQHATTHGVFYNDRPLRNEAVTLAEVCQQNGYKTGFIGKWHINGGLAKDFAAGRLAPIP 154 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 D F+ L + S + N +N Q A ++ A+ F+ Q Sbjct: 155 VDRRQGFEYWRGLECTHDYNNSPYYNEVNKRFVWQQ-------YDAISQTDSAISFMTQS 207 Query: 178 ARADEPFLMVVSYDEPHHPFT-CPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM 236 + EPFL+V+++ PH P+ P EY ++YAD L + Sbjct: 208 RK--EPFLLVLAWGPPHDPYQTAPKEYRQRYADKTLSLRPN------------------V 247 Query: 237 PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALT-PEQRENTWVIYTSDHGEMMGAHKLI 295 P+ + Y+A + +DD IGR+ AL + ENT ++TSDHG+M+ +H I Sbjct: 248 PAKDTMEANRALKGYYAHINALDDCIGRLQAALKGAKLDENTIFVFTSDHGDMLYSHDQI 307 Query: 296 SKGAAMYDDITRIPLIIRSPQG---ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENI 352 +K +D+ RIP +++ P G + R +D P++ D++PT+++L+ P + G+N+ Sbjct: 308 NKQKP-WDESIRIPFLLKYPAGLSRKGRTLDVPITLTDVMPTVLSLSGQTIPASVQGQNV 366 Query: 353 LAV-KEPRGVMVEFNR-----YEIEHDSFG-GFIPVRCWVTDDFKLVLNLFTSDELYDRR 405 ++ ++PR + ++G G R T + V +L LYD + Sbjct: 367 ASLIRQPRAPRPDDAALIACIVPFHQWNYGRGGREYRGIRTARYTYVRDLKGPWLLYDNQ 426 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPW 452 DP ++ NL ++ + A + ++ L + D F++ + W Sbjct: 427 QDPYQLTNLANEPKLAGTQKQLEGILAQKLRAANDNFQAGNVYMDKW 473 >UniRef50_A6DM50 Choline sulfatase n=6 Tax=Bacteria RepID=A6DM50_9BACT Length = 647 Score = 362 bits (929), Expect = 2e-98, Method: Composition-based stats. Identities = 108/453 (23%), Positives = 194/453 (42%), Gaps = 30/453 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYT----CSPVCTPARA 57 K+PNF+F+ D Q+ +G Y + T N+D L GI F Y VC +RA Sbjct: 26 KKPNFMFIFADDQSYESIGAYGQLNIKTPNLDRLVKRGISFTHTYNMGAWGGAVCVASRA 85 Query: 58 GLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 L +G + N+ K + AGY T GKWH+ G+ F + Sbjct: 86 MLNSGRFVNR------AEKGVKQYPHWSQIMNSAGYTTYMTGKWHVHGNPRFDVMKDVRG 139 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 + L+ + + Q W +++ + F ++ Sbjct: 140 -----GMPNQTPARYKRTFKPELYESEWLPWDKRQQGFWRGGTHWTQVVADNTLTFFEKV 194 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE----HHRLWA 233 ++PF M ++++ PH P P EY++ Y ++ E + E R Sbjct: 195 KNDNKPFFMYLAFNAPHDPRQAPKEYVDMYPLDSIKIPENYMPEYPYAAEICGKKLRDEV 254 Query: 234 QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAH 292 A + Y+A ++D IGR+++AL ENT++I+T+DHG G H Sbjct: 255 LAPYPRTTYAVKRNRQEYYASITYMDHHIGRMLDALEASGKAENTYIIFTADHGLAAGHH 314 Query: 293 KLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 L+ K +MY+ R P I+ P + ++DTP+ D + T + LA +EKP + ++ Sbjct: 315 GLMGK-QSMYEHSMRPPFIVVGPGIKQNSKIDTPIYLQDAMATAIELAGVEKPAHVEFKS 373 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF--TSDELYDRRNDPN 409 ++ + + V+++R ++ R + DD+KL+ L++ +NDP Sbjct: 374 LMPLIKGE-KTVQYDRIYGKY-----MNTQRMILKDDWKLIFYPHAAKKMRLFNIKNDPA 427 Query: 410 EMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 EM++LID+ +A ++ ++ ++ DP Sbjct: 428 EMNDLIDNPEYATKIQELKREFVELQKEMGDPL 460 >UniRef50_A6CFT9 Iduronate-2-sulfatase n=2 Tax=Planctomycetaceae RepID=A6CFT9_9PLAN Length = 489 Score = 362 bits (929), Expect = 2e-98, Method: Composition-based stats. Identities = 109/439 (24%), Positives = 186/439 (42%), Gaps = 25/439 (5%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 +++PN LF+ TD + + CY + T N+D LA G+ F AY +C P+RA L Sbjct: 30 VEKPNVLFIGTDDLRCD-LACYGHPLVKTPNLDKLATRGVLFKRAYCQQALCNPSRASLM 88 Query: 61 TGIYANQSGPWT---NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 TG + W + NI T+ + FK GY T IGK + Sbjct: 89 TGRRPDTLEIWDLPTHFREADPNIVTLPQLFKQQGYFTQNIGKIFHNWRQKIQGDPASWS 148 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 A F + + L N L ++ + ++ + RI + AV LQ Sbjct: 149 VPAVMHFARHDDDQPMLNDNRELPVN-LAKAPRSESRDVPDSAYFDGRIGDLAVKALQDL 207 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE----HHRLWA 233 + +PF + V + +PH PF P +Y + Y D + + Q N P+ R Sbjct: 208 KQKQQPFFLAVGFWKPHLPFNPPKKYWDLYDDSPITVPDNPQPP-KNVPDVALHDSREIL 266 Query: 234 QAMPSPVGDDGLYH-HPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGA 291 +A+ + D + Y A ++D Q+G+V+ L RE T +++ SDHG +G Sbjct: 267 RAVKGKLTDAQIIELRTGYLAGISYLDAQLGKVLAELDRLGLREKTIIVFWSDHGFHLGE 326 Query: 292 HKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 H L K + +++ R+PL+I P + + D V +D+ PT++ L ++ P L G Sbjct: 327 HGLWCK-TSNFENDARVPLMISVPHMKTAGKTSDALVELLDMYPTLVELCGLDSPGKLEG 385 Query: 350 ENILAVKEPRGVMVEFNRY-EIEHDSFGGFIPVRC---WVTDDFKLVLNLFTS------D 399 +++ V + V+ + + ++ P T ++ Sbjct: 386 TSLVPVLKDPTQSVKPAAFTQHPRPAYYRKQPENMGVSVRTPRYRYTEWRNFKTGKVIAR 445 Query: 400 ELYDRRNDPNEMHNLIDDI 418 ELYD +DP E N+I++ Sbjct: 446 ELYDHTSDPEENTNIINEP 464 >UniRef50_A3P379 Choline-sulfatase n=63 Tax=cellular organisms RepID=A3P379_BURP0 Length = 517 Score = 361 bits (928), Expect = 3e-98, Method: Composition-based stats. Identities = 122/466 (26%), Positives = 191/466 (40%), Gaps = 24/466 (5%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L +M D + Y + T ID LAAEG+ F++AY SP+C P+R L G Sbjct: 11 QPNILVLMADQLTPFALRAYGHRATRTPTIDRLAAEGVVFDAAYCASPLCAPSRFALMAG 70 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 + G + N T Y + AGY T GK H G D E D Sbjct: 71 KLPSALGAYDNAAELPAQTLTFAHYLRAAGYRTMLSGKMHFCGPDQLHGFE--ERLTTDI 128 Query: 123 WFDGANYLSELTEK-EISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL------Q 175 + ++ + T E W + ++SV D + + A + + Sbjct: 129 YPADFGWVPDWTRPAERPSWYHNMSSVLDAGPCVRTNQLDFDDDATFAARQKIFDVARER 188 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA-- 233 R PF MVVS PH P+ EY + Y D +L D A+ P RL A Sbjct: 189 AAGRDTRPFCMVVSLTHPHDPYAITREYWDLYRDEDIDLPAVQMDFDASDPHSRRLRAVC 248 Query: 234 -QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGA 291 P Y+ +VD Q G ++ L ++T VI T+DHG+M+G Sbjct: 249 EVDRTPPEDLQIRRARRAYYGATSYVDAQFGALLATLEQCGLADDTIVIVTADHGDMLGE 308 Query: 292 HKLISKGAAMYDDITRIPLIIRSP-QGERRQVDTPVSHIDLLPTMMALA----DIEKPEI 346 L K ++ R+PLI+ +P + +V VSH+DLLPT++ LA + P+ Sbjct: 309 RGLWYK-MTFFEGACRVPLIVHAPRRFPAARVPAAVSHVDLLPTLVELATGERRADWPDA 367 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRN 406 + G +++ G + E+ + G P+ K + + D+L+D RN Sbjct: 368 VDGRSLVPHLRGEGG---HDEAFGEYLAEGAIAPIVMMRRGSHKYIHSPADPDQLFDLRN 424 Query: 407 DPNEMHNLIDDIRFADVRSKMH-DALLDY-MDKIRDPFRSYQWSLR 450 DP E+ NL + A + + + + +D + + Q R Sbjct: 425 DPRELDNLANTPAAAKHVAAFRMERVARWDLDALHQQVLASQRRRR 470 >UniRef50_Q127E2 Sulfatase n=1 Tax=Polaromonas sp. JS666 RepID=Q127E2_POLSJ Length = 511 Score = 361 bits (926), Expect = 5e-98, Method: Composition-based stats. Identities = 126/497 (25%), Positives = 200/497 (40%), Gaps = 64/497 (12%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MKRPN L + TD + +G + + T +ID +A G F S T + VC P+RA + Sbjct: 1 MKRPNILLITTDQHRGDCLGFAG-RKVKTPHIDEMARTGTHFTSCITPNIVCQPSRASIL 59 Query: 61 TGIYANQSGPWTNNVAPGK--NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP--- 115 TG+ G N + + + +GY T +IGK H H F P Sbjct: 60 TGLLPLTHGVCDNGIDLDEARGEAGFAGTLASSGYSTGFIGKAHFSTHHTFAKTGRPECQ 119 Query: 116 -------PEW---------------DADYW-----FDGANYLSELTEKEISLWRNGLNSV 148 P W +YW G ++ + RN L Sbjct: 120 FSEADYGPAWYGPYMGFEHVELAVEGHNYWLPTPLPGGLHHSRWYYGDGLGEMRNRLYQQ 179 Query: 149 EDLQANHIDETF--------TWAHRISNRAVDFLQQPA-RADEPFLMVVSYDEPHHPFTC 199 + + +TF + I +R ++F+++ A A + F + S+ +PHHPF C Sbjct: 180 DMGPPSGAPQTFNSALPSAWHNSTWIGDRTIEFMRKHAGEAAKRFCLWASFPDPHHPFDC 239 Query: 200 PVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ-----------------AMPSPVGD 242 P + + +L D +P H+ MP+P Sbjct: 240 PEPWSRLHHPDEVDLPAHRTTDFERRPWWHKASMDSKPVGDAAVQALRQNFSRMPTPAEQ 299 Query: 243 DGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAM 301 Y+ VD Q+GR+ AL + NT VI+TSDHGE +G H L+ KG Sbjct: 300 QLRNITANYYGMISLVDHQVGRIQTALQQLGLDGNTLVIFTSDHGEWLGDHGLMLKGPIP 359 Query: 302 YDDITRIPLIIRSPQGERRQVD-TPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRG 360 Y+ + R+ +++ PQ + QV PVS +DL T A L G+++ + E Sbjct: 360 YEGVLRVGMVVNGPQVQAGQVRHEPVSTLDLAATFADYATATALAPLHGQSLRPLLEGGQ 419 Query: 361 VMVEF--NRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF-TSDELYDRRNDPNEMHNLIDD 417 +F + + + G + +R T+++KL L + E+Y DPNEM NL DD Sbjct: 420 QTRDFALSEWNVAASRCGLELQLRTVRTENWKLTLEQNSGAGEMYCLSEDPNEMDNLFDD 479 Query: 418 IRFADVRSKMHDALLDY 434 + R ++ D + Sbjct: 480 PGYTAKRKELSDMIASR 496 >UniRef50_D0DCV9 Choline-sulfatase n=2 Tax=Citreicella sp. SE45 RepID=D0DCV9_9RHOB Length = 474 Score = 360 bits (925), Expect = 6e-98, Method: Composition-based stats. Identities = 112/467 (23%), Positives = 198/467 (42%), Gaps = 41/467 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K PN +F++TD Q + +G ++T N+D L EG F Y SP C+P+RA LF+ Sbjct: 3 KHPNIVFIITDQQRIDTIGALGCPWMDTPNLDRLVNEGTAFEQMYVTSPSCSPSRASLFS 62 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH------LDGHDYFGTGECP 115 G Y + +G + N+ + + + K +GY T +GK H G+D E Sbjct: 63 GTYPHTNGVFRND---ERWVYSWVGLLKQSGYRTVNVGKMHTWPVEGAFGYDERHVTENK 119 Query: 116 PE-----------WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 WD +W G + +T++E+ + L E + Sbjct: 120 DRAHPNLPFYLDNWDKAFWARGVEKPTRVTQREMPDYAERLGC----YVWDAPEDLHADN 175 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 + A +L + + DEPF + + PH P+ EYL KY +L E + D Sbjct: 176 FVPEMACMWLDRY-KGDEPFFLQIGIPGPHPPYDPTAEYLAKYEGRD-DLPEPIRYDFDT 233 Query: 225 KPEHHRLWAQA-----------MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ 273 +P R + +P P + Y+A +D Q+G ++ AL Sbjct: 234 QPGPLRELRRQHLDNDHDAVVHLPDPTAEQMRLQRAHYYANVSMIDTQVGNILAALERRG 293 Query: 274 -RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLL 332 ++T +++TSDHG+ + H K M++ R+P I+ + D V+ D Sbjct: 294 VLDDTIIVFTSDHGDCLNDHGHSQKW-NMFEATVRVPAIVWGRGIPAMRRDELVALFDWG 352 Query: 333 PTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSF-GGFIPVRCWVTDDFKL 391 PT++ A + P + +++ + + + E +D+ G + D+KL Sbjct: 353 PTILEWAGVTPPAWMEAQSLNPLMAGEEQLRDRVFAEHANDAILTGTSYMTMIRRGDWKL 412 Query: 392 VLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 V + +S+ +L+D +DP E NL DD A+ + + +L + + Sbjct: 413 VHFVDSSEGQLFDLASDPGERSNLWDDPAQAERKLSLIHDILRWRIE 459 >UniRef50_B4D026 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D026_9BACT Length = 489 Score = 360 bits (925), Expect = 6e-98, Method: Composition-based stats. Identities = 115/466 (24%), Positives = 185/466 (39%), Gaps = 52/466 (11%) Query: 3 RP-NFLFVMTDTQATNMVGCYS--GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 +P N LF+++D + + K L T N+D LA EG +A+ + +C+P+RA + Sbjct: 26 KPRNILFILSDDHRWDFMSFMPEAPKFLETPNLDRLAKEGAHLRNAFCSTSLCSPSRASI 85 Query: 60 FTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 TG Y + G N I Y + AGY T ++GKWH+ G P Sbjct: 86 LTGQYMHHHGVVDNQRPEPAAIRYFPEYLRAAGYETAFLGKWHM------GEDSDNPRKG 139 Query: 120 ADYW--FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 DYW F G + + T ++ H + +++ A+D+L+ Sbjct: 140 FDYWAGFRGQGHYFDDTY--------------NINGEHKKIDGYSSDVLTDLALDWLKH- 184 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 R D+PF + Y PH+PF +Y E + N R + Sbjct: 185 -RGDKPFFCELCYKAPHYPFEPAPRNKGRYEKAPIPYPETMANTEENYLTQPRWVRERRF 243 Query: 238 SPVGDDGLYHHP--------------LYFACNDFVDDQIGRVINALTPEQ-RENTWVIYT 282 G D + Y +D+ IGR++ L R++T V+Y Sbjct: 244 GIHGVDHMETGRFDHDPVPSFEDLYHRYSETVFSMDENIGRLLKYLDNTGLRDSTIVVYM 303 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALAD 340 +D+G +G H K ++ R+P+++R+P V V +ID+ PT++ A Sbjct: 304 ADNGFELGEHGFYDK-RDAFETSMRVPMLLRAPGAVKPGTVVTKMVQNIDIAPTLLEAAG 362 Query: 341 IEKPEI---LPGENILAVKEPRGVMVEFN--RYEIEHDSFGGFIPVRCWVTDDFKLVL-- 393 + P + G + + + R V + +F TD +K V Sbjct: 363 VTVPADAPKMDGYSFWPLVQGRDVPWRDHILYEYYWERNFPATPTTFAIRTDRWKYVYTH 422 Query: 394 NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 L+ D LYD DP E HNLID F + K+ L D +DK Sbjct: 423 GLWDRDGLYDLETDPVERHNLIDVPAFREQGGKLRGQLFDELDKSG 468 >UniRef50_Q15XI1 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XI1_PSEA6 Length = 510 Score = 359 bits (923), Expect = 1e-97, Method: Composition-based stats. Identities = 104/493 (21%), Positives = 200/493 (40%), Gaps = 57/493 (11%) Query: 1 MKRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 + +PN L ++ D + + Y+ +T NID LA++ + F + Y +PVC+P+R L Sbjct: 36 VTKPNVLLILVDDLGYSDIKAYNENSFYDTPNIDKLASQSVMFTNGYAANPVCSPSRFAL 95 Query: 60 FTGIYANQSGPWT------------------NNVAPGKNISTMGRYFKDAGYHTCYIGKW 101 TG + + N A + T+ FK GY+T ++GKW Sbjct: 96 LTGKHPTRGKATDWFPANDKPARAGRFLPAEFNDALPLSEITLAEAFKQNGYNTAFLGKW 155 Query: 102 HLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 HL + P D G + ++ + + Sbjct: 156 HLGKTEDL----WPENQGFDVNIAGTKNGHPAAGY--------FSPYKNARLTDGPKGEY 203 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 R++N A+ + + ++ PF M++S+ H P P + +++Y ++ + A +D Sbjct: 204 LTQRLTNEAISLVDKYSKQTVPFFMMLSFYTVHTPLAAPNKDVQEYQ---AKIRQYAHND 260 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVI 280 + E A+ V +HP Y A +D Q+GR++ L E+T V+ Sbjct: 261 EFQREEQVWPTAEKREVRV----KQNHPTYAAMVKQMDTQVGRLLAKLKQAGMEESTLVV 316 Query: 281 YTSDHGEMMGAHK-------LISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDL 331 +TSD+G + A L +Y+ R+PL+++ PQ + + Q++ PV+ DL Sbjct: 317 FTSDNGGLSSAEGSPTSNLPLRGGKGWLYEGGIRVPLLVKLPQKKHKHLQINEPVTSTDL 376 Query: 332 LPTMMALADIE--KPEILPGENI----LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWV 385 PT+++ ++ + L G ++ + +M + H S G P Sbjct: 377 YPTLLSAGHLDLLPQQHLDGVDLNQYFSPGAKRDALMRRPLYFHYPHYSNQGGFPGAAIR 436 Query: 386 TDDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRS 444 ++KL+ LY+ ND E +L + + + + L ++ + F Sbjct: 437 QGNWKLIERFEDGKVHLYNLANDIGEQIDLANQA--PERVASLRKKLHEWYQQTSARFLK 494 Query: 445 YQWSLRPWRKDAR 457 + + PW+ D + Sbjct: 495 AKGNKTPWQPDFK 507 >UniRef50_C9L4R7 Putative sulfatase YidJ n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L4R7_RUMHA Length = 458 Score = 359 bits (923), Expect = 1e-97, Method: Composition-based stats. Identities = 116/452 (25%), Positives = 192/452 (42%), Gaps = 28/452 (6%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N L + D +++ CY GK + T NID LA G+ + YT S VCTP+R FTG Y Sbjct: 3 NVLIIHVDQLRRDVLSCYGGKEVQTPNIDFLAENGVLLENFYTPSAVCTPSRGCFFTGNY 62 Query: 65 ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYWF 124 +++G + N + +++ F AGYHT Y+GKWHL H G F Sbjct: 63 PHENGAYRNGIPVKRDVHGFAEVFAKAGYHTGYLGKWHLADHKERGDDLGEYN---PLGF 119 Query: 125 DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPF 184 + +Y E + + NG + N + +++ + FL ++ +PF Sbjct: 120 EDWDYKVEFGHCKSVAYENGKVRPKREVGN---DKSYTTDWLTDETIRFLNNQLKSTQPF 176 Query: 185 LMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDG 244 L VS +PH PF Y + E+ E + W + P G Sbjct: 177 LFTVSIPDPHQPFEVRPPYDTMFDPLKVEIPESFWEKEIPDWAERDTWGRLHYYPYGLFE 236 Query: 245 LYHH-----PLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKG 298 H Y +DD +GR+I L ENT V++T+DHGE MG H L+ K Sbjct: 237 REGHLRRLKAQYLGAVKCIDDNVGRIIQCLKDTGLWENTMVVFTTDHGEYMGEHGLMEKN 296 Query: 299 AAMYDDITRIPLIIR--SPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVK 356 +Y+ + IP +I + + R+ +T ++ +D PT+ + I P + G+++ Sbjct: 297 -NLYESVYHIPCVISMPWKKIQERRCNTWINVVDFAPTLAGMLGIPYPFKVQGKDLSTYL 355 Query: 357 EPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFK--LVLNLFTSDE-----LYDRRNDPN 409 + +P +T +F+ V + +E L+D R DP Sbjct: 356 LENRETEQILYIH------PSDVPRAGILTPEFELAYVGKGWCEEEFHDHILFDMRKDPL 409 Query: 410 EMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 +M N+ +A V+ + + L + ++I P Sbjct: 410 QMTNVFGKPEYAKVQKMLTEKLKRHFEEIGTP 441 >UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria RepID=C1ZCL4_PLALI Length = 470 Score = 359 bits (923), Expect = 1e-97, Method: Composition-based stats. Identities = 112/459 (24%), Positives = 179/459 (38%), Gaps = 44/459 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L + D +G T ID+LA G RF Y+ PVC+P RA L TG Sbjct: 28 KPNVLLIFIDDLGKTDIGIEGSSFYETPRIDALAKSGARFTQFYSAHPVCSPTRAALMTG 87 Query: 63 IYANQSGPWT-----NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + G ++VA ++ T+G+ F++AGYHT Y+GKWHL + Sbjct: 88 KMPQRLGITDWIRPESDVALPQSEVTIGQAFQEAGYHTAYLGKWHLGHKPQQHPAARGFD 147 Query: 118 WDADYWFDG--ANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 W G ++Y + N + E Q +++ A++ LQ Sbjct: 148 WTKGVNHGGQPSSYYFPYKNPQKPDAPNNVPDFEKCQPED-----YLTDVLTSSAIEHLQ 202 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 Q R PF + +++ H P P +EKY Q LA + Sbjct: 203 QRDRT-RPFFLCLAHYAVHTPIQPPKNLVEKY-----------QVKLATQKNPKSPGEGI 250 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHK- 293 HP Y A + +D Q+GR+++ L + + T V++TSD+G + + Sbjct: 251 QEGSAISRSQQDHPAYAAMVENLDTQVGRLLDELKTQGILDQTIVVFTSDNGGLCTLNGK 310 Query: 294 ---------LISKGAAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLLPTMMALADIE- 342 L + Y+ RIP I P QV D P D+ PT+++L I Sbjct: 311 SPGPTCNLPLRAGKGWTYEGGIRIPTYISWPGKISPQVLDIPAYTCDIYPTLLSLCQIPP 370 Query: 343 -KPEILPGENILAVKEPRGVMVEFNR---YEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS 398 + + G ++ + + E R + H G P +KL+ L T Sbjct: 371 RPTQHVDGISLAGLLTKSSSLPESERTLVWYYPHTHGSGHKPSAAIRQGPWKLIHFLETD 430 Query: 399 D-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 ELY +DP E NL + + ++ L ++ Sbjct: 431 RIELYHLEDDPGESRNLAS--KHPERALQLQKELQKIIE 467 >UniRef50_A3SJ21 Sulfatase n=1 Tax=Roseovarius nubinhibens ISM RepID=A3SJ21_9RHOB Length = 518 Score = 359 bits (922), Expect = 2e-97, Method: Composition-based stats. Identities = 110/442 (24%), Positives = 185/442 (41%), Gaps = 19/442 (4%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + +M D A + G Y + T ++D+LAA G+RF++AY +P+C P+R + Sbjct: 10 RRPNIVVIMADQLAPHFTGAYGHQVAKTPHMDALAARGMRFDAAYCNAPLCAPSRFAFMS 69 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G ++ + N + T Y GY TC GK H G D + D Sbjct: 70 GQLISRIAAYDNASEFRATVPTFAHYLSALGYRTCLSGKMHFVGPDQKHGFQDRVTTDIY 129 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL-----QQ 176 E ++ I W + + +V++ + + A +L + Sbjct: 130 PSDFAWTPDWEAPDERIDKWYHNMQTVKESGCAIATFQTDYDDEVEFAARRWLIDRARDR 189 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM 236 A + P MV S+ PH P+ E+ + Y+D EL E LA+ R + Sbjct: 190 AAGQEAPLCMVASFIHPHDPYVARPEWWDLYSDDEIELPEVL--PLADHDPFSRRLMDGI 247 Query: 237 PSPV----GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGA 291 + D+ + Y A + D +IG ++ L +NT VI T+DHG+M+G Sbjct: 248 EASYVPLSRDEVIRARRAYLANVSYFDSKIGALVKTLDETGELDNTVVIVTADHGDMLGE 307 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKP---EILP 348 L K ++ R+PLI+ P + S IDLLP+ + +A ++ E + Sbjct: 308 RGLWYK-MNFFEHSARVPLIMAGPGVVQGAAANACSLIDLLPSFLEIAGADESVLGEPVD 366 Query: 349 GENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDP 408 G +++ + RG + E+ + PV D K + +LYD DP Sbjct: 367 GRSLMPLA--RGEADPQDEAISEYCAEMTAWPVFMIRRGDLKYIHCDGDPPQLYDLSVDP 424 Query: 409 NEMHNLIDDIRFADVRSKMHDA 430 E N ++D +A R++M Sbjct: 425 GERVNRVEDPDYA-CRARMFAE 445 >UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKC9_9BACT Length = 454 Score = 359 bits (921), Expect = 2e-97, Method: Composition-based stats. Identities = 105/454 (23%), Positives = 188/454 (41%), Gaps = 68/454 (14%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L ++ D VG + + + T NID +A EG++F++ Y+ +C P RA L +G Sbjct: 19 KPNILIILADDLGYADVGYHGLEEIPTPNIDRIANEGVQFSAGYSNGSICGPTRAALMSG 78 Query: 63 IYANQSGPWTNN----------VAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 +Y + G V + + T+ +YF++AGY T GKWHL G F Sbjct: 79 VYQQRIGCEGICGGRKLNEHVVVGMPREVKTLAQYFQEAGYATGLFGKWHLGGERLFDKT 138 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 P D +F L + + ++ R +D ++ E + I AV Sbjct: 139 LMPTSRGFDEFFG---ILEGASLYDDTVNRERKYIRQDTVIDY--EGEYFTDAIGREAVS 193 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 F+ + + D+PF + + + H P +Y++++A Sbjct: 194 FITR--KGDKPFFLYLPFTAVHAPMQASEKYMQRFAHIADP------------------- 232 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGE---- 287 + ++ A +DD IGRV +AL + +NT +++ SD+G Sbjct: 233 --------------NRRVFAAMLSAMDDNIGRVFDALEHQGILDNTLIVFWSDNGGKPDN 278 Query: 288 -MMGAHKLISKGAAMYDDITRIPLIIRSPQG---ERRQVDTPVSHIDLLPTMMALADIEK 343 H L + Y+ R+P +R P+G + +D PV +D+ P+ + A I Sbjct: 279 NYSLNHPLKGQKTQFYEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDIFPSALEAAQITV 338 Query: 344 PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYD 403 P+ + + IL + + + F D+KL N EL++ Sbjct: 339 PKDIEAKTILPLMQGKTNQTPHPAM------FWKRAGKMAVRMGDWKL-SNAGGPSELFN 391 Query: 404 RRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 + D +E N+ID + D+ +KM+ L++ K Sbjct: 392 LKQDISESRNIID--QHPDIANKMNRLWLNWDKK 423 >UniRef50_A0Q2E3 N-acetylgalactosamine 6-sulfate sulfatase n=3 Tax=Firmicutes RepID=A0Q2E3_CLONN Length = 483 Score = 358 bits (920), Expect = 2e-97, Method: Composition-based stats. Identities = 130/462 (28%), Positives = 195/462 (42%), Gaps = 56/462 (12%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + ++TD Q +GCY T +DSLA GIRF + + SPVC+PARA ++TG Sbjct: 7 NVISIITDDQGYWSMGCYGNHDAITPTLDSLANNGIRFENFFCVSPVCSPARASIYTGRI 66 Query: 65 ANQSGP------WTNNV---APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 +Q G W N K ST GY GKWHL D Sbjct: 67 PSQHGIHDWLDEWNNGYTTEEYLKGQSTFVDILAKNGYECAMSGKWHLGVADK------- 119 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 P+ YW+ ++++G I E +++ ++F++ Sbjct: 120 PQNGFKYWYSHQKGGGPY--YGAPMYKDG---------TLIHEERYVTDVMTDYGLEFIE 168 Query: 176 QPARADEPFLMVVSYDEPHHPFTC---PVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 + +D PF + ++Y PH P++ P E L+ Y D ++ K + W Sbjct: 169 KQRDSDNPFYLSLNYTAPHAPWSPENHPKELLDLYKDCEFKSCPKDGKN---------DW 219 Query: 233 AQAMPSPVGDDGLYH-HPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMG 290 + P +D YFA VD+ I RVI+ L ENT +I+TSD+G MG Sbjct: 220 SIDYIFPKTEDERREVLRGYFAALTSVDNNIKRVIDKLKEMGVLENTLIIFTSDNGMNMG 279 Query: 291 AHKLISKGA-----AMYDDITRIPLIIRSPQGERRQVDT-PVSHIDLLPTMMALADI--E 342 H + KG M+D +IP I + QV T +SH D+ PT+M I E Sbjct: 280 HHGIFGKGNGTSPVNMFDTSVKIPCFITKIGDIKPQVSTDLLSHYDIRPTLMEYLGIEDE 339 Query: 343 KPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-NLFTSD 399 E LPG + ++ + + N I + + P R T ++K V Sbjct: 340 IDEGVKLPGRSFASLLRGEKLERDDNEVVI----YDEYGPARMIRTKEWKYVHRYPAGPH 395 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 ELYD NDP+E NLIDD D+ ++ L + + +P Sbjct: 396 ELYDLVNDPDEKINLIDDEDKKDIVKELRYRLKRWFIQYVNP 437 >UniRef50_A3HTC6 Choline sulfatase n=5 Tax=Bacteria RepID=A3HTC6_9SPHI Length = 499 Score = 358 bits (920), Expect = 2e-97, Method: Composition-based stats. Identities = 100/475 (21%), Positives = 187/475 (39%), Gaps = 31/475 (6%) Query: 4 PNFLFVMTDTQATNMVG-CYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 PN +F+ +D + +G + T NID LA G F +A+ +P+C P+R + TG Sbjct: 30 PNIVFIASDDL-NDWIGVLNGHPQVKTPNIDRLANRGTLFTNAHAQAPLCNPSRVSILTG 88 Query: 63 IYANQSGPWTNNVAPGK-----NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + +G + + + T+ +YF+ GY T GK G + Sbjct: 89 LRPTTTGIYGLAPRHREVERTKEVVTLPQYFEKRGYRTLSTGKIFHGGITPTERAIEFQD 148 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 W D S++ + + + + L H ++ +++++ AV+ + + Sbjct: 149 WGPDGGHRPFP-PSKIVKAPLDMIDHPLIDWGIYPVEH--DSIMDDYKVASWAVEQINEI 205 Query: 178 ARAD--EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + PF + V +++PH P ++ + Y L D + P+ Sbjct: 206 GKGGDSNPFFLAVGFNKPHVPLYTSQKWFDLYPKDEIILPLAPFGDRNDIPDFAWNLHWY 265 Query: 236 MPSP------VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEM 288 +P P + Y A F+D Q+GRV++AL ENT +++ SDHG Sbjct: 266 LPEPRLSWLIANQEWENKVRAYLATISFMDAQVGRVLDALEENNLTENTIIVFWSDHGYH 325 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLLPTMMALADIEKPEIL 347 +G + K ++++ T +PLI P + + PV +D+ PT++ +A + K + + Sbjct: 326 LGEKDITGKN-SLWERSTHVPLIFAGPGVSKGAISSQPVELLDIYPTLVEMALLSKNDAV 384 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRND 407 G ++ E + T+ ++ + S+ELYD D Sbjct: 385 EGISLKPQLEDANAKRTKPAITTHNPGN------NAVRTERWRYIKYADGSEELYDHYRD 438 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMG 462 E NL + D ++ L ++ K P + KD W G Sbjct: 439 DEEWSNLA----YLDEYRELKAELSKWLPKTSAPLAEGSKHRILYEKDGVWYWEG 489 >UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5053 Length = 467 Score = 358 bits (920), Expect = 3e-97, Method: Composition-based stats. Identities = 105/458 (22%), Positives = 177/458 (38%), Gaps = 57/458 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + ++ D +GCY + T +ID LA G +F Y+ SPVC P+R L TG Sbjct: 25 KPNIVLIVADDLGCFELGCYGQTKIKTPHIDKLAQGGAKFTRFYSGSPVCAPSRCVLMTG 84 Query: 63 IYANQSGPWTNNVAPGK-------NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 ++ + N A + T+ K GY T +GKW L D G+ P Sbjct: 85 KHSGHATVRNNVEAKPEGQFPIRAEDVTVADALKAHGYATGAMGKWGLGMFDTAGS---P 141 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + D +F + ++RN + FT A+ F++ Sbjct: 142 LKHGFDLFFGYNCQRHAHSHYPTYIYRNDKRVELKGNDGKTGKQFTQ-DLFEEEALGFIE 200 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHH-RLWAQ 234 A +PF + + + PH P + L +Y L + P + + Q Sbjct: 201 --ANKAKPFFLYLPFTVPHVAVQVPEDSLNEYKGQ-----------LGDDPAYDGKKGYQ 247 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHG------- 286 P+P H Y A +D +GRV+ L E NT V++TSD+G Sbjct: 248 PHPAP--------HAGYAAMVTRMDRSVGRVVEKLNALGLEKNTLVLFTSDNGPTHNVGG 299 Query: 287 ----EMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALAD 340 A KL ++Y+ R+P I P + D P+ D+LPT+ A A Sbjct: 300 ADSSFFNSAGKLRGLKGSVYEGGIRVPFIAYQPGTIKAGTESDAPLYFPDVLPTLCAFAG 359 Query: 341 IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT--- 397 + P + G + L + + ++ + F G+ + + ++K V Sbjct: 360 TKAPSAIDGISFLPLLKGEKQPT----HDFLYWEFSGYGGQQAVIEGEWKAVRQALGMGG 415 Query: 398 -SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 ELY+ DP+E ++ + V +++ L + Sbjct: 416 VKTELYNLAKDPSEKEDVAA--KNPAVLARLEKRLKNE 451 >UniRef50_C0S8M2 Choline sulfatase n=8 Tax=Eurotiomycetidae RepID=C0S8M2_PARBP Length = 619 Score = 358 bits (919), Expect = 3e-97, Method: Composition-based stats. Identities = 111/432 (25%), Positives = 188/432 (43%), Gaps = 20/432 (4%) Query: 2 KRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++P+ L++M D A ++ Y P+ T N++ LA EG+ F SAY SP+C P+R + Sbjct: 5 EKPSILYIMADQMAAPLLSLYDENSPIKTPNLERLAREGVCFESAYCNSPLCAPSRFSMV 64 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG +++G + N + T Y + GYHT GK H G D E + Sbjct: 65 TGQLPSKTGGYDNASDLPADTPTYAHYLRKEGYHTALAGKMHFVGPDQLHGYEQ--RLTS 122 Query: 121 DYWFDGANYLSELTEKEISL-WRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP-- 177 D + + E E+ W + ++SV + + + +RA +L Sbjct: 123 DIYPGDYGWTVNWDEPEVRPDWYHDMSSVLEAGPCVRTNQLDYDDEVIHRATQYLYDHTR 182 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 RA +PF + VS PH P+ EY + Y D L + + H + +++ Sbjct: 183 HRAGQPFCLTVSMTHPHDPYAMTKEYWDLYEDIDIPLPKTPVIPHDEQDPHSQRVLKSID 242 Query: 238 SPVGDD----GLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAH 292 + L YFA +VD Q+GR++ L + +NT V++T DHG+M+G Sbjct: 243 LFGKEIPEQCILAARRAYFAACSYVDSQVGRLMATLKACDLADNTIVVFTGDHGDMLGER 302 Query: 293 KLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEKPEIL--PG 349 L K Y+ R+P+ + +P + ++V VS +DLLPT A+A E L G Sbjct: 303 GLWYK-MVWYEHAARVPMFVHAPGRYKPKRVKENVSTMDLLPTFAAMAGGEINNHLPIDG 361 Query: 350 ENILAVKEPRGVM-----VEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDR 404 +++ ++ + E+ + G PV +K + + L++ Sbjct: 362 VSLMPYLLDSDSREAVSGLKTDTVIGEYMAEGTLAPVVMIRRGPWKFIYSPIDPPMLFNV 421 Query: 405 RNDPNEMHNLID 416 + DP E NL Sbjct: 422 KRDPTEAVNLAS 433 >UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZGF2_PLALI Length = 490 Score = 358 bits (919), Expect = 3e-97, Method: Composition-based stats. Identities = 113/498 (22%), Positives = 182/498 (36%), Gaps = 80/498 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN + ++ D VG K + T +ID LA G+ F AY +P C P RA L + Sbjct: 41 RPPNIILILMDDMGWRDVGFMGNKFVETPHIDRLAKTGLVFTQAYASAPNCAPTRACLMS 100 Query: 62 GIYANQSGPWT-------------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWH 102 G YA + G +T + N+ T+ +D GY T + G W+ Sbjct: 101 GQYAPRHGIYTVVDPRQPPGSPWHKWQAAESKSELDTNVVTIAEALRDGGYATAFFGMWN 160 Query: 103 LDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTW 162 L G G P F + L + + +G + Sbjct: 161 L------GRGRTGPVTPGGQGFQKVVFPENLGFGKDEYFDDGKH--------------YL 200 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 R+++ + F+ + ++PF + + H PF E L KY Sbjct: 201 TDRLTDEVLKFVDE--HREQPFFVYLPDHAIHAPFNPKPELLAKYE-------------- 244 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIY 281 + P A + VD +GR+++ L + +NT VI+ Sbjct: 245 --------------RKAAASNDRRDDPACAATIEAVDHNVGRIMDHLKRLKLSDNTVVIF 290 Query: 282 TSDHGEM-MGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMAL 338 TSD+G L +Y+ R+PL++ P + + D PVS IDL PT++ L Sbjct: 291 TSDNGGTQQYTPPLRGGKGELYEGGIRVPLVVAGPGVKSLGSRCDVPVSSIDLYPTLLEL 350 Query: 339 ADIEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 A I+ PE L G ++ + + + + G P DFKL+ Sbjct: 351 AGIKPPEGQVLDGVSLAPLLQGDATLDRERLFWHFPCYVGKATPSSAMREGDFKLIEFFE 410 Query: 397 TSD--ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY-QWSLRPWR 453 EL++ +NDPNE NL + D + + L + K S P Sbjct: 411 EGGRVELFNLKNDPNEEKNLASVM--PDKAAALAKTLRAWQKKTNASIPPGPNPSYDPQA 468 Query: 454 KDARPRWMGAFRPRPQDG 471 + R G +P+ G Sbjct: 469 ERPRGNQGGGRPDKPKRG 486 >UniRef50_Q1IH24 Choline sulfatase n=29 Tax=cellular organisms RepID=Q1IH24_PSEE4 Length = 505 Score = 358 bits (918), Expect = 4e-97, Method: Composition-based stats. Identities = 115/445 (25%), Positives = 184/445 (41%), Gaps = 14/445 (3%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN LF+M D A ++ Y+ P+ N+ LA + + F+SAY SP+C P+R L Sbjct: 1 MKQPNILFIMADQMAAPLLPIYTPSPIKMPNLARLAEQAVVFDSAYCNSPLCAPSRFTLV 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 +G ++ G + N +I T Y + GY T GK H G D E D Sbjct: 61 SGQLPSRIGAYDNAADFPADIPTYAHYLRRLGYRTALSGKMHFCGPDQLHGYEERLTSDI 120 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 Y D ++ + W + ++SV + + +A +L R Sbjct: 121 -YPADYGWAVNWDAPDQRPSWYHNMSSVLQAGPCVRTNQLDFDEEVVFKARQYLYDHVRE 179 Query: 181 D--EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA--QAM 236 D PF + VS PH P+T P Y + Y L P RL Sbjct: 180 DHGRPFCLTVSMTHPHDPYTIPKRYWDLYEAVDIPLPRDVIAQSQQDPHSQRLLKVYDLW 239 Query: 237 PSPVGDDGLYH-HPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKL 294 P+ D + YF ++DD IG ++ L ++T ++++ DHG+M+G L Sbjct: 240 DKPLPVDKIRDARRAYFGACSYIDDNIGLLVQTLEDCGLADDTLIVFSGDHGDMLGERGL 299 Query: 295 ISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALAD--IEKPEILPGEN 351 K ++ R+PL+I +P+ ++ VS DLLPT++ LA ++K L G + Sbjct: 300 WYK-MHWFEMSARVPLLIHAPKRFAPARISASVSTCDLLPTLVELAGGAVDKDLHLDGRS 358 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEM 411 +L + +G + E+ + G P+ +K V + LYD DP+E Sbjct: 359 LLGHLQGQGG---HDEVIGEYMAEGTVGPLMMIRRGAYKFVYSEDDPCLLYDLSRDPHER 415 Query: 412 HNLIDDIRFADVRSKMHDALLDYMD 436 NL + D D Sbjct: 416 ENLTGSPDHQVLLQAFVDEAKQRWD 440 >UniRef50_UPI00017453D4 choline sulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017453D4 Length = 485 Score = 357 bits (917), Expect = 6e-97, Method: Composition-based stats. Identities = 119/445 (26%), Positives = 191/445 (42%), Gaps = 32/445 (7%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 R N LF++ D + +G G + T N+D LA G F +A+ +P C+P+R TG Sbjct: 26 RLNVLFILVDDL-NDQIGWLGGAGI-TPNMDRLAQRGTLFANAHAQAPWCSPSRTSFLTG 83 Query: 63 IYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + +G PW NV + + T+ ++F GY T IGK + +G CPP Sbjct: 84 KRPSTTGIYALTPWFRNVPALRELVTLPQHFAAHGYETFGIGKVYHEG--------CPPA 135 Query: 118 WDADYWFDGANYLSELTEKEIS-LWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 F Y + + S + N +D + T ++++ A++ L + Sbjct: 136 NQPTPEFSVMGYQGNWRKPQPSKPFVNTPGMRQDFGQFPDRDDQTDDFKVASSAIECLGR 195 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM 236 P +PF + V PH+P P ++ Y L E D + P R Sbjct: 196 PHT--KPFFIAVGLRRPHYPLYAPQQWFSLYDPQNVWLPEVPATDRDDLPRFARALRLGN 253 Query: 237 PSPV------GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMM 289 P H+ Y AC FVD+QIGR+++AL T ++ SDHG + Sbjct: 254 TEPTLGPIVNAGLWRSHNHAYLACVSFVDNQIGRILDALEQSGEAHRTVIVLASDHGFHL 313 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 G +L +K +++ T +PLI P R PV +D+ PT+ + + P L G Sbjct: 314 GEKELFAK-RTLWERATHVPLIFAGPGVGRGTSKRPVELLDIYPTLTEICGLPTPPGLEG 372 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPN 409 E++ A+ R R I G F T+D++ + S+ELYD R DP Sbjct: 373 ESLGALL--RDPSAARTRPAITGQMQGSF----AVRTEDWRYIRYADGSEELYDHREDPQ 426 Query: 410 EMHNLIDDIRFADVRSKMHDALLDY 434 E NL D R+ V++++ + + Sbjct: 427 EFLNLAADQRWTSVKTELGSWIPKH 451 >UniRef50_Q7WC54 Putative sulfatase n=3 Tax=Proteobacteria RepID=Q7WC54_BORPA Length = 529 Score = 357 bits (916), Expect = 6e-97, Method: Composition-based stats. Identities = 112/452 (24%), Positives = 179/452 (39%), Gaps = 18/452 (3%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PNFLF+M D + Y T N+D LAA RF + Y P+C P+R + T Sbjct: 5 KQPNFLFLMADQLTAFALRMYGNGVCRTPNLDRLAARSTRFANMYCNFPLCAPSRVAMLT 64 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G + G + N + T + AGY T GK H G + + D Sbjct: 65 GRLPSSVGVYDNASEFSAEVPTFLHHLALAGYSTILSGKMHFVGPEQHHGFQ--ERLTTD 122 Query: 122 YWFDGANYLSELTEK-EISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR- 179 + + + E+ I+ + SV + + + R V + R Sbjct: 123 IYPSDFGWTPDWREEIPIAPTGMNMRSVIEAGEYRRSMQIDYDDDVVYRGVQKIYDLGRL 182 Query: 180 -ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK-PEHHRLW---AQ 234 D PF + VS PH+P+ E+L+ Y ++ A + P RLW Q Sbjct: 183 HRDRPFFLAVSMTHPHNPYVSTREFLDLYRPEDIDMPAVPPIPFAQQDPHSQRLWYMFRQ 242 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGAHK 293 Y+A +VD Q+GR+++AL + T V++T+DHG+M+G Sbjct: 243 DEYDVSDAHVRAARHAYYAMVSYVDAQVGRMLDALQAMDLDESTVVVFTADHGDMLGERG 302 Query: 294 LISKGAAMYDDITRIPLIIRSPQGERRQVD-TPVSHIDLLPTMMALADIEKPEI-----L 347 L K +D RIPL+I +P R V S +D+ PTM+ LA + P+ Sbjct: 303 LWYKW-VHFDPAVRIPLLISAPGRTRPAVRHELASLVDIFPTMLELAGVSVPDDGASPPP 361 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRND 407 G ++ E + G P +KLV+ L++ ++D Sbjct: 362 DGRSLAEGL-GVSQDEPTGVVYGEMNGEGAHAPCLAVRQGWWKLVVAEGDPPLLFNLQDD 420 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 P+E+ NL D+ ++ + D R Sbjct: 421 PHELRNLAGQPAARDIERQLTALVQARWDARR 452 >UniRef50_A6DNH0 Choline sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DNH0_9BACT Length = 466 Score = 357 bits (916), Expect = 7e-97, Method: Composition-based stats. Identities = 113/447 (25%), Positives = 197/447 (44%), Gaps = 32/447 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++PN L + D + G G P + T ++D LA G F +A+ PVC+ +R + Sbjct: 18 EKPNVLMISIDDL-NDWTGFLGGHPQVKTPHMDKLANSGRIFANAHCAVPVCSSSRVSVM 76 Query: 61 TGIYANQSGPW-----TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 +G+ A G + ++ K++ T+ R+FK+ GY+T GK G + Sbjct: 77 SGLAATTHGSYEIGPSYQSIPALKDVLTIQRHFKNQGYYTLAGGKVLHHGFKGSVANDND 136 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + L E W G + D QA+ + ++++ A LQ Sbjct: 137 RSLIKGHSGPKPKQPLNLPEGWSRAWDWGQHPGTDAQAHDM--------KLAHNAAQALQ 188 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + D+PF M V + PH P P ++ Y + L + DL + P++ Sbjct: 189 E--DFDKPFFMSVGFFRPHVPLLVPPKWFNLYDEESIVLAPSPKSDLDDVPKNFLSINDY 246 Query: 236 MPSPV------GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEM 288 +P D Y A FVD +GRVI+AL + +NT VI SDHG Sbjct: 247 AVAPTHKEVLATDSHRKLTHAYLASISFVDACVGRVIDALKNSKYADNTIVILWSDHGFH 306 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ-VDTPVSHIDLLPTMMALADIEKPEIL 347 +G + +K ++++ T++PL++ P E + P S ID+ PT++ L ++ P+ L Sbjct: 307 LGEKEHWAK-RTLWEESTKVPLLVYGPGIESGEACLEPASLIDIYPTLVDLCGVKAPKKL 365 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRND 407 G +++ + + E + I +G T D++ + ++ELYD +ND Sbjct: 366 DGISLMPQL--KNPLSERKQPAIISSYYGN----HAVRTRDWRFISYEDGAEELYDHKND 419 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDY 434 P+E NLI+D + +R ++ L Sbjct: 420 PDEYKNLINDPNYKSIRDELAQWLPKK 446 >UniRef50_Q46P27 Sulfatase n=3 Tax=Proteobacteria RepID=Q46P27_RALEJ Length = 482 Score = 357 bits (916), Expect = 7e-97, Method: Composition-based stats. Identities = 120/451 (26%), Positives = 195/451 (43%), Gaps = 26/451 (5%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M N + +M+D M+GC + T N+D+LAA G+RF+SAYT SP+C PARA Sbjct: 1 MASKNVVVIMSDEHDPRMMGCSGHPFVKTPNLDALAARGVRFSSAYTPSPICVPARAAFA 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG +Q W N + G +D G IGK H + E P +DA Sbjct: 61 TGRRVHQVRLWDNAMPYTGEQRGWGHVLQDRGIRVESIGKLH------YRNEEDPAGFDA 114 Query: 121 DYWF----DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 ++ G + NG + + + ++ RAV +LQ+ Sbjct: 115 EHLPMHVVGGHGMVWASIRNPFRPRENGPRMLGEHIGPGESSYTQYDRAVTQRAVQWLQE 174 Query: 177 PA-RADEPFLMVVSYDEPHHPFTCPVEYLEKYA-DFYYELGEKAQDDLANKP---EHHRL 231 A R + F++ V PH PF P E+ Y D E + P E+ Sbjct: 175 AAQRQEAGFVLYVGLVAPHFPFVVPEEFYSLYPTDGLPEPKLHPRTGYEQHPWVREYCDF 234 Query: 232 WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTW-VIYTSDHGEMMG 290 A D+ L Y+ ++D +G+++ AL E+T ++YTSDHG+ +G Sbjct: 235 MASERQFADADERLRAFAAYYGLCTWLDHNVGQILGALRDNGLEDTTHIVYTSDHGDNLG 294 Query: 291 AHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILP-- 348 A + K + +Y++ ++P+++ P +TPV +DL PT++ A ++ + Sbjct: 295 ARGVWGK-STLYEESVKVPMLLAGPIVTPGVCNTPVDLLDLFPTILQGAGVDPATEIDER 353 Query: 349 -GENILAVKE--PRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRR 405 G ++ + P V + Y + GGF +K + EL+D Sbjct: 354 PGRSLFELARSAPEPDRVILSEYHAAGSNAGGF----MLRKGRWKYHHYVGFRPELFDLE 409 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 +DP E+ +L D +A V + MH+ALL D Sbjct: 410 SDPEELTDLAGDPAYAPVLASMHEALLAICD 440 >UniRef50_A4W906 Sulfatase n=43 Tax=Enterobacteriaceae RepID=A4W906_ENT38 Length = 501 Score = 356 bits (915), Expect = 1e-96, Method: Composition-based stats. Identities = 124/487 (25%), Positives = 205/487 (42%), Gaps = 57/487 (11%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 + +PN + ++ D +G Y + T NID LA EG+RF+ Y +P+C+P+RAGL Sbjct: 33 LNKPNVVIILADDLGYGDLGIYGHPIVKTPNIDKLAQEGVRFSQYYAPAPLCSPSRAGLL 92 Query: 61 TGIYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 TG ++G P N+A G+N T+ Y KD GY T +GKWHL+ Sbjct: 93 TGRTPFRTGIRSWIPTNKNIALGRNEKTIASYLKDQGYDTAMMGKWHLNAGVDRHDQPQA 152 Query: 116 PEWDADYWF-DGANYLSELTEKEISLWRNGLNSVEDLQANH---IDETFTWAHRISNRAV 171 + DY + A +++ +K RNG+ N +S A+ Sbjct: 153 EDAGFDYTLVNAAGFVTSDLDKAKERPRNGVVYPNGFYRNGKALGTVNQISGEFVSQEAI 212 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRL 231 ++L + ++PF M V++ E H P P +YLE Y ++ E+ + Sbjct: 213 NWLND-KKDNKPFFMYVAFTEVHTPLASPKKYLEIYKNY--------------MSEYEKQ 257 Query: 232 WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGE--- 287 + D Y+A ++D+Q+G+V+ + Q +NT +I+TSD+G Sbjct: 258 HPDMFYADWVDKPYRGPGEYYANISYMDEQVGKVLAKIKSMGQEDNTIIIFTSDNGPVTR 317 Query: 288 ---------MMGA-HKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTM 335 M G L + +++ R+P II+ Q DTPVS +D+LPT+ Sbjct: 318 EARKWYELNMAGETDGLRGRKDNLWEGGIRVPAIIKYGQHLHAGTVTDTPVSGLDILPTL 377 Query: 336 MALA--DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCW--VTDDFKL 391 L ++ I+ GE+I+ V E + + + F P W D+K+ Sbjct: 378 AELTHFNLPTDRIIDGESIVPVLEGQTMNRQQPLLFAIDMPFQD-DPTDMWALRDGDWKM 436 Query: 392 VLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY---------MDKIRDP 441 + + + LY+ + D E N + + KM AL Y M D Sbjct: 437 IFDRNSKPKYLYNLKLDRGETMNQLGKQ--PVLEQKMIAALARYQSSIENDSLMKARGDK 494 Query: 442 FRSYQWS 448 W+ Sbjct: 495 PTPVDWN 501 >UniRef50_UPI0001744DD5 choline sulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744DD5 Length = 469 Score = 356 bits (914), Expect = 1e-96, Method: Composition-based stats. Identities = 106/449 (23%), Positives = 180/449 (40%), Gaps = 29/449 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYT----CSPVCTPARA 57 +RPN LF+ +D Q + + L T N+D L +G F AY VC P+RA Sbjct: 13 ERPNVLFLFSDDQRADTIAALGNTHLQTPNLDRLVRDGTTFTQAYCMGSNQGAVCVPSRA 72 Query: 58 GLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 L +G + K +T F AGY T GKWH + Sbjct: 73 MLMSGRTLYRV------QEQLKGQATWPESFASAGYRTFMTGKWHNGAPSALRAFQEA-- 124 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 + G + +++S G N N +++AV+F++ Sbjct: 125 --KAVFLGGMGDPDAIPVQDMSSAGQGGNRQF---VNRRTVEKHCVELFADKAVEFVRAQ 179 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD-DLANKPEHHRLWAQAM 236 ++ +P+L V+++ PH P P + E+ + E N E + Sbjct: 180 KQSSQPWLCYVAFNAPHDPRKAPPAWHEQTNANKPPIPENFLPVHPFNNGEMTVRDEKLA 239 Query: 237 PSPVGDDGLYHH-PLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKL 294 P P + + Y+A F+D QIGR++ +L Q E T ++++SDHG +G+H L Sbjct: 240 PWPRTEPVIRQELADYYAAIMFMDSQIGRILESLRATGQDEKTIIVFSSDHGLAIGSHGL 299 Query: 295 ISKGAAMYDDITRIPLIIRSPQGERRQVD-TPVSHIDLLPTMMALADIEKPEILPGENIL 353 + K ++YD PLI+ P + + +D+ PT+ LA + PE G +++ Sbjct: 300 MGK-QSLYDHSMHSPLILAGPGVPKGEKRAALCYLLDVYPTLGDLAGVNAPEGSEGLSLV 358 Query: 354 AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD-ELYDRRNDPNEMH 412 V + + G R D +KL++ + +LYD ++DP E Sbjct: 359 PVLKGEEITRRQAIMT------GYRKVQRAVRDDQWKLIVYPQVNKMQLYDLKSDPAETR 412 Query: 413 NLIDDIRFADVRSKMHDALLDYMDKIRDP 441 +L + A+ +M L + D Sbjct: 413 DLAREPGHAEEIDRMRTLLEKLQKENGDT 441 >UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=Bacteria RepID=Q7UHJ9_RHOBA Length = 1012 Score = 356 bits (913), Expect = 1e-96, Method: Composition-based stats. Identities = 113/473 (23%), Positives = 187/473 (39%), Gaps = 59/473 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PNF+ ++TD Q + C+ K ++T ID +AAEG R S Y +PVCTP+RAGL TG Sbjct: 570 KPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRAGLMTG 629 Query: 63 IYA--------NQSGPWTNNVAPG--KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 Y + G G + T+ K AGY T GKWHL F Sbjct: 630 CYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFGKWHLGDQPEF--- 686 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHI----DETFTWAHRISN 168 P + D +F G Y ++ + LQ + + + R++ Sbjct: 687 -LPTKQGFDEFF-GIPYSHDIHPFHPRQNHYHFPPLPLLQNDTVIEMDPDADFLTKRLTE 744 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 +AV F+++ D+PF + + + PH P ++E AD EK ++ Sbjct: 745 QAVSFIER--NKDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNI------ 796 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGE 287 D L+ +D +G++++AL + T V++TSD+G Sbjct: 797 --------------DYATRANLFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGP 842 Query: 288 -----MMGAHKLISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTMMALAD 340 +L ++ R P ++R P Q D ++ +DLLPT LA Sbjct: 843 PKNTLYASPGELRGHKGTTFEGGMREPTVVRWPGQIPAGHQNDELMTAMDLLPTFAKLAG 902 Query: 341 IEKP--EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS 398 P ++ G++I + + + + +KL +N + Sbjct: 903 AAIPTDRVIDGKDIWPTLKGETQTPHDAFFYHRGNQLA------AVRSGKWKLHVNNGVA 956 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRP 451 +LYD ND E N+I+ +V K+ L D+ I R ++ P Sbjct: 957 KQLYDLENDLGEKVNVIE--TNPEVVKKLQHQLKDFAADIASNSRPAAFNANP 1007 Score = 280 bits (716), Expect = 9e-74, Method: Composition-based stats. Identities = 118/538 (21%), Positives = 191/538 (35%), Gaps = 146/538 (27%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN + + D +GCY L+T NID LAAEG RF A++ S VCTP+R GL T Sbjct: 38 RPPNVVLIFVDDLGYGDLGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSRYGLLT 97 Query: 62 GIYANQS----GPW-----TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 G Y ++ G W T+ + N T+G+ FK+ GY T +GKWHL + Sbjct: 98 GQYPVRAMGGQGIWGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKWHLGFKEEPCDW 157 Query: 113 EC-----PPEWDADYWFD-----------GANYLSELTEKEISLWRNGLNSVED------ 150 + P + D++F N S G V Sbjct: 158 QVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGGKPVSPTPMFPE 217 Query: 151 -------------LQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPF 197 L+A+ I + ++ RAV ++ + + +EPF + + HHPF Sbjct: 218 EASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITE--KKNEPFFLYFATPNIHHPF 275 Query: 198 TCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDF 257 T + G LY Sbjct: 276 TPAPRF---------------------------------------KGTSQCGLYGDFVHE 296 Query: 258 VDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHK-------------LISKGAAMYD 303 +D +G ++ +L +NT V++TSD+G M+ L+ +++ Sbjct: 297 LDWMVGEIVQSLEDNGLTDNTLVLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWE 356 Query: 304 DITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIEKPEILPGENI--------- 352 R+PLI + P + Q D +S +DL T AL + E P ++I Sbjct: 357 GGHRVPLIAKWPGKIKAGTQSDQLISQVDLFATFSALTEQEMPSSEQKDSINMLPALLDD 416 Query: 353 ----------LAVKEPRGVMVEFNRYEI--------------EHDSFGGFIPVRC----- 383 LA ++PR + + ++ +H ++GG V+ Sbjct: 417 PNEPLRTELVLAPRQPRNLAIRKGKWLYIGARGSGGFNGSKPQHHAWGGPAAVQFSGQKN 476 Query: 384 --WVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 V + +LYD ND ++ N+ +V +M L Y K Sbjct: 477 SDIVNGR---IKKNAPPAQLYDLENDRSQTTNVFR--EHPEVVEEMKAMLESYRPKQG 529 >UniRef50_C3QDX1 Sulfatase n=2 Tax=Bacteroides RepID=C3QDX1_9BACE Length = 485 Score = 356 bits (913), Expect = 1e-96, Method: Composition-based stats. Identities = 112/451 (24%), Positives = 202/451 (44%), Gaps = 24/451 (5%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+ D + G + T NID LA+EG+ F ++Y+C P PAR L +G + Sbjct: 30 NILFIQADQHRYDCTGFSGKGLVKTPNIDKLASEGVIFTNSYSCIPTSCPARQSLISGKW 89 Query: 65 ANQS-GPWTNNVAP---GKNISTMGRYFKDAGYHTCYIGKWHLDG---HDYFGTGECPPE 117 Q G W ++ N T + Y+GKWH+ FG + PE Sbjct: 90 PEQHKGLWNYDITLPVTPFNGPTWTEKLSEKDIKMGYVGKWHVSDRKSPKDFGFDDYVPE 149 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 W + W N L + ++ G + V+ +Q + H ++ R ++ +++ Sbjct: 150 WSYNNW-RKKNNLPDYVWQDSRWVMGGYDPVDKMQ--------SRTHWLAQRVIEMIKKY 200 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP--EHHRLWAQA 235 + + + +PH P E+L Y + +DDL+NKP + +++ Sbjct: 201 QSEGKKWHVRFDTSDPHLPCYPVREFLAMYDKEKIQEWPNYRDDLSNKPYIQRQQIYNWE 260 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKL 294 + + + YFA +DD +G VI AL +NT+++YT+DHG+ G+H + Sbjct: 261 LEDSNWEMWQGYLQRYFANITQLDDAVGMVIEALKEMGVYDNTFIVYTTDHGDAAGSHNM 320 Query: 295 ISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSH-IDLLPTMMALADIEKPEILPGENIL 353 + K MY++ +PL+++ P R +D V++ +D+ T + ++ GE++L Sbjct: 321 VDKHYVMYEEEVHVPLVMKIPGVSHRIIDRFVNNQLDMAATFCDMYQLDY--KTQGESLL 378 Query: 354 AVKEPRGVMVEFNRYEI--EHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEM 411 + E + ++ Y + G R K V NL +DELYD +DP E+ Sbjct: 379 PLIEEKKEASDWREYAFSNYNGQQFGLFVQRMIRDKRMKYVWNLTDTDELYDLESDPWEL 438 Query: 412 HNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 +NL+ + ++ AL + + + +DP Sbjct: 439 NNLVYSKEYKAELVRLRKALYEDLKQRKDPL 469 >UniRef50_Q0TUK6 Arylsulfatase n=9 Tax=Bacteria RepID=SULF_CLOP1 Length = 481 Score = 355 bits (912), Expect = 2e-96, Method: Composition-based stats. Identities = 113/467 (24%), Positives = 183/467 (39%), Gaps = 33/467 (7%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + +M D + +G + + T N+D +A EG F +AYT P C +RA + TG Sbjct: 2 KPNIVLIMVDQMRGDCLGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTG 61 Query: 63 IYANQSGPWTNNVAPGKN-ISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 + G N +T+ F AGYHT IGK H+ D Sbjct: 62 MSQKSHGRVGYEDGVSWNYENTIASEFSKAGYHTQCIGKMHVYPERNLCGFHNIMLHDGY 121 Query: 122 YWF----------------DGANYLSELTEKEISLWRNGLNSVEDL-QANHIDETFTWAH 164 F D + E + L GL+ + + +E + Sbjct: 122 LHFARNKEGKASTQIEQCDDYLKWFREKKGHNVDLIDIGLDCNSWVSRPWGYEENLHPTN 181 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 + N ++DFL++ +PF + +S+ PH P P Y + Y D +L E D AN Sbjct: 182 WVVNESIDFLRR-KDPSKPFFLKMSFVRPHSPLDPPKFYFDMYKDE--DLPEPLMGDWAN 238 Query: 225 KPEHHRLWAQ---AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVI 280 K + Y+ +D QIGR + AL+ NT + Sbjct: 239 KEDEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFL 298 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP-----QGERRQVDTPVSHIDLLPTM 335 + SDHG+MMG H KG Y+ +R+P I P + + D + D++PT+ Sbjct: 299 FVSDHGDMMGDHNWFRKG-IPYEGSSRVPFFIYDPGNLLKGKKGKVFDEVLELRDIMPTL 357 Query: 336 MALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL 395 + A I P+ + G ++ + E R Y SFG D L + Sbjct: 358 LDFAHISIPDSVEGLSLKNLIEERNSTWRD--YIHGEHSFGEDSNHYIVTKRDKFLWFSQ 415 Query: 396 FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 ++ +D NDP E+ NLID + + + L+ ++ + + Sbjct: 416 RGEEQYFDLENDPKELTNLIDSEEYKERIDYLRKILIKELEGREEGY 462 >UniRef50_Q7UYA8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA8_RHOBA Length = 745 Score = 355 bits (912), Expect = 2e-96, Method: Composition-based stats. Identities = 110/456 (24%), Positives = 195/456 (42%), Gaps = 37/456 (8%) Query: 3 RPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN LF+ D + VGC T N+D A + + FN+A+ +C +RA T Sbjct: 308 RPNVLFITVDDL-NDWVGCLGGNPDAQTPNLDRFAQQSVLFNNAHCQVALCYASRASFMT 366 Query: 62 GIYANQSGPWTNNVAPGKN----ISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 G+YA+++G + N+ ++ M +F ++GY T +GK + + H + Sbjct: 367 GMYASKTGIYNNSSKSARDAYHRAKQMPVWFGESGYRTMCMGKIYHNDHGKKAYWD---- 422 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHID--ETFTWAHRISNRAVDFLQ 175 + + E R G ++ + L +D + +I+ ++ L Sbjct: 423 ---EIGPKTLRWGPEPPNGRQFTKRFGKDAQDSLAWAALDIEKGGMPDEQIAAWGIEKLD 479 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 Q D+PF + + + +PH P T P Y E++ L ++DL + PE R W Sbjct: 480 Q--EYDQPFFLSLGFYKPHTPMTAPKRYFEQFDRDSLTLPNVLENDLDDVPEIGRRWVLD 537 Query: 236 MPSPVGDDGLYHHP---------LYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDH 285 + ++ + + Y AC +DD IG+V+ L NT V+ SDH Sbjct: 538 RSKLIAEEAVKQYSPTYRRELVHAYHACVALIDDCIGQVLRKLDNSPYANNTIVVLCSDH 597 Query: 286 GEMMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMALADIEK 343 G +G K +++ TR LI+R+P G + V ID+ PT+ L ++ Sbjct: 598 GWHLGEKNHWRKWMP-WEESTRSLLIVRTPDAAGSGQVCQRTVGLIDIYPTLAELCELSP 656 Query: 344 PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYD 403 P+ L G + + + ++R + G +D ++ + + S+ELYD Sbjct: 657 PDGLQGLSFRKLLD--NPDGPWDRPALTSTKAGN----HTVRSDRWRYIRYIDGSEELYD 710 Query: 404 RRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 DPNE HNL +D ++ + H +D + + Sbjct: 711 HDVDPNEWHNLANDPSMNSIKKQ-HAEWIDRLTESN 745 >UniRef50_A0JVP0 Sulfatase n=1 Tax=Arthrobacter sp. FB24 RepID=A0JVP0_ARTS2 Length = 508 Score = 355 bits (911), Expect = 3e-96, Method: Composition-based stats. Identities = 120/478 (25%), Positives = 200/478 (41%), Gaps = 37/478 (7%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 R N LF+MTD Q + +GCY + +T +D LAA G ++ AYT + +CTPARA L TG Sbjct: 8 RTNILFLMTDQQRIDTMGCYGNRSRHTPYLDGLAARGTVYDRAYTPTAICTPARASLLTG 67 Query: 63 IYANQSGPWTN-------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLD---GHDYFGTG 112 ++ + G +N T + GY ++GKWH+ G D++G Sbjct: 68 LHPFEHGLLSNFEWNSGHRDELPDGTPTFADELRKQGYRLGHVGKWHVGRERGPDFYGF- 126 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWR----------NGLNSVEDLQANHIDETFTW 162 E A FD Y S L EK +R +G T+ Sbjct: 127 EGEHLPGALNTFDNPAYTSWLAEKGFPSFRIVDPVYTVQKDGSQGHLIAGITDQPTEATF 186 Query: 163 AHRISNRAVDFLQQPARAD------------EPFLMVVSYDEPHHPFTCPVEYLEKYADF 210 ++++ + L++ A+ PF + PH P+ P ++ + Sbjct: 187 EAWLADQTIAKLREFAQTHPAGGAPGTETAVAPFYLSCHIFGPHLPYLIPRQWYDLVDPA 246 Query: 211 YYELGEKAQDDLANKPEHHRLWAQAM--PSPVGDDGLYHHPLYFACNDFVDDQIGRVINA 268 +L + + KP + +A+ S ++ +Y+ +D +IGR++ Sbjct: 247 TVQLPKSFAETFNGKPLVQQTYAEYWSTDSFTVEEWKKLTAVYWGYVSMIDHEIGRILQT 306 Query: 269 LTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVS 327 + ++T +++T+DHGE GAH+L KG AMY+DI RIP I+ +P E R+ VS Sbjct: 307 VEELGLNDSTVIMFTADHGEFTGAHRLNDKGPAMYEDIYRIPAIVAAPGQEPRRESKFVS 366 Query: 328 HIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTD 387 D T + +AD + G +++ E R + Sbjct: 367 LQDFTATFIDIADG-YAGNIRGSSLMPSTTAPLPADWRTEMVCEFHGHHFPYAQRMIRNE 425 Query: 388 DFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 +K + N DE YD +DP+E+HN++ +A M +L + D F + Sbjct: 426 RYKYIANPEGIDEFYDLVSDPDELHNVVTVPAYATQLKTMRLSLYKELVSRGDKFYQW 483 >UniRef50_C9L4R5 Mucin-desulfating sulfatase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L4R5_RUMHA Length = 484 Score = 354 bits (910), Expect = 3e-96, Method: Composition-based stats. Identities = 116/480 (24%), Positives = 197/480 (41%), Gaps = 74/480 (15%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+M D Q + + C K L T N++ +A G++F + Y SPVC+PARA + TG Sbjct: 2 NILFIMADDQGSWAMNCGGTKELCTPNLNRIAESGMQFQNFYCVSPVCSPARASVLTGDI 61 Query: 65 ANQSGPW-----------------------------TNNVAPGKNISTMGRYFKDAGYHT 95 + G ++ + +T + GY Sbjct: 62 PSSHGVHDWIRSGNIDKDKFEEAGRENPYWNGYSCEDKPISYLEGKTTYTDVLNENGYRC 121 Query: 96 CYIGKWHLDGHDYFGTGECPPEWDADYWF----DGANYLSELTEKEISLWRNGLNSVEDL 151 GKWHL P+ W+ G +Y + NG V Sbjct: 122 ALAGKWHLG-------DSVCPQHGFSKWYTIGLGGCDYF------HPDIVENGNIKVLHE 168 Query: 152 QANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFY 211 Q I+N+A+++L + +EPF + V + PH P+ ++ +K+ D+Y Sbjct: 169 Q--------YVTEVIANKAIEYLNEFQHQEEPFYLSVHFTAPHSPW-GEEQHPKKWMDYY 219 Query: 212 YELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP 271 ++ D A+ P+ P + + YFA +D+QIGR+++ L Sbjct: 220 ENCDFQSIPDEADHPD-----LTTGPVFGTEKRKENLRGYFAAISAMDEQIGRILDTLEA 274 Query: 272 EQ-RENTWVIYTSDHGEMMGAHKLISKGA-----AMYDDITRIPLIIRSPQGE--RRQVD 323 RENT V+YT+D+G MG H + KG MY+ ++P ++ P ++ + Sbjct: 275 NGLRENTLVVYTADNGMSMGHHGVWGKGNGTFPFNMYETSVKVPFLMSLPGVIPQGKREE 334 Query: 324 TPVSHIDLLPTMMALADIEK--PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV 381 T +S D+ PT++ L +++ E LPG + + + + + D +G PV Sbjct: 335 TILSAYDIFPTLLELCKLDRKECEKLPGRSFAYLLRWEKEHKKRDEEIVVFDEYG---PV 391 Query: 382 RCWVTDDFKLVL-NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 R D+K + + ELY DP E NL + + +M L ++ +K D Sbjct: 392 RMIRNQDWKYIHRYPYGPHELYYLTEDPEEKENLYGQPEYEKMVVEMRTRLNEWFNKYAD 451 >UniRef50_A7A9X1 Putative uncharacterized protein n=2 Tax=Parabacteroides RepID=A7A9X1_9PORP Length = 480 Score = 354 bits (910), Expect = 3e-96, Method: Composition-based stats. Identities = 96/459 (20%), Positives = 186/459 (40%), Gaps = 29/459 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAY----TCSPVCTPARA 57 K+PN + ++ D + + + + T N+D LA E F +A+ T V P+RA Sbjct: 23 KKPNIILILADDMRASGMNFLGKEQVQTPNLDKLAGESTVFTNAHIMGGTSGAVSMPSRA 82 Query: 58 GLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHD----YFGTGE 113 L TG Y + + +G + AGY+T + GKWH + Sbjct: 83 MLMTGKYLYN--LEKQGATIPNSHTMIGETLQKAGYNTFHTGKWHSSYEALNRCFKEGKA 140 Query: 114 CPPEWDADYW----FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNR 169 D+W +D ++ + + + N VE ++ ++ Sbjct: 141 IFFGGMWDHWNVPLYDYHADMNYGKRRPVIHNQAKSNKVEYEIGEYMYSGKHSVDIFTHE 200 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD--DLANKPE 227 AV+++QQ ++PF + V+Y PH P + P EY++ Y +L + N Sbjct: 201 AVEYIQQQKDKNQPFFLSVAYMSPHDPRSMPDEYMQLYDQSQIQLPPNFMEKHPFDNGEL 260 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 R A D+ H Y+A VD ++G +I L ENT +I+ D+G Sbjct: 261 EIRDEILAAIPRRPDEIKKHIREYYAMISHVDKRVGNIIQTLKDNGLYENTIIIFAGDNG 320 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKPE 345 +G H L+ K +Y+ +PL+I++ ++ ID+ PT+ + + P+ Sbjct: 321 LAVGQHGLMGK-QNVYEHSVGVPLMIKAAAQHTGKKTADLCYLIDVFPTLCDMLQLPVPQ 379 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD---ELY 402 + G ++L+ + + + ++ Y R +KL+ +L+ Sbjct: 380 SVDGISLLSSLDGKEPVRDYLYYSY-------MDNQRGISDGTWKLIEYHVNGKRTTQLF 432 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 + +NDP E ++L ++ ++ + + + + D Sbjct: 433 NLKNDPWERNDLSGQKKYEKTIQRLREKMAEEQKRTNDT 471 >UniRef50_Q01PN7 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01PN7_SOLUE Length = 496 Score = 354 bits (910), Expect = 4e-96, Method: Composition-based stats. Identities = 109/455 (23%), Positives = 178/455 (39%), Gaps = 23/455 (5%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L +M D + +G + ++T N+D LAA G+RF +AY+ +P CTPARAGL TG Sbjct: 24 RPNILLLMADQWRADCLGAAGNRAIHTPNLDQLAASGVRFTNAYSATPTCTPARAGLLTG 83 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC-------- 114 + G + M R +DAGY+T IGK H Sbjct: 84 LAPWNHGMLRYAEVGARYPVEMPRALRDAGYYTAAIGKLHYHPQRNVHGYHQALLDESGR 143 Query: 115 --PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 P++ +DY + L L N D + + E A Sbjct: 144 IESPDFRSDYRSWFWSQAPNLDPDATGLGWNDF----DARPYTLPERLHPTTWTGQTAAS 199 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 +++ R+ EPF + VS+ PH P+ P +Y D A Sbjct: 200 WIETYQRS-EPFFLKVSFARPHSPYDPPDRLWRRYQDAPLPPAAVAGWASRYAARSGPQP 258 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHGEMMGA 291 + Y+ FVD+QIGR++ +L + T +++ SDHG+M+G Sbjct: 259 DAWHGDLGAEQVRRSRQGYYGSVTFVDEQIGRIMESLTRRGLLDQTLIVFFSDHGDMLGD 318 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQG-----ERRQVDTPVSHIDLLPTMMALADIEKPEI 346 H L K + Y +R+P ++R P+G +D V D+LPT + A Sbjct: 319 HNLWRK-SYAYAGSSRVPFLVRWPEGMLTARRGGTIDQMVELRDVLPTFLDAAAAAPARP 377 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDE-LYDRR 405 L G+++L + + + +K + + E L+D + Sbjct: 378 LDGQSLLPLIAGKSPAWRPFLDLEHGVCYSPDNHWNALADQQYKYIFHARDGREQLFDVQ 437 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 D +E+H+L D A + ++ ++ D Sbjct: 438 RDAHELHDLSGDPAAAAKLREWRQRMIAHLSPRGD 472 >UniRef50_C7MHR6 Arylsulfatase A family protein n=3 Tax=Bacteria RepID=C7MHR6_BRAFD Length = 480 Score = 354 bits (909), Expect = 4e-96, Method: Composition-based stats. Identities = 121/485 (24%), Positives = 202/485 (41%), Gaps = 46/485 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + +MTD Q + + ++T N+D L EG F + Y SP C P+RA LFT Sbjct: 6 ERPNIVLIMTDQQRFDSIAALGHDHVDTPNLDRLVREGAAFTNTYVPSPSCAPSRASLFT 65 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH-------LDGHDYF----- 109 G+Y + SG N+ + + AGY +GK H + H+ Sbjct: 66 GLYPHSSGVLRNDDPWSH---SWVEHLSAAGYRCTSVGKMHTYPYEAPVGFHERHVIENK 122 Query: 110 -----GTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 +WD W G S +T +E + L + E + E + Sbjct: 123 DRAHPDLPYFLDQWDKAIWIRGHQKPSRVTYRERDDYAERLGAFE----WELPEDLHADN 178 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 + N A +L+ D+PF + + + PH P+ P +LE Y D ++ Q DL + Sbjct: 179 FVGNLARHWLETYPEHDDPFFLQIGFPGPHPPYDPPARHLEPYRDRPMPEAKRTQADLDS 238 Query: 225 KP---EHHRLWAQA--------MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ 273 +P + R QA + +P + YFA +D+Q+G +++AL Sbjct: 239 QPAPLKELRTHHQANDHDAIVQLENPTAEQLDRQRRHYFANVSLIDEQVGGILDALEERG 298 Query: 274 -RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHID 330 +NT V++TSDHG+ + H K MY+ +P II P ++ D VS +D Sbjct: 299 VLDNTVVVFTSDHGDALNDHGHSQKW-TMYEPSVHVPGIIWGPGRVEPDQRFDGLVSLMD 357 Query: 331 LLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFG----GFIPVRCWVT 386 + PT++ LA + PE + ++L + + E +Y + G + Sbjct: 358 IAPTVLELAGLTPPEWMEARSLLPALQGQE--WEGRQYVFSEHARDAILTGTALMTMARD 415 Query: 387 DDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 +KLV + D +L+D DP E NL A+ R ++ A+ + ++ Sbjct: 416 ARYKLVEFIDHEDGQLFDLAKDPYEETNLWFCEEHAETRRRLERAISTWRASSSMQTATW 475 Query: 446 QWSLR 450 R Sbjct: 476 AKDKR 480 >UniRef50_A6DMZ1 Sulfatase n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMZ1_9BACT Length = 514 Score = 354 bits (909), Expect = 4e-96, Method: Composition-based stats. Identities = 113/494 (22%), Positives = 197/494 (39%), Gaps = 66/494 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSG---KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 RPN +++ +D AT +G Y G T NID LA EG+ F AY + +C P+RA L Sbjct: 23 RPNIVWMFSDDHATQAIGAYGGLLESYNLTPNIDRLAKEGMIFKRAYVGNSICAPSRATL 82 Query: 60 FTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 TG +++ G N N + + GY T IGK HL G Sbjct: 83 LTGKHSHLHGKVDNAKGFDHNQQQFQKLLQKGGYQTAMIGKIHLPGK----------MQG 132 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 DYW E+ + W + E + + E + I+ RA++++ Sbjct: 133 FDYW--------EVLPGQGKYW-DPEFVTETGKTIYPGE--HSSDVITRRALNWMNNERD 181 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK---PEHHRLWAQAM 236 +PF+++V + PH + + +K++ + + DD + ++ + + Sbjct: 182 KSKPFMLMVHFKAPHRSWQPTTRWKKKFSTMTFPEPDTLFDDYQGRGTAAKYQDMNIEHS 241 Query: 237 PSPVGD-------------------------DGLYHHPLYFACNDFVDDQIGRVINALTP 271 + VGD + Y AC VD+ IG++++ L Sbjct: 242 MNMVGDLKSNQSPRKEFLKKNALTGKALVKWKYQMYMRDYLACIAGVDENIGKILDQLAE 301 Query: 272 EQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSH 328 + NT V+Y+SD G +G H K MY++ R PL+ R P + + + V + Sbjct: 302 SGLDKNTIVMYSSDQGFYLGEHGWFDK-RFMYEESYRTPLLARWPGVIKAKTRNEDLVQN 360 Query: 329 IDLLPTMMALADIEKPEILPGENILAV---KEPRGVMVEFNRYEIEHDSFGGFIPVRCWV 385 ID T + LA + P + GE+++ + K P + E+ + Sbjct: 361 IDFAETFLDLAGLPIPADMQGESLVPLMKGKTPDDWRTHLYYHYYEYPGWHSVHRHEGVS 420 Query: 386 TDDFKLVLNLFT------SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 +KL+ E YD + DP+EM + + +A +K+ L + +K Sbjct: 421 DKRYKLMRFYGKDVPNGEEWEFYDLKTDPSEMKSEYANPEYASTIAKLKKELANLREKYE 480 Query: 440 DPFRSYQWSLRPWR 453 Q+ + PW+ Sbjct: 481 VKDIP-QYDINPWK 493 >UniRef50_A6DMW2 Putative exported uslfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMW2_9BACT Length = 479 Score = 354 bits (909), Expect = 4e-96, Method: Composition-based stats. Identities = 110/505 (21%), Positives = 188/505 (37%), Gaps = 91/505 (18%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF++ D +G Y T N++ LA++ +RF+ AY S VC+P R + T Sbjct: 26 QRPNILFIVADDMGIMDLGVYGSDYYLTPNLNKLASQSMRFDRAYAASHVCSPTRGAILT 85 Query: 62 GIYANQSG-----PWTNNVAPGK------------NISTMGRYFKDAGYHTCYIGKWHLD 104 G Y + PW K + T R + Y T GKWHL Sbjct: 86 GRYPQRIHLTDALPWDRLYKNPKMIPPNHVKELSLKLPTFARVLQKNDYRTAMFGKWHLG 145 Query: 105 GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 + F TG+ + D F + + + G+N Sbjct: 146 NEERFFTGKEHKAYGFDEAFGVSG--------KAKAYDKGVNE----------------- 180 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 ++ R + FL++ +PF++ + + PH P CP Y D Sbjct: 181 -LTERTLRFLKE--NKKKPFMLCLMHHVPHVPVACPPYAKALY-------------DSVP 224 Query: 225 KPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTS 283 K +H + + Y D+ I +V++AL +NT VI TS Sbjct: 225 KGKHQK-----------------NSKYAGMISHFDNSIKKVLDALRALGLDDNTVVIVTS 267 Query: 284 DHGE---MMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMAL 338 D+G + ++Y+ TR+PL+IR P + V D PT + L Sbjct: 268 DNGGLSNLSSNKPYNGGKGSLYEGGTRVPLLIRWPGKITPGSVNKSVVISNDFFPTFLEL 327 Query: 339 ADIE--KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 A + L G++++ + + + + + H P + D KL+ + Sbjct: 328 AGLPLMPEAHLDGKSMMPLLKGKTLGKRTLYWHFPH----RGTPGSSIIDGDLKLIHKIE 383 Query: 397 -TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKD 455 + E++D +DP E +NL + + S++ L ++ ++ S P R Sbjct: 384 SDTYEMFDLNSDPYEANNLFEKQ--PEQASRLQKMLARHLKEVAAQEMSPNPQWDPKRPK 441 Query: 456 ARPRWMGAFRPRPQ-DGYSPVVRDY 479 +P G P + G+ V Y Sbjct: 442 GKPTNFGIHYPAGRKKGFRLTVEAY 466 >UniRef50_A6DPD0 Sulfatase family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DPD0_9BACT Length = 471 Score = 354 bits (908), Expect = 5e-96, Method: Composition-based stats. Identities = 112/461 (24%), Positives = 202/461 (43%), Gaps = 37/461 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++ N LF++ D +GCY K + + NID LA+EG F+ AY PVC +RA + T Sbjct: 24 EKNNVLFIIVDDLRPE-LGCYGNKQVLSPNIDRLASEGTLFSKAYCNVPVCGASRASVMT 82 Query: 62 GIYANQSGPWTNNVAPGKN---ISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+ + + N K + + F+ GY T IGK + + +DY + + Sbjct: 83 GLRPTKDRFISYNAKAYKESGGVLDLAGIFQKNGYTTISIGKVYHERNDYRSSWD----- 137 Query: 119 DADYWFDGANYLSELTEKEISLWRN----GLNSVEDL----QANHIDETFTWAHRISNRA 170 F + ++ + ++ L N G S E L +A + + +++++ A Sbjct: 138 -----FKDSPLITSPSMRDYHLPENQAGRGKYSFEALGTACEAADEPDEKYFTYQLADAA 192 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 +D++ + + ++P+ + V + +PH PF P +Y + Y ++L + Sbjct: 193 IDYIDKTEKKNKPWFLAVGFTKPHLPFVAPKKYWDLYKRSDFKLASNPNMPKNAPTQASH 252 Query: 231 LWAQAM--------PSPVGDD-GLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVI 280 W + PV DD L Y+AC F D IGR+++ L R+NT VI Sbjct: 253 QWHELRKMYNDIPQTGPVPDDKALELKHGYYACVSFTDAMIGRILDYLDTNNLRKNTTVI 312 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ-VDTPVSHIDLLPTMMALA 339 DHG +G H L K A ++ PLI+ + + V +D+ P++ LA Sbjct: 313 LWGDHGWQLGEHGLWCKHAN-FETSLNTPLIVSAAGQNAQGPSKALVEFVDIYPSLCDLA 371 Query: 340 DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF-TS 398 KP L G++ + + + + H G I ++ +++ N T+ Sbjct: 372 GFTKPPHLQGKSFAPLLKKPNTKWKSAVFSRYHA--GDSIHTNRFLYTEWRNKSNGNITA 429 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 LYD + DP+E N+ + +A++ K+ L ++D Sbjct: 430 RMLYDHQRDPDENFNIAANPEYAELVKKLSKRLQAHIDSWN 470 >UniRef50_Q1GMK9 Choline sulfatase n=8 Tax=Alphaproteobacteria RepID=Q1GMK9_SILST Length = 504 Score = 354 bits (908), Expect = 5e-96, Method: Composition-based stats. Identities = 126/492 (25%), Positives = 207/492 (42%), Gaps = 22/492 (4%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M PN L M D + + L+ N+ LAA RF + YT SP+C P RA Sbjct: 1 MTLPNILIFMVDQLNGTLFPDGPAEWLHAPNMKKLAARSTRFRNCYTASPLCAPGRASFM 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 +G + +G + N +I T + + AGY+TC GK H G D E D Sbjct: 61 SGQLPSATGVYDNAAEFASSIPTYAHHLRRAGYYTCLSGKMHFVGPDQLHGFEERLTTDI 120 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 G + I W + + SV I + ++ A + AR Sbjct: 121 YPPDFGWTPDYRKPGERIDWWYHNMGSVTGAGVAEISNQMEFDDEVAFHATQKIYDLARG 180 Query: 181 D--EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 P+ + VS+ PH P+ +Y + Y D + + E A N+ H + A Sbjct: 181 KDARPWCLTVSFTHPHDPYVTRKKYWDLYEDCPHLMPEVADLGYENQDPHSKRIFDANDW 240 Query: 239 PVGD----DGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHK 293 D D YF ++DD+IG V+ AL +++ +T +++ SDHG+M+G Sbjct: 241 RNFDITEEDIRRSRRAYFGNISYLDDKIGEVMEALEGTRQDKDTIILFVSDHGDMLGERG 300 Query: 294 LISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILP---GE 350 L K + Y+ +R+P++I +P V PVS+ID+ PT+ LA + E++P GE Sbjct: 301 LWFK-MSFYEGSSRVPMMISAPNMTPGLVCDPVSNIDVCPTLCDLAGVSMSEVMPWTAGE 359 Query: 351 NILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNE 410 +++ + + +E+ + + P+ + +KL L D+L+D DP+E Sbjct: 360 SLVPLGQG---GTRSTPVAMEYAAEASYAPMVSLRSGRYKLNLCALDPDQLFDLDADPHE 416 Query: 411 MHNLIDDIRFADVRSKMHDALLDY--MDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRP 468 NL D + + + +D+ R+ Q R W R G F P Sbjct: 417 RVNLAKDPTHHEAYQALKAIAAERWDLDRFDADVRASQ--ARRWVVYEALRQGGYF---P 471 Query: 469 QDGYSPVVRDYD 480 D Y P+ + + Sbjct: 472 WD-YQPLQKASE 482 >UniRef50_B6HPN7 Pc22g01020 protein n=15 Tax=Eukaryota RepID=B6HPN7_PENCW Length = 589 Score = 354 bits (908), Expect = 6e-96, Method: Composition-based stats. Identities = 103/427 (24%), Positives = 182/427 (42%), Gaps = 16/427 (3%) Query: 2 KRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 K+PN L++M D A ++ + P+ T N+D LA G+ F+SAY SP+C P+R + Sbjct: 4 KKPNILYIMADQMAAPLLSLHDKNSPIKTPNLDRLAEGGVVFDSAYCNSPLCAPSRFVMV 63 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 +G ++ G + N + T Y + GYHT GK H G D E + Sbjct: 64 SGQLPSKIGAYDNAADLPADTPTYAHYLRREGYHTALAGKMHFCGPDQLHGYEQ--RLTS 121 Query: 121 DYWFDGANYLSELTEKEIS-LWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 D + + E +I W + ++SV + + + ++ +L R Sbjct: 122 DIYPGDYGWSVNWDEPDIRADWYHNMSSVMEAGPVVRTNQLDFDEEVIYKSTQYLYDHVR 181 Query: 180 A--DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW---AQ 234 ++PF + VS PH P+ E+ + Y D L + + H + Sbjct: 182 QRNEQPFCLTVSMTHPHDPYAMTKEFWDLYNDVEIPLPKNGAIPHDQQDAHSQRVLKCID 241 Query: 235 AMPSPVGDDGL-YHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAH 292 + D+ + Y+A +VD +G+++ L ++T +++T DHG+M+G Sbjct: 242 LFNKEMPDERIRAARRAYYAACTYVDTNVGKLLRVLENTGMADDTIIVFTGDHGDMLGER 301 Query: 293 KLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEKPEI--LPG 349 L K +++ R+P ++ +P+ ++V VS +DLLPT LA + L G Sbjct: 302 GLWYK-MTWFENSARVPFLVHAPKHFAPKRVSENVSTMDLLPTFAELAGAKLISELPLDG 360 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPN 409 +++ G + + E+ G P+ +K + + LYD NDP Sbjct: 361 VSLVPYLTG-GEGLRTDTVYGEYMGEGTQAPLMMIRRGRWKFIYSTIDPPMLYDLVNDPE 419 Query: 410 EMHNLID 416 E NL Sbjct: 420 ERTNLAA 426 >UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6C430_9PLAN Length = 503 Score = 353 bits (907), Expect = 7e-96, Method: Composition-based stats. Identities = 114/509 (22%), Positives = 190/509 (37%), Gaps = 86/509 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + V+ D + CY + + NID A EG++ S Y P C+P+RAGL TG Sbjct: 34 RPNIMVVLCDDLGYGDLACYGHPVIQSPNIDRFAKEGLKLTSCYAAHPNCSPSRAGLMTG 93 Query: 63 IYANQSGPWT-----NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + G + + + K T+ + AGY TC++GKWHL+G P + Sbjct: 94 RTPFRVGIYNWIPMLSPMHVRKREITIATLLRQAGYATCHVGKWHLNGMFNMVGQPQPSD 153 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 D+WF N E + RN + +++ A ++L Q Sbjct: 154 HGFDHWFSTQNNALPTHENPFNFVRNARPV--------GPLQGFASQLVADEAEEWLTQL 205 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 ++PF M V + EPH P + + Y + + P HH Sbjct: 206 RDKEKPFFMFVCFHEPHEPIASAERFRKLY----------TAPEGSTLPAHH-------- 247 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHG-------EMM 289 +DD GR++ L + RENT +I+TSD+G Sbjct: 248 ---------------GNVTQMDDAFGRILKTLDDQKLRENTLIIFTSDNGPAITRRHPHG 292 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKP--E 345 + L K A Y+ R+P I++ P+ D PV +D+LPT+ A+ADI P Sbjct: 293 SSGPLRDKKGATYEGGIRVPGIVQWPEHVQPGTTSDVPVCGVDILPTLCAVADIPAPTDR 352 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD------ 399 +L G NIL + E + ++ + Y + ++KL+ L Sbjct: 353 VLDGTNILPLLEGKPILRKKPLYW--QFNRAKNDAKVALRDGEWKLLAKLNVPSPKPSGG 410 Query: 400 -----------------ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 ELY ++D E + + + ++ KM + D+++ Sbjct: 411 ITTEEIDAVKNAKLEGFELYHIQSDIAETTDRAESEQ--EILKKMKQQMQAIFDEVQAEA 468 Query: 443 RSYQ-WSLRPWRKDARPRWMGAFRPRPQD 470 + W + + + Sbjct: 469 PRWPAWEFARYEGKILSEYYRKQEEAEKQ 497 >UniRef50_D2R014 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R014_9PLAN Length = 475 Score = 353 bits (906), Expect = 9e-96, Method: Composition-based stats. Identities = 104/468 (22%), Positives = 171/468 (36%), Gaps = 67/468 (14%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + ++ D + + C T +I+ LAA G+RF AY+ VC+P RA + TG Sbjct: 41 PNIVVILIDDMGFSDLSCMGSTYYETPSINKLAASGMRFTHAYSACTVCSPTRAAVLTGK 100 Query: 64 YANQS-------GPWTN---------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHD 107 Y + G +N N T+ GY T IGKWHL Sbjct: 101 YPARLHLTDWIPGQMSNKTKLKLPDWNKQLNLEEITLAELLGAHGYTTASIGKWHL---- 156 Query: 108 YFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRIS 167 G EC P G N + RNG+ + R++ Sbjct: 157 --GPPECEPTRQGFSLNIGGNSKGQPPSYFFPYERNGVLLPGLAEGKP---NEYLTDRLT 211 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 + F+++ +PF + + + H P E + KY + Q Sbjct: 212 DACEAFIEE--NQSKPFFLYLPHYCVHTPLQAKPELIAKYEAKNAQFPGNPQ-------- 261 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHG 286 H Y A + +D +GR++ L + + T VI+TSD+G Sbjct: 262 -------------------HEAKYAAMVESLDQSVGRIMAKLDALDLTKKTIVIFTSDNG 302 Query: 287 EMM-----GAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMALA 339 ++ + + Y+ R+PLI+ P D P +DL PT+ L+ Sbjct: 303 GLVLREITSNLPARAGKGSAYEGGVRVPLIVSYPPMIKPGTTCDVPAISMDLFPTLAELS 362 Query: 340 DIEKPEILPGENILAVKE--PRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 + + G++I+ + E P + H GG P +++LV Sbjct: 363 GAKYSHDIDGKSIVPLLEEKPDAFAARPLYWHYPHYHGGGATPYSAMRVGNYRLVEFFED 422 Query: 398 SD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRS 444 ELYD +D EM NL + D+ K+H L+ + + + + Sbjct: 423 GRLELYDLAHDIGEMKNLAQEK--PDLTEKLHRQLIAWRKSVDAQYAT 468 >UniRef50_B1KD82 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD82_SHEWM Length = 526 Score = 352 bits (904), Expect = 2e-95, Method: Composition-based stats. Identities = 122/466 (26%), Positives = 199/466 (42%), Gaps = 38/466 (8%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+MTD N++G + T N+D LA+EG + +AYT +P+C+P+R FT Y Sbjct: 24 NLLFIMTDEMKWNVMGVAGHPVVKTPNLDRLASEGTYYKTAYTVAPICSPSRRSFFTSRY 83 Query: 65 ANQSGPWTNN--VAPGKNISTMGRYFKDAGYHTCYIGK------WHLDGHDYF--GTGEC 114 + G N+ + K GY T GK WH G D F + E Sbjct: 84 THVHGVIDNSKQALANDGEVDLQTILKHQGYRTAISGKLHFYPEWHDWGFDEFWARSSEG 143 Query: 115 PPEWDADYWFDGANYLSELTEK-EISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 P + + A + + + + S+ DL + + ++++A+D+ Sbjct: 144 PNRLETYRQYMVAKHGDDAFKPIKGSVTYPKDPLGHDLGRYRFGKEDFETYWLTDKALDY 203 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA 233 L + +PF + +SY+EPH P+ Y Y + A + Sbjct: 204 L--ARKEKKPFFLFLSYNEPHSPYMVTEPYASMYDPKTLPVPVIPASAKAERKVALEKKI 261 Query: 234 QAMPSPVGDDGLYHH---PLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMM 289 + + DD Y VDD +GRV++ L +NT V++T+DHG M+ Sbjct: 262 KGKSRHLIDDEQMMRDLTAQYLGHVSNVDDNVGRVLSYLDSSGLADNTIVVFTADHGNML 321 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQGER--------RQVDTPVSHIDLLPTMMALADI 341 G H KG M++ +RIPLIIR+ + R R V+ V ID++PT++ + DI Sbjct: 322 GDHGKWFKG-VMHEGSSRIPLIIRAGKHTRYAKVMNRGRVVEQVVESIDVMPTLLEMLDI 380 Query: 342 EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD-- 399 + P + GE++L++ + NR + F ++ DFKL++ Sbjct: 381 KAPRGMQGESLLSLTAGEAKNWK-NRAFSQRSDF-------MFIEGDFKLIMPAKAGKKG 432 Query: 400 --ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFR 443 ELY+ NDP E HNL + M ++ + P R Sbjct: 433 KLELYNLANDPLENHNLAGMTEYQAKVKSMQQSIQVWQADKPAPIR 478 >UniRef50_A3HWG3 Choline sulfatase n=1 Tax=Algoriphagus sp. PR1 RepID=A3HWG3_9SPHI Length = 505 Score = 352 bits (903), Expect = 2e-95, Method: Composition-based stats. Identities = 106/466 (22%), Positives = 190/466 (40%), Gaps = 37/466 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAY----TCSPVCTPARA 57 ++PN LF+ D Q + +G + T ID L EG RF++AY +C +RA Sbjct: 41 QKPNVLFLFADDQRADALGINGNPYIQTPTIDQLGREGSRFSNAYVMGGVHGAICMSSRA 100 Query: 58 GLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 LF+G TM F AGY T GKWH Sbjct: 101 MLFSGK------NLYKVTDKLSGEHTMTMSFAAAGYRTFGTGKWH----------NEKEA 144 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 ++A + YL + + R+ D + + + A+DF++ Sbjct: 145 FEASFQEAKNVYLGGMADHYDLPLRD---YGADGKLGEPTRKGFSTEQFAQAAIDFIKDH 201 Query: 178 --ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 D+PF V++ PH P++ Y+ Y D L + +H + + Sbjct: 202 GQRNTDQPFFCYVAFTAPHDPYSPEANYINHYPDGTLPLPGNYMPYHPFEFDHLTVRDEN 261 Query: 236 MPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAH 292 + + L Y+A +D QI +++N L Q +NT ++Y +D+G G+H Sbjct: 262 LTGWPRKPEVIQMILSDYYALVTHLDTQIAKILNTLKETGQYDNTIIVYAADNGLAAGSH 321 Query: 293 KLISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 L+ K ++Y+ +++PLII+ P + +++D DL PT+ LA I P + G + Sbjct: 322 GLLGK-QSLYEHSSKVPLIIKGPGVPQDQELDAFAYIHDLYPTLAELAGIPDPSDIDGVS 380 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF-TSDELYDRRNDPNE 410 ++ V V + VR +KL+ +L+D DP E Sbjct: 381 LVPVITGEQDGVRDALFTSYRG------TVRAVRNKKYKLIRYPERDYTQLFDLDADPLE 434 Query: 411 MHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDA 456 ++NL ++ + +S+M + + + + +D + ++P + D Sbjct: 435 INNLAENTEYQSKKSEMFELMEKWQNSFQDTVKLTADKIKPMKYDP 480 >UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Bacteria RepID=A6C284_9PLAN Length = 605 Score = 351 bits (902), Expect = 3e-95, Method: Composition-based stats. Identities = 106/445 (23%), Positives = 178/445 (40%), Gaps = 58/445 (13%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + + D Q + L+T N+DSLA EG++FN Y + VC P RA TG Sbjct: 43 PNIVIFLADDQGWGDLSHNGNTNLHTPNVDSLAKEGVKFNRFYVGA-VCAPTRAAFLTGR 101 Query: 64 YANQSG---PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 Y ++G T + T+ + FK AGY T GKWH + +D Sbjct: 102 YHARTGTIGVSTGQERFNSDEYTIAQAFKAAGYATGAFGKWHNG--TQYPNHPNAKGFDE 159 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 Y F + W + + + D + ++++A+ F++Q + Sbjct: 160 YYGFTSGH------------WGHYFSPMLDHNGTFVKGNGYITDDLTDKAMAFIEQQVQN 207 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 +PF + Y PH P P +Y +++ D +L + D +P+H R Sbjct: 208 HKPFFAYLPYCTPHSPMQVPDQYWDRFKDKQLKLHNREPDR--EQPDHLRAA-------- 257 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMM--GAHKLISK 297 A + VD +GRV+ L + ++T VIY SD+G + K Sbjct: 258 -----------LAMCENVDWNVGRVLKKLNSLRITDDTIVIYFSDNGPNGVRWNGDMKGK 306 Query: 298 GAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADI--EKPEILPGENIL 353 ++ + R P +IR P ++V+ IDLLPT+ LA I +P+ + G ++ Sbjct: 307 KGSLDEGGVRSPFVIRWPGHLPAGQEVNQIAGAIDLLPTLTDLAGIKRPEPKPIDGVSLK 366 Query: 354 AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHN 413 + E F TD ++L ELYD DP + +N Sbjct: 367 PLMLNSKADWP------ERMIFSSLRNRVSVRTDQYRLSR----KGELYDMHADPGQRNN 416 Query: 414 LIDDIRFADVRSKMHDALLDYMDKI 438 + ++ +K+ A+ D+ + Sbjct: 417 IAKQK--PEITAKLQQAVTDWRQSV 439 >UniRef50_Q7UL93 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UL93_RHOBA Length = 470 Score = 351 bits (902), Expect = 3e-95, Method: Composition-based stats. Identities = 110/467 (23%), Positives = 179/467 (38%), Gaps = 77/467 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++P+ LF+M D + C L T NID+LA G+RF++AY S VCTP RA L T Sbjct: 45 EQPHILFIMADDMGWKDLHCQGNDVLRTPNIDALAEAGVRFDNAYAGSTVCTPTRASLMT 104 Query: 62 GIYANQSGPWTN-------------------NVAPGKNISTMGRYFKDAGYHTCYIGKWH 102 G+ + + N +TM K AGY T + GKWH Sbjct: 105 GLAPARLHITQHGADSKSFWPDDRLIQPPPTNHELPHETTTMAERLKAAGYTTGFFGKWH 164 Query: 103 LDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTW 162 L G + P E D G T + E Sbjct: 165 LGGDKKY----WPTEHGFDVNVGGCGLGGPPTY---------FDPYRIPALPPRKEGEYL 211 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 R+++ + F+++ D+P + + PH+PF P + +E Y Sbjct: 212 TDRLADETIAFMRR--EKDKPMFVCLWTYNPHYPFEAPEDLIEHYKGK------------ 257 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIY 281 + +P+Y + D +GRV+ L + T V++ Sbjct: 258 -------------------EGTGLKNPIYGGQIEATDRGVGRVLRELDSLGIADETLVVF 298 Query: 282 TSDHGEMMGA---HKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMM 336 TSD+G GA L +++ R+PLI+R P +TPV +DL T++ Sbjct: 299 TSDNGGWSGATDNRPLREGKGFLFEGGLRVPLIVRWPGVTEAATVNETPVVSMDLTATIL 358 Query: 337 ALADIEKP--EILPGENILAVKEPRGVMVEFNRYEIEHDSFGG-FIPVRCWVTDDFKLVL 393 A + E L GE++ + + + + H +F P + +KL+L Sbjct: 359 DAAGVSLANGESLDGESLRPLFSGGKLERDALYFHYPHFAFHKDNRPGSVIRSGQYKLIL 418 Query: 394 NL-FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 S ELYD +ND +E +L DV ++ L+++++ Sbjct: 419 RHDDDSVELYDLQNDLSETSDLAA--VHPDVAQELKGRLMEWLEATG 463 >UniRef50_B7ACM6 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7ACM6_9BACE Length = 534 Score = 351 bits (902), Expect = 3e-95, Method: Composition-based stats. Identities = 110/447 (24%), Positives = 199/447 (44%), Gaps = 20/447 (4%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + +++D N++GC ++T N+D+LA GI F S Y SP+ P+R L TG Sbjct: 62 RPNIVLIISDEHNGNIMGCMGDPYIHTPNLDALAENGILFKSHYCASPISGPSRQSLTTG 121 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDY-FGTGECPPEWDAD 121 Y + W N V +I+++ R + GY T G +G +Y + + + + Sbjct: 122 KYVSHHNVWGNTVGCPNDITSLPRIMQQQGYETVLTGGMKYNGLNYGWNSYKANDGYKIA 181 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN-----RAVDFLQQ 176 Y ++E +N+ ED+ T + ++ A+ +++ Sbjct: 182 YDKKKKAGTDIAQKRERIKAGVFVNNKEDIGKEFTPMGATDMNLFTDIQRSQDAIAYIKN 241 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA--Q 234 A +PF ++V PH+P E ++KY D + + + + N P +++ + Sbjct: 242 RAHVKQPFFLLVGLMAPHYPLQATQELVDKYKD-KIPMPKIPKGYIENLPLNYKHLRNTR 300 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHGEMMGAHK 293 + + + Y+A ++ D QIG++I + +NT +IYTSDHGE +G H Sbjct: 301 KLENVPKEIVKKARECYYARVEWADSQIGKIIKTINESPMADNTIIIYTSDHGENLGEHG 360 Query: 294 LISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 L K +YD ++PLII +P+ ++ +DL+ T+ L + P GE+ Sbjct: 361 LWWKN-CLYDCSAKVPLIISNPKRWKGKQTRSKNTESVDLVQTIADLGGTKVPNDWDGES 419 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL------NLFTSDELYDRR 405 +L + E + + E+ + + + +K V N ELYD Sbjct: 420 MLPLLEDSTYNWK-DFAICEYYAGYIASGITMYRQGKWKYVYHARMDENHGPEIELYDMD 478 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALL 432 NDP E+ NL D ++ + +H L+ Sbjct: 479 NDPEELTNLARDNQYKMLIQDLHQELI 505 >UniRef50_Q7UQ05 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UQ05_RHOBA Length = 525 Score = 351 bits (902), Expect = 3e-95, Method: Composition-based stats. Identities = 104/486 (21%), Positives = 178/486 (36%), Gaps = 74/486 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN L + D +GCY T ID+LA GIRF +AY PVC+P RA + T Sbjct: 52 SRPNVLLFLVDDLGWADLGCYGSTYHETPQIDALAESGIRFTNAYAACPVCSPTRASIMT 111 Query: 62 GIYANQSGPW-------------------TNNVAPGKNISTMGRYFKD-AGYHTCYIGKW 101 G + + + + T+ + +D A Y T ++GKW Sbjct: 112 GRHPVRVDITDWIPGMSTDRAQNPRFQHVDDRDNLALDEVTIAEHLRDAADYQTFFLGKW 171 Query: 102 HLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 HL G P + G + S W+N + + Sbjct: 172 HLG-----DVGHLPTDQGFQINIGG-GHKGSPPGGYYSPWKNPYLKAKQ-------DGEY 218 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 R+++ AV + +R D+PF M++SY H P T ++ + ++ Sbjct: 219 LTTRLTDEAVSLVDTASREDKPFFMMMSYYNVHSPITPDKRTIDHF-----------EEK 267 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVI 280 +N PE G +P Y + VD +GR++ AL +NT VI Sbjct: 268 QSNSPELQGDTPTIAERDAVTRGRQDNPAYASMVKAVDTSVGRIMKALKEHGVDDNTLVI 327 Query: 281 YTSDHGEM--------MGAHKLISKGAAMYDDITRIPLIIRSPQ------------GERR 320 + SD+G + L + +Y+ R PL++R P+ + + Sbjct: 328 FFSDNGGLSTLRKFGPTCNSPLRAGKGWLYEGGIREPLLVRLPKTMPGGATNETVSHQPK 387 Query: 321 QVDTPVSHIDLLPTMMALADIE--KPEILPGENILAVKEPRGVMVEFN----RYEIEHDS 374 VD+ DL PT++ + + G ++L + + + H Sbjct: 388 TVDSVACSTDLFPTILDVVGLPLQPESHADGISLLPAIAGEAAETDSSPRDLHWHYPHYH 447 Query: 375 FGGFIPVRCWVTDDFKLV-LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 + P ++KL+ + ELYD D E +L + +++ DAL Sbjct: 448 GSLWRPGAAIRRGNYKLIEFYETDTAELYDLSVDMGETKDLSK--TEPERFAELRDALRQ 505 Query: 434 YMDKIR 439 + ++ Sbjct: 506 WQTEMN 511 >UniRef50_C1ZIM5 Arylsulfatase A family protein n=2 Tax=Planctomycetaceae RepID=C1ZIM5_PLALI Length = 523 Score = 351 bits (902), Expect = 3e-95, Method: Composition-based stats. Identities = 117/451 (25%), Positives = 183/451 (40%), Gaps = 34/451 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 KRPN L + D Q + + G P + T + SLA G F +A+ +P+C P+R L Sbjct: 46 KRPNVLMIAIDDQ-NDWIEPLGGHPLVKTPQLKSLAERGTVFLNAHCQAPLCNPSRTSLL 104 Query: 61 TGIYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 G+ + +G PW +V T+ + F AGY T GK G G Sbjct: 105 LGLRSTTTGIYGLSPWFRDVPALSGRLTLPQAFGKAGYTTLSTGKIFHGG----GGKPKD 160 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + D W ++ I + V+ H+D +I++ A++ L+ Sbjct: 161 RLKEFDEWGPAGGVGKRPEKRLIQPPPHSNPLVDWGAFPHLDSEKGDT-QITDWAIEKLK 219 Query: 176 QPA-------RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 Q +PFLM V Y PH P E+L Y D L +DD + P Sbjct: 220 QRQVQQSSSTGESKPFLMCVGYFLPHVPCYVTPEWLAMYPDDDSILPFIEKDDRKDTPRF 279 Query: 229 HRLWAQAMPSPVGDDGLYHH------PLYFACNDFVDDQIGRVINALTPEQ-RENTWVIY 281 +P P H Y A +VD QIGR++ AL NT ++ Sbjct: 280 SWYLHWRLPEPRLKWLQQHEHWRSLVRSYLASTSYVDAQIGRLLAALEATGEANNTLIVL 339 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALAD 340 SDHG +G + K +++ TR+PL+ P + PV +D+ PT+ L Sbjct: 340 WSDHGWHLGEKGITGKN-TLWERSTRVPLLFAGPGVLAGGKCVEPVELLDIYPTLAQLCQ 398 Query: 341 IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDE 400 +E P L G +++ + R I + G T D + + S+E Sbjct: 399 LEAPTDLEGVSLVPQLT--NPLAVRQRPAITSHNQGN----HAIRTRDHRYIRYADGSEE 452 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 LYD DP+E+ NL DD + ++ +++ L Sbjct: 453 LYDHLVDPHELKNLADDPAHSGLKKQLNSWL 483 >UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CBI6_9PLAN Length = 599 Score = 351 bits (901), Expect = 3e-95, Method: Composition-based stats. Identities = 106/452 (23%), Positives = 181/452 (40%), Gaps = 67/452 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN L +MTD Q V + + T D LA++G RF Y SPVC P R+ L T Sbjct: 29 ERPNVLLIMTDDQGWGDVRSHDNPLIETPQQDLLASQGARFERFYV-SPVCAPTRSSLLT 87 Query: 62 GIYANQ---SGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G Y+ + G +T+ FK AGY T GKWH H + + Sbjct: 88 GRYSLRTGVHGVTRGFENMRAEETTIAEMFKAAGYKTGAFGKWHNGRH--YPMHPNGQGF 145 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 D + F G + W ++ + + +++RA+DF++Q Sbjct: 146 DEFFGFCGGH------------WNRYFDTNLEHNKQPVKTEGYITDVLTDRAIDFIKQ-- 191 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 D+PF V Y+ PH P+ P +Y +KYA+ + + Sbjct: 192 NKDQPFFCYVPYNAPHSPWIVPEKYWDKYANKGLDDKARCA------------------- 232 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMG--AHKLI 295 +A + VDD +GR++ L + +NT V++ +D+G + Sbjct: 233 -------------YAMVECVDDNLGRLMQTLDDLKLSDNTIVLFLTDNGPNSNRYNGNMR 279 Query: 296 SKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIE--KPEILPGEN 351 + ++++ R+PL +R P + V +HID+LPT++ L +E + L G++ Sbjct: 280 GRKGSIHEGGIRVPLFVRYPGKIKAGTVVKPIAAHIDILPTLLELCSVENTADQPLDGKS 339 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFI-----PVRCWVTDDFKLVLNLFTSDELYDRRN 406 ++ + + R F I P TD ++ LYD + Sbjct: 340 LVPLLTNKSNKDWPQRMLFSDRLFRNSIPDDELPNGSVRTDRWR-AAYERGKWSLYDMQA 398 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 DP++ N+I+ V + A D+ + Sbjct: 399 DPSQKQNVIE--AHPAVIKDLSAAYRDWFKDV 428 >UniRef50_A6CG48 Sulfatase family protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CG48_9PLAN Length = 472 Score = 351 bits (901), Expect = 4e-95, Method: Composition-based stats. Identities = 110/448 (24%), Positives = 188/448 (41%), Gaps = 27/448 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF+ D + CY + +++ NID LA + F A+ P C +RA L T Sbjct: 21 ERPNVLFIAVDDLRPE-LACYGKQHIHSPNIDKLAESSVLFERAFCMVPTCGASRASLMT 79 Query: 62 GIYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 GI ++ W AP N +TM FK GY+T +GK D PP Sbjct: 80 GIRPARNRFVNFLAWAERDAP--NATTMNTQFKQNGYYTASLGKIFHHPADNRQGWSEPP 137 Query: 117 EWDAD-YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 W+ + ++ ++ + + ++ +A++ LQ Sbjct: 138 WRPKGVQWYQRPENQEKHAARQK---LGNKKKGPAWESADVPDNAYMDGVLAEKAIEKLQ 194 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 Q + ++PF + V + +PH PF P +Y + Y +L + E + + Sbjct: 195 QLEKQEQPFFLAVGFFKPHLPFIAPQKYWDLYDHDKIQLPANHKVPQDAPKESIHRFGEL 254 Query: 236 M-------PSPVGDDGLYHH-PLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 PV ++ + Y+AC + D QIG+++ L Q +NT V+ DHG Sbjct: 255 RAYADIPAKGPVSEETARNLIHGYYACVSYTDAQIGKLLAELDRLQLSDNTIVVLWGDHG 314 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR-QVDTPVSHIDLLPTMMALADIEKPE 345 +G H L K + Y+ IPLI+R+P + + + + ID+ PT+ LADI +P+ Sbjct: 315 WNLGDHTLWCKH-SCYESSLHIPLIVRAPGIKGGERRSSLMESIDVYPTLCDLADIPQPK 373 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRR 405 L G++ +++ + E+ + + G I ++ L S LYD Sbjct: 374 HLKGQSFVSLM--KDSTAEWKQAAVSRYRNGDTIRTDTLRYTEYTLPKGKLVSQMLYDHS 431 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLD 433 DP E N+ + AD ++ L Sbjct: 432 TDPLENVNVSA--QQADAVKELSAQLKQ 457 >UniRef50_C2KTX6 Arylsulfatase n=2 Tax=Mobiluncus mulieris RepID=C2KTX6_9ACTO Length = 505 Score = 351 bits (901), Expect = 4e-95, Method: Composition-based stats. Identities = 115/462 (24%), Positives = 210/462 (45%), Gaps = 26/462 (5%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + MTD + +GC + T NIDSLAA+G F +T + +CTPAR+ + TG Sbjct: 14 NIINFMTDQHRIDTLGCLGNENAQTPNIDSLAADGCIFEKGFTPTAICTPARSSMLTGKL 73 Query: 65 ANQSGPWTN-------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG-ECPP 116 + N + A ++ T + +D GY+ +GK+H + G + Sbjct: 74 PFKHLTLANPEWNIAYSTAIPEDDWTYTQQLRDDGYNVGMVGKYHCGTNLPDKFGCDDDT 133 Query: 117 EWDADYWFDGANYLSELTEKEI------SLWRNGLNSVEDLQ----ANHIDETFTWAHRI 166 W A+ + Y + L E + +WR L D E T+ I Sbjct: 134 YWGAENPVNNEKYTAWLEENHLPPVKAHDIWRGKLPGNRDGHIIAARLDQPEEATFERFI 193 Query: 167 SNRAVDFLQQPAR----ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 ++ ++ L+Q A+ +D+PF + V + PH P+ P E+ + L + D L Sbjct: 194 ADVSIAKLRQYAKDYRESDKPFSLDVHFFGPHLPYFLPDEWFDLIDPESIVLPKNFGDTL 253 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHH--PLYFACNDFVDDQIGRVINALTP-EQRENTWV 279 KP + +A + ++ + +Y+ +D +IGR++ + + ++T + Sbjct: 254 VGKPPIQQNYATYWSTSSFNNDQWRKLIAVYWGYVAMIDFEIGRILEVVRELDLMDDTAM 313 Query: 280 IYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTMMAL 338 + +DHGE G+H+L KG AMYD+I RIP I+R P + V+ +DL T++ + Sbjct: 314 FFCADHGEFTGSHRLNDKGPAMYDEIYRIPFIVRIPGLTHGNRCREYVNLLDLTATIIDI 373 Query: 339 ADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS 398 A + + G++++ + + E + R DD+KL+++ + Sbjct: 374 AGGDTSRVEDGKSLVRLAAGKPEADWRQDIVCEFHGLHFPVQQRMLRNDDYKLIVSHESI 433 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 +ELYD + DP+EM+N+ + ++R +M L + + D Sbjct: 434 NELYDLKRDPDEMNNVYAAPAYDEIRRQMACELYIQLKERGD 475 >UniRef50_A6DSH1 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH1_9BACT Length = 462 Score = 351 bits (901), Expect = 4e-95, Method: Composition-based stats. Identities = 105/445 (23%), Positives = 195/445 (43%), Gaps = 24/445 (5%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 + N LF+M+D + + Y + T N+D L ++ + F+ AY+ P+C P+R + +G Sbjct: 22 KMNVLFIMSDDLNVD-IASYGHPIVKTPNLDKLRSKSVLFSQAYSQYPLCNPSRNSILSG 80 Query: 63 IYANQSGPWTNNVAPGK---NISTMGRYFKDAGYHTCYIGKWHLD-------GHDYFGTG 112 +Y SG +N K +I+T+ FK GY GK G TG Sbjct: 81 MYPGTSGCLSNADQLRKTAPDITTLPEAFKKQGYEVISTGKIFHHEDPQSWTGITNLRTG 140 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 + P+ DY F + T E G ++ E + R + + Sbjct: 141 KLHPQ-GKDYNFYRPAFDERKTIGEGRNLTEGELGFMTWRSVTEKEDILFDSRTARWTMQ 199 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD-DLANKPEHHRL 231 L++ A ++PF + V + PH PF P + + Y +L E Q+ ++ + Sbjct: 200 HLEKLAEDEKPFFLGVGFSRPHDPFFAPKRFFDMYPMESIKLPETPQNASKVPMMAYYDV 259 Query: 232 WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMG 290 + +A L Y+A ++D+Q+G V++ L NT V++ SDHG +G Sbjct: 260 FKRAFDKMDTQKRLEFVRSYYASISYMDEQLGLVLDKLEALNLSNNTLVVFISDHGYQVG 319 Query: 291 AHKLISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTMMALADIEKPEILP 348 +K +++ R PL+I +P+ + +VD V ID+LPT+ + + P+ Sbjct: 320 EKGYFNK-TLLFERSCRAPLMISNPKLKSSVNKVDKIVEFIDVLPTITEITSVPTPKTAE 378 Query: 349 GENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDP 408 G +++ + + + V E + ++ R T+ ++L+ + LYD + DP Sbjct: 379 GRSLIPLMKGKKV-------EWKEEAISYVNADRSIRTERYRLINWRGQKEALYDHQRDP 431 Query: 409 NEMHNLIDDIRFADVRSKMHDALLD 433 E N +D+ + +V ++ L + Sbjct: 432 GEHFNQVDNPEYKEVLKRLRSKLKE 456 >UniRef50_A3HTC7 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HTC7_9SPHI Length = 1174 Score = 351 bits (900), Expect = 5e-95, Method: Composition-based stats. Identities = 116/525 (22%), Positives = 207/525 (39%), Gaps = 52/525 (9%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 + RPN +F++TD Q + +G + + T +D LA G F +A +P+C +RA LF Sbjct: 29 LNRPNIIFILTDDQRFDALGYAGNQFVQTPEMDRLAESGTYFETAIVTTPICAASRASLF 88 Query: 61 TGIY--ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 TG+Y A+ T N+ + K++GY+T + GK+ + + ++ Sbjct: 89 TGLYERAHNFNFQTGNIRAEYMEESYPTILKNSGYYTAFFGKYGVRYDNLNN------QF 142 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 D +D N + +T +A+DF+ + Sbjct: 143 DEYESYDRNNQYPDKRGYYFKTI--------------AGDTVHLTRYTGQKALDFIDKAP 188 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYL------EKYADFYYELGEKAQDDLANKPE---HH 229 D+PF + +S+ PH P +Y + + +D+ Sbjct: 189 E-DKPFSLSLSFSAPHAHDGAPDQYFWQTTTDPLLQNTTIPGPDLGEDEFFQAQPQFVRD 247 Query: 230 RLWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHG 286 + Y H L Y+ +D +I ++ L + + NT +I D+G Sbjct: 248 GFNRLRWTWRYDTEEKYQHSLKGYYRMISGIDLEIAKIREKLKEKGLDKNTVIIVMGDNG 307 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKPE 345 +G +L K MYD+ R+PLII P+ G + + V +ID+ T+ LA +E PE Sbjct: 308 YFLGERQLAGKW-LMYDNSIRVPLIIYDPRSGNHQDIKDMVLNIDVPATIADLAGVETPE 366 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEH-DSFGGFIPVRCWVTDDFKLVLNLFTS--DELY 402 G++++ + E + + + IEH F P T+++K + +ELY Sbjct: 367 SWQGKSLMPIVEGKSQKIGRDTILIEHIWEFENIPPSEGVRTEEWKYFRYVNDKKVEELY 426 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL------------R 450 +DP E++NLIDD + D+ K+ + + K + FR +L Sbjct: 427 HLVDDPKEINNLIDDPEYKDIAIKLRSKTDELISKFGNKFREAPSNLTVELIRKPQNAVE 486 Query: 451 PWRKDARPRWMGAFRPRPQDGYSPVVRDYDTGLPTQGVKVEEKKQ 495 W + + Q Y +V + + T V Q Sbjct: 487 VLDTQPEFGWKVSDYSKSQSAYQILVSSSEKLIETNTGDVWNSGQ 531 >UniRef50_B5CWC2 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CWC2_9BACE Length = 515 Score = 350 bits (899), Expect = 6e-95, Method: Composition-based stats. Identities = 110/450 (24%), Positives = 188/450 (41%), Gaps = 35/450 (7%) Query: 2 KRP-NFLFVMTDTQATNMVGCY-SGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 ++P N LF+ D + VG T N+D LAA G+ F SAY +PV +RA L Sbjct: 23 EKPKNVLFIAVDDL-NDWVGFLKGHPNTRTPNMDRLAAMGMVFESAYCAAPVSNASRAAL 81 Query: 60 FTGIYANQSGPWTN-----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 +G + +G + N K+ T+ +YF + GY++ GK H G Sbjct: 82 LSGFRTSTTGVYGNAEFMRESPVLKDAVTLPKYFSNHGYYSMARGKIF---HQPMGPWGD 138 Query: 115 PPEWDADYWFDGANYLSELTE----KEISLWRNGLNSVEDLQANHIDETFTWAHRISNRA 170 P WD+ G + + + G V D +DET T + + A Sbjct: 139 PQSWDSQENLGGLSLNPPRQKGKQANGLEKQTTGGAVVLDWAGVDVDETKTNDYLNAQWA 198 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 L + D+PF M PH P+ P +Y +++ +L ++ + K Sbjct: 199 AQEL--MKKHDKPFFMACGIFRPHLPWYVPQKYFDRFKLEDIQLPKQDPMETMEKLSPRA 256 Query: 231 LWAQAMPSPVGDDGLYHH--------PLYFACNDFVDDQIGRVINALTPE-QRENTWVIY 281 L P + + Y AC + DD IG++++AL +R+NT V++ Sbjct: 257 LSMTGYNKPEHEFNILKKYGMEKEAVRAYLACISYADDCIGQIVDALEKSPERDNTIVVF 316 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALA 339 DHG +G K+ + +++D +P+II +P PVS +DL PT+++LA Sbjct: 317 WGDHGWHLGE-KMRYRKFSLWDRSCHVPMIIVAPGVTKPGSVCKQPVSLLDLYPTLVSLA 375 Query: 340 DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD 399 + + G +I + + + ++ ++ S+ Sbjct: 376 GLPANPLNEGNDITPLLQNPNAHWTKPAITTLAQNE------HSICDGRYRYIIYRDGSE 429 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHD 429 ELYD ++DP E NL D ++ADV++ + Sbjct: 430 ELYDHKHDPLEWKNLAADKKYADVKAHLRT 459 >UniRef50_Q5LRB5 Choline sulfatase n=1 Tax=Ruegeria pomeroyi RepID=Q5LRB5_SILPO Length = 498 Score = 350 bits (899), Expect = 7e-95, Method: Composition-based stats. Identities = 121/436 (27%), Positives = 192/436 (44%), Gaps = 18/436 (4%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L +M D M+ G T+++ LA ++F +AYT SP+C PAR+ TG Sbjct: 16 RPNILLIMADQMTPFMLEACGGTGARTRHLTRLAGRAVQFTNAYTPSPICVPARSCFMTG 75 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 +Y + +G + N + T Y +AGY T GK H G D + + D Sbjct: 76 LYTSTTGCYDNGDPYHSFLPTFAHYLTNAGYETVLSGKMHFIGADQLHGFQ--RRLNPDI 133 Query: 123 WFDGANYLSELTEKEISLWRNGLNSVEDLQAN---HIDETFTWAHRISNRAVDFLQQPAR 179 + G + L + ++ + + L N + + RA+++L+ Sbjct: 134 YPSGFLWSYPLPPDGDASFQAFDFTPQYLAENIGPGWSKELQYDEETQFRALEYLRHA-- 191 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE-------HHRLW 232 D P+++ VS+ PH P+ P Y E Y D L + D A E H L Sbjct: 192 PDTPWMLTVSFTNPHPPYVVPRPYWEMYKDADIPLPDYPADMDARYSEFDHALRRWHGLH 251 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGA 291 + + + + A +VDD+IG ++ L QR+ T +I TSDHGEM+G Sbjct: 252 QRGHEVRDPRNLIAMRRGFAALAHYVDDKIGALLEVLDETGQRDETVIIVTSDHGEMLGE 311 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 LI K ++Y+ RIPLII P +VDTPVS +DL T++ L+ L G + Sbjct: 312 KGLIQK-RSLYEWSARIPLIIDLPGAAPGRVDTPVSLLDLPATLIELSGQTPVAPLEGRS 370 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEM 411 +L RG ++ E+ G P D+K ++ +LY+ DP E Sbjct: 371 LLGAV--RGQELDTVPIVSEYHGEGIMRPSFMVRLGDWKYHYCHGSAPQLYNLARDPGEW 428 Query: 412 HNLIDDIRFADVRSKM 427 HN + A+ +++ Sbjct: 429 HNRAGEPDLAETEARL 444 >UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4Q9_9PLAN Length = 490 Score = 350 bits (899), Expect = 7e-95, Method: Composition-based stats. Identities = 110/500 (22%), Positives = 186/500 (37%), Gaps = 94/500 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN +F++ D Y + +T +ID LA++G+RF Y PVC+P RA + G Sbjct: 34 RPNIVFILIDDMGWPDPVSYGNQFHDTPHIDQLASDGVRFTDFYAACPVCSPTRASIQAG 93 Query: 63 IYANQS-------GPWT---------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH 106 Y + G W N I T G + A Y+T Y GKWHL Sbjct: 94 QYQARLHLTDFIPGHWRPFEKLIVPENAPHLPLEIVTPGELLQSANYNTAYFGKWHLG-- 151 Query: 107 DYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI 166 + D Y + L R+ + I A + Sbjct: 152 ------------PESHNPDQQGYQTSLVTGG----RHFAPRFRTTPSTRIPNKAYLADFL 195 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 +++ ++F++Q +PF + +S+ H P + + KY KP Sbjct: 196 TDKTIEFIRQ--NKSKPFFVQLSHYAVHIPLEAKQQMIRKYQQK-------------PKP 240 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDH 285 + ++P+Y A VDD +GR++ AL + ENT VI+TSD+ Sbjct: 241 AYG----------------INNPVYAAMVAHVDDSVGRIVAALEELKLTENTVVIFTSDN 284 Query: 286 GEMMGAH----------KLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLP 333 G + + L + ++Y+ R+PLII+ P + P ID P Sbjct: 285 GGLRQSFSGGDIVSTNAPLRDEKGSLYEGGIRVPLIIKWPGVAAAGKTCAEPTISIDFWP 344 Query: 334 TMMALADIEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL 391 T +A E + G ++L + + + + + P D+KL Sbjct: 345 TFAEIAHTTLQEHQTIDGLSLLPLLKDPSSHLNREEIYFHYPHYHHSTPASAIRAGDWKL 404 Query: 392 VLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR----------D 440 + + ELY+ + D +E NL + + ++ L D+ + D Sbjct: 405 IEFFADGNLELYNLQQDLSETTNLAA--KNPEKAVELQQKLADWRTRTGAALPVKNPKYD 462 Query: 441 PFRSYQ-WSLRPWRKDARPR 459 P R+ + W+ R + R Sbjct: 463 PARASEFWNRRTNQPVPERR 482 >UniRef50_A6C1R0 Choline sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C1R0_9PLAN Length = 492 Score = 350 bits (899), Expect = 7e-95, Method: Composition-based stats. Identities = 106/448 (23%), Positives = 176/448 (39%), Gaps = 36/448 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYT----CSPVCTPARA 57 +RPN LF+ +D Q + V Y + T N+D L G F +AY VC P+RA Sbjct: 33 ERPNILFLFSDDQRADAVAAYDNPHIQTPNLDQLVKAGFNFRNAYCMGSIHGAVCQPSRA 92 Query: 58 GLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 L +G + K + T+ + K AGY T GKWH Sbjct: 93 MLNSGR------SLYHVPMDLKGVITLPQLLKQAGYETFGTGKWH-------------NH 133 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 D+ + + L ++ E N + + AVDFL+Q Sbjct: 134 RDSFQKSFTTGTAAFIGGMSNHLKVPVVDLKEGKFENKRTGKKFSSELFVDAAVDFLKQQ 193 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK--AQDDLANKPEHHRLWAQA 235 A++PF V++ PH P P ++ Y + L + Q N R A Sbjct: 194 P-AEKPFYAYVAFTAPHDPRMPPETAMKVYENSPPPLPKNFMPQHPFNNGWLTGRDEALT 252 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHKL 294 + Y+ +D QIGR++ L + + NT VI++SDHG +G+H L Sbjct: 253 GWPRQPEIVREQLAEYYGMITHMDTQIGRILQTLKDKDLDKNTIVIFSSDHGLALGSHGL 312 Query: 295 ISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTMMALADIEKPEILPGENIL 353 + K +Y+ + PLI + P + D V D+ PT+ L I+ P + G ++ Sbjct: 313 LGK-QNLYEHSMKSPLIFKGPGIPMNKSSDALVYLYDIFPTVCELTQIQVPSGVEGSDLA 371 Query: 354 AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL-FTSDELYDRRNDPNEMH 412 + + V + D +R + +KL+ +L+D + D +E+ Sbjct: 372 PIWRGKSERVRDTLFTTYEDL------MRAVRDERWKLIRYPQIDKTQLFDLKEDRHELK 425 Query: 413 NLIDDIRFADVRSKMHDALLDYMDKIRD 440 +L + + KM L ++ + D Sbjct: 426 DLSEHPEQQERIKKMLAELKEWQKRTDD 453 >UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B5CXC7_9BACE Length = 509 Score = 350 bits (898), Expect = 8e-95, Method: Composition-based stats. Identities = 101/476 (21%), Positives = 179/476 (37%), Gaps = 46/476 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN +F+M D VG + T NID LA+EG+ F Y + + +P+R L T Sbjct: 29 RQPNVVFIMVDDYGWADVGYNGSRFYETPNIDRLASEGMIFTDGYAAASISSPSRVSLMT 88 Query: 62 GIYANQSGP------WTNNVAPGK-----------------NISTMGRYFKDAGYHTCYI 98 G Y ++G + + P + TM FK+ GY T ++ Sbjct: 89 GKYPARTGITDWIPGYQYGLKPEQLKQYKMLAPEMPLNMPLEEVTMAEAFKEHGYATYHV 148 Query: 99 GKWHLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGL-NSVEDLQANHID 157 GKWH + P D G S + + + + Sbjct: 149 GKWHCAEDSLY----YPQYQGFDVNIGGWLKGSPNGIRRSQGGKGAYCSPYRNPYLPDGP 204 Query: 158 ETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK 217 E R+ + ++ ++ + AD+PF + +++ H P EY++ + +G Sbjct: 205 EGEFLTDRLGDESIKLIKNSS-ADKPFFLYLAFYAVHTPIEAKPEYVKYFKWKAQRMGLD 263 Query: 218 AQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-N 276 E ++ A+ + + Y A +D+ +GRV+ AL + N Sbjct: 264 TIVPFTRNLEWYKN-AEYKAGHWKERTIQSDAEYAALIYSMDENVGRVMQALKDNGLDKN 322 Query: 277 TWVIYTSDHGEMMGAHK-------LISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVS 327 T V SD+G + A L + +Y+ R P II+ PQ TPV Sbjct: 323 TIVCLLSDNGGLSTAEGSPTCNAPLRAGKGWLYEGGIREPFIIKYPQMVEAGSVCHTPVV 382 Query: 328 HIDLLPTMMALADIEKP--EILPGENILAVKEPRGVMVEFN-RYEIEHDSFGGFIPVRCW 384 +D PT++ +A + + + G+++L + + + H G P Sbjct: 383 AVDFYPTLLDMAGLPLKSHQHVDGKSLLPLLKGDQAYDRGPIFFHYPHYGGKGDTPAGAV 442 Query: 385 VTDDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 D+KL+ ELY+ +ND +E +L D ++M L + Sbjct: 443 RMGDYKLIEFYEDGHVELYNLKNDISETRDLSK--TEKDKAAEMQKMLHRWRTDCN 496 >UniRef50_UPI0001C35789 arylsulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C35789 Length = 520 Score = 350 bits (898), Expect = 8e-95, Method: Composition-based stats. Identities = 113/476 (23%), Positives = 197/476 (41%), Gaps = 42/476 (8%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + +MTD + +G + T +DS+AA+GI F+ AY+ P C PARA L TG+ Sbjct: 35 PNIVLIMTDQMRGDCLGIAGHPDVKTPYLDSIAAKGILFDHAYSACPSCVPARAALHTGM 94 Query: 64 YANQSGPWTNNVAPGKNIS-TMGRYFKDAGYHTCYIGKWHLD------------------ 104 G N TM AGY+T +GK H+ Sbjct: 95 RQEHHGRVGYQDMVNWNYPHTMAGELAAAGYYTQCVGKMHVHPLRNLMGFHNIELHDGYL 154 Query: 105 --GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHI-DETFT 161 D E + DY++ +L + + + G+ + I +E + Sbjct: 155 HAYRDPAAAWEESQKQADDYFY----WLKQELGADADVTDTGMECNSWVSRPWIYEEKYH 210 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA--- 218 + +S R++DFL++ +PF ++ SY PH PF P Y + Y D Sbjct: 211 PTNWVSTRSIDFLRRR-DTSKPFFLMASYLRPHPPFDAPQYYFDLYRDKQLTPPAVGDWE 269 Query: 219 QDDLANKPEHHRLWAQAMPSPVGDDGLYHHPL-YFACNDFVDDQIGRVINALTPEQR-EN 276 +D + + PV + + + Y+AC +D QIGR+I AL + +N Sbjct: 270 DEDFTGDYQRLGRIYDSATGPVDPELIRQAQIGYYACITHLDHQIGRLIQALVEYKLMDN 329 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ-----GERRQVDTPVSHIDL 331 T +++TSDHGE + H L K Y+ RIP+++ P+ + D+ Sbjct: 330 TIILFTSDHGEELCDHHLFRKSRP-YEGSCRIPMLLSGPERLIHAAPGTVCHSVAELRDV 388 Query: 332 LPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL 391 +PT++ A PE + G++++ + ++ + + G VT+ K Sbjct: 389 MPTLLDAAGAPIPETVDGKSMIPDPDGTLPVIRQW---LHGEHEAGVNSNHFIVTEHDKY 445 Query: 392 VLNLFTSDE-LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQ 446 V T E ++ D E+HN I D ++ + + L++ + + + + Q Sbjct: 446 VWYSQTGREQYFNLDEDRRELHNGIADTQYQERIGLLRGLLIEELKEREEGYSDGQ 501 >UniRef50_Q7UWE8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UWE8_RHOBA Length = 488 Score = 350 bits (898), Expect = 9e-95, Method: Composition-based stats. Identities = 115/451 (25%), Positives = 185/451 (41%), Gaps = 26/451 (5%) Query: 3 RP-NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +P N L + D +GCY +++ NID LAA G+RF+ AY VC +RA L + Sbjct: 33 KPLNVLMIAVDDLRPE-LGCYGKSYMHSPNIDRLAASGMRFDRAYCQVAVCGASRASLMS 91 Query: 62 GIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPE 117 G + W ++ T+ ++ GY T ++GK +H D E Sbjct: 92 GCRPETTQCWNFKTLLRSQMPDVLTLPQHLSRNGYETGFLGKVYHSASDDAAAWTVDANE 151 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 W G +Y+ EL K + N + ++ ++RAV L++ Sbjct: 152 WAPRDRSKGKSYVQELPRKRNPANSSEKNGPSIENGGDVPDSAYTDGHNADRAVALLERF 211 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ--- 234 + D+PF + V + +PH PF P +Y + Y ++ +D + P W + Sbjct: 212 STQDKPFFLAVGFLKPHLPFNAPAKYWDLYDRDDIKIP-SREDVVDGLPYARSSWGELKN 270 Query: 235 -----AMPSPVGDDGLYHH-PLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGE 287 A + D+ Y A ++D Q+G+V+NAL RENT V+ DHG Sbjct: 271 YTDIPAKTDMLDDEKTRELIHGYRAAVSYMDAQVGKVLNALEANGQRENTIVVLWGDHGW 330 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEIL 347 +G K Y+ TR+PLI+ +P + + V +DL PT+ L ++ PE Sbjct: 331 YVGDFGDWCKHTN-YEIATRVPLIVSAPGVPAGETKSLVELVDLFPTLCELTELPVPEHC 389 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV--RCWVTDDFKLVLNLFTSDE----- 400 G++I V G+ V + S G PV TD F+ + T Sbjct: 390 QGKSIAGVVHDPGLSVRPAAFSQYKKSKLGVGPVLGTSIRTDRFRYTEYVSTKTGKLEDI 449 Query: 401 -LYDRRNDPNEMHNLIDDIRFADVRSKMHDA 430 L D DP N+ D + ++H Sbjct: 450 VLIDFDKDPGATRNVASDPAYQPFLPQLHAW 480 >UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4L0_9PLAN Length = 413 Score = 350 bits (898), Expect = 9e-95, Method: Composition-based stats. Identities = 107/443 (24%), Positives = 183/443 (41%), Gaps = 62/443 (13%) Query: 10 MTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIYANQSG 69 M D + CY + NT ++D LAA GIRF ++ VC+P RAGL TG Y ++G Sbjct: 1 MADDLGYGDLSCYGSQNCNTPHLDRLAANGIRFTDFHSSGAVCSPTRAGLLTGRYQQRAG 60 Query: 70 ----PWTN-----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 + N + KN T+ + +DAGY T GKWHL + + Sbjct: 61 IDGVVYANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLGYQRQYNPTFRGFQQFV 120 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 Y +Y + L + W + + E H I++ A++F++Q + Sbjct: 121 GYVSGNVDYFAHLDGTGVFDWWHNAELNRE-------EQGYVTHLINDHALEFIRQ--QQ 171 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 ++PF + ++++ H P+ P + + ++ + D+AN Sbjct: 172 EKPFFVYIAHEAVHSPYQGPHD-QPMRKEGGGDIKSAKRKDIAN---------------- 214 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGE--MMGAHKLISK 297 Y N +D IG++++ L E T++ + SD+G KL Sbjct: 215 ---------AYREMNTEMDKGIGQIVDVLKEVNLTEKTFIFFLSDNGANKNGSNGKLRGF 265 Query: 298 GAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEI--LPGENIL 353 ++++ R+P I P E D PV IDL+PT++ LA+ + P L G +++ Sbjct: 266 KGSLWEGGHRVPAIACWPGRIPEGTVCDEPVISIDLMPTILELANAKIPAGHKLDGVSLV 325 Query: 354 AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD--ELYDRRNDPNEM 411 ++ + R + + F + +KLVLN + ELYD D +E Sbjct: 326 SLLKDRKSL-------VPRQIFWEYNGKSAMRQGHWKLVLNQTRKEPIELYDLTRDMSES 378 Query: 412 HNLIDDIRFADVRSKMHDALLDY 434 NL D+ +M AL + Sbjct: 379 KNLADNQ--PQRVQQMQSALAAW 399 >UniRef50_Q5UEY3 Probable sulfatase n=1 Tax=uncultured alpha proteobacterium EBAC2C11 RepID=Q5UEY3_9PROT Length = 512 Score = 349 bits (897), Expect = 1e-94, Method: Composition-based stats. Identities = 121/497 (24%), Positives = 206/497 (41%), Gaps = 69/497 (13%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MKRPNF+F+ +D Q + G + L T ++D L EG+ F + T SPVC PARA + Sbjct: 1 MKRPNFVFITSDQQRGDCYGFMG-RKLKTPHLDQLRREGMHFRNCITPSPVCQPARAAIL 59 Query: 61 TGIYANQSGPWTNNVAPGKNISTM--GRYFKDAGYHTCYIGKWHLDGHDYF--------- 109 TG +G N + + + Y T IGK H F Sbjct: 60 TGKLPKTNGVKDNGIDLRAERGELGFAAALTNVDYETALIGKAHFATTQTFSPQTSVECK 119 Query: 110 -GTGECPPEWDADY-------------WF---------DGANYLSEL-----TEKEISLW 141 G+ + PP W+ Y W G +Y E LW Sbjct: 120 TGSADYPPNWNGPYMGFQHVELLTQGHWHKIRPPVIPPSGQHYEDWFFNVVGKESAFELW 179 Query: 142 ----RNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPF 197 R G+ + + + + + + +++R++ +L ++ PF + +S+ +PHHPF Sbjct: 180 KSETRKGVGAAQTWASA-LPVAWHSSTWVADRSIHWLSNRRESN-PFCLWISFPDPHHPF 237 Query: 198 TCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ------------------AMPSP 239 CP + + +L + + DL ++P HR + MP Sbjct: 238 DCPEPWNLLHNPEDVDLPKFLEKDLNDRPWWHRRSLESEPDLSDPVLKRFRKQGSRMPDQ 297 Query: 240 VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKG 298 Y+ +D +GRVI L + + T +IYTSDHG+ MG L KG Sbjct: 298 SEAQLREMTANYYGMISLIDHNVGRVIACLREKGILDETIIIYTSDHGDHMGERGLYLKG 357 Query: 299 AAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLLPTMMALADIEKPEILPGENILAVKE 357 +YD + + +I+R P + + P++ +D+ T A P+ ++ + Sbjct: 358 PMLYDSLINVGMIVRGPGVAAGRSENAPITTLDVGATFCDYAGTSLPKEAQSVSLKSCFG 417 Query: 358 PRGVMVE--FNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD-ELYDRRNDPNEMHNL 414 G + ++ +++ G + +R T KL + L + D ELYD ++D NEM NL Sbjct: 418 GAGSPHDAVYSEWDVAPSRCGVRLDLRTVHTGKAKLTIELQSGDGELYDLQSDGNEMINL 477 Query: 415 IDDIRFADVRSKMHDAL 431 ++ A++++ M L Sbjct: 478 WNEPLAAELQNHMTKLL 494 >UniRef50_A6DJJ1 Sulfatase family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJJ1_9BACT Length = 510 Score = 349 bits (896), Expect = 1e-94, Method: Composition-based stats. Identities = 106/480 (22%), Positives = 185/480 (38%), Gaps = 51/480 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+ N LF+ D M+GCY + + T NID +A G F +A +C P+RA L T Sbjct: 29 KKMNVLFIPIDDLKP-MLGCYGDQAIITPNIDRIAERGTVFLNASCQQAICGPSRASLMT 87 Query: 62 GIYANQSGPWTNNVAP---GKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+Y + + W +I ++ +YFK GY T +GK G + P W Sbjct: 88 GMYPDHTKVWDLATKMRDINPDILSIPQYFKQQGYETTGVGKTFDPRCVDGGKFQDKPSW 147 Query: 119 DADYWFDGANYLSELT----EKEISLWRNGLNSVEDLQAN------------------HI 156 Y G + K+ + G Q N + Sbjct: 148 SIPYHKAGGKGYANPEVAKAWKKAAELVKGRTFKMGYQRNKAMARLGDPICRPATECMDV 207 Query: 157 DETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGE 216 + ++ L++ ++AD+PF + V + +PH PF P +Y + Y ++ E Sbjct: 208 PDHVYKDGAVARVGAKLLEELSKADKPFFLSVGFAKPHLPFVAPKKYWDMYNSHDIQVAE 267 Query: 217 -KAQDDLANKPEHHRLWAQAMPSPVGDDGLYHH-------PLYFACNDFVDDQIGRVINA 268 + K + L A S + + G Y A ++D Q+G +++ Sbjct: 268 YQKSAKNDTKIAYKSLGEIAAYSDMPEKGPIDQETQKHLIHGYMATTSYMDAQLGLLLDK 327 Query: 269 LTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPV 326 L NT + DHG +G H + +K ++ R PL+I +P+G + + PV Sbjct: 328 LEELGIANNTIICLWGDHGFHLGDHGMWTKHTN-FEQAVRSPLLIAAPKGFKPNSTNAPV 386 Query: 327 SHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVT 386 +D+ PT+ LA ++ P LPG+++ V + V + + G + Sbjct: 387 ELVDIFPTLCDLAGLDIPTHLPGKSLAPVMKDTSTSVRYA--ALGQYPRGNKTMGYTLRS 444 Query: 387 DDFKLVLNLF------------TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 + ++ V L + +L+D DP E NL + + + Sbjct: 445 ERYRYVKWLNLDYRKSVAKGKLVATQLFDYEKDPLETVNLAANPEYKKIIDSFEAEFARR 504 >UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UZ43_RHOBA Length = 608 Score = 349 bits (896), Expect = 1e-94, Method: Composition-based stats. Identities = 108/461 (23%), Positives = 179/461 (38%), Gaps = 69/461 (14%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + V+TD Q G K + T NID+LAAE Y +P C+P R+ L TG Sbjct: 31 RPNVVMVITDDQGYGDCGFTGNKVVQTPNIDALAAESSVLTD-YHVAPTCSPTRSALMTG 89 Query: 63 IYANQSGPWT---NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 + N++G W N T G F DAGY T GKWHL G+ P Sbjct: 90 HWTNRTGVWHTISGRSMLRDNEVTFGEIFSDAGYQTGMFGKWHL--------GDNYPYRA 141 Query: 120 ADYWF-DGANYLSELTEKEISLWRNGLNSVEDLQANH-IDETFTWAHRISNRAVDFLQQP 177 D F + + + W N + F+++ Sbjct: 142 EDNGFTEVYRHGGGGVGQTPDFWDNAYFDGSYFHNGKAVKAEGFCTDVFFKEGNRFIREC 201 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 ADEPF ++ + PH P P +Y++ Y + + Sbjct: 202 VEADEPFFAYIATNAPHGPLHAPQKYIDMYPEMNDNV----------------------- 238 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAH---- 292 +F VDD +G+ L +NT I+T+D+G GA Sbjct: 239 -----------ATFFGMITNVDDNVGQTRKLLRELGVHDNTIFIFTTDNGTAGGASVYNA 287 Query: 293 KLISKGAAMYDDITRIPLIIRSPQG---ERRQVDTPVSHIDLLPTMMALADIEKPEIL-- 347 + K + Y+ R+P ++ P+G + R +T +D++PT++ + +E PE + Sbjct: 288 GMRGKKGSPYEGGHRVPFVMHYPEGGFAKSRTNNTLCHAVDVVPTLLDMCGVEAPESVKF 347 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR----CWVTDDFKLVLNLFTSDELYD 403 G +I+++ + V FN + DS P++ + D ++L+ ELY+ Sbjct: 348 DGTSIVSLLKDE-VDSSFNDRMLITDSQRVIDPIKWRQSSVMQDKWRLI----NGKELYN 402 Query: 404 RRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRS 444 NDP + +N+ D + + M + ++ F Sbjct: 403 IANDPGQENNIAGD--HPEQVASMRAFYEAWWAELEPTFSQ 441 >UniRef50_A6DJ72 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=3 Tax=Bacteria RepID=A6DJ72_9BACT Length = 495 Score = 349 bits (896), Expect = 1e-94, Method: Composition-based stats. Identities = 114/469 (24%), Positives = 204/469 (43%), Gaps = 51/469 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP--LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 +RPN +F++TD Q + VG + ++T +I+ +AAEG++F + Y + +C+P+RA Sbjct: 25 QRPNVVFILTDDQRGDAVGYHKKPLLGIDTPSINKIAAEGVQFENMYCTTSLCSPSRAAF 84 Query: 60 FTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 +G Y + + N ++ + + GY T +IGKWH+ D Sbjct: 85 LSGTYTHTHKVYDNFTDYPHDLKSFPLLLQQEGYTTGWIGKWHMGEEDDSKRP------G 138 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 DYW + + + + NG +AH++++ A+DFL + Sbjct: 139 FDYW---VTHKGQGKYWDTTFNVNGERK---------KVPGYYAHKVTDMAIDFLNK-VD 185 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLE-------KYADFYYELGEKAQDDLANKPEHHRLW 232 +PF + + + PH PF +Y Y D ++LG+K + + P H ++ Sbjct: 186 KSKPFALCLGHKAPHGPFIPEAKYDSIYNDTPVPYPDSSWKLGDKPKWIVDRLPTWHGIY 245 Query: 233 ------AQAMPSPVGD---DGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYT 282 + P+ D + Y A + VDD +GR+ + L +NT +I+T Sbjct: 246 GPLYGFRKDFPNDKASAIVDFEHFVRSYTATINSVDDSVGRIYDHLEEMGILDNTILIFT 305 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTMMALAD 340 SD+G ++G H +I K M++ IPL +R P+ + + V ID+ PT+M L Sbjct: 306 SDNGFLLGEHGMIDK-RTMHEASVSIPLTVRFPKKIKGGTVIKEQVLSIDMAPTIMELTV 364 Query: 341 IEKPEILPGENILAVKEP--RGVMVEFNRYEIEHDSFGGFIP-VRCWVTDDFKLVLNLFT 397 +K G + + + + YE ++ + P VR +K V Sbjct: 365 GKKMPSAQGLSWATLLDDTKDAEWRKTWLYEYNYEVQFPYTPNVRGIRHGKWKYVAYPHG 424 Query: 398 S-------DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 +ELY+ DP+E NL +D +AD++S + L + Sbjct: 425 DGGKLRHMEELYNMERDPSESSNLAEDPAYADIKSMLAMELAKTLKSTG 473 >UniRef50_UPI00016C500A sulfatase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C500A Length = 472 Score = 349 bits (895), Expect = 2e-94, Method: Composition-based stats. Identities = 108/463 (23%), Positives = 185/463 (39%), Gaps = 43/463 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + +MTD Q + + C L T N+D +A EG R+ +A+ + +C P+RA L TG Sbjct: 22 RPNIVVMMTDDQRHDYMSCAGHPFLKTPNMDRIAKEGFRYTNAFVTNALCAPSRATLMTG 81 Query: 63 IYANQSGPWTN-NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 Y++ +G N N + + GY + GK H+ GH T D Sbjct: 82 QYSHLNGVRDNMGTTLNPNAPWLPDELRKLGYEVAFCGKSHVPGHFRDKTW--------D 133 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 Y+F + L +G + ID ++++A+ ++++P Sbjct: 134 YYFGFQGQGNYLKPLIAESGPDGKIGPDKPYDGWID------DVVTDKALAWVKKPRA-- 185 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS-PV 240 +PF + + + PH + + + YA + D KP A + P Sbjct: 186 KPFALFLFFKSPHRAWQPAARHKDLYAGAAVKKPALWDDPGQGKPRAFLQAANMIGQYPD 245 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGA 299 D Y C VDD +G+V+N L ++ + T V+YTSD+G +G + K Sbjct: 246 TKDYDGMIRDYARCITGVDDNVGKVLNTLDEQKIADTTAVMYTSDNGFFLGEWQRFDK-R 304 Query: 300 AMYDDITRIPLIIRSPQGERRQV-------DTPVSHIDLLPTMMALADIEKPEILPGENI 352 M++ R+PL+++ P+ + V + D+ PT++ LA P+ + G ++ Sbjct: 305 FMHEPSVRVPLLLKVPKALAKDCVPPGSQPGAMVINPDIAPTVLELAGGAPPKAMQGRSV 364 Query: 353 LAV--------KEPRGVMVEFNRYEI--EHDSFGGFIPVRCWVTDDFKLVLNLF------ 396 L P E YE D R T +KL+ Sbjct: 365 LPFARLPPAGPLPPEMAPREAWYYEYFEFPDPSHNVEKQRGVRTTKWKLIHYYDPPFKFK 424 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 + ELYD DP E NL + F ++ + + ++ Sbjct: 425 DAYELYDLEKDPEERVNLANRPAFQGTVKELQEKMAALRKELG 467 >UniRef50_Q7UZ92 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UZ92_RHOBA Length = 582 Score = 349 bits (895), Expect = 2e-94, Method: Composition-based stats. Identities = 116/464 (25%), Positives = 195/464 (42%), Gaps = 35/464 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF+ D + +GCY T NID LA+ ++FN AY VC P+RA L T Sbjct: 26 QRPNVLFIAVDDLRPS-IGCYGDPQAITPNIDRLASRSVQFNRAYCQVAVCNPSRASLMT 84 Query: 62 GIYANQSGPWTNNVAPGK---NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP--- 115 G+ + WT + + T+ ++F+ GY GK + + + P Sbjct: 85 GLRPDNLAVWTLPIHFREAMPEAVTIPQWFRRYGYTAVSHGKIYHNPTPDPQSWSEPIRD 144 Query: 116 -PEWDADYWFDGANYLSEL-TEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 P A Y + + E WR A + + +N A++ Sbjct: 145 LPRLPAFYPDGTREQMKKFDNELPDRDWRKNNLRGPSTAAPELADDQLLDGARTNMAIED 204 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH----- 228 L++ ++D PF + + Y PH + P +Y + + + Q + P Sbjct: 205 LRRLGKSDAPFFLAMGYIRPHLAWVAPKKYWDMHDPSKLPVRTGEQIPKNSPPYAMHNNS 264 Query: 229 ---HRLWAQAMPSPVGD------DGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTW 278 H + +P P D D + Y+AC ++D QIGR+++AL E +NT Sbjct: 265 EMTHYVDRMNLPKPWDDDTVPTEDARHLMHAYYACVSYIDAQIGRLLSALKEEGLADNTI 324 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMM 336 V+ SDHG +G H+ K Y+ +PL+I P + +Q D +DL PT+ Sbjct: 325 VVLWSDHGWKLGEHRGWGK-MTNYEIDAHVPLLITGPGVKCLGQQTDQLAELLDLFPTLC 383 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFNR-YEIEHDSFGGFIPVRCWVTDDFKLV--L 393 +A I+ P+ + G +++ + V + G T D++LV Sbjct: 384 EMAGIDVPDFVDGSSLVPILNDVDAKVHDGAVNQYYRRHEGRQYMGYSIRTSDYRLVEWR 443 Query: 394 NLFTSD----ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 + F+ + ELYD RND +E +++D V ++ LL+ Sbjct: 444 DFFSGEVAAKELYDHRNDDSENESIVDSTE-PKVIDELTSLLLE 486 >UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DTN4_9BACT Length = 482 Score = 348 bits (894), Expect = 2e-94, Method: Composition-based stats. Identities = 106/462 (22%), Positives = 183/462 (39%), Gaps = 52/462 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN ++++ D +GCY K + T ++D +AA G++F Y+ S VC P+R+ L G Sbjct: 19 KPNIIYILADDLGYGDLGCYGQKVIQTPHLDKMAANGMKFTQHYSGSTVCGPSRSCLLEG 78 Query: 63 IYANQSGPWTNNVAPGKNIST---MGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 ++ + N + + + + AGYHT IGK + + P + Sbjct: 79 KHSGNTYVRGNGMLQMRQDPHDLIFPKALQKAGYHTAMIGKSGMGCNT--DDAALPYQKG 136 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHI--DETFTWAHRISNRAVDFLQQP 177 DY+F ++ LW+N + N+ + + + N A+D++++ Sbjct: 137 FDYFFGFTSHTQAHWFFPTHLWKNDGKVTKVEYPNNTLHEGDNYSSEVVMNEALDYVER- 195 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 + D PF + +++ PH E+ KY E L K +H + P Sbjct: 196 -QKDGPFFLHLAFQIPHASLRAKEEWKAKYRPILKE------KLLPKKDKHPHYSYEREP 248 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMM-GAHK-- 293 + A ++D +G + L ENT +++ SD+G M G HK Sbjct: 249 ----------KTTFAAMVSYMDHNVGLLNKKLEDLGLAENTLIMFASDNGAMQEGGHKRD 298 Query: 294 -------LISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIEKP 344 L MY+ R P+I P + + D + D+ PT+ LA + Sbjct: 299 SFDSNGVLRGGKRDMYEGGVRTPMIAYWPGKIKAGQTSDHISAFWDISPTVRELAGAKVQ 358 Query: 345 EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD----- 399 E G + + +G + + E GG R +KL+L +D Sbjct: 359 EDTDGISFVPTLLGKGSQTKHDYLYWEFFEQGG---KRAIRMGKWKLILYKTNTDLNPKM 415 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 EL+D D +E +L + + S ALL MDK P Sbjct: 416 ELFDLEADISEQKDLSK--QLPEKVS----ALLKLMDKAHTP 451 >UniRef50_C5BYA8 Sulfatase n=2 Tax=Micrococcineae RepID=C5BYA8_BEUC1 Length = 478 Score = 348 bits (893), Expect = 3e-94, Method: Composition-based stats. Identities = 118/483 (24%), Positives = 190/483 (39%), Gaps = 63/483 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN ++TD A + VG Y +T ID +A G R ++ + + +CTP+RA + T Sbjct: 3 RRPNICLILTDDHAAHAVGTYGSVVNSTPRIDEIAQRGWRLDNLFCTNSICTPSRASILT 62 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G +++ +G T + + + T KDAGY T +GKWHL G GE D Sbjct: 63 GQHSHTNGVRTLSTPMDRELPTFVSQLKDAGYRTAIVGKWHL------GEGEEHRPRAFD 116 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ----P 177 +W L + E +R T I++ A+ +L P Sbjct: 117 HWM----ILRDQGEYHDPTFR--------TPDGLRTVTGYATDVITDLALQWLDDLDYGP 164 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 D P+ +++ + PH + + ++A + DD A + A + Sbjct: 165 DGTDSPWCLLIHHKAPHRSWEPDEAHRAQFAGRPIPVPATFTDDYATRSGAAHRAAMRVA 224 Query: 238 SPVGDDGL-------------------YHHPLYFACNDFVDDQIGRVINALTP-EQRENT 277 + L + Y AC VDD +GRVI+ L + ++T Sbjct: 225 DQLTRRDLKADPPAGLSYEDEALWKYQRYMEDYLACVASVDDNVGRVIDRLAERGELDDT 284 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTM 335 ++YTSD G +G H K MYD+ R+P ++ P R D V+++DL T+ Sbjct: 285 LLMYTSDQGFFLGDHGWFDK-RFMYDESIRMPFVVSCPTALDGGRSTDQIVTNVDLARTI 343 Query: 336 MALADIEKPEILPGENILAVK----EPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL 391 + AD+E + GE+ P + RY D T+ +KL Sbjct: 344 LEAADVEPHPGMQGESFWGTLARGETPPADQSFYYRYWEHDDGAHHAAGHYGIRTERYKL 403 Query: 392 VLNLFT--------------SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 + ELYD DP+E+ N+ DD +A VR + + L Sbjct: 404 IYFYNDGLGLPGTGWATYAPEWELYDLEADPDELVNVADDPTYAVVRRDLTERLAREQAA 463 Query: 438 IRD 440 D Sbjct: 464 AGD 466 >UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UJ66_RHOBA Length = 616 Score = 347 bits (892), Expect = 4e-94, Method: Composition-based stats. Identities = 108/468 (23%), Positives = 178/468 (38%), Gaps = 81/468 (17%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN + V+TD Q + C+ LNT N+D LA + +R + + P CTP RA L T Sbjct: 55 SRPNVILVVTDDQGYGDMSCHGNPWLNTPNLDRLATQSVRLENFHV-DPFCTPTRAALMT 113 Query: 62 GIYANQSGPWT---NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G Y + G W + +TM F+++GY T GKWHL F E E Sbjct: 114 GRYCTRVGAWAVTEGRQLLDPDETTMAETFRESGYRTGMFGKWHLGDPPPFAPRERGLET 173 Query: 119 DADYWFDGANYLSELTEKEI---SLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + GA+ + T + + +RNG D A+DF+Q Sbjct: 174 VVRHMAGGADEIGNPTGNDYFDDTYYRNGTPESFD---------GYCTDIWFEEAIDFIQ 224 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + +++PF + + H P+ Y + + E Sbjct: 225 K--ESEQPFFAYIPTNAMHSPYLVADRYSDPFKRQGIEP--------------------- 261 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALT-PEQRENTWVIYTSDHGEMMGAH-- 292 ++ D+ +GR++ L R+NT +I+ SD+G GA Sbjct: 262 -----------QRAAFYGMIQNFDENLGRLLKRLDQDNLRDNTMLIFMSDNGTAQGASEQ 310 Query: 293 --------KLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIE 342 + K ++Y+ R+P P R VD H D LPT++ L D++ Sbjct: 311 NRKVGFNAGMRGKKGSVYEGGHRVPCFASWPAKWDGNRPVDQLTCHRDWLPTLIELCDLK 370 Query: 343 KPEIL--PGENILAVKEPRGVMVEFNRYEIEHD---------SFGGFIPVRCWVTDDFKL 391 +P + G ++ + IE + G P +TD ++L Sbjct: 371 RPADVTFDGRSMAGLLSHSSQQWPERTLVIERQPDNVVSATKTQGRAQPPFVVLTDRWRL 430 Query: 392 VLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 V DELYD +NDP ++ N+ + +V ++ Y + + Sbjct: 431 VR-----DELYDIQNDPGQIKNIAA--EYPEVVRELRAEYDAYFEDVH 471 >UniRef50_B8KHZ9 Arylsulfatase A n=2 Tax=Gammaproteobacteria RepID=B8KHZ9_9GAMM Length = 483 Score = 347 bits (892), Expect = 4e-94, Method: Composition-based stats. Identities = 120/471 (25%), Positives = 201/471 (42%), Gaps = 59/471 (12%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N L + D G Y + + T +ID LAAEG+RF Y S +C+P+RAGL TG Sbjct: 30 NVLLIYVDDLGYGDTGAYGHRVVKTPHIDRLAAEGMRFTQFYAPSALCSPSRAGLLTGRT 89 Query: 65 ANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 ++G P + VA G N +T+ K GY T IGKWHL+G + P ++ Sbjct: 90 PYRTGVESWIPDDSQVALGHNETTLADLAKARGYRTAVIGKWHLNGGLHMQGTPQPRDFG 149 Query: 120 ADYWFDGANYLSELTEKE-ISLWRNGLNSVEDLQANH---IDETFTWAHRISNRAVDFLQ 175 D+ + A ++ + +E L R G +++ N+ A +S+ A+D+L Sbjct: 150 FDHQYGLAAWVKNASVRESKELPRRGAMFPDNMYRNNEAVGPTKKYSAELVSDEAIDWL- 208 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFY----YELGEKAQDDLANKPEHHRL 231 + A +PF ++++Y E H P P EYL +Y D+ + D N+P R Sbjct: 209 --SGAKDPFFLLLTYSEVHTPIASPPEYLAQYQDYLTQEARDNPLLFYFDWRNRPWRGR- 265 Query: 232 WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG---- 286 Y+A ++D Q+GRVI L + ++T +I++SD+G Sbjct: 266 -----------------GEYYANVSYMDAQLGRVIEYLRGKGVLDDTLIIFSSDNGPVTD 308 Query: 287 --------EMMGA-HKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTM 335 M G L K +++ R+P IIR P+ R P + +D+ PT+ Sbjct: 309 AALTPWELGMAGETAGLRGKKRFLFEGGLRVPGIIRYPERIEAGRVESRPATALDVFPTL 368 Query: 336 MALADIEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL 393 + L GE++ + + + Y G ++KL+L Sbjct: 369 AQWLGVAVDSSVPLDGESLWPLIDGGDFQRQQAFYWSIPTPDGMEF---AVRDGNWKLIL 425 Query: 394 NLFTSDE-LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR-DPF 442 + + L+D +D E++NL++ VR ++ + DP Sbjct: 426 DADERPQYLFDLASDWYEVNNLLE--TEPAVRERLLQIYAARRAAVESDPL 474 >UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN4_DYAFD Length = 497 Score = 347 bits (892), Expect = 4e-94, Method: Composition-based stats. Identities = 115/503 (22%), Positives = 190/503 (37%), Gaps = 82/503 (16%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN +++ D +GCY + + T N+D LA EGIRF YT +PVC PARA L TG Sbjct: 27 PNIVYIYADDLGYGELGCYGQQKIKTPNLDRLAKEGIRFTQHYTGTPVCAPARAMLMTGK 86 Query: 64 YANQSGPWTN-------------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG 110 +A S N + N T+ K GY T GKW + ++ G Sbjct: 87 HAGHSAIRGNFELGGFRDEEERGQMPLPANELTVAELLKQKGYATALTGKWGMGMNNTEG 146 Query: 111 TGECPPEWDADYWFDGANYLSELTEKEISLWRNG-----LNSVEDLQANHIDETFTWAH- 164 T P DY++ + LW N +D+ T A Sbjct: 147 T---PTRQGFDYYYGYLDQKQAHNLYPSHLWENDRWDTLAQPWQDIHRKLDPAKATDADF 203 Query: 165 -----------RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE 213 +++ +A+ F+ + PF + + Y PH P EY++KY + E Sbjct: 204 ESFKGKEYAPAKMTEKALAFIDRSKAG--PFFLYMPYTLPHVSLQAPDEYVKKYIGQFDE 261 Query: 214 LGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ 273 + + A+ Y Y + F+DDQ+G +++ L Sbjct: 262 KPYYGEKNYAS-------------------TKYPLSTYASMITFLDDQVGIILDKLKALG 302 Query: 274 R-ENTWVIYTSDHGEMMGAH----------KLISKGAAMYDDITRIPLIIRSPQGER--R 320 +NT V+++SD+G L +Y+ R P I+R P + R Sbjct: 303 LDDNTIVMFSSDNGATFNGGVNPQFFNSVAGLRGLKMDVYEGGIREPFIVRWPGKIKPGR 362 Query: 321 QVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRY-EIEHDSFGGFI 379 D + DL+PT+ L P G + L + + + + E+ GG I Sbjct: 363 VSDHVSAQFDLMPTLAELTGQASPPT-DGISFLPELLGQTNRQKKHEFLYFEYPEKGGQI 421 Query: 380 PVRCWVTDDFKLV-----LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 VR D+K V N +L++ + D +E ++ D+ K+ D ++ Sbjct: 422 AVRM---GDWKGVKTDLRKNPGNPWQLFNLKTDRSESTDVAA--SHPDILKKL-DQIVKR 475 Query: 435 MDKIRDPFRSYQWSLRPWRKDAR 457 + +P + + P +R Sbjct: 476 --EHEEPANAAWQFVMPVIAASR 496 >UniRef50_Q7UFA5 Putative sulfatase yidj n=1 Tax=Rhodopirellula baltica RepID=Q7UFA5_RHOBA Length = 527 Score = 347 bits (892), Expect = 4e-94, Method: Composition-based stats. Identities = 119/468 (25%), Positives = 196/468 (41%), Gaps = 42/468 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGK-------------PLNTQNIDSLAAEGIRFNSAYTC 48 ++PN L + TD +GCY + T +IDS+AA G S Y Sbjct: 56 EQPNVLIIQTDEHNFRTLGCYRDTLPIEEAEIWGKGAVVETPSIDSIAARGAICTSFYAT 115 Query: 49 SPVCTPARAGLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDY 108 SPVCTP+RA FTG Y +G + N+ ++ T + GY T Y GKWHLDG Sbjct: 116 SPVCTPSRAAFFTGRYPQNTGAYQNDRPLRGDMVTFAEVLRRDGYATGYAGKWHLDGP-- 173 Query: 109 FGTGECPPEWDAD--YWFDGANYLSELTEKEISLWRNGLNSV--------EDLQANHIDE 158 P+W D + F Y+ + + NG SV + N DE Sbjct: 174 -----GKPQWGPDRQFGFSDNRYMFNRGHWKKFDFENGQPSVAATNKKGQPNYDLNGADE 228 Query: 159 TFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA 218 + +RA DF+++ ++ EPF +S +PH P T Y + + Sbjct: 229 KTFSTDWLCDRAADFIREHSQ--EPFCYHLSLPDPHGPNTVRQPYDTMFENMPVRPPMTF 286 Query: 219 QDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENT 277 Q D + + YF +DD +G +++ L + T Sbjct: 287 QLD----GDQPGWLPATNRNSQQRFNARLMTQYFGMVRCIDDNVGMLLSLLDELSLTKRT 342 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTM 335 V++TSDHG++ H ++KG Y+ ++P+II +P ++D + +D PT+ Sbjct: 343 VVVFTSDHGDLCYEHGRLNKG-NPYEGSAKVPMIIAAPGLISAGLRIDQAMGTVDFAPTL 401 Query: 336 MALADIEKPEILPGENILA-VKEPRGVMVEFNRYEIEH-DSFGGFIPVRCWVTDDFKLVL 393 ++L E P G ++ + E +R + + VTD +KL++ Sbjct: 402 LSLLRKEVPAGTQGRDLSEWFRNIETSDEESHRNSVTFLRAASSKAAWIAAVTDRYKLIV 461 Query: 394 NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 + L+D + DP+E N I + +++ +LL Y D ++DP Sbjct: 462 SADDQPWLFDLKEDPHETTNHIGKPENQTIAARLARSLLRYGDLMKDP 509 >UniRef50_C5BWB0 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5BWB0_BEUC1 Length = 497 Score = 347 bits (892), Expect = 4e-94, Method: Composition-based stats. Identities = 117/462 (25%), Positives = 186/462 (40%), Gaps = 31/462 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L VMTD Q + +G G P+ T N+D LAA+G F AY+ +P CTPARA L TG Sbjct: 4 RPNVLLVMTDQQRWDTLGSAGG-PVETANLDHLAAQGTTFTHAYSATPSCTPARASLLTG 62 Query: 63 IYANQSGPWTNNVAPGKN---ISTMGRYFKDAGYHTCYIGKWH------LDGHDYFGTGE 113 +G +T+ DAGYHT +GK H L G E Sbjct: 63 QDPWHTGILGMGAGQPPMAGLENTLPEALADAGYHTQGVGKMHFSPQRALHGFHATTIDE 122 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDL-QANHIDETFTWAHRISNRAVD 172 + + D + ++ +GL+ L + H E + ++ Sbjct: 123 SLRVEEPGFTSDYTQWFERHAPADVRQADHGLDFNSWLARPFHTGEHLHPSTWTVTESIR 182 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY--ADFYYELGEKAQDDLANKPEHHR 230 FL++ PF ++ S+ PH P+ P Y E Y +L D A+ + Sbjct: 183 FLERR-DPTRPFFLMTSFARPHSPYDPPAFYYEHYLRRHHTGDLPPAVVGDWASVHDVGG 241 Query: 231 LWAQA----MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDH 285 D+ Y+ +D QIGR++ L + + T V++T+DH Sbjct: 242 AEGMDPNAWRGRRTADEIGRARAGYYGSIHHIDHQIGRLMRYLRDRRLDAETLVVFTADH 301 Query: 286 GEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR-----QVDTPVSHIDLLPTMMALAD 340 G+M+G H L K Y+ +PL++R P G R VD PV D++PT++ Sbjct: 302 GDMLGDHHLWRK-TYAYEGSAHVPLVVRLPAGMRSAGDAEVVDDPVCLQDVMPTILDACG 360 Query: 341 IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT--- 397 ++ P + G + L + V + + ++ +K V Sbjct: 361 VDVPASVDGASTLPLVTGERVPWREFVHGEHSTCYHPSQEMQYLTDGAWKYVWFPRGDGP 420 Query: 398 ---SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 ++L+D R+DP E +L A V + L+D + Sbjct: 421 GSPREQLFDLRSDPYEERDLAPRSDHAAVLRRWRARLVDVLA 462 >UniRef50_A6DKC5 Putative sulfatase yidj n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKC5_9BACT Length = 511 Score = 347 bits (891), Expect = 5e-94, Method: Composition-based stats. Identities = 122/456 (26%), Positives = 190/456 (41%), Gaps = 35/456 (7%) Query: 4 PNFLFVMTDTQATNMVGCYS-------------GKPLNTQNIDSLAAEGIRFNSAYTCSP 50 PN L +MTD +GCY G + T +ID LA EG+ N+ Y SP Sbjct: 33 PNLLIIMTDEHNFRTLGCYRKLLSKDQAMIWGDGNIVETPHIDKLAEEGVLCNNFYASSP 92 Query: 51 VCTPARAGLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL--DGHDY 108 VC+PAR +G Y + NN ++ + G + GY T Y GKWHL DG Sbjct: 93 VCSPARGSFISGQYPQNTPVIDNNTHMSDDVVSFGSILQSHGYTTGYSGKWHLDGDGKPQ 152 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 +G D Y F+ ++ L N DE ++N Sbjct: 153 WGPERQFGFEDNRYMFNRGHWKKILDTASGPKIGAEKRGTPTYDVNGADENTYTTDWLTN 212 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 + +DF+ Q + PF +VSY +PH P T Y Y ++ + A + P Sbjct: 213 KTIDFITQHKAS--PFCYMVSYPDPHGPDTVRAPYDTMYTHMNFQKPKTASKKQDDLPSW 270 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGE 287 G + Y+ +DD I R++ L + ENT V++TSDHG+ Sbjct: 271 ----------ATTKRGAANQSQYYGMIKCIDDNIARIMTCLDEQGILENTIVVFTSDHGD 320 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKPE 345 M G H +KG + + ++P I+R P+ + V+ +S +D LPT++ L D E Sbjct: 321 MRGEHGRQNKGIPL-EASAKVPFIVRYPKKISSGKIVNEALSGVDFLPTILGLMDKETAG 379 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRR 405 G + + + + I G +TD +KLV+ + L D++ Sbjct: 380 KEEGRDGSQLLHGKVPTGWSDVTFIR----GTKEKWVAAITDQYKLVMAPWDEPWLIDKK 435 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 N+P+E N I+D ++ V + + Y K DP Sbjct: 436 NNPDETINYINDPQYRSVIRSLAKEMQRYGTKYNDP 471 >UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 Tax=Nostocaceae RepID=Q3M597_ANAVT Length = 457 Score = 347 bits (890), Expect = 7e-94, Method: Composition-based stats. Identities = 111/454 (24%), Positives = 174/454 (38%), Gaps = 66/454 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN +F++ D + Y T N+D LA +G+RF +AY VCTP R T Sbjct: 40 SRPNVVFILVDDMGWGDLSIYGRTDYETPNLDRLARQGVRFTNAYANQTVCTPTRIAFLT 99 Query: 62 GIYANQ------------SGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYF 109 G Y + S P +NN+ N T+ K GY T +GKWH F Sbjct: 100 GRYQARLPVGLREPLGARSQPASNNIGIPANQPTIASLLKANGYETALVGKWHAGYPPNF 159 Query: 110 GTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHID--ETFTWAHRIS 167 G P + D +F L+ G + + DL N + + + Sbjct: 160 G----PLQKGFDEYFG------HLSGGIEYFTHTGTDRILDLYENDVPVQRSGYVTDLFT 209 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 +RAV+F+Q+P PF + + Y+ PH P+ P Sbjct: 210 DRAVEFIQRPH--SRPFYLSLHYNAPHWPWQGP--------------------------- 240 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG 286 + + A G Y A +DD +GRV++AL +NT VI+TSD+G Sbjct: 241 -NDQASTAFYLTNGYTVGGSQATYAAMVKSLDDGVGRVLDALEASGQADNTLVIFTSDNG 299 Query: 287 E--MMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIE 342 + A++Y+ R+P IIR P + + + DL T++A Sbjct: 300 GERFSNFGPFRGQKASLYEGGIRVPAIIRYPGVTQANQVSNQVIITFDLTATILAATGTS 359 Query: 343 KPEIL--PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDE 400 G+N+L + RG EF+R R + D+K + Sbjct: 360 FHPNYPPDGQNLLPLL--RGDRSEFSRTLFWRYGAALTTRQRAVRSGDWKY-WRRGNQEA 416 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 L++ DP E +L D A V +++ + + Sbjct: 417 LFNLATDPGETTDLKD--SNAQVFTRLRNQFQHW 448 >UniRef50_C5BXT8 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5BXT8_BEUC1 Length = 497 Score = 347 bits (890), Expect = 7e-94, Method: Composition-based stats. Identities = 130/470 (27%), Positives = 198/470 (42%), Gaps = 40/470 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN L + TD + VG + G T N+ +LA G F Y + VC+P RA + T Sbjct: 14 SRPNILVICTDQHRFDAVGTHPGSAAITPNLVALAERGAVFEQCYAPNTVCSPTRASMLT 73 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL------DGHDYFGTGECP 115 G Y + G W N V + S + R DAGY T +GK+HL + G Sbjct: 74 GEYPSSHGLWANGVTLPEGRSLVSRELADAGYRTGLVGKFHLASAFEGRTEERLDDGFET 133 Query: 116 PEWDADYWFDGAN--YLSELTEKEISLWRNGLNSV----------EDLQANHIDETFTWA 163 W D + Y L E+ +LW + V E+ + + + +++ Sbjct: 134 FAWAHDPFHGAPENAYHRWLRERHPALWAEAMGDVVTPDVENFAHENTRFDEMPAHASYS 193 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 ++ DFL+ D PF ++ +Y PHHPF P EYL+ Y D+LA Sbjct: 194 TWVTEEVGDFLR--TEDDRPFFLLANYFAPHHPFAAPQEYLDLYPPGSVPPPVGGPDELA 251 Query: 224 NKPEHHRLWAQAMPSPVG--------DDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE 275 KP ++A G + Y A +DD +GR++ L + E Sbjct: 252 TKPTLQSEASRASYVGHGPSFADFTPEGIDEIRRTYHAMVSQIDDGVGRILRTLREQGLE 311 Query: 276 -NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLL 332 +T V++ SDHGEM+G H L+ KG MYD R+PL++ P +V V D+ Sbjct: 312 RDTLVVFVSDHGEMLGDHALLLKGPMMYDPAVRVPLVVSWPDLVPAGHRVTDFVGVHDVA 371 Query: 333 PTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR--CWVTDDFK 390 T+ A +E G +++AV + E + P + D K Sbjct: 372 HTIRCAAGLEPYARDQGLDLVAVAREEREARTYAWAEYRDSGYPYDPPAHTTMYRRHDSK 431 Query: 391 LVLNLFTSD-------ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 +V+ D ELYD +DP+E+ N DD +A R ++ A+ D Sbjct: 432 VVVWHGDPDAGRPATGELYDLADDPDELVNRWDDPAYARRRLELCAAVSD 481 >UniRef50_A6DIH4 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DIH4_9BACT Length = 621 Score = 346 bits (889), Expect = 8e-94, Method: Composition-based stats. Identities = 109/464 (23%), Positives = 191/464 (41%), Gaps = 36/464 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+ N L +++D + + Y T N+D A +FN AY PVC P+RA + Sbjct: 154 KKLNVLMIVSDDL-NHYIKSYGDPQAITPNLDKFMAMSTQFNKAYCQYPVCGPSRASFLS 212 Query: 62 GIYANQSGPWTNNVAP---GKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+Y S TN + M +F++ GY T GK H +G E Sbjct: 213 GLYPESSLVITNTQYLRDVNPSADNMLEHFRNNGYWTGAAGKIF---HSTYGMMEKGTSL 269 Query: 119 DADYWFDGANYL------------SELTEKEISLWRNGLNSVEDLQANHIDE---TFTWA 163 D F A + + + +N + DL + E Sbjct: 270 DEYEKFSNAENPQLLLLKKRWIKEGKPGDFKAYFNKNKVKDQADLVLGYGTELRDNQHGD 329 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 R + R +++ + ++PF M +PH PF P +YL+ Y + ++D Sbjct: 330 GRNARRVAQWIKNNSAGEKPFFMACGIVKPHTPFYAPKKYLDLYPKDKLIFDDVPENDWD 389 Query: 224 NKP-----EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENT 277 NKP + ++ + + ++ Y+ Y C F+D Q+ +++AL +NT Sbjct: 390 NKPKVAGVKRYQAFRGELGVNDRENRKYYLQSYLGCISFMDAQVKVLMDALKESGQMDNT 449 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTM 335 +++ SDHG +G H + K ++++ R+P I P G +Q D+ ID+ PT+ Sbjct: 450 VIVFMSDHGFQIGEHFMYGK-VTLFEECARVPFGIIYPGNPGAGKQSDSLAELIDVYPTL 508 Query: 336 MALADIEKPEI-LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 + L + +P L G++++ V + + V Y + G + R + Sbjct: 509 LDLCKLPQPSHKLQGKSLVPVTKDTSLQVRNEAYTVVTR---GKLMGRAIRKGSWVYAHW 565 Query: 395 LFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 D ELY+ DP + +NL+ D +A V +M AL + Sbjct: 566 GSDRDVELYNMDKDPKQYNNLVKDPEYAKVLKQMDKALKQKASE 609 >UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_RHOBA Length = 485 Score = 346 bits (889), Expect = 9e-94, Method: Composition-based stats. Identities = 113/467 (24%), Positives = 175/467 (37%), Gaps = 83/467 (17%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++ D VGCY G P+ T ID LAA G RF Y+ VC+P+RA L TG Sbjct: 46 RPNVVMLLADDLGYRDVGCYGG-PVETPTIDQLAAGGTRFQQFYSGCAVCSPSRATLMTG 104 Query: 63 IYANQSGPW------TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 + ++G + + N T+ +DAGY T ++GKWHL P Sbjct: 105 RHHIRAGVYSWIQDESQNSHLRLREVTLAEVLRDAGYATAHVGKWHLGLPTEERDKPTPD 164 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 + D+WF N + RNG + +++ A+ ++ + Sbjct: 165 QHGFDHWFATWNNAQPSHRNPDNFIRNGEPVGQ--------LEGYSCQLVADEAIRWMDR 216 Query: 177 P--ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 + D+PF + V + EPH P P E +KY + Sbjct: 217 HRESDPDQPFFLNVWFHEPHAPIAAPDEVTQKYGKLSDKG-------------------- 256 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGA-- 291 +Y D D I R++ L RENT ++Y SD+G Sbjct: 257 --------------AVYSGTIDNTDQAIKRLLAKLDALGVRENTLIVYASDNGSYRTDRV 302 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEKPE-ILP 348 KL + A ++ R+P I P V + P +D+LPT+ L I P+ L Sbjct: 303 GKLRGRKGANWEGGIRVPGIFHWPGHIPAGVVSNEPAGLVDVLPTICGLLKISPPQVHLD 362 Query: 349 GENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV---------LNLFTSD 399 G ++ + G F R++ P+ D+ LV NLF Sbjct: 363 GSDLTPLLT--GHADSFERHQPLFWHLQRSQPIVAMRDGDYSLVGFRDYEMSNKNLFEEK 420 Query: 400 -------------ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 ELY+ ++DP + NL + + M +L Sbjct: 421 WIPAIKNGTYHNFELYNLKDDPGQTKNLAAEQ--PERVEAMKQRMLQ 465 >UniRef50_Q7UJQ8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=4 Tax=Planctomycetaceae RepID=Q7UJQ8_RHOBA Length = 491 Score = 346 bits (889), Expect = 9e-94, Method: Composition-based stats. Identities = 107/491 (21%), Positives = 179/491 (36%), Gaps = 71/491 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN +F++ D +GCY + + T +D +AAEG+RF Y + VC P+R+ L T Sbjct: 34 KRPNIVFILADDLGYGDLGCYGQELIQTPRLDQMAAEGMRFTDFYAGNTVCAPSRSVLMT 93 Query: 62 GIYANQSGPWTNNVAPG-------KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 G++ + N P T+ + AGY T GKW L G Sbjct: 94 GMHMGHTHVRGNAGGPDMSKQSLRDENVTVAEVLQSAGYATALCGKWGLGDDALGGRDGL 153 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF-------------T 161 P + D+++ N + LWRN + D ++ Sbjct: 154 PRKQGFDHFYGYLNQVHAHNYYPEFLWRNETKVALRNEVQRRDRSYGGFTGGWATKRVDY 213 Query: 162 WAHRISNRAVDFLQQPA--RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ 219 I+N A+ F+++ A A +PF + +S PH G Sbjct: 214 SHDLIANEAMGFIREKATDAATKPFFLYLSLTIPHA----------------NNEGTGMS 257 Query: 220 DDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTW 278 + P++ D A +D +GR+++ L Q + T Sbjct: 258 GNGQEVPDYGIY--------ADKDWSDQDKGQAAMITRMDSDVGRILDLLKELQIDEQTV 309 Query: 279 VIYTSDHGEM-MGAH---------KLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPV 326 V+++SD+G G H L A+ + R+PLI+R P D Sbjct: 310 VMFSSDNGPHNEGGHNPKKFDPAGPLRGMKRALTEGGIRVPLIVRWPGTTPPGAVSDHIG 369 Query: 327 SHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRY-EIEHDSFGGFIPVRCWV 385 DL+ T LA + PE + R + + Y E GG VR Sbjct: 370 YFGDLMATAAELAGTDFPEDADSISFAPTIVGRPEAQQTHEYLYWEFYEQGGRQAVRRV- 428 Query: 386 TDDFKLVLNLF--TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFR 443 ++K + + +LYD + D E NL D ++ ++ M++ P Sbjct: 429 --NWKAIREPWMTGPTQLYDLKADIGETTNLASD--HPEIVKQLETL----MEEAHTPHP 480 Query: 444 SYQWSLRPWRK 454 ++Q + ++ Sbjct: 481 NWQVRVPASKR 491 >UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XH3_PSEA6 Length = 500 Score = 346 bits (889), Expect = 1e-93, Method: Composition-based stats. Identities = 107/488 (21%), Positives = 186/488 (38%), Gaps = 64/488 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN LFV+ D N VG + T N+D LA G+ F++AY P C P+RA + T Sbjct: 38 EKPNILFVLADDLGYNDVGFNGSTDIKTPNLDGLAKNGMTFDAAYVAHPFCGPSRAAIMT 97 Query: 62 GIYANQSGPWTN------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 G Y ++ G N NV + + + K AGY T +GKWHL + + Sbjct: 98 GRYPHKIGAQFNLPEDNSNVGVSADELFIAQTMKSAGYFTGAMGKWHLGEASEYHPNK-- 155 Query: 116 PEWDADYWFDGANYLSELTEKEISLWR---------NGLNSVEDLQANHIDETFTWAHRI 166 +D Y F G + + E + + N + + + ET + Sbjct: 156 HGFDEFYGFLGGGHNYFPEQFEAAYNKRVAQGMTNINMYLTPLEHNGKEVRETEYITDGL 215 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 S AV+F+ + A +PF + ++Y+ PH P E + ++ + Sbjct: 216 SREAVNFVDKAAAKKKPFFLYLAYNAPHVPLQAKEEDMAMFSQIKDK------------- 262 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDH 285 Y VD +GR++ L +NT +++TSD+ Sbjct: 263 --------------------KRRTYAGMVYAVDRGVGRIVEQLKKNGQFDNTVIVFTSDN 302 Query: 286 GEMMGA----HKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALA 339 G +G + L ++ + R P+++ P+ + PV +DL PT L Sbjct: 303 GGKLGQGANNYPLKEGKGSVQEGGFRTPMLVHWPKHMKAGSRFSHPVLALDLYPTFAGLG 362 Query: 340 DIEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 PE L G++I A + + + G + FK V N Sbjct: 363 GAVLPEDKKLDGKDIWADIQANTAPHKDEFIYVLRHRNGYSDA--AARRNQFKAVKNHND 420 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY-MDKIRDPFRSYQWSLRPWRKDA 456 +LY+ D +E +++ + D+ M ++ + + + + WR A Sbjct: 421 DWKLYNIAQDISEDNDISA--QHPDILRDMVSSMESWSWNNQQPKWFHQSAEGAQWRLKA 478 Query: 457 RPRWMGAF 464 PR+ F Sbjct: 479 MPRFDQTF 486 >UniRef50_A6DGT7 Sulfatase family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DGT7_9BACT Length = 504 Score = 346 bits (889), Expect = 1e-93, Method: Composition-based stats. Identities = 113/484 (23%), Positives = 195/484 (40%), Gaps = 60/484 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L + D M+G Y + + ID LA + AY VC +RA + TG Sbjct: 19 RPNILIISVDDLKP-MLGTYGDPLVQSPTIDKLAEASALYEKAYCQQAVCGASRASIMTG 77 Query: 63 IYANQSGPWTNNVAPGKNIS---TMGRYFKDAGYHTCYIGKWH-----LDGHDYFGTGEC 114 + + S W + T+ YFK GY TC+ GK DG Sbjct: 78 LRPDNSRVWEFRQVMRERNPQAITIPEYFKSQGYMTCFAGKIFDYRCVADGKKQDLKSWS 137 Query: 115 PPEWD------ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHI------------ 156 PE + F + +L KEI L +NG + D I Sbjct: 138 RPEQPRNSEAMKNLGFADPAFREKLRLKEIELKKNGQKASYDAIKKAIGGSPCYEDSIDG 197 Query: 157 DETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGE 216 + I+ V +++ + +PF + V + +PH PF P +Y + Y + + L E Sbjct: 198 PDEIYEDGMIAREGVRLIKELGQKKKPFFIAVGFKKPHLPFNAPKKYWDLYKETDFAL-E 256 Query: 217 KAQDDLANKPE--HHRLW-------AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVIN 267 K Q + P + W + + Y AC +VD QI +++ Sbjct: 257 KYQKPVQGAPHYAYQNSWEFSGYNVPRINGEVLESFQRKLKHAYAACISYVDAQIAKLLK 316 Query: 268 ALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDT 324 L + E NT +++ SDHG +G H + K + Y+ TR+P + P+ ++ + Sbjct: 317 TLKDQGLEKNTVIVFWSDHGFHLGDHGMWCKHSN-YEQATRVPFFVYDPRQNLKKGRYTQ 375 Query: 325 PVSHIDLLPTMMALADIEKPEILPGENIL--AVKEPRGVMVEFNRYEIEHDSFGGFIPVR 382 PV ID+ PT+ L+ + PEIL G+++L A + + + +F R + ++ G+ Sbjct: 376 PVELIDMFPTLCQLSGLAIPEILDGKSLLSEAAENAKFALSQFPRNQGKNKKIMGY---- 431 Query: 383 CWVTDDFKLVLNLFTSD-------------ELYDRRNDPNEMHNLIDDIRFADVRSKMHD 429 + + ++ + + + ELYD DP E NL ++ + + ++ Sbjct: 432 GFRFERYRYIEWVDNNYQQDNTQLGPLKAVELYDYEKDPLEQVNLANNPEYKSILRRLQQ 491 Query: 430 ALLD 433 + Sbjct: 492 EAKE 495 >UniRef50_A6DHY1 Mucin-desulfating sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHY1_9BACT Length = 545 Score = 346 bits (888), Expect = 1e-93, Method: Composition-based stats. Identities = 108/458 (23%), Positives = 191/458 (41%), Gaps = 38/458 (8%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++TD Q + +GC + T +ID L+ G+ F+S YT +P+C +RA TG Sbjct: 21 RPNIIMLLTDDQRYDTLGCMGNDQVKTPHIDKLSERGVTFDSHYTNTPICLGSRASTMTG 80 Query: 63 IYANQSGPWTNNVAPGK---NISTMGRYFKDAGYHTCYIGKWHL--DGHDYFGTGECP-P 116 +Y +G ++ + + + ++ GY T +IGK+ + +Y P Sbjct: 81 MYEYTNGCNFSHGFLSQELWDEMSYPVILRNNGYFTGFIGKFGFPVNAKNYHEYENLPID 140 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 +D Y + G Y K + + E + A +F+ + Sbjct: 141 SFDRWYGWTGQGYFDTSKNKYMVKFAK--------------EYPHVTLATAEAACEFIDE 186 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK-PEHHRLWAQA 235 + D+PF + +S+ H PF+ Y + Y D ++ + A K P +L Q Sbjct: 187 AQKQDKPFCLSLSFKASHKPFSPDPAYDDVYKDTVWKKRANYDEGGARKLPPQAKLGRQY 246 Query: 236 MP--SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAH 292 + + Y +D +G+++ L +NT +IY +D+G G+H Sbjct: 247 LTIDDFAPEKYQESMRKYNQLIYGIDQAVGKIVEKLDQTGLSKNTVIIYATDNGYSCGSH 306 Query: 293 KLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGE 350 Y+ R P+II P+ ++ ++D+ PT+ LA I P + G+ Sbjct: 307 G-FGGKVLPYEGPARGPMIIMDPRSDQTGKRSKGVSGNVDIHPTICDLAGIAIPAKVDGK 365 Query: 351 NILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR--CWVTDDFKLVLNLFTSD------ELY 402 ++L V + + V R + +F G VT+D+K + F D ELY Sbjct: 366 SLLPVLKDSEIRV---RKAMPVFNFWGSAATHEMTMVTEDYKYIYWYFEGDGMVAAEELY 422 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 R D EM+NL+++ A +M + IRD Sbjct: 423 HRHKDSAEMNNLVNNPEMALKLEEMRQLFDAQVQHIRD 460 >UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9FLAO Length = 459 Score = 346 bits (888), Expect = 1e-93, Method: Composition-based stats. Identities = 107/447 (23%), Positives = 174/447 (38%), Gaps = 57/447 (12%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN L ++ D + C L + NID+LAA G+RF + Y S VC+P+RA L TG Sbjct: 42 PNILCILVDDLGYGDLSCQGATDLQSPNIDALAANGMRFTNFYANSTVCSPSRAALLTGR 101 Query: 64 YANQSG--------PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 Y + G P N + + AGYHT IGKWHL + + P Sbjct: 102 YPDLVGVPGVIRQNPENNWGNLADDAVLIPSELNPAGYHTGIIGKWHLGLEE----PDTP 157 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + Y+ +L ++ + R G+N + L ID ++ +DFL+ Sbjct: 158 NDRGFTYF---KGFLGDMMDDYWDHRRGGINWMR-LNREEIDPKGHATDLFTDWTIDFLK 213 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + ++PF + ++Y+ PH P P E+L+K + L EK ++ Sbjct: 214 ERQGEEQPFFLYLAYNAPHFPIQPPREWLDKVREREPNLTEKRAKNV------------- 260 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMM----G 290 A + +D +GRV+ AL ENT V++ SD+G + Sbjct: 261 -----------------AFVEHLDYSVGRVMEALKTTGLEENTLVVFVSDNGGALWYAQS 303 Query: 291 AHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKPEILP 348 L MY+ R+P I D +DL PT LA + PE + Sbjct: 304 NGPLRGGKQDMYEGGIRVPAIFYWKGKIAPGTTSDNTALLMDLFPTFCELAGRKPPENVD 363 Query: 349 GENILAVKEPRGVMVEFNRYEIEHDSFG--GFIPVRCWVTDDFKLVLN-LFTSDELYDRR 405 G +++ + G G DFK++ N F + ++ Sbjct: 364 GISLVPTLTGQAQDTANRYLYWVRREGGDYGGQAYYAARFGDFKILQNTPFEPIQFFNIG 423 Query: 406 NDPNEMHNL-IDDIRFADVRSKMHDAL 431 D E L D + +R+++ + + Sbjct: 424 QDELETTPLETDSEAYRALRAQLMEHI 450 >UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZUT0_9PLAN Length = 457 Score = 346 bits (887), Expect = 1e-93, Method: Composition-based stats. Identities = 100/463 (21%), Positives = 166/463 (35%), Gaps = 69/463 (14%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +F++ D GCY +T +ID LA +G+RF AY +PVC+P RA L TG Sbjct: 31 KPNIVFILIDDMGCKDAGCYGATNFSTPHIDRLANQGMRFTDAY-AAPVCSPTRASLMTG 89 Query: 63 IYANQSG------------------PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLD 104 + + P N + T+ + GY IGKWHL Sbjct: 90 KHPARLHLTNFIPQIGRQLPAGKLIPPGFNHVLPLDEKTIAQELHADGYQCAMIGKWHL- 148 Query: 105 GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 G P + FD + Sbjct: 149 -----GEEHGPEYRPQNRGFDRVVLSEHHGIFNYFYPFVDQQKWPYAGPLPGNPGDYLPD 203 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 R+++ A+DF+++ + PF + +S+ H + P + KY + E Sbjct: 204 RLTDEAIDFVRE--NRERPFFLYLSHWSVHGRYFAPESLIAKYRERGLE----------- 250 Query: 225 KPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTS 283 +Y A + VD+ +GR++ L +NT ++ S Sbjct: 251 ---------------------ERPAIYAAMMETVDNSVGRLMATLDELNLADNTLFVFMS 289 Query: 284 DHGEMM--GAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALA 339 D+G L ++Y+ R+PLI+R P PV DL PT + A Sbjct: 290 DNGGERITSMAPLRGSKGSLYEGGVRVPLIVRYPGVVKPNTTCSVPVISHDLFPTFLDFA 349 Query: 340 DIEKPE-ILPGENILAVKEPR-GVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 + + L G +I + + Y +G P +KLV +L T Sbjct: 350 ERSYRDNKLDGHSIAGLLTGEQSELDRDALYWHFPHYWGSTRPCSAMRQGRWKLVEHLET 409 Query: 398 SD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 +LYD +DP E +L +++ +++ L + K+ Sbjct: 410 GRAQLYDLSSDPGEQRDLANEM--PQQATELRKMLAQWRTKVG 450 >UniRef50_Q5UEW6 Probable phosphonate monoester hydrolase n=1 Tax=uncultured alpha proteobacterium EBAC2C11 RepID=Q5UEW6_9PROT Length = 512 Score = 345 bits (886), Expect = 2e-93, Method: Composition-based stats. Identities = 124/476 (26%), Positives = 202/476 (42%), Gaps = 51/476 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + +MTD Q + +G + T N+D L EG F + + SPVC +RA +F G Sbjct: 22 KPNIVLIMTDQQRADTIGALGSPWMQTPNLDRLVNEGTSFTNCFVTSPVCVSSRASIFLG 81 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC-------- 114 Y + + +TN N ++ D+GYH IGK H++ +D G Sbjct: 82 GYPHTTNVYTNFETWEPN---WVKWLSDSGYHCVNIGKMHINPYDAKGGFHQRFFVENKD 138 Query: 115 ------------PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTW 162 EWD S T + R+ ID+ Sbjct: 139 RPLFLEDHERAIYDEWDKALKVRRLEKPSRYTR--VRDNRDAFLKNLGCFTWEIDDDMHP 196 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 + + N A +L +A+ PF + + + PH P+ ++L Y D + +Q +L Sbjct: 197 DNFVGNTASWWLNDR-KAESPFFLQIGFPGPHPPYDPTGDFLSIYKDTKFPHRAASQREL 255 Query: 223 ANKPEHHRLWAQAM-----------PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP 271 +PE H+ Q+M + DD H Y A +D Q+G++++ L Sbjct: 256 EKQPEMHKQLRQSMIDFNIDSVAWRENLTDDDIQLLHRYYSANVSMIDCQVGQILSTLEQ 315 Query: 272 EQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSH 328 +NT VI+ SDH + +G H I K MYD +TR+PLI +P+ + Q V Sbjct: 316 RGYLDNTIVIFCSDHADALGEHGHIQKW-TMYDCVTRVPLIFWAPKTVKMQHQCADLVQL 374 Query: 329 IDLLPTMMALADIEKPEILPGENILAVKE--------PRGVMVEFNRYEIEHDSF-GGFI 379 +D+ PT++ A+IE P + + + P + E+ E+ D G Sbjct: 375 MDIAPTILNFANIEPPHNWEALALNKMLKTGCWDDQRPDHKLREYVYAELGRDHIQSGAE 434 Query: 380 PVRCWVTDDFKLVLNL-FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 V + +K V+ + ELYD RNDP+E NL +D ++ D R + +L + Sbjct: 435 YVIMRRDEHWKYVIYPGNDTGELYDIRNDPHETVNLWNDPQYLDQRKEATIEILSW 490 >UniRef50_Q15XH4 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XH4_PSEA6 Length = 517 Score = 345 bits (886), Expect = 2e-93, Method: Composition-based stats. Identities = 123/462 (26%), Positives = 201/462 (43%), Gaps = 24/462 (5%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN L +M D + G P+ T NID+LA +G RF++A T + +C+P+RA LFT Sbjct: 33 KQPNVLVLMFDDMRFDTFSYRGG-PVPTPNIDALANDGTRFDNAMTTTGLCSPSRAALFT 91 Query: 62 GIYANQSGPWTNNVAPGKNISTMGR-------YFKDAGYHTCYIGKWHLDGHD------Y 108 G + +++G N ++ + D GYH Y+GKWHL Sbjct: 92 GRWGHKTGLDDNVGLYHSHVDELSEEEGGVIRRAADTGYHVGYVGKWHLGPQGPALRGAD 151 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 F G+ + + +++ + E Q + + Sbjct: 152 FMWGKEHSQARHSRPYVPYEKQAKMAQYNRGERDENGEKHEYYQTLPGTYETSHTAENVD 211 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP-- 226 L++ A+ DEPF V+S+++PH P+ P Y + +L KP Sbjct: 212 MGQKMLREAAKMDEPFFGVISFEQPHPPYRVPEPYASMFDPKTVKLPANHAVKRQFKPMA 271 Query: 227 EHHRLWAQAMPSPVGD-DGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSD 284 + W + D D Y+ +D +G +I ++ +I D Sbjct: 272 QDEDWWPWHDVGHMTDMDWRKSRTFYYGAIAMIDHAVGDIIKTAKDVGMYDDLTIIVLGD 331 Query: 285 HGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKP 344 G M+G H L KG YD++ R+PLIIR+P E R V+ VS +D+ PT+ + +E Sbjct: 332 QGSMLGEHNLYDKGPYAYDELMRMPLIIRAPNVEPRIVNKQVSMLDIAPTISEMMSLEPD 391 Query: 345 EILPGENILAVKEP----RGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS-D 399 + G +++ + E V+ Y E GG+ +R T + K V N + D Sbjct: 392 GDVDGRSLVNLMEQGDIADKGRVDQALYAYE-WYNGGWFGIRALRTPEMKFVWNPGDNRD 450 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 ELYD +NDP E++NLI D ++ M + D + +I+DP Sbjct: 451 ELYDLKNDPIEVNNLIKDKKYTKQLRHMVQLMEDELVRIKDP 492 >UniRef50_O69787 Choline-sulfatase n=53 Tax=Alphaproteobacteria RepID=BETC_RHIME Length = 512 Score = 345 bits (886), Expect = 2e-93, Method: Composition-based stats. Identities = 110/489 (22%), Positives = 197/489 (40%), Gaps = 34/489 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L +M D + L+ N+ +LA RF++ YT SP+C PARA G Sbjct: 5 KPNILIIMVDQLNGKLFPDGPADFLHAPNLKALAKRSARFHNNYTSSPLCAPARASFMAG 64 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 +++ + N +I T + + AGY+T GK H G D E D Sbjct: 65 QLPSRTRVYDNAAEYQSSIPTYAHHLRRAGYYTALSGKMHFVGPDQLHGFE--ERLTTDI 122 Query: 123 WFDGANYLSELTEKE--ISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 + + + + I W + L SV I + ++ A L Q +R Sbjct: 123 YPADFGWTPDYRKPGERIDWWYHNLGSVTGAGVAEITNQMEYDDEVAFLANQKLYQLSRE 182 Query: 181 D-----EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + P+ + VS+ PH P+ ++ + Y D + E L + H + + Sbjct: 183 NDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEDCEHLTPEVGAIPLDEQDPHSQRIMLS 242 Query: 236 MP----SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHGEMMG 290 ++ YFA ++D+++G +I+ LT ++T +++ SDHG+M+G Sbjct: 243 CDYQNFDVTEENVRRSRRAYFANISYLDEKVGELIDTLTRTRMLDDTLILFCSDHGDMLG 302 Query: 291 AHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPE---IL 347 L K ++ R+PL+I P TP S++D+ PT+ LA I E Sbjct: 303 ERGLWFK-MNFFEGSARVPLMIAGPGIAPGLHLTPTSNLDVTPTLADLAGISLEEVRPWT 361 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRND 407 G +++ + + +E+ + + P+ +K V ++L+D D Sbjct: 362 DGVSLVPMVNG---VERTEPVLMEYAAEASYAPLVAIREGKWKYVYCALDPEQLFDLEAD 418 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPW----RKDARPRWMGA 463 P E+ NL ++ R ++ + RD R+ W + + R+ RW+ Sbjct: 419 PLELTNLAENPRGPVDQATLTA--------FRD-MRAAHWDMEAFDAAVRESQARRWVVY 469 Query: 464 FRPRPQDGY 472 R Y Sbjct: 470 EALRNGAYY 478 >UniRef50_B4D0V9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D0V9_9BACT Length = 497 Score = 345 bits (886), Expect = 2e-93, Method: Composition-based stats. Identities = 108/457 (23%), Positives = 184/457 (40%), Gaps = 31/457 (6%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++ PN LF+++D Q + + + T N+D L G F A P+C +RA + Sbjct: 41 VRHPNILFIISDDQRPDTIAALGNPIIQTPNLDRLVHGGTAFTRAVAAYPICYVSRAEIL 100 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKW---------HLDGHDYFGT 111 T + A ++G ++T + AGYHT ++GKW + T Sbjct: 101 TSVCAFRNGVGYTGNKLDPKLATWSGTLRSAGYHTWFVGKWDNGATPKAYGYEETRGLYT 160 Query: 112 GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 G P + + D A + +G E D + + ++ A+ Sbjct: 161 GGGAPLQNTPSYVDHAGRPATGYRGYTFKTDDGKPLPELGVGLTPDISRHF----ADAAI 216 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD----DLANKPE 227 DF+++ + EPF + V++ PH P P + KY L + + D N Sbjct: 217 DFIER--KPAEPFFLHVAFTAPHDPRLLPPGWETKYDPKTMPLPKNFRSVHPFDHGNMGG 274 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHG 286 + + P D+ Y+A +D+QIGR++ AL Q +NT +I+TSD G Sbjct: 275 RDEVLLASPRRP--DEVRAELAAYYAAISGMDEQIGRIVEALKSTGQLDNTLIIFTSDQG 332 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDT-PVSHIDLLPTMMALADIEKPE 345 +G+H LI K +Y+ +PLI+ P + + DL PT + I P Sbjct: 333 LAVGSHGLIGK-QNLYEHTLGVPLIMSGPGIPKGETREAQCDLRDLFPTTCEVTGIATPP 391 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD-ELYDR 404 + G +++ V V D R +KL+L +L+ Sbjct: 392 AVQGRSLVPVLRDAQKTVYPFVVGYYTD------AQRAIREGTWKLILYPKAKRTQLFYL 445 Query: 405 RNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 +DP+EMH+L A + + LL ++ + DP Sbjct: 446 ASDPDEMHDLSAQPEQARRLADLRIKLLGWLKENGDP 482 >UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DBQ5_9BACT Length = 483 Score = 345 bits (886), Expect = 2e-93, Method: Composition-based stats. Identities = 104/485 (21%), Positives = 180/485 (37%), Gaps = 69/485 (14%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +F++ D +GCY + + T NID LAA+G+RF YT VC P+R L TG Sbjct: 26 KPNVIFILADDLGIGDLGCYGQQKIRTPNIDHLAADGMRFLQHYTGCSVCAPSRCALMTG 85 Query: 63 IYANQSGPWTNNV---------APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE 113 + + N ++ T+ R ++AGY+T IGKW L + + Sbjct: 86 RHMGHAAIRDNAQRGPSEEGQRPMPQDTFTVARLMQNAGYYTGIIGKWGLGMPEDHSS-- 143 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANH---------IDETFTWAH 164 P + +Y F T LWRN ++ Sbjct: 144 -PRDMGFNYSFGYLCQSMAHTYYPPYLWRNNERETLAGNPSYDVSMKGVIEPKGEIYSHD 202 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 +++ A+ F++ D+PF + +++ PH P + + +Y + E + AN Sbjct: 203 VMASDALKFVRD--HHDKPFFLYLAFTIPHLSLQVPEDSMSEYHGQWTETPFRNTKHYAN 260 Query: 225 KPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTS 283 Y +D +GR++ L +NT V ++S Sbjct: 261 N-------------------ETPRAAYAGMITRMDRDVGRLMALLKELGIDDNTLVFFSS 301 Query: 284 DHG-----------EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHID 330 D+G +Y+ R PLI R P V D D Sbjct: 302 DNGAVFPLAGTDPVFFQSTGGFRGYKQDLYEGGIRTPLIARWPGKIETGVTTDQASVFYD 361 Query: 331 LLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRY-EIEHDSFGGFIPVRCWVTDDF 389 LPTM L + P G + L + + + + E+ S GG + VR D+ Sbjct: 362 FLPTMAELNGVPPPADTDGLSYLPTLLGKPAQQKQHDFLYWEYQSAGGAVAVRM---GDW 418 Query: 390 KLVLNLFTSD-----ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRS 444 K + N + E+Y+ +D E H++ ++ +K + + + P + Sbjct: 419 KAIANKIKKNPNANFEVYNLASDRTESHDVAA--EHPEIVAKAREIIAR--EHTPSPIKE 474 Query: 445 YQWSL 449 + ++L Sbjct: 475 WNFTL 479 >UniRef50_A4GIB2 Putative secreted sulfatase n=1 Tax=uncultured marine bacterium HF10_49E08 RepID=A4GIB2_9BACT Length = 667 Score = 345 bits (885), Expect = 3e-93, Method: Composition-based stats. Identities = 107/499 (21%), Positives = 189/499 (37%), Gaps = 86/499 (17%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN +F + D + VGCY K T ID LA EGIRF++AY+ VC+P+RA + T Sbjct: 22 RKPNIVFFLVDDLGWSDVGCYGSKFHETPAIDQLAKEGIRFDNAYSTCHVCSPSRASILT 81 Query: 62 GIYANQSG--------------PWTNN---VAPGKNISTMGRYFKDAGYHTCYIGKWHLD 104 G Y ++ P + A T+ K GY T GK HL Sbjct: 82 GKYPARTNLTEWLGGRPERDYEPLHHGEKLTALPDEEVTLAETLKSHGYATANYGKAHLR 141 Query: 105 -GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWA 163 + +G E W Y + G E L A D + Sbjct: 142 VDPNAYGFDEEITGWVRSYHYPF-----------------GGAYNEKLPAKKGD---YYT 181 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD-- 221 ++++ A+DF+++ D PF + + + H P + +EKY + ++ D Sbjct: 182 DKLTDAALDFIER--NKDRPFFVHLEHFAVHDPIQGRPDLVEKYRKKLAAMPKQDGPDFI 239 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHP--------------LYFACNDFVDDQIGRVIN 267 L + P+ L + + + +D L H + + D+ +GR+ Sbjct: 240 LESNPDGPELTTEELKALAENDELQDHQDARVWWVKQKQDNVEFAGMLEATDESLGRIRK 299 Query: 268 ALTPEQR-ENTWVIYTSDHGEMMGAH---------------------KLISKGAAMYDDI 305 L +NT VI+T+D+G M ++ L Y+ Sbjct: 300 KLKDLGLADNTIVIFTADNGGMSASNQYRGINHPIESLDSRFASSNLPLRGAKGWNYEGG 359 Query: 306 TRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIE--KPEILPGENILAVKEPRGV 361 R+PL++ P + + V+ D PT++ + + + + G + L + Sbjct: 360 IRVPLVVYWPGRIKPDSTSNALVTGTDFYPTLLEMIGMPTLPNQHIDGVSFLPALRGKAH 419 Query: 362 MVEFNRYEIEHDSFGGFI-PVRCWVTDDFKLV-LNLFTSDELYDRRNDPNEMHNLIDDIR 419 + H S G+ P +KL+ S +L+D D E ++L Sbjct: 420 DRGAIYWHFPHYSNHGYQSPGGAIRLGKYKLLEYYENGSVQLFDLEKDIGEQNDLSK--T 477 Query: 420 FADVRSKMHDALLDYMDKI 438 DV++K+ L ++ ++ Sbjct: 478 KPDVKAKLLKMLHEWRREV 496 >UniRef50_A6DFZ4 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFZ4_9BACT Length = 519 Score = 345 bits (885), Expect = 3e-93, Method: Composition-based stats. Identities = 121/489 (24%), Positives = 197/489 (40%), Gaps = 57/489 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++ N L + D + CY K + NIDSLA G F S Y VC P+R +FT Sbjct: 19 EKANVLIITIDDLKP-TLACYGDKYAVSPNIDSLADNGTLFRSNYCQQAVCAPSRISMFT 77 Query: 62 GIYANQSGPWTNNVAP---GKNISTMGRYFKDAGYHTCYIGK-WH-------------LD 104 G+ + +G + NI TM +YFK+ GY + GK H L Sbjct: 78 GLRPDTTGILDLHTHMRDINPNILTMPQYFKENGYLSIGYGKLMHGAKNDDKELSWSELG 137 Query: 105 GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLW-------RNGLNSVEDLQANHID 157 + P D +L + + L + +A + Sbjct: 138 DDLPYNKNHPKPVLDKFQNPKAHQVFKKLNKTQKRLKTSLLQKEMKNKGAYLVSEAYDLP 197 Query: 158 ETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK 217 + ++ + L + A E F MV+ +++PH PF P +Y + Y L E Sbjct: 198 DDAYRDGAVAKAGIQRLNELAETKEKFFMVLGFNKPHLPFNAPKKYWDMYDPNKLPLAEH 257 Query: 218 AQDDLANKPEHHRL-------WAQAMPSPVGDDGLYHH--PLYFACNDFVDDQIGRVINA 268 + D +P++ + D+ H Y+AC +VD Q+GRV++ Sbjct: 258 QKQD-QQRPKYAYHSFGELAAYKDYQIGKAVDEKRQRHLIHAYYACVSYVDAQVGRVMDE 316 Query: 269 LTPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDT-PV 326 L + NT V+ DHG +G H L K + ++ TR PLII +P ++ QV P Sbjct: 317 LKRLNLDKNTIVVLWGDHGWHLGDHGLWCKHSN-FEQATRAPLIISAPNQKKGQVSQSPT 375 Query: 327 SHIDLLPTMMALADIEKPEILPGENILAVKE-PRGVMVEFNRYEIEHDSFGGFIPVRCWV 385 ID+ P++ L +E PE L GE++ + E P+ + +++ + + G+ Sbjct: 376 EFIDIFPSLCKLTGLEIPEQLEGEDLSPILEDPKAKVKDYSISQYLRWANHGY----TMR 431 Query: 386 TDDFKLVL--------------NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + ++L L N ELYD + DPNE N ++ +A+V K+ Sbjct: 432 SGKYRLTLWMPKNYYGFMKFDENDIVEVELYDYQKDPNETTNFANNPEYAEVLRKLKKQF 491 Query: 432 LDYMDKIRD 440 Y D Sbjct: 492 ASYFASQYD 500 >UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D6K5_PAESJ Length = 434 Score = 345 bits (885), Expect = 3e-93, Method: Composition-based stats. Identities = 105/478 (21%), Positives = 184/478 (38%), Gaps = 84/478 (17%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MKRPN + D +GCY + T ++D LA+EGIRF + Y+ SPVC+P+RA L Sbjct: 1 MKRPNIIVFYCDDLGYGDLGCYGSDAMKTPHLDQLASEGIRFTNWYSNSPVCSPSRASLL 60 Query: 61 TGIYANQSGPWT------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 TG Y ++G + +T+ K+ GYHT GKWHL +G Sbjct: 61 TGKYPAKAGVTSILGGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLGASAEYGPN-- 118 Query: 115 PPEWDADYWFDGA--NYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 +D Y F +Y S + N ++ + + + I+ A Sbjct: 119 AHGFDQFYGFRAGCIDYYSHIFYWGQGGGVNPVHDLWRNETEVWENGEYMTEAITREATS 178 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 ++ A DEP+ M V+Y+ PH+P P YL+++ D + Sbjct: 179 YID-AAPDDEPYFMYVAYNAPHYPMHAPKAYLDRFPDLPPD------------------- 218 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG----- 286 + A VDD +G ++ AL + E+T + ++SD+G Sbjct: 219 ---------------RRIMAAMIAAVDDGVGEIVKALKQKGAYEDTIIFFSSDNGPSTES 263 Query: 287 -----------EMMGAHKLISKGAAMYDDITRIPLIIRSPQG----ERRQVDTPVSHIDL 331 A + A++++ R P I+ P G + + D + +D+ Sbjct: 264 RNWLDGTEDLYYGGSAGRFRGHKASLFEGGIREPAILSYPAGLAEQQGQISDEMFAMMDI 323 Query: 332 LPTMMALADI-EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFK 390 PTM+ L+ I + L G ++ + F + +K Sbjct: 324 FPTMLELSGIGTEGYSLDGHSVFDALSGNALSPR-------KQLFWEYEGQLAVREGKWK 376 Query: 391 LVLNLF--------TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 LVLN + L D D +E NL+ ++ ++ ++ + + +++ Sbjct: 377 LVLNGKLDFSRTEADAVHLSDLEQDSSERINLVK--QYPEIAQRLERDVRQWYQSLQE 432 >UniRef50_A6DG71 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG71_9BACT Length = 515 Score = 344 bits (883), Expect = 4e-93, Method: Composition-based stats. Identities = 109/492 (22%), Positives = 197/492 (40%), Gaps = 68/492 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLN---TQNIDSLAAEGIRFNSAYTCSPVCTPARAG 58 +PN LF+M D +GCY + T ID LA++GI+F++ + + +CTP+RA Sbjct: 28 SKPNILFIMADDHTKQAIGCYGSRLSKLNPTPTIDRLASQGIQFDNVFCSNAICTPSRAS 87 Query: 59 LFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 + TG Y+ +G N + G + + + K AGY T IGKWHL C Sbjct: 88 IITGQYSQTNGVLDLNGSIGPDKQFLPKEMKKAGYETAMIGKWHLKKEPATFDYYCVLPG 147 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 G + + W + +D + + I++ ++ +L+ Sbjct: 148 ------QGLYHNPIFNIRGSKPWPKNTITKKDQHS---------SDAITDISLHWLKNER 192 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP------------ 226 +PF ++ + PH F EY ++Y + ++ + L + P Sbjct: 193 DKSKPFFLMHHFKAPHDMF----EYAKRYESYLEDVHIPEPESLFSVPAGSAGSKDLGSG 248 Query: 227 --EHHRLWAQAMPSPVGDDG----------LYHHPLYFACNDFVDDQIGRVINALTP-EQ 273 ++H W V DD + Y C +DD I R+++ L Q Sbjct: 249 LSKNHNPWQLPQKLGVSDDIPEPEYTRLSYQKYLKAYLRCVKGIDDNIARLLSYLKDSNQ 308 Query: 274 RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDL 331 +NT +IYTSD G +G H LI K MY++ +P I+ +P + + + +++ D Sbjct: 309 LDNTIIIYTSDQGFFLGEHNLIDK-RWMYEEAMGMPFIVYAPGMIKNNFKNNCLINNTDF 367 Query: 332 LPTMMALADIEK-PEILPGENILAVKEPRGVMVE-----FNRYEIEHDSFGGFIPVRCWV 385 PT++ +A ++K P + G++ + E + RY + Sbjct: 368 APTLLEIAGLKKTPNYMQGKSFYKALSNQQKPDEWRTVTYYRYWMHMAHKLAVPAHFGIR 427 Query: 386 TDDFKLVLNLFTS------------DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 ++ KL+ E YD DP EM N + + ++ ++ LL+ Sbjct: 428 SESHKLIFFYGRKYGRRGGKPTPISWEFYDLDKDPKEMKNEYKNPEYKEIIKRLKTQLLE 487 Query: 434 YMDKIRDPFRSY 445 + + + Y Sbjct: 488 IRKDLNEEDKKY 499 >UniRef50_C5C586 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C586_BEUC1 Length = 478 Score = 344 bits (883), Expect = 5e-93, Method: Composition-based stats. Identities = 115/464 (24%), Positives = 188/464 (40%), Gaps = 45/464 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + V+ D +GC+ T +ID+LAA G RF +Y +PVC+P RA L TG Sbjct: 15 RPNIVLVVVDDLGWRDLGCFGSTFYETPHIDALAASGTRFTHSYAAAPVCSPTRASLLTG 74 Query: 63 IYANQSGP--WTNNVA------------PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDY 108 Y + G W A ++ + R + GY T ++GKWHL G Sbjct: 75 KYPARVGVTNWIGGHAIGALRDVPYFHGLPQDEYALARALRAGGYRTWHVGKWHLGGGR- 133 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 P D G+ S ++ G+ ++ED + R+++ Sbjct: 134 ----HLPEHHGFDLNVGGSASGSPVSYYAPY----GIGALEDA-----PDGEFLTDRLTD 180 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 AVD ++ + D PFL+ + + H P P +EKY LG A + Sbjct: 181 VAVDLVR--SSDDAPFLLNLWHYAVHTPIEAPAHLVEKYRHKAETLGLPTHGPDAVEAGE 238 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGE 287 H V + P Y A + +D +GR++ AL + ++T +++TSD+G Sbjct: 239 HMPARHLRSERVRRRRIQSDPTYAAMLETLDGAVGRLVTALRDVGKLDDTLIVFTSDNGG 298 Query: 288 MMGAHK-------LISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMAL 338 + A L M D TR+P I+ P + D P + D PT++A Sbjct: 299 LSTAEGSPTCNAPLSEGKGWMADGGTRVPTIVSWPGRVPAGARSDLPFTSPDFYPTLLAA 358 Query: 339 ADIE--KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 A + + + G N+ + + + H S G P +KLV + Sbjct: 359 AGLTQLPEQHVDGVNLWPAWQGAPLDRGPIFWHYPHYSNQGGAPSAAVRDGRWKLVRHFG 418 Query: 397 TS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 DEL+D D +E H++ R DV +++ L ++ + Sbjct: 419 IEHDELFDVVADVSESHDVSG--RRRDVVARLSVTLDSWLADVG 460 >UniRef50_Q7US96 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7US96_RHOBA Length = 498 Score = 344 bits (883), Expect = 5e-93, Method: Composition-based stats. Identities = 108/515 (20%), Positives = 186/515 (36%), Gaps = 94/515 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L + D +GCY + T ID LAAEG+RF + Y VC+P R L +G Sbjct: 31 QPNILLIFIDDLGWKDIGCYGNDFVETPRIDQLAAEGLRFTNFYASGAVCSPTRCALQSG 90 Query: 63 IYANQSGPWTN----------------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH 106 + G + +A + T+ K +GY T Y+GKWHL Sbjct: 91 QNQARIGITAHIPGHWRPFERVITPQTTMALPLDTVTIAESLKASGYTTGYVGKWHLGNG 150 Query: 107 DYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI 166 F +D G + + S + N Sbjct: 151 PEFQPDRQ--GYDFSAVIGGPHLPGRYRVQGRSDLKPKPNQ-------------YRTDFE 195 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 ++ +DF++Q D+PF +++S H P E ++KY + G Sbjct: 196 ADLCIDFMRQ--NKDQPFFLMLSPFAVHIPLAAMSEKVQKYEAMAKQTGNSLP------- 246 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDH 285 HP+Y A + DD +GR++++L + ++T +++TSD+ Sbjct: 247 ---------------------HPVYAAMIEHCDDMVGRLVDSLEQLDIADDTMIVFTSDN 285 Query: 286 GEMM--------------GAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHI 329 G + L + ++++ R+PLIIR P D P Sbjct: 286 GGLYKRYDYRESADDLVSSQAPLKGEKGSLHEGGIRVPLIIRHPATVKSAGVCDEPTISH 345 Query: 330 DLLPTMMALADIEKP--EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTD 387 D PT + +A E P + + G ++L + ++ + + + P Sbjct: 346 DFYPTFVEMAGGELPINQTIDGHSLLPLMTAPTQTLDRDALHWHYPHYHHDRPASAIRER 405 Query: 388 DFKLVLNLFTSD--ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI------- 438 D+KL+ L + ELY+ +D E NL + + + L + + Sbjct: 406 DWKLIEYLDGTGDVELYNLADDLGETKNLASEKQ--GRAGDLKRKLTTWRSSVLARTPIP 463 Query: 439 ---RDPFRSYQWSLRPWRKDARPRWMGAFRPRPQD 470 DP R+++W K F P +D Sbjct: 464 NPSYDPERAHEWWNLKSGKPVPSEQRKRFPPTEKD 498 >UniRef50_B0TKJ5 Sulfatase n=2 Tax=Gammaproteobacteria RepID=B0TKJ5_SHEHH Length = 492 Score = 344 bits (882), Expect = 6e-93, Method: Composition-based stats. Identities = 110/470 (23%), Positives = 200/470 (42%), Gaps = 47/470 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + D + Y + T NID LAAEGI+F Y+ +P+C+P+RAG+ TG Sbjct: 29 KPNVVIFYVDDLGYGDLATYGHNIVKTPNIDKLAAEGIKFTQYYSPAPLCSPSRAGMLTG 88 Query: 63 IYANQSGPW-----TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 ++G NV GK T+ KD GY T GK HL+G + + Sbjct: 89 RTPYRTGIRSWIPDGQNVHIGKEEITLAHMLKDEGYDTAITGKLHLNGGAHMKDHPQASD 148 Query: 118 WDADYWFDGANYLSELTEKEI----SLWRNGLNSVEDLQANHIDETFT---WAHRISNRA 170 ++ F ++ + E R+G V++ N + T A ++N A Sbjct: 149 LGFEHSFIIPGGWAKNAKTEAKNADGSLRHGKIHVDNFWRNGVPVGETDQFSADLVANEA 208 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 + +L D+PF + V + E H P P +YL+ Y D+ + ++ D H Sbjct: 209 IGWLDDQG-GDKPFFLYVPFSEVHTPIASPQKYLDMYGDYLTDFAKENPDLF------HW 261 Query: 231 LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGE-- 287 W G+ YFA ++D Q+GRVI+ L + +NT ++++SD+G Sbjct: 262 DWVNQPYRGQGE--------YFANITYMDAQLGRVIDKLKAMGEYDNTIILFSSDNGPVT 313 Query: 288 ----------MMGA-HKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPT 334 M G L + +++ R+P+I++ + + D P+ +D++PT Sbjct: 314 REARKPYELNMAGETGGLRGRKDNLFEGGIRVPMIMKYHGHVKAETDSDEPIYGLDIVPT 373 Query: 335 MMALADIEKPE--ILPGENILAVKEPRG-VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL 391 + L + P + G + ++ + + I+ I DFKL Sbjct: 374 LSELIGFDTPSDRTIDGVSFVSTFNGLSVERTKPMIWTIDMPYQDDAINEYAVRIGDFKL 433 Query: 392 VLNLFTSDE-LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 +++ +++ L++ D E++NL++ + ++ A Y I + Sbjct: 434 IIDRQGNNKYLFNIGQDKYEVYNLLNKPEYKAKVEELTTAYQAYRKDIEN 483 >UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C1V3_9PLAN Length = 470 Score = 344 bits (882), Expect = 6e-93, Method: Composition-based stats. Identities = 93/471 (19%), Positives = 175/471 (37%), Gaps = 74/471 (15%) Query: 2 KRP-NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++P N +F + D +GCY + NID LAAEG++F Y+ C+P R L Sbjct: 31 EKPWNVVFFLVDDLGWTDLGCYGSDFYQSPNIDQLAAEGMKFTQNYSACNACSPTRGALL 90 Query: 61 TGIYANQSGP------WTNNV------------APGKNISTMGRYFKDAGYHTCYIGKWH 102 TG+Y ++ W + + +T+ + AGY T ++GKWH Sbjct: 91 TGMYPARTHLTDWIPGWAKSYTDFPLKPPEWKKHLDQKYTTLPEALRTAGYQTFHVGKWH 150 Query: 103 LDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTW 162 L G G P + D G N L + + + Sbjct: 151 LGGR-----GNLPQDHGFDVNISGTN--RGLPRSYHFPYGGDAMKWDSSLTEAERQDRYL 203 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 R+++ AV ++Q + D+PF + S+ H P + ++KY Sbjct: 204 TDRMADEAVALIRQ--QQDKPFFLYCSFYSVHSPIQGRPDLVKKYKGLPAG--------- 252 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIY 281 + +P Y A VD+ IGRV L + T +++ Sbjct: 253 ---------------------KRHKNPEYAAMIQSVDEAIGRVRAQLKESGIADRTLIVF 291 Query: 282 TSDHGE----MMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTM 335 TSD+G L + ++ TR+P I+ P P+ +D PT+ Sbjct: 292 TSDNGGVRRKTSNNDPLRGEKGQHWEGGTRVPAIVLWPGVTPAGSVCAEPIITMDFYPTI 351 Query: 336 MALADI----EKPEILPGENILAVKEPRGVM--VEFNRYEIEHDSFGGFIPVRCWVTDDF 389 + + + E + + G +++ + + E + H + +P ++ Sbjct: 352 LNITGVAGNTEHNQSVDGLSLVPLLKDPAATLNREALYWHYPHYNVFIGVPYSAIRVGEY 411 Query: 390 KLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 KL+ DELY+ D +E ++ + ++ +++ L ++ ++ Sbjct: 412 KLIHYYEDGNDELYNLAEDLSETSDVSK--TYPELTARLERRLQQHLKQVG 460 >UniRef50_A0LK86 Sulfatase n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LK86_SYNFM Length = 487 Score = 343 bits (881), Expect = 7e-93, Method: Composition-based stats. Identities = 122/447 (27%), Positives = 187/447 (41%), Gaps = 32/447 (7%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN L + D + +GC G P + T NID LA G+ F +A SP+C+P+RA FT Sbjct: 47 KPNVLMFVLDDM-NDWIGCLGGHPDVKTPNIDRLAQRGVLFRNAQCSSPICSPSRASFFT 105 Query: 62 GIYANQSGPWTNNVAPGK---NISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPE 117 GI + SG + N+ A K N T+ ++F GY + GK +H D E P Sbjct: 106 GIRPSTSGIYGNSQAFRKIMPNAVTLPQHFIAHGYRSMGCGKLFHFIKTDSRSWHEFFPS 165 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHID--ETFTWAHRISNRAVDFLQ 175 + FD + L+ GL V ID + +++ A D L+ Sbjct: 166 RSMERPFDPVPPNAPLS---------GLPDVNQFDWGPIDIVDEELGDGKLARWAADALR 216 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + R D PF + V PH P P +Y + Y L +DL + P WA+ Sbjct: 217 R--RYDRPFFLGVGLLRPHVPLYVPRKYFDMYPPESITLPTVKANDLDDVPPTGVSWAKP 274 Query: 236 MPSPV---GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGA 291 + D Y A FVD Q+G V++AL NT V+ D+G +G Sbjct: 275 ERHQLIVEHDQWRKAVAGYLASVSFVDAQVGWVLDALDESPYVNNTVVVLWGDNGWHLGE 334 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 KL ++++ R+PLII P R+ PVS +D+ PT+ L D+ L Sbjct: 335 -KLHWTKLTLWEESCRVPLIIALPGLTPPGRKCAKPVSTMDVYPTLNELCDLTPKPELEC 393 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPN 409 +IL + + G + ++ + ++ELYD + DP Sbjct: 394 RSILELLRNPQSDTWDGPPALSTYMPGN----HSLRDERYRYIRYNDGTEELYDLKADPM 449 Query: 410 EMHNLI--DDIRFADVRSKMHDALLDY 434 E +NL+ A VR ++ L + Sbjct: 450 EWNNLLAGGGTGPAGVRDRLSAFLPKF 476 >UniRef50_C6LAI4 Arylsulfatase n=6 Tax=Bacteria RepID=C6LAI4_9FIRM Length = 481 Score = 343 bits (881), Expect = 7e-93, Method: Composition-based stats. Identities = 110/459 (23%), Positives = 200/459 (43%), Gaps = 32/459 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LF+MTD + +G + T +D+LAA+G+ F++AY+ P C PARA L T Sbjct: 4 KKPNILFIMTDQLRGDCLGIAGHPDVKTPYLDTLAAKGVLFSNAYSACPSCIPARAALHT 63 Query: 62 GIYANQS-GPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH---------------LDG 105 G+ + + T+ AGY+T +GK H DG Sbjct: 64 GMLPEHHRRVGYQDGIAWRYEHTLAGELSRAGYYTQCVGKMHVHPLRNYLGFHNVELHDG 123 Query: 106 HDYFGTGECPPEWDADY-WFDGANYLSELTEKEISLWRNGLNSVEDL-QANHIDETFTWA 163 + ++ P ++ + D +L E +GL+ + + +E + Sbjct: 124 YLHYARYGSVPYRESQHVADDYYYWLKEQKGISADPMESGLDCNSWVARPFPYEEKYHPT 183 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 + +++R++DFL++ D+PF ++ SY PH PF P Y + Y D + Sbjct: 184 NWVTDRSIDFLRRR-DPDQPFFLMASYLRPHPPFDAPAYYFDLYKDKKLTPPYVGDWEDT 242 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHPL-YFACNDFVDDQIGRVINALTPEQREN-TWVIY 281 + ++ P ++ + + Y+AC +D QIGR++ ALT + +N T + + Sbjct: 243 KLLKERGRIFDSLTGPEDEELIRQAQIGYYACITHLDHQIGRLLMALTEHELQNDTMIFF 302 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRS-PQ----GERRQVDTPVSHIDLLPTMM 336 T+DHGE + H K + Y IPLII P+ D D++PT++ Sbjct: 303 TADHGEELCDHHHFRK-SLPYQGSIHIPLIISGNPELTGFAPHSVCDEVTELCDIMPTLL 361 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 +A + P+ + G+++LA + G Y S+G D + Sbjct: 362 DIAGADIPDRVDGKSLLAFADGEGR-----EYLHGEHSYGELSNHYIVTKKDKFCWFSTS 416 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 ++ + DP+E+H+ I+D + + + + L+ + Sbjct: 417 GTEHYFVLEEDPHELHDRIEDPACRERIAYLRNCLIREL 455 >UniRef50_C0W1U3 Sulfatase n=1 Tax=Actinomyces coleocanis DSM 15436 RepID=C0W1U3_9ACTO Length = 482 Score = 343 bits (881), Expect = 8e-93, Method: Composition-based stats. Identities = 129/510 (25%), Positives = 200/510 (39%), Gaps = 56/510 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PNF+ +TD Q + L T N+D L+ E F++ Y SPVC+PAR L T Sbjct: 3 KQPNFVIFVTDDQGPWATSEHW-PELQTPNLDQLSKESSTFSNYYCASPVCSPARGTLLT 61 Query: 62 GIYANQSGPWT----------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 G + G I T+ D GY+ +GKWH+ Sbjct: 62 GRMPSAHGIHDWLVGGRHPDALEEPFLDGIITLPEVLDDNGYYCGMVGKWHVG------- 114 Query: 112 GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 P YW+ A+ +W D N E + I+ A Sbjct: 115 TSQTPAPGFSYWY--AHRYGGGPYYNAPIW--------DENGNEATEPKYFTDAIAENAC 164 Query: 172 DFLQQPA--RADEPFLMVVSYDEPHHPF--TCPVEYLEKYADFYYELGEKAQDDLANKPE 227 DF+Q A ++PF ++V++ PH P+ P E ++ YAD + +++ + Sbjct: 165 DFIQSAASVNEEKPFFLMVNFTAPHSPWINNHPQELMDLYADTDF--PSIPREEPHPWTK 222 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 ++ +A A PV Y A VD+ +G ++ AL +NT V+Y SD+G Sbjct: 223 YYDDFADAFADPVP-----SLRGYAASLTGVDNAVGDILKALEENAYADNTVVMYMSDNG 277 Query: 287 EMMGAHKLISKGA-----AMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALAD 340 G H + KG +++ R+P II P E R+VD VS T+ LA+ Sbjct: 278 FSCGQHGIWGKGNGTFPLNFWENSVRVPFIIHLPGQHEYRKVDDHVSACSFFETVCELAE 337 Query: 341 IEKPEI-LPG-ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS 398 I PE L G +I + RG + + + + D +GG R D K + Sbjct: 338 ITPPEDPLRGARSIADLA--RGEIRDSDEPVMVFDEYGGG---RMIRYGDLKFIDRFDGP 392 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARP 458 ELYD +NDP E++NL+ D + VR ++ + + R P Sbjct: 393 QELYDLKNDPAELNNLVHDESYEKVRDELASLMGQWFAAHETEVYRAFHRDIRGRGQVHP 452 Query: 459 RWMGAFRPR---PQDGYSPVVRDYDTGLPT 485 G R + + LP Sbjct: 453 PHAGYNDTRTYVTDNENRDGDNTHKKALPV 482 >UniRef50_C6IGG0 Iduronate 2-sulfatase n=2 Tax=Bacteroides RepID=C6IGG0_9BACE Length = 482 Score = 343 bits (881), Expect = 8e-93, Method: Composition-based stats. Identities = 111/449 (24%), Positives = 187/449 (41%), Gaps = 29/449 (6%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 + R N LF+M D +GCY + + T N+D LA+ G+ F +AY PV +RA L Sbjct: 32 VSRMNVLFLMADDMRPE-LGCYGVEAVKTPNMDRLASSGVLFQNAYCNVPVSGASRASLL 90 Query: 61 TGIYANQSGPWTNNVAPG----KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 TG+Y + + N A + +F GYHT GK D+ + PP Sbjct: 91 TGVYPHYPDRFVNFSAYASKDCPEAIPLSGWFTKNGYHTVSDGKVFHHMSDHAASWSEPP 150 Query: 117 EWDAD-----YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 + YW + + + + ++ + +T +++ RA+ Sbjct: 151 YRNHPDGYDVYWAEYNKWELWMNSESGKTINPKTMRGPFCESADVPDTAYDDGKLAERAI 210 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRL 231 L++ ++PF + + +PH PF P +Y + Y L PE R Sbjct: 211 RDLRRMKEMNKPFFLACGFWKPHLPFNAPKKYWDLYKREEIPLAPNRFRP-EGLPEQVRN 269 Query: 232 ------WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSD 284 +A+ + D Y+AC +VD QIG+V++AL ENT V+ D Sbjct: 270 SSEIYAYARVSDTSDADFQREVKHGYYACLSYVDAQIGKVLDALDELGLAENTIVVLLGD 329 Query: 285 HGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKP 344 HG +G H + K M D T +PLIIR P ++ + + V +DL PT+ L I +P Sbjct: 330 HGWNLGEHDFVGKHNLM-DRSTHVPLIIRVPGRKKGKTRSMVEFVDLYPTLCELCQIPQP 388 Query: 345 -EILPGENILAV---KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDE 400 E L G++ V + + + ++E ++ W+ D K + Sbjct: 389 AEQLDGQSFAKVFSNLKAKTKDEVYIQWEGGDNAVDQRFSYAEWMKGDVK------KASM 442 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHD 429 L+D R D E N +++ ++ + Sbjct: 443 LFDHRIDKEENKNRVNEKKYKSKVESLSS 471 >UniRef50_B6A548 Choline-sulfatase n=1 Tax=Rhizobium leguminosarum bv. trifolii WSM2304 RepID=B6A548_RHILW Length = 503 Score = 343 bits (880), Expect = 9e-93, Method: Composition-based stats. Identities = 104/445 (23%), Positives = 184/445 (41%), Gaps = 16/445 (3%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN LF+ D + Y N++ +A G+ F +AY P+C P+R + G Sbjct: 7 PNILFIQVDQLTAASLSAYGDTVCRAPNLERIADTGVVFETAYCNFPLCAPSRFSMAAGQ 66 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYW 123 + G + N +I T Y + AGY T GK H G D F E D + Sbjct: 67 LCSTIGAYDNAAEMPASIPTYAHYLRAAGYQTALSGKMHFIGPDQFHGFE--KRLTPDLY 124 Query: 124 FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD-- 181 +++ + N +V + ++ +A+ L AR+D Sbjct: 125 PADFSWVPNWGNEGKRDT-NDTRAVLISGICERSVQIDFDENVTFQAIQHLYNIARSDDK 183 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH----HRLWAQAMP 237 PF + VSY PH P+ C E+ + Y + H + +A Sbjct: 184 RPFFLQVSYTHPHEPYLCRKEFWDLYEGVDVPMPAVDALSEQEHDPHSVRLLKDFAMLDV 243 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKLIS 296 D Y+ ++D IG++++ L RENT +++ SDHGEM+G + Sbjct: 244 RFADGDIQRARRAYYGSISYIDSMIGQILDTLEAIGARENTAIVFASDHGEMLGERGMWF 303 Query: 297 KGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADI----EKPEILPGENI 352 K ++ R+PL++ +P + ++V VS +DLLPT+M LA + E L G+++ Sbjct: 304 KKH-FFEAALRVPLLLNAPWIKPQRVSETVSLVDLLPTLMGLATGRVWRSETEELEGQDL 362 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMH 412 + + E+ + +P+ +KL+ + + L+D DP E+ Sbjct: 363 TGFLDREDHEPSRAVFA-EYLAEATPVPIFMVRKGRYKLISSSHDGNLLFDLMADPKELQ 421 Query: 413 NLIDDIRFADVRSKMHDALLDYMDK 437 NL +A++ +++ + D D+ Sbjct: 422 NLAGHTDYAEIEARLLKIVADKWDE 446 >UniRef50_A6DNH1 Choline sulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DNH1_9BACT Length = 470 Score = 343 bits (880), Expect = 1e-92, Method: Composition-based stats. Identities = 101/452 (22%), Positives = 190/452 (42%), Gaps = 37/452 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++PN L + D + +G G P T N+D LA G+ F + SPVC P+R + Sbjct: 23 EKPNVLLIAVDDL-NDWIGVLGGHPQAKTPNMDRLANRGVLFTNTQCQSPVCNPSRGSMM 81 Query: 61 TGIYANQSGPWTNNVAPG-----KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 T +Y + +G + N + G K M + F+ GYH GK H+ Sbjct: 82 TSLYPSTTGIYFLNPSVGTSPKAKGHLVMPKRFEAEGYHVSAAGKLF---HNQENKKYFK 138 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + F + LW G+ D Q + R++ Sbjct: 139 EYGGSFGGFGPIPKKKITSFPGHPLWDWGVYPERDEQMPDVKIAAWGKERLAR------- 191 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHH-RLWAQ 234 D+PF M + + PH P P ++ + Y ++ + ++D+ P++ L + Sbjct: 192 ---DYDQPFFMGIGFYRPHVPQFAPQKWFDMYPLESVQMPKMRKNDIEGIPQYGVDLTRE 248 Query: 235 AMPSPVGDDGLYHH------PLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHGE 287 +P + + + Y AC FVD Q+G++++AL ++NT+V+ SDHG Sbjct: 249 KHVAPTYEWVIENKEEKKLVQSYLACVSFVDAQVGKILDALDASPHKDNTYVVLYSDHGF 308 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEIL 347 +G + +K ++++D R+P++I P + P +D+ PT++ L ++ L Sbjct: 309 HLGEKERYAK-RSLWEDGARVPMMISGPGIKPGVTHKPTQLLDIYPTLLELTGLKSDPKL 367 Query: 348 PGENILAVK-EPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRN 406 G +++ + P+ + R ++ V++ ++ + S+E YDR Sbjct: 368 EGNSLVPLLRNPQSDWPHYARTSFGPGNY-------AIVSERYRYIHYNDGSEEFYDRSK 420 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 D +E HN I + +A + +K + I Sbjct: 421 DTHEWHNQIKNPEYASIIAKHRKQVPQERAPI 452 >UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD Length = 452 Score = 343 bits (880), Expect = 1e-92, Method: Composition-based stats. Identities = 114/473 (24%), Positives = 185/473 (39%), Gaps = 85/473 (17%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN L + TD Q T V CY K L+T NID LA EG+ F+ Y +PVC+P+RA L T Sbjct: 26 KRPNVLIIYTDDQGTLDVNCYGAKDLHTPNIDRLAKEGVLFSQFYAAAPVCSPSRASLLT 85 Query: 62 GIYANQSGPWTN------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 G Y ++ N + + TM FKD GY T +IGKWH+ + P Sbjct: 86 GRYPQRAQLDNNAPSEEGHAGMPGSQYTMAEMFKDGGYTTAHIGKWHIG----YSPETMP 141 Query: 116 PEWDADYWFD--------GANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRIS 167 + DY F ++Y LWRNG ED + +A Sbjct: 142 NQQGFDYSFGFMGGCIDNYSHYFYWAGPNRHDLWRNGQEIWEDGK--------FFADLTV 193 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 FL++ RAD+PF + + + PH+P ++ + Y D Sbjct: 194 QEVNGFLEKNKRADKPFFLYWAINMPHYPLQGQEKWRQYYKDLPAP-------------- 239 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 +Y A +D++IG+V+ L ENT V++ SD G Sbjct: 240 --------------------RRMYAAAVSTMDEKIGQVLQQLDRLGLAENTIVVFQSDQG 279 Query: 287 -------EMMGAH--KLISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTM 335 G ++++ R+P IIR + D +ID PT+ Sbjct: 280 HSTEDRSFGGGGFTGPYRGAKFSLFEGGIRVPAIIRWTGHLPKNEVRDQLCVNIDWYPTL 339 Query: 336 MALADIEKPE-ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 L + P+ + G++I V + + + ++KL+ N Sbjct: 340 AGLCKVALPQRKIDGKDIQQVITSSKTSSPHDIFFWQSQGTKENPQW-AVRQGNWKLLHN 398 Query: 395 L-------FTSDELY--DRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 D+L+ + + D +E NL + ++ S + + L +++++ Sbjct: 399 PSSAKKAETGPDDLFLVNLQQDTSEAKNLAA--QHPEIVSSLKEQYLKWINEV 449 >UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4S5_9BACT Length = 486 Score = 342 bits (879), Expect = 1e-92, Method: Composition-based stats. Identities = 115/509 (22%), Positives = 189/509 (37%), Gaps = 91/509 (17%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN LF++ D + +GCY T NID A+ +RF SAY S VC+P+R+ L TG Sbjct: 25 KPNILFILADDMGWSDLGCYGADLHETPNIDRFASGAVRFTSAYAMS-VCSPSRSTLMTG 83 Query: 63 IYANQSG--PWTNNVA-----------------PGKNISTMGRYFKDAGYHTCYIGKWHL 103 +A + W + T+ Y K AGY T IGKWHL Sbjct: 84 KHAARLHFTIWAEGAQEGGAKNRELREAESIWNLPNSEKTIATYLKSAGYLTALIGKWHL 143 Query: 104 DGHDYFGTGECPPEWDADYWFDGANYLSE--LTEKEISLWRNGLNSVEDLQANHIDETFT 161 +++ P D G N+ + +G + Sbjct: 144 GDWEHY-----PEAHGFDINIGGTNWGAPQTFWWPYSGSGTHGPEFRYIPHLEYGHPGEY 198 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 R+++ A+ + D+PF + +++ H P + ++ + Y Sbjct: 199 LTDRLTDEAIKVIDHA--GDQPFFVYLAHHAVHTPIEAKADDIQHFDAKY---------- 246 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVI 280 D + H +Y A N +D+ +GRV+ L + NT VI Sbjct: 247 -------------------RDGMNHRHTIYAAMNKELDENVGRVLEHLKERGLDKNTVVI 287 Query: 281 YTSDHGEMMG-------------AHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTP 325 + SD+G +G L S A+Y+ R+PLIIR P D P Sbjct: 288 FASDNGGYIGVDKVSGKNMPVTNNAPLRSGKGALYEGGIRVPLIIRWPGVTPNGATCDEP 347 Query: 326 VSHIDLLPTMMALADIEK-PEILPGENILAVK-EPRGVMVEFNRYEIEHDSFGGFIPVRC 383 V D+L T + + + G +I + +P + + + PV Sbjct: 348 VILTDMLQTFLHITGQPPATDATDGMDISPLLKDPSAKLNRDALFFHYPHYYHTTTPVSA 407 Query: 384 WVTDDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 D+KL+ + ELY+ RND +E H+L ++ D + + D L + D + Sbjct: 408 IRARDWKLLEFYEDNHLELYNLRNDLSEKHDLAKEM--PDKAAALRDQLNAWRDSVG--- 462 Query: 443 RSYQWSLRPWRKDARPRWMGAFRPRPQDG 471 P + G +P+PQ+ Sbjct: 463 --------AVLPQPNPDFKGG-KPKPQNA 482 >UniRef50_D2R201 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R201_9PLAN Length = 511 Score = 342 bits (879), Expect = 1e-92, Method: Composition-based stats. Identities = 110/452 (24%), Positives = 187/452 (41%), Gaps = 30/452 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPL-NTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 KRP+ LF+ D Q + +G G P T ++D+LAA G +A+ +P+C P+R L Sbjct: 39 KRPHILFIAIDDQ-NDWIGHLGGHPYAKTPHLDALAARGTTLANAHCQAPLCNPSRTSLM 97 Query: 61 TGIYANQSG-----PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 G+ +G PW + + ++ +Y + GY T GK + G G Sbjct: 98 FGLRPTSTGIYGLAPWIRTLPEFEKRVSLPQYLQQHGYRTLTTGKIYHGGL-----GPKK 152 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + D W ++ +K I G + + D + ++I++ A++ L Sbjct: 153 RLEEFDVWGPAGGIGAKPEKKLIPPTPMGNHPLMDWGKFDHRDEDKGDYQITSWAIEQLD 212 Query: 176 Q--PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA 233 A+ P + V Y PH P ++ ++ L A DD ++ P Sbjct: 213 DQVQHHAETPMFLSVGYFLPHVPCFISPKWYDEVPQGDKLLPLVAADDRSDIPRFAWYLH 272 Query: 234 QAMPSPVGDDGLYHHPL------YFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHG 286 ++P P H Y A FVD QIGR++ AL + ++T V+ DHG Sbjct: 273 WSLPEPRLKWVEDHRQWENLVRSYLASTTFVDAQIGRLLTALEERKLADDTIVVVWGDHG 332 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLLPTMMALADIEKPE 345 +G + K +++ TR+PLI P +QV P +D+ PT++ LA + Sbjct: 333 WHLGEKGITGKN-TLWERSTRVPLIFAGPGITPKQVCGEPAELLDIFPTLVELAGLPPRN 391 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRR 405 L G ++ E+ + + ++ + S+ELYD + Sbjct: 392 DLEGHSLAPQLRDASRQREWPAITSHNQGN------HAIRSARYRYITYADGSEELYDMQ 445 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 +DP E+ NL D +A V + H L MD+ Sbjct: 446 SDPRELTNLASDSTWASVIAD-HRRWLPKMDR 476 >UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7UX95_RHOBA Length = 538 Score = 342 bits (879), Expect = 1e-92, Method: Composition-based stats. Identities = 101/474 (21%), Positives = 166/474 (35%), Gaps = 75/474 (15%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 + RPN + ++ D +GCY + T +D LAAEGI+ + Y+ + VC P+R L Sbjct: 71 VSRPNIVLIVADDLGYGELGCYGQTKIRTPRLDQLAAEGIKLTNFYSGNAVCAPSRCCLM 130 Query: 61 TGIYANQSGPWTNNV-------------------APGKNISTMGRYFKDAGYHTCYIGKW 101 TG + + N + T+ Y K GY T GKW Sbjct: 131 TGKHPGHAHVRNNGDPKIDPAVREALKLEFPGQYPLPVDEVTIAEYLKSVGYRTGAFGKW 190 Query: 102 HLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 L +FGT P E D ++ LWRN + V+ + Sbjct: 191 GLG---HFGTTGDPNEQGFDLFYGFNCQRHAHNHYPNFLWRNRVKEVQPGNDRTLHGETY 247 Query: 162 WAHRISNRAVDFLQQPARADE--PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ 219 + N A +F++Q D+ PF + + PH P E ++ Y E Sbjct: 248 SQDQFVNEACEFIRQSVAEDKTQPFFAYLPFAVPHLSIQVPEEEVDAYDGVIEEADY--- 304 Query: 220 DDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTW 278 EHH P Y A +D+ +G+V++ + ENT Sbjct: 305 -------EHHGYLKHPRP----------RAGYAAMVTRMDEGVGQVVDLVDSLGLGENTL 347 Query: 279 VIYTSDHG------------EMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDT 324 +++TSD+G A + + + R+P+I R R D Sbjct: 348 IMFTSDNGPTYDRLGGSDSDYFNSASGMKGLKGQLDEGGIRVPMIARQTGVVPAGRTSDW 407 Query: 325 PVSHIDLLPTMMALADIEKPE-ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRC 383 + D LPT+ A +E G + L + + + E + F G+ + Sbjct: 408 IGAWWDFLPTITDAAGVEVDASTTDGISFLPLLHGDDAAQQSH--EFLYWEFPGYSGQQA 465 Query: 384 WVTDDFKLVLNLFTSDE-----------LYDRRNDPNEMHNLIDDIRFADVRSK 426 ++K + + LYD D E +++ DV +K Sbjct: 466 IRMGNWKAIRKDLSKRLKKGQTEPPAFALYDLSKDLAESNDVSA--SHPDVMAK 517 >UniRef50_A6DF72 Putative secreted sulfatase ydeN n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF72_9BACT Length = 481 Score = 342 bits (879), Expect = 1e-92, Method: Composition-based stats. Identities = 101/471 (21%), Positives = 178/471 (37%), Gaps = 73/471 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + ++ D CY T N+D L+ G+RF AY+ VC+P R+ + TG Sbjct: 23 KPNVIMILVDDLGWTDTTCYGSDLYQTPNVDELSRTGMRFTDAYSACTVCSPTRSSIMTG 82 Query: 63 IYANQSG----------PWTNNVAPG------KNISTMGRYFKDAGYHTCYIGKWHLDGH 106 + P+ +P T+ FK GY T +IGKWHL Sbjct: 83 KNPANNNLTDWITGHVKPYAKLKSPNWKMHLTAEEITLAEAFKATGYKTVHIGKWHLGEE 142 Query: 107 DYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI 166 + P D G S + + + + + R+ Sbjct: 143 ----SVSWPENQGFDENIAGFRAGSPSAHGGGGYF----SPYNNPRLKDGPKGEYLTERL 194 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 + A ++Q A+ +PF M + H P E ++KY + + Sbjct: 195 AQEASQYIQSTAKLKKPFFMNLWLYNVHTPLQARQEKIDKYTRLIQKGYQHT-------- 246 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDH 285 +P+Y A + +DD +G V+ A+ +NT +I+ SD+ Sbjct: 247 ---------------------NPVYAAMVEHMDDAVGTVMQAVKDAGIEDNTIIIFNSDN 285 Query: 286 GEMMG-----------AHKLISKGAAMYDDITRIPLIIRS--PQGERRQVDTPVSHIDLL 332 G + G + L S MY+ R+P+II+ + +PV D+ Sbjct: 286 GGLRGNYENNRQKVTSNYPLRSGKGDMYEGGVRVPMIIKWSRKIKAGQTSSSPVISHDIY 345 Query: 333 PTMMALADIEKPE--ILPGENILA-VKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDF 389 PT++ L I+ + + G +++ + E + + + + H G P D+ Sbjct: 346 PTLLDLCKIDVSKKQDIDGISLVPELLEGKTIQRDALYWHYPHYHLEGAKPYSAIRKGDW 405 Query: 390 KLV-LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 KL+ L + ELY+ RND +E +NL + +++ L + KI Sbjct: 406 KLIFLYEESHAELYNLRNDISERNNLA--MTEKRKLAELMGDLRTWKKKIG 454 >UniRef50_A4A047 Iduronate-2-sulfatase n=2 Tax=Bacteria RepID=A4A047_9PLAN Length = 481 Score = 342 bits (878), Expect = 2e-92, Method: Composition-based stats. Identities = 109/459 (23%), Positives = 186/459 (40%), Gaps = 34/459 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN LF+ D T +GCY +++ NID LAA G F AY VC+P+R L T Sbjct: 18 RQPNVLFIAVDDLRTE-LGCYGASQIHSPNIDRLAAAGTVFTRAYCQQAVCSPSRTSLMT 76 Query: 62 GIYANQSGPWTNNVAPGKNIS---TMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+ + + + KN+ T+G++FK GY++ +GK + G+D T P Sbjct: 77 GLRPDSTKVYDLVTHFRKNVPDVVTLGQHFKQNGYYSVSMGKIYHGGYDDPPTWSEPARK 136 Query: 119 D-ADYWFDGANYLSELTEKEISLWRNGLNSVE--------DLQANHIDETFTWAHRISNR 169 + A L +T+K + GL V+ + + + +++ Sbjct: 137 PQGGAGYVLAENLQTITDKRNAARAKGLRGVQLSRAARGPATEMADVADNAYADGAVADL 196 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHH 229 AV L++ ++ DEPF + V + +PH PF P +Y + Y EL P Sbjct: 197 AVKSLRELSQRDEPFFLAVGFVKPHLPFNAPKKYWDMYDPAKIELAANPYPPKNVTPYSL 256 Query: 230 RLWAQAM--------PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVI 280 W + + Y+AC + D +G++++ L + + T V+ Sbjct: 257 TSWGEMRVYDGIPKQGDLSPEKARELKHGYYACISYTDANVGKLLDELDKLKLTDETIVV 316 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMAL 338 DHG +G H K ++D PLIIR+P + V +D+ PT+ L Sbjct: 317 LWGDHGWKLGEHNSWCKHTN-FEDDANAPLIIRAPGQKSPGAKSTALVEFVDIYPTLCEL 375 Query: 339 ADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS 398 A + P+ L G + + + + + I TD ++ Sbjct: 376 AALPLPQHLEGTSAAPLLDQPDAAWKTAAFSQYPRRQ---IMGYTMKTDRYRFTAWKNKK 432 Query: 399 ------DELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 ELYD + DP E N+ A++ ++ L Sbjct: 433 SGKVVATELYDHQVDPAENVNVAGLTENAELIVQLQKQL 471 >UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CEC4_9PLAN Length = 467 Score = 342 bits (878), Expect = 2e-92, Method: Composition-based stats. Identities = 107/475 (22%), Positives = 177/475 (37%), Gaps = 87/475 (18%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + D VG T +ID LA E ++F +AY+ +P C P+RA L + Sbjct: 28 QRPNIVLFFIDDLGWRDVGFMGSDFFETPHIDRLADESMKFTAAYSAAPNCAPSRACLMS 87 Query: 62 GIYANQSGPWT------------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHL 103 G+Y + G +T NN +T+ AGY +GKWHL Sbjct: 88 GLYTPRHGVYTVGDPARGNDRYRKLIPAENNRVLDDRFTTIADRLSQAGYRCASVGKWHL 147 Query: 104 DGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWA 163 G+ P G N S ++N Q + ++ Sbjct: 148 --------GQSPLSQGFQVNIAG-NQTGSPRGGYFSPYQNP-------QLSDGEQGEFLT 191 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 R++ A F++ PF + +++ H P E + Sbjct: 192 DRLTTAACQFIKD--NQGSPFFLYLTHYAVHTPLQAKKEDIA------------------ 231 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYT 282 + Q+ P+ L+ H Y A +D IGRV+ L +Q + NT V++T Sbjct: 232 --------YFQSKPAG----KLHQHATYAAMIRSMDQSIGRVLQTLREQQLDQNTIVVFT 279 Query: 283 SDHGEMMGAH---KLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMA 337 SD+G A L +Y+ R+PL+I+ P V ++DL PT + Sbjct: 280 SDNGGYGPATSMLPLRGSKGMLYEGGIRVPLLIKWPGVTQPGSTTGEAVINVDLYPTFLE 339 Query: 338 LADIEK--PEILPGENILAV-------KEPRGVMVEFNRYEIEHDSF---GGFIPVRCWV 385 + +I E+L GE+++ + E R + F Y ++ PV Sbjct: 340 MTNIPVLESELLDGESLVPLLKDPQTRLESRSLFWHFPAYLQKYQGMQQRFRTTPVSVIR 399 Query: 386 TDDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 D+KL+ ELY+ R D E L + ++ AL + +++ Sbjct: 400 QGDWKLLEFFEDGHQELYNTRLDIGESKELSG--SHPEKTQELSQALHRWQKQVK 452 >UniRef50_A3I0S5 Putative sulfatase yidJ n=1 Tax=Algoriphagus sp. PR1 RepID=A3I0S5_9SPHI Length = 491 Score = 342 bits (877), Expect = 2e-92, Method: Composition-based stats. Identities = 117/462 (25%), Positives = 200/462 (43%), Gaps = 41/462 (8%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN +FV+ D VG + T N++ LA E + F +A T VC P RA TG Sbjct: 38 PNIVFVLADQWRAQEVGYAGNDQIITPNLNKLATESLIFENAVTTMAVCAPWRASFLTGQ 97 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH----DYFGTGECPPEWD 119 Y G + N+ T +K+AGY T YIGKWHL+GH D F + P D Sbjct: 98 YPLTHGVFYNDKPLPNEAYTFAEIYKEAGYQTGYIGKWHLNGHARGADPFSARDQPVPKD 157 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 FD +E++ N ++ H+ E + + ++ A+ ++ Sbjct: 158 RRQGFDY------WKVREVTHNYNNSFYFDEEDKKHVWEGYDVFPQ-TDSAISYI--SKN 208 Query: 180 ADEPFLMVVSYDEPHHPFT-CPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 ++PF++++SY PH P+ P EY + Y +L +P Sbjct: 209 KEKPFVLMLSYGPPHDPYFSAPKEYQDLYDAGTLKLRPN------------------VPE 250 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISK 297 D Y+A +D IG ++ + +NT ++TS+HG+M+ + ++ K Sbjct: 251 EYQDSARRVLAGYYAHATAIDKAIGDLLEGIEKAGVADNTIFVFTSEHGDMLMSRGVV-K 309 Query: 298 GAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENIL--- 353 +D+ ++P++IR P E R+V P+ D+LPT++ L+DI P+ + G++ Sbjct: 310 KQRPWDEAIKVPMLIRYPGKLESRRVLDPIGTPDILPTLLGLSDIPIPKSIEGKDFSKNL 369 Query: 354 ---AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNE 410 E ++ E G R T + V +L LYD NDP + Sbjct: 370 LSGKDLENDATLIMLPVPFHEWQFKNGGREYRGIRTRRYTYVKDLLGPWLLYDNENDPFQ 429 Query: 411 MHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPW 452 ++NL++ + ++ K+ +L + +I D F + W Sbjct: 430 LNNLVNQSEYNSLQKKLEKSLSKKLKEINDKFWPADEYMNQW 471 >UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UGD7_RHOBA Length = 543 Score = 342 bits (877), Expect = 2e-92, Method: Composition-based stats. Identities = 111/460 (24%), Positives = 183/460 (39%), Gaps = 79/460 (17%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++ D + VG K + T ++D LAA G+ F + Y P C+P+RAGL TG Sbjct: 44 RPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTG 103 Query: 63 IYANQSGPWTNNVA-----------PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 + + G +N + +T+ K+AGY T IGKWHL F Sbjct: 104 RHQQRFGHGSNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLGDAKPF-- 161 Query: 112 GECPPEWDADYWF----DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRIS 167 P D WF G +Y +L K+ L + + D + S Sbjct: 162 --WPNRRGFDEWFGFSGGGFSYWGDLGMKDPLLGVHRGDEPVDPKT-----LTHLTDDFS 214 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 AV F+Q+ EPF + ++Y+ PH P +L+K A Y Sbjct: 215 TEAVKFIQR--HETEPFFLYLAYNAPHAPDHATRAHLQKTAHIEYGG------------- 259 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 +Y A +D+ IGRV++ + ENT +I+ SD+G Sbjct: 260 --------------------RAVYGAMVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNG 299 Query: 287 ---EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDT--PVSHIDLLPTMMALADI 341 E +++ R+P ++ P R + P++ +DL PT +A A + Sbjct: 300 GRREHAVNFPYRGHKGMLFEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGM 359 Query: 342 EKP--EILPGENILAVKEPRGVMVE----FNRYEIEHDSFGGFIPVRCWVTDDFKLV-LN 394 + + L G+N+L V + F RY + DS+G ++KL+ Sbjct: 360 DPSQNDKLDGQNLLPVLTDDKQRLPERPLFWRYSMGDDSYG-----YAVRDGNWKLIDSR 414 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 L+D NDP E +L + + +++ + + Sbjct: 415 YKDRKLLFDLANDPWEREDLAA--QHPEQVARLSRMMEAW 452 >UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UL40_RHOBA Length = 592 Score = 341 bits (876), Expect = 3e-92, Method: Composition-based stats. Identities = 119/497 (23%), Positives = 181/497 (36%), Gaps = 66/497 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + VMTD Q VG + + L T N+D AAEG + Y SP+CTP R+ L TG Sbjct: 46 RPNVILVMTDDQGWAEVGFHGNEVLKTPNLDRFAAEGTELTNFYV-SPMCTPTRSSLMTG 104 Query: 63 IYANQSGPWT---NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 Y ++G +T+ F AGY T GKWHL GE P Sbjct: 105 RYHFRTGAHDTYIGRSNMNPEETTIAEVFAGAGYRTGIFGKWHL--------GENFPMRA 156 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHI--DETFTWAHRISNRAVDFLQQP 177 D F + + + LQ N + ++ F++ Sbjct: 157 EDQGFQKVVVHGGGGIGQFADYPGNTYWDPTLQYNDSFKKAKGYCTDVFIDESIQFMKDS 216 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 ++PF + + PH PF E+ Y + Sbjct: 217 --GEQPFFCYLPLNVPHSPFDVADEFRADYDNQNLADP---------------------- 252 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAH--KL 294 DG + D GR++ A+ QRENT +++ SD+G L Sbjct: 253 -----DGRKWVAPIYGMITQFDGAFGRLLEAVEDMGQRENTIILFMSDNGPNSTYFTAGL 307 Query: 295 ISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEIL--PGE 350 +K ++Y++ R P +I+ P+ R+ DTP HIDLLPT+ I P L G+ Sbjct: 308 RAKKGSVYENGIRSPFVIQWPKTLQGGRKFDTPAMHIDLLPTLADACGIGLPADLQVDGK 367 Query: 351 NILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR--CWVTDDFKLVLNLF--TSDELYDRRN 406 +IL + + ++H+ +K+V + T ELY+ Sbjct: 368 SILGLLHGETQGFQQRYLFMQHNRANVPPKYENCMARRGPWKVVGDGGEPTGFELYNIEQ 427 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYMDKI-----RDPFRSYQWSLRPWRKD----AR 457 DP E +L D + ++ + D + RD Y L P +K Sbjct: 428 DPGETRDLAD--KHPEIVKAFVREYEAWFDDVTTQLRRDNGVPYPTELNPEQKRDFRFTW 485 Query: 458 PRWMGAFRP-RPQDGYS 473 W G RP + Sbjct: 486 QDWWGDKTGWRPNNYGR 502 >UniRef50_A6DFB2 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFB2_9BACT Length = 474 Score = 341 bits (876), Expect = 3e-92, Method: Composition-based stats. Identities = 123/470 (26%), Positives = 202/470 (42%), Gaps = 66/470 (14%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKP-LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + + +F++ D +G G P T N+D+LA G+ F +A+ +P C P+R T Sbjct: 30 KKDIVFIIVDDL-NTWIGAMGGHPQTKTPNLDALATRGVLFTNAHCNAPQCGPSRKSFLT 88 Query: 62 GIYANQSGPWTN-----------------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLD 104 G+Y +G + N + P K +F Y GK Sbjct: 89 GLYPKSTGKYFNVAKKMPFFKDQPLKGATSKNPPKKPLDFHTHFMKNNYRVVSGGK---- 144 Query: 105 GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 + D FD + T+K ++LW G +ID+T T + Sbjct: 145 ------VDHGSLKAKIDNKFDRPKEVKHFTDKRVNLWGEG-------GPQNIDDTMTGDY 191 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ-DDLA 223 + + A+ Q ++D+P LM V + PH PF P EY +K+ +L + + DDLA Sbjct: 192 KTAQWAIK--QWNTKSDKPLLMSVGFYRPHRPFNVPKEYFDKFPLESIQLPKVPEFDDLA 249 Query: 224 NKPEHHRLWAQAMPSPV----------------GDDGLYHHPLYFACNDFVDDQIGRVIN 267 + PE+ + A++ D+ Y Y AC ++VD QIG + Sbjct: 250 DLPEYGKALARSNAHKNLFKPRTVHEHILHLGGEDEWKYMVQSYLACINYVDTQIGLFLE 309 Query: 268 ALTPEQREN-TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDT 324 L R N T +I TSDHG +G + K AA++ TR+P I+ +P Sbjct: 310 TLKNNPRGNDTVIILTSDHGWDLGEKEHWCK-AALWRTTTRVPYIVVAPGLTQAGTVNQQ 368 Query: 325 PVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCW 384 P+SH+D+ PT+ A I KP+ L G++IL + + E + S+G Sbjct: 369 PISHVDIYPTLCDFAGIAKPKHLEGQSILPLVKDSSAKRE-----AAYLSYG--PRNTAV 421 Query: 385 VTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 T+ ++ + S ELYD + DP E NL + +A++++KM + + Sbjct: 422 QTERYRYISYEDGSGELYDHQKDPREWTNLSSNPEYAELKAKMIQKVKKF 471 >UniRef50_A4AMS2 Choline sulfatase n=1 Tax=Flavobacteriales bacterium HTCC2170 RepID=A4AMS2_9FLAO Length = 503 Score = 341 bits (876), Expect = 3e-92, Method: Composition-based stats. Identities = 114/471 (24%), Positives = 191/471 (40%), Gaps = 38/471 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYT----CSPVCTPARA 57 K+PN + + D + K + T N+D L G F +AY VC +RA Sbjct: 26 KKPNIVLIFADDMTYTAINALGNKEIQTPNLDRLVKGGTTFKNAYNMGAWNGAVCVASRA 85 Query: 58 GLFTGIYANQSGPWTNNVAPGKNI-STMGRYFKDAGYHTCYIGKWHLDGHD---YFGTGE 113 + +G + + N GK T G+ + AGY T GKWH+D + Sbjct: 86 MMISGRSVWNANNFRQNWLEGKEFDKTWGKLMESAGYDTYMTGKWHVDAPADSVFQNVTH 145 Query: 114 CPPEWDADYWFDGANY--LSELTEKEIS------LWRNGLNSVEDLQANHIDETF----- 160 D W G ++E+ ++ S + N + D N +D+ F Sbjct: 146 VRRGMPWDSWGHGGKIPAINEMIKEGKSKKEIRAIGYNRPLNENDTTWNPVDKKFGGFWV 205 Query: 161 ---TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK 217 W+ + + AV F+ Q D PF M ++++ PH P P EY++ Y+ L + Sbjct: 206 GGKHWSEVLKDDAVGFIDQAKVKDNPFFMYLAFNAPHDPRQAPQEYVDMYSLDKISLPKS 265 Query: 218 AQDDLANKP-----EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE 272 K R A A H Y+A +D+QIG +++AL Sbjct: 266 WMPMYPYKDSIGNGPGLRDEALAPFPRTEYATKKHIQEYYALISHMDNQIGEILDALENS 325 Query: 273 Q-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHID 330 ENT+VI+T+DHG +G H L+ K + +D R P +I P + +D + D Sbjct: 326 GKMENTYVIFTADHGLAIGKHGLLGKQSQ-FDHSIRPPFMIVGPDIPKDASIDKDIYLQD 384 Query: 331 LLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFK 390 + T + LA IEKP+ + +I + + + G R D +K Sbjct: 385 AMATSLDLAGIEKPDYVFFNSIKDLAKGERKESHYKEIY-----GGYTTTQRMIRKDGYK 439 Query: 391 LVLNLF-TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 L++ L++ DP E+++L ++ + + + LL D++ D Sbjct: 440 LIVYPKLKKVLLFNMETDPEEINDLSENPEYQGKINTLFKELLVLQDELND 490 >UniRef50_UPI00016BFE17 putative sulfatase n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016BFE17 Length = 454 Score = 341 bits (876), Expect = 3e-92, Method: Composition-based stats. Identities = 113/461 (24%), Positives = 191/461 (41%), Gaps = 45/461 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN ++V+TD + +G + T N+D++A EG+ F +A +P C P R L TG Sbjct: 4 KPNVIWVLTDQMRASAMGFMGDANVRTPNLDNMAREGVAFVNAVAGTPWCCPFRGALLTG 63 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 +Y +Q+G A +T+ F+ AGYHT Y+GKWHLDG + P Sbjct: 64 LYPHQNGVTQTPQALDPATATITAPFRTAGYHTAYVGKWHLDGSNSVTHYIPPERRGGFD 123 Query: 123 WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD- 181 +F G Y + + E ++ N L ++N + LQQ D Sbjct: 124 YFMG--YENNNNQNESYVFGNDCERPTRLDGYE-------TDALTNIFIKHLQQHTSQDS 174 Query: 182 -EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 +PF V+S PH P+ P L + P +L P Sbjct: 175 YQPFFAVLSVQPPHDPYVPP-------------LYTGEGKIYYHNPADIKLRPNVPPGRW 221 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGAHKLISKGA 299 D+ Y++ + +D +G++ AL + T++I+ SDHG+ + +H SK + Sbjct: 222 SDEARMDLAGYYSMIENIDTNVGKLXMALKQMNIDRETYIIFMSDHGDCLNSHGQWSK-S 280 Query: 300 AMYDDITRIPLII-----RSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILA 354 + +++ RIP I+ + D ++HID+ PT + L I P+ + G + Sbjct: 281 SPWEEAIRIPFIVCRVGTNY-HMQSGIRDAVINHIDIAPTTLGLCGITPPQNMVGFDYSP 339 Query: 355 VK---------EPRGVMVEFNRYEIEHDSFGGFIPVRCWV----TDDFKLVLNLFTSDEL 401 + E +G + + + + + W D +K V L Sbjct: 340 LCIQKSQPEYHELQGAIPDSAYLQQIPRKYHAHTANKAWRAVVTRDGYKYVCYPHQDVML 399 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 ++ +DP EM NL D ++ K+H L Y+ D F Sbjct: 400 FNLNDDPYEMANLCHDSTSQVIKEKLHSLLQKYIVDTGDTF 440 >UniRef50_D2R203 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R203_9PLAN Length = 490 Score = 341 bits (875), Expect = 4e-92, Method: Composition-based stats. Identities = 110/453 (24%), Positives = 189/453 (41%), Gaps = 24/453 (5%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCS----PVCTPARAG 58 +PN +F+ D + + + T N+D LA +G F+ AY VC +R+ Sbjct: 37 KPNVVFLFADDLSYEALAYAGNGQVKTPNLDRLAKQGTSFSHAYNMGSFSPAVCIASRSM 96 Query: 59 LFTGIYANQSG-PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 L TG ++ KN+ R AGY T GKWH+ + Sbjct: 97 LVTGRSVWKAQTLHAAGGKEPKNVVLWPRQMHGAGYQTFITGKWHVPWNPMLAFDVTAHV 156 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 + ++ S + + + W+ ++ AV+F Sbjct: 157 RG-----GMPKDVPSFYDRPHSDKPDTFDPANPGNGGYWQGGKHWSEVTADDAVEFFSAS 211 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK----PEHHRLWA 233 P M V+++ PH P P YL++Y E+ + Q + + Sbjct: 212 RDKSRPCFMYVAFNAPHDPRQAPQTYLDRYPTETIEVPKDFQPLYPERASIGADEKLRDE 271 Query: 234 QAMPSPVGDDGLY-HHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGA 291 + P P + + H Y+A +DDQIGR+++A+ + + T V++T+DHG G Sbjct: 272 KLAPFPRTEFAVRTHRREYYALITHLDDQIGRILDAIEQTKSDRPTMVMFTADHGLACGH 331 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEKPEILPGE 350 H L+ K MYD R+PLII + + +D PV D++PT + LA + + Sbjct: 332 HGLMGK-QNMYDHSIRVPLIIAGENIPQGKTIDVPVYLQDVMPTALELAGVAPGPEVHFH 390 Query: 351 NILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL-FTSDELYDRRNDPN 409 ++L + + + Y + S+ RC V D FKLV+ + +L+D ++DP Sbjct: 391 SLLPIVRGEQKV---SNYPAIYSSYLNL--QRCVVKDGFKLVVYPALPAAKLFDLQHDPL 445 Query: 410 EMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 E+ +L D A + ++ DAL+ + I DP Sbjct: 446 ELSDLSADPNHATRKEQLFDALVAEAESISDPL 478 >UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF83_9BACT Length = 488 Score = 341 bits (875), Expect = 4e-92, Method: Composition-based stats. Identities = 102/463 (22%), Positives = 169/463 (36%), Gaps = 57/463 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + ++ D +GCY + T NID LA +G++F S Y S VC P+RA L T Sbjct: 41 RRPNIILILADDLGYGDLGCYGQTQIKTPNIDKLAEDGMKFTSFYAGSTVCAPSRATLMT 100 Query: 62 GIYANQSGPWTN-NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 G N +++ T+ + K AGY T IGKW L G+ P Sbjct: 101 GKNTGHVNIRGNADLSLNGEELTIAKILKLAGYATGCIGKWGLGNE---GSPGLPGRQGF 157 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDL----QANHIDETFTWAHRISNRAVDFLQ- 175 D + + + L+R+ E + + + + A+++L+ Sbjct: 158 DEYLGYLDQVQAHDYYPTHLFRSDSKGEESKIALTENDADHKGLYSNDFFTQSALNYLRI 217 Query: 176 ---QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 F + + Y PH + + + +P + W Sbjct: 218 NKPSKLNKHRSFFLYLPYTLPHA------------NNELGNRTGNGMEVPSTEPYTNEQW 265 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHG----- 286 Q + A +D +G +++ L + + NT VI+ SD+G Sbjct: 266 PQVEKNK------------AAMITRLDHYVGEIMDYLKKSKLDENTVVIFASDNGPHKEG 313 Query: 287 -----EMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALA 339 A L +Y+ R+P I+R P D P++ D LPT +A Sbjct: 314 GVNPKYFNSAGGLRGIKRDLYEGGIRVPFIVRWPARVKAGSISDAPLAFWDFLPTAAEIA 373 Query: 340 DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL-FTS 398 P + G + L + E G VR D+K V + Sbjct: 374 RTSSPTNIDGISFLPTLLGKAQTNRHQYLYWEFHEQGFDQAVRM---GDWKAVRHGINGP 430 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 ELY+ + D +E N+ D + +V +K+ D L + DP Sbjct: 431 IELYNLKTDVSEKDNVAD--KNPEVMAKIADYLKK--ARTDDP 469 >UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAW6_9PLAN Length = 472 Score = 340 bits (873), Expect = 7e-92, Method: Composition-based stats. Identities = 100/488 (20%), Positives = 182/488 (37%), Gaps = 93/488 (19%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN + ++ D +GC + T +IDSLA+ GIRF AY +P C+P+RAGL T Sbjct: 24 EQPNIIVLLADDLGYGELGCQGNPQIPTPHIDSLASHGIRFTQAYVTAPNCSPSRAGLLT 83 Query: 62 GIYANQSGPWTN---------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 G + G N + T+ D GY TC IGKWHL G + Sbjct: 84 GRIPTRFGYEFNPIGARNEDSGTGLPPDEQTIAERLHDQGYTTCLIGKWHLGGTADYHPF 143 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWR---------------------NGLNSVEDL 151 + + +G ++ ++ R + D Sbjct: 144 RHGFDEFFGFMHEGHYFVPPPYHGVTTMLRRKTLPGRQKGRWISENLIYSTHMGYDEPDY 203 Query: 152 QAN--------HIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEY 203 AN ++ET + AV F+ + D+PF + ++Y+ H P + Sbjct: 204 DANNPIIRGGQPVNETEYLTDAFTREAVSFINR--HQDKPFFLYLAYNAVHSPLQGKKKD 261 Query: 204 LEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIG 263 ++ + H ++ A +D IG Sbjct: 262 IQHF---------------------------------TQIEDIHRQIFAAMLSSMDQSIG 288 Query: 264 RVINALTPEQRE-NTWVIYTSDHGE-----MMGAHKLISKGAAMYDDITRIPLIIRSPQ- 316 +++ + + T +++ SD+G L + +MY+ R+P ++R Sbjct: 289 KILKQVQQSGLDEKTLIVFLSDNGGPTRELTSSNLPLRGEKGSMYEGGLRVPFLMRWTGT 348 Query: 317 -GERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSF 375 ++ +D PVS +D+ PT +ALA P+ L G N+L + + + + F Sbjct: 349 LAPKQTIDVPVSSLDIFPTSVALAGASLPQNLDGRNLLPLLLQQKTELPVADF------F 402 Query: 376 GGFIPVRCWVTDDFKLVLNLFTS----DELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + D+K+V T ELY+ ND +E +L + + R ++ Sbjct: 403 WRQGRKAALRSGDWKIVQMRGTREKPVWELYNLANDKSETIDLATEQS--EKRMELQTRW 460 Query: 432 LDYMDKIR 439 + +++ Sbjct: 461 NELNAQMK 468 >UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_BACFR Length = 489 Score = 340 bits (872), Expect = 9e-92, Method: Composition-based stats. Identities = 108/462 (23%), Positives = 190/462 (41%), Gaps = 51/462 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN +F++ D + CY + T NID LA G+RF Y+ + V P+R+ L TG Sbjct: 36 RPNVVFILADDLGYGDLSCYGQEKFETPNIDRLAQNGMRFTQCYSGTTVSAPSRSCLITG 95 Query: 63 IYANQSGPWTN-------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 ++ + N +N T+ F++AGY T GKW L Y G+ P Sbjct: 96 THSGHTAIRGNKELAPEGQFPLPENSQTIFNDFRNAGYRTGAFGKWGLG---YIGSAGDP 152 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF----TWAHRISNRAV 171 + D ++ L + LW N + DL N+++ + I ++A+ Sbjct: 153 YKQGIDQFYGYNCQLLAHSYYPDHLWDN--DKRVDLPDNNLNVQYGKGTYSQDLIHSKAL 210 Query: 172 DFLQQPAR-ADEPFLMVVSYDEPHHPFTCPVEY-LEKYADFYYELGEKAQDDLANKPEHH 229 FL + A+ D+PF M PH P + ++K+ Y E + + + Sbjct: 211 AFLDEAAKEKDQPFFMWYPTIIPHAELIVPEDSIIKKFRGKYPEKPYRGVEPGSPAFRKG 270 Query: 230 RLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEM 288 Q P H + A +D +G+++ L +NT +I++SD+G Sbjct: 271 GYCTQFYP----------HATFAAMVYRLDVYVGQIVQKLKDMGVYDNTIIIFSSDNGPH 320 Query: 289 M--GAHK--------LISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMM 336 M GA +Y+ R+P+II P + D S DL+PT Sbjct: 321 MEGGADPDFFNSNGIWRGYKRDVYEGGIRVPMIISWPGHVQPSTETDFMCSFWDLMPTFR 380 Query: 337 ALADIEKP-EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-N 394 + + + + G +IL + + R E E G + D+KLV N Sbjct: 381 EVLNPKADTRNMDGVSILPLLQNRKGQKEHEYLYFEFLEMNG---RQAVRKGDWKLVHMN 437 Query: 395 LFTSD---ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 + + ELY+ +DP+E +N+++ ++ + ++ + + Sbjct: 438 IRGNKPYYELYNLASDPSEKYNVLN--QYPEKADELKAIMKE 477 >UniRef50_A6L183 Iduronate 2-sulfatase n=11 Tax=Bacteroides RepID=A6L183_BACV8 Length = 477 Score = 339 bits (870), Expect = 1e-91, Method: Composition-based stats. Identities = 112/449 (24%), Positives = 190/449 (42%), Gaps = 21/449 (4%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++ N LF+M D +GCY K + T NID AA G+ F +AY PV +RA L T Sbjct: 26 EKMNVLFLMADDMRPE-LGCYGVKEVKTPNIDRFAASGLLFQNAYCNIPVSGASRASLLT 84 Query: 62 GIYANQSGPWTNNVA-PGKNIST---MGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 G+Y + + N A K+ T + R+F GY+T GK D+ + PP Sbjct: 85 GVYPHYPDRFVNYSAYASKDCPTAIPISRWFTSHGYYTISNGKVFHHLSDHANSWSEPPY 144 Query: 118 WDAD-----YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 YW + + + E + + +T +++ +A+ Sbjct: 145 RKHPDGYDVYWAEYNKWELWMNEASARTINPKTMRGPFCEWAEVPDTAYDDGKLALKAIA 204 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK--AQDDLANKPEHHR 230 L++ +PF M + +PH PF P +Y + Y + DL N+ ++ Sbjct: 205 DLKRLKEQGKPFFMACGFWKPHLPFNAPKKYWDLYDREKIPVANNRFRPKDLPNEVKNST 264 Query: 231 LWAQAMPSPVGDDGLYHHPL---YFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 + DD + Y+AC +VD QIG+V++AL NT V+ DHG Sbjct: 265 EIYAYARTTTADDISFQKEAKHGYYACLSYVDAQIGKVLDALDELGLANNTIVVLLGDHG 324 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPE- 345 +G H + K M D T +PLI+R P ++ + + V +DL PT+ L + P+ Sbjct: 325 WHLGEHNFLGKHNLM-DRSTHVPLIVRVPGLKKGKTKSMVEFVDLYPTLCELCHLPIPKN 383 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRR 405 L G + + + ++ Y I+ + + R + ++K S L+D Sbjct: 384 QLDGTSFVPILTNLKAKIKDQVY-IQWEGGDNTVSNR-YNYAEWK-QKEKIHSRMLFDHH 440 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLDY 434 DP E N +++ ++ +K+ L Sbjct: 441 IDPEENKNRVNERKYRSEINKLSSFLKAK 469 >UniRef50_B4CVD2 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVD2_9BACT Length = 631 Score = 339 bits (870), Expect = 1e-91, Method: Composition-based stats. Identities = 106/472 (22%), Positives = 173/472 (36%), Gaps = 103/472 (21%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +F++ D N + CY K T N+D LA EG+RF AY SP+C+ +RA + TG Sbjct: 33 KPNIVFILCDDLGVNDLSCYGRKDQQTPNLDRLAGEGMRFTCAYCASPICSASRAAIMTG 92 Query: 63 IYANQSGPWTNNVAPGKNIS------------------TMGRYFKDAGYHTCYIGKWHLD 104 + TN + + T+ + AGY + IGKWHL Sbjct: 93 KAPGRVHI-TNFLPGRADAPSQKFIQPEIEGQLPLEENTIAKALHGAGYVSACIGKWHLG 151 Query: 105 GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 G G P DY F G E + Sbjct: 152 GK-----GFLPTNQGFDYAFAGHANTKP----------------------SATEGGKGEY 184 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 ++ A +L++ D PF + ++++ PH P E +EK+ D Sbjct: 185 ELTAEAERWLEK--NKDHPFFLYLAHNSPHVPLAAKPELIEKHKD--------------- 227 Query: 225 KPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTS 283 +P+Y A + +DD +GR++ + E T I+TS Sbjct: 228 ---------------------AWNPIYAAMIESLDDCVGRIMKKVDELGLTEKTIFIFTS 266 Query: 284 DHGEMM----------GAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDL 331 D+G + + + + R PLI+R P + +TPV D Sbjct: 267 DNGGLHVYELPNTPSTYNAPFRAGKGYLEEGGLREPLIVRWPGKIKAGATNETPVVLYDF 326 Query: 332 LPTMMALADIEKPEI---LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDD 388 +PT+M A ++ L G NIL + + + + + G P + Sbjct: 327 MPTLMTAAGLDVAHTVGPLDGVNILPLLTGGTIPPRTLYWHFPNYTNQGSKPAGAIRDGE 386 Query: 389 FKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 +KL+ + T + ELY+ DP E ++L S++ L + I Sbjct: 387 WKLIQDDETGNLELYNIAADPGEKNDLAKSQS--ARVSELQGKLAAWRKSIG 436 >UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C3C8_9PLAN Length = 600 Score = 339 bits (870), Expect = 1e-91, Method: Composition-based stats. Identities = 107/457 (23%), Positives = 178/457 (38%), Gaps = 61/457 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN + VMTD Q + T I LAAEG+ F Y VC P RAGL T Sbjct: 33 RQPNIILVMTDDQGYWDTEISGNPKIKTPTIKKLAAEGVTFTRFYAN-MVCAPTRAGLMT 91 Query: 62 GIYANQSGPWT---NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G + ++G + G N +T+ + + AGY T GKWHL + + P Sbjct: 92 GRHYLRTGLYNTRFGGDTLGPNETTIAQVLQKAGYKTGLFGKWHLGRYAQY----QPQRR 147 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 D++F + E + NG ++ ++ A+DF+Q+ Sbjct: 148 GFDHFFGHYHGHIERYTNPDQVVVNG---------TPVETRGYVTDLFTDAAIDFIQR-- 196 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 +PF ++Y+ PH PF + +PE +L + + Sbjct: 197 NQQQPFFCYLAYNAPHSPFLLDTSHF-------------------GQPEGDKLIEKYLAK 237 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGAH--KLI 295 + +A + +D + R++ + + + T VI+TSD+G + L Sbjct: 238 GLP----LREARIYAMIERIDQNLSRLLQTVHDLKLDQETVVIFTSDNGGVSRGFKAGLK 293 Query: 296 SKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPE--ILPGEN 351 A+ Y+ TR+P ++R + D V+ DL PT LA + P L GE+ Sbjct: 294 GSKASAYEGGTRVPFVVRWTDHFPAGKTTDAMVAQTDLFPTFCQLAGVPVPSNVKLDGES 353 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTD-DFKLV---------LNLFTSDEL 401 IL++ E G D + R + FKLV +L Sbjct: 354 ILSLMEQGGGKSPHQYLYHTWDRYTPNPYHRWAIHGPRFKLVGHDPQGKKKKEGEPQGQL 413 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 YD + DP E N+ D ++ + S++ L + + Sbjct: 414 YDLQEDPGEKKNVAD--QYPEKVSELRGEFLRWFQDV 448 >UniRef50_B9XJI6 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XJI6_9BACT Length = 490 Score = 339 bits (869), Expect = 2e-91, Method: Composition-based stats. Identities = 100/442 (22%), Positives = 186/442 (42%), Gaps = 29/442 (6%) Query: 3 RPNFLFVMTDTQATNMVGCY-SGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN L ++ D N +G + T N+D LAA G+ F +A +P+C P+RA + Sbjct: 30 KPNILLIIADDL-NNWIGPNKGNPQVKTPNLDKLAARGVTFQNAQASAPLCNPSRASFMS 88 Query: 62 GIYANQSGPWTNNVAPGKNIS---TMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPE 117 G + +G + N ++ + Y + GY + GK +H + + Sbjct: 89 GQRPSTTGIYDNQQPAMPHLPRGVCLNDYVRKFGYTSLGAGKIYHYHQYRE-------ED 141 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF-LQQ 176 WD ++ + ++ + + ED +E + + R+V + + Q Sbjct: 142 WDKVVFYADDTLPNHPAKRRPGPFGYRM-FTEDKPDAEFNEQRAESELVDARSVSWCIDQ 200 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ-- 234 + F M PH P+ P +Y + Y +L +DLA+ P +A+ Sbjct: 201 LGQQQGAFFMACGVHRPHVPWDVPKKYFDMYPLESVKLPPILTNDLADVPPAGIAFAKPN 260 Query: 235 AMPSPVGDDGLY--HHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGA 291 + + G++ Y A F D QIGR+++AL + R+NT +I+ D+G +G Sbjct: 261 GVHQAILKAGVWQDRVRAYLAAISFADAQIGRLLDALDKSKYRDNTIIIFVGDNGWHLGE 320 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQGERRQ--VDTPVSHIDLLPTMMALADIEKPEILPG 349 + +K +A++ T +PLI +P + D V + PT+ LA I P+ G Sbjct: 321 KEHWAK-SALWRRATNVPLIWVAPGVAKPGTECDRAVDLTSIFPTVCELAGIPTPKHAEG 379 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPN 409 +I + + + + TD+++ + S+ELYD++ DPN Sbjct: 380 ISIKPLLKNPSAKWKQPAVTTFLQNN------HAICTDEWRYIRYADGSEELYDQKADPN 433 Query: 410 EMHNLIDDIRFADVRSKMHDAL 431 E N A ++ ++ +L Sbjct: 434 EWTNQAAKPELAKIKMELAKSL 455 >UniRef50_A3I2R7 Arylsulfatase n=2 Tax=Bacteroidetes RepID=A3I2R7_9SPHI Length = 589 Score = 339 bits (869), Expect = 2e-91, Method: Composition-based stats. Identities = 101/454 (22%), Positives = 189/454 (41%), Gaps = 36/454 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K PN + ++TD Q G K ++T ID LA F + Y SPVC P RA L T Sbjct: 30 KPPNIILIITDDQGYGDFGFTGNKHVSTPTIDQLAENSFEFTNFYV-SPVCAPTRASLMT 88 Query: 62 GIYANQSGP---WTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G Y+ ++G + + T+ + + Y + GKWHL + + + Sbjct: 89 GRYSLRTGIRDTYNGGAMMSPDEITIAELLQKSDYTSGIFGKWHLGDNYPMRPSDQGFDE 148 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 + G + + T R+ + V + ++ A++F+++ Sbjct: 149 SLIHLSGGMGQVGDFTTY-FQKDRSYFDPVLWHNNRQESYQGYCSDIFASAAIEFIEK-- 205 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 D+PF +S++ PH P P EY +KY + G ++ + +P + Sbjct: 206 NKDQPFFTYLSFNAPHTPLQVPEEYYQKYKNIDTSTGYESDE----RPFY---------- 251 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGAH--KLI 295 P+ D +A + +DD + + L + E+ T +I+ +D+G + L Sbjct: 252 PMSDSQKEEARKVYAMVENIDDNLKNLFAKLKELEIEDETIIIFLTDNGPQQQRYLAGLR 311 Query: 296 SKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKP--EILPGEN 351 +Y R PL+I P+ E R+++T +HID+LPT+ L I+ P + G++ Sbjct: 312 GLKGNVYQGGIRTPLLIHIPEKLSENRKINTLSAHIDILPTIADLVGIQLPLDRKIDGKS 371 Query: 352 ILAVKEPRGVMVE-FNRYEIEHDSFGGFIPVRCWVTDDFKLV----LNLFTSD-ELYDRR 405 +L + E + + + F ++KLV + D +LY+ + Sbjct: 372 LLPLLIGEVDSFENRSLFSYWNRKFPEKYSNISIQNSEWKLVGKTDYDASIEDFQLYNLK 431 Query: 406 NDPNEMHNLIDDIRFA--DVRSKMHDALLDYMDK 437 DP E NLI ++++++ L+ + + Sbjct: 432 EDPYEQSNLITSKISKGLELKNELDQLYLELISE 465 >UniRef50_UPI0001746164 choline-sulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746164 Length = 469 Score = 338 bits (868), Expect = 2e-91, Method: Composition-based stats. Identities = 121/474 (25%), Positives = 188/474 (39%), Gaps = 33/474 (6%) Query: 10 MTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIYANQSG 69 MTD +++GC K T ++D+LA G+ F++ Y SP+CTP+R TG Y + Sbjct: 1 MTDEHNASVMGCAGDKVARTPHLDALAERGVLFDAHYCASPICTPSRQSFTTGKYVSGHR 60 Query: 70 PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG------------TGECPPE 117 W+N + ++ R AGY + GK H G G G+ P Sbjct: 61 VWSNTPGVPEGTPSLARILNAAGYDSYLNGKMHYKGGMTHGYQIISEKDGRITPGKEPGA 120 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 A + L + L + H D + A+ FL++ Sbjct: 121 ERAARGSNAIKPRQRLAAGRFEDRGDELGEEFEHAGEHADMDEFVDVVRRDHAIKFLKER 180 Query: 178 -ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM 236 A ++PF + + + PH+P P EYLE + D E + P ++R Sbjct: 181 GADNNKPFFLTIGFIAPHYPLVAPPEYLEHFRD-KVPFPEVPPGYVDTLPLNYRHLRNDR 239 Query: 237 PSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHK 293 L L Y+A +++DDQIG V+ AL + ENT VIYTSDHGE +G H Sbjct: 240 KFERVPPALAKRALEGYYARVEWIDDQIGMVLEALKNSRFAENTVVIYTSDHGENLGEHG 299 Query: 294 LISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 L K M+D R+PLI+ P + +DL+ T+ A+ + P G + Sbjct: 300 LWWKN-CMFDSGARVPLIVSWPSRWKGGQHRTGACGVLDLVQTIAAIGGAQVPSDWKGVS 358 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN------LFTSDELYDRR 405 ++ + + S+ D+K V + ELY+ R Sbjct: 359 MIPWLDDHSAPWRDLAVSEYYASY-VASGFAMIRQGDWKYVYHTRADELHGPERELYNLR 417 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPR 459 DP E+HNL MH A+++ + DP + + WR P Sbjct: 418 EDPRELHNLAGKEENMPRMEAMHKAMVEETGE--DP----EVTEARWRAGETPE 465 >UniRef50_A6UE90 Sulfatase n=1 Tax=Sinorhizobium medicae WSM419 RepID=A6UE90_SINMW Length = 489 Score = 338 bits (868), Expect = 3e-91, Method: Composition-based stats. Identities = 111/465 (23%), Positives = 191/465 (41%), Gaps = 40/465 (8%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M +PN LF+ +D A + GCY + T NID LA EG+RF++AY SP+CTP+R + Sbjct: 1 MNKPNVLFIFSDQHAQKVAGCYGDDVVRTPNIDRLAQEGVRFDNAYCPSPICTPSRMSML 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH------LDGHDYFGTGEC 114 T + ++ WTN+ ++ T +AGY IG+ H L G+ G G+ Sbjct: 61 TARWPHRQECWTNDDMLRSDVPTWLHRAGEAGYRPALIGRMHSIGPDQLHGYAERGIGDH 120 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 P + F +SL ++G + + + + A +L Sbjct: 121 TPNFAGIARFPMGVLEGTNEPDSVSLTQSGAGMAIYQRKDQ---------DVVDAAAAWL 171 Query: 175 QQ----PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 + A + F + V PH P+ E + Y ++ D ++ + HR Sbjct: 172 RDKGAARNAAGQQFCLTVGLMTPHAPYVVDREAFDHYHG---QVPPPRLDVPQDEHDWHR 228 Query: 231 LWAQAMPSPVGDDGLYH--HPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGE 287 W D + Y+ D+ IG+V++AL ++T ++Y SDHG+ Sbjct: 229 WWRHDRGIGEVSDAVRDRARAAYWGLVQRTDEMIGQVLDALKEIGAMDDTLIVYASDHGD 288 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ--VDTPVSHIDLLPTMMALADIEKPE 345 +G L K +++ + PL++R P D V+ +DL TM+ + + Sbjct: 289 HVGERGLWWKH-TFFEESVKFPLVMRLPGAIPAGESRDQVVNLVDLSQTMIEVMGAQPLP 347 Query: 346 ILPGENILAVKEPRGVMVE---FNRY---EIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD 399 G++ AV R E F+ Y + + G + R + +KL + Sbjct: 348 YADGKSFWAVACDREAPWENETFSEYCTDPVPSWTGGRAVQQRMIRSGSWKLSVYDGEPP 407 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRS 444 L+D DP+E N +D A++ ++ L D +R Sbjct: 408 LLFDLSTDPDERINRAEDPDCAEMFQRLSARLA------HDGWRP 446 >UniRef50_A6E5R0 Putative sulfatase n=1 Tax=Roseovarius sp. TM1035 RepID=A6E5R0_9RHOB Length = 453 Score = 338 bits (868), Expect = 3e-91, Method: Composition-based stats. Identities = 114/458 (24%), Positives = 186/458 (40%), Gaps = 46/458 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + + D + C + T+++D LAA G RF+ A+ SP+C P+R TG Sbjct: 2 QPNIVIINPDQMRWDYASCQGHPFIATRHLDRLAAMGTRFSHAFAASPMCGPSRTSFLTG 61 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 Y + G + R DAGY W D+ Sbjct: 62 KYPIEHGIRQYGGTYDQAQPNALRVLGDAGYVRGI--------------------WGKDH 101 Query: 123 WFDGANYLSELTEKE---ISLWRNGLNSVEDLQANHIDETFTW--AHRISNRAVDFLQQP 177 F G S E E I + + + + +D W R+++ + F+ + Sbjct: 102 TFKGNVIGSLYDEGEDICIGIMGGHPDYINAWDSTSLDVGSKWNLTKRLTDEGLAFIHRQ 161 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ----DDLANKPEHHRLWA 233 AR +PF + ++Y +PH F CP Y + +EL + D + + R+ + Sbjct: 162 ARTSQPFFLTLNYQDPHPFFACPEPYSSLFHPDQFELSPNYRKAPVDGEITRLTNWRIHS 221 Query: 234 QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAH 292 + PV + +Y +VDDQ+GR++N L + ENT V++ SDHGE +G Sbjct: 222 NEINMPVAE-LKQAMAIYAGQIRYVDDQVGRILNELEALDLLENTIVLFWSDHGEFIGDF 280 Query: 293 KLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGE 350 + K A Y+ + R P++I P G R V +D + T++ L +++PE Sbjct: 281 GVTHKIPAFYECLIRAPMVIWDPTGRVPRGVCSGLVELMDGMATVLDLCGLKQPEGSHAR 340 Query: 351 NILAVKEPR-------GVMVEFNRYEIEHDSFGG------FIPVRCWVTDDFKLVLNLFT 397 ++ R G++V I G F P T+D+KL L Sbjct: 341 SMAGTVVGRRDVYADSGMLVRQPLEPISGHVIKGAMPPTAFGPGSMLRTEDWKLCLYGED 400 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 EL+D R D E NL R A ++ ++ L M Sbjct: 401 KGELFDLRRDRYETTNLFGAPRHAQIQDELMLRLTQRM 438 >UniRef50_D1AX15 Sulfatase n=2 Tax=Fusobacteriaceae RepID=D1AX15_STRM9 Length = 491 Score = 338 bits (868), Expect = 3e-91, Method: Composition-based stats. Identities = 113/456 (24%), Positives = 186/456 (40%), Gaps = 49/456 (10%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+ N LF++ D +GCY K T NID LA +G F + + SPVC+PARA +F Sbjct: 1 MKKNNILFIIADDLGAWALGCYGNKDAITPNIDMLAEKGKIFENFFCVSPVCSPARASIF 60 Query: 61 TGIYANQSGP------WTNNVA---PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 TG +Q G W N K ST Y C GKWH+ D Sbjct: 61 TGRIPSQHGIHDWLDEWENGTTTEDYLKGQSTFVDVLSKNNYICCMSGKWHMGLAD---- 116 Query: 112 GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 P+ YW+ + I E +I+ A+ Sbjct: 117 ---VPQKGFHYWYSHQKGGGPYYMAPMY-----------KDGKLIHEEEYITDKITEYAI 162 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRL 231 DFL + D+PF + V+Y PH P+ + + + +L E + + +H Sbjct: 163 DFLDDVYKEDKPFFLNVNYTAPHSPWD-----KKNHKEEILKLYEGCKFKSCPRDPYHPW 217 Query: 232 WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHGEMMG 290 ++ + YFA +D IG +I L + +NT +I+TSD+G MG Sbjct: 218 KISETFEGNEEERIQILKGYFAALTSMDFGIGEIIKKLEEKDMLKNTLIIFTSDNGMNMG 277 Query: 291 AHKLISKGA-----AMYDDITRIPLIIRSP-QGERRQVDTPVSHIDLLPTMMALADIE-- 342 H + KG MYD ++P II + E +V+ +SH D+ T++ ++ Sbjct: 278 HHGIFGKGNGTSPLNMYDSSVKVPFIIYKKDETEAEKVNNLLSHYDVRSTLLEYLGLDDV 337 Query: 343 KPEILP--GENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-NLFTSD 399 K E + G + + + ++ ++ + +D +G P R + +K V Sbjct: 338 KDENIDYPGNSFSEILNNKK--IDDDKNVVIYDEYG---PTRMIRNEKYKYVHRYPDGPH 392 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 E Y+ D E N I++ +++ + +M L + Sbjct: 393 EFYNLIEDVEEKVNEINNEKYSKIIDQMRKDLEIWF 428 >UniRef50_C7MF96 Arylsulfatase A family protein n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MF96_BRAFD Length = 483 Score = 338 bits (868), Expect = 3e-91, Method: Composition-based stats. Identities = 110/487 (22%), Positives = 178/487 (36%), Gaps = 67/487 (13%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M RPN +F+M+D A + + Y + T ++D LA EG R ++ Y + +CTP+RA + Sbjct: 1 MTRPNIVFIMSDDHAAHSISAYGSRVNTTPHMDRLADEGARMDATYCTNAICTPSRASIL 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 +G Y++ + + + T ++ GY T GKWHL + P +D Sbjct: 61 SGTYSHINRAPSIYSEFDYRVRTFPEVLQECGYQTALYGKWHLGRSER----SLPRGFDD 116 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 + ++ V A ++ +++DF+ + Sbjct: 117 FRIYPDQGDY--------------IDPVMIGPAGEEQIPGYATDIVTRQSLDFIDRR-DP 161 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA------- 233 ++PF ++V + PH P+ Y Y E DD A + E R A Sbjct: 162 EQPFCLLVHHKAPHRPWIPHPRYEHLYEAGTIPEPETMWDDHATRSEVVREVAMNLDDLR 221 Query: 234 -----------------QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-E 275 +A + + Y C VDD IG +++ L E E Sbjct: 222 PTDYKDELPAELEGETEEARRARASWKYQRYMRDYLRCVQAVDDSIGEILDHLDQEGLGE 281 Query: 276 NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLP 333 NT V+YTSD G +G H K M D+ +P+++R P +V VS++D Sbjct: 282 NTLVVYTSDQGFFLGDHGWFDK-RLMLDESLTMPMLLRWPAQIPAGSRVSDIVSNVDFAA 340 Query: 334 TMMALADIEKPEILP--GENILAVKE----PRGVMVEFNRYEIEHDSFGGFIPVRCWVTD 387 T++ A ++ G + L P + RY D T Sbjct: 341 TLLEAAGRSASDLPDQQGRSFLPQLRGEEVPDWRQAVYYRYWEHDDPEHHAPAHYGVRTP 400 Query: 388 DFKLVLNLFT--------------SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 K + DELYD DP EM N+ D +A V +M Sbjct: 401 THKYIHYYNDGLGSPGSSTRIMPAEDELYDLATDPQEMRNVAHDPAYAGVLEEMKALTAQ 460 Query: 434 YMDKIRD 440 + D Sbjct: 461 LQAEYGD 467 >UniRef50_D0Z4S7 Iduronate sulfatase n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0Z4S7_LISDA Length = 539 Score = 338 bits (867), Expect = 3e-91, Method: Composition-based stats. Identities = 115/481 (23%), Positives = 193/481 (40%), Gaps = 63/481 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 + PN LF+ D + +G + T N+D L F +A+ PVC P+R L Sbjct: 33 QHPNVLFLAVDDL-NDWIGALGAHPQVKTPNLDRLYKRSTAFRNAHCQVPVCGPSRTALL 91 Query: 61 TGIYANQSGPWTNNVAPGK-----------NISTMGRYFKDAGYHTCYIGKW-HLDGHDY 108 TG+ +G +TN K + + ++FK+ GY+T GK H DY Sbjct: 92 TGMAPTTTGLYTNKELGIKPFDPVAEQVLGSTPVLPQHFKNNGYYTMASGKISHHGTADY 151 Query: 109 FGTGECPPEWD-----------ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHID 157 + E + +Y + K G ++ + Sbjct: 152 RHKEQWDEEIPLYVIGPRDEHLKANGYGYGSYGVD-DHKYYPFPVGGGQIIQSQEYGPGT 210 Query: 158 ETFTW-----------------AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCP 200 F+ ++ V+ LQ+ ++PF + + PH P+T P Sbjct: 211 RGFSLCSGALDRHDIPNGGVMPDEYFADWTVERLQR--HYEKPFFLACGFIRPHVPYTAP 268 Query: 201 VEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHH--------PLYF 252 EY + + + E + ++ + P + A + P GD + Y Sbjct: 269 REYFDMFPLESIIVPETIEKEMTDIPLMGKALALGI-IPGGDAAAVNKLGIRKELVQAYL 327 Query: 253 ACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLI 311 AC F+D Q+G+V++AL NT V++ DHG+ G H + + ++ + TR+PL+ Sbjct: 328 ACIAFMDAQVGKVLDALEKSPYANNTIVMFWGDHGQNFGEH-MNYRKQTLWQESTRVPLM 386 Query: 312 IRSPQGERRQV-DTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEI 370 IR PQ E+ QV D VS +DL PT++ L + K G ++ + F+R Sbjct: 387 IRLPQQEKGQVCDEAVSLLDLYPTLIELCHLPKVATNEGISLKPLLN----NPRFDRKIP 442 Query: 371 EHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDA 430 ++G + + + S+ELYDR DPNE HNL D + ++ M Sbjct: 443 AVTTYG--YQCHAIRDEQYTYIRYRDGSEELYDRNLDPNEHHNLASDPNYQVIKQAMKQW 500 Query: 431 L 431 L Sbjct: 501 L 501 >UniRef50_A6DR18 Arylsulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DR18_9BACT Length = 543 Score = 337 bits (866), Expect = 4e-91, Method: Composition-based stats. Identities = 101/510 (19%), Positives = 184/510 (36%), Gaps = 89/510 (17%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + +MTD + +GCY G+ + T N+D LA +G+RF Y C P RA L TG Sbjct: 41 KPNIIIIMTDDMGFSDLGCYGGE-IETPNLDMLANKGVRFTQFYNAG-RCCPTRASLLTG 98 Query: 63 IYANQSGPWT-----------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 +Y +Q+G + T K AGY+T GKWH+ Sbjct: 99 LYQHQAGIGGMMGDRGAEWPGFRGHLTERCVTFAEVLKTAGYNTYQTGKWHVGDKK---K 155 Query: 112 GECPPEWDADYWF----DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRIS 167 P D+ + G + + KE + D Q N + + Sbjct: 156 EWWPLARGFDHSYSCPQGGGFFFKPSSFKEKRQVVRDTEVLYD-QKNDPPADWYATDAWT 214 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD------- 220 + + F++ A+ + PF+ ++++ PH P + + KY + + +K ++ Sbjct: 215 DEGLKFIESEAKENRPFIWYLAHNAPHFPLQAKPQDIAKYRGKFMQGWDKLREQRHKRLI 274 Query: 221 DLANKPEHHRLWAQAMPSPVGD--------DGLYHHPLYFACNDFVDDQIGRVINALTP- 271 DL + +L + P D Y A D VD +G++I L Sbjct: 275 DLGIIDKQWKLSPREKGIPAWDSLSGKEKYQQDLRMASYAAMIDCVDQNVGKIITKLKEL 334 Query: 272 EQRENTWVIYTSDH-----GEMMGAH-------------------------KLISKGAAM 301 Q +NT +++ D+ G MG + + Sbjct: 335 NQYDNTLILFLHDNGGCDAGGAMGENTGKGTCGTAKSFAYYGACWANVSNTPFRKYKKYI 394 Query: 302 YDDITRIPLIIRSPQGERRQ-----VDTPVSHIDLLPTMMALADIEKPEI--------LP 348 ++ PLI P+G ++ + P ID++ + + L+ P + Sbjct: 395 HEGGISTPLIAHWPEGIAKKLQGKLITEPAHVIDIMASCVDLSGATYPTSFKGHAIIPME 454 Query: 349 GENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDP 408 G ++ + E + + E F R +KLV ELYD +D Sbjct: 455 GTSLRPLFEGKSL-------ERNDGLFFEHYGHRGVRRGSWKLVATRQGKWELYDMVSDR 507 Query: 409 NEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 E+++L + + ++ + ++ Sbjct: 508 TELNDLSSKM--PEKVKELSRLYNKWTERC 535 >UniRef50_A9MER1 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A9MER1_SALAR Length = 430 Score = 337 bits (865), Expect = 5e-91, Method: Composition-based stats. Identities = 129/458 (28%), Positives = 191/458 (41%), Gaps = 58/458 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L TD Q + VGCY+ T +D LA EG++F +A+T PVC PAR+ L TG Sbjct: 2 QPNILVFFTDQQRWDTVGCYNPVVSTTPVLDQLAREGVKFENAFTVQPVCGPARSCLQTG 61 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 Y Q+G + NN+A ++ T+ + F AGY T YIGKWHL D E Y Sbjct: 62 RYPTQNGCYRNNIAMRQDEVTLAKLFNQAGYDTAYIGKWHLADLDEKPVLEALRG-GWQY 120 Query: 123 WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADE 182 W + N V +D+ T+ A+D+L+ R D Sbjct: 121 WLAADALEHTSHPYGGHFFDNDNQPVH-FDGYRVDDQTTF-------ALDYLKNRQR-DN 171 Query: 183 PFLMVVSYDEPHHP-----FTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 PFL+ +SY EPH F P Y E++ DL N+P W Q +P Sbjct: 172 PFLLFLSYLEPHFQNDMARFVAPDGYAERFQ------TASVPPDLINRP---GDWPQNLP 222 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLIS 296 Y+ +D+ +GR+++ L +NT +++ SDHG Sbjct: 223 D------------YYGMCQNLDENLGRIVDYLKSSGEYDNTIILFFSDHGCHFRTRNDEY 270 Query: 297 KGAAMYDDITRIPLIIR-SPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAV 355 K + ++ RIP + R P R V+ V+ +D+ TM++ A I P+ + G ++ Sbjct: 271 K-RSCHESSIRIPCVARGGPFSGGRTVEHLVTLLDIPVTMLSAAGITVPDAMVGRDLQTA 329 Query: 356 KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL--VLNLFTSD-----------ELY 402 + G E +I G R T +K V +LY Sbjct: 330 LD-AGHWDEEVLIQISESEVG-----RALRTTRWKYEIVAPGSDPWNESAATIYVESQLY 383 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 D NDP E NLI A +R K+ + M I + Sbjct: 384 DLLNDPWERQNLIASPEHARIRDKLRQDIGRKMTAIGE 421 >UniRef50_UPI0001968556 hypothetical protein BACCELL_00122 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968556 Length = 510 Score = 337 bits (865), Expect = 6e-91, Method: Composition-based stats. Identities = 121/478 (25%), Positives = 199/478 (41%), Gaps = 44/478 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LFVM+D +G P+ T N+D A E + F AY+ PV P RA LF+ Sbjct: 46 KKPNVLFVMSDQHRRQALGFMKEDPVITPNLDKFAKEAVTFTRAYSAHPVSGPNRACLFS 105 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW--D 119 G Y + + N+ + + MG FK +GY T YIGKWHLDGH+ P E Sbjct: 106 GKYTQNNKVFGNDCRLEDDGNGMGALFKKSGYSTGYIGKWHLDGHEGGKYSFVPRERRLG 165 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP-- 177 +YW + + G + + + + +A++FL+ Sbjct: 166 FEYWLISQGH------RHFDGRYYGDKDSLIVTGRWMPDYE------TEKALEFLKNRNG 213 Query: 178 -ARADEPFLMVVSYDEPH---HPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA 233 +PF +VVSY PH P + + +L + A ++ Sbjct: 214 ERDEGKPFCLVVSYAPPHNGMGPGFQNKHNIGHWNALLKDLPIRKGSGFAAPKRFEEMYE 273 Query: 234 QAMPSPVGDDGLY--------HHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSD 284 A P + P YF +D+ G++I L ENT V+YTSD Sbjct: 274 PADKLPRRKNVEKVDNKESYPALPGYFGAITSIDENFGKLIQELKDSGEWENTIVVYTSD 333 Query: 285 HGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEK 343 HGE++G+H + K Y++ +PL+I P + ++ ++ ID+LPT++ LA IE Sbjct: 334 HGELLGSHGRMYK-DLWYEESIGVPLMISYPAKLKPKKAAQLINSIDILPTLLELAGIEI 392 Query: 344 PEILPGENILAVKEPRGVMVEFNRYEIEH----DSFGGFIPVRCWVTDDFKLV------- 392 PE++ G + + + + G R T + V Sbjct: 393 PEVIDGNSYADYMNGKEKETTDKIFFQFDRGVLNDNGPDRYYRAVRTKRYTYVAAMSPYY 452 Query: 393 --LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWS 448 + LYD DP ++H + ++ + +++ A++D+ ++ DPF S W Sbjct: 453 DQFVGKNKEVLYDNEKDPYQLHPIFLGEKYDAIMNELRTAVMDWCEQTHDPFFSKYWK 510 >UniRef50_A6C1Q0 N-acetylgalactosamine 6-sulfate sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C1Q0_9PLAN Length = 469 Score = 337 bits (865), Expect = 6e-91, Method: Composition-based stats. Identities = 106/477 (22%), Positives = 179/477 (37%), Gaps = 67/477 (14%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++TD Q +G Y + ++T ++D + +G F +A+ +PVC+P+RA +G Sbjct: 29 RPNLISIVTDDQGRWAMGLYGNRQIHTPHMDQIGKQGAVFTNAFVATPVCSPSRATFLSG 88 Query: 63 IYANQSGPWT------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 + + G T + GY T IGKWHL + F E Sbjct: 89 RFPTELKITDWISSEEAQEGAGLTAMTWPEVLQQHGYQTALIGKWHLGELNQFHPHE--K 146 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 + F +N +++ + + + A++F++ Sbjct: 147 GFGHFMGFLAGGTRP-------------MNPTLEIKGETQKRKGSLPDLLVDDAINFIRT 193 Query: 177 PARADEPFLMVVSYDEPHHPFTC-PVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 D+PF + + + PH P+ P + Y ++ Sbjct: 194 S--KDKPFALCLHFRAPHTPYGPVPEQDSAHYEGMKIDVPIT------------------ 233 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKL 294 P + + + Y+A VD IGR++ L ENT VI+TSDHG G H + Sbjct: 234 -PGVIPEQIRQKNKEYYASVSSVDRNIGRLLKELDQLRLAENTLVIFTSDHGYNNGRHGV 292 Query: 295 ISKG--------------AAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMAL 338 +KG M+D R+PL++R P + D VS+ID+ ++ Sbjct: 293 STKGNGHWIAGGVTGPKRPNMWDTSIRVPLVMRWPAVIKPGTQFDEIVSNIDMFKFVLGA 352 Query: 339 ADIEKPE--ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV--LN 394 I +P L G + + + V + G +R T K V Sbjct: 353 LKIPQPANLKLHGIDYSPLLFGQPAPVRKALFGQYDLHNNGLAYLRMIRTPKLKYVKHYR 412 Query: 395 LFTSDELYDRRNDPNEMHNLID---DIRFADVRSKMHDALLDYMDKIRDPFRSYQWS 448 DELYD DP E NL+ + D + D L+++ I+DP + Sbjct: 413 ARNMDELYDLETDPGENTNLLQRRTRKNWQDTADLLEDQLIEWQKSIQDPILEPAYE 469 >UniRef50_UPI0001BC85B0 choline sulfatase n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC85B0 Length = 497 Score = 337 bits (865), Expect = 6e-91, Method: Composition-based stats. Identities = 114/452 (25%), Positives = 195/452 (43%), Gaps = 29/452 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVC----TPARA 57 ++PN LF++TD + + + + T IDSL AEG+ F + YT +C P+RA Sbjct: 34 QQPNVLFILTDDLQASSIHALGNEDVYTPAIDSLIAEGVTFTNTYTNGALCGALSMPSRA 93 Query: 58 GLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 L TG ++ + K +T + F+ GY T GKWH D + + + Sbjct: 94 MLMTGR--GLYNIQSDGMKIPKAHTTFPQQFRRHGYRTFATGKWHSDKAAFNRSFQE--- 148 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 + +F G + + L V + E F+ + ++ A+ FLQ+ Sbjct: 149 -GDNIYFGGMHPYEQNGHCSPHLNHYDSTGVYGPKTKFTGEEFS-SKMYADAAIRFLQKQ 206 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA-NKPEHHRLWAQAM 236 +PFL V++ PH P Y KY+ ++ N E + Sbjct: 207 KGDKQPFLAYVAFTSPHDPRNQLPNYGRKYSPDTLDVPRNFLPKHPFNNGEMRVRDELLL 266 Query: 237 PSPVGDDGLYHH-PLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKL 294 P+P + + Y+ VD QIGR++ L Q ENT V++ SD+G +G H L Sbjct: 267 PAPRTEQQVQKELSDYYGMISEVDVQIGRIMEVLRATGQAENTIVVFASDNGLAVGRHGL 326 Query: 295 ISKGAAMYDDITRIPLIIRSP---QGERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 + K +YD ++PL I +P + + + D+ PT+ LA+I PE + ++ Sbjct: 327 LGK-QNLYDHSVKVPLTIIAPSYKNRKGEKNQSLCYLHDIAPTLCELANIPLPESMNAQS 385 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS---DELYDRRNDP 408 + V E G + + R +V D +K ++ ++L+D + DP Sbjct: 386 LYPVLEDSGTTHRKELFLAYSNI------QRAFVNDSYKYIIYHVNGKITEQLFDLQKDP 439 Query: 409 NEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 EMHNL+ + R + +K+ L M++ D Sbjct: 440 LEMHNLLTEKR--EEANKLKKQLAFRMEEEGD 469 >UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFN4_9BACT Length = 481 Score = 337 bits (864), Expect = 7e-91, Method: Composition-based stats. Identities = 100/477 (20%), Positives = 172/477 (36%), Gaps = 77/477 (16%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN ++++ D +GCY + + T +ID+LA EG+RF Y+ +PVC P+R L +G Sbjct: 20 PNVIYILADDLGYGELGCYGQEKIKTPHIDALAKEGMRFTRHYSGAPVCAPSRGVLLSGQ 79 Query: 64 YANQSGPWTNNVAPGKNIS-------TMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 +++ N + T+ + FKD GY T GKW L Y G+ P Sbjct: 80 QLSKAYIRNNREHKPEGQEPIPEPGMTLAQIFKDKGYATGAFGKWGLG---YPGSSSDPK 136 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSV---------------EDLQANHIDETFT 161 D ++ + +W N N D + Sbjct: 137 ALGFDTFYGYNCQRVAHSFYPPHMWSNDKNITINEKPVPGHWRKAVGPDFDFSQFYAENY 196 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 I + A+ F++ D+PF + + EPH P +++ Y + E + Sbjct: 197 APDLILDEALKFIKD--NKDKPFFAYLPFVEPHLAMHPPHSWVDSYPKEWDSPKESYKAA 254 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVI 280 L Y A +D+ +G V+ L + ENT VI Sbjct: 255 YLPH-------------------LRPRAGYAAMISDLDEHVGSVMQLLKELDLVENTLVI 295 Query: 281 YTSDHG----------EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSH 328 +TSD+G L ++Y+ R+P+I P ++ D Sbjct: 296 FTSDNGASHCIEVDHEFFNSTKDLRGLKGSVYEGGLRVPMIAHWPGKIKKAQVSDHVSGF 355 Query: 329 IDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWV-TD 387 +D++ T L E P+ G + L + + + F G+ + + Sbjct: 356 VDVMATFCDLLQTEAPQTSDGVSFLPTLKGEKQEPQ----PVLAWEFQGYSGQQAIILDG 411 Query: 388 DFKLVL-----------NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 +K V ELYD DPNE +L + ++ ++H A++ Sbjct: 412 RWKGVRQNLSPRGKKKAKSTPKWELYDLNKDPNEKTDLAT--QMPEIVDRIHKAMMK 466 >UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKB8_9BACT Length = 465 Score = 337 bits (864), Expect = 8e-91, Method: Composition-based stats. Identities = 106/485 (21%), Positives = 175/485 (36%), Gaps = 72/485 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN + +M D N VG + T IDS+A G++F + YT VC P+RAG T Sbjct: 19 SRPNLIVIMADDLGYNDVGFNGCTEIPTPGIDSIAQNGVKFTNGYTSYSVCGPSRAGFIT 78 Query: 62 GIYANQSGPWTN--------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE 113 G Y + G N N A K+ T+ GYH IGKWHL Sbjct: 79 GRYQQRFGFERNPQWNLTDPNSALPKSEMTIAESLTQVGYHCGIIGKWHLGAEPSL---- 134 Query: 114 CPPEWDADYWFDGANYLS-----ELTEKEISLWRNGLNSVEDL---QANHIDETFTWAHR 165 P + D +F +L + +N L+S + T Sbjct: 135 RPNKRGFDEFFGHLGGGHRFMPEDLVIQHTEEVKNELDSYRSWITRNDTPVKTTKYLTEE 194 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 S+ AV F+++ +PF + +SY+ PH P +YL ++ Sbjct: 195 FSDEAVSFIKR--NHQKPFFLFLSYNAPHLPLQATEKYLARFPHIKDP------------ 240 Query: 226 PEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSD 284 Y A VDD + +V+ +L +NT V + SD Sbjct: 241 ---------------------KRKTYAAMVSAVDDGVSQVMQSLKETNIADNTIVFFLSD 279 Query: 285 HGEM-----MGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMA 337 +G L + + +++ R+P ++ P ++ D PVS +D+ T+ + Sbjct: 280 NGGPSHKNKSDNFPLKGQKSDVWEGGFRVPFAMQYPAAIQAKQVYDHPVSSLDIFATIAS 339 Query: 338 LADIE--KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-N 394 LA + L G N++ + I ++ DFKLV+ Sbjct: 340 LAQSPTHADKPLDGVNLIPFITGEKTQAPHAQIFIRKFDQSRYV----VRQGDFKLVIPY 395 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRK 454 +LY+ D E +N+ + ++ + ++ DP W+K Sbjct: 396 KDAPPQLYNLSKDIGEENNIAA--VHPERVKELEKVRKQWDSELMDPIFLGLLHTEAWQK 453 Query: 455 DARPR 459 A + Sbjct: 454 KAARK 458 >UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYA9_9BACT Length = 490 Score = 336 bits (863), Expect = 1e-90, Method: Composition-based stats. Identities = 104/456 (22%), Positives = 173/456 (37%), Gaps = 66/456 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN + +++D Q K + T N+D+LA G+R Y +PVC+P+RAGL T Sbjct: 37 KRPNIIVIVSDDQGYADASFQGSKDILTPNLDALAKSGVRCTRGYVTAPVCSPSRAGLMT 96 Query: 62 GIYANQSGPWTNNVA--------PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE 113 G Y + G N VA N + + + AGY+T +GKWHL D G Sbjct: 97 GRYQERFGHHNNIVAEAALPIAHLPSNETLLPQVLAKAGYYTAMVGKWHLGLQD----GC 152 Query: 114 CPPEWDADYWF----DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNR 169 P E D +F G +Y E+ ++ +E Sbjct: 153 RPYERGFDEFFGIITGGHDYFVNHPEERAVGDQSYKARIERNGPVGEAVPGYLTDAFGAD 212 Query: 170 AVDFLQQP--ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 AV +++ R D+P + ++++ PH P P + ++ Sbjct: 213 AVRIIRESHTKRPDQPLFLYLAFNAPHTPTQAPKDLVD---------------------- 250 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHG 286 MP+ + Y A +D +G+V AL E +T++++ SD+G Sbjct: 251 -------TMPATL---ESKDRRTYAAQITSMDASVGKVRAALKENGMEKDTFIVFFSDNG 300 Query: 287 E----MMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALAD 340 L ++Y+ R+P P + PV+ +D+ T ALA Sbjct: 301 GANHPYYDNTPLRDHKGSLYEGGIRVPFFAVYPGHIPAGSVCELPVTSLDVFATACALAG 360 Query: 341 IEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS 398 + L ++L V E E FG + R D KLV+ S Sbjct: 361 TKPETSHPLDSVDMLPVLEGNARQPTHATLFWEFPGFGAAVADR-----DLKLVVPKKGS 415 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 +L+D D E +L + + +++ L ++ Sbjct: 416 PQLFDLAVDIGEKSDLAA--QNPEKVARLSTLLSEW 449 >UniRef50_A6DG72 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG72_9BACT Length = 468 Score = 336 bits (863), Expect = 1e-90, Method: Composition-based stats. Identities = 101/456 (22%), Positives = 185/456 (40%), Gaps = 46/456 (10%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF++ D +++ CY K T N+D LA++ I F+ AY C P+R L Y Sbjct: 30 NVLFIIADDLKASVLACYGDKICQTPNLDKLASQSIVFDRAYCQGLSCGPSRTSLMHSRY 89 Query: 65 ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGK-WHLDGH----DYFGTGECPPEWD 119 G + + K+ G++T +GK +H+ + P W Sbjct: 90 ------------LGSEGINLPEHLKNNGWYTVRVGKIYHMRVPYDIIHGIDGQDIPSSWT 137 Query: 120 ADYWFDGAN--------------YLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHR 165 + GA + L +E S +N + + + D+ + Sbjct: 138 EKFNSKGAESHTPGDYACLNKNIFTKSLKNRESSGMKNRMFVSVISEGDGSDQPDVKS-- 195 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 + + ++ L Q R +EPF + PH+P P E+ + Y +L E Sbjct: 196 -AEKTIELLNQ--RKNEPFFIATGLVRPHYPNVAPKEFFQNYPWEKIDLPELRNPTSLGI 252 Query: 226 PEHHRLWAQAMPSPVG---DDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIY 281 P + +G D+ Y+A +F+D QIGR+++ + + NT +I+ Sbjct: 253 PAAGHPRITNSNNSIGKYPDNQKRMWSAYYATVEFMDRQIGRILDEVDRLGLKSNTAIIF 312 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADI 341 SDHG +G H K +++++TR+PLI P R+++ +D+ P++ L + Sbjct: 313 LSDHGYHLGEHGFWQKN-NLHEEVTRVPLIAYIPGLAPRRINEVTELVDIYPSLTELLGV 371 Query: 342 EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDEL 401 KP+ + G++ L + + + + G T+DF + + EL Sbjct: 372 YKPKTVQGKSFLPFLKNKTEDFRNSALSLMPGKKG-----YSIRTEDFSYIRYQNGAAEL 426 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 Y+ DP ++ NLI + SK+ L + + Sbjct: 427 YNMNKDPKQLVNLIQNPEHKQTISKLDRELNTRLKE 462 >UniRef50_Q02B50 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q02B50_SOLUE Length = 478 Score = 336 bits (863), Expect = 1e-90, Method: Composition-based stats. Identities = 124/472 (26%), Positives = 194/472 (41%), Gaps = 50/472 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN L +++D + +G T N+D +A+ G+ F SA + PVC PARA +FT Sbjct: 34 RPNVLLIISDQFRWDCIGAMGLNPMNLTPNLDGMASRGVLFRSAISNQPVCAPARASIFT 93 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G Y ++ G W N + N T+G K AGY T YIGKWHL P + Sbjct: 94 GQYPSRHGVWRNGLGLAANAVTLGSAMKQAGYSTNYIGKWHLSPGAADTPETRGPVKPEN 153 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 + + + S G D + H A +++RA FL+ A Sbjct: 154 RGGFQDLWEAANVLELTSHAYEGDLFDGDGKPLHFS-NRYRADFMTDRAQLFLRSRAAR- 211 Query: 182 EPFLMVVSYDEPHHP-----FTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM 236 PFL+ +SY E HH F P E+ +Y + + P+ R Sbjct: 212 SPFLLTLSYLEVHHQNDKDTFDPPKEFAGRYPNPFV-------------PQDLRPLPGTW 258 Query: 237 PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHKLI 295 PS + D YFAC +D+ +G + L + NT V++TSDHG Sbjct: 259 PSQLAD--------YFACVAKMDEIVGTLRKTLVETGLDKNTIVMFTSDHGNHFRTRNAE 310 Query: 296 SKGAAMYDDITRIPLIIRSPQGERR-QVDTPVSHIDLLPTMMALADIEKPEILPGENILA 354 K + ++ IPL++ P R +V+ VSH+D+ PT++A A +E P + G N L Sbjct: 311 YK-RSPHESSIHIPLVMEGPGFNRGMEVNQLVSHVDMAPTLLAAAGLEVPASMQGHNFLP 369 Query: 355 VKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL--------------VLNLFTSDE 400 + + E R E+ + F+ R T + ++ F Sbjct: 370 LLD---RHTEGWRDEV-YFEMSEFVTGRGLRTPQYTYAAAAPKVPGWKATPTVDKFVEYM 425 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPW 452 LYD +DP + NL + V +++ L M + + + P+ Sbjct: 426 LYDLYSDPYQQVNLAGRTPYQKVAAELRQRLQARMREASGERSTIDPAWFPY 477 >UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4W7_9PLAN Length = 459 Score = 336 bits (862), Expect = 1e-90, Method: Composition-based stats. Identities = 105/460 (22%), Positives = 164/460 (35%), Gaps = 73/460 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN + +M D + CY K + T +ID LAA ++F ++ +CTP RA + T Sbjct: 33 QPPNIVLIMADDLGYGDLACYGNKQVKTPHIDRLAASALKFTDFHSAGAMCTPTRAAMLT 92 Query: 62 GIYANQSGPW---------TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 G Y + G +++ TM K GY T GKWHL + Sbjct: 93 GQYQQRFGRQFESALSGKSNHDIGLPHQAVTMAELLKQQGYATACFGKWHLG----YQPP 148 Query: 113 ECPPEWDADYW-----FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRIS 167 P D + DG ++ W N S+ E A +S Sbjct: 149 WLPTNQGFDLFRGLTSGDGDHHTHVDRSGNEDWWHNNEISM---------EKGYTADLLS 199 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 +V F++ A PF + V + H P+ P + + A + Sbjct: 200 KYSVAFME--ANRTRPFFLYVPHLAIHFPWQGPQDPPHR---------------KAGQDY 242 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHG 286 H W +P P P A + +D +G++++AL E NT VI+TSD+G Sbjct: 243 HAGKWG-IIPDPGNV-----SPHTTAMIESLDQSVGKILSALKRLDLEQNTLVIFTSDNG 296 Query: 287 EMM----------GAHKLISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTM 335 + L + A +Y+ R+P +I P D +DLLPT+ Sbjct: 297 GYLTYGKNFQNISSNGPLRGQKATLYEGGHRVPCLISWPGVITAGVTDQTAHSVDLLPTL 356 Query: 336 MALADIEKPE-ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 A I G ++ + + R + D F R +KL L Sbjct: 357 AQAAGISATNFQTDGLDLAPL-------WQTGRPLADRDLFWRMGNNRAVRRGQWKLCL- 408 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 ELY D E N ++ M AL ++ Sbjct: 409 KNNRSELYHLETDLGEQQNRAA--EHPEIVKSMSQALKEW 446 >UniRef50_UPI00016C0B39 choline sulfatase n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0B39 Length = 459 Score = 336 bits (862), Expect = 1e-90, Method: Composition-based stats. Identities = 108/464 (23%), Positives = 181/464 (39%), Gaps = 40/464 (8%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAY----TCSPVCTPAR 56 MK PN L + D Q N + + + T N+D L A G F A+ TC +C +R Sbjct: 1 MKAPNVLILFADDQRFNTINALNNDEIITPNLDRLVASGTAFTHAHIQGGTCGAICMASR 60 Query: 57 AGLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 A L TG + + + MG +FK GY T GKWH + + + Sbjct: 61 AMLNTGR--SLFKLDDLGQQIPDDHTLMGEFFKARGYDTFGTGKWHNGKKSFNRSXDQGD 118 Query: 117 EWDA----DYWFDGANYLSELTEKEISLWR----NGLNSVEDLQANHIDETFTWAHRISN 168 D+W + E + + N N V+ ++ I+ Sbjct: 119 SIFFGGMSDHWAVPFYHYDSSAEYNKVIRKCVDQNHSNEVKKTAGEYMRAGEHSTDVIAE 178 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA-------QDD 221 + FL Q + D+PF S+ PH P T P E+L Y +L + Sbjct: 179 SVIKFLDQ--KHDKPFFAYTSFLAPHDPRTMPEEFLNMYNPEDIKLPPNFMSYHFIEYAN 236 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVI 280 + E + + + + H Y+A +D QIGR+++ L +++NT ++ Sbjct: 237 WECRDETLAPYPRTLA-----NTQKHIAEYYAMITHLDYQIGRILDKLEEIGEKDNTIIV 291 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALA 339 Y D+G +G H L K ++YD R+PL+I + D V D+ PT+ L Sbjct: 292 YAGDNGLALGQHGLFGK-QSLYDHSMRVPLLISGAGIKAGMKTDALVYLFDIFPTLCDLL 350 Query: 340 DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS- 398 + E P + G++ + Y D +R D FK + + + Sbjct: 351 EQEIPASVTGQSFAECIKGTKDAARDQIYLAYTDK------IRAITKDGFKYIEHRYNGI 404 Query: 399 --DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 +L+D +DP EM NL+ + + + + AL + + Sbjct: 405 ITKQLFDLNSDPFEMSNLVLNPNYQEKLVALQKALQAESQQSNE 448 >UniRef50_Q7UVD9 N-acetylgalactosamine 6-sulfate sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UVD9_RHOBA Length = 564 Score = 336 bits (862), Expect = 1e-90, Method: Composition-based stats. Identities = 124/493 (25%), Positives = 192/493 (38%), Gaps = 88/493 (17%) Query: 3 RPNFLFVMTDTQATNMV------GCYSGKPL-NTQNIDSLAAEGIRFNSAYTCSPVCTPA 55 +PN + V+TD QA G +S P+ +T N+D LAAEG F + + +PVC+PA Sbjct: 101 KPNVVLVLTDDQAPWAFAEAVRSGQFSDVPIPSTPNMDRLAAEGAVFRNFFCTTPVCSPA 160 Query: 56 RAGLFTGIYANQSGPWTNNVAPGK--------------NISTMGRYFKDAGYHTCYIGKW 101 RA L TG YA++ G PG N T + GY T +GKW Sbjct: 161 RATLMTGRYASELGIKDFIPQPGHKLYDPDSPIHLDPDNTVTFAEVMQQQGYTTGLVGKW 220 Query: 102 HLDGHDYFG-TGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF 160 HL G +G+ P D + + E+ L V+ Q Sbjct: 221 HLGDWTANGDSGKHPTRHGFDSFMGLTGGGTTPDNPELELN----GKVQQFQG------- 269 Query: 161 TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTC-PVEYLEKYADFYYELGEKAQ 219 +++ A+DF++Q AD PF + +S PH + E + Y + + + Sbjct: 270 LTTDILTDHAIDFVEQ--NADRPFFLCLSTRAPHGRWLPVAPEDWQPYEEMDPTIPQ--- 324 Query: 220 DDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTW 278 P D Y A VD +GR++ L ++ NT Sbjct: 325 ----------------YPDLDTDWVRKKMKEYLASTSGVDRNLGRLLKTLDAQELTSNTI 368 Query: 279 VIYTSDHGEMMGAHKLISKG--------------------------AAMYDDITRIPLII 312 VI+TSDHG MG H + KG +YD R+P I+ Sbjct: 369 VIFTSDHGFNMGHHGIYHKGNGIWATRQKPPGKFHQGTRVISDKYRPNLYDHSLRVPAIV 428 Query: 313 RSPQGER--RQVDTPVSHIDLLPTMMALAD-IEKPEILPGENILAVKEPRGVMVEFNRYE 369 R P + ++ SH+D PT+ A+A + LPG ++ + + Sbjct: 429 RWPGVVKPSAVIEATASHLDWFPTLCAIAGDGSSAKDLPGRDLSPLLKGELQDDWDQAQY 488 Query: 370 IEHDSFGGFIP-VRCWVTDDFKLVLNLFTS--DELYDRRNDPNEMHNLIDDIRFADVRSK 426 E+D + +R + T ++KL+ + DE YD DP+E NLI + V + Sbjct: 489 FEYDMINYAVASLRGYRTPEYKLIRDRHNEGCDEFYDLTTDPDETVNLIRNPGSQAVIKR 548 Query: 427 MHDALLDYMDKIR 439 + L K+ Sbjct: 549 LDAKLRAMEKKLE 561 >UniRef50_A6DMY9 Putative uncharacterized protein n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMY9_9BACT Length = 590 Score = 336 bits (861), Expect = 2e-90, Method: Composition-based stats. Identities = 94/460 (20%), Positives = 179/460 (38%), Gaps = 71/460 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + ++TD Q + + + ++T ++D LA +G RF + + + VC P RA L TG Sbjct: 25 KPNIVLILTDDQGYGDISSHGNRMIDTPHLDQLAEDGTRFENFFVSN-VCAPTRASLLTG 83 Query: 63 IYANQSGPW---TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 Y ++G +T+ FK GY T GKWH H PP Sbjct: 84 RYHIRTGVVQVSRGLEIMRSEEATIAEVFKAQGYETGLFGKWHNGEH----YPNNPPGQG 139 Query: 120 ADYWFDG-ANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 D +F A ++ + ++ D + +++RA+D++++ Sbjct: 140 FDEYFGFCAGHIGDF-----------FDATLDHNKTFVKTKGFITDVLTDRAIDWIEK-- 186 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 + D+PF + Y+ PH P+ +Y +++A Y Sbjct: 187 QQDKPFFAYIPYNAPHAPYQVEDKYYDEFAAKGYS------------------------- 221 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEM---MGAHKL 294 H + + +DD IGR++ L +NT VI+ +D+G + Sbjct: 222 -------AAHSAAYGMIENLDDNIGRLLKILDDLNLTDNTIVIFLTDNGPNSPTRFNGGM 274 Query: 295 ISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALAD--IEKPEILPGE 350 ++ + R+P IR P R + +HID+LPT+M LA ++ P L G Sbjct: 275 KGSKGSVDEGGVRVPFFIRWPGKIAKGRTIHDLAAHIDVLPTLMELAGVNVDLPNKLDGR 334 Query: 351 NILAVK--EPRGVMVEFNRYEIEHDSFGGFIP----VRCWVTDDFKLVLNLFTSDELYDR 404 ++ ++ + I G + ++ ++ VL+ + LYD Sbjct: 335 SLTSLISSSKTPKAPAWPERLIFTQGPGTNMTPGSGAGAARSNQYRYVLSR-GEEGLYDM 393 Query: 405 RNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRS 444 NDP + + ++ A ++++ + + Sbjct: 394 INDPGQEK--DLKKSKKKIFDELKAAYIEWLKDVSAGWEP 431 >UniRef50_Q7W424 Putative sulfatase n=2 Tax=Bordetella RepID=Q7W424_BORPA Length = 485 Score = 336 bits (861), Expect = 2e-90, Method: Composition-based stats. Identities = 122/447 (27%), Positives = 193/447 (43%), Gaps = 21/447 (4%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + +M+D + +GCY + ++T N+D+LAA G RF SAY SPVC PARA TG Y Sbjct: 7 NMVVIMSDEHQSRALGCYGHEFVHTPNLDALAARGTRFASAYCTSPVCIPARASFATGKY 66 Query: 65 ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYWF 124 NQ G W N A ++ + +D + IGK H DY G E + Sbjct: 67 INQIGFWDNADAYDGSVPSWHHMLRDRDHQVVSIGKLHFR--DYGGDHGFSEEIIPMHIV 124 Query: 125 DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA--RADE 182 G L L ++ + + + TF I +RA +L++ A AD+ Sbjct: 125 GGKGDLMGLVRSDLPVRKGAYKMAQMAGPGESQYTFY-DREIVSRAQIWLREQAPRHADK 183 Query: 183 PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD-LANKPEHHRLWAQAMPSPVG 241 P+++ VS+ PH P T P E+ +Y + L + + P Sbjct: 184 PWVLFVSFVSPHFPLTAPPEHYYRYYNRDLPLPKLYDRSQRPDHPYQQDYRGSFNYDDYF 243 Query: 242 DDGL--YHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHKLISKG 298 D GL Y+ F+D+ IG+++ L + ++T V+YTSDHG+ +GA + K Sbjct: 244 DPGLVKKAQAGYYGLCSFLDENIGKLLGTLDDLDILDSTRVVYTSDHGDNLGARGMWGK- 302 Query: 299 AAMYDDITRIPLIIRS---PQGERRQVDTPVSHIDLLPTMMALADIEKP---EILPGENI 352 + M+++ +PLII P G V+TPVSH+D+ P + P E LPG ++ Sbjct: 303 SNMFEEAAAVPLIIAGRDIPSGV--TVNTPVSHVDVAPFIYDAVGETSPGLKEGLPGVSL 360 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMH 412 + E+ G +K + + +L+D DP E+ Sbjct: 361 FQLARGETPQR---NVMAEYHGMGSATGAFMIREGRYKYIFYVHYPHQLFDLEADPEELE 417 Query: 413 NLIDDIRFADVRSKMHDALLDYMDKIR 439 +L +ADV + L D + Sbjct: 418 DLAGRPDYADVVALCKQKLWQLCDPVE 444 >UniRef50_A6DR15 Arylsulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DR15_9BACT Length = 526 Score = 335 bits (859), Expect = 3e-90, Method: Composition-based stats. Identities = 109/486 (22%), Positives = 190/486 (39%), Gaps = 65/486 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN + ++ D + +GCY G+ + T NID+LA EG+RF + + CTP+RA L T Sbjct: 40 SRPNIIVILADDMGYSDLGCYGGE-IQTPNIDALAREGVRFT-GFKNTARCTPSRASLLT 97 Query: 62 GIYANQSGPW---------TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 G Y++ G + T+ K GY T +GKWH G Sbjct: 98 GRYSHSVGVGAMQQDQHLPGYRGQLSADAPTIAEILKPHGYATGVVGKWH---QAVTGKS 154 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 + P + D FD + S NS + F H +S+ A++ Sbjct: 155 KQKPLFPLDRGFDFFYGTWWGAKDYFSPKFMMKNSEHIPDSTTYPADFYLTHALSDSAIE 214 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCP----VEYLEKYADFYYELGE---KAQDDLANK 225 F+ PF + +++ PH P P + +++Y + +L + + Q +L Sbjct: 215 FVDAQVGQQNPFFLYLAHYAPHAPIQAPADRIQKCIDRYKAGFVKLQQERFERQQELGVA 274 Query: 226 PEHHRLWAQA-----MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWV 279 P++ Q+ + ++ + Y A + +D+ IG++I+ L +NT + Sbjct: 275 PDNAATIDQSSKWNTLSEADKNEWVTTMATYGAMIEIMDEGIGQLIDVLKKNGQYDNTLI 334 Query: 280 IYTSDHGE---MMGAHKLI------------SKGAAMYDDITRIPLIIRSPQG----ERR 320 + SD+G G L A + PLI+ P + Sbjct: 335 LVLSDNGSTPNHKGTRNLANLCATLSNTPFSGVKAHALEGGISSPLIVSWPDKLKEYAGQ 394 Query: 321 QVDTPVSHIDLLPTMMALADIEKPEIL--------PGENILAVKEPRGVMVEFNRYEIEH 372 + ID+LPT + A + P+ G N++A +G +E + EH Sbjct: 395 IRNGRCHIIDILPTCLDAAGAKFPDAFKGIKPVQADGINLMAAV--KGAELESRPFFWEH 452 Query: 373 DSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALL 432 + R D +KLV + T +LYD DP E +L + + S + Sbjct: 453 ------LQSRAVYRDAWKLVADK-TLWKLYDLSKDPAEQCDLSS--KHPERASALKALWT 503 Query: 433 DYMDKI 438 ++ ++ Sbjct: 504 EWAEEY 509 >UniRef50_UPI0001968553 hypothetical protein BACCELL_00119 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968553 Length = 514 Score = 335 bits (859), Expect = 3e-90, Method: Composition-based stats. Identities = 114/477 (23%), Positives = 208/477 (43%), Gaps = 47/477 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN +FV+TD +G P+ T ++D A+ + F++A +C PV P RA LFT Sbjct: 13 EKPNVIFVLTDQWRKQALGFKGEDPVQTPHLDEFASWAVSFDNATSCRPVSGPGRACLFT 72 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G Y+ +G + N V + +MGR FK AGY T YIGKWHL+G + + D Sbjct: 73 GKYSINNGVFANKVPLATDEESMGRVFKAAGYATAYIGKWHLNGMND-HVTDSVRRQGFD 131 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 Y+ + G +D + +I + ++F+++ ++ Sbjct: 132 YFVQSMGHQPFF---------QGYYVQDDKERTYI--KGWAPTYETQLGIEFIEKQKTSE 180 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ-----------DDLANKPEHHR 230 +PF +V+SY+ PH + ++Y Y +K + + L ++ + Sbjct: 181 QPFCLVLSYNPPHT--GGGPGFEDRYQPGKYGPDKKLKMGYGYAGPAEYEALYKDIDYEK 238 Query: 231 LWAQAMPSPVG---DDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHG 286 + P+ D P YF +D+ G ++ L ENT +++TSDHG Sbjct: 239 NPIRGNLKPIRRSSDTSARVIPGYFGAITAIDNDFGNLMTYLEQNDLLENTIIVFTSDHG 298 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPV-SHIDLLPTMMALADIEKPE 345 E MG+ L++KG +++ +P ++ + + + V + IDL+PT++ L+ + P+ Sbjct: 299 ESMGSQGLMTKG-TWFEESMGVPCLVGWKGVIKPKREIVVFNSIDLMPTLLGLSGLSIPQ 357 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGF------IPVRCWVTDDFKLVLNLFTSD 399 + G + + + Y FGG R T + VL + Sbjct: 358 GVDGVDYSPLLLGKKFKA--PEYAFTSFDFGGVEELKAPRYWRAVYTSRYTYVLCGMNQN 415 Query: 400 E--------LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWS 448 LYDR DP +++ + + + ++H L+ ++D+ DPF W+ Sbjct: 416 RAFTKDGLVLYDREKDPLQLNPIYKGMGYDKTIDRLHAELVKHLDETGDPFIKEYWN 472 >UniRef50_UPI0001C35931 N-acetylgalactosamine 6-sulfate sulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C35931 Length = 472 Score = 335 bits (859), Expect = 3e-90, Method: Composition-based stats. Identities = 118/514 (22%), Positives = 199/514 (38%), Gaps = 68/514 (13%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M+RPN +F++ D +G + T N+D +A EG + + SPVC+PARA L Sbjct: 1 MRRPNIVFILADDMGFWTLGSAGNRDAVTPNLDEMAREGCIAENFFCSSPVCSPARATLL 60 Query: 61 TGIYANQSGP-----WTNNVAPGK-------NISTMGRYFKDAGYHTCYIGKWHLDGHDY 108 TG + G N G+ + Y +AGY GKWHL Sbjct: 61 TGRMPSMHGILDWILRGNIKNEGEIPIEYLNDFKGYTDYLSEAGYICGLSGKWHLG---- 116 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 + +W+ + + + + R G + E I+ Sbjct: 117 ---DSQKQQKGFSHWY--VHQSGGGSYYDAPMIREGKR---------VCEQGYITELITR 162 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPF--TCPVEYLEKYADFYYELGEKAQDDLANKP 226 AV FL + A + PF + V++ PH P+ P +YL+ Y D ++ + P Sbjct: 163 DAVRFLNEHAGKNAPFYLGVNFTAPHTPWIHNHPQKYLDLYRDCAFD----------SCP 212 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDH 285 R Q + D YFA +D +G + L + + +T ++++SD+ Sbjct: 213 VEQRHPWQIDFAEFNYDRTEMLKGYFAATSALDVGVGEIREELKRLKLDQDTLILFSSDN 272 Query: 286 GEMMGAHKLISKGA-----AMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMAL 338 G G H + KG MYD ++P + P ++ S D PT+M + Sbjct: 273 GFNCGHHGIWGKGNGTAPFNMYDTSVKVPFLACMPGKIQPGTRLRGLYSAYDFFPTIMEI 332 Query: 339 ADIEKPE-ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV-LNLF 396 A ++ E LPG++ G + N + + +G VR ++K + Sbjct: 333 AGVQYKEKGLPGKSFAKAV-FSGEERDINDCVVVYSEYG---AVRMIRQKEWKYIRRYPE 388 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDA 456 DELY+ + DP+EM N+ID ++ ++ L + + P + A Sbjct: 389 GPDELYNLKTDPDEMRNMIDKAA-PELIELLNKRLDSWFSEHTRPETDG--------RQA 439 Query: 457 RPRWMGAFRPRPQDGYSPV--VRDYDTGLPTQGV 488 G R G+ P + Y+T LP + Sbjct: 440 NVTGAGQNRKYTNHGFEPGSFEKGYET-LPIRQA 472 >UniRef50_D2QTW5 Sulfatase n=2 Tax=Sphingobacteriales RepID=D2QTW5_9SPHI Length = 523 Score = 335 bits (859), Expect = 3e-90, Method: Composition-based stats. Identities = 113/472 (23%), Positives = 179/472 (37%), Gaps = 72/472 (15%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN +++ D +GCY + + T N+D LA EGIRF YT PVC PAR L TG Sbjct: 47 PNIIYIYADDLGYAELGCYGQQKIRTPNLDKLAREGIRFTQHYTSMPVCAPARCMLLTGK 106 Query: 64 YANQSGPWTNNVAPG-------------KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG 110 ++ S N G T+GR + GY T +GKW + + Sbjct: 107 HSGHSYIRGNYEMGGFPDSLEGGQMPLYPGAFTIGRLLQQQGYKTACVGKWGMGMAN--- 163 Query: 111 TGECPPEWDADYWFDGANYLSELTEKEISLWRNGL-----NSVEDLQANHIDET------ 159 T P E DY++ + LW NG N V D+ ET Sbjct: 164 TTGNPNEQGFDYFYGYLDQKQAHNYYPTHLWENGKPDKLNNPVIDVHRRLTPETATPEAF 223 Query: 160 ------FTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE 213 +++ +A F++Q PF + + + PH P +++Y + Sbjct: 224 AYFRGNDYAIDKLAQKAQAFIRQ--NKSGPFFLYLPFTAPHVSLQAPEAAVKEYIGKF-- 279 Query: 214 LGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ 273 D + E L Q S Y Y A +D QIG+++ L + Sbjct: 280 ------GDGEQRTERPYLGEQGYAS-----TPYPRATYAAMITHMDAQIGQLMQLLKDLK 328 Query: 274 RE-NTWVIYTSDHG----------EMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERR 320 + NT V+++SD+G KL +Y+ R P++ R P + Sbjct: 329 IDENTLVMFSSDNGATFNGGVEAAYFNSVGKLRGLKMDVYEGGIREPMLARWPGRIKPNQ 388 Query: 321 QVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRY-EIEHDSFGGFI 379 D DLL T+ L ++P G + L + + + + E+ GG + Sbjct: 389 TTDHVSVQYDLLATLAELVGYKRPFATDGISFLPTLLGQSSSQKQHPFLYWEYPEKGGQL 448 Query: 380 PVRCWVTDDFKLV-----LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSK 426 +R ++K V + T ELYD D +E N+ D + D+ + Sbjct: 449 AIRM---GNWKAVKTNVRKDRTTPWELYDLNKDVSETTNIAD--KHPDIIRQ 495 >UniRef50_D2R1I8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R1I8_9PLAN Length = 427 Score = 334 bits (858), Expect = 3e-90, Method: Composition-based stats. Identities = 104/470 (22%), Positives = 188/470 (40%), Gaps = 63/470 (13%) Query: 8 FVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIYANQ 67 ++ D V Y + T ID LAAEG+ S VC+P+RA L TG YA++ Sbjct: 2 LILADDLGYGDVSTYHPSDVRTPQIDQLAAEGMLLTSMRANCTVCSPSRAALLTGRYADR 61 Query: 68 SGPWTNNVAPGKN--------ISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 G ++ + T+ K GYHT +GKWHL + P E Sbjct: 62 VGVPGVIRTKPEDSWGWFDPTVPTLADELKRVGYHTAIVGKWHLG----LESPNTPNERG 117 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL-QQPA 178 D++ +L ++ + + R G N + I+ ++ A ++L ++ Sbjct: 118 FDFF---QGFLGDMMDSYTTHLRYGNNYMRR-NREVIEPQGHATELFTDWASEYLVERAK 173 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 + ++PF + ++Y+ PH P P E+L K + +L +K ++ Sbjct: 174 QKEQPFFLYLAYNAPHFPIEPPAEWLAKVKERAPQLDQKRAKNV---------------- 217 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMM----GAHK 293 A + +D IGRV+ L + NT V++TSD+G + Sbjct: 218 --------------AFVEHLDHSIGRVLKTLKETGLDQNTVVVFTSDNGGSLPHAQNNDP 263 Query: 294 LISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 + YD R+P ++R P + D + DL PT + LA + L + Sbjct: 264 WRDGKQSHYDGGLRVPFMVRWPGQIKAGSRSDYVGLNFDLFPTFLELAGATPSKELDAVS 323 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSF--GGFIPVRCWVTDDFKLVLN-LFTSDELYDRRNDP 408 ++ V + + + Y + + G + ++KL+ N +++ ELY+ +NDP Sbjct: 324 LVPVLKGGKITTSRDLYFVRREGGVTYGGKSYEAIIRGEWKLLQNDPYSALELYNIQNDP 383 Query: 409 NEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARP 458 E +L V +++ AL ++ + + W P + P Sbjct: 384 GETKDLAA--SNKKVVNELAAALRLHIQRGG----ATPWQAPPRKPALAP 427 >UniRef50_A3ZMT9 Arylsulfatase n=2 Tax=Planctomycetaceae RepID=A3ZMT9_9PLAN Length = 542 Score = 334 bits (857), Expect = 4e-90, Method: Composition-based stats. Identities = 104/521 (19%), Positives = 184/521 (35%), Gaps = 98/521 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + +M D + +G + G+ + T NID+LA G+RF+ Y C P RA L TG Sbjct: 28 RPNIILIMVDDMGFSDLGYHGGE-IATPNIDALAHSGVRFSQFYNNG-RCCPTRATLMTG 85 Query: 63 IYANQSGPWTNNVAPGK-----------------NISTMGRYFKDAGYHTCYIGKWHLDG 105 +Y +Q+G +PG+ N T+ + GY T GKWHL Sbjct: 86 LYPHQTGIGHMTESPGEANYGSGKPPTYQGYLNRNCVTIAEALQQQGYATLMSGKWHLGE 145 Query: 106 HDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHR 165 +D P + + +F + + + N + D+ F Sbjct: 146 ND---KSRWPLQRGFEKYFGCLSGATLYFFPDGDRKMTLGNQQIAEPESTTDQPFYTTDA 202 Query: 166 ISNRAVDFLQQPARADE-PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 ++ A+ FL++ + P + ++Y PH P E + KY Y +K ++ Sbjct: 203 FTDYAIRFLKEEQAGQQRPMFLYLAYTAPHWPLQAFEEDIAKYRGKYKIGWDKLREQRLE 262 Query: 225 KPEHHRLWA---------------QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL 269 + ++ L A + + D+ +Y A D VD IGR++ L Sbjct: 263 RQKNLGLIAADRQLSPRTPKIPAWDELDAAQQDEMDLKMAVYAAMIDRVDQNIGRLMKHL 322 Query: 270 TPEQ-RENTWVIYTSDHGE------MMGAH-------------------------KLISK 297 ++T +++ SD+G + GAH Sbjct: 323 KESGIEDDTLILFLSDNGGCQEGGVLGGAHFLDPEQRNRQYFHGYGEAWANASNTPFRLY 382 Query: 298 GAAMYDDITRIPLIIRSPQGERRQ---VDTPVSHIDLLPTMMALADIEKPEI-------- 346 ++ T P +R P + P ID++PT++ +A P Sbjct: 383 KHFNHEGGTATPFFMRWPGKIAARDAWCAEPAQLIDVMPTILDVAGATYPAKYAENAIPP 442 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL--------FTS 398 L G ++ + + IEH++ D+KLV Sbjct: 443 LDGVSLRPTMQGE-PLDRQQPICIEHENNA------SIRAGDWKLVGRGVAAPRGVQPAK 495 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 ELY+ +D E NL + + ++ + ++ Sbjct: 496 WELYNIADDRTETQNLA--VEHPEKVRELSQQWNAWAKRVG 534 >UniRef50_C6XTA2 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XTA2_PEDHD Length = 535 Score = 334 bits (857), Expect = 5e-90, Method: Composition-based stats. Identities = 108/514 (21%), Positives = 189/514 (36%), Gaps = 81/514 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN LF+ D ++GCY + + T NID LA G F S Y VC P RA + T Sbjct: 29 EKPNVLFIAVDDLKP-ILGCYGDRLIKTPNIDRLAKMGTVFKSNYCQQAVCGPTRASIMT 87 Query: 62 GIYANQSGPWTNNVAP---GKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+ + + W +I T+ +YF GY T IGK + P W Sbjct: 88 GMRPDITKVWDLKTKMRDMNPDILTIPQYFASQGYSTQAIGK--IYDPRCVDEDLDKPSW 145 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSV-------------------------EDLQA 153 ++ Y + T + + + G ++ Sbjct: 146 TVPHYRTDKKYYAASTGQPVLNYYQGKEIKSLVEKRRAEAKGKIITDQELLATIKPSVEC 205 Query: 154 NHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE 213 + + +A D L + +PF V + +PH PF P +Y + Y Sbjct: 206 VDVPDQAYIDGANILQAKDILTTLQKKSQPFFFAVGFAKPHLPFNAPKKYWDLYQREDMP 265 Query: 214 LGEKAQDD----------------LANKPE-----HHRLWAQAMPSPVGDDGLYHHPLYF 252 + + ++ P+ + + +P + + Y+ Sbjct: 266 VAAFQEKSKNAVDVAYHNSGELRAYSDIPDLLSFTDQKSYGLTLPIAKQKELI---HGYY 322 Query: 253 ACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLI 311 A +VD Q+G ++NAL +NT ++ DHG +G H L K + ++ TR PLI Sbjct: 323 AAVSYVDAQVGILLNALDSLGLSKNTVIVLWGDHGWHLGDHNLWCKHSD-FEQATRSPLI 381 Query: 312 IRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAV-KEPRGVMVEFNRYEI 370 +P + + +D+ PT+ LA I P+ L G +++ + + P + EF + Sbjct: 382 FSAPGIKSSATTSLSEFVDVFPTLCNLAGIPVPQHLEGTSLVPLMRNPASSIKEFAISQY 441 Query: 371 EHDSF----------GGFIPVRCWVTDDFKLVLNLFT-------------SDELYDRRND 407 S + T ++ + + DELYD + D Sbjct: 442 PRSSNAVETQRMTDASAKVMGYSLRTKRYRYTIWMENFRSNQAFKATAVVGDELYDYQKD 501 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 P E N++ D +A + + D ++ Y P Sbjct: 502 PLEKINVVKDRNYALIAKSLKDKMIRYFHSKEKP 535 >UniRef50_Q1ARG1 Sulfatase n=2 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1ARG1_RUBXD Length = 492 Score = 334 bits (857), Expect = 5e-90, Method: Composition-based stats. Identities = 117/470 (24%), Positives = 186/470 (39%), Gaps = 45/470 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + ++TD Q VG G L G F +A+ VC P+RA + Sbjct: 45 ERPNLILILTDDQTPGDVGYMPGVRAL------LRDRGTTFRNAFVTDSVCCPSRATILR 98 Query: 62 GIYANQS---------GPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 G YA+ G + G ST+ + K GY T ++GK+ L+G Y T Sbjct: 99 GQYAHNHEIAGAKPPAGGFEKFRRLGLERSTVATWLKARGYATGFVGKY-LNG--YLRTT 155 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 PP WD Y F+G Y + +L NG N ++ + + +A+ Sbjct: 156 HVPPGWDRWYGFNGGGY------HDFTLNENGRN------VSYRGPSSYQTDVLGRKALG 203 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD-DLANKPEHHRL 231 F++ AR D PF + +S PH P + +A + D+++KP Sbjct: 204 FVRWAARRDRPFFLHLSPWAPHGPAEPAPRHARLFARTPLPRPPSFDERDVSDKPR---- 259 Query: 232 WAQAMPSPVGDDGLYHHPLY---FACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGE 287 W + P ++ LY VD+ +GR++ AL ENT++ +TSD+G Sbjct: 260 WVRDNPRLGREEVREMGRLYRNRLRTLRAVDELVGRLVAALRESGQLENTYIFFTSDNGF 319 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLLPTMMALADIEKPEI 346 MG H+L Y++ R+PL++R P +V V + DL PT L P Sbjct: 320 HMGHHRLPEGKWTAYEEDIRVPLLVRGPGVPEGRVLPHLVLNNDLAPTFGRLGGARVPGY 379 Query: 347 LPGENILAVKEPRGVMVEFNR----YEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELY 402 + G +++ + R E + D R + V ELY Sbjct: 380 VDGRSLVLLLRRDPPSRRSWRSAFLVEAKRDGANRRPAYRALRSVGHLYVEYESGERELY 439 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPW 452 D R DP+++ NL + R K+ L + R+ + W Sbjct: 440 DLRRDPHQLRNLAPRLDGESAR-KLRSRLAKLSGCAEEECRTLENRKPVW 488 >UniRef50_A7V656 Putative uncharacterized protein n=6 Tax=Bacteroides RepID=A7V656_BACUN Length = 521 Score = 334 bits (857), Expect = 5e-90, Method: Composition-based stats. Identities = 108/480 (22%), Positives = 193/480 (40%), Gaps = 46/480 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K PN + +M D +++ D+LA G F+ AYT +P PAR + T Sbjct: 30 KEPNIIIIMADQLRVDLLQREGYPLNTMPFADNLAKNGTWFDCAYTSAPASGPARVSMLT 89 Query: 62 GIYANQSGPWTN-NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 G + + + +N N+ + K+ GY T +GK H T + W Sbjct: 90 GRFPSATHVKSNHNIKDAYYTKDLFDVAKEKGYTTAMVGKNH-----SHLTADRVDYWSP 144 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 + K + L+ ++ + +R+ + A ++ + Sbjct: 145 YNHGGQESRNKSEKGKAFDRYLGTLDMYASMEPSPYGVEAQLPYRMVDDACHWID--SHK 202 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS-- 238 D+PFLM S EPH+P+ Y + + DL K E ++L A+ M Sbjct: 203 DKPFLMWFSIAEPHNPYQVCEPYYSMFPPESLPEMGSSAKDLNTKGEEYQLLAEMMAQGH 262 Query: 239 -PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLIS 296 ++ Y +DDQ+ R + L +NT +I+ +DHG+ +G + L+ Sbjct: 263 VGYRENLQRLRSNYHGMLRMIDDQLSRFVGELKKNGVYDNTIIIFVADHGDYVGEYGLMK 322 Query: 297 KGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEKPEILPGENILA 354 KG + D +TRIP+ P + + + VS ID+ PT+ + E P + G ++ Sbjct: 323 KGVGLDDVLTRIPMQWTGPGIKASAIPHNAHVSIIDIFPTICEIIGAEIPMGVQGRSLWP 382 Query: 355 VKEPRGVMVEFNRYEIEHDSFGGFI--------------------------------PVR 382 + + + + R + D FGG +R Sbjct: 383 LLQGKEYPEQEFRSVMAQDGFGGMYYTKVDATDYREEGAVGKKGLFFDELNTWTQSGTMR 442 Query: 383 CWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 D+KLV ++ + +LY+ + DP+E++NL D +F V+++M + LL + DP Sbjct: 443 MLRKGDWKLVYDMNANGQLYNLKADPSELNNLFSDKKFNKVKNEMIEELLRWDISTHDPL 502 >UniRef50_Q7UVD4 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UVD4_RHOBA Length = 510 Score = 334 bits (856), Expect = 6e-90, Method: Composition-based stats. Identities = 111/466 (23%), Positives = 194/466 (41%), Gaps = 40/466 (8%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L ++ D +G Y T N+D+LA G+ F+ AY VC P+R+ TG Sbjct: 41 RPNVLLIVADDLNC-AIGPYGDPNAITPNLDALANRGLVFDRAYCQQAVCNPSRSSFLTG 99 Query: 63 IYANQSGP------WTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 + G + G ++ T+ ++FK+ GY+ IGK + G + Sbjct: 100 LRPTTVGVDDLRKSFRETAPNGASLVTLPQHFKNHGYYCQDIGKIFHN----MGDTQDRQ 155 Query: 117 EWDADYWFDGANYLSE--LTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 W D + ++ + ++L L + + +T +I+ A + Sbjct: 156 SWSMDEVLHAGTHAADTVHSNTPVALRARKLKKAPATETLDVPDTAYRDGQIARLAASVI 215 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG--EKAQDDLANKPEH---- 228 + PF + V + PH PF P +Y + Y E + D+ + H Sbjct: 216 RDYPDDAAPFFLGVGFWRPHLPFVAPKKYWDLYDPDEISSPQLETSPVDVPDIAMHISRE 275 Query: 229 -HRLWAQAMPSPVGDDGLYH-HPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDH 285 H + + + H Y+A F+D Q+G ++NAL +N T V + SDH Sbjct: 276 LHGYDGIPKEAELSPELKRHLRHGYYASISFLDAQVGLILNALEASGHDNDTIVAFVSDH 335 Query: 286 GEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ--VDTPVSHIDLLPTMMALADI-- 341 G +G L K + ++ R+PLII P+ +R Q D +DL PT+ +LA I Sbjct: 336 GFHIGEKTLWGK-TSNFELDARVPLIIADPRVDRTQPRTDCLTELVDLYPTLTSLAGIAN 394 Query: 342 EKPEILPGENILAVK-----EPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL--- 393 + PE L G+++ ++ + +++ T D++ Sbjct: 395 DLPENLEGDDLSSLLINPNQTLKTAAFTQHQHPFYAPREKWVALGYSVRTADWRYTQWRS 454 Query: 394 ---NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 + ++ELYD RNDPNE N+ ++F DV + +L+ + + Sbjct: 455 IQDHHVIAEELYDHRNDPNESQNVA--VQFPDVVQQHSQSLIKHFN 498 >UniRef50_Q7UGD6 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=1 Tax=Rhodopirellula baltica RepID=Q7UGD6_RHOBA Length = 578 Score = 333 bits (855), Expect = 7e-90, Method: Composition-based stats. Identities = 124/512 (24%), Positives = 192/512 (37%), Gaps = 78/512 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPNFLFV+TD Q+ M+GC + T NID LA EGI F+ AY S +CTP+R +F Sbjct: 50 SRPNFLFVLTDDQSYGMMGCDGNELTRTPNIDQLAREGIFFDRAYVTSAICTPSRISIFL 109 Query: 62 GIYANQSGPWTN---NVAPGKNISTMGRYFKDAGYHTCYIGKWHLD-GHDYFGTGECPPE 117 Y + G N +VAP + +D GY+T Y+GK H G D + +G Sbjct: 110 SQYERKHGVNFNSGTSVAPEAWAKSYPVVMRDNGYYTGYVGKNHAPIGKDGYNSGLMEES 169 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 DY++ G ++ + ++ + N E F + AV FL++ Sbjct: 170 --FDYFYAGHGHIRFYPKAVHKIFEGAEYDTQVEIVNEGAEDFLSYEHRLDGAVRFLEER 227 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVE--------YLEKYADFYYELGEKAQDDLANKPEH- 228 AD+PF + + + PH T ++ Y Y D L + K Sbjct: 228 P-ADKPFCLSICLNLPHSAGTGSMQQRESDDDIYKSLYRDIEIPLPKHYVAKDDIKTPRL 286 Query: 229 -----HRLWAQAMPSPVGDDGLYHHPLYFACNDF--VDDQIGRVINALTPEQ-RENTWVI 280 Q + V L + +D IG + L E +NT +I Sbjct: 287 PADVLRASDRQTGYNFVDTPELLKERIIRQMQSLTGIDRLIGNLRTKLETEGVDDNTIII 346 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE----RRQVDTPVSHIDLLPTMM 336 + SDHG MG H L A Y+ T +PLI+ P+ + + V ID+ TM+ Sbjct: 347 FCSDHGLFMGQHGL-GGKALCYEQTTHVPLIVYDPELPTVLKGARCNELVQTIDIAATML 405 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFNRY-EIEHDSFGGFIPVRCWVTDDFKLVLNL 395 LADIE P G+++ + G + + + E + G + +K + Sbjct: 406 DLADIETPATFQGKSMRPLLSGDGGAIRDHVFTENLWVTHFGNPRIEAVQDKRWKYIRYY 465 Query: 396 FTS------------------------------------------------DELYDRRND 407 +EL+D +D Sbjct: 466 RNDRVSASVKIQVAEDLGMKSSLMLYGVHDNEIAVYRNHAEASLRGEEPIHEELFDLESD 525 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 P+E++NLIDD + + R Sbjct: 526 PDELNNLIDDPEVKTQLETLRSVWKQQLTAAR 557 >UniRef50_A9G4Y6 Sulfatase n=1 Tax=Phaeobacter gallaeciensis BS107 RepID=A9G4Y6_9RHOB Length = 508 Score = 333 bits (855), Expect = 7e-90, Method: Composition-based stats. Identities = 118/494 (23%), Positives = 201/494 (40%), Gaps = 64/494 (12%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + +MTD Q + +GC T +ID+LAA G F + +T +C+P+R+ LF+G+ Sbjct: 3 PNVILIMTDQQRADSLGCTGNPVARTPHIDALAARGAVFRNHFTPHQICSPSRSTLFSGL 62 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL---------DGHDYFGTGEC 114 +A G N VA +++ + KDAGY T GK+H + D E Sbjct: 63 FARHHGLTRNGVALPEHLPLITHDLKDAGYRTHGAGKFHFQPILAGPEHEMPDSNAFWEL 122 Query: 115 P--PEW-DADYWFD--------------GANYLSELTEKEIS---------LWRNGLNSV 148 P W Y FD G +Y + L E G + Sbjct: 123 PQSEGWAGPFYGFDKVDILIGESVSATEGGHYANWLRETAPDAAALYLPENALEPGPEDL 182 Query: 149 EDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYA 208 +++ + I + + I++RA FL++ ++PF + VSY +PHHPF+ P + + Y Sbjct: 183 DEVWKSAIPSELHYNNWITDRACGFLEER-DGEQPFFLFVSYPDPHHPFSPPAPWCDMYD 241 Query: 209 DFYYELGEKAQDDLANKP---------EHHRLWAQ--AMPSPVGDDG------------- 244 D+LA P E + + P P + G Sbjct: 242 PQEVPAPALTADELAAMPSYILDGDREEAGKSYVDFLRNPGPPREQGFMQTTQRFSEASL 301 Query: 245 LYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYD 303 + +D+ IGR++ L + E+T +I+TSDHGE++G H LI KG + Y Sbjct: 302 RQAIAHTYGMVSMIDNCIGRLLAQLEAQGLAEDTLIIFTSDHGELLGDHGLIRKGPSPYR 361 Query: 304 DITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMV 363 + +PL+I P + SH+DL T+ A +E G++ A+ Sbjct: 362 PLLHVPLVIAGPGVAPGTREGVTSHLDLRATLQAHLGLESRRQ-DGQSFQALLTAADACG 420 Query: 364 EFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD--ELYDRRNDPNEMHNLIDDIRFA 421 + Y H + +T+++++ + + E++ DP E NL + Sbjct: 421 RSHLYAEYHPRTRMETYNQTLLTEEWRVTIYPENPEWGEMFHLTEDPGEHVNLFFHPDYT 480 Query: 422 DVRSKMHDALLDYM 435 + + + Sbjct: 481 VQKQAFIEQMDREF 494 >UniRef50_C5BAV0 Sulfatase, putative n=2 Tax=Edwardsiella RepID=C5BAV0_EDWI9 Length = 544 Score = 333 bits (855), Expect = 8e-90, Method: Composition-based stats. Identities = 126/504 (25%), Positives = 206/504 (40%), Gaps = 59/504 (11%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + + D A VG Y +NT IDSL A G RF AY P+C P+RA +TG Sbjct: 45 NIVIITADQLARRGVGGYGNPHVNTPAIDSLIARGTRFEQAYCPYPLCAPSRACYWTGRL 104 Query: 65 ANQSGPWTNNVA-PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYW 123 +Q+G N+ +++ T+G F AGY + GK H ++ A Sbjct: 105 PHQTGVIANDSPNLPQDMVTLGELFSRAGYECRHFGKRH--------------DYGALKG 150 Query: 124 FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ--QPARAD 181 F A+ + EL + + ++ ED+ ++ ++ + AD Sbjct: 151 FTCADQI-ELPYDSPAAYPVDYDTREDVY-------------CLQESLKYIDTLKGRGAD 196 Query: 182 EPFLMVVSYDEPH------HPFTCPVEYLEKYADFYYELGE-KAQDDLANKP-------- 226 PF++ + ++ PH F P ++ L DL N+P Sbjct: 197 APFMLAIEFNNPHNINGWTGAFAGPHGDIDGLGTLPPLLDNFDTSADLPNRPLAIQYACC 256 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDH 285 H+R+ A SP+ + + Y+ + D IG+V++AL ++T V++ +DH Sbjct: 257 THNRVMQAANWSPL--NFRQYLKAYYHFTELADGFIGQVLSALRASGHADDTLVVFFADH 314 Query: 286 GEMMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEKP 344 G+ MGAH+L++K Y++ T +PLI P S DLLPT+ A + P Sbjct: 315 GDAMGAHRLVAKMNWFYEESTNVPLIFAGPGIRPHSSSRHLTSLCDLLPTLCDYAGLTPP 374 Query: 345 EILPGENILAVKEPRGVMVEFNR--YEIEHDSFGGFIPVRCWVTDDFKLVLNLFT-SDEL 401 L G ++L + + + D P R TD +K ++ +EL Sbjct: 375 PGLYGRSLLPILRGEQPDTWRDEVITQWNTDRNVDVQPARMLRTDRYKYIIYKENEEEEL 434 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL-RPWRKDARPRW 460 YD + DP E HNL + R + DY+ DPF S + + R WR P + Sbjct: 435 YDLQQDPGETHNLAHSAEHQEQRQALRARFDDYVRNQIDPFYSQEAIIDRRWRSHL-PGY 493 Query: 461 MGAFRPRP----QDGYSPVVRDYD 480 Q P++ + + Sbjct: 494 HNHQGQTSIQVYQKEIRPLIMNKE 517 >UniRef50_B2T943 Sulfatase n=1 Tax=Burkholderia phytofirmans PsJN RepID=B2T943_BURPP Length = 476 Score = 333 bits (855), Expect = 8e-90, Method: Composition-based stats. Identities = 113/444 (25%), Positives = 194/444 (43%), Gaps = 15/444 (3%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK N LF+++D N++GC + T ++D+LA G RF +AYT SP+C PARA L Sbjct: 1 MKPTNVLFILSDEHQHNLMGCAGHPVIKTPSLDALAQRGTRFENAYTPSPICVPARASLA 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG Y + W N +A + ++ +G T IGK H + A Sbjct: 61 TGRYVHDIRCWDNAIAYDGSTPGWAQHLSASGVLTESIGKLHY--KSDASPVGFRRQQHA 118 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 + DG + + G + + D + + R+++ A +L + A Sbjct: 119 VHILDGIGQVWGSVRNPMPETM-GRSPLYDKIGPGTSDYNRFDMRVADTACGWLGEHAAD 177 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM--PS 238 D+P+++ V PH P P ++L+ Y +L + A+ M + Sbjct: 178 DKPWVLFVGLVAPHFPLVVPQDFLDLYDPREIDLPLLHPSTGYVRHPWVERQARHMDHDA 237 Query: 239 PVGDDGLYHHPL--YFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGAHKLI 295 +G D + Y+A F+D Q+G+V+ AL ++ T +IY+SDHG+ +G + Sbjct: 238 AIGSDERRRLAVACYYALVSFLDAQVGKVLAALRASGLDDSTTIIYSSDHGDNLGKRGMW 297 Query: 296 SKGAAMYDDITRIPLIIRSPQGERRQVDT-PVSHIDLLPTMMALADIEKPEILP--GENI 352 +K MY + T +P+I+ P +V PVS ID+ T++ E ++ G+++ Sbjct: 298 NK-CLMYRESTGVPMIVAGPGIPASKVSETPVSLIDIQNTLLECTGCEA-ALIDGPGKSL 355 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMH 412 + + + E+ + G +K L EL+D +NDP EM Sbjct: 356 VELACAEDDVGRLAFS--EYHAVGSESAAYMLADSHYKYHHYLGMKPELFDVKNDPEEMR 413 Query: 413 NLIDDIRFADVRSKMHDALLDYMD 436 +L +ADV + L +D Sbjct: 414 DLASLPEYADVLAHFERQLRALLD 437 >UniRef50_Q7UJQ7 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UJQ7_RHOBA Length = 571 Score = 333 bits (855), Expect = 8e-90, Method: Composition-based stats. Identities = 118/474 (24%), Positives = 189/474 (39%), Gaps = 49/474 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN L ++ D +GCY T NIDSLA G+RF AY VC P+R L Sbjct: 98 ERPNVLLILVDDLKP-ALGCYGDSIAKTPNIDSLANRGMRFEMAYCNQAVCAPSRFTLML 156 Query: 62 GIYANQSGPWTNNVAPGK---NISTMGRYF-KDAGYHTCYIGKWHLDGHDYFGTGECP-- 115 G ++ +G + + + TM ++F K GY T +GK GH G E Sbjct: 157 GSHSTSTGLYGLGSQLRQIIPDAVTMPQHFAKQGGYRTESLGKTFHIGHGNHGDPESFSV 216 Query: 116 PEWDA---DYWFDGANYLSELTEKEISLWRNGLNSVEDLQAN------HIDETFTWAHRI 166 P + +Y + +LT +E L ++ L + R+ Sbjct: 217 PHFKEKVIEYLEPASTDGGQLTREEAYFTNQMLGRIKTLPRGAAYESPDAKDEDYADGRV 276 Query: 167 SNRAVDFLQQPARADE----PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 + + LQ + + PF + + PH PF+ P +Y + Y + + Sbjct: 277 AAETIQRLQAAKQRQKTEGTPFFIASGFARPHLPFSAPQKYWDLYDPASLPMPTHETLPV 336 Query: 223 ANKPEHHRLWAQAM--------PSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTP- 271 + + P+ DD L + + Y+A +VD QIG+VI L Sbjct: 337 DAPKVAGKRGGEISNYKPVPTEPNADFDDELKRNLIHGYYASVSYVDAQIGKVIKELDRL 396 Query: 272 EQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHI 329 E +NT V+ DHG +G + +K Y+ RIP++I +P + Sbjct: 397 ELLDNTIVVLWGDHGFHLGDLGIWTKHTN-YEQANRIPILITAPGVTQPGSSTKQLAESV 455 Query: 330 DLLPTMMALADIEK---PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVT 386 D+ PT+ LA + P+ + G +++ V + V + Y G R T Sbjct: 456 DIFPTLSELAGLPAPSGPQPIDGVSLVPVLKDSSARVRDHAYHAYPKRQLG----RSIRT 511 Query: 387 DDFKLVLN------LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 + ++LV T+ ELYD + DPNE NL D +V ++ L Y Sbjct: 512 ERYRLVEWKAFDGKGDTAYELYDYQTDPNETKNLASD--RPEVVQRLTKILAKY 563 >UniRef50_Q7UH46 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UH46_RHOBA Length = 490 Score = 333 bits (855), Expect = 8e-90, Method: Composition-based stats. Identities = 116/505 (22%), Positives = 187/505 (37%), Gaps = 88/505 (17%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + +M D G + T +D+LA EG + Y+ PVC+P RA TG Sbjct: 32 PNIVLMMCDDLGWGDTGFNGNTIIQTPELDALANEGTVLDHFYSVGPVCSPTRASFLTGR 91 Query: 64 YANQSGPWT-NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLD---------GHDYFGTGE 113 + + G WT N T+ R K GY T + GKWHL G Sbjct: 92 HYFRMGIWTANKGHLPSQEFTLARMLKTRGYATGHFGKWHLGTLSRTVSAKGKGRRPDLH 151 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTW-------AHRI 166 P W+ DY S +TE + W G+ + + T + + Sbjct: 152 YAPPWERDY------DASFVTESAVCTWDPGIGKRARNNPYYENGVATDENVLGCDSRVL 205 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 +RA+ F++ A D+PFL V+ + PH EYL KY Sbjct: 206 MDRALPFIEAAAERDQPFLSVIWFHAPHEDIQAGPEYLAKY------------------- 246 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDH 285 +G Y+ C VDDQ+GR+ L +NT + + SD+ Sbjct: 247 ----------------EGHGEAAHYYGCITAVDDQVGRLRKKLASLGVADNTLLFFCSDN 290 Query: 286 GEMMGA-------------HKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHID 330 G G + + ++ D R+P + P V + P+S +D Sbjct: 291 GPEGGEPSNRMKTRRAGSAGEFSGRKRSVLDGGVRVPAFVHWPGQIPAGVRLNAPLSVMD 350 Query: 331 LLPTMMALADIE--KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDD 388 LLPT+ A+ E +L GEN+L + + E + + F C V Sbjct: 351 LLPTVAAITGAETLPNRLLDGENVLPIWKGEQAQREKS-IPFRYGQFA------CLVRGK 403 Query: 389 FKLVL---NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 KL++ N + D L+D D +E +NL + ++ + M LL +++ + Sbjct: 404 HKLIIESPNDDSKDRLFDLSKDVSESNNLANQK--PELTASMRTELLGFLESAKASHAGE 461 Query: 446 QWSLRPWRKDARPRWMGAFRPRPQD 470 ++ + + +G R Sbjct: 462 EYEGNDTKPVEKWHPLGKAGQRGNA 486 >UniRef50_UPI0000E0F7B6 iduronate 2-sulfatase precursor n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0F7B6 Length = 499 Score = 333 bits (855), Expect = 8e-90, Method: Composition-based stats. Identities = 103/445 (23%), Positives = 185/445 (41%), Gaps = 27/445 (6%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + ++ D ++G Y K + NID+LAA+GI F AY PVC +RA + TGI Sbjct: 56 NIVMIIVDDLRP-VLGVYGDKNAYSPNIDALAAQGITFTQAYANVPVCGASRASMLTGIR 114 Query: 65 ANQSGPWTNNVAPGKNIS---TMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 N++ K+ ++ + +++GYHT IGK + D +A Sbjct: 115 PNKTRFIDYKAKAQKDAPGAKSLPQVLRESGYHTMGIGKIFHNSKDLAKVSWSEKLQNA- 173 Query: 122 YWFDGANYLSELTEKEISLWR-NGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 A L+ +E + + N + + + + ++ +A+ L + A+ Sbjct: 174 -GMGHATRLNPDSENYLKTTKFNKRGNGPWYETMDVADEAYPDGKVKEKALKALTRLAKQ 232 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYY------ELGEKAQDDLANKPEHHRLWAQ 234 ++PF + V + PH PF P +Y + + + A L E H + Sbjct: 233 EQPFFLSVGFIRPHLPFYAPKKYYDLHPREKFSPFFDRNKPRNAPKSLNGSGEIHTYHFK 292 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAHK 293 + Y+A ++D +G VI + R+NT ++ TSDHG +G H Sbjct: 293 DYTYNSDAFHMSSLQGYYASVSYIDALVGDVIAQIDSLGLRDNTTIMLTSDHGFNLGEHN 352 Query: 294 LISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLPTMMALADIEKPEILPGENI 352 +K M + RIP+I+ P + + D V +D+ PT+ + + P + G++ Sbjct: 353 FWTKH-TMLETSLRIPMIVAGPNIAKDEKTDALVELVDVFPTITEITKVNPPATVQGQSF 411 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL----FTSDELYDRRNDP 408 + + V + + F V DDF + LYD + DP Sbjct: 412 VKSLQNASVNHK-------KQIYSRFKKGDSVVNDDFIFTSYATAENTIEEMLYDHKVDP 464 Query: 409 NEMHNLIDDIRFADVRSKMHDALLD 433 +E +N++++ R+ V +KM L Sbjct: 465 HETNNVVNEPRYQAVATKMRAQLTA 489 >UniRef50_B5CYA4 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CYA4_9BACE Length = 536 Score = 333 bits (855), Expect = 9e-90, Method: Composition-based stats. Identities = 112/507 (22%), Positives = 198/507 (39%), Gaps = 75/507 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPL---NTQNIDSLAAEGIRFNSAYTCSPVCTPARAG 58 K+ N +++M+D + +G Y + T ID LA +G+ F + + + + TP+RA Sbjct: 30 KQMNVIYIMSDDHTSQAIGAYGSRLAVLNPTPTIDELARDGMLFENCFCTNSISTPSRAC 89 Query: 59 LFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 + TG Y++++ T + + + F + GY T IGKWHL Sbjct: 90 IMTGQYSHRNKVLTLDEVLQPDQEYLVDEFHNMGYQTAMIGKWHLGCEPSH--------- 140 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 DY+ + + + + + + + N I + + ++N A+D+L+ Sbjct: 141 -FDYYSVFNGHGGQGEYFDPTFLTSDVTD-KKWPNNQIKKMGYSSDIVTNLAIDWLKNRR 198 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE----------- 227 +PF M+ Y PH F Y D + D E Sbjct: 199 DKSKPFFMMHHYKAPHDMFEYAPRYEYYLDDVEVPVPLSLFDTDKWGSEGTRGKNDSLRH 258 Query: 228 ----------HHRLWAQAMPSPVGDDG-------LYHHPLYFACNDFVDDQIGRVINALT 270 R + GD+ ++ Y C VDD + R+ + L Sbjct: 259 FIGTSVSSRHEIRNYVMEYKCNTGDEMENTYLAYQHYLKSYLRCVKGVDDNLKRLFDYLK 318 Query: 271 PEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVS 327 E ENT ++YT D G M+G H L K MY++ R+P I+R P+ + D ++ Sbjct: 319 KEGLWENTIIVYTGDQGMMLGEHDLQDK-RWMYEESQRMPFIVRDPRCPYKGAKSDLMIN 377 Query: 328 HIDLLPTMMALADIEKPEILPGENILAVKEPRGV--MVEFNRYEIEHDSFGGFIPVR-CW 384 +ID PT++ + ++P + G++ +V E + + Y +P Sbjct: 378 NIDFAPTLIEMVGGKEPSYMDGKSFASVFEGKKPENWKDAVYYRYWMHMIHHDVPAHIGI 437 Query: 385 VTDDFKLV----LNLFTSD----------------------ELYDRRNDPNEMHNLIDDI 418 T+++KL+ + ELYD +NDP EM NL D+ Sbjct: 438 RTENYKLILFYGRHYDDKRYGQKSMSWLKNSHKIVPTLVSFELYDVKNDPYEMVNLADNP 497 Query: 419 RFADVRSKMHDALLDYMDKIRDPFRSY 445 ++A V M L + ++ D +Y Sbjct: 498 KYAKVLKDMKKKLRELRKQVGDTDEAY 524 >UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UPK7_RHOBA Length = 482 Score = 333 bits (854), Expect = 1e-89, Method: Composition-based stats. Identities = 108/472 (22%), Positives = 199/472 (42%), Gaps = 76/472 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + ++ D A + G P T N+D A+E I+F+ AY+ S VC PARA L T Sbjct: 54 RRPNVIVILADDLAVGDLAGGDGSPTRTPNLDRFASESIQFSQAYSGSCVCAPARAALLT 113 Query: 62 GIYANQSGPWTNNV-------APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 G Y +++G T N+ ++ +T+ KDAGY T +GKWH D F + Sbjct: 114 GRYPHRTGVVTLNMNRYPEMTRLRRDETTIADVLKDAGYATGLVGKWHTGRGDGFHPLD- 172 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 +D F G++ ++ +R + E Q + +DE++ ++ RA++F+ Sbjct: 173 -RGFDEFEGFFGSD--------DVGYFRYPFS--EQRQISDVDESY-LTDDLNRRAIEFV 220 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 ++ + PF + +++ PH P P E + +Y + ++ Sbjct: 221 RR--HHEHPFFLHLAHYAPHRPLEAPPEVIARYREQGFD--------------------- 257 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGE--MMGA 291 +A + +D IG ++ + E+T V++ SD+G + G Sbjct: 258 -----------ESTATIYAMIEVMDRGIGELLAEIDDLGLSEDTIVLFASDNGPDPLTGE 306 Query: 292 H---KLISKGAAMYDDITRIPLIIRS-PQGERRQVDTPVSHIDLLPTMMALADIEKP--E 345 +L + + R+PL +R + Q D V+ +DL+PT++ L ++ Sbjct: 307 RFNRELRGTKYQVNEGGIRVPLFVRWSKRLAPGQRDQMVTFVDLMPTILDLCRVDVSMLN 366 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL-------FTS 398 L GE+ + V E + R+ + + + +KLV S Sbjct: 367 RLDGESFVPVLEDASIAHSTMRFWQWNRASPNYTHNAAVRHGRYKLVRPYVTRGAKLKDS 426 Query: 399 DE---LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR-DPFRSYQ 446 E L+D +NDP E ++ ++ D+ +M L + + D R + Sbjct: 427 TEPSVLFDLQNDPTESRDVSK--QYPDIAERMSRELDRWSASVETDRIRPVK 476 >UniRef50_D2R575 Sulfatase n=4 Tax=Bacteria RepID=D2R575_9PLAN Length = 511 Score = 333 bits (854), Expect = 1e-89, Method: Composition-based stats. Identities = 115/492 (23%), Positives = 186/492 (37%), Gaps = 49/492 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L ++ D +GCY T ++D LA G+RF AY VC P+R L G Sbjct: 28 RPNVLMILVDDLKP-ALGCYGDPVAQTPSLDKLAERGMRFERAYCNQAVCAPSRFTLMLG 86 Query: 63 IYANQSGPWTNNVAPGK---NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE---CPP 116 ++ +G + + + T+ +YF GY T +GK GH G + P Sbjct: 87 SHSTSTGLYGLGSQLRQFIPDAVTLPQYFAKHGYRTESLGKVFHIGHGNQGDPDSFSVPH 146 Query: 117 EWDA--DYWFDGANYLSELTEKEISLWRNGLNSVEDLQAN------HIDETFTWAHRISN 168 D +Y + +LT +E L + L +D+ R+++ Sbjct: 147 FHDKVIEYLDPASTDGGKLTREEAFFTNQRLGEIGSLPRGAAFEAPDVDDLQYADGRVAS 206 Query: 169 RAVDFLQQPA----RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 + L+ + PF + + + PH PF P +Y + Y + E Q + Sbjct: 207 ETIKRLRAAKQLRDQEGTPFFIAIGFARPHLPFCAPKKYWDLYDRAKLPMPEFEQLPMNA 266 Query: 225 KPEHHRLWAQ-AMPSPVGDDGLYHHP---------LYFACNDFVDDQIGRVINALTPEQR 274 P + + + PV ++G Y+A FVD QIG+V+ L Sbjct: 267 PPVAGKRGGEISNYKPVPENGKAEFSDELKRNLIHGYYASMSFVDAQIGKVLEELNASGL 326 Query: 275 -ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDL 331 NT V+ DHG +G + +K Y+ RIP++I +P +D+ Sbjct: 327 AGNTIVVLWGDHGFHLGDLGIWTKHTN-YEQANRIPIVIVAPGVTQPGTATKQLAESVDI 385 Query: 332 LPTMMALADIEK---PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDD 388 PT+ LA + P+ + G +++ V + V + Y + G R TD Sbjct: 386 FPTLAELAGLPAPSGPQPIDGVSLVPVLKDSSARVRDHAYHAYPKAKLG----RAIRTDR 441 Query: 389 FKLVLNLFTS-------DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 ++LV ELYD DP E NL +V + L Y + Sbjct: 442 YRLVEWRAIGAAPESAVYELYDYDADPLERENLAAKQ--PEVVESLKMTLAKYPQPVLGT 499 Query: 442 FRSYQWSLRPWR 453 R P + Sbjct: 500 PRGAAKDRSPEK 511 >UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3JD43_NITOC Length = 440 Score = 332 bits (853), Expect = 1e-89, Method: Composition-based stats. Identities = 104/455 (22%), Positives = 171/455 (37%), Gaps = 67/455 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN + ++ D VGCY + + T N+D+LA +G RF ++ P+CTP RA L T Sbjct: 17 QPPNVILIVADDMGYGDVGCYGNQHIKTPNLDALAKKGARFTDFHSNGPLCTPTRAALLT 76 Query: 62 GIYANQSG--------PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE 113 G Y + G + A T K GY T +GKWHL F Sbjct: 77 GCYQQRVGLHIIPKDQRYAMAKAMSLEEITFAEALKSVGYSTALVGKWHLGDRPAF---- 132 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHI----DETFTWAHRISNR 169 PP D +F ++ WR + ++ I + + Sbjct: 133 LPPRQGFDEYFGIP------YSHDMHPWRKSFPPLPLMRGEEIVELNPDLDHLTQYCTEE 186 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHH 229 AV F+ D PFL+ + + PH P + ++++ Sbjct: 187 AVKFI--SKNKDRPFLLYMPHPMPHQPVHVSERFAKRFSK-------------------- 224 Query: 230 RLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEM 288 + + + G+D LY A + +D +G +I A+ E+T+V +TSD+G Sbjct: 225 ----EQLAAIKGEDKKSRKFLYSATIEEIDWSVGEIIKAVRALGIEESTFVAFTSDNGPA 280 Query: 289 MGAHK-LISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEKP- 344 +G+ L K +++ R+P I + R V D +DL PTM A+ P Sbjct: 281 IGSAGPLRGKKRELWEGGHRVPFIAYWQEKIRPGVVIDEIAMSMDLFPTMAAMGRAPLPR 340 Query: 345 EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL-----VLNLFTSD 399 + + G N+L + + E F + +KL TS Sbjct: 341 KKIDGVNLLPLLCEGDKLSE-------RTVFWRSKGKKAARKGPWKLLMQPTKKKRPTSI 393 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 LY ND +E HNL + + + + + Sbjct: 394 GLYHLNNDLSEQHNLAE--IYPEKLKSLQLEFAAW 426 >UniRef50_A6C2T4 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C2T4_9PLAN Length = 493 Score = 332 bits (853), Expect = 1e-89, Method: Composition-based stats. Identities = 112/476 (23%), Positives = 183/476 (38%), Gaps = 63/476 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + +MTD +GCY + + T +ID LA EG F A+ + VC+P RA T Sbjct: 31 QRPNVVIIMTDNHGEWTLGCYGNQDIKTPHIDQLAKEGTLFTRAFANNAVCSPTRASFLT 90 Query: 62 GIYANQSGPWT----------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 G+ Q G ++ + ++ + DAGY GKWHL + Y Sbjct: 91 GLMPCQHGVHCFLRTRIQTGPDSFNTLEEFQSIPQVLHDAGYVCGLSGKWHLGDNLY--- 147 Query: 112 GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 P+ YW + S + + E ++ E + + Sbjct: 148 ----PQEGFSYWITKPHGGS------AGFYDQNVIENEKIRK----EPTYLTDLWTQHGI 193 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYL-EKYADFYYELGEKAQDDLANKPEHHR 230 F++Q ++PF + ++Y+ P+ + E + ++ Y ++ + +P Sbjct: 194 RFIKQ--NQEKPFFLFLAYNGPYGLGSAMKEPIRNRFKAEYEKMTFPSFPREKAQP---- 247 Query: 231 LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMM 289 W +GD G+ Y A VDD +G+++ L RENT VI+T+D G Sbjct: 248 -WNFNYGDWIGDLGIIRK--YAAEVSAVDDGVGQIMQTLKDLGLRENTLVIFTADQGLSG 304 Query: 290 GAHKLISKG-----AAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALAD-- 340 G G +D IPLI P + D V++ D+ PT++ Sbjct: 305 GHSGYWGMGDHTRPLTAFDWTMTIPLIFSQPGKIVSGARQDMMVANYDVYPTLLNYLGLQ 364 Query: 341 --IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV-LNLFT 397 I PG N V + + + + F VR T D+K + + Sbjct: 365 DKIPAKPATPGRNFAPVLKGEQIPWDEVVFY-------EFENVRAIRTKDWKYIERYRES 417 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWR 453 +ELY D E N ID R ++ L + K DP QW + W+ Sbjct: 418 PNELYHLVTDSREHRNRIDQPASKQTRKELKQRLDQFFSKYADP----QWDI--WK 467 >UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017445FC Length = 481 Score = 332 bits (851), Expect = 2e-89, Method: Composition-based stats. Identities = 96/464 (20%), Positives = 163/464 (35%), Gaps = 66/464 (14%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + + D +GCY K + T N+D LAA+G+RF Y+ VC P+R + TG Sbjct: 17 RPNVIVFLADDLGYGELGCYGQKKIKTPNLDQLAADGMRFTDFYSGHAVCAPSRCVMLTG 76 Query: 63 IYANQSGPWTNN----------------------VAPGKNISTMGRYFKDAGYHTCYIGK 100 + S N+ +A + +T + +GY T +GK Sbjct: 77 KHTGHSFVRENSEGRAAQAKERNRIKAADGYLPQIALPASEATYASALQKSGYRTACVGK 136 Query: 101 WHLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF 160 W L G+ P + D ++ + LWRN + + + + Sbjct: 137 WGLGHPSNEGS---PNKHGFDLFYGYISQWQAHYYYPTYLWRNDVKEPLEGNDGKVGRQY 193 Query: 161 TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD 220 A + A+ F++ PF + + PH P + L E Q Sbjct: 194 -AADLMEQEALKFMETT--GGGPFFLYYATPVPHVSLQVPPD--------EPSLAEYKQA 242 Query: 221 DLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWV 279 P + + +Y A +D +G+ + L ++ NT + Sbjct: 243 FAGQDPPYDGRKSYLPTEDP-------RAIYAAMVTRMDRTLGKFRDLLKRTGQDQNTLI 295 Query: 280 IYTSDHG----------EMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPV-S 327 I+TSD+G G L ++D R P I P + QV V + Sbjct: 296 IFTSDNGATFNGGYDREFFGGNQPLRGMKTQLWDGGIRTPFIAAWPGSIQPGQVSRFVGA 355 Query: 328 HIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTD 387 DL PT + P L G +IL + + + + GG VR Sbjct: 356 SWDLFPTFAEIVGFPVPAGLDGVSILPTLKGEVATQKQHDHLYWETVAGGHQAVRM---G 412 Query: 388 DFKLVL-----NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSK 426 +K + N +L++ D +E ++ + D+ +K Sbjct: 413 PWKGIRLGVIKNPSAPVQLFNLETDVSETTDVAA--QHPDIVAK 454 >UniRef50_Q5LH37 Putative sulfatase n=16 Tax=Bacteroides RepID=Q5LH37_BACFN Length = 483 Score = 332 bits (851), Expect = 2e-89, Method: Composition-based stats. Identities = 130/475 (27%), Positives = 202/475 (42%), Gaps = 56/475 (11%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN +F+M D + +GC +P+ T ++D LA+EGI F +A + PV +PAR L TG+ Sbjct: 28 PNLVFIMADQYRGDAIGCIGKEPVKTPHLDKLASEGINFTNAISSYPVSSPARGMLMTGM 87 Query: 64 YANQSGPWTN--------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH------DYF 109 Y S N V +N KD GY+ YIGKWHLD Y Sbjct: 88 YPIGSKVTGNCNSETAPYGVELSQNARCWSDVLKDQGYNMGYIGKWHLDAPYKPYVDTYN 147 Query: 110 GTGE------CPPE--WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 G+ CPPE D+W Y L + W N + Sbjct: 148 NRGKVAWNEWCPPERRHGFDHWIAYGTYDYHL---KPMYWNTTAPRDSFYYVNQWGPEYE 204 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFT-CPVEYLEKYADFYYELGEKAQD 220 +++A++++ +PF +VVS + PH + P Y E Y D E K + Sbjct: 205 -----ASKAIEYINGQKDQKQPFALVVSMNPPHTGYELVPDRYKEIYKDLDVEALCKGRP 259 Query: 221 DLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWV 279 D+ K + +GD + Y+AC VD+ +GR+I AL +NT V Sbjct: 260 DIPAK-----------GTEMGDYFRNNIRNYYACITGVDENVGRIIEALKQNNLFDNTIV 308 Query: 280 IYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTP--VSHIDLLPTMMA 337 ++TSDHG MGAH+ K Y++ RIP+I+ P + + P ++ DL PT+++ Sbjct: 309 VFTSDHGICMGAHENAGK-DIFYEESMRIPMILSWPDQIKPRKSDPLMIAFADLYPTLLS 367 Query: 338 LADI--EKPEILPGENIL-AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 + E PE + ++ V + Y R TD + ++ Sbjct: 368 MMGFSKEIPETVQTFDLSNEVLTGKNKKDLVQPYYFVKFDN-HATGYRGLRTDRYTYAVH 426 Query: 395 LFTSD----ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 L+DR NDP+EM+N+ + + + L +++K DPF Y Sbjct: 427 ATDGKIDNVILFDRTNDPHEMNNIAS--QQLKLTHTFNRQLKTWLEKTNDPFAQY 479 >UniRef50_B5JCS9 Sulfatase, putative n=2 Tax=Bacteria RepID=B5JCS9_9BACT Length = 495 Score = 331 bits (850), Expect = 3e-89, Method: Composition-based stats. Identities = 105/457 (22%), Positives = 181/457 (39%), Gaps = 24/457 (5%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCS----PVCTPARA 57 ++ + LF+ D + +G + T NID LA EG F +AY VC +R Sbjct: 35 QKSDILFIFADDLSFEAIGASGNAVVQTPNIDRLADEGTYFTNAYNMGSWTPAVCLASRT 94 Query: 58 GLFTGIYANQSGPWTNNVAPGKNIST----MGRYFKDAGYHTCYIGKWHLDGHDYFGTGE 113 + TG + + + G++ + +AGY T GKWH+ Sbjct: 95 MINTGR--SVWHAQDLHKSFGQSHESRPILWSERMANAGYETYLTGKWHVSVAPEDVFDY 152 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 + E + I + + + + W+ +++ A DF Sbjct: 153 VRHVRPGMPLDSFPDQRPEGYHRPIEGKPDPWSPYDRSKGGFWSGGTHWSEVVASDAEDF 212 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE----HH 229 + Q + PF M ++++ H P P E++++Y+ + E Q + E Sbjct: 213 MLQASGRTGPFFMYLAFNAAHDPRQSPKEFVDRYSAESIPIPENYQPIYPYREEMGSGEG 272 Query: 230 RLWAQAMPSPVGDDGLY-HHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGE 287 P P + + H Y+A +D QIGR++ AL R+ T++ +T+DHG Sbjct: 273 LRDEILAPFPRTEYAVRVHRSEYYAVISHLDAQIGRILKALEASGRRDRTYIFFTADHGL 332 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVD-TPVSHIDLLPTMMALADIEKPEI 346 G H L+ K MY+ + PLI+ P Q PV D++ T + LA EKPE Sbjct: 333 ACGHHGLLGK-QNMYEHSMKAPLIVLGPGLPGGQRRKAPVYIQDIMATTLELAGAEKPEG 391 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD-ELYDRR 405 L E++L + + + + R KL+L S L+D Sbjct: 392 LEFESLLPLIGDPALNGKHGNIYGAYTDK-----QRMIRVGSLKLILYPEASRLRLFDLA 446 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 +DP+E+ +L D R+ + LL +++ D Sbjct: 447 SDPDEIKDLAGDPRYWSSIRSLFAELLQKQEQLSDTL 483 >UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 Tax=Bacteria RepID=A6CD52_9PLAN Length = 460 Score = 331 bits (850), Expect = 3e-89, Method: Composition-based stats. Identities = 105/464 (22%), Positives = 177/464 (38%), Gaps = 58/464 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN L + TD Q N VGCY + + T +ID LA EG+ F Y+ S +CTP+R G+ T Sbjct: 26 ERPNILIIFTDDQGINDVGCYGSE-IPTPHIDQLAKEGLLFRQYYSASAICTPSRFGILT 84 Query: 62 GIYANQSGPW-----------TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG 110 G +S N +T+ + GY T +GKWHL GH Sbjct: 85 GRNPTRSQDQLLGALMFMSDIDQNRGIQPGETTIADVLQQNGYQTALLGKWHL-GHGTES 143 Query: 111 TGECPPEWDADYWFDGA--NYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 +D G +Y + +T I W + H+ E I+ Sbjct: 144 FLPTAHGFDLFRGHTGGCIDYFT-MTYGNIPDWYHNQR--------HVSENGYATDLITE 194 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 A FL+ D+PF + +SY+ PH + + + + + + + Sbjct: 195 EAEHFLKDQQTTDKPFFLFLSYNAPH------------FGKGWSPGDQSPVNIMQARGDD 242 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHG- 286 + VG + A +DD IGRV+++L + NT VI+ +DHG Sbjct: 243 LKR--------VGTIKDKVRREFAAMTVSLDDGIGRVMSSLKNNGLDQNTLVIFMTDHGG 294 Query: 287 ---EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ--VDTPVSHIDLLPTMMALADI 341 A +++ R+P IIR P + + +DL PT+ A++ Sbjct: 295 DYVYGGNNQPFRGAKATLFEGGIRVPCIIRWPGKIKAGTETNEVAWALDLFPTICHFANV 354 Query: 342 EKPE-ILPGENILAVKEPRGVM-VEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD 399 + L G++I + + + +++ + D+K + + + Sbjct: 355 DTDGLTLDGKDISGLLTRQTPVGTRELYWQLGPHAELKRGRWSALRQGDWKYIQDAGGEE 414 Query: 400 ELYDRRNDPNEMHNLIDDI-----RFADVRSKMHDALLDYMDKI 438 L+D + DP E NL + R + L + I Sbjct: 415 FLFDLKADPYEKQNLTQSQSTKLTELQERRDTLVKTLTPQVKSI 458 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P31447 Uncharacterized sulfatase yidJ n=52 Tax=Enteroba... 604 e-171 UniRef50_B0N997 Putative uncharacterized protein n=1 Tax=Clostri... 491 e-137 UniRef50_C9L4Q0 Putative sulfatase YidJ n=2 Tax=Blautia hansenii... 484 e-135 UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 475 e-132 UniRef50_C6D1Q0 Sulfatase n=2 Tax=Paenibacillus sp. JDR-2 RepID=... 475 e-132 UniRef50_B9XGI2 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XGI... 472 e-131 UniRef50_B9XEU8 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XEU... 470 e-131 UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase... 468 e-130 UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT 467 e-130 UniRef50_A4AWR8 Iduronate-2-sulfatase n=5 Tax=Bacteria RepID=A4A... 462 e-128 UniRef50_C5EHR5 Putative uncharacterized protein n=1 Tax=Clostri... 460 e-128 UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria Rep... 458 e-127 UniRef50_A6DPE5 Iduronate-2-sulfatase n=2 Tax=Lentisphaera arane... 456 e-126 UniRef50_B7AMH4 Putative uncharacterized protein n=1 Tax=Bactero... 452 e-126 UniRef50_D2QWC7 Sulfatase n=5 Tax=Bacteria RepID=D2QWC7_9PLAN 451 e-125 UniRef50_C6DK82 Sulfatase n=3 Tax=Pectobacterium RepID=C6DK82_PECCP 450 e-125 UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi D... 450 e-125 UniRef50_UPI0001C36AAF N-acetylgalactosamine 6-sulfate sulfatase... 450 e-125 UniRef50_C6VXD1 Sulfatase n=4 Tax=Bacteria RepID=C6VXD1_DYAFD 449 e-125 UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD 449 e-125 UniRef50_C6J5I7 Sulfatase n=1 Tax=Paenibacillus sp. oral taxon 7... 449 e-124 UniRef50_A6DG38 N-acetylglucosamine-6-sulfatase n=1 Tax=Lentisph... 449 e-124 UniRef50_Q7UJ67 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 447 e-124 UniRef50_C4G6V3 Putative uncharacterized protein n=1 Tax=Abiotro... 446 e-124 UniRef50_B5JJG3 Sulfatase, putative n=1 Tax=Verrucomicrobiae bac... 446 e-124 UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 446 e-123 UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bactero... 445 e-123 UniRef50_B5JYP8 Choline-sulfatase n=1 Tax=Octadecabacter antarct... 445 e-123 UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Plancto... 445 e-123 UniRef50_Q7UH28 Mucin-desulfating sulfatase (N-acetylglucosamine... 445 e-123 UniRef50_B9XND0 Sulfatase n=3 Tax=Bacteria RepID=B9XND0_9BACT 443 e-123 UniRef50_A6DNI8 Putative N-acetylglucosamine-6-sulfatase n=1 Tax... 442 e-122 UniRef50_A6C9F6 Iduronate-2-sulfatase n=1 Tax=Planctomyces maris... 442 e-122 UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces mari... 442 e-122 UniRef50_B4D6H3 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 441 e-122 UniRef50_Q01RE9 Sulfatase n=4 Tax=Bacteria RepID=Q01RE9_SOLUE 440 e-122 UniRef50_Q7UMT6 Mucin-desulfating sulfatase (N-acetylglucosamine... 440 e-122 UniRef50_C6J2Z0 Sulfatase n=4 Tax=Firmicutes RepID=C6J2Z0_9BACL 440 e-122 UniRef50_C6J5I8 Sulfatase n=2 Tax=Paenibacillus sp. oral taxon 7... 439 e-121 UniRef50_A6CFT9 Iduronate-2-sulfatase n=2 Tax=Planctomycetaceae ... 439 e-121 UniRef50_A6DLX7 Putative sulfatase n=1 Tax=Lentisphaera araneosa... 439 e-121 UniRef50_A6C8U0 Choline sulfatase n=1 Tax=Planctomyces maris DSM... 439 e-121 UniRef50_A6CBG2 Mucin-desulfating sulfatase (N-acetylglucosamine... 438 e-121 UniRef50_B8FL44 Sulfatase n=1 Tax=Desulfatibacillum alkenivorans... 438 e-121 UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN... 438 e-121 UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 436 e-121 UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglob... 436 e-121 UniRef50_A6DME6 Sulfatase family protein n=1 Tax=Lentisphaera ar... 436 e-121 UniRef50_C6CXF5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=... 435 e-120 UniRef50_A6DKS7 N-acetylglucosamine-6-sulfatase n=1 Tax=Lentisph... 435 e-120 UniRef50_C7MEQ7 Choline-sulfatase n=1 Tax=Brachybacterium faeciu... 435 e-120 UniRef50_D2R014 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 435 e-120 UniRef50_C6J3H9 Sulfatase n=2 Tax=Paenibacillus sp. oral taxon 7... 434 e-120 UniRef50_A4CMA4 Mucin-desulfating sulfatase (N-acetylglucosamine... 434 e-120 UniRef50_A3HTC6 Choline sulfatase n=5 Tax=Bacteria RepID=A3HTC6_... 434 e-120 UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria ... 434 e-120 UniRef50_UPI0001C36159 sulfatase n=2 Tax=Clostridium hathewayi D... 433 e-120 UniRef50_Q7UW58 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 432 e-119 UniRef50_Q15XI1 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 432 e-119 UniRef50_B4D780 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 432 e-119 UniRef50_C0QY53 Sulfatase n=2 Tax=Brachyspira RepID=C0QY53_BRAHW 432 e-119 UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomy... 431 e-119 UniRef50_A6LF65 Choline-sulfatase n=26 Tax=Bacteroidales RepID=A... 430 e-119 UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina ... 430 e-119 UniRef50_Q482B9 Sulfatase family protein n=1 Tax=Colwellia psych... 430 e-119 UniRef50_Q7UQ05 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 430 e-119 UniRef50_A6DM50 Choline sulfatase n=6 Tax=Bacteria RepID=A6DM50_... 430 e-119 UniRef50_D2MLH4 Sulfatase family protein n=1 Tax=Candidatus Pori... 429 e-119 UniRef50_A7A9X1 Putative uncharacterized protein n=2 Tax=Parabac... 429 e-119 UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM ... 429 e-118 UniRef50_D2R1A1 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 429 e-118 UniRef50_Q029P1 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 429 e-118 UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 428 e-118 UniRef50_A6C1R0 Choline sulfatase n=1 Tax=Planctomyces maris DSM... 428 e-118 UniRef50_A4U8Q3 Sulfatase n=2 Tax=Bacteria RepID=A4U8Q3_9BACT 427 e-118 UniRef50_UPI0001744DD5 choline sulfatase n=1 Tax=Verrucomicrobiu... 427 e-118 UniRef50_Q7UJQ8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 427 e-118 UniRef50_C3WCE8 Arylsulfatase n=2 Tax=Fusobacterium RepID=C3WCE8... 427 e-118 UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 427 e-118 UniRef50_C0S8M2 Choline sulfatase n=8 Tax=Eurotiomycetidae RepID... 425 e-117 UniRef50_A0Q2E3 N-acetylgalactosamine 6-sulfate sulfatase n=3 Ta... 425 e-117 UniRef50_B9YAN4 Putative uncharacterized protein n=1 Tax=Holdema... 425 e-117 UniRef50_B4D026 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 425 e-117 UniRef50_A4GIB2 Putative secreted sulfatase n=1 Tax=uncultured m... 424 e-117 UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7... 424 e-117 UniRef50_D2RQH7 Sulfatase n=1 Tax=Haloterrigena turkmenica DSM 5... 424 e-117 UniRef50_Q7NMX5 Gll0640 protein n=1 Tax=Gloeobacter violaceus Re... 424 e-117 UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF8... 424 e-117 UniRef50_A4A280 Iduronate-2-sulfatase n=1 Tax=Blastopirellula ma... 424 e-117 UniRef50_UPI0001745B0B sulfatase n=1 Tax=Verrucomicrobium spinos... 423 e-117 UniRef50_A3P379 Choline-sulfatase n=63 Tax=cellular organisms Re... 423 e-116 UniRef50_A6DSH1 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 423 e-116 UniRef50_A6CG48 Sulfatase family protein n=1 Tax=Planctomyces ma... 422 e-116 UniRef50_C5BVK2 Sulfatase n=11 Tax=Actinomycetales RepID=C5BVK2_... 421 e-116 UniRef50_A3HTC7 Putative uncharacterized protein n=1 Tax=Algorip... 421 e-116 UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 T... 421 e-116 UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Plancto... 420 e-116 UniRef50_UPI000051016C choline-sulfatase n=1 Tax=Brevibacterium ... 420 e-116 UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC... 420 e-116 UniRef50_Q7UZ92 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 420 e-116 UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 420 e-116 UniRef50_A3SJ21 Sulfatase n=1 Tax=Roseovarius nubinhibens ISM Re... 419 e-116 UniRef50_A5FX90 Sulfatase n=4 Tax=Alphaproteobacteria RepID=A5FX... 419 e-115 UniRef50_A0JVM4 Sulfatase n=2 Tax=Actinomycetales RepID=A0JVM4_A... 419 e-115 UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 419 e-115 UniRef50_A6DPD0 Sulfatase family protein n=1 Tax=Lentisphaera ar... 418 e-115 UniRef50_A4AP83 Putative sulfatase n=1 Tax=Flavobacteriales bact... 418 e-115 UniRef50_A6DMW2 Putative exported uslfatase n=1 Tax=Lentisphaera... 418 e-115 UniRef50_Q7UWE8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 417 e-115 UniRef50_Q7US96 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 417 e-115 UniRef50_Q01PN7 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 417 e-115 UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9... 417 e-115 UniRef50_Q7WC54 Putative sulfatase n=3 Tax=Proteobacteria RepID=... 416 e-114 UniRef50_A6DJJ1 Sulfatase family protein n=1 Tax=Lentisphaera ar... 416 e-114 UniRef50_A3HWG3 Choline sulfatase n=1 Tax=Algoriphagus sp. PR1 R... 415 e-114 UniRef50_A6DNH0 Choline sulfatase n=1 Tax=Lentisphaera araneosa ... 415 e-114 UniRef50_D2QTW5 Sulfatase n=2 Tax=Sphingobacteriales RepID=D2QTW... 415 e-114 UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_B... 415 e-114 UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=... 415 e-114 UniRef50_C0G116 Sulfatase n=1 Tax=Natrialba magadii ATCC 43099 R... 415 e-114 UniRef50_C9L4R5 Mucin-desulfating sulfatase n=1 Tax=Blautia hans... 415 e-114 UniRef50_Q7UYA8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 415 e-114 UniRef50_B1KD82 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 414 e-114 UniRef50_B5CWC2 Putative uncharacterized protein n=1 Tax=Bactero... 414 e-114 UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=B... 414 e-114 UniRef50_Q0TUK6 Arylsulfatase n=9 Tax=Bacteria RepID=SULF_CLOP1 414 e-114 UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 T... 414 e-114 UniRef50_A6DMZ1 Sulfatase n=3 Tax=Lentisphaera araneosa HTCC2155... 414 e-114 UniRef50_A4W906 Sulfatase n=43 Tax=Enterobacteriaceae RepID=A4W9... 414 e-114 UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 414 e-114 UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodop... 414 e-114 UniRef50_C7MF96 Arylsulfatase A family protein n=1 Tax=Brachybac... 413 e-114 UniRef50_D0DCV9 Choline-sulfatase n=2 Tax=Citreicella sp. SE45 R... 413 e-114 UniRef50_A6DJ72 Mucin-desulfating sulfatase (N-acetylglucosamine... 412 e-113 UniRef50_Q1IH24 Choline sulfatase n=29 Tax=cellular organisms Re... 412 e-113 UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flamme... 412 e-113 UniRef50_B6HPN7 Pc22g01020 protein n=15 Tax=Eukaryota RepID=B6HP... 412 e-113 UniRef50_Q46P27 Sulfatase n=3 Tax=Proteobacteria RepID=Q46P27_RALEJ 412 e-113 UniRef50_C6IGG0 Iduronate 2-sulfatase n=2 Tax=Bacteroides RepID=... 412 e-113 UniRef50_Q01ZJ7 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 412 e-113 UniRef50_B8KHZ9 Arylsulfatase A n=2 Tax=Gammaproteobacteria RepI... 412 e-113 UniRef50_C6XTA2 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 ... 411 e-113 UniRef50_C6D448 Sulfatase n=2 Tax=Bacteria RepID=C6D448_PAESJ 411 e-113 UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 411 e-113 UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium sp... 411 e-113 UniRef50_UPI00017453D4 choline sulfatase n=1 Tax=Verrucomicrobiu... 411 e-113 UniRef50_B6A548 Choline-sulfatase n=1 Tax=Rhizobium leguminosaru... 411 e-113 UniRef50_A6DG71 Mucin-desulfating sulfatase (N-acetylglucosamine... 411 e-113 UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 410 e-113 UniRef50_A6DGT7 Sulfatase family protein n=1 Tax=Lentisphaera ar... 410 e-113 UniRef50_C1ZA41 Arylsulfatase A family protein n=1 Tax=Planctomy... 410 e-113 UniRef50_A0LK86 Sulfatase n=1 Tax=Syntrophobacter fumaroxidans M... 410 e-113 UniRef50_C5C586 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 1233... 410 e-113 UniRef50_O69787 Choline-sulfatase n=53 Tax=Alphaproteobacteria R... 410 e-113 UniRef50_C1ZIM5 Arylsulfatase A family protein n=2 Tax=Planctomy... 410 e-113 UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 409 e-113 UniRef50_UPI0001C35931 N-acetylgalactosamine 6-sulfate sulfatase... 409 e-112 UniRef50_B4D0V9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 409 e-112 UniRef50_Q1GMK9 Choline sulfatase n=8 Tax=Alphaproteobacteria Re... 409 e-112 UniRef50_A6L183 Iduronate 2-sulfatase n=11 Tax=Bacteroides RepID... 409 e-112 UniRef50_A6DHI1 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 409 e-112 UniRef50_C5BWB0 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 1233... 409 e-112 UniRef50_B9XR48 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XR4... 409 e-112 UniRef50_A6DFZ4 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 409 e-112 UniRef50_B5CYA4 Putative uncharacterized protein n=1 Tax=Bactero... 409 e-112 UniRef50_A4A047 Iduronate-2-sulfatase n=2 Tax=Bacteria RepID=A4A... 409 e-112 UniRef50_A6KWS8 Arylsulfatase n=6 Tax=Bacteroides RepID=A6KWS8_B... 408 e-112 UniRef50_D0Z4S7 Iduronate sulfatase n=1 Tax=Photobacterium damse... 407 e-112 UniRef50_B0TKJ5 Sulfatase n=2 Tax=Gammaproteobacteria RepID=B0TK... 407 e-112 UniRef50_UPI00016C500A sulfatase n=1 Tax=Gemmata obscuriglobus U... 407 e-112 UniRef50_A6C4W8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 407 e-112 UniRef50_A6DJ15 Putative arylsulfatase n=2 Tax=Lentisphaera aran... 407 e-112 UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomy... 407 e-112 UniRef50_A3I2R7 Arylsulfatase n=2 Tax=Bacteroidetes RepID=A3I2R7... 407 e-112 UniRef50_C1ZKY2 Arylsulfatase A family protein n=1 Tax=Planctomy... 407 e-112 UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomy... 407 e-112 UniRef50_A6CAY0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 407 e-112 UniRef50_B1KD88 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 405 e-111 UniRef50_A0JVP0 Sulfatase n=1 Tax=Arthrobacter sp. FB24 RepID=A0... 405 e-111 UniRef50_UPI0001C35789 arylsulfatase n=1 Tax=Clostridium hathewa... 405 e-111 UniRef50_D2R575 Sulfatase n=4 Tax=Bacteria RepID=D2R575_9PLAN 405 e-111 UniRef50_A6DNH1 Choline sulfatase n=2 Tax=Lentisphaera araneosa ... 404 e-111 UniRef50_C1ZIS7 Arylsulfatase A family protein n=1 Tax=Planctomy... 404 e-111 UniRef50_A6DIH4 Iduronate-2-sulfatase n=1 Tax=Lentisphaera arane... 404 e-111 UniRef50_B7ACM6 Putative uncharacterized protein n=1 Tax=Bactero... 404 e-111 UniRef50_C7MHR6 Arylsulfatase A family protein n=3 Tax=Bacteria ... 404 e-111 UniRef50_UPI00016C0B39 choline sulfatase n=1 Tax=Epulopiscium sp... 404 e-111 UniRef50_D2QL61 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepI... 403 e-111 UniRef50_D2R201 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 403 e-111 UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Plancto... 403 e-111 UniRef50_Q7UJQ7 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula bal... 403 e-111 UniRef50_B2UNV9 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC B... 403 e-111 UniRef50_C0BKJ9 Sulfatase n=2 Tax=Bacteroidetes RepID=C0BKJ9_9BACT 403 e-110 UniRef50_Q7UER7 Sulfatase 1 n=8 Tax=Bacteria RepID=Q7UER7_RHOBA 402 e-110 UniRef50_C0W1U3 Sulfatase n=1 Tax=Actinomyces coleocanis DSM 154... 402 e-110 UniRef50_A6DKD8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 402 e-110 UniRef50_Q7URY7 Aryl-sulphate sulphohydrolase n=1 Tax=Rhodopirel... 402 e-110 UniRef50_Q482D6 Sulfatase family protein n=2 Tax=Bacteria RepID=... 402 e-110 UniRef50_C6XY33 Sulfatase n=5 Tax=Bacteroidetes RepID=C6XY33_PEDHD 402 e-110 UniRef50_C6I9F7 Sulfatase n=4 Tax=Bacteroides RepID=C6I9F7_9BACE 402 e-110 UniRef50_A3ZMT9 Arylsulfatase n=2 Tax=Planctomycetaceae RepID=A3... 402 e-110 UniRef50_Q5LRB5 Choline sulfatase n=1 Tax=Ruegeria pomeroyi RepI... 401 e-110 UniRef50_B4D433 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 401 e-110 UniRef50_C7M5R4 Sulfatase n=4 Tax=Bacteroidetes RepID=C7M5R4_CAPOD 401 e-110 UniRef50_A6DKP2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 401 e-110 UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 401 e-110 UniRef50_C1ZJ89 Arylsulfatase A family protein n=1 Tax=Planctomy... 400 e-110 UniRef50_A6C1Q0 N-acetylgalactosamine 6-sulfate sulfatase n=1 Ta... 400 e-110 UniRef50_C1ZIW1 Arylsulfatase A family protein n=6 Tax=Bacteria ... 400 e-110 UniRef50_Q127E2 Sulfatase n=1 Tax=Polaromonas sp. JS666 RepID=Q1... 400 e-110 UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 T... 400 e-110 UniRef50_D2R457 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 399 e-110 UniRef50_D2R1I8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 399 e-109 UniRef50_B9XJI6 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XJI... 398 e-109 UniRef50_C6VXD2 Sulfatase n=7 Tax=Bacteria RepID=C6VXD2_DYAFD 398 e-109 UniRef50_C5BYA8 Sulfatase n=2 Tax=Micrococcineae RepID=C5BYA8_BEUC1 398 e-109 UniRef50_C9L4R7 Putative sulfatase YidJ n=1 Tax=Blautia hansenii... 398 e-109 UniRef50_C6LAI4 Arylsulfatase n=6 Tax=Bacteria RepID=C6LAI4_9FIRM 397 e-109 UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 397 e-109 UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 397 e-109 UniRef50_C6VYV1 Sulfatase n=1 Tax=Dyadobacter fermentans DSM 180... 397 e-109 UniRef50_C7PRW9 Sulfatase n=1 Tax=Chitinophaga pinensis DSM 2588... 397 e-109 UniRef50_A6DR18 Arylsulfatase n=2 Tax=Lentisphaera araneosa HTCC... 397 e-109 UniRef50_A6C2T4 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 Re... 397 e-109 UniRef50_C5HLB2 Putative sulfatase n=1 Tax=uncultured bacterium ... 397 e-109 UniRef50_A3J5W3 Putative arylsulfatase n=1 Tax=Flavobacteria bac... 397 e-109 UniRef50_D2R203 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 397 e-109 UniRef50_Q7UVD9 N-acetylgalactosamine 6-sulfate sulfatase n=1 Ta... 397 e-109 UniRef50_A9LGQ4 Secreted arylsulfatase n=4 Tax=Bacteria RepID=A9... 396 e-109 UniRef50_C6I6Z4 N-acetylgalactosamine-6-sulfatase n=11 Tax=Bacte... 396 e-109 UniRef50_A6DHY1 Mucin-desulfating sulfatase n=1 Tax=Lentisphaera... 396 e-109 UniRef50_A6CAR8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 396 e-108 UniRef50_D1AX15 Sulfatase n=2 Tax=Fusobacteriaceae RepID=D1AX15_... 396 e-108 UniRef50_C9L4I6 Arylsulfatase n=1 Tax=Blautia hansenii DSM 20583... 395 e-108 UniRef50_A4AQQ7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 395 e-108 UniRef50_B1KD77 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 395 e-108 UniRef50_A6DI98 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 395 e-108 UniRef50_Q7UM38 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 395 e-108 UniRef50_UPI0000E0F7B6 iduronate 2-sulfatase precursor n=1 Tax=G... 395 e-108 UniRef50_A7LY81 Putative uncharacterized protein n=5 Tax=Bactero... 395 e-108 UniRef50_A6DKC5 Putative sulfatase yidj n=1 Tax=Lentisphaera ara... 395 e-108 UniRef50_A6KZI7 Arylsulfatase n=23 Tax=Bacteroidales RepID=A6KZI... 394 e-108 Sequences not found previously or not previously below threshold: UniRef50_UPI00016C0ED5 sulfatase n=1 Tax=Epulopiscium sp. 'N.t. ... 440 e-122 UniRef50_B9XGT6 Sulfatase n=3 Tax=Bacteria RepID=B9XGT6_9BACT 438 e-121 UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 417 e-115 UniRef50_A6DHI0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 409 e-112 UniRef50_C6VRQ8 Sulfatase n=1 Tax=Dyadobacter fermentans DSM 180... 408 e-112 UniRef50_A6DFU7 Mucin-desulfating sulfatase (N-acetylglucosamine... 407 e-112 UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Pro... 396 e-108 UniRef50_D2QCX4 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepI... 396 e-108 >UniRef50_P31447 Uncharacterized sulfatase yidJ n=52 Tax=Enterobacteriaceae RepID=YIDJ_ECOLI Length = 497 Score = 604 bits (1559), Expect = e-171, Method: Composition-based stats. Identities = 497/497 (100%), Positives = 497/497 (100%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF Sbjct: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA Sbjct: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA Sbjct: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV Sbjct: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKGAA 300 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKGAA Sbjct: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKGAA 300 Query: 301 MYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRG 360 MYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRG Sbjct: 301 MYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRG 360 Query: 361 VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRF 420 VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRF Sbjct: 361 VMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRF 420 Query: 421 ADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQDGYSPVVRDYD 480 ADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQDGYSPVVRDYD Sbjct: 421 ADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQDGYSPVVRDYD 480 Query: 481 TGLPTQGVKVEEKKQKF 497 TGLPTQGVKVEEKKQKF Sbjct: 481 TGLPTQGVKVEEKKQKF 497 >UniRef50_B0N997 Putative uncharacterized protein n=1 Tax=Clostridium scindens ATCC 35704 RepID=B0N997_EUBSP Length = 495 Score = 491 bits (1265), Expect = e-137, Method: Composition-based stats. Identities = 210/504 (41%), Positives = 293/504 (58%), Gaps = 21/504 (4%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+ +F+MTDT +MVGCY + T N+D LA EGIR+ +AYTC PVC PAR+ +F Sbjct: 1 MKKQ-VIFLMTDTTRKDMVGCYGNPKMKTPNLDRLAEEGIRYENAYTCQPVCGPARSAIF 59 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG + + +G TN++A G N+ T+G+ + G YIGKWHLDG DYFG G CP WD Sbjct: 60 TGTFPHTNGMVTNSIAMGDNVKTIGQRLHNHGISCGYIGKWHLDGSDYFGNGRCPEGWDP 119 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 +YW+D YL ELT++E R+ +D E FT+AHR S+RA+ +L+ Sbjct: 120 EYWYDMKTYLDELTDEEKVRSRDPKECYKDG----FSEEFTYAHRCSDRAIKYLENHQDE 175 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 D F + VSYDEPH P CP + + F +E QDDL+ KP RLW+ Sbjct: 176 D--FFLSVSYDEPHGPSLCPEPFNHMFDGFKFESCPNFQDDLSKKPFMQRLWSGKNLHAT 233 Query: 241 GDDGLYH---HPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISK 297 D+ L+ CN F D +IGRV++ + + VI+TSDHG+M+GAH+L SK Sbjct: 234 EDEINQPSDGLSLFLGCNSFADYEIGRVLDKIREV-APDALVIFTSDHGDMLGAHRLFSK 292 Query: 298 GAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKE 357 AA Y ++ IPLII+ D SHID+ PT++ + P++L G+++L + Sbjct: 293 NAAAYKEVANIPLIIKG-GERGYVEDAMASHIDIAPTILDYFGLPIPKLLEGKSMLPQIK 351 Query: 358 PRGVM------VEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEM 411 EF RYEI+HD FGG +R ++ +KLV++L +DE YD NDP EM Sbjct: 352 NPEKEINDVVFTEFTRYEIDHDGFGGLQIMRAVMSKRYKLVIHLLDTDEFYDLENDPYEM 411 Query: 412 HNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRW--MGAFRPRPQ 469 +NLI+D ++ + R+ +HD L+ +M+ RD +R YQWS+RPWR D P W G R R Sbjct: 412 NNLIEDKKYIEERNALHDKLIQHMNDTRDLYRGYQWSMRPWRTDFIPDWENEGYTRQREN 471 Query: 470 DGYSPVVRDYDTGLPTQGVKVEEK 493 + Y P DYDTGLP + V +K Sbjct: 472 EEYEPRQLDYDTGLPMEEA-VRKK 494 >UniRef50_C9L4Q0 Putative sulfatase YidJ n=2 Tax=Blautia hansenii DSM 20583 RepID=C9L4Q0_RUMHA Length = 505 Score = 484 bits (1246), Expect = e-135, Method: Composition-based stats. Identities = 212/506 (41%), Positives = 301/506 (59%), Gaps = 22/506 (4%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+ +F+MTDTQ T+M+GCY + T N+D LAAEGIR++ AYT PVC PAR+ +F Sbjct: 1 MKKRQVIFIMTDTQRTDMLGCYGNSAMVTPNLDRLAAEGIRYDKAYTTQPVCQPARSAIF 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG Y + W+N + N+ ++G+ DAG HT Y+GKWHLDG DYFG G CP WD Sbjct: 61 TGSYPHSCAGWSNCMGLSDNVQSIGQRLSDAGIHTAYVGKWHLDGGDYFGLGRCPKGWDE 120 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 DYW+D +L ELT +E R + +E ++ +I E T+ HR ++RAVDF+++ Sbjct: 121 DYWYDMKCFLDELTPEE----RYRIRQIESIEKYNITEDMTYGHRCADRAVDFIEKHKDE 176 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 D + +V+S DEPH P CP +Y++ Y D+ + E +D L +KPEHHR+WA Sbjct: 177 D--YFLVMSLDEPHGPHICPKKYVDLYKDYEIPVKENMKDTLEDKPEHHRIWAGDEYLKA 234 Query: 241 -GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGAHKLISKG 298 +D + CN F D +IGRV++A Q E+ +IYTSDHG+MM H L KG Sbjct: 235 CREDFKLSPKEFLGCNTFADYEIGRVLDA--AAQYEDEPIIIYTSDHGDMMYGHSLTGKG 292 Query: 299 AAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEP 358 A+Y++IT IPL+I+ + PVSHI+L PT+ + + P++ G +I + Sbjct: 293 PALYEEITHIPLMIK--GFGKGVDKNPVSHINLAPTIFDMFGVPIPKMFEGRSIFEEVKN 350 Query: 359 RGVM------VEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMH 412 V +EF RYE++HD FGG+ P+R +K+V+NL TSDELYD + DP EM Sbjct: 351 PEVRCNDYVFMEFGRYEVDHDGFGGYQPLRGAFDGRYKMVINLMTSDELYDLQEDPQEMK 410 Query: 413 NLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPR-WMGAF--RPRPQ 469 NLI++ + ++R ++H+A+LD M + RDPFR Y W RPW + + W R R Sbjct: 411 NLINEPGYDEIRKRLHEAILDNMYETRDPFRGYYWEDRPWNRITEYKTWDSRLMTRQREN 470 Query: 470 DGYSPVVRDYDTGLPTQGVKVEEKKQ 495 + Y P DY TGLP V +K Q Sbjct: 471 EEYEPRQLDYGTGLPMTSA-VRKKGQ 495 >UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R322_9PLAN Length = 513 Score = 475 bits (1224), Expect = e-132, Method: Composition-based stats. Identities = 110/492 (22%), Positives = 181/492 (36%), Gaps = 65/492 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN +F + D +GCY T NID LAA+G RF AY PVC+P RA + T Sbjct: 35 QQPNIVFFLVDDLGQRDLGCYGSTFYETPNIDKLAADGARFTQAYAACPVCSPTRASILT 94 Query: 62 GIYANQSGPWT---------------NNVAPGK--------NISTMGRYFKDAGYHTCYI 98 G++ ++G N + + T+ + K AGY T + Sbjct: 95 GLWPQRTGITDYIATDNSNGPAKWNRNTMTLPAAYRDRLALDSPTLAKSLKSAGYATFFA 154 Query: 99 GKWHLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDE 158 GKWHL ++ P D G K+ + + + Sbjct: 155 GKWHLGPEGFY-----PENQGFDINRGGIERGGPYGGKQYF------SPYGNPRLTDGPA 203 Query: 159 TFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA 218 R++ F++ A +PF S+ H P + +KY Sbjct: 204 GEHLPDRLATETCQFIE--AHQKQPFFAYFSFYSVHTPLQAREDLRQKY--------VAK 253 Query: 219 QDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENT 277 ++ L KP R + + + H +Y A D +D +G+V+ L ENT Sbjct: 254 REKLGLKPTWGREHMRDV------RQVQEHAVYAAMVDAMDQAVGKVLAKLDELGLRENT 307 Query: 278 WVIYTSDHGEMMGAHK-------LISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSH 328 VI+TSD+G + + L MY+ R PL++R P +DTPVS Sbjct: 308 LVIFTSDNGGLSTSEGWPTSNLPLRGGKGWMYEGGIREPLVMRWPAKVKAGSTIDTPVSS 367 Query: 329 IDLLPTMMALADIEKPE--ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVT 386 D + T++A + E + G ++L + + + H G P Sbjct: 368 PDFMATLLAATATKPAEQQQIDGVSLLPLLAGEKLKERSLFWHYPHYGNQGGAPAAAIRR 427 Query: 387 DDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 +KL+ L EL++ D +E NL + +M L + ++ Sbjct: 428 GSWKLIEWLEDGQVELFNLATDESETTNLASKE--PALVREMLAELHAWQKEVGAILPEK 485 Query: 446 QWSLRPWRKDAR 457 + P + R Sbjct: 486 NPNYDPAKPSGR 497 >UniRef50_C6D1Q0 Sulfatase n=2 Tax=Paenibacillus sp. JDR-2 RepID=C6D1Q0_PAESJ Length = 480 Score = 475 bits (1223), Expect = e-132, Method: Composition-based stats. Identities = 131/480 (27%), Positives = 211/480 (43%), Gaps = 31/480 (6%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN L++ TD Q + +G Y + +NT +ID LAAEG+ F A+ SPVCTP+RA Sbjct: 1 MKKPNILWICTDQQRQDTLGAYGNQWVNTPHIDRLAAEGVLFEQAFCQSPVCTPSRASFL 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDG--HDYFGTGECPPEW 118 TG Y + N + + + + GY GK HL E + Sbjct: 61 TGRYPRTTRCRANGQDIPADEKLISKLLSEEGYICGLAGKLHLSACHPSVNKGTERRIDD 120 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANH--------IDETFTWAHRISNRA 170 D +F + +E E + W G + D + +A Sbjct: 121 GFDQFFWSHHPNAEWPTNEYTQWLKGKGKTFSPRPFENSPYVNCGPDAEDHQTTWCAEKA 180 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYAD--FYYELGEKAQDDLANKPEH 228 V F++ + + P+ +V+ +PHHPF P EYL++Y D L + +L NKP + Sbjct: 181 VQFIETNSDYERPWFFLVNLFDPHHPFDPPKEYLDRYLDRLDEIPLPNYEEGELENKPVY 240 Query: 229 HRLWAQAMP---------SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTW 278 R+ D Y+A D +DDQ+GR++++L +NT Sbjct: 241 QRIDRDGAYGMRGHLAASDMSERDHRLIRAAYWAMCDLIDDQVGRMLDSLERSGQLDNTI 300 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMA 337 V++ SDHGE++G H + KG YD R+PLI+R P R++ + V DL PT++ Sbjct: 301 VVFMSDHGELLGDHGMYLKGPHFYDCSVRVPLIVRGPGIHGGRRIASLVELADLAPTLLE 360 Query: 338 LADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV-------RCWVTDDFK 390 + I + G+++ + +G +R ++ +S+ P TD K Sbjct: 361 ASQIPTYTGMQGKSLWPILLNKGEDAPNHREDVYCESYDANFPHGDLRAWATMVRTDSHK 420 Query: 391 LVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL 449 LVL + ELYD DP E N +D + V+ ++ L + M DP + + + Sbjct: 421 LVLYHNDNSGELYDLLADPKENRNAWNDHAYTSVKFELMQRLCNRMALTVDPLPARKAAW 480 >UniRef50_B9XGI2 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XGI2_9BACT Length = 515 Score = 472 bits (1216), Expect = e-131, Method: Composition-based stats. Identities = 114/464 (24%), Positives = 190/464 (40%), Gaps = 51/464 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN LF+M D A + +G Y K T NIDS+A G+RF++ + + +CTP+RA + TG Sbjct: 61 RPNILFIMADDHAAHAIGAYGSKINQTPNIDSIAKAGMRFDNCFVVNSICTPSRAAILTG 120 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 Y++ +G N N T+ + + AGY+T +GKWHL+ DY Sbjct: 121 KYSHINGVTVFN-RFDGNQPTVAKMLQAAGYYTGMVGKWHLESDPT----------GFDY 169 Query: 123 WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADE 182 W I + N I++ +++FL+ D+ Sbjct: 170 WNVLPGQGKYHDPDFIEM------------GNRKKIEGYATEIITDLSINFLKNRP-QDK 216 Query: 183 PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA-------------NKPEHH 229 PF ++ + PH P+ ++ + Y D E DD ++ Sbjct: 217 PFFLMCHHKAPHRPWEPDEKHAKMYEDVTIPEPETFNDDYKTRSSAATEATMRIDRDLTP 276 Query: 230 RLWAQAMPSPVG------DDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYT 282 + QA P + + Y C VDD +GR++ L ENT VIYT Sbjct: 277 KDLKQAPPPGLAGEALKKWKYQRYIKDYLRCIASVDDNVGRLLKFLDDSGLAENTIVIYT 336 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALAD 340 SD G +G H K MY++ R+P I+R P + + ++D PT + A Sbjct: 337 SDQGFFLGDHNWFDK-RFMYEESLRMPFIVRYPNHIKPATVNKDMILNVDFAPTFLQCAG 395 Query: 341 IEKPEILPGENILAVKEPRGVMVEF---NRYEIEHDSFGGFIPVRCWVTDDFKLVL-NLF 396 +E P+ + G +IL + + + + + P ++ +KL+ N Sbjct: 396 LEVPKEIQGRSILPLLQGKAPKDWRTSMYYRYYHYPADHRVQPHYGVRSERYKLIYFNKI 455 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 E YD + DP+E+ N+ D +A + + ++ D Sbjct: 456 NEWEFYDLKRDPHELKNVYADPAYAKEVQRAKAEMERLRKELND 499 >UniRef50_B9XEU8 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XEU8_9BACT Length = 456 Score = 470 bits (1211), Expect = e-131, Method: Composition-based stats. Identities = 111/461 (24%), Positives = 178/461 (38%), Gaps = 46/461 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN +F++ D + GC + T NID +A EG F + + P+C+P+R T Sbjct: 16 SQPNIVFILVDDIRWDAFGCMGHPFVKTPNIDRIAKEGALFKNFFVTLPLCSPSRGSFLT 75 Query: 62 GIYANQSGPWTNNVA--PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 G YA+ +G N + T R DAGY T ++GKWH+ D P Sbjct: 76 GQYAHVNGVTNNGEHSTLSHQLVTFPRLLHDAGYETSFVGKWHMGTDDT-------PRPG 128 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 D+W N ++ +++RAV+F++Q Sbjct: 129 FDHWLSFKGQGVY------------ENPNLNIDGKVSRVEGYITDILNSRAVEFVKQ--E 174 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS- 238 +PF + V + H PFT + E Y DDLA KP R Q Sbjct: 175 HKKPFCLYVGHKAVHGPFTPAERHKELYTKEQIPHPPSIDDDLAGKPVLTRKEQQGPKDG 234 Query: 239 --------------PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTS 283 P+G +D+ +G+++ AL Q ENT +I+TS Sbjct: 235 QKPQKVGFDDEAERPMGKVPERLVRQQLRTLMAIDEGVGQLLRALEESRQLENTVIIFTS 294 Query: 284 DHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADI 341 D+G G H L Y++ R PL+IR P+ D V +ID+ PT++ LA Sbjct: 295 DNGYFWGEHHL-GDKRWAYEESIRDPLLIRYPKLIKPGTVRDQMVLNIDIAPTLLELAHA 353 Query: 342 EKPEILPGENILAVKEPRGVMVEFNRYEIEHDS--FGGFIPVRCWVTDDFKLVLN--LFT 397 + G +++ + V + + + T+ +K + L Sbjct: 354 PVSRSMQGRSLVPLFNKDSVEWRKSALFEYFQEKAYPRTPTWQAIRTEQWKYIHYTELEG 413 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 DELY+ + D EM NLI + ++ L + + Sbjct: 414 MDELYNLKADSYEMKNLIKEQSARSSLQELKSELGKLLKQT 454 >UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4991 Length = 596 Score = 468 bits (1206), Expect = e-130, Method: Composition-based stats. Identities = 110/528 (20%), Positives = 171/528 (32%), Gaps = 98/528 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + ++ D +GCY T NID +A +G+RF Y PVC+P RA + TG Sbjct: 22 KPNVVLIVIDDLGQRDLGCYGSTFYKTPNIDRMAKDGVRFTDFYAACPVCSPTRASIMTG 81 Query: 63 IYANQSGPWT-----------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDG 105 Y + G T+ K GY T +IGKWHL G Sbjct: 82 KYPQRVGITDWLPGRKDLPGQRLKRPELKNELALEEVTVAETLKGHGYVTAHIGKWHLGG 141 Query: 106 HDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHR 165 G P + D G + + L+ + G R Sbjct: 142 K-----GFEPEKQGFDVNVAGDHTGTPLSYFAPFANKAGATMP---GLEKAAPDEYLTDR 193 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 ++ A F+ A D+PF + + + H P P ++KY Sbjct: 194 LAAEAETFI--TANKDKPFFLYLPHYGVHTPLRAPQPLVDKYKTQAV------------- 238 Query: 226 PEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSD 284 G +P+Y A + +D +GRV+ L + +NT V++TSD Sbjct: 239 -----------------HGRQSNPVYAAMVESMDAAVGRVLKRLDDLKLSDNTLVLFTSD 281 Query: 285 HGEMMG----------AHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLL 332 +G + L +Y+ R+PLI + P +D ID Sbjct: 282 NGGLATLEGMPFAPTINAPLREGKGYLYEGGVRVPLIAKWPGKVKPGTVMDQVACSIDFF 341 Query: 333 PTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV 392 T++ G +++ + + H + G P ++KLV Sbjct: 342 DTILEATGATSAARRDGVSLVPAFGGEKLKPRALYWHYPHYANQGSRPGGAVRAGNYKLV 401 Query: 393 LNL-FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRP 451 EL+D D +E NL D DV + L + + Sbjct: 402 EYYEDGRRELFDVAKDLSESRNLAADK--PDVVKDLAAKLDAWRTDV------------- 446 Query: 452 WRKDARPRWMGAFRPRPQDGYSPVVRDYDTGLPTQ--GVKVEEKKQKF 497 GA P P Y P D D + V +F Sbjct: 447 ----------GAKMPTPNPDYRPNPPDKDGAITLHARTALVTGTMLRF 484 >UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT Length = 500 Score = 467 bits (1204), Expect = e-130, Method: Composition-based stats. Identities = 103/479 (21%), Positives = 172/479 (35%), Gaps = 60/479 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPNF+F++ D VG T N+D LA EG+RF AY VC+P RA + TG Sbjct: 38 RPNFVFILADDLGWKDVGFNGSTFYETPNLDRLAREGMRFTDAYAACSVCSPTRASIMTG 97 Query: 63 IYANQSGPWT-----------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDG 105 Y + T+ + ++ GY T +IGKWHL G Sbjct: 98 KYPARLHLTDWLPGRPDKPDQILKHPKIITELPAAEITLAKALQEGGYKTAFIGKWHLGG 157 Query: 106 HDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHR 165 G P + D G + + ++ A R Sbjct: 158 L-----GHWPEQAGFDINIGGCGMGHPSSY---------FSPYKNPTLKDGPVGEYLADR 203 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 +++ AV F++ PFL+ +S+ H P +EKY +L + + Sbjct: 204 LTDEAVKFIENT--KGTPFLLYLSHYSVHTPLQAKKGLIEKYQKKVMQLPPTKGPEFVTE 261 Query: 226 PEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSD 284 + + P+Y A +D+ +GRV++ L + NT +I+TSD Sbjct: 262 GN------------TNARQVQNQPIYAAMMQSLDESVGRVLDKLKELGLDKNTVIIFTSD 309 Query: 285 HGEMMGAHK-------LISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTM 335 +G + A L + Y+ R PL+++ P D V D PT+ Sbjct: 310 NGGLSTAEGAPTSNMPLRAGKGWPYEGGVREPLVVKWPGVTKAASVSDHQVMSTDYYPTL 369 Query: 336 MALADIE--KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV- 392 + +A + + L G + + + + H S G P D+KL+ Sbjct: 370 LEIAGLPARPEQHLDGISFTPALRGKEMGERPLFWHYPHYSNQGGAPSSSIRKGDWKLIE 429 Query: 393 LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRP 451 EL++ R D E ++L R ++ L + ++ + P Sbjct: 430 WYEENRIELFNLRLDVGEKNDLASTSAL--KREELKSELQAWRASVKADMPLPNPNFDP 486 >UniRef50_A4AWR8 Iduronate-2-sulfatase n=5 Tax=Bacteria RepID=A4AWR8_9FLAO Length = 498 Score = 462 bits (1189), Expect = e-128, Method: Composition-based stats. Identities = 105/468 (22%), Positives = 199/468 (42%), Gaps = 26/468 (5%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LF++ D T V Y +NT +ID LA+EG+ F Y+ PVC P+RA + Sbjct: 39 KKPNVLFIIADDLTTTAVSSYGNSEVNTPHIDKLASEGVLFTRTYSQYPVCGPSRASFMS 98 Query: 62 GIYANQS---GPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKW-HLDGH----DYFGTGE 113 G Y + + G + G T + FKD GY+T + K H+ + Sbjct: 99 GYYPSATTTYGYVSGRKNIGSERKTWSQVFKDNGYYTARVSKIFHMGVPIDIEKGSNGQD 158 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQ-----ANHIDETFTWAHRISN 168 W + G + + + + +G ++ D+ + + Sbjct: 159 DEQSWTERFNSQGPEWKAPGAGELVQGNPDGTLPIKGGNVMTIVKADGDDLVHSDGKTAE 218 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 +A + +++ D+PF + + + PH PF P Y E Y +L +K ++D + P+ Sbjct: 219 KASELIRK--HKDKPFFLAIGFVRPHVPFVAPKSYFEPYPHNQTKLPKKVENDWDDIPKR 276 Query: 229 HRLWAQA-MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 + + + Y+A ++D Q+G+V+ L E +NT V++TSDHG Sbjct: 277 GINYVTSVNGKMNTEQEKKAIAAYYASVSYMDAQVGKVLKTLKEEGLEDNTIVVFTSDHG 336 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEI 346 +G H+ +++++ ++PLII+ P + + +DL PT+ ALA ++ + Sbjct: 337 FHLGEHEFW-MKVSLHEESVKVPLIIKVPGKKPAVCHSFTELLDLYPTITALAGLKYSDQ 395 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT---SDELYD 403 L GE+++ + + V + + +D+ + EL+D Sbjct: 396 LQGESLVNILDEPTYEVRDMAFSVSQGGKSFL-----LRNEDWAYIQYDEDAASGIELFD 450 Query: 404 RRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRP 451 + DP + NL +A + + L + +R + +SL+ Sbjct: 451 MKKDPKQFTNLAQLPEYASIVDSFKEKLKTKLKAVRSNDLNIDYSLKK 498 >UniRef50_C5EHR5 Putative uncharacterized protein n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EHR5_9FIRM Length = 490 Score = 460 bits (1185), Expect = e-128, Method: Composition-based stats. Identities = 121/483 (25%), Positives = 205/483 (42%), Gaps = 43/483 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + M D + +GCY ++T +IDSLA G F++ + +PVC+P+R + T Sbjct: 4 RRPNIILFMCDQLRFDALGCYGNNQIHTPHIDSLALNGSTFDNHFVQNPVCSPSRCTVLT 63 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G Y G N + + T+ +D GY T IGK H+ E + + Sbjct: 64 GRYPKNHGTRDNGIPLRDSEITLAETLRDNGYRTAAIGKMHITTQFVPKEDEQEDWPEDN 123 Query: 122 YWFDGANYLSELTEKEISLW----------------------------RNGLNSVEDLQA 153 Y FD + + E W + Sbjct: 124 YGFDIIHTTCDCKTGEYLDWLKAASPEDYEEVKMQGERKAKEDRASAADKDTGGPPQVYP 183 Query: 154 NHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE 213 + I+ ++ +H I++R +D +++ D+PF S+ +PHHPF P Y + Y E Sbjct: 184 SGINPSYHQSHWIADRMIDLIEESG-PDQPFFAYCSFVDPHHPFDPPKPYGDMYDPDALE 242 Query: 214 LGEKAQDDLANKPEHHR---------LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGR 264 + + + +L +KP H R Y+ +DD IGR Sbjct: 243 VPVRMEGELLDKPPHFRKALTARGFSNEKYDYRKLTDHQWGQVKAAYYGMITLIDDNIGR 302 Query: 265 VINALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQ 321 ++NAL E +T +++T+DHGE++G H L+ KG YD I + P+II+ P + + Sbjct: 303 ILNALRENGLEKDTLILFTNDHGELLGDHGLLFKGPFHYDCIIKAPMIIKWPGVVPQGSR 362 Query: 322 VDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV 381 H+D++PT++ A + P + G ++ + + E + + + V Sbjct: 363 YSQVTEHVDIMPTLLEYAGVRPPYGVQGCSMAPILRGDKGAGKEYA-MTEFNCYDWGLSV 421 Query: 382 RCWVTDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 + ++KL ELYDR DP E NL DD + V++ M L+D + + D Sbjct: 422 KTLTGRNYKLTYYAGEEYGELYDRNLDPEEFKNLWDDEAYGAVKAYMMKKLMDRIIETED 481 Query: 441 PFR 443 P Sbjct: 482 PLP 484 >UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria RepID=A6DGD3_9BACT Length = 713 Score = 458 bits (1179), Expect = e-127, Method: Composition-based stats. Identities = 105/487 (21%), Positives = 188/487 (38%), Gaps = 52/487 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRP+ + + D N + CY + T ++D +A EG RF AY +PVC+P RA + Sbjct: 238 KRPHIILFLIDDLGWNDIACYGSQFYETPHLDKMAKEGFRFTDAYAANPVCSPTRASILL 297 Query: 62 GIYANQSGPWTNNVA------------------PGKNISTMGRYFKDAGYHTCYIGKWHL 103 G Y ++ G ++ + T+ K+ GY T +IGKWHL Sbjct: 298 GKYPSRVGLSNHSGSSGPKGPGHKLTPVPVKGNMPLEDITLAEALKEVGYKTAHIGKWHL 357 Query: 104 DGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWA 163 H P + D G + + + S E Sbjct: 358 QAHHDTSRNHFPEKHGFDLNIAG-HRMGQPGSFYFPYKSKQHPSTNVPDMADGQEGDYLT 416 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 +++++A+ ++++ D PF + Y H P + +KY ELG Sbjct: 417 DKLTDKAIHYIKE--NKDTPFFLNFWYYTVHTPIIPRQDLKKKYEAKANELGINKNQPGI 474 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYT 282 + +Q PS Y A + +D+ IGR+ L Q ++ T +I+ Sbjct: 475 PVLKSFARSSQNNPS------------YAAMVEAMDENIGRIFKTLKELQIDDETIIIFC 522 Query: 283 SDHGEMMGAHK---------LISKGAAMYDDITRIPLIIRSPQGERRQ-VDTPVSHIDLL 332 SD+G + + L + A +Y+ RIP II+ P + + + PV D+ Sbjct: 523 SDNGGLSTSTGPNCPTSQLPLKAGKAWVYEGGIRIPFIIKWPGKKGGKELQAPVCTTDIY 582 Query: 333 PTMMALADIE--KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGG---FIPVRCWVTD 387 PT++ + + + L G ++ ++ + ++ I + + P Sbjct: 583 PTLLDMLKLPAKPEQHLDGVSLTSLMNGQAKELQREALFIHYPHYHHINSMGPAGAVRMG 642 Query: 388 DFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQ 446 D+KLV T + ELY+ + D EM+NL+ + ++M L + + P Sbjct: 643 DYKLVEYYETGEFELYNLKEDIGEMNNLVK--EQPERAAQMLKKLEQWRQQSNSPKPERN 700 Query: 447 WSLRPWR 453 P + Sbjct: 701 PHYDPQK 707 >UniRef50_A6DPE5 Iduronate-2-sulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DPE5_9BACT Length = 487 Score = 456 bits (1174), Expect = e-126, Method: Composition-based stats. Identities = 107/454 (23%), Positives = 184/454 (40%), Gaps = 28/454 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 + N LF+ D +G Y + T N+D LA G F+ AY P+C P+RA + +G Sbjct: 20 KMNVLFISADDLNC-DIGPYGNTQVKTPNLDRLARMGTVFDRAYCQQPLCGPSRASIMSG 78 Query: 63 IYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPE- 117 + N G WT N N+ TMG +F+ GY++ +GK +H Y GT E Sbjct: 79 LRPNTLGVWTLNSKLRGRIPNLVTMGEFFQKQGYYSGRVGKIYHYGNPTYIGTNGNDDEQ 138 Query: 118 -WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHID----ETFTWAHRISNRAVD 172 W + G + E + G + D + +++RA+ Sbjct: 139 TWTERFNPKGIDRTQEENIIRYPGGKTGKKGGLGISMAWWDPVSKDNEHTDGLVADRAIK 198 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYA--DFYYELGEKAQDDLANKP---- 226 ++ A D+PF + + PH P+ P +Y + Y D + E+A+ +LA+ P Sbjct: 199 MIE--ANKDKPFFIAAGFFNPHCPYVAPKKYFDMYDINDIELQELEEAKQELADVPAMAI 256 Query: 227 --EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTS 283 + + W D+ Y+A F+D Q+GR+ AL + T +++ S Sbjct: 257 QRDAGQRWPYFYKGLTRDEAKQCKLAYYATVSFIDAQVGRIFEALEKNNLMDKTIIVFWS 316 Query: 284 DHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIE 342 DHG +G L K ++ R PL+I +P + + +PV +D+ PT++ + Sbjct: 317 DHGYFLGEKGLWFK-RKAFERSARAPLLIAAPGLSKGQVCKSPVELLDIYPTLVEATGFQ 375 Query: 343 KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT--SDE 400 P L G ++ + + H G T ++ E Sbjct: 376 IPSELEGVSLSPLLKNAQTKWTKPAITQIHH--GADKQGYSIRTKKWRYTEWNKGQAGKE 433 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 LY+ DP E NL + + +++ L + Sbjct: 434 LYNHETDPEETINLATNPEHTQIVAQLSTELQKF 467 >UniRef50_B7AMH4 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AMH4_9BACE Length = 520 Score = 452 bits (1165), Expect = e-126, Method: Composition-based stats. Identities = 102/489 (20%), Positives = 203/489 (41%), Gaps = 55/489 (11%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSG-KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 +K N +F+++D + + L T +D +A EG +A+ + + +P+RA + Sbjct: 39 VKPRNVVFILSDDHRYDYMVFLGTIPWLETPCMDRMAREGAYIQNAFVTTSLSSPSRASI 98 Query: 60 FTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 TG+Y++ NN ++ Y + AGY T + GKWH+ P+ Sbjct: 99 LTGLYSHTHKVVDNNAPLPDGLTFFPEYLQAAGYETAFFGKWHMGN------DTGEPQPG 152 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 +W + + ++ +++ A+DF+++ + Sbjct: 153 FTHWEGIRGQGVYWNP----------EININGKWKEFKDSTYLGDLLTDHAIDFIREQKK 202 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN--------------- 224 AD+PF + +S+ H PF P Y YA+ L ++ Sbjct: 203 ADKPFFVYLSHKGVHDPFQAPKRYEGCYANKKVPLPTSFENPHYGITPTPNKSVQTGKPL 262 Query: 225 ----------KPEHHRLWAQAMPSPV-----GDDGLYHHPLYFACNDFVDDQIGRVINAL 269 KP+ ++ ++ + Y VD+ IGRVI++L Sbjct: 263 SGVDYYGEQMKPDWVKMQRESWHGVDFCYNGRRNWEEEVRKYCETLRAVDESIGRVIDSL 322 Query: 270 TPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPV 326 + NT VIY D+G G H LI K Y+ R+P++IR+P + + + V Sbjct: 323 QEMGLDENTVVIYMGDNGFCWGEHGLIDK-RQFYEASVRVPMLIRAPGLFPAGQVLKSMV 381 Query: 327 SHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHD--SFGGFIPVRCW 384 ++D+ PT+++ A ++KP + GE+ + + + + + + + + + Sbjct: 382 QNVDIAPTILSCAGLDKPAQMVGESYIPLLQGKEIPWRNRIFYEYYWEHEYPQTPTMHGV 441 Query: 385 VTDDFKLVLNL--FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 TD++K + + ++E YD DP+E+ N I D + D+ ++ L D+++ F Sbjct: 442 RTDNYKYIRYHGIWDTNEFYDLNEDPSELQNRIADPEYQDIIKQLDADLYDWLETTNGMF 501 Query: 443 RSYQWSLRP 451 + ++RP Sbjct: 502 IPLKRTVRP 510 >UniRef50_D2QWC7 Sulfatase n=5 Tax=Bacteria RepID=D2QWC7_9PLAN Length = 490 Score = 451 bits (1161), Expect = e-125, Method: Composition-based stats. Identities = 111/458 (24%), Positives = 183/458 (39%), Gaps = 27/458 (5%) Query: 2 KRP-NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 K+P N L + +D N +GCY + + ID LAA G RF+ AY P+C P+R+ Sbjct: 32 KKPYNVLLIASDDL-NNSLGCYGHATVKSPRIDELAARGTRFDRAYCQFPLCNPSRSSFL 90 Query: 61 TGIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPP 116 TG+ +Q+ N +I T+ + F +AGY+ +GK +H GT Sbjct: 91 TGLRPDQTTVHDNARKFRSERPDIVTLPQMFMNAGYYVARVGKLYHYGVPLQIGTSGLDD 150 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 E + + K SL A + + A+ L+ Sbjct: 151 EPSWQQVVNPRGRDRDDEPKIFSLVPGQFGGTPSWLAAEGTDDEQTDAIGAAEAIKLLE- 209 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM 236 A ++PF + V + PH P+ P Y EKY + + D + PE A Sbjct: 210 -ANKEKPFFLAVGFYRPHTPYVAPKSYFEKYPADKIPIVTTPEGDRRDIPEPAVSQHSAR 268 Query: 237 PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLI 295 + YFA F+D Q+G++++AL + +NT V++ SDHG +G H + Sbjct: 269 HNMNEKLQREATQAYFASITFMDQQVGKLLDALDRLKLRDNTIVVFLSDHGYHLGEHGGL 328 Query: 296 SKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGENIL 353 + +++++ R+PLII P ID+ PT+ L ++ P LPG+++ Sbjct: 329 WQKQSLFEESARVPLIISVPGQKHAGEGTAAVAELIDIYPTLADLCGLKAPANLPGQSLR 388 Query: 354 AVKEPRGVMVEFNRYEIEHDSFG-------------GFIPVRCWVTDDFKLVLNLFTSD- 399 E + G TD ++L + Sbjct: 389 PQIEDPQAPGKGFAITQVRRGGNPGGAKAGKKNPPAGGFAGYSLRTDKYRLTIWGEEGAK 448 Query: 400 --ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 ELYD + DP E NL D A+ +++ L ++ Sbjct: 449 GLELYDHQTDPQEYTNLASDPSKAETITELKALLAKHL 486 >UniRef50_C6DK82 Sulfatase n=3 Tax=Pectobacterium RepID=C6DK82_PECCP Length = 564 Score = 450 bits (1159), Expect = e-125, Method: Composition-based stats. Identities = 119/517 (23%), Positives = 202/517 (39%), Gaps = 35/517 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN ++++ D Q + G K + T ++D +A G F +A+ + +C+P+RA + T Sbjct: 39 QRPNIVYILLDDQRYDAFGFI-NKNIQTPHMDEIAKNGTWFKNAFVTTSLCSPSRASILT 97 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G+Y + G NN ++ K+ GY T + GKWH G DY D Sbjct: 98 GMYVHNHGVSDNNPTDLSKLNYFPEKLKERGYQTGFFGKWHFGGADYTAKAGFA---GFD 154 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 W G + ++ + + +++ AV++L Sbjct: 155 RWVGLLGQGDYYPINMF-----GEQAKLNIDGKMVPQKGYITDELTDYAVNWLD-GIDKK 208 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD---DLANKPEHHRLWAQAM-- 236 +PF+M +S+ H F + + + L E D + KP + + Sbjct: 209 KPFMMYLSHKGVHSDFYPAIRHKGSMDKVTFPLPETYADTPENYEGKPMWVKNQRNSWHG 268 Query: 237 ---PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAH 292 P D Y+ VDD +GRV L + NT V+ D+G G H Sbjct: 269 VDYPYNKKMDMQQFQRDYYETLRSVDDSVGRVQEWLKKNGLDKNTIVMVMGDNGFTFGEH 328 Query: 293 KLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 LI K + Y+ R+PLI P G+ V+ V++ID+ PT + A EKP+ G + Sbjct: 329 GLIDK-RSAYETSMRVPLIASGPGFGKGDVVEDLVANIDIAPTFLEAAGAEKPKNYDGNS 387 Query: 352 ILAVKEPRGVMVEFNRY----EIEHDSFGGFIPVRCWVTDDFKLVLNL--FTSDELYDRR 405 L +K + + Y F T ++K + + +ELYD + Sbjct: 388 FLNIKSDKEKQAKRKDYFAYEYFWEYDFPYTPTTFAIRTPEYKYIQYYGIWDKEELYDMK 447 Query: 406 NDPNEMHNLIDDIRFA--DVRSKMHDALLDYMDKIRD----PFRSYQWSLRPWRKDARPR 459 NDP+E NLID + + + L + D P+ + +R + Sbjct: 448 NDPDEKQNLIDSKDKKLIETKIALRKQLYMELKDHDDRNVIPYNQRTKEGQVFRYQETGK 507 Query: 460 WMGAFRPRPQDGYSPVVRDYDTGLPTQGVKVEEKKQK 496 M F P RD +G+ + V + + +K Sbjct: 508 KMADF-PDEWLRG-DNPRDKYSGIIPETVNKDAEGEK 542 >UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C366AB Length = 470 Score = 450 bits (1158), Expect = e-125, Method: Composition-based stats. Identities = 109/481 (22%), Positives = 182/481 (37%), Gaps = 51/481 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PNFLF+ D + C T NID L +G+ F ++Y PVC+P+RA TG Sbjct: 4 QPNFLFIFMDDMGWRDLACTGSTFYETPNIDRLCRQGMVFANSYASCPVCSPSRASCLTG 63 Query: 63 IYANQSGPWT-------------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHL 103 Y + G + T+ + KDAGY T ++GKWHL Sbjct: 64 KYPARLGVTDWIDMEGTSHPLKGKLIDAPYIKHLPEGEYTIAQALKDAGYDTWHVGKWHL 123 Query: 104 DGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWA 163 G +++ P + D G ++ + + E Sbjct: 124 GGREFY-----PEHFGFDVNIGGCSWGHPHDGY--------FSPYGIETLSEGPEGEYLT 170 Query: 164 HRISNRAVDFLQQPAR--ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 RI++ AV L++ + +PF M + + H P E ++ ELG + Sbjct: 171 DRITDEAVRLLRKRQACGSRKPFYMNLCHYAVHTPIQVKDEDRARFEKKARELGLDKETA 230 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVI 280 L HH V + P Y +D IGR++ AL + ENT V+ Sbjct: 231 LVEGEFHH--TEDKKGRRVVRRVIQSDPSYAGMIWNLDQNIGRLLEALRECGEEENTVVV 288 Query: 281 YTSDHGEMMGAHK-------LISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDL 331 +TSD+G + + +Y+ TR+PLI++ P + D PV+ D Sbjct: 289 FTSDNGGLATSEGSPTCNLPASEGKGWVYEGGTRVPLIVKYPGRVAPGSRCDVPVTTPDF 348 Query: 332 LPTMMALADIEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDF 389 PT + LA + + + G +I+ + + + H G P V D+ Sbjct: 349 YPTFLELAGVPQKAGIPIDGRSIVPLLSGNPMPERPIFWHYPHYGNQGGTPASSVVMGDY 408 Query: 390 KLVLNL-FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWS 448 K + ELYD + D +E +NL + + +++ L + ++ F Sbjct: 409 KYIEFFEDGRGELYDLKADFSETNNL--CEKMPETAARLRMLLHGWQREVCARFPEENAE 466 Query: 449 L 449 Sbjct: 467 Y 467 >UniRef50_UPI0001C36AAF N-acetylgalactosamine 6-sulfate sulfatase n=2 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C36AAF Length = 468 Score = 450 bits (1158), Expect = e-125, Method: Composition-based stats. Identities = 127/496 (25%), Positives = 198/496 (39%), Gaps = 57/496 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LF++TD Q +GCY + T N+D LA +G+RF++ + SPVC+PARA L T Sbjct: 4 KKPNVLFILTDDQGIWSMGCYGNSEIQTPNLDKLAKQGVRFDNFFCTSPVCSPARASLLT 63 Query: 62 GIYANQSGPWTN-----------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG 110 G +Q G + K+ + GY GKWHL + Sbjct: 64 GKIPSQHGILDYLSGGNGGASQAAIEFLKDHRGYTDILAEEGYTCGLSGKWHLGDGGH-- 121 Query: 111 TGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRA 170 P+ +W+ + ++RNG I+E I++ A Sbjct: 122 -----PQKGFSFWYA--HQKGGGPYYNAPMFRNGQK---------IEEKGYITDVITDEA 165 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFT--CPVEYLEKYADFYYELGEKAQDDLANKPEH 228 + F+ + ++PF + V Y PH P+ P +Y + Y D +E + + Sbjct: 166 ISFIDREKNKEQPFYLSVHYTAPHSPWINCHPKKYTDLYEDCPFETCPQGE--------- 216 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGE 287 WA+ YFA +DD +GR++ L E E+T +I++SD+G Sbjct: 217 VHPWAKTEVIAGYQKPRESLIGYFAAVTAMDDNVGRILKKLEEENLMEDTLIIFSSDNGF 276 Query: 288 MMGAHKLISKGA-----AMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALAD 340 G H + KG MYD ++PLI+ D S D +PT + Sbjct: 277 NCGHHGIWGKGNGTFPLNMYDSSVKVPLIMCHKGHIPENHVCDEMHSGYDFMPTPLDYLG 336 Query: 341 IEKPE--ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-NLFT 397 + E LPG++ L+ + E N + F + PVR + +KLV F Sbjct: 337 FKNDEADKLPGKSFLSALMGQEQKGEENSVVV----FDEYGPVRMIRSRKYKLVHRYPFG 392 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDAR 457 DE YD DP E +N I+D + DV M + + + DP + P + Sbjct: 393 PDEFYDLEVDPGEAYNGIEDESYQDVIRDMKKQMELWFLQYVDP--RIDGAKEPVMGGGQ 450 Query: 458 PRWMGAFRPRPQDGYS 473 G P Sbjct: 451 KDLAGVLGPGINVYGE 466 >UniRef50_C6VXD1 Sulfatase n=4 Tax=Bacteria RepID=C6VXD1_DYAFD Length = 474 Score = 449 bits (1157), Expect = e-125, Method: Composition-based stats. Identities = 113/442 (25%), Positives = 193/442 (43%), Gaps = 18/442 (4%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+ N LF+ D N +G Y + + NID LA G+RF+ AYT P+C+P+R+ L T Sbjct: 30 KKFNVLFIAVDDL-NNDLGTYGNTFVKSPNIDRLAKRGVRFDKAYTQFPLCSPSRSSLLT 88 Query: 62 GIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGKW-HLDGHDYFGTG--ECP 115 G + + + +I T+ + FK+ Y++ +GK H GT + P Sbjct: 89 GQRPDMTKIYELQTHFRKNLPDIVTLPQLFKNNNYYSARVGKIFHYGVPSQIGTDGLDDP 148 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 W G + E K ++ R GL S +A + I++ A+ + Sbjct: 149 ESWSYRVNPKGRDKTEEPLIKNLTPDR-GLGSALAWRATEGTDDEQTDGLIASEAIKIMT 207 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + + +EPF + V + PH P+ P +Y + Y L ++ +DL + PE Sbjct: 208 E--KKNEPFFLAVGFFRPHTPYVAPQKYFDMYPVDKVPLPKEIPNDLDDVPEAALFTKPP 265 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKL 294 Y+A F+D Q+G++I+AL + ENT ++ SDHG +G H Sbjct: 266 HWGLDEAKRREALRAYYATITFMDAQVGKLIDALDKLKLAENTIIVLWSDHGYNVGQHGQ 325 Query: 295 ISKGAAMYDDITRIPLIIRSPQGERRQ-VDTPVSHIDLLPTMMALADIEKPEILPGENIL 353 +++++ R+PLII P G + + V +D+ PT+ L ++ + L G+++ Sbjct: 326 W-MKQSLFENSARVPLIISVPGGTKGKASGRTVELVDIFPTLAELCGLDPKQNLQGKSLT 384 Query: 354 AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT--SDELYDRRNDPNEM 411 + + + + Y G I R T+ F+ ELYD + DP E Sbjct: 385 PLLKNPAAIWDKPAYTQVRR---GQIFGRSVRTERFRYTEWDGGNAGVELYDHQKDPGEF 441 Query: 412 HNLIDDIRFADVRSKMHDALLD 433 NL D F +++ L Sbjct: 442 TNLAKDNSFVITVNELALLLKK 463 >UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD Length = 481 Score = 449 bits (1157), Expect = e-125, Method: Composition-based stats. Identities = 104/457 (22%), Positives = 168/457 (36%), Gaps = 44/457 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN +F++ D VG K + T NID LA EG+ FN Y + VC P+R+ L T Sbjct: 28 QRPNIVFILADDLGYGDVGFNGQKLIKTPNIDKLAKEGMIFNQFYAGTSVCAPSRSSLLT 87 Query: 62 GIYANQSGPWTN-------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 G + + N +++T+ K +GY T GKW L G+ Sbjct: 88 GQHTGHTYIRGNKGVEPEGQQPIADSVTTLAEVLKKSGYVTAAFGKWGLG---PVGSEGD 144 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 P + D ++ LW N + + I I +A+ F+ Sbjct: 145 PNKQGFDRFYGYNCQSLAHRYYPEHLWDNSKKILLEGNKGLIHNKEYAPDLIQKKALSFV 204 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 +PF + + Y PH P + L +Y +E D +Q Sbjct: 205 -NAQDGKQPFFLFLPYILPHAELVVPDDSLFRYYKGKFEEKPHKGADYGPGANGGGYASQ 263 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMM---- 289 P H + A +D +G+V+NAL + + NT VI+TSD+G + Sbjct: 264 DFP----------HATFAAMVARLDLYVGQVMNALKKKGLDKNTLVIFTSDNGPHVEGGA 313 Query: 290 ------GAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADI 341 +Y+ R P R P + D + D+LPT LA+ Sbjct: 314 DPRFFNSGAGFRGVKRDLYEGGIREPFAARWPAAIKPGSKSDYIGAFWDILPTFAELANA 373 Query: 342 EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-----NLF 396 P + G + + + + + + E GG + ++K V N Sbjct: 374 PAPRNIDGISFTDALKGKAIQKKHDYLYWEFHEQGGR---QAVRQGNWKAVRLKAAGNPD 430 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 ELYD DP E +NL +F + ++ + Sbjct: 431 ALVELYDLSKDPQEKNNL--TPQFPEKAKELGQIMNR 465 >UniRef50_C6J5I7 Sulfatase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J5I7_9BACL Length = 522 Score = 449 bits (1157), Expect = e-124, Method: Composition-based stats. Identities = 127/487 (26%), Positives = 193/487 (39%), Gaps = 68/487 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF+ TD Q + +GCY + T N+D LAA G F +A+ P+C P+RA L T Sbjct: 15 ERPNILFIHTDQQRADSLGCYGNTVIRTPNLDQLAASGTLFENAHCTHPLCMPSRATLLT 74 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH--------------- 106 G Y + + N + + T+ +GY T IGK H + Sbjct: 75 GRYMHAHRLYRNGIPLSQQEQTIAHLLSKSGYATGLIGKAHFTPYKGDPKVNPESVQINN 134 Query: 107 ----------------DYFGTGECPPEWDA-DYWFDGANYLSELTEKEISLWR------- 142 Y+G DY G +Y + E+ Sbjct: 135 GVAPEECWAYWRQFEGPYYGFDHVQMSMGHGDYGMKGGHYGLWVHEQHPDKVPLFDQDIH 194 Query: 143 -NGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPV 201 + V + + + +N+A++F++Q D PF + Y EPH PF P Sbjct: 195 GEPSDGVYRSWKSAVPLEIHSSTWTTNKAIEFIKQ--NKDRPFYAWIGYQEPHEPFNPPR 252 Query: 202 EYLEKYADFYYELGEKAQDDL-ANKPEHHRLW--AQAMPSPVGDDGLYHHPLYFACNDFV 258 Y + Y L + + PEH + + + Y+ C + Sbjct: 253 PYCDMYDPQEILLPVGRDGEWGSESPEHVQYYLNRGKWKDIREEKVREIIAHYYGCVSMI 312 Query: 259 DDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG 317 DD IGR++ L E +NT +I+TSDHGE +G H L KGA +TRIPL+I+ P Sbjct: 313 DDCIGRLMKTLEEEGLADNTIIIFTSDHGEWLGDHGLWLKGAVHARGLTRIPLMIKWPGT 372 Query: 318 E--RRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDS- 374 R+V S ID++PT++ A E P + G ++ +V V Sbjct: 373 AVSGRRVSNVASLIDVMPTLLDAAGAEIPYGVQGTSLRSVLAGEQDKVRDYALIEHRHEP 432 Query: 375 --------------FGGFIPVRC--WVTDDFKLVLNLF-TSDELYDRRNDPNEMHNLIDD 417 G VTD ++L ELYD + DP+E+ NL D Sbjct: 433 YHLNIQLEKEELVINKGTEEWHMKTIVTDRYRLSYIPSAQYGELYDHQTDPDELINLWD- 491 Query: 418 IRFADVR 424 +F ++R Sbjct: 492 -KFPELR 497 >UniRef50_A6DG38 N-acetylglucosamine-6-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG38_9BACT Length = 498 Score = 449 bits (1156), Expect = e-124, Method: Composition-based stats. Identities = 108/475 (22%), Positives = 190/475 (40%), Gaps = 47/475 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + + +D A + CY + T +D LA G+RFN A + CTP+RA T Sbjct: 23 QRPNIILIFSDDHAKKALSCYGNTGIKTPALDRLADGGMRFNHALVTNSFCTPSRATALT 82 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL----DGHDYFGTGECPPE 117 G Y++++G N + + T + + AGY T GKWHL G DY+ + Sbjct: 83 GKYSHKNGVTRLNQSFDGSQQTFPKLLQKAGYETSLFGKWHLLSQPTGFDYYCVQKMQGM 142 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 F+ + + ++ + G + I+ A+++++ Sbjct: 143 PFNPRVFEPQHGWVPWSPQDRKSYMKGGRVI----------KGYNNDVITTEAINWIKNR 192 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH--------- 228 ++PF +++ PH P+T + D DD + H Sbjct: 193 ENKNKPFCLLLHPKPPHAPYTPATRDEDYLKDVTIPEPANLHDDYKGRTPHAIAGKMTAN 252 Query: 229 ----------HRLWAQAMPSPVGDD------GLYHHPLYFACNDFVDDQIGRVINALTPE 272 R + + + + Y+ VDD +GRV++ L Sbjct: 253 RIILNPAFKSMRARIEKENPNISERELTSKMYQEYIKGYYRLVKSVDDNVGRVLDYLKES 312 Query: 273 QRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHI 329 E NT VIYTSD G +G H +K MY++ P +++ P + ++ SH+ Sbjct: 313 GLEKNTIVIYTSDQGFSLGEHGFYNK-QWMYEEPLHAPFLVKFPGTVKAGQVHNSMTSHV 371 Query: 330 DLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDF 389 D+ PT++ A + PE + G ++ + + V Y +D + TD + Sbjct: 372 DIAPTILDFAGVTIPEGMQGFSLKPILLGKKEKVRDASYYHFYDHGVRLPEMIGIRTDRY 431 Query: 390 KLVLNLFTS----DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 KL+ EL+D +ND EM+NL + + D+ + + L + K D Sbjct: 432 KLIFYPGMKGHYRWELFDLKNDSQEMNNLHYNPEYRDLAQDLKNQLRELTIKYDD 486 >UniRef50_Q7UJ67 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UJ67_RHOBA Length = 505 Score = 447 bits (1150), Expect = e-124, Method: Composition-based stats. Identities = 118/450 (26%), Positives = 192/450 (42%), Gaps = 22/450 (4%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN LF+ D + +GCY T NID LAA G+ F AY P+C P RA + T Sbjct: 44 SKPNVLFIAVDDL-ASALGCYGDVVAKTPNIDRLAATGVCFRRAYNQLPLCNPTRASVMT 102 Query: 62 GIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTG--ECP 115 G+ +Q + + N+ T+ + F+ AGY +GK +H + GT + P Sbjct: 103 GLRPDQIKVYDLDRHFRDEVPNVITLSQAFQQAGYFAARVGKIYHYNVPASIGTDGFDDP 162 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 P W+ G + E R ++ L A+ DE T I+ A+ ++ Sbjct: 163 PSWNQTVNPKGRDKDDEHLIFNAEPHRKISGALSWLAADGEDEEQT-DGMIATEAIRIMR 221 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + + DEPF + V + PH P+ P +Y + Y L D + P Sbjct: 222 E--KKDEPFFLGVGFFRPHTPYVAPKKYFDMYPLESLRLPFAPAGDREDIPTAAFAHNCP 279 Query: 236 MPSPVGDD--GLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAH 292 +P+ D+ L Y+AC F+D Q+GR+++AL + +NT V++ SDHG +G H Sbjct: 280 VPNYGLDETTLLKATQAYYACVSFIDAQVGRLLDALEEQGLADNTIVVFWSDHGYHLGEH 339 Query: 293 KLISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 + + ++++ + PLIIR P + V +D+ PT+ +A IE P L G + Sbjct: 340 NGVWQKRTLFEEGAKAPLIIRDPSQLGLGSCNRIVEFVDIYPTLTDVAGIESPSGLAGRS 399 Query: 352 ILAVKEPRGVMVEFNRY-EIEHDSFGGFIPVRC---WVTDDFKLVLNLFTSD--ELYDRR 405 + + ++ + T ++ ELYD + Sbjct: 400 LKPLLNDPVANWNGTAITQVLRPADDRLPEQVMGCSIRTHRYRYTEWAEGRHGVELYDHQ 459 Query: 406 NDPNEMHNLIDDIRFA--DVRSKMHDALLD 433 +DPNE HNL D V ++ L Sbjct: 460 SDPNEFHNLALDPDERAVAVIRRLRPLLRA 489 >UniRef50_C4G6V3 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G6V3_ABIDE Length = 502 Score = 446 bits (1149), Expect = e-124, Method: Composition-based stats. Identities = 207/501 (41%), Positives = 284/501 (56%), Gaps = 22/501 (4%) Query: 1 MKRPNFLFVMTDTQATNMVG-CY-SGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAG 58 M + F+ +MTD+Q +M+ C G+ ++T +D L +G+ F SAYT PVC PARAG Sbjct: 1 MAKKQFIVIMTDSQRRDMISRCNERGENMHTPCLDRLCDQGLAFQSAYTTQPVCGPARAG 60 Query: 59 LFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 LFTG Y + +G N +A + T+G+ AG H YIGKWHLDG DYFG G CP W Sbjct: 61 LFTGTYPHTNGMLGNCMALSQQSLTIGQRLSKAGIHAAYIGKWHLDGGDYFGDGICPEGW 120 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 D +YW+D NYL EL E L++ + + I E FT+ +R + RA+DF+++ Sbjct: 121 DENYWYDMRNYLDELESDEDRARSRTLDTALEGEG--IGEEFTYGYRCTKRALDFMEKYK 178 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 D + +VVSYDEPHHPF P + + + Y + D + PEH ++W + Sbjct: 179 DED--YFLVVSYDEPHHPFLSPKSFYKPFYQPYLQKP-NQHMDFSKLPEHIQVWHEKFSE 235 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKG 298 G G N F+D QIGRV+ A + E+ V+YTSDHG+ G+H + +KG Sbjct: 236 IQGGKGDGFAVGLLGSNSFIDSQIGRVLEA-AEKNAEDALVLYTSDHGDSQGSHGIHAKG 294 Query: 299 AAMYDDITRIPLIIRSPQGERRQVDT--PVSHIDLLPTMMALADIEKPEILPGENILAVK 356 AMY++IT IPLI R + T PVSHID++PT++ + +P+ L GE++L Sbjct: 295 PAMYEEITNIPLIARWKNKIEAGITTQMPVSHIDIVPTILDFYGLPQPKSLEGESLLNSL 354 Query: 357 ---------EPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRND 407 E R V VEFNRYE++HD +GGF PVRC V +KL +NL T DELY+ D Sbjct: 355 TDKEITGQKEGRPVFVEFNRYEVDHDGWGGFQPVRCVVKGKWKLTINLMTQDELYNLEED 414 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKD---ARPRWMGAF 464 NEMHNLIDD +R+++HD LLD+ ++ RDP R Y W RPWRKD G Sbjct: 415 YNEMHNLIDDPNCESIRNQLHDLLLDWQNETRDPLRGYYWEKRPWRKDRQKVSWDCGGYS 474 Query: 465 RPRPQDGYSPVVRDYDTGLPT 485 R R ++ Y TGLP Sbjct: 475 RSRHREDGEVGEYGYSTGLPI 495 >UniRef50_B5JJG3 Sulfatase, putative n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JJG3_9BACT Length = 499 Score = 446 bits (1148), Expect = e-124, Method: Composition-based stats. Identities = 115/494 (23%), Positives = 189/494 (38%), Gaps = 55/494 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN LF+M+D A V Y + T NID LA EG+ F+ A+ + +C PARA TG Sbjct: 25 RPNILFIMSDDHANAAVSAYDDTLIQTPNIDRLANEGMLFSRAFCTNSICGPARAVTLTG 84 Query: 63 IYANQSGPWTN-NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 Y++ +G N + + T + + AGY T IGKWHL D Sbjct: 85 KYSHLNGFIVNESTSFDGGQQTYPKLLQAAGYETAVIGKWHLGSDPT----------GFD 134 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 +W ++ ++ I++ A+D+L + Sbjct: 135 FWKILIGQGQYY------------DAPFLTAEGQVETEGYVTDVITDLAIDWL-NTREDE 181 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVG 241 +PF+++ + PH F +YL D E DD A + + + Sbjct: 182 KPFMLMYQHKAPHANFQPGPDYLNWREDETIPEPETLFDDYATRSPAAWDNEMRIDPTLE 241 Query: 242 DD----------------------GLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTW 278 ++ Y C VDD +GRV L +NT Sbjct: 242 LQYQGELNLKVPDGLRGHERSRWLYQFYIKNYLRCVKSVDDGVGRVFEQLEAMGELDNTI 301 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMM 336 VIYTSD G +G H K MY++ +IPL++R P+ D V+++D TM+ Sbjct: 302 VIYTSDQGFFLGEHGYYDK-RFMYEESLQIPLLVRYPKMIEAGSVRDEIVTNLDFAETML 360 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGF---IPVRCWVTDDFKLVL 393 LA ++ P + GE+++ + + + + + G+ T+ +KL+ Sbjct: 361 DLAGVKVPSGMQGESLVPLLKGKKRKGWRDAMYYHFYEYPGYHYVKRHYGIRTERYKLIR 420 Query: 394 NLFT--SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRP 451 + ELYD DP E++NL + + ++ L D + Sbjct: 421 FYHDIEAWELYDLDEDPQELNNLYGSDGYEKLTKRLKKRLDKIQSNFGDSPELADELVER 480 Query: 452 WRKDARPRWMGAFR 465 + + PRW Sbjct: 481 YPHGSMPRWGRYQD 494 >UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=4 Tax=Bacteria RepID=A6C861_9PLAN Length = 498 Score = 446 bits (1148), Expect = e-123, Method: Composition-based stats. Identities = 107/485 (22%), Positives = 182/485 (37%), Gaps = 50/485 (10%) Query: 3 RP-NFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 +P NF+F++ D VGC + T +I+ LA G+RF + Y +PVC+P R + Sbjct: 33 KPLNFVFILVDDLGYMDVGCNNPQTFYETPHINQLAKTGMRFTNGYAANPVCSPTRYSIM 92 Query: 61 TGIYANQSGPWTN--------------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH 106 TG Y + N + +T+ K+ GY T + GKWHL Sbjct: 93 TGKYPTRVDATNFFSGKRAGKFLPAPLNDKMPLSETTIAEALKEHGYSTFFAGKWHLGPT 152 Query: 107 DYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI 166 F P + D G + + + L+ H+ R+ Sbjct: 153 QEF----WPEKQGFDINRGGWHRGGPYGGGKYFSPYGNPRLTDGLKGEHLP------DRL 202 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 ++ F+ A DEPF +++ H P P + KY + LG +++ A++ Sbjct: 203 ASETAQFID--AHRDEPFFAYLAFYSVHTPLMGPGPLVTKYKEKAKRLGLTGKEEFADEE 260 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDH 285 + + L +H +Y A + +D +G+V+ L ENT V+ T+D+ Sbjct: 261 QVF-----PVDEKRRVRILQNHAVYAAMVESMDKAVGKVLQQLEESGVAENTVVMLTADN 315 Query: 286 GEMMGAHK-------LISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMM 336 G + + L +Y+ R +IR P G D PV D PT++ Sbjct: 316 GGLSTSEGSPTSNLPLRGGKGWLYEGGIREVFLIRWPGGTEPGSVCDEPVITTDFYPTIL 375 Query: 337 ALADIE--KPEILPGENILAVKEPRGVMVEFN-RYEIEHDSFGGFIPVRCWVTDDFKLV- 392 LA + + L G ++ + + H S G IP D+KL+ Sbjct: 376 DLAGLPLKPQQHLDGVSLKPFLQGEAPFKRDALYWHYPHYSNQGGIPGGAIRVGDWKLIE 435 Query: 393 LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL-RP 451 LY + D E +L + ++ + + M L + + F + P Sbjct: 436 RFEDGQVHLYHLKEDLGEKQDLAE--KYPERVAAMRKQLHKWYQETDAKFLQAKPGGPEP 493 Query: 452 WRKDA 456 WR Sbjct: 494 WRPGT 498 >UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B5CXC7_9BACE Length = 509 Score = 445 bits (1147), Expect = e-123, Method: Composition-based stats. Identities = 100/489 (20%), Positives = 178/489 (36%), Gaps = 46/489 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN +F+M D VG + T NID LA+EG+ F Y + + +P+R L T Sbjct: 29 RQPNVVFIMVDDYGWADVGYNGSRFYETPNIDRLASEGMIFTDGYAAASISSPSRVSLMT 88 Query: 62 GIYANQSGPWT-----------------------NNVAPGKNISTMGRYFKDAGYHTCYI 98 G Y ++G + TM FK+ GY T ++ Sbjct: 89 GKYPARTGITDWIPGYQYGLKPEQLKQYKMLAPEMPLNMPLEEVTMAEAFKEHGYATYHV 148 Query: 99 GKWHLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGL-NSVEDLQANHID 157 GKWH + P D G S + + + + Sbjct: 149 GKWHC----AEDSLYYPQYQGFDVNIGGWLKGSPNGIRRSQGGKGAYCSPYRNPYLPDGP 204 Query: 158 ETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK 217 E R+ + ++ ++ + AD+PF + +++ H P EY++ + +G Sbjct: 205 EGEFLTDRLGDESIKLIKNSS-ADKPFFLYLAFYAVHTPIEAKPEYVKYFKWKAQRMGLD 263 Query: 218 AQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-N 276 E ++ A+ + + Y A +D+ +GRV+ AL + N Sbjct: 264 TIVPFTRNLEWYKN-AEYKAGHWKERTIQSDAEYAALIYSMDENVGRVMQALKDNGLDKN 322 Query: 277 TWVIYTSDHGEM-------MGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVS 327 T V SD+G + L + +Y+ R P II+ PQ TPV Sbjct: 323 TIVCLLSDNGGLSTAEGSPTCNAPLRAGKGWLYEGGIREPFIIKYPQMVEAGSVCHTPVV 382 Query: 328 HIDLLPTMMALADIEKP--EILPGENILAVKEPRGVMVEFN-RYEIEHDSFGGFIPVRCW 384 +D PT++ +A + + + G+++L + + + H G P Sbjct: 383 AVDFYPTLLDMAGLPLKSHQHVDGKSLLPLLKGDQAYDRGPIFFHYPHYGGKGDTPAGAV 442 Query: 385 VTDDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFR 443 D+KL+ ELY+ +ND +E +L + D ++M L + Sbjct: 443 RMGDYKLIEFYEDGHVELYNLKNDISETRDLSKTEK--DKAAEMQKMLHRWRTDCNAKMP 500 Query: 444 SYQWSLRPW 452 + P Sbjct: 501 TRNPHYVPV 509 >UniRef50_B5JYP8 Choline-sulfatase n=1 Tax=Octadecabacter antarcticus 238 RepID=B5JYP8_9RHOB Length = 531 Score = 445 bits (1146), Expect = e-123, Method: Composition-based stats. Identities = 111/443 (25%), Positives = 190/443 (42%), Gaps = 15/443 (3%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN + +M D A + + Y T N++ LAA+G F + Y+ +P+C P+RA + + Sbjct: 5 SRPNIILIMADQMAAHALSLYGNTVCKTPNLERLAAQGTVFENGYSNNPLCVPSRASMLS 64 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G+ + + N ++ TM Y + AGY T GK H G D D Sbjct: 65 GMLSPDVNVFDNANELPSSVPTMAHYLRHAGYWTELCGKMHFIGPDQEHGFNQRSV--TD 122 Query: 122 YWFDGANYLSELTEKEISLWRN-GLNSVEDLQANHIDETFTWAHRISNRAVDFL-QQPAR 179 + ++++ + LN V + + + + A+ L + Sbjct: 123 VYPASFQWIADWQAGPAFVPSGTALNGVVESGPCVRTMQEDYDDEVEHCAIQSLYDRARE 182 Query: 180 AD-EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR----LWAQ 234 D +PF +VS+ PH PFT EY ++Y + + H + + Sbjct: 183 PDRQPFFQIVSFTNPHTPFTVSQEYWDRYESSEIDAPAVGALPFEDLDYHSKALFFAHGR 242 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHK 293 + Y+ +VDD++GR+++ L R+NT V + SDHGEM+G Sbjct: 243 HRHKVTQKHLIAARHAYYGMISYVDDKVGRILDTLEKTGQRDNTAVFFVSDHGEMLGERG 302 Query: 294 LISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTMMALADIEKPEILPGENI 352 + K ++ +P I P + + VS +DLLPT + LA + PE L G ++ Sbjct: 303 MWFK-QTFWEWSAHVPFIASVPGITGGGRSEKVVSLVDLLPTFLDLAGADSPE-LAGSSV 360 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMH 412 L + E ++ + G +P R FK + LYD ++DP E++ Sbjct: 361 LPLMEGDADAWPDIAIS-DYLAIGPCVPCRMVRKGRFKFIYTHGHPALLYDLQDDPLELN 419 Query: 413 NLIDDIRFADVRSKMHD-ALLDY 434 NL D+ FADV +++ +L D+ Sbjct: 420 NLADNAAFADVLAELQAFSLTDW 442 >UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CBI6_9PLAN Length = 599 Score = 445 bits (1145), Expect = e-123, Method: Composition-based stats. Identities = 107/452 (23%), Positives = 176/452 (38%), Gaps = 67/452 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN L +MTD Q V + + T D LA++G RF Y SPVC P R+ L T Sbjct: 29 ERPNVLLIMTDDQGWGDVRSHDNPLIETPQQDLLASQGARFERFYV-SPVCAPTRSSLLT 87 Query: 62 GIYANQ---SGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G Y+ + G +T+ FK AGY T GKWH H P Sbjct: 88 GRYSLRTGVHGVTRGFENMRAEETTIAEMFKAAGYKTGAFGKWHNGRHYPMH----PNGQ 143 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 D +F W ++ + + +++RA+DF++Q Sbjct: 144 GFDEFFGFCGG----------HWNRYFDTNLEHNKQPVKTEGYITDVLTDRAIDFIKQ-- 191 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 D+PF V Y+ PH P+ P +Y +KYA+ + Sbjct: 192 NKDQPFFCYVPYNAPHSPWIVPEKYWDKYANKGLD------------------------- 226 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMG--AHKLI 295 Y A + VDD +GR++ L + +NT V++ +D+G + Sbjct: 227 ------DKARCAY-AMVECVDDNLGRLMQTLDDLKLSDNTIVLFLTDNGPNSNRYNGNMR 279 Query: 296 SKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIE--KPEILPGEN 351 + ++++ R+PL +R P V +HID+LPT++ L +E + L G++ Sbjct: 280 GRKGSIHEGGIRVPLFVRYPGKIKAGTVVKPIAAHIDILPTLLELCSVENTADQPLDGKS 339 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFI-----PVRCWVTDDFKLVLNLFTSDELYDRRN 406 ++ + + R F I P TD ++ LYD + Sbjct: 340 LVPLLTNKSNKDWPQRMLFSDRLFRNSIPDDELPNGSVRTDRWR-AAYERGKWSLYDMQA 398 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 DP++ N+I+ V + A D+ + Sbjct: 399 DPSQKQNVIE--AHPAVIKDLSAAYRDWFKDV 428 >UniRef50_Q7UH28 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=1 Tax=Rhodopirellula baltica RepID=Q7UH28_RHOBA Length = 534 Score = 445 bits (1145), Expect = e-123, Method: Composition-based stats. Identities = 114/449 (25%), Positives = 198/449 (44%), Gaps = 30/449 (6%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++ N +F++TD + +GC L T N+DS+AA G +A+ + +C+P+RA + Sbjct: 54 VEPRNVVFILTDDHRFDAMGCAGHPFLETPNLDSIAANGTHIKNAFVTTSLCSPSRASIL 113 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG+Y ++ NN +Y + AGY T ++GKWH+ GH P Sbjct: 114 TGLYTHKHRVIDNNRLVPDGTLFFPQYLQRAGYDTAFVGKWHMGGH------HDDPRPGF 167 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D+W + L G ++ + + +++ AVD+L++ Sbjct: 168 DHWVSFRGQGNYLPP--------GPKYTLNVNGERVKQKGYITDELTDYAVDWLKER-DD 218 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK--PEHHRLWAQAMPS 238 DEPF + +S+ H FT + +YAD ++ A+K P R + Sbjct: 219 DEPFFLYLSHKAVHSNFTPAERHQGRYADEDLSFLPTGKELSADKNTPRWVRDQKNSWHG 278 Query: 239 -----PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAH 292 Y + Y VDD +GRV+ L ++T +IY D+G M G H Sbjct: 279 IDFSYHSDKGLDYLYRRYCESVLAVDDSVGRVLQQLKDMGIHDDTLIIYMGDNGFMWGEH 338 Query: 293 KLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGE 350 LI K + Y+ R+P++++ P + ++ V +ID+ PT++ A ++ PE + G+ Sbjct: 339 GLIDKRVS-YEASIRVPMLMQCPNLFDGGQPIENVVGNIDVGPTILHAAGLQTPEYMDGQ 397 Query: 351 NILAVKEPRGVMVEFNRYEIEHDS--FGGFIPVRCWVTDDFKLVLNL--FTSDELYDRRN 406 + L + R + + F D FK + + +DELYD + Sbjct: 398 SFLDLPNNRDADWRKYFLYVYYWEKNFPQTPTQFALRGDRFKYITYYGLWDTDELYDLQT 457 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYM 435 DP+E++NLI D + V +M D L + Sbjct: 458 DPDELNNLIHDPDYKSVAKEMEDQLYAML 486 >UniRef50_B9XND0 Sulfatase n=3 Tax=Bacteria RepID=B9XND0_9BACT Length = 492 Score = 443 bits (1141), Expect = e-123, Method: Composition-based stats. Identities = 118/476 (24%), Positives = 188/476 (39%), Gaps = 41/476 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN LF++ D +G + T ++D L +E + F +A + PVC+P RA L T Sbjct: 47 QPPNVLFIIADQWRAEAMGYNGNPDVKTPHLDHLQSESVDFVNAVSSVPVCSPTRASLMT 106 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G A G + N+V T+ + AGY T IGKWHLDGH + D Sbjct: 107 GQRALTHGVFVNDVPLSPKAITLSKVLHQAGYDTACIGKWHLDGHGRSQFIPRERRQNFD 166 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 YW L + S + L + A +L+ + A Sbjct: 167 YW----KVLECTHQYNNSFYFADLPFKLKWDGYDVFAQTH-------DASQYLRNHSHAK 215 Query: 182 EPFLMVVSYDEPHHPFT-CPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 +PF + +S+ PH P+ P Y +Y + N P R AQ Sbjct: 216 KPFFLYLSWGPPHDPYQTAPATYRSQYQAAKIK-------TRLNVPPGMRASAQT----- 263 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGA 299 + Y++ +D +G ++ L E NT VI+TSDHG+M+ +H L+ K Sbjct: 264 ------NLAGYYSHCTAIDSCVGTLLQTLKDTGLETNTLVIFTSDHGDMLHSHGLV-KKQ 316 Query: 300 AMYDDITRIPLIIRSP---QGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVK 356 +D+ R+PL++R P + R++D P + D +PT++ L P + G + A Sbjct: 317 HPFDESIRVPLLMRWPAGLGTQPRKLDAPFNSPDFMPTILGLCGAPVPNTVEGIDYSAYL 376 Query: 357 EPRGVMVEF------NRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNE 410 + + E+ G R T + V +L L+D DP + Sbjct: 377 QGDVNPSDGATLISCPVPFGEYSRQHGGREYRGIRTTRYTYVRDLNGPWLLFDNLEDPAQ 436 Query: 411 MHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRP 466 M NL+ A + + LL + + D F Q L W + + P Sbjct: 437 MDNLVGQPECAQLEEDLEKILLQKLAEANDQFLPGQAYLDRWGYKLNANGIIPYTP 492 >UniRef50_A6DNI8 Putative N-acetylglucosamine-6-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DNI8_9BACT Length = 705 Score = 442 bits (1138), Expect = e-122, Method: Composition-based stats. Identities = 122/520 (23%), Positives = 203/520 (39%), Gaps = 60/520 (11%) Query: 4 PNFLFVMTDTQATNMVGCYSG-KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 PN +F++TD Q + +G L T NID + EG+ F +++ +C PARAG TG Sbjct: 23 PNIIFILTDDQKYDAMGFMGHYPFLKTPNIDRIRNEGVHFKNSFVTLSMCAPARAGFLTG 82 Query: 63 IYANQSGPWTN--NVAPGKN-ISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 Y +G TN +N + + AGY T + GKWHLD P Sbjct: 83 TYPQVNGVCTNVEGREFNQNKTPSFPLLLQRAGYETGFFGKWHLD-------HSNKPRLG 135 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 D W + NG + D + H +++ A+DF+ + Sbjct: 136 FDRWVSFSGQG----------KYNGNDLNIDGKLVHNP--GYITDELTDYALDFIDK--N 181 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR--------- 230 +D+PF + +S+ H PFT + Y E D+L +KP+ R Sbjct: 182 SDKPFCVYLSHKAVHQPFTPAKRHSSLYKGETVPKKESFFDNLKDKPKWQRVNLPPEKLY 241 Query: 231 ---------LWAQAMPSPVGDDGLYHHPL--YFACNDFVDDQIGRVINALTPEQ-RENTW 278 A P P + H Y VD+ IG++ L ++ +NT Sbjct: 242 RLRYNNTHETPAVKTPRPYTKENGSHPHTKDYLRAIAAVDEGIGKIYALLENKKILDNTV 301 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTMM 336 +I+ D+G ++G H+ Y++ RIPLI+R P +D V +ID+ PT++ Sbjct: 302 IIFAGDNGYLLGEHQ-RGDKRVHYNESMRIPLIMRYPAKIPADSTLDQMVLNIDVAPTIL 360 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDS--FGGFIPVRCWVTDDFKLVLN 394 +A ++ PEI+ GE+ + + + + Y + + TD + Sbjct: 361 DIAGVKAPEIMQGESCMPLFDKSKKTPWRDAYLFTYWRDLIPTLPRIVAVRTDRYVYTTY 420 Query: 395 L--FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPW 452 +ELYD NDP+EM NL A++ M + + + ++ Sbjct: 421 PDIDDVNELYDLENDPHEMRNLATSPEHAEIVKAMEQKIEELKKET-------KYKKIVP 473 Query: 453 RKDARPRWMGAFRPRPQDGYSPVVRDYDTGLPTQGVKVEE 492 R P+W + + D + VK+ Sbjct: 474 RPRPEPQWGVQEGLICDLEFKQGLNSKDQSVIANKVKINN 513 >UniRef50_A6C9F6 Iduronate-2-sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C9F6_9PLAN Length = 506 Score = 442 bits (1138), Expect = e-122, Method: Composition-based stats. Identities = 109/451 (24%), Positives = 187/451 (41%), Gaps = 22/451 (4%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN LF++ D +GCY + + NID LA +G+RF AY P+C P+RA TG Sbjct: 45 KPNVLFLICDDLNC-DLGCYGHPQVQSPNIDQLAKQGVRFEHAYCQFPLCGPSRASFMTG 103 Query: 63 IYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPEW 118 +Y +Q+ N + N+ TM + F+D GY +GK +H + + GT + Sbjct: 104 MYPDQTLVHRNGIYIREHVPNVKTMSQMFRDHGYFATRVGKIYHYNVPKHIGTSGHDDPY 163 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 + F+ + ++ SL A + ++ A+ L++ A Sbjct: 164 SWNQTFNPRGRDVDDEDQIFSLVPGSYGGTLSWLAAEGTDAEQTDGIAADIAIQQLKKFA 223 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 + EPF + V PH P+ P Y EKY ++ + L P R Sbjct: 224 ESKEPFFLAVGLYRPHTPYVAPKSYFEKYPVEQIKVPQIPDGYLKTIPASARKSVTRKKD 283 Query: 239 PV---GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHKL 294 + Y+A F D Q+G +++AL + NT V++TSDHG MG H Sbjct: 284 QIDLPDKLARQAIQAYYASITFADAQLGHILSALKETGLDENTIVVFTSDHGYHMGEHGH 343 Query: 295 ISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENI 352 K ++++ T +P+II P + + P +D PT+ L ++ P + G + Sbjct: 344 WQK-TTLFENATHVPMIIAGPGVTAKGQAAAAPAEMVDFYPTLAELCGLKAPASVSGISQ 402 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD---ELYDRRNDPN 409 + + + + T F+ ELYD +DP Sbjct: 403 VPALKDATATPRKTALTQYLNGY-------SLRTPTFRYTEWGTNGSEGVELYDHSSDPA 455 Query: 410 EMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 EMHNL + + +R ++ + L + +++ Sbjct: 456 EMHNLANQAKTQKLRDELAEILHERIEQANA 486 >UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BZT7_9PLAN Length = 459 Score = 442 bits (1138), Expect = e-122, Method: Composition-based stats. Identities = 107/451 (23%), Positives = 169/451 (37%), Gaps = 47/451 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN +F+M D +GCY K + T +ID LAAEG++F AY S VC P+R+ L T Sbjct: 15 QKPNIIFIMADDLGYAELGCYGQKKIKTPHIDKLAAEGMKFTQAYAGSMVCQPSRSVLMT 74 Query: 62 GIYANQSGPWTN--NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 G + + N N + +T+ K AGY T GKW L Y GT P + Sbjct: 75 GQHTGHTAVRANDLNQLLYEEDTTVAEVLKIAGYATGAFGKWGLG---YEGTPGRPGQQG 131 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 D + + +W N + + + + I A F+Q+ Sbjct: 132 FDDFTGQLLQVHAHFYYPFWIW-NNEHRLMLPENENNQRGRYIHDLIHEDAKAFIQK--N 188 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP 239 +PF + Y PH P E + Y + + + L +P + Sbjct: 189 KAQPFFAYLPYIIPHVELVVPEESEKPYRGQFPK-----KQILDPRPGYIGSEDGLTT-- 241 Query: 240 VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG-----------E 287 + +DD +G ++ L R+NT +I+TSD+G Sbjct: 242 -----------FAGMVSRLDDHVGEIVTLLEDLGIRDNTLIIFTSDNGGQGGTWKEMTDF 290 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKPE 345 G L +MY+ R+P I P + D ++ D+LPT+ +A P Sbjct: 291 FNGNAPLRGHKGSMYEGGIRVPFIANWPGKIAAGKTSDLQIAFWDVLPTLAQVAGTTVPS 350 Query: 346 I--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL-FTSDELY 402 + G + L +G E E+ G I R ++K V N ELY Sbjct: 351 GVDIDGISFLPTLLGKGKQPEHEYLYWEYTR--GKIRSRAIRQGNWKAVQNRMNQPIELY 408 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 D D E NL + + + + Sbjct: 409 DLGTDIGETKNLAK--QHPEKIKDLQQIMQQ 437 >UniRef50_B4D6H3 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D6H3_9BACT Length = 525 Score = 441 bits (1135), Expect = e-122, Method: Composition-based stats. Identities = 113/478 (23%), Positives = 189/478 (39%), Gaps = 40/478 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF+MTD Q + VG + T N+D LAA G F++ Y SPVC P+R FT Sbjct: 30 RRPNILFIMTDQQRWDCVGANGNTIIKTPNMDRLAARGANFSNVYVASPVCVPSRISFFT 89 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH-------DYFGTGEC 114 G YA+ N + + K+AGY T +GK H F E Sbjct: 90 GRYAHSHRNRVNYTPLDASEVLLQARLKEAGYRTASVGKLHYFPPTVEHAKSTGFDIVEL 149 Query: 115 PPEWDA-DYWFDGANYLSELTEKEISLWR---NGLNSVEDLQANHIDETFTWAHRISNRA 170 D W D + K+ +R + ++ ID +T R Sbjct: 150 HDGVPFTDKWSDYVKWRQANDPKKDIYYRATAKNIEPGKNPNRAAIDTQYTDTTWTGERG 209 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA-QDDLANKPEHH 229 +L + A+ +PF + VS+ +PH P+ Y Y D + E +DLA+ P Sbjct: 210 RYWLTELAKGQQPFFLYVSFWKPHSPYEIGPPYDSMYDDANIPIPETVTANDLASMPLPL 269 Query: 230 RLWAQA----MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSD 284 + + + + + + Y+ VD +IG ++ AL +NT ++++SD Sbjct: 270 QKLSLRENPNVWKQTQERVEWMYRSYYGAISHVDHEIGLLLEALEASGQAQNTLIVFSSD 329 Query: 285 HGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTMMALADIEK 343 HG+ + H++ K ++ ++PL++ P D + +DL+PT++ + + Sbjct: 330 HGDQLMEHRIYGK-NCFFEPSVKVPLMVSLPGRIKPAHYDQLMETVDLVPTLLDFIGLPE 388 Query: 344 PEILPGENILAVK--EPRGVMVEFNRYE--------------IEHDSFGGFIPVR----- 382 P + G + + R + + + G VR Sbjct: 389 PREVQGRSFAPLIADLGRPYTPHDAVFSENIIPEVITSGKMDLPFEKGKGVDGVRHPDAK 448 Query: 383 CWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 TD +K ELYD + DP E NL V +M LL+++ + Sbjct: 449 MVRTDRWKYCYYPEGYAELYDLQKDPGERTNLAGRPENHAVEEEMRTRLLNWLIDSAE 506 >UniRef50_Q01RE9 Sulfatase n=4 Tax=Bacteria RepID=Q01RE9_SOLUE Length = 499 Score = 440 bits (1134), Expect = e-122, Method: Composition-based stats. Identities = 115/455 (25%), Positives = 200/455 (43%), Gaps = 35/455 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 +R N +F+++D + +G L T ++D+LA +G +A+ C+ +C+P+RA + Sbjct: 27 RRRNVIFILSDDHRYDALGFMHPQPWLRTPHLDTLARDGAHLKNAFVCTALCSPSRASIL 86 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG+YA++ NN A + + + AGY T ++GKWH+ P+ Sbjct: 87 TGVYAHRHHIVDNNTAIPRGTRFFPQLLQRAGYKTGFVGKWHMG------REGDDPQPGF 140 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D W S L E RNGLN + H+ + +++ A+D+L+ Sbjct: 141 DKWVSFRGQGSYLPE------RNGLN----VDGKHVPQKGYITDELTDYALDWLRTVP-K 189 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN---KPEHHRLWAQAM- 236 ++P+ + +S+ H F + YA + + N +P + + Sbjct: 190 EQPYFLYLSHKAVHADFIPADRHKGAYAKETFRPPTTMDESGPNAQHRPMWVQNQRNSWH 249 Query: 237 ----PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGA 291 P D ++ Y VDD + R+++AL ++T VIY D+G G Sbjct: 250 GVDFPYHSDLDVGEYYKRYAETLLGVDDSVDRMLDALRERGQLDSTLVIYMGDNGFQFGE 309 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 H LI K Y++ R+PL+ R P+ R VD V+ +D++PT++ A P+ L G Sbjct: 310 HGLIDK-RTAYEESMRVPLLARCPEMFSGGRVVDRMVAGLDIMPTVLDAAGAAIPQGLDG 368 Query: 350 ENILAVKEPRGVMVEFNRYEIEHD---SFGGFIPVRCWVTDDFKLVLNL--FTSDELYDR 404 ++L + + E+ +F + TD +K V + SDELYD Sbjct: 369 RSMLPLLRGENDPQWRTQLLYEYYWERNFPQTPTMHALRTDRYKYVRYYGIWDSDELYDL 428 Query: 405 RNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 + DPNE NLI + + + L D M++ Sbjct: 429 QEDPNETTNLIYNPERKATIEEFNKRLFDEMERTD 463 >UniRef50_UPI00016C0ED5 sulfatase n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0ED5 Length = 490 Score = 440 bits (1133), Expect = e-122, Method: Composition-based stats. Identities = 112/508 (22%), Positives = 198/508 (38%), Gaps = 88/508 (17%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M +PN +F+MTD A++ + CY K T N+D +A EG+RF++ + + +C P+RA + Sbjct: 1 MNQPNIVFIMTDDHASHSMSCYGSKINVTPNMDRIANEGMRFDNCFCTNSICAPSRAVIL 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG +++ +G T N T + + AGY T IGKWHL Y Sbjct: 61 TGKHSHLNGVITLNDEFDGRQQTFPKLLQKAGYQTGIIGKWHLGEGGYADPT------GF 114 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 DY+ + + G + + I++ +++F++ Sbjct: 115 DYYCVLHGQG---EYFDPKMREQGEDKIF---------KGYATDIITDMSLNFIKDR-DK 161 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK--------------- 225 +PF+++ + PH P+T +Y + Y D E DD A + Sbjct: 162 SKPFMLMCHHKAPHRPWTVSAKYADLYKDEEILQPETFDDDYATRCDAAREAEMXXDNDF 221 Query: 226 ---------PEHHRLWAQAMPSPVGD------------------------DGLYHHPLYF 252 P R + P + + Y Sbjct: 222 MYRDLKLVPPPTKRPMDKIPPPDSLEGYTLTPEETGVPVSFSSYAELKNFKYQRYIKDYL 281 Query: 253 ACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLI 311 C VDD IG++++ L E E+T V+YTSD G +G H K MY++ R+P + Sbjct: 282 RCVASVDDGIGQLLDCLNEEGIAEDTIVVYTSDQGFFLGDHGWYDK-RFMYEESLRMPFV 340 Query: 312 IRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYE 369 I+ P+ R D + ++D T + A + P+ + G++ + E Sbjct: 341 IKYPRAIKAGRVSDKMILNLDFAETFLDFAGVPIPDDMQGKSFRRILEDENAPAIQTAMY 400 Query: 370 IEHD---SFGGFIPVRCWVTDDFKLVLNL--------------FTSDELYDRRNDPNEMH 412 + + T D+KL+ + EL+D + DP+E++ Sbjct: 401 YRYWMHLAHHNIWSHYGIRTLDYKLIYYYAQALGKSGTIDEYREAAWELFDLKKDPHELN 460 Query: 413 NLIDDIRFADVRSKMHDALLDYMDKIRD 440 N+ D+ +AD+ K+ D + + D Sbjct: 461 NVYDNPEYADLIVKLKDEMYKLKAEAED 488 >UniRef50_Q7UMT6 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=2 Tax=Bacteria RepID=Q7UMT6_RHOBA Length = 524 Score = 440 bits (1133), Expect = e-122, Method: Composition-based stats. Identities = 107/455 (23%), Positives = 188/455 (41%), Gaps = 29/455 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 PN LF++ D + +G L T +ID++A +G AY + +C+P+RA + T Sbjct: 41 SPPNILFILCDDHRFDCLGVAGHPFLETPHIDTMARDGAMLRRAYVTTSLCSPSRASILT 100 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G YA+ N A N+ +DAGY T +IGKWH+ G P+ D Sbjct: 101 GQYAHNHRVVDNYHAVDPNLVFFPESLQDAGYQTAFIGKWHMGG------DIDDPQRGFD 154 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVE--DLQANHIDETFTWAHRISNRAVDFLQQPAR 179 +W + + + + + ++ + + ++ ++D+L+ Sbjct: 155 HWVSFRGQGTYWPDGHGTTREVPQTTYDGFNVNGKRVPQRGYITDELTEYSLDWLK-GRD 213 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADF--YYELGEKAQDDLANKPEHHRLWAQAMP 237 ++PF + VS+ H F + +Y + E+ D NKP R + Sbjct: 214 PNKPFFLYVSHKAVHADFVPADRHRGRYDNEALPIEIPTVEAMDAGNKPMWVRNQRNSRH 273 Query: 238 S-------PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMM 289 P +Y+ Y VDD +G++ L ++ + NT V+Y D+G Sbjct: 274 GVDFGYNLPGFSPEVYYRR-YCESLLAVDDSVGQLREFLKQQELDQNTIVVYMGDNGFQF 332 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALADIEKPEIL 347 G H LI K Y+ ++PL++ +P V D V +ID+ PT++ A+ P+ + Sbjct: 333 GDHGLIDK-RTAYEASAKVPLLVVAPGKIPAGVPFDGLVGNIDIAPTLLEAANASAPKNI 391 Query: 348 PGENILAVKEPRGV----MVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL--FTSDEL 401 G+++ ++ + + FK + + DEL Sbjct: 392 NGQSVWQALCSSDASSLNDRTLLYEYYWERNYPHTPTLHAVIGGRFKYIRCHGLWDRDEL 451 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 YD +DP EM NLIDD R+AD ++ L + Sbjct: 452 YDLESDPGEMQNLIDDSRYADRVESLNQRLWQLLK 486 >UniRef50_C6J2Z0 Sulfatase n=4 Tax=Firmicutes RepID=C6J2Z0_9BACL Length = 502 Score = 440 bits (1133), Expect = e-122, Method: Composition-based stats. Identities = 130/497 (26%), Positives = 210/497 (42%), Gaps = 58/497 (11%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN L + +D Q N +G + L+T N+D L EG F AY +P CTP RA + Sbjct: 1 MKKPNILLITSDQQHWNTIGAF-NPELSTPNLDRLVQEGTTFTRAYCPNPTCTPTRASII 59 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH----LDGHDYFGTGECPP 116 TG+Y +Q G WT ++ +G FK+AGY T +GK H + +Y P Sbjct: 60 TGLYPSQHGAWTLGTKLLEDRPVVGTNFKEAGYRTALVGKAHFQPLMGNEEYPSLESYPL 119 Query: 117 EWDADYW-----------------------FDGANYLSELTEKEISLWRN------GLNS 147 D DYW G +Y + EK + WR+ G S Sbjct: 120 LQDLDYWRQFSDSFYGFDHVELARNHTNEAHVGQHYAIWMEEKGCTNWRDYFLPPTGTMS 179 Query: 148 VEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY 207 + L + E + + I+ R L+Q +E F + S+ +PH P+ + Y Sbjct: 180 PKQLHRWDLPEEYHYNTWIAERTNALLEQYKNNNESFFLWASFFDPHPPYLVSEPWDTMY 239 Query: 208 ADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGD------------------DGLYHHP 249 + E + + N P H + Q P + Sbjct: 240 DPESLTIPEVSPGEHDNNPPHFGMTQQKSPDFSAWKETGQAIHGYHSHLMPESERKQLVA 299 Query: 250 LYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRI 308 Y+ +D IGR+++ L E+T V++T+DHG G H L +KG MY+D+ ++ Sbjct: 300 TYYGMISMMDKYIGRILDRLDELGLAEDTIVVFTTDHGHFFGQHGLQAKGGFMYEDLIKL 359 Query: 309 PLIIRSPQGERR--QVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFN 366 P I+R P Q D VS +DL PT ++ A + P + G + AV + Sbjct: 360 PFIVRYPGKVPANVQSDALVSLVDLAPTFLSFAGLPIPVWMTGVDQSAVWTGSKSSARDH 419 Query: 367 RYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRS 425 E I R +V +KL + EL+D + DP E++N +D +A+++S Sbjct: 420 -IICEFRHEPTTIHQRTYVDQRYKLTVYYNQPYGELFDLQEDPGELNNRWNDPSYANLKS 478 Query: 426 KMHDALLDYMDKIRDPF 442 ++ + + + ++P Sbjct: 479 ELLLKYV-WAELGKEPL 494 >UniRef50_C6J5I8 Sulfatase n=2 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J5I8_9BACL Length = 522 Score = 439 bits (1131), Expect = e-121, Method: Composition-based stats. Identities = 114/469 (24%), Positives = 188/469 (40%), Gaps = 30/469 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRP+ F+M D + +G + T ++D+L+ + + F +AYT P+C PARA + T Sbjct: 8 KRPHVFFLMCDELRADSLGYMGNSIVKTPHLDNLSKDAVIFENAYTNCPMCVPARASMMT 67 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH-LDGHDYFGTGECPPEWDA 120 G +G N + + + + GY T GK H + G E + Sbjct: 68 GRNPISNGVLDNAMLMIDDEKPLPDLLRQNGYTTTLFGKLHVHRSAEEIGFEEFQSGYGD 127 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 Y E+ +K G + + T R++ + + + + Sbjct: 128 PYTSFLGIKDPEMRKKSSYKKNEGDIPLVIHGESPTHPDQTPCSRLTEDYIRRISEIPGS 187 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA--QAMPS 238 D+P +S +PH P+ Y E Y L A L +KP HR + + Sbjct: 188 DKPIFHHLSLHDPHTPYMPTKPYSEMYDPAQMPLPPNAGRSLDDKPITHRYFHKVRGFDK 247 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISK 297 +D Y+ VDD+IG+VI L + +++ +I+TSDHG MMG H + K Sbjct: 248 LTEEDYRKSLASYYGLVTHVDDRIGKVIARLKELELYDDSLIIFTSDHGSMMGEHGFVEK 307 Query: 298 GAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTMMALADIEKPEILPGENILAV 355 MY+ + RIPL+++ PQ ++DT ID+LPT++ A I PE + G+++L V Sbjct: 308 WGHMYEPVVRIPLLVKLPQNVNGGMRLDTFAEIIDILPTILDAAGIAVPEEVQGKSLLPV 367 Query: 356 KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT------------------ 397 + P +KL + Sbjct: 368 CRGESKEHRTEAHSQYFCGSLHREPALMIRDHQWKLTIYPEQESIHEKLYGDHYLKYSPF 427 Query: 398 ------SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 ELYD +DP E HNL D+ ++A + +M L + + Sbjct: 428 FDLPLVEGELYDLLSDPYEQHNLFDNPKYAAQKEEMLSKLESWKQSLGA 476 >UniRef50_A6CFT9 Iduronate-2-sulfatase n=2 Tax=Planctomycetaceae RepID=A6CFT9_9PLAN Length = 489 Score = 439 bits (1129), Expect = e-121, Method: Composition-based stats. Identities = 104/457 (22%), Positives = 178/457 (38%), Gaps = 27/457 (5%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 +++PN LF+ TD + CY + T N+D LA G+ F AY +C P+RA L Sbjct: 30 VEKPNVLFIGTDDLRC-DLACYGHPLVKTPNLDKLATRGVLFKRAYCQQALCNPSRASLM 88 Query: 61 TGIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 TG + W NI T+ + FK GY T IGK + Sbjct: 89 TGRRPDTLEIWDLPTHFREADPNIVTLPQLFKQQGYFTQNIGKIFHNWRQKIQGDPASWS 148 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 A F + + L N L ++ + ++ + RI + AV LQ Sbjct: 149 VPAVMHFARHDDDQPMLNDNRELPVN-LAKAPRSESRDVPDSAYFDGRIGDLAVKALQDL 207 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM- 236 + +PF + V + +PH PF P +Y + Y D + + Q + + Sbjct: 208 KQKQQPFFLAVGFWKPHLPFNPPKKYWDLYDDSPITVPDNPQPPKNVPDVALHDSREILR 267 Query: 237 ---PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAH 292 + Y A ++D Q+G+V+ L E T +++ SDHG +G H Sbjct: 268 AVKGKLTDAQIIELRTGYLAGISYLDAQLGKVLAELDRLGLREKTIIVFWSDHGFHLGEH 327 Query: 293 KLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMALADIEKPEILPGE 350 L K + +++ R+PL+I P + + D V +D+ PT++ L ++ P L G Sbjct: 328 GLWCK-TSNFENDARVPLMISVPHMKTAGKTSDALVELLDMYPTLVELCGLDSPGKLEGT 386 Query: 351 NILAVKEPRGVMVEFNRY-EIEHDSFGGFIPVRC---WVTDDFKLVLNLFTS------DE 400 +++ V + V+ + + ++ P T ++ E Sbjct: 387 SLVPVLKDPTQSVKPAAFTQHPRPAYYRKQPENMGVSVRTPRYRYTEWRNFKTGKVIARE 446 Query: 401 LYDRRNDPNEMHNLIDDI----RFADVRSKMHDALLD 433 LYD +DP E N+I++ F + Sbjct: 447 LYDHTSDPEENTNIINEPTDRADFQAAVKLLEAQFPR 483 >UniRef50_A6DLX7 Putative sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLX7_9BACT Length = 502 Score = 439 bits (1129), Expect = e-121, Method: Composition-based stats. Identities = 126/508 (24%), Positives = 212/508 (41%), Gaps = 36/508 (7%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M +PN L++M+D N G + T N+D LA EG+ F + +P+C+P+R Sbjct: 1 MSKPNVLWLMSDQHNANCTGYAGNPNVKTPNLDDLANEGVEFEQGFCNNPICSPSRLSFI 60 Query: 61 TGIYANQSGPWT--NNVAPGKNISTMGRYFKDAGYHTCYIGKWHL------DGHDYFGTG 112 TG+Y N G NN N +T+ F+ GY T +GK H+ +G +Y Sbjct: 61 TGLYTNNHGYLGNRNNDVTTPNPNTLSSLFRRFGYQTGLVGKSHMITGWDKEGFEYIRYT 120 Query: 113 ECPPEWDAD----YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 + D D ++FD E + G +++ Q + + H N Sbjct: 121 DMCDADDNDPHTCHYFDYLAQRGLADHYEEGSPKEGQQTLDGSQPASLPYKHSIEHYTGN 180 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK----AQDDLAN 224 ++++FL+ D PF + +S+ PH P T E + Y L E ++ Sbjct: 181 KSLEFLENR-DQDRPFFLKMSFQRPHDPITPAPEDFDMYNPEDIVLPESISDLFENKFVG 239 Query: 225 KPEHHRLWAQAMPS-----PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTW 278 KP+ + + Y+A +D++IGRVI+ L +NT Sbjct: 240 KPQFMQDYVANPGDYPMCVADEAKLKRALASYYALITKIDEEIGRVIDHLKETGEYDNTI 299 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTMMA 337 + YT+DHG+ G H L K +Y+ I RIP +++ P G + V +D T+ Sbjct: 300 IFYTADHGDFAGEHGLFLKNLGIYESIHRIPFLLKWPGGPTGVKNKELVESVDWYATLCD 359 Query: 338 LADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF- 396 L +I+ P+ + G +++ V + + EH T ++LV Sbjct: 360 LCNIQAPDNVDGRSLVPVAKGEAKG--SDAIICEHH------TSTAIRTKQYRLVYYRET 411 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDA 456 ELYDR NDP+E++NL + +R + +L+Y + R L Sbjct: 412 GEGELYDRGNDPDELNNLWSHADYQSIRMDLMQQVLNYHMSYQ---RKTYNELDQVINKK 468 Query: 457 RPRWMGAFRPRPQDGYSPVVRDYDTGLP 484 R A + + YS +++ Y+T P Sbjct: 469 RKHSFSALLQKEKAYYSDLIKVYETKKP 496 >UniRef50_A6C8U0 Choline sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8U0_9PLAN Length = 479 Score = 439 bits (1129), Expect = e-121, Method: Composition-based stats. Identities = 107/447 (23%), Positives = 188/447 (42%), Gaps = 17/447 (3%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +F+++D Q + + + T ++D L G F A +P+CTP+RA + +G Sbjct: 33 QPNIVFLLSDDQRPDTIAALGNPIIKTPHLDQLVKAGTSFTRAVCANPICTPSRAEILSG 92 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHD---YFGTGECPPEWD 119 + +G K + T + AGY+T Y+GKWH DG + Sbjct: 93 VSGFHNGSMDFGKPIKKELPTWSQTLSKAGYNTWYVGKWHNDGKPVLRGYDETLGLFTGG 152 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 W + + + W + + T + ++ A++F+++ + Sbjct: 153 GGRWAVPSYDGNGVLVTGYRGWIFQDDERHFFPEKGVGLTSNISEHFADAAIEFVER--K 210 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA-NKPEHHRLWAQAMPS 238 +PF + V + PH P P+ Y + Y + + +P Sbjct: 211 HQKPFFLHVCFTAPHDPLLMPIGYEQNYDPDQMPVPANFLPQHPFDHGNFDGRDEALLPW 270 Query: 239 PVGDDGLYH-HPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLIS 296 P + + + LY++ +D Q+GR++ AL ENT +I++SDHG MG+H L Sbjct: 271 PRTKEIVKNDLSLYYSVISHLDAQVGRIVKALKKTGEWENTILIFSSDHGLAMGSHGLRG 330 Query: 297 KGAAMYDDITRIPLIIRSPQGERRQ-VDTPVSHIDLLPTMMALADIEKPEILPGENILAV 355 K MY+ +PLI+ P + DL PT LA + P+ + G+++ V Sbjct: 331 K-QNMYEHTVNVPLIMVGPGIPADTLSNAQCYLRDLYPTSCDLAGVPIPKTVEGKSLKPV 389 Query: 356 KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF-TSDELYDRRNDPNEMHNL 414 + V ++ +F R TD +KL+ +L+D +NDP E H+L Sbjct: 390 LSGQLDAV-YDEVYCYFRNF-----QRMIRTDRWKLIYYPHLDRVQLFDLKNDPLEQHDL 443 Query: 415 IDDIRFADVRSKMHDALLDYMDKIRDP 441 + VR K+ D L D+ + DP Sbjct: 444 SGEAALQQVRGKLLDQLNDWRKQQNDP 470 >UniRef50_A6CBG2 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=2 Tax=Planctomyces maris DSM 8797 RepID=A6CBG2_9PLAN Length = 633 Score = 438 bits (1128), Expect = e-121, Method: Composition-based stats. Identities = 99/464 (21%), Positives = 179/464 (38%), Gaps = 50/464 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +P+ + V+ D + +GC + T +ID ++ EG RF +A+ +P+C+P RA L TG Sbjct: 191 QPDMVVVLVDDLRWDELGCMGHPFVRTPHIDRISREGARFRNAFCSTPLCSPVRACLLTG 250 Query: 63 IYANQSGPWT--NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 Y + G + N + T + + AGY T Y+GKWH+ D Sbjct: 251 RYTHNHGIFDNINRSEHSHTLKTFPQELQKAGYATAYVGKWHMGNDDTARP-------GF 303 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D+W + + ++ I ++ + +F++ A+ Sbjct: 304 DHWVSMKGQGTS------------FDPTLNINGERIQFKGHTTDVLNQKVNEFVK--AQG 349 Query: 181 DEPFLMVVSYDEPH------------HP----FTCPVEYLEKYADFYYELGEKAQDDLAN 224 ++PF + +++ H P F + + Y+D D L Sbjct: 350 EKPFCLYIAHKALHPELTQRDDGSITDPSAAKFMPAKRHEKLYSDDAIPRRLNVVDTLEG 409 Query: 225 KPEHHRLWAQAMP-SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYT 282 K R P S +D+ +G + L + ++T ++T Sbjct: 410 KRALKRTVPGLPPLSQKTGTSDEVIRDRLRMLAGIDEGVGSLCELLESQGKLDDTVFVFT 469 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMALAD 340 SDHG G H L + Y++ R+PL++R P +D +DL PTM+ LA Sbjct: 470 SDHGYWYGEHGLSVERRLPYEEGIRVPLLVRYPPVIKAGTVIDEFAVSVDLAPTMLDLAH 529 Query: 341 IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFI-----PVRCWVTDDFKLVLNL 395 ++ + G +++ + + + +E++S F T +K + Sbjct: 530 VKTDQKYDGRSLVPLLKGEHPADWRQSFLVEYNSDTVFPRLVKMGYTAVRTPRWKYIQFN 589 Query: 396 F--TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 +ELYD DP EM NLI+D + ++ L M Sbjct: 590 ELTGMNELYDMLRDPYEMQNLINDPAAKETVKQLQAELKQLMKD 633 >UniRef50_B8FL44 Sulfatase n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FL44_DESAA Length = 468 Score = 438 bits (1127), Expect = e-121, Method: Composition-based stats. Identities = 116/442 (26%), Positives = 193/442 (43%), Gaps = 32/442 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LF++TD + +GC + T N+D LA++G+ FN+++ S +C+P+RA T Sbjct: 50 KKPNVLFILTDDHRYDHMGCAGHPFIKTPNLDRLASQGVYFNNSFVTSSLCSPSRASFLT 109 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G YA+ G N T FK GY T ++GKWH+ GE P D Sbjct: 110 GQYAHTHGVQNNLTPWDNGNVTFLERFKQEGYDTAFLGKWHM-------PGELPKLRGVD 162 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 + + + L NG ++ +++RA++F+ + +D Sbjct: 163 EFVTFTVRGGQGQYWDCPLIVNGEDAK--------PNKRYITEELTDRAINFIDR--ESD 212 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVG 241 PF + +S+ HH + P + + Y+D L E+A + A+ Sbjct: 213 NPFCLYLSHKAAHHDWKPPTDLKDLYSDEELPLAEEADT-------WVTMTNGAVFCGTT 265 Query: 242 DDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAA 300 YH+ Y VD Q+GR++ L + +NT V+Y D+G G H+ I K Sbjct: 266 GTLQYHYRNYCRVVASVDRQVGRLLKFLEDKGLADNTIVVYAGDNGYFWGEHRKIDK-RW 324 Query: 301 MYDDITRIPLIIRSPQG---ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKE 357 Y++ RIP +IR+P R+ D +IDL PT+ LA IE + G+++ + Sbjct: 325 AYEESIRIPFMIRAPGVVPDPGRKADQMALNIDLAPTLFDLAGIEPHAGMEGQSLAPILR 384 Query: 358 -PRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT-SDELYDRRNDPNEMHNLI 415 R E YE D ++ T + + + E YD + DP E N+ Sbjct: 385 NGRTPGREAWLYEYFKDYPYNVPAIQAIRTQNNIYIEYESSRKPEYYDLQADPKEKQNIY 444 Query: 416 DDIRFADVRSKMHDALLDYMDK 437 D + AD+ S+ + + + Sbjct: 445 DQLEAADI-SRYQQMIAAFAKE 465 >UniRef50_B9XGT6 Sulfatase n=3 Tax=Bacteria RepID=B9XGT6_9BACT Length = 477 Score = 438 bits (1127), Expect = e-121, Method: Composition-based stats. Identities = 110/489 (22%), Positives = 183/489 (37%), Gaps = 99/489 (20%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +F++ D V CY K T NID LA +GI+F +TC P C P RA L +G Sbjct: 22 KPNIVFILADDLGYTDVACYGSKYYETPNIDKLAKDGIKFTDGHTCGPNCQPTRASLMSG 81 Query: 63 IYANQSGPWT------------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLD 104 Y ++G +T N + T+ + K AGY T GKWHL Sbjct: 82 QYGPRTGVYTVGSIDRFAWQTRSLHPVENVTKLPLDKITLAQSLKKAGYATGMFGKWHLG 141 Query: 105 GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 P + D + + + A Sbjct: 142 EDKEHH----PAQRGFDEALVSMGVHFDFVTNPKVDY---------------PKDEYLAD 182 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 ++++A+DF+++ DEPF + + + H P E ++K++ Sbjct: 183 FLTDKALDFIKR--HKDEPFFLYLPHYAVHKPLQAKKELIQKFSAKQ------------- 227 Query: 225 KPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTS 283 +H+P Y A VD+ +GRV+ L + +NT VI++S Sbjct: 228 -----------------GVDGHHNPTYAAMIASVDESVGRVVALLDELKLSDNTLVIFSS 270 Query: 284 DHGEMMG--------------AHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVS 327 D+G + G + L +Y+ R+P I R P + D P+ Sbjct: 271 DNGGVGGYQREGIKKAGDVTDNNPLRGGKGMLYEGGHRVPYIFRWPGKIPAGKVCDQPII 330 Query: 328 HIDLLPTMMALADIEKPE--ILPGENILAVKE-PRGVMVEFNRYEIEHDSF-------GG 377 IDL PT++ LA + PE L G + L V + + + + Sbjct: 331 SIDLYPTLLELAGAKAPEKYPLDGTSYLKVLKSGGMKKLNRDAIYWHFPGYLGAGADTWR 390 Query: 378 FIPVRCWVTDDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 +PV D+KL+ ELY+ R D E +NL + + ++ L+ + Sbjct: 391 TLPVGVVRCGDWKLMEFFEDHRLELYNLREDLGETNNLAAKM--PEKAQELEKKLVAWQK 448 Query: 437 KIRDPFRSY 445 +++ P + Sbjct: 449 EVQAPMPTA 457 >UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN4_DYAFD Length = 497 Score = 438 bits (1127), Expect = e-121, Method: Composition-based stats. Identities = 107/502 (21%), Positives = 178/502 (35%), Gaps = 80/502 (15%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN +++ D +GCY + + T N+D LA EGIRF YT +PVC PARA L TG Sbjct: 27 PNIVYIYADDLGYGELGCYGQQKIKTPNLDRLAKEGIRFTQHYTGTPVCAPARAMLMTGK 86 Query: 64 YANQSGPWTN-------------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG 110 +A S N + N T+ K GY T GKW + ++ G Sbjct: 87 HAGHSAIRGNFELGGFRDEEERGQMPLPANELTVAELLKQKGYATALTGKWGMGMNNTEG 146 Query: 111 TGECPPEWDADYWFDGANYLSELTEKEISLWRN-----------------GLNSVEDLQA 153 T P DY++ + LW N D Sbjct: 147 T---PTRQGFDYYYGYLDQKQAHNLYPSHLWENDRWDTLAQPWQDIHRKLDPAKATDADF 203 Query: 154 NHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE 213 +++ +A+ F+ + PF + + Y PH P EY++KY + E Sbjct: 204 ESFKGKEYAPAKMTEKALAFIDRSKAG--PFFLYMPYTLPHVSLQAPDEYVKKYIGQFDE 261 Query: 214 LGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ 273 + + A+ Y Y + F+DDQ+G +++ L Sbjct: 262 KPYYGEKNYAS-------------------TKYPLSTYASMITFLDDQVGIILDKLKALG 302 Query: 274 R-ENTWVIYTSDHG----------EMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RR 320 +NT V+++SD+G L +Y+ R P I+R P R Sbjct: 303 LDDNTIVMFSSDNGATFNGGVNPQFFNSVAGLRGLKMDVYEGGIREPFIVRWPGKIKPGR 362 Query: 321 QVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIP 380 D + DL+PT+ L P G + L + ++E + + Sbjct: 363 VSDHVSAQFDLMPTLAELTGQASPPT-DGISFLPELLGQTN--RQKKHEFLYFEYPEKGG 419 Query: 381 VRCWVTDDFKLV-----LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 D+K V N +L++ + D +E ++ D+ K+ + Sbjct: 420 QIAVRMGDWKGVKTDLRKNPGNPWQLFNLKTDRSESTDVAAS--HPDILKKLDQIVKR-- 475 Query: 436 DKIRDPFRSYQWSLRPWRKDAR 457 + +P + + P +R Sbjct: 476 -EHEEPANAAWQFVMPVIAASR 496 >UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DBQ5_9BACT Length = 483 Score = 436 bits (1123), Expect = e-121, Method: Composition-based stats. Identities = 97/484 (20%), Positives = 173/484 (35%), Gaps = 67/484 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +F++ D +GCY + + T NID LAA+G+RF YT VC P+R L TG Sbjct: 26 KPNVIFILADDLGIGDLGCYGQQKIRTPNIDHLAADGMRFLQHYTGCSVCAPSRCALMTG 85 Query: 63 IYANQSGPWTNNV---------APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE 113 + + N ++ T+ R ++AGY+T IGKW L + + Sbjct: 86 RHMGHAAIRDNAQRGPSEEGQRPMPQDTFTVARLMQNAGYYTGIIGKWGLGMPEDHSS-- 143 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANH---------IDETFTWAH 164 P + +Y F T LWRN ++ Sbjct: 144 -PRDMGFNYSFGYLCQSMAHTYYPPYLWRNNERETLAGNPSYDVSMKGVIEPKGEIYSHD 202 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 +++ A+ F++ D+PF + +++ PH P + + +Y + E + AN Sbjct: 203 VMASDALKFVRD--HHDKPFFLYLAFTIPHLSLQVPEDSMSEYHGQWTETPFRNTKHYAN 260 Query: 225 KPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTS 283 Y +D +GR++ L +NT V ++S Sbjct: 261 NETP-------------------RAAYAGMITRMDRDVGRLMALLKELGIDDNTLVFFSS 301 Query: 284 DHG-----------EMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHID 330 D+G +Y+ R PLI R P D D Sbjct: 302 DNGAVFPLAGTDPVFFQSTGGFRGYKQDLYEGGIRTPLIARWPGKIETGVTTDQASVFYD 361 Query: 331 LLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFK 390 LPTM L + P G + L + + + + + + D+K Sbjct: 362 FLPTMAELNGVPPPADTDGLSYLPTLLGKPAQQKQHDFL--YWEYQSAGGAVAVRMGDWK 419 Query: 391 LV-----LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 + N + E+Y+ +D E H++ ++ +K + + + P + + Sbjct: 420 AIANKIKKNPNANFEVYNLASDRTESHDVAA--EHPEIVAKAREIIAR--EHTPSPIKEW 475 Query: 446 QWSL 449 ++L Sbjct: 476 NFTL 479 >UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5053 Length = 467 Score = 436 bits (1123), Expect = e-121, Method: Composition-based stats. Identities = 102/457 (22%), Positives = 170/457 (37%), Gaps = 55/457 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + ++ D +GCY + T +ID LA G +F Y+ SPVC P+R L TG Sbjct: 25 KPNIVLIVADDLGCFELGCYGQTKIKTPHIDKLAQGGAKFTRFYSGSPVCAPSRCVLMTG 84 Query: 63 IYANQSGPWTNNV-------APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 ++ + N T+ K GY T +GKW L D G+ P Sbjct: 85 KHSGHATVRNNVEAKPEGQFPIRAEDVTVADALKAHGYATGAMGKWGLGMFDTAGS---P 141 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + D +F + ++RN + FT A+ F++ Sbjct: 142 LKHGFDLFFGYNCQRHAHSHYPTYIYRNDKRVELKGNDGKTGKQFTQ-DLFEEEALGFIE 200 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 A +PF + + + PH P + L +Y + + Q Sbjct: 201 --ANKAKPFFLYLPFTVPHVAVQVPEDSLNEYKGQLGDDPAY----------DGKKGYQP 248 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHG-------- 286 P+P H Y A +D +GRV+ L E NT V++TSD+G Sbjct: 249 HPAP--------HAGYAAMVTRMDRSVGRVVEKLNALGLEKNTLVLFTSDNGPTHNVGGA 300 Query: 287 ---EMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADI 341 A KL ++Y+ R+P I P + D P+ D+LPT+ A A Sbjct: 301 DSSFFNSAGKLRGLKGSVYEGGIRVPFIAYQPGTIKAGTESDAPLYFPDVLPTLCAFAGT 360 Query: 342 EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT---- 397 + P + G + L + + + F G+ + + ++K V Sbjct: 361 KAPSAIDGISFLPLLKGEKQPTHD----FLYWEFSGYGGQQAVIEGEWKAVRQALGMGGV 416 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 ELY+ DP+E ++ + V +++ L + Sbjct: 417 KTELYNLAKDPSEKEDVAA--KNPAVLARLEKRLKNE 451 >UniRef50_A6DME6 Sulfatase family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DME6_9BACT Length = 461 Score = 436 bits (1123), Expect = e-121, Method: Composition-based stats. Identities = 106/448 (23%), Positives = 178/448 (39%), Gaps = 33/448 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN LF+ D +G Y + + NID LA+ F +A+ VC P+RA L T Sbjct: 19 EKPNVLFIAVDDLKPE-LGAYGNTQVKSPNIDKLASRSSVFTNAHCQWAVCGPSRASLMT 77 Query: 62 GIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+Y +G ++ T+ ++FK++GY T GK + T + P W Sbjct: 78 GLYPESTGVMDLKTPMRSVNPDVLTLPQHFKNSGYFTAATGKIYDPRCVDGRTKDDAPSW 137 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 Y + ++G + + + N D T +I +D L+Q Sbjct: 138 STPY---------KTLNYGKVKLKDGKHFAKAPELNDEDLT---DGQILLNGLDLLEQAQ 185 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD-----DLANKPEHHRLWA 233 D+PF + V + +PH PF P +Y + Y L D + Sbjct: 186 NQDKPFFVAVGFKKPHLPFVAPKKYWDLYDRERLTLPSFLDKAQGASDYGWHDSNELRSY 245 Query: 234 QAMPSPVG---DDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMM 289 +P + + Y AC ++D +GR+I L +NT ++ DHG + Sbjct: 246 DGIPKKGPIAIELQKEAYHGYLACVSYIDALVGRLIQDLEKRNLADNTIIVLWGDHGFHL 305 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 G H + K + + TR PLII P+ + ++ TP ID+ PT+ A +E PE++ G Sbjct: 306 GDHNMWGKHTNL-EQATRSPLIISLPKQKAQKSHTPAGLIDIFPTLCEAAGLEVPEVVQG 364 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD----ELYDRR 405 ++ V + + T ++ + + ELYD Sbjct: 365 TSLFPVINGEKDQHKNGAISFF---KSKGAKGYSYRTKRYRYIEWSKGNKVEAIELYDYE 421 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLD 433 NDP E NL ++ + AL + Sbjct: 422 NDPQEKINLATQQESKELIRTLSQALRE 449 >UniRef50_C6CXF5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CXF5_PAESJ Length = 509 Score = 435 bits (1121), Expect = e-120, Method: Composition-based stats. Identities = 115/493 (23%), Positives = 193/493 (39%), Gaps = 54/493 (10%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M +PN L + TD Q + Y L+T +++LA G+ F A+ P C P+R+ +F Sbjct: 1 MSKPNILLIQTDQQTAETLSLYGNTALHTPALEALAERGVVFEQAFCNYPACAPSRSSMF 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH---------LDGHDYFGT 111 TG Y + N++ + T+ + K+ GY T IGK H G Sbjct: 61 TGRYCSTLNLHANHMLINPSEVTLPQVLKNHGYQTAIIGKNHAFTERPSSIYPGGVPENP 120 Query: 112 GECPPEWDA----DYWFDGANYLSELTEKEISLW--RNGLNSVEDLQANHIDETFTWAHR 165 +D D+ Y + + W + +S N + Sbjct: 121 SLLHEVFDYVRLADHGHMVDGYRDDPGAQAAHAWAVEHCWSSPLGHGTNPAPVEKCGTYL 180 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 + +D+L + ++PF +S+ +PH P+ P Y + D L K Sbjct: 181 LGETMLDYLAHLRQENQPFFTWLSFPDPHTPYQVPEPYASMIRPEDVPMPPV--DSLEGK 238 Query: 226 PEHHRLWAQAMPSPVGDDG--LYHHPLYFACNDFVDDQIGRV---INALTPEQRENTWVI 280 PE ++ D+ +++ F+DD + ++ ++AL ENT +I Sbjct: 239 PERVKVAHLMDAMDTADEQLIRQVRAIHYGMIRFIDDTLAKIFERMDAL--SLLENTVII 296 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLPTMMALA 339 +TSDHG+ MGAH +I K YD T +P I+ P + ++ + ID++PT++ LA Sbjct: 297 FTSDHGDSMGAHGIIQKHNFFYDSFTHVPFIMSLPGYKGTKRTSNLLELIDIMPTLLELA 356 Query: 340 DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFG----------------------- 376 I P G++ A E + +E G Sbjct: 357 GIPVPPGCQGKSHAAFLEGDLSVTPREYVVMESGEHGDPVKVSDITLRPEHPLDERYFVW 416 Query: 377 ------GFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDA 430 + T ++KL + ELYD + DP+E+HN D FA VR ++ Sbjct: 417 CAYSDAWIGKGKAIRTKEWKLCIYANGEGELYDLKADPHELHNRFPDPAFASVRIELERK 476 Query: 431 LLDYMDKIRDPFR 443 LL + + D Sbjct: 477 LLQWSMEKEDRLP 489 >UniRef50_A6DKS7 N-acetylglucosamine-6-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKS7_9BACT Length = 515 Score = 435 bits (1121), Expect = e-120, Method: Composition-based stats. Identities = 109/488 (22%), Positives = 186/488 (38%), Gaps = 70/488 (14%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN LF+ +D AT VG Y +T NID +A+EGIRF+ + +C P+RA + TG Sbjct: 22 PNILFIFSDDHATQAVGSYGSIINSTPNIDRIASEGIRFDRCLVTNAICGPSRATILTGK 81 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYW 123 Y++ +G + N++ T + + AGY T IGKWHL D++ Sbjct: 82 YSHLNGFYKNDMYFDGRQITFPKLLRQAGYQTAVIGKWHLASLPT----------GFDHF 131 Query: 124 FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEP 183 Y + + RNG + I+ +++L+ ++P Sbjct: 132 EVITGYGGQGKYYHPVMNRNGEPTKHR---------GYTTEVITKLNMEWLKNQRDPNKP 182 Query: 184 FLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR------------- 230 F++++ + PH + +Y+ + D + D K H + Sbjct: 183 FMLMMQHKAPHRAWLPSPKYMNAFKDKKFPKPANLHTDYQGKASHVKKQDMMIKDSMNPG 242 Query: 231 ----------------LWAQAMPSPVG--------------DDGLYHHPLYFACNDFVDD 260 W +A + + Y C +DD Sbjct: 243 DLKLTPPKYLDGADLANWHKAYDEENAAFAKAKLSGKALRSWNYQRYIRDYVRCVQSIDD 302 Query: 261 QIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER 319 IG V+N L ENT +IY+SD G +G H K MY++ R PL++R P + Sbjct: 303 SIGEVLNYLDESGLAENTLLIYSSDQGFFLGEHGWFDK-RFMYEEALRTPLVMRWPGKIK 361 Query: 320 RQV--DTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGG 377 S++D T + +A ++ P + G ++L + + + + + + Sbjct: 362 AGSVDSHITSNLDFAQTFLEVAGVKVPAEMQGASLLPIMKGQQPENWRESFYYHYYGYPD 421 Query: 378 F---IPVRCWVTDDFKLVL-NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 + +KL+ ELYDR+NDP E N D F + M L Sbjct: 422 WHLVQKHCGVTDGRYKLIHFYTTDEWELYDRKNDPEENINRASDPEFKSILQNMRKKLSQ 481 Query: 434 YMDKIRDP 441 +++ P Sbjct: 482 QRIQLKVP 489 >UniRef50_C7MEQ7 Choline-sulfatase n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MEQ7_BRAFD Length = 520 Score = 435 bits (1120), Expect = e-120, Method: Composition-based stats. Identities = 118/507 (23%), Positives = 204/507 (40%), Gaps = 38/507 (7%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + + D A +G Y T ++D+LAAE F+ AY +P+C P+RA + TG Sbjct: 4 RPNIVVIQADQMAAQALGAYGDTAARTPHMDALAAEAAVFDRAYCNTPLCAPSRASMMTG 63 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 + + N +I T + + AGYHT +G+ H G D E D Sbjct: 64 RMPSDIDCFDNGSDFAASIPTFAHHLRAAGYHTALVGRMHFIGPDQHHGFEQ--RLTTDV 121 Query: 123 WFDGANYLSELTE--KEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 + + + + + W + ++V + + + RA+ L RA Sbjct: 122 YPADMDMVPDWQRDLGDRLQWYHDADAVHTAGVSQATVQLDFDDEVGFRALRHLNDRVRA 181 Query: 181 DE------PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 D+ PFLMV S+ PH P+ P E+ +++AD + + H Sbjct: 182 DQAAGERVPFLMVASFIHPHDPYEPPQEHWDRFADVDIPAPRHPEVPDPAQDPHSHRLRA 241 Query: 235 ----AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMM 289 ++ Y+A ++DD +GR+ L E+T V+ TSDHG+M+ Sbjct: 242 MSGFDQRETTEEEVRRARRSYYAAVSYIDDHVGRIRERLESLGLWEDTVVVVTSDHGDML 301 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIE----- 342 G L K + Y++ +R+PLI+ P+ + PVS +DL+PT++ L + Sbjct: 302 GEKGLWFK-MSPYEESSRVPLILHGPEHLVPAGRYANPVSLLDLMPTLLELGGADGATSA 360 Query: 343 ----KPEILPGENILAVKEPR---GVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL 395 G ++L IE+ + G P V K V+ Sbjct: 361 AAEATTPARQGLSLLESARRERSGTAGPADRDVIIEYLAEGTLRPQLTLVRGQHKFVVCP 420 Query: 396 FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY--MDKIRDPFRSYQWSLRPWR 453 D+L+D DP+E N+ D A++ +++ A+ + + + + Q R Sbjct: 421 GDPDQLFDLHTDPHERTNIAADPAQAELVAELRAAVAAQYDLAALEEKVLASQA-----R 475 Query: 454 KDARPRWMGAFRPRPQDGYSPVVRDYD 480 + + + + R RP Y P Sbjct: 476 RRLVAQALQSGRSRPW-DYEPDPEQRY 501 >UniRef50_D2R014 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R014_9PLAN Length = 475 Score = 435 bits (1119), Expect = e-120, Method: Composition-based stats. Identities = 97/469 (20%), Positives = 161/469 (34%), Gaps = 67/469 (14%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + ++ D + + C T +I+ LAA G+RF AY+ VC+P RA + TG Sbjct: 41 PNIVVILIDDMGFSDLSCMGSTYYETPSINKLAASGMRFTHAYSACTVCSPTRAAVLTGK 100 Query: 64 YANQSGPWT-------NNVA---------PGKNISTMGRYFKDAGYHTCYIGKWHLDGHD 107 Y + N T+ GY T IGKWHL + Sbjct: 101 YPARLHLTDWIPGQMSNKTKLKLPDWNKQLNLEEITLAELLGAHGYTTASIGKWHLGPPE 160 Query: 108 YFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRIS 167 P G + + RNG V R++ Sbjct: 161 C-----EPTRQGFSLNIGGNSKGQPPSYF-FPYERNG---VLLPGLAEGKPNEYLTDRLT 211 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 + F+++ +PF + + + H P E + KY + Sbjct: 212 DACEAFIEE--NQSKPFFLYLPHYCVHTPLQAKPELIAKYEAKNAQFPGNP--------- 260 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 H Y A + +D +GR++ L + T VI+TSD+G Sbjct: 261 ------------------QHEAKYAAMVESLDQSVGRIMAKLDALDLTKKTIVIFTSDNG 302 Query: 287 EM-----MGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMALA 339 + + + Y+ R+PLI+ P D P +DL PT+ L+ Sbjct: 303 GLVLREITSNLPARAGKGSAYEGGVRVPLIVSYPPMIKPGTTCDVPAISMDLFPTLAELS 362 Query: 340 DIEKPEILPGENILAVKEPRGV--MVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 + + G++I+ + E + + H GG P +++LV Sbjct: 363 GAKYSHDIDGKSIVPLLEEKPDAFAARPLYWHYPHYHGGGATPYSAMRVGNYRLVEFFED 422 Query: 398 SD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 ELYD +D EM NL D+ K+H L+ + + + + Sbjct: 423 GRLELYDLAHDIGEMKNLA--QEKPDLTEKLHRQLIAWRKSVDAQYATP 469 >UniRef50_C6J3H9 Sulfatase n=2 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J3H9_9BACL Length = 503 Score = 434 bits (1117), Expect = e-120, Method: Composition-based stats. Identities = 129/492 (26%), Positives = 214/492 (43%), Gaps = 55/492 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PNFL + D + + C+ + T NID LA EG+ F AY +PVC P+RA L TG Sbjct: 2 KPNFLVFVVDQMQSRTLSCHGHPDVKTPNIDRLAREGVSFTRAYCNNPVCMPSRASLLTG 61 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL----------------DGH 106 + A Q G TN +A ++ T+ + GY T +GK H +G Sbjct: 62 LTARQHGVLTNGIALSEHFPTLPGVLSEHGYRTHAVGKLHHQPIGSVSREEQMEFSWEGM 121 Query: 107 DYFGTGECPPEWDADYWFDGANYLSEL---------------TEKEISLWRNGLNSVEDL 151 ++ +GE Y + +Y+ L + G +D Sbjct: 122 KFWESGEIRSIPSGYYGYQSVDYVGGHVTCFGDYLRWLEQVYPGGGKKLSKEGAYYADDK 181 Query: 152 QA----NHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY 207 + E + + H I+ R++DFL+Q ++ D+PF + S+ +PHHPF Y E Y Sbjct: 182 IPMSWRIDLPEEYHYNHWIAERSIDFLEQMSQQDQPFFLWCSFPDPHHPFAACRPYSEMY 241 Query: 208 ADFYYELGEKA--QDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRV 265 L E ++D + + R S D + VD IG + Sbjct: 242 DPASLTLPEHWDVEEDGISWLKERRNIHPDYTSFDEHDLREILAQTYGMISHVDKTIGEI 301 Query: 266 INALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV-- 322 L + + NT +++ +DHGE +G+H LI+KG ++++ R+P I + P+ ++ Sbjct: 302 TKKLKELELDQNTVIVFLADHGEYLGSHHLITKGEWPWEELIRVPFIWKIPESMKKGYLN 361 Query: 323 DTPVSHIDLLPTMMALADIEK------------PEILPGENILAVKEPRGVMVEFNRYEI 370 + VS +D +PT++ LA IE P LPG ++ + E V+ Sbjct: 362 EQVVSLLDFVPTILDLAGIEPAVMDVRGVQYTEPLGLPGRSLRPIIEQGDVLPPGPAIVE 421 Query: 371 EHDSF--GGFIPVRCWVTDDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKM 427 + + +R VT+ +K+ + L T D LYD + DP E NL D FA V+ + Sbjct: 422 YDEDWFPPNVCRMRTIVTERYKMTVYLNTEDGLLYDLQEDPYEQKNLWFDPSFARVKHIL 481 Query: 428 HDALLDYMDKIR 439 + +L + + Sbjct: 482 TEQMLRELVRTD 493 >UniRef50_A4CMA4 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=2 Tax=Flavobacteriales RepID=A4CMA4_9FLAO Length = 490 Score = 434 bits (1117), Expect = e-120, Method: Composition-based stats. Identities = 119/491 (24%), Positives = 207/491 (42%), Gaps = 48/491 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSG-KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 K+ N +F++TD + +G L T N+D LAAEG +A+ + +C+P+RA + Sbjct: 28 KQRNVIFILTDDHRFDYMGFTGKVPWLETPNMDRLAAEGAYLPNAFVTTSLCSPSRASIL 87 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG++++ N N++ +Y ++AGY T ++GKWH+ + P Sbjct: 88 TGMFSHTHTIVDNQAPNPGNLTYFPQYLQEAGYQTAFLGKWHM------SSHTDEPRPGF 141 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D+W ++ + +L NG + ++ ++ AVD+L+ Sbjct: 142 DHW---ESFFGQGVYYNPTLNING-------ERIEYKDSTYITDLLTEHAVDWLE-SRDK 190 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 D+PF + +S+ H F + +YA EL + + V Sbjct: 191 DKPFFLYLSHKAVHAEFQPARRHKGRYAGKKIELPPTYEQTKTGAWRDLKWPEWVADQRV 250 Query: 241 GD-----------DGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEM 288 D Y VDD +G V+ L E + T VIY D+G Sbjct: 251 SWHGVDYMYHSNIDMQELVQAYCETLLGVDDSVGAVLEYLEEEGLDEETLVIYMGDNGFS 310 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEI 346 G H LI K Y++ ++PL++R P+ + V +ID+ PT++A A + +P+ Sbjct: 311 WGEHGLIDK-RHFYEESVKVPLLVRCPELFEGGQVPQDMVQNIDIGPTVLAEAGVAQPDD 369 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHD---SFGGFIPVRCWVTDDFKLVLNL--FTSDEL 401 +PG + + + + ++ E+ F V TD +K + + +EL Sbjct: 370 MPGVSFIPILTGDKDATKRDKIFYEYYWENDFPMTPTVFGMRTDKYKYIRYHGIWDRNEL 429 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWM 461 YD NDP+EM+NLI D +V M D+L ++++ P + RPRW Sbjct: 430 YDLENDPHEMYNLIGDPEKQEVIQTMLDSLYNWLETT-------DGMKIPLKSTDRPRWG 482 Query: 462 GAFRPRPQDGY 472 R + Y Sbjct: 483 DY---RHKGEY 490 >UniRef50_A3HTC6 Choline sulfatase n=5 Tax=Bacteria RepID=A3HTC6_9SPHI Length = 499 Score = 434 bits (1116), Expect = e-120, Method: Composition-based stats. Identities = 98/475 (20%), Positives = 181/475 (38%), Gaps = 31/475 (6%) Query: 4 PNFLFVMTDTQATNMVGC-YSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 PN +F+ +D + +G + T NID LA G F +A+ +P+C P+R + TG Sbjct: 30 PNIVFIASDDL-NDWIGVLNGHPQVKTPNIDRLANRGTLFTNAHAQAPLCNPSRVSILTG 88 Query: 63 IYANQSGPWTNNVAPGK-----NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + +G + + + T+ +YF+ GY T GK G + Sbjct: 89 LRPTTTGIYGLAPRHREVERTKEVVTLPQYFEKRGYRTLSTGKIFHGGITPTERAIEFQD 148 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 W D G + L ++ ++ +++++ AV+ + + Sbjct: 149 WGPD---GGHRPFPPSKIVKAPLDMIDHPLIDWGIYPVEHDSIMDDYKVASWAVEQINEI 205 Query: 178 ARAD--EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + PF + V +++PH P ++ + Y L D + P+ Sbjct: 206 GKGGDSNPFFLAVGFNKPHVPLYTSQKWFDLYPKDEIILPLAPFGDRNDIPDFAWNLHWY 265 Query: 236 MPSPV------GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEM 288 +P P + Y A F+D Q+GRV++AL ENT +++ SDHG Sbjct: 266 LPEPRLSWLIANQEWENKVRAYLATISFMDAQVGRVLDALEENNLTENTIIVFWSDHGYH 325 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEKPEIL 347 +G + K ++++ T +PLI P + PV +D+ PT++ +A + K + + Sbjct: 326 LGEKDITGK-NSLWERSTHVPLIFAGPGVSKGAISSQPVELLDIYPTLVEMALLSKNDAV 384 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRND 407 G ++ E + T+ ++ + S+ELYD D Sbjct: 385 EGISLKPQLEDANAKRTKPAITTHNPGNN------AVRTERWRYIKYADGSEELYDHYRD 438 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMG 462 E NL + ++ L ++ K P + KD W G Sbjct: 439 DEEWSNLAYLDEY----RELKAELSKWLPKTSAPLAEGSKHRILYEKDGVWYWEG 489 >UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria RepID=C1ZCL4_PLALI Length = 470 Score = 434 bits (1116), Expect = e-120, Method: Composition-based stats. Identities = 104/462 (22%), Positives = 175/462 (37%), Gaps = 48/462 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L + D +G T ID+LA G RF Y+ PVC+P RA L TG Sbjct: 28 KPNVLLIFIDDLGKTDIGIEGSSFYETPRIDALAKSGARFTQFYSAHPVCSPTRAALMTG 87 Query: 63 IYANQSGPWT-----NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + G ++VA ++ T+G+ F++AGYHT Y+GKWHL P Sbjct: 88 KMPQRLGITDWIRPESDVALPQSEVTIGQAFQEAGYHTAYLGKWHLGHKPQQH----PAA 143 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHID--ETFTWAHRISNRAVDFLQ 175 D W G N+ + + + + +++ A++ L Sbjct: 144 RGFD-WTKGVNHGGQPSSYYFPYKNPQKPDAPNNVPDFEKCQPEDYLTDVLTSSAIEHL- 201 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKY--ADFYYELGEKAQDDLANKPEHHRLWA 233 Q PF + +++ H P P +EKY + + + + R Sbjct: 202 QQRDRTRPFFLCLAHYAVHTPIQPPKNLVEKYQVKLATQKNPKSPGEGIQEGSAISRS-- 259 Query: 234 QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMM--- 289 HP Y A + +D Q+GR+++ L + + T V++TSD+G + Sbjct: 260 -----------QQDHPAYAAMVENLDTQVGRLLDELKTQGILDQTIVVFTSDNGGLCTLN 308 Query: 290 -------GAHKLISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTMMALADI 341 L + Y+ RIP I P + +D P D+ PT+++L I Sbjct: 309 GKSPGPTCNLPLRAGKGWTYEGGIRIPTYISWPGKISPQVLDIPAYTCDIYPTLLSLCQI 368 Query: 342 E--KPEILPGENILAVKEPRGVMVEFNR---YEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 + + G ++ + + E R + H G P +KL+ L Sbjct: 369 PPRPTQHVDGISLAGLLTKSSSLPESERTLVWYYPHTHGSGHKPSAAIRQGPWKLIHFLE 428 Query: 397 -TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 ELY +DP E NL + + ++ L ++ Sbjct: 429 TDRIELYHLEDDPGESRNLAS--KHPERALQLQKELQKIIES 468 >UniRef50_UPI0001C36159 sulfatase n=2 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C36159 Length = 497 Score = 433 bits (1115), Expect = e-120, Method: Composition-based stats. Identities = 119/473 (25%), Positives = 194/473 (41%), Gaps = 47/473 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+ N ++ TD Q + + ++T NID L EG+ F AYT +P+CTP+RA T Sbjct: 8 KKKNIVWFCTDQQRWDTIHSLGNPYIHTPNIDRLVKEGVAFTRAYTQAPICTPSRACFLT 67 Query: 62 GIYANQSG-PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL------------DGHDY 108 G Y + + N K+ + + D GY +GK HL DG+ Y Sbjct: 68 GRYPRTTKTIFNGNEKFSKDEKLVTKLLSDEGYTCGLVGKLHLTSAEGRVEKRCDDGYSY 127 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISL-------------WRNGLNSVEDLQANH 155 F P D+ G +Y + L EK + W N + Sbjct: 128 FQYSHHPHN---DWKDGGNDYQNWLNEKGVHWEEIYGGKFMTMATWPPQANPSFSGKQVG 184 Query: 156 IDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG 215 + + + +DF++ + EP+L+ V+ +PH P P EY ++ L Sbjct: 185 VPAQYHQTTWCVEKTIDFIETRRNSGEPWLISVNPFDPHPPLDPPQEYKDRLNVEEMPLP 244 Query: 216 EKAQDDLANKPEHHRL---------WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVI 266 ++ KP H + A+ + S ++ Y+A + +DDQ GR++ Sbjct: 245 LWEDGEMEGKPPHQQKDVIQGGQDGQAEPIGSLTEEEKRERFRDYYAEIELIDDQFGRLL 304 Query: 267 NALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ--VD 323 + L E+T +I+ SDHGEM G H L KGA Y+ + +PLII P ++ D Sbjct: 305 SYLDQTGLREDTIIIFMSDHGEMSGDHGLYWKGAYFYEGLVHVPLIISCPSIFKQGFLCD 364 Query: 324 TPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFN-----RYEIEHDSFGGF 378 V +D+ PT+M A +E P + G + + F + Sbjct: 365 ALVELVDIAPTLMEAAGLEVPYFMQGRSFYDILTGEADPHHFKDAVYSEFYHCLRGTHED 424 Query: 379 IPVRCWVTDDFKLVLNLFTS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDA 430 I + +KL++ ELYD D NE HNL D + +++++ Sbjct: 425 IDATMYYNGRYKLIVYHGKEFGELYDHETDQNEFHNLWDKPEYEALKTELIRK 477 >UniRef50_Q7UW58 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UW58_RHOBA Length = 541 Score = 432 bits (1113), Expect = e-119, Method: Composition-based stats. Identities = 120/479 (25%), Positives = 206/479 (43%), Gaps = 24/479 (5%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +R N LF+++D T +GCY + T NID LAA G+ F +A P+C P+R + Sbjct: 65 QRKNVLFLISDDLNTR-IGCYGDPIVQTPNIDRLAARGVLFENAACQYPLCGPSRNSMLC 123 Query: 62 GIYANQSGPWTNNVAPGKNIS---TMGRYFKDAGYHTCYIGK-WHLDGHDYFGT--GECP 115 G+Y + +G N +I ++ + F+ GY +GK +H + GT + P Sbjct: 124 GLYPDTTGIHGNAQIFRDSIPERWSLPQAFRLDGYFAGRVGKLYHYNVPKSVGTNGHDDP 183 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 W+ + G + L E +L + A+ + +++ A L+ Sbjct: 184 ASWELELNPAGCDRLIE-EPDIFTLRKGAFGGTLSWYASPRPDEAHTDGMLADDASWVLE 242 Query: 176 Q-PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 + R D PF + V + PH P+ P EY E Y L + ++D A+ P L + Sbjct: 243 RCAKRNDRPFFLAVGFYRPHTPYVAPKEYFEPYKLEDMPLFDNVEEDNADVPAAALLSKK 302 Query: 235 AMPSPVGDDGLYH-HPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAH 292 + D+ Y+A F+D Q+G+V++ L + NT V++TSDHG +G Sbjct: 303 KEQDLLNDELRRQAIQAYYASTTFMDAQVGKVLDTLKRTGLDKNTIVVFTSDHGYFLGEK 362 Query: 293 KLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 L K A++D + +PLII P E +PV +DL PT+ L D+ +++ G++ Sbjct: 363 GLWQK-QALFDKVAGVPLIIAEPGRTEGAIAKSPVGLVDLYPTLAELCDVPTQKLMQGQS 421 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGF---IPVRCWVTDDFKLVLNLFTSD--ELYDRRN 406 ++ + + + T+ ++L L ELYD +N Sbjct: 422 LVPMLRDPSQTGRGYSMSMVARNDRQTKQRYYGYSIRTERYRLTLWDDGKRGTELYDHQN 481 Query: 407 DPNEMHNLI-----DDIRFADVRSKMHDALLDYMDK-IRDPFRSYQWSLRPWRKDARPR 459 DP E NL +D A V ++ + L M + + ++ + W R Sbjct: 482 DPEEFTNLAHGERKNDPNNAKVIRELTEKLKAEMANGMPASGKRTEYKVGNWNPMLRID 540 >UniRef50_Q15XI1 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XI1_PSEA6 Length = 510 Score = 432 bits (1111), Expect = e-119, Method: Composition-based stats. Identities = 103/495 (20%), Positives = 199/495 (40%), Gaps = 57/495 (11%) Query: 1 MKRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 + +PN L ++ D + + Y+ +T NID LA++ + F + Y +PVC+P+R L Sbjct: 36 VTKPNVLLILVDDLGYSDIKAYNENSFYDTPNIDKLASQSVMFTNGYAANPVCSPSRFAL 95 Query: 60 FTGIYANQSGPWT------------------NNVAPGKNISTMGRYFKDAGYHTCYIGKW 101 TG + + N A + T+ FK GY+T ++GKW Sbjct: 96 LTGKHPTRGKATDWFPANDKPARAGRFLPAEFNDALPLSEITLAEAFKQNGYNTAFLGKW 155 Query: 102 HLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 HL + P D G + ++ + + Sbjct: 156 HLGKTEDL----WPENQGFDVNIAGTKNGHPAAGY--------FSPYKNARLTDGPKGEY 203 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 R++N A+ + + ++ PF M++S+ H P P + +++Y ++ + A +D Sbjct: 204 LTQRLTNEAISLVDKYSKQTVPFFMMLSFYTVHTPLAAPNKDVQEY---QAKIRQYAHND 260 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVI 280 + E A+ +HP Y A +D Q+GR++ L E T V+ Sbjct: 261 EFQREEQVWPTAEKREV----RVKQNHPTYAAMVKQMDTQVGRLLAKLKQAGMEESTLVV 316 Query: 281 YTSDHGEMMGAHK-------LISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDL 331 +TSD+G + A L +Y+ R+PL+++ PQ + + Q++ PV+ DL Sbjct: 317 FTSDNGGLSSAEGSPTSNLPLRGGKGWLYEGGIRVPLLVKLPQKKHKHLQINEPVTSTDL 376 Query: 332 LPTMMALADIE--KPEILPGENI----LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWV 385 PT+++ ++ + L G ++ + +M + H S G P Sbjct: 377 YPTLLSAGHLDLLPQQHLDGVDLNQYFSPGAKRDALMRRPLYFHYPHYSNQGGFPGAAIR 436 Query: 386 TDDFKLV-LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRS 444 ++KL+ LY+ ND E +L + + + + + L ++ + F Sbjct: 437 QGNWKLIERFEDGKVHLYNLANDIGEQIDLAN--QAPERVASLRKKLHEWYQQTSARFLK 494 Query: 445 YQWSLRPWRKDARPR 459 + + PW+ D + Sbjct: 495 AKGNKTPWQPDFKAE 509 >UniRef50_B4D780 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D780_9BACT Length = 496 Score = 432 bits (1111), Expect = e-119, Method: Composition-based stats. Identities = 107/466 (22%), Positives = 202/466 (43%), Gaps = 48/466 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN LF++ D N + C L T NID +A EG+RF + + + +C+P+RA + + Sbjct: 27 KRPNVLFILCDDIRWNAMSCAGHPALKTPNIDRIANEGVRFANMFCTTSLCSPSRASILS 86 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G+YA+ G N + + ++GY T Y+GKWH+ P D Sbjct: 87 GVYAHTHGVTNNFTEFPEKLVHWPMRLHESGYETAYMGKWHMG------EDNDAPRPGFD 140 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 ++ A + + + + NG S + +++ A+D+L++ Sbjct: 141 FF---ATHKGQGKYWDTAWNINGAGSKVIP--------GYYTTIVTDMALDWLKK-DHGG 188 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRL---------- 231 +P+ + + + PH +T +Y + + E A L +KP + Sbjct: 189 KPWALCIGHKAPHSFYTPEEKYAHVFDNVRVPYPESAF-HLEDKPTWMKQRLYTWHGIYG 247 Query: 232 ----WAQAMPSPVGD---DGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTS 283 W + P + D Y+ VDD +GR++ L +Q +NT +++ Sbjct: 248 PLFEWRKKFPDDRPEAVKDFENMVHGYWGTILSVDDSVGRLLKYLEDTKQLDNTIIVFMG 307 Query: 284 DHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDT-PVSHIDLLPTMMALADIE 342 D+G + G H ++ K ++ RIP+++R P + +V+ +D+ P+++ L + Sbjct: 308 DNGLLEGEHGMVDK-RTAHEPSMRIPMLVRYPGLAKGKVEEGQALTLDVAPSLLELCGAK 366 Query: 343 KPEILPGENILAVK-EPRGVMVEFNRYEIEHDS-FGGFIPVRCWVTDDFKLVLNLFTS-- 398 + + G++ + + E + YE ++ F VR TD++K V Sbjct: 367 PLDNIQGKSWVKLVREGDPTWRKSWFYEYNYEKQFPYTPNVRAIRTDEWKYVHYPHGDGT 426 Query: 399 -----DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 ELY+ + DPNE HNL+ D + A ++ L++ M + Sbjct: 427 PDRYIGELYNEKTDPNEDHNLVKDPQQAGRIEELKKLLVEKMKETG 472 >UniRef50_C0QY53 Sulfatase n=2 Tax=Brachyspira RepID=C0QY53_BRAHW Length = 474 Score = 432 bits (1111), Expect = e-119, Method: Composition-based stats. Identities = 119/473 (25%), Positives = 201/473 (42%), Gaps = 27/473 (5%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN + + D + + Y + T ++ LA G F +++ SPVCTP+RA +FT Sbjct: 4 KKPNIILITADQMRADSIE-YINDEVKTPVLNELAENGSVFTNSFCTSPVCTPSRASIFT 62 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G Y G W ++ T+ Y K+ Y GK H E D Sbjct: 63 GRYPMNIGAWNIGTELNEDEVTLADYLKEDNYFNVASGKMHFRPQLKNLNWEFEDVPKRD 122 Query: 122 ---------YWFDGANYLSELTEKEISLWRNGLNSVEDLQA-----NHIDETFTWAHRIS 167 Y FD + + + E + N ++ N I E + + Sbjct: 123 RVRERDKTYYGFDITHITEDDKQGEYLDFANSHGCNLEIGKGIDGINPIPEELHQTYWTA 182 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 +A+D + D+P M VS+ +PHHPF +Y + Y D + +PE Sbjct: 183 QKAIDEIDNF-NFDKPLFMWVSFVDPHHPFDPIKKYYDIYKDIKPKELNSKLKLDKKRPE 241 Query: 228 HHRLWAQAMPSP--------VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTW 278 H P ++ LY+ F+D QIGR+I+ L + +NT Sbjct: 242 HLTKQGDRGYWPGGGEEHHYSQEEIKEIKKLYYGMISFIDSQIGRIIDKLKEKNEFDNTI 301 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMAL 338 +I+TSDHGE +G + L+ KG MYD + ++PL+ + + D + +ID+LPT++ + Sbjct: 302 IIFTSDHGEYLGDYGLLKKGPFMYDCLIKVPLLFYGKGIVKNRSDEIIENIDILPTILDM 361 Query: 339 ADIEKPEILPGENILAVKEPRGVMVEFNR-YEIEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 E P + G +I + + + I +D+ I ++ + T +KL + L Sbjct: 362 LGKEIPYGIQGHSIKNILIGEDKNKTYKKGAVITYDAHDRGIFIKTYRTKQYKLSIFLDE 421 Query: 398 S-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL 449 E YD DPNE +NL + + ++++K+ + M + DP S Sbjct: 422 EYGEFYDLEKDPNEENNLFFNKEYDEIKNKLLLEMCHKMIECSDPLNRRYASW 474 >UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZGF2_PLALI Length = 490 Score = 431 bits (1109), Expect = e-119, Method: Composition-based stats. Identities = 112/500 (22%), Positives = 180/500 (36%), Gaps = 80/500 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN + ++ D VG K + T +ID LA G+ F AY +P C P RA L + Sbjct: 41 RPPNIILILMDDMGWRDVGFMGNKFVETPHIDRLAKTGLVFTQAYASAPNCAPTRACLMS 100 Query: 62 GIYANQSGPWT-------------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWH 102 G YA + G +T + N+ T+ +D GY T + G W+ Sbjct: 101 GQYAPRHGIYTVVDPRQPPGSPWHKWQAAESKSELDTNVVTIAEALRDGGYATAFFGMWN 160 Query: 103 LDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTW 162 L G P F + L + + +G Sbjct: 161 LG------RGRTGPVTPGGQGFQKVVFPENLGFGKDEYFDDG--------------KHYL 200 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 R+++ + F+ + ++PF + + H PF E L KY Sbjct: 201 TDRLTDEVLKFVDEHR--EQPFFVYLPDHAIHAPFNPKPELLAKYERKA----------- 247 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIY 281 + P A + VD +GR+++ L + +NT VI+ Sbjct: 248 -----------------AASNDRRDDPACAATIEAVDHNVGRIMDHLKRLKLSDNTVVIF 290 Query: 282 TSDHGE-MMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMAL 338 TSD+G L +Y+ R+PL++ P + + D PVS IDL PT++ L Sbjct: 291 TSDNGGTQQYTPPLRGGKGELYEGGIRVPLVVAGPGVKSLGSRCDVPVSSIDLYPTLLEL 350 Query: 339 ADIEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 A I+ PE L G ++ + + + + G P DFKL+ Sbjct: 351 AGIKPPEGQVLDGVSLAPLLQGDATLDRERLFWHFPCYVGKATPSSAMREGDFKLIEFFE 410 Query: 397 --TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRS-YQWSLRPWR 453 EL++ +NDPNE NL + D + + L + K S P Sbjct: 411 EGGRVELFNLKNDPNEEKNLASVM--PDKAAALAKTLRAWQKKTNASIPPGPNPSYDPQA 468 Query: 454 KDARPRWMGAFRPRPQDGYS 473 + R G +P+ G Sbjct: 469 ERPRGNQGGGRPDKPKRGRQ 488 >UniRef50_A6LF65 Choline-sulfatase n=26 Tax=Bacteroidales RepID=A6LF65_PARD8 Length = 520 Score = 430 bits (1107), Expect = e-119, Method: Composition-based stats. Identities = 112/458 (24%), Positives = 183/458 (39%), Gaps = 21/458 (4%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +P+ + +MTD Q + +GC K + + NID LA EG F S Y+ +P TP RAGL TG Sbjct: 45 KPHIILIMTDQQRGDALGCMGNKAVISPNIDRLAQEGSLFVSGYSSAPSSTPGRAGLLTG 104 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH------LDGHDYFGTGECPP 116 + G K M + ++ GY+T IGK H L G E Sbjct: 105 MSPWHHGMLGYGRMALKYRYEMPQMMRNLGYYTFGIGKMHWFPQKALHGFHATLIDESGR 164 Query: 117 EWDADYWFDGANYLS-ELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 D+ D + + K+ L G N +DE A + ++ Sbjct: 165 VESPDFISDYREWFQLQAPGKDPDLTGIGWND-HAAGVYKLDERLHPTAWTGQTACELIR 223 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ---DDLANKPEHHRLW 232 D+P + VS+ PH P+ P YL+ Y D K Sbjct: 224 NYDN-DKPLFLKVSFARPHSPYDPPQRYLDMYKDADIPKPHIGDWCGQYAEPKDPLQGAS 282 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGA 291 + + Y+A F+DDQ+G++I L + +N + +T+DHG+M+G Sbjct: 283 DAPFGNFGDAYAINSRRHYYANITFIDDQVGQIIQTLKDKGMYDNALICFTADHGDMLGD 342 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQGE------RRQVDTPVSHIDLLPTMMALADIEKPE 345 H K Y+ IP I++ P G ++ PV D LPT + +A P Sbjct: 343 HYHWRK-TYPYEGSAHIPYIVKWPAGISKSIPDGSSIEQPVELRDFLPTFIDIAGGSVPP 401 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN-LFTSDELYDR 404 + G ++L + + + + K + N S++L+D Sbjct: 402 DMDGRSLLKLIQGQQEQWRPYIDMEHATCYSDDNYWAALTDGKIKYIWNFHNGSEQLFDL 461 Query: 405 RNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 R DP E HNL +D + + S++ +++++ + D F Sbjct: 462 REDPGETHNLSEDAAYQNKLSELRKMMVEHLSERGDSF 499 >UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZUT0_9PLAN Length = 457 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 98/471 (20%), Positives = 166/471 (35%), Gaps = 75/471 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +F++ D GCY +T +ID LA +G+RF AY +PVC+P RA L TG Sbjct: 31 KPNIVFILIDDMGCKDAGCYGATNFSTPHIDRLANQGMRFTDAYA-APVCSPTRASLMTG 89 Query: 63 IYANQSGPW------------------TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLD 104 + + N + T+ + GY IGKWHL Sbjct: 90 KHPARLHLTNFIPQIGRQLPAGKLIPPGFNHVLPLDEKTIAQELHADGYQCAMIGKWHLG 149 Query: 105 GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 + G P D ++ + Sbjct: 150 --EEHGPEYRPQNRGFDRVVLSEHHGIFNYFYPFV----DQQKWPYAGPLPGNPGDYLPD 203 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 R+++ A+DF+++ + PF + +S+ H + P + KY + E Sbjct: 204 RLTDEAIDFVRE--NRERPFFLYLSHWSVHGRYFAPESLIAKYRERGLEERP-------- 253 Query: 225 KPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTS 283 +Y A + VD+ +GR++ L +NT ++ S Sbjct: 254 ------------------------AIYAAMMETVDNSVGRLMATLDELNLADNTLFVFMS 289 Query: 284 DHGEMMGAH-----KLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMM 336 D+G G L ++Y+ R+PLI+R P PV DL PT + Sbjct: 290 DNG---GERITSMAPLRGSKGSLYEGGVRVPLIVRYPGVVKPNTTCSVPVISHDLFPTFL 346 Query: 337 ALADIEK-PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGG-FIPVRCWVTDDFKLVLN 394 A+ L G +I + ++ + + G P +KLV + Sbjct: 347 DFAERSYRDNKLDGHSIAGLLTGEQSELDRDALYWHFPHYWGSTRPCSAMRQGRWKLVEH 406 Query: 395 LFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRS 444 L T +LYD +DP E +L + +++ L + K+ + Sbjct: 407 LETGRAQLYDLSSDPGEQRDLAN--EMPQQATELRKMLAQWRTKVGAQMPT 455 >UniRef50_Q482B9 Sulfatase family protein n=1 Tax=Colwellia psychrerythraea 34H RepID=Q482B9_COLP3 Length = 511 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 121/472 (25%), Positives = 205/472 (43%), Gaps = 32/472 (6%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+ D N +G Y + + NID+LA +GIRF+ AY+ SP+CTP+R+ TG+Y Sbjct: 42 NVLFITIDDL-NNDLGAYGHHLVKSPNIDALAKKGIRFDKAYSQSPMCTPSRSSFMTGLY 100 Query: 65 ANQSGPWTNNVAPG---------KNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGEC 114 +Q+G + ++T+ + FK+ GY + +GK +H + GT Sbjct: 101 PDQTGIIAHGSHTQMTAHFREHIPKVTTLPQLFKNNGYFSGRVGKIYHQGVPNQIGTSGA 160 Query: 115 PPEWDADYWFDGANYLSELTEK-----EISLWRNGLNSVEDLQANHIDETFTWAHRISNR 169 + ++ +K E +L R V A D+ +++ Sbjct: 161 DDAASWHETVNPIGLDKDVEDKIIAFNEKALVRQSFGGVLSFLAIGDDDKAHTDGKVATE 220 Query: 170 AVDFLQQPA--RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 ++ ++ + +PF + + PH PF P +Y + Y + ++D + P+ Sbjct: 221 TINMIKDHHPDKTGKPFFIGAGFYRPHTPFVAPKKYFDLYPLEKIKPYIAPKNDRKDIPD 280 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 + + Y+A +VD Q+GRV++AL + +NT V++ SDHG Sbjct: 281 IALQDREGQVGLTLNQRKQIIQGYYAAVSYVDAQVGRVLDALKQQDLSDNTIVVFLSDHG 340 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKP 344 +G H L K ++++ R PLII +P + R V +PV +D+ PT+ L + P Sbjct: 341 YELGQHGLWQK-GSLFEGSARAPLIIYAPNVKDNGRVVTSPVELVDIYPTLAKLTGLVAP 399 Query: 345 EILPGENILAVKEPRGVMVEFNRYEIEHDSFGG--------FIPVRCWVTDDFKLVLNLF 396 E L G+++ V Y + G I T+ ++ Sbjct: 400 EYLAGKDLTPALNDVDFQVRKGAYSAILNRNKGDNNQFAFTKIRGHSIRTNRYRYTEWGE 459 Query: 397 T--SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQ 446 ELYD +NDP E+ NL D + VR KM L D MD + +S + Sbjct: 460 GYFGAELYDHKNDPQELKNLADKVSLESVRIKMKWLLNDAMDDAQKRIKSIE 511 >UniRef50_Q7UQ05 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UQ05_RHOBA Length = 525 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 105/496 (21%), Positives = 177/496 (35%), Gaps = 74/496 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN L + D +GCY T ID+LA GIRF +AY PVC+P RA + T Sbjct: 52 SRPNVLLFLVDDLGWADLGCYGSTYHETPQIDALAESGIRFTNAYAACPVCSPTRASIMT 111 Query: 62 GIYANQSGPW-------------------TNNVAPGKNISTMGRYFKD-AGYHTCYIGKW 101 G + + + + T+ + +D A Y T ++GKW Sbjct: 112 GRHPVRVDITDWIPGMSTDRAQNPRFQHVDDRDNLALDEVTIAEHLRDAADYQTFFLGKW 171 Query: 102 HLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 HL G P + G + S S W+N + + Sbjct: 172 HLGD-----VGHLPTDQGFQINIGGGHKGSPPGGY-YSPWKNPYLKAKQ-------DGEY 218 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 R+++ AV + +R D+PF M++SY H P T ++ + + Sbjct: 219 LTTRLTDEAVSLVDTASREDKPFFMMMSYYNVHSPITPDKRTIDHFEEKQ---------- 268 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVI 280 +N PE G +P Y + VD +GR++ AL +NT VI Sbjct: 269 -SNSPELQGDTPTIAERDAVTRGRQDNPAYASMVKAVDTSVGRIMKALKEHGVDDNTLVI 327 Query: 281 YTSDHGEM--------MGAHKLISKGAAMYDDITRIPLIIRSPQ------------GERR 320 + SD+G + L + +Y+ R PL++R P+ + + Sbjct: 328 FFSDNGGLSTLRKFGPTCNSPLRAGKGWLYEGGIREPLLVRLPKTMPGGATNETVSHQPK 387 Query: 321 QVDTPVSHIDLLPTMMALADIE--KPEILPGENILAVKEPRGVM----VEFNRYEIEHDS 374 VD+ DL PT++ + + G ++L + H Sbjct: 388 TVDSVACSTDLFPTILDVVGLPLQPESHADGISLLPAIAGEAAETDSSPRDLHWHYPHYH 447 Query: 375 FGGFIPVRCWVTDDFKLVLNLF-TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 + P ++KL+ + ELYD D E +L + +++ DAL Sbjct: 448 GSLWRPGAAIRRGNYKLIEFYETDTAELYDLSVDMGETKDLSKTE--PERFAELRDALRQ 505 Query: 434 YMDKIRDPFRSYQWSL 449 + ++ + Sbjct: 506 WQTEMNAKMPVPNPNF 521 >UniRef50_A6DM50 Choline sulfatase n=6 Tax=Bacteria RepID=A6DM50_9BACT Length = 647 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 109/455 (23%), Positives = 196/455 (43%), Gaps = 34/455 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYT----CSPVCTPARA 57 K+PNF+F+ D Q+ +G Y + T N+D L GI F Y VC +RA Sbjct: 26 KKPNFMFIFADDQSYESIGAYGQLNIKTPNLDRLVKRGISFTHTYNMGAWGGAVCVASRA 85 Query: 58 GLFTGIYANQS--GPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 L +G + N++ G K + AGY T GKWH+ G+ F + Sbjct: 86 MLNSGRFVNRAEKGV--------KQYPHWSQIMNSAGYTTYMTGKWHVHGNPRFDVMKDV 137 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + L+ + + Q W +++ + F + Sbjct: 138 RG-----GMPNQTPARYKRTFKPELYESEWLPWDKRQQGFWRGGTHWTQVVADNTLTFFE 192 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE----HHRL 231 + ++PF M ++++ PH P P EY++ Y ++ E + E R Sbjct: 193 KVKNDNKPFFMYLAFNAPHDPRQAPKEYVDMYPLDSIKIPENYMPEYPYAAEICGKKLRD 252 Query: 232 WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMG 290 A + Y+A ++D IGR+++AL ENT++I+T+DHG G Sbjct: 253 EVLAPYPRTTYAVKRNRQEYYASITYMDHHIGRMLDALEASGKAENTYIIFTADHGLAAG 312 Query: 291 AHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 H L+ K +MY+ R P I+ P + ++DTP+ D + T + LA +EKP + Sbjct: 313 HHGLMGK-QSMYEHSMRPPFIVVGPGIKQNSKIDTPIYLQDAMATAIELAGVEKPAHVEF 371 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF--TSDELYDRRND 407 ++++ + + V+++R ++ R + DD+KL+ L++ +ND Sbjct: 372 KSLMPLIKGEK-TVQYDRIYGKY-----MNTQRMILKDDWKLIFYPHAAKKMRLFNIKND 425 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 P EM++LID+ +A ++ ++ ++ DP Sbjct: 426 PAEMNDLIDNPEYATKIQELKREFVELQKEMGDPL 460 >UniRef50_D2MLH4 Sulfatase family protein n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MLH4_9BACT Length = 476 Score = 429 bits (1105), Expect = e-119, Method: Composition-based stats. Identities = 127/470 (27%), Positives = 200/470 (42%), Gaps = 35/470 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN L++ TD Q + + + + T N+D L AEG+ F A+ S +CTP+R+ T Sbjct: 4 KRPNILWICTDQQRYDTIHALGNEHIQTPNLDRLCAEGVAFTHAHCQSAICTPSRSSFLT 63 Query: 62 GIYANQ-SGPWTNNVAPGKNI--STMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE- 117 G+Y + G N N + + DAGY GK HL Sbjct: 64 GLYPSTVHGNRNGNAYFPANERVQLITKRLADAGYDCGLSGKLHLASAWNGEEQRVDDGY 123 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLN-------SVEDLQANHIDE---TFTWAHRIS 167 Y + + SL G++ + N+ + + + Sbjct: 124 RKFWYSHSHNQGIGNGNQYTDSLTEQGMDLGDVFQTKKDGTYGNYRPDMNPQYHQTTWCA 183 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 +RA++F++ P D P+LM V+ +PH PF P + KY + D + Sbjct: 184 DRAIEFIESP--HDSPYLMSVNPFDPHGPFDAPDTH--KYNPADLPPPIFRESDQQTQTR 239 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG 286 R +A +P GD ++ Y+ +D+ +GR++NAL RENT VI+TSDHG Sbjct: 240 LKRFFADKEGNPPGDREQHNKASYYGMIALIDENVGRMLNALERTGQRENTIVIFTSDHG 299 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKP 344 EM+G H L KG Y+ + R+PLII P + + D + +D+ PT+ LA I Sbjct: 300 EMLGDHGLTGKGCRFYEALVRVPLIISWPGTFLQGHRADGLTALLDIAPTLADLAGIPL- 358 Query: 345 EILPGENILAVKEPRGVMVEFNRYE--IEHDSFGGFIPV----------RCWVTDDFKLV 392 E G++++ + + + +D F P D +KLV Sbjct: 359 EWTHGKSLIPILTGEHPGHAHHDFVRCEYYDVVDKFAPHASEKHKPCWATMLRNDRYKLV 418 Query: 393 LNLF-TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 + ELYD DP+E HNL D AD++ ++ D+ DP Sbjct: 419 VYHDEDYGELYDLWEDPDEFHNLWKDPSRADLKYQLTKQNFDHTVICADP 468 >UniRef50_A7A9X1 Putative uncharacterized protein n=2 Tax=Parabacteroides RepID=A7A9X1_9PORP Length = 480 Score = 429 bits (1105), Expect = e-119, Method: Composition-based stats. Identities = 96/464 (20%), Positives = 187/464 (40%), Gaps = 29/464 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAY----TCSPVCTPARA 57 K+PN + ++ D + + + + T N+D LA E F +A+ T V P+RA Sbjct: 23 KKPNIILILADDMRASGMNFLGKEQVQTPNLDKLAGESTVFTNAHIMGGTSGAVSMPSRA 82 Query: 58 GLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 L TG Y + + +G + AGY+T + GKWH + Sbjct: 83 MLMTGKYLYN--LEKQGATIPNSHTMIGETLQKAGYNTFHTGKWHSSYEALNRCFKEGKA 140 Query: 118 WDA----DYW----FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNR 169 D+W +D ++ + + + N VE ++ ++ Sbjct: 141 IFFGGMWDHWNVPLYDYHADMNYGKRRPVIHNQAKSNKVEYEIGEYMYSGKHSVDIFTHE 200 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD--DLANKPE 227 AV+++QQ ++PF + V+Y PH P + P EY++ Y +L + N Sbjct: 201 AVEYIQQQKDKNQPFFLSVAYMSPHDPRSMPDEYMQLYDQSQIQLPPNFMEKHPFDNGEL 260 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 R A D+ H Y+A VD ++G +I L ENT +I+ D+G Sbjct: 261 EIRDEILAAIPRRPDEIKKHIREYYAMISHVDKRVGNIIQTLKDNGLYENTIIIFAGDNG 320 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKPE 345 +G H L+ K +Y+ +PL+I++ ++ ID+ PT+ + + P+ Sbjct: 321 LAVGQHGLMGK-QNVYEHSVGVPLMIKAAAQHTGKKTADLCYLIDVFPTLCDMLQLPVPQ 379 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD---ELY 402 + G ++L+ + + + ++ Y R +KL+ +L+ Sbjct: 380 SVDGISLLSSLDGKEPVRDYLYYSY-------MDNQRGISDGTWKLIEYHVNGKRTTQLF 432 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQ 446 + +NDP E ++L ++ ++ + + + + D + Sbjct: 433 NLKNDPWERNDLSGQKKYEKTIQRLREKMAEEQKRTNDTSVFWN 476 >UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4Q9_9PLAN Length = 490 Score = 429 bits (1105), Expect = e-118, Method: Composition-based stats. Identities = 103/508 (20%), Positives = 174/508 (34%), Gaps = 83/508 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN +F++ D Y + +T +ID LA++G+RF Y PVC+P RA + G Sbjct: 34 RPNIVFILIDDMGWPDPVSYGNQFHDTPHIDQLASDGVRFTDFYAACPVCSPTRASIQAG 93 Query: 63 IYANQSGPWT----------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH 106 Y + N I T G + A Y+T Y GKWHL Sbjct: 94 QYQARLHLTDFIPGHWRPFEKLIVPENAPHLPLEIVTPGELLQSANYNTAYFGKWHLGPE 153 Query: 107 DYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI 166 + D Y + L R+ + I A + Sbjct: 154 S--------------HNPDQQGYQTSLVTGG----RHFAPRFRTTPSTRIPNKAYLADFL 195 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 +++ ++F++Q +PF + +S+ H P + + KY Sbjct: 196 TDKTIEFIRQ--NKSKPFFVQLSHYAVHIPLEAKQQMIRKYQQKPKP------------- 240 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDH 285 ++P+Y A VDD +GR++ AL + ENT VI+TSD+ Sbjct: 241 ----------------AYGINNPVYAAMVAHVDDSVGRIVAALEELKLTENTVVIFTSDN 284 Query: 286 GE----MMG------AHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLP 333 G G L + ++Y+ R+PLII+ P + P ID P Sbjct: 285 GGLRQSFSGGDIVSTNAPLRDEKGSLYEGGIRVPLIIKWPGVAAAGKTCAEPTISIDFWP 344 Query: 334 TMMALADIEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL 391 T +A E + G ++L + + + + + P D+KL Sbjct: 345 TFAEIAHTTLQEHQTIDGLSLLPLLKDPSSHLNREEIYFHYPHYHHSTPASAIRAGDWKL 404 Query: 392 VLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLR 450 + + ELY+ + D +E NL + + ++ L D+ + Sbjct: 405 IEFFADGNLELYNLQQDLSETTNLAA--KNPEKAVELQQKLADWRTRTGAALPVKNPKYD 462 Query: 451 PWRKDARPRWMGAFRPRPQDGYSPVVRD 478 P R + + Sbjct: 463 PARASEFWNRRTNQPVPERRKTRIDQTN 490 >UniRef50_D2R1A1 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R1A1_9PLAN Length = 486 Score = 429 bits (1104), Expect = e-118, Method: Composition-based stats. Identities = 110/453 (24%), Positives = 195/453 (43%), Gaps = 26/453 (5%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF++ D +GCY + T +ID LA+EG+RF A+ + C P+RA L +G Y Sbjct: 31 NVLFIIADDLTATALGCYGNQICQTPHIDRLASEGMRFTHAFCNATYCGPSRASLMSGYY 90 Query: 65 ANQSGPWTNNVAPGK--NISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTG----ECPPE 117 + +G +T +F+++GY+ + K +H+ TG + Sbjct: 91 PHATGILGYTSPRPAIGQRATWSEHFRNSGYYAARVSKIYHMGVPGDIETGSNGADDAAS 150 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQA-----NHIDETFTWAHRISNRAVD 172 WD + +G + + T + + +G V D+ R + + + Sbjct: 151 WDERFNIEGPEWKAAGTGETLEGNPDGKKPVMGGNTFVVVEADGDDLVHSDGRAALKTAE 210 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYAD-FYYELGEKAQDDLANKPEHHRL 231 ++Q + +PF + + PH PF P +Y E Y L K DD + P Sbjct: 211 LIRQHTQ--KPFFIACGFVRPHVPFVAPRQYFEPYLPYDKLPLPTKVADDWKDIPLAGIN 268 Query: 232 WAQAMPSPVGD-DGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMM 289 + ++ + + Y+A ++D Q+G+V++AL ++T VI+TSDHG + Sbjct: 269 YKTSVNMKMDERRQKKAIGGYYAAVSYMDAQVGKVLDALEQSGAADHTIVIFTSDHGYHL 328 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPG 349 G H +K ++ D +++PLIIR P + + V IDL PT+ +L +E PE L G Sbjct: 329 GEHDFWAK-VSLLDQSSKVPLIIRVPGKKPAVCHSLVELIDLYPTIASLCGLEVPERLQG 387 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD---ELYDRRN 406 +NI + + V + + + G + + EL+D + Sbjct: 388 KNIATLWDDPHKQVRDTAFSVAPMTQGFL-----LRDHQWSFIQYGEEGAKGLELFDVKA 442 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 DP + NL + DV L +++ +R Sbjct: 443 DPQQHTNLAQSPEYEDVVRGFQSKLKEHLQTLR 475 >UniRef50_Q029P1 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q029P1_SOLUE Length = 467 Score = 429 bits (1103), Expect = e-118, Method: Composition-based stats. Identities = 116/448 (25%), Positives = 195/448 (43%), Gaps = 31/448 (6%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N L + D + +GCY + T N D LA EG+RF +A+ +P C P+R L TG Y Sbjct: 27 NLLVITNDQHRADCLGCYGNPVIRTPNTDRLAGEGVRFGNAFVHAPQCVPSRVSLHTGRY 86 Query: 65 ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP--EWDADY 122 + TN+ ++ T+ + GY T +G+ Y G + + D Sbjct: 87 PHVHRVPTNSYDLPESEQTLAKVLNANGYRTACVGEMPFAPRAYTGGFQQVLASNREYDQ 146 Query: 123 WFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADE 182 + G ++ + + A DFL+ A D Sbjct: 147 FLAGHGLKFPKSDGPF-----------QAAPVPWTDDLDETAFFAGHARDFLK--ANRDR 193 Query: 183 PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ-----AMP 237 PF + +++ PHHPF P + + Y + ++ANKP + + + Sbjct: 194 PFFLDINFRRPHHPFNPPAPFDKMYLGAAFPPSHARPGEMANKPPQQKAALENSVGFDLR 253 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGAHKLIS 296 S D Y+ D IG V++ L + E+ T V++ +DHGEM+G H L+ Sbjct: 254 SMTPADLDRVKAYYYGMISENDKYIGTVLDELKSQGLEDRTVVVFNADHGEMLGDHGLLF 313 Query: 297 KGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILA 354 KG+ MYD +T++PLI+R+P R VD V +D++PT++ L I+ P + G++++ Sbjct: 314 KGSYMYDGVTQVPLILRAPGKLPARTVVDGLVEEVDVMPTLLELLGIDVPAGVQGKSLVP 373 Query: 355 VKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN-LFTSDELYDRRNDPNEMHN 413 + + + + F ++ T ++KLV ELY DP+E+ N Sbjct: 374 LADNPKARHKDAVFA-------EFPTIKMARTREWKLVHYNKAKYGELYHLTEDPHELTN 426 Query: 414 LIDDIRFADVRSKMHDALLDYMDKIRDP 441 L DD ++A + M L D++ DP Sbjct: 427 LYDDPKYAPASADMQGLLADWLATSTDP 454 >UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Bacteria RepID=A6C284_9PLAN Length = 605 Score = 428 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 102/445 (22%), Positives = 173/445 (38%), Gaps = 58/445 (13%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + + D Q + L+T N+DSLA EG++FN Y + VC P RA TG Sbjct: 43 PNIVIFLADDQGWGDLSHNGNTNLHTPNVDSLAKEGVKFNRFYVGA-VCAPTRAAFLTGR 101 Query: 64 YANQSG---PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 Y ++G T + T+ + FK AGY T GKWH P Sbjct: 102 YHARTGTIGVSTGQERFNSDEYTIAQAFKAAGYATGAFGKWHNGTQYPNH----PNAKGF 157 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D ++ + W + + + D + ++++A+ F++Q + Sbjct: 158 DEYYGFTSG----------HWGHYFSPMLDHNGTFVKGNGYITDDLTDKAMAFIEQQVQN 207 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 +PF + Y PH P P +Y +++ D +L P Sbjct: 208 HKPFFAYLPYCTPHSPMQVPDQYWDRFKDKQLKL--------------------HNREPD 247 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMM--GAHKLISK 297 + + A + VD +GRV+ L + ++T VIY SD+G + K Sbjct: 248 REQPDHLRAA-LAMCENVDWNVGRVLKKLNSLRITDDTIVIYFSDNGPNGVRWNGDMKGK 306 Query: 298 GAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADI--EKPEILPGENIL 353 ++ + R P +IR P ++V+ IDLLPT+ LA I +P+ + G ++ Sbjct: 307 KGSLDEGGVRSPFVIRWPGHLPAGQEVNQIAGAIDLLPTLTDLAGIKRPEPKPIDGVSLK 366 Query: 354 AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHN 413 + E F TD ++L ELYD DP + +N Sbjct: 367 PLMLNSKADWP------ERMIFSSLRNRVSVRTDQYRLSR----KGELYDMHADPGQRNN 416 Query: 414 LIDDIRFADVRSKMHDALLDYMDKI 438 + ++ +K+ A+ D+ + Sbjct: 417 IAKQK--PEITAKLQQAVTDWRQSV 439 >UniRef50_A6C1R0 Choline sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C1R0_9PLAN Length = 492 Score = 428 bits (1101), Expect = e-118, Method: Composition-based stats. Identities = 110/477 (23%), Positives = 182/477 (38%), Gaps = 40/477 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTC----SPVCTPARA 57 +RPN LF+ +D Q + V Y + T N+D L G F +AY VC P+RA Sbjct: 33 ERPNILFLFSDDQRADAVAAYDNPHIQTPNLDQLVKAGFNFRNAYCMGSIHGAVCQPSRA 92 Query: 58 GLFTGIYANQSGPWTNNVAPG-KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 L +G V K + T+ + K AGY T GKWH + + Sbjct: 93 MLNSGRSLYH-------VPMDLKGVITLPQLLKQAGYETFGTGKWHNHRDSFQKSFTTGT 145 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 +S + + + G + + + AVDFL+Q Sbjct: 146 A-------AFIGGMSNHLKVPVVDLKEGKFENKRTGKK------FSSELFVDAAVDFLKQ 192 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK--AQDDLANKPEHHRLWAQ 234 A++PF V++ PH P P ++ Y + L + Q N R A Sbjct: 193 QP-AEKPFYAYVAFTAPHDPRMPPETAMKVYENSPPPLPKNFMPQHPFNNGWLTGRDEAL 251 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHK 293 + Y+ +D QIGR++ L + + NT VI++SDHG +G+H Sbjct: 252 TGWPRQPEIVREQLAEYYGMITHMDTQIGRILQTLKDKDLDKNTIVIFSSDHGLALGSHG 311 Query: 294 LISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTMMALADIEKPEILPGENI 352 L+ K +Y+ + PLI + P + D V D+ PT+ L I+ P + G ++ Sbjct: 312 LLGK-QNLYEHSMKSPLIFKGPGIPMNKSSDALVYLYDIFPTVCELTQIQVPSGVEGSDL 370 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF-TSDELYDRRNDPNEM 411 + + V + D +R + +KL+ +L+D + D +E+ Sbjct: 371 APIWRGKSERVRDTLFTTYEDL------MRAVRDERWKLIRYPQIDKTQLFDLKEDRHEL 424 Query: 412 HNLIDDIRFADVRSKMHDALLDYMDKIRD--PFRSYQWSLRPWRKDARPRWMGAFRP 466 +L + + KM L ++ + D P S R R +P Sbjct: 425 KDLSEHPEQQERIKKMLAELKEWQKRTDDKQPLTSEHPKPEAIDLTGRKRKPDQHQP 481 >UniRef50_A4U8Q3 Sulfatase n=2 Tax=Bacteria RepID=A4U8Q3_9BACT Length = 556 Score = 427 bits (1100), Expect = e-118, Method: Composition-based stats. Identities = 119/490 (24%), Positives = 189/490 (38%), Gaps = 19/490 (3%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN L VM D A ++ Y G T N++ LA EG+ F +AY P+C PAR L +G Sbjct: 58 PNILLVMMDQLAPQVLKPYGGTVCRTPNLERLAGEGVVFENAYCNYPICAPARFSLMSGR 117 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYW 123 ++ G + N + T Y + GYHTC GK H G D E D + Sbjct: 118 MPSRIGAFDNATEFPSEVPTFAHYLRAMGYHTCLSGKMHFVGADQLHGFED--RVTTDVY 175 Query: 124 FDGANYLSELTEKEISL--WRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA- 180 ++ S+ + W + + V D ++ + A +L A Sbjct: 176 PADFSWTSDWSLGPTFWEPWFHSVRIVRDAGPRRRSVNTSYDEEATVEACRWLHDHADRA 235 Query: 181 -DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP 239 PF + S+ PH P+ P + + Y D + L + H R + Sbjct: 236 DGRPFFLAASFISPHDPYLAPPSHWDLYTDDGIDDPRVGDIPLEERDPHSRRLYYTIGRH 295 Query: 240 VGD----DGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKL 294 + D Y+A ++DD+IGR++ L +NT V+ T+DHG+M+G L Sbjct: 296 IETIGPADVRRARRAYYAVMSWLDDRIGRILETLKAIDADDNTIVVLTADHGDMLGERGL 355 Query: 295 ISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALAD-IEKPE---ILPG 349 K ++ R+PLI+ +P R+V VS +DL PT + A E PE + G Sbjct: 356 WLK-MNFFEWSVRVPLIVHAPTLYRARRVRENVSLLDLFPTFLEWAGDGELPELFAPIDG 414 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPN 409 +I + + E+ G PV +K + L+D DP+ Sbjct: 415 ASIAGLAAGHSDGWP-DVVGSEYCGEGASSPVLMIRRGRWKYIHCEDDPPLLFDIEQDPD 473 Query: 410 EMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQ 469 E+ NL V + + + + D R + R + +G P Sbjct: 474 ELVNLAGTPEVGGVETDLAGEVCRWWDTAALKDRVIESQRRR-KFLHAALSVGRRTPWDW 532 Query: 470 DGYSPVVRDY 479 R+Y Sbjct: 533 QPVRDASREY 542 >UniRef50_UPI0001744DD5 choline sulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001744DD5 Length = 469 Score = 427 bits (1100), Expect = e-118, Method: Composition-based stats. Identities = 103/449 (22%), Positives = 175/449 (38%), Gaps = 29/449 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYT----CSPVCTPARA 57 +RPN LF+ +D Q + + L T N+D L +G F AY VC P+RA Sbjct: 13 ERPNVLFLFSDDQRADTIAALGNTHLQTPNLDRLVRDGTTFTQAYCMGSNQGAVCVPSRA 72 Query: 58 GLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 L +G + K +T F AGY T GKWH + Sbjct: 73 MLMSGRTLYRV------QEQLKGQATWPESFASAGYRTFMTGKWHNGAPSALRAFQEAKA 126 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 + G + +++S G N N +++AV+F++ Sbjct: 127 ----VFLGGMGDPDAIPVQDMSSAGQGGNRQF---VNRRTVEKHCVELFADKAVEFVRAQ 179 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD--DLANKPEHHRLWAQA 235 ++ +P+L V+++ PH P P + E+ + E N R A Sbjct: 180 KQSSQPWLCYVAFNAPHDPRKAPPAWHEQTNANKPPIPENFLPVHPFNNGEMTVRDEKLA 239 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHKL 294 Y+A F+D QIGR++ +L ++ T ++++SDHG +G+H L Sbjct: 240 PWPRTEPVIRQELADYYAAIMFMDSQIGRILESLRATGQDEKTIIVFSSDHGLAIGSHGL 299 Query: 295 ISKGAAMYDDITRIPLIIRSPQGERR-QVDTPVSHIDLLPTMMALADIEKPEILPGENIL 353 + K ++YD PLI+ P + + +D+ PT+ LA + PE G +++ Sbjct: 300 MGK-QSLYDHSMHSPLILAGPGVPKGEKRAALCYLLDVYPTLGDLAGVNAPEGSEGLSLV 358 Query: 354 AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF-TSDELYDRRNDPNEMH 412 V + + G R D +KL++ +LYD ++DP E Sbjct: 359 PVLKGEEITRRQAIMT------GYRKVQRAVRDDQWKLIVYPQVNKMQLYDLKSDPAETR 412 Query: 413 NLIDDIRFADVRSKMHDALLDYMDKIRDP 441 +L + A+ +M L + D Sbjct: 413 DLAREPGHAEEIDRMRTLLEKLQKENGDT 441 >UniRef50_Q7UJQ8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=4 Tax=Planctomycetaceae RepID=Q7UJQ8_RHOBA Length = 491 Score = 427 bits (1099), Expect = e-118, Method: Composition-based stats. Identities = 100/490 (20%), Positives = 176/490 (35%), Gaps = 69/490 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN +F++ D +GCY + + T +D +AAEG+RF Y + VC P+R+ L T Sbjct: 34 KRPNIVFILADDLGYGDLGCYGQELIQTPRLDQMAAEGMRFTDFYAGNTVCAPSRSVLMT 93 Query: 62 GIYANQSGPWTNNVA-------PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 G++ + N T+ + AGY T GKW L G Sbjct: 94 GMHMGHTHVRGNAGGPDMSKQSLRDENVTVAEVLQSAGYATALCGKWGLGDDALGGRDGL 153 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF-------------T 161 P + D+++ N + LWRN + D ++ Sbjct: 154 PRKQGFDHFYGYLNQVHAHNYYPEFLWRNETKVALRNEVQRRDRSYGGFTGGWATKRVDY 213 Query: 162 WAHRISNRAVDFLQQPA--RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ 219 I+N A+ F+++ A A +PF + +S PH G Sbjct: 214 SHDLIANEAMGFIREKATDAATKPFFLYLSLTIPHA----------------NNEGTGMS 257 Query: 220 DDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTW 278 + P++ + A +D +GR+++ L Q + T Sbjct: 258 GNGQEVPDYGIYADKDWSDQDKGQ--------AAMITRMDSDVGRILDLLKELQIDEQTV 309 Query: 279 VIYTSDHGEMM-GAH---------KLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPV 326 V+++SD+G G H L A+ + R+PLI+R P D Sbjct: 310 VMFSSDNGPHNEGGHNPKKFDPAGPLRGMKRALTEGGIRVPLIVRWPGTTPPGAVSDHIG 369 Query: 327 SHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVT 386 DL+ T LA + PE + R + + Y + F + Sbjct: 370 YFGDLMATAAELAGTDFPEDADSISFAPTIVGRPEAQQTHEYL--YWEFYEQGGRQAVRR 427 Query: 387 DDFKLVLNLF--TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRS 444 ++K + + +LYD + D E NL D ++ ++ + ++ P + Sbjct: 428 VNWKAIREPWMTGPTQLYDLKADIGETTNLASD--HPEIVKQLETLM----EEAHTPHPN 481 Query: 445 YQWSLRPWRK 454 +Q + ++ Sbjct: 482 WQVRVPASKR 491 >UniRef50_C3WCE8 Arylsulfatase n=2 Tax=Fusobacterium RepID=C3WCE8_FUSMR Length = 476 Score = 427 bits (1098), Expect = e-118, Method: Composition-based stats. Identities = 115/473 (24%), Positives = 212/473 (44%), Gaps = 27/473 (5%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN + + TD + +G Y + T N+D +A EG+ F +++ SPVCTP+RAG+FT Sbjct: 6 KKPNIVLITTDQMRADAIG-YINSKVITPNLDMMAKEGVVFTNSFCSSPVCTPSRAGIFT 64 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHL--------DGHDYFGTGE 113 G Y +G W +N T+ + K GY+ +GK H + ++ + Sbjct: 65 GRYPMNTGAWNIGTCLDENEITLADWLKGEGYYNIGVGKMHFRPQLKDFDNNYEDVEVRD 124 Query: 114 CPPEWDADYWFDGANYLSELTEKEIS---LWRNGLN---SVEDLQANHIDETFTWAHRIS 167 E D Y+ Y++E ++ L NG + + N + E F + I Sbjct: 125 RVRERDKTYYGFDETYITEDDKQGKYLDFLDENGYHLEVGKGNDGMNPLPEEFNQTYWIG 184 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 ++ + +++ ++P M+ ++ +PHHPF ++ Y + +PE Sbjct: 185 MKSCEAIRKY-DFNKPLFMMTNFVDPHHPFDPAEKFARMYDGVEIDSPISKDKFCNERPE 243 Query: 228 HHRLWAQAMPSP--------VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTW 278 + + + P + + Y+A F+D +IG++ L + +NT Sbjct: 244 YLKRQGERGYWPGGGEQHKLSDEKVEEYTRYYYAMITFIDQEIGKIRKELEKKGELDNTI 303 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ-VDTPVSHIDLLPTMMA 337 +I+TSDHGE MG + L+ KG MYD++ ++PL+ E+ D V +ID++PT++ Sbjct: 304 IIFTSDHGEYMGDYGLLQKGPFMYDNLIKVPLLFWGKGVEKSVTSDEIVENIDIVPTILE 363 Query: 338 LADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT 397 L E P + GE++ + + + +D+ I V+ + +KL L + Sbjct: 364 LIGKEVPYGIQGESLKNILQKIDKERVKKSAIVTYDARDRGIMVKSYRDKRYKLNLFMNE 423 Query: 398 S-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL 449 E+YD DP E NL + +++++ M + DP + Sbjct: 424 EYGEMYDLEVDPQETTNLFFKEEYLQLKNELLLKACYRMMECSDPLSKRTANW 476 >UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DTN4_9BACT Length = 482 Score = 427 bits (1098), Expect = e-118, Method: Composition-based stats. Identities = 97/454 (21%), Positives = 173/454 (38%), Gaps = 48/454 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN ++++ D +GCY K + T ++D +AA G++F Y+ S VC P+R+ L G Sbjct: 19 KPNIIYILADDLGYGDLGCYGQKVIQTPHLDKMAANGMKFTQHYSGSTVCGPSRSCLLEG 78 Query: 63 IYANQSGPWTNN---VAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 ++ + N + + + + AGYHT IGK + + P + Sbjct: 79 KHSGNTYVRGNGMLQMRQDPHDLIFPKALQKAGYHTAMIGKSGMGCNTD--DAALPYQKG 136 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF--TWAHRISNRAVDFLQQP 177 DY+F ++ LW+N + N+ + + N A+D++++ Sbjct: 137 FDYFFGFTSHTQAHWFFPTHLWKNDGKVTKVEYPNNTLHEGDNYSSEVVMNEALDYVERQ 196 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 D PF + +++ PH E+ KY E +L + Sbjct: 197 --KDGPFFLHLAFQIPHASLRAKEEWKAKYRPILKE----------------KLLPKKDK 238 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMM-GAHK-- 293 P + A ++D +G + L ENT +++ SD+G M G HK Sbjct: 239 HPHYSYEREPKTTFAAMVSYMDHNVGLLNKKLEDLGLAENTLIMFASDNGAMQEGGHKRD 298 Query: 294 -------LISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKP 344 L MY+ R P+I P + D + D+ PT+ LA + Sbjct: 299 SFDSNGVLRGGKRDMYEGGVRTPMIAYWPGKIKAGQTSDHISAFWDISPTVRELAGAKVQ 358 Query: 345 EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD----- 399 E G + + +G + + E GG R +KL+L +D Sbjct: 359 EDTDGISFVPTLLGKGSQTKHDYLYWEFFEQGGK---RAIRMGKWKLILYKTNTDLNPKM 415 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 EL+D D +E +L + + S + + Sbjct: 416 ELFDLEADISEQKDLSK--QLPEKVSALLKLMDK 447 >UniRef50_C0S8M2 Choline sulfatase n=8 Tax=Eurotiomycetidae RepID=C0S8M2_PARBP Length = 619 Score = 425 bits (1095), Expect = e-117, Method: Composition-based stats. Identities = 109/432 (25%), Positives = 185/432 (42%), Gaps = 20/432 (4%) Query: 2 KRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++P+ L++M D A ++ Y P+ T N++ LA EG+ F SAY SP+C P+R + Sbjct: 5 EKPSILYIMADQMAAPLLSLYDENSPIKTPNLERLAREGVCFESAYCNSPLCAPSRFSMV 64 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG +++G + N + T Y + GYHT GK H G D E + Sbjct: 65 TGQLPSKTGGYDNASDLPADTPTYAHYLRKEGYHTALAGKMHFVGPDQLHGYEQ--RLTS 122 Query: 121 DYWFDGANYLSELTEKEI-SLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 D + + E E+ W + ++SV + + + +RA +L R Sbjct: 123 DIYPGDYGWTVNWDEPEVRPDWYHDMSSVLEAGPCVRTNQLDYDDEVIHRATQYLYDHTR 182 Query: 180 A--DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 +PF + VS PH P+ EY + Y D L + + H + +++ Sbjct: 183 HRAGQPFCLTVSMTHPHDPYAMTKEYWDLYEDIDIPLPKTPVIPHDEQDPHSQRVLKSID 242 Query: 238 SPVGDDGLY----HHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMMGAH 292 + YFA +VD Q+GR++ L + +NT V++T DHG+M+G Sbjct: 243 LFGKEIPEQCILAARRAYFAACSYVDSQVGRLMATLKACDLADNTIVVFTGDHGDMLGER 302 Query: 293 KLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEKPEIL--PG 349 L K Y+ R+P+ + +P + ++V VS +DLLPT A+A E L G Sbjct: 303 GLWYK-MVWYEHAARVPMFVHAPGRYKPKRVKENVSTMDLLPTFAAMAGGEINNHLPIDG 361 Query: 350 ENILAVKEPRGVMV-----EFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDR 404 +++ + + E+ + G PV +K + + L++ Sbjct: 362 VSLMPYLLDSDSREAVSGLKTDTVIGEYMAEGTLAPVVMIRRGPWKFIYSPIDPPMLFNV 421 Query: 405 RNDPNEMHNLID 416 + DP E NL Sbjct: 422 KRDPTEAVNLAS 433 >UniRef50_A0Q2E3 N-acetylgalactosamine 6-sulfate sulfatase n=3 Tax=Firmicutes RepID=A0Q2E3_CLONN Length = 483 Score = 425 bits (1095), Expect = e-117, Method: Composition-based stats. Identities = 125/461 (27%), Positives = 193/461 (41%), Gaps = 54/461 (11%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + ++TD Q +GCY T +DSLA GIRF + + SPVC+PARA ++TG Sbjct: 7 NVISIITDDQGYWSMGCYGNHDAITPTLDSLANNGIRFENFFCVSPVCSPARASIYTGRI 66 Query: 65 ANQSGP------WTNN---VAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 +Q G W N K ST GY GKWHL D Sbjct: 67 PSQHGIHDWLDEWNNGYTTEEYLKGQSTFVDILAKNGYECAMSGKWHLGVADK------- 119 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 P+ YW+ + ++++G I E +++ ++F++ Sbjct: 120 PQNGFKYWYS--HQKGGGPYYGAPMYKDG---------TLIHEERYVTDVMTDYGLEFIE 168 Query: 176 QPARADEPFLMVVSYDEPHHPFTC---PVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 + +D PF + ++Y PH P++ P E L+ Y D ++ K + Sbjct: 169 KQRDSDNPFYLSLNYTAPHAPWSPENHPKELLDLYKDCEFKSCP--------KDGKNDWS 220 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGA 291 + D+ YFA VD+ I RVI+ L ENT +I+TSD+G MG Sbjct: 221 IDYIFPKTEDERREVLRGYFAALTSVDNNIKRVIDKLKEMGVLENTLIIFTSDNGMNMGH 280 Query: 292 HKLISKGA-----AMYDDITRIPLII-RSPQGERRQVDTPVSHIDLLPTMMALADI--EK 343 H + KG M+D +IP I + + + +SH D+ PT+M I E Sbjct: 281 HGIFGKGNGTSPVNMFDTSVKIPCFITKIGDIKPQVSTDLLSHYDIRPTLMEYLGIEDEI 340 Query: 344 PEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-NLFTSDE 400 E LPG + ++ + + N I + + P R T ++K V E Sbjct: 341 DEGVKLPGRSFASLLRGEKLERDDNEVVI----YDEYGPARMIRTKEWKYVHRYPAGPHE 396 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 LYD NDP+E NLIDD D+ ++ L + + +P Sbjct: 397 LYDLVNDPDEKINLIDDEDKKDIVKELRYRLKRWFIQYVNP 437 >UniRef50_B9YAN4 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YAN4_9FIRM Length = 470 Score = 425 bits (1094), Expect = e-117, Method: Composition-based stats. Identities = 93/482 (19%), Positives = 168/482 (34%), Gaps = 55/482 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + ++ D + C T +ID L EG+ F+ AY PVC+P+RA + +G Sbjct: 4 QPNVIMILIDDLGWMDLSCQGSSFYETPHIDQLRREGMAFDQAYAACPVCSPSRASILSG 63 Query: 63 IYANQSGPWT-----NNVA-------------PGKNISTMGRYFKDAGYHTCYIGKWHLD 104 Y + N + +M + F++AGY T ++GKWHL Sbjct: 64 KYPARLKVTDWIDHENYHPCRGKLIDAPYIKELSVSEFSMAKAFQEAGYQTWHVGKWHLG 123 Query: 105 GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 + P D G+ + + + E Sbjct: 124 KEATY-----PEHHGFDVNLGGSWWGHPKKGY--------FSPYHMENLSDGPEGEYLTD 170 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 RI A ++ PF + + + H P E + + + +G QD Sbjct: 171 RIGAEAAALIR-SRDPQRPFFLNLWHYAVHTPLQAKAEDIAYFEEKAKRMGLDQQDPFEI 229 Query: 225 KPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTS 283 Q + + P+Y A +DD +G+++ L E + +T VI+TS Sbjct: 230 GDPF--PILQKKDKRITRRIVQSDPVYAAMIKALDDSVGQLMATLKAEGLDEDTIVIFTS 287 Query: 284 DHGEMM-------GAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPT 334 D+G + L MY+ R PL +R P + D PT Sbjct: 288 DNGGLATAEHSPTCNFPLSEGKGWMYEGAVREPLFVRWPGKIEAGSLSHALTTSPDFYPT 347 Query: 335 MMALADIE--KPEILPGENILAVKEPRGVMVEF--NRYEIEHDSFGGFIPVRCWVTDDFK 390 ++ L + + G ++ V + + H G P +K Sbjct: 348 LLELCGLPLRPQQHCDGVSLAPVLLNPQAKFDRGPIFWHYPHYGNQGGTPGSALRCGKWK 407 Query: 391 LVLNLFT-SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL 449 + S L+D D +E HN+ + + D+ + H L ++++ + ++ + Sbjct: 408 YIEFYEDHSVRLFDLEQDVSEKHNVAEV--YPDLVRQFHSLLHEWLEAVD----AWYPEV 461 Query: 450 RP 451 P Sbjct: 462 NP 463 >UniRef50_B4D026 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D026_9BACT Length = 489 Score = 425 bits (1093), Expect = e-117, Method: Composition-based stats. Identities = 109/464 (23%), Positives = 180/464 (38%), Gaps = 48/464 (10%) Query: 3 RP-NFLFVMTDTQATNMVGCYSG--KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 +P N LF+++D + + K L T N+D LA EG +A+ + +C+P+RA + Sbjct: 26 KPRNILFILSDDHRWDFMSFMPEAPKFLETPNLDRLAKEGAHLRNAFCSTSLCSPSRASI 85 Query: 60 FTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 TG Y + G N I Y + AGY T ++GKWH+ P Sbjct: 86 LTGQYMHHHGVVDNQRPEPAAIRYFPEYLRAAGYETAFLGKWHMGEDSDN------PRKG 139 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 DYW + + ++ H + +++ A+D+L+ R Sbjct: 140 FDYWAGFRGQG------------HYFDDTYNINGEHKKIDGYSSDVLTDLALDWLK--HR 185 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP 239 D+PF + Y PH+PF +Y E + N R + Sbjct: 186 GDKPFFCELCYKAPHYPFEPAPRNKGRYEKAPIPYPETMANTEENYLTQPRWVRERRFGI 245 Query: 240 VGDDGLYH--------------HPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSD 284 G D + + Y +D+ IGR++ L ++T V+Y +D Sbjct: 246 HGVDHMETGRFDHDPVPSFEDLYHRYSETVFSMDENIGRLLKYLDNTGLRDSTIVVYMAD 305 Query: 285 HGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIE 342 +G +G H K ++ R+P+++R+P V V +ID+ PT++ A + Sbjct: 306 NGFELGEHGFYDK-RDAFETSMRVPMLLRAPGAVKPGTVVTKMVQNIDIAPTLLEAAGVT 364 Query: 343 KPEI---LPGENILAVKEPRGVMVEFNRYEIEHDS--FGGFIPVRCWVTDDFKLVLNL-- 395 P + G + + + R V + + F TD +K V Sbjct: 365 VPADAPKMDGYSFWPLVQGRDVPWRDHILYEYYWERNFPATPTTFAIRTDRWKYVYTHGL 424 Query: 396 FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 + D LYD DP E HNLID F + K+ L D +DK Sbjct: 425 WDRDGLYDLETDPVERHNLIDVPAFREQGGKLRGQLFDELDKSG 468 >UniRef50_A4GIB2 Putative secreted sulfatase n=1 Tax=uncultured marine bacterium HF10_49E08 RepID=A4GIB2_9BACT Length = 667 Score = 424 bits (1092), Expect = e-117, Method: Composition-based stats. Identities = 103/517 (19%), Positives = 184/517 (35%), Gaps = 84/517 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN +F + D + VGCY K T ID LA EGIRF++AY+ VC+P+RA + T Sbjct: 22 RKPNIVFFLVDDLGWSDVGCYGSKFHETPAIDQLAKEGIRFDNAYSTCHVCSPSRASILT 81 Query: 62 GIYANQSGP---------WTNN--------VAPGKNISTMGRYFKDAGYHTCYIGKWHLD 104 G Y ++ A T+ K GY T GK HL Sbjct: 82 GKYPARTNLTEWLGGRPERDYEPLHHGEKLTALPDEEVTLAETLKSHGYATANYGKAHLR 141 Query: 105 GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 P + D G + + + Sbjct: 142 V--------DPNAYGFDEEITGWVRSYHYPFGGAY-----------NEKLPAKKGDYYTD 182 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD--L 222 ++++ A+DF+++ D PF + + + H P + +EKY + ++ D L Sbjct: 183 KLTDAALDFIER--NKDRPFFVHLEHFAVHDPIQGRPDLVEKYRKKLAAMPKQDGPDFIL 240 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHP--------------LYFACNDFVDDQIGRVINA 268 + P+ L + + + +D L H + + D+ +GR+ Sbjct: 241 ESNPDGPELTTEELKALAENDELQDHQDARVWWVKQKQDNVEFAGMLEATDESLGRIRKK 300 Query: 269 LTPEQR-ENTWVIYTSDHGEM---------------------MGAHKLISKGAAMYDDIT 306 L +NT VI+T+D+G M L Y+ Sbjct: 301 LKDLGLADNTIVIFTADNGGMSASNQYRGINHPIESLDSRFASSNLPLRGAKGWNYEGGI 360 Query: 307 RIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIE--KPEILPGENILAVKEPRGVM 362 R+PL++ P + + V+ D PT++ + + + + G + L + Sbjct: 361 RVPLVVYWPGRIKPDSTSNALVTGTDFYPTLLEMIGMPTLPNQHIDGVSFLPALRGKAHD 420 Query: 363 VEFNRYEIEHDSFGGFI-PVRCWVTDDFKLVLNL-FTSDELYDRRNDPNEMHNLIDDIRF 420 + H S G+ P +KL+ S +L+D D E ++L Sbjct: 421 RGAIYWHFPHYSNHGYQSPGGAIRLGKYKLLEYYENGSVQLFDLEKDIGEQNDLSKTK-- 478 Query: 421 ADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDAR 457 DV++K+ L ++ ++ + S K +R Sbjct: 479 PDVKAKLLKMLHEWRREVDAKMPYPKTSNSKPAKGSR 515 >UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7UX95_RHOBA Length = 538 Score = 424 bits (1092), Expect = e-117, Method: Composition-based stats. Identities = 97/474 (20%), Positives = 164/474 (34%), Gaps = 75/474 (15%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 + RPN + ++ D +GCY + T +D LAAEGI+ + Y+ + VC P+R L Sbjct: 71 VSRPNIVLIVADDLGYGELGCYGQTKIRTPRLDQLAAEGIKLTNFYSGNAVCAPSRCCLM 130 Query: 61 TGIYANQSGPWTNNV-------------------APGKNISTMGRYFKDAGYHTCYIGKW 101 TG + + N + T+ Y K GY T GKW Sbjct: 131 TGKHPGHAHVRNNGDPKIDPAVREALKLEFPGQYPLPVDEVTIAEYLKSVGYRTGAFGKW 190 Query: 102 HLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFT 161 L +FGT P E D ++ LWRN + V+ + Sbjct: 191 GLG---HFGTTGDPNEQGFDLFYGFNCQRHAHNHYPNFLWRNRVKEVQPGNDRTLHGETY 247 Query: 162 WAHRISNRAVDFLQQPARADE--PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ 219 + N A +F++Q D+ PF + + PH P E ++ Y E + Sbjct: 248 SQDQFVNEACEFIRQSVAEDKTQPFFAYLPFAVPHLSIQVPEEEVDAYDGV-IEEADYEH 306 Query: 220 DDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTW 278 P Y A +D+ +G+V++ + ENT Sbjct: 307 HGYLKHP-------------------RPRAGYAAMVTRMDEGVGQVVDLVDSLGLGENTL 347 Query: 279 VIYTSDHG------------EMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDT 324 +++TSD+G A + + + R+P+I R R D Sbjct: 348 IMFTSDNGPTYDRLGGSDSDYFNSASGMKGLKGQLDEGGIRVPMIARQTGVVPAGRTSDW 407 Query: 325 PVSHIDLLPTMMALADIEKPE-ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRC 383 + D LPT+ A +E G + L + + + + + F G+ + Sbjct: 408 IGAWWDFLPTITDAAGVEVDASTTDGISFLPLLHGDDAAQQSHEFL--YWEFPGYSGQQA 465 Query: 384 WVTDDFKLVLNLFTSDE-----------LYDRRNDPNEMHNLIDDIRFADVRSK 426 ++K + + LYD D E +++ DV +K Sbjct: 466 IRMGNWKAIRKDLSKRLKKGQTEPPAFALYDLSKDLAESNDVSAS--HPDVMAK 517 >UniRef50_D2RQH7 Sulfatase n=1 Tax=Haloterrigena turkmenica DSM 5511 RepID=D2RQH7_9EURY Length = 498 Score = 424 bits (1091), Expect = e-117, Method: Composition-based stats. Identities = 129/491 (26%), Positives = 198/491 (40%), Gaps = 44/491 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN LFV+TD + + P+ T +D L++EG+RF+ A T +CT ARA L T Sbjct: 4 SRPNVLFVLTDQERYDCTAPEG-PPVETPAMDRLSSEGMRFSRACTPISICTSARASLMT 62 Query: 62 GIYANQSGPWTNNVA-------PGKNISTMGRYFKDAGYHTCYIGKWH------------ 102 G++ + G N+ + T + GY Y GKWH Sbjct: 63 GLFPHGHGMLNNSHEADAIRPNLPPELPTFSELLAENGYDCSYTGKWHVGRDQTPEDFGF 122 Query: 103 --LDGHDYFGTG--ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDE 158 L G D E E+ + E R+ +D Sbjct: 123 AYLGGSDKHHDDIDEAFREYREERGVPPGEVDLEEVLYTGDDPRDASEGTFVAATTPVDV 182 Query: 159 TFTWAHRISNRAVDFLQQPARAD---------EPFLMVVSYDEPHHPFTCPVEYLEKYAD 209 T A+ ++ R +D ++ A D +PF + PHHP+ P Y Y Sbjct: 183 EETRAYFLAERTIDAIEAHADGDSGEGDGNGSDPFFHRADFYGPHHPYVVPEPYASMYDP 242 Query: 210 FYYELGEKAQDDLANKPEHHRL--WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVIN 267 + E + KP+ H + + D Y+ +DDQ+ R++ Sbjct: 243 NEIDPPESYAETYDGKPQVHENFHYYRGADGLEWDHWAEATAKYWGFVSLIDDQLERILE 302 Query: 268 ALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDT 324 AL + T V++ SDHG+ +G H+ +KG MYDD RIPL +R P + Sbjct: 303 ALEEHGLADETAVVHASDHGDFVGNHRQFNKGPLMYDDTYRIPLQVRWPGVAEPGTTCEV 362 Query: 325 PVSHIDLLPTMMALADIEKPEILPGENILAVKE-PRGVM-----VEFNRYEIEHDSFGGF 378 PV DL T + + ++ PE +++ + E + + H G Sbjct: 363 PVHLHDLAATFLEMGGVDVPESFDSRSLVPLLETGDDPDAVPDDWPDSTFAQYHGDEFGL 422 Query: 379 IPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 R T +K V N DELYD + DP E+ NLID +ADVR +M D L+D+M + Sbjct: 423 YTQRMVRTGRYKYVYNGPDIDELYDLKADPAELQNLIDHPGYADVREEMRDRLVDWMQET 482 Query: 439 RDPFRSYQWSL 449 DP + + + Sbjct: 483 DDPNQGWVPDV 493 >UniRef50_Q7NMX5 Gll0640 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NMX5_GLOVI Length = 834 Score = 424 bits (1091), Expect = e-117, Method: Composition-based stats. Identities = 106/438 (24%), Positives = 185/438 (42%), Gaps = 33/438 (7%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + ++TD QA N + L + LA++G+ F +A+ +C P+RA + TG Y Sbjct: 37 NVVLIVTDDQAWNTLAYM--PKLQS----QLASQGVTFTNAFAGQSLCCPSRATILTGRY 90 Query: 65 ANQSGPWTNNVAPGK-----NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 + G N+ G + ST+ + +++GY T GK+ + PP WD Sbjct: 91 PHNHGVLGNDAPFGGALAFYDASTLPVWLQESGYRTGLFGKYFNGY--SYSAFYTPPGWD 148 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 F A Y + NG ++ E+ ++ +AV F+ A Sbjct: 149 EWQTFQLAGY------YNYRINANGT-----IEDYGRSESNYSTDVLTQKAVAFITNSAA 197 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD-DLANKPEHHRLWAQAMPS 238 +D+PF + ++ PH P+T + +YAD + D+ +KP + A P Sbjct: 198 SDKPFFLFLAPFAPHAPYTPAPRHAGRYADIPPWRPPNYNEQDVLDKPTWVQKLRPASPQ 257 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISK 297 D Y VDD + ++ AL RENT VI+TSD+G G H+ K Sbjct: 258 TQTDYDKE-RQAYLEMLLAVDDGVESILQALESTGQRENTLVIFTSDNGLTWGEHRWWEK 316 Query: 298 GAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAV 355 G + Y++ R+P+++ P RQ + V ++DL T+ A I P + G ++L + Sbjct: 317 GCS-YEESLRVPMVVSFPGVSTAARQEELLVLNMDLTATIAEAAGIPIPATVDGRSLLPI 375 Query: 356 KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLI 415 + + V + + + +K + NL ELY+ +DP E+ N + Sbjct: 376 LKGQAVSWREQFL---FEGWQLTPTHAGVRSTAWKYMENLAGEQELYNLIDDPYELDNAV 432 Query: 416 DDIRFADVRSKMHDALLD 433 + +++ L Sbjct: 433 GVADYGAQVAELQATLAQ 450 >UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF83_9BACT Length = 488 Score = 424 bits (1091), Expect = e-117, Method: Composition-based stats. Identities = 100/468 (21%), Positives = 166/468 (35%), Gaps = 57/468 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + ++ D +GCY + T NID LA +G++F S Y S VC P+RA L T Sbjct: 41 RRPNIILILADDLGYGDLGCYGQTQIKTPNIDKLAEDGMKFTSFYAGSTVCAPSRATLMT 100 Query: 62 GIYANQSGPWTNNV-APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 G N + T+ + K AGY T IGKW L G+ P Sbjct: 101 GKNTGHVNIRGNADLSLNGEELTIAKILKLAGYATGCIGKWGLGNE---GSPGLPGRQGF 157 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDL----QANHIDETFTWAHRISNRAVDFLQ- 175 D + + + L+R+ E + + + + A+++L+ Sbjct: 158 DEYLGYLDQVQAHDYYPTHLFRSDSKGEESKIALTENDADHKGLYSNDFFTQSALNYLRI 217 Query: 176 ---QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 F + + Y PH ++L N+ + Sbjct: 218 NKPSKLNKHRSFFLYLPYTLPHA-----------------------NNELGNRTGNGMEV 254 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHG----- 286 P + A +D +G +++ L + + NT VI+ SD+G Sbjct: 255 PSTEPY-TNEQWPQVEKNKAAMITRLDHYVGEIMDYLKKSKLDENTVVIFASDNGPHKEG 313 Query: 287 -----EMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALA 339 A L +Y+ R+P I+R P D P++ D LPT +A Sbjct: 314 GVNPKYFNSAGGLRGIKRDLYEGGIRVPFIVRWPARVKAGSISDAPLAFWDFLPTAAEIA 373 Query: 340 DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL-FTS 398 P + G + L + E G + D+K V + Sbjct: 374 RTSSPTNIDGISFLPTLLGKAQTNRHQYLYWEFHEQGFD---QAVRMGDWKAVRHGINGP 430 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQ 446 ELY+ + D +E N+ D + +V +K+ D L + DP + Sbjct: 431 IELYNLKTDVSEKDNVAD--KNPEVMAKIADYLKK--ARTDDPRWPAK 474 >UniRef50_A4A280 Iduronate-2-sulfatase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A280_9PLAN Length = 475 Score = 424 bits (1090), Expect = e-117, Method: Composition-based stats. Identities = 109/455 (23%), Positives = 193/455 (42%), Gaps = 31/455 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 + N LF+++D + + CY + T NID LA G++F AY PVC P+RA L +G Sbjct: 25 KYNVLFIISDDLSAESLSCYGHRECQTPNIDRLAQRGVKFTHAYCQYPVCGPSRAALMSG 84 Query: 63 IYANQSGPWTNNVAPG-----KNISTMGRYFKDAGYHTCYIGK-WHLDGHDYF----GTG 112 ++A G N + + ++M ++F+D GY+ + K +H+ Sbjct: 85 LHAATIGVMGNGQSTRFTQNLGDRASMSQHFRDQGYYAARVSKIYHMRIPGDITAGTNGD 144 Query: 113 ECPPEWDADYWFDGANYLSE----------LTEKEISLWRNGLNSVEDLQANHIDETFTW 162 + WD + ++S L + + G + D Sbjct: 145 DHAASWDERFNCQAPEWMSAGDAATYSNEKLNKDPDKHYGLGFGTAFYAVKASTDGAEQA 204 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 H+ +++A++ L++ +E F + V PH P P ++ E YAD EL K D Sbjct: 205 DHKAADKAIELLRK--HKEERFFLAVGMVRPHVPLVAPAKFFEPYADGQMELPLKVAGDW 262 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIY 281 + P+ + Y+A ++D Q+GRV++ L + NT V++ Sbjct: 263 DDIPKAGISRNSKATGMTLEGQRNTLSAYYAAVAYMDYQVGRVLDELHQLGLDKNTVVVF 322 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADI 341 T+DHG +G H K +++++ T IPLI+ P + + V+ + ID+ PT+ L ++ Sbjct: 323 TADHGYHLGEHDFWQK-MSLHEESTHIPLIVAIPGEQPKVVNGLAAQIDIYPTLAQLCEL 381 Query: 342 EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDEL 401 P L G + +A V + + TD + + ++EL Sbjct: 382 PVPTYLQGVSQVAAIASPDAAVRDDVLCMTSKGKL-------LRTDRYAYISYSGGTEEL 434 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 YD ++DP + NL D V K+ L + D Sbjct: 435 YDMQSDPQQYTNLAKDPASQPVLGKLRAQLKERAD 469 >UniRef50_UPI0001745B0B sulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745B0B Length = 676 Score = 423 bits (1089), Expect = e-117, Method: Composition-based stats. Identities = 114/455 (25%), Positives = 182/455 (40%), Gaps = 31/455 (6%) Query: 4 PNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 PN LF++ D + +G T N+D LA G+RF +A+ +C P+R + TG Sbjct: 37 PNVLFIIADDL-NDWIGWMGGHPQARTPNMDRLARMGMRFMNAHCSYALCNPSRTSMLTG 95 Query: 63 IYANQSGP------WTNNVAPGKNISTMGRYFKDAGYHTCYIGK-WH--LDGHDYFGTGE 113 I SG W N + T+ YF+ GY T GK +H G + TG Sbjct: 96 IQPWNSGVAGNEQDWRNAEPL-QGKPTLPEYFRQQGYTTAAGGKVFHASHGGPEGRLTGW 154 Query: 114 CPPEWDA------DYWFDGANYLSELTEKEISLWRNGLNSVE-DLQANHIDETFTWAHRI 166 D F G NGL+ D + T ++ Sbjct: 155 HGGRRGFEQDSAWDVRFPGNGVQIPDLPVHTGQNFNGLDIWHWDWGTVDVKPEATDDGQV 214 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 N A +LQ+ + PF + V PH P+ P +Y + L E +DDLA+ P Sbjct: 215 VNWAAQYLQR--KQPRPFFLTVGLYRPHAPWYVPRQYFAERPLSEVRLPEVKEDDLADVP 272 Query: 227 EHHRLW---AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYT 282 + + Y A F D +GRV++AL + NT +++T Sbjct: 273 AAAKAYLNGGLHRKMLDRQLWGSAVRAYLASISFCDAMVGRVLDALESSPNKTNTVIVFT 332 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALAD 340 SDHG +G + KG +++ +T +PL++ +P + Q VS +DL PT+ L Sbjct: 333 SDHGLYLGEKQRWHKGG-LWERVTHVPLVVVAPGVTQPDTQSSQAVSLVDLYPTLCELTG 391 Query: 341 IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDE 400 + KP+ L G +++ + + G TD ++ + S+E Sbjct: 392 LPKPQSLDGISLVPLLRDPNASRTTPAVTAMGE---GDKASYAVRTDRWRYIRYANGSEE 448 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM 435 LYD ++DP+E NL A V+ + + Sbjct: 449 LYDHQSDPHEWTNLAGRTNLAAVQKDLAAQIPQKW 483 >UniRef50_A3P379 Choline-sulfatase n=63 Tax=cellular organisms RepID=A3P379_BURP0 Length = 517 Score = 423 bits (1088), Expect = e-116, Method: Composition-based stats. Identities = 123/466 (26%), Positives = 193/466 (41%), Gaps = 24/466 (5%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L +M D + Y + T ID LAAEG+ F++AY SP+C P+R L G Sbjct: 11 QPNILVLMADQLTPFALRAYGHRATRTPTIDRLAAEGVVFDAAYCASPLCAPSRFALMAG 70 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 + G + N T Y + AGY T GK H G D E D Sbjct: 71 KLPSALGAYDNAAELPAQTLTFAHYLRAAGYRTMLSGKMHFCGPDQLHGFE--ERLTTDI 128 Query: 123 WFDGANYLSELTEK-EISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL-----QQ 176 + ++ + T E W + ++SV D + + A + ++ Sbjct: 129 YPADFGWVPDWTRPAERPSWYHNMSSVLDAGPCVRTNQLDFDDDATFAARQKIFDVARER 188 Query: 177 PARAD-EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA-- 233 A D PF MVVS PH P+ EY + Y D +L D A+ P RL A Sbjct: 189 AAGRDTRPFCMVVSLTHPHDPYAITREYWDLYRDEDIDLPAVQMDFDASDPHSRRLRAVC 248 Query: 234 -QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGA 291 P Y+ +VD Q G ++ L ++T VI T+DHG+M+G Sbjct: 249 EVDRTPPEDLQIRRARRAYYGATSYVDAQFGALLATLEQCGLADDTIVIVTADHGDMLGE 308 Query: 292 HKLISKGAAMYDDITRIPLIIRSP-QGERRQVDTPVSHIDLLPTMMALA----DIEKPEI 346 L K ++ R+PLI+ +P + +V VSH+DLLPT++ LA + P+ Sbjct: 309 RGLWYK-MTFFEGACRVPLIVHAPRRFPAARVPAAVSHVDLLPTLVELATGERRADWPDA 367 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRN 406 + G +++ G + E+ + G P+ K + + D+L+D RN Sbjct: 368 VDGRSLVPHLRGEGG---HDEAFGEYLAEGAIAPIVMMRRGSHKYIHSPADPDQLFDLRN 424 Query: 407 DPNEMHNLIDDIRFADVRSKMH-DALLDY-MDKIRDPFRSYQWSLR 450 DP E+ NL + A + + + + +D + + Q R Sbjct: 425 DPRELDNLANTPAAAKHVAAFRMERVARWDLDALHQQVLASQRRRR 470 >UniRef50_A6DSH1 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH1_9BACT Length = 462 Score = 423 bits (1088), Expect = e-116, Method: Composition-based stats. Identities = 101/444 (22%), Positives = 185/444 (41%), Gaps = 22/444 (4%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 + N LF+M+D + Y + T N+D L ++ + F+ AY+ P+C P+R + +G Sbjct: 22 KMNVLFIMSDDLNV-DIASYGHPIVKTPNLDKLRSKSVLFSQAYSQYPLCNPSRNSILSG 80 Query: 63 IYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE-- 117 +Y SG +N +I+T+ FK GY GK TG Sbjct: 81 MYPGTSGCLSNADQLRKTAPDITTLPEAFKKQGYEVISTGKIFHHEDPQSWTGITNLRTG 140 Query: 118 ----WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 DY F + T E G ++ E + R + + Sbjct: 141 KLHPQGKDYNFYRPAFDERKTIGEGRNLTEGELGFMTWRSVTEKEDILFDSRTARWTMQH 200 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD-DLANKPEHHRLW 232 L++ A ++PF + V + PH PF P + + Y +L E Q+ ++ ++ Sbjct: 201 LEKLAEDEKPFFLGVGFSRPHDPFFAPKRFFDMYPMESIKLPETPQNASKVPMMAYYDVF 260 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGA 291 +A L Y+A ++D+Q+G V++ L NT V++ SDHG +G Sbjct: 261 KRAFDKMDTQKRLEFVRSYYASISYMDEQLGLVLDKLEALNLSNNTLVVFISDHGYQVGE 320 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQGERRQ--VDTPVSHIDLLPTMMALADIEKPEILPG 349 +K +++ R PL+I +P+ + VD V ID+LPT+ + + P+ G Sbjct: 321 KGYFNK-TLLFERSCRAPLMISNPKLKSSVNKVDKIVEFIDVLPTITEITSVPTPKTAEG 379 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPN 409 +++ + + + V + R T+ ++L+ + LYD + DP Sbjct: 380 RSLIPLMKGKKVEWKEEAISY-------VNADRSIRTERYRLINWRGQKEALYDHQRDPG 432 Query: 410 EMHNLIDDIRFADVRSKMHDALLD 433 E N +D+ + +V ++ L + Sbjct: 433 EHFNQVDNPEYKEVLKRLRSKLKE 456 >UniRef50_A6CG48 Sulfatase family protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CG48_9PLAN Length = 472 Score = 422 bits (1086), Expect = e-116, Method: Composition-based stats. Identities = 104/446 (23%), Positives = 179/446 (40%), Gaps = 23/446 (5%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF+ D + CY + +++ NID LA + F A+ P C +RA L T Sbjct: 21 ERPNVLFIAVDDLRPE-LACYGKQHIHSPNIDKLAESSVLFERAFCMVPTCGASRASLMT 79 Query: 62 GIYANQSGPWTN---NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 GI ++ N +TM FK GY+T +GK D PP Sbjct: 80 GIRPARNRFVNFLAWAERDAPNATTMNTQFKQNGYYTASLGKIFHHPADNRQGWSEPPWR 139 Query: 119 DAD-YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 W+ + ++ ++ + + ++ +A++ LQQ Sbjct: 140 PKGVQWYQRPENQEKHAARQ---KLGNKKKGPAWESADVPDNAYMDGVLAEKAIEKLQQL 196 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM- 236 + ++PF + V + +PH PF P +Y + Y +L + E + + Sbjct: 197 EKQEQPFFLAVGFFKPHLPFIAPQKYWDLYDHDKIQLPANHKVPQDAPKESIHRFGELRA 256 Query: 237 -------PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEM 288 + Y+AC + D QIG+++ L Q +NT V+ DHG Sbjct: 257 YADIPAKGPVSEETARNLIHGYYACVSYTDAQIGKLLAELDRLQLSDNTIVVLWGDHGWN 316 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQGERR-QVDTPVSHIDLLPTMMALADIEKPEIL 347 +G H L K + Y+ IPLI+R+P + + + + ID+ PT+ LADI +P+ L Sbjct: 317 LGDHTLWCKH-SCYESSLHIPLIVRAPGIKGGERRSSLMESIDVYPTLCDLADIPQPKHL 375 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRND 407 G++ +++ + + + G I ++ L S LYD D Sbjct: 376 KGQSFVSLMKDSTAEWKQAA--VSRYRNGDTIRTDTLRYTEYTLPKGKLVSQMLYDHSTD 433 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLD 433 P E N+ + AD ++ L Sbjct: 434 PLENVNVSA--QQADAVKELSAQLKQ 457 >UniRef50_C5BVK2 Sulfatase n=11 Tax=Actinomycetales RepID=C5BVK2_BEUC1 Length = 505 Score = 421 bits (1084), Expect = e-116, Method: Composition-based stats. Identities = 136/500 (27%), Positives = 214/500 (42%), Gaps = 36/500 (7%) Query: 2 KRP----NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARA 57 +RP N LF +TD + +G Y + T N+D+LAA+G F+ YT + +CTPARA Sbjct: 7 ERPVALTNILFFLTDQHRKDTLGAYGNATVRTPNLDALAADGTTFDRFYTPTAICTPARA 66 Query: 58 GLFTGIYANQSGPWTN-------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG 110 L TG + N + T +AGYH +GKWH+ H G Sbjct: 67 SLLTGAAPFRHKLLANYERNVGYQEELSEGQFTFSEDLAEAGYHLGLVGKWHVGTHRTAG 126 Query: 111 T--GECPPEWDADYWFDGANYLSELTEKEISLWR----------NGLNSVEDLQANHIDE 158 + P D A+YL+ L E ++ +R NG H Sbjct: 127 DLGFDGPHLPGWHNPVDHADYLAYLEENDLPPYRISDEVRGTFPNGAPGNLLAARLHQPL 186 Query: 159 TFTWAHRISNRAVDFLQQPAR----ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYEL 214 T+ + ++ RA+D L+ AR + PF + + PH P+ P EYL+ Y EL Sbjct: 187 EATFEYFLAERAIDLLRTYARDHRTSGRPFFLATHFFGPHLPYILPSEYLDMYDADDVEL 246 Query: 215 GEKAQDDLANKPEHHRLWAQAMPSPV--GDDGLYHHPLYFACNDFVDDQIGRVINALTPE 272 + A KP ++ + Y+ VD Q+GR+++A Sbjct: 247 PLSVAETFAGKPPVQGNYSAHWTFDTLGDETSRKLIAAYWGYVTLVDSQVGRILDAAREL 306 Query: 273 Q-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP-QGERRQVDTPVSHID 330 ++ V +++DHGE GAH+L KG AMY+DI IP I++ P ++ D ID Sbjct: 307 GVYDDAAVFFSADHGEFTGAHRLHDKGPAMYEDIYTIPGIVKLPGGVPGQRSDRLAHLID 366 Query: 331 LLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFK 390 L T++ +A + + G + + + E P R VT+ +K Sbjct: 367 LTATILDVAGRDPARAVDGVPVTPLVRGEETPWREDLVA-EFHGHHFPHPQRMLVTERWK 425 Query: 391 LVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLR 450 LV+N + +ELYD DP+E+ N A VR+++ L + + D F + S+ Sbjct: 426 LVVNPESVNELYDLVRDPDELQNRYTHPETAAVRAELLGRLYRQLRERGDNFYHWMTSMY 485 Query: 451 PWRKD----ARPRWMGAFRP 466 P + + + GA RP Sbjct: 486 PVGEKDYDTSLSMFEGAHRP 505 >UniRef50_A3HTC7 Putative uncharacterized protein n=1 Tax=Algoriphagus sp. PR1 RepID=A3HTC7_9SPHI Length = 1174 Score = 421 bits (1083), Expect = e-116, Method: Composition-based stats. Identities = 115/523 (21%), Positives = 207/523 (39%), Gaps = 52/523 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN +F++TD Q + +G + + T +D LA G F +A +P+C +RA LFTG Sbjct: 31 RPNIIFILTDDQRFDALGYAGNQFVQTPEMDRLAESGTYFETAIVTTPICAASRASLFTG 90 Query: 63 IY--ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 +Y A+ T N+ + K++GY+T + GK+ + ++D Sbjct: 91 LYERAHNFNFQTGNIRAEYMEESYPTILKNSGYYTAFFGKYGV------RYDNLNNQFDE 144 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 +D N + +T +A+DF+ + Sbjct: 145 YESYDRNNQYPDKRGYYFKTIAG--------------DTVHLTRYTGQKALDFIDKAPE- 189 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYL------EKYADFYYELGEKAQDD-LANKPEHHR--- 230 D+PF + +S+ PH P +Y + + +D+ +P+ R Sbjct: 190 DKPFSLSLSFSAPHAHDGAPDQYFWQTTTDPLLQNTTIPGPDLGEDEFFQAQPQFVRDGF 249 Query: 231 -LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEM 288 + + Y+ +D +I ++ L + + NT +I D+G Sbjct: 250 NRLRWTWRYDTEEKYQHSLKGYYRMISGIDLEIAKIREKLKEKGLDKNTVIIVMGDNGYF 309 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKPEIL 347 +G +L K MYD+ R+PLII P+ G + + V +ID+ T+ LA +E PE Sbjct: 310 LGERQLAGKW-LMYDNSIRVPLIIYDPRSGNHQDIKDMVLNIDVPATIADLAGVETPESW 368 Query: 348 PGENILAVKEPRGVMVEFNRYEIEH-DSFGGFIPVRCWVTDDFKLVLNLFTS--DELYDR 404 G++++ + E + + + IEH F P T+++K + +ELY Sbjct: 369 QGKSLMPIVEGKSQKIGRDTILIEHIWEFENIPPSEGVRTEEWKYFRYVNDKKVEELYHL 428 Query: 405 RNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL------------RPW 452 +DP E++NLIDD + D+ K+ + + K + FR +L Sbjct: 429 VDDPKEINNLIDDPEYKDIAIKLRSKTDELISKFGNKFREAPSNLTVELIRKPQNAVEVL 488 Query: 453 RKDARPRWMGAFRPRPQDGYSPVVRDYDTGLPTQGVKVEEKKQ 495 W + + Q Y +V + + T V Q Sbjct: 489 DTQPEFGWKVSDYSKSQSAYQILVSSSEKLIETNTGDVWNSGQ 531 >UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 Tax=Nostocaceae RepID=Q3M597_ANAVT Length = 457 Score = 421 bits (1083), Expect = e-116, Method: Composition-based stats. Identities = 107/454 (23%), Positives = 169/454 (37%), Gaps = 66/454 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN +F++ D + Y T N+D LA +G+RF +AY VCTP R T Sbjct: 40 SRPNVVFILVDDMGWGDLSIYGRTDYETPNLDRLARQGVRFTNAYANQTVCTPTRIAFLT 99 Query: 62 GIYANQSGPW------------TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYF 109 G Y + +NN+ N T+ K GY T +GKWH F Sbjct: 100 GRYQARLPVGLREPLGARSQPASNNIGIPANQPTIASLLKANGYETALVGKWHAGYPPNF 159 Query: 110 GTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDET--FTWAHRIS 167 G P + D +F L+ G + + DL N + + Sbjct: 160 G----PLQKGFDEYFG------HLSGGIEYFTHTGTDRILDLYENDVPVQRSGYVTDLFT 209 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 +RAV+F+Q+P PF + + Y+ PH P+ P Sbjct: 210 DRAVEFIQRP--HSRPFYLSLHYNAPHWPWQGP--------------------------- 240 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG 286 + + A G Y A +DD +GRV++AL +NT VI+TSD+G Sbjct: 241 -NDQASTAFYLTNGYTVGGSQATYAAMVKSLDDGVGRVLDALEASGQADNTLVIFTSDNG 299 Query: 287 E--MMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIE 342 + A++Y+ R+P IIR P + + + DL T++A Sbjct: 300 GERFSNFGPFRGQKASLYEGGIRVPAIIRYPGVTQANQVSNQVIITFDLTATILAATGTS 359 Query: 343 KPEIL--PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDE 400 G+N+L + EF+R R + D+K + Sbjct: 360 FHPNYPPDGQNLLPLLRGD--RSEFSRTLFWRYGAALTTRQRAVRSGDWKY-WRRGNQEA 416 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 L++ DP E +L D A V +++ + + Sbjct: 417 LFNLATDPGETTDLKDS--NAQVFTRLRNQFQHW 448 >UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C1V3_9PLAN Length = 470 Score = 420 bits (1082), Expect = e-116, Method: Composition-based stats. Identities = 92/480 (19%), Positives = 174/480 (36%), Gaps = 74/480 (15%) Query: 2 KRP-NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++P N +F + D +GCY + NID LAAEG++F Y+ C+P R L Sbjct: 31 EKPWNVVFFLVDDLGWTDLGCYGSDFYQSPNIDQLAAEGMKFTQNYSACNACSPTRGALL 90 Query: 61 TGIYANQSGPWT------------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWH 102 TG+Y ++ + +T+ + AGY T ++GKWH Sbjct: 91 TGMYPARTHLTDWIPGWAKSYTDFPLKPPEWKKHLDQKYTTLPEALRTAGYQTFHVGKWH 150 Query: 103 LDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTW 162 L G G P + D G N + + + + Sbjct: 151 LGGR-----GNLPQDHGFDVNISGTNRGLPRSYH--FPYGGDAMKWDSSLTEAERQDRYL 203 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 R+++ AV ++Q D+PF + S+ H P + ++KY L Sbjct: 204 TDRMADEAVALIRQQ--QDKPFFLYCSFYSVHSPIQGRPDLVKKYKG----LPAGK---- 253 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIY 281 + +P Y A VD+ IGRV L + T +++ Sbjct: 254 ----------------------RHKNPEYAAMIQSVDEAIGRVRAQLKESGIADRTLIVF 291 Query: 282 TSDHGE----MMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTM 335 TSD+G L + ++ TR+P I+ P P+ +D PT+ Sbjct: 292 TSDNGGVRRKTSNNDPLRGEKGQHWEGGTRVPAIVLWPGVTPAGSVCAEPIITMDFYPTI 351 Query: 336 MALADI----EKPEILPGENILAVKEPRGVM--VEFNRYEIEHDSFGGFIPVRCWVTDDF 389 + + + E + + G +++ + + E + H + +P ++ Sbjct: 352 LNITGVAGNTEHNQSVDGLSLVPLLKDPAATLNREALYWHYPHYNVFIGVPYSAIRVGEY 411 Query: 390 KLVLNL-FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWS 448 KL+ +DELY+ D +E ++ + ++ +++ L ++ ++ Sbjct: 412 KLIHYYEDGNDELYNLAEDLSETSDVSK--TYPELTARLERRLQQHLKQVGAQMPVSNPQ 469 >UniRef50_UPI000051016C choline-sulfatase n=1 Tax=Brevibacterium linens BL2 RepID=UPI000051016C Length = 509 Score = 420 bits (1082), Expect = e-116, Method: Composition-based stats. Identities = 125/497 (25%), Positives = 214/497 (43%), Gaps = 32/497 (6%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M+ PN + + D A +G Y T N+D+LAA+G F+ AY +P+C+P+RA + Sbjct: 1 MQPPNIVVIQADQMAAQALGAYGDTAALTPNMDALAADGAVFDRAYCNTPLCSPSRASMM 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 TG + N ++ T + GYHT IG+ H G D E Sbjct: 61 TGRMPSDIDCLDNGDDFAASVPTFAHRLRKLGYHTALIGRMHFIGPDQHHGFE--ERLTT 118 Query: 121 DYWFDGANYLSELTE--KEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL---- 174 D + + + + + W + + V A + + + R + L Sbjct: 119 DVYPADLDMVPDWQRPLDQKLQWYHEADPVFTAGAAKANVQQDFDDEVIFRTLRHLNGRV 178 Query: 175 --QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ-DDLANKPEHHRL 231 Q A D+PFLMV S+ PH P+ P E+ +++A+ + D+A P HRL Sbjct: 179 RANQAAGEDQPFLMVTSFIHPHDPYEPPREHWDRFAEVDIPDPAHPEVPDIAEDPHSHRL 238 Query: 232 ---WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGE 287 P +D Y+A ++DD IG++ L + +NT +I TSDHG+ Sbjct: 239 RTMSGLDKKEPGTEDIRRARRAYYAAVSYIDDHIGKIRQRLRELELEDNTVIIVTSDHGD 298 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMALADIEKPE 345 M+G L K + Y+ +R+P+II P + PVS +DL+PT++ LA P+ Sbjct: 299 MLGEKGLWYK-MSPYEQSSRVPIIINGPAEAVTPGRYANPVSLVDLMPTLLELAGTSDPD 357 Query: 346 ILPGENILAVKE----PRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDEL 401 G ++ + + IE+ + G + P + +KL + + L Sbjct: 358 AT-GVSLFESARQEAAGETGPADRDVI-IEYFAEGTYRPQVTLIRGQYKLTICPGDPELL 415 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALL-----DYMDK-IRDPFRSYQWSLRPWRKD 455 +D +DP+E+ N D +A++ + M L +++++ + S Q + Sbjct: 416 FDLESDPDELVNRAGDAAYAELVATMRAELDSRYDLEHLEEHVLGSQSSRQLVADALKIG 475 Query: 456 ARPRWMGAFRPRPQDGY 472 W F P P++GY Sbjct: 476 TVRHW--DFDPEPENGY 490 >UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFN4_9BACT Length = 481 Score = 420 bits (1081), Expect = e-116, Method: Composition-based stats. Identities = 100/476 (21%), Positives = 170/476 (35%), Gaps = 75/476 (15%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN ++++ D +GCY + + T +ID+LA EG+RF Y+ +PVC P+R L +G Sbjct: 20 PNVIYILADDLGYGELGCYGQEKIKTPHIDALAKEGMRFTRHYSGAPVCAPSRGVLLSGQ 79 Query: 64 YANQSGPWTNNVAPGKNIS-------TMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 +++ N + T+ + FKD GY T GKW L Y G+ P Sbjct: 80 QLSKAYIRNNREHKPEGQEPIPEPGMTLAQIFKDKGYATGAFGKWGLG---YPGSSSDPK 136 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSV---------------EDLQANHIDETFT 161 D ++ + +W N N D + Sbjct: 137 ALGFDTFYGYNCQRVAHSFYPPHMWSNDKNITINEKPVPGHWRKAVGPDFDFSQFYAENY 196 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 I + A+ F++ D+PF + + EPH P +++ Y + E + Sbjct: 197 APDLILDEALKFIKD--NKDKPFFAYLPFVEPHLAMHPPHSWVDSYPKEWDSPKESYKAA 254 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVI 280 L Y A +D+ +G V+ L ENT VI Sbjct: 255 YLPH-------------------LRPRAGYAAMISDLDEHVGSVMQLLKELDLVENTLVI 295 Query: 281 YTSDHG----------EMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSH 328 +TSD+G L ++Y+ R+P+I P + + D Sbjct: 296 FTSDNGASHCIEVDHEFFNSTKDLRGLKGSVYEGGLRVPMIAHWPGKIKKAQVSDHVSGF 355 Query: 329 IDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDD 388 +D++ T L E P+ G + L + + E + G + + Sbjct: 356 VDVMATFCDLLQTEAPQTSDGVSFLPTLKGEKQEPQ-PVLAWEFQGYSGQQAI--ILDGR 412 Query: 389 FKLVL-----------NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 +K V ELYD DPNE +L + ++ ++H A++ Sbjct: 413 WKGVRQNLSPRGKKKAKSTPKWELYDLNKDPNEKTDLA--TQMPEIVDRIHKAMMK 466 >UniRef50_Q7UZ92 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UZ92_RHOBA Length = 582 Score = 420 bits (1080), Expect = e-116, Method: Composition-based stats. Identities = 110/464 (23%), Positives = 188/464 (40%), Gaps = 35/464 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF+ D + +GCY T NID LA+ ++FN AY VC P+RA L T Sbjct: 26 QRPNVLFIAVDDLRPS-IGCYGDPQAITPNIDRLASRSVQFNRAYCQVAVCNPSRASLMT 84 Query: 62 GIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP--- 115 G+ + WT + T+ ++F+ GY GK + + + P Sbjct: 85 GLRPDNLAVWTLPIHFREAMPEAVTIPQWFRRYGYTAVSHGKIYHNPTPDPQSWSEPIRD 144 Query: 116 -PEWDADYWFDGANYLSELTEK-EISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 P A Y + + + WR A + + +N A++ Sbjct: 145 LPRLPAFYPDGTREQMKKFDNELPDRDWRKNNLRGPSTAAPELADDQLLDGARTNMAIED 204 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYEL----------GEKAQDDLA 223 L++ ++D PF + + Y PH + P +Y + + + A + + Sbjct: 205 LRRLGKSDAPFFLAMGYIRPHLAWVAPKKYWDMHDPSKLPVRTGEQIPKNSPPYAMHNNS 264 Query: 224 NKPEHHRLWAQAMPSPVG----DDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTW 278 + P +D + Y+AC ++D QIGR+++AL E +NT Sbjct: 265 EMTHYVDRMNLPKPWDDDTVPTEDARHLMHAYYACVSYIDAQIGRLLSALKEEGLADNTI 324 Query: 279 VIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMM 336 V+ SDHG +G H+ K Y+ +PL+I P + +Q D +DL PT+ Sbjct: 325 VVLWSDHGWKLGEHRGWGK-MTNYEIDAHVPLLITGPGVKCLGQQTDQLAELLDLFPTLC 383 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFNRY-EIEHDSFGGFIPVRCWVTDDFKLVLNL 395 +A I+ P+ + G +++ + V + G T D++LV Sbjct: 384 EMAGIDVPDFVDGSSLVPILNDVDAKVHDGAVNQYYRRHEGRQYMGYSIRTSDYRLVEWR 443 Query: 396 F------TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 + ELYD RND +E +++D V ++ LL+ Sbjct: 444 DFFSGEVAAKELYDHRNDDSENESIVDSTE-PKVIDELTSLLLE 486 >UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6C430_9PLAN Length = 503 Score = 420 bits (1080), Expect = e-116, Method: Composition-based stats. Identities = 113/515 (21%), Positives = 187/515 (36%), Gaps = 86/515 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + V+ D + CY + + NID A EG++ S Y P C+P+RAGL TG Sbjct: 34 RPNIMVVLCDDLGYGDLACYGHPVIQSPNIDRFAKEGLKLTSCYAAHPNCSPSRAGLMTG 93 Query: 63 IYANQSGPWTNNVAPGK-----NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 + G + T+ + AGY TC++GKWHL+G P + Sbjct: 94 RTPFRVGIYNWIPMLSPMHVRKREITIATLLRQAGYATCHVGKWHLNGMFNMVGQPQPSD 153 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 D+WF N E + RN + +++ A ++L Q Sbjct: 154 HGFDHWFSTQNNALPTHENPFNFVRN--------ARPVGPLQGFASQLVADEAEEWLTQL 205 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 ++PF M V + EPH P + + Y + + P HH Sbjct: 206 RDKEKPFFMFVCFHEPHEPIASAERFRKLY----------TAPEGSTLPAHH-------- 247 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG-------EMM 289 +DD GR++ L ++ ENT +I+TSD+G Sbjct: 248 ---------------GNVTQMDDAFGRILKTLDDQKLRENTLIIFTSDNGPAITRRHPHG 292 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEI- 346 + L K A Y+ R+P I++ P+ D PV +D+LPT+ A+ADI P Sbjct: 293 SSGPLRDKKGATYEGGIRVPGIVQWPEHVQPGTTSDVPVCGVDILPTLCAVADIPAPTDR 352 Query: 347 -LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL-------------- 391 L G NIL + E + ++ + Y + + ++KL Sbjct: 353 VLDGTNILPLLEGKPILRKKPLYWQFNRAK--NDAKVALRDGEWKLLAKLNVPSPKPSGG 410 Query: 392 --------VLNLF-TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 V N ELY ++D E + + + ++ KM + D+++ Sbjct: 411 ITTEEIDAVKNAKLEGFELYHIQSDIAETTDRAESEQ--EILKKMKQQMQAIFDEVQAEA 468 Query: 443 RSYQ-WSLRPWRKDARPRWMGAFRPRPQDGYSPVV 476 + W + + + Sbjct: 469 PRWPAWEFARYEGKILSEYYRKQEEAEKQKKQKQP 503 >UniRef50_A3SJ21 Sulfatase n=1 Tax=Roseovarius nubinhibens ISM RepID=A3SJ21_9RHOB Length = 518 Score = 419 bits (1079), Expect = e-116, Method: Composition-based stats. Identities = 107/450 (23%), Positives = 184/450 (40%), Gaps = 22/450 (4%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + +M D A + G Y + T ++D+LAA G+RF++AY +P+C P+R + Sbjct: 10 RRPNIVVIMADQLAPHFTGAYGHQVAKTPHMDALAARGMRFDAAYCNAPLCAPSRFAFMS 69 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G ++ + N + T Y GY TC GK H G D + D Sbjct: 70 GQLISRIAAYDNASEFRATVPTFAHYLSALGYRTCLSGKMHFVGPDQKHGFQD--RVTTD 127 Query: 122 YWFDGANYLSEL--TEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL----- 174 + + + ++ I W + + +V++ + + A +L Sbjct: 128 IYPSDFAWTPDWEAPDERIDKWYHNMQTVKESGCAIATFQTDYDDEVEFAARRWLIDRAR 187 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 + A + P MV S+ PH P+ E+ + Y+D EL E LA+ R Sbjct: 188 DRAAGQEAPLCMVASFIHPHDPYVARPEWWDLYSDDEIELPEVL--PLADHDPFSRRLMD 245 Query: 235 AMPSPV----GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMM 289 + + D+ + Y A + D +IG ++ L +NT VI T+DHG+M+ Sbjct: 246 GIEASYVPLSRDEVIRARRAYLANVSYFDSKIGALVKTLDETGELDNTVVIVTADHGDML 305 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKP---EI 346 G L K ++ R+PLI+ P + S IDLLP+ + +A ++ E Sbjct: 306 GERGLWYK-MNFFEHSARVPLIMAGPGVVQGAAANACSLIDLLPSFLEIAGADESVLGEP 364 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRN 406 + G +++ + + E+ + PV D K + +LYD Sbjct: 365 VDGRSLMPLARGEADPQDEA--ISEYCAEMTAWPVFMIRRGDLKYIHCDGDPPQLYDLSV 422 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 DP E N ++D +A + L D Sbjct: 423 DPGERVNRVEDPDYACRARMFAEELAGRWD 452 >UniRef50_A5FX90 Sulfatase n=4 Tax=Alphaproteobacteria RepID=A5FX90_ACICJ Length = 518 Score = 419 bits (1079), Expect = e-115, Method: Composition-based stats. Identities = 117/444 (26%), Positives = 176/444 (39%), Gaps = 16/444 (3%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L VM D + Y + T NID+LAA G+ F++AY SP+C P+R +G Sbjct: 17 RPNILIVMADQLGARALPAYGNQVALTPNIDALAAGGVVFDNAYCNSPLCGPSRYVFMSG 76 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 + G + N + + + + AGY T GK H G D E D Sbjct: 77 QLPSAIGAFDNAAEFPAMLPSFAHHMRAAGYRTILSGKMHFCGPDQMHGFE--ERLTTDI 134 Query: 123 WFDGANYLSELTEKEISL-WRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 + + + T+ W + ++SV + + + A L AR D Sbjct: 135 YPADFGWTPDWTDFATRPSWYHDMSSVREAGLCVRTNQMDYDDEVVFAARQKLFDLARDD 194 Query: 182 --EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA---QAM 236 PF MVVS PH PF EY Y ++ + P RL Sbjct: 195 DGRPFCMVVSLTHPHDPFAMTEEYWNLYDHDAIDMPRVRTAPASMDPHSLRLRHVSNMDN 254 Query: 237 PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLI 295 Y+A FVD Q+GR+ + T + T+DHGE++G H L Sbjct: 255 EPVTEAQVRNARHAYYAAISFVDRQLGRLRETVEACGLAARTVTVMTADHGELLGEHGLW 314 Query: 296 SKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKPEIL--PGENI 352 K + ++D RIPLI+ +P +V VS +D+LPT++ L P L G ++ Sbjct: 315 YK-MSFFEDACRIPLIVHAPGRFAPARVGAAVSSVDMLPTLVGLGGGRIPAGLACDGTSL 373 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMH 412 L E RG + E+ + G P+ K + D+L+D DP+E Sbjct: 374 LGHLEGRGG---HDGAFGEYLAEGAIAPIVMIRRGRHKFIHCPADPDQLFDLEADPDERA 430 Query: 413 NLIDDIRFADVRSKMHDALLDYMD 436 NL A + + + D Sbjct: 431 NLAAAPEHAALVAAFRAEVAARWD 454 >UniRef50_A0JVM4 Sulfatase n=2 Tax=Actinomycetales RepID=A0JVM4_ARTS2 Length = 479 Score = 419 bits (1078), Expect = e-115, Method: Composition-based stats. Identities = 120/497 (24%), Positives = 195/497 (39%), Gaps = 56/497 (11%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN L +++D Q +GC + T ++D+LA+ G R ++ + SPVC+PARA L TG Sbjct: 7 PNILLILSDDQGAWALGCSGNTEIQTPHLDNLASGGTRLDNFFCVSPVCSPARASLMTGT 66 Query: 64 YANQSGPWTN---------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 ++ G + AGY+ GKWHL +D Sbjct: 67 IPSKHGVHDYLHGVETGPEAPDYLQGQRLFTDDLAAAGYYMGLSGKWHLGANDRA----- 121 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 +WF A S +++RNG+ I+ + F+ Sbjct: 122 --REGFSHWFSLAGGGSPY--DAATMYRNGVKETVY---------GYLTDAITADSTGFM 168 Query: 175 QQPARADEPFLMVVSYDEPHHPF--TCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 ++ A D PF + ++Y PH P+ P E+ Y D +E + Sbjct: 169 ERAAGQDSPFFLALNYTAPHKPWKDQHPAEFTALYDDCAFESCPQEPT------HPWTPT 222 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGA 291 +P D YFA +D IG+V+ L E+T VI++SD+G G Sbjct: 223 VDGVPIGGEADVRAALVGYFAAVSAMDAGIGQVLQKLDELGLREDTLVIFSSDNGFNCGQ 282 Query: 292 HKLISKGA-----AMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKP 344 H + KG ++D ++P I P + + +S DL T++ LA ++ Sbjct: 283 HGVWGKGNGTFPLNVFDSSIKVPAIFSFPGRIARGKVREELLSAYDLPATILELAGLDPL 342 Query: 345 EILP--GENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-NLFTSDEL 401 E G++ V + + R + D +G PVR +D +K V EL Sbjct: 343 EFEQGPGKSFADVLRGKPLAPARPRPVVVFDEYG---PVRMIRSDSWKYVHRYPQGPHEL 399 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDA----- 456 YD DP E HNL+ ++R + + M + + ++ ++ P Sbjct: 400 YDLATDPGERHNLVREVRHEERVAGMRRDMQLWFEQYQE--EEADGRKFPVVGAGQTLPV 457 Query: 457 RPRWMGAFRPRPQDGYS 473 R +GAF P DG S Sbjct: 458 RADPLGAFTPPSWDGIS 474 >UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKB8_9BACT Length = 465 Score = 419 bits (1078), Expect = e-115, Method: Composition-based stats. Identities = 106/485 (21%), Positives = 175/485 (36%), Gaps = 72/485 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN + +M D N VG + T IDS+A G++F + YT VC P+RAG T Sbjct: 19 SRPNLIVIMADDLGYNDVGFNGCTEIPTPGIDSIAQNGVKFTNGYTSYSVCGPSRAGFIT 78 Query: 62 GIYANQSGPWTN--------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE 113 G Y + G N N A K+ T+ GYH IGKWHL Sbjct: 79 GRYQQRFGFERNPQWNLTDPNSALPKSEMTIAESLTQVGYHCGIIGKWHLGAEPSL---- 134 Query: 114 CPPEWDADYWFDGANYLS-----ELTEKEISLWRNGLNSVEDL---QANHIDETFTWAHR 165 P + D +F +L + +N L+S + T Sbjct: 135 RPNKRGFDEFFGHLGGGHRFMPEDLVIQHTEEVKNELDSYRSWITRNDTPVKTTKYLTEE 194 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 S+ AV F+++ +PF + +SY+ PH P +YL ++ Sbjct: 195 FSDEAVSFIKR--NHQKPFFLFLSYNAPHLPLQATEKYLARFPHIKDP------------ 240 Query: 226 PEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSD 284 Y A VDD + +V+ +L +NT V + SD Sbjct: 241 ---------------------KRKTYAAMVSAVDDGVSQVMQSLKETNIADNTIVFFLSD 279 Query: 285 HGEM-----MGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMA 337 +G L + + +++ R+P ++ P ++ D PVS +D+ T+ + Sbjct: 280 NGGPSHKNKSDNFPLKGQKSDVWEGGFRVPFAMQYPAAIQAKQVYDHPVSSLDIFATIAS 339 Query: 338 LADIE--KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-N 394 LA + L G N++ + I ++ DFKLV+ Sbjct: 340 LAQSPTHADKPLDGVNLIPFITGEKTQAPHAQIFIRKFDQSRYV----VRQGDFKLVIPY 395 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRK 454 +LY+ D E +N+ + ++ + ++ DP W+K Sbjct: 396 KDAPPQLYNLSKDIGEENNIAAV--HPERVKELEKVRKQWDSELMDPIFLGLLHTEAWQK 453 Query: 455 DARPR 459 A + Sbjct: 454 KAARK 458 >UniRef50_A6DPD0 Sulfatase family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DPD0_9BACT Length = 471 Score = 418 bits (1076), Expect = e-115, Method: Composition-based stats. Identities = 103/466 (22%), Positives = 190/466 (40%), Gaps = 47/466 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++ N LF++ D +GCY K + + NID LA+EG F+ AY PVC +RA + T Sbjct: 24 EKNNVLFIIVDDLRPE-LGCYGNKQVLSPNIDRLASEGTLFSKAYCNVPVCGASRASVMT 82 Query: 62 GIYANQSGPWTNNVAPGKN---ISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+ + + N K + + F+ GY T IGK + + +DY + + Sbjct: 83 GLRPTKDRFISYNAKAYKESGGVLDLAGIFQKNGYTTISIGKVYHERNDYRSSWD----- 137 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDL--------QANHIDETFTWAHRISNRA 170 F + ++ + ++ L N + +A + + +++++ A Sbjct: 138 -----FKDSPLITSPSMRDYHLPENQAGRGKYSFEALGTACEAADEPDEKYFTYQLADAA 192 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 +D++ + + ++P+ + V + +PH PF P +Y + Y ++L + Sbjct: 193 IDYIDKTEKKNKPWFLAVGFTKPHLPFVAPKKYWDLYKRSDFKLASNPNMPKNAPTQASH 252 Query: 231 LWAQAM---------PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVI 280 W + D L Y+AC F D IGR+++ L +NT VI Sbjct: 253 QWHELRKMYNDIPQTGPVPDDKALELKHGYYACVSFTDAMIGRILDYLDTNNLRKNTTVI 312 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQ-VDTPVSHIDLLPTMMALA 339 DHG +G H L K A ++ PLI+ + + V +D+ P++ LA Sbjct: 313 LWGDHGWQLGEHGLWCKHAN-FETSLNTPLIVSAAGQNAQGPSKALVEFVDIYPSLCDLA 371 Query: 340 DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF--- 396 KP L G++ + + + + H T+ F Sbjct: 372 GFTKPPHLQGKSFAPLLKKPNTKWKSAVFSRYH-------AGDSIHTNRFLYTEWRNKSN 424 Query: 397 ---TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 T+ LYD + DP+E N+ + +A++ K+ L ++D Sbjct: 425 GNITARMLYDHQRDPDENFNIAANPEYAELVKKLSKRLQAHIDSWN 470 >UniRef50_A4AP83 Putative sulfatase n=1 Tax=Flavobacteriales bacterium HTCC2170 RepID=A4AP83_9FLAO Length = 467 Score = 418 bits (1076), Expect = e-115, Method: Composition-based stats. Identities = 115/465 (24%), Positives = 201/465 (43%), Gaps = 41/465 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN ++V+ D +G + T N+D LA+EGI F +A + SPVCTP R+ + T Sbjct: 22 KKPNIIYVLADQWRAEALGSNGNPNVITPNLDKLASEGISFTNAISTSPVCTPYRSMMLT 81 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G Y ++G + N+V+ + + G+ +K+ GY T YIGKWH+DG D Sbjct: 82 GRYPLKNGMFMNDVSLDPDSQSFGKLYKNEGYSTAYIGKWHVDGKGRSAFIPKERRQGFD 141 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 YW + + W N A + A+ F++ Sbjct: 142 YW---KVLECSHSYNNSNYWGNDDELHSW--------EGYDAAAQTKDAIAFIEAQTENK 190 Query: 182 EPFLMVVSYDEPHHPF-TCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 PF +++S+ PH P+ T P E+ + Y + +L +LA Sbjct: 191 SPFCLILSWGPPHAPYKTAPKEFQKLYENMDIQLRPNVPVELA----------------- 233 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGA 299 ++ Y+A +D I ++ +A+ +NT ++TSDHG+++ +H K Sbjct: 234 -ENTKAMLKGYYAHCSALDSYIKQLQDAIKRNNLEDNTIFVFTSDHGDLINSHTER-KKQ 291 Query: 300 AMYDDITRIPLIIRSP---QGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVK 356 +Y++ ++P II+ P + R+ D ++ +D+LPTM+ ++ I+ PE L GE+I V Sbjct: 292 RIYEESAKVPFIIKYPALLGKQGRKSDFLLNTLDILPTMLGMSSIKAPEGLDGEDISDVI 351 Query: 357 EPRGVMVEFNRYEIEHDSFGGFIPV------RCWVTDDFKLVLNLFTSDELYDRRNDPNE 410 FG + R +T + +L +D DP + Sbjct: 352 LGEKEDNRKAALVACIQPFGQWKRTLGGKEFRGVITKRYTYAKDLSGEWLFFDNVEDPYQ 411 Query: 411 MHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKD 455 ++NL+ + F V + + L +D++ D F L W Sbjct: 412 LNNLVGNPSFKSVAENLEELLDKELDRLDDDFLPGASYLETWGHK 456 >UniRef50_A6DMW2 Putative exported uslfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMW2_9BACT Length = 479 Score = 418 bits (1075), Expect = e-115, Method: Composition-based stats. Identities = 106/505 (20%), Positives = 180/505 (35%), Gaps = 91/505 (18%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF++ D +G Y T N++ LA++ +RF+ AY S VC+P R + T Sbjct: 26 QRPNILFIVADDMGIMDLGVYGSDYYLTPNLNKLASQSMRFDRAYAASHVCSPTRGAILT 85 Query: 62 GIYANQSGPWT---------NNVAPGKNI--------STMGRYFKDAGYHTCYIGKWHLD 104 G Y + N N T R + Y T GKWHL Sbjct: 86 GRYPQRIHLTDALPWDRLYKNPKMIPPNHVKELSLKLPTFARVLQKNDYRTAMFGKWHLG 145 Query: 105 GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 + F TG+ + D F + + Sbjct: 146 NEERFFTGKEHKAYGFDEAFGVSGKAKAYDKGVNE------------------------- 180 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 ++ R + FL++ +PF++ + + PH P CP Y Sbjct: 181 -LTERTLRFLKE--NKKKPFMLCLMHHVPHVPVACPPYAKALYDSVP------------- 224 Query: 225 KPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTS 283 K +H + Y D+ I +V++AL +NT VI TS Sbjct: 225 KGKHQKNSK-----------------YAGMISHFDNSIKKVLDALRALGLDDNTVVIVTS 267 Query: 284 DHGE---MMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMAL 338 D+G + ++Y+ TR+PL+IR P + V D PT + L Sbjct: 268 DNGGLSNLSSNKPYNGGKGSLYEGGTRVPLLIRWPGKITPGSVNKSVVISNDFFPTFLEL 327 Query: 339 ADIE--KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 A + L G++++ + + + + + H P + D KL+ + Sbjct: 328 AGLPLMPEAHLDGKSMMPLLKGKTLGKRTLYWHFPH----RGTPGSSIIDGDLKLIHKIE 383 Query: 397 -TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKD 455 + E++D +DP E +NL + + S++ L ++ ++ S P R Sbjct: 384 SDTYEMFDLNSDPYEANNLFEKQ--PEQASRLQKMLARHLKEVAAQEMSPNPQWDPKRPK 441 Query: 456 ARPRWMGAFRPRP-QDGYSPVVRDY 479 +P G P + G+ V Y Sbjct: 442 GKPTNFGIHYPAGRKKGFRLTVEAY 466 >UniRef50_Q7UWE8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UWE8_RHOBA Length = 488 Score = 417 bits (1074), Expect = e-115, Method: Composition-based stats. Identities = 113/451 (25%), Positives = 181/451 (40%), Gaps = 26/451 (5%) Query: 3 RP-NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +P N L + D +GCY +++ NID LAA G+RF+ AY VC +RA L + Sbjct: 33 KPLNVLMIAVDDLRPE-LGCYGKSYMHSPNIDRLAASGMRFDRAYCQVAVCGASRASLMS 91 Query: 62 GIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPE 117 G + W ++ T+ ++ GY T ++GK +H D E Sbjct: 92 GCRPETTQCWNFKTLLRSQMPDVLTLPQHLSRNGYETGFLGKVYHSASDDAAAWTVDANE 151 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 W G +Y+ EL K + N + ++ ++RAV L++ Sbjct: 152 WAPRDRSKGKSYVQELPRKRNPANSSEKNGPSIENGGDVPDSAYTDGHNADRAVALLERF 211 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ--- 234 + D+PF + V + +PH PF P +Y + Y ++ +D + P W + Sbjct: 212 STQDKPFFLAVGFLKPHLPFNAPAKYWDLYDRDDIKIP-SREDVVDGLPYARSSWGELKN 270 Query: 235 ------AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGE 287 + Y A ++D Q+G+V+NAL RENT V+ DHG Sbjct: 271 YTDIPAKTDMLDDEKTRELIHGYRAAVSYMDAQVGKVLNALEANGQRENTIVVLWGDHGW 330 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEIL 347 +G K Y+ TR+PLI+ +P + + V +DL PT+ L ++ PE Sbjct: 331 YVGDFGDWCKHTN-YEIATRVPLIVSAPGVPAGETKSLVELVDLFPTLCELTELPVPEHC 389 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPV--RCWVTDDFKLVLN---LFTSDE-- 400 G++I V G+ V + S G PV TD F+ E Sbjct: 390 QGKSIAGVVHDPGLSVRPAAFSQYKKSKLGVGPVLGTSIRTDRFRYTEYVSTKTGKLEDI 449 Query: 401 -LYDRRNDPNEMHNLIDDIRFADVRSKMHDA 430 L D DP N+ D + ++H Sbjct: 450 VLIDFDKDPGATRNVASDPAYQPFLPQLHAW 480 >UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6CBM1_9PLAN Length = 497 Score = 417 bits (1074), Expect = e-115, Method: Composition-based stats. Identities = 104/503 (20%), Positives = 185/503 (36%), Gaps = 93/503 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + ++ D + CY + T ++D LA+EG+R Y +PVC+P+RAGL TG Sbjct: 32 KPNIVIILCDDLGYGDLACYGHPVIKTPHLDQLASEGMRLTDCYASAPVCSPSRAGLLTG 91 Query: 63 IYANQSGPWT-----NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 N+ G + + + ++ T+ + + AGY T ++GKWH +G P + Sbjct: 92 RTPNRLGVYDWIPEGHPMHLKRDEVTVAQLLQQAGYDTAHVGKWHCNGMFNSKEQPQPGD 151 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 +WF N E + RNG E +++ + +L Sbjct: 152 HGFRHWFSTQNNALPTHENPNNFVRNGKPLGEI--------EGFSCQIVADEGIRWLSDW 203 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 ++PF + V + EPH P +E Y D Sbjct: 204 REKEKPFFLHVCFHEPHERVASPPALVETYLDKSL------------------------- 238 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGE--------- 287 YFA +D +G+++ L + +NT V +TSD+G Sbjct: 239 -------YEDQAQYFANVANMDRAVGKLLIKLDELKVADNTLVFFTSDNGPETLNRYGKG 291 Query: 288 ---MMGAHK-LISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADI 341 G+ L +Y+ R+P I+R P +++ TPV +DLLPT +A + Sbjct: 292 SRRSWGSPGVLRGMKLHIYEGGIRVPGIVRWPGKIKAGQEIATPVCSVDLLPTFCEIAGV 351 Query: 342 EKPE--ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD 399 P+ L G ++L + + + + ++ P D+K+V + + Sbjct: 352 AVPDQRPLDGASLLPLFAGNKIERTTPLFWNYYRAYS--TPRVAMREGDWKVVAHWSGPE 409 Query: 400 -------------------------ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 ELY+ ++D +E HNL + + L+ Sbjct: 410 GIIPLGGNVNSVSQEIIKNAKLTKFELYNLKDDISEQHNLA--WQEQKRLDTLKKKLVQK 467 Query: 435 MDKIRDPFRSYQW-SLRPWRKDA 456 ++ + RK Sbjct: 468 YAAVQKEGPVWDTSEYDQSRKKT 490 >UniRef50_Q7US96 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7US96_RHOBA Length = 498 Score = 417 bits (1073), Expect = e-115, Method: Composition-based stats. Identities = 107/515 (20%), Positives = 182/515 (35%), Gaps = 86/515 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L + D +GCY + T ID LAAEG+RF + Y VC+P R L +G Sbjct: 31 QPNILLIFIDDLGWKDIGCYGNDFVETPRIDQLAAEGLRFTNFYASGAVCSPTRCALQSG 90 Query: 63 IYANQSGPWTN----------------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH 106 + G + +A + T+ K +GY T Y+GKWHL Sbjct: 91 QNQARIGITAHIPGHWRPFERVITPQTTMALPLDTVTIAESLKASGYTTGYVGKWHLGNG 150 Query: 107 DYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI 166 F +D G + + S + Sbjct: 151 PEFQPDRQ--GYDFSAVIGGPHLPGRYRVQGRSDLKP-------------KPNQYRTDFE 195 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 ++ +DF++Q D+PF +++S H P E ++KY + G Sbjct: 196 ADLCIDFMRQ--NKDQPFFLMLSPFAVHIPLAAMSEKVQKYEAMAKQTGNSLP------- 246 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDH 285 HP+Y A + DD +GR++++L ++T +++TSD+ Sbjct: 247 ---------------------HPVYAAMIEHCDDMVGRLVDSLEQLDIADDTMIVFTSDN 285 Query: 286 GEM--------------MGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHI 329 G + L + ++++ R+PLIIR P D P Sbjct: 286 GGLYKRYDYRESADDLVSSQAPLKGEKGSLHEGGIRVPLIIRHPATVKSAGVCDEPTISH 345 Query: 330 DLLPTMMALADIEKP--EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTD 387 D PT + +A E P + + G ++L + ++ + + + P Sbjct: 346 DFYPTFVEMAGGELPINQTIDGHSLLPLMTAPTQTLDRDALHWHYPHYHHDRPASAIRER 405 Query: 388 DFKLVLNLFTSD--ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 D+KL+ L + ELY+ +D E NL + + + L + + Sbjct: 406 DWKLIEYLDGTGDVELYNLADDLGETKNLASEKQ--GRAGDLKRKLTTWRSSVLARTPIP 463 Query: 446 QWSLRPWRKDARPRWM-GAFRPRPQ-DGYSPVVRD 478 S P R G P Q + P +D Sbjct: 464 NPSYDPERAHEWWNLKSGKPVPSEQRKRFPPTEKD 498 >UniRef50_Q01PN7 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01PN7_SOLUE Length = 496 Score = 417 bits (1073), Expect = e-115, Method: Composition-based stats. Identities = 112/479 (23%), Positives = 187/479 (39%), Gaps = 24/479 (5%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L +M D + +G + ++T N+D LAA G+RF +AY+ +P CTPARAGL TG Sbjct: 24 RPNILLLMADQWRADCLGAAGNRAIHTPNLDQLAASGVRFTNAYSATPTCTPARAGLLTG 83 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP------- 115 + G + M R +DAGY+T IGK H Sbjct: 84 LAPWNHGMLRYAEVGARYPVEMPRALRDAGYYTAAIGKLHYHPQRNVHGYHQALLDESGR 143 Query: 116 ---PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 P++ +DY + L L N D + + E A Sbjct: 144 IESPDFRSDYRSWFWSQAPNLDPDATGLGWNDF----DARPYTLPERLHPTTWTGQTAAS 199 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 +++ R++ PF + VS+ PH P+ P +Y D A Sbjct: 200 WIETYQRSE-PFFLKVSFARPHSPYDPPDRLWRRYQDAPLPPAAVAGWASRYAARSGPQP 258 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHGEMMGA 291 + Y+ FVD+QIGR++ +L + T +++ SDHG+M+G Sbjct: 259 DAWHGDLGAEQVRRSRQGYYGSVTFVDEQIGRIMESLTRRGLLDQTLIVFFSDHGDMLGD 318 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQG-----ERRQVDTPVSHIDLLPTMMALADIEKPEI 346 H L K + Y +R+P ++R P+G +D V D+LPT + A Sbjct: 319 HNLWRK-SYAYAGSSRVPFLVRWPEGMLTARRGGTIDQMVELRDVLPTFLDAAAAAPARP 377 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN-LFTSDELYDRR 405 L G+++L + + + +K + + ++L+D + Sbjct: 378 LDGQSLLPLIAGKSPAWRPFLDLEHGVCYSPDNHWNALADQQYKYIFHARDGREQLFDVQ 437 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF-RSYQWSLRPWRKDARPRWMGA 463 D +E+H+L D A + ++ ++ D + R + R P + GA Sbjct: 438 RDAHELHDLSGDPAAAAKLREWRQRMIAHLSPRGDHWVRGGNLATREDDPAYSPNYPGA 496 >UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9FLAO Length = 459 Score = 417 bits (1073), Expect = e-115, Method: Composition-based stats. Identities = 104/448 (23%), Positives = 169/448 (37%), Gaps = 57/448 (12%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN L ++ D + C L + NID+LAA G+RF + Y S VC+P+RA L TG Sbjct: 42 PNILCILVDDLGYGDLSCQGATDLQSPNIDALAANGMRFTNFYANSTVCSPSRAALLTGR 101 Query: 64 YANQSGPWT--------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 Y + G N + + AGYHT IGKWHL + + P Sbjct: 102 YPDLVGVPGVIRQNPENNWGNLADDAVLIPSELNPAGYHTGIIGKWHLGLEEP----DTP 157 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + Y+ + + + R G + L ID ++ +DFL+ Sbjct: 158 NDRGFTYFKGFLGDMMD----DYWDHRRGGINWMRLNREEIDPKGHATDLFTDWTIDFLK 213 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + ++PF + ++Y+ PH P P E+L+K + L EK ++ Sbjct: 214 ERQGEEQPFFLYLAYNAPHFPIQPPREWLDKVREREPNLTEKRAKNV------------- 260 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMM----G 290 A + +D +GRV+ AL E NT V++ SD+G + Sbjct: 261 -----------------AFVEHLDYSVGRVMEALKTTGLEENTLVVFVSDNGGALWYAQS 303 Query: 291 AHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKPEILP 348 L MY+ R+P I D +DL PT LA + PE + Sbjct: 304 NGPLRGGKQDMYEGGIRVPAIFYWKGKIAPGTTSDNTALLMDLFPTFCELAGRKPPENVD 363 Query: 349 GENILAVKEPRGVMVEFNRYEIEHDSFG--GFIPVRCWVTDDFKLVLN-LFTSDELYDRR 405 G +++ + G G DFK++ N F + ++ Sbjct: 364 GISLVPTLTGQAQDTANRYLYWVRREGGDYGGQAYYAARFGDFKILQNTPFEPIQFFNIG 423 Query: 406 NDPNEMHNL-IDDIRFADVRSKMHDALL 432 D E L D + +R+++ + + Sbjct: 424 QDELETTPLETDSEAYRALRAQLMEHIR 451 >UniRef50_Q7WC54 Putative sulfatase n=3 Tax=Proteobacteria RepID=Q7WC54_BORPA Length = 529 Score = 416 bits (1070), Expect = e-114, Method: Composition-based stats. Identities = 106/452 (23%), Positives = 175/452 (38%), Gaps = 18/452 (3%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PNFLF+M D + Y T N+D LAA RF + Y P+C P+R + T Sbjct: 5 KQPNFLFLMADQLTAFALRMYGNGVCRTPNLDRLAARSTRFANMYCNFPLCAPSRVAMLT 64 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G + G + N + T + AGY T GK H G + + D Sbjct: 65 GRLPSSVGVYDNASEFSAEVPTFLHHLALAGYSTILSGKMHFVGPEQHHGFQ--ERLTTD 122 Query: 122 YWFDGANYLSELTEK-EISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA-- 178 + + + E+ I+ + SV + + + R V + Sbjct: 123 IYPSDFGWTPDWREEIPIAPTGMNMRSVIEAGEYRRSMQIDYDDDVVYRGVQKIYDLGRL 182 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRL----WAQ 234 D PF + VS PH+P+ E+L+ Y ++ A + H + + Q Sbjct: 183 HRDRPFFLAVSMTHPHNPYVSTREFLDLYRPEDIDMPAVPPIPFAQQDPHSQRLWYMFRQ 242 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGAHK 293 Y+A +VD Q+GR+++AL + T V++T+DHG+M+G Sbjct: 243 DEYDVSDAHVRAARHAYYAMVSYVDAQVGRMLDALQAMDLDESTVVVFTADHGDMLGERG 302 Query: 294 LISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTMMALADIEKPEI-----L 347 L K +D RIPL+I +P S +D+ PTM+ LA + P+ Sbjct: 303 LWYKW-VHFDPAVRIPLLISAPGRTRPAVRHELASLVDIFPTMLELAGVSVPDDGASPPP 361 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRND 407 G ++ E + G P +KLV+ L++ ++D Sbjct: 362 DGRSLAEGL-GVSQDEPTGVVYGEMNGEGAHAPCLAVRQGWWKLVVAEGDPPLLFNLQDD 420 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 P+E+ NL D+ ++ + D R Sbjct: 421 PHELRNLAGQPAARDIERQLTALVQARWDARR 452 >UniRef50_A6DJJ1 Sulfatase family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJJ1_9BACT Length = 510 Score = 416 bits (1070), Expect = e-114, Method: Composition-based stats. Identities = 101/480 (21%), Positives = 180/480 (37%), Gaps = 51/480 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+ N LF+ D M+GCY + + T NID +A G F +A +C P+RA L T Sbjct: 29 KKMNVLFIPIDDLKP-MLGCYGDQAIITPNIDRIAERGTVFLNASCQQAICGPSRASLMT 87 Query: 62 GIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+Y + + W +I ++ +YFK GY T +GK G + P W Sbjct: 88 GMYPDHTKVWDLATKMRDINPDILSIPQYFKQQGYETTGVGKTFDPRCVDGGKFQDKPSW 147 Query: 119 DADY----WFDGANYLSELTEKEISLWRNGLNSVEDLQAN------------------HI 156 Y AN K+ + G Q N + Sbjct: 148 SIPYHKAGGKGYANPEVAKAWKKAAELVKGRTFKMGYQRNKAMARLGDPICRPATECMDV 207 Query: 157 DETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGE 216 + ++ L++ ++AD+PF + V + +PH PF P +Y + Y ++ E Sbjct: 208 PDHVYKDGAVARVGAKLLEELSKADKPFFLSVGFAKPHLPFVAPKKYWDMYNSHDIQVAE 267 Query: 217 KAQDDLANKPEHHRLWAQ--------AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINA 268 + + ++ + + + Y A ++D Q+G +++ Sbjct: 268 YQKSAKNDTKIAYKSLGEIAAYSDMPEKGPIDQETQKHLIHGYMATTSYMDAQLGLLLDK 327 Query: 269 LTPEQ-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP-QGERRQVDTPV 326 L NT + DHG +G H + +K ++ R PL+I +P + + PV Sbjct: 328 LEELGIANNTIICLWGDHGFHLGDHGMWTKHTN-FEQAVRSPLLIAAPKGFKPNSTNAPV 386 Query: 327 SHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVT 386 +D+ PT+ LA ++ P LPG+++ V + V + G + Sbjct: 387 ELVDIFPTLCDLAGLDIPTHLPGKSLAPVMKDTSTSVRYAALGQYP--RGNKTMGYTLRS 444 Query: 387 DDFKLVLNLF------------TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 + ++ V L + +L+D DP E NL + + + Sbjct: 445 ERYRYVKWLNLDYRKSVAKGKLVATQLFDYEKDPLETVNLAANPEYKKIIDSFEAEFARR 504 >UniRef50_A3HWG3 Choline sulfatase n=1 Tax=Algoriphagus sp. PR1 RepID=A3HWG3_9SPHI Length = 505 Score = 415 bits (1069), Expect = e-114, Method: Composition-based stats. Identities = 100/466 (21%), Positives = 189/466 (40%), Gaps = 37/466 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTC----SPVCTPARA 57 ++PN LF+ D Q + +G + T ID L EG RF++AY +C +RA Sbjct: 41 QKPNVLFLFADDQRADALGINGNPYIQTPTIDQLGREGSRFSNAYVMGGVHGAICMSSRA 100 Query: 58 GLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 LF+G + TM F AGY T GKWH + + + + Sbjct: 101 MLFSGKNLYK--VTDK----LSGEHTMTMSFAAAGYRTFGTGKWHNEKEAFEASFQEAKN 154 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 +++ + + + D + + + A+DF++ Sbjct: 155 -------VYLGGMADHYDLPLRDYG------ADGKLGEPTRKGFSTEQFAQAAIDFIKDH 201 Query: 178 --ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 D+PF V++ PH P++ Y+ Y D L + +H + + Sbjct: 202 GQRNTDQPFFCYVAFTAPHDPYSPEANYINHYPDGTLPLPGNYMPYHPFEFDHLTVRDEN 261 Query: 236 MPSP--VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAH 292 + + Y+A +D QI +++N L +NT ++Y +D+G G+H Sbjct: 262 LTGWPRKPEVIQMILSDYYALVTHLDTQIAKILNTLKETGQYDNTIIVYAADNGLAAGSH 321 Query: 293 KLISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 L+ K ++Y+ +++PLII+ P + +++D DL PT+ LA I P + G + Sbjct: 322 GLLGK-QSLYEHSSKVPLIIKGPGVPQDQELDAFAYIHDLYPTLAELAGIPDPSDIDGVS 380 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF-TSDELYDRRNDPNE 410 ++ V V + VR +KL+ +L+D DP E Sbjct: 381 LVPVITGEQDGVRDALFTSYRG------TVRAVRNKKYKLIRYPERDYTQLFDLDADPLE 434 Query: 411 MHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDA 456 ++NL ++ + +S+M + + + + +D + ++P + D Sbjct: 435 INNLAENTEYQSKKSEMFELMEKWQNSFQDTVKLTADKIKPMKYDP 480 >UniRef50_A6DNH0 Choline sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DNH0_9BACT Length = 466 Score = 415 bits (1069), Expect = e-114, Method: Composition-based stats. Identities = 108/446 (24%), Positives = 189/446 (42%), Gaps = 32/446 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++PN L + D + G + T ++D LA G F +A+ PVC+ +R + Sbjct: 18 EKPNVLMISIDDL-NDWTGFLGGHPQVKTPHMDKLANSGRIFANAHCAVPVCSSSRVSVM 76 Query: 61 TGIYANQSGPW-----TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 +G+ A G + ++ K++ T+ R+FK+ GY+T GK G + Sbjct: 77 SGLAATTHGSYEIGPSYQSIPALKDVLTIQRHFKNQGYYTLAGGKVLHHGFKGSVANDND 136 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + L E W G + D QA+ + +++ A LQ Sbjct: 137 RSLIKGHSGPKPKQPLNLPEGWSRAWDWGQHPGTDAQAHDMK--------LAHNAAQALQ 188 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + D+PF M V + PH P P ++ Y + L + DL + P++ Sbjct: 189 E--DFDKPFFMSVGFFRPHVPLLVPPKWFNLYDEESIVLAPSPKSDLDDVPKNFLSINDY 246 Query: 236 MPSPV------GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEM 288 +P D Y A FVD +GRVI+AL + +NT VI SDHG Sbjct: 247 AVAPTHKEVLATDSHRKLTHAYLASISFVDACVGRVIDALKNSKYADNTIVILWSDHGFH 306 Query: 289 MGAHKLISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTMMALADIEKPEIL 347 +G + +K ++++ T++PL++ P E P S ID+ PT++ L ++ P+ L Sbjct: 307 LGEKEHWAK-RTLWEESTKVPLLVYGPGIESGEACLEPASLIDIYPTLVDLCGVKAPKKL 365 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRND 407 G +++ + + + T D++ + ++ELYD +ND Sbjct: 366 DGISLMPQLKNPLSERKQPAIISSYYGN------HAVRTRDWRFISYEDGAEELYDHKND 419 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLD 433 P+E NLI+D + +R ++ L Sbjct: 420 PDEYKNLINDPNYKSIRDELAQWLPK 445 >UniRef50_D2QTW5 Sulfatase n=2 Tax=Sphingobacteriales RepID=D2QTW5_9SPHI Length = 523 Score = 415 bits (1068), Expect = e-114, Method: Composition-based stats. Identities = 103/475 (21%), Positives = 170/475 (35%), Gaps = 78/475 (16%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN +++ D +GCY + + T N+D LA EGIRF YT PVC PAR L TG Sbjct: 47 PNIIYIYADDLGYAELGCYGQQKIRTPNLDKLAREGIRFTQHYTSMPVCAPARCMLLTGK 106 Query: 64 YANQSGPWTN-------------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG 110 ++ S N + T+GR + GY T +GKW + + G Sbjct: 107 HSGHSYIRGNYEMGGFPDSLEGGQMPLYPGAFTIGRLLQQQGYKTACVGKWGMGMANTTG 166 Query: 111 TGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDE------------ 158 P E DY++ + LW NG + + Sbjct: 167 ---NPNEQGFDYFYGYLDQKQAHNYYPTHLWENGKPDKLNNPVIDVHRRLTPETATPEAF 223 Query: 159 -----TFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPV----EYLEKYAD 209 +++ +A F++Q PF + + + PH P EY+ K+ D Sbjct: 224 AYFRGNDYAIDKLAQKAQAFIRQ--NKSGPFFLYLPFTAPHVSLQAPEAAVKEYIGKFGD 281 Query: 210 FYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL 269 + +P + P P Y A +D QIG+++ L Sbjct: 282 ---------GEQRTERPYLGEQGYASTPYP--------RATYAAMITHMDAQIGQLMQLL 324 Query: 270 TPEQRE-NTWVIYTSDHG----------EMMGAHKLISKGAAMYDDITRIPLIIRSPQ-- 316 + + NT V+++SD+G KL +Y+ R P++ R P Sbjct: 325 KDLKIDENTLVMFSSDNGATFNGGVEAAYFNSVGKLRGLKMDVYEGGIREPMLARWPGRI 384 Query: 317 GERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFG 376 + D DLL T+ L ++P G + L + + + + + + Sbjct: 385 KPNQTTDHVSVQYDLLATLAELVGYKRPFATDGISFLPTLLGQSSSQKQHPFL--YWEYP 442 Query: 377 GFIPVRCWVTDDFKLV-----LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSK 426 ++K V + T ELYD D +E N+ D + D+ + Sbjct: 443 EKGGQLAIRMGNWKAVKTNVRKDRTTPWELYDLNKDVSETTNIAD--KHPDIIRQ 495 >UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_BACFR Length = 489 Score = 415 bits (1068), Expect = e-114, Method: Composition-based stats. Identities = 103/461 (22%), Positives = 183/461 (39%), Gaps = 49/461 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN +F++ D + CY + T NID LA G+RF Y+ + V P+R+ L TG Sbjct: 36 RPNVVFILADDLGYGDLSCYGQEKFETPNIDRLAQNGMRFTQCYSGTTVSAPSRSCLITG 95 Query: 63 IYANQSGPWTNNV-------APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 ++ + N +N T+ F++AGY T GKW L Y G+ P Sbjct: 96 THSGHTAIRGNKELAPEGQFPLPENSQTIFNDFRNAGYRTGAFGKWGLG---YIGSAGDP 152 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHI--DETFTWAHRISNRAVDF 173 + D ++ L + LW N ++ + I ++A+ F Sbjct: 153 YKQGIDQFYGYNCQLLAHSYYPDHLWDNDKRVDLPDNNLNVQYGKGTYSQDLIHSKALAF 212 Query: 174 LQQPAR-ADEPFLMVVSYDEPHHPFTCPVEY-LEKYADFYYELGEKAQDDLANKPEHHRL 231 L + A+ D+PF M PH P + ++K+ Y E + + + Sbjct: 213 LDEAAKEKDQPFFMWYPTIIPHAELIVPEDSIIKKFRGKYPEKPYRGVEPGSPAFRKGGY 272 Query: 232 WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMG 290 Q P H + A +D +G+++ L +NT +I++SD+G M Sbjct: 273 CTQFYP----------HATFAAMVYRLDVYVGQIVQKLKDMGVYDNTIIIFSSDNGPHM- 321 Query: 291 AHK-----------LISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMA 337 +Y+ R+P+II P + D S DL+PT Sbjct: 322 EGGADPDFFNSNGIWRGYKRDVYEGGIRVPMIISWPGHVQPSTETDFMCSFWDLMPTFRE 381 Query: 338 LADIEKP-EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-NL 395 + + + + G +IL + + R E E G + D+KLV N+ Sbjct: 382 VLNPKADTRNMDGVSILPLLQNRKGQKEHEYLYFEFLEMNGR---QAVRKGDWKLVHMNI 438 Query: 396 FTSD---ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 + ELY+ +DP+E +N+++ ++ + ++ + + Sbjct: 439 RGNKPYYELYNLASDPSEKYNVLN--QYPEKADELKAIMKE 477 >UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D6K5_PAESJ Length = 434 Score = 415 bits (1068), Expect = e-114, Method: Composition-based stats. Identities = 103/480 (21%), Positives = 182/480 (37%), Gaps = 88/480 (18%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MKRPN + D +GCY + T ++D LA+EGIRF + Y+ SPVC+P+RA L Sbjct: 1 MKRPNIIVFYCDDLGYGDLGCYGSDAMKTPHLDQLASEGIRFTNWYSNSPVCSPSRASLL 60 Query: 61 TGIYANQSGPW------TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 TG Y ++G +T+ K+ GYHT GKWHL +G Sbjct: 61 TGKYPAKAGVTSILGGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLGASAEYG---- 116 Query: 115 PPEWDADYWFDGA----NYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRA 170 P D ++ +Y S + N ++ + + + I+ A Sbjct: 117 PNAHGFDQFYGFRAGCIDYYSHIFYWGQGGGVNPVHDLWRNETEVWENGEYMTEAITREA 176 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 ++ DEP+ M V+Y+ PH+P P YL+++ D + Sbjct: 177 TSYIDAAPD-DEPYFMYVAYNAPHYPMHAPKAYLDRFPDLPPD----------------- 218 Query: 231 LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG--- 286 + A VDD +G ++ AL + E+T + ++SD+G Sbjct: 219 -----------------RRIMAAMIAAVDDGVGEIVKALKQKGAYEDTIIFFSSDNGPST 261 Query: 287 ------------EMMGAHK-LISKGAAMYDDITRIPLIIRSPQG----ERRQVDTPVSHI 329 G+ A++++ R P I+ P G + + D + + Sbjct: 262 ESRNWLDGTEDLYYGGSAGRFRGHKASLFEGGIREPAILSYPAGLAEQQGQISDEMFAMM 321 Query: 330 DLLPTMMALADI-EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDD 388 D+ PTM+ L+ I + L G ++ + F + Sbjct: 322 DIFPTMLELSGIGTEGYSLDGHSVFDALSGNALSPRK-------QLFWEYEGQLAVREGK 374 Query: 389 FKLVLNLF--------TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 +KLVLN + L D D +E NL+ ++ ++ ++ + + +++ Sbjct: 375 WKLVLNGKLDFSRTEADAVHLSDLEQDSSERINLVK--QYPEIAQRLERDVRQWYQSLQE 432 >UniRef50_C0G116 Sulfatase n=1 Tax=Natrialba magadii ATCC 43099 RepID=C0G116_NATMA Length = 499 Score = 415 bits (1068), Expect = e-114, Method: Composition-based stats. Identities = 131/493 (26%), Positives = 201/493 (40%), Gaps = 53/493 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKP---LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 RPN L V+TD + + G + T+ ID L+A G F A+T +C+ ARA L Sbjct: 7 RPNVLLVLTDQERYDC-SALDGPVAETVETETIDHLSATGTHFERAFTPISICSSARASL 65 Query: 60 FTGIYANQSGPWTNNVA-------PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 TG + + G N + T DAGYH Y GKWH+ Sbjct: 66 LTGQFPHGHGMLNNCHEDDALQPNLPPGVPTFSEKLDDAGYHLTYTGKWHVG------RD 119 Query: 113 ECPPEWDADYWFDGANYLSELT--------EKEISLWRNGLNSVEDLQANHIDE------ 158 + P ++ Y + ++ E+ + L+ V N D+ Sbjct: 120 QTPEDFGFSYLGGSDKHHDDIDDAFREYRAERGTPVGEADLDDVIYTGTNPRDDSNGTFV 179 Query: 159 --------TFTWAHRISNRAVDFLQQPARADE--PFLMVVSYDEPHHPFTCPVEYLEKYA 208 T A ++ R +D +++ A D PF + PHHP+ P Y Y Sbjct: 180 AATTSVEVEETRAWFLAERTIDAIEEHASRDRDAPFFHRADFYGPHHPYVVPEPYASMYD 239 Query: 209 DFYYELGEKAQDDLANKPEHHRLWA--QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVI 266 +L E + A KP H + + + D Y+ +DDQ GR++ Sbjct: 240 PENIDLPESYAETDAGKPRVHANYRSYRGVEQFDRDVWKEAIAKYWGFVTLIDDQFGRIL 299 Query: 267 NALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVD 323 +AL + T V++ SDHG+ G H+ +KG MYDD IPL +R P + Sbjct: 300 DALESTGLTDETVVVHASDHGDFAGGHRQFNKGPLMYDDTYHIPLQVRWPGVTEPGSVRE 359 Query: 324 TPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVM-------VEFNRYEIEHDSFG 376 PV DL T + + + PE +++ + + G + + H Sbjct: 360 EPVHLHDLAATFLEMGGVAIPESFDSRSLVPLLDADGPEQESAPSAWPDSVFAQYHGDEF 419 Query: 377 GFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMD 436 G R TD +K V N DELYD DP E+ NLID +ADVR ++ L+D+M+ Sbjct: 420 GLYTQRMVRTDRYKYVYNAPDVDELYDLEADPAELQNLIDHPDYADVRRELRTRLIDWME 479 Query: 437 KIRDPFRSYQWSL 449 + DP R + + Sbjct: 480 ETDDPNRQWVPDV 492 >UniRef50_C9L4R5 Mucin-desulfating sulfatase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L4R5_RUMHA Length = 484 Score = 415 bits (1067), Expect = e-114, Method: Composition-based stats. Identities = 113/479 (23%), Positives = 194/479 (40%), Gaps = 72/479 (15%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+M D Q + + C K L T N++ +A G++F + Y SPVC+PARA + TG Sbjct: 2 NILFIMADDQGSWAMNCGGTKELCTPNLNRIAESGMQFQNFYCVSPVCSPARASVLTGDI 61 Query: 65 ANQSGPWT----------------------NN-------VAPGKNISTMGRYFKDAGYHT 95 + G N ++ + +T + GY Sbjct: 62 PSSHGVHDWIRSGNIDKDKFEEAGRENPYWNGYSCEDKPISYLEGKTTYTDVLNENGYRC 121 Query: 96 CYIGKWHLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANH 155 GKWHL P+ W+ L + NG V Sbjct: 122 ALAGKWHLGD-------SVCPQHGFSKWYTIG--LGGCDYFHPDIVENGNIKVLH----- 167 Query: 156 IDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPF---TCPVEYLEKYADFYY 212 I+N+A+++L + +EPF + V + PH P+ P ++++ Y + + Sbjct: 168 ---EQYVTEVIANKAIEYLNEFQHQEEPFYLSVHFTAPHSPWGEEQHPKKWMDYYENCDF 224 Query: 213 ELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE 272 + + D A+ P+ P + + YFA +D+QIGR+++ L Sbjct: 225 Q----SIPDEADHPDL-----TTGPVFGTEKRKENLRGYFAAISAMDEQIGRILDTLEAN 275 Query: 273 QR-ENTWVIYTSDHGEMMGAHKLISKGA-----AMYDDITRIPLIIRSPQGE--RRQVDT 324 ENT V+YT+D+G MG H + KG MY+ ++P ++ P ++ +T Sbjct: 276 GLRENTLVVYTADNGMSMGHHGVWGKGNGTFPFNMYETSVKVPFLMSLPGVIPQGKREET 335 Query: 325 PVSHIDLLPTMMALADIEKP--EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR 382 +S D+ PT++ L +++ E LPG + + + + + D +G PVR Sbjct: 336 ILSAYDIFPTLLELCKLDRKECEKLPGRSFAYLLRWEKEHKKRDEEIVVFDEYG---PVR 392 Query: 383 CWVTDDFKLVL-NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 D+K + + ELY DP E NL + + +M L ++ +K D Sbjct: 393 MIRNQDWKYIHRYPYGPHELYYLTEDPEEKENLYGQPEYEKMVVEMRTRLNEWFNKYAD 451 >UniRef50_Q7UYA8 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA8_RHOBA Length = 745 Score = 415 bits (1067), Expect = e-114, Method: Composition-based stats. Identities = 108/456 (23%), Positives = 188/456 (41%), Gaps = 37/456 (8%) Query: 3 RPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN LF+ D + VGC T N+D A + + FN+A+ +C +RA T Sbjct: 308 RPNVLFITVDDL-NDWVGCLGGNPDAQTPNLDRFAQQSVLFNNAHCQVALCYASRASFMT 366 Query: 62 GIYANQSGPWTNNVAPGKNIST----MGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 G+YA+++G + N+ ++ M +F ++GY T +GK + + H + Sbjct: 367 GMYASKTGIYNNSSKSARDAYHRAKQMPVWFGESGYRTMCMGKIYHNDHGKKAYWD---- 422 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDET--FTWAHRISNRAVDFLQ 175 + + E R G ++ + L +D +I+ ++ L Sbjct: 423 ---EIGPKTLRWGPEPPNGRQFTKRFGKDAQDSLAWAALDIEKGGMPDEQIAAWGIEKLD 479 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 Q D+PF + + + +PH P T P Y E++ L ++DL + PE R W Sbjct: 480 Q--EYDQPFFLSLGFYKPHTPMTAPKRYFEQFDRDSLTLPNVLENDLDDVPEIGRRWVLD 537 Query: 236 MPSPVGDDG---------LYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDH 285 + ++ Y AC +DD IG+V+ L NT V+ SDH Sbjct: 538 RSKLIAEEAVKQYSPTYRRELVHAYHACVALIDDCIGQVLRKLDNSPYANNTIVVLCSDH 597 Query: 286 GEMMGAHKLISKGAAMYDDITRIPLIIRSP--QGERRQVDTPVSHIDLLPTMMALADIEK 343 G +G K +++ TR LI+R+P G + V ID+ PT+ L ++ Sbjct: 598 GWHLGEKNHWRKWM-PWEESTRSLLIVRTPDAAGSGQVCQRTVGLIDIYPTLAELCELSP 656 Query: 344 PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYD 403 P+ L G + + + + +D ++ + + S+ELYD Sbjct: 657 PDGLQGLSFRKLLDNPDGPWDRPALT------STKAGNHTVRSDRWRYIRYIDGSEELYD 710 Query: 404 RRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 DPNE HNL +D ++ + H +D + + Sbjct: 711 HDVDPNEWHNLANDPSMNSIKKQ-HAEWIDRLTESN 745 >UniRef50_B1KD82 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD82_SHEWM Length = 526 Score = 414 bits (1066), Expect = e-114, Method: Composition-based stats. Identities = 112/468 (23%), Positives = 186/468 (39%), Gaps = 42/468 (8%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF+MTD N++G + T N+D LA+EG + +AYT +P+C+P+R FT Y Sbjct: 24 NLLFIMTDEMKWNVMGVAGHPVVKTPNLDRLASEGTYYKTAYTVAPICSPSRRSFFTSRY 83 Query: 65 ANQSGPWTN--NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW---- 118 + G N + K GY T GK H + + W Sbjct: 84 THVHGVIDNSKQALANDGEVDLQTILKHQGYRTAISGKLHF--YPEWHDWGFDEFWARSS 141 Query: 119 -------DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 + + + S+ DL + + ++++A+ Sbjct: 142 EGPNRLETYRQYMVAKHGDDAFKPIKGSVTYPKDPLGHDLGRYRFGKEDFETYWLTDKAL 201 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRL 231 D+L + +PF + +SY+EPH P+ Y Y + A + Sbjct: 202 DYL--ARKEKKPFFLFLSYNEPHSPYMVTEPYASMYDPKTLPVPVIPASAKAERKVALEK 259 Query: 232 WAQAMPSPVGDDGLYHH---PLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGE 287 + + DD Y VDD +GRV++ L +NT V++T+DHG Sbjct: 260 KIKGKSRHLIDDEQMMRDLTAQYLGHVSNVDDNVGRVLSYLDSSGLADNTIVVFTADHGN 319 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGE--------RRQVDTPVSHIDLLPTMMALA 339 M+G H K M++ +RIPLIIR+ + R V+ V ID++PT++ + Sbjct: 320 MLGDHGKWFK-GVMHEGSSRIPLIIRAGKHTRYAKVMNRGRVVEQVVESIDVMPTLLEML 378 Query: 340 DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN----L 395 DI+ P + GE++L++ + + D ++ DFKL++ Sbjct: 379 DIKAPRGMQGESLLSLTAGEAKNWKNRAFSQRSD--------FMFIEGDFKLIMPAKAGK 430 Query: 396 FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFR 443 ELY+ NDP E HNL + M ++ + P R Sbjct: 431 KGKLELYNLANDPLENHNLAGMTEYQAKVKSMQQSIQVWQADKPAPIR 478 >UniRef50_B5CWC2 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CWC2_9BACE Length = 515 Score = 414 bits (1066), Expect = e-114, Method: Composition-based stats. Identities = 107/449 (23%), Positives = 185/449 (41%), Gaps = 35/449 (7%) Query: 2 KRP-NFLFVMTDTQATNMVGCY-SGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 ++P N LF+ D + VG T N+D LAA G+ F SAY +PV +RA L Sbjct: 23 EKPKNVLFIAVDDL-NDWVGFLKGHPNTRTPNMDRLAAMGMVFESAYCAAPVSNASRAAL 81 Query: 60 FTGIYANQSGPWTNNVAPGK-----NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 +G + +G + N + + T+ +YF + GY++ GK G Sbjct: 82 LSGFRTSTTGVYGNAEFMRESPVLKDAVTLPKYFSNHGYYSMARGKIFH---QPMGPWGD 138 Query: 115 PPEWDADYWFDGANYLSELTEK----EISLWRNGLNSVEDLQANHIDETFTWAHRISNRA 170 P WD+ G + + + G V D +DET T + + A Sbjct: 139 PQSWDSQENLGGLSLNPPRQKGKQANGLEKQTTGGAVVLDWAGVDVDETKTNDYLNAQWA 198 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 L + D+PF M PH P+ P +Y +++ +L ++ + K Sbjct: 199 AQEL--MKKHDKPFFMACGIFRPHLPWYVPQKYFDRFKLEDIQLPKQDPMETMEKLSPRA 256 Query: 231 LWAQAMPSPVGDD--------GLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIY 281 L P + Y AC + DD IG++++AL +R+NT V++ Sbjct: 257 LSMTGYNKPEHEFNILKKYGMEKEAVRAYLACISYADDCIGQIVDALEKSPERDNTIVVF 316 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALA 339 DHG +G + + +++D +P+II +P PVS +DL PT+++LA Sbjct: 317 WGDHGWHLGEK-MRYRKFSLWDRSCHVPMIIVAPGVTKPGSVCKQPVSLLDLYPTLVSLA 375 Query: 340 DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD 399 + + G +I + + + ++ ++ S+ Sbjct: 376 GLPANPLNEGNDITPLLQNPNAHWTKPAITTLAQNE------HSICDGRYRYIIYRDGSE 429 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMH 428 ELYD ++DP E NL D ++ADV++ + Sbjct: 430 ELYDHKHDPLEWKNLAADKKYADVKAHLR 458 >UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=Bacteria RepID=Q7UHJ9_RHOBA Length = 1012 Score = 414 bits (1066), Expect = e-114, Method: Composition-based stats. Identities = 110/473 (23%), Positives = 184/473 (38%), Gaps = 59/473 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PNF+ ++TD Q + C+ K ++T ID +AAEG R S Y +PVCTP+RAGL TG Sbjct: 570 KPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRAGLMTG 629 Query: 63 IYA--------NQSGPWTNNVA--PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 Y + G + T+ K AGY T GKWHL F Sbjct: 630 CYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFGKWHLGDQPEF--- 686 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHI----DETFTWAHRISN 168 P + D +F Y ++ + LQ + + + R++ Sbjct: 687 -LPTKQGFDEFFG-IPYSHDIHPFHPRQNHYHFPPLPLLQNDTVIEMDPDADFLTKRLTE 744 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 +AV F+++ D+PF + + + PH P ++E AD EK ++ Sbjct: 745 QAVSFIER--NKDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNIDYATRA 802 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGE 287 + L+ +D +G++++AL + T V++TSD+G Sbjct: 803 N--------------------LFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGP 842 Query: 288 -----MMGAHKLISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTMMALAD 340 +L ++ R P ++R P Q D ++ +DLLPT LA Sbjct: 843 PKNTLYASPGELRGHKGTTFEGGMREPTVVRWPGQIPAGHQNDELMTAMDLLPTFAKLAG 902 Query: 341 IEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS 398 P + G++I + + + + +KL +N + Sbjct: 903 AAIPTDRVIDGKDIWPTLKGETQTPHDAFFY------HRGNQLAAVRSGKWKLHVNNGVA 956 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRP 451 +LYD ND E N+I+ +V K+ L D+ I R ++ P Sbjct: 957 KQLYDLENDLGEKVNVIE--TNPEVVKKLQHQLKDFAADIASNSRPAAFNANP 1007 Score = 326 bits (836), Expect = 1e-87, Method: Composition-based stats. Identities = 108/540 (20%), Positives = 173/540 (32%), Gaps = 150/540 (27%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN + + D +GCY L+T NID LAAEG RF A++ S VCTP+R GL T Sbjct: 38 RPPNVVLIFVDDLGYGDLGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSRYGLLT 97 Query: 62 GIYANQS----GPW-----TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 G Y ++ G W T+ + N T+G+ FK+ GY T +GKWHL + Sbjct: 98 GQYPVRAMGGQGIWGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKWHLGFKEEPCDW 157 Query: 113 ECPPEW-----DADYWFD-----------GANYLSELTEKEISLWRNGLNSVEDLQANHI 156 + P D++F N S G V Sbjct: 158 QVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGGKPVSPTPMFPE 217 Query: 157 DE---------------TFTWAH----RISNRAVDFLQQPARADEPFLMVVSYDEPHHPF 197 + ++ RAV ++ + + +EPF + + HHPF Sbjct: 218 EASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITE--KKNEPFFLYFATPNIHHPF 275 Query: 198 TCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDF 257 T + G LY Sbjct: 276 TPAPRF---------------------------------------KGTSQCGLYGDFVHE 296 Query: 258 VDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMG-------------AHKLISKGAAMYD 303 +D +G ++ +L +NT V++TSD+G M+ +L+ +++ Sbjct: 297 LDWMVGEIVQSLEDNGLTDNTLVLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWE 356 Query: 304 DITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKP--EILPGENILA-VKEP 358 R+PLI + P Q D +S +DL T AL + E P E N+L + + Sbjct: 357 GGHRVPLIAKWPGKIKAGTQSDQLISQVDLFATFSALTEQEMPSSEQKDSINMLPALLDD 416 Query: 359 RGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV--------------LNLFTSD----- 399 + + + + + Sbjct: 417 PNEPLRTELVLAPRQ-----PRNLAIRKGKWLYIGARGSGGFNGSKPQHHAWGGPAAVQF 471 Query: 400 --------------------ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 +LYD ND ++ N+ +V +M L Y K Sbjct: 472 SGQKNSDIVNGRIKKNAPPAQLYDLENDRSQTTNVF--REHPEVVEEMKAMLESYRPKQG 529 >UniRef50_Q0TUK6 Arylsulfatase n=9 Tax=Bacteria RepID=SULF_CLOP1 Length = 481 Score = 414 bits (1066), Expect = e-114, Method: Composition-based stats. Identities = 104/470 (22%), Positives = 178/470 (37%), Gaps = 31/470 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + +M D + +G + + T N+D +A EG F +AYT P C +RA + TG Sbjct: 2 KPNIVLIMVDQMRGDCLGVNGNEFIETPNLDMMATEGYNFENAYTAVPSCIASRASILTG 61 Query: 63 IYANQSGPWTNNVAPGKN-ISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 + G N +T+ F AGYHT IGK H+ D Sbjct: 62 MSQKSHGRVGYEDGVSWNYENTIASEFSKAGYHTQCIGKMHVYPERNLCGFHNIMLHDGY 121 Query: 122 YWF----------------DGANYLSELTEKEISLWRNGLNSVEDL-QANHIDETFTWAH 164 F D + E + L GL+ + + +E + Sbjct: 122 LHFARNKEGKASTQIEQCDDYLKWFREKKGHNVDLIDIGLDCNSWVSRPWGYEENLHPTN 181 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD-DLA 223 + N ++DFL++ +PF + +S+ PH P P Y + Y D + Sbjct: 182 WVVNESIDFLRR-KDPSKPFFLKMSFVRPHSPLDPPKFYFDMYKDEDLPEPLMGDWANKE 240 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYT 282 ++ + Y+ +D QIGR + AL+ NT ++ Sbjct: 241 DEENRGKDINCVKGIINKKALKRAKAAYYGSITHIDHQIGRFLIALSEYGELNNTIFLFV 300 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP-----QGERRQVDTPVSHIDLLPTMMA 337 SDHG+MMG H + Y+ +R+P I P + + D + D++PT++ Sbjct: 301 SDHGDMMGDHN-WFRKGIPYEGSSRVPFFIYDPGNLLKGKKGKVFDEVLELRDIMPTLLD 359 Query: 338 LADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF- 396 A I P+ + G ++ + E R I + G VT K + Sbjct: 360 FAHISIPDSVEGLSLKNLIEERNSTWRD---YIHGEHSFGEDSNHYIVTKRDKFLWFSQR 416 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQ 446 ++ +D NDP E+ NLID + + + L+ ++ + + Sbjct: 417 GEEQYFDLENDPKELTNLIDSEEYKERIDYLRKILIKELEGREEGYTDGN 466 >UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4W7_9PLAN Length = 459 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 100/464 (21%), Positives = 159/464 (34%), Gaps = 71/464 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN + +M D + CY K + T +ID LAA ++F ++ +CTP RA + T Sbjct: 33 QPPNIVLIMADDLGYGDLACYGNKQVKTPHIDRLAASALKFTDFHSAGAMCTPTRAAMLT 92 Query: 62 GIYANQSGPW---------TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 G Y + G +++ TM K GY T GKWHL + Sbjct: 93 GQYQQRFGRQFESALSGKSNHDIGLPHQAVTMAELLKQQGYATACFGKWHLG----YQPP 148 Query: 113 ECPPEWDADYW----FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 P D + ++ + + W + E A +S Sbjct: 149 WLPTNQGFDLFRGLTSGDGDHHTHVDRSGNEDWWHNNEISM--------EKGYTADLLSK 200 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 +V F++ A PF + V + H P+ P + QD A K Sbjct: 201 YSVAFME--ANRTRPFFLYVPHLAIHFPWQGPQ---------DPPHRKAGQDYHAGKWGI 249 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGE 287 P A + +D +G++++AL E NT VI+TSD+G Sbjct: 250 IPDPGNVSPHTT------------AMIESLDQSVGKILSALKRLDLEQNTLVIFTSDNGG 297 Query: 288 MM----------GAHKLISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTMM 336 + L + A +Y+ R+P +I P D +DLLPT+ Sbjct: 298 YLTYGKNFQNISSNGPLRGQKATLYEGGHRVPCLISWPGVITAGVTDQTAHSVDLLPTLA 357 Query: 337 ALADIEKPE-ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL 395 A I G ++ + + R + D F R +KL L Sbjct: 358 QAAGISATNFQTDGLDLAPL-------WQTGRPLADRDLFWRMGNNRAVRRGQWKLCL-K 409 Query: 396 FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 ELY D E N ++ M AL ++ + Sbjct: 410 NNRSELYHLETDLGEQQNRAA--EHPEIVKSMSQALKEWEADVD 451 >UniRef50_A6DMZ1 Sulfatase n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMZ1_9BACT Length = 514 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 108/494 (21%), Positives = 190/494 (38%), Gaps = 66/494 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSG---KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 RPN +++ +D AT +G Y G T NID LA EG+ F AY + +C P+RA L Sbjct: 23 RPNIVWMFSDDHATQAIGAYGGLLESYNLTPNIDRLAKEGMIFKRAYVGNSICAPSRATL 82 Query: 60 FTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 TG +++ G N N + + GY T IGK HL G Sbjct: 83 LTGKHSHLHGKVDNAKGFDHNQQQFQKLLQKGGYQTAMIGKIHLPGK----------MQG 132 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 DYW + ++ + + I+ RA++++ Sbjct: 133 FDYWEVLPGQGKYWDPEFVTETGKTIYP-----------GEHSSDVITRRALNWMNNERD 181 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP------------- 226 +PF+++V + PH + + +K++ + + DD + Sbjct: 182 KSKPFMLMVHFKAPHRSWQPTTRWKKKFSTMTFPEPDTLFDDYQGRGTAAKYQDMNIEHS 241 Query: 227 ---------------EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP 271 E + A + V + Y AC VD+ IG++++ L Sbjct: 242 MNMVGDLKSNQSPRKEFLKKNALTGKALVKWKYQMYMRDYLACIAGVDENIGKILDQLAE 301 Query: 272 EQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSH 328 + NT V+Y+SD G +G H K MY++ R PL+ R P + + + V + Sbjct: 302 SGLDKNTIVMYSSDQGFYLGEHGWFDK-RFMYEESYRTPLLARWPGVIKAKTRNEDLVQN 360 Query: 329 IDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR---CWV 385 ID T + LA + P + GE+++ + + + + + G+ V Sbjct: 361 IDFAETFLDLAGLPIPADMQGESLVPLMKGKTPDDWRTHLYYHYYEYPGWHSVHRHEGVS 420 Query: 386 TDDFKLVLNL------FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 +KL+ E YD + DP+EM + + +A +K+ L + +K Sbjct: 421 DKRYKLMRFYGKDVPNGEEWEFYDLKTDPSEMKSEYANPEYASTIAKLKKELANLREKYE 480 Query: 440 DPFRSYQWSLRPWR 453 Q+ + PW+ Sbjct: 481 VKDIP-QYDINPWK 493 >UniRef50_A4W906 Sulfatase n=43 Tax=Enterobacteriaceae RepID=A4W906_ENT38 Length = 501 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 118/466 (25%), Positives = 196/466 (42%), Gaps = 46/466 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + ++ D +G Y + T NID LA EG+RF+ Y +P+C+P+RAGL TG Sbjct: 35 KPNVVIILADDLGYGDLGIYGHPIVKTPNIDKLAQEGVRFSQYYAPAPLCSPSRAGLLTG 94 Query: 63 IYANQSGPW-----TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 ++G N+A G+N T+ Y KD GY T +GKWHL+ + Sbjct: 95 RTPFRTGIRSWIPTNKNIALGRNEKTIASYLKDQGYDTAMMGKWHLNAGVDRHDQPQAED 154 Query: 118 WDADYW-FDGANYLSELTEKEISLWRNGLNSVEDLQANH---IDETFTWAHRISNRAVDF 173 DY + A +++ +K RNG+ N +S A+++ Sbjct: 155 AGFDYTLVNAAGFVTSDLDKAKERPRNGVVYPNGFYRNGKALGTVNQISGEFVSQEAINW 214 Query: 174 LQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA 233 L + ++PF M V++ E H P P +YLE Y + E+ + Sbjct: 215 LND-KKDNKPFFMYVAFTEVHTPLASPKKYLEIYKN--------------YMSEYEKQHP 259 Query: 234 QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGE----- 287 + D Y+A ++D+Q+G+V+ + +NT +I+TSD+G Sbjct: 260 DMFYADWVDKPYRGPGEYYANISYMDEQVGKVLAKIKSMGQEDNTIIIFTSDNGPVTREA 319 Query: 288 -------MMGA-HKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMA 337 M G L + +++ R+P II+ Q DTPVS +D+LPT+ Sbjct: 320 RKWYELNMAGETDGLRGRKDNLWEGGIRVPAIIKYGQHLHAGTVTDTPVSGLDILPTLAE 379 Query: 338 LA--DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR-CWVTDDFKLVLN 394 L ++ I+ GE+I+ V E + + + F D+K++ + Sbjct: 380 LTHFNLPTDRIIDGESIVPVLEGQTMNRQQPLLFAIDMPFQDDPTDMWALRDGDWKMIFD 439 Query: 395 LFT-SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 + LY+ + D E N + + KM AL Y I Sbjct: 440 RNSKPKYLYNLKLDRGETMNQLGKQ--PVLEQKMIAALARYQSSIE 483 >UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYA9_9BACT Length = 490 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 104/491 (21%), Positives = 177/491 (36%), Gaps = 79/491 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN + +++D Q K + T N+D+LA G+R Y +PVC+P+RAGL T Sbjct: 37 KRPNIIVIVSDDQGYADASFQGSKDILTPNLDALAKSGVRCTRGYVTAPVCSPSRAGLMT 96 Query: 62 GIYANQSGPWTNNV--------APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE 113 G Y + G N V N + + + AGY+T +GKWHL D G Sbjct: 97 GRYQERFGHHNNIVAEAALPIAHLPSNETLLPQVLAKAGYYTAMVGKWHLGLQD----GC 152 Query: 114 CPPEWDADYWFD----GANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNR 169 P E D +F G +Y E+ ++ +E Sbjct: 153 RPYERGFDEFFGIITGGHDYFVNHPEERAVGDQSYKARIERNGPVGEAVPGYLTDAFGAD 212 Query: 170 AVDFLQQP--ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 AV +++ R D+P + ++++ PH P P + ++ + + Sbjct: 213 AVRIIRESHTKRPDQPLFLYLAFNAPHTPTQAPKDLVD-------TMPATLES------- 258 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHG 286 Y A +D +G+V AL E +T++++ SD+G Sbjct: 259 ------------------KDRRTYAAQITSMDASVGKVRAALKENGMEKDTFIVFFSDNG 300 Query: 287 EMMGAHK------LISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMAL 338 H L ++Y+ R+P P + PV+ +D+ T AL Sbjct: 301 GA--NHPYYDNTPLRDHKGSLYEGGIRVPFFAVYPGHIPAGSVCELPVTSLDVFATACAL 358 Query: 339 ADIEK--PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 A + L ++L V E E FG D KLV+ Sbjct: 359 AGTKPETSHPLDSVDMLPVLEGNARQPTHATLFWEFPGFGA-----AVADRDLKLVVPKK 413 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDA 456 S +L+D D E +L + + +++ L ++ + P W + Sbjct: 414 GSPQLFDLAVDIGEKSDLAA--QNPEKVARLSTLLSEWHAQNARPL---------WGPGS 462 Query: 457 RPRWMGAFRPR 467 + + P+ Sbjct: 463 QTQLHAGQTPK 473 >UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UZ43_RHOBA Length = 608 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 103/493 (20%), Positives = 174/493 (35%), Gaps = 70/493 (14%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + V+TD Q G K + T NID+LAAE Y +P C+P R+ L TG Sbjct: 31 RPNVVMVITDDQGYGDCGFTGNKVVQTPNIDALAAESSVLTD-YHVAPTCSPTRSALMTG 89 Query: 63 IYANQSGPWT---NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 + N++G W N T G F DAGY T GKWHL + + + Sbjct: 90 HWTNRTGVWHTISGRSMLRDNEVTFGEIFSDAGYQTGMFGKWHLGDNYPYRAEDNGFTEV 149 Query: 120 ADYWFDGANYLSELTEKEISLWRN-GLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 + G W N + + F+++ Sbjct: 150 YRHGGGGVGQT-------PDFWDNAYFDGSYFHNGKAVKAEGFCTDVFFKEGNRFIRECV 202 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 ADEPF ++ + PH P P +Y++ Y + + Sbjct: 203 EADEPFFAYIATNAPHGPLHAPQKYIDMYPEMNDNV------------------------ 238 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMG----AHK 293 +F VDD +G+ L +NT I+T+D+G G Sbjct: 239 ----------ATFFGMITNVDDNVGQTRKLLRELGVHDNTIFIFTTDNGTAGGASVYNAG 288 Query: 294 LISKGAAMYDDITRIPLIIRSP--QGERRQV-DTPVSHIDLLPTMMALADIEKPEIL--P 348 + K + Y+ R+P ++ P + + +T +D++PT++ + +E PE + Sbjct: 289 MRGKKGSPYEGGHRVPFVMHYPEGGFAKSRTNNTLCHAVDVVPTLLDMCGVEAPESVKFD 348 Query: 349 GENILAVKEPRGVMVEFNRYEIEHDSF---GGFIPVRCWVTDDFKLVLNLFTSDELYDRR 405 G +I+++ + +R I + D ++L+ ELY+ Sbjct: 349 GTSIVSLLKDEVDSSFNDRMLITDSQRVIDPIKWRQSSVMQDKWRLI----NGKELYNIA 404 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWS-----LRPWRKDARPRW 460 NDP + +N+ D + + M + ++ F P W Sbjct: 405 NDPGQENNIAGD--HPEQVASMRAFYEAWWAELEPTFSQTTEMTVGHPDHPVVTFTAHDW 462 Query: 461 MGAFRPRPQDGYS 473 +G P Q Sbjct: 463 IGQAPPWNQSAIR 475 >UniRef50_C7MF96 Arylsulfatase A family protein n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MF96_BRAFD Length = 483 Score = 413 bits (1063), Expect = e-114, Method: Composition-based stats. Identities = 109/501 (21%), Positives = 177/501 (35%), Gaps = 70/501 (13%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M RPN +F+M+D A + + Y + T ++D LA EG R ++ Y + +CTP+RA + Sbjct: 1 MTRPNIVFIMSDDHAAHSISAYGSRVNTTPHMDRLADEGARMDATYCTNAICTPSRASIL 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 +G Y++ + + + T ++ GY T GKWHL + Sbjct: 61 SGTYSHINRAPSIYSEFDYRVRTFPEVLQECGYQTALYGKWHLGRSERSLP------RGF 114 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D + + ++ V A ++ +++DF+ + Sbjct: 115 DDFRIYPDQGDY------------IDPVMIGPAGEEQIPGYATDIVTRQSLDFIDRR-DP 161 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 ++PF ++V + PH P+ Y Y E DD A + E R A + Sbjct: 162 EQPFCLLVHHKAPHRPWIPHPRYEHLYEAGTIPEPETMWDDHATRSEVVREVAMNLDDLR 221 Query: 241 GDDGL------------------------YHHPLYFACNDFVDDQIGRVINALTPEQR-E 275 D + Y C VDD IG +++ L E E Sbjct: 222 PTDYKDELPAELEGETEEARRARASWKYQRYMRDYLRCVQAVDDSIGEILDHLDQEGLGE 281 Query: 276 NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLP 333 NT V+YTSD G +G H K M D+ +P+++R P +V VS++D Sbjct: 282 NTLVVYTSDQGFFLGDHGWFDK-RLMLDESLTMPMLLRWPAQIPAGSRVSDIVSNVDFAA 340 Query: 334 TMMALAD---IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSF----GGFIPVRCWVT 386 T++ A + P+ G + L V + T Sbjct: 341 TLLEAAGRSASDLPDQ-QGRSFLPQLRGEEVPDWRQAVYYRYWEHDDPEHHAPAHYGVRT 399 Query: 387 DDFKLVLNLFT--------------SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALL 432 K + DELYD DP EM N+ D +A V +M Sbjct: 400 PTHKYIHYYNDGLGSPGSSTRIMPAEDELYDLATDPQEMRNVAHDPAYAGVLEEMKALTA 459 Query: 433 DYMDKIRD-PFRSYQWSLRPW 452 + D P+ W Sbjct: 460 QLQAEYGDAPYEGPDTPRLEW 480 >UniRef50_D0DCV9 Choline-sulfatase n=2 Tax=Citreicella sp. SE45 RepID=D0DCV9_9RHOB Length = 474 Score = 413 bits (1062), Expect = e-114, Method: Composition-based stats. Identities = 110/467 (23%), Positives = 195/467 (41%), Gaps = 41/467 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K PN +F++TD Q + +G ++T N+D L EG F Y SP C+P+RA LF+ Sbjct: 3 KHPNIVFIITDQQRIDTIGALGCPWMDTPNLDRLVNEGTAFEQMYVTSPSCSPSRASLFS 62 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWH-------LDGHDYFGTGEC 114 G Y + +G + N+ + + K +GY T +GK H + T Sbjct: 63 GTYPHTNGVFRNDER---WVYSWVGLLKQSGYRTVNVGKMHTWPVEGAFGYDERHVTENK 119 Query: 115 PP----------EWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 WD +W G + +T++E+ + L E + Sbjct: 120 DRAHPNLPFYLDNWDKAFWARGVEKPTRVTQREMPDYAERLGC----YVWDAPEDLHADN 175 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 + A +L + + DEPF + + PH P+ EYL KY +L E + D Sbjct: 176 FVPEMACMWLDRY-KGDEPFFLQIGIPGPHPPYDPTAEYLAKYEGRD-DLPEPIRYDFDT 233 Query: 225 KPEHHRLWAQ-----------AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ 273 +P R + +P P + Y+A +D Q+G ++ AL Sbjct: 234 QPGPLRELRRQHLDNDHDAVVHLPDPTAEQMRLQRAHYYANVSMIDTQVGNILAALERRG 293 Query: 274 -RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLL 332 ++T +++TSDHG+ + H K M++ R+P I+ + D V+ D Sbjct: 294 VLDDTIIVFTSDHGDCLNDHGHSQKW-NMFEATVRVPAIVWGRGIPAMRRDELVALFDWG 352 Query: 333 PTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSF-GGFIPVRCWVTDDFKL 391 PT++ A + P + +++ + + + E +D+ G + D+KL Sbjct: 353 PTILEWAGVTPPAWMEAQSLNPLMAGEEQLRDRVFAEHANDAILTGTSYMTMIRRGDWKL 412 Query: 392 VLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 V + +S+ +L+D +DP E NL DD A+ + + +L + + Sbjct: 413 VHFVDSSEGQLFDLASDPGERSNLWDDPAQAERKLSLIHDILRWRIE 459 >UniRef50_A6DJ72 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=3 Tax=Bacteria RepID=A6DJ72_9BACT Length = 495 Score = 412 bits (1061), Expect = e-113, Method: Composition-based stats. Identities = 107/470 (22%), Positives = 191/470 (40%), Gaps = 51/470 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKP--LNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGL 59 +RPN +F++TD Q + VG + ++T +I+ +AAEG++F + Y + +C+P+RA Sbjct: 25 QRPNVVFILTDDQRGDAVGYHKKPLLGIDTPSINKIAAEGVQFENMYCTTSLCSPSRAAF 84 Query: 60 FTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 +G Y + + N ++ + + GY T +IGKWH+ D Sbjct: 85 LSGTYTHTHKVYDNFTDYPHDLKSFPLLLQQEGYTTGWIGKWHMGEEDDSK------RPG 138 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 DYW ++ ++ +AH++++ A+DFL + Sbjct: 139 FDYWVTHKGQGKYW------------DTTFNVNGERKKVPGYYAHKVTDMAIDFLNK-VD 185 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK-------AQDDLANKPEHHRLW 232 +PF + + + PH PF +Y Y D + + + P H ++ Sbjct: 186 KSKPFALCLGHKAPHGPFIPEAKYDSIYNDTPVPYPDSSWKLGDKPKWIVDRLPTWHGIY 245 Query: 233 AQAMPSPVG---------DDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYT 282 D + Y A + VDD +GR+ + L +NT +I+T Sbjct: 246 GPLYGFRKDFPNDKASAIVDFEHFVRSYTATINSVDDSVGRIYDHLEEMGILDNTILIFT 305 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALAD 340 SD+G ++G H +I K M++ IPL +R P+ + V ID+ PT+M L Sbjct: 306 SDNGFLLGEHGMIDK-RTMHEASVSIPLTVRFPKKIKGGTVIKEQVLSIDMAPTIMELTV 364 Query: 341 IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDS---FGGFIPVRCWVTDDFKLVLNLFT 397 +K G + + + + E++ F VR +K V Sbjct: 365 GKKMPSAQGLSWATLLDDTKDAEWRKTWLYEYNYEVQFPYTPNVRGIRHGKWKYVAYPHG 424 Query: 398 S-------DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 +ELY+ DP+E NL +D +AD++S + L + Sbjct: 425 DGGKLRHMEELYNMERDPSESSNLAEDPAYADIKSMLAMELAKTLKSTGA 474 >UniRef50_Q1IH24 Choline sulfatase n=29 Tax=cellular organisms RepID=Q1IH24_PSEE4 Length = 505 Score = 412 bits (1061), Expect = e-113, Method: Composition-based stats. Identities = 112/446 (25%), Positives = 180/446 (40%), Gaps = 16/446 (3%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN LF+M D A ++ Y+ P+ N+ LA + + F+SAY SP+C P+R L Sbjct: 1 MKQPNILFIMADQMAAPLLPIYTPSPIKMPNLARLAEQAVVFDSAYCNSPLCAPSRFTLV 60 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 +G ++ G + N +I T Y + GY T GK H G D E + Sbjct: 61 SGQLPSRIGAYDNAADFPADIPTYAHYLRRLGYRTALSGKMHFCGPDQLHGYE--ERLTS 118 Query: 121 DYWFDGANYLSELT-EKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 D + + + W + ++SV + + +A +L R Sbjct: 119 DIYPADYGWAVNWDAPDQRPSWYHNMSSVLQAGPCVRTNQLDFDEEVVFKARQYLYDHVR 178 Query: 180 AD--EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW--AQA 235 D PF + VS PH P+T P Y + Y L P RL Sbjct: 179 EDHGRPFCLTVSMTHPHDPYTIPKRYWDLYEAVDIPLPRDVIAQSQQDPHSQRLLKVYDL 238 Query: 236 MPSPVG-DDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHK 293 P+ D YF ++DD IG ++ L ++T ++++ DHG+M+G Sbjct: 239 WDKPLPVDKIRDARRAYFGACSYIDDNIGLLVQTLEDCGLADDTLIVFSGDHGDMLGERG 298 Query: 294 LISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKPE--ILPGE 350 L K ++ R+PL+I +P+ ++ VS DLLPT++ LA + L G Sbjct: 299 LWYK-MHWFEMSARVPLLIHAPKRFAPARISASVSTCDLLPTLVELAGGAVDKDLHLDGR 357 Query: 351 NILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNE 410 ++L + +G + E+ + G P+ +K V + LYD DP+E Sbjct: 358 SLLGHLQGQGG---HDEVIGEYMAEGTVGPLMMIRRGAYKFVYSEDDPCLLYDLSRDPHE 414 Query: 411 MHNLIDDIRFADVRSKMHDALLDYMD 436 NL + D D Sbjct: 415 RENLTGSPDHQVLLQAFVDEAKQRWD 440 >UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR28_9SPHI Length = 602 Score = 412 bits (1061), Expect = e-113, Method: Composition-based stats. Identities = 94/446 (21%), Positives = 161/446 (36%), Gaps = 55/446 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN + ++TD Q + L T + D + EG + Y SPVC P RA + T Sbjct: 38 RPPNVIVILTDDQGWGDFSHTGNEYLKTPHFDKMTEEGALLDQFYV-SPVCAPTRASVLT 96 Query: 62 GIYANQSG---PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G Y ++G T+ FK+AGY T GKWH E P Sbjct: 97 GRYHLRTGVSFVTRGRENMRSEEVTIAEVFKEAGYATGCFGKWHNGA----HYPENPQGQ 152 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 D + + W N ++ + + + + F+ A Sbjct: 153 GFDTFLGFTSG----------HWSNYFDTELEYNGEMKSTKGFITDVLMDETIQFID--A 200 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 DEPFL V + PH P+ P +Y +KY D + Sbjct: 201 HKDEPFLAFVPLNAPHTPYQVPDKYFDKYKDIDF-------------------------- 234 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG--EMMGAHKLI 295 + + +DD +G+++ L ++ ENT V++ SD+G Sbjct: 235 GYDKKQNKKIATIYGMCENIDDNLGKLMKHLKDQELEENTIVVFLSDNGPQGARYNGPWR 294 Query: 296 SKGAAMYDDITRIPLIIRSPQGERRQV-DTPVSHIDLLPTMMALADIEKPEIL--PGENI 352 ++++ T +P I+ + +HIDL+PT+M LA IEKPE + G ++ Sbjct: 295 GGKTSVHEGGTLVPCAIQWKGHIPNSSKSSLTAHIDLMPTLMGLAGIEKPENIQFDGIDL 354 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMH 412 + +F D++ LY+ + DP+E + Sbjct: 355 SNYLMGTSDDLGERNLYTHMTNFEITADRGAVRQGDYRFTTEYGDVG-LYNLKEDPSEEN 413 Query: 413 NLIDDIRFADVRSKMHDALLDYMDKI 438 NL D + + ++ A ++ + Sbjct: 414 NLKD--QLPEKTQELKTAFENWYKDV 437 >UniRef50_B6HPN7 Pc22g01020 protein n=15 Tax=Eukaryota RepID=B6HPN7_PENCW Length = 589 Score = 412 bits (1060), Expect = e-113, Method: Composition-based stats. Identities = 102/427 (23%), Positives = 180/427 (42%), Gaps = 16/427 (3%) Query: 2 KRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 K+PN L++M D A ++ + P+ T N+D LA G+ F+SAY SP+C P+R + Sbjct: 4 KKPNILYIMADQMAAPLLSLHDKNSPIKTPNLDRLAEGGVVFDSAYCNSPLCAPSRFVMV 63 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 +G ++ G + N + T Y + GYHT GK H G D E + Sbjct: 64 SGQLPSKIGAYDNAADLPADTPTYAHYLRREGYHTALAGKMHFCGPDQLHGYEQ--RLTS 121 Query: 121 DYWFDGANYLSELTEKEIS-LWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 D + + E +I W + ++SV + + + ++ +L R Sbjct: 122 DIYPGDYGWSVNWDEPDIRADWYHNMSSVMEAGPVVRTNQLDFDEEVIYKSTQYLYDHVR 181 Query: 180 A--DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ--- 234 ++PF + VS PH P+ E+ + Y D L + + H + + Sbjct: 182 QRNEQPFCLTVSMTHPHDPYAMTKEFWDLYNDVEIPLPKNGAIPHDQQDAHSQRVLKCID 241 Query: 235 -AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAH 292 + Y+A +VD +G+++ L ++T +++T DHG+M+G Sbjct: 242 LFNKEMPDERIRAARRAYYAACTYVDTNVGKLLRVLENTGMADDTIIVFTGDHGDMLGER 301 Query: 293 KLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEKPE--ILPG 349 L K +++ R+P ++ +P+ ++V VS +DLLPT LA + L G Sbjct: 302 GLWYK-MTWFENSARVPFLVHAPKHFAPKRVSENVSTMDLLPTFAELAGAKLISELPLDG 360 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPN 409 +++ G + + E+ G P+ +K + + LYD NDP Sbjct: 361 VSLVPYLTG-GEGLRTDTVYGEYMGEGTQAPLMMIRRGRWKFIYSTIDPPMLYDLVNDPE 419 Query: 410 EMHNLID 416 E NL Sbjct: 420 ERTNLAA 426 >UniRef50_Q46P27 Sulfatase n=3 Tax=Proteobacteria RepID=Q46P27_RALEJ Length = 482 Score = 412 bits (1060), Expect = e-113, Method: Composition-based stats. Identities = 108/438 (24%), Positives = 183/438 (41%), Gaps = 14/438 (3%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + +M+D M+GC + T N+D+LAA G+RF+SAYT SP+C PARA TG Sbjct: 5 NVVVIMSDEHDPRMMGCSGHPFVKTPNLDALAARGVRFSSAYTPSPICVPARAAFATGRR 64 Query: 65 ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYWF 124 +Q W N + G +D G IGK H + + E + Sbjct: 65 VHQVRLWDNAMPYTGEQRGWGHVLQDRGIRVESIGKLHYRNEEDPAGFDA--EHLPMHVV 122 Query: 125 DGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA-RADEP 183 G + NG + + + ++ RAV +LQ+ A R + Sbjct: 123 GGHGMVWASIRNPFRPRENGPRMLGEHIGPGESSYTQYDRAVTQRAVQWLQEAAQRQEAG 182 Query: 184 FLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW----AQAMPSP 239 F++ V PH PF P E+ Y + + R + A Sbjct: 183 FVLYVGLVAPHFPFVVPEEFYSLYPTDGLPEPKLHPRTGYEQHPWVREYCDFMASERQFA 242 Query: 240 VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRENTW-VIYTSDHGEMMGAHKLISKG 298 D+ L Y+ ++D +G+++ AL E+T ++YTSDHG+ +GA + K Sbjct: 243 DADERLRAFAAYYGLCTWLDHNVGQILGALRDNGLEDTTHIVYTSDHGDNLGARGVWGK- 301 Query: 299 AAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILP---GENILAV 355 + +Y++ ++P+++ P +TPV +DL PT++ A ++ + G ++ + Sbjct: 302 STLYEESVKVPMLLAGPIVTPGVCNTPVDLLDLFPTILQGAGVDPATEIDERPGRSLFEL 361 Query: 356 KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLI 415 E+ + G +K + EL+D +DP E+ +L Sbjct: 362 AR--SAPEPDRVILSEYHAAGSNAGGFMLRKGRWKYHHYVGFRPELFDLESDPEELTDLA 419 Query: 416 DDIRFADVRSKMHDALLD 433 D +A V + MH+ALL Sbjct: 420 GDPAYAPVLASMHEALLA 437 >UniRef50_C6IGG0 Iduronate 2-sulfatase n=2 Tax=Bacteroides RepID=C6IGG0_9BACE Length = 482 Score = 412 bits (1059), Expect = e-113, Method: Composition-based stats. Identities = 111/453 (24%), Positives = 180/453 (39%), Gaps = 31/453 (6%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 + R N LF+M D +GCY + + T N+D LA+ G+ F +AY PV +RA L Sbjct: 32 VSRMNVLFLMADDMRPE-LGCYGVEAVKTPNMDRLASSGVLFQNAYCNVPVSGASRASLL 90 Query: 61 TGIYANQSGPWTNNVAPG----KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 TG+Y + + N A + +F GYHT GK D+ + PP Sbjct: 91 TGVYPHYPDRFVNFSAYASKDCPEAIPLSGWFTKNGYHTVSDGKVFHHMSDHAASWSEPP 150 Query: 117 EW----DAD-YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 D YW + + + + ++ + +T +++ RA+ Sbjct: 151 YRNHPDGYDVYWAEYNKWELWMNSESGKTINPKTMRGPFCESADVPDTAYDDGKLAERAI 210 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRL 231 L++ ++PF + + +PH PF P +Y + Y L PE R Sbjct: 211 RDLRRMKEMNKPFFLACGFWKPHLPFNAPKKYWDLYKREEIPLAPNRFRP-EGLPEQVRN 269 Query: 232 ------WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSD 284 +A+ + D Y+AC +VD QIG+V++AL ENT V+ D Sbjct: 270 SSEIYAYARVSDTSDADFQREVKHGYYACLSYVDAQIGKVLDALDELGLAENTIVVLLGD 329 Query: 285 HGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKP 344 HG +G H + K M D T +PLIIR P ++ + + V +DL PT+ L I +P Sbjct: 330 HGWNLGEHDFVGKHNLM-DRSTHVPLIIRVPGRKKGKTRSMVEFVDLYPTLCELCQIPQP 388 Query: 345 -EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS----D 399 E L G++ V + Y V F + Sbjct: 389 AEQLDGQSFAKVFSNLKAKTKDEVYIQWEGGDNA-------VDQRFSYAEWMKGDVKKAS 441 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALL 432 L+D R D E N +++ ++ + + Sbjct: 442 MLFDHRIDKEENKNRVNEKKYKSKVESLSSFIK 474 >UniRef50_Q01ZJ7 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01ZJ7_SOLUE Length = 516 Score = 412 bits (1059), Expect = e-113, Method: Composition-based stats. Identities = 132/490 (26%), Positives = 207/490 (42%), Gaps = 30/490 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L +MTD Q + T NID LA++G+ F +YT S VC PARA L +G Sbjct: 30 RPNILHIMTDQQQWATI--AGRSGCRTPNIDRLASQGMLFERSYTPSAVCCPARAMLLSG 87 Query: 63 IYANQSGPWTNNVAPG-------KNISTMGRYFKDAGYHTCYIGKWH---------LDGH 106 Y +G + +P ++ + ++AGY Y GKWH H Sbjct: 88 AYHWHNGVYNQVHSPPSVHRDMNADVVLYSQRLREAGYRLGYTGKWHASYLRTPLDFGFH 147 Query: 107 DYFGTGECPPE--WDADYWFDGANYLSEL--TEKEISLWRNGLNSVEDLQANHIDETFTW 162 + G C PE D D ++E T ++ + G E T Sbjct: 148 EIAGVAGCDPELLKKIDLNPDRVPRITEPLRTTQQRMMRWPGSEPFVMWGYREGPEESTP 207 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 +RI+ A +++ A+ ++P+ + V + EPH P+ +YL++Y + + D Sbjct: 208 EYRIAEMASRMMKRFAKGEQPWHLEVHFVEPHDPYMPLKQYLDRYDPRSIPVPKSFADTF 267 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIY 281 A KP HR ++ DD Y+A + +D QIGRV+ AL + T V + Sbjct: 268 AGKPGLHRRESETWGKVTEDDVRQSRAHYYAYAEQLDAQIGRVLKALDETGQADRTLVAF 327 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALA 339 T+DHG+M+GAH++ KG Y++ R+P+I+R P + V DL T +A A Sbjct: 328 TADHGDMVGAHRMWIKGWLPYEECYRVPMIVRWPGHVQAGSKSSKLVQTHDLGHTYLAAA 387 Query: 340 DIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD 399 G ++ + + + R +TD FK V N F D Sbjct: 388 GARSLPFPDGASLAPLFADPRRKDWRDDILCAYYGGEYLYTQRIAITDRFKYVFNGFDYD 447 Query: 400 ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPR 459 E+YD DP+EM N++ D +A M + + M + DP + P Sbjct: 448 EMYDLERDPDEMRNVVADSEYARFTGDMQARMYELMARFHDP-----YGDSPEGTKGDRY 502 Query: 460 WMGAFRPRPQ 469 + PR + Sbjct: 503 CAARYLPRGK 512 >UniRef50_B8KHZ9 Arylsulfatase A n=2 Tax=Gammaproteobacteria RepID=B8KHZ9_9GAMM Length = 483 Score = 412 bits (1059), Expect = e-113, Method: Composition-based stats. Identities = 118/480 (24%), Positives = 199/480 (41%), Gaps = 59/480 (12%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N L + D G Y + + T +ID LAAEG+RF Y S +C+P+RAGL TG Sbjct: 30 NVLLIYVDDLGYGDTGAYGHRVVKTPHIDRLAAEGMRFTQFYAPSALCSPSRAGLLTGRT 89 Query: 65 ANQSGPW-----TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 ++G + VA G N +T+ K GY T IGKWHL+G + P ++ Sbjct: 90 PYRTGVESWIPDDSQVALGHNETTLADLAKARGYRTAVIGKWHLNGGLHMQGTPQPRDFG 149 Query: 120 ADYWFDGANYLSELTEKE-ISLWRNGLNSVEDLQANH---IDETFTWAHRISNRAVDFLQ 175 D+ + A ++ + +E L R G +++ N+ A +S+ A+D+L Sbjct: 150 FDHQYGLAAWVKNASVRESKELPRRGAMFPDNMYRNNEAVGPTKKYSAELVSDEAIDWLS 209 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYAD----FYYELGEKAQDDLANKPEHHRL 231 PF ++++Y E H P P EYL +Y D + D N+P R Sbjct: 210 GAKD---PFFLLLTYSEVHTPIASPPEYLAQYQDYLTQEARDNPLLFYFDWRNRPWRGRG 266 Query: 232 WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG---- 286 Y+A ++D Q+GRVI L + ++T +I++SD+G Sbjct: 267 ------------------EYYANVSYMDAQLGRVIEYLRGKGVLDDTLIIFSSDNGPVTD 308 Query: 287 --------EMMGA-HKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTM 335 M G L K +++ R+P IIR P+ R P + +D+ PT+ Sbjct: 309 AALTPWELGMAGETAGLRGKKRFLFEGGLRVPGIIRYPERIEAGRVESRPATALDVFPTL 368 Query: 336 MALADIEKPE--ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL 393 + L GE++ + + + Y G ++KL+L Sbjct: 369 AQWLGVAVDSSVPLDGESLWPLIDGGDFQRQQAFYWSIPTPDGMEF---AVRDGNWKLIL 425 Query: 394 NLFTSDE-LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR-DPFRSYQWSLRP 451 + + L+D +D E++NL++ VR ++ + DP + + + Sbjct: 426 DADERPQYLFDLASDWYEVNNLLETE--PAVRERLLQIYAARRAAVESDPLALARKTHKR 483 >UniRef50_C6XTA2 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XTA2_PEDHD Length = 535 Score = 411 bits (1058), Expect = e-113, Method: Composition-based stats. Identities = 105/511 (20%), Positives = 178/511 (34%), Gaps = 75/511 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN LF+ D ++GCY + + T NID LA G F S Y VC P RA + T Sbjct: 29 EKPNVLFIAVDDLKP-ILGCYGDRLIKTPNIDRLAKMGTVFKSNYCQQAVCGPTRASIMT 87 Query: 62 GIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+ + + W +I T+ +YF GY T IGK + P W Sbjct: 88 GMRPDITKVWDLKTKMRDMNPDILTIPQYFASQGYSTQAIGKIY--DPRCVDEDLDKPSW 145 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQAN------------------------ 154 ++ Y + T + + + G ++ Sbjct: 146 TVPHYRTDKKYYAASTGQPVLNYYQGKEIKSLVEKRRAEAKGKIITDQELLATIKPSVEC 205 Query: 155 -HIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE 213 + + +A D L + +PF V + +PH PF P +Y + Y Sbjct: 206 VDVPDQAYIDGANILQAKDILTTLQKKSQPFFFAVGFAKPHLPFNAPKKYWDLYQREDMP 265 Query: 214 LGEKAQDDLANKPEHHRLWAQAMPSPVGDD------------------GLYHHPLYFACN 255 + + + + D Y+A Sbjct: 266 VAAFQEKSKNAVDVAYHNSGELRAYSDIPDLLSFTDQKSYGLTLPIAKQKELIHGYYAAV 325 Query: 256 DFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRS 314 +VD Q+G ++NAL +NT ++ DHG +G H L K + ++ TR PLI + Sbjct: 326 SYVDAQVGILLNALDSLGLSKNTVIVLWGDHGWHLGDHNLWCKHSD-FEQATRSPLIFSA 384 Query: 315 PQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDS 374 P + + +D+ PT+ LA I P+ L G +++ + ++ S Sbjct: 385 PGIKSSATTSLSEFVDVFPTLCNLAGIPVPQHLEGTSLVPLMRNPASSIKEFAISQYPRS 444 Query: 375 FGGFIPVRC-----------WVTDDFKLVLNLFT-------------SDELYDRRNDPNE 410 R T ++ + + DELYD + DP E Sbjct: 445 SNAVETQRMTDASAKVMGYSLRTKRYRYTIWMENFRSNQAFKATAVVGDELYDYQKDPLE 504 Query: 411 MHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 N++ D +A + + D ++ Y P Sbjct: 505 KINVVKDRNYALIAKSLKDKMIRYFHSKEKP 535 >UniRef50_C6D448 Sulfatase n=2 Tax=Bacteria RepID=C6D448_PAESJ Length = 511 Score = 411 bits (1058), Expect = e-113, Method: Composition-based stats. Identities = 120/507 (23%), Positives = 201/507 (39%), Gaps = 67/507 (13%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+PN L + +D Q N +G ++ + L+T N+D L G F AY +P CTP+RA + Sbjct: 1 MKKPNILLITSDQQHWNTLGYFNNE-LSTPNLDRLIKAGTTFTRAYCPNPTCTPSRASII 59 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE------- 113 TG Y +Q G WT ++ +G F AGY T +GK H Sbjct: 60 TGQYPSQHGAWTLGTKLLEDRHFVGEDFNSAGYKTALVGKAHFQPLSSTEEYPSLEAYPV 119 Query: 114 ----------CPPEWDADYW----------FDGANYLSELTEKEISLWRN------GLNS 147 P + ++ G +Y + EK WR+ G Sbjct: 120 LQDLEMWKQFNGPFYGFEHVELTRNHTNEAHVGQHYALWMEEKGCVNWRDYFLPPTGNMD 179 Query: 148 VEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY 207 I E + + I+ R ++Q A D+PF + S+ +PH + P + Y Sbjct: 180 PAITYKWPIPEKYHYNTWIAERTNALMEQYAEEDKPFFLWSSFFDPHPEYLVPEPWDTMY 239 Query: 208 ADFYYELGEKAQDDLANKPEHHRLWAQAMPSP---------------------------V 240 + + + P H L + P Sbjct: 240 DPDSLTIPDIVPGEHDKNPPHFGLTQEDNPDFSPWAETGNGIHGYRSHHYYEYGEKKKLT 299 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGA 299 D +Y+ +D IG +++ L +NT V++T+DHG G H L +KG Sbjct: 300 DYDKKKLVAVYYGMISMMDKYIGTILDKLEELGIADNTVVVFTTDHGHFFGQHGLQAKGG 359 Query: 300 AMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKE 357 Y+D+ R+P I+R P D S +DL PT ++L+ I P + G + V Sbjct: 360 FHYEDLIRLPFIVRYPGQVPAGVTSDAIQSLVDLAPTFLSLSGIPVPHAITGVDQSEVWR 419 Query: 358 PRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKL-VLNLFTSDELYDRRNDPNEMHNLID 416 + E I + +V +K+ V T E++D ++DP+E++NL D Sbjct: 420 GTASAARDHAI-CEFRHEPTTIHQKTYVDQRYKITVYYNQTYGEIFDLQDDPSELNNLWD 478 Query: 417 DIRFADVRSKMHDALLDYMDKIRDPFR 443 D +A ++S++ + + + ++P Sbjct: 479 DPAYAALKSELLLKYI-WAELGKEPMP 504 >UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4L0_9PLAN Length = 413 Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats. Identities = 104/449 (23%), Positives = 176/449 (39%), Gaps = 64/449 (14%) Query: 10 MTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIYANQSG 69 M D + CY + NT ++D LAA GIRF ++ VC+P RAGL TG Y ++G Sbjct: 1 MADDLGYGDLSCYGSQNCNTPHLDRLAANGIRFTDFHSSGAVCSPTRAGLLTGRYQQRAG 60 Query: 70 ----PWTN-----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 + N + KN T+ + +DAGY T GKWHL + + Sbjct: 61 IDGVVYANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLGYQRQYNPTFRGFQQFV 120 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 Y +Y + L + W + + E H I++ A++F++Q Sbjct: 121 GYVSGNVDYFAHLDGTGVFDWWHNAELNRE-------EQGYVTHLINDHALEFIRQQ--Q 171 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 ++PF + ++++ H P+ P H + + + Sbjct: 172 EKPFFVYIAHEAVHSPYQGP---------------------------HDQPMRKEGGGDI 204 Query: 241 GDDGLYHHP-LYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGE--MMGAHKLIS 296 Y N +D IG++++ L E T++ + SD+G KL Sbjct: 205 KSAKRKDIANAYREMNTEMDKGIGQIVDVLKEVNLTEKTFIFFLSDNGANKNGSNGKLRG 264 Query: 297 KGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKPEI--LPGENI 352 ++++ R+P I P D PV IDL+PT++ LA+ + P L G ++ Sbjct: 265 FKGSLWEGGHRVPAIACWPGRIPEGTVCDEPVISIDLMPTILELANAKIPAGHKLDGVSL 324 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN--LFTSDELYDRRNDPNE 410 +++ + R + F + +KLVLN ELYD D +E Sbjct: 325 VSLLKDRKS-------LVPRQIFWEYNGKSAMRQGHWKLVLNQTRKEPIELYDLTRDMSE 377 Query: 411 MHNLIDDIRFADVRSKMHDALLDYMDKIR 439 NL D+ +M AL + ++ Sbjct: 378 SKNLADNQ--PQRVQQMQSALAAWKSDVQ 404 >UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017445FC Length = 481 Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats. Identities = 92/471 (19%), Positives = 160/471 (33%), Gaps = 66/471 (14%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + + D +GCY K + T N+D LAA+G+RF Y+ VC P+R + TG Sbjct: 17 RPNVIVFLADDLGYGELGCYGQKKIKTPNLDQLAADGMRFTDFYSGHAVCAPSRCVMLTG 76 Query: 63 IYANQSGPWTNNVA----------------------PGKNISTMGRYFKDAGYHTCYIGK 100 + S N+ + +T + +GY T +GK Sbjct: 77 KHTGHSFVRENSEGRAAQAKERNRIKAADGYLPQIALPASEATYASALQKSGYRTACVGK 136 Query: 101 WHLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF 160 W L G+ P + D ++ + LWRN + + + Sbjct: 137 WGLGHPSNEGS---PNKHGFDLFYGYISQWQAHYYYPTYLWRNDVKEPLEGNDGKVGRQ- 192 Query: 161 TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD 220 A + A+ F++ PF + + PH P + L E Q Sbjct: 193 YAADLMEQEALKFMETTGGG--PFFLYYATPVPHVSLQVPPD--------EPSLAEYKQA 242 Query: 221 DLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWV 279 P + + P D +Y A +D +G+ + L ++ NT + Sbjct: 243 FAGQDPPYDG---RKSYLPTEDP----RAIYAAMVTRMDRTLGKFRDLLKRTGQDQNTLI 295 Query: 280 IYTSDHG----------EMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVS 327 I+TSD+G G L ++D R P I P + + Sbjct: 296 IFTSDNGATFNGGYDREFFGGNQPLRGMKTQLWDGGIRTPFIAAWPGSIQPGQVSRFVGA 355 Query: 328 HIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTD 387 DL PT + P L G +IL + + + + GG + Sbjct: 356 SWDLFPTFAEIVGFPVPAGLDGVSILPTLKGEVATQKQHDHLYWETVAGGH---QAVRMG 412 Query: 388 DFKLVL-----NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 +K + N +L++ D +E ++ + D+ +K+ + Sbjct: 413 PWKGIRLGVIKNPSAPVQLFNLETDVSETTDVAA--QHPDIVAKIATIMSA 461 >UniRef50_UPI00017453D4 choline sulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017453D4 Length = 485 Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats. Identities = 109/457 (23%), Positives = 182/457 (39%), Gaps = 35/457 (7%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N LF++ D + +G G + T N+D LA G F +A+ +P C+P+R TG Sbjct: 28 NVLFILVDDL-NDQIGWLGGAGI-TPNMDRLAQRGTLFANAHAQAPWCSPSRTSFLTGKR 85 Query: 65 ANQSGP-----WTNNVAPGKNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPEW 118 + +G W NV + + T+ ++F GY T IGK +H Sbjct: 86 PSTTGIYALTPWFRNVPALRELVTLPQHFAAHGYETFGIGKVYHEGCPPANQPTPEFSVM 145 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 + + + N +D + T ++++ A++ L +P Sbjct: 146 GY--------QGNWRKPQPSKPFVNTPGMRQDFGQFPDRDDQTDDFKVASSAIECLGRP- 196 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 +PF + V PH+P P ++ Y L E D + P R Sbjct: 197 -HTKPFFIAVGLRRPHYPLYAPQQWFSLYDPQNVWLPEVPATDRDDLPRFARALRLGNTE 255 Query: 239 PV------GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGA 291 P H+ Y AC FVD+QIGR+++AL T ++ SDHG +G Sbjct: 256 PTLGPIVNAGLWRSHNHAYLACVSFVDNQIGRILDALEQSGEAHRTVIVLASDHGFHLGE 315 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 +L +K +++ T +PLI P R PV +D+ PT+ + + P L GE+ Sbjct: 316 KELFAK-RTLWERATHVPLIFAGPGVGRGTSKRPVELLDIYPTLTEICGLPTPPGLEGES 374 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEM 411 + A+ G T+D++ + S+ELYD R DP E Sbjct: 375 LGALLRDPSAARTRPAIT------GQMQGSFAVRTEDWRYIRYADGSEELYDHREDPQEF 428 Query: 412 HNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWS 448 NL D R+ V++++ + + DP + + Sbjct: 429 LNLAADQRWTSVKTELGSWIPKHPA---DPVPGSEKT 462 >UniRef50_B6A548 Choline-sulfatase n=1 Tax=Rhizobium leguminosarum bv. trifolii WSM2304 RepID=B6A548_RHILW Length = 503 Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats. Identities = 110/483 (22%), Positives = 192/483 (39%), Gaps = 22/483 (4%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN LF+ D + Y N++ +A G+ F +AY P+C P+R + G Sbjct: 7 PNILFIQVDQLTAASLSAYGDTVCRAPNLERIADTGVVFETAYCNFPLCAPSRFSMAAGQ 66 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYW 123 + G + N +I T Y + AGY T GK H G D F E D + Sbjct: 67 LCSTIGAYDNAAEMPASIPTYAHYLRAAGYQTALSGKMHFIGPDQFHGFE--KRLTPDLY 124 Query: 124 FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD-- 181 +++ + N +V + ++ +A+ L AR+D Sbjct: 125 PADFSWVPNWGNEGKRDT-NDTRAVLISGICERSVQIDFDENVTFQAIQHLYNIARSDDK 183 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH----HRLWAQAMP 237 PF + VSY PH P+ C E+ + Y + H + +A Sbjct: 184 RPFFLQVSYTHPHEPYLCRKEFWDLYEGVDVPMPAVDALSEQEHDPHSVRLLKDFAMLDV 243 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLIS 296 D Y+ ++D IG++++ L RENT +++ SDHGEM+G + Sbjct: 244 RFADGDIQRARRAYYGSISYIDSMIGQILDTLEAIGARENTAIVFASDHGEMLGERGMWF 303 Query: 297 KGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEK----PEILPGENI 352 K ++ R+PL++ +P + ++V VS +DLLPT+M LA E L G+++ Sbjct: 304 KK-HFFEAALRVPLLLNAPWIKPQRVSETVSLVDLLPTLMGLATGRVWRSETEELEGQDL 362 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMH 412 + + E+ + +P+ +KL+ + + L+D DP E+ Sbjct: 363 TGFLDREDHEPSRAVFA-EYLAEATPVPIFMVRKGRYKLISSSHDGNLLFDLMADPKELQ 421 Query: 413 NLIDDIRFADVRSKMHDALLDYMDK---IRDPFRSYQWSL---RPWRKDARPRWMGAFRP 466 NL +A++ +++ + D D+ D S L + RW +P Sbjct: 422 NLAGHTDYAEIEARLLKIVADKWDEGKLTEDILLSQARRLFVREAAKLGTPTRWNHDEQP 481 Query: 467 RPQ 469 + Sbjct: 482 GQE 484 >UniRef50_A6DG71 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG71_9BACT Length = 515 Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats. Identities = 108/489 (22%), Positives = 190/489 (38%), Gaps = 60/489 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLN---TQNIDSLAAEGIRFNSAYTCSPVCTPARAG 58 +PN LF+M D +GCY + T ID LA++GI+F++ + + +CTP+RA Sbjct: 28 SKPNILFIMADDHTKQAIGCYGSRLSKLNPTPTIDRLASQGIQFDNVFCSNAICTPSRAS 87 Query: 59 LFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 + TG Y+ +G N + G + + + K AGY T IGKWHL Sbjct: 88 IITGQYSQTNGVLDLNGSIGPDKQFLPKEMKKAGYETAMIGKWHLKKEPA---------- 137 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 DY+ N S + + + I++ ++ +L+ Sbjct: 138 TFDYYCVLPGQGLYHN-----PIFNIRGSKPWPKNTITKKDQHSSDAITDISLHWLKNER 192 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA----------QDDLANKPEH 228 +PF ++ + PH F Y D + E +D + ++ Sbjct: 193 DKSKPFFLMHHFKAPHDMFEYAKRYESYLEDVHIPEPESLFSVPAGSAGSKDLGSGLSKN 252 Query: 229 HRLWAQAMPSPVGDD----------GLYHHPLYFACNDFVDDQIGRVINALTP-EQRENT 277 H W V DD + Y C +DD I R+++ L Q +NT Sbjct: 253 HNPWQLPQKLGVSDDIPEPEYTRLSYQKYLKAYLRCVKGIDDNIARLLSYLKDSNQLDNT 312 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTM 335 +IYTSD G +G H LI K MY++ +P I+ +P + + + +++ D PT+ Sbjct: 313 IIIYTSDQGFFLGEHNLIDK-RWMYEEAMGMPFIVYAPGMIKNNFKNNCLINNTDFAPTL 371 Query: 336 MALADIEK-PEILPGENILAVKEPRGVMVEF-----NRYEIEHDSFGGFIPVRCWVTDDF 389 + +A ++K P + G++ + E+ RY + ++ Sbjct: 372 LEIAGLKKTPNYMQGKSFYKALSNQQKPDEWRTVTYYRYWMHMAHKLAVPAHFGIRSESH 431 Query: 390 KLVLNLFTS------------DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 KL+ E YD DP EM N + + ++ ++ LL+ Sbjct: 432 KLIFFYGRKYGRRGGKPTPISWEFYDLDKDPKEMKNEYKNPEYKEIIKRLKTQLLEIRKD 491 Query: 438 IRDPFRSYQ 446 + + + Y Sbjct: 492 LNEEDKKYP 500 >UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UL40_RHOBA Length = 592 Score = 410 bits (1056), Expect = e-113, Method: Composition-based stats. Identities = 113/497 (22%), Positives = 177/497 (35%), Gaps = 66/497 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + VMTD Q VG + + L T N+D AAEG + Y SP+CTP R+ L TG Sbjct: 46 RPNVILVMTDDQGWAEVGFHGNEVLKTPNLDRFAAEGTELTNFYV-SPMCTPTRSSLMTG 104 Query: 63 IYANQSGPWT---NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 Y ++G +T+ F AGY T GKWHL E P Sbjct: 105 RYHFRTGAHDTYIGRSNMNPEETTIAEVFAGAGYRTGIFGKWHLG--------ENFPMRA 156 Query: 120 ADYWFDGANYLSELTEKEISLWRNG--LNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 D F + + + + + + ++ F++ Sbjct: 157 EDQGFQKVVVHGGGGIGQFADYPGNTYWDPTLQYNDSFKKAKGYCTDVFIDESIQFMKDS 216 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 ++PF + + PH PF E+ Y + Sbjct: 217 --GEQPFFCYLPLNVPHSPFDVADEFRADYDNQNLADP---------------------- 252 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMG--AHKL 294 DG + D GR++ A+ RENT +++ SD+G L Sbjct: 253 -----DGRKWVAPIYGMITQFDGAFGRLLEAVEDMGQRENTIILFMSDNGPNSTYFTAGL 307 Query: 295 ISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEIL--PGE 350 +K ++Y++ R P +I+ P+ R+ DTP HIDLLPT+ I P L G+ Sbjct: 308 RAKKGSVYENGIRSPFVIQWPKTLQGGRKFDTPAMHIDLLPTLADACGIGLPADLQVDGK 367 Query: 351 NILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR--CWVTDDFKLVLNLFTSD--ELYDRRN 406 +IL + + ++H+ +K+V + ELY+ Sbjct: 368 SILGLLHGETQGFQQRYLFMQHNRANVPPKYENCMARRGPWKVVGDGGEPTGFELYNIEQ 427 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYMDKI-----RDPFRSYQWSLRPWRKD----AR 457 DP E +L D + ++ + D + RD Y L P +K Sbjct: 428 DPGETRDLAD--KHPEIVKAFVREYEAWFDDVTTQLRRDNGVPYPTELNPEQKRDFRFTW 485 Query: 458 PRWMGAFRP-RPQDGYS 473 W G RP + Sbjct: 486 QDWWGDKTGWRPNNYGR 502 >UniRef50_A6DGT7 Sulfatase family protein n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DGT7_9BACT Length = 504 Score = 410 bits (1055), Expect = e-113, Method: Composition-based stats. Identities = 105/483 (21%), Positives = 181/483 (37%), Gaps = 58/483 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L + D M+G Y + + ID LA + AY VC +RA + TG Sbjct: 19 RPNILIISVDDLKP-MLGTYGDPLVQSPTIDKLAEASALYEKAYCQQAVCGASRASIMTG 77 Query: 63 IYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGKWH------LDGHDYFGTGE 113 + + S W T+ YFK GY TC+ GK + Sbjct: 78 LRPDNSRVWEFRQVMRERNPQAITIPEYFKSQGYMTCFAGKIFDYRCVADGKKQDLKSWS 137 Query: 114 CPPEWDADY-----WFDGANYLSELTEKEISLWRNGLNSVEDLQAN------------HI 156 P + F + +L KEI L +NG + D Sbjct: 138 RPEQPRNSEAMKNLGFADPAFREKLRLKEIELKKNGQKASYDAIKKAIGGSPCYEDSIDG 197 Query: 157 DETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGE 216 + I+ V +++ + +PF + V + +PH PF P +Y + Y + + L E Sbjct: 198 PDEIYEDGMIAREGVRLIKELGQKKKPFFIAVGFKKPHLPFNAPKKYWDLYKETDFAL-E 256 Query: 217 KAQDDLANKPEH---------HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVIN 267 K Q + P + + + Y AC +VD QI +++ Sbjct: 257 KYQKPVQGAPHYAYQNSWEFSGYNVPRINGEVLESFQRKLKHAYAACISYVDAQIAKLLK 316 Query: 268 ALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDT 324 L + E NT +++ SDHG +G H + K + Y+ TR+P + P+ ++ + Sbjct: 317 TLKDQGLEKNTVIVFWSDHGFHLGDHGMWCKHSN-YEQATRVPFFVYDPRQNLKKGRYTQ 375 Query: 325 PVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEH-DSFGGFIPVRC 383 PV ID+ PT+ L+ + PEIL G+++ +F + I Sbjct: 376 PVELIDMFPTLCQLSGLAIPEILDGKSL---LSEAAENAKFALSQFPRNQGKNKKIMGYG 432 Query: 384 WVTDDFKLVLNLFTSD-------------ELYDRRNDPNEMHNLIDDIRFADVRSKMHDA 430 + + ++ + + + ELYD DP E NL ++ + + ++ Sbjct: 433 FRFERYRYIEWVDNNYQQDNTQLGPLKAVELYDYEKDPLEQVNLANNPEYKSILRRLQQE 492 Query: 431 LLD 433 + Sbjct: 493 AKE 495 >UniRef50_C1ZA41 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZA41_PLALI Length = 519 Score = 410 bits (1055), Expect = e-113, Method: Composition-based stats. Identities = 97/505 (19%), Positives = 170/505 (33%), Gaps = 69/505 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + +MTD Q + + + T ++D L + +RF + SP C P RA + T Sbjct: 45 RPNIILMMTDDQGYGDLSLHGNPVVKTPHLDQLGRQSVRFEQFHV-SPTCAPTRASIMTS 103 Query: 63 IYANQSGPWT---NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 + SG + + ++ K AGY T GKWHL D + G+ + Sbjct: 104 RHEFSSGVTHTILERERLSLKATILPQFLKRAGYTTGIFGKWHLGDEDAYQPGKRGFDEV 163 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 + G + + L + N V + ++A+ ++ Sbjct: 164 FIHGGGGIGQSYPGSCGDAPLNKY-FNPVIRHNGKFVATNGYCTKVFVDQAITWISSQPD 222 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP 239 ++PF ++ + PH P CP EY E Y Sbjct: 223 -NQPFFCYITPNAPHAPLDCPKEYYEPY-------------------------------- 249 Query: 240 VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAH----KL 294 + ++ DDQ+GR++ AL ++T VI+ +D+G GA + Sbjct: 250 -LEHVPEDVARFYGMITHWDDQLGRLLKALEDRDISKDTIVIFMTDNGSATGAKHFSAGM 308 Query: 295 ISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEK----PEILPG 349 + Y+ R+P + + H D+LPT+ LA++ + G Sbjct: 309 RANKGTPYEGGIRVPAFWSWAGHWQPQVRQEVTCHYDILPTLTELANVPVADDEKQSWQG 368 Query: 350 ENILAVKEPRGVMVEFN-------RYEIEHDSFGGFIPVR----CWVTDDFKLVLNLFTS 398 +++ + R R+ EHD + D+KL+ N+ Sbjct: 369 RSLVPLLAGRSPNWPPRPFITHVGRWPKEHDPKREPSTYQYAKCAIRLGDWKLISNVKQG 428 Query: 399 ---DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKD 455 ELY DP E NL ++ D ++ + + + Sbjct: 429 EPQWELYQLAEDPAEKINLAK--KYPDRVEELKKIYDAWWLSVVPKMENESIEGPAVNPF 486 Query: 456 ARPRWMGAFRPRPQ----DGYSPVV 476 + W P P SP Sbjct: 487 HKAYWEQYQGPGPNHANPPDGSPRP 511 >UniRef50_A0LK86 Sulfatase n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LK86_SYNFM Length = 487 Score = 410 bits (1055), Expect = e-113, Method: Composition-based stats. Identities = 113/445 (25%), Positives = 178/445 (40%), Gaps = 28/445 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN L + D + +GC + T NID LA G+ F +A SP+C+P+RA FT Sbjct: 47 KPNVLMFVLDDM-NDWIGCLGGHPDVKTPNIDRLAQRGVLFRNAQCSSPICSPSRASFFT 105 Query: 62 GIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPE 117 GI + SG + N+ A N T+ ++F GY + GK +H D E P Sbjct: 106 GIRPSTSGIYGNSQAFRKIMPNAVTLPQHFIAHGYRSMGCGKLFHFIKTDSRSWHEFFPS 165 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 + FD + L+ + D I + +++ A D L++ Sbjct: 166 RSMERPFDPVPPNAPLSGLPDV-------NQFDWGPIDIVDEELGDGKLARWAADALRR- 217 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 R D PF + V PH P P +Y + Y L +DL + P WA+ Sbjct: 218 -RYDRPFFLGVGLLRPHVPLYVPRKYFDMYPPESITLPTVKANDLDDVPPTGVSWAKPER 276 Query: 238 SPV---GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHK 293 + D Y A FVD Q+G V++AL NT V+ D+G +G Sbjct: 277 HQLIVEHDQWRKAVAGYLASVSFVDAQVGWVLDALDESPYVNNTVVVLWGDNGWHLGEK- 335 Query: 294 LISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 L ++++ R+PLII P R+ PVS +D+ PT+ L D+ L + Sbjct: 336 LHWTKLTLWEESCRVPLIIALPGLTPPGRKCAKPVSTMDVYPTLNELCDLTPKPELECRS 395 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEM 411 IL + + + ++ + ++ELYD + DP E Sbjct: 396 ILELLRNPQSDTWDGPPALSTY----MPGNHSLRDERYRYIRYNDGTEELYDLKADPMEW 451 Query: 412 HN-LIDDIRFAD-VRSKMHDALLDY 434 +N L VR ++ L + Sbjct: 452 NNLLAGGGTGPAGVRDRLSAFLPKF 476 >UniRef50_C5C586 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C586_BEUC1 Length = 478 Score = 410 bits (1054), Expect = e-113, Method: Composition-based stats. Identities = 109/465 (23%), Positives = 179/465 (38%), Gaps = 45/465 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + V+ D +GC+ T +ID+LAA G RF +Y +PVC+P RA L TG Sbjct: 15 RPNIVLVVVDDLGWRDLGCFGSTFYETPHIDALAASGTRFTHSYAAAPVCSPTRASLLTG 74 Query: 63 IYANQSGPWT--------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDY 108 Y + G ++ + R + GY T ++GKWHL G Sbjct: 75 KYPARVGVTNWIGGHAIGALRDVPYFHGLPQDEYALARALRAGGYRTWHVGKWHLGGGR- 133 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 P D G+ S ++ + R+++ Sbjct: 134 ----HLPEHHGFDLNVGGSASGSPVSYYAPYGIG---------ALEDAPDGEFLTDRLTD 180 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 AVD ++ + D PFL+ + + H P P +EKY LG A + Sbjct: 181 VAVDLVR--SSDDAPFLLNLWHYAVHTPIEAPAHLVEKYRHKAETLGLPTHGPDAVEAGE 238 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGE 287 H V + P Y A + +D +GR++ AL + ++T +++TSD+G Sbjct: 239 HMPARHLRSERVRRRRIQSDPTYAAMLETLDGAVGRLVTALRDVGKLDDTLIVFTSDNGG 298 Query: 288 M-------MGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMAL 338 + L M D TR+P I+ P + D P + D PT++A Sbjct: 299 LSTAEGSPTCNAPLSEGKGWMADGGTRVPTIVSWPGRVPAGARSDLPFTSPDFYPTLLAA 358 Query: 339 ADIE--KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 A + + + G N+ + + + H S G P +KLV + Sbjct: 359 AGLTQLPEQHVDGVNLWPAWQGAPLDRGPIFWHYPHYSNQGGAPSAAVRDGRWKLVRHFG 418 Query: 397 TS-DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 DEL+D D +E H++ R DV +++ L ++ + Sbjct: 419 IEHDELFDVVADVSESHDVSGRRR--DVVARLSVTLDSWLADVGA 461 >UniRef50_O69787 Choline-sulfatase n=53 Tax=Alphaproteobacteria RepID=BETC_RHIME Length = 512 Score = 410 bits (1054), Expect = e-113, Method: Composition-based stats. Identities = 108/517 (20%), Positives = 199/517 (38%), Gaps = 33/517 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L +M D + L+ N+ +LA RF++ YT SP+C PARA G Sbjct: 5 KPNILIIMVDQLNGKLFPDGPADFLHAPNLKALAKRSARFHNNYTSSPLCAPARASFMAG 64 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 +++ + N +I T + + AGY+T GK H G D E D Sbjct: 65 QLPSRTRVYDNAAEYQSSIPTYAHHLRRAGYYTALSGKMHFVGPDQLHGFE--ERLTTDI 122 Query: 123 WFDGANYLSELTEKE--ISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 + + + + I W + L SV I + ++ A L Q +R Sbjct: 123 YPADFGWTPDYRKPGERIDWWYHNLGSVTGAGVAEITNQMEYDDEVAFLANQKLYQLSRE 182 Query: 181 D-----EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA-- 233 + P+ + VS+ PH P+ ++ + Y D + E L + H + Sbjct: 183 NDDESRRPWCLTVSFTHPHDPYVARRKFWDLYEDCEHLTPEVGAIPLDEQDPHSQRIMLS 242 Query: 234 --QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL-TPEQRENTWVIYTSDHGEMMG 290 ++ YFA ++D+++G +I+ L ++T +++ SDHG+M+G Sbjct: 243 CDYQNFDVTEENVRRSRRAYFANISYLDEKVGELIDTLTRTRMLDDTLILFCSDHGDMLG 302 Query: 291 AHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADI---EKPEIL 347 L K ++ R+PL+I P TP S++D+ PT+ LA I E Sbjct: 303 ERGLWFK-MNFFEGSARVPLMIAGPGIAPGLHLTPTSNLDVTPTLADLAGISLEEVRPWT 361 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRND 407 G +++ + + +E+ + + P+ +K V ++L+D D Sbjct: 362 DGVSLVPMVNG---VERTEPVLMEYAAEASYAPLVAIREGKWKYVYCALDPEQLFDLEAD 418 Query: 408 PNEMHNLIDDIRFADVRSKMHD---------ALLDYMDKIRDPFRSYQWSLRPWRKDARP 458 P E+ NL ++ R ++ + + + +R+ R A Sbjct: 419 PLELTNLAENPRGPVDQATLTAFRDMRAAHWDMEAFDAAVRESQARRWVVYEALRNGAYY 478 Query: 459 RWMGAFRPRPQDGYSPVVRDYDTGLPTQGVKVEEKKQ 495 W + + Y + DT + K + + Sbjct: 479 PWDHQPLQKASERYMRNHMNLDT---LEESKRYPRGE 512 >UniRef50_C1ZIM5 Arylsulfatase A family protein n=2 Tax=Planctomycetaceae RepID=C1ZIM5_PLALI Length = 523 Score = 410 bits (1054), Expect = e-113, Method: Composition-based stats. Identities = 111/469 (23%), Positives = 178/469 (37%), Gaps = 38/469 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 KRPN L + D Q + + + T + SLA G F +A+ +P+C P+R L Sbjct: 46 KRPNVLMIAIDDQ-NDWIEPLGGHPLVKTPQLKSLAERGTVFLNAHCQAPLCNPSRTSLL 104 Query: 61 TGIYANQSGPWTNNVAPGK-----NISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 G+ + +G + + T+ + F AGY T GK G G Sbjct: 105 LGLRSTTTGIYGLSPWFRDVPALSGRLTLPQAFGKAGYTTLSTGKIFHGG----GGKPKD 160 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + D W ++ I + V+ H+D +I++ A++ L+ Sbjct: 161 RLKEFDEWGPAGGVGKRPEKRLIQPPPHSNPLVDWGAFPHLDSEK-GDTQITDWAIEKLK 219 Query: 176 QPARAD-------EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 Q +PFLM V Y PH P E+L Y D L +DD + P Sbjct: 220 QRQVQQSSSTGESKPFLMCVGYFLPHVPCYVTPEWLAMYPDDDSILPFIEKDDRKDTPRF 279 Query: 229 HRLWAQAMPSPV------GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIY 281 +P P + Y A +VD QIGR++ AL NT ++ Sbjct: 280 SWYLHWRLPEPRLKWLQQHEHWRSLVRSYLASTSYVDAQIGRLLAALEATGEANNTLIVL 339 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALAD 340 SDHG +G + K +++ TR+PL+ P + PV +D+ PT+ L Sbjct: 340 WSDHGWHLGEKGITGK-NTLWERSTRVPLLFAGPGVLAGGKCVEPVELLDIYPTLAQLCQ 398 Query: 341 IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDE 400 +E P L G +++ + + + T D + + S+E Sbjct: 399 LEAPTDLEGVSLVPQLTNPLAVRQRPAITSHNQGN------HAIRTRDHRYIRYADGSEE 452 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL 449 LYD DP+E+ NL DD S + L ++ I P + Sbjct: 453 LYDHLVDPHELKNLADDPAH----SGLKKQLNSWLPSIDQPPVTGSKDR 497 >UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UGD7_RHOBA Length = 543 Score = 409 bits (1053), Expect = e-113, Method: Composition-based stats. Identities = 103/497 (20%), Positives = 179/497 (36%), Gaps = 63/497 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++ D + VG K + T ++D LAA G+ F + Y P C+P+RAGL TG Sbjct: 44 RPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTG 103 Query: 63 IYANQSGPWTNNVA-----------PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 + + G +N + +T+ K+AGY T IGKWHL F Sbjct: 104 RHQQRFGHGSNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLGDAKPF-- 161 Query: 112 GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 P D WF + ++ + L + S AV Sbjct: 162 --WPNRRGFDEWFGFSGGGFSYW-GDLGMKDPLLGVHRGDEPVDPKTLTHLTDDFSTEAV 218 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRL 231 F+Q+ EPF + ++Y+ PH P +L+K Sbjct: 219 KFIQR--HETEPFFLYLAYNAPHAPDHATRAHLQK------------------------- 251 Query: 232 WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGE--- 287 +Y A +D+ IGRV++ + ENT +I+ SD+G Sbjct: 252 --------TAHIEYGGRAVYGAMVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGGRRE 303 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDT--PVSHIDLLPTMMALADIEK-- 343 +++ R+P ++ P R + P++ +DL PT +A A ++ Sbjct: 304 HAVNFPYRGHKGMLFEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPSQ 363 Query: 344 PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV-LNLFTSDELY 402 + L G+N+L V + R S G ++KL+ L+ Sbjct: 364 NDKLDGQNLLPVLTDDKQRLPE-RPLFWRYSMGDDSYGYAVRDGNWKLIDSRYKDRKLLF 422 Query: 403 DRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMG 462 D NDP E +L + + +++ + + + P S + +++ Sbjct: 423 DLANDPWEREDLAA--QHPEQVARLSRMMEAWDARNVPPKWSDAHGVNVRKEENTRNEAV 480 Query: 463 AFRPRPQDGYSPVVRDY 479 R + + Sbjct: 481 EKASRGERSRPDLDLHS 497 >UniRef50_UPI0001C35931 N-acetylgalactosamine 6-sulfate sulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C35931 Length = 472 Score = 409 bits (1053), Expect = e-112, Method: Composition-based stats. Identities = 116/515 (22%), Positives = 192/515 (37%), Gaps = 70/515 (13%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 M+RPN +F++ D +G + T N+D +A EG + + SPVC+PARA L Sbjct: 1 MRRPNIVFILADDMGFWTLGSAGNRDAVTPNLDEMAREGCIAENFFCSSPVCSPARATLL 60 Query: 61 TGIYANQSGPWT--------NNVAPG----KNISTMGRYFKDAGYHTCYIGKWHLDGHDY 108 TG + G N + Y +AGY GKWHL Sbjct: 61 TGRMPSMHGILDWILRGNIKNEGEIPIEYLNDFKGYTDYLSEAGYICGLSGKWHLGD--- 117 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 + +W+ + + + + R G + E I+ Sbjct: 118 ----SQKQQKGFSHWY--VHQSGGGSYYDAPMIREGKR---------VCEQGYITELITR 162 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPF--TCPVEYLEKYADFYYELGEKAQDDLANKP 226 AV FL + A + PF + V++ PH P+ P +YL+ Y D + + P Sbjct: 163 DAVRFLNEHAGKNAPFYLGVNFTAPHTPWIHNHPQKYLDLYRDCAF----------DSCP 212 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDH 285 R Q + D YFA +D +G + L + + +T ++++SD+ Sbjct: 213 VEQRHPWQIDFAEFNYDRTEMLKGYFAATSALDVGVGEIREELKRLKLDQDTLILFSSDN 272 Query: 286 GEMMGAHKLISKGA-----AMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMAL 338 G G H + KG MYD ++P + P ++ S D PT+M + Sbjct: 273 GFNCGHHGIWGKGNGTAPFNMYDTSVKVPFLACMPGKIQPGTRLRGLYSAYDFFPTIMEI 332 Query: 339 ADIEKPE-ILPGENIL-AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV-LNL 395 A ++ E LPG++ AV + VR ++K + Sbjct: 333 AGVQYKEKGLPGKSFAKAVFSGEERDINDCVVVYSEYG-----AVRMIRQKEWKYIRRYP 387 Query: 396 FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKD 455 DELY+ + DP+EM N+ID ++ ++ L + + P + Sbjct: 388 EGPDELYNLKTDPDEMRNMIDKAA-PELIELLNKRLDSWFSEHTRPETDG--------RQ 438 Query: 456 ARPRWMGAFRPRPQDGYSPV--VRDYDTGLPTQGV 488 A G R G+ P + Y+T LP + Sbjct: 439 ANVTGAGQNRKYTNHGFEPGSFEKGYET-LPIRQA 472 >UniRef50_B4D0V9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D0V9_9BACT Length = 497 Score = 409 bits (1053), Expect = e-112, Method: Composition-based stats. Identities = 99/451 (21%), Positives = 173/451 (38%), Gaps = 19/451 (4%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++ PN LF+++D Q + + + T N+D L G F A P+C +RA + Sbjct: 41 VRHPNILFIISDDQRPDTIAALGNPIIQTPNLDRLVHGGTAFTRAVAAYPICYVSRAEIL 100 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW-- 118 T + A ++G ++T + AGYHT ++GKW E Sbjct: 101 TSVCAFRNGVGYTGNKLDPKLATWSGTLRSAGYHTWFVGKWDNGATPKAYGYEETRGLYT 160 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHR---ISNRAVDFLQ 175 + + + +R +D + ++ A+DF++ Sbjct: 161 GGGAPLQNTPSYVDHAGRPATGYRGYTFKTDDGKPLPELGVGLTPDISRHFADAAIDFIE 220 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQ--DDLANKPEHHRLWA 233 + + EPF + V++ PH P P + KY L + + + R Sbjct: 221 R--KPAEPFFLHVAFTAPHDPRLLPPGWETKYDPKTMPLPKNFRSVHPFDHGNMGGRDEV 278 Query: 234 QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAH 292 D+ Y+A +D+QIGR++ AL +NT +I+TSD G +G+H Sbjct: 279 LLASPRRPDEVRAELAAYYAAISGMDEQIGRIVEALKSTGQLDNTLIIFTSDQGLAVGSH 338 Query: 293 KLISKGAAMYDDITRIPLIIRSPQGERR-QVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 LI K +Y+ +PLI+ P + + DL PT + I P + G + Sbjct: 339 GLIGK-QNLYEHTLGVPLIMSGPGIPKGETREAQCDLRDLFPTTCEVTGIATPPAVQGRS 397 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD-ELYDRRNDPNE 410 ++ V V R +KL+L +L+ +DP+E Sbjct: 398 LVPVLRDAQKTV------YPFVVGYYTDAQRAIREGTWKLILYPKAKRTQLFYLASDPDE 451 Query: 411 MHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 MH+L A + + LL ++ + DP Sbjct: 452 MHDLSAQPEQARRLADLRIKLLGWLKENGDP 482 >UniRef50_A6DHI0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI0_9BACT Length = 456 Score = 409 bits (1053), Expect = e-112, Method: Composition-based stats. Identities = 97/471 (20%), Positives = 169/471 (35%), Gaps = 74/471 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +F+M D +G Y K + T +D +A EG+R Y + VC P+R L TG Sbjct: 19 KPNIIFIMCDDMGYGQLGSYGQKMIKTPRLDQMAKEGLRLTDYYAGTAVCAPSRCSLMTG 78 Query: 63 IYANQSGPWTN------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 + + N T+ K+AGY T IGKW L Y G+ P Sbjct: 79 QHVGHTYIRGNKEYPTGQEPIPAETITVAEKMKEAGYATALIGKWGLG---YPGSEGEPN 135 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 + DY+F + L RN L+ N E + +++ A F+++ Sbjct: 136 KQGFDYFFGYNDQKHAHNHFPKFLLRN--EETLTLKNNSGKEIEYSQYMLTDEAKGFIKK 193 Query: 177 PARADEPFLMVVSYDEPHHPFTCP--VEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 D PF + ++Y PH P E +Y D + Sbjct: 194 --NKDNPFFLYLAYVIPHSRLQIPGDDECYLQYKDESWP--------------------- 230 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG------- 286 + H +D +G +++ L ENT V++TSD+G Sbjct: 231 -------EKQKKH----AGMISRLDKDVGSILDLLKEMNLAENTLVVFTSDNGAHREGGA 279 Query: 287 ---EMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADI 341 + L +MY+ R+P I P + + +H DL+PT L + Sbjct: 280 RPEFFNDSGPLSGIKRSMYEGGVRVPFIAHWPGVIKPGQVSNHIGAHWDLMPTACELGGV 339 Query: 342 EKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDF-----K--LVLN 394 + PE + G + + + + E + Y + R D+ K + Sbjct: 340 QPPEGIDGISYVPLLKGNMEEQEKHDYLYFELHW---PTKRGVRKGDWVALQSKTSAIDP 396 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 + +L++ +ND + +L ++ + + L+ P + Sbjct: 397 NKDTIKLFNLKNDLGQKKDLA--TQYPEKVEEFKKIFLE--AHTPAPLFEF 443 >UniRef50_Q1GMK9 Choline sulfatase n=8 Tax=Alphaproteobacteria RepID=Q1GMK9_SILST Length = 504 Score = 409 bits (1052), Expect = e-112, Method: Composition-based stats. Identities = 122/491 (24%), Positives = 205/491 (41%), Gaps = 26/491 (5%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN L M D + + L+ N+ LAA RF + YT SP+C P RA +G Sbjct: 4 PNILIFMVDQLNGTLFPDGPAEWLHAPNMKKLAARSTRFRNCYTASPLCAPGRASFMSGQ 63 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYW 123 + +G + N +I T + + AGY+TC GK H G D E D + Sbjct: 64 LPSATGVYDNAAEFASSIPTYAHHLRRAGYYTCLSGKMHFVGPDQLHGFE--ERLTTDIY 121 Query: 124 FDGANYLSELTEKE--ISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 + + + I W + + SV I + ++ A + AR Sbjct: 122 PPDFGWTPDYRKPGERIDWWYHNMGSVTGAGVAEISNQMEFDDEVAFHATQKIYDLARGK 181 Query: 182 --EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP 239 P+ + VS+ PH P+ +Y + Y D + + E A N+ H + A Sbjct: 182 DARPWCLTVSFTHPHDPYVTRKKYWDLYEDCPHLMPEVADLGYENQDPHSKRIFDANDWR 241 Query: 240 ----VGDDGLYHHPLYFACNDFVDDQIGRVINALT-PEQRENTWVIYTSDHGEMMGAHKL 294 +D YF ++DD+IG V+ AL Q ++T +++ SDHG+M+G L Sbjct: 242 NFDITEEDIRRSRRAYFGNISYLDDKIGEVMEALEGTRQDKDTIILFVSDHGDMLGERGL 301 Query: 295 ISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILP---GEN 351 K + Y+ +R+P++I +P V PVS+ID+ PT+ LA + E++P GE+ Sbjct: 302 WFK-MSFYEGSSRVPMMISAPNMTPGLVCDPVSNIDVCPTLCDLAGVSMSEVMPWTAGES 360 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEM 411 ++ + + +E+ + + P+ + +KL L D+L+D DP+E Sbjct: 361 LVPLGQGGT---RSTPVAMEYAAEASYAPMVSLRSGRYKLNLCALDPDQLFDLDADPHER 417 Query: 412 HNLIDDIRFADVRSKMHDALLDY--MDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQ 469 NL D + + + +D+ R+ Q R W R G F Sbjct: 418 VNLAKDPTHHEAYQALKAIAAERWDLDRFDADVRASQ--ARRWVVYEALRQGGYFP---- 471 Query: 470 DGYSPVVRDYD 480 Y P+ + + Sbjct: 472 WDYQPLQKASE 482 >UniRef50_A6L183 Iduronate 2-sulfatase n=11 Tax=Bacteroides RepID=A6L183_BACV8 Length = 477 Score = 409 bits (1052), Expect = e-112, Method: Composition-based stats. Identities = 110/452 (24%), Positives = 184/452 (40%), Gaps = 29/452 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++ N LF+M D +GCY K + T NID AA G+ F +AY PV +RA L T Sbjct: 26 EKMNVLFLMADDMRPE-LGCYGVKEVKTPNIDRFAASGLLFQNAYCNIPVSGASRASLLT 84 Query: 62 GIYANQSGPWTNNVAP-GKNIST---MGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 G+Y + + N A K+ T + R+F GY+T GK D+ + PP Sbjct: 85 GVYPHYPDRFVNYSAYASKDCPTAIPISRWFTSHGYYTISNGKVFHHLSDHANSWSEPPY 144 Query: 118 W----DAD-YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 D YW + + + E + + +T +++ +A+ Sbjct: 145 RKHPDGYDVYWAEYNKWELWMNEASARTINPKTMRGPFCEWAEVPDTAYDDGKLALKAIA 204 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK--AQDDLANKPEHHR 230 L++ +PF M + +PH PF P +Y + Y + DL N+ ++ Sbjct: 205 DLKRLKEQGKPFFMACGFWKPHLPFNAPKKYWDLYDREKIPVANNRFRPKDLPNEVKNST 264 Query: 231 LWAQAMPSPVGDD---GLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 + DD Y+AC +VD QIG+V++AL NT V+ DHG Sbjct: 265 EIYAYARTTTADDISFQKEAKHGYYACLSYVDAQIGKVLDALDELGLANNTIVVLLGDHG 324 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEI 346 +G H + K M D T +PLI+R P ++ + + V +DL PT+ L + P+ Sbjct: 325 WHLGEHNFLGKHNLM-DRSTHVPLIVRVPGLKKGKTKSMVEFVDLYPTLCELCHLPIPKN 383 Query: 347 -LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS----DEL 401 L G + + + ++ Y V++ + L Sbjct: 384 QLDGTSFVPILTNLKAKIKDQVYIQWEGGDNT-------VSNRYNYAEWKQKEKIHSRML 436 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 +D DP E N +++ ++ +K+ L Sbjct: 437 FDHHIDPEENKNRVNERKYRSEINKLSSFLKA 468 >UniRef50_A6DHI1 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI1_9BACT Length = 472 Score = 409 bits (1052), Expect = e-112, Method: Composition-based stats. Identities = 105/497 (21%), Positives = 184/497 (37%), Gaps = 78/497 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN ++++ D VG K + T +D LA++G+RF Y + VC P+RA L TG Sbjct: 20 KPNIIYILCDDLGYGEVGYNGQKMIQTPELDKLASKGMRFTDHYCGNAVCAPSRASLITG 79 Query: 63 IYANQS-------GPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 + + G + T+G+ K AGY T IGKW L G F P Sbjct: 80 KHPGHAFIRANSPGYPDGQTPIPADSETLGKLMKRAGYATACIGKWGLGG---FHNAGNP 136 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + D+++ + LWRNG E L + +E ++ A+ +++ Sbjct: 137 HKQGFDHFYGYTDQRKAHNYYPEYLWRNGEK--EMLNNKNGEENDYSHDLMTVDALKYIE 194 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 + + D+PF + ++Y PH + P L +Y D + Sbjct: 195 E--KKDQPFFLYLAYLIPHVKYQVPD--LAQYKDKDWP---------------------- 228 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG-------- 286 ++ A +D IG + L +NT +++ SD+G Sbjct: 229 ----------KEMKIHAAMTSRMDRDIGTIARRLEELGIADNTLIMFNSDNGAHGKSNSE 278 Query: 287 -EMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEK 343 + L +MYD R P+I P D + D++PT L Sbjct: 279 KFFNTSGDLKGLKRSMYDGGVRSPMIAYWPGTIQAGSVSDHISAFWDMMPTFSELTGEPF 338 Query: 344 PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV---LNLFTSDE 400 G ++L + + ++Y + + P +K V + E Sbjct: 339 KGETDGISMLPTLLGKDSEQKQHKYLYW-ELYESNKPNCAIRFGKWKGVVLDRRKGLNIE 397 Query: 401 LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRW 460 LYD D +E NL ++ +V ++ +++ ++ P+ W KD +P + Sbjct: 398 LYDMSGDQSESKNLAA--QYPEVVDEIRKMMVE--AHVKSPY---------WDKDFKPLY 444 Query: 461 MGAFRPRPQDGYSPVVR 477 A G P+ R Sbjct: 445 -NAKAACEDTGVKPMPR 460 >UniRef50_C5BWB0 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5BWB0_BEUC1 Length = 497 Score = 409 bits (1051), Expect = e-112, Method: Composition-based stats. Identities = 113/459 (24%), Positives = 181/459 (39%), Gaps = 31/459 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L VMTD Q + +G G + T N+D LAA+G F AY+ +P CTPARA L TG Sbjct: 4 RPNVLLVMTDQQRWDTLGSAGGP-VETANLDHLAAQGTTFTHAYSATPSCTPARASLLTG 62 Query: 63 IYANQSGPWTNNVAPGKN---ISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP---- 115 +G +T+ DAGYHT +GK H Sbjct: 63 QDPWHTGILGMGAGQPPMAGLENTLPEALADAGYHTQGVGKMHFSPQRALHGFHATTIDE 122 Query: 116 --PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDL-QANHIDETFTWAHRISNRAVD 172 + + D + ++ +GL+ L + H E + ++ Sbjct: 123 SLRVEEPGFTSDYTQWFERHAPADVRQADHGLDFNSWLARPFHTGEHLHPSTWTVTESIR 182 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY--ADFYYELGEKAQDDLANKPEHHR 230 FL++ PF ++ S+ PH P+ P Y E Y +L D A+ + Sbjct: 183 FLERR-DPTRPFFLMTSFARPHSPYDPPAFYYEHYLRRHHTGDLPPAVVGDWASVHDVGG 241 Query: 231 LWAQA----MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDH 285 D+ Y+ +D QIGR++ L + + T V++T+DH Sbjct: 242 AEGMDPNAWRGRRTADEIGRARAGYYGSIHHIDHQIGRLMRYLRDRRLDAETLVVFTADH 301 Query: 286 GEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR-----QVDTPVSHIDLLPTMMALAD 340 G+M+G H L K Y+ +PL++R P G R VD PV D++PT++ Sbjct: 302 GDMLGDHHLWRK-TYAYEGSAHVPLVVRLPAGMRSAGDAEVVDDPVCLQDVMPTILDACG 360 Query: 341 IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT--- 397 ++ P + G + L + V + + ++ +K V Sbjct: 361 VDVPASVDGASTLPLVTGERVPWREFVHGEHSTCYHPSQEMQYLTDGAWKYVWFPRGDGP 420 Query: 398 ---SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 ++L+D R+DP E +L A V + L+D Sbjct: 421 GSPREQLFDLRSDPYEERDLAPRSDHAAVLRRWRARLVD 459 >UniRef50_B9XR48 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XR48_9BACT Length = 508 Score = 409 bits (1051), Expect = e-112, Method: Composition-based stats. Identities = 99/500 (19%), Positives = 168/500 (33%), Gaps = 83/500 (16%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 +++PN +F + D VGC+ K ++T NID +A EG++F Y+ SPVC P+R L Sbjct: 35 LRKPNVIFFIADDLGYADVGCFGQKKIHTPNIDRIATEGMKFTQHYSGSPVCAPSRCVLM 94 Query: 61 TGIYANQSGPWTNNV-------APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGE 113 TG ++ S N N T+ R + GY T GKW L G + G Sbjct: 95 TGKHSGHSAVRDNRELKPEGQFPLPANTITVARLLQQNGYITGAFGKWGLGGPESSGK-- 152 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF------------- 160 P + +F LW + D D+ Sbjct: 153 -PLDQGFTRFFGYNCQRVAHNLFPTYLWDDNHRLALDNPPIGEDQKLPADADSNDPASYK 211 Query: 161 ------TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYEL 214 + +A+ F++ D PF + PH P + L++Y E Sbjct: 212 AFTGKSYAPDLYAEQALRFIRD--NKDHPFFLFFPTIVPHVALQVPEDSLKEYEGKLPET 269 Query: 215 GEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR 274 H Y A +D +GR++ + Sbjct: 270 PYTGGKGYLPN-------------------RTPHAAYAAMITRMDRDLGRMLALIKELNL 310 Query: 275 -ENTWVIYTSDHG------------EMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--R 319 ++T ++TSD+G + S ++Y+ RIPLI+R Sbjct: 311 DDDTIFVFTSDNGPAPQDMGGTDTKFFNSSGPFRSGKTSIYEGGMRIPLIVRWHGKIQPN 370 Query: 320 RQVDTPVSHIDLLPTMMALAD--IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGG 377 D D LPT++ L+ P + G + + + + F Sbjct: 371 STSDRVTGFEDWLPTLLELSGNKKSVPTGIDGLSFASTLLGEKLPER----PFLYREFPA 426 Query: 378 FIPVRCWVTDDFKLVLNLFTSD---------ELYDRRNDPNEMHNLIDDIRFADVRSKMH 428 + + ++K V ELYD + D E H++ D D+ +K+ Sbjct: 427 YGGQQAIRVGNWKAVRQHLKPKGNAKPNLHIELYDLQTDIAESHDVSD--EHPDIVTKLD 484 Query: 429 DAL-LDYMDKIRDPFRSYQW 447 + + ++ PF + Sbjct: 485 NLMREQHIPSKAFPFPALDK 504 >UniRef50_A6DFZ4 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFZ4_9BACT Length = 519 Score = 409 bits (1051), Expect = e-112, Method: Composition-based stats. Identities = 117/488 (23%), Positives = 188/488 (38%), Gaps = 55/488 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++ N L + D + CY K + NIDSLA G F S Y VC P+R +FT Sbjct: 19 EKANVLIITIDDLKP-TLACYGDKYAVSPNIDSLADNGTLFRSNYCQQAVCAPSRISMFT 77 Query: 62 GIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGK-WH-------------LD 104 G+ + +G + NI TM +YFK+ GY + GK H L Sbjct: 78 GLRPDTTGILDLHTHMRDINPNILTMPQYFKENGYLSIGYGKLMHGAKNDDKELSWSELG 137 Query: 105 GHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLW-------RNGLNSVEDLQANHID 157 + P D +L + + L + +A + Sbjct: 138 DDLPYNKNHPKPVLDKFQNPKAHQVFKKLNKTQKRLKTSLLQKEMKNKGAYLVSEAYDLP 197 Query: 158 ETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK 217 + ++ + L + A E F MV+ +++PH PF P +Y + Y L E Sbjct: 198 DDAYRDGAVAKAGIQRLNELAETKEKFFMVLGFNKPHLPFNAPKKYWDMYDPNKLPLAEH 257 Query: 218 AQDDLANKPEHHRLWAQAMPSP--------VGDDGLYHHPLYFACNDFVDDQIGRVINAL 269 + D + + + + Y+AC +VD Q+GRV++ L Sbjct: 258 QKQDQQRPKYAYHSFGELAAYKDYQIGKAVDEKRQRHLIHAYYACVSYVDAQVGRVMDEL 317 Query: 270 TPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDT-PVS 327 + NT V+ DHG +G H L K + ++ TR PLII +P ++ QV P Sbjct: 318 KRLNLDKNTIVVLWGDHGWHLGDHGLWCKHSN-FEQATRAPLIISAPNQKKGQVSQSPTE 376 Query: 328 HIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIE-HDSFGGFIPVRCWVT 386 ID+ P++ L +E PE L GE++ + E V+ + G+ + Sbjct: 377 FIDIFPSLCKLTGLEIPEQLEGEDLSPILEDPKAKVKDYSISQYLRWANHGYT----MRS 432 Query: 387 DDFKLVLNLFT--------------SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALL 432 ++L L + ELYD + DPNE N ++ +A+V K+ Sbjct: 433 GKYRLTLWMPKNYYGFMKFDENDIVEVELYDYQKDPNETTNFANNPEYAEVLRKLKKQFA 492 Query: 433 DYMDKIRD 440 Y D Sbjct: 493 SYFASQYD 500 >UniRef50_B5CYA4 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CYA4_9BACE Length = 536 Score = 409 bits (1051), Expect = e-112, Method: Composition-based stats. Identities = 112/519 (21%), Positives = 199/519 (38%), Gaps = 75/519 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPL---NTQNIDSLAAEGIRFNSAYTCSPVCTPARAG 58 K+ N +++M+D + +G Y + T ID LA +G+ F + + + + TP+RA Sbjct: 30 KQMNVIYIMSDDHTSQAIGAYGSRLAVLNPTPTIDELARDGMLFENCFCTNSISTPSRAC 89 Query: 59 LFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 + TG Y++++ T + + + F + GY T IGKWHL Sbjct: 90 IMTGQYSHRNKVLTLDEVLQPDQEYLVDEFHNMGYQTAMIGKWHLGCEPSH--------- 140 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 DY+ + + + + + + + N I + + ++N A+D+L+ Sbjct: 141 -FDYYSVFNGHGGQGEYFDPTFLTSDVTD-KKWPNNQIKKMGYSSDIVTNLAIDWLKNRR 198 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE----------- 227 +PF M+ Y PH F Y D + D E Sbjct: 199 DKSKPFFMMHHYKAPHDMFEYAPRYEYYLDDVEVPVPLSLFDTDKWGSEGTRGKNDSLRH 258 Query: 228 ----------HHRLWAQAMPSPVGDD-------GLYHHPLYFACNDFVDDQIGRVINALT 270 R + GD+ ++ Y C VDD + R+ + L Sbjct: 259 FIGTSVSSRHEIRNYVMEYKCNTGDEMENTYLAYQHYLKSYLRCVKGVDDNLKRLFDYLK 318 Query: 271 PEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVS 327 E ENT ++YT D G M+G H L K MY++ R+P I+R P+ + D ++ Sbjct: 319 KEGLWENTIIVYTGDQGMMLGEHDLQDK-RWMYEESQRMPFIVRDPRCPYKGAKSDLMIN 377 Query: 328 HIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDS---FGGFIPVRCW 384 +ID PT++ + ++P + G++ +V E + + + Sbjct: 378 NIDFAPTLIEMVGGKEPSYMDGKSFASVFEGKKPENWKDAVYYRYWMHMIHHDVPAHIGI 437 Query: 385 VTDDFKLV----LNLFTSD----------------------ELYDRRNDPNEMHNLIDDI 418 T+++KL+ + ELYD +NDP EM NL D+ Sbjct: 438 RTENYKLILFYGRHYDDKRYGQKSMSWLKNSHKIVPTLVSFELYDVKNDPYEMVNLADNP 497 Query: 419 RFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDAR 457 ++A V M L + ++ D +Y + K R Sbjct: 498 KYAKVLKDMKKKLRELRKQVGDTDEAYPELKKVIDKALR 536 >UniRef50_A4A047 Iduronate-2-sulfatase n=2 Tax=Bacteria RepID=A4A047_9PLAN Length = 481 Score = 409 bits (1051), Expect = e-112, Method: Composition-based stats. Identities = 106/469 (22%), Positives = 183/469 (39%), Gaps = 43/469 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN LF+ D T +GCY +++ NID LAA G F AY VC+P+R L T Sbjct: 18 RQPNVLFIAVDDLRTE-LGCYGASQIHSPNIDRLAAAGTVFTRAYCQQAVCSPSRTSLMT 76 Query: 62 GIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+ + + + ++ T+G++FK GY++ +GK + G+D T P Sbjct: 77 GLRPDSTKVYDLVTHFRKNVPDVVTLGQHFKQNGYYSVSMGKIYHGGYDDPPTWSEPARK 136 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLN-------------SVEDLQANHIDETFTWAHR 165 GA Y+ + I+ RN + + + Sbjct: 137 PQ----GGAGYVLAENLQTITDKRNAARAKGLRGVQLSRAARGPATEMADVADNAYADGA 192 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 +++ AV L++ ++ DEPF + V + +PH PF P +Y + Y EL Sbjct: 193 VADLAVKSLRELSQRDEPFFLAVGFVKPHLPFNAPKKYWDMYDPAKIELAANPYPPKNVT 252 Query: 226 PEHHRLWAQAM--------PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-EN 276 P W + + Y+AC + D +G++++ L + + Sbjct: 253 PYSLTSWGEMRVYDGIPKQGDLSPEKARELKHGYYACISYTDANVGKLLDELDKLKLTDE 312 Query: 277 TWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPT 334 T V+ DHG +G H K ++D PLIIR+P + V +D+ PT Sbjct: 313 TIVVLWGDHGWKLGEHNSWCKHTN-FEDDANAPLIIRAPGQKSPGAKSTALVEFVDIYPT 371 Query: 335 MMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 + LA + P+ L G + + + + + I TD ++ Sbjct: 372 LCELAALPLPQHLEGTSAAPLLDQPDAAWKTAAFSQYPRRQ---IMGYTMKTDRYRFTAW 428 Query: 395 LFTS------DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD-YMD 436 ELYD + DP E N+ A++ ++ L + Sbjct: 429 KNKKSGKVVATELYDHQVDPAENVNVAGLTENAELIVQLQKQLDAGWQA 477 >UniRef50_C6VRQ8 Sulfatase n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VRQ8_DYAFD Length = 553 Score = 408 bits (1050), Expect = e-112, Method: Composition-based stats. Identities = 104/507 (20%), Positives = 177/507 (34%), Gaps = 81/507 (15%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + +M D + +G Y + + T +ID LA EGIRF Y + +C P RA L T Sbjct: 32 QRPNIILIMVDDLGYSDIGAYGSE-IKTPHIDQLAGEGIRFREFY-NNSICAPTRASLIT 89 Query: 62 GIYANQSGP---------WTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 G Y +++G K T G ++AGY T GKWH+ + Sbjct: 90 GQYPHKAGVGYFNVNLGLPAYQGYLNKESLTFGEVLRNAGYSTLLSGKWHVG----NDST 145 Query: 113 ECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVD 172 P + D ++ N S + + VE+ + ++ I++ A+ Sbjct: 146 AWPNQRGFDRFYGFINGASNYFDIGKYGKGPAVELVENNKRINLPPDKYLTDEITDHALA 205 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 FL + ++ +PF + ++Y+ PH P P + KY Y + + + + + + Sbjct: 206 FLDEQSKTAKPFFLYLAYNAPHWPLQAPEADIAKYKGRYSIGWDSLRAERLQRQKALGIT 265 Query: 233 AQAMPSPVGDD---------------GLYHHPLYFACNDFVDDQIGRVINALTPEQR-EN 276 D +Y A D VD IG++ L + +N Sbjct: 266 DPKQSVAARDKDVTPWENVPYDEKLLWERKMEIYAAMVDRVDQNIGKLREKLKALNKDDN 325 Query: 277 TWVIYTSDHGEMMG---------------------------------AHKLISKGAAMYD 303 T +++ SD+G G + M++ Sbjct: 326 TLIVFISDNGAQGGYAGASPRRPQRNTGPAGTAGSYVYQDQPWAYVSNAPHAAYKNNMHE 385 Query: 304 DITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKPE--------ILPGENIL 353 P I P+ + V IDL PT LA P LPG++++ Sbjct: 386 GGISAPFIAWFPRQIKGGQIVKGTGHLIDLAPTFYDLAKAAYPATANGVATNTLPGKSLV 445 Query: 354 AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-NLFTSDELYDRRNDPNEMH 412 V + V+ I F R +K+V ELYD D E Sbjct: 446 PVLTGKSGEVDRGGEPI----FWERAGNRAVRKGKWKIVSTYPAYKWELYDLETDRGETS 501 Query: 413 NLIDDIRFADVRSKMHDALLDYMDKIR 439 ++ +V ++ + +K Sbjct: 502 DVAS--ANPNVVDQLAADYFRWAEKTG 526 >UniRef50_A6KWS8 Arylsulfatase n=6 Tax=Bacteroides RepID=A6KWS8_BACV8 Length = 464 Score = 408 bits (1049), Expect = e-112, Method: Composition-based stats. Identities = 100/460 (21%), Positives = 171/460 (37%), Gaps = 65/460 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K PN +++M D +GCY + + T NID +A G++F Y+ S V P+R L T Sbjct: 28 KSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQNGMKFMQHYSGSTVSAPSRCALIT 87 Query: 62 GIYANQSGPWTNN-----------VAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG 110 G + + N T+ FK Y T +GKW + G G Sbjct: 88 GKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVADIFKTKNYVTGCVGKWGMGGP---G 144 Query: 111 TGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRA 170 T P + DY++ + L N + +D + + +A Sbjct: 145 TEGMPGKHGFDYFYGYLGQRFAHSYYPEFLHENEQKIM-------LDGKYYSHDLMLEKA 197 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 ++F+ + A +PF + S PH E + +Y + E D Sbjct: 198 LNFIDE--NAQKPFFLYFSPTIPHADLDIMGEAMTEYEGEFCETPFGGSRDGY------- 248 Query: 231 LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG--- 286 Y A ++D +G +I L + ++T +++TSD+G Sbjct: 249 -----------KSQQNPRAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHS 297 Query: 287 -------EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERR--QVDTPVSHIDLLPTMMA 337 + +Y+ R P +I+ P + + + D LPT+ Sbjct: 298 EGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTIGE 357 Query: 338 LADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDD-FKLVL--- 393 L + P+ + G + L +G E + E FGG + +T D +KLV Sbjct: 358 LVQADIPQNIDGISYLPTLTGKGTQKEHDCIYYEFFEFGGK---QSIMTPDGWKLVRLEV 414 Query: 394 --NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 T +ELY+ DP E N+I ++ DV K+ + + Sbjct: 415 SDPSKTYEELYNIYTDPAETSNVIK--QYPDVAKKLKNMI 452 >UniRef50_D0Z4S7 Iduronate sulfatase n=1 Tax=Photobacterium damselae subsp. damselae CIP 102761 RepID=D0Z4S7_LISDA Length = 539 Score = 407 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 103/479 (21%), Positives = 179/479 (37%), Gaps = 59/479 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 + PN LF+ D + +G + T N+D L F +A+ PVC P+R L Sbjct: 33 QHPNVLFLAVDDL-NDWIGALGAHPQVKTPNLDRLYKRSTAFRNAHCQVPVCGPSRTALL 91 Query: 61 TGIYANQSGPWTNNV----APGK-------NISTMGRYFKDAGYHTCYIGKW-HLDGHDY 108 TG+ +G +TN + + ++FK+ GY+T GK H DY Sbjct: 92 TGMAPTTTGLYTNKELGIKPFDPVAEQVLGSTPVLPQHFKNNGYYTMASGKISHHGTADY 151 Query: 109 FGTGECPPEWDADY-----------WFDGANYL-SELTEKEISLWRNGLNSVEDLQAN-- 154 + E + +Y + + + ++ Sbjct: 152 RHKEQWDEEIPLYVIGPRDEHLKANGYGYGSYGVDDHKYYPFPVGGGQIIQSQEYGPGTR 211 Query: 155 ------------HIDETF-TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPV 201 I ++ V+ LQ+ ++PF + + PH P+T P Sbjct: 212 GFSLCSGALDRHDIPNGGVMPDEYFADWTVERLQR--HYEKPFFLACGFIRPHVPYTAPR 269 Query: 202 EYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDG-------LYHHPLYFAC 254 EY + + + E + ++ + P + A + Y AC Sbjct: 270 EYFDMFPLESIIVPETIEKEMTDIPLMGKALALGIIPGGDAAAVNKLGIRKELVQAYLAC 329 Query: 255 NDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIR 313 F+D Q+G+V++AL NT V++ DHG+ G H + + ++ + TR+PL+IR Sbjct: 330 IAFMDAQVGKVLDALEKSPYANNTIVMFWGDHGQNFGEH-MNYRKQTLWQESTRVPLMIR 388 Query: 314 SPQG-ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEH 372 PQ + + D VS +DL PT++ L + K G ++ + + Sbjct: 389 LPQQEKGQVCDEAVSLLDLYPTLIELCHLPKVATNEGISLKPLLNNPRFDRKIPAVTTYG 448 Query: 373 DSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + + + S+ELYDR DPNE HNL D + ++ M L Sbjct: 449 Y------QCHAIRDEQYTYIRYRDGSEELYDRNLDPNEHHNLASDPNYQVIKQAMKQWL 501 >UniRef50_B0TKJ5 Sulfatase n=2 Tax=Gammaproteobacteria RepID=B0TKJ5_SHEHH Length = 492 Score = 407 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 110/469 (23%), Positives = 193/469 (41%), Gaps = 47/469 (10%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + D + Y + T NID LAAEGI+F Y+ +P+C+P+RAG+ TG Sbjct: 29 KPNVVIFYVDDLGYGDLATYGHNIVKTPNIDKLAAEGIKFTQYYSPAPLCSPSRAGMLTG 88 Query: 63 IYANQSGPW-----TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 ++G NV GK T+ KD GY T GK HL+G + + Sbjct: 89 RTPYRTGIRSWIPDGQNVHIGKEEITLAHMLKDEGYDTAITGKLHLNGGAHMKDHPQASD 148 Query: 118 WDADYWFDGANYLSELTEKEIS----LWRNGLNSVEDLQANHIDETF---TWAHRISNRA 170 ++ F ++ + E R+G V++ N + A ++N A Sbjct: 149 LGFEHSFIIPGGWAKNAKTEAKNADGSLRHGKIHVDNFWRNGVPVGETDQFSADLVANEA 208 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 + +L D+PF + V + E H P P +YL+ Y D+ + + N H Sbjct: 209 IGWLDDQG-GDKPFFLYVPFSEVHTPIASPQKYLDMYGDYLTDFAK------ENPDLFHW 261 Query: 231 LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGE-- 287 W G+ YFA ++D Q+GRVI+ L +NT ++++SD+G Sbjct: 262 DWVNQPYRGQGE--------YFANITYMDAQLGRVIDKLKAMGEYDNTIILFSSDNGPVT 313 Query: 288 ----------MMGA-HKLISKGAAMYDDITRIPLIIRSPQGERRQ--VDTPVSHIDLLPT 334 M G L + +++ R+P+I++ + + D P+ +D++PT Sbjct: 314 REARKPYELNMAGETGGLRGRKDNLFEGGIRVPMIMKYHGHVKAETDSDEPIYGLDIVPT 373 Query: 335 MMALADIEKPEI--LPGENILAVKEPRGVMVEFNRYE-IEHDSFGGFIPVRCWVTDDFKL 391 + L + P + G + ++ V I+ I DFKL Sbjct: 374 LSELIGFDTPSDRTIDGVSFVSTFNGLSVERTKPMIWTIDMPYQDDAINEYAVRIGDFKL 433 Query: 392 VLNLFT-SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 +++ + L++ D E++NL++ + ++ A Y I Sbjct: 434 IIDRQGNNKYLFNIGQDKYEVYNLLNKPEYKAKVEELTTAYQAYRKDIE 482 >UniRef50_UPI00016C500A sulfatase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C500A Length = 472 Score = 407 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 107/463 (23%), Positives = 182/463 (39%), Gaps = 43/463 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + +MTD Q + + C L T N+D +A EG R+ +A+ + +C P+RA L TG Sbjct: 22 RPNIVVMMTDDQRHDYMSCAGHPFLKTPNMDRIAKEGFRYTNAFVTNALCAPSRATLMTG 81 Query: 63 IYANQSGPWTN-NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 Y++ +G N N + + GY + GK H+ GH T D Sbjct: 82 QYSHLNGVRDNMGTTLNPNAPWLPDELRKLGYEVAFCGKSHVPGHFRDKTW--------D 133 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 Y+F + L +G ++++A+ ++++P Sbjct: 134 YYFGFQGQGNYLKPLIAESGPDGKI------GPDKPYDGWIDDVVTDKALAWVKKPRA-- 185 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS-PV 240 +PF + + + PH + + + YA + D KP A + P Sbjct: 186 KPFALFLFFKSPHRAWQPAARHKDLYAGAAVKKPALWDDPGQGKPRAFLQAANMIGQYPD 245 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHKLISKGA 299 D Y C VDD +G+V+N L ++ + T V+YTSD+G +G + K Sbjct: 246 TKDYDGMIRDYARCITGVDDNVGKVLNTLDEQKIADTTAVMYTSDNGFFLGEWQRFDK-R 304 Query: 300 AMYDDITRIPLIIRSPQG-------ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENI 352 M++ R+PL+++ P+ Q V + D+ PT++ LA P+ + G ++ Sbjct: 305 FMHEPSVRVPLLLKVPKALAKDCVPPGSQPGAMVINPDIAPTVLELAGGAPPKAMQGRSV 364 Query: 353 LAV--------KEPRGVMVEFNRYEI--EHDSFGGFIPVRCWVTDDFKLVLNLF------ 396 L P E YE D R T +KL+ Sbjct: 365 LPFARLPPAGPLPPEMAPREAWYYEYFEFPDPSHNVEKQRGVRTTKWKLIHYYDPPFKFK 424 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 + ELYD DP E NL + F ++ + + ++ Sbjct: 425 DAYELYDLEKDPEERVNLANRPAFQGTVKELQEKMAALRKELG 467 >UniRef50_A6C4W8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4W8_9PLAN Length = 459 Score = 407 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 95/445 (21%), Positives = 166/445 (37%), Gaps = 51/445 (11%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN +F+M D +GCY K + T +ID AA+G RF AY VCT +RA L T Sbjct: 27 ERPNIIFIMADDLGYGDLGCYGQKLMKTPHIDQFAAQGTRFTQAYAGGSVCTASRAVLLT 86 Query: 62 GIYANQSGPWTN----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 G++ + N ++ T+ + +GY +GKW L GT Sbjct: 87 GLHNGHTPARDNIPHYATYLQESDVTIAEVLQKSGYRCGGVGKWSLGDA---GTVGRATN 143 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 D WF N + + + + +L+ N + ++ RA+ F++ Sbjct: 144 QGFDMWFGYLNQ--DHAHYYFTEYLDDNEGRLELKGNTKNRQQYSHDLLTERALQFIRDS 201 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 A +PF + +Y PH F+ E + H L Sbjct: 202 AA--QPFFLYAAYTLPH--FSAKAE------------------------DPHGLAVPDTE 233 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMG------ 290 D Y A +D +GR+++ + Q E T +I+TSD+G G Sbjct: 234 PYSDRDWDIKSKKYAAMIHRLDRDVGRIMSLVNELQLRERTLIIFTSDNGGHRGVPAQLH 293 Query: 291 -AHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKPEIL 347 L + + R+P I P + D ++ D+LPT LA + L Sbjct: 294 TNGPLRGFKRDLTEGGIRVPFIANWPGTIPAGKVSDEVIAFQDMLPTFAELAGAQVSANL 353 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL-FTSDELYDRRN 406 G ++L V+ ++ + +++K + + LY+ Sbjct: 354 DGISVLPALRGEPRKVKHEYLYWDY-GHCRARYDQAVRWNNWKGIRHGQQGEIALYNLDQ 412 Query: 407 DPNEMHNLIDDIRFADVRSKMHDAL 431 D +E ++ D + V ++ + + Sbjct: 413 DLSESRDVAD--KHPQVVQRIAEIM 435 >UniRef50_A6DJ15 Putative arylsulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ15_9BACT Length = 469 Score = 407 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 85/462 (18%), Positives = 172/462 (37%), Gaps = 43/462 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN ++++ D + Y K +T NID + EG+ F Y+ S VC P+RA L T Sbjct: 19 EKPNIIYLLVDDLGYGDLSLYGQKKFSTPNIDRIGKEGMVFTDHYSGSTVCAPSRAALMT 78 Query: 62 GIYANQSGPWTNNV----------APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 G ++ N ++ K AGY T IGKW + GT Sbjct: 79 GKHSGHGLVRGNYEVGPHGFGGELPLRPEDVSLAEVMKSAGYATGLIGKWGMGMD---GT 135 Query: 112 GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 P + DY + N ++ NG + + + + + + Sbjct: 136 TGEPRKKGFDYSYGFLNQAHAHHYYPEYIYENGEKLMIPENKDDA-RGLYISDTFAEKGI 194 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRL 231 +F+++ D+PF + ++ PH P + L ++ + E + Sbjct: 195 EFVEE--NKDKPFFLFWAFVTPHAELLVPDDSLNEFKGKWPETPFVMGKQGGD------- 245 Query: 232 WAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEM-- 288 P V + + +D ++G + + L +NT ++++SD+G Sbjct: 246 -GTDNPFGVYASQDHPRAAFSGMITRLDKRVGDLFDKLEELGIDDNTIIMFSSDNGPHKE 304 Query: 289 MGAHK--------LISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMAL 338 GA L + + R+P ++R P R + + D++PT+ + Sbjct: 305 GGADPDFFDSNAELTGYKRDLTEGGIRVPFMVRWPNVVKARSKSSHASAFWDVMPTIAEI 364 Query: 339 ADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL-FT 397 A+ + PE + G + L + V + Y H+ + ++K + + + Sbjct: 365 ANTDSPEDIDGLSFLPALKGEKQQVHKHLYWEFHE---RGYTEQALRMGNWKAIRHGVNS 421 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 +LYD +D +E +++ ++ + + L Sbjct: 422 PIKLYDLISDESEQNDVSA--KYPATAKHITNILDTERTDSE 461 >UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZF72_PLALI Length = 470 Score = 407 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 93/456 (20%), Positives = 158/456 (34%), Gaps = 68/456 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN + D G + T +IDS+A G+R + + C+P+RAGL T Sbjct: 39 RKPNVIIFYADDLGWGETGIQGNPQIPTPHIDSIAKNGVRCTQGFVAATYCSPSRAGLLT 98 Query: 62 GIYANQSGPWTNN----VAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 G Y + G N +T+ GY T +GKWHL G P + Sbjct: 99 GRYPTRFGHEFNRIANVSGLDLQETTLADRLHGLGYKTACVGKWHLGD----GPEYRPTK 154 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 D +F + R V + A DE F + R+V+++ Q Sbjct: 155 RGFDEFFGTLANTPFFHPTKFVDSR-----VSNDVAEVSDENFYTTDEYAKRSVEWIGQQ 209 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 ++ P+ + + ++ H P P +YL+++ Sbjct: 210 QQS--PWFLYLPFNAQHAPLQAPQKYLDRFESIADP------------------------ 243 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGE-----MMGA 291 L+ A +DD IG+V+ + ENT V + SD+G Sbjct: 244 ---------KRKLFAAMMSAMDDAIGQVLGKVRELGQEENTLVFFISDNGGPTQGTTSQN 294 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALAD--IEKPEIL 347 L ++ TR+P +++ + D PV ++D+LPT++ A I+ L Sbjct: 295 GPLRGFKMTTFEGGTRVPFLVQWKGKLPAGKTYDNPVINLDVLPTVLTAAGSKIDPAWKL 354 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS--DELYDRR 405 G +++ D+KLV+ S ELYD Sbjct: 355 DGVDLVPYFTSSIANKPHETLYWRFGEQW------AVRQGDWKLVVARGGSGQPELYDLA 408 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 +D E NL ++ + + P Sbjct: 409 SDIAESKNLAS--ENPAKVKELQALWDQWSHEQAAP 442 >UniRef50_A6DFU7 Mucin-desulfating sulfatase (N-acetylglucosamine-6-sulfatase) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFU7_9BACT Length = 519 Score = 407 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 106/489 (21%), Positives = 183/489 (37%), Gaps = 87/489 (17%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN LF+ +D +TN +G Y K T NID +A EG F ++ + +C P+RA + + Sbjct: 21 ERPNILFIFSDDHSTNAIGAYGSKINTTPNIDRIADEGAVFEKSFCTNSICQPSRASILS 80 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G++++ +G N N + R K AGY T IGKWH+ + + D Sbjct: 81 GVHSHINGVTYNGAHWNGNQTVFPRELKKAGYQTALIGKWHMHPNPTN---------EFD 131 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 YW + + H +++ ++ +L Q + Sbjct: 132 YWKVLVGSGGQGDFYNPDFM--------SIDKGHEQIMGYSTDVVTDESIKWLDQR-DQN 182 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK---------------- 225 +PFLM+V + PH P Y+ Y D+ N+ Sbjct: 183 KPFLMMVQFKSPHVPRIPHPRYMNMY-TEDVAEPATLYDNYQNRLKGASTAWMEINGQNE 241 Query: 226 ---------------------------PEHHRLWAQAMPSPVGDDGLYHHPL-------- 250 PE + AM + + Sbjct: 242 EVLAYFPPKNATEPVNKKQKKHLDRMTPEQRKALLDAMDNQNSEYYELKKAGAFKDPVKA 301 Query: 251 -----------YFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHKLISKG 298 Y C +DD +GR++ L + + NT VIY+SD +G H ++ Sbjct: 302 RKLKYQFFIKNYLRCVQAIDDNVGRLLKWLEDNELDENTIVIYSSDQSYFIGEHG-WAEK 360 Query: 299 AAMYDDITRIPLIIRSPQGERRQVD--TPVSHIDLLPTMMALADIEKPEILPGENILAVK 356 MY++ ++P +IR P + + +ID PT + A + P G+++L + Sbjct: 361 RWMYEEALKMPFVIRWPGKIKPGSKPQAMIQNIDYAPTFLDAAGAKIPTRFQGKSLLPIF 420 Query: 357 EPRGVMVEFN-RYEIEHDSFGGFIPVRCWVTDDFKLV-LNLFTSDELYDRRNDPNEMHNL 414 V Y H T+ +KL+ ELYD +NDPNE++++ Sbjct: 421 SEEKSEVRDAIYYHYYHHGAHNVPRHEGIRTERYKLINFYTNNEFELYDLKNDPNEVNSV 480 Query: 415 IDDIRFADV 423 ++ +A++ Sbjct: 481 ANNPEYAEI 489 >UniRef50_A3I2R7 Arylsulfatase n=2 Tax=Bacteroidetes RepID=A3I2R7_9SPHI Length = 589 Score = 407 bits (1047), Expect = e-112, Method: Composition-based stats. Identities = 101/458 (22%), Positives = 185/458 (40%), Gaps = 44/458 (9%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K PN + ++TD Q G K ++T ID LA F + Y SPVC P RA L T Sbjct: 30 KPPNIILIITDDQGYGDFGFTGNKHVSTPTIDQLAENSFEFTNFYV-SPVCAPTRASLMT 88 Query: 62 GIYANQSGP---WTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G Y+ ++G + + T+ + + Y + GKWHL P + Sbjct: 89 GRYSLRTGIRDTYNGGAMMSPDEITIAELLQKSDYTSGIFGKWHLGD----NYPMRPSDQ 144 Query: 119 DAD----YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 D + G + + T R+ + V + ++ A++F+ Sbjct: 145 GFDESLIHLSGGMGQVGDFTTYFQKD-RSYFDPVLWHNNRQESYQGYCSDIFASAAIEFI 203 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 ++ D+PF +S++ PH P P EY +KY + + +P + Sbjct: 204 EK--NKDQPFFTYLSFNAPHTPLQVPEEYYQKYKNID----TSTGYESDERPFY------ 251 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGAH- 292 P+ D +A + +DD + + L + E+ T +I+ +D+G + Sbjct: 252 ----PMSDSQKEEARKVYAMVENIDDNLKNLFAKLKELEIEDETIIIFLTDNGPQQQRYL 307 Query: 293 -KLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKP--EIL 347 L +Y R PL+I P+ E R+++T +HID+LPT+ L I+ P + Sbjct: 308 AGLRGLKGNVYQGGIRTPLLIHIPEKLSENRKINTLSAHIDILPTIADLVGIQLPLDRKI 367 Query: 348 PGENILAVKEPR-GVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV----LNLFTSD-EL 401 G+++L + + + + F ++KLV + D +L Sbjct: 368 DGKSLLPLLIGEVDSFENRSLFSYWNRKFPEKYSNISIQNSEWKLVGKTDYDASIEDFQL 427 Query: 402 YDRRNDPNEMHNLIDDIRFA--DVRSKMHDALLDYMDK 437 Y+ + DP E NLI ++++++ L+ + + Sbjct: 428 YNLKEDPYEQSNLITSKISKGLELKNELDQLYLELISE 465 >UniRef50_C1ZKY2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZKY2_PLALI Length = 483 Score = 407 bits (1047), Expect = e-112, Method: Composition-based stats. Identities = 97/493 (19%), Positives = 162/493 (32%), Gaps = 86/493 (17%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L ++ D VG + K + T N+D+LA G++F S Y P C+P RAGL TG Sbjct: 32 RPNILLIVGDDMGYADVGFHGCKDIPTPNLDALAKSGVQFTSGYVTGPYCSPTRAGLLTG 91 Query: 63 IYANQSGPWTN----NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 Y + G N N T+ K GY T +GKWHL P E Sbjct: 92 RYQQRFGHEFNPSGANTGLPLTEVTIADRLKQVGYTTGLVGKWHLGSQPAMH----PQER 147 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 + + + + L + + AV F+++ Sbjct: 148 GFEEFIGFLGGAHSFFDAQGILRGH----------EPVKTIDYTTDLFGREAVSFIEKHR 197 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 D+P+ + +S++ H P + + Sbjct: 198 --DKPWFLYLSFNAVHTPMHATED---------------------------------RMA 222 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMG------- 290 + Y A +D+ IG+V+ L ++ T V++ SD+G Sbjct: 223 KLASISDQERRTYAAMMLAMDEAIGKVLTQLETTGQKQKTLVMFISDNGGPTMPGVTING 282 Query: 291 --AHKLISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTMMALADIEKPEIL 347 L + R+P ++ P D+PV +DL T +A+A +EK Sbjct: 283 SINTPLRGSKRTTLEGGIRVPFVVSWPGKIAPAVFDSPVIQLDLTATALAVAGVEKDVKS 342 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSD-------- 399 G N+L + + V F F D+KLV +D Sbjct: 343 DGVNLLPYLQGKQSEVPHAAL------FWRFGEQMAVRAGDYKLVRYDSNADTLTGKGKQ 396 Query: 400 -----ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRK 454 LYD + D E +L + + +++ + + P + Sbjct: 397 PVTAARLYDLKEDLGETRDLAASM--PEKVAELQAQWDRWNQQNMPPLWGG-GNKVASDG 453 Query: 455 DARPRWMGAFRPR 467 + R G + Sbjct: 454 EPRSGQSGKASQK 466 >UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZAC9_PLALI Length = 479 Score = 407 bits (1047), Expect = e-112, Method: Composition-based stats. Identities = 101/484 (20%), Positives = 164/484 (33%), Gaps = 81/484 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L +M D +G G + T ++D LAA GIR +AY +P C+P+RAG TG Sbjct: 37 RPNILVIMADDLGYADLGVQGGCEIPTPHLDQLAASGIRCTNAYVSAPYCSPSRAGFLTG 96 Query: 63 IYANQSGPWTNNV-------APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 Y + G N T+ + GY T IGKWH F P Sbjct: 97 KYQTRFGHEFNPHVGEEAKLGLPLEEVTIANLLQTEGYRTALIGKWHQG----FSKDHHP 152 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDL---QANHIDETFTWAHRISNRAVD 172 D +F + R G D+ + +N A+ Sbjct: 153 QSRGFDEFFGFLVGGHNYLLHKEVKARFGTAHSHDMIYRGREVEPQEGYATDLFTNEALR 212 Query: 173 FLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 ++ P +P+ + +SY+ H P +L+K +L Sbjct: 213 WMSGPPN--KPWFLYLSYNAVHTPLEIAP-HLQKRIPESVKLP----------------- 252 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG----- 286 Y + +DD IGR+ L+ E T +I+ SD+G Sbjct: 253 --------------ARRGYLSLLAGLDDSIGRITQHLSQHGLREKTLIIFLSDNGGSGRA 298 Query: 287 ----EMMG-AHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALA 339 G H L + R+P + P + + P+ +DLLPT+ LA Sbjct: 299 PILAYNSGLNHPLRGDKGQTLEGGIRVPFFVSWPGQLPARTIYEQPIISLDLLPTVCQLA 358 Query: 340 D------IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL 393 P+ + G N++ + F F P + ++KLV Sbjct: 359 ANNPAKPQPLPQGIDGVNLMPYWLGQRSGAPHE------SLFWRFGPQKAVRAGNWKLVD 412 Query: 394 NLF------TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQW 447 + ELYD D +E +NL + ++ +++ + + +P Sbjct: 413 WRDFPASKNSGWELYDLSTDISEKNNLAE--THPEIVARLKTSWEKWNQSNIEPLWRGSK 470 Query: 448 SLRP 451 Sbjct: 471 MEDA 474 >UniRef50_A6CAY0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAY0_9PLAN Length = 466 Score = 407 bits (1047), Expect = e-112, Method: Composition-based stats. Identities = 106/475 (22%), Positives = 172/475 (36%), Gaps = 69/475 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN L + D +GCY + T +D LA+EG+R YT SP CT +RA L T Sbjct: 33 KRPNILLITADNLGYGDLGCYGNPVMKTPMLDQLASEGVRLTDFYTASPTCTVSRATLLT 92 Query: 62 GIYANQSGP-------WTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 G Y + G K+ + Y K GY T GKW++ F G Sbjct: 93 GRYPQRIGLNHQLSADENYGDGLRKSEVLIPEYLKQQGYRTACFGKWNVG----FSPGSR 148 Query: 115 PPEWDADYWFDGA----NYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRA 170 P E D +F A +Y LWR + ++ A Sbjct: 149 PTERGFDEFFGFAAGNIDYYHHYYAGRHDLWR---------GLKEVFVEGYSTDLFADAA 199 Query: 171 VDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHR 230 ++ A +D+PF + + ++ PH P + ++ + A + P+ Sbjct: 200 CQYI--SAESDQPFFIYLPFNAPHFP---SQRNKQPGQGNEWQAPDLAFEKYGYDPQTKN 254 Query: 231 LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMM 289 + Y A +D IGRV+ L + T VI+ SD+G M Sbjct: 255 PQER----------------YRAVVTALDSAIGRVLKQLDTSGLRDQTIVIWYSDNGAFM 298 Query: 290 ---------GAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMAL 338 L G +++ R+P IIR P +P+ +D+LPT++ L Sbjct: 299 LKERGLEVASNKPLRDGGVTLWEGGIRVPAIIRYPGHLKAGTVNQSPLISLDILPTLITL 358 Query: 339 ADIEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-NL 395 A P L G+++L + + + +KLV Sbjct: 359 AGGPLPAERILDGQDMLPALAAQTAPEPRTFFFQYRN-------FSAVRRGKYKLVRIKP 411 Query: 396 FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLR 450 L+D D +E +L + R V +++ A D+ ++ + + S Sbjct: 412 NQPFMLFDLEQDLSETTDLAE--RNPKVLNQLQQAYADWEREVAENEERRRKSDN 464 >UniRef50_B1KD88 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD88_SHEWM Length = 500 Score = 405 bits (1043), Expect = e-111, Method: Composition-based stats. Identities = 103/498 (20%), Positives = 169/498 (33%), Gaps = 80/498 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN ++ + D +G Y + + T NID LAAEG+RF+ Y S VC P+RA L T Sbjct: 32 RQPNVIYFLADDLGVGDLGSYGQQHIRTPNIDKLAAEGMRFSRHYAGSSVCAPSRASLMT 91 Query: 62 GIYANQSGPWTN--------------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHD 107 G + N + T+ F+ AGY T GKW L Sbjct: 92 GRDMGHTDIRGNIQLMDQPDSPEYQGQYPLAQGTITLAHLFQLAGYQTGAFGKWGLGSLQ 151 Query: 108 YFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDE--------- 158 G P D ++ + LW + D A ++ Sbjct: 152 SSG---NPKAMGFDQFYGYLDQRHAHNYFPQYLWDGDEVARLDNPAINVHPKLDRDKSDH 208 Query: 159 -----TFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYE 213 ++I RA +F+ Q DE F + V + PH P + L+ Y Sbjct: 209 REYMGKDYAPYKILARAKEFISQ--NRDEAFFLYVPFVVPHAAIQIPDKELDGYQFDETA 266 Query: 214 LGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ 273 P+ A +D +G ++ L Sbjct: 267 HRLGEPRAYTPHPKP-------------------RAARAAMISRMDRDVGDIMAMLKELG 307 Query: 274 R-ENTWVIYTSDHG----------EMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RR 320 +NT V+++SD+G + A +Y+ R PLI R P Sbjct: 308 LDDNTLVLFSSDNGATAAGGSDINFFNSTAGARGEKATLYEGGIRAPLIARWPGNISAGS 367 Query: 321 QVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIP 380 + D + D+LPT L D+ PE + G ++L + + E F P Sbjct: 368 ESDHLSAFWDMLPTFAQLLDLSVPEGIQGISMLPTLLGKPQNQQHESLYWE---FFSRNP 424 Query: 381 VRCWVTDDFKLVLNLFTSD-----E-----LYDRRNDPNEMHNLIDDIRFADVRSKMHDA 430 + V ++K + + E LY+ + DP+E NL + ++ K Sbjct: 425 SQAVVMGNWKAIRHYSKERGKGALELGATALYNLQEDPSESQNLAA--KHPELVKKAEMI 482 Query: 431 LLDYMDKIRDPFRSYQWS 448 + P+ ++ Sbjct: 483 MAQRQRSPHLPWNFQSYN 500 >UniRef50_A0JVP0 Sulfatase n=1 Tax=Arthrobacter sp. FB24 RepID=A0JVP0_ARTS2 Length = 508 Score = 405 bits (1042), Expect = e-111, Method: Composition-based stats. Identities = 118/477 (24%), Positives = 196/477 (41%), Gaps = 35/477 (7%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 R N LF+MTD Q + +GCY + +T +D LAA G ++ AYT + +CTPARA L TG Sbjct: 8 RTNILFLMTDQQRIDTMGCYGNRSRHTPYLDGLAARGTVYDRAYTPTAICTPARASLLTG 67 Query: 63 IYANQSGPWTN-------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH--DYFGTGE 113 ++ + G +N T + GY ++GKWH+ F E Sbjct: 68 LHPFEHGLLSNFEWNSGHRDELPDGTPTFADELRKQGYRLGHVGKWHVGRERGPDFYGFE 127 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWR----------NGLNSVEDLQANHIDETFTWA 163 A FD Y S L EK +R +G T+ Sbjct: 128 GEHLPGALNTFDNPAYTSWLAEKGFPSFRIVDPVYTVQKDGSQGHLIAGITDQPTEATFE 187 Query: 164 HRISNRAVDFLQQPARAD------------EPFLMVVSYDEPHHPFTCPVEYLEKYADFY 211 ++++ + L++ A+ PF + PH P+ P ++ + Sbjct: 188 AWLADQTIAKLREFAQTHPAGGAPGTETAVAPFYLSCHIFGPHLPYLIPRQWYDLVDPAT 247 Query: 212 YELGEKAQDDLANKPEHHRLWAQAM--PSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL 269 +L + + KP + +A+ S ++ +Y+ +D +IGR++ + Sbjct: 248 VQLPKSFAETFNGKPLVQQTYAEYWSTDSFTVEEWKKLTAVYWGYVSMIDHEIGRILQTV 307 Query: 270 TPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSH 328 ++T +++T+DHGE GAH+L KG AMY+DI RIP I+ +P E R+ VS Sbjct: 308 EELGLNDSTVIMFTADHGEFTGAHRLNDKGPAMYEDIYRIPAIVAAPGQEPRRESKFVSL 367 Query: 329 IDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDD 388 D T + +AD + G +++ E R + Sbjct: 368 QDFTATFIDIADG-YAGNIRGSSLMPSTTAPLPADWRTEMVCEFHGHHFPYAQRMIRNER 426 Query: 389 FKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 +K + N DE YD +DP+E+HN++ +A M +L + D F + Sbjct: 427 YKYIANPEGIDEFYDLVSDPDELHNVVTVPAYATQLKTMRLSLYKELVSRGDKFYQW 483 >UniRef50_UPI0001C35789 arylsulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C35789 Length = 520 Score = 405 bits (1042), Expect = e-111, Method: Composition-based stats. Identities = 108/472 (22%), Positives = 189/472 (40%), Gaps = 34/472 (7%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + +MTD + +G + T +DS+AA+GI F+ AY+ P C PARA L TG+ Sbjct: 35 PNIVLIMTDQMRGDCLGIAGHPDVKTPYLDSIAAKGILFDHAYSACPSCVPARAALHTGM 94 Query: 64 YANQSGPWTNNVAPGKNIS-TMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD- 121 G N TM AGY+T +GK H+ D Sbjct: 95 RQEHHGRVGYQDMVNWNYPHTMAGELAAAGYYTQCVGKMHVHPLRNLMGFHNIELHDGYL 154 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHID----------------ETFTWAHR 165 + + E ++K+ + L A+ D E + + Sbjct: 155 HAYRDPAAAWEESQKQADDYFYWLKQELGADADVTDTGMECNSWVSRPWIYEEKYHPTNW 214 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 +S R++DFL++ +PF ++ SY PH PF P Y + Y D + + Sbjct: 215 VSTRSIDFLRRR-DTSKPFFLMASYLRPHPPFDAPQYYFDLYRDKQLTPPAVGDWEDEDF 273 Query: 226 PEHHRLWAQAMPSPV----GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVI 280 ++ + S + Y+AC +D QIGR+I AL + +NT ++ Sbjct: 274 TGDYQRLGRIYDSATGPVDPELIRQAQIGYYACITHLDHQIGRLIQALVEYKLMDNTIIL 333 Query: 281 YTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSP-----QGERRQVDTPVSHIDLLPTM 335 +TSDHGE + H L K Y+ RIP+++ P + D++PT+ Sbjct: 334 FTSDHGEELCDHHLFRKSR-PYEGSCRIPMLLSGPERLIHAAPGTVCHSVAELRDVMPTL 392 Query: 336 MALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL 395 + A PE + G++++ + ++ + G VT+ K V Sbjct: 393 LDAAGAPIPETVDGKSMIPDPDGTLPVIRQWL---HGEHEAGVNSNHFIVTEHDKYVWYS 449 Query: 396 F-TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQ 446 ++ ++ D E+HN I D ++ + + L++ + + + + Q Sbjct: 450 QTGREQYFNLDEDRRELHNGIADTQYQERIGLLRGLLIEELKEREEGYSDGQ 501 >UniRef50_D2R575 Sulfatase n=4 Tax=Bacteria RepID=D2R575_9PLAN Length = 511 Score = 405 bits (1041), Expect = e-111, Method: Composition-based stats. Identities = 107/492 (21%), Positives = 180/492 (36%), Gaps = 49/492 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L ++ D +GCY T ++D LA G+RF AY VC P+R L G Sbjct: 28 RPNVLMILVDDLKP-ALGCYGDPVAQTPSLDKLAERGMRFERAYCNQAVCAPSRFTLMLG 86 Query: 63 IYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGK-WHLDGHDYFGTGECPPEW 118 ++ +G + + T+ +YF GY T +GK +H+ + Sbjct: 87 SHSTSTGLYGLGSQLRQFIPDAVTLPQYFAKHGYRTESLGKVFHIGHGNQGDPDSFSVPH 146 Query: 119 DADYWFDGAN----YLSELTEKEISLWRNGLNSVEDLQAN------HIDETFTWAHRISN 168 D + + +LT +E L + L +D+ R+++ Sbjct: 147 FHDKVIEYLDPASTDGGKLTREEAFFTNQRLGEIGSLPRGAAFEAPDVDDLQYADGRVAS 206 Query: 169 RAVDFLQQPAR-ADE---PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 + L+ + D+ PF + + + PH PF P +Y + Y + E Q + Sbjct: 207 ETIKRLRAAKQLRDQEGTPFFIAIGFARPHLPFCAPKKYWDLYDRAKLPMPEFEQLPMNA 266 Query: 225 KPEHHRLWAQ----------AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR 274 P + + + Y+A FVD QIG+V+ L Sbjct: 267 PPVAGKRGGEISNYKPVPENGKAEFSDELKRNLIHGYYASMSFVDAQIGKVLEELNASGL 326 Query: 275 -ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDL 331 NT V+ DHG +G + +K Y+ RIP++I +P +D+ Sbjct: 327 AGNTIVVLWGDHGFHLGDLGIWTKHTN-YEQANRIPIVIVAPGVTQPGTATKQLAESVDI 385 Query: 332 LPTMMALADIEK---PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDD 388 PT+ LA + P+ + G +++ V + V + + ++ R TD Sbjct: 386 FPTLAELAGLPAPSGPQPIDGVSLVPVLKDSSARVRDHA----YHAYPKAKLGRAIRTDR 441 Query: 389 FKLVLNLFTS-------DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 ++LV ELYD DP E NL +V + L Y + Sbjct: 442 YRLVEWRAIGAAPESAVYELYDYDADPLERENLAAKQ--PEVVESLKMTLAKYPQPVLGT 499 Query: 442 FRSYQWSLRPWR 453 R P + Sbjct: 500 PRGAAKDRSPEK 511 >UniRef50_A6DNH1 Choline sulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DNH1_9BACT Length = 470 Score = 404 bits (1040), Expect = e-111, Method: Composition-based stats. Identities = 95/449 (21%), Positives = 177/449 (39%), Gaps = 35/449 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 ++PN L + D + +G T N+D LA G+ F + SPVC P+R + Sbjct: 23 EKPNVLLIAVDDL-NDWIGVLGGHPQAKTPNMDRLANRGVLFTNTQCQSPVCNPSRGSMM 81 Query: 61 TGIYANQSGPWTNNVAPG-----KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 T +Y + +G + N + G K M + F+ GYH GK + + E Sbjct: 82 TSLYPSTTGIYFLNPSVGTSPKAKGHLVMPKRFEAEGYHVSAAGKLFHNQENKKYFKEYG 141 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + F + LW D + +I+ + L Sbjct: 142 GSFG---GFGPIPKKKITSFPGHPLW--------DWGVYPERDEQMPDVKIAAWGKERL- 189 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 D+PF M + + PH P P ++ + Y ++ + ++D+ P++ + Sbjct: 190 -ARDYDQPFFMGIGFYRPHVPQFAPQKWFDMYPLESVQMPKMRKNDIEGIPQYGVDLTRE 248 Query: 236 MPSPVGDDG-------LYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHGE 287 + Y AC FVD Q+G++++AL ++NT+V+ SDHG Sbjct: 249 KHVAPTYEWVIENKEEKKLVQSYLACVSFVDAQVGKILDALDASPHKDNTYVVLYSDHGF 308 Query: 288 MMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEIL 347 +G + ++++D R+P++I P + P +D+ PT++ L ++ L Sbjct: 309 HLGEKE-RYAKRSLWEDGARVPMMISGPGIKPGVTHKPTQLLDIYPTLLELTGLKSDPKL 367 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRND 407 G +++ + SFG V++ ++ + S+E YDR D Sbjct: 368 EGNSLVPLLRNPQSDWPH----YARTSFG--PGNYAIVSERYRYIHYNDGSEEFYDRSKD 421 Query: 408 PNEMHNLIDDIRFADVRSKMHDALLDYMD 436 +E HN I + +A + +K + Sbjct: 422 THEWHNQIKNPEYASIIAKHRKQVPQERA 450 >UniRef50_C1ZIS7 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZIS7_PLALI Length = 631 Score = 404 bits (1040), Expect = e-111, Method: Composition-based stats. Identities = 110/546 (20%), Positives = 184/546 (33%), Gaps = 112/546 (20%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++ D +GCY T ++D LA +GIR AY PVC+P RA L TG Sbjct: 36 RPNIVVILADDLGWADLGCYGNPFHKTPHLDQLARDGIRCTQAYAACPVCSPTRAALLTG 95 Query: 63 IYANQSGPWT--------NNVAP---------GKNISTMGRYFKDAGYHTCYIGKWHLDG 105 + N+ A + I T+ K GY TC IGKWHL G Sbjct: 96 QNPARLHLTDWLPGRGNRNDQALRVPEIRNSLPQGIMTLPGVLKSNGYQTCSIGKWHLGG 155 Query: 106 HDYFGTGECPPEW-DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 E D A + + + + I + Sbjct: 156 GASGPLQHGFHEQIAGDERGSPARWFAPFGPQAATNGEKDRQGKPIPGLEDIPDGKYLTD 215 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 ++++AV F+++ A++PF + + + H P P E ++K+ D Sbjct: 216 ALADKAVAFIEKQT-AEKPFFLYLPHFAVHTPMNAPEETIQKFRDNKPP----------- 263 Query: 225 KPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTS 283 G+ + +Y A +D +G+V+N+LT + +NT V++TS Sbjct: 264 -------------------GVVRNEIYAAMLYHLDAAVGKVMNSLTEKGFAKNTIVVFTS 304 Query: 284 DHGEMMG----------AHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLP 333 D+G + L +Y+ R+PLI+ P+ S D+ Sbjct: 305 DNGGLATIEGKNTPATINAPLREGKGWLYEGGIRVPLIVSFPKHIPDG-----STTDVPM 359 Query: 334 T-------MMALADI----EKPEILPGENILAVKEP-------RGVMVEFNRYEIEHDSF 375 T +++LA I + L G NI + + + H + Sbjct: 360 TTLDLLPSLLSLAGIQYQVDANSPLDGMNISDIWTGNATPELKKAAFERPLYWHYPHYAN 419 Query: 376 GGFIPVRCWVTDDFKLVLNLF-TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 G P +K + N EL+ DP E N + ++ L + Sbjct: 420 QGGFPGGVIRQGPWKYIENYQTGRKELFLVDKDPGEGRNRA--PDEPEKITQFAAQLAAW 477 Query: 435 MDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQDGYSPVVRDYDTG---LPTQGVKVE 491 I P Y P TG +P + +V Sbjct: 478 KQSISA-----------------------QETVPNPDYIPNPPHAKTGVISIPAKSAQVY 514 Query: 492 EKKQKF 497 ++ ++ Sbjct: 515 GRQLRY 520 >UniRef50_A6DIH4 Iduronate-2-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DIH4_9BACT Length = 621 Score = 404 bits (1040), Expect = e-111, Method: Composition-based stats. Identities = 105/464 (22%), Positives = 188/464 (40%), Gaps = 36/464 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+ N L +++D + + Y T N+D A +FN AY PVC P+RA + Sbjct: 154 KKLNVLMIVSDDL-NHYIKSYGDPQAITPNLDKFMAMSTQFNKAYCQYPVCGPSRASFLS 212 Query: 62 GIYANQSGPWTNNVAPG---KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G+Y S TN + M +F++ GY T GK +G E Sbjct: 213 GLYPESSLVITNTQYLRDVNPSADNMLEHFRNNGYWTGAAGKIFH---STYGMMEKGTSL 269 Query: 119 DADYWFDGAN----------YLSELTEKEISLWRNGLNSVEDLQ-----ANHIDETFTWA 163 D F A ++ E + + N + + + Sbjct: 270 DEYEKFSNAENPQLLLLKKRWIKEGKPGDFKAYFNKNKVKDQADLVLGYGTELRDNQHGD 329 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 R + R +++ + ++PF M +PH PF P +YL+ Y + ++D Sbjct: 330 GRNARRVAQWIKNNSAGEKPFFMACGIVKPHTPFYAPKKYLDLYPKDKLIFDDVPENDWD 389 Query: 224 NKP-----EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENT 277 NKP + ++ + + ++ Y+ Y C F+D Q+ +++AL +NT Sbjct: 390 NKPKVAGVKRYQAFRGELGVNDRENRKYYLQSYLGCISFMDAQVKVLMDALKESGQMDNT 449 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTM 335 +++ SDHG +G H + ++++ R+P I P G +Q D+ ID+ PT+ Sbjct: 450 VIVFMSDHGFQIGEH-FMYGKVTLFEECARVPFGIIYPGNPGAGKQSDSLAELIDVYPTL 508 Query: 336 MALADIEKPEI-LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN 394 + L + +P L G++++ V + + V Y + G + R + Sbjct: 509 LDLCKLPQPSHKLQGKSLVPVTKDTSLQVRNEAYTVVTR---GKLMGRAIRKGSWVYAHW 565 Query: 395 LFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 D ELY+ DP + +NL+ D +A V +M AL + Sbjct: 566 GSDRDVELYNMDKDPKQYNNLVKDPEYAKVLKQMDKALKQKASE 609 >UniRef50_B7ACM6 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7ACM6_9BACE Length = 534 Score = 404 bits (1039), Expect = e-111, Method: Composition-based stats. Identities = 110/448 (24%), Positives = 197/448 (43%), Gaps = 22/448 (4%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + +++D N++GC ++T N+D+LA GI F S Y SP+ P+R L TG Sbjct: 62 RPNIVLIISDEHNGNIMGCMGDPYIHTPNLDALAENGILFKSHYCASPISGPSRQSLTTG 121 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 Y + W N V +I+++ R + GY T G +G + +G Sbjct: 122 KYVSHHNVWGNTVGCPNDITSLPRIMQQQGYETVLTGGMKYNGLN-YGWNSYKANDGYKI 180 Query: 123 WFDGANYLSELTEKEISLWRNG--LNSVEDLQANHIDETFTWAHRISN-----RAVDFLQ 175 +D ++ + G +N+ ED+ T + ++ A+ +++ Sbjct: 181 AYDKKKKAGTDIAQKRERIKAGVFVNNKEDIGKEFTPMGATDMNLFTDIQRSQDAIAYIK 240 Query: 176 QPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA-- 233 A +PF ++V PH+P E ++KY D + + + + N P +++ Sbjct: 241 NRAHVKQPFFLLVGLMAPHYPLQATQELVDKYKDK-IPMPKIPKGYIENLPLNYKHLRNT 299 Query: 234 QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHGEMMGAH 292 + + + + Y+A ++ D QIG++I + +NT +IYTSDHGE +G H Sbjct: 300 RKLENVPKEIVKKARECYYARVEWADSQIGKIIKTINESPMADNTIIIYTSDHGENLGEH 359 Query: 293 KLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGE 350 L K +YD ++PLII +P+ ++ +DL+ T+ L + P GE Sbjct: 360 GLWWK-NCLYDCSAKVPLIISNPKRWKGKQTRSKNTESVDLVQTIADLGGTKVPNDWDGE 418 Query: 351 NILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL------NLFTSDELYDR 404 ++L + E + E+ + + + +K V N ELYD Sbjct: 419 SMLPLLEDSTYNWKDFAI-CEYYAGYIASGITMYRQGKWKYVYHARMDENHGPEIELYDM 477 Query: 405 RNDPNEMHNLIDDIRFADVRSKMHDALL 432 NDP E+ NL D ++ + +H L+ Sbjct: 478 DNDPEELTNLARDNQYKMLIQDLHQELI 505 >UniRef50_C7MHR6 Arylsulfatase A family protein n=3 Tax=Bacteria RepID=C7MHR6_BRAFD Length = 480 Score = 404 bits (1039), Expect = e-111, Method: Composition-based stats. Identities = 115/483 (23%), Positives = 195/483 (40%), Gaps = 42/483 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + +MTD Q + + ++T N+D L EG F + Y SP C P+RA LFT Sbjct: 6 ERPNIVLIMTDQQRFDSIAALGHDHVDTPNLDRLVREGAAFTNTYVPSPSCAPSRASLFT 65 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP----- 116 G+Y + SG N+ + + AGY +GK H ++ Sbjct: 66 GLYPHSSGVLRNDDPWSH---SWVEHLSAAGYRCTSVGKMHTYPYEAPVGFHERHVIENK 122 Query: 117 ------------EWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAH 164 +WD W G S +T +E + L + E + E + Sbjct: 123 DRAHPDLPYFLDQWDKAIWIRGHQKPSRVTYRERDDYAERLGAFE----WELPEDLHADN 178 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 + N A +L+ D+PF + + + PH P+ P +LE Y D ++ Q DL + Sbjct: 179 FVGNLARHWLETYPEHDDPFFLQIGFPGPHPPYDPPARHLEPYRDRPMPEAKRTQADLDS 238 Query: 225 KPEHHRLWAQ-----------AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ 273 +P + + +P + YFA +D+Q+G +++AL Sbjct: 239 QPAPLKELRTHHQANDHDAIVQLENPTAEQLDRQRRHYFANVSLIDEQVGGILDALEERG 298 Query: 274 -RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHID 330 +NT V++TSDHG+ + H K MY+ +P II P ++ D VS +D Sbjct: 299 VLDNTVVVFTSDHGDALNDHGHSQKW-TMYEPSVHVPGIIWGPGRVEPDQRFDGLVSLMD 357 Query: 331 LLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSF--GGFIPVRCWVTDD 388 + PT++ LA + PE + ++L + + + G + Sbjct: 358 IAPTVLELAGLTPPEWMEARSLLPALQGQEWEGRQYVFSEHARDAILTGTALMTMARDAR 417 Query: 389 FKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQW 447 +KLV + D +L+D DP E NL A+ R ++ A+ + ++ Sbjct: 418 YKLVEFIDHEDGQLFDLAKDPYEETNLWFCEEHAETRRRLERAISTWRASSSMQTATWAK 477 Query: 448 SLR 450 R Sbjct: 478 DKR 480 >UniRef50_UPI00016C0B39 choline sulfatase n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0B39 Length = 459 Score = 404 bits (1039), Expect = e-111, Method: Composition-based stats. Identities = 109/459 (23%), Positives = 176/459 (38%), Gaps = 30/459 (6%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAY----TCSPVCTPAR 56 MK PN L + D Q N + + + T N+D L A G F A+ TC +C +R Sbjct: 1 MKAPNVLILFADDQRFNTINALNNDEIITPNLDRLVASGTAFTHAHIQGGTCGAICMASR 60 Query: 57 AGLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 A L TG + + + MG +FK GY T GKWH + + + Sbjct: 61 AMLNTGR--SLFKLDDLGQQIPDDHTLMGEFFKARGYDTFGTGKWHNGKKSFNRSXDQGD 118 Query: 117 EWDA----DYWFDGANYLSELTEKEISLWR----NGLNSVEDLQANHIDETFTWAHRISN 168 D+W + E + + N N V+ ++ I+ Sbjct: 119 SIFFGGMSDHWAVPFYHYDSSAEYNKVIRKCVDQNHSNEVKKTAGEYMRAGEHSTDVIAE 178 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA--QDDLANKP 226 + FL Q + D+PF S+ PH P T P E+L Y +L + Sbjct: 179 SVIKFLDQ--KHDKPFFAYTSFLAPHDPRTMPEEFLNMYNPEDIKLPPNFMSYHFIEYAN 236 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDH 285 R A + H Y+A +D QIGR+++ L +NT ++Y D+ Sbjct: 237 WECRDETLAPYPRTLANTQKHIAEYYAMITHLDYQIGRILDKLEEIGEKDNTIIVYAGDN 296 Query: 286 GEMMGAHKLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALADIEKP 344 G +G H L K ++YD R+PL+I + D V D+ PT+ L + E P Sbjct: 297 GLALGQHGLFGK-QSLYDHSMRVPLLISGAGIKAGMKTDALVYLFDIFPTLCDLLEQEIP 355 Query: 345 EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS---DEL 401 + G++ + Y D +R D FK + + + +L Sbjct: 356 ASVTGQSFAECIKGTKDAARDQIYLAYTDK------IRAITKDGFKYIEHRYNGIITKQL 409 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 +D +DP EM NL+ + + + + AL + + Sbjct: 410 FDLNSDPFEMSNLVLNPNYQEKLVALQKALQAESQQSNE 448 >UniRef50_D2QL61 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QL61_9SPHI Length = 489 Score = 403 bits (1037), Expect = e-111, Method: Composition-based stats. Identities = 113/473 (23%), Positives = 201/473 (42%), Gaps = 51/473 (10%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 +++PN + VM D +G + + T N+D LA E + PVC+P+RA L Sbjct: 35 VRKPNIVIVMADQWRAQDLGYAGNREVITPNLDKLALESVNAPLCVAEVPVCSPSRASLL 94 Query: 61 TGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH-------DYFGTGE 113 TG +A G + N+ T+ + GY T +IGKWH++G Sbjct: 95 TGQHATTHGVFYNDRPLRNEAVTLAEVCQQNGYKTGFIGKWHINGGLAKDFAAGRLAPIP 154 Query: 114 CPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF 173 +YW + S + N +N Q ++ A+ F Sbjct: 155 VDRRQGFEYWRGLEC----THDYNNSPYYNEVNKRFVWQQYDAISQ-------TDSAISF 203 Query: 174 LQQPARADEPFLMVVSYDEPHHPFT-CPVEYLEKYADFYYELGEKAQDDLANKPEHHRLW 232 + Q + EPFL+V+++ PH P+ P EY ++YAD L Sbjct: 204 MTQSRK--EPFLLVLAWGPPHDPYQTAPKEYRQRYADKTLSLRPN--------------- 246 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGA 291 +P+ + Y+A + +DD IGR+ AL + + NT ++TSDHG+M+ + Sbjct: 247 ---VPAKDTMEANRALKGYYAHINALDDCIGRLQAALKGAKLDENTIFVFTSDHGDMLYS 303 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQG---ERRQVDTPVSHIDLLPTMMALADIEKPEILP 348 H I+K +D+ RIP +++ P G + R +D P++ D++PT+++L+ P + Sbjct: 304 HDQINK-QKPWDESIRIPFLLKYPAGLSRKGRTLDVPITLTDVMPTVLSLSGQTIPASVQ 362 Query: 349 GENILAVKEPRGVMVEFNR------YEIEHDSFG-GFIPVRCWVTDDFKLVLNLFTSDEL 401 G+N+ ++ + ++G G R T + V +L L Sbjct: 363 GQNVASLIRQPRAPRPDDAALIACIVPFHQWNYGRGGREYRGIRTARYTYVRDLKGPWLL 422 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRK 454 YD + DP ++ NL ++ + A + ++ L + D F++ + W Sbjct: 423 YDNQQDPYQLTNLANEPKLAGTQKQLEGILAQKLRAANDNFQAGNVYMDKWNY 475 >UniRef50_D2R201 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R201_9PLAN Length = 511 Score = 403 bits (1037), Expect = e-111, Method: Composition-based stats. Identities = 101/448 (22%), Positives = 178/448 (39%), Gaps = 29/448 (6%) Query: 2 KRPNFLFVMTDTQATNMVGCYS-GKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 KRP+ LF+ D Q + +G T ++D+LAA G +A+ +P+C P+R L Sbjct: 39 KRPHILFIAIDDQ-NDWIGHLGGHPYAKTPHLDALAARGTTLANAHCQAPLCNPSRTSLM 97 Query: 61 TGIYANQSGPWTNNVAPG-----KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECP 115 G+ +G + + ++ +Y + GY T GK + G G Sbjct: 98 FGLRPTSTGIYGLAPWIRTLPEFEKRVSLPQYLQQHGYRTLTTGKIYHGGL-----GPKK 152 Query: 116 PEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQ 175 + D W ++ +K I G + + D + ++I++ A++ L Sbjct: 153 RLEEFDVWGPAGGIGAKPEKKLIPPTPMGNHPLMDWGKFDHRDEDKGDYQITSWAIEQLD 212 Query: 176 Q--PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWA 233 A+ P + V Y PH P ++ ++ L A DD ++ P Sbjct: 213 DQVQHHAETPMFLSVGYFLPHVPCFISPKWYDEVPQGDKLLPLVAADDRSDIPRFAWYLH 272 Query: 234 QAMPSPVGDDGLYHH------PLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 ++P P H Y A FVD QIGR++ AL + ++T V+ DHG Sbjct: 273 WSLPEPRLKWVEDHRQWENLVRSYLASTTFVDAQIGRLLTALEERKLADDTIVVVWGDHG 332 Query: 287 EMMGAHKLISKGAAMYDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEKPE 345 +G + K +++ TR+PLI P ++ P +D+ PT++ LA + Sbjct: 333 WHLGEKGITGK-NTLWERSTRVPLIFAGPGITPKQVCGEPAELLDIFPTLVELAGLPPRN 391 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRR 405 L G ++ E+ + + ++ + S+ELYD + Sbjct: 392 DLEGHSLAPQLRDASRQREWPAITSHNQGN------HAIRSARYRYITYADGSEELYDMQ 445 Query: 406 NDPNEMHNLIDDIRFADVRSKMHDALLD 433 +DP E+ NL D +A V + L Sbjct: 446 SDPRELTNLASDSTWASVIADHRRWLPK 473 >UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C3C8_9PLAN Length = 600 Score = 403 bits (1037), Expect = e-111, Method: Composition-based stats. Identities = 110/478 (23%), Positives = 183/478 (38%), Gaps = 62/478 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN + VMTD Q + T I LAAEG+ F Y VC P RAGL T Sbjct: 33 RQPNIILVMTDDQGYWDTEISGNPKIKTPTIKKLAAEGVTFTRFYANM-VCAPTRAGLMT 91 Query: 62 GIYANQSGPWT---NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G + ++G + G N +T+ + + AGY T GKWHL + + P Sbjct: 92 GRHYLRTGLYNTRFGGDTLGPNETTIAQVLQKAGYKTGLFGKWHLGRYAQY----QPQRR 147 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 D++F + E + NG ++ ++ A+DF+Q+ Sbjct: 148 GFDHFFGHYHGHIERYTNPDQVVVNG---------TPVETRGYVTDLFTDAAIDFIQR-- 196 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 +PF ++Y+ PH PF +PE +L + + Sbjct: 197 NQQQPFFCYLAYNAPHSPF-------------------LLDTSHFGQPEGDKLIEKYLAK 237 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGAH--KLI 295 + +A + +D + R++ + + + T VI+TSD+G + L Sbjct: 238 GLPLREARI----YAMIERIDQNLSRLLQTVHDLKLDQETVVIFTSDNGGVSRGFKAGLK 293 Query: 296 SKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPE--ILPGEN 351 A+ Y+ TR+P ++R + D V+ DL PT LA + P L GE+ Sbjct: 294 GSKASAYEGGTRVPFVVRWTDHFPAGKTTDAMVAQTDLFPTFCQLAGVPVPSNVKLDGES 353 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR-CWVTDDFKLV-LNLFTS--------DEL 401 IL++ E G D + R FKLV + +L Sbjct: 354 ILSLMEQGGGKSPHQYLYHTWDRYTPNPYHRWAIHGPRFKLVGHDPQGKKKKEGEPQGQL 413 Query: 402 YDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPR 459 YD + DP E N+ D ++ + S++ L + + Y+ + P + P Sbjct: 414 YDLQEDPGEKKNVAD--QYPEKVSELRGEFLRWFQDVTAGQV-YEPAAIPVGDEQEPE 468 >UniRef50_Q7UJQ7 Iduronate-2-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UJQ7_RHOBA Length = 571 Score = 403 bits (1037), Expect = e-111, Method: Composition-based stats. Identities = 110/474 (23%), Positives = 180/474 (37%), Gaps = 49/474 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN L ++ D +GCY T NIDSLA G+RF AY VC P+R L Sbjct: 98 ERPNVLLILVDDLKP-ALGCYGDSIAKTPNIDSLANRGMRFEMAYCNQAVCAPSRFTLML 156 Query: 62 GIYANQSGPWTNNVAPG---KNISTMGRYF-KDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 G ++ +G + + TM ++F K GY T +GK GH G E Sbjct: 157 GSHSTSTGLYGLGSQLRQIIPDAVTMPQHFAKQGGYRTESLGKTFHIGHGNHGDPESFSV 216 Query: 118 WDA-----DYWFDGANYLSELTEKEISLWRNGLNSVEDLQAN------HIDETFTWAHRI 166 +Y + +LT +E L ++ L + R+ Sbjct: 217 PHFKEKVIEYLEPASTDGGQLTREEAYFTNQMLGRIKTLPRGAAYESPDAKDEDYADGRV 276 Query: 167 SNRAVDFLQQPARADE----PFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 + + LQ + + PF + + PH PF+ P +Y + Y + + Sbjct: 277 AAETIQRLQAAKQRQKTEGTPFFIASGFARPHLPFSAPQKYWDLYDPASLPMPTHETLPV 336 Query: 223 ANKPEHHRLWA----------QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE 272 + + + Y+A +VD QIG+VI L Sbjct: 337 DAPKVAGKRGGEISNYKPVPTEPNADFDDELKRNLIHGYYASVSYVDAQIGKVIKELDRL 396 Query: 273 -QRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHI 329 +NT V+ DHG +G + +K Y+ RIP++I +P + Sbjct: 397 ELLDNTIVVLWGDHGFHLGDLGIWTKHTN-YEQANRIPILITAPGVTQPGSSTKQLAESV 455 Query: 330 DLLPTMMALADIEK---PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVT 386 D+ PT+ LA + P+ + G +++ V + V + + ++ R T Sbjct: 456 DIFPTLSELAGLPAPSGPQPIDGVSLVPVLKDSSARVRDHA----YHAYPKRQLGRSIRT 511 Query: 387 DDFKLVLNL------FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 + ++LV T+ ELYD + DPNE NL D +V ++ L Y Sbjct: 512 ERYRLVEWKAFDGKGDTAYELYDYQTDPNETKNLASDR--PEVVQRLTKILAKY 563 >UniRef50_B2UNV9 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UNV9_AKKM8 Length = 500 Score = 403 bits (1037), Expect = e-111, Method: Composition-based stats. Identities = 102/485 (21%), Positives = 180/485 (37%), Gaps = 78/485 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + + D VGCY K + T NID LA+EG R+ Y+ +PVC+P+R L TG Sbjct: 25 RPNVVLINADDLGWAEVGCYGQKKIKTPNIDKLASEGQRWVYFYSGAPVCSPSRNVLMTG 84 Query: 63 IYANQSGPWT--------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDY 108 + + T+ K AGY T GKW + Sbjct: 85 KHTGNCDVQDLKRVDAGENWRDLKGDWPIRTETYTLPEAMKKAGYATAVFGKWGIGD--- 141 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLW-----------------RNGLNSVEDL 151 FG+ P + D ++ + + T LW +G ++ Sbjct: 142 FGSTGAPDKHGVDRFYGYTDQKACHTYYPPYLWNDGKKEVLNTSLTAATIGHGSQPKGEV 201 Query: 152 QANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFY 211 A+ + I+++ ++F+++ A +PF + + EPH E++++Y + Sbjct: 202 LADTYRAEQHSSDLIADKMLEFVKEKAHGKQPFFLYYAPLEPHVAMQPLQEWIDRYPREW 261 Query: 212 YELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP 271 + + P Y +D +GR+++ L Sbjct: 262 DKSPYRGNRGYLPHP-------------------RPRAAYAGMISQMDHNVGRLLDTLKA 302 Query: 272 EQRE-NTWVIYTSDHG---EMMG-AHK-------LISKGAAMYDDITRIPLIIRSPQGE- 318 + NT VI+TSD+G + G H+ L +Y+ R+P IIR P Sbjct: 303 CGLDKNTIVIFTSDNGTTHDAGGVDHRFFNSVADLKGLKGQLYEGGIRVPGIIRWPGKIA 362 Query: 319 -RRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGG 377 + + P H D++PT+ AL + L G ++ V + + + + + GG Sbjct: 363 PGKTITQPAFHADVMPTLCALTGADAGSPL-GTDLSPVLLGKKSALHDRKPLV--WAGGG 419 Query: 378 FIPVRCWVTDDFKLVLN------LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDAL 431 + D K++ + E+YD DP E +N+ D+ ++ L Sbjct: 420 YGGQVAVRFDSKKVIRRNLFPGKKPDNWEVYDIVKDPAEKNNIAA--ENRDLINRAIAIL 477 Query: 432 LDYMD 436 Sbjct: 478 DREYQ 482 >UniRef50_C0BKJ9 Sulfatase n=2 Tax=Bacteroidetes RepID=C0BKJ9_9BACT Length = 493 Score = 403 bits (1036), Expect = e-110, Method: Composition-based stats. Identities = 97/472 (20%), Positives = 168/472 (35%), Gaps = 76/472 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN ++++ D +G Y K + T N+D LAA+G+RF YT +PVC P+R T Sbjct: 25 QPPNIIYILADDLGYGELGSYGQKKIKTPNLDRLAADGMRFTQHYTGAPVCAPSRYMFLT 84 Query: 62 GIYANQSGPWTN-------------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDY 108 G +A + N + + T+ + K AGY T IGKW L ++ Sbjct: 85 GNHAGHAYIRGNYELGQFSDEMEGGQMPIPETTPTLAKMLKKAGYQTAMIGKWGLGMNET 144 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDL----------------Q 152 G+ P DY++ + LW N + Sbjct: 145 TGS---PLLHGFDYYYGYLDQKQAHNYYPTHLWENDKKDPLNNDYFLVHSPISSKANQSD 201 Query: 153 ANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYY 212 + R+ +A+ FL A +D+P+ + PH P +++Y D + Sbjct: 202 FDQFKGQEYAPDRMLEKAIQFLDTTA-SDKPYFLYYPSPIPHVSLQVPDSLVDQYRDVFE 260 Query: 213 ELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE 272 E + + Y A +D ++G++ +++ + Sbjct: 261 EEPYLGNKGYTAH-------------------QFPNAAYAAMITHLDSEVGKIWDSVKEK 301 Query: 273 Q-RENTWVIYTSDHG----------EMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--R 319 ENT ++++SD+G A L +Y+ RIP I Sbjct: 302 GQEENTLILFSSDNGPTFAGGVDPDFFNSAAGLRGLKMDVYEGGIRIPFIAYWKGKIKAG 361 Query: 320 RQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFI 379 D H D+ T LA ++ G +IL + E+ Sbjct: 362 SISDLISGHWDMFNTFAELAGQDQSAP-DGISILPELLGESQNETHDYIYFEY---PEKR 417 Query: 380 PVRCWVTDDFKLVL-----NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSK 426 +D+K V NL + ELY+ + D NE+ N+ ++ +K Sbjct: 418 GQIALRIEDWKGVKVEMKTNLDSKWELYNLKTDRNEVFNVAA--EHPEIVNK 467 >UniRef50_Q7UER7 Sulfatase 1 n=8 Tax=Bacteria RepID=Q7UER7_RHOBA Length = 553 Score = 402 bits (1035), Expect = e-110, Method: Composition-based stats. Identities = 98/500 (19%), Positives = 172/500 (34%), Gaps = 68/500 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++ D + +G K T +ID+LA G+RF + Y VC+P+RA + G Sbjct: 57 RPNVVLILVDDLGLHDIGIEGSKFHQTPHIDALAKRGMRFTAGYANCRVCSPSRASIQLG 116 Query: 63 IYANQSGPWT--------------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWH 102 + + G A T+ +++GY T + GKWH Sbjct: 117 QFTARHGITDWIGAKTGMDFNRGDELLPAEYVHAMPAKDVTLPEALRESGYKTFFAGKWH 176 Query: 103 LDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTW 162 L G G P + D G + S ++ + + Sbjct: 177 LGGE-----GSMPTDHGFDINIGGHHRGSPPGGF--------FAPFKNPVMEDGPDGESL 223 Query: 163 AHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDL 222 R+ F++ + D+P+ ++S+ H P E +KY + Sbjct: 224 TRRLGKETASFIE--GQDDQPYFAMLSFYAVHGPIQTTQELWQKYRESAPAPPADGNRFK 281 Query: 223 ANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIY 281 ++ R + +P+Y + +D+ +G V+ A+ +NT VI+ Sbjct: 282 IDRTLPVR-------------QIQDNPVYAGMMETLDNAVGDVMAAIEASGKADNTLVIF 328 Query: 282 TSDHG-------EMMGAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLL 332 T D+G ++ R P + P D PV DL Sbjct: 329 TGDNGGVSSGDAYSTSNLPHRGGKGRQWEGGLREPYYVSMPAIVPENSTSDVPVIGSDLY 388 Query: 333 PTMMALADIE--KPEILPGENILAVKEPRGV---MVEFNRYEIEHDSFGGFIPVRCWVTD 387 PT++ + ++ + + G ++ V + H G P T Sbjct: 389 PTILDVCNLPLRPQQHIDGRSLETVLAGGKDELLEQRSLIWHYPHYGNQGGEPSSVIRTG 448 Query: 388 DFKLVLNL-FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQ 446 D+KL+ + DELY D E ++L + + M LL Y+ + F Sbjct: 449 DYKLIHYHLDSHDELYHLPTDIGEQNDLAS--EQPERVAAMRKELLAYLKSVDAKFPQPD 506 Query: 447 WSLRPWRKDARPRWMGAFRP 466 P + A+ RW P Sbjct: 507 PRFDP--EKAKQRWARTHGP 524 >UniRef50_C0W1U3 Sulfatase n=1 Tax=Actinomyces coleocanis DSM 15436 RepID=C0W1U3_9ACTO Length = 482 Score = 402 bits (1035), Expect = e-110, Method: Composition-based stats. Identities = 124/510 (24%), Positives = 192/510 (37%), Gaps = 56/510 (10%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PNF+ +TD Q + L T N+D L+ E F++ Y SPVC+PAR L T Sbjct: 3 KQPNFVIFVTDDQGPWATSEHW-PELQTPNLDQLSKESSTFSNYYCASPVCSPARGTLLT 61 Query: 62 GIYANQSGPWT----------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 G + G I T+ D GY+ +GKWH+ Sbjct: 62 GRMPSAHGIHDWLVGGRHPDALEEPFLDGIITLPEVLDDNGYYCGMVGKWHVGTSQT--- 118 Query: 112 GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 P YW+ + +W D N E + I+ A Sbjct: 119 ----PAPGFSYWYA--HRYGGGPYYNAPIW--------DENGNEATEPKYFTDAIAENAC 164 Query: 172 DFLQQPA--RADEPFLMVVSYDEPHHPF--TCPVEYLEKYADFYYELGEKAQDDLANKPE 227 DF+Q A ++PF ++V++ PH P+ P E ++ YAD + +++ + Sbjct: 165 DFIQSAASVNEEKPFFLMVNFTAPHSPWINNHPQELMDLYADTDF--PSIPREEPHPWTK 222 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 ++ +A A PV Y A VD+ +G ++ AL +NT V+Y SD+G Sbjct: 223 YYDDFADAFADPVP-----SLRGYAASLTGVDNAVGDILKALEENAYADNTVVMYMSDNG 277 Query: 287 EMMGAHKLISKGA-----AMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPTMMALAD 340 G H + KG +++ R+P II P E R+VD VS T+ LA+ Sbjct: 278 FSCGQHGIWGKGNGTFPLNFWENSVRVPFIIHLPGQHEYRKVDDHVSACSFFETVCELAE 337 Query: 341 IEKPEI-LPG-ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS 398 I PE L G +I + + + G R D K + Sbjct: 338 ITPPEDPLRGARSIADLARGEIRDSDEPVMVFDEYGGG-----RMIRYGDLKFIDRFDGP 392 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARP 458 ELYD +NDP E++NL+ D + VR ++ + + R P Sbjct: 393 QELYDLKNDPAELNNLVHDESYEKVRDELASLMGQWFAAHETEVYRAFHRDIRGRGQVHP 452 Query: 459 RWMGAFRPR---PQDGYSPVVRDYDTGLPT 485 G R + + LP Sbjct: 453 PHAGYNDTRTYVTDNENRDGDNTHKKALPV 482 >UniRef50_A6DKD8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKD8_9BACT Length = 455 Score = 402 bits (1034), Expect = e-110, Method: Composition-based stats. Identities = 100/486 (20%), Positives = 178/486 (36%), Gaps = 83/486 (17%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 ++PN + ++ D +G + T +ID+LA G+ F Y + VC P+RAGL T Sbjct: 20 QKPNIILILADDLGYEDLGFLGAPDIKTPHIDALARSGMNFTQGYQSASVCGPSRAGLLT 79 Query: 62 GIYANQSGPWTN------------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYF 109 G Y G N + + + K A Y T IGKWH+ Sbjct: 80 GRYQQLFGSGENPPETGELSKRFPDAGIPLDEQMIFDLLKPAAYTTGVIGKWHMG----L 135 Query: 110 GTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNR 169 + P + DY++ N E ++ + + + + ++ Sbjct: 136 SHEQRPTQRSVDYYYGFLNGAHSYREAKMDMKGAPMTWPIFRNNEPVPFSGYTTEVFNDE 195 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHH 229 V+F+++ D+PF + +SY+ H P+ + L++ Sbjct: 196 GVNFIKR--NKDKPFFLYMSYNSVHGPWEAQPKDLQRSDHIK------------------ 235 Query: 230 RLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHG-- 286 +Y A +DD +GR+I L E ENT VI+ SD+G Sbjct: 236 ---------------KKWRRIYSAMLISMDDGVGRLIQTLKDEGIYENTLVIFMSDNGAP 280 Query: 287 -----------EMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVD--TPVSHIDLLP 333 + L + Y+ R+P I+ PQ +Q PVS +D++P Sbjct: 281 NNLHEAERAGDYLASNGSLRGRKGDTYEGGIRVPYIMSWPQVIPKQSTYQHPVSGLDIVP 340 Query: 334 TMMALAD-IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV 392 T++ ++ + L G N++ D D+KL Sbjct: 341 TLIHISQAAPAKKELSGVNLMPYITGEKTSRPHKTLYWRRDD------DYAIRDKDWKLT 394 Query: 393 LNLFTS---DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSL 449 N + L++ ++DPNE +NLI + ++ K+ + K+ D +W Sbjct: 395 WNDYNGPRTPMLFNLKDDPNEKNNLIH--KHPEIAQKLQAKFDQWDSKLPDN----KWWG 448 Query: 450 RPWRKD 455 P ++ Sbjct: 449 GPSNRN 454 >UniRef50_Q7URY7 Aryl-sulphate sulphohydrolase n=1 Tax=Rhodopirellula baltica RepID=Q7URY7_RHOBA Length = 490 Score = 402 bits (1034), Expect = e-110, Method: Composition-based stats. Identities = 99/491 (20%), Positives = 167/491 (34%), Gaps = 90/491 (18%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 + PN LF+ D T N+D+LA G+ F++AY+C+ C PARA L + Sbjct: 32 EHPNVLFIYLDDYGWRDATFMGSDFYETPNLDALAERGMVFSNAYSCAANCAPARASLLS 91 Query: 62 GIYANQSGPWT------------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHL 103 G Y+ + + +I T +DAGY T IGKWHL Sbjct: 92 GQYSPRHEIYNVGTERRGNPKHGTLQHIPGTETLSSDIQTWAHQVRDAGYRTGIIGKWHL 151 Query: 104 DGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWA 163 + P + D G + S + + Sbjct: 152 --------SDDPLPYGFDINVAGTHSGSPPKGYFPPH-------PKVPGLQDTSDDEYLT 196 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 R+++ A+ F++ A + + + +S+ H P + + KY Sbjct: 197 DRLTDEAIGFIE--ANQEWSWFLYLSHFAVHTPLQAKPDLVAKYKAKQPGT--------- 245 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYT 282 L+ H + A + VD+ +GR++ L E NT +++T Sbjct: 246 ---------------------LHDHAVMAAMIESVDEGVGRMVETLRELGLEENTAIVFT 284 Query: 283 SDHGEMM---GAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMA 337 SD+G L Y+ R P + P + D PV DL PT + Sbjct: 285 SDNGGFGPATSMKPLRGYKGTYYEGGIREPFFVTWPGVVDAGTKSDVPVIAADLYPTFIE 344 Query: 338 LADIEKP--EILPGENILAVKEPRGVMVEFNRYEI-------------EHDSFGGFIPVR 382 + + P + L G +++ + + G + + Y + D P Sbjct: 345 MTGAKLPADQPLDGVSLMPLLKQEGSLADRELYWHFPAYLQSYSVTDGQRDLLYRSRPCG 404 Query: 383 CWVTDDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 +KL ELYD DP E +NL D +H L+ + ++I Sbjct: 405 IIRDGRWKLHEYFEDGGLELYDLVTDPGESNNLAD--ANPIKTQALHSKLVAWRERIGAS 462 Query: 442 FRS-YQWSLRP 451 + + P Sbjct: 463 MPTEPNPNHDP 473 >UniRef50_Q482D6 Sulfatase family protein n=2 Tax=Bacteria RepID=Q482D6_COLP3 Length = 492 Score = 402 bits (1034), Expect = e-110, Method: Composition-based stats. Identities = 104/463 (22%), Positives = 183/463 (39%), Gaps = 40/463 (8%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN + ++ D + Y T NID LAA+G++F++AY P C P+R +F+ Sbjct: 29 SKPNVVMLLVDDFGRQDLSTYGSNFYETPNIDQLAADGMKFDNAYAAHPRCVPSRVAIFS 88 Query: 62 GIYANQSGP----WTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 G Y + G + T G + K+AGY T YIGKWHL G P + Sbjct: 89 GSYPTRYGVPQGERVGKHHLPLSAVTFGEHLKEAGYQTGYIGKWHLG-----KEGGDPTK 143 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 D ++ + + + + E R+++ A+ F++Q Sbjct: 144 QGFDSSIMAGHWGAPPSYYFPYTKMSKSGKNKGFAKVEGSEEEYLTDRLTDEALTFIEQ- 202 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 + D+PFL+V+++ H P ++KY +LG ++ Sbjct: 203 -KKDQPFLLVLAHYAVHTPIEGKPALVKKYKTKMKKLGIANAGPKSDAD-------LIKD 254 Query: 238 SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGE--------- 287 S + ++P Y A + VD +GR+ L +NT +I TSDHG Sbjct: 255 STGYHKTIQNNPDYAAMVESVDISVGRIEQQLKRLGLEDNTIIILTSDHGGLSSRGLKSN 314 Query: 288 ---MMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALAD-- 340 + +YD TR+PLI++ P+ V+ D PT++ +A Sbjct: 315 RVLATSNNPYRHGKGWIYDGGTRVPLIVKWPEKVKAGSISQVQVTGTDHYPTILQMAGLS 374 Query: 341 IEKPEILPGENILAVKEPRGVMVEFNRYEIE--HDSFGGFIPVRCWVTDDFKLV-LNLFT 397 + + G + LA + + S G + ++KL+ Sbjct: 375 LSPKDHQDGVSYLAALNSDETPRKAMFWHSPAARPSKTGDTNSSAIIEGEWKLLDFWSTG 434 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 ELY+ ++D +E +NL + + ++M L ++ D I Sbjct: 435 KVELYNLKDDKSEANNLAKLM--PEKTAEMLAKLTNWKDDIDA 475 >UniRef50_C6XY33 Sulfatase n=5 Tax=Bacteroidetes RepID=C6XY33_PEDHD Length = 493 Score = 402 bits (1034), Expect = e-110, Method: Composition-based stats. Identities = 95/464 (20%), Positives = 174/464 (37%), Gaps = 43/464 (9%) Query: 3 RP-NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +P N LF+ D +GCY + + + +ID+LAA+ + F + P C +RA + T Sbjct: 29 KPYNVLFIFVDDLRP-DLGCYGNRIIKSPHIDALAAQSVLFKQQFVTVPTCGASRASILT 87 Query: 62 GIYANQSGPWTNNV-APGKNISTMGR----YFKDAGYHTCYIGKWHLDGHDYFGTGECP- 115 G+ +N + + GY+T IGK Y P Sbjct: 88 GLRPRSVNDLSNEAFELKPKSQNIPESFIALLRQQGYYTVGIGKISHSPDGYVYKYLEPK 147 Query: 116 -------PEWDADYWFDGANYLSELTEKEISLWRNG---LNSVEDLQANHIDETFTWAHR 165 WD + G + N V+ + + ++ Sbjct: 148 SSQMELERSWDEMLFNAGKWKTGWNAFFGYADGNNRNELKGEVKPYEHAPVSDSNYPDGL 207 Query: 166 ISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 + AV L++ + ++PF + V +PH PFT P +Y + Y + L + Sbjct: 208 TAEMAVSKLKELSTKEKPFFLGVGLFKPHLPFTAPQKYWDLYQEADISLTPSPDIPVDVN 267 Query: 226 PEHHRLWAQAMPSPVGDD------------GLYHHPLYFACNDFVDDQIGRVINALTPEQ 273 P + + G++ Y+A + D QIG++++ L Sbjct: 268 PVSLQESGEFNGYKKGEERASLAKPVSDAYARKLRHAYYAAVSYSDAQIGKILDELKRSG 327 Query: 274 RE-NTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLL 332 ++ NT V+ DHG +G ++ K + + PLII+ P + + VS +D+ Sbjct: 328 KDKNTIVVLWGDHGWHLGDDRVWGKH-TLSEWALHSPLIIKVPGLPQAINNNVVSAVDVY 386 Query: 333 PTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV 392 PT+M L I+KP + G +++ + ++ F TD ++L Sbjct: 387 PTLMELCGIKKPAHIDGTSLVPALKNP------LASSAGGIAYSYFKKGISLRTDRYRLT 440 Query: 393 LNLF---TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLD 433 + ELYD + DP E N+ + ++ ++ L Sbjct: 441 KYFRAAMPAIELYDHQTDPYENKNIAA--QQPELVKQLMVLLEK 482 >UniRef50_C6I9F7 Sulfatase n=4 Tax=Bacteroides RepID=C6I9F7_9BACE Length = 493 Score = 402 bits (1033), Expect = e-110, Method: Composition-based stats. Identities = 101/483 (20%), Positives = 172/483 (35%), Gaps = 66/483 (13%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN +F+ D + C + T +ID LA EG+ F +Y PV +P+RA L TG Sbjct: 27 PNVIFIYADDLGYTDLSCTGSRFYETPHIDKLAREGVCFTQSYAACPVSSPSRAALLTGK 86 Query: 64 YANQSGPWTN--------------------NVAPGKNISTMGRYFKDAGYHTCYIGKWHL 103 Y + N+ K+ TM F+ GY T GKWHL Sbjct: 87 YPARINLTDYIPGDRAYGPHKNQRLASLPFNLHLSKDEITMAEAFRQNGYSTFMAGKWHL 146 Query: 104 DGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWA 163 + P + D G N + + Q E Sbjct: 147 AESAEY----YPEQNGFDINIGGNNTGHPSKGY--------FSPYGNPQLKDGPEGEYLT 194 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 R+++ + ++ +P ++PF + +SY H P E + KY A Sbjct: 195 DRLTDEVIRYISEP--KEKPFFVYLSYYTVHLPLQAKAEKIAKYR-RKLSRAVPADSSFV 251 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYT 282 K E + Q +P Y A + +D+ IGR+++ L + T V++T Sbjct: 252 KKGETYHKLVQDIP------------AYAAMVESLDENIGRLLDTLHRSGLDERTIVVFT 299 Query: 283 SDHGEMM----------GAHKLISKGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHID 330 SD+G M L + +Y+ ++P IIR + + + DTP+ D Sbjct: 300 SDNGGMATSNTTRNIPTSNLPLRAGKGYLYEGGIKVPAIIRWSRHLKGRQVSDTPIIGTD 359 Query: 331 LLPT--MMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFG-GFIPVRCWVTD 387 PT + + + + G ++ V + + + H S G G P Sbjct: 360 YYPTLLDLCGLPLLPGQHVDGVSMKPVLQGGRLSRPSLFWHYPHYSGGLGGRPSAAIREG 419 Query: 388 DFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQ 446 D+KL+ ELY+ D +E +L + ++ + L + ++ Sbjct: 420 DYKLIEFFEDHHVELYNVIQDESEEKDLS--QIYPEIADGLRKKLYLWYKEVGARMPVDN 477 Query: 447 WSL 449 Sbjct: 478 PHY 480 >UniRef50_A3ZMT9 Arylsulfatase n=2 Tax=Planctomycetaceae RepID=A3ZMT9_9PLAN Length = 542 Score = 402 bits (1033), Expect = e-110, Method: Composition-based stats. Identities = 104/521 (19%), Positives = 182/521 (34%), Gaps = 98/521 (18%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + +M D + +G + G+ + T NID+LA G+RF+ Y C P RA L TG Sbjct: 28 RPNIILIMVDDMGFSDLGYHGGE-IATPNIDALAHSGVRFSQFYNNG-RCCPTRATLMTG 85 Query: 63 IYANQSGP-----------------WTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDG 105 +Y +Q+G T +N T+ + GY T GKWHL Sbjct: 86 LYPHQTGIGHMTESPGEANYGSGKPPTYQGYLNRNCVTIAEALQQQGYATLMSGKWHLGE 145 Query: 106 HDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHR 165 +D P + + +F + + + N + D+ F Sbjct: 146 NDK---SRWPLQRGFEKYFGCLSGATLYFFPDGDRKMTLGNQQIAEPESTTDQPFYTTDA 202 Query: 166 ISNRAVDFL-QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 ++ A+ FL ++ A P + ++Y PH P E + KY Y +K ++ Sbjct: 203 FTDYAIRFLKEEQAGQQRPMFLYLAYTAPHWPLQAFEEDIAKYRGKYKIGWDKLREQRLE 262 Query: 225 KPEHHRLWA---------------QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINAL 269 + ++ L A + + D+ +Y A D VD IGR++ L Sbjct: 263 RQKNLGLIAADRQLSPRTPKIPAWDELDAAQQDEMDLKMAVYAAMIDRVDQNIGRLMKHL 322 Query: 270 TPEQR-ENTWVIYTSDHGE------MMGAH-------------------------KLISK 297 ++T +++ SD+G + GAH Sbjct: 323 KESGIEDDTLILFLSDNGGCQEGGVLGGAHFLDPEQRNRQYFHGYGEAWANASNTPFRLY 382 Query: 298 GAAMYDDITRIPLIIRSPQGERRQ---VDTPVSHIDLLPTMMALADIEKPE--------I 346 ++ T P +R P + P ID++PT++ +A P Sbjct: 383 KHFNHEGGTATPFFMRWPGKIAARDAWCAEPAQLIDVMPTILDVAGATYPAKYAENAIPP 442 Query: 347 LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL--------FTS 398 L G ++ + + IEH++ D+KLV Sbjct: 443 LDGVSLRPTMQGE-PLDRQQPICIEHENN------ASIRAGDWKLVGRGVAAPRGVQPAK 495 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 ELY+ +D E NL + + ++ + ++ Sbjct: 496 WELYNIADDRTETQNLA--VEHPEKVRELSQQWNAWAKRVG 534 >UniRef50_Q5LRB5 Choline sulfatase n=1 Tax=Ruegeria pomeroyi RepID=Q5LRB5_SILPO Length = 498 Score = 401 bits (1032), Expect = e-110, Method: Composition-based stats. Identities = 122/487 (25%), Positives = 198/487 (40%), Gaps = 26/487 (5%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN L +M D M+ G T+++ LA ++F +AYT SP+C PAR+ TG Sbjct: 16 RPNILLIMADQMTPFMLEACGGTGARTRHLTRLAGRAVQFTNAYTPSPICVPARSCFMTG 75 Query: 63 IYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADY 122 +Y + +G + N + T Y +AGY T GK H G D + + D Sbjct: 76 LYTSTTGCYDNGDPYHSFLPTFAHYLTNAGYETVLSGKMHFIGADQLHGFQ--RRLNPDI 133 Query: 123 WFDGANYLSELTEKE---ISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 + G + L + + + + + RA+++L+ Sbjct: 134 YPSGFLWSYPLPPDGDASFQAFDFTPQYLAENIGPGWSKELQYDEETQFRALEYLRHA-- 191 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE-------HHRLW 232 D P+++ VS+ PH P+ P Y E Y D L + D A E H L Sbjct: 192 PDTPWMLTVSFTNPHPPYVVPRPYWEMYKDADIPLPDYPADMDARYSEFDHALRRWHGLH 251 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGA 291 + + + + A +VDD+IG ++ L R+ T +I TSDHGEM+G Sbjct: 252 QRGHEVRDPRNLIAMRRGFAALAHYVDDKIGALLEVLDETGQRDETVIIVTSDHGEMLGE 311 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 LI K ++Y+ RIPLII P +VDTPVS +DL T++ L+ L G + Sbjct: 312 KGLIQK-RSLYEWSARIPLIIDLPGAAPGRVDTPVSLLDLPATLIELSGQTPVAPLEGRS 370 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEM 411 +L + + + E+ G P D+K ++ +LY+ DP E Sbjct: 371 LLGAVRGQEL--DTVPIVSEYHGEGIMRPSFMVRLGDWKYHYCHGSAPQLYNLARDPGEW 428 Query: 412 HNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQ-- 469 HN + A+ +++ + F + + W + + + A R Sbjct: 429 HNRAGEPDLAETEARLDRVI------TGGSFDLERIAREVWERLPLKQVVNAAMQRNGTA 482 Query: 470 DGYSPVV 476 Y P Sbjct: 483 WDYRPDP 489 >UniRef50_B4D433 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D433_9BACT Length = 465 Score = 401 bits (1031), Expect = e-110, Method: Composition-based stats. Identities = 96/489 (19%), Positives = 171/489 (34%), Gaps = 74/489 (15%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +F + D + C+ T NID LA +G++F AY+ VC+P RA + +G Sbjct: 26 KPNVIFFLVDDLGATDLSCFGSSFYQTPNIDRLAQDGLKFTHAYSACTVCSPTRASIISG 85 Query: 63 IYA----------------NQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH 106 Y + ++ST+ + AGY TC IGKWHL Sbjct: 86 RYPAELHLTDWIAGHKRPKAKLRIPDWTQHLTHDVSTLPQAMHAAGYTTCAIGKWHLG-- 143 Query: 107 DYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI 166 + P ++ D T + + + R+ Sbjct: 144 -----EDGPEKYGFDVAIADNGKGQPATYFSPYKNPH---------LSDGPPGEFLSDRL 189 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 + A F++Q + PF + ++ H P + KY Sbjct: 190 TTEAEKFIEQ--NKEHPFFLYFAHYAVHTPLMGKPAVIAKYK------------------ 229 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDH 285 V + H+P+Y + + VDD +G + L + + T +I+TSD+ Sbjct: 230 -----------EHVSPNDPQHNPVYASLIESVDDSLGHLRAKLDELKLSDKTIIIFTSDN 278 Query: 286 GEMM-----GAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMAL 338 G ++ + + + Y+ R+P I P TPV +D TM+ L Sbjct: 279 GGLILNQVTSNLGMRAGKGSAYEGGVRVPAIAFVPGVTQAGTVATTPVISMDWTATMLDL 338 Query: 339 ADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS 398 A + + G ++ V + + + H GG P + D+++LV + Sbjct: 339 AGAKPLDQQRGVSLAPVLHGGQISLRALFWHYPHYHPGGATPYCAMLEDNWRLVEFFEDN 398 Query: 399 D-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDAR 457 ELY +DP E H+L ++ L + + + + P + Sbjct: 399 HVELYHLSDDPEEKHDLAASQS--AKAEELKARLHAWRETMHAQLPTPNPDYDPAHANDG 456 Query: 458 PRWMGAFRP 466 P+ Sbjct: 457 PKKKAPPEQ 465 >UniRef50_C7M5R4 Sulfatase n=4 Tax=Bacteroidetes RepID=C7M5R4_CAPOD Length = 480 Score = 401 bits (1031), Expect = e-110, Method: Composition-based stats. Identities = 94/476 (19%), Positives = 163/476 (34%), Gaps = 47/476 (9%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN +F++ D + Y + + T + LA EG++F YT + VC P+RA TG Sbjct: 24 PNVIFILADDLGYGDIEPYGQQIIKTPQLSKLADEGMKFTQFYTGTSVCAPSRASFITGQ 83 Query: 64 YANQSGPWTN---------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 ++ N N ++ + FK AGY+T GKW L + Sbjct: 84 TTGETHIRGNEEVREPVDGQAPLLANDPSVAQLFKKAGYNTGCFGKWGLG---IVPSEGN 140 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 P + D +F + LW + + N+ + I + +D++ Sbjct: 141 PLKQGFDTFFGYNSQFRAHRRYPAFLWHDNEKVLIPENGNYERQEVYGEDLIQEKILDYI 200 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYAD-FYYELGEKAQDDLANKPEHHRLWA 233 + A++PF M ++Y PH P + YA Y D WA Sbjct: 201 GKQT-AEKPFFMWLTYTLPHAELVVP--HDSIYASYEYLPKKPYKGVDYDKITPKPFGWA 257 Query: 234 QAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHG------ 286 M P + Y A +D +G + L + + +T +I+ SD+G Sbjct: 258 GYMSQPHT------YATYAAMVSRLDKYLGEIRKLLKVKGLDEDTIIIFASDNGAHREGG 311 Query: 287 ----EMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALAD 340 + L +Y+ R P I+ D + D++PT + Sbjct: 312 ADPKFFNSSAGLRGIKRDLYEGGIRTPYIVYWKGKIKAGSVSDHIGAFWDMMPTFAEITH 371 Query: 341 IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-----NL 395 + + L + + E GG + ++K V + Sbjct: 372 QKYVPNRHQVSFLPTLLGKKQQQQHKYLYWEFHEMGGR---QAVRYKNWKGVRLNVNKDK 428 Query: 396 FTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRP 451 ELYD DP E HNL + ++ + K+ + R + W + Sbjct: 429 KAPIELYDLTTDPAEQHNLAE--KYPKIVKKIERFMEQ--SHTRSELFPFDWEAKK 480 >UniRef50_A6DKP2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKP2_9BACT Length = 446 Score = 401 bits (1031), Expect = e-110, Method: Composition-based stats. Identities = 95/466 (20%), Positives = 165/466 (35%), Gaps = 78/466 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + V D V + + T ID++A G+ F Y + VC P+RAG+ TG Sbjct: 19 KPNIVLVFADDMGWGDVAYHGVEDAQTPAIDAIAKGGVWFEQGYAAASVCGPSRAGILTG 78 Query: 63 IYANQSGPWTNNV---APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 Y G TN K+ + K AGY + GKWHL G+ P + Sbjct: 79 RYQQLFGVVTNGDADKGIPKSQKNIAELLKPAGYKSGAFGKWHLGSKK----GQFPNDRG 134 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHI---DETFTWAHRISNRAVDFLQQ 176 D ++ + + L + I E +I++ AV+F+++ Sbjct: 135 FDTFYGFHFGAHDYYRADKKLNKKKKGYAPIYFNQDIVDYKEGDYLTEKITDHAVEFIEE 194 Query: 177 PARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAM 236 D+PF M V+Y+ H P+ P EYL + Sbjct: 195 --NKDQPFFMYVAYNSVHSPWQVPDEYLAR------------------------------ 222 Query: 237 PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGE-------- 287 + + + L+ A +DD +GR+ L + NT ++T+D+G Sbjct: 223 ---IPESVPAYRRLFLAMVLAMDDGVGRIRAKLKELNLDENTIFVFTTDNGSPKIGNKKP 279 Query: 288 ------MMGAHKLISKGAAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMALA 339 M + Y+ R+P + P+ + + PV DL PT ++ A Sbjct: 280 NEGQYRMSMSQGFRGYKGDTYEGGIRVPFCMSWPKKIKSGNKFEAPVIAYDLAPTFLSAA 339 Query: 340 DIEKP-EILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS 398 +E + G+++L E S D+KL N Sbjct: 340 SLEYSTKQFSGKDLLPYLEDEQKGRPHETLFWHRHSGLDD---YAVRHGDWKLTYNDQEG 396 Query: 399 D----------ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 +L++ + DP E +L D + + ++ ++ Sbjct: 397 TSKDFLKKVHLKLFNLKQDPYEKKDLADSM--PEKLQQLKQLYFNW 440 >UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D464_9BACT Length = 474 Score = 401 bits (1031), Expect = e-110, Method: Composition-based stats. Identities = 105/494 (21%), Positives = 167/494 (33%), Gaps = 100/494 (20%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 KRPN LF++ D GCY GK + T NID L A G+RF+S Y +P C +RA L T Sbjct: 26 KRPNILFIVADDLGYGEPGCYGGKDIPTPNIDKLVASGVRFSSGYVSAPFCAASRAALMT 85 Query: 62 GIYANQSGPWTN---------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG 112 G Y + G N N T+ +D GY T +GKWHL G F Sbjct: 86 GRYQTRFGFEYNPIGAKNADPGTGLPVNEKTVADRLRDVGYATGLVGKWHLGGTAPFH-- 143 Query: 113 ECPPEWDADYWFDGAN--------------------------YLSELTEKEISLWRNGLN 146 P D +F + + ++W L+ Sbjct: 144 --PQRRGFDEFFGFLHEGHFYLPPPWSGATTWLRRKALPDGSQGRWTSPDGHTVWSTDLH 201 Query: 147 SVEDLQANHID---------ETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPF 197 E E + A F+ + +P+ + ++Y+ H P Sbjct: 202 ENEPAYDADNPLLRNSQPVEEKANLTDAFTREACSFIDRHQA--QPWFLYLAYNAVHSPL 259 Query: 198 TCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDF 257 Y+EK++ G ++ A Sbjct: 260 QGEDTYMEKFSHI---------------------------------GDIQRRIFAAVLAH 286 Query: 258 VDDQIGRVINALTPEQR-ENTWVIYTSDHGE-----MMGAHKLISKGAAMYDDITRIPLI 311 +D+ IG+V L + ENT V++ SD+G L ++D RIP Sbjct: 287 LDEDIGKVRAQLRADGLEENTLVVFLSDNGGPTKELTSSNLPLRGGKGDLWDGGIRIPFA 346 Query: 312 IRSPQGERR--QVDTPVSHIDLLPTMMALADIEKPE-ILPGENILAVKEPRGVMVEFNRY 368 + +D P +DL T + LA E + L G ++L + + + Sbjct: 347 VSWKGQIPAGHTIDAPAISMDLTATALKLAGAETEQAKLDGVDLLPLLTGKTTAAPHDTL 406 Query: 369 EIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMH 428 F D+KL+ +LYD +D E +N+ + A +++ Sbjct: 407 ------FWRVGRKNALRHGDWKLLRQGSKEWQLYDLAHDVGETNNMAA--QNAARVTELS 458 Query: 429 DALLDYMDKIRDPF 442 + + DP Sbjct: 459 ALWDKWNSEQIDPL 472 >UniRef50_C1ZJ89 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJ89_PLALI Length = 536 Score = 400 bits (1030), Expect = e-110, Method: Composition-based stats. Identities = 103/484 (21%), Positives = 172/484 (35%), Gaps = 79/484 (16%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN +F++ D VGC+ + T NID LA+ G++ Y+ +P C P+R L TG Sbjct: 38 RPNVVFILADDLGWGEVGCFGQSKIPTPNIDRLASRGVKLTRHYSGAPTCAPSRCVLMTG 97 Query: 63 IYANQSGPWTN-------------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYF 109 + + N T+ R F+ AGY T GKW L Sbjct: 98 KHLGHAEIRGNQQAKVKLPQFTEGQHPLSDKALTIARQFQKAGYATGAFGKWGLG---PV 154 Query: 110 GTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDE----------- 158 G+ P D +F + +LW+N + V + + + Sbjct: 155 GSTGEPNRQGFDEFFGYNCQALAHSYFPKALWKNAESIVNNEKPVPGHKKQPEGEVTMEA 214 Query: 159 ---TFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELG 215 I A+ F+ + +PF + + + EPH P + +E++ + E Sbjct: 215 YQGENYAPRLIMAEALSFIDR--HHQQPFFLYLPFTEPHVAMQPPPKIVEEFPVEWDERV 272 Query: 216 EKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-R 274 + P Y A +D+ +G VI +L Sbjct: 273 YRGDGGYLPHP-------------------RPRAAYAAMIRDLDNHVGDVITSLEKHGLL 313 Query: 275 ENTWVIYTSDHG------------------EMMGAHKLISKGAAMYDDITRIPLIIRSPQ 316 E T +++TSD+G +L ++Y+ R+P I+ P Sbjct: 314 EKTLIVFTSDNGATHASANPDFHVGGADPLFFNSTRELKGFKGSIYEGGLRVPAIVSWPG 373 Query: 317 GER--RQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDS 374 ++TP D PT+ + PE L G N+L + + +F R + Sbjct: 374 QIPPATTINTPSYFPDWFPTLCNATQLPLPEGLDGVNLLPLLTGKTSPDQFIRPDPMVWV 433 Query: 375 FGGFIPVRCWVTDDFKLVL-----NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHD 429 + + C DFK++ N E+Y +DP E NL D D+ +K + Sbjct: 434 YAEYTGQVCVHLGDFKVLRRGLRTNRPGPWEVYQLVSDPGESTNLADSR--PDLVTKAIE 491 Query: 430 ALLD 433 L Sbjct: 492 VLKA 495 >UniRef50_A6C1Q0 N-acetylgalactosamine 6-sulfate sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C1Q0_9PLAN Length = 469 Score = 400 bits (1030), Expect = e-110, Method: Composition-based stats. Identities = 106/477 (22%), Positives = 182/477 (38%), Gaps = 67/477 (14%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++TD Q +G Y + ++T ++D + +G F +A+ +PVC+P+RA +G Sbjct: 29 RPNLISIVTDDQGRWAMGLYGNRQIHTPHMDQIGKQGAVFTNAFVATPVCSPSRATFLSG 88 Query: 63 IYANQSGPWTN------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 + + G T + GY T IGKWHL + F P Sbjct: 89 RFPTELKITDWISSEEAQEGAGLTAMTWPEVLQQHGYQTALIGKWHLGELNQFH----PH 144 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQ 176 E ++ + +N +++ + + + A++F++ Sbjct: 145 EKGFGHFMGFLAGGTRP-----------MNPTLEIKGETQKRKGSLPDLLVDDAINFIRT 193 Query: 177 PARADEPFLMVVSYDEPHHPFTC-PVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQA 235 D+PF + + + PH P+ P + Y ++ Sbjct: 194 S--KDKPFALCLHFRAPHTPYGPVPEQDSAHYEGMKIDVPIT------------------ 233 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKL 294 P + + + Y+A VD IGR++ L + ENT VI+TSDHG G H + Sbjct: 234 -PGVIPEQIRQKNKEYYASVSSVDRNIGRLLKELDQLRLAENTLVIFTSDHGYNNGRHGV 292 Query: 295 ISKG--------------AAMYDDITRIPLIIRSPQGERRQV--DTPVSHIDLLPTMMAL 338 +KG M+D R+PL++R P + D VS+ID+ ++ Sbjct: 293 STKGNGHWIAGGVTGPKRPNMWDTSIRVPLVMRWPAVIKPGTQFDEIVSNIDMFKFVLGA 352 Query: 339 ADIEKPE--ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV--LN 394 I +P L G + + + V + G +R T K V Sbjct: 353 LKIPQPANLKLHGIDYSPLLFGQPAPVRKALFGQYDLHNNGLAYLRMIRTPKLKYVKHYR 412 Query: 395 LFTSDELYDRRNDPNEMHNLID---DIRFADVRSKMHDALLDYMDKIRDPFRSYQWS 448 DELYD DP E NL+ + D + D L+++ I+DP + Sbjct: 413 ARNMDELYDLETDPGENTNLLQRRTRKNWQDTADLLEDQLIEWQKSIQDPILEPAYE 469 >UniRef50_C1ZIW1 Arylsulfatase A family protein n=6 Tax=Bacteria RepID=C1ZIW1_PLALI Length = 526 Score = 400 bits (1030), Expect = e-110, Method: Composition-based stats. Identities = 113/501 (22%), Positives = 188/501 (37%), Gaps = 79/501 (15%) Query: 4 PNFLFVMTDTQATNMVGCYSG--KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 PN LF+ +D A + Y K L T +ID +A EGIRF+ + +C P RA + T Sbjct: 36 PNILFIFSDDLAYQAISAYGDERKLLETPHIDRVAKEGIRFDRCVVTNSICGPCRATILT 95 Query: 62 GIYANQSGPWTN-NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 G Y++++G + N N +T + K GY T IGKWHL Sbjct: 96 GKYSHKNGFYNNTNSRFDSTQTTFPKLLKSQGYSTALIGKWHLISEPT----------GF 145 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D+W N + I++R++D+L+ Sbjct: 146 DHWEILPGQGIYY------------NPPMIANGQKVQREGYVTDIITDRSIDWLKNR-DK 192 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE------------H 228 +PFL++ + PH ++ + +L D + D ++ + Sbjct: 193 SKPFLLMAQHKAPHREWSPALRHLGFNKDKPFAEPATLFDQHKDRAQAVVDHDMGIDRTF 252 Query: 229 HRLWAQAMPSP-------------------------------VGDDGLYHHPLYFACNDF 257 +L A+ +P P V + Y AC Sbjct: 253 TKLDAKLVPPPGINSTQLEEWNKYYLPRNNAFEAAHLQGQDLVRWRYQRYMHDYLACVKA 312 Query: 258 VDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ 316 VD+ +GR++ L E ENT V+ +SD G +G H K ++++ R PL+ R P Sbjct: 313 VDESVGRLLQTLDEEGLAENTLVVVSSDQGFYLGEHGWFDK-RWIFEESLRTPLLARWPA 371 Query: 317 GE--RRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRGVMVEF---NRYEIE 371 R VS +D+ T + +A I+ P + G ++L + + E Sbjct: 372 AIPAGRTNGQIVSLLDIAQTFLDVAKIDAPNDMQGASLLPLLKGDTPADWRKSLYYRYYE 431 Query: 372 HDSFGGFIPVRCWVTDDFKLVLN---LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMH 428 + S P VTD +KLV EL DR+ DP E+ + +D +A +++ Sbjct: 432 YPSPHRVKPHYGVVTDRYKLVHYEGTGEGEWELLDRQVDPQEVKSFHNDPAYAQTMTELK 491 Query: 429 DALLDYMDKIRDPFRSYQWSL 449 D + + D + Sbjct: 492 DEIRRLQKVVDDQTPPPAKAY 512 >UniRef50_Q127E2 Sulfatase n=1 Tax=Polaromonas sp. JS666 RepID=Q127E2_POLSJ Length = 511 Score = 400 bits (1030), Expect = e-110, Method: Composition-based stats. Identities = 121/497 (24%), Positives = 194/497 (39%), Gaps = 64/497 (12%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MKRPN L + TD + +G + + T +ID +A G F S T + VC P+RA + Sbjct: 1 MKRPNILLITTDQHRGDCLGFAG-RKVKTPHIDEMARTGTHFTSCITPNIVCQPSRASIL 59 Query: 61 TGIYANQSGPWTNNVAPGK--NISTMGRYFKDAGYHTCYIGKWHLDGH------------ 106 TG+ G N + + + +GY T +IGK H H Sbjct: 60 TGLLPLTHGVCDNGIDLDEARGEAGFAGTLASSGYSTGFIGKAHFSTHHTFAKTGRPECQ 119 Query: 107 ---DYFGTGECPPEWDADYW---------------FDGANYLSELTEKEISLWRNGLNSV 148 +G P ++ G ++ + RN L Sbjct: 120 FSEADYGPAWYGPYMGFEHVELAVEGHNYWLPTPLPGGLHHSRWYYGDGLGEMRNRLYQQ 179 Query: 149 EDLQANHIDETF--------TWAHRISNRAVDFLQQPA-RADEPFLMVVSYDEPHHPFTC 199 + + +TF + I +R ++F+++ A A + F + S+ +PHHPF C Sbjct: 180 DMGPPSGAPQTFNSALPSAWHNSTWIGDRTIEFMRKHAGEAAKRFCLWASFPDPHHPFDC 239 Query: 200 PVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ-----------------AMPSPVGD 242 P + + +L D +P H+ MP+P Sbjct: 240 PEPWSRLHHPDEVDLPAHRTTDFERRPWWHKASMDSKPVGDAAVQALRQNFSRMPTPAEQ 299 Query: 243 DGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMMGAHKLISKGAAM 301 Y+ VD Q+GR+ AL + NT VI+TSDHGE +G H L+ KG Sbjct: 300 QLRNITANYYGMISLVDHQVGRIQTALQQLGLDGNTLVIFTSDHGEWLGDHGLMLKGPIP 359 Query: 302 YDDITRIPLIIRSPQG-ERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPRG 360 Y+ + R+ +++ PQ + PVS +DL T A L G+++ + E Sbjct: 360 YEGVLRVGMVVNGPQVQAGQVRHEPVSTLDLAATFADYATATALAPLHGQSLRPLLEGGQ 419 Query: 361 VMVEFNRYEIE--HDSFGGFIPVRCWVTDDFKLVLNLF-TSDELYDRRNDPNEMHNLIDD 417 +F E G + +R T+++KL L + E+Y DPNEM NL DD Sbjct: 420 QTRDFALSEWNVAASRCGLELQLRTVRTENWKLTLEQNSGAGEMYCLSEDPNEMDNLFDD 479 Query: 418 IRFADVRSKMHDALLDY 434 + R ++ D + Sbjct: 480 PGYTAKRKELSDMIASR 496 >UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 Tax=Bacteria RepID=A6CD52_9PLAN Length = 460 Score = 400 bits (1028), Expect = e-110, Method: Composition-based stats. Identities = 101/452 (22%), Positives = 172/452 (38%), Gaps = 57/452 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN L + TD Q N VGCY + + T +ID LA EG+ F Y+ S +CTP+R G+ T Sbjct: 26 ERPNILIIFTDDQGINDVGCYGSE-IPTPHIDQLAKEGLLFRQYYSASAICTPSRFGILT 84 Query: 62 GIYANQS-----GPW------TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG 110 G +S G N +T+ + GY T +GKWHL Sbjct: 85 GRNPTRSQDQLLGALMFMSDIDQNRGIQPGETTIADVLQQNGYQTALLGKWHLGHGTE-- 142 Query: 111 TGECPPEWDADYWFDGANYLSEL---TEKEISLWRNGLNSVEDLQANHIDETFTWAHRIS 167 P D + + T I W + H+ E I+ Sbjct: 143 -SFLPTAHGFDLFRGHTGGCIDYFTMTYGNIPDWYHNQR--------HVSENGYATDLIT 193 Query: 168 NRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPE 227 A FL+ D+PF + +SY+ PH + + + + + + + Sbjct: 194 EEAEHFLKDQQTTDKPFFLFLSYNAPH------------FGKGWSPGDQSPVNIMQARGD 241 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHG 286 + VG + A +DD IGRV+++L + NT VI+ +DHG Sbjct: 242 DLKR--------VGTIKDKVRREFAAMTVSLDDGIGRVMSSLKNNGLDQNTLVIFMTDHG 293 Query: 287 E---MMGAH-KLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALAD 340 G + A +++ R+P IIR P + + +DL PT+ A+ Sbjct: 294 GDYVYGGNNQPFRGAKATLFEGGIRVPCIIRWPGKIKAGTETNEVAWALDLFPTICHFAN 353 Query: 341 IEKPE-ILPGENILAVKEPRGV-MVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS 398 ++ L G++I + + +++ + D+K + + Sbjct: 354 VDTDGLTLDGKDISGLLTRQTPVGTRELYWQLGPHAELKRGRWSALRQGDWKYIQDAGGE 413 Query: 399 DELYDRRNDPNEMHNLIDDIRFADVRSKMHDA 430 + L+D + DP E NL +++ + Sbjct: 414 EFLFDLKADPYEKQNLTQSQS--TKLTELQER 443 >UniRef50_D2R457 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R457_9PLAN Length = 516 Score = 399 bits (1027), Expect = e-110, Method: Composition-based stats. Identities = 104/491 (21%), Positives = 169/491 (34%), Gaps = 92/491 (18%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN +F++ D VGC+ K T +ID+LA +G+R Y+ +PVC P+R L TG+ Sbjct: 33 PNIVFILCDDLGYGDVGCFGQKKTRTPHIDTLARDGMRLIQHYSGAPVCAPSRCVLLTGL 92 Query: 64 YANQSGPWTNNV-------APGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPP 116 ++ S N + T+ GY GKW L G + G P Sbjct: 93 HSGHSQVRDNREAQPEGQYPLAEGTVTLPGLL--EGYVCGAFGKWGLGGPESSGK---PL 147 Query: 117 EWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF---------------- 160 D +F LW N + + F Sbjct: 148 AQGFDRFFGYNCQRQAHNYYPQHLWSNDEKVLLKNPPFAAHQKFPADADPQNPAAFERYR 207 Query: 161 ---TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEK 217 A IS +A+ F+ + +PF + + PH P + L++YA + E Sbjct: 208 GPDYAADLISEQALKFIDE--HHQKPFFLYYASPVPHLALQVPEDSLKEYAGEFSETPYL 265 Query: 218 AQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-N 276 + P Y A +D +IGR++ L + Sbjct: 266 GERGYLPHPTP-------------------RAAYAAMITRMDREIGRILERLEKYGLQRR 306 Query: 277 TWVIYTSDHG------------EMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQV 322 T V+++SD+G A L + ++Y+ R+P I++ P Sbjct: 307 TIVVFSSDNGPLYDKLGGTDADFFQSALDLRGRKGSVYEGGIRVPTIVKFPGVVPAGTTS 366 Query: 323 DTPVSHIDLLPTMMALAD--IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIP 380 T D +PT+++LA + PE G ++ E + F G+ Sbjct: 367 STLGGFEDWMPTLLSLAGMSTKIPEQADGRDLSPSLRGDWQAPR----EFLYREFPGYGG 422 Query: 381 VRCWVTDDFK-----LVLNLFTSD------------ELYDRRNDPNEMHNLIDDIRFADV 423 + + +K LV + T ELYD DP E N+ V Sbjct: 423 QQFVRSGKWKAVRQNLVRPVPTGKKKLAEWKEPLAIELYDLEADPTESTNVAA--EHPKV 480 Query: 424 RSKMHDALLDY 434 +K+H +L Sbjct: 481 VAKLHAIMLRE 491 >UniRef50_D2R1I8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R1I8_9PLAN Length = 427 Score = 399 bits (1027), Expect = e-109, Method: Composition-based stats. Identities = 104/470 (22%), Positives = 186/470 (39%), Gaps = 63/470 (13%) Query: 8 FVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIYANQ 67 ++ D V Y + T ID LAAEG+ S VC+P+RA L TG YA++ Sbjct: 2 LILADDLGYGDVSTYHPSDVRTPQIDQLAAEGMLLTSMRANCTVCSPSRAALLTGRYADR 61 Query: 68 SGPW--------TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 G + + T+ K GYHT +GKWHL + P E Sbjct: 62 VGVPGVIRTKPEDSWGWFDPTVPTLADELKRVGYHTAIVGKWHLG----LESPNTPNERG 117 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL-QQPA 178 D++ +L ++ + + R G N + I+ ++ A ++L ++ Sbjct: 118 FDFF---QGFLGDMMDSYTTHLRYGNNYMRR-NREVIEPQGHATELFTDWASEYLVERAK 173 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 + ++PF + ++Y+ PH P P E+L K + +L +K ++ Sbjct: 174 QKEQPFFLYLAYNAPHFPIEPPAEWLAKVKERAPQLDQKRAKNV---------------- 217 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGEMM----GAHK 293 A + +D IGRV+ L + NT V++TSD+G + Sbjct: 218 --------------AFVEHLDHSIGRVLKTLKETGLDQNTVVVFTSDNGGSLPHAQNNDP 263 Query: 294 LISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 + YD R+P ++R P + D + DL PT + LA + L + Sbjct: 264 WRDGKQSHYDGGLRVPFMVRWPGQIKAGSRSDYVGLNFDLFPTFLELAGATPSKELDAVS 323 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSF--GGFIPVRCWVTDDFKLVLN-LFTSDELYDRRNDP 408 ++ V + + + Y + + G + ++KL+ N +++ ELY+ +NDP Sbjct: 324 LVPVLKGGKITTSRDLYFVRREGGVTYGGKSYEAIIRGEWKLLQNDPYSALELYNIQNDP 383 Query: 409 NEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARP 458 E +L V +++ AL ++ + W P + P Sbjct: 384 GETKDLAAS--NKKVVNELAAALRLHIQRGGAT----PWQAPPRKPALAP 427 >UniRef50_B9XJI6 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XJI6_9BACT Length = 490 Score = 398 bits (1025), Expect = e-109, Method: Composition-based stats. Identities = 102/480 (21%), Positives = 185/480 (38%), Gaps = 31/480 (6%) Query: 3 RPNFLFVMTDTQATNMVGCY-SGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN L ++ D N +G + T N+D LAA G+ F +A +P+C P+RA + Sbjct: 30 KPNILLIIADDL-NNWIGPNKGNPQVKTPNLDKLAARGVTFQNAQASAPLCNPSRASFMS 88 Query: 62 GIYANQSGPWTNNVAPGKNIS---TMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G + +G + N ++ + Y + GY + GK + +W Sbjct: 89 GQRPSTTGIYDNQQPAMPHLPRGVCLNDYVRKFGYTSLGAGKIYHYHQ------YREEDW 142 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDF-LQQP 177 D ++ + ++ + + ED +E + + R+V + + Q Sbjct: 143 DKVVFYADDTLPNHPAKRRPGPFGYRM-FTEDKPDAEFNEQRAESELVDARSVSWCIDQL 201 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 + F M PH P+ P +Y + Y +L +DLA+ P +A+ Sbjct: 202 GQQQGAFFMACGVHRPHVPWDVPKKYFDMYPLESVKLPPILTNDLADVPPAGIAFAKPNG 261 Query: 238 SPV----GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAH 292 Y A F D QIGR+++AL + +NT +I+ D+G +G Sbjct: 262 VHQAILKAGVWQDRVRAYLAAISFADAQIGRLLDALDKSKYRDNTIIIFVGDNGWHLGEK 321 Query: 293 KLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGE 350 + +K +A++ T +PLI +P + D V + PT+ LA I P+ G Sbjct: 322 EHWAK-SALWRRATNVPLIWVAPGVAKPGTECDRAVDLTSIFPTVCELAGIPTPKHAEGI 380 Query: 351 NILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNE 410 +I + + + TD+++ + S+ELYD++ DPNE Sbjct: 381 SIKPLLKNPSAKWKQPAVTTF------LQNNHAICTDEWRYIRYADGSEELYDQKADPNE 434 Query: 411 MHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQD 470 N A ++ ++ +L + P K AR A ++ Sbjct: 435 WTNQAAKPELAKIKMELAKSLP----SVNAQPVPENPDPGPKGKKARKANRVAAADPEKE 490 >UniRef50_C6VXD2 Sulfatase n=7 Tax=Bacteria RepID=C6VXD2_DYAFD Length = 543 Score = 398 bits (1024), Expect = e-109, Method: Composition-based stats. Identities = 103/534 (19%), Positives = 186/534 (34%), Gaps = 89/534 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN + ++ D +GCY G+ + T N+D LA G+RF + Y + C P RA L T Sbjct: 25 KKPNVIVILADDLGYADLGCYGGE-IPTPNLDKLAQSGVRFTNFY-NTARCCPTRAALLT 82 Query: 62 GIYANQSGPWT-----------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFG 110 G+Y++Q+G N T+ K AGY T GKWH+ Sbjct: 83 GVYSHQAGIGHMMDDKGADHPAYRGQLNHNSVTIAEVMKGAGYFTAMSGKWHVG----HQ 138 Query: 111 TGECPPEWDADYW-FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNR 169 G P D A L+ NG D + + + + +N Sbjct: 139 HGVYPSNRGFDRSLHAPAGGFYYAGGNNAKLFLNGQEVTND--STALPKDWYSTDLWTNY 196 Query: 170 AVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHH 229 + F+ + +PF++ ++++ PH P P E + K+ Y + EK + + K Sbjct: 197 GLRFIDEALAEKKPFMLYLAHNAPHFPLQAPEEDIVKFRGKYLKGWEKLRQERYEKQIKL 256 Query: 230 RLWAQAMPSP------------VGDDGLYH---HPLYFACNDFVDDQIGRVINALTPEQR 274 L + P D+ + +Y A +D IG +++ L Sbjct: 257 GLIDPSWKLPPINPNVKRWDSLSDDEKKRYDDIMAIYAAVISRLDKSIGDLVDGLKKRGV 316 Query: 275 -ENTWVIYTSDHGE----------------------MMGAH-------KLISKGAAMYDD 304 +NT +++ SD+G +G + ++ Sbjct: 317 FDNTVILFVSDNGGNAEPGIEGRYQGDKPGNAKSTVFLGQGWAEAACTPFWAYKHHTHEG 376 Query: 305 ITRIPLIIRSPQGERRQVD-----TPVSHIDLLPTMMALADIEKP--------EILPGEN 351 P I+ P G + P ID++ T++ L + P + + G + Sbjct: 377 GISSPGIVSWPAGIPTSRNGKFERQPAHIIDIMATLVDLGNAGYPTTYAGQPIQPMEGAS 436 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEM 411 + + + + F R +KLV +LYD D E+ Sbjct: 437 LKPAFTGKPINRKNPI-------FWEHEGNRAIRDGKWKLVAEKTEKWQLYDVEQDRTEL 489 Query: 412 HNLIDDIRFADVRSKMHDALLDYMDKIRDP--FRSYQWSLRPWRKDARPRWMGA 463 ++ D DV K+ + ++ ++++W + P G Sbjct: 490 NDQFDKQ--PDVAKKLVAKYEAWYKRVGAEEYDKTFKWFYDYNKAKQEPGAAGN 541 >UniRef50_C5BYA8 Sulfatase n=2 Tax=Micrococcineae RepID=C5BYA8_BEUC1 Length = 478 Score = 398 bits (1023), Expect = e-109, Method: Composition-based stats. Identities = 109/483 (22%), Positives = 182/483 (37%), Gaps = 63/483 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN ++TD A + VG Y +T ID +A G R ++ + + +CTP+RA + T Sbjct: 3 RRPNICLILTDDHAAHAVGTYGSVVNSTPRIDEIAQRGWRLDNLFCTNSICTPSRASILT 62 Query: 62 GIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 G +++ +G T + + + T KDAGY T +GKWHL + D Sbjct: 63 GQHSHTNGVRTLSTPMDRELPTFVSQLKDAGYRTAIVGKWHLGEGEEHRP------RAFD 116 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP---- 177 +W + + T I++ A+ +L Sbjct: 117 HWMILRDQGEYH------------DPTFRTPDGLRTVTGYATDVITDLALQWLDDLDYGP 164 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 D P+ +++ + PH + + ++A + DD A + A + Sbjct: 165 DGTDSPWCLLIHHKAPHRSWEPDEAHRAQFAGRPIPVPATFTDDYATRSGAAHRAAMRVA 224 Query: 238 SPVG-------------------DDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENT 277 + + Y AC VDD +GRVI+ L ++T Sbjct: 225 DQLTRRDLKADPPAGLSYEDEALWKYQRYMEDYLACVASVDDNVGRVIDRLAERGELDDT 284 Query: 278 WVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTM 335 ++YTSD G +G H K MYD+ R+P ++ P R D V+++DL T+ Sbjct: 285 LLMYTSDQGFFLGDHGWFDK-RFMYDESIRMPFVVSCPTALDGGRSTDQIVTNVDLARTI 343 Query: 336 MALADIEKPEILPGENILAVK-EPRGVMVE---FNRYEIEHDSFGGFIPVRCWVTDDFKL 391 + AD+E + GE+ + + RY D T+ +KL Sbjct: 344 LEAADVEPHPGMQGESFWGTLARGETPPADQSFYYRYWEHDDGAHHAAGHYGIRTERYKL 403 Query: 392 VLNLFT--------------SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 + ELYD DP+E+ N+ DD +A VR + + L Sbjct: 404 IYFYNDGLGLPGTGWATYAPEWELYDLEADPDELVNVADDPTYAVVRRDLTERLAREQAA 463 Query: 438 IRD 440 D Sbjct: 464 AGD 466 >UniRef50_C9L4R7 Putative sulfatase YidJ n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L4R7_RUMHA Length = 458 Score = 398 bits (1023), Expect = e-109, Method: Composition-based stats. Identities = 119/453 (26%), Positives = 193/453 (42%), Gaps = 30/453 (6%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N L + D +++ CY GK + T NID LA G+ + YT S VCTP+R FTG Y Sbjct: 3 NVLIIHVDQLRRDVLSCYGGKEVQTPNIDFLAENGVLLENFYTPSAVCTPSRGCFFTGNY 62 Query: 65 ANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTG-ECPPEWDADYW 123 +++G + N + +++ F AGYHT Y+GKWHL H G + W Sbjct: 63 PHENGAYRNGIPVKRDVHGFAEVFAKAGYHTGYLGKWHLADHKERGDDLGEYNPLGFEDW 122 Query: 124 FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEP 183 +Y E + + NG + N + +++ + FL ++ +P Sbjct: 123 ----DYKVEFGHCKSVAYENGKVRPKREVGN---DKSYTTDWLTDETIRFLNNQLKSTQP 175 Query: 184 FLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKA----QDDLANKPEHHRLWAQAMPSP 239 FL VS +PH PF Y + E+ E D A + RL Sbjct: 176 FLFTVSIPDPHQPFEVRPPYDTMFDPLKVEIPESFWEKEIPDWAERDTWGRLHYYPYGLF 235 Query: 240 VGD-DGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISK 297 + Y +DD +GR+I L ENT V++T+DHGE MG H L+ K Sbjct: 236 EREGHLRRLKAQYLGAVKCIDDNVGRIIQCLKDTGLWENTMVVFTTDHGEYMGEHGLMEK 295 Query: 298 GAAMYDDITRIPLIIR--SPQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAV 355 +Y+ + IP +I + + R+ +T ++ +D PT+ + I P + G+++ Sbjct: 296 -NNLYESVYHIPCVISMPWKKIQERRCNTWINVVDFAPTLAGMLGIPYPFKVQGKDLSTY 354 Query: 356 KEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDF--KLVLNLFTSDE-----LYDRRNDP 408 E Y D +P +T +F V + +E L+D R DP Sbjct: 355 LL-ENRETEQILYIHPSD-----VPRAGILTPEFELAYVGKGWCEEEFHDHILFDMRKDP 408 Query: 409 NEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 +M N+ +A V+ + + L + ++I P Sbjct: 409 LQMTNVFGKPEYAKVQKMLTEKLKRHFEEIGTP 441 >UniRef50_C6LAI4 Arylsulfatase n=6 Tax=Bacteria RepID=C6LAI4_9FIRM Length = 481 Score = 397 bits (1022), Expect = e-109, Method: Composition-based stats. Identities = 111/467 (23%), Positives = 193/467 (41%), Gaps = 34/467 (7%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN LF+MTD + +G + T +D+LAA+G+ F++AY+ P C PARA L T Sbjct: 4 KKPNILFIMTDQLRGDCLGIAGHPDVKTPYLDTLAAKGVLFSNAYSACPSCIPARAALHT 63 Query: 62 GIYANQSGPWTNNVAPGKN-ISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 G+ T+ AGY+T +GK H+ + D Sbjct: 64 GMLPEHHRRVGYQDGIAWRYEHTLAGELSRAGYYTQCVGKMHVHPLRNYLGFHNVELHDG 123 Query: 121 DYWF----------------DGANYLSELTEKEISLWRNGLNSVED-LQANHIDETFTWA 163 + D +L E +GL+ + +E + Sbjct: 124 YLHYARYGSVPYRESQHVADDYYYWLKEQKGISADPMESGLDCNSWVARPFPYEEKYHPT 183 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 + +++R++DFL++ D+PF ++ SY PH PF P Y + Y D + Sbjct: 184 NWVTDRSIDFLRRR-DPDQPFFLMASYLRPHPPFDAPAYYFDLYKDKKLTPPYVGDWEDT 242 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHP-LYFACNDFVDDQIGRVINALTPEQREN-TWVIY 281 + ++ P ++ + Y+AC +D QIGR++ ALT + +N T + + Sbjct: 243 KLLKERGRIFDSLTGPEDEELIRQAQIGYYACITHLDHQIGRLLMALTEHELQNDTMIFF 302 Query: 282 TSDHGEMMGAHKLISKGAAMYDDITRIPLIIRS-PQGER----RQVDTPVSHIDLLPTMM 336 T+DHGE + H K + Y IPLII P+ D D++PT++ Sbjct: 303 TADHGEELCDHHHFRK-SLPYQGSIHIPLIISGNPELTGFAPHSVCDEVTELCDIMPTLL 361 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 +A + P+ + G+++LA + G Y S+G D K Sbjct: 362 DIAGADIPDRVDGKSLLAFADGEGRE-----YLHGEHSYGELSNHYIVTKKD-KFCWFST 415 Query: 397 TSDE-LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 + E + DP+E+H+ I+D + + + + L+ + + F Sbjct: 416 SGTEHYFVLEEDPHELHDRIEDPACRERIAYLRNCLIRELTGRPEGF 462 >UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UPK7_RHOBA Length = 482 Score = 397 bits (1022), Expect = e-109, Method: Composition-based stats. Identities = 107/472 (22%), Positives = 192/472 (40%), Gaps = 76/472 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + ++ D A + G P T N+D A+E I+F+ AY+ S VC PARA L T Sbjct: 54 RRPNVIVILADDLAVGDLAGGDGSPTRTPNLDRFASESIQFSQAYSGSCVCAPARAALLT 113 Query: 62 GIYANQSGPWTNN-------VAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 G Y +++G T N ++ +T+ KDAGY T +GKWH D F + Sbjct: 114 GRYPHRTGVVTLNMNRYPEMTRLRRDETTIADVLKDAGYATGLVGKWHTGRGDGFHPLD- 172 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 +D F G++ + E Q + +DE+ ++ RA++F+ Sbjct: 173 -RGFDEFEGFFGSDDVGYFRY----------PFSEQRQISDVDES-YLTDDLNRRAIEFV 220 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 ++ PF + +++ PH P P E + +Y + ++ Sbjct: 221 RRHHEH--PFFLHLAHYAPHRPLEAPPEVIARYREQGFD--------------------- 257 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGE--MMGA 291 +A + +D IG ++ + E+T V++ SD+G + G Sbjct: 258 -----------ESTATIYAMIEVMDRGIGELLAEIDDLGLSEDTIVLFASDNGPDPLTGE 306 Query: 292 H---KLISKGAAMYDDITRIPLIIRS-PQGERRQVDTPVSHIDLLPTMMALADIEKP--E 345 +L + + R+PL +R + Q D V+ +DL+PT++ L ++ Sbjct: 307 RFNRELRGTKYQVNEGGIRVPLFVRWSKRLAPGQRDQMVTFVDLMPTILDLCRVDVSMLN 366 Query: 346 ILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNL-------FTS 398 L GE+ + V E + R+ + + + +KLV S Sbjct: 367 RLDGESFVPVLEDASIAHSTMRFWQWNRASPNYTHNAAVRHGRYKLVRPYVTRGAKLKDS 426 Query: 399 DE---LYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR-DPFRSYQ 446 E L+D +NDP E ++ ++ D+ +M L + + D R + Sbjct: 427 TEPSVLFDLQNDPTESRDVSK--QYPDIAERMSRELDRWSASVETDRIRPVK 476 >UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 Length = 471 Score = 397 bits (1022), Expect = e-109, Method: Composition-based stats. Identities = 96/470 (20%), Positives = 161/470 (34%), Gaps = 77/470 (16%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN +F+ +D G + + T N+D LA+EG+RF Y C P+RAG+ T Sbjct: 25 KQPNIVFLFSDDAGYADFGFQGSETMKTPNLDQLASEGVRFTQGYVSDSTCGPSRAGIMT 84 Query: 62 GIYANQSG---------------PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGH 106 G Y + G + + TMG Y K GY T + GKWHL G Sbjct: 85 GRYQQKFGYEEINVPGYMSEHSAIKGAEMGIPLDEVTMGDYMKSLGYRTAFYGKWHLGGT 144 Query: 107 DYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDE-----TFT 161 D P D ++ E++ D + H + Sbjct: 145 DELH----PMHRGFDEFYGFRGGDRSYWAYEVNAPERKSAVFTDKKLEHGIDQFQEHEGY 200 Query: 162 WAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDD 221 ++ +A F+++ D+PF + +S++ H P E L K+ Sbjct: 201 LTDVLAEKANQFIEKA--PDKPFFIFLSFNAVHTPMEATPEDLAKFP------------- 245 Query: 222 LANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVI 280 A +D G V+N L ++T V+ Sbjct: 246 ---------------------QLKGKRKEVAAMTLALDRASGAVLNKLKELGLEDDTLVV 284 Query: 281 YTSDHGE-----MMGAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLP 333 +++D+G + L + + R+P +++ P + D PVS +DLLP Sbjct: 285 FSNDNGGPTDKNASSNYPLAGTKSNFLEGGIRVPFLVKWPAKLAAGKVYDKPVSTLDLLP 344 Query: 334 TMMALADIE-KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV 392 T E L G +++ + + D+KL+ Sbjct: 345 TFFKAGGGEEVMSELDGVDLMPYITGQNNKAPHE------SMYWKKETRAAIRQGDWKLL 398 Query: 393 LNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 ELY+ ND E HNL + + +M+ + + P Sbjct: 399 RFPDRPAELYNLANDIGEQHNLAA--QEPERVKQMYKDFFSWEMTLERPL 446 >UniRef50_C6VYV1 Sulfatase n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VYV1_DYAFD Length = 506 Score = 397 bits (1022), Expect = e-109, Method: Composition-based stats. Identities = 107/490 (21%), Positives = 178/490 (36%), Gaps = 64/490 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSG---KPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAG 58 K+PN L++M+D + G Y G + +I+ LAA G +A+ + +C P+RA Sbjct: 22 KKPNILYIMSDDHTSQAWGIYGGILKDYVKNDHIEWLAANGATLGNAFCTNSICVPSRAA 81 Query: 59 LFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 + TG Y++++G + + + + + K AGY T IGKWHL Sbjct: 82 ILTGRYSHRNGVYDLSDSLSPDSLNYAKLLKTAGYQTALIGKWHLVKEPA---------- 131 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 DY+ + + ED + + I+++++ +L++ Sbjct: 132 GFDYYCVLPGQG----RYRNPIMMTKEDFREDQKGGKV-YEGYSTDVITDQSIAWLEKR- 185 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA-NKPEHHRLWAQAMP 237 +PF + + H PF P Y D D A H W + Sbjct: 186 DKSKPFYLSTHFKATHEPFDYPKRYENYLEDVEIPYPADFADRGATGSGRTHDGWPLDLL 245 Query: 238 SPVGDDG-------------------------LYHHPLYFACNDFVDDQIGRVINALTPE 272 + G Y C VDD IGR+I L Sbjct: 246 GTRYEKGTGKEYPGHSFSLQGLDSVAARKKIYQKFVKDYIRCGAAVDDNIGRLIQYLKDA 305 Query: 273 Q-RENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHI 329 +NT +IYTSD G +G H K +Y+ R+P +I P+ ++V+ + +I Sbjct: 306 GELDNTIIIYTSDQGYFLGEHGFFDK-RFIYEPSIRMPFVISYPKEIPKGKRVNDLILNI 364 Query: 330 DLLPTMMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVR-CWVTDD 388 D + A I P + G++ + + + + P TD Sbjct: 365 DFASLFLDYAGIAPPASMQGKSFRKNLQGKTPAHWRKDIYYRYWANEPNRPAHFGIRTDR 424 Query: 389 FKLVLNLFT--------------SDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDY 434 +KL E YD +NDP E N I D ++ D+ +K+ L D Sbjct: 425 YKLAFFYGQSRTKTARDNMKYPPGWEFYDLKNDPGEDRNAILDPQYKDIIAKLKARLKDI 484 Query: 435 MDKIRDPFRS 444 + D S Sbjct: 485 KKESGDGVES 494 >UniRef50_C7PRW9 Sulfatase n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PRW9_CHIPD Length = 460 Score = 397 bits (1022), Expect = e-109, Method: Composition-based stats. Identities = 92/465 (19%), Positives = 164/465 (35%), Gaps = 46/465 (9%) Query: 3 RPNFLFVMTDTQATNMVGCYSGK-PLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN +F++ D + Y+ K P+ T NID L EGI+F + Y+ + VC P+R L T Sbjct: 19 KPNIIFILADDLGYGNISAYNSKSPVKTPNIDRLGQEGIQFKNFYSGNTVCAPSRCALLT 78 Query: 62 GIYANQSGPWTN-NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 G + + N + ST+ + + GY T GKW L GT P Sbjct: 79 GKHMGHAYIRGNTRLPLRAEDSTLAQLLQGNGYRTGMFGKWGLG---ESGTTGSPEIKGF 135 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARA 180 D +F N L+ + + D I A+ F+ Sbjct: 136 DTFFGYLNQQHAHNYYTDYLFEVKEGQISRV---PRDTNVYSQDEILQHALSFIND--NK 190 Query: 181 DEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPV 240 D+PF + + + PH P ++ + + + K +R Sbjct: 191 DKPFFLFLPFTLPHAELAPPATDMQAFLNADGSSKLGPETPYERKNGTYRSQENP----- 245 Query: 241 GDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMM---------- 289 H + A +D +G + + +NT++ +TSD+G Sbjct: 246 -------HAAFAAMVTKLDRNVGEISALIKQLGLDDNTYIFFTSDNGPHREGGADPIYFD 298 Query: 290 GAHKLISKGAAMYDDITRIPLIIRSPQG--ERRQVDTPVSHIDLLPTMMALADIEKPEIL 347 L +Y+ R+PL++R+P + P + D+LPT+ + + Sbjct: 299 SNGPLKGIKRDLYEGGIRVPLLVRAPGKVSAGQVSTIPWAFWDVLPTLSDITHSPVLSGI 358 Query: 348 PGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF----TSDELYD 403 G + + + + + + G + DD+KL+ ELY Sbjct: 359 DGLSYTKALNGTKPARQHDHFYWQFNEGGL---QEALLKDDWKLIRFKKRGTPERFELYH 415 Query: 404 RRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWS 448 D E H+L ++ + +L K+ + WS Sbjct: 416 LSEDIGEEHDLA--TKYPQKVKALSGLMLQ--SKMPAENPEFDWS 456 >UniRef50_A6DR18 Arylsulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DR18_9BACT Length = 543 Score = 397 bits (1022), Expect = e-109, Method: Composition-based stats. Identities = 97/509 (19%), Positives = 184/509 (36%), Gaps = 87/509 (17%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + +MTD + +GCY G+ + T N+D LA +G+RF Y + C P RA L TG Sbjct: 41 KPNIIIIMTDDMGFSDLGCYGGE-IETPNLDMLANKGVRFTQFY-NAGRCCPTRASLLTG 98 Query: 63 IYANQSGP-----------WTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 +Y +Q+G + T K AGY+T GKWH+ Sbjct: 99 LYQHQAGIGGMMGDRGAEWPGFRGHLTERCVTFAEVLKTAGYNTYQTGKWHVGDKKK--- 155 Query: 112 GECPPEWDADYWFD---GANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 P D+ + G + + + + V Q N + ++ Sbjct: 156 EWWPLARGFDHSYSCPQGGGFFFKPSSFKEKRQVVRDTEVLYDQKNDPPADWYATDAWTD 215 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQD-------D 221 + F++ A+ + PF+ ++++ PH P + + KY + + +K ++ D Sbjct: 216 EGLKFIESEAKENRPFIWYLAHNAPHFPLQAKPQDIAKYRGKFMQGWDKLREQRHKRLID 275 Query: 222 LANKPEHHRLWAQAMPSPVGD--------DGLYHHPLYFACNDFVDDQIGRVINALTP-E 272 L + +L + P D Y A D VD +G++I L Sbjct: 276 LGIIDKQWKLSPREKGIPAWDSLSGKEKYQQDLRMASYAAMIDCVDQNVGKIITKLKELN 335 Query: 273 QRENTWVIYTSDH-----GEMMGAH-------------------------KLISKGAAMY 302 Q +NT +++ D+ G MG + ++ Sbjct: 336 QYDNTLILFLHDNGGCDAGGAMGENTGKGTCGTAKSFAYYGACWANVSNTPFRKYKKYIH 395 Query: 303 DDITRIPLIIRSPQGERRQ-----VDTPVSHIDLLPTMMALADIEKPE--------ILPG 349 + PLI P+G ++ + P ID++ + + L+ P + G Sbjct: 396 EGGISTPLIAHWPEGIAKKLQGKLITEPAHVIDIMASCVDLSGATYPTSFKGHAIIPMEG 455 Query: 350 ENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPN 409 ++ + E + + + + R +KLV ELYD +D Sbjct: 456 TSLRPLFEGKSLERNDGLFFEHY-------GHRGVRRGSWKLVATRQGKWELYDMVSDRT 508 Query: 410 EMHNLIDDIRFADVRSKMHDALLDYMDKI 438 E+++L + + ++ + ++ Sbjct: 509 ELNDLSSKM--PEKVKELSRLYNKWTERC 535 >UniRef50_A6C2T4 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C2T4_9PLAN Length = 493 Score = 397 bits (1022), Expect = e-109, Method: Composition-based stats. Identities = 111/499 (22%), Positives = 176/499 (35%), Gaps = 67/499 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + +MTD +GCY + + T +ID LA EG F A+ + VC+P RA T Sbjct: 31 QRPNVVIIMTDNHGEWTLGCYGNQDIKTPHIDQLAKEGTLFTRAFANNAVCSPTRASFLT 90 Query: 62 GIYANQSGPW-----TNNVAPG-----KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 G+ Q G P + ++ + DAGY GKWHL + Y Sbjct: 91 GLMPCQHGVHCFLRTRIQTGPDSFNTLEEFQSIPQVLHDAGYVCGLSGKWHLGDNLY--- 147 Query: 112 GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 P+ YW + S + E + + Sbjct: 148 ----PQEGFSYWITKPHGGSAGFY----------DQNVIENEKIRKEPTYLTDLWTQHGI 193 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPF----TCPVEYLEKYADFYYELGEKAQDDLANKPE 227 F++Q ++PF + ++Y+ P+ ++ Y ++ + +P Sbjct: 194 RFIKQ--NQEKPFFLFLAYN---GPYGLGSAMKEPIRNRFKAEYEKMTFPSFPREKAQP- 247 Query: 228 HHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG 286 W +GD G+ Y A VDD +G+++ L ENT VI+T+D G Sbjct: 248 ----WNFNYGDWIGDLGI--IRKYAAEVSAVDDGVGQIMQTLKDLGLRENTLVIFTADQG 301 Query: 287 EMMGAHKLISKG-----AAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALA 339 G G +D IPLI P + D V++ D+ PT++ Sbjct: 302 LSGGHSGYWGMGDHTRPLTAFDWTMTIPLIFSQPGKIVSGARQDMMVANYDVYPTLLNYL 361 Query: 340 D----IEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLV-LN 394 I PG N V + + + + F VR T D+K + Sbjct: 362 GLQDKIPAKPATPGRNFAPVLKGEQIPWDEVVFY-------EFENVRAIRTKDWKYIERY 414 Query: 395 LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP----FRSYQWSLR 450 + +ELY D E N ID R ++ L + K DP ++ + Sbjct: 415 RESPNELYHLVTDSREHRNRIDQPASKQTRKELKQRLDQFFSKYADPQWDIWKGGKSKTI 474 Query: 451 PWRKDARPRWMGAFRPRPQ 469 K P P Q Sbjct: 475 LMTKQLFPDSYLYLPPSKQ 493 >UniRef50_C5HLB2 Putative sulfatase n=1 Tax=uncultured bacterium FLS12 RepID=C5HLB2_9BACT Length = 503 Score = 397 bits (1021), Expect = e-109, Method: Composition-based stats. Identities = 120/467 (25%), Positives = 198/467 (42%), Gaps = 35/467 (7%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + VM D + Y G L T + D LA+EG F+ A T SP+CTP+R +TG Sbjct: 9 PNLVMVMVDQLQAQRMKLYGGTDLLTPHFDRLASEGALFSQAITTSPLCTPSRISFWTGQ 68 Query: 64 YANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYW 123 Y + G N P ++ + K AGYHT IGK H + WDA + Sbjct: 69 YPSAVGGMNNGPLPLTDVPHLPGMLKAAGYHTALIGKNHCFRGEVVADLFDAT-WDAGHG 127 Query: 124 FDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEP 183 + + + + + T R + + +L++ D+P Sbjct: 128 GAQGGKDDPDILAYERTAQLEFLRMCHGRIVDLPDHVTTTARATKNGLAWLEEQ--GDDP 185 Query: 184 FLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP--VG 241 F + +SY EPH PF + + Y L E + D+++KP H + + M +P Sbjct: 186 FFLWLSYPEPHSPFVTTRNWADLYDPAKLTLPESWRSDISDKPAHFQELHELMGAPAVSD 245 Query: 242 DDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKLISKGAA 300 D+ +Y+ +DD +G+V++ L + ++T V++ SDHGE +G++ ++ K A Sbjct: 246 DELRELTQIYYGMASQIDDGLGQVLDCLERKGLADDTIVVFVSDHGEYIGSNYMLQKSAH 305 Query: 301 MYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEP 358 + + + R+PL IR P D PV H D++PT+ L + P+ + ++ + + Sbjct: 306 LPEALIRVPLAIRWPGHVPSGAVYDDPVEHHDMMPTLCTLMGFDVPDSVQAADLTPLFDG 365 Query: 359 RGVMVEFNRYEIEHDS--------------------------FGGFIPV-RCWVTDDFKL 391 + + EI H + FG + V R T K Sbjct: 366 KPFARDAAYSEIGHHADREMTREKTYAPDLPWAEARAFYHFVFGHYAHVGRGIRTRTHKY 425 Query: 392 VLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 V + ELYD NDP EM NL A++ + + L + D Sbjct: 426 VAYEYGEKELYDLANDPEEMVNLAGKPAAAEIEADLAARLEAWSDAH 472 >UniRef50_A3J5W3 Putative arylsulfatase n=1 Tax=Flavobacteria bacterium BAL38 RepID=A3J5W3_9FLAO Length = 468 Score = 397 bits (1021), Expect = e-109, Method: Composition-based stats. Identities = 96/452 (21%), Positives = 169/452 (37%), Gaps = 60/452 (13%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 K+PN +F++ D N +G Y GK + T NID LA EG++F++ Y S +C P+R L T Sbjct: 27 KKPNIVFILADDMGYNELGSYGGKIIETPNIDQLAKEGMKFSNHYCGSNICAPSRGTLMT 86 Query: 62 GIYANQSGPWTN-------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGEC 114 G + + N N + T+ K AGY T GKW L G+ Sbjct: 87 GKHTGHAYIRDNKPLPYEGNEPIPASEITVAEILKTAGYTTGAFGKWGLGYPASEGS--- 143 Query: 115 PPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFL 174 P D ++ + L +N L VE + A I +RA++F+ Sbjct: 144 PNNQGFDQFYGYNGQIHAHNYFTSYLRKNDL--VELNANIDAPYSVYSADIIKDRALEFV 201 Query: 175 QQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQ 234 + + PF + PH+P+ P +K + Sbjct: 202 E--VNKNNPFFLYFCPTLPHNPYHQP----------------------DDKTLEYYAKKT 237 Query: 235 AMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTP-EQRENTWVIYTSDHGEMM---- 289 P + P Y A + +D Q+G ++ L +NT +I+ SD+G + Sbjct: 238 GFPIGDAHSEEFSVPKYAALSSRLDQQVGEIMAKLKELNLLDNTLIIFASDNGSALTKEE 297 Query: 290 -----GAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIE 342 L + + +Y+ + PLI + + D LPT + + Sbjct: 298 DSYLRTGGDLRGRKSEVYEGGIKSPLIAFWKGKIIPGSSSNHISAFWDFLPTCAEIVKAK 357 Query: 343 KPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFK--LVLNLFT--- 397 P+ + G + L + + + Y + D K V + + Sbjct: 358 TPDNIDGISYLPTLLGKTDNQKQHDYLY-----WERSQSQAIRKGDMKANFVYDKTSQKQ 412 Query: 398 SDELYDRRNDPNEMHNLIDDIRFADVRSKMHD 429 + E+Y+ DP E +NL + + +++++ Sbjct: 413 NIEIYNLAQDPFEKNNLAETM--PELKAEFIK 442 >UniRef50_D2R203 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R203_9PLAN Length = 490 Score = 397 bits (1021), Expect = e-109, Method: Composition-based stats. Identities = 109/453 (24%), Positives = 182/453 (40%), Gaps = 24/453 (5%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCS----PVCTPARAG 58 +PN +F+ D + + + T N+D LA +G F+ AY VC +R+ Sbjct: 37 KPNVVFLFADDLSYEALAYAGNGQVKTPNLDRLAKQGTSFSHAYNMGSFSPAVCIASRSM 96 Query: 59 LFTGIYANQSG-PWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPE 117 L TG ++ KN+ R AGY T GKWH+ + Sbjct: 97 LVTGRSVWKAQTLHAAGGKEPKNVVLWPRQMHGAGYQTFITGKWHVPWNPMLAFDVTAHV 156 Query: 118 WDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 + ++ S + + + W+ ++ AV+F Sbjct: 157 RG-----GMPKDVPSFYDRPHSDKPDTFDPANPGNGGYWQGGKHWSEVTADDAVEFFSAS 211 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK-----PEHHRLW 232 P M V+++ PH P P YL++Y E+ + Q + E R Sbjct: 212 RDKSRPCFMYVAFNAPHDPRQAPQTYLDRYPTETIEVPKDFQPLYPERASIGADEKLRDE 271 Query: 233 AQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQREN-TWVIYTSDHGEMMGA 291 A H Y+A +DDQIGR+++A+ + + T V++T+DHG G Sbjct: 272 KLAPFPRTEFAVRTHRREYYALITHLDDQIGRILDAIEQTKSDRPTMVMFTADHGLACGH 331 Query: 292 HKLISKGAAMYDDITRIPLIIRSPQGE-RRQVDTPVSHIDLLPTMMALADIEKPEILPGE 350 H L+ K MYD R+PLII + +D PV D++PT + LA + + Sbjct: 332 HGLMGK-QNMYDHSIRVPLIIAGENIPQGKTIDVPVYLQDVMPTALELAGVAPGPEVHFH 390 Query: 351 NILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS-DELYDRRNDPN 409 ++L + + + + + RC V D FKLV+ +L+D ++DP Sbjct: 391 SLLPIVRGEQKVSNYPAIYSSYLNL-----QRCVVKDGFKLVVYPALPAAKLFDLQHDPL 445 Query: 410 EMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 E+ +L D A + ++ DAL+ + I DP Sbjct: 446 ELSDLSADPNHATRKEQLFDALVAEAESISDPL 478 >UniRef50_Q7UVD9 N-acetylgalactosamine 6-sulfate sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UVD9_RHOBA Length = 564 Score = 397 bits (1020), Expect = e-109, Method: Composition-based stats. Identities = 117/493 (23%), Positives = 180/493 (36%), Gaps = 88/493 (17%) Query: 3 RPNFLFVMTDTQATNMVG-------CYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPA 55 +PN + V+TD QA +T N+D LAAEG F + + +PVC+PA Sbjct: 101 KPNVVLVLTDDQAPWAFAEAVRSGQFSDVPIPSTPNMDRLAAEGAVFRNFFCTTPVCSPA 160 Query: 56 RAGLFTGIYANQSGP-----------WTNNVAP---GKNISTMGRYFKDAGYHTCYIGKW 101 RA L TG YA++ G + + N T + GY T +GKW Sbjct: 161 RATLMTGRYASELGIKDFIPQPGHKLYDPDSPIHLDPDNTVTFAEVMQQQGYTTGLVGKW 220 Query: 102 HLDGHDYFGT-GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF 160 HL G G+ P D + + N +L Sbjct: 221 HLGDWTANGDSGKHPTRHGFDSFMGLTGGGTTPD-----------NPELELNGKVQQFQG 269 Query: 161 TWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTC-PVEYLEKYADFYYELGEKAQ 219 +++ A+DF++Q AD PF + +S PH + E + Y + + Sbjct: 270 LTTDILTDHAIDFVEQ--NADRPFFLCLSTRAPHGRWLPVAPEDWQPYEEMDPTIP---- 323 Query: 220 DDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTW 278 P D Y A VD +GR++ L ++ NT Sbjct: 324 ---------------QYPDLDTDWVRKKMKEYLASTSGVDRNLGRLLKTLDAQELTSNTI 368 Query: 279 VIYTSDHGEMMGAHKLISKG--------------------------AAMYDDITRIPLII 312 VI+TSDHG MG H + KG +YD R+P I+ Sbjct: 369 VIFTSDHGFNMGHHGIYHKGNGIWATRQKPPGKFHQGTRVISDKYRPNLYDHSLRVPAIV 428 Query: 313 RSPQ--GERRQVDTPVSHIDLLPTMMALAD-IEKPEILPGENILAVKEPRGVMVEFNRYE 369 R P ++ SH+D PT+ A+A + LPG ++ + + Sbjct: 429 RWPGVVKPSAVIEATASHLDWFPTLCAIAGDGSSAKDLPGRDLSPLLKGELQDDWDQAQY 488 Query: 370 IEHDSFGGFIPV-RCWVTDDFKLVLNLFTS--DELYDRRNDPNEMHNLIDDIRFADVRSK 426 E+D + R + T ++KL+ + DE YD DP+E NLI + V + Sbjct: 489 FEYDMINYAVASLRGYRTPEYKLIRDRHNEGCDEFYDLTTDPDETVNLIRNPGSQAVIKR 548 Query: 427 MHDALLDYMDKIR 439 + L K+ Sbjct: 549 LDAKLRAMEKKLE 561 >UniRef50_A9LGQ4 Secreted arylsulfatase n=4 Tax=Bacteria RepID=A9LGQ4_9BACT Length = 608 Score = 396 bits (1019), Expect = e-109, Method: Composition-based stats. Identities = 100/466 (21%), Positives = 166/466 (35%), Gaps = 68/466 (14%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +RPN + ++D Q C + + T NIDSLA +G+ F + + PVC+P RA T Sbjct: 43 QRPNVIVFLSDDQGWGDFSCTGNQSVATPNIDSLATQGLLFENFFV-CPVCSPTRAEFLT 101 Query: 62 GIY-ANQS--GPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G Y + G + +T+ AGY T GKWH P Sbjct: 102 GRYHPQSNVKGVSQGQERIDLDETTIADCLSQAGYATAAFGKWHNG----MQYPYHPCGR 157 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 D ++ + W N N + + +NRA+ F++ Sbjct: 158 GFDDFYGFCSG----------HWGNYFNPTLEHNGRIVKGEGYINDDFTNRALKFIED-- 205 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 +PF + + Y+ PH P P Y +++A+ Sbjct: 206 HKSQPFFLYLPYNTPHWPPQMPDAYWQRFAEKEI---------------------VQRGQ 244 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMM--GAHKLI 295 + L A + +D +GRV+ L + +NT VIY +D+G + Sbjct: 245 KGDKEDLAKTRSALAMVENIDWNVGRVLAKLDELKIADNTIVIYFNDNGPNSNRWNAGMK 304 Query: 296 SKGAAMYDDITRIPLIIRSPQ---GERRQVDTPVSHIDLLPTMMALADIEK--PEILPGE 350 K + + R PL +R P G R+V+ IDL PT++A +IL G+ Sbjct: 305 GKKGSTDEGGVRSPLFVRWPNGVKGAGRRVNQICGAIDLYPTLLAATGSANVGDKILDGK 364 Query: 351 NILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNE 410 N+L + + F + T F+L + L+D DP++ Sbjct: 365 NLLPIWDGSETN------LGFRMLFSYWRGKASVRTQQFRL----DNNGWLFDMLTDPHQ 414 Query: 411 MHNLIDDIRFADVRS-------KMHDALLDYMDKIRDPFRSYQWSL 449 ++ D V + + + MD + PF Sbjct: 415 TKDISSDQ--PAVAALLLGSLIRFKQEMEAEMDSTKRPFSVGHPDF 458 >UniRef50_C6I6Z4 N-acetylgalactosamine-6-sulfatase n=11 Tax=Bacteroidetes RepID=C6I6Z4_9BACE Length = 504 Score = 396 bits (1019), Expect = e-109, Method: Composition-based stats. Identities = 100/498 (20%), Positives = 175/498 (35%), Gaps = 95/498 (19%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN ++++ D +GCY + + T NID L +GI F YT SPV PAR L T Sbjct: 24 SRPNVIYIIMDDLGYGDIGCYGSEKIETPNIDRLYKDGISFTQHYTGSPVSAPARCVLMT 83 Query: 62 GIYANQSGPWTNNV-----------------------APGKNISTMGRYFKDAGYHTCYI 98 G+++ + N+ + T+GR + AGY T Sbjct: 84 GMHSGHAQIRANDEMAYRGAIMNYDSMYVHPGLEGQYPLKAHTMTLGRMMQQAGYVTGCF 143 Query: 99 GKWHLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRN--------------- 143 GKW L GT P + D ++ + L++N Sbjct: 144 GKWGLGAP---GTEGTPNKQGFDSFYGYNCQRQAHSYYPAFLYKNEDRVYLANKVLDPHT 200 Query: 144 -----GLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFT 198 G + ++ + I + + F+ Q +PF ++ + PH Sbjct: 201 TKLDAGADPRDEAAYAKFSQKEYANDLIFDELISFVGQ--NRKKPFFLMWTTPLPHVSLQ 258 Query: 199 CPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFV 258 P ++++ Y + P + P H Y A + Sbjct: 259 APEKWVKYYVGKF----------GDEAPYIGKAGYMPCRYP--------HATYAAMISYF 300 Query: 259 DDQIGRVINAL-TPEQRENTWVIYTSDH----------------GEMMGAHKLISKGAAM 301 D+QIG++I L +NT +++TSD+ G + + Sbjct: 301 DEQIGKLIEKLKKERLYDNTVIMFTSDNGPTFNGGSDSPWFDSGGPFRSEYGW--GKCFV 358 Query: 302 YDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKEPR 359 ++ RIP I+ P + Q D D++PT+ +A+I PE G + L Sbjct: 359 HEGGIRIPAIVTWPGKIKPSTQSDHICGFQDVMPTLADIANIACPET-DGISFLPALLGE 417 Query: 360 GVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN----LFTSDELYDRRNDPNEMHNLI 415 + + Y I ++ +K ++N ++ ELYD +D E H++ Sbjct: 418 TERQKEHEYLYWEYPDP-TIGLKAIRMGKWKGIVNNIRKGNSTMELYDLESDLREEHDVA 476 Query: 416 DDIRFADVRSKMHDALLD 433 D+ K+ + Sbjct: 477 A--EHPDIVRKLTRLMEK 492 >UniRef50_A6DHY1 Mucin-desulfating sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHY1_9BACT Length = 545 Score = 396 bits (1019), Expect = e-109, Method: Composition-based stats. Identities = 102/454 (22%), Positives = 184/454 (40%), Gaps = 30/454 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 RPN + ++TD Q + +GC + T +ID L+ G+ F+S YT +P+C +RA TG Sbjct: 21 RPNIIMLLTDDQRYDTLGCMGNDQVKTPHIDKLSERGVTFDSHYTNTPICLGSRASTMTG 80 Query: 63 IYANQSGPWTNNVAPGK---NISTMGRYFKDAGYHTCYIGKWHLDGHDY-FGTGECPPEW 118 +Y +G ++ + + + ++ GY T +IGK+ + + E P Sbjct: 81 MYEYTNGCNFSHGFLSQELWDEMSYPVILRNNGYFTGFIGKFGFPVNAKNYHEYENLPID 140 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 D W+ T K + + E + A +F+ + Sbjct: 141 SFDRWYGWTGQGYFDTSKNKYMVK------------FAKEYPHVTLATAEAACEFIDEAQ 188 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK-PEHHRLWAQAMP 237 + D+PF + +S+ H PF+ Y + Y D ++ + A K P +L Q + Sbjct: 189 KQDKPFCLSLSFKASHKPFSPDPAYDDVYKDTVWKKRANYDEGGARKLPPQAKLGRQYLT 248 Query: 238 --SPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKL 294 + Y +D +G+++ L +NT +IY +D+G G+H Sbjct: 249 IDDFAPEKYQESMRKYNQLIYGIDQAVGKIVEKLDQTGLSKNTVIIYATDNGYSCGSHG- 307 Query: 295 ISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMALADIEKPEILPGENI 352 Y+ R P+II P+ ++ ++D+ PT+ LA I P + G+++ Sbjct: 308 FGGKVLPYEGPARGPMIIMDPRSDQTGKRSKGVSGNVDIHPTICDLAGIAIPAKVDGKSL 367 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTS------DELYDRRN 406 L V + + V + T+D+K + F +ELY R Sbjct: 368 LPVLKDSEIRVRKAMPVFNFWGSAATHEMTMV-TEDYKYIYWYFEGDGMVAAEELYHRHK 426 Query: 407 DPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRD 440 D EM+NL+++ A +M + IRD Sbjct: 427 DSAEMNNLVNNPEMALKLEEMRQLFDAQVQHIRD 460 >UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Proteobacteria RepID=UPI0000E0F7DD Length = 493 Score = 396 bits (1018), Expect = e-108, Method: Composition-based stats. Identities = 108/482 (22%), Positives = 178/482 (36%), Gaps = 92/482 (19%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPL-NTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 +PN + ++ D + VG T NID+LA +G+ F+ AY + C P+RA L + Sbjct: 39 KPNIIMIVIDDLGWSDVGYNQTTDYFETPNIDALAQQGLVFDQAYAGAANCAPSRAVLMS 98 Query: 62 GIYANQSGPWT------------------NNVAPGKNISTMGRYFKDAGYHTCYIGKWHL 103 G Y + G +T N +I T+G K AGY T GKWHL Sbjct: 99 GQYGPRHGVYTVSPSDRGHAKTRKLIPIKNKRGLTTDIITIGESLKTAGYTTGTFGKWHL 158 Query: 104 DGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWA 163 P + D G++ N + Sbjct: 159 GA--------DPDKQGFDVNVAGSHQGMTFHYFSPYQLPN---------IEDGPKGEYLT 201 Query: 164 HRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLA 223 R++ +D+++ + D+PF V Y H P+ V+ + KY + + Sbjct: 202 ERLTTEVIDWVK--SSKDQPFFAYVPYYTVHTPYQAVVDKVNKYHEKGIK---------- 249 Query: 224 NKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYT 282 Y A + +DD +GR+ + L E ENT VI+T Sbjct: 250 ---------------------SKREATYAAMVEHMDDNVGRIFDMLDSEGLAENTVVIFT 288 Query: 283 SDHGEM-MGAHK--LISKGAAMYDDITRIPLIIRSPQGERRQVDT-PVSHIDLLPTMMAL 338 SD+G M + L + YD R+PLI+R P+ + +D PV + D PT++ L Sbjct: 289 SDNGGYRMSSFPTPLRGGKGSYYDGGLRVPLIVRWPEKVKPGLDHTPVINADFYPTLVNL 348 Query: 339 ADIEKPEI-LPGENILAVKEPRGVMVEFNRYEIEH--------------DSFGGFIPVRC 383 ++P L G ++ A + + E + + D P Sbjct: 349 TKSKQPNQVLDGVDLTAHLLGQQDIAERDLFWHFPVYLQAHHAPTDQGQDPLFRTRPGSA 408 Query: 384 WVTDDFKLVLNLFTSD-ELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPF 442 + D+KL+ ++ ELY+ ND E +NL ++ L + +I Sbjct: 409 IRSGDWKLLQYFENNEFELYNLANDLAEKNNLASV--HPSRVKELKTKLQAWQQQIGADI 466 Query: 443 RS 444 + Sbjct: 467 PT 468 >UniRef50_D2QCX4 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QCX4_9SPHI Length = 533 Score = 396 bits (1018), Expect = e-108, Method: Composition-based stats. Identities = 109/503 (21%), Positives = 187/503 (37%), Gaps = 81/503 (16%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 +KRPN L+++ D + +GCY G+ +NT N+D LAA GI+ S Y + C P RA L Sbjct: 38 VKRPNILYILADDMGFSDIGCYGGE-VNTPNLDKLAAGGIKLRSFY-NNARCCPTRASLL 95 Query: 61 TGIYANQSGPW-------------TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHD 107 TG Y + G + T+ K+ GY T +GKWH+ Sbjct: 96 TGQYPHTVGMGLMVTMPNAAIQPGSYQGFLDARYPTIAERLKETGYSTYMLGKWHVG--- 152 Query: 108 YFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRIS 167 P + +++F + S E + ++D + + F + Sbjct: 153 -ERPEHWPLKRGFEHYFGLISGASSYYEIIPAEKGKRFIVLDDKEFTPPADGFYMTDAFT 211 Query: 168 NRAVDFLQQPAR--ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANK 225 + AV +L Q + AD+PF M ++Y PH P + KY Y + + + K Sbjct: 212 DYAVQYLNQQKQEQADKPFFMYLAYTAPHFPLHAYESDIAKYEKLYAQGWDVTRTKRYQK 271 Query: 226 PEHHRLWAQAM-------------PSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE 272 + L + + + +Y A D +D IGR+I L Sbjct: 272 MQQLGLIDKRYQLTPRPANVPAWNSATDKAQWIRKMAVYAAMIDRMDQNIGRLIKTLKAN 331 Query: 273 Q-RENTWVIYTSDHGEM---------------MGAHK----------------LISKGAA 300 +NT +++ SD+G +G Sbjct: 332 GQYDNTLIVFMSDNGSSNENMESRKLNDPTKKIGERGSYVTYDTPWANVSVTPFRKYKRF 391 Query: 301 MYDDITRIPLIIRSPQGER---RQVDTPVSHIDLLPTMMALADIEKPEILPGENILAVKE 357 +++ P I++ P+ R VD +DLLPT + LA + LPG+++ + Sbjct: 392 LHEGGMITPCIMQWPRNIRPAAGYVDGIGHVMDLLPTSLELAGLSA-NDLPGKSLSYLWT 450 Query: 358 PRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT-SDELYDRRNDPNEMHNLID 416 P+ +E E + D+KLV + ELY+ + DP E ++L Sbjct: 451 PKKTEPRTYCWEHE--------GNKAIRKADWKLVKDTEDADWELYNIKTDPCETNDLAR 502 Query: 417 DIRFADVRSKMHDALLDYMDKIR 439 + + M + ++ Sbjct: 503 NQ--PQRVASMRTEFDTWAQRVG 523 >UniRef50_A6CAR8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Planctomycetaceae RepID=A6CAR8_9PLAN Length = 501 Score = 396 bits (1018), Expect = e-108, Method: Composition-based stats. Identities = 109/491 (22%), Positives = 179/491 (36%), Gaps = 76/491 (15%) Query: 4 PNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGI 63 PN + +++D Q +G + + + T ++D LA EG + S Y P CTP+R L TG Sbjct: 38 PNIIMIVSDDQGYRDLGSFGSEEIMTPHLDRLAKEGAKLTSFYVTWPACTPSRGSLLTGR 97 Query: 64 YANQSGPWT----------NNVAPGKNIST-------------MGRYFKDAGYHTCYIGK 100 Y ++G + + P + T + K AGY + GK Sbjct: 98 YPQRNGIYDMIRNEAPDFGHKYKPAEYEVTFERIGGMDVREKLLPALLKPAGYVSAIYGK 157 Query: 101 WHLDGHDYFGTGECPPEWDADYWFD----GANYLSELTEKEISLWRNGLNSVEDLQANHI 156 W L H F P D ++ G +Y + S++RN Q Sbjct: 158 WDLGIHKRF----LPLARGFDDFYGFTNTGIDYFTHERYGVPSMYRNN-------QPTEE 206 Query: 157 DETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPH-----HPF-----TCPVEYLEK 206 D+ + AV F+++ +PF + + ++ PH P P +Y Sbjct: 207 DKGTYCTYLFQREAVRFIKE--NHQKPFFLYLPFNAPHGASSLDPRIRGGAQAPEKYKNM 264 Query: 207 YADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVI 266 Y L K + R G Y A +DD IG V+ Sbjct: 265 YPHLKDTLVTKKKTGRYE----FRERPDGPVIHQGVSASKRRLEYVASITCMDDAIGEVL 320 Query: 267 NALTPEQ-RENTWVIYTSDHGEMMG--AHKLISKGAAMYDDITRIPLIIRSPQGE--RRQ 321 L Q +NT V++ SD+G G L K M++ R+P ++R P Sbjct: 321 GLLDEYQIADNTIVVFFSDNGGSGGADNSPLKGKKGMMFEGGIRVPCLVRYPAKIKPGTV 380 Query: 322 VDTPVSHIDLLPTMMALADIEKPEI--LPGENILAVKEPRGVMVEFNRYEIEHDSFGGFI 379 D ++ ++L+PT + A I PE + G ++L V + Y + Sbjct: 381 NDELLTSLELVPTFLKEAAIPLPENVVIDGYDMLPVLMGKTTSPRNEMYWQRRED----- 435 Query: 380 PVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIR 439 + +K V + S L+D D E H+L +M + ++ ++ Sbjct: 436 --KAARVGHWKWVESEKGSG-LFDLSQDIGEKHDLS--PTHPKKLEEMKNHFANWKKQMA 490 Query: 440 D-----PFRSY 445 D PFR Y Sbjct: 491 DAEPRGPFRDY 501 >UniRef50_D1AX15 Sulfatase n=2 Tax=Fusobacteriaceae RepID=D1AX15_STRM9 Length = 491 Score = 396 bits (1018), Expect = e-108, Method: Composition-based stats. Identities = 120/512 (23%), Positives = 198/512 (38%), Gaps = 63/512 (12%) Query: 1 MKRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLF 60 MK+ N LF++ D +GCY K T NID LA +G F + + SPVC+PARA +F Sbjct: 1 MKKNNILFIIADDLGAWALGCYGNKDAITPNIDMLAEKGKIFENFFCVSPVCSPARASIF 60 Query: 61 TGIYANQSGPWT------NNVA---PGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGT 111 TG +Q G N K ST Y C GKWH+ D Sbjct: 61 TGRIPSQHGIHDWLDEWENGTTTEDYLKGQSTFVDVLSKNNYICCMSGKWHMGLAD---- 116 Query: 112 GECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAV 171 P+ YW+ + ++++G I E +I+ A+ Sbjct: 117 ---VPQKGFHYWYS--HQKGGGPYYMAPMYKDG---------KLIHEEEYITDKITEYAI 162 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFT---CPVEYLEKYADFYYELGEKAQDDLANKPEH 228 DFL + D+PF + V+Y PH P+ E L+ Y ++ + + Sbjct: 163 DFLDDVYKEDKPFFLNVNYTAPHSPWDKKNHKEEILKLYEGCKFKSCP--------RDPY 214 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPE-QRENTWVIYTSDHGE 287 H ++ + YFA +D IG +I L + +NT +I+TSD+G Sbjct: 215 HPWKISETFEGNEEERIQILKGYFAALTSMDFGIGEIIKKLEEKDMLKNTLIIFTSDNGM 274 Query: 288 MMGAHKLISKGA-----AMYDDITRIPLIIRSP-QGERRQVDTPVSHIDLLPTMMALADI 341 MG H + KG MYD ++P II + E +V+ +SH D+ T++ + Sbjct: 275 NMGHHGIFGKGNGTSPLNMYDSSVKVPFIIYKKDETEAEKVNNLLSHYDVRSTLLEYLGL 334 Query: 342 E--KPEILP--GENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVL-NLF 396 + K E + G + + + + + N + P R + +K V Sbjct: 335 DDVKDENIDYPGNSFSEILNNKKIDDDKNVVIYDEYG-----PTRMIRNEKYKYVHRYPD 389 Query: 397 TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYM-----DKIRDPFRSYQWSLRP 451 E Y+ D E N I++ +++ + +M L + +I + Sbjct: 390 GPHEFYNLIEDVEEKVNEINNEKYSKIIDQMRKDLEIWFLNYVNKEIDGATLPIYGAG-- 447 Query: 452 WRKDARPRWMGAFRPRPQDGYSPVVRDYDTGL 483 +K +W G + +S + D L Sbjct: 448 -QKKFAGKWGGYAKDTFGRYHSKFIFSSDAKL 478 >UniRef50_C9L4I6 Arylsulfatase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L4I6_RUMHA Length = 514 Score = 395 bits (1017), Expect = e-108, Method: Composition-based stats. Identities = 96/470 (20%), Positives = 184/470 (39%), Gaps = 32/470 (6%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN L +MTD + +G + T +D+LA+ G+ F++AY+ P C ARA L TG Sbjct: 28 KPNILLIMTDQLRGDCLGFAGHPDVKTPYLDTLASRGVSFDNAYSSCPSCIAARAALHTG 87 Query: 63 IYANQSGPWTNNVAPGKNIS-TMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 + TM AGY+ +GK H+ + D Sbjct: 88 MAQEHHRRTGYEDNIPWEYPNTMAGELSKAGYYCQCVGKMHVHPLRNYLGFHNVELHDGY 147 Query: 122 -YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHID----------------ETFTWAH 164 + N + ++K + + L + + + D E + + Sbjct: 148 LHSARYTNVPWQESQKNADDYFHWLKQEKGIDTDVTDTGLECNSWVARPWIYEEKYHPTN 207 Query: 165 RISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLAN 224 +++R++DFL++ +PF ++ SY PH PF P Y + Y + Sbjct: 208 WVTDRSIDFLRR-KDPQKPFFLMASYLRPHPPFDAPSYYFDLYNKKELTPPAVGDWETTE 266 Query: 225 KPEHHRLWAQAMPSPVGDDGLYHHP-LYFACNDFVDDQIGRVINALTPEQ-RENTWVIYT 282 + + + P + Y+AC +D QIGR+I AL +NT +++T Sbjct: 267 ELQAMGRVFDSKCGPSDAVLIREAQIGYYACITHLDHQIGRLIQALVEYGVYDNTLILFT 326 Query: 283 SDHGEMMGAHKLISKGAAMYDDITRIPLIIRSPQ------GERRQVDTPVSHIDLLPTMM 336 SDHGE + H + K Y RIP+I+ + + V D++PT++ Sbjct: 327 SDHGEELCDHHMFRKSR-PYQGSIRIPMIVSGNDKFLSGMKQGTVSHSVVELRDVMPTLL 385 Query: 337 ALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLF 396 + + P+ + G+++L + + + + G V+ K + Sbjct: 386 DFVNADIPDSVDGKSMLPLVTNPDEKLRD---VLHGEHSYGPYSNHWLVSSYDKFIWYSE 442 Query: 397 -TSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSY 445 +++ + DP E+H+ I + + ++ L++ + + F Sbjct: 443 TGTEQYFRISEDPKELHDEISNPAYQKRIEQLRRTLIETLKDRPEGFSDG 492 >UniRef50_A4AQQ7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Bacteroidetes RepID=A4AQQ7_9FLAO Length = 596 Score = 395 bits (1017), Expect = e-108, Method: Composition-based stats. Identities = 95/474 (20%), Positives = 177/474 (37%), Gaps = 64/474 (13%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + +MTD Q + L+T NID++A G F + Y PVC+P RA L TG Sbjct: 36 KPNVVLIMTDDQGWGDLSFNGNTNLSTPNIDAIAKNGASFQNFYVQ-PVCSPTRAELLTG 94 Query: 63 IYANQSGPW---TNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWD 119 YA + G + T +T+ FK AGY T GKWH + P Sbjct: 95 KYAARLGVYSTSTGGERFNSKETTIAEIFKKAGYKTTAYGKWHSGMQPPYH----PNSRG 150 Query: 120 ADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPAR 179 D ++ + W N + + + + ++N+ +DF+ + Sbjct: 151 FDDYYGFTSG----------HWGNYFSPMLEHNGEIVKGEGFLVDDLTNKGLDFITE--N 198 Query: 180 ADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPSP 239 + PF + + Y+ PH P P EY E++ ++ Sbjct: 199 KNNPFFLYLPYNTPHSPMQVPNEYWERFEKKKLDM---------------------RYQG 237 Query: 240 VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG--EMMGAHKLIS 296 ++ A + +D +GR+ N L ENT ++Y SD+G + Sbjct: 238 NEEESENFTRAALAMVENIDFNMGRLTNKLKELGLEENTIIVYLSDNGPNGWRWNGGMRG 297 Query: 297 KGAAMYDDITRIPLIIRSPQGER--RQVDTPVSHIDLLPTMMALADIEKP--EILPGENI 352 + + + R P I+ +++ ID+LPT+ +LA I +P + + G+++ Sbjct: 298 RKGSTDEGGVRSPFFIQWKNTIPKNKKISQIAGAIDILPTLTSLAGINQPTIKSIDGKDL 357 Query: 353 LAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMH 412 + + E +R+ + H T ++L + LYD +ND + Sbjct: 358 KTLIADKNPTWE-SRHIVNHW-----RGKTSIRTQKYRL----DNENRLYDMQNDIGQRT 407 Query: 413 NLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRP 466 +L + + + ++ + + + P ++ P Sbjct: 408 DLSS--ELPQLTDSLVNIKNIWLKDA----VTVKPENKRPFTLGHPDFIYTQIP 455 >UniRef50_B1KD77 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD77_SHEWM Length = 482 Score = 395 bits (1017), Expect = e-108, Method: Composition-based stats. Identities = 102/466 (21%), Positives = 176/466 (37%), Gaps = 63/466 (13%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N L ++ D +G Y + + NID+LA++ + AY+ SPVC+P+RA L TG + Sbjct: 37 NVLVLLIDDLGWTDLGAYGSQYYESPNIDALASQSRLYTQAYSSSPVCSPSRAALMTGKH 96 Query: 65 ANQSGPWTN----------------NVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDY 108 ++ T+ T+ FK GY T + GKWH+ G Sbjct: 97 PSKLKITTHFPGYKAKSPKLKEPWKADHLALTELTLAEAFKSQGYETFFAGKWHMGGE-- 154 Query: 109 FGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISN 168 G P + D G + S + ++ + ++ R+++ Sbjct: 155 ---GYLPTDQGFDINIGGMHRGSPPGGYY--------DPYKNPNLPNRNKGEHLTKRLTD 203 Query: 169 RAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEH 228 +DFL Q + ++PF ++SY H P + L + + + Sbjct: 204 ETIDFLSQ--KHEKPFFALLSYYGVHTPLQAGPDKLAYFKEKTNTVA------------G 249 Query: 229 HRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQRE-NTWVIYTSDHGE 287 + + Y + VD +GR++ +L + + NT V+ TSD+G Sbjct: 250 EKAFLIDKGHQSRTQINQVDANYASMIWAVDKSVGRILESLEKQGLDKNTLVVLTSDNGG 309 Query: 288 MMGAH------------KLISKGAAMYDDITRIPLIIRSPQ-GERRQVDTPVSHIDLLPT 334 H L S +Y+ RIPL+I P + Q DT + DL PT Sbjct: 310 FSTRHQGDERVTSTANLPLRSGKGWVYEGGVRIPLLIHQPGQQIQSQHDTLTTSADLYPT 369 Query: 335 MMALADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFG--GFIPVRCWVTDDFKLV 392 + +A + PE + G +I + + + + H + G P D+KL+ Sbjct: 370 LANVAGAKIPEGIDGSDIF-LLDEEPELAKQRVIVWHHPHYHGSGNKPSAAIRVGDWKLL 428 Query: 393 L-NLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDK 437 ELY+ ND E NL + R+ + L ++ Sbjct: 429 HFYEQDRVELYNLSNDIAEQVNL--EQLEPKRRAHLLALLDEWYRD 472 >UniRef50_A6DI98 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DI98_9BACT Length = 468 Score = 395 bits (1017), Expect = e-108, Method: Composition-based stats. Identities = 103/470 (21%), Positives = 168/470 (35%), Gaps = 54/470 (11%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 + N +F++ D +G Y + + T +D +AA GIRF + Y+ CT +R L TG Sbjct: 22 KTNIIFILADDLGYGELGSYGQEKIKTPELDKMAASGIRFTNHYSGYTTCTMSRKVLMTG 81 Query: 63 IYANQSGPWTNNVAPGKNIS--TMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDA 120 + G I T+ K+AGY T IGKW + G G P + Sbjct: 82 KHIANL-------PMGDRIPSITIAGLLKNAGYKTAMIGKWGMKGRP--GHDNSPEKHGF 132 Query: 121 DYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETF---------TWAHRISNRAV 171 D+ F N LWRNG N + E + A+ Sbjct: 133 DHVFTYDNQGFAHFYYPEYLWRNGEKIHYPTNKNLLTEDGYIKEKHDGVYSHDEFTKDAL 192 Query: 172 DFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKY-ADFYYELGEKAQDDLANKPEHHR 230 F+++ D PF + + Y PH T P + +E Y + E + + P + Sbjct: 193 GFIEE--NKDRPFFLYLPYTIPHAEITVPHDSVEPYLKLNWPETPKIIGGGGSKDPGYGS 250 Query: 231 LWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHG--- 286 + + Y H Y +D +GR+++ L + ENT V++ SD+G Sbjct: 251 QYVKGYCG-----QKYPHAAYAGMISRMDRDVGRILDLLKKLKIEENTLVLFGSDNGASP 305 Query: 287 -------EMMGAHKLISKGAAMYDDITRIPLIIRSPQ--GERRQVDTPVSHIDLLPTMMA 337 + KL ++Y+ TR P I P+ ++ ++ D + T Sbjct: 306 EGGQTLEFFQSSGKLRGDKRSIYEAGTRTPFIAYWPKTIQAGQKTGHISAYCDFVATACD 365 Query: 338 LADIEKPEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLN--- 394 +A I+ PE G + L Y ++KL+ N Sbjct: 366 VAGIKTPEHSDGVSYLPSLLGNRHQQAQRPYIFNAW-----KSWSSVRVKEWKLIANRKK 420 Query: 395 ---LFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 ELY+ D EM NL + + +H + + P Sbjct: 421 DLSPAERFELYNLEKDEGEMKNLA--TQHPEKVQSLHKLINKISKRHYTP 468 >UniRef50_Q7UM38 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UM38_RHOBA Length = 667 Score = 395 bits (1016), Expect = e-108, Method: Composition-based stats. Identities = 103/490 (21%), Positives = 182/490 (37%), Gaps = 61/490 (12%) Query: 2 KRPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFT 61 RPN L V+TD Q + + L T +IDSLA +G++ + Y VC+P RA T Sbjct: 91 SRPNVLVVLTDDQGWGDLSLHGNPNLQTPHIDSLARDGVQIKNFYV-CAVCSPTRAEFLT 149 Query: 62 GIYANQSGPWT---NNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEW 118 G Y +SG ++ + T+G F+ AGY T GKWH + P Sbjct: 150 GRYHTRSGVFSTSAGGERFDLSERTIGDAFQAAGYRTAAFGKWHSGMQAPYH----PNAR 205 Query: 119 DADYWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPA 178 D ++ + W N + + +L + ++ A+DF+++ Sbjct: 206 GFDEFYGFCSG----------HWGNYFSPMLELNGEIVKGDGFIVDDLTQHAIDFMER-- 253 Query: 179 RADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMPS 238 + PF + + + PH P P E + + P Sbjct: 254 DRENPFFIYLPLNTPHSPMQVPDEDWQNFEGKEI-------------------VPDPRPE 294 Query: 239 PVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMG--AHKLI 295 + + H A + +DD +G++++AL ENT V++ D+G L Sbjct: 295 NAKKEDVQHTRAALALCENIDDNVGQLLDALERLSLSENTIVVFFCDNGPNGSRFNGGLR 354 Query: 296 SKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEKPEI---LPGE 350 + A+++ R P +IR P + V IDL PT+ L D+E L G Sbjct: 355 GRKGAVHEGGLRSPCLIRYPSKIPAGQTVGGIAGAIDLFPTLADLCDVEVGATAGPLDGI 414 Query: 351 NILA-VKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPN 409 +++ ++EP+ E + F ++ ++ N L+D DP Sbjct: 415 SLIDGLREPKSKPSERLIFTAWSGKF-------SVRSNRYRYHANGD----LFDIVADPG 463 Query: 410 EMHNLIDDIRFADVRSKMHDALLDYMDKIRDPFRSYQWSLRPWRKDARPRWMGAFRPRPQ 469 E ++ +D +++ AL D++ + + RS+ W Q Sbjct: 464 ETGSVAEDQ--PVATARLKKALEDWVKETKPRDRSHSEEQVFPVGHPDHPWTQLPARDAQ 521 Query: 470 DGYSPVVRDY 479 + Sbjct: 522 ATGQIRRSNR 531 >UniRef50_UPI0000E0F7B6 iduronate 2-sulfatase precursor n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0F7B6 Length = 499 Score = 395 bits (1016), Expect = e-108, Method: Composition-based stats. Identities = 100/444 (22%), Positives = 179/444 (40%), Gaps = 25/444 (5%) Query: 5 NFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIY 64 N + ++ D ++G Y K + NID+LAA+GI F AY PVC +RA + TGI Sbjct: 56 NIVMIIVDDLRP-VLGVYGDKNAYSPNIDALAAQGITFTQAYANVPVCGASRASMLTGIR 114 Query: 65 ANQSGPWTNNVAPGKNIS---TMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 N++ K+ ++ + +++GYHT IGK + D +A Sbjct: 115 PNKTRFIDYKAKAQKDAPGAKSLPQVLRESGYHTMGIGKIFHNSKDLAKVSWSEKLQNAG 174 Query: 122 YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQPARAD 181 + + N + + + + ++ +A+ L + A+ + Sbjct: 175 MGHA-TRLNPDSENYLKTTKFNKRGNGPWYETMDVADEAYPDGKVKEKALKALTRLAKQE 233 Query: 182 EPFLMVVSYDEPHHPFTCPVEYLEKYADFYYEL------GEKAQDDLANKPEHHRLWAQA 235 +PF + V + PH PF P +Y + + + A L E H + Sbjct: 234 QPFFLSVGFIRPHLPFYAPKKYYDLHPREKFSPFFDRNKPRNAPKSLNGSGEIHTYHFKD 293 Query: 236 MPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQR-ENTWVIYTSDHGEMMGAHKL 294 + Y+A ++D +G VI + +NT ++ TSDHG +G H Sbjct: 294 YTYNSDAFHMSSLQGYYASVSYIDALVGDVIAQIDSLGLRDNTTIMLTSDHGFNLGEHNF 353 Query: 295 ISKGAAMYDDITRIPLIIRSPQGER-RQVDTPVSHIDLLPTMMALADIEKPEILPGENIL 353 +K M + RIP+I+ P + + D V +D+ PT+ + + P + G++ + Sbjct: 354 WTKH-TMLETSLRIPMIVAGPNIAKDEKTDALVELVDVFPTITEITKVNPPATVQGQSFV 412 Query: 354 AVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFT----SDELYDRRNDPN 409 + V + Y F V DDF + LYD + DP+ Sbjct: 413 KSLQNASVNHKKQIY-------SRFKKGDSVVNDDFIFTSYATAENTIEEMLYDHKVDPH 465 Query: 410 EMHNLIDDIRFADVRSKMHDALLD 433 E +N++++ R+ V +KM L Sbjct: 466 ETNNVVNEPRYQAVATKMRAQLTA 489 >UniRef50_A7LY81 Putative uncharacterized protein n=5 Tax=Bacteroides RepID=A7LY81_BACOV Length = 517 Score = 395 bits (1015), Expect = e-108, Method: Composition-based stats. Identities = 126/500 (25%), Positives = 217/500 (43%), Gaps = 60/500 (12%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN + +MTD Q ++ G T +D LA E + FN AYT P +PAR +FTG Sbjct: 25 KPNIVVIMTDQQRADLCGREGFPLEVTPFVDRLAQENVWFNKAYTVMPASSPARCSMFTG 84 Query: 63 IYANQSGPWTNNVAPG-KNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDAD 121 + + + TN+ P + K+ GY T +GK H D D Sbjct: 85 RFPSATHVRTNHNIPDISYQQDLVGVLKENGYKTALVGKNHAYLKPA----------DLD 134 Query: 122 YWFDGANYLSE----LTEKEISLWRNGLNSVEDLQANHIDETFTWAHRISNRAVDFLQQP 177 +W + ++ EKE + + N + L+ + I +I N A+ +++Q Sbjct: 135 FWSEYGHWGKHKKTTPAEKETARFLNQQARGQWLEPSPISLEEQHPTKIVNEALAWIKQQ 194 Query: 178 ARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKPEHHRLWAQAMP 237 + PF + VS+ EPH+P+ Y ++ + + ++ DLA K E +R+ AQ Sbjct: 195 --KENPFFVWVSFPEPHNPYQVCEPYYSMFSPDKLPVLKTSRKDLAKKGEKYRILAQLED 252 Query: 238 SP---VGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDHGEMMGAHK 293 + + D Y +DDQI R+I +L ENT + SDHG+ G + Sbjct: 253 ASCPNLEQDLPRIRANYIGMIRLIDDQIKRLIESLKASGQYENTLFVVLSDHGDYWGEYG 312 Query: 294 LISKGAAMYDDITRIPLIIRS--PQGERRQVDTPVSHIDLLPTMMALADIEKPEILPGEN 351 LI KGA + + + RIP++ + + +D+ VS DL PT + E P + G + Sbjct: 313 LIRKGAGLSESLARIPMVWAGYHIKNQPAPMDSHVSIADLFPTFCSAIGAEIPAGVQGRS 372 Query: 352 ILAVKEPRGVMVEFNRYEIEHDSFGGFI-------------------------------- 379 + + + E + FGG Sbjct: 373 LWPMLTGKAYPKEEFSSMVVQQGFGGADVGLDASLTFEQEGALTPGKIAHFDELNTWTQS 432 Query: 380 -PVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKI 438 R DD+KLV+N + + ELY+ + DP+E+HNL + +++++++++ LL + ++ Sbjct: 433 GTSRMIRKDDWKLVMNHYGNGELYNLKKDPSEVHNLFGEKKYSEIQTELLTRLLAWELRL 492 Query: 439 RDPF----RSYQWSLRPWRK 454 +DP R Y + P+ Sbjct: 493 QDPLPLPQRRYHFKQNPFNY 512 >UniRef50_A6DKC5 Putative sulfatase yidj n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKC5_9BACT Length = 511 Score = 395 bits (1015), Expect = e-108, Method: Composition-based stats. Identities = 118/458 (25%), Positives = 186/458 (40%), Gaps = 39/458 (8%) Query: 4 PNFLFVMTDTQATNMVGCY--------------SGKPLNTQNIDSLAAEGIRFNSAYTCS 49 PN L +MTD +GCY + T +ID LA EG+ N+ Y S Sbjct: 33 PNLLIIMTDEHNFRTLGCYRKLLSKDQAMIWGDGN-IVETPHIDKLAEEGVLCNNFYASS 91 Query: 50 PVCTPARAGLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYF 109 PVC+PAR +G Y + NN ++ + G + GY T Y GKWHLDG D Sbjct: 92 PVCSPARGSFISGQYPQNTPVIDNNTHMSDDVVSFGSILQSHGYTTGYSGKWHLDG-DGK 150 Query: 110 GTGECPPEWDAD---YWFDGANYLSELTEKEISLWRNGLNSVEDLQANHIDETFTWAHRI 166 ++ + Y F+ ++ L N DE + Sbjct: 151 PQWGPERQFGFEDNRYMFNRGHWKKILDTASGPKIGAEKRGTPTYDVNGADENTYTTDWL 210 Query: 167 SNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLEKYADFYYELGEKAQDDLANKP 226 +N+ +DF+ Q + PF +VSY +PH P T Y Y ++ + A + P Sbjct: 211 TNKTIDFITQHKAS--PFCYMVSYPDPHGPDTVRAPYDTMYTHMNFQKPKTASKKQDDLP 268 Query: 227 EHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRVINALTPEQ-RENTWVIYTSDH 285 G + Y+ +DD I R++ L + ENT V++TSDH Sbjct: 269 SWA----------TTKRGAANQSQYYGMIKCIDDNIARIMTCLDEQGILENTIVVFTSDH 318 Query: 286 GEMMGAHKLISKGAAMYDDITRIPLIIRSPQGE--RRQVDTPVSHIDLLPTMMALADIEK 343 G+M G H + ++P I+R P+ + V+ +S +D LPT++ L D E Sbjct: 319 GDMRGEHG-RQNKGIPLEASAKVPFIVRYPKKISSGKIVNEALSGVDFLPTILGLMDKET 377 Query: 344 PEILPGENILAVKEPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYD 403 G + + + + I G +TD +KLV+ + L D Sbjct: 378 AGKEEGRDGSQLLHGKVPTGWSDVTFI----RGTKEKWVAAITDQYKLVMAPWDEPWLID 433 Query: 404 RRNDPNEMHNLIDDIRFADVRSKMHDALLDYMDKIRDP 441 ++N+P+E N I+D ++ V + + Y K DP Sbjct: 434 KKNNPDETINYINDPQYRSVIRSLAKEMQRYGTKYNDP 471 >UniRef50_A6KZI7 Arylsulfatase n=23 Tax=Bacteroidales RepID=A6KZI7_BACV8 Length = 508 Score = 394 bits (1014), Expect = e-108, Method: Composition-based stats. Identities = 100/491 (20%), Positives = 168/491 (34%), Gaps = 87/491 (17%) Query: 3 RPNFLFVMTDTQATNMVGCYSGKPLNTQNIDSLAAEGIRFNSAYTCSPVCTPARAGLFTG 62 +PN +++M D +GCY ++T NID++A EG+RF AY SPV P+RA TG Sbjct: 27 KPNIIYIMCDDMGYGDLGCYGQPYISTPNIDNMAKEGMRFTQAYAGSPVSAPSRASFMTG 86 Query: 63 IYANQSGPWTN----------------------NVAPGKNISTMGRYFKDAGYHTCYIGK 100 ++ N + KD GY T GK Sbjct: 87 QHSGHCEVRGNKEYWRDAPVVMYGNNKEYAVVGQHPYDPGHVIIPEIMKDNGYTTGMFGK 146 Query: 101 WHLDGHDYFGTGECPPEWDADYWFDGANYLSELTEKEISLWRNGLN-------------- 146 W Y G+ P + D ++ L R + Sbjct: 147 W---AGGYEGSVSTPDKRGIDEYYGYICQFQAHLYYPNFLNRYSKSAGDTAVVRVVMDEN 203 Query: 147 -SVEDLQANHIDETFTWAHRISNRAVDFLQQPARADEPFLMVVSYDEPHHPFTCPVEYLE 205 + ++ A I A+ +L + +PF + +Y PH P + + Sbjct: 204 INYPMFGKDYFKRPQYSADMIHEEAMKWLDKQ-DGKQPFFGIFTYTLPHAELAQPEDSI- 261 Query: 206 KYADFYYELGEKAQDDLANKPEHHRLWAQAMPSPVGDDGLYHHPLYFACNDFVDDQIGRV 265 L + +K + ++ PS ++ H + +D +G V Sbjct: 262 --------LTGYQKKFFEDKTWGGQEGSRYNPS------VHTHAQFAGMITRLDYYVGEV 307 Query: 266 INALTPEQRE-NTWVIYTSDHGEM----------MGAHKLISKGAAMYDDITRIPLIIRS 314 +N L + + NT VI+TSD+G KL Y+ RIP I+R Sbjct: 308 LNKLKEKGLDENTIVIFTSDNGPHEEGGADPTFFGRDGKLRGLKRQCYEGGIRIPFIVRW 367 Query: 315 PQG--ERRQVDTPVSHIDLLPTMMALADIEKP-----------EILPGENILAVKEPRGV 361 P E D ++ DL+PT LA ++ + G + + Sbjct: 368 PGKVPEGTVNDHQLAFYDLMPTFCDLAGVKNYVKKYTNKKKDVDYFDGISFAPTLLGQEG 427 Query: 362 MVEFNRYEIEHDSFGGFIPVRCWVTDDFKLVLNLFTSDELYDRRNDPNEMHNLIDDIRFA 421 + + E D D+K+V+ T LY+ D +E H++ Sbjct: 428 QKKHDFLYWEFDE----TDQIGVRMGDWKMVVKKGTPF-LYNLATDIHEDHDIAA--GHP 480 Query: 422 DVRSKMHDALL 432 D+ +M + + Sbjct: 481 DIVKQMKEIIR 491 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.183 0.643 Lambda K H 0.267 0.0562 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,883,955,036 Number of Sequences: 3077464 Number of extensions: 230613313 Number of successful extensions: 531544 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 4123 Number of HSP's successfully gapped in prelim test: 1999 Number of HSP's that attempted gapping in prelim test: 493650 Number of HSP's gapped (non-prelim): 11526 length of query: 497 length of database: 1,040,396,356 effective HSP length: 133 effective length of query: 364 effective length of database: 631,093,644 effective search space: 229718086416 effective search space used: 229718086416 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 96 (41.1 bits)