BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (551 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P25549 Arylsulfatase n=54 Tax=Proteobacteria RepID=ASLA... 1147 0.0 UniRef50_A5FF56 Sulfatase n=2 Tax=Bacteria RepID=A5FF56_FLAJ1 349 1e-94 UniRef50_Q7ULF9 Arylsulfatase n=4 Tax=Bacteria RepID=Q7ULF9_RHOBA 288 3e-76 UniRef50_B8KM61 Steryl-sulfatase n=2 Tax=gamma proteobacterium N... 226 2e-57 UniRef50_C1ZCM0 Arylsulfatase A family protein n=2 Tax=Bacteria ... 219 3e-55 UniRef50_A1WGP9 Sulfatase n=6 Tax=Proteobacteria RepID=A1WGP9_VEREI 212 3e-53 UniRef50_A8G0H1 Sulfatase family protein n=5 Tax=Gammaproteobact... 207 1e-51 UniRef50_C7ZGP1 Predicted protein n=3 Tax=Leotiomyceta RepID=C7Z... 203 1e-50 UniRef50_A0Z7U6 Arylsulfatase n=2 Tax=Gammaproteobacteria RepID=... 200 1e-49 UniRef50_B8KV72 Arylsulfatase A n=1 Tax=gamma proteobacterium NO... 199 2e-49 UniRef50_A6UG37 Sulfatase n=16 Tax=Bacteria RepID=A6UG37_SINMW 196 2e-48 UniRef50_Q1CY93 Sulfatase family protein n=4 Tax=Bacteria RepID=... 195 4e-48 UniRef50_Q46SG5 Arylsulfatase n=3 Tax=Proteobacteria RepID=Q46SG... 194 9e-48 UniRef50_A6QA55 Arylsulfatase n=5 Tax=Bacteria RepID=A6QA55_SULNB 192 4e-47 UniRef50_Q4RJR3 Chromosome 13 SCAF15035, whole genome shotgun se... 189 2e-46 UniRef50_Q1QJ61 Sulfatase n=3 Tax=Bacteria RepID=Q1QJ61_NITHX 189 2e-46 UniRef50_Q0KB87 Arylsulfatase A or related enzyme n=107 Tax=cell... 187 1e-45 UniRef50_B9XS23 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XS2... 184 6e-45 UniRef50_Q488V4 Sulfatase family protein n=30 Tax=Bacteria RepID... 184 1e-44 UniRef50_B8KM62 N-acetylgalactosamine-6-sulfatase n=1 Tax=gamma ... 183 2e-44 UniRef50_A0Z6R0 Putative arylsulfatase n=1 Tax=marine gamma prot... 183 2e-44 UniRef50_Q0C069 Sulfatase family protein n=3 Tax=Bacteria RepID=... 181 8e-44 UniRef50_UPI0001968C90 hypothetical protein BACCELL_02360 n=1 Ta... 180 1e-43 UniRef50_B9XCM3 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XCM... 179 2e-43 UniRef50_D2R206 Steryl-sulfatase n=1 Tax=Pirellula staleyi DSM 6... 179 3e-43 UniRef50_A6P2X1 Putative uncharacterized protein n=1 Tax=Bactero... 178 5e-43 UniRef50_A9W035 Sulfatase n=6 Tax=Bacteria RepID=A9W035_METEP 178 5e-43 UniRef50_D2QZL2 Sulfatase n=8 Tax=cellular organisms RepID=D2QZL... 177 1e-42 UniRef50_B9R4R2 Sulfatase, putative n=2 Tax=Rhodobacteraceae Rep... 176 1e-42 UniRef50_B5JJG5 Sulfatase, putative n=1 Tax=Verrucomicrobiae bac... 176 2e-42 UniRef50_A6LED1 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LE... 175 5e-42 UniRef50_A6DSG4 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 175 5e-42 UniRef50_B7S1F0 Sulfatase, putative n=1 Tax=marine gamma proteob... 175 5e-42 UniRef50_B8KTJ7 Arylsulfatase F n=1 Tax=gamma proteobacterium NO... 173 1e-41 UniRef50_B4D764 Steryl-sulfatase n=1 Tax=Chthoniobacter flavus E... 172 2e-41 UniRef50_C7RSC1 Sulfatase n=2 Tax=Bacteria RepID=C7RSC1_9PROT 172 3e-41 UniRef50_Q01Z68 Sulfatase n=4 Tax=Bacteria RepID=Q01Z68_SOLUE 171 6e-41 UniRef50_UPI0000586CBD PREDICTED: similar to MGC86251 protein n=... 168 5e-40 UniRef50_B4AUP3 Sulfatase n=2 Tax=Bacteria RepID=B4AUP3_9CHRO 167 7e-40 UniRef50_Q1VDY3 Probable sulfatase n=2 Tax=Vibrio alginolyticus ... 167 8e-40 UniRef50_A4CGL5 Arylsulfatase A (Precursor) n=2 Tax=Flavobacteri... 167 1e-39 UniRef50_D2QZX4 Sulfatase n=10 Tax=Bacteria RepID=D2QZX4_9PLAN 166 3e-39 UniRef50_A4A2W0 Arylsulfatase A n=1 Tax=Blastopirellula marina D... 165 3e-39 UniRef50_A4CMB0 Arylsulfatase A n=4 Tax=Bacteria RepID=A4CMB0_9FLAO 164 6e-39 UniRef50_Q0BZE9 Sulfatase family protein n=1 Tax=Hyphomonas nept... 164 6e-39 UniRef50_A6DJ11 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 164 6e-39 UniRef50_P15289 Arylsulfatase A component C n=34 Tax=Euteleostom... 163 2e-38 UniRef50_A6DPC8 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 162 3e-38 UniRef50_A0YAF7 Arylsulfatase A n=4 Tax=Bacteria RepID=A0YAF7_9GAMM 162 3e-38 UniRef50_Q15US6 Sulfatase n=3 Tax=Alteromonadales RepID=Q15US6_P... 162 4e-38 UniRef50_A7SRP2 Predicted protein n=2 Tax=Nematostella vectensis... 161 5e-38 UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomy... 160 9e-38 UniRef50_B0UGK6 Sulfatase n=18 Tax=Bacteria RepID=B0UGK6_METS4 160 1e-37 UniRef50_A9UPM8 Predicted protein (Fragment) n=1 Tax=Monosiga br... 160 1e-37 UniRef50_UPI00005846A1 PREDICTED: similar to arylsulfatase n=1 T... 159 2e-37 UniRef50_D2QWC8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 159 3e-37 UniRef50_D2QTW6 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepI... 158 5e-37 UniRef50_A7IPG5 Sulfatase n=3 Tax=Bacteria RepID=A7IPG5_XANP2 157 1e-36 UniRef50_A6DLD9 Sulfatase n=2 Tax=Chlamydiae/Verrucomicrobia gro... 156 2e-36 UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 155 3e-36 UniRef50_A6DHI4 Arylsulfatase A (ASA) n=1 Tax=Lentisphaera arane... 155 4e-36 UniRef50_A7RLK6 Predicted protein (Fragment) n=11 Tax=Eumetazoa ... 155 4e-36 UniRef50_C1ZKY2 Arylsulfatase A family protein n=1 Tax=Planctomy... 155 5e-36 UniRef50_D0TQQ7 Putative uncharacterized protein n=1 Tax=Bactero... 155 5e-36 UniRef50_UPI0001A444F6 arylsulfatase A n=1 Tax=Pectobacterium ca... 154 1e-35 UniRef50_A6LED2 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LE... 154 1e-35 UniRef50_Q96EG1 Arylsulfatase G n=22 Tax=Euteleostomi RepID=ARSG... 153 1e-35 UniRef50_C2FU81 Sulfatase family protein n=2 Tax=Sphingobacteriu... 153 2e-35 UniRef50_UPI00005887B4 PREDICTED: similar to galactosamine (N-ac... 153 2e-35 UniRef50_B7QJZ0 Arylsulfatase B, putative n=9 Tax=Ixodes scapula... 152 3e-35 UniRef50_C6Y214 Sulfatase n=3 Tax=Sphingobacteriaceae RepID=C6Y2... 152 3e-35 UniRef50_Q7UKJ5 Arylsulfatase A n=3 Tax=Bacteria RepID=Q7UKJ5_RHOBA 151 6e-35 UniRef50_A6LDP6 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LD... 151 6e-35 UniRef50_B2ULS2 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC B... 151 7e-35 UniRef50_P34059 N-acetylgalactosamine-6-sulfatase n=23 Tax=Deute... 151 8e-35 UniRef50_P50473 Arylsulfatase n=8 Tax=Deuterostomia RepID=ARS_STRPU 150 9e-35 UniRef50_C6Y1Z7 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 ... 150 1e-34 UniRef50_B3CAE2 Putative uncharacterized protein n=3 Tax=Bactero... 150 1e-34 UniRef50_UPI000180BD6E PREDICTED: similar to arylsulfatase n=1 T... 150 2e-34 UniRef50_A6LCL3 Arylsulfatase A n=9 Tax=Bacteroidales RepID=A6LC... 149 2e-34 UniRef50_Q7UG72 Arylsulfatase A [precursor] n=1 Tax=Rhodopirellu... 149 3e-34 UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomy... 149 3e-34 UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planct... 148 4e-34 UniRef50_A6DSG6 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 148 5e-34 UniRef50_C1BQY6 Arylsulfatase A n=2 Tax=Caligus RepID=C1BQY6_9MAXI 148 6e-34 UniRef50_C6VTS4 Sulfatase n=47 Tax=cellular organisms RepID=C6VT... 148 6e-34 UniRef50_B9NR18 Sulfatase family protein n=1 Tax=Rhodobacteracea... 147 8e-34 UniRef50_UPI0001927538 PREDICTED: similar to CG8646 CG8646-PA n=... 147 1e-33 UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 147 1e-33 UniRef50_Q01N83 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 147 1e-33 UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 146 2e-33 UniRef50_A6DI94 Arylsulfatase A n=2 Tax=Bacteria RepID=A6DI94_9BACT 146 2e-33 UniRef50_UPI000179252A PREDICTED: similar to arylsulfatase b n=3... 146 2e-33 UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 146 2e-33 UniRef50_B7RWW8 Sulfatase, putative n=1 Tax=marine gamma proteob... 145 3e-33 UniRef50_C3ZGR2 Putative uncharacterized protein n=1 Tax=Branchi... 145 3e-33 UniRef50_Q7UHK0 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 145 5e-33 UniRef50_A6C4Q6 Arylsulfatase n=1 Tax=Planctomyces maris DSM 879... 145 6e-33 UniRef50_A7V8P8 Putative uncharacterized protein n=1 Tax=Bactero... 144 6e-33 UniRef50_C5C581 Cerebroside-sulfatase n=1 Tax=Beutenbergia caver... 144 7e-33 UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flamme... 144 7e-33 UniRef50_B8KKX3 Arylsulfatase B n=1 Tax=gamma proteobacterium NO... 144 7e-33 UniRef50_A0PKV5 Arylsulfatase, AslA n=5 Tax=Bacteria RepID=A0PKV... 144 1e-32 UniRef50_Q8SZ72 RE14504p n=18 Tax=Neoptera RepID=Q8SZ72_DROME 143 2e-32 UniRef50_A6DSP6 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 143 2e-32 UniRef50_C9KTC2 Arylsulphatase A n=5 Tax=Bacteroides RepID=C9KTC... 142 3e-32 UniRef50_UPI000180C68F PREDICTED: similar to arylsulfatase, part... 142 4e-32 UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina ... 142 4e-32 UniRef50_C1ZFQ0 Arylsulfatase A family protein n=1 Tax=Planctomy... 141 5e-32 UniRef50_A6LIX5 Arylsulfatase n=2 Tax=Bacteroidales RepID=A6LIX5... 141 7e-32 UniRef50_A3ZMN6 Arylsulfatase B n=3 Tax=Bacteria RepID=A3ZMN6_9PLAN 141 8e-32 UniRef50_Q7UH46 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 141 8e-32 UniRef50_B4D3U0 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 140 9e-32 UniRef50_A7S8Q2 Predicted protein n=2 Tax=Nematostella vectensis... 140 1e-31 UniRef50_C1ZI83 Arylsulfatase A family protein n=1 Tax=Planctomy... 139 2e-31 UniRef50_D2R2H5 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 139 2e-31 UniRef50_D2QCX4 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepI... 139 2e-31 UniRef50_C3ZFE8 Putative uncharacterized protein n=1 Tax=Branchi... 139 2e-31 UniRef50_C3ZQB5 Putative uncharacterized protein n=1 Tax=Branchi... 139 3e-31 UniRef50_A6DG39 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC... 139 3e-31 UniRef50_UPI000180C5AE PREDICTED: similar to sulfatase 1 n=2 Tax... 139 3e-31 UniRef50_D2A3E0 Putative uncharacterized protein GLEAN_07966 n=4... 139 3e-31 UniRef50_D2QW96 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 139 4e-31 UniRef50_A6DI18 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HT... 139 4e-31 UniRef50_A7HQ00 Steryl-sulfatase n=4 Tax=Proteobacteria RepID=A7... 138 5e-31 UniRef50_A0Z632 Arylsulfatase B n=1 Tax=marine gamma proteobacte... 138 6e-31 UniRef50_D2R917 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 137 7e-31 UniRef50_A6DSM5 Arylsulfatase A (Precursor) n=1 Tax=Lentisphaera... 137 1e-30 UniRef50_A6C8S3 Arylsulphatase A n=1 Tax=Planctomyces maris DSM ... 137 1e-30 UniRef50_Q02AN8 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 137 1e-30 UniRef50_B7S0F9 Sulfatase domain protein n=1 Tax=marine gamma pr... 137 1e-30 UniRef50_Q15XP0 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 137 1e-30 UniRef50_A6DP41 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 137 1e-30 UniRef50_A6DHS3 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 137 1e-30 UniRef50_UPI000186ED10 arylsulfatase B precursor, putative n=1 T... 137 1e-30 UniRef50_A6DR29 N-acetylgalactosamine-6-sulfatase n=3 Tax=Bacter... 136 2e-30 UniRef50_A9UP45 Predicted protein n=1 Tax=Monosiga brevicollis R... 136 2e-30 UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC1... 136 2e-30 UniRef50_C6Y1U6 Sulfatase n=2 Tax=Sphingobacteriales RepID=C6Y1U... 136 2e-30 UniRef50_A4AQQ7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 136 2e-30 UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=B... 136 2e-30 UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9... 136 2e-30 UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 135 3e-30 UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=... 135 3e-30 UniRef50_C5PU94 N-acetylgalactosamine-6-sulfatase n=1 Tax=Sphing... 135 3e-30 UniRef50_B7PTL2 Arylsulfatase B, putative (Fragment) n=1 Tax=Ixo... 135 4e-30 UniRef50_A6CEL4 Arylsulfatase A n=4 Tax=Bacteria RepID=A6CEL4_9PLAN 135 6e-30 UniRef50_A6DKM2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 135 6e-30 UniRef50_B4QDF1 GD10911 n=2 Tax=melanogaster subgroup RepID=B4QD... 134 6e-30 UniRef50_A6DKN7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 134 7e-30 UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 134 7e-30 UniRef50_B7AM73 Putative uncharacterized protein n=1 Tax=Bactero... 134 8e-30 UniRef50_A4A218 Arylsulfatase A n=2 Tax=Bacteria RepID=A4A218_9PLAN 134 9e-30 UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD 134 9e-30 UniRef50_Q7UNN1 Arylsulphatase A n=3 Tax=Bacteria RepID=Q7UNN1_R... 134 1e-29 UniRef50_UPI00016C41FE sulfatase n=1 Tax=Gemmata obscuriglobus U... 134 1e-29 UniRef50_A6CEG5 Arylsulphatase A n=2 Tax=Bacteria RepID=A6CEG5_9... 134 1e-29 UniRef50_UPI000186F312 arylsulfatase B precursor, putative n=1 T... 133 1e-29 UniRef50_Q1YP24 Arylsulfatase A n=1 Tax=gamma proteobacterium HT... 133 2e-29 UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 133 2e-29 UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomy... 132 3e-29 UniRef50_A6DF77 Arylsulphatase A n=2 Tax=Lentisphaera araneosa H... 132 3e-29 UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria ... 132 3e-29 UniRef50_B9XR48 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XR4... 132 3e-29 UniRef50_C9KTU9 Twin-arginine translocation pathway signal n=5 T... 132 3e-29 UniRef50_B9KQS8 Twin-arginine translocation pathway signal n=2 T... 132 3e-29 UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Plancto... 132 3e-29 UniRef50_P77318 Uncharacterized sulfatase ydeN n=81 Tax=Gammapro... 132 3e-29 UniRef50_UPI0000588CF9 PREDICTED: similar to arylsulfatase B n=1... 132 3e-29 UniRef50_Q7UIN1 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 132 3e-29 UniRef50_B7PV03 Arylsulfatase B, putative n=7 Tax=Ixodes scapula... 132 3e-29 UniRef50_C6Y1N8 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 ... 132 3e-29 UniRef50_D2R457 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 132 4e-29 UniRef50_A7SPY2 Predicted protein (Fragment) n=10 Tax=Eumetazoa ... 132 4e-29 UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Pro... 132 4e-29 UniRef50_A5FAW4 Sulfatase n=1 Tax=Flavobacterium johnsoniae UW10... 132 4e-29 UniRef50_Q9NJU8 Sulfatase 1 n=2 Tax=Coelomata RepID=Q9NJU8_HELPO 132 4e-29 UniRef50_A3HYT7 Arylsulphatase A n=1 Tax=Algoriphagus sp. PR1 Re... 132 5e-29 UniRef50_A6DG54 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 132 5e-29 UniRef50_Q7US20 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 131 5e-29 UniRef50_Q7UGB8 Arylsulfatase homolog b1498 n=1 Tax=Rhodopirellu... 131 6e-29 UniRef50_A6DMX8 Iduronate-sulfatase or arylsulfatase A n=1 Tax=L... 131 6e-29 UniRef50_P15848 Arylsulfatase B n=32 Tax=Euteleostomi RepID=ARSB... 131 6e-29 UniRef50_B1KD86 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 131 8e-29 UniRef50_A6DRV5 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 131 8e-29 UniRef50_D2R783 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 130 1e-28 UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD 130 1e-28 UniRef50_B0SY54 Sulfatase n=7 Tax=Alphaproteobacteria RepID=B0SY... 130 1e-28 UniRef50_C1ZF13 Arylsulfatase A family protein n=1 Tax=Planctomy... 130 1e-28 UniRef50_A7RFN2 Predicted protein n=7 Tax=Eumetazoa RepID=A7RFN2... 130 1e-28 UniRef50_B4D681 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 130 1e-28 UniRef50_Q7UYA6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 130 1e-28 UniRef50_A4XED5 Sulfatase n=1 Tax=Novosphingobium aromaticivoran... 130 2e-28 UniRef50_B6RB10 Arylsulfatase n=7 Tax=Coelomata RepID=B6RB10_HALDI 129 2e-28 UniRef50_Q7UIU1 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 129 2e-28 UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 129 3e-28 UniRef50_A6DG53 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 129 3e-28 UniRef50_Q7UYS6 Arylsulfatase A n=4 Tax=Bacteria RepID=Q7UYS6_RHOBA 129 3e-28 UniRef50_UPI00015B51A4 PREDICTED: similar to arylsulfatase b n=1... 129 4e-28 UniRef50_UPI0000588E05 PREDICTED: similar to steroid sulfatase n... 128 5e-28 UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyc... 128 5e-28 UniRef50_B8HPF9 Sulfatase n=2 Tax=Bacteria RepID=B8HPF9_CYAP4 128 5e-28 UniRef50_C3Q8V4 Arylsulfatase B n=6 Tax=Bacteroides RepID=C3Q8V4... 128 5e-28 UniRef50_UPI0000586CBA PREDICTED: similar to arylsulfatase B n=3... 128 6e-28 UniRef50_D0PR02 N-acetylgalactosamine-4-sulfatase n=1 Tax=Flamme... 128 6e-28 UniRef50_B5CWC8 Putative uncharacterized protein n=1 Tax=Bactero... 128 7e-28 UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3J... 128 7e-28 UniRef50_B5CWB1 Putative uncharacterized protein n=1 Tax=Bactero... 128 7e-28 UniRef50_A6DMY9 Putative uncharacterized protein n=2 Tax=Lentisp... 127 8e-28 UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 127 8e-28 UniRef50_A6DLE2 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 127 1e-27 UniRef50_UPI0000E46777 PREDICTED: similar to arylsulfatase J n=1... 127 1e-27 UniRef50_A4CJK0 Arylsulfatase A n=1 Tax=Robiginitalea biformata ... 127 1e-27 UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 127 1e-27 UniRef50_Q9VVM4 CG7402 n=10 Tax=Drosophila RepID=Q9VVM4_DROME 127 2e-27 UniRef50_A4AM21 Arylsulfatase A n=1 Tax=Flavobacteriales bacteri... 127 2e-27 UniRef50_Q7UYA9 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodop... 126 2e-27 UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bactero... 126 2e-27 UniRef50_C9KTV0 Arylsulfatase n=1 Tax=Bacteroides finegoldii DSM... 126 2e-27 UniRef50_Q024K7 Sulfatase n=28 Tax=Bacteria RepID=Q024K7_SOLUE 126 2e-27 UniRef50_Q5FYB1 Arylsulfatase I n=5 Tax=Chordata RepID=ARSI_HUMAN 126 2e-27 UniRef50_B0NLM9 Putative uncharacterized protein n=1 Tax=Bactero... 126 2e-27 UniRef50_Q5FYB0 Arylsulfatase J n=81 Tax=Eumetazoa RepID=ARSJ_HUMAN 126 2e-27 UniRef50_A6C4B6 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8... 126 2e-27 UniRef50_UPI0000DB708B PREDICTED: similar to CG7402-PA isoform 2... 126 2e-27 UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodop... 126 2e-27 UniRef50_A7SK50 Predicted protein n=1 Tax=Nematostella vectensis... 126 3e-27 UniRef50_A9LGQ4 Secreted arylsulfatase n=4 Tax=Bacteria RepID=A9... 126 3e-27 UniRef50_UPI000186D20A arylsulfatase B precursor, putative n=1 T... 125 3e-27 UniRef50_A6CGJ8 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8... 125 3e-27 UniRef50_A6BYP9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM ... 125 3e-27 UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 T... 125 3e-27 UniRef50_Q16DZ0 Sulfatase, putative n=8 Tax=Proteobacteria RepID... 125 4e-27 UniRef50_C2G0L0 Possible Cerebroside-sulfatase n=2 Tax=Sphingoba... 125 4e-27 UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces mari... 125 4e-27 UniRef50_A7AKS6 Putative uncharacterized protein n=3 Tax=Bactero... 125 4e-27 UniRef50_C6I6Z4 N-acetylgalactosamine-6-sulfatase n=11 Tax=Bacte... 125 5e-27 UniRef50_A6C4V9 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 Re... 125 5e-27 UniRef50_Q7UYW3 Arylsulfatase B n=1 Tax=Rhodopirellula baltica R... 124 7e-27 UniRef50_Q2LZ24 GA16747 n=5 Tax=Drosophila RepID=Q2LZ24_DROPS 124 7e-27 UniRef50_Q15SA2 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 124 9e-27 UniRef50_A6DI30 N-acetylgalactosamine-6-sulfatase n=1 Tax=Lentis... 124 1e-26 UniRef50_C1ZJ89 Arylsulfatase A family protein n=1 Tax=Planctomy... 124 1e-26 UniRef50_C9MKK8 Arylsulphatase A n=4 Tax=Bacteroidales RepID=C9M... 124 1e-26 UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_B... 124 1e-26 UniRef50_C5EQ23 Arylsulfatase E n=1 Tax=Clostridiales bacterium ... 123 2e-26 UniRef50_A6DIG7 Iduronate-sulfatase or arylsulfatase A n=1 Tax=L... 123 2e-26 UniRef50_Q7UYD6 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 123 2e-26 UniRef50_D2R921 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 123 2e-26 UniRef50_A6C383 Sulfatase (Fragment) n=1 Tax=Planctomyces maris ... 123 2e-26 >UniRef50_P25549 Arylsulfatase n=54 Tax=Proteobacteria RepID=ASLA_ECOLI Length = 551 Score = 1147 bits (2967), Expect = 0.0, Method: Compositional matrix adjust. Identities = 551/551 (100%), Positives = 551/551 (100%) Query: 1 MEFSFSPKRLVVAVAAALPLMASAADTPSTATARKGFAGYDHPNQYLVKPATTIADNMMP 60 MEFSFSPKRLVVAVAAALPLMASAADTPSTATARKGFAGYDHPNQYLVKPATTIADNMMP Sbjct: 1 MEFSFSPKRLVVAVAAALPLMASAADTPSTATARKGFAGYDHPNQYLVKPATTIADNMMP 60 Query: 61 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV 120 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV Sbjct: 61 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV 120 Query: 121 ASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQG 180 ASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQG Sbjct: 121 ASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQG 180 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK Sbjct: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR Sbjct: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 Query: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE Sbjct: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 Query: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP 420 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP Sbjct: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP 420 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP Sbjct: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK Sbjct: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 Query: 541 KYPPRAQIKSD 551 KYPPRAQIKSD Sbjct: 541 KYPPRAQIKSD 551 >UniRef50_A5FF56 Sulfatase n=2 Tax=Bacteria RepID=A5FF56_FLAJ1 Length = 524 Score = 349 bits (896), Expect = 1e-94, Method: Compositional matrix adjust. Identities = 195/508 (38%), Positives = 293/508 (57%), Gaps = 24/508 (4%) Query: 45 QYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFN 104 Q P + D + P + HP QDKE + + K KKPN+++ L+DD+G+ D+G Sbjct: 20 QNYFNPTVKVKDYLEPAIPHPDQDKE----MKDKLSKLKKKPNILIILIDDMGYGDIGVY 75 Query: 105 GGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPG 164 GGGVA+G PTP++D +A +GL LTS Y+QP+ +P+RA I+TG+ G+ P + G+ Sbjct: 76 GGGVAIGAPTPNMDKLAHEGLQLTSTYAQPTCTPSRAAIMTGRIPARSGLTRPTLTGENP 135 Query: 165 GLQ---GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWR 221 + T ++L GY + GKWH+GE+K S P VG+D++ GF SV Y ++ Sbjct: 136 KVNPWASENTTAKILSQNGYKSAISGKWHLGESKGSLPNEVGYDEWLGFGSVQSEYAQFV 195 Query: 222 DVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITP-KYMEDLDQRWMDYGV 280 + + P++ PDR +K++ ++ + V+GGE + + I+ + + +DQ + +Y Sbjct: 196 NEWIYPDLINKPDRLAAVKKM-VDQNILTGVKGGENKVVQPISNIEELSKVDQVFANYSE 254 Query: 281 KFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTL 340 F+ + K +KPF+L + H DNY + Y G SPA Y D +VE++D+ L K L Sbjct: 255 DFIKRSVKENKPFYLIHSFSKVHNDNYVSEGYKGKSPAAIPYKDAIVEVDDIVGRLMKLL 314 Query: 341 EKNGQLDNTLIVFTSDNGPEAEV-PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-R 398 + DNTL+ TSDNGP +V P G TPFRG KG+TWEGGVRVP YWKGMI P R Sbjct: 315 QDLKIDDNTLVFLTSDNGPNEDVWPDGGYTPFRGGKGTTWEGGVRVPGIAYWKGMIAPGR 374 Query: 399 KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF 458 SDG+ D+ D+F T+L AG V + +P + +IDGVDQ SFFL G SNR A + Sbjct: 375 ISDGLFDICDMFNTSLSAAG-----VLDKIPSSNYIDGVDQLSFFLSDKGVSNRNAVFMY 429 Query: 459 LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSS--VFNLYTDPQESDS 516 K A+R E+K H+ + + S + + +Q+ G S V+N+Y DP+E S Sbjct: 430 SETKFMAIRWQEYKVHMNV-----FNTSATRRNLDQSTIQSIGMSPWVYNIYADPKEQLS 484 Query: 517 IGVRHIPMGVP-LQTEMHAYMEILKKYP 543 G R+ G+P + + A++ +KYP Sbjct: 485 QGHRYFEWGIPGVMGLIAAHLATYQKYP 512 >UniRef50_Q7ULF9 Arylsulfatase n=4 Tax=Bacteria RepID=Q7ULF9_RHOBA Length = 538 Score = 288 bits (738), Expect = 3e-76, Method: Compositional matrix adjust. Identities = 172/485 (35%), Positives = 270/485 (55%), Gaps = 13/485 (2%) Query: 67 QDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLI 126 QD+ + KLAE++ K GK+PN++ ++DD+G+ D G GGG A+G TP+ID +AS+GL Sbjct: 57 QDQAAEDKLAEIKAKHGKRPNILWLVVDDMGYGDPGCYGGGAAIGAATPNIDRLASEGLR 116 Query: 127 LTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQP---GGLQGLTTLPQLLHDQGYVT 183 LTS YSQ + +PTR+ ILTG+ + G+ P + G + +LP+LL D GY T Sbjct: 117 LTSCYSQQTCTPTRSAILTGRLPVRTGLTRPILAGDKLTRNPWEDEVSLPKLLSDAGYYT 176 Query: 184 QAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLP 243 GKWH+GE +P ++GFD++ G+ ++ D P++ +P+R+ + + Sbjct: 177 LLTGKWHVGEPVGMRPHDIGFDEYYGYYPAQKEISQRFDERRFPDLVNNPERARAFEAIA 236 Query: 244 FSKDDVHAVRGGEQQAIADI-TPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGC 302 H +GG + + I + + M ++ D+ ++ + ++AK D+PFFL + Sbjct: 237 PDNHLTHGFKGGRTEKLKQIQSTEDMGRAEKVLADFTIQRIKELAKEDQPFFLEHCFMKV 296 Query: 303 HFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 H DN+PN S A+ Y + + E++ + L++ L NT + FTSDNGP+ + Sbjct: 297 HCDNFPNPDLGPLSAAKYYYKEAVAEVDLHVGEIMAALKEADVLGNTFVFFTSDNGPQMD 356 Query: 363 -VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHP 420 P G TPFRGAKG+T+EGGVRVP YWKG++ R+SDG+ DL DLF +L LA P Sbjct: 357 GWPDAGYTPFRGAKGTTFEGGVRVPGIAYWKGVVSGGRQSDGLFDLLDLFGVSLKLAEIP 416 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 + +P + D +DQTSF L +GQS R+A +++ +L + RM E+K HV P Sbjct: 417 TSD----LPVDRYYDYIDQTSFLLQDDGQSKREAVYFWWGKELMSCRMHEYKVHVKAVLP 472 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 + ++ V +FNLY DP+E +G R + ++ A+ K Sbjct: 473 ---ESTHMHIDYSTLVDVGLAPWLFNLYIDPKEQLPVGHRRNAWLATVLGKLKAHATTFK 529 Query: 541 KYPPR 545 KYP + Sbjct: 530 KYPAK 534 >UniRef50_B8KM61 Steryl-sulfatase n=2 Tax=gamma proteobacterium NOR5-3 RepID=B8KM61_9GAMM Length = 500 Score = 226 bits (576), Expect = 2e-57, Method: Compositional matrix adjust. Identities = 161/475 (33%), Positives = 241/475 (50%), Gaps = 39/475 (8%) Query: 85 KPNVVVFLLDDVGWMDVGFNG-GGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 KPNVV+ L D++G+ D+G G GG G PTP ID +AS+G++LT + +P +PTRA + Sbjct: 37 KPNVVLMLSDNMGYGDLGVYGSGGELRGMPTPRIDQLASEGMMLTQFFVEPGCTPTRAAL 96 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNV 202 LTG+YS G+ + G P LQ TL +L QGY T GKWH+G K+S P N Sbjct: 97 LTGRYSQRAGLGSIIIAGTPSTLQDSEVTLAELFKSQGYATAMTGKWHLGGEKQSLPINQ 156 Query: 203 GFDDFR-GFNSVSD--MYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD++ G +D +Y + E A++ ++ + P KD V VR + + Sbjct: 157 GFDEWHVGILQTTDGVLYPDGMRRSGFSEAAIAKSQTAIWESEP-GKDVVKKVRPYDLE- 214 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPAR 319 Y ++ + VK++ + AK +PFFLY G H+ P+ + G S A Sbjct: 215 -------YRRHIEGDIAEASVKYIKEQAKEKEPFFLYVGWSHVHYPALPHPDFEGKSSAG 267 Query: 320 TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA------EVPPHGRTPFRG 373 +GD ++E++ + +++ G DNT++++ SDNGP + PFRG Sbjct: 268 L-FGDAVMELDYRTGQVLDAIKEAGIEDNTIVIWLSDNGPATTQGSNNDFLGSSAGPFRG 326 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTF 433 G EG +RVP + W I+P KS+ +V + D +PT LA GAK VP Sbjct: 327 EVGDALEGSLRVPGMIKWPAKIKPAKSNEMVAIHDFYPT---LANIIGAK----VPTDRA 379 Query: 434 IDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFT 493 IDGVDQ FFLG N QS R++ F+ G++AAVR +++ + P + S Sbjct: 380 IDGVDQGDFFLGKNKQSARESLITFMEGEVAAVRWKQWRIY-----PKQFVASEGNPSLM 434 Query: 494 GTVMQTAGS----SVFNLYTDPQES-DSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 G A SV+N+ DP+E + V +G P + Y + L+KYP Sbjct: 435 GVGAYRAEGMGYPSVYNIARDPREQWNQTAVSAFVLG-PYMQIVGEYQKSLEKYP 488 >UniRef50_C1ZCM0 Arylsulfatase A family protein n=2 Tax=Bacteria RepID=C1ZCM0_PLALI Length = 509 Score = 219 bits (557), Expect = 3e-55, Method: Compositional matrix adjust. Identities = 150/417 (35%), Positives = 208/417 (49%), Gaps = 28/417 (6%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSS 137 L KPN++V + DDVGWM+V GG + +G TP+ID + +G+ TS Y+QPS + Sbjct: 17 LSASAADKPNILVIMADDVGWMNVSSYGGDI-MGIRTPNIDRIGQEGIRFTSFYAQPSCT 75 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKE 196 RA LTGQ + G+ G P GLQ TL ++L +GY T GK H+G+ +E Sbjct: 76 AGRAAFLTGQLPVRTGLTTVGTPGSPAGLQKEDITLAEILKTKGYSTAQFGKNHLGDLEE 135 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALS-PDRS---EYIKQLPFSKDDVHAV 252 P GFD++ G ++Y H+N L PDR E+ K+ + V Sbjct: 136 HLPHRHGFDEYFG-----NLY------HLNGNEDLEDPDRPTDPEFRKKFD-PRGVVSGT 183 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH-FDNYPNAK 311 G + +T K ME D + + FLD+ AK KPFFL++ + H F ++ Sbjct: 184 ADGPTKDEGPLTTKRMETFDDEIVAKSLDFLDRKAKDQKPFFLWHCSARLHVFFHFKEGV 243 Query: 312 YAGSSPARTS-YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT- 369 S R YGD + E + L LE G NT++V+ +DNG + P G T Sbjct: 244 RGKSRAGREDVYGDALAEHDGHIGQLLAKLEATGLDKNTIVVYVTDNGAYQYMWPEGGTS 303 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGA-----KV 424 PFRG KG+TWEGGVR P V W G + R S IVD+ DL PT AG A K Sbjct: 304 PFRGDKGTTWEGGVRAPCMVRWPGAVGGRVSSEIVDMTDLLPTLASAAGETDAVEKLKKG 363 Query: 425 ANLVPKT--TFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQ 479 A+ K +DG DQT+ F G + +S RK Y+ L A+R + FK I++ Sbjct: 364 ADYGGKNYKVHLDGYDQTALFTGKSDKSARKFVFYYDETVLTAIRYESFKVTFSIKE 420 >UniRef50_A1WGP9 Sulfatase n=6 Tax=Proteobacteria RepID=A1WGP9_VEREI Length = 470 Score = 212 bits (540), Expect = 3e-53, Method: Compositional matrix adjust. Identities = 145/464 (31%), Positives = 228/464 (49%), Gaps = 41/464 (8%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILT 145 PN+V+ + D++GW + G GGG G PTP+IDA+A+QGL L + + PTR+ ++T Sbjct: 34 PNIVLIVADNLGWGEPGCYGGGALRGAPTPNIDALATQGLRLQNFNVESDCVPTRSALMT 93 Query: 146 GQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGF 204 G++ I G L G P GL + TL QLL QGY + GKWH+G+ P + GF Sbjct: 94 GRHPIRTGCLQSVPPGLPQGLTRREITLAQLLSAQGYASAHYGKWHLGDVPGRLPSDRGF 153 Query: 205 DDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG---EQQAIA 261 D++ G +D V +P V LP+ + R G E + Sbjct: 154 DEWYGIARTTDESQFTSTVGFDPAVV----------DLPW----IMRGRSGQPSENLKVY 199 Query: 262 DITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTS 321 D+ + +D ++ + F+ + A + +PFFLY HF P+ +AG + A Sbjct: 200 DLDSR--RQIDAELVEQSIAFMRRNASTGRPFFLYLPLIHLHFPTLPHPDFAGRTGA-GD 256 Query: 322 YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAKGSTWE 380 + D MVE++ + + L++ G +N++++F SDNGPE VP G P+ G + E Sbjct: 257 FADSMVELDHRVGQVVRALDELGAAENSVLIFCSDNGPEFRVPYRGTAGPWSGTYHTAME 316 Query: 381 GGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQ 439 G +RVP V W G I R S+ IV + DLF T LAG GA+ +P+ IDGVDQ Sbjct: 317 GSLRVPCIVRWPGHISAARVSNEIVHVTDLFTT---LAGVAGAR----IPQDRPIDGVDQ 369 Query: 440 TSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQT 499 FFLG S R+ +++ +L AV+ ++K H +P G + Sbjct: 370 LPFFLGRQSASAREGFPFYIKEELRAVKWRDWKLH-FYWEPVVNESKG----------KL 418 Query: 500 AGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 +FN+ DP+E + + + P+ + A+ + ++P Sbjct: 419 ESPYLFNITRDPKEQMDVMAYNTWVRAPMLKLVKAFQDSFVQHP 462 >UniRef50_A8G0H1 Sulfatase family protein n=5 Tax=Gammaproteobacteria RepID=A8G0H1_SHESH Length = 517 Score = 207 bits (526), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 160/494 (32%), Positives = 243/494 (49%), Gaps = 55/494 (11%) Query: 85 KPNVVVFLLDDVGWMDV-GFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 +PNVV +LDDV MD+ ++ G AV TP+ID +A +G++++ Y+Q SS+ R+ Sbjct: 26 QPNVVAIMLDDVTTMDISAYHRGLGAVS--TPNIDRIAERGMMVSDYYAQGSSTAGRSAF 83 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGLT----TLPQLLHDQGYVTQAIGKWHMGENKESQP 199 +TGQY I G+ GQPG +GL TL ++L D+GY T +GK H+G+N + P Sbjct: 84 ITGQYPIRTGL---TSVGQPGSTRGLQKEDPTLAEMLKDKGYATVHVGKSHLGDNNDHLP 140 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS----EYIKQLPFSK-DDVHAVRG 254 GFD+F GF + ++H PE P+ I + K DD R Sbjct: 141 TVHGFDEFYGFL----YHLNVMEMHEQPEFPKDPNFKGRGRNMIHTVATDKFDDTVDPRF 196 Query: 255 GE--QQAIAD---ITPKYMEDLDQRWMDYGVKFLDK--MAKSDKPFFLYYGTRGCHFDNY 307 G +Q I+D + K M+ +D ++D+ + +L+K D+P+F++Y H + Sbjct: 197 GVIGKQTISDQGELGAKRMQTVDGEFLDFAINWLEKHEATNDDQPYFMWYNPTRMHQKTH 256 Query: 308 PNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE-VPPH 366 +Y G+S T Y D +VE++D L LE G++DNT+I+FTSDNG + P Sbjct: 257 VRPEYQGASQHNTYY-DGLVELDDQIGVLLDKLEATGEIDNTIILFTSDNGVNLDHWPDS 315 Query: 367 GRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTALDLAGHPGAKVA 425 G FRG KG+TW+GG RVP V W I Q +DG++ D PT + AG K Sbjct: 316 GAASFRGQKGTTWDGGFRVPMLVSWPAKIPQGEYTDGLMSAEDWVPTIMAAAGDADIKQD 375 Query: 426 NLVPK-------TTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ 478 L K IDG +Q L G+SNR ++ L A R+DE+K H+ + Sbjct: 376 LLTGKKINDETYKVHIDGYNQLD-MLTEGGKSNRHEFFFYNENSLNAFRVDEWKVHLKTK 434 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES--DSIG----VRHIPMGVP-LQTE 531 + + G + N+ DP E D+ G ++ +P L Sbjct: 435 TEWIAPADEWPLGM-----------ILNIKADPFERSPDTRGWFLWMKEKTWVLPKLLKA 483 Query: 532 MHAYMEILKKYPPR 545 + + + LK +PPR Sbjct: 484 VGKHQQSLKAFPPR 497 >UniRef50_C7ZGP1 Predicted protein n=3 Tax=Leotiomyceta RepID=C7ZGP1_NECH7 Length = 446 Score = 203 bits (517), Expect = 1e-50, Method: Compositional matrix adjust. Identities = 138/437 (31%), Positives = 217/437 (49%), Gaps = 36/437 (8%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KPN+V+ L D++GW ++G GGG+ G TP ID +A++GL+L + + PTR+ ++ Sbjct: 7 KPNIVLILADNLGWGELGCYGGGILRGAATPRIDKLATEGLLLHNFNVESDCVPTRSALM 66 Query: 145 TGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVG 203 TG++ I G G P GL + TLP+ L QGY T GKWH+G+ P + G Sbjct: 67 TGRHPIRTGCRQSVPAGFPQGLTRWERTLPECLKPQGYATAHHGKWHLGDIPGRYPSDRG 126 Query: 204 FDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADI 263 FD++ G +D + PEVA +LP+ + A + E I D+ Sbjct: 127 FDEWLGIPRTTDESQFTSALGYAPEVA----------ELPYIMKGI-AGQDSENICIYDL 175 Query: 264 TPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYG 323 + + +D+ +D +L + K++KPFFLY+ HF P+ + G + + + Sbjct: 176 EKRRL--IDEMLVDQSKDWLSRQVKAEKPFFLYHPLVHLHFPTLPHRDFEGKT-GQGEFA 232 Query: 324 DCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAKGSTWEGG 382 D M EM+ L L+ G DNT+++F SDNGPE P G P+ G + EG Sbjct: 233 DSMAEMDYRVGELIDHLDSLGVSDNTVLIFASDNGPEFRPPYKGTAGPWSGTYHTAMEGS 292 Query: 383 VRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTS 441 +RVP + W G + S+ V + D+F T L++AG + VP IDG+ Q S Sbjct: 293 LRVPFIIRWPGHVPTGVTSNETVHVTDIFTTILEIAG-------SEVPSDRPIDGISQVS 345 Query: 442 FFLG-TNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTA 500 FF + +S R+ +++ +L AV+ ++K H LI +P SG + Sbjct: 346 FFKDPSTVKSQREGFLFYIKEELRAVKWKDWKLH-LIWEPKVNQSSG----------KLE 394 Query: 501 GSSVFNLYTDPQESDSI 517 +FN+ DP+E I Sbjct: 395 SPYLFNVVRDPKEETDI 411 >UniRef50_A0Z7U6 Arylsulfatase n=2 Tax=Gammaproteobacteria RepID=A0Z7U6_9GAMM Length = 512 Score = 200 bits (508), Expect = 1e-49, Method: Compositional matrix adjust. Identities = 149/494 (30%), Positives = 238/494 (48%), Gaps = 53/494 (10%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KPN ++ DDVG+ +V G+ +G TP+ID++A G++ T AY + S + RA + Sbjct: 28 KPNFLMLWGDDVGYWNVSAYNQGM-MGYETPNIDSIAKDGMLFTHAYGEQSCTAGRAAFV 86 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGLT----TLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 TGQ G+L G PG +G+ T+ + L +GY+T GK H+G+ E P Sbjct: 87 TGQSGFRTGLLK---VGLPGAKEGMDQRDPTIAEYLKSKGYMTGQFGKNHLGDRDEHLPT 143 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS-----KDDVHAVRGG 255 N GFD+F + ++Y H+N E P+ +Y K F + + + G Sbjct: 144 NHGFDEF-----IGNLY------HLNAEE--EPEHPDYPKDPAFREKFGPRGVIKSSSDG 190 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 + +T K ME +D+ + FL++ K+D+PFFL+Y T H + G Sbjct: 191 RIEDTGPLTKKRMETIDEEVTAAALDFLERAVKADQPFFLWYNTTRMHVHTRLKPESEGV 250 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGA 374 + + D MVE + + + L++ G DNT++++T+DNG E P G T PFRG Sbjct: 251 T-GLGVFPDGMVEHDGMIGQMLDKLDELGITDNTVVMYTTDNGAEKFTWPDGGTAPFRGE 309 Query: 375 KGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTF 433 K + WEGG RVP V W G+I+P +S+GIV D FPT G K + + F Sbjct: 310 KNTNWEGGYRVPLLVKWPGLIEPGSRSNGIVSHMDWFPTIAAALGDTDLK-EQVSKGSAF 368 Query: 434 --------IDGVDQTSFFLGTNGQSNRKAEHYFL-NGKLAAVRMDEFKYHVLIQQPYAYT 484 +DG + ++ G +S R YF +G L +R +K Q+ +++ Sbjct: 369 GEGNSKVHLDGYNMLPYWGGETDESPRAEFFYFSDDGNLVGMRYQRWKAVFAEQRAHSFD 428 Query: 485 QSGYQGGFTGTVMQTAGSSVFNLYTDP---QESDSIGVR-----HIPMGVPLQTEMHAYM 536 + +Q +F+LY+DP E +SI + H+ + VP QT + ++ Sbjct: 429 V------WADPFVQLRVPKIFDLYSDPFEEAEHESIHYKDWWFQHVFLLVPAQTYVGEFL 482 Query: 537 EILKKYPPRAQIKS 550 +YPPR + S Sbjct: 483 GTFVEYPPRQKPAS 496 >UniRef50_B8KV72 Arylsulfatase A n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KV72_9GAMM Length = 535 Score = 199 bits (507), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 147/508 (28%), Positives = 235/508 (46%), Gaps = 61/508 (12%) Query: 67 QDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLI 126 QD ++L L + ++PN++V L DD+GW ++G GGG G PTP +D +A +G+ Sbjct: 51 QDAAVDKQLRSLTARFERRPNILVILADDIGWGELGSYGGGKLKGAPTPALDQMADEGMR 110 Query: 127 LTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQG-LTTLPQLLHDQGYVTQA 185 S Y++PS +PTR ++TG++ + G+ GQ GL TL ++L + GY T Sbjct: 111 FLSHYTEPSCTPTRVALMTGRHPVRTGLDEVLFPGQVKGLVADEVTLAEVLSEAGYATGM 170 Query: 186 IGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRD--VHVNPEVALSPDRSEYIKQLP 243 GKWH+GE +E QPQ GF D+ +N + WR+ H + E Y +P Sbjct: 171 FGKWHLGELQEHQPQYQGF-DYAYYNLYNGAIWPWRENATHYDTENDTGITGPPYFIDIP 229 Query: 244 FSKDD---------VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFF 294 + ++ + A R + I ++ D D + F+ ++ PFF Sbjct: 230 EAYEETFDIPLHGIMRAKRNTPAEEIDPLSLSRFNTFDNELTDEVIAFMRDQHEAGIPFF 289 Query: 295 LYYGTRGCHFDNYP--NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIV 352 Y+ T + P ++ Y S + +V+ + A L+++L+ G +NTL++ Sbjct: 290 AYFATNTQQVFSCPDVDSPYLDKSNCQARQ---LVQHDKNMARLFESLDNMGIDENTLVL 346 Query: 353 FTSDNGPEAEV-PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSD-GIVDLADLF 410 + SDNGP + P G + RG K +EGGVR P W G I P ++ IV ++D + Sbjct: 347 WISDNGPMNKFYPSTGFSWLRGYKSEVYEGGVRTPGIAKWPGSIAPGQTPIDIVHVSDWY 406 Query: 411 PTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL----------- 459 T +LA GAK A +P IDGVDQ S G S R ++ Sbjct: 407 TTIANLA---GAKAA--IPDDRVIDGVDQRSLLFNGEGYSRRDYVFFYRYIAYKNLSSTG 461 Query: 460 -NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIG 518 L+A+RM + K+H+ QSG ++NL DP ES Sbjct: 462 PASMLSAIRMGDIKFHL---------QSG---------------EIYNLLRDPVESHPGR 497 Query: 519 VRHIPMGVPLQTEMHAYMEILKKYPPRA 546 ++ P++ + + ++KKYP R Sbjct: 498 REYLWAMQPIRRMIWEHRAMMKKYPNRV 525 >UniRef50_A6UG37 Sulfatase n=16 Tax=Bacteria RepID=A6UG37_SINMW Length = 552 Score = 196 bits (497), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 144/489 (29%), Positives = 237/489 (48%), Gaps = 53/489 (10%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 +GK PN++V DD+G + G+ +G TP+ID +A++G I T AY Q S + RA Sbjct: 55 SGKTPNILVIFGDDIGIPQISAYTMGL-MGYRTPNIDRIAAEGAIFTDAYGQQSCTAGRA 113 Query: 142 TILTGQYSIHHGILMPPMYGQPGGLQG-LTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 + + GQ G+L M G P G+Q + T+ ++ +GY T GK H+G+ E P Sbjct: 114 SFILGQEPFRTGLLTIGMPGDPHGIQDWMPTIADVMKSKGYATGQFGKNHLGDRDEHLPT 173 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVN----PEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 N GFD+F G ++Y H+N PE P E+ K + + + G+ Sbjct: 174 NHGFDEFFG-----NLY------HLNAEEEPEGYFYPKDEEFRKNFG-PRGVIKSSADGK 221 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 + + K ME +D+ ++ F+D+ AK+DKPFF ++ + H + + G + Sbjct: 222 IEDTGALNTKRMETVDEEFLAAAKDFIDRQAKADKPFFCWFNSTRMHVFTHLKPESMGKT 281 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG-RTPFRGAK 375 + + D MVE + L + L+ G +NT++++T+DNG E + P G T F G K Sbjct: 282 -GKGIHADGMVEHDGHVGQLLQQLDDLGITENTIVLYTTDNGAELALWPDGAMTMFHGEK 340 Query: 376 GSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTF- 433 G+TWEGG R+P V W G+++P + + V L D PT AG P K KT F Sbjct: 341 GTTWEGGFRIPMMVRWPGVVKPGTQINDPVTLMDWMPTFATAAGIPDVKEEM---KTGFK 397 Query: 434 ---------IDGVDQTSFFLGTNGQSNRKAEHYF-LNGKLAAVRMDEFKYHVLIQQPYAY 483 +DG D T+ G + R+A +YF G L A+R +++K + Sbjct: 398 SGDKTFKVHLDGYDLTALLKGEAEEPPREAVYYFDQGGNLNAIRWNDWKLSFAVNS---- 453 Query: 484 TQSGYQGGFTGTVMQT-AGSSVFNLYTDPQESDS--------IGVRHIPMGVPLQTEMHA 534 +G +T + +++ NL DP E + R++ + VP+Q+++ Sbjct: 454 -----EGNIATATRETPSWANIANLRMDPYERGTKEGGGAMEFIARNMWLLVPIQSKIKE 508 Query: 535 YMEILKKYP 543 + + +YP Sbjct: 509 FFQDFDQYP 517 >UniRef50_Q1CY93 Sulfatase family protein n=4 Tax=Bacteria RepID=Q1CY93_MYXXD Length = 553 Score = 195 bits (495), Expect = 4e-48, Method: Compositional matrix adjust. Identities = 158/492 (32%), Positives = 229/492 (46%), Gaps = 37/492 (7%) Query: 81 KTGKKPNVVVFLLDDVG-WMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT 139 K +KPN++V DD+G W +N G +G TP+ID +A +G ++T Y Q S + Sbjct: 16 KQSRKPNILVIWGDDIGIWNISAYNQG--MMGYFTPNIDRIAKEGAMMTDCYGQQSCTAG 73 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGLQGLT-TLPQLLHDQGYVTQAIGKWHMGENKESQ 198 RA +TG + G+ M G GLQ T+ ++L GY GK H+G++ Sbjct: 74 RAAFITGMNPLRTGLTTIGMPGAKYGLQDSDPTIAEMLKPLGYTCGHFGKNHVGDSNPYL 133 Query: 199 PQNVGFDDFRG--FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD-VHAVRGG 255 P GFD+F G ++ ++ E D +P ++ +DD R G Sbjct: 134 PTVHGFDEFFGNLYHLNAEGEPECPDYPKDPTFKERFGPRGVLRSWATDRDDPTEDKRWG 193 Query: 256 --EQQAIAD---ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 +Q I D +T K ME +D ++ + F+++ K KPFFL++ T H Y Sbjct: 194 VVGKQRIEDTGALTRKRMETVDGEFLQGTLDFMERAVKDGKPFFLWHNTTRTHVWTYLQE 253 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE-VPPHGRT 369 KY ++ Y D M E++D+ L L++ G DNTL+VF++DNG E P G + Sbjct: 254 KYRNAT-GYGLYADAMRELDDIVGVLLAKLDELGIADNTLVVFSTDNGVEKMGWPDGGNS 312 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 PFRG KGSTWEGGVRVP V W G+++P R + I D PT + AG P VA Sbjct: 313 PFRGEKGSTWEGGVRVPCMVRWPGVVEPGRVINDIFAHEDWMPTLVSAAGGPKDLVAQCQ 372 Query: 429 ------PKT--TFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 KT ++DG DQT G + + +G LAAVR D++K Q+ Sbjct: 373 RGYKAGDKTFRVYLDGYDQTGLLAGKEKGPRHEFIYVLDSGNLAAVRYDDWKLIFSYQE- 431 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES--------DSIGVRHIPMGVPLQTEM 532 G F+G A + NL +DP E G R VP Q + Sbjct: 432 ----GEGPDMWFSGKRFDPAWPYLINLRSDPFEYGPKAGLYLKWYGERMFTF-VPAQALV 486 Query: 533 HAYMEILKKYPP 544 + + L YPP Sbjct: 487 QKFAQSLLDYPP 498 >UniRef50_Q46SG5 Arylsulfatase n=3 Tax=Proteobacteria RepID=Q46SG5_RALEJ Length = 542 Score = 194 bits (492), Expect = 9e-48, Method: Compositional matrix adjust. Identities = 150/500 (30%), Positives = 239/500 (47%), Gaps = 49/500 (9%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 +PN++V DD+GW +V G GV +G TP+ID++ +G+ T Y+QPS + RA + Sbjct: 29 RPNILVIWGDDIGWENVSAYGMGV-MGYTTPNIDSIGMEGIRFTDQYAQPSCTAGRAAFI 87 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGLT----TLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 TGQY I G+ GQPG G +L +++ GY T GK HMG+ P Sbjct: 88 TGQYPIRSGMTT---VGQPGDKLGWQPASPSLGEVMKQAGYRTGFFGKSHMGDRNSHLPT 144 Query: 201 NVGFDDFRG--FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE-- 256 GFD+F G ++ ++ E D D++ K P +A + Sbjct: 145 VHGFDEFFGNLYHLNTEELPENHDYQAYANGYPGGDKAFAQKFAPRGVLHTYATDNDDPT 204 Query: 257 ---------QQAIAD---ITPKYMEDLDQ-RWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 +Q I D +T K MED D + + F+ + DKPFF++ T H Sbjct: 205 DMPRFGPVGKQKIEDTGPLTKKRMEDFDAAEVIPKAIDFMQGAKQKDKPFFVWLNTSRMH 264 Query: 304 FDNYPNAKYAGSSPARTS----YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP 359 + N K+ ++ T G M++ + + + L+++G NT++ +++DNGP Sbjct: 265 LYTHLNDKWRYAAAKYTHEDDMQGSGMLQHDHDIGLVLEYLKRSGLDKNTIVWYSTDNGP 324 Query: 360 EAEVPPHGRT-PFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLA 417 E PHG T PFRG K +T+EGGVRV + + W G+I+P + +GI D+F T +A Sbjct: 325 EHVSWPHGSTTPFRGEKMTTYEGGVRVVSMLRWPGVIKPGQIKNGIQAHQDMFTTFAAIA 384 Query: 418 GHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLI 477 G P K +IDG++ ++ G S RK Y+ KL AVRM +K H + Sbjct: 385 GVPDVVGQMKREKHQYIDGINNLDYWTGKTADSARKDFLYYYENKLTAVRMGPWKLHFSL 444 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES----DSIG---VRHIPMGVPLQT 530 ++ Y GT+ + + +FNL +DP ES D+ G + + P+ Sbjct: 445 KEDY-----------YGTLQPRSVTMLFNLRSDPFESYDSKDAYGHLLQKAQWISGPMNE 493 Query: 531 EMHAYMEILKKYPPRAQIKS 550 + ++++ + YPP KS Sbjct: 494 LIASHLKTIADYPPVQPAKS 513 >UniRef50_A6QA55 Arylsulfatase n=5 Tax=Bacteria RepID=A6QA55_SULNB Length = 528 Score = 192 bits (487), Expect = 4e-47, Method: Compositional matrix adjust. Identities = 151/497 (30%), Positives = 240/497 (48%), Gaps = 53/497 (10%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 KKPN++V DD+GW +V G G +G TP+ID++ +G+ T Y+QPS + RA+ Sbjct: 28 KKPNILVIWGDDIGWQNVSAYGMGT-MGYTTPNIDSIGMEGIRFTDHYAQPSCTAGRASF 86 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGLT----TLPQLLHDQGYVTQAIGKWHMGENKESQP 199 +TGQY I G+ GQPG GL L +++ +QGY T GK H+G+N P Sbjct: 87 ITGQYPIRSGMTT---VGQPGDPLGLKPESPCLAEVMKEQGYTTGQFGKNHLGDNNMHLP 143 Query: 200 QNVGFDDFRG--FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV----- 252 GFD+F G ++ + E RD + A S Y K+ ++ +H+ Sbjct: 144 TVHGFDEFYGNLYHLNTQEEAEQRDYQRFAK-AYSGSVEAYEKKFG-TRGVIHSFATDKD 201 Query: 253 ------RGGE--QQAIAD---ITPKYMEDLDQR-WMDYGVKFLDKMAKSDKPFFLYYGTR 300 R G+ +Q I D +T + M++ D++ + F+ + K KPFF++ T Sbjct: 202 DPTVDPRFGKVGKQIIEDTGPLTQERMKEFDEKEVIPRAFDFMIRAKKEGKPFFVWLNTT 261 Query: 301 GCHFDNYPNAKYAGSSPARTS----YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 H N K+ ++ TS +G M++ + + L+KN +T++ +++D Sbjct: 262 RMHLYTRLNDKWRYAAEKFTSEVDVHGSGMLQHDHDIGLVLDFLKKNDLEKDTIVWYSTD 321 Query: 357 NGPEAEVPPHG-RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTAL 414 NGPE PHG TPF+ K +TWEGGVRV + + W G I+ + +GI D+F T Sbjct: 322 NGPEHSAWPHGATTPFKSEKMTTWEGGVRVISMIKWPGHIKKGQILNGIQSHMDMFTTLA 381 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYH 474 AG + K +IDG++ ++ G + +S R + Y+ KL+AVRM +K+ Sbjct: 382 AAAGVDNVAEKMMKEKKQYIDGLNNLDYWTGKSKKSARNSIFYYYESKLSAVRMGPWKFL 441 Query: 475 VLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES--DSIGVRHIPMGV-----P 527 ++ Y G ++ V NL DP ES D H+ V P Sbjct: 442 FSTKKDYY-----------GNLVPRTVPIVVNLRMDPFESYTDKESYGHLLQKVSWLMSP 490 Query: 528 LQTEMHAYMEILKKYPP 544 + M A+++ L YPP Sbjct: 491 MGEMMAAHLKTLADYPP 507 >UniRef50_Q4RJR3 Chromosome 13 SCAF15035, whole genome shotgun sequence. (Fragment) n=1 Tax=Tetraodon nigroviridis RepID=Q4RJR3_TETNG Length = 474 Score = 189 bits (481), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 145/477 (30%), Positives = 227/477 (47%), Gaps = 78/477 (16%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTR 140 T PN V+ DD+G+ D+G G ++ TP++D +A+ GL T Y + P SP+R Sbjct: 18 TSLPPNFVLLFADDLGFGDLGCYGHPTSL---TPNLDGLAAGGLRFTDFYCTSPVCSPSR 74 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMG---E 193 A++LTG+Y G+ +Y PG GL TT+ ++L +GY T A+GKWH+G + Sbjct: 75 ASLLTGRYQTRSGVYPGVLY--PGSRGGLPLNETTIAEVLKPRGYATAAVGKWHLGGPCQ 132 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 N P +V +V+ V L D E IKQ P + Sbjct: 133 NLTCFPPDVKCFGLCDVGTVT--------------VPLMHD--EVIKQQPVN-------- 168 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 DL++ + D+ F+ AK +PFFLY+ + H+ Y A Sbjct: 169 --------------FLDLEKAYSDFAKDFITTSAKRKQPFFLYFPSHHTHYPQYAGPGAA 214 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT--PF 371 G S R +GD ++E + +L TLE+ G ++NTLI FTSDNGPE G P Sbjct: 215 GKS-LRGPFGDALLEFDQTIGSLLATLERTGVINNTLIFFTSDNGPELMRMSRGGNAGPL 273 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 R KG+T+EGG+R P YW+G+IQP + + D+ PT LA GAK+ ++ Sbjct: 274 RCGKGTTYEGGMREPAIAYWQGLIQPGVTHEMASTLDILPTFASLA---GAKLPQVM--- 327 Query: 432 TFIDGVDQTSFFLGTNGQSNRKAEHYF------LNGKLAAVRMDEFKYHVLIQ-QPYAYT 484 +DGVD T+ + G+S R+A ++ NG L A+R++++K H Q ++ T Sbjct: 328 --LDGVDMTNILF-SQGKSKREAMMFYPTDPSEKNG-LFAIRLEKYKAHFYTQGASHSST 383 Query: 485 QSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 ++ +F+L DP E H P+ + + ++ + ++K Sbjct: 384 TPDQDCSIFASLKAHDPPLLFDLEADPSE-------HYPLSLDDRPDLQEVLGRIRK 433 >UniRef50_Q1QJ61 Sulfatase n=3 Tax=Bacteria RepID=Q1QJ61_NITHX Length = 496 Score = 189 bits (481), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 127/441 (28%), Positives = 210/441 (47%), Gaps = 36/441 (8%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILT 145 PNVV FL+D++G+ ++G GGG+ G T IDA A +G+ L + + +P+R+ ++T Sbjct: 49 PNVVYFLVDNLGYGELGCYGGGILRGADTRRIDAFADEGIKLLNFAPEAQCTPSRSALMT 108 Query: 146 GQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGF 204 G+Y+I G + G+ GGL T+ +L +GY T +GKWH+GE+ P + GF Sbjct: 109 GRYAIRSGNHTVALPGEEGGLVAWERTMGDVLSARGYATACVGKWHVGESAGRWPTDHGF 168 Query: 205 DDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADIT 264 D++ G W + + P R L K D + + + Sbjct: 169 DEWYGPPR------SWDESLWPTDPWYDPKRDPVSNMLESRKGDR------TPRTVKQLD 216 Query: 265 PKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGD 324 D+D+ + G F+ + + + FFLY+ H P A++ G S + + D Sbjct: 217 LNVRRDVDRELLTRGKAFMKRSVDAKRSFFLYFNHSLMHMPTIPRAEFRGKS-GQGDWAD 275 Query: 325 CMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP--FRGAKGSTWEGG 382 C++E++ F + TL++ DNT++VF+ DNGPE E+ P TP F G+ + EG Sbjct: 276 CLLELDSDFGEILDTLKELKVDDNTIVVFSGDNGPE-ELEPWRGTPGFFDGSYFTGMEGS 334 Query: 383 VRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTS 441 +R P V + G + P ++S+ IV + D+F L AG +P IDG+DQ + Sbjct: 335 LRTPCMVRYPGRVPPGKQSNDIVHITDMFTIILQWAGA-------AMPTDRVIDGIDQRA 387 Query: 442 FFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAG 501 FF G S R Y++ L V+ FK +Q+ T ++ + Sbjct: 388 FFEGKQNNSARDGIPYWMADTLYGVKWRNFKMVFYLQKT-----------LTEPALKLST 436 Query: 502 SSVFNLYTDPQESDSIGVRHI 522 + NL DP+E + + +I Sbjct: 437 PHIINLTVDPKERKAFDLPYI 457 >UniRef50_Q0KB87 Arylsulfatase A or related enzyme n=107 Tax=cellular organisms RepID=Q0KB87_RALEH Length = 585 Score = 187 bits (474), Expect = 1e-45, Method: Compositional matrix adjust. Identities = 146/460 (31%), Positives = 217/460 (47%), Gaps = 48/460 (10%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 +GKKPN++V DD+G ++ GV VG+ TP+ID +A +G+I T Y++ S + R+ Sbjct: 88 SGKKPNILVIFGDDIGQTNISAYSMGV-VGHRTPNIDRIAREGMIFTDYYAENSCTAGRS 146 Query: 142 TILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKES 197 + +TGQ + G+ G PG GL T+ + L GY T GK H+G+ E Sbjct: 147 SFITGQSPLRTGL---SKVGAPGATVGLQARDVTIAEALKPLGYATGQFGKNHLGDRDEY 203 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL--PFSKD-----DVH 250 P GFD+F G ++Y H+N E P R + K PF K+ +H Sbjct: 204 LPTKHGFDEFYG-----NLY------HLNAEE--EPQRPYWPKDKNDPFVKNFSPRGVLH 250 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 + G+ + +T K ME +D D KF+ K ++DKPFF++ T H + Sbjct: 251 STADGKIEDTGALTTKRMETIDDETTDAAQKFITKQVQADKPFFVWMNTTRMHAFTHVRP 310 Query: 311 KYAGSS--PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP-EAEVPPHG 367 G S P Y D M+E + L KTL+ DNT++++T+DNGP + P Sbjct: 311 SMQGQSGMPG-NDYADGMIEHDGDVGKLLKTLDDLKIADNTIVIYTTDNGPNQWSWPDAA 369 Query: 368 RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVAN 426 TPFR K + WEG RVP + W G I+ S+ + D FPT L G K Sbjct: 370 STPFRSEKNTNWEGAFRVPAMIRWPGKIKAGTVSNEMFSGLDWFPTLLAAVGDGDIK-ER 428 Query: 427 LVPKTTF--------IDGVDQTSFFLGTNGQSNRKAEHYFL-NGKLAAVRMDEFKYHVLI 477 L+ T+ +DG +Q ++ G + RK +YF +G L A+R D++K V Sbjct: 429 LLKGTSLGSKNAKVHLDGYNQLAYLTGQTNKGARKEFYYFNDDGVLVAMRYDDWKV-VFC 487 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSI 517 +Q +Q F + +FNL DP E I Sbjct: 488 EQTTPGGFQVWQDPFKCLRV----PKIFNLRMDPYERADI 523 >UniRef50_B9XS23 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XS23_9BACT Length = 635 Score = 184 bits (468), Expect = 6e-45, Method: Compositional matrix adjust. Identities = 146/454 (32%), Positives = 219/454 (48%), Gaps = 57/454 (12%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 T +KPN++ L DD+G+ D+G G + N TP++D +A +G+ LTS Y+ P +P+RA Sbjct: 21 TSQKPNIIFILADDMGYGDIGPFGSTL---NRTPNLDRMAKEGMKLTSFYAAPLCTPSRA 77 Query: 142 TILTGQYSIHHGILMPPMYGQPGGLQGLTT----LPQLLHDQGYVTQAIGKWHMGENKES 197 ILTG Y+ + P P GL T + +LL QGY T AIGKWH+G+ E+ Sbjct: 78 QILTGCYAKRVSL---PKVLSPRSEVGLNTNEQTVAKLLKRQGYATMAIGKWHVGDAPEN 134 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQ-----LPFSKDDVHAV 252 P GFD + G +DM E P + + K+ LP +D Sbjct: 135 LPTRHGFDHYLGLPYSNDMGGE------------EPGKDQPAKRGARPPLPLVRD----- 177 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 +Q I + P + L +R+ D VKF+ A +PFFLY H +P + Sbjct: 178 ----EQVIEVVKPADQDRLTERYTDEAVKFI--RANDKQPFFLYLAHTAVHAPIHPGHNF 231 Query: 313 AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT--P 370 G S YGD + E++ + TL + G +NTL++F+SDNGP +G T P Sbjct: 232 RGKS-RNGLYGDWVEEVDWSVGKVLDTLRELGLSENTLVLFSSDNGPWLAQKTNGGTAGP 290 Query: 371 FRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVP 429 RG KG T+EGG+R PT +W G + + D + DL PT + LAG +P Sbjct: 291 LRGGKGGTFEGGMREPTLAWWPGKVPAQSVCDTVAGNIDLLPTFVKLAG-------GTLP 343 Query: 430 KTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP--YAYTQSG 487 K IDG D ++ LG ++ R+A +YF L AVR +K ++ Q ++++ Sbjct: 344 KDKKIDGRDISNLLLGQTKEAQREAHYYFAGTALQAVRSGPWKLAIVPQYEGMGKFSENA 403 Query: 488 YQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 +GG + ++NL D E + H Sbjct: 404 VEGG------KPFAPRLYNLDEDIGEKTDVVAEH 431 >UniRef50_Q488V4 Sulfatase family protein n=30 Tax=Bacteria RepID=Q488V4_COLP3 Length = 525 Score = 184 bits (466), Expect = 1e-44, Method: Compositional matrix adjust. Identities = 131/415 (31%), Positives = 202/415 (48%), Gaps = 36/415 (8%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 ++PN++ DD+G ++ G+ +G T +ID +A +G++ T Y + S + RA Sbjct: 37 ERPNILAIWGDDIGQSNISAYTHGM-MGYKTTNIDRIAKEGVLFTDYYGENSCTAGRAAF 95 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 +TGQY + G+ G PG +GL T+ +LL D+GYVT GK H+G+ E P Sbjct: 96 ITGQYPVRTGL---TKVGLPGSDKGLRAEDVTIAELLKDRGYVTGQFGKNHLGDKDEFLP 152 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD-----DVHAVRG 254 N GFD+F G ++Y H+N E P+ +Y K + K +H+ Sbjct: 153 TNHGFDEFLG-----NLY------HLNAEE--EPEHPDYPKDQAYKKRFGPRGVIHSFAD 199 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG 314 G+ + +T K ME +D ++ KF+DK K++KPFF+++ H + + G Sbjct: 200 GKIEDSGPLTKKRMETIDDEFLAATTKFIDKAHKNNKPFFVWFNATRMHIWTHLKEESKG 259 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRG 373 S YGD M+E + L L++ DNT++++T+DNG E P G T PF+G Sbjct: 260 LSKRGGIYGDGMMEHDYQVGVLLDQLDRLAIADNTIVLYTTDNGAEVFSWPDGGTIPFKG 319 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQPRKSD-GIVDLADLFPTALDLAGHPGAK-------VA 425 K +TWEGG RVP V W G I + +V D PT L AG K Sbjct: 320 EKNTTWEGGFRVPAMVRWPGKITAGDAKIEMVSHMDWAPTLLAAAGVTDIKEKLKQGTTV 379 Query: 426 NLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN-GKLAAVRMDEFKYHVLIQQ 479 N +DG + + G ++ R + YF + G L+AVR + K IQ+ Sbjct: 380 NGKKYKVHLDGYNLLPYLTGATDEAPRPSYLYFTDGGDLSAVRFGDMKLQYSIQE 434 >UniRef50_B8KM62 N-acetylgalactosamine-6-sulfatase n=1 Tax=gamma proteobacterium NOR5-3 RepID=B8KM62_9GAMM Length = 472 Score = 183 bits (464), Expect = 2e-44, Method: Compositional matrix adjust. Identities = 132/441 (29%), Positives = 213/441 (48%), Gaps = 38/441 (8%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KPN+V+ +D+ G+ ++G GGG+ G TP ID +AS+G+ LT+ + +P+RA ++ Sbjct: 9 KPNIVLINMDNFGYGELGVYGGGIVRGGATPRIDKLASEGIRLTNFNVEAQCTPSRAALM 68 Query: 145 TGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVG 203 TG+Y++ G P+ GL Q T+P++L D GY T GKW++G+ + P N G Sbjct: 69 TGRYAVRTGNGTVPLQTVDYGLTQWEYTMPEMLSDAGYATAHFGKWNLGQREGRYPTNQG 128 Query: 204 FDDFRGFNSVSD--------MYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 FD++ G + +D M+ +W ++A ++ IK+ + +G Sbjct: 129 FDEWYGIPNSTDESEWPTNEMFLKW------AKIAKETGKTPMIKETHV----LSGRKGS 178 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 + + ++D+ D G F+ + AK+ KPFFLY H P+A++ G Sbjct: 179 PTKEVKVFDSSVRPEIDREVTDLGKDFMTRQAKAGKPFFLYLPYTQTHAPVTPSAEFKGK 238 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGA 374 S +GD +++++ L +++ G DNT+ +FT+DNG E G P+ G+ Sbjct: 239 S-GNGKWGDILMQIDAYTGELLDKVDELGIADNTIFIFTADNGGEMTPTFQGWNGPWSGS 297 Query: 375 KGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTF 433 + EG +RVP V W G + K S+ IV DLF T ++AG VP Sbjct: 298 YFTGMEGSLRVPFIVRWPGKVPAGKVSNEIVHEFDLFSTFANIAG-------GKVPTDRI 350 Query: 434 IDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFT 493 ID D T FFLG QS R ++ + V+ +K ++ Q+ G G Sbjct: 351 IDSKDMTDFFLGKQEQSGRDGFVIYVGDDIFGVKWQNYK--MMFQE-----LDGGNGSNK 403 Query: 494 GTVMQTAGSSVFNLYTDPQES 514 V FNLY DP+E Sbjct: 404 LNVFPFV--RFFNLYEDPKEE 422 >UniRef50_A0Z6R0 Putative arylsulfatase n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z6R0_9GAMM Length = 466 Score = 183 bits (464), Expect = 2e-44, Method: Compositional matrix adjust. Identities = 137/439 (31%), Positives = 209/439 (47%), Gaps = 43/439 (9%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 K NVV+ L+D+ G+ ++G GGGV G PTP ID++A +GL LT+ + +P+R+ + Sbjct: 25 KPANVVLVLMDNFGYGEIGVYGGGVMRGAPTPRIDSIAKEGLQLTNFNVEAECTPSRSAL 84 Query: 144 LTGQYSI--HHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQN 201 +TG+Y I PP G + TL +LL D GY T GKWH+G+ + P + Sbjct: 85 MTGRYGIRTRQRANQPPRGVWYGITKWEVTLAELLSDAGYATGIFGKWHLGDTEGRYPTD 144 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG--GEQQ- 258 GFD++ G SD A PD + + S H + GEQ Sbjct: 145 QGFDEWIGLPRSSDR-------------AFWPDSNSFQPNSHPSAKFTHVMSASKGEQPV 191 Query: 259 --AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 A+ D + + +D+ D + F+ +M+ KPFF Y H P+ + GS+ Sbjct: 192 EGAVYDRAKRAI--IDREITDQAIDFMTRMSGKGKPFFAYLPYTQTHEPVDPHPDFYGST 249 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAK 375 S+ D + + + L T+E G ++T+ +FTSDNG E G T P+R Sbjct: 250 -GNGSFADVLAQTDVYVGELLDTVESLGIREDTIFIFTSDNGREGVPRSFGFTGPWRSGM 308 Query: 376 GSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFI 434 S +EG +RVP V W G I P R S+ IV D+F T G +P I Sbjct: 309 FSPYEGSLRVPFLVRWPGKIPPGRVSNEIVHQMDVFSTVASFTGVD-------IPTDRVI 361 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTG 494 DGVDQ++FF G +S R + ++ L + +K +L+++ + GY Sbjct: 362 DGVDQSNFFRGKTEKSARDSLVIYIGNTLFGAKWRNWK--ILLRE---MDEDGYG----- 411 Query: 495 TVMQTAGSSVFNLYTDPQE 513 + + A SV+NL DP+E Sbjct: 412 -IKEMAYPSVYNLIVDPKE 429 >UniRef50_Q0C069 Sulfatase family protein n=3 Tax=Bacteria RepID=Q0C069_HYPNA Length = 505 Score = 181 bits (458), Expect = 8e-44, Method: Compositional matrix adjust. Identities = 147/497 (29%), Positives = 232/497 (46%), Gaps = 68/497 (13%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS- 132 +AE E ++PN+V+ +DD+G+ D+G G +A TP++D +A +G TS Y+ Sbjct: 34 SVAEKEAAASEQPNIVLIFVDDMGYADIGSFGSPIAR---TPNLDRLAMEGQKWTSFYAP 90 Query: 133 QPSSSPTRATILTGQYSIHHG---------ILMPPMYGQPGGLQGLTTLPQLLHDQGYVT 183 P +P+RA ++TG+ ++ G +L P G G Q T+ +LL +GYV+ Sbjct: 91 APVCTPSRAGLMTGRLAVRSGMAGLVQARHVLFPTSTG--GLPQSEVTIAELLQQEGYVS 148 Query: 184 QAIGKWHMGENKESQPQNVGFDDFRGFNSVSDM------YTEWR-DVHVNPEVALSPDRS 236 A GKWHMG E P + GF + G +DM T W D+ P P+ Sbjct: 149 AAFGKWHMGHLPEFLPTSHGFQSYFGIPYSNDMNMPGGGETPWSIDLFFEP-----PNIQ 203 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 + +P +D+ R +Q L QR+ + ++F++ +PFFLY Sbjct: 204 NW--DVPLMQDEEIIERPADQFT-----------LTQRYTERAIEFMETSHAEGQPFFLY 250 Query: 297 YGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 H + + + G S A +YGD + E++ + L+ NTL++FTSD Sbjct: 251 LAHNMPHTPLFTSEGFTGVS-AGGAYGDVIEELDWSVGEIVDALKDMKIEKNTLVIFTSD 309 Query: 357 NGPEAEVPPHGRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTAL 414 NGP + H + R KG+TWEGG+RVP +W G I PR + DL PT Sbjct: 310 NGPWLAMKTHSGSAGMLRDGKGTTWEGGMRVPAIFWWPGQIAPRTVTDLGSALDLMPT-- 367 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYH 474 A GA+ +P+ DG D S L + G S R+ +Y+ + AVR ++K H Sbjct: 368 -FAAISGAR----LPEDRVYDGFD-LSPALFSEGSSPRETLYYYRFTDVFAVRKGKYKAH 421 Query: 475 VLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH--IPMGVPLQTEM 532 ++ G GG T ++T ++++ DP E +I +H I M + + E Sbjct: 422 --------FSTYGAFGGSGRTELET--PELYDIEADPSEQFNIAAQHPEIVMELKVLAEK 471 Query: 533 HA-----YMEILKKYPP 544 A L++YPP Sbjct: 472 QAASVEPVENQLERYPP 488 >UniRef50_UPI0001968C90 hypothetical protein BACCELL_02360 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968C90 Length = 525 Score = 180 bits (456), Expect = 1e-43, Method: Compositional matrix adjust. Identities = 146/467 (31%), Positives = 219/467 (46%), Gaps = 64/467 (13%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRAT 142 +KPN V +DD+G+ DV G TP+IDA+A++G+ T Y+ P SSP+RA Sbjct: 74 EKPNFVFIYMDDMGYSDVSCYG---ETRWTTPNIDALAAEGIKFTDCYAASPISSPSRAG 130 Query: 143 ILTGQYSIHHGI---LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 LTG+Y GI P Y G T+ ++L QGY T IGKWH+G ++ P Sbjct: 131 FLTGRYPARMGIQGVFYPDSY--TGMAPEEVTMAEVLKVQGYATACIGKWHLGSREKYLP 188 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD++ G +DM + +V L + E F D Sbjct: 189 LQQGFDEYFGIPYSNDM---------SAQVYLRGNEVE-----EFHID------------ 222 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPAR 319 I ++T KY E+ V ++ + K+D+PFFL+ H Y + ++AG S A Sbjct: 223 INNVTKKYTEE--------AVDYIRR--KADQPFFLFLAHSMMHVPIYVSDEFAGKSGAG 272 Query: 320 TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA-EVPPHGRT-PFRGAKGS 377 YGD ++E++ + +TL + G DNTL+VFTSDNGP E P GR P R K + Sbjct: 273 I-YGDAVLEVDWSVGRIMETLRELGLDDNTLVVFTSDNGPWLQEGPLGGRALPLREGKTT 331 Query: 378 TWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGV 437 +EGGVRVP YWKG I+P + +V L D FPT L+G ++P +DG Sbjct: 332 AFEGGVRVPCIAYWKGQIKPVVNTDVVSLLDWFPTVTALSG-------GILPDVR-LDGY 383 Query: 438 DQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVM 497 D T+ GT +++ ++ N + R ++K I P G +G F Sbjct: 384 DLTAVLNGTGKRASEDYAYFRNNRDITDYRSGDWK----ISLP----APGIKGNFWRAST 435 Query: 498 QTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 + +FNL D E ++ ++ + ++ Y + PP Sbjct: 436 AEHDTLLFNLREDIGERYNLYRKYPGKAKEMLQKLQEYTRNFGEIPP 482 >UniRef50_B9XCM3 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XCM3_9BACT Length = 565 Score = 179 bits (455), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 146/476 (30%), Positives = 220/476 (46%), Gaps = 64/476 (13%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 KKPN++ + DDVGW ++G G+ G TP++D +ASQG+ T Y++ S + RA Sbjct: 34 KKPNILFIMGDDVGWFNIGAYHQGIMSGK-TPNLDKLASQGMRFTDYYAEASCTAGRANF 92 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 +TG+ + G+ GQ G G+ TL L QGY T GK H+G+ + P Sbjct: 93 ITGEIPLRTGLTT---VGQAGADVGIPDKACTLATALKAQGYATGQFGKNHLGDLNKYLP 149 Query: 200 QNVGFDDFRGF----NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV-RG 254 GFD+F G+ +++SD Y W V+ + +DD + R Sbjct: 150 TLHGFDEFFGYLYHLDALSDPY--WYSFPVDEAYYNKFGPRSVVHCWATDQDDTTEMPRW 207 Query: 255 GE--QQAIADITP---------------------KY-MEDLDQRWMDYGVKFLDKMAKSD 290 G+ +Q + D P KY M D+ + + F+DK K Sbjct: 208 GKVGKQKVVDEGPLPPFPDMSNVPNMHDLPFLKAKYDMTTFDEVLVKSSIDFMDKAKKDG 267 Query: 291 KPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYG---DCMVEMNDVFANLYKTLEKNGQLD 347 KPFF+++ + H + KY+ +++++G M +++D L K L+ G+ D Sbjct: 268 KPFFVWHNSTRMHVWTFLAKKYSAMQNSKSNFGLEEAGMAQLDDNVGALLKHLDDMGEAD 327 Query: 348 NTLIVFTSDNGPEA-EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVD 405 NT++VFT+DNG E P G TPF+ KG+ EGG RVP W G I+P +GI Sbjct: 328 NTIVVFTTDNGAEVFTWPDGGMTPFKATKGTVGEGGFRVPCIARWPGHIKPGTVENGIFS 387 Query: 406 LADLFPTALDLAGH--------PGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY 457 D FPT AG+ G K + K +DG +Q + L G S R Y Sbjct: 388 GLDWFPTLCAAAGNTDITDQLLKGVKFGDREYK-NHLDGYNQMA-LLEDKGPSARHELFY 445 Query: 458 FLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQE 513 F L AVR+D+FK+ QQP+ G+ G + T ++ N+ DP E Sbjct: 446 FGGPHLGAVRLDDFKFQ-FYQQPW---------GWPGEKVTTDMPTLVNIRQDPFE 491 >UniRef50_D2R206 Steryl-sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R206_9PLAN Length = 504 Score = 179 bits (453), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 138/430 (32%), Positives = 200/430 (46%), Gaps = 64/430 (14%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 +PNV++ +DD+G+ D+G G + NPTP + +A++G+ LTS Y+ P SP+RA +L Sbjct: 32 RPNVIIINIDDLGYADIGPFG---SKKNPTPALTKMAAEGMKLTSHYAAPVCSPSRAALL 88 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 TG Y +L P P GL T+ +L GY T +GKWH+G+ E P Sbjct: 89 TGCYPKR--VLSIPHVLFPSAGSGLHPDEVTIADMLKASGYKTACLGKWHVGDQAEFLPT 146 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALS-PDRSEYIKQ------------------ 241 GFD + G +DM T N L P KQ Sbjct: 147 KQGFDSYYGIPYSNDMGTATDGSKSNFGAPLPMPGAKGKGKQPAQATGELPLGSPTGLTG 206 Query: 242 -----LPFSKDD--VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFF 294 LP ++D V VRG +Q +L + + V F+ + D+PFF Sbjct: 207 NMQPPLPLLENDKVVARVRGEDQV-----------NLTRDYTKRAVNFIRE--NKDQPFF 253 Query: 295 LYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFT 354 LY+ HF YP+ ++ S R + D + E++ + L + + TL++FT Sbjct: 254 LYFAHTAVHFPMYPSKEFRTSD--RGTLDDWVDEVDASVGEVLAALAEMKIDEKTLVIFT 311 Query: 355 SDNGPEAEVPPHG--RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFP 411 SDNG PHG TP +G+KG TWEGG+RVPT W G I+ S I + DL P Sbjct: 312 SDNGGSL---PHGSDNTPLKGSKGLTWEGGIRVPTIARWPGTIKGGTSTSAITGMIDLLP 368 Query: 412 TALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEF 471 T +A GAK+ +DG++Q GT +S R+ YF +L AVR D + Sbjct: 369 T---IAAATGAKLPE-----RKLDGLNQLPLLNGTAKESPRREFFYFRGLELDAVRRDNW 420 Query: 472 KYHVLIQQPY 481 K H+ + Y Sbjct: 421 KLHLAKGELY 430 >UniRef50_A6P2X1 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6P2X1_9BACE Length = 494 Score = 178 bits (452), Expect = 5e-43, Method: Compositional matrix adjust. Identities = 149/475 (31%), Positives = 224/475 (47%), Gaps = 85/475 (17%) Query: 72 QQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY 131 ++ L +E + G PNVVV +DD+G+ D+G G A TP+IDA+A G++LT+ Y Sbjct: 57 KRYLEGVELENGDPPNVVVIYVDDMGYGDLGCTG---ATAISTPNIDALAEGGVLLTNYY 113 Query: 132 S-QPSSSPTRATILTGQYSI----------------HHGILMPPMYGQ-PGGLQGLTT-- 171 + P S +RA +LTG+Y I H L+ + G P GL T Sbjct: 114 APAPICSASRAGLLTGRYPIRTLTSGAYMNTEGLSGHLANLLEVVKGTYPYQNDGLPTDE 173 Query: 172 --LPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEV 229 LP++L GY T +GKWH+G +E +P N GFD F G +Y++ D H Sbjct: 174 ILLPEVLQQAGYETALVGKWHLGIREEERPYNRGFDLFYG-----ALYSDDNDPH----- 223 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS 289 E + P+ + G + + + ++++D Sbjct: 224 -RIYHNDEVVHDEPYDQ-------SGMTKELTQVAKQFIDD-----------------NQ 258 Query: 290 DKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 D PFFLYY + H+ + + ++ G+S A YGDCM E++ + TLE+NG L+NT Sbjct: 259 DGPFFLYYASPFPHWPSNASEEWLGTSQAGI-YGDCMQEVDWSVGEIMDTLEENGLLENT 317 Query: 350 LIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLAD 408 L++FTSDNGP + G+ RG K + + GG VP Y G I + DG++ D Sbjct: 318 LVIFTSDNGPWYDGATGGQ---RGRKDTNYNGGSHVPFIAYMPGTIPEGEVYDGLMSGVD 374 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRM 468 +FPT L+L G +P+ IDG+D F GQS+ FLN Sbjct: 375 VFPTILNLLGIE-------LPQDRVIDGMDMWPFL---TGQSDSPRTELFLNKD-----K 419 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIP 523 D F LI+ + Y + Y T ++Q G ++NL TDP+E+ + H P Sbjct: 420 DTF---ALIEDNFKYLERSYSENGTYWMLQ-QGPFLYNLDTDPEEAYDV-TTHFP 469 >UniRef50_A9W035 Sulfatase n=6 Tax=Bacteria RepID=A9W035_METEP Length = 564 Score = 178 bits (451), Expect = 5e-43, Method: Compositional matrix adjust. Identities = 142/460 (30%), Positives = 216/460 (46%), Gaps = 49/460 (10%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 G+KPN++V + DD+G ++G G+ G TP ID +A++G++ T Y++ S + RA Sbjct: 55 VGQKPNIIVIMGDDIGIWNIGAYHRGMMAGR-TPHIDQLAAEGMLFTDYYAEASCTAGRA 113 Query: 142 TILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKES 197 +TG+ I G+ GQ G G+ T+ L GY T GK H+G+ E Sbjct: 114 AFITGELPIRTGMTT---VGQAGAAIGIPAEAVTIATALKGMGYATGQFGKNHLGDKNEF 170 Query: 198 QPQNVGFDDFRGF----NSVSD----MYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 P GFD+F G+ +++ D Y + V P + + + DD Sbjct: 171 LPTVHGFDEFFGYLYHLDAMEDPAHPAYPQELLNRVGPRNMVH----SWATNVDDPTDDP 226 Query: 250 HAVRGGEQQAIAD---ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 R G+Q+ I D + PK ME +D D + F+DK + KPFF++ H Sbjct: 227 RWGRVGKQR-IEDAGTLYPKRMETIDDEIRDLALGFIDKAKANGKPFFVWLNPTRMHVTT 285 Query: 307 YPNAKYAGSSPARTSYG---DCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA-E 362 + + KY ++ + M +++DV + K L+ G DNT++VFT+DNG E Sbjct: 286 HLSPKYQAMRNSKNGWSIQEAGMAQIDDVVGAVMKKLKDLGVDDNTIVVFTTDNGTEVFT 345 Query: 363 VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSD-GIVDLADLFPTALDLAGHP- 420 P G+TPF +KG+ EGG R P V W G + D G++ D FPT + AG+P Sbjct: 346 WPDGGQTPFAQSKGTVMEGGFRAPAMVRWPGKVPAGTVDNGVISGLDWFPTLVAAAGNPD 405 Query: 421 -------GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKY 473 G ++A+ K +DG +Q G G S R YF +L AVR+ ++KY Sbjct: 406 IGEELKKGKQIADQTYK-VHLDGYNQLDLITG-KGPSKRNEVWYFGESELGAVRIGDYKY 463 Query: 474 HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQE 513 I QP GG+ G + + NL DP E Sbjct: 464 R-FIDQP---------GGWLGDKTKPDVPYITNLRLDPFE 493 >UniRef50_D2QZL2 Sulfatase n=8 Tax=cellular organisms RepID=D2QZL2_9PLAN Length = 529 Score = 177 bits (448), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 152/512 (29%), Positives = 232/512 (45%), Gaps = 68/512 (13%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 K+PN+V+ DDVG ++ GV +G TP ID +A +G++ T Y++ S + RA+ Sbjct: 25 KRPNIVIIWGDDVGQSNISAYSHGV-MGYKTPHIDRLAREGMMFTDYYAEQSCTAGRASF 83 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGLT----TLPQLLHDQGYVTQAIGKWHMGENKESQP 199 +TGQ+ + G+ G PG GL T+ +LL GY T GK H+G+ E P Sbjct: 84 ITGQHGLRTGLT---KVGLPGAALGLRKEDPTIAELLKPLGYATGQFGKNHLGDRNEFLP 140 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPF--------------- 244 GFD+F G ++Y H+N E P+ ++Y K F Sbjct: 141 TVHGFDEFYG-----NLY------HLNAEE--EPEHADYPKDPAFRAKYGPRGVLDCKAS 187 Query: 245 SKDD-VHAVRGGE--QQAIAD---ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 +DD R G+ +Q I D +T K ME +D V ++ + +K+DKPFF++ Sbjct: 188 DRDDPTVDARFGKVGKQIIKDTGPLTKKRMETIDDDVASRAVDYIQRQSKADKPFFIWVN 247 Query: 299 TRGCHFDNYPNAKYAGSSPARTS-YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 HF + + G S S Y D M++ + + K ++ G DNT +++++DN Sbjct: 248 FTHMHFRTHVKPESKGQSGRWMSEYADAMIDHDKNVGTVLKAIDDAGIADNTFVMYSTDN 307 Query: 358 GPEAEV-PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALD 415 GP P TPFR K S WEG RVP V W I+P S+ IV D PT L Sbjct: 308 GPHMNSWPDAAMTPFRNEKNSNWEGAYRVPCAVRWPNKIKPGSVSNQIVGHHDWLPTLLA 367 Query: 416 LAGH--------PGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG-KLAAV 466 +AG G K+ ++ K DG + G +S R++ Y + +L + Sbjct: 368 IAGDEQVTDKLLKGYKIGDMTYK-VHPDGYNLVPHLTGQEEKSPRESFLYCNDDQQLVGL 426 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIG-------- 518 R D +K V ++Q T + FT + +FNL DP E I Sbjct: 427 RYDNWKL-VFMEQRATGTLRVWSEPFTTLRV----PKIFNLRLDPYERADITSNTYYDWL 481 Query: 519 VRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 + H + VP Q + ++ K+YP R + S Sbjct: 482 IDHAFLLVPAQDYVGKFLLTFKEYPQRQKAAS 513 >UniRef50_B9R4R2 Sulfatase, putative n=2 Tax=Rhodobacteraceae RepID=B9R4R2_9RHOB Length = 555 Score = 176 bits (447), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 144/514 (28%), Positives = 236/514 (45%), Gaps = 78/514 (15%) Query: 66 AQDKETQQKLAELEK-KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQG 124 +QD E ++KL+E+ GK PN++ L+DDV + +G G TP I+ +AS+G Sbjct: 51 SQDVEIEEKLSEIRAANNGKPPNILYILIDDVSFGQMGNRTLNYVTGIDTPSINNLASEG 110 Query: 125 LILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVT 183 L L Y++PS +PTR +LTG++ I G+ + GL T+ ++L ++GY T Sbjct: 111 LSLMRMYTEPSCTPTRTAMLTGRHPIRAGVKEVKVALVGEGLSSEEVTIAEILKEKGYNT 170 Query: 184 QAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLP 243 +GKWH G+ +++ P N GFD W ++ +V LS E ++ Sbjct: 171 AHVGKWHQGDIEQAYPHNQGFD--------------WAAFPLHQQVQLSLMTREAMQSNT 216 Query: 244 ------------FSKDD----------VHAVRGGEQQAIA-----DITPKYMEDLDQRWM 276 F+ D V A GGE + + + T E++++R+ Sbjct: 217 MLGFHASGQTNQFALDQRFKPYGLVTGVEAEAGGEAREVGIAPGEEWTQAKYEEMNERYQ 276 Query: 277 DYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP-NAKYAGSSPARTS--YGDCMVEMNDVF 333 ++ L+++A D PFFL Y + YP N Y + +R + D + ++ Sbjct: 277 RQILEQLERLAAEDAPFFLQY------WPLYPLNFVYPDQAISRNGGFHADKLQLLDTWI 330 Query: 334 ANLYKTLEKNGQLDNTLIVFTSDNGPEAE---VPPHGRTPFRGAKGSTWEGGVRVPTFVY 390 +L ++ G DNT+++ +DNG + +RG K EGGVR F+ Sbjct: 331 GDLLAKVDALGLRDNTIVMLMADNGLMYHYEGTSGLNQLIYRGGKTQHLEGGVRTDAFIR 390 Query: 391 WKGMIQPRKSDG-IVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ 449 W G+I+ + G IV ++DLF T +AG ++L+P+ IDGVDQT+ L G Sbjct: 391 WPGVIEAGSAAGDIVHVSDLFTTFARIAG-----ASDLIPRDRVIDGVDQTALLLNGEGN 445 Query: 450 SNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYT 509 S R + + L +V EFK H+ A Q G A + VFN+Y Sbjct: 446 SRRDYVYVYEGTVLRSVVKQEFKMHLP-----APGQPG------------AAAPVFNIYR 488 Query: 510 DPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 DP+E + + + G Q + + + K+P Sbjct: 489 DPREENPLVGYSLWSGASFQDMVKRHQMTIAKHP 522 >UniRef50_B5JJG5 Sulfatase, putative n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JJG5_9BACT Length = 462 Score = 176 bits (447), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 139/471 (29%), Positives = 219/471 (46%), Gaps = 68/471 (14%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRAT 142 K PN+V DD+G+ D+ G A TP ID++ QG+ T YS P SP+RA Sbjct: 33 KPPNIVFIFADDLGYNDLSSYG---ATDIATPAIDSLGEQGIRFTDFYSASPVCSPSRAA 89 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 +LTG+Y I GI P G+ TT+ +LL + GY T +GKWH+G +++ Sbjct: 90 LLTGRYPIRQGITG---VFWPQSFDGIDPAETTIAELLQENGYRTGLVGKWHLGHHQKHL 146 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 P GF + G +DM D V +RG + + Sbjct: 147 PLQNGFHSYFGIPYSNDM------------------------------DMVVYMRGNDVE 176 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA 318 + ++ Y +R+ + V+F+++ D+PFFLY H Y + + G+S Sbjct: 177 SY-EVDQHYTT---RRYTEEAVQFIEQ--NKDQPFFLYLAHSMPHVPIYASENFVGTS-K 229 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT--PFRGAKG 376 R YGD + E++ A + TL+K+ +NTL+VFTSDNGP + G + P R K Sbjct: 230 RGLYGDVIQELDWSVAQILDTLDKHQLSENTLVVFTSDNGPWTALKHLGGSAAPLREGKM 289 Query: 377 STWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANL-VPKTTFI 434 T++GG+RVP V W I + S + ++ D FPT +++ANL PK+ I Sbjct: 290 FTFDGGMRVPCLVRWPAQIPAGQTSHAMANMMDWFPTF--------SRIANLDTPKSRSI 341 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTG 494 DG+D T G+ +++ + + +G L A R ++K ++ PY G Q Sbjct: 342 DGLDITDVLTGSGPRADNEFFFFHGDGDLRAYRDGDWK----LKLPY----EGNQAARWR 393 Query: 495 TVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPR 545 + +FNL DP E+ + +H +Q M ++ L + PP Sbjct: 394 QAVAAHPILLFNLAEDPGETTDLAAQHPERLAAMQARMTDFLASLGELPPE 444 >UniRef50_A6LED1 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LED1_PARD8 Length = 459 Score = 175 bits (443), Expect = 5e-42, Method: Compositional matrix adjust. Identities = 136/453 (30%), Positives = 213/453 (47%), Gaps = 67/453 (14%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPT---PDIDAVASQGLILTSAYSQPS-SSPTR 140 KPN V+ DD+G+ D+ GNPT P+ID +A +G+ LT Y S+P+R Sbjct: 30 KPNFVIIFCDDMGYGDLS------CYGNPTIRTPNIDRMACEGMKLTQFYVGAGVSTPSR 83 Query: 141 ATILTGQYSIHHG-------ILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 A ++TG+ + +G +L P + G Q T+ ++L GY T +GKWH+G Sbjct: 84 AALMTGRLPVRNGLYGDRVAVLFPN--SKAGLGQDEVTIAKVLQQSGYATGCVGKWHLGA 141 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSP--DRSEYIKQLPFSKDDVHA 251 P + GFD + G +DM SP ++ + + P + Sbjct: 142 FSPYLPTDHGFDTYFGIPYSNDM---------------SPVQNKGAHARNFPPTP---LI 183 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 V G + ++ D +L +R+ + V F+ +K +PFFLY+ H Y NA+ Sbjct: 184 VDGKQIESEPD-----QGELTRRYTEKAVSFIKNHSK--EPFFLYFAHTFPHIPLYTNAR 236 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-- 369 + G+S R YGD + E++ + K L +NG +NT ++FTSDNGP +G + Sbjct: 237 FEGTS-KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNGPWLTEHENGGSAG 295 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVP 429 P + KG+ WEGG RVP + G I P +D I+ DL+PT L +AG P Sbjct: 296 PLKDGKGTWWEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQ-------P 348 Query: 430 KTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHV-LIQQPYAYTQSGY 488 K +DGV+QT S R +Y+ +L A+R E+KY+ I+ Y T Sbjct: 349 KDLVLDGVNQTGLLF-EEKHSARDEVYYWWGSELMAIRKGEWKYYFKTIKDQYLRTCK-- 405 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 + A ++N+ TD E ++ +H Sbjct: 406 -------IETPAEPLLYNVETDISERFNLADKH 431 >UniRef50_A6DSG4 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSG4_9BACT Length = 489 Score = 175 bits (443), Expect = 5e-42, Method: Compositional matrix adjust. Identities = 144/469 (30%), Positives = 221/469 (47%), Gaps = 66/469 (14%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 +KPN++ +L DD+G+ D+G G A G TP ID +A +G +S Y SP+RA Sbjct: 28 QKPNILFYLTDDLGYGDIGCYG---AEGQYTPAIDQLAKEGTKFSSFYVHQRCSPSRAAF 84 Query: 144 LTGQYSIHHGILMPP-MYGQPGGLQGLT----TLPQLLHDQGYVTQAIGKWHMGENKESQ 198 +TG Y+ H + +P +Y G GL TLP+L+ GY T +GKWH+GE K Sbjct: 85 MTGSYA--HRVGLPQVIYKHREGPIGLNPSEITLPELMKTAGYNTALVGKWHLGEWKPFH 142 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 P N G+D F GF V + + + +L +R E + ++ E Q Sbjct: 143 PLNHGYDYFYGFLKV---------IEGSEKPSLIENRKELASK----------IQKTEGQ 183 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA 318 A + + F+ K K+ PFFL Y H +P+ ++ G+S Sbjct: 184 APGMVKA-------------AINFMTKHKKN--PFFLVYSDPMPHAPYFPSEQFKGTS-K 227 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT----PFRGA 374 R +YG+ + E++ F +L L++ G +NT++VFTSDNGP E P R Sbjct: 228 RGNYGEVIHEIDWQFKHLMDALDELGLKENTIVVFTSDNGPPVERQKKYDVGLSGPLRDG 287 Query: 375 KGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTF 433 K + +EGGVRVP + W G ++ SD ++ + D+ PT +LAG VP Sbjct: 288 KWTNFEGGVRVPFIIRWPGKVKVDASSDAMIGIIDMLPTFCELAGVD-------VPNDRV 340 Query: 434 IDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFT 493 IDGV+ LG + +S E + G A + + +KY+ Q PY + G Sbjct: 341 IDGVNILPQLLG-DQESKALRETQIVPG--ATIIHNGWKYYAKQQNPYNNKKPEDWNG-- 395 Query: 494 GTVMQTAGS-SVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +Q A ++FNL D E+ + +H + L+ M +M LKK Sbjct: 396 ---LQPAKEGALFNLKEDIGETTEVSAQHPEIAESLKKNMAKFMAELKK 441 >UniRef50_B7S1F0 Sulfatase, putative n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7S1F0_9GAMM Length = 470 Score = 175 bits (443), Expect = 5e-42, Method: Compositional matrix adjust. Identities = 133/440 (30%), Positives = 220/440 (50%), Gaps = 45/440 (10%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KPN+V+ ++D+ G+ +VG GGG+ G PTP+ID++A++G LT+ + +P+R++++ Sbjct: 31 KPNIVMVVMDNFGYGEVGVYGGGMLRGAPTPNIDSIATEGFQLTNFNVEAECTPSRSSLM 90 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGLT----TLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 TG+Y I P G G+T TL ++L D GY T GKWH+G+ + P Sbjct: 91 TGRYGIR--TRQRPNDEPRGIWYGITPWEITLAEMLSDAGYATGMFGKWHLGDEEGRYPT 148 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 + GFD++ G + SD A PD + K V + G++ Sbjct: 149 DQGFDEWYGIPNSSDQ-------------AFWPDSDSFQKDAGVEFTHVMESKRGQKPKK 195 Query: 261 ADITPK-YMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH--FDNYPNAKYAGSSP 317 D+ + + +D+ D + F+ + AK+ KPFF Y H D +P+ K S Sbjct: 196 KDVYGREKRKTIDREITDRAIDFIKRKAKAGKPFFAYLPYTQTHEPVDAHPDFK---GST 252 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAKG 376 S+ D + + + L KT++ G DNT+ +FTSDNG E G T P+RG Sbjct: 253 GNGSFADVLAQTDSYVGELLKTIDNLGFKDNTIFIFTSDNGREGIKRSFGFTGPWRGTMF 312 Query: 377 STWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFID 435 + +EG +RVP + + I +K S+ IV L D+FPT L+G +P+ +D Sbjct: 313 APYEGSLRVPFLIRYPDKIPAKKVSNDIVHLIDIFPTIAKLSG-------GEIPQDRILD 365 Query: 436 GVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGT 495 GVDQT F G + +S R++ ++ +L V+ +K +L+++ + Y Sbjct: 366 GVDQTDFLTGKSEKSARESVIIYIGNELFGVKWRNWK--MLLKE---IDEDSY------A 414 Query: 496 VMQTAGSSVFNLYTDPQESD 515 + A S++NL DP+E + Sbjct: 415 IQTMAYPSIYNLIVDPKEEE 434 >UniRef50_B8KTJ7 Arylsulfatase F n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KTJ7_9GAMM Length = 473 Score = 173 bits (439), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 123/438 (28%), Positives = 203/438 (46%), Gaps = 39/438 (8%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 + PN+V+ L+D+ GW +VG GGG G PTP I ++A +GL LT+ +P +P+R+++ Sbjct: 32 QHPNIVLVLMDNFGWGEVGAYGGGALRGAPTPHIYSLAEEGLRLTNFNVEPECTPSRSSL 91 Query: 144 LTGQYSIHHGILMPPMYGQP--GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQN 201 +TG+Y+ + Y G + T+ ++L + Y T GKWH+G+ + P Sbjct: 92 MTGRYAARTRLRTDGTYRSVWYGITKWEVTIAEMLTETEYATGWFGKWHLGDTEGRYPTG 151 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQ-LPFSKDD--VHAVRGGEQQ 258 GFD++ G SD A PD ++Y + P ++ + + + RG + + Sbjct: 152 QGFDEWYGIPRSSDR-------------AFWPDSTQYDGEGFPGARFNYVMESTRGEKPK 198 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA 318 +A +D+ D + F+ + A + KPFF H P+ Y G + Sbjct: 199 ELAVYDRAKRRLIDREITDKTIDFIQRKAAAKKPFFTLVSYTQTHEPVEPHPDYRGRT-G 257 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAKGS 377 + D + + +D +L T++ ++TL +FT+DNG E G T P+RG S Sbjct: 258 HGDFADVLAQTDDYVGDLLDTIDALDIAEDTLFIFTADNGREGIPGSWGFTGPWRGGMFS 317 Query: 378 TWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDG 436 WEG +RVP + W G I S+ IV L DL PT A + +P +DG Sbjct: 318 PWEGSLRVPFLIRWPGKIPSGTVSNDIVHLVDLMPTF-------AAATHSELPDDRILDG 370 Query: 437 VDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTV 496 +DQ FFLG S R++ ++ +L + +K +Y + Sbjct: 371 LDQLPFFLGETENSPRESVMVYVGNELFGAKWRNWKILFKDMDTDSY-----------AI 419 Query: 497 MQTAGSSVFNLYTDPQES 514 A S++NL DP+E Sbjct: 420 RDLAYPSIYNLIVDPKEE 437 >UniRef50_B4D764 Steryl-sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D764_9BACT Length = 499 Score = 172 bits (437), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 138/451 (30%), Positives = 205/451 (45%), Gaps = 41/451 (9%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KPN ++ +DD+G+ D+ G + N TP++D +A +G LT Y P SP+R+ ++ Sbjct: 23 KPNFIIINIDDMGYADIAPFGSKL---NRTPNLDRMAQEGRKLTCFYGAPVCSPSRSALM 79 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGLT----TLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 TG Y +L P PG GL T+ +LL GY T IGKWH+G+ E P Sbjct: 80 TGCYPKR--VLPIPSVLFPGAAVGLNPAEHTVAELLKKSGYATGCIGKWHLGDQPEFLPP 137 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVN-----PEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 GFD + G +DM + P+ +P+ S I + + + Sbjct: 138 RRGFDYYLGLPYSNDMGPGEDGSKSSLGDPIPKPKATPNPSAPIPETGITGNQPPLPMLE 197 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 ++ IA + + L R+ VKF+ + DKPFFLY HF YP ++AG Sbjct: 198 NEKVIARVRQDEQQGLVDRYTKAAVKFITE--HKDKPFFLYLPHNAVHFPIYPGKEWAGK 255 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAK 375 SP Y D + +++ + TL + D+T ++FTSDNG P P RG K Sbjct: 256 SP-NGYYSDWVEQVDWSVGQVLNTLRELKLQDHTFVLFTSDNG---GTPRAVNAPLRGFK 311 Query: 376 GSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFI 434 +TWEGG+R PT +W G I SD I + D+ PT ++LAG VP I Sbjct: 312 TTTWEGGMREPTIAWWPGKIPGGTSSDEITGMFDILPTLVNLAG-------GEVPTDHKI 364 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYFLNG-KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFT 493 DG + G G + Y+ NG +L VR +K + +G G Sbjct: 365 DGGNIWPVLAGEAGAKSPHEVFYYFNGLRLEGVRTGPWKLR--------FGSAGLAEG-K 415 Query: 494 GTVMQTAG---SSVFNLYTDPQESDSIGVRH 521 G V + A ++NL TD E+ ++ H Sbjct: 416 GPVKKPAAPIPDQLYNLQTDIGETTNVADAH 446 >UniRef50_C7RSC1 Sulfatase n=2 Tax=Bacteria RepID=C7RSC1_9PROT Length = 574 Score = 172 bits (435), Expect = 3e-41, Method: Compositional matrix adjust. Identities = 146/488 (29%), Positives = 227/488 (46%), Gaps = 56/488 (11%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 A L + GK+PN++ + DD+GWM G+ VG TP+ID + +G + Y++ S Sbjct: 24 AALAQAPGKRPNILFIMGDDIGWMQPSIYHQGLMVGE-TPNIDRIGQEGAKFMTYYAEQS 82 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGEN 194 + R TG + G++ P + G P LQ G +L + L D GY T GK H+G++ Sbjct: 83 CTAGRTAFFTGMTPLRAGMIPPQLPGSPSFLQPGTPSLAKFLLDLGYTTGEFGKNHLGDH 142 Query: 195 KESQPQNVGFDDFRGFNSVSDMY--TEWRDVHVNPEV--ALSPDRSEYIK---QLPFSKD 247 + P GF +F G+ D + D++ +P V + P ++ ++ ++P + D Sbjct: 143 SAALPTAHGFQEFWGYLYHLDAMQGVSFPDINSSPTVQAIVPPCKNTPVRGLAEVPGAVD 202 Query: 248 --------------DVHAVRGGEQ-QAIAD---ITPKYMEDLDQRWMDYGVKFLDKM--A 287 + G E+ Q D +T K E +D+ + FLD+ Sbjct: 203 PKTTLCMTPPRPVLACTSSDGTEKNQTCKDEGPLTLKRSETVDEEISAKVIDFLDRNDPK 262 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAG--SSPARTSYGD---CMVEMNDVFANLYKTLEK 342 K++KPFF++Y H + KY + +G M +M+D + K LE Sbjct: 263 KTNKPFFVWYNPARMHITTMLSDKYMAMVGTKGGKDWGTNEAAMKQMDDNIGYVLKKLED 322 Query: 343 NGQLDNTLIVFTSDNGPEA-EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS- 400 GQLDNT++VFT+DNG E P G TPF+G K +TWEGG+R P + W G I+P Sbjct: 323 MGQLDNTIVVFTTDNGAEVITYPDGGNTPFKGGKLTTWEGGMRAPAVIRWPGHIKPGTVL 382 Query: 401 DGIVDLADLFPTALDLAGHPGAKVANLVPKT----------TFIDGVDQTSFFLGTNGQS 450 + I D PT +++AG GAK +L + T ++GV+Q + G + S Sbjct: 383 NDIFASYDWMPTFVEIAG--GAKGNDLNKQIMAGKYPGIVKTKLNGVNQLDYLTGKSATS 440 Query: 451 NRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTD 510 R A Y+ +AVR +K + +A GG G V V N+ D Sbjct: 441 ARDAFFYYGGPVPSAVRYKNWKIY------FAMASEANTGGLMG-VHTFHWPLVANIRRD 493 Query: 511 PQESDSIG 518 P E S+G Sbjct: 494 PFEG-SVG 500 >UniRef50_Q01Z68 Sulfatase n=4 Tax=Bacteria RepID=Q01Z68_SOLUE Length = 560 Score = 171 bits (433), Expect = 6e-41, Method: Compositional matrix adjust. Identities = 136/442 (30%), Positives = 204/442 (46%), Gaps = 48/442 (10%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 +KPN++ + DD+GWM G+ VG TP+ID + ++G I + S + R Sbjct: 19 QKPNILFIMGDDIGWMQPSIYHRGLMVGE-TPNIDRIGNEGAIFMDYVAMQSCTSGRNAF 77 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNV 202 TG Y + G++ P + G P L+ G L LHD GY T GK H+G++ E+ P Sbjct: 78 FTGMYPLRTGMIPPQLPGSPSYLRPGTPALAVFLHDLGYTTGEFGKNHLGDHTEALPTAH 137 Query: 203 GFDDFRGFNSVSDMY--TEWRDVHVNP--EVALSPDRSEYI---KQLPFSKDD------- 248 GF ++ G+ D + D++ P +V P ++ I ++P + D Sbjct: 138 GFQEYWGYLYHLDAMQGVSFPDINSTPTQQVIAPPCKNTPIPGLSEVPGAVDPKTTTCLT 197 Query: 249 -------VHAVRGGEQ-QAIADITPKYME---DLDQRWMDYGVKFLDKM--AKSDKPFFL 295 H+ G E+ Q D P ++ +D+ V FLD+ K++KPFF+ Sbjct: 198 PPRPVLWCHSSDGTEKNQTCKDQGPLTLDRSRTVDEEISAKVVDFLDRNDPRKTNKPFFV 257 Query: 296 YYGTRGCHFDNYPNAKYAGSSPAR--TSYG---DCMVEMNDVFANLYKTLEKNGQLDNTL 350 +Y H + KY R +G M +M+D + LE+ GQLDNT+ Sbjct: 258 WYNPARMHVTTMLSPKYEAMLGTRGGKDWGVNEAGMKQMDDNIGVVLAKLEQMGQLDNTI 317 Query: 351 IVFTSDNGPEA-EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLAD 408 +VFT+DNG E P G TPF+G KG WEGG R P + W G I+P + D Sbjct: 318 VVFTTDNGAETISFPDGGITPFKGQKGEAWEGGYRAPCVIRWPGHIKPGTVYKELFAALD 377 Query: 409 LFPTALDLAGHP----------GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF 458 PT ++ AG P K +V KTT +DGV+Q+ + G S R YF Sbjct: 378 WLPTFVEFAGGPKGDALKQQIEAGKYPGIV-KTT-LDGVNQSDYLQGKCDTSARDYFFYF 435 Query: 459 LNGKLAAVRMDEFKYHVLIQQP 480 +AVR +K + + QP Sbjct: 436 SGATPSAVRYKNWKMYYTMSQP 457 >UniRef50_UPI0000586CBD PREDICTED: similar to MGC86251 protein n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000586CBD Length = 525 Score = 168 bits (425), Expect = 5e-40, Method: Compositional matrix adjust. Identities = 128/457 (28%), Positives = 212/457 (46%), Gaps = 43/457 (9%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRA 141 K+PN+++F DD+G+ D+ G + TP++ +A+ G++LT YS P SP+RA Sbjct: 22 AKRPNIIIFYADDLGYGDLEPYGHPTSS---TPNLGRLAAGGIVLTQFYSSSPVCSPSRA 78 Query: 142 TILTGQYSIHHGILMPPMYG--QPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG--ENKE 196 +LTG+Y + G+ P ++ GGL T + ++L +GY + A+GKWH+G N Sbjct: 79 ALLTGRYQMRSGV-YPHVFNVEMSGGLPLNETLISKMLKPEGYRSAAVGKWHLGLGNNSV 137 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P N GFD+F G + + N +P EY F+ + E Sbjct: 138 YLPHNHGFDEFLGLPASPSQCRCSVCFYPNVTCHRAPCSPEYSPCALFNGTTII-----E 192 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 Q P + LD ++ +F+ ++ PFFLYY + H Y + +G+S Sbjct: 193 Q-------PADLLTLDDKYAMQSRRFIRTNVETGTPFFLYYASHHTHHPQYAGKETSGTS 245 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP--FRGA 374 R +GD + ++ +Y+ L++NG L++T F+SDNGP + G + Sbjct: 246 -IRGRFGDSLAALDWEVGQIYEELKENGILEDTFFFFSSDNGPSLSLENFGGNAGLMKCG 304 Query: 375 KGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFI 434 K +T+EGG+RVP V+W G I P +S + D+ PT +A AK+ N+ + Sbjct: 305 KATTYEGGIRVPAIVHWPGQITPGRSMELSSTLDVLPT---IASITNAKLPNVT-----L 356 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYFLNGKL------AAVRMDEFKYHVLIQQPYAYTQSGY 488 DG D + F G + + ++ K+ AVR ++K + Sbjct: 357 DGYDMSPFLF--QGMPSLRESFFYYPSKVDTEHKSYAVRYKQYKAVFYTEGSALSNNKNK 414 Query: 489 QGGFTGTVMQTAGSS--VFNLYTDPQESDSIGVRHIP 523 GT ++T +F+L DP E +I + H P Sbjct: 415 DVDCRGTSLRTYHDPPMLFDLEQDPSEQYNISINHSP 451 >UniRef50_B4AUP3 Sulfatase n=2 Tax=Bacteria RepID=B4AUP3_9CHRO Length = 570 Score = 167 bits (424), Expect = 7e-40, Method: Compositional matrix adjust. Identities = 146/521 (28%), Positives = 238/521 (45%), Gaps = 75/521 (14%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 KKPN++V + DDVGW ++ G+ +G TP+ID +AS+G++ T Y++ S + RA Sbjct: 43 KKPNILVIMGDDVGWFNISAYNRGM-MGYKTPNIDRIASEGMLFTDVYAEQSCTAGRAAF 101 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGLT-TLPQLLHDQGYVTQAIGKWHMGENKESQPQNV 202 +TGQ G+ + G P GL G TL +LL GY T GK H+G+ E P Sbjct: 102 ITGQSPGRTGMTKVGLPGVPIGLSGEDPTLAELLKPLGYATGQFGKNHLGDLDEFLPTVH 161 Query: 203 GFDDFRGFNSVSDMYTEWRDVHVNPEVA-LSPD--RSEYIKQLPFSKDDVHAVR------ 253 GFD+F G ++Y H+N E +PD ++E KQ + +H+ Sbjct: 162 GFDEFYG-----NLY------HLNAEEEPENPDYPKNEIFKQKLGPRGVLHSYSLDYVTQ 210 Query: 254 --------------------GGEQQAIAD---ITPKYMEDLDQRWMDYGVKFLDKMAKSD 290 G Q I + +T + M+ +D ++D ++F++K + Sbjct: 211 ENPEITCPEENLSKYEDENIPGLGQVICNTGPLTIERMKTVDDEFLDASLEFINKTQQEG 270 Query: 291 KPFFLYYGTRGCH-FDNYPNAKYAGSSPARTS-YGDCMVEMNDVFANLYKTLEKNGQLDN 348 KPFF+++ T H F + + Y YG+ M E + L L++ G D+ Sbjct: 271 KPFFVWFNTTRMHVFTHLKDDSYNPDLEKYDDIYGEGMEEHDQDVGILLDYLDEQGLTDD 330 Query: 349 TLIVFTSDNGPEA-EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDL 406 T++++T+DNG E P G TPF G K + WEGG RVP + W G I+ + S+ I+ Sbjct: 331 TIVIYTTDNGAEVFSWPDGGTTPFHGEKNTNWEGGFRVPAMIRWPGYIEAGQISNEIISH 390 Query: 407 ADLFPTALDLAGHPGAKVANLVP-----------KTTFIDGVDQTSFFLGTNGQSNRKAE 455 D PT L AG P L K +DG + + S R+ Sbjct: 391 QDWLPTLLAAAGAPDDIAEQLKSEDGYNAGIKTFKKIHLDGYNLLPYLTDQEYHSPRRWF 450 Query: 456 HYFLNGKL-AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTD---- 510 Y + +A+R+D++K V+ + A G++ ++ + + NL D Sbjct: 451 VYLTDDAYPSAIRVDDWK--VIFSEQRA---EGFE-VWSEPYVNLRVPMILNLRRDPFEK 504 Query: 511 -PQESDSI---GVRHIPMGVPLQTEMHAYMEILKKYPPRAQ 547 P+ES++ RH + P Q +++ ++YPPR + Sbjct: 505 APEESNNYIDWRFRHTFVIAPAQIVAQEFLDTFREYPPRQK 545 >UniRef50_Q1VDY3 Probable sulfatase n=2 Tax=Vibrio alginolyticus RepID=Q1VDY3_VIBAL Length = 483 Score = 167 bits (424), Expect = 8e-40, Method: Compositional matrix adjust. Identities = 122/449 (27%), Positives = 209/449 (46%), Gaps = 59/449 (13%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSS 137 LE + PNVVV L+D++GW ++ + G G T ++D +A +G+ LT+ +P + Sbjct: 20 LEVVAEETPNVVVMLVDNLGWGEL--SSYGSTRGVETKNLDQLAREGVRLTNFNVEPQCT 77 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKE 196 PTR++ +TG+ ++ G ++G P G+ T+ + +QGY T GKWH+G+ K Sbjct: 78 PTRSSFMTGRRALRSGT-DKVVWGVPYGMVNWEITIAEKFKEQGYNTSLYGKWHLGDQKG 136 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS----KDDVHAV 252 P + GFD++ G +A + D SEY Q + K + + Sbjct: 137 RFPTDQGFDEWYG-------------------IANTTDESEYSSQPGYKAILPKPQILSA 177 Query: 253 RGGEQ-QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 R G+ + + + +D +++ F+++ K +KPFF H P+ Sbjct: 178 RAGQDPKGVKEYNLDSRRTIDSELVEHATDFINRNVKENKPFFSVITFTQPHLPTLPHPD 237 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-P 370 + G + + +Y D + E++ + +EK G DNTL+++ SDNGPE +P G + P Sbjct: 238 FIGKT-GKGNYSDVLAEIDFRAGQVIGAIEKAGIKDNTLVIWFSDNGPEWHMPYQGSSGP 296 Query: 371 FRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVP 429 +RGA + EG +R P W I+P R SD I+ + DLF + + G+ +P Sbjct: 297 WRGAYFTALEGSLRTPFIASWPNHIKPGRVSDEIIHVVDLFASLSHVGGYK-------LP 349 Query: 430 KTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQ 489 ID +DQ +F G + +SNR + A + + +K H + Q Sbjct: 350 SDRTIDSIDQWAFLKGDSEKSNRDGFIVNNGSETYAYKWENYKMHFIDQD---------- 399 Query: 490 GGFTGTVMQTAGS-----SVFNLYTDPQE 513 +M G ++NL DP+E Sbjct: 400 ------IMPEKGRPLQIPEIYNLIDDPKE 422 >UniRef50_A4CGL5 Arylsulfatase A (Precursor) n=2 Tax=Flavobacteria RepID=A4CGL5_9FLAO Length = 526 Score = 167 bits (423), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 147/471 (31%), Positives = 218/471 (46%), Gaps = 63/471 (13%) Query: 69 KETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILT 128 +ET + + + PN+V+ DD G+ DVG G A PTP++DA+A+ GL+LT Sbjct: 57 RETVKSEFAAADRADRPPNIVIIFTDDQGYSDVGVYG---ARDIPTPNLDAMAADGLLLT 113 Query: 129 SAYS-QPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAI 186 + Y+ QP S +RA +LTG Y GI M P GL TL +LL QGY T Sbjct: 114 NFYAAQPVCSASRAGLLTGCYPNRVGIHNALMPNSPVGLNPAEETLAELLRQQGYRTGIF 173 Query: 187 GKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVH--VNPEVALSPDRSEYIKQLPF 244 GKWH+G++ + P GFD+F G +DM+ +H P P LP Sbjct: 174 GKWHLGDHPDFLPTRHGFDEFFGIPYSNDMWP----LHPLQGPVFDFGP--------LPL 221 Query: 245 SKDDVHAVRGGEQQAIADITPKYMED---LDQRWMDYGVKFLDKMAKSDKPFFLYYGTRG 301 EQ+ + D +ED L ++ + V F+++ ++PFFLY Sbjct: 222 Y----------EQERVVDT----LEDQRLLTRQITERSVDFINR--HKEEPFFLYVPHPQ 265 Query: 302 CHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA 361 H + + + G S R YGD ++E++ + LE NG D+T ++FTSDNGP Sbjct: 266 PHVPLFVSDAFRGKS-GRGLYGDVIMEIDWSVGQVLGALEDNGLTDDTWVIFTSDNGPWL 324 Query: 362 EVPPH-GRT-PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS--DGIVDLADLFPTALDLA 417 H GR P R KG+ WEGGVR P + + G + PR D + DL PT + Sbjct: 325 AYGNHSGRAEPLREGKGTNWEGGVREPCIMKFPGRL-PRGKVLDEPLMAIDLLPTIASVT 383 Query: 418 GHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH-YFLNGKLAAVRMDEFKYHVL 476 G P IDG + G + + A + Y+ +L AVR ++K Sbjct: 384 GSPQPGRE--------IDGKNAWGLLSGAEARGPQDAYYFYYRVNELQAVRDGDWK---- 431 Query: 477 IQQPYAY-TQSGYQGGFTGT-----VMQTAGSSVFNLYTDPQESDSIGVRH 521 + P+ Y T G + G G + ++NL DP E++++ RH Sbjct: 432 LVLPHNYRTMQGQEPGADGLPGAYDYVDVTAPELYNLREDPGETNNLAERH 482 >UniRef50_D2QZX4 Sulfatase n=10 Tax=Bacteria RepID=D2QZX4_9PLAN Length = 499 Score = 166 bits (419), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 138/461 (29%), Positives = 208/461 (45%), Gaps = 52/461 (11%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTS- 129 T++ A+ K ++PN+V+ DD+G+ D+G G A G TP+++ +AS+G+ T Sbjct: 39 TEESAADAASK--RRPNIVLIFCDDLGYADIGCFG---AKGYETPNLNKLASEGMKFTDF 93 Query: 130 AYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGK 188 + S +RA +LTG Y GIL G+ + + +LL + GY T GK Sbjct: 94 QVAAAVCSASRAALLTGCYPQRVGILSALGPSDSIGIAKNELLISELLQNLGYKTACFGK 153 Query: 189 WHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 WH+G +++ PQ GF + G +DM+ + H + A P LP Sbjct: 154 WHLGHHEQFLPQQNGFATYFGLPYSNDMWPK----HPTAKNAYPP--------LPLI--- 198 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 + ++ P + L + + VKF+ +KPFFLY H + Sbjct: 199 -------DGNKTIELNPDQTK-LTTWYTEKAVKFIHDCG--EKPFFLYVPHNMPHVPLFV 248 Query: 309 NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR 368 + K+AG + R +GD + E++ + K LE G +DNTL++FTSDNGP H Sbjct: 249 SEKFAGKT-KRGLFGDVIAEIDWSVGEITKALEATGNVDNTLVIFTSDNGPWLSYGDHAG 307 Query: 369 TP--FRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVA 425 + FR KG+ WEGG RVP + G IQP D + DLFPT G A Sbjct: 308 STGGFREGKGTVWEGGHRVPMIAKYPGTIQPGTTCDKLASTIDLFPTIAHYCG------A 361 Query: 426 NLVPKTTFIDGVDQTSFFLGTNG-QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYT 484 + P IDGV G +S+ + +Y+ L AVR + FK H P+A+ Sbjct: 362 TIDPSRK-IDGVSIQPLLESVEGAKSSHEFFYYYWGNGLEAVRDERFKLHF----PHAFR 416 Query: 485 Q----SGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 G G G ++F+L DP E +I H Sbjct: 417 SLTGTPGTDGMPNGYTQAKTELALFDLDADPFEQTNIAADH 457 >UniRef50_A4A2W0 Arylsulfatase A n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A2W0_9PLAN Length = 477 Score = 165 bits (418), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 120/406 (29%), Positives = 188/406 (46%), Gaps = 31/406 (7%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 + ++ KPN V+ +DD+G+ D+ G V N TP+++A+A +G+ LT Y+ P Sbjct: 21 SSCAQEVATKPNFVIINIDDLGYADIEPFGSEV---NRTPNLNAMADEGMKLTCFYAAPV 77 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLT----TLPQLLHDQGYVTQAIGKWHM 191 SP+RA ++TG Y L P PG +G++ T+ +L+ +QGY T IGKWH+ Sbjct: 78 CSPSRAALMTGCYPKRA--LTIPHVLFPGNAEGMSPNEVTIAELMKEQGYATAIIGKWHL 135 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 G+ + P GFD + G +DM V N + + + LP +++ Sbjct: 136 GDQPDFLPTRQGFDYYYGLPYSNDMGPAADGVKSNYGAPIPQRKGKGQPPLPLLRNETVL 195 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 R + K +L + + ++F+ +KPFFLY HF YP Sbjct: 196 QR---------VLAKDQTELVTNYTEEAIQFIRD--HQEKPFFLYLPHSAVHFPMYPGDA 244 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPF 371 + G + + Y D + E++ + + L+ G TL++FTSDNG + + P Sbjct: 245 FRGKN-SHGLYNDWVEEVDWSVGQVLQALKDLGLDQRTLVIFTSDNGGQTRFGAVNK-PL 302 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPK 430 R K +T+EGG+RVPT V W G + SD +V + D+ PT + LAG P Sbjct: 303 RAGKATTYEGGMRVPTIVRWPGKVPAGSSSDAVVGMIDVLPTLVKLAG-------GTTPT 355 Query: 431 TTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG-KLAAVRMDEFKYHV 475 IDG D G + YF G L AVR +K + Sbjct: 356 DRKIDGADIGPILAGVKEAKSPHDVFYFYRGYDLEAVRSGPWKLRL 401 >UniRef50_A4CMB0 Arylsulfatase A n=4 Tax=Bacteria RepID=A4CMB0_9FLAO Length = 492 Score = 164 bits (416), Expect = 6e-39, Method: Compositional matrix adjust. Identities = 140/478 (29%), Positives = 223/478 (46%), Gaps = 63/478 (13%) Query: 66 AQDKETQQKLAELEKKTG---KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPT---PDIDA 119 Q+ ET +E G +KPN ++ DD+G+ D+ + G+PT ++D Sbjct: 27 CQNTETSPGDSEGTAAAGGIPEKPNFIIVFADDLGYGDLS------SFGHPTIHTKNLDR 80 Query: 120 VASQGLILTSAYSQPS-SSPTRATILTGQYSIHHG-------ILMPPMY-GQPGGLQGLT 170 +A++G T+ Y S +P+RA +LTG+ + +G + P + G P Sbjct: 81 MAAEGQKWTNFYVAASVCTPSRAGLLTGRLPVRNGLTSNEIGVFFPDSHNGMPASE---I 137 Query: 171 TLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDM-----YTEWRDVHV 225 TL + L GY T +GKWH+G +E P N GFDD+ G +DM +T ++D Sbjct: 138 TLAEQLKKAGYATGMVGKWHLGHKEEYLPPNHGFDDYFGIPYSNDMDFTGQFTSYQDY-- 195 Query: 226 NPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDK 285 +R E +K + +V +RG E+ P + +R+ D VK++ + Sbjct: 196 ---FGRYTERYESLKTEEY---NVPLIRGTEEIE----RPVNQNTITKRYNDEAVKWIRE 245 Query: 286 MAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQ 345 D+PFF+Y H + + ++ G+S AR YGD + E++ + + LE G Sbjct: 246 --HKDEPFFMYLAHSLPHVPLFTSDEFRGTS-ARGLYGDVVEEIDHGVGQIMELLEAEGL 302 Query: 346 LDNTLIVFTSDNGPEAEVPPHGRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGI 403 +NT++VFTSDNGP G + R KG+TWEGG+R PT + GM+ + + Sbjct: 303 AENTIVVFTSDNGPWLPTGISGGSAGLLREGKGTTWEGGMREPTIFWAPGMLPAKVVMDM 362 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL 463 DLF T LAG P +P +DGVD + G + +S RK Y+ L Sbjct: 363 GSTLDLFNTFSSLAGVP-------MPDDREMDGVDLSPILFG-DAESPRKEMFYYQGADL 414 Query: 464 AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 AVR+ +K H YT+ Y G ++ ++N+ DP E + +H Sbjct: 415 YAVRLGAYKAHF-------YTKEAYVMG--AERVEHNPPLLYNVEEDPSEKYDLSGKH 463 >UniRef50_Q0BZE9 Sulfatase family protein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0BZE9_HYPNA Length = 459 Score = 164 bits (416), Expect = 6e-39, Method: Compositional matrix adjust. Identities = 141/454 (31%), Positives = 209/454 (46%), Gaps = 71/454 (15%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 +E K PN+++ + DD+GW D+ NG + TP+ID + +G+ LT Y+ + Sbjct: 29 SETAPAAAKPPNIIIIMADDLGWGDISLNGAALI---ETPNIDRIGQEGIQLTDFYAGSN 85 Query: 136 -SSPTRATILTGQYSIHHG---ILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 SP+RA +LTG+Y I G ++ P + Q G T+ ++L + GY T +GKWH+ Sbjct: 86 VCSPSRAALLTGRYPIRSGMQHVIFP--HSQDGLPAEEITISEMLKNAGYRTGMVGKWHL 143 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSP-DRSEYIKQLPFSKDDVH 250 G +E P N GFD F G +DM D++ E+ SP D+S+ L ++K Sbjct: 144 GHQEEYWPTNQGFDWFYGVPYSNDMAP--FDLYRGKEIIESPADQSQL--SLNYAK---- 195 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 +++ED SDKPFFLYY H + Sbjct: 196 ------------AAKEFIED-----------------SSDKPFFLYYAETFPHIPLFVPE 226 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP 370 +G+S A YGD + ++ + TL++ G D+TLI+FTSDNGP E Sbjct: 227 DRSGTSDAGL-YGDVVETVDAGIGIVLDTLDEAGVADDTLIIFTSDNGPWFE---GSAGE 282 Query: 371 FRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTALDLAGHPGAKVANLVP 429 FRG KG T EGG RVP W G I + S + DL PTA L+G +P Sbjct: 283 FRGRKGETHEGGFRVPFLARWPGHIPKGSVSHEMAMNIDLLPTAASLSG-------ATLP 335 Query: 430 KTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQ 489 IDG D TS T G +F +G V + ++ +++ Y ++ Sbjct: 336 ADRVIDGKDLTSLL--TAGAPTPHDILFFFDGN-EIVGARDARFRLVLNTFYRTMSVPFE 392 Query: 490 GGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIP 523 + GT + +F+L DPQES S +R P Sbjct: 393 --YFGTAL------LFDLEKDPQESFSF-MREYP 417 >UniRef50_A6DJ11 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ11_9BACT Length = 462 Score = 164 bits (416), Expect = 6e-39, Method: Compositional matrix adjust. Identities = 144/457 (31%), Positives = 213/457 (46%), Gaps = 63/457 (13%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRATI 143 KPNV++ L DD G+ D+ G +P ID +A +GL LTS Y + P S +RA + Sbjct: 22 KPNVIIILTDDQGYNDLSCYGSKTI---KSPRIDQLAEEGLKLTSYYVASPVCSASRAAL 78 Query: 144 LTGQYSIHHGILMPPMYGQPG------GLQGL----TTLPQLLHDQGYVTQAIGKWHMGE 193 LTG+Y P + G PG G +GL T+ +LL GY T+A+GKWH+G+ Sbjct: 79 LTGRY--------PKLVGVPGVFFPNRGHKGLDPKHQTIAKLLKSVGYATKAVGKWHLGD 130 Query: 194 NKESQPQNVGFDDFRGFNSVSDM-------YTE---WRDVHVNPEVALSPDRSEYIKQLP 243 E P N GFD + G +DM Y+E +R+ V+ E + IK + Sbjct: 131 ELEFLPTNQGFDSYYGIPYSNDMTPAFSMKYSENCLYRE-GVDQEALKKAFEANKIKPVG 189 Query: 244 FSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 KD V +R E + P + +R+ D +KF+D+ S+KPFFLY H Sbjct: 190 M-KDKVPLMRNDECIEM----PADQSTITKRFTDESIKFIDESTASNKPFFLYLAHSMPH 244 Query: 304 FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 Y + + G S A YGD + E++ + L + +NTL ++TSDNGP Sbjct: 245 TPLYVSKDFEGKS-AGGIYGDVIEEIDYNVGRIIDHLNEKNIAENTLFIYTSDNGPWLIK 303 Query: 364 PPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLA-DLFPTALDLAGHP 420 HG + P K +++EGG RVP + W I + L+ D+FPT LA Sbjct: 304 KSHGGSALPLFEGKMTSFEGGQRVPAIIRWPAKIPKDSVSNEMTLSMDIFPT---LAKIT 360 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 GAK + I+G + + +N K +H + AVR +KYH QQ Sbjct: 361 GAKAQD----ADLINGKNALELY---EDPANFKTKHDYFFYSPRAVRHKNWKYH---QQE 410 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSI 517 +S T +T G S+++L D ES ++ Sbjct: 411 TFKLKS--------TARKTKGPSLYDLSKDIGESKNL 439 >UniRef50_P15289 Arylsulfatase A component C n=34 Tax=Euteleostomi RepID=ARSA_HUMAN Length = 507 Score = 163 bits (412), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 119/409 (29%), Positives = 186/409 (45%), Gaps = 40/409 (9%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRAT 142 + PN+V+ DD+G+ D+G G + TP++D +A+ GL T Y S +P+RA Sbjct: 19 RPPNIVLIFADDLGYGDLGCYGHPSST---TPNLDQLAAGGLRFTDFYVPVSLCTPSRAA 75 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 +LTG+ + G M P P GL T+ ++L +GY+T GKWH+G E Sbjct: 76 LLTGRLPVRMG--MYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGA 133 Query: 199 --PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P + GF F G D P + + +P + Sbjct: 134 FLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLAN--------- 184 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 + P ++ L+ R+M + + + D+PFFLYY + H+ + +A S Sbjct: 185 --LSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSFAERS 242 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP--FRGA 374 R +GD ++E++ L + G L+ TL++FT+DNGPE G R Sbjct: 243 -GRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLLRCG 301 Query: 375 KGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFI 434 KG+T+EGGVR P +W G I P + + DL PT LAG P +P T + Sbjct: 302 KGTTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAP-------LPNVT-L 353 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYFLN-----GKLAAVRMDEFKYHVLIQ 478 DG D + LGT G+S R++ ++ + + AVR ++K H Q Sbjct: 354 DGFDLSPLLLGT-GKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQ 401 >UniRef50_A6DPC8 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DPC8_9BACT Length = 598 Score = 162 bits (410), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 136/411 (33%), Positives = 201/411 (48%), Gaps = 37/411 (9%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNP---TPDIDAVASQGLILTSAYSQPS-SS 137 T KKPN +V DD G+ D+G G+P TP+ID +A +G T+ YS + S Sbjct: 20 TDKKPNFIVIFTDDQGYQDLG------CFGSPKIKTPEIDQMAKEGARYTNFYSANAICS 73 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGE 193 +RA +LTG+Y +G+ +Y PG QGL T+ ++L GY T IGKWH+G+ Sbjct: 74 ASRAALLTGRYPSRNGVFH--VY-YPGASQGLKPSEITIAEVLKTAGYRTSIIGKWHLGD 130 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS-EYIKQLPFSKDDVHAV 252 + P N GFD + G +DM+ +D+ + ++ L + E IK SK Sbjct: 131 RNQFLPTNQGFDSYFGIPFSNDMWMS-KDLALADDIKLFGGVTVEQIKSGEASKAVKGEK 189 Query: 253 RGGEQQAIADIT----PKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 RGG+ + D P + QR+ D +K + + K +P+F+Y H Y Sbjct: 190 RGGKVPLMRDEEVVEYPVDQTYITQRYTDEALKIIKESEKKKQPYFIYLAYAMPHVPLYA 249 Query: 309 NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR 368 + K+AG S AR YGD + EM+ + K L+ +G NTL++FTSDNGP G Sbjct: 250 SPKFAGKS-ARGPYGDTVEEMDYHVGRILKHLKSSGADKNTLVIFTSDNGPWNLGERGGS 308 Query: 369 T-PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDG--IVDLADLFPTALDLAGHPGAKVA 425 P RGAK ST+EGG RVP ++W G I P +D I D PT AK+A Sbjct: 309 ALPLRGAKFSTYEGGHRVPCVMWWPGTI-PAGTDSAEIATTLDFMPTF--------AKLA 359 Query: 426 NLVPKTTFIDGVDQTSFFL-GTNGQSNRKAEHYFLNGKLAAVRMDEFKYHV 475 N +DG + G G+S + +++ + A+R+ K + Sbjct: 360 NAQLPNRTLDGKNIAPMLRDGNKGKSPYEKFYFWSKNHIEALRIGNMKLRM 410 >UniRef50_A0YAF7 Arylsulfatase A n=4 Tax=Bacteria RepID=A0YAF7_9GAMM Length = 479 Score = 162 bits (410), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 133/451 (29%), Positives = 213/451 (47%), Gaps = 61/451 (13%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPT---PDIDAVASQGLILTSAYSQPS-SSPT 139 + PNV++ DD+G+ D+G A G+PT P++D +A++G+ T+ Y+ S +P+ Sbjct: 36 QSPNVIIIFADDMGYGDIG------AYGHPTIRSPNLDQMAAEGIKWTNFYAASSVCTPS 89 Query: 140 RATILTGQYSIHHG-------ILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHM 191 RA +LTG+ + G +L P GGL T+ + L ++ Y T +GKWH+ Sbjct: 90 RAGLLTGRLPVRSGMAHDQIRVLFPT---STGGLPTTEITIAKALKEKDYRTALVGKWHL 146 Query: 192 GENKESQPQNVGFDDFRG--FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 G QP + GFD++ G +++ D+ E YI+ + +KD Sbjct: 147 GHLPGFQPLDHGFDEYFGIPYSNDHDLKKEL----------------SYIQTITHAKDGD 190 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 V + ++I + P + +R+ V F+ K S++PFFLY H + + Sbjct: 191 FNVPLMQNRSIIE-RPANQNTITKRYTQEAVSFIKK--NSNQPFFLYLAHSMPHVPLFAS 247 Query: 310 AKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT 369 ++ GSS R YGD + E++ + TL + G +NTL+VFTSDNGP + HG + Sbjct: 248 DQFRGSS-DRGLYGDVIEEIDWSVGQVLSTLSEQGISENTLVVFTSDNGPWLIMGAHGGS 306 Query: 370 P--FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANL 427 + KG+++EGG+R P +W I+P + DLFPT + +AG Sbjct: 307 AGLLKSGKGTSYEGGMREPAIFWWPEKIKPAVAHNTASTLDLFPTIMSIAGID------- 359 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSG 487 +P DG D + + RK Y+ K+ AVR ++K H + A + Sbjct: 360 MPSDRSYDGYDLSPTMF-EQKSNERKNIFYYHGDKIFAVRQGDWKVHF---KTVANIYTK 415 Query: 488 YQGGFTGTVMQTAGSSVFNLYTDPQESDSIG 518 Q T T Q VFNL DP E +G Sbjct: 416 EQKILTHTPPQ-----VFNLLVDPSERFDVG 441 >UniRef50_Q15US6 Sulfatase n=3 Tax=Alteromonadales RepID=Q15US6_PSEA6 Length = 526 Score = 162 bits (410), Expect = 4e-38, Method: Compositional matrix adjust. Identities = 130/448 (29%), Positives = 201/448 (44%), Gaps = 77/448 (17%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS- 136 L+++ +KPNVV+F +DD+G+ D+ NG A+G TP++DA+AS+G+ T A+S S+ Sbjct: 35 LKQQASQKPNVVIFYVDDLGYGDISPNG---AIGVDTPNLDALASKGVNFTDAHSTASTC 91 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG---- 192 +P+R ++LTG++ + P G TLP +L GY T IGKWH+G Sbjct: 92 TPSRYSLLTGEHGFRQNAAILPGDAPALIRPGKATLPSMLQKAGYTTGVIGKWHLGLGEG 151 Query: 193 -----ENKESQPQNVGFDDFRGFNSVSD----MYTEWRDV--------------HV---- 225 ++ + P +GFD + D +Y E +V H Sbjct: 152 SVDWNQDVKPGPLEIGFDYSFLLPATGDRVPTVYLEGHEVVNLESSDPIEVSYDHKVGDR 211 Query: 226 -----NPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGV 280 NPE+ ++ + + + ++ GGE+ D E+ + V Sbjct: 212 PTGVDNPELLRMKADLQHSQTIVNGISRIGSMSGGEKALWVD------EEFPDVFSQKAV 265 Query: 281 KFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTL 340 +F+++ K PFFL++ + H PN ++ G S GD + +M+ V + + L Sbjct: 266 EFIERSKKD--PFFLFFSFQDIHVPRLPNERFKGKS-TMGPRGDAIAQMDWVVGRVMQAL 322 Query: 341 EKNGQLDNTLIVFTSDNGPE-------------AEVPPHGRTPFRGAKGSTWEGGVRVPT 387 G DNTL++FTSDNGP E P G PFRG K S +EGG RVP Sbjct: 323 TTQGVADNTLVIFTSDNGPVLDDGYDDMAAEMLGEHLPAG--PFRGGKYSVFEGGTRVPM 380 Query: 388 FVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTN 447 VYW G +S ++ D++ + L P A KT ID +D FLG Sbjct: 381 IVYWPGNTTHIRSSALISQVDIYASLAGLVKQPLA-------KTEAIDSLDVMHAFLG-- 431 Query: 448 GQSNRKAEHYFLNGKLA--AVRMDEFKY 473 A Y L + +R +KY Sbjct: 432 --KTNNARTYLLEEAVGTLGLRKHNWKY 457 >UniRef50_A7SRP2 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7SRP2_NEMVE Length = 491 Score = 161 bits (408), Expect = 5e-38, Method: Compositional matrix adjust. Identities = 118/357 (33%), Positives = 172/357 (48%), Gaps = 63/357 (17%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 ++ KP+++ L DD+GW DVGF+G + TP+ID +A+ G+IL + Y QP +PTR Sbjct: 20 QSSAKPHLLFVLADDLGWSDVGFHGSKIQ----TPNIDRLAANGVILDNYYVQPVCTPTR 75 Query: 141 ATILTGQYSIH----HGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG-EN 194 A+++TG+Y IH HGI+ G+P GL LT LPQ L GY T +GKWH+G N Sbjct: 76 ASLMTGKYPIHTGLQHGIIHN---GRPYGLPLNLTLLPQKLRKAGYSTHMLGKWHLGFYN 132 Query: 195 KESQPQNVGFDDFRGFNS-VSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 ES P GFD F GF S + YT +D +++ +D+ VR Sbjct: 133 WESTPTYRGFDTFYGFYSGAENHYTHVQDHYLD------------------LRDNEEIVR 174 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH---------F 304 A + K E + + P F+Y + H Sbjct: 175 DQNGTYSAHLFTKRAEQ------------IVRAHDPSTPLFMYMAFQNVHSPVQAPKEYI 222 Query: 305 DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP 364 D Y K P R +Y + M+D NL + +K G +NT+++F++DNG VP Sbjct: 223 DRYSFIK----DPLRRTYAAMVTIMDDALGNLTRAFDKAGLWENTILIFSTDNG---GVP 275 Query: 365 PHG--RTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAG 418 +G P RG K + WEGGVR FV+ + Q K ++ + D +PT + LAG Sbjct: 276 KNGGYDYPLRGRKDTLWEGGVRGVAFVHGVALEQSGVKCKALMHVTDWYPTLVSLAG 332 >UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZAC9_PLALI Length = 479 Score = 160 bits (406), Expect = 9e-38, Method: Compositional matrix adjust. Identities = 143/470 (30%), Positives = 203/470 (43%), Gaps = 68/470 (14%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRATI 143 +PN++V + DD+G+ D+G GG PTP +D +A+ G+ T+AY S P SP+RA Sbjct: 37 RPNILVIMADDLGYADLGVQGG---CEIPTPHLDQLAASGIRCTNAYVSAPYCSPSRAGF 93 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 LTG+Y G P G+ L GL T+ LL +GY T IGKWH G +K+ P Sbjct: 94 LTGKYQTRFGHEFNPHVGEEAKL-GLPLEEVTIANLLQTEGYRTALIGKWHQGFSKDHHP 152 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 Q+ GFD+F GF Y ++V A S D RG E + Sbjct: 153 QSRGFDEFFGFLVGGHNYLLHKEVKARFGTAHSHDM---------------IYRGREVEP 197 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH--FDNYPNAKY----A 313 + RWM +KP+FLY H + P+ + + Sbjct: 198 QEGYATDLFTNEALRWMS---------GPPNKPWFLYLSYNAVHTPLEIAPHLQKRIPES 248 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT---- 369 PAR Y + ++D + + L ++G + TLI+F SDNG P Sbjct: 249 VKLPARRGYLSLLAGLDDSIGRITQHLSQHGLREKTLIIFLSDNGGSGRAPILAYNSGLN 308 Query: 370 -PFRGAKGSTWEGGVRVPTFVYWKGMIQPRK--SDGIVDLADLFPTALDLAGHPGAKVAN 426 P RG KG T EGG+RVP FV W G + R I+ L DL PT LA + AK Sbjct: 309 HPLRGDKGQTLEGGIRVPFFVSWPGQLPARTIYEQPIISL-DLLPTVCQLAANNPAKPQ- 366 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN-GKLAAVRMDEFKYHVLIQQPYAYTQ 485 P IDGV+ ++LG +S E F G AVR +K P A Sbjct: 367 --PLPQGIDGVNLMPYWLGQ--RSGAPHESLFWRFGPQKAVRAGNWKLVDWRDFP-ASKN 421 Query: 486 SGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAY 535 SG++ +++L TD E +++ H + L+T + Sbjct: 422 SGWE--------------LYDLSTDISEKNNLAETHPEIVARLKTSWEKW 457 >UniRef50_B0UGK6 Sulfatase n=18 Tax=Bacteria RepID=B0UGK6_METS4 Length = 569 Score = 160 bits (406), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 150/505 (29%), Positives = 226/505 (44%), Gaps = 64/505 (12%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 +KPN++ + DD G+ D+G GGG G PTP+ID +A G+ S Y+QPS +P RA + Sbjct: 47 QKPNILFIVSDDTGYGDLGPYGGGEGRGMPTPNIDRLAEDGMTFFSFYAQPSCTPGRAAM 106 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGLT-TLPQLLHDQGYVTQAIGKWHMGENKESQPQNV 202 TG+ G+ GQ GGL TL +L GY T GKWH+GE + P Sbjct: 107 QTGRIPNRSGMTTVAFQGQGGGLPAAEWTLGSVLKQGGYKTYFTGKWHLGEADYALPNAQ 166 Query: 203 GFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG-GEQQAIA 261 G+D + +Y + +P PD ++ + F + A+ G ++A+ Sbjct: 167 GYDVMQ----YCGLYHLNAYTYADP--TWFPDMDPELRAM-FQRVTRGALSGKAGEKAVE 219 Query: 262 D--ITPKYMED--------------LDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD 305 D + +Y+ D + FLD AK+ PF++ H Sbjct: 220 DFKVNGQYVNTPVVDGKAGVVGIPFFDSYVEKAALGFLDDAAKAGSPFYINVNFMKVHQP 279 Query: 306 NYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV-P 364 N P ++ S +++ Y D +VE++ + L G NTL+ +T+DNG +V P Sbjct: 280 NMPAPEFEHKSLSKSKYADSVVELDARIGRIMDKLRSLGLDKNTLVFYTTDNGAWQDVYP 339 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAK 423 G TPFRG KG+ EGG RVP W G I+P K+ IV DL T +AG Sbjct: 340 DAGYTPFRGTKGTVREGGNRVPAMAVWPGKIKPGTKNHDIVGGLDLMATFASVAGLT-LP 398 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLA--AVRMDEFKYHVLIQQPY 481 + + D D + LGT G+S RK+ YF +L+ AVR+ +K Sbjct: 399 DKDRDGQPMIFDSYDMSPVLLGT-GKSARKSWFYFTEDELSPGAVRVGNYK--------A 449 Query: 482 AYTQSGYQGGFTG-----TVMQTAGSS--------VFNLYTDPQESDSIGVRH------- 521 + G G TG T + GSS +F+L+ DPQE + + + Sbjct: 450 VFNLRGDDGAATGALAVDTNLGWKGSSKYVATVPQIFDLWQDPQERYDVFMNNYTERTWT 509 Query: 522 -IPMGVPLQTEMHAYMEILKKYPPR 545 + M ++ M Y++ YPPR Sbjct: 510 LVTMSAAVKNLMKTYVQ----YPPR 530 >UniRef50_A9UPM8 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UPM8_MONBE Length = 497 Score = 160 bits (405), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 130/432 (30%), Positives = 193/432 (44%), Gaps = 75/432 (17%) Query: 114 TPDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPP--------MYGQP- 163 TP ++ +A+ G+ T YS SP+RA+++TG+YS+ GI + P +Y Sbjct: 1 TPHLEKLAASGMTFTQWYSTFHVCSPSRASMMTGRYSVRSGIGIAPGVRALSSSIYPAQA 60 Query: 164 -GGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDM-YTEW 220 GGL TT+ + L + GY T AIGKWH+G+ + P N GFD++ G DM + W Sbjct: 61 VGGLPLNETTMAEALKEAGYATAAIGKWHLGQREIFLPTNQGFDEYLGIPFSQDMGLSFW 120 Query: 221 RDVHVNPEVALSP------DRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQR 274 ++ P P D ++ I+Q P + +L R Sbjct: 121 FLNNLQPVEPYQPVPLPLLDGTDVIEQ-----------------------PVALSNLVHR 157 Query: 275 WMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFA 334 +++ F+ + +SD PFFLY H N + K+ GSS + + GD + EM+ Sbjct: 158 YIERATDFIKRSHESDTPFFLYLPFNHVHAPNSCSPKFCGSS-EQGAVGDAVQEMDWAIG 216 Query: 335 NLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGM 394 + LEK G ++TL FTSDNG G R K S WEGG +VP +W GM Sbjct: 217 RIMSYLEKLGLENDTLTFFTSDNGAPLLQDGAGNGVLRDGKASMWEGGFKVPALAHWPGM 276 Query: 395 IQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRK 453 I+ + S + AD++PT + AG P +P DG+D + LG G + Sbjct: 277 IKGNQVSHELTSTADIYPTLMHFAGVP-------LPSDRVYDGIDLSDVLLGKEGAKGHE 329 Query: 454 AEHYFLN-------GKLAAVRMDEFKYH----VLIQQPYAYTQSGYQGGFTGTVMQTAGS 502 ++ N G+L AVR + K + QP+A G Q Sbjct: 330 CIMFYHNAVAANASGELYAVRCGDMKVYWATASTTSQPWA---DGPQ----------EPP 376 Query: 503 SVFNLYTDPQES 514 VFNL DP E+ Sbjct: 377 LVFNLTADPGET 388 >UniRef50_UPI00005846A1 PREDICTED: similar to arylsulfatase n=1 Tax=Strongylocentrotus purpuratus RepID=UPI00005846A1 Length = 552 Score = 159 bits (403), Expect = 2e-37, Method: Compositional matrix adjust. Identities = 135/490 (27%), Positives = 211/490 (43%), Gaps = 74/490 (15%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPT----PDIDAVASQGLILTSAYSQPS-SSPT 139 KPN V+F DD+G+ D+ + G+PT P D + G+ T Y + +P+ Sbjct: 57 KPNFVIFFADDMGYGDLA------SYGHPTQERGPIDDVMVENGIKFTQGYVPDTVCTPS 110 Query: 140 RATILTGQYSIHHGIL---------MPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWH 190 R +LTG+Y + G+ +P + + G T+ + L ++GY T GKWH Sbjct: 111 RVALLTGRYPVRSGVFSGTGGSRVFLP--WTRSGLPSTELTIAEALKEEGYTTGMAGKWH 168 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEV-ALSPDRSEYIKQLPFSKDDV 249 +G N E++ V GF+ V H+ P +++ D + P D Sbjct: 169 LGLNSETRDDGVHLPMHHGFDFVG---------HILPFTNSMACDDTGRFVDFP---DVT 216 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 Q +A P L Q +++ V F++ A PFF Y+ H Y + Sbjct: 217 KCFLYKRDQIVAQ--PFNHTYLTQTFVNDAVSFIEDNAHD--PFFFYFPFSHPHVPLYAS 272 Query: 310 AKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT 369 ++AG S R YGD + EM+ + LE G NTL++F +D+GP+ E HG Sbjct: 273 PRFAGKS-QRGEYGDNINEMSWAVGEVIDALEAKGLSQNTLVLFLADHGPQPEYCAHGGD 331 Query: 370 P--FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANL 427 P F+G K +TWEGG+RVP YW G I PR+SD +V D+ T +DLA Sbjct: 332 PSIFKGYKTNTWEGGIRVPFVAYWPGQITPRESDALVSTLDIMRTVVDLAN-------GT 384 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQ-------- 479 +P T DG T L N S +++ +L AVR +K H + Sbjct: 385 LPDDTAYDGEVITDVLL-KNAPSPHDVLYHYCKDRLMAVRSGPYKVHYFTHRVQTQDYFA 443 Query: 480 --------PYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTE 531 P A+ Y + V + ++N+ DP E+ P+ L + Sbjct: 444 GECQDGGLPLAHYFDCYH-CYDSCVTEQDPPLIYNVEHDPIEA-------YPLNTTLDSS 495 Query: 532 MHAYMEILKK 541 + +M L++ Sbjct: 496 LAEFMVDLQE 505 >UniRef50_D2QWC8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QWC8_9PLAN Length = 468 Score = 159 bits (401), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 122/339 (35%), Positives = 161/339 (47%), Gaps = 45/339 (13%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRATI 143 +PN+VV + DD+G+ D+G +G PTP +DA+A+ G+ TS Y S P SPTRA + Sbjct: 29 RPNIVVIVGDDMGYHDLGVHG---CKDIPTPHLDALATSGVRCTSGYVSGPYCSPTRAGL 85 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 LTG+Y G P P G GL TTL L GY T +GKWH+G +++ P Sbjct: 86 LTGRYQQRFGHEFNPG-PTPTGEIGLPLSETTLADRLKKVGYKTGMVGKWHLGNDEKRHP 144 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 + GFD+F GF + Y +P + +L +RG E Sbjct: 145 LSRGFDEFFGFLGGARTYFA------------TPGNASAGTKL---------LRGRE--- 180 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY-----AG 314 + D +Y+ D R V ++D+ S PFFLY H + KY A Sbjct: 181 VVD-EKEYLTDAFAR---EAVAYIDRSKAS--PFFLYLTFNAVHTPMEASQKYLDRFTAV 234 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGA 374 S P R Y M M+D + LE+ L+NTLI F SDNG TP RG Sbjct: 235 SDPKRQKYCAMMSAMDDAVGQVVAKLEREKLLENTLIFFVSDNGGPTAANTGDNTPLRGF 294 Query: 375 KGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPT 412 K +TWEGG+RVP FV WKG I K+ D V D PT Sbjct: 295 KATTWEGGIRVPYFVSWKGKIPAGKTYDQPVIQIDFVPT 333 >UniRef50_D2QTW6 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QTW6_9SPHI Length = 486 Score = 158 bits (400), Expect = 5e-37, Method: Compositional matrix adjust. Identities = 142/459 (30%), Positives = 217/459 (47%), Gaps = 53/459 (11%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATIL 144 PNVV+F +DD+G+ D+ G A+ TP++D +A++G T+ + Q S +RA +L Sbjct: 38 PNVVLFFMDDLGYGDLSVTG---ALDYTTPNLDKMAAEGTRFTNFLAAQAVCSASRAALL 94 Query: 145 TGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVG 203 TG Y G+ P GL TL +LL ++GY T GKWH+G+NK+ P G Sbjct: 95 TGCYPNRLGLYGALGPNSPIGLNPNEETLAELLKERGYATGMFGKWHLGDNKQFLPMQQG 154 Query: 204 FDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIAD- 262 FD++ G DM+ L P +++ K P D + G E + + D Sbjct: 155 FDEYYGVPYSHDMW------------PLHPAQAQ-AKYPPLRWIDGNEP-GPEIKDLNDA 200 Query: 263 --ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPART 320 IT E V F+ K KPFFLY H +A++ G S AR Sbjct: 201 GKITGTITEK--------AVSFIRNHKK--KPFFLYVPHPLPHVPLATSARFKGQS-ARG 249 Query: 321 SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP--FRGAKGST 378 +GD + E++ + L++ G NTL++F SDNGP H + FR KG++ Sbjct: 250 IFGDVLTELDWSVGQIMNELKQQGLDKNTLVIFISDNGPWLNYGDHAGSSGGFREGKGTS 309 Query: 379 WEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGV 437 +EGG RVP V W G++ R S+ ++ D+ PT ++ G +PK IDGV Sbjct: 310 FEGGHRVPCLVRWPGVVPAGRVSNKLLTALDILPTVANVCG-------ARLPKQR-IDGV 361 Query: 438 DQTSFFLGTNGQSNR-KAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY---QGGFT 493 D + G N + R K +Y+ L AVR ++K ++ P T G+ QGG Sbjct: 362 DWVALLKGDNSVTPRDKFYYYYRKNSLEAVRQGDWK--LVFAHP-GRTYEGFLPGQGGKP 418 Query: 494 GTVMQT--AGSSVFNLYTDPQESDSIGVRHIPMGVPLQT 530 G +T + +++L DP E + +H + L+T Sbjct: 419 GPSTETHAIAAGLYDLRRDPGERYDVREQHPEVVARLET 457 >UniRef50_A7IPG5 Sulfatase n=3 Tax=Bacteria RepID=A7IPG5_XANP2 Length = 491 Score = 157 bits (396), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 141/489 (28%), Positives = 207/489 (42%), Gaps = 112/489 (22%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 +P++V L DD+G+ DVGF+G + TP++D +A+QG L Y+QP +PTRA L Sbjct: 48 RPHIVYILADDLGFADVGFHGSDIK----TPNLDHLAAQGARLGQFYTQPFCTPTRAAFL 103 Query: 145 TGQYSIHHGILMPPMYGQPGGLQ-GLTT----LPQLLHDQGYVTQAIGKWHMGE-NKESQ 198 TG+Y +H+G+ + + P G + GL T LPQ L D GY T +GKWH+G +++ Sbjct: 104 TGRYPLHYGLQVGAI---PSGAKYGLATDEFLLPQALKDVGYRTALVGKWHLGHADQKFW 160 Query: 199 PQNVGFDDFRG--------FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 P+ GFD F G F + T+W H N +V + E F K+ V Sbjct: 161 PRQRGFDSFYGPLVGEIDHFKHEAHGVTDW--YHDNTQV-----KEEGYDTELFGKEAV- 212 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH------- 303 + IA PK P FLY H Sbjct: 213 -------RLIAAHDPK------------------------TPLFLYLAFTAPHTPFQAPQ 241 Query: 304 --FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG--- 358 D Y + ++P R +Y + M+D ++ L G +NTLIVF SDNG Sbjct: 242 SYLDQYAHI----AAPQRRAYAAMITAMDDQIGHVVAALTSRGMRENTLIVFHSDNGGTR 297 Query: 359 -----PEAEVP---PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLF 410 E V P P+R KGS +EGG RV W G I P ++G++ + D+ Sbjct: 298 SKMFAGEGAVAGDLPASNAPYRDGKGSLYEGGTRVVALANWPGRIAPGAAEGVMHVVDML 357 Query: 411 PTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDE 470 PT LAG A+L K+ +DGVD GQ+ R Y + AVR Sbjct: 358 PTLAKLAG------ASLA-KSKPLDGVDVWPAL--AAGQAGRAGIVYNVEPTQGAVRDGR 408 Query: 471 FKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQT 530 +K V+ + +F++ DP E+ + +H LQ Sbjct: 409 WK-------------------LVWRVVLPPTAELFDVEADPSETTDVSAQHPEKVAELQG 449 Query: 531 EMHAYMEIL 539 ++ A + Sbjct: 450 KVVALARTM 458 >UniRef50_A6DLD9 Sulfatase n=2 Tax=Chlamydiae/Verrucomicrobia group RepID=A6DLD9_9BACT Length = 517 Score = 156 bits (394), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 142/518 (27%), Positives = 225/518 (43%), Gaps = 96/518 (18%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRAT 142 +KPN+++ DD+G+ D+ GG G TP ID +A+ G+ +S Y S + +P+R + Sbjct: 22 EKPNILIIYADDIGYGDLSCYGG---TGAQTPFIDRLANDGIRFSSGYASAATCTPSRYS 78 Query: 143 ILTGQYSIHH--GILMP---PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG----- 192 +LTG+Y+ + ++P P+ P + + D GY+T +GKWH+G Sbjct: 79 LLTGEYAFRNKSAKILPGNAPLIIDPAK----PNIASFMKDAGYITALVGKWHLGLGLSD 134 Query: 193 ------ENKESQPQNVGFDDFRGFNSVSD----MYTEWRDV-HVNPE----------VAL 231 N + P+ +GFD + D +Y E +V ++P V Sbjct: 135 GSFDWNSNIKPAPRELGFDYSFYMAATGDRVPSVYIENSEVVDLDPSDPIKVSYAKPVGT 194 Query: 232 SPDRSEYIKQLPFSKDDVHA------------VRGGEQQAIADITPKYMEDLDQRWMDYG 279 P + L D HA + GG D ED+ +++ Sbjct: 195 EPTGISHPHLLTVQADVQHAGTIVNGISRIGTMTGGHAARFKD------EDMADTYLNKA 248 Query: 280 VKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKT 339 + F++K D+PFF+Y+ H P+ ++ GSS + GD +V+ + L KT Sbjct: 249 IDFINK--SKDQPFFMYFAAHDNHVPRRPHPRFQGSS-SLGPRGDAIVQFDWTVGKLIKT 305 Query: 340 LEKNGQLDNTLIVFTSDNGP----------EAEVPPH-GRTPFRGAKGSTWEGGVRVPTF 388 L+ N NTLI+ +SDNGP EA H PFRG K S WEGG R+P Sbjct: 306 LKANKMYRNTLIILSSDNGPVLFDGYWEGSEARNGDHKAAGPFRGGKYSLWEGGTRMPFI 365 Query: 389 VYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNG 448 V W G IQ S ++ D+F + L G +PK+ DG + +G Sbjct: 366 VSWPGKIQSGTSSALISQVDIFASIATLIGKD-------LPKSASPDGQNMLPALMG--- 415 Query: 449 QSNRKAEHYFLNGKLA--AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFN 506 + Y + L+ A+RM ++KY P T+ G + T + G +FN Sbjct: 416 -KSPVGRDYLVEEALSQVALRMGDWKY----IPPGTVTERGGLDEWIKTPVHPPG-MLFN 469 Query: 507 LYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 L DP E++ + +H ++ A + ILKK P Sbjct: 470 LADDPGETNDLSKQH-------PKKVKAMLAILKKEAP 500 >UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UGD7_RHOBA Length = 543 Score = 155 bits (393), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 117/354 (33%), Positives = 174/354 (49%), Gaps = 59/354 (16%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRATI 143 +PN+V+ + DD+G+ DVGFNG PTP +D +A+ G++ T+ Y S P SP+RA + Sbjct: 44 RPNIVLIVADDLGYSDVGFNG---CKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGL 100 Query: 144 LTGQYSIHHGILMPP-----MYGQ--PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 LTG++ G P +G+ PG TTL L + GYVT AIGKWH+G+ K Sbjct: 101 LTGRHQQRFGHGSNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLGDAKP 160 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P GFD++ GF+ ++ W D+ + KD + V G+ Sbjct: 161 FWPNRRGFDEWFGFSGGG--FSYWGDLGM--------------------KDPLLGVHRGD 198 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY----------YGTRGCHFDN 306 + + PK + L + VKF+ + +PFFLY + TR H Sbjct: 199 EP----VDPKTLTHLTDDFSTEAVKFIQR--HETEPFFLYLAYNAPHAPDHATR-AHLQK 251 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH 366 + +Y G R YG + M++ + + ++G +NT+I+F SDNG E H Sbjct: 252 TAHIEYGG----RAVYGAMVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGGRRE---H 304 Query: 367 GRT-PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAG 418 P+RG KG +EGG+RVP V W G ++ K + + DLFPTAL AG Sbjct: 305 AVNFPYRGHKGMLFEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAG 358 >UniRef50_A6DHI4 Arylsulfatase A (ASA) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI4_9BACT Length = 511 Score = 155 bits (392), Expect = 4e-36, Method: Compositional matrix adjust. Identities = 129/451 (28%), Positives = 196/451 (43%), Gaps = 91/451 (20%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRAT 142 +KPN+V DDVG+ DVG G + PTP ID +A G+ T + S + SP+R Sbjct: 22 EKPNIVFIYGDDVGFGDVGVYG---SEKIPTPHIDKLAKGGIQFTDGHCSAATCSPSRFA 78 Query: 143 ILTGQYSIHHGI-LMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKES--- 197 +LTG ++ HG+ ++PP P + + TLP++L + GYVT +GKWH+G + Sbjct: 79 MLTGVHAFRHGVNILPP--NAPLSIPTDIPTLPKMLRENGYVTGVVGKWHLGIGAKGVET 136 Query: 198 --------QPQNVGFDDFRGFNSVSDMY-------------------------------- 217 P +GFD S +D Sbjct: 137 DWNGDVKPGPLEIGFDQMFLLPSTNDRVPCVYLDGHRVYNYDPNDPIYVGRTLESVNKPG 196 Query: 218 -TEWRDVHVNPEV-ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRW 275 T++ D NPE+ P + + + + GGE+ D T M D+ + Sbjct: 197 STQYGDARKNPELMTYYPSTHGHNNSVINGIGRIGFMSGGEKALWNDET---MADV---F 250 Query: 276 MDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFAN 335 ++ +F+ + AK DKPFFLY+ ++ H P+ ++ G++ GD MV+ + Sbjct: 251 VEKASEFIKEKAKGDKPFFLYFASQDIHVPRAPHPRFQGATKL-GKRGDAMVQFDWCTGA 309 Query: 336 LYKTLEKNGQLDNTLIVFTSDNGP-----------------EAEVPPHGRTPFRGAKGST 378 L K L++ G DNT++ F+SDNGP E + G +RG K Sbjct: 310 LMKALDEAGVADNTIVFFSSDNGPVYDDGYADGSVTKTSSKETDHGHDGSGIYRGGKYQI 369 Query: 379 WEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVD 438 +EGG RVP + W I+P SD +V+ DL+ + L GH + K ID D Sbjct: 370 YEGGTRVPFIISWPAKIKPAVSDAMVNQVDLYTSFAKLVGHD-------LRKEEAIDSRD 422 Query: 439 QTSFFLGTNGQ-------SNRKAEHYFLNGK 462 + FLG Q RK +H GK Sbjct: 423 TLAAFLGEESQGLDYMFNEARKTDHAVRQGK 453 >UniRef50_A7RLK6 Predicted protein (Fragment) n=11 Tax=Eumetazoa RepID=A7RLK6_NEMVE Length = 380 Score = 155 bits (392), Expect = 4e-36, Method: Compositional matrix adjust. Identities = 117/357 (32%), Positives = 170/357 (47%), Gaps = 56/357 (15%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KP+++V + DD+GW DV F+G PTP++D +A++G+IL + Y P +PTRA+++ Sbjct: 4 KPHIIVIVADDLGWDDVSFHGSPQI---PTPNLDYLATRGVILNNYYVSPICTPTRASLM 60 Query: 145 TGQYSIHHGILMPPMYG-QPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQN 201 TG+Y IH G+ +Y QP GL G TLPQ L QGY T IGKWH+G KE P Sbjct: 61 TGKYPIHLGMQHFVIYAAQPYGLPLGEITLPQYLQIQGYKTAGIGKWHLGFFAKEYTPTY 120 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIA 261 GFD F G S +++Y F + + Sbjct: 121 RGFDSFYGMWSA---------------------KADYWNHTSFENGFWGTDMRNNMEPVT 159 Query: 262 DITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN---------AKY 312 KY ++ R +K ++ KS+ P FLY + H N + K+ Sbjct: 160 TDKDKYATEVFTR---EALKVIENHNKSE-PLFLYIAHQAPHSANPHDPLQAPEDKVKKF 215 Query: 313 AG--SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG--- 367 +G R Y + ++D +++ LEKN L+N++I+FT+DNG PHG Sbjct: 216 SGVIDKIERQQYAAMVTCVDDSIGEVFRALEKNRMLNNSVILFTTDNGGA----PHGFNR 271 Query: 368 ----RTPFRGAKGSTWEGGVRVPTFVYWKGMIQP--RKSDGIVDLADLFPTALDLAG 418 P RG K WEGGVR F+Y +I+ R S ++D+ D PT LAG Sbjct: 272 NQGSNYPLRGGKDMMWEGGVRGTAFIY-SDLIKHKGRVSTDLIDVTDWVPTLYYLAG 327 >UniRef50_C1ZKY2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZKY2_PLALI Length = 483 Score = 155 bits (391), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 126/398 (31%), Positives = 176/398 (44%), Gaps = 60/398 (15%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRATI 143 +PN+++ + DD+G+ DVGF+G PTP++DA+A G+ TS Y + P SPTRA + Sbjct: 32 RPNILLIVGDDMGYADVGFHG---CKDIPTPNLDALAKSGVQFTSGYVTGPYCSPTRAGL 88 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVG 203 LTG+Y G P G T+ L GY T +GKWH+G PQ G Sbjct: 89 LTGRYQQRFGHEFNPSGANTGLPLTEVTIADRLKQVGYTTGLVGKWHLGSQPAMHPQERG 148 Query: 204 FDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADI 263 F++F GF + + + + + +RG E D Sbjct: 149 FEEFIGFLGGAHSFFDAQGI----------------------------LRGHEPVKTID- 179 Query: 264 TPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN----AKYAG-SSPA 318 Y DL R V F++K DKP+FLY H + AK A S Sbjct: 180 ---YTTDLFGR---EAVSFIEK--HRDKPWFLYLSFNAVHTPMHATEDRMAKLASISDQE 231 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG----PEAEVPPHGRTPFRGA 374 R +Y M+ M++ + LE GQ TL++F SDNG P + TP RG+ Sbjct: 232 RRTYAAMMLAMDEAIGKVLTQLETTGQKQKTLVMFISDNGGPTMPGVTINGSINTPLRGS 291 Query: 375 KGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFI 434 K +T EGG+RVP V W G I P D V DL TAL +AG V K Sbjct: 292 KRTTLEGGIRVPFVVSWPGKIAPAVFDSPVIQLDLTATALAVAG---------VEKDVKS 342 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFK 472 DGV+ + G + A ++ G+ AVR ++K Sbjct: 343 DGVNLLPYLQGKQSEVPHAA-LFWRFGEQMAVRAGDYK 379 >UniRef50_D0TQQ7 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TQQ7_9BACE Length = 853 Score = 155 bits (391), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 148/496 (29%), Positives = 224/496 (45%), Gaps = 78/496 (15%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT-RAT 142 +KPNVV+ DD G+ D+G G + TP ID +A +G+ LT Y S S RA Sbjct: 21 QKPNVVIIFTDDQGYQDLGCYGSPLI---QTPFIDRMAKEGIKLTDFYVSSSVSSASRAG 77 Query: 143 ILTGQYSIHHGI---LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 +LTG+ + +G+ P G P TL + L +QGY T GKWH+G+ K P Sbjct: 78 LLTGRLNTRNGVKGVFFPESEGMP---SEEITLAEALKEQGYTTGCFGKWHLGDLKGHLP 134 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPD---RSEYIKQLPFSKDDVHAVRGGE 256 + GFD + G +DMY + P + + R Y L +K+D VR Sbjct: 135 TDQGFDYYYGIPYSNDMY-------IGPSQQFASNVTFREGY--NLSKAKEDQEFVRTSS 185 Query: 257 QQAIA----DITPKYMED------LDQ-----RWMDYGVKFLDKMAKSDKPFFLYYGTRG 301 + I + +P + D DQ R+ D+ + F++ ++PFF+Y Sbjct: 186 RADIKKRLNNASPLFEGDKIIEYPCDQSTTTRRYFDHAIDFIEN--NPEQPFFVYITPSM 243 Query: 302 CHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA 361 H + + ++ G S R YGD + E++ L L+K +NTL++F SDNGP Sbjct: 244 PHVPLFASEQFKGKS-KRGLYGDVVEEIDWNVGRLIDYLDKKKLAENTLVIFASDNGPWL 302 Query: 362 EVPPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAG 418 G + P RG K S +EGGVRVP + WKG I SD IV DLFPT + AG Sbjct: 303 SFKEDGGSAEPLRGGKFSYYEGGVRVPCIIRWKGSIPAGVTSDAIVASIDLFPTIMHYAG 362 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ 478 K IDG++ +S FL R Y G++ +R ++ Y Sbjct: 363 CQSFKQK--------IDGINISS-FLKNPSLRLRDEYVYVKGGEVHGIRKGDWVY----- 408 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 ++G G V + +FNL D ES+++ +++ ++ E+ Sbjct: 409 ----LPKTGNSKFKKGDVPE-----LFNLKQDIGESNNLHLQY-------PNKVKELQEV 452 Query: 539 LKKYP-----PRAQIK 549 +KKY P +QI+ Sbjct: 453 MKKYQSTSTMPYSQIR 468 >UniRef50_UPI0001A444F6 arylsulfatase A n=1 Tax=Pectobacterium carotovorum subsp. brasiliensis PBR1692 RepID=UPI0001A444F6 Length = 487 Score = 154 bits (388), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 124/422 (29%), Positives = 189/422 (44%), Gaps = 62/422 (14%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRATI 143 KPNV++ DD+GW D+ G PTP +D +A+ G T+ Y S SSP+R + Sbjct: 30 KPNVIILFTDDMGWADMSVQGAKT----PTPHLDKLAATGQRWTNFYVSSAISSPSRGGL 85 Query: 144 LTGQYSIHHGILMPPMYG-----QPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 +TG+ G+ + G P G ++ + L GY T GKWH+G + Sbjct: 86 MTGRIETKTGLYGTKIPGVFMDEDPDGFPDDEISMAESLQHNGYRTIMYGKWHLGTQSTA 145 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRD-VHVNPEVALSPDRSEYIKQL-------------- 242 P GFD++ G + +D ++ D V +N + P R E + ++ Sbjct: 146 FPTRHGFDEWYGIPTSNDRFSTVVDQVEMNRLASSDPKRRELLSKMEEINRAPRQEYWNV 205 Query: 243 PF-------SKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFL 295 P K +AV G QQA + +D+ + + Y D+ FF+ Sbjct: 206 PLYHSYKDNGKQVDYAVPQGFQQA------SFTKDVTNKAVQYIAD------NKDQSFFM 253 Query: 296 YYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 Y H + + ++ G YGD M+E++ +Y+ LE N +NT+++FTS Sbjct: 254 YMAYPQTHVPLFTSPEFKGK--GHNPYGDVMLEIDWSVGQIYQALEANKLAENTIVIFTS 311 Query: 356 DNGPEAEVPPHGRT----PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFP 411 DNGP + G P R K + +EGG RVP V WK I P+ D I DL P Sbjct: 312 DNGPWLQYDKDGLAGSALPLRSGKSTVFEGGQRVPFIVNWKSHIAPKVVDDIGSTLDLLP 371 Query: 412 TALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ-SNRKAEHYFLNGKLAAVRMDE 470 T + + G A+ +DGVD ++ FL NG+ S R YF GK+ A R + Sbjct: 372 TLMKITGSQHAQRD--------LDGVDLSAAFL--NGKPSARTFMPYFYWGKMDAYRDGD 421 Query: 471 FK 472 +K Sbjct: 422 YK 423 >UniRef50_A6LED2 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LED2_PARD8 Length = 468 Score = 154 bits (388), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 124/420 (29%), Positives = 193/420 (45%), Gaps = 60/420 (14%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 A + + ++PNV++ +DD G+ D+G G + + TP ID +A +G+ LT Y S Sbjct: 16 AAVATQAAERPNVIIVFIDDFGYGDLGCYG---STKHRTPHIDQMAKEGIRLTDFYVGSS 72 Query: 136 -SSPTRATILTGQY----SIHHGILMPPMYGQ------PGGLQGLT----TLPQLLHDQG 180 S+P+R+ +LTG Y S+H P+ + P +GL T+ +L+ +QG Sbjct: 73 VSTPSRSALLTGCYPRRVSMHVNADPTPLMSKGRQVLFPASHKGLNPGEITIAELMKEQG 132 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 Y T IGKWH+G+ P GFD + G +DM DR Sbjct: 133 YATACIGKWHLGDQLPFLPTRQGFDYYYGIPYSNDM-----------------DRP--YC 173 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 LP EQ+ + + P + L R+ + V+F+ +S PFF+Y Sbjct: 174 PLPLM----------EQEEVI-VAPVGHDSLTIRYTNKTVEFIKSHKES--PFFIYLCHN 220 Query: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 H + + G S YGD E++ L +TL++ G NTLI+FTSDNG + Sbjct: 221 MTHNPLAASPAFKGKS-QNGLYGDATEELDWSMGVLLETLKEEGLDQNTLIIFTSDNGAD 279 Query: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGH 419 R P RG KG+T+EGG RVP + W I +++D +V D PT + Sbjct: 280 EHFGGTNR-PLRGQKGTTYEGGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCSY 338 Query: 420 PGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQ 479 VP IDG + + G + S + +Y+ +L AVR +KYH+ +++ Sbjct: 339 A-------VPSDRVIDGHNVSGILEGESMASPTETFYYYQKQQLQAVRWGNWKYHLPLKE 391 >UniRef50_Q96EG1 Arylsulfatase G n=22 Tax=Euteleostomi RepID=ARSG_HUMAN Length = 525 Score = 153 bits (387), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 135/480 (28%), Positives = 208/480 (43%), Gaps = 50/480 (10%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS-SP 138 K G+KPN V+ L DD+GW D+G N A T ++D +AS+G+ ++ S+ SP Sbjct: 30 KTRGQKPNFVIILADDMGWGDLGAN---WAETKDTANLDKMASEGMRFVDFHAAASTCSP 86 Query: 139 TRATILTGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 +RA++LTG+ + +G+ GGL TTL ++L GYVT IGKWH+G + Sbjct: 87 SRASLLTGRLGLRNGVTRNFAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGSY 146 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P GFD + G DM + +P P + L A+ E Sbjct: 147 HPNFRGFDYYFGIPYSHDMGCTDTPGYNHPPCPACPQGDGPSRNLQRDCYTDVALPLYEN 206 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSP 317 I + P + L Q++ + +F+ + + S +PF LY H P + + Sbjct: 207 LNIVE-QPVNLSSLAQKYAEKATQFIQRASTSGRPFLLYVALAHMHVP-LPVTQLPAAPR 264 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT--PFRG-- 373 R+ YG + EM+ + + ++ + +NT + FT DNGP A+ + PF G Sbjct: 265 GRSLYGAGLWEMDSLVGQIKDKVDHTVK-ENTFLWFTGDNGPWAQKCELAGSVGPFTGFW 323 Query: 374 --------AKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKV 424 AK +TWEGG RVP YW G + S ++ + D+FPT + LA Sbjct: 324 QTRQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALA------- 376 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN-----GKLAAVRMDEFKYHVLIQQ 479 +P+ DGVD + G + +R H G L VR++ +K + Sbjct: 377 QASLPQGRRFDGVDVSEVLFGRSQPGHRVLFHPNSGAAGEFGALQTVRLERYKAFYITGG 436 Query: 480 PYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL 539 A G TG +Q +FNL D E+ VPL+ Y +L Sbjct: 437 ARACD------GSTGPELQHKFPLIFNLEDDTAEA-----------VPLERGGAEYQAVL 479 >UniRef50_C2FU81 Sulfatase family protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FU81_9SPHI Length = 461 Score = 153 bits (386), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 144/467 (30%), Positives = 220/467 (47%), Gaps = 93/467 (19%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNP---TPDIDAVASQGLILTS-AYSQPSSSPT 139 +KPN++ L DD+G+ D+G GNP TP +D +A++G+ T + PS +P+ Sbjct: 22 QKPNIIFVLTDDLGYSDLG------CYGNPSISTPFLDKMAAKGVRATDYMVTSPSCTPS 75 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 RA++LTG+Y+ + + P G GL T+ ++L ++GY T IGKWH+G++ E Sbjct: 76 RASLLTGRYASRYNLPDPIGPGAKNGLPAQEVTIAEMLKEKGYHTALIGKWHLGDHGEYL 135 Query: 199 PQNVGFDDFRGFNSVSDMYT-EWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P GFD F G +Y+ ++RD +V + + R+ Q P V Sbjct: 136 PNKQGFDYFYGM-----LYSHDYRDPYVKTDTTIKIFRN----QTP-------VVTRPAD 179 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSP 317 A++ I Y E++ Q ++ + K + PFFLYY H P A A S Sbjct: 180 SALSRI---YTEEVKQ--------YISQQKKGE-PFFLYYAHNMPHL---PVAFSAESGR 224 Query: 318 ARTSY-----GDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP------- 365 + + G + +++ A ++ +LE+ G DNT+ +F+SDNGP E P Sbjct: 225 MKDLHFAGPLGAVLEDLDRQLAIMWASLEEQGLADNTIFMFSSDNGPWIEYPVRMSGDHK 284 Query: 366 ----HGRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGI-----VDLADLFPTAL 414 H T FRG+K T+EGGVRVP YWKG +GI + D+ PT Sbjct: 285 TKNWHVGTAGVFRGSKAQTYEGGVRVPFITYWKG----HTPEGITLRNAISNVDILPT-- 338 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH---YFLN-GKLAAVRMDE 470 LA GA VP + +DG Q+ L T+ N A+H Y +N GK+ AVR Sbjct: 339 -LAEWTGAS----VPASRTLDG--QSIAALLTSKSENITADHRPIYLVNHGKVEAVRKGS 391 Query: 471 FKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSI 517 +KY L + Y+ A +FN+ DP E ++ Sbjct: 392 WKYRELPAGVNNNSGKPYE----------AAKELFNISYDPSERTNV 428 >UniRef50_UPI00005887B4 PREDICTED: similar to galactosamine (N-acetyl)-6-sulfate sulfatase n=1 Tax=Strongylocentrotus purpuratus RepID=UPI00005887B4 Length = 465 Score = 153 bits (386), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 123/401 (30%), Positives = 187/401 (46%), Gaps = 74/401 (18%) Query: 91 FLLDDVGWMDVGFNGGGVAVGNP---TPDIDAVASQGLILTSAYS-QPSSSPTRATILTG 146 L+DD+GW D+G GNP TP++D +A++G++L Y+ P SP+RA +LTG Sbjct: 1 MLMDDMGWGDLGI------YGNPAKETPNLDQMAAEGILLPDFYAANPLGSPSRAALLTG 54 Query: 147 QYSIHHGILMPPMYGQPGGLQGLTT---------LPQLLHDQGYVTQAIGKWHMGENKES 197 + I +G + Q + LP+LL GY ++ +GKWH+G + Sbjct: 55 RLPIRNGFYTTNGHAHNAWSQQIVKGGIPDSEILLPKLLKLSGYKSKIVGKWHLGHLPQY 114 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P GFD++ G + +IK LP ++ R E Sbjct: 115 LPLKHGFDEWFGAPNC------------------------HIKSLP----NIPVYRDSE- 145 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSP 317 + +Y E G+ F++K A++ +PFFLY+ H Y + + G S Sbjct: 146 -----MIGRYFEQ-------EGLNFIEKSAEAKQPFFLYWTPDATHEPVYASKPFLGRS- 192 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLD-NTLIVFTSDNGPEAEVPPHGRT--PFRGA 374 R YGD ++E+++ + L K Q+D NT +VFTSDNG +G T P+ Sbjct: 193 QRGLYGDAVIELDEGVGQILGKL-KELQIDTNTFVVFTSDNGAATYAKENGGTNGPYLCG 251 Query: 375 KGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTF 433 K +T+EGG+RVPT +W I+P R + I ++ DLF TAL+LA P F Sbjct: 252 KRTTYEGGMRVPTIAWWPTHIKPGRVTHQIGNIMDLFTTALNLA-------HIRPPSDRF 304 Query: 434 IDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYH 474 IDG L NR Y+ ++ AVR+ E+K H Sbjct: 305 IDGQSLLPALLNGEEDVNRTM-FYYRGNQMMAVRLGEYKAH 344 >UniRef50_B7QJZ0 Arylsulfatase B, putative n=9 Tax=Ixodes scapularis RepID=B7QJZ0_IXOSC Length = 529 Score = 152 bits (384), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 114/376 (30%), Positives = 171/376 (45%), Gaps = 59/376 (15%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 A K + PN++ L DD+GW DV F G PTP++D +ASQG+IL + Y QP Sbjct: 10 AVFSKSSTVPPNIIFILADDLGWADVSFRGDPQI---PTPNLDVLASQGIILNNYYVQPL 66 Query: 136 SSPTRATILTGQYSIHHGIL-MPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGE 193 +P+R +++G Y IH G+ + P G+P GL LT +P+ L + GY T IGKWH+G Sbjct: 67 CAPSRGALMSGLYPIHTGLQHLVPGPGEPWGLPTNLTIMPEYLKNLGYATHMIGKWHLGY 126 Query: 194 NKES-QPQNVGFDDFRGF-NSVSDMYTE---WRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 +KES P GFD F G+ N D Y W + L F ++ Sbjct: 127 HKESYTPTRRGFDSFYGYLNGGEDYYDHTILWSNA----------------SGLDFWENT 170 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 G + T K + L K KP FLY+ + H +Y Sbjct: 171 TPVRNEGNHYSTELFTKK-------------AQSLIKHHDPAKPMFLYFSHQAVHCGDY- 216 Query: 309 NAKYAGSSPA-------------RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 K +PA R+ + + E++ + + L K G L N++++F++ Sbjct: 217 --KVELEAPALAIAHFPYIKELNRSIHAGAVYELDKSVGLVMEALNKRGMLSNSIVIFST 274 Query: 356 DNG--PEAEVPPHGRT-PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFP 411 DNG P P G P RG+K + WEGG R FV+ + + R S+ ++ + D P Sbjct: 275 DNGGLPWGVEPNSGYNWPLRGSKETNWEGGARGAAFVWSPLLFKSGRLSNQMMHITDWLP 334 Query: 412 TALDLAGHPGAKVANL 427 T AG + + N+ Sbjct: 335 TLYSAAGGNVSTLGNI 350 >UniRef50_C6Y214 Sulfatase n=3 Tax=Sphingobacteriaceae RepID=C6Y214_PEDHD Length = 472 Score = 152 bits (384), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 132/424 (31%), Positives = 188/424 (44%), Gaps = 61/424 (14%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP 134 ++ + KT KPNV+V + DD G++D G GG PTP+IDA+A QG T AY Sbjct: 18 ISAAQVKTAAKPNVIVIVSDDAGYVDFGCYGGKQI---PTPNIDAIAKQGTRFTDAYVSA 74 Query: 135 S-SSPTRATILTGQYSIHHG-------ILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQA 185 S +P+RA ILTG+Y G +L P G+ T+ + GY T A Sbjct: 75 SVCAPSRAGILTGRYQQRFGFEHNTSNVLAPGYKITDVGMDPSEQTIGNEMQANGYKTIA 134 Query: 186 IGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS 245 IGKWH G+ + P N GF++F GF + ++ N Sbjct: 135 IGKWHQGDEPKHFPLNRGFNEFYGFTGGHRDFFAYKGKRTNE------------------ 176 Query: 246 KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD 305 HA+ ++ + + Y+ D+ + D F+ A DKPFF+Y H Sbjct: 177 ----HALYN-NKEIVPENEITYLTDM---FTDKATSFI--TANKDKPFFMYLSYNAVHTP 226 Query: 306 NYPNAK------YAG-SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD-NTLIVFTSDN 357 NAK YA + R +Y M ++D + TL+ N QLD NTLI+F +DN Sbjct: 227 --MNAKKDLMERYASIADTGRRAYAAMMTSLDDGIGKVMATLKAN-QLDKNTLIIFINDN 283 Query: 358 GPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGI-VDLADLFPTALDL 416 G A V P RG KGS WEGG+RV + W G I K+D V D+ PTA+ Sbjct: 284 GG-ATVNSSDNGPLRGMKGSKWEGGIRVAMMMKWPGHIAANKTDSRPVSSLDILPTAIGA 342 Query: 417 AGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVL 476 L DGV+ + N ++ +A Y+ G AA+R +K + Sbjct: 343 GKGKQKGTKKL-------DGVNLLPYLSAGNKKTPHEA-LYWRRGVAAAMREGNWKLIRV 394 Query: 477 IQQP 480 + P Sbjct: 395 KESP 398 >UniRef50_Q7UKJ5 Arylsulfatase A n=3 Tax=Bacteria RepID=Q7UKJ5_RHOBA Length = 489 Score = 151 bits (382), Expect = 6e-35, Method: Compositional matrix adjust. Identities = 140/418 (33%), Positives = 197/418 (47%), Gaps = 68/418 (16%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTR 140 T +KPNV+V DD G+ D+G G + TP++D +AS+G TS YS S SP+R Sbjct: 43 TTEKPNVIVIFTDDQGYNDLGCYG---SPNIKTPNLDRLASEGRRYTSFYSACSVCSPSR 99 Query: 141 ATILTGQY----SIHHGILMP-PMYG-QPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 A +LTG Y +H +L P YG P + T+ L GY T +GKWH+G + Sbjct: 100 AALLTGCYPKRVGLHQHVLFPQSTYGLHPDEV----TIADHLKSAGYATACVGKWHLGHH 155 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 KE+ P + GFD + G +DM PD K S DD R Sbjct: 156 KETLPTSNGFDSYYGIPYSNDMN--------------HPDNKRLGK---MSSDD----RW 194 Query: 255 GEQQAIADI--TPKYMED------LDQ-----RWMDYGVKFLDKMAKSDKPFFLYYGTRG 301 +Q + + TP ++ +DQ R+ D ++F++ A DKPFFLY Sbjct: 195 TDQSSAVTLWNTPLVQDEEIIELPVDQRTVTRRYTDRAIEFVE--ANQDKPFFLYLPHSM 252 Query: 302 CHFDNY-PNAKYAGSSPARTSYGDCMVEMNDV-FANLYKTLEKNGQLDNTLIVFTSDNGP 359 H Y P Y P + C++E D L +T+ G + TLIV+TSDNGP Sbjct: 253 PHIPLYVPEDVY---DPDPQNAYKCVIEHIDTEVGRLVQTVRDLGLSEKTLIVYTSDNGP 309 Query: 360 EAEVPPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDL 416 + HG + P R KG+T+EGG RVP ++ G I S+ DL PT + Sbjct: 310 WLQFKNHGGSAGPLRAGKGTTFEGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPT---I 366 Query: 417 AGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNR-KAEHYFLNGKLAAVRMDEFKY 473 A G + N IDG+D TS F T+ +S R + Y +G L +RM ++KY Sbjct: 367 ASFTGVALEN----DRKIDGIDLTSTF--TSDESARDEFVFYSAHGVLEGIRMGDWKY 418 >UniRef50_A6LDP6 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LDP6_PARD8 Length = 452 Score = 151 bits (382), Expect = 6e-35, Method: Compositional matrix adjust. Identities = 132/453 (29%), Positives = 203/453 (44%), Gaps = 69/453 (15%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPT---PDIDAVASQGLILTSAYSQPS-SSPTR 140 KPN++V DD+G+ D+ G+PT P+ID +A +G +S Y S SSP+R Sbjct: 25 KPNIIVINCDDMGYGDLS------CFGSPTIKTPNIDRMAIEGQKWSSFYVSASVSSPSR 78 Query: 141 ATILTGQYSIHHGILMPPMYGQ------PGGLQGL----TTLPQLLHDQGYVTQAIGKWH 190 A +LTG+ + G MYG P GL T+ +LL GY T IGKWH Sbjct: 79 AGLLTGRLGVRTG-----MYGDQRRVLFPDSKGGLPSEELTIAELLKQAGYHTACIGKWH 133 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +G E P GFD F G+ +DM R E IK L +K Sbjct: 134 LGHLPEYMPLRHGFDYFYGYPYSNDM-----------------SRKEQIK-LGNTKYPYE 175 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 + +++ + +Y +L Q+ + ++++ + + PFFLY H Y + Sbjct: 176 YIIYEQEKELEREPQQY--NLTQQVTEAAIRYIK--SNENSPFFLYLAHPMPHMPVYAST 231 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT- 369 + G S AR YGD + E++ + +TL+ G NTL++FTSDNGP G + Sbjct: 232 DFQGKS-ARGKYGDTVEELDWSVGQILQTLKSEGLDKNTLVIFTSDNGPWLLCKQEGGSP 290 Query: 370 -PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 P + K S +EGG RVP + W M++P + DL PT ++AG P + Sbjct: 291 GPLKDGKASMFEGGFRVPC-IMWGAMVKPGYITDMASTLDLLPTFCEIAGIP-------L 342 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 P DG+ + L R +++ +L A+R ++K H + Y Sbjct: 343 PSDRHYDGISLLN-VLKDKSTCKRDVFYFYRGSELYAIRKGKYKAHFSYRPAY------- 394 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 G T ++ +++L TDP E +I H Sbjct: 395 --GTTDKIIYDK-PVLYDLGTDPGELYNIAEEH 424 >UniRef50_B2ULS2 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2ULS2_AKKM8 Length = 526 Score = 151 bits (381), Expect = 7e-35, Method: Compositional matrix adjust. Identities = 145/520 (27%), Positives = 228/520 (43%), Gaps = 90/520 (17%) Query: 82 TGKKPNVVVFL-LDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPT 139 T K P +V + DD+G+ DVG G A G PTP ID +A QG T AYS S +P+ Sbjct: 25 TVKPPKAIVMIYADDLGYGDVGCYG---AKGIPTPAIDKLAEQGCRFTDAYSTTSVCTPS 81 Query: 140 RATILTGQYSIHH---GILMPPMYGQPGGLQGLT-----TLPQLLHDQGYVTQAIGKWHM 191 R + TG+Y GIL PG + TLP++L GY T IGKWH+ Sbjct: 82 RYALFTGEYPWRKEGTGIL-------PGDAALIIDTKKPTLPKMLQSHGYKTYMIGKWHL 134 Query: 192 GENKESQ-----------PQNVGFDDFRGFNSVSD----MYTEWRDV-HVNPEVALSPDR 235 G ++ + P +GFD+ F + D + E +V +++P + Sbjct: 135 GLGEKGKKIDWNKHISPSPNEIGFDESFIFAATGDRVPCVILENGNVRNLDPNDPIEVSY 194 Query: 236 SEYIKQLPFSKDDVHAVR----GGEQQAIADITPK--YM----------EDLDQRWMDYG 279 LP KD+ ++ G QAI + + +M E+ D Sbjct: 195 KHNFPGLPNGKDNKDQLKLMWSHGHNQAIINGIGRIGFMKGGRSALWKDEENADIITDKA 254 Query: 280 VKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKT 339 ++++ K AK+ +PFFL + T H P ++ G S GD VE++D + + Sbjct: 255 IEYIQKSAKAKEPFFLMFATHDIHVPRCPEKRFVGKS-RHGVRGDVTVELDDCVRRITEA 313 Query: 340 LEKNGQLDNTLIVFTSDNGP-------------EAEVPPHGRTPFRGAKGSTWEGGVRVP 386 L++ G + L++F+SDNGP A P G PFR K S EGG R+P Sbjct: 314 LQQAGLEKDALVIFSSDNGPVLDDGYRDFAVRDNATHSPAG--PFRAGKYSILEGGSRIP 371 Query: 387 TFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLG 445 V W G+I+P S +++ DL GA + L+ D + Sbjct: 372 FIVKWPGVIKPGTTSKALLNQMDL-----------GASLEQLLAPGKANSFRDSENVMPA 420 Query: 446 TNGQSNRKAEHYFLN--GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSS 503 G+S + +++ +N GK A+R ++K+ I A + G G G S Sbjct: 421 LLGKSAKGRDYHVINSTGKALAIRHGKWKF---IPAGVA-IRDGINGASAKMSKSPEGGS 476 Query: 504 VFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 +F+L DP+E D++ +H + +M A +E +++ P Sbjct: 477 LFDLEKDPKELDNVASQH----PDICEQMKAKLEEIRQRP 512 >UniRef50_P34059 N-acetylgalactosamine-6-sulfatase n=23 Tax=Deuterostomia RepID=GALNS_HUMAN Length = 522 Score = 151 bits (381), Expect = 8e-35, Method: Compositional matrix adjust. Identities = 117/405 (28%), Positives = 188/405 (46%), Gaps = 46/405 (11%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRAT 142 + PN+++ L+DD+GW D+G G TP++D +A++GL+ + YS P SP+RA Sbjct: 29 QPPNILLLLMDDMGWGDLGVYG---EPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAA 85 Query: 143 ILTGQYSIHHGILMPPMYGQP--------GGL-QGLTTLPQLLHDQGYVTQAIGKWHMGE 193 +LTG+ I +G + + GG+ LP+LL GYV++ +GKWH+G Sbjct: 86 LLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLGH 145 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 + P GFD++ G +P P ++ +P +D R Sbjct: 146 RPQFHPLKHGFDEWFG----------------SPNCHFGPYDNKARPNIPVYRDWEMVGR 189 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 E+ I T + +L Q ++ + F+ + A+ PFFLY+ H Y + + Sbjct: 190 YYEEFPINLKTGE--ANLTQIYLQEALDFIKRQAR-HHPFFLYWAVDATHAPVYASKPFL 246 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE-AEVPPHGRT--P 370 G+S R YGD + E++D + + L+ DNT + FTSDNG P G + P Sbjct: 247 GTS-QRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGP 305 Query: 371 FRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVP 429 F K +T+EGG+R P +W G + + S + + DLF T+L LAG P Sbjct: 306 FLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGL-------TPP 358 Query: 430 KTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYH 474 IDG++ L G+ + Y+ L A + + K H Sbjct: 359 SDRAIDGLNLLPTLL--QGRLMDRPIFYYRGDTLMAATLGQHKAH 401 >UniRef50_P50473 Arylsulfatase n=8 Tax=Deuterostomia RepID=ARS_STRPU Length = 567 Score = 150 bits (380), Expect = 9e-35, Method: Compositional matrix adjust. Identities = 137/446 (30%), Positives = 195/446 (43%), Gaps = 64/446 (14%) Query: 73 QKLAELEKKTGK------KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPD---IDAVASQ 123 + L L +TG+ KPNV++ L DD+G D+ G+PT + ID +A+Q Sbjct: 48 EDLLHLLGQTGQHRTAMTKPNVILLLADDMGVGDLS------VYGHPTQEPGFIDQMANQ 101 Query: 124 GLILTSAYSQPS-SSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHD 178 GL T YS S +P+R+ I+TG+ I G+ P GL T+ + + Sbjct: 102 GLRFTQGYSGDSVCTPSRSAIVTGRQPIRTGVYGEERIFLPWTTTGLPLYEVTIAEAMKG 161 Query: 179 QGYVTQAIGKWHMGENKESQ------PQNVGFDDFRGFNSVSDMYTEWR--DVHVNPEVA 230 GY T +GKWH+G N+ S P N GFD F G N WR D ++ + Sbjct: 162 AGYTTGMVGKWHLGINENSSSDGAHLPANRGFD-FVGHNL--PFGNSWRCDDTGLHQDFP 218 Query: 231 LSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSD 290 + Y ++ H G Q + D T ++ED + Sbjct: 219 DTNACFLYYNSTSVAQPFQHK---GLTQLLRDDTVGFIED-----------------NVN 258 Query: 291 KPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 KPFF+Y H + + ++ +S R YGD + EM+ + TL N DNT+ Sbjct: 259 KPFFMYVSFAHMHTSLFSSDDFSCTS-RRGRYGDNLREMDQAIEQIVTTLVDNDIDDNTV 317 Query: 351 IVFTSDNGPEAEVPPHG--RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLAD 408 I FTSD+GP E G FRG KG +WEGG R+P VYW G I P S IV D Sbjct: 318 IFFTSDHGPHREYCGEGGDANVFRGGKGQSWEGGHRIPYIVYWPGTISPGVSHEIVTSMD 377 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRM 468 + TA++L G + +P DG S L S Y+ L AVR+ Sbjct: 378 IIATAVNLGG-------SQLPTDRIYDGKCLKSVLL-EGASSPHDDFFYYCKDTLMAVRV 429 Query: 469 DEFKYHVLIQQPYAYTQSGYQ--GGF 492 ++K H Q + + G + GGF Sbjct: 430 GKYKAHFKTQTDSSQMKLGERCDGGF 455 >UniRef50_C6Y1Z7 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y1Z7_PEDHD Length = 480 Score = 150 bits (380), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 127/449 (28%), Positives = 202/449 (44%), Gaps = 55/449 (12%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTS-AYSQPSSSPTRAT 142 ++PNV++ +DD+G+ D G G PTP+ + A +G+ T +Q SP+RA Sbjct: 25 QRPNVIIINMDDMGYGDTEPYG---MTGIPTPNFNKAAKEGMRFTHFNAAQAICSPSRAA 81 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQ---------GLTTLPQLLHDQGYVTQAIGKWHMGE 193 +LTG Y P G G L T+ LL GY T +GKWH+G Sbjct: 82 LLTGCY--------PNRIGLRGALSPDSKIALDTAEETIASLLKKAGYKTAMLGKWHLGS 133 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA-- 251 + P + GFD F G +DM+ D P+ A++ +S +LP D A Sbjct: 134 KAPNLPLHYGFDSFYGLPYSNDMWPV--DYEGKPQAAVAGKKS--YPELPLLDGDKPADY 189 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 VR + QA+ L + V+F++ PFFLY H +A Sbjct: 190 VRTPDDQAM----------LTGTFTRKAVRFIEN--NKSAPFFLYLAHPMPHVPLAASAA 237 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP- 370 + G S +GD ++E++ + K+L++N NT+++ SDNGP H + Sbjct: 238 FRGKSELGL-FGDVIMELDWSVGEIMKSLDRNKIASNTILIIMSDNGPWLRFGNHAGSSG 296 Query: 371 -FRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLV 428 FRG K + W+GG RVP + W G ++ + ++ D+ PT L L ++ Sbjct: 297 GFRGGKMTIWDGGTRVPCIIRWPGKVEAGSVNSNLITNMDILPTLLQL--------SHAA 348 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN-GKLAAVRMDEFKYHVLIQQPYAYTQSG 487 P IDG+ LG + ++ R+ +Y+ N L AVR +K VL +YT Sbjct: 349 PPEKKIDGISFADLLLGRSDKAPRQVFYYYYNENSLKAVRYKNWKL-VLPHTSVSYTSDI 407 Query: 488 Y-QGGFTGTVMQT-AGSSVFNLYTDPQES 514 + + GF G + ++++L DP E+ Sbjct: 408 HGKDGFPGAATRAEVKMALYDLAHDPGEA 436 >UniRef50_B3CAE2 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=B3CAE2_9BACE Length = 467 Score = 150 bits (379), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 130/415 (31%), Positives = 188/415 (45%), Gaps = 53/415 (12%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT-RATI 143 KPNVV+ DD G+ D+G G + TP ID +A +GL LT Y S S RA + Sbjct: 25 KPNVVIIFTDDQGYQDLGCYGSPLI---QTPSIDGMAREGLKLTDFYVSASVSSASRAGL 81 Query: 144 LTGQYSIHHGI---LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 LTG+ + +G+ P G P TL + L +Q Y T GKWH+G+ K P Sbjct: 82 LTGRLNTRNGVKGVFFPESEGMP---SEEITLAEALKEQDYATGCFGKWHLGDLKGHLPT 138 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPD---RSEYIKQLPFSKDDVHAVRGGEQ 257 + GFD + G +DMY + P + + R Y L +K D VR Sbjct: 139 DQGFDKYFGIPYSNDMY-------IGPSQKFASNAVFREGYT--LSEAKADQDFVRNAPN 189 Query: 258 QA-----IADITPKYMED------LDQ-----RWMDYGVKFLDKMAKSDKPFFLYYGTRG 301 +A + ++P + D DQ R+ D ++F+ + +KPFF+Y Sbjct: 190 RATIKKRLNSVSPLFEGDEIIEYPCDQSTTTRRYFDKAIEFVGQ--NKEKPFFVYITPSM 247 Query: 302 CHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA 361 H + + ++ G S R YGD + E++ L++ G +NTL++F SDNGP Sbjct: 248 PHIPLFASEQFRGKS-KRGLYGDVVEEIDWNVGRFLDYLDQQGLAENTLVIFASDNGPWL 306 Query: 362 EVPPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAG 418 + P RG K S +EGGVRVP + WKG I SD I+ DLFPT + G Sbjct: 307 GYKEDSGSADPLRGGKFSYYEGGVRVPCILRWKGTIPAGVTSDAIIASIDLFPTIMHYVG 366 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKY 473 + IDGVD +S FL R Y G++ +R ++ Y Sbjct: 367 CKSFRQE--------IDGVDISS-FLKNPSLRLRDEYVYVRGGEVHGIRKGDWAY 412 >UniRef50_UPI000180BD6E PREDICTED: similar to arylsulfatase n=1 Tax=Ciona intestinalis RepID=UPI000180BD6E Length = 501 Score = 150 bits (378), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 134/465 (28%), Positives = 210/465 (45%), Gaps = 63/465 (13%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPD---IDAVASQGLILTSAY 131 L L + +PN V+ DDVG+ D + G+PT + ID +A++G+ T Y Sbjct: 10 LIILANQVLSRPNFVLIFADDVGYGDFQ------SYGHPTQERGPIDDLAAEGMRFTQWY 63 Query: 132 SQPS-SSPTRATILTGQYSIHHGILMPPMY---GQPGGL-QGLTTLPQLLHDQGYVTQAI 186 S S +P+RA +LTG+ IH G++ P GGL + TTL + L + GY T + Sbjct: 64 SAASLCTPSRAALLTGRLPIHSGMVGPTRVLHQNDAGGLPKNETTLAEALKELGYKTGMV 123 Query: 187 GKWHMGENKESQ------PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 GKWH+G N+ Q P++ GFD F G N + + SP SEY Sbjct: 124 GKWHLGINELKQNDGRHLPKHHGFD-FVGTNLPFTFH-----------LFCSP--SEY-- 167 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 P K + + + I P E L + ++ +F+ + K+ PFFLY Sbjct: 168 --PVDKMKIKCFLSNKDEIIEQ--PIIPEKLTDKIVEGAKQFITENQKN--PFFLYLSLP 221 Query: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 H + ++ S R SYGD + EM+ + L+ NTL++F SD+GP Sbjct: 222 QTHVAMFCKEEFCNKS-MRGSYGDNVNEMSWAVGEVVNQLKDLNLDQNTLVMFLSDHGPA 280 Query: 361 AEVP--PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAG 418 E +G K S+W+GG++VP +W G IQP +V D+FPT L LAG Sbjct: 281 VEFCYTGGSTGGLKGGKASSWDGGIKVPAVAWWPGTIQPGVKTQVVSTMDIFPTFLQLAG 340 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ 478 + G NL DG+ + L + + ++ + +L AVR +K H Q Sbjct: 341 NEGNN-GNL-------DGMSISDLLLSNHDNEVHEILFHYCSDRLMAVRYGRYKIHFHTQ 392 Query: 479 QPYAYTQSGYQGGFTGTVMQ----TAGSS------VFNLYTDPQE 513 + + + G ++ A ++ +F++ TDP+E Sbjct: 393 HLHVFNSNCIDGKALENIVDYFDCYANTTTHNPPLIFDINTDPEE 437 >UniRef50_A6LCL3 Arylsulfatase A n=9 Tax=Bacteroidales RepID=A6LCL3_PARD8 Length = 476 Score = 149 bits (377), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 137/444 (30%), Positives = 201/444 (45%), Gaps = 53/444 (11%) Query: 87 NVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTS-AYSQPSSSPTRATILT 145 N+V+ LDDVG+ D FNG A G TP+ID +A++G+ T QP S +RA +LT Sbjct: 25 NIVLINLDDVGYGDFSFNG---AYGYTTPNIDKMAAEGVRFTHFLVGQPISGASRAGLLT 81 Query: 146 GQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGF 204 G Y G P G+ T+ ++L +GY T GKWH+G KE P GF Sbjct: 82 GCYPNRIGFSGAPGPDSNYGVHPEEMTIAEVLKQKGYSTAIFGKWHLGSQKEFLPLQNGF 141 Query: 205 DDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADIT 264 D++ G +DM W EV PD Y D + + G Sbjct: 142 DEYYGLPYSNDM---WPFHPQQGEVFNFPDLPTY---------DGNEIIG---------- 179 Query: 265 PKYMEDLDQRWMDYGVKFLDKMAKS-DKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYG 323 Y D + DY + ++ + K+ +KPFFLY H + K+ G S + YG Sbjct: 180 --YNTDQTRLTTDYTTRSVNFIKKNKNKPFFLYLAHNMPHVPLAVSDKFKGKS-EQGLYG 236 Query: 324 DCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP--FRGAKGSTWEG 381 D M+E++ ++K L + G DNTL++ TSDNGP H + R AK +T++G Sbjct: 237 DVMMEIDWSVGEIFKALRELGLEDNTLVILTSDNGPWTNYGNHAGSAGGLREAKATTFDG 296 Query: 382 GVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQT 440 G RVP +YWKG P + + DL PT ++ P L P+ IDGV Sbjct: 297 GNRVPCIMYWKGKTLPGTTCNKLASNIDLLPTFAEITQAP------LPPRK--IDGVSIL 348 Query: 441 SFFLGTNGQSNRKA-EHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSG-YQGGFTG---- 494 G + R++ +Y+ L AV FK + P+ Y G Y+ G G Sbjct: 349 PLIEGKKDANPRESFVYYYRKNDLEAVTDGMFK----LVFPHKYVTYGAYEPGNDGQPGK 404 Query: 495 -TVMQTAGSSVFNLYTDPQESDSI 517 T ++ +++L DP E ++ Sbjct: 405 LTNLEIMKPEMYDLRRDPGERYNV 428 >UniRef50_Q7UG72 Arylsulfatase A [precursor] n=1 Tax=Rhodopirellula baltica RepID=Q7UG72_RHOBA Length = 503 Score = 149 bits (376), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 138/486 (28%), Positives = 211/486 (43%), Gaps = 60/486 (12%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTS-AYSQPSSS 137 E G +PN+VV +DD+ + D+G G A G TP++D +A++G T + S S Sbjct: 26 EDIAGSRPNIVVIYMDDMAYADIGPFG---AKGYSTPNLDRMANEGRKFTDFSVSSAVCS 82 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGE 193 +R+ +LTG Y H + + G P GL TT ++ GY T GKWH+G Sbjct: 83 ASRSALLTGCY--HRRVGLSGALG-PQAKIGLAPAETTFAEVCKSAGYRTACHGKWHLGH 139 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD----- 248 + + P N GFD F G +DM+ L PD ++ P + Sbjct: 140 HPKFLPTNQGFDQFYGIPYSNDMW------------PLHPDTIRRQQKDPNDPGNWPPLP 187 Query: 249 -VHAVRGGEQQAIAD-ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 + ++ G + + D + P E + V+F+ K SDKPF LY H Sbjct: 188 IIESIAGQPPRIVNDNVQPADQEQMTVELTRRSVEFI-KNQSSDKPFLLYLPHPMVHVPL 246 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH 366 Y + ++ G S A +GD M+E++ + +E Q NTL++FTSDNGP H Sbjct: 247 YVSERFRGKSGAGL-FGDVMMEVDWSVGEILSAIESIDQQKNTLVIFTSDNGPWLSYGNH 305 Query: 367 GRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAK 423 + P R KG+ WEGGVR PT ++W I + D+ PT ++L G + Sbjct: 306 AGSAAPLREGKGTQWEGGVREPTLMWWPETIPAGTTCETFCSTIDVLPTIVELTGGEAPE 365 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH-----YFLNGKLAAVRMDEFKYHVLIQ 478 IDG L G K+ H Y+ G+L +R + FK + Sbjct: 366 RK--------IDGHSIVDLMLDVPGA---KSPHESFVGYYGGGQLQTIRNERFK----LV 410 Query: 479 QPYAY-----TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMH 533 P+AY + G G G M +G +++L D E+ ++ H + LQ Sbjct: 411 FPHAYRTLGDREPGKDGMPDGYAMTKSGLELYDLDADVSETTNVIEAHPEVVKQLQAAAE 470 Query: 534 AYMEIL 539 Y + L Sbjct: 471 VYRQQL 476 >UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZF72_PLALI Length = 470 Score = 149 bits (376), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 123/405 (30%), Positives = 182/405 (44%), Gaps = 56/405 (13%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPT 139 +T +KPNV++F DD+GW + G G PTP ID++A G+ T + + SP+ Sbjct: 36 QTSRKPNVIIFYADDLGWGETGIQGNPQI---PTPHIDSIAKNGVRCTQGFVAATYCSPS 92 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 RA +LTG+Y G + G TTL LH GY T +GKWH+G+ E +P Sbjct: 93 RAGLLTGRYPTRFGHEFNRIANVSGLDLQETTLADRLHGLGYKTACVGKWHLGDGPEYRP 152 Query: 200 QNVGFDDFRGFNSVSDMY--TEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 GFD+F G + + + T++ D V+ +VA D + Y ++K V + G+Q Sbjct: 153 TKRGFDEFFGTLANTPFFHPTKFVDSRVSNDVAEVSDENFYTTD-EYAKRSVEWI--GQQ 209 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY----- 312 Q P+FLY H KY Sbjct: 210 Q-------------------------------QSPWFLYLPFNAQHAPLQAPQKYLDRFE 238 Query: 313 AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFR 372 + + P R + M M+D + + + GQ +NTL+ F SDNG + P R Sbjct: 239 SIADPKRKLFAAMMSAMDDAIGQVLGKVRELGQEENTLVFFISDNGGPTQGTTSQNGPLR 298 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 G K +T+EGG RVP V WKG + K+ D V D+ PT L AG + + Sbjct: 299 GFKMTTFEGGTRVPFLVQWKGKLPAGKTYDNPVINLDVLPTVLTAAG-------SKIDPA 351 Query: 432 TFIDGVDQTSFFLGTNGQSNRKAEH-YFLNGKLAAVRMDEFKYHV 475 +DGVD +F T+ +N+ E Y+ G+ AVR ++K V Sbjct: 352 WKLDGVDLVPYF--TSSIANKPHETLYWRFGEQWAVRQGDWKLVV 394 >UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAW6_9PLAN Length = 472 Score = 148 bits (374), Expect = 4e-34, Method: Compositional matrix adjust. Identities = 125/420 (29%), Positives = 192/420 (45%), Gaps = 55/420 (13%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRA 141 ++PN++V L DD+G+ ++G G PTP ID++AS G+ T AY + P+ SP+RA Sbjct: 23 AEQPNIIVLLADDLGYGELGCQGNPQI---PTPHIDSLASHGIRFTQAYVTAPNCSPSRA 79 Query: 142 TILTGQYSIHHGILMPPM--------YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 +LTG+ G P+ G P Q T+ + LHDQGY T IGKWH+G Sbjct: 80 GLLTGRIPTRFGYEFNPIGARNEDSGTGLPPDEQ---TIAERLHDQGYTTCLIGKWHLGG 136 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTE--WRDVHVNPEVALSPDRSE---------YIKQL 242 + P GFD+F GF + + V P R + Y + Sbjct: 137 TADYHPFRHGFDEFFGFMHEGHYFVPPPYHGVTTMLRRKTLPGRQKGRWISENLIYSTHM 196 Query: 243 PFSKDDVHA----VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 + + D A +RGG Q + + +Y+ D R V F+++ DKPFFLY Sbjct: 197 GYDEPDYDANNPIIRGG--QPVNET--EYLTDAFTR---EAVSFINR--HQDKPFFLYLA 247 Query: 299 TRGCHFDNYPNAK-----YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVF 353 H K R + + M+ + K ++++G + TLIVF Sbjct: 248 YNAVHSPLQGKKKDIQHFTQIEDIHRQIFAAMLSSMDQSIGKILKQVQQSGLDEKTLIVF 307 Query: 354 TSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPT 412 SDNG P RG KGS +EGG+RVP + W G + P+++ D V D+FPT Sbjct: 308 LSDNGGPTRELTSSNLPLRGEKGSMYEGGLRVPFLMRWTGTLAPKQTIDVPVSSLDIFPT 367 Query: 413 ALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFK 472 ++ LAG +P+ +DG + L + A+ ++ G+ AA+R ++K Sbjct: 368 SVALAGAS-------LPQN--LDGRNLLPLLLQQKTELP-VADFFWRQGRKAALRSGDWK 417 >UniRef50_A6DSG6 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSG6_9BACT Length = 499 Score = 148 bits (374), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 128/474 (27%), Positives = 216/474 (45%), Gaps = 72/474 (15%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILT 145 PN + + DD G+ D+G G + TP+ID +A +G+ T Y++ SP RA+++T Sbjct: 23 PNFIFIMTDDQGYGDLGCYGHPII---KTPNIDKMADRGVRFTDFYARHKCSPARASLMT 79 Query: 146 GQYSIHHGI--LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVG 203 G ++ G+ ++ P G ++ + T+P++L ++GY T IGKWH+G P++ G Sbjct: 80 GAFNFRVGVGSIVYPN-STTGLIKEVVTIPEMLKEKGYTTALIGKWHLGHTAGYLPRDQG 138 Query: 204 FDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADI 263 FD + G + H + + P + F+ +D A +G + I Sbjct: 139 FDYYFGVPGTN---------HGDAKTHKLPVAEGFKPSGEFTIEDYWADKGKGVHGNSTI 189 Query: 264 T---------PKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG 314 P + L +R+ V+++ + DKPFFLY+ H +A + G Sbjct: 190 LMKNDNVIEWPTDITQLTKRYTHDAVRYIKE--NKDKPFFLYFAHGTPHHPYTVDAAFRG 247 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG----PEAEVPPHGRT- 369 S YGD + E++ + K L++NG T+I FTSDNG P E G Sbjct: 248 KSD-HGLYGDMIEEIDWSVGEVIKALQENGIEKKTIIAFTSDNGADSKPNKEHAEKGSNL 306 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 P +G KGS+ EGGVRVP + W G + + +K++ I L D+FPT LAG + V Sbjct: 307 PLKGWKGSSEEGGVRVPFVLSWPGTLPEGKKTNEIASLMDIFPTYAALAG-----IEPEV 361 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKA--EHYFLNG---KLAAVRMDEFKYHVLIQQPYAY 483 P+ +D + F + + K+ ++ F G K+ VR FKY Sbjct: 362 PQK-----IDGNNIFPIMMCEPDVKSPNKYIFYAGNTPKITGVRNHRFKY---------- 406 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 T S +++++ D E+ ++ ++ + LQ M A+ + Sbjct: 407 --------------STKTSGLYDMHADIGETTNVADKYPEVLQELQKAMEAFQK 446 >UniRef50_C1BQY6 Arylsulfatase A n=2 Tax=Caligus RepID=C1BQY6_9MAXI Length = 508 Score = 148 bits (373), Expect = 6e-34, Method: Compositional matrix adjust. Identities = 118/417 (28%), Positives = 185/417 (44%), Gaps = 58/417 (13%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNP---TPDIDAVASQGLILTSAYS-QPSSSPT 139 ++PNV++F+ DD+G+ D+ + G+P TP ID + ++ ++ ++ YS P SP+ Sbjct: 21 REPNVILFIADDLGYGDLS------SYGHPSSRTPHIDNLITKSMLFSNYYSASPLCSPS 74 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGLQGL-----TTLPQLLHDQGYVTQAIGKWHMGEN 194 R+ + TG+Y + GI P P GL T + +L GY + IGKWH+G Sbjct: 75 RSALFTGEYPVRSGIY--PGVFFPNSTGGLDPNKKTIMKELRDRFGYKSALIGKWHLGVG 132 Query: 195 KESQ---PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK--DDV 249 K+ + + FD + G DM P + K SK D Sbjct: 133 KQGEYLPTASKNFDYYYGIPYAHDM---------------CPCHECFPKTECLSKCHDKF 177 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 E I + P + L QR+ D ++F+ ++ FFL Y H + Sbjct: 178 VPCPLFENTTIKE-QPANLVTLTQRYTDKAIQFIKN--NTENHFFLTYAFHQPHHPQFAG 234 Query: 310 AKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT 369 ++ +S +R GD + EM+D + K ++K TLI+FTSDNGP G Sbjct: 235 LRFRNTS-SRGGIGDALSEMDDAVYQVIKAVKKLKLQSKTLIIFTSDNGPSLMREERGGC 293 Query: 370 P--FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANL 427 FR KG+T+EGG+RVP F+ W G IQPR S ++ D++PT + + Sbjct: 294 AGLFRCGKGTTYEGGMRVPMFMSWGGFIQPRISHTLISALDIYPTLMSIIS--------- 344 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLA------AVRMDEFKYHVLIQ 478 V DG D T L + + N + + G A+R ++K H Q Sbjct: 345 VQYKNSTDGFDFTRHILKMDDKYNPRETLMYFPGNAQRRLGPFAMRYHQYKAHFYTQ 401 >UniRef50_C6VTS4 Sulfatase n=47 Tax=cellular organisms RepID=C6VTS4_DYAFD Length = 520 Score = 148 bits (373), Expect = 6e-34, Method: Compositional matrix adjust. Identities = 122/380 (32%), Positives = 182/380 (47%), Gaps = 53/380 (13%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS-SPTRATI 143 KPN+V+ LDD+G+ DVG G A TP++D +A+ G+ T+ Y+ S+ +P+R + Sbjct: 36 KPNIVIVNLDDLGYGDVGAYG---ATALKTPNMDRIANGGIRFTNGYATSSTCTPSRFAL 92 Query: 144 LTGQYSIHH--GILMP---PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG------ 192 +TG Y + ++P P+ T+P++L GY T +GKWH+G Sbjct: 93 VTGVYPWRNKEAKILPGDAPLLID----TAQQTIPKVLKKAGYATAIVGKWHLGLGNGDT 148 Query: 193 -ENKESQP--QNVGFDDFRGFNSVSD----MYTE-WRDVHVNPEVALSPDRSEYIKQLPF 244 NKE +P +GFD + D +Y E R V ++P + + + P Sbjct: 149 DWNKEVKPGPNQLGFDYSYILAATQDRVPTVYIENTRVVGLDPNDPIRVSYKQNFEGEPT 208 Query: 245 SKDDVHAVR----GGEQQAIADITPK--YMEDLDQ-RWMDYGVK--FLDKMAK-----SD 290 KD+ ++ G Q+I + + YM+ + +W D + FL K + Sbjct: 209 GKDNPELLKMKWHHGHDQSIVNGISRIGYMKGGQKAKWNDEEMADLFLTKAQQFIKDHKS 268 Query: 291 KPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 KPFFLYY + H P+ ++ G + GD + E + L TLEK G L+NTL Sbjct: 269 KPFFLYYAMQQPHVPRTPHPRFKGVT-GMGPRGDAIAEADWCLGELLNTLEKEGILENTL 327 Query: 351 IVFTSDNGP-------EAEVPPHGR----TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK 399 I+FTSDNGP + V G+ P RG K S +E GVRVP YWKG I+P Sbjct: 328 IIFTSDNGPVVNDGYHDDAVEKLGKHKPAGPLRGGKYSLFEAGVRVPFITYWKGTIKPAV 387 Query: 400 SDGIVDLADLFPTALDLAGH 419 SD +V DL + L G Sbjct: 388 SDAVVCQLDLLSSLAHLTGQ 407 >UniRef50_B9NR18 Sulfatase family protein n=1 Tax=Rhodobacteraceae bacterium KLH11 RepID=B9NR18_9RHOB Length = 555 Score = 147 bits (372), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 130/482 (26%), Positives = 215/482 (44%), Gaps = 57/482 (11%) Query: 63 QHPAQDKETQQKLAEL-EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVA 121 Q D + LA + G+ PN++ L+DD+G+ D+G G TP+I+ + Sbjct: 41 QWAQDDAAVDEALAAFRDGNNGQPPNIISVLIDDMGFGDMGIPELNAVRGYDTPNINDFS 100 Query: 122 SQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQG 180 + L + Y++PS +PTR +TG+ + G+ + GL G TL ++L G Sbjct: 101 DEALRMVRMYTEPSCTPTRVAQMTGRLPVRMGMGDTTVDIAGFGLPGTEVTLAEVLKQAG 160 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALS-------- 232 Y T +GKWHMG+ ES N GFD + T + D + +V+++ Sbjct: 161 YATSHVGKWHMGDIAESWAMNQGFDYAQHAVHQQGQLTIFNDDAIKEQVSVAIRDYDDKY 220 Query: 233 -------PDRSEYIKQLPFSKDD-VHAVR--GGEQQAIADITPKYMEDLDQRWMDYGVKF 282 PD S + + + +R GE+ A KY ++++Q + D ++ Sbjct: 221 TLDGWFRPDASAMMTVIEGETGGPIREIRMDAGERWNAA----KY-DEMNQAFQDKTLQE 275 Query: 283 LDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEK 342 L+++A D PFFL Y DN + S Y D M ++ +L++ L++ Sbjct: 276 LERLAGGDAPFFLQYWPM-IPLDNTRAGRDGPESANGGLYVDKMQLLDQWLGDLFERLDE 334 Query: 343 NGQLDNTLIVFTSDNGPEAEVPPH-GRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQPRK 399 G +NT++V DNG + P G TP + G KG T EGGVRV F+ W GMI+ Sbjct: 335 LGLSENTIVVVMGDNGHFTKYAPQSGFTPMIYAGGKGDTTEGGVRVDAFIRWPGMIE--- 391 Query: 400 SDG----IVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE 455 +DG I+ ++DL+ T AG +P+ +DGVDQ + L + +S + + Sbjct: 392 ADGLLNSIIHVSDLYTTLSRFAGADA-----FIPRDRVVDGVDQAAALLFAD-ESKARRD 445 Query: 456 HYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD 515 H + + +P A + + G + F+LY D +E Sbjct: 446 HVIIYS---------------VAKPEAIVKDQLKLKLPGPGENAIVAKFFDLYRDTREEY 490 Query: 516 SI 517 S+ Sbjct: 491 SV 492 >UniRef50_UPI0001927538 PREDICTED: similar to CG8646 CG8646-PA n=5 Tax=Hydra magnipapillata RepID=UPI0001927538 Length = 502 Score = 147 bits (371), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 119/394 (30%), Positives = 184/394 (46%), Gaps = 68/394 (17%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KP++++ + DD+GW D+ F+G PTP+ID +A+ G+IL + Y P +P+R+ I+ Sbjct: 19 KPHIIMIVADDLGWNDISFHGSNEI---PTPNIDRLANNGVILDNYYVLPICTPSRSAIM 75 Query: 145 TGQYSIHHGILMPPMYG-QPGGLQGLTT--LPQLLHDQGYVTQAIGKWHMGE-NKESQPQ 200 TG+Y IH G+ ++G P G+ GL LPQ L QGY T +GKWH+G K+ P Sbjct: 76 TGRYPIHTGMQQDTIFGPNPYGV-GLNEKFLPQYLKQQGYKTHGVGKWHLGFFAKQYTPT 134 Query: 201 NVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD + G + D + H N E D + + FS+D ++ +A Sbjct: 135 YRGFDSYYGSYLGKGDYWN-----HSNTETYSGLDLHDNENGV-FSQDGNYSTEMYTAEA 188 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH---------------F 304 I+ I S +P FLY + H Sbjct: 189 ISCINNH---------------------NSSEPLFLYLAYQAVHSANTEEDPLQAPQEWI 227 Query: 305 DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA--- 361 D + K+ R Y + M+ ++ L + LDN++I+FT+DNG A Sbjct: 228 DKFSYIKHE----QRRKYAAMLGYMDYGVGRVHDALAEKKMLDNSIIIFTTDNGGPANGF 283 Query: 362 EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPG 421 + P RG K + +EGGVR +FVY K + PR S ++ + D PT ++LA G Sbjct: 284 DYNWANNFPLRGVKATLFEGGVRGVSFVYSKLIESPRVSHELIHITDWLPTLVNLA---G 340 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE 455 KV++ F+DG DQ + N QS+++ E Sbjct: 341 GKVSD-----GFLDGFDQWATL--QNKQSSQRNE 367 >UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKC9_9BACT Length = 454 Score = 147 bits (370), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 142/462 (30%), Positives = 202/462 (43%), Gaps = 99/462 (21%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATI 143 KPN+++ L DD+G+ DVG++G PTP+ID +A++G+ ++ YS S PTRA + Sbjct: 19 KPNILIILADDLGYADVGYHG---LEEIPTPNIDRIANEGVQFSAGYSNGSICGPTRAAL 75 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQ-----------GLTTLPQLLHDQGYVTQAIGKWHMG 192 ++G Y G G GG + + TL Q + GY T GKWH+G Sbjct: 76 MSGVYQQRIGC-----EGICGGRKLNEHVVVGMPREVKTLAQYFQEAGYATGLFGKWHLG 130 Query: 193 E----NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 +K P + GFD+F G + +Y D VN E +YI+Q Sbjct: 131 GERLFDKTLMPTSRGFDEFFGILEGASLY----DDTVNRE-------RKYIRQ------- 172 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 + D +Y D R V F+ + K DKPFFLY H Sbjct: 173 ---------DTVIDYEGEYFTDAIGR---EAVSFITR--KGDKPFFLYLPFTAVHAPMQA 218 Query: 309 NAKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 + KY + P R + + M+D ++ LE G LDNTLIVF SDNG + + Sbjct: 219 SEKYMQRFAHIADPNRRVFAAMLSAMDDNIGRVFDALEHQGILDNTLIVFWSDNGGKPDN 278 Query: 364 PPHGRTPFRGAKGSTWEGGVRVPTFVYW-KGMIQPRKS-DGIVDLADLFPTALDLAGHPG 421 P +G K +EGG+RVP V W KG I K+ D V L D+FP+AL+ Sbjct: 279 NYSLNHPLKGQKTQFYEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDIFPSALE-----A 333 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 A++ VPK I+ G Q+ A + GK+ AVRM ++K Sbjct: 334 AQIT--VPKD--IEAKTILPLMQGKTNQTPHPAMFWKRAGKM-AVRMGDWK--------- 379 Query: 482 AYTQSGYQGGFTGTVMQTAG--SSVFNLYTDPQESDSIGVRH 521 + AG S +FNL D ES +I +H Sbjct: 380 ---------------LSNAGGPSELFNLKQDISESRNIIDQH 406 >UniRef50_Q01N83 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01N83_SOLUE Length = 461 Score = 147 bits (370), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 125/465 (26%), Positives = 194/465 (41%), Gaps = 64/465 (13%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRAT 142 ++PN+VV L DD+G+ D+G G +A TP+ID +A +G TS YS P SP+RA Sbjct: 26 RQPNIVVILADDLGYGDLGCYGSPIA----TPNIDRLAEEGARFTSFYSASPVCSPSRAA 81 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNV 202 ++TG+Y + + G G T+ Q+L GY T IGKWH+G P N Sbjct: 82 LMTGRYPTRVEVPVVLGPGDAGLPDSEITMAQVLKSAGYRTSCIGKWHIGSTPGYLPTNR 141 Query: 203 GFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA--VRGGEQQAI 260 GFD+F G +P+S D +RG A Sbjct: 142 GFDEFFG--------------------------------VPYSADITPCPLMRGSSVVAP 169 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPART 320 A Q +D+ + D PFFLY H + ++AG S Sbjct: 170 AVDCSTLTSSFTQEALDFMRR------AQDNPFFLYLAHTAPHLPLAASPRFAGQS-GLG 222 Query: 321 SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWE 380 Y D + E++ + L+ G NTL++F+SDNGP + + RG KG T+E Sbjct: 223 MYADVVQELDWSTGQVMAALKATGLDSNTLVMFSSDNGPWYQ---GSQGKLRGRKGETYE 279 Query: 381 GGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQ 439 GG+R P + G+I G+ DL PT A++A + +DGVD Sbjct: 280 GGMREPFLARYPGVIPSGIGCAGLATTMDLLPTL--------ARLAGAQTPSNPLDGVDI 331 Query: 440 TSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQT 499 G + +R YF L R+ +K H+ A++ G + Sbjct: 332 WPVLTGERAEVDRDVFLYFDAVYLQCARLGRWKLHLSRYNTKAWSPLPPGGRVN---LPL 388 Query: 500 AGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 ++++ +DPQES H + ++ + +++ +PP Sbjct: 389 PRPELYDVVSDPQESYDCAASHPAIVADIRARVE---RMVQTFPP 430 >UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XH3_PSEA6 Length = 500 Score = 146 bits (369), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 133/473 (28%), Positives = 204/473 (43%), Gaps = 70/473 (14%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPT 139 ++ +KPN++ L DD+G+ DVGFNG + TP++D +A G+ +AY + P P+ Sbjct: 35 ESNEKPNILFVLADDLGYNDVGFNG---STDIKTPNLDGLAKNGMTFDAAYVAHPFCGPS 91 Query: 140 RATILTGQY--SIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 RA I+TG+Y I +P G + Q + GY T A+GKWH+GE E Sbjct: 92 RAAIMTGRYPHKIGAQFNLPEDNSNVGVSADELFIAQTMKSAGYFTGAMGKWHLGEASEY 151 Query: 198 QPQNVGFDDFRGF-NSVSDMYTEWRDVHVNPEVALS-PDRSEYIKQLPFSKDDVHAVRGG 255 P GFD+F GF + + E + N VA + + Y+ L + +V Sbjct: 152 HPNKHGFDEFYGFLGGGHNYFPEQFEAAYNKRVAQGMTNINMYLTPLEHNGKEVR----- 206 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH------------ 303 E + I D + V F+DK A KPFFLY H Sbjct: 207 ETEYITDGLSR-----------EAVNFVDKAAAKKKPFFLYLAYNAPHVPLQAKEEDMAM 255 Query: 304 FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 F + K R +Y + ++ + + L+KNGQ DNT+IVFTSDNG + Sbjct: 256 FSQIKDKK-------RRTYAGMVYAVDRGVGRIVEQLKKNGQFDNTVIVFTSDNGGKLGQ 308 Query: 364 PPHGRTPFRGAKGSTWEGGVRVPTFVYW-KGMIQPRKSDGIVDLADLFPTALDLAGHPGA 422 + P + KGS EGG R P V+W K M + V DL+PT AG GA Sbjct: 309 GAN-NYPLKEGKGSVQEGGFRTPMLVHWPKHMKAGSRFSHPVLALDLYPT---FAGLGGA 364 Query: 423 KVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYA 482 ++P+ +DG D + + Q+N A DEF Y + + Y+ Sbjct: 365 ----VLPEDKKLDGKD-----IWADIQAN------------TAPHKDEFIYVLRHRNGYS 403 Query: 483 YTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAY 535 + + F ++N+ D E + I +H + + + M ++ Sbjct: 404 -DAAARRNQFKAVKNHNDDWKLYNIAQDISEDNDISAQHPDILRDMVSSMESW 455 >UniRef50_A6DI94 Arylsulfatase A n=2 Tax=Bacteria RepID=A6DI94_9BACT Length = 472 Score = 146 bits (369), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 139/481 (28%), Positives = 224/481 (46%), Gaps = 63/481 (13%) Query: 85 KPNVVVFLLDDVGWMDVG-FNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRAT 142 KPN ++ DD G+ D+ FN GV TP ID +A++G+ + Y S S +RA Sbjct: 21 KPNFIIIFTDDQGYGDLSCFNPQGVQ----TPHIDQMATEGMKFNNFYVSAAVCSASRAA 76 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 +LTG Y+ GI PG QGL T+ +LL +Q Y T GKWH+G+ Sbjct: 77 LLTGTYNDRIGIKSAFF---PGTKQGLHPDEITIAELLKEQNYATACFGKWHLGDEPSLL 133 Query: 199 PQNVGFDDFRGFNSVSDMY-----TEWRDVHVNPEVAL--SPDRSEYI-----KQLPFSK 246 P GFD + G +DM+ T + N + L + + ++I K+ P K Sbjct: 134 PSAQGFDTYFGIPYSNDMFIAPHQTFAENAKFNGDWTLEKAKELQKFIAPHVNKRGPIWK 193 Query: 247 DDVHA---VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 + A + GEQ I + P L QR+ D +KF+DK +KPFF++ H Sbjct: 194 SEYKALVPILEGEQ--IVEF-PADQASLTQRYFDRTIKFIDK--NQNKPFFIFLTPAMPH 248 Query: 304 FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 + + ++ G S + YGD + E++ L K L++ NTL++FTSDNGP Sbjct: 249 VPLFASKEFRGKS-KKGLYGDVIKEIDFHTGRLIKHLKEKELDQNTLVIFTSDNGPWLSY 307 Query: 364 PPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHP 420 G + P R K +++EGGVR+PT + G+I+ + + DL PT L Sbjct: 308 GDEGGSSGPLRDGKFTSYEGGVRMPTVFWGPGLIKANSVCNQLASTIDLLPTFAQL---- 363 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 V VP+ IDG D + N +R H F + AVR ++K +++++ Sbjct: 364 ---VNTQVPQDRKIDGKDISPLLKSQNHVIHR---HLFFRDE--AVRSGDWK--LVVKEH 413 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 + + G +++NL D ES+++ H + LQ+++ +++ L Sbjct: 414 HMTMRKG------------PLPALYNLKNDVAESNNLIDTHPKVAQYLQSKLDEHLKDLN 461 Query: 541 K 541 + Sbjct: 462 E 462 >UniRef50_UPI000179252A PREDICTED: similar to arylsulfatase b n=3 Tax=Acyrthosiphon pisum RepID=UPI000179252A Length = 599 Score = 146 bits (368), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 114/380 (30%), Positives = 178/380 (46%), Gaps = 58/380 (15%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 K+P++++ + DD+GW DVGF+G ++ PTP+IDA+A G+IL Y QP+ +P+RA + Sbjct: 32 KQPHIILIVADDLGWNDVGFHG---SIQIPTPNIDALAYNGVILNRHYVQPTCTPSRAAL 88 Query: 144 LTGQYSIHHGIL-MPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENK-ESQPQ 200 LTG+Y I +G+ P + G P L LPQ L D GY T +GKWH+G NK + P Sbjct: 89 LTGKYPIRYGLQGFPIIAGVPLALPLNEKILPQYLKDLGYSTHLVGKWHLGANKNQHTPI 148 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 GFD G+ W +S S + L KD R G ++A Sbjct: 149 KRGFDSHFGY---------WNGF-------ISYRNSTHSTGLMVGKD----ARRGFERAG 188 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFL-------YYGTRGCHFDNYPNAKYA 313 ++ +Y D+ + D K + DKP FL + G G + N + Sbjct: 189 DEMVDRYATDI---FTDEANKVIKLCKNHDKPMFLMVSHLAVHTGVPGPNILEVSNKTHN 245 Query: 314 G------SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG 367 + R Y + +++ ++ ++L+ NG L++++++F SDNG A+ P G Sbjct: 246 DIRFDYIENKERRLYAGMLTSLDESVGSIIESLDNNGMLEDSIVLFISDNGAPADDPIWG 305 Query: 368 ------RTPFRGAKGSTWEGGVRVPTFVYWKGMIQP--RKSDGIVDLADLFPTALDLAGH 419 P RG KG+ +GGVR + W ++ R + + + D PT AG Sbjct: 306 YGNSGSNWPLRGEKGAVLDGGVRGVAAI-WSPWLKKKHRIFENLFHITDWLPTLYTAAGG 364 Query: 420 PGAKVANLVPKTTFIDGVDQ 439 + IDGVDQ Sbjct: 365 DFEDLGQ-------IDGVDQ 377 >UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UPK7_RHOBA Length = 482 Score = 146 bits (368), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 130/480 (27%), Positives = 200/480 (41%), Gaps = 94/480 (19%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTR 140 T ++PNV+V L DD+ VG GG TP++D AS+ + + AYS +P R Sbjct: 52 TSRRPNVIVILADDLA---VGDLAGGDGSPTRTPNLDRFASESIQFSQAYSGSCVCAPAR 108 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQ---GLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 A +LTG+Y G++ M P + TT+ +L D GY T +GKWH G Sbjct: 109 AALLTGRYPHRTGVVTLNMNRYPEMTRLRRDETTIADVLKDAGYATGLVGKWHTGRGDGF 168 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR--GG 255 P + GFD+F GF F DDV R Sbjct: 169 HPLDRGFDEFEGF---------------------------------FGSDDVGYFRYPFS 195 Query: 256 EQQAIADITPKYM-EDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH----------- 303 EQ+ I+D+ Y+ +DL++R ++F+ + + PFFL+ H Sbjct: 196 EQRQISDVDESYLTDDLNRR----AIEFVRR--HHEHPFFLHLAHYAPHRPLEAPPEVIA 249 Query: 304 ------FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 FD YA G+ + E++D+ G ++T+++F SDN Sbjct: 250 RYREQGFDESTATIYAMIEVMDRGIGELLAEIDDL-----------GLSEDTIVLFASDN 298 Query: 358 GPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLA 417 GP+ RG K EGG+RVP FV W + P + D +V DL PT LDL Sbjct: 299 GPDPLTGERFNRELRGTKYQVNEGGIRVPLFVRWSKRLAPGQRDQMVTFVDLMPTILDLC 358 Query: 418 GHPGAKVANLVPKTTFIDGVDQTSFFLGTNG--QSNRKAEHYFLNGKLAAVRMDEFKYHV 475 + N + +F+ ++ S T Q NR + +Y N AAVR +K Sbjct: 359 -RVDVSMLNRLDGESFVPVLEDASIAHSTMRFWQWNRASPNYTHN---AAVRHGRYK--- 411 Query: 476 LIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAY 535 + +PY + + T S +F+L DP ES + ++ + + E+ + Sbjct: 412 -LVRPYVTRGAKLKDS-------TEPSVLFDLQNDPTESRDVSKQYPDIAERMSRELDRW 463 >UniRef50_B7RWW8 Sulfatase, putative n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RWW8_9GAMM Length = 486 Score = 145 bits (367), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 141/491 (28%), Positives = 210/491 (42%), Gaps = 97/491 (19%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNP----TPDIDAVASQGLILT 128 + LA PN++ L DD+G+ D+G NP TP+IDA+A+ GL+L+ Sbjct: 48 EHLASSATSVNTAPNILFILYDDMGYGDIG-----AGETNPDVIATPNIDALAAAGLVLS 102 Query: 129 SAYS-QPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLT----------------- 170 +S P +P+RA LTG+ + G +P + G + Sbjct: 103 DFHSPAPVCTPSRAGYLTGRLAPRAG--LPDVVFPSGSTKAFISSLLLKSGSPVRLPAEE 160 Query: 171 -TLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEV 229 T+ ++L GY T +GKWH+G+++ S P ++GF+ + G +DM Sbjct: 161 ITVAEVLRAAGYRTGMVGKWHLGDSRPSLPNDLGFEHYYGALYSNDM----------EPF 210 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS 289 AL +R + ++P D + L +R+ + + FL S Sbjct: 211 ALYRNR---VVEVPAPVDQSY--------------------LSERYTEEALAFL---RAS 244 Query: 290 DKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 D+ FFLY+ H + G+S YGD + E++D L + L +G+LDNT Sbjct: 245 DERFFLYFAHNFPHDPLHSRDGRLGTSDGGL-YGDVLEEIDDGVGILVEELRYSGKLDNT 303 Query: 350 LIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLAD 408 LI+ TSDNGP + RG KG+T+EGG+RVP +W I Q R + D Sbjct: 304 LIIITSDNGPWFLGNAGDQ---RGRKGNTFEGGMRVPFIAHWPAEIPQGRSEPAMAMGID 360 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRM 468 L PT LD+ P P +DG L S + HY+ L AVR Sbjct: 361 LLPTVLDILALPA-------PNDRILDGRSMLP-TLTKGAASPHQYLHYYDGETLFAVRD 412 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGT-VMQTAGSS-----VFNLYTDPQESDSIGVRHI 522 FKY G G F GT M +GS +F+L +DP ES + RH Sbjct: 413 QRFKYR------------GPAGVFYGTDQMLISGSIPQKEWLFDLQSDPGESYDVSARHP 460 Query: 523 PMGVPLQTEMH 533 L+ E + Sbjct: 461 QKLAELRAEFN 471 >UniRef50_C3ZGR2 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZGR2_BRAFL Length = 598 Score = 145 bits (367), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 111/358 (31%), Positives = 177/358 (49%), Gaps = 47/358 (13%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 +++++ + KPN+V L DD GW D+G++G + TP++D +A++G+ L + Y QP Sbjct: 112 SDIQESSSGKPNIVFILADDYGWNDIGYHGSVIR----TPNLDRLAAEGVKLENYYVQPL 167 Query: 136 SSPTRATILTGQYSI----HHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWH 190 SP+R ++TG+Y I H ++ PP QP GL TLPQ L + GY T +GKWH Sbjct: 168 CSPSRCQLMTGRYQIRYGLQHSLIWPP---QPSGLPLDEVTLPQRLKEGGYSTHIVGKWH 224 Query: 191 MGENKES-QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 +G K+ P + GFD F G+ + ++ Y R + P + + L Sbjct: 225 LGFYKQDYTPTHRGFDTFYGYLTGAEDYWTHR------QKGGLPGQPQTWSGLDLRD--- 275 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDK--PFFLYYGTRGCH---- 303 + + + D Y L + K ++ +A+ DK P FL+ + H Sbjct: 276 ------QNRPVTDQNGTYSTHL------FANKAIEIIAQQDKNKPMFLFLSFQAVHDPLQ 323 Query: 304 FDNYPNAKYAG-SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 ++Y+ S R Y M+ N+ + L++ G DNT+++F++DNG Sbjct: 324 APEEDISRYSHISDTNRRVYAAMTTIMDQAVGNVTRALKQYGLWDNTVLIFSTDNG--GR 381 Query: 363 VPPHG-RTPFRGAKGSTWEGGVRVPTFVYWKGMIQP--RKSDGIVDLADLFPTALDLA 417 V G P RG KGS WEGGVR FV +I+ R SD ++ ++D FPT + LA Sbjct: 382 VDRGGINWPLRGWKGSLWEGGVRGVGFVN-SPLIKAKGRTSDALIHISDWFPTLVGLA 438 >UniRef50_Q7UHK0 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UHK0_RHOBA Length = 478 Score = 145 bits (365), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 126/451 (27%), Positives = 202/451 (44%), Gaps = 65/451 (14%) Query: 84 KKPNVVVFLLDDVGWMDVG-FNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS--SPTR 140 + PN V+ DD+G+ D+ ++ GV TP +D +A++G + + P++ SP+R Sbjct: 41 RPPNFVLIFADDLGYGDISCYDSSGVK----TPHLDQLAAEGF-RSKDFFVPANVCSPSR 95 Query: 141 ATILTGQYSIHHGILMP-----PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK 195 A +LTG+Y + G+ + Y G T+P+LL GY + +GKWH+G Sbjct: 96 AALLTGRYPMRCGMPVARNENVAKYKDYGFAPDEITIPELLGPAGYRSLMVGKWHLGMEL 155 Query: 196 E-SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 E S P + GFD++ G S P R + + + ++ + Sbjct: 156 EGSHPLDAGFDEYLGIPS-----------------NYEPRRGK-------NHNTLYRGKQ 191 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG 314 EQ+ +A E+L +R+ D + F+++ + D PFF+Y H P+ + G Sbjct: 192 VEQKNVA------CEELTKRYTDEVIDFIER--QKDDPFFIYVSHHIVHNPLKPSPDFVG 243 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGA 374 +S + YGD + E++ + +T+ G +NTL++FTSDNGP G Sbjct: 244 TS-EKGKYGDFIKELDHSTGRIMQTIRDAGLDENTLVIFTSDNGPTRN---GSSGELSGG 299 Query: 375 KGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTF 433 K T EGG RVP W I P + SD + DL P +LAG P +P Sbjct: 300 KYCTMEGGHRVPGMFRWTSKIAPNQVSDVTLTSMDLLPLFCELAGVP-------IPDDRQ 352 Query: 434 IDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHV---LIQQPYAYTQSGYQG 490 IDG LG +S + +Y+ L AVR ++K H+ QP+ + Sbjct: 353 IDGKSILPVLLGQTSESPHQFLYYYNGTNLQAVREGKWKLHLPRTTDDQPFWSKKPDKTK 412 Query: 491 GFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 GF + +FNL D E ++ RH Sbjct: 413 GF----VTLNEMRLFNLDRDLGEKKNVADRH 439 >UniRef50_A6C4Q6 Arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4Q6_9PLAN Length = 574 Score = 145 bits (365), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 123/449 (27%), Positives = 196/449 (43%), Gaps = 61/449 (13%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 +PNV+V L DD G+ DVGF G + TP +D +A + + LT Y P +PTRA++L Sbjct: 34 RPNVIVILTDDQGYGDVGFRGN---LKINTPHLDRMAEKSIELTRFYCSPVCAPTRASLL 90 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVG 203 TG+ G++ G +QG T+ +LL GY T GKWH+G+N +PQ+ G Sbjct: 91 TGRNYYRTGVIHTSRGG--AKMQGEEVTVAELLQQAGYQTGIFGKWHLGDNYPMRPQDQG 148 Query: 204 FDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADI 263 F + +H + + SPD+ K+ +A Sbjct: 149 FAE--------------SLIHKSGGIGQSPDQPNSYFHPKLWKN-----------GVAFQ 183 Query: 264 TPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF-----DNY--PNAKYAGSS 316 + Y D+ + D + F+D+ K++KPFF+Y T H ++Y P + Sbjct: 184 STGYCTDV---FFDAALDFIDRQTKTEKPFFVYLATNAPHTPLEIAESYWKPYQRQGLDE 240 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKG 376 YG + +++ L LE++ + T+++F DNGP+ + G RG K Sbjct: 241 TTARVYG-MITNLDENIGKLLSHLERSALAEKTVVLFLGDNGPQQKRYTGG---LRGRKS 296 Query: 377 STWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFID 435 T+EGG+RVP W G + K D I DL PT L L P++ +D Sbjct: 297 WTYEGGIRVPCLAQWPGHFREGEKIDQIAAHIDLMPTLLAL-------TETRCPESLKLD 349 Query: 436 GVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGT 495 GVD + G + ++ + ++ L R Y V+ ++ + +GY G F Sbjct: 350 GVDLSPLLTGRKEKLPARSLFFQVHRGLTPQRYQ--NYAVVTER---FKLAGYPGTFGTE 404 Query: 496 VMQTAGSSVFNLY---TDPQESDSIGVRH 521 + V Y TDP E ++ H Sbjct: 405 NLLLQAEPVLEFYDLSTDPGEQKNVLHSH 433 >UniRef50_A7V8P8 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V8P8_BACUN Length = 525 Score = 144 bits (364), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 119/373 (31%), Positives = 169/373 (45%), Gaps = 65/373 (17%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-S 136 +++ ++PN+V+ + DD+GW DVG+ G AV TP+IDA+A +G+ + Y S S Sbjct: 24 VQRDKSQRPNIVLVIADDMGWGDVGYQG---AVDVSTPNIDALARRGVQFSQGYVSCSIS 80 Query: 137 SPTRATILTGQYSIHHGIL--MPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 P+RA ILTG Y G + P P +G +TL +++ D GY T +GKWHM ++ Sbjct: 81 GPSRAGILTGVYQQRFGFYNNLHPWAKIP---EGQSTLGEMVRDCGYATGFVGKWHMADS 137 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 E P GFD F GF W D H P Y D R Sbjct: 138 PEQSPNRRGFDQFYGF---------WSDTHDYYRSTDKPGVELY--------DFCPLYRN 180 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF-----DNYPN 309 GE Q + +Y+ D R V+F+DK A S PF L H ++Y N Sbjct: 181 GEIQPPLHESGEYITDCFTR---EAVEFIDKHASS--PFLLCLSYNAVHSPWQVPEHYVN 235 Query: 310 AKYAGSS---PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-PEA---- 361 + G R + ++ ++D + ++L KNG +NTL + SDNG P Sbjct: 236 -RLEGRRFHHEDRKVFAAMVLALDDGIGRVMESLRKNGLEENTLFILISDNGSPRGQGIE 294 Query: 362 -----EVPPHGRT------PFRGAKGSTWEGGVRVPTFVYW-----KGMIQPRKSDGIVD 405 E G T PFRG K T+EGG+RVP + W +GM+ D V Sbjct: 295 CSTGYEYKDRGNTTMSSPGPFRGYKADTYEGGIRVPYIMSWPSELPQGMVY----DNPVI 350 Query: 406 LADLFPTALDLAG 418 D+FPT + G Sbjct: 351 SLDIFPTVMQAVG 363 >UniRef50_C5C581 Cerebroside-sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C581_BEUC1 Length = 458 Score = 144 bits (364), Expect = 7e-33, Method: Compositional matrix adjust. Identities = 121/403 (30%), Positives = 183/403 (45%), Gaps = 51/403 (12%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRAT 142 ++PN+V+ DD+G+ D+G G ++ N TP +D +A++G+ LT Y + P SP+R Sbjct: 3 QRPNIVLINADDLGYGDLGCYG---SMRNDTPHLDRLAAEGVRLTDFYMASPVCSPSRGG 59 Query: 143 ILTGQYSIHHG--------ILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGE 193 +LTG Y G +L P G P GL T+ ++L D GY T AIGKWH G+ Sbjct: 60 MLTGCYPPRIGFGEFVGRPVLFP---GDPVGLDPAERTMARVLGDAGYATAAIGKWHCGD 116 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 E P GFD + G +DM + R+ P + L S +++ P + Sbjct: 117 QPEFLPTRHGFDSYFGIPFSNDMGRQ-REHEDWPPLPLMSGES-VVQEQPDQRS------ 168 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 L +R+ +F+++ A +PFFLY H + A + Sbjct: 169 -----------------LTERYTVAATRFIEENAH--QPFFLYLAHMYVHVPLFVPAPFL 209 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRG 373 +S YG + ++ + TL + G +NT++VFTSDNG A P RG Sbjct: 210 AAS-RNGGYGGAVAALDWSTGVVMDTLRRLGLEENTIVVFTSDNGSRARGEGGSNDPLRG 268 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 K TWEGG RV V W I D + DL PT + A A+ Sbjct: 269 HKAQTWEGGQRVACVVRWPAAIPAGGVCDAVTRSIDLLPTFAAV-----AGAADWADPAR 323 Query: 433 FIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHV 475 +DGVD T+ G N +Y+++ L AVR+ ++K H+ Sbjct: 324 PVDGVDLTALLTGAGPAPNETFAYYYMD-DLEAVRVGDWKLHL 365 >UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR28_9SPHI Length = 602 Score = 144 bits (364), Expect = 7e-33, Method: Compositional matrix adjust. Identities = 116/414 (28%), Positives = 177/414 (42%), Gaps = 70/414 (16%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSS 137 ++++T + PNV+V L DD GW D G TP D + +G +L Y P + Sbjct: 32 VQEQTQRPPNVIVILTDDQGWGDFSHTGNEYL---KTPHFDKMTEEGALLDQFYVSPVCA 88 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 PTRA++LTG+Y + G+ G+ T+ ++ + GY T GKWH G + Sbjct: 89 PTRASVLTGRYHLRTGVSFVTR-GRENMRSEEVTIAEVFKEAGYATGCFGKWHNGAHYPE 147 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 PQ GFD F GF S W S Y D GE Sbjct: 148 NPQGQGFDTFLGFTS-----GHW---------------SNYF--------DTELEYNGEM 179 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY---------YGTRGCHFDNYP 308 ++ T ++ D+ MD ++F+D A D+PF + Y +FD Y Sbjct: 180 KS----TKGFITDV---LMDETIQFID--AHKDEPFLAFVPLNAPHTPYQVPDKYFDKYK 230 Query: 309 NAKYA----GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP 364 + + + T YG C ++D L K L+ +NT++VF SDNGP+ Sbjct: 231 DIDFGYDKKQNKKIATIYGMCE-NIDDNLGKLMKHLKDQELEENTIVVFLSDNGPQG--- 286 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKV 424 P+RG K S EGG VP + WKG I + DL PT + LAG Sbjct: 287 ARYNGPWRGGKTSVHEGGTLVPCAIQWKGHIPNSSKSSLTAHIDLMPTLMGLAGIEK--- 343 Query: 425 ANLVPKTTFIDGVDQTSFFLGTN---GQSN--RKAEHYFLNGKLAAVRMDEFKY 473 P+ DG+D +++ +GT+ G+ N ++ + AVR ++++ Sbjct: 344 ----PENIQFDGIDLSNYLMGTSDDLGERNLYTHMTNFEITADRGAVRQGDYRF 393 >UniRef50_B8KKX3 Arylsulfatase B n=1 Tax=gamma proteobacterium NOR5-3 RepID=B8KKX3_9GAMM Length = 507 Score = 144 bits (364), Expect = 7e-33, Method: Compositional matrix adjust. Identities = 133/425 (31%), Positives = 180/425 (42%), Gaps = 94/425 (22%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 +PNVV+ L DD+GW DVG++G + TP ID +A++GL L Y+Q + SPTRA +L Sbjct: 40 RPNVVIILADDMGWNDVGYHGSDIH----TPHIDQLAAEGLELDRFYAQTACSPTRAALL 95 Query: 145 TGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQPQNV 202 +GQ S GI P P GL +P D GY T +GKWH+G E +P Sbjct: 96 SGQSSQSLGIYSPLSKLNPTGLALDQKIMPAYFRDAGYQTFMVGKWHLGFYEPEYRPLAR 155 Query: 203 GFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS-EYIKQLPFSKDDVHAVRGGEQQAIA 261 GFD F G +++ W VH L R+ + ++Q +S A Sbjct: 156 GFDHFYG--NLTGGVGYWNHVHGG---GLDWQRNGKTLRQEGYST----------HLQSA 200 Query: 262 DITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN----AKYAG-SS 316 +IT L + +KP FLY H N A+YA + Sbjct: 201 EITR-----------------LIQQRDPEKPLFLYAAFNAPHLPNEAPADTLARYAHIEN 243 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-------PEAEVPPHGR- 368 P R + + E++ L +TL G L+NTLI F SDNG P V R Sbjct: 244 PNRRIHAAMVTELDSAIGQLMETLSTEGMLENTLIWFMSDNGGLNRTAMPSGLVSMSQRL 303 Query: 369 ---------------------------TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSD 401 +P R K S +EGG RVP+FVYWKG + P + Sbjct: 304 EDWFGKPLFPKTLEFIRTNALDGGSDNSPHRKGKQSIYEGGARVPSFVYWKGRLSPERIT 363 Query: 402 GIVDLADLFPTAL-----DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH 456 +V + D+ PT L D G P A TT GVDQ + G +A Sbjct: 364 QMVTVKDVLPTLLSATDIDANGLPTA-------TTTESAGVDQ---WPGLTRGEFIQAPD 413 Query: 457 YFLNG 461 Y +NG Sbjct: 414 YLING 418 >UniRef50_A0PKV5 Arylsulfatase, AslA n=5 Tax=Bacteria RepID=A0PKV5_MYCUA Length = 516 Score = 144 bits (362), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 133/504 (26%), Positives = 214/504 (42%), Gaps = 56/504 (11%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KPN++V DD+G ++ G+ +G TP ID +A++G+ T AY + S + RA + Sbjct: 5 KPNILVIWGDDIGITNLSCYSDGL-MGYRTPHIDRIANEGMRFTDAYGEQSCTAGRAAFI 63 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGLT-TLPQLLHDQGYVTQAIGKWHMGENKESQPQNVG 203 +GQ G+ M G G T+ LL GY T GK H+G+ + P G Sbjct: 64 SGQSVYRTGMSKVGMPDSDIGWSGQDPTIADLLKPLGYATGQFGKNHLGDRNKHLPTVHG 123 Query: 204 FDDFRGFNSVSDMYTEWRDVHVNPEVALSP--DRSEYIKQLPFSKDDVHAVRGGE----- 256 FD+F F ++ + E PE+ P DR + +L + + E Sbjct: 124 FDEF--FGNLCHLNAE-----EEPELPDYPKSDRFPVLAELNRPRGVLRCWATEEVSDEP 176 Query: 257 ---------QQAIAD---ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF 304 +Q I D +T + ME +D + V F+ + A +D PFF++ H Sbjct: 177 DDPKYGPVGKQRIVDTGPLTKQRMETIDDETTEACVDFIKRQAAADTPFFVWMNMTHMHL 236 Query: 305 DNYPNAKYAGSSPARTS-YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 + + G + S Y D M++ + L L++ G +T++++++DNGP A Sbjct: 237 RTHTKPESVGQAGVWQSPYHDTMIDHDRNVGQLLDALDELGIAQDTIVIYSTDNGPHANT 296 Query: 364 PPHG-RTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHP- 420 P G TPFR K + WEG +R+P + W G I S+ I+ D PT L AG P Sbjct: 297 WPDGATTPFRSEKNTNWEGALRIPEMIRWPGKISAGVVSNEIIQHHDRLPTFLAAAGEPD 356 Query: 421 -------GAKVANLVPKTTF---IDGVDQTSFFLGTNGQSNRKAEHYFL-NGKLAAVRMD 469 G K++ F +D + + G +S R+ YF + + +R Sbjct: 357 IIEKLKKGHKISVRGDDKEFKVHLDAFNLLPYLTGEVEESPRQGFIYFSDDCDVLGIRFH 416 Query: 470 EFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIG--------VRH 521 +K V +Q T + F + +FNL TDP E I + H Sbjct: 417 NWKI-VFQEQRCQGTLQVWAEPFIPLRV----PKIFNLRTDPYERADITSNTYYDWFLDH 471 Query: 522 IPMGVPLQTEMHAYMEILKKYPPR 545 + ++E K++PPR Sbjct: 472 DFIAFYGTAICTQFLETFKEFPPR 495 >UniRef50_Q8SZ72 RE14504p n=18 Tax=Neoptera RepID=Q8SZ72_DROME Length = 562 Score = 143 bits (361), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 113/368 (30%), Positives = 168/368 (45%), Gaps = 61/368 (16%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 AE+EK K PN++ L DD+G+ DVGF+G PTP+IDA+A G+IL Y P Sbjct: 17 AEVEKSPAK-PNIIFILADDLGFNDVGFHGSAEI---PTPNIDALAYSGIILNRYYVAPI 72 Query: 136 SSPTRATILTGQYSIHHGILMPPMY-GQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGE 193 +P+R+ ++TG+Y IH G+ +Y +P GL LPQ L++ GY + GKWH+G Sbjct: 73 CTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGH 132 Query: 194 NK-ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 K + P GF GF S Y + V N + L + Sbjct: 133 WKLKYTPLYRGFSSHVGFWSGHQDYNDHTAVE-NNQWGLD-------------------M 172 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN------ 306 R G Q A D+ Y D+ D+ VK + + P FLY CH N Sbjct: 173 RNGTQVAY-DLHGHYTTDVI---TDHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLP 228 Query: 307 -----------YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 PN K R + + +M++ + L K+ L+N++I+F+S Sbjct: 229 VPDNDVIKMSHIPNYK-------RRKFAAMVSKMDNSVGQIVDQLRKSNMLENSIIIFSS 281 Query: 356 DNGPEAE---VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP--RKSDGIVDLADLF 410 DNG A+ + P +G K + WEGGVR + W +++ R S+ + + D Sbjct: 282 DNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLM-WSPLLKKSQRVSNQTMHIIDWL 340 Query: 411 PTALDLAG 418 PT L+ AG Sbjct: 341 PTLLEAAG 348 >UniRef50_A6DSP6 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSP6_9BACT Length = 512 Score = 143 bits (360), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 139/490 (28%), Positives = 217/490 (44%), Gaps = 80/490 (16%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRAT 142 K+PN+++ DD+G+ DVG++G + TP+ID++A QG+ + Y S P+RA Sbjct: 19 KQPNIILIFADDMGYDDVGYHGNKRII---TPNIDSIAEQGVQFSQGYVSASVCGPSRAG 75 Query: 143 ILTGQYSIHHGI------------LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWH 190 +LTG Y G + PM G P Q + + + L GY IGKWH Sbjct: 76 LLTGVYQQRFGCGENPNGSGYPNQMKYPMAGLP---QSQSMISEELKTLGYTNGMIGKWH 132 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 MG + +P G+D F GF + S YTEW + R+E ++ P +K Sbjct: 133 MGFDMSLRPNQRGYDFFYGFINGSHDYTEWTQEFAKGKSRWPIFRNEEME--PANK--AQ 188 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 + +++ + + Y+ DL + D V F+D+ A DKPFFLY H + Sbjct: 189 YIDVFKEKGVKVVDENYLTDL---FTDEAVNFIDRNA--DKPFFLYLAYNAVHHP-WQTT 242 Query: 311 KYAGSSPARTS-------YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-PEAE 362 ++A A + + M++ + K L++ DNT+I+F SDNG P+ + Sbjct: 243 QHALDKTAHLKDDKNYHVFASMVYAMDEGIGKVMKKLKEKNIDDNTIIIFLSDNGSPQGQ 302 Query: 363 VPPHG-RTP--------------FRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDL 406 H + P FRG KG T+EGG+RVP + W IQ K D + Sbjct: 303 GIEHSPKDPNRHRGGFTMSSTGIFRGYKGDTYEGGIRVPFCIKWPQQIQKGTKYDMPISA 362 Query: 407 ADLFPTALDLAGHPGAKVANLVPKTTF-IDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAA 465 DL PT + AG K P+ F DGVD +L + + R ++ A Sbjct: 363 LDLQPTLVKAAGGNDKK-----PQKGFAYDGVDILP-YLKEDKEIKRSL--FWRRDTDYA 414 Query: 466 VRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMG 525 +R ++K +Q A+ G T T +FN+ DP+E ++ +H + Sbjct: 415 IRKGDWK----LQWNDAH------GPLTIT--------LFNIKEDPEERSNLIKQHPELA 456 Query: 526 VPLQTEMHAY 535 LQ E + Sbjct: 457 QQLQNEFDTW 466 >UniRef50_C9KTC2 Arylsulphatase A n=5 Tax=Bacteroides RepID=C9KTC2_9BACE Length = 501 Score = 142 bits (359), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 113/379 (29%), Positives = 173/379 (45%), Gaps = 71/379 (18%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNP-----TPDIDAVASQGLILTSAYSQPS-S 136 + PNV+ L DD+G+ G ++ NP TP+ID + G+ T A+S + S Sbjct: 19 AQSPNVIFILADDLGY-------GDISAFNPESKIHTPNIDNLTHSGISFTDAHSSSALS 71 Query: 137 SPTRATILTGQY----SIHHGIL--MPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWH 190 +P+R +I+TG+Y ++ G+L P P T+ Q+ + GY T IGKWH Sbjct: 72 TPSRYSIITGRYPWRTTMKSGVLNGFSPAMITPD----RRTIAQMFSENGYNTACIGKWH 127 Query: 191 MG---------ENKE---------SQPQNVGFDDFRG----FNSVSDMYTEWRDVHVNPE 228 +G +NK+ + P + GFD F G + +Y E V P Sbjct: 128 LGWDWAYPQNAKNKQDVDFSLPIKNGPTDRGFDYFYGIPASLGTAPHVYVENDKVTALPN 187 Query: 229 VALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAK 288 + P + + +R G A AD P +D + +G+ +++K Sbjct: 188 RTIGPQKG------------IKLIRNG--VAGADFEP---QDCLPNIIRHGIDYINKQRD 230 Query: 289 SDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDN 348 S KPFFLY H P KY G + YGD +V ++D+ + KTL+KN QL+N Sbjct: 231 SKKPFFLYLPITAPHTPVLPAEKYQGQT-IIGDYGDFVVMIDDMVQQIVKTLKKNNQLEN 289 Query: 349 TLIVFTSDNGPE-----AEVPPHGRTP---FRGAKGSTWEGGVRVPTFVYWKGMIQPRKS 400 T+I+FTSDNG E+ G P +RG K +EGG R+P V W+G + Sbjct: 290 TIIIFTSDNGCAPYIGVEEMENKGHHPSYIYRGYKNDIYEGGHRIPLIVSWQGKYTNETN 349 Query: 401 DGIVDLADLFPTALDLAGH 419 +V L D + T + + Sbjct: 350 GSLVSLTDFYATFAQMVNY 368 >UniRef50_UPI000180C68F PREDICTED: similar to arylsulfatase, partial n=1 Tax=Ciona intestinalis RepID=UPI000180C68F Length = 532 Score = 142 bits (358), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 138/506 (27%), Positives = 214/506 (42%), Gaps = 93/506 (18%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPD---IDAVASQGLILTS 129 Q+++ +K K PN +V + DD+G+ D + G+PT + +D + +G+ T Sbjct: 20 QQVSAYHRK--KSPNFIVIMADDIGYGDFQ------SFGHPTQEYGGVDRMVKEGMRFTQ 71 Query: 130 AYSQPS-SSPTRATILTGQYSIHHGIL--MPPMYGQPGGLQGL----TTLPQLLHDQGYV 182 S + SP+RA +LTG+Y+I G+ + P++ QP + GL T+ + L GY Sbjct: 72 WTSAATLCSPSRAALLTGRYAIRSGLRGDVAPVF-QPQSVGGLPRKEITIAESLKALGYR 130 Query: 183 TQAIGKWHMGENKESQ------PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS 236 T +GKWH+G N+ + P N GFD F G N L S Sbjct: 131 TGLVGKWHLGINRNTSTDGYHLPHNHGFD-FVGTN-------------------LPLSHS 170 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 E F+ +++ + + P + L R F+ + FFLY Sbjct: 171 EMCNPAEFTVEELSTMCFLYNGSTIVEQPVNLSTLTDRITSDAKNFISNNRLNS--FFLY 228 Query: 297 YGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 + H + ++ G S R YGD + EM+ +++ L + DNTL++F SD Sbjct: 229 FSPPQAHRALFCAERFCGRS-KRGPYGDTINEMSSAISDILDHLVQLEIDDNTLVIFLSD 287 Query: 357 NGPEAEVPPHGRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTA 413 +GP ++ P G P F+ KG+TWEGG+RVP +W G+I S+ +V D+ PT Sbjct: 288 HGPNSDKCPDGGVPGLFKAGKGTTWEGGLRVPAVAWWPGVIPAGTVSNAVVSTLDVHPTL 347 Query: 414 LDLAGHPG------------------AKVANLVP-KTTFIDGV-----------DQTSFF 443 L +A AK N P + DG+ +TS Sbjct: 348 LKIAALRSLPCFKRCTSSATPSSNLYAKAENQKPIPSKLFDGIPIPDLICSMKHQRTSSC 407 Query: 444 LGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSS 503 L T SNR HY + AVR + K+H P +S T +++T Sbjct: 408 LST--PSNRILFHY-CGEDILAVRYGDLKFH-FKSNPPLQRRSNCVRTVTADLIRTFSCG 463 Query: 504 --------VFNLYTDPQESDSIGVRH 521 VFNL DP E + + H Sbjct: 464 KRTHDPPLVFNLLIDPSEEIPLNISH 489 >UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZUT0_9PLAN Length = 457 Score = 142 bits (357), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 117/357 (32%), Positives = 155/357 (43%), Gaps = 62/357 (17%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KPN+V L+DD+G D G G A TP ID +A+QG+ T AY+ P SPTRA+++ Sbjct: 31 KPNIVFILIDDMGCKDAGCYG---ATNFSTPHIDRLANQGMRFTDAYAAPVCSPTRASLM 87 Query: 145 TGQY---------------SIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKW 189 TG++ + G L+PP + L T+ Q LH GY IGKW Sbjct: 88 TGKHPARLHLTNFIPQIGRQLPAGKLIPPGFNHVLPLDE-KTIAQELHADGYQCAMIGKW 146 Query: 190 HMGENK--ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 H+GE E +PQN GFD V LS + PF Sbjct: 147 HLGEEHGPEYRPQNRGFD----------------------RVVLSEHHGIFNYFYPFVDQ 184 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 G D P R D + F+ + ++PFFLY H + Sbjct: 185 QKWPYAGPLPGNPGDYLP-------DRLTDEAIDFVRE--NRERPFFLYLSHWSVHGRYF 235 Query: 308 PN----AKY--AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA 361 AKY G Y M +++ L TL++ DNTL VF SDNG E Sbjct: 236 APESLIAKYRERGLEERPAIYAAMMETVDNSVGRLMATLDELNLADNTLFVFMSDNGGER 295 Query: 362 EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGI-VDLADLFPTALDLA 417 P RG+KGS +EGGVRVP V + G+++P + + V DLFPT LD A Sbjct: 296 IT---SMAPLRGSKGSLYEGGVRVPLIVRYPGVVKPNTTCSVPVISHDLFPTFLDFA 349 >UniRef50_C1ZFQ0 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZFQ0_PLALI Length = 522 Score = 141 bits (356), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 124/439 (28%), Positives = 198/439 (45%), Gaps = 71/439 (16%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS--SPTR 140 ++PN+++ L DD+G+ D+ V T ID +A +G+ T A+S PS+ +PTR Sbjct: 32 AEQPNILLILADDLGYGDLRCYNSQSKVS--TSHIDRLAREGMRFTDAHS-PSTVCTPTR 88 Query: 141 ATILTGQYSIH-------HGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 ++TGQ + P PG L TLP +L ++GY T +GKWH+G Sbjct: 89 YGLMTGQMPFRAPSGGTVFTGVGGPSLIAPGRL----TLPMMLRERGYSTACVGKWHIGL 144 Query: 194 ---NKESQPQNV----------------------GFDDFRGFNSVSDMYTEWRDVHV-NP 227 ++E +P + GFD F F + T+W + N Sbjct: 145 TFFDREGRPIHSNALEAVRQVDFSRRIDGGPVDHGFDSF--FGTACCPTTDWLYAFIEND 202 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA 287 V + P S LP H R G +D ME++D +++ +FL++ Sbjct: 203 RVPVPPTASLEKSALP-KHPYAHDCRPG--LIASDFA---MEEIDLIFLEKSRQFLNQHV 256 Query: 288 KSD--KPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQ 345 + + KPFFL++ T+ H ++ ++ G S A +GD ++E++ + L K+LE+ Sbjct: 257 RQNPGKPFFLFHSTQAVHLPSFAAKQFQGKSEA-GPHGDFLLELDYIVGELMKSLEELHI 315 Query: 346 LDNTLIVFTSDNGPEAEVPPHGRT--------PFRGAKGSTWEGGVRVPTFVYWKGMIQP 397 +NTL++FTSDNGPE H R+ P+RG K WEGG RVP V W G ++P Sbjct: 316 AENTLVIFTSDNGPEVTSVIHMRSDHGHDGARPWRGMKRDAWEGGHRVPFIVRWPGKVRP 375 Query: 398 RKSDG-IVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRK--A 454 ++ + L D+ T A V +P D + +L + R Sbjct: 376 GTTNSQLTSLTDVMATV-------AAIVDTQLPDHAAEDSFNMLPAWLDESAPPIRPYLL 428 Query: 455 EHYFLNGKLAAVRMDEFKY 473 F + A+R E+KY Sbjct: 429 TQSFGGSRTLAIRQGEWKY 447 >UniRef50_A6LIX5 Arylsulfatase n=2 Tax=Bacteroidales RepID=A6LIX5_PARD8 Length = 514 Score = 141 bits (355), Expect = 7e-32, Method: Compositional matrix adjust. Identities = 145/497 (29%), Positives = 218/497 (43%), Gaps = 86/497 (17%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRAT 142 K+PNVV+ L DD+G+ DVG N V TP ID +A G+ T A+S + S P+R Sbjct: 22 KQPNVVIILADDMGYGDVGCNNPYARV--RTPAIDQLARNGIRFTDAHSAGALSGPSRYG 79 Query: 143 ILTGQY-------SIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN- 194 ++TG+Y S + G L P Y +P L T+ L+ + GY T +GKWH+G + Sbjct: 80 LVTGRYFFRTPKKSEYWGYLSP--YIEPERL----TIGSLMRNAGYTTACVGKWHLGLDW 133 Query: 195 ---KESQPQ-----------------------NVGFDDFRGFNSVSDM--YTEWR-DVHV 225 +S+PQ +GFD + DM Y R D V Sbjct: 134 QLKDDSKPQILTPKKFGYTNTDFSAPVKRGPTELGFDYSFILPASLDMPPYAFVRNDRVV 193 Query: 226 NPEVALSPD-----RSEYI---KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMD 277 +P+V L+ D + E + + +++D++ RG + E+ +D Sbjct: 194 DPDVILTADAYPKKQDETVYAWDRKHTNENDIYWERGVWWRNGEMSRSFKFEECFPTIVD 253 Query: 278 YGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLY 337 G+ F+D+ + DKPFFLY G H P ++ GS+ T YGD M ++++V A + Sbjct: 254 EGIAFIDREGRKDKPFFLYMPLTGPHTPWLPTVQFKGSTELGT-YGDFMGDIDNVVARVN 312 Query: 338 KTLEKNGQLDNTLIVFTSDNG---PEAEVPPHGRT---PFRGAKGSTWEGGVRVPTFVYW 391 L++ G NT+++F SDNG E ++ +G RG KG W+GG VP V+W Sbjct: 313 AKLKELGLEKNTIVIFASDNGGAWEEEDIQQYGHQSNWSRRGQKGDAWDGGHHVPLIVHW 372 Query: 392 KGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQS 450 I+ P V L D+ T DL G +PK D G S Sbjct: 373 PDHIKCPGVCSQTVGLVDILATLADLTGQS-------LPKGQAEDSFSFKKVLDGDMNAS 425 Query: 451 NRKAEHYFL-NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTG-----TVMQTAGSSV 504 R Y +GKLA + D + Y GGFT V + Sbjct: 426 TRDQIMYLSGSGKLAIKKGD-----------WKYIDCLGSGGFTAPARLSPVKNGPKGQL 474 Query: 505 FNLYTDPQESDSIGVRH 521 +N+ TD ES+++ +R Sbjct: 475 YNMRTDSLESNNLFLRE 491 >UniRef50_A3ZMN6 Arylsulfatase B n=3 Tax=Bacteria RepID=A3ZMN6_9PLAN Length = 455 Score = 141 bits (355), Expect = 8e-32, Method: Compositional matrix adjust. Identities = 108/348 (31%), Positives = 168/348 (48%), Gaps = 55/348 (15%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 +PN+V L DD+G DV + G + TP +DA+A+ G L Y QP SPTR+ +L Sbjct: 28 RPNIVFLLADDLGGADVSWRGSPIK----TPQLDALANSGAKLEQFYVQPVCSPTRSALL 83 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKESQ-P 199 TG+Y + +G+ + + +P GL TL + L D GY T +GKWH+G + P Sbjct: 84 TGRYPMRYGLQVGVV--RPWADYGLPLDERTLAEALQDAGYETAIVGKWHLGHVSPAYLP 141 Query: 200 QNVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 GFD G +N D +T RD ++ K ++D+ +A Q+ Sbjct: 142 MARGFDHQYGHYNGALDYFTHDRD-----------GGHDWHKDDHVNRDEGYATHLIAQE 190 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF-----DNYPNAKYA 313 A+ + ++D D++ KP FLY H ++Y A Y Sbjct: 191 AV-----RVIQDRDKK----------------KPLFLYVPFNAVHSPLQVPESY-AAPYG 228 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN-GPE-AEVPPHGRTPF 371 R +Y + +++ + +++ LDNTL +F+SDN GPE ++ +G P Sbjct: 229 DMKKRRQAYAGMVAALDEAVGQIVDEIQRQEMLDNTLFIFSSDNGGPEPGKLTDNG--PL 286 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAG 418 RG K + +EGGVRV F WKG I P K + + + D +PT ++LAG Sbjct: 287 RGGKHTLYEGGVRVCAFASWKGRIAPGSKVEAPLHIVDWYPTLIELAG 334 >UniRef50_Q7UH46 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UH46_RHOBA Length = 490 Score = 141 bits (355), Expect = 8e-32, Method: Compositional matrix adjust. Identities = 133/479 (27%), Positives = 206/479 (43%), Gaps = 79/479 (16%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATIL 144 PN+V+ + DD+GW D GFNG + TP++DA+A++G +L YS P SPTRA+ L Sbjct: 32 PNIVLMMCDDLGWGDTGFNGNTII---QTPELDALANEGTVLDHFYSVGPVCSPTRASFL 88 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGLT-TLPQLLHDQGYVTQAIGKWHMGENKESQPQNVG 203 TG++ GI G L TL ++L +GY T GKWH+G + Sbjct: 89 TGRHYFRMGIWT----ANKGHLPSQEFTLARMLKTRGYATGHFGKWHLGTLSRTVSA--- 141 Query: 204 FDDFRGFNSVSDMYTE--WR---DVHVNPEVALS---PDRSEYIKQLPFSKDDVHAVRGG 255 +G D++ W D E A+ P + + P+ ++ V Sbjct: 142 ----KGKGRRPDLHYAPPWERDYDASFVTESAVCTWDPGIGKRARNNPYYENGV------ 191 Query: 256 EQQAIADITPKYMEDLDQR-WMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN----A 310 T + + D R MD + F++ A+ D+PF H D A Sbjct: 192 -------ATDENVLGCDSRVLMDRALPFIEAAAERDQPFLSVIWFHAPHEDIQAGPEYLA 244 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH---- 366 KY G A YG C+ ++D L K L G DNTL+ F SDNGPE P + Sbjct: 245 KYEGHGEAAHYYG-CITAVDDQVGRLRKKLASLGVADNTLLFFCSDNGPEGGEPSNRMKT 303 Query: 367 ----GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPG 421 F G K S +GGVRVP FV+W G I + + + + DL PT + G Sbjct: 304 RRAGSAGEFSGRKRSVLDGGVRVPAFVHWPGQIPAGVRLNAPLSVMDLLPTVAAITG--- 360 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 A +P +DG + + G Q+ R+ F G+ A + K+ ++I+ P Sbjct: 361 ---AETLP-NRLLDGENVLPIWKGE--QAQREKSIPFRYGQFAC--LVRGKHKLIIESPN 412 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 ++ +F+L D ES+++ + + ++TE+ ++E K Sbjct: 413 DDSK----------------DRLFDLSKDVSESNNLANQKPELTASMRTELLGFLESAK 455 >UniRef50_B4D3U0 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D3U0_9BACT Length = 467 Score = 140 bits (354), Expect = 9e-32, Method: Compositional matrix adjust. Identities = 107/359 (29%), Positives = 153/359 (42%), Gaps = 61/359 (16%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 +PN + L DD+GW DVGF+ G V PTP++D +A +GL L Y P SPTR L Sbjct: 40 RPNFIFILADDLGWGDVGFHHGNV----PTPNLDHLAGEGLELMQHYVYPVCSPTRCAFL 95 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGLT-TLPQLLHDQGYVTQAIGKWHMGENKESQPQNVG 203 +G+Y+ + P P + T TL + L GY T GKWH+G E PQ G Sbjct: 96 SGRYASRFSVTTP---QNPRAFRWDTVTLARALKSVGYDTALCGKWHLGSKPEWGPQKFG 152 Query: 204 FDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIAD- 262 FD + S++ W + E + R + + EQ + D Sbjct: 153 FD--HSYGSLAGGVGPWDHHYKIGEFTQTWHRDGKLIE--------------EQGHVTDL 196 Query: 263 ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF------DNYPNAKYAGSS 316 IT + +E L+ R +DKPFFLY H + + + Sbjct: 197 ITKEAVEWLESR--------------TDKPFFLYVPFTAVHIPIREPDEILQRVPASITK 242 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG------------PEAEVP 364 P+ YG ++ ++D + LEK G+ NTL++F SDNG P P Sbjct: 243 PSLRHYGANVMHLDDSVGKILVALEKTGKAGNTLVIFGSDNGAIPGVENNDPLYPPDHYP 302 Query: 365 P----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGH 419 P P G KG +EGG+ W G ++P K G+ + D PT LAG+ Sbjct: 303 PGPAGGSNEPLHGMKGEVYEGGIHTAAVARWPGQLKPGKFLGLAHITDWMPTFCALAGY 361 >UniRef50_A7S8Q2 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7S8Q2_NEMVE Length = 540 Score = 140 bits (353), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 136/468 (29%), Positives = 205/468 (43%), Gaps = 81/468 (17%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILT 145 P+++ L+DD+GW DVG++ AV TP+ID +ASQG+ L S YSQP +P+R ++T Sbjct: 35 PHIMFILMDDLGWSDVGYHNISHAV--KTPNIDKLASQGVKLMSYYSQPMCTPSRGALMT 92 Query: 146 GQYSIHHG-----ILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQP 199 G+Y IH G I + +G P + T+PQ L GY T IGKWH+G + + P Sbjct: 93 GKYPIHLGMQHFVINITSPWGMP---RRFPTIPQKLRTLGYRTSMIGKWHLGFFDWDYTP 149 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD F GF + WR + L F +D+ A G Q + Sbjct: 150 LRRGFDSFLGF--FAGEQDHWRHSKMG--------------FLDFRRDEEPANEYGGQHS 193 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH--FDNYPN--AKYAG- 314 D+ + ++ R + +P FL H +PN K G Sbjct: 194 -TDVFTQEAINIAMR------------HNASQPLFLLLSYAAVHTPLQAHPNDVNKIGGV 240 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGA 374 S R +Y M + L ++NG +NTL+++ SDNG + P RG Sbjct: 241 SDKDRQNYLGMMGAADWSIGRLIDVYKRNGLWNNTLMIWASDNGAQPGKGGGYNWPLRGY 300 Query: 375 KGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDL---ADLFPTALDLAGHPGAKVANLVPKT 431 K S +EGGVRVP FV+ G + RK + DL D +PT + LAG + P Sbjct: 301 KSSLFEGGVRVPAFVH--GEMLQRKGGTVNDLFHVTDWYPTLVKLAG------GEVEPD- 351 Query: 432 TFIDGVDQT----------------SFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFK--Y 473 IDGVDQ + + N + R A F AA+R K Y Sbjct: 352 --IDGVDQWPTLSEGKPSKREEILHNIDIPANQEEERMAPRGFNYYSGAALRRGHMKLVY 409 Query: 474 HVLIQQPYAYTQSGYQGGFTGTVMQ----TAGSSVFNLYTDPQESDSI 517 + Y ++G++G +++ +++N+ DP+E + + Sbjct: 410 KMGDAGWYQLPENGHRGPVVEEMVKDRLPIVELALYNITADPEERNDL 457 >UniRef50_C1ZI83 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZI83_PLALI Length = 558 Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 139/471 (29%), Positives = 209/471 (44%), Gaps = 61/471 (12%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRAT 142 +KPNVV+ DD+G+ DVG G + TP+ID +A +G+ TS Y +Q S +R Sbjct: 106 EKPNVVIINCDDLGYADVGAFGATIC---KTPEIDRMAREGVKATSFYVAQAVCSASRTA 162 Query: 143 ILTGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQN 201 +LTG GIL + G+ TL +L QGY T GKWH+G + P + Sbjct: 163 LLTGCLPNRIGILGALSHVSKNGIADSEVTLGELFQSQGYSTAMYGKWHLGYQAQFLPGH 222 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPF--SKDDVHAVRGGEQQA 259 GF + G +DM+++ NP P LP K D A G Sbjct: 223 HGFGEALGIPYSNDMWSK------NPYGKFPP--------LPLFRQKGDSPAEIIGHDTD 268 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPAR 319 + T + V F+D+ A DKPFF+Y H + + + A+ Sbjct: 269 QSRFTTDFTM--------AAVSFIDRHA--DKPFFIYLAHPMPHTPIFVSEERNSGERAQ 318 Query: 320 TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT--PFRGAKGS 377 Y D + E++ + +TLEK+ TL++FTSDNGP H + P R KG+ Sbjct: 319 L-YRDVIGEIDWSVGTIRQTLEKHQLTRKTLVIFTSDNGPWLVFGNHAGSTGPLREGKGT 377 Query: 378 TWEGGVRVPTFVYWKGMIQPRKSDGIVDLA----DLFPTALDLAGHPGAKVANLVPKTTF 433 W+GG RVP W G+I P D VDL DLFPT A GAK+ + Sbjct: 378 MWDGGARVPFVACWPGVIPP---DTTVDLPMATYDLFPT---FAKMLGAKLPDHP----- 426 Query: 434 IDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG--G 491 IDGVD + +A ++ L AVR +K + P+ Y +G G Sbjct: 427 IDGVDIWPQLTSASKAQPHQALWFYYGRDLIAVRSGPWK----LVFPHTYVHPVERGNDG 482 Query: 492 FTGTVMQTAGS--SVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 G ++ + +++NL +D E+ ++ +H + ++ AY E+ + Sbjct: 483 QRGKLVNRKFTELALYNLDSDIGETTNLASQH----PEIVKQLEAYAEVAR 529 >UniRef50_D2R2H5 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R2H5_9PLAN Length = 507 Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 126/421 (29%), Positives = 185/421 (43%), Gaps = 68/421 (16%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS-SPTR 140 + +KPN++V + DD+G+ D+G NG TP ID VA++GL TS Y S+ +PTR Sbjct: 21 SAEKPNIIVIIADDLGYGDLGCNGSQTIA---TPHIDRVAAEGLRFTSGYCSASTCTPTR 77 Query: 141 ATILTGQYSIH---HGILMP--PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG--- 192 ++LTG Y+ GI P P QP T+ LL QGY T IGKWH+G Sbjct: 78 YSLLTGTYAFRVKGTGIAAPNSPALIQPE----TVTVASLLKSQGYATACIGKWHLGLGV 133 Query: 193 ----ENKESQPQ--NVGFDDFRGFNSVSD----MYTEWRDV-HVNPEVAL---------- 231 N E +P +GFD + +D ++ E V +++P L Sbjct: 134 GKPDWNGELKPGPLEIGFDHCLLLPTTNDRVPQVFVENHRVRNLDPADPLWVGDEKPSDD 193 Query: 232 SPDRSEYIKQLPFSKDDVHAVRGGEQQAIADI-------TPKYM-EDLDQRWMDYGVKFL 283 P + L D H G I+ I ++ +DL W+ +++ Sbjct: 194 HPTGISHRSTLAMDWDYGH--NGTIHNGISRIGFYTGGMKARFRDQDLADEWVKASAQWI 251 Query: 284 DKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKN 343 + A PFFLY+ H P+ ++ G S GD ++E + L K LE++ Sbjct: 252 E--ANKAGPFFLYFAAHDIHVPRTPHERFVGKS-GMGPRGDSILEFDWCVGELMKVLEQH 308 Query: 344 GQLDNTLIVFTSDNGP-------EAEVPPHGRTP----FRGAKGSTWEGGVRVPTFVYWK 392 +NTL+V SDNGP + V G+ FRG K S +EGG R P V WK Sbjct: 309 QLAENTLVVICSDNGPVLNDGYKDQAVELIGKHAAAGLFRGGKYSVFEGGTRTPFIVSWK 368 Query: 393 GMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNR 452 G + SD +V D A A GAK +P+ +D ++ LG + R Sbjct: 369 GRVASGVSDKLVSTIDF---ASSFAALAGAK----IPEDACLDSLNLLDTLLGDKAAAGR 421 Query: 453 K 453 + Sbjct: 422 E 422 >UniRef50_D2QCX4 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QCX4_9SPHI Length = 533 Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 135/449 (30%), Positives = 188/449 (41%), Gaps = 133/449 (29%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 K+PN++ L DD+G+ D+G GG V TP++D +A+ G+ L S Y+ PTRA++ Sbjct: 39 KRPNILYILADDMGFSDIGCYGGEVN----TPNLDKLAAGGIKLRSFYNNARCCPTRASL 94 Query: 144 LTGQY--SIHHGIL--MPPMYGQPGGLQGLT-----TLPQLLHDQGYVTQAIGKWHMGEN 194 LTGQY ++ G++ MP QPG QG T+ + L + GY T +GKWH+GE Sbjct: 95 LTGQYPHTVGMGLMVTMPNAAIQPGSYQGFLDARYPTIAERLKETGYSTYMLGKWHVGER 154 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 E P GF+ + G S + S Y + +P K V Sbjct: 155 PEHWPLKRGFEHYFGLISGA---------------------SSYYEIIPAEKGKRFIVLD 193 Query: 255 GEQQAIADITPK----YMEDLDQRWMDYGVKFLD--KMAKSDKPFFLYYGTRGCHFDNYP 308 ++ TP YM D + DY V++L+ K ++DKPFF+Y HF + Sbjct: 194 DKE-----FTPPADGFYMTDA---FTDYAVQYLNQQKQEQADKPFFMYLAYTAPHFPLHA 245 Query: 309 N----AKYA----------------------------------GSSPARTSYGD------ 324 AKY + PA S D Sbjct: 246 YESDIAKYEKLYAQGWDVTRTKRYQKMQQLGLIDKRYQLTPRPANVPAWNSATDKAQWIR 305 Query: 325 ------CMVE-MNDVFANLYKTLEKNGQLDNTLIVFTSDNG-------------PEAEVP 364 M++ M+ L KTL+ NGQ DNTLIVF SDNG P ++ Sbjct: 306 KMAVYAAMIDRMDQNIGRLIKTLKANGQYDNTLIVFMSDNGSSNENMESRKLNDPTKKIG 365 Query: 365 PHGR-------------TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS--DGIVDLADL 409 G TPFR K EGG+ P + W I+P DGI + DL Sbjct: 366 ERGSYVTYDTPWANVSVTPFRKYKRFLHEGGMITPCIMQWPRNIRPAAGYVDGIGHVMDL 425 Query: 410 FPTALDLAG-----HPGAKVANL-VPKTT 432 PT+L+LAG PG ++ L PK T Sbjct: 426 LPTSLELAGLSANDLPGKSLSYLWTPKKT 454 >UniRef50_C3ZFE8 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZFE8_BRAFL Length = 485 Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 132/467 (28%), Positives = 208/467 (44%), Gaps = 69/467 (14%) Query: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156 GW DV F+G PTP++D++A G+IL + Y P +PTR+ I+TG++ IH G+ Sbjct: 3 GWNDVSFHGSDQI---PTPNLDSLAYSGVILGNYYVSPICTPTRSAIMTGRHPIHTGLQH 59 Query: 157 PPMYG-QPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENK-ESQPQNVGFDDFRGFNSV 213 + G P GL T LPQ L GY T +GKWH+G + E P GFD + G+ + Sbjct: 60 GVISGATPFGLPLNETILPQYLKPLGYATHIVGKWHLGHHAWEFTPTFRGFDSYFGYLTG 119 Query: 214 SDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQ 273 D Y + D D S ++L + D +R G + P + E+ Sbjct: 120 KDNYYDHTD-----------DESNSPEELGYKGLD---LRNGTE-------PVWTENGTY 158 Query: 274 RWMDYGVKFLDKMAKSD--KPFFLYYGTRGCH--------------FDNYPNAKYAGSSP 317 + + + D KP FLY + H D +P+ ++ P Sbjct: 159 STELFATEAERIITSHDTSKPLFLYLPHQAVHSGNPDNPLQAPQKYIDKFPHIQH----P 214 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN-GPEAEVPPH--GRTPFRGA 374 R ++ + ++D N+ K L G L+N++I+FT+DN GP A + P RG Sbjct: 215 GRRTFAAMVSALDDAVGNVTKALSARGMLENSVIIFTTDNGGPAAGFDQNYASNWPLRGV 274 Query: 375 KGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKT-- 431 K + WEGGV FV+ + QP R + ++ + DL PT +LAG ++ NL Sbjct: 275 KNTLWEGGVHGTGFVHSPLIKQPKRTTHELLHVCDLLPTIYELAGGDSTELKNLDGTNVW 334 Query: 432 -TFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490 T GV + N RK AA+R ++K +++ + Y + + G Sbjct: 335 ETISRGVQSPRVEVLHNIDPKRKT---------AALRYGDYK--IILGEAY---KGAWDG 380 Query: 491 GFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 + + T S N + DP E ++I + P+ L + AY E Sbjct: 381 WYPPEGVTTNHSEEAN-HEDPCEFNNIADWNKPLVNFLMGRLEAYSE 426 >UniRef50_C3ZQB5 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZQB5_BRAFL Length = 560 Score = 139 bits (350), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 122/427 (28%), Positives = 188/427 (44%), Gaps = 82/427 (19%) Query: 67 QDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPD---IDAVASQ 123 + K ++ + ++L+ KPN+++ L DD+GW D+ + G+PT + ID +A++ Sbjct: 84 KTKTSRDERSKLKTTVPVKPNIILMLADDMGWGDL------CSYGHPTQECGEIDKMAAE 137 Query: 124 GLILTSAYSQPS-SSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHD 178 G+ T YS S SP+RA ILTG+ + G+ P GL TT+ +LL + Sbjct: 138 GMRFTQWYSADSLCSPSRAAILTGRLPVRVGVWGGSRVFLPASTGGLPRDETTIAELLKE 197 Query: 179 QGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 GY T +GKWH+G +V ++ E D P + + Sbjct: 198 AGYATGMVGKWHLG-------------------AVHFVHMESPD----PMLCFKYWNATL 234 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 ++Q PF D+ +T +++D V F+ D PFFLY Sbjct: 235 VQQ-PFRHDN--------------LTTSFLQD--------SVAFMHN--NKDTPFFLYLS 269 Query: 299 TRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG 358 H D + ++ +S R YGD + E++ + KTL TL++F SD+G Sbjct: 270 FAHMHTDMFSAPRFRETS-RRGRYGDGLRELDWAVGEVLKTLVSLQIQHRTLVIFLSDHG 328 Query: 359 PEAEVPPHGRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALD 415 E+ G + +G K STW+GG+RVP +W G++ P + S +V D+F TA + Sbjct: 329 GHLEICTEGGSNGILKGGKASTWDGGLRVPGIAWWPGVVAPGQVSQHLVSSMDVFQTAAE 388 Query: 416 LAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ--------SNRKAEHYFLNGKLAAVR 467 LAG PK DG L S R HY N +L AVR Sbjct: 389 LAG-------VTPPKDRIYDGKSLVPILLEKTAAVPTSKSPPSPRTLFHYCSN-RLMAVR 440 Query: 468 MDEFKYH 474 E+K H Sbjct: 441 YGEYKAH 447 >UniRef50_A6DG39 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG39_9BACT Length = 473 Score = 139 bits (350), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 114/364 (31%), Positives = 176/364 (48%), Gaps = 51/364 (14%) Query: 84 KKPNVVVFLLDDVGWMDVG-FNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRA 141 +KPN+V+FL DD+G+ D G FN P ID +A +G+ T A+S + +P+R Sbjct: 22 EKPNIVIFLADDLGYGDCGAFNSQSKI---KMPHIDRLAEEGMRFTDAHSASATCTPSRY 78 Query: 142 TILTGQYSIHHGILMPPMY-GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQ- 198 +LTG + G+ + G+P + TL LL + Y T +GKWH+G ENK Sbjct: 79 GLLTGINPVRTGVFNTLLKTGRPIIHKDEMTLADLLKVEDYETWMVGKWHLGFENKSKSL 138 Query: 199 ---------PQNVGFDDFRGFNSVSD----MYTEWRDVHVNPEVALSPDRSEYIKQLPFS 245 P + GFD F G S + + + R + EV+ SE++ Sbjct: 139 DLSQDLRGGPLDCGFDYFFGLASSASSSPLCFIKNRKIQ---EVS-----SEFV------ 184 Query: 246 KDDVHAVRGGEQQAIADI-TPK--YMEDLDQRWMDYGVKFLDKMAKSDK--PFFLYYGTR 300 +V +RG Q++ I PK +ED+ R + V + + AKS K PF LY+ + Sbjct: 185 --EVDKIRGSGQKSKYKIAVPKDLKLEDVSPRLSENAVGLIQEYAKSAKEQPFLLYFASI 242 Query: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP- 359 H P+ + G S Y D +++M+D + + L+ G NT+++FTSDNG Sbjct: 243 APHQPWVPSENFKGKS-GLGVYADFVMQMDDELGQINQALKDTGLEKNTIVIFTSDNGTG 301 Query: 360 ------EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPT 412 AE H P RGAK S++EGG R+P W G+I +S +++ D+F T Sbjct: 302 PGAHYLMAEQGHHSSGPMRGAKASSYEGGHRMPFIAKWPGIIPVNSQSKAVINATDIFAT 361 Query: 413 ALDL 416 +L Sbjct: 362 IAEL 365 >UniRef50_UPI000180C5AE PREDICTED: similar to sulfatase 1 n=2 Tax=Ciona intestinalis RepID=UPI000180C5AE Length = 562 Score = 139 bits (349), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 121/388 (31%), Positives = 177/388 (45%), Gaps = 46/388 (11%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSP 138 +K+ G++P+VV L DD G+ D+G++ TP +D++A++G+IL + Y QP SP Sbjct: 27 KKQRGQRPHVVFVLADDFGFNDIGYHAREHYSDMYTPFLDSLAAKGVILENYYVQPICSP 86 Query: 139 TRATILTGQYSIH----HGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGE 193 TR +LTG+Y IH HGI+ QP GL L Q L GY T +GKWH+G Sbjct: 87 TRGQLLTGRYQIHTGLAHGIIRA---AQPYGLPLDNILLSQQLRQCGYKTNMVGKWHLGF 143 Query: 194 NKESQ-PQNVGFDDFRGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 +E P N GF +F GF N + +T + H P+ K F D+ Sbjct: 144 FREEYLPWNRGFQNFFGFLNGGVNHFTRY---HCEPK-----------KTRRFCGYDMID 189 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF-----DN 306 R G A Y E ++ + +DK K KP FLY + H + Sbjct: 190 SRYGPTNAT------YGEYSTNLFIRKSKEMIDKHNKQ-KPMFLYLSLQAVHGPLQVPNQ 242 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH 366 Y R Y + M+ L K L++ NT+ +F++DNG + + Sbjct: 243 YLKRFKHIRDKNRRIYAGMVYAMDRGIRQLVKHLKRARMWKNTIFIFSTDNGGQTTRGGN 302 Query: 367 GRTPFRGAKGSTWEGGVRVPTFVYWKGM--IQPRKSDGIVDLADLFPTALDLAGHPGAKV 424 P RG KG+ WEGG+R FV+ K + PR + ++ ++D +PT + P Sbjct: 303 N-WPLRGKKGTLWEGGIRGVGFVHGKPLQVTTPRVNKELLHVSDWYPTIMSATHCP---- 357 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNR 452 V T IDG DQ L +N S R Sbjct: 358 --YVVGTPPIDGYDQWE-TLRSNKTSKR 382 >UniRef50_D2A3E0 Putative uncharacterized protein GLEAN_07966 n=4 Tax=Arthropoda RepID=D2A3E0_TRICA Length = 558 Score = 139 bits (349), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 114/377 (30%), Positives = 169/377 (44%), Gaps = 54/377 (14%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 +PNVV + DD+GW DVGF+G PTP+IDA+A G+IL S YSQ +P+RA +L Sbjct: 29 QPNVVFIVADDLGWNDVGFHGSNQI---PTPNIDALAYNGIILNSHYSQSFGTPSRAALL 85 Query: 145 TGQYSIHHGILMPPMYGQPG-GLQGLTTLPQLLHDQGYVTQAIGKWHMGENK-ESQPQNV 202 TG+Y + G+ P + G L + + D GY T +GKWH+G ++ P Sbjct: 86 TGKYPMKLGLQGPSITPAEGRSLPEGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFR 145 Query: 203 GFDDF----RGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 GFD F GF S D + W+ +N + EY +D V + Sbjct: 146 GFDHFFGFYNGFTSYYDYVSNWK---INDK--------EY-SGFDLRRDTVPSWNDAG-- 191 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN----------YP 308 KY DL + ++ V + K + P F+ H N Sbjct: 192 -------KYATDL---FAEHAVDVIQKH-NVNTPLFMMIAHLAVHVGNEGKWLEAPQETV 240 Query: 309 NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG- 367 N P R +Y + +++D +++ LE L NT++VF SDNG P H Sbjct: 241 NKFKHIRDPNRRTYAAMVSKLDDSIGAVFEALEAKNMLQNTIVVFISDNGAPTVGPHHNW 300 Query: 368 --RTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKV 424 P RG K + +EGGVR ++ ++Q R S ++ + D PT L G + Sbjct: 301 GSNYPLRGIKDTLFEGGVRTVACIWSPLLVQSSRVSTDLIHITDWLPT---LFTAVGGDL 357 Query: 425 ANLVPKTTFIDGVDQTS 441 + L P +DG+DQ S Sbjct: 358 SVLDPD---LDGIDQWS 371 >UniRef50_D2QW96 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QW96_9PLAN Length = 481 Score = 139 bits (349), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 106/380 (27%), Positives = 173/380 (45%), Gaps = 70/380 (18%) Query: 89 VVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS--SPTRATILTG 146 ++ L DD+G+ DV V PTP+ID +A +G+ T A+S PS+ +P+R ++TG Sbjct: 1 MLILADDLGYGDVRCYNPDAKV--PTPNIDRLAREGMRFTDAHS-PSTVCTPSRYGLMTG 57 Query: 147 QYSIH-------HGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG------- 192 Q + P PG L TLP +L QGYVT A+GKWH+G Sbjct: 58 QMPFRVPGGGTVFTGVGGPSLITPGRL----TLPAMLQQQGYVTAAVGKWHVGLTFRDSS 113 Query: 193 ------------------ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPD 234 E P + GFD F F + T+W ++ + +P Sbjct: 114 GEPIKTSGVDAVRRVDFSRRIEGGPIDHGFDHF--FGTACCPTTDWLYAFIDGDHIPTPP 171 Query: 235 RS----EYIKQLPFSKDDVHAVRGGEQQAIADITPKY-MEDLDQRWMDYGVKFLDKMAKS 289 S + P+S+D + I P + M+++D ++ ++F+ + A++ Sbjct: 172 TSLLKRSTLPSHPYSED----------CRLGLIAPDFEMQEVDTIFLQKSLEFIAEHART 221 Query: 290 D--KPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 KP FL++ T+ H ++ + G + +GD + + + + L + L+K+G + Sbjct: 222 SPQKPLFLFHATQAVHLPSFAGKDFRGKTEV-GPHGDFLCQFDHIVGQLMQALDKHGLAE 280 Query: 348 NTLIVFTSDNGPEAEVPPH--------GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK 399 NTL++ TSDNGPE H G P+RG K WEGG R+P V W G ++P Sbjct: 281 NTLVILTSDNGPETTTVVHMRADHQHDGAKPWRGVKRDAWEGGHRLPLIVRWPGHVKPNT 340 Query: 400 SDG-IVDLADLFPTALDLAG 418 + + L D+ T + G Sbjct: 341 TSAELTSLTDIMATVAAITG 360 >UniRef50_A6DI18 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DI18_9BACT Length = 562 Score = 139 bits (349), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 141/503 (28%), Positives = 225/503 (44%), Gaps = 86/503 (17%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRA 141 +KPN++ L DD+G DV + PTP +D +A+ G++ T A++ S +PTR Sbjct: 29 AEKPNIIYLLADDMGVGDVKAYNADSKI--PTPALDNLAANGMMFTDAHTNSSVCTPTRY 86 Query: 142 TILTGQYSIHHGILMPPMYGQPGGLQGLT---------TLPQLLHDQGYVTQAIGKWHMG 192 ILTG+YS + G QGL+ T+ LL +GY T IGKWH+G Sbjct: 87 GILTGRYSWR-------TTKKSGVTQGLSPHLIDSNRETVASLLKKEGYATACIGKWHLG 139 Query: 193 EN---------------------KESQ--PQNVGFDDFRGFNSVSDM--YTEWRDVHVNP 227 + KE Q P GFD + G + ++ + D + Sbjct: 140 MDWSLKDGSIADSKSDQSQIDLSKEIQNGPNKNGFDYYFGMAASANHSPHCFIEDGYTVG 199 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA 287 ++ + D+ K + +G +Q ++I P++ E + W+ V Sbjct: 200 KLQVLDDKQR--KAVGIDGKPGLVAKGFKQ---SEILPRFTEKTCE-WVRSQVN-----Q 248 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 K D+PFF+Y H P+AK+ G S +S+GD +E + + K L+ G D Sbjct: 249 KPDQPFFVYMPLNSPHSPIVPSAKFLGKS-GLSSHGDFCMETDWALGEVVKILKALGIED 307 Query: 348 NTLIVFTSDNG--PEAEVPP---HGRTP---FRGAKGSTWEGGVRVPTFVYW-KGMIQPR 398 NT+I+FT+DNG P A+ P G P +RG KG T+EGG RVP V W KG+ + Sbjct: 308 NTMIIFTADNGTSPMAKFEPMQEQGHFPSYIYRGLKGETYEGGHRVPFIVKWPKGLAPAK 367 Query: 399 KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ-----SNRK 453 SD ++ DL T ++ G +AN V G D SF Q +NR Sbjct: 368 TSDQLICTTDLMATVAEIN---GIALANNV-------GEDSISFLPALREQAIPELANRA 417 Query: 454 AEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQE 513 H+ G A + + K+ +L+ +S + V+ A +F++ DPQE Sbjct: 418 IVHHSDAGVFA---IRQGKWKLLLDNIGGSRRSNPK---DKPVIDDAEIQLFDMVNDPQE 471 Query: 514 SDSIGVRHIPMGVPLQTEMHAYM 536 S ++ ++ + L+ ++ Y+ Sbjct: 472 STNLSQKNPEIVEGLKKQLADYI 494 >UniRef50_A7HQ00 Steryl-sulfatase n=4 Tax=Proteobacteria RepID=A7HQ00_PARL1 Length = 553 Score = 138 bits (348), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 135/484 (27%), Positives = 209/484 (43%), Gaps = 93/484 (19%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSS 137 E + PN+VV L DD+G+ D+ GGG+ PTP+ID++A G TSAYS + + Sbjct: 64 EPAGNRPPNIVVILADDLGFNDISHFGGGIV---PTPNIDSIARGGANFTSAYSGTAACA 120 Query: 138 PTRATILTGQYSIHHGILMPP----------MY-------------------GQPGGLQG 168 P+RA I+TG+Y G P M+ P QG Sbjct: 121 PSRAMIMTGRYGTRTGFEFTPTPPGMTRIVDMFYNDGTRTHEMLVDREAAAKAPPFREQG 180 Query: 169 L----TTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVH 224 L TL + L +GY IGKWH+G E P GFD+ S + + DV Sbjct: 181 LPGSEITLAEALKPKGYHNIHIGKWHLGNAPEFLPNAQGFDESVMLESGLFLPEDSPDV- 239 Query: 225 VNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPK-YMEDLDQRWMDYGVKFL 283 VN ++ P I Q +++ G A PK Y+ D + D +K + Sbjct: 240 VNAKLPFDP-----IDQFLWARMQYATSYNGS----AWFEPKGYLTDF---YTDEAIKAI 287 Query: 284 DKMAKSDKPFFLYYGTRGCHF------DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLY 337 + A ++PFFLY G H +Y + R Y +V ++ + Sbjct: 288 E--ANRNRPFFLYLAHWGVHTPLQASKADYDALSHIEDERLRV-YAAMIVALDRSVGRVL 344 Query: 338 KTLEKNGQLDNTLIVFTSDNGPEAEVP-PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ 396 ++L++NG +NTL++F+SDNG + P P+RG K + +EGG+RVP F W I Sbjct: 345 QSLKENGLEENTLVIFSSDNGAPGYIGLPDVNKPYRGWKLTFFEGGIRVPFFAKWPARI- 403 Query: 397 PRKSDGIVDLA--DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKA 454 P ++ +A D+FPT + AG +P IDG+D + G+ Sbjct: 404 PAGTERTTPVAHLDMFPTIVAAAG-------GELPADRVIDGIDLLPY--AARGEKPAPR 454 Query: 455 EHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 ++ +G AV+ D +K + ++P + +FNL TDP E Sbjct: 455 PIFWRDGHYQAVQADGWKLQ-MAERPNK-------------------TWLFNLKTDPTEQ 494 Query: 515 DSIG 518 +++ Sbjct: 495 NNVA 498 >UniRef50_A0Z632 Arylsulfatase B n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z632_9GAMM Length = 545 Score = 138 bits (347), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 111/368 (30%), Positives = 173/368 (47%), Gaps = 52/368 (14%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 +KPN+++ + DD+GW DVG++GG + TP +D +A QG+ L Y+ P SPTRA + Sbjct: 31 QKPNILIMVADDLGWADVGYHGGDID----TPSLDRLAQQGVRLNRFYTTPICSPTRAAL 86 Query: 144 LTGQYSIH----HGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES-Q 198 +TG+ I +G++ P + G +P+ GY T IGKWH+G + + Sbjct: 87 MTGRDPIRLGVTYGVIFP--WDNIGVHPDEHFMPETFQAAGYQTAIIGKWHLGHAQMTYH 144 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY----IKQLPFSKDDVHAVRG 254 P N GF+ F G H++ EV P S ++ S DD +G Sbjct: 145 PNNRGFEHFYG--------------HLHTEVGFYPPFSNQGGKDFQRNGVSIDD----QG 186 Query: 255 GEQQAIADITPKYME--DLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 E +AD +Y+ D D+ ++ Y + F+ D P L + D P A+ Sbjct: 187 YETYLLADEVSRYIRERDRDRPFLVY-MPFIAPHTPLDAPVELQDKYKDIETD-LPMARS 244 Query: 313 AGSS------------PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 + AR Y + M+ + TL++ G DNT+++F SDNG Sbjct: 245 RQTDDTRLISRVMLQPSARPMYAAVVDAMDQAIGRVLDTLDQEGISDNTIVLFFSDNGGA 304 Query: 361 A-EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAG 418 A P RG KG T+EGG+RV + + W M++P + + I+ + D+FPT +D A Sbjct: 305 AYSYGGANNAPLRGGKGETFEGGIRVTSLMRWPAMLEPGQIFEQIMSVMDVFPTLVDAAD 364 Query: 419 -HPGAKVA 425 PG A Sbjct: 365 VRPGNNFA 372 >UniRef50_D2R917 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R917_9PLAN Length = 486 Score = 137 bits (346), Expect = 7e-31, Method: Compositional matrix adjust. Identities = 131/468 (27%), Positives = 198/468 (42%), Gaps = 100/468 (21%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 ++PN+V + DD+GW DVGFNG TP+IDA+A G + Y Q +PTRA + Sbjct: 27 RQPNIVHIVADDLGWKDVGFNG---CTEIKTPNIDALAKGGAKFSQFYVQNMCTPTRACL 83 Query: 144 LTGQYSIHHG---ILMPPMYGQPGGLQGLTT----LPQLLHDQGYVTQAIGKWHMGE-NK 195 +TG++ +G I++P G GL T +PQ L D GY T IGKWH+G ++ Sbjct: 84 MTGRFPYRYGLQTIVIPTAAG-----YGLDTSEYLMPQCLGDAGYKTAIIGKWHLGHADQ 138 Query: 196 ESQPQNVGFD-DFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD--VHAV 252 + P+ GFD + D +T D H L + +D+ VH Sbjct: 139 KYWPKQRGFDYQYGAMIGELDYFT--HDEH---------------GVLDWFRDNKPVHE- 180 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF-----DNY 307 +G I D KY+ D + KPF+LY H Y Sbjct: 181 QGYTTTLIGDDAVKYIHGQDGK----------------KPFYLYLTFNAPHTPYQAPKEY 224 Query: 308 PNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE----- 362 + P R +Y + +++ + L++ G +NTLI F SDNG + Sbjct: 225 ITKYLNIAEPTRRTYAAMVDCLDENIGKVVAALDQKGLRENTLIFFHSDNGGTKDKMFAG 284 Query: 363 --------VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTAL 414 V P P+R KGS +EGG RV W G I+ + DG++ DL+PT Sbjct: 285 QMADMSKVVLPCDNGPYRNGKGSLFEGGSRVCALANWPGKIKAQTVDGMIHAVDLYPTFA 344 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF-LNGKLAAVRMDEFKY 473 LA GA +A P +DG + G+ + + E ++ + A +R ++K Sbjct: 345 ALA---GASIAKCKP----LDGTNVWDTI--AEGKPSPRTEFFYSIEPFRAGLRQGDWK- 394 Query: 474 HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 LI + M + ++NL DP E ++I H Sbjct: 395 --LIWR----------------TMLPSSVDLYNLAEDPYEKNNIAAAH 424 >UniRef50_A6DSM5 Arylsulfatase A (Precursor) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSM5_9BACT Length = 401 Score = 137 bits (345), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 118/425 (27%), Positives = 188/425 (44%), Gaps = 38/425 (8%) Query: 120 VASQGLILTSAY-SQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHD 178 +AS+G Y Q S +RA +L+G Y + G T+ + L Sbjct: 1 MASEGSTFLQFYVPQAICSASRAALLSGSYPHRTNVFGAHGPNGRGPPTEFATIAEPLKK 60 Query: 179 QGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 GY T GKWH G+ ++P GFD+ G +DM+ L P + ++ Sbjct: 61 SGYNTVHFGKWHCGDTNATRPLARGFDEHAGLMYSNDMWH------------LHPMQPKH 108 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 + P + GE + I DI PK ++L + + V F+ + D+PFFLY Sbjct: 109 WGKFP-----LRFWNNGEIE-IEDIQPKDQKNLTKWATEKSVDFIKR--NKDQPFFLYTT 160 Query: 299 TRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG 358 H Y + ++ G S + YGD + E++ + + L+ NG D T+I+F+SDNG Sbjct: 161 HSMPHVPLYVSKEFEGIS-GQGLYGDVLAELDWSVGQINQALKDNGIEDKTMIIFSSDNG 219 Query: 359 PEAEVPPH-GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLA-DLFPTALDL 416 P A H G+ P+R AK ++++GG R P V + MI P + V + DL PT LDL Sbjct: 220 PWAGYGDHAGKPPYREAKATSFDGGTRSPLIVKYPKMIPPNSASKKVFCSIDLMPTILDL 279 Query: 417 AG--HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGK-LAAVRMDEFKY 473 AG HP K IDG + G N +YF G+ L A+ ++ Sbjct: 280 AGGPHPDNK----------IDGKNVLDLMTDKKGAKNPHHYYYFSTGRHLEAIMSANGRW 329 Query: 474 HVLIQQPYAYTQSGYQGGFTGTVMQTAGS-SVFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 + + Y + Q GF + +++++ DP ES +I + + L+ Sbjct: 330 KLHLPHSYRHVQVAGADGFDAKYSRPQQPLALYDMKNDPMESKNIISNYPELADELKQAA 389 Query: 533 HAYME 537 AY++ Sbjct: 390 QAYIK 394 >UniRef50_A6C8S3 Arylsulphatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8S3_9PLAN Length = 481 Score = 137 bits (345), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 123/405 (30%), Positives = 179/405 (44%), Gaps = 71/405 (17%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNP---TPDIDAVASQGLILTS-AYSQPSSSPTR 140 KPN +V DD+G+ D+ G+P TP ++ +A++G LT P +P+R Sbjct: 39 KPNFIVIFADDLGYGDLE------CYGHPRFKTPHLNQMAAEGARLTQFNVPVPYCAPSR 92 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTL---------PQLLHDQGYVTQAIGKWHM 191 AT+LTG+Y HG+ P P G Q + + +LL + GY T IGKWH+ Sbjct: 93 ATLLTGRYPWRHGVWYNPA---PDGQQFRSGVGIAESELLLSELLKENGYATICIGKWHL 149 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 G + E P GFDD+ G +DM R V++ + E + + P + Sbjct: 150 GHDPEYYPTRHGFDDYLGILYSNDM----RPVNLM--------QGEKLLEYPVIQ----- 192 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 +L +R+ + VKF+ + + PFFLY H + Sbjct: 193 -----------------ANLTKRYTERAVKFIQE--NQEGPFFLYLPHAMPHKPLAASEA 233 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPF 371 + S A YGD + E++ ++KTL + +NTL++F SDNGP G Sbjct: 234 FYKKSGAGL-YGDVIAELDWSVGEIFKTLRELNLDENTLVIFASDNGPWFGGNTAG---L 289 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPK 430 G K +TWEGG+RVP W G I PR+ D + D+FPT L AG P VP Sbjct: 290 SGMKSTTWEGGLRVPMIARWPGKIPPRQVIDTVCGSIDVFPTILKQAGIP-------VPA 342 Query: 431 TTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHV 475 IDG D L + +A + L VR +K HV Sbjct: 343 DRVIDGKDLFP-VLTKQAPTPHQALYSMKGNSLFTVRSGPWKLHV 386 >UniRef50_Q02AN8 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q02AN8_SOLUE Length = 443 Score = 137 bits (345), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 133/478 (27%), Positives = 204/478 (42%), Gaps = 81/478 (16%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRATI 143 +PNV+V +LDD+G D+G+ G A TP IDA+A++GL + YS P +P R+ I Sbjct: 26 RPNVLVVVLDDLGCHDLGYLG---AADLKTPHIDALAARGLKFRNWYSNAPVCAPARSAI 82 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVG 203 LTG++ G+ P G P G+ TL +L GY T GKWH+G E+ P G Sbjct: 83 LTGRFPASAGV---PDNG-PALAHGIPTLASVLKGSGYQTGCFGKWHLGSTDETAPTGHG 138 Query: 204 FDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADI 263 FD F GF+S +Y + D+ H + + D Sbjct: 139 FDSFYGFHSGC---------------------VDYYSHRFYWGDNYHDLWHNRTEIFED- 176 Query: 264 TPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS----SPAR 319 +Y L +R D F+ + ++PF Y H+ + A+Y +P R Sbjct: 177 -GRY---LTERIADEAAGFIGR----NRPFLGYVAFNAPHYPMHAPAQYKARFPNLAPER 228 Query: 320 TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE---------VPPHGRTP 370 +Y + ++D + + LE G +NTL+ F DNG E Sbjct: 229 QTYAAMIAAVDDGIGQIQRALETTGAAENTLMFFIGDNGATTEKRAGLNGDFATAGDNGV 288 Query: 371 FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLA---DLFPTALDLAGHPGAKVANL 427 F+G K S ++GG+ VP FV W I RK +LA D+ PT G P L Sbjct: 289 FKGYKFSLFDGGMHVPGFVSWPAGI--RKGGWTDELAMSMDILPTICRATGAP------L 340 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSG 487 P+ +DG D + + +N S K+ ++ G+LA R P+ +G Sbjct: 341 PPR---VDGSDLLN-TIASNAPSPHKSLYWSQGGQLATRR-----------GPWKLVVNG 385 Query: 488 --YQGGFTGTVMQTAGSSVF--NLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 Y G T +V+ NL +P E+ ++ H + L T++H + + L K Sbjct: 386 RLYDRRADGNKPLTGEDAVWLSNLDDNPGETRNLRRTHANLVDELLTDLHRWHDALPK 443 >UniRef50_B7S0F9 Sulfatase domain protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7S0F9_9GAMM Length = 602 Score = 137 bits (345), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 103/344 (29%), Positives = 166/344 (48%), Gaps = 45/344 (13%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 KKPN+++ LDD G+ D+ N G +PTP +DA+A+QG+ T Y++ S + +R + Sbjct: 14 KKPNILLLALDDFGYNDLAINNGS---DSPTPRLDAIAAQGVRFTRHYAESSCTASRVAL 70 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQG----LTTLPQLLHDQGYVTQAIGKWHMGE-NKESQ 198 LTG+Y P G L G L TLP L +GY+ +GKWH G+ ++ES+ Sbjct: 71 LTGRY--------PARVGAHPYLNGIDHELMTLPDALGSEGYIRHMVGKWHTGDSHRESR 122 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 P+ GFD + GF ++ +Y P + + R + P+ ++++ G QQ Sbjct: 123 PEYQGFDHWFGF--INQLYLR------GPHRSANYRRGKPTYINPWLENEL----GDLQQ 170 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA---GS 315 +T + + LD + + P+FLY H P A+++ Sbjct: 171 YEGHLTDILTD-----------RALDVIKREQNPWFLYLSYYAPHTPIEPAARFSERYAD 219 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAK 375 PA Y +++ + L ++G++DNT+I+ SDNG A+ P PF G+K Sbjct: 220 DPA-GRYQAMKDQLDSNIGRIIDWLTESGEIDNTMIIVVSDNGGTAKSWP-SNLPFYGSK 277 Query: 376 GSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAG 418 + EGGVR P + W G ++ D I + DL+PT L G Sbjct: 278 ATYTEGGVRTPLLLSWPGHWPVGQQDDQIAMIFDLYPTILAALG 321 >UniRef50_Q15XP0 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XP0_PSEA6 Length = 627 Score = 137 bits (345), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 133/480 (27%), Positives = 207/480 (43%), Gaps = 84/480 (17%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 Q + A E T KPN+V+ + DD G+ D+G + + TP+ID +A+Q LT+ Sbjct: 31 VQNRSASAEPPT--KPNIVLIVTDDQGYGDIGRHNNPII---QTPNIDDIAAQSARLTNF 85 Query: 131 YSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWH 190 + P+ SPTR+ +LTG++S+ G+ + G + +T L + L + GY T GKWH Sbjct: 86 HVDPTCSPTRSALLTGKHSLRAGVWHTILGRYMLGPEHVT-LAESLQENGYRTGIFGKWH 144 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +G+N +PQ+ GFDD +H V +PD Y F +D + Sbjct: 145 LGDNYPYRPQDQGFDDVL--------------IHGGGGVGQTPD---YWGNTQF--NDTY 185 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY---------YGTRG 301 R G + + K W D KF+DK + D P+F Y Y Sbjct: 186 -YRNGTPEKFSGYATKI-------WFDEAKKFIDK--QHDTPYFAYIALNAPHGPYRAPE 235 Query: 302 CHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG--- 358 H + Y + G + S+ + +++ L L QLDNT+ +F +DNG Sbjct: 236 THIEPY---EKRGLNRDMASFYGMISYIDEQVGELRAHLRAQDQLDNTIFIFMTDNGSSY 292 Query: 359 --------------PEAEVPPHGR--TPFRGAKGSTWEGGVRVPTFV-YWKGMIQPRKSD 401 P AE P+ + RG KG +EGG RVP F+ Y G I + Sbjct: 293 KPTDAKTHLTKRHLPLAEQYPNWQPNDNMRGYKGEVYEGGHRVPFFISYPNGNITTGDYE 352 Query: 402 GIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG 461 I D+ PT L+L AN+ P + +DG ++ G Q+NR E Sbjct: 353 AITAHFDVMPTLLEL--------ANIPPVNSTLDGTSLATYLKGE--QANRSLESKL--S 400 Query: 462 KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 + A V ++ YH +++P A ++ + +FNL DP + + I H Sbjct: 401 ERAIVVTNQRVYHPSVKRPIAIAFHQWR-----YISANDSEKLFNLQQDPSQQNDIKNDH 455 >UniRef50_A6DP41 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DP41_9BACT Length = 534 Score = 137 bits (345), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 149/516 (28%), Positives = 222/516 (43%), Gaps = 88/516 (17%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNP-----TPDIDAVASQGLILTSAYSQPS-SS 137 +KP+++ L+DD+G G V+ NP TP IDA+A G++ T ++ S + Sbjct: 21 EKPHIIYVLMDDMG-------QGDVSCFNPSSKIHTPQIDALAKNGMMFTDTHTNSSVCT 73 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMGENK 195 PTR ILTG+Y+ + + G L G TL LL QGY T IGKWH+G + Sbjct: 74 PTRYGILTGRYAWRTHLKKSVIGGTSPSLIKPGRMTLASLLKGQGYHTGMIGKWHLGWDF 133 Query: 196 ESQPQNVGFD----------------------DFRGFNSVSDMYTEWRDVHVNPEVALSP 233 P +V D D GF+ Y+ + + P V + Sbjct: 134 SFHPDSVKIDPLYWGYTPGTKIDYAKGVENGPDVHGFDY---YYSIPSSLDIPPYVYVEN 190 Query: 234 DRSEYIKQLPFS----KDDVHAVRGGEQQA---IADITPKYMEDLDQRWMDYGVKFLDKM 286 R + L S ++ RGG A I D+TP + +Q F+ K Sbjct: 191 GR---VTNLDISERKGEEGKRLWRGGPMSADFDIEDVTPNFFRRANQ--------FIAKN 239 Query: 287 AKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQL 346 AKSDKPFFLY H P K+ G S Y D +++++ L KTL+ N Sbjct: 240 AKSDKPFFLYLPLPSPHTPILPIKKFQGKS-GVNEYADFILQIDSHMGELIKTLKDNNIF 298 Query: 347 DNTLIVFTSDNG--PEA---EVPPHGRTP---FRGAKGSTWEGGVRVPTFVYW-KGMIQP 397 DNTL+VFT+DNG P A E+ G P FRG K +EGG RVP V W G +Q Sbjct: 299 DNTLLVFTADNGISPRADIVEINNAGHFPSNGFRGRKADIFEGGHRVPYIVTWPNGGVQA 358 Query: 398 RK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE- 455 S+ + D+ T D+ + +P+ D + R A Sbjct: 359 GSVSEQTICTTDMLATLADI-------LEVKLPENAGEDSYSTLPLLINRPYDFKRPATV 411 Query: 456 HYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY-QGGFTGTVMQTAGSSV---FNLYTDP 511 H+ +NG A+R ++K LI + G+ + T + G V +NL +DP Sbjct: 412 HHSINGSY-AIRQGDWK---LI---FCAGSGGWPKSDLTPEMASAQGLPVIQLYNLKSDP 464 Query: 512 QESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQ 547 E+ ++ ++ + L +M Y++ + P AQ Sbjct: 465 AETVNLYAKYPHIVDRLTVQMQKYIDEGRSTPGEAQ 500 >UniRef50_A6DHS3 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHS3_9BACT Length = 524 Score = 137 bits (344), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 114/369 (30%), Positives = 165/369 (44%), Gaps = 69/369 (18%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATI 143 KPN++ L DD+G+ D+ GG + PTP +D +A +G+ T A++ S +PTR I Sbjct: 22 KPNIIFILADDMGYGDMSNEGGLI----PTPHLDRMADEGMKFTDAHTSSSVCTPTRYGI 77 Query: 144 LTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMG--------- 192 LTG+Y+ + G L Q T+ L DQGY T +GKWH+G Sbjct: 78 LTGRYNWRSSKKKGVLSGTSAPLIPQDRVTIANFLKDQGYHTGMVGKWHLGIGWQMLDEA 137 Query: 193 ---------------ENKES------------QPQNVGFDDFRGFNSVSDMYTEWRDVHV 225 NK++ P + GFD F G + DM Sbjct: 138 KKPEKSFLKEGYKMKNNKQAASWKVDYSKPAITPIHNGFDYFYGIAASLDM--------- 188 Query: 226 NPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFL-D 284 +P V + D++ ++ + R G D T M + D K++ Sbjct: 189 SPYVYIENDKA--VEMATHERGFATPYRPGATGPSFDATYCLMT-----FADKSRKYIAQ 241 Query: 285 KMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNG 344 + A KPFFLY H P+ K+ G SP +T YGD ++E + V + L+K G Sbjct: 242 QAADKSKPFFLYLPLTSPHTPIMPSEKFLGKSPTKTIYGDFVMETDWVVGEVMAELDKQG 301 Query: 345 QLDNTLIVFTSDNG--PEAEVPPH---GRTP---FRGAKGSTWEGGVRVPTFVYWKGMIQ 396 DNTLIVFT+DNG P +P H G +P +RG K +EGG RVP V W +Q Sbjct: 302 IADNTLIVFTADNGCSPTGSIPEHIKIGHSPNGQWRGHKADIFEGGHRVPFLVRWPAQVQ 361 Query: 397 PR-KSDGIV 404 + +SD + Sbjct: 362 TKTQSDSTI 370 >UniRef50_UPI000186ED10 arylsulfatase B precursor, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186ED10 Length = 570 Score = 137 bits (344), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 122/414 (29%), Positives = 186/414 (44%), Gaps = 61/414 (14%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 ++PN+++ L DD+GW DV F+G TP+IDA+A G+IL S Y +P+RA++ Sbjct: 45 ERPNIIIILADDLGWNDVSFHGSNQI---QTPNIDALAYNGIILNSHYVPALCTPSRASL 101 Query: 144 LTGQY----SIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGE-NKES 197 +TG+Y + H +++ P +P GL T +P+ + GY T A+GKWH+G KE Sbjct: 102 MTGKYPTSLGMQHLVILSP---EPWGLPLNETLMPEYFNKNGYATHAVGKWHLGFFKKEY 158 Query: 198 QPQNVGFDD-FRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P GFD F +N D Y +S Y + F D ++ +G Sbjct: 159 TPIYRGFDSHFGHWNGFQDYYDH---------TTMSDSLKGYDMRRNFEVD--YSYQG-- 205 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN------- 309 Y D+ + +K +D P FLY H N N Sbjct: 206 ---------MYTTDV---FTKEAIKIIDNHNSQKGPLFLYLSHLAPHSGNPDNPFQAPED 253 Query: 310 --AKYAG-SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH 366 +K+ + P R Y + ++++ + LEKN L+N++I+F SDNG Sbjct: 254 EISKHECINDPGRKIYAAMVTKLDESVGQVVSALEKNKMLNNSIIIFMSDNGAATYGLHS 313 Query: 367 GR---TPFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGIVDLADLFPTALDLAGHPG 421 R P RG K S WEGGVR T W + R S ++ ++D PT L AG Sbjct: 314 NRGSNYPLRGLKESPWEGGVR-GTAAIWSPFLNKTKRVSKQLMHMSDWLPTLLTAAG-LN 371 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKA--EHYFLNGKLAAVRMDEFKY 473 L+ K IDG+D + L + S RK +Y +++ +D +KY Sbjct: 372 YSSTQLINK---IDGIDMWN-VLSNDLPSPRKEVFNNYDEIENYSSLMIDSWKY 421 >UniRef50_A6DR29 N-acetylgalactosamine-6-sulfatase n=3 Tax=Bacteria RepID=A6DR29_9BACT Length = 510 Score = 136 bits (343), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 136/487 (27%), Positives = 201/487 (41%), Gaps = 75/487 (15%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATI 143 KPNV++ + DD+GW D GFNG V TP +D +A++GL L YS S SPTRA++ Sbjct: 25 KPNVILIMADDLGWGDTGFNGSKVI---KTPHLDQMAAEGLQLDRFYSASSVCSPTRASV 81 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGL-----TTLPQLLHDQGYVTQAIGKWHMG------ 192 LTG+ P G P QG TLP++L++QGY T GKWH+G Sbjct: 82 LTGR--------NPYRTGVPTANQGFLRPEEITLPEVLNEQGYATGHFGKWHLGTLTHTE 133 Query: 193 -ENKESQPQNVGFDDFRGFNSVSDMY-TEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 + +P N + + D + TE + +P + + K L + Sbjct: 134 KDANRGKPGNTKEFNPPKLHGYEDAFVTESKVPTYDPMILPAKFDQGESKHLGWE----- 188 Query: 251 AVRGGEQQA-----IADITPKYMEDL----DQR-WMDYGVKFLDKMAKSDKPFFLYYGTR 300 V+ GE+ DI K + D D R MD + F+D+ +KPF Sbjct: 189 YVKEGEESKPYGTFYWDIEGKKITDNLKGDDSRVIMDRVLPFIDQAVADEKPFLSVVWFH 248 Query: 301 GCHFDNYPNAK----YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 H + Y G +Y C+ M++ L K L G DNT+I F SD Sbjct: 249 TPHLPCVAGPRHQEMYKGHPIHLRNYAGCVTAMDEQIGRLRKHLADKGVADNTMIWFCSD 308 Query: 357 NGPEA-EVPPHGRT-PFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTA 413 NGPE+ E P +G FRG K +EGGVRVP + W + + RK +D PT Sbjct: 309 NGPESKERPDNGSAGHFRGRKRDLYEGGVRVPAVMVWPAKVKEARKISAPCITSDYMPTI 368 Query: 414 LDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKY 473 LD P + + + + ++ F R E + +FK Sbjct: 369 LDALHIPHPQASYATDGRSLMPIINNEDF--------TRDKEIGIMFSSRIVWHKGDFKL 420 Query: 474 HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMH 533 Y GG ++NL +DP E + ++ + L+ +M Sbjct: 421 ------------LSYNGG--------KKYELYNLKSDPSEKTDVAAQNPELVEKLKKDML 460 Query: 534 AYMEILK 540 A+ E +K Sbjct: 461 AWHESVK 467 >UniRef50_A9UP45 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9UP45_MONBE Length = 339 Score = 136 bits (343), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 103/340 (30%), Positives = 156/340 (45%), Gaps = 44/340 (12%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 K+PN+V + DD+GW DV +G PTP IDA+A G+ LT+ + QP +PTR+T Sbjct: 31 KRPNIVFIVADDLGWNDVSLHGSPQI---PTPHIDAIAHSGVHLTNYHVQPVCTPTRSTF 87 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENKE-SQPQN 201 L+G++ IH GI MP G L T LP L GY T A+GKWH+G+N E + P Sbjct: 88 LSGRHVIHTGIYMPFAQGTALRLNLSYTLLPAYLKKLGYRTAAVGKWHLGQNVEKALPTG 147 Query: 202 VGFDDFRGFNS-VSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 GFD++ G+ S D YT D H + D +E ++ ++ ++A+ Sbjct: 148 RGFDEYLGYWSGAEDYYTH--DTHGGYDFQ---DGTE----CAIKYNNTYSTYIFAERAV 198 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS----- 315 I A ++P FLY + H+ A+Y Sbjct: 199 NTILE---------------------ADPEQPLFLYTAFQNVHWPLEAPAEYVARFSHIP 237 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG--PEAEVPPHGRTPFRG 373 + R ++D N+ L+++ DNT+++FTSDNG E P RG Sbjct: 238 NSERQYVAAMTSILDDAVGNITDALKRSRIADNTILIFTSDNGGPVHDENTESNNYPLRG 297 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPT 412 K + W GG +V + KG+ P G++ +D P+ Sbjct: 298 GKNTLWNGGTQVVGMIAGKGIENPGTDCHGMMHASDWLPS 337 >UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788C38 Length = 452 Score = 136 bits (343), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 110/361 (30%), Positives = 166/361 (45%), Gaps = 57/361 (15%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRAT 142 K+PN +V DD+G+ D+G G TP +D +A +G+ T+ YS P SP+RA+ Sbjct: 15 KQPNFIVIYCDDLGYGDLGCYGSDTV---KTPHLDGLADEGIRFTNWYSNSPVCSPSRAS 71 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 +LTG+Y G+ + G G GL TL + L GY T GKWH+G ++E+ Sbjct: 72 LLTGKYPARAGV--GEILGAKRGSHGLPADEVTLAKALKPAGYRTALYGKWHLGLSEETS 129 Query: 199 PQNVGFDDFRGFNS-VSDMYTE---WRDVH-VNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 P GFD+F GF + D Y+ W H VNP L + +E + Sbjct: 130 PNAHGFDEFFGFKAGCVDFYSHIFYWGQAHGVNPLHDLWENETEVWE------------- 176 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY- 312 +YM +L + V F+ + + + PFFL+ H+ + KY Sbjct: 177 ----------NGRYMTELI---TERSVDFIQRSREQEAPFFLFASYNAPHYPMHAPQKYM 223 Query: 313 ---AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP---- 365 A R + ++D + K L++ G ++T+I F+SDNGP +E Sbjct: 224 DRFAHLPWDRQVMAAMIAAVDDGVGKIVKALKEAGCYEDTVIFFSSDNGPSSESRNWLDG 283 Query: 366 -----HGRTP--FRGAKGSTWEGGVRVPTFVYW-KGMIQPRKSDGIVDLADLFPTALDLA 417 +G + FRG K S +EGG+R P + W G + D + + DL PT LDLA Sbjct: 284 TEDVYYGGSAGIFRGHKASLFEGGIREPAILSWPNGWEGGQVRDEVAAMMDLAPTFLDLA 343 Query: 418 G 418 G Sbjct: 344 G 344 >UniRef50_C6Y1U6 Sulfatase n=2 Tax=Sphingobacteriales RepID=C6Y1U6_PEDHD Length = 523 Score = 136 bits (343), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 149/524 (28%), Positives = 225/524 (42%), Gaps = 71/524 (13%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 ++ KL ++K K PN+V L DD+G+ D+ G V TP ID +A QG+ T A Sbjct: 26 SKTKLQAQQQK--KLPNIVYILADDLGYGDIKIYNAGAKVN--TPHIDKLAEQGMRFTDA 81 Query: 131 YSQPS-SSPTRATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIG 187 ++ S +P+R +ILTG+Y + + + G L +GL T+ LL Y T IG Sbjct: 82 HTTSSVCTPSRYSILTGRYPWRSRLPVGVLRGYSRTLIEEGLPTVAGLLKTSSYRTAVIG 141 Query: 188 KWHMGEN---KESQPQNV--GFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 KWH+G + KE+ ++ F+ R + +M + D P +Y L Sbjct: 142 KWHLGLDWMPKEAFKDSINPAFNKDRLYGITDEMNPDQIDFGRAPVRGPRTQGFDYSYVL 201 Query: 243 PFSKD--------------DVHAVRGGEQQAIADITPKYMEDLDQRWMD-YGV------- 280 P S D + G + A P + L D YGV Sbjct: 202 PASLDMPPYAYLENDQLTEPLTGYTPGNKLASGYTGPFWRAGLKSPSFDFYGVLPAFTNK 261 Query: 281 --KFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYK 338 F+ K A + PFFLY+ H P A+Y G S A YGD + E++ + + Sbjct: 262 ATDFIKKEAATKNPFFLYFPMPAPHTPWMPTAEYRGKSQA-GEYGDYLQEVDAAVGKILQ 320 Query: 339 TLEKNGQLDNTLIVFTSDNGPE------AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWK 392 L+ G NTL+VFTSDNGP + H PFRG KG +EGG RVP V + Sbjct: 321 VLDSLGLSKNTLVVFTSDNGPYWRDDFVQQYGHHAAGPFRGMKGDAYEGGHRVPFIVRYP 380 Query: 393 GMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSN 451 G ++ S+ LA+L T DL G+ + + D S G++ Sbjct: 381 GKVKAGTISNVTTTLANLMATCADLTGNHAVQ----------FETEDSYSILPVLLGKAA 430 Query: 452 RKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVM------QTAGSSVF 505 AE + A V + ++ + + P+ GGF+ + Q AG ++ Sbjct: 431 GIAE------QPAIVNISSKGFYDIRKGPWKLITGLGSGGFSVPSIVKAPEGQAAG-QLY 483 Query: 506 NLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIK 549 NL TD +E ++ R+ P V E+ A +E +K P + K Sbjct: 484 NLDTDIKEETNLYSRY-PEKV---KELSALLEKIKAAPKGKRAK 523 >UniRef50_A4AQQ7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Bacteroidetes RepID=A4AQQ7_9FLAO Length = 596 Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 118/368 (32%), Positives = 163/368 (44%), Gaps = 68/368 (18%) Query: 75 LAELEKKTGKK------PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILT 128 + EKKT +K PNVV+ + DD GW D+ FNG TP+IDA+A G Sbjct: 20 IVSCEKKTKEKNEIQTKPNVVLIMTDDQGWGDLSFNGN---TNLSTPNIDAIAKNGASFQ 76 Query: 129 SAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGK 188 + Y QP SPTRA +LTG+Y+ G+ G+ + TT+ ++ GY T A GK Sbjct: 77 NFYVQPVCSPTRAELLTGKYAARLGVYSTSTGGERFNSKE-TTIAEIFKKAGYKTTAYGK 135 Query: 189 WHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 WH G P + GFDD+ GF S W + + +P + E +K Sbjct: 136 WHSGMQPPYHPNSRGFDDYYGFTS-----GHWGN-YFSP---MLEHNGEIVK-------- 178 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH----- 303 GE + D+T K G+ F+ + + PFFLY H Sbjct: 179 ------GEGFLVDDLTNK------------GLDFITE--NKNNPFFLYLPYNTPHSPMQV 218 Query: 304 ----FDNYPNAK----YAGSSPARTSYGD---CMVEMNDV-FANLYKTLEKNGQLDNTLI 351 ++ + K Y G+ ++ MVE D L L++ G +NT+I Sbjct: 219 PNEYWERFEKKKLDMRYQGNEEESENFTRAALAMVENIDFNMGRLTNKLKELGLEENTII 278 Query: 352 VFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLF 410 V+ SDNGP G RG KGST EGGVR P F+ WK I + +K I D+ Sbjct: 279 VYLSDNGPNGWRWNGG---MRGRKGSTDEGGVRSPFFIQWKNTIPKNKKISQIAGAIDIL 335 Query: 411 PTALDLAG 418 PT LAG Sbjct: 336 PTLTSLAG 343 >UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=Bacteria RepID=Q7UHJ9_RHOBA Length = 1012 Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 129/427 (30%), Positives = 185/427 (43%), Gaps = 81/427 (18%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRATI 143 KPN +V L DD G+ D+ G A TP ID +A++G LTS Y + P +P+RA + Sbjct: 570 KPNFIVILTDDQGYGDLSCFG---AKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRAGL 626 Query: 144 LTGQY--------SIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGEN 194 +TG Y + G+L+ G P GL T+ ++L GY T GKWH+G+ Sbjct: 627 MTGCYPKRIDMAMGSNFGVLL---AGDPKGLHPDEITIAEVLKTAGYRTGMFGKWHLGDQ 683 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY-IKQLPFSKDDVHAVR 253 E P GFD+F G D++ P ++ Y LP ++D Sbjct: 684 PEFLPTKQGFDEFFGIPYSHDIH------------PFHPRQNHYHFPPLPLLQNDT---- 727 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 + ++ P + L +R + V F+++ D+PFFLY P+A Sbjct: 728 ------VIEMDPD-ADFLTKRLTEQAVSFIER--NKDQPFFLYLP------HPIPHAPLH 772 Query: 314 GSSPARTSYGD---CMVEMND------VFANLYK---------------TLEKNGQLDNT 349 S P D +E D ANL++ L NG + T Sbjct: 773 ASPPFMEGVADDVIAAIEKEDGNIDYATRANLFRQAIAEIDWSVGQILDALRSNGLDEKT 832 Query: 350 LIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLAD 408 +++FTSDNGP RG KG+T+EGG+R PT V W G I ++D ++ D Sbjct: 833 MVLFTSDNGPPKNTLYASPGELRGHKGTTFEGGMREPTVVRWPGQIPAGHQNDELMTAMD 892 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRM 468 L PT LAG +P IDG D G Q+ A Y +LAAVR Sbjct: 893 LLPTFAKLAG-------AAIPTDRVIDGKDIWPTLKGET-QTPHDAFFYHRGNQLAAVRS 944 Query: 469 DEFKYHV 475 ++K HV Sbjct: 945 GKWKLHV 951 Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 111/366 (30%), Positives = 176/366 (48%), Gaps = 51/366 (13%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRAT 142 + PNVV+ +DD+G+ D+G G A TP+ID +A++G T A+S + +P+R Sbjct: 38 RPPNVVLIFVDDLGYGDLGCYG---ATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSRYG 94 Query: 143 ILTGQYSIH----HGILMPPMYGQPGGL---QGLTTLPQLLHDQGYVTQAIGKWHMGENK 195 +LTGQY + GI P GL T+ ++ ++GY T +GKWH+G + Sbjct: 95 LLTGQYPVRAMGGQGIWGP--LPTTSGLIIDTNTKTIGKVFKNKGYATACLGKWHLGFKE 152 Query: 196 ESQ---------PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY-------I 239 E PQ+VGFD + G V+ V+VN + D S+ + Sbjct: 153 EPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPY---VYVNDDSIFGYDPSDPLVYGGKPV 209 Query: 240 KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWM--DYGVKFLDKMAKSDKPFFLYY 297 P ++ +V+ + + A + +D + + VK++ + K ++PFFLY+ Sbjct: 210 SPTPMFPEEA-SVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITE--KKNEPFFLYF 266 Query: 298 GTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 T H P ++ G+S YGD + E++ + + ++LE NG DNTL++FTSDN Sbjct: 267 ATPNIHHPFTPAPRFKGTSQCGL-YGDFVHELDWMVGEIVQSLEDNGLTDNTLVLFTSDN 325 Query: 358 GPEAEVPPHGRTPFR----------GAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDL 406 G A + GR + G K WEGG RVP W G I+ +SD ++ Sbjct: 326 G--AMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKAGTQSDQLISQ 383 Query: 407 ADLFPT 412 DLF T Sbjct: 384 VDLFAT 389 >UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9FLAO Length = 459 Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 116/373 (31%), Positives = 169/373 (45%), Gaps = 68/373 (18%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATIL 144 PN++ L+DD+G+ D+ G A +P+IDA+A+ G+ T+ Y+ + SP+RA +L Sbjct: 42 PNILCILVDDLGYGDLSCQG---ATDLQSPNIDALAANGMRFTNFYANSTVCSPSRAALL 98 Query: 145 TGQYSIHHGILMPPMYGQPGGLQ------------GLTTLPQLLHDQGYVTQAIGKWHMG 192 TG+Y P + G PG ++ +P L+ GY T IGKWH+G Sbjct: 99 TGRY--------PDLVGVPGVIRQNPENNWGNLADDAVLIPSELNPAGYHTGIIGKWHLG 150 Query: 193 ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 + P + GF F+GF + DM ++ D R I + +++ Sbjct: 151 LEEPDTPNDRGFTYFKGF--LGDMMDDYWD-----------HRRGGINWMRLNRE----- 192 Query: 253 RGGEQQAIADITPK-YMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 +I PK + DL + D+ + FL + ++PFFLY HF P + Sbjct: 193 ---------EIDPKGHATDL---FTDWTIDFLKERQGEEQPFFLYLAYNAPHFPIQPPRE 240 Query: 312 YAGSSPART-------SYGDCMVEMNDV-FANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 + R + VE D + + L+ G +NTL+VF SDNG A Sbjct: 241 WLDKVREREPNLTEKRAKNVAFVEHLDYSVGRVMEALKTTGLEENTLVVFVSDNG-GALW 299 Query: 364 PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGH-PG 421 P RG K +EGG+RVP YWKG I P SD L DLFPT +LAG P Sbjct: 300 YAQSNGPLRGGKQDMYEGGIRVPAIFYWKGKIAPGTTSDNTALLMDLFPTFCELAGRKPP 359 Query: 422 AKV--ANLVPKTT 432 V +LVP T Sbjct: 360 ENVDGISLVPTLT 372 >UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKB8_9BACT Length = 465 Score = 135 bits (341), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 123/459 (26%), Positives = 188/459 (40%), Gaps = 61/459 (13%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP 134 L L +PN++V + DD+G+ DVGFNG PTP ID++A G+ T+ Y+ Sbjct: 10 LISLNAICASRPNLIVIMADDLGYNDVGFNG---CTEIPTPGIDSIAQNGVKFTNGYTSY 66 Query: 135 S-SSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKW 189 S P+RA +TG+Y G P + L T+ + L GY IGKW Sbjct: 67 SVCGPSRAGFITGRYQQRFGFERNPQWNLTDPNSALPKSEMTIAESLTQVGYHCGIIGKW 126 Query: 190 HMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 H+G +P GFD+F G H+ P+ I+ K+++ Sbjct: 127 HLGAEPSLRPNKRGFDEFFG--------------HLGGGHRFMPE-DLVIQHTEEVKNEL 171 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 + R + D K + L + + D V F+ + KPFFL+ H Sbjct: 172 DSYRSWITR--NDTPVKTTKYLTEEFSDEAVSFIKR--NHQKPFFLFLSYNAPHLPLQAT 227 Query: 310 AKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP 364 KY P R +Y + ++D + + ++L++ DNT++ F SDNG + Sbjct: 228 EKYLARFPHIKDPKRKTYAAMVSAVDDGVSQVMQSLKETNIADNTIVFFLSDNGGPSHKN 287 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAK 423 P +G K WEGG RVP + + IQ ++ D V D+F T LA P Sbjct: 288 KSDNFPLKGQKSDVWEGGFRVPFAMQYPAAIQAKQVYDHPVSSLDIFATIASLAQSPTHA 347 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAY 483 L DGV+ F G Q A H ++ + D+ +Y V Sbjct: 348 DKPL-------DGVNLIPFITGEKTQ----APH----AQIFIRKFDQSRYVV-------- 384 Query: 484 TQSGYQGGFTGTV-MQTAGSSVFNLYTDPQESDSIGVRH 521 QG F + + A ++NL D E ++I H Sbjct: 385 ----RQGDFKLVIPYKDAPPQLYNLSKDIGEENNIAAVH 419 >UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D6K5_PAESJ Length = 434 Score = 135 bits (341), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 112/365 (30%), Positives = 171/365 (46%), Gaps = 64/365 (17%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRAT 142 K+PN++VF DD+G+ D+G G TP +D +AS+G+ T+ YS P SP+RA+ Sbjct: 2 KRPNIIVFYCDDLGYGDLGCYGSDAM---KTPHLDQLASEGIRFTNWYSNSPVCSPSRAS 58 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 +LTG+Y G+ + G G +GL TTL L + GY T GKWH+G + E Sbjct: 59 LLTGKYPAKAGV--TSILGGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLGASAEYG 116 Query: 199 PQNVGFDDFRGFNS-VSDMYTE---W-RDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 P GFD F GF + D Y+ W + VNP VH + Sbjct: 117 PNAHGFDQFYGFRAGCIDYYSHIFYWGQGGGVNP---------------------VHDLW 155 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY-PNA-- 310 E + + +YM + R ++D A D+P+F+Y H+ + P A Sbjct: 156 RNETEVWEN--GEYMTEAITR---EATSYIDA-APDDEPYFMYVAYNAPHYPMHAPKAYL 209 Query: 311 -KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP----- 364 ++ P R + ++D + K L++ G ++T+I F+SDNGP E Sbjct: 210 DRFPDLPPDRRIMAAMIAAVDDGVGEIVKALKQKGAYEDTIIFFSSDNGPSTESRNWLDG 269 Query: 365 --------PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI---QPRKSDGIVDLADLFPTA 413 GR FRG K S +EGG+R P + + + Q + SD + + D+FPT Sbjct: 270 TEDLYYGGSAGR--FRGHKASLFEGGIREPAILSYPAGLAEQQGQISDEMFAMMDIFPTM 327 Query: 414 LDLAG 418 L+L+G Sbjct: 328 LELSG 332 >UniRef50_C5PU94 N-acetylgalactosamine-6-sulfatase n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PU94_9SPHI Length = 443 Score = 135 bits (341), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 140/469 (29%), Positives = 205/469 (43%), Gaps = 73/469 (15%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNP---TPDIDAVASQGLILTSAY 131 LA +PN ++ +DD+G+ DVG N GNP TP++D +A +G+ ++ Y Sbjct: 15 LAVFNSSAQTQPNFIIIYVDDMGYGDVGIN------GNPNIETPNLDRMAMEGMRFSNYY 68 Query: 132 S-QPSSSPTRATILTGQYSIHHG---ILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIG 187 S P+ + +R +LTG+Y G +L P Q G Q +T+ + L ++GY T G Sbjct: 69 SASPACTASRYALLTGKYPSRAGFRWVLNPT--DQIGIHQQESTIAERLKEKGYRTAIYG 126 Query: 188 KWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 KWH+G KE P GFD++ G +DM P++AL S Y L + Sbjct: 127 KWHLGSTRKEFLPLANGFDEYVGLPYSNDMIPP-----KYPDIAL---LSGY-DTLELNP 177 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 D R ++AIA F+ K AK +PFF+Y H Sbjct: 178 DQSKLTRLYTEKAIA--------------------FITKNAK--QPFFIYLPYAMPHTPL 215 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH 366 + + + G S R YGD + E++ L L++N T +VFTSDNGP + Sbjct: 216 HASEDFLGKS-KRGLYGDVVQELDHHIGRLLTFLKENKLDQQTYVVFTSDNGPWLIQNQN 274 Query: 367 GRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQPRK--SDGIVDLADLFPTALDLAGHPGA 422 G + FR KGSTWEGG+R P F+ W P+ + + D+ PT LAG Sbjct: 275 GGSAGLFRDGKGSTWEGGMREPFFL-WGHHTIPKGYVENEVFTALDMLPTITALAG---- 329 Query: 423 KVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF-LNGKLAAVRMDEFKYHVLIQQPY 481 + IDG + + G R YF L+ +L AVR +K HV Sbjct: 330 ----ISAGPNKIDGTNLKPLWSGKKDTKGRDEFFYFGLDHQLMAVRKGPWKLHV-----K 380 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQT 530 Y+Q G +FNL DP E ++ ++ M L T Sbjct: 381 TYSQLGL------VYFDKQLPLLFNLDHDPSEKYNLASQYPEMVSDLTT 423 >UniRef50_B7PTL2 Arylsulfatase B, putative (Fragment) n=1 Tax=Ixodes scapularis RepID=B7PTL2_IXOSC Length = 406 Score = 135 bits (340), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 120/387 (31%), Positives = 176/387 (45%), Gaps = 61/387 (15%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS 136 E + + PN+V+ L DD+GW DV ++G PTP+ID +A G+IL Y QP S Sbjct: 20 EATSASSQPPNLVMILADDLGWHDVSYHGSDQI---PTPNIDVLAMDGIILFHNYVQPLS 76 Query: 137 SPTRATILTGQYSIHHGILMPPM-YGQPGGLQG-LTTLPQL---LHDQGYVTQAIGKWHM 191 +PTRA +LTG Y IH G + P GL T LPQL L D A WH+ Sbjct: 77 TPTRAALLTGLYPIHTGTQRLDIGSADPIGLSADFTLLPQLSVTLADNFTSLGARSGWHL 136 Query: 192 GENK-ESQPQNVGFDDFRG-FNSVSDMYTEW-RDVHVNPEVALSPDRSEYIKQLPFSKDD 248 G K E +P GFD F G +N SD +T + RD +++ Sbjct: 137 GFCKDEFKPTKRGFDTFYGIYNGDSDYWTHFARDNNIDVS-------------------- 176 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 HA++ E++A+ + +Y+ L + V+ + K +KPFFLY+ H Sbjct: 177 GHALK-DEKRALVEEAGRYLTSL---LANQAVQLIHNRPK-NKPFFLYFAPTAVHCGGSN 231 Query: 309 NAKYAGSSPA----------RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG 358 + A R + + E++ + + L +GQL+NT+I F++DNG Sbjct: 232 GSLQAPKEYISKFGYLADYDRQLFAGSLAELDKSVGLIVEALYVSGQLNNTVIAFSTDNG 291 Query: 359 P-----EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGM-IQPRKSDGIVDLADLFPT 412 A P+ P RG KG+ EGGVR P F++ + + R + + + D PT Sbjct: 292 GAPVGFSANTSPN--WPLRGTKGTVAEGGVRGPGFLWSSSLTTRGRVTQQLFHVTDWMPT 349 Query: 413 ALDLAGHPGAKVANLVPKTTFIDGVDQ 439 AG + T IDGVDQ Sbjct: 350 FYTAAGGQAKDL-------TMIDGVDQ 369 >UniRef50_A6CEL4 Arylsulfatase A n=4 Tax=Bacteria RepID=A6CEL4_9PLAN Length = 527 Score = 135 bits (339), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 140/511 (27%), Positives = 222/511 (43%), Gaps = 87/511 (17%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATIL 144 PN++ L DD+G+ D+ + TP +D +A G+I T A+S S +PTR +L Sbjct: 25 PNIIYILADDMGYGDIRALNPECKIA--TPHLDQLAHGGMIFTDAHSSSSVCTPTRYGVL 82 Query: 145 TGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMGE--------- 193 TG+Y+ + ++G L T+P +L + GY T +GKWH+G Sbjct: 83 TGRYNWRSRLKSGVLWGLSRRLIEPDRETVPSMLKEHGYYTACVGKWHLGMDWSLKQGGF 142 Query: 194 ------NKESQP--------------QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSP 233 NK++ P +VGFD F G ++ DM P V + Sbjct: 143 ATEQSYNKKTNPGWDVDYSKPIQNGPNSVGFDYFFGISASLDM---------PPYVYIEN 193 Query: 234 DRSEYIKQLP---FSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA--- 287 DRS+ I + F H + +AI D+ P R D V+ +D+ A Sbjct: 194 DRSQGIPTVTKAFFRDGPAHK----DFEAI-DVLP--------RITDKTVQIIDEHAAAS 240 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 K KPFF+Y+ H P ++ G S +Y D +++++D + + L+K G + Sbjct: 241 KEGKPFFIYFPLNAPHTPILPTPEWQGKS-GINAYCDFVMQVDDTVGQVMQALKKQGIHE 299 Query: 348 NTLIVFTSDNG--PEA---EVPPHGRTP---FRGAKGSTWEGGVRVPTFVYWKGMIQP-R 398 NTL++FT+DNG P A E+ P FRG K +EGG RVP W I+ Sbjct: 300 NTLVIFTADNGCSPAANFKEMTDKDHQPSYQFRGHKADIYEGGHRVPFIANWPARIKAGT 359 Query: 399 KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE-HY 457 SD + L DLF TA D+ GAK VP D V GT R+A H+ Sbjct: 360 HSDQLTCLTDLFATAADIV---GAK----VPDDAGEDSVSILPAMEGTAHTPLREAAVHH 412 Query: 458 FLNGKLAAVRMDEFKYHVLI-QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDS 516 + G ++R D +K + +++ + G + + +++L D E + Sbjct: 413 SIRGAF-SIRKDHWKLELCPGSGGWSFPKPG-----KDNLSELPAIQLYDLNHDAGEQKN 466 Query: 517 IGVRHIPMGVPLQTEMHAYMEILKKYPPRAQ 547 + H + L T + +Y + + P + Q Sbjct: 467 VQAEHPEVVKELTTLLQSYADRGRSTPGKPQ 497 >UniRef50_A6DKM2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKM2_9BACT Length = 472 Score = 135 bits (339), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 114/404 (28%), Positives = 177/404 (43%), Gaps = 95/404 (23%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRA 141 +KPN+++ L DD+G +G G TP+IDA+A++ + +AYS + +P+RA Sbjct: 17 AQKPNIILILADDLGGAGLGCYGNEFF---GTPNIDALAAKSMRFDNAYSGSTVCAPSRA 73 Query: 142 TILTGQYSIHHGI-----------------------LMPPM--YGQPGGLQGLTTLPQLL 176 +++GQY H I L+ P+ Y P +G TL Q Sbjct: 74 CLMSGQYVGRHKITWVSQFQRDYIKKKRGPNLNGFRLLQPVHPYHMP---EGTITLGQAF 130 Query: 177 HDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS 236 D GY T GKWH+G + QP +GFD++ F + H P Sbjct: 131 KDAGYATAMFGKWHLGHRPQDQPDKMGFDEYLTFQGMK---------HFAPYT------- 174 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 LP + V+ GE+ + D+T D + F+++ ++KPFFLY Sbjct: 175 -----LP------NKVQHGEKVYLTDLT-----------CDKAIDFMERKVAAEKPFFLY 212 Query: 297 YGTRGCH--------FDNYPNAKYAGSSPARTSYGDCMVE-MNDVFANLYKTLEKNGQLD 347 Y H Y K G ++ G M + ++D L K +++ G + Sbjct: 213 YPDFLVHAPMEAKQAMIQYFEKKTIGQH-HKSVIGAAMTKHLDDTVGRLVKKVDELGIAE 271 Query: 348 NTLIVFTSDNGPEAEVPPHG-------RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK- 399 NT+I+FTSDNG G P+R AK S +EGG RVP +W G+ + Sbjct: 272 NTIIIFTSDNGGLGYKSDGGYGDKGTSNYPYRSAKSSHYEGGSRVPLIFHWPGVTEANSL 331 Query: 400 SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFF 443 S +V D++PT L + A+VA P+ +DG+D +S Sbjct: 332 SHEVVSGIDIYPTLLKI-----AQVAK--PQEQILDGIDFSSIL 368 >UniRef50_B4QDF1 GD10911 n=2 Tax=melanogaster subgroup RepID=B4QDF1_DROSI Length = 633 Score = 134 bits (338), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 114/386 (29%), Positives = 169/386 (43%), Gaps = 85/386 (22%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 AE EK K PN++ L DD+G+ DVGF+G PTP+IDA+A G+IL Y P Sbjct: 17 AENEKPPAK-PNIIFILADDLGFNDVGFHGSAEI---PTPNIDALAYSGIILNRYYVAPI 72 Query: 136 SSPTRATILTGQYSIHHGILMPPMY-GQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGE 193 +P+R+ ++TG+Y IH G+ +Y +P GL LPQ L++ GY + GKWH+G Sbjct: 73 CTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGH 132 Query: 194 NK-ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 K + P GF G + + Sbjct: 133 WKLKYTPLYRGFSSHWGLD----------------------------------------M 152 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN------ 306 R G Q A D+ Y D+ ++ VK + + P FLY CH N Sbjct: 153 RNGTQVAY-DLHGHYTTDVIT---EHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLP 208 Query: 307 -----------YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 PN K R + + +M+D + L K+ L+N++I+F+S Sbjct: 209 VPDNDVIKMSHIPNYK-------RRKFAAMVSKMDDSVGQIVDQLRKSNMLENSIIIFSS 261 Query: 356 DNGPEAE---VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP--RKSDGIVDLADLF 410 DNG A+ + P +G K + WEGGVR + W +++ R S+ + + D Sbjct: 262 DNGGPAQGFNLNFASNYPLKGVKNTLWEGGVRAAGLM-WSPLLKKSQRVSNQTMHIVDWL 320 Query: 411 PTALDLAGHPGAKVANLVPKTTFIDG 436 PT L+ AG A +ANL + IDG Sbjct: 321 PTLLEAAGGQPA-LANLSKQ---IDG 342 >UniRef50_A6DKN7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKN7_9BACT Length = 465 Score = 134 bits (338), Expect = 7e-30, Method: Compositional matrix adjust. Identities = 113/406 (27%), Positives = 188/406 (46%), Gaps = 53/406 (13%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTR 140 + +K N+++ DD+ + +G G V TP ID++ ++G+ + Y S + +P+R Sbjct: 16 SAEKTNIILIFADDMHYGALGVTGS-VLTKAKTPAIDSIFNEGVHFPNGYASHATCAPSR 74 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQ-------GLTT----LPQLLHDQGYVTQAIGKW 189 A +LTG+Y + PGG G+ T +P L+ GY T AIGKW Sbjct: 75 AGLLTGRYQARFDLET-----LPGGTADRKKTGYGVKTSEIMIPALMKKGGYQTCAIGKW 129 Query: 190 HMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD-D 248 H+G ++E QP GFD + G+ Y V S + + +K LP +D + Sbjct: 130 HLGSSEEFQPNARGFDHWFGYRGSCGFYQFKSQVQ-------SAKKGQELKPLPSGEDPN 182 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF-DNY 307 + VR GE + + D W+ ++PFF+Y+ H D Sbjct: 183 LDVVRNGESVRLEGYLTDHFSDEAANWIK---------ENKERPFFMYFAPYNVHAPDTV 233 Query: 308 PNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG 367 PN KY T++ + ++ + L++ G DNTL+VF++DNG + + + Sbjct: 234 PN-KYI--PKGGTAHDGVIAALDASVQTILDALKEAGIADNTLVVFSNDNGGKKD---YS 287 Query: 368 RTPFRGAKGSTWEGGVRVPTFVYW-KGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVAN 426 +T F+G K + +EGG+RVP + W KG+ K +G+V DL PT L AKV Sbjct: 288 KT-FKGNKATFYEGGIRVPFAMRWPKGIEAGSKYNGVVSTLDLLPTFAAL-----AKVD- 340 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFK 472 +P DG Q + + +++ H++ NG R+ ++K Sbjct: 341 -LPSDRVYDG--QNLLPVIKDSAKDQRQAHFWRNGAWRTARVGDWK 383 >UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6C430_9PLAN Length = 503 Score = 134 bits (338), Expect = 7e-30, Method: Compositional matrix adjust. Identities = 115/386 (29%), Positives = 176/386 (45%), Gaps = 71/386 (18%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPT 139 K+ +PN++V L DD+G+ D+ G V +P+ID A +GL LTS Y+ P+ SP+ Sbjct: 30 KSPARPNIMVVLCDDLGYGDLACYGHPVIQ---SPNIDRFAKEGLKLTSCYAAHPNCSPS 86 Query: 140 RATILTGQYSIHHGI-----LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM--- 191 RA ++TG+ GI ++ PM+ + + T+ LL GY T +GKWH+ Sbjct: 87 RAGLMTGRTPFRVGIYNWIPMLSPMHVRKREI----TIATLLRQAGYATCHVGKWHLNGM 142 Query: 192 -GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 + QP + GFD + F++ ++ H NP + R V Sbjct: 143 FNMVGQPQPSDHGFDHW--FSTQNNALP----THENPFNFVRNARP------------VG 184 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH------- 303 ++G Q +AD +++ L + +KPFF++ H Sbjct: 185 PLQGFASQLVADEAEEWLTQLRDK---------------EKPFFMFVCFHEPHEPIASAE 229 Query: 304 -FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE-A 361 F A + PA +G+ + +M+D F + KTL+ +NTLI+FTSDNGP Sbjct: 230 RFRKLYTAPEGSTLPAH--HGN-VTQMDDAFGRILKTLDDQKLRENTLIIFTSDNGPAIT 286 Query: 362 EVPPHGRT-PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGH 419 PHG + P R KG+T+EGG+RVP V W +QP SD V D+ PT +A Sbjct: 287 RRHPHGSSGPLRDKKGATYEGGIRVPGIVQWPEHVQPGTTSDVPVCGVDILPTLCAVADI 346 Query: 420 PGAKVANLVPKTTFIDGVDQTSFFLG 445 P P +DG + G Sbjct: 347 PA-------PTDRVLDGTNILPLLEG 365 >UniRef50_B7AM73 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AM73_9BACE Length = 523 Score = 134 bits (338), Expect = 8e-30, Method: Compositional matrix adjust. Identities = 129/489 (26%), Positives = 219/489 (44%), Gaps = 78/489 (15%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSS-PTRAT 142 +KPNV+ + DD+G D+ G A TP+ID +A QG+ T+AY+ S+S P+R Sbjct: 34 QKPNVIYLISDDLGIGDLSCYG---ATKVSTPNIDRLAGQGMQFTNAYATSSTSTPSRFG 90 Query: 143 ILTGQYSIHH---GILMPPMYGQPGGLQ-----GLTTLPQLLHDQGYVTQAIGKWHMGE- 193 +LTG Y GI PG + T+ + ++GY T A+GKWH+G Sbjct: 91 LLTGMYPWRQENTGI-------APGNSELIIDTACVTMADMFKEEGYCTGAVGKWHLGLG 143 Query: 194 -------NKESQP--QNVGFD-DF---RGFNSVSDMYTEWRDVHV---NPEVALSPDRSE 237 N E +P Q++GFD +F + V ++ E + HV +P+ ++ + + Sbjct: 144 PKGGTDFNHEIKPNTQDIGFDYEFIIPATVDRVPCVFVE--NAHVVGLDPKDPITVNYNH 201 Query: 238 YIKQLPFSKDDVHAVR----GGEQQAIADITPK--YMED-LDQRWMDYGVKFLDK----- 285 + P ++ +V+ G I + P+ +M W+D + + Sbjct: 202 KVGDWPTGLENPESVKMKPSQGHNNTIINGIPRIGWMTGGKSALWVDEDIADIITGKAKD 261 Query: 286 --MAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKN 343 ++ ++PFFLY GT+ H P+ ++AG S GD +++++ + +TL+ Sbjct: 262 FIISHKNEPFFLYMGTQDVHVPRVPHPRFAGKS-GLGPRGDVILQLDWTVGEIMRTLDSL 320 Query: 344 GQLDNTLIVFTSDNGP--------EAEVPPHGRTP---FRGAKGSTWEGGVRVPTFVYWK 392 DNT+ VF SDNGP +A +G TP +RG K S+++ G R+P V W Sbjct: 321 NIADNTIFVFCSDNGPVIDDGYQDQALELLNGHTPMKHYRGGKYSSFDAGTRIPFIVRWP 380 Query: 393 GMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNR 452 I+P K + + D++ + L H +P D DQ FLGT+ Sbjct: 381 NGIKPGKQQALFSMIDVYASFAALLDHQ-------LPTGVAPDSRDQLDSFLGTDTAGCN 433 Query: 453 KAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ 512 LN L+ ++ + +KY +P + + G ++NL DP Sbjct: 434 YIVQQNLNNTLSIIQHN-WKYIEPSNKPALEYWTKIEMG------NNPEPQLYNLSIDPS 486 Query: 513 ESDSIGVRH 521 E +++ H Sbjct: 487 EKNNVAKDH 495 >UniRef50_A4A218 Arylsulfatase A n=2 Tax=Bacteria RepID=A4A218_9PLAN Length = 491 Score = 134 bits (337), Expect = 9e-30, Method: Compositional matrix adjust. Identities = 116/408 (28%), Positives = 193/408 (47%), Gaps = 54/408 (13%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRA 141 K PN+V+F +D++G D+G G + + TP ID +A++G TS Y + +P+RA Sbjct: 37 AKPPNIVLFFVDNLGTGDIGCYGSTL---HRTPHIDRLAAEGAKFTSFYVASGVCTPSRA 93 Query: 142 TILTGQYSIH---HGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGEN 194 ++TG Y + H +P +GL TT+ ++LH GY T GKWH+G+ Sbjct: 94 ALMTGCYPLRVDMHKSGEGVAVLRPLDTKGLNPKETTMAEVLHSVGYATGIFGKWHLGDQ 153 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 E P GFD F G DM + R + PE+ L R E + + P +D + V+ Sbjct: 154 PEFLPTQQGFDTFFGIPYSDDMTKDLRP-QLWPELPLM--RDEQVIEAPVDRDLL--VKR 208 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY--YGTRGCHFDNYPNAKY 312 ++AIA F+++ ++PFF+Y + G + + + Sbjct: 209 CTEEAIA--------------------FIEQ--NQERPFFVYIPHTMPGSTKRPFSSPAF 246 Query: 313 AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PF 371 G S YGD + E++ + +TL++ + TL+++TSDNG PP G P+ Sbjct: 247 QGKS-KNGPYGDSVEELDWSTGQVMETLKRLDLDEQTLVIWTSDNGAPHRNPPQGSNLPY 305 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPK 430 +G +T EG +R+P + W G I + +D + DL PT LAG +K Sbjct: 306 QGDGYNTSEGAMRMPCVMRWPGKISAGQINDALCTTMDLLPTFGKLAGATMSK------- 358 Query: 431 TTFIDGVDQTSFFLGTNGQS---NRKAEHYFLNGKLAAVRMDEFKYHV 475 T IDG + + LG + + + K ++ +L A+R +K ++ Sbjct: 359 -TEIDGHEISRILLGESDTASPWDDKGFAFYYMDQLQAIRAGRWKLYL 405 >UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD Length = 481 Score = 134 bits (337), Expect = 9e-30, Method: Compositional matrix adjust. Identities = 126/459 (27%), Positives = 206/459 (44%), Gaps = 63/459 (13%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRAT 142 ++PN+V L DD+G+ DVGFNG + TP+ID +A +G+I Y+ S +P+R++ Sbjct: 28 QRPNIVFILADDLGYGDVGFNGQKLI---KTPNIDKLAKEGMIFNQFYAGTSVCAPSRSS 84 Query: 143 ILTGQYSIHHGIL----MPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKES 197 +LTGQ++ H I + P QP +TTL ++L GYVT A GKW +G E Sbjct: 85 LLTGQHTGHTYIRGNKGVEPEGQQPIA-DSVTTLAEVLKKSGYVTAAFGKWGLGPVGSEG 143 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRD-VHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P GFD F G+N S + + + + N + L I ++ D + + Sbjct: 144 DPNKQGFDRFYGYNCQSLAHRYYPEHLWDNSKKILLEGNKGLIHNKEYAPDLI------Q 197 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT------RGCHFDNYPNA 310 ++A++ + + + ++ Y + + + D F Y G +G + N Sbjct: 198 KKALSFVNAQDGKQPFFLFLPYILPHAELVVPDDSLFRYYKGKFEEKPHKGADYGPGANG 257 Query: 311 K-YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR- 368 YA ++ + ++ + L+K G NTL++FTSDNGP E R Sbjct: 258 GGYASQDFPHATFAAMVARLDLYVGQVMNALKKKGLDKNTLVIFTSDNGPHVEGGADPRF 317 Query: 369 ----TPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAK 423 FRG K +EGG+R P W I+P KSD I D+ PT +LA P Sbjct: 318 FNSGAGFRGVKRDLYEGGIREPFAARWPAAIKPGSKSDYIGAFWDILPTFAELANAP--- 374 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAY 483 P+ IDG+ SF G++ +K Y +++H Sbjct: 375 ----APRN--IDGI---SFTDALKGKAIQKKHDYLY-----------WEFH-----EQGG 409 Query: 484 TQSGYQGGFTGTVMQTAGS-----SVFNLYTDPQESDSI 517 Q+ QG + ++ AG+ +++L DPQE +++ Sbjct: 410 RQAVRQGNWKAVRLKAAGNPDALVELYDLSKDPQEKNNL 448 >UniRef50_Q7UNN1 Arylsulphatase A n=3 Tax=Bacteria RepID=Q7UNN1_RHOBA Length = 529 Score = 134 bits (336), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 130/517 (25%), Positives = 221/517 (42%), Gaps = 80/517 (15%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP 134 L+E +PNV+V + DD+G+ D+G G A G TP+ID +AS+G TS Y Sbjct: 34 LSETSAADNDRPNVIVVMADDLGYGDIGCYG---AKGLETPNIDQMASEGCRFTSGYCSA 90 Query: 135 SS-SPTRATILTGQYSIH--HGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWH 190 S+ +PTR + LTG Y+ + + PP P + G TT ++L + GY T IGKWH Sbjct: 91 STCTPTRYSFLTGTYAFRFPNTGIAPP--NSPALIPAGTTTTARILKNAGYKTAVIGKWH 148 Query: 191 MGENKESQ-----------PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVA--LSPDRSE 237 +G ++++ P +GFD + +D + V+VN L P Sbjct: 149 LGLGEKNEGPDWNGDLKPGPLEIGFDHCILLPTTNDRVPQ---VYVNDHNVENLDPADPL 205 Query: 238 YIKQLPFSKDD--------------VHAVRGGEQQAIADI-------TPKYM-EDLDQRW 275 ++ S+D H I+ I ++ EDL RW Sbjct: 206 WVGNKKPSEDHPTGITHRDTLKMDWSHGHNSTIHNGISRIGFYTGGHAARFRDEDLSDRW 265 Query: 276 MDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFAN 335 ++ +++ ++PFFL++ + H + ++ GS+ GD + E++ Sbjct: 266 VEESKRWI--AENREEPFFLFFASHDLHVPRVVHERFQGSTKL-GPRGDAIAELDWCVGE 322 Query: 336 LYKTLEKNGQLDNTLIVFTSDNGPEAE-------------VPPHGRTPFRGAKGSTWEGG 382 L K+LE+NG + T++VF SDNGP + P+G P++G K + +EGG Sbjct: 323 LMKSLEENGLTEKTMLVFCSDNGPVLDDGYKDDANEKLGNHDPNG--PYQGGKYTVYEGG 380 Query: 383 VRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSF 442 R P G I V ++D +D A A V +P +D + Sbjct: 381 TRTPFITRMPGTIP-------VGVSDEMVCTIDFAASLAAMVGQELPNDASLDSQNVLGA 433 Query: 443 FLGTNGQSNRKAEHYFLNGKLA--AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTA 500 + +G S R+ NGK+ R+ ++K L++ + Y + T Sbjct: 434 LMNQSGASGREHLVQQDNGKVGNYGYRVGDWK---LVRHD---QKKSYNFDLSMTRKPVP 487 Query: 501 GSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 +++NL +DP E + + +Q E+ ++ Sbjct: 488 QFALYNLESDPAEQNDLSDSEPERAKQMQQELQKLLD 524 >UniRef50_UPI00016C41FE sulfatase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C41FE Length = 499 Score = 134 bits (336), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 124/405 (30%), Positives = 178/405 (43%), Gaps = 67/405 (16%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRAT 142 K PN+V L DDVG+ D+G G + TP++D +A QG LT A+S + +PTR Sbjct: 23 KPPNIVFILADDVGYGDLGCYG---STKVRTPNLDTLAKQGTRLTDAHSPAAVCTPTRYA 79 Query: 143 ILTGQYSIHHG----IL--MPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG---- 192 +LTGQY+ H IL + P+ +P L T+P L GY T A+GKWH+G Sbjct: 80 LLTGQYAWRHAPGSRILSGVAPLSIKPDTL----TVPAFLKQNGYTTAAVGKWHLGLGEK 135 Query: 193 ---ENKESQP--QNVGFD-----DFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 N E +P + VGFD G + R V+ +P+ ++ ++ + Sbjct: 136 ETDYNGEIKPGAREVGFDYSFLIPATGDRTPCVFVENGRVVNYDPKDPITVSYTKKVGTE 195 Query: 243 PFSKDD-----VHAVRGGEQQAIADITPK--YM----------EDLDQRWMDYGVKFLDK 285 P K++ V G I + + +M ED+ V+F+ K Sbjct: 196 PTGKENPELLTVQKPSLGHDMTIVNGISRIGWMSGGKAARWKDEDIADDITKKAVEFIGK 255 Query: 286 MAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQ 345 DKPFFLY+ T H P+ ++ G S GDC+ E++ + L++ Sbjct: 256 A--KDKPFFLYFATHDAHVPRVPHPRFKGKS-GHGLRGDCIEELDWCVGEIVAALDRYKL 312 Query: 346 LDNTLIVFTSDN----------GPEAEVPPH-GRTPFRGAKGSTWEGGVRVPTFVYWKGM 394 DNTL+VFTSDN G + H RG KG +EGG RVP W Sbjct: 313 TDNTLVVFTSDNGGVMDDGYIDGTATDTSGHKCNGALRGFKGGLYEGGHRVPFIAKWPVH 372 Query: 395 IQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVD 438 + K SDG+V DL T + G L+P D VD Sbjct: 373 VAAGKVSDGLVCHVDLLRTCAAILG-------KLLPSGAGPDSVD 410 >UniRef50_A6CEG5 Arylsulphatase A n=2 Tax=Bacteria RepID=A6CEG5_9PLAN Length = 476 Score = 134 bits (336), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 111/363 (30%), Positives = 163/363 (44%), Gaps = 52/363 (14%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 +E+ + +KPN+++ + DDV W G G A TP IDA+A+QG+ T+ YS P Sbjct: 20 SEVIAQQARKPNIILIMADDVSWECFGSYG---ADDYQTPHIDALANQGIRFTNCYSTPL 76 Query: 136 SSPTRATILTGQYSI----HHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 +P+R ++TG+Y+ H G L P T Q+L GY T GKW + Sbjct: 77 CTPSRVKLMTGKYNFRNYTHFGYLNPKE----------KTFGQMLQSAGYKTAIAGKWQL 126 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 N D+ R F + D Y W+ V + E P ++ Sbjct: 127 --NGLYHGAEGHADNTRPFKAGFDEYCLWQ---VTTRTKIKEGGGERFWSPPLEQN---- 177 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 G+ IAD +Y D+ D+ F+ K D PFF+YY T H P Sbjct: 178 ---GKFLTIADNADQYGPDI---MSDFLCDFIKK--NQDVPFFVYYPTTLVHNPFVPTPD 229 Query: 312 YAGSSP-------------ARTSYGDCMVE-MNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 G +P AR + MV ++ + + + +E GQLDNTLI+FT+DN Sbjct: 230 TIGDAPRTQAANKQPKGKAARKANFVAMVNYLDKLVGKIVQQVEDVGQLDNTLILFTADN 289 Query: 358 GPEAEVPP--HGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTAL 414 G ++ +GRT +G KGST + G VP YWKG Q + ++D DL+PT Sbjct: 290 GTNVQITSQWNGRT-IQGGKGSTTDMGTHVPLVAYWKGHTPQGEVLNDLIDFTDLYPTFA 348 Query: 415 DLA 417 +A Sbjct: 349 AMA 351 >UniRef50_UPI000186F312 arylsulfatase B precursor, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186F312 Length = 514 Score = 133 bits (335), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 117/414 (28%), Positives = 192/414 (46%), Gaps = 58/414 (14%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT 139 +K P++++ L DD+GW DVGF+G PTP+IDA+A G+IL + Y P +P+ Sbjct: 25 EKLVNNPHIIIILADDLGWNDVGFHGSNQI---PTPNIDALAFTGIILNNYYVAPVCTPS 81 Query: 140 RATILTGQYSIHHGILMPPMYGQ-PGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGE-NKE 196 R+ +LTG+Y IH G+ ++G P GL LP+ L YVT+ +GKWH+G K+ Sbjct: 82 RSALLTGKYPIHTGLQHGVIHGSAPYGLNLNEKLLPEYLRSLNYVTRHVGKWHLGSFKKD 141 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P+ GFD G+ +T +D + + + +P Y +R G Sbjct: 142 YTPEYRGFDSHYGY------WTGHQDYYDHTAIE-NPGFWGY------------DMRRGM 182 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN-YPNAKYAGS 315 +D Y DL + + VK + K S+KP FLY H N Y + Sbjct: 183 NVTRSDFG-YYTTDL---FTNEAVKVI-KGHDSNKPLFLYLAHLATHSGNKYSPLQAPAE 237 Query: 316 SPARTSY---------GDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA---EV 363 + A+ +Y + ++++ + + L + ++N +I+F++DNG A + Sbjct: 238 TVAKFNYIKDKNRRLFAGMLSKLDESVGKVVEALADSNMINNCVILFSTDNGGPAGGFNL 297 Query: 364 PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAK 423 P RG K + WEGGVR F++ + + S+ ++ + D PT L L Sbjct: 298 NAASNWPLRGVKDTLWEGGVRGVGFIWSPFLPSSKVSNAMIHITDWLPTLLSL------- 350 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN----GKLAAVRMDEFKY 473 N + IDG++ + G + + E LN K+AA+R +KY Sbjct: 351 -TNASNSISDIDGINVWPNL--SKGLPSVRKE-ILLNIDTERKIAALRYKNWKY 400 >UniRef50_Q1YP24 Arylsulfatase A n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YP24_9GAMM Length = 502 Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 128/450 (28%), Positives = 196/450 (43%), Gaps = 70/450 (15%) Query: 72 QQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNP---TPDIDAVASQGLILT 128 Q+ + K KPN ++ DD+G+ D G GNP TP ID +AS G T Sbjct: 21 QESKQQAPNKHKAKPNFILVYTDDMGYSDAG------PFGNPLIETPAIDRLASSGQTWT 74 Query: 129 SAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQ------PGGLQGL----TTLPQLLH 177 + Y+ P +P+R +LTG+ + G +YG PG +G+ TTL ++ Sbjct: 75 NFYAAAPVCTPSRGALLTGKLPVRTG-----LYGDNINVFFPGSKKGMPENETTLAEVFQ 129 Query: 178 DQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDR-- 235 D Y T GKWH+G+ P GF+++ G +DM +W + P + Sbjct: 130 DNQYATGMFGKWHLGDATGFYPTRHGFNEWLGIPYSNDM--DWEVEGITSSNIFFPAQDI 187 Query: 236 -SEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMED----------------LDQRWMDY 278 ++Y P + + + Q + I + + D + +R+ Sbjct: 188 MAKYGTVSPVLQRQIFQPEINDWQ-VPLIHSRKLADGRFVDHEIQRPADQTLITRRYTTE 246 Query: 279 GVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYK 338 ++F+ + + KPFF+Y H + +A++AG S A YGD + E++ + Sbjct: 247 SIRFMREAVTAQKPFFIYLAHSMPHVPLFRSAEFAGKSKAGI-YGDVIEEIDWSLQKIIA 305 Query: 339 TLEKNGQLDNTLIVFTSDNGPEAEVPPHG--RTPFRGAKGSTWEGGVRVPTFVYWKGMIQ 396 + DNT IVFTSDNGP H TP R KG+T++GG+RV T + G Sbjct: 306 ATQALAIDDNTYIVFTSDNGPWLIYGTHAGTATPLRDGKGTTFDGGMRVMTV--FSG--- 360 Query: 397 PRKSDGIVD----LADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ-SN 451 P GI+D DLF T LAG +TT D VD + NGQ S Sbjct: 361 PDIHQGIIDDLGSQTDLFATFTALAG--------FGSQTTAADSVDLSHTL--RNGQPSP 410 Query: 452 RKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 R + ++ +L A R + K H + Q Y Sbjct: 411 RTSIPFYSGSELRAFRYQDHKVHFVTQGAY 440 >UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYA9_9BACT Length = 490 Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 122/414 (29%), Positives = 178/414 (42%), Gaps = 62/414 (14%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRAT 142 K+PN++V + DD G+ D F G + TP++DA+A G+ T Y + P SP+RA Sbjct: 37 KRPNIIVIVSDDQGYADASFQGSKDIL---TPNLDALAKSGVRCTRGYVTAPVCSPSRAG 93 Query: 143 ILTGQYSI---HHGILMP----PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK 195 ++TG+Y HH ++ P+ P T LPQ+L GY T +GKWH+G Sbjct: 94 LMTGRYQERFGHHNNIVAEAALPIAHLP---SNETLLPQVLAKAGYYTAMVGKWHLGLQD 150 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVN-PEVALSPDRSEYIKQLPFSKDDVHAVRG 254 +P GFD+F G + T D VN PE D+S + R Sbjct: 151 GCRPYERGFDEFFG------IITGGHDYFVNHPEERAVGDQS-------------YKARI 191 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDK--MAKSDKPFFLYYGTRGCHFDNYPNAKY 312 + + P Y+ D + V+ + + + D+P FLY H Sbjct: 192 ERNGPVGEAVPGYLTDA---FGADAVRIIRESHTKRPDQPLFLYLAFNAPHTPTQAPKDL 248 Query: 313 AGSSPA------RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH 366 + PA R +Y + M+ + L++NG +T IVF SDNG A P + Sbjct: 249 VDTMPATLESKDRRTYAAQITSMDASVGKVRAALKENGMEKDTFIVFFSDNG-GANHPYY 307 Query: 367 GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDL----ADLFPTALDLAGHPGA 422 TP R KGS +EGG+RVP F + G I + + +L D+F TA LAG Sbjct: 308 DNTPLRDHKGSLYEGGIRVPFFAVYPGHI---PAGSVCELPVTSLDVFATACALAG---- 360 Query: 423 KVANLVPKTTF-IDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHV 475 P+T+ +D VD G Q + G AAV + K V Sbjct: 361 ----TKPETSHPLDSVDMLPVLEGNARQPTHATLFWEFPGFGAAVADRDLKLVV 410 >UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZGF2_PLALI Length = 490 Score = 132 bits (333), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 115/368 (31%), Positives = 164/368 (44%), Gaps = 77/368 (20%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSS 136 L ++ + PN+++ L+DD+GW DVGF G TP ID +A GL+ T AY S P+ Sbjct: 35 LAAESRRPPNIILILMDDMGWRDVGFMGNKFV---ETPHIDRLAKTGLVFTQAYASAPNC 91 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGL---------------QGLTTLPQLLHDQGY 181 +PTRA +++GQY+ HGI QP G + T+ + L D GY Sbjct: 92 APTRACLMSGQYAPRHGIYTVVDPRQPPGSPWHKWQAAESKSELDTNVVTIAEALRDGGY 151 Query: 182 VTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQ 241 T G W++G + G +GF V V PE Sbjct: 152 ATAFFGMWNLGRGRTGPVTPGG----QGFQKV-----------VFPE------------N 184 Query: 242 LPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRG 301 L F KD+ D Y+ D R D +KF+D+ ++PFF+Y Sbjct: 185 LGFGKDEYF-----------DDGKHYLTD---RLTDEVLKFVDE--HREQPFFVYLPDHA 228 Query: 302 CH--FDNYPN--AKYAGSSPARTSYGD---CMVEMNDVFANLYKTLEKNGQL---DNTLI 351 H F+ P AKY + A D C + V N+ + ++ +L DNT++ Sbjct: 229 IHAPFNPKPELLAKYERKAAASNDRRDDPACAATIEAVDHNVGRIMDHLKRLKLSDNTVV 288 Query: 352 VFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLF 410 +FTSDNG + P P RG KG +EGG+RVP V G+ + D V DL+ Sbjct: 289 IFTSDNGGTQQYTP----PLRGGKGELYEGGIRVPLVVAGPGVKSLGSRCDVPVSSIDLY 344 Query: 411 PTALDLAG 418 PT L+LAG Sbjct: 345 PTLLELAG 352 >UniRef50_A6DF77 Arylsulphatase A n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF77_9BACT Length = 518 Score = 132 bits (333), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 128/526 (24%), Positives = 230/526 (43%), Gaps = 90/526 (17%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS--SPTRA 141 KKPN++ L DD+G+ D+ V T ++D +A++G+ T A+S PS+ +P+R Sbjct: 18 KKPNILFILADDLGYGDLSCYNDEAKVK--TANLDQLANEGMRFTDAHS-PSTVCTPSRY 74 Query: 142 TILTGQ--YSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN----- 194 +I+TG+ + ++ + + G + TLPQ+L + GY T GKWH+G + Sbjct: 75 SIMTGRMAFRLNFKGVFTGVSGPCLITKDRLTLPQMLRNNGYETAMFGKWHIGMSFLDKN 134 Query: 195 ----------------KESQ------------------PQNVGFDDFRGFNSVSDMYTEW 220 K+ + P N GFD F F +V ++W Sbjct: 135 GDVIEVSEPPRKTPKLKKQEIALEAIKRVDYSKPIPDGPLNQGFDHF--FGTVCCPTSDW 192 Query: 221 RDVHVNPE-VALSPDRSEYIKQLP--FSKDDVHAVRGGEQQAIADITPKYM-EDLDQRWM 276 +++ + + + P + LP F D A + P + E++D ++ Sbjct: 193 LYAYIDGDRIPVPPTKIVDKALLPKHFWSFDCRA---------GLLAPNFKHENVDMVFL 243 Query: 277 DYGVKFLDKMAK--SDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFA 334 + + FLD K S KPFFL++ + H ++P ++ G + A +GD + + + + Sbjct: 244 EKSLSFLDSHHKKQSAKPFFLFHSLQAVHLPSFPAKEFQGKTQA-GPHGDFIYQFDYIVG 302 Query: 335 NLYKTLEKNGQLDNTLIVFTSDNGPEA--------EVPPHGRTPFRGAKGSTWEGGVRVP 386 L + L+ G +NTL++ +SDNGPE +G P+RG K WEGG RVP Sbjct: 303 KLVEKLKTLGMAENTLVIISSDNGPEVGTTINMRERYKHNGARPWRGVKRDNWEGGHRVP 362 Query: 387 TFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLG 445 +W G I+ S V L D+ T + V +P D + L Sbjct: 363 MIAWWPGKIRSSSVSQQTVCLTDIMATCASI-------VNTSLPNNAAEDSFNILPILL- 414 Query: 446 TNGQSNRKAEHYFLNGKLA---AVRMDEFKY--HVLIQQPYAYTQSGYQG-GFTGTVMQT 499 GQ+ + + L+ ++ ++R ++KY H + G T + + Sbjct: 415 --GQTTKAIREFTLHQTISLDLSIRHGDWKYLDHSGSGGNNYSGGRIKKALGLTNSKIN- 471 Query: 500 AGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPR 545 A + ++NL DP+E +++ +H + L+ ++ + + P R Sbjct: 472 APAQLYNLKADPKEVNNLYYQHPEIAQQLKAKLEEFKTSGRSAPKR 517 >UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria RepID=C1ZCL4_PLALI Length = 470 Score = 132 bits (333), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 134/509 (26%), Positives = 207/509 (40%), Gaps = 115/509 (22%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSP 138 K+ KPNV++ +DD+G D+G G TP IDA+A G T YS P SP Sbjct: 23 KEMADKPNVLLIFIDDLGKTDIGIEGSSFYE---TPRIDALAKSGARFTQFYSAHPVCSP 79 Query: 139 TRATILTGQYSIHHGILMPPMYGQPGG----LQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 TRA ++TG+ GI + +P Q T+ Q + GY T +GKWH+G Sbjct: 80 TRAALMTGKMPQRLGITD---WIRPESDVALPQSEVTIGQAFQEAGYHTAYLGKWHLGHK 136 Query: 195 KESQPQNVGFDDFRGFN---SVSDMYTEWRDVHVNPEVALSPDR-SEYIKQLPFSKDDVH 250 + P GFD +G N S Y ++ NP+ +P+ ++ K P Sbjct: 137 PQQHPAARGFDWTKGVNHGGQPSSYYFPYK----NPQKPDAPNNVPDFEKCQP------- 185 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP-- 308 E +T +E L QR +PFFL H P Sbjct: 186 -----EDYLTDVLTSSAIEHLQQR-------------DRTRPFFLCLAHYAVHTPIQPPK 227 Query: 309 ------NAKYA--------------GSSPART-----SYGDCMVEMNDVFANLYKTLEKN 343 K A GS+ +R+ +Y + ++ L L+ Sbjct: 228 NLVEKYQVKLATQKNPKSPGEGIQEGSAISRSQQDHPAYAAMVENLDTQVGRLLDELKTQ 287 Query: 344 GQLDNTLIVFTSDNGPEAEVP-----PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR 398 G LD T++VFTSDNG + P P R KG T+EGG+R+PT++ W G I P+ Sbjct: 288 GILDQTIVVFTSDNGGLCTLNGKSPGPTCNLPLRAGKGWTYEGGIRIPTYISWPGKISPQ 347 Query: 399 KSDGIVDLADLFPTALDLAGHP--------GAKVANLVPKTTFIDGVDQTS--FFLGTNG 448 D D++PT L L P G +A L+ K++ + ++T ++ T+G Sbjct: 348 VLDIPAYTCDIYPTLLSLCQIPPRPTQHVDGISLAGLLTKSSSLPESERTLVWYYPHTHG 407 Query: 449 QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLY 508 ++ + AA+R +K LI ++T +++L Sbjct: 408 SGHKPS---------AAIRQGPWK---LIH-----------------FLETDRIELYHLE 438 Query: 509 TDPQESDSIGVRHIPMGVPLQTEMHAYME 537 DP ES ++ +H + LQ E+ +E Sbjct: 439 DDPGESRNLASKHPERALQLQKELQKIIE 467 >UniRef50_B9XR48 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XR48_9BACT Length = 508 Score = 132 bits (333), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 123/402 (30%), Positives = 182/402 (45%), Gaps = 66/402 (16%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRAT 142 +KPNV+ F+ DD+G+ DVG G TP+ID +A++G+ T YS P +P+R Sbjct: 36 RKPNVIFFIADDLGYADVGCFGQKKIH---TPNIDRIATEGMKFTQHYSGSPVCAPSRCV 92 Query: 143 ILTGQYSIHHGI-----LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 ++TG++S H + L P GQ T+ +LL GY+T A GKW +G + S Sbjct: 93 LMTGKHSGHSAVRDNRELKP--EGQFPLPANTITVARLLQQNGYITGAFGKWGLGGPESS 150 Query: 198 -QPQNVGFDDFRGFNS---VSDMYTE--WRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 +P + GF F G+N +++ W D H +AL ++LP D Sbjct: 151 GKPLDQGFTRFFGYNCQRVAHNLFPTYLWDDNH---RLALDNPPIGEDQKLPADAD---- 203 Query: 252 VRGGEQQAIADITPK-YMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF------ 304 + + T K Y DL + + ++F+ D PFFL++ T H Sbjct: 204 --SNDPASYKAFTGKSYAPDL---YAEQALRFIRD--NKDHPFFLFFPTIVPHVALQVPE 256 Query: 305 -------DNYPNAKYAGSS---PART---SYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 P Y G P RT +Y + M+ + +++ D+T+ Sbjct: 257 DSLKEYEGKLPETPYTGGKGYLPNRTPHAAYAAMITRMDRDLGRMLALIKELNLDDDTIF 316 Query: 352 VFTSDNGPEAEVPPHGRT-------PFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGI 403 VFTSDNGP + T PFR K S +EGG+R+P V W G IQP SD + Sbjct: 317 VFTSDNGPAPQDMGGTDTKFFNSSGPFRSGKTSIYEGGMRIPLIVRWHGKIQPNSTSDRV 376 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLG 445 D PT L+L+G+ + VP T IDG+ S LG Sbjct: 377 TGFEDWLPTLLELSGNKKS-----VP--TGIDGLSFASTLLG 411 >UniRef50_C9KTU9 Twin-arginine translocation pathway signal n=5 Tax=Bacteroidales RepID=C9KTU9_9BACE Length = 453 Score = 132 bits (333), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 123/426 (28%), Positives = 193/426 (45%), Gaps = 74/426 (17%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATIL 144 PNV++ L+DD+G D+ A TP+ID + G+ L + Y+ S SSP+RA +L Sbjct: 31 PNVLLILVDDLGLGDLSCQ---YAKDLSTPNIDRIFETGVRLDNFYANSSVSSPSRAALL 87 Query: 145 TGQYSIHHGI--LMPPMYGQPGGLQG--LTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 TG + G+ ++ P Q G G T+P++L + GY T IGKWH+G + P Sbjct: 88 TGCFPAMVGVPGVIRPSIDQNWGYFGPSAVTMPEVLKNGGYRTALIGKWHLGWESPNLPN 147 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 GFD F GF ++DM ++ H +GG + Sbjct: 148 ERGFDHFHGF--LADMMDDYY---------------------------THRRQGGNYMYL 178 Query: 261 AD--ITPK-YMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG--- 314 D I PK + +L W V ++ K AK PFFLY H P ++ Sbjct: 179 NDKEIDPKGHATELFTSW---SVDYIKKEAKEKNPFFLYLAYNAPHSPLQPPVEWVNKVQ 235 Query: 315 ----SSPARTSYGDCMVEMNDV-FANLYKTLEKNGQLDNTLIVFTSDNGPE-AEVPPHGR 368 S P + + ++E D + ++LE++GQL+NTL++F SDNG + + +G Sbjct: 236 ERDKSLPVKRARLIALIEHLDYNIGKVIQSLEESGQLNNTLVIFASDNGGDRGSMANNG- 294 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANL 427 P RGAKG +EGG+ V + G+ + R+ + V + DL PT D P V + Sbjct: 295 -PTRGAKGDMFEGGIHVACALNMPGVFEGGRRDNHFVVMMDLMPTICDFVNVP---VKHE 350 Query: 428 VPKTTFIDGV--------DQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQ 479 + + +D + D+ F+L G + F +AVR +E+K +L Sbjct: 351 IDGISVLDAIKGKTQNTEDRFVFWLRNEGGAQ------FGGKSQSAVRYNEYK--LLQNL 402 Query: 480 PYAYTQ 485 P+ Q Sbjct: 403 PFEKAQ 408 >UniRef50_B9KQS8 Twin-arginine translocation pathway signal n=2 Tax=Alphaproteobacteria RepID=B9KQS8_RHOSK Length = 509 Score = 132 bits (332), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 115/382 (30%), Positives = 182/382 (47%), Gaps = 79/382 (20%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 +P+++ L+DD+G+ DVG++G V TP++D +A++G L Y+QP +PTRA ++ Sbjct: 63 RPHILYILVDDLGYADVGYHGSDVK----TPNVDRLAAEGARLMQFYTQPLCTPTRAALM 118 Query: 145 TGQYSIHHGI---LMPPMYGQPGGLQGLTT----LPQLLHDQGYVTQAIGKWHMGE-NKE 196 TG+Y + +G+ ++P GG GL T LPQ+L + GY T +GKWH+G +++ Sbjct: 119 TGRYPMRYGLQTGVIP-----SGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQK 173 Query: 197 SQPQNVGFDDFRG--------FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 P+ G D F G F + T+W + + E+ P Y +L F D Sbjct: 174 YWPRQRGVDYFYGPLVGEIDHFKHEAHGITDW---YRDNEMVKEPG---YDTEL-FGAD- 225 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 A+R E+ A TP YM +L A Y + D YP Sbjct: 226 --AIRLIEEHDSA--TPLYM-------------YLSFTAPHTP-----YQAPDKYKDLYP 263 Query: 309 NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG---------- 358 + G R +Y + M+D + + LE+ G ++TL++F SDNG Sbjct: 264 DIADEG----RKAYAAMISCMDDQVGLVLQALERRGMREDTLVIFHSDNGGTRSKMFAGE 319 Query: 359 --PEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDL 416 E+PP P R KG+ +EGG RV W G I ++ G++ + D+ PT L Sbjct: 320 GAVAGELPPR-NDPLREGKGTLYEGGTRVVALANWPGRIPAGETHGMMHVVDMLPT---L 375 Query: 417 AGHPGAKVANLVPKTTFIDGVD 438 AG A++A+ +DG+D Sbjct: 376 AGLAQAEIAH----AGQLDGMD 393 >UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C1V3_9PLAN Length = 470 Score = 132 bits (332), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 127/425 (29%), Positives = 185/425 (43%), Gaps = 79/425 (18%) Query: 84 KKP-NVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS-SPTRA 141 +KP NVV FL+DD+GW D+G G +P+ID +A++G+ T YS ++ SPTR Sbjct: 31 EKPWNVVFFLVDDLGWTDLGCYGSDFY---QSPNIDQLAAEGMKFTQNYSACNACSPTRG 87 Query: 142 TILTGQYSIHHGI---------------LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAI 186 +LTG Y + L PP + + Q TTLP+ L GY T + Sbjct: 88 ALLTGMYPARTHLTDWIPGWAKSYTDFPLKPPEWKKHLD-QKYTTLPEALRTAGYQTFHV 146 Query: 187 GKWHMGENKESQPQNVGFD-DFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS 245 GKWH+G + + PQ+ GFD + G N R H P+ Sbjct: 147 GKWHLG-GRGNLPQDHGFDVNISGTNRGLP-----RSYH-----------------FPYG 183 Query: 246 KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH-- 303 D A++ A+ +Y+ D R D V + + + DKPFFLY H Sbjct: 184 GD---AMKWDSSLTEAERQDRYLTD---RMADEAVALIRQ--QQDKPFFLYCSFYSVHSP 235 Query: 304 FDNYPN--AKY----AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 P+ KY AG Y + +++ + L+++G D TLIVFTSDN Sbjct: 236 IQGRPDLVKKYKGLPAGKRHKNPEYAAMIQSVDEAIGRVRAQLKESGIADRTLIVFTSDN 295 Query: 358 GPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS---DGIVDLADLFPTAL 414 G P RG KG WEGG RVP V W G + P S + I+ + D +PT L Sbjct: 296 G-GVRRKTSNNDPLRGEKGQHWEGGTRVPAIVLWPG-VTPAGSVCAEPIITM-DFYPTIL 352 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY-------FLNGKLAAVR 467 ++ G VA +DG+ NR+A ++ F+ +A+R Sbjct: 353 NITG-----VAGNTEHNQSVDGLSLVPLLKDPAATLNREALYWHYPHYNVFIGVPYSAIR 407 Query: 468 MDEFK 472 + E+K Sbjct: 408 VGEYK 412 >UniRef50_P77318 Uncharacterized sulfatase ydeN n=81 Tax=Gammaproteobacteria RepID=YDEN_ECOLI Length = 560 Score = 132 bits (332), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 113/417 (27%), Positives = 171/417 (41%), Gaps = 106/417 (25%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGV-----------------------AVGNPTP 115 E T KPN++V +DD+G+ + F+ G A TP Sbjct: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 Query: 116 DIDAVASQGLILTSAY-SQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL----T 170 + ++ +G+ T+ Y + S P+RA I+TG+ G+ Y G+ T Sbjct: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-----YSNTDAQDGIPLTET 165 Query: 171 TLPQLLHDQGYVTQAIGKWHMG--------ENKES---------------QPQNVGFDDF 207 LP+L + GY T A+GKWH+ E+K++ QPQN GFD F Sbjct: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 Query: 208 RGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKY 267 GF++ Y SP + +++P +G Y Sbjct: 226 MGFHAAGTAYYN------------SPSLFKNRERVP--------AKG------------Y 253 Query: 268 MEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF-------DNYPNAKYAGSSPART 320 + D + D + +D+ D+PF LY H D Y GS A Sbjct: 254 ISD---QLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADN 310 Query: 321 SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWE 380 Y + ++ + + L+KNGQ DNT+I+FTSDNG + P +G K T+ Sbjct: 311 YYA-SVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYP 369 Query: 381 GGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGV 437 GG P F++WKG +QP D ++ D +PTALD A +PK +DGV Sbjct: 370 GGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADIS-------IPKDLKLDGV 419 >UniRef50_UPI0000588CF9 PREDICTED: similar to arylsulfatase B n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000588CF9 Length = 545 Score = 132 bits (332), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 109/350 (31%), Positives = 162/350 (46%), Gaps = 46/350 (13%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 + P++V L DD G+ D+G+ + TP++D +A++G+ L + Y QP +P+RA + Sbjct: 57 RPPHIVFILADDYGFNDIGYRNPAMR----TPNLDYLAAEGIKLDNYYVQPICTPSRAQL 112 Query: 144 LTGQYSIH----HGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG-ENKES 197 ++G+Y IH H I+ PP QP L L TLPQ L + GY T GKWH+G KE Sbjct: 113 MSGKYQIHTGLQHSIIWPP---QPNCLPLDLPTLPQKLKEAGYATHMAGKWHLGFYKKEC 169 Query: 198 QPQNVGFDDFRGF---NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 P N GFD F G ++TE E P Y P+ D R Sbjct: 170 WPTNRGFDSFLGILLGKGDHFLHTE--------EGGGGP----YPSTWPWEGLD---FRD 214 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY-- 312 G Q A + +R V+ + + DKP FLY + H Y Sbjct: 215 GLQSTNAYSGIYSTHVIAER-----VENIIEKHDKDKPLFLYVSFQAVHTPLQVPESYLQ 269 Query: 313 ----AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR 368 + R Y M++ N+ K L+K G D+T++VF+SDNG + Sbjct: 270 PFESSIQDEKRRIYAGMTYCMDEAVGNITKKLKKQGLWDDTVLVFSSDNGGNIDQGA-SN 328 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK---SDGIVDLADLFPTALD 415 P RG+K + WEGGVR FV + + K S ++D++D +PT ++ Sbjct: 329 WPLRGSKTTLWEGGVRAVGFVTSPLLSERMKGTVSRELIDISDWYPTLIE 378 >UniRef50_Q7UIN1 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UIN1_RHOBA Length = 554 Score = 132 bits (332), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 115/386 (29%), Positives = 175/386 (45%), Gaps = 56/386 (14%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNP-----TPDIDAVASQGLILTSAYSQPS- 135 T +PNV++ DD G+ G V+ NP TP++D +A +GL T+A+S S Sbjct: 54 TDTRPNVIIVYTDDQGF-------GDVSSMNPDAKFETPNMDRLAKEGLTFTNAHSSDSV 106 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMG- 192 +P+R +LTG+YS + M + L TL L D+GY T +GKWH+G Sbjct: 107 CTPSRYGLLTGRYSWRTTLKRGVMNAEGKCLIADDRMTLASFLRDEGYQTGMVGKWHLGM 166 Query: 193 ------------ENKESQPQNVGFDDFRGF-NSVSDMYTEW---RDVHVNPE--VALSPD 234 + P + GFD F G S++ W R V P+ P+ Sbjct: 167 QFPGSPKKRDWSQPVRDMPLDKGFDHFFGIPASLNYGVLAWFDGRHAAVPPKSWTGKKPN 226 Query: 235 RS--EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMED-LDQRWMDYGVKFL-------- 283 + +Y P+ + + A + + I ++ ++++ R+ D ++++ Sbjct: 227 KRHVDYRIMPPYQETETEARKRFKNTTI-EVADDFVDNQCLTRFTDEAIEWITEATATPG 285 Query: 284 DKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKN 343 ++ A + PFFLY H+ P +Y G YG+ M+E + L K LE N Sbjct: 286 NESASNAPPFFLYLPLTSPHYPVCPLPEYWGQGDC-GGYGEFMIETDHHLGRLLKHLEAN 344 Query: 344 GQLDNTLIVFTSDNGPEA-------EVPPHGRTPFRGAKGSTWEGGVRVPTFVYW-KGMI 395 G DNTL++ TSDNGPE + H +RG K +EGG RVP W G+ Sbjct: 345 GLTDNTLVILTSDNGPEKSWKQRIDDFGHHSNGSYRGGKRDIYEGGHRVPMLARWPNGIK 404 Query: 396 QP-RKSDGIVDLADLFPTALDLAGHP 420 QP R SD +V DL T +L G P Sbjct: 405 QPGRISDALVGQVDLLATVAELLGRP 430 >UniRef50_B7PV03 Arylsulfatase B, putative n=7 Tax=Ixodes scapularis RepID=B7PV03_IXOSC Length = 588 Score = 132 bits (332), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 105/358 (29%), Positives = 166/358 (46%), Gaps = 53/358 (14%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 ++P++V L DD+GW DV +NG TP+IDA+A G+ L Y+QP +P+RA + Sbjct: 63 RQPHIVFILADDLGWNDVSYNG---CPQIRTPNIDALAWNGIRLQRYYTQPMCTPSRAAL 119 Query: 144 LTGQYSIHHGIL-MPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQPQ 200 +TG+Y IH G+ + +P GL LPQ L D GYV+Q +GKWH+G KE P Sbjct: 120 MTGRYPIHTGMQHFVILQNEPRGLPLKFKLLPQWLGDLGYVSQMLGKWHLGFYKKEYTPT 179 Query: 201 NVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GF G + D Y+ R ++ FS + +G + Sbjct: 180 MRGFQKHIGSWGGFVDYYSHIR-----------------FNKIGFSHSGLDFRQGLSEGR 222 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKS---DKPFFLYYGTRGCHFDN---------- 306 D Q + ++ + ++ ++ +KP FLY H N Sbjct: 223 EFD---------GQYYTEFMTEAATRVIENHPLEKPLFLYLAHLAPHGANRHDPLQVPKK 273 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE--AEVP 364 Y + + RT Y + +++ + + L K G L +T++VF+SDNG + E P Sbjct: 274 YSDKYHDIGHWNRTMYAGMVSALDESVGAVVEALGKRGMLSDTVLVFSSDNGGDTNGENP 333 Query: 365 PHGRT-PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS---DGIVDLADLFPTALDLAG 418 + + PF+G K + WEGG+ VP F+ W + + + I ++D PT LAG Sbjct: 334 NYASSWPFKGQKRTLWEGGIHVPGFI-WSPLFSGMRGFDYNNIFHISDWLPTLYQLAG 390 >UniRef50_C6Y1N8 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y1N8_PEDHD Length = 440 Score = 132 bits (332), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 121/451 (26%), Positives = 195/451 (43%), Gaps = 75/451 (16%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNP---TPDIDAVASQGLILTS-AYSQPSSSPT 139 K+PNV++ L DD+G+ D+ GNP TP +D +AS G++ T+ + P+ SP+ Sbjct: 25 KRPNVIIVLTDDMGYGDLA------CYGNPLFKTPFLDKMASNGVMATNFVTTSPTCSPS 78 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENK 195 R + LTG+Y MP + G PG + T+ ++L Y T IGKWH+G+ Sbjct: 79 RVSTLTGRYCSRSK--MPRVIG-PGDKTAIPDEEVTIAEMLKTSAYRTACIGKWHIGDYG 135 Query: 196 ESQPQNVGFDDFRGFNSVSDMYT-EWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 P GFD F G +Y+ ++R +V + + R++ K + +D + Sbjct: 136 TGLPNKQGFDLFYGM-----LYSHDFRAPYVKTDTVIKIFRNQ--KPEIYRPNDTILTKA 188 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG 314 ++AI F+ + +PFFLY H + Sbjct: 189 YTREAIG--------------------FVKESTAKKQPFFLYLAYNMPHLPVASAVRKDS 228 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH----GRTP 370 + A G + EM+ A L+KT++ +G+ DNT+ +FTSDNGP P G T Sbjct: 229 NKSAGGELGSVIEEMDTEMAKLWKTVQDSGEADNTIFIFTSDNGPWLNAPQRMYDDGITK 288 Query: 371 ---------FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPG 421 FRG+K ++ EGG RVP VY+K + + D+ PT D G Sbjct: 289 PYHVGTAGIFRGSKATSLEGGHRVPFIVYYKNHTAQQVVRSPISNLDILPTLADWTG--- 345 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 +PK +DG + Q K +Y+ N L V+ ++K + ++ Sbjct: 346 ----TALPKRV-LDGESVVKLLSQKDYQIPHKPIYYY-NYVLEGVKDGDWKLRI-TKKDD 398 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ 512 + + G+ T +NLY DP+ Sbjct: 399 KTIEEMFHLGWDPT-------ERYNLYNDPK 422 >UniRef50_D2R457 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R457_9PLAN Length = 516 Score = 132 bits (332), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 145/510 (28%), Positives = 214/510 (41%), Gaps = 89/510 (17%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATIL 144 PN+V L DD+G+ DVG G TP ID +A G+ L YS P +P+R +L Sbjct: 33 PNIVFILCDDLGYGDVGCFG---QKKTRTPHIDTLARDGMRLIQHYSGAPVCAPSRCVLL 89 Query: 145 TGQYSIHHGILMPPMYGQPGG----LQGLTTLPQLLHDQGYVTQAIGKWHMGENKES-QP 199 TG +S H + QP G +G TLP LL +GYV A GKW +G + S +P Sbjct: 90 TGLHSGHSQV-RDNREAQPEGQYPLAEGTVTLPGLL--EGYVCGAFGKWGLGGPESSGKP 146 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD F G+N + + P+ S D +K PF+ Q Sbjct: 147 LAQGFDRFFGYNCQRQAHNYY------PQHLWSNDEKVLLKNPPFAAHQKFPADADPQNP 200 Query: 260 IADIT---PKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH------------- 303 A P Y DL + +KF+D+ KPFFLYY + H Sbjct: 201 AAFERYRGPDYAADLIS---EQALKFIDE--HHQKPFFLYYASPVPHLALQVPEDSLKEY 255 Query: 304 ---FDNYPNAKYAGSSP---ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 F P G P R +Y + M+ + + LEK G T++VF+SDN Sbjct: 256 AGEFSETPYLGERGYLPHPTPRAAYAAMITRMDREIGRILERLEKYGLQRRTIVVFSSDN 315 Query: 358 GPEAEVPPHGRTPF-------RGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADL 409 GP + F RG KGS +EGG+RVPT V + G++ S + D Sbjct: 316 GPLYDKLGGTDADFFQSALDLRGRKGSVYEGGIRVPTIVKFPGVVPAGTTSSTLGGFEDW 375 Query: 410 FPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRK----------AEHYFL 459 PT L LAG ++ +P+ DG D + G + Q+ R+ + + Sbjct: 376 MPTLLSLAG-----MSTKIPEQA--DGRDLSPSLRG-DWQAPREFLYREFPGYGGQQFVR 427 Query: 460 NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGV 519 +GK AVR + + ++ A + + +++L DP ES ++ Sbjct: 428 SGKWKAVRQNLVRPVPTGKKKLAEWK------------EPLAIELYDLEADPTESTNVAA 475 Query: 520 RHIPMGVPLQTEMHAYMEILKKYPPRAQIK 549 H P V ++HA M L+++ P + K Sbjct: 476 EH-PKVV---AKLHAIM--LREHQPSVEFK 499 >UniRef50_A7SPY2 Predicted protein (Fragment) n=10 Tax=Eumetazoa RepID=A7SPY2_NEMVE Length = 270 Score = 132 bits (332), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 97/291 (33%), Positives = 145/291 (49%), Gaps = 45/291 (15%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 KKPN+V+ + DD+GW DV F+G PTP ID +AS+G+IL S Y P +PTRA++ Sbjct: 1 KKPNIVMIVADDLGWDDVSFHGSSQI---PTPTIDKLASEGVILNSYYVSPICTPTRASL 57 Query: 144 LTGQYSIHHGILM---PPMYG-QPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG-ENKES 197 +TG++ ++ G+L+ ++G QP GL G TT PQ + GYVT IGKWH+G KE Sbjct: 58 MTGKHPMNLGMLIHTHATVFGTQPYGLPLGETTTPQYMKSLGYVTHGIGKWHLGFFEKEY 117 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P GFD F GF + + Y + H + E D +D+ VR Sbjct: 118 TPTYRGFDSFYGFWNGKEDYWD----HSSQEDVWGTDL----------RDNEKPVRNESG 163 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN-------- 309 ++ + E Q + + KP +LY +G H N Sbjct: 164 HYGTEL---FAERAAQ---------IIHLHNQTKPLYLYLAQQGVHSANGNEPLQAPKRL 211 Query: 310 -AKYAG-SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG 358 K++ SSP R Y + +++ ++K L + G L+NT++VFT+DNG Sbjct: 212 IKKFSHISSPKRRIYAAMVSSLDESVETVHKALSETGMLNNTVLVFTTDNG 262 >UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Proteobacteria RepID=UPI0000E0F7DD Length = 493 Score = 132 bits (331), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 115/398 (28%), Positives = 174/398 (43%), Gaps = 78/398 (19%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATI 143 KPN+++ ++DD+GW DVG+N TP+IDA+A QGL+ AY+ + +P+RA + Sbjct: 39 KPNIIMIVIDDLGWSDVGYNQTTDYF--ETPNIDALAQQGLVFDQAYAGAANCAPSRAVL 96 Query: 144 LTGQYSIHHGIL--------------MPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKW 189 ++GQY HG+ + P+ + G + T+ + L GY T GKW Sbjct: 97 MSGQYGPRHGVYTVSPSDRGHAKTRKLIPIKNKRGLTTDIITIGESLKTAGYTTGTFGKW 156 Query: 190 HMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 H+G + + Q GF D S M + + P + P + EY+ Sbjct: 157 HLGADPDKQ----GF-DVNVAGSHQGMTFHYFSPYQLPNIEDGP-KGEYL---------- 200 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH------ 303 E L +D+ VK + D+PFF Y H Sbjct: 201 ------------------TERLTTEVIDW-VK-----SSKDQPFFAYVPYYTVHTPYQAV 236 Query: 304 FDNYPNAKYAGSSPARTSYGDCMVE-MNDVFANLYKTLEKNGQLDNTLIVFTSDNG--PE 360 D G R + MVE M+D ++ L+ G +NT+++FTSDNG Sbjct: 237 VDKVNKYHEKGIKSKREATYAAMVEHMDDNVGRIFDMLDSEGLAENTVVIFTSDNGGYRM 296 Query: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP 420 + P TP RG KGS ++GG+RVP V W ++P V AD +PT ++L Sbjct: 297 SSFP----TPLRGGKGSYYDGGLRVPLIVRWPEKVKPGLDHTPVINADFYPTLVNLT--- 349 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF 458 +K N V +DGVD T+ LG + R +F Sbjct: 350 KSKQPNQV-----LDGVDLTAHLLGQQDIAERDLFWHF 382 >UniRef50_A5FAW4 Sulfatase n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FAW4_FLAJ1 Length = 539 Score = 132 bits (331), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 141/525 (26%), Positives = 216/525 (41%), Gaps = 103/525 (19%) Query: 73 QKLAE-----LEKK-----TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVAS 122 QKLAE L +K + KKPN+++ L DD+G D+ GG PTP ID++A+ Sbjct: 40 QKLAEGKAAFLSQKDTSAASEKKPNIIILLADDLGKYDISLYGGK---STPTPQIDSLAA 96 Query: 123 QGLILTSAYSQPS-SSPTRATILTGQYSIHHGILMPPMYGQPG----------------- 164 G+ T Y S SP+RA +LTG+Y G P P Sbjct: 97 SGVTFTDGYVSSSICSPSRAGLLTGRYQERFGHEYQPGDRYPKNNLEYYAFKYLLNTNSW 156 Query: 165 --------------GLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDD 206 QGL T L QGY T IGKWH+G K P + GFD Sbjct: 157 RLNPKIEYPNDASIATQGLPKSEITFADLAKKQGYSTAIIGKWHLGHTKGFFPLDRGFDY 216 Query: 207 FRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPK 266 GF ++ + NP++ ++ +++ + + V + I D K Sbjct: 217 HYGFYQAFSLFAPEDN---NPDI-INHHHTDFTDKTIWGNGRVGTGQIRRDSTIID-EKK 271 Query: 267 YMEDLDQRWMDYGVKFLDKMAKSDKPFFLY---------YGTRGCHFDNYPNAKYAGSSP 317 Y L +++ + F+DK +KPF LY + R ++D +PN K Sbjct: 272 Y---LTEKFAEEAEAFIDK--NKNKPFLLYVPFNAPHTPFQVRKKYYDRFPNVK----DE 322 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGS 377 + Y + ++D + ++K G +NTLI F SDNG P +G K S Sbjct: 323 NKRVYFAMISALDDAIGLIRAKVKKEGLEENTLIFFASDNGGADYTYATTNAPLKGGKFS 382 Query: 378 TWEGGVRVPTFVYWKGMIQPRKSDGI-VDLADLFPTALDLAGHPGAKVANLVPKTTFIDG 436 +EGGV VP + WKG I+P V D+F T + H G +PK DG Sbjct: 383 HFEGGVNVPFALSWKGKIKPHTIYKTPVSSLDIFST-IAAVTHSG------LPKDRVYDG 435 Query: 437 VDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTV 496 VD + N Q+++ Y+ +G A+R ++K + SG Sbjct: 436 VDLVD-VVNNNKQAHQNL--YWRSGDAKAIRSGDWKLII----------SG--------- 473 Query: 497 MQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +T + ++NL D E+ + ++ LQT + + + L K Sbjct: 474 -KTHETWLYNLAKDKSETTDLASKNPEKVKELQTALQNWEKGLIK 517 >UniRef50_Q9NJU8 Sulfatase 1 n=2 Tax=Coelomata RepID=Q9NJU8_HELPO Length = 503 Score = 132 bits (331), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 104/355 (29%), Positives = 166/355 (46%), Gaps = 53/355 (14%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT 139 ++ +PN+V L DD G+ DVG++G + TP +DA+++ G+ L + Y QP +PT Sbjct: 28 RQDAGQPNIVFVLADDFGFHDVGYHGSEIH----TPTLDALSASGVRLENYYVQPICTPT 83 Query: 140 RATILTGQYSIH----HGILMPPMYGQPGGLQGLT-TLPQLLHDQGYVTQAIGKWHMGEN 194 R+ +++G+Y IH HGI+ QP L + TL L + GY T +GKWH+G Sbjct: 84 RSQLMSGRYQIHTGLQHGIINS---CQPNALPNDSPTLADKLKESGYATHMVGKWHLGFY 140 Query: 195 K-ESQPQNVGFDDFRGF-NSVSDMYTE---WRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 K E P N GFD + G+ N+ D + WR V Y+ +D+ Sbjct: 141 KQEYLPWNRGFDTYFGYLNAAEDYFNHNVPWRQV-------------RYLDL----RDNN 183 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF----- 304 VR Q A + D+ Q + KP FLY + H Sbjct: 184 GPVRNETGQYSAHLFTGKAIDVVQS------------HNTSKPLFLYLAYQSVHAPLEVP 231 Query: 305 DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP 364 + Y + + R ++ + +++ ANL + L+ G +NT+++F++DNG + Sbjct: 232 EKYEHKYRNITDKNRRTFAGMVSALDEGVANLTQALKDKGLWNNTVLIFSTDNGGQIHAG 291 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAG 418 + P RG K S WEGG FV + + S G++ ++D FPT + LAG Sbjct: 292 GN-NYPLRGWKASLWEGGFHGVGFVSGGALKRSGAVSKGLIHVSDWFPTLVTLAG 345 >UniRef50_A3HYT7 Arylsulphatase A n=1 Tax=Algoriphagus sp. PR1 RepID=A3HYT7_9SPHI Length = 437 Score = 132 bits (331), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 118/397 (29%), Positives = 185/397 (46%), Gaps = 57/397 (14%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 + PN+++ + DD+G +G GG TP IDA+A+QG +A++QP +P+R I Sbjct: 29 RPPNIILIMADDLGVETIGSYGG---TSYQTPFIDAMAAQGAKFENAFAQPLCTPSRVQI 85 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVG 203 +TGQY++ + + +GQ Q TT +LL D GY T GKW +G+ +S PQ+ G Sbjct: 86 MTGQYNVRNYTV----FGQLDRSQ--TTFAKLLKDAGYKTAIAGKWQLGKESDS-PQHFG 138 Query: 204 FDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADI 263 F++ W+ H+ + + + Y + GG Q DI Sbjct: 139 FEE----------SCLWQ--HMLGATDKNGNDTRYSNPVLEINGVPKHFDGG--QFSTDI 184 Query: 264 TPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG--TRGCHFDNYPNAK-YAGSSPART 320 T D+ + F++K D+PFF YY C F P++K + SSP Sbjct: 185 TS-----------DFLIDFMEK--NKDQPFFAYYPMIITHCPFVPTPDSKDWDPSSPGSP 231 Query: 321 SY-------GDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR-TPFR 372 +Y GD + M+ + +E+ G + T+I+FT DNG + + R + Sbjct: 232 TYKGDPQYFGDMVAYMDKTVGKIIAKVEEMGLSEETIIIFTGDNGTDQPIVSSYRGKDYP 291 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 G K T E G+ VP V WKG I +++ ++D +D PT LDLA G K + +P Sbjct: 292 GGKKFTTENGIHVPLVVKWKGKIDSGIQNEDLIDFSDFLPTLLDLA---GIKAVHGIP-- 346 Query: 432 TFIDGVDQTSFFLGTNGQ-SNRKAEHYFLNGKLAAVR 467 +DGV +G G N Y NG L +++ Sbjct: 347 --LDGVSFMPQLMGKEGNPRNWIYSWYSRNGDLESLQ 381 >UniRef50_A6DG54 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG54_9BACT Length = 469 Score = 132 bits (331), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 121/395 (30%), Positives = 167/395 (42%), Gaps = 58/395 (14%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATIL 144 PNVVV DD GW D G GG V T ID +A G+ T Y+ P+ SP+RA +L Sbjct: 28 PNVVVIYFDDTGWKDFGCFGGAVD----TTHIDNLAKNGMRFTEYYAPAPNCSPSRAGLL 83 Query: 145 TGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGE---NKESQPQ 200 TG++ G+ P L T+ + L +GY T GKWH+G P Sbjct: 84 TGRFPFRLGMYSYRSKNTPMHLPDSEITIAEALKTKGYATGMFGKWHLGNLDGKSHPTPS 143 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 GFD + ++ + NP K L + V + G Q + Sbjct: 144 EQGFDYWLACDN--------NLIKHNP------------KSLIRNGKPVGKIAGWAAQVV 183 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKP--FFLYYGTRGCHFDNYPNAKYAGSSPA 318 AD ++M+ + Y + F + + D P Y RG +N A Y G Sbjct: 184 ADEANEWMKKQTSPFFAY-IAFSETHSPLDAPEELITKYIERG---ENKKRATYRG---- 235 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGST 378 T Y D V ++ KTL+ G DNTL+ SDNGP +E G RG K T Sbjct: 236 MTEYSDAAV------GSILKTLDDMGVSDNTLVFLASDNGPTSEDSCEG---LRGKKSYT 286 Query: 379 WEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGV 437 WEGG+RVP + W G ++P + + V DL PT D+ G +PK IDGV Sbjct: 287 WEGGIRVPAIIRWPGKVKPGSEYNDPVGGIDLLPTLCDIVGAE-------LPK-RHIDGV 338 Query: 438 DQTSFFLGTNGQSNRKAEHYFLNGKLAA-VRMDEF 471 S G + N +F AA +RM ++ Sbjct: 339 SIRSVLEGKPFKRNTPILSFFYRTSPAASMRMGDY 373 >UniRef50_Q7US20 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7US20_RHOBA Length = 458 Score = 131 bits (330), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 119/393 (30%), Positives = 187/393 (47%), Gaps = 78/393 (19%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 +PN+V+ + DD+G +G GG V TP +D +AS G+ T AYSQP +PTR ++ Sbjct: 22 RPNIVLIMADDIGIEGLGCYGG---VSYDTPALDQLASDGVRFTHAYSQPLCTPTRVQLM 78 Query: 145 TGQYSIHH----GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM--------- 191 +G+Y+ + GIL P + T + ++GY T GKW + Sbjct: 79 SGKYNHRNWKTFGILDP----------NVKTFGHRMKEEGYATAIFGKWQLQSYDPPGYP 128 Query: 192 GENKES----QPQNVGFDDFRGFNSVSDMYTEWR-DVHVNPEVALSPDRSEYIKQLPFSK 246 G ++ P++ GFD + F++ ++TE + + NP + Sbjct: 129 GADERRGTGMHPKDAGFDQYALFHA---LHTEDKGSRYANPTML---------------- 169 Query: 247 DDVHAVRGGEQQAIADITP-KYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD 305 + A +GGE + I P ++ ED+ W+ + FL + ++P F+YY H+ Sbjct: 170 -EGEAGQGGELK----IYPGQFGEDI---WVKKTIDFLKR--DRNEPAFVYYPMALPHWP 219 Query: 306 NYP---NAKYAGSSPA--RTSYGDCMVEMNDV-FANLYKTLEKNGQLDNTLIVFTSDNGP 359 P + + S PA + Y MVE DV NL + L+ NG +NT+++F DNG Sbjct: 220 FVPTPISDDWDPSQPAVEQLKYFKDMVEYMDVAVGNLIQGLQSNGLRENTVVIFYGDNGT 279 Query: 360 EAEVPPH---GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDL 416 +V GR+ +G KG T + G+ VP V W G IQP SD ++D +D +PT ++L Sbjct: 280 HLKVVSELSDGRS-IQGGKGLTKQTGIHVPLIVSWPGHIQPNVSDKLIDASDFYPTLIEL 338 Query: 417 AGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ 449 A G KVA + +DGV G GQ Sbjct: 339 A---GGKVA----EDPTMDGVSFAPDLFGKKGQ 364 >UniRef50_Q7UGB8 Arylsulfatase homolog b1498 n=1 Tax=Rhodopirellula baltica RepID=Q7UGB8_RHOBA Length = 656 Score = 131 bits (330), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 126/477 (26%), Positives = 196/477 (41%), Gaps = 93/477 (19%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNP---TPDIDAVASQGLILTSAYSQP 134 ++ + +PNV++ L DD GW D+ A NP TP +DA+A++ L Y P Sbjct: 94 IQAEASDRPNVLLILTDDQGWGDLA------AHRNPKISTPTLDALANESARLDRFYVSP 147 Query: 135 SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWH 190 +PTRA +LTG+Y G+ G G + + TTL +L GY T GKWH Sbjct: 148 VCAPTRAALLTGRYPERSGV-----AGVTGRREVMRAEETTLAELYRSAGYATGCFGKWH 202 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 G P GF++F GF H N DD Sbjct: 203 NGAQMPLHPNGQGFNEFFGFCG----------GHFN------------------LYDDAL 234 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY---------YGTRG 301 R G T Y+ D+ D V+F+ D+PFF Y + R Sbjct: 235 LERNGTPVQ----TKGYITDV---LTDAAVEFIQN--HHDRPFFCYVPFNAPHGPFQVRR 285 Query: 302 CHFDNYPNAKYAGSSPARTSYGDCMVEMNDV-FANLYKTLEKNGQLDNTLIVFTSDNGPE 360 FD Y + GS +T+ MV+ D + L K L + + T++VF +DNGP Sbjct: 286 DLFDRYND----GSIDEKTAAVYAMVQNIDTNVSRLLKCLSDHSLDEETIVVFLTDNGPN 341 Query: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP 420 + G RG KGS EGG RVP F+ W G IQP+ + DL PT + P Sbjct: 342 GKRFNGG---MRGTKGSVHEGGCRVPCFIRWTGNIQPQSISQVAAHIDLLPTLMQWCDIP 398 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 +P +DG ++ L +G A+ L + +++ +F + Sbjct: 399 -------LPTKVPLDG--RSLVELIRDGADPTLADRSILTYRPNPMQLQKFGKAAVRTNT 449 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 + T ++ + +S+F++ TD ++ I H + L++++ Y++ Sbjct: 450 HRLT------------IEKSKASLFDMTTDAGQTTDIASSHPELTKQLRSQIQKYVQ 494 >UniRef50_A6DMX8 Iduronate-sulfatase or arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMX8_9BACT Length = 532 Score = 131 bits (330), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 131/482 (27%), Positives = 205/482 (42%), Gaps = 68/482 (14%) Query: 75 LAELEKKTGKK--PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 L E+ KT + PN+V+ DD+G+ D+ G A TP+ID +A G++ T +S Sbjct: 40 LNEMRPKTTQSEYPNIVLIYADDLGYGDLSSYG---ATKIKTPNIDRLAKNGILFTDGHS 96 Query: 133 -QPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWH 190 + +P+R +LTG+Y + P + TT+ LL +GY T +GKWH Sbjct: 97 TSATCTPSRYALLTGEYPLRINNYSPVFCADRLIIDTKKTTIASLLKRKGYTTACVGKWH 156 Query: 191 MG--------ENKESQ--PQNVGFDDFRGFNSVSD----MYTEWRDV-HVNPEVALSPDR 235 +G NKE + P +GFD F G V+ +Y E R + ++P L+ R Sbjct: 157 LGFGDKPKPDWNKELKPGPLELGFDYFFGLPVVNSHPPFVYMENRRILGLDPNDPLTYKR 216 Query: 236 ------SEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS 289 Y+ + + +V GG+ + E L Q+ + + M + Sbjct: 217 GGKTYGKAYVGKHTSPHRGMPSVIGGKVAHDLYVDELIGEKLTQKALTW-------MNQQ 269 Query: 290 DKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 DKPFFLYY + H P+ + G S GD + E++ + +E+ G L+NT Sbjct: 270 DKPFFLYYASHNVHLPITPHPYFHGKSECGLR-GDFVEELDWSVGQIISAVERFGALENT 328 Query: 350 LIVFTSDNGP------EAEV------PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP 397 + +FTSDNG + ++ P+G+ +G K WE G RVP V W I Sbjct: 329 IFIFTSDNGAIIKGKDQGDILDQLGHKPNGK--LKGRKFGAWEAGHRVPFIVSWPNKIPA 386 Query: 398 RK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH 456 K SD ++ DL PT + G L P DG +Q LG + S R Sbjct: 387 GKTSDALIANLDLLPTFAAITGQ------KLAPHEAR-DGFNQLPLLLGKDTTSARSE-- 437 Query: 457 YFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDP-QESD 515 ++ + + L Q + Y GG+ ++NL DP Q+ + Sbjct: 438 -------LIIQPHKRSHKSLRQGDWVYIPGAGDGGWVPAKKGELPKQLYNLKDDPYQQQN 490 Query: 516 SI 517 I Sbjct: 491 RI 492 >UniRef50_P15848 Arylsulfatase B n=32 Tax=Euteleostomi RepID=ARSB_HUMAN Length = 533 Score = 131 bits (330), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 110/356 (30%), Positives = 165/356 (46%), Gaps = 54/356 (15%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 + P++V L DD+GW DVGF+G + TP +DA+A+ G++L + Y+QP +P+R+ + Sbjct: 43 RPPHLVFLLADDLGWNDVGFHGSRIR----TPHLDALAAGGVLLDNYYTQPLCTPSRSQL 98 Query: 144 LTGQYSI----HHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGE-NKES 197 LTG+Y I H I+ P QP + LPQLL + GY T +GKWH+G KE Sbjct: 99 LTGRYQIRTGLQHQIIWP---CQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKEC 155 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P GFD + G+ S+ Y S +R I L ++ + R GE+ Sbjct: 156 LPTRRGFDTYFGYLLGSEDY-------------YSHERCTLIDALNVTRCALD-FRDGEE 201 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKS-------DKPFFLYYGTRGCHF-----D 305 A K M Y K A + +KP FLY + H + Sbjct: 202 VATGY---KNM---------YSTNIFTKRAIALITNHPPEKPLFLYLALQSVHEPLQVPE 249 Query: 306 NYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP 365 Y R Y + M++ N+ L+ +G +NT+ +F++DNG + + Sbjct: 250 EYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQT-LAG 308 Query: 366 HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLA-GH 419 P RG K S WEGGVR FV + Q K+ ++ ++D PT + LA GH Sbjct: 309 GNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGH 364 >UniRef50_B1KD86 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD86_SHEWM Length = 484 Score = 131 bits (329), Expect = 8e-29, Method: Compositional matrix adjust. Identities = 106/356 (29%), Positives = 159/356 (44%), Gaps = 63/356 (17%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS-S 137 E+ K+ NVV+ +DD+G MD G G + PTP+ID +A+ G+ T AY+ ++ + Sbjct: 31 EESKLKQANVVIIYVDDLGIMDTGIYG---SAQYPTPNIDKLANSGVRFTQAYANAANCA 87 Query: 138 PTRATILTGQYSIHHGIL--------------MPPMYGQPGGLQGLTTLPQLLHDQGYVT 183 P+RA+++TG HGIL + P+ LTT+ L QGY T Sbjct: 88 PSRASLMTGLTPAEHGILTVGSSERGESQYRKLIPVTNNTELNPDLTTIADLFKQQGYAT 147 Query: 184 QAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLP 243 IGKWH+G ++ P GFD T H+ P Y P Sbjct: 148 AVIGKWHLG---KTAPTEYGFD------------TAIAASHLG-----HPPSYFY----P 183 Query: 244 FSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 +SK + G E+ + D E L R V ++ + +PFFLY H Sbjct: 184 YSKGKRKLI-GLEEGGLKD------EYLSNRITREAVNYI---SSQRQPFFLYLPFYAVH 233 Query: 304 --------FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 + N NA+ +Y + ++ L + L+K+GQ +NTL+VF S Sbjct: 234 TPIEAPKEWVNQHNARQQAGEIKSAAYAAMIANLDRDVGKLLQALDKSGQRENTLVVFAS 293 Query: 356 DNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGI-VDLADLF 410 DNG A P P+RG K S +EGG+++P + W I P + V ++DLF Sbjct: 294 DNG--AYDPATSSLPYRGYKSSLFEGGIKIPLVLSWPKQIPPNSQNRTPVQMSDLF 347 >UniRef50_A6DRV5 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DRV5_9BACT Length = 505 Score = 131 bits (329), Expect = 8e-29, Method: Compositional matrix adjust. Identities = 130/440 (29%), Positives = 207/440 (47%), Gaps = 76/440 (17%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS--SPTRA 141 +KPN+V+ L DD+G+ DV F V TP +DA+A +G+ + A++ PS+ SP+R Sbjct: 26 EKPNIVIILTDDLGYGDVSFLNPESKVR--TPHMDALAKEGVWASDAHA-PSTVCSPSRY 82 Query: 142 TILTGQY----SIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWH------- 190 ++LTG+Y S+ G L P + + + TLP++L ++GY T IGKWH Sbjct: 83 SLLTGRYAWRGSLRAGRLNP--WKESAIEKDRVTLPKILKEKGYHTALIGKWHLGFEWPW 140 Query: 191 MGENKESQ------------------------PQNVGFDDFRGFNSVSDM----YTEWRD 222 MG K S+ P GFD + G + +M + E Sbjct: 141 MGGGKPSESIIGKGTSTASCEMFDWSKPIKGGPLGAGFDYYFG-DDAPNMPPYAFIENDR 199 Query: 223 VHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKF 282 + P ++ D + +KQ +H + GE+ + K M + + +D F Sbjct: 200 LTCEP---VNIDGRKLMKQQTMRGGYIHGIGPGEKGWQLN---KVMPTITAKAID----F 249 Query: 283 LDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEK 342 +D+ +K +KPFFL Y H P KY G S A YGD +++ ++ + K L+ Sbjct: 250 IDQESKKEKPFFLMYAPTSPHSPIVPLDKYKGKSLA-GPYGDFIIQTDEAIGQVVKALKN 308 Query: 343 NGQLDNTLIVFTSDNGP----EAEVPPHGRT---PFRGAKGSTWEGGVRVPTFVYW-KGM 394 +G +NTL++ +SDNGP + HG P RG K EGG RVP W KG Sbjct: 309 SGVYENTLLIISSDNGPAPFMRERIQTHGHNPSGPLRGLKRDLLEGGHRVPFIASWPKGE 368 Query: 395 IQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRK 453 I+ K D ++ DLF T +AG K+ + + + D +D + L +N ++ Sbjct: 369 IKGGKEIDALLSQTDLFAT---IAGIIDYKLEDSIAE----DSLDILA-TLRSNQTVRQE 420 Query: 454 AEHYFLNGKLAAVRMDEFKY 473 ++ NGKL +R + + Y Sbjct: 421 LVYHASNGKL-GLRQENWAY 439 >UniRef50_D2R783 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R783_9PLAN Length = 505 Score = 130 bits (328), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 119/407 (29%), Positives = 178/407 (43%), Gaps = 44/407 (10%) Query: 71 TQQKLAELEKKTGK--KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILT 128 T + AE E K +PN+V+ DD+GW D+G + PTP++D +ASQGL LT Sbjct: 18 TNLRGAETESARAKPARPNIVILYADDMGWGDLGAQNPDSKI--PTPNLDRLASQGLRLT 75 Query: 129 SAYSQPS-SSPTRATILTGQYSIH--HGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQA 185 A+S +P+R +L G+Y HGI+ + Q T+ +LL +GY T Sbjct: 76 DAHSSSGICTPSRYALLHGRYHWRKFHGIVN--SFDQSVMDDERVTMAELLKTEGYKTAC 133 Query: 186 IGKWHMGE--NKESQP------QNVGF--DDFRGFNSVS--------DMYTEWRDVHVNP 227 IGKWH+G N +P Q GF +DF + D Y + P Sbjct: 134 IGKWHLGWDWNAIKRPGAKGGAQGTGFAAEDFDWSKPIPGGPLSHGFDYYYGDDVPNFPP 193 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA 287 DR + + + A E + + ++ D V +++K Sbjct: 194 YAWFENDRIVVPPTVRVTTTEPTAEGNWEARPGPAVKDWDFWNVMPTLTDKAVAWINKQ- 252 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 K+D+PFFLY+ H P ++ G S A +GD M + + + + L+K G + Sbjct: 253 KADEPFFLYFPFTSPHAPIVPTKEFTGKSQA-GGFGDFMTQTDATVGRVLEALDKQGLAE 311 Query: 348 NTLIVFTSDNGPEAEVPPHGRT-------PFRGAKGSTWEGGVRVPTFVYWKGMIQPRK- 399 NTL++FT+DNGPE R P RG K WEGG RVP + W + K Sbjct: 312 NTLVIFTADNGPEHYAYERVRKFEHRSMGPLRGLKRDLWEGGHRVPMVIRWPKHVPAGKV 371 Query: 400 SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGT 446 SDG++ DL T + V +P + D +Q + GT Sbjct: 372 SDGLMSQIDLLATIATI-------VDAEIPAGSADDSYNQLPLWTGT 411 >UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD Length = 452 Score = 130 bits (328), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 116/376 (30%), Positives = 175/376 (46%), Gaps = 50/376 (13%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-P 134 A L + K+PNV++ DD G +DV G A TP+ID +A +G++ + Y+ P Sbjct: 18 APLFAQQQKRPNVLIIYTDDQGTLDVNCYG---AKDLHTPNIDRLAKEGVLFSQFYAAAP 74 Query: 135 SSSPTRATILTGQYSIHHGI--LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 SP+RA++LTG+Y + P G G T+ ++ D GY T IGKWH+G Sbjct: 75 VCSPSRASLLTGRYPQRAQLDNNAPSEEGHAGMPGSQYTMAEMFKDGGYTTAHIGKWHIG 134 Query: 193 ENKESQPQNVGFDDFRGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 + E+ P GFD GF D Y+ + ++ + H Sbjct: 135 YSPETMPNQQGFDYSFGFMGGCIDNYSHY---------------------FYWAGPNRHD 173 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 + Q+ D K+ DL + ++ FL+K ++DKPFFLY+ H+ K Sbjct: 174 LWRNGQEIWED--GKFFADLTVQEVN---GFLEKNKRADKPFFLYWAINMPHYPLQGQEK 228 Query: 312 ---YAGSSPA-RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG 367 Y PA R Y + M++ + + L++ G +NT++VF SD G E G Sbjct: 229 WRQYYKDLPAPRRMYAAAVSTMDEKIGQVLQQLDRLGLAENTIVVFQSDQGHSTEDRSFG 288 Query: 368 ----RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGA 422 P+RGAK S +EGG+RVP + W G + + D + D +PT LAG Sbjct: 289 GGGFTGPYRGAKFSLFEGGIRVPAIIRWTGHLPKNEVRDQLCVNIDWYPT---LAGL--C 343 Query: 423 KVANLVPKTTFIDGVD 438 KVA +P+ IDG D Sbjct: 344 KVA--LPQRK-IDGKD 356 >UniRef50_B0SY54 Sulfatase n=7 Tax=Alphaproteobacteria RepID=B0SY54_CAUSK Length = 559 Score = 130 bits (328), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 126/419 (30%), Positives = 179/419 (42%), Gaps = 83/419 (19%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVG-NPTPDIDAVASQGLILTSAY-SQPSSSPTRA 141 + PNV+V L DD+G+ D+ FNGGGVA G PTP+ID++ G+ + Y + +P+RA Sbjct: 60 RPPNVIVILADDMGFNDITFNGGGVAGGLVPTPNIDSLGHDGVSFANGYDGNATCAPSRA 119 Query: 142 TILTGQYSIHHG----------------------ILMPPMY-----GQPGGLQGLT---- 170 TI+TG+Y+ G I++P Y P G T Sbjct: 120 TIMTGRYATRFGFEFTPAPVAFEKMVGSEGAAGDIVLPRFYPDRLKAMPPGSTAPTPDAV 179 Query: 171 ----------TLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEW 220 T+ QLL +GY T GKWH+G S+P+ GFD+ GF + MY Sbjct: 180 NELSMPASEITVAQLLKTRGYHTLHFGKWHLGGKAGSRPEQKGFDESLGFIAGGSMYLPE 239 Query: 221 RDVHV-NPEVALSPDRSEYIKQLPFSK--DDVHAVRGGEQQAIADITPKYMEDLDQRWMD 277 D V N + P LP++ + R G YM D D Sbjct: 240 GDPGVENAKQPWDPIDRFLWPNLPYAVQFNGSPMFRPG----------GYMTDY---LTD 286 Query: 278 YGVKFLDKMAKSDKPFFLYYGTRGCH---------FDNYPNAKYAGSSPARTSYGDCMVE 328 VK + A ++PFF+Y+ H +D P K YG + Sbjct: 287 EAVKAV--RANRNRPFFMYFAPNAIHTPLQATKADYDALPEIK----DHRLRVYGAMVRN 340 Query: 329 MNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP-PHGRTPFRGAKGSTWEGGVRVPT 387 ++ L + L++ G NTL++FTSDNG + P P+RG K + +EGG+ P Sbjct: 341 LDRNVGRLLQALKEEGLDQNTLVIFTSDNGGANYIGLPDINRPYRGWKATFFEGGIHSPF 400 Query: 388 FVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLG 445 F+ W +I + V D+F TA AG P +PK IDGVD F G Sbjct: 401 FMRWPAVIPANSRYSAPVGHIDIFATAAAAAGAP-------LPKDRVIDGVDLVPFVQG 452 >UniRef50_C1ZF13 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZF13_PLALI Length = 461 Score = 130 bits (327), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 120/456 (26%), Positives = 185/456 (40%), Gaps = 70/456 (15%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNP---TPDIDAVASQGLILTSAY- 131 A E + ++PN+++ L DD G + G +P TP ID++ G+ Y Sbjct: 23 ATTETTSERRPNILLILSDDCGHAEFSIQG------HPRYKTPHIDSIGKNGVHFRQGYV 76 Query: 132 SQPSSSPTRATILTGQYS--IHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGK 188 S SP+RA +L G+Y H +PP Y + GL + T LPQLL + GY T A+GK Sbjct: 77 SGCVCSPSRAGLLAGRYQQRFGHEFNIPPAYSETNGLPRSETLLPQLLKEDGYRTIALGK 136 Query: 189 WHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 WH+G + P GF D+ GF S Y P K Sbjct: 137 WHLGYAPQFHPMERGFTDYYGFLQGSRSY------------------------FPLKKPT 172 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 ++ AI + YM D D + ++ + +P+ +Y H N Sbjct: 173 RLNQMLRDRTAIPEEQFGYMTD---HLADEAIAYIKQW--QSQPWMMYLAFNATHSPNDA 227 Query: 309 NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR 368 A ++ Y + ++ + L++ G +TL++F +DNG H Sbjct: 228 TAVDLQAADGNKIYA-MTIALDRAVGKVLDALKECGLSKDTLVIFINDNGGAGG---HDN 283 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANL 427 G KGSTWEGG R+P V + I + D V DLFPT LD+AG A++ + Sbjct: 284 GSLHGKKGSTWEGGTRIPFLVQYPAKIPSGQVIDEPVIALDLFPTILDVAGLGDAELKKI 343 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEH--YFLNGKLAAVRMDEFKYHVLIQQPYAYTQ 485 +DG+ S G++ R + Y+ +GK A+R K Sbjct: 344 PFDPEKLDGI---SLIPRMTGKTQRLVDRPLYWKSGKRWAIRQGNLK------------- 387 Query: 486 SGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 +G Q +F+L +DP E ++ H Sbjct: 388 -----AVSGNDDQGDQVELFDLSSDPDEQRNLAATH 418 >UniRef50_A7RFN2 Predicted protein n=7 Tax=Eumetazoa RepID=A7RFN2_NEMVE Length = 512 Score = 130 bits (327), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 113/364 (31%), Positives = 168/364 (46%), Gaps = 51/364 (14%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 KKP++V + DD+GW DV F+G G PTP+ID +A G+IL + Y P +PTR+ I Sbjct: 22 KKPHIVFIVADDLGWDDVSFHGSGQI---PTPNIDGLAKTGVILNNYYVSPICTPTRSAI 78 Query: 144 LTGQYSIHHGILMPPMY-GQPGGLQGL--TTLPQLLHDQGYVTQAIGKWHMGENK-ESQP 199 +TG+Y IH G+ + QP GL GL T +PQ L GY T +GKWH+G K E P Sbjct: 79 MTGKYPIHTGMQHSVILAAQPYGL-GLNETLMPQYLKRLGYATHGVGKWHLGFFKYEYTP 137 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD + G+ Y + H N E + + L S+ DV G Sbjct: 138 IQRGFDSYFGYWCGKGDYWD----HSNNE------KYGWGLDLHDSEQDVWTEWG----- 182 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDK--PFFLYYGTRGCHFDNY-------PNA 310 Y DL + K ++ ++ + P FLY + H N+ P+ Sbjct: 183 ------HYSSDL------FAEKAVNVISTHNASVPLFLYLPFQAVHSANFIQPLQAPPDL 230 Query: 311 --KYAGSSPARTSYGDCMV-EMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA---EVP 364 K+ R MV M+ + +L+ +N++IVFT+DNG A + Sbjct: 231 IDKFKNIKDERRRIFAAMVSSMDGAIKKVVDSLKARSMYNNSIIVFTTDNGGPANGFDSN 290 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAK 423 P RG K + WEGG+R F++ + +P R ++ ++D PT +AG Sbjct: 291 MASNFPLRGVKRTLWEGGIRGTAFIHSPLITKPGRVMTELMHVSDWLPTLYTVAGGDIHD 350 Query: 424 VANL 427 + NL Sbjct: 351 LQNL 354 >UniRef50_B4D681 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D681_9BACT Length = 536 Score = 130 bits (327), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 117/396 (29%), Positives = 175/396 (44%), Gaps = 79/396 (19%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDV-GFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS- 135 L + PN++ L DD+G+ DV N G TP++D + G+I T A+S + Sbjct: 25 LPRAHAANPNIIYILCDDLGYGDVKCLNAEGKIA---TPNMDRLGKAGMIFTDAHSSSAV 81 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGL------QGLTTLPQLLHDQGYVTQAIGKW 189 SPTR I+TG+Y+ P G GGL QG T+ +L + GY T IGKW Sbjct: 82 CSPTRYGIITGRYNWRS----PLQSGVLGGLSPRLIEQGRMTVASMLKEHGYATACIGKW 137 Query: 190 HMGEN----------------------------KESQPQNVGFDDFRGFNSVSDM--YTE 219 H+G + ++ P +VGFD + G ++ DM YT Sbjct: 138 HLGMDWAKLPGKDVTELSVEKPDQVHNVDYAAPIKNGPNSVGFDYYYGISASLDMVPYTF 197 Query: 220 WRDVHVNPEVALSPDRSEYIKQLPFSK-DDVHAVRGG-------EQQAIADITPKYMEDL 271 + HV V + D+S PF++ + H R G + + +T K ++ + Sbjct: 198 IENDHVT--VLPTVDKS-----FPFTEGRESHPTRPGPAAPGFEPRDVLPTLTRKAVDYI 250 Query: 272 DQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMND 331 QR D A++ KPFFLY H P+A++ G S + Y D ++E + Sbjct: 251 GQRTND---------AQNGKPFFLYLPLNSPHTPIAPSAEWQGKS-GISPYADFVMETDW 300 Query: 332 VFANLYKTLEKNGQLDNTLIVFTSDNGPE-----AEVPPHGRTP---FRGAKGSTWEGGV 383 + + LE+ G DNT++ SDNG AE+ G P FRG K ++GG Sbjct: 301 AIGEVLRVLEEKGLADNTIVFMASDNGCSPSADFAELAEKGHHPSYVFRGHKADIFDGGH 360 Query: 384 RVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAG 418 +P V W I+ SD +V L D T D+ G Sbjct: 361 HIPFLVRWPAKIKAGSTSDQVVCLTDFMATCADVLG 396 >UniRef50_Q7UYA6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UYA6_RHOBA Length = 490 Score = 130 bits (327), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 121/420 (28%), Positives = 180/420 (42%), Gaps = 88/420 (20%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNP---TPDIDAVASQGLILTSAYSQPSSSPTRAT 142 PN VV DD G+ DVG G+P TP +DA+A G+ TS Y+QP P+RA Sbjct: 23 PNFVVIFTDDQGYEDVG------CFGSPDIRTPRLDAMAKGGMKFTSFYAQPICGPSRAA 76 Query: 143 ILTGQYSI------HHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 ++TG Y + H + P ++ + T+ ++L +GY + GKW + ++ + Sbjct: 77 LMTGCYPMRVAERGHTKQIHPILH------EDEVTIAEVLKTKGYASACFGKWDLAKHAQ 130 Query: 197 S------QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 S P GFD F G P S D V Sbjct: 131 SGFFSDLLPTGQGFDYFYG--------------------------------TPTSNDRVA 158 Query: 251 AVRGGEQQAIADITPKY-MEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 + E+ I P+ M L +R+ D + F++K ++PFF+Y H + Sbjct: 159 NLYRNEEL----IEPESDMATLTRRYTDEAISFIEK--NQNQPFFVYIPHTMPHTRLDAS 212 Query: 310 AKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA-------- 361 + G S R YGD + E++ + +L + DNT ++FTSDNGP Sbjct: 213 KDFKGKS-KRGLYGDVIEEIDFNVGRILDSLNELNLADNTYVLFTSDNGPWLVKNKGHAD 271 Query: 362 --EVPPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDL 416 + HG + P R K ST+EGGVRVP ++ G + D I D+ PT L Sbjct: 272 GHRLGDHGGSAGPLRSGKVSTFEGGVRVPAILWAPGKVPAGTVCDSIATTMDVMPTLAAL 331 Query: 417 AGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSN-RKAEHYFLNGKLAAVRMDEFKYHV 475 AG +P IDG D F G +++ KA Y+L L AVR ++K H+ Sbjct: 332 AGAE-------IPTDRVIDGEDIRHLFHGEFDKADPDKAFFYYLRVHLQAVRQGKWKLHL 384 >UniRef50_A4XED5 Sulfatase n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=A4XED5_NOVAD Length = 462 Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 123/431 (28%), Positives = 187/431 (43%), Gaps = 58/431 (13%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 Q LA K ++PN+V + DD+G+ D G + TP ID++ + G++L YS Sbjct: 22 QALAVTRKAAPERPNIVFIMADDLGYADTSATG---SRHIRTPAIDSIGAGGVMLRQGYS 78 Query: 133 Q-PSSSPTRATILTGQYSIHHGILMPPMYG--QPGGLQ---GLTTLPQLLHDQGYVTQAI 186 P SPTR +LTG Y+ I + G P G+ T+ ++ GY T + Sbjct: 79 STPICSPTRTALLTGCYAQRFAIGVEEPLGPNAPAGIGVPLDRPTIASVMKALGYRTSLV 138 Query: 187 GKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 GKWH+GE P G+D F G Y R V + ++ Sbjct: 139 GKWHLGEPPAHGPLKHGYDHFLGIVEGGADYFVHRMVMSGKPAGVG-----------LAE 187 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH--F 304 DD R G Y+ D+ + D V+ +++ ++PFFL H + Sbjct: 188 DDAQTDRTG-----------YLTDI---FGDEAVRVIEE--GGNQPFFLSLHFTAPHWPW 231 Query: 305 DNYPNAKYAGSSPARTSY--GDC-----MVE-MNDVFANLYKTLEKNGQLDNTLIVFTSD 356 + + K A + P+ Y G+ MVE M+ A + ++++G+ DNT++VFTSD Sbjct: 232 EGREDEKLARALPSSFHYEGGNLAKYREMVETMDQNVAKVLAAIDRSGKADNTVVVFTSD 291 Query: 357 NGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALD 415 NG E PF G KG EGGVRVP V W I+ +S+ ++ D PT L Sbjct: 292 NGGERFSDTW---PFVGHKGEVLEGGVRVPLMVRWPRRIKAGSRSEQVMVSMDFLPTLLG 348 Query: 416 LAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHV 475 +AG A++ DG D ++ G R F + AAVR + KY Sbjct: 349 MAGGDAARIGRF-------DGADLSAQLAGA-APVTRTLFWRFKASEQAAVRQGDMKYLR 400 Query: 476 LIQQPYAYTQS 486 + + Y + S Sbjct: 401 MAGKEYLFDLS 411 >UniRef50_B6RB10 Arylsulfatase n=7 Tax=Coelomata RepID=B6RB10_HALDI Length = 481 Score = 129 bits (325), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 104/362 (28%), Positives = 163/362 (45%), Gaps = 59/362 (16%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 L + G+ ++V + DD+GW D+GF+ + TP+ID +A +GL+L Y Q Sbjct: 14 NLCDDVSAAGRPRHIVFIVADDLGWNDIGFHNPDII----TPNIDKLAREGLLLNHHYVQ 69 Query: 134 PSSSPTRATILTGQY----SIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGK 188 P SP+RA ++G Y + H +++ QP L +T LPQ L + GY T +GK Sbjct: 70 PLCSPSRAAFMSGYYPFKTGLQHSVILE---NQPVCLPLNITILPQKLKELGYATHIVGK 126 Query: 189 WHMG-ENKESQPQNVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 WH G + P GFD F G + ++ D YT Sbjct: 127 WHNGFCSWNCTPTYRGFDSFFGYYGAMEDYYT---------------------------- 158 Query: 247 DDVHAVRGGEQQAIADITPKYMED---LDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 H +RG + TP + ++ R+ D +++ +S +P FLY + + Sbjct: 159 ---HVIRGFLDYR-NNTTPVWTDNGTYSTLRFTDVATDIIERHNQS-QPLFLYLAYQAVY 213 Query: 304 FDNYPNAKYAGSSP-----ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG 358 AKY P R + + +++ N+ KTL + G +D+TLI+FT+DNG Sbjct: 214 GPIEVPAKYEAMYPNIKSENRRKFSGMVSALDEAVGNVTKTLRQRGLMDDTLILFTADNG 273 Query: 359 PEAEVPPHGRT-PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDL 416 V G P RG+K + +EGG R F+Y G+ + DG++ D PT Sbjct: 274 --GGVDESGNNYPLRGSKFTVYEGGTRAVGFMYGSGLQKTGTVFDGMIHAVDWLPTLTAA 331 Query: 417 AG 418 AG Sbjct: 332 AG 333 >UniRef50_Q7UIU1 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UIU1_RHOBA Length = 529 Score = 129 bits (325), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 138/488 (28%), Positives = 208/488 (42%), Gaps = 70/488 (14%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATI 143 +PN+++ + DD+G DV + TP + +A +GL A++ S +PTR + Sbjct: 49 RPNIILVMADDLGIGDVSPTNPDCKIK--TPRLQQMADEGLTFLDAHTPSSVCTPTRYGL 106 Query: 144 LTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMG-----ENKE 196 LTG+Y+ + + G L TL LL GY T IGKWH+G KE Sbjct: 107 LTGRYNWRSRLAKGVLSGTSEHLIPGDRATLGHLLQGAGYHTAMIGKWHLGWDWHKNGKE 166 Query: 197 --------SQPQNVGFDDFRGFNSVSDM--YTEWRDVHVNPEVALSPDRSEYI--KQLPF 244 + P N GFD + G DM Y W D V P R E + KQ P+ Sbjct: 167 IDFSKPVLNGPDNNGFDQYYGHCGSLDMPPYV-WVDTGTPTSV---PTRKEGVTKKQNPY 222 Query: 245 SKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF 304 + G+ I + P D + ++++ K DKPFFLY H Sbjct: 223 GWYRNGPI--GDDFEIEQVLPHLF--------DKSIAYVEERVKEDKPFFLYLPLPAPHT 272 Query: 305 DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG--PEA- 361 P + +S Y D +++M+ L + K G +NTL++FTSDNG PEA Sbjct: 273 PIVPVPPFKDAS-GMNPYADFVMQMDHHMGQLLDAISKAGIDENTLVIFTSDNGCSPEAN 331 Query: 362 --EVPPHGRTP---FRGAKGSTWEGGVRVPTFVYWKG-MIQPRKSDGIVDLADLFPTALD 415 E+ HG P +RG K +EGG RVP V W G ++ + ++ + L D++ T Sbjct: 332 FGELAKHGHDPSGKYRGHKADIYEGGHRVPFIVRWPGKVVAGKTTNALTCLTDVYATLQS 391 Query: 416 LAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHV 475 + P T DG D T F G + S+R+A G A+R D +K + Sbjct: 392 ITDQPRE-------ATGGEDGFDLTDVF-GGDDSSDREALVSHSIGGSFAIRRDSWKLCL 443 Query: 476 LIQQPYAYTQSGYQGGFTGTVMQTAG------SSVFNLYTDPQESDSIGVRHIPMGVPLQ 529 S GG++ A +F+L TDP E +S+ + + L Sbjct: 444 ----------SHGSGGWSNPREPKAKLQGLPPMQLFDLETDPAEKNSVAKENPEVVDSLL 493 Query: 530 TEMHAYME 537 ++ Y+E Sbjct: 494 LLLNEYVE 501 >UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4S5_9BACT Length = 486 Score = 129 bits (324), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 116/428 (27%), Positives = 179/428 (41%), Gaps = 78/428 (18%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KPN++ L DD+GW D+G G + + TP+ID AS + TSAY+ SP+R+T++ Sbjct: 25 KPNILFILADDMGWSDLGCYGADL---HETPNIDRFASGAVRFTSAYAMSVCSPSRSTLM 81 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGL---------------TTLPQLLHDQGYVTQAIGKW 189 TG+++ + Q GG + T+ L GY+T IGKW Sbjct: 82 TGKHAARLHFTIWAEGAQEGGAKNRELREAESIWNLPNSEKTIATYLKSAGYLTALIGKW 141 Query: 190 HMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 H+G+ E P+ GFD G + T W P+S Sbjct: 142 HLGD-WEHYPEAHGFDINIGGTNWGAPQTFW---------------------WPYSGSGT 179 Query: 250 HAVRGGEQQAIADITPKY-MEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD--- 305 H G E + I + + E L R D +K +D D+PFF+Y H Sbjct: 180 H---GPEFRYIPHLEYGHPGEYLTDRLTDEAIKVIDHAG--DQPFFVYLAHHAVHTPIEA 234 Query: 306 -----NYPNAKYA-GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG- 358 + +AKY G + T Y E+++ + + L++ G NT+++F SDNG Sbjct: 235 KADDIQHFDAKYRDGMNHRHTIYAAMNKELDENVGRVLEHLKERGLDKNTVVIFASDNGG 294 Query: 359 -------PEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLF 410 +P P R KG+ +EGG+RVP + W G+ D V L D+ Sbjct: 295 YIGVDKVSGKNMPVTNNAPLRSGKGALYEGGIRVPLIIRWPGVTPNGATCDEPVILTDML 354 Query: 411 PTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKA------EHYFLNGKLA 464 T L + G P P T DG+D + + + NR A +Y ++ Sbjct: 355 QTFLHITGQP--------PATDATDGMDISPLLKDPSAKLNRDALFFHYPHYYHTTTPVS 406 Query: 465 AVRMDEFK 472 A+R ++K Sbjct: 407 AIRARDWK 414 >UniRef50_A6DG53 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG53_9BACT Length = 515 Score = 129 bits (324), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 110/367 (29%), Positives = 169/367 (46%), Gaps = 60/367 (16%) Query: 86 PNVVVFLLDDVGWMDV-GFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATI 143 PN+++ L DD+G + NG G PTP +D + +QG+ T A+S + +PTR + Sbjct: 33 PNIILILADDMGIDSIQALNG---KSGIPTPHLDRLLTQGIHFTDAHSGSAVCTPTRYGV 89 Query: 144 LTGQYSIHHGIL--MPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG--------- 192 LTG+Y+ + + + +P + TLP +L +GY T IGKWH+G Sbjct: 90 LTGRYAWRSRLKKSIVRQWERPLIEKDRLTLPGMLKKKGYNTACIGKWHLGWDWPKKGGG 149 Query: 193 -----------ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI-- 239 E E P GFD + G D W+ P V + R + Sbjct: 150 FTEKMKEIDFSEKIEGGPAGCGFDYYFG-----DDVPNWQ-----PFVWIENGRMLGVPN 199 Query: 240 KQLPFSKDDVHAVRG--GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYY 297 KQL F+ H+ +G E + + PK E V+++++ A++ +PFFLY+ Sbjct: 200 KQLSFA-SHYHSGKGIGVEGWDLEAVLPKITEK--------SVEYINQQAETKQPFFLYF 250 Query: 298 GTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 H P+ + G S + Y D ++E + + K L+ G DNTL++FT+DN Sbjct: 251 SMTSPHTPIAPSKPFQGKS-GISRYADFLMETDWCVGQIMKALKDRGIADNTLLIFTADN 309 Query: 358 G--PEA------EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLAD 408 G P+ E + +RG K +EGG RVP V W G I+P KSD + L D Sbjct: 310 GTSPKCNFTELREKRTDLQNHWRGMKADAFEGGHRVPFIVSWPGHIKPGSKSDQTISLVD 369 Query: 409 LFPTALD 415 + T D Sbjct: 370 IMATCAD 376 >UniRef50_Q7UYS6 Arylsulfatase A n=4 Tax=Bacteria RepID=Q7UYS6_RHOBA Length = 512 Score = 129 bits (324), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 108/387 (27%), Positives = 170/387 (43%), Gaps = 68/387 (17%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPT 139 +T PNV++ DD+G+ D+ + PTP +D +A G+ T +S +P+ Sbjct: 31 ETKTPPNVLILYADDLGYGDLNLQNAESKI--PTPHLDQLARSGMRFTDGHSSSGICTPS 88 Query: 140 RATILTGQYSIH--HGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWH------- 190 R +LTG++ HGI+ +G+ TLP++ GY T AIGKWH Sbjct: 89 RYALLTGRHHWRDFHGIVN--AFGESVFEPEQLTLPEMFQQHGYQTAAIGKWHLGWDWDA 146 Query: 191 --------MGENKESQ---------------PQNVGFDDFRGFNSVSDMYTEWRDVHVNP 227 GE ++ P GFD + G ++ W + + Sbjct: 147 IKKPDAKTFGEGRKKGYGPEAFDWTKSIPDGPLAHGFDSYFGDTVINFPPYCWIE---DD 203 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVRGGEQ-------QAIADITPKYMEDLDQRWMDYGV 280 +V +PD + K+ R G Q I T + GV Sbjct: 204 KVVKAPDTIMDTAKWKPIKEGNWECRPGPMTSDWDPYQNIPTTTAR------------GV 251 Query: 281 KFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTL 340 +F++ +SD+PFFLY+ H PN ++ G S A YGD + E +D L + L Sbjct: 252 QFIESQKESDQPFFLYFAFPAPHAPIIPNDEFDGRSGA-GPYGDYVCETDDACGKLLRAL 310 Query: 341 EKNGQLDNTLIVFTSDNGPEA------EVPPHGRT-PFRGAKGSTWEGGVRVPTFVYWKG 393 +++GQ +NT+++F++DNGPE E H + PFRG K +EGG VP ++W G Sbjct: 311 KESGQSENTIVIFSADNGPERYAYARDEKYDHWSSQPFRGLKRDLYEGGHHVPFVIHWPG 370 Query: 394 MIQP-RKSDGIVDLADLFPTALDLAGH 419 + D +V D+F T ++ GH Sbjct: 371 VTDSGSTCDALVSQVDIFATLAEMLGH 397 >UniRef50_UPI00015B51A4 PREDICTED: similar to arylsulfatase b n=1 Tax=Nasonia vitripennis RepID=UPI00015B51A4 Length = 581 Score = 129 bits (323), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 110/389 (28%), Positives = 183/389 (47%), Gaps = 43/389 (11%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILT 145 P++V+ L DD+GW DV F+G PTP+IDA+A G+IL Y+ P +P+R+ ++T Sbjct: 35 PHIVIILADDMGWNDVSFHGANEI---PTPNIDALAYNGVILNKYYTMPICTPSRSALMT 91 Query: 146 GQYSIHHGILMPPMY-GQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNV 202 G+Y I G+ PM +P G+ ++ +P+ + GY T+ +GKWH+G E P Sbjct: 92 GRYPIRDGMQGTPMRPAEPRGIPLNVSLMPEQMRRLGYETRLVGKWHLGYTTEDYTPVRR 151 Query: 203 GFDDF----RGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 GFD F GF S D + W D + L D S+ +L S + + E + Sbjct: 152 GFDTFFGYYNGFISYYDYWIGWNDTNEVTGYDLHRDESDSF-ELAHSSEYFTDLITDEAE 210 Query: 259 AI----ADITPKYMEDLDQRWMDYGVKFLD---KMAKSDKPFFLYYGTRGCHFDNYPNAK 311 I + P ++E + + G K D ++ ++D + ++Y + K Sbjct: 211 KIIRNNKNAKPLFLE-ISHLAVHAGSKVHDDPLEVRRTDD-----VNASFPYIEDYQHRK 264 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-- 369 YAG M +++ + K L++ L+N++I+F SDNG V + T Sbjct: 265 YAG----------MMAALDESVGRVVKALKEAEMLENSIIIFMSDNGAPT-VGLYNNTGS 313 Query: 370 --PFRGAKGSTWEGGVRVPTFVYWKGMIQP--RKSDGIVDLADLFPTALDLAGHPGAKVA 425 P RG KG +EG R ++ +I+ R S+ ++ + D PT AG + Sbjct: 314 NYPMRGIKGGMFEGAARAAACIF-SPLIKAHSRVSEELMHIVDWLPTLYTAAGGNPMDLQ 372 Query: 426 NLVPKTTFIDGVDQTSFFLGTNGQSNRKA 454 + +DGV Q S + G S+R++ Sbjct: 373 SQFDGALPLDGVSQWSSIVA-GGPSSRQS 400 >UniRef50_UPI0000588E05 PREDICTED: similar to steroid sulfatase n=3 Tax=Strongylocentrotus purpuratus RepID=UPI0000588E05 Length = 596 Score = 128 bits (322), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 114/425 (26%), Positives = 181/425 (42%), Gaps = 70/425 (16%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSP 138 E KPN++VFL+DDVG D+G G TP+ID +A +G LT + P +P Sbjct: 25 ENGERTKPNIIVFLMDDVGMGDIGCFGNTTI---NTPNIDQLAKEGAKLTQHIAHPICTP 81 Query: 139 TRATILTGQYSIHHGIL------MPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHM 191 +RA ++TG+Y+I G+ + P GL TT+ +++ D GY T IGKWH+ Sbjct: 82 SRAALMTGRYAIRSGMTSFHIMRVISFISAPAGLPSNETTIAEVVKDVGYSTALIGKWHL 141 Query: 192 G-----ENKESQPQNVGFDDFRG--FNSVSD-----MYTEWRD----------------V 223 G E+ S P + GFD F G ++ D ++ WR Sbjct: 142 GFMCEKEDDCSDPNSQGFDYFYGLPLTNILDCGHGTVFEAWRKNFYFEVTVTMVVLVLAT 201 Query: 224 HVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADIT----------PKYMEDLDQ 273 + ++ R+ + S V + + A+ + P +L Q Sbjct: 202 FILVMYSIVGQRTLVAVVVASSSFYVFCLVMPQLIAMMNCVIVENHNVVEQPLSYTNLTQ 261 Query: 274 RWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVF 333 R + + FL++ ++PF L H + Y + S YG + E++ Sbjct: 262 RHTQHALDFLEE--HKEEPFLLVMSFLQAHTELYAEPHFLDRS-QHGIYGAAVEELDWSV 318 Query: 334 ANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT---------PFRGAKGSTWEGGVR 384 + L + G D+T + TSDNG A V + R+ ++G K + +EGG+R Sbjct: 319 GEIMGALHRMGVADDTFVYLTSDNG--AHVEEYTRSGEREGGSNGIYKGGKANCFEGGIR 376 Query: 385 VPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFF 443 VPT V + G++ P + DL PT +AG G + +P+ IDG D Sbjct: 377 VPTIVRYPGVVPPESEVSEPTSIVDLLPT---IAGLTGGE----IPRDRIIDGKDIMPLL 429 Query: 444 LGTNG 448 G G Sbjct: 430 QGQEG 434 >UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CEC4_9PLAN Length = 467 Score = 128 bits (322), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 103/361 (28%), Positives = 153/361 (42%), Gaps = 68/361 (18%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRAT 142 ++PN+V+F +DD+GW DVGF G TP ID +A + + T+AYS P+ +P+RA Sbjct: 28 QRPNIVLFFIDDLGWRDVGFMGSDFF---ETPHIDRLADESMKFTAAYSAAPNCAPSRAC 84 Query: 143 ILTGQYSIHHGIL--------------MPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGK 188 +++G Y+ HG+ + P TT+ L GY ++GK Sbjct: 85 LMSGLYTPRHGVYTVGDPARGNDRYRKLIPAENNRVLDDRFTTIADRLSQAGYRCASVGK 144 Query: 189 WHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 WH+G++ SQ GF N + + NP+++ Sbjct: 145 WHLGQSPLSQ----GFQVNIAGNQTGSPRGGYFSPYQNPQLS------------------ 182 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD--- 305 GEQ E L R +F+ S PFFLY H Sbjct: 183 -----DGEQG----------EFLTDRLTTAACQFIKDNQGS--PFFLYLTHYAVHTPLQA 225 Query: 306 -----NYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 Y +K AG +Y + M+ + +TL + NT++VFTSDNG Sbjct: 226 KKEDIAYFQSKPAGKLHQHATYAAMIRSMDQSIGRVLQTLREQQLDQNTIVVFTSDNGGY 285 Query: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDG-IVDLADLFPTALDLAGH 419 P P RG+KG +EGG+RVP + W G+ QP + G V DL+PT L++ Sbjct: 286 G--PATSMLPLRGSKGMLYEGGIRVPLLIKWPGVTQPGSTTGEAVINVDLYPTFLEMTNI 343 Query: 420 P 420 P Sbjct: 344 P 344 >UniRef50_B8HPF9 Sulfatase n=2 Tax=Bacteria RepID=B8HPF9_CYAP4 Length = 495 Score = 128 bits (322), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 110/369 (29%), Positives = 163/369 (44%), Gaps = 59/369 (15%) Query: 72 QQKL-AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 QQ L + +++ + P+++ + DD GW DVGF+G + TP++D +A G L Sbjct: 33 QQDLPVAVAQQSSQPPHILFIMSDDQGWKDVGFHGSDIR----TPNLDQLAKTGARLEQY 88 Query: 131 YSQPSSSPTRATILTGQYSIHHGI--LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGK 188 YSQP +P+RA +LTG+Y +G+ L+ P G+ G LPQ L + GY T +GK Sbjct: 89 YSQPMCTPSRAALLTGRYPHRYGLQTLVIPSAGKYGLPTDEYLLPQALKEAGYETAIVGK 148 Query: 189 WHMGE-NKESQPQNVGFD-DFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 WH+G + + P+ GFD + D +T H + ++ + K Sbjct: 149 WHLGHADPKYWPRQRGFDYQYGPLLGEIDYFTH--SAH---------GKVDWYRNNQLIK 197 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 ++ + Q A VK ++K P FLY H Sbjct: 198 EEGYVTTLLGQDA--------------------VKLIEKH-NPKTPLFLYLAFTAPHAPY 236 Query: 307 YPNAKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN-GPE 360 KY + P R +Y + M+D + LEK G +NTLIVF SDN GP Sbjct: 237 QAPQKYLDQYKTIADPNRRAYAAMITAMDDQIGQVVAALEKRGMRNNTLIVFQSDNGGPR 296 Query: 361 A-----EVPPHGRT------PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLAD 408 + EV G T P+R K S +EGG RV W G IQP + + + D Sbjct: 297 SAQFTGEVDTSGGTIPADNGPYRDGKASLYEGGTRVVALANWPGKIQPGTVVNHPIHIVD 356 Query: 409 LFPTALDLA 417 ++PT LA Sbjct: 357 MYPTLTGLA 365 >UniRef50_C3Q8V4 Arylsulfatase B n=6 Tax=Bacteroides RepID=C3Q8V4_9BACE Length = 498 Score = 128 bits (322), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 103/343 (30%), Positives = 156/343 (45%), Gaps = 35/343 (10%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 ++PN+V+ L DD+GW DVGF+G + TP +DA+ +G+ L Y+ P S+PTRA + Sbjct: 64 ERPNIVIVLADDLGWGDVGFHGSEIK----TPSLDALVGEGVELERFYTSPISTPTRAGL 119 Query: 144 LTGQYSIHHGI---LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQA-IGKWHMGENKESQ- 198 +TG+Y G+ ++PP + + G + T+ +L GY +A IGKWH+G K+ Sbjct: 120 MTGRYPNRFGVRSAVIPP-WREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHY 178 Query: 199 PQNVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P N GF F G N D + R+ + ++ D ++ Q Sbjct: 179 PMNRGFSHFYGHLNGAIDYFDLTREGEL-----------DWHNDWETCHDKGYSTELITQ 227 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSP 317 +AI I E ++ Y A+ +K LY DN+ + Sbjct: 228 EAIRCIDAYEKEGPFMLYVAYNAPHTPLQAQ-EKDIKLYT-------DNFDSL--TPKEQ 277 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGS 377 + +Y + M+ + L+K G +DNT +F SDNG A VP P RG K Sbjct: 278 KKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG-TAGVPGSSSGPLRGHKFD 336 Query: 378 TWEGGVRVPTFVYWKGMIQPRK--SDGIVDLADLFPTALDLAG 418 W+GG P +YWK + K S + DL PT DL G Sbjct: 337 EWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 379 >UniRef50_UPI0000586CBA PREDICTED: similar to arylsulfatase B n=3 Tax=Deuterostomia RepID=UPI0000586CBA Length = 596 Score = 128 bits (321), Expect = 6e-28, Method: Compositional matrix adjust. Identities = 107/371 (28%), Positives = 170/371 (45%), Gaps = 55/371 (14%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSS 137 ++ TGK P++V + DD GW DVG++ + TP++D +AS+G+ L + Y QP S Sbjct: 91 IKGATGKPPHIVFIVADDYGWFDVGYHNSTIK----TPNLDLLASRGVKLENYYVQPICS 146 Query: 138 PTRATILTGQYSIH----HGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG 192 P+R+ ++TG+Y IH H +++ P QP L TTLPQ L + GY T +GKWH+G Sbjct: 147 PSRSQLMTGRYQIHTGLQHFVIIAP---QPNCLPLNETTLPQKLKESGYATHLVGKWHLG 203 Query: 193 ENK-ESQPQNVGFDDFRGFNS-VSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 K E P GFD G+ S + D +T +R P + H Sbjct: 204 FYKNECMPLQRGFDSSFGYLSGMQDYWTHFRS-----------------GSFPGFPEGNH 246 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 + G + + +Y + Q + + + ++P FLY + H Sbjct: 247 WL-GIDFWDNNRVAWEYTGNYSQFVFTERAQRVIQQHNPNQPLFLYLPLQSVHGPLQVPE 305 Query: 311 KYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP 365 KY R +Y + M++ + +L++ G ++T++VFT+DNG Sbjct: 306 KYMKPYAHFQDVGRQTYAGMVATMDEAVGKVVDSLQEAGLWNDTVLVFTTDNGGTP---- 361 Query: 366 HGRT----PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVD-----LADLFPTALDL 416 G++ P RG K + WEGGV F+ G + P G V ++D FPT ++ Sbjct: 362 -GKSGNNWPLRGTKNTLWEGGVHGVGFI--TGPMIPAGVQGTVSKHFMHISDWFPTLIE- 417 Query: 417 AGHPGAKVANL 427 G G A L Sbjct: 418 -GVAGGNTAGL 427 >UniRef50_D0PR02 N-acetylgalactosamine-4-sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR02_9SPHI Length = 595 Score = 128 bits (321), Expect = 6e-28, Method: Compositional matrix adjust. Identities = 130/489 (26%), Positives = 204/489 (41%), Gaps = 92/489 (18%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNP---TPDIDAVASQGLILTSAYSQPSSSPTRAT 142 PNV++ L DD G D+G +G NP TP+ID Q + LT + P +PTRA Sbjct: 29 PNVILILTDDQGIGDLGCHG------NPWLKTPNIDKFYEQSVRLTDFHVSPLCTPTRAA 82 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNV 202 I+TGQY I +G G+ +G T+ + GY T GKWH+G+N +P + Sbjct: 83 IMTGQYPIRNGA-WATYKGRDALSKGQLTMADVFKSAGYSTALFGKWHLGDNYPVRPSDS 141 Query: 203 GFDDFRGFNSVSDMYTEWRDVHVNPEVALSP-DRSEYIKQLPFSKDDVHAVRGGEQQAIA 261 GFD HV +A + S+Y F DDV+ V +Q Sbjct: 142 GFD------------------HVVQHLAGGIGELSDYWGNSYF--DDVYYVNNQPKQFQG 181 Query: 262 DITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH-----FDNY--PNAKYAG 314 Y D+ W +KF+++ K ++PFF+Y H + Y P K+ G Sbjct: 182 -----YCTDV---WFSEAMKFINQQEK-EQPFFIYLPLNAPHDPLIVDEKYAAPYKKFEG 232 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPF--- 371 S + + +++ F K L+K G NT++++ SDNG G+ + Sbjct: 233 SEIIDANLYGMIANIDENFGKFRKFLKKKGLDKNTILIYMSDNGTRFGYSRDGKLGYNYH 292 Query: 372 -RGAKGSTWEGGVRVPTFVYWK--GMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 +G KG +EGG RVP F+ W G+ + + DL PT L G P + Sbjct: 293 LKGMKGDKFEGGHRVPFFIQWMDGGIEGGKDIRSLSAHVDLIPTLAKLCGIP-------L 345 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYF-------LNGKLAAVRMDEFKYHVLIQQPY 481 PK DG+D + +R + L K V +E++ LI Sbjct: 346 PKNQAFDGIDLSGVLTKNEKPKDRSVFVHHRQDWRPPLQEKGTCVLKNEWR---LI---- 398 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +GYQ ++N+ TDP ++ ++ + + L E ++ + K Sbjct: 399 ----NGYQ--------------LYNMKTDPLQTTNVAEENKELVEALLEENKSFYQQTKT 440 Query: 542 YPPRAQIKS 550 YP ++ S Sbjct: 441 YPTFYELPS 449 >UniRef50_B5CWC8 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CWC8_9BACE Length = 493 Score = 128 bits (321), Expect = 7e-28, Method: Compositional matrix adjust. Identities = 124/432 (28%), Positives = 183/432 (42%), Gaps = 62/432 (14%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP 134 AE +K +KPN++ FL+DD+G D+ G TP+ID +A+ G++ T+ Y Sbjct: 20 CAEQKKVEEQKPNIIYFLVDDMGMGDLSLTG---QKKYETPNIDKLAADGMLFTNHYCGT 76 Query: 135 S-SSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKW 189 + S P+RA ++TG+++ H + G G Q L TL +L GY T IGKW Sbjct: 77 TVSGPSRACLMTGKHTGHTSV-----RGNQPGPQLLGDNEATLASVLKGAGYKTAVIGKW 131 Query: 190 HMGENKE-SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI--KQLPFSK 246 +G PQ GFD G+ ++ W + PE E + +L ++ Sbjct: 132 GIGHPIPLDDPQRKGFDLSYGYLNM------WHAHNCFPEFLYRNGVKEELTGNKLALAE 185 Query: 247 DDVHA---VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR--- 300 D + + G A D +Y DL ++ +KF+ K+ PFF+YY Sbjct: 186 DGTNPWADMPEGTGVARMDARKQYAPDLFEK---EALKFISDNKKN--PFFIYYALNLPH 240 Query: 301 --------GCHFDNYPNAKYAGSS--PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 GC +Y NA A + M ++ +L LEK G DNT+ Sbjct: 241 ANNEAAPNGCEVPSY-NADIAAKDWPEVEKGFAQMMQIIDKQVGDLVAYLEKEGLADNTI 299 Query: 351 IVFTSDNGPEAEVPPH-----GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIV 404 I+F SDNGP E RG K W+GG+R P V W G ++ S+ + Sbjct: 300 IMFASDNGPHQEGGHKVDFFDSNADLRGKKRDMWDGGIRTPFIVKWPGKVKAGSTSNHLS 359 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF---LNG 461 D+ PT D+A V K IDG+ LG + + YF G Sbjct: 360 AFWDVLPTFCDIAK---------VEKPAGIDGLSLLPTLLGDTAKQEKHKYLYFEFYEEG 410 Query: 462 KLAAVRMDEFKY 473 AV D +KY Sbjct: 411 GKQAVVADNWKY 422 >UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3JD43_NITOC Length = 440 Score = 128 bits (321), Expect = 7e-28, Method: Compositional matrix adjust. Identities = 113/388 (29%), Positives = 170/388 (43%), Gaps = 60/388 (15%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRATIL 144 PNV++ + DD+G+ DVG G TP++DA+A +G T +S P +PTRA +L Sbjct: 19 PNVILIVADDMGYGDVGCYGNQHI---KTPNLDALAKKGARFTDFHSNGPLCTPTRAALL 75 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGLT----TLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 TG Y G+ + P + + ++ T + L GY T +GKWH+G+ P Sbjct: 76 TGCYQQRVGLHIIPKDQRYAMAKAMSLEEITFAEALKSVGYSTALVGKWHLGDRPAFLPP 135 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 GFD++ G DM+ WR + P LP +RG E I Sbjct: 136 RQGFDEYFGIPYSHDMHP-WRK-------SFPP--------LPL-------MRGEE---I 169 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG------ 314 ++ P ++ L Q + VKF+ K D+PF LY H + + ++A Sbjct: 170 VELNPD-LDHLTQYCTEEAVKFISK--NKDRPFLLYMPHPMPHQPVHVSERFAKRFSKEQ 226 Query: 315 --------SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH 366 + Y + E++ + K + G ++T + FTSDNGP Sbjct: 227 LAAIKGEDKKSRKFLYSATIEEIDWSVGEIIKAVRALGIEESTFVAFTSDNGPAI----G 282 Query: 367 GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPT--ALDLAGHPGAK 423 P RG K WEGG RVP YW+ I+P D I DLFPT A+ A P K Sbjct: 283 SAGPLRGKKRELWEGGHRVPFIAYWQEKIRPGVVIDEIAMSMDLFPTMAAMGRAPLPRKK 342 Query: 424 V--ANLVPKTTFIDGVDQTSFFLGTNGQ 449 + NL+P D + + + F + G+ Sbjct: 343 IDGVNLLPLLCEGDKLSERTVFWRSKGK 370 >UniRef50_B5CWB1 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CWB1_9BACE Length = 536 Score = 128 bits (321), Expect = 7e-28, Method: Compositional matrix adjust. Identities = 121/428 (28%), Positives = 180/428 (42%), Gaps = 52/428 (12%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSS 137 E +KPN++ L DD+ + D+ G TP +D++A QG+ + AY+ P S+ Sbjct: 34 EAHKTEKPNIIFVLADDMSYRDLSCYG---QQRYSTPHLDSLAMQGVRFSQAYAAAPESA 90 Query: 138 PTRATILTGQYSIHHGILM-PPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG-ENK 195 P+R +LTG ++ H + M GQ L T+ ++L + GY T +GKW +G + Sbjct: 91 PSRCCMLTGLHTGHSSVRMNSSARGQDNILDSDVTVAEVLKEAGYHTAFVGKWGIGLQGT 150 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS-EYIKQLPFSKDDVHAVRG 254 E P GFD GF ++ +T P+ DR Y + F + +G Sbjct: 151 EGVPYKQGFDYCFGFYDQTEAHT------YIPDYLYENDRKVMYPQNKGFEMARRYDYKG 204 Query: 255 GEQQA---------IADITPKYMEDLDQRWMDYG-VKFLDKM--AKSDKPFFLYYGTRGC 302 + Q I+++ Y + M+ + FL + AK PFFLYY T+ Sbjct: 205 NKAQNTYDKDGCLYISELKDPYGYAYSENEMEKAAMNFLKRQTEAKDKSPFFLYYATQLP 264 Query: 303 H----FDNYPNAK-YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 H D K + + + +++++ L L++ GQ +NT+I F SDN Sbjct: 265 HGPVIVDELGEMKDHPEVNQLSREWAAMVMKLDSFVGKLVAYLKQTGQYENTIIFFASDN 324 Query: 358 G------------PEAEVPPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGI 403 G P P R PFRG K + EGG+RVP FV +P Sbjct: 325 GYSMCGYTERGNGPSWPDDPWLRNKGPFRGGKFTAQEGGLRVPFFVSCPSKFKPEVISTP 384 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL 463 V L D FPTA ++AG L P DG+ G YF GK Sbjct: 385 VWLPDFFPTAAEIAG--------LDPHARRTDGISLLPLLKGEKENYTGHKYLYFSRGKE 436 Query: 464 AAVRMDEF 471 VRM F Sbjct: 437 QGVRMGAF 444 >UniRef50_A6DMY9 Putative uncharacterized protein n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMY9_9BACT Length = 590 Score = 127 bits (320), Expect = 8e-28, Method: Compositional matrix adjust. Identities = 105/353 (29%), Positives = 166/353 (47%), Gaps = 74/353 (20%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KPN+V+ L DD G+ D+ +G + TP +D +A G + + +PTRA++L Sbjct: 25 KPNIVLILTDDQGYGDISSHGNRMI---DTPHLDQLAEDGTRFENFFVSNVCAPTRASLL 81 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 TG+Y I G++ GL+ + T+ ++ QGY T GKWH GE+ + P Sbjct: 82 TGRYHIRTGVVQVSR-----GLEIMRSEEATIAEVFKAQGYETGLFGKWHNGEHYPNNPP 136 Query: 201 NVGFDDFRGFNS--VSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 GFD++ GF + + D + + L +++ ++K F Sbjct: 137 GQGFDEYFGFCAGHIGDFF----------DATLDHNKT-FVKTKGF-------------- 171 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY---------YGTRGCHFDNYPN 309 I D+ D + +++K + DKPFF Y Y ++D + Sbjct: 172 -ITDVL-----------TDRAIDWIEK--QQDKPFFAYIPYNAPHAPYQVEDKYYDEFAA 217 Query: 310 AKYAGSSPARTSYGDCMVE-MNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR 368 Y+ + A +YG M+E ++D L K L+ DNT+++F +DNGP + P Sbjct: 218 KGYSAAHSA--AYG--MIENLDDNIGRLLKILDDLNLTDNTIVIFLTDNGPNS--PTRFN 271 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLA---DLFPTALDLAG 418 +G+KGS EGGVRVP F+ W G I K I DLA D+ PT ++LAG Sbjct: 272 GGMKGSKGSVDEGGVRVPFFIRWPGKIA--KGRTIHDLAAHIDVLPTLMELAG 322 >UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D464_9BACT Length = 474 Score = 127 bits (320), Expect = 8e-28, Method: Compositional matrix adjust. Identities = 113/403 (28%), Positives = 169/403 (41%), Gaps = 55/403 (13%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQ 133 A+L K+PN++ + DD+G+ + G GG PTP+ID + + G+ +S Y S Sbjct: 17 CAQLAIAAPKRPNILFIVADDLGYGEPGCYGGKDI---PTPNIDKLVASGVRFSSGYVSA 73 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQ---PGGLQGL----TTLPQLLHDQGYVTQAI 186 P + +RA ++TG+Y G P+ + PG GL T+ L D GY T + Sbjct: 74 PFCAASRAALMTGRYQTRFGFEYNPIGAKNADPG--TGLPVNEKTVADRLRDVGYATGLV 131 Query: 187 GKWHMGENKESQPQNVGFDDFRGFNSVSDMY--------TEWRDVHVNPEVA----LSPD 234 GKWH+G PQ GFD+F GF Y T W P+ + SPD Sbjct: 132 GKWHLGGTAPFHPQRRGFDEFFGFLHEGHFYLPPPWSGATTWLRRKALPDGSQGRWTSPD 191 Query: 235 -----RSEYIKQLPFSKDDVHAVRGGEQ-QAIADITPKYMEDLDQRWMDYGVKFLDKMAK 288 ++ + P D +R + + A++T + + F+D+ Sbjct: 192 GHTVWSTDLHENEPAYDADNPLLRNSQPVEEKANLTDAFTRE--------ACSFIDR--H 241 Query: 289 SDKPFFLYYGTRGCHF-----DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKN 343 +P+FLY H D Y R + + +++ + L + Sbjct: 242 QAQPWFLYLAYNAVHSPLQGEDTYMEKFSHIGDIQRRIFAAVLAHLDEDIGKVRAQLRAD 301 Query: 344 GQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DG 402 G +NTL+VF SDNG + P RG KG W+GG+R+P V WKG I + D Sbjct: 302 GLEENTLVVFLSDNGGPTKELTSSNLPLRGGKGDLWDGGIRIPFAVSWKGQIPAGHTIDA 361 Query: 403 IVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLG 445 DL TAL LAG + +DGVD G Sbjct: 362 PAISMDLTATALKLAGAETEQAK--------LDGVDLLPLLTG 396 >UniRef50_A6DLE2 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLE2_9BACT Length = 441 Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 127/467 (27%), Positives = 187/467 (40%), Gaps = 83/467 (17%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATIL 144 PN+++ L DD G D G + TP ID++A G+ T AY+ S SP+RA +L Sbjct: 21 PNIIIILADDAGSSDFSCYGSKQLL---TPHIDSIAHNGIKFTQAYTASSVCSPSRAGLL 77 Query: 145 TGQYSIHHGILM-----------PPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 TG+Y G L P + G P TL L + GY T IGKWH+GE Sbjct: 78 TGRYQQTFGHLANIPHSKHSANDPELLGLP---VTEITLADSLKELGYSTHCIGKWHLGE 134 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 P GFD+F GF S + Y + E+ DR +R Sbjct: 135 ADHFHPNARGFDNFYGFLSGARTY------FLGGELRGDMDR---------------IMR 173 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH----FDNYPN 309 E A+ + Y ++ + ++ + + + DKPFF+Y H + Sbjct: 174 NKE---FAEPSSGYTTEV---FTQEAIRIIQE--EQDKPFFIYLSHNAVHGPMDAKDEDI 225 Query: 310 AKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT 369 Y +P R Y M ++D L + L+ + Q +NTLI F SDNG Sbjct: 226 MSYDFKNPLRKKYSGLMKNLDDQTGLLLQALKDSKQYENTLIFFMSDNGGPTTHNGSSNW 285 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 P RG KGS +EGG R P + W I SD + D+F T + AG LV Sbjct: 286 PLRGFKGSEFEGGNRTPFLLQWPEKISAGLSSDKPIIAYDVFATCIQAAG------GELV 339 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 T+ G+D + RK ++ GK ++R ++K ++L Sbjct: 340 TDRTY-HGIDLLPVINKPQETNARKL--FWSRGKNYSMRQGKWKLNIL------------ 384 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAY 535 GSS++NL D E + + + L EM + Sbjct: 385 ----------PTGSSLYNLENDQSEKHDLSEQFPEIKAQLIKEMSKW 421 >UniRef50_UPI0000E46777 PREDICTED: similar to arylsulfatase J n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E46777 Length = 588 Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 108/369 (29%), Positives = 170/369 (46%), Gaps = 44/369 (11%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILT 145 P+V++ + DD+G+ DVG++ TP+ID +A G+ L + Y QP +PTR+ ++T Sbjct: 94 PHVIMIIADDLGYNDVGYHAKYGRSMIRTPNIDEMAYSGVRLENYYVQPVCTPTRSQLIT 153 Query: 146 GQYSIHHGILMPPMY-GQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGEN-KESQPQNV 202 G+Y IH G+ ++ G+P L TTL Q L QGY T A+GKWH+G K+ P Sbjct: 154 GRYQIHTGMQHLNLFPGRPCCLPLDETTLAQALKKQGYSTHAVGKWHLGYAWKDCLPSRR 213 Query: 203 GFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIAD 262 GF+ F F ++ W + + AL D+ L K + R + Sbjct: 214 GFESF--FGNIMGSADHW----SHNKTALFGDK------LVMGKSMYYNERIYWKHEGTF 261 Query: 263 ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG------SS 316 T Y Q L + +KP FLY H +YA + Sbjct: 262 STTLYTNRARQ---------LIRKQPRNKPLFLYLSYEAVHTPLNVPEQYAKPYEGIIHN 312 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAK 375 R Y + +++ N+ + L+ NG DN++I+FT+DNG + G P RG K Sbjct: 313 SKRRRYAGLVNILDEAVRNVTEALKYNGLYDNSVIIFTTDNGGRPKPRSVGNNWPLRGGK 372 Query: 376 GSTWEGGVRVPTFVY-----WKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPK 430 + WEGG+R FV+ W+ ++ + ++ ++D FPT + G G K+ P Sbjct: 373 STLWEGGIRGVGFVHSPLIPWE--LRGTVNRQLIHVSDWFPTI--VXGIAGGKLVTNKP- 427 Query: 431 TTFIDGVDQ 439 +DG Q Sbjct: 428 ---LDGXHQ 433 >UniRef50_A4CJK0 Arylsulfatase A n=1 Tax=Robiginitalea biformata HTCC2501 RepID=A4CJK0_9FLAO Length = 516 Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 124/440 (28%), Positives = 184/440 (41%), Gaps = 74/440 (16%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRAT 142 +PN+V+ DD+G+ D G G A TP ID++A+ GL T Y S + +P+R Sbjct: 35 SRPNIVIIYADDLGFGDTGAYG---ATEIQTPHIDSLAAGGLRFTRGYASSATCTPSRYA 91 Query: 143 ILTGQYSIHHG---ILMPPMYGQPGGLQGL-----TTLPQLLHDQGYVTQAIGKWHMG-- 192 +LTGQY IL PG L TLP LL GY T +GKWH+G Sbjct: 92 LLTGQYPWRKEKARIL-------PGNAPLLIDTAQATLPGLLRQAGYRTGIVGKWHLGLG 144 Query: 193 -------ENKESQPQNVGFDDFRGFNSVSDM-----YTEWRDVHVNPEVALSPDRSEYIK 240 + P VGF++ + D + V + P+ + E Sbjct: 145 TGAVDWNQAIRPGPNEVGFEESFILAATQDRVPTVYIRNGQVVGLEPDDPIQVSYEENFP 204 Query: 241 QLPFSKDDVHAVR----GGEQQAIADITPK--YME-DLDQRWMDYGVK--FLD------K 285 P + D V+ G +I + P+ +M+ RW+D + FL + Sbjct: 205 GEPTALDHPELVKMGWDHGHNNSIVNGIPRIGFMKGGQAARWVDEDMADTFLKEAQVFIR 264 Query: 286 MAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQ 345 + PFFL+Y + H P+ ++ G++ GD + E + + TL+ G Sbjct: 265 ERDPEAPFFLFYSLQQPHVPRTPHPRFVGAT-DLGPRGDAIFEADWCIGQILATLQDEGL 323 Query: 346 LDNTLIVFTSDNGP-------EAEVPPHG----RTPFRGAKGSTWEGGVRVPTFVYWKGM 394 L NTL++F+SDNGP + V +G P+RG K S +E G RVP V W G Sbjct: 324 LTNTLVIFSSDNGPVLNDGYLDQAVERNGGHSPWGPYRGGKYSLFEAGTRVPFIVSWPGT 383 Query: 395 IQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKA 454 I P SD +V DL + L G P G D + + +G+S+ Sbjct: 384 IAPGVSDAMVSQIDLLASLAHLTGVPDP-------------GTDSQNIWPALSGRSDAGR 430 Query: 455 EHYFLNG-KLAAVRMDEFKY 473 EH L A R ++ Y Sbjct: 431 EHMVLEATSRTAFRTRDWVY 450 >UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 Length = 471 Score = 127 bits (318), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 111/410 (27%), Positives = 172/410 (41%), Gaps = 57/410 (13%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRA 141 K+PN+V DD G+ D GF G TP++D +AS+G+ T Y S + P+RA Sbjct: 24 AKQPNIVFLFSDDAGYADFGFQGSETM---KTPNLDQLASEGVRFTQGYVSDSTCGPSRA 80 Query: 142 TILTGQYSIHHG---ILMPPMYGQPGGLQGL--------TTLPQLLHDQGYVTQAIGKWH 190 I+TG+Y G I +P + ++G T+ + GY T GKWH Sbjct: 81 GIMTGRYQQKFGYEEINVPGYMSEHSAIKGAEMGIPLDEVTMGDYMKSLGYRTAFYGKWH 140 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI---KQLPFSKD 247 +G E P + GFD+F GF Y + VN +P+R + K+L D Sbjct: 141 LGGTDELHPMHRGFDEFYGFRGGDRSYWAY---EVN-----APERKSAVFTDKKLEHGID 192 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH--FD 305 G +A+ +++E DKPFF++ H + Sbjct: 193 QFQEHEGYLTDVLAEKANQFIE-----------------KAPDKPFFIFLSFNAVHTPME 235 Query: 306 NYPN--AKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 P AK+ R + ++ + L++ G D+TL+VF++DNG + Sbjct: 236 ATPEDLAKFPQLKGKRKEVAAMTLALDRASGAVLNKLKELGLEDDTLVVFSNDNGGPTDK 295 Query: 364 PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGA 422 P G K + EGG+RVP V W + K D V DL PT G G Sbjct: 296 NASSNYPLAGTKSNFLEGGIRVPFLVKWPAKLAAGKVYDKPVSTLDLLPTFFKAGG--GE 353 Query: 423 KVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFK 472 +V + +DGVD + G N ++ ++ Y+ AA+R ++K Sbjct: 354 EV------MSELDGVDLMPYITGQNNKAPHES-MYWKKETRAAIRQGDWK 396 >UniRef50_Q9VVM4 CG7402 n=10 Tax=Drosophila RepID=Q9VVM4_DROME Length = 579 Score = 127 bits (318), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 114/390 (29%), Positives = 173/390 (44%), Gaps = 51/390 (13%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KPN+V+ L+DD+G DV F+G + TP+IDA+A G++L Y +P+RAT+L Sbjct: 27 KPNIVIILIDDMGMNDVSFHGSNQIL---TPNIDALAYNGILLNKHYVPNLCTPSRATLL 83 Query: 145 TGQYSIHHGIL-MPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQPQN 201 TG+Y IH G+ + +P GL Q +P++ D GY T +GKWH+G K+ P Sbjct: 84 TGKYPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTM 143 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIA 261 GFD G+ + Y ++ D V DR+ Y L F +D A Sbjct: 144 RGFDHHFGY---YNGYIDYYDHQVR-----MLDRN-YSAGLDFRRDLEPCPEANGTYATE 194 Query: 262 DITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN------YPNAKYAG- 314 T + ++Q DK KP F+ H N P + A Sbjct: 195 AFTSEAKRIIEQH---------DK----SKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKF 241 Query: 315 ---SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-PEAEVPPHGRT- 369 P R +Y + ++ A L+ NG L+N++I+ SDNG P + + + Sbjct: 242 PHIRDPKRRTYAGMISSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSN 301 Query: 370 -PFRGAKGSTWEGGVRVPTFVYWKGMIQPRK--SDGIVDLADLFPTALDLAGHPGAKVAN 426 P+RG K S WEGG+R + W +++ R S+ + D PT LAG G + Sbjct: 302 YPYRGQKESPWEGGIRSAGAL-WSPLLKERGYVSNQAIHAVDWLPT---LAGAAGVSLPQ 357 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSNRKAEH 456 +P +DG++ G R H Sbjct: 358 DLP----LDGINLWPMLSGNEEPKPRTMIH 383 >UniRef50_A4AM21 Arylsulfatase A n=1 Tax=Flavobacteriales bacterium HTCC2170 RepID=A4AM21_9FLAO Length = 535 Score = 127 bits (318), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 133/522 (25%), Positives = 219/522 (41%), Gaps = 89/522 (17%) Query: 81 KTGKKPNVVVFLLDDVGWMDV-GFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSP 138 K K PN+V L DD+G+ D+ FN G TP+ID +A G+ T A++ + +P Sbjct: 31 KKQKPPNIVYILADDLGYGDISAFNAEGKI---QTPNIDNLAKDGMKFTDAHTSSAVCTP 87 Query: 139 TRATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMG---- 192 TR ILTG+Y+ I + G+ L TT+ L D GY T IGKWH+G Sbjct: 88 TRYGILTGRYNWRSPIKSGVLTGKSEALIPNSRTTVASFLSDNGYKTGFIGKWHLGWDWA 147 Query: 193 -----------------ENKE------SQPQNVGFDDFRGFNSVSDM--YTEWRDVHVNP 227 EN + + P ++GFD G + DM Y + Sbjct: 148 IKDSTNNGGEGWNATDFENLDFTKPVTNTPNDLGFDYAYGHSGSLDMAPYVYVENGMATA 207 Query: 228 EV-ALSPDRSEYI--KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLD 284 +V ++ D+ +Y ++ P + D VH ++TP + + F+ Sbjct: 208 KVDTVTVDKGKYTWWREGPTAADFVHD----------EVTPNFFRK--------SMSFIK 249 Query: 285 KMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNG 344 + ++PFFLY H P ++ G S Y D +V ++D L + LE+ G Sbjct: 250 EQGAEEQPFFLYLALPSPHTPILPTEEWQGKSNLN-PYADFVVMIDDYLGQLVEVLEQKG 308 Query: 345 QLDNTLIVFTSDNG--PEAEVPPHG------RTPFRGAKGSTWEGGVRVPTFVYWKGMIQ 396 +NT+++FTSDNG P+A+ G +RG K +EGG R+P V W I+ Sbjct: 309 LAENTIVIFTSDNGCSPQADFKILGDLGHDPSAIYRGHKADIYEGGHRIPFVVKWPSKIE 368 Query: 397 PRK-SDGIVDLADLFPTA-----LDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQS 450 SD + DL T +DL + G +++P +D D+ F Sbjct: 369 SGSVSDKTICTTDLLATVADILNVDLLDNQGEDSFSILP---LLDTTDKREF-------- 417 Query: 451 NRKAE-HYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYT 509 R+A H+ +NG A + + ++ + +G + + +++L Sbjct: 418 KREATVHHSINGSFALRKANWKMIFCTGSGGWSDPKPNSEG-----IEELPKFQLYDLAN 472 Query: 510 DPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 DP E ++ H + L M Y++ + P + Q + Sbjct: 473 DPSEQTNLFGHHPDIEGQLSELMLDYIDDGRSTPGKKQTNEE 514 >UniRef50_Q7UYA9 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA9_RHOBA Length = 474 Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 121/394 (30%), Positives = 171/394 (43%), Gaps = 62/394 (15%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRATIL 144 PNV++ + DD GW DVGFNG V TP++DA+AS G+ Y + P SPTR + L Sbjct: 33 PNVILLMSDDQGWGDVGFNGNEVV---QTPNLDAMASAGVRFDRFYAAAPLCSPTRGSCL 89 Query: 145 TGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENKE------- 196 TG+Y GIL GG++ G T+ ++L +GY T GKWH+G K Sbjct: 90 TGRYPFRFGILA----AHTGGMRVGEITIAEMLQKRGYATGMFGKWHIGWVKPDEVSTRG 145 Query: 197 --SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDR---SEYIKQLPFSKDDVHA 251 S P + GFD++ F + S + T W D + P+ S + P+ VH Sbjct: 146 FYSPPSHHGFDEY--FATTSAVPT-W-DPTITPQDWDSWGNGPGEPWKGGFPY----VHN 197 Query: 252 VRGGEQQAIADITPKYMEDLDQR-WMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP-- 308 R ++ D D R MD + F++ A KPFF T H + P Sbjct: 198 GREAKENLSGD---------DSRVIMDRVIPFIE--ANQAKPFF---ATVWFHAPHEPVV 243 Query: 309 -----NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 Y + R +Y C+ M+ L L + G NT++ F SDNGP + Sbjct: 244 AGEEFKKLYPKAGSKRKNYYGCITAMDQQVGRLRAKLRELGIEKNTVVFFCSDNGPSDGL 303 Query: 364 PPHGRT---PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGI-VDLADLFPTALDLAGH 419 G PF+G K + +EGG+ VP W G I S + D PT + G Sbjct: 304 AKKGVASAGPFKGHKHTMYEGGLLVPACAEWPGTIPAGTSTEVRCSTVDFLPTVASIVGD 363 Query: 420 PGAKVANLVPKTTF-IDGVDQTSFFLGTNGQSNR 452 ++V K T IDG+D G +R Sbjct: 364 ------SMVQKATRPIDGIDLMPLIRGEAKDRDR 391 >UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B5CXC7_9BACE Length = 509 Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 122/445 (27%), Positives = 178/445 (40%), Gaps = 120/445 (26%) Query: 44 NQYLVKPA--TTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDV 101 N++L+ A T+A NM+ H A D ++PNVV ++DD GW DV Sbjct: 5 NKHLLTLAGGVTLAANML----HAASDN--------------RQPNVVFIMVDDYGWADV 46 Query: 102 GFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATILTGQYSIHHGI------ 154 G+NG TP+ID +AS+G+I T Y+ S SSP+R +++TG+Y GI Sbjct: 47 GYNGSRFY---ETPNIDRLASEGMIFTDGYAAASISSPSRVSLMTGKYPARTGITDWIPG 103 Query: 155 ----LMPPMYGQPGGLQ---------GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQN 201 L P Q L T+ + + GY T +GKWH E+ PQ Sbjct: 104 YQYGLKPEQLKQYKMLAPEMPLNMPLEEVTMAEAFKEHGYATYHVGKWHCAEDSLYYPQY 163 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIA 261 GFD V++ + SP+ + +GG+ + Sbjct: 164 QGFD-----------------VNIGGWLKGSPN-------------GIRRSQGGKGAYCS 193 Query: 262 DITPKYMED------LDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH------------ 303 Y+ D L R D +K + K + +DKPFFLY H Sbjct: 194 PYRNPYLPDGPEGEFLTDRLGDESIKLI-KNSSADKPFFLYLAFYAVHTPIEAKPEYVKY 252 Query: 304 -------------------FDNYPNAKY-AGSSPART-----SYGDCMVEMNDVFANLYK 338 + Y NA+Y AG RT Y + M++ + + Sbjct: 253 FKWKAQRMGLDTIVPFTRNLEWYKNAEYKAGHWKERTIQSDAEYAALIYSMDENVGRVMQ 312 Query: 339 TLEKNGQLDNTLIVFTSDNG--PEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ 396 L+ NG NT++ SDNG AE P P R KG +EGG+R P + + M++ Sbjct: 313 ALKDNGLDKNTIVCLLSDNGGLSTAEGSPTCNAPLRAGKGWLYEGGIREPFIIKYPQMVE 372 Query: 397 PRK-SDGIVDLADLFPTALDLAGHP 420 V D +PT LD+AG P Sbjct: 373 AGSVCHTPVVAVDFYPTLLDMAGLP 397 >UniRef50_C9KTV0 Arylsulfatase n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KTV0_9BACE Length = 459 Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 108/363 (29%), Positives = 161/363 (44%), Gaps = 61/363 (16%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-S 136 +E K+PN V+ + DD+G+ DVG G TP+ID +A +G++ T +S S S Sbjct: 21 VEMMAQKQPNFVIIVADDMGYGDVGIYGNEYI---KTPNIDQIAREGMMFTDFHSNGSVS 77 Query: 137 SPTRATILTGQYSIHHG---ILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKW 189 SPTR +LTG+Y G +L+ P + + GL T ++L D GY T IGKW Sbjct: 78 SPTRCGLLTGRYQQRAGLEKVLLVPRDDKDKEV-GLPSEEITFAKILGDNGYRTALIGKW 136 Query: 190 HMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 H+G ++ P N GF F GF S + V R+ Y + ++ Sbjct: 137 HLGYLQKHHPMNFGFQKFVGFKSGN--------------VDYQSHRNRYGDMDWWDGLEM 182 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG----------- 298 + G + ++ Y+++ DKPF LY Sbjct: 183 KDMSGYTTTLLTTLSEDYIKE-----------------NKDKPFCLYIAHAAPHSPMQGP 225 Query: 299 -TRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 + + P + Y D + E++ + +TL+K +NT +VF SDN Sbjct: 226 DEKAVRTEATPEGDKNSDRSNKEIYKDMVEELDWSVGRILETLKKYKLDENTFVVFFSDN 285 Query: 358 GPEAEVPPHGRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTAL 414 GP V +G + ++GAKGS WEGG RVP Y G I+ + V DLFPT L Sbjct: 286 GP---VINNGGSAGGYKGAKGSPWEGGHRVPGICYMPGTIKEGTTCEQTVMSFDLFPTML 342 Query: 415 DLA 417 D+A Sbjct: 343 DMA 345 >UniRef50_Q024K7 Sulfatase n=28 Tax=Bacteria RepID=Q024K7_SOLUE Length = 504 Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 120/422 (28%), Positives = 178/422 (42%), Gaps = 82/422 (19%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS-SPTRAT 142 K PN+V DD+G+ D G A TP++D A+ G+ T+A+S ++ +P+R + Sbjct: 24 KPPNIVYMYADDLGYGDTSCYG---ATRVKTPNLDRAAAAGIRFTNAHSSSATCTPSRYS 80 Query: 143 ILTGQYSIHH---GILM--PPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE---- 193 +LTG+Y+ H G+L + QPG TLP +L GY T A+GKWH+G Sbjct: 81 LLTGEYAWRHQGTGVLPGDASLIVQPGRY----TLPAMLQQAGYRTGAVGKWHLGLGGRD 136 Query: 194 ---NKESQPQ--NVGFDDFRGFNSVSD----MYTEWRDV-HVNPEVALSPDRSEYIKQLP 243 N E +P VGFD F + D ++ E R V +++P P R Y K P Sbjct: 137 LDWNGEIRPGPLEVGFDYSFIFPATGDRVPCVFVENRKVVNLDPN---DPLRVRYDKPFP 193 Query: 244 FSKDDVHAVRGGEQQAIADITPKYMED----------------LDQRWMD---------Y 278 G + + P + D RW+D Sbjct: 194 GEPT------GAANPELLKMKPSHGHDNTIVNGISRIGYMAGGKSARWVDEDMADTITGK 247 Query: 279 GVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYK 338 V FL++ +PFFLY+ T H P+ ++ G + GD + E++ + Sbjct: 248 AVSFLEQ--NRARPFFLYFATHDIHVPRVPHPRFVGKTDM-GPRGDAIAELDWSIGRILD 304 Query: 339 TLEKNGQLDNTLIVFTSDNGP-----------EAEVPPHGRTPFRGAKGSTWEGGVRVPT 387 TL++ NTL VF+SDNGP E H P RG K S ++GG R+P Sbjct: 305 TLDRLKLTRNTLFVFSSDNGPVVDDGYRDQAVERLGDHHPAGPLRGGKYSAYDGGTRIPF 364 Query: 388 FVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTN 447 V W G ++P S + DL + L G +P+T D + LG Sbjct: 365 VVRWPGTVKPGISAAPISQVDLLASFAALTGRK-------LPETAAPDSFNVLPALLGKT 417 Query: 448 GQ 449 Q Sbjct: 418 KQ 419 >UniRef50_Q5FYB1 Arylsulfatase I n=5 Tax=Chordata RepID=ARSI_HUMAN Length = 569 Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 108/347 (31%), Positives = 160/347 (46%), Gaps = 50/347 (14%) Query: 87 NVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTG 146 +++ L DD G+ DVG++G + TP +D +A++G+ L + Y QP +P+R+ +LTG Sbjct: 48 HIIFILTDDQGYHDVGYHGSDIE----TPTLDRLAAKGVKLENYYIQPICTPSRSQLLTG 103 Query: 147 QYSIH----HGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQPQ 200 +Y IH H I+ P QP L TLPQ L + GY T +GKWH+G KE P Sbjct: 104 RYQIHTGLQHSIIRPQ---QPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPT 160 Query: 201 NVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD F G D YT D P V D+H GE A Sbjct: 161 RRGFDTFLGSLTGNVDYYTY--DNCDGPGVC---------------GFDLHE---GENVA 200 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSD--KPFFLYYGTRGCHF-----DNYPNAKY 312 ++ +Y M Y + +A +P FLY + H Y Sbjct: 201 WG-LSGQYST------MLYAQRASHILASHSPQRPLFLYVAFQAVHTPLQSPREYLYRYR 253 Query: 313 AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFR 372 + AR Y + M++ N+ L++ G +N++I+F+SDNG + P R Sbjct: 254 TMGNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-FSGGSNWPLR 312 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAG 418 G KG+ WEGGVR FV+ + + R S ++ + D +PT + LAG Sbjct: 313 GRKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAG 359 >UniRef50_B0NLM9 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NLM9_BACSE Length = 463 Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 120/476 (25%), Positives = 202/476 (42%), Gaps = 81/476 (17%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRA 141 G KPN++ L DD+G+ D+ G TP+ID +A+ G T Y+ SSP+R Sbjct: 32 GDKPNIIFILADDMGYCDLSCYGNKYI---ETPNIDRLAATGTAFTQCYAGSGISSPSRC 88 Query: 142 TILTGQYSIHHGILMPPMYGQPGGLQGL---------------TTLPQLLHDQGYVTQAI 186 ++TG+ + + I + GG++GL TT+ +L GY T + Sbjct: 89 ALMTGKNTGNTTIR--DNFCIAGGIEGLKGTKTIRRMHLQPNDTTIATVLGAAGYRTCLV 146 Query: 187 GKWHM-GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS 245 KWH+ G N E+ P N GFD+F G+ +S Y+ D + P + ++ E +K+ + Sbjct: 147 NKWHLDGFNPEATPLNRGFDEFYGW-LISTAYSN--DPYYYPYWRFNNEKLENVKE---N 200 Query: 246 KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH-- 303 + D H K+ DL + +KF+++ + PFFLY H Sbjct: 201 EGDKHI--------------KHNTDLST---EDAIKFINR--NKNNPFFLYLAYDAPHEP 241 Query: 304 --FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA 361 D Y + M+ L L++ G +NTL++F SDNG Sbjct: 242 YNIDETTWYDDEAWDMNTKRYASLITHMDRAIGRLLAELDRLGLRENTLVIFASDNGAAK 301 Query: 362 EVPPH---GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAG 418 + P + +G KG +EGG+RVP V G + +K + I+ D+ PT LAG Sbjct: 302 QAPLEELGCKGSLKGMKGQLYEGGIRVPFIVNQPGKVPVQKLNNIIYFPDVMPTLAALAG 361 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ 478 + +P+ ++G++ F G ++ + ++ GK A R ++K Sbjct: 362 -----ATDKLPQK--LNGINILPLFYGQQLDTDNRLLYWEFPGKQRAARCGDWK------ 408 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 TV + A ++N+ D ES ++ ++ + EM A Sbjct: 409 --------------VVTVKKDAPLELYNIKEDMTESVNLANKYPEKVAQFEKEMKA 450 >UniRef50_Q5FYB0 Arylsulfatase J n=81 Tax=Eumetazoa RepID=ARSJ_HUMAN Length = 599 Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 106/365 (29%), Positives = 164/365 (44%), Gaps = 83/365 (22%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 +P+++ L DD G+ DVG++G + TP +D +A++G+ L + Y QP +P+R+ + Sbjct: 75 QPHLIFILADDQGFRDVGYHGSEIK----TPTLDKLAAEGVKLENYYVQPICTPSRSQFI 130 Query: 145 TGQYSIH----HGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQ 198 TG+Y IH H I+ P QP L TLPQ L + GY T +GKWH+G KE Sbjct: 131 TGKYQIHTGLQHSIIRPT---QPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187 Query: 199 PQNVGFDDFRG-FNSVSDMYTEWR---------DVHVNPEVALSPDRSEYIKQLPFSKDD 248 P GFD F G D YT ++ D++ N A D Y Q+ + Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQR-- 245 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH----- 303 QQ +A P KP FLY + H Sbjct: 246 -------VQQILASHNPT------------------------KPIFLYIAYQAVHSPLQA 274 Query: 304 ----FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP 359 F++Y + + R Y + +++ N+ L+ G +N++I+++SDNG Sbjct: 275 PGRYFEHYRSI----ININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGG 330 Query: 360 EAEVPPHGRT--PFRGAKGSTWEGGVRVPTFVYW-----KGMIQPRKSDGIVDLADLFPT 412 + P G + P RG+KG+ WEGG+R FV+ KG + +V + D +PT Sbjct: 331 Q---PTAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTV----CKELVHITDWYPT 383 Query: 413 ALDLA 417 + LA Sbjct: 384 LISLA 388 >UniRef50_A6C4B6 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4B6_9PLAN Length = 515 Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 146/520 (28%), Positives = 222/520 (42%), Gaps = 107/520 (20%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATI 143 +PNVV+ L DD+G+ DV G + PTP++D A Q L+ T A++ S P+R + Sbjct: 33 RPNVVIILADDMGYGDVTALNKGSRI--PTPNLDQFARQSLVFTDAHAAGSYCVPSRYGL 90 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQ---------GLTTLPQLLHDQGYVTQAIGKWHMGEN 194 LTG+Y + G G L G T+ L+ D GY T +GKWH G + Sbjct: 91 LTGRY------MWRTRLGSGGNLANFAGTLIEPGRRTIANLMQDAGYQTGLVGKWHQGID 144 Query: 195 KESQPQNV--------GFDDFRGFNSVSDMYTEWRD------------VHVNPEVALSPD 234 + + ++ + +F+ + + +D +NP + + Sbjct: 145 WKLRDESARVQIRVDPNYQNFKNIDFAAPALKGPKDYGFAYSFGTAGSAEMNPSTFIVNN 204 Query: 235 RSEYIKQLPF--SKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS--D 290 R+ I L +K+ G + IA+ M+ L + +F++ +S D Sbjct: 205 RAAVIPTLTTAEAKEKFGEWYGRDDNIIAE--GYTMDRLVPTLSNKACEFVETAVRSKPD 262 Query: 291 KPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 +PFFLYY H PN ++ G S A T YGD +VE++ L + L+ G DNTL Sbjct: 263 QPFFLYYAMTTPHNPIVPNQEFVGKSQAGT-YGDFVVELDFHVGRLLQKLKDLGIADNTL 321 Query: 351 IVFTSDNGP--------------EAEVPPHGRT-PFRGAKGSTWEGGVRVPTFVYWKGMI 395 I FTSDNGP + ++ H T P G KG EGG RVP V W I Sbjct: 322 IFFTSDNGPVDRTRGYPQRWVRGDTQIYGHDSTGPCSGWKGGLEEGGHRVPFIVRWAAKI 381 Query: 396 QP-RKSDGIVDLADLFPTALDLAGHPGAKVANL-VPKTTFIDGVDQTSFFLGTNGQS--- 450 +P + + D+ PT A++ N+ + T DGV SF+ G S Sbjct: 382 KPGEECATTIVFNDVLPTL--------AEMLNVKLDSNTAEDGV---SFYPALTGASRPV 430 Query: 451 --NRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSV---- 504 ++ H NG AVR FK ++I+ P TV + + V Sbjct: 431 SFHKAIIHNHHNGHF-AVRQGAFK--LIIKGP-------------KTVEEVLDARVPVKY 474 Query: 505 --FNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKY 542 ++L D ES + +H P V +MHA +LK+Y Sbjct: 475 QLYDLDKDIAESTDVSAKH-PEKV---KQMHA---LLKQY 507 >UniRef50_UPI0000DB708B PREDICTED: similar to CG7402-PA isoform 2 n=2 Tax=Apocrita RepID=UPI0000DB708B Length = 609 Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 110/382 (28%), Positives = 169/382 (44%), Gaps = 50/382 (13%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILT 145 P+++VF+ DD+GW DVGF+G PTP+IDA+A G+IL Y PSS+P+R T Sbjct: 31 PHIIVFMADDLGWNDVGFHGSNQI---PTPNIDALAYNGIILNRHYVLPSSTPSRIAFFT 87 Query: 146 GQYSIHHGILMPPMY-GQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQPQNV 202 G Y I G+ + G+P GL + LP+ L GYVT+ IGKWHMG + P + Sbjct: 88 GLYPIRIGMQGDGIRGGEPRGLPLHIKILPEHLRGLGYVTKLIGKWHMGFHTLQYTPLHR 147 Query: 203 GFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIAD 262 GFD F GF + Y ++ EY Q + D+H G+ A Sbjct: 148 GFDTFFGFYNSHITYYDY----------------EYSNQ-NMTGYDMHC---GDDPAYG- 186 Query: 263 ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF------DNYPNAKYAGSS 316 + +Y DL + + +K ++ + +P +L H D+ + Sbjct: 187 MKREYATDL---FTNEAIKIIEN-HELPRPLYLQISHLAVHAPIEQPDDSSRDEIVQIRE 242 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA--EVPPHGRT-PFRG 373 P R Y + ++++ + L + G L ++LI+F +DNG + +G P RG Sbjct: 243 PNRRKYAKMVSKLDESVGRVVHALGEKGMLRDSLILFLTDNGAASIGRYRNYGSNYPLRG 302 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 K + +EGGVR W ++ R ++ + D PT AG + Sbjct: 303 TKYTLYEGGVR-GVAALWSSRLEKGARVFKKLIHITDWLPTLYSAAGGDLKDLGK----- 356 Query: 432 TFIDGVDQTSFFLGTNGQSNRK 453 IDG+DQ G K Sbjct: 357 --IDGIDQWRVLSEGQGHGREK 376 >UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UZ43_RHOBA Length = 608 Score = 126 bits (316), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 100/354 (28%), Positives = 158/354 (44%), Gaps = 58/354 (16%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 + +PNVV+ + DD G+ D GF G V TP+IDA+A++ +LT + P+ SPTR Sbjct: 27 RAADRPNVVMVITDDQGYGDCGFTGNKVV---QTPNIDALAAESSVLTDYHVAPTCSPTR 83 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 + ++TG ++ G+ + G+ T ++ D GY T GKWH+G+N + + Sbjct: 84 SALMTGHWTNRTGVWH-TISGRSMLRDNEVTFGEIFSDAGYQTGMFGKWHLGDNYPYRAE 142 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 + GF +++Y H V +PD + F H G +A Sbjct: 143 DNGF---------TEVYR-----HGGGGVGQTPD---FWDNAYFDGSYFH--NGKAVKAE 183 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPART 320 T + ++ G +F+ + ++D+PFF Y T H G A Sbjct: 184 GFCTDVFFKE--------GNRFIRECVEADEPFFAYIATNAPH----------GPLHAPQ 225 Query: 321 SYGDCMVEMNDVFANLY--------------KTLEKNGQLDNTLIVFTSDNGPEAEVPPH 366 Y D EMND A + K L + G DNT+ +FT+DNG + Sbjct: 226 KYIDMYPEMNDNVATFFGMITNVDDNVGQTRKLLRELGVHDNTIFIFTTDNGTAGGASVY 285 Query: 367 GRTPFRGAKGSTWEGGVRVPTFVYW--KGMIQPRKSDGIVDLADLFPTALDLAG 418 RG KGS +EGG RVP +++ G + R ++ + D+ PT LD+ G Sbjct: 286 -NAGMRGKKGSPYEGGHRVPFVMHYPEGGFAKSRTNNTLCHAVDVVPTLLDMCG 338 >UniRef50_A7SK50 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7SK50_NEMVE Length = 630 Score = 126 bits (316), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 139/548 (25%), Positives = 217/548 (39%), Gaps = 128/548 (23%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRAT 142 KKPN+++F++DD+G+ D+G G TP ID +A++G LT + S +P+RA Sbjct: 27 KKPNILIFIVDDLGYADLGCFGNDSLA---TPHIDKIATEGAKLTHNLAGESICTPSRAA 83 Query: 143 ILTGQYSIHHGILMPPMYG------------QPGGL-QGLTTLPQLLHDQGYVTQ----- 184 +LTG+Y + G L+P G GGL + TT + L + GY T Sbjct: 84 LLTGRYPVRTG-LVPSRGGLSEYIRVIIFTANSGGLPRNETTFAKALLETGYSTGPLLNY 142 Query: 185 -------------------------------------AIGKWHMGENKES------QPQN 201 +GKWH+G ++++ P N Sbjct: 143 SSWFCLKSSPGPYFLAAILNLALFWRVYDPCIKVTPGLVGKWHLGLSRDTVDDFHYHPLN 202 Query: 202 VGFDDFRGF-----------NSVSD-MYTEWRD--VHVNPEVALSPDRSEYIKQLP---- 243 GF F G +V D ++ WR VAL+ S ++ ++ Sbjct: 203 HGFQYFYGLPLTNLRNCDSGGAVLDHIFPNWRMQMAVFMIIVALTGFTSYFLNKISTLSF 262 Query: 244 --------------FSKDDVHAVRGGEQQAIADIT--PKYMEDLDQRWMDYGVKFLDKMA 287 F D V+ D+ P +++L R+ D ++F+ + Sbjct: 263 MVLLIAPLILVYPVFLVDIVYQNFNCILMRNFDLVEQPVILDNLTARFTDESIRFM--VT 320 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 D PF L H + + G S + YGD + EM+ + LE+ G + Sbjct: 321 HKDDPFLLVVSYAKVHTALFTTDYFKGHS-GHSPYGDNVEEMDWSVGQIMDALEELGVKN 379 Query: 348 NTLIVFTSDNGPEAEVPPHG-------RTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RK 399 NT + FTSDNGP E + ++G KG +WEGG+RVPT V W IQP + Sbjct: 380 NTFVYFTSDNGPHLEEVARNSEYEGGWKGIYKGGKGQSWEGGIRVPTLVSWPSHIQPGIE 439 Query: 400 SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL 459 D + DLFPT LD+AG +P IDG + N S K +++ Sbjct: 440 IDEPTNGIDLFPTVLDIAGAS-------MPNDRIIDGRNLIPLLTQQNKYSPHKFMYHYC 492 Query: 460 NGKLAAVRM------DEFKYHVLIQQPYAYTQSGY----QGGFTGTVMQTAGSSVFNLYT 509 + AVR +K H + + T++ + G + + +F + Sbjct: 493 AKAVHAVRYRPRSGHTTWKAHFMTPKYTPGTEACFGYAVCGCYESQTITHNPPLLFEITA 552 Query: 510 DPQESDSI 517 DP ES I Sbjct: 553 DPSESTPI 560 >UniRef50_A9LGQ4 Secreted arylsulfatase n=4 Tax=Bacteria RepID=A9LGQ4_9BACT Length = 608 Score = 126 bits (316), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 137/493 (27%), Positives = 202/493 (40%), Gaps = 121/493 (24%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 ++PNV+VFL DD GW D G TP+ID++A+QGL+ + + P SPTRA Sbjct: 43 QRPNVIVFLSDDQGWGDFSCTGNQSVA---TPNIDSLATQGLLFENFFVCPVCSPTRAEF 99 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGL-------TTLPQLLHDQGYVTQAIGKWHMGENKE 196 LTG+Y P G QG TT+ L GY T A GKWH G Sbjct: 100 LTGRYH--------PQSNVKGVSQGQERIDLDETTIADCLSQAGYATAAFGKWHNGMQYP 151 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P GFDDF GF S W + + NP + +K + DD Sbjct: 152 YHPCGRGFDDFYGFCS-----GHWGN-YFNPTLE---HNGRIVKGEGYINDD-------- 194 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF-DNYPNAKYAGS 315 + K++ED +PFFLY H+ P+A + Sbjct: 195 ---FTNRALKFIED-----------------HKSQPFFLYLPYNTPHWPPQMPDAYWQRF 234 Query: 316 SP---------------ARTSYGDCMVEMNDVFANLYKTLEKNGQL---DNTLIVFTSDN 357 + A+T MVE ++ N+ + L K +L DNT++++ +DN Sbjct: 235 AEKEIVQRGQKGDKEDLAKTRSALAMVE--NIDWNVGRVLAKLDELKIADNTIVIYFNDN 292 Query: 358 GPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGIVDLADLFPTALD 415 GP + G +G KGST EGGVR P FV W ++ R+ + I DL+PT L Sbjct: 293 GPNSNRWNAG---MKGKKGSTDEGGVRSPLFVRWPNGVKGAGRRVNQICGAIDLYPTLLA 349 Query: 416 LAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHV 475 G AN+ K +DG + + G+ + + GK A+VR +F+ Sbjct: 350 ATGS-----ANVGDK--ILDGKNLLPIWDGSETNLGFRMLFSYWRGK-ASVRTQQFRLD- 400 Query: 476 LIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQE-----SDSIGVRHIPMG--VPL 528 +G+ +F++ TDP + SD V + +G + Sbjct: 401 ---------NNGW---------------LFDMLTDPHQTKDISSDQPAVAALLLGSLIRF 436 Query: 529 QTEMHAYMEILKK 541 + EM A M+ K+ Sbjct: 437 KQEMEAEMDSTKR 449 >UniRef50_UPI000186D20A arylsulfatase B precursor, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186D20A Length = 532 Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 109/395 (27%), Positives = 180/395 (45%), Gaps = 67/395 (16%) Query: 75 LAELEKKTGKKP-NVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 LA + K T ++P +++ L+DD+GW D GF+G TP++DA+A G+IL Y Sbjct: 16 LAVVCKSTAQQPPHIITILIDDLGWTDTGFHGSDQI---KTPNMDALAYSGMILNRHYVL 72 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKW 189 PS +P+R+ +LTG Y I G+ P+ G G ++ L P+ L + GY T +GKW Sbjct: 73 PSCTPSRSALLTGLYPIRTGMQGMPLKG--GDVRNLPLSFKLKPEFLKNLGYRTHLVGKW 130 Query: 190 HMGENKESQ-PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 H+G + P GFD F G+ + Y ++ N VA ++ EY F D Sbjct: 131 HLGYRTINHLPNQRGFDSFFGY---YNGYVDYFKFGHNQTVA--GEKIEY-----FYGYD 180 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS---DKPFFLYYGTRGCHFD 305 +H R GE Y D D + +K+ K+ +P +LY+ H Sbjct: 181 LH--RNGEI---------YQTDKDTYATRLFTREAEKIIKNHNESEPLYLYFSHLATHTG 229 Query: 306 N----------------YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 + Y + K+ G R ++ C+ E++ + + L++ LDN+ Sbjct: 230 DDDIGMEVPEDADVNKTYGHIKHYG----RRAFAGCLEELDKSVGEVMEALKEKNMLDNS 285 Query: 350 LIVFTSDNG---PEAEVPPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIV 404 +I+ SDNG ++PP+ + P G K + +EGGVR ++ + + +D + Sbjct: 286 IILIMSDNGGHTVSVDLPPNWSSNWPLGGTKFTLFEGGVRSVALIWSPLLPKGVINDDFI 345 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQ 439 + D PT AG ++ DG+DQ Sbjct: 346 HITDWLPTLYSAAGGNPKELG-------IFDGIDQ 373 >UniRef50_A6CGJ8 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CGJ8_9PLAN Length = 520 Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 138/514 (26%), Positives = 207/514 (40%), Gaps = 103/514 (20%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS--SPTRA 141 K+ N+V L DD+G+ DV + TP ID +A++G+ T A++ PS+ +PTR Sbjct: 31 KQSNIVYILADDLGYGDVSCYNPESKIK--TPHIDRLAAEGMKFTDAHT-PSAVCTPTRY 87 Query: 142 TILTGQYSIHHGILMPPMYG--QPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG---ENKE 196 ILTG+Y + + G P Q T+P LL GY T IGKWH+G +K Sbjct: 88 GILTGRYCWRTRLKYRVLDGFDPPLIEQDQVTVPSLLKKAGYDTACIGKWHLGMQWTDKN 147 Query: 197 SQP----------------------------QNVGFDDFRGFNSVSDMYTEWRDVHVNPE 228 QP GFD + G ++ +M +P Sbjct: 148 GQPVPAVPIDRRQRPRVGDDVDYTKPILGGPLTSGFDYYFGISASLNM---------SPF 198 Query: 229 VALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAK 288 + DR + +P + + + D T + + VK++++ K Sbjct: 199 CFIRNDRPVILPTIPSERIQTEFLSVDQGMRSPDFT---IRSVMPTLTGEAVKYIERHGK 255 Query: 289 S--DKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQL 346 ++PFFLY+ H PN ++ G S A YGD ++E++ + L++ G Sbjct: 256 ESPERPFFLYFPLTAPHLPLVPNDEFKGKS-AAGEYGDFVLEVDATVGAIMDALQRTGVA 314 Query: 347 DNTLIVFTSDNG-------PEAE------VPPH-----------GRTPFRGAKGSTWEGG 382 +NTL++FTSDNG P+ P H G RG K WEGG Sbjct: 315 ENTLVIFTSDNGGLYHWWTPQETDDLKHYKPNHRGQYVKDRGHQGNAHLRGTKADIWEGG 374 Query: 383 VRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTS 441 RVP V W G + D +V+L DL T A +P D V+ Sbjct: 375 HRVPFIVRWPGKTPADSTNDELVELTDLLATC-------AAITDTKLPDGDAQDSVNILP 427 Query: 442 FFLGTNGQSNRK--AEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGT--VM 497 LG + + A H+ L G +VR Q P+ GGFT V Sbjct: 428 ALLGKKSDTPLREYAIHHSLWGHF-SVR----------QGPWKMIPKRGSGGFTRAREVE 476 Query: 498 QTAGS---SVFNLYTDPQESDSIGVRHIPMGVPL 528 AG ++NL DP E+ ++ + H + PL Sbjct: 477 PAAGEPTGQLYNLKQDPSETKNVWLEHPEVVKPL 510 >UniRef50_A6BYP9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BYP9_9PLAN Length = 442 Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 107/362 (29%), Positives = 156/362 (43%), Gaps = 59/362 (16%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KPN+++ L DDVG +G GG PTP IDA+A G+ AY+ P PTR ++ Sbjct: 28 KPNILLILADDVGSDAIGCYGG---RSYPTPHIDALAKGGMKFNHAYAMPVCHPTRVCLM 84 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWH--MGENKESQPQNV 202 TG+Y G P +G+ T+ + GY T GKW M +N + P V Sbjct: 85 TGRYPFRFGKAGSKWGDFPRDAEGI-TIGNRMQQAGYATAVTGKWQLCMMKNDKQHPSRV 143 Query: 203 GFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIAD 262 GFD++ F + E H I Q +DD ++ G E A Sbjct: 144 GFDEWCLFG-----WHEGGRYH-----------HPLIYQNGSLRDDASSLYGPEVYA--- 184 Query: 263 ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH--FDNYPNAKYAGSSPAR- 319 D+ + F+ + + KPFF YY CH D+ + A R Sbjct: 185 --------------DFLIDFMKRSHNAGKPFFAYYPMALCHDVTDDLKDEHVAYYKYGRW 230 Query: 320 TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE----VPPHGRTP----- 370 +YG+ + M+D+ + +L + G DNTLI+FT+DNG A V +G+ Sbjct: 231 MTYGEMIASMDDMVGKVVASLNEMGVRDNTLIIFTTDNGTPAASYLTVNENGKMVRPKVV 290 Query: 371 -------FRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGA 422 G KG + G RVP W G I+ + D +VD++D PT ++AG A Sbjct: 291 SVQNGKIVPGGKGKLDDTGTRVPLIANWPGHIKAGTEVDDMVDMSDYLPTVAEIAGLKEA 350 Query: 423 KV 424 V Sbjct: 351 DV 352 >UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 Tax=Bacteria RepID=A6CD52_9PLAN Length = 460 Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 120/440 (27%), Positives = 180/440 (40%), Gaps = 103/440 (23%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPT 139 + ++PN+++ DD G DVG G + PTP ID +A +GL+ YS + +P+ Sbjct: 23 QAAERPNILIIFTDDQGINDVGCYGSEI----PTPHIDQLAKEGLLFRQYYSASAICTPS 78 Query: 140 RATILTG------QYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMG 192 R ILTG Q + ++ Q G+Q G TT+ +L GY T +GKWH+G Sbjct: 79 RFGILTGRNPTRSQDQLLGALMFMSDIDQNRGIQPGETTIADVLQQNGYQTALLGKWHLG 138 Query: 193 ENKES-QPQNVGFDDFRG------------FNSVSDMYTEWRDVHVNPEVALSPDRSEYI 239 ES P GFD FRG + ++ D Y R V N Sbjct: 139 HGTESFLPTAHGFDLFRGHTGGCIDYFTMTYGNIPDWYHNQRHVSEN------------- 185 Query: 240 KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 Y DL ++ FL +DKPFFL+ Sbjct: 186 --------------------------GYATDLITEEAEH---FLKDQQTTDKPFFLFLSY 216 Query: 300 RGCHF-------DNYP---------NAKYAGS--SPARTSYGDCMVEMNDVFANLYKTLE 341 HF D P + K G+ R + V ++D + +L+ Sbjct: 217 NAPHFGKGWSPGDQSPVNIMQARGDDLKRVGTIKDKVRREFAAMTVSLDDGIGRVMSSLK 276 Query: 342 KNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSD 401 NG NTL++F +D+G + V PFRGAK + +EGG+RVP + W G I+ Sbjct: 277 NNGLDQNTLVIFMTDHGGDY-VYGGNNQPFRGAKATLFEGGIRVPCIIRWPGKIKAGTET 335 Query: 402 GIVDLA-DLFPTALDLAG-------HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRK 453 V A DLFPT A G ++ L+ + T + G + + LG + + R Sbjct: 336 NEVAWALDLFPTICHFANVDTDGLTLDGKDISGLLTRQTPV-GTRELYWQLGPHAELKR- 393 Query: 454 AEHYFLNGKLAAVRMDEFKY 473 G+ +A+R ++KY Sbjct: 394 -------GRWSALRQGDWKY 406 >UniRef50_Q16DZ0 Sulfatase, putative n=8 Tax=Proteobacteria RepID=Q16DZ0_ROSDO Length = 416 Score = 125 bits (315), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 112/403 (27%), Positives = 172/403 (42%), Gaps = 38/403 (9%) Query: 171 TLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGF----NSVSDMYTEWRDVHVN 226 TL ++ GY T GK H+G+ + P GFD++ + N++ YT D + Sbjct: 9 TLANMMKSLGYTTGQFGKNHLGDQNQFLPTTKGFDEYWVWLYHLNAME--YTSDPDWSDD 66 Query: 227 PEVALSPDRSEYIKQLPFSK-DDVHAVRGG--EQQAIAD---ITPKYMEDLDQRWMDYGV 280 PE I + DD R G Q I D P+ + LD + Sbjct: 67 PEFEAQYGPRNVIHAFATEEMDDTVDPRWGLVGNQRIVDDGPAPPERQKTLDDEVTARTL 126 Query: 281 KFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY-AGSSPARTSYGDCMVEMNDVFANLYKT 339 F+D+ +S+ PFF++ H + + +Y A R M E++D + Sbjct: 127 DFIDRAVESETPFFVWMAPARAHVWTHLSPEYEAMLGNGRGLQDVVMKELDDNVGKVLAR 186 Query: 340 LEKNGQLDNTLIVFTSDNGPEAEV-PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QP 397 LE DNT++VFTSDNGPE P G TPF G KG+TWEGG RVP + W G + + Sbjct: 187 LEALNISDNTIVVFTSDNGPETMTWPDGGTTPFYGEKGTTWEGGFRVPAIIKWPGKVPEG 246 Query: 398 RKSDGIVDLADLFPTALDLAGHPG----AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRK 453 R ++GI D PT + AG P + +D +Q + LG +S R Sbjct: 247 RVANGIFSGMDWMPTLVAAAGGPDNLPEVMLEGYEGYNVHLDSYNQLPYLLGGE-ESQRD 305 Query: 454 AEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQE 513 Y+ L A+ ++K H ++Q G+ G + +FNL DP E Sbjct: 306 EIVYYEGTSLQAIHYRDWKAHFVVQ----------HHGWAGPKDELNAPLLFNLRRDPYE 355 Query: 514 --SDSIGVRHIPMGV------PLQTEMHAYMEILKKYPPRAQI 548 ++ G+ I MG P + +++ + +PPR + Sbjct: 356 KAAEESGMYTIWMGKKMWAFGPAARLVQGHLQSFQAFPPRGAV 398 >UniRef50_C2G0L0 Possible Cerebroside-sulfatase n=2 Tax=Sphingobacterium spiritivorum RepID=C2G0L0_9SPHI Length = 505 Score = 125 bits (314), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 132/497 (26%), Positives = 217/497 (43%), Gaps = 79/497 (15%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS 136 +++ + +KPNV++ +DD+G+ D+ GG TP +D +A+ G+ T+A++ S+ Sbjct: 21 QIQAQDKQKPNVLMIYVDDLGYGDLSIYGGQDI---ETPHLDELATSGIRFTNAHAAAST 77 Query: 137 -SPTRATILTGQ--YSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHM 191 +P+R ++TG Y ++P G + Q TLP++ H QGY T +GKWH+ Sbjct: 78 CTPSRYALMTGNNPYRAKGTGILP---GDAALIIPQDKITLPKVFHQQGYTTGIVGKWHL 134 Query: 192 GENKESQ----------PQNVGFDDFRGFNSVSDMY-TEWRDVH-VNPEVALSPDRSEYI 239 G ++ + P VG+D F + +D T + + H V A P + Y Sbjct: 135 GLGEQVEKDWNGKIAPGPLEVGYDYSFIFPATADRVPTVFLENHYVLAADAKDPIQVNYR 194 Query: 240 KQL---PFSKDD-----VHAVRG-GEQQAIADITPK--YMED-LDQRWMD--YGVKFLDK 285 +++ P K++ +HA G G I + + +M D RW D + F +K Sbjct: 195 QKIGNEPTGKENPELLKLHASPGQGHDNTIVNGIGRIGWMTGGKDARWADEELTLTFFEK 254 Query: 286 MAK-----SDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTL 340 + KPFFL Y H P + G S GD +++++ L + L Sbjct: 255 AKEFIKTNQKKPFFLCYNATEPHVPRMPATLFKGKSKLGLR-GDAILQLDYTVGQLVQEL 313 Query: 341 EKNGQLDNTLIVFTSDNGP-----------EAEVPPHGRTPFRGAKGSTWEGGVRVPTFV 389 + NG +NT+I+FTSDNGP E +RG K S +E G RVP V Sbjct: 314 KNNGLYENTIIIFTSDNGPVLDDGYADQAVEKSANHDAFGGWRGGKYSAFEAGSRVPFLV 373 Query: 390 YWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNG 448 W +I+ ++SD ++ DL + AG G PK +D +Q +GT Sbjct: 374 SWPAVIKGGQQSDALIGQVDLLAS---FAGQLGVS----YPKDQAVDSQNQWKTLIGT-- 424 Query: 449 QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGS----SV 504 ++K Y + +IQ Y Y + V G+ + Sbjct: 425 --DKKGRTYLVKSS---------GTFSIIQGDYKYIKPRKGAKIDKAVNIELGNDEQPQL 473 Query: 505 FNLYTDPQESDSIGVRH 521 +NL TD E ++I +H Sbjct: 474 YNLRTDKAEKENIAAKH 490 >UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BZT7_9PLAN Length = 459 Score = 125 bits (314), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 120/420 (28%), Positives = 185/420 (44%), Gaps = 54/420 (12%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QP 134 LE +KPN++ + DD+G+ ++G G TP ID +A++G+ T AY+ Sbjct: 7 VRLEATEKQKPNIIFIMADDLGYAELGCYGQKKI---KTPHIDKLAAEGMKFTQAYAGSM 63 Query: 135 SSSPTRATILTGQYSIHHGI----LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWH 190 P+R+ ++TGQ++ H + L +Y + TT+ ++L GY T A GKW Sbjct: 64 VCQPSRSVLMTGQHTGHTAVRANDLNQLLYEED------TTVAEVLKIAGYATGAFGKWG 117 Query: 191 MG-ENKESQPQNVGFDDFRG--FNSVSDMYTE---WRDVH--VNPEVALSPDRSEYIKQL 242 +G E +P GFDDF G + Y W + H + PE + R YI L Sbjct: 118 LGYEGTPGRPGQQGFDDFTGQLLQVHAHFYYPFWIWNNEHRLMLPENE-NNQRGRYIHDL 176 Query: 243 PFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGC 302 +D ++ + Q P + ++ L +S+KP+ + + Sbjct: 177 -IHEDAKAFIQKNKAQPFFAYLPYIIPHVE----------LVVPEESEKPYRGQFPKKQI 225 Query: 303 HFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 D P Y GS T++ + ++D + LE G DNTLI+FTSDNG + Sbjct: 226 -LD--PRPGYIGSEDGLTTFAGMVSRLDDHVGEIVTLLEDLGIRDNTLIIFTSDNGGQGG 282 Query: 363 VPP------HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALD 415 +G P RG KGS +EGG+RVP W G I K SD + D+ PT Sbjct: 283 TWKEMTDFFNGNAPLRGHKGSMYEGGIRVPFIANWPGKIAAGKTSDLQIAFWDVLPTLAQ 342 Query: 416 LAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY-FLNGKL--AAVRMDEFK 472 +AG VP IDG+ LG Q + ++ + GK+ A+R +K Sbjct: 343 VAG-------TTVPSGVDIDGISFLPTLLGKGKQPEHEYLYWEYTRGKIRSRAIRQGNWK 395 >UniRef50_A7AKS6 Putative uncharacterized protein n=3 Tax=Bacteroidales RepID=A7AKS6_9PORP Length = 464 Score = 125 bits (314), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 102/359 (28%), Positives = 157/359 (43%), Gaps = 62/359 (17%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SS 137 + + ++PN+++ L DD G+ D GF G A TP+ID +A++G I T A+ + SS Sbjct: 27 QDEEAQRPNILILLADDAGYADFGFMG---ATDIQTPNIDRLAAEGCIFTDAHVAATVSS 83 Query: 138 PTRATILTGQYSIHHGI---LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 P+R+ +LTG+Y +G L P G P + LP LL Y T IGKWH+G Sbjct: 84 PSRSMMLTGRYGQRYGYECNLDKPGDGLPDDEE---LLPALLKRYDYRTGCIGKWHLGSE 140 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 +P GFD F G + R +PE + D+ ++Q ++ Sbjct: 141 PSQRPNAKGFDTFYG------LLAGHRSYFYDPETS---DKDGNLQQYQYNG-------- 183 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA--------KSDKPFFLYYGTRGCHFDN 306 R + + F D++A +S++PF LY H N Sbjct: 184 -------------------RKLSFDGYFTDELASKAQQFVTESEQPFMLYMSFTAPHSPN 224 Query: 307 YPN----AKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 A++ G R Y M ++ + L+ G+ DNT+I F SDNG + Sbjct: 225 EATEEDLARFEGQ--PRQKYAAMMYALDRGVGKIVDELKAAGKFDNTIIFFLSDNGG-ST 281 Query: 363 VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTALDLAGHP 420 P +G KG+ +EGG RVP FV W + ++ G+ D+F T +D P Sbjct: 282 TNQSSNLPLKGFKGNKFEGGQRVPFFVVWGDRFKRDQRFTGLTSSLDIFATVVDALDIP 340 >UniRef50_C6I6Z4 N-acetylgalactosamine-6-sulfatase n=11 Tax=Bacteroidetes RepID=C6I6Z4_9BACE Length = 504 Score = 125 bits (313), Expect = 5e-27, Method: Compositional matrix adjust. Identities = 130/453 (28%), Positives = 191/453 (42%), Gaps = 95/453 (20%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATI 143 +PNV+ ++DD+G+ D+G G + TP+ID + G+ T Y+ P S+P R + Sbjct: 25 RPNVIYIIMDDLGYGDIGCYG---SEKIETPNIDRLYKDGISFTQHYTGSPVSAPARCVL 81 Query: 144 LTGQYSIH-----------HGILM--PPMYGQPGGLQG-------LTTLPQLLHDQGYVT 183 +TG +S H G +M MY PG L+G TL +++ GYVT Sbjct: 82 MTGMHSGHAQIRANDEMAYRGAIMNYDSMYVHPG-LEGQYPLKAHTMTLGRMMQQAGYVT 140 Query: 184 QAIGKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 GKW +G E P GFD F G+N ++ + P + Y+ Sbjct: 141 GCFGKWGLGAPGTEGTPNKQGFDSFYGYNCQRQAHSYY------PAFLYKNEDRVYLANK 194 Query: 243 PFSKDDVHAVRGGE---QQAIADITPK-YMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 G + + A A + K Y DL D + F+ + K KPFFL + Sbjct: 195 VLDPHTTKLDAGADPRDEAAYAKFSQKEYANDLI---FDELISFVGQNRK--KPFFLMWT 249 Query: 299 TRGCHF-----------------DNYPNAKYAGSSPAR---TSYGDCMVEMNDVFANLYK 338 T H D P AG P R +Y + ++ L + Sbjct: 250 TPLPHVSLQAPEKWVKYYVGKFGDEAPYIGKAGYMPCRYPHATYAAMISYFDEQIGKLIE 309 Query: 339 TLEKNGQLDNTLIVFTSDNGPE----AEVPPHGR-TPFRGAKGSTW------EGGVRVPT 387 L+K DNT+I+FTSDNGP ++ P PFR G W EGG+R+P Sbjct: 310 KLKKERLYDNTVIMFTSDNGPTFNGGSDSPWFDSGGPFRSEYG--WGKCFVHEGGIRIPA 367 Query: 388 FVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGT 446 V W G I+P +SD I D+ PT D +AN+ T DG+ SF Sbjct: 368 IVTWPGKIKPSTQSDHICGFQDVMPTLAD--------IANIACPET--DGI---SFLPAL 414 Query: 447 NGQSNRKAEHYFLNGK-------LAAVRMDEFK 472 G++ R+ EH +L + L A+RM ++K Sbjct: 415 LGETERQKEHEYLYWEYPDPTIGLKAIRMGKWK 447 >UniRef50_A6C4V9 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4V9_9PLAN Length = 480 Score = 125 bits (313), Expect = 5e-27, Method: Compositional matrix adjust. Identities = 116/402 (28%), Positives = 174/402 (43%), Gaps = 73/402 (18%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNP---TPDIDAVASQGLILTSAYSQPS 135 E+ G +PN++V ++DD+G+ V GNP TP+ID +A++G+ T +S + Sbjct: 30 ERPPGDRPNLIVIMVDDMGYAGVS------CFGNPYFKTPEIDRLAAEGMKFTDFHSSGT 83 Query: 136 -SSPTRATILTGQYSIHHGI--LMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGK 188 SPTRA +LTG+Y GI ++ P+ P +GL T +LL GY T IGK Sbjct: 84 VCSPTRAGLLTGRYQQRAGIEAVIHPVSDHPEHQKGLRKSENTFAELLKQAGYRTALIGK 143 Query: 189 WHMG---ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS 245 WH G + E P N GFD F G++S + + HV V Sbjct: 144 WHQGYPHNSAEFHPDNHGFDTFVGYHSGNIDFIS----HVGDHV---------------- 183 Query: 246 KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH-- 303 K D R Q+ Y L + Y ++F+ + ++PF LY H Sbjct: 184 KHDWWHGRKETQET------GYSTHLINQ---YALQFIKE--SRNQPFCLYLAHEAIHNP 232 Query: 304 ------------FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 + K A + + + ++ + + L K+G NT + Sbjct: 233 VQVPGDPIRRTEAAGWKRWKPASEAERIEKFRGMTLPVDAGVGQIREFLVKSGLDKNTFV 292 Query: 352 VFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLA-DLF 410 +F SDNGP + P G +RGAKGS +EGG RVP +W G IQ + ++ D+ Sbjct: 293 LFFSDNGPSRDF-PSGSPKWRGAKGSVYEGGHRVPAIAWWPGKIQAGTETDVPAISLDVM 351 Query: 411 PTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNR 452 PT L +A +PK +DGVD + S R Sbjct: 352 PTLLGIAHID-------MPKERPLDGVDLSPVLFEQKPLSER 386 >UniRef50_Q7UYW3 Arylsulfatase B n=1 Tax=Rhodopirellula baltica RepID=Q7UYW3_RHOBA Length = 520 Score = 124 bits (312), Expect = 7e-27, Method: Compositional matrix adjust. Identities = 108/356 (30%), Positives = 158/356 (44%), Gaps = 62/356 (17%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRATIL 144 PN+VV L DD+G+ D+G G TP++D +A G++ + AY + SP+RA +L Sbjct: 56 PNIVVILADDMGYGDMGCMGSQTLQ---TPNLDRLAESGVLCSQAYVASAVCSPSRAGLL 112 Query: 145 TGQYSIHHGI-----LMPPMYGQPGGLQGLTTLPQLLHDQ----GYVTQAIGKWHMGENK 195 T + G Y L GL T + L D GY T IGKWH+G + Sbjct: 113 TSRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADHLGAAGYATALIGKWHLGMGE 172 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV-RG 254 P GFD F G + S Y F H + R Sbjct: 173 MHHPNRRGFDHFCGMLTGSHHY--------------------------FPATMKHVIERN 206 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLD--KMAKSDKPFFLYYGTRGCHFDNYPN--- 309 G++ + D + +Y+ D + D G++F+D K A D+P+F+++ H + Sbjct: 207 GKR--VDDFSSEYLTDF---FTDEGLRFIDQHKSANPDQPWFVFFSYNAPHTPMHATEAD 261 Query: 310 -AKYAG-SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG 367 A++A + R +Y M ++ + + LE+ GQ +NTL+VF SDNG A Sbjct: 262 LARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNG-GATNNGSW 320 Query: 368 RTPFRGAKGSTWEGGVRVPTFVYW-----KGMIQPRKSDGIVDLADLFPTALDLAG 418 P RG KGS EGG+RVP W G++ DG+V DL PT AG Sbjct: 321 NGPLRGVKGSMREGGIRVPMIWTWPAKFPAGVLY----DGVVSSLDLLPTFCSAAG 372 >UniRef50_Q2LZ24 GA16747 n=5 Tax=Drosophila RepID=Q2LZ24_DROPS Length = 575 Score = 124 bits (312), Expect = 7e-27, Method: Compositional matrix adjust. Identities = 110/395 (27%), Positives = 170/395 (43%), Gaps = 58/395 (14%) Query: 66 AQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGL 125 A+ ET A +T +PN+++ L DD+G+ DV F GG + TP+IDA+A G Sbjct: 17 AKTDETPASAASETAETAGRPNIIIILADDMGFDDVSFRGGREFL---TPNIDALAFHGR 73 Query: 126 ILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYG-QPGGLQ-GLTTLPQLLHDQGYVT 183 IL Y+ +P+R +L+G+Y IH G + +P GL T +P++ GY T Sbjct: 74 ILDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVISNEEPWGLTLNATLMPEIFQQAGYST 133 Query: 184 QAIGKWHMGENK-ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 IGKWH+G ++ E P GFD G+ Y + R +L D + + + Sbjct: 134 NLIGKWHLGFSRPEYTPTRRGFDYHYGYWGAYIDYYQRRSKMPARNYSLGYD---FRRNM 190 Query: 243 PFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGC 302 D RG Y+ DL + + + + ++P FL Sbjct: 191 ELECRD----RG-----------VYVTDL---LTNEAERVIREREGQEEPLFLVLSHLAT 232 Query: 303 HFDNYPN---------AKYAG-SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIV 352 H N + K+A P R Y + +++ + L GQL+N++++ Sbjct: 233 HTANEDDPLQAPEEEIRKFAYIKDPNRRKYAAMVSKLDQSVGRIVSALNSTGQLENSIVI 292 Query: 353 FTSDNGPEAEVPPHG-------RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS--DGI 403 F SDNG P G P RG K + WEGGVRV + W +Q R + Sbjct: 293 FYSDNG----APSVGMFANTGSNWPLRGQKNTPWEGGVRVAGAI-WSAQLQARGNIFTQP 347 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVD 438 + +AD P+ AG +P T +DG+D Sbjct: 348 IYVADWLPSLAHAAGIE-------LPHTLELDGID 375 >UniRef50_Q15SA2 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15SA2_PSEA6 Length = 724 Score = 124 bits (311), Expect = 9e-27, Method: Compositional matrix adjust. Identities = 133/510 (26%), Positives = 202/510 (39%), Gaps = 116/510 (22%) Query: 80 KKTGKKP-NVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSS 137 K ++P N+V+ L DD+GW D + TP+I+A+A++G+I + AY+ P S Sbjct: 17 KAVDEQPKNIVIILADDLGWNDTTIFQPSQFL--ETPNINALAAKGMIFSQAYANSPLCS 74 Query: 138 PTRATILTGQYSIHHGI-----------LMPPMYGQPGGLQ-------------GLTTLP 173 PTRAT+LTGQ HG LMP + + L TL Sbjct: 75 PTRATLLTGQTPARHGSTAPSHHLPVVRLMPSLPASAATNRKSIAPQTVTRLDTALPTLS 134 Query: 174 QLLHDQGYVTQAIGKWHMGENKESQPQNVGFD----DFRGFNSVSDMYTEWRDVHVNPEV 229 + GY T GKWH+G + S P GFD +F+G W Sbjct: 135 SIAKANGYHTAHFGKWHLGAHPYS-PSEHGFDIDIPNFQGAGPTGGYLAPW--------- 184 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYM-EDLDQRWMDYGVKFLDKMAK 288 + +PD I P+ E +D R K++ + K Sbjct: 185 SFAPD----------------------------IQPQIAGEHIDIRLAKEAKKWIFSV-K 215 Query: 289 SDKPFFLYYGTRGCH---------FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKT 339 D PFFL + H D + N + S +Y + + +D L++ Sbjct: 216 DDGPFFLNFWAFSVHAPFNADADEIDYFINKRSGFHSQRNATYAAMVKQFDDAIGVLWQA 275 Query: 340 LEKNGQLDNTLIVFTSDNGPE-----AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGM 394 L + NT+I+FTSDNG P +G K + +EGG++VPT V W G+ Sbjct: 276 LVEAKVEKNTIIIFTSDNGGNMYTVVGNTHATSNFPLKGGKATEYEGGLKVPTAVIWPGL 335 Query: 395 IQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRK 453 QP S+ + AD FPT L+ G ++ P T +DG D G ++ Sbjct: 336 TQPNTLSNTPIQTADFFPTLLN-----GVNLS--WPSTHIVDGRDIRPVLQGGTLETRAI 388 Query: 454 AEHYFLNGKL-------AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFN 506 +Y K+ A V +D +K + + + Y +SG T ++N Sbjct: 389 FTYYPAEPKVPDWLPPSATVTLDGWK----LIRTFHYGKSG-----------THLYKLYN 433 Query: 507 LYTDPQESDSIGVRHIPMGVPLQTEMHAYM 536 L DP ES ++ I L + AY+ Sbjct: 434 LNLDPSESTNLANTQIHKVSELDALLEAYL 463 >UniRef50_A6DI30 N-acetylgalactosamine-6-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DI30_9BACT Length = 519 Score = 124 bits (311), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 112/366 (30%), Positives = 157/366 (42%), Gaps = 43/366 (11%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRAT 142 K PN + + DD GW DVG+NG + TP++DA+A+ GL Y+ S SPTRAT Sbjct: 22 KLPNFIYCITDDQGWGDVGYNGHPIL---KTPELDAMAADGLRCDRFYAAASVCSPTRAT 78 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQ--- 198 ++TG+ + I P YG+ + TL Q L GY + GKWH+GE +KE Sbjct: 79 VVTGRNNWRVNISSPTAYGEASLPKEEITLGQYLKPLGYTSAHFGKWHIGEFDKEIAKHH 138 Query: 199 ---PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 P GFD +V Y D + S D ++ + + V Sbjct: 139 YMPPWEAGFDVTFSTRNVIATY----DPYQKASKGKSGDELLKANKMLYYDNGVMIPM-- 192 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH-----FDNYPNA 310 ++A+ D P D + MD F+ MAK DKPF++Y H Y Sbjct: 193 -EKALND--PSLKGDDSRIVMDRAETFIRDMAKDDKPFYIYLCFHAVHTPLVTIPEYHKK 249 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP----------- 359 Y+ +Y + ++ L K L + DNT++ ++SDNGP Sbjct: 250 FYSDLDAKSANYFSNISAIDGQMGRLRKLLRELNIADNTMLWYSSDNGPNLKKKDNIKYG 309 Query: 360 EAEVPPHGRTP------FRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPT 412 EA+ TP ++G K WEGGVRV V W MI Q D + D PT Sbjct: 310 EAQDGKFNYTPIGSTGAYKGWKRYLWEGGVRVCGLVEWPAMIKQGIDHDYPIVTTDFVPT 369 Query: 413 ALDLAG 418 AL G Sbjct: 370 ALAAVG 375 >UniRef50_C1ZJ89 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJ89_PLALI Length = 536 Score = 124 bits (311), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 119/413 (28%), Positives = 172/413 (41%), Gaps = 85/413 (20%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATI 143 +PNVV L DD+GW +VG G PTP+ID +AS+G+ LT YS P+ +P+R + Sbjct: 38 RPNVVFILADDLGWGEVGCFGQSKI---PTPNIDRLASRGVKLTRHYSGAPTCAPSRCVL 94 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQ-----------------GYVTQAI 186 +TG++ H I G Q LPQ Q GY T A Sbjct: 95 MTGKHLGHAEIR--------GNQQAKVKLPQFTEGQHPLSDKALTIARQFQKAGYATGAF 146 Query: 187 GKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEW-RDVHVNPEVALSPDRSEYIKQLPF 244 GKW +G +P GFD+F G+N + ++ + + + N E ++ + K +P Sbjct: 147 GKWGLGPVGSTGEPNRQGFDEFFGYNCQALAHSYFPKALWKNAESIVNNE-----KPVPG 201 Query: 245 SKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF 304 K E + P+ + M + F+D+ +PFFLY H Sbjct: 202 HKKQPEGEVTMEAYQGENYAPRLI-------MAEALSFIDR--HHQQPFFLYLPFTEPHV 252 Query: 305 DNYPNAKYAGSSPA-------------------RTSYGDCMVEMNDVFANLYKTLEKNGQ 345 P K P R +Y + ++++ ++ +LEK+G Sbjct: 253 AMQPPPKIVEEFPVEWDERVYRGDGGYLPHPRPRAAYAAMIRDLDNHVGDVITSLEKHGL 312 Query: 346 LDNTLIVFTSDNG-------PEAEVPPHGRTP--------FRGAKGSTWEGGVRVPTFVY 390 L+ TLIVFTSDNG P+ V G P +G KGS +EGG+RVP V Sbjct: 313 LEKTLIVFTSDNGATHASANPDFHV--GGADPLFFNSTRELKGFKGSIYEGGLRVPAIVS 370 Query: 391 WKGMIQPRKSDGIVD-LADLFPTALDLAGHP---GAKVANLVPKTTFIDGVDQ 439 W G I P + D FPT + P G NL+P T DQ Sbjct: 371 WPGQIPPATTINTPSYFPDWFPTLCNATQLPLPEGLDGVNLLPLLTGKTSPDQ 423 >UniRef50_C9MKK8 Arylsulphatase A n=4 Tax=Bacteroidales RepID=C9MKK8_9BACT Length = 598 Score = 124 bits (310), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 112/423 (26%), Positives = 193/423 (45%), Gaps = 65/423 (15%) Query: 78 LEKKTGK---KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-Q 133 + +K GK KPNV++ L DD+G+ D+ G A TP+I+ +A G+ T+ ++ Sbjct: 107 MAQKAGKHLNKPNVIIILADDLGYGDIECYG---AKNVHTPNINRLAQSGIRFTNGHAVA 163 Query: 134 PSSSPTRATILTGQYSIHH-GILMPPMYGQPGGLQGLT--TLPQLLHDQGYVTQAIGKWH 190 +S+P+R ++LTG+Y+ G + P G G + + T+ + QGY T AIGKWH Sbjct: 164 ATSTPSRYSLLTGEYAWRREGTDVAP--GNAGMIIKPSQFTMADMFKSQGYATCAIGKWH 221 Query: 191 MGENKESQPQN-----------VGFDDFRGFNSVSD----MYTEWRDV-HVNPEVALSPD 234 +G ++ Q+ +GFD + +D ++ E V + +PE + Sbjct: 222 LGLGDKTGEQDWNAPLPQALGDIGFDYHYIMAATADRVPCVFIENGKVANYDPEHPIEVS 281 Query: 235 RSEYIKQLPFSKD------DVHAVRGGEQQAIADI-TPKYM----------EDLDQRWMD 277 + P +D + HA G + + I YM E++ + Sbjct: 282 YRKNFDGEPTGRDHPELLYNQHASHGHDMSIVNGIGRIGYMKGGGKALWKDENIADSILV 341 Query: 278 YGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLY 337 + +F+++ +PFF+Y+ T H +P+ ++ G + GD + + + L Sbjct: 342 HATRFMEQ--HKTEPFFMYFATNDVHVPRFPHPRFRGHNIMGVR-GDAIEQFDWTVGELL 398 Query: 338 KTLEKNGQLDNTLIVFTSDNGP--------EAEVPPHGRT---PFRGAKGSTWEGGVRVP 386 + L++ DNTL++ TSDNGP AE HG + PFRG K S +EGG +P Sbjct: 399 RKLKELHLDDNTLVILTSDNGPVVDDGYADRAEELLHGHSPAGPFRGNKYSAFEGGTAIP 458 Query: 387 TFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGT 446 V W ++ +++D + +D G+ + +PK + D D S LG Sbjct: 459 LIVSWSKQVKGG------EVSDALVSQIDFLSSLGSLIHATLPKGSAPDSKDYLSTLLGR 512 Query: 447 NGQ 449 N Q Sbjct: 513 NKQ 515 >UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_BACFR Length = 489 Score = 124 bits (310), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 138/466 (29%), Positives = 205/466 (43%), Gaps = 97/466 (20%) Query: 71 TQQKLAELEK-KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTS 129 TQQ LA +K K +PNVV L DD+G+ D+ G TP+ID +A G+ T Sbjct: 21 TQQALARQKKAKEQTRPNVVFILADDLGYGDLSCYG---QEKFETPNIDRLAQNGMRFTQ 77 Query: 130 AYSQPS-SSPTRATILTGQYSIHHGI-----LMPPMYGQPGGLQGLTTLPQLLHDQGYVT 183 YS + S+P+R+ ++TG +S H I L P GQ + T+ + GY T Sbjct: 78 CYSGTTVSAPSRSCLITGTHSGHTAIRGNKELAPE--GQFPLPENSQTIFNDFRNAGYRT 135 Query: 184 QAIGKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRD-VHVNPEVALSPDRSEYIK- 240 A GKW +G P G D F G+N ++ + D + N + PD + ++ Sbjct: 136 GAFGKWGLGYIGSAGDPYKQGIDQFYGYNCQLLAHSYYPDHLWDNDKRVDLPDNNLNVQY 195 Query: 241 -QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS-DKPFFLYYG 298 + +S+D +H+ +A+A FLD+ AK D+PFF++Y Sbjct: 196 GKGTYSQDLIHS------KALA--------------------FLDEAAKEKDQPFFMWYP 229 Query: 299 TRGCHFD--------------NYPNAKYAG---SSPARTSYGDC-----------MVEMN 330 T H + YP Y G SPA G C MV Sbjct: 230 TIIPHAELIVPEDSIIKKFRGKYPEKPYRGVEPGSPAFRKGGYCTQFYPHATFAAMVYRL 289 Query: 331 DVFA-NLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP--------FRGAKGSTWEG 381 DV+ + + L+ G DNT+I+F+SDNGP E G P +RG K +EG Sbjct: 290 DVYVGQIVQKLKDMGVYDNTIIIFSSDNGPHME---GGADPDFFNSNGIWRGYKRDVYEG 346 Query: 382 GVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQT 440 G+RVP + W G +QP ++D + DL PT ++ +P A N+ DGV Sbjct: 347 GIRVPMIISWPGHVQPSTETDFMCSFWDLMPTFREVL-NPKADTRNM-------DGVSIL 398 Query: 441 SFFLGTNGQSNRKAEHY-FL--NGKLAAVRMDEFKYHVLIQ--QPY 481 GQ + ++ FL NG+ A + D H+ I+ +PY Sbjct: 399 PLLQNRKGQKEHEYLYFEFLEMNGRQAVRKGDWKLVHMNIRGNKPY 444 >UniRef50_C5EQ23 Arylsulfatase E n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EQ23_9FIRM Length = 483 Score = 123 bits (309), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 112/409 (27%), Positives = 178/409 (43%), Gaps = 52/409 (12%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRAT 142 KKPN++VFL DD G+ D+ G TP++D +A+ G T Y+ + SP+RA Sbjct: 15 KKPNIIVFLTDDQGYGDLSCMGSTDVC---TPNLDILAAGGARFTDFYAGSAVCSPSRAC 71 Query: 143 ILTGQYSIHHGI--LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 +LTG+Y G+ ++ + G G+ T L D GY T +GKWH+G E +P Sbjct: 72 LLTGRYPYMTGVRSILGGIKTTTGLNPGIPTFASALKDLGYTTGMVGKWHLGAVPECRPT 131 Query: 201 NVGFDDFRGFNS-VSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 ++GFD F GF S V+D ++ N ++P+ + ++D ++ Sbjct: 132 HMGFDYFCGFLSGVNDYFSHIHYTEANSHPGINPNHDLW-------ENDERCLK------ 178 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS---- 315 T +Y +L R G++F+ + + D PF LY H+ + KY Sbjct: 179 ---YTGEYSTELFAR---KGLEFIREQVEKDMPFALYCAFNAPHYPMHAPYKYLERFKHL 232 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH--------- 366 R + ++D + L++ G ++T+I F SDNGP E Sbjct: 233 PEDRQIMAAMLSAVDDGVGEIMNYLKRRGIFNDTIIYFQSDNGPSKESRNWLDERKDYYY 292 Query: 367 -GRT-PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLA-DLFPTALDLAGHPGAK 423 G T +G K S ++GG+RVP W M+ + + D+FPT ++ AG + Sbjct: 293 GGSTGGLKGHKFSLFDGGIRVPAIFSWPAMVPAGQVISEPCMGTDIFPTFINAAGGNASD 352 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFK 472 I G D T G K Y+ G+ AVR +K Sbjct: 353 YE--------ISGCDILPVM--TIGARRDKDCLYWEMGQQTAVRRGNYK 391 >UniRef50_A6DIG7 Iduronate-sulfatase or arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DIG7_9BACT Length = 482 Score = 123 bits (309), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 109/369 (29%), Positives = 174/369 (47%), Gaps = 61/369 (16%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPT 139 + +KPN+V+++ DD G + G TP+++ + G+ T+A++ S SPT Sbjct: 20 QANEKPNIVLYVADDFGLASINALGADEKFVQ-TPNLNKLTENGIKFTNAFTTASVCSPT 78 Query: 140 RATILTGQYS----IHHGILMPPMYGQPGGLQGLT-TLPQLLHDQGYVTQAIGKWHMGEN 194 R T+LTGQYS + G++ P + T T+ ++L QGY T A+GKWH+G Sbjct: 79 RYTMLTGQYSWKTRLKKGVVNN---NDPLIISTETITMGKMLQSQGYRTAAVGKWHLGYT 135 Query: 195 KES----------QPQNVGFDDFRGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLP 243 + P +VGFD G N++ D++ +V + DR ++ Sbjct: 136 DKKFDNLLGKIYPGPNDVGFDYHFGVPNNLDDLH----------KVYIENDRIYGLRS-- 183 Query: 244 FSKDDVHA----VRGGEQQAIADITPKYMEDLDQRWMDYGV-KFLDKMAK-SDKPFFLYY 297 D + A GG+ D + E++ MDY K LD + K KPFF+YY Sbjct: 184 ---DKIEAWGASFYGGQSYKGYDAPQRVCEEV----MDYTTQKALDWIKKDKSKPFFMYY 236 Query: 298 GTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 H P+ + G S + YGD + +++ + LE+ G ++NTL +FTSDN Sbjct: 237 AAAAVHHPITPSKRMKGKSGSGL-YGDFIQDLDYSVGQFIQALEQEGLMENTLFIFTSDN 295 Query: 358 G----------PEAEVPP---HGRTPFRGAKGSTWEGGVRVPTFVYW-KGMIQPRKSDGI 403 G PE + H RG K +EGG++VP ++W K + + ++SD + Sbjct: 296 GGDIPASNKDWPEYQAYDMGFHYNGKTRGDKHQIYEGGLKVPFIIHWPKKVKKGQESDHL 355 Query: 404 VDLADLFPT 412 V AD F T Sbjct: 356 VTTADFFST 364 >UniRef50_Q7UYD6 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UYD6_RHOBA Length = 889 Score = 123 bits (309), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 117/375 (31%), Positives = 157/375 (41%), Gaps = 80/375 (21%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRAT 142 K+PNV+ L DD+GW D G TP+I+ +A +G+ T AYS P SPTRA+ Sbjct: 265 KRPNVLFILADDLGWSDTTLFG--TTKLYQTPNIERLAKRGMTFTRAYSSSPLCSPTRAS 322 Query: 143 ILTGQYSIHHGILMP----------PMYGQPGGLQGLTTLPQ--------------LLHD 178 +LTG HGI P P + G +T+P+ + D Sbjct: 323 VLTGLSPARHGITSPTCHLPKVVLEPKVSETGPPNKFSTVPESVTRLDTKYYTLAEMFRD 382 Query: 179 QGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDV--HVNPEVALSPDRS 236 GY T GKWH+G S P GFD DV H P A S Sbjct: 383 NGYATGHFGKWHLGPEPYS-PLEHGFD---------------VDVPHHPGPGPAGS---- 422 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 Y+ F D V I D E L+ R V+FL++ +++PFFL Sbjct: 423 -YVAPWKFKDFDHDPV-------IPD------EHLEDRMAKEAVRFLEQ--HTNEPFFLN 466 Query: 297 YGTRGCH--FDNYPNA------KYAGSSPARTSYGDCMVE-MNDVFANLYKTLEKNGQLD 347 Y H FD + P R M+E M+D L TL++ G D Sbjct: 467 YWMFSVHAPFDAKKELIEEYRDRVDPKDPQRCPTYAAMIESMDDAIGTLLDTLDRLGIAD 526 Query: 348 NTLIVFTSDNGPEAEVPPHGRT-----PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSD 401 T+IVF SDNG G T P RG K + +EGGVR P V G+++ +SD Sbjct: 527 ETIIVFASDNGGNMYNEVDGTTATSNAPLRGGKATMYEGGVRGPAIVVQPGVVESGSRSD 586 Query: 402 GIVDLADLFPTALDL 416 I+ D +PT L++ Sbjct: 587 AIIQSIDFYPTLLEM 601 >UniRef50_D2R921 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R921_9PLAN Length = 676 Score = 123 bits (309), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 135/499 (27%), Positives = 203/499 (40%), Gaps = 115/499 (23%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATIL 144 KPNVV L DDVGW D+ +GGGV PTP+ID + +QG+ ++ SPTRA L Sbjct: 52 KPNVVYILADDVGWGDLSVHGGGV----PTPNIDKLFAQGIEVSHFMGWCVCSPTRAMFL 107 Query: 145 TGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNV-- 202 TG++ I G P G L TT+ + GY T GKWH G + ++ Sbjct: 108 TGRHPIRVG--TGPEVGGELSLDE-TTIAEGFKANGYRTGVFGKWHSGSDPDTPAFRAAF 164 Query: 203 ---------------------GFDD-FRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 GFD+ + + +D + R V V+ +R Sbjct: 165 AEAFKAIPNKQFAGGHGANAHGFDEAWVYYGGGADFFNR-RTVQGRGPVSWWHNRE---- 219 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDL-DQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 F DD Y +DL QR ++F+ + D+PFF Y Sbjct: 220 ---FRPDD----------------EGYTDDLVTQR----AIEFIRE--NKDQPFFCYVPF 254 Query: 300 RGCH------------FDN-----YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEK 342 H D+ P A S + + + M++ A + LEK Sbjct: 255 HIAHAPLQAKENDLAAIDSKTAAKLPTASGKTSDEGKHIHAAMLHSMDNNIAAIRDELEK 314 Query: 343 NGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWK--GMIQPRKS 400 G DNT+ VFTSDNG + P RG K + +EGGVR+PT +YW G+ RK Sbjct: 315 LGLSDNTIFVFTSDNG---AMEAGSSLPLRGHKHTIYEGGVRLPTAIYWPKGGLTGGRKW 371 Query: 401 DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN 460 +G+ D+FPT + A + +PKT +DG + + + Q + +YF+ Sbjct: 372 NGLCGALDMFPTLM-------AMTDSTMPKTQPLDG--KNVWPALRDNQPSPVESYYFIW 422 Query: 461 GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVR 520 A+R D +K H + G + ++++ D ES++I Sbjct: 423 HDEDAIRTDRWKLHR------------FHGRY----------ELYDITIDETESNNIADS 460 Query: 521 HIPMGVPLQTEMHAYMEIL 539 H + L +M A+ E L Sbjct: 461 HPDVVKSLSAKMDAWAESL 479 >UniRef50_A6C383 Sulfatase (Fragment) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C383_9PLAN Length = 405 Score = 123 bits (308), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 114/404 (28%), Positives = 175/404 (43%), Gaps = 53/404 (13%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTR 140 + +KPNV++ DD G +D+ G + TP +D++A +G+ T Y S P SP+R Sbjct: 5 SSEKPNVIIIFTDDQGSVDLNCYGAKDLI---TPHMDSIARRGIRFTQFYASAPVCSPSR 61 Query: 141 ATILTGQYSIHHGI--LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 A +LTG++ G+ + +G+ G T+ +++ GY T IGKWH+G E+ Sbjct: 62 AGMLTGRFPARAGVPGNVSSHHGKSGMPTEQITIAEMMQQAGYQTAHIGKWHLGYTPETM 121 Query: 199 PQNVGFD-DFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P GF+ F D Y+ + + N P+R + + G E Sbjct: 122 PHGQGFETSFGHMGGCIDNYSHF--FYWN-----GPNRHDLWEN------------GKEV 162 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK----YA 313 P M + Q DY K DKPFFLY+ H+ K YA Sbjct: 163 WRDGAFFPDLMVEQCQ---DYIRK------AGDKPFFLYWAINVPHYPLQGKEKWRKTYA 213 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG----RT 369 S R Y + M+D + TL+ + T+I+F SD+G E G Sbjct: 214 HLSSPRDKYAAFVSTMDDCIGEVLATLDACQLREKTIIIFQSDHGHSHEERTFGGGGSAG 273 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 P+RGAK S +EGG+RVP + W G I + D + D PT L G P + Sbjct: 274 PYRGAKFSLFEGGIRVPAMISWPGTIAEGEVRDQLATGCDWLPTISALTGAP-------L 326 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFK 472 P +DG + + + +S + Y+ GK A+R ++K Sbjct: 327 P-AHHLDGKNLKAVIESSTAKSPHE-NFYWQIGKSWAIREGDWK 368 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P25549 Arylsulfatase n=54 Tax=Proteobacteria RepID=ASLA... 816 0.0 UniRef50_A5FF56 Sulfatase n=2 Tax=Bacteria RepID=A5FF56_FLAJ1 478 e-133 UniRef50_A6LED1 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LE... 467 e-130 UniRef50_Q0C069 Sulfatase family protein n=3 Tax=Bacteria RepID=... 462 e-128 UniRef50_A6DPC8 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 457 e-127 UniRef50_A4CMB0 Arylsulfatase A n=4 Tax=Bacteria RepID=A4CMB0_9FLAO 455 e-126 UniRef50_Q7ULF9 Arylsulfatase n=4 Tax=Bacteria RepID=Q7ULF9_RHOBA 455 e-126 UniRef50_A4CGL5 Arylsulfatase A (Precursor) n=2 Tax=Flavobacteri... 452 e-125 UniRef50_B5JJG5 Sulfatase, putative n=1 Tax=Verrucomicrobiae bac... 449 e-124 UniRef50_D2QZX4 Sulfatase n=10 Tax=Bacteria RepID=D2QZX4_9PLAN 449 e-124 UniRef50_D2QTW6 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepI... 447 e-124 UniRef50_A4A2W0 Arylsulfatase A n=1 Tax=Blastopirellula marina D... 447 e-124 UniRef50_UPI0001968C90 hypothetical protein BACCELL_02360 n=1 Ta... 447 e-124 UniRef50_Q01N83 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 445 e-123 UniRef50_A6LED2 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LE... 444 e-123 UniRef50_A1WGP9 Sulfatase n=6 Tax=Proteobacteria RepID=A1WGP9_VEREI 443 e-123 UniRef50_B9XS23 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XS2... 440 e-122 UniRef50_B4D764 Steryl-sulfatase n=1 Tax=Chthoniobacter flavus E... 439 e-121 UniRef50_A0YAF7 Arylsulfatase A n=4 Tax=Bacteria RepID=A0YAF7_9GAMM 439 e-121 UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=B... 437 e-121 UniRef50_A6LDP6 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LD... 436 e-121 UniRef50_D0TQQ7 Putative uncharacterized protein n=1 Tax=Bactero... 433 e-120 UniRef50_D2R206 Steryl-sulfatase n=1 Tax=Pirellula staleyi DSM 6... 431 e-119 UniRef50_A6DI94 Arylsulfatase A n=2 Tax=Bacteria RepID=A6DI94_9BACT 431 e-119 UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomy... 430 e-119 UniRef50_Q7UKJ5 Arylsulfatase A n=3 Tax=Bacteria RepID=Q7UKJ5_RHOBA 430 e-119 UniRef50_B8KM61 Steryl-sulfatase n=2 Tax=gamma proteobacterium N... 427 e-118 UniRef50_A6DSG4 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 427 e-118 UniRef50_C1ZCM0 Arylsulfatase A family protein n=2 Tax=Bacteria ... 425 e-117 UniRef50_Q7UHK0 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 425 e-117 UniRef50_A6DJ11 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 424 e-117 UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomy... 422 e-116 UniRef50_C6Y1Z7 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 ... 422 e-116 UniRef50_C7ZGP1 Predicted protein n=3 Tax=Leotiomyceta RepID=C7Z... 421 e-116 UniRef50_D2QWC8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 419 e-115 UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planct... 419 e-115 UniRef50_D2QZL2 Sulfatase n=8 Tax=cellular organisms RepID=D2QZL... 417 e-115 UniRef50_B3CAE2 Putative uncharacterized protein n=3 Tax=Bactero... 417 e-115 UniRef50_Q7UYA6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 416 e-114 UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC1... 416 e-114 UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces mari... 414 e-114 UniRef50_B8KV72 Arylsulfatase A n=1 Tax=gamma proteobacterium NO... 414 e-114 UniRef50_Q1CY93 Sulfatase family protein n=4 Tax=Bacteria RepID=... 414 e-114 UniRef50_C5C581 Cerebroside-sulfatase n=1 Tax=Beutenbergia caver... 413 e-114 UniRef50_A6LCL3 Arylsulfatase A n=9 Tax=Bacteroidales RepID=A6LC... 413 e-114 UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 413 e-114 UniRef50_B9XGT6 Sulfatase n=3 Tax=Bacteria RepID=B9XGT6_9BACT 413 e-113 UniRef50_Q0KB87 Arylsulfatase A or related enzyme n=107 Tax=cell... 413 e-113 UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 412 e-113 UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 411 e-113 UniRef50_C6Y214 Sulfatase n=3 Tax=Sphingobacteriaceae RepID=C6Y2... 411 e-113 UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 T... 410 e-113 UniRef50_C1ZI83 Arylsulfatase A family protein n=1 Tax=Planctomy... 409 e-112 UniRef50_B4AUP3 Sulfatase n=2 Tax=Bacteria RepID=B4AUP3_9CHRO 408 e-112 UniRef50_P15289 Arylsulfatase A component C n=34 Tax=Euteleostom... 408 e-112 UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD 408 e-112 UniRef50_A6DF77 Arylsulphatase A n=2 Tax=Lentisphaera araneosa H... 406 e-112 UniRef50_A6C8S3 Arylsulphatase A n=1 Tax=Planctomyces maris DSM ... 406 e-112 UniRef50_B8HPF9 Sulfatase n=2 Tax=Bacteria RepID=B8HPF9_CYAP4 406 e-111 UniRef50_Q46SG5 Arylsulfatase n=3 Tax=Proteobacteria RepID=Q46SG... 406 e-111 UniRef50_A6DSG6 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 405 e-111 UniRef50_A0Z7U6 Arylsulfatase n=2 Tax=Gammaproteobacteria RepID=... 405 e-111 UniRef50_A6UG37 Sulfatase n=16 Tax=Bacteria RepID=A6UG37_SINMW 405 e-111 UniRef50_P34059 N-acetylgalactosamine-6-sulfatase n=23 Tax=Deute... 405 e-111 UniRef50_Q488V4 Sulfatase family protein n=30 Tax=Bacteria RepID... 404 e-111 UniRef50_UPI0000586CBD PREDICTED: similar to MGC86251 protein n=... 404 e-111 UniRef50_C5PU94 N-acetylgalactosamine-6-sulfatase n=1 Tax=Sphing... 404 e-111 UniRef50_C3ZGR2 Putative uncharacterized protein n=1 Tax=Branchi... 404 e-111 UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN... 402 e-110 UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD 401 e-110 UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 400 e-110 UniRef50_A6CGJ8 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8... 400 e-110 UniRef50_A7HQ00 Steryl-sulfatase n=4 Tax=Proteobacteria RepID=A7... 400 e-110 UniRef50_A6DKP2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 400 e-110 UniRef50_A6P2X1 Putative uncharacterized protein n=1 Tax=Bactero... 400 e-110 UniRef50_C5EQ23 Arylsulfatase E n=1 Tax=Clostridiales bacterium ... 400 e-110 UniRef50_Q7UG72 Arylsulfatase A [precursor] n=1 Tax=Rhodopirellu... 400 e-109 UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=... 400 e-109 UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 399 e-109 UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 398 e-109 UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3J... 397 e-109 UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 396 e-109 UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 396 e-109 UniRef50_B8KM62 N-acetylgalactosamine-6-sulfatase n=1 Tax=gamma ... 396 e-109 UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 396 e-108 UniRef50_UPI0001A444F6 arylsulfatase A n=1 Tax=Pectobacterium ca... 395 e-108 UniRef50_A4AM21 Arylsulfatase A n=1 Tax=Flavobacteriales bacteri... 395 e-108 UniRef50_D2R917 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 395 e-108 UniRef50_B8KTJ7 Arylsulfatase F n=1 Tax=gamma proteobacterium NO... 395 e-108 UniRef50_A6C4Q6 Arylsulfatase n=1 Tax=Planctomyces maris DSM 879... 395 e-108 UniRef50_A6DSP6 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 394 e-108 UniRef50_A9W035 Sulfatase n=6 Tax=Bacteria RepID=A9W035_METEP 394 e-108 UniRef50_A5FAW4 Sulfatase n=1 Tax=Flavobacterium johnsoniae UW10... 393 e-108 UniRef50_A6CAY0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 393 e-107 UniRef50_A6BZV9 Arylsulfatase n=3 Tax=Bacteria RepID=A6BZV9_9PLAN 393 e-107 UniRef50_B9KQS8 Twin-arginine translocation pathway signal n=2 T... 393 e-107 UniRef50_B0NLM9 Putative uncharacterized protein n=1 Tax=Bactero... 392 e-107 UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_B... 392 e-107 UniRef50_A6C8R8 Arylsulfatase A n=2 Tax=Planctomycetaceae RepID=... 392 e-107 UniRef50_A6QA55 Arylsulfatase n=5 Tax=Bacteria RepID=A6QA55_SULNB 391 e-107 UniRef50_A6CEL4 Arylsulfatase A n=4 Tax=Bacteria RepID=A6CEL4_9PLAN 391 e-107 UniRef50_UPI00016C41FE sulfatase n=1 Tax=Gemmata obscuriglobus U... 391 e-107 UniRef50_A3ZMT9 Arylsulfatase n=2 Tax=Planctomycetaceae RepID=A3... 391 e-107 UniRef50_Q024K7 Sulfatase n=28 Tax=Bacteria RepID=Q024K7_SOLUE 390 e-107 UniRef50_A6DKD8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 390 e-107 UniRef50_C1ZFQ0 Arylsulfatase A family protein n=1 Tax=Planctomy... 390 e-106 UniRef50_A6DLE2 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 390 e-106 UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina ... 389 e-106 UniRef50_Q7UYA5 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 389 e-106 UniRef50_A6DHI0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 389 e-106 UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 T... 388 e-106 UniRef50_A7V8P8 Putative uncharacterized protein n=1 Tax=Bactero... 388 e-106 UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT 388 e-106 UniRef50_A6DKP3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 388 e-106 UniRef50_A4A218 Arylsulfatase A n=2 Tax=Bacteria RepID=A4A218_9PLAN 388 e-106 UniRef50_A6DG39 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC... 387 e-106 UniRef50_D2R783 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 387 e-106 UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 387 e-106 UniRef50_A6DQE3 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 387 e-106 UniRef50_B4D681 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 386 e-106 UniRef50_C2FU81 Sulfatase family protein n=2 Tax=Sphingobacteriu... 386 e-106 UniRef50_Q7UNN1 Arylsulphatase A n=3 Tax=Bacteria RepID=Q7UNN1_R... 386 e-105 UniRef50_A8G0H1 Sulfatase family protein n=5 Tax=Gammaproteobact... 386 e-105 UniRef50_B9XCM3 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XCM... 385 e-105 UniRef50_C9KTV0 Arylsulfatase n=1 Tax=Bacteroides finegoldii DSM... 385 e-105 UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 385 e-105 UniRef50_B0UGK6 Sulfatase n=18 Tax=Bacteria RepID=B0UGK6_METS4 385 e-105 UniRef50_A9UPM8 Predicted protein (Fragment) n=1 Tax=Monosiga br... 384 e-105 UniRef50_A6DP41 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 384 e-105 UniRef50_Q7UJQ8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 383 e-105 UniRef50_Q7UYS6 Arylsulfatase A n=4 Tax=Bacteria RepID=Q7UYS6_RHOBA 383 e-105 UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Plancto... 383 e-105 UniRef50_D2R457 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 383 e-105 UniRef50_A6DHI4 Arylsulfatase A (ASA) n=1 Tax=Lentisphaera arane... 383 e-104 UniRef50_A6C4V9 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 Re... 382 e-104 UniRef50_A0Z6R0 Putative arylsulfatase n=1 Tax=marine gamma prot... 382 e-104 UniRef50_A3ZLN5 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 382 e-104 UniRef50_C6Y1U6 Sulfatase n=2 Tax=Sphingobacteriales RepID=C6Y1U... 381 e-104 UniRef50_Q96EG1 Arylsulfatase G n=22 Tax=Euteleostomi RepID=ARSG... 381 e-104 UniRef50_A3J5W3 Putative arylsulfatase n=1 Tax=Flavobacteria bac... 381 e-104 UniRef50_C0BKJ9 Sulfatase n=2 Tax=Bacteroidetes RepID=C0BKJ9_9BACT 381 e-104 UniRef50_Q7UIN1 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 380 e-104 UniRef50_Q1YP24 Arylsulfatase A n=1 Tax=gamma proteobacterium HT... 380 e-104 UniRef50_Q7UYW3 Arylsulfatase B n=1 Tax=Rhodopirellula baltica R... 380 e-104 UniRef50_UPI000180BD6E PREDICTED: similar to arylsulfatase n=1 T... 380 e-104 UniRef50_Q7UYA9 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodop... 380 e-104 UniRef50_A6DKN7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 380 e-104 UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 380 e-104 UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flamme... 380 e-104 UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase... 380 e-104 UniRef50_B9XR48 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XR4... 380 e-104 UniRef50_A6DMX8 Iduronate-sulfatase or arylsulfatase A n=1 Tax=L... 380 e-103 UniRef50_D2R2H5 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 380 e-103 UniRef50_Q7UIU1 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 379 e-103 UniRef50_A6DG53 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 379 e-103 UniRef50_A6DSH3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 379 e-103 UniRef50_UPI00005846A1 PREDICTED: similar to arylsulfatase n=1 T... 378 e-103 UniRef50_A6LIX5 Arylsulfatase n=2 Tax=Bacteroidales RepID=A6LIX5... 378 e-103 UniRef50_UPI0001927538 PREDICTED: similar to CG8646 CG8646-PA n=... 378 e-103 UniRef50_UPI0000586CBA PREDICTED: similar to arylsulfatase B n=3... 378 e-103 UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria ... 378 e-103 UniRef50_A7SRP2 Predicted protein n=2 Tax=Nematostella vectensis... 378 e-103 UniRef50_A6DI18 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HT... 378 e-103 UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC... 378 e-103 UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Plancto... 378 e-103 UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 T... 377 e-103 UniRef50_A6DR29 N-acetylgalactosamine-6-sulfatase n=3 Tax=Bacter... 377 e-103 UniRef50_B1KD88 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 377 e-103 UniRef50_A6DLD9 Sulfatase n=2 Tax=Chlamydiae/Verrucomicrobia gro... 376 e-102 UniRef50_B7S1F0 Sulfatase, putative n=1 Tax=marine gamma proteob... 376 e-102 UniRef50_C6VTS4 Sulfatase n=47 Tax=cellular organisms RepID=C6VT... 376 e-102 UniRef50_Q7UYH4 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 376 e-102 UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium sp... 375 e-102 UniRef50_A7S8Q2 Predicted protein n=2 Tax=Nematostella vectensis... 375 e-102 UniRef50_C1ZF13 Arylsulfatase A family protein n=1 Tax=Planctomy... 375 e-102 UniRef50_B7AM73 Putative uncharacterized protein n=1 Tax=Bactero... 375 e-102 UniRef50_A3ZMN6 Arylsulfatase B n=3 Tax=Bacteria RepID=A3ZMN6_9PLAN 375 e-102 UniRef50_D2QW96 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 375 e-102 UniRef50_A0PKV5 Arylsulfatase, AslA n=5 Tax=Bacteria RepID=A0PKV... 375 e-102 UniRef50_A6DHS3 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 375 e-102 UniRef50_A6DHI1 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 374 e-102 UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 374 e-102 UniRef50_A6CAR8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 373 e-102 UniRef50_Q15US6 Sulfatase n=3 Tax=Alteromonadales RepID=Q15US6_P... 373 e-102 UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria Rep... 373 e-101 UniRef50_A6DF72 Putative secreted sulfatase ydeN n=1 Tax=Lentisp... 372 e-101 UniRef50_C9MNT2 Arylsulfatase n=4 Tax=Bacteroidales RepID=C9MNT2... 372 e-101 UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 372 e-101 UniRef50_Q9NJU8 Sulfatase 1 n=2 Tax=Coelomata RepID=Q9NJU8_HELPO 372 e-101 UniRef50_Q1YSH0 Sulfatase family protein n=4 Tax=cellular organi... 371 e-101 UniRef50_D0PR02 N-acetylgalactosamine-4-sulfatase n=1 Tax=Flamme... 371 e-101 UniRef50_D2QTW5 Sulfatase n=2 Tax=Sphingobacteriales RepID=D2QTW... 371 e-101 UniRef50_Q1VDY3 Probable sulfatase n=2 Tax=Vibrio alginolyticus ... 371 e-101 UniRef50_UPI0000588CF9 PREDICTED: similar to arylsulfatase B n=1... 371 e-101 UniRef50_UPI0001745666 N-acetylgalactosamine 6-sulfate sulfatase... 371 e-101 UniRef50_A3I2R7 Arylsulfatase n=2 Tax=Bacteroidetes RepID=A3I2R7... 370 e-101 UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9... 370 e-101 UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 370 e-101 UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_R... 370 e-101 UniRef50_A6C383 Sulfatase (Fragment) n=1 Tax=Planctomyces maris ... 370 e-100 UniRef50_C9KTC2 Arylsulphatase A n=5 Tax=Bacteroides RepID=C9KTC... 369 e-100 UniRef50_Q7UWW9 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 369 e-100 UniRef50_B2ULS2 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC B... 369 e-100 UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bactero... 368 e-100 UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Plancto... 368 e-100 UniRef50_C7M5R4 Sulfatase n=4 Tax=Bacteroidetes RepID=C7M5R4_CAPOD 368 e-100 UniRef50_A4CJK0 Arylsulfatase A n=1 Tax=Robiginitalea biformata ... 368 e-100 UniRef50_B5CWC8 Putative uncharacterized protein n=1 Tax=Bactero... 368 e-100 UniRef50_B2URC2 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC B... 368 e-100 UniRef50_C1ZJ89 Arylsulfatase A family protein n=1 Tax=Planctomy... 368 e-100 UniRef50_B6RB10 Arylsulfatase n=7 Tax=Coelomata RepID=B6RB10_HALDI 368 e-100 UniRef50_Q15XP0 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 367 e-100 UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Pro... 367 e-100 UniRef50_D2R207 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 366 e-100 UniRef50_P15848 Arylsulfatase B n=32 Tax=Euteleostomi RepID=ARSB... 366 1e-99 UniRef50_UPI0001B577E1 arylsulfatase precursor n=1 Tax=Streptomy... 366 1e-99 UniRef50_A9LGQ4 Secreted arylsulfatase n=4 Tax=Bacteria RepID=A9... 366 2e-99 UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomy... 366 2e-99 UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyc... 366 2e-99 UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodop... 365 2e-99 UniRef50_Q8SZ72 RE14504p n=18 Tax=Neoptera RepID=Q8SZ72_DROME 365 2e-99 UniRef50_C2G0L0 Possible Cerebroside-sulfatase n=2 Tax=Sphingoba... 365 2e-99 UniRef50_Q5FYB1 Arylsulfatase I n=5 Tax=Chordata RepID=ARSI_HUMAN 365 3e-99 UniRef50_D2QCX4 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepI... 365 3e-99 UniRef50_Q1QJ61 Sulfatase n=3 Tax=Bacteria RepID=Q1QJ61_NITHX 364 4e-99 UniRef50_B4CZ78 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 364 4e-99 UniRef50_A6DMY9 Putative uncharacterized protein n=2 Tax=Lentisp... 364 5e-99 UniRef50_C6I6Z4 N-acetylgalactosamine-6-sulfatase n=11 Tax=Bacte... 364 5e-99 UniRef50_UPI000186ED10 arylsulfatase B precursor, putative n=1 T... 364 6e-99 UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 364 6e-99 UniRef50_A7AKS6 Putative uncharacterized protein n=3 Tax=Bactero... 364 6e-99 UniRef50_A6DG54 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 363 8e-99 UniRef50_UPI0001BC7CBC sulfatase n=1 Tax=Bacteroides sp. D2 RepI... 363 8e-99 UniRef50_A6DKM2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 363 9e-99 UniRef50_P50473 Arylsulfatase n=8 Tax=Deuterostomia RepID=ARS_STRPU 363 9e-99 UniRef50_C6Z6I9 N-acetylgalactosamine 6-sulfate sulfatase n=5 Ta... 363 9e-99 UniRef50_Q5FYB0 Arylsulfatase J n=81 Tax=Eumetazoa RepID=ARSJ_HUMAN 363 1e-98 UniRef50_C7RSC1 Sulfatase n=2 Tax=Bacteria RepID=C7RSC1_9PROT 363 1e-98 Sequences not found previously or not previously below threshold: UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 398 e-109 UniRef50_A6LEC5 Arylsulfatase A n=2 Tax=Parabacteroides RepID=A6... 392 e-107 UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 391 e-107 UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF8... 383 e-105 UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7... 380 e-103 UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglob... 376 e-103 UniRef50_A6C4W8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 376 e-102 UniRef50_A6DF76 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 373 e-102 UniRef50_A6DM29 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 373 e-102 UniRef50_A6KZI7 Arylsulfatase n=23 Tax=Bacteroidales RepID=A6KZI... 372 e-101 UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 370 e-101 UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM ... 365 2e-99 >UniRef50_P25549 Arylsulfatase n=54 Tax=Proteobacteria RepID=ASLA_ECOLI Length = 551 Score = 816 bits (2109), Expect = 0.0, Method: Composition-based stats. Identities = 551/551 (100%), Positives = 551/551 (100%) Query: 1 MEFSFSPKRLVVAVAAALPLMASAADTPSTATARKGFAGYDHPNQYLVKPATTIADNMMP 60 MEFSFSPKRLVVAVAAALPLMASAADTPSTATARKGFAGYDHPNQYLVKPATTIADNMMP Sbjct: 1 MEFSFSPKRLVVAVAAALPLMASAADTPSTATARKGFAGYDHPNQYLVKPATTIADNMMP 60 Query: 61 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV 120 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV Sbjct: 61 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV 120 Query: 121 ASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQG 180 ASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQG Sbjct: 121 ASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQG 180 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK Sbjct: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR Sbjct: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 Query: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE Sbjct: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 Query: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP 420 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP Sbjct: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP 420 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP Sbjct: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK Sbjct: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 Query: 541 KYPPRAQIKSD 551 KYPPRAQIKSD Sbjct: 541 KYPPRAQIKSD 551 >UniRef50_A5FF56 Sulfatase n=2 Tax=Bacteria RepID=A5FF56_FLAJ1 Length = 524 Score = 478 bits (1230), Expect = e-133, Method: Composition-based stats. Identities = 195/509 (38%), Positives = 293/509 (57%), Gaps = 20/509 (3%) Query: 42 HPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDV 101 Q P + D + P + HP QDKE + KL++L+KK PN+++ L+DD+G+ D+ Sbjct: 17 SAQQNYFNPTVKVKDYLEPAIPHPDQDKEMKDKLSKLKKK----PNILIILIDDMGYGDI 72 Query: 102 GFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYG 161 G GGGVA+G PTP++D +A +GL LTS Y+QP+ +P+RA I+TG+ G+ P + G Sbjct: 73 GVYGGGVAIGAPTPNMDKLAHEGLQLTSTYAQPTCTPSRAAIMTGRIPARSGLTRPTLTG 132 Query: 162 QPGGLQ---GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYT 218 + + T ++L GY + GKWH+GE+K S P VG+D++ GF SV Y Sbjct: 133 ENPKVNPWASENTTAKILSQNGYKSAISGKWHLGESKGSLPNEVGYDEWLGFGSVQSEYA 192 Query: 219 EWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITP-KYMEDLDQRWMD 277 ++ + + P++ PDR +K++ ++ + V+GGE + + I+ + + +DQ + + Sbjct: 193 QFVNEWIYPDLINKPDRLAAVKKM-VDQNILTGVKGGENKVVQPISNIEELSKVDQVFAN 251 Query: 278 YGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLY 337 Y F+ + K +KPF+L + H DNY + Y G SPA Y D +VE++D+ L Sbjct: 252 YSEDFIKRSVKENKPFYLIHSFSKVHNDNYVSEGYKGKSPAAIPYKDAIVEVDDIVGRLM 311 Query: 338 KTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR-TPFRGAKGSTWEGGVRVPTFVYWKGMIQ 396 K L+ DNTL+ TSDNGP +V P G TPFRG KG+TWEGGVRVP YWKGMI Sbjct: 312 KLLQDLKIDDNTLVFLTSDNGPNEDVWPDGGYTPFRGGKGTTWEGGVRVPGIAYWKGMIA 371 Query: 397 PRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE 455 P + SDG+ D+ D+F T+L AG V + +P + +IDGVDQ SFFL G SNR A Sbjct: 372 PGRISDGLFDICDMFNTSLSAAG-----VLDKIPSSNYIDGVDQLSFFLSDKGVSNRNAV 426 Query: 456 HYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD 515 + K A+R E+K H+ + A ++ Q M V+N+Y DP+E Sbjct: 427 FMYSETKFMAIRWQEYKVHMNVFNTSATRRNLDQSTIQSIGMS---PWVYNIYADPKEQL 483 Query: 516 SIGVRHIPMGVP-LQTEMHAYMEILKKYP 543 S G R+ G+P + + A++ +KYP Sbjct: 484 SQGHRYFEWGIPGVMGLIAAHLATYQKYP 512 >UniRef50_A6LED1 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LED1_PARD8 Length = 459 Score = 467 bits (1202), Expect = e-130, Method: Composition-based stats. Identities = 133/479 (27%), Positives = 206/479 (43%), Gaps = 59/479 (12%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-S 135 KPN V+ DD+G+ D+ G TP+ID +A +G+ LT Y Sbjct: 22 NAASDAANKPNFVIIFCDDMGYGDLSCYGNPTI---RTPNIDRMACEGMKLTQFYVGAGV 78 Query: 136 SSPTRATILTGQYSIHHGILMPPM-----YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWH 190 S+P+RA ++TG+ + +G+ + + G Q T+ ++L GY T +GKWH Sbjct: 79 STPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVLQQSGYATGCVGKWH 138 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +G P + GFD + G +DM P H Sbjct: 139 LGAFSPYLPTDHGFDTYFGIPYSNDM-------------------------SPVQNKGAH 173 Query: 251 AVRGGEQQAIADITPKYME----DLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 A I D E +L +R+ + V F+ +K PFFLY+ H Sbjct: 174 ARNFPPTPLIVDGKQIESEPDQGELTRRYTEKAVSFIKNHSKE--PFFLYFAHTFPHIPL 231 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH 366 Y NA++ G+S R YGD + E++ + K L +NG +NT ++FTSDNGP + Sbjct: 232 YTNARFEGTS-KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNGPWLTEHEN 290 Query: 367 GRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKV 424 G + P + KG+ WEGG RVP + G I P +D I+ DL+PT L +AG Sbjct: 291 GGSAGPLKDGKGTWWEGGFRVPAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQ--- 347 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYT 484 PK +DGV+QT S R +Y+ +L A+R E+KY+ + Sbjct: 348 ----PKDLVLDGVNQTGLLF-EEKHSARDEVYYWWGSELMAIRKGEWKYYFKTIKDQYLR 402 Query: 485 QSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 T + A ++N+ TD E ++ +H + L + + +K P Sbjct: 403 --------TCKIETPAEPLLYNVETDISERFNLADKHPEIVKLLIEAGEKHKKGMKIKP 453 >UniRef50_Q0C069 Sulfatase family protein n=3 Tax=Bacteria RepID=Q0C069_HYPNA Length = 505 Score = 462 bits (1190), Expect = e-128, Method: Composition-based stats. Identities = 131/500 (26%), Positives = 220/500 (44%), Gaps = 54/500 (10%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 +AE E ++PN+V+ +DD+G+ D+G G +A TP++D +A +G TS Sbjct: 31 APDSVAEKEAAASEQPNIVLIFVDDMGYADIGSFGSPIA---RTPNLDRLAMEGQKWTSF 87 Query: 131 YS-QPSSSPTRATILTGQYSIHHGI---------LMPPMYGQPGGLQGLTTLPQLLHDQG 180 Y+ P +P+RA ++TG+ ++ G+ L P G G Q T+ +LL +G Sbjct: 88 YAPAPVCTPSRAGLMTGRLAVRSGMAGLVQARHVLFPTSTG--GLPQSEVTIAELLQQEG 145 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 YV+ A GKWHMG E P + GF + G +DM P +P + Sbjct: 146 YVSAAFGKWHMGHLPEFLPTSHGFQSYFGIPYSNDM--------NMPGGGETPWSIDLFF 197 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 + P ++ + E+ P L QR+ + ++F++ +PFFLY Sbjct: 198 EPPNIQNWDVPLMQDEEII---ERPADQFTLTQRYTERAIEFMETSHAEGQPFFLYLAHN 254 Query: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 H + + + G S A +YGD + E++ + L+ NTL++FTSDNGP Sbjct: 255 MPHTPLFTSEGFTGVS-AGGAYGDVIEELDWSVGEIVDALKDMKIEKNTLVIFTSDNGPW 313 Query: 361 AEVPPHGRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAG 418 + H + R KG+TWEGG+RVP +W G I PR + DL PT ++G Sbjct: 314 LAMKTHSGSAGMLRDGKGTTWEGGMRVPAIFWWPGQIAPRTVTDLGSALDLMPTFAAISG 373 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ 478 +P+ DG D + + G S R+ +Y+ + AVR ++K H Sbjct: 374 A-------RLPEDRVYDGFDLSPALF-SEGSSPRETLYYYRFTDVFAVRKGKYKAHFSTY 425 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQ-------TE 531 + + ++++ DP E +I +H + + L+ Sbjct: 426 GAFG----------GSGRTELETPELYDIEADPSEQFNIAAQHPEIVMELKVLAEKQAAS 475 Query: 532 MHAYMEILKKYPPRAQIKSD 551 + L++YPP + + Sbjct: 476 VEPVENQLERYPPGEKRGEE 495 >UniRef50_A6DPC8 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DPC8_9BACT Length = 598 Score = 457 bits (1177), Expect = e-127, Method: Composition-based stats. Identities = 133/462 (28%), Positives = 197/462 (42%), Gaps = 36/462 (7%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSP 138 + T KKPN +V DD G+ D+G G TP+ID +A +G T+ YS S Sbjct: 18 QATDKKPNFIVIFTDDQGYQDLGCFGSPKI---KTPEIDQMAKEGARYTNFYSANAICSA 74 Query: 139 TRATILTGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 +RA +LTG+Y +G+ G GL T+ ++L GY T IGKWH+G+ + Sbjct: 75 SRAALLTGRYPSRNGVFHVYYPGASQGLKPSEITIAEVLKTAGYRTSIIGKWHLGDRNQF 134 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P N GFD + G +DM+ + E IK SK RGG+ Sbjct: 135 LPTNQGFDSYFGIPFSNDMWMSKDLALADDIKLFGGVTVEQIKSGEASKAVKGEKRGGKV 194 Query: 258 QAIADIT----PKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 + D P + QR+ D +K + + K +P+F+Y H Y + K+A Sbjct: 195 PLMRDEEVVEYPVDQTYITQRYTDEALKIIKESEKKKQPYFIYLAYAMPHVPLYASPKFA 254 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFR 372 G S AR YGD + EM+ + K L+ +G NTL++FTSDNGP G P R Sbjct: 255 GKS-ARGPYGDTVEEMDYHVGRILKHLKSSGADKNTLVIFTSDNGPWNLGERGGSALPLR 313 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 GAK ST+EGG RVP ++W G I S I D PT LA Sbjct: 314 GAKFSTYEGGHRVPCVMWWPGTIPAGTDSAEIATTLDFMPTFAKLANAQLP--------N 365 Query: 432 TFIDGVDQTSFFL-GTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490 +DG + G G+S + +++ + A+R+ K + Sbjct: 366 RTLDGKNIAPMLRDGNKGKSPYEKFYFWSKNHIEALRIGNMKL---------------RM 410 Query: 491 GFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 + + +FNL D ES ++ + + + Sbjct: 411 SWDKKNNVRKETELFNLEGDIAESHNLAPQMPEKVAAMTKML 452 >UniRef50_A4CMB0 Arylsulfatase A n=4 Tax=Bacteria RepID=A4CMB0_9FLAO Length = 492 Score = 455 bits (1170), Expect = e-126, Method: Composition-based stats. Identities = 134/489 (27%), Positives = 220/489 (44%), Gaps = 41/489 (8%) Query: 66 AQDKETQQKLAELEKKTG---KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVAS 122 Q+ ET +E G +KPN ++ DD+G+ D+ G T ++D +A+ Sbjct: 27 CQNTETSPGDSEGTAAAGGIPEKPNFIIVFADDLGYGDLSSFGHPTIH---TKNLDRMAA 83 Query: 123 QGLILTSAYSQP-SSSPTRATILTGQYSIHHGILMPPMY-----GQPGGLQGLTTLPQLL 176 +G T+ Y +P+RA +LTG+ + +G+ + G TL + L Sbjct: 84 EGQKWTNFYVAASVCTPSRAGLLTGRLPVRNGLTSNEIGVFFPDSHNGMPASEITLAEQL 143 Query: 177 HDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS 236 GY T +GKWH+G +E P N GFDD+ G +DM + +R Sbjct: 144 KKAGYATGMVGKWHLGHKEEYLPPNHGFDDYFGIPYSNDMDFTGQFTSYQDYFGRYTERY 203 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 E +K + +V +RG E+ P + +R+ D VK++ + D+PFF+Y Sbjct: 204 ESLKTEEY---NVPLIRGTEEI----ERPVNQNTITKRYNDEAVKWIRE--HKDEPFFMY 254 Query: 297 YGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 H + + ++ G+S AR YGD + E++ + + LE G +NT++VFTSD Sbjct: 255 LAHSLPHVPLFTSDEFRGTS-ARGLYGDVVEEIDHGVGQIMELLEAEGLAENTIVVFTSD 313 Query: 357 NGPEAEVPPHGRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTAL 414 NGP G + R KG+TWEGG+R PT + GM+ + + DLF T Sbjct: 314 NGPWLPTGISGGSAGLLREGKGTTWEGGMREPTIFWAPGMLPAKVVMDMGSTLDLFNTFS 373 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYH 474 LAG P +P +DGVD + G + +S RK Y+ L AVR+ +K H Sbjct: 374 SLAGVP-------MPDDREMDGVDLSPILFG-DAESPRKEMFYYQGADLYAVRLGAYKAH 425 Query: 475 VLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 ++ Y ++ ++N+ DP E + +H + ++ + A Sbjct: 426 FYTKEAYV---------MGAERVEHNPPLLYNVEEDPSEKYDLSGKHPEVIEEIRRVVEA 476 Query: 535 YMEILKKYP 543 + + K P Sbjct: 477 HNANMVKAP 485 >UniRef50_Q7ULF9 Arylsulfatase n=4 Tax=Bacteria RepID=Q7ULF9_RHOBA Length = 538 Score = 455 bits (1170), Expect = e-126, Method: Composition-based stats. Identities = 173/496 (34%), Positives = 273/496 (55%), Gaps = 13/496 (2%) Query: 56 DNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTP 115 +N + QD+ + KLAE++ K GK+PN++ ++DD+G+ D G GGG A+G TP Sbjct: 46 ENHEAAIPLATQDQAAEDKLAEIKAKHGKRPNILWLVVDDMGYGDPGCYGGGAAIGAATP 105 Query: 116 DIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQ---PGGLQGLTTL 172 +ID +AS+GL LTS YSQ + +PTR+ ILTG+ + G+ P + G + +L Sbjct: 106 NIDRLASEGLRLTSCYSQQTCTPTRSAILTGRLPVRTGLTRPILAGDKLTRNPWEDEVSL 165 Query: 173 PQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALS 232 P+LL D GY T GKWH+GE +P ++GFD++ G+ ++ D P++ + Sbjct: 166 PKLLSDAGYYTLLTGKWHVGEPVGMRPHDIGFDEYYGYYPAQKEISQRFDERRFPDLVNN 225 Query: 233 PDRSEYIKQLPFSKDDVHAVRGGEQQAIADI-TPKYMEDLDQRWMDYGVKFLDKMAKSDK 291 P+R+ + + H +GG + + I + + M ++ D+ ++ + ++AK D+ Sbjct: 226 PERARAFEAIAPDNHLTHGFKGGRTEKLKQIQSTEDMGRAEKVLADFTIQRIKELAKEDQ 285 Query: 292 PFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 PFFL + H DN+PN S A+ Y + + E++ + L++ L NT + Sbjct: 286 PFFLEHCFMKVHCDNFPNPDLGPLSAAKYYYKEAVAEVDLHVGEIMAALKEADVLGNTFV 345 Query: 352 VFTSDNGPEAEVPPH-GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADL 409 FTSDNGP+ + P G TPFRGAKG+T+EGGVRVP YWKG++ R+SDG+ DL DL Sbjct: 346 FFTSDNGPQMDGWPDAGYTPFRGAKGTTFEGGVRVPGIAYWKGVVSGGRQSDGLFDLLDL 405 Query: 410 FPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMD 469 F +L LA P + +P + D +DQTSF L +GQS R+A +++ +L + RM Sbjct: 406 FGVSLKLAEIPTSD----LPVDRYYDYIDQTSFLLQDDGQSKREAVYFWWGKELMSCRMH 461 Query: 470 EFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQ 529 E+K HV P + ++ V +FNLY DP+E +G R + Sbjct: 462 EYKVHVKAVLP---ESTHMHIDYSTLVDVGLAPWLFNLYIDPKEQLPVGHRRNAWLATVL 518 Query: 530 TEMHAYMEILKKYPPR 545 ++ A+ KKYP + Sbjct: 519 GKLKAHATTFKKYPAK 534 >UniRef50_A4CGL5 Arylsulfatase A (Precursor) n=2 Tax=Flavobacteria RepID=A4CGL5_9FLAO Length = 526 Score = 452 bits (1163), Expect = e-125, Method: Composition-based stats. Identities = 137/492 (27%), Positives = 216/492 (43%), Gaps = 43/492 (8%) Query: 58 MMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDI 117 ++ ++ +ET + + + PN+V+ DD G+ DVG G A PTP++ Sbjct: 46 LLAIILLGVSCRETVKSEFAAADRADRPPNIVIIFTDDQGYSDVGVYG---ARDIPTPNL 102 Query: 118 DAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPMYGQPGGL-QGLTTLPQL 175 DA+A+ GL+LT+ Y+ QP S +RA +LTG Y GI M P GL TL +L Sbjct: 103 DAMAADGLLLTNFYAAQPVCSASRAGLLTGCYPNRVGIHNALMPNSPVGLNPAEETLAEL 162 Query: 176 LHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDR 235 L QGY T GKWH+G++ + P GFD+F G +DM+ Sbjct: 163 LRQQGYRTGIFGKWHLGDHPDFLPTRHGFDEFFGIPYSNDMWP----------------- 205 Query: 236 SEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFL 295 + L D + EQ+ + D T + L ++ + V F+++ ++PFFL Sbjct: 206 ---LHPLQGPVFDFGPLPLYEQERVVD-TLEDQRLLTRQITERSVDFINR--HKEEPFFL 259 Query: 296 YYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 Y H + + + G S R YGD ++E++ + LE NG D+T ++FTS Sbjct: 260 YVPHPQPHVPLFVSDAFRGKS-GRGLYGDVIMEIDWSVGQVLGALEDNGLTDDTWVIFTS 318 Query: 356 DNGPEAEVPPHGR--TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPT 412 DNGP H P R KG+ WEGGVR P + + G + K D + DL PT Sbjct: 319 DNGPWLAYGNHSGRAEPLREGKGTNWEGGVREPCIMKFPGRLPRGKVLDEPLMAIDLLPT 378 Query: 413 ALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN-GKLAAVRMDEF 471 + G P IDG + G + + A +++ +L AVR ++ Sbjct: 379 IASVTGSPQP--------GREIDGKNAWGLLSGAEARGPQDAYYFYYRVNELQAVRDGDW 430 Query: 472 KYHVLIQQPYAYTQSGYQGGFTG--TVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQ 529 K + Q G G + ++NL DP E++++ RH + + Sbjct: 431 KLVLPHNYRTMQGQEPGADGLPGAYDYVDVTAPELYNLREDPGETNNLAERHPEVLAAIS 490 Query: 530 TEMHAYMEILKK 541 + + L Sbjct: 491 RKADSMRRRLGD 502 >UniRef50_B5JJG5 Sulfatase, putative n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JJG5_9BACT Length = 462 Score = 449 bits (1156), Expect = e-124, Method: Composition-based stats. Identities = 134/478 (28%), Positives = 212/478 (44%), Gaps = 64/478 (13%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-P 134 K PN+V DD+G+ D+ G A TP ID++ QG+ T YS P Sbjct: 25 PSAASSAEKPPNIVFIFADDLGYNDLSSYG---ATDIATPAIDSLGEQGIRFTDFYSASP 81 Query: 135 SSSPTRATILTGQYSIH---HGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 SP+RA +LTG+Y I G+ P + G TT+ +LL + GY T +GKWH+ Sbjct: 82 VCSPSRAALLTGRYPIRQGITGVFWPQSFD--GIDPAETTIAELLQENGYRTGLVGKWHL 139 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 G +++ P GF + G +DM D V Sbjct: 140 GHHQKHLPLQNGFHSYFGIPYSNDM------------------------------DMVVY 169 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 +RG + ++ ++ Y +R+ + V+F+++ D+PFFLY H Y + Sbjct: 170 MRGNDVESY-EVDQHYT---TRRYTEEAVQFIEQ--NKDQPFFLYLAHSMPHVPIYASEN 223 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-- 369 + G+S R YGD + E++ A + TL+K+ +NTL+VFTSDNGP + G + Sbjct: 224 FVGTS-KRGLYGDVIQELDWSVAQILDTLDKHQLSENTLVVFTSDNGPWTALKHLGGSAA 282 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLV 428 P R K T++GG+RVP V W I S + ++ D FPT +A Sbjct: 283 PLREGKMFTFDGGMRVPCLVRWPAQIPAGQTSHAMANMMDWFPTFSRIANLDT------- 335 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 PK+ IDG+D T G+ +++ + + +G L A R ++K + + G Sbjct: 336 PKSRSIDGLDITDVLTGSGPRADNEFFFFHGDGDLRAYRDGDWKLKLPYE--------GN 387 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRA 546 Q + +FNL DP E+ + +H +Q M ++ L + PP Sbjct: 388 QAARWRQAVAAHPILLFNLAEDPGETTDLAAQHPERLAAMQARMTDFLASLGELPPEK 445 >UniRef50_D2QZX4 Sulfatase n=10 Tax=Bacteria RepID=D2QZX4_9PLAN Length = 499 Score = 449 bits (1155), Expect = e-124, Method: Composition-based stats. Identities = 136/498 (27%), Positives = 206/498 (41%), Gaps = 44/498 (8%) Query: 50 PATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVA 109 A T ++ + T++ A+ K ++PN+V+ DD+G+ D+G G A Sbjct: 18 AAMTFVAFVLATTFVISSTAATEESAADAASK--RRPNIVLIFCDDLGYADIGCFG---A 72 Query: 110 VGNPTPDIDAVASQGLILTSA-YSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGL-Q 167 G TP+++ +AS+G+ T + S +RA +LTG Y GIL G+ + Sbjct: 73 KGYETPNLNKLASEGMKFTDFQVAAAVCSASRAALLTGCYPQRVGILSALGPSDSIGIAK 132 Query: 168 GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNP 227 + +LL + GY T GKWH+G +++ PQ GF + G +DM+ Sbjct: 133 NELLISELLQNLGYKTACFGKWHLGHHEQFLPQQNGFATYFGLPYSNDMW---------- 182 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA 287 P LP + +Q L + + VKF+ Sbjct: 183 --PKHPTAKNAYPPLPLIDGNKTIELNPDQ-----------TKLTTWYTEKAVKFIHDCG 229 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 +KPFFLY H + + K+AG + R +GD + E++ + K LE G +D Sbjct: 230 --EKPFFLYVPHNMPHVPLFVSEKFAGKT-KRGLFGDVIAEIDWSVGEITKALEATGNVD 286 Query: 348 NTLIVFTSDNGPEAEVPPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIV 404 NTL++FTSDNGP H + FR KG+ WEGG RVP + G IQP D + Sbjct: 287 NTLVIFTSDNGPWLSYGDHAGSTGGFREGKGTVWEGGHRVPMIAKYPGTIQPGTTCDKLA 346 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNG-QSNRKAEHYFLNGKL 463 DLFPT G + + IDGV G +S+ + +Y+ L Sbjct: 347 STIDLFPTIAHYCGAT-------IDPSRKIDGVSIQPLLESVEGAKSSHEFFYYYWGNGL 399 Query: 464 AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIP 523 AVR + FK H G G G ++F+L DP E +I H Sbjct: 400 EAVRDERFKLHFPHAFRSLTGTPGTDGMPNGYTQAKTELALFDLDADPFEQTNIAADHPE 459 Query: 524 MGVPLQTEMHAYMEILKK 541 + L + L Sbjct: 460 VTARLTAAAESMRSDLGD 477 >UniRef50_D2QTW6 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QTW6_9SPHI Length = 486 Score = 447 bits (1150), Expect = e-124, Method: Composition-based stats. Identities = 135/471 (28%), Positives = 206/471 (43%), Gaps = 41/471 (8%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSS 137 + PNVV+F +DD+G+ D+ G A+ TP++D +A++G T+ + Q S Sbjct: 31 KPAPATPPNVVLFFMDDLGYGDLSVTG---ALDYTTPNLDKMAAEGTRFTNFLAAQAVCS 87 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 +RA +LTG Y G+ P GL TL +LL ++GY T GKWH+G+NK+ Sbjct: 88 ASRAALLTGCYPNRLGLYGALGPNSPIGLNPNEETLAELLKERGYATGMFGKWHLGDNKQ 147 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P GFD++ G DM+ +H A P P G E Sbjct: 148 FLPMQQGFDEYYGVPYSHDMWP----LHPAQAQAKYPPLRWIDGNEP----------GPE 193 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 + + D + + V F+ K KPFFLY H +A++ G S Sbjct: 194 IKDLND-----AGKITGTITEKAVSFIRNHKK--KPFFLYVPHPLPHVPLATSARFKGQS 246 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH--GRTPFRGA 374 AR +GD + E++ + L++ G NTL++F SDNGP H FR Sbjct: 247 -ARGIFGDVLTELDWSVGQIMNELKQQGLDKNTLVIFISDNGPWLNYGDHAGSSGGFREG 305 Query: 375 KGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTF 433 KG+++EGG RVP V W G++ R S+ ++ D+ PT ++ G +PK Sbjct: 306 KGTSFEGGHRVPCLVRWPGVVPAGRVSNKLLTALDILPTVANVCGA-------RLPKQR- 357 Query: 434 IDGVDQTSFFLGTNGQSNRKAEHYFLN-GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGF 492 IDGVD + G N + R +Y+ L AVR ++K QGG Sbjct: 358 IDGVDWVALLKGDNSVTPRDKFYYYYRKNSLEAVRQGDWKLVFAHPGRTYEGFLPGQGGK 417 Query: 493 TGTVMQTAGSS--VFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 G +T + +++L DP E + +H + L+T L Sbjct: 418 PGPSTETHAIAAGLYDLRRDPGERYDVREQHPEVVARLETIAEEARADLGD 468 >UniRef50_A4A2W0 Arylsulfatase A n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A2W0_9PLAN Length = 477 Score = 447 bits (1150), Expect = e-124, Method: Composition-based stats. Identities = 123/465 (26%), Positives = 201/465 (43%), Gaps = 55/465 (11%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 + ++ KPN V+ +DD+G+ D+ G V N TP+++A+A +G+ LT Y+ P Sbjct: 21 SSCAQEVATKPNFVIINIDDLGYADIEPFGSEV---NRTPNLNAMADEGMKLTCFYAAPV 77 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGLQG----LTTLPQLLHDQGYVTQAIGKWHM 191 SP+RA ++TG Y L P PG +G T+ +L+ +QGY T IGKWH+ Sbjct: 78 CSPSRAALMTGCYPKRA--LTIPHVLFPGNAEGMSPNEVTIAELMKEQGYATAIIGKWHL 135 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 G+ + P GFD + G +DM V N + + + LP +++ Sbjct: 136 GDQPDFLPTRQGFDYYYGLPYSNDMGPAADGVKSNYGAPIPQRKGKGQPPLPLLRNETVL 195 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 R + K +L + + ++F+ +KPFFLY HF YP Sbjct: 196 QR---------VLAKDQTELVTNYTEEAIQFIRD--HQEKPFFLYLPHSAVHFPMYPGDA 244 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPF 371 + G + + Y D + E++ + + L+ G TL++FTSDNG + P Sbjct: 245 FRGKN-SHGLYNDWVEEVDWSVGQVLQALKDLGLDQRTLVIFTSDNGGQTRFGAV-NKPL 302 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPK 430 R K +T+EGG+RVPT V W G + SD +V + D+ PT + LAG P Sbjct: 303 RAGKATTYEGGMRVPTIVRWPGKVPAGSSSDAVVGMIDVLPTLVKLAG-------GTTPT 355 Query: 431 TTFIDGVDQTSFFLG-TNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQ 489 IDG D G +S +++ L AVR +K + Sbjct: 356 DRKIDGADIGPILAGVKEAKSPHDVFYFYRGYDLEAVRSGPWKLRL-------------- 401 Query: 490 GGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 +++NL+ D E+ ++ + + L+ Sbjct: 402 ----------KEGALYNLHEDISEAKNVAPDNADVVERLRKIAAE 436 >UniRef50_UPI0001968C90 hypothetical protein BACCELL_02360 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968C90 Length = 525 Score = 447 bits (1149), Expect = e-124, Method: Composition-based stats. Identities = 133/475 (28%), Positives = 212/475 (44%), Gaps = 60/475 (12%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 E +KPN V +DD+G+ DV G TP+IDA+A++G+ T Y+ Sbjct: 64 SCTEATPTKSEKPNFVFIYMDDMGYSDVSCYGE---TRWTTPNIDALAAEGIKFTDCYAA 120 Query: 134 -PSSSPTRATILTGQYSIHHGILMPPMYGQ-PGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 P SSP+RA LTG+Y GI G T+ ++L QGY T IGKWH+ Sbjct: 121 SPISSPSRAGFLTGRYPARMGIQGVFYPDSYTGMAPEEVTMAEVLKVQGYATACIGKWHL 180 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 G ++ P GFD++ G +DM Q+ ++V Sbjct: 181 GSREKYLPLQQGFDEYFGIPYSNDM----------------------SAQVYLRGNEVEE 218 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 + ++ +++ + V ++ + K+D+PFFL+ H Y + + Sbjct: 219 FHID------------INNVTKKYTEEAVDYIRR--KADQPFFLFLAHSMMHVPIYVSDE 264 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-- 369 +AG S A YGD ++E++ + +TL + G DNTL+VFTSDNGP + P G Sbjct: 265 FAGKSGA-GIYGDAVLEVDWSVGRIMETLRELGLDDNTLVVFTSDNGPWLQEGPLGGRAL 323 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVP 429 P R K + +EGGVRVP YWKG I+P + +V L D FPT L+G ++P Sbjct: 324 PLREGKTTAFEGGVRVPCIAYWKGQIKPVVNTDVVSLLDWFPTVTALSG-------GILP 376 Query: 430 KTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQ 489 +DG D T+ GT +++ ++ N + R ++K + G + Sbjct: 377 DVR-LDGYDLTAVLNGTGKRASEDYAYFRNNRDITDYRSGDWKI--------SLPAPGIK 427 Query: 490 GGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 G F + +FNL D E ++ ++ + ++ Y + PP Sbjct: 428 GNFWRASTAEHDTLLFNLREDIGERYNLYRKYPGKAKEMLQKLQEYTRNFGEIPP 482 >UniRef50_Q01N83 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01N83_SOLUE Length = 461 Score = 445 bits (1146), Expect = e-123, Method: Composition-based stats. Identities = 124/464 (26%), Positives = 190/464 (40%), Gaps = 60/464 (12%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRA 141 ++PN+VV L DD+G+ D+G G +A TP+ID +A +G TS YS P SP+RA Sbjct: 25 QRQPNIVVILADDLGYGDLGCYGSPIA----TPNIDRLAEEGARFTSFYSASPVCSPSRA 80 Query: 142 TILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQN 201 ++TG+Y + + G G T+ Q+L GY T IGKWH+G P N Sbjct: 81 ALMTGRYPTRVEVPVVLGPGDAGLPDSEITMAQVLKSAGYRTSCIGKWHIGSTPGYLPTN 140 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIA 261 GFD+F G +D+ +RG A A Sbjct: 141 RGFDEFFGVPYSADITP------------------------------CPLMRGSSVVAPA 170 Query: 262 DITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTS 321 L + + F+ + D PFFLY H + ++AG S Sbjct: 171 VD----CSTLTSSFTQEALDFMRRA--QDNPFFLYLAHTAPHLPLAASPRFAGQS-GLGM 223 Query: 322 YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEG 381 Y D + E++ + L+ G NTL++F+SDNGP + + RG KG T+EG Sbjct: 224 YADVVQELDWSTGQVMAALKATGLDSNTLVMFSSDNGPWYQ---GSQGKLRGRKGETYEG 280 Query: 382 GVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQT 440 G+R P + G+I G+ DL PT LAG + +DGVD Sbjct: 281 GMREPFLARYPGVIPSGIGCAGLATTMDLLPTLARLAGAQTP--------SNPLDGVDIW 332 Query: 441 SFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTA 500 G + +R YF L R+ +K H+ A++ G + Sbjct: 333 PVLTGERAEVDRDVFLYFDAVYLQCARLGRWKLHLSRYNTKAWSPLPPGGRVN---LPLP 389 Query: 501 GSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 ++++ +DPQES H + ++ + ++ PP Sbjct: 390 RPELYDVVSDPQESYDCAASHPAIVADIRARVERMVQTF---PP 430 >UniRef50_A6LED2 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LED2_PARD8 Length = 468 Score = 444 bits (1142), Expect = e-123, Method: Composition-based stats. Identities = 121/482 (25%), Positives = 202/482 (41%), Gaps = 72/482 (14%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QP 134 A + + ++PNV++ +DD G+ D+G G + TP ID +A +G+ LT Y Sbjct: 16 AAVATQAAERPNVIIVFIDDFGYGDLGCYGS---TKHRTPHIDQMAKEGIRLTDFYVGSS 72 Query: 135 SSSPTRATILTGQYSIHHGILMPPMY--------------GQPGGLQGLTTLPQLLHDQG 180 S+P+R+ +LTG Y + + G G T+ +L+ +QG Sbjct: 73 VSTPSRSALLTGCYPRRVSMHVNADPTPLMSKGRQVLFPASHKGLNPGEITIAELMKEQG 132 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 Y T IGKWH+G+ P GFD + G +DM Sbjct: 133 YATACIGKWHLGDQLPFLPTRQGFDYYYGIPYSNDM------------------------ 168 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 D + +Q + P + L R+ + V+F+ + + PFF+Y Sbjct: 169 ------DRPYCPLPLMEQEEVIVAPVGHDSLTIRYTNKTVEFIK--SHKESPFFIYLCHN 220 Query: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 H + + G S YGD E++ L +TL++ G NTLI+FTSDNG + Sbjct: 221 MTHNPLAASPAFKGKSQ-NGLYGDATEELDWSMGVLLETLKEEGLDQNTLIIFTSDNGAD 279 Query: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGH 419 P RG KG+T+EGG RVP + W I + +D +V D PT Sbjct: 280 EHFGGT-NRPLRGQKGTTYEGGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYC-- 336 Query: 420 PGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQ 479 + VP IDG + + G + S + +Y+ +L AVR +KYH+ +++ Sbjct: 337 -----SYAVPSDRVIDGHNVSGILEGESMASPTETFYYYQKQQLQAVRWGNWKYHLPLKE 391 Query: 480 PYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL 539 + + ++NL D E+ ++ +H + T+M+ ++E + Sbjct: 392 RIKGPH--------FPDTEVGEARLYNLANDLSETTNVIDKHPEVV----TKMNQWIEQV 439 Query: 540 KK 541 + Sbjct: 440 RS 441 >UniRef50_A1WGP9 Sulfatase n=6 Tax=Proteobacteria RepID=A1WGP9_VEREI Length = 470 Score = 443 bits (1140), Expect = e-123, Method: Composition-based stats. Identities = 137/475 (28%), Positives = 223/475 (46%), Gaps = 35/475 (7%) Query: 72 QQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY 131 + + PN+V+ + D++GW + G GGG G PTP+IDA+A+QGL L + Sbjct: 20 HPHFCKAATMSVSTPNIVLIVADNLGWGEPGCYGGGALRGAPTPNIDALATQGLRLQNFN 79 Query: 132 SQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWH 190 + PTR+ ++TG++ I G L G P GL + TL QLL QGY + GKWH Sbjct: 80 VESDCVPTRSALMTGRHPIRTGCLQSVPPGLPQGLTRREITLAQLLSAQGYASAHYGKWH 139 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +G+ P + GFD++ G +D V +P V LP+ + Sbjct: 140 LGDVPGRLPSDRGFDEWYGIARTTDESQFTSTVGFDPAVV----------DLPWI---MR 186 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 G + + +D ++ + F+ + A + +PFFLY HF P+ Sbjct: 187 GRSGQPSENLKVYDLDSRRQIDAELVEQSIAFMRRNASTGRPFFLYLPLIHLHFPTLPHP 246 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT- 369 +AG + A + D MVE++ + + L++ G +N++++F SDNGPE VP G Sbjct: 247 DFAGRTGA-GDFADSMVELDHRVGQVVRALDELGAAENSVLIFCSDNGPEFRVPYRGTAG 305 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 P+ G + EG +RVP V W G I R S+ IV + DLF T +AG + Sbjct: 306 PWSGTYHTAMEGSLRVPCIVRWPGHISAARVSNEIVHVTDLFTTLAGVAGA-------RI 358 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 P+ IDGVDQ FFLG S R+ +++ +L AV+ ++K H + Sbjct: 359 PQDRPIDGVDQLPFFLGRQSASAREGFPFYIKEELRAVKWRDWKLHFY-----------W 407 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 + + + +FN+ DP+E + + + P+ + A+ + ++P Sbjct: 408 EPVVNESKGKLESPYLFNITRDPKEQMDVMAYNTWVRAPMLKLVKAFQDSFVQHP 462 >UniRef50_B9XS23 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XS23_9BACT Length = 635 Score = 440 bits (1133), Expect = e-122, Method: Composition-based stats. Identities = 149/477 (31%), Positives = 217/477 (45%), Gaps = 40/477 (8%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT 139 T +KPN++ L DD+G+ D+G G + N TP++D +A +G+ LTS Y+ P +P+ Sbjct: 19 AATSQKPNIIFILADDMGYGDIGPFGSTL---NRTPNLDRMAKEGMKLTSFYAAPLCTPS 75 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 RA ILTG Y+ + GL T+ +LL QGY T AIGKWH+G+ E+ Sbjct: 76 RAQILTGCYAKRVSLPKVLSPRSEVGLNTNEQTVAKLLKRQGYATMAIGKWHVGDAPENL 135 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 P GFD + G +DM E P + LP +D +Q Sbjct: 136 PTRHGFDHYLGLPYSNDMGGE-------EPGKDQPAKRGARPPLPLVRD---------EQ 179 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA 318 I + P + L +R+ D VKF+ A +PFFLY H +P + G S Sbjct: 180 VIEVVKPADQDRLTERYTDEAVKFIR--ANDKQPFFLYLAHTAVHAPIHPGHNFRGKSR- 236 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT--PFRGAKG 376 YGD + E++ + TL + G +NTL++F+SDNGP +G T P RG KG Sbjct: 237 NGLYGDWVEEVDWSVGKVLDTLRELGLSENTLVLFSSDNGPWLAQKTNGGTAGPLRGGKG 296 Query: 377 STWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFID 435 T+EGG+R PT +W G + + D + DL PT + LAG +PK ID Sbjct: 297 GTFEGGMREPTLAWWPGKVPAQSVCDTVAGNIDLLPTFVKLAG-------GTLPKDKKID 349 Query: 436 GVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGT 495 G D ++ LG ++ R+A +YF L AVR +K ++ Q Y G Sbjct: 350 GRDISNLLLGQTKEAQREAHYYFAGTALQAVRSGPWKLAIVPQ----YEGMGKFSENAVE 405 Query: 496 VMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL---KKYPPRAQIK 549 + ++NL D E + H L + A L KK P + Sbjct: 406 GGKPFAPRLYNLDEDIGEKTDVVAEHPDEMKRLLGYVEAMEADLGVSKKNGPGVRPP 462 >UniRef50_B4D764 Steryl-sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D764_9BACT Length = 499 Score = 439 bits (1130), Expect = e-121, Method: Composition-based stats. Identities = 137/469 (29%), Positives = 206/469 (43%), Gaps = 35/469 (7%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 KPN ++ +DD+G+ D+ G + N TP++D +A +G LT Y P SP+R+ + Sbjct: 22 DKPNFIIINIDDMGYADIAPFGSKL---NRTPNLDRMAQEGRKLTCFYGAPVCSPSRSAL 78 Query: 144 LTGQYSIHHGILMPPMYGQPGGLQGLT----TLPQLLHDQGYVTQAIGKWHMGENKESQP 199 +TG Y +L P PG GL T+ +LL GY T IGKWH+G+ E P Sbjct: 79 MTGCYPKR--VLPIPSVLFPGAAVGLNPAEHTVAELLKKSGYATGCIGKWHLGDQPEFLP 136 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVN-----PEVALSPDRSEYIKQLPFSKDDVHAVRG 254 GFD + G +DM + P+ +P+ S I + + + Sbjct: 137 PRRGFDYYLGLPYSNDMGPGEDGSKSSLGDPIPKPKATPNPSAPIPETGITGNQPPLPML 196 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG 314 ++ IA + + L R+ VKF+ + DKPFFLY HF YP ++AG Sbjct: 197 ENEKVIARVRQDEQQGLVDRYTKAAVKFITE--HKDKPFFLYLPHNAVHFPIYPGKEWAG 254 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGA 374 SP Y D + +++ + TL + D+T ++FTSDNG P P RG Sbjct: 255 KSP-NGYYSDWVEQVDWSVGQVLNTLRELKLQDHTFVLFTSDNGG---TPRAVNAPLRGF 310 Query: 375 KGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTF 433 K +TWEGG+R PT +W G I SD I + D+ PT ++LAG VP Sbjct: 311 KTTTWEGGMREPTIAWWPGKIPGGTSSDEITGMFDILPTLVNLAG-------GEVPTDHK 363 Query: 434 IDGVDQTSFFLGTNG-QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGF 492 IDG + G G +S + +YF +L VR +K G Sbjct: 364 IDGGNIWPVLAGEAGAKSPHEVFYYFNGLRLEGVRTGPWKLRF------GSAGLAEGKGP 417 Query: 493 TGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 ++NL TD E+ ++ H + L+ A + L + Sbjct: 418 VKKPAAPIPDQLYNLQTDIGETTNVADAHPDVVAHLRELADAMKDDLGR 466 >UniRef50_A0YAF7 Arylsulfatase A n=4 Tax=Bacteria RepID=A0YAF7_9GAMM Length = 479 Score = 439 bits (1130), Expect = e-121, Method: Composition-based stats. Identities = 120/472 (25%), Positives = 205/472 (43%), Gaps = 45/472 (9%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-S 135 + + + PNV++ DD+G+ D+G G P++D +A++G+ T+ Y+ Sbjct: 29 AVANPSHQSPNVIIIFADDMGYGDIGAYGHPTIRS---PNLDQMAAEGIKWTNFYAASSV 85 Query: 136 SSPTRATILTGQYSIHHGILMPPMY-----GQPGGLQGLTTLPQLLHDQGYVTQAIGKWH 190 +P+RA +LTG+ + G+ + G T+ + L ++ Y T +GKWH Sbjct: 86 CTPSRAGLLTGRLPVRSGMAHDQIRVLFPTSTGGLPTTEITIAKALKEKDYRTALVGKWH 145 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +G QP + GFD++ G +D YI+ + +KD Sbjct: 146 LGHLPGFQPLDHGFDEYFGIPYSNDH--------------DLKKELSYIQTITHAKDGDF 191 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 V + ++I + P + +R+ V F+ K S++PFFLY H + + Sbjct: 192 NVPLMQNRSIIE-RPANQNTITKRYTQEAVSFIKK--NSNQPFFLYLAHSMPHVPLFASD 248 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP 370 ++ GSS R YGD + E++ + TL + G +NTL+VFTSDNGP + HG + Sbjct: 249 QFRGSSD-RGLYGDVIEEIDWSVGQVLSTLSEQGISENTLVVFTSDNGPWLIMGAHGGSA 307 Query: 371 --FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 + KG+++EGG+R P +W I+P + DLFPT + +AG + Sbjct: 308 GLLKSGKGTSYEGGMREPAIFWWPEKIKPAVAHNTASTLDLFPTIMSIAGID-------M 360 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 P DG D + + RK Y+ K+ AVR ++K H + Sbjct: 361 PSDRSYDGYDLSPTMF-EQKSNERKNIFYYHGDKIFAVRQGDWKVHFKTVANIYTKEQ-- 417 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 ++ VFNL DP E +G + + + + +K Sbjct: 418 ------KILTHTPPQVFNLLVDPSERFDVGAVNPAIIASAAKLIEQHQLSVK 463 >UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=Bacteria RepID=Q7UHJ9_RHOBA Length = 1012 Score = 437 bits (1123), Expect = e-121, Method: Composition-based stats. Identities = 124/503 (24%), Positives = 196/503 (38%), Gaps = 82/503 (16%) Query: 65 PAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQG 124 P+ + + KPN +V L DD G+ D+ G TP ID +A++G Sbjct: 550 PSSPTASVSPAGREKTAETTKPNFIVILTDDQGYGDLSCFGAKHV---DTPRIDQMAAEG 606 Query: 125 LILTSAY-SQPSSSPTRATILTGQYSIHHGILMPPMYG-----QPGGL-QGLTTLPQLLH 177 LTS Y + P +P+RA ++TG Y + M +G P GL T+ ++L Sbjct: 607 SRLTSFYVAAPVCTPSRAGLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLK 666 Query: 178 DQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSE 237 GY T GKWH+G+ E P GFD+F G D++ + Sbjct: 667 TAGYRTGMFGKWHLGDQPEFLPTKQGFDEFFGIPYSHDIHPFHPRQN-----------HY 715 Query: 238 YIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYY 297 + LP ++D + ++ P + L +R + V F+++ D+PFFLY Sbjct: 716 HFPPLPLLQNDT----------VIEMDPD-ADFLTKRLTEQAVSFIER--NKDQPFFLYL 762 Query: 298 GTRGCHFDNYPNAKYAG-----------SSPARTSYGD-------CMVEMNDVFANLYKT 339 H + + + Y + E++ + Sbjct: 763 PHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNIDYATRANLFRQAIAEIDWSVGQILDA 822 Query: 340 LEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR- 398 L NG + T+++FTSDNGP RG KG+T+EGG+R PT V W G I Sbjct: 823 LRSNGLDEKTMVLFTSDNGPPKNTLYASPGELRGHKGTTFEGGMREPTVVRWPGQIPAGH 882 Query: 399 KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF 458 ++D ++ DL PT LAG +P IDG D G Q+ A Y Sbjct: 883 QNDELMTAMDLLPTFAKLAGA-------AIPTDRVIDGKDIWPTLKGET-QTPHDAFFYH 934 Query: 459 LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIG 518 +LAAVR ++K H V +++L D E ++ Sbjct: 935 RGNQLAAVRSGKWKLH---------------------VNNGVAKQLYDLENDLGEKVNVI 973 Query: 519 VRHIPMGVPLQTEMHAYMEILKK 541 + + LQ ++ + + Sbjct: 974 ETNPEVVKKLQHQLKDFAADIAS 996 Score = 384 bits (986), Expect = e-105, Method: Composition-based stats. Identities = 116/504 (23%), Positives = 195/504 (38%), Gaps = 61/504 (12%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPT 139 + PNVV+ +DD+G+ D+G G A TP+ID +A++G T A+S +P+ Sbjct: 35 AAERPPNVVLIFVDDLGYGDLGCYG---ATKLSTPNIDRLAAEGRRFTDAHSASAVCTPS 91 Query: 140 RATILTGQYSIHH----GILMP-PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 R +LTGQY + GI P P T+ ++ ++GY T +GKWH+G Sbjct: 92 RYGLLTGQYPVRAMGGQGIWGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKWHLGFK 151 Query: 195 KES---------QPQNVGFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPDRSEYIK 240 +E PQ+VGFD + G V+ + +P L Sbjct: 152 EEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGGKPVSP 211 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 F ++ A+ E + VK++ + K ++PFFLY+ T Sbjct: 212 TPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITE--KKNEPFFLYFATP 269 Query: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 H P ++ G+S YGD + E++ + + ++LE NG DNTL++FTSDNG Sbjct: 270 NIHHPFTPAPRFKGTSQC-GLYGDFVHELDWMVGEIVQSLEDNGLTDNTLVLFTSDNGAM 328 Query: 361 A--------EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFP 411 + G K WEGG RVP W G I+ +SD ++ DLF Sbjct: 329 LNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKAGTQSDQLISQVDLFA 388 Query: 412 TALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNR-KAEHYFLNGKLAAVRMDE 470 T L +P + D ++ L + R + + A+R + Sbjct: 389 TFSALT-------EQEMPSSEQKDSINMLPALLDDPNEPLRTELVLAPRQPRNLAIRKGK 441 Query: 471 FKYHVLIQQPYAYTQSGYQGGFTGTV------------------MQTAGSSVFNLYTDPQ 512 + Y + G + +++L D Sbjct: 442 WLYIGARGSGGFNGSKPQHHAWGGPAAVQFSGQKNSDIVNGRIKKNAPPAQLYDLENDRS 501 Query: 513 ESDSIGVRHIPMGVPLQTEMHAYM 536 ++ ++ H + ++ + +Y Sbjct: 502 QTTNVFREHPEVVEEMKAMLESYR 525 >UniRef50_A6LDP6 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LDP6_PARD8 Length = 452 Score = 436 bits (1122), Expect = e-121, Method: Composition-based stats. Identities = 127/468 (27%), Positives = 202/468 (43%), Gaps = 53/468 (11%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPT 139 KPN++V DD+G+ D+ G TP+ID +A +G +S Y SSP+ Sbjct: 21 SQPTKPNIIVINCDDMGYGDLSCFGSPTI---KTPNIDRMAIEGQKWSSFYVSASVSSPS 77 Query: 140 RATILTGQYSIHHGILMPPMY-----GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 RA +LTG+ + G+ + G T+ +LL GY T IGKWH+G Sbjct: 78 RAGLLTGRLGVRTGMYGDQRRVLFPDSKGGLPSEELTIAELLKQAGYHTACIGKWHLGHL 137 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 E P GFD F G+ +DM R E IK L +K + Sbjct: 138 PEYMPLRHGFDYFYGYPYSNDM-----------------SRKEQIK-LGNTKYPYEYIIY 179 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG 314 +++ + +Y +L Q+ + ++++ + + PFFLY H Y + + G Sbjct: 180 EQEKELEREPQQY--NLTQQVTEAAIRYIK--SNENSPFFLYLAHPMPHMPVYASTDFQG 235 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT--PFR 372 S AR YGD + E++ + +TL+ G NTL++FTSDNGP G + P + Sbjct: 236 KS-ARGKYGDTVEELDWSVGQILQTLKSEGLDKNTLVIFTSDNGPWLLCKQEGGSPGPLK 294 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 K S +EGG RVP + W M++P + DL PT ++AG P +P Sbjct: 295 DGKASMFEGGFRVPCIM-WGAMVKPGYITDMASTLDLLPTFCEIAGIP-------LPSDR 346 Query: 433 FIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGF 492 DG+ + + R +++ +L A+R ++K H + Y Sbjct: 347 HYDGISLLNVLKDKS-TCKRDVFYFYRGSELYAIRKGKYKAHFSYRPAYG---------- 395 Query: 493 TGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 T + +++L TDP E +I H + L +A+ LK Sbjct: 396 TTDKIIYDKPVLYDLGTDPGELYNIAEEHPDIVQELTMLANAHKASLK 443 >UniRef50_D0TQQ7 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TQQ7_9BACE Length = 853 Score = 433 bits (1115), Expect = e-120, Method: Composition-based stats. Identities = 133/491 (27%), Positives = 206/491 (41%), Gaps = 62/491 (12%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT- 139 + +KPNVV+ DD G+ D+G G + TP ID +A +G+ LT Y Sbjct: 18 QARQKPNVVIIFTDDQGYQDLGCYGSPLI---QTPFIDRMAKEGIKLTDFY--------V 66 Query: 140 --------RATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 RA +LTG+ + +G+ G TL + L +QGY T GKWH+ Sbjct: 67 SSSVSSASRAGLLTGRLNTRNGVKGVFFPESEGMPSEEITLAEALKEQGYTTGCFGKWHL 126 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTE----------WRDVHVNPEVALSPDRSEYIKQ 241 G+ K P + GFD + G +DMY +R+ + + + + Sbjct: 127 GDLKGHLPTDQGFDYYYGIPYSNDMYIGPSQQFASNVTFREGYNLSKAKEDQEFVRTSSR 186 Query: 242 LPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRG 301 K +A E I + P +R+ D+ + F++ ++PFF+Y Sbjct: 187 ADIKKRLNNASPLFEGDKIIEY-PCDQSTTTRRYFDHAIDFIEN--NPEQPFFVYITPSM 243 Query: 302 CHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA 361 H + + ++ G S R YGD + E++ L L+K +NTL++F SDNGP Sbjct: 244 PHVPLFASEQFKGKS-KRGLYGDVVEEIDWNVGRLIDYLDKKKLAENTLVIFASDNGPWL 302 Query: 362 EVPPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAG 418 G + P RG K S +EGGVRVP + WKG I SD IV DLFPT + AG Sbjct: 303 SFKEDGGSAEPLRGGKFSYYEGGVRVPCIIRWKGSIPAGVTSDAIVASIDLFPTIMHYAG 362 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ 478 K IDG++ +SF + + R Y G++ +R ++ Y Sbjct: 363 CQSFK--------QKIDGINISSFLKNPSLRL-RDEYVYVKGGEVHGIRKGDWVYLPKTG 413 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 + +FNL D ES+++ +++ LQ M Y Sbjct: 414 N--------------SKFKKGDVPELFNLKQDIGESNNLHLQYPNKVKELQEVMKKYQST 459 Query: 539 LKKYPPRAQIK 549 P +QI+ Sbjct: 460 --STMPYSQIR 468 >UniRef50_D2R206 Steryl-sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R206_9PLAN Length = 504 Score = 431 bits (1109), Expect = e-119, Method: Composition-based stats. Identities = 128/471 (27%), Positives = 198/471 (42%), Gaps = 62/471 (13%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 +PNV++ +DD+G+ D+G G NPTP + +A++G+ LTS Y+ P SP+R Sbjct: 28 AEESRPNVIIINIDDLGYADIGPFGSK---KNPTPALTKMAAEGMKLTSHYAAPVCSPSR 84 Query: 141 ATILTGQYSIHHGILMPPMYGQP----GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 A +LTG Y +L P P G T+ +L GY T +GKWH+G+ E Sbjct: 85 AALLTGCYPKR--VLSIPHVLFPSAGSGLHPDEVTIADMLKASGYKTACLGKWHVGDQAE 142 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLP------------- 243 P GFD + G +DM T N L ++ + P Sbjct: 143 FLPTKQGFDSYYGIPYSNDMGTATDGSKSNFGAPLPMPGAKGKGKQPAQATGELPLGSPT 202 Query: 244 --FSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRG 301 + +A + + +L + + V F+ + D+PFFLY+ Sbjct: 203 GLTGNMQPPLPLLENDKVVARVRGEDQVNLTRDYTKRAVNFIRE--NKDQPFFLYFAHTA 260 Query: 302 CHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA 361 HF YP+ ++ S R + D + E++ + L + + TL++FTSDNG Sbjct: 261 VHFPMYPSKEFRTSD--RGTLDDWVDEVDASVGEVLAALAEMKIDEKTLVIFTSDNGGSL 318 Query: 362 EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHP 420 TP +G+KG TWEGG+RVPT W G I+ S I + DL PT G Sbjct: 319 PHGSD-NTPLKGSKGLTWEGGIRVPTIARWPGTIKGGTSTSAITGMIDLLPTIAAATGAK 377 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 + +DG++Q GT +S R+ YF +L AVR D +K H+ Sbjct: 378 LPE--------RKLDGLNQLPLLNGTAKESPRREFFYFRGLELDAVRRDNWKLHL----- 424 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTE 531 A +++L +D ES ++ H + L Sbjct: 425 -------------------AKGELYDLESDIGESKNVAADHPEIVKSLTEL 456 >UniRef50_A6DI94 Arylsulfatase A n=2 Tax=Bacteria RepID=A6DI94_9BACT Length = 472 Score = 431 bits (1108), Expect = e-119, Method: Composition-based stats. Identities = 128/474 (27%), Positives = 205/474 (43%), Gaps = 49/474 (10%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRATI 143 KPN ++ DD G+ D+ G TP ID +A++G+ + Y S S +RA + Sbjct: 21 KPNFIIIFTDDQGYGDLSCFNPQ---GVQTPHIDQMATEGMKFNNFYVSAAVCSASRAAL 77 Query: 144 LTGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNV 202 LTG Y+ GI G GL T+ +LL +Q Y T GKWH+G+ P Sbjct: 78 LTGTYNDRIGIKSAFFPGTKQGLHPDEITIAELLKEQNYATACFGKWHLGDEPSLLPSAQ 137 Query: 203 GFDDFRGFNSVSDMY-----TEWRDVHVNPEVALSPDR-------SEYIKQLPFSKDDVH 250 GFD + G +DM+ T + N + L + K+ P K + Sbjct: 138 GFDTYFGIPYSNDMFIAPHQTFAENAKFNGDWTLEKAKELQKFIAPHVNKRGPIWKSEYK 197 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 A+ + P L QR+ D +KF+DK +KPFF++ H + + Sbjct: 198 ALVPILEGEQIVEFPADQASLTQRYFDRTIKFIDK--NQNKPFFIFLTPAMPHVPLFASK 255 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT- 369 ++ G S + YGD + E++ L K L++ NTL++FTSDNGP G + Sbjct: 256 EFRGKS-KKGLYGDVIKEIDFHTGRLIKHLKEKELDQNTLVIFTSDNGPWLSYGDEGGSS 314 Query: 370 -PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANL 427 P R K +++EGGVR+PT + G+I+ + + DL PT L V Sbjct: 315 GPLRDGKFTSYEGGVRMPTVFWGPGLIKANSVCNQLASTIDLLPTFAQL-------VNTQ 367 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSG 487 VP+ IDG D + N +R + AVR ++K V Sbjct: 368 VPQDRKIDGKDISPLLKSQNHVIHRHLFF-----RDEAVRSGDWKLVVKEHH-------- 414 Query: 488 YQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 T+ + +++NL D ES+++ H + LQ+++ +++ L + Sbjct: 415 ------MTMRKGPLPALYNLKNDVAESNNLIDTHPKVAQYLQSKLDEHLKDLNE 462 >UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZAC9_PLALI Length = 479 Score = 430 bits (1107), Expect = e-119, Method: Composition-based stats. Identities = 131/476 (27%), Positives = 202/476 (42%), Gaps = 62/476 (13%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSS 137 + +PN++V + DD+G+ D+G GG PTP +D +A+ G+ T+AY S P S Sbjct: 31 QTSKSGRPNILVIMADDLGYADLGVQGG---CEIPTPHLDQLAASGIRCTNAYVSAPYCS 87 Query: 138 PTRATILTGQYSIHHGILMPPMYGQP---GGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 P+RA LTG+Y G P G+ G T+ LL +GY T IGKWH G + Sbjct: 88 PSRAGFLTGKYQTRFGHEFNPHVGEEAKLGLPLEEVTIANLLQTEGYRTALIGKWHQGFS 147 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 K+ PQ+ GFD+F GF Y ++V A S D RG Sbjct: 148 KDHHPQSRGFDEFFGFLVGGHNYLLHKEVKARFGTAHSHD---------------MIYRG 192 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY--PNAKY 312 E + + + ++++ +KP+FLY H P+ + Sbjct: 193 REVEPQEGYA-------TDLFTNEALRWM--SGPPNKPWFLYLSYNAVHTPLEIAPHLQK 243 Query: 313 AG----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP---- 364 PAR Y + ++D + + L ++G + TLI+F SDNG P Sbjct: 244 RIPESVKLPARRGYLSLLAGLDDSIGRITQHLSQHGLREKTLIIFLSDNGGSGRAPILAY 303 Query: 365 -PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGA 422 P RG KG T EGG+RVP FV W G + R + + DL PT LA + A Sbjct: 304 NSGLNHPLRGDKGQTLEGGIRVPFFVSWPGQLPARTIYEQPIISLDLLPTVCQLAANNPA 363 Query: 423 KVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYA 482 K P IDGV+ ++LG + ++ ++ G AVR +K P + Sbjct: 364 KPQ---PLPQGIDGVNLMPYWLGQRSGAPHESL-FWRFGPQKAVRAGNWKLVDWRDFPAS 419 Query: 483 YTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 + +G +++L TD E +++ H + L+T + + Sbjct: 420 ---------------KNSGWELYDLSTDISEKNNLAETHPEIVARLKTSWEKWNQS 460 >UniRef50_Q7UKJ5 Arylsulfatase A n=3 Tax=Bacteria RepID=Q7UKJ5_RHOBA Length = 489 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 128/473 (27%), Positives = 199/473 (42%), Gaps = 39/473 (8%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 A T +KPNV+V DD G+ D+G G TP++D +AS+G TS YS Sbjct: 35 SSAAESTDTTEKPNVIVIFTDDQGYNDLGCYGSP---NIKTPNLDRLASEGRRYTSFYSA 91 Query: 134 -PSSSPTRATILTGQYSIHHGILMPPMYGQP--GGLQGLTTLPQLLHDQGYVTQAIGKWH 190 SP+RA +LTG Y G+ ++ Q G T+ L GY T +GKWH Sbjct: 92 CSVCSPSRAALLTGCYPKRVGLHQHVLFPQSTYGLHPDEVTIADHLKSAGYATACVGKWH 151 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +G +KE+ P + GFD + G +DM H + + + + + Sbjct: 152 LGHHKETLPTSNGFDSYYGIPYSNDM------NHPDNKRLGKMSSDDRWTDQSSAVTLWN 205 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 +++ I P + +R+ D ++F++ A DKPFFLY H Y Sbjct: 206 TPLVQDEEIIE--LPVDQRTVTRRYTDRAIEFVE--ANQDKPFFLYLPHSMPHIPLYVPE 261 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT- 369 P + +Y + ++ L +T+ G + TLIV+TSDNGP + HG + Sbjct: 262 DVYDPDP-QNAYKCVIEHIDTEVGRLVQTVRDLGLSEKTLIVYTSDNGPWLQFKNHGGSA 320 Query: 370 -PFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANL 427 P R KG+T+EGG RVP ++ G I S+ DL PT G Sbjct: 321 GPLRAGKGTTFEGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASFTGV-------A 373 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL-NGKLAAVRMDEFKYHVLIQQPYAYTQS 486 + IDG+D TS F + +S R ++ +G L +RM ++KY Q Sbjct: 374 LENDRKIDGIDLTSTFT--SDESARDEFVFYSAHGVLEGIRMGDWKY---------LRQV 422 Query: 487 GYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL 539 +G +F+L D E +++ + + M E + Sbjct: 423 ARRGPNAKGPKPEPKVFLFDLSQDIGEKNNLVEQQPERVQKMHARMEELNEEI 475 >UniRef50_B8KM61 Steryl-sulfatase n=2 Tax=gamma proteobacterium NOR5-3 RepID=B8KM61_9GAMM Length = 500 Score = 427 bits (1098), Expect = e-118, Method: Composition-based stats. Identities = 152/477 (31%), Positives = 234/477 (49%), Gaps = 29/477 (6%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGG-VAVGNPTPDIDAVASQGLILTSAYSQPSS 136 KPNVV+ L D++G+ D+G G G G PTP ID +AS+G++LT + +P Sbjct: 30 TPAIAADKPNVVLMLSDNMGYGDLGVYGSGGELRGMPTPRIDQLASEGMMLTQFFVEPGC 89 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGLQG-LTTLPQLLHDQGYVTQAIGKWHMGENK 195 +PTRA +LTG+YS G+ + G P LQ TL +L QGY T GKWH+G K Sbjct: 90 TPTRAALLTGRYSQRAGLGSIIIAGTPSTLQDSEVTLAELFKSQGYATAMTGKWHLGGEK 149 Query: 196 ESQPQNVGFDDFR-GFNSVSD--MYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 +S P N GFD++ G +D +Y + E A++ ++ + P KD V V Sbjct: 150 QSLPINQGFDEWHVGILQTTDGVLYPDGMRRSGFSEAAIAKSQTAIWESEP-GKDVVKKV 208 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 R + + Y ++ + VK++ + AK +PFFLY G H+ P+ + Sbjct: 209 RPYDLE--------YRRHIEGDIAEASVKYIKEQAKEKEPFFLYVGWSHVHYPALPHPDF 260 Query: 313 AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH------ 366 G S A +GD ++E++ + +++ G DNT++++ SDNGP + Sbjct: 261 EGKSSA-GLFGDAVMELDYRTGQVLDAIKEAGIEDNTIVIWLSDNGPATTQGSNNDFLGS 319 Query: 367 GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVAN 426 PFRG G EG +RVP + W I+P KS+ +V + D +PT ++ G Sbjct: 320 SAGPFRGEVGDALEGSLRVPGMIKWPAKIKPAKSNEMVAIHDFYPTLANIIGA------- 372 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQS 486 VP IDGVDQ FFLG N QS R++ F+ G++AAVR +++ + Q + Sbjct: 373 KVPTDRAIDGVDQGDFFLGKNKQSARESLITFMEGEVAAVRWKQWRIY-PKQFVASEGNP 431 Query: 487 GYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 G SV+N+ DP+E + + P + Y + L+KYP Sbjct: 432 SLMGVGAYRAEGMGYPSVYNIARDPREQWNQTAVSAFVLGPYMQIVGEYQKSLEKYP 488 >UniRef50_A6DSG4 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSG4_9BACT Length = 489 Score = 427 bits (1097), Expect = e-118, Method: Composition-based stats. Identities = 136/470 (28%), Positives = 208/470 (44%), Gaps = 62/470 (13%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 + +KPN++ +L DD+G+ D+G G G TP ID +A +G +S Y SP+R Sbjct: 25 QAQQKPNILFYLTDDLGYGDIGCYGAE---GQYTPAIDQLAKEGTKFSSFYVHQRCSPSR 81 Query: 141 ATILTGQYSIHHG---ILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 A +TG Y+ G ++ G G TLP+L+ GY T +GKWH+GE K Sbjct: 82 AAFMTGSYAHRVGLPQVIYKHREGPIGLNPSEITLPELMKTAGYNTALVGKWHLGEWKPF 141 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P N G+D F GF V E + P E K+L Sbjct: 142 HPLNHGYDYFYGFLKVI-------------EGSEKPSLIENRKELAS------------- 175 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSP 317 + E + + F+ K K+ PFFL Y H +P+ ++ G+S Sbjct: 176 ------KIQKTEGQAPGMVKAAINFMTKHKKN--PFFLVYSDPMPHAPYFPSEQFKGTS- 226 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG----RTPFRG 373 R +YG+ + E++ F +L L++ G +NT++VFTSDNGP E P R Sbjct: 227 KRGNYGEVIHEIDWQFKHLMDALDELGLKENTIVVFTSDNGPPVERQKKYDVGLSGPLRD 286 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 K + +EGGVRVP + W G ++ SD ++ + D+ PT +LAG VP Sbjct: 287 GKWTNFEGGVRVPFIIRWPGKVKVDASSDAMIGIIDMLPTFCELAGVD-------VPNDR 339 Query: 433 FIDGVDQTSFFLG-TNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGG 491 IDGV+ LG ++ R+ + A + + +KY+ Q PY + G Sbjct: 340 VIDGVNILPQLLGDQESKALRETQIV----PGATIIHNGWKYYAKQQNPYNNKKPEDWNG 395 Query: 492 FTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 ++FNL D E+ + +H + L+ M +M LKK Sbjct: 396 L----QPAKEGALFNLKEDIGETTEVSAQHPEIAESLKKNMAKFMAELKK 441 >UniRef50_C1ZCM0 Arylsulfatase A family protein n=2 Tax=Bacteria RepID=C1ZCM0_PLALI Length = 509 Score = 425 bits (1094), Expect = e-117, Method: Composition-based stats. Identities = 149/486 (30%), Positives = 219/486 (45%), Gaps = 35/486 (7%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSS 137 L KPN++V + DDVGWM+V GG + G TP+ID + +G+ TS Y+QPS + Sbjct: 17 LSASAADKPNILVIMADDVGWMNVSSYGGDIM-GIRTPNIDRIGQEGIRFTSFYAQPSCT 75 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKE 196 RA LTGQ + G+ G P GLQ TL ++L +GY T GK H+G+ +E Sbjct: 76 AGRAAFLTGQLPVRTGLTTVGTPGSPAGLQKEDITLAEILKTKGYSTAQFGKNHLGDLEE 135 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P GFD++ G + + + +P+ P+ + + V G Sbjct: 136 HLPHRHGFDEYFG----NLYHLNGNEDLEDPDRPTDPEFRKKFDP----RGVVSGTADGP 187 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 + +T K ME D + + FLD+ AK KPFFL++ + H + G S Sbjct: 188 TKDEGPLTTKRMETFDDEIVAKSLDFLDRKAKDQKPFFLWHCSARLHVFFHFKEGVRGKS 247 Query: 317 PARTS--YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRG 373 A YGD + E + L LE G NT++V+ +DNG + P G T PFRG Sbjct: 248 RAGREDVYGDALAEHDGHIGQLLAKLEATGLDKNTIVVYVTDNGAYQYMWPEGGTSPFRG 307 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKT-- 431 KG+TWEGGVR P V W G + R S IVD+ DL PT AG A Sbjct: 308 DKGTTWEGGVRAPCMVRWPGAVGGRVSSEIVDMTDLLPTLASAAGETDAVEKLKKGADYG 367 Query: 432 -----TFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQS 486 +DG DQT+ F G + +S RK Y+ L A+R + FK I++ Sbjct: 368 GKNYKVHLDGYDQTALFTGKSDKSARKFVFYYDETVLTAIRYESFKVTFSIKEG------ 421 Query: 487 GYQGGFTGTVMQTAGSSVFNLYTDPQESD--SIGVRHIP----MGVPLQTEMHAYMEILK 540 G + ++ + NL DP E + ++ + P+ ++ + Sbjct: 422 ---GHWDDPLVGLGRPMITNLRMDPFERQTGDVNRQYAEHKTWVLTPIVGIAEKHLTTFR 478 Query: 541 KYPPRA 546 +P R Sbjct: 479 DFPVRQ 484 >UniRef50_Q7UHK0 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UHK0_RHOBA Length = 478 Score = 425 bits (1092), Expect = e-117, Method: Composition-based stats. Identities = 124/473 (26%), Positives = 195/473 (41%), Gaps = 61/473 (12%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSP 138 + PN V+ DD+G+ D+ G TP +D +A++G + SP Sbjct: 37 AAADRPPNFVLIFADDLGYGDISCYDS---SGVKTPHLDQLAAEGFRSKDFFVPANVCSP 93 Query: 139 TRATILTGQYSIHHGI-----LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 +RA +LTG+Y + G+ Y G T+P+LL GY + +GKWH+G Sbjct: 94 SRAALLTGRYPMRCGMPVARNENVAKYKDYGFAPDEITIPELLGPAGYRSLMVGKWHLGM 153 Query: 194 N-KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 + S P + GFD++ G S Y + + + ++ Sbjct: 154 ELEGSHPLDAGFDEYLGIPS------------------------NYEPRRGKNHNTLYRG 189 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 + EQ+ +A E+L +R+ D + F+++ D PFF+Y H P+ + Sbjct: 190 KQVEQKNVA------CEELTKRYTDEVIDFIERQ--KDDPFFIYVSHHIVHNPLKPSPDF 241 Query: 313 AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFR 372 G+S YGD + E++ + +T+ G +NTL++FTSDNGP Sbjct: 242 VGTSEK-GKYGDFIKELDHSTGRIMQTIRDAGLDENTLVIFTSDNGP---TRNGSSGELS 297 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 G K T EGG RVP W I P + SD + DL P +LAG P +P Sbjct: 298 GGKYCTMEGGHRVPGMFRWTSKIAPNQVSDVTLTSMDLLPLFCELAGVP-------IPDD 350 Query: 432 TFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLI---QQPYAYTQSGY 488 IDG LG +S + +Y+ L AVR ++K H+ QP+ + Sbjct: 351 RQIDGKSILPVLLGQTSESPHQFLYYYNGTNLQAVREGKWKLHLPRTTDDQPFWSKKPDK 410 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 GF + +FNL D E ++ RH + L + L Sbjct: 411 TKGF----VTLNEMRLFNLDRDLGEKKNVADRHPEIVARLNEQAELIRTELGD 459 >UniRef50_A6DJ11 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ11_9BACT Length = 462 Score = 424 bits (1090), Expect = e-117, Method: Composition-based stats. Identities = 133/479 (27%), Positives = 205/479 (42%), Gaps = 43/479 (8%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 T L L KPNV++ L DD G+ D+ G P ID +A +GL LTS Sbjct: 8 TLISLQFLMAADTSKPNVIIILTDDQGYNDLSCYGSKTIKS---PRIDQLAEEGLKLTSY 64 Query: 131 Y-SQPSSSPTRATILTGQYSIHHGI--LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIG 187 Y + P S +RA +LTG+Y G+ + P G G T+ +LL GY T+A+G Sbjct: 65 YVASPVCSASRAALLTGRYPKLVGVPGVFFPNRGHKGLDPKHQTIAKLLKSVGYATKAVG 124 Query: 188 KWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS-- 245 KWH+G+ E P N GFD + G +DM + + + E +K+ + Sbjct: 125 KWHLGDELEFLPTNQGFDSYYGIPYSNDMTPAFSMKYSENCLYREGVDQEALKKAFEANK 184 Query: 246 ------KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 KD V +R E + P + +R+ D +KF+D+ S+KPFFLY Sbjct: 185 IKPVGMKDKVPLMRNDECIEM----PADQSTITKRFTDESIKFIDESTASNKPFFLYLAH 240 Query: 300 RGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP 359 H Y + + G S A YGD + E++ + L + +NTL ++TSDNGP Sbjct: 241 SMPHTPLYVSKDFEGKS-AGGIYGDVIEEIDYNVGRIIDHLNEKNIAENTLFIYTSDNGP 299 Query: 360 EAEVPPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDL 416 HG + P K +++EGG RVP + W I S+ + D+FPT + Sbjct: 300 WLIKKSHGGSALPLFEGKMTSFEGGQRVPAIIRWPAKIPKDSVSNEMTLSMDIFPTLAKI 359 Query: 417 AGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVL 476 G I+G + + +N K +H + AVR +KYH Sbjct: 360 TGAKAQDAD-------LINGKNALELY---EDPANFKTKHDYFFYSPRAVRHKNWKYH-- 407 Query: 477 IQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAY 535 T +T G S+++L D ES ++ + + L+ + + Sbjct: 408 ---------QQETFKLKSTARKTKGPSLYDLSKDIGESKNLINDYPEIAAQLKNALLEH 457 >UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZF72_PLALI Length = 470 Score = 422 bits (1086), Expect = e-116, Method: Composition-based stats. Identities = 120/464 (25%), Positives = 186/464 (40%), Gaps = 69/464 (14%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SS 137 +T +KPNV++F DD+GW + G G PTP ID++A G+ T + + S Sbjct: 34 PTQTSRKPNVIIFYADDLGWGETGIQGNPQI---PTPHIDSIAKNGVRCTQGFVAATYCS 90 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 P+RA +LTG+Y G + G TTL LH GY T +GKWH+G+ E Sbjct: 91 PSRAGLLTGRYPTRFGHEFNRIANVSGLDLQETTLADRLHGLGYKTACVGKWHLGDGPEY 150 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 +P GFD+F G + T + + +S D +E + ++ D+ Sbjct: 151 RPTKRGFDEFFGTLAN----TPFFHPTKFVDSRVSNDVAEVSDENFYTTDE--------- 197 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS-- 315 + V+++ + P+FLY H KY Sbjct: 198 -----------------YAKRSVEWIGQQ--QQSPWFLYLPFNAQHAPLQAPQKYLDRFE 238 Query: 316 ---SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFR 372 P R + M M+D + + + GQ +NTL+ F SDNG + P R Sbjct: 239 SIADPKRKLFAAMMSAMDDAIGQVLGKVRELGQEENTLVFFISDNGGPTQGTTSQNGPLR 298 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 G K +T+EGG RVP V WKG + K+ D V D+ PT L AG + + Sbjct: 299 GFKMTTFEGGTRVPFLVQWKGKLPAGKTYDNPVINLDVLPTVLTAAG-------SKIDPA 351 Query: 432 TFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGG 491 +DGVD +F + + Y+ G+ AVR ++K V Sbjct: 352 WKLDGVDLVPYFTSSIANKPHETL-YWRFGEQWAVRQGDWKLVVARGGS----------- 399 Query: 492 FTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAY 535 +++L +D ES ++ + LQ + Sbjct: 400 --------GQPELYDLASDIAESKNLASENPAKVKELQALWDQW 435 >UniRef50_C6Y1Z7 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y1Z7_PEDHD Length = 480 Score = 422 bits (1085), Expect = e-116, Method: Composition-based stats. Identities = 119/471 (25%), Positives = 199/471 (42%), Gaps = 33/471 (7%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSS 136 + ++PNV++ +DD+G+ D G G PTP+ + A +G+ T + Q Sbjct: 19 AQTTKTQRPNVIIINMDDMGYGDTEPYG---MTGIPTPNFNKAAKEGMRFTHFNAAQAIC 75 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENK 195 SP+RA +LTG Y G+ L T+ LL GY T +GKWH+G Sbjct: 76 SPSRAALLTGCYPNRIGLRGALSPDSKIALDTAEETIASLLKKAGYKTAMLGKWHLGSKA 135 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 + P + GFD F G +DM+ D P+ A++ +S +LP G Sbjct: 136 PNLPLHYGFDSFYGLPYSNDMWP--VDYEGKPQAAVAGKKS--YPELPLL--------DG 183 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 ++ A TP L + V+F++ PFFLY H +A + G Sbjct: 184 DKPADYVRTPDDQAMLTGTFTRKAVRFIEN--NKSAPFFLYLAHPMPHVPLAASAAFRGK 241 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH--GRTPFRG 373 S +GD ++E++ + K+L++N NT+++ SDNGP H FRG Sbjct: 242 SEL-GLFGDVIMELDWSVGEIMKSLDRNKIASNTILIIMSDNGPWLRFGNHAGSSGGFRG 300 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQPRKSD-GIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 K + W+GG RVP + W G ++ + ++ D+ PT L L ++ P Sbjct: 301 GKMTIWDGGTRVPCIIRWPGKVEAGSVNSNLITNMDILPTLLQL--------SHAAPPEK 352 Query: 433 FIDGVDQTSFFLGTNGQSNRKAEHYFLN-GKLAAVRMDEFKYHVL-IQQPYAYTQSGYQG 490 IDG+ LG + ++ R+ +Y+ N L AVR +K + Y G G Sbjct: 353 KIDGISFADLLLGRSDKAPRQVFYYYYNENSLKAVRYKNWKLVLPHTSVSYTSDIHGKDG 412 Query: 491 GFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 ++++L DP E+ + ++ + + + + Sbjct: 413 FPGAATRAEVKMALYDLAHDPGEAYDVQQQYPELVQKMLVFVEEARADMGD 463 >UniRef50_C7ZGP1 Predicted protein n=3 Tax=Leotiomyceta RepID=C7ZGP1_NECH7 Length = 446 Score = 421 bits (1083), Expect = e-116, Method: Composition-based stats. Identities = 135/468 (28%), Positives = 220/468 (47%), Gaps = 36/468 (7%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 K KPN+V+ L D++GW ++G GGG+ G TP ID +A++GL+L + + PTR Sbjct: 3 KDPTKPNIVLILADNLGWGELGCYGGGILRGAATPRIDKLATEGLLLHNFNVESDCVPTR 62 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 + ++TG++ I G G P GL + TLP+ L QGY T GKWH+G+ P Sbjct: 63 SALMTGRHPIRTGCRQSVPAGFPQGLTRWERTLPECLKPQGYATAHHGKWHLGDIPGRYP 122 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 + GFD++ G +D + PEVA P + + G + + Sbjct: 123 SDRGFDEWLGIPRTTDESQFTSALGYAPEVAELP-------------YIMKGIAGQDSEN 169 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPAR 319 I + +D+ +D +L + K++KPFFLY+ HF P+ + G + + Sbjct: 170 ICIYDLEKRRLIDEMLVDQSKDWLSRQVKAEKPFFLYHPLVHLHFPTLPHRDFEGKT-GQ 228 Query: 320 TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAKGST 378 + D M EM+ L L+ G DNT+++F SDNGPE P G P+ G + Sbjct: 229 GEFADSMAEMDYRVGELIDHLDSLGVSDNTVLIFASDNGPEFRPPYKGTAGPWSGTYHTA 288 Query: 379 WEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGV 437 EG +RVP + W G + S+ V + D+F T L++AG VP IDG+ Sbjct: 289 MEGSLRVPFIIRWPGHVPTGVTSNETVHVTDIFTTILEIAGSE-------VPSDRPIDGI 341 Query: 438 DQTSFFLG-TNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTV 496 Q SFF + +S R+ +++ +L AV+ ++K H++ ++ + Sbjct: 342 SQVSFFKDPSTVKSQREGFLFYIKEELRAVKWKDWKLHLI-----------WEPKVNQSS 390 Query: 497 MQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 + +FN+ DP+E I + + P+ + + LK P Sbjct: 391 GKLESPYLFNVVRDPKEETDILAYNTWVMQPVLKLRAEFEKSLKSDPA 438 >UniRef50_D2QWC8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QWC8_9PLAN Length = 468 Score = 419 bits (1078), Expect = e-115, Method: Composition-based stats. Identities = 130/469 (27%), Positives = 188/469 (40%), Gaps = 73/469 (15%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSP 138 +PN+VV + DD+G+ D+G +G PTP +DA+A+ G+ TS Y P SP Sbjct: 24 AADASRPNIVVIVGDDMGYHDLGVHG---CKDIPTPHLDALATSGVRCTSGYVSGPYCSP 80 Query: 139 TRATILTGQYSIHHGILMPPMY---GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK 195 TRA +LTG+Y G P G+ G TTL L GY T +GKWH+G ++ Sbjct: 81 TRAGLLTGRYQQRFGHEFNPGPTPTGEIGLPLSETTLADRLKKVGYKTGMVGKWHLGNDE 140 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 + P + GFD+F GF + Y +P + +L ++ V Sbjct: 141 KRHPLSRGFDEFFGFLGGARTYF------------ATPGNASAGTKLLRGREVVDEK--- 185 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG- 314 E L + V ++D+ S PFFLY H + KY Sbjct: 186 -------------EYLTDAFAREAVAYIDRSKAS--PFFLYLTFNAVHTPMEASQKYLDR 230 Query: 315 ----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP 370 S P R Y M M+D + LE+ L+NTLI F SDNG TP Sbjct: 231 FTAVSDPKRQKYCAMMSAMDDAVGQVVAKLEREKLLENTLIFFVSDNGGPTAANTGDNTP 290 Query: 371 FRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVP 429 RG K +TWEGG+RVP FV WKG I K+ D V D PT A A P Sbjct: 291 LRGFKATTWEGGIRVPYFVSWKGKIPAGKTYDQPVIQIDFVPT---------ALAAAGAP 341 Query: 430 KTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQ 489 DGV+ + N ++ + ++ G A+R +K + Sbjct: 342 AAEKTDGVNLLPYLTFENKEAPHASL-FWRFGPQTAIRHGNYKLVMTR------------ 388 Query: 490 GGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 ++++L D E+ + + L A+ + Sbjct: 389 --------DLDKPALYDLAADISETKDLSADKPEIVAQLTAAYDAWNQE 429 >UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAW6_9PLAN Length = 472 Score = 419 bits (1078), Expect = e-115, Method: Composition-based stats. Identities = 125/481 (25%), Positives = 200/481 (41%), Gaps = 54/481 (11%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPT 139 ++PN++V L DD+G+ ++G G PTP ID++AS G+ T AY P+ SP+ Sbjct: 21 SAAEQPNIIVLLADDLGYGELGCQGNPQI---PTPHIDSLASHGIRFTQAYVTAPNCSPS 77 Query: 140 RATILTGQYSIHHGILMPPMYGQ-----PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 RA +LTG+ G P+ + G T+ + LHDQGY T IGKWH+G Sbjct: 78 RAGLLTGRIPTRFGYEFNPIGARNEDSGTGLPPDEQTIAERLHDQGYTTCLIGKWHLGGT 137 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 + P GFD+F GF + H + K S++ +++ Sbjct: 138 ADYHPFRHGFDEFFGFMHEGHYFVP-PPYHGVTTMLRRKTLPGRQKGRWISENLIYSTHM 196 Query: 255 GEQQAIADITPKYM---------EDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD 305 G + D + E L + V F+++ DKPFFLY H Sbjct: 197 GYDEPDYDANNPIIRGGQPVNETEYLTDAFTREAVSFINRH--QDKPFFLYLAYNAVHSP 254 Query: 306 NYPNAKYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 K R + + M+ + K ++++G + TLIVF SDNG Sbjct: 255 LQGKKKDIQHFTQIEDIHRQIFAAMLSSMDQSIGKILKQVQQSGLDEKTLIVFLSDNGGP 314 Query: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGH 419 P RG KGS +EGG+RVP + W G + P+++ D V D+FPT++ LAG Sbjct: 315 TRELTSSNLPLRGEKGSMYEGGLRVPFLMRWTGTLAPKQTIDVPVSSLDIFPTSVALAGA 374 Query: 420 PGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQ 479 +P+ +DG + L + A+ ++ G+ AA+R ++K + Sbjct: 375 S-------LPQN--LDGRNLLPLLLQQKTELP-VADFFWRQGRKAALRSGDWKIVQMRG- 423 Query: 480 PYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL 539 + ++NL D E+ + + LQT + + Sbjct: 424 ----------------TREKPVWELYNLANDKSETIDLATEQSEKRMELQTRWNELNAQM 467 Query: 540 K 540 K Sbjct: 468 K 468 >UniRef50_D2QZL2 Sulfatase n=8 Tax=cellular organisms RepID=D2QZL2_9PLAN Length = 529 Score = 417 bits (1073), Expect = e-115, Method: Composition-based stats. Identities = 139/505 (27%), Positives = 219/505 (43%), Gaps = 34/505 (6%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 L + K+PN+V+ DDVG ++ GV G TP ID +A +G++ T Y++ Sbjct: 15 SLVASAQAQIKRPNIVIIWGDDVGQSNISAYSHGVM-GYKTPHIDRLAREGMMFTDYYAE 73 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMG 192 S + RA+ +TGQ+ + G+ + G GL+ T+ +LL GY T GK H+G Sbjct: 74 QSCTAGRASFITGQHGLRTGLTKVGLPGAALGLRKEDPTIAELLKPLGYATGQFGKNHLG 133 Query: 193 ENKESQPQNVGFDDFRG--FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 + E P GFD+F G ++ ++ E D +P + +DD Sbjct: 134 DRNEFLPTVHGFDEFYGNLYHLNAEEEPEHADYPKDPAFRAKYGPRGVLDCKASDRDDPT 193 Query: 251 -AVRGGE--QQAIADITP---KYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF 304 R G+ +Q I D P K ME +D V ++ + +K+DKPFF++ HF Sbjct: 194 VDARFGKVGKQIIKDTGPLTKKRMETIDDDVASRAVDYIQRQSKADKPFFIWVNFTHMHF 253 Query: 305 DNYPNAKYAGSS-PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 + + G S + Y D M++ + + K ++ G DNT +++++DNGP Sbjct: 254 RTHVKPESKGQSGRWMSEYADAMIDHDKNVGTVLKAIDDAGIADNTFVMYSTDNGPHMNS 313 Query: 364 PPH-GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPG 421 P TPFR K S WEG RVP V W I+P S+ IV D PT L +AG Sbjct: 314 WPDAAMTPFRNEKNSNWEGAYRVPCAVRWPNKIKPGSVSNQIVGHHDWLPTLLAIAGDEQ 373 Query: 422 AKVA-------NLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG-KLAAVRMDEFKY 473 + DG + G +S R++ Y + +L +R D +K Sbjct: 374 VTDKLLKGYKIGDMTYKVHPDGYNLVPHLTGQEEKSPRESFLYCNDDQQLVGLRYDNWKL 433 Query: 474 HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVR--------HIPMG 525 + Q+ +G ++ +FNL DP E I H + Sbjct: 434 VFMEQRA-----TGTLRVWSEPFTTLRVPKIFNLRLDPYERADITSNTYYDWLIDHAFLL 488 Query: 526 VPLQTEMHAYMEILKKYPPRAQIKS 550 VP Q + ++ K+YP R + S Sbjct: 489 VPAQDYVGKFLLTFKEYPQRQKAAS 513 >UniRef50_B3CAE2 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=B3CAE2_9BACE Length = 467 Score = 417 bits (1072), Expect = e-115, Method: Composition-based stats. Identities = 127/479 (26%), Positives = 195/479 (40%), Gaps = 59/479 (12%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT- 139 + KPNVV+ DD G+ D+G G + TP ID +A +GL LT Y Sbjct: 21 QAQHKPNVVIIFTDDQGYQDLGCYGSPLI---QTPSIDGMAREGLKLTDFY--------V 69 Query: 140 --------RATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 RA +LTG+ + +G+ G TL + L +Q Y T GKWH+ Sbjct: 70 SASVSSASRAGLLTGRLNTRNGVKGVFFPESEGMPSEEITLAEALKEQDYATGCFGKWHL 129 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTE----------WRDVHVNPEVALSPDRSEYIKQ 241 G+ K P + GFD + G +DMY +R+ + E D Sbjct: 130 GDLKGHLPTDQGFDKYFGIPYSNDMYIGPSQKFASNAVFREGYTLSEAKADQDFVRNAPN 189 Query: 242 LPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRG 301 K +++V + P +R+ D ++F+ + +KPFF+Y Sbjct: 190 RATIKKRLNSVSPLFEGDEIIEYPCDQSTTTRRYFDKAIEFVGQ--NKEKPFFVYITPSM 247 Query: 302 CHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA 361 H + + ++ G S R YGD + E++ L++ G +NTL++F SDNGP Sbjct: 248 PHIPLFASEQFRGKS-KRGLYGDVVEEIDWNVGRFLDYLDQQGLAENTLVIFASDNGPWL 306 Query: 362 EVPPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAG 418 + P RG K S +EGGVRVP + WKG I SD I+ DLFPT + G Sbjct: 307 GYKEDSGSADPLRGGKFSYYEGGVRVPCILRWKGTIPAGVTSDAIIASIDLFPTIMHYVG 366 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ 478 + IDGVD +SF + + R Y G++ +R ++ Y Sbjct: 367 CKSFR--------QEIDGVDISSFLKNPSLRL-RDEYVYVRGGEVHGIRKGDWAYLPKTG 417 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 + +FNL D E++++ + + LQ M Y Sbjct: 418 N--------------SKFKEGDVPELFNLKRDIGETNNLHLEYPEKVKELQEVMQLYQA 462 >UniRef50_Q7UYA6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UYA6_RHOBA Length = 490 Score = 416 bits (1069), Expect = e-114, Method: Composition-based stats. Identities = 123/482 (25%), Positives = 187/482 (38%), Gaps = 70/482 (14%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRAT 142 PN VV DD G+ DVG G TP +DA+A G+ TS Y+QP P+RA Sbjct: 20 AAPPNFVVIFTDDQGYEDVGCFGSPDI---RTPRLDAMAKGGMKFTSFYAQPICGPSRAA 76 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM------GENKE 196 ++TG Y + P + T+ ++L +GY + GKW + G + Sbjct: 77 LMTGCYPMRVAERGHTKQIHPILHEDEVTIAEVLKTKGYASACFGKWDLAKHAQSGFFSD 136 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P GFD F G P S D V + E Sbjct: 137 LLPTGQGFDYFYG--------------------------------TPTSNDRVANLYRNE 164 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 + + M L +R+ D + F++K ++PFF+Y H + + G S Sbjct: 165 ELIEPESD---MATLTRRYTDEAISFIEK--NQNQPFFVYIPHTMPHTRLDASKDFKGKS 219 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA----------EVPPH 366 R YGD + E++ + +L + DNT ++FTSDNGP + H Sbjct: 220 -KRGLYGDVIEEIDFNVGRILDSLNELNLADNTYVLFTSDNGPWLVKNKGHADGHRLGDH 278 Query: 367 GRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAK 423 G + P R K ST+EGGVRVP ++ G + D I D+ PT LAG Sbjct: 279 GGSAGPLRSGKVSTFEGGVRVPAILWAPGKVPAGTVCDSIATTMDVMPTLAALAGAE--- 335 Query: 424 VANLVPKTTFIDGVDQTSFFLGT-NGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYA 482 +P IDG D F G + KA Y+L L AVR ++K H+ ++ Sbjct: 336 ----IPTDRVIDGEDIRHLFHGEFDKADPDKAFFYYLRVHLQAVRQGKWKLHLPREKEPV 391 Query: 483 YTQSGYQGGFTGTVMQT--AGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 + + + +L D E+ ++ + + L + + L Sbjct: 392 GAAPFGRNAHIAPKDRIGFKQPFLVDLDNDLGETTNVAAENPEVVERLLGLAESMRDDLG 451 Query: 541 KY 542 Y Sbjct: 452 DY 453 >UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788C38 Length = 452 Score = 416 bits (1069), Expect = e-114, Method: Composition-based stats. Identities = 118/479 (24%), Positives = 198/479 (41%), Gaps = 63/479 (13%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRAT 142 K+PN +V DD+G+ D+G G TP +D +A +G+ T+ YS P SP+RA+ Sbjct: 15 KQPNFIVIYCDDLGYGDLGCYGSDTV---KTPHLDGLADEGIRFTNWYSNSPVCSPSRAS 71 Query: 143 ILTGQYSIHHGI--LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 +LTG+Y G+ ++ G G TL + L GY T GKWH+G ++E+ P Sbjct: 72 LLTGKYPARAGVGEILGAKRGSHGLPADEVTLAKALKPAGYRTALYGKWHLGLSEETSPN 131 Query: 201 NVGFDDFRGFN-SVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD+F GF D Y+ + + +H + E + Sbjct: 132 AHGFDEFFGFKAGCVDFYSHI-----------------FYWGQAHGVNPLHDLWENETEV 174 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS---- 315 + +YM +L + V F+ + + + PFFL+ H+ + KY Sbjct: 175 WEN--GRYMTEL---ITERSVDFIQRSREQEAPFFLFASYNAPHYPMHAPQKYMDRFAHL 229 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP---------- 365 R + ++D + K L++ G ++T+I F+SDNGP +E Sbjct: 230 PWDRQVMAAMIAAVDDGVGKIVKALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDVYY 289 Query: 366 -HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAK 423 FRG K S +EGG+R P + W + + D + + DL PT LDLAG A Sbjct: 290 GGSAGIFRGHKASLFEGGIREPAILSWPNGWEGGQVRDEVAAMMDLAPTFLDLAGVDPAA 349 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAY 483 + +DG S + + G+L AVR ++K + Sbjct: 350 GPL---QGVALDGSSLKEMLQ-MREPSPHQQLFWEYQGQL-AVREGDWKLVL-------- 396 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKY 542 G + + +L DP E ++ R+ + L ++ + E ++++ Sbjct: 397 -----NGKLDFDRVVPDQIHLSDLSRDPGERSNLADRYPEIVERLSRDVRDWYEEVQRH 450 >UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BZT7_9PLAN Length = 459 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 117/480 (24%), Positives = 195/480 (40%), Gaps = 56/480 (11%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SS 136 LE +KPN++ + DD+G+ ++G G TP ID +A++G+ T AY+ Sbjct: 9 LEATEKQKPNIIFIMADDLGYAELGCYGQKKI---KTPHIDKLAAEGMKFTQAYAGSMVC 65 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NK 195 P+R+ ++TGQ++ H + + + TT+ ++L GY T A GKW +G Sbjct: 66 QPSRSVLMTGQHTGHTAVRANDL--NQLLYEEDTTVAEVLKIAGYATGAFGKWGLGYEGT 123 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 +P GFDDF G + + N E L +E ++ + D +H Sbjct: 124 PGRPGQQGFDDFTGQLLQVHAHFYYPFWIWNNEHRLMLPENENNQRGRYIHDLIH----- 178 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSD--KPFFLYYGTRGCHFDNYPNAKYA 313 + K ++ Y + ++ + + KP+ + + P Y Sbjct: 179 --EDAKAFIQKNKAQPFFAYLPYIIPHVELVVPEESEKPYRGQFPKKQI---LDPRPGYI 233 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP------HG 367 GS T++ + ++D + LE G DNTLI+FTSDNG + +G Sbjct: 234 GSEDGLTTFAGMVSRLDDHVGEIVTLLEDLGIRDNTLIIFTSDNGGQGGTWKEMTDFFNG 293 Query: 368 RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGI-VDLADLFPTALDLAGHPGAKVAN 426 P RG KGS +EGG+RVP W G I K+ + + D+ PT +AG Sbjct: 294 NAPLRGHKGSMYEGGIRVPFIANWPGKIAAGKTSDLQIAFWDVLPTLAQVAG-------T 346 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY-FLNGKL--AAVRMDEFKYHVLIQQPYAY 483 VP IDG+ LG Q + ++ + GK+ A+R +K Sbjct: 347 TVPSGVDIDGISFLPTLLGKGKQPEHEYLYWEYTRGKIRSRAIRQGNWKAV--------- 397 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 +++L TD E+ ++ +H LQ M + +P Sbjct: 398 -----------QNRMNQPIELYDLGTDIGETKNLAKQHPEKIKDLQQIMQQAHSEPRDFP 446 >UniRef50_B8KV72 Arylsulfatase A n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KV72_9GAMM Length = 535 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 141/512 (27%), Positives = 229/512 (44%), Gaps = 61/512 (11%) Query: 63 QHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVAS 122 + QD ++L L + ++PN++V L DD+GW ++G GGG G PTP +D +A Sbjct: 47 EWATQDAAVDKQLRSLTARFERRPNILVILADDIGWGELGSYGGGKLKGAPTPALDQMAD 106 Query: 123 QGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQ-PGGLQGLTTLPQLLHDQGY 181 +G+ S Y++PS +PTR ++TG++ + G+ GQ G + TL ++L + GY Sbjct: 107 EGMRFLSHYTEPSCTPTRVALMTGRHPVRTGLDEVLFPGQVKGLVADEVTLAEVLSEAGY 166 Query: 182 VTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDV--HVNPEVALSPDRSEYI 239 T GKWH+GE +E QPQ GFD +N + WR+ H + E Y Sbjct: 167 ATGMFGKWHLGELQEHQPQYQGFDYAY-YNLYNGAIWPWRENATHYDTENDTGITGPPYF 225 Query: 240 KQLPFSKDDV---------HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSD 290 +P + ++ A R + I ++ D D + F+ ++ Sbjct: 226 IDIPEAYEETFDIPLHGIMRAKRNTPAEEIDPLSLSRFNTFDNELTDEVIAFMRDQHEAG 285 Query: 291 KPFFLYYGTRGCHFDNYP--NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDN 348 PFF Y+ T + P ++ Y S + +V+ + A L+++L+ G +N Sbjct: 286 IPFFAYFATNTQQVFSCPDVDSPYLDKSNCQAR---QLVQHDKNMARLFESLDNMGIDEN 342 Query: 349 TLIVFTSDNGPEAEV-PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSD-GIVDL 406 TL+++ SDNGP + P G + RG K +EGGVR P W G I P ++ IV + Sbjct: 343 TLVLWISDNGPMNKFYPSTGFSWLRGYKSEVYEGGVRTPGIAKWPGSIAPGQTPIDIVHV 402 Query: 407 ADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN------ 460 +D + T +LAG A +P IDGVDQ S G S R ++ Sbjct: 403 SDWYTTIANLAGAKAA-----IPDDRVIDGVDQRSLLFNGEGYSRRDYVFFYRYIAYKNL 457 Query: 461 ------GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 L+A+RM + K+H+ ++NL DP ES Sbjct: 458 SSTGPASMLSAIRMGDIKFHL------------------------QSGEIYNLLRDPVES 493 Query: 515 DSIGVRHIPMGVPLQTEMHAYMEILKKYPPRA 546 ++ P++ + + ++KKYP R Sbjct: 494 HPGRREYLWAMQPIRRMIWEHRAMMKKYPNRV 525 >UniRef50_Q1CY93 Sulfatase family protein n=4 Tax=Bacteria RepID=Q1CY93_MYXXD Length = 553 Score = 414 bits (1064), Expect = e-114, Method: Composition-based stats. Identities = 152/496 (30%), Positives = 226/496 (45%), Gaps = 33/496 (6%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 K +KPN++V DD+G ++ G +G TP+ID +A +G ++T Y Q S + R Sbjct: 16 KQSRKPNILVIWGDDIGIWNISAYNQG-MMGYFTPNIDRIAKEGAMMTDCYGQQSCTAGR 74 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQG-LTTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 A +TG + G+ M G GLQ T+ ++L GY GK H+G++ P Sbjct: 75 AAFITGMNPLRTGLTTIGMPGAKYGLQDSDPTIAEMLKPLGYTCGHFGKNHVGDSNPYLP 134 Query: 200 QNVGFDDFRG--FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD-VHAVRGG- 255 GFD+F G ++ ++ E D +P ++ +DD R G Sbjct: 135 TVHGFDEFFGNLYHLNAEGEPECPDYPKDPTFKERFGPRGVLRSWATDRDDPTEDKRWGV 194 Query: 256 -EQQAIAD---ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 +Q I D +T K ME +D ++ + F+++ K KPFFL++ T H Y K Sbjct: 195 VGKQRIEDTGALTRKRMETVDGEFLQGTLDFMERAVKDGKPFFLWHNTTRTHVWTYLQEK 254 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE-AEVPPHGRTP 370 Y ++ Y D M E++D+ L L++ G DNTL+VF++DNG E P G +P Sbjct: 255 YRNAT-GYGLYADAMRELDDIVGVLLAKLDELGIADNTLVVFSTDNGVEKMGWPDGGNSP 313 Query: 371 FRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPG-------- 421 FRG KGSTWEGGVRVP V W G+++P R + I D PT + AG P Sbjct: 314 FRGEKGSTWEGGVRVPCMVRWPGVVEPGRVINDIFAHEDWMPTLVSAAGGPKDLVAQCQR 373 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 A ++DG DQT G + + +G LAAVR D++K Q+ Sbjct: 374 GYKAGDKTFRVYLDGYDQTGLLAGKEKGPRHEFIYVLDSGNLAAVRYDDWKLIFSYQEG- 432 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPM-------GVPLQTEMHA 534 G F+G A + NL +DP E ++ VP Q + Sbjct: 433 ----EGPDMWFSGKRFDPAWPYLINLRSDPFEYGPKAGLYLKWYGERMFTFVPAQALVQK 488 Query: 535 YMEILKKYPPRAQIKS 550 + + L YPP S Sbjct: 489 FAQSLLDYPPSQAPGS 504 >UniRef50_C5C581 Cerebroside-sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C581_BEUC1 Length = 458 Score = 413 bits (1063), Expect = e-114, Method: Composition-based stats. Identities = 125/467 (26%), Positives = 190/467 (40%), Gaps = 64/467 (13%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRA 141 ++PN+V+ DD+G+ D+G G + N TP +D +A++G+ LT Y + P SP+R Sbjct: 2 TQRPNIVLINADDLGYGDLGCYGS---MRNDTPHLDRLAAEGVRLTDFYMASPVCSPSRG 58 Query: 142 TILTGQYSIHHGI-----LMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENK 195 +LTG Y G G P GL T+ ++L D GY T AIGKWH G+ Sbjct: 59 GMLTGCYPPRIGFGEFVGRPVLFPGDPVGLDPAERTMARVLGDAGYATAAIGKWHCGDQP 118 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 E P GFD + G +DM + P LP + Sbjct: 119 EFLPTRHGFDSYFGIPFSNDMGRQREHEDWPP--------------LPLMSGESVVQEQP 164 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 +Q++ L +R+ +F+++ A +PFFLY H + A + + Sbjct: 165 DQRS-----------LTERYTVAATRFIEENAH--QPFFLYLAHMYVHVPLFVPAPFLAA 211 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAK 375 S YG + ++ + TL + G +NT++VFTSDNG A P RG K Sbjct: 212 SR-NGGYGGAVAALDWSTGVVMDTLRRLGLEENTIVVFTSDNGSRARGEGGSNDPLRGHK 270 Query: 376 GSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFI 434 TWEGG RV V W I D + DL PT A A+ + Sbjct: 271 AQTWEGGQRVACVVRWPAAIPAGGVCDAVTRSIDLLPTF-----AAVAGAADWADPARPV 325 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTG 494 DGVD T+ G G + + Y+ L AVR+ ++K H+ ++ Sbjct: 326 DGVDLTALLTGA-GPAPNETFAYYYMDDLEAVRVGDWKLHLSKRRD-------------- 370 Query: 495 TVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +++L TD E+ + H + L+ L Sbjct: 371 -----PMRELYDLRTDAAETHDVAADHPDVVARLEAVAETIRADLGD 412 >UniRef50_A6LCL3 Arylsulfatase A n=9 Tax=Bacteroidales RepID=A6LCL3_PARD8 Length = 476 Score = 413 bits (1063), Expect = e-114, Method: Composition-based stats. Identities = 128/463 (27%), Positives = 190/463 (41%), Gaps = 43/463 (9%) Query: 87 NVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILT 145 N+V+ LDDVG+ D FNG A G TP+ID +A++G+ T QP S +RA +LT Sbjct: 25 NIVLINLDDVGYGDFSFNG---AYGYTTPNIDKMAAEGVRFTHFLVGQPISGASRAGLLT 81 Query: 146 GQYSIHHGILMPPMYGQPGG-LQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGF 204 G Y G P G T+ ++L +GY T GKWH+G KE P GF Sbjct: 82 GCYPNRIGFSGAPGPDSNYGVHPEEMTIAEVLKQKGYSTAIFGKWHLGSQKEFLPLQNGF 141 Query: 205 DDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADIT 264 D++ G +DM+ P + E D + + G Sbjct: 142 DEYYGLPYSNDMW------------PFHPQQGEVFNFPDLPTYDGNEIIGYNTD------ 183 Query: 265 PKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGD 324 L + V F+ K +KPFFLY H + K+ G S + YGD Sbjct: 184 ---QTRLTTDYTTRSVNFIKK--NKNKPFFLYLAHNMPHVPLAVSDKFKGKSE-QGLYGD 237 Query: 325 CMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT--PFRGAKGSTWEGG 382 M+E++ ++K L + G DNTL++ TSDNGP H + R AK +T++GG Sbjct: 238 VMMEIDWSVGEIFKALRELGLEDNTLVILTSDNGPWTNYGNHAGSAGGLREAKATTFDGG 297 Query: 383 VRVPTFVYWKGM-IQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTS 441 RVP +YWKG + + + DL PT ++ P IDGV Sbjct: 298 NRVPCIMYWKGKTLPGTTCNKLASNIDLLPTFAEITQAPLP--------PRKIDGVSILP 349 Query: 442 FFLGTNGQSNRKAE-HYFLNGKLAAVRMDEFKYHVLIQQ-PYAYTQSGYQGGFTG-TVMQ 498 G + R++ +Y+ L AV FK + Y + G G T ++ Sbjct: 350 LIEGKKDANPRESFVYYYRKNDLEAVTDGMFKLVFPHKYVTYGAYEPGNDGQPGKLTNLE 409 Query: 499 TAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +++L DP E ++ ++ L L Sbjct: 410 IMKPEMYDLRRDPGERYNVITQYPEEAAKLMKIADQKRHELGD 452 >UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D464_9BACT Length = 474 Score = 413 bits (1062), Expect = e-114, Method: Composition-based stats. Identities = 114/483 (23%), Positives = 181/483 (37%), Gaps = 57/483 (11%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQ 133 A+L K+PN++ + DD+G+ + G GG PTP+ID + + G+ +S Y S Sbjct: 17 CAQLAIAAPKRPNILFIVADDLGYGEPGCYGGKDI---PTPNIDKLVASGVRFSSGYVSA 73 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQ-----PGGLQGLTTLPQLLHDQGYVTQAIGK 188 P + +RA ++TG+Y G P+ + G T+ L D GY T +GK Sbjct: 74 PFCAASRAALMTGRYQTRFGFEYNPIGAKNADPGTGLPVNEKTVADRLRDVGYATGLVGK 133 Query: 189 WHMGENKESQPQNVGFDDFRGFNSVSDMY--TEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 WH+G PQ GFD+F GF Y W PD S+ P Sbjct: 134 WHLGGTAPFHPQRRGFDEFFGFLHEGHFYLPPPWSGATTWLRRKALPDGSQGRWTSPDGH 193 Query: 247 DDVHAVRGGEQQAIADITP--------KYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 + A P + +L + F+D+ +P+FLY Sbjct: 194 TVWSTDLHENEPAYDADNPLLRNSQPVEEKANLTDAFTREACSFIDRH--QAQPWFLYLA 251 Query: 299 TRGCHFDNYPNAKYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVF 353 H Y R + + +++ + L +G +NTL+VF Sbjct: 252 YNAVHSPLQGEDTYMEKFSHIGDIQRRIFAAVLAHLDEDIGKVRAQLRADGLEENTLVVF 311 Query: 354 TSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPT 412 SDNG + P RG KG W+GG+R+P V WKG I D DL T Sbjct: 312 LSDNGGPTKELTSSNLPLRGGKGDLWDGGIRIPFAVSWKGQIPAGHTIDAPAISMDLTAT 371 Query: 413 ALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFK 472 AL LAG + +DGVD G + ++ G+ A+R ++K Sbjct: 372 ALKLAGAETEQA--------KLDGVDLLPLLTGKTTAAPHDTL-FWRVGRKNALRHGDWK 422 Query: 473 YHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 + + +++L D E++++ ++ L Sbjct: 423 LLRQGSKEW---------------------QLYDLAHDVGETNNMAAQNAARVTELSALW 461 Query: 533 HAY 535 + Sbjct: 462 DKW 464 >UniRef50_B9XGT6 Sulfatase n=3 Tax=Bacteria RepID=B9XGT6_9BACT Length = 477 Score = 413 bits (1061), Expect = e-113, Method: Composition-based stats. Identities = 122/510 (23%), Positives = 187/510 (36%), Gaps = 118/510 (23%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSS 136 L K KPN+V L DD+G+ DV G TP+ID +A G+ T ++ P+ Sbjct: 15 LSTKAANKPNIVFILADDLGYTDVACYGSKY---YETPNIDKLAKDGIKFTDGHTCGPNC 71 Query: 137 SPTRATILTGQYSIHHGILMP--------------PMYGQPGGLQGLTTLPQLLHDQGYV 182 PTRA++++GQY G+ P+ TL Q L GY Sbjct: 72 QPTRASLMSGQYGPRTGVYTVGSIDRFAWQTRSLHPVENVTKLPLDKITLAQSLKKAGYA 131 Query: 183 TQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 T GKWH+GE+KE P GFD+ ++ M + D NP+V D Sbjct: 132 TGMFGKWHLGEDKEHHPAQRGFDE-----ALVSMGVHF-DFVTNPKVDYPKD-------- 177 Query: 243 PFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGC 302 E L D + F+ + D+PFFLY Sbjct: 178 --------------------------EYLADFLTDKALDFIKR--HKDEPFFLYLPHYAV 209 Query: 303 HFDNYPNAKYAGSSPART--------SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFT 354 H + A+ +Y + +++ + L++ DNTL++F+ Sbjct: 210 HKPLQAKKELIQKFSAKQGVDGHHNPTYAAMIASVDESVGRVVALLDELKLSDNTLVIFS 269 Query: 355 SDNGPEAEVPPHG---------RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIV 404 SDNG G P RG KG +EGG RVP W G I K D + Sbjct: 270 SDNGGVGGYQREGIKKAGDVTDNNPLRGGKGMLYEGGHRVPYIFRWPGKIPAGKVCDQPI 329 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFL-GTNGQSNRKAEHYFLNGKL 463 DL+PT L+LAG P+ +DG G + NR A ++ G L Sbjct: 330 ISIDLYPTLLELAGAKA-------PEKYPLDGTSYLKVLKSGGMKKLNRDAIYWHFPGYL 382 Query: 464 AA------------VRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDP 511 A VR ++K + ++NL D Sbjct: 383 GAGADTWRTLPVGVVRCGDWKLMEF--------------------FEDHRLELYNLREDL 422 Query: 512 QESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 E++++ + L+ ++ A+ + ++ Sbjct: 423 GETNNLAAKMPEKAQELEKKLVAWQKEVQA 452 >UniRef50_Q0KB87 Arylsulfatase A or related enzyme n=107 Tax=cellular organisms RepID=Q0KB87_RALEH Length = 585 Score = 413 bits (1061), Expect = e-113, Method: Composition-based stats. Identities = 145/522 (27%), Positives = 223/522 (42%), Gaps = 39/522 (7%) Query: 49 KPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGV 108 PA A + +PV A +GKKPN++V DD+G ++ GV Sbjct: 62 TPAQPPAQSNLPVAPEAASA-------PVAVNTSGKKPNILVIFGDDIGQTNISAYSMGV 114 Query: 109 AVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQG 168 VG+ TP+ID +A +G+I T Y++ S + R++ +TGQ + G+ G GLQ Sbjct: 115 -VGHRTPNIDRIAREGMIFTDYYAENSCTAGRSSFITGQSPLRTGLSKVGAPGATVGLQA 173 Query: 169 L-TTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNP 227 T+ + L GY T GK H+G+ E P GFD+F G + E + + Sbjct: 174 RDVTIAEALKPLGYATGQFGKNHLGDRDEYLPTKHGFDEFYGNLYHLNAEEEPQRPY--- 230 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA 287 D+++ + + +H+ G+ + +T K ME +D D KF+ K Sbjct: 231 ---WPKDKNDPFVKNFSPRGVLHSTADGKIEDTGALTTKRMETIDDETTDAAQKFITKQV 287 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPART-SYGDCMVEMNDVFANLYKTLEKNGQL 346 ++DKPFF++ T H + G S Y D M+E + L KTL+ Sbjct: 288 QADKPFFVWMNTTRMHAFTHVRPSMQGQSGMPGNDYADGMIEHDGDVGKLLKTLDDLKIA 347 Query: 347 DNTLIVFTSDNGPEA-EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIV 404 DNT++++T+DNGP P TPFR K + WEG RVP + W G I+ S+ + Sbjct: 348 DNTIVIYTTDNGPNQWSWPDAASTPFRSEKNTNWEGAFRVPAMIRWPGKIKAGTVSNEMF 407 Query: 405 DLADLFPTALDLAGHPGAKVA-------NLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY 457 D FPT L G K +DG +Q ++ G + RK +Y Sbjct: 408 SGLDWFPTLLAAVGDGDIKERLLKGTSLGSKNAKVHLDGYNQLAYLTGQTNKGARKEFYY 467 Query: 458 FL-NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDS 516 F +G L A+R D++K Q T G + +FNL DP E Sbjct: 468 FNDDGVLVAMRYDDWKVVFCEQ-----TTPGGFQVWQDPFKCLRVPKIFNLRMDPYERAD 522 Query: 517 I-GVRHIPMG---VPLQT----EMHAYMEILKKYPPRAQIKS 550 I ++ L + A+++ YPP + S Sbjct: 523 IVSDQYNDWLGKNAYLTEIGTMKAAAFLQTFVNYPPSQRPAS 564 >UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UGD7_RHOBA Length = 543 Score = 412 bits (1058), Expect = e-113, Method: Composition-based stats. Identities = 129/482 (26%), Positives = 202/482 (41%), Gaps = 80/482 (16%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTR 140 +PN+V+ + DD+G+ DVGFNG PTP +D +A+ G++ T+ Y+ P SP+R Sbjct: 41 AKDRPNIVLIVADDLGYSDVGFNG---CKEIPTPHLDELAASGVVFTNGYASHPYCSPSR 97 Query: 141 ATILTGQYSIHHGILMPPMYGQ-------PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 A +LTG++ G P PG TTL L + GYVT AIGKWH+G+ Sbjct: 98 AGLLTGRHQQRFGHGSNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLGD 157 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 K P GFD++ GF+ ++ W D+ + KD + V Sbjct: 158 AKPFWPNRRGFDEWFGFSGGG--FSYWGDLGM--------------------KDPLLGVH 195 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 G++ + PK + L + VKF+ + PFFLY H ++ + Sbjct: 196 RGDEP----VDPKTLTHLTDDFSTEAVKFIQRHETE--PFFLYLAYNAPHAPDHATRAHL 249 Query: 314 GSSP-----ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR 368 + R YG + M++ + + ++G +NT+I+F SDNG E Sbjct: 250 QKTAHIEYGGRAVYGAMVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGGRRE--HAVN 307 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANL 427 P+RG KG +EGG+RVP V W G ++ K + + DLFPTAL AG + Sbjct: 308 FPYRGHKGMLFEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPS----- 362 Query: 428 VPKTTFIDGVDQTSFFLGTNGQ-SNRKAEHYFLNGKLA---AVRMDEFKYHVLIQQPYAY 483 + +DG + + R + G + AVR +K + Sbjct: 363 --QNDKLDGQNLLPVLTDDKQRLPERPLFWRYSMGDDSYGYAVRDGNWKLIDSRYKDRKL 420 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 +F+L DP E + + +H L M A+ + P Sbjct: 421 --------------------LFDLANDPWEREDLAAQHPEQVARLSRMMEAW--DARNVP 458 Query: 544 PR 545 P+ Sbjct: 459 PK 460 >UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKB8_9BACT Length = 465 Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats. Identities = 114/473 (24%), Positives = 179/473 (37%), Gaps = 61/473 (12%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ- 133 L L +PN++V + DD+G+ DVGFNG PTP ID++A G+ T+ Y+ Sbjct: 10 LISLNAICASRPNLIVIMADDLGYNDVGFNG---CTEIPTPGIDSIAQNGVKFTNGYTSY 66 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYG----QPGGLQGLTTLPQLLHDQGYVTQAIGKW 189 P+RA +TG+Y G P + + T+ + L GY IGKW Sbjct: 67 SVCGPSRAGFITGRYQQRFGFERNPQWNLTDPNSALPKSEMTIAESLTQVGYHCGIIGKW 126 Query: 190 HMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 H+G +P GFD+F G + D+ + + + Y + Sbjct: 127 HLGAEPSLRPNKRGFDEFFGHLGGGHRFMP-EDLVIQHTEEVKNELDSYRSWI------- 178 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 D K + L + + D V F+ + KPFFL+ H Sbjct: 179 ---------TRNDTPVKTTKYLTEEFSDEAVSFIKR--NHQKPFFLFLSYNAPHLPLQAT 227 Query: 310 AKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP 364 KY P R +Y + ++D + + ++L++ DNT++ F SDNG + Sbjct: 228 EKYLARFPHIKDPKRKTYAAMVSAVDDGVSQVMQSLKETNIADNTIVFFLSDNGGPSHKN 287 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAK 423 P +G K WEGG RVP + + IQ ++ D V D+F T LA P Sbjct: 288 KSDNFPLKGQKSDVWEGGFRVPFAMQYPAAIQAKQVYDHPVSSLDIFATIASLAQSPTHA 347 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGK-LAAVRMDEFKYHVLIQQPYA 482 +DGV+ F G Q+ + VR +FK + + Sbjct: 348 -------DKPLDGVNLIPFITGEKTQAPHAQIFIRKFDQSRYVVRQGDFKLVIPYK---- 396 Query: 483 YTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAY 535 A ++NL D E ++I H L+ + Sbjct: 397 ----------------DAPPQLYNLSKDIGEENNIAAVHPERVKELEKVRKQW 433 >UniRef50_C6Y214 Sulfatase n=3 Tax=Sphingobacteriaceae RepID=C6Y214_PEDHD Length = 472 Score = 411 bits (1056), Expect = e-113, Method: Composition-based stats. Identities = 128/485 (26%), Positives = 194/485 (40%), Gaps = 72/485 (14%) Query: 72 QQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY 131 ++ + KT KPNV+V + DD G++D G GG PTP+IDA+A QG T AY Sbjct: 15 WTGISAAQVKTAAKPNVIVIVSDDAGYVDFGCYGGKQI---PTPNIDAIAKQGTRFTDAY 71 Query: 132 SQP-SSSPTRATILTGQYSIHHGILMPPMYGQPGGL--------QGLTTLPQLLHDQGYV 182 +P+RA ILTG+Y G G T+ + GY Sbjct: 72 VSASVCAPSRAGILTGRYQQRFGFEHNTSNVLAPGYKITDVGMDPSEQTIGNEMQANGYK 131 Query: 183 TQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 T AIGKWH G+ + P N GF++F GF + ++ N Sbjct: 132 TIAIGKWHQGDEPKHFPLNRGFNEFYGFTGGHRDFFAYKGKRTNE--------------- 176 Query: 243 PFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGC 302 HA+ ++ + + L + D F+ A DKPFF+Y Sbjct: 177 -------HALYNNKEIVPENE----ITYLTDMFTDKATSFIT--ANKDKPFFMYLSYNAV 223 Query: 303 HFDNYPNA----KYAG-SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 H +YA + R +Y M ++D + TL+ N NTLI+F +DN Sbjct: 224 HTPMNAKKDLMERYASIADTGRRAYAAMMTSLDDGIGKVMATLKANQLDKNTLIIFINDN 283 Query: 358 GPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDG-IVDLADLFPTALDL 416 G A V P RG KGS WEGG+RV + W G I K+D V D+ PTA+ Sbjct: 284 GG-ATVNSSDNGPLRGMKGSKWEGGIRVAMMMKWPGHIAANKTDSRPVSSLDILPTAI-- 340 Query: 417 AGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVL 476 T +DGV+ + N ++ +A Y+ G AA+R +K + Sbjct: 341 -----GAGKGKQKGTKKLDGVNLLPYLSAGNKKTPHEAL-YWRRGVAAAMREGNWKLIRV 394 Query: 477 IQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYM 536 + P +F+L D E+ ++ ++ L ++ + Sbjct: 395 KESP-----------------TVQNVLLFDLSKDLSETKNLSEKYPAKVKELLVKLAEWE 437 Query: 537 EILKK 541 + L + Sbjct: 438 KGLDQ 442 >UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4W7_9PLAN Length = 459 Score = 410 bits (1055), Expect = e-113, Method: Composition-based stats. Identities = 115/491 (23%), Positives = 175/491 (35%), Gaps = 94/491 (19%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PS 135 + + PN+V+ + DD+G+ D+ G TP ID +A+ L T +S Sbjct: 26 SAAEAAQQPPNIVLIMADDLGYGDLACYGNKQV---KTPHIDRLAASALKFTDFHSAGAM 82 Query: 136 SSPTRATILTGQYSIHHGILMPP-----MYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWH 190 +PTRA +LTGQY G G T+ +LL QGY T GKWH Sbjct: 83 CTPTRAAMLTGQYQQRFGRQFESALSGKSNHDIGLPHQAVTMAELLKQQGYATACFGKWH 142 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +G P N GFD FRG S + D N + + + S Sbjct: 143 LGYQPPWLPTNQGFDLFRGLTSGDGDHHTHVDRSGNEDWWHNNEISM------------- 189 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY--- 307 E+ AD+ KY V F++ A +PFFLY HF Sbjct: 190 -----EKGYTADLLSKY-----------SVAFME--ANRTRPFFLYVPHLAIHFPWQGPQ 231 Query: 308 -PNAKYAGSSPARTSYG-------------DCMVEMNDVFANLYKTLEKNGQLDNTLIVF 353 P + AG +G + ++ + L++ NTL++F Sbjct: 232 DPPHRKAGQDYHAGKWGIIPDPGNVSPHTTAMIESLDQSVGKILSALKRLDLEQNTLVIF 291 Query: 354 TSDNGPEAEVPPH-----GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLAD 408 TSDNG + P RG K + +EGG RVP + W G+I +D D Sbjct: 292 TSDNGGYLTYGKNFQNISSNGPLRGQKATLYEGGHRVPCLISWPGVITAGVTDQTAHSVD 351 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRM 468 L PT AG DG+D + G+ + ++ G AVR Sbjct: 352 LLPTLAQAAGISATNFQT--------DGLDLAPLW--QTGRPLADRDLFWRMGNNRAVRR 401 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPL 528 ++K ++ S +++L TD E + H + + Sbjct: 402 GQWKL----------------------CLKNNRSELYHLETDLGEQQNRAAEHPEIVKSM 439 Query: 529 QTEMHAYMEIL 539 + + + Sbjct: 440 SQALKEWEADV 450 >UniRef50_C1ZI83 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZI83_PLALI Length = 558 Score = 409 bits (1051), Expect = e-112, Method: Composition-based stats. Identities = 124/468 (26%), Positives = 191/468 (40%), Gaps = 43/468 (9%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSS 136 + +KPNVV+ DD+G+ DVG G + TP+ID +A +G+ TS Y +Q Sbjct: 100 AAEARPEKPNVVIINCDDLGYADVGAFGATIC---KTPEIDRMAREGVKATSFYVAQAVC 156 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGLQG-LTTLPQLLHDQGYVTQAIGKWHMGENK 195 S +R +LTG GIL + G+ TL +L QGY T GKWH+G Sbjct: 157 SASRTALLTGCLPNRIGILGALSHVSKNGIADSEVTLGELFQSQGYSTAMYGKWHLGYQA 216 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 + P + GF + G +DM++ ++ Y K P Sbjct: 217 QFLPGHHGFGEALGIPYSNDMWS----------------KNPYGKFPPLPLFRQKGDSPA 260 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 E ++ D V F+D+ A DKPFF+Y H + + + Sbjct: 261 EIIGHDTDQSRFTTDFTMA----AVSFIDRHA--DKPFFIYLAHPMPHTPIFVSEERNSG 314 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT--PFRG 373 A+ Y D + E++ + +TLEK+ TL++FTSDNGP H + P R Sbjct: 315 ERAQ-LYRDVIGEIDWSVGTIRQTLEKHQLTRKTLVIFTSDNGPWLVFGNHAGSTGPLRE 373 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 KG+ W+GG RVP W G+I P D + DLFPT + G Sbjct: 374 GKGTMWDGGARVPFVACWPGVIPPDTTVDLPMATYDLFPTFAKMLGAKLP--------DH 425 Query: 433 FIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGF 492 IDGVD + +A ++ L AVR +K + + Sbjct: 426 PIDGVDIWPQLTSASKAQPHQALWFYYGRDLIAVRSGPWKLVFPHTYVHPVERGNDGQRG 485 Query: 493 TGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 + +++NL +D E+ ++ +H + L AY E+ + Sbjct: 486 KLVNRKFTELALYNLDSDIGETTNLASQHPEIVKQL----EAYAEVAR 529 >UniRef50_B4AUP3 Sulfatase n=2 Tax=Bacteria RepID=B4AUP3_9CHRO Length = 570 Score = 408 bits (1049), Expect = e-112, Method: Composition-based stats. Identities = 142/525 (27%), Positives = 232/525 (44%), Gaps = 57/525 (10%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 +A + + KKPN++V + DDVGW ++ G +G TP+ID +AS+G++ T Y++ Sbjct: 33 NIALAQTISPKKPNILVIMGDDVGWFNISAYNRG-MMGYKTPNIDRIASEGMLFTDVYAE 91 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMG 192 S + RA +TGQ G+ + G P GL G TL +LL GY T GK H+G Sbjct: 92 QSCTAGRAAFITGQSPGRTGMTKVGLPGVPIGLSGEDPTLAELLKPLGYATGQFGKNHLG 151 Query: 193 ENKESQPQNVGFDDFRG--FNSVSDMYTEWRDVHVN--------PEVALSPDRSEYIKQ- 241 + E P GFD+F G ++ ++ E D N P L +Y+ Q Sbjct: 152 DLDEFLPTVHGFDEFYGNLYHLNAEEEPENPDYPKNEIFKQKLGPRGVLHSYSLDYVTQE 211 Query: 242 LPFSKDDVHAVRGGEQQAIAD----------ITPKYMEDLDQRWMDYGVKFLDKMAKSDK 291 P + E + I +T + M+ +D ++D ++F++K + K Sbjct: 212 NPEITCPEENLSKYEDENIPGLGQVICNTGPLTIERMKTVDDEFLDASLEFINKTQQEGK 271 Query: 292 PFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDC----MVEMNDVFANLYKTLEKNGQLD 347 PFF+++ T H + + K +P Y D M E + L L++ G D Sbjct: 272 PFFVWFNTTRMHV--FTHLKDDSYNPDLEKYDDIYGEGMEEHDQDVGILLDYLDEQGLTD 329 Query: 348 NTLIVFTSDNGPEA-EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVD 405 +T++++T+DNG E P G TPF G K + WEGG RVP + W G I+ + S+ I+ Sbjct: 330 DTIVIYTTDNGAEVFSWPDGGTTPFHGEKNTNWEGGFRVPAMIRWPGYIEAGQISNEIIS 389 Query: 406 LADLFPTALDLAGHPGAKVANLVPKT-----------TFIDGVDQTSFFLGTNGQSNRKA 454 D PT L AG P L + +DG + + S R+ Sbjct: 390 HQDWLPTLLAAAGAPDDIAEQLKSEDGYNAGIKTFKKIHLDGYNLLPYLTDQEYHSPRRW 449 Query: 455 EHYFLNGKL-AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ- 512 Y + +A+R+D++K Q+ + ++ + + NL DP Sbjct: 450 FVYLTDDAYPSAIRVDDWKVIFSEQRAEGFEV------WSEPYVNLRVPMILNLRRDPFE 503 Query: 513 ----ESDSIGV---RHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 ES++ RH + P Q +++ ++YPPR + S Sbjct: 504 KAPEESNNYIDWRFRHTFVIAPAQIVAQEFLDTFREYPPRQKPAS 548 >UniRef50_P15289 Arylsulfatase A component C n=34 Tax=Euteleostomi RepID=ARSA_HUMAN Length = 507 Score = 408 bits (1049), Expect = e-112, Method: Composition-based stats. Identities = 116/447 (25%), Positives = 195/447 (43%), Gaps = 37/447 (8%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRAT 142 + PN+V+ DD+G+ D+G G + TP++D +A+ GL T Y +P+RA Sbjct: 19 RPPNIVLIFADDLGYGDLGCYGHPSST---TPNLDQLAAGGLRFTDFYVPVSLCTPSRAA 75 Query: 143 ILTGQYSIHHGILMPPMYGQP--GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE--SQ 198 +LTG+ + G+ + G T+ ++L +GY+T GKWH+G E Sbjct: 76 LLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFL 135 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 P + GF F G D P + + +P + Sbjct: 136 PPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLAN----------- 184 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA 318 + P ++ L+ R+M + + + D+PFFLYY + H+ + +A S Sbjct: 185 LSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSFAERS-G 243 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP--FRGAKG 376 R +GD ++E++ L + G L+ TL++FT+DNGPE G R KG Sbjct: 244 RGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLLRCGKG 303 Query: 377 STWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDG 436 +T+EGGVR P +W G I P + + DL PT LAG P +DG Sbjct: 304 TTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLP--------NVTLDG 355 Query: 437 VDQTSFFLGTNGQSNRKAEHYF-----LNGKLAAVRMDEFKYHVLIQQ-PYAYTQSGYQG 490 D + LG G+S R++ ++ + AVR ++K H Q ++ T + Sbjct: 356 FDLSPLLLG-TGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPAC 414 Query: 491 GFTGTVMQTAGSSVFNLYTDPQESDSI 517 + ++ +++L DP E+ ++ Sbjct: 415 HASSSLTAHEPPLLYDLSKDPGENYNL 441 >UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD Length = 452 Score = 408 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 122/485 (25%), Positives = 195/485 (40%), Gaps = 70/485 (14%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-P 134 A L + K+PNV++ DD G +DV G A TP+ID +A +G++ + Y+ P Sbjct: 18 APLFAQQQKRPNVLIIYTDDQGTLDVNCYG---AKDLHTPNIDRLAKEGVLFSQFYAAAP 74 Query: 135 SSSPTRATILTGQYSIHHGI--LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 SP+RA++LTG+Y + P G G T+ ++ D GY T IGKWH+G Sbjct: 75 VCSPSRASLLTGRYPQRAQLDNNAPSEEGHAGMPGSQYTMAEMFKDGGYTTAHIGKWHIG 134 Query: 193 ENKESQPQNVGFDDFRGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 + E+ P GFD GF D Y+ + ++ + H Sbjct: 135 YSPETMPNQQGFDYSFGFMGGCIDNYSHY---------------------FYWAGPNRHD 173 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 + Q+ D K+ DL + FL+K ++DKPFFLY+ H+ K Sbjct: 174 LWRNGQEIWED--GKFFADLT---VQEVNGFLEKNKRADKPFFLYWAINMPHYPLQGQEK 228 Query: 312 ----YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG 367 Y R Y + M++ + + L++ G +NT++VF SD G E G Sbjct: 229 WRQYYKDLPAPRRMYAAAVSTMDEKIGQVLQQLDRLGLAENTIVVFQSDQGHSTEDRSFG 288 Query: 368 R----TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGA 422 P+RGAK S +EGG+RVP + W G + + D + D +PT L Sbjct: 289 GGGFTGPYRGAKFSLFEGGIRVPAIIRWTGHLPKNEVRDQLCVNIDWYPTLAGLCKVALP 348 Query: 423 KVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGK----LAAVRMDEFKYHVLIQ 478 + IDG D + S + G AVR +K Sbjct: 349 Q--------RKIDGKDIQQVITSSKTSSPHDIFFWQSQGTKENPQWAVRQGNWKL----- 395 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVF--NLYTDPQESDSIGVRHIPMGVPLQTEMHAYM 536 + +T +F NL D E+ ++ +H + L+ + ++ Sbjct: 396 ---------LHNPSSAKKAETGPDDLFLVNLQQDTSEAKNLAAQHPEIVSSLKEQYLKWI 446 Query: 537 EILKK 541 + + Sbjct: 447 NEVVQ 451 >UniRef50_A6DF77 Arylsulphatase A n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF77_9BACT Length = 518 Score = 406 bits (1045), Expect = e-112, Method: Composition-based stats. Identities = 119/520 (22%), Positives = 209/520 (40%), Gaps = 74/520 (14%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTR 140 KKPN++ L DD+G+ D+ T ++D +A++G+ T A+S +P+R Sbjct: 16 ADKKPNILFILADDLGYGDLSCYNDE--AKVKTANLDQLANEGMRFTDAHSPSTVCTPSR 73 Query: 141 ATILTGQYSIHHGI--LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK--- 195 +I+TG+ + + + G + TLPQ+L + GY T GKWH+G + Sbjct: 74 YSIMTGRMAFRLNFKGVFTGVSGPCLITKDRLTLPQMLRNNGYETAMFGKWHIGMSFLDK 133 Query: 196 ------------------------------------ESQPQNVGFDDFRGFNSVSDMYTE 219 P N GFD F G +V ++ Sbjct: 134 NGDVIEVSEPPRKTPKLKKQEIALEAIKRVDYSKPIPDGPLNQGFDHFFG--TVCCPTSD 191 Query: 220 WRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKY-MEDLDQRWMDY 278 W +++ + P K L R G + P + E++D +++ Sbjct: 192 WLYAYIDGDRIPVPPTKIVDKALLPKHFWSFDCRAG------LLAPNFKHENVDMVFLEK 245 Query: 279 GVKFLDKMAKSD--KPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANL 336 + FLD K KPFFL++ + H ++P ++ G + A +GD + + + + L Sbjct: 246 SLSFLDSHHKKQSAKPFFLFHSLQAVHLPSFPAKEFQGKTQA-GPHGDFIYQFDYIVGKL 304 Query: 337 YKTLEKNGQLDNTLIVFTSDNGPEAE--------VPPHGRTPFRGAKGSTWEGGVRVPTF 388 + L+ G +NTL++ +SDNGPE +G P+RG K WEGG RVP Sbjct: 305 VEKLKTLGMAENTLVIISSDNGPEVGTTINMRERYKHNGARPWRGVKRDNWEGGHRVPMI 364 Query: 389 VYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTN 447 +W G I+ S V L D+ T + V +P D + LG Sbjct: 365 AWWPGKIRSSSVSQQTVCLTDIMATCASI-------VNTSLPNNAAEDSFNILPILLGQT 417 Query: 448 GQSNRKAEHYFLNGKLAAVRMDEFKY--HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVF 505 ++ R+ + ++R ++KY H + A + ++ Sbjct: 418 TKAIREFTLHQTISLDLSIRHGDWKYLDHSGSGGNNYSGGRIKKALGLTNSKINAPAQLY 477 Query: 506 NLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPR 545 NL DP+E +++ +H + L+ ++ + + P R Sbjct: 478 NLKADPKEVNNLYYQHPEIAQQLKAKLEEFKTSGRSAPKR 517 >UniRef50_A6C8S3 Arylsulphatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8S3_9PLAN Length = 481 Score = 406 bits (1045), Expect = e-112, Method: Composition-based stats. Identities = 130/494 (26%), Positives = 190/494 (38%), Gaps = 86/494 (17%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSS 137 + KPN +V DD+G+ D+ G TP ++ +A++G LT P + Sbjct: 33 AAQATAKPNFIVIFADDLGYGDLECYGHP---RFKTPHLNQMAAEGARLTQFNVPVPYCA 89 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLL-------HDQGYVTQAIGKWH 190 P+RAT+LTG+Y HG+ P G + + + GY T IGKWH Sbjct: 90 PSRATLLTGRYPWRHGVWYNPAPDGQQFRSG-VGIAESELLLSELLKENGYATICIGKWH 148 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +G + E P GFDD+ G +DM ++ + E + + P + Sbjct: 149 LGHDPEYYPTRHGFDDYLGILYSNDM------------RPVNLMQGEKLLEYPVIQ---- 192 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 +L +R+ + VKF+ + + PFFLY H + Sbjct: 193 ------------------ANLTKRYTERAVKFIQE--NQEGPFFLYLPHAMPHKPLAASE 232 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP 370 + S A YGD + E++ ++KTL + +NTL++F SDNGP G Sbjct: 233 AFYKKSGA-GLYGDVIAELDWSVGEIFKTLRELNLDENTLVIFASDNGPWFGGNTAG--- 288 Query: 371 FRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVP 429 G K +TWEGG+RVP W G I PR+ D + D+FPT L AG P VP Sbjct: 289 LSGMKSTTWEGGLRVPMIARWPGKIPPRQVIDTVCGSIDVFPTILKQAGIP-------VP 341 Query: 430 KTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHV-------------- 475 IDG D + +A + L VR +K HV Sbjct: 342 ADRVIDGKDLFPVLT-KQAPTPHQALYSMKGNSLFTVRSGPWKLHVKPSPRQVLAGKGKN 400 Query: 476 ----------LIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMG 525 I PY Q G Q +FNL D E D++ H + Sbjct: 401 WIDPRGPDGITIIAPYEQAMPDQQPGI-HNGDQPVPMMLFNLQQDIAEQDNVADEHPEVV 459 Query: 526 VPLQTEMHAYMEIL 539 L H + Sbjct: 460 ARLMKLYHEMQAEV 473 >UniRef50_B8HPF9 Sulfatase n=2 Tax=Bacteria RepID=B8HPF9_CYAP4 Length = 495 Score = 406 bits (1043), Expect = e-111, Method: Composition-based stats. Identities = 119/479 (24%), Positives = 182/479 (37%), Gaps = 85/479 (17%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS 136 + +++ + P+++ + DD GW DVGF+G + TP++D +A G L YSQP Sbjct: 39 AVAQQSSQPPHILFIMSDDQGWKDVGFHGSDI----RTPNLDQLAKTGARLEQYYSQPMC 94 Query: 137 SPTRATILTGQYSIHHGILM--PPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE- 193 +P+RA +LTG+Y +G+ P G+ G LPQ L + GY T +GKWH+G Sbjct: 95 TPSRAALLTGRYPHRYGLQTLVIPSAGKYGLPTDEYLLPQALKEAGYETAIVGKWHLGHA 154 Query: 194 NKESQPQNVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 + + P+ GFD G D +T H+ Sbjct: 155 DPKYWPRQRGFDYQYGPLLGEIDYFT-------------------------------HSA 183 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 G + K + VK ++K P FLY H KY Sbjct: 184 HGKVDWYRNNQLIKEEGYVTTLLGQDAVKLIEKH-NPKTPLFLYLAFTAPHAPYQAPQKY 242 Query: 313 AGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP-------- 359 P R +Y + M+D + LEK G +NTLIVF SDNG Sbjct: 243 LDQYKTIADPNRRAYAAMITAMDDQIGQVVAALEKRGMRNNTLIVFQSDNGGPRSAQFTG 302 Query: 360 ----EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTAL 414 P P+R K S +EGG RV W G IQP + + + D++PT Sbjct: 303 EVDTSGGTIPADNGPYRDGKASLYEGGTRVVALANWPGKIQPGTVVNHPIHIVDMYPTLT 362 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYH 474 LA + V K +DG++ S R Y + AA+ +++K Sbjct: 363 GLA-------SVSVGKNKPLDGLNIWPAL-SEAKPSPRSQVVYDIEPFRAALSQEDWKLV 414 Query: 475 VLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMH 533 P +FNL D E ++ ++ + L+ ++ Sbjct: 415 WKATLPSRL-------------------ELFNLSQDVSEQTNLAEQNPEIVSRLKQQIE 454 >UniRef50_Q46SG5 Arylsulfatase n=3 Tax=Proteobacteria RepID=Q46SG5_RALEJ Length = 542 Score = 406 bits (1043), Expect = e-111, Method: Composition-based stats. Identities = 148/536 (27%), Positives = 242/536 (45%), Gaps = 56/536 (10%) Query: 46 YLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNG 105 + +K + + + A ++T +PN++V DD+GW +V G Sbjct: 3 HTLKRLVAVTATVAAMSPFAAGAQQT-------------RPNILVIWGDDIGWENVSAYG 49 Query: 106 GGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQP-G 164 GV G TP+ID++ +G+ T Y+QPS + RA +TGQY I G+ G G Sbjct: 50 MGVM-GYTTPNIDSIGMEGIRFTDQYAQPSCTAGRAAFITGQYPIRSGMTTVGQPGDKLG 108 Query: 165 GLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRG--FNSVSDMYTEWRD 222 +L +++ GY T GK HMG+ P GFD+F G ++ ++ E D Sbjct: 109 WQPASPSLGEVMKQAGYRTGFFGKSHMGDRNSHLPTVHGFDEFFGNLYHLNTEELPENHD 168 Query: 223 VHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE-----------QQAIADITP---KYM 268 D++ K P +A + +Q I D P K M Sbjct: 169 YQAYANGYPGGDKAFAQKFAPRGVLHTYATDNDDPTDMPRFGPVGKQKIEDTGPLTKKRM 228 Query: 269 EDLDQ-RWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDC-- 325 ED D + + F+ + DKPFF++ T H + N K+ ++ T D Sbjct: 229 EDFDAAEVIPKAIDFMQGAKQKDKPFFVWLNTSRMHLYTHLNDKWRYAAAKYTHEDDMQG 288 Query: 326 --MVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAKGSTWEGG 382 M++ + + + L+++G NT++ +++DNGPE PHG T PFRG K +T+EGG Sbjct: 289 SGMLQHDHDIGLVLEYLKRSGLDKNTIVWYSTDNGPEHVSWPHGSTTPFRGEKMTTYEGG 348 Query: 383 VRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTS 441 VRV + + W G+I+P + +GI D+F T +AG P K +IDG++ Sbjct: 349 VRVVSMLRWPGVIKPGQIKNGIQAHQDMFTTFAAIAGVPDVVGQMKREKHQYIDGINNLD 408 Query: 442 FFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAG 501 ++ G S RK Y+ KL AVRM +K H +++ Y GT+ + Sbjct: 409 YWTGKTADSARKDFLYYYENKLTAVRMGPWKLHFSLKEDYY-----------GTLQPRSV 457 Query: 502 SSVFNLYTDPQESDS-------IGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 + +FNL +DP ES + + + P+ + ++++ + YPP KS Sbjct: 458 TMLFNLRSDPFESYDSKDAYGHLLQKAQWISGPMNELIASHLKTIADYPPVQPAKS 513 >UniRef50_A6DSG6 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSG6_9BACT Length = 499 Score = 405 bits (1042), Expect = e-111, Method: Composition-based stats. Identities = 121/475 (25%), Positives = 208/475 (43%), Gaps = 54/475 (11%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT 139 + PN + + DD G+ D+G G + TP+ID +A +G+ T Y++ SP Sbjct: 17 TAKAEMPNFIFIMTDDQGYGDLGCYGHPII---KTPNIDKMADRGVRFTDFYARHKCSPA 73 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 RA+++TG ++ G+ GL + + T+P++L ++GY T IGKWH+G Sbjct: 74 RASLMTGAFNFRVGVGSIVYPNSTTGLIKEVVTIPEMLKEKGYTTALIGKWHLGHTAGYL 133 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 P++ GFD + G + + + + + P I+ + D V G Sbjct: 134 PRDQGFDYYFGVPGTN--HGDAKTHKLPVAEGFKPSGEFTIED--YWADKGKGVHGNSTI 189 Query: 259 AIADIT----PKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG 314 + + P + L +R+ V+++ + DKPFFLY+ H +A + G Sbjct: 190 LMKNDNVIEWPTDITQLTKRYTHDAVRYIKE--NKDKPFFLYFAHGTPHHPYTVDAAFRG 247 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA-----EVPPHGRT 369 S YGD + E++ + K L++NG T+I FTSDNG ++ Sbjct: 248 KSD-HGLYGDMIEEIDWSVGEVIKALQENGIEKKTIIAFTSDNGADSKPNKEHAEKGSNL 306 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 P +G KGS+ EGGVRVP + W G + +K++ I L D+FPT LAG V Sbjct: 307 PLKGWKGSSEEGGVRVPFVLSWPGTLPEGKKTNEIASLMDIFPTYAALAGIEP-----EV 361 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG--KLAAVRMDEFKYHVLIQQPYAYTQS 486 P+ IDG + + + ++ K+ VR FKY Sbjct: 362 PQK--IDGNNIFPIMMCEPDVKSPNKYIFYAGNTPKITGVRNHRFKY------------- 406 Query: 487 GYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 T S +++++ D E+ ++ ++ + LQ M A+ + + + Sbjct: 407 -----------STKTSGLYDMHADIGETTNVADKYPEVLQELQKAMEAFQKDIDE 450 >UniRef50_A0Z7U6 Arylsulfatase n=2 Tax=Gammaproteobacteria RepID=A0Z7U6_9GAMM Length = 512 Score = 405 bits (1041), Expect = e-111, Method: Composition-based stats. Identities = 141/488 (28%), Positives = 224/488 (45%), Gaps = 35/488 (7%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 KPN ++ DDVG+ +V G +G TP+ID++A G++ T AY + S + RA Sbjct: 25 ASDKPNFLMLWGDDVGYWNVSAYNQG-MMGYETPNIDSIAKDGMLFTHAYGEQSCTAGRA 83 Query: 142 TILTGQYSIHHGILMPPMYG-QPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 +TGQ G+L + G + G Q T+ + L +GY+T GK H+G+ E P Sbjct: 84 AFVTGQSGFRTGLLKVGLPGAKEGMDQRDPTIAEYLKSKGYMTGQFGKNHLGDRDEHLPT 143 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 N GFD+F G + + + +P+ P E + + + G + Sbjct: 144 NHGFDEFIG----NLYHLNAEEEPEHPDYPKDPAFREKFGP----RGVIKSSSDGRIEDT 195 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPART 320 +T K ME +D+ + FL++ K+D+PFFL+Y T H + G + Sbjct: 196 GPLTKKRMETIDEEVTAAALDFLERAVKADQPFFLWYNTTRMHVHTRLKPESEGVT-GLG 254 Query: 321 SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAKGSTW 379 + D MVE + + + L++ G DNT++++T+DNG E P G T PFRG K + W Sbjct: 255 VFPDGMVEHDGMIGQMLDKLDELGITDNTVVMYTTDNGAEKFTWPDGGTAPFRGEKNTNW 314 Query: 380 EGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVA-------NLVPKT 431 EGG RVP V W G+I+P S+GIV D FPT G K Sbjct: 315 EGGYRVPLLVKWPGLIEPGSRSNGIVSHMDWFPTIAAALGDTDLKEQVSKGSAFGEGNSK 374 Query: 432 TFIDGVDQTSFFLGTNGQSNRKAEHYFL-NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490 +DG + ++ G +S R YF +G L +R +K Q+ +++ Sbjct: 375 VHLDGYNMLPYWGGETDESPRAEFFYFSDDGNLVGMRYQRWKAVFAEQRAHSFDV----- 429 Query: 491 GFTGTVMQTAGSSVFNLYTDPQES--------DSIGVRHIPMGVPLQTEMHAYMEILKKY 542 + +Q +F+LY+DP E +H+ + VP QT + ++ +Y Sbjct: 430 -WADPFVQLRVPKIFDLYSDPFEEAEHESIHYKDWWFQHVFLLVPAQTYVGEFLGTFVEY 488 Query: 543 PPRAQIKS 550 PPR + S Sbjct: 489 PPRQKPAS 496 >UniRef50_A6UG37 Sulfatase n=16 Tax=Bacteria RepID=A6UG37_SINMW Length = 552 Score = 405 bits (1041), Expect = e-111, Method: Composition-based stats. Identities = 146/529 (27%), Positives = 240/529 (45%), Gaps = 39/529 (7%) Query: 34 RKGFAGYDHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLL 93 RK G ++ + TI A Q+ + +GK PN++V Sbjct: 9 RKNAQGSISIDRRSLLLGGTILAAAAAANGAVAVGSAKAQE--QSSAGSGKTPNILVIFG 66 Query: 94 DDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHG 153 DD+G + G+ G TP+ID +A++G I T AY Q S + RA+ + GQ G Sbjct: 67 DDIGIPQISAYTMGLM-GYRTPNIDRIAAEGAIFTDAYGQQSCTAGRASFILGQEPFRTG 125 Query: 154 ILMPPMYGQPGGLQG-LTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNS 212 +L M G P G+Q + T+ ++ +GY T GK H+G+ E P N GFD+F G Sbjct: 126 LLTIGMPGDPHGIQDWMPTIADVMKSKGYATGQFGKNHLGDRDEHLPTNHGFDEFFGNLY 185 Query: 213 VSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLD 272 + E PE P E+ K + + + G+ + + K ME +D Sbjct: 186 HLNAEEE-------PEGYFYPKDEEFRKNFG-PRGVIKSSADGKIEDTGALNTKRMETVD 237 Query: 273 QRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDV 332 + ++ F+D+ AK+DKPFF ++ + H + + G + + + D MVE + Sbjct: 238 EEFLAAAKDFIDRQAKADKPFFCWFNSTRMHVFTHLKPESMGKT-GKGIHADGMVEHDGH 296 Query: 333 FANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG-RTPFRGAKGSTWEGGVRVPTFVYW 391 L + L+ G +NT++++T+DNG E + P G T F G KG+TWEGG R+P V W Sbjct: 297 VGQLLQQLDDLGITENTIVLYTTDNGAELALWPDGAMTMFHGEKGTTWEGGFRIPMMVRW 356 Query: 392 KGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTT-------FIDGVDQTSFF 443 G+++P + + V L D PT AG P K + +DG D T+ Sbjct: 357 PGVVKPGTQINDPVTLMDWMPTFATAAGIPDVKEEMKTGFKSGDKTFKVHLDGYDLTALL 416 Query: 444 LGTNGQSNRKAEHYF-LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGS 502 G + R+A +YF G L A+R +++K + + T T + + Sbjct: 417 KGEAEEPPREAVYYFDQGGNLNAIRWNDWKLSFAV--------NSEGNIATATRETPSWA 468 Query: 503 SVFNLYTDPQES--------DSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 ++ NL DP E R++ + VP+Q+++ + + +YP Sbjct: 469 NIANLRMDPYERGTKEGGGAMEFIARNMWLLVPIQSKIKEFFQDFDQYP 517 >UniRef50_P34059 N-acetylgalactosamine-6-sulfatase n=23 Tax=Deuterostomia RepID=GALNS_HUMAN Length = 522 Score = 405 bits (1040), Expect = e-111, Method: Composition-based stats. Identities = 126/490 (25%), Positives = 205/490 (41%), Gaps = 59/490 (12%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTR 140 + PN+++ L+DD+GW D+G G TP++D +A++GL+ + YS P SP+R Sbjct: 27 APQPPNILLLLMDDMGWGDLGVYGEP---SRETPNLDRMAAEGLLFPNFYSANPLCSPSR 83 Query: 141 ATILTGQYSIHHGILMPPMYGQP---------GGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 A +LTG+ I +G + + G LP+LL GYV++ +GKWH+ Sbjct: 84 AALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHL 143 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 G + P GFD++ G +P P ++ +P +D Sbjct: 144 GHRPQFHPLKHGFDEWFG----------------SPNCHFGPYDNKARPNIPVYRDWEMV 187 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 R E+ I T + +L Q ++ + F+ + A+ PFFLY+ H Y + Sbjct: 188 GRYYEEFPINLKTGE--ANLTQIYLQEALDFIKRQARHH-PFFLYWAVDATHAPVYASKP 244 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP---HGR 368 + G+S R YGD + E++D + + L+ DNT + FTSDNG P Sbjct: 245 FLGTSQ-RGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSN 303 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANL 427 PF K +T+EGG+R P +W G + + S + + DLF T+L LAG Sbjct: 304 GPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTP------ 357 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ-QPYAYTQS 486 P IDG++ L G+ + Y+ L A + + K H + + Sbjct: 358 -PSDRAIDGLNLLPTLL--QGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQ 414 Query: 487 GYQGGFTGTV---------MQTAGSSVFNLYTDPQESDSI---GVRHIPMGVPLQTEMHA 534 G V T +F+L DP E + + + + + Sbjct: 415 GIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSVVQQ 474 Query: 535 YMEILKKYPP 544 + E L P Sbjct: 475 HQEALVPAQP 484 >UniRef50_Q488V4 Sulfatase family protein n=30 Tax=Bacteria RepID=Q488V4_COLP3 Length = 525 Score = 404 bits (1039), Expect = e-111, Method: Composition-based stats. Identities = 141/488 (28%), Positives = 215/488 (44%), Gaps = 34/488 (6%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 ++PN++ DD+G ++ G +G T +ID +A +G++ T Y + S + RA Sbjct: 35 DTERPNILAIWGDDIGQSNISAYTHG-MMGYKTTNIDRIAKEGVLFTDYYGENSCTAGRA 93 Query: 142 TILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 +TGQY + G+ + G GL+ T+ +LL D+GYVT GK H+G+ E P Sbjct: 94 AFITGQYPVRTGLTKVGLPGSDKGLRAEDVTIAELLKDRGYVTGQFGKNHLGDKDEFLPT 153 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 N GFD+F G + E PE P Y K+ + +H+ G+ + Sbjct: 154 NHGFDEFLGNLYHLNAEEE-------PEHPDYPKDQAYKKRFG-PRGVIHSFADGKIEDS 205 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPART 320 +T K ME +D ++ KF+DK K++KPFF+++ H + + G S Sbjct: 206 GPLTKKRMETIDDEFLAATTKFIDKAHKNNKPFFVWFNATRMHIWTHLKEESKGLSKRGG 265 Query: 321 SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAKGSTW 379 YGD M+E + L L++ DNT++++T+DNG E P G T PF+G K +TW Sbjct: 266 IYGDGMMEHDYQVGVLLDQLDRLAIADNTIVLYTTDNGAEVFSWPDGGTIPFKGEKNTTW 325 Query: 380 EGGVRVPTFVYWKGMIQPRKSD-GIVDLADLFPTALDLAGHPGAK-------VANLVPKT 431 EGG RVP V W G I + +V D PT L AG K N Sbjct: 326 EGGFRVPAMVRWPGKITAGDAKIEMVSHMDWAPTLLAAAGVTDIKEKLKQGTTVNGKKYK 385 Query: 432 TFIDGVDQTSFFLGTNGQSNRKAEHYF-LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490 +DG + + G ++ R + YF G L+AVR + K IQ+ Sbjct: 386 VHLDGYNLLPYLTGATDEAPRPSYLYFTDGGDLSAVRFGDMKLQYSIQECEGLNV----- 440 Query: 491 GFTGTVMQTAGSSVFNLYTDPQES-DSIG-------VRHIPMGVPLQTEMHAYMEILKKY 542 + + + NL DP E V HI T M+ ++ Sbjct: 441 -WICPLTPLRAPLLTNLRQDPYERARDESGSYERWYVDHIFEFSRGITMTAQQMKTFVEF 499 Query: 543 PPRAQIKS 550 PPR + S Sbjct: 500 PPRQKPAS 507 >UniRef50_UPI0000586CBD PREDICTED: similar to MGC86251 protein n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000586CBD Length = 525 Score = 404 bits (1039), Expect = e-111, Method: Composition-based stats. Identities = 121/462 (26%), Positives = 202/462 (43%), Gaps = 40/462 (8%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRA 141 K+PN+++F DD+G+ D+ G + TP++ +A+ G++LT YS P SP+RA Sbjct: 22 AKRPNIIIFYADDLGYGDLEPYGHPTSS---TPNLGRLAAGGIVLTQFYSSSPVCSPSRA 78 Query: 142 TILTGQYSIHHGILMPPMYGQ--PGGLQGLTTLPQLLHDQGYVTQAIGKWHMG--ENKES 197 +LTG+Y + G+ + G T + ++L +GY + A+GKWH+G N Sbjct: 79 ALLTGRYQMRSGVYPHVFNVEMSGGLPLNETLISKMLKPEGYRSAAVGKWHLGLGNNSVY 138 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P N GFD+F G + + N +P EY F+ + Sbjct: 139 LPHNHGFDEFLGLPASPSQCRCSVCFYPNVTCHRAPCSPEYSPCALFNGTTII------- 191 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSP 317 P + LD ++ +F+ ++ PFFLYY + H Y + +G+S Sbjct: 192 -----EQPADLLTLDDKYAMQSRRFIRTNVETGTPFFLYYASHHTHHPQYAGKETSGTS- 245 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP--FRGAK 375 R +GD + ++ +Y+ L++NG L++T F+SDNGP + G + K Sbjct: 246 IRGRFGDSLAALDWEVGQIYEELKENGILEDTFFFFSSDNGPSLSLENFGGNAGLMKCGK 305 Query: 376 GSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFID 435 +T+EGG+RVP V+W G I P +S + D+ PT + +D Sbjct: 306 ATTYEGGIRVPAIVHWPGQITPGRSMELSSTLDVLPTIASITNAKLP--------NVTLD 357 Query: 436 GVDQTSFFLGTNGQSNRKAEHYF-----LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490 G D + F S R++ Y+ K AVR ++K + Sbjct: 358 GYDMSPFLFQGM-PSLRESFFYYPSKVDTEHKSYAVRYKQYKAVFYTEGSALSNNKNKDV 416 Query: 491 GFTGTVMQT--AGSSVFNLYTDPQESDSIGVRH-IPMGVPLQ 529 GT ++T +F+L DP E +I + H + L+ Sbjct: 417 DCRGTSLRTYHDPPMLFDLEQDPSEQYNISINHSPERDIILK 458 >UniRef50_C5PU94 N-acetylgalactosamine-6-sulfatase n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PU94_9SPHI Length = 443 Score = 404 bits (1038), Expect = e-111, Method: Composition-based stats. Identities = 126/474 (26%), Positives = 198/474 (41%), Gaps = 61/474 (12%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ- 133 LA +PN ++ +DD+G+ DVG NG TP++D +A +G+ ++ YS Sbjct: 15 LAVFNSSAQTQPNFIIIYVDDMGYGDVGINGNP---NIETPNLDRMAMEGMRFSNYYSAS 71 Query: 134 PSSSPTRATILTGQYSIHHGI-LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 P+ + +R +LTG+Y G + Q G Q +T+ + L ++GY T GKWH+G Sbjct: 72 PACTASRYALLTGKYPSRAGFRWVLNPTDQIGIHQQESTIAERLKEKGYRTAIYGKWHLG 131 Query: 193 E-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 KE P GFD++ G +DM +P D+ Sbjct: 132 STRKEFLPLANGFDEYVGLPYSNDM-------------------------IPPKYPDIAL 166 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 + G + + K L + + + + F+ K AK +PFF+Y H + + Sbjct: 167 LSGYDTLELNPDQSK----LTRLYTEKAIAFITKNAK--QPFFIYLPYAMPHTPLHASED 220 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP- 370 + G S R YGD + E++ L L++N T +VFTSDNGP +G + Sbjct: 221 FLGKS-KRGLYGDVVQELDHHIGRLLTFLKENKLDQQTYVVFTSDNGPWLIQNQNGGSAG 279 Query: 371 -FRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLV 428 FR KGSTWEGG+R P F++ I + + D+ PT LAG Sbjct: 280 LFRDGKGSTWEGGMREPFFLWGHHTIPKGYVENEVFTALDMLPTITALAGISAG------ 333 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYF-LNGKLAAVRMDEFKYHVLIQQPYAYTQSG 487 IDG + + G R YF L+ +L AVR +K HV Sbjct: 334 --PNKIDGTNLKPLWSGKKDTKGRDEFFYFGLDHQLMAVRKGPWKLHVKTY--------- 382 Query: 488 YQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +FNL DP E ++ ++ M L T + + + + + Sbjct: 383 --SQLGLVYFDKQLPLLFNLDHDPSEKYNLASQYPEMVSDLTTLILSKEKEIAE 434 >UniRef50_C3ZGR2 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZGR2_BRAFL Length = 598 Score = 404 bits (1038), Expect = e-111, Method: Composition-based stats. Identities = 121/492 (24%), Positives = 202/492 (41%), Gaps = 60/492 (12%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 +++++ + KPN+V L DD GW D+G++G + TP++D +A++G+ L + Y QP Sbjct: 112 SDIQESSSGKPNIVFILADDYGWNDIGYHGSVI----RTPNLDRLAAEGVKLENYYVQPL 167 Query: 136 SSPTRATILTGQYSIHHGILMPPMY-GQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGE 193 SP+R ++TG+Y I +G+ ++ QP GL TLPQ L + GY T +GKWH+G Sbjct: 168 CSPSRCQLMTGRYQIRYGLQHSLIWPPQPSGLPLDEVTLPQRLKEGGYSTHIVGKWHLGF 227 Query: 194 NK-ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 K + P + GFD F G+ + ++ Y R P + + Q Sbjct: 228 YKQDYTPTHRGFDTFYGYLTGAEDYWTHRQKGGLPGQPQTWSGLDLRDQ----------- 276 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 + + D Y L + + ++ + + +KP FL+ + H + Sbjct: 277 ----NRPVTDQNGTYSTHL---FANKAIEIIAQQ-DKNKPMFLFLSFQAVHDPLQAPEED 328 Query: 313 AG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG 367 S R Y M+ N+ + L++ G DNT+++F++DNG + Sbjct: 329 ISRYSHISDTNRRVYAAMTTIMDQAVGNVTRALKQYGLWDNTVLIFSTDNGGRVDRG-GI 387 Query: 368 RTPFRGAKGSTWEGGVRVPTFVYWKG-MIQPRKSDGIVDLADLFPTALDLAGHPGAKVAN 426 P RG KGS WEGGVR FV + R SD ++ ++D FPT + LA + Sbjct: 388 NWPLRGWKGSLWEGGVRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLA-------SG 440 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSNRKAEH--------------------YFLNGKLAAV 466 T +DG D R+ H F AA+ Sbjct: 441 STNGTKPLDGHDVWEAISDGKPSPRREILHNIDPMFHTVPSPRPHQWGDRVFNTSVHAAI 500 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGV 526 R ++K + +FN+ DP+E + +H + Sbjct: 501 RSGDWKLLTGYPGNTSRVPPPSSTKEEPADTPGKHLWLFNIREDPEERTDLSQKHPGVVQ 560 Query: 527 PLQTEMHAYMEI 538 L ++ Y Sbjct: 561 ELLEKLARYNRT 572 >UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN4_DYAFD Length = 497 Score = 402 bits (1033), Expect = e-110, Method: Composition-based stats. Identities = 123/503 (24%), Positives = 186/503 (36%), Gaps = 80/503 (15%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QP 134 A+ +K K PN+V DD+G+ ++G G TP++D +A +G+ T Y+ P Sbjct: 17 AQAQKAPDKLPNIVYIYADDLGYGELGCYGQQKI---KTPNLDRLAKEGIRFTQHYTGTP 73 Query: 135 SSSPTRATILTGQYSIHH---------GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQA 185 +P RA ++TG+++ H G GQ T+ +LL +GY T Sbjct: 74 VCAPARAMLMTGKHAGHSAIRGNFELGGFRDEEERGQMPLPANELTVAELLKQKGYATAL 133 Query: 186 IGKWHMGEN-KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPF 244 GKW MG N E P GFD + G+ + + P DR + + Q Sbjct: 134 TGKWGMGMNNTEGTPTRQGFDYYYGYLDQKQAHNLY------PSHLWENDRWDTLAQPW- 186 Query: 245 SKDDVHAVRGGEQQAIADITP-KYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 D+H + AD K E + + + F+D+ PFFLY H Sbjct: 187 --QDIHRKLDPAKATDADFESFKGKEYAPAKMTEKALAFIDRSKAG--PFFLYMPYTLPH 242 Query: 304 F-------------------DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNG 344 Y YA + ++Y + ++D + L+ G Sbjct: 243 VSLQAPDEYVKKYIGQFDEKPYYGEKNYASTKYPLSTYASMITFLDDQVGIILDKLKALG 302 Query: 345 QLDNTLIVFTSDNGPEAEVP-----PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-R 398 DNT+++F+SDNG + RG K +EGG+R P V W G I+P R Sbjct: 303 LDDNTIVMFSSDNGATFNGGVNPQFFNSVAGLRGLKMDVYEGGIREPFIVRWPGKIKPGR 362 Query: 399 KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF 458 SD + DL PT +L G DG+ LG + + YF Sbjct: 363 VSDHVSAQFDLMPTLAELTGQASPPT----------DGISFLPELLGQTNRQKKHEFLYF 412 Query: 459 LN---GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD 515 G AVRM ++K + +FNL TD ES Sbjct: 413 EYPEKGGQIAVRMGDWKGVKTDLR----------------KNPGNPWQLFNLKTDRSEST 456 Query: 516 SIGVRHIPMGVPLQTEMHAYMEI 538 + H + L + E Sbjct: 457 DVAASHPDILKKLDQIVKREHEE 479 >UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD Length = 481 Score = 401 bits (1030), Expect = e-110, Method: Composition-based stats. Identities = 124/491 (25%), Positives = 201/491 (40%), Gaps = 53/491 (10%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 + A+ + ++PN+V L DD+G+ DVGFNG + TP+ID +A +G+I Y+ Sbjct: 17 TQRADAQAPKPQRPNIVFILADDLGYGDVGFNGQKLI---KTPNIDKLAKEGMIFNQFYA 73 Query: 133 -QPSSSPTRATILTGQYSIHHGILMPPMY---GQPGGLQGLTTLPQLLHDQGYVTQAIGK 188 +P+R+++LTGQ++ H I GQ +TTL ++L GYVT A GK Sbjct: 74 GTSVCAPSRSSLLTGQHTGHTYIRGNKGVEPEGQQPIADSVTTLAEVLKKSGYVTAAFGK 133 Query: 189 WHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 W +G E P GFD F G+N S + + + + + + ++ + Sbjct: 134 WGLGPVGSEGDPNKQGFDRFYGYNCQSLAHRYYPEHLWDNSKKILLEGNK-----GLIHN 188 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG--TRGCH-- 303 +A +++A++ + + + ++ Y + + + D F Y G H Sbjct: 189 KEYAPDLIQKKALSFVNAQDGKQPFFLFLPYILPHAELVVPDDSLFRYYKGKFEEKPHKG 248 Query: 304 FDNYPNA---KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 D P A YA ++ + ++ + L+K G NTL++FTSDNGP Sbjct: 249 ADYGPGANGGGYASQDFPHATFAAMVARLDLYVGQVMNALKKKGLDKNTLVIFTSDNGPH 308 Query: 361 AEVPP-----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTAL 414 E + FRG K +EGG+R P W I+P KSD I D+ PT Sbjct: 309 VEGGADPRFFNSGAGFRGVKRDLYEGGIREPFAARWPAAIKPGSKSDYIGAFWDILPTFA 368 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH--YFLNGKLAAVRMDEFK 472 +LA P IDG+ T G Q + + G AVR +K Sbjct: 369 ELANAPAP---------RNIDGISFTDALKGKAIQKKHDYLYWEFHEQGGRQAVRQGNWK 419 Query: 473 YHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 L A +++L DPQE +++ + L M Sbjct: 420 AVRL----------------KAAGNPDALVELYDLSKDPQEKNNLTPQFPEKAKELGQIM 463 Query: 533 HAYMEILKKYP 543 + +P Sbjct: 464 NRAHVSSAIFP 474 >UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6CBM1_9PLAN Length = 497 Score = 400 bits (1029), Expect = e-110, Method: Composition-based stats. Identities = 124/500 (24%), Positives = 190/500 (38%), Gaps = 72/500 (14%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 +L +EK+ KPN+V+ L DD+G+ D+ G V TP +D +AS+G+ LT Y+ Sbjct: 20 PELQAVEKQQAAKPNIVIILCDDLGYGDLACYGHPVI---KTPHLDQLASEGMRLTDCYA 76 Query: 133 -QPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWH 190 P SP+RA +LTG+ G+ G P L+ T+ QLL GY T +GKWH Sbjct: 77 SAPVCSPSRAGLLTGRTPNRLGVYDWIPEGHPMHLKRDEVTVAQLLQQAGYDTAHVGKWH 136 Query: 191 -MGE---NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 G ++ QP + GF + H NP + + Sbjct: 137 CNGMFNSKEQPQPGDHGFRHWF------STQNNALPTHENPNNFVRNGKP---------- 180 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH--- 303 + + G Q +A D G+++L + +KPFFL+ H Sbjct: 181 --LGEIEGFSCQIVA---------------DEGIRWLSDWREKEKPFFLHVCFHEPHERV 223 Query: 304 -FDNYPNAKYAGSS--PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 Y S + Y + M+ L L++ DNTL+ FTSDNGPE Sbjct: 224 ASPPALVETYLDKSLYEDQAQYFANVANMDRAVGKLLIKLDELKVADNTLVFFTSDNGPE 283 Query: 361 A------EVPPHGRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFP 411 +P RG K +EGG+RVP V W G I+ + V DL P Sbjct: 284 TLNRYGKGSRRSWGSPGVLRGMKLHIYEGGIRVPGIVRWPGKIKAGQEIATPVCSVDLLP 343 Query: 412 TALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGK---LAAVRM 468 T ++AG VP +DG F G + + A+R Sbjct: 344 TFCEIAGV-------AVPDQRPLDGASLLPLFAGNKIERTTPLFWNYYRAYSTPRVAMRE 396 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVM----QTAGSSVFNLYTDPQESDSIGVRHIPM 524 ++K P G + + ++NL D E ++ + Sbjct: 397 GDWKVVAHWSGPEGIIPLGGNVNSVSQEIIKNAKLTKFELYNLKDDISEQHNLAWQEQKR 456 Query: 525 GVPLQTEM-HAYMEILKKYP 543 L+ ++ Y + K+ P Sbjct: 457 LDTLKKKLVQKYAAVQKEGP 476 >UniRef50_A6CGJ8 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CGJ8_9PLAN Length = 520 Score = 400 bits (1029), Expect = e-110, Method: Composition-based stats. Identities = 120/523 (22%), Positives = 193/523 (36%), Gaps = 89/523 (17%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSS 136 + K+ N+V L DD+G+ DV TP ID +A++G+ T A++ Sbjct: 25 IAHAADKQSNIVYILADDLGYGDVSCYNPE--SKIKTPHIDRLAAEGMKFTDAHTPSAVC 82 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQ--PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 +PTR ILTG+Y + + G P Q T+P LL GY T IGKWH+G Sbjct: 83 TPTRYGILTGRYCWRTRLKYRVLDGFDPPLIEQDQVTVPSLLKKAGYDTACIGKWHLGMQ 142 Query: 195 KESQ-------------------------------PQNVGFDDFRGFNSVSDMYTEWRDV 223 + P GFD + G ++ +M Sbjct: 143 WTDKNGQPVPAVPIDRRQRPRVGDDVDYTKPILGGPLTSGFDYYFGISASLNM------- 195 Query: 224 HVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFL 283 +P + DR + +P + + + D T + + VK++ Sbjct: 196 --SPFCFIRNDRPVILPTIPSERIQTEFLSVDQGMRSPDFT---IRSVMPTLTGEAVKYI 250 Query: 284 DKMAKS--DKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLE 341 ++ K ++PFFLY+ H PN ++ G S A YGD ++E++ + L+ Sbjct: 251 ERHGKESPERPFFLYFPLTAPHLPLVPNDEFKGKSAA-GEYGDFVLEVDATVGAIMDALQ 309 Query: 342 KNGQLDNTLIVFTSDNGP------------------------EAEVPPHGRTPFRGAKGS 377 + G +NTL++FTSDNG + G RG K Sbjct: 310 RTGVAENTLVIFTSDNGGLYHWWTPQETDDLKHYKPNHRGQYVKDRGHQGNAHLRGTKAD 369 Query: 378 TWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDG 436 WEGG RVP V W G +D +V+L DL T + +P D Sbjct: 370 IWEGGHRVPFIVRWPGKTPADSTNDELVELTDLLATCAAIT-------DTKLPDGDAQDS 422 Query: 437 VDQTSFFLGTNGQSN-RKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGT 495 V+ LG + R+ + +VR +K P + + Sbjct: 423 VNILPALLGKKSDTPLREYAIHHSLWGHFSVRQGPWKMI-----PKRGSGGFTRAREVEP 477 Query: 496 VMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 ++NL DP E+ ++ + H + PL + + Sbjct: 478 AAGEPTGQLYNLKQDPSETKNVWLEHPEVVKPLSAILEQVQKQ 520 >UniRef50_A7HQ00 Steryl-sulfatase n=4 Tax=Proteobacteria RepID=A7HQ00_PARL1 Length = 553 Score = 400 bits (1029), Expect = e-110, Method: Composition-based stats. Identities = 127/516 (24%), Positives = 205/516 (39%), Gaps = 89/516 (17%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPS 135 E + PN+VV L DD+G+ D+ GGG+ PTP+ID++A G TSAYS + Sbjct: 62 AAEPAGNRPPNIVVILADDLGFNDISHFGGGIV---PTPNIDSIARGGANFTSAYSGTAA 118 Query: 136 SSPTRATILTGQYSIHHGILMPPMY---------------------------------GQ 162 +P+RA I+TG+Y G P + Sbjct: 119 CAPSRAMIMTGRYGTRTGFEFTPTPPGMTRIVDMFYNDGTRTHEMLVDREAAAKAPPFRE 178 Query: 163 PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRD 222 G TL + L +GY IGKWH+G E P GFD+ S + + D Sbjct: 179 QGLPGSEITLAEALKPKGYHNIHIGKWHLGNAPEFLPNAQGFDESVMLESGLFLPEDSPD 238 Query: 223 VHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKF 282 V VN ++ P I Q +++ G L + D +K Sbjct: 239 V-VNAKLPFDP-----IDQFLWARMQYATSYNGSAWFEP------KGYLTDFYTDEAIKA 286 Query: 283 LDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS-----SPARTSYGDCMVEMNDVFANLY 337 ++ A ++PFFLY G H + + Y +V ++ + Sbjct: 287 IE--ANRNRPFFLYLAHWGVHTPLQASKADYDALSHIEDERLRVYAAMIVALDRSVGRVL 344 Query: 338 KTLEKNGQLDNTLIVFTSDNGPEAEVP-PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ 396 ++L++NG +NTL++F+SDNG + P P+RG K + +EGG+RVP F W I Sbjct: 345 QSLKENGLEENTLVIFSSDNGAPGYIGLPDVNKPYRGWKLTFFEGGIRVPFFAKWPARIP 404 Query: 397 PRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE 455 V D+FPT + AG +P IDG+D + + R Sbjct: 405 AGTERTTPVAHLDMFPTIVAAAG-------GELPADRVIDGIDLLPYAARGEKPAPR--P 455 Query: 456 HYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD 515 ++ +G AV+ D +K + + + +FNL TDP E + Sbjct: 456 IFWRDGHYQAVQADGWKLQM--------------------AERPNKTWLFNLKTDPTEQN 495 Query: 516 SIGVRHIPMGVPLQTEMHAYMEILKK--YPPRAQIK 549 ++ + L+ + A+ ++ +P A++ Sbjct: 496 NVADENPEKVAELKALVEAHNATQREPLFPAVAEMP 531 >UniRef50_A6DKP2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKP2_9BACT Length = 446 Score = 400 bits (1028), Expect = e-110, Method: Composition-based stats. Identities = 119/479 (24%), Positives = 187/479 (39%), Gaps = 65/479 (13%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPT 139 + KPN+V+ DD+GW DV ++G TP IDA+A G+ Y+ P+ Sbjct: 15 RAADKPNIVLVFADDMGWGDVAYHG---VEDAQTPAIDAIAKGGVWFEQGYAAASVCGPS 71 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 RA ILTG+Y G++ G + + +LL GY + A GKWH+G K P Sbjct: 72 RAGILTGRYQQLFGVVTN-GDADKGIPKSQKNIAELLKPAGYKSGAFGKWHLGSKKGQFP 130 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 + GFD F GF+ + Y + +P + F++D V G Sbjct: 131 NDRGFDTFYGFHFGAHDYYRADKKLNKKKKGYAP--------IYFNQDIVDYKEG----- 177 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA- 318 + L ++ D+ V+F+++ D+PFF+Y H +Y P Sbjct: 178 ---------DYLTEKITDHAVEFIEE--NKDQPFFMYVAYNSVHSPWQVPDEYLARIPES 226 Query: 319 ----RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP---------EAEVPP 365 R + ++ M+D + L++ +NT+ VFT+DNG E + Sbjct: 227 VPAYRRLFLAMVLAMDDGVGRIRAKLKELNLDENTIFVFTTDNGSPKIGNKKPNEGQYRM 286 Query: 366 HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKV 424 FRG KG T+EGG+RVP + W I+ K + V DL PT L A + Sbjct: 287 SMSQGFRGYKGDTYEGGIRVPFCMSWPKKIKSGNKFEAPVIAYDLAPTFLSAASLEYS-- 344 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL--AAVRMDEFKYHVLIQQPYA 482 T G D + + + + L AVR ++K Q+ Sbjct: 345 ------TKQFSGKDLLPYLEDEQKGRPHETLFWHRHSGLDDYAVRHGDWKLTYNDQE--- 395 Query: 483 YTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 G + ++ +FNL DP E + L+ + E K Sbjct: 396 --------GTSKDFLKKVHLKLFNLKQDPYEKKDLADSMPEKLQQLKQLYFNWHETHAK 446 >UniRef50_A6P2X1 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6P2X1_9BACE Length = 494 Score = 400 bits (1028), Expect = e-110, Method: Composition-based stats. Identities = 140/493 (28%), Positives = 215/493 (43%), Gaps = 84/493 (17%) Query: 72 QQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY 131 ++ L +E + G PNVVV +DD+G+ D+G G TP+IDA+A G++LT+ Y Sbjct: 57 KRYLEGVELENGDPPNVVVIYVDDMGYGDLGCTGATAIS---TPNIDALAEGGVLLTNYY 113 Query: 132 S-QPSSSPTRATILTGQYSIHH---GILMPPM------------------YGQPGGLQGL 169 + P S +RA +LTG+Y I G M Y G Sbjct: 114 APAPICSASRAGLLTGRYPIRTLTSGAYMNTEGLSGHLANLLEVVKGTYPYQNDGLPTDE 173 Query: 170 TTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEV 229 LP++L GY T +GKWH+G +E +P N GFD F G Sbjct: 174 ILLPEVLQQAGYETALVGKWHLGIREEERPYNRGFDLFYGALYS---------------- 217 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS 289 DD R + P + + +F+D Sbjct: 218 -----------------DDNDPHRIYHNDEVVHDEPYDQSGMTKELTQVAKQFIDD--NQ 258 Query: 290 DKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 D PFFLYY + H+ + + ++ G+S A YGDCM E++ + TLE+NG L+NT Sbjct: 259 DGPFFLYYASPFPHWPSNASEEWLGTSQA-GIYGDCMQEVDWSVGEIMDTLEENGLLENT 317 Query: 350 LIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLAD 408 L++FTSDNGP + G+ RG K + + GG VP Y G I + DG++ D Sbjct: 318 LVIFTSDNGPWYDGATGGQ---RGRKDTNYNGGSHVPFIAYMPGTIPEGEVYDGLMSGVD 374 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRM 468 +FPT L+L G +P+ IDG+D F G + S R + A+ Sbjct: 375 VFPTILNLLGIE-------LPQDRVIDGMDMWPFLTGQSD-SPRTELFLNKDKDTFALIE 426 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPL 528 D FKY +++ Y+ + + M G ++NL TDP+E+ + + Sbjct: 427 DNFKY---LERSYSENGTYW--------MLQQGPFLYNLDTDPEEAYDVTTHFPEKAEEM 475 Query: 529 QTEMHAYMEILKK 541 ++ ++ + LK+ Sbjct: 476 AQKIDSFKQSLKE 488 >UniRef50_C5EQ23 Arylsulfatase E n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EQ23_9FIRM Length = 483 Score = 400 bits (1027), Expect = e-110, Method: Composition-based stats. Identities = 112/479 (23%), Positives = 190/479 (39%), Gaps = 57/479 (11%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRAT 142 KKPN++VFL DD G+ D+ G TP++D +A+ G T Y+ SP+RA Sbjct: 15 KKPNIIVFLTDDQGYGDLSCMGSTDVC---TPNLDILAAGGARFTDFYAGSAVCSPSRAC 71 Query: 143 ILTGQYSIHHGILMP--PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 +LTG+Y G+ + G G+ T L D GY T +GKWH+G E +P Sbjct: 72 LLTGRYPYMTGVRSILGGIKTTTGLNPGIPTFASALKDLGYTTGMVGKWHLGAVPECRPT 131 Query: 201 NVGFDDFRGFN-SVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 ++GFD F GF V+D ++ N ++P+ + ++D ++ Sbjct: 132 HMGFDYFCGFLSGVNDYFSHIHYTEANSHPGINPNHDLW-------ENDERCLK------ 178 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN----AKYAGS 315 T +Y +L + G++F+ + + D PF LY H+ + ++ Sbjct: 179 ---YTGEYSTEL---FARKGLEFIREQVEKDMPFALYCAFNAPHYPMHAPYKYLERFKHL 232 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP---------- 365 R + ++D + L++ G ++T+I F SDNGP E Sbjct: 233 PEDRQIMAAMLSAVDDGVGEIMNYLKRRGIFNDTIIYFQSDNGPSKESRNWLDERKDYYY 292 Query: 366 -HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAK 423 +G K S ++GG+RVP W M+ + D+FPT ++ AG + Sbjct: 293 GGSTGGLKGHKFSLFDGGIRVPAIFSWPAMVPAGQVISEPCMGTDIFPTFINAAGGNAS- 351 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAY 483 I G D G K Y+ G+ AVR +K + + Sbjct: 352 -------DYEISGCDILPVMT--IGARRDKDCLYWEMGQQTAVRRGNYKLVI-----NGF 397 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKY 542 + G+ + +L D E ++ + L+ + + L+ Y Sbjct: 398 LRDGWSLPLDPKTETKHEVWLSDLSQDMGEEHNLVEEMPELAKELEEKALTWRRDLEAY 456 >UniRef50_Q7UG72 Arylsulfatase A [precursor] n=1 Tax=Rhodopirellula baltica RepID=Q7UG72_RHOBA Length = 503 Score = 400 bits (1027), Expect = e-109, Method: Composition-based stats. Identities = 128/483 (26%), Positives = 200/483 (41%), Gaps = 46/483 (9%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS 136 E G +PN+VV +DD+ + D+G G A G TP++D +A++G T Sbjct: 24 APEDIAGSRPNIVVIYMDDMAYADIGPFG---AKGYSTPNLDRMANEGRKFTDF------ 74 Query: 137 SPT----------RATILTGQYSIHHGILMPPMY-GQPGGLQGLTTLPQLLHDQGYVTQA 185 R+ +LTG Y G+ + G TT ++ GY T Sbjct: 75 ---SVSSAVCSASRSALLTGCYHRRVGLSGALGPQAKIGLAPAETTFAEVCKSAGYRTAC 131 Query: 186 IGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS 245 GKWH+G + + P N GFD F G +DM+ D + P+ LP Sbjct: 132 HGKWHLGHHPKFLPTNQGFDQFYGIPYSNDMWPLHPDTIRRQQ--KDPNDPGNWPPLPI- 188 Query: 246 KDDVHAVRGGEQQAIAD-ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF 304 + ++ G + + D + P E + V+F+ + SDKPF LY H Sbjct: 189 ---IESIAGQPPRIVNDNVQPADQEQMTVELTRRSVEFIKNQS-SDKPFLLYLPHPMVHV 244 Query: 305 DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP 364 Y + ++ G S A +GD M+E++ + +E Q NTL++FTSDNGP Sbjct: 245 PLYVSERFRGKSGA-GLFGDVMMEVDWSVGEILSAIESIDQQKNTLVIFTSDNGPWLSYG 303 Query: 365 PHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPG 421 H + P R KG+ WEGGVR PT ++W I + D+ PT ++L G Sbjct: 304 NHAGSAAPLREGKGTQWEGGVREPTLMWWPETIPAGTTCETFCSTIDVLPTIVELTGGEA 363 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNG-QSNRKAEH-YFLNGKLAAVRMDEFKYHVLIQ- 478 + IDG L G +S ++ Y+ G+L +R + FK Sbjct: 364 PE--------RKIDGHSIVDLMLDVPGAKSPHESFVGYYGGGQLQTIRNERFKLVFPHAY 415 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 + + G G G M +G +++L D E+ ++ H + LQ Y + Sbjct: 416 RTLGDREPGKDGMPDGYAMTKSGLELYDLDADVSETTNVIEAHPEVVKQLQAAAEVYRQQ 475 Query: 539 LKK 541 L Sbjct: 476 LGD 478 >UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D6K5_PAESJ Length = 434 Score = 400 bits (1027), Expect = e-109, Method: Composition-based stats. Identities = 119/480 (24%), Positives = 201/480 (41%), Gaps = 71/480 (14%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRAT 142 K+PN++VF DD+G+ D+G G TP +D +AS+G+ T+ YS P SP+RA+ Sbjct: 2 KRPNIIVFYCDDLGYGDLGCYGSDAM---KTPHLDQLASEGIRFTNWYSNSPVCSPSRAS 58 Query: 143 ILTGQYSIHHGI--LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 +LTG+Y G+ ++ G G TTL L + GY T GKWH+G + E P Sbjct: 59 LLTGKYPAKAGVTSILGGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLGASAEYGPN 118 Query: 201 NVGFDDFRGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD F GF D Y+ + + VH + E + Sbjct: 119 AHGFDQFYGFRAGCIDYYSHI-----------------FYWGQGGGVNPVHDLWRNETEV 161 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA----KYAGS 315 + E + + ++D A D+P+F+Y H+ + ++ Sbjct: 162 WENG-----EYMTEAITREATSYID-AAPDDEPYFMYVAYNAPHYPMHAPKAYLDRFPDL 215 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP---------- 365 P R + ++D + K L++ G ++T+I F+SDNGP E Sbjct: 216 PPDRRIMAAMIAAVDDGVGEIVKALKQKGAYEDTIIFFSSDNGPSTESRNWLDGTEDLYY 275 Query: 366 -HGRTPFRGAKGSTWEGGVRVPTFVYWKGMI---QPRKSDGIVDLADLFPTALDLAGHPG 421 FRG K S +EGG+R P + + + Q + SD + + D+FPT L+L+G Sbjct: 276 GGSAGRFRGHKASLFEGGIREPAILSYPAGLAEQQGQISDEMFAMMDIFPTMLELSGIGT 335 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 + +DG G N S RK + G+L AVR ++K + Sbjct: 336 EGYS--------LDGHSVFDALSG-NALSPRKQLFWEYEGQL-AVREGKWKLVL------ 379 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 G + + + +L D E ++ ++ + L+ ++ + + L++ Sbjct: 380 -------NGKLDFSRTEADAVHLSDLEQDSSERINLVKQYPEIAQRLERDVRQWYQSLQE 432 >UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKC9_9BACT Length = 454 Score = 399 bits (1026), Expect = e-109, Method: Composition-based stats. Identities = 130/472 (27%), Positives = 190/472 (40%), Gaps = 85/472 (18%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTR 140 KPN+++ L DD+G+ DVG++G PTP+ID +A++G+ ++ YS PTR Sbjct: 16 ATDKPNILIILADDLGYADVGYHGLEEI---PTPNIDRIANEGVQFSAGYSNGSICGPTR 72 Query: 141 ATILTGQYSIHHGI------LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG-- 192 A +++G Y G + G + + TL Q + GY T GKWH+G Sbjct: 73 AALMSGVYQQRIGCEGICGGRKLNEHVVVGMPREVKTLAQYFQEAGYATGLFGKWHLGGE 132 Query: 193 --ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +K P + GFD+F G + +Y + + Sbjct: 133 RLFDKTLMPTSRGFDEFFGILEGASLYDDTVN---------------------------R 165 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 + Q + D +Y D R V F+ + K DKPFFLY H + Sbjct: 166 ERKYIRQDTVIDYEGEYFTDAIGR---EAVSFITR--KGDKPFFLYLPFTAVHAPMQASE 220 Query: 311 KYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP 365 KY P R + + M+D ++ LE G LDNTLIVF SDNG + + Sbjct: 221 KYMQRFAHIADPNRRVFAAMLSAMDDNIGRVFDALEHQGILDNTLIVFWSDNGGKPDNNY 280 Query: 366 HGRTPFRGAKGSTWEGGVRVPTFVYWK-GMIQPRKS-DGIVDLADLFPTALDLAGHPGAK 423 P +G K +EGG+RVP V W G I K+ D V L D+FP+AL+ A Sbjct: 281 SLNHPLKGQKTQFYEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDIFPSALEAAQIT--- 337 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAY 483 VPK I+ G Q+ A + GK AVRM ++K Sbjct: 338 ----VPKD--IEAKTILPLMQGKTNQTPHPAMFWKRAGK-MAVRMGDWKL---------- 380 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAY 535 S +FNL D ES +I +H + + + Sbjct: 381 ------------SNAGGPSELFNLKQDISESRNIIDQHPDIANKMNRLWLNW 420 >UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 Length = 471 Score = 398 bits (1023), Expect = e-109, Method: Composition-based stats. Identities = 112/485 (23%), Positives = 180/485 (37%), Gaps = 72/485 (14%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 +A K+PN+V DD G+ D GF G TP++D +AS+G+ T Y Sbjct: 15 SVACTSLSYAKQPNIVFLFSDDAGYADFGFQGSETM---KTPNLDQLASEGVRFTQGYVS 71 Query: 134 -PSSSPTRATILTGQYSIHHGILMPPMYG-----------QPGGLQGLTTLPQLLHDQGY 181 + P+RA I+TG+Y G + G + G T+ + GY Sbjct: 72 DSTCGPSRAGIMTGRYQQKFGYEEINVPGYMSEHSAIKGAEMGIPLDEVTMGDYMKSLGY 131 Query: 182 VTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQ 241 T GKWH+G E P + GFD+F GF Y + + A+ D K+ Sbjct: 132 RTAFYGKWHLGGTDELHPMHRGFDEFYGFRGGDRSYWAYEVNAPERKSAVFTD-----KK 186 Query: 242 LPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRG 301 L D G L + +F++K DKPFF++ Sbjct: 187 LEHGIDQFQEHEGY---------------LTDVLAEKANQFIEKA--PDKPFFIFLSFNA 229 Query: 302 CHFDNYPN----AKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 H AK+ R + ++ + L++ G D+TL+VF++DN Sbjct: 230 VHTPMEATPEDLAKFPQLKGKRKEVAAMTLALDRASGAVLNKLKELGLEDDTLVVFSNDN 289 Query: 358 GPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDL 416 G + P G K + EGG+RVP V W + K D V DL PT Sbjct: 290 GGPTDKNASSNYPLAGTKSNFLEGGIRVPFLVKWPAKLAAGKVYDKPVSTLDLLPTFFKA 349 Query: 417 AGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVL 476 G + +DGVD + G N ++ ++ Y+ AA+R ++K Sbjct: 350 GGGEEVM--------SELDGVDLMPYITGQNNKAPHES-MYWKKETRAAIRQGDWKLLRF 400 Query: 477 IQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYM 536 +P ++NL D E ++ + + + ++ Sbjct: 401 PDRPA---------------------ELYNLANDIGEQHNLAAQEPERVKQMYKDFFSWE 439 Query: 537 EILKK 541 L++ Sbjct: 440 MTLER 444 >UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4L0_9PLAN Length = 413 Score = 398 bits (1022), Expect = e-109, Method: Composition-based stats. Identities = 114/477 (23%), Positives = 174/477 (36%), Gaps = 83/477 (17%) Query: 92 LLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSI 150 + DD+G+ D+ G TP +D +A+ G+ T +S SPTRA +LTG+Y Sbjct: 1 MADDLGYGDLSCYGSQNCN---TPHLDRLAANGIRFTDFHSSGAVCSPTRAGLLTGRYQQ 57 Query: 151 HHGI----LMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFD 205 GI P + GLQ TL Q L D GY T GKWH+G ++ P GF Sbjct: 58 RAGIDGVVYANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLGYQRQYNPTFRGFQ 117 Query: 206 DFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITP 265 F G+ S + Y D + + A++ Sbjct: 118 QFVGYVSGNVDYFAHLDGTGVFDWWHN----------------------------AELNR 149 Query: 266 KYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD-----NYPNAKYAG---SSP 317 + + D+ ++F+ + +KPFF+Y H + P K G S Sbjct: 150 EEQGYVTHLINDHALEFIRQQ--QEKPFFVYIAHEAVHSPYQGPHDQPMRKEGGGDIKSA 207 Query: 318 ART----SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRG 373 R +Y + EM+ + L++ + T I F SDNG RG Sbjct: 208 KRKDIANAYREMNTEMDKGIGQIVDVLKEVNLTEKTFIFFLSDNGANK---NGSNGKLRG 264 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 KGS WEGG RVP W G I D V DL PT L+LA +P Sbjct: 265 FKGSLWEGGHRVPAIACWPGRIPEGTVCDEPVISIDLMPTILELANA-------KIPAGH 317 Query: 433 FIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGF 492 +DGV S R+ ++ +A+R +K + Sbjct: 318 KLDGVSLVSLLKDRKSLVPRQI--FWEYNGKSAMRQGHWKLVL----------------- 358 Query: 493 TGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIK 549 + +++L D ES ++ +Q+ + A+ ++K K Sbjct: 359 --NQTRKEPIELYDLTRDMSESKNLADNQPQRVQQMQSALAAWKSDVQKTATTQPEK 413 >UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3JD43_NITOC Length = 440 Score = 397 bits (1020), Expect = e-109, Method: Composition-based stats. Identities = 118/485 (24%), Positives = 187/485 (38%), Gaps = 82/485 (16%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 + + + + PNV++ + DD+G+ DVG G TP++DA+A +G T +S Sbjct: 6 NSSSLVSGREKQPPNVILIVADDMGYGDVGCYGNQHI---KTPNLDALAKKGARFTDFHS 62 Query: 133 Q-PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQ----GLTTLPQLLHDQGYVTQAIG 187 P +PTRA +LTG Y G+ + P + + T + L GY T +G Sbjct: 63 NGPLCTPTRAALLTGCYQQRVGLHIIPKDQRYAMAKAMSLEEITFAEALKSVGYSTALVG 122 Query: 188 KWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 KWH+G+ P GFD++ G DM+ + K P Sbjct: 123 KWHLGDRPAFLPPRQGFDEYFGIPYSHDMHP-------------------WRKSFP---- 159 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 + +RG E I ++ P ++ L Q + VKF+ K D+PF LY H + Sbjct: 160 PLPLMRGEE---IVELNPD-LDHLTQYCTEEAVKFISK--NKDRPFLLYMPHPMPHQPVH 213 Query: 308 PNAKYA------------GSSPARTS--YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVF 353 + ++A G Y + E++ + K + G ++T + F Sbjct: 214 VSERFAKRFSKEQLAAIKGEDKKSRKFLYSATIEEIDWSVGEIIKAVRALGIEESTFVAF 273 Query: 354 TSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPT 412 TSDNGP P RG K WEGG RVP YW+ I+P D I DLFPT Sbjct: 274 TSDNGPAI----GSAGPLRGKKRELWEGGHRVPFIAYWQEKIRPGVVIDEIAMSMDLFPT 329 Query: 413 ALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFK 472 + P + IDGV+ + S R ++ + A R +K Sbjct: 330 MAAMGRAPLPR--------KKIDGVNLLPLLCEGDKLSERTV--FWRSKGKKAARKGPWK 379 Query: 473 YHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 + + +++L D E ++ + LQ E Sbjct: 380 LLM----------------QPTKKKRPTSIGLYHLNNDLSEQHNLAEIYPEKLKSLQLEF 423 Query: 533 HAYME 537 A+ + Sbjct: 424 AAWEK 428 >UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYA9_9BACT Length = 490 Score = 396 bits (1019), Expect = e-109, Method: Composition-based stats. Identities = 118/470 (25%), Positives = 175/470 (37%), Gaps = 67/470 (14%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRA 141 K+PN++V + DD G+ D F G + TP++DA+A G+ T Y P SP+RA Sbjct: 36 TKRPNIIVIVSDDQGYADASFQGSKDIL---TPNLDALAKSGVRCTRGYVTAPVCSPSRA 92 Query: 142 TILTGQYSIHHG----ILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 ++TG+Y G I+ T LPQ+L GY T +GKWH+G Sbjct: 93 GLMTGRYQERFGHHNNIVAEAALPIAHLPSNETLLPQVLAKAGYYTAMVGKWHLGLQDGC 152 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 +P GFD+F G + Y D + R Sbjct: 153 RPYERGFDEFFGIITGGHDYFVNHPEERAV------------------GDQSYKARIERN 194 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMA--KSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 + + P Y+ D + V+ + + + D+P FLY H + Sbjct: 195 GPVGEAVPGYLTD---AFGADAVRIIRESHTKRPDQPLFLYLAFNAPHTPTQAPKDLVDT 251 Query: 316 SPA------RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT 369 PA R +Y + M+ + L++NG +T IVF SDNG A P + T Sbjct: 252 MPATLESKDRRTYAAQITSMDASVGKVRAALKENGMEKDTFIVFFSDNGG-ANHPYYDNT 310 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLV 428 P R KGS +EGG+RVP F + G I + V D+F TA LAG Sbjct: 311 PLRDHKGSLYEGGIRVPFFAVYPGHIPAGSVCELPVTSLDVFATACALAGTKPE------ 364 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 + +D VD G Q + G AAV + K V + Sbjct: 365 -TSHPLDSVDMLPVLEGNARQPTHATLFWEFPGFGAAVADRDLKLVVPKKGS-------- 415 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 +F+L D E + ++ L T + + Sbjct: 416 -------------PQLFDLAVDIGEKSDLAAQNPEKVARLSTLLSEWHAQ 452 >UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6C430_9PLAN Length = 503 Score = 396 bits (1019), Expect = e-109, Method: Composition-based stats. Identities = 126/502 (25%), Positives = 198/502 (39%), Gaps = 66/502 (13%) Query: 68 DKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLIL 127 E+ K+ +PN++V L DD+G+ D+ G V P+ID A +GL L Sbjct: 17 TNESLAAEPTASVKSPARPNIMVVLCDDLGYGDLACYGHPVIQS---PNIDRFAKEGLKL 73 Query: 128 TSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQA 185 TS Y+ P+ SP+RA ++TG+ GI P + + T+ LL GY T Sbjct: 74 TSCYAAHPNCSPSRAGLMTGRTPFRVGIYNWIPMLSPMHVRKREITIATLLRQAGYATCH 133 Query: 186 IGKWHMG----ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQ 241 +GKWH+ + QP + GFD + H NP + R Sbjct: 134 VGKWHLNGMFNMVGQPQPSDHGFDHWF------STQNNALPTHENPFNFVRNARP----- 182 Query: 242 LPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRG 301 V ++G Q +A D ++L ++ +KPFF++ Sbjct: 183 -------VGPLQGFASQLVA---------------DEAEEWLTQLRDKEKPFFMFVCFHE 220 Query: 302 CHFDNYPNAKY-----AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 H ++ A ++ + +M+D F + KTL+ +NTLI+FTSD Sbjct: 221 PHEPIASAERFRKLYTAPEGSTLPAHHGNVTQMDDAFGRILKTLDDQKLRENTLIIFTSD 280 Query: 357 NGPE--AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTA 413 NGP P P R KG+T+EGG+RVP V W +QP SD V D+ PT Sbjct: 281 NGPAITRRHPHGSSGPLRDKKGATYEGGIRVPGIVQWPEHVQPGTTSDVPVCGVDILPTL 340 Query: 414 LDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL-----NGKLAAVRM 468 +A P P +DG + G RK Y+ N A+R Sbjct: 341 CAVADIPA-------PTDRVLDGTNILPLLEGK--PILRKKPLYWQFNRAKNDAKVALRD 391 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVM--QTAGSSVFNLYTDPQESDSIGVRHIPMGV 526 E+K + P G V + G ++++ +D E+ + Sbjct: 392 GEWKLLAKLNVPSPKPSGGITTEEIDAVKNAKLEGFELYHIQSDIAETTDRAESEQEILK 451 Query: 527 PLQTEMHAYMEILKKYPPRAQI 548 ++ +M A + ++ PR Sbjct: 452 KMKQQMQAIFDEVQAEAPRWPA 473 >UniRef50_B8KM62 N-acetylgalactosamine-6-sulfatase n=1 Tax=gamma proteobacterium NOR5-3 RepID=B8KM62_9GAMM Length = 472 Score = 396 bits (1019), Expect = e-109, Method: Composition-based stats. Identities = 134/473 (28%), Positives = 216/473 (45%), Gaps = 33/473 (6%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 KPN+V+ +D+ G+ ++G GGG+ G TP ID +AS+G+ LT+ + +P+RA Sbjct: 6 AADKPNIVLINMDNFGYGELGVYGGGIVRGGATPRIDKLASEGIRLTNFNVEAQCTPSRA 65 Query: 142 TILTGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 ++TG+Y++ G P+ GL Q T+P++L D GY T GKW++G+ + P Sbjct: 66 ALMTGRYAVRTGNGTVPLQTVDYGLTQWEYTMPEMLSDAGYATAHFGKWNLGQREGRYPT 125 Query: 201 NVGFDDFRGFNSVSD--MYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 N GFD++ G + +D + ++A ++ IK+ + +G + Sbjct: 126 NQGFDEWYGIPNSTDESEWPTNEMFLKWAKIAKETGKTPMIKETHV----LSGRKGSPTK 181 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA 318 + ++D+ D G F+ + AK+ KPFFLY H P+A++ G S Sbjct: 182 EVKVFDSSVRPEIDREVTDLGKDFMTRQAKAGKPFFLYLPYTQTHAPVTPSAEFKGKS-G 240 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG-RTPFRGAKGS 377 +GD +++++ L +++ G DNT+ +FT+DNG E G P+ G+ + Sbjct: 241 NGKWGDILMQIDAYTGELLDKVDELGIADNTIFIFTADNGGEMTPTFQGWNGPWSGSYFT 300 Query: 378 TWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDG 436 EG +RVP V W G + K S+ IV DLF T ++AG VP ID Sbjct: 301 GMEGSLRVPFIVRWPGKVPAGKVSNEIVHEFDLFSTFANIAG-------GKVPTDRIIDS 353 Query: 437 VDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTV 496 D T FFLG QS R ++ + V+ +K Q G + + Sbjct: 354 KDMTDFFLGKQEQSGRDGFVIYVGDDIFGVKWQNYKMMF---------QELDGGNGSNKL 404 Query: 497 MQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMH------AYMEILKKYP 543 FNLY DP+E + + M L +M L K P Sbjct: 405 NVFPFVRFFNLYEDPKEEYPLNLT-KDMIANLWVRWGTGPILVDHMASLAKEP 456 >UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XH3_PSEA6 Length = 500 Score = 396 bits (1017), Expect = e-108, Method: Composition-based stats. Identities = 121/489 (24%), Positives = 191/489 (39%), Gaps = 62/489 (12%) Query: 61 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV 120 ++ + ++ +KPN++ L DD+G+ DVGFNG TP++D + Sbjct: 15 LIAISVGNASAADAGQSKADESNEKPNILFVLADDLGYNDVGFNGSTDI---KTPNLDGL 71 Query: 121 ASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHG--ILMPPMYGQPGGLQGLTTLPQLLH 177 A G+ +AY P P+RA I+TG+Y G +P G + Q + Sbjct: 72 AKNGMTFDAAYVAHPFCGPSRAAIMTGRYPHKIGAQFNLPEDNSNVGVSADELFIAQTMK 131 Query: 178 DQGYVTQAIGKWHMGENKESQPQNVGFDDFRGF-NSVSDMYTEWRDVHVNPEVALSPDR- 235 GY T A+GKWH+GE E P GFD+F GF + + E + N VA Sbjct: 132 SAGYFTGAMGKWHLGEASEYHPNKHGFDEFYGFLGGGHNYFPEQFEAAYNKRVAQGMTNI 191 Query: 236 SEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFL 295 + Y+ L + +V E + V F+DK A KPFFL Sbjct: 192 NMYLTPLEHNGKEVRE----------------TEYITDGLSREAVNFVDKAAAKKKPFFL 235 Query: 296 YYGTRGCHFDNYPNAKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 Y H + R +Y + ++ + + L+KNGQ DNT+ Sbjct: 236 YLAYNAPHVPLQAKEEDMAMFSQIKDKKRRTYAGMVYAVDRGVGRIVEQLKKNGQFDNTV 295 Query: 351 IVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADL 409 IVFTSDNG + + P + KGS EGG R P V+W ++ V DL Sbjct: 296 IVFTSDNGGKLGQGAN-NYPLKEGKGSVQEGGFRTPMLVHWPKHMKAGSRFSHPVLALDL 354 Query: 410 FPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH---YFLNGKLAAV 466 +PT L G ++P+ +DG D + + + + AA Sbjct: 355 YPTFAGLGGA-------VLPEDKKLDGKDIWADIQANTAPHKDEFIYVLRHRNGYSDAAA 407 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGV 526 R ++FK + ++N+ D E + I +H + Sbjct: 408 RRNQFKAVKNHNDDW---------------------KLYNIAQDISEDNDISAQHPDILR 446 Query: 527 PLQTEMHAY 535 + + M ++ Sbjct: 447 DMVSSMESW 455 >UniRef50_UPI0001A444F6 arylsulfatase A n=1 Tax=Pectobacterium carotovorum subsp. brasiliensis PBR1692 RepID=UPI0001A444F6 Length = 487 Score = 395 bits (1016), Expect = e-108, Method: Composition-based stats. Identities = 132/498 (26%), Positives = 210/498 (42%), Gaps = 54/498 (10%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 L ++ KPNV++ DD+GW D+ G PTP +D +A+ G T+ Sbjct: 16 AGPALWQVAAAAQTKPNVIILFTDDMGWADMSVQGAKT----PTPHLDKLAATGQRWTNF 71 Query: 131 Y-SQPSSSPTRATILTGQYSIHHGILMPPMYG-----QPGG-LQGLTTLPQLLHDQGYVT 183 Y S SSP+R ++TG+ G+ + G P G ++ + L GY T Sbjct: 72 YVSSAISSPSRGGLMTGRIETKTGLYGTKIPGVFMDEDPDGFPDDEISMAESLQHNGYRT 131 Query: 184 QAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRD-VHVNPEVALSPDRSEYI--- 239 GKWH+G + P GFD++ G + +D ++ D V +N + P R E + Sbjct: 132 IMYGKWHLGTQSTAFPTRHGFDEWYGIPTSNDRFSTVVDQVEMNRLASSDPKRRELLSKM 191 Query: 240 -------KQLPFSKDDVHAVR-GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDK 291 +Q ++ H+ + G+Q A + + V+++ D+ Sbjct: 192 EEINRAPRQEYWNVPLYHSYKDNGKQVDYAVPQGFQQASFTKDVTNKAVQYIAD--NKDQ 249 Query: 292 PFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 FF+Y H + + ++ G YGD M+E++ +Y+ LE N +NT++ Sbjct: 250 SFFMYMAYPQTHVPLFTSPEFKGK--GHNPYGDVMLEIDWSVGQIYQALEANKLAENTIV 307 Query: 352 VFTSDNGPEAEVPPHGRT----PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLA 407 +FTSDNGP + G P R K + +EGG RVP V WK I P+ D I Sbjct: 308 IFTSDNGPWLQYDKDGLAGSALPLRSGKSTVFEGGQRVPFIVNWKSHIAPKVVDDIGSTL 367 Query: 408 DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVR 467 DL PT + + G A+ +DGVD ++ FL S R YF GK+ A R Sbjct: 368 DLLPTLMKITGSQHAQ--------RDLDGVDLSAAFLNGK-PSARTFMPYFYWGKMDAYR 418 Query: 468 MDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVP 527 ++K ++ G + +FNL D E + + Sbjct: 419 DGDYKVVFRDKK-------------AGIPVDLEKPLMFNLRDDVSEQHDLSAKEPDRYRA 465 Query: 528 LQTEMHAYMEIL-KKYPP 544 L + AY + L +K PP Sbjct: 466 LIEKARAYEQSLGEKKPP 483 >UniRef50_A4AM21 Arylsulfatase A n=1 Tax=Flavobacteriales bacterium HTCC2170 RepID=A4AM21_9FLAO Length = 535 Score = 395 bits (1015), Expect = e-108, Method: Composition-based stats. Identities = 117/517 (22%), Positives = 194/517 (37%), Gaps = 79/517 (15%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPT 139 K K PN+V L DD+G+ D+ TP+ID +A G+ T A+ S +PT Sbjct: 31 KKQKPPNIVYILADDLGYGDISAFNAE--GKIQTPNIDNLAKDGMKFTDAHTSSAVCTPT 88 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMGENK-- 195 R ILTG+Y+ I + G+ L TT+ L D GY T IGKWH+G + Sbjct: 89 RYGILTGRYNWRSPIKSGVLTGKSEALIPNSRTTVASFLSDNGYKTGFIGKWHLGWDWAI 148 Query: 196 -------------------------ESQPQNVGFDDFRGFNSVSDMYTEWRDVH-----V 225 + P ++GFD G + DM + Sbjct: 149 KDSTNNGGEGWNATDFENLDFTKPVTNTPNDLGFDYAYGHSGSLDMAPYVYVENGMATAK 208 Query: 226 NPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDK 285 V + + + ++ P + D VH +++ + + F+ + Sbjct: 209 VDTVTVDKGKYTWWREGPTAADFVH------------------DEVTPNFFRKSMSFIKE 250 Query: 286 MAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQ 345 ++PFFLY H P ++ G S Y D +V ++D L + LE+ G Sbjct: 251 QGAEEQPFFLYLALPSPHTPILPTEEWQGKSNLN-PYADFVVMIDDYLGQLVEVLEQKGL 309 Query: 346 LDNTLIVFTSDNGPEAE-----VPPHGRTP---FRGAKGSTWEGGVRVPTFVYWKGMIQP 397 +NT+++FTSDNG + + G P +RG K +EGG R+P V W I+ Sbjct: 310 AENTIVIFTSDNGCSPQADFKILGDLGHDPSAIYRGHKADIYEGGHRIPFVVKWPSKIES 369 Query: 398 R-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFF-LGTNGQSNRKAE 455 SD + DL T D+ D + R+A Sbjct: 370 GSVSDKTICTTDLLATVADILNVDLLDNQGE-------DSFSILPLLDTTDKREFKREAT 422 Query: 456 HYFLNGKLAAVRMDEFKYHVLI-QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 + A+R +K ++ + +G + + +++L DP E Sbjct: 423 VHHSINGSFALRKANWKMIFCTGSGGWSDPKPNSEG-----IEELPKFQLYDLANDPSEQ 477 Query: 515 DSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 ++ H + L M Y++ + P + Q + Sbjct: 478 TNLFGHHPDIEGQLSELMLDYIDDGRSTPGKKQTNEE 514 >UniRef50_D2R917 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R917_9PLAN Length = 486 Score = 395 bits (1015), Expect = e-108, Method: Composition-based stats. Identities = 123/488 (25%), Positives = 184/488 (37%), Gaps = 84/488 (17%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 A + ++PN+V + DD+GW DVGFNG TP+IDA+A G + Y Q Sbjct: 19 AVASQAADRQPNIVHIVADDLGWKDVGFNG---CTEIKTPNIDALAKGGAKFSQFYVQNM 75 Query: 136 SSPTRATILTGQYSIHHGILM--PPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 +PTRA ++TG++ +G+ P G +PQ L D GY T IGKWH+G Sbjct: 76 CTPTRACLMTGRFPYRYGLQTIVIPTAAGYGLDTSEYLMPQCLGDAGYKTAIIGKWHLGH 135 Query: 194 -NKESQPQNVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 +++ P+ GFD G D +T ++ F + Sbjct: 136 ADQKYWPKQRGFDYQYGAMIGELDYFTHDEHGVLD----------------WFRDNKPVH 179 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 +G I D KY+ D KPF+LY H + Sbjct: 180 EQGYTTTLIGDDAVKYIHGQD----------------GKKPFYLYLTFNAPHTPYQAPKE 223 Query: 312 YAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH 366 Y P R +Y + +++ + L++ G +NTLI F SDNG + Sbjct: 224 YITKYLNIAEPTRRTYAAMVDCLDENIGKVVAALDQKGLRENTLIFFHSDNGGTKDKMFA 283 Query: 367 G-------------RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTA 413 G P+R KGS +EGG RV W G I+ + DG++ DL+PT Sbjct: 284 GQMADMSKVVLPCDNGPYRNGKGSLFEGGSRVCALANWPGKIKAQTVDGMIHAVDLYPTF 343 Query: 414 LDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKY 473 LAG AK +DG + S R Y + A +R ++K Sbjct: 344 AALAGASIAKC-------KPLDGTNVWDTI-AEGKPSPRTEFFYSIEPFRAGLRQGDWKL 395 Query: 474 HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMH 533 P + ++NL DP E ++I H +Q + Sbjct: 396 IWRTMLPSSVD-------------------LYNLAEDPYEKNNIAAAHPDKVATMQARIE 436 Query: 534 AYMEILKK 541 + K Sbjct: 437 TASKDAAK 444 >UniRef50_B8KTJ7 Arylsulfatase F n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KTJ7_9GAMM Length = 473 Score = 395 bits (1014), Expect = e-108, Method: Composition-based stats. Identities = 125/464 (26%), Positives = 204/464 (43%), Gaps = 35/464 (7%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 + PN+V+ L+D+ GW +VG GGG G PTP I ++A +GL LT+ +P +P+R+++ Sbjct: 32 QHPNIVLVLMDNFGWGEVGAYGGGALRGAPTPHIYSLAEEGLRLTNFNVEPECTPSRSSL 91 Query: 144 LTGQYSIHHGILMPPMYGQ--PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQN 201 +TG+Y+ + Y G + T+ ++L + Y T GKWH+G+ + P Sbjct: 92 MTGRYAARTRLRTDGTYRSVWYGITKWEVTIAEMLTETEYATGWFGKWHLGDTEGRYPTG 151 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIA 261 GFD++ G SD W D R Y+ + + RG + + +A Sbjct: 152 QGFDEWYGIPRSSDR-AFWPDSTQYDGEGFPGARFNYV---------MESTRGEKPKELA 201 Query: 262 DITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTS 321 +D+ D + F+ + A + KPFF H P+ Y G + Sbjct: 202 VYDRAKRRLIDREITDKTIDFIQRKAAAKKPFFTLVSYTQTHEPVEPHPDYRGRT-GHGD 260 Query: 322 YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAKGSTWE 380 + D + + +D +L T++ ++TL +FT+DNG E G T P+RG S WE Sbjct: 261 FADVLAQTDDYVGDLLDTIDALDIAEDTLFIFTADNGREGIPGSWGFTGPWRGGMFSPWE 320 Query: 381 GGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQ 439 G +RVP + W G I S+ IV L DL PT + +P +DG+DQ Sbjct: 321 GSLRVPFLIRWPGKIPSGTVSNDIVHLVDLMPTFAAAT-------HSELPDDRILDGLDQ 373 Query: 440 TSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQT 499 FFLG S R++ ++ +L + +K +Y + Sbjct: 374 LPFFLGETENSPRESVMVYVGNELFGAKWRNWKILFKDMDTDSY-----------AIRDL 422 Query: 500 AGSSVFNLYTDPQESDSIGVRHI--PMGVPLQTEMHAYMEILKK 541 A S++NL DP+E G + PL + + L+ Sbjct: 423 AYPSIYNLIVDPKEEVPEGNYLPDTWVDAPLYQVVEDFEASLEA 466 >UniRef50_A6C4Q6 Arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4Q6_9PLAN Length = 574 Score = 395 bits (1014), Expect = e-108, Method: Composition-based stats. Identities = 118/468 (25%), Positives = 190/468 (40%), Gaps = 51/468 (10%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 +PNV+V L DD G+ DVGF G + TP +D +A + + LT Y P +PTRA Sbjct: 31 AESRPNVIVILTDDQGYGDVGFRGN---LKINTPHLDRMAEKSIELTRFYCSPVCAPTRA 87 Query: 142 TILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQN 201 ++LTG+ G++ G T+ +LL GY T GKWH+G+N +PQ+ Sbjct: 88 SLLTGRNYYRTGVIHTSRGG-AKMQGEEVTVAELLQQAGYQTGIFGKWHLGDNYPMRPQD 146 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIA 261 GF + +H + + SPD+ K+ V A Sbjct: 147 QGFAESL--------------IHKSGGIGQSPDQPNSYFHPKLWKNGV-----------A 181 Query: 262 DITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY------AGS 315 + Y D+ + D + F+D+ K++KPFF+Y T H Y G Sbjct: 182 FQSTGYCTDV---FFDAALDFIDRQTKTEKPFFVYLATNAPHTPLEIAESYWKPYQRQGL 238 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAK 375 + +++ L LE++ + T+++F DNGP+ + G RG K Sbjct: 239 DETTARVYGMITNLDENIGKLLSHLERSALAEKTVVLFLGDNGPQQKRYTGG---LRGRK 295 Query: 376 GSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFI 434 T+EGG+RVP W G + K D I DL PT L L P++ + Sbjct: 296 SWTYEGGIRVPCLAQWPGHFREGEKIDQIAAHIDLMPTLLALT-------ETRCPESLKL 348 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTG 494 DGVD + G + ++ + ++ L R + V+ ++ G G Sbjct: 349 DGVDLSPLLTGRKEKLPARSLFFQVHRGLTPQRYQNY--AVVTERFKLAGYPGTFGTENL 406 Query: 495 TVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKY 542 + ++L TDP E ++ H L + + +K Sbjct: 407 LLQAEPVLEFYDLSTDPGEQKNVLHSHPETVKALLKQYEDWFSEMKAT 454 >UniRef50_A6DSP6 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSP6_9BACT Length = 512 Score = 394 bits (1013), Expect = e-108, Method: Composition-based stats. Identities = 124/493 (25%), Positives = 196/493 (39%), Gaps = 70/493 (14%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPTR 140 K+PN+++ DD+G+ DVG++G + TP+ID++A QG+ + Y P+R Sbjct: 17 ADKQPNIILIFADDMGYDDVGYHGNKRII---TPNIDSIAEQGVQFSQGYVSASVCGPSR 73 Query: 141 ATILTGQYSIHHGILMPPM---------YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 A +LTG Y G P Y G Q + + + L GY IGKWHM Sbjct: 74 AGLLTGVYQQRFGCGENPNGSGYPNQMKYPMAGLPQSQSMISEELKTLGYTNGMIGKWHM 133 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 G + +P G+D F GF + S YTEW + R+E ++ + Sbjct: 134 GFDMSLRPNQRGYDFFYGFINGSHDYTEWTQEFAKGKSRWPIFRNEEMEPA----NKAQY 189 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN----- 306 + +++ + + Y+ DL + D V F+D+ A DKPFFLY H Sbjct: 190 IDVFKEKGVKVVDENYLTDL---FTDEAVNFIDRNA--DKPFFLYLAYNAVHHPWQTTQH 244 Query: 307 -YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-PEAEVP 364 + + + M++ + K L++ DNT+I+F SDNG P+ + Sbjct: 245 ALDKTAHLKDDKNYHVFASMVYAMDEGIGKVMKKLKEKNIDDNTIIIFLSDNGSPQGQGI 304 Query: 365 PH---------------GRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLAD 408 H FRG KG T+EGG+RVP + W I + K D + D Sbjct: 305 EHSPKDPNRHRGGFTMSSTGIFRGYKGDTYEGGIRVPFCIKWPQQIQKGTKYDMPISALD 364 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRM 468 L PT + AG K K DGVD + K ++ A+R Sbjct: 365 LQPTLVKAAGGNDKKPQ----KGFAYDGVDILPYL---KEDKEIKRSLFWRRDTDYAIRK 417 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPL 528 ++K ++FN+ DP+E ++ +H + L Sbjct: 418 GDWKLQ------------------WNDAHGPLTITLFNIKEDPEERSNLIKQHPELAQQL 459 Query: 529 QTEMHAYMEILKK 541 Q E + + Sbjct: 460 QNEFDTWDNSMPD 472 >UniRef50_A9W035 Sulfatase n=6 Tax=Bacteria RepID=A9W035_METEP Length = 564 Score = 394 bits (1013), Expect = e-108, Method: Composition-based stats. Identities = 139/502 (27%), Positives = 212/502 (42%), Gaps = 46/502 (9%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRAT 142 G+KPN++V + DD+G ++G G+ G TP ID +A++G++ T Y++ S + RA Sbjct: 56 GQKPNIIVIMGDDIGIWNIGAYHRGMMAG-RTPHIDQLAAEGMLFTDYYAEASCTAGRAA 114 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKESQPQN 201 +TG+ I G+ G G+ T+ L GY T GK H+G+ E P Sbjct: 115 FITGELPIRTGMTTVGQAGAAIGIPAEAVTIATALKGMGYATGQFGKNHLGDKNEFLPTV 174 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVNPE----VALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 GFD+F G+ D + E V + + DD R G+Q Sbjct: 175 HGFDEFFGYLYHLDAMEDPAHPAYPQELLNRVGPRNMVHSWATNVDDPTDDPRWGRVGKQ 234 Query: 258 QAIADIT--PKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 + T PK ME +D D + F+DK + KPFF++ H + + KY Sbjct: 235 RIEDAGTLYPKRMETIDDEIRDLALGFIDKAKANGKPFFVWLNPTRMHVTTHLSPKYQAM 294 Query: 316 SPARTSYG---DCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR-TPF 371 ++ + M +++DV + K L+ G DNT++VFT+DNG E P G TPF Sbjct: 295 RNSKNGWSIQEAGMAQIDDVVGAVMKKLKDLGVDDNTIVVFTTDNGTEVFTWPDGGQTPF 354 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPRKSD-GIVDLADLFPTALDLAGHPGAKVANLVPK 430 +KG+ EGG R P V W G + D G++ D FPT + AG+P K Sbjct: 355 AQSKGTVMEGGFRAPAMVRWPGKVPAGTVDNGVISGLDWFPTLVAAAGNPDIGEELKKGK 414 Query: 431 -------TTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAY 483 +DG +Q G G S R YF +L AVR+ ++KY + Q Sbjct: 415 QIADQTYKVHLDGYNQLDLITGK-GPSKRNEVWYFGESELGAVRIGDYKYRFIDQ----- 468 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD---------------SIGVRHIPMGVPL 528 GG+ G + + NL DP E + + + Sbjct: 469 -----PGGWLGDKTKPDVPYITNLRLDPFERTGWPDSGTKIGTQNYMNWFLYEFWRFTFV 523 Query: 529 QTEMHAYMEILKKYPPRAQIKS 550 Q E+ ++PP + S Sbjct: 524 QQEVEKLAMTAVEFPPMQKGAS 545 >UniRef50_A5FAW4 Sulfatase n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FAW4_FLAJ1 Length = 539 Score = 393 bits (1011), Expect = e-108, Method: Composition-based stats. Identities = 129/538 (23%), Positives = 202/538 (37%), Gaps = 86/538 (15%) Query: 46 YLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNG 105 +L P T + P Q A+ K + + KKPN+++ L DD+G D+ G Sbjct: 24 FLFWPINTDGTLIQPD-QKLAEGKAAFLSQKDTSAASEKKPNIIILLADDLGKYDISLYG 82 Query: 106 GGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATILTGQYSIHHGILMPPMYGQP- 163 G PTP ID++A+ G+ T Y S SP+RA +LTG+Y G P P Sbjct: 83 GK---STPTPQIDSLAASGVTFTDGYVSSSICSPSRAGLLTGRYQERFGHEYQPGDRYPK 139 Query: 164 ----------------------------------GGLQGLTTLPQLLHDQGYVTQAIGKW 189 G + T L QGY T IGKW Sbjct: 140 NNLEYYAFKYLLNTNSWRLNPKIEYPNDASIATQGLPKSEITFADLAKKQGYSTAIIGKW 199 Query: 190 HMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 H+G K P + GFD GF ++ + NP++ ++ +++ + + V Sbjct: 200 HLGHTKGFFPLDRGFDYHYGF---YQAFSLFAPEDNNPDI-INHHHTDFTDKTIWGNGRV 255 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 + I D + L +++ + F+DK +KPF LY H Sbjct: 256 GTGQIRRDSTIIDEK----KYLTEKFAEEAEAFIDK--NKNKPFLLYVPFNAPHTPFQVR 309 Query: 310 AKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP 364 KY + Y + ++D + ++K G +NTLI F SDNG Sbjct: 310 KKYYDRFPNVKDENKRVYFAMISALDDAIGLIRAKVKKEGLEENTLIFFASDNGGADYTY 369 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAK 423 P +G K S +EGGV VP + WKG I+P V D+F T + Sbjct: 370 ATTNAPLKGGKFSHFEGGVNVPFALSWKGKIKPHTIYKTPVSSLDIFSTIAAVT------ 423 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAY 483 + +PK DGVD Y+ +G A+R ++K + + Sbjct: 424 -HSGLPKDRVYDGVDLVDVVNNNKQA---HQNLYWRSGDAKAIRSGDWKLIISGK----- 474 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 T + ++NL D E+ + ++ LQT + + + L K Sbjct: 475 ---------------THETWLYNLAKDKSETTDLASKNPEKVKELQTALQNWEKGLIK 517 >UniRef50_A6CAY0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAY0_9PLAN Length = 466 Score = 393 bits (1010), Expect = e-107, Method: Composition-based stats. Identities = 114/506 (22%), Positives = 199/506 (39%), Gaps = 95/506 (18%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQ 133 + EK K+PN+++ D++G+ D+G G V TP +D +AS+G+ LT Y + Sbjct: 24 VTAAEKPENKRPNILLITADNLGYGDLGCYGNPVM---KTPMLDQLASEGVRLTDFYTAS 80 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQP---GGLQGLTTLPQLLHDQGYVTQAIGKWH 190 P+ + +RAT+LTG+Y G+ + G + +P+ L QGY T GKW+ Sbjct: 81 PTCTVSRATLLTGRYPQRIGLNHQLSADENYGDGLRKSEVLIPEYLKQQGYRTACFGKWN 140 Query: 191 MGENKESQPQNVGFDDFRGFNSVS-DMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 +G + S+P GFD+F GF + + D Y + + Sbjct: 141 VGFSPGSRPTERGFDEFFGFAAGNIDYYHHYYAGRHD----------------------- 177 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 RG ++ + + + D +++ A+SD+PFF+Y HF + N Sbjct: 178 -LWRGLKEVFVEGYS-------TDLFADAACQYI--SAESDQPFFIYLPFNAPHFPSQRN 227 Query: 310 A---------------KYAGSSPA----RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 + G P + Y + ++ + K L+ +G D T+ Sbjct: 228 KQPGQGNEWQAPDLAFEKYGYDPQTKNPQERYRAVVTALDSAIGRVLKQLDTSGLRDQTI 287 Query: 351 IVFTSDNGP----EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDG-IVD 405 +++ SDNG E + P R + WEGG+RVP + + G ++ + + Sbjct: 288 VIWYSDNGAFMLKERGLEVASNKPLRDGGVTLWEGGIRVPAIIRYPGHLKAGTVNQSPLI 347 Query: 406 LADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAA 465 D+ PT + LAG P +P +DG D R + N +A Sbjct: 348 SLDILPTLITLAGGP-------LPAERILDGQDMLPALAAQTAPEPRTFFFQYRN--FSA 398 Query: 466 VRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMG 525 VR ++K + +F+L D E+ + R+ + Sbjct: 399 VRRGKYKLVRI--------------------KPNQPFMLFDLEQDLSETTDLAERNPKVL 438 Query: 526 VPLQTEMHAYMEILKKYPPRAQIKSD 551 LQ + + + R + KSD Sbjct: 439 NQLQQAYADWEREVAENEERRR-KSD 463 >UniRef50_A6BZV9 Arylsulfatase n=3 Tax=Bacteria RepID=A6BZV9_9PLAN Length = 520 Score = 393 bits (1010), Expect = e-107, Method: Composition-based stats. Identities = 108/548 (19%), Positives = 186/548 (33%), Gaps = 128/548 (23%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 A + K+PN+++ + DD+GW D+G GG V TP +D +A +GL T Y+ Sbjct: 19 AVQAAEKIKRPNIILIMCDDMGWSDIGCYGGEV----QTPHLDRMAKEGLRFTQFYNNAV 74 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK 195 TRA+++TG Y Y +P + + T+ ++L GY T GKWH+G + Sbjct: 75 CWTTRASLVTGLYP---------RYPRPHLNRNMVTIGEVLQQAGYQTALSGKWHLGRTE 125 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 + P GF DF G + + + +P+ + + D Sbjct: 126 STHPVYRGFQDFYGLLDGCCNF--FDPYYRDPKF-----------KRGITGDGYRFFAEN 172 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN----AK 311 + Y D + D+ ++ + +++DKPFFL+ H+ + K Sbjct: 173 TTRITEFPDDFYTTD---AFTDHAIQEIKTYSQTDKPFFLHLCYTAPHYPLHAKPEDIKK 229 Query: 312 YAG------------------------------------------------SSPARTSYG 323 Y G Y Sbjct: 230 YKGRYAAGWEALRNERYQRQLKMGLVDPQWKLPARDPESADWEQDKYPRDWQERRMEVYA 289 Query: 324 DCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG---------------- 367 + M+ L TL++ G DNT+++F SDNGP+A P Sbjct: 290 AMIDCMDQNIGRLMATLKETGVDDNTIVMFLSDNGPDASEPGGANPEQIPGPEEYYTTCG 349 Query: 368 -------RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGH 419 TPFR K EGG+ P V W G I+ + + D+ PT ++LA Sbjct: 350 PSWAFPQNTPFRRFKTWMHEGGISTPLIVRWPGKIKANSLTRQPAHIIDVMPTCVELAET 409 Query: 420 PGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQ 479 K +DG G + + N AVR ++K Sbjct: 410 DYPATFQSH-KILPVDGKSIVPILQGKIREPHDSLFWELRNN--QAVRQGKWKLV----- 461 Query: 480 PYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL 539 +++L D E++++ ++ ++ + + + Sbjct: 462 ---------------ADRNINRWELYDLEQDRTETNNLASQYPERVAQMKADWQKWADKT 506 Query: 540 KKYPPRAQ 547 + Q Sbjct: 507 GVAQQKHQ 514 >UniRef50_B9KQS8 Twin-arginine translocation pathway signal n=2 Tax=Alphaproteobacteria RepID=B9KQS8_RHOSK Length = 509 Score = 393 bits (1009), Expect = e-107, Method: Composition-based stats. Identities = 117/525 (22%), Positives = 206/525 (39%), Gaps = 98/525 (18%) Query: 42 HPNQYLVKPATT---IADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGW 98 HPN+ V + A + ++ PA+ +E +P+++ L+DD+G+ Sbjct: 29 HPNRRDVLAGSAGFLAAIAGLSILAQPARAQEVA------------RPHILYILVDDLGY 76 Query: 99 MDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM-- 156 DVG++G V TP++D +A++G L Y+QP +PTRA ++TG+Y + +G+ Sbjct: 77 ADVGYHGSDV----KTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGRYPMRYGLQTGV 132 Query: 157 PPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQNVGFDDFRGFNSVSD 215 P G+ G LPQ+L + GY T +GKWH+G +++ P+ G D F G Sbjct: 133 IPSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGVDYFYG------ 186 Query: 216 MYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRW 275 + ++ K + H + + P Y +L + Sbjct: 187 ---------------------PLVGEIDHFKHEAHGITDWYRDNEMVKEPGYDTEL---F 222 Query: 276 MDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG-----SSPARTSYGDCMVEMN 330 ++ +++ S P ++Y H KY + R +Y + M+ Sbjct: 223 GADAIRLIEEH-DSATPLYMYLSFTAPHTPYQAPDKYKDLYPDIADEGRKAYAAMISCMD 281 Query: 331 DVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG-----------RTPFRGAKGSTW 379 D + + LE+ G ++TL++F SDNG G P R KG+ + Sbjct: 282 DQVGLVLQALERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLREGKGTLY 341 Query: 380 EGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQ 439 EGG RV W G I ++ G++ + D+ PT LA A +DG+D Sbjct: 342 EGGTRVVALANWPGRIPAGETHGMMHVVDMLPTLAGLAQAEIAHAGQ-------LDGMDV 394 Query: 440 TSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQT 499 S R+ Y + A+R ++K + + Sbjct: 395 WQAI-SAGKASPREEVVYNIEPTQGALRDGKWKLY-------------------WQPILP 434 Query: 500 AGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 +F+L DP E+ + + +Q + + PP Sbjct: 435 PKVELFDLEADPSETTDLSAKEPEQLARMQARVIDLARSMA--PP 477 >UniRef50_B0NLM9 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NLM9_BACSE Length = 463 Score = 392 bits (1008), Expect = e-107, Method: Composition-based stats. Identities = 111/483 (22%), Positives = 193/483 (39%), Gaps = 77/483 (15%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPTRA 141 G KPN++ L DD+G+ D+ G TP+ID +A+ G T Y+ SSP+R Sbjct: 32 GDKPNIIFILADDMGYCDLSCYGNKYI---ETPNIDRLAATGTAFTQCYAGSGISSPSRC 88 Query: 142 TILTGQYSIHH-------------GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGK 188 ++TG+ + + G+ + TT+ +L GY T + K Sbjct: 89 ALMTGKNTGNTTIRDNFCIAGGIEGLKGTKTIRRMHLQPNDTTIATVLGAAGYRTCLVNK 148 Query: 189 WHM-GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 WH+ G N E+ P N GFD+F G+ + D + P + ++ E +K+ Sbjct: 149 WHLDGFNPEATPLNRGFDEFYGWLISTAY---SNDPYYYPYWRFNNEKLENVKE------ 199 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH---- 303 + K+++ + +KF+++ + PFFLY H Sbjct: 200 --------------NEGDKHIKHNTDLSTEDAIKFINR--NKNNPFFLYLAYDAPHEPYN 243 Query: 304 FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 D Y + M+ L L++ G +NTL++F SDNG + Sbjct: 244 IDETTWYDDEAWDMNTKRYASLITHMDRAIGRLLAELDRLGLRENTLVIFASDNGAAKQA 303 Query: 364 PP---HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP 420 P + +G KG +EGG+RVP V G + +K + I+ D+ PT LAG Sbjct: 304 PLEELGCKGSLKGMKGQLYEGGIRVPFIVNQPGKVPVQKLNNIIYFPDVMPTLAALAGAT 363 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 + +P+ ++G++ F G ++ + ++ GK A R ++K Sbjct: 364 -----DKLPQK--LNGINILPLFYGQQLDTDNRLLYWEFPGKQRAARCGDWKVV------ 410 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 TV + A ++N+ D ES ++ ++ + EM A Sbjct: 411 --------------TVKKDAPLELYNIKEDMTESVNLANKYPEKVAQFEKEMKAMRIPTP 456 Query: 541 KYP 543 +P Sbjct: 457 NWP 459 >UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_BACFR Length = 489 Score = 392 bits (1007), Expect = e-107, Method: Composition-based stats. Identities = 128/521 (24%), Positives = 199/521 (38%), Gaps = 99/521 (19%) Query: 69 KETQQKLAELEKKTGK-KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLIL 127 TQQ LA +K + +PNVV L DD+G+ D+ G TP+ID +A G+ Sbjct: 19 ASTQQALARQKKAKEQTRPNVVFILADDLGYGDLSCYGQE---KFETPNIDRLAQNGMRF 75 Query: 128 TSAYS-QPSSSPTRATILTGQYSIHHGILMP---PMYGQPGGLQGLTTLPQLLHDQGYVT 183 T YS S+P+R+ ++TG +S H I GQ + T+ + GY T Sbjct: 76 TQCYSGTTVSAPSRSCLITGTHSGHTAIRGNKELAPEGQFPLPENSQTIFNDFRNAGYRT 135 Query: 184 QAIGKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDV-HVNPEVALSPDRSEYIKQ 241 A GKW +G P G D F G+N ++ + D N + PD + ++ Sbjct: 136 GAFGKWGLGYIGSAGDPYKQGIDQFYGYNCQLLAHSYYPDHLWDNDKRVDLPDNNLNVQ- 194 Query: 242 LPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAK-SDKPFFLYYGTR 300 Y +DL + FLD+ AK D+PFF++Y T Sbjct: 195 --------------------YGKGTYSQDLIHS---KALAFLDEAAKEKDQPFFMWYPTI 231 Query: 301 GCHFD--------------NYPNAKYAGSSPA---------------RTSYGDCMVEMND 331 H + YP Y G P ++ + ++ Sbjct: 232 IPHAELIVPEDSIIKKFRGKYPEKPYRGVEPGSPAFRKGGYCTQFYPHATFAAMVYRLDV 291 Query: 332 VFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP-----HGRTPFRGAKGSTWEGGVRVP 386 + + L+ G DNT+I+F+SDNGP E + +RG K +EGG+RVP Sbjct: 292 YVGQIVQKLKDMGVYDNTIIIFSSDNGPHMEGGADPDFFNSNGIWRGYKRDVYEGGIRVP 351 Query: 387 TFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLG 445 + W G +QP +D + DL PT +V N T +DGV Sbjct: 352 MIISWPGHVQPSTETDFMCSFWDLMPTF--------REVLNPKADTRNMDGVSILPLLQN 403 Query: 446 TNGQSNRKAEH--YFLNGKLAAVRMDEFKY-HVLIQQPYAYTQSGYQGGFTGTVMQTAGS 502 GQ + + + AVR ++K H+ I+ Y Sbjct: 404 RKGQKEHEYLYFEFLEMNGRQAVRKGDWKLVHMNIRGNKPY------------------Y 445 Query: 503 SVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 ++NL +DP E ++ ++ L+ M +P Sbjct: 446 ELYNLASDPSEKYNVLNQYPEKADELKAIMKEAHIEDSNWP 486 >UniRef50_A6C8R8 Arylsulfatase A n=2 Tax=Planctomycetaceae RepID=A6C8R8_9PLAN Length = 510 Score = 392 bits (1007), Expect = e-107, Method: Composition-based stats. Identities = 120/484 (24%), Positives = 192/484 (39%), Gaps = 55/484 (11%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA-YSQP 134 + KKPN +V D++G+ D+ G V N TP ++ +A +G T + Sbjct: 38 SASAAPQQKKPNFIVIFCDNLGYGDIEPFGSTV---NRTPCLNRMAREGRKFTHYCVTAG 94 Query: 135 SSSPTRATILTGQYSIHHGILMPPMYGQ------PGGL-QGLTTLPQLLHDQGYVTQAIG 187 +P+RA+I+TG YS G+ P GQ P GL T+ ++L QGY T IG Sbjct: 95 VCTPSRASIMTGCYSQRVGMHWNPRDGQVLRPISPYGLNPDEITVAEVLKKQGYKTGMIG 154 Query: 188 KWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 KWH+G+ P GFD F G DM + + LP + Sbjct: 155 KWHLGDQTPFLPTRQGFDYFYGIPYSDDM------TQAVGQRLGDRLDGKNWPPLPVMLN 208 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 D G ++ L + + + V+F++K ++PFFLY+ Sbjct: 209 DTVIEAGVDRNL-----------LTKDYTEKAVEFIEK--NKNQPFFLYFPQAMPGSTRK 255 Query: 308 P--NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP 365 P + + G S +GD + E++ + L + G NTL+++TSDNG Sbjct: 256 PFASDAFRGKS-KNGPWGDSIEELDWSTGQILDKLVELGIDKNTLVIWTSDNGSPMAKDM 314 Query: 366 HG-----RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGH 419 + P G +T EG RVPT V+W + + + DL PT LAG Sbjct: 315 NSTERGTNKPLNGRGYTTSEGAFRVPTIVWWPETVPAGTVCEELATTMDLLPTFARLAG- 373 Query: 420 PGAKVANLVPKTTFIDGVDQTSFFLGT-NGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ 478 VP IDG D +G + ++ +Y+ +L AVR +K V ++ Sbjct: 374 ------GKVPSDRIIDGHDIRPLIMGEADAKTPYDGFYYYAMEQLQAVRKGPWKLFVPLK 427 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 + + G + +FN+ TD ++ +H + L + Sbjct: 428 EFSRHPHFKKGEG--------SRPLLFNVVTDISSEHNVADQHPEIVKELMSLAEKARAD 479 Query: 539 LKKY 542 L Sbjct: 480 LGDT 483 >UniRef50_A6LEC5 Arylsulfatase A n=2 Tax=Parabacteroides RepID=A6LEC5_PARD8 Length = 483 Score = 392 bits (1007), Expect = e-107, Method: Composition-based stats. Identities = 110/493 (22%), Positives = 183/493 (37%), Gaps = 75/493 (15%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNG-------GGVAVGNPTPDIDAVASQGLILT 128 + +++ KPN+++ L DD+G+ DV + TP++D +A QG+ T Sbjct: 22 CDAKEEAVPKPNIIILLADDLGYNDVSCYRNENFPQQSDSFPTSQTPNLDLLARQGIRFT 81 Query: 129 SAYS-QPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQG-LTTLPQLLHDQGYVTQAI 186 + Y SSP+RA ++TG+ G+ P L+ T+ ++L Y T Sbjct: 82 NFYCGAAVSSPSRAALMTGRNCTRTGVYNYLEQNSPMHLRDSEVTIAEVLKQADYATGHF 141 Query: 187 GKWHM--GENKESQPQNVGFDD-FRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLP 243 GKWH+ G + P + GFD F N+ +P Sbjct: 142 GKWHLSSGRPDQPYPNDQGFDYSFYALNNS----------------------------VP 173 Query: 244 FSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 + + R GE Q + + +++LDK +PFFL H Sbjct: 174 SHHNPTNFFRNGEPQGEIE------GYSCDIVVTEALQWLDK--NKQEPFFLNVWFNEPH 225 Query: 304 FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 F + Y C+ M+ L L++ DNT+++F SDNG Sbjct: 226 FPMEAPEELKKRHAINPEYYGCIENMDIAIGKLMNYLKEQNLEDNTIVIFASDNG---SQ 282 Query: 364 PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGI-VDLADLFPTALDLAGHPGA 422 + PFRG K +EGG+RVP V W + D+ PT LA P Sbjct: 283 WDYSNLPFRGEKHFNYEGGLRVPCIVRWHKHVPTGVISEFNGCFTDILPTLASLADAP-- 340 Query: 423 KVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN---GKLAAVRMDEF-KYHVLIQ 478 VP IDG+D + FLG R+ +F + +R ++ Sbjct: 341 -----VPTDRVIDGMDISPVFLGKAETLERENPLFFFRYIHDPICMIREGDWCLLGYDEP 395 Query: 479 QPYAYTQSGYQGGFTGTV------------MQTAGSSVFNLYTDPQESDSIGVRHIPMGV 526 P+A++ G + ++NL D +E + +H + Sbjct: 396 LPWAFSLDELALGKVKPWYLTKEHMEFAKKVFPKYFELYNLRDDREERIDVADKHPEIVA 455 Query: 527 PLQTEMHAYMEIL 539 L+++M + + Sbjct: 456 RLKSKMLKLKQEV 468 >UniRef50_A6QA55 Arylsulfatase n=5 Tax=Bacteria RepID=A6QA55_SULNB Length = 528 Score = 391 bits (1006), Expect = e-107, Method: Composition-based stats. Identities = 139/499 (27%), Positives = 228/499 (45%), Gaps = 43/499 (8%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRAT 142 KKPN++V DD+GW +V G G G TP+ID++ +G+ T Y+QPS + RA+ Sbjct: 27 AKKPNILVIWGDDIGWQNVSAYGMGTM-GYTTPNIDSIGMEGIRFTDHYAQPSCTAGRAS 85 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKESQPQN 201 +TGQY I G+ G P GL+ L +++ +QGY T GK H+G+N P Sbjct: 86 FITGQYPIRSGMTTVGQPGDPLGLKPESPCLAEVMKEQGYTTGQFGKNHLGDNNMHLPTV 145 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIA 261 GFD+F G + E E ++ ++ +H+ + Sbjct: 146 HGFDEFYGNLYHLNTQEEAEQRDYQRFAKAYSGSVEAYEKKFGTRGVIHSFATDKDDPTV 205 Query: 262 D----------------ITPKYMEDLDQR-WMDYGVKFLDKMAKSDKPFFLYYGTRGCHF 304 D +T + M++ D++ + F+ + K KPFF++ T H Sbjct: 206 DPRFGKVGKQIIEDTGPLTQERMKEFDEKEVIPRAFDFMIRAKKEGKPFFVWLNTTRMHL 265 Query: 305 DNYPNAKYAGSSPARTS----YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 N K+ ++ TS +G M++ + + L+KN +T++ +++DNGPE Sbjct: 266 YTRLNDKWRYAAEKFTSEVDVHGSGMLQHDHDIGLVLDFLKKNDLEKDTIVWYSTDNGPE 325 Query: 361 AEVPPHG-RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAG 418 PHG TPF+ K +TWEGGVRV + + W G I+ + +GI D+F T AG Sbjct: 326 HSAWPHGATTPFKSEKMTTWEGGVRVISMIKWPGHIKKGQILNGIQSHMDMFTTLAAAAG 385 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ 478 + K +IDG++ ++ G + +S R + Y+ KL+AVRM +K+ + Sbjct: 386 VDNVAEKMMKEKKQYIDGLNNLDYWTGKSKKSARNSIFYYYESKLSAVRMGPWKFLFSTK 445 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDS-------IGVRHIPMGVPLQTE 531 + Y G ++ V NL DP ES + + + + P+ Sbjct: 446 KDYY-----------GNLVPRTVPIVVNLRMDPFESYTDKESYGHLLQKVSWLMSPMGEM 494 Query: 532 MHAYMEILKKYPPRAQIKS 550 M A+++ L YPP KS Sbjct: 495 MAAHLKTLADYPPVQGGKS 513 >UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R322_9PLAN Length = 513 Score = 391 bits (1005), Expect = e-107, Method: Composition-based stats. Identities = 122/528 (23%), Positives = 181/528 (34%), Gaps = 125/528 (23%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPT 139 ++PN+V FL+DD+G D+G G TP+ID +A+ G T AY+ P SPT Sbjct: 32 AAEQQPNIVFFLVDDLGQRDLGCYGSTF---YETPNIDKLAADGARFTQAYAACPVCSPT 88 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGL-------------------QGLTTLPQLLHDQG 180 RA+ILTG + GI G TL + L G Sbjct: 89 RASILTGLWPQRTGITDYIATDNSNGPAKWNRNTMTLPAAYRDRLALDSPTLAKSLKSAG 148 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYT--EWRDVHVNPEVALSPDRSEY 238 Y T GKWH+G + P+N GFD RG Y ++ + NP + P Sbjct: 149 YATFFAGKWHLGP-EGFYPENQGFDINRGGIERGGPYGGKQYFSPYGNPRLTDGPAG--- 204 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 E L R +F++ A +PFF Y+ Sbjct: 205 ------------------------------EHLPDRLATETCQFIE--AHQKQPFFAYFS 232 Query: 299 TRGCHFDNYPNAKYAGSSPARTS------------------------YGDCMVEMNDVFA 334 H A+ Y + M+ Sbjct: 233 FYSVHTPLQAREDLRQKYVAKREKLGLKPTWGREHMRDVRQVQEHAVYAAMVDAMDQAVG 292 Query: 335 NLYKTLEKNGQLDNTLIVFTSDNGP--EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWK 392 + L++ G +NTL++FTSDNG +E P P RG KG +EGG+R P + W Sbjct: 293 KVLAKLDELGLRENTLVIFTSDNGGLSTSEGWPTSNLPLRGGKGWMYEGGIREPLVMRWP 352 Query: 393 GMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSN 451 ++ + D V D T L A+ IDGV G + Sbjct: 353 AKVKAGSTIDTPVSSPDFMATLLAATATKPAEQQQ-------IDGVSLLPLLAGEKLKER 405 Query: 452 RKAEHYFLNGKL-----AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFN 506 HY G AA+R +K ++ +FN Sbjct: 406 SLFWHYPHYGNQGGAPAAAIRRGSWKLI--------------------EWLEDGQVELFN 445 Query: 507 LYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL-----KKYPPRAQIK 549 L TD E+ ++ + + + E+HA+ + + +K P K Sbjct: 446 LATDESETTNLASKEPALVREMLAELHAWQKEVGAILPEKNPNYDPAK 493 >UniRef50_A6CEL4 Arylsulfatase A n=4 Tax=Bacteria RepID=A6CEL4_9PLAN Length = 527 Score = 391 bits (1005), Expect = e-107, Method: Composition-based stats. Identities = 131/523 (25%), Positives = 213/523 (40%), Gaps = 80/523 (15%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 Q A +K PN++ L DD+G+ D+ TP +D +A G+I T A+S Sbjct: 13 QNTAHASEKAND-PNIIYILADDMGYGDIRALNPE--CKIATPHLDQLAHGGMIFTDAHS 69 Query: 133 QP-SSSPTRATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKW 189 +PTR +LTG+Y+ + ++G L T+P +L + GY T +GKW Sbjct: 70 SSSVCTPTRYGVLTGRYNWRSRLKSGVLWGLSRRLIEPDRETVPSMLKEHGYYTACVGKW 129 Query: 190 HMGENK-----------------------------ESQPQNVGFDDFRGFNSVSDMYTEW 220 H+G + ++ P +VGFD F G ++ DM Sbjct: 130 HLGMDWSLKQGGFATEQSYNKKTNPGWDVDYSKPIQNGPNSVGFDYFFGISASLDM---- 185 Query: 221 RDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGV 280 P V + DRS+ I + R G + D+ R D V Sbjct: 186 -----PPYVYIENDRSQGIPTV-----TKAFFRDGPAHKDFEAI-----DVLPRITDKTV 230 Query: 281 KFLDKMA---KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLY 337 + +D+ A K KPFF+Y+ H P ++ G S +Y D +++++D + Sbjct: 231 QIIDEHAAASKEGKPFFIYFPLNAPHTPILPTPEWQGKS-GINAYCDFVMQVDDTVGQVM 289 Query: 338 KTLEKNGQLDNTLIVFTSDNGPEA-----EVPPHGRTP---FRGAKGSTWEGGVRVPTFV 389 + L+K G +NTL++FT+DNG E+ P FRG K +EGG RVP Sbjct: 290 QALKKQGIHENTLVIFTADNGCSPAANFKEMTDKDHQPSYQFRGHKADIYEGGHRVPFIA 349 Query: 390 YWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNG 448 W I+ SD + L DLF TA D+ G VP D V GT Sbjct: 350 NWPARIKAGTHSDQLTCLTDLFATAADIVGA-------KVPDDAGEDSVSILPAMEGTAH 402 Query: 449 QSNRKAEHYFLNGKLAAVRMDEFKYHV-LIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNL 507 R+A + ++R D +K + +++ + G + + +++L Sbjct: 403 TPLREAAVHHSIRGAFSIRKDHWKLELCPGSGGWSFPKPG-----KDNLSELPAIQLYDL 457 Query: 508 YTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 D E ++ H + L T + +Y + + P + Q + Sbjct: 458 NHDAGEQKNVQAEHPEVVKELTTLLQSYADRGRSTPGKPQPNT 500 >UniRef50_UPI00016C41FE sulfatase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C41FE Length = 499 Score = 391 bits (1004), Expect = e-107, Method: Composition-based stats. Identities = 131/510 (25%), Positives = 191/510 (37%), Gaps = 79/510 (15%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTR 140 K PN+V L DDVG+ D+G G TP++D +A QG LT A+S +PTR Sbjct: 21 DPKPPNIVFILADDVGYGDLGCYGS---TKVRTPNLDTLAKQGTRLTDAHSPAAVCTPTR 77 Query: 141 ATILTGQYSIHH--GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG------ 192 +LTGQY+ H G + T+P L GY T A+GKWH+G Sbjct: 78 YALLTGQYAWRHAPGSRILSGVAPLSIKPDTLTVPAFLKQNGYTTAAVGKWHLGLGEKET 137 Query: 193 -ENKESQPQNV--GFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPDRSEYIKQLPF 244 N E +P GFD + D R V+ +P+ ++ ++ + P Sbjct: 138 DYNGEIKPGAREVGFDYSFLIPATGDRTPCVFVENGRVVNYDPKDPITVSYTKKVGTEPT 197 Query: 245 SKDDVHAVRGGEQQAIADIT-----------------PKYMEDLDQRWMDYGVKFLDKMA 287 K++ + + D+T ED+ V+F+ K Sbjct: 198 GKENPELLTVQKPSLGHDMTIVNGISRIGWMSGGKAARWKDEDIADDITKKAVEFIGKA- 256 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 DKPFFLY+ T H P+ ++ G S GDC+ E++ + L++ D Sbjct: 257 -KDKPFFLYFATHDAHVPRVPHPRFKGKS-GHGLRGDCIEELDWCVGEIVAALDRYKLTD 314 Query: 348 NTLIVFTSDNGP-----------EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ 396 NTL+VFTSDNG RG KG +EGG RVP W + Sbjct: 315 NTLVVFTSDNGGVMDDGYIDGTATDTSGHKCNGALRGFKGGLYEGGHRVPFIAKWPVHVA 374 Query: 397 PRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE 455 K SDG+V DL T + G L+P D VD ++ Sbjct: 375 AGKVSDGLVCHVDLLRTCAAILG-------KLLPSGAGPDSVDIFPTLTADRPTKPCRST 427 Query: 456 HYFLNGKLA--AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQE 513 +G A+R E+K + G +FNL DP E Sbjct: 428 LIHQSGNPNALAIRKGEWKL------------------IPNEGKKKVGPELFNLAADPTE 469 Query: 514 SDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 ++ + L + + E P Sbjct: 470 QKNLAADKPEVVKELAALLKSVQENPTSRP 499 >UniRef50_A3ZMT9 Arylsulfatase n=2 Tax=Planctomycetaceae RepID=A3ZMT9_9PLAN Length = 542 Score = 391 bits (1004), Expect = e-107, Method: Composition-based stats. Identities = 133/564 (23%), Positives = 204/564 (36%), Gaps = 145/564 (25%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRAT 142 +PN+++ ++DD+G+ D+G++GG +A TP+IDA+A G+ + Y+ PTRAT Sbjct: 26 SDRPNIILIMVDDMGFSDLGYHGGEIA----TPNIDALAHSGVRFSQFYNNGRCCPTRAT 81 Query: 143 ILTGQYSIHHGILM--------PPMYGQPGGLQG-----LTTLPQLLHDQGYVTQAIGKW 189 ++TG Y GI G+P QG T+ + L QGY T GKW Sbjct: 82 LMTGLYPHQTGIGHMTESPGEANYGSGKPPTYQGYLNRNCVTIAEALQQQGYATLMSGKW 141 Query: 190 HMGENKES-QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 H+GEN +S P GF+ + G S + +Y F D Sbjct: 142 HLGENDKSRWPLQRGFEKYFGCLSGATLYF-------------------------FPDGD 176 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFL-DKMAKSDKPFFLYYGTRGCHFDNY 307 G +Q A + T + DY ++FL ++ A +P FLY H+ Sbjct: 177 RKMTLGNQQIAEPESTTDQPFYTTDAFTDYAIRFLKEEQAGQQRPMFLYLAYTAPHWPLQ 236 Query: 308 PN----AKYAGS------------------------------------------------ 315 AKY G Sbjct: 237 AFEEDIAKYRGKYKIGWDKLREQRLERQKNLGLIAADRQLSPRTPKIPAWDELDAAQQDE 296 Query: 316 -SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP---------- 364 Y + ++ L K L+++G D+TLI+F SDNG E Sbjct: 297 MDLKMAVYAAMIDRVDQNIGRLMKHLKESGIEDDTLILFLSDNGGCQEGGVLGGAHFLDP 356 Query: 365 ----------------PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK--SDGIVDL 406 TPFR K EGG P F+ W G I R L Sbjct: 357 EQRNRQYFHGYGEAWANASNTPFRLYKHFNHEGGTATPFFMRWPGKIAARDAWCAEPAQL 416 Query: 407 ADLFPTALDLAG--HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLA 464 D+ PT LD+AG +P N +P +DGV G +R+ + A Sbjct: 417 IDVMPTILDVAGATYPAKYAENAIP---PLDGVSLRPTMQGE--PLDRQQPICIEHENNA 471 Query: 465 AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPM 524 ++R ++K +G +Q A ++N+ D E+ ++ V H Sbjct: 472 SIRAGDWKLV-------------GRGVAAPRGVQPAKWELYNIADDRTETQNLAVEHPEK 518 Query: 525 GVPLQTEMHAYMEILKKYPPRAQI 548 L + +A+ + + YP R Sbjct: 519 VRELSQQWNAWAKRVGVYPKRQAP 542 >UniRef50_Q024K7 Sulfatase n=28 Tax=Bacteria RepID=Q024K7_SOLUE Length = 504 Score = 390 bits (1003), Expect = e-107, Method: Composition-based stats. Identities = 123/511 (24%), Positives = 193/511 (37%), Gaps = 66/511 (12%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-P 134 A K PN+V DD+G+ D G A TP++D A+ G+ T+A+S Sbjct: 16 ASRAFAAAKPPNIVYMYADDLGYGDTSCYG---ATRVKTPNLDRAAAAGIRFTNAHSSSA 72 Query: 135 SSSPTRATILTGQYSIHH-GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 + +P+R ++LTG+Y+ H G + P G TLP +L GY T A+GKWH+G Sbjct: 73 TCTPSRYSLLTGEYAWRHQGTGVLPGDASLIVQPGRYTLPAMLQQAGYRTGAVGKWHLGL 132 Query: 194 NKESQ---------PQNVGFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPDRSEYI 239 P VGFD F + D + V+++P L + Sbjct: 133 GGRDLDWNGEIRPGPLEVGFDYSFIFPATGDRVPCVFVENRKVVNLDPNDPLRVRYDKPF 192 Query: 240 KQLPFSKDDVHAVR----GGEQQAIADITPK--YM----------EDLDQRWMDYGVKFL 283 P + ++ G I + + YM ED+ V FL Sbjct: 193 PGEPTGAANPELLKMKPSHGHDNTIVNGISRIGYMAGGKSARWVDEDMADTITGKAVSFL 252 Query: 284 DKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKN 343 ++ +PFFLY+ T H P+ ++ G + GD + E++ + TL++ Sbjct: 253 EQ--NRARPFFLYFATHDIHVPRVPHPRFVGKTD-MGPRGDAIAELDWSIGRILDTLDRL 309 Query: 344 GQLDNTLIVFTSDNGPEAEVP-----------PHGRTPFRGAKGSTWEGGVRVPTFVYWK 392 NTL VF+SDNGP + H P RG K S ++GG R+P V W Sbjct: 310 KLTRNTLFVFSSDNGPVVDDGYRDQAVERLGDHHPAGPLRGGKYSAYDGGTRIPFVVRWP 369 Query: 393 GMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNR 452 G ++P S + DL + L G +P+T D + LG Q Sbjct: 370 GTVKPGISAAPISQVDLLASFAALTG-------RKLPETAAPDSFNVLPALLGKTKQGRP 422 Query: 453 KAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ 512 + L A ++K P G +F+L D Sbjct: 423 HIVEHATALSLIA---GDWKVIRPHTGPRRNQTGNEIG-------NDPEPQLFDLAHDIG 472 Query: 513 ESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 E +++ +H L + + + P Sbjct: 473 EQNNVAPQHPEKVQELLGMLAQIEKSPRTRP 503 >UniRef50_A6DKD8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKD8_9BACT Length = 455 Score = 390 bits (1001), Expect = e-107, Method: Composition-based stats. Identities = 121/477 (25%), Positives = 182/477 (38%), Gaps = 80/477 (16%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTR 140 +KPN+++ L DD+G+ D+GF G TP IDA+A G+ T Y S P+R Sbjct: 18 AAQKPNIILILADDLGYEDLGFLGAPDI---KTPHIDALARSGMNFTQGYQSASVCGPSR 74 Query: 141 ATILTGQYSIHHGILMPP--------MYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 A +LTG+Y G P + G + LL Y T IGKWHMG Sbjct: 75 AGLLTGRYQQLFGSGENPPETGELSKRFPDAGIPLDEQMIFDLLKPAAYTTGVIGKWHMG 134 Query: 193 ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 + E +P D + GF + + Y E + + Sbjct: 135 LSHEQRPTQRSVDYYYGFLNGAHSYREAKMDMKGAPMTW------------------PIF 176 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 R E + T + + D GV F+ + DKPFFLY H K Sbjct: 177 RNNEPVPFSGYT-------TEVFNDEGVNFIKR--NKDKPFFLYMSYNSVHGPWEAQPKD 227 Query: 313 AGSSPA-----RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP-- 365 S R Y ++ M+D L +TL+ G +NTL++F SDNG + Sbjct: 228 LQRSDHIKKKWRRIYSAMLISMDDGVGRLIQTLKDEGIYENTLVIFMSDNGAPNNLHEAE 287 Query: 366 ------HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAG 418 RG KG T+EGG+RVP + W +I + + V D+ PT + + Sbjct: 288 RAGDYLASNGSLRGRKGDTYEGGIRVPYIMSWPQVIPKQSTYQHPVSGLDIVPTLIHI-- 345 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ 478 + P + GV+ + G K Y+ A+R ++K Sbjct: 346 ------SQAAPAKKELSGVNLMPYITGEKTSRPHKTL-YWRRDDDYAIRDKDWKL----- 393 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAY 535 T + Y G T +FNL DP E +++ +H + LQ + + Sbjct: 394 -----TWNDYNGPRT--------PMLFNLKDDPNEKNNLIHKHPEIAQKLQAKFDQW 437 >UniRef50_C1ZFQ0 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZFQ0_PLALI Length = 522 Score = 390 bits (1001), Expect = e-106, Method: Composition-based stats. Identities = 130/509 (25%), Positives = 215/509 (42%), Gaps = 61/509 (11%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTR 140 ++PN+++ L DD+G+ D+ V T ID +A +G+ T A+S +PTR Sbjct: 31 AAEQPNILLILADDLGYGDLRCYNSQSKVS--TSHIDRLAREGMRFTDAHSPSTVCTPTR 88 Query: 141 ATILTGQYSIH--HGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG---EN 194 ++TGQ G + G P + G TLP +L ++GY T +GKWH+G + Sbjct: 89 YGLMTGQMPFRAPSGGTVFTGVGGPSLIAPGRLTLPMMLRERGYSTACVGKWHIGLTFFD 148 Query: 195 KESQPQN----------------------VGFDDFRGFNSVSDMYTEWRDVHVNPEVALS 232 +E +P + GFD F G + T+W + + Sbjct: 149 REGRPIHSNALEAVRQVDFSRRIDGGPVDHGFDSFFG--TACCPTTDWLYAFIENDRVPV 206 Query: 233 PDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSD-- 290 P + K H R G +D ME++D +++ +FL++ + + Sbjct: 207 PPTASLEKSALPKHPYAHDCRPG--LIASDFA---MEEIDLIFLEKSRQFLNQHVRQNPG 261 Query: 291 KPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 KPFFL++ T+ H ++ ++ G S A +GD ++E++ + L K+LE+ +NTL Sbjct: 262 KPFFLFHSTQAVHLPSFAAKQFQGKSEA-GPHGDFLLELDYIVGELMKSLEELHIAENTL 320 Query: 351 IVFTSDNGPEAEVPPHGRT--------PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSD- 401 ++FTSDNGPE H R+ P+RG K WEGG RVP V W G ++P ++ Sbjct: 321 VIFTSDNGPEVTSVIHMRSDHGHDGARPWRGMKRDAWEGGHRVPFIVRWPGKVRPGTTNS 380 Query: 402 GIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH--YFL 459 + L D+ T + V +P D + +L + R F Sbjct: 381 QLTSLTDVMATVAAI-------VDTQLPDHAAEDSFNMLPAWLDESAPPIRPYLLTQSFG 433 Query: 460 NGKLAAVRMDEFKY--HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSI 517 + A+R E+KY H + A ++NL TDP ES ++ Sbjct: 434 GSRTLAIRQGEWKYLDHTGSGGNRYENDPSLKPFILPDAAPDAPGQLYNLSTDPGESTNL 493 Query: 518 GVRHIPMGVPLQTEMHAYMEILKKYPPRA 546 + L+T + + P R Sbjct: 494 YHARPEVTSRLKTLLEQSKTNGRSRPTRP 522 >UniRef50_A6DLE2 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLE2_9BACT Length = 441 Score = 390 bits (1001), Expect = e-106, Method: Composition-based stats. Identities = 117/468 (25%), Positives = 181/468 (38%), Gaps = 77/468 (16%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTR 140 + PN+++ L DD G D G + TP ID++A G+ T AY + SP+R Sbjct: 17 ANEPPNIIIILADDAGSSDFSCYGSKQLL---TPHIDSIAHNGIKFTQAYTASSVCSPSR 73 Query: 141 ATILTGQYSIHHGILMPPMYGQP--------GGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 A +LTG+Y G L + + G TL L + GY T IGKWH+G Sbjct: 74 AGLLTGRYQQTFGHLANIPHSKHSANDPELLGLPVTEITLADSLKELGYSTHCIGKWHLG 133 Query: 193 ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 E P GFD+F GF S + Y ++ + + Sbjct: 134 EADHFHPNARGFDNFYGFLSGARTYFLGGELRGDMD------------------------ 169 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK- 311 R + A+ + Y ++ + ++ + + + DKPFF+Y H + Sbjct: 170 RIMRNKEFAEPSSGYTTEV---FTQEAIRIIQE--EQDKPFFIYLSHNAVHGPMDAKDED 224 Query: 312 ---YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR 368 Y +P R Y M ++D L + L+ + Q +NTLI F SDNG Sbjct: 225 IMSYDFKNPLRKKYSGLMKNLDDQTGLLLQALKDSKQYENTLIFFMSDNGGPTTHNGSSN 284 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANL 427 P RG KGS +EGG R P + W I SD + D+F T + AG Sbjct: 285 WPLRGFKGSEFEGGNRTPFLLQWPEKISAGLSSDKPIIAYDVFATCIQAAG-------GE 337 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSG 487 + G+D + RK ++ GK ++R ++K ++L Sbjct: 338 LVTDRTYHGIDLLPVINKPQETNARK--LFWSRGKNYSMRQGKWKLNIL----------- 384 Query: 488 YQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAY 535 GSS++NL D E + + + L EM + Sbjct: 385 -----------PTGSSLYNLENDQSEKHDLSEQFPEIKAQLIKEMSKW 421 >UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZUT0_9PLAN Length = 457 Score = 389 bits (1000), Expect = e-106, Method: Composition-based stats. Identities = 122/490 (24%), Positives = 184/490 (37%), Gaps = 98/490 (20%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 KPN+V L+DD+G D G G A TP ID +A+QG+ T AY+ P SPTR Sbjct: 27 AAPTKPNIVFILIDDMGCKDAGCYG---ATNFSTPHIDRLANQGMRFTDAYAAPVCSPTR 83 Query: 141 ATILTGQYSIHH---------GILMPPMYGQPGG-----LQGLTTLPQLLHDQGYVTQAI 186 A+++TG++ G +P P G T+ Q LH GY I Sbjct: 84 ASLMTGKHPARLHLTNFIPQIGRQLPAGKLIPPGFNHVLPLDEKTIAQELHADGYQCAMI 143 Query: 187 GKWHMG--ENKESQPQNVGFDDFRG--FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 GKWH+G E +PQN GFD + + + + + D P P Sbjct: 144 GKWHLGEEHGPEYRPQNRGFDRVVLSEHHGIFNYFYPFVDQQKWPYAGPLPGNPG----- 198 Query: 243 PFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGC 302 + L R D + F+ + ++PFFLY Sbjct: 199 --------------------------DYLPDRLTDEAIDFVRE--NRERPFFLYLSHWSV 230 Query: 303 HFDNYPNA------KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 H + + G Y M +++ L TL++ DNTL VF SD Sbjct: 231 HGRYFAPESLIAKYRERGLEERPAIYAAMMETVDNSVGRLMATLDELNLADNTLFVFMSD 290 Query: 357 NGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALD 415 NG E P RG+KGS +EGGVRVP V + G+++P V DLFPT LD Sbjct: 291 NGGE---RITSMAPLRGSKGSLYEGGVRVPLIVRYPGVVKPNTTCSVPVISHDLFPTFLD 347 Query: 416 LAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH----YFLNGKL--AAVRMD 469 A + +DG G + +R A + ++ +A+R Sbjct: 348 FAERSY--------RDNKLDGHSIAGLLTGEQSELDRDALYWHFPHYWGSTRPCSAMRQG 399 Query: 470 EFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQ 529 +K ++T + +++L +DP E + L+ Sbjct: 400 RWKLV--------------------EHLETGRAQLYDLSSDPGEQRDLANEMPQQATELR 439 Query: 530 TEMHAYMEIL 539 + + + Sbjct: 440 KMLAQWRTKV 449 >UniRef50_Q7UYA5 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA5_RHOBA Length = 562 Score = 389 bits (1000), Expect = e-106, Method: Composition-based stats. Identities = 113/481 (23%), Positives = 189/481 (39%), Gaps = 58/481 (12%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSS 136 E +PN+++ L DD+G+ D+ G TP +D +AS+GL Y+ Sbjct: 115 TEAHADDRPNIILLLADDLGYGDLSCFGSP---AVKTPHLDRLASEGLKCNRFYAGSAVC 171 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG---- 192 SPTRA++LTG+Y + GI + TT+ +LL D GY T IGKWH+G Sbjct: 172 SPTRASVLTGRYPLRFGITKHFNDRNGWLPESATTVAELLKDAGYNTAHIGKWHLGGLHV 231 Query: 193 ------ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLP-FS 245 + P+ GFD + ++ P R + + F Sbjct: 232 DEPGKRLTNQPGPRQHGFDFY------------------QTQIEQQPLRGQMGRDKTLFR 273 Query: 246 KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD 305 K +R Q I+ P Y + D+ V+ ++K++ + PFF+ H Sbjct: 274 KGGTVLLRN--DQRISQDDPYYHKHFTDANGDFAVEMIEKLSSEEDPFFINMWWLVPHKP 331 Query: 306 NYPNAK-------YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG 358 P + + + + + M+ + + L++ DNTL++FTSDNG Sbjct: 332 YEPAPEPHWSDTAADDITDDQHRFRSMVQHMDAKVGAILRKLDELKIADNTLVLFTSDNG 391 Query: 359 PEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLA-DLFPTALDLA 417 E H +G K +GG+RVP V W I ++ DL PT D A Sbjct: 392 AAFEGFIHD---LKGGKTELHDGGIRVPMIVRWPDAIPAGQTSQTFSHTNDLLPTFCDAA 448 Query: 418 GHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLI 477 + +P +DG+ S + G S + F L + H Sbjct: 449 -------SVQLPSDLPLDGLSLLSHWKGGTPPSQVERGTVFWQLDL----YKSLQRHYPK 497 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 +PYA T+ +G + + +F++ DP E ++ H + L ++ ++ Sbjct: 498 PKPYA-TEVVMRGNWKLLAFKGKPVELFDVGADPNEKRNVLAEHPELVASLSAQLKDWLN 556 Query: 538 I 538 Sbjct: 557 E 557 >UniRef50_A6DHI0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI0_9BACT Length = 456 Score = 389 bits (999), Expect = e-106, Method: Composition-based stats. Identities = 110/478 (23%), Positives = 178/478 (37%), Gaps = 76/478 (15%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSS 137 + KPN++ + DD+G+ +G G + TP +D +A +GL LT Y+ + Sbjct: 13 AANSADKPNIIFIMCDDMGYGQLGSYGQKMI---KTPRLDQMAKEGLRLTDYYAGTAVCA 69 Query: 138 PTRATILTGQYSIHHGILMPPMY--GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-N 194 P+R +++TGQ+ H I Y GQ T+ + + + GY T IGKW +G Sbjct: 70 PSRCSLMTGQHVGHTYIRGNKEYPTGQEPIPAETITVAEKMKEAGYATALIGKWGLGYPG 129 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 E +P GFD F G+N + + +R Sbjct: 130 SEGEPNKQGFDYFFGYNDQKHAHNHFPK---------------------------FLLRN 162 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF--------DN 306 E + + + K +E D F+ K D PFFLY H + Sbjct: 163 EETLTLKNNSGKEIEYSQYMLTDEAKGFIKK--NKDNPFFLYLAYVIPHSRLQIPGDDEC 220 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP- 365 Y K + + + ++ ++ L++ +NTL+VFTSDNG E Sbjct: 221 YLQYKDESWPEKQKKHAGMISRLDKDVGSILDLLKEMNLAENTLVVFTSDNGAHREGGAR 280 Query: 366 ----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHP 420 + P G K S +EGGVRVP +W G+I+P + S+ I DL PTA +L G Sbjct: 281 PEFFNDSGPLSGIKRSMYEGGVRVPFIAHWPGVIKPGQVSNHIGAHWDLMPTACELGGVQ 340 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF--LNGKLAAVRMDEFKYHVLIQ 478 + IDG+ G + + YF VR ++ Sbjct: 341 PPEG---------IDGISYVPLLKGNMEEQEKHDYLYFELHWPTKRGVRKGDW------- 384 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTE-MHAY 535 Q + +FNL D + + ++ + + A+ Sbjct: 385 -------VALQSKTSAIDPNKDTIKLFNLKNDLGQKKDLATQYPEKVEEFKKIFLEAH 435 >UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 Tax=Nostocaceae RepID=Q3M597_ANAVT Length = 457 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 119/497 (23%), Positives = 185/497 (37%), Gaps = 89/497 (17%) Query: 69 KETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILT 128 L +PNVV L+DD+GW D+ G TP++D +A QG+ T Sbjct: 25 ATASANLFSRATAQSSRPNVVFILVDDMGWGDLSIYG---RTDYETPNLDRLARQGVRFT 81 Query: 129 SAYS-QPSSSPTRATILTGQYSIHH--------GILMPPMYGQPGGLQGLTTLPQLLHDQ 179 +AY+ Q +PTR LTG+Y G P G T+ LL Sbjct: 82 NAYANQTVCTPTRIAFLTGRYQARLPVGLREPLGARSQPASNNIGIPANQPTIASLLKAN 141 Query: 180 GYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI 239 GY T +GKWH G P GFD++ G S Y Sbjct: 142 GYETALVGKWHAGYPPNFGPLQKGFDEYFGHLSGGIEYFTHTGTD--------------- 186 Query: 240 KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 + L ++DV R G + + D V+F+ + +PF+L Sbjct: 187 RILDLYENDVPVQRSG--------------YVTDLFTDRAVEFIQR--PHSRPFYLSLHY 230 Query: 300 RGCHFDNYPNAKYAGSS----------PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 H+ A ++ ++ +Y + ++D + LE +GQ DNT Sbjct: 231 NAPHWPWQGPNDQASTAFYLTNGYTVGGSQATYAAMVKSLDDGVGRVLDALEASGQADNT 290 Query: 350 LIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLAD 408 L++FTSDNG E PFRG K S +EGG+RVP + + G+ Q + S+ ++ D Sbjct: 291 LVIFTSDNGGE---RFSNFGPFRGQKASLYEGGIRVPAIIRYPGVTQANQVSNQVIITFD 347 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF---LNGKLAA 465 L T L G DG + G + +R + L + A Sbjct: 348 LTATILAATGTSFH-------PNYPPDGQNLLPLLRGDRSEFSRTLFWRYGAALTTRQRA 400 Query: 466 VRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMG 525 VR ++KY + ++FNL TDP E+ + + + Sbjct: 401 VRSGDWKY----------------------WRRGNQEALFNLATDPGETTDLKDSNAQVF 438 Query: 526 VPLQTEMHAYMEILKKY 542 L+ + + + Y Sbjct: 439 TRLRNQFQHWELQMLPY 455 >UniRef50_A7V8P8 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V8P8_BACUN Length = 525 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 133/525 (25%), Positives = 205/525 (39%), Gaps = 91/525 (17%) Query: 48 VKPATTIA--DNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNG 105 +K + T+ ++P A + +++ ++PN+V+ + DD+GW DVG+ G Sbjct: 1 MKVSCTLVSVAALLPFSGSNAGN---------VQRDKSQRPNIVLVIADDMGWGDVGYQG 51 Query: 106 GGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATILTGQYSIHHGILMPPMYGQPG 164 AV TP+IDA+A +G+ + Y S S P+RA ILTG Y G ++ Sbjct: 52 ---AVDVSTPNIDALARRGVQFSQGYVSCSISGPSRAGILTGVYQQRFGFY-NNLHPWAK 107 Query: 165 GLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVH 224 +G +TL +++ D GY T +GKWHM ++ E P GFD F GF W D H Sbjct: 108 IPEGQSTLGEMVRDCGYATGFVGKWHMADSPEQSPNRRGFDQFYGF---------WSDTH 158 Query: 225 VNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLD 284 P Y D R GE Q + +Y+ D + V+F+D Sbjct: 159 DYYRSTDKPGVELY--------DFCPLYRNGEIQPPLHESGEYITDC---FTREAVEFID 207 Query: 285 KMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPART-------SYGDCMVEMNDVFANLY 337 K A S PF L H Y R + ++ ++D + Sbjct: 208 KHASS--PFLLCLSYNAVHSPWQVPEHYVNRLEGRRFHHEDRKVFAAMVLALDDGIGRVM 265 Query: 338 KTLEKNGQLDNTLIVFTSDNGPEA----------EVPPHGRT------PFRGAKGSTWEG 381 ++L KNG +NTL + SDNG E G T PFRG K T+EG Sbjct: 266 ESLRKNGLEENTLFILISDNGSPRGQGIECSTGYEYKDRGNTTMSSPGPFRGYKADTYEG 325 Query: 382 GVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQT 440 G+RVP + W + D V D+FPT + G + + +DGV Sbjct: 326 GIRVPYIMSWPSELPQGMVYDNPVISLDIFPTVMQAVGGTSRQKYS-------LDGVSLL 378 Query: 441 SFFLGT-NGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQT 499 + + Y+ + A+R ++K Q T Sbjct: 379 PYLKSEWPIDKRPHSTLYWRRDEDFAIRKGDWKLVYNDQG------------------ST 420 Query: 500 AGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 +F++ D +E + + + L E A+ L PP Sbjct: 421 RKIQLFDMKDDKEEVYDLSGEYPELADSLLAEFDAWDAAL---PP 462 >UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT Length = 500 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 127/517 (24%), Positives = 183/517 (35%), Gaps = 115/517 (22%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 L +PN V L DD+GW DVGFNG TP++D +A +G+ T AY+ Sbjct: 26 TSLCATRVHAADRPNFVFILADDLGWKDVGFNGSTF---YETPNLDRLAREGMRFTDAYA 82 Query: 133 Q-PSSSPTRATILTGQYSIHHGI--LMPPMYGQPG-----------GLQGLTTLPQLLHD 178 SPTRA+I+TG+Y + +P +P TL + L + Sbjct: 83 ACSVCSPTRASIMTGKYPARLHLTDWLPGRPDKPDQILKHPKIITELPAAEITLAKALQE 142 Query: 179 QGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 GY T IGKWH+G P+ GFD G + P SP ++ Sbjct: 143 GGYKTAFIGKWHLG-GLGHWPEQAGFDINIGGCGMGH-----------PSSYFSPYKNPT 190 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 +K P E L R D VKF++ PF LY Sbjct: 191 LKDGPVG-----------------------EYLADRLTDEAVKFIEN--TKGTPFLLYLS 225 Query: 299 TRGCHFDNYPNA----KY----------------------AGSSPARTSYGDCMVEMNDV 332 H KY A + Y M +++ Sbjct: 226 HYSVHTPLQAKKGLIEKYQKKVMQLPPTKGPEFVTEGNTNARQVQNQPIYAAMMQSLDES 285 Query: 333 FANLYKTLEKNGQLDNTLIVFTSDNGP--EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVY 390 + L++ G NT+I+FTSDNG AE P P R KG +EGGVR P V Sbjct: 286 VGRVLDKLKELGLDKNTVIIFTSDNGGLSTAEGAPTSNMPLRAGKGWPYEGGVREPLVVK 345 Query: 391 WKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ 449 W G+ + SD V D +PT L++AG P +DG+ T G Sbjct: 346 WPGVTKAASVSDHQVMSTDYYPTLLEIAGLPAR-------PEQHLDGISFTPALRGKEMG 398 Query: 450 SNRKAEHYFLNGKL-----AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSV 504 HY +++R ++K ++ + Sbjct: 399 ERPLFWHYPHYSNQGGAPSSSIRKGDWKLIEWYEEN--------------------RIEL 438 Query: 505 FNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 FNL D E + + L++E+ A+ +K Sbjct: 439 FNLRLDVGEKNDLASTSALKREELKSELQAWRASVKA 475 >UniRef50_A6DKP3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKP3_9BACT Length = 465 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 115/477 (24%), Positives = 190/477 (39%), Gaps = 74/477 (15%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQ 133 LA L KPN++V L DD+G+ DV ++G TP ID++A G + Y + Sbjct: 13 LASLSASAA-KPNIIVILADDLGYGDVSYHG--TLKETTTPHIDSIAQSGAWFQNGYSAA 69 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYG------QPGGLQGLTTLPQLLHDQGYVTQAIG 187 P P+RA +L+G+Y G + G +P++L +GY T +G Sbjct: 70 PVCGPSRAGLLSGRYQQRFGYYDNIGPFTLNKDVEAGLPLSQKLIPEILVKEGYATGMVG 129 Query: 188 KWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 KWH G+ + P N GF +F GFN+ + + +K + D Sbjct: 130 KWHDGDQHKFWPYNRGFQEFYGFNNGAI-------------------NNWVLKGENHTVD 170 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 + AV ++ + + +YM + + V+F+D+ PFFLY H Sbjct: 171 EWGAVHRENKR--VENSGEYM---TEAFGREAVEFIDRHKTE--PFFLYLSFNAVHGPLQ 223 Query: 308 PNAKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 Y R + M+D + + L K G +NT+I FTSDNG + + Sbjct: 224 APKSYTNQFKHIKPENRALCLAMLKSMDDNIGLVLEKLRKEGLEENTIIFFTSDNGGKLK 283 Query: 363 VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS--DGIVDLADLFPTALDLAGHP 420 +RG K + ++GG+ VP V WK I + + V DL T AG Sbjct: 284 GNYSFNGKYRGEKNTVFDGGLHVPYAVQWKAQIPAQTKALEAPVHSIDLAHTIFAAAGVE 343 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 + +DG + + + +R Y+ N A+R +++KY Sbjct: 344 -------IKDEYKLDGRNLLPYLKNQSDFDDRN--LYWANNANIAIRDNKWKY------- 387 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 + Q + +FNL DP ES+++ ++ +Q A+ Sbjct: 388 ---------------LKQAGKTYLFNLEEDPYESNNLVSQYPEKAQDMQKRHDAWQA 429 >UniRef50_A4A218 Arylsulfatase A n=2 Tax=Bacteria RepID=A4A218_9PLAN Length = 491 Score = 388 bits (996), Expect = e-106, Method: Composition-based stats. Identities = 115/480 (23%), Positives = 198/480 (41%), Gaps = 61/480 (12%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-S 135 + K PN+V+F +D++G D+G G + + TP ID +A++G TS Y Sbjct: 31 AAQSADAKPPNIVLFFVDNLGTGDIGCYGSTL---HRTPHIDRLAAEGAKFTSFYVASGV 87 Query: 136 SSPTRATILTGQYSIHH-------GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGK 188 +P+RA ++TG Y + G+ + G TT+ ++LH GY T GK Sbjct: 88 CTPSRAALMTGCYPLRVDMHKSGEGVAVLRPLDTKGLNPKETTMAEVLHSVGYATGIFGK 147 Query: 189 WHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 WH+G+ E P GFD F G DM + R + +LP +D Sbjct: 148 WHLGDQPEFLPTQQGFDTFFGIPYSDDMTKDL--------------RPQLWPELPLMRD- 192 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 +Q I P + L +R + + F+++ ++PFF+Y P Sbjct: 193 --------EQVIE--APVDRDLLVKRCTEEAIAFIEQ--NQERPFFVYIPHTMPGSTKRP 240 Query: 309 --NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH 366 + + G S YGD + E++ + +TL++ + TL+++TSDNG PP Sbjct: 241 FSSPAFQGKS-KNGPYGDSVEELDWSTGQVMETLKRLDLDEQTLVIWTSDNGAPHRNPPQ 299 Query: 367 G-RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKV 424 G P++G +T EG +R+P + W G I + +D + DL PT LAG +K Sbjct: 300 GSNLPYQGDGYNTSEGAMRMPCVMRWPGKISAGQINDALCTTMDLLPTFGKLAGATMSK- 358 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNG---QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 T IDG + + LG + + K ++ +L A+R +K ++ + Sbjct: 359 -------TEIDGHEISRILLGESDTASPWDDKGFAFYYMDQLQAIRAGRWKLYLPLD--- 408 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 + +++++ D E + H + L + Sbjct: 409 ----PKTGLRLPPAASKEGNVALYDVRNDVHEDQEVSAEHPDVVAHLTDLAQQIRREIGD 464 >UniRef50_A6DG39 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG39_9BACT Length = 473 Score = 387 bits (995), Expect = e-106, Method: Composition-based stats. Identities = 117/480 (24%), Positives = 200/480 (41%), Gaps = 56/480 (11%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRAT 142 +KPN+V+FL DD+G+ D G P ID +A +G+ T A+S + +P+R Sbjct: 22 EKPNIVIFLADDLGYGDCGAFNSQ--SKIKMPHIDRLAEEGMRFTDAHSASATCTPSRYG 79 Query: 143 ILTGQYSIHHGILMPPMY-GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ--- 198 +LTG + G+ + G+P + TL LL + Y T +GKWH+G +S+ Sbjct: 80 LLTGINPVRTGVFNTLLKTGRPIIHKDEMTLADLLKVEDYETWMVGKWHLGFENKSKSLD 139 Query: 199 --------PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 P + GFD F G +P + + + + D + Sbjct: 140 LSQDLRGGPLDCGFDYFFGLA---------SSASSSPLCFIKNRKIQEVSSEFVEVDKIR 190 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAK--SDKPFFLYYGTRGCHFDNYP 308 + IA +ED+ R + V + + AK ++PF LY+ + H P Sbjct: 191 GSGQKSKYKIAVPKDLKLEDVSPRLSENAVGLIQEYAKSAKEQPFLLYFASIAPHQPWVP 250 Query: 309 NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP-------EA 361 + + G S Y D +++M+D + + L+ G NT+++FTSDNG A Sbjct: 251 SENFKGKS-GLGVYADFVMQMDDELGQINQALKDTGLEKNTIVIFTSDNGTGPGAHYLMA 309 Query: 362 EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHP 420 E H P RGAK S++EGG R+P W G+I +S +++ D+F T +L Sbjct: 310 EQGHHSSGPMRGAKASSYEGGHRMPFIAKWPGIIPVNSQSKAVINATDIFATIAELLKVD 369 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 + V D + N + +R + ++RM ++K Sbjct: 370 LKEKYPQVAP----DSFSFYKNLINLNQKQSRPSMVV-----RESIRMGDWKL------- 413 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 SG + F ++ + ++NL +D E + + H + E +M+ K Sbjct: 414 ---ISSGGKKEFDS--LKMSQFKLYNLSSDLAEKNDLAPSHPERAQEMYKEFKKFMDQRK 468 >UniRef50_D2R783 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R783_9PLAN Length = 505 Score = 387 bits (994), Expect = e-106, Method: Composition-based stats. Identities = 125/512 (24%), Positives = 199/512 (38%), Gaps = 66/512 (12%) Query: 70 ETQQKLAELEKKTGK--KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLIL 127 T + AE E K +PN+V+ DD+GW D+G PTP++D +ASQGL L Sbjct: 17 ATNLRGAETESARAKPARPNIVILYADDMGWGDLGAQNPD--SKIPTPNLDRLASQGLRL 74 Query: 128 TSAYSQP-SSSPTRATILTGQYSIH--HGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQ 184 T A+S +P+R +L G+Y HGI+ + Q T+ +LL +GY T Sbjct: 75 TDAHSSSGICTPSRYALLHGRYHWRKFHGIVNS--FDQSVMDDERVTMAELLKTEGYKTA 132 Query: 185 AIGKWHMGENK----------------------------ESQPQNVGFDDFRGFNSVSDM 216 IGKWH+G + P + GFD + G + Sbjct: 133 CIGKWHLGWDWNAIKRPGAKGGAQGTGFAAEDFDWSKPIPGGPLSHGFDYYYGDD----- 187 Query: 217 YTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWM 276 + P DR + + + A E + + ++ Sbjct: 188 -----VPNFPPYAWFENDRIVVPPTVRVTTTEPTAEGNWEARPGPAVKDWDFWNVMPTLT 242 Query: 277 DYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANL 336 D V +++K K+D+PFFLY+ H P ++ G S A +GD M + + + Sbjct: 243 DKAVAWINKQ-KADEPFFLYFPFTSPHAPIVPTKEFTGKSQA-GGFGDFMTQTDATVGRV 300 Query: 337 YKTLEKNGQLDNTLIVFTSDNGPE-------AEVPPHGRTPFRGAKGSTWEGGVRVPTFV 389 + L+K G +NTL++FT+DNGPE + P RG K WEGG RVP + Sbjct: 301 LEALDKQGLAENTLVIFTADNGPEHYAYERVRKFEHRSMGPLRGLKRDLWEGGHRVPMVI 360 Query: 390 YWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNG 448 W + K SDG++ DL T + V +P + D +Q + G Sbjct: 361 RWPKHVPAGKVSDGLMSQIDLLATIATI-------VDAEIPAGSADDSYNQLPLWTG-TA 412 Query: 449 QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLY 508 S R + N A+R + + + +G ++NL Sbjct: 413 PSARDTLVHNTNAGGYAIRHGHWVLIDAKSGGVSKV-PAWFDEASGYTANKQPGELYNLQ 471 Query: 509 TDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 D + ++ H L+ + E + Sbjct: 472 DDLAQKHNLYADHKEKVDDLKARLQTIREKGQ 503 >UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4S5_9BACT Length = 486 Score = 387 bits (994), Expect = e-106, Method: Composition-based stats. Identities = 123/497 (24%), Positives = 195/497 (39%), Gaps = 98/497 (19%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRAT 142 KPN++ L DD+GW D+G G + + TP+ID AS + TSAY+ SP+R+T Sbjct: 23 PDKPNILFILADDMGWSDLGCYGADL---HETPNIDRFASGAVRFTSAYAMSVCSPSRST 79 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQG---------------LTTLPQLLHDQGYVTQAIG 187 ++TG+++ + Q GG + T+ L GY+T IG Sbjct: 80 LMTGKHAARLHFTIWAEGAQEGGAKNRELREAESIWNLPNSEKTIATYLKSAGYLTALIG 139 Query: 188 KWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 KWH+G+ E P+ GFD G + T W P+S Sbjct: 140 KWHLGD-WEHYPEAHGFDINIGGTNWGAPQTFW---------------------WPYSGS 177 Query: 248 DVHAVRGGEQQAIADIT-PKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 H G E + I + E L R D +K +D D+PFF+Y H Sbjct: 178 GTH---GPEFRYIPHLEYGHPGEYLTDRLTDEAIKVID--HAGDQPFFVYLAHHAVHTPI 232 Query: 307 YP--------NAKYA-GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 +AKY G + T Y E+++ + + L++ G NT+++F SDN Sbjct: 233 EAKADDIQHFDAKYRDGMNHRHTIYAAMNKELDENVGRVLEHLKERGLDKNTVVIFASDN 292 Query: 358 GPEAEV--------PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLAD 408 G V P P R KG+ +EGG+RVP + W G+ D V L D Sbjct: 293 GGYIGVDKVSGKNMPVTNNAPLRSGKGALYEGGIRVPLIIRWPGVTPNGATCDEPVILTD 352 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE------HYFLNGK 462 + T L + G P P T DG+D + + + NR A +Y Sbjct: 353 MLQTFLHITGQP--------PATDATDGMDISPLLKDPSAKLNRDALFFHYPHYYHTTTP 404 Query: 463 LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHI 522 ++A+R ++K + + ++NL D E + Sbjct: 405 VSAIRARDWKLLEFYEDNHL--------------------ELYNLRNDLSEKHDLAKEMP 444 Query: 523 PMGVPLQTEMHAYMEIL 539 L+ +++A+ + + Sbjct: 445 DKAAALRDQLNAWRDSV 461 >UniRef50_A6DQE3 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DQE3_9BACT Length = 489 Score = 387 bits (994), Expect = e-106, Method: Composition-based stats. Identities = 120/498 (24%), Positives = 197/498 (39%), Gaps = 75/498 (15%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSP 138 T K PN+V DD+G+ DV + TP ID VA QG+I T +S +P Sbjct: 18 ANTDKLPNIVYIYADDLGYGDVSCLNPNGLIS--TPSIDKVAQQGMIFTDCHSSASVCTP 75 Query: 139 TRATILTGQYSIHHGILMPPMYG--QPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK- 195 +R +++TG+YS + + G + G T+ LL + GY T IGKWH+G N Sbjct: 76 SRYSLMTGRYSWRSSLKKGVLTGYKKAIIEDGRMTVASLLKENGYNTAMIGKWHLGMNWA 135 Query: 196 ---------------ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 + P + GFD F G ++ D P + + DR+ Sbjct: 136 LNSKNNKKIDYSRAIKKTPTSNGFDYFYGISASLDF---------PPYIYIENDRA---V 183 Query: 241 QLPFSKDDVHAVRG-GEQQAIADITPKY-MEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 P D+ +G I PK+ + ++ + +++K +KPFFLY+ Sbjct: 184 GEPTEHIDLSFNQGIDRHGRPGPIEPKFKVNNVLTELTQKTTAKISELSKQEKPFFLYFS 243 Query: 299 TRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG 358 H P ++ G S YGD ++E + + K ++ N NTL++ +SDNG Sbjct: 244 LTSPHTPCAPADEFIGKSSL-GLYGDFVMETDYRIGQVIKAIKDNDIEHNTLVIISSDNG 302 Query: 359 -----PEAEVPPHGRTP---FRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADL 409 G P FRG KGS +EGG RVP V W ++ +D V Sbjct: 303 CATYIGHEAFQTKGHYPSYIFRGYKGSLFEGGHRVPYIVKWPAKVKAGALNDTPVSQVGF 362 Query: 410 FPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMD 469 T ++ G +P D V L N + ++ + A+R + Sbjct: 363 LATCAEIVGAE-------LPDNAGEDSVSNLPAMLSLNKKPIWESFIHKNGRGGLAIRHN 415 Query: 470 EFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQ 529 E+K + T +++NL D +E ++ +++ + L Sbjct: 416 EWKLIL-----------------------TKVPALYNLKNDIKEQKNLALQYPEIVSRLT 452 Query: 530 TEMHAYMEILKKYPPRAQ 547 + Y++ + P Q Sbjct: 453 KLLQKYVDDGRSTPGEKQ 470 >UniRef50_B4D681 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D681_9BACT Length = 536 Score = 386 bits (993), Expect = e-106, Method: Composition-based stats. Identities = 121/517 (23%), Positives = 197/517 (38%), Gaps = 72/517 (13%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PS 135 L + PN++ L DD+G+ DV TP++D + G+I T A+S Sbjct: 24 ALPRAHAANPNIIYILCDDLGYGDVKCLNAE--GKIATPNMDRLGKAGMIFTDAHSSSAV 81 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMGE 193 SPTR I+TG+Y+ + + G L QG T+ +L + GY T IGKWH+G Sbjct: 82 CSPTRYGIITGRYNWRSPLQSGVLGGLSPRLIEQGRMTVASMLKEHGYATACIGKWHLGM 141 Query: 194 NK----------------------------ESQPQNVGFDDFRGFNSVSDMYTEWRDVHV 225 + ++ P +VGFD + G ++ DM + Sbjct: 142 DWAKLPGKDVTELSVEKPDQVHNVDYAAPIKNGPNSVGFDYYYGISASLDMVPYTFIEND 201 Query: 226 NPEVALSPDRSEYIKQLPFSKD-DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLD 284 + V + D K PF++ + H R G D+ V ++ Sbjct: 202 HVTVLPTVD-----KSFPFTEGRESHPTRPGPAAP-----GFEPRDVLPTLTRKAVDYIG 251 Query: 285 KM---AKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLE 341 + A++ KPFFLY H P+A++ G S + Y D ++E + + + LE Sbjct: 252 QRTNDAQNGKPFFLYLPLNSPHTPIAPSAEWQGKS-GISPYADFVMETDWAIGEVLRVLE 310 Query: 342 KNGQLDNTLIVFTSDNGPE-----AEVPPHGRTP---FRGAKGSTWEGGVRVPTFVYWKG 393 + G DNT++ SDNG AE+ G P FRG K ++GG +P V W Sbjct: 311 EKGLADNTIVFMASDNGCSPSADFAELAEKGHHPSYVFRGHKADIFDGGHHIPFLVRWPA 370 Query: 394 MIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNR 452 I+ SD +V L D T D+ G +P D G + Sbjct: 371 KIKAGSTSDQVVCLTDFMATCADVLGI-------KLPDNAAEDSASLLPVLEGKADKPIH 423 Query: 453 KAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGS--SVFNLYTD 510 +A + A+R +K + + G+ T ++++ D Sbjct: 424 EAVVHHSVNGSFAIRQGNWKLEL------CPSSGGWSDPRPKTAAANKLPAVQLYDMSAD 477 Query: 511 PQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQ 547 E ++ H + L M ++ + P Q Sbjct: 478 IGERKNVEAEHTEVVDRLIRLMEKFVADGRTTPGARQ 514 >UniRef50_C2FU81 Sulfatase family protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FU81_9SPHI Length = 461 Score = 386 bits (993), Expect = e-106, Method: Composition-based stats. Identities = 121/481 (25%), Positives = 201/481 (41%), Gaps = 67/481 (13%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA-YSQPSSSPTR 140 +KPN++ L DD+G+ D+G G TP +D +A++G+ T + PS +P+R Sbjct: 20 AQQKPNIIFVLTDDLGYSDLGCYGNPSIS---TPFLDKMAAKGVRATDYMVTSPSCTPSR 76 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQG-LTTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 A++LTG+Y+ + + P G GL T+ ++L ++GY T IGKWH+G++ E P Sbjct: 77 ASLLTGRYASRYNLPDPIGPGAKNGLPAQEVTIAEMLKEKGYHTALIGKWHLGDHGEYLP 136 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD F G D RD +V + + R++ Sbjct: 137 NKQGFDYFYGMLYSHDY----RDPYVKTDTTIKIFRNQ---------------------- 170 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPAR 319 +T L + + + +++ + K + PFFLYY H +A+ Sbjct: 171 TPVVTRPADSALSRIYTEEVKQYISQQKKGE-PFFLYYAHNMPHLPVAFSAESGRMKDLH 229 Query: 320 --TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP------- 370 G + +++ A ++ +LE+ G DNT+ +F+SDNGP E P Sbjct: 230 FAGPLGAVLEDLDRQLAIMWASLEEQGLADNTIFMFSSDNGPWIEYPVRMSGDHKTKNWH 289 Query: 371 ------FRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAK 423 FRG+K T+EGGVRVP YWKG + + D+ PT + G Sbjct: 290 VGTAGVFRGSKAQTYEGGVRVPFITYWKGHTPEGITLRNAISNVDILPTLAEWTGAS--- 346 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNG--QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 VP + +DG + + ++ + + +GK+ AVR +KY L Sbjct: 347 ----VPASRTLDGQSIAALLTSKSENITADHRPIYLVNHGKVEAVRKGSWKYRELPAGVN 402 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 + Y+ A +FN+ DP E ++ L+ + L Sbjct: 403 NNSGKPYE----------AAKELFNISYDPSERTNVISEFPEKAQELKVLFDNFDASLDT 452 Query: 542 Y 542 Y Sbjct: 453 Y 453 >UniRef50_Q7UNN1 Arylsulphatase A n=3 Tax=Bacteria RepID=Q7UNN1_RHOBA Length = 529 Score = 386 bits (992), Expect = e-105, Method: Composition-based stats. Identities = 121/514 (23%), Positives = 210/514 (40%), Gaps = 66/514 (12%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQ 133 L+E +PNV+V + DD+G+ D+G G A G TP+ID +AS+G TS Y S Sbjct: 34 LSETSAADNDRPNVIVVMADDLGYGDIGCYG---AKGLETPNIDQMASEGCRFTSGYCSA 90 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMG 192 + +PTR + LTG Y+ P + G TT ++L + GY T IGKWH+G Sbjct: 91 STCTPTRYSFLTGTYAFRFPNTGIAPPNSPALIPAGTTTTARILKNAGYKTAVIGKWHLG 150 Query: 193 ENKES-----------QPQNVGFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPDRS 236 +++ P +GFD + +D + +++P L Sbjct: 151 LGEKNEGPDWNGDLKPGPLEIGFDHCILLPTTNDRVPQVYVNDHNVENLDPADPLWVGNK 210 Query: 237 EYIKQLP------------FSKDDVHAVRGGEQQAIADITPK----YMEDLDQRWMDYGV 280 + + P +S + G + EDL RW++ Sbjct: 211 KPSEDHPTGITHRDTLKMDWSHGHNSTIHNGISRIGFYTGGHAARFRDEDLSDRWVEESK 270 Query: 281 KFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTL 340 +++ + ++PFFL++ + H + ++ GS+ GD + E++ L K+L Sbjct: 271 RWIAE--NREEPFFLFFASHDLHVPRVVHERFQGSTKL-GPRGDAIAELDWCVGELMKSL 327 Query: 341 EKNGQLDNTLIVFTSDNGP----------EAEVPPHG-RTPFRGAKGSTWEGGVRVPTFV 389 E+NG + T++VF SDNGP ++ H P++G K + +EGG R P Sbjct: 328 EENGLTEKTMLVFCSDNGPVLDDGYKDDANEKLGNHDPNGPYQGGKYTVYEGGTRTPFIT 387 Query: 390 YWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ 449 G I SD +V D + + G +P +D + + +G Sbjct: 388 RMPGTIPVGVSDEMVCTIDFAASLAAMVG-------QELPNDASLDSQNVLGALMNQSGA 440 Query: 450 SNRKAEHYFLNGKL--AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNL 507 S R+ NGK+ R+ ++K Q + Y + T +++NL Sbjct: 441 SGREHLVQQDNGKVGNYGYRVGDWKLVRHDQ------KKSYNFDLSMTRKPVPQFALYNL 494 Query: 508 YTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +DP E + + +Q E+ ++ + Sbjct: 495 ESDPAEQNDLSDSEPERAKQMQQELQKLLDAGRS 528 >UniRef50_A8G0H1 Sulfatase family protein n=5 Tax=Gammaproteobacteria RepID=A8G0H1_SHESH Length = 517 Score = 386 bits (992), Expect = e-105, Method: Composition-based stats. Identities = 148/495 (29%), Positives = 232/495 (46%), Gaps = 47/495 (9%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT 139 +PNVV +LDDV MD+ TP+ID +A +G++++ Y+Q SS+ Sbjct: 21 SAASTQPNVVAIMLDDVTTMDISAY-HRGLGAVSTPNIDRIAERGMMVSDYYAQGSSTAG 79 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 R+ +TGQY I G+ G GLQ TL ++L D+GY T +GK H+G+N + Sbjct: 80 RSAFITGQYPIRTGLTSVGQPGSTRGLQKEDPTLAEMLKDKGYATVHVGKSHLGDNNDHL 139 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK-----DDVHAVR 253 P GFD+F GF + ++H PE P+ + + + DD R Sbjct: 140 PTVHGFDEFYGFL----YHLNVMEMHEQPEFPKDPNFKGRGRNMIHTVATDKFDDTVDPR 195 Query: 254 GG--EQQAIAD---ITPKYMEDLDQRWMDYGVKFLDKMA--KSDKPFFLYYGTRGCHFDN 306 G +Q I+D + K M+ +D ++D+ + +L+K D+P+F++Y H Sbjct: 196 FGVIGKQTISDQGELGAKRMQTVDGEFLDFAINWLEKHEATNDDQPYFMWYNPTRMHQKT 255 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP- 365 + +Y G+S +Y D +VE++D L LE G++DNT+I+FTSDNG + P Sbjct: 256 HVRPEYQGASQ-HNTYYDGLVELDDQIGVLLDKLEATGEIDNTIILFTSDNGVNLDHWPD 314 Query: 366 HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKV 424 G FRG KG+TW+GG RVP V W I + +DG++ D PT + AG K Sbjct: 315 SGAASFRGQKGTTWDGGFRVPMLVSWPAKIPQGEYTDGLMSAEDWVPTIMAAAGDADIKQ 374 Query: 425 ANLVPK-------TTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLI 477 L K IDG +Q G+SNR ++ L A R+DE+K H+ Sbjct: 375 DLLTGKKINDETYKVHIDGYNQLDMLT-EGGKSNRHEFFFYNENSLNAFRVDEWKVHLKT 433 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDS------IGVRHIPMGVP-LQT 530 + + + G + N+ DP E + ++ +P L Sbjct: 434 KTEWIAPADEWPLGM-----------ILNIKADPFERSPDTRGWFLWMKEKTWVLPKLLK 482 Query: 531 EMHAYMEILKKYPPR 545 + + + LK +PPR Sbjct: 483 AVGKHQQSLKAFPPR 497 >UniRef50_B9XCM3 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XCM3_9BACT Length = 565 Score = 385 bits (989), Expect = e-105, Method: Composition-based stats. Identities = 144/529 (27%), Positives = 220/529 (41%), Gaps = 69/529 (13%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSP 138 KKPN++ + DDVGW ++G G+ G TP++D +ASQG+ T Y++ S + Sbjct: 29 APAQAKKPNILFIMGDDVGWFNIGAYHQGIMSG-KTPNLDKLASQGMRFTDYYAEASCTA 87 Query: 139 TRATILTGQYSIHHGILMPPMYGQPGGLQGLT-TLPQLLHDQGYVTQAIGKWHMGENKES 197 RA +TG+ + G+ G G+ TL L QGY T GK H+G+ + Sbjct: 88 GRANFITGEIPLRTGLTTVGQAGADVGIPDKACTLATALKAQGYATGQFGKNHLGDLNKY 147 Query: 198 QPQNVGFDDFRGFNSVSDMYTE--WRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV-RG 254 P GFD+F G+ D ++ W V+ + +DD + R Sbjct: 148 LPTLHGFDEFFGYLYHLDALSDPYWYSFPVDEAYYNKFGPRSVVHCWATDQDDTTEMPRW 207 Query: 255 GE--QQAIADITP----------------------KYMEDLDQRWMDYGVKFLDKMAKSD 290 G+ +Q + D P M D+ + + F+DK K Sbjct: 208 GKVGKQKVVDEGPLPPFPDMSNVPNMHDLPFLKAKYDMTTFDEVLVKSSIDFMDKAKKDG 267 Query: 291 KPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYG---DCMVEMNDVFANLYKTLEKNGQLD 347 KPFF+++ + H + KY+ +++++G M +++D L K L+ G+ D Sbjct: 268 KPFFVWHNSTRMHVWTFLAKKYSAMQNSKSNFGLEEAGMAQLDDNVGALLKHLDDMGEAD 327 Query: 348 NTLIVFTSDNGPEAEVPPHGR-TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVD 405 NT++VFT+DNG E P G TPF+ KG+ EGG RVP W G I+P +GI Sbjct: 328 NTIVVFTTDNGAEVFTWPDGGMTPFKATKGTVGEGGFRVPCIARWPGHIKPGTVENGIFS 387 Query: 406 LADLFPTALDLAGHPGAKVA-------NLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF 458 D FPT AG+ +DG +Q + G S R YF Sbjct: 388 GLDWFPTLCAAAGNTDITDQLLKGVKFGDREYKNHLDGYNQMALLEDK-GPSARHELFYF 446 Query: 459 LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES---- 514 L AVR+D+FK+ QQP+ G+ G + T ++ N+ DP E Sbjct: 447 GGPHLGAVRLDDFKFQFY-QQPW---------GWPGEKVTTDMPTLVNIRQDPFERTPST 496 Query: 515 -------------DSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 + R V +Q E+ + YPP S Sbjct: 497 RGQSLNDLGGGYMNDFFAREFWRFVLVQQEVAKLAKTAIDYPPMQDPAS 545 >UniRef50_C9KTV0 Arylsulfatase n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KTV0_9BACE Length = 459 Score = 385 bits (989), Expect = e-105, Method: Composition-based stats. Identities = 120/486 (24%), Positives = 189/486 (38%), Gaps = 81/486 (16%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 + +E K+PN V+ + DD+G+ DVG G TP+ID +A +G++ T +S Sbjct: 17 AFSPVEMMAQKQPNFVIIVADDMGYGDVGIYGNEYI---KTPNIDQIAREGMMFTDFHSN 73 Query: 134 -PSSSPTRATILTGQYSIHHG------ILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAI 186 SSPTR +LTG+Y G + + G T ++L D GY T I Sbjct: 74 GSVSSPTRCGLLTGRYQQRAGLEKVLLVPRDDKDKEVGLPSEEITFAKILGDNGYRTALI 133 Query: 187 GKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 GKWH+G ++ P N GF F GF S + Y R+ + + + + Sbjct: 134 GKWHLGYLQKHHPMNFGFQKFVGFKSGNVDYQSHRNRYGDMDW--------------WDG 179 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 ++ + G + ++ Y+++ DKPF LY H Sbjct: 180 LEMKDMSGYTTTLLTTLSEDYIKE-----------------NKDKPFCLYIAHAAPHSPM 222 Query: 307 ------------YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFT 354 P + Y D + E++ + +TL+K +NT +VF Sbjct: 223 QGPDEKAVRTEATPEGDKNSDRSNKEIYKDMVEELDWSVGRILETLKKYKLDENTFVVFF 282 Query: 355 SDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTA 413 SDNGP ++GAKGS WEGG RVP Y G I+ + V DLFPT Sbjct: 283 SDNGPVINNG-GSAGGYKGAKGSPWEGGHRVPGICYMPGTIKEGTTCEQTVMSFDLFPTM 341 Query: 414 LDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKY 473 LD+A + +DG F G N + + K +VR ++K Sbjct: 342 LDMADI------HYDDSKKKLDGTSLVPLFKGENLAP--RLLFWGNGNKTISVRDGKWKL 393 Query: 474 HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMH 533 Q+ +F+L DP E +++ + + L E+ Sbjct: 394 VRYNQKGGITLH------------------LFDLNNDPYEKNNLSKQEPELVERLDKEIT 435 Query: 534 AYMEIL 539 + E + Sbjct: 436 RWAESV 441 >UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UPK7_RHOBA Length = 482 Score = 385 bits (988), Expect = e-105, Method: Composition-based stats. Identities = 118/496 (23%), Positives = 194/496 (39%), Gaps = 78/496 (15%) Query: 60 PVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDA 119 P +Q + +++ T ++PNV+V L DD+ D+ GG TP++D Sbjct: 36 PAVQDGDANAKSE------SDATSRRPNVIVILADDLAVGDLA---GGDGSPTRTPNLDR 86 Query: 120 VASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPMYGQP---GGLQGLTTLPQL 175 AS+ + + AYS +P RA +LTG+Y G++ M P + TT+ + Sbjct: 87 FASESIQFSQAYSGSCVCAPARAALLTGRYPHRTGVVTLNMNRYPEMTRLRRDETTIADV 146 Query: 176 LHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDR 235 L D GY T +GKWH G P + GFD+F GF D+ Sbjct: 147 LKDAGYATGLVGKWHTGRGDGFHPLDRGFDEFEGFFGSDDVGYF---------------- 190 Query: 236 SEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFL 295 + PFS EQ+ I+D+ Y+ D R ++F+ + + PFFL Sbjct: 191 -----RYPFS----------EQRQISDVDESYLTDDLNR---RAIEFVRRHHEH--PFFL 230 Query: 296 YYGTRGCHFDNYPNA------KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 + H + G + + + M+ L ++ G ++T Sbjct: 231 HLAHYAPHRPLEAPPEVIARYREQGFDESTATIYAMIEVMDRGIGELLAEIDDLGLSEDT 290 Query: 350 LIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADL 409 +++F SDNGP+ RG K EGG+RVP FV W + P + D +V DL Sbjct: 291 IVLFASDNGPDPLTGERFNRELRGTKYQVNEGGIRVPLFVRWSKRLAPGQRDQMVTFVDL 350 Query: 410 FPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG-----KLA 464 PT LDL + + +DG + + + A Sbjct: 351 MPTILDLCRVDVSMLNR-------LDGESFVPVLEDASIAHSTMRFWQWNRASPNYTHNA 403 Query: 465 AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPM 524 AVR +K +PY + + T S +F+L DP ES + ++ + Sbjct: 404 AVRHGRYKLV----RPYVTRGAKLKDS-------TEPSVLFDLQNDPTESRDVSKQYPDI 452 Query: 525 GVPLQTEMHAYMEILK 540 + E+ + ++ Sbjct: 453 AERMSRELDRWSASVE 468 >UniRef50_B0UGK6 Sulfatase n=18 Tax=Bacteria RepID=B0UGK6_METS4 Length = 569 Score = 385 bits (988), Expect = e-105, Method: Composition-based stats. Identities = 136/493 (27%), Positives = 206/493 (41%), Gaps = 26/493 (5%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSS 137 + +KPN++ + DD G+ D+G GGG G PTP+ID +A G+ S Y+QPS + Sbjct: 41 AQAPQQQKPNILFIVSDDTGYGDLGPYGGGEGRGMPTPNIDRLAEDGMTFFSFYAQPSCT 100 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 P RA + TG+ G+ GQ GGL TL +L GY T GKWH+GE Sbjct: 101 PGRAAMQTGRIPNRSGMTTVAFQGQGGGLPAAEWTLGSVLKQGGYKTYFTGKWHLGEADY 160 Query: 197 SQPQNVGFD--DFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 + P G+D + G ++ + + L + K AV Sbjct: 161 ALPNAQGYDVMQYCGLYHLNAYTYADPTWFPDMDPELRAMFQRVTRGALSGKAGEKAVED 220 Query: 255 GEQQAIADITPKY--------MEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 + TP + D + FLD AK+ PF++ H N Sbjct: 221 FKVNGQYVNTPVVDGKAGVVGIPFFDSYVEKAALGFLDDAAKAGSPFYINVNFMKVHQPN 280 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE-VPP 365 P ++ S +++ Y D +VE++ + L G NTL+ +T+DNG + P Sbjct: 281 MPAPEFEHKSLSKSKYADSVVELDARIGRIMDKLRSLGLDKNTLVFYTTDNGAWQDVYPD 340 Query: 366 HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKV 424 G TPFRG KG+ EGG RVP W G I+P + IV DL T +AG Sbjct: 341 AGYTPFRGTKGTVREGGNRVPAMAVWPGKIKPGTKNHDIVGGLDLMATFASVAGLTLPD- 399 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLA--AVRMDEFKYHVLIQQPYA 482 + + D D + LG G+S RK+ YF +L+ AVR+ +K ++ Sbjct: 400 KDRDGQPMIFDSYDMSPVLLG-TGKSARKSWFYFTEDELSPGAVRVGNYKAVFNLRGDDG 458 Query: 483 YTQSGYQ-----GGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHI----PMGVPLQTEMH 533 G + +F+L+ DPQE + + + V + + Sbjct: 459 AATGALAVDTNLGWKGSSKYVATVPQIFDLWQDPQERYDVFMNNYTERTWTLVTMSAAVK 518 Query: 534 AYMEILKKYPPRA 546 M+ +YPPR Sbjct: 519 NLMKTYVQYPPRK 531 >UniRef50_A9UPM8 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UPM8_MONBE Length = 497 Score = 384 bits (986), Expect = e-105, Method: Composition-based stats. Identities = 126/454 (27%), Positives = 192/454 (42%), Gaps = 58/454 (12%) Query: 114 TPDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMY-----------G 161 TP ++ +A+ G+ T YS SP+RA+++TG+YS+ GI + P Sbjct: 1 TPHLEKLAASGMTFTQWYSTFHVCSPSRASMMTGRYSVRSGIGIAPGVRALSSSIYPAQA 60 Query: 162 QPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMY-TEW 220 G TT+ + L + GY T AIGKWH+G+ + P N GFD++ G DM + W Sbjct: 61 VGGLPLNETTMAEALKEAGYATAAIGKWHLGQREIFLPTNQGFDEYLGIPFSQDMGLSFW 120 Query: 221 RDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGV 280 ++ P P + + P + +L R+++ Sbjct: 121 FLNNLQPVEPYQPVPLPLLDGTDVIE-----------------QPVALSNLVHRYIERAT 163 Query: 281 KFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTL 340 F+ + +SD PFFLY H N + K+ GSS + + GD + EM+ + L Sbjct: 164 DFIKRSHESDTPFFLYLPFNHVHAPNSCSPKFCGSSE-QGAVGDAVQEMDWAIGRIMSYL 222 Query: 341 EKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RK 399 EK G ++TL FTSDNG G R K S WEGG +VP +W GMI+ + Sbjct: 223 EKLGLENDTLTFFTSDNGAPLLQDGAGNGVLRDGKASMWEGGFKVPALAHWPGMIKGNQV 282 Query: 400 SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL 459 S + AD++PT + AG P +P DG+D + LG G + ++ Sbjct: 283 SHELTSTADIYPTLMHFAGVP-------LPSDRVYDGIDLSDVLLGKEGAKGHECIMFYH 335 Query: 460 N-------GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ 512 N G+L AVR + K + +A + Q G VFNL DP Sbjct: 336 NAVAANASGELYAVRCGDMKVY------WATASTTSQPWADGPQ---EPPLVFNLTADPG 386 Query: 513 ESDSIGVRHIPMGVP---LQTEMHAYMEILKKYP 543 E+ + G L A++ + P Sbjct: 387 ETTPLTAWTEEYGATLGVLTAAKEAHLATITPVP 420 >UniRef50_A6DP41 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DP41_9BACT Length = 534 Score = 384 bits (986), Expect = e-105, Method: Composition-based stats. Identities = 128/508 (25%), Positives = 197/508 (38%), Gaps = 66/508 (12%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPT 139 + +KP+++ L+DD+G DV + TP IDA+A G++ T ++ +PT Sbjct: 18 QAMEKPHIIYVLMDDMGQGDVSCFNP--SSKIHTPQIDALAKNGMMFTDTHTNSSVCTPT 75 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 R ILTG+Y+ + + G L G TL LL QGY T IGKWH+G + Sbjct: 76 RYGILTGRYAWRTHLKKSVIGGTSPSLIKPGRMTLASLLKGQGYHTGMIGKWHLGWDFSF 135 Query: 198 QPQN----------------------------VGFDDFRGFNSVSDMYTEWRDVHVNPEV 229 P + GFD + S D+ + Sbjct: 136 HPDSVKIDPLYWGYTPGTKIDYAKGVENGPDVHGFDYYYSIPSSLDIPPYVYVENGRVTN 195 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS 289 +R ++ RGG A DI ED+ + +F+ K AKS Sbjct: 196 LDISERK--------GEEGKRLWRGGPMSADFDI-----EDVTPNFFRRANQFIAKNAKS 242 Query: 290 DKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 DKPFFLY H P K+ G S Y D +++++ L KTL+ N DNT Sbjct: 243 DKPFFLYLPLPSPHTPILPIKKFQGKSGVNE-YADFILQIDSHMGELIKTLKDNNIFDNT 301 Query: 350 LIVFTSDNGPEA-----EVPPHGRTP---FRGAKGSTWEGGVRVPTFVYWK--GMIQPRK 399 L+VFT+DNG E+ G P FRG K +EGG RVP V W G+ Sbjct: 302 LLVFTADNGISPRADIVEINNAGHFPSNGFRGRKADIFEGGHRVPYIVTWPNGGVQAGSV 361 Query: 400 SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL 459 S+ + D+ T D+ + +P+ D + R A + Sbjct: 362 SEQTICTTDMLATLADI-------LEVKLPENAGEDSYSTLPLLINRPYDFKRPATVHHS 414 Query: 460 NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGV 519 A+R ++K + ++NL +DP E+ ++ Sbjct: 415 INGSYAIRQGDWKLIFCAGSGGWPKSDLT--PEMASAQGLPVIQLYNLKSDPAETVNLYA 472 Query: 520 RHIPMGVPLQTEMHAYMEILKKYPPRAQ 547 ++ + L +M Y++ + P AQ Sbjct: 473 KYPHIVDRLTVQMQKYIDEGRSTPGEAQ 500 >UniRef50_Q7UJQ8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=4 Tax=Planctomycetaceae RepID=Q7UJQ8_RHOBA Length = 491 Score = 383 bits (985), Expect = e-105, Method: Composition-based stats. Identities = 113/506 (22%), Positives = 192/506 (37%), Gaps = 76/506 (15%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QP 134 K+PN+V L DD+G+ D+G G + TP +D +A++G+ T Y+ Sbjct: 26 PSTSAADAKRPNIVFILADDLGYGDLGCYGQELI---QTPRLDQMAAEGMRFTDFYAGNT 82 Query: 135 SSSPTRATILTGQYSIHHGILMP---PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 +P+R+ ++TG + H + P + T+ ++L GY T GKW + Sbjct: 83 VCAPSRSVLMTGMHMGHTHVRGNAGGPDMSKQSLRDENVTVAEVLQSAGYATALCGKWGL 142 Query: 192 GEN----KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 G++ ++ P+ GFD F G+ + + + PE + ++ +D Sbjct: 143 GDDALGGRDGLPRKQGFDHFYGYLNQVHAHNYY------PEFLWRNETKVALRNEVQRRD 196 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS--DKPFFLYYGTRGCH-- 303 + GG A Y DL + + F+ + A KPFFLY H Sbjct: 197 RSY---GGFTGGWATKRVDYSHDL---IANEAMGFIREKATDAATKPFFLYLSLTIPHAN 250 Query: 304 ------------FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 +Y S + M+ + L++ + T++ Sbjct: 251 NEGTGMSGNGQEVPDYGIYADKDWSDQDKGQAAMITRMDSDVGRILDLLKELQIDEQTVV 310 Query: 352 VFTSDNGPEAEVPPHGR-----TPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVD 405 +F+SDNGP E + + P RG K + EGG+RVP V W G P SD I Sbjct: 311 MFSSDNGPHNEGGHNPKKFDPAGPLRGMKRALTEGGIRVPLIVRWPGTTPPGAVSDHIGY 370 Query: 406 LADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGT-NGQSNRKAEH--YFLNGK 462 DL TA +LAG + A D + +G Q + + ++ G Sbjct: 371 FGDLMATAAELAGTDFPEDA---------DSISFAPTIVGRPEAQQTHEYLYWEFYEQGG 421 Query: 463 LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHI 522 AVR +K I++P+ T + +++L D E+ ++ H Sbjct: 422 RQAVRRVNWKA---IREPWM----------------TGPTQLYDLKADIGETTNLASDHP 462 Query: 523 PMGVPLQTEMHAYMEILKKYPPRAQI 548 + L+T M + R Sbjct: 463 EIVKQLETLMEEAHTPHPNWQVRVPA 488 >UniRef50_Q7UYS6 Arylsulfatase A n=4 Tax=Bacteria RepID=Q7UYS6_RHOBA Length = 512 Score = 383 bits (985), Expect = e-105, Method: Composition-based stats. Identities = 117/504 (23%), Positives = 202/504 (40%), Gaps = 64/504 (12%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPT 139 +T PNV++ DD+G+ D+ PTP +D +A G+ T +S +P+ Sbjct: 31 ETKTPPNVLILYADDLGYGDLNLQNAE--SKIPTPHLDQLARSGMRFTDGHSSSGICTPS 88 Query: 140 RATILTGQYSIH--HGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 R +LTG++ HGI+ +G+ TLP++ GY T AIGKWH+G + ++ Sbjct: 89 RYALLTGRHHWRDFHGIV--NAFGESVFEPEQLTLPEMFQQHGYQTAAIGKWHLGWDWDA 146 Query: 198 ------------------------------QPQNVGFDDFRGFNSVSDMYTEWRDVHVNP 227 P GFD + G ++ W + + Sbjct: 147 IKKPDAKTFGEGRKKGYGPEAFDWTKSIPDGPLAHGFDSYFGDTVINFPPYCWIE---DD 203 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA 287 +V +PD + K+ R G + D GV+F++ Sbjct: 204 KVVKAPDTIMDTAKWKPIKEGNWECRPGPMTSDWDPYQNIPTT-----TARGVQFIESQK 258 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 +SD+PFFLY+ H PN ++ G S A YGD + E +D L + L+++GQ + Sbjct: 259 ESDQPFFLYFAFPAPHAPIIPNDEFDGRSGA-GPYGDYVCETDDACGKLLRALKESGQSE 317 Query: 348 NTLIVFTSDNGPEA-------EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRK 399 NT+++F++DNGPE + PFRG K +EGG VP ++W G+ Sbjct: 318 NTIVIFSADNGPERYAYARDEKYDHWSSQPFRGLKRDLYEGGHHVPFVIHWPGVTDSGST 377 Query: 400 SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL 459 D +V D+F T ++ GH +P D Q +R++ Sbjct: 378 CDALVSQVDIFATLAEMLGHS-------IPDGQAKDSRSLMPLLK-EPKQQHRQSLVQNT 429 Query: 460 NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGV 519 + A+R ++ + G++ +++L D +S+++ Sbjct: 430 RVDVYAIRDGKWLLIDAKSGYVSGRNKGWESRRQIPADDKLPHELYDLSVDIGQSENVAG 489 Query: 520 RHIPMGVPLQTEMHAYMEILKKYP 543 H + ++ + E YP Sbjct: 490 EHPEIVERMKALLQTIREDG--YP 511 >UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CBI6_9PLAN Length = 599 Score = 383 bits (985), Expect = e-105, Method: Composition-based stats. Identities = 110/479 (22%), Positives = 180/479 (37%), Gaps = 92/479 (19%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 + ++PNV++ + DD GW DV + + TP D +ASQG Y P +PTR Sbjct: 26 QAAERPNVLLIMTDDQGWGDVRSHDNPLI---ETPQQDLLASQGARFERFYVSPVCAPTR 82 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 +++LTG+YS+ G+ G TT+ ++ GY T A GKWH G + P Sbjct: 83 SSLLTGRYSLRTGV-HGVTRGFENMRAEETTIAEMFKAAGYKTGAFGKWHNGRHYPMHPN 141 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 GFD+F GF + + D ++ E Sbjct: 142 GQGFDEFFGFCGGH--WNRYFDTNL------------------------------EHNKQ 169 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS----- 315 T Y+ D+ D + F+ + D+PFF Y H KY Sbjct: 170 PVKTEGYITDV---LTDRAIDFIKQ--NKDQPFFCYVPYNAPHSPWIVPEKYWDKYANKG 224 Query: 316 --SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRG 373 AR +Y + ++D L +TL+ DNT+++F +DNGP + RG Sbjct: 225 LDDKARCAY-AMVECVDDNLGRLMQTLDDLKLSDNTIVLFLTDNGPNS---NRYNGNMRG 280 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 KGS EGG+RVP FV + G I+ I D+ PT L+L Sbjct: 281 RKGSIHEGGIRVPLFVRYPGKIKAGTVVKPIAAHIDILPTLLELCSVENTA-------DQ 333 Query: 433 FIDGVDQTSFFLGTNGQ--------SNRKAEHYFLNGKL--AAVRMDEFKYHVLIQQPYA 482 +DG + + S+R + + +L +VR D ++ Sbjct: 334 PLDGKSLVPLLTNKSNKDWPQRMLFSDRLFRNSIPDDELPNGSVRTDRWRAAY------- 386 Query: 483 YTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 + S++++ DP + ++ H + L + + + + Sbjct: 387 ---------------ERGKWSLYDMQADPSQKQNVIEAHPAVIKDLSAAYRDWFKDVSQ 430 >UniRef50_D2R457 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R457_9PLAN Length = 516 Score = 383 bits (985), Expect = e-105, Method: Composition-based stats. Identities = 131/499 (26%), Positives = 198/499 (39%), Gaps = 65/499 (13%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATI 143 PN+V L DD+G+ DVG G TP ID +A G+ L YS P +P+R + Sbjct: 32 PPNIVFILCDDLGYGDVGCFGQK---KTRTPHIDTLARDGMRLIQHYSGAPVCAPSRCVL 88 Query: 144 LTGQYSIHHGILMPPM---YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQP 199 LTG +S H + GQ +G TLP LL +GYV A GKW +G +P Sbjct: 89 LTGLHSGHSQVRDNREAQPEGQYPLAEGTVTLPGLL--EGYVCGAFGKWGLGGPESSGKP 146 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD F G+N + + P+ S D +K PF+ Q Sbjct: 147 LAQGFDRFFGYNCQRQAHNYY------PQHLWSNDEKVLLKNPPFAAHQKFPADADPQNP 200 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH---------------- 303 A + + + +KF+D+ + KPFFLYY + H Sbjct: 201 AAFERYRGPDYAADLISEQALKFIDEHHQ--KPFFLYYASPVPHLALQVPEDSLKEYAGE 258 Query: 304 ---FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 Y R +Y + M+ + + LEK G T++VF+SDNGP Sbjct: 259 FSETPYLGERGYLPHPTPRAAYAAMITRMDREIGRILERLEKYGLQRRTIVVFSSDNGPL 318 Query: 361 AEVPPHGRTPF-------RGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPT 412 + F RG KGS +EGG+RVPT V + G++ S + D PT Sbjct: 319 YDKLGGTDADFFQSALDLRGRKGSVYEGGIRVPTIVKFPGVVPAGTTSSTLGGFEDWMPT 378 Query: 413 ALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH--YFLNGKLAAVRMDE 470 L LAG ++ +P+ DG D + G + Q+ R+ + + G VR + Sbjct: 379 LLSLAG-----MSTKIPEQA--DGRDLSPSLRG-DWQAPREFLYREFPGYGGQQFVRSGK 430 Query: 471 FKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQT 530 +K + + +++L DP ES ++ H + L Sbjct: 431 WKAV----RQNLVRPVPTGKKKLAEWKEPLAIELYDLEADPTESTNVAAEHPKVVAKLHA 486 Query: 531 EMHAYMEILKKYPPRAQIK 549 M L+++ P + K Sbjct: 487 IM------LREHQPSVEFK 499 >UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF83_9BACT Length = 488 Score = 383 bits (985), Expect = e-105, Method: Composition-based stats. Identities = 114/516 (22%), Positives = 193/516 (37%), Gaps = 94/516 (18%) Query: 70 ETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTS 129 + + + ++PN+++ L DD+G+ D+G G TP+ID +A G+ TS Sbjct: 27 TSDAQTSTNRPPAPRRPNIILILADDLGYGDLGCYGQTQI---KTPNIDKLAEDGMKFTS 83 Query: 130 AYSQP-SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGK 188 Y+ +P+RAT++TG+ + H I G T+ ++L GY T IGK Sbjct: 84 FYAGSTVCAPSRATLMTGKNTGHVNIRGNADLSLNG---EELTIAKILKLAGYATGCIGK 140 Query: 189 WHMG-ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 W +G E P GFD++ G+ + + P D ++ +++ Sbjct: 141 WGLGNEGSPGLPGRQGFDEYLGYLDQVQAHDYY------PTHLFRSDSKGEESKIALTEN 194 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA----KSDKPFFLYYGTRGCH 303 D AD Y D + + +L + FFLY H Sbjct: 195 D------------ADHKGLYSNDF---FTQSALNYLRINKPSKLNKHRSFFLYLPYTLPH 239 Query: 304 -----------------FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQL 346 + Y N ++ + + ++ + L+K+ Sbjct: 240 ANNELGNRTGNGMEVPSTEPYTNEQWPQVEKNK---AAMITRLDHYVGEIMDYLKKSKLD 296 Query: 347 DNTLIVFTSDNGPEAEVP-----PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-S 400 +NT+++F SDNGP E + RG K +EGG+RVP V W ++ S Sbjct: 297 ENTVVIFASDNGPHKEGGVNPKYFNSAGGLRGIKRDLYEGGIRVPFIVRWPARVKAGSIS 356 Query: 401 DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF-- 458 D + D PTA ++A T IDG+ LG Q+NR Y+ Sbjct: 357 DAPLAFWDFLPTAAEIA---------RTSSPTNIDGISFLPTLLGK-AQTNRHQYLYWEF 406 Query: 459 -LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSI 517 G AVRM ++K ++NL TD E D++ Sbjct: 407 HEQGFDQAVRMGDWKAVRHGIN--------------------GPIELYNLKTDVSEKDNV 446 Query: 518 GVRHIPMGVPLQTEMHAYMEILKKYPPR--AQIKSD 551 ++ + + + ++P + A+IK D Sbjct: 447 ADKNPEVMAKIADYLKKARTDDPRWPAKTVAEIKED 482 >UniRef50_A6DHI4 Arylsulfatase A (ASA) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI4_9BACT Length = 511 Score = 383 bits (983), Expect = e-104, Method: Composition-based stats. Identities = 125/514 (24%), Positives = 201/514 (39%), Gaps = 85/514 (16%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTR 140 +KPN+V DDVG+ DVG G PTP ID +A G+ T + S + SP+R Sbjct: 20 AAEKPNIVFIYGDDVGFGDVGVYGSEKI---PTPHIDKLAKGGIQFTDGHCSAATCSPSR 76 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES--- 197 +LTG ++ HG+ + P + TLP++L + GYVT +GKWH+G + Sbjct: 77 FAMLTGVHAFRHGVNILPPNAPLSIPTDIPTLPKMLRENGYVTGVVGKWHLGIGAKGVET 136 Query: 198 --------QPQNVGFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPDRSEYIKQLPF 244 P +GFD S +D R + +P + R+ P Sbjct: 137 DWNGDVKPGPLEIGFDQMFLLPSTNDRVPCVYLDGHRVYNYDPNDPIYVGRTLESVNKPG 196 Query: 245 SKDDVHAVRGGEQQAI-----------------------ADITPKYMEDLDQRWMDYGVK 281 S A + E + E + +++ + Sbjct: 197 STQYGDARKNPELMTYYPSTHGHNNSVINGIGRIGFMSGGEKALWNDETMADVFVEKASE 256 Query: 282 FLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLE 341 F+ + AK DKPFFLY+ ++ H P+ ++ G++ GD MV+ + L K L+ Sbjct: 257 FIKEKAKGDKPFFLYFASQDIHVPRAPHPRFQGATKL-GKRGDAMVQFDWCTGALMKALD 315 Query: 342 KNGQLDNTLIVFTSDNGP-----------------EAEVPPHGRTPFRGAKGSTWEGGVR 384 + G DNT++ F+SDNGP E + G +RG K +EGG R Sbjct: 316 EAGVADNTIVFFSSDNGPVYDDGYADGSVTKTSSKETDHGHDGSGIYRGGKYQIYEGGTR 375 Query: 385 VPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFL 444 VP + W I+P SD +V+ DL+ + L GH + K ID D + FL Sbjct: 376 VPFIISWPAKIKPAVSDAMVNQVDLYTSFAKLVGHD-------LRKEEAIDSRDTLAAFL 428 Query: 445 GTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSV 504 G Q AVR ++K+ + + + Sbjct: 429 GEESQGL-DYMFNEARKTDHAVRQGKWKFISKGGKKKKKSND----------------EL 471 Query: 505 FNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 ++L DP E ++ + ++ + + Sbjct: 472 YDLEADPSEQKNVVKEFPEVAGDMKKLLEQVRKS 505 >UniRef50_A6C4V9 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4V9_9PLAN Length = 480 Score = 382 bits (982), Expect = e-104, Method: Composition-based stats. Identities = 117/499 (23%), Positives = 190/499 (38%), Gaps = 82/499 (16%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PS 135 E+ G +PN++V ++DD+G+ V G TP+ID +A++G+ T +S Sbjct: 28 AAERPPGDRPNLIVIMVDDMGYAGVSCFGNPY---FKTPEIDRLAAEGMKFTDFHSSGTV 84 Query: 136 SSPTRATILTGQYSIHHGI--LMPPMYGQP----GGLQGLTTLPQLLHDQGYVTQAIGKW 189 SPTRA +LTG+Y GI ++ P+ P G + T +LL GY T IGKW Sbjct: 85 CSPTRAGLLTGRYQQRAGIEAVIHPVSDHPEHQKGLRKSENTFAELLKQAGYRTALIGKW 144 Query: 190 HMGE---NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 H G + E P N GFD F G++S + + HV Sbjct: 145 HQGYPHNSAEFHPDNHGFDTFVGYHSGNIDFISHVGDHV--------------------- 183 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 H G ++ + Y ++F+ + +PF LY H Sbjct: 184 --KHDWWHGRKET------QETGYSTHLINQYALQFIKESRN--QPFCLYLAHEAIHNPV 233 Query: 307 YPN------------AKYAGSSPART--SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIV 352 ++ +S A + + ++ + + L K+G NT ++ Sbjct: 234 QVPGDPIRRTEAAGWKRWKPASEAERIEKFRGMTLPVDAGVGQIREFLVKSGLDKNTFVL 293 Query: 353 FTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFP 411 F SDNGP + P G +RGAKGS +EGG RVP +W G IQ +D D+ P Sbjct: 294 FFSDNGPSRDF-PSGSPKWRGAKGSVYEGGHRVPAIAWWPGKIQAGTETDVPAISLDVMP 352 Query: 412 TALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH---YFLNGKLAAVRM 468 T L +A +PK +DGVD + S R + A+R Sbjct: 353 TLLGIAHID-------MPKERPLDGVDLSPVLFEQKPLSERPLFWASLSNNGSRSEAMRA 405 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPL 528 +K V + + ++ L DP E++++ + Sbjct: 406 GPWKLVVQHPRA------------KPGTFENEKVELYRLDQDPGEANNLSKAEPQRASRM 453 Query: 529 QTEMHAYMEILKKYPPRAQ 547 ++ + + + Sbjct: 454 LKQLKDWYQDTQNTATSQP 472 >UniRef50_A0Z6R0 Putative arylsulfatase n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z6R0_9GAMM Length = 466 Score = 382 bits (982), Expect = e-104, Method: Composition-based stats. Identities = 134/441 (30%), Positives = 201/441 (45%), Gaps = 40/441 (9%) Query: 82 TGKKP-NVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 + +KP NVV+ L+D+ G+ ++G GGGV G PTP ID++A +GL LT+ + +P+R Sbjct: 22 SAEKPANVVLVLMDNFGYGEIGVYGGGVMRGAPTPRIDSIAKEGLQLTNFNVEAECTPSR 81 Query: 141 ATILTGQYSIHH--GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 + ++TG+Y I PP G + TL +LL D GY T GKWH+G+ + Sbjct: 82 SALMTGRYGIRTRQRANQPPRGVWYGITKWEVTLAELLSDAGYATGIFGKWHLGDTEGRY 141 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH---AVRGG 255 P + GFD++ G SD A PD + + S H A +G Sbjct: 142 PTDQGFDEWIGLPRSSD-------------RAFWPDSNSFQPNSHPSAKFTHVMSASKGE 188 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 + A +D+ D + F+ +M+ KPFF Y H P+ + G Sbjct: 189 QPVEGAVYDRAKRAIIDREITDQAIDFMTRMSGKGKPFFAYLPYTQTHEPVDPHPDFYG- 247 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGA 374 S S+ D + + + L T+E G ++T+ +FTSDNG E G T P+R Sbjct: 248 STGNGSFADVLAQTDVYVGELLDTVESLGIREDTIFIFTSDNGREGVPRSFGFTGPWRSG 307 Query: 375 KGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTF 433 S +EG +RVP V W G I P R S+ IV D+F T G +P Sbjct: 308 MFSPYEGSLRVPFLVRWPGKIPPGRVSNEIVHQMDVFSTVASFTGVD-------IPTDRV 360 Query: 434 IDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFT 493 IDGVDQ++FF G +S R + ++ L + +K + Y Sbjct: 361 IDGVDQSNFFRGKTEKSARDSLVIYIGNTLFGAKWRNWKILLREMDEDGYG--------- 411 Query: 494 GTVMQTAGSSVFNLYTDPQES 514 + + A SV+NL DP+E Sbjct: 412 --IKEMAYPSVYNLIVDPKEE 430 >UniRef50_A3ZLN5 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZLN5_9PLAN Length = 468 Score = 382 bits (981), Expect = e-104, Method: Composition-based stats. Identities = 123/515 (23%), Positives = 198/515 (38%), Gaps = 111/515 (21%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-P 134 A +K+ + P++V+ + DD G+ D+ G G TP +D +A+ G LTS Y P Sbjct: 22 AVAAEKSKRPPSIVLIVSDDQGFADLSCIGDN---GCRTPRLDQLAASGTRLTSFYVSWP 78 Query: 135 SSSPTRATILTGQYSIHHGILMPPMYGQP-------------------GGLQGLTTLPQL 175 + +P+RA+++TG+Y +G P G L + Sbjct: 79 ACTPSRASLMTGRYPQRNGTYDMIRNEAPDYDYLYTPEEYAVTAERILGTDLQEVFLADV 138 Query: 176 LHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGF-NSVSDMYTEWRDVHVNPEVALSPD 234 L GYV+ GKW G+ K P GFD + GF N+ D +T R + P + Sbjct: 139 LKQAGYVSAVFGKWDGGQLKRYLPLQRGFDQYYGFANTGVDYFTHER--YGVPSM----- 191 Query: 235 RSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFF 294 + Q + Y+ DL +R ++F+D+ D+PFF Sbjct: 192 -------------------FRDNQPTEEDKGTYLTDLFER---EAIRFIDE--NHDRPFF 227 Query: 295 LYYGTRGCH-------------------FDNYPNAKYAGSSPARTSYGDCMVEMNDVFAN 335 LY H D++P + R +Y + M++ Sbjct: 228 LYLPFNAPHSASNLDRSIRGFAQAPQEYLDHFPGGESK-QEKRRQAYLAAVERMDEAIGK 286 Query: 336 LYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI 395 + L+++ DNTLI+F SDN +P RG K +EGG RVP V+W G + Sbjct: 287 VVDQLQQHQIADNTLIIFLSDN---GGGGGADNSPLRGGKAKMFEGGNRVPCIVHWPGKV 343 Query: 396 QPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKA 454 K S+ + ++FPT + G +P DG D G + S R+ Sbjct: 344 PAGKVSNQFLTSLEVFPTVIAAIG-------GKLPDDVIYDGFDMLPVLNGAS--SPREE 394 Query: 455 EHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 + G +AA R+ ++K+ V AG +F+L D E Sbjct: 395 MFWKRRGDVAA-RVGDWKW----------------------VDSAAGKGLFDLAHDIGEK 431 Query: 515 DSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIK 549 + H M L+ A+ ++ PR + Sbjct: 432 KDLSKEHPEMLAKLKARFDAWTAEMEAADPRGPFR 466 >UniRef50_C6Y1U6 Sulfatase n=2 Tax=Sphingobacteriales RepID=C6Y1U6_PEDHD Length = 523 Score = 381 bits (980), Expect = e-104, Method: Composition-based stats. Identities = 126/521 (24%), Positives = 191/521 (36%), Gaps = 76/521 (14%) Query: 78 LEKKTGKK-PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPS 135 L+ + KK PN+V L DD+G+ D+ G V TP ID +A QG+ T A++ Sbjct: 30 LQAQQQKKLPNIVYILADDLGYGDIKIYNAGAKVN--TPHIDKLAEQGMRFTDAHTTSSV 87 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMGE 193 +P+R +ILTG+Y + + + G L +GL T+ LL Y T IGKWH+G Sbjct: 88 CTPSRYSILTGRYPWRSRLPVGVLRGYSRTLIEEGLPTVAGLLKTSSYRTAVIGKWHLGL 147 Query: 194 N-------------------------------------KESQPQNVGFDDFRGFNSVSDM 216 + P+ GFD + DM Sbjct: 148 DWMPKEAFKDSINPAFNKDRLYGITDEMNPDQIDFGRAPVRGPRTQGFDYSYVLPASLDM 207 Query: 217 YTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWM 276 ++ + P S R G + D + + Sbjct: 208 PPY---AYLENDQLTEPLTGYTPGNKLASGYTGPFWRAGLKSPSFDFYG-----VLPAFT 259 Query: 277 DYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANL 336 + F+ K A + PFFLY+ H P A+Y G S A YGD + E++ + Sbjct: 260 NKATDFIKKEAATKNPFFLYFPMPAPHTPWMPTAEYRGKSQA-GEYGDYLQEVDAAVGKI 318 Query: 337 YKTLEKNGQLDNTLIVFTSDNGPE------AEVPPHGRTPFRGAKGSTWEGGVRVPTFVY 390 + L+ G NTL+VFTSDNGP + H PFRG KG +EGG RVP V Sbjct: 319 LQVLDSLGLSKNTLVVFTSDNGPYWRDDFVQQYGHHAAGPFRGMKGDAYEGGHRVPFIVR 378 Query: 391 WKGMIQPRKSDGIVDLA-DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGT-NG 448 + G ++ + +L T DL G+ + D LG G Sbjct: 379 YPGKVKAGTISNVTTTLANLMATCADLTGNHAVQFETE-------DSYSILPVLLGKAAG 431 Query: 449 QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLY 508 + + A + +R +K + S A ++NL Sbjct: 432 IAEQPAIVNISSKGFYDIRKGPWKLITGLGSGGFSVPS-----IVKAPEGQAAGQLYNLD 486 Query: 509 TDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIK 549 TD +E ++ R+ L + +K P + K Sbjct: 487 TDIKEETNLYSRYPEKVKELSALLEK----IKAAPKGKRAK 523 >UniRef50_Q96EG1 Arylsulfatase G n=22 Tax=Euteleostomi RepID=ARSG_HUMAN Length = 525 Score = 381 bits (980), Expect = e-104, Method: Composition-based stats. Identities = 131/484 (27%), Positives = 208/484 (42%), Gaps = 42/484 (8%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSP 138 K G+KPN V+ L DD+GW D+G N A T ++D +AS+G+ ++ S SP Sbjct: 30 KTRGQKPNFVIILADDMGWGDLGAN---WAETKDTANLDKMASEGMRFVDFHAAASTCSP 86 Query: 139 TRATILTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 +RA++LTG+ + +G+ GGL TTL ++L GYVT IGKWH+G + Sbjct: 87 SRASLLTGRLGLRNGVTRNFAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGSY 146 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P GFD + G DM + +P P + L A+ E Sbjct: 147 HPNFRGFDYYFGIPYSHDMGCTDTPGYNHPPCPACPQGDGPSRNLQRDCYTDVALPLYEN 206 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSP 317 I + P + L Q++ + +F+ + + S +PF LY H P + + Sbjct: 207 LNIVE-QPVNLSSLAQKYAEKATQFIQRASTSGRPFLLYVALAHMHVP-LPVTQLPAAPR 264 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP--HGRTPFRG-- 373 R+ YG + EM+ + + ++ +NT + FT DNGP A+ PF G Sbjct: 265 GRSLYGAGLWEMDSLVGQIKDKVDHT-VKENTFLWFTGDNGPWAQKCELAGSVGPFTGFW 323 Query: 374 --------AKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKV 424 AK +TWEGG RVP YW G + S ++ + D+FPT + LA Sbjct: 324 QTRQGGSPAKQTTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALA------- 376 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN-----GKLAAVRMDEFKYHVLIQQ 479 +P+ DGVD + G + +R H G L VR++ +K + Sbjct: 377 QASLPQGRRFDGVDVSEVLFGRSQPGHRVLFHPNSGAAGEFGALQTVRLERYKAFYITGG 436 Query: 480 PYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSI---GVRHIPMGVPLQTEMHAYM 536 A G TG +Q +FNL D E+ + G + + ++ + + Sbjct: 437 ARA------CDGSTGPELQHKFPLIFNLEDDTAEAVPLERGGAEYQAVLPEVRKVLADVL 490 Query: 537 EILK 540 + + Sbjct: 491 QDIA 494 >UniRef50_A3J5W3 Putative arylsulfatase n=1 Tax=Flavobacteria bacterium BAL38 RepID=A3J5W3_9FLAO Length = 468 Score = 381 bits (979), Expect = e-104, Method: Composition-based stats. Identities = 108/493 (21%), Positives = 177/493 (35%), Gaps = 83/493 (16%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPS 135 E K KKPN+V L DD+G+ ++G GG + TP+ID +A +G+ ++ Y Sbjct: 20 AQETKNTKKPNIVFILADDMGYNELGSYGGKII---ETPNIDQLAKEGMKFSNHYCGSNI 76 Query: 136 SSPTRATILTGQYSIHHGILMP---PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 +P+R T++TG+++ H I P G T+ ++L GY T A GKW +G Sbjct: 77 CAPSRGTLMTGKHTGHAYIRDNKPLPYEGNEPIPASEITVAEILKTAGYTTGAFGKWGLG 136 Query: 193 E-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 E P N GFD F G+N + + + Sbjct: 137 YPASEGSPNNQGFDQFYGYNGQIHAHNYFTS---------------------------YL 169 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY-PNA 310 + + A+I Y D ++F++ + PFFLY+ H + P+ Sbjct: 170 RKNDLVELNANIDAPYSVYSADIIKDRALEFVE--VNKNNPFFLYFCPTLPHNPYHQPDD 227 Query: 311 KYAGSSPART---------------SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 K +T Y ++ + L++ LDNTLI+F S Sbjct: 228 KTLEYYAKKTGFPIGDAHSEEFSVPKYAALSSRLDQQVGEIMAKLKELNLLDNTLIIFAS 287 Query: 356 DNGPEAEVPPHG----RTPFRGAKGSTWEGGVRVPTFVYWKGM-IQPRKSDGIVDLADLF 410 DNG RG K +EGG++ P +WKG I S+ I D Sbjct: 288 DNGSALTKEEDSYLRTGGDLRGRKSEVYEGGIKSPLIAFWKGKIIPGSSSNHISAFWDFL 347 Query: 411 PTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDE 470 PT ++ IDG+ LG + Y+ + A+R + Sbjct: 348 PTCAEIVKAKTPDN---------IDGISYLPTLLGKTDNQKQHDYLYWERSQSQAIRKGD 398 Query: 471 FKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQT 530 K + + + Q ++NL DP E +++ + Sbjct: 399 MKANFVYDKT----------------SQKQNIEIYNLAQDPFEKNNLAETMPELKAEFIK 442 Query: 531 EMHAYMEILKKYP 543 + +P Sbjct: 443 IAQTARVESEIFP 455 >UniRef50_C0BKJ9 Sulfatase n=2 Tax=Bacteroidetes RepID=C0BKJ9_9BACT Length = 493 Score = 381 bits (979), Expect = e-104, Method: Composition-based stats. Identities = 113/508 (22%), Positives = 199/508 (39%), Gaps = 81/508 (15%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS- 132 + +E + + PN++ L DD+G+ ++G G TP++D +A+ G+ T Y+ Sbjct: 15 SCSTVENQKDQPPNIIYILADDLGYGELGSYGQKKI---KTPNLDRLAADGMRFTQHYTG 71 Query: 133 QPSSSPTRATILTGQYSIHHGILMPPMYGQP---------GGLQGLTTLPQLLHDQGYVT 183 P +P+R LTG ++ H I GQ + TL ++L GY T Sbjct: 72 APVCAPSRYMFLTGNHAGHAYIRGNYELGQFSDEMEGGQMPIPETTPTLAKMLKKAGYQT 131 Query: 184 QAIGKWHMGENKES-QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 IGKW +G N+ + P GFD + G+ + + P D+ + + Sbjct: 132 AMIGKWGLGMNETTGSPLLHGFDYYYGYLDQKQAHNYY------PTHLWENDKKDPLNND 185 Query: 243 PFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGC 302 F VH+ + K E R ++ ++FLD A SDKP+FLYY + Sbjct: 186 YF---LVHSPISSKANQSDFDQFKGQEYAPDRMLEKAIQFLDTTA-SDKPYFLYYPSPIP 241 Query: 303 HF-------------------DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKN 343 H N Y +Y + ++ ++ ++++ Sbjct: 242 HVSLQVPDSLVDQYRDVFEEEPYLGNKGYTAHQFPNAAYAAMITHLDSEVGKIWDSVKEK 301 Query: 344 GQLDNTLIVFTSDNGPEAEVP-----PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR 398 GQ +NTLI+F+SDNGP + RG K +EGG+R+P YWKG I+ Sbjct: 302 GQEENTLILFSSDNGPTFAGGVDPDFFNSAAGLRGLKMDVYEGGIRIPFIAYWKGKIKAG 361 Query: 399 KSDGIVD-LADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH- 456 ++ D+F T +LAG + DG+ LG + + Sbjct: 362 SISDLISGHWDMFNTFAELAGQDQSAP----------DGISILPELLGESQNETHDYIYF 411 Query: 457 -YFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD 515 Y A+R++++K + + + ++NL TD E Sbjct: 412 EYPEKRGQIALRIEDWKGVKVEMKTNL----------------DSKWELYNLKTDRNEVF 455 Query: 516 SIGVRHIPMGVPLQTEMHAYMEILKKYP 543 ++ H + ++ + + ++P Sbjct: 456 NVAAEHPEIV----NKIDSLHKTAHRHP 479 >UniRef50_Q7UIN1 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UIN1_RHOBA Length = 554 Score = 380 bits (977), Expect = e-104, Method: Composition-based stats. Identities = 128/552 (23%), Positives = 204/552 (36%), Gaps = 73/552 (13%) Query: 42 HPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEK---KTGKKPNVVVFLLDDVGW 98 H L + ++ + + T++ +A+ T +PNV++ DD G+ Sbjct: 11 HSRLRLSQSNLSLRAIAILAVGLVCLSVSTRRAVAQQNPVIGSTDTRPNVIIVYTDDQGF 70 Query: 99 MDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMP 157 DV TP++D +A +GL T+A+S +P+R +LTG+YS + Sbjct: 71 GDVSSMNPD--AKFETPNMDRLAKEGLTFTNAHSSDSVCTPSRYGLLTGRYSWRTTLKRG 128 Query: 158 PMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNV------------- 202 M + L TL L D+GY T +GKWH+G P+ Sbjct: 129 VMNAEGKCLIADDRMTLASFLRDEGYQTGMVGKWHLGMQFPGSPKKRDWSQPVRDMPLDK 188 Query: 203 GFDDFRGFNSVSDM----YTEWRDVHVNPEVALSPDRS----EYIKQLPFSKDDVHAVRG 254 GFD F G + + + + R V P+ + +Y P+ + + A + Sbjct: 189 GFDHFFGIPASLNYGVLAWFDGRHAAVPPKSWTGKKPNKRHVDYRIMPPYQETETEARKR 248 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKM--------AKSDKPFFLYYGTRGCHFDN 306 + I R+ D ++++ + A + PFFLY H+ Sbjct: 249 FKNTTIEVADDFVDNQCLTRFTDEAIEWITEATATPGNESASNAPPFFLYLPLTSPHYPV 308 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE------ 360 P +Y G YG+ M+E + L K LE NG DNTL++ TSDNGPE Sbjct: 309 CPLPEYWGQGDC-GGYGEFMIETDHHLGRLLKHLEANGLTDNTLVILTSDNGPEKSWKQR 367 Query: 361 -AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGIVDLADLFPTALDLA 417 + H +RG K +EGG RVP W I+ R SD +V DL T +L Sbjct: 368 IDDFGHHSNGSYRGGKRDIYEGGHRVPMLARWPNGIKQPGRISDALVGQVDLLATVAELL 427 Query: 418 GHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLI 477 G P +P D S L + + +R A+ ++K Sbjct: 428 GRP-------LPDEAAEDSHSFASILLDPSYEHHRVPLINHGVRGEFAITAGDWK----- 475 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 + ++NL DP ES I H + L+ + + Sbjct: 476 --------------WIAPRRDNDEGELYNLANDPSESQDISSDHPTVVRRLRNALTKIVV 521 Query: 538 ILKKYPPRAQIK 549 + Q Sbjct: 522 NGRSTSGDPQPN 533 >UniRef50_Q1YP24 Arylsulfatase A n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YP24_9GAMM Length = 502 Score = 380 bits (977), Expect = e-104, Method: Composition-based stats. Identities = 120/499 (24%), Positives = 201/499 (40%), Gaps = 51/499 (10%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 Q+ + K KPN ++ DD+G+ D G G + TP ID +AS G T+ Sbjct: 20 AQESKQQAPNKHKAKPNFILVYTDDMGYSDAGPFGNPLI---ETPAIDRLASSGQTWTNF 76 Query: 131 YSQ-PSSSPTRATILTGQYSIHHGILMPPM-----YGQPGGLQGLTTLPQLLHDQGYVTQ 184 Y+ P +P+R +LTG+ + G+ + + G + TTL ++ D Y T Sbjct: 77 YAAAPVCTPSRGALLTGKLPVRTGLYGDNINVFFPGSKKGMPENETTLAEVFQDNQYATG 136 Query: 185 AIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK---- 240 GKWH+G+ P GF+++ G +DM +W + P + K Sbjct: 137 MFGKWHLGDATGFYPTRHGFNEWLGIPYSNDM--DWEVEGITSSNIFFPAQDIMAKYGTV 194 Query: 241 ------------QLPFSKDDVHAVRGGEQQAIADI--TPKYMEDLDQRWMDYGVKFLDKM 286 + +H+ + + + + P + +R+ ++F+ + Sbjct: 195 SPVLQRQIFQPEINDWQVPLIHSRKLADGRFVDHEIQRPADQTLITRRYTTESIRFMREA 254 Query: 287 AKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQL 346 + KPFF+Y H + +A++AG S A YGD + E++ + + Sbjct: 255 VTAQKPFFIYLAHSMPHVPLFRSAEFAGKSKA-GIYGDVIEEIDWSLQKIIAATQALAID 313 Query: 347 DNTLIVFTSDNGPEAEVPPHGR--TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIV 404 DNT IVFTSDNGP H TP R KG+T++GG+RV T I D + Sbjct: 314 DNTYIVFTSDNGPWLIYGTHAGTATPLRDGKGTTFDGGMRVMTVFSGPD-IHQGIIDDLG 372 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLA 464 DLF T LAG +TT D VD + S R + ++ +L Sbjct: 373 SQTDLFATFTALAGFGS--------QTTAADSVDLSHTLRNGQ-PSPRTSIPFYSGSELR 423 Query: 465 AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPM 524 A R + K H + Q Y G + + +L D E+++I + Sbjct: 424 AFRYQDHKVHFVTQGAY---------GMKPAREVHQPAMLIDLKADVGEANNIAKNNPQR 474 Query: 525 GVPLQTEMHAYMEILKKYP 543 + + + + + + P Sbjct: 475 VLEVVQQAETFKQSITVAP 493 >UniRef50_Q7UYW3 Arylsulfatase B n=1 Tax=Rhodopirellula baltica RepID=Q7UYW3_RHOBA Length = 520 Score = 380 bits (977), Expect = e-104, Method: Composition-based stats. Identities = 123/497 (24%), Positives = 187/497 (37%), Gaps = 87/497 (17%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPS 135 PN+VV L DD+G+ D+G G TP++D +A G++ + AY + Sbjct: 47 ASRAAESTPPNIVVILADDMGYGDMGCMGSQTL---QTPNLDRLAESGVLCSQAYVASAV 103 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQP---------GGLQGLTTLPQLLHDQGYVTQAI 186 SP+RA +LT + G G TL L GY T I Sbjct: 104 CSPSRAGLLTSRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADHLGAAGYATALI 163 Query: 187 GKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 GKWH+G + P GFD F G + S Y HV Sbjct: 164 GKWHLGMGEMHHPNRRGFDHFCGMLTGSHHYFPATMKHV--------------------- 202 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKM--AKSDKPFFLYYGTRGCHF 304 R G++ + D + +Y+ D + D G++F+D+ A D+P+F+++ H Sbjct: 203 ----IERNGKR--VDDFSSEYLTDF---FTDEGLRFIDQHKSANPDQPWFVFFSYNAPHT 253 Query: 305 DNYPN----AKYAG-SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP 359 + A++A + R +Y M ++ + + LE+ GQ +NTL+VF SDNG Sbjct: 254 PMHATEADLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGG 313 Query: 360 EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAG 418 P RG KGS EGG+RVP W DG+V DL PT AG Sbjct: 314 ATNNGSW-NGPLRGVKGSMREGGIRVPMIWTWPAKFPAGVLYDGVVSSLDLLPTFCSAAG 372 Query: 419 HPGAKVANLVPKTTFI------------DGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAV 466 +A+ + DG+D + NR+ Y+ AA+ Sbjct: 373 AEPLALADPMSHEDASNRKRMNRLSGTHDGIDMAPHLADGSEPPNRR--LYWRLQGQAAI 430 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGV 526 K +P +F + TD ES + ++ Sbjct: 431 LDGTDKLLRPSHRPA---------------------ELFEVSTDVSESHDLSAQNPSRFR 469 Query: 527 PLQTEMHAYMEILKKYP 543 L E+ A+ +L P Sbjct: 470 ELYDELGAWESMLTTVP 486 >UniRef50_UPI000180BD6E PREDICTED: similar to arylsulfatase n=1 Tax=Ciona intestinalis RepID=UPI000180BD6E Length = 501 Score = 380 bits (977), Expect = e-104, Method: Composition-based stats. Identities = 121/490 (24%), Positives = 202/490 (41%), Gaps = 58/490 (11%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP 134 L L + +PN V+ DDVG+ D G P ID +A++G+ T YS Sbjct: 10 LIILANQVLSRPNFVLIFADDVGYGDFQSYGHPTQERGP---IDDLAAEGMRFTQWYSAA 66 Query: 135 S-SSPTRATILTGQYSIHHGILMPPMYGQP----GGLQGLTTLPQLLHDQGYVTQAIGKW 189 S +P+RA +LTG+ IH G++ P G + TTL + L + GY T +GKW Sbjct: 67 SLCTPSRAALLTGRLPIHSGMVGPTRVLHQNDAGGLPKNETTLAEALKELGYKTGMVGKW 126 Query: 190 HMGENK------ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLP 243 H+G N+ P++ GFD F G + + + P Sbjct: 127 HLGINELKQNDGRHLPKHHGFD-FVG-----------------TNLPFTFHLFCSPSEYP 168 Query: 244 FSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 K + + + I P E L + ++ +F+ + PFFLY H Sbjct: 169 VDKMKIKCFLSNKDEIIE--QPIIPEKLTDKIVEGAKQFITE--NQKNPFFLYLSLPQTH 224 Query: 304 FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 + ++ S R SYGD + EM+ + L+ NTL++F SD+GP E Sbjct: 225 VAMFCKEEFCNKS-MRGSYGDNVNEMSWAVGEVVNQLKDLNLDQNTLVMFLSDHGPAVEF 283 Query: 364 P--PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPG 421 +G K S+W+GG++VP +W G IQP +V D+FPT L LAG+ G Sbjct: 284 CYTGGSTGGLKGGKASSWDGGIKVPAVAWWPGTIQPGVKTQVVSTMDIFPTFLQLAGNEG 343 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 +DG+ + L + + ++ + +L AVR +K H Q + Sbjct: 344 NNGN--------LDGMSISDLLLSNHDNEVHEILFHYCSDRLMAVRYGRYKIHFHTQHLH 395 Query: 482 AYTQSGYQGGFTGTVMQ----------TAGSSVFNLYTDPQESDSI-GVRHIPMGVPLQT 530 + + G ++ +F++ TDP+E + + ++ Sbjct: 396 VFNSNCIDGKALENIVDYFDCYANTTTHNPPLIFDINTDPEELFPLEAAPRAHIIEEVEK 455 Query: 531 EMHAYMEILK 540 ++ + + +K Sbjct: 456 QVAKHQKTIK 465 >UniRef50_Q7UYA9 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA9_RHOBA Length = 474 Score = 380 bits (977), Expect = e-104, Method: Composition-based stats. Identities = 117/486 (24%), Positives = 179/486 (36%), Gaps = 67/486 (13%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSS 136 E PNV++ + DD GW DVGFNG V TP++DA+AS G+ Y+ P Sbjct: 25 AETTDTNSPNVILLMSDDQGWGDVGFNGNEVV---QTPNLDAMASAGVRFDRFYAAAPLC 81 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 SPTR + LTG+Y GIL G G T+ ++L +GY T GKWH+G K Sbjct: 82 SPTRGSCLTGRYPFRFGILAAHTGGM---RVGEITIAEMLQKRGYATGMFGKWHIGWVKP 138 Query: 197 ---------SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 S P + GFD++ S + P + P+ Sbjct: 139 DEVSTRGFYSPPSHHGFDEYFATTSAVPTWDPTITPQDWDSWGNGPGEP-WKGGFPY--- 194 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 VH R ++ D + + MD + F++ A KPFF H Sbjct: 195 -VHNGREAKENLSGDDS--------RVIMDRVIPFIE--ANQAKPFFATVWFHAPHEPVV 243 Query: 308 PNAKYAGSSPA----RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 ++ P R +Y C+ M+ L L + G NT++ F SDNGP + Sbjct: 244 AGEEFKKLYPKAGSKRKNYYGCITAMDQQVGRLRAKLRELGIEKNTVVFFCSDNGPSDGL 303 Query: 364 PPHG---RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGI-VDLADLFPTALDLAGH 419 G PF+G K + +EGG+ VP W G I S + D PT + G Sbjct: 304 AKKGVASAGPFKGHKHTMYEGGLLVPACAEWPGTIPAGTSTEVRCSTVDFLPTVASIVGD 363 Query: 420 PGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL----AAVRMDEFKYHV 475 + A T IDG+D G +R + ++ ++K Sbjct: 364 SMVQKA-----TRPIDGIDLMPLIRGEAKDRDRDLFFGYRRLYQGIDGQSIISGDWKL-- 416 Query: 476 LIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAY 535 + +++L DP E+ + L+ ++ Sbjct: 417 -----------------LQEAKKNGRLRLYDLSKDPFETQDLSEEMPEQTEQLRKQLEEL 459 Query: 536 MEILKK 541 ++ Sbjct: 460 QASCQR 465 >UniRef50_A6DKN7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKN7_9BACT Length = 465 Score = 380 bits (977), Expect = e-104, Method: Composition-based stats. Identities = 111/480 (23%), Positives = 200/480 (41%), Gaps = 51/480 (10%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTR 140 + +K N+++ DD+ + +G G V TP ID++ ++G+ + Y+ + +P+R Sbjct: 16 SAEKTNIILIFADDMHYGALGVTGS-VLTKAKTPAIDSIFNEGVHFPNGYASHATCAPSR 74 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQG------LTTLPQLLHDQGYVTQAIGKWHMGEN 194 A +LTG+Y + P G +P L+ GY T AIGKWH+G + Sbjct: 75 AGLLTGRYQARFDLETLPGGTADRKKTGYGVKTSEIMIPALMKKGGYQTCAIGKWHLGSS 134 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD-DVHAVR 253 +E QP GFD + G+ Y V S + + +K LP +D ++ VR Sbjct: 135 EEFQPNARGFDHWFGYRGSCGFYQFKSQVQ-------SAKKGQELKPLPSGEDPNLDVVR 187 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 GE + L + D ++ + ++PFF+Y+ H + KY Sbjct: 188 NGESVRLEGY-------LTDHFSDEAANWIKE--NKERPFFMYFAPYNVHAPDTVPNKYI 238 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRG 373 T++ + ++ + L++ G DNTL+VF++DNG + + F+G Sbjct: 239 PK--GGTAHDGVIAALDASVQTILDALKEAGIADNTLVVFSNDNGGKKDYSKT----FKG 292 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 K + +EGG+RVP + W I+ K +G+V DL PT LA +P Sbjct: 293 NKATFYEGGIRVPFAMRWPKGIEAGSKYNGVVSTLDLLPTFAALAKVD-------LPSDR 345 Query: 433 FIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG-- 490 DG + + +++ H++ NG R+ ++K + + G Sbjct: 346 VYDGQNLLPVI--KDSAKDQRQAHFWRNGAWRTARVGDWKLVWQVDRKKQKALLNKLGIK 403 Query: 491 ---GFTGTVMQTA-----GSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKY 542 G T + A ++NL DP+E ++ + + + + K+ Sbjct: 404 HVKGRGVTYAERADELFLEPELYNLANDPKEESNLAQSNPEKLQEMVKIYKDWEASIPKW 463 >UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Bacteria RepID=A6C284_9PLAN Length = 605 Score = 380 bits (976), Expect = e-104, Method: Composition-based stats. Identities = 110/519 (21%), Positives = 184/519 (35%), Gaps = 98/519 (18%) Query: 47 LVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGG 106 L A + N+ K++Q + + PN+V+FL DD GW D+ NG Sbjct: 7 LFLLACILTGNLTASENKNPPHKKSQTR---PATQATTHPNIVIFLADDQGWGDLSHNGN 63 Query: 107 GVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGL 166 TP++D++A +G+ Y +PTRA LTG+Y G + GQ Sbjct: 64 ---TNLHTPNVDSLAKEGVKFNRFYVGAVCAPTRAAFLTGRYHARTGTIG-VSTGQERFN 119 Query: 167 QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVN 226 T+ Q GY T A GKWH G + P GFD++ GF S + + ++ Sbjct: 120 SDEYTIAQAFKAAGYATGAFGKWHNGTQYPNHPNAKGFDEYYGFTSGH--WGHYFSPMLD 177 Query: 227 PEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKM 286 ++K + DD D + F+++ Sbjct: 178 -------HNGTFVKGNGYITDD--------------------------LTDKAMAFIEQQ 204 Query: 287 AKSDKPFFLYYGTRGCHFDNYPNAKY-----------------AGSSPARTSYGDCMVEM 329 ++ KPFF Y H +Y + + Sbjct: 205 VQNHKPFFAYLPYCTPHSPMQVPDQYWDRFKDKQLKLHNREPDREQPDHLRAALAMCENV 264 Query: 330 NDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFV 389 + + K L D+T++++ SDNGP +G KGS EGGVR P + Sbjct: 265 DWNVGRVLKKLNSLRITDDTIVIYFSDNGPNGVRW---NGDMKGKKGSLDEGGVRSPFVI 321 Query: 390 YWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNG 448 W G + + + I DL PT DLAG P+ IDGV L + Sbjct: 322 RWPGHLPAGQEVNQIAGAIDLLPTLTDLAGIKR-------PEPKPIDGVSLKPLMLNSKA 374 Query: 449 QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLY 508 + L ++ +VR D+++ +++++ Sbjct: 375 DWPERMIFSSLRNRV-SVRTDQYRLSR-------------------------KGELYDMH 408 Query: 509 TDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQ 547 DP + ++I + + LQ + + + + +P Sbjct: 409 ADPGQRNNIAKQKPEITAKLQQAVTDWRQSV--WPNGYP 445 >UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR28_9SPHI Length = 602 Score = 380 bits (976), Expect = e-104, Method: Composition-based stats. Identities = 112/479 (23%), Positives = 180/479 (37%), Gaps = 90/479 (18%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSS 137 ++++T + PNV+V L DD GW D G TP D + +G +L Y P + Sbjct: 32 VQEQTQRPPNVIVILTDDQGWGDFSHTGNEYL---KTPHFDKMTEEGALLDQFYVSPVCA 88 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 PTRA++LTG+Y + G+ G+ T+ ++ + GY T GKWH G + Sbjct: 89 PTRASVLTGRYHLRTGVSF-VTRGRENMRSEEVTIAEVFKEAGYATGCFGKWHNGAHYPE 147 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 PQ GFD F GF S W + D GE Sbjct: 148 NPQGQGFDTFLGFTSG-----HWSNYF-----------------------DTELEYNGEM 179 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY----- 312 ++ + MD ++F+D A D+PF + H KY Sbjct: 180 KSTKGF-------ITDVLMDETIQFID--AHKDEPFLAFVPLNAPHTPYQVPDKYFDKYK 230 Query: 313 -------AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP 365 + + ++D L K L+ +NT++VF SDNGP+ Sbjct: 231 DIDFGYDKKQNKKIATIYGMCENIDDNLGKLMKHLKDQELEENTIVVFLSDNGPQG---A 287 Query: 366 HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVA 425 P+RG K S EGG VP + WKG I + DL PT + LAG Sbjct: 288 RYNGPWRGGKTSVHEGGTLVPCAIQWKGHIPNSSKSSLTAHIDLMPTLMGLAGIEK---- 343 Query: 426 NLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG-----KLAAVRMDEFKYHVLIQQP 480 P+ DG+D +++ +GT+ + + + AVR ++++ Sbjct: 344 ---PENIQFDGIDLSNYLMGTSDDLGERNLYTHMTNFEITADRGAVRQGDYRF------- 393 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL 539 + ++NL DP E +++ + L+T + + + Sbjct: 394 ---------------TTEYGDVGLYNLKEDPSEENNLKDQLPEKTQELKTAFENWYKDV 437 >UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4991 Length = 596 Score = 380 bits (976), Expect = e-104, Method: Composition-based stats. Identities = 127/496 (25%), Positives = 178/496 (35%), Gaps = 96/496 (19%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PS 135 L KPNVV+ ++DD+G D+G G TP+ID +A G+ T Y+ P Sbjct: 14 ALPASAAGKPNVVLIVIDDLGQRDLGCYGSTF---YKTPNIDRMAKDGVRFTDFYAACPV 70 Query: 136 SSPTRATILTGQYSIHHGI--LMPPMYGQPG-----------GLQGLTTLPQLLHDQGYV 182 SPTRA+I+TG+Y GI +P PG T+ + L GYV Sbjct: 71 CSPTRASIMTGKYPQRVGITDWLPGRKDLPGQRLKRPELKNELALEEVTVAETLKGHGYV 130 Query: 183 TQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 T IGKWH+G K +P+ GFD D P +P ++ + Sbjct: 131 TAHIGKWHLG-GKGFEPEKQGFD-----------VNVAGDHTGTPLSYFAPFANKAGATM 178 Query: 243 PFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGC 302 P G ++A D E L R F+ A DKPFFLY G Sbjct: 179 P-----------GLEKAAPD------EYLTDRLAAEAETFIT--ANKDKPFFLYLPHYGV 219 Query: 303 HFDNYPNAKYAGSSPARTS--------YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFT 354 H + Y + M+ + K L+ DNTL++FT Sbjct: 220 HTPLRAPQPLVDKYKTQAVHGRQSNPVYAAMVESMDAAVGRVLKRLDDLKLSDNTLVLFT 279 Query: 355 SDNGPEAEVP-----PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLAD 408 SDNG A + P P R KG +EGGVRVP W G ++P D + D Sbjct: 280 SDNGGLATLEGMPFAPTINAPLREGKGYLYEGGVRVPLIAKWPGKVKPGTVMDQVACSID 339 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL----- 463 F T L+ G A DGV F G + HY Sbjct: 340 FFDTILEATGATSAARR---------DGVSLVPAFGGEKLKPRALYWHYPHYANQGSRPG 390 Query: 464 AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIP 523 AVR +K + +F++ D ES ++ Sbjct: 391 GAVRAGNYKLV--------------------EYYEDGRRELFDVAKDLSESRNLAADKPD 430 Query: 524 MGVPLQTEMHAYMEIL 539 + L ++ A+ + Sbjct: 431 VVKDLAAKLDAWRTDV 446 >UniRef50_B9XR48 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XR48_9BACT Length = 508 Score = 380 bits (975), Expect = e-104, Method: Composition-based stats. Identities = 121/502 (24%), Positives = 193/502 (38%), Gaps = 65/502 (12%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QP 134 AE +KPNV+ F+ DD+G+ DVG G TP+ID +A++G+ T YS P Sbjct: 28 AEPSPMPLRKPNVIFFIADDLGYADVGCFGQKKIH---TPNIDRIATEGMKFTQHYSGSP 84 Query: 135 SSSPTRATILTGQYSIHHGILMPPM---YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 +P+R ++TG++S H + GQ T+ +LL GY+T A GKW + Sbjct: 85 VCAPSRCVLMTGKHSGHSAVRDNRELKPEGQFPLPANTITVARLLQQNGYITGAFGKWGL 144 Query: 192 G-ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 G +P + GF F G+N + ++ P + + P +D Sbjct: 145 GGPESSGKPLDQGFTRFFGYNCQRVAH------NLFPTYLWDDNHRLALDNPPIGEDQKL 198 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF----DN 306 + + + ++F+ D PFFL++ T H Sbjct: 199 PADADSNDPASYKAFTGKSYAPDLYAEQALRFIRD--NKDHPFFLFFPTIVPHVALQVPE 256 Query: 307 YPNAKYAGSSPA---------------RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 +Y G P +Y + M+ + +++ D+T+ Sbjct: 257 DSLKEYEGKLPETPYTGGKGYLPNRTPHAAYAAMITRMDRDLGRMLALIKELNLDDDTIF 316 Query: 352 VFTSDNGPEAEVPP-------HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGI 403 VFTSDNGP + + PFR K S +EGG+R+P V W G IQP SD + Sbjct: 317 VFTSDNGPAPQDMGGTDTKFFNSSGPFRSGKTSIYEGGMRIPLIVRWHGKIQPNSTSDRV 376 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH--YFLNG 461 D PT L+L+G+ + IDG+ S LG R + + G Sbjct: 377 TGFEDWLPTLLELSGNKKSVPTG-------IDGLSFASTLLGEK-LPERPFLYREFPAYG 428 Query: 462 KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 A+R+ +K +P + +++L TD ES + H Sbjct: 429 GQQAIRVGNWKAVRQHLKPKGNAKPNLH------------IELYDLQTDIAESHDVSDEH 476 Query: 522 IPMGVPLQTEMHAYMEILKKYP 543 + L M K +P Sbjct: 477 PDIVTKLDNLMREQHIPSKAFP 498 >UniRef50_A6DMX8 Iduronate-sulfatase or arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMX8_9BACT Length = 532 Score = 380 bits (975), Expect = e-103, Method: Composition-based stats. Identities = 118/512 (23%), Positives = 195/512 (38%), Gaps = 58/512 (11%) Query: 61 VMQHPAQDKETQQK-LAELEKKTGKK--PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDI 117 + Q ++ E L E+ KT + PN+V+ DD+G+ D+ G A TP+I Sbjct: 25 IAQTQSKSAEAPVSVLNEMRPKTTQSEYPNIVLIYADDLGYGDLSSYG---ATKIKTPNI 81 Query: 118 DAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQL 175 D +A G++ T +S + +P+R +LTG+Y + P + TT+ L Sbjct: 82 DRLAKNGILFTDGHSTSATCTPSRYALLTGEYPLRINNYSPVFCADRLIIDTKKTTIASL 141 Query: 176 LHDQGYVTQAIGKWHMGENKESQP----------QNVGFDDFRGFNSVSD-----MYTEW 220 L +GY T +GKWH+G + +P +GFD F G V+ Sbjct: 142 LKRKGYTTACVGKWHLGFGDKPKPDWNKELKPGPLELGFDYFFGLPVVNSHPPFVYMENR 201 Query: 221 RDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMED--LDQRWMDY 278 R + ++P L+ R + RG + D + ++ Sbjct: 202 RILGLDPNDPLTYKRGGKTYGKAYVGKHTSPHRGMPSVIGGKVAHDLYVDELIGEKLTQK 261 Query: 279 GVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYK 338 + ++++ DKPFFLYY + H P+ + G S GD + E++ + Sbjct: 262 ALTWMNQ---QDKPFFLYYASHNVHLPITPHPYFHGKSEC-GLRGDFVEELDWSVGQIIS 317 Query: 339 TLEKNGQLDNTLIVFTSDNGPEAEVPPHG----------RTPFRGAKGSTWEGGVRVPTF 388 +E+ G L+NT+ +FTSDNG + G +G K WE G RVP Sbjct: 318 AVERFGALENTIFIFTSDNGAIIKGKDQGDILDQLGHKPNGKLKGRKFGAWEAGHRVPFI 377 Query: 389 VYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTN 447 V W I SD ++ DL PT + G A DG +Q LG + Sbjct: 378 VSWPNKIPAGKTSDALIANLDLLPTFAAITGQKLAPHEAR-------DGFNQLPLLLGKD 430 Query: 448 GQSNRKAEHYF-LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFN 506 S R ++R ++ Y GG+ ++N Sbjct: 431 TTSARSELIIQPHKRSHKSLRQGDW----------VYIPGAGDGGWVPAKKGELPKQLYN 480 Query: 507 LYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 L DP + + + + ++ M+ Sbjct: 481 LKDDPYQQQNRINDFPERADAMASHLNKLMKQ 512 >UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7UX95_RHOBA Length = 538 Score = 380 bits (975), Expect = e-103, Method: Composition-based stats. Identities = 108/528 (20%), Positives = 181/528 (34%), Gaps = 94/528 (17%) Query: 67 QDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLI 126 + + T +PN+V+ + DD+G+ ++G G TP +D +A++G+ Sbjct: 55 DSTTVSAEEPNAKDATVSRPNIVLIVADDLGYGELGCYGQTKI---RTPRLDQLAAEGIK 111 Query: 127 LTSAYS-QPSSSPTRATILTGQYSIHHGILMP---------------PMYGQPGGLQGLT 170 LT+ YS +P+R ++TG++ H + GQ Sbjct: 112 LTNFYSGNAVCAPSRCCLMTGKHPGHAHVRNNGDPKIDPAVREALKLEFPGQYPLPVDEV 171 Query: 171 TLPQLLHDQGYVTQAIGKWHMG-ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEV 229 T+ + L GY T A GKW +G P GFD F GFN + + Sbjct: 172 TIAEYLKSVGYRTGAFGKWGLGHFGTTGDPNEQGFDLFYGFNCQRHAHNHYP-------- 223 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS 289 + + R E Q D T ++++ +F+ + Sbjct: 224 -----------------NFLWRNRVKEVQPGNDRTLHGETYSQDQFVNEACEFIRQSVAE 266 Query: 290 DK--PFFLYYGTRGCHF------------------DNYPNAKYAGSSPARTSYGDCMVEM 329 DK PFF Y H +Y + Y R Y + M Sbjct: 267 DKTQPFFAYLPFAVPHLSIQVPEEEVDAYDGVIEEADYEHHGYLKHPRPRAGYAAMVTRM 326 Query: 330 NDVFANLYKTLEKNGQLDNTLIVFTSDNGP-------EAEVPPHGRTPFRGAKGSTWEGG 382 ++ + ++ G +NTLI+FTSDNGP + + +G KG EGG Sbjct: 327 DEGVGQVVDLVDSLGLGENTLIMFTSDNGPTYDRLGGSDSDYFNSASGMKGLKGQLDEGG 386 Query: 383 VRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTS 441 +RVP G++ R SD I D PT D AG + DG+ Sbjct: 387 IRVPMIARQTGVVPAGRTSDWIGAWWDFLPTITDAAGVEV--------DASTTDGISFLP 438 Query: 442 FFLGTNGQSNRKAEHYFL---NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQ 498 G + Y+ A+RM +K + + Sbjct: 439 LLHGDDAAQQSHEFLYWEFPGYSGQQAIRMGNWKA----------IRKDLSKRLKKGQTE 488 Query: 499 TAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRA 546 ++++L D ES+ + H + ++ +++P R Sbjct: 489 PPAFALYDLSKDLAESNDVSASHPDVMAKIEAIAKQQHVPSEQFPLRV 536 >UniRef50_D2R2H5 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R2H5_9PLAN Length = 507 Score = 380 bits (975), Expect = e-103, Method: Composition-based stats. Identities = 128/513 (24%), Positives = 194/513 (37%), Gaps = 67/513 (13%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQ 133 L + +KPN++V + DD+G+ D+G NG TP ID VA++GL TS Y S Sbjct: 14 LVAAIASSAEKPNIIVIIADDLGYGDLGCNGSQTIA---TPHIDRVAAEGLRFTSGYCSA 70 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLT-TLPQLLHDQGYVTQAIGKWHMG 192 + +PTR ++LTG Y+ P +Q T T+ LL QGY T IGKWH+G Sbjct: 71 STCTPTRYSLLTGTYAFRVKGTGIAAPNSPALIQPETVTVASLLKSQGYATACIGKWHLG 130 Query: 193 ENKES---------QPQNVGFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPDRSEY 238 P +GFD + +D R +++P L + Sbjct: 131 LGVGKPDWNGELKPGPLEIGFDHCLLLPTTNDRVPQVFVENHRVRNLDPADPLWVGDEKP 190 Query: 239 IKQLPFSKD-------DVHAVRGGEQQAIADITPKYM---------EDLDQRWMDYGVKF 282 P D G Y +DL W+ ++ Sbjct: 191 SDDHPTGISHRSTLAMDWDYGHNGTIHNGISRIGFYTGGMKARFRDQDLADEWVKASAQW 250 Query: 283 LDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEK 342 ++ A PFFLY+ H P+ ++ G S GD ++E + L K LE+ Sbjct: 251 IE--ANKAGPFFLYFAAHDIHVPRTPHERFVGKS-GMGPRGDSILEFDWCVGELMKVLEQ 307 Query: 343 NGQLDNTLIVFTSDNGPEAEVPPHGRTP-----------FRGAKGSTWEGGVRVPTFVYW 391 + +NTL+V SDNGP + FRG K S +EGG R P V W Sbjct: 308 HQLAENTLVVICSDNGPVLNDGYKDQAVELIGKHAAAGLFRGGKYSVFEGGTRTPFIVSW 367 Query: 392 KGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSN 451 KG + SD +V D + LAG +P+ +D ++ LG + Sbjct: 368 KGRVASGVSDKLVSTIDFASSFAALAGA-------KIPEDACLDSLNLLDTLLGDKAAAG 420 Query: 452 RKAEHYFLNGKLA-AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTD 510 R+ NG +R ++K P G + +F L +D Sbjct: 421 REYVLQQDNGGTKLGLRAGDWKLVRGGALPGKKKGPGAR----------EADQLFRLSSD 470 Query: 511 PQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 P E+ ++ LQ + + + P Sbjct: 471 PGETKNVAAEFPAELEKLQKLLATIIADGRTRP 503 >UniRef50_Q7UIU1 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UIU1_RHOBA Length = 529 Score = 379 bits (973), Expect = e-103, Method: Composition-based stats. Identities = 124/485 (25%), Positives = 190/485 (39%), Gaps = 50/485 (10%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRA 141 +PN+++ + DD+G DV TP + +A +GL A++ +PTR Sbjct: 47 ASRPNIILVMADDLGIGDVSPTNPD--CKIKTPRLQQMADEGLTFLDAHTPSSVCTPTRY 104 Query: 142 TILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ- 198 +LTG+Y+ + + G L TL LL GY T IGKWH+G + Sbjct: 105 GLLTGRYNWRSRLAKGVLSGTSEHLIPGDRATLGHLLQGAGYHTAMIGKWHLGWDWHKNG 164 Query: 199 ------------PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 P N GFD + G DM P + KQ P+ Sbjct: 165 KEIDFSKPVLNGPDNNGFDQYYGHCGSLDMPPYVWVDTGTPTSVPTRKEGVTKKQNPYG- 223 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 R G I D +E + D + ++++ K DKPFFLY H Sbjct: 224 ----WYRNG---PIGDDFE--IEQVLPHLFDKSIAYVEERVKEDKPFFLYLPLPAPHTPI 274 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP-- 364 P + +S Y D +++M+ L + K G +NTL++FTSDNG E Sbjct: 275 VPVPPFKDAS-GMNPYADFVMQMDHHMGQLLDAISKAGIDENTLVIFTSDNGCSPEANFG 333 Query: 365 ---PHGRTP---FRGAKGSTWEGGVRVPTFVYWKGM-IQPRKSDGIVDLADLFPTALDLA 417 HG P +RG K +EGG RVP V W G + + ++ + L D++ T + Sbjct: 334 ELAKHGHDPSGKYRGHKADIYEGGHRVPFIVRWPGKVVAGKTTNALTCLTDVYATLQSIT 393 Query: 418 GHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLI 477 P DG D T F G + S+R+A G A+R D +K + Sbjct: 394 DQPREATGGE-------DGFDLTDVF-GGDDSSDREALVSHSIGGSFAIRRDSWKLCLSH 445 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 + G +F+L TDP E +S+ + + L ++ Y+E Sbjct: 446 GSGGWSNPREPKAKLQG----LPPMQLFDLETDPAEKNSVAKENPEVVDSLLLLLNEYVE 501 Query: 538 ILKKY 542 + Sbjct: 502 TGRST 506 >UniRef50_A6DG53 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG53_9BACT Length = 515 Score = 379 bits (973), Expect = e-103, Method: Composition-based stats. Identities = 119/501 (23%), Positives = 199/501 (39%), Gaps = 62/501 (12%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSP 138 + PN+++ L DD+G + G G PTP +D + +QG+ T A+S +P Sbjct: 27 AAKTETPNIILILADDMGIDSIQALNGK--SGIPTPHLDRLLTQGIHFTDAHSGSAVCTP 84 Query: 139 TRATILTGQYSIHHGILMPPM--YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK- 195 TR +LTG+Y+ + + + +P + TLP +L +GY T IGKWH+G + Sbjct: 85 TRYGVLTGRYAWRSRLKKSIVRQWERPLIEKDRLTLPGMLKKKGYNTACIGKWHLGWDWP 144 Query: 196 -------------------ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS 236 E P GFD + G D W+ P V + R Sbjct: 145 KKGGGFTEKMKEIDFSEKIEGGPAGCGFDYYFG-----DDVPNWQ-----PFVWIENGRM 194 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 + S + +E + + + V+++++ A++ +PFFLY Sbjct: 195 LGVPNKQLS-----FASHYHSGKGIGVEGWDLEAVLPKITEKSVEYINQQAETKQPFFLY 249 Query: 297 YGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 + H P+ + G S + Y D ++E + + K L+ G DNTL++FT+D Sbjct: 250 FSMTSPHTPIAPSKPFQGKS-GISRYADFLMETDWCVGQIMKALKDRGIADNTLLIFTAD 308 Query: 357 NGPEA--------EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLA 407 NG E + +RG K +EGG RVP V W G I+P KSD + L Sbjct: 309 NGTSPKCNFTELREKRTDLQNHWRGMKADAFEGGHRVPFIVSWPGHIKPGSKSDQTISLV 368 Query: 408 DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSN-RKAEHYFLNGKLAAV 466 D+ T D VA + + D V G + + +A + V Sbjct: 369 DIMATCADA-------VALTLSDSAAEDSVSLMPVLKGEDIATPLHEAVICHSISGVFVV 421 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGV 526 R ++K G +++L +DP+E++++ H + Sbjct: 422 RKGKWKLQYSAGSGGLSLPKDKNAKKKG----LPTWQLYDLSSDPKETNNLINGHQEIVK 477 Query: 527 PLQTEMHAYMEILKKYPPRAQ 547 L + Y+E + P Q Sbjct: 478 DLTAILRRYIENGRSTPGTPQ 498 >UniRef50_A6DSH3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH3_9BACT Length = 455 Score = 379 bits (973), Expect = e-103, Method: Composition-based stats. Identities = 112/464 (24%), Positives = 174/464 (37%), Gaps = 67/464 (14%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTR 140 KPN++V L DD G+ DV TP DA+A G+I Y+ S TR Sbjct: 21 ADSKPNIIVILSDDQGYADVS-YNPEHDDYISTPHTDALAKSGVIFHRGYTSGSVCSTTR 79 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 + ++TG+Y +GI G G +P L + GY + A GKWH+G + P Sbjct: 80 SGLMTGRYQQRYGIYTAGEGGT-GTDLNAKFIPNYLKEAGYKSMAFGKWHLGHEMKYHPL 138 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 + GFDDF GF R H + D K RG E I Sbjct: 139 HRGFDDFYGFMG--------RGAHDFFRLEKEYD----------GKFGGPIYRGLE--PI 178 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS---SP 317 D L R + VKF+++ DKPFF Y H A+ + Sbjct: 179 DD-----KGYLTTRITEETVKFIEE--NKDKPFFAYVAYNAVHTPAQAPAEDIKAVSGDE 231 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGS 377 R + ++ + KTL+K+ +NT+I++ SDNG + + P RG K Sbjct: 232 TRDILVAMLKHLDLGVGEIVKTLKKHDIYENTIIIYLSDNGGAKSMVAN-NKPLRGVKHD 290 Query: 378 TWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDG 436 ++GG+RVP + W I+ + + V D+ PT LD AG +P + IDG Sbjct: 291 IYDGGIRVPFLMSWPAQIKAGQDTQSPVISLDILPTLLDAAG---------LPALSDIDG 341 Query: 437 VDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTV 496 G +R + ++++ +K Sbjct: 342 ESMLPVIRGDKDNLDRP-FFWNHGDGQTGIQLNNWKLVF--------------------- 379 Query: 497 MQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 + ++ + D ES ++ H LQ ++ + Sbjct: 380 -NKGVTELYKISDDIGESKNLAASHPEKVQALQKIYDKWLSQMA 422 >UniRef50_UPI00005846A1 PREDICTED: similar to arylsulfatase n=1 Tax=Strongylocentrotus purpuratus RepID=UPI00005846A1 Length = 552 Score = 378 bits (972), Expect = e-103, Method: Composition-based stats. Identities = 132/493 (26%), Positives = 203/493 (41%), Gaps = 71/493 (14%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDA-VASQGLILTSAYSQ-PSSSPTRA 141 KPN V+F DD+G+ D+ G P ID + G+ T Y +P+R Sbjct: 56 DKPNFVIFFADDMGYGDLASYGHPTQERGP---IDDVMVENGIKFTQGYVPDTVCTPSRV 112 Query: 142 TILTGQYSIHHGILMPPM-------YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 +LTG+Y + G+ + + G T+ + L ++GY T GKWH+G N Sbjct: 113 ALLTGRYPVRSGVFSGTGGSRVFLPWTRSGLPSTELTIAEALKEEGYTTGMAGKWHLGLN 172 Query: 195 KE------SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 E P + GFD F G + + + D + K + +D Sbjct: 173 SETRDDGVHLPMHHGFD-FVG------HILPFTNSMACDDTGRFVDFPDVTKCFLYKRDQ 225 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 + A P L Q +++ V F++ A PFF Y+ H Y Sbjct: 226 IVA------------QPFNHTYLTQTFVNDAVSFIEDNAHD--PFFFYFPFSHPHVPLYA 271 Query: 309 NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR 368 + ++AG S R YGD + EM+ + LE G NTL++F +D+GP+ E HG Sbjct: 272 SPRFAGKSQ-RGEYGDNINEMSWAVGEVIDALEAKGLSQNTLVLFLADHGPQPEYCAHGG 330 Query: 369 TP--FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVAN 426 P F+G K +TWEGG+RVP YW G I PR+SD +V D+ T +DLA Sbjct: 331 DPSIFKGYKTNTWEGGIRVPFVAYWPGQITPRESDALVSTLDIMRTVVDLAN-------G 383 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQ------- 479 +P T DG T L N S +++ +L AVR +K H + Sbjct: 384 TLPDDTAYDGEVITDVLL-KNAPSPHDVLYHYCKDRLMAVRSGPYKVHYFTHRVQTQDYF 442 Query: 480 ---------PYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIG----VRHIPMGV 526 P A+ Y + V + ++N+ DP E+ + V Sbjct: 443 AGECQDGGLPLAHYFDCYH-CYDSCVTEQDPPLIYNVEHDPIEAYPLNTTLDSSLAEFMV 501 Query: 527 PLQTEMHAYMEIL 539 LQ ++ A++ + Sbjct: 502 DLQEKVTAHIASV 514 >UniRef50_A6LIX5 Arylsulfatase n=2 Tax=Bacteroidales RepID=A6LIX5_PARD8 Length = 514 Score = 378 bits (972), Expect = e-103, Method: Composition-based stats. Identities = 127/509 (24%), Positives = 199/509 (39%), Gaps = 62/509 (12%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTR 140 K+PNVV+ L DD+G+ DVG N TP ID +A G+ T A+S S P+R Sbjct: 20 AQKQPNVVIILADDMGYGDVGCNNP--YARVRTPAIDQLARNGIRFTDAHSAGALSGPSR 77 Query: 141 ATILTGQYSIHH-GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ- 198 ++TG+Y Y P T+ L+ + GY T +GKWH+G + + + Sbjct: 78 YGLVTGRYFFRTPKKSEYWGYLSPYIEPERLTIGSLMRNAGYTTACVGKWHLGLDWQLKD 137 Query: 199 --------------------------PQNVGFDDFRGFNSVSDMYTEWR---DVHVNPEV 229 P +GFD + DM D V+P+V Sbjct: 138 DSKPQILTPKKFGYTNTDFSAPVKRGPTELGFDYSFILPASLDMPPYAFVRNDRVVDPDV 197 Query: 230 ALSPDRSE--------YIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVK 281 L+ D + +++D++ RG + E+ +D G+ Sbjct: 198 ILTADAYPKKQDETVYAWDRKHTNENDIYWERGVWWRNGEMSRSFKFEECFPTIVDEGIA 257 Query: 282 FLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLE 341 F+D+ + DKPFFLY G H P ++ GS+ +YGD M ++++V A + L+ Sbjct: 258 FIDREGRKDKPFFLYMPLTGPHTPWLPTVQFKGSTEL-GTYGDFMGDIDNVVARVNAKLK 316 Query: 342 KNGQLDNTLIVFTSDNGPEAE------VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI 395 + G NT+++F SDNG E RG KG W+GG VP V+W I Sbjct: 317 ELGLEKNTIVIFASDNGGAWEEEDIQQYGHQSNWSRRGQKGDAWDGGHHVPLIVHWPDHI 376 Query: 396 Q-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKA 454 + P V L D+ T DL G +PK D G S R Sbjct: 377 KCPGVCSQTVGLVDILATLADLTG-------QSLPKGQAEDSFSFKKVLDGDMNASTRDQ 429 Query: 455 EHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 Y A++ ++KY + V ++N+ TD ES Sbjct: 430 IMYLSGSGKLAIKKGDWKYI-----DCLGSGGFTAPARLSPVKNGPKGQLYNMRTDSLES 484 Query: 515 DSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 +++ +R + L + ++ P Sbjct: 485 NNLFLREKGIANELSALLKKLLDQGYSRP 513 >UniRef50_UPI0001927538 PREDICTED: similar to CG8646 CG8646-PA n=5 Tax=Hydra magnipapillata RepID=UPI0001927538 Length = 502 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 119/501 (23%), Positives = 204/501 (40%), Gaps = 78/501 (15%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRAT 142 KP++++ + DD+GW D+ F+G PTP+ID +A+ G+IL + Y P +P+R+ Sbjct: 17 ADKPHIIMIVADDLGWNDISFHGSNEI---PTPNIDRLANNGVILDNYYVLPICTPSRSA 73 Query: 143 ILTGQYSIHHGILMPPMYG-QPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQP 199 I+TG+Y IH G+ ++G P G+ LPQ L QGY T +GKWH+G K+ P Sbjct: 74 IMTGRYPIHTGMQQDTIFGPNPYGVGLNEKFLPQYLKQQGYKTHGVGKWHLGFFAKQYTP 133 Query: 200 QNVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 GFD + G + D + +S D+H G Sbjct: 134 TYRGFDSYYGSYLGKGDYWNH-------------------SNTETYSGLDLHDNENG--- 171 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF------DNYPNAKY 312 + + + + + ++ S +P FLY + H ++ Sbjct: 172 ----VFSQDGNYSTEMYTAEAISCINNH-NSSEPLFLYLAYQAVHSANTEEDPLQAPQEW 226 Query: 313 AG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA---EVP 364 R Y + M+ ++ L + LDN++I+FT+DNG A + Sbjct: 227 IDKFSYIKHEQRRKYAAMLGYMDYGVGRVHDALAEKKMLDNSIIIFTTDNGGPANGFDYN 286 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKV 424 P RG K + +EGGVR +FVY K + PR S ++ + D PT ++LAG + Sbjct: 287 WANNFPLRGVKATLFEGGVRGVSFVYSKLIESPRVSHELIHITDWLPTLVNLAGGKVS-- 344 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN--GKLAAVRMDEFK---------- 472 F+DG DQ + + K A+R+ +K Sbjct: 345 ------DGFLDGFDQWATLQNKQSSQRNEVLLNIDEKVWKNEALRVGSWKIIKEGNYWDG 398 Query: 473 ------YHVLIQQPYAYTQSGYQGGFTGTVM--QTAGSSVFNLYTDPQESDSIGVRHIPM 524 ++ ++Y S + G ++ +F++ DP E + + + + Sbjct: 399 WYPPPSFNEQSNNSFSYLSSTVKCGHDIPIVINHCDSYCLFHIDEDPCEINDLSKKFPEV 458 Query: 525 GVPLQTEMHAYMEILKKYPPR 545 L ++ Y + + PPR Sbjct: 459 LAELINRLNTYRQSMV--PPR 477 >UniRef50_UPI0000586CBA PREDICTED: similar to arylsulfatase B n=3 Tax=Deuterostomia RepID=UPI0000586CBA Length = 596 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 120/512 (23%), Positives = 204/512 (39%), Gaps = 83/512 (16%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSS 137 ++ TGK P++V + DD GW DVG++ + TP++D +AS+G+ L + Y QP S Sbjct: 91 IKGATGKPPHIVFIVADDYGWFDVGYHNSTI----KTPNLDLLASRGVKLENYYVQPICS 146 Query: 138 PTRATILTGQYSIHHGILMPPMYG-QPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGENK 195 P+R+ ++TG+Y IH G+ + QP L TTLPQ L + GY T +GKWH+G K Sbjct: 147 PSRSQLMTGRYQIHTGLQHFVIIAPQPNCLPLNETTLPQKLKESGYATHLVGKWHLGFYK 206 Query: 196 -ESQPQNVGFDDFRGFN-SVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 E P GFD G+ + D +T +R P+ + ++ + + V Sbjct: 207 NECMPLQRGFDSSFGYLSGMQDYWTHFRS----GSFPGFPEGNHWLGIDFWDNNRV---- 258 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 +Y + Q + + + ++P FLY + H KY Sbjct: 259 ----------AWEYTGNYSQFVFTERAQRVIQQHNPNQPLFLYLPLQSVHGPLQVPEKYM 308 Query: 314 G-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR 368 R +Y + M++ + +L++ G ++T++VFT+DNG + Sbjct: 309 KPYAHFQDVGRQTYAGMVATMDEAVGKVVDSLQEAGLWNDTVLVFTTDNGGTPGKSGN-N 367 Query: 369 TPFRGAKGSTWEGGVRVPTFVYW---KGMIQPRKSDGIVDLADLFPTALD-LAGHPGAKV 424 P RG K + WEGGV F+ +Q S + ++D FPT ++ +AG A + Sbjct: 368 WPLRGTKNTLWEGGVHGVGFITGPMIPAGVQGTVSKHFMHISDWFPTLIEGVAGGNTAGL 427 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY--------------------------- 457 A +D + + S RK + Sbjct: 428 A--------LDSYNMWNSIT-KGTPSPRKELLHNIDPYIRADHPFGYGYDEETDMIYPLS 478 Query: 458 ---------FLNGKLAAVRMDEFKYH--VLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFN 506 F AA+R+ E+K + + + +FN Sbjct: 479 GLYPKMAAEFSTDMRAAIRVGEWKLLTGFPGRSGWYPPPEWNIHPIDPVEAANKVTWLFN 538 Query: 507 LYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 + DP E + + +H + L + AY + Sbjct: 539 ITADPCEKNDLSYQHPEVVTELVGRLEAYYKT 570 >UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria RepID=C1ZCL4_PLALI Length = 470 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 121/503 (24%), Positives = 188/503 (37%), Gaps = 99/503 (19%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSS 137 K+ KPNV++ +DD+G D+G G TP IDA+A G T YS P S Sbjct: 22 AKEMADKPNVLLIFIDDLGKTDIGIEGSSF---YETPRIDALAKSGARFTQFYSAHPVCS 78 Query: 138 PTRATILTGQYSIHHGILMPPMY-GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 PTRA ++TG+ GI Q T+ Q + GY T +GKWH+G + Sbjct: 79 PTRAALMTGKMPQRLGITDWIRPESDVALPQSEVTIGQAFQEAGYHTAYLGKWHLGHKPQ 138 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDR-SEYIKQLPFSKDDVHAVRGG 255 P GFD +G N + + + NP+ +P+ ++ K P Sbjct: 139 QHPAARGFDWTKGVNHGGQPSSYYF-PYKNPQKPDAPNNVPDFEKCQPE----------- 186 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA----- 310 + L ++ L + ++ +PFFL H P Sbjct: 187 -------------DYLTDVLTSSAIEHLQQRDRT-RPFFLCLAHYAVHTPIQPPKNLVEK 232 Query: 311 -------KYAGSSPA---------------RTSYGDCMVEMNDVFANLYKTLEKNGQLDN 348 + SP +Y + ++ L L+ G LD Sbjct: 233 YQVKLATQKNPKSPGEGIQEGSAISRSQQDHPAYAAMVENLDTQVGRLLDELKTQGILDQ 292 Query: 349 TLIVFTSDNGP-----EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGI 403 T++VFTSDNG P P R KG T+EGG+R+PT++ W G I P+ D Sbjct: 293 TIVVFTSDNGGLCTLNGKSPGPTCNLPLRAGKGWTYEGGIRIPTYISWPGKISPQVLDIP 352 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNG--QSNRKAEHYFLNG 461 D++PT L L P T +DG+ ++ +S R Y+ + Sbjct: 353 AYTCDIYPTLLSLCQIPPR-------PTQHVDGISLAGLLTKSSSLPESERTLVWYYPHT 405 Query: 462 K------LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD 515 AA+R +K ++T +++L DP ES Sbjct: 406 HGSGHKPSAAIRQGPWKLIHF--------------------LETDRIELYHLEDDPGESR 445 Query: 516 SIGVRHIPMGVPLQTEMHAYMEI 538 ++ +H + LQ E+ +E Sbjct: 446 NLASKHPERALQLQKELQKIIES 468 >UniRef50_A7SRP2 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7SRP2_NEMVE Length = 491 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 128/498 (25%), Positives = 212/498 (42%), Gaps = 73/498 (14%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 ++ KP+++ L DD+GW DVGF+G + TP+ID +A+ G+IL + Y QP +PTR Sbjct: 20 QSSAKPHLLFVLADDLGWSDVGFHGSKI----QTPNIDRLAANGVILDNYYVQPVCTPTR 75 Query: 141 ATILTGQYSIHHGILMPPMY-GQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGE-NKES 197 A+++TG+Y IH G+ ++ G+P GL LT LPQ L GY T +GKWH+G N ES Sbjct: 76 ASLMTGKYPIHTGLQHGIIHNGRPYGLPLNLTLLPQKLRKAGYSTHMLGKWHLGFYNWES 135 Query: 198 QPQNVGFDDFRGFNSVS-DMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P GFD F GF S + + YT +D +++ +D+ VR Sbjct: 136 TPTYRGFDTFYGFYSGAENHYTHVQDHYLD------------------LRDNEEIVRDQN 177 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG-- 314 A + K E + + P F+Y + H +Y Sbjct: 178 GTYSAHLFTKRAEQ------------IVRAHDPSTPLFMYMAFQNVHSPVQAPKEYIDRY 225 Query: 315 ---SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPF 371 P R +Y + M+D NL + +K G +NT+++F++DNG + + P Sbjct: 226 SFIKDPLRRTYAAMVTIMDDALGNLTRAFDKAGLWENTILIFSTDNGGVPKNGGYDY-PL 284 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPK 430 RG K + WEGGVR FV+ + Q K ++ + D +PT + LAG + + Sbjct: 285 RGRKDTLWEGGVRGVAFVHGVALEQSGVKCKALMHVTDWYPTLVSLAG-------GSLDE 337 Query: 431 TTFIDGVDQTSFFLGTNGQSNRKAEHYF-------------LNGKLAAVRMDEFKYHVLI 477 +DG D ++ H + +R+ + K + + Sbjct: 338 DEDLDGYDVWESISHGVESPRKELLHNIDTINIPPGDGSLGFSTTGIGLRVGDMKLLMAV 397 Query: 478 QQPYAYTQSGYQGG------FTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTE 531 + + G + + +++N+ DP E + + + LQ Sbjct: 398 PNISYFIPPEDRNGSVDWYIHSNNKVPMVEVALYNITADPYEKHDLHDKLPDVVTRLQLR 457 Query: 532 MHAYMEILKKYPPRAQIK 549 + Y + PP + K Sbjct: 458 VEHYRKTAV--PPANKPK 473 >UniRef50_A6DI18 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DI18_9BACT Length = 562 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 119/499 (23%), Positives = 201/499 (40%), Gaps = 64/499 (12%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPT 139 +KPN++ L DD+G DV PTP +D +A+ G++ T A++ +PT Sbjct: 27 SAAEKPNIIYLLADDMGVGDVKAYNAD--SKIPTPALDNLAANGMMFTDAHTNSSVCTPT 84 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMGENK-- 195 R ILTG+YS G L T+ LL +GY T IGKWH+G + Sbjct: 85 RYGILTGRYSWRTTKKSGVTQGLSPHLIDSNRETVASLLKKEGYATACIGKWHLGMDWSL 144 Query: 196 ---------------------ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPD 234 ++ P GFD + G + ++ +P + Sbjct: 145 KDGSIADSKSDQSQIDLSKEIQNGPNKNGFDYYFGMAASANH---------SPHCFIEDG 195 Query: 235 RSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS--DKP 292 + + +L D G + + ++ R+ + +++ D+P Sbjct: 196 YT--VGKLQVLDDKQRKAVGIDGKPGLVAKGFKQSEILPRFTEKTCEWVRSQVNQKPDQP 253 Query: 293 FFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIV 352 FF+Y H P+AK+ G S +S+GD +E + + K L+ G DNT+I+ Sbjct: 254 FFVYMPLNSPHSPIVPSAKFLGKS-GLSSHGDFCMETDWALGEVVKILKALGIEDNTMII 312 Query: 353 FTSDNG--PEAEVPP---HGRTP---FRGAKGSTWEGGVRVPTFVYWK-GMIQPRKSDGI 403 FT+DNG P A+ P G P +RG KG T+EGG RVP V W G+ + SD + Sbjct: 313 FTADNGTSPMAKFEPMQEQGHFPSYIYRGLKGETYEGGHRVPFIVKWPKGLAPAKTSDQL 372 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQS-NRKAEHYFLNGK 462 + DL T ++ G A D + +A + + Sbjct: 373 ICTTDLMATVAEINGIALANNVGE-------DSISFLPALREQAIPELANRAIVHHSDAG 425 Query: 463 LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHI 522 + A+R ++K + S V+ A +F++ DPQES ++ ++ Sbjct: 426 VFAIRQGKWKLLL-----DNIGGSRRSNPKDKPVIDDAEIQLFDMVNDPQESTNLSQKNP 480 Query: 523 PMGVPLQTEMHAYMEILKK 541 + L+ ++ Y+ + Sbjct: 481 EIVEGLKKQLADYINKGRS 499 >UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFN4_9BACT Length = 481 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 118/481 (24%), Positives = 187/481 (38%), Gaps = 70/481 (14%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATIL 144 PNV+ L DD+G+ ++G G TP IDA+A +G+ T YS P +P+R +L Sbjct: 20 PNVIYILADDLGYGELGCYGQEKI---KTPHIDALAKEGMRFTRHYSGAPVCAPSRGVLL 76 Query: 145 TGQ-----YSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQ 198 +GQ Y I + P +P G+ TL Q+ D+GY T A GKW +G S Sbjct: 77 SGQQLSKAY-IRNNREHKPEGQEPIPEPGM-TLAQIFKDKGYATGAFGKWGLGYPGSSSD 134 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 P+ +GFD F G+N ++ + P S D++ I + P G + Sbjct: 135 PKALGFDTFYGYNCQRVAHSFY------PPHMWSNDKNITINEKPV-PGHWRKAVGPDFD 187 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA 318 Y DL +D +KF+ DKPFF Y H +P + S P Sbjct: 188 FSQFYAENYAPDL---ILDEALKFIKD--NKDKPFFAYLPFVEPHLAMHPPHSWVDSYPK 242 Query: 319 -------------------RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP 359 R Y + ++++ ++ + L++ ++NTL++FTSDNG Sbjct: 243 EWDSPKESYKAAYLPHLRPRAGYAAMISDLDEHVGSVMQLLKELDLVENTLVIFTSDNGA 302 Query: 360 E-----AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTA 413 + RG KGS +EGG+RVP +W G I + + SD + D+ T Sbjct: 303 SHCIEVDHEFFNSTKDLRGLKGSVYEGGLRVPMIAHWPGKIKKAQVSDHVSGFVDVMATF 362 Query: 414 LDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN--GKLAAVRMDEF 471 DL + + DGV G + F G+ A + + Sbjct: 363 CDLLQTEAPQTS---------DGVSFLPTLKGEKQEPQPVLAWEFQGYSGQQAIILDGRW 413 Query: 472 KYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTE 531 K + T +++L DP E + + + + Sbjct: 414 K----------GVRQNLSPRGKKKAKSTPKWELYDLNKDPNEKTDLATQMPEIVDRIHKA 463 Query: 532 M 532 M Sbjct: 464 M 464 >UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C3C8_9PLAN Length = 600 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 109/478 (22%), Positives = 183/478 (38%), Gaps = 74/478 (15%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSP 138 K+ ++PN+++ + DD G+ D +G TP I +A++G+ T Y+ +P Sbjct: 28 AKEKSRQPNIILVMTDDQGYWDTEISGNPKI---KTPTIKKLAAEGVTFTRFYANMVCAP 84 Query: 139 TRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 TRA ++TG++ + G+ G G TT+ Q+L GY T GKWH+G + Q Sbjct: 85 TRAGLMTGRHYLRTGLYNTRFGGDTLG-PNETTIAQVLQKAGYKTGLFGKWHLGRYAQYQ 143 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 PQ GFD F G Y + + NP+ Q Sbjct: 144 PQRRGFDHFFG------HYHGHIERYTNPD-----------------------------Q 168 Query: 259 AIADITP-KYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS- 316 + + TP + + + D + F+ + +PFF Y H + + G Sbjct: 169 VVVNGTPVETRGYVTDLFTDAAIDFIQR--NQQQPFFCYLAYNAPHSPFLLDTSHFGQPE 226 Query: 317 -------------PARTSY-GDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 P R + + ++ + L +T+ T+++FTSDNG Sbjct: 227 GDKLIEKYLAKGLPLREARIYAMIERIDQNLSRLLQTVHDLKLDQETVVIFTSDNGG--- 283 Query: 363 VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPG 421 V + +G+K S +EGG RVP V W +D +V DLFPT LAG P Sbjct: 284 VSRGFKAGLKGSKASAYEGGTRVPFVVRWTDHFPAGKTTDAMVAQTDLFPTFCQLAGVP- 342 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 VP +DG S G+S + ++ + R YH Sbjct: 343 ------VPSNVKLDGESILSLMEQGGGKSPHQYLYHTWD------RYTPNPYHRWAIHGP 390 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL 539 + G+ +++L DP E ++ ++ L+ E + + + Sbjct: 391 RFKLVGHDPQGKKKKEGEPQGQLYDLQEDPGEKKNVADQYPEKVSELRGEFLRWFQDV 448 >UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 Tax=Bacteria RepID=A6CD52_9PLAN Length = 460 Score = 377 bits (969), Expect = e-103, Method: Composition-based stats. Identities = 115/495 (23%), Positives = 173/495 (34%), Gaps = 98/495 (19%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSS 137 + + ++PN+++ DD G DVG G + PTP ID +A +GL+ YS + Sbjct: 21 QLQAAERPNILIIFTDDQGINDVGCYGSEI----PTPHIDQLAKEGLLFRQYYSASAICT 76 Query: 138 PTRATILTGQYSIHHG-------ILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWH 190 P+R ILTG+ + M + G G TT+ +L GY T +GKWH Sbjct: 77 PSRFGILTGRNPTRSQDQLLGALMFMSDIDQNRGIQPGETTIADVLQQNGYQTALLGKWH 136 Query: 191 MGENKE-SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 +G E P GFD FRG Y Y + + Sbjct: 137 LGHGTESFLPTAHGFDLFRGHTGGCIDYFTMT----------------YGNIPDWYHNQR 180 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 H G Y DL + FL +DKPFFL+ HF + Sbjct: 181 HVSENG-----------YATDL---ITEEAEHFLKDQQTTDKPFFLFLSYNAPHFGKGWS 226 Query: 310 AKYAG------------------SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 R + V ++D + +L+ NG NTL+ Sbjct: 227 PGDQSPVNIMQARGDDLKRVGTIKDKVRREFAAMTVSLDDGIGRVMSSLKNNGLDQNTLV 286 Query: 352 VFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLF 410 +F +D+G V PFRGAK + +EGG+RVP + W G I+ ++ + DLF Sbjct: 287 IFMTDHGG-DYVYGGNNQPFRGAKATLFEGGIRVPCIIRWPGKIKAGTETNEVAWALDLF 345 Query: 411 PTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH------YFLNGKLA 464 PT A +DG D + R+ G+ + Sbjct: 346 PTICHFANVDT--------DGLTLDGKDISGLLTRQTPVGTRELYWQLGPHAELKRGRWS 397 Query: 465 AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPM 524 A+R ++KY +F+L DP E ++ Sbjct: 398 ALRQGDWKYI---------------------QDAGGEEFLFDLKADPYEKQNLTQSQSTK 436 Query: 525 GVPLQTEMHAYMEIL 539 LQ ++ L Sbjct: 437 LTELQERRDTLVKTL 451 >UniRef50_A6DR29 N-acetylgalactosamine-6-sulfatase n=3 Tax=Bacteria RepID=A6DR29_9BACT Length = 510 Score = 377 bits (969), Expect = e-103, Method: Composition-based stats. Identities = 132/497 (26%), Positives = 191/497 (38%), Gaps = 61/497 (12%) Query: 69 KETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILT 128 + + KPNV++ + DD+GW D GFNG V TP +D +A++GL L Sbjct: 9 AASAALFSPFISAESAKPNVILIMADDLGWGDTGFNGSKVI---KTPHLDQMAAEGLQLD 65 Query: 129 SAYSQP-SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIG 187 YS SPTRA++LTG+ G+ P Q TLP++L++QGY T G Sbjct: 66 RFYSASSVCSPTRASVLTGRNPYRTGV---PTANQGFLRPEEITLPEVLNEQGYATGHFG 122 Query: 188 KWHMG---------------ENKESQPQN-VGFDDFRGFNSVSDMYTEWRDVHVNPEVAL 231 KWH+G KE P G++D S Y + Sbjct: 123 KWHLGTLTHTEKDANRGKPGNTKEFNPPKLHGYEDAFVTESKVPTYDPMILPAKFDQGES 182 Query: 232 SPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDK 291 EY+K+ SK E + I D D + MD + F+D+ +K Sbjct: 183 KHLGWEYVKEGEESKPYGTFYWDIEGKKITDNLK---GDDSRVIMDRVLPFIDQAVADEK 239 Query: 292 PFFLYYGTRGCHFDNYPNAK----YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 PF H + Y G +Y C+ M++ L K L G D Sbjct: 240 PFLSVVWFHTPHLPCVAGPRHQEMYKGHPIHLRNYAGCVTAMDEQIGRLRKHLADKGVAD 299 Query: 348 NTLIVFTSDNGPEAEVPPHGRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIV 404 NT+I F SDNGPE++ P + FRG K +EGGVRVP + W ++ RK Sbjct: 300 NTMIWFCSDNGPESKERPDNGSAGHFRGRKRDLYEGGVRVPAVMVWPAKVKEARKISAPC 359 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLA 464 +D PT LD P + + DG N R E + Sbjct: 360 ITSDYMPTILDALHIPHPQASYAT------DGRSLMPII--NNEDFTRDKEIGIMFSSRI 411 Query: 465 AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPM 524 +FK Y GG ++NL +DP E + ++ + Sbjct: 412 VWHKGDFKLL------------SYNGG--------KKYELYNLKSDPSEKTDVAAQNPEL 451 Query: 525 GVPLQTEMHAYMEILKK 541 L+ +M A+ E +K Sbjct: 452 VEKLKKDMLAWHESVKS 468 >UniRef50_B1KD88 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD88_SHEWM Length = 500 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 112/502 (22%), Positives = 183/502 (36%), Gaps = 75/502 (14%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QP 134 + +E K ++PNV+ FL DD+G D+G G TP+ID +A++G+ + Y+ Sbjct: 24 SNIEPKVNRQPNVIYFLADDLGVGDLGSYGQQHI---RTPNIDKLAAEGMRFSRHYAGSS 80 Query: 135 SSSPTRATILTGQYSIHHGIL----------MPPMYGQPGGLQGLTTLPQLLHDQGYVTQ 184 +P+RA+++TG+ H I P GQ QG TL L GY T Sbjct: 81 VCAPSRASLMTGRDMGHTDIRGNIQLMDQPDSPEYQGQYPLAQGTITLAHLFQLAGYQTG 140 Query: 185 AIGKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLP 243 A GKW +G P+ +GFD F G+ + + + + D Sbjct: 141 AFGKWGLGSLQSSGNPKAMGFDQFYGYLDQRHAHNYFPQYLWDGDEVARLDNPAINVHPK 200 Query: 244 FSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 +D + D P + + +F+ + D+ FFLY H Sbjct: 201 LDRDK----SDHREYMGKDYAPY-------KILARAKEFISQ--NRDEAFFLYVPFVVPH 247 Query: 304 --------------FDNYPNA-----KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNG 344 FD + Y R + + M+ ++ L++ G Sbjct: 248 AAIQIPDKELDGYQFDETAHRLGEPRAYTPHPKPRAARAAMISRMDRDVGDIMAMLKELG 307 Query: 345 QLDNTLIVFTSDNGPEAEVPP-----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK 399 DNTL++F+SDNG A + RG K + +EGG+R P W G I Sbjct: 308 LDDNTLVLFSSDNGATAAGGSDINFFNSTAGARGEKATLYEGGIRAPLIARWPGNISAGS 367 Query: 400 -SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH-- 456 SD + D+ PT L + I G+ LG ++ + Sbjct: 368 ESDHLSAFWDMLPTFAQLLDLSVPEG---------IQGISMLPTLLGKPQNQQHESLYWE 418 Query: 457 YFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDS 516 +F AV M +K Y ++ ++++NL DP ES + Sbjct: 419 FFSRNPSQAVVMGNWKAI-----------RHYSKERGKGALELGATALYNLQEDPSESQN 467 Query: 517 IGVRHIPMGVPLQTEMHAYMEI 538 + +H + + M Sbjct: 468 LAAKHPELVKKAEMIMAQRQRS 489 >UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5053 Length = 467 Score = 376 bits (967), Expect = e-103, Method: Composition-based stats. Identities = 113/498 (22%), Positives = 173/498 (34%), Gaps = 90/498 (18%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSS 137 KPN+V+ + DD+G ++G G TP ID +A G T YS P + Sbjct: 19 RAADAPKPNIVLIVADDLGCFELGCYGQTKI---KTPHIDKLAQGGAKFTRFYSGSPVCA 75 Query: 138 PTRATILTGQYSIHHGILMPPM---YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG-E 193 P+R ++TG++S H + GQ T+ L GY T A+GKW +G Sbjct: 76 PSRCVLMTGKHSGHATVRNNVEAKPEGQFPIRAEDVTVADALKAHGYATGAMGKWGLGMF 135 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 + P GFD F G+N ++ + + D ++ Sbjct: 136 DTAGSPLKHGFDLFFGYNCQRHAHSHY-------------------PTYIYRNDKRVELK 176 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF--------- 304 G + + T + + + F++ A KPFFLY H Sbjct: 177 GNDGKTGKQFT-------QDLFEEEALGFIE--ANKAKPFFLYLPFTVPHVAVQVPEDSL 227 Query: 305 ----------DNYPNAK-YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVF 353 Y K Y Y + M+ + + L G NTL++F Sbjct: 228 NEYKGQLGDDPAYDGKKGYQPHPAPHAGYAAMVTRMDRSVGRVVEKLNALGLEKNTLVLF 287 Query: 354 TSDNGPEAEVPP------HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDL 406 TSDNGP V + RG KGS +EGG+RVP Y G I+ SD + Sbjct: 288 TSDNGPTHNVGGADSSFFNSAGKLRGLKGSVYEGGIRVPFIAYQPGTIKAGTESDAPLYF 347 Query: 407 ADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN-GKLAA 465 D+ PT AG + IDG+ G ++ F G A Sbjct: 348 PDVLPTLCAFAGTKAP---------SAIDGISFLPLLKGEKQPTHDFLYWEFSGYGGQQA 398 Query: 466 VRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMG 525 V E+K M + ++NL DP E + + ++ + Sbjct: 399 VIEGEWKAVR-----------------QALGMGGVKTELYNLAKDPSEKEDVAAKNPAVL 441 Query: 526 VPLQTEMHAYMEILKKYP 543 L+ + +P Sbjct: 442 ARLEKRLKNEHTPNSNFP 459 >UniRef50_A6DLD9 Sulfatase n=2 Tax=Chlamydiae/Verrucomicrobia group RepID=A6DLD9_9BACT Length = 517 Score = 376 bits (967), Expect = e-102, Method: Composition-based stats. Identities = 124/517 (23%), Positives = 205/517 (39%), Gaps = 68/517 (13%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSP 138 +KPN+++ DD+G+ D+ GG G TP ID +A+ G+ +S Y+ + +P Sbjct: 18 SAATEKPNILIIYADDIGYGDLSCYGG---TGAQTPFIDRLANDGIRFSSGYASAATCTP 74 Query: 139 TRATILTGQYSIHHGILMPPMYGQPGGL-QGLTTLPQLLHDQGYVTQAIGKWHMG----- 192 +R ++LTG+Y+ + P + + + D GY+T +GKWH+G Sbjct: 75 SRYSLLTGEYAFRNKSAKILPGNAPLIIDPAKPNIASFMKDAGYITALVGKWHLGLGLSD 134 Query: 193 ------ENKESQPQNVGFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPDRSEYIKQ 241 N + P+ +GFD + D V ++P + ++ + Sbjct: 135 GSFDWNSNIKPAPRELGFDYSFYMAATGDRVPSVYIENSEVVDLDPSDPIKVSYAKPVGT 194 Query: 242 LPFSKDDVH----------------AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDK 285 P H + ED+ +++ + F++K Sbjct: 195 EPTGISHPHLLTVQADVQHAGTIVNGISRIGTMTGGHAARFKDEDMADTYLNKAIDFINK 254 Query: 286 MAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQ 345 D+PFF+Y+ H P+ ++ GSS GD +V+ + L KTL+ N Sbjct: 255 --SKDQPFFMYFAAHDNHVPRRPHPRFQGSSSL-GPRGDAIVQFDWTVGKLIKTLKANKM 311 Query: 346 LDNTLIVFTSDNGP-----------EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGM 394 NTLI+ +SDNGP PFRG K S WEGG R+P V W G Sbjct: 312 YRNTLIILSSDNGPVLFDGYWEGSEARNGDHKAAGPFRGGKYSLWEGGTRMPFIVSWPGK 371 Query: 395 IQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKA 454 IQ S ++ D+F + L G +PK+ DG + +G + R Sbjct: 372 IQSGTSSALISQVDIFASIATLIG-------KDLPKSASPDGQNMLPALMGKS-PVGRDY 423 Query: 455 EHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 A+RM ++KY P T+ G + T + G +FNL DP E+ Sbjct: 424 LVE-EALSQVALRMGDWKYIP----PGTVTERGGLDEWIKTPVHPPG-MLFNLADDPGET 477 Query: 515 DSIGVRHIPMGVPLQTEMHAYMEI---LKKYPPRAQI 548 + + +H + + KK P +Q+ Sbjct: 478 NDLSKQHPKKVKAMLAILKKEAPSKFLNKKTPGASQL 514 >UniRef50_B7S1F0 Sulfatase, putative n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7S1F0_9GAMM Length = 470 Score = 376 bits (966), Expect = e-102, Method: Composition-based stats. Identities = 132/477 (27%), Positives = 221/477 (46%), Gaps = 42/477 (8%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT 139 KPN+V+ ++D+ G+ +VG GGG+ G PTP+ID++A++G LT+ + +P+ Sbjct: 26 TAAASKPNIVMVVMDNFGYGEVGVYGGGMLRGAPTPNIDSIATEGFQLTNFNVEAECTPS 85 Query: 140 RATILTGQYSIHHGILMP--PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 R++++TG+Y I P G TL ++L D GY T GKWH+G+ + Sbjct: 86 RSSLMTGRYGIRTRQRPNDEPRGIWYGITPWEITLAEMLSDAGYATGMFGKWHLGDEEGR 145 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P + GFD++ G + SD A PD + K V + G++ Sbjct: 146 YPTDQGFDEWYGIPNSSDQ-------------AFWPDSDSFQKDAGVEFTHVMESKRGQK 192 Query: 258 QAIADITPK-YMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 D+ + + +D+ D + F+ + AK+ KPFF Y H + + G S Sbjct: 193 PKKKDVYGREKRKTIDREITDRAIDFIKRKAKAGKPFFAYLPYTQTHEPVDAHPDFKG-S 251 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAK 375 S+ D + + + L KT++ G DNT+ +FTSDNG E G T P+RG Sbjct: 252 TGNGSFADVLAQTDSYVGELLKTIDNLGFKDNTIFIFTSDNGREGIKRSFGFTGPWRGTM 311 Query: 376 GSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFI 434 + +EG +RVP + + I +K S+ IV L D+FPT L+G +P+ + Sbjct: 312 FAPYEGSLRVPFLIRYPDKIPAKKVSNDIVHLIDIFPTIAKLSG-------GEIPQDRIL 364 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTG 494 DGVDQT F G + +S R++ ++ +L V+ +K + +Y Sbjct: 365 DGVDQTDFLTGKSEKSARESVIIYIGNELFGVKWRNWKMLLKEIDEDSY----------- 413 Query: 495 TVMQTAGSSVFNLYTDPQESDS--IGVRHIPMGVPLQTEMHAYMEILKK---YPPRA 546 + A S++NL DP+E + + + PL + + ++ PP Sbjct: 414 AIQTMAYPSIYNLIVDPKEEEPEKFYLDDTWVDTPLWRVVEEHTASIEADKGAPPEQ 470 >UniRef50_C6VTS4 Sulfatase n=47 Tax=cellular organisms RepID=C6VTS4_DYAFD Length = 520 Score = 376 bits (966), Expect = e-102, Method: Composition-based stats. Identities = 128/508 (25%), Positives = 199/508 (39%), Gaps = 67/508 (13%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS- 135 + + KPN+V+ LDD+G+ DVG G A TP++D +A+ G+ T+ Y+ S Sbjct: 28 APQTEKAAKPNIVIVNLDDLGYGDVGAYG---ATALKTPNMDRIANGGIRFTNGYATSST 84 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGEN 194 +P+R ++TG Y + P + T+P++L GY T +GKWH+G Sbjct: 85 CTPSRFALVTGVYPWRNKEAKILPGDAPLLIDTAQQTIPKVLKKAGYATAIVGKWHLGLG 144 Query: 195 KESQ---------PQNVGFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPDRSEYIK 240 P +GFD + D R V ++P + + + Sbjct: 145 NGDTDWNKEVKPGPNQLGFDYSYILAATQDRVPTVYIENTRVVGLDPNDPIRVSYKQNFE 204 Query: 241 QLPFSKDDVHAVR----GGEQQAIADITPK--YM----------EDLDQRWMDYGVKFLD 284 P KD+ ++ G Q+I + + YM E++ ++ +F+ Sbjct: 205 GEPTGKDNPELLKMKWHHGHDQSIVNGISRIGYMKGGQKAKWNDEEMADLFLTKAQQFIK 264 Query: 285 KMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNG 344 KPFFLYY + H P+ ++ G + GD + E + L TLEK G Sbjct: 265 DH--KSKPFFLYYAMQQPHVPRTPHPRFKGVT-GMGPRGDAIAEADWCLGELLNTLEKEG 321 Query: 345 QLDNTLIVFTSDNGPEAEVPPHG-----------RTPFRGAKGSTWEGGVRVPTFVYWKG 393 L+NTLI+FTSDNGP H P RG K S +E GVRVP YWKG Sbjct: 322 ILENTLIIFTSDNGPVVNDGYHDDAVEKLGKHKPAGPLRGGKYSLFEAGVRVPFITYWKG 381 Query: 394 MIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRK 453 I+P SD +V DL + L G +D + FLG + + Sbjct: 382 TIKPAVSDAVVCQLDLLSSLAHLTGQEA----------KGLDSRNYLDVFLGKTQKG--R 429 Query: 454 AEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQE 513 +E A+R ++ P + G ++NL TD + Sbjct: 430 SELILEASSRTALRQGDWLMIPPYNGPAINKMVNIELG------NAKEYQLYNLKTDIGQ 483 Query: 514 SDSIGVRHIPMGVPLQTEMHAYMEILKK 541 ++ L T + K Sbjct: 484 QHNLAKSEPERLKKLVTAFEQLQQGGAK 511 >UniRef50_Q7UYH4 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYH4_RHOBA Length = 479 Score = 376 bits (965), Expect = e-102, Method: Composition-based stats. Identities = 114/490 (23%), Positives = 192/490 (39%), Gaps = 60/490 (12%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTR 140 ++P+VVV L+DD+G+ D G TP+ID++A G+ T+A++ P +R Sbjct: 19 AAERPHVVVILVDDMGYGDPGCFNPD--SKIETPNIDSLARDGMRFTNAHAPGPLCHMSR 76 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ-- 198 ++TG+Y + + P +P TL L QGY T +GKWH+G + + Sbjct: 77 YGLMTGRYPFRTDVSVWPR--EPLIDPDQATLASLAKSQGYRTTMVGKWHLGFEERANES 134 Query: 199 --------PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSP--DRSEYIKQLPFSKDD 248 P + GFD F G + +D+ + ++N A+ P DR E +S Sbjct: 135 YDRPLLGGPVDRGFDHFFGIRASTDIPPYF---YINDRNAVHPPTDRIEANASEGWSPIQ 191 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKM--AKSDKPFFLYYGTRGCHFDN 306 R G ++ D+ + D + + A+ P LY H Sbjct: 192 GAFWRAGGIAPDLELA-----DVLPHFTDVAISSIQAHPNAEDASPMMLYLAYPAPHTPW 246 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP------E 360 P+ ++ G S SYGD ++ ++ + TL+ + +T+++FTSDNGP Sbjct: 247 LPSPEFTGKSKVD-SYGDFVMMVDHEVGRVLDTLKVHDMERDTIVIFTSDNGPVWYENDT 305 Query: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGH 419 FRG K WE G R+P V W G S D +V DL T D+ Sbjct: 306 ERFQHDSAGGFRGMKADAWEAGHRMPFIVRWPGHASASSSTDHLVCFTDLMATFADI--- 362 Query: 420 PGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAA---VRMDEFKYHVL 476 +P+ D L + K + + + +R ++K Sbjct: 363 ----WETELPQDAGPDSHSFLPALLQQPFEEGTKRTEFVMRAGSKSTMTIRAGDWKL--- 415 Query: 477 IQQPYAYTQSGYQGGFTGTVMQTAGS-----SVFNLYTDPQESDSIGVRHIPMGVPLQTE 531 GGF+ ++NL +DP E++++ H + LQ Sbjct: 416 -------ITGLGSGGFSKPSRIPPKPGGPTGQLYNLKSDPAEANNVYQDHPDIVQRLQKR 468 Query: 532 MHAYMEILKK 541 M ++ + Sbjct: 469 MKQIVDDGRS 478 >UniRef50_A6C4W8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4W8_9PLAN Length = 459 Score = 376 bits (965), Expect = e-102, Method: Composition-based stats. Identities = 107/489 (21%), Positives = 181/489 (37%), Gaps = 82/489 (16%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-P 134 A ++ G++PN++ + DD+G+ D+G G + TP ID A+QG T AY+ Sbjct: 19 ASMQAAEGERPNIIFIMADDLGYGDLGCYGQKLM---KTPHIDQFAAQGTRFTQAYAGGS 75 Query: 135 SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE- 193 + +RA +LTG ++ H + + T+ ++L GY +GKW +G+ Sbjct: 76 VCTASRAVLLTGLHNGHTPARDNIPHYATYLQESDVTIAEVLQKSGYRCGGVGKWSLGDA 135 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 + N GFD + G+ + + + + + E L + +Q Sbjct: 136 GTVGRATNQGFDMWFGYLNQDHAHYYFTEYLDDNEGRLELKGNTKNRQ------------ 183 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF--------- 304 +Y DL + ++F+ A +PFFLY HF Sbjct: 184 ------------QYSHDL---LTERALQFIRDSAA--QPFFLYAAYTLPHFSAKAEDPHG 226 Query: 305 ---DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA 361 + Y + ++ + + + + TLI+FTSDNG Sbjct: 227 LAVPDTEPYSDRDWDIKSKKYAAMIHRLDRDVGRIMSLVNELQLRERTLIIFTSDNGGHR 286 Query: 362 EVPP--HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAG 418 VP H P RG K EGG+RVP W G I K SD ++ D+ PT +LAG Sbjct: 287 GVPAQLHTNGPLRGFKRDLTEGGIRVPFIANWPGTIPAGKVSDEVIAFQDMLPTFAELAG 346 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN----GKLAAVRMDEFKYH 474 + +DG+ G + + ++ AVR + +K Sbjct: 347 AQVSAN---------LDGISVLPALRGEPRKVKHEYLYWDYGHCRARYDQAVRWNNWKGI 397 Query: 475 VLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 QQ +++NL D ES + +H + + M+ Sbjct: 398 RHGQQ--------------------GEIALYNLDQDLSESRDVADKHPQVVQRIAEIMNT 437 Query: 535 YMEILKKYP 543 +YP Sbjct: 438 AAVPNPRYP 446 >UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017445FC Length = 481 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 122/501 (24%), Positives = 178/501 (35%), Gaps = 73/501 (14%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPS 135 + +PNV+VFL DD+G+ ++G G TP++D +A+ G+ T YS Sbjct: 9 AASLQASARPNVIVFLADDLGYGELGCYGQKKI---KTPNLDQLAADGMRFTDFYSGHAV 65 Query: 136 SSPTRATILTGQYSIHHGILMPPMYG------------------QPGGLQGLTTLPQLLH 177 +P+R +LTG+++ H + Q T L Sbjct: 66 CAPSRCVMLTGKHTGHSFVRENSEGRAAQAKERNRIKAADGYLPQIALPASEATYASALQ 125 Query: 178 DQGYVTQAIGKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS 236 GY T +GKW +G + E P GFD F G+ S +W+ + P D Sbjct: 126 KSGYRTACVGKWGLGHPSNEGSPNKHGFDLFYGYIS------QWQAHYYYPTYLWRNDVK 179 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDL----DQRWMDYGVKFLDKMAKSDKP 292 E P +D R + K+ME + V + D+P Sbjct: 180 E-----PLEGNDGKVGRQYAADLMEQEALKFMETTGGGPFFLYYATPVPHVSLQVPPDEP 234 Query: 293 FFLYY--GTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 Y G Y + R Y + M+ L++ GQ NTL Sbjct: 235 SLAEYKQAFAGQDPPYDGRKSYLPTEDPRAIYAAMVTRMDRTLGKFRDLLKRTGQDQNTL 294 Query: 351 IVFTSDNGPEAEVPP-----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIV- 404 I+FTSDNG G P RG K W+GG+R P W G IQP + V Sbjct: 295 IIFTSDNGATFNGGYDREFFGGNQPLRGMKTQLWDGGIRTPFIAAWPGSIQPGQVSRFVG 354 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL--NGK 462 DLFPT ++ G P +DGV G + Y+ G Sbjct: 355 ASWDLFPTFAEIVGFPVPAG---------LDGVSILPTLKGEVATQKQHDHLYWETVAGG 405 Query: 463 LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHI 522 AVRM +K L +A +FNL TD E+ + +H Sbjct: 406 HQAVRMGPWKGIRL----------------GVIKNPSAPVQLFNLETDVSETTDVAAQHP 449 Query: 523 PMGVPLQTEMHAYMEILKKYP 543 + + T M A ++P Sbjct: 450 DIVAKIATIMSAGRVPSAEFP 470 >UniRef50_A7S8Q2 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7S8Q2_NEMVE Length = 540 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 119/496 (23%), Positives = 194/496 (39%), Gaps = 77/496 (15%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 P+++ L+DD+GW DVG++ ++ TP+ID +ASQG+ L S YSQP +P+R Sbjct: 30 SMAGPPHIMFILMDDLGWSDVGYHN--ISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSR 87 Query: 141 ATILTGQYSIHHGILMPPMYGQP--GGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKES 197 ++TG+Y IH G+ + G + T+PQ L GY T IGKWH+G + + Sbjct: 88 GALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFFDWDY 147 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P GFD F GF + + R + L F +D+ A G Q Sbjct: 148 TPLRRGFDSFLGFFAGEQDHW----------------RHSKMGFLDFRRDEEPANEYGGQ 191 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS-- 315 + + + + +P FL H + Sbjct: 192 ------------HSTDVFTQEAIN-IAMRHNASQPLFLLLSYAAVHTPLQAHPNDVNKIG 238 Query: 316 ---SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFR 372 R +Y M + L ++NG +NTL+++ SDNG + P R Sbjct: 239 GVSDKDRQNYLGMMGAADWSIGRLIDVYKRNGLWNNTLMIWASDNGAQPGKGGGYNWPLR 298 Query: 373 GAKGSTWEGGVRVPTFVYWKGMI---QPRKSDGIVDLADLFPTALDLAGHPGAKVANLVP 429 G K S +EGGVRVP FV+ G + + + + + D +PT + LAG Sbjct: 299 GYKSSLFEGGVRVPAFVH--GEMLQRKGGTVNDLFHVTDWYPTLVKLAGGEV-------- 348 Query: 430 KTTFIDGVDQTSFFLGTNGQSNRKAEHY-----------------FLNGKLAAVRMDEFK 472 IDGVDQ S R+ + F AA+R K Sbjct: 349 -EPDIDGVDQWPTL-SEGKPSKREEILHNIDIPANQEEERMAPRGFNYYSGAALRRGHMK 406 Query: 473 YHVLIQ--QPYAYTQSGYQGGFTGTVMQTAGS----SVFNLYTDPQESDSIGVRHIPMGV 526 + Y ++G++G +++ +++N+ DP+E + + + + Sbjct: 407 LVYKMGDAGWYQLPENGHRGPVVEEMVKDRLPIVELALYNITADPEERNDLSKLNPDIVD 466 Query: 527 PLQTEMHAYMEILKKY 542 L + +Y Sbjct: 467 SLWRRLQELNATSLEY 482 >UniRef50_C1ZF13 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZF13_PLALI Length = 461 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 120/471 (25%), Positives = 183/471 (38%), Gaps = 60/471 (12%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-P 134 A E + ++PN+++ L DD G + G TP ID++ G+ Y Sbjct: 23 ATTETTSERRPNILLILSDDCGHAEFSIQGHP---RYKTPHIDSIGKNGVHFRQGYVSGC 79 Query: 135 SSSPTRATILTGQYSIHHG--ILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHM 191 SP+RA +L G+Y G +PP Y + GL T LPQLL + GY T A+GKWH+ Sbjct: 80 VCSPSRAGLLAGRYQQRFGHEFNIPPAYSETNGLPRSETLLPQLLKEDGYRTIALGKWHL 139 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 G + P GF D+ GF S Y P K Sbjct: 140 GYAPQFHPMERGFTDYYGFLQGSRSY------------------------FPLKKPTRLN 175 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 ++ AI + YM D D + ++ + +P+ +Y H N A Sbjct: 176 QMLRDRTAIPEEQFGYMTD---HLADEAIAYIKQW--QSQPWMMYLAFNATHSPNDATAV 230 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPF 371 ++ Y + ++ + L++ G +TL++F +DNG H Sbjct: 231 DLQAADGNKIY-AMTIALDRAVGKVLDALKECGLSKDTLVIFINDNGGAGG---HDNGSL 286 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPK 430 G KGSTWEGG R+P V + I + D V DLFPT LD+AG A++ + Sbjct: 287 HGKKGSTWEGGTRIPFLVQYPAKIPSGQVIDEPVIALDLFPTILDVAGLGDAELKKIPFD 346 Query: 431 TTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490 +DG+ G Q Y+ +GK A+R K Sbjct: 347 PEKLDGISLIPRMTGKT-QRLVDRPLYWKSGKRWAIRQGNLKAV---------------- 389 Query: 491 GFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +G Q +F+L +DP E ++ H L+ + L+K Sbjct: 390 --SGNDDQGDQVELFDLSSDPDEQRNLAATHPDELQQLEALYRKWESTLEK 438 >UniRef50_B7AM73 Putative uncharacterized protein n=1 Tax=Bacteroides eggerthii DSM 20697 RepID=B7AM73_9BACE Length = 523 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 126/526 (23%), Positives = 218/526 (41%), Gaps = 69/526 (13%) Query: 56 DNMMPVMQHPAQDKETQQKLAELEKKTG-KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPT 114 N++P + + + +KPNV+ + DD+G D+ G A T Sbjct: 5 SNLLPYLPSAILLAGCGNHSKQTAPSSQLQKPNVIYLISDDLGIGDLSCYG---ATKVST 61 Query: 115 PDIDAVASQGLILTSAYSQPS-SSPTRATILTGQYSIH---HGILMPPMYGQPGGLQGLT 170 P+ID +A QG+ T+AY+ S S+P+R +LTG Y GI P + Sbjct: 62 PNIDRLAGQGMQFTNAYATSSTSTPSRFGLLTGMYPWRQENTGI--APGNSELIIDTACV 119 Query: 171 TLPQLLHDQGYVTQAIGKWHMG--------ENKESQPQ--NVGFDDFRGFNSVSD----- 215 T+ + ++GY T A+GKWH+G N E +P ++GFD + D Sbjct: 120 TMADMFKEEGYCTGAVGKWHLGLGPKGGTDFNHEIKPNTQDIGFDYEFIIPATVDRVPCV 179 Query: 216 MYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG----GEQQAIADITPK--YM- 268 V ++P+ ++ + + + P ++ +V+ G I + P+ +M Sbjct: 180 FVENAHVVGLDPKDPITVNYNHKVGDWPTGLENPESVKMKPSQGHNNTIINGIPRIGWMT 239 Query: 269 ---------EDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPAR 319 ED+ F+ ++ ++PFFLY GT+ H P+ ++AG S Sbjct: 240 GGKSALWVDEDIADIITGKAKDFI--ISHKNEPFFLYMGTQDVHVPRVPHPRFAGKS-GL 296 Query: 320 TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP--------HGRTP- 370 GD +++++ + +TL+ DNT+ VF SDNGP + +G TP Sbjct: 297 GPRGDVILQLDWTVGEIMRTLDSLNIADNTIFVFCSDNGPVIDDGYQDQALELLNGHTPM 356 Query: 371 --FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 +RG K S+++ G R+P V W I+P K + + D++ + L + + + Sbjct: 357 KHYRGGKYSSFDAGTRIPFIVRWPNGIKPGKQQALFSMIDVYASFAAL-------LDHQL 409 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 P D DQ FLGT+ LN L+ + +KY +P + Sbjct: 410 PTGVAPDSRDQLDSFLGTDTAGCNYIVQQNLNNTLSII-QHNWKYIEPSNKPALEYWTKI 468 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 + G ++NL DP E +++ H L + Sbjct: 469 EMG------NNPEPQLYNLSIDPSEKNNVAKDHPDKVKTLSALLEE 508 >UniRef50_A3ZMN6 Arylsulfatase B n=3 Tax=Bacteria RepID=A3ZMN6_9PLAN Length = 455 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 116/473 (24%), Positives = 180/473 (38%), Gaps = 70/473 (14%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 T +A +PN+V L DD+G DV + G + TP +DA+A+ G L Sbjct: 14 TLASVATTFATDAPRPNIVFLLADDLGGADVSWRGSPI----KTPQLDALANSGAKLEQF 69 Query: 131 YSQPSSSPTRATILTGQYSIHHGILMPPM--YGQPGGLQGLTTLPQLLHDQGYVTQAIGK 188 Y QP SPTR+ +LTG+Y + +G+ + + + G TL + L D GY T +GK Sbjct: 70 YVQPVCSPTRSALLTGRYPMRYGLQVGVVRPWADYGLPLDERTLAEALQDAGYETAIVGK 129 Query: 189 WHMGE-NKESQPQNVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 WH+G + P GFD G +N D +T RD ++ K ++ Sbjct: 130 WHLGHVSPAYLPMARGFDHQYGHYNGALDYFTHDRD-----------GGHDWHKDDHVNR 178 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 D+ +A V+ + KP FLY H Sbjct: 179 DEGYA--------------------THLIAQEAVRVIQD-RDKKKPLFLYVPFNAVHSPL 217 Query: 307 YPNAKYA----GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 YA R +Y + +++ + +++ LDNTL +F+SDNG Sbjct: 218 QVPESYAAPYGDMKKRRQAYAGMVAALDEAVGQIVDEIQRQEMLDNTLFIFSSDNGGPEP 277 Query: 363 VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPG 421 P RG K + +EGGVRV F WKG I P K + + + D +PT ++LAG Sbjct: 278 GKLTDNGPLRGGKHTLYEGGVRVCAFASWKGRIAPGSKVEAPLHIVDWYPTLIELAG--- 334 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 + + +DG + T S + A+R+ ++K V Sbjct: 335 ----GSLQQAKPLDGRNIWPSIT-TGEPSPHDVIVCNITPTEGAIRVGDWKLVVH----- 384 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 +FNL D E + + M L+ Sbjct: 385 ------------NIGKPREKVELFNLSDDLAEQQNRATTNAKMLRKLRNRFDQ 425 >UniRef50_D2QW96 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QW96_9PLAN Length = 481 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 120/496 (24%), Positives = 202/496 (40%), Gaps = 62/496 (12%) Query: 90 VFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQY 148 + L DD+G+ DV PTP+ID +A +G+ T A+S +P+R ++TGQ Sbjct: 2 LILADDLGYGDVRCYNPD--AKVPTPNIDRLAREGMRFTDAHSPSTVCTPSRYGLMTGQM 59 Query: 149 SIHH---GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG------------- 192 G + + G G TLP +L QGYVT A+GKWH+G Sbjct: 60 PFRVPGGGTVFTGVGGPSLITPGRLTLPAMLQQQGYVTAAVGKWHVGLTFRDSSGEPIKT 119 Query: 193 ------------ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 E P + GFD F G + T+W ++ + +P S + Sbjct: 120 SGVDAVRRVDFSRRIEGGPIDHGFDHFFG--TACCPTTDWLYAFIDGDHIPTPPTSLLKR 177 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKY-MEDLDQRWMDYGVKFLDKMAK--SDKPFFLYY 297 S R G I P + M+++D ++ ++F+ + A+ KP FL++ Sbjct: 178 STLPSHPYSEDCRLG------LIAPDFEMQEVDTIFLQKSLEFIAEHARTSPQKPLFLFH 231 Query: 298 GTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 T+ H ++ + G + +GD + + + + L + L+K+G +NTL++ TSDN Sbjct: 232 ATQAVHLPSFAGKDFRGKTEV-GPHGDFLCQFDHIVGQLMQALDKHGLAENTLVILTSDN 290 Query: 358 GPEAEVPPHGRT--------PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSD-GIVDLAD 408 GPE H R P+RG K WEGG R+P V W G ++P + + L D Sbjct: 291 GPETTTVVHMRADHQHDGAKPWRGVKRDAWEGGHRLPLIVRWPGHVKPNTTSAELTSLTD 350 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY--FLNGKLAAV 466 + T + G + + D ++ GT R F + ++ Sbjct: 351 IMATVAAITGAE-------LDRDAAEDSLNMLPVLEGTANAPIRPYLLTQAFGGARTLSI 403 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGS-SVFNLYTDPQESDSIGVRHIPMG 525 R +KY + G F +++L TDP E+ ++ ++ + Sbjct: 404 RRGNWKYIDHRGSGGNNYEQGLMKPFALPDSAPEAPGQLYDLDTDPGETTNLYLQKPEIV 463 Query: 526 VPLQTEMHAYMEILKK 541 LQT + + Sbjct: 464 KQLQTLLEQTQTTGRS 479 >UniRef50_A0PKV5 Arylsulfatase, AslA n=5 Tax=Bacteria RepID=A0PKV5_MYCUA Length = 516 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 121/504 (24%), Positives = 204/504 (40%), Gaps = 42/504 (8%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 + KPN++V DD+G ++ +G TP ID +A++G+ T AY + S + RA Sbjct: 2 SNGKPNILVIWGDDIGITNLSCY-SDGLMGYRTPHIDRIANEGMRFTDAYGEQSCTAGRA 60 Query: 142 TILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 ++GQ G+ M G G T+ LL GY T GK H+G+ + P Sbjct: 61 AFISGQSVYRTGMSKVGMPDSDIGWSGQDPTIADLLKPLGYATGQFGKNHLGDRNKHLPT 120 Query: 201 NVGFDDFRGFNSVSDMY-----TEWRDVHVNPEVALSPDRSEYIKQLPFSK-----DDVH 250 GFD+F G + ++ P +A ++ + DD Sbjct: 121 VHGFDEFFGNLCHLNAEEEPELPDYPKSDRFPVLAELNRPRGVLRCWATEEVSDEPDDPK 180 Query: 251 AVRGGEQQAI--ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 G+Q+ + +T + ME +D + V F+ + A +D PFF++ H + Sbjct: 181 YGPVGKQRIVDTGPLTKQRMETIDDETTEACVDFIKRQAAADTPFFVWMNMTHMHLRTHT 240 Query: 309 NAKYAGSSPA-RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG 367 + G + ++ Y D M++ + L L++ G +T++++++DNGP A P G Sbjct: 241 KPESVGQAGVWQSPYHDTMIDHDRNVGQLLDALDELGIAQDTIVIYSTDNGPHANTWPDG 300 Query: 368 -RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPG---- 421 TPFR K + WEG +R+P + W G I S+ I+ D PT L AG P Sbjct: 301 ATTPFRSEKNTNWEGALRIPEMIRWPGKISAGVVSNEIIQHHDRLPTFLAAAGEPDIIEK 360 Query: 422 -------AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG-KLAAVRMDEFKY 473 + + +D + + G +S R+ YF + + +R +K Sbjct: 361 LKKGHKISVRGDDKEFKVHLDAFNLLPYLTGEVEESPRQGFIYFSDDCDVLGIRFHNWKI 420 Query: 474 HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVR--------HIPMG 525 Q+ G + + +FNL TDP E I H + Sbjct: 421 VFQEQR-----CQGTLQVWAEPFIPLRVPKIFNLRTDPYERADITSNTYYDWFLDHDFIA 475 Query: 526 VPLQTEMHAYMEILKKYPPRAQIK 549 ++E K++PPR Sbjct: 476 FYGTAICTQFLETFKEFPPRHPPA 499 >UniRef50_A6DHS3 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHS3_9BACT Length = 524 Score = 375 bits (962), Expect = e-102, Method: Composition-based stats. Identities = 133/527 (25%), Positives = 208/527 (39%), Gaps = 82/527 (15%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-S 132 L L KPN++ L DD+G+ D+ GG + PTP +D +A +G+ T A+ S Sbjct: 11 SLFCLSLSAQDKPNIIFILADDMGYGDMSNEGGLI----PTPHLDRMADEGMKFTDAHTS 66 Query: 133 QPSSSPTRATILTGQYSIHHGILMPPMYGQ--PGGLQGLTTLPQLLHDQGYVTQAIGKWH 190 +PTR ILTG+Y+ + G P Q T+ L DQGY T +GKWH Sbjct: 67 SSVCTPTRYGILTGRYNWRSSKKKGVLSGTSAPLIPQDRVTIANFLKDQGYHTGMVGKWH 126 Query: 191 MGENKES------------------------------------QPQNVGFDDFRGFNSVS 214 +G + P + GFD F G + Sbjct: 127 LGIGWQMLDEAKKPEKSFLKEGYKMKNNKQAASWKVDYSKPAITPIHNGFDYFYGIAASL 186 Query: 215 DMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQR 274 DM +P V + D++ ++ + R G D T M Sbjct: 187 DM---------SPYVYIENDKA--VEMATHERGFATPYRPGATGPSFDATYCLMT----- 230 Query: 275 WMDYGVKFL-DKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVF 333 + D K++ + A KPFFLY H P+ K+ G SP +T YGD ++E + V Sbjct: 231 FADKSRKYIAQQAADKSKPFFLYLPLTSPHTPIMPSEKFLGKSPTKTIYGDFVMETDWVV 290 Query: 334 ANLYKTLEKNGQLDNTLIVFTSDNG--PEAEVPPH---GRTP---FRGAKGSTWEGGVRV 385 + L+K G DNTLIVFT+DNG P +P H G +P +RG K +EGG RV Sbjct: 291 GEVMAELDKQGIADNTLIVFTADNGCSPTGSIPEHIKIGHSPNGQWRGHKADIFEGGHRV 350 Query: 386 PTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFL 444 P V W +Q +SD + AK++ + T D + Sbjct: 351 PFLVRWPAQVQTKTQSDSTICTT-----DFFATAADAAKLSASIEDTMAEDSYSFYADLT 405 Query: 445 GTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAG-SS 503 +G++ R + A+R ++K ++ G+ G + Sbjct: 406 -QSGKTKRPFTIHHSINGSFAIRQGKWKLNL------CPGSGGWSAPRPGKATKGLPLIQ 458 Query: 504 VFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 +++L DP E +++ H + L ++ ++ + P Q Sbjct: 459 LYDLDGDPAEKNNLQDAHPEIVDNLVNQLAKEIKAGRSTPGAPQTNE 505 >UniRef50_A6DHI1 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI1_9BACT Length = 472 Score = 374 bits (961), Expect = e-102, Method: Composition-based stats. Identities = 110/473 (23%), Positives = 189/473 (39%), Gaps = 78/473 (16%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATI 143 KPN++ L DD+G+ +VG+NG + TP++D +AS+G+ T Y +P+RA++ Sbjct: 20 KPNIIYILCDDLGYGEVGYNGQKMI---QTPELDKLASKGMRFTDHYCGNAVCAPSRASL 76 Query: 144 LTGQYSIHHGILMPPMY---GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQP 199 +TG++ H I GQ TL +L+ GY T IGKW +G + P Sbjct: 77 ITGKHPGHAFIRANSPGYPDGQTPIPADSETLGKLMKRAGYATACIGKWGLGGFHNAGNP 136 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD F G+ + + + + R GE++ Sbjct: 137 HKQGFDHFYGYTDQRKAHNYYPE---------------------------YLWRNGEKEM 169 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF----DNYPNAKYAGS 315 + + + + +K++++ K D+PFFLY H + K Sbjct: 170 LNNKNGEENDYSHDLMTVDALKYIEE--KKDQPFFLYLAYLIPHVKYQVPDLAQYKDKDW 227 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP----HGRTPF 371 + M+ + + LE+ G DNTLI+F SDNG + + Sbjct: 228 PKEMKIHAAMTSRMDRDIGTIARRLEELGIADNTLIMFNSDNGAHGKSNSEKFFNTSGDL 287 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPK 430 +G K S ++GGVR P YW G IQ SD I D+ PT +L G P Sbjct: 288 KGLKRSMYDGGVRSPMIAYWPGTIQAGSVSDHISAFWDMMPTFSELTGEPFKGET----- 342 Query: 431 TTFIDGVDQTSFFLGTNGQSNRKAEHYFL----NGKLAAVRMDEFKYHVLIQQPYAYTQS 486 DG+ LG + + + Y+ N A+R ++K VL ++ Sbjct: 343 ----DGISMLPTLLGKDSEQKQHKYLYWELYESNKPNCAIRFGKWKGVVLDRR------- 391 Query: 487 GYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM-HAYMEI 538 + ++++ D ES ++ ++ + ++ M A+++ Sbjct: 392 -----------KGLNIELYDMSGDQSESKNLAAQYPEVVDEIRKMMVEAHVKS 433 >UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=4 Tax=Bacteria RepID=A6C861_9PLAN Length = 498 Score = 374 bits (961), Expect = e-102, Method: Composition-based stats. Identities = 118/525 (22%), Positives = 184/525 (35%), Gaps = 122/525 (23%) Query: 71 TQQKLAELEKKTGKKP-NVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTS 129 + L+ EK KP N V L+DD+G+MDVG N TP I+ +A G+ T+ Sbjct: 19 ADRSLSAAEKPKQNKPLNFVFILVDDLGYMDVGCNNPQ--TFYETPHINQLAKTGMRFTN 76 Query: 130 AYSQ-PSSSPTRATILTGQYSIH-----------HGILMPPMYGQPGGLQGLTTLPQLLH 177 Y+ P SPTR +I+TG+Y G +P L TT+ + L Sbjct: 77 GYAANPVCSPTRYSIMTGKYPTRVDATNFFSGKRAGKFLPAPLNDKMPL-SETTIAEALK 135 Query: 178 DQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTE--WRDVHVNPEVALSPDR 235 + GY T GKWH+G +E P+ GFD RG Y + + NP + Sbjct: 136 EHGYSTFFAGKWHLGPTQEFWPEKQGFDINRGGWHRGGPYGGGKYFSPYGNPRL------ 189 Query: 236 SEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFL 295 E L R +F+D A D+PFF Sbjct: 190 ---------------------------TDGLKGEHLPDRLASETAQFID--AHRDEPFFA 220 Query: 296 YYGTRGCHFDNYPN----AKYAGSS-----------------------------PARTSY 322 Y H KY + Y Sbjct: 221 YLAFYSVHTPLMGPGPLVTKYKEKAKRLGLTGKEEFADEEQVFPVDEKRRVRILQNHAVY 280 Query: 323 GDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP--EAEVPPHGRTPFRGAKGSTWE 380 + M+ + + LE++G +NT+++ T+DNG +E P P RG KG +E Sbjct: 281 AAMVESMDKAVGKVLQQLEESGVAENTVVMLTADNGGLSTSEGSPTSNLPLRGGKGWLYE 340 Query: 381 GGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQ 439 GG+R + W G +P D V D +PT LDLAG P + +DGV Sbjct: 341 GGIREVFLIRWPGGTEPGSVCDEPVITTDFYPTILDLAGLP-------LKPQQHLDGVSL 393 Query: 440 TSFFLGTNGQSNRKAEHYFLNGKLA------AVRMDEFKYHVLIQQPYAYTQSGYQGGFT 493 F G ++ + A+R+ ++K Sbjct: 394 KPFLQGEAPFKRDALYWHYPHYSNQGGIPGGAIRVGDWKLI------------------- 434 Query: 494 GTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 + +++L D E + ++ ++ ++H + + Sbjct: 435 -ERFEDGQVHLYHLKEDLGEKQDLAEKYPERVAAMRKQLHKWYQE 478 >UniRef50_A6CAR8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Planctomycetaceae RepID=A6CAR8_9PLAN Length = 501 Score = 373 bits (959), Expect = e-102, Method: Composition-based stats. Identities = 124/536 (23%), Positives = 193/536 (36%), Gaps = 134/536 (25%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSP 138 K PN+++ + DD G+ D+G G + TP +D +A +G LTS Y P+ +P Sbjct: 32 KAAETPPNIIMIVSDDQGYRDLGSFGSEEIM---TPHLDRLAKEGAKLTSFYVTWPACTP 88 Query: 139 TRATILTGQYSIHHGILMPPMYGQP-------------------GGLQGLTTLPQLLHDQ 179 +R ++LTG+Y +GI P G LP LL Sbjct: 89 SRGSLLTGRYPQRNGIYDMIRNEAPDFGHKYKPAEYEVTFERIGGMDVREKLLPALLKPA 148 Query: 180 GYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVS-DMYTEWRDVHVNPEVALSPDRSEY 238 GYV+ GKW +G +K P GFDDF GF + D +T R + P + + Sbjct: 149 GYVSAIYGKWDLGIHKRFLPLARGFDDFYGFTNTGIDYFTHER--YGVPSMYRNN----- 201 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 Q + Y L QR V+F+ + KPFFLY Sbjct: 202 -------------------QPTEEDKGTYCTYLFQR---EAVRFIKE--NHQKPFFLYLP 237 Query: 299 TRGCHFDNYPNAKYAG-------------------------------------------- 314 H + + + G Sbjct: 238 FNAPHGASSLDPRIRGGAQAPEKYKNMYPHLKDTLVTKKKTGRYEFRERPDGPVIHQGVS 297 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGA 374 +S R Y + M+D + L++ DNT++VF SDNG +P +G Sbjct: 298 ASKRRLEYVASITCMDDAIGEVLGLLDEYQIADNTIVVFFSDNGGSGGAD---NSPLKGK 354 Query: 375 KGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPKTTF 433 KG +EGG+RVP V + I+P D ++ +L PT L A P +P+ Sbjct: 355 KGMMFEGGIRVPCLVRYPAKIKPGTVNDELLTSLELVPTFLKEAAIP-------LPENVV 407 Query: 434 IDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFT 493 IDG D +G ++ + E Y+ + A R+ +K+ Sbjct: 408 IDGYDMLPVLMGKT--TSPRNEMYWQRREDKAARVGHWKW-------------------- 445 Query: 494 GTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIK 549 V GS +F+L D E + H ++ + + + PR + Sbjct: 446 --VESEKGSGLFDLSQDIGEKHDLSPTHPKKLEEMKNHFANWKKQMADAEPRGPFR 499 >UniRef50_A6DF76 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF76_9BACT Length = 542 Score = 373 bits (959), Expect = e-102, Method: Composition-based stats. Identities = 118/532 (22%), Positives = 191/532 (35%), Gaps = 92/532 (17%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTR 140 KPN+V L DD+G D G N TP+IDA+A++G+ T + PTR Sbjct: 18 AADKPNIVFILADDMGIGDTNCYGDEKCRIN-TPNIDALAAEGVRFTDFHVNSSICGPTR 76 Query: 141 ATILTGQYSIHHGILM---PPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE- 196 ++TG+Y G + P + P TL ++L GY T IGKWH+G Sbjct: 77 RALMTGRYPWRFGATVNNGPWGFCGPRPNTEKYTLGKVLKKAGYNTGYIGKWHLGTTMVT 136 Query: 197 ------------------SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 P GFD DMY + Sbjct: 137 KDGKKQGLTNVDYTKPLVYGPMQFGFDYSFILPGSLDMYPYA-----------------F 179 Query: 239 IKQLPFSKDDVHAVRGGE--QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 IK + + DV A++G + A + + + F+ K SD PFFL+ Sbjct: 180 IKDNDW-QGDVSALKGWSAFNRVGAAEISFESNKVVETFYRESELFIKKQ-NSDTPFFLF 237 Query: 297 YGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 H P ++ G S YGD ++E++ A + + L++ G +NTLI+F+SD Sbjct: 238 LALTSPHTPVCPGEEWNGKSEL-GPYGDFVMEVDHSIARVKQALKEKGLYENTLIIFSSD 296 Query: 357 NGPEAEVPP--------------HGRTP---FRGAKGSTWEGGVRVPTFVYWKGMIQPRK 399 +GP G P +RG K S +EGG+RVP W G + Sbjct: 297 HGPAPYAGNILKATPNQISLLEQQGHYPAGIYRGYKFSIYEGGLRVPFIASWPGKTPKGQ 356 Query: 400 -SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF 458 + ++ DLF T +L + + D + + +RK Sbjct: 357 ICNQLIGFNDLFATFAELTNI-------KLQEDEAPDSISFARLLTKPSSNGDRKDLIM- 408 Query: 459 LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGS---------------- 502 + A+R E+K + +G + Sbjct: 409 QSVTSFAIRDGEWKLCLCPGSGIPANSENGKGNDPAPNAAWKKALEEFKGKPHQTDLLKA 468 Query: 503 ---SVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 +FNL DP+E +++ ++ + + + P ++K+D Sbjct: 469 PFVQLFNLAKDPEEKNNLASKNPRQVEKMINLFKKQIADGRSTPG-PKLKND 519 >UniRef50_A6DM29 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DM29_9BACT Length = 481 Score = 373 bits (959), Expect = e-102, Method: Composition-based stats. Identities = 112/491 (22%), Positives = 189/491 (38%), Gaps = 73/491 (14%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSP 138 KK ++PN+V+ L DD+G+ D+ G TP++D +A +G+ Y + P S Sbjct: 30 KKDTERPNIVLILCDDLGYGDLACYGHKQI---KTPNLDQMAKEGIRFNHFYSAAPVCSA 86 Query: 139 TRATILTGQYSIHHGIL----MPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWH---- 190 +R +LTG+ G+ P + T PQLL GY T GKWH Sbjct: 87 SRVGLLTGRSPNRAGVYDWIPHSSESSSPHMRKNEITFPQLLQKAGYATCLSGKWHCNGA 146 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 + ++QPQ+ GFD + + P K+ V+ Sbjct: 147 LINTNQAQPQDAGFDYWF---------------------------ATQNNAAPSHKNPVN 179 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSD--KPFFLYYGTRGCHFDNYP 308 +R G + + Q + + +++ K + +PFF+Y H Sbjct: 180 FIRNGVELGPIEGFS------CQIVTNEAINWMEDHVKQNEKQPFFIYLSFHEPHEPIAS 233 Query: 309 NAK----YAG--SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE-- 360 K Y G + + Y + ++ +L L+K DNTL++FTSDNGPE Sbjct: 234 PQKIVDTYKGIAENTNQAEYFANVENLDKAVGSLMNQLKKLKINDNTLVIFTSDNGPETL 293 Query: 361 -----AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTAL 414 A +G K T E G RVP ++W I + SD ++ D FPT Sbjct: 294 NRYEAASRSYGSPGELKGMKLWTAEAGFRVPAIMHWPEKIATGQISDQVISALDFFPTFC 353 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL---NGKLAAVRMDEF 471 DLA +K N +DG + T ++ + N + A+R ++ Sbjct: 354 DLAQASNSKSLN-------LDGSNFTPALHKKKMTRHKPLLWIYYAALNERQVAMRHGDW 406 Query: 472 KYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTE 531 K + P Y + T + ++NL D E++ + ++ + Sbjct: 407 KISAKLNLP-RYHNITSKNFPKVTAATLSDYQLYNLSKDKSEANDLSNQNPKKSAQMIKF 465 Query: 532 MH-AYMEILKK 541 + Y ++L+ Sbjct: 466 LKLQYQDLLED 476 >UniRef50_Q15US6 Sulfatase n=3 Tax=Alteromonadales RepID=Q15US6_PSEA6 Length = 526 Score = 373 bits (958), Expect = e-102, Method: Composition-based stats. Identities = 130/507 (25%), Positives = 210/507 (41%), Gaps = 59/507 (11%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-S 136 L+++ +KPNVV+F +DD+G+ D+ NG A+G TP++DA+AS+G+ T A+S S Sbjct: 35 LKQQASQKPNVVIFYVDDLGYGDISPNG---AIGVDTPNLDALASKGVNFTDAHSTASTC 91 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 +P+R ++LTG++ + P G TLP +L GY T IGKWH+G + Sbjct: 92 TPSRYSLLTGEHGFRQNAAILPGDAPALIRPGKATLPSMLQKAGYTTGVIGKWHLGLGEG 151 Query: 197 S---------QPQNVGFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPDRSEYIKQL 242 S P +GFD + D V++ + + Sbjct: 152 SVDWNQDVKPGPLEIGFDYSFLLPATGDRVPTVYLEGHEVVNLESSDPIEVSYDHKVGDR 211 Query: 243 PFSKDDVHAVRGG----EQQAIADITPKYM------------EDLDQRWMDYGVKFLDKM 286 P D+ +R Q I + + E+ + V+F+++ Sbjct: 212 PTGVDNPELLRMKADLQHSQTIVNGISRIGSMSGGEKALWVDEEFPDVFSQKAVEFIERS 271 Query: 287 AKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQL 346 K PFFL++ + H PN ++ G S GD + +M+ V + + L G Sbjct: 272 KKD--PFFLFFSFQDIHVPRLPNERFKGKS-TMGPRGDAIAQMDWVVGRVMQALTTQGVA 328 Query: 347 DNTLIVFTSDNGPEAEVPPHGRT-----------PFRGAKGSTWEGGVRVPTFVYWKGMI 395 DNTL++FTSDNGP + PFRG K S +EGG RVP VYW G Sbjct: 329 DNTLVIFTSDNGPVLDDGYDDMAAEMLGEHLPAGPFRGGKYSVFEGGTRVPMIVYWPGNT 388 Query: 396 QPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE 455 +S ++ D++ + L P AK ID +D FLG + R Sbjct: 389 THIRSSALISQVDIYASLAGLVKQPLAKTE-------AIDSLDVMHAFLGKTNNA-RTYL 440 Query: 456 HYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD 515 G L +R +KY I + + G + +F+L D E Sbjct: 441 LEEAVGTL-GLRKHNWKYIKAISKEKGL--PNWLGNKDIEMGFALTPQLFDLTDDVGEQT 497 Query: 516 SIGVRHIPMGVPLQTEMHAYMEILKKY 542 + + + ++ ++ +E +Y Sbjct: 498 NRAKDYPALVNAMEQKIQQLIEKGFRY 524 >UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria RepID=A6DGD3_9BACT Length = 713 Score = 373 bits (957), Expect = e-101, Method: Composition-based stats. Identities = 113/517 (21%), Positives = 187/517 (36%), Gaps = 115/517 (22%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSS 137 +K + K+P++++FL+DD+GW D+ G TP +D +A +G T AY+ P S Sbjct: 233 KKASSKRPHIILFLIDDLGWNDIACYGSQF---YETPHLDKMAKEGFRFTDAYAANPVCS 289 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGL--------------QGLTTLPQLLHDQGYVT 183 PTRA+IL G+Y G+ P G TL + L + GY T Sbjct: 290 PTRASILLGKYPSRVGLSNHSGSSGPKGPGHKLTPVPVKGNMPLEDITLAEALKEVGYKT 349 Query: 184 QAIGKWHMGENKE----SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI 239 IGKWH+ + + P+ GFD N + + + P + Sbjct: 350 AHIGKWHLQAHHDTSRNHFPEKHGFD----LNIAGHRMGQPGSFYFPYKSKQHPSTN--- 402 Query: 240 KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 V + G++ + L + D + ++ + D PFFL + Sbjct: 403 ---------VPDMADGQE----------GDYLTDKLTDKAIHYIKE--NKDTPFFLNFWY 441 Query: 300 RGCHFDNYPN--------------------------AKYAGSSPARTSYGDCMVEMNDVF 333 H P +A SS SY + M++ Sbjct: 442 YTVHTPIIPRQDLKKKYEAKANELGINKNQPGIPVLKSFARSSQNNPSYAAMVEAMDENI 501 Query: 334 ANLYKTLEKNGQLDNTLIVFTSDNGP----EAEVPPHGRTPFRGAKGSTWEGGVRVPTFV 389 ++KTL++ D T+I+F SDNG P + P + K +EGG+R+P + Sbjct: 502 GRIFKTLKELQIDDETIIIFCSDNGGLSTSTGPNCPTSQLPLKAGKAWVYEGGIRIPFII 561 Query: 390 YWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ 449 W G ++ V D++PT LD+ P +DGV TS G + Sbjct: 562 KWPGKKGGKELQAPVCTTDIYPTLLDMLKLPAK-------PEQHLDGVSLTSLMNGQAKE 614 Query: 450 SNRKAEHYFL--------NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAG 501 R+A G AVRM ++K +T Sbjct: 615 LQREALFIHYPHYHHINSMGPAGAVRMGDYKLV--------------------EYYETGE 654 Query: 502 SSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 ++NL D E +++ + ++ + + Sbjct: 655 FELYNLKEDIGEMNNLVKEQPERAAQMLKKLEQWRQQ 691 >UniRef50_A6DF72 Putative secreted sulfatase ydeN n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF72_9BACT Length = 481 Score = 372 bits (956), Expect = e-101, Method: Composition-based stats. Identities = 117/499 (23%), Positives = 187/499 (37%), Gaps = 102/499 (20%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSP 138 ++ KPNV++ L+DD+GW D G + TP++D ++ G+ T AYS SP Sbjct: 18 QENALKPNVIMILVDDLGWTDTTCYGSDL---YQTPNVDELSRTGMRFTDAYSACTVCSP 74 Query: 139 TRATILTGQYSIH-------HGILMPPMYGQPGGLQ-----GLTTLPQLLHDQGYVTQAI 186 TR++I+TG+ + G + P + + TL + GY T I Sbjct: 75 TRSSIMTGKNPANNNLTDWITGHVKPYAKLKSPNWKMHLTAEEITLAEAFKATGYKTVHI 134 Query: 187 GKWHMGENKESQPQNVGFDD----FRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 GKWH+GE S P+N GFD+ FR + + + + NP + P Sbjct: 135 GKWHLGEESVSWPENQGFDENIAGFRAGSPSAHGGGGYFSPYNNPRLKDGPKG------- 187 Query: 243 PFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGC 302 E L +R +++ AK KPFF+ Sbjct: 188 --------------------------EYLTERLAQEASQYIQSTAKLKKPFFMNLWLYNV 221 Query: 303 HFDNYPNAK---------YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVF 353 H + G Y + M+D + + ++ G DNT+I+F Sbjct: 222 HTPLQARQEKIDKYTRLIQKGYQHTNPVYAAMVEHMDDAVGTVMQAVKDAGIEDNTIIIF 281 Query: 354 TSDNGPEAEVPPH------GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDL 406 SDNG + P R KG +EGGVRVP + W I+ + S V Sbjct: 282 NSDNGGLRGNYENNRQKVTSNYPLRSGKGDMYEGGVRVPMIIKWSRKIKAGQTSSSPVIS 341 Query: 407 ADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTS-FFLGTNGQSNR---KAEHYFLNG- 461 D++PT LDL V K IDG+ G Q + HY L G Sbjct: 342 HDIYPTLLDLCKID-------VSKKQDIDGISLVPELLEGKTIQRDALYWHYPHYHLEGA 394 Query: 462 -KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVR 520 +A+R ++K L ++ +A ++NL D E +++ + Sbjct: 395 KPYSAIRKGDWKLIFLYEESHA--------------------ELYNLRNDISERNNLAMT 434 Query: 521 HIPMGVPLQTEMHAYMEIL 539 L ++ + + + Sbjct: 435 EKRKLAELMGDLRTWKKKI 453 >UniRef50_A6KZI7 Arylsulfatase n=23 Tax=Bacteroidales RepID=A6KZI7_BACV8 Length = 508 Score = 372 bits (956), Expect = e-101, Method: Composition-based stats. Identities = 117/505 (23%), Positives = 194/505 (38%), Gaps = 88/505 (17%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSP 138 + KPN++ + DD+G+ D+G G TP+ID +A +G+ T AY+ P S+P Sbjct: 22 AQKTPKPNIIYIMCDDMGYGDLGCYGQPYIS---TPNIDNMAKEGMRFTQAYAGSPVSAP 78 Query: 139 TRATILTGQYSIHHGILMPPMY------------------GQPGGLQGLTTLPQLLHDQG 180 +RA+ +TGQ+S H + Y GQ G +P+++ D G Sbjct: 79 SRASFMTGQHSGHCEVRGNKEYWRDAPVVMYGNNKEYAVVGQHPYDPGHVIIPEIMKDNG 138 Query: 181 YVTQAIGKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI 239 Y T GKW G S P G D++ G+ + + + +N + D + Sbjct: 139 YTTGMFGKWAGGYEGSVSTPDKRGIDEYYGYICQFQAHLYYPNF-LNRYSKSAGDTAVVR 197 Query: 240 KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 + + + + ++ P+Y D+ + +K+LDK +PFF + Sbjct: 198 VVMDENINYPMFGKDYFKR------PQYSADMIH---EEAMKWLDKQ-DGKQPFFGIFTY 247 Query: 300 RGCH-----------------------FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANL 336 H + ++Y S + + ++ + Sbjct: 248 TLPHAELAQPEDSILTGYQKKFFEDKTWGGQEGSRYNPSVHTHAQFAGMITRLDYYVGEV 307 Query: 337 YKTLEKNGQLDNTLIVFTSDNGPEAEVPPH----GRTP-FRGAKGSTWEGGVRVPTFVYW 391 L++ G +NT+++FTSDNGP E GR RG K +EGG+R+P V W Sbjct: 308 LNKLKEKGLDENTIVIFTSDNGPHEEGGADPTFFGRDGKLRGLKRQCYEGGIRIPFIVRW 367 Query: 392 KGMIQ-PRKSDGIVDLADLFPTALDLAGHPG--AKVANLVPKTTFIDGVDQTSFFLGTNG 448 G + +D + DL PT DLAG K N + DG+ LG G Sbjct: 368 PGKVPEGTVNDHQLAFYDLMPTFCDLAGVKNYVKKYTNKKKDVDYFDGISFAPTLLGQEG 427 Query: 449 QSNRKAEHY-FLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNL 507 Q ++ F VRM ++K V P+ ++NL Sbjct: 428 QKKHDFLYWEFDETDQIGVRMGDWKMVVKKGTPF----------------------LYNL 465 Query: 508 YTDPQESDSIGVRHIPMGVPLQTEM 532 TD E I H + ++ + Sbjct: 466 ATDIHEDHDIAAGHPDIVKQMKEII 490 >UniRef50_C9MNT2 Arylsulfatase n=4 Tax=Bacteroidales RepID=C9MNT2_9BACT Length = 539 Score = 372 bits (955), Expect = e-101, Method: Composition-based stats. Identities = 116/558 (20%), Positives = 196/558 (35%), Gaps = 96/558 (17%) Query: 35 KGFAGYDHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLD 94 K F D+ N + + + ++P+ + K +KPN++ + D Sbjct: 11 KSFLKTDNENLKPINMISKLTKTLLPITALGCVQGNA------MTPKKQQKPNIIYIMCD 64 Query: 95 DVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHG 153 D+G+ D+G G + TP+ID +A +G+ T AY+ P S+P+RA ++TGQ+S H Sbjct: 65 DMGYGDLGCYGQKYIL---TPNIDRMAKEGMRFTQAYAGAPVSAPSRACLMTGQHSGHTE 121 Query: 154 ILMPPMY------------------GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-N 194 + Y GQ LP+++ D GY T GKW G Sbjct: 122 VRGNKEYWTNSKPVYYGENKDFSVVGQHPYDPNHIILPEIMKDNGYRTGMFGKWAGGYEG 181 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 S P G DDF G+ + + + Y ++ + V Sbjct: 182 SLSTPDKRGVDDFYGYICQFQAHLYYPNFL----------NEYYKERGDTAVKRVVLTEN 231 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN-------- 306 D K + + + +L K DKPFF + H + Sbjct: 232 INHPMFGDEYFKRTQYSADLIHQHAMDWL-KAQTKDKPFFGVFTYTLPHAELTQPDDSLV 290 Query: 307 -------YPNAKYAGSSPAR--------TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 + + + G +R + + ++ + K L++ G DNTL+ Sbjct: 291 AFYKKQFFTDKTWGGQEGSRYNAVVHTHAQFAAMITRLDSYVGEILKLLDERGLADNTLV 350 Query: 352 VFTSDNGPEAEVPP-----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVD 405 +FTSDNGP E + RG K +EGG+R+P W G I+ S+ Sbjct: 351 IFTSDNGPHEEGGADPSFFNRDGKLRGIKRQCYEGGIRIPFIARWNGHIKAGVESNLPFA 410 Query: 406 LADLFPTALDLAGHPGA--KVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL--NG 461 DL PT ++ G + N + DG+ + + Y+ Sbjct: 411 FYDLMPTFAEMVGVKDYVQRYRNKKKTIDYFDGISILPTLINDGIGQKKYPYLYWEFAET 470 Query: 462 KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 AVRM ++K + P+ ++NL D E I H Sbjct: 471 DQTAVRMGDWKLITIHGIPH----------------------LYNLSNDLHEDHDIANEH 508 Query: 522 IPMGVPLQT-EMHAYMEI 538 + + + + Sbjct: 509 PDIVQKMIEIALKEHTNS 526 >UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DBQ5_9BACT Length = 483 Score = 372 bits (955), Expect = e-101, Method: Composition-based stats. Identities = 118/492 (23%), Positives = 179/492 (36%), Gaps = 82/492 (16%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSS 137 E T KPNV+ L DD+G D+G G TP+ID +A+ G+ Y+ + Sbjct: 20 EPATPAKPNVIFILADDLGIGDLGCYGQQKI---RTPNIDHLAADGMRFLQHYTGCSVCA 76 Query: 138 PTRATILTGQYSIHHGIL-----MPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 P+R ++TG++ H I P GQ Q T+ +L+ + GY T IGKW +G Sbjct: 77 PSRCALMTGRHMGHAAIRDNAQRGPSEEGQRPMPQDTFTVARLMQNAGYYTGIIGKWGLG 136 Query: 193 ENKESQ-PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 ++ P+++GF+ G+ S +T + P + E + P + Sbjct: 137 MPEDHSSPRDMGFNYSFGYLCQSMAHTYY------PPYLWRNNERETLAGNPSYDVSMKG 190 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH-------- 303 V + + + +KF+ DKPFFLY H Sbjct: 191 VIEPKGEIYSH----------DVMASDALKFVRDHH--DKPFFLYLAFTIPHLSLQVPED 238 Query: 304 -----------FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIV 352 YA + R +Y + M+ L L++ G DNTL+ Sbjct: 239 SMSEYHGQWTETPFRNTKHYANNETPRAAYAGMITRMDRDVGRLMALLKELGIDDNTLVF 298 Query: 353 FTSDNG------PEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVD 405 F+SDNG V FRG K +EGG+R P W G I+ +D Sbjct: 299 FSSDNGAVFPLAGTDPVFFQSTGGFRGYKQDLYEGGIRTPLIARWPGKIETGVTTDQASV 358 Query: 406 LADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN---GK 462 D PT +L G VP DG+ LG Q + Y+ G Sbjct: 359 FYDFLPTMAELNG---------VPPPADTDGLSYLPTLLGKPAQQKQHDFLYWEYQSAGG 409 Query: 463 LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHI 522 AVRM ++K + A V+NL +D ES + H Sbjct: 410 AVAVRMGDWKAIANKIK----------------KNPNANFEVYNLASDRTESHDVAAEHP 453 Query: 523 PMGVPLQTEMHA 534 + + + Sbjct: 454 EIVAKAREIIAR 465 >UniRef50_Q9NJU8 Sulfatase 1 n=2 Tax=Coelomata RepID=Q9NJU8_HELPO Length = 503 Score = 372 bits (955), Expect = e-101, Method: Composition-based stats. Identities = 116/507 (22%), Positives = 203/507 (40%), Gaps = 69/507 (13%) Query: 72 QQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY 131 Q + ++ +PN+V L DD G+ DVG++G + TP +DA+++ G+ L + Y Sbjct: 20 QSSASAGTRQDAGQPNIVFVLADDFGFHDVGYHGSEI----HTPTLDALSASGVRLENYY 75 Query: 132 SQPSSSPTRATILTGQYSIHHGILMPPM-YGQPGGLQGL-TTLPQLLHDQGYVTQAIGKW 189 QP +PTR+ +++G+Y IH G+ + QP L TL L + GY T +GKW Sbjct: 76 VQPICTPTRSQLMSGRYQIHTGLQHGIINSCQPNALPNDSPTLADKLKESGYATHMVGKW 135 Query: 190 HMGENK-ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 H+G K E P N GFD + G+ + ++ Y D + ++ Sbjct: 136 HLGFYKQEYLPWNRGFDTYFGYLNAAEDYFNHNVPWRQVRYLDLRDNNGPVRN------- 188 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 T +Y L + + + + + KP FLY + H Sbjct: 189 --------------ETGQYSAHL---FTGKAIDVV-QSHNTSKPLFLYLAYQSVHAPLEV 230 Query: 309 NAKYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 KY R ++ + +++ ANL + L+ G +NT+++F++DNG + Sbjct: 231 PEKYEHKYRNITDKNRRTFAGMVSALDEGVANLTQALKDKGLWNNTVLIFSTDNGGQIHA 290 Query: 364 PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGA 422 + P RG K S WEGG FV + + S G++ ++D FPT + LAG Sbjct: 291 GGN-NYPLRGWKASLWEGGFHGVGFVSGGALKRSGAVSKGLIHVSDWFPTLVTLAG---- 345 Query: 423 KVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY-----------------FLNGKLAA 465 + T +DG +Q S R+ + + AA Sbjct: 346 ---GNLNGTKPLDGFNQWDTISNET-PSPREILLHNIDILYPQKGVPLYSNTWDTRVRAA 401 Query: 466 VRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGS---SVFNLYTDPQESDSIGVRHI 522 +R+ ++K ++ + +Q + + +FN+ DP E + + Sbjct: 402 IRVGDYKLITGDPGNGSWVPPPDGHLYFVPEIQESAAKNVWLFNITADPNEHNDLSSEKP 461 Query: 523 PMGVPLQTEMHAYMEILKKYPPRAQIK 549 + L + + PPR Sbjct: 462 LEVLRLLQILVQFNNTAV--PPRYPAP 486 >UniRef50_Q1YSH0 Sulfatase family protein n=4 Tax=cellular organisms RepID=Q1YSH0_9GAMM Length = 557 Score = 371 bits (954), Expect = e-101, Method: Composition-based stats. Identities = 118/515 (22%), Positives = 190/515 (36%), Gaps = 84/515 (16%) Query: 72 QQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGN-PTPDIDAVASQGLILTSA 130 Q + + PN+++ L DD+G+ D+ GG A G+ TP+ID +A QG+ + Sbjct: 50 QGPASAETTPAKRPPNIILILTDDMGFNDISLYNGGAADGSLQTPNIDRIAEQGIRFNNG 109 Query: 131 YSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQP-------------------------- 163 Y+ + +RA++LTG+YS G+ P+Y Sbjct: 110 YAANAVCTSSRASLLTGRYSTRFGVEYTPIYKTGVRIFNWMEELNPSTPPVLVDMDLAAT 169 Query: 164 -------GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDM 216 G T+ ++L Q Y T IGKWH+G N + +P+ GFDD + Sbjct: 170 LPPIDALGMPAAEITIGEVLQQQDYYTAHIGKWHLGSNGDMRPEQQGFDDSLSMKGI--- 226 Query: 217 YTEWRDVHVNPEVALSPDRSEYIK-QLPFSKDDVHAVRGGEQQAIADITPKY--MEDLDQ 273 L PD + + ++P D G + + P + L Sbjct: 227 ------------FYLPPDHPDVVNAKIPGDSIDSMVWAVGSYEVQWNGGPPFEPKGYLTD 274 Query: 274 RWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG-----SSPARTSYGDCMVE 328 + D V ++ A +PFFLY G H + + +Y + Sbjct: 275 YFTDAAVDVIE--ANRHRPFFLYLAHWGPHNPVQASREDYDALPHIKDHRLRTYAAMLRA 332 Query: 329 MNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG-RTPFRGAKGSTWEGGVRVPT 387 ++ + +L++NG DNTLI+FTSDNG + P+RG K + +EGG VP Sbjct: 333 LDRSVEKIEASLQENGLSDNTLIIFTSDNGGAGYLDLTDLNKPYRGWKLTHFEGGTHVPY 392 Query: 388 FVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGT 446 W I+ + SD + D+F T AG VP +DGV+ F G Sbjct: 393 MAKWPAQIEAGQSSDEAIHHIDMFHTIAAAAGAS-------VPTDRTLDGVNLLPFMQGK 445 Query: 447 NGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFN 506 + K ++ G V +K Q +F+ Sbjct: 446 QTGAPHKTL-FWHTGHQQTVWHQGWKMIRAEQSDKPGADPMVF--------------LFD 490 Query: 507 LYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 L DP E +++ L + + K Sbjct: 491 LNNDPTEQNNLIAEQPEKAAELTALLDTHHAQQAK 525 >UniRef50_D0PR02 N-acetylgalactosamine-4-sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR02_9SPHI Length = 595 Score = 371 bits (954), Expect = e-101, Method: Composition-based stats. Identities = 117/483 (24%), Positives = 188/483 (38%), Gaps = 70/483 (14%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 + + PNV++ L DD G D+G +G TP+ID Q + LT + P +PTR Sbjct: 24 QKKQAPNVILILTDDQGIGDLGCHGNPWL---KTPNIDKFYEQSVRLTDFHVSPLCTPTR 80 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 A I+TGQY I +G G+ +G T+ + GY T GKWH+G+N +P Sbjct: 81 AAIMTGQYPIRNGAWAT-YKGRDALSKGQLTMADVFKSAGYSTALFGKWHLGDNYPVRPS 139 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 + GFD V + S+Y F DDV+ V +Q Sbjct: 140 DSGFDH-----------------VVQHLAGGIGELSDYWGNSYF--DDVYYVNNQPKQ-- 178 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY-------PNAKYA 313 + W +KF+++ + ++PFF+Y H P K+ Sbjct: 179 ------FQGYCTDVWFSEAMKFINQQ-EKEQPFFIYLPLNAPHDPLIVDEKYAAPYKKFE 231 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG----RT 369 GS + + +++ F K L+K G NT++++ SDNG G Sbjct: 232 GSEIIDANLYGMIANIDENFGKFRKFLKKKGLDKNTILIYMSDNGTRFGYSRDGKLGYNY 291 Query: 370 PFRGAKGSTWEGGVRVPTFVYWK-GMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANL 427 +G KG +EGG RVP F+ W G I+ K + DL PT L G P Sbjct: 292 HLKGMKGDKFEGGHRVPFFIQWMDGGIEGGKDIRSLSAHVDLIPTLAKLCGIP------- 344 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSG 487 +PK DG+D + +R + +++ Q Sbjct: 345 LPKNQAFDGIDLSGVLTKNEKPKDRSVFVHHRQ---------DWR---------PPLQEK 386 Query: 488 YQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQ 547 G ++N+ TDP ++ ++ + + L E ++ + K YP + Sbjct: 387 GTCVLKNEWRLINGYQLYNMKTDPLQTTNVAEENKELVEALLEENKSFYQQTKTYPTFYE 446 Query: 548 IKS 550 + S Sbjct: 447 LPS 449 >UniRef50_D2QTW5 Sulfatase n=2 Tax=Sphingobacteriales RepID=D2QTW5_9SPHI Length = 523 Score = 371 bits (954), Expect = e-101, Method: Composition-based stats. Identities = 105/500 (21%), Positives = 171/500 (34%), Gaps = 83/500 (16%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSP 138 +T PN++ DD+G+ ++G G TP++D +A +G+ T Y+ P +P Sbjct: 41 PRTAVSPNIIYIYADDLGYAELGCYGQQKI---RTPNLDKLAREGIRFTQHYTSMPVCAP 97 Query: 139 TRATILTGQYSIHH---------GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKW 189 R +LTG++S H G GQ G T+ +LL QGY T +GKW Sbjct: 98 ARCMLLTGKHSGHSYIRGNYEMGGFPDSLEGGQMPLYPGAFTIGRLLQQQGYKTACVGKW 157 Query: 190 HMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 MG N P GFD F G+ + + P + + + Sbjct: 158 GMGMANTTGNPNEQGFDYFYGYLDQKQAHNYY------PTHLWENGKPDKLNNPVIDVHR 211 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF---- 304 +A A + + F+ + PFFLY H Sbjct: 212 RLTPETATPEAFAYFRGN--DYAIDKLAQKAQAFIRQ--NKSGPFFLYLPFTAPHVSLQA 267 Query: 305 ---------------------DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKN 343 YA + R +Y + M+ L + L+ Sbjct: 268 PEAAVKEYIGKFGDGEQRTERPYLGEQGYASTPYPRATYAAMITHMDAQIGQLMQLLKDL 327 Query: 344 GQLDNTLIVFTSDNGPEAEVP-----PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP- 397 +NTL++F+SDNG + RG K +EGG+R P W G I+P Sbjct: 328 KIDENTLVMFSSDNGATFNGGVEAAYFNSVGKLRGLKMDVYEGGIREPMLARWPGRIKPN 387 Query: 398 RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY 457 + +D + DL T +L G+ DG+ LG + + Y Sbjct: 388 QTTDHVSVQYDLLATLAELVGYKRPFAT---------DGISFLPTLLGQSSSQKQHPFLY 438 Query: 458 FLN---GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 + G A+RM +K + +T +++L D E+ Sbjct: 439 WEYPEKGGQLAIRMGNWKAVKTNVR----------------KDRTTPWELYDLNKDVSET 482 Query: 515 DSIGVRHIPMGVPLQTEMHA 534 +I +H + + Sbjct: 483 TNIADKHPDIIRQANAIVAR 502 >UniRef50_Q1VDY3 Probable sulfatase n=2 Tax=Vibrio alginolyticus RepID=Q1VDY3_VIBAL Length = 483 Score = 371 bits (952), Expect = e-101, Method: Composition-based stats. Identities = 114/449 (25%), Positives = 200/449 (44%), Gaps = 37/449 (8%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSS 137 LE + PNVVV L+D++GW ++ G G T ++D +A +G+ LT+ +P + Sbjct: 20 LEVVAEETPNVVVMLVDNLGWGELSSYGST--RGVETKNLDQLAREGVRLTNFNVEPQCT 77 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 PTR++ +TG+ ++ G G + T+ + +QGY T GKWH+G+ K Sbjct: 78 PTRSSFMTGRRALRSGTDKVVWGVPYGMVNWEITIAEKFKEQGYNTSLYGKWHLGDQKGR 137 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P + GFD++ G + +D E + P + + + A G + Sbjct: 138 FPTDQGFDEWYGIANTTDE----------SEYSSQPGYKAILPKPQI----LSARAGQDP 183 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSP 317 + + + +D +++ F+++ K +KPFF H P+ + G + Sbjct: 184 KGVKEYNLDSRRTIDSELVEHATDFINRNVKENKPFFSVITFTQPHLPTLPHPDFIGKT- 242 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAKG 376 + +Y D + E++ + +EK G DNTL+++ SDNGPE +P G + P+RGA Sbjct: 243 GKGNYSDVLAEIDFRAGQVIGAIEKAGIKDNTLVIWFSDNGPEWHMPYQGSSGPWRGAYF 302 Query: 377 STWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFID 435 + EG +R P W I+P R SD I+ + DLF + + G+ +P ID Sbjct: 303 TALEGSLRTPFIASWPNHIKPGRVSDEIIHVVDLFASLSHVGGY-------KLPSDRTID 355 Query: 436 GVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGT 495 +DQ +F G + +SNR + A + + +K H + Q Sbjct: 356 SIDQWAFLKGDSEKSNRDGFIVNNGSETYAYKWENYKMHFIDQ-----------DIMPEK 404 Query: 496 VMQTAGSSVFNLYTDPQESDSIGVRHIPM 524 ++NL DP+E + + Sbjct: 405 GRPLQIPEIYNLIDDPKEEFDLRNNATWL 433 >UniRef50_UPI0000588CF9 PREDICTED: similar to arylsulfatase B n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000588CF9 Length = 545 Score = 371 bits (952), Expect = e-101, Method: Composition-based stats. Identities = 122/499 (24%), Positives = 198/499 (39%), Gaps = 70/499 (14%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS 136 + + + P++V L DD G+ D+G+ TP++D +A++G+ L + Y QP Sbjct: 50 QPSRNPRRPPHIVFILADDYGFNDIGYRN----PAMRTPNLDYLAAEGIKLDNYYVQPIC 105 Query: 137 SPTRATILTGQYSIHHGILMPPMY-GQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGE- 193 +P+RA +++G+Y IH G+ ++ QP L L TLPQ L + GY T GKWH+G Sbjct: 106 TPSRAQLMSGKYQIHTGLQHSIIWPPQPNCLPLDLPTLPQKLKEAGYATHMAGKWHLGFY 165 Query: 194 NKESQPQNVGFDDFRGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 KE P N GFD F G D + + P Y P+ D Sbjct: 166 KKECWPTNRGFDSFLGILLGKGDHFLHTEEGGGGP----------YPSTWPWEGLDF--- 212 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 R G Q A Y + V+ + + DKP FLY + H Y Sbjct: 213 RDGLQSTNA-----YSGIYSTHVIAERVENIIEKHDKDKPLFLYVSFQAVHTPLQVPESY 267 Query: 313 AG------SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH 366 R Y M++ N+ K L+K G D+T++VF+SDNG + Sbjct: 268 LQPFESSIQDEKRRIYAGMTYCMDEAVGNITKKLKKQGLWDDTVLVFSSDNGGNIDQGA- 326 Query: 367 GRTPFRGAKGSTWEGGVRVPTFVYWK---GMIQPRKSDGIVDLADLFPTALD-LAGHPGA 422 P RG+K + WEGGVR FV ++ S ++D++D +PT ++ +AG + Sbjct: 327 SNWPLRGSKTTLWEGGVRAVGFVTSPLLSERMKGTVSRELIDISDWYPTLIEGVAGWTLS 386 Query: 423 KVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH----------------------YFLN 460 T +DG + + + H F Sbjct: 387 --------GTKLDGYNIWETLRSGKPSARVELLHNIDPLITPPSTWPNESIAAAHNSFST 438 Query: 461 GKLAAVRMDEFKY---HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSI 517 AA+R ++K + I + + ++ +FN+ DP+E + Sbjct: 439 RTYAALRYKDWKIVTGYXSINNGWYSPAESSKQSVASEILPGKSVWLFNITRDPREFHDL 498 Query: 518 GVRHIPMGVPLQTEMHAYM 536 + + L + +Y Sbjct: 499 SNQEPAIVNFLLERLESYQ 517 >UniRef50_UPI0001745666 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745666 Length = 497 Score = 371 bits (952), Expect = e-101, Method: Composition-based stats. Identities = 116/491 (23%), Positives = 186/491 (37%), Gaps = 57/491 (11%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPTR 140 +PN++ L+DD+G+ D+G G TP ID +A++G+ LT Y+ +P+R Sbjct: 34 AADRPNIIYILVDDMGYGDLGCFGQKTFT---TPHIDRMAAEGMKLTRHYAGSTVCAPSR 90 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQP 199 +LTG ++ H + ++ P T+P LL GY T GK+ +G + P Sbjct: 91 CVLLTGLHTGHCRVRGNGLWTMPDSD---VTVPNLLKQAGYATACFGKYGLGKPLPDDDP 147 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHV-NPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 GFD F G+ S + + + N + + +E + +D A G +Q Sbjct: 148 NRKGFDTFFGYVDTSHAHNFYPTYLIRNGQRVALNNVTEPGSRKAGHEDTGFATVDGRRQ 207 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH--------------- 303 Q D +L A +PFF+YY H Sbjct: 208 FAP-----------QLIADELQTYLRDRAAGKQPFFVYYALNMPHANNEAGKNSPLKHGM 256 Query: 304 -FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 +Y + M ++D + L+K G NTL++FTSDNGP AE Sbjct: 257 EVPSYGEYANKDWPDVEKGFASAMRFVDDQVGAVLAALKKAGLDQNTLVMFTSDNGPHAE 316 Query: 363 VPP-----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDL 416 F G K S +GG+RVP W I+ R +S+ + DL PT DL Sbjct: 317 GGHSSDFFDSNGAFSGIKRSMTDGGIRVPLVARWPAAIKARGESEHVSGFQDLLPTVADL 376 Query: 417 AGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF----LNGKLAAVRMDEFK 472 AG DG+ G +G+ + ++ GK A +R +K Sbjct: 377 AGAKLEGET---------DGLSLVPTLTGKDGEQKQHKYLFWNFDEQGGKRAVLRW-PWK 426 Query: 473 YHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 L Q+ G ++ ++NL D E +++ + L+ M Sbjct: 427 LIHLNTGTARMGQNAG-GKPQPVQPKSLEVQLYNLEEDVGEQNNLASLQPGIVSELEGYM 485 Query: 533 HAYMEILKKYP 543 + P Sbjct: 486 KEAWRAPQTQP 496 >UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DTN4_9BACT Length = 482 Score = 370 bits (951), Expect = e-101, Method: Composition-based stats. Identities = 114/478 (23%), Positives = 174/478 (36%), Gaps = 57/478 (11%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP 134 L L KPN++ L DD+G+ D+G G V TP +D +A+ G+ T YS Sbjct: 9 LFALNLSAADKPNIIYILADDLGYGDLGCYGQKVI---QTPHLDKMAANGMKFTQHYSGS 65 Query: 135 -SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 P+R+ +L G++S + + M L P+ L GY T IGK MG Sbjct: 66 TVCGPSRSCLLEGKHSGNTYVRGNGMLQMRQDPHDLI-FPKALQKAGYHTAMIGKSGMGC 124 Query: 194 NKE--SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 N + + P GFD F GF S + + + P D + ++ + + +H Sbjct: 125 NTDDAALPYQKGFDYFFGFTSHTQAHWFF------PTHLWKNDGK--VTKVEYPNNTLHE 176 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRG-------CHF 304 + + + Y+E + F A R Sbjct: 177 GDNYSSEVVMNEALDYVERQKDGPFFLHLAFQIPHASLRAKEEWKAKYRPILKEKLLPKK 236 Query: 305 DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP 364 D +P+ Y +T++ + M+ L K LE G +NTLI+F SDNG E Sbjct: 237 DKHPHYSYE--REPKTTFAAMVSYMDHNVGLLNKKLEDLGLAENTLIMFASDNGAMQEGG 294 Query: 365 P-----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAG 418 RG K +EGGVR P YW G I+ SD I D+ PT +LAG Sbjct: 295 HKRDSFDSNGVLRGGKRDMYEGGVRTPMIAYWPGKIKAGQTSDHISAFWDISPTVRELAG 354 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH--YFLNGKLAAVRMDEFKYHVL 476 + DG+ LG Q+ + +F G A+RM ++K + Sbjct: 355 AKVQEDT---------DGISFVPTLLGKGSQTKHDYLYWEFFEQGGKRAIRMGKWKLIL- 404 Query: 477 IQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 + +F+L D E + + L M Sbjct: 405 ---------------YKTNTDLNPKMELFDLEADISEQKDLSKQLPEKVSALLKLMDK 447 >UniRef50_A3I2R7 Arylsulfatase n=2 Tax=Bacteroidetes RepID=A3I2R7_9SPHI Length = 589 Score = 370 bits (951), Expect = e-101, Method: Composition-based stats. Identities = 114/506 (22%), Positives = 197/506 (38%), Gaps = 90/506 (17%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 K PN+++ + DD G+ D GF G TP ID +A T+ Y P +PTRA Sbjct: 28 AQKPPNIILIITDDQGYGDFGFTGNKHVS---TPTIDQLAENSFEFTNFYVSPVCAPTRA 84 Query: 142 TILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQN 201 +++TG+YS+ GI G T+ +LL Y + GKWH+G+N +P + Sbjct: 85 SLMTGRYSLRTGIRDTYNGG-AMMSPDEITIAELLQKSDYTSGIFGKWHLGDNYPMRPSD 143 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIA 261 GFD+ +H++ + D + Y ++ D V ++ Sbjct: 144 QGFDESL--------------IHLSGGMGQVGDFTTYFQKDRSYFDPVLWHNNRQE---- 185 Query: 262 DITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS------ 315 Y + ++F++K D+PFF Y H +Y Sbjct: 186 ----SYQGYCSDIFASAAIEFIEK--NKDQPFFTYLSFNAPHTPLQVPEEYYQKYKNIDT 239 Query: 316 -----SPARTSY-------------GDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDN 357 S R Y + ++D NL+ L++ D T+I+F +DN Sbjct: 240 STGYESDERPFYPMSDSQKEEARKVYAMVENIDDNLKNLFAKLKELEIEDETIIIFLTDN 299 Query: 358 GPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTALDL 416 GP+ + G RG KG+ ++GG+R P ++ + + RK + + D+ PT DL Sbjct: 300 GPQQQRYLAG---LRGLKGNVYQGGIRTPLLIHIPEKLSENRKINTLSAHIDILPTIADL 356 Query: 417 AGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL------AAVRMDE 470 G +P IDG +G ++ + N K +++ E Sbjct: 357 VGIQ-------LPLDRKIDGKSLLPLLIGEVDSFENRSLFSYWNRKFPEKYSNISIQNSE 409 Query: 471 FKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQT 530 +K G T ++NL DP E ++ I G+ L+ Sbjct: 410 WKLV----------------GKTDYDASIEDFQLYNLKEDPYEQSNLITSKISKGLELKN 453 Query: 531 EMHA-YMEILKKY----PPRAQIKSD 551 E+ Y+E++ + PP+ + ++ Sbjct: 454 ELDQLYLELISEENLINPPKIHVGNE 479 >UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9FLAO Length = 459 Score = 370 bits (951), Expect = e-101, Method: Composition-based stats. Identities = 118/486 (24%), Positives = 185/486 (38%), Gaps = 84/486 (17%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSS 137 ++ PN++ L+DD+G+ D+ G A +P+IDA+A+ G+ T+ Y+ S Sbjct: 35 AQERPDAPNILCILVDDLGYGDLSCQG---ATDLQSPNIDALAANGMRFTNFYANSTVCS 91 Query: 138 PTRATILTGQYSIHHG----ILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 P+RA +LTG+Y G I P +P L+ GY T IGKWH+G Sbjct: 92 PSRAALLTGRYPDLVGVPGVIRQNPENNWGNLADDAVLIPSELNPAGYHTGIIGKWHLGL 151 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 + P + GF F+GF DM ++ D + +R E Sbjct: 152 EEPDTPNDRGFTYFKGFLG--DMMDDYWDHRRGGINWMRLNREE---------------- 193 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 I PK + D+ + FL + ++PFFLY HF P ++ Sbjct: 194 ---------IDPK--GHATDLFTDWTIDFLKERQGEEQPFFLYLAYNAPHFPIQPPREWL 242 Query: 314 GS--------SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP 365 + R + ++ + + L+ G +NTL+VF SDNG Sbjct: 243 DKVREREPNLTEKRAKNVAFVEHLDYSVGRVMEALKTTGLEENTLVVFVSDNGGAL-WYA 301 Query: 366 HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKV 424 P RG K +EGG+RVP YWKG I P SD L DLFPT +LAG + Sbjct: 302 QSNGPLRGGKQDMYEGGIRVPAIFYWKGKIAPGTTSDNTALLMDLFPTFCELAGRKPPEN 361 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY-------FLNGKLAAVRMDEFKYHVLI 477 +DG+ G + + ++ + A R +FK +L Sbjct: 362 ---------VDGISLVPTLTGQAQDTANRYLYWVRREGGDYGGQAYYAARFGDFK--ILQ 410 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 P+ FN+ D E+ + L+ ++ ++ Sbjct: 411 NTPF------------------EPIQFFNIGQDELETTPL-ETDSEAYRALRAQLMEHIR 451 Query: 538 ILKKYP 543 P Sbjct: 452 TAGGVP 457 >UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UJ66_RHOBA Length = 616 Score = 370 bits (950), Expect = e-101, Method: Composition-based stats. Identities = 98/474 (20%), Positives = 175/474 (36%), Gaps = 64/474 (13%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSS 137 + + +PNV++ + DD G+ D+ +G TP++D +A+Q + L + + P + Sbjct: 49 AQTASESRPNVILVVTDDQGYGDMSCHGNPWLN---TPNLDRLATQSVRLENFHVDPFCT 105 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 PTRA ++TG+Y G G+ TT+ + + GY T GKWH+G+ Sbjct: 106 PTRAALMTGRYCTRVGAWA-VTEGRQLLDPDETTMAETFRESGYRTGMFGKWHLGDPPPF 164 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P+ G + T R + + +P ++Y + + G Sbjct: 165 APRERG------------LETVVRHMAGGADEIGNPTGNDYFDDTYYRNGTPESFDG--- 209 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY----- 312 Y D+ W + + F+ K +S++PFF Y T H +Y Sbjct: 210 ---------YCTDI---WFEEAIDFIQK--ESEQPFFAYIPTNAMHSPYLVADRYSDPFK 255 Query: 313 -AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG---- 367 G P R ++ + ++ L K L+++ DNT+++F SDNG Sbjct: 256 RQGIEPQRAAFYGMIQNFDENLGRLLKRLDQDNLRDNTMLIFMSDNGTAQGASEQNRKVG 315 Query: 368 -RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVA 425 RG KGS +EGG RVP F W + D + D PT ++L Sbjct: 316 FNAGMRGKKGSVYEGGHRVPCFASWPAKWDGNRPVDQLTCHRDWLPTLIELCDLKR---- 371 Query: 426 NLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQ 485 P DG ++ Q + ++ + Sbjct: 372 ---PADVTFDGRSMAGLLSHSSQQWPERTLVIERQPDN------------VVSATKTQGR 416 Query: 486 SGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL 539 + + ++++ DP + +I + + L+ E AY E + Sbjct: 417 AQPPFVVLTDRWRLVRDELYDIQNDPGQIKNIAAEYPEVVRELRAEYDAYFEDV 470 >UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_RHOBA Length = 485 Score = 370 bits (950), Expect = e-101, Method: Composition-based stats. Identities = 117/478 (24%), Positives = 178/478 (37%), Gaps = 62/478 (12%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 T +LA +PNVV+ L DD+G+ DVG GG V TP ID +A+ G Sbjct: 32 TFGQLAGETHAQTLRPNVVMLLADDLGYRDVGCYGGPV----ETPTIDQLAAGGTRFQQF 87 Query: 131 YS-QPSSSPTRATILTGQYSIHHGI--LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIG 187 YS SP+RAT++TG++ I G+ + TL ++L D GY T +G Sbjct: 88 YSGCAVCSPSRATLMTGRHHIRAGVYSWIQDESQNSHLRLREVTLAEVLRDAGYATAHVG 147 Query: 188 KWHMG----ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLP 243 KWH+G E + P GFD + + + H NP+ + Sbjct: 148 KWHLGLPTEERDKPTPDQHGFDHWFA------TWNNAQPSHRNPDNFIRNGEP------- 194 Query: 244 FSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 V + G Q +AD ++M+ + + D+PFFL H Sbjct: 195 -----VGQLEGYSCQLVADEAIRWMDRH-------------RESDPDQPFFLNVWFHEPH 236 Query: 304 FDNYPNA----KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP 359 KY S Y + + L L+ G +NTLIV+ SDNG Sbjct: 237 APIAAPDEVTQKYGKLSDKGAVYSGTIDNTDQAIKRLLAKLDALGVRENTLIVYASDNG- 295 Query: 360 EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAG 418 RG KG+ WEGG+RVP +W G I S+ L D+ PT L Sbjct: 296 --SYRTDRVGKLRGRKGANWEGGIRVPGIFHWPGHIPAGVVSNEPAGLVDVLPTICGL-- 351 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL---NGKLAAVRMDEFKYHV 475 + P +DG D T G R ++ + + A+R ++ Sbjct: 352 ------LKISPPQVHLDGSDLTPLLTGHADSFERHQPLFWHLQRSQPIVAMRDGDYSLVG 405 Query: 476 LIQQPYAYTQSGYQGGFTGTVMQT-AGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 + + T ++NL DP ++ ++ ++ M Sbjct: 406 FRDYEMSNKNLFEEKWIPAIKNGTYHNFELYNLKDDPGQTKNLAAEQPERVEAMKQRM 463 >UniRef50_A6C383 Sulfatase (Fragment) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C383_9PLAN Length = 405 Score = 370 bits (949), Expect = e-100, Method: Composition-based stats. Identities = 111/450 (24%), Positives = 179/450 (39%), Gaps = 63/450 (14%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTR 140 + +KPNV++ DD G +D+ G + TP +D++A +G+ T Y+ P SP+R Sbjct: 5 SSEKPNVIIIFTDDQGSVDLNCYGAKDLI---TPHMDSIARRGIRFTQFYASAPVCSPSR 61 Query: 141 ATILTGQYSIHHGILMPPM--YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 A +LTG++ G+ +G+ G T+ +++ GY T IGKWH+G E+ Sbjct: 62 AGMLTGRFPARAGVPGNVSSHHGKSGMPTEQITIAEMMQQAGYQTAHIGKWHLGYTPETM 121 Query: 199 PQNVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P GF+ G D Y+ + + L + E + F D Sbjct: 122 PHGQGFETSFGHMGGCIDNYSHFFYWNGPNRHDLWENGKEVWRDGAFFPD---------- 171 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK----YA 313 ++ ++ K DKPFFLY+ H+ K YA Sbjct: 172 ----------------LMVEQCQDYIRKAG--DKPFFLYWAINVPHYPLQGKEKWRKTYA 213 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR----T 369 S R Y + M+D + TL+ + T+I+F SD+G E G Sbjct: 214 HLSSPRDKYAAFVSTMDDCIGEVLATLDACQLREKTIIIFQSDHGHSHEERTFGGGGSAG 273 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLV 428 P+RGAK S +EGG+RVP + W G I + D + D PT L G P Sbjct: 274 PYRGAKFSLFEGGIRVPAMISWPGTIAEGEVRDQLATGCDWLPTISALTGAPLPA----- 328 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 +DG + + + +S + Y+ GK A+R ++K + T G Sbjct: 329 ---HHLDGKNLKAVIESSTAKSPHEN-FYWQIGKSWAIREGDWKLLGNPRDTSQQTPLGK 384 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIG 518 + + +L D E ++ Sbjct: 385 ENQIF----------LVDLSKDIGEKKNLA 404 >UniRef50_C9KTC2 Arylsulphatase A n=5 Tax=Bacteroides RepID=C9KTC2_9BACE Length = 501 Score = 369 bits (948), Expect = e-100, Method: Composition-based stats. Identities = 124/507 (24%), Positives = 199/507 (39%), Gaps = 62/507 (12%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ- 133 L + PNV+ L DD+G+ D+ TP+ID + G+ T A+S Sbjct: 11 LLAATAVKAQSPNVIFILADDLGYGDISAFNPE--SKIHTPNIDNLTHSGISFTDAHSSS 68 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQPGG--LQGLTTLPQLLHDQGYVTQAIGKWHM 191 S+P+R +I+TG+Y + + G T+ Q+ + GY T IGKWH+ Sbjct: 69 ALSTPSRYSIITGRYPWRTTMKSGVLNGFSPAMITPDRRTIAQMFSENGYNTACIGKWHL 128 Query: 192 GENKES------------------QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSP 233 G + P + GFD F G + + P V + Sbjct: 129 GWDWAYPQNAKNKQDVDFSLPIKNGPTDRGFDYFYGIPAS---------LGTAPHVYVEN 179 Query: 234 DRSEYIKQLPFS-KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKP 292 D+ + + + +R G A AD P +D + +G+ +++K S KP Sbjct: 180 DKVTALPNRTIGPQKGIKLIRNG--VAGADFEP---QDCLPNIIRHGIDYINKQRDSKKP 234 Query: 293 FFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIV 352 FFLY H P KY G + YGD +V ++D+ + KTL+KN QL+NT+I+ Sbjct: 235 FFLYLPITAPHTPVLPAEKYQGQTII-GDYGDFVVMIDDMVQQIVKTLKKNNQLENTIII 293 Query: 353 FTSDNG-----PEAEVPPHGRTP---FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIV 404 FTSDNG E+ G P +RG K +EGG R+P V W+G + +V Sbjct: 294 FTSDNGCAPYIGVEEMENKGHHPSYIYRGYKNDIYEGGHRIPLIVSWQGKYTNETNGSLV 353 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLA 464 L D + T + + + +D G S RK Y Sbjct: 354 SLTDFYATFAQMVNYQ-------LKDEEAVDSYSIWPIL-SKKGNSARKDLIYESGKGYL 405 Query: 465 AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIG--VRHI 522 ++R + K ++ + + + +FNL DP E +I R+ Sbjct: 406 SLRTLQLKLVF-----HSGSGGWGYPNKPADLAKLPSMQLFNLKEDPSEKKNIISNKRYK 460 Query: 523 PMGVPLQTEMHAYMEILKKYPPRAQIK 549 + + Y+E + P + Sbjct: 461 KDVDKMTQMIKKYVEEGRSTPGKRSAN 487 >UniRef50_Q7UWW9 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UWW9_RHOBA Length = 622 Score = 369 bits (948), Expect = e-100, Method: Composition-based stats. Identities = 109/501 (21%), Positives = 194/501 (38%), Gaps = 70/501 (13%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP 134 +A PNV++ + DD G+ D FNG TP +D +AS+ + LT + P Sbjct: 27 IATPRPSGAASPNVILVMTDDQGYGDFSFNGNPYI---QTPALDRLASESVQLTDFHVAP 83 Query: 135 SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 +PTR +++G + + + G+ L T+ + D GY T GKWH+G+N Sbjct: 84 MCTPTRGQLMSGLDAFRNSAI-NVSSGRTLLRHDLKTMADVFQDAGYRTGIFGKWHLGDN 142 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 +P++ GFD+ F S ++ F DD + +R Sbjct: 143 YPFRPEDRGFDETLWFPSSH-----------------INSVPDFWDNDYF--DDTY-IRN 182 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG 314 G++ A + Y D+ + D +++ + + +D PFF + H+ + +Y Sbjct: 183 GKRVAHSG----YCTDV---FFDEAIEWAKQTSPTDSPFFAFIPLNSAHWPWFVPDQYRA 235 Query: 315 SSPAR---------------------TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVF 353 S+ + ++D L + L+++G +NT++VF Sbjct: 236 RVRTMLGDTTELKRQLDTTPSNLEDLISFLAMGLNIDDNVGTLTQYLDESGLSENTIVVF 295 Query: 354 TSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTA 413 +DNG + RG K WEGG RVP + W I +K D + + DL PT Sbjct: 296 LTDNG-STFGDHYFNAGMRGKKTQLWEGGHRVPCLIRWPEQITAQKIDDLTHVQDLLPTL 354 Query: 414 LDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKY 473 LA +P +DG LG + + RM +FK Sbjct: 355 AALA-----DCDEHLPG--PLDGTSLAPRLLGETDSLADRMLVINYS------RMPQFKV 401 Query: 474 HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMH 533 P ++G + + ++N+ DP + ++ H + ++ + Sbjct: 402 TYTKGNPAIPRRNGAAVMWNKWRLLENK-RLYNVEQDPHQDHNVAQDHPEIVAKMRAHLA 460 Query: 534 AYMEILKK---YPPRAQIKSD 551 + + +K P R I S+ Sbjct: 461 TWWDGVKDDVMTPERVVIGSE 481 >UniRef50_B2ULS2 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2ULS2_AKKM8 Length = 526 Score = 369 bits (948), Expect = e-100, Method: Composition-based stats. Identities = 131/519 (25%), Positives = 211/519 (40%), Gaps = 71/519 (13%) Query: 80 KKTGKKPN-VVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSS 137 T K P +V+ DD+G+ DVG G A G PTP ID +A QG T AYS + Sbjct: 23 TPTVKPPKAIVMIYADDLGYGDVGCYG---AKGIPTPAIDKLAEQGCRFTDAYSTTSVCT 79 Query: 138 PTRATILTGQYSIHH-GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 P+R + TG+Y G + P TLP++L GY T IGKWH+G ++ Sbjct: 80 PSRYALFTGEYPWRKEGTGILPGDAALIIDTKKPTLPKMLQSHGYKTYMIGKWHLGLGEK 139 Query: 197 SQ-----------PQNVGFDDFRGFNSVSDMYT-----EWRDVHVNPEVALSPDRSEYIK 240 + P +GFD+ F + D +++P + Sbjct: 140 GKKIDWNKHISPSPNEIGFDESFIFAATGDRVPCVILENGNVRNLDPNDPIEVSYKHNFP 199 Query: 241 QLPFSKDDVHAVR----GGEQQAIADITPK------------YMEDLDQRWMDYGVKFLD 284 LP KD+ ++ G QAI + + E+ D ++++ Sbjct: 200 GLPNGKDNKDQLKLMWSHGHNQAIINGIGRIGFMKGGRSALWKDEENADIITDKAIEYIQ 259 Query: 285 KMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNG 344 K AK+ +PFFL + T H P ++ G S GD VE++D + + L++ G Sbjct: 260 KSAKAKEPFFLMFATHDIHVPRCPEKRFVGKSR-HGVRGDVTVELDDCVRRITEALQQAG 318 Query: 345 QLDNTLIVFTSDNGPEAEVPPHG-----------RTPFRGAKGSTWEGGVRVPTFVYWKG 393 + L++F+SDNGP + PFR K S EGG R+P V W G Sbjct: 319 LEKDALVIFSSDNGPVLDDGYRDFAVRDNATHSPAGPFRAGKYSILEGGSRIPFIVKWPG 378 Query: 394 MIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNR 452 +I+P S +++ DL G ++ +F D + LG + + R Sbjct: 379 VIKPGTTSKALLNQMDL--------GASLEQLLAPGKANSFRDSENVMPALLGKSAK-GR 429 Query: 453 KAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ 512 GK A+R ++K+ + G G G S+F+L DP+ Sbjct: 430 DYHVINSTGKALAIRHGKWKFIP----AGVAIRDGINGASAKMSKSPEGGSLFDLEKDPK 485 Query: 513 ESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 E D++ +H + ++ ++ + R + K+D Sbjct: 486 ELDNVASQHPDICEQMKAKLEEIRQ-------RPETKAD 517 >UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B5CXC7_9BACE Length = 509 Score = 368 bits (946), Expect = e-100, Method: Composition-based stats. Identities = 138/574 (24%), Positives = 203/574 (35%), Gaps = 144/574 (25%) Query: 44 NQYLVKPA--TTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDV 101 N++L+ A T+A NM+ H A D ++PNVV ++DD GW DV Sbjct: 5 NKHLLTLAGGVTLAANML----HAASDN--------------RQPNVVFIMVDDYGWADV 46 Query: 102 GFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATILTGQYSIHHGILMPPMY 160 G+NG TP+ID +AS+G+I T Y+ S SSP+R +++TG+Y GI + Sbjct: 47 GYNGSRF---YETPNIDRLASEGMIFTDGYAAASISSPSRVSLMTGKYPARTGITDW-IP 102 Query: 161 GQPGGLQ--------------------GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 G GL+ T+ + + GY T +GKWH E+ PQ Sbjct: 103 GYQYGLKPEQLKQYKMLAPEMPLNMPLEEVTMAEAFKEHGYATYHVGKWHCAEDSLYYPQ 162 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 GFD G R SP R+ Y+ P Sbjct: 163 YQGFDVNIGGWLKGSPNGIRRSQGGKGAYC-SPYRNPYLPDGPEG--------------- 206 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY-------- 312 E L R D +K + K + +DKPFFLY H +Y Sbjct: 207 --------EFLTDRLGDESIKLI-KNSSADKPFFLYLAFYAVHTPIEAKPEYVKYFKWKA 257 Query: 313 ------------------------AGSSPART-----SYGDCMVEMNDVFANLYKTLEKN 343 AG RT Y + M++ + + L+ N Sbjct: 258 QRMGLDTIVPFTRNLEWYKNAEYKAGHWKERTIQSDAEYAALIYSMDENVGRVMQALKDN 317 Query: 344 GQLDNTLIVFTSDNGP--EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KS 400 G NT++ SDNG AE P P R KG +EGG+R P + + M++ Sbjct: 318 GLDKNTIVCLLSDNGGLSTAEGSPTCNAPLRAGKGWLYEGGIREPFIIKYPQMVEAGSVC 377 Query: 401 DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF-- 458 V D +PT LD+AG P + +DG G ++ Sbjct: 378 HTPVVAVDFYPTLLDMAGLP-------LKSHQHVDGKSLLPLLKGDQAYDRGPIFFHYPH 430 Query: 459 LNGK----LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 GK AVRM ++K + + ++NL D E+ Sbjct: 431 YGGKGDTPAGAVRMGDYKLIEFYEDGHV--------------------ELYNLKNDISET 470 Query: 515 DSIGVRHIPMGVPLQTEMHAYMEILK-KYPPRAQ 547 + +Q +H + K P R Sbjct: 471 RDLSKTEKDKAAEMQKMLHRWRTDCNAKMPTRNP 504 >UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C1V3_9PLAN Length = 470 Score = 368 bits (946), Expect = e-100, Method: Composition-based stats. Identities = 120/498 (24%), Positives = 189/498 (37%), Gaps = 91/498 (18%) Query: 74 KLAELEKKTGKKP-NVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 + + +KP NVV FL+DD+GW D+G G +P+ID +A++G+ T YS Sbjct: 21 SITQPTHAADEKPWNVVFFLVDDLGWTDLGCYGSDF---YQSPNIDQLAAEGMKFTQNYS 77 Query: 133 QP-SSSPTRATILTGQYSIHHGILMP--------------PMYGQPGGLQGLTTLPQLLH 177 + SPTR +LTG Y + P + Q TTLP+ L Sbjct: 78 ACNACSPTRGALLTGMYPARTHLTDWIPGWAKSYTDFPLKPPEWKKHLDQKYTTLPEALR 137 Query: 178 DQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSE 237 GY T +GKWH+G + + PQ+ GFD ++ + Sbjct: 138 TAGYQTFHVGKWHLG-GRGNLPQDHGFD---------------------VNISGTNRGLP 175 Query: 238 YIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYY 297 P+ D A++ A+ +Y+ D R D V + + DKPFFLY Sbjct: 176 RSYHFPYGGD---AMKWDSSLTEAERQDRYLTD---RMADEAVALIRQQ--QDKPFFLYC 227 Query: 298 GTRGCHFDNYPNAKY--------AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 H AG Y + +++ + L+++G D T Sbjct: 228 SFYSVHSPIQGRPDLVKKYKGLPAGKRHKNPEYAAMIQSVDEAIGRVRAQLKESGIADRT 287 Query: 350 LIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLAD 408 LIVFTSDNG P RG KG WEGG RVP V W G+ + D Sbjct: 288 LIVFTSDNGG-VRRKTSNNDPLRGEKGQHWEGGTRVPAIVLWPGVTPAGSVCAEPIITMD 346 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY-------FLNG 461 +PT L++ G G N +DG+ NR+A ++ F+ Sbjct: 347 FYPTILNITGVAGNTEHN-----QSVDGLSLVPLLKDPAATLNREALYWHYPHYNVFIGV 401 Query: 462 KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 +A+R+ E+K + ++NL D E+ + + Sbjct: 402 PYSAIRVGEYKLIHYY--------------------EDGNDELYNLAEDLSETSDVSKTY 441 Query: 522 IPMGVPLQTEMHAYMEIL 539 + L+ + +++ + Sbjct: 442 PELTARLERRLQQHLKQV 459 >UniRef50_C7M5R4 Sulfatase n=4 Tax=Bacteroidetes RepID=C7M5R4_CAPOD Length = 480 Score = 368 bits (946), Expect = e-100, Method: Composition-based stats. Identities = 116/497 (23%), Positives = 192/497 (38%), Gaps = 61/497 (12%) Query: 74 KLAELEKKTGKK-PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 L + K +K PNV+ L DD+G+ D+ G + TP + +A +G+ T Y+ Sbjct: 11 SLCTIGVKAQEKLPNVIFILADDLGYGDIEPYGQQII---KTPQLSKLADEGMKFTQFYT 67 Query: 133 -QPSSSPTRATILTGQYSIHHGILMP-----PMYGQPGGLQGLTTLPQLLHDQGYVTQAI 186 +P+RA+ +TGQ + I P+ GQ L ++ QL GY T Sbjct: 68 GTSVCAPSRASFITGQTTGETHIRGNEEVREPVDGQAPLLANDPSVAQLFKKAGYNTGCF 127 Query: 187 GKWHMGENK-ESQPQNVGFDDFRGFNSVSDMYTEWRDV-HVNPEVALSPDRSEYIKQLPF 244 GKW +G E P GFD F G+NS + + + E L P+ Y +Q + Sbjct: 128 GKWGLGIVPSEGNPLKQGFDTFFGYNSQFRAHRRYPAFLWHDNEKVLIPENGNYERQEVY 187 Query: 245 SKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYY-----GT 299 +D + +++ + I + E W+ Y + + + D + Y Sbjct: 188 GEDLI------QEKILDYIGKQTAEKPFFMWLTYTLPHAELVVPHDSIYASYEYLPKKPY 241 Query: 300 RGCHFDN-----YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFT 354 +G +D + A Y +Y + ++ + K L+ G ++T+I+F Sbjct: 242 KGVDYDKITPKPFGWAGYMSQPHTYATYAAMVSRLDKYLGEIRKLLKVKGLDEDTIIIFA 301 Query: 355 SDNGPEAEVPP-----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLAD 408 SDNG E + RG K +EGG+R P VYWKG I+ SD I D Sbjct: 302 SDNGAHREGGADPKFFNSSAGLRGIKRDLYEGGIRTPYIVYWKGKIKAGSVSDHIGAFWD 361 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH--YFLNGKLAAV 466 + PT ++ VP V LG Q K + + G AV Sbjct: 362 MMPTFAEIT------HQKYVPNRHQ---VSFLPTLLGKKQQQQHKYLYWEFHEMGGRQAV 412 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGV 526 R +K L + A +++L TDP E ++ ++ + Sbjct: 413 RYKNWKGVRL----------------NVNKDKKAPIELYDLTTDPAEQHNLAEKYPKIVK 456 Query: 527 PLQTEMHAYMEILKKYP 543 ++ M + +P Sbjct: 457 KIERFMEQSHTRSELFP 473 >UniRef50_A4CJK0 Arylsulfatase A n=1 Tax=Robiginitalea biformata HTCC2501 RepID=A4CJK0_9FLAO Length = 516 Score = 368 bits (946), Expect = e-100, Method: Composition-based stats. Identities = 118/509 (23%), Positives = 190/509 (37%), Gaps = 66/509 (12%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 + A +PN+V+ DD+G+ D G G TP ID++A+ GL T Y+ Sbjct: 24 PQAAGGANSDASRPNIVIIYADDLGFGDTGAYGATEI---QTPHIDSLAAGGLRFTRGYA 80 Query: 133 Q-PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWH 190 + +P+R +LTGQY P + TLP LL GY T +GKWH Sbjct: 81 SSATCTPSRYALLTGQYPWRKEKARILPGNAPLLIDTAQATLPGLLRQAGYRTGIVGKWH 140 Query: 191 MGENKES---------QPQNVGFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPDRS 236 +G + P VGF++ + D + V + P+ + Sbjct: 141 LGLGTGAVDWNQAIRPGPNEVGFEESFILAATQDRVPTVYIRNGQVVGLEPDDPIQVSYE 200 Query: 237 EYIKQLPFSKDDVHAVR----GGEQQAIADITPKYM------------EDLDQRWMDYGV 280 E P + D V+ G +I + P+ ED+ ++ Sbjct: 201 ENFPGEPTALDHPELVKMGWDHGHNNSIVNGIPRIGFMKGGQAARWVDEDMADTFLKEAQ 260 Query: 281 KFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTL 340 F+ + + PFFL+Y + H P+ ++ G++ GD + E + + TL Sbjct: 261 VFIRE-RDPEAPFFLFYSLQQPHVPRTPHPRFVGATDL-GPRGDAIFEADWCIGQILATL 318 Query: 341 EKNGQLDNTLIVFTSDNGP--------EAEVPPHGRTPF---RGAKGSTWEGGVRVPTFV 389 + G L NTL++F+SDNGP +A G +P+ RG K S +E G RVP V Sbjct: 319 QDEGLLTNTLVIFSSDNGPVLNDGYLDQAVERNGGHSPWGPYRGGKYSLFEAGTRVPFIV 378 Query: 390 YWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ 449 W G I P SD +V DL + L G P D + G + Sbjct: 379 SWPGTIAPGVSDAMVSQIDLLASLAHLTGVPDPGT----------DSQNIWPALSGRSDA 428 Query: 450 SNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYT 509 A R ++ Y P + G A +++L + Sbjct: 429 GREHMVL--EATSRTAFRTRDWVYIPPQSGPPVAKNVNIELG------NAAAPQLYHLES 480 Query: 510 DPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 DP ++ ++ + L + + Sbjct: 481 DPGQTRNLAEELPQIRDSLMRQYQEIISS 509 >UniRef50_B5CWC8 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CWC8_9BACE Length = 493 Score = 368 bits (946), Expect = e-100, Method: Composition-based stats. Identities = 118/485 (24%), Positives = 184/485 (37%), Gaps = 59/485 (12%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-Q 133 AE +K +KPN++ FL+DD+G D+ G TP+ID +A+ G++ T+ Y Sbjct: 20 CAEQKKVEEQKPNIIYFLVDDMGMGDLSLTGQK---KYETPNIDKLAADGMLFTNHYCGT 76 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 S P+RA ++TG+++ H + Q G TL +L GY T IGKW +G Sbjct: 77 TVSGPSRACLMTGKHTGHTSVRGNQPGPQLLG-DNEATLASVLKGAGYKTAVIGKWGIGH 135 Query: 194 N-KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI--KQLPFSKDDVH 250 PQ GFD G+ ++ W + PE E + +L ++D + Sbjct: 136 PIPLDDPQRKGFDLSYGYLNM------WHAHNCFPEFLYRNGVKEELTGNKLALAEDGTN 189 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 + + + +KF+ PFF+YY H +N Sbjct: 190 PWADMPEGTGVARMDARKQYAPDLFEKEALKFISD--NKKNPFFIYYALNLPHANNEAAP 247 Query: 311 K------------YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG 358 + M ++ +L LEK G DNT+I+F SDNG Sbjct: 248 NGCEVPSYNADIAAKDWPEVEKGFAQMMQIIDKQVGDLVAYLEKEGLADNTIIMFASDNG 307 Query: 359 PEAEVPP-----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPT 412 P E RG K W+GG+R P V W G ++ S+ + D+ PT Sbjct: 308 PHQEGGHKVDFFDSNADLRGKKRDMWDGGIRTPFIVKWPGKVKAGSTSNHLSAFWDVLPT 367 Query: 413 ALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY---FLNGKLAAVRMD 469 D+A V K IDG+ LG + + Y + G AV D Sbjct: 368 FCDIA---------KVEKPAGIDGLSLLPTLLGDTAKQEKHKYLYFEFYEEGGKQAVVAD 418 Query: 470 EFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQ 529 +KY L + G + +S++ L D E + H M ++ Sbjct: 419 NWKYIKLNVR-------------QGKGAKPVETSLYRLTDDVSEQKDVKEEHPEMVEIME 465 Query: 530 TEMHA 534 + Sbjct: 466 GYIKE 470 >UniRef50_B2URC2 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2URC2_AKKM8 Length = 465 Score = 368 bits (946), Expect = e-100, Method: Composition-based stats. Identities = 112/480 (23%), Positives = 171/480 (35%), Gaps = 80/480 (16%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QP 134 A + + PN++V L DD+G+ D+G G TP +D +A +G+ + AY P Sbjct: 19 AVAQPRLSSPPNMIVILADDLGYGDLGCTGSKQI---KTPSLDRLAREGVFCSRAYVTAP 75 Query: 135 SSSPTRATILTGQYSIHHGILMPPM-------YGQPGGLQGLTTLPQLLHDQGYVTQAIG 187 SP+R +LTG++ +GI P G Q +P+ L GY + G Sbjct: 76 MCSPSRMGLLTGRFPKRYGITTNPNIQMDYLPESHYGLPQTEKLIPEYLAPCGYRSAVFG 135 Query: 188 KWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 KWH+G K P GF + GF S Y P K+ Sbjct: 136 KWHLGHTKGYTPPERGFTHWWGFLGGSRHY------------------------FPVKKE 171 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 D T + L D V+FL + K KPFF++ H+ N Sbjct: 172 AEGLNPSMIVSNFTDKTD--ITYLTDDITDRAVEFLQEAGKDKKPFFMFVSYNAPHWPNE 229 Query: 308 PNAKYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 + + R Y + M+ + L+ +G +T++VF SDNG E Sbjct: 230 AKPEDIAKFRNVQNGERRVYCAMVYAMDRGIGRILDALKADGLEKDTIVVFLSDNGGAPE 289 Query: 363 VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKG---MIQPRKSDGIVDLADLFPTALDLAGH 419 PFRGAK +EGGVRVP + + ++ V DL P L G Sbjct: 290 A-SSCNAPFRGAKRQHFEGGVRVPFIIRYPADKRLVPGSVCRQPVSSVDLLPALLKANGR 348 Query: 420 PGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQ 479 +DG+D R ++ +AV + KY ++ + Sbjct: 349 HIP---------RKLDGMDILELVGNKGAPVPRT--FFWCTDYTSAVLTGDMKYLLVPDR 397 Query: 480 PYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIG-VRHIPMGVPLQTEMHAYMEI 538 +N+ DPQE + RH L ++ Y+ Sbjct: 398 ---------------------APQFYNVADDPQEQRDLYFSRHQD-ADLLAKKLGTYLTT 435 >UniRef50_C1ZJ89 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJ89_PLALI Length = 536 Score = 368 bits (944), Expect = e-100, Method: Composition-based stats. Identities = 126/537 (23%), Positives = 199/537 (37%), Gaps = 92/537 (17%) Query: 47 LVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGG 106 V P A +++ ++Q A + L +PNVV L DD+GW +VG G Sbjct: 2 FVLPEIRAALSVLLLIQLAA--ESLWANELTLISHQSPRPNVVFILADDLGWGEVGCFGQ 59 Query: 107 GVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPM------ 159 PTP+ID +AS+G+ LT YS P+ +P+R ++TG++ H I Sbjct: 60 SKI---PTPNIDRLASRGVKLTRHYSGAPTCAPSRCVLMTGKHLGHAEIRGNQQAKVKLP 116 Query: 160 ---YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQNVGFDDFRGFNSVSD 215 GQ T+ + GY T A GKW +G +P GFD+F G+N + Sbjct: 117 QFTEGQHPLSDKALTIARQFQKAGYATGAFGKWGLGPVGSTGEPNRQGFDEFFGYNCQAL 176 Query: 216 MYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRW 275 ++ + ++ + K +P K E + P+ + Sbjct: 177 AHSYFPKALWKNAESIVNNE----KPVPGHKKQPEGEVTMEAYQGENYAPRLI------- 225 Query: 276 MDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK-------------------YAGSS 316 M + F+D+ + +PFFLY H P K Y Sbjct: 226 MAEALSFIDRHHQ--QPFFLYLPFTEPHVAMQPPPKIVEEFPVEWDERVYRGDGGYLPHP 283 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-----PEAEVPPHGRTP- 370 R +Y + ++++ ++ +LEK+G L+ TLIVFTSDNG + G P Sbjct: 284 RPRAAYAAMIRDLDNHVGDVITSLEKHGLLEKTLIVFTSDNGATHASANPDFHVGGADPL 343 Query: 371 -------FRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGA 422 +G KGS +EGG+RVP V W G I P + D FPT + Sbjct: 344 FFNSTRELKGFKGSIYEGGLRVPAIVSWPGQIPPATTINTPSYFPDWFPTLCNAT----- 398 Query: 423 KVANLVPKTTFIDGVDQTSFFLGTNGQ-----SNRKAEHYFLNGKLAAVRMDEFKYHVLI 477 +P +DGV+ G + Y V + +FK Sbjct: 399 ----QLPLPEGLDGVNLLPLLTGKTSPDQFIRPDPMVWVYAEYTGQVCVHLGDFKVLRRG 454 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 + + V+ L +DP ES ++ + + A Sbjct: 455 LRTN----------------RPGPWEVYQLVSDPGESTNLADSRPDLVTKAIEVLKA 495 >UniRef50_B6RB10 Arylsulfatase n=7 Tax=Coelomata RepID=B6RB10_HALDI Length = 481 Score = 368 bits (944), Expect = e-100, Method: Composition-based stats. Identities = 121/498 (24%), Positives = 190/498 (38%), Gaps = 66/498 (13%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 L + G+ ++V + DD+GW D+GF+ + TP+ID +A +GL+L Y Q Sbjct: 14 NLCDDVSAAGRPRHIVFIVADDLGWNDIGFHNPDII----TPNIDKLAREGLLLNHHYVQ 69 Query: 134 PSSSPTRATILTGQYSIHHGILMPPM-YGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHM 191 P SP+RA ++G Y G+ + QP L +T LPQ L + GY T +GKWH Sbjct: 70 PLCSPSRAAFMSGYYPFKTGLQHSVILENQPVCLPLNITILPQKLKELGYATHIVGKWHN 129 Query: 192 GEN-KESQPQNVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 G P GFD F G + ++ D YT ++ Sbjct: 130 GFCSWNCTPTYRGFDSFFGYYGAMEDYYTHVIRGFLD----------------------- 166 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 R D R+ D +++ +P FLY + + Sbjct: 167 --YRNNTTPVWTDN----GTYSTLRFTDVATDIIERH-NQSQPLFLYLAYQAVYGPIEVP 219 Query: 310 AKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP 364 AKY S R + + +++ N+ KTL + G +D+TLI+FT+DNG + Sbjct: 220 AKYEAMYPNIKSENRRKFSGMVSALDEAVGNVTKTLRQRGLMDDTLILFTADNGGGVD-E 278 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAK 423 P RG+K + +EGG R F+Y G+ + DG++ D PT AG Sbjct: 279 SGNNYPLRGSKFTVYEGGTRAVGFMYGSGLQKTGTVFDGMIHAVDWLPTLTAAAGGTPVS 338 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG------KLAAVRMDEFKYHVLI 477 DG++ T S R Y + AA+R+ ++K Sbjct: 339 DR---------DGINLWPSL-STASPSPRTEVVYNYDSHPQPVQGHAAIRVGDYKLIDGY 388 Query: 478 QQPYAYTQSGYQGGFTGT----VMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMH 533 P+ Q + +FNL DP E + + M L + Sbjct: 389 PGPFPDWYKPEQVTSSLNTRFSRDSANQYQLFNLKDDPNERNDLSNFRPDMVKKLAARL- 447 Query: 534 AYMEILKKYPPRAQIKSD 551 A+ + P + D Sbjct: 448 AWYKKQAVPPNFPETPDD 465 >UniRef50_Q15XP0 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XP0_PSEA6 Length = 627 Score = 367 bits (943), Expect = e-100, Method: Composition-based stats. Identities = 123/499 (24%), Positives = 201/499 (40%), Gaps = 82/499 (16%) Query: 70 ETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTS 129 Q + A E T KPN+V+ + DD G+ D+G + + TP+ID +A+Q LT+ Sbjct: 30 AVQNRSASAEPPT--KPNIVLIVTDDQGYGDIGRHNNPII---QTPNIDDIAAQSARLTN 84 Query: 130 AYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKW 189 + P+ SPTR+ +LTG++S+ G+ + G+ TL + L + GY T GKW Sbjct: 85 FHVDPTCSPTRSALLTGKHSLRAGVWHTIL-GRYMLGPEHVTLAESLQENGYRTGIFGKW 143 Query: 190 HMGENKESQPQNVGFDD--FRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 H+G+N +PQ+ GFDD G V W + N Sbjct: 144 HLGDNYPYRPQDQGFDDVLIHGGGGVGQTPDYWGNTQFNDTYY----------------- 186 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 R G + K+ + W D KF+DK D P+F Y H Sbjct: 187 -----RNGTPE-------KFSGYATKIWFDEAKKFIDKQH--DTPYFAYIALNAPHGPYR 232 Query: 308 PNA------KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG--- 358 + G + S+ + +++ L L QLDNT+ +F +DNG Sbjct: 233 APETHIEPYEKRGLNRDMASFYGMISYIDEQVGELRAHLRAQDQLDNTIFIFMTDNGSSY 292 Query: 359 --------------PEAEVPPHG--RTPFRGAKGSTWEGGVRVPTFVYWK-GMIQPRKSD 401 P AE P+ RG KG +EGG RVP F+ + G I + Sbjct: 293 KPTDAKTHLTKRHLPLAEQYPNWQPNDNMRGYKGEVYEGGHRVPFFISYPNGNITTGDYE 352 Query: 402 GIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG 461 I D+ PT L+LA P P + +DG ++ G + +++ Sbjct: 353 AITAHFDVMPTLLELANIP--------PVNSTLDGTSLATYLKGEQANRSLESKL----S 400 Query: 462 KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 + A V ++ YH +++P A ++ + +FNL DP + + I H Sbjct: 401 ERAIVVTNQRVYHPSVKRPIAIAFHQWR-----YISANDSEKLFNLQQDPSQQNDIKNDH 455 Query: 522 IPMGVPLQTEMHAYMEILK 540 + ++ + + ++ Sbjct: 456 PDILARMRQRKQTWWQEMQ 474 >UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Proteobacteria RepID=UPI0000E0F7DD Length = 493 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 113/500 (22%), Positives = 186/500 (37%), Gaps = 113/500 (22%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPT 139 KPN+++ ++DD+GW DVG+ TP+IDA+A QGL+ AY+ + +P+ Sbjct: 35 ADTTKPNIIMIVIDDLGWSDVGY--NQTTDYFETPNIDALAQQGLVFDQAYAGAANCAPS 92 Query: 140 RATILTGQYSIHHGILM--------------PPMYGQPGGLQGLTTLPQLLHDQGYVTQA 185 RA +++GQY HG+ P+ + G + T+ + L GY T Sbjct: 93 RAVLMSGQYGPRHGVYTVSPSDRGHAKTRKLIPIKNKRGLTTDIITIGESLKTAGYTTGT 152 Query: 186 IGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS 245 GKWH+G + P GFD S M + + P + P Sbjct: 153 FGKWHLGAD----PDKQGFDVNVA-GSHQGMTFHYFSPYQLPNIEDGPKG---------- 197 Query: 246 KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD 305 E L +R + ++ + D+PFF Y H Sbjct: 198 -----------------------EYLTERLTTEVIDWVK--SSKDQPFFAYVPYYTVHTP 232 Query: 306 NYP-------NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG 358 + S +Y + M+D ++ L+ G +NT+++FTSDNG Sbjct: 233 YQAVVDKVNKYHEKGIKSKREATYAAMVEHMDDNVGRIFDMLDSEGLAENTVVIFTSDNG 292 Query: 359 PEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAG 418 TP RG KGS ++GG+RVP V W ++P V AD +PT ++L Sbjct: 293 GYRM--SSFPTPLRGGKGSYYDGGLRVPLIVRWPEKVKPGLDHTPVINADFYPTLVNLTK 350 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY-------------------FL 459 +DGVD T+ LG + R + F Sbjct: 351 SKQP--------NQVLDGVDLTAHLLGQQDIAERDLFWHFPVYLQAHHAPTDQGQDPLFR 402 Query: 460 NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGV 519 +A+R ++K + ++NL D E +++ Sbjct: 403 TRPGSAIRSGDWKL--------------------LQYFENNEFELYNLANDLAEKNNLAS 442 Query: 520 RHIPMGVPLQTEMHAYMEIL 539 H L+T++ A+ + + Sbjct: 443 VHPSRVKELKTKLQAWQQQI 462 >UniRef50_D2R207 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R207_9PLAN Length = 495 Score = 366 bits (941), Expect = e-100, Method: Composition-based stats. Identities = 104/500 (20%), Positives = 189/500 (37%), Gaps = 72/500 (14%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-S 135 + +PN++ + DD+G+ DVG G V TP+ID +A +GL T YS Sbjct: 28 SAQAADSDRPNIIWLMADDLGYGDVGCYGQKVIA---TPNIDQMAREGLRFTQFYSGATV 84 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPG---GLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 +P+R+ ++TG + H + G P T+ + L GY T +GKW +G Sbjct: 85 CAPSRSVLMTGLHHGHTRVRGNAGAGNPAAQALRADDFTVAKFLQQAGYRTALVGKWGLG 144 Query: 193 ENKES---QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 ++ ++ P+ GFD+F G+ + + + P + + +P Sbjct: 145 DDGQASTGLPRKQGFDEFVGYLNQRHAHNHF------PSFLWRNEEKFPLPNVP------ 192 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH------ 303 E+ + K ++ D + + F+++ ++PFFLY+ H Sbjct: 193 ----ELEEPDGSGYPKKAVQFADDLLTEEALAFVER--NREQPFFLYWTPVIPHANNERA 246 Query: 304 --------FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 ++ + + ++ + L++ TL +FTS Sbjct: 247 RDLGNGAQVPDFGPYEKETWPEQDKGQAAMIHRLDTYVGRMLAKLKQLKLDQKTLFIFTS 306 Query: 356 DNGPEAEVPPHG-----RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADL 409 DNGP E + + G K S +GG+RVP +W G I P++ S+ + D Sbjct: 307 DNGPHNEARHNLERFQPSGSWTGIKRSLHDGGIRVPMICWWPGTIAPQQVSEHVGYSGDF 366 Query: 410 FPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMD 469 F TA +LA P +D + S G + + + Y Sbjct: 367 FATAAELASRPAPAG---------LDSISFASTLRGDSSKQAKHEFLY------------ 405 Query: 470 EFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQ 529 +++H + Y+G A +V++L TDPQE I + + L Sbjct: 406 -WEFHENGFSQATLCEGRYKG--IRLRDPDAPIAVYDLQTDPQERVDIAATNPALAARLD 462 Query: 530 TEMHAYMEILKKYPPRAQIK 549 + + + +P R Sbjct: 463 HYLKSARTTNEDWPARKPAA 482 >UniRef50_P15848 Arylsulfatase B n=32 Tax=Euteleostomi RepID=ARSB_HUMAN Length = 533 Score = 366 bits (940), Expect = 1e-99, Method: Composition-based stats. Identities = 118/515 (22%), Positives = 196/515 (38%), Gaps = 78/515 (15%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 + P++V L DD+GW DVGF+G + TP +DA+A+ G++L + Y+QP +P+R+ Sbjct: 41 ASRPPHLVFLLADDLGWNDVGFHGSRI----RTPHLDALAAGGVLLDNYYTQPLCTPSRS 96 Query: 142 TILTGQYSIHHGILMPPMY-GQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQ 198 +LTG+Y I G+ ++ QP + LPQLL + GY T +GKWH+G KE Sbjct: 97 QLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECL 156 Query: 199 PQNVGFDDFRGFN-SVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P GFD + G+ D Y+ R + D + +D G + Sbjct: 157 PTRRGFDTYFGYLLGSEDYYSHERCTLI--------DALNVTRCALDFRDGEEVATGYKN 208 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG--- 314 +I + + + +KP FLY + H +Y Sbjct: 209 MYSTNI-----------FTKRAIALITNH-PPEKPLFLYLALQSVHEPLQVPEEYLKPYD 256 Query: 315 --SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFR 372 R Y + M++ N+ L+ +G +NT+ +F++DNG + + P R Sbjct: 257 FIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAGGN-NWPLR 315 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQPRKSD-GIVDLADLFPTALDLAGHPGAKVANLVPKT 431 G K S WEGGVR FV + Q + ++ ++D PT + LA T Sbjct: 316 GRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLA-------RGHTNGT 368 Query: 432 TFIDGVDQTSFFLGTNGQSNRKAEHY-------------------------------FLN 460 +DG D S R + F Sbjct: 369 KPLDGFDVWKTI-SEGSPSPRIELLHNIDPNFVDSSPCPRNSMAPAKDDSSLPEYSAFNT 427 Query: 461 GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGS---SVFNLYTDPQESDSI 517 AA+R +K + Q + + +F++ DP+E + Sbjct: 428 SVHAAIRHGNWKLLTGYPGCGYWFPPPSQYNVSEIPSSDPPTKTLWLFDIDRDPEERHDL 487 Query: 518 GVRHIPMGVPLQTEMHAYME-ILKKYPPRAQIKSD 551 + + L + + Y + + Y P + D Sbjct: 488 SREYPHIVTKLLSRLQFYHKHSVPVYFPAQDPRCD 522 >UniRef50_UPI0001B577E1 arylsulfatase precursor n=1 Tax=Streptomyces sp. C RepID=UPI0001B577E1 Length = 746 Score = 366 bits (939), Expect = 1e-99, Method: Composition-based stats. Identities = 121/538 (22%), Positives = 203/538 (37%), Gaps = 90/538 (16%) Query: 43 PNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTG---KKPNVVVFLLDDVGWM 99 P++ A+T + V A + + E K G + PN+VV L DD+G+ Sbjct: 2 PSRRTFLAASTATLGLTAVTATTAGSAQAVPAVTVPETKDGSGTRLPNIVVVLADDLGYG 61 Query: 100 DVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPP 158 ++G G + TP +D +A++GL T AYS +P+R ++LTG ++ H + P Sbjct: 62 ELGSYGQKLIS---TPRLDRLATEGLRFTDAYSTAAVCAPSRCSLLTGLHTGHSTVRANP 118 Query: 159 MYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGE---NKESQPQNVGFDDFRGFNSVS 214 G G L TT Q+L +GY T IGKW G ++S P GF++F G+ S Sbjct: 119 SSGGQGSLTATDTTFAQVLRARGYRTAVIGKWGFGPEAAGQDSHPAARGFEEFYGYIDHS 178 Query: 215 DMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQR 274 + +Y + + + A A P +E Sbjct: 179 HAH-------------------QYYPEYLWHNAVKEPIPANAGGAKAVYAPHLLEQ---- 215 Query: 275 WMDYGVKFLDKMAKSDKPFFLYYGTRGCH----FDNYPNAKYAGSSPARTSYGDCMVEMN 330 + ++F+D A PF L H + + A + + + Sbjct: 216 ---HALEFIDTHAAE--PFLLLLTPNVPHAPSDIPDSSAYADRSWTAANKGHAAQVSYFD 270 Query: 331 DVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH-----GRTPFRGAKGSTWEGGVRV 385 + + L G +T+++ TSDNGP E + P RG K + +EGGVRV Sbjct: 271 SLVGKVVDRLRSLGLEQDTVVLVTSDNGPHEEGGVNPDLFDANGPLRGYKRNLYEGGVRV 330 Query: 386 PTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLG 445 P + G +Q S+ L D+ PT +L G P T +DG+ G Sbjct: 331 PLIAWGPGRVQQGTSNRPTPLTDVLPTLAELGGAPAP---------TDVDGLSAAPLLAG 381 Query: 446 TNGQSNRKAEHYFLNGKL------------------AAVRMDEFKYHVL-IQQPYAYTQS 486 + S R Y+ +L AVR + +K ++ + Sbjct: 382 SPD-SARHGHLYWFRDELGVTSRANAQDGKRATWLAEAVRRENWKAVRFAPERDHNLPDD 440 Query: 487 GYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM-HAYMEILKKYP 543 +Q +++L TD E+ + ++ L M ++ + + P Sbjct: 441 KWQ------------VELYDLATDLGETRDVLAKNPSKAAELVALMRSSWKDTYPRTP 486 >UniRef50_A9LGQ4 Secreted arylsulfatase n=4 Tax=Bacteria RepID=A9LGQ4_9BACT Length = 608 Score = 366 bits (939), Expect = 2e-99, Method: Composition-based stats. Identities = 122/488 (25%), Positives = 188/488 (38%), Gaps = 103/488 (21%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT 139 + ++PNV+VFL DD GW D G TP+ID++A+QGL+ + + P SPT Sbjct: 39 AQNDQRPNVIVFLSDDQGWGDFSCTGNQSVA---TPNIDSLATQGLLFENFFVCPVCSPT 95 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 RA LTG+Y + GQ TT+ L GY T A GKWH G P Sbjct: 96 RAEFLTGRYHPQSNVKG-VSQGQERIDLDETTIADCLSQAGYATAAFGKWHNGMQYPYHP 154 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFDDF GF S + NP + +K + DD Sbjct: 155 CGRGFDDFYGFCSGH------WGNYFNPTLE---HNGRIVKGEGYINDD----------- 194 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF-DNYPNAKYAGSSPA 318 + + +KF++ +PFFLY H+ P+A + + Sbjct: 195 ---------------FTNRALKFIED--HKSQPFFLYLPYNTPHWPPQMPDAYWQRFAEK 237 Query: 319 ----RTSYGD------------CMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 R GD + ++ + L++ DNT++++ +DNGP + Sbjct: 238 EIVQRGQKGDKEDLAKTRSALAMVENIDWNVGRVLAKLDELKIADNTIVIYFNDNGPNSN 297 Query: 363 VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGIVDLADLFPTALDLAGHP 420 G +G KGST EGGVR P FV W ++ R+ + I DL+PT L G Sbjct: 298 RWNAG---MKGKKGSTDEGGVRSPLFVRWPNGVKGAGRRVNQICGAIDLYPTLLAATGSA 354 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 +DG + + G+ + + GK A+VR +F+ Sbjct: 355 NV-------GDKILDGKNLLPIWDGSETNLGFRMLFSYWRGK-ASVRTQQFRL------- 399 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGV-------PLQTEMH 533 +G+ +F++ TDP ++ I + + EM Sbjct: 400 ---DNNGW---------------LFDMLTDPHQTKDISSDQPAVAALLLGSLIRFKQEME 441 Query: 534 AYMEILKK 541 A M+ K+ Sbjct: 442 AEMDSTKR 449 >UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZGF2_PLALI Length = 490 Score = 366 bits (939), Expect = 2e-99, Method: Composition-based stats. Identities = 125/510 (24%), Positives = 193/510 (37%), Gaps = 113/510 (22%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPS 135 L ++ + PN+++ L+DD+GW DVGF G TP ID +A GL+ T AY+ P+ Sbjct: 34 SLAAESRRPPNIILILMDDMGWRDVGFMGNKFV---ETPHIDRLAKTGLVFTQAYASAPN 90 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGLQ---------------GLTTLPQLLHDQG 180 +PTRA +++GQY+ HGI QP G + T+ + L D G Sbjct: 91 CAPTRACLMSGQYAPRHGIYTVVDPRQPPGSPWHKWQAAESKSELDTNVVTIAEALRDGG 150 Query: 181 YVTQAIGKWHMGENKES--QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 Y T G W++G + P GF + + Sbjct: 151 YATAFFGMWNLGRGRTGPVTPGGQGFQ-----------------------------KVVF 181 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 + L F KD+ D Y+ D R D +KF+D+ ++PFF+Y Sbjct: 182 PENLGFGKDEYF-----------DDGKHYLTD---RLTDEVLKFVDEHR--EQPFFVYLP 225 Query: 299 TRGCHFDNYPNA-------KYAGSSPART---SYGDCMVEMNDVFANLYKTLEKNGQLDN 348 H P + A +S R + + ++ + L++ DN Sbjct: 226 DHAIHAPFNPKPELLAKYERKAAASNDRRDDPACAATIEAVDHNVGRIMDHLKRLKLSDN 285 Query: 349 TLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLA 407 T+++FTSDNG + P P RG KG +EGG+RVP V G+ + D V Sbjct: 286 TVVIFTSDNGGTQQYTP----PLRGGKGELYEGGIRVPLVVAGPGVKSLGSRCDVPVSSI 341 Query: 408 DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF-----LNGK 462 DL+PT L+LAG P+ +DGV G + +F Sbjct: 342 DLYPTLLELAGI-------KPPEGQVLDGVSLAPLLQGDATLDRERLFWHFPCYVGKATP 394 Query: 463 LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHI 522 +A+R +FK ++ +FNL DP E ++ Sbjct: 395 SSAMREGDFKLIEFFEEG-------------------GRVELFNLKNDPNEEKNLASVMP 435 Query: 523 PMGVPLQTEMHAYM-EILKKYPPRAQIKSD 551 L + A+ + PP D Sbjct: 436 DKAAALAKTLRAWQKKTNASIPPGPNPSYD 465 >UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CEC4_9PLAN Length = 467 Score = 366 bits (939), Expect = 2e-99, Method: Composition-based stats. Identities = 109/510 (21%), Positives = 182/510 (35%), Gaps = 110/510 (21%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-S 132 + ++PN+V+F +DD+GW DVGF G TP ID +A + + T+AY + Sbjct: 18 SMLSQASAENQRPNIVLFFIDDLGWRDVGFMGSDF---FETPHIDRLADESMKFTAAYSA 74 Query: 133 QPSSSPTRATILTGQYSIHHGILM--------------PPMYGQPGGLQGLTTLPQLLHD 178 P+ +P+RA +++G Y+ HG+ P TT+ L Sbjct: 75 APNCAPSRACLMSGLYTPRHGVYTVGDPARGNDRYRKLIPAENNRVLDDRFTTIADRLSQ 134 Query: 179 QGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 GY ++GKWH+G++ P + GF N + + NP+++ Sbjct: 135 AGYRCASVGKWHLGQS----PLSQGFQVNIAGNQTGSPRGGYFSPYQNPQLSDGEQG--- 187 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 E L R +F+ PFFLY Sbjct: 188 ------------------------------EFLTDRLTTAACQFIKD--NQGSPFFLYLT 215 Query: 299 TRGCHFDNYPNAKY--------AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 H + AG +Y + M+ + +TL + NT+ Sbjct: 216 HYAVHTPLQAKKEDIAYFQSKPAGKLHQHATYAAMIRSMDQSIGRVLQTLREQQLDQNTI 275 Query: 351 IVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSD-GIVDLADL 409 +VFTSDNG P P RG+KG +EGG+RVP + W G+ QP + V DL Sbjct: 276 VVFTSDNGGYG--PATSMLPLRGSKGMLYEGGIRVPLLIKWPGVTQPGSTTGEAVINVDL 333 Query: 410 FPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY------------ 457 +PT L++ P V ++ +DG + ++ + Sbjct: 334 YPTFLEMTNIP-------VLESELLDGESLVPLLKDPQTRLESRSLFWHFPAYLQKYQGM 386 Query: 458 ---FLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 F ++ +R ++K + + ++N D ES Sbjct: 387 QQRFRTTPVSVIRQGDWKLLEFFEDGHQ--------------------ELYNTRLDIGES 426 Query: 515 DSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 + H L +H + + +K P Sbjct: 427 KELSGSHPEKTQELSQALHRWQKQVKAAIP 456 >UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UZ43_RHOBA Length = 608 Score = 365 bits (938), Expect = 2e-99, Method: Composition-based stats. Identities = 108/484 (22%), Positives = 189/484 (39%), Gaps = 80/484 (16%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 + +PNVV+ + DD G+ D GF G V TP+IDA+A++ +LT + P+ SPTR Sbjct: 27 RAADRPNVVMVITDDQGYGDCGFTGNKVV---QTPNIDALAAESSVLTDYHVAPTCSPTR 83 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 + ++TG ++ G+ + G+ T ++ D GY T GKWH+G+N + + Sbjct: 84 SALMTGHWTNRTGVWHT-ISGRSMLRDNEVTFGEIFSDAGYQTGMFGKWHLGDNYPYRAE 142 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 + GF + H V +PD + F H + + + Sbjct: 143 DNGFTEVY--------------RHGGGGVGQTPD---FWDNAYFDGSYFHNGKAVKAEGF 185 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPAR- 319 + G +F+ + ++D+PFF Y T H + KY P Sbjct: 186 ----------CTDVFFKEGNRFIRECVEADEPFFAYIATNAPHGPLHAPQKYIDMYPEMN 235 Query: 320 ---TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKG 376 ++ + ++D K L + G DNT+ +FT+DNG + RG KG Sbjct: 236 DNVATFFGMITNVDDNVGQTRKLLRELGVHDNTIFIFTTDNGTAGGASVY-NAGMRGKKG 294 Query: 377 STWEGGVRVPTFVYWK--GMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFI 434 S +EGG RVP +++ G + R ++ + D+ PT LD+ G P++ Sbjct: 295 SPYEGGHRVPFVMHYPEGGFAKSRTNNTLCHAVDVVPTLLDMCGVEA-------PESVKF 347 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYF--------LNGKLAAVRMDEFKYHVLIQQPYAYTQS 486 DG S S + + ++V D+++ Sbjct: 348 DGTSIVSLLKDEVDSSFNDRMLITDSQRVIDPIKWRQSSVMQDKWRLI------------ 395 Query: 487 GYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRA 546 G ++N+ DP + ++I H ++ A+ L+ P + Sbjct: 396 -------------NGKELYNIANDPGQENNIAGDHPEQVASMRAFYEAWWAELE--PTFS 440 Query: 547 QIKS 550 Q Sbjct: 441 QTTE 444 >UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4Q9_9PLAN Length = 490 Score = 365 bits (938), Expect = 2e-99, Method: Composition-based stats. Identities = 111/514 (21%), Positives = 174/514 (33%), Gaps = 104/514 (20%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 + L +K + +PN+V L+DD+GW D G + TP ID +AS G+ T Sbjct: 20 SVTALHAEQKISADRPNIVFILIDDMGWPDPVSYGNQF---HDTPHIDQLASDGVRFTDF 76 Query: 131 YSQ-PSSSPTRATILTGQYSIHH-------GILMP-----PMYGQPGGLQGLTTLPQLLH 177 Y+ P SPTRA+I GQY G P P + T +LL Sbjct: 77 YAACPVCSPTRASIQAGQYQARLHLTDFIPGHWRPFEKLIVPENAPHLPLEIVTPGELLQ 136 Query: 178 DQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSE 237 Y T GKWH+G + P G+ H P +P Sbjct: 137 SANYNTAYFGKWHLGP-ESHNPDQQGYQTSLVTGG----------RHFAPRFRTTP---- 181 Query: 238 YIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYY 297 + R + +AD D ++F+ + KPFF+ Sbjct: 182 -------------STRIPNKAYLADF-----------LTDKTIEFIRQ--NKSKPFFVQL 215 Query: 298 GTRGCHFDNYPN----AKYAGSSP-----ARTSYGDCMVEMNDVFANLYKTLEKNGQLDN 348 H KY Y + ++D + LE+ +N Sbjct: 216 SHYAVHIPLEAKQQMIRKYQQKPKPAYGINNPVYAAMVAHVDDSVGRIVAALEELKLTEN 275 Query: 349 TLIVFTSDNGPEAEVPPHG-----RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDG 402 T+++FTSDNG + G P R KGS +EGG+RVP + W G+ Sbjct: 276 TVVIFTSDNGGLRQSFSGGDIVSTNAPLRDEKGSLYEGGIRVPLIIKWPGVAAAGKTCAE 335 Query: 403 IVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKA-----EHY 457 D +PT ++A + + IDG+ + NR+ HY Sbjct: 336 PTISIDFWPTFAEIA-------HTTLQEHQTIDGLSLLPLLKDPSSHLNREEIYFHYPHY 388 Query: 458 FLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSI 517 + +A+R ++K ++NL D E+ ++ Sbjct: 389 HHSTPASAIRAGDWKLIEFFADGNL--------------------ELYNLQQDLSETTNL 428 Query: 518 GVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 ++ V LQ ++ + P K D Sbjct: 429 AAKNPEKAVELQQKLADWRTRTGAALPVKNPKYD 462 >UniRef50_Q8SZ72 RE14504p n=18 Tax=Neoptera RepID=Q8SZ72_DROME Length = 562 Score = 365 bits (938), Expect = 2e-99, Method: Composition-based stats. Identities = 130/546 (23%), Positives = 211/546 (38%), Gaps = 108/546 (19%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 A +K+ KPN++ L DD+G+ DVGF+G PTP+IDA+A G+IL Y P Sbjct: 16 AAEVEKSPAKPNIIFILADDLGFNDVGFHGSAEI---PTPNIDALAYSGIILNRYYVAPI 72 Query: 136 SSPTRATILTGQYSIHHGILMPPMY-GQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGE 193 +P+R+ ++TG+Y IH G+ +Y +P GL LPQ L++ GY + GKWH+G Sbjct: 73 CTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGH 132 Query: 194 NK-ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 K + P GF GF S Y + V N + D + Sbjct: 133 WKLKYTPLYRGFSSHVGFWSGHQDYNDHTAVENNQ----------------WGLD----M 172 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH--------- 303 R G Q A D+ Y D+ D+ VK + + P FLY CH Sbjct: 173 RNGTQVAY-DLHGHYTTDV---ITDHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLP 228 Query: 304 -FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 DN + R + + +M++ + L K+ L+N++I+F+SDNG A+ Sbjct: 229 VPDNDVIKMSHIPNYKRRKFAAMVSKMDNSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQ 288 Query: 363 VPP---HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAG 418 P +G K + WEGGVR ++ + + R S+ + + D PT L+ AG Sbjct: 289 GFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAG 348 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG--KLAAVRMDEFKYHVL 476 A L + IDG + + S R + ++ AA+ + ++K + Sbjct: 349 GQPA----LSNLSKQIDGQSIWRALV-QDKASPRLNVLHNIDDIWGSAALSVGDWKL--V 401 Query: 477 IQQPYAYTQSGYQGGFTGTVMQTAGSS--------------------------------- 503 Y + G+ G + Sbjct: 402 KGTNYRGSWDGWYGPAGERDPRLYDWQLVGRSRAGKALEALKMLPSRADQQRIRAAATVS 461 Query: 504 --------------------VFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 +F++ DP E ++ ++ + L TE+ + P Sbjct: 462 CPGQSSQGTSCVATAFSAPCLFHIRDDPCEQYNLAKQYPEVVNALMTELERFNATAV--P 519 Query: 544 PRAQIK 549 P + Sbjct: 520 PSNKPA 525 >UniRef50_C2G0L0 Possible Cerebroside-sulfatase n=2 Tax=Sphingobacterium spiritivorum RepID=C2G0L0_9SPHI Length = 505 Score = 365 bits (937), Expect = 2e-99, Method: Composition-based stats. Identities = 114/505 (22%), Positives = 204/505 (40%), Gaps = 69/505 (13%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS- 135 +++ + +KPNV++ +DD+G+ D+ GG TP +D +A+ G+ T+A++ S Sbjct: 21 QIQAQDKQKPNVLMIYVDDLGYGDLSIYGGQDI---ETPHLDELATSGIRFTNAHAAAST 77 Query: 136 SSPTRATILTGQYSIHH-GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 +P+R ++TG G + P Q TLP++ H QGY T +GKWH+G Sbjct: 78 CTPSRYALMTGNNPYRAKGTGILPGDAALIIPQDKITLPKVFHQQGYTTGIVGKWHLGLG 137 Query: 195 KE----------SQPQNVGFDDFRGFNSVSDMYTEWRDVH-----VNPEVALSPDRSEYI 239 ++ P VG+D F + +D + + + + + + I Sbjct: 138 EQVEKDWNGKIAPGPLEVGYDYSFIFPATADRVPTVFLENHYVLAADAKDPIQVNYRQKI 197 Query: 240 KQLPFSKDDVHAVRG------GEQQAIADITPK--YM----------EDLDQRWMDYGVK 281 P K++ ++ G I + + +M E+L + + + Sbjct: 198 GNEPTGKENPELLKLHASPGQGHDNTIVNGIGRIGWMTGGKDARWADEELTLTFFEKAKE 257 Query: 282 FLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLE 341 F+ KPFFL Y H P + G S GD +++++ L + L+ Sbjct: 258 FIKT--NQKKPFFLCYNATEPHVPRMPATLFKGKSKL-GLRGDAILQLDYTVGQLVQELK 314 Query: 342 KNGQLDNTLIVFTSDNGP-----------EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVY 390 NG +NT+I+FTSDNGP E +RG K S +E G RVP V Sbjct: 315 NNGLYENTIIIFTSDNGPVLDDGYADQAVEKSANHDAFGGWRGGKYSAFEAGSRVPFLVS 374 Query: 391 WKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ 449 W +I+ +SD ++ DL + G PK +D +Q +GT+ + Sbjct: 375 WPAVIKGGQQSDALIGQVDLLASFAGQLGVSY-------PKDQAVDSQNQWKTLIGTDKK 427 Query: 450 SNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYT 509 + + ++ ++KY + + G ++NL T Sbjct: 428 G---RTYLVKSSGTFSIIQGDYKYIKPRKGAKIDKAVNIELG------NDEQPQLYNLRT 478 Query: 510 DPQESDSIGVRHIPMGVPLQTEMHA 534 D E ++I +H L+ + + Sbjct: 479 DKAEKENIAAKHTNKVKELEQLLQS 503 >UniRef50_Q5FYB1 Arylsulfatase I n=5 Tax=Chordata RepID=ARSI_HUMAN Length = 569 Score = 365 bits (936), Expect = 3e-99, Method: Composition-based stats. Identities = 120/503 (23%), Positives = 189/503 (37%), Gaps = 78/503 (15%) Query: 87 NVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTG 146 +++ L DD G+ DVG++G + TP +D +A++G+ L + Y QP +P+R+ +LTG Sbjct: 48 HIIFILTDDQGYHDVGYHGSDI----ETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTG 103 Query: 147 QYSIHHGILMPPMYG-QPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQNVG 203 +Y IH G+ + QP L TLPQ L + GY T +GKWH+G KE P G Sbjct: 104 RYQIHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRG 163 Query: 204 FDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIAD 262 FD F G D YT + D V G + + Sbjct: 164 FDTFLGSLTGNVDYYT-------------------------YDNCDGPGVCGFDLHEGEN 198 Query: 263 ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS-----SP 317 + + +P FLY + H +Y + Sbjct: 199 VAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHTPLQSPREYLYRYRTMGNV 258 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGS 377 AR Y + M++ N+ L++ G +N++I+F+SDNG + P RG KG+ Sbjct: 259 ARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQT-FSGGSNWPLRGRKGT 317 Query: 378 TWEGGVRVPTFVYWK-GMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDG 436 WEGGVR FV+ + R S ++ + D +PT + LAG + +DG Sbjct: 318 YWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADG-------LDG 370 Query: 437 VDQTSFFLGTNGQSNRKAEH-------------------YFLNGKLAAVRMDEFKYHVLI 477 D + H + AA+R+ E+K Sbjct: 371 YDVWPAISEGRASPRTEILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTGD 430 Query: 478 QQPYAYTQSGYQGGFTGTVMQT-------AGSSVFNLYTDPQESDSIGVRHIPMGVPLQT 530 + F G+ +FN+ DP E + + + + L Sbjct: 431 PGYGDWIPPQTLATFPGSWWNLERMASVRQAVWLFNISADPYEREDLAGQRPDVVRTLLA 490 Query: 531 EMHAYMEIL--KKYP---PRAQI 548 + Y +YP PRA Sbjct: 491 RLAEYNRTAIPVRYPAENPRAHP 513 >UniRef50_D2QCX4 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QCX4_9SPHI Length = 533 Score = 365 bits (936), Expect = 3e-99, Method: Composition-based stats. Identities = 134/558 (24%), Positives = 205/558 (36%), Gaps = 154/558 (27%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 ++ K+PN++ L DD+G+ D+G GG V TP++D +A+ G+ L S Y+ PTR Sbjct: 36 QSVKRPNILYILADDMGFSDIGCYGGEVN----TPNLDKLAAGGIKLRSFYNNARCCPTR 91 Query: 141 ATILTGQYSIHHGI----LMPPMYGQPGGLQGL-----TTLPQLLHDQGYVTQAIGKWHM 191 A++LTGQY G+ MP QPG QG T+ + L + GY T +GKWH+ Sbjct: 92 ASLLTGQYPHTVGMGLMVTMPNAAIQPGSYQGFLDARYPTIAERLKETGYSTYMLGKWHV 151 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 GE E P GF+ + G S + Y E + D+ Sbjct: 152 GERPEHWPLKRGFEHYFGLISGASSYYEIIPAEKGKRFIVLDDK---------------- 195 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS--DKPFFLYYGTRGCHFDNYP- 308 + YM D + DY V++L++ + DKPFF+Y HF + Sbjct: 196 ------EFTPPADGFYMTD---AFTDYAVQYLNQQKQEQADKPFFMYLAYTAPHFPLHAY 246 Query: 309 --------------------------------NAKY------------------AGSSPA 318 + +Y A Sbjct: 247 ESDIAKYEKLYAQGWDVTRTKRYQKMQQLGLIDKRYQLTPRPANVPAWNSATDKAQWIRK 306 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-------------PEAEVPP 365 Y + M+ L KTL+ NGQ DNTLIVF SDNG P ++ Sbjct: 307 MAVYAAMIDRMDQNIGRLIKTLKANGQYDNTLIVFMSDNGSSNENMESRKLNDPTKKIGE 366 Query: 366 HGR-------------TPFRGAKGSTWEGGVRVPTFVYWKGMIQP--RKSDGIVDLADLF 410 G TPFR K EGG+ P + W I+P DGI + DL Sbjct: 367 RGSYVTYDTPWANVSVTPFRKYKRFLHEGGMITPCIMQWPRNIRPAAGYVDGIGHVMDLL 426 Query: 411 PTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDE 470 PT+L+LAG + G S+ R + + + A+R + Sbjct: 427 PTSLELAGLSAND----------LPGKSL-SYLWTPKKTEPRT--YCWEHEGNKAIRKAD 473 Query: 471 FKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQT 530 +K + A ++N+ TDP E++ + ++T Sbjct: 474 WKLV--------------------KDTEDADWELYNIKTDPCETNDLARNQPQRVASMRT 513 Query: 531 EMHAYMEI--LKKYPPRA 546 E + + +++ P Sbjct: 514 EFDTWAQRVGVRERPAGK 531 >UniRef50_Q1QJ61 Sulfatase n=3 Tax=Bacteria RepID=Q1QJ61_NITHX Length = 496 Score = 364 bits (935), Expect = 4e-99, Method: Composition-based stats. Identities = 130/470 (27%), Positives = 214/470 (45%), Gaps = 38/470 (8%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT 139 + + PNVV FL+D++G+ ++G GGG+ G T IDA A +G+ L + + +P+ Sbjct: 43 RTSNGPPNVVYFLVDNLGYGELGCYGGGILRGADTRRIDAFADEGIKLLNFAPEAQCTPS 102 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGLQG-LTTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 R+ ++TG+Y+I G + G+ GGL T+ +L +GY T +GKWH+GE+ Sbjct: 103 RSALMTGRYAIRSGNHTVALPGEEGGLVAWERTMGDVLSARGYATACVGKWHVGESAGRW 162 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 P + GFD++ G W + + P R L K D + Sbjct: 163 PTDHGFDEWYGPPRS------WDESLWPTDPWYDPKRDPVSNMLESRKGD------RTPR 210 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA 318 + + D+D+ + G F+ + + + FFLY+ H P A++ G S Sbjct: 211 TVKQLDLNVRRDVDRELLTRGKAFMKRSVDAKRSFFLYFNHSLMHMPTIPRAEFRGKS-G 269 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP--FRGAKG 376 + + DC++E++ F + TL++ DNT++VF+ DNGPE P G TP F G+ Sbjct: 270 QGDWADCLLELDSDFGEILDTLKELKVDDNTIVVFSGDNGPEELEPWRG-TPGFFDGSYF 328 Query: 377 STWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFID 435 + EG +R P V + G + P +S+ IV + D+F L AG +P ID Sbjct: 329 TGMEGSLRTPCMVRYPGRVPPGKQSNDIVHITDMFTIILQWAGA-------AMPTDRVID 381 Query: 436 GVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGT 495 G+DQ +FF G S R Y++ L V+ FK +Q+ T Sbjct: 382 GIDQRAFFEGKQNNSARDGIPYWMADTLYGVKWRNFKMVFYLQKT-----------LTEP 430 Query: 496 VMQTAGSSVFNLYTDPQESD--SIGVRHIPMGVPLQTEMHAYMEILKKYP 543 ++ + + NL DP+E + H M + +K+ P Sbjct: 431 ALKLSTPHIINLTVDPKERKAFDLPYIHSWTAAHFGRIMKDFAISVKREP 480 >UniRef50_B4CZ78 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CZ78_9BACT Length = 527 Score = 364 bits (935), Expect = 4e-99, Method: Composition-based stats. Identities = 115/497 (23%), Positives = 185/497 (37%), Gaps = 55/497 (11%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPS 135 +T PN+V+ L DD+G+ VG G TP+ID +A +G T A + Sbjct: 18 APAAETTSTPNIVIILADDLGYGSVGCFGAD-GKLVRTPNIDRLAHEGRRFTDANTTSSV 76 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMGE 193 +PTR ++LTG+Y + + L + +L GY T AIGKWH+G Sbjct: 77 CTPTRYSLLTGRYCWRTSLKYETLNTFAPMLIEPTRYNMASMLKAHGYHTAAIGKWHLGY 136 Query: 194 NKESQ---------------PQNVGFDDFRGFN-SVSDMYTEWRDVHVNPEVALSPDRSE 237 + P +GFD + D+ + + H + ++ Sbjct: 137 GDGKKDPKYRVDYTAELAPGPNELGFDYHFAVPQNHGDVTGVYVENHYVYGLRSGKIPAD 196 Query: 238 YIKQLPFSKDDVHAVR--------GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS 289 P D+ A G D + + + D ++++ K+ Sbjct: 197 LKLPAPVPDDENFAPTYNSESQQGHGHTPMEIDAPRRVDDRVMPELTDQAAHWIEQQ-KA 255 Query: 290 DKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 PFFLY+ H P+ G+S A +GD + E++ + +TLEK G NT Sbjct: 256 GTPFFLYFAPVAVHEPVTPSRDTRGTSQA-GRFGDWIHELDRTVGRVLETLEKQGFAQNT 314 Query: 350 LIVFTSDNG---------PEAEVPPHG---RTPFRGAKGSTWEGGVRVPTFVYWKGMIQP 397 L++FTSDNG PE + G +RG K ++GG VP W G I Sbjct: 315 LVIFTSDNGGIYEPTQKRPEMDAVHAGLAVNGQWRGGKTHVFQGGFNVPFIARWPGKIPA 374 Query: 398 RK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTN-GQSNRKAE 455 S ++ L D+ T + G D + LG Q R Sbjct: 375 GTESREMISLVDVLATTAAIVGEKLPSAEKA-----AEDSCNILPALLGEKYDQPLRSDM 429 Query: 456 HYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD 515 N + A+R +K+ + P G + + ++NL DP ES Sbjct: 430 VEHSNDGVFAIRKGPWKWIEGV--PVKQISPGLRKAHAAEFQR----QLYNLAEDPTESK 483 Query: 516 SIGVRHIPMGVPLQTEM 532 + +H + L+ + Sbjct: 484 DVSEQHPEIVKELEAAL 500 >UniRef50_A6DMY9 Putative uncharacterized protein n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMY9_9BACT Length = 590 Score = 364 bits (935), Expect = 5e-99, Method: Composition-based stats. Identities = 110/472 (23%), Positives = 179/472 (37%), Gaps = 82/472 (17%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 KPN+V+ L DD G+ D+ +G + TP +D +A G + + +PTRA Sbjct: 22 AEDKPNIVLILTDDQGYGDISSHGNRMI---DTPHLDQLAEDGTRFENFFVSNVCAPTRA 78 Query: 142 TILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQN 201 ++LTG+Y I G++ G T+ ++ QGY T GKWH GE+ + P Sbjct: 79 SLLTGRYHIRTGVVQ-VSRGLEIMRSEEATIAEVFKAQGYETGLFGKWHNGEHYPNNPPG 137 Query: 202 VGFDDFRGF--NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD++ GF + D + D + ++K F Sbjct: 138 QGFDEYFGFCAGHIGDFFDATLDHN-----------KTFVKTKGF--------------- 171 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY------A 313 + D + +++K DKPFF Y H KY Sbjct: 172 -----------ITDVLTDRAIDWIEKQ--QDKPFFAYIPYNAPHAPYQVEDKYYDEFAAK 218 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRG 373 G S A ++ + ++D L K L+ DNT+++F +DNGP + P +G Sbjct: 219 GYSAAHSAAYGMIENLDDNIGRLLKILDDLNLTDNTIVIFLTDNGPNS--PTRFNGGMKG 276 Query: 374 AKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 +KGS EGGVRVP F+ W G I + R + D+ PT ++LAG V Sbjct: 277 SKGSVDEGGVRVPFFIRWPGKIAKGRTIHDLAAHIDVLPTLMELAGV-------NVDLPN 329 Query: 433 FIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGF 492 +DG TS + +L + Q P G G Sbjct: 330 KLDGRSLTSLISSSKTPKAPA-----WPERL-----------IFTQGPGTNMTPGSGAGA 373 Query: 493 T-----GTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL 539 V+ ++++ DP + + + L+ +++ + Sbjct: 374 ARSNQYRYVLSRGEEGLYDMINDPGQEKDLKKSKKKIFDELKAAYIEWLKDV 425 >UniRef50_C6I6Z4 N-acetylgalactosamine-6-sulfatase n=11 Tax=Bacteroidetes RepID=C6I6Z4_9BACE Length = 504 Score = 364 bits (935), Expect = 5e-99, Method: Composition-based stats. Identities = 120/520 (23%), Positives = 186/520 (35%), Gaps = 94/520 (18%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 + L+ + +PNV+ ++DD+G+ D+G G TP+ID + G+ T Sbjct: 11 SVFALSAKSQVKESRPNVIYIIMDDLGYGDIGCYGSEKI---ETPNIDRLYKDGISFTQH 67 Query: 131 YS-QPSSSPTRATILTGQYSIHHGIL-------------------MPPMYGQPGGLQGLT 170 Y+ P S+P R ++TG +S H I P + GQ Sbjct: 68 YTGSPVSAPARCVLMTGMHSGHAQIRANDEMAYRGAIMNYDSMYVHPGLEGQYPLKAHTM 127 Query: 171 TLPQLLHDQGYVTQAIGKWHMG-ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEV 229 TL +++ GYVT GKW +G E P GFD F G+N ++ + P Sbjct: 128 TLGRMMQQAGYVTGCFGKWGLGAPGTEGTPNKQGFDSFYGYNCQRQAHSYY------PAF 181 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYM-EDLDQRWMDYGVKFLDKMAK 288 + Y+ G + + A E + D + F+ + Sbjct: 182 LYKNEDRVYLANKVLDPHTTKLDAGADPRDEAAYAKFSQKEYANDLIFDELISFVGQ--N 239 Query: 289 SDKPFFLYYGTRGCHFDNYPNAKY-----------------AGSSPAR---TSYGDCMVE 328 KPFFL + T H K+ AG P R +Y + Sbjct: 240 RKKPFFLMWTTPLPHVSLQAPEKWVKYYVGKFGDEAPYIGKAGYMPCRYPHATYAAMISY 299 Query: 329 MNDVFANLYKTLEKNGQLDNTLIVFTSDNGP-----EAEVPPHGRTPFR----GAKGSTW 379 ++ L + L+K DNT+I+FTSDNGP PFR K Sbjct: 300 FDEQIGKLIEKLKKERLYDNTVIMFTSDNGPTFNGGSDSPWFDSGGPFRSEYGWGKCFVH 359 Query: 380 EGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVD 438 EGG+R+P V W G I+P +SD I D+ PT D+A + DG+ Sbjct: 360 EGGIRIPAIVTWPGKIKPSTQSDHICGFQDVMPTLADIANIACPET----------DGIS 409 Query: 439 QTSFFLGTNGQSNRKAEHYFLNGK----LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTG 494 LG + Y+ L A+RM ++K G Sbjct: 410 FLPALLGETERQKEHEYLYWEYPDPTIGLKAIRMGKWK-----------------GIVNN 452 Query: 495 TVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 + +++L +D +E + H + L M Sbjct: 453 IRKGNSTMELYDLESDLREEHDVAAEHPDIVRKLTRLMEK 492 >UniRef50_UPI000186ED10 arylsulfatase B precursor, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186ED10 Length = 570 Score = 364 bits (934), Expect = 6e-99, Method: Composition-based stats. Identities = 125/528 (23%), Positives = 198/528 (37%), Gaps = 103/528 (19%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 ++PN+++ L DD+GW DV F+G TP+IDA+A G+IL S Y +P+RA++ Sbjct: 45 ERPNIIIILADDLGWNDVSFHGSNQI---QTPNIDALAYNGIILNSHYVPALCTPSRASL 101 Query: 144 LTGQYSIHHGILM-PPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQPQ 200 +TG+Y G+ + +P GL T +P+ + GY T A+GKWH+G KE P Sbjct: 102 MTGKYPTSLGMQHLVILSPEPWGLPLNETLMPEYFNKNGYATHAVGKWHLGFFKKEYTPI 161 Query: 201 NVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD G +N D Y +S Y + F D + Sbjct: 162 YRGFDSHFGHWNGFQDYYDH---------TTMSDSLKGYDMRRNFEVDYSYQ-------- 204 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG----- 314 Y D+ + +K +D P FLY H N N A Sbjct: 205 -----GMYTTDV---FTKEAIKIIDNHNSQKGPLFLYLSHLAPHSGNPDNPFQAPEDEIS 256 Query: 315 -----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA---EVPPH 366 + P R Y + ++++ + LEKN L+N++I+F SDNG Sbjct: 257 KHECINDPGRKIYAAMVTKLDESVGQVVSALEKNKMLNNSIIIFMSDNGAATYGLHSNRG 316 Query: 367 GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVA 425 P RG K S WEGGVR ++ + + R S ++ ++D PT L AG + Sbjct: 317 SNYPLRGLKESPWEGGVRGTAAIWSPFLNKTKRVSKQLMHMSDWLPTLLTAAGLNYSSTQ 376 Query: 426 NLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH--YFLNGKLAAVRMDEFKYHVLIQQ---- 479 + IDG+D + + S RK Y +++ +D +KY Q Sbjct: 377 LI----NKIDGIDMWNVL-SNDLPSPRKEVFNNYDEIENYSSLMIDSWKYVEGTAQEGKA 431 Query: 480 PYAYTQSGYQGGF------------------------------------------TGTVM 497 Y + + T V+ Sbjct: 432 DYWFEEPSRNNCSEYRVSNEDIFRLRRDSTIICDNPTFSSSLSITRNNHTDVKNKTKYVL 491 Query: 498 QTAGSS----VFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +FNL DP E ++ + ++ + + + K Sbjct: 492 TCDPLLKRFCLFNLNDDPCERLNLADVFPDVVKRIKNRLLELKKSVVK 539 >UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UL40_RHOBA Length = 592 Score = 364 bits (934), Expect = 6e-99, Method: Composition-based stats. Identities = 106/485 (21%), Positives = 177/485 (36%), Gaps = 81/485 (16%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 + +PNV++ + DD GW +VGF+G V TP++D A++G LT+ Y Sbjct: 35 SSVTVAVAAEPRPNVILVMTDDQGWAEVGFHGNEVL---KTPNLDRFAAEGTELTNFYVS 91 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 P +PTR++++TG+Y G G+ TT+ ++ GY T GKWH+GE Sbjct: 92 PMCTPTRSSLMTGRYHFRTG-AHDTYIGRSNMNPEETTIAEVFAGAGYRTGIFGKWHLGE 150 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 N + ++ GF + A P + + L ++ Sbjct: 151 NFPMRAEDQGFQK-----------VVVHGGGGIGQFADYPGNTYWDPTLQYN-------- 191 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 D K ++D ++F+ ++PFF Y H ++ Sbjct: 192 --------DSFKKAKGYCTDVFIDESIQFMKD--SGEQPFFCYLPLNVPHSPFDVADEFR 241 Query: 314 GS-------SPARTSYGD----CMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 P + + + + F L + +E GQ +NT+I+F SDNGP + Sbjct: 242 ADYDNQNLADPDGRKWVAPIYGMITQFDGAFGRLLEAVEDMGQRENTIILFMSDNGPNST 301 Query: 363 VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPG 421 G R KGS +E G+R P + W +Q RK D DL PT D G Sbjct: 302 YFTAG---LRAKKGSVYENGIRSPFVIQWPKTLQGGRKFDTPAMHIDLLPTLADACGI-- 356 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG-------KLAAVRMDEFKYH 474 +P +DG G ++ N + R +K Sbjct: 357 -----GLPADLQVDGKSILGLLHGETQGFQQRYLFMQHNRANVPPKYENCMARRGPWKVV 411 Query: 475 VLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 G + G ++N+ DP E+ + +H + E A Sbjct: 412 -------------------GDGGEPTGFELYNIEQDPGETRDLADKHPEIVKAFVREYEA 452 Query: 535 YMEIL 539 + + + Sbjct: 453 WFDDV 457 >UniRef50_A7AKS6 Putative uncharacterized protein n=3 Tax=Bacteroidales RepID=A7AKS6_9PORP Length = 464 Score = 364 bits (934), Expect = 6e-99, Method: Composition-based stats. Identities = 111/476 (23%), Positives = 182/476 (38%), Gaps = 68/476 (14%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 + + + ++PN+++ L DD G+ D GF G A TP+ID +A++G I T A+ Sbjct: 22 SCSSGQDEEAQRPNILILLADDAGYADFGFMG---ATDIQTPNIDRLAAEGCIFTDAHVA 78 Query: 134 P-SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 SSP+R+ +LTG+Y +G G LP LL Y T IGKWH+G Sbjct: 79 ATVSSPSRSMMLTGRYGQRYGYECNLDKPGDGLPDDEELLPALLKRYDYRTGCIGKWHLG 138 Query: 193 ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 +P GFD F G + Y +PE + D+ ++Q ++ + Sbjct: 139 SEPSQRPNAKGFDTFYGLLAGHRSYFY------DPETS---DKDGNLQQYQYNGRKL--- 186 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN--- 309 + +F+ + S++PF LY H N Sbjct: 187 -------------SFDGYFTDELASKAQQFVTE---SEQPFMLYMSFTAPHSPNEATEED 230 Query: 310 -AKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR 368 A++ G R Y M ++ + L+ G+ DNT+I F SDNG Sbjct: 231 LARFEGQP--RQKYAAMMYALDRGVGKIVDELKAAGKFDNTIIFFLSDNGGST-TNQSSN 287 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANL 427 P +G KG+ +EGG RVP FV W + ++ G+ D+F T +D P + Sbjct: 288 LPLKGFKGNKFEGGQRVPFFVVWGDRFKRDQRFTGLTSSLDIFATVVDALDIPEEGLHK- 346 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSG 487 IDGV + G + +A ++ A+R +K + Sbjct: 347 -----PIDGVSLLPYLSGEKSGNPHEAL-FWRKMDTRAIRSGSYKLIITRGVDSV----- 395 Query: 488 YQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 ++N+ D +E + L ++ + + K P Sbjct: 396 ----------------LYNMDQDVEEMHDLLSSEPEKARELMEQLSEWEQACCKDP 435 >UniRef50_A6DG54 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG54_9BACT Length = 469 Score = 363 bits (933), Expect = 8e-99, Method: Composition-based stats. Identities = 118/480 (24%), Positives = 179/480 (37%), Gaps = 69/480 (14%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSP 138 + PNVVV DD GW D G GG V T ID +A G+ T Y+ P+ SP Sbjct: 22 QVQAAPPNVVVIYFDDTGWKDFGCFGGAV----DTTHIDNLAKNGMRFTEYYAPAPNCSP 77 Query: 139 TRATILTGQYSIHHGILMPPMYGQPGGLQG-LTTLPQLLHDQGYVTQAIGKWHMGENKES 197 +RA +LTG++ G+ P L T+ + L +GY T GKWH+G Sbjct: 78 SRAGLLTGRFPFRLGMYSYRSKNTPMHLPDSEITIAEALKTKGYATGMFGKWHLGNLDGK 137 Query: 198 ---QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 P GFD + + + + NP+ + + V + G Sbjct: 138 SHPTPSEQGFDYW--------LACDNNLIKHNPKSLIRNGKP------------VGKIAG 177 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY-- 312 Q +AD ++M K PFF Y H + Sbjct: 178 WAAQVVADEANEWM------------------KKQTSPFFAYIAFSETHSPLDAPEELIT 219 Query: 313 ----AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR 368 G + R +Y + ++ KTL+ G DNTL+ SDNGP +E G Sbjct: 220 KYIERGENKKRATYRGMTEYSDAAVGSILKTLDDMGVSDNTLVFLASDNGPTSEDSCEG- 278 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANL 427 RG K TWEGG+RVP + W G ++P + V DL PT D+ G K Sbjct: 279 --LRGKKSYTWEGGIRVPAIIRWPGKVKPGSEYNDPVGGIDLLPTLCDIVGAELPK---- 332 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAE-HYFLNGKLAAVRMDEFKY--HVLIQQPYAYT 484 IDGV S G + N ++ A++RM ++ H + Sbjct: 333 ----RHIDGVSIRSVLEGKPFKRNTPILSFFYRTSPAASMRMGDYVLIGHSDDEDRKKSH 388 Query: 485 QSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA-YMEILKKYP 543 + + ++N+ D + +I + L+ M A + + + + P Sbjct: 389 SMSAEDMPIVKSSKLVSFELYNIKNDLGQEKNIAATYPEKLAELRKIMLALHHDAISEGP 448 >UniRef50_UPI0001BC7CBC sulfatase n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7CBC Length = 496 Score = 363 bits (933), Expect = 8e-99, Method: Composition-based stats. Identities = 120/509 (23%), Positives = 188/509 (36%), Gaps = 83/509 (16%) Query: 51 ATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAV 110 +T++ ++ ++ + KPN+V+ L DD G+ V Sbjct: 8 TSTLSTALLAILPIAKASAQ------HTTPSHPDKPNIVIILADDQGYGGVNCY--PHIK 59 Query: 111 GNPTPDIDAVASQGLILTSAYSQP-SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL 169 TP+ID +A+ G+ Y+ SSPTRA ++TG+Y G G Q Sbjct: 60 KIVTPNIDKLAASGVQCMQGYTSGHLSSPTRAGLMTGKYQQSFGFYGLSTPHVGGIPQDQ 119 Query: 170 TTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGF-NSVSDMYTEWRDVHVNPE 228 L + L + GY T IGKWH+G+ S P N GF F GF N + D Y +P Sbjct: 120 KLLSEYLVENGYNTACIGKWHLGDYIRSHPNNRGFQTFFGFINGLHDYY--------DPL 171 Query: 229 VALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAK 288 V S D L F+ D++ V ME + V F+ K A Sbjct: 172 VGGSWD--GVYNGLAFTLDNMEPVTE-------------MEYSTYEYTKRAVDFIQKNA- 215 Query: 289 SDKPFFLYYGTRGCHFDNYPNAKYAG------SSPARTSYG-DCMVEMNDVFANLYKTLE 341 D PFFLY H + G + ++ + +TLE Sbjct: 216 -DHPFFLYLPYNAIHSPLQAPEELIGELAINPQEIGKDDIARAMTFALDQGVGKVVETLE 274 Query: 342 KNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS- 400 + G DNT+I + SDNG V + FRG KGS +EGG+RVP V + + Sbjct: 275 QLGLRDNTIIFYLSDNGA---VEYSDKWEFRGRKGSYYEGGIRVPFIVSYPAKLAKGTIY 331 Query: 401 DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN 460 + V D+ PT ++LAG A + GV+ + G + ++ Sbjct: 332 NKPVMSIDIAPTVMELAGLSHADMH----------GVNLLPYLSGKDRTEPHDVLYWSTE 381 Query: 461 GKL--------AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ 512 K A+R ++K Y ++++ DPQ Sbjct: 382 KKSNNQVFKNEFAIRQGKWKLVSDPHFEKDYD-------------------LYDIEADPQ 422 Query: 513 ESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 E + ++ L ++ + + Sbjct: 423 EKHGLKDQYPEKYKELFGMYLNWINQMPE 451 >UniRef50_A6DKM2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKM2_9BACT Length = 472 Score = 363 bits (933), Expect = 9e-99, Method: Composition-based stats. Identities = 112/503 (22%), Positives = 185/503 (36%), Gaps = 115/503 (22%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPTRA 141 +KPN+++ L DD+G +G G TP+IDA+A++ + +AYS +P+RA Sbjct: 17 AQKPNIILILADDLGGAGLGCYGNEFFG---TPNIDALAAKSMRFDNAYSGSTVCAPSRA 73 Query: 142 TILTGQYSIHHGILM--------------PPMYGQP--------GGLQGLTTLPQLLHDQ 179 +++GQY H I P + G +G TL Q D Sbjct: 74 CLMSGQYVGRHKITWVSQFQRDYIKKKRGPNLNGFRLLQPVHPYHMPEGTITLGQAFKDA 133 Query: 180 GYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI 239 GY T GKWH+G + QP +GFD++ F + Sbjct: 134 GYATAMFGKWHLGHRPQDQPDKMGFDEYLTFQGMKHFAPYTLPN---------------- 177 Query: 240 KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 V+ GE+ + D+T D + F+++ ++KPFFLYY Sbjct: 178 -----------KVQHGEKVYLTDLTC-----------DKAIDFMERKVAAEKPFFLYYPD 215 Query: 300 RGCHFD--------NYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 H Y K G ++D L K +++ G +NT+I Sbjct: 216 FLVHAPMEAKQAMIQYFEKKTIGQHHKSVIGAAMTKHLDDTVGRLVKKVDELGIAENTII 275 Query: 352 VFTSDNGPEAEVPPHG-------RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGI 403 +FTSDNG G P+R AK S +EGG RVP +W G+ + S + Sbjct: 276 IFTSDNGGLGYKSDGGYGDKGTSNYPYRSAKSSHYEGGSRVPLIFHWPGVTEANSLSHEV 335 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF----- 458 V D++PT L +A P+ +DG+D +S + + ++ Sbjct: 336 VSGIDIYPTLLKIA-------QVAKPQEQILDGIDFSSILKNPKQKLPARDLFHYQPIYN 388 Query: 459 ---LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD 515 ++R + KY + +FNL D + Sbjct: 389 HKVFGDASVSLRRGDMKYIYYFVEENF--------------------ELFNLKDDVSQKK 428 Query: 516 SIGVRHIPMGVPLQTEMHAYMEI 538 + + + L+ +++ Sbjct: 429 DLSADYPELCEELKKACFKHLDE 451 >UniRef50_P50473 Arylsulfatase n=8 Tax=Deuterostomia RepID=ARS_STRPU Length = 567 Score = 363 bits (932), Expect = 9e-99, Method: Composition-based stats. Identities = 129/461 (27%), Positives = 186/461 (40%), Gaps = 60/461 (13%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRATI 143 KPNV++ L DD+G D+ G ID +A+QGL T YS +P+R+ I Sbjct: 66 KPNVILLLADDMGVGDLSVYGHPTQEPGF---IDQMANQGLRFTQGYSGDSVCTPSRSAI 122 Query: 144 LTGQYSIHHGILMPPMYGQP----GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE--- 196 +TG+ I G+ P G T+ + + GY T +GKWH+G N+ Sbjct: 123 VTGRQPIRTGVYGEERIFLPWTTTGLPLYEVTIAEAMKGAGYTTGMVGKWHLGINENSSS 182 Query: 197 ---SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 P N GFD F G N D ++ + + Y ++ H Sbjct: 183 DGAHLPANRGFD-FVGHNLPFGNSWRCDDTGLHQDFPDTNACFLYYNSTSVAQPFQH--- 238 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 + L Q D V F++ KPFF+Y H + + ++ Sbjct: 239 ---------------KGLTQLLRDDTVGFIEDNVN--KPFFMYVSFAHMHTSLFSSDDFS 281 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR--TPF 371 +S R YGD + EM+ + TL N DNT+I FTSD+GP E G F Sbjct: 282 CTSR-RGRYGDNLREMDQAIEQIVTTLVDNDIDDNTVIFFTSDHGPHREYCGEGGDANVF 340 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 RG KG +WEGG R+P VYW G I P S IV D+ TA++L G + +P Sbjct: 341 RGGKGQSWEGGHRIPYIVYWPGTISPGVSHEIVTSMDIIATAVNLGG-------SQLPTD 393 Query: 432 TFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQ-- 489 DG S L S Y+ L AVR+ ++K H Q + + G + Sbjct: 394 RIYDGKCLKSVLL-EGASSPHDDFFYYCKDTLMAVRVGKYKAHFKTQTDSSQMKLGERCD 452 Query: 490 ------------GGFTGTVMQTAGSSVFNLYTDPQESDSIG 518 V + +F+L DP E+ +G Sbjct: 453 GGFPLDDYFLCSDCEGDCVTEHNPPIMFDLEKDPGENYPLG 493 >UniRef50_C6Z6I9 N-acetylgalactosamine 6-sulfate sulfatase n=5 Tax=Bacteroides RepID=C6Z6I9_9BACE Length = 471 Score = 363 bits (932), Expect = 9e-99, Method: Composition-based stats. Identities = 113/495 (22%), Positives = 192/495 (38%), Gaps = 83/495 (16%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-S 135 + +K + +PN++ L DD+G+ D+ G TP+ID +A G Y+ Sbjct: 26 QNDKVSPDRPNIIFILADDMGYGDLSCYGNQYV---KTPNIDQLAETGTRFNQCYAGSGI 82 Query: 136 SSPTRATILTGQYSIHH-------------GILMPP-----MYGQPGGLQGLTTLPQLLH 177 SSP+R +LTG+ + + GI + P + + L TT+ +L Sbjct: 83 SSPSRCALLTGKNTGNTRIRDNMCTAGGIAGIKINPNGDSTIVRRANLLPQDTTIATILS 142 Query: 178 DQGYVTQAIGKWHM-GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS 236 GY T + KWH+ G +K + P + GFD+F G+ + + Sbjct: 143 AAGYRTCLVNKWHLDGYDKGASPNHRGFDEFYGWTIST------VHSNSPYYYPYYRFHG 196 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 + + +P + + H + + + F+ + D PFFLY Sbjct: 197 DSLIHIPENANGKHGIHNN-----------------DLSTNDAIAFIKR--NKDNPFFLY 237 Query: 297 YGTRGCHFDNYPN--AKYAG---SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 H + A Y G P Y + M+ L LE+ NTLI Sbjct: 238 LAFDAPHEPYNIDQTAWYEGQETWEPNTKRYASLITHMDAAIGRLLNELEQLNLRKNTLI 297 Query: 352 VFTSDNGPEAEVPP---HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLAD 408 +F SDNG + P + F+G KG+ +EGG+RVP V GM+ + D ++ D Sbjct: 298 IFASDNGAAIQAPIKILNCNAGFKGRKGTLYEGGIRVPFIVNQPGMVPIQTLDNLIYFPD 357 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRM 468 + PT LA K +PK I+G++ F G ++ + ++ GK A R Sbjct: 358 MMPTLAALA-----KGTKHLPKQ--INGINILPLFYGKQVDTDNRLLYWEFPGKQRAARK 410 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPL 528 ++K T+ ++NL DP+E++++ ++ Sbjct: 411 GDWKCV--------------------TIHPNQPLELYNLKEDPEETNNLAKKYPRRVKEF 450 Query: 529 QTEMHAYMEILKKYP 543 EM +P Sbjct: 451 DEEMQRMHIPTPNWP 465 >UniRef50_Q5FYB0 Arylsulfatase J n=81 Tax=Eumetazoa RepID=ARSJ_HUMAN Length = 599 Score = 363 bits (932), Expect = 1e-98, Method: Composition-based stats. Identities = 113/510 (22%), Positives = 198/510 (38%), Gaps = 75/510 (14%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS 136 E + +P+++ L DD G+ DVG++G + TP +D +A++G+ L + Y QP Sbjct: 67 EPSTTSTSQPHLIFILADDQGFRDVGYHGSEI----KTPTLDKLAAEGVKLENYYVQPIC 122 Query: 137 SPTRATILTGQYSIHHGILMPPMYG-QPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGE- 193 +P+R+ +TG+Y IH G+ + QP L TLPQ L + GY T +GKWH+G Sbjct: 123 TPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFY 182 Query: 194 NKESQPQNVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 KE P GFD F G D YT ++ + D E + +++ Sbjct: 183 RKECMPTRRGFDTFFGSLLGSGDYYTHYK---CDSPGMCGYDLYENDNAAWDYDNGIYST 239 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 + Q+ V+ + KP FLY + H +Y Sbjct: 240 QMYTQR---------------------VQQILASHNPTKPIFLYIAYQAVHSPLQAPGRY 278 Query: 313 -----AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG 367 + + R Y + +++ N+ L+ G +N++I+++SDNG + Sbjct: 279 FEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-GS 337 Query: 368 RTPFRGAKGSTWEGGVRVPTFVYWK-GMIQPRKSDGIVDLADLFPTALDLAGHPGAKVAN 426 P RG+KG+ WEGG+R FV+ + +V + D +PT + LA Sbjct: 338 NWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA-------EG 390 Query: 427 LVPKTTFIDGVDQT-------------------SFFLGTNGQSNRKAEHYFLNGKLAAVR 467 + + +DG D + S + +A+R Sbjct: 391 QIDEDIQLDGYDIWETISEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIR 450 Query: 468 MDEFKYHV--------LIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGV 519 + +K + Q ++ T+ +FN+ DP E + Sbjct: 451 VQHWKLLTGNPGYSDWVPPQSFSNLGPNRWHNERITLSTGKSVWLFNITADPYERVDLSN 510 Query: 520 RHIPMGVPLQTEMHAYMEIL--KKYPPRAQ 547 R+ + L + + + +YPP+ Sbjct: 511 RYPGIVKKLLRRLSQFNKTAVPVRYPPKDP 540 >UniRef50_C7RSC1 Sulfatase n=2 Tax=Bacteria RepID=C7RSC1_9PROT Length = 574 Score = 363 bits (932), Expect = 1e-98, Method: Composition-based stats. Identities = 139/481 (28%), Positives = 216/481 (44%), Gaps = 51/481 (10%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 A L + GK+PN++ + DD+GWM G+ VG TP+ID + +G + Y++ S Sbjct: 24 AALAQAPGKRPNILFIMGDDIGWMQPSIYHQGLMVG-ETPNIDRIGQEGAKFMTYYAEQS 82 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWHMGEN 194 + R TG + G++ P + G P LQ G +L + L D GY T GK H+G++ Sbjct: 83 CTAGRTAFFTGMTPLRAGMIPPQLPGSPSFLQPGTPSLAKFLLDLGYTTGEFGKNHLGDH 142 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTE--WRDVHVNPEV--ALSPDRSEYIKQL---PFSKD 247 + P GF +F G+ D + D++ +P V + P ++ ++ L P + D Sbjct: 143 SAALPTAHGFQEFWGYLYHLDAMQGVSFPDINSSPTVQAIVPPCKNTPVRGLAEVPGAVD 202 Query: 248 DVHAV------------------RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDK--MA 287 + + + +T K E +D+ + FLD+ Sbjct: 203 PKTTLCMTPPRPVLACTSSDGTEKNQTCKDEGPLTLKRSETVDEEISAKVIDFLDRNDPK 262 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGS--SPARTSYG---DCMVEMNDVFANLYKTLEK 342 K++KPFF++Y H + KY + +G M +M+D + K LE Sbjct: 263 KTNKPFFVWYNPARMHITTMLSDKYMAMVGTKGGKDWGTNEAAMKQMDDNIGYVLKKLED 322 Query: 343 NGQLDNTLIVFTSDNGPEA-EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS- 400 GQLDNT++VFT+DNG E P G TPF+G K +TWEGG+R P + W G I+P Sbjct: 323 MGQLDNTIVVFTTDNGAEVITYPDGGNTPFKGGKLTTWEGGMRAPAVIRWPGHIKPGTVL 382 Query: 401 DGIVDLADLFPTALDLAGHPGAK------VANLVPK--TTFIDGVDQTSFFLGTNGQSNR 452 + I D PT +++AG +A P T ++GV+Q + G + S R Sbjct: 383 NDIFASYDWMPTFVEIAGGAKGNDLNKQIMAGKYPGIVKTKLNGVNQLDYLTGKSATSAR 442 Query: 453 KAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ 512 A Y+ +AVR +K + A GG G V V N+ DP Sbjct: 443 DAFFYYGGPVPSAVRYKNWKIYF------AMASEANTGGLMG-VHTFHWPLVANIRRDPF 495 Query: 513 E 513 E Sbjct: 496 E 496 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P25549 Arylsulfatase n=54 Tax=Proteobacteria RepID=ASLA... 649 0.0 UniRef50_Q0C069 Sulfatase family protein n=3 Tax=Bacteria RepID=... 523 e-147 UniRef50_A4CGL5 Arylsulfatase A (Precursor) n=2 Tax=Flavobacteri... 521 e-146 UniRef50_A6LED1 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LE... 519 e-145 UniRef50_A6DPC8 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 517 e-145 UniRef50_D2QTW6 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepI... 516 e-145 UniRef50_D2QZX4 Sulfatase n=10 Tax=Bacteria RepID=D2QZX4_9PLAN 512 e-143 UniRef50_Q01N83 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 510 e-143 UniRef50_B5JJG5 Sulfatase, putative n=1 Tax=Verrucomicrobiae bac... 510 e-143 UniRef50_A6LED2 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LE... 510 e-143 UniRef50_A4CMB0 Arylsulfatase A n=4 Tax=Bacteria RepID=A4CMB0_9FLAO 509 e-142 UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=B... 506 e-142 UniRef50_Q7UKJ5 Arylsulfatase A n=3 Tax=Bacteria RepID=Q7UKJ5_RHOBA 506 e-142 UniRef50_A4A2W0 Arylsulfatase A n=1 Tax=Blastopirellula marina D... 506 e-142 UniRef50_UPI0001968C90 hypothetical protein BACCELL_02360 n=1 Ta... 503 e-141 UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planct... 502 e-140 UniRef50_D0TQQ7 Putative uncharacterized protein n=1 Tax=Bactero... 499 e-139 UniRef50_A6LDP6 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LD... 497 e-139 UniRef50_B4D764 Steryl-sulfatase n=1 Tax=Chthoniobacter flavus E... 495 e-138 UniRef50_D2QWC8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 495 e-138 UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 492 e-137 UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomy... 491 e-137 UniRef50_B9XGT6 Sulfatase n=3 Tax=Bacteria RepID=B9XGT6_9BACT 491 e-137 UniRef50_A0YAF7 Arylsulfatase A n=4 Tax=Bacteria RepID=A0YAF7_9GAMM 490 e-137 UniRef50_A6DI94 Arylsulfatase A n=2 Tax=Bacteria RepID=A6DI94_9BACT 490 e-137 UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomy... 489 e-136 UniRef50_D2R206 Steryl-sulfatase n=1 Tax=Pirellula staleyi DSM 6... 489 e-136 UniRef50_A6DJ11 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 489 e-136 UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces mari... 488 e-136 UniRef50_B9XS23 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XS2... 487 e-136 UniRef50_C6Y1Z7 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 ... 486 e-135 UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC1... 484 e-135 UniRef50_B3CAE2 Putative uncharacterized protein n=3 Tax=Bactero... 484 e-135 UniRef50_Q7UHK0 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 482 e-134 UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 T... 482 e-134 UniRef50_C6Y214 Sulfatase n=3 Tax=Sphingobacteriaceae RepID=C6Y2... 480 e-134 UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 480 e-134 UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 479 e-134 UniRef50_C3ZGR2 Putative uncharacterized protein n=1 Tax=Branchi... 479 e-134 UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT 479 e-133 UniRef50_C1ZI83 Arylsulfatase A family protein n=1 Tax=Planctomy... 478 e-133 UniRef50_A6LCL3 Arylsulfatase A n=9 Tax=Bacteroidales RepID=A6LC... 478 e-133 UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN... 478 e-133 UniRef50_B8HPF9 Sulfatase n=2 Tax=Bacteria RepID=B8HPF9_CYAP4 478 e-133 UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 478 e-133 UniRef50_Q7UYA6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 478 e-133 UniRef50_A6DHI0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 477 e-133 UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 476 e-132 UniRef50_C5C581 Cerebroside-sulfatase n=1 Tax=Beutenbergia caver... 474 e-132 UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 473 e-132 UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina ... 473 e-132 UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase... 472 e-131 UniRef50_A6DKP2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 472 e-131 UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 472 e-131 UniRef50_A6BZV9 Arylsulfatase n=3 Tax=Bacteria RepID=A6BZV9_9PLAN 471 e-131 UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 471 e-131 UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD 470 e-131 UniRef50_Q7UJQ8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 470 e-131 UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=... 469 e-130 UniRef50_C5EQ23 Arylsulfatase E n=1 Tax=Clostridiales bacterium ... 469 e-130 UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7... 468 e-130 UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 468 e-130 UniRef50_A6DSG6 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 468 e-130 UniRef50_A6CGJ8 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8... 468 e-130 UniRef50_B9KQS8 Twin-arginine translocation pathway signal n=2 T... 467 e-130 UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD 466 e-130 UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF8... 466 e-129 UniRef50_A6C8S3 Arylsulphatase A n=1 Tax=Planctomyces maris DSM ... 465 e-129 UniRef50_D2R917 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 464 e-129 UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 464 e-129 UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 T... 464 e-129 UniRef50_C9MNT2 Arylsulfatase n=4 Tax=Bacteroidales RepID=C9MNT2... 463 e-128 UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Plancto... 463 e-128 UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_B... 463 e-128 UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3J... 462 e-128 UniRef50_D2R783 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 461 e-128 UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 461 e-128 UniRef50_A6DSP6 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 461 e-128 UniRef50_A6C4Q6 Arylsulfatase n=1 Tax=Planctomyces maris DSM 879... 461 e-128 UniRef50_A5FAW4 Sulfatase n=1 Tax=Flavobacterium johnsoniae UW10... 461 e-128 UniRef50_A6DLE2 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 461 e-128 UniRef50_A7HQ00 Steryl-sulfatase n=4 Tax=Proteobacteria RepID=A7... 460 e-128 UniRef50_Q7UG72 Arylsulfatase A [precursor] n=1 Tax=Rhodopirellu... 460 e-128 UniRef50_D2QTW5 Sulfatase n=2 Tax=Sphingobacteriales RepID=D2QTW... 460 e-128 UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 459 e-128 UniRef50_A6CEL4 Arylsulfatase A n=4 Tax=Bacteria RepID=A6CEL4_9PLAN 459 e-128 UniRef50_A6C4W8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 459 e-128 UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglob... 459 e-127 UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 458 e-127 UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bactero... 458 e-127 UniRef50_A4AM21 Arylsulfatase A n=1 Tax=Flavobacteriales bacteri... 458 e-127 UniRef50_C5PU94 N-acetylgalactosamine-6-sulfatase n=1 Tax=Sphing... 458 e-127 UniRef50_A3ZMT9 Arylsulfatase n=2 Tax=Planctomycetaceae RepID=A3... 458 e-127 UniRef50_A6LEC5 Arylsulfatase A n=2 Tax=Parabacteroides RepID=A6... 458 e-127 UniRef50_C0BKJ9 Sulfatase n=2 Tax=Bacteroidetes RepID=C0BKJ9_9BACT 457 e-127 UniRef50_A1WGP9 Sulfatase n=6 Tax=Proteobacteria RepID=A1WGP9_VEREI 457 e-127 UniRef50_C9KTV0 Arylsulfatase n=1 Tax=Bacteroides finegoldii DSM... 457 e-127 UniRef50_A5FF56 Sulfatase n=2 Tax=Bacteria RepID=A5FF56_FLAJ1 456 e-127 UniRef50_B4D681 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 456 e-127 UniRef50_A6DKP3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 456 e-126 UniRef50_A3ZMN6 Arylsulfatase B n=3 Tax=Bacteria RepID=A3ZMN6_9PLAN 456 e-126 UniRef50_UPI00016C41FE sulfatase n=1 Tax=Gemmata obscuriglobus U... 456 e-126 UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 455 e-126 UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 455 e-126 UniRef50_Q7UYW3 Arylsulfatase B n=1 Tax=Rhodopirellula baltica R... 455 e-126 UniRef50_A6CAY0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 455 e-126 UniRef50_A6DKD8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 455 e-126 UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria Rep... 455 e-126 UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Plancto... 455 e-126 UniRef50_Q7UYA9 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodop... 454 e-126 UniRef50_A6C8R8 Arylsulfatase A n=2 Tax=Planctomycetaceae RepID=... 453 e-126 UniRef50_A6DSG4 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 453 e-126 UniRef50_A6DF77 Arylsulphatase A n=2 Tax=Lentisphaera araneosa H... 453 e-125 UniRef50_Q024K7 Sulfatase n=28 Tax=Bacteria RepID=Q024K7_SOLUE 453 e-125 UniRef50_A6C4V9 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 Re... 453 e-125 UniRef50_B9XR48 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XR4... 452 e-125 UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria ... 452 e-125 UniRef50_A6DSH3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 451 e-125 UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 451 e-125 UniRef50_Q7UQ05 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 450 e-125 UniRef50_D2R014 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 450 e-125 UniRef50_C1ZCM0 Arylsulfatase A family protein n=2 Tax=Bacteria ... 450 e-125 UniRef50_Q0KB87 Arylsulfatase A or related enzyme n=107 Tax=cell... 450 e-125 UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_R... 450 e-125 UniRef50_A3J5W3 Putative arylsulfatase n=1 Tax=Flavobacteria bac... 449 e-125 UniRef50_A6DHI1 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 449 e-125 UniRef50_A7V8P8 Putative uncharacterized protein n=1 Tax=Bactero... 449 e-125 UniRef50_A6DP41 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 449 e-124 UniRef50_C7ZGP1 Predicted protein n=3 Tax=Leotiomyceta RepID=C7Z... 448 e-124 UniRef50_D2QZL2 Sulfatase n=8 Tax=cellular organisms RepID=D2QZL... 448 e-124 UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM ... 448 e-124 UniRef50_B1KD88 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 448 e-124 UniRef50_Q7UIN1 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 447 e-124 UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 447 e-124 UniRef50_B8KM61 Steryl-sulfatase n=2 Tax=gamma proteobacterium N... 446 e-124 UniRef50_Q7ULF9 Arylsulfatase n=4 Tax=Bacteria RepID=Q7ULF9_RHOBA 446 e-124 UniRef50_UPI0001927538 PREDICTED: similar to CG8646 CG8646-PA n=... 446 e-123 UniRef50_Q1YSH0 Sulfatase family protein n=4 Tax=cellular organi... 446 e-123 UniRef50_D2R457 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 446 e-123 UniRef50_B0NLM9 Putative uncharacterized protein n=1 Tax=Bactero... 446 e-123 UniRef50_A4A218 Arylsulfatase A n=2 Tax=Bacteria RepID=A4A218_9PLAN 445 e-123 UniRef50_A6P2X1 Putative uncharacterized protein n=1 Tax=Bactero... 445 e-123 UniRef50_Q7UL93 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 445 e-123 UniRef50_C6Y1U6 Sulfatase n=2 Tax=Sphingobacteriales RepID=C6Y1U... 445 e-123 UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium sp... 445 e-123 UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Plancto... 444 e-123 UniRef50_UPI0001A444F6 arylsulfatase A n=1 Tax=Pectobacterium ca... 444 e-123 UniRef50_A6DQE3 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 444 e-123 UniRef50_C1ZFQ0 Arylsulfatase A family protein n=1 Tax=Planctomy... 444 e-123 UniRef50_UPI0000586CBD PREDICTED: similar to MGC86251 protein n=... 444 e-123 UniRef50_Q9NJU8 Sulfatase 1 n=2 Tax=Coelomata RepID=Q9NJU8_HELPO 444 e-123 UniRef50_A6DG53 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 443 e-123 UniRef50_Q1CY93 Sulfatase family protein n=4 Tax=Bacteria RepID=... 443 e-123 UniRef50_A6DF72 Putative secreted sulfatase ydeN n=1 Tax=Lentisp... 443 e-123 UniRef50_Q7UNN1 Arylsulphatase A n=3 Tax=Bacteria RepID=Q7UNN1_R... 443 e-123 UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC... 443 e-123 UniRef50_A6DG39 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC... 443 e-122 UniRef50_P34059 N-acetylgalactosamine-6-sulfatase n=23 Tax=Deute... 442 e-122 UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyc... 442 e-122 UniRef50_C6I6Z4 N-acetylgalactosamine-6-sulfatase n=11 Tax=Bacte... 442 e-122 UniRef50_A6KZI7 Arylsulfatase n=23 Tax=Bacteroidales RepID=A6KZI... 442 e-122 UniRef50_A6DF76 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 442 e-122 UniRef50_C6VTS4 Sulfatase n=47 Tax=cellular organisms RepID=C6VT... 442 e-122 UniRef50_UPI0000586CBA PREDICTED: similar to arylsulfatase B n=3... 441 e-122 UniRef50_A6DR29 N-acetylgalactosamine-6-sulfatase n=3 Tax=Bacter... 441 e-122 UniRef50_A3ZLN5 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 440 e-122 UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Pro... 440 e-122 UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9... 440 e-122 UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 T... 440 e-122 UniRef50_A6DHS3 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 440 e-122 UniRef50_B4AUP3 Sulfatase n=2 Tax=Bacteria RepID=B4AUP3_9CHRO 439 e-122 UniRef50_Q7US96 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 439 e-121 UniRef50_P15289 Arylsulfatase A component C n=34 Tax=Euteleostom... 439 e-121 UniRef50_D2R2H5 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 439 e-121 UniRef50_Q7UYD6 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 439 e-121 UniRef50_Q7UYS6 Arylsulfatase A n=4 Tax=Bacteria RepID=Q7UYS6_RHOBA 439 e-121 UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flamme... 438 e-121 UniRef50_A6DM29 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 438 e-121 UniRef50_A6DI18 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HT... 438 e-121 UniRef50_B4CZ54 Sulfatase n=3 Tax=Bacteria RepID=B4CZ54_9BACT 438 e-121 UniRef50_Q7UIU1 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 438 e-121 UniRef50_A6KWS8 Arylsulfatase n=6 Tax=Bacteroides RepID=A6KWS8_B... 437 e-121 UniRef50_A6DMX8 Iduronate-sulfatase or arylsulfatase A n=1 Tax=L... 437 e-121 UniRef50_C1ZJ89 Arylsulfatase A family protein n=1 Tax=Planctomy... 436 e-121 UniRef50_P15848 Arylsulfatase B n=32 Tax=Euteleostomi RepID=ARSB... 436 e-121 UniRef50_Q7UYH4 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 436 e-121 UniRef50_UPI0001745666 N-acetylgalactosamine 6-sulfate sulfatase... 436 e-120 UniRef50_A6C176 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 436 e-120 UniRef50_B2URC2 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC B... 436 e-120 UniRef50_Q488V4 Sulfatase family protein n=30 Tax=Bacteria RepID... 436 e-120 UniRef50_A6CAR8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 435 e-120 UniRef50_A6UG37 Sulfatase n=16 Tax=Bacteria RepID=A6UG37_SINMW 435 e-120 UniRef50_C6I9F7 Sulfatase n=4 Tax=Bacteroides RepID=C6I9F7_9BACE 435 e-120 UniRef50_UPI0001AEC7EA iduronate-sulfatase and sulfatase 1 precu... 435 e-120 UniRef50_A6DHI4 Arylsulfatase A (ASA) n=1 Tax=Lentisphaera arane... 435 e-120 UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodop... 435 e-120 UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 435 e-120 UniRef50_UPI0001BC7CBC sulfatase n=1 Tax=Bacteroides sp. D2 RepI... 435 e-120 UniRef50_A6DJ15 Putative arylsulfatase n=2 Tax=Lentisphaera aran... 435 e-120 UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi D... 434 e-120 UniRef50_D2R207 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 434 e-120 UniRef50_B9YAN4 Putative uncharacterized protein n=1 Tax=Holdema... 434 e-120 UniRef50_C1ZF13 Arylsulfatase A family protein n=1 Tax=Planctomy... 434 e-120 UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomy... 434 e-120 UniRef50_A7S8Q2 Predicted protein n=2 Tax=Nematostella vectensis... 434 e-120 UniRef50_Q7UN55 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 434 e-120 UniRef50_Q46SG5 Arylsulfatase n=3 Tax=Proteobacteria RepID=Q46SG... 433 e-120 UniRef50_A7SRP2 Predicted protein n=2 Tax=Nematostella vectensis... 433 e-120 UniRef50_A6C383 Sulfatase (Fragment) n=1 Tax=Planctomyces maris ... 433 e-120 UniRef50_C7M5R4 Sulfatase n=4 Tax=Bacteroidetes RepID=C7M5R4_CAPOD 433 e-120 UniRef50_C6VRQ8 Sulfatase n=1 Tax=Dyadobacter fermentans DSM 180... 433 e-119 UniRef50_UPI0000588CF9 PREDICTED: similar to arylsulfatase B n=1... 432 e-119 UniRef50_C2FU81 Sulfatase family protein n=2 Tax=Sphingobacteriu... 432 e-119 UniRef50_B5CWC8 Putative uncharacterized protein n=1 Tax=Bactero... 432 e-119 UniRef50_A3HWU7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 432 e-119 UniRef50_B8KM62 N-acetylgalactosamine-6-sulfatase n=1 Tax=gamma ... 432 e-119 UniRef50_Q7UWW9 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 431 e-119 UniRef50_Q7UMZ5 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 431 e-119 UniRef50_A6C4B6 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8... 431 e-119 UniRef50_A6LIX5 Arylsulfatase n=2 Tax=Bacteroidales RepID=A6LIX5... 431 e-119 UniRef50_Q7UTH7 Arylsulfatase A n=2 Tax=Bacteria RepID=Q7UTH7_RHOBA 430 e-119 UniRef50_A6DKN7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 430 e-119 UniRef50_Q7UYH3 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 430 e-119 UniRef50_A6DMV0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 429 e-119 UniRef50_C1ZA41 Arylsulfatase A family protein n=1 Tax=Planctomy... 429 e-118 UniRef50_Q5FYB1 Arylsulfatase I n=5 Tax=Chordata RepID=ARSI_HUMAN 429 e-118 UniRef50_D2R1I8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 429 e-118 UniRef50_B6RB10 Arylsulfatase n=7 Tax=Coelomata RepID=B6RB10_HALDI 429 e-118 UniRef50_B4CZ78 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 429 e-118 UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 429 e-118 UniRef50_Q7URW3 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodop... 429 e-118 UniRef50_UPI000186ED10 arylsulfatase B precursor, putative n=1 T... 429 e-118 UniRef50_A6DLD9 Sulfatase n=2 Tax=Chlamydiae/Verrucomicrobia gro... 429 e-118 UniRef50_Q5FYB0 Arylsulfatase J n=81 Tax=Eumetazoa RepID=ARSJ_HUMAN 429 e-118 UniRef50_B4D4S6 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 428 e-118 UniRef50_Q7UYA5 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 428 e-118 UniRef50_Q1YP24 Arylsulfatase A n=1 Tax=gamma proteobacterium HT... 427 e-118 UniRef50_A0Z7U6 Arylsulfatase n=2 Tax=Gammaproteobacteria RepID=... 427 e-118 UniRef50_A7AKS6 Putative uncharacterized protein n=3 Tax=Bactero... 427 e-118 UniRef50_Q8SZ72 RE14504p n=18 Tax=Neoptera RepID=Q8SZ72_DROME 427 e-118 UniRef50_B0SY54 Sulfatase n=7 Tax=Alphaproteobacteria RepID=B0SY... 426 e-117 UniRef50_A4GIB2 Putative secreted sulfatase n=1 Tax=uncultured m... 426 e-117 UniRef50_A4GJF1 Sulfatase n=1 Tax=uncultured marine bacterium EB... 426 e-117 Sequences not found previously or not previously below threshold: UniRef50_C1ZKY2 Arylsulfatase A family protein n=1 Tax=Planctomy... 497 e-139 UniRef50_P08842 Steryl-sulfatase n=59 Tax=Coelomata RepID=STS_HUMAN 461 e-128 UniRef50_A7IPG5 Sulfatase n=3 Tax=Bacteria RepID=A7IPG5_XANP2 453 e-126 UniRef50_B4CVD2 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 453 e-125 UniRef50_Q0BZE9 Sulfatase family protein n=1 Tax=Hyphomonas nept... 446 e-123 UniRef50_A6BYR0 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 441 e-122 UniRef50_A6DMX9 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 426 e-117 >UniRef50_P25549 Arylsulfatase n=54 Tax=Proteobacteria RepID=ASLA_ECOLI Length = 551 Score = 649 bits (1676), Expect = 0.0, Method: Composition-based stats. Identities = 551/551 (100%), Positives = 551/551 (100%) Query: 1 MEFSFSPKRLVVAVAAALPLMASAADTPSTATARKGFAGYDHPNQYLVKPATTIADNMMP 60 MEFSFSPKRLVVAVAAALPLMASAADTPSTATARKGFAGYDHPNQYLVKPATTIADNMMP Sbjct: 1 MEFSFSPKRLVVAVAAALPLMASAADTPSTATARKGFAGYDHPNQYLVKPATTIADNMMP 60 Query: 61 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV 120 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV Sbjct: 61 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV 120 Query: 121 ASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQG 180 ASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQG Sbjct: 121 ASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQG 180 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK Sbjct: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR Sbjct: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 Query: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE Sbjct: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 Query: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP 420 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP Sbjct: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP 420 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP Sbjct: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK Sbjct: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 Query: 541 KYPPRAQIKSD 551 KYPPRAQIKSD Sbjct: 541 KYPPRAQIKSD 551 >UniRef50_Q0C069 Sulfatase family protein n=3 Tax=Bacteria RepID=Q0C069_HYPNA Length = 505 Score = 523 bits (1349), Expect = e-147, Method: Composition-based stats. Identities = 126/498 (25%), Positives = 214/498 (42%), Gaps = 50/498 (10%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 +AE E ++PN+V+ +DD+G+ D+G G +A TP++D +A +G TS Sbjct: 31 APDSVAEKEAAASEQPNIVLIFVDDMGYADIGSFGSPIA---RTPNLDRLAMEGQKWTSF 87 Query: 131 YS-QPSSSPTRATILTGQYSIHHGILMPPMYGQ-------PGGLQGLTTLPQLLHDQGYV 182 Y+ P +P+RA ++TG+ ++ G+ G Q T+ +LL +GYV Sbjct: 88 YAPAPVCTPSRAGLMTGRLAVRSGMAGLVQARHVLFPTSTGGLPQSEVTIAELLQQEGYV 147 Query: 183 TQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 + A GKWHMG E P + GF + G +DM P +P + + Sbjct: 148 SAAFGKWHMGHLPEFLPTSHGFQSYFGIPYSNDM--------NMPGGGETPWSIDLFFEP 199 Query: 243 PFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGC 302 P ++ + E+ P L QR+ + ++F++ +PFFLY Sbjct: 200 PNIQNWDVPLMQDEEII---ERPADQFTLTQRYTERAIEFMETSHAEGQPFFLYLAHNMP 256 Query: 303 HFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 H + + + G S +YGD + E++ + L+ NTL++FTSDNGP Sbjct: 257 HTPLFTSEGFTGVSAG-GAYGDVIEELDWSVGEIVDALKDMKIEKNTLVIFTSDNGPWLA 315 Query: 363 VPPHGR--TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP 420 + H R KG+TWEGG+RVP +W G I PR + DL PT ++G Sbjct: 316 MKTHSGSAGMLRDGKGTTWEGGMRVPAIFWWPGQIAPRTVTDLGSALDLMPTFAAISGA- 374 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 +P+ DG D + G S R+ +Y+ + AVR ++K H Sbjct: 375 ------RLPEDRVYDGFDLSPALFSE-GSSPRETLYYYRFTDVFAVRKGKYKAHFSTYGA 427 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL- 539 + + + ++++ DP E +I +H + + L+ + Sbjct: 428 FGGSG----------RTELETPELYDIEADPSEQFNIAAQHPEIVMELKVLAEKQAASVE 477 Query: 540 ------KKYPPRAQIKSD 551 ++YPP + + Sbjct: 478 PVENQLERYPPGEKRGEE 495 >UniRef50_A4CGL5 Arylsulfatase A (Precursor) n=2 Tax=Flavobacteria RepID=A4CGL5_9FLAO Length = 526 Score = 521 bits (1344), Expect = e-146, Method: Composition-based stats. Identities = 136/531 (25%), Positives = 217/531 (40%), Gaps = 48/531 (9%) Query: 19 PLMASAADTPSTATARKGFAGYDHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAEL 78 P +T+T F ++ + ++ ++ +ET + Sbjct: 12 PKQFLIQSMKTTSTPLSKFL-----QKFSRWCQRCVRYPLLAIILLGVSCRETVKSEFAA 66 Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSS 137 + + PN+V+ DD G+ DVG G A PTP++DA+A+ GL+LT+ Y+ P S Sbjct: 67 ADRADRPPNIVIIFTDDQGYSDVGVYG---ARDIPTPNLDAMAADGLLLTNFYAAQPVCS 123 Query: 138 PTRATILTGQYSIHHGILMPPMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 +RA +LTG Y GI M P G TL +LL QGY T GKWH+G++ + Sbjct: 124 ASRAGLLTGCYPNRVGIHNALMPNSPVGLNPAEETLAELLRQQGYRTGIFGKWHLGDHPD 183 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P GFD+F G +DM+ + P Sbjct: 184 FLPTRHGFDEFFGIPYSNDMWPLHPLQGPVFDFGPLPLY--------------------- 222 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 +Q T + L ++ + V F+++ ++PFFLY H + + + G S Sbjct: 223 EQERVVDTLEDQRLLTRQITERSVDFINRH--KEEPFFLYVPHPQPHVPLFVSDAFRGKS 280 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR--TPFRGA 374 R YGD ++E++ + LE NG D+T ++FTSDNGP H P R Sbjct: 281 -GRGLYGDVIMEIDWSVGQVLGALEDNGLTDDTWVIFTSDNGPWLAYGNHSGRAEPLREG 339 Query: 375 KGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPKTTF 433 KG+ WEGGVR P + + G + K D + DL PT + G P Sbjct: 340 KGTNWEGGVREPCIMKFPGRLPRGKVLDEPLMAIDLLPTIASVTGSPQP--------GRE 391 Query: 434 IDGVDQTSFFLGTNGQSNRKAEHYFLN-GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGF 492 IDG + G + + A +++ +L AVR ++K + Q G Sbjct: 392 IDGKNAWGLLSGAEARGPQDAYYFYYRVNELQAVRDGDWKLVLPHNYRTMQGQEPGADGL 451 Query: 493 TGTVMQTA--GSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 G ++NL DP E++++ RH + + + + L Sbjct: 452 PGAYDYVDVTAPELYNLREDPGETNNLAERHPEVLAAISRKADSMRRRLGD 502 >UniRef50_A6LED1 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LED1_PARD8 Length = 459 Score = 519 bits (1337), Expect = e-145, Method: Composition-based stats. Identities = 132/496 (26%), Positives = 207/496 (41%), Gaps = 52/496 (10%) Query: 56 DNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTP 115 + + A L KPN V+ DD+G+ D+ G TP Sbjct: 2 KTLKSLSVLTAAVLSNSLSLNAASDAAN-KPNFVIIFCDDMGYGDLSCYGNPTI---RTP 57 Query: 116 DIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMP-----PMYGQPGGLQGL 169 +ID +A +G+ LT Y S+P+RA ++TG+ + +G+ + G Q Sbjct: 58 NIDRMACEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDE 117 Query: 170 TTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEV 229 T+ ++L GY T +GKWH+G P + GFD + G +DM V Sbjct: 118 VTIAKVLQQSGYATGCVGKWHLGAFSPYLPTDHGFDTYFGIPYSNDMSP----------V 167 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS 289 + P D + +L +R+ + V F+ +K Sbjct: 168 QNKGAHARNFPPTPLIVDGKQI-----------ESEPDQGELTRRYTEKAVSFIKNHSKE 216 Query: 290 DKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 PFFLY+ H Y NA++ G+S R YGD + E++ + K L +NG +NT Sbjct: 217 --PFFLYFAHTFPHIPLYTNARFEGTS-KRGLYGDVVEEIDWSVGEVLKALRENGLDENT 273 Query: 350 LIVFTSDNGPEAEVPPHGR--TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLA 407 ++FTSDNGP +G P + KG+ WEGG RVP + G I P +D I+ Sbjct: 274 FVIFTSDNGPWLTEHENGGSAGPLKDGKGTWWEGGFRVPAICWMPGKINPAINDEIMTSM 333 Query: 408 DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVR 467 DL+PT L +AG PK +DGV+QT S R +Y+ +L A+R Sbjct: 334 DLYPTFLSMAGIEQ-------PKDLVLDGVNQTGLLF-EEKHSARDEVYYWWGSELMAIR 385 Query: 468 MDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVP 527 E+KY+ + T + A ++N+ TD E ++ +H + Sbjct: 386 KGEWKYYFKTIKDQYLR--------TCKIETPAEPLLYNVETDISERFNLADKHPEIVKL 437 Query: 528 LQTEMHAYMEILKKYP 543 L + + +K P Sbjct: 438 LIEAGEKHKKGMKIKP 453 >UniRef50_A6DPC8 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DPC8_9BACT Length = 598 Score = 517 bits (1333), Expect = e-145, Method: Composition-based stats. Identities = 132/468 (28%), Positives = 197/468 (42%), Gaps = 36/468 (7%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSP 138 + T KKPN +V DD G+ D+G G TP+ID +A +G T+ YS S Sbjct: 18 QATDKKPNFIVIFTDDQGYQDLGCFGSPKI---KTPEIDQMAKEGARYTNFYSANAICSA 74 Query: 139 TRATILTGQYSIHHGILMPPMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 +RA +LTG+Y +G+ G G T+ ++L GY T IGKWH+G+ + Sbjct: 75 SRAALLTGRYPSRNGVFHVYYPGASQGLKPSEITIAEVLKTAGYRTSIIGKWHLGDRNQF 134 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P N GFD + G +DM+ + E IK SK RGG+ Sbjct: 135 LPTNQGFDSYFGIPFSNDMWMSKDLALADDIKLFGGVTVEQIKSGEASKAVKGEKRGGKV 194 Query: 258 QAIAD----ITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 + D P + QR+ D +K + + K +P+F+Y H Y + K+A Sbjct: 195 PLMRDEEVVEYPVDQTYITQRYTDEALKIIKESEKKKQPYFIYLAYAMPHVPLYASPKFA 254 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFR 372 G S AR YGD + EM+ + K L+ +G NTL++FTSDNGP G P R Sbjct: 255 GKS-ARGPYGDTVEEMDYHVGRILKHLKSSGADKNTLVIFTSDNGPWNLGERGGSALPLR 313 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 GAK ST+EGG RVP ++W G I S I D PT LA Sbjct: 314 GAKFSTYEGGHRVPCVMWWPGTIPAGTDSAEIATTLDFMPTFAKLANAQLP--------N 365 Query: 432 TFIDGVDQTSFFL-GTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490 +DG + G G+S + +++ + A+R+ K + Sbjct: 366 RTLDGKNIAPMLRDGNKGKSPYEKFYFWSKNHIEALRIGNMKL---------------RM 410 Query: 491 GFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 + + +FNL D ES ++ + + + + Sbjct: 411 SWDKKNNVRKETELFNLEGDIAESHNLAPQMPEKVAAMTKMLLEAEQE 458 >UniRef50_D2QTW6 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QTW6_9SPHI Length = 486 Score = 516 bits (1330), Expect = e-145, Method: Composition-based stats. Identities = 129/504 (25%), Positives = 201/504 (39%), Gaps = 53/504 (10%) Query: 46 YLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNG 105 LV A T+ + + + PNVV+F +DD+G+ D+ G Sbjct: 10 RLVLSAITLVGLGLSISAWVEK------------PAPATPPNVVLFFMDDLGYGDLSVTG 57 Query: 106 GGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQP- 163 A+ TP++D +A++G T+ + S +RA +LTG Y G+ P Sbjct: 58 ---ALDYTTPNLDKMAAEGTRFTNFLAAQAVCSASRAALLTGCYPNRLGLYGALGPNSPI 114 Query: 164 GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDV 223 G TL +LL ++GY T GKWH+G+NK+ P GFD++ G DM+ Sbjct: 115 GLNPNEETLAELLKERGYATGMFGKWHLGDNKQFLPMQQGFDEYYGVPYSHDMWPLHPAQ 174 Query: 224 HVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFL 283 +K G + + + V F+ Sbjct: 175 AQ-------------------AKYPPLRWIDGNEPGPEIKDLNDAGKITGTITEKAVSFI 215 Query: 284 DKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKN 343 KPFFLY H +A++ G S AR +GD + E++ + L++ Sbjct: 216 RNH--KKKPFFLYVPHPLPHVPLATSARFKGQS-ARGIFGDVLTELDWSVGQIMNELKQQ 272 Query: 344 GQLDNTLIVFTSDNGPEAEVPPH--GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KS 400 G NTL++F SDNGP H FR KG+++EGG RVP V W G++ S Sbjct: 273 GLDKNTLVIFISDNGPWLNYGDHAGSSGGFREGKGTSFEGGHRVPCLVRWPGVVPAGRVS 332 Query: 401 DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN 460 + ++ D+ PT ++ G K IDGVD + G N + R +Y+ Sbjct: 333 NKLLTALDILPTVANVCGARLPK--------QRIDGVDWVALLKGDNSVTPRDKFYYYYR 384 Query: 461 -GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSS--VFNLYTDPQESDSI 517 L AVR ++K QGG G +T + +++L DP E + Sbjct: 385 KNSLEAVRQGDWKLVFAHPGRTYEGFLPGQGGKPGPSTETHAIAAGLYDLRRDPGERYDV 444 Query: 518 GVRHIPMGVPLQTEMHAYMEILKK 541 +H + L+T L Sbjct: 445 REQHPEVVARLETIAEEARADLGD 468 >UniRef50_D2QZX4 Sulfatase n=10 Tax=Bacteria RepID=D2QZX4_9PLAN Length = 499 Score = 512 bits (1319), Expect = e-143, Method: Composition-based stats. Identities = 132/498 (26%), Positives = 203/498 (40%), Gaps = 44/498 (8%) Query: 50 PATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVA 109 A T ++ + T++ + + ++PN+V+ DD+G+ D+G G A Sbjct: 18 AAMTFVAFVLATTFVISSTAATEES--AADAASKRRPNIVLIFCDDLGYADIGCFG---A 72 Query: 110 VGNPTPDIDAVASQGLILTSA-YSQPSSSPTRATILTGQYSIHHGILMPPMYGQP-GGLQ 167 G TP+++ +AS+G+ T + S +RA +LTG Y GIL G + Sbjct: 73 KGYETPNLNKLASEGMKFTDFQVAAAVCSASRAALLTGCYPQRVGILSALGPSDSIGIAK 132 Query: 168 GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNP 227 + +LL + GY T GKWH+G +++ PQ GF + G +DM+ + Sbjct: 133 NELLISELLQNLGYKTACFGKWHLGHHEQFLPQQNGFATYFGLPYSNDMWPKHPTAKNAY 192 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA 287 D ++ I+ P L + + VKF+ Sbjct: 193 PPLPLIDGNKTIELNP-----------------------DQTKLTTWYTEKAVKFIHDC- 228 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 +KPFFLY H + + K+AG + R +GD + E++ + K LE G +D Sbjct: 229 -GEKPFFLYVPHNMPHVPLFVSEKFAGKT-KRGLFGDVIAEIDWSVGEITKALEATGNVD 286 Query: 348 NTLIVFTSDNGPEAEVPPH--GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIV 404 NTL++FTSDNGP H FR KG+ WEGG RVP + G IQP D + Sbjct: 287 NTLVIFTSDNGPWLSYGDHAGSTGGFREGKGTVWEGGHRVPMIAKYPGTIQPGTTCDKLA 346 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLG-TNGQSNRKAEHYFLNGKL 463 DLFPT G + + IDGV +S+ + +Y+ L Sbjct: 347 STIDLFPTIAHYCGAT-------IDPSRKIDGVSIQPLLESVEGAKSSHEFFYYYWGNGL 399 Query: 464 AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIP 523 AVR + FK H G G G ++F+L DP E +I H Sbjct: 400 EAVRDERFKLHFPHAFRSLTGTPGTDGMPNGYTQAKTELALFDLDADPFEQTNIAADHPE 459 Query: 524 MGVPLQTEMHAYMEILKK 541 + L + L Sbjct: 460 VTARLTAAAESMRSDLGD 477 >UniRef50_Q01N83 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01N83_SOLUE Length = 461 Score = 510 bits (1314), Expect = e-143, Method: Composition-based stats. Identities = 122/465 (26%), Positives = 186/465 (40%), Gaps = 60/465 (12%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTR 140 ++PN+VV L DD+G+ D+G G +A TP+ID +A +G TS YS P SP+R Sbjct: 24 QQRQPNIVVILADDLGYGDLGCYGSPIA----TPNIDRLAEEGARFTSFYSASPVCSPSR 79 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 A ++TG+Y + + G G T+ Q+L GY T IGKWH+G P Sbjct: 80 AALMTGRYPTRVEVPVVLGPGDAGLPDSEITMAQVLKSAGYRTSCIGKWHIGSTPGYLPT 139 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 N GFD+F G +D I P + Sbjct: 140 NRGFDEFFGVPYSAD-----------------------ITPCPLMRGSSVVAP------- 169 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPART 320 L + + F+ + D PFFLY H + ++AG S Sbjct: 170 ----AVDCSTLTSSFTQEALDFMRRA--QDNPFFLYLAHTAPHLPLAASPRFAGQS-GLG 222 Query: 321 SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWE 380 Y D + E++ + L+ G NTL++F+SDNGP + RG KG T+E Sbjct: 223 MYADVVQELDWSTGQVMAALKATGLDSNTLVMFSSDNGPW---YQGSQGKLRGRKGETYE 279 Query: 381 GGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQ 439 GG+R P + G+I G+ DL PT LAG +DGVD Sbjct: 280 GGMREPFLARYPGVIPSGIGCAGLATTMDLLPTLARLAGAQTPS--------NPLDGVDI 331 Query: 440 TSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQT 499 G + +R YF L R+ +K H+ A++ G + Sbjct: 332 WPVLTGERAEVDRDVFLYFDAVYLQCARLGRWKLHLSRYNTKAWSPLPPGGRVN---LPL 388 Query: 500 AGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 ++++ +DPQES H + ++ + ++ +PP Sbjct: 389 PRPELYDVVSDPQESYDCAASHPAIVADIRARVERMVQT---FPP 430 >UniRef50_B5JJG5 Sulfatase, putative n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JJG5_9BACT Length = 462 Score = 510 bits (1314), Expect = e-143, Method: Composition-based stats. Identities = 131/476 (27%), Positives = 202/476 (42%), Gaps = 60/476 (12%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-P 134 K PN+V DD+G+ D+ G TP ID++ QG+ T YS P Sbjct: 25 PSAASSAEKPPNIVFIFADDLGYNDLSSYGATDIA---TPAIDSLGEQGIRFTDFYSASP 81 Query: 135 SSSPTRATILTGQYSIHHGILMPPMY-GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 SP+RA +LTG+Y I GI G TT+ +LL + GY T +GKWH+G Sbjct: 82 VCSPSRAALLTGRYPIRQGITGVFWPQSFDGIDPAETTIAELLQENGYRTGLVGKWHLGH 141 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 +++ P GF + G +DM D V +R Sbjct: 142 HQKHLPLQNGFHSYFGIPYSNDM------------------------------DMVVYMR 171 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 G + ++ +R+ + V+F+++ D+PFFLY H Y + + Sbjct: 172 GNDVESYEVD----QHYTTRRYTEEAVQFIEQ--NKDQPFFLYLAHSMPHVPIYASENFV 225 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP--HGRTPF 371 G+S R YGD + E++ A + TL+K+ +NTL+VFTSDNGP + P Sbjct: 226 GTS-KRGLYGDVIQELDWSVAQILDTLDKHQLSENTLVVFTSDNGPWTALKHLGGSAAPL 284 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPK 430 R K T++GG+RVP V W I S + ++ D FPT +A PK Sbjct: 285 REGKMFTFDGGMRVPCLVRWPAQIPAGQTSHAMANMMDWFPTFSRIANLDT-------PK 337 Query: 431 TTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490 + IDG+D T G+ +++ + + +G L A R ++K G Q Sbjct: 338 SRSIDGLDITDVLTGSGPRADNEFFFFHGDGDLRAYRDGDWKL--------KLPYEGNQA 389 Query: 491 GFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRA 546 + +FNL DP E+ + +H +Q M ++ L + PP Sbjct: 390 ARWRQAVAAHPILLFNLAEDPGETTDLAAQHPERLAAMQARMTDFLASLGELPPEK 445 >UniRef50_A6LED2 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LED2_PARD8 Length = 468 Score = 510 bits (1313), Expect = e-143, Method: Composition-based stats. Identities = 119/487 (24%), Positives = 197/487 (40%), Gaps = 68/487 (13%) Query: 72 QQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY 131 A + + ++PNV++ +DD G+ D+G G + TP ID +A +G+ LT Y Sbjct: 12 AFSGAAVATQAAERPNVIIVFIDDFGYGDLGCYGST---KHRTPHIDQMAKEGIRLTDFY 68 Query: 132 S-QPSSSPTRATILTGQYSIHHGILMPPMY--------------GQPGGLQGLTTLPQLL 176 S+P+R+ +LTG Y + + G G T+ +L+ Sbjct: 69 VGSSVSTPSRSALLTGCYPRRVSMHVNADPTPLMSKGRQVLFPASHKGLNPGEITIAELM 128 Query: 177 HDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS 236 +QGY T IGKWH+G+ P GFD + G +DM Sbjct: 129 KEQGYATACIGKWHLGDQLPFLPTRQGFDYYYGIPYSNDM-------------------- 168 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 D + +Q + P + L R+ + V+F+ + PFF+Y Sbjct: 169 ----------DRPYCPLPLMEQEEVIVAPVGHDSLTIRYTNKTVEFIKSH--KESPFFIY 216 Query: 297 YGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 H + + G S YGD E++ L +TL++ G NTLI+FTSD Sbjct: 217 LCHNMTHNPLAASPAFKGKSQN-GLYGDATEELDWSMGVLLETLKEEGLDQNTLIIFTSD 275 Query: 357 NGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALD 415 NG + P RG KG+T+EGG RVP + W I + +D +V D PT Sbjct: 276 NGADEHFGGT-NRPLRGQKGTTYEGGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAH 334 Query: 416 LAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHV 475 + VP IDG + + G + S + +Y+ +L AVR +KYH+ Sbjct: 335 YC-------SYAVPSDRVIDGHNVSGILEGESMASPTETFYYYQKQQLQAVRWGNWKYHL 387 Query: 476 LIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAY 535 +++ G + + ++NL D E+ ++ +H + + + Sbjct: 388 PLKE--------RIKGPHFPDTEVGEARLYNLANDLSETTNVIDKHPEVVTKMNQWIEQV 439 Query: 536 MEILKKY 542 + + Sbjct: 440 RSDMGDW 446 >UniRef50_A4CMB0 Arylsulfatase A n=4 Tax=Bacteria RepID=A4CMB0_9FLAO Length = 492 Score = 509 bits (1311), Expect = e-142, Method: Composition-based stats. Identities = 132/501 (26%), Positives = 219/501 (43%), Gaps = 41/501 (8%) Query: 54 IADNMMPVMQHPAQDKETQQKLAELEKKT---GKKPNVVVFLLDDVGWMDVGFNGGGVAV 110 +A M + Q+ ET +E +KPN ++ DD+G+ D+ G Sbjct: 15 MALTMCLGLMAGCQNTETSPGDSEGTAAAGGIPEKPNFIIVFADDLGYGDLSSFGHPTI- 73 Query: 111 GNPTPDIDAVASQGLILTSAYSQP-SSSPTRATILTGQYSIHHGILMPPMY-----GQPG 164 T ++D +A++G T+ Y +P+RA +LTG+ + +G+ + G Sbjct: 74 --HTKNLDRMAAEGQKWTNFYVAASVCTPSRAGLLTGRLPVRNGLTSNEIGVFFPDSHNG 131 Query: 165 GLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVH 224 TL + L GY T +GKWH+G +E P N GFDD+ G +DM + Sbjct: 132 MPASEITLAEQLKKAGYATGMVGKWHLGHKEEYLPPNHGFDDYFGIPYSNDMDFTGQFTS 191 Query: 225 VNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLD 284 +R E +K ++ + E P + +R+ D VK++ Sbjct: 192 YQDYFGRYTERYESLKTEEYNVPLIRGTEEIE-------RPVNQNTITKRYNDEAVKWIR 244 Query: 285 KMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNG 344 + D+PFF+Y H + + ++ G+S AR YGD + E++ + + LE G Sbjct: 245 EH--KDEPFFMYLAHSLPHVPLFTSDEFRGTS-ARGLYGDVVEEIDHGVGQIMELLEAEG 301 Query: 345 QLDNTLIVFTSDNGPEAEVPPHGRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDG 402 +NT++VFTSDNGP G + R KG+TWEGG+R PT + GM+ + Sbjct: 302 LAENTIVVFTSDNGPWLPTGISGGSAGLLREGKGTTWEGGMREPTIFWAPGMLPAKVVMD 361 Query: 403 IVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGK 462 + DLF T LAG P +P +DGVD + G + +S RK Y+ Sbjct: 362 MGSTLDLFNTFSSLAGVP-------MPDDREMDGVDLSPILFG-DAESPRKEMFYYQGAD 413 Query: 463 LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHI 522 L AVR+ +K H ++ Y ++ ++N+ DP E + +H Sbjct: 414 LYAVRLGAYKAHFYTKEAY---------VMGAERVEHNPPLLYNVEEDPSEKYDLSGKHP 464 Query: 523 PMGVPLQTEMHAYMEILKKYP 543 + ++ + A+ + K P Sbjct: 465 EVIEEIRRVVEAHNANMVKAP 485 >UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=Bacteria RepID=Q7UHJ9_RHOBA Length = 1012 Score = 506 bits (1305), Expect = e-142, Method: Composition-based stats. Identities = 120/507 (23%), Positives = 191/507 (37%), Gaps = 82/507 (16%) Query: 61 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV 120 P+ + + KPN +V L DD G+ D+ G TP ID + Sbjct: 546 AANQPSSPTASVSPAGREKTAETTKPNFIVILTDDQGYGDLSCFGAKHV---DTPRIDQM 602 Query: 121 ASQGLILTSAY-SQPSSSPTRATILTGQYSIHHGI-----LMPPMYGQP-GGLQGLTTLP 173 A++G LTS Y + P +P+RA ++TG Y + + G P G T+ Sbjct: 603 AAEGSRLTSFYVAAPVCTPSRAGLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIA 662 Query: 174 QLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSP 233 ++L GY T GKWH+G+ E P GFD+F G D++ + L Sbjct: 663 EVLKTAGYRTGMFGKWHLGDQPEFLPTKQGFDEFFGIPYSHDIHPFHPRQNHYHFPPLPL 722 Query: 234 DRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPF 293 +++ + ++ D L +R + V F+++ D+PF Sbjct: 723 LQNDTVIEMDPDAD----------------------FLTKRLTEQAVSFIER--NKDQPF 758 Query: 294 FLYYGTRGCHFDNYPNAKY-----------AGSSPARTSYGD-------CMVEMNDVFAN 335 FLY H + + + Y + E++ Sbjct: 759 FLYLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNIDYATRANLFRQAIAEIDWSVGQ 818 Query: 336 LYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI 395 + L NG + T+++FTSDNGP RG KG+T+EGG+R PT V W G I Sbjct: 819 ILDALRSNGLDEKTMVLFTSDNGPPKNTLYASPGELRGHKGTTFEGGMREPTVVRWPGQI 878 Query: 396 QPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKA 454 ++D ++ DL PT LAG +P IDG D G Q+ A Sbjct: 879 PAGHQNDELMTAMDLLPTFAKLAGA-------AIPTDRVIDGKDIWPTLKGET-QTPHDA 930 Query: 455 EHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 Y +LAAVR ++K H V +++L D E Sbjct: 931 FFYHRGNQLAAVRSGKWKLH---------------------VNNGVAKQLYDLENDLGEK 969 Query: 515 DSIGVRHIPMGVPLQTEMHAYMEILKK 541 ++ + + LQ ++ + + Sbjct: 970 VNVIETNPEVVKKLQHQLKDFAADIAS 996 Score = 453 bits (1166), Expect = e-126, Method: Composition-based stats. Identities = 116/509 (22%), Positives = 193/509 (37%), Gaps = 61/509 (11%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-P 134 + PNVV+ +DD+G+ D+G G TP+ID +A++G T A+S Sbjct: 30 CGTSVAAERPPNVVLIFVDDLGYGDLGCYGATKLS---TPNIDRLAAEGRRFTDAHSASA 86 Query: 135 SSSPTRATILTGQYSIHH----GILMP-PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKW 189 +P+R +LTGQY + GI P P T+ ++ ++GY T +GKW Sbjct: 87 VCTPSRYGLLTGQYPVRAMGGQGIWGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKW 146 Query: 190 HMGENKES---------QPQNVGFDDFRGFN--SVSDMYTEWRD---VHVNPEVALSPDR 235 H+G +E PQ+VGFD + G + Y D +P L Sbjct: 147 HLGFKEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGG 206 Query: 236 SEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFL 295 F ++ A+ E + VK++ + K ++PFFL Sbjct: 207 KPVSPTPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITE--KKNEPFFL 264 Query: 296 YYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 Y+ T H P ++ G+S YGD + E++ + + ++LE NG DNTL++FTS Sbjct: 265 YFATPNIHHPFTPAPRFKGTSQC-GLYGDFVHELDWMVGEIVQSLEDNGLTDNTLVLFTS 323 Query: 356 DNGPEA--------EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDL 406 DNG + G K WEGG RVP W G I+ SD ++ Sbjct: 324 DNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKAGTQSDQLISQ 383 Query: 407 ADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNR-KAEHYFLNGKLAA 465 DLF T L +P + D ++ L + R + + A Sbjct: 384 VDLFATFSALT-------EQEMPSSEQKDSINMLPALLDDPNEPLRTELVLAPRQPRNLA 436 Query: 466 VRMDEFKYHVLIQQPYAYTQSGYQGGFTGT------------------VMQTAGSSVFNL 507 +R ++ Y + G + +++L Sbjct: 437 IRKGKWLYIGARGSGGFNGSKPQHHAWGGPAAVQFSGQKNSDIVNGRIKKNAPPAQLYDL 496 Query: 508 YTDPQESDSIGVRHIPMGVPLQTEMHAYM 536 D ++ ++ H + ++ + +Y Sbjct: 497 ENDRSQTTNVFREHPEVVEEMKAMLESYR 525 >UniRef50_Q7UKJ5 Arylsulfatase A n=3 Tax=Bacteria RepID=Q7UKJ5_RHOBA Length = 489 Score = 506 bits (1305), Expect = e-142, Method: Composition-based stats. Identities = 127/473 (26%), Positives = 197/473 (41%), Gaps = 39/473 (8%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 A T +KPNV+V DD G+ D+G G TP++D +AS+G TS YS Sbjct: 35 SSAAESTDTTEKPNVIVIFTDDQGYNDLGCYGSP---NIKTPNLDRLASEGRRYTSFYSA 91 Query: 134 -PSSSPTRATILTGQYSIHHGILMPPMYGQP--GGLQGLTTLPQLLHDQGYVTQAIGKWH 190 SP+RA +LTG Y G+ ++ Q G T+ L GY T +GKWH Sbjct: 92 CSVCSPSRAALLTGCYPKRVGLHQHVLFPQSTYGLHPDEVTIADHLKSAGYATACVGKWH 151 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +G +KE+ P + GFD + G +DM H + + + + + Sbjct: 152 LGHHKETLPTSNGFDSYYGIPYSNDM------NHPDNKRLGKMSSDDRWTDQSSAVTLWN 205 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 +++ I P + +R+ D ++F++ A DKPFFLY H Y Sbjct: 206 TPLVQDEEII--ELPVDQRTVTRRYTDRAIEFVE--ANQDKPFFLYLPHSMPHIPLYVPE 261 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR-- 368 P Y + ++ L +T+ G + TLIV+TSDNGP + HG Sbjct: 262 DVYDPDPQNA-YKCVIEHIDTEVGRLVQTVRDLGLSEKTLIVYTSDNGPWLQFKNHGGSA 320 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANL 427 P R KG+T+EGG RVP ++ G I S+ DL PT G Sbjct: 321 GPLRAGKGTTFEGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASFTGV-------A 373 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL-NGKLAAVRMDEFKYHVLIQQPYAYTQS 486 + IDG+D TS F + +S R ++ +G L +RM ++KY + + Sbjct: 374 LENDRKIDGIDLTSTFT--SDESARDEFVFYSAHGVLEGIRMGDWKYLRQVAR------- 424 Query: 487 GYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL 539 +G +F+L D E +++ + + M E + Sbjct: 425 --RGPNAKGPKPEPKVFLFDLSQDIGEKNNLVEQQPERVQKMHARMEELNEEI 475 >UniRef50_A4A2W0 Arylsulfatase A n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A2W0_9PLAN Length = 477 Score = 506 bits (1304), Expect = e-142, Method: Composition-based stats. Identities = 122/470 (25%), Positives = 199/470 (42%), Gaps = 51/470 (10%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP 134 + ++ KPN V+ +DD+G+ D+ G V N TP+++A+A +G+ LT Y+ P Sbjct: 20 SSSCAQEVATKPNFVIINIDDLGYADIEPFGSEV---NRTPNLNAMADEGMKLTCFYAAP 76 Query: 135 SSSPTRATILTGQYSIHH-GILMPPMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 SP+RA ++TG Y I G G T+ +L+ +QGY T IGKWH+G Sbjct: 77 VCSPSRAALMTGCYPKRALTIPHVLFPGNAEGMSPNEVTIAELMKEQGYATAIIGKWHLG 136 Query: 193 ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 + + P GFD + G +DM V N + + + LP +++ Sbjct: 137 DQPDFLPTRQGFDYYYGLPYSNDMGPAADGVKSNYGAPIPQRKGKGQPPLPLLRNETVLQ 196 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 R + K +L + + ++F+ +KPFFLY HF YP + Sbjct: 197 R---------VLAKDQTELVTNYTEEAIQFIRDH--QEKPFFLYLPHSAVHFPMYPGDAF 245 Query: 313 AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFR 372 G + + Y D + E++ + + L+ G TL++FTSDNG + P R Sbjct: 246 RGKN-SHGLYNDWVEEVDWSVGQVLQALKDLGLDQRTLVIFTSDNGGQTRFG-AVNKPLR 303 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 K +T+EGG+RVPT V W G + SD +V + D+ PT + LAG P Sbjct: 304 AGKATTYEGGMRVPTIVRWPGKVPAGSSSDAVVGMIDVLPTLVKLAGGTT-------PTD 356 Query: 432 TFIDGVDQTSFFLG-TNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490 IDG D G +S +++ L AVR +K Sbjct: 357 RKIDGADIGPILAGVKEAKSPHDVFYFYRGYDLEAVRSGPWKL----------------- 399 Query: 491 GFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 + +++NL+ D E+ ++ + + L+ L Sbjct: 400 -------RLKEGALYNLHEDISEAKNVAPDNADVVERLRKIAAEMDSDLG 442 >UniRef50_UPI0001968C90 hypothetical protein BACCELL_02360 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968C90 Length = 525 Score = 503 bits (1297), Expect = e-141, Method: Composition-based stats. Identities = 130/487 (26%), Positives = 209/487 (42%), Gaps = 60/487 (12%) Query: 62 MQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVA 121 + E +KPN V +DD+G+ DV G TP+IDA+A Sbjct: 52 LVWGGLLGTVGVSCTEATPTKSEKPNFVFIYMDDMGYSDVSCYG---ETRWTTPNIDALA 108 Query: 122 SQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQP-GGLQGLTTLPQLLHDQ 179 ++G+ T Y+ P SSP+RA LTG+Y GI G T+ ++L Q Sbjct: 109 AEGIKFTDCYAASPISSPSRAGFLTGRYPARMGIQGVFYPDSYTGMAPEEVTMAEVLKVQ 168 Query: 180 GYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI 239 GY T IGKWH+G ++ P GFD++ G +DM Sbjct: 169 GYATACIGKWHLGSREKYLPLQQGFDEYFGIPYSNDM----------------------- 205 Query: 240 KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 + + + + + ++ +++ + V ++ + K+D+PFFL+ Sbjct: 206 -----------SAQVYLRGNEVEEFHIDINNVTKKYTEEAVDYIRR--KADQPFFLFLAH 252 Query: 300 RGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP 359 H Y + ++AG S A YGD ++E++ + +TL + G DNTL+VFTSDNGP Sbjct: 253 SMMHVPIYVSDEFAGKSGA-GIYGDAVLEVDWSVGRIMETLRELGLDDNTLVVFTSDNGP 311 Query: 360 EAEVPPHGR--TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLA 417 + P G P R K + +EGGVRVP YWKG I+P + +V L D FPT L+ Sbjct: 312 WLQEGPLGGRALPLREGKTTAFEGGVRVPCIAYWKGQIKPVVNTDVVSLLDWFPTVTALS 371 Query: 418 GHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLI 477 G +DG D T+ GT +++ ++ N + R ++K Sbjct: 372 GGILP--------DVRLDGYDLTAVLNGTGKRASEDYAYFRNNRDITDYRSGDWKI---- 419 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 + G +G F + +FNL D E ++ ++ + ++ Y Sbjct: 420 ----SLPAPGIKGNFWRASTAEHDTLLFNLREDIGERYNLYRKYPGKAKEMLQKLQEYTR 475 Query: 538 ILKKYPP 544 + PP Sbjct: 476 NFGEIPP 482 >UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAW6_9PLAN Length = 472 Score = 502 bits (1294), Expect = e-140, Method: Composition-based stats. Identities = 124/481 (25%), Positives = 195/481 (40%), Gaps = 54/481 (11%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPT 139 ++PN++V L DD+G+ ++G G PTP ID++AS G+ T AY P+ SP+ Sbjct: 21 SAAEQPNIIVLLADDLGYGELGCQGNPQI---PTPHIDSLASHGIRFTQAYVTAPNCSPS 77 Query: 140 RATILTGQYSIHHGILMPPMYGQ-----PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 RA +LTG+ G P+ + G T+ + LHDQGY T IGKWH+G Sbjct: 78 RAGLLTGRIPTRFGYEFNPIGARNEDSGTGLPPDEQTIAERLHDQGYTTCLIGKWHLGGT 137 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 + P GFD+F GF + H + K S++ +++ Sbjct: 138 ADYHPFRHGFDEFFGFMHEGHYFVP-PPYHGVTTMLRRKTLPGRQKGRWISENLIYSTHM 196 Query: 255 GEQQAIADITPK---------YMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD 305 G + D E L + V F+++ DKPFFLY H Sbjct: 197 GYDEPDYDANNPIIRGGQPVNETEYLTDAFTREAVSFINRH--QDKPFFLYLAYNAVHSP 254 Query: 306 NYPNAKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 K R + + M+ + K ++++G + TLIVF SDNG Sbjct: 255 LQGKKKDIQHFTQIEDIHRQIFAAMLSSMDQSIGKILKQVQQSGLDEKTLIVFLSDNGGP 314 Query: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGH 419 P RG KGS +EGG+RVP + W G + P + D V D+FPT++ LAG Sbjct: 315 TRELTSSNLPLRGEKGSMYEGGLRVPFLMRWTGTLAPKQTIDVPVSSLDIFPTSVALAGA 374 Query: 420 PGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQ 479 + +DG + L + A+ ++ G+ AA+R ++K + Sbjct: 375 SLPQN---------LDGRNLLPLLLQQKTELP-VADFFWRQGRKAALRSGDWKIVQMRG- 423 Query: 480 PYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL 539 + ++NL D E+ + + LQT + + Sbjct: 424 ----------------TREKPVWELYNLANDKSETIDLATEQSEKRMELQTRWNELNAQM 467 Query: 540 K 540 K Sbjct: 468 K 468 >UniRef50_D0TQQ7 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TQQ7_9BACE Length = 853 Score = 499 bits (1285), Expect = e-139, Method: Composition-based stats. Identities = 132/492 (26%), Positives = 204/492 (41%), Gaps = 62/492 (12%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT- 139 + +KPNVV+ DD G+ D+G G + TP ID +A +G+ LT Y Sbjct: 18 QARQKPNVVIIFTDDQGYQDLGCYGSPLI---QTPFIDRMAKEGIKLTDFY--------V 66 Query: 140 --------RATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 RA +LTG+ + +G+ G TL + L +QGY T GKWH+ Sbjct: 67 SSSVSSASRAGLLTGRLNTRNGVKGVFFPESEGMPSEEITLAEALKEQGYTTGCFGKWHL 126 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTE----------WRDVHVNPEVALSPDRSEYIKQ 241 G+ K P + GFD + G +DMY +R+ + + + + Sbjct: 127 GDLKGHLPTDQGFDYYYGIPYSNDMYIGPSQQFASNVTFREGYNLSKAKEDQEFVRTSSR 186 Query: 242 LPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRG 301 K +A E I + P +R+ D+ + F++ ++PFF+Y Sbjct: 187 ADIKKRLNNASPLFEGDKIIEY-PCDQSTTTRRYFDHAIDFIEN--NPEQPFFVYITPSM 243 Query: 302 CHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA 361 H + + ++ G S R YGD + E++ L L+K +NTL++F SDNGP Sbjct: 244 PHVPLFASEQFKGKS-KRGLYGDVVEEIDWNVGRLIDYLDKKKLAENTLVIFASDNGPWL 302 Query: 362 EVPPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAG 418 G + P RG K S +EGGVRVP + WKG I SD IV DLFPT + AG Sbjct: 303 SFKEDGGSAEPLRGGKFSYYEGGVRVPCIIRWKGSIPAGVTSDAIVASIDLFPTIMHYAG 362 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ 478 IDG++ +SF + R Y G++ +R ++ Y Sbjct: 363 CQS--------FKQKIDGINISSFLKNPSL-RLRDEYVYVKGGEVHGIRKGDWVYLPKTG 413 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 + +FNL D ES+++ +++ LQ M Y Sbjct: 414 N--------------SKFKKGDVPELFNLKQDIGESNNLHLQYPNKVKELQEVMKKYQST 459 Query: 539 LKKYPPRAQIKS 550 P +QI+ Sbjct: 460 --STMPYSQIRD 469 >UniRef50_C1ZKY2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZKY2_PLALI Length = 483 Score = 497 bits (1281), Expect = e-139, Method: Composition-based stats. Identities = 125/497 (25%), Positives = 185/497 (37%), Gaps = 70/497 (14%) Query: 58 MMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDI 117 + + L +PN+++ + DD+G+ DVGF+G PTP++ Sbjct: 5 LEAGCSQISAFAFCMLALVITPVIAADRPNILLIVGDDMGYADVGFHG---CKDIPTPNL 61 Query: 118 DAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLL 176 DA+A G+ TS Y P SPTRA +LTG+Y G P G T+ L Sbjct: 62 DALAKSGVQFTSGYVTGPYCSPTRAGLLTGRYQQRFGHEFNPSGANTGLPLTEVTIADRL 121 Query: 177 HDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS 236 GY T +GKWH+G PQ GF++F GF + + + + + E + D Sbjct: 122 KQVGYTTGLVGKWHLGSQPAMHPQERGFEEFIGFLGGAHSFFDAQGILRGHEPVKTID-- 179 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 + V F++K DKP+FLY Sbjct: 180 ---------------------------------YTTDLFGREAVSFIEKHR--DKPWFLY 204 Query: 297 YGTRGCHFDNYPNAKYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 H + R +Y M+ M++ + LE GQ TL+ Sbjct: 205 LSFNAVHTPMHATEDRMAKLASISDQERRTYAAMMLAMDEAIGKVLTQLETTGQKQKTLV 264 Query: 352 VFTSDNGPEA----EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLA 407 +F SDNG + TP RG+K +T EGG+RVP V W G I P D V Sbjct: 265 MFISDNGGPTMPGVTINGSINTPLRGSKRTTLEGGIRVPFVVSWPGKIAPAVFDSPVIQL 324 Query: 408 DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVR 467 DL TAL +AG K DGV+ + G + A + G+ AVR Sbjct: 325 DLTATALAVAGVE---------KDVKSDGVNLLPYLQGKQSEVPHAALF-WRFGEQMAVR 374 Query: 468 MDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVP 527 ++K T G Q +++L D E+ + Sbjct: 375 AGDYKLVRYDSNADTLTGKGKQPVTAAR--------LYDLKEDLGETRDLAASMPEKVAE 426 Query: 528 LQTEMHAYMEILKKYPP 544 LQ + + + + PP Sbjct: 427 LQAQWDRWNQ--QNMPP 441 >UniRef50_A6LDP6 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LDP6_PARD8 Length = 452 Score = 497 bits (1281), Expect = e-139, Method: Composition-based stats. Identities = 122/468 (26%), Positives = 199/468 (42%), Gaps = 53/468 (11%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPT 139 KPN++V DD+G+ D+ G TP+ID +A +G +S Y SSP+ Sbjct: 21 SQPTKPNIIVINCDDMGYGDLSCFGSPTI---KTPNIDRMAIEGQKWSSFYVSASVSSPS 77 Query: 140 RATILTGQYSIHHGILMPPM-----YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 RA +LTG+ + G+ + G T+ +LL GY T IGKWH+G Sbjct: 78 RAGLLTGRLGVRTGMYGDQRRVLFPDSKGGLPSEELTIAELLKQAGYHTACIGKWHLGHL 137 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 E P GFD F G+ +DM + + + ++Y + + + R Sbjct: 138 PEYMPLRHGFDYFYGYPYSNDM---------SRKEQIKLGNTKYPYEYIIYEQEKELERE 188 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG 314 +Q +L Q+ + ++++ + + PFFLY H Y + + G Sbjct: 189 PQQY-----------NLTQQVTEAAIRYIK--SNENSPFFLYLAHPMPHMPVYASTDFQG 235 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA--EVPPHGRTPFR 372 S AR YGD + E++ + +TL+ G NTL++FTSDNGP + P + Sbjct: 236 KS-ARGKYGDTVEELDWSVGQILQTLKSEGLDKNTLVIFTSDNGPWLLCKQEGGSPGPLK 294 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 K S +EGG RVP + W M++P + DL PT ++AG P +P Sbjct: 295 DGKASMFEGGFRVPCIM-WGAMVKPGYITDMASTLDLLPTFCEIAGIP-------LPSDR 346 Query: 433 FIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGF 492 DG+ + + R +++ +L A+R ++K H + Y Sbjct: 347 HYDGISLLNVLKDKSTC-KRDVFYFYRGSELYAIRKGKYKAHFSYRPAYG---------- 395 Query: 493 TGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 T + +++L TDP E +I H + L +A+ LK Sbjct: 396 TTDKIIYDKPVLYDLGTDPGELYNIAEEHPDIVQELTMLANAHKASLK 443 >UniRef50_B4D764 Steryl-sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D764_9BACT Length = 499 Score = 495 bits (1275), Expect = e-138, Method: Composition-based stats. Identities = 132/467 (28%), Positives = 201/467 (43%), Gaps = 31/467 (6%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 KPN ++ +DD+G+ D+ G + N TP++D +A +G LT Y P SP+R+ + Sbjct: 22 DKPNFIIINIDDMGYADIAPFGSKL---NRTPNLDRMAQEGRKLTCFYGAPVCSPSRSAL 78 Query: 144 LTGQYSIHH-GILMPPMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQN 201 +TG Y I G G T+ +LL GY T IGKWH+G+ E P Sbjct: 79 MTGCYPKRVLPIPSVLFPGAAVGLNPAEHTVAELLKKSGYATGCIGKWHLGDQPEFLPPR 138 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVN-----PEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 GFD + G +DM + P+ +P+ S I + + + Sbjct: 139 RGFDYYLGLPYSNDMGPGEDGSKSSLGDPIPKPKATPNPSAPIPETGITGNQPPLPMLEN 198 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 ++ IA + + L R+ VKF+ + DKPFFLY HF YP ++AG S Sbjct: 199 EKVIARVRQDEQQGLVDRYTKAAVKFITEH--KDKPFFLYLPHNAVHFPIYPGKEWAGKS 256 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKG 376 P Y D + +++ + TL + D+T ++FTSDNG P RG K Sbjct: 257 PN-GYYSDWVEQVDWSVGQVLNTLRELKLQDHTFVLFTSDNGGTPRAV---NAPLRGFKT 312 Query: 377 STWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFID 435 +TWEGG+R PT +W G I SD I + D+ PT ++LAG VP ID Sbjct: 313 TTWEGGMREPTIAWWPGKIPGGTSSDEITGMFDILPTLVNLAGGE-------VPTDHKID 365 Query: 436 GVDQTSFFLGTNG-QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTG 494 G + G G +S + +YF +L VR +K + Sbjct: 366 GGNIWPVLAGEAGAKSPHEVFYYFNGLRLEGVRTGPWKLRFGSAGLAEGKGPVKKPAAPI 425 Query: 495 TVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 ++NL TD E+ ++ H + L+ A + L + Sbjct: 426 ------PDQLYNLQTDIGETTNVADAHPDVVAHLRELADAMKDDLGR 466 >UniRef50_D2QWC8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QWC8_9PLAN Length = 468 Score = 495 bits (1274), Expect = e-138, Method: Composition-based stats. Identities = 128/492 (26%), Positives = 185/492 (37%), Gaps = 78/492 (15%) Query: 57 NMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPD 116 ++ ++ T +PN+VV + DD+G+ D+G +G PTP Sbjct: 6 SLRALVALGLLTAAT-----TSMAADASRPNIVVIVGDDMGYHDLGVHG---CKDIPTPH 57 Query: 117 IDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMY---GQPGGLQGLTTL 172 +DA+A+ G+ TS Y P SPTRA +LTG+Y G P G+ G TTL Sbjct: 58 LDALATSGVRCTSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNPGPTPTGEIGLPLSETTL 117 Query: 173 PQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALS 232 L GY T +GKWH+G +++ P + GFD+F GF + Y Sbjct: 118 ADRLKKVGYKTGMVGKWHLGNDEKRHPLSRGFDEFFGFLGGARTYFATPGN--------- 168 Query: 233 PDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKP 292 G + E L + V ++D+ S P Sbjct: 169 -------------------ASAGTKLLRGREVVDEKEYLTDAFAREAVAYIDRSKAS--P 207 Query: 293 FFLYYGTRGCHFDNYPNAKYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 FFLY H + KY P R Y M M+D + LE+ L+ Sbjct: 208 FFLYLTFNAVHTPMEASQKYLDRFTAVSDPKRQKYCAMMSAMDDAVGQVVAKLEREKLLE 267 Query: 348 NTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDL 406 NTLI F SDNG TP RG K +TWEGG+RVP FV WKG I K+ D V Sbjct: 268 NTLIFFVSDNGGPTAANTGDNTPLRGFKATTWEGGIRVPYFVSWKGKIPAGKTYDQPVIQ 327 Query: 407 ADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAV 466 D PT A A P DGV+ + N ++ + + G A+ Sbjct: 328 IDFVPT---------ALAAAGAPAAEKTDGVNLLPYLTFENKEAPHASLF-WRFGPQTAI 377 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGV 526 R +K + ++++L D E+ + + Sbjct: 378 RHGNYKLVM--------------------TRDLDKPALYDLAADISETKDLSADKPEIVA 417 Query: 527 PLQTEMHAYMEI 538 L A+ + Sbjct: 418 QLTAAYDAWNQE 429 >UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D464_9BACT Length = 474 Score = 492 bits (1267), Expect = e-137, Method: Composition-based stats. Identities = 113/486 (23%), Positives = 178/486 (36%), Gaps = 57/486 (11%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-Q 133 A+L K+PN++ + DD+G+ + G GG PTP+ID + + G+ +S Y Sbjct: 17 CAQLAIAAPKRPNILFIVADDLGYGEPGCYGGKDI---PTPNIDKLVASGVRFSSGYVSA 73 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQ-----PGGLQGLTTLPQLLHDQGYVTQAIGK 188 P + +RA ++TG+Y G P+ + G T+ L D GY T +GK Sbjct: 74 PFCAASRAALMTGRYQTRFGFEYNPIGAKNADPGTGLPVNEKTVADRLRDVGYATGLVGK 133 Query: 189 WHMGENKESQPQNVGFDDFRGFNSVSDMY--TEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 WH+G PQ GFD+F GF Y W PD S+ P Sbjct: 134 WHLGGTAPFHPQRRGFDEFFGFLHEGHFYLPPPWSGATTWLRRKALPDGSQGRWTSPDGH 193 Query: 247 DDVHAVRGGEQQAIADITP--------KYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 + A P + +L + F+D+ +P+FLY Sbjct: 194 TVWSTDLHENEPAYDADNPLLRNSQPVEEKANLTDAFTREACSFIDRH--QAQPWFLYLA 251 Query: 299 TRGCHFDNYPNAKYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVF 353 H Y R + + +++ + L +G +NTL+VF Sbjct: 252 YNAVHSPLQGEDTYMEKFSHIGDIQRRIFAAVLAHLDEDIGKVRAQLRADGLEENTLVVF 311 Query: 354 TSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPT 412 SDNG + P RG KG W+GG+R+P V WKG I D DL T Sbjct: 312 LSDNGGPTKELTSSNLPLRGGKGDLWDGGIRIPFAVSWKGQIPAGHTIDAPAISMDLTAT 371 Query: 413 ALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFK 472 AL LAG + +DGVD G + + G+ A+R ++K Sbjct: 372 ALKLAGAETEQA--------KLDGVDLLPLLTGKTTAAPHDTLF-WRVGRKNALRHGDWK 422 Query: 473 YHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 + +++L D E++++ ++ L Sbjct: 423 LLR---------------------QGSKEWQLYDLAHDVGETNNMAAQNAARVTELSALW 461 Query: 533 HAYMEI 538 + Sbjct: 462 DKWNSE 467 >UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZF72_PLALI Length = 470 Score = 491 bits (1264), Expect = e-137, Method: Composition-based stats. Identities = 120/499 (24%), Positives = 189/499 (37%), Gaps = 75/499 (15%) Query: 44 NQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGF 103 N + + + + + + +T +KPNV++F DD+GW + G Sbjct: 5 NGFHFSSICLVGICLAGISSICDLAQGAEP------TQTSRKPNVIIFYADDLGWGETGI 58 Query: 104 NGGGVAVGNPTPDIDAVASQGLILTSAYSQPS-SSPTRATILTGQYSIHHGILMPPMYGQ 162 G PTP ID++A G+ T + + SP+RA +LTG+Y G + Sbjct: 59 QGNPQI---PTPHIDSIAKNGVRCTQGFVAATYCSPSRAGLLTGRYPTRFGHEFNRIANV 115 Query: 163 PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRD 222 G TTL LH GY T +GKWH+G+ E +P GFD+F G + T + Sbjct: 116 SGLDLQETTLADRLHGLGYKTACVGKWHLGDGPEYRPTKRGFDEFFGTLAN----TPFFH 171 Query: 223 VHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKF 282 + +S D +E + + V++ Sbjct: 172 PTKFVDSRVSNDVAEVSDENF--------------------------YTTDEYAKRSVEW 205 Query: 283 LDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS-----SPARTSYGDCMVEMNDVFANLY 337 + + P+FLY H KY P R + M M+D + Sbjct: 206 IGQQ--QQSPWFLYLPFNAQHAPLQAPQKYLDRFESIADPKRKLFAAMMSAMDDAIGQVL 263 Query: 338 KTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP 397 + + GQ +NTL+ F SDNG + P RG K +T+EGG RVP V WKG + Sbjct: 264 GKVRELGQEENTLVFFISDNGGPTQGTTSQNGPLRGFKMTTFEGGTRVPFLVQWKGKLPA 323 Query: 398 RKS-DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH 456 K+ D V D+ PT L AG + + +DGVD +F + + Sbjct: 324 GKTYDNPVINLDVLPTVLTAAG-------SKIDPAWKLDGVDLVPYFTSSIANKPHETL- 375 Query: 457 YFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDS 516 Y+ G+ AVR ++K V +++L +D ES + Sbjct: 376 YWRFGEQWAVRQGDWKLVVARGGS-------------------GQPELYDLASDIAESKN 416 Query: 517 IGVRHIPMGVPLQTEMHAY 535 + + LQ + Sbjct: 417 LASENPAKVKELQALWDQW 435 >UniRef50_B9XGT6 Sulfatase n=3 Tax=Bacteria RepID=B9XGT6_9BACT Length = 477 Score = 491 bits (1264), Expect = e-137, Method: Composition-based stats. Identities = 120/517 (23%), Positives = 182/517 (35%), Gaps = 118/517 (22%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 + L K KPN+V L DD+G+ DV G TP+ID +A G+ T ++ Sbjct: 11 AVFCLSTKAANKPNIVFILADDLGYTDVACYGSKY---YETPNIDKLAKDGIKFTDGHTC 67 Query: 134 -PSSSPTRATILTGQYSIHHGILMP--------------PMYGQPGGLQGLTTLPQLLHD 178 P+ PTRA++++GQY G+ P+ TL Q L Sbjct: 68 GPNCQPTRASLMSGQYGPRTGVYTVGSIDRFAWQTRSLHPVENVTKLPLDKITLAQSLKK 127 Query: 179 QGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 GY T GKWH+GE+KE P GFD+ D NP+V D Sbjct: 128 AGYATGMFGKWHLGEDKEHHPAQRGFDE------ALVSMGVHFDFVTNPKVDYPKD---- 177 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 E L D + F+ + D+PFFLY Sbjct: 178 ------------------------------EYLADFLTDKALDFIKRH--KDEPFFLYLP 205 Query: 299 TRGCHFDNYPNAKYAGS--------SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 H + +Y + +++ + L++ DNTL Sbjct: 206 HYAVHKPLQAKKELIQKFSAKQGVDGHHNPTYAAMIASVDESVGRVVALLDELKLSDNTL 265 Query: 351 IVFTSDNGPEAEVPPHG---------RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KS 400 ++F+SDNG G P RG KG +EGG RVP W G I Sbjct: 266 VIFSSDNGGVGGYQREGIKKAGDVTDNNPLRGGKGMLYEGGHRVPYIFRWPGKIPAGKVC 325 Query: 401 DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFL-GTNGQSNRKAEHYFL 459 D + DL+PT L+LAG P+ +DG G + NR A ++ Sbjct: 326 DQPIISIDLYPTLLELAGAKA-------PEKYPLDGTSYLKVLKSGGMKKLNRDAIYWHF 378 Query: 460 NGKLAA------------VRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNL 507 G L A VR ++K + ++NL Sbjct: 379 PGYLGAGADTWRTLPVGVVRCGDWKL--------------------MEFFEDHRLELYNL 418 Query: 508 YTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 D E++++ + L+ ++ A+ + ++ P Sbjct: 419 REDLGETNNLAAKMPEKAQELEKKLVAWQKEVQAPMP 455 >UniRef50_A0YAF7 Arylsulfatase A n=4 Tax=Bacteria RepID=A0YAF7_9GAMM Length = 479 Score = 490 bits (1263), Expect = e-137, Method: Composition-based stats. Identities = 120/472 (25%), Positives = 205/472 (43%), Gaps = 45/472 (9%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-S 135 + + + PNV++ DD+G+ D+G G P++D +A++G+ T+ Y+ Sbjct: 29 AVANPSHQSPNVIIIFADDMGYGDIGAYGHPTIRS---PNLDQMAAEGIKWTNFYAASSV 85 Query: 136 SSPTRATILTGQYSIHHGILMPPMY-----GQPGGLQGLTTLPQLLHDQGYVTQAIGKWH 190 +P+RA +LTG+ + G+ + G T+ + L ++ Y T +GKWH Sbjct: 86 CTPSRAGLLTGRLPVRSGMAHDQIRVLFPTSTGGLPTTEITIAKALKEKDYRTALVGKWH 145 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +G QP + GFD++ G +D YI+ + +KD Sbjct: 146 LGHLPGFQPLDHGFDEYFGIPYSNDH--------------DLKKELSYIQTITHAKDGDF 191 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 V + ++I + P + +R+ V F+ K S++PFFLY H + + Sbjct: 192 NVPLMQNRSIIE-RPANQNTITKRYTQEAVSFIKK--NSNQPFFLYLAHSMPHVPLFASD 248 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP 370 ++ GSS R YGD + E++ + TL + G +NTL+VFTSDNGP + HG + Sbjct: 249 QFRGSSD-RGLYGDVIEEIDWSVGQVLSTLSEQGISENTLVVFTSDNGPWLIMGAHGGSA 307 Query: 371 --FRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 + KG+++EGG+R P +W I+P + DLFPT + +AG + Sbjct: 308 GLLKSGKGTSYEGGMREPAIFWWPEKIKPAVAHNTASTLDLFPTIMSIAGID-------M 360 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 P DG D + + RK Y+ K+ AVR ++K H + Sbjct: 361 PSDRSYDGYDLSPTMF-EQKSNERKNIFYYHGDKIFAVRQGDWKVHFKTVANIYTKEQ-- 417 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 ++ VFNL DP E +G + + + + +K Sbjct: 418 ------KILTHTPPQVFNLLVDPSERFDVGAVNPAIIASAAKLIEQHQLSVK 463 >UniRef50_A6DI94 Arylsulfatase A n=2 Tax=Bacteria RepID=A6DI94_9BACT Length = 472 Score = 490 bits (1261), Expect = e-137, Method: Composition-based stats. Identities = 124/474 (26%), Positives = 200/474 (42%), Gaps = 49/474 (10%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATI 143 KPN ++ DD G+ D+ G TP ID +A++G+ + Y S +RA + Sbjct: 21 KPNFIIIFTDDQGYGDLSCFNPQ---GVQTPHIDQMATEGMKFNNFYVSAAVCSASRAAL 77 Query: 144 LTGQYSIHHGILMPPMYG-QPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNV 202 LTG Y+ GI G + G T+ +LL +Q Y T GKWH+G+ P Sbjct: 78 LTGTYNDRIGIKSAFFPGTKQGLHPDEITIAELLKEQNYATACFGKWHLGDEPSLLPSAQ 137 Query: 203 GFDDFRGFNSVSDMY-----TEWRDVHVNPEVALSPDR-------SEYIKQLPFSKDDVH 250 GFD + G +DM+ T + N + L + K+ P K + Sbjct: 138 GFDTYFGIPYSNDMFIAPHQTFAENAKFNGDWTLEKAKELQKFIAPHVNKRGPIWKSEYK 197 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 A+ + P L QR+ D +KF+DK +KPFF++ H + + Sbjct: 198 ALVPILEGEQIVEFPADQASLTQRYFDRTIKFIDK--NQNKPFFIFLTPAMPHVPLFASK 255 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP--PHGR 368 ++ G S YGD + E++ L K L++ NTL++FTSDNGP Sbjct: 256 EFRGKSKK-GLYGDVIKEIDFHTGRLIKHLKEKELDQNTLVIFTSDNGPWLSYGDEGGSS 314 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANL 427 P R K +++EGGVR+PT + G+I+ + + DL PT L Sbjct: 315 GPLRDGKFTSYEGGVRMPTVFWGPGLIKANSVCNQLASTIDLLPTFAQLVN-------TQ 367 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSG 487 VP+ IDG D + N +R + AVR ++K V Sbjct: 368 VPQDRKIDGKDISPLLKSQNHVIHRHLFF-----RDEAVRSGDWKLVVKEHH-------- 414 Query: 488 YQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 T+ + +++NL D ES+++ H + LQ+++ +++ L + Sbjct: 415 ------MTMRKGPLPALYNLKNDVAESNNLIDTHPKVAQYLQSKLDEHLKDLNE 462 >UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZAC9_PLALI Length = 479 Score = 489 bits (1260), Expect = e-136, Method: Composition-based stats. Identities = 128/505 (25%), Positives = 199/505 (39%), Gaps = 67/505 (13%) Query: 50 PATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVA 109 PA + ++ E + +PN++V + DD+G+ D+G GG Sbjct: 7 PAIALWLALVAFCSQALLAAEDVN-----QTSKSGRPNILVIMADDLGYADLGVQGG--- 58 Query: 110 VGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPMYGQP---GG 165 PTP +D +A+ G+ T+AY P SP+RA LTG+Y G P G+ G Sbjct: 59 CEIPTPHLDQLAASGIRCTNAYVSAPYCSPSRAGFLTGKYQTRFGHEFNPHVGEEAKLGL 118 Query: 166 LQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHV 225 T+ LL +GY T IGKWH G +K+ PQ+ GFD+F GF Y Sbjct: 119 PLEEVTIANLLQTEGYRTALIGKWHQGFSKDHHPQSRGFDEFFGFLVGGHNYLLH----- 173 Query: 226 NPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDK 285 E + + RG E + + + ++++ Sbjct: 174 ----------KEVKARFGTAHSHDMIYRGREVEPQE-------GYATDLFTNEALRWM-- 214 Query: 286 MAKSDKPFFLYYGTRGCHFDNYPNAKYAG------SSPARTSYGDCMVEMNDVFANLYKT 339 +KP+FLY H PAR Y + ++D + + Sbjct: 215 SGPPNKPWFLYLSYNAVHTPLEIAPHLQKRIPESVKLPARRGYLSLLAGLDDSIGRITQH 274 Query: 340 LEKNGQLDNTLIVFTSDNGPEAE-----VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGM 394 L ++G + TLI+F SDNG P RG KG T EGG+RVP FV W G Sbjct: 275 LSQHGLREKTLIIFLSDNGGSGRAPILAYNSGLNHPLRGDKGQTLEGGIRVPFFVSWPGQ 334 Query: 395 IQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRK 453 + R + + DL PT LA + AK P IDGV+ ++LG + + Sbjct: 335 LPARTIYEQPIISLDLLPTVCQLAANNPAKPQ---PLPQGIDGVNLMPYWLGQRSGAPHE 391 Query: 454 AEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQE 513 + + G AVR +K P + + +G +++L TD E Sbjct: 392 SLF-WRFGPQKAVRAGNWKLVDWRDFPAS---------------KNSGWELYDLSTDISE 435 Query: 514 SDSIGVRHIPMGVPLQTEMHAYMEI 538 +++ H + L+T + + Sbjct: 436 KNNLAETHPEIVARLKTSWEKWNQS 460 >UniRef50_D2R206 Steryl-sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R206_9PLAN Length = 504 Score = 489 bits (1260), Expect = e-136, Method: Composition-based stats. Identities = 128/498 (25%), Positives = 198/498 (39%), Gaps = 58/498 (11%) Query: 62 MQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVA 121 + A +PNV++ +DD+G+ D+G G NPTP + +A Sbjct: 9 LAIGAIVASFLGSFVSEILAEESRPNVIIINIDDLGYADIGPFGSK---KNPTPALTKMA 65 Query: 122 SQGLILTSAYSQPSSSPTRATILTGQYSIHH-GILMPPMYGQ-PGGLQGLTTLPQLLHDQ 179 ++G+ LTS Y+ P SP+RA +LTG Y I G T+ +L Sbjct: 66 AEGMKLTSHYAAPVCSPSRAALLTGCYPKRVLSIPHVLFPSAGSGLHPDEVTIADMLKAS 125 Query: 180 GYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI 239 GY T +GKWH+G+ E P GFD + G +DM T N L ++ Sbjct: 126 GYKTACLGKWHVGDQAEFLPTKQGFDSYYGIPYSNDMGTATDGSKSNFGAPLPMPGAKGK 185 Query: 240 KQLP---------------FSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLD 284 + P + +A + + +L + + V F+ Sbjct: 186 GKQPAQATGELPLGSPTGLTGNMQPPLPLLENDKVVARVRGEDQVNLTRDYTKRAVNFIR 245 Query: 285 KMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNG 344 + D+PFFLY+ HF YP+ ++ S R + D + E++ + L + Sbjct: 246 E--NKDQPFFLYFAHTAVHFPMYPSKEFRTSD--RGTLDDWVDEVDASVGEVLAALAEMK 301 Query: 345 QLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGI 403 + TL++FTSDNG TP +G+KG TWEGG+RVPT W G I+ S I Sbjct: 302 IDEKTLVIFTSDNGGSLPHGSD-NTPLKGSKGLTWEGGIRVPTIARWPGTIKGGTSTSAI 360 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL 463 + DL PT G + +DG++Q GT +S R+ YF +L Sbjct: 361 TGMIDLLPTIAAATGAKLPE--------RKLDGLNQLPLLNGTAKESPRREFFYFRGLEL 412 Query: 464 AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIP 523 AVR D +K H A +++L +D ES ++ H Sbjct: 413 DAVRRDNWKLH------------------------LAKGELYDLESDIGESKNVAADHPE 448 Query: 524 MGVPLQTEMHAYMEILKK 541 + L L + Sbjct: 449 IVKSLTELAATADNDLGQ 466 >UniRef50_A6DJ11 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ11_9BACT Length = 462 Score = 489 bits (1259), Expect = e-136, Method: Composition-based stats. Identities = 124/477 (25%), Positives = 196/477 (41%), Gaps = 35/477 (7%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 T L L KPNV++ L DD G+ D+ G P ID +A +GL LTS Sbjct: 8 TLISLQFLMAADTSKPNVIIILTDDQGYNDLSCYGSKTIKS---PRIDQLAEEGLKLTSY 64 Query: 131 YSQ-PSSSPTRATILTGQYSIHHGILMP--PMYGQPGGLQGLTTLPQLLHDQGYVTQAIG 187 Y P S +RA +LTG+Y G+ P G G T+ +LL GY T+A+G Sbjct: 65 YVASPVCSASRAALLTGRYPKLVGVPGVFFPNRGHKGLDPKHQTIAKLLKSVGYATKAVG 124 Query: 188 KWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 KWH+G+ E P N GFD + G +DM + + + E +K+ + Sbjct: 125 KWHLGDELEFLPTNQGFDSYYGIPYSNDMTPAFSMKYSENCLYREGVDQEALKKAFEANK 184 Query: 248 DVHAVRGGEQQAIADIT----PKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 + + + P + +R+ D +KF+D+ S+KPFFLY H Sbjct: 185 IKPVGMKDKVPLMRNDECIEMPADQSTITKRFTDESIKFIDESTASNKPFFLYLAHSMPH 244 Query: 304 FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 Y + + G S YGD + E++ + L + +NTL ++TSDNGP Sbjct: 245 TPLYVSKDFEGKSAG-GIYGDVIEEIDYNVGRIIDHLNEKNIAENTLFIYTSDNGPWLIK 303 Query: 364 PPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHP 420 HG + P K +++EGG RVP + W I S+ + D+FPT + G Sbjct: 304 KSHGGSALPLFEGKMTSFEGGQRVPAIIRWPAKIPKDSVSNEMTLSMDIFPTLAKITGAK 363 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 I+G + + + ++ AVR +KYH Sbjct: 364 AQDADL-------INGKNALELYEDPANFKTKHDYFFYSP---RAVRHKNWKYH------ 407 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 T +T G S+++L D ES ++ + + L+ + + + Sbjct: 408 -----QQETFKLKSTARKTKGPSLYDLSKDIGESKNLINDYPEIAAQLKNALLEHNK 459 >UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BZT7_9PLAN Length = 459 Score = 488 bits (1257), Expect = e-136, Method: Composition-based stats. Identities = 114/503 (22%), Positives = 181/503 (35%), Gaps = 92/503 (18%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SS 136 LE +KPN++ + DD+G+ ++G G TP ID +A++G+ T AY+ Sbjct: 9 LEATEKQKPNIIFIMADDLGYAELGCYGQKKI---KTPHIDKLAAEGMKFTQAYAGSMVC 65 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NK 195 P+R+ ++TGQ++ H + + + TT+ ++L GY T A GKW +G Sbjct: 66 QPSRSVLMTGQHTGHTAVRANDL--NQLLYEEDTTVAEVLKIAGYATGAFGKWGLGYEGT 123 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 +P GFDDF G + + N E L Sbjct: 124 PGRPGQQGFDDFTGQLLQVHAHFYYPFWIWNNEHRL------------------------ 159 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK---- 311 + + + + F+ +PFF Y H + + Sbjct: 160 --MLPENENNQRGRYIHDLIHEDAKAFI--QKNKAQPFFAYLPYIIPHVELVVPEESEKP 215 Query: 312 ----------------YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 Y GS T++ + ++D + LE G DNTLI+FTS Sbjct: 216 YRGQFPKKQILDPRPGYIGSEDGLTTFAGMVSRLDDHVGEIVTLLEDLGIRDNTLIIFTS 275 Query: 356 DNGPEAEVP------PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLAD 408 DNG + +G P RG KGS +EGG+RVP W G I SD + D Sbjct: 276 DNGGQGGTWKEMTDFFNGNAPLRGHKGSMYEGGIRVPFIANWPGKIAAGKTSDLQIAFWD 335 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGK---LAA 465 + PT +AG VP IDG+ LG Q + ++ A Sbjct: 336 VLPTLAQVAGTT-------VPSGVDIDGISFLPTLLGKGKQPEHEYLYWEYTRGKIRSRA 388 Query: 466 VRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMG 525 +R +K +++L TD E+ ++ +H Sbjct: 389 IRQGNWKAVQNRMN--------------------QPIELYDLGTDIGETKNLAKQHPEKI 428 Query: 526 VPLQTEMHAYMEILKKYPPRAQI 548 LQ M + +P + Sbjct: 429 KDLQQIMQQAHSEPRDFPQTLKP 451 >UniRef50_B9XS23 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XS23_9BACT Length = 635 Score = 487 bits (1254), Expect = e-136, Method: Composition-based stats. Identities = 145/478 (30%), Positives = 212/478 (44%), Gaps = 41/478 (8%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT 139 T +KPN++ L DD+G+ D+G G + N TP++D +A +G+ LTS Y+ P +P+ Sbjct: 19 AATSQKPNIIFILADDMGYGDIGPFGSTL---NRTPNLDRMAKEGMKLTSFYAAPLCTPS 75 Query: 140 RATILTGQYSIHHGILMPPMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 RA ILTG Y+ + G T+ +LL QGY T AIGKWH+G+ E+ Sbjct: 76 RAQILTGCYAKRVSLPKVLSPRSEVGLNTNEQTVAKLLKRQGYATMAIGKWHVGDAPENL 135 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 P GFD + G +DM P + LP +D +Q Sbjct: 136 PTRHGFDHYLGLPYSNDM-------GGEEPGKDQPAKRGARPPLPLVRD---------EQ 179 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA 318 I + P + L +R+ D VKF+ A +PFFLY H +P + G S Sbjct: 180 VIEVVKPADQDRLTERYTDEAVKFIR--ANDKQPFFLYLAHTAVHAPIHPGHNFRGKSRN 237 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR--TPFRGAKG 376 YGD + E++ + TL + G +NTL++F+SDNGP +G P RG KG Sbjct: 238 -GLYGDWVEEVDWSVGKVLDTLRELGLSENTLVLFSSDNGPWLAQKTNGGTAGPLRGGKG 296 Query: 377 STWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFID 435 T+EGG+R PT +W G + + D + DL PT + LAG +PK ID Sbjct: 297 GTFEGGMREPTLAWWPGKVPAQSVCDTVAGNIDLLPTFVKLAGGT-------LPKDKKID 349 Query: 436 GVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGT 495 G D ++ LG ++ R+A +YF L AVR +K ++ Q Y G Sbjct: 350 GRDISNLLLGQTKEAQREAHYYFAGTALQAVRSGPWKLAIVPQ----YEGMGKFSENAVE 405 Query: 496 VMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL----KKYPPRAQIK 549 + ++NL D E + H L + A L K P Sbjct: 406 GGKPFAPRLYNLDEDIGEKTDVVAEHPDEMKRLLGYVEAMEADLGVSKKNGPGVRPPG 463 >UniRef50_C6Y1Z7 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y1Z7_PEDHD Length = 480 Score = 486 bits (1251), Expect = e-135, Method: Composition-based stats. Identities = 115/471 (24%), Positives = 194/471 (41%), Gaps = 33/471 (7%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSS 136 + ++PNV++ +DD+G+ D G G PTP+ + A +G+ T + Sbjct: 19 AQTTKTQRPNVIIINMDDMGYGDTEPYG---MTGIPTPNFNKAAKEGMRFTHFNAAQAIC 75 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPG-GLQGLTTLPQLLHDQGYVTQAIGKWHMGENK 195 SP+RA +LTG Y G+ T+ LL GY T +GKWH+G Sbjct: 76 SPSRAALLTGCYPNRIGLRGALSPDSKIALDTAEETIASLLKKAGYKTAMLGKWHLGSKA 135 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 + P + GFD F G +DM+ D P+ A++ +S + G Sbjct: 136 PNLPLHYGFDSFYGLPYSNDMWP--VDYEGKPQAAVAGKKS----------YPELPLLDG 183 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 ++ A TP L + V+F++ PFFLY H +A + G Sbjct: 184 DKPADYVRTPDDQAMLTGTFTRKAVRFIEN--NKSAPFFLYLAHPMPHVPLAASAAFRGK 241 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH--GRTPFRG 373 S +GD ++E++ + K+L++N NT+++ SDNGP H FRG Sbjct: 242 SEL-GLFGDVIMELDWSVGEIMKSLDRNKIASNTILIIMSDNGPWLRFGNHAGSSGGFRG 300 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQPRKSD-GIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 K + W+GG RVP + W G ++ + ++ D+ PT L L+ P Sbjct: 301 GKMTIWDGGTRVPCIIRWPGKVEAGSVNSNLITNMDILPTLLQLSHAA--------PPEK 352 Query: 433 FIDGVDQTSFFLGTNGQSNRKAEHYFLN-GKLAAVRMDEFKYHVLIQQ-PYAYTQSGYQG 490 IDG+ LG + ++ R+ +Y+ N L AVR +K + Y G G Sbjct: 353 KIDGISFADLLLGRSDKAPRQVFYYYYNENSLKAVRYKNWKLVLPHTSVSYTSDIHGKDG 412 Query: 491 GFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 ++++L DP E+ + ++ + + + + Sbjct: 413 FPGAATRAEVKMALYDLAHDPGEAYDVQQQYPELVQKMLVFVEEARADMGD 463 >UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788C38 Length = 452 Score = 484 bits (1246), Expect = e-135, Method: Composition-based stats. Identities = 111/478 (23%), Positives = 189/478 (39%), Gaps = 61/478 (12%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRAT 142 K+PN +V DD+G+ D+G G TP +D +A +G+ T+ YS P SP+RA+ Sbjct: 15 KQPNFIVIYCDDLGYGDLGCYGSDTV---KTPHLDGLADEGIRFTNWYSNSPVCSPSRAS 71 Query: 143 ILTGQYSIHHGILMPPM--YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 +LTG+Y G+ G G TL + L GY T GKWH+G ++E+ P Sbjct: 72 LLTGKYPARAGVGEILGAKRGSHGLPADEVTLAKALKPAGYRTALYGKWHLGLSEETSPN 131 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 GFD+F GF + + + + +H + E + Sbjct: 132 AHGFDEFFGFKAGCVDFYSHI----------------FYWGQAHGVNPLHDLWENETEVW 175 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS----S 316 + + + + V F+ + + + PFFL+ H+ + KY Sbjct: 176 ENGR-----YMTELITERSVDFIQRSREQEAPFFLFASYNAPHYPMHAPQKYMDRFAHLP 230 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE-----------VPP 365 R + ++D + K L++ G ++T+I F+SDNGP +E Sbjct: 231 WDRQVMAAMIAAVDDGVGKIVKALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDVYYG 290 Query: 366 HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKV 424 FRG K S +EGG+R P + W + + D + + DL PT LDLAG A Sbjct: 291 GSAGIFRGHKASLFEGGIREPAILSWPNGWEGGQVRDEVAAMMDLAPTFLDLAGVDPAAG 350 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYT 484 + +DG S + + AVR ++K + Sbjct: 351 PL---QGVALDGSSLKEMLQ-MREPSPHQQLF-WEYQGQLAVREGDWKLVL--------- 396 Query: 485 QSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKY 542 G + + +L DP E ++ R+ + L ++ + E ++++ Sbjct: 397 ----NGKLDFDRVVPDQIHLSDLSRDPGERSNLADRYPEIVERLSRDVRDWYEEVQRH 450 >UniRef50_B3CAE2 Putative uncharacterized protein n=3 Tax=Bacteroides RepID=B3CAE2_9BACE Length = 467 Score = 484 bits (1246), Expect = e-135, Method: Composition-based stats. Identities = 127/479 (26%), Positives = 194/479 (40%), Gaps = 59/479 (12%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT- 139 + KPNVV+ DD G+ D+G G + TP ID +A +GL LT Y Sbjct: 21 QAQHKPNVVIIFTDDQGYQDLGCYGSPLI---QTPSIDGMAREGLKLTDFY--------V 69 Query: 140 --------RATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 RA +LTG+ + +G+ G TL + L +Q Y T GKWH+ Sbjct: 70 SASVSSASRAGLLTGRLNTRNGVKGVFFPESEGMPSEEITLAEALKEQDYATGCFGKWHL 129 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTE----------WRDVHVNPEVALSPDRSEYIKQ 241 G+ K P + GFD + G +DMY +R+ + E D Sbjct: 130 GDLKGHLPTDQGFDKYFGIPYSNDMYIGPSQKFASNAVFREGYTLSEAKADQDFVRNAPN 189 Query: 242 LPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRG 301 K +++V + P +R+ D ++F+ + +KPFF+Y Sbjct: 190 RATIKKRLNSVSPLFEGDEIIEYPCDQSTTTRRYFDKAIEFVGQ--NKEKPFFVYITPSM 247 Query: 302 CHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA 361 H + + ++ G S R YGD + E++ L++ G +NTL++F SDNGP Sbjct: 248 PHIPLFASEQFRGKS-KRGLYGDVVEEIDWNVGRFLDYLDQQGLAENTLVIFASDNGPWL 306 Query: 362 EVPPHGRT--PFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAG 418 + P RG K S +EGGVRVP + WKG I SD I+ DLFPT + G Sbjct: 307 GYKEDSGSADPLRGGKFSYYEGGVRVPCILRWKGTIPAGVTSDAIIASIDLFPTIMHYVG 366 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ 478 + IDGVD +SF + R Y G++ +R ++ Y Sbjct: 367 CKSFR--------QEIDGVDISSFLKNPSL-RLRDEYVYVRGGEVHGIRKGDWAYLPKTG 417 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 + +FNL D E++++ + + LQ M Y Sbjct: 418 N--------------SKFKEGDVPELFNLKRDIGETNNLHLEYPEKVKELQEVMQLYQA 462 >UniRef50_Q7UHK0 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UHK0_RHOBA Length = 478 Score = 482 bits (1242), Expect = e-134, Method: Composition-based stats. Identities = 121/470 (25%), Positives = 191/470 (40%), Gaps = 55/470 (11%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSP 138 + PN V+ DD+G+ D+ G TP +D +A++G + SP Sbjct: 37 AAADRPPNFVLIFADDLGYGDISCYDS---SGVKTPHLDQLAAEGFRSKDFFVPANVCSP 93 Query: 139 TRATILTGQYSIHHGI-----LMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 +RA +LTG+Y + G+ Y G T+P+LL GY + +GKWH+G Sbjct: 94 SRAALLTGRYPMRCGMPVARNENVAKYKDYGFAPDEITIPELLGPAGYRSLMVGKWHLGM 153 Query: 194 N-KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 + S P + GFD++ G S Y + + + ++ Sbjct: 154 ELEGSHPLDAGFDEYLGIPS------------------------NYEPRRGKNHNTLYRG 189 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 + EQ+ +A E+L +R+ D + F+++ D PFF+Y H P+ + Sbjct: 190 KQVEQKNVA------CEELTKRYTDEVIDFIERQ--KDDPFFIYVSHHIVHNPLKPSPDF 241 Query: 313 AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFR 372 G+S YGD + E++ + +T+ G +NTL++FTSDNGP Sbjct: 242 VGTSEK-GKYGDFIKELDHSTGRIMQTIRDAGLDENTLVIFTSDNGPT---RNGSSGELS 297 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 G K T EGG RVP W I P + SD + DL P +LAG P +P Sbjct: 298 GGKYCTMEGGHRVPGMFRWTSKIAPNQVSDVTLTSMDLLPLFCELAGVP-------IPDD 350 Query: 432 TFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGG 491 IDG LG +S + +Y+ L AVR ++K H+ + Sbjct: 351 RQIDGKSILPVLLGQTSESPHQFLYYYNGTNLQAVREGKWKLHLPRTTD-DQPFWSKKPD 409 Query: 492 FTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 T + +FNL D E ++ RH + L + L Sbjct: 410 KTKGFVTLNEMRLFNLDRDLGEKKNVADRHPEIVARLNEQAELIRTELGD 459 >UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4W7_9PLAN Length = 459 Score = 482 bits (1241), Expect = e-134, Method: Composition-based stats. Identities = 110/491 (22%), Positives = 168/491 (34%), Gaps = 94/491 (19%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PS 135 + + PN+V+ + DD+G+ D+ G TP ID +A+ L T +S Sbjct: 26 SAAEAAQQPPNIVLIMADDLGYGDLACYGNKQV---KTPHIDRLAASALKFTDFHSAGAM 82 Query: 136 SSPTRATILTGQYSIHHGIL-----MPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWH 190 +PTRA +LTGQY G G T+ +LL QGY T GKWH Sbjct: 83 CTPTRAAMLTGQYQQRFGRQFESALSGKSNHDIGLPHQAVTMAELLKQQGYATACFGKWH 142 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +G P N GFD FRG S + D N + + + Sbjct: 143 LGYQPPWLPTNQGFDLFRGLTSGDGDHHTHVDRSGNEDWWHNNE---------------- 186 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY--- 307 Y V F++ A +PFFLY HF Sbjct: 187 -------------ISMEKGYTADLLSKYSVAFME--ANRTRPFFLYVPHLAIHFPWQGPQ 231 Query: 308 -PNAKYAGSSPARTSYG-------------DCMVEMNDVFANLYKTLEKNGQLDNTLIVF 353 P + AG +G + ++ + L++ NTL++F Sbjct: 232 DPPHRKAGQDYHAGKWGIIPDPGNVSPHTTAMIESLDQSVGKILSALKRLDLEQNTLVIF 291 Query: 354 TSDNGPEAEVPPH-----GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLAD 408 TSDNG + P RG K + +EGG RVP + W G+I +D D Sbjct: 292 TSDNGGYLTYGKNFQNISSNGPLRGQKATLYEGGHRVPCLISWPGVITAGVTDQTAHSVD 351 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRM 468 L PT AG DG+D + ++R ++ G AVR Sbjct: 352 LLPTLAQAAGISATNFQT--------DGLDLAPLWQTGRPLADRD--LFWRMGNNRAVRR 401 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPL 528 ++K ++ S +++L TD E + H + + Sbjct: 402 GQWKL----------------------CLKNNRSELYHLETDLGEQQNRAAEHPEIVKSM 439 Query: 529 QTEMHAYMEIL 539 + + + Sbjct: 440 SQALKEWEADV 450 >UniRef50_C6Y214 Sulfatase n=3 Tax=Sphingobacteriaceae RepID=C6Y214_PEDHD Length = 472 Score = 480 bits (1237), Expect = e-134, Method: Composition-based stats. Identities = 124/493 (25%), Positives = 187/493 (37%), Gaps = 72/493 (14%) Query: 64 HPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQ 123 ++ + KT KPNV+V + DD G++D G GG PTP+IDA+A Q Sbjct: 7 ISTLLLALWTGISAAQVKTAAKPNVIVIVSDDAGYVDFGCYGGKQI---PTPNIDAIAKQ 63 Query: 124 GLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPMY--------GQPGGLQGLTTLPQ 174 G T AY +P+RA ILTG+Y G G T+ Sbjct: 64 GTRFTDAYVSASVCAPSRAGILTGRYQQRFGFEHNTSNVLAPGYKITDVGMDPSEQTIGN 123 Query: 175 LLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPD 234 + GY T AIGKWH G+ + P N GF++F GF + ++ N Sbjct: 124 EMQANGYKTIAIGKWHQGDEPKHFPLNRGFNEFYGFTGGHRDFFAYKGKRTN-------- 175 Query: 235 RSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFF 294 HA+ ++ + + L + D F+ A DKPFF Sbjct: 176 --------------EHALYNNKEIVPENE----ITYLTDMFTDKATSFI--TANKDKPFF 215 Query: 295 LYYGTRGCHFDNYPNAKYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 +Y H R +Y M ++D + TL+ N NT Sbjct: 216 MYLSYNAVHTPMNAKKDLMERYASIADTGRRAYAAMMTSLDDGIGKVMATLKANQLDKNT 275 Query: 350 LIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDG-IVDLAD 408 LI+F +DNG V P RG KGS WEGG+RV + W G I K+D V D Sbjct: 276 LIIFINDNGGAT-VNSSDNGPLRGMKGSKWEGGIRVAMMMKWPGHIAANKTDSRPVSSLD 334 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRM 468 + PT T +DGV+ + N ++ +A Y+ G AA+R Sbjct: 335 ILPT-------AIGAGKGKQKGTKKLDGVNLLPYLSAGNKKTPHEAL-YWRRGVAAAMRE 386 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPL 528 +K + + P +F+L D E+ ++ ++ L Sbjct: 387 GNWKLIRVKESPTVQN-----------------VLLFDLSKDLSETKNLSEKYPAKVKEL 429 Query: 529 QTEMHAYMEILKK 541 ++ + + L + Sbjct: 430 LVKLAEWEKGLDQ 442 >UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R322_9PLAN Length = 513 Score = 480 bits (1236), Expect = e-134, Method: Composition-based stats. Identities = 117/526 (22%), Positives = 176/526 (33%), Gaps = 122/526 (23%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPT 139 ++PN+V FL+DD+G D+G G TP+ID +A+ G T AY+ P SPT Sbjct: 32 AAEQQPNIVFFLVDDLGQRDLGCYGSTF---YETPNIDKLAADGARFTQAYAACPVCSPT 88 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGL-------------------QGLTTLPQLLHDQG 180 RA+ILTG + GI G TL + L G Sbjct: 89 RASILTGLWPQRTGITDYIATDNSNGPAKWNRNTMTLPAAYRDRLALDSPTLAKSLKSAG 148 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYT--EWRDVHVNPEVALSPDRSEY 238 Y T GKWH+G + P+N GFD RG Y ++ + NP + P Sbjct: 149 YATFFAGKWHLGP-EGFYPENQGFDINRGGIERGGPYGGKQYFSPYGNPRLTDGPAG--- 204 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 E L R +F++ A +PFF Y+ Sbjct: 205 ------------------------------EHLPDRLATETCQFIE--AHQKQPFFAYFS 232 Query: 299 TRGCHFDNYPNAKYAGSS------------------------PARTSYGDCMVEMNDVFA 334 H Y + M+ Sbjct: 233 FYSVHTPLQAREDLRQKYVAKREKLGLKPTWGREHMRDVRQVQEHAVYAAMVDAMDQAVG 292 Query: 335 NLYKTLEKNGQLDNTLIVFTSDNGP--EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWK 392 + L++ G +NTL++FTSDNG +E P P RG KG +EGG+R P + W Sbjct: 293 KVLAKLDELGLRENTLVIFTSDNGGLSTSEGWPTSNLPLRGGKGWMYEGGIREPLVMRWP 352 Query: 393 GMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSN 451 ++ D V D T L A+ IDGV G + Sbjct: 353 AKVKAGSTIDTPVSSPDFMATLLAATATKPAE-------QQQIDGVSLLPLLAGEKLK-E 404 Query: 452 RKAEHYFLNGKLA------AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVF 505 R ++ + A+R +K ++ +F Sbjct: 405 RSLFWHYPHYGNQGGAPAAAIRRGSWKLI--------------------EWLEDGQVELF 444 Query: 506 NLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 NL TD E+ ++ + + + E+HA+ + + P D Sbjct: 445 NLATDESETTNLASKEPALVREMLAELHAWQKEVGAILPEKNPNYD 490 >UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UGD7_RHOBA Length = 543 Score = 479 bits (1235), Expect = e-134, Method: Composition-based stats. Identities = 125/482 (25%), Positives = 195/482 (40%), Gaps = 80/482 (16%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTR 140 +PN+V+ + DD+G+ DVGFNG PTP +D +A+ G++ T+ Y+ P SP+R Sbjct: 41 AKDRPNIVLIVADDLGYSDVGFNG---CKEIPTPHLDELAASGVVFTNGYASHPYCSPSR 97 Query: 141 ATILTGQYSIHHGILMPPMYGQ-------PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 A +LTG++ G P PG TTL L + GYVT AIGKWH+G+ Sbjct: 98 AGLLTGRHQQRFGHGSNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLGD 157 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 K P GFD++ GF+ Y + + Sbjct: 158 AKPFWPNRRGFDEWFGFSGGGFSY--------------------------WGDLGMKDPL 191 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 G + + PK + L + VKF+ + +PFFLY H ++ + Sbjct: 192 LGVHRGDEPVDPKTLTHLTDDFSTEAVKFIQRH--ETEPFFLYLAYNAPHAPDHATRAHL 249 Query: 314 GSSP-----ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR 368 + R YG + M++ + + ++G +NT+I+F SDNG E Sbjct: 250 QKTAHIEYGGRAVYGAMVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGGRRE--HAVN 307 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANL 427 P+RG KG +EGG+RVP V W G ++ K + + DLFPTAL AG + Sbjct: 308 FPYRGHKGMLFEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPS----- 362 Query: 428 VPKTTFIDGVDQTSFFLGTNGQ-SNRKAEHYFLNGKLA---AVRMDEFKYHVLIQQPYAY 483 + +DG + + R + G + AVR +K + Sbjct: 363 --QNDKLDGQNLLPVLTDDKQRLPERPLFWRYSMGDDSYGYAVRDGNWKLIDSRYKDRKL 420 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 +F+L DP E + + +H L M A+ + P Sbjct: 421 --------------------LFDLANDPWEREDLAAQHPEQVARLSRMMEAW--DARNVP 458 Query: 544 PR 545 P+ Sbjct: 459 PK 460 >UniRef50_C3ZGR2 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZGR2_BRAFL Length = 598 Score = 479 bits (1235), Expect = e-134, Method: Composition-based stats. Identities = 118/508 (23%), Positives = 197/508 (38%), Gaps = 65/508 (12%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 +++++ + KPN+V L DD GW D+G++G TP++D +A++G+ L + Y QP Sbjct: 112 SDIQESSSGKPNIVFILADDYGWNDIGYHGS----VIRTPNLDRLAAEGVKLENYYVQPL 167 Query: 136 SSPTRATILTGQYSIHHGILMP-PMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 SP+R ++TG+Y I +G+ QP G TLPQ L + GY T +GKWH+G Sbjct: 168 CSPSRCQLMTGRYQIRYGLQHSLIWPPQPSGLPLDEVTLPQRLKEGGYSTHIVGKWHLGF 227 Query: 194 NK-ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 K + P + GFD F G+ + ++ Y R P + + Q Sbjct: 228 YKQDYTPTHRGFDTFYGYLTGAEDYWTHRQKGGLPGQPQTWSGLDLRDQN---------- 277 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 +T + + + ++ + + +KP FL+ + H + Sbjct: 278 --------RPVTDQNGTYSTHLFANKAIEIIAQQ-DKNKPMFLFLSFQAVHDPLQAPEED 328 Query: 313 AGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG 367 R Y M+ N+ + L++ G DNT+++F++DNG + Sbjct: 329 ISRYSHISDTNRRVYAAMTTIMDQAVGNVTRALKQYGLWDNTVLIFSTDNGGRVDRG-GI 387 Query: 368 RTPFRGAKGSTWEGGVRVPTFVYWKG-MIQPRKSDGIVDLADLFPTALDLAGHPGAKVAN 426 P RG KGS WEGGVR FV + R SD ++ ++D FPT + LA Sbjct: 388 NWPLRGWKGSLWEGGVRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLASGSTNGT-- 445 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSNRKAEH--------------------YFLNGKLAAV 466 +DG D R+ H F AA+ Sbjct: 446 -----KPLDGHDVWEAISDGKPSPRREILHNIDPMFHTVPSPRPHQWGDRVFNTSVHAAI 500 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGV 526 R ++K + +FN+ DP+E + +H + Sbjct: 501 RSGDWKLLTGYPGNTSRVPPPSSTKEEPADTPGKHLWLFNIREDPEERTDLSQKHPGVVQ 560 Query: 527 PLQTEMHAYMEI-----LKKYPPRAQIK 549 L ++ Y + P+A Sbjct: 561 ELLEKLARYNRTAVPVFYPSFDPQANPA 588 >UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT Length = 500 Score = 479 bits (1233), Expect = e-133, Method: Composition-based stats. Identities = 127/529 (24%), Positives = 185/529 (34%), Gaps = 119/529 (22%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 L +PN V L DD+GW DVGFNG TP++D +A +G+ T AY+ Sbjct: 26 TSLCATRVHAADRPNFVFILADDLGWKDVGFNGSTF---YETPNLDRLAREGMRFTDAYA 82 Query: 133 Q-PSSSPTRATILTGQYSIHHGILMPPMYGQPG--------------GLQGLTTLPQLLH 177 SPTRA+I+TG+Y + + G+P TL + L Sbjct: 83 ACSVCSPTRASIMTGKYPARLHLTDW-LPGRPDKPDQILKHPKIITELPAAEITLAKALQ 141 Query: 178 DQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSE 237 + GY T IGKWH+G P+ GFD G + P SP ++ Sbjct: 142 EGGYKTAFIGKWHLGGL-GHWPEQAGFDINIGGCGMGH-----------PSSYFSPYKNP 189 Query: 238 YIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYY 297 +K P E L R D VKF++ PF LY Sbjct: 190 TLKDGPVG-----------------------EYLADRLTDEAVKFIENT--KGTPFLLYL 224 Query: 298 GTRGCHFDNYPNA----KYAGSS----------------------PARTSYGDCMVEMND 331 H KY + Y M +++ Sbjct: 225 SHYSVHTPLQAKKGLIEKYQKKVMQLPPTKGPEFVTEGNTNARQVQNQPIYAAMMQSLDE 284 Query: 332 VFANLYKTLEKNGQLDNTLIVFTSDNGP--EAEVPPHGRTPFRGAKGSTWEGGVRVPTFV 389 + L++ G NT+I+FTSDNG AE P P R KG +EGGVR P V Sbjct: 285 SVGRVLDKLKELGLDKNTVIIFTSDNGGLSTAEGAPTSNMPLRAGKGWPYEGGVREPLVV 344 Query: 390 YWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNG 448 W G+ + SD V D +PT L++AG P +DG+ T G Sbjct: 345 KWPGVTKAASVSDHQVMSTDYYPTLLEIAGLPAR-------PEQHLDGISFTPALRGKEM 397 Query: 449 QSNRKAEHYFLNG------KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGS 502 R ++ + +++R ++K + Sbjct: 398 -GERPLFWHYPHYSNQGGAPSSSIRKGDWKLIEWY--------------------EENRI 436 Query: 503 SVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 +FNL D E + + L++E+ A+ +K P D Sbjct: 437 ELFNLRLDVGEKNDLASTSALKREELKSELQAWRASVKADMPLPNPNFD 485 >UniRef50_C1ZI83 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZI83_PLALI Length = 558 Score = 478 bits (1232), Expect = e-133, Method: Composition-based stats. Identities = 118/469 (25%), Positives = 181/469 (38%), Gaps = 39/469 (8%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSS 136 + +KPNVV+ DD+G+ DVG G + TP+ID +A +G+ TS Y Sbjct: 100 AAEARPEKPNVVIINCDDLGYADVGAFGATIC---KTPEIDRMAREGVKATSFYVAQAVC 156 Query: 137 SPTRATILTGQYSIHHGILMPPMY-GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK 195 S +R +LTG GIL + + G TL +L QGY T GKWH+G Sbjct: 157 SASRTALLTGCLPNRIGILGALSHVSKNGIADSEVTLGELFQSQGYSTAMYGKWHLGYQA 216 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 + P + GF + G +DM+++ P + + D Sbjct: 217 QFLPGHHGFGEALGIPYSNDMWSKNPYGKFPPLPLFRQKGDSPAEIIGHDTD-------- 268 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 + V F+D+ A DKPFF+Y H + + + S Sbjct: 269 ------------QSRFTTDFTMAAVSFIDRHA--DKPFFIYLAHPMPHTPIFVSEE-RNS 313 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH--GRTPFRG 373 Y D + E++ + +TLEK+ TL++FTSDNGP H P R Sbjct: 314 GERAQLYRDVIGEIDWSVGTIRQTLEKHQLTRKTLVIFTSDNGPWLVFGNHAGSTGPLRE 373 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 KG+ W+GG RVP W G+I P D + DLFPT + G Sbjct: 374 GKGTMWDGGARVPFVACWPGVIPPDTTVDLPMATYDLFPTFAKMLGAKLP--------DH 425 Query: 433 FIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGF 492 IDGVD + +A ++ L AVR +K + + Sbjct: 426 PIDGVDIWPQLTSASKAQPHQALWFYYGRDLIAVRSGPWKLVFPHTYVHPVERGNDGQRG 485 Query: 493 TGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 + +++NL +D E+ ++ +H + L+ L Sbjct: 486 KLVNRKFTELALYNLDSDIGETTNLASQHPEIVKQLEAYAEVARNELGD 534 >UniRef50_A6LCL3 Arylsulfatase A n=9 Tax=Bacteroidales RepID=A6LCL3_PARD8 Length = 476 Score = 478 bits (1232), Expect = e-133, Method: Composition-based stats. Identities = 128/479 (26%), Positives = 186/479 (38%), Gaps = 43/479 (8%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 T + N+V+ LDDVG+ D FNG A G TP+ID +A++G+ T Sbjct: 9 TAASAFSMAGIAADYTNIVLINLDDVGYGDFSFNG---AYGYTTPNIDKMAAEGVRFTHF 65 Query: 131 YS-QPSSSPTRATILTGQYSIHHGILMPPMYG-QPGGLQGLTTLPQLLHDQGYVTQAIGK 188 QP S +RA +LTG Y G P G T+ ++L +GY T GK Sbjct: 66 LVGQPISGASRAGLLTGCYPNRIGFSGAPGPDSNYGVHPEEMTIAEVLKQKGYSTAIFGK 125 Query: 189 WHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 WH+G KE P GFD++ G +DM+ EV PD Y + Sbjct: 126 WHLGSQKEFLPLQNGFDEYYGLPYSNDMWPFHP---QQGEVFNFPDLPTYDGNEIIGYNT 182 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 L + V F+ K +KPFFLY H Sbjct: 183 ------------------DQTRLTTDYTTRSVNFIKK--NKNKPFFLYLAHNMPHVPLAV 222 Query: 309 NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH-- 366 + K+ G S + YGD M+E++ ++K L + G DNTL++ TSDNGP H Sbjct: 223 SDKFKGKSE-QGLYGDVMMEIDWSVGEIFKALRELGLEDNTLVILTSDNGPWTNYGNHAG 281 Query: 367 GRTPFRGAKGSTWEGGVRVPTFVYWKGM-IQPRKSDGIVDLADLFPTALDLAGHPGAKVA 425 R AK +T++GG RVP +YWKG + + + DL PT ++ P Sbjct: 282 SAGGLREAKATTFDGGNRVPCIMYWKGKTLPGTTCNKLASNIDLLPTFAEITQAPLP--- 338 Query: 426 NLVPKTTFIDGVDQTSFFLGTNGQSNRKAE-HYFLNGKLAAVRMDEFKYHVLIQQPYAYT 484 IDGV G + R++ +Y+ L AV FK + Sbjct: 339 -----PRKIDGVSILPLIEGKKDANPRESFVYYYRKNDLEAVTDGMFKLVFPHKYVTYGA 393 Query: 485 QSGYQGGFTGTVMQTA--GSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 G G + +++L DP E ++ ++ L L Sbjct: 394 YEPGNDGQPGKLTNLEIMKPEMYDLRRDPGERYNVITQYPEEAAKLMKIADQKRHELGD 452 >UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN4_DYAFD Length = 497 Score = 478 bits (1231), Expect = e-133, Method: Composition-based stats. Identities = 117/502 (23%), Positives = 183/502 (36%), Gaps = 78/502 (15%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QP 134 A+ +K K PN+V DD+G+ ++G G TP++D +A +G+ T Y+ P Sbjct: 17 AQAQKAPDKLPNIVYIYADDLGYGELGCYGQQKI---KTPNLDRLAKEGIRFTQHYTGTP 73 Query: 135 SSSPTRATILTGQYSIHHGILMPPM---------YGQPGGLQGLTTLPQLLHDQGYVTQA 185 +P RA ++TG+++ H I GQ T+ +LL +GY T Sbjct: 74 VCAPARAMLMTGKHAGHSAIRGNFELGGFRDEEERGQMPLPANELTVAELLKQKGYATAL 133 Query: 186 IGKWHMGENK-ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPF 244 GKW MG N E P GFD + G+ + + P DR + + Q Sbjct: 134 TGKWGMGMNNTEGTPTRQGFDYYYGYLDQKQAHNLY------PSHLWENDRWDTLAQPW- 186 Query: 245 SKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF 304 +D + + + K E + + + F+D+ PFFLY H Sbjct: 187 -QDIHRKLDPAKATDADFESFKGKEYAPAKMTEKALAFIDRSKAG--PFFLYMPYTLPHV 243 Query: 305 DNYPNAKYAGSSPAR-------------------TSYGDCMVEMNDVFANLYKTLEKNGQ 345 +Y + ++Y + ++D + L+ G Sbjct: 244 SLQAPDEYVKKYIGQFDEKPYYGEKNYASTKYPLSTYASMITFLDDQVGIILDKLKALGL 303 Query: 346 LDNTLIVFTSDNGPEAEVP-----PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-K 399 DNT+++F+SDNG + RG K +EGG+R P V W G I+P Sbjct: 304 DDNTIVMFSSDNGATFNGGVNPQFFNSVAGLRGLKMDVYEGGIREPFIVRWPGKIKPGRV 363 Query: 400 SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL 459 SD + DL PT +L G DG+ LG + + YF Sbjct: 364 SDHVSAQFDLMPTLAELTGQASP----------PTDGISFLPELLGQTNRQKKHEFLYFE 413 Query: 460 N---GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDS 516 G AVRM ++K + +FNL TD ES Sbjct: 414 YPEKGGQIAVRMGDWKGVKTDLR----------------KNPGNPWQLFNLKTDRSESTD 457 Query: 517 IGVRHIPMGVPLQTEMHAYMEI 538 + H + L + E Sbjct: 458 VAASHPDILKKLDQIVKREHEE 479 >UniRef50_B8HPF9 Sulfatase n=2 Tax=Bacteria RepID=B8HPF9_CYAP4 Length = 495 Score = 478 bits (1231), Expect = e-133, Method: Composition-based stats. Identities = 119/513 (23%), Positives = 185/513 (36%), Gaps = 86/513 (16%) Query: 42 HPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDV 101 ++L + + Q + +++ + P+++ + DD GW DV Sbjct: 7 KSRRFLF---GFLFAVFTCCITWKLLTLNQQDLPVAVAQQSSQPPHILFIMSDDQGWKDV 63 Query: 102 GFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMY- 160 GF+G + TP++D +A G L YSQP +P+RA +LTG+Y +G+ + Sbjct: 64 GFHGSDI----RTPNLDQLAKTGARLEQYYSQPMCTPSRAALLTGRYPHRYGLQTLVIPS 119 Query: 161 -GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYT 218 G+ G LPQ L + GY T +GKWH+G + + P+ GFD G Y Sbjct: 120 AGKYGLPTDEYLLPQALKEAGYETAIVGKWHLGHADPKYWPRQRGFDYQYGPLLGEIDYF 179 Query: 219 EWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDY 278 H+ G + K + Sbjct: 180 ------------------------------THSAHGKVDWYRNNQLIKEEGYVTTLLGQD 209 Query: 279 GVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS-----SPARTSYGDCMVEMNDVF 333 VK ++K P FLY H KY P R +Y + M+D Sbjct: 210 AVKLIEKH-NPKTPLFLYLAFTAPHAPYQAPQKYLDQYKTIADPNRRAYAAMITAMDDQI 268 Query: 334 ANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG------------RTPFRGAKGSTWEG 381 + LEK G +NTLIVF SDNG G P+R K S +EG Sbjct: 269 GQVVAALEKRGMRNNTLIVFQSDNGGPRSAQFTGEVDTSGGTIPADNGPYRDGKASLYEG 328 Query: 382 GVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQT 440 G RV W G IQP + + + D++PT LA V K +DG++ Sbjct: 329 GTRVVALANWPGKIQPGTVVNHPIHIVDMYPTLTGLASVS-------VGKNKPLDGLNIW 381 Query: 441 SFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTA 500 S R Y + AA+ +++K P Sbjct: 382 PALS-EAKPSPRSQVVYDIEPFRAALSQEDWKLVWKATLPSRL----------------- 423 Query: 501 GSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMH 533 +FNL D E ++ ++ + L+ ++ Sbjct: 424 --ELFNLSQDVSEQTNLAEQNPEIVSRLKQQIE 454 >UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKB8_9BACT Length = 465 Score = 478 bits (1231), Expect = e-133, Method: Composition-based stats. Identities = 114/479 (23%), Positives = 180/479 (37%), Gaps = 61/479 (12%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ- 133 L L +PN++V + DD+G+ DVGFNG PTP ID++A G+ T+ Y+ Sbjct: 10 LISLNAICASRPNLIVIMADDLGYNDVGFNG---CTEIPTPGIDSIAQNGVKFTNGYTSY 66 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYG----QPGGLQGLTTLPQLLHDQGYVTQAIGKW 189 P+RA +TG+Y G P + + T+ + L GY IGKW Sbjct: 67 SVCGPSRAGFITGRYQQRFGFERNPQWNLTDPNSALPKSEMTIAESLTQVGYHCGIIGKW 126 Query: 190 HMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 H+G +P GFD+F G + D+ + + + Y + Sbjct: 127 HLGAEPSLRPNKRGFDEFFGHLGGGHRFMP-EDLVIQHTEEVKNELDSYRSWI------- 178 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 D K + L + + D V F+ + KPFFL+ H Sbjct: 179 ---------TRNDTPVKTTKYLTEEFSDEAVSFIKR--NHQKPFFLFLSYNAPHLPLQAT 227 Query: 310 AKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP 364 KY P R +Y + ++D + + ++L++ DNT++ F SDNG + Sbjct: 228 EKYLARFPHIKDPKRKTYAAMVSAVDDGVSQVMQSLKETNIADNTIVFFLSDNGGPSHKN 287 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAK 423 P +G K WEGG RVP + + IQ ++ D V D+F T LA Sbjct: 288 KSDNFPLKGQKSDVWEGGFRVPFAMQYPAAIQAKQVYDHPVSSLDIFATIASLA------ 341 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGK-LAAVRMDEFKYHVLIQQPYA 482 + +DGV+ F G Q+ + VR +FK + Sbjct: 342 -QSPTHADKPLDGVNLIPFITGEKTQAPHAQIFIRKFDQSRYVVRQGDFKLVIPY----- 395 Query: 483 YTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 + A ++NL D E ++I H L+ + L Sbjct: 396 ---------------KDAPPQLYNLSKDIGEENNIAAVHPERVKELEKVRKQWDSELMD 439 >UniRef50_Q7UYA6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UYA6_RHOBA Length = 490 Score = 478 bits (1230), Expect = e-133, Method: Composition-based stats. Identities = 118/482 (24%), Positives = 179/482 (37%), Gaps = 70/482 (14%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRAT 142 PN VV DD G+ DVG G TP +DA+A G+ TS Y+QP P+RA Sbjct: 20 AAPPNFVVIFTDDQGYEDVGCFGSPDI---RTPRLDAMAKGGMKFTSFYAQPICGPSRAA 76 Query: 143 ILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM------GENKE 196 ++TG Y + P + T+ ++L +GY + GKW + G + Sbjct: 77 LMTGCYPMRVAERGHTKQIHPILHEDEVTIAEVLKTKGYASACFGKWDLAKHAQSGFFSD 136 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P GFD F G + +D V Sbjct: 137 LLPTGQGFDYFYGTPTSND-----------------------------------RVANLY 161 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 + M L +R+ D + F++K ++PFF+Y H + + G S Sbjct: 162 RNEELIEPESDMATLTRRYTDEAISFIEK--NQNQPFFVYIPHTMPHTRLDASKDFKGKS 219 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE------------AEVP 364 R YGD + E++ + +L + DNT ++FTSDNGP Sbjct: 220 -KRGLYGDVIEEIDFNVGRILDSLNELNLADNTYVLFTSDNGPWLVKNKGHADGHRLGDH 278 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAK 423 P R K ST+EGGVRVP ++ G + D I D+ PT LAG Sbjct: 279 GGSAGPLRSGKVSTFEGGVRVPAILWAPGKVPAGTVCDSIATTMDVMPTLAALAGAE--- 335 Query: 424 VANLVPKTTFIDGVDQTSFFLGT-NGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYA 482 +P IDG D F G + KA Y+L L AVR ++K H+ ++ Sbjct: 336 ----IPTDRVIDGEDIRHLFHGEFDKADPDKAFFYYLRVHLQAVRQGKWKLHLPREKEPV 391 Query: 483 YTQSGYQGGFTGTVM--QTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 + + +L D E+ ++ + + L + + L Sbjct: 392 GAAPFGRNAHIAPKDRIGFKQPFLVDLDNDLGETTNVAAENPEVVERLLGLAESMRDDLG 451 Query: 541 KY 542 Y Sbjct: 452 DY 453 >UniRef50_A6DHI0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI0_9BACT Length = 456 Score = 477 bits (1228), Expect = e-133, Method: Composition-based stats. Identities = 109/478 (22%), Positives = 175/478 (36%), Gaps = 76/478 (15%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSS 137 + KPN++ + DD+G+ +G G + TP +D +A +GL LT Y+ + Sbjct: 13 AANSADKPNIIFIMCDDMGYGQLGSYGQKMI---KTPRLDQMAKEGLRLTDYYAGTAVCA 69 Query: 138 PTRATILTGQYSIHHGILMPPMY--GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-N 194 P+R +++TGQ+ H I Y GQ T+ + + + GY T IGKW +G Sbjct: 70 PSRCSLMTGQHVGHTYIRGNKEYPTGQEPIPAETITVAEKMKEAGYATALIGKWGLGYPG 129 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 E +P GFD F G+N + + +R Sbjct: 130 SEGEPNKQGFDYFFGYNDQKHAHNHFP---------------------------KFLLRN 162 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN----- 309 E + + + K +E D F+ K D PFFLY H Sbjct: 163 EETLTLKNNSGKEIEYSQYMLTDEAKGFIKK--NKDNPFFLYLAYVIPHSRLQIPGDDEC 220 Query: 310 ---AKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP- 365 K + + + ++ ++ L++ +NTL+VFTSDNG E Sbjct: 221 YLQYKDESWPEKQKKHAGMISRLDKDVGSILDLLKEMNLAENTLVVFTSDNGAHREGGAR 280 Query: 366 ----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHP 420 + P G K S +EGGVRVP +W G+I+P S+ I DL PTA +L G Sbjct: 281 PEFFNDSGPLSGIKRSMYEGGVRVPFIAHWPGVIKPGQVSNHIGAHWDLMPTACELGGVQ 340 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF--LNGKLAAVRMDEFKYHVLIQ 478 + IDG+ G + + YF VR ++ Sbjct: 341 PPEG---------IDGISYVPLLKGNMEEQEKHDYLYFELHWPTKRGVRKGDW------- 384 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTE-MHAY 535 Q + +FNL D + + ++ + + A+ Sbjct: 385 -------VALQSKTSAIDPNKDTIKLFNLKNDLGQKKDLATQYPEKVEEFKKIFLEAH 435 >UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 Length = 471 Score = 476 bits (1226), Expect = e-132, Method: Composition-based stats. Identities = 108/490 (22%), Positives = 179/490 (36%), Gaps = 72/490 (14%) Query: 69 KETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILT 128 +A K+PN+V DD G+ D GF G TP++D +AS+G+ T Sbjct: 10 AALSISVACTSLSYAKQPNIVFLFSDDAGYADFGFQGSETM---KTPNLDQLASEGVRFT 66 Query: 129 SAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYG-----------QPGGLQGLTTLPQLL 176 Y + P+RA I+TG+Y G + G + G T+ + Sbjct: 67 QGYVSDSTCGPSRAGIMTGRYQQKFGYEEINVPGYMSEHSAIKGAEMGIPLDEVTMGDYM 126 Query: 177 HDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS 236 GY T GKWH+G E P + GFD+F GF Y + + A+ D+ Sbjct: 127 KSLGYRTAFYGKWHLGGTDELHPMHRGFDEFYGFRGGDRSYWAYEVNAPERKSAVFTDK- 185 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 + + D ++ L + +F++K DKPFF++ Sbjct: 186 -------------------KLEHGIDQFQEHEGYLTDVLAEKANQFIEKA--PDKPFFIF 224 Query: 297 YGTRGCHFDNYPNAKYAGSSP----ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIV 352 H + P R + ++ + L++ G D+TL+V Sbjct: 225 LSFNAVHTPMEATPEDLAKFPQLKGKRKEVAAMTLALDRASGAVLNKLKELGLEDDTLVV 284 Query: 353 FTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFP 411 F++DNG + P G K + EGG+RVP V W + K D V DL P Sbjct: 285 FSNDNGGPTDKNASSNYPLAGTKSNFLEGGIRVPFLVKWPAKLAAGKVYDKPVSTLDLLP 344 Query: 412 TALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEF 471 T G +DGVD + G N ++ ++ Y+ AA+R ++ Sbjct: 345 TFFKAGGGEEVMSE--------LDGVDLMPYITGQNNKAPHESM-YWKKETRAAIRQGDW 395 Query: 472 KYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTE 531 K +P ++NL D E ++ + + + Sbjct: 396 KLLRFPDRPA---------------------ELYNLANDIGEQHNLAAQEPERVKQMYKD 434 Query: 532 MHAYMEILKK 541 ++ L++ Sbjct: 435 FFSWEMTLER 444 >UniRef50_C5C581 Cerebroside-sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C581_BEUC1 Length = 458 Score = 474 bits (1220), Expect = e-132, Method: Composition-based stats. Identities = 121/467 (25%), Positives = 185/467 (39%), Gaps = 64/467 (13%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRA 141 ++PN+V+ DD+G+ D+G G + N TP +D +A++G+ LT Y + P SP+R Sbjct: 2 TQRPNIVLINADDLGYGDLGCYGS---MRNDTPHLDRLAAEGVRLTDFYMASPVCSPSRG 58 Query: 142 TILTGQYSIHHGI-----LMPPMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK 195 +LTG Y G G P G T+ ++L D GY T AIGKWH G+ Sbjct: 59 GMLTGCYPPRIGFGEFVGRPVLFPGDPVGLDPAERTMARVLGDAGYATAAIGKWHCGDQP 118 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 E P GFD + G +DM + P +S + Sbjct: 119 EFLPTRHGFDSYFGIPFSNDMGRQREHEDWPPLPLMSGE--------------------- 157 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 L +R+ +F+++ + +PFFLY H + A + + Sbjct: 158 ----SVVQEQPDQRSLTERYTVAATRFIEE--NAHQPFFLYLAHMYVHVPLFVPAPFLAA 211 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAK 375 S YG + ++ + TL + G +NT++VFTSDNG A P RG K Sbjct: 212 SRN-GGYGGAVAALDWSTGVVMDTLRRLGLEENTIVVFTSDNGSRARGEGGSNDPLRGHK 270 Query: 376 GSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFI 434 TWEGG RV V W I D + DL PT A A+ + Sbjct: 271 AQTWEGGQRVACVVRWPAAIPAGGVCDAVTRSIDLLPTF-----AAVAGAADWADPARPV 325 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTG 494 DGVD T+ G G + + Y+ L AVR+ ++K H+ ++ Sbjct: 326 DGVDLTALLTG-AGPAPNETFAYYYMDDLEAVRVGDWKLHLSKRRDPMR----------- 373 Query: 495 TVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +++L TD E+ + H + L+ L Sbjct: 374 --------ELYDLRTDAAETHDVAADHPDVVARLEAVAETIRADLGD 412 >UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4L0_9PLAN Length = 413 Score = 473 bits (1219), Expect = e-132, Method: Composition-based stats. Identities = 108/477 (22%), Positives = 167/477 (35%), Gaps = 83/477 (17%) Query: 92 LLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSI 150 + DD+G+ D+ G TP +D +A+ G+ T +S SPTRA +LTG+Y Sbjct: 1 MADDLGYGDLSCYGSQNCN---TPHLDRLAANGIRFTDFHSSGAVCSPTRAGLLTGRYQQ 57 Query: 151 HHGILMPPMYG-----QPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFD 205 GI G + TL Q L D GY T GKWH+G ++ P GF Sbjct: 58 RAGIDGVVYANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLGYQRQYNPTFRGFQ 117 Query: 206 DFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITP 265 F G+ S + Y D + + A++ Sbjct: 118 QFVGYVSGNVDYFAHLDGTGVFDWWHN----------------------------AELNR 149 Query: 266 KYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA--------GSSP 317 + + D+ ++F+ + +KPFF+Y H S Sbjct: 150 EEQGYVTHLINDHALEFIRQQ--QEKPFFVYIAHEAVHSPYQGPHDQPMRKEGGGDIKSA 207 Query: 318 ART----SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRG 373 R +Y + EM+ + L++ + T I F SDNG RG Sbjct: 208 KRKDIANAYREMNTEMDKGIGQIVDVLKEVNLTEKTFIFFLSDNGANK---NGSNGKLRG 264 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 KGS WEGG RVP W G I D V DL PT L+LA +P Sbjct: 265 FKGSLWEGGHRVPAIACWPGRIPEGTVCDEPVISIDLMPTILELANAK-------IPAGH 317 Query: 433 FIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGF 492 +DGV S R+ ++ +A+R +K + + Sbjct: 318 KLDGVSLVSLLKDRKSLVPRQ--IFWEYNGKSAMRQGHWKLVLNQTR------------- 362 Query: 493 TGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIK 549 +++L D ES ++ +Q+ + A+ ++K K Sbjct: 363 ------KEPIELYDLTRDMSESKNLADNQPQRVQQMQSALAAWKSDVQKTATTQPEK 413 >UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZUT0_9PLAN Length = 457 Score = 473 bits (1218), Expect = e-132, Method: Composition-based stats. Identities = 125/524 (23%), Positives = 186/524 (35%), Gaps = 101/524 (19%) Query: 58 MMPVMQHPAQDKETQQKLAELE---KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPT 114 M P M + LA KPN+V L+DD+G D G G A T Sbjct: 1 MSPSMNYRQCIAAILVLLASGALHSDAAPTKPNIVFILIDDMGCKDAGCYG---ATNFST 57 Query: 115 PDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM--------------PPMY 160 P ID +A+QG+ T AY+ P SPTRA+++TG++ + P Sbjct: 58 PHIDRLANQGMRFTDAYAAPVCSPTRASLMTGKHPARLHLTNFIPQIGRQLPAGKLIPPG 117 Query: 161 GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG--ENKESQPQNVGFDDFRG--FNSVSDM 216 T+ Q LH GY IGKWH+G E +PQN GFD + + + Sbjct: 118 FNHVLPLDEKTIAQELHADGYQCAMIGKWHLGEEHGPEYRPQNRGFDRVVLSEHHGIFNY 177 Query: 217 YTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWM 276 + + D P P + L R Sbjct: 178 FYPFVDQQKWPYAGPLPGNP-------------------------------GDYLPDRLT 206 Query: 277 DYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPART------SYGDCMVEMN 330 D + F+ + ++PFFLY H + R Y M ++ Sbjct: 207 DEAIDFVRE--NRERPFFLYLSHWSVHGRYFAPESLIAKYRERGLEERPAIYAAMMETVD 264 Query: 331 DVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVY 390 + L TL++ DNTL VF SDNG E P RG+KGS +EGGVRVP V Sbjct: 265 NSVGRLMATLDELNLADNTLFVFMSDNGGE---RITSMAPLRGSKGSLYEGGVRVPLIVR 321 Query: 391 WKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ 449 + G+++P V DLFPT LD A + +DG G + Sbjct: 322 YPGVVKPNTTCSVPVISHDLFPTFLDFAERSY--------RDNKLDGHSIAGLLTGEQSE 373 Query: 450 SNRKAEHYFLNGKL------AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSS 503 +R A ++ +A+R +K ++T + Sbjct: 374 LDRDALYWHFPHYWGSTRPCSAMRQGRWKLVEH--------------------LETGRAQ 413 Query: 504 VFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQ 547 +++L +DP E + L+ + + + P Sbjct: 414 LYDLSSDPGEQRDLANEMPQQATELRKMLAQWRTKVGAQMPTHP 457 >UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4991 Length = 596 Score = 472 bits (1216), Expect = e-131, Method: Composition-based stats. Identities = 124/508 (24%), Positives = 177/508 (34%), Gaps = 98/508 (19%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ- 133 L KPNVV+ ++DD+G D+G G TP+ID +A G+ T Y+ Sbjct: 12 FFALPASAAGKPNVVLIVIDDLGQRDLGCYGSTF---YKTPNIDRMAKDGVRFTDFYAAC 68 Query: 134 PSSSPTRATILTGQYSIHHGILMP-----PMYGQPGGLQG--------LTTLPQLLHDQG 180 P SPTRA+I+TG+Y GI + GQ T+ + L G Sbjct: 69 PVCSPTRASIMTGKYPQRVGITDWLPGRKDLPGQRLKRPELKNELALEEVTVAETLKGHG 128 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 YVT IGKWH+G K +P+ GFD + + + Sbjct: 129 YVTAHIGKWHLG-GKGFEPEKQGFDVNVAGDHTGTPLSYF-------------------- 167 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 PF+ + G E+ A E L R F+ A DKPFFLY Sbjct: 168 -APFANKAGATMPGLEKAAPD-------EYLTDRLAAEAETFI--TANKDKPFFLYLPHY 217 Query: 301 GCHFDNYPNAKYAGSSPARTS--------YGDCMVEMNDVFANLYKTLEKNGQLDNTLIV 352 G H + Y + M+ + K L+ DNTL++ Sbjct: 218 GVHTPLRAPQPLVDKYKTQAVHGRQSNPVYAAMVESMDAAVGRVLKRLDDLKLSDNTLVL 277 Query: 353 FTSDNGPEAE-----VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDL 406 FTSDNG A P P R KG +EGGVRVP W G ++P D + Sbjct: 278 FTSDNGGLATLEGMPFAPTINAPLREGKGYLYEGGVRVPLIAKWPGKVKPGTVMDQVACS 337 Query: 407 ADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLA-- 464 D F T L+ G A DGV F G + R ++ + Sbjct: 338 IDFFDTILEATGATSAARR---------DGVSLVPAFGGEKLK-PRALYWHYPHYANQGS 387 Query: 465 ----AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVR 520 AVR +K + +F++ D ES ++ Sbjct: 388 RPGGAVRAGNYKLVEYY--------------------EDGRRELFDVAKDLSESRNLAAD 427 Query: 521 HIPMGVPLQTEMHAYMEILKKYPPRAQI 548 + L ++ A+ + P Sbjct: 428 KPDVVKDLAAKLDAWRTDVGAKMPTPNP 455 >UniRef50_A6DKP2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKP2_9BACT Length = 446 Score = 472 bits (1216), Expect = e-131, Method: Composition-based stats. Identities = 118/475 (24%), Positives = 181/475 (38%), Gaps = 65/475 (13%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPTR 140 KPN+V+ DD+GW DV ++G TP IDA+A G+ Y+ P+R Sbjct: 16 AADKPNIVLVFADDMGWGDVAYHG---VEDAQTPAIDAIAKGGVWFEQGYAAASVCGPSR 72 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 A ILTG+Y G++ G + + +LL GY + A GKWH+G K P Sbjct: 73 AGILTGRYQQLFGVVTN-GDADKGIPKSQKNIAELLKPAGYKSGAFGKWHLGSKKGQFPN 131 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 + GFD F GF+ + Y Y +K Q I Sbjct: 132 DRGFDTFYGFHFGAHDY--------------------YRADKKLNKKKKGYAPIYFNQDI 171 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA-- 318 D K + L ++ D+ V+F+++ D+PFF+Y H +Y P Sbjct: 172 VDY--KEGDYLTEKITDHAVEFIEE--NKDQPFFMYVAYNSVHSPWQVPDEYLARIPESV 227 Query: 319 ---RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP---------EAEVPPH 366 R + ++ M+D + L++ +NT+ VFT+DNG E + Sbjct: 228 PAYRRLFLAMVLAMDDGVGRIRAKLKELNLDENTIFVFTTDNGSPKIGNKKPNEGQYRMS 287 Query: 367 GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVA 425 FRG KG T+EGG+RVP + W I+ K + V DL PT L A + Sbjct: 288 MSQGFRGYKGDTYEGGIRVPFCMSWPKKIKSGNKFEAPVIAYDLAPTFLSAASLEYSTKQ 347 Query: 426 NLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL--AAVRMDEFKYHVLIQQPYAY 483 G D + + + + L AVR ++K Q+ Sbjct: 348 FS--------GKDLLPYLEDEQKGRPHETLFWHRHSGLDDYAVRHGDWKLTYNDQE---- 395 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 G + ++ +FNL DP E + L+ + E Sbjct: 396 -------GTSKDFLKKVHLKLFNLKQDPYEKKDLADSMPEKLQQLKQLYFNWHET 443 >UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYA9_9BACT Length = 490 Score = 472 bits (1215), Expect = e-131, Method: Composition-based stats. Identities = 115/506 (22%), Positives = 175/506 (34%), Gaps = 69/506 (13%) Query: 48 VKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGG 107 + +++ + K+PN++V + DD G+ D F G Sbjct: 1 MTLIDPFLMSLLRKAFTSVAALSLASSSVRADDTPTKRPNIIVIVSDDQGYADASFQGSK 60 Query: 108 VAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPMY----GQ 162 + TP++DA+A G+ T Y P SP+RA ++TG+Y G + Sbjct: 61 DIL---TPNLDALAKSGVRCTRGYVTAPVCSPSRAGLMTGRYQERFGHHNNIVAEAALPI 117 Query: 163 PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRG-FNSVSDMYTEWR 221 T LPQ+L GY T +GKWH+G +P GFD+F G D + Sbjct: 118 AHLPSNETLLPQVLAKAGYYTAMVGKWHLGLQDGCRPYERGFDEFFGIITGGHDYFVNHP 177 Query: 222 DVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVK 281 + D + R + + P L + V+ Sbjct: 178 EERAVG-------------------DQSYKARIERNGPVGEAVP---GYLTDAFGADAVR 215 Query: 282 FLDKMAKS--DKPFFLYYGTRGCHFDNYPNAKYAG------SSPARTSYGDCMVEMNDVF 333 + + D+P FLY H S R +Y + M+ Sbjct: 216 IIRESHTKRPDQPLFLYLAFNAPHTPTQAPKDLVDTMPATLESKDRRTYAAQITSMDASV 275 Query: 334 ANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKG 393 + L++NG +T IVF SDNG A P + TP R KGS +EGG+RVP F + G Sbjct: 276 GKVRAALKENGMEKDTFIVFFSDNGG-ANHPYYDNTPLRDHKGSLYEGGIRVPFFAVYPG 334 Query: 394 MIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNR 452 I + V D+F TA LAG + +D VD G Q Sbjct: 335 HIPAGSVCELPVTSLDVFATACALAGTKP-------ETSHPLDSVDMLPVLEGNARQPTH 387 Query: 453 KAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ 512 + G AAV + K V + +F+L D Sbjct: 388 ATLFWEFPGFGAAVADRDLKLVVP---------------------KKGSPQLFDLAVDIG 426 Query: 513 ESDSIGVRHIPMGVPLQTEMHAYMEI 538 E + ++ L T + + Sbjct: 427 EKSDLAAQNPEKVARLSTLLSEWHAQ 452 >UniRef50_A6BZV9 Arylsulfatase n=3 Tax=Bacteria RepID=A6BZV9_9PLAN Length = 520 Score = 471 bits (1213), Expect = e-131, Method: Composition-based stats. Identities = 104/551 (18%), Positives = 182/551 (33%), Gaps = 128/551 (23%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 A + K+PN+++ + DD+GW D+G GG V TP +D +A +GL T Y+ Sbjct: 16 SHSAVQAAEKIKRPNIILIMCDDMGWSDIGCYGGEV----QTPHLDRMAKEGLRFTQFYN 71 Query: 133 QPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 TRA+++TG Y Y +P + + T+ ++L GY T GKWH+G Sbjct: 72 NAVCWTTRASLVTGLYP---------RYPRPHLNRNMVTIGEVLQQAGYQTALSGKWHLG 122 Query: 193 ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 + + P GF DF G + + + +P+ Y Sbjct: 123 RTESTHPVYRGFQDFYGLLDGCCNFFD--PYYRDPKFKRGITGDGY-------------- 166 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 R + + D+ ++ + +++DKPFFL+ H+ + + Sbjct: 167 RFFAENTTRITEFPDDFYTTDAFTDHAIQEIKTYSQTDKPFFLHLCYTAPHYPLHAKPED 226 Query: 313 AGSSPAR----------------------------------------------------T 320 R Sbjct: 227 IKKYKGRYAAGWEALRNERYQRQLKMGLVDPQWKLPARDPESADWEQDKYPRDWQERRME 286 Query: 321 SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG------------- 367 Y + M+ L TL++ G DNT+++F SDNGP+A P Sbjct: 287 VYAAMIDCMDQNIGRLMATLKETGVDDNTIVMFLSDNGPDASEPGGANPEQIPGPEEYYT 346 Query: 368 ----------RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDL 416 TPFR K EGG+ P V W G I+ + + D+ PT ++L Sbjct: 347 TCGPSWAFPQNTPFRRFKTWMHEGGISTPLIVRWPGKIKANSLTRQPAHIIDVMPTCVEL 406 Query: 417 AGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVL 476 A K +DG G + + + AVR ++K Sbjct: 407 AETDYPATFQSH-KILPVDGKSIVPILQGK-IREPHDSLF-WELRNNQAVRQGKWKLV-- 461 Query: 477 IQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYM 536 +++L D E++++ ++ ++ + + Sbjct: 462 ------------------ADRNINRWELYDLEQDRTETNNLASQYPERVAQMKADWQKWA 503 Query: 537 EILKKYPPRAQ 547 + + Q Sbjct: 504 DKTGVAQQKHQ 514 >UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XH3_PSEA6 Length = 500 Score = 471 bits (1213), Expect = e-131, Method: Composition-based stats. Identities = 115/487 (23%), Positives = 183/487 (37%), Gaps = 58/487 (11%) Query: 61 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV 120 ++ + ++ +KPN++ L DD+G+ DVGFNG TP++D + Sbjct: 15 LIAISVGNASAADAGQSKADESNEKPNILFVLADDLGYNDVGFNGSTDI---KTPNLDGL 71 Query: 121 ASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQP--GGLQGLTTLPQLLH 177 A G+ +AY P P+RA I+TG+Y G G + Q + Sbjct: 72 AKNGMTFDAAYVAHPFCGPSRAAIMTGRYPHKIGAQFNLPEDNSNVGVSADELFIAQTMK 131 Query: 178 DQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSE 237 GY T A+GKWH+GE E P GFD+F GF Y + + + Sbjct: 132 SAGYFTGAMGKWHLGEASEYHPNKHGFDEFYGFLGGGHNYFPEQFEAAYNKRVAQGMTNI 191 Query: 238 YIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYY 297 + P + + E + V F+DK A KPFFLY Sbjct: 192 NMYLTPLEHNGKEV--------------RETEYITDGLSREAVNFVDKAAAKKKPFFLYL 237 Query: 298 GTRGCHFDNYPNAKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIV 352 H + R +Y + ++ + + L+KNGQ DNT+IV Sbjct: 238 AYNAPHVPLQAKEEDMAMFSQIKDKKRRTYAGMVYAVDRGVGRIVEQLKKNGQFDNTVIV 297 Query: 353 FTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFP 411 FTSDNG + + P + KGS EGG R P V+W ++ V DL+P Sbjct: 298 FTSDNGGKLGQGAN-NYPLKEGKGSVQEGGFRTPMLVHWPKHMKAGSRFSHPVLALDLYP 356 Query: 412 TALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH---YFLNGKLAAVRM 468 T L G ++P+ +DG D + + + + AA R Sbjct: 357 TFAGLGGA-------VLPEDKKLDGKDIWADIQANTAPHKDEFIYVLRHRNGYSDAAARR 409 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPL 528 ++FK + ++N+ D E + I +H + + Sbjct: 410 NQFKAVKNHNDDWK---------------------LYNIAQDISEDNDISAQHPDILRDM 448 Query: 529 QTEMHAY 535 + M ++ Sbjct: 449 VSSMESW 455 >UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD Length = 481 Score = 470 bits (1211), Expect = e-131, Method: Composition-based stats. Identities = 121/526 (23%), Positives = 192/526 (36%), Gaps = 97/526 (18%) Query: 59 MPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDID 118 + ++ TQ+ A+ + ++PN+V L DD+G+ DVGFNG + TP+ID Sbjct: 5 LLLIPLLTSSFLTQR--ADAQAPKPQRPNIVFILADDLGYGDVGFNGQKLI---KTPNID 59 Query: 119 AVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPM---YGQPGGLQGLTTLPQ 174 +A +G+I Y+ +P+R+++LTGQ++ H I GQ +TTL + Sbjct: 60 KLAKEGMIFNQFYAGTSVCAPSRSSLLTGQHTGHTYIRGNKGVEPEGQQPIADSVTTLAE 119 Query: 175 LLHDQGYVTQAIGKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSP 233 +L GYVT A GKW +G E P GFD F G+N S + + + + + Sbjct: 120 VLKKSGYVTAAFGKWGLGPVGSEGDPNKQGFDRFYGYNCQSLAHRYYPEHLWDNSKKILL 179 Query: 234 DRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPF 293 + ++ + E + F+ +PF Sbjct: 180 EGNKGL-------------------------IHNKEYAPDLIQKKALSFV-NAQDGKQPF 213 Query: 294 FLYYGTRGCHFDNYPNAK----------------------------YAGSSPARTSYGDC 325 FL+ H + YA ++ Sbjct: 214 FLFLPYILPHAELVVPDDSLFRYYKGKFEEKPHKGADYGPGANGGGYASQDFPHATFAAM 273 Query: 326 MVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP-----HGRTPFRGAKGSTWE 380 + ++ + L+K G NTL++FTSDNGP E + FRG K +E Sbjct: 274 VARLDLYVGQVMNALKKKGLDKNTLVIFTSDNGPHVEGGADPRFFNSGAGFRGVKRDLYE 333 Query: 381 GGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQ 439 GG+R P W I+P KSD I D+ PT +LA P IDG+ Sbjct: 334 GGIREPFAARWPAAIKPGSKSDYIGAFWDILPTFAELANAPAP---------RNIDGISF 384 Query: 440 TSFFLGTNGQSNRKAEHY--FLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVM 497 T G Q ++ G AVR +K Sbjct: 385 TDALKGKAIQKKHDYLYWEFHEQGGRQAVRQGNWKAVR----------------LKAAGN 428 Query: 498 QTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 A +++L DPQE +++ + L M+ +P Sbjct: 429 PDALVELYDLSKDPQEKNNLTPQFPEKAKELGQIMNRAHVSSAIFP 474 >UniRef50_Q7UJQ8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=4 Tax=Planctomycetaceae RepID=Q7UJQ8_RHOBA Length = 491 Score = 470 bits (1210), Expect = e-131, Method: Composition-based stats. Identities = 106/509 (20%), Positives = 181/509 (35%), Gaps = 76/509 (14%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 K+PN+V L DD+G+ D+G G + TP +D +A++G+ T Y+ Sbjct: 23 ATAPSTSAADAKRPNIVFILADDLGYGDLGCYGQELI---QTPRLDQMAAEGMRFTDFYA 79 Query: 133 -QPSSSPTRATILTGQYSIHHGILMP---PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGK 188 +P+R+ ++TG + H + P + T+ ++L GY T GK Sbjct: 80 GNTVCAPSRSVLMTGMHMGHTHVRGNAGGPDMSKQSLRDENVTVAEVLQSAGYATALCGK 139 Query: 189 WHMGEN----KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPF 244 W +G++ ++ P+ GFD F G+ + + + PE + ++ Sbjct: 140 WGLGDDALGGRDGLPRKQGFDHFYGYLNQVHAHNYY------PEFLWRNETKVALRNEVQ 193 Query: 245 SKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA--KSDKPFFLYYGTRGC 302 +D + K ++ + + F+ + A + KPFFLY Sbjct: 194 RRDRSYGG------FTGGWATKRVDYSHDLIANEAMGFIREKATDAATKPFFLYLSLTIP 247 Query: 303 H--------------FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDN 348 H +Y S + M+ + L++ + Sbjct: 248 HANNEGTGMSGNGQEVPDYGIYADKDWSDQDKGQAAMITRMDSDVGRILDLLKELQIDEQ 307 Query: 349 TLIVFTSDNGPEAEVPPH-----GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDG 402 T+++F+SDNGP E + P RG K + EGG+RVP V W G P SD Sbjct: 308 TVVMFSSDNGPHNEGGHNPKKFDPAGPLRGMKRALTEGGIRVPLIVRWPGTTPPGAVSDH 367 Query: 403 IVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN-- 460 I DL TA +LAG + A D + +G Y+ Sbjct: 368 IGYFGDLMATAAELAGTDFPEDA---------DSISFAPTIVGRPEAQQTHEYLYWEFYE 418 Query: 461 -GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGV 519 G AVR +K T + +++L D E+ ++ Sbjct: 419 QGGRQAVRRVNWKAIR-------------------EPWMTGPTQLYDLKADIGETTNLAS 459 Query: 520 RHIPMGVPLQTEMHAYMEILKKYPPRAQI 548 H + L+T M + R Sbjct: 460 DHPEIVKQLETLMEEAHTPHPNWQVRVPA 488 >UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D6K5_PAESJ Length = 434 Score = 469 bits (1209), Expect = e-130, Method: Composition-based stats. Identities = 112/479 (23%), Positives = 193/479 (40%), Gaps = 69/479 (14%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRAT 142 K+PN++VF DD+G+ D+G G TP +D +AS+G+ T+ YS P SP+RA+ Sbjct: 2 KRPNIIVFYCDDLGYGDLGCYGSDAM---KTPHLDQLASEGIRFTNWYSNSPVCSPSRAS 58 Query: 143 ILTGQYSIHHGILMPPMY--GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 +LTG+Y G+ G G TTL L + GY T GKWH+G + E P Sbjct: 59 LLTGKYPAKAGVTSILGGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLGASAEYGPN 118 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 GFD F GF + Y + VH + E + Sbjct: 119 AHGFDQFYGFRAGCIDYYSHIFYWGQGG----------------GVNPVHDLWRNETEVW 162 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS----S 316 + E + + ++D A D+P+F+Y H+ + Y Sbjct: 163 EN-----GEYMTEAITREATSYID-AAPDDEPYFMYVAYNAPHYPMHAPKAYLDRFPDLP 216 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE-----------VPP 365 P R + ++D + K L++ G ++T+I F+SDNGP E Sbjct: 217 PDRRIMAAMIAAVDDGVGEIVKALKQKGAYEDTIIFFSSDNGPSTESRNWLDGTEDLYYG 276 Query: 366 HGRTPFRGAKGSTWEGGVRVPTFVYWKGMI---QPRKSDGIVDLADLFPTALDLAGHPGA 422 FRG K S +EGG+R P + + + Q + SD + + D+FPT L+L+G Sbjct: 277 GSAGRFRGHKASLFEGGIREPAILSYPAGLAEQQGQISDEMFAMMDIFPTMLELSGIGTE 336 Query: 423 KVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYA 482 + +DG G ++ ++ AVR ++K + + ++ Sbjct: 337 GYS--------LDGHSVFDALSGNALSPRKQ--LFWEYEGQLAVREGKWKLVLNGKLDFS 386 Query: 483 YTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 + + +L D E ++ ++ + L+ ++ + + L++ Sbjct: 387 -------------RTEADAVHLSDLEQDSSERINLVKQYPEIAQRLERDVRQWYQSLQE 432 >UniRef50_C5EQ23 Arylsulfatase E n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EQ23_9FIRM Length = 483 Score = 469 bits (1207), Expect = e-130, Method: Composition-based stats. Identities = 110/479 (22%), Positives = 182/479 (37%), Gaps = 57/479 (11%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRAT 142 KKPN++VFL DD G+ D+ G TP++D +A+ G T Y+ SP+RA Sbjct: 15 KKPNIIVFLTDDQGYGDLSCMGSTDVC---TPNLDILAAGGARFTDFYAGSAVCSPSRAC 71 Query: 143 ILTGQYSIHHGILMPPMYGQP--GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 +LTG+Y G+ + G G+ T L D GY T +GKWH+G E +P Sbjct: 72 LLTGRYPYMTGVRSILGGIKTTTGLNPGIPTFASALKDLGYTTGMVGKWHLGAVPECRPT 131 Query: 201 NVGFDDFRGFNSV-SDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 ++GFD F GF S +D ++ N ++P+ + K Sbjct: 132 HMGFDYFCGFLSGVNDYFSHIHYTEANSHPGINPNHDLWENDERCLKYT----------- 180 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN----AKYAGS 315 E + + G++F+ + + D PF LY H+ + ++ Sbjct: 181 --------GEYSTELFARKGLEFIREQVEKDMPFALYCAFNAPHYPMHAPYKYLERFKHL 232 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE-----------VP 364 R + ++D + L++ G ++T+I F SDNGP E Sbjct: 233 PEDRQIMAAMLSAVDDGVGEIMNYLKRRGIFNDTIIYFQSDNGPSKESRNWLDERKDYYY 292 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAK 423 +G K S ++GG+RVP W M+ D+FPT ++ AG + Sbjct: 293 GGSTGGLKGHKFSLFDGGIRVPAIFSWPAMVPAGQVISEPCMGTDIFPTFINAAGGNAS- 351 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAY 483 I G D G K Y+ G+ AVR +K + + Sbjct: 352 -------DYEISGCDILPVMT--IGARRDKDCLYWEMGQQTAVRRGNYKLVIN-----GF 397 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKY 542 + G+ + +L D E ++ + L+ + + L+ Y Sbjct: 398 LRDGWSLPLDPKTETKHEVWLSDLSQDMGEEHNLVEEMPELAKELEEKALTWRRDLEAY 456 >UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7UX95_RHOBA Length = 538 Score = 468 bits (1206), Expect = e-130, Method: Composition-based stats. Identities = 108/552 (19%), Positives = 183/552 (33%), Gaps = 98/552 (17%) Query: 44 NQYLVKPATTIADNMMPVMQHP----AQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWM 99 NQ ++ P+ + ++ + + T +PN+V+ + DD+G+ Sbjct: 28 NQAVLMPSRKWVRWALLLVCVAGVPNLDSTTVSAEEPNAKDATVSRPNIVLIVADDLGYG 87 Query: 100 DVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPP 158 ++G G TP +D +A++G+ LT+ YS +P+R ++TG++ H + Sbjct: 88 ELGCYGQTKI---RTPRLDQLAAEGIKLTNFYSGNAVCAPSRCCLMTGKHPGHAHVRNNG 144 Query: 159 ---------------MYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQNV 202 GQ T+ + L GY T A GKW +G P Sbjct: 145 DPKIDPAVREALKLEFPGQYPLPVDEVTIAEYLKSVGYRTGAFGKWGLGHFGTTGDPNEQ 204 Query: 203 GFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIAD 262 GFD F GFN + + + R E Q D Sbjct: 205 GFDLFYGFNCQRHAHNHYPNFLWRN-------------------------RVKEVQPGND 239 Query: 263 ITPKYMEDLDQRWMDYGVKFLDKMAKSDK--PFFLYYGTRGCHFDNYPNAK--------- 311 T ++++ +F+ + DK PFF Y H + Sbjct: 240 RTLHGETYSQDQFVNEACEFIRQSVAEDKTQPFFAYLPFAVPHLSIQVPEEEVDAYDGVI 299 Query: 312 ---------YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP--- 359 Y R Y + M++ + ++ G +NTLI+FTSDNGP Sbjct: 300 EEADYEHHGYLKHPRPRAGYAAMVTRMDEGVGQVVDLVDSLGLGENTLIMFTSDNGPTYD 359 Query: 360 ----EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTAL 414 + + +G KG EGG+RVP G++ SD I D PT Sbjct: 360 RLGGSDSDYFNSASGMKGLKGQLDEGGIRVPMIARQTGVVPAGRTSDWIGAWWDFLPTIT 419 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL---NGKLAAVRMDEF 471 D AG DG+ G + Y+ A+RM + Sbjct: 420 DAAGVEVDASTT--------DGISFLPLLHGDDAAQQSHEFLYWEFPGYSGQQAIRMGNW 471 Query: 472 KYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTE 531 K + ++++L D ES+ + H + ++ Sbjct: 472 KAIRKD----------LSKRLKKGQTEPPAFALYDLSKDLAESNDVSASHPDVMAKIEAI 521 Query: 532 MHAYMEILKKYP 543 +++P Sbjct: 522 AKQQHVPSEQFP 533 >UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4S5_9BACT Length = 486 Score = 468 bits (1206), Expect = e-130, Method: Composition-based stats. Identities = 116/505 (22%), Positives = 187/505 (37%), Gaps = 96/505 (19%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRAT 142 KPN++ L DD+GW D+G G + + TP+ID AS + TSAY+ SP+R+T Sbjct: 23 PDKPNILFILADDMGWSDLGCYGADL---HETPNIDRFASGAVRFTSAYAMSVCSPSRST 79 Query: 143 ILTGQYSIHHGILMPPMYGQPG---------------GLQGLTTLPQLLHDQGYVTQAIG 187 ++TG+++ + Q G T+ L GY+T IG Sbjct: 80 LMTGKHAARLHFTIWAEGAQEGGAKNRELREAESIWNLPNSEKTIATYLKSAGYLTALIG 139 Query: 188 KWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 KWH+G+ E P+ GFD G + T W + P + Sbjct: 140 KWHLGD-WEHYPEAHGFDINIGGTNWGAPQTFWWPYSGSGTHG------------PEFRY 186 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 H G E L R D +K +D D+PFF+Y H Sbjct: 187 IPHLEYGHP-----------GEYLTDRLTDEAIKVID--HAGDQPFFVYLAHHAVHTPIE 233 Query: 308 PNAKYA---------GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG 358 A G + T Y E+++ + + L++ G NT+++F SDNG Sbjct: 234 AKADDIQHFDAKYRDGMNHRHTIYAAMNKELDENVGRVLEHLKERGLDKNTVVIFASDNG 293 Query: 359 PEAE--------VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADL 409 +P P R KG+ +EGG+RVP + W G+ D V L D+ Sbjct: 294 GYIGVDKVSGKNMPVTNNAPLRSGKGALYEGGIRVPLIIRWPGVTPNGATCDEPVILTDM 353 Query: 410 FPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL------NGKL 463 T L + G P P T DG+D + + + NR A + + Sbjct: 354 LQTFLHITGQP--------PATDATDGMDISPLLKDPSAKLNRDALFFHYPHYYHTTTPV 405 Query: 464 AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIP 523 +A+R ++K + + ++NL D E + Sbjct: 406 SAIRARDWKLLEFYEDNHL--------------------ELYNLRNDLSEKHDLAKEMPD 445 Query: 524 MGVPLQTEMHAYMEILKKYPPRAQI 548 L+ +++A+ + + P+ Sbjct: 446 KAAALRDQLNAWRDSVGAVLPQPNP 470 >UniRef50_A6DSG6 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSG6_9BACT Length = 499 Score = 468 bits (1204), Expect = e-130, Method: Composition-based stats. Identities = 113/475 (23%), Positives = 198/475 (41%), Gaps = 54/475 (11%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT 139 + PN + + DD G+ D+G G + TP+ID +A +G+ T Y++ SP Sbjct: 17 TAKAEMPNFIFIMTDDQGYGDLGCYGHPII---KTPNIDKMADRGVRFTDFYARHKCSPA 73 Query: 140 RATILTGQYSIHHGILMPPMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 RA+++TG ++ G+ G ++ + T+P++L ++GY T IGKWH+G Sbjct: 74 RASLMTGAFNFRVGVGSIVYPNSTTGLIKEVVTIPEMLKEKGYTTALIGKWHLGHTAGYL 133 Query: 199 PQNVGFDDFRGFNSVSDM----YTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 P++ GFD + G + + P + + K + ++ Sbjct: 134 PRDQGFDYYFGVPGTNHGDAKTHKLPVAEGFKPSGEFTIEDYWADKGKGVHGNSTILMKN 193 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG 314 P + L +R+ V+++ + DKPFFLY+ H +A + G Sbjct: 194 DNVIEW----PTDITQLTKRYTHDAVRYIKE--NKDKPFFLYFAHGTPHHPYTVDAAFRG 247 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP-----PHGRT 369 S YGD + E++ + K L++NG T+I FTSDNG +++ Sbjct: 248 KSD-HGLYGDMIEEIDWSVGEVIKALQENGIEKKTIIAFTSDNGADSKPNKEHAEKGSNL 306 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLV 428 P +G KGS+ EGGVRVP + W G + K++ I L D+FPT LAG Sbjct: 307 PLKGWKGSSEEGGVRVPFVLSWPGTLPEGKKTNEIASLMDIFPTYAALAGIEPEV----- 361 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG--KLAAVRMDEFKYHVLIQQPYAYTQS 486 IDG + + + ++ K+ VR FKY Sbjct: 362 --PQKIDGNNIFPIMMCEPDVKSPNKYIFYAGNTPKITGVRNHRFKYSTKTSG------- 412 Query: 487 GYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +++++ D E+ ++ ++ + LQ M A+ + + + Sbjct: 413 -----------------LYDMHADIGETTNVADKYPEVLQELQKAMEAFQKDIDE 450 >UniRef50_A6CGJ8 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CGJ8_9PLAN Length = 520 Score = 468 bits (1204), Expect = e-130, Method: Composition-based stats. Identities = 121/548 (22%), Positives = 195/548 (35%), Gaps = 92/548 (16%) Query: 53 TIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGN 112 + ++ P++ +A K N+V L DD+G+ DV Sbjct: 3 SFTQSIRPILILSGLFLSLFLPIAHAADKQS---NIVYILADDLGYGDVSCYNPE--SKI 57 Query: 113 PTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPMYGQ--PGGLQGL 169 TP ID +A++G+ T A++ +PTR ILTG+Y + + G P Q Sbjct: 58 KTPHIDRLAAEGMKFTDAHTPSAVCTPTRYGILTGRYCWRTRLKYRVLDGFDPPLIEQDQ 117 Query: 170 TTLPQLLHDQGYVTQAIGKWHMGENKE-------------------------------SQ 198 T+P LL GY T IGKWH+G Sbjct: 118 VTVPSLLKKAGYDTACIGKWHLGMQWTDKNGQPVPAVPIDRRQRPRVGDDVDYTKPILGG 177 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 P GFD + G ++ +M +P + DR + +P + + + Sbjct: 178 PLTSGFDYYFGISASLNM---------SPFCFIRNDRPVILPTIPSERIQTEFLSVDQGM 228 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAK--SDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 D T + + VK++++ K ++PFFLY+ H PN ++ G S Sbjct: 229 RSPDFTIR---SVMPTLTGEAVKYIERHGKESPERPFFLYFPLTAPHLPLVPNDEFKGKS 285 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP------------ 364 A YGD ++E++ + L++ G +NTL++FTSDNG Sbjct: 286 AA-GEYGDFVLEVDATVGAIMDALQRTGVAENTLVIFTSDNGGLYHWWTPQETDDLKHYK 344 Query: 365 ------------PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFP 411 G RG K WEGG RVP V W G +D +V+L DL Sbjct: 345 PNHRGQYVKDRGHQGNAHLRGTKADIWEGGHRVPFIVRWPGKTPADSTNDELVELTDLLA 404 Query: 412 TALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSN-RKAEHYFLNGKLAAVRMDE 470 T + D V+ LG + R+ + +VR Sbjct: 405 TCAAITDTKLPDGDAQ-------DSVNILPALLGKKSDTPLREYAIHHSLWGHFSVRQGP 457 Query: 471 FKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQT 530 +K P + + ++NL DP E+ ++ + H + PL Sbjct: 458 WKMI-----PKRGSGGFTRAREVEPAAGEPTGQLYNLKQDPSETKNVWLEHPEVVKPLSA 512 Query: 531 EMHAYMEI 538 + + Sbjct: 513 ILEQVQKQ 520 >UniRef50_B9KQS8 Twin-arginine translocation pathway signal n=2 Tax=Alphaproteobacteria RepID=B9KQS8_RHOSK Length = 509 Score = 467 bits (1202), Expect = e-130, Method: Composition-based stats. Identities = 114/542 (21%), Positives = 200/542 (36%), Gaps = 96/542 (17%) Query: 28 PSTATARKGFAGYDHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPN 87 P+ T HPN+ V + + + AQ + +P+ Sbjct: 19 PTDTTPDLSA----HPNRRDVLAGSAGFLAAIAGLSILAQ---------PARAQEVARPH 65 Query: 88 VVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQ 147 ++ L+DD+G+ DVG++G V TP++D +A++G L Y+QP +PTRA ++TG+ Sbjct: 66 ILYILVDDLGYADVGYHGSDV----KTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGR 121 Query: 148 YSIHHGILMPPMY--GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQNVGF 204 Y + +G+ + G+ G LPQ+L + GY T +GKWH+G +++ P+ G Sbjct: 122 YPMRYGLQTGVIPSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGV 181 Query: 205 DDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADIT 264 D F G + H G + Sbjct: 182 DYFYGPLVGEIDHF------------------------------KHEAHGITDWYRDNEM 211 Query: 265 PKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG-----SSPAR 319 K + + ++ +++ S P ++Y H KY + R Sbjct: 212 VKEPGYDTELFGADAIRLIEEH-DSATPLYMYLSFTAPHTPYQAPDKYKDLYPDIADEGR 270 Query: 320 TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG-----------R 368 +Y + M+D + + LE+ G ++TL++F SDNG G Sbjct: 271 KAYAAMISCMDDQVGLVLQALERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRN 330 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 P R KG+ +EGG RV W G I ++ G++ + D+ PT LA A Sbjct: 331 DPLREGKGTLYEGGTRVVALANWPGRIPAGETHGMMHVVDMLPTLAGLAQAEIAHAGQ-- 388 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 +DG+D S R+ Y + A+R ++K + Sbjct: 389 -----LDGMDVWQAISAGKA-SPREEVVYNIEPTQGALRDGKWKLYW------------- 429 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQI 548 + +F+L DP E+ + + +Q + + PP Sbjct: 430 ------QPILPPKVELFDLEADPSETTDLSAKEPEQLARMQARVIDLARSMA--PPLFYA 481 Query: 549 KS 550 + Sbjct: 482 NA 483 >UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD Length = 452 Score = 466 bits (1201), Expect = e-130, Method: Composition-based stats. Identities = 119/495 (24%), Positives = 190/495 (38%), Gaps = 66/495 (13%) Query: 64 HPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQ 123 A + A L + K+PNV++ DD G +DV G A TP+ID +A + Sbjct: 6 LSAMVALSCFMAAPLFAQQQKRPNVLIIYTDDQGTLDVNCYG---AKDLHTPNIDRLAKE 62 Query: 124 GLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMY--GQPGGLQGLTTLPQLLHDQG 180 G++ + Y+ P SP+RA++LTG+Y + G G T+ ++ D G Sbjct: 63 GVLFSQFYAAAPVCSPSRASLLTGRYPQRAQLDNNAPSEEGHAGMPGSQYTMAEMFKDGG 122 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVS-DMYTEWRDVHVNPEVALSPDRSEYI 239 Y T IGKWH+G + E+ P GFD GF D Y+ + Sbjct: 123 YTTAHIGKWHIGYSPETMPNQQGFDYSFGFMGGCIDNYSHYF------------------ 164 Query: 240 KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 ++ + H + Q+ D K+ DL + FL+K ++DKPFFLY+ Sbjct: 165 ---YWAGPNRHDLWRNGQEIWED--GKFFADLT---VQEVNGFLEKNKRADKPFFLYWAI 216 Query: 300 RGCHFDNYPNAK----YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 H+ K Y R Y + M++ + + L++ G +NT++VF S Sbjct: 217 NMPHYPLQGQEKWRQYYKDLPAPRRMYAAAVSTMDEKIGQVLQQLDRLGLAENTIVVFQS 276 Query: 356 DNGPEAE----VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLF 410 D G E P+RGAK S +EGG+RVP + W G + D + D + Sbjct: 277 DQGHSTEDRSFGGGGFTGPYRGAKFSLFEGGIRVPAIIRWTGHLPKNEVRDQLCVNIDWY 336 Query: 411 PTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG----KLAAV 466 PT L + IDG D + S + G AV Sbjct: 337 PTLAGLCKVALPQ--------RKIDGKDIQQVITSSKTSSPHDIFFWQSQGTKENPQWAV 388 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGV 526 R +K + NL D E+ ++ +H + Sbjct: 389 RQGNWKLLHNPSSAKKAETGPDDLF------------LVNLQQDTSEAKNLAAQHPEIVS 436 Query: 527 PLQTEMHAYMEILKK 541 L+ + ++ + + Sbjct: 437 SLKEQYLKWINEVVQ 451 >UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF83_9BACT Length = 488 Score = 466 bits (1200), Expect = e-129, Method: Composition-based stats. Identities = 102/514 (19%), Positives = 182/514 (35%), Gaps = 86/514 (16%) Query: 68 DKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLIL 127 + + + ++PN+++ L DD+G+ D+G G TP+ID +A G+ Sbjct: 25 TLTSDAQTSTNRPPAPRRPNIILILADDLGYGDLGCYGQTQI---KTPNIDKLAEDGMKF 81 Query: 128 TSAYSQP-SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAI 186 TS Y+ +P+RAT++TG+ + H I T+ ++L GY T I Sbjct: 82 TSFYAGSTVCAPSRATLMTGKNTGHVNIRGN---ADLSLNGEELTIAKILKLAGYATGCI 138 Query: 187 GKWHMG-ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS 245 GKW +G E P GFD++ G+ + + P D ++ + Sbjct: 139 GKWGLGNEGSPGLPGRQGFDEYLGYLDQVQAHDYY------PTHLFRSDSKGEESKIALT 192 Query: 246 KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA----KSDKPFFLYYGTRG 301 ++D + + + +L + FFLY Sbjct: 193 END---------------ADHKGLYSNDFFTQSALNYLRINKPSKLNKHRSFFLYLPYTL 237 Query: 302 CH--------------FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 H + + + ++ + L+K+ + Sbjct: 238 PHANNELGNRTGNGMEVPSTEPYTNEQWPQVEKNKAAMITRLDHYVGEIMDYLKKSKLDE 297 Query: 348 NTLIVFTSDNGPEAEVP-----PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SD 401 NT+++F SDNGP E + RG K +EGG+RVP V W ++ SD Sbjct: 298 NTVVIFASDNGPHKEGGVNPKYFNSAGGLRGIKRDLYEGGIRVPFIVRWPARVKAGSISD 357 Query: 402 GIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY--FL 459 + D PTA ++A IDG+ LG + + ++ Sbjct: 358 APLAFWDFLPTAAEIARTSSPTN---------IDGISFLPTLLGKAQTNRHQYLYWEFHE 408 Query: 460 NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGV 519 G AVRM ++K ++NL TD E D++ Sbjct: 409 QGFDQAVRMGDWKAVRHGIN--------------------GPIELYNLKTDVSEKDNVAD 448 Query: 520 RHIPMGVPLQTEMHAYMEILKKYPPR--AQIKSD 551 ++ + + + ++P + A+IK D Sbjct: 449 KNPEVMAKIADYLKKARTDDPRWPAKTVAEIKED 482 >UniRef50_A6C8S3 Arylsulphatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8S3_9PLAN Length = 481 Score = 465 bits (1198), Expect = e-129, Method: Composition-based stats. Identities = 126/495 (25%), Positives = 179/495 (36%), Gaps = 84/495 (16%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSS 137 + KPN +V DD+G+ D+ G TP ++ +A++G LT P + Sbjct: 33 AAQATAKPNFIVIFADDLGYGDLECYGHP---RFKTPHLNQMAAEGARLTQFNVPVPYCA 89 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLL-------HDQGYVTQAIGKWH 190 P+RAT+LTG+Y HG+ P G + + + GY T IGKWH Sbjct: 90 PSRATLLTGRYPWRHGVWYNPAPDGQQFRSG-VGIAESELLLSELLKENGYATICIGKWH 148 Query: 191 MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +G + E P GFDD+ G +DM Sbjct: 149 LGHDPEYYPTRHGFDDYLGILYSNDMRP-------------------------------- 176 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 Q P +L +R+ + VKF+ + + PFFLY H + Sbjct: 177 --VNLMQGEKLLEYPVIQANLTKRYTERAVKFIQE--NQEGPFFLYLPHAMPHKPLAASE 232 Query: 311 KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP 370 + S A YGD + E++ ++KTL + +NTL++F SDNGP G Sbjct: 233 AFYKKSGA-GLYGDVIAELDWSVGEIFKTLRELNLDENTLVIFASDNGPWFGGNTAG--- 288 Query: 371 FRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVP 429 G K +TWEGG+RVP W G I PR D + D+FPT L AG P VP Sbjct: 289 LSGMKSTTWEGGLRVPMIARWPGKIPPRQVIDTVCGSIDVFPTILKQAGIP-------VP 341 Query: 430 KTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQ 489 IDG D + +A + L VR +K HV G Sbjct: 342 ADRVIDGKDLFPVLT-KQAPTPHQALYSMKGNSLFTVRSGPWKLHVKPSPRQVLAGKGKN 400 Query: 490 GGFTG-----------------------TVMQTAGSSVFNLYTDPQESDSIGVRHIPMGV 526 Q +FNL D E D++ H + Sbjct: 401 WIDPRGPDGITIIAPYEQAMPDQQPGIHNGDQPVPMMLFNLQQDIAEQDNVADEHPEVVA 460 Query: 527 PLQTEMHAYMEILKK 541 L H + Sbjct: 461 RLMKLYHEMQAEVPA 475 >UniRef50_D2R917 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R917_9PLAN Length = 486 Score = 464 bits (1195), Expect = e-129, Method: Composition-based stats. Identities = 121/507 (23%), Positives = 185/507 (36%), Gaps = 86/507 (16%) Query: 57 NMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPD 116 N+ + A +A + ++PN+V + DD+GW DVGFNG TP+ Sbjct: 2 NLTKLELWAAVLLVAFTAVA--SQAADRQPNIVHIVADDLGWKDVGFNG---CTEIKTPN 56 Query: 117 IDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMY--GQPGGLQGLTTLPQ 174 IDA+A G + Y Q +PTRA ++TG++ +G+ + G +PQ Sbjct: 57 IDALAKGGAKFSQFYVQNMCTPTRACLMTGRFPYRYGLQTIVIPTAAGYGLDTSEYLMPQ 116 Query: 175 LLHDQGYVTQAIGKWHMGE-NKESQPQNVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALS 232 L D GY T IGKWH+G +++ P+ GFD G D +T ++ Sbjct: 117 CLGDAGYKTAIIGKWHLGHADQKYWPKQRGFDYQYGAMIGELDYFTHDEHGVLDWFRDNK 176 Query: 233 PDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKP 292 P D VK++ KP Sbjct: 177 P-------------------------------VHEQGYTTTLIGDDAVKYIHGQ-DGKKP 204 Query: 293 FFLYYGTRGCHFDNYPNAKYAGSS-----PARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 F+LY H +Y P R +Y + +++ + L++ G + Sbjct: 205 FYLYLTFNAPHTPYQAPKEYITKYLNIAEPTRRTYAAMVDCLDENIGKVVAALDQKGLRE 264 Query: 348 NTLIVFTSDNGPEAEVPPHG-------------RTPFRGAKGSTWEGGVRVPTFVYWKGM 394 NTLI F SDNG + G P+R KGS +EGG RV W G Sbjct: 265 NTLIFFHSDNGGTKDKMFAGQMADMSKVVLPCDNGPYRNGKGSLFEGGSRVCALANWPGK 324 Query: 395 IQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKA 454 I+ + DG++ DL+PT LAG AK +DG + S R Sbjct: 325 IKAQTVDGMIHAVDLYPTFAALAGASIAKC-------KPLDGTNVWDTIA-EGKPSPRTE 376 Query: 455 EHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 Y + A +R ++K P + ++NL DP E Sbjct: 377 FFYSIEPFRAGLRQGDWKLIWRTMLPSSVD-------------------LYNLAEDPYEK 417 Query: 515 DSIGVRHIPMGVPLQTEMHAYMEILKK 541 ++I H +Q + + K Sbjct: 418 NNIAAAHPDKVATMQARIETASKDAAK 444 >UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6CBM1_9PLAN Length = 497 Score = 464 bits (1195), Expect = e-129, Method: Composition-based stats. Identities = 121/500 (24%), Positives = 184/500 (36%), Gaps = 72/500 (14%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 +L +EK+ KPN+V+ L DD+G+ D+ G V TP +D +AS+G+ LT Y+ Sbjct: 20 PELQAVEKQQAAKPNIVIILCDDLGYGDLACYGHPVI---KTPHLDQLASEGMRLTDCYA 76 Query: 133 -QPSSSPTRATILTGQYSIHHGILMPPMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWH 190 P SP+RA +LTG+ G+ G P + T+ QLL GY T +GKWH Sbjct: 77 SAPVCSPSRAGLLTGRTPNRLGVYDWIPEGHPMHLKRDEVTVAQLLQQAGYDTAHVGKWH 136 Query: 191 M-GE---NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 G ++ QP + GF + H NP + + Sbjct: 137 CNGMFNSKEQPQPGDHGFRHWF------STQNNALPTHENPNNFVRNGKP---------- 180 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH--- 303 + Q D G+++L + +KPFFL+ H Sbjct: 181 -----------------LGEIEGFSCQIVADEGIRWLSDWREKEKPFFLHVCFHEPHERV 223 Query: 304 -FDNYPNAKYAGSS--PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 Y S + Y + M+ L L++ DNTL+ FTSDNGPE Sbjct: 224 ASPPALVETYLDKSLYEDQAQYFANVANMDRAVGKLLIKLDELKVADNTLVFFTSDNGPE 283 Query: 361 A------EVPPHGRTP--FRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFP 411 +P RG K +EGG+RVP V W G I+ + V DL P Sbjct: 284 TLNRYGKGSRRSWGSPGVLRGMKLHIYEGGIRVPGIVRWPGKIKAGQEIATPVCSVDLLP 343 Query: 412 TALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN---GKLAAVRM 468 T ++AG VP +DG F G + + A+R Sbjct: 344 TFCEIAGV-------AVPDQRPLDGASLLPLFAGNKIERTTPLFWNYYRAYSTPRVAMRE 396 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTG----TVMQTAGSSVFNLYTDPQESDSIGVRHIPM 524 ++K P G + ++NL D E ++ + Sbjct: 397 GDWKVVAHWSGPEGIIPLGGNVNSVSQEIIKNAKLTKFELYNLKDDISEQHNLAWQEQKR 456 Query: 525 GVPLQTEM-HAYMEILKKYP 543 L+ ++ Y + K+ P Sbjct: 457 LDTLKKKLVQKYAAVQKEGP 476 >UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 Tax=Nostocaceae RepID=Q3M597_ANAVT Length = 457 Score = 464 bits (1194), Expect = e-129, Method: Composition-based stats. Identities = 114/507 (22%), Positives = 184/507 (36%), Gaps = 89/507 (17%) Query: 59 MPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDID 118 + + L +PNVV L+DD+GW D+ G TP++D Sbjct: 15 LGMTAAGTLMATASANLFSRATAQSSRPNVVFILVDDMGWGDLSIYG---RTDYETPNLD 71 Query: 119 AVASQGLILTSAYS-QPSSSPTRATILTGQYSIHH--------GILMPPMYGQPGGLQGL 169 +A QG+ T+AY+ Q +PTR LTG+Y G P G Sbjct: 72 RLARQGVRFTNAYANQTVCTPTRIAFLTGRYQARLPVGLREPLGARSQPASNNIGIPANQ 131 Query: 170 TTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEV 229 T+ LL GY T +GKWH G P GFD++ G S Y ++ Sbjct: 132 PTIASLLKANGYETALVGKWHAGYPPNFGPLQKGFDEYFGHLSGGIEYFTHTGTDRILDL 191 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS 289 + D+ + + + D V+F+ + Sbjct: 192 YEN-----------------------------DVPVQRSGYVTDLFTDRAVEFIQRPHS- 221 Query: 290 DKPFFLYYGTRGCHFDNYPNAKYAGSS----------PARTSYGDCMVEMNDVFANLYKT 339 +PF+L H+ A ++ ++ +Y + ++D + Sbjct: 222 -RPFYLSLHYNAPHWPWQGPNDQASTAFYLTNGYTVGGSQATYAAMVKSLDDGVGRVLDA 280 Query: 340 LEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-R 398 LE +GQ DNTL++FTSDNG E PFRG K S +EGG+RVP + + G+ Q + Sbjct: 281 LEASGQADNTLVIFTSDNGGERFSNF---GPFRGQKASLYEGGIRVPAIIRYPGVTQANQ 337 Query: 399 KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF 458 S+ ++ DL T L G DG + G + +R + Sbjct: 338 VSNQVIITFDLTATILAATG-------TSFHPNYPPDGQNLLPLLRGDRSEFSRTLFWRY 390 Query: 459 L---NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD 515 + AVR ++KY + ++FNL TDP E+ Sbjct: 391 GAALTTRQRAVRSGDWKY----------------------WRRGNQEALFNLATDPGETT 428 Query: 516 SIGVRHIPMGVPLQTEMHAYMEILKKY 542 + + + L+ + + + Y Sbjct: 429 DLKDSNAQVFTRLRNQFQHWELQMLPY 455 >UniRef50_C9MNT2 Arylsulfatase n=4 Tax=Bacteroidales RepID=C9MNT2_9BACT Length = 539 Score = 463 bits (1191), Expect = e-128, Method: Composition-based stats. Identities = 115/558 (20%), Positives = 191/558 (34%), Gaps = 96/558 (17%) Query: 35 KGFAGYDHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLD 94 K F D+ N + + + ++P+ + K +KPN++ + D Sbjct: 11 KSFLKTDNENLKPINMISKLTKTLLPITALGCVQGNA------MTPKKQQKPNIIYIMCD 64 Query: 95 DVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHG 153 D+G+ D+G G + TP+ID +A +G+ T AY+ P S+P+RA ++TGQ+S H Sbjct: 65 DMGYGDLGCYGQKYIL---TPNIDRMAKEGMRFTQAYAGAPVSAPSRACLMTGQHSGHTE 121 Query: 154 ILMPPMY------------------GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-N 194 + Y GQ LP+++ D GY T GKW G Sbjct: 122 VRGNKEYWTNSKPVYYGENKDFSVVGQHPYDPNHIILPEIMKDNGYRTGMFGKWAGGYEG 181 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 S P G DDF G+ + + + Y ++ + V Sbjct: 182 SLSTPDKRGVDDFYGYICQFQAHLYYPNFL----------NEYYKERGDTAVKRVVLTEN 231 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA---- 310 D K + + + +L K DKPFF + H + Sbjct: 232 INHPMFGDEYFKRTQYSADLIHQHAMDWL-KAQTKDKPFFGVFTYTLPHAELTQPDDSLV 290 Query: 311 -------------------KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 +Y + + ++ + K L++ G DNTL+ Sbjct: 291 AFYKKQFFTDKTWGGQEGSRYNAVVHTHAQFAAMITRLDSYVGEILKLLDERGLADNTLV 350 Query: 352 VFTSDNGPEAEVPPHGR-----TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVD 405 +FTSDNGP E RG K +EGG+R+P W G I+ S+ Sbjct: 351 IFTSDNGPHEEGGADPSFFNRDGKLRGIKRQCYEGGIRIPFIARWNGHIKAGVESNLPFA 410 Query: 406 LADLFPTALDLAGHPG--AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN--G 461 DL PT ++ G + N + DG+ + + Y+ Sbjct: 411 FYDLMPTFAEMVGVKDYVQRYRNKKKTIDYFDGISILPTLINDGIGQKKYPYLYWEFAET 470 Query: 462 KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 AVRM ++K + P+ ++NL D E I H Sbjct: 471 DQTAVRMGDWKLITIHGIPH----------------------LYNLSNDLHEDHDIANEH 508 Query: 522 IPMGVPLQTE-MHAYMEI 538 + + + + Sbjct: 509 PDIVQKMIEIALKEHTNS 526 >UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CBI6_9PLAN Length = 599 Score = 463 bits (1191), Expect = e-128, Method: Composition-based stats. Identities = 101/467 (21%), Positives = 165/467 (35%), Gaps = 72/467 (15%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 + ++PNV++ + DD GW DV + + TP D +ASQG Y P +PTR Sbjct: 26 QAAERPNVLLIMTDDQGWGDVRSHDNPLI---ETPQQDLLASQGARFERFYVSPVCAPTR 82 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 +++LTG+YS+ G+ G TT+ ++ GY T A GKWH G + P Sbjct: 83 SSLLTGRYSLRTGV-HGVTRGFENMRAEETTIAEMFKAAGYKTGAFGKWHNGRHYPMHPN 141 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 GFD+F GF F + H + + + Sbjct: 142 GQGFDEFFGFCGGH-------------------------WNRYFDTNLEHNKQPVKTE-- 174 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS----- 315 + D + F+ + D+PFF Y H KY Sbjct: 175 --------GYITDVLTDRAIDFIKQ--NKDQPFFCYVPYNAPHSPWIVPEKYWDKYANKG 224 Query: 316 --SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRG 373 AR +Y + ++D L +TL+ DNT+++F +DNGP + RG Sbjct: 225 LDDKARCAYA-MVECVDDNLGRLMQTLDDLKLSDNTIVLFLTDNGPNS---NRYNGNMRG 280 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTT 432 KGS EGG+RVP FV + G I+ I D+ PT L+L Sbjct: 281 RKGSIHEGGIRVPLFVRYPGKIKAGTVVKPIAAHIDILPTLLELCSVENTA-------DQ 333 Query: 433 FIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGF 492 +DG + + + + + Sbjct: 334 PLDGKSLVPLLTNKSNKDWPQRMLFSDR------------LFRNSIPDDELPNGSVRTDR 381 Query: 493 TGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL 539 + S++++ DP + ++ H + L + + + Sbjct: 382 WRAAYERGKWSLYDMQADPSQKQNVIEAHPAVIKDLSAAYRDWFKDV 428 >UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_BACFR Length = 489 Score = 463 bits (1191), Expect = e-128, Method: Composition-based stats. Identities = 117/530 (22%), Positives = 191/530 (36%), Gaps = 95/530 (17%) Query: 58 MMPVMQHPAQDKETQQKLA-ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPD 116 ++ TQQ LA + + K +PNVV L DD+G+ D+ G TP+ Sbjct: 8 LLLGSALLVGMASTQQALARQKKAKEQTRPNVVFILADDLGYGDLSCYGQE---KFETPN 64 Query: 117 IDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMP---PMYGQPGGLQGLTTL 172 ID +A G+ T YS S+P+R+ ++TG +S H I GQ + T+ Sbjct: 65 IDRLAQNGMRFTQCYSGTTVSAPSRSCLITGTHSGHTAIRGNKELAPEGQFPLPENSQTI 124 Query: 173 PQLLHDQGYVTQAIGKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVAL 231 + GY T A GKW +G P G D F G+N ++ + D + + + Sbjct: 125 FNDFRNAGYRTGAFGKWGLGYIGSAGDPYKQGIDQFYGYNCQLLAHSYYPDHLWDNDKRV 184 Query: 232 SPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAK-SD 290 + Q + FLD+ AK D Sbjct: 185 DLPDNNLNVQYG-----------------------KGTYSQDLIHSKALAFLDEAAKEKD 221 Query: 291 KPFFLYYGTRGCHFDNYPN-----AKYAGSSP------------------------ARTS 321 +PFF++Y T H + K+ G P + Sbjct: 222 QPFFMWYPTIIPHAELIVPEDSIIKKFRGKYPEKPYRGVEPGSPAFRKGGYCTQFYPHAT 281 Query: 322 YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP-----HGRTPFRGAKG 376 + + ++ + + L+ G DNT+I+F+SDNGP E + +RG K Sbjct: 282 FAAMVYRLDVYVGQIVQKLKDMGVYDNTIIIFSSDNGPHMEGGADPDFFNSNGIWRGYKR 341 Query: 377 STWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFID 435 +EGG+RVP + W G +QP +D + DL PT ++ N T +D Sbjct: 342 DVYEGGIRVPMIISWPGHVQPSTETDFMCSFWDLMPTFREVL--------NPKADTRNMD 393 Query: 436 GVDQTSFFLGTNGQSNRKAEHYF--LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFT 493 GV GQ + ++ AVR ++K + + Sbjct: 394 GVSILPLLQNRKGQKEHEYLYFEFLEMNGRQAVRKGDWKLVHMNIRGNKPY--------- 444 Query: 494 GTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 ++NL +DP E ++ ++ L+ M +P Sbjct: 445 --------YELYNLASDPSEKYNVLNQYPEKADELKAIMKEAHIEDSNWP 486 >UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3JD43_NITOC Length = 440 Score = 462 bits (1189), Expect = e-128, Method: Composition-based stats. Identities = 114/485 (23%), Positives = 183/485 (37%), Gaps = 82/485 (16%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 + + + + PNV++ + DD+G+ DVG G TP++DA+A +G T +S Sbjct: 6 NSSSLVSGREKQPPNVILIVADDMGYGDVGCYGNQHI---KTPNLDALAKKGARFTDFHS 62 Query: 133 Q-PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQ----GLTTLPQLLHDQGYVTQAIG 187 P +PTRA +LTG Y G+ + P + + T + L GY T +G Sbjct: 63 NGPLCTPTRAALLTGCYQQRVGLHIIPKDQRYAMAKAMSLEEITFAEALKSVGYSTALVG 122 Query: 188 KWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 KWH+G+ P GFD++ G DM+ + Sbjct: 123 KWHLGDRPAFLPPRQGFDEYFGIPYSHDMHP--------------------------WRK 156 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 + + I ++ P ++ L Q + VKF+ K D+PF LY H + Sbjct: 157 SFPPLPLMRGEEIVELNP-DLDHLTQYCTEEAVKFISK--NKDRPFLLYMPHPMPHQPVH 213 Query: 308 PNAKY------------AGSSPARTS--YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVF 353 + ++ G Y + E++ + K + G ++T + F Sbjct: 214 VSERFAKRFSKEQLAAIKGEDKKSRKFLYSATIEEIDWSVGEIIKAVRALGIEESTFVAF 273 Query: 354 TSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPT 412 TSDNGP P RG K WEGG RVP YW+ I+P D I DLFPT Sbjct: 274 TSDNGPAI----GSAGPLRGKKRELWEGGHRVPFIAYWQEKIRPGVVIDEIAMSMDLFPT 329 Query: 413 ALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFK 472 + P + IDGV+ + S R ++ + A R +K Sbjct: 330 MAAMGRAPLPR--------KKIDGVNLLPLLCEGDKLSERT--VFWRSKGKKAARKGPWK 379 Query: 473 YHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 + + T G +++L D E ++ + LQ E Sbjct: 380 LLMQPTKKKRPTSIG----------------LYHLNNDLSEQHNLAEIYPEKLKSLQLEF 423 Query: 533 HAYME 537 A+ + Sbjct: 424 AAWEK 428 >UniRef50_D2R783 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R783_9PLAN Length = 505 Score = 461 bits (1188), Expect = e-128, Method: Composition-based stats. Identities = 116/524 (22%), Positives = 192/524 (36%), Gaps = 63/524 (12%) Query: 54 IADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNP 113 + + +M + + + A + +PN+V+ DD+GW D+G P Sbjct: 6 LTFGALWLMLFATNLRGAETESARAKPA---RPNIVILYADDMGWGDLGAQNPD--SKIP 60 Query: 114 TPDIDAVASQGLILTSAYSQP-SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTL 172 TP++D +ASQGL LT A+S +P+R +L G+Y + + Q T+ Sbjct: 61 TPNLDRLASQGLRLTDAHSSSGICTPSRYALLHGRYHWRKFHGIVNSFDQSVMDDERVTM 120 Query: 173 PQLLHDQGYVTQAIGKWHMGENK----------------------------ESQPQNVGF 204 +LL +GY T IGKWH+G + P + GF Sbjct: 121 AELLKTEGYKTACIGKWHLGWDWNAIKRPGAKGGAQGTGFAAEDFDWSKPIPGGPLSHGF 180 Query: 205 DDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADIT 264 D + G + P DR + + + A E + + Sbjct: 181 DYYYG----------DDVPNFPPYAWFENDRIVVPPTVRVTTTEPTAEGNWEARPGPAVK 230 Query: 265 PKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGD 324 ++ D V +++K K+D+PFFLY+ H P ++ G S A +GD Sbjct: 231 DWDFWNVMPTLTDKAVAWINKQ-KADEPFFLYFPFTSPHAPIVPTKEFTGKSQA-GGFGD 288 Query: 325 CMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE-------VPPHGRTPFRGAKGS 377 M + + + + L+K G +NTL++FT+DNGPE P RG K Sbjct: 289 FMTQTDATVGRVLEALDKQGLAENTLVIFTADNGPEHYAYERVRKFEHRSMGPLRGLKRD 348 Query: 378 TWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDG 436 WEGG RVP + W + SDG++ DL T + + D Sbjct: 349 LWEGGHRVPMVIRWPKHVPAGKVSDGLMSQIDLLATIATIVDAEIPAGSAD-------DS 401 Query: 437 VDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTV 496 +Q + G S R + N A+R + + +G Sbjct: 402 YNQLPLWTG-TAPSARDTLVHNTNAGGYAIRHGHWVLIDAKSGG-VSKVPAWFDEASGYT 459 Query: 497 MQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 ++NL D + ++ H L+ + E + Sbjct: 460 ANKQPGELYNLQDDLAQKHNLYADHKEKVDDLKARLQTIREKGQ 503 >UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKC9_9BACT Length = 454 Score = 461 bits (1188), Expect = e-128, Method: Composition-based stats. Identities = 122/474 (25%), Positives = 181/474 (38%), Gaps = 85/474 (17%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTR 140 KPN+++ L DD+G+ DVG++G PTP+ID +A++G+ ++ YS PTR Sbjct: 16 ATDKPNILIILADDLGYADVGYHGLEEI---PTPNIDRIANEGVQFSAGYSNGSICGPTR 72 Query: 141 ATILTGQYSIHHGILMPPMYGQP------GGLQGLTTLPQLLHDQGYVTQAIGKWHMG-- 192 A +++G Y G + G + + TL Q + GY T GKWH+G Sbjct: 73 AALMSGVYQQRIGCEGICGGRKLNEHVVVGMPREVKTLAQYFQEAGYATGLFGKWHLGGE 132 Query: 193 --ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 +K P + GFD+F G + +Y + + Sbjct: 133 RLFDKTLMPTSRGFDEFFGILEGASLYDDTVN---------------------------- 164 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 R + + E V F+ + K DKPFFLY H + Sbjct: 165 --RERKYIRQDTVIDYEGEYFTDAIGREAVSFITR--KGDKPFFLYLPFTAVHAPMQASE 220 Query: 311 KYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP 365 KY P R + + M+D ++ LE G LDNTLIVF SDNG + + Sbjct: 221 KYMQRFAHIADPNRRVFAAMLSAMDDNIGRVFDALEHQGILDNTLIVFWSDNGGKPDNNY 280 Query: 366 HGRTPFRGAKGSTWEGGVRVPTFVYWK-GMIQPRKS-DGIVDLADLFPTALDLAGHPGAK 423 P +G K +EGG+RVP V W G I K+ D V L D+FP+AL+ A K Sbjct: 281 SLNHPLKGQKTQFYEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDIFPSALEAAQITVPK 340 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAY 483 G Q+ A + AVRM ++K Sbjct: 341 DIEAKT---------ILPLMQGKTNQTPHPAMF-WKRAGKMAVRMGDWKL---------- 380 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 S +FNL D ES +I +H + + + + Sbjct: 381 ------------SNAGGPSELFNLKQDISESRNIIDQHPDIANKMNRLWLNWDK 422 >UniRef50_A6DSP6 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSP6_9BACT Length = 512 Score = 461 bits (1187), Expect = e-128, Method: Composition-based stats. Identities = 119/496 (23%), Positives = 185/496 (37%), Gaps = 70/496 (14%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSS 137 K+PN+++ DD+G+ DVG++G + TP+ID++A QG+ + Y Sbjct: 14 ATFADKQPNIILIFADDMGYDDVGYHGNKRII---TPNIDSIAEQGVQFSQGYVSASVCG 70 Query: 138 PTRATILTGQYSIHHGILMPPM---------YGQPGGLQGLTTLPQLLHDQGYVTQAIGK 188 P+RA +LTG Y G P Y G Q + + + L GY IGK Sbjct: 71 PSRAGLLTGVYQQRFGCGENPNGSGYPNQMKYPMAGLPQSQSMISEELKTLGYTNGMIGK 130 Query: 189 WHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 WHMG + +P G+D F GF + S YTEW + R+E ++ ++ Sbjct: 131 WHMGFDMSLRPNQRGYDFFYGFINGSHDYTEWTQEFAKGKSRWPIFRNEEMEPANKAQYI 190 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 G + + L + D V F+D+ +DKPFFLY H Sbjct: 191 DVFKEKGVKVVDEN-------YLTDLFTDEAVNFIDR--NADKPFFLYLAYNAVHHPWQT 241 Query: 309 NAKYAGS------SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 + + M++ + K L++ DNT+I+F SDNG Sbjct: 242 TQHALDKTAHLKDDKNYHVFASMVYAMDEGIGKVMKKLKEKNIDDNTIIIFLSDNGSPQG 301 Query: 363 VP----------------PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVD 405 FRG KG T+EGG+RVP + W IQ D + Sbjct: 302 QGIEHSPKDPNRHRGGFTMSSTGIFRGYKGDTYEGGIRVPFCIKWPQQIQKGTKYDMPIS 361 Query: 406 LADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAA 465 DL PT + AG K DGVD + + + R ++ A Sbjct: 362 ALDLQPTLVKAAGGNDKKPQKGF----AYDGVDILPYLK-EDKEIKRS--LFWRRDTDYA 414 Query: 466 VRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMG 525 +R ++K +FN+ DP+E ++ +H + Sbjct: 415 IRKGDWKLQWNDAHGPLTIT------------------LFNIKEDPEERSNLIKQHPELA 456 Query: 526 VPLQTEMHAYMEILKK 541 LQ E + + Sbjct: 457 QQLQNEFDTWDNSMPD 472 >UniRef50_A6C4Q6 Arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4Q6_9PLAN Length = 574 Score = 461 bits (1186), Expect = e-128, Method: Composition-based stats. Identities = 112/468 (23%), Positives = 180/468 (38%), Gaps = 51/468 (10%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 +PNV+V L DD G+ DVGF G + TP +D +A + + LT Y P +PTRA Sbjct: 31 AESRPNVIVILTDDQGYGDVGFRGN---LKINTPHLDRMAEKSIELTRFYCSPVCAPTRA 87 Query: 142 TILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQN 201 ++LTG+ G++ G T+ +LL GY T GKWH+G+N +PQ+ Sbjct: 88 SLLTGRNYYRTGVIHTSRGGAK-MQGEEVTVAELLQQAGYQTGIFGKWHLGDNYPMRPQD 146 Query: 202 VGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIA 261 GF + S + P S + G Sbjct: 147 QGFAESLIHKSGGIGQS---------------------PDQPNSYFHPKLWKNG------ 179 Query: 262 DITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY------AGS 315 + + + D + F+D+ K++KPFF+Y T H Y G Sbjct: 180 -VAFQSTGYCTDVFFDAALDFIDRQTKTEKPFFVYLATNAPHTPLEIAESYWKPYQRQGL 238 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAK 375 + +++ L LE++ + T+++F DNGP+ + G RG K Sbjct: 239 DETTARVYGMITNLDENIGKLLSHLERSALAEKTVVLFLGDNGPQQKRYTGG---LRGRK 295 Query: 376 GSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFI 434 T+EGG+RVP W G + K D I DL PT L L P++ + Sbjct: 296 SWTYEGGIRVPCLAQWPGHFREGEKIDQIAAHIDLMPTLLALT-------ETRCPESLKL 348 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTG 494 DGVD + G + ++ + ++ L R + ++ G G Sbjct: 349 DGVDLSPLLTGRKEKLPARSLFFQVHRGLTPQRYQNYAVV--TERFKLAGYPGTFGTENL 406 Query: 495 TVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKY 542 + ++L TDP E ++ H L + + +K Sbjct: 407 LLQAEPVLEFYDLSTDPGEQKNVLHSHPETVKALLKQYEDWFSEMKAT 454 >UniRef50_A5FAW4 Sulfatase n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FAW4_FLAJ1 Length = 539 Score = 461 bits (1186), Expect = e-128, Method: Composition-based stats. Identities = 125/534 (23%), Positives = 194/534 (36%), Gaps = 86/534 (16%) Query: 46 YLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNG 105 +L P T + P Q A+ K + + KKPN+++ L DD+G D+ G Sbjct: 24 FLFWPINTDGTLIQPD-QKLAEGKAAFLSQKDTSAASEKKPNIIILLADDLGKYDISLYG 82 Query: 106 GGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPTRATILTGQYSIHHGILMPPMYGQP- 163 G PTP ID++A+ G+ T Y SP+RA +LTG+Y G P P Sbjct: 83 GK---STPTPQIDSLAASGVTFTDGYVSSSICSPSRAGLLTGRYQERFGHEYQPGDRYPK 139 Query: 164 ----------------------------------GGLQGLTTLPQLLHDQGYVTQAIGKW 189 G + T L QGY T IGKW Sbjct: 140 NNLEYYAFKYLLNTNSWRLNPKIEYPNDASIATQGLPKSEITFADLAKKQGYSTAIIGKW 199 Query: 190 HMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 H+G K P + GFD GF ++ N ++ +++ + + V Sbjct: 200 HLGHTKGFFPLDRGFDYHYGFYQAFSLFAP----EDNNPDIINHHHTDFTDKTIWGNGRV 255 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 + I D L +++ + F+DK +KPF LY H Sbjct: 256 GTGQIRRDSTIIDEKK----YLTEKFAEEAEAFIDK--NKNKPFLLYVPFNAPHTPFQVR 309 Query: 310 AKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP 364 KY + Y + ++D + ++K G +NTLI F SDNG Sbjct: 310 KKYYDRFPNVKDENKRVYFAMISALDDAIGLIRAKVKKEGLEENTLIFFASDNGGADYTY 369 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAK 423 P +G K S +EGGV VP + WKG I+P V D+F T + Sbjct: 370 ATTNAPLKGGKFSHFEGGVNVPFALSWKGKIKPHTIYKTPVSSLDIFSTIAAVT------ 423 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAY 483 + +PK DGVD Y+ +G A+R ++K + Sbjct: 424 -HSGLPKDRVYDGVDLVDVVNNNKQA---HQNLYWRSGDAKAIRSGDWKLIISG------ 473 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 +T + ++NL D E+ + ++ LQT + + + Sbjct: 474 --------------KTHETWLYNLAKDKSETTDLASKNPEKVKELQTALQNWEK 513 >UniRef50_A6DLE2 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLE2_9BACT Length = 441 Score = 461 bits (1186), Expect = e-128, Method: Composition-based stats. Identities = 112/479 (23%), Positives = 174/479 (36%), Gaps = 77/479 (16%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-S 132 L + PN+++ L DD G D G + TP ID++A G+ T AY + Sbjct: 9 SLLCTSLLANEPPNIIIILADDAGSSDFSCYGSKQLL---TPHIDSIAHNGIKFTQAYTA 65 Query: 133 QPSSSPTRATILTGQYSIHHGILMPPMYGQP--------GGLQGLTTLPQLLHDQGYVTQ 184 SP+RA +LTG+Y G L + + G TL L + GY T Sbjct: 66 SSVCSPSRAGLLTGRYQQTFGHLANIPHSKHSANDPELLGLPVTEITLADSLKELGYSTH 125 Query: 185 AIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPF 244 IGKWH+GE P GFD+F GF S + Y ++ + + + Sbjct: 126 CIGKWHLGEADHFHPNARGFDNFYGFLSGARTYFLGGELRGDMDRIMRN----------- 174 Query: 245 SKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF 304 + + + ++ + + + DKPFF+Y H Sbjct: 175 ----------------KEFAEPSSGYTTEVFTQEAIRIIQE--EQDKPFFIYLSHNAVHG 216 Query: 305 DNYPNAK----YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 + Y +P R Y M ++D L + L+ + Q +NTLI F SDNG Sbjct: 217 PMDAKDEDIMSYDFKNPLRKKYSGLMKNLDDQTGLLLQALKDSKQYENTLIFFMSDNGGP 276 Query: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGH 419 P RG KGS +EGG R P + W I SD + D+F T + AG Sbjct: 277 TTHNGSSNWPLRGFKGSEFEGGNRTPFLLQWPEKISAGLSSDKPIIAYDVFATCIQAAGG 336 Query: 420 PGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQ 479 + G+D + RK ++ GK ++R ++K ++L Sbjct: 337 E-------LVTDRTYHGIDLLPVINKPQETNARK--LFWSRGKNYSMRQGKWKLNILPTG 387 Query: 480 PYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 ++NL D E + + + L EM + Sbjct: 388 SS----------------------LYNLENDQSEKHDLSEQFPEIKAQLIKEMSKWKST 424 >UniRef50_P08842 Steryl-sulfatase n=59 Tax=Coelomata RepID=STS_HUMAN Length = 583 Score = 461 bits (1186), Expect = e-128, Method: Composition-based stats. Identities = 124/543 (22%), Positives = 198/543 (36%), Gaps = 87/543 (16%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ- 133 L E E +PN+++ + DD+G D G G TP+ID +AS G+ LT + Sbjct: 16 LWEAESHAASRPNIILVMADDLGIGDPGCYGNKTI---RTPNIDRLASGGVKLTQHLAAS 72 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYG-------QPGGLQGLTTLPQLLHDQGYVTQAI 186 P +P+RA +TG+Y + G+ G G T +LL DQGY T I Sbjct: 73 PLCTPSRAAFMTGRYPVRSGMASWSRTGVFLFTASSGGLPTDEITFAKLLKDQGYSTALI 132 Query: 187 GKWHMGE------NKESQPQNVGFDDFRGF---------NSVSDMYTEWRDVHVNPEVAL 231 GKWH+G + P + GF+ F G ++T V + + Sbjct: 133 GKWHLGMSCHSKTDFCHHPLHHGFNYFYGISLTNLRDCKPGEGSVFTTGFKRLVFLPLQI 192 Query: 232 SPDRSEYIKQL----------------------------PFSKDDVHAVRGGEQQAIADI 263 + L F + Sbjct: 193 VGVTLLTLAALNCLGLLHVPLGVFFSLLFLAALILTLFLGFLHYFRPLNCFMMRNYEIIQ 252 Query: 264 TPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYG 323 P ++L QR +F+ + ++ PF L H + + +AG S YG Sbjct: 253 QPMSYDNLTQRLTVEAAQFIQR--NTETPFLLVLSYLHVHTALFSSKDFAGKSQ-HGVYG 309 Query: 324 DCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE-------AEVPPHGRTPFRGAKG 376 D + EM+ + L++ ++TLI FTSD G E+ ++G K Sbjct: 310 DAVEEMDWSVGQILNLLDELRLANDTLIYFTSDQGAHVEEVSSKGEIHGGSNGIYKGGKA 369 Query: 377 STWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFID 435 + WEGG+RVP + W +IQ + D D+FPT LAG P +P+ ID Sbjct: 370 NNWEGGIRVPGILRWPRVIQAGQKIDEPTSNMDIFPTVAKLAGAP-------LPEDRIID 422 Query: 436 GVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDE------FKYHVLIQQPYAYTQSG-- 487 G D G + +S+ + ++ N L AVR +K +G Sbjct: 423 GRDLMPLLEGKSQRSDHEFLFHYCNAYLNAVRWHPQNSTSIWKAFFFTPNFNPVGSNGCF 482 Query: 488 ---YQGGFTGTVMQTAGSSVFNLYTDPQESDSIG-VRHI---PMGVPLQTEMHAYMEILK 540 F V +F++ DP+E + + + +Q + + L Sbjct: 483 ATHVCFCFGSYVTHHDPPLLFDISKDPRERNPLTPASEPRFYEILKVMQEAADRHTQTLP 542 Query: 541 KYP 543 + P Sbjct: 543 EVP 545 >UniRef50_A7HQ00 Steryl-sulfatase n=4 Tax=Proteobacteria RepID=A7HQ00_PARL1 Length = 553 Score = 460 bits (1185), Expect = e-128, Method: Composition-based stats. Identities = 122/516 (23%), Positives = 199/516 (38%), Gaps = 89/516 (17%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPS 135 E + PN+VV L DD+G+ D+ GGG+ PTP+ID++A G TSAYS + Sbjct: 62 AAEPAGNRPPNIVVILADDLGFNDISHFGGGIV---PTPNIDSIARGGANFTSAYSGTAA 118 Query: 136 SSPTRATILTGQYSIHHGILMPPMY---------------------------------GQ 162 +P+RA I+TG+Y G P + Sbjct: 119 CAPSRAMIMTGRYGTRTGFEFTPTPPGMTRIVDMFYNDGTRTHEMLVDREAAAKAPPFRE 178 Query: 163 PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRD 222 G TL + L +GY IGKWH+G E P GFD+ E Sbjct: 179 QGLPGSEITLAEALKPKGYHNIHIGKWHLGNAPEFLPNAQGFDESV-MLESGLFLPEDSP 237 Query: 223 VHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKF 282 VN ++ P ++ ++ + + L + D +K Sbjct: 238 DVVNAKLPFDPIDQFLWARMQYATSYNGSAWFEPK-----------GYLTDFYTDEAIKA 286 Query: 283 LDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG-----SSPARTSYGDCMVEMNDVFANLY 337 ++ A ++PFFLY G H + Y +V ++ + Sbjct: 287 IE--ANRNRPFFLYLAHWGVHTPLQASKADYDALSHIEDERLRVYAAMIVALDRSVGRVL 344 Query: 338 KTLEKNGQLDNTLIVFTSDNGPEAEVP-PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ 396 ++L++NG +NTL++F+SDNG + P P+RG K + +EGG+RVP F W I Sbjct: 345 QSLKENGLEENTLVIFSSDNGAPGYIGLPDVNKPYRGWKLTFFEGGIRVPFFAKWPARIP 404 Query: 397 PRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE 455 V D+FPT + AG +P IDG+D + + R Sbjct: 405 AGTERTTPVAHLDMFPTIVAAAGGE-------LPADRVIDGIDLLPYAARGEKPAPRP-- 455 Query: 456 HYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD 515 ++ +G AV+ D +K + + +FNL TDP E + Sbjct: 456 IFWRDGHYQAVQADGWKL--------------------QMAERPNKTWLFNLKTDPTEQN 495 Query: 516 SIGVRHIPMGVPLQTEMHAYMEILKK--YPPRAQIK 549 ++ + L+ + A+ ++ +P A++ Sbjct: 496 NVADENPEKVAELKALVEAHNATQREPLFPAVAEMP 531 >UniRef50_Q7UG72 Arylsulfatase A [precursor] n=1 Tax=Rhodopirellula baltica RepID=Q7UG72_RHOBA Length = 503 Score = 460 bits (1185), Expect = e-128, Method: Composition-based stats. Identities = 127/501 (25%), Positives = 198/501 (39%), Gaps = 44/501 (8%) Query: 58 MMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDI 117 + + PA E G +PN+VV +DD+ + D+G G A G TP++ Sbjct: 5 HLLALCLPAFFTALPASATAPEDIAGSRPNIVVIYMDDMAYADIGPFG---AKGYSTPNL 61 Query: 118 DAVASQGLILTSAYSQPSSSPT----------RATILTGQYSIHHGILMPPMY-GQPGGL 166 D +A++G T R+ +LTG Y G+ + G Sbjct: 62 DRMANEGRKFTDF---------SVSSAVCSASRSALLTGCYHRRVGLSGALGPQAKIGLA 112 Query: 167 QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVN 226 TT ++ GY T GKWH+G + + P N GFD F G +DM+ D Sbjct: 113 PAETTFAEVCKSAGYRTACHGKWHLGHHPKFLPTNQGFDQFYGIPYSNDMWPLHPDTIRR 172 Query: 227 PEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKM 286 + P+ LP + + ++ P E + V+F+ Sbjct: 173 QQ--KDPNDPGNWPPLPIIESIAGQP---PRIVNDNVQPADQEQMTVELTRRSVEFIKNQ 227 Query: 287 AKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQL 346 + SDKPF LY H Y + ++ G S A +GD M+E++ + +E Q Sbjct: 228 S-SDKPFLLYLPHPMVHVPLYVSERFRGKSGA-GLFGDVMMEVDWSVGEILSAIESIDQQ 285 Query: 347 DNTLIVFTSDNGPEAEVPPH--GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGI 403 NTL++FTSDNGP H P R KG+ WEGGVR PT ++W I + Sbjct: 286 KNTLVIFTSDNGPWLSYGNHAGSAAPLREGKGTQWEGGVREPTLMWWPETIPAGTTCETF 345 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLG-TNGQSNRKAEH-YFLNG 461 D+ PT ++L G + IDG L +S ++ Y+ G Sbjct: 346 CSTIDVLPTIVELTGGEAPE--------RKIDGHSIVDLMLDVPGAKSPHESFVGYYGGG 397 Query: 462 KLAAVRMDEFKYHVLIQ-QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVR 520 +L +R + FK + + G G G M +G +++L D E+ ++ Sbjct: 398 QLQTIRNERFKLVFPHAYRTLGDREPGKDGMPDGYAMTKSGLELYDLDADVSETTNVIEA 457 Query: 521 HIPMGVPLQTEMHAYMEILKK 541 H + LQ Y + L Sbjct: 458 HPEVVKQLQAAAEVYRQQLGD 478 >UniRef50_D2QTW5 Sulfatase n=2 Tax=Sphingobacteriales RepID=D2QTW5_9SPHI Length = 523 Score = 460 bits (1184), Expect = e-128, Method: Composition-based stats. Identities = 108/537 (20%), Positives = 178/537 (33%), Gaps = 85/537 (15%) Query: 43 PNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVG 102 + + + M V+ + T + A PN++ DD+G+ ++G Sbjct: 6 AMRQPLLWVSAFLLLMGWVLVSFKPPRTTVSRDAVPRTAVS--PNIIYIYADDLGYAELG 63 Query: 103 FNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMP---- 157 G TP++D +A +G+ T Y+ P +P R +LTG++S H I Sbjct: 64 CYGQQKI---RTPNLDKLAREGIRFTQHYTSMPVCAPARCMLLTGKHSGHSYIRGNYEMG 120 Query: 158 -----PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQNVGFDDFRGFN 211 GQ G T+ +LL QGY T +GKW MG N P GFD F G+ Sbjct: 121 GFPDSLEGGQMPLYPGAFTIGRLLQQQGYKTACVGKWGMGMANTTGNPNEQGFDYFYGYL 180 Query: 212 SVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDL 271 + + P + + + +A A Sbjct: 181 DQKQAHNYY------PTHLWENGKPDKLNNPVIDVHRRLTPETATPEAFAYFRGND--YA 232 Query: 272 DQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN----AKYAGS------------ 315 + F+ + PFFLY H +Y G Sbjct: 233 IDKLAQKAQAFIRQ--NKSGPFFLYLPFTAPHVSLQAPEAAVKEYIGKFGDGEQRTERPY 290 Query: 316 ---------SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP-- 364 R +Y + M+ L + L+ +NTL++F+SDNG Sbjct: 291 LGEQGYASTPYPRATYAAMITHMDAQIGQLMQLLKDLKIDENTLVMFSSDNGATFNGGVE 350 Query: 365 ---PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHP 420 + RG K +EGG+R P W G I+P + +D + DL T +L G+ Sbjct: 351 AAYFNSVGKLRGLKMDVYEGGIREPMLARWPGRIKPNQTTDHVSVQYDLLATLAELVGYK 410 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN---GKLAAVRMDEFKYHVLI 477 DG+ LG + + Y+ G A+RM +K Sbjct: 411 RPFAT---------DGISFLPTLLGQSSSQKQHPFLYWEYPEKGGQLAIRMGNWKAVKTN 461 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 + +T +++L D E+ +I +H + + Sbjct: 462 VR----------------KDRTTPWELYDLNKDVSETTNIADKHPDIIRQANAIVAR 502 >UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DTN4_9BACT Length = 482 Score = 459 bits (1183), Expect = e-128, Method: Composition-based stats. Identities = 115/510 (22%), Positives = 181/510 (35%), Gaps = 89/510 (17%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP 134 L L KPN++ L DD+G+ D+G G V TP +D +A+ G+ T YS Sbjct: 9 LFALNLSAADKPNIIYILADDLGYGDLGCYGQKVI---QTPHLDKMAANGMKFTQHYSGS 65 Query: 135 -SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 P+R+ +L G++S + + M L P+ L GY T IGK MG Sbjct: 66 TVCGPSRSCLLEGKHSGNTYVRGNGMLQMRQDPHDLI-FPKALQKAGYHTAMIGKSGMGC 124 Query: 194 NKE--SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 N + + P GFD F GF S + + + + ++ + EY D+ Sbjct: 125 NTDDAALPYQKGFDYFFGFTSHTQAHWFFPTHLWKNDGKVT--KVEYPNNTLHEGDN--- 179 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 + M+ + ++++ D PFFL+ + H + Sbjct: 180 ------------------YSSEVVMNEALDYVERQ--KDGPFFLHLAFQIPHASLRAKEE 219 Query: 312 YAGSS----------------------PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 + +T++ + M+ L K LE G +NT Sbjct: 220 WKAKYRPILKEKLLPKKDKHPHYSYEREPKTTFAAMVSYMDHNVGLLNKKLEDLGLAENT 279 Query: 350 LIVFTSDNGPEAEVPP-----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGI 403 LI+F SDNG E RG K +EGGVR P YW G I+ SD I Sbjct: 280 LIMFASDNGAMQEGGHKRDSFDSNGVLRGGKRDMYEGGVRTPMIAYWPGKIKAGQTSDHI 339 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY--FLNG 461 D+ PT +LAG + DG+ LG Q+ ++ F G Sbjct: 340 SAFWDISPTVRELAGAKVQEDT---------DGISFVPTLLGKGSQTKHDYLYWEFFEQG 390 Query: 462 KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 A+RM ++K + + +F+L D E + + Sbjct: 391 GKRAIRMGKWKLIL----------------YKTNTDLNPKMELFDLEADISEQKDLSKQL 434 Query: 522 IPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 L M + P + S+ Sbjct: 435 PEKVSALLKLMDKAHTPAEN--PTFKFASE 462 >UniRef50_A6CEL4 Arylsulfatase A n=4 Tax=Bacteria RepID=A6CEL4_9PLAN Length = 527 Score = 459 bits (1183), Expect = e-128, Method: Composition-based stats. Identities = 123/524 (23%), Positives = 205/524 (39%), Gaps = 80/524 (15%) Query: 72 QQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY 131 Q A +K PN++ L DD+G+ D+ TP +D +A G+I T A+ Sbjct: 12 SQNTAHASEKAND-PNIIYILADDMGYGDIRALNPE--CKIATPHLDQLAHGGMIFTDAH 68 Query: 132 SQP-SSSPTRATILTGQYSIHHGILMPPMYG--QPGGLQGLTTLPQLLHDQGYVTQAIGK 188 S +PTR +LTG+Y+ + ++G + T+P +L + GY T +GK Sbjct: 69 SSSSVCTPTRYGVLTGRYNWRSRLKSGVLWGLSRRLIEPDRETVPSMLKEHGYYTACVGK 128 Query: 189 WHMGENK-----------------------------ESQPQNVGFDDFRGFNSVSDMYTE 219 WH+G + ++ P +VGFD F G ++ DM Sbjct: 129 WHLGMDWSLKQGGFATEQSYNKKTNPGWDVDYSKPIQNGPNSVGFDYFFGISASLDM-PP 187 Query: 220 WRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYG 279 + + + + + + P KD D+ R D Sbjct: 188 YVYIENDRSQGIPTVTKAFFRDGPAHKDFEAI------------------DVLPRITDKT 229 Query: 280 VKFLDKMA---KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANL 336 V+ +D+ A K KPFF+Y+ H P ++ G S Y D +++++D + Sbjct: 230 VQIIDEHAAASKEGKPFFIYFPLNAPHTPILPTPEWQGKSGINA-YCDFVMQVDDTVGQV 288 Query: 337 YKTLEKNGQLDNTLIVFTSDNGPEAEVP--------PHGRTPFRGAKGSTWEGGVRVPTF 388 + L+K G +NTL++FT+DNG FRG K +EGG RVP Sbjct: 289 MQALKKQGIHENTLVIFTADNGCSPAANFKEMTDKDHQPSYQFRGHKADIYEGGHRVPFI 348 Query: 389 VYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTN 447 W I+ SD + L DLF TA D+ G VP D V GT Sbjct: 349 ANWPARIKAGTHSDQLTCLTDLFATAADIVGAK-------VPDDAGEDSVSILPAMEGTA 401 Query: 448 GQSNRKAEHYFLNGKLAAVRMDEFKY-HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFN 506 R+A + ++R D +K +++ + G + + +++ Sbjct: 402 HTPLREAAVHHSIRGAFSIRKDHWKLELCPGSGGWSFPKPG-----KDNLSELPAIQLYD 456 Query: 507 LYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 L D E ++ H + L T + +Y + + P + Q + Sbjct: 457 LNHDAGEQKNVQAEHPEVVKELTTLLQSYADRGRSTPGKPQPNT 500 >UniRef50_A6C4W8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4W8_9PLAN Length = 459 Score = 459 bits (1183), Expect = e-128, Method: Composition-based stats. Identities = 102/489 (20%), Positives = 175/489 (35%), Gaps = 82/489 (16%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-P 134 A ++ G++PN++ + DD+G+ D+G G + TP ID A+QG T AY+ Sbjct: 19 ASMQAAEGERPNIIFIMADDLGYGDLGCYGQKLM---KTPHIDQFAAQGTRFTQAYAGGS 75 Query: 135 SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE- 193 + +RA +LTG ++ H + + T+ ++L GY +GKW +G+ Sbjct: 76 VCTASRAVLLTGLHNGHTPARDNIPHYATYLQESDVTIAEVLQKSGYRCGGVGKWSLGDA 135 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 + N GFD + G+ + + + + + E L + Sbjct: 136 GTVGRATNQGFDMWFGYLNQDHAHYYFTEYLDDNEGRLELKGN----------------- 178 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH---------- 303 T + + ++F+ A +PFFLY H Sbjct: 179 ----------TKNRQQYSHDLLTERALQFIRDSAA--QPFFLYAAYTLPHFSAKAEDPHG 226 Query: 304 --FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA 361 + Y + ++ + + + + TLI+FTSDNG Sbjct: 227 LAVPDTEPYSDRDWDIKSKKYAAMIHRLDRDVGRIMSLVNELQLRERTLIIFTSDNGGHR 286 Query: 362 EVPP--HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAG 418 VP H P RG K EGG+RVP W G I SD ++ D+ PT +LAG Sbjct: 287 GVPAQLHTNGPLRGFKRDLTEGGIRVPFIANWPGTIPAGKVSDEVIAFQDMLPTFAELAG 346 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN----GKLAAVRMDEFKYH 474 + +DG+ G + + ++ AVR + +K Sbjct: 347 AQVSAN---------LDGISVLPALRGEPRKVKHEYLYWDYGHCRARYDQAVRWNNWKGI 397 Query: 475 VLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 QQ +++NL D ES + +H + + M+ Sbjct: 398 RHGQQ--------------------GEIALYNLDQDLSESRDVADKHPQVVQRIAEIMNT 437 Query: 535 YMEILKKYP 543 +YP Sbjct: 438 AAVPNPRYP 446 >UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5053 Length = 467 Score = 459 bits (1183), Expect = e-127, Method: Composition-based stats. Identities = 109/513 (21%), Positives = 169/513 (32%), Gaps = 90/513 (17%) Query: 64 HPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQ 123 A KPN+V+ + DD+G ++G G TP ID +A Sbjct: 4 TAAVFLAVALLAPSGRAADAPKPNIVLIVADDLGCFELGCYGQTKI---KTPHIDKLAQG 60 Query: 124 GLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPM---YGQPGGLQGLTTLPQLLHDQ 179 G T YS P +P+R ++TG++S H + GQ T+ L Sbjct: 61 GAKFTRFYSGSPVCAPSRCVLMTGKHSGHATVRNNVEAKPEGQFPIRAEDVTVADALKAH 120 Query: 180 GYVTQAIGKWHMG-ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 GY T A+GKW +G + P GFD F G+N ++ + + + ++ Sbjct: 121 GYATGAMGKWGLGMFDTAGSPLKHGFDLFFGYNCQRHAHSHYPTYIYRNDKRVELKGNDG 180 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 F++ + + + F++ A KPFFLY Sbjct: 181 KTGKQFTQ--------------------------DLFEEEALGFIE--ANKAKPFFLYLP 212 Query: 299 TRGCHFDNYPNAK--------------------YAGSSPARTSYGDCMVEMNDVFANLYK 338 H Y Y + M+ + + Sbjct: 213 FTVPHVAVQVPEDSLNEYKGQLGDDPAYDGKKGYQPHPAPHAGYAAMVTRMDRSVGRVVE 272 Query: 339 TLEKNGQLDNTLIVFTSDNGPEAEVPP------HGRTPFRGAKGSTWEGGVRVPTFVYWK 392 L G NTL++FTSDNGP V + RG KGS +EGG+RVP Y Sbjct: 273 KLNALGLEKNTLVLFTSDNGPTHNVGGADSSFFNSAGKLRGLKGSVYEGGIRVPFIAYQP 332 Query: 393 GMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSN 451 G I+ SD + D+ PT AG IDG+ G ++ Sbjct: 333 GTIKAGTESDAPLYFPDVLPTLCAFAGTKAPS---------AIDGISFLPLLKGEKQPTH 383 Query: 452 RKAEHYFLN-GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTD 510 F G AV E+K + ++NL D Sbjct: 384 DFLYWEFSGYGGQQAVIEGEWKAVRQALGMGGV-----------------KTELYNLAKD 426 Query: 511 PQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 P E + + ++ + L+ + +P Sbjct: 427 PSEKEDVAAKNPAVLARLEKRLKNEHTPNSNFP 459 >UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=4 Tax=Bacteria RepID=A6C861_9PLAN Length = 498 Score = 458 bits (1180), Expect = e-127, Method: Composition-based stats. Identities = 117/539 (21%), Positives = 186/539 (34%), Gaps = 125/539 (23%) Query: 71 TQQKLAELEKKTGKKP-NVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTS 129 + L+ EK KP N V L+DD+G+MDVG N TP I+ +A G+ T+ Sbjct: 19 ADRSLSAAEKPKQNKPLNFVFILVDDLGYMDVGCNNPQ--TFYETPHINQLAKTGMRFTN 76 Query: 130 AYSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQPG----------GLQGLTTLPQLLHD 178 Y+ P SPTR +I+TG+Y + G TT+ + L + Sbjct: 77 GYAANPVCSPTRYSIMTGKYPTRVDATNFFSGKRAGKFLPAPLNDKMPLSETTIAEALKE 136 Query: 179 QGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTE--WRDVHVNPEVALSPDRS 236 GY T GKWH+G +E P+ GFD RG Y + + NP + Sbjct: 137 HGYSTFFAGKWHLGPTQEFWPEKQGFDINRGGWHRGGPYGGGKYFSPYGNPRLT------ 190 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 E L R +F+D A D+PFF Y Sbjct: 191 ---------------------------DGLKGEHLPDRLASETAQFID--AHRDEPFFAY 221 Query: 297 YGTRGCHFDNYPN-------------------AKYAGSS--------------PARTSYG 323 H ++A Y Sbjct: 222 LAFYSVHTPLMGPGPLVTKYKEKAKRLGLTGKEEFADEEQVFPVDEKRRVRILQNHAVYA 281 Query: 324 DCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP--EAEVPPHGRTPFRGAKGSTWEG 381 + M+ + + LE++G +NT+++ T+DNG +E P P RG KG +EG Sbjct: 282 AMVESMDKAVGKVLQQLEESGVAENTVVMLTADNGGLSTSEGSPTSNLPLRGGKGWLYEG 341 Query: 382 GVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQT 440 G+R + W G +P D V D +PT LDLAG P + +DGV Sbjct: 342 GIREVFLIRWPGGTEPGSVCDEPVITTDFYPTILDLAGLP-------LKPQQHLDGVSLK 394 Query: 441 SFFLGTNGQSNRKAEHYFLNGKLA------AVRMDEFKYHVLIQQPYAYTQSGYQGGFTG 494 F G ++ + A+R+ ++K Sbjct: 395 PFLQGEAPFKRDALYWHYPHYSNQGGIPGGAIRVGDWKLI-------------------- 434 Query: 495 TVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI-----LKKYPPRAQI 548 + +++L D E + ++ ++ ++H + + L+ P + Sbjct: 435 ERFEDGQVHLYHLKEDLGEKQDLAEKYPERVAAMRKQLHKWYQETDAKFLQAKPGGPEP 493 >UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B5CXC7_9BACE Length = 509 Score = 458 bits (1180), Expect = e-127, Method: Composition-based stats. Identities = 117/538 (21%), Positives = 176/538 (32%), Gaps = 121/538 (22%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSS 137 ++PNVV ++DD GW DVG+NG TP+ID +AS+G+I T Y+ SS Sbjct: 24 AASDNRQPNVVFIMVDDYGWADVGYNGSRF---YETPNIDRLASEGMIFTDGYAAASISS 80 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGG-------------------LQGLTTLPQLLHD 178 P+R +++TG+Y GI Q G T+ + + Sbjct: 81 PSRVSLMTGKYPARTGITDWIPGYQYGLKPEQLKQYKMLAPEMPLNMPLEEVTMAEAFKE 140 Query: 179 QGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 GY T +GKWH E+ PQ GFD G R SP R+ Y Sbjct: 141 HGYATYHVGKWHCAEDSLYYPQYQGFDVNIGGWLKGSP-NGIRRSQGGKGAYCSPYRNPY 199 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 + P E L R D +K + + +DKPFFLY Sbjct: 200 LPDGPEG-----------------------EFLTDRLGDESIKLIKNSS-ADKPFFLYLA 235 Query: 299 TRGCHFDNYPNAKY----AGSSPART---------------------------------S 321 H +Y + Sbjct: 236 FYAVHTPIEAKPEYVKYFKWKAQRMGLDTIVPFTRNLEWYKNAEYKAGHWKERTIQSDAE 295 Query: 322 YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP--EAEVPPHGRTPFRGAKGSTW 379 Y + M++ + + L+ NG NT++ SDNG AE P P R KG + Sbjct: 296 YAALIYSMDENVGRVMQALKDNGLDKNTIVCLLSDNGGLSTAEGSPTCNAPLRAGKGWLY 355 Query: 380 EGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVD 438 EGG+R P + + M++ V D +PT LD+AG P + +DG Sbjct: 356 EGGIREPFIIKYPQMVEAGSVCHTPVVAVDFYPTLLDMAGLP-------LKSHQHVDGKS 408 Query: 439 QTSFFLGTNGQSNRKAEHYFLNGKLA------AVRMDEFKYHVLIQQPYAYTQSGYQGGF 492 G ++ + AVRM ++K Sbjct: 409 LLPLLKGDQAYDRGPIFFHYPHYGGKGDTPAGAVRMGDYKLIEFY--------------- 453 Query: 493 TGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 + ++NL D E+ + +Q +H + P Sbjct: 454 -----EDGHVELYNLKNDISETRDLSKTEKDKAAEMQKMLHRWRTDCNAKMPTRNPHY 506 >UniRef50_A4AM21 Arylsulfatase A n=1 Tax=Flavobacteriales bacterium HTCC2170 RepID=A4AM21_9FLAO Length = 535 Score = 458 bits (1179), Expect = e-127, Method: Composition-based stats. Identities = 109/511 (21%), Positives = 180/511 (35%), Gaps = 67/511 (13%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPT 139 K K PN+V L DD+G+ D+ TP+ID +A G+ T A+ S +PT Sbjct: 31 KKQKPPNIVYILADDLGYGDISAFNAE--GKIQTPNIDNLAKDGMKFTDAHTSSAVCTPT 88 Query: 140 RATILTGQYSIHHGILMPPMYGQP--GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK-- 195 R ILTG+Y+ I + G+ TT+ L D GY T IGKWH+G + Sbjct: 89 RYGILTGRYNWRSPIKSGVLTGKSEALIPNSRTTVASFLSDNGYKTGFIGKWHLGWDWAI 148 Query: 196 -------------------------ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVA 230 + P ++GFD G + DM + Sbjct: 149 KDSTNNGGEGWNATDFENLDFTKPVTNTPNDLGFDYAYGHSGSLDMAPYVYVENGMATAK 208 Query: 231 LSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSD 290 + + K + + +++ + + F+ + + Sbjct: 209 VDTVTVDKGKYTWW-------------REGPTAADFVHDEVTPNFFRKSMSFIKEQGAEE 255 Query: 291 KPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 +PFFLY H P ++ G S Y D +V ++D L + LE+ G +NT+ Sbjct: 256 QPFFLYLALPSPHTPILPTEEWQGKSNLN-PYADFVVMIDDYLGQLVEVLEQKGLAENTI 314 Query: 351 IVFTSDNGPEAEV--------PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSD 401 ++FTSDNG + +RG K +EGG R+P V W I+ SD Sbjct: 315 VIFTSDNGCSPQADFKILGDLGHDPSAIYRGHKADIYEGGHRIPFVVKWPSKIESGSVSD 374 Query: 402 GIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFF-LGTNGQSNRKAEHYFLN 460 + DL T D+ D + R+A + Sbjct: 375 KTICTTDLLATVADILNVDLLDNQGE-------DSFSILPLLDTTDKREFKREATVHHSI 427 Query: 461 GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVR 520 A+R +K + + + +++L DP E ++ Sbjct: 428 NGSFALRKANWKMIFCTGSGG----WSDPKPNSEGIEELPKFQLYDLANDPSEQTNLFGH 483 Query: 521 HIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 H + L M Y++ + P + Q + Sbjct: 484 HPDIEGQLSELMLDYIDDGRSTPGKKQTNEE 514 >UniRef50_C5PU94 N-acetylgalactosamine-6-sulfatase n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PU94_9SPHI Length = 443 Score = 458 bits (1179), Expect = e-127, Method: Composition-based stats. Identities = 120/474 (25%), Positives = 188/474 (39%), Gaps = 61/474 (12%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ- 133 LA +PN ++ +DD+G+ DVG NG TP++D +A +G+ ++ YS Sbjct: 15 LAVFNSSAQTQPNFIIIYVDDMGYGDVGINGNP---NIETPNLDRMAMEGMRFSNYYSAS 71 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMY-GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 P+ + +R +LTG+Y G Q G Q +T+ + L ++GY T GKWH+G Sbjct: 72 PACTASRYALLTGKYPSRAGFRWVLNPTDQIGIHQQESTIAERLKEKGYRTAIYGKWHLG 131 Query: 193 E-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 KE P GFD++ G +DM + + + D E Sbjct: 132 STRKEFLPLANGFDEYVGLPYSNDMIPP---KYPDIALLSGYDTLELNPD---------- 178 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 L + + + + F+ K + +PFF+Y H + + Sbjct: 179 ----------------QSKLTRLYTEKAIAFITK--NAKQPFFIYLPYAMPHTPLHASED 220 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP- 370 + G S R YGD + E++ L L++N T +VFTSDNGP +G + Sbjct: 221 FLGKS-KRGLYGDVVQELDHHIGRLLTFLKENKLDQQTYVVFTSDNGPWLIQNQNGGSAG 279 Query: 371 -FRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLV 428 FR KGSTWEGG+R P F++ I + + D+ PT LAG Sbjct: 280 LFRDGKGSTWEGGMREPFFLWGHHTIPKGYVENEVFTALDMLPTITALAGISAG------ 333 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL-AAVRMDEFKYHVLIQQPYAYTQSG 487 IDG + + G R YF AVR +K HV Sbjct: 334 --PNKIDGTNLKPLWSGKKDTKGRDEFFYFGLDHQLMAVRKGPWKLHVKT---------- 381 Query: 488 YQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +FNL DP E ++ ++ M L T + + + + + Sbjct: 382 -YSQLGLVYFDKQLPLLFNLDHDPSEKYNLASQYPEMVSDLTTLILSKEKEIAE 434 >UniRef50_A3ZMT9 Arylsulfatase n=2 Tax=Planctomycetaceae RepID=A3ZMT9_9PLAN Length = 542 Score = 458 bits (1178), Expect = e-127, Method: Composition-based stats. Identities = 127/565 (22%), Positives = 197/565 (34%), Gaps = 141/565 (24%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPT 139 +PN+++ ++DD+G+ D+G++GG +A TP+IDA+A G+ + Y+ PT Sbjct: 23 ADPSDRPNIILIMVDDMGFSDLGYHGGEIA----TPNIDALAHSGVRFSQFYNNGRCCPT 78 Query: 140 RATILTGQYSIHHGILM--------PPMYGQPG-----GLQGLTTLPQLLHDQGYVTQAI 186 RAT++TG Y GI G+P + T+ + L QGY T Sbjct: 79 RATLMTGLYPHQTGIGHMTESPGEANYGSGKPPTYQGYLNRNCVTIAEALQQQGYATLMS 138 Query: 187 GKWHMGENKES-QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS 245 GKWH+GEN +S P GF+ + G S + +Y F Sbjct: 139 GKWHLGENDKSRWPLQRGFEKYFGCLSGATLYF-------------------------FP 173 Query: 246 KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFL-DKMAKSDKPFFLYYGTRGCHF 304 D G +Q A + T + DY ++FL ++ A +P FLY H+ Sbjct: 174 DGDRKMTLGNQQIAEPESTTDQPFYTTDAFTDYAIRFLKEEQAGQQRPMFLYLAYTAPHW 233 Query: 305 DNYPNA----KYAGSSP------------------------------------------- 317 KY G Sbjct: 234 PLQAFEEDIAKYRGKYKIGWDKLREQRLERQKNLGLIAADRQLSPRTPKIPAWDELDAAQ 293 Query: 318 ------ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP------- 364 Y + ++ L K L+++G D+TLI+F SDNG E Sbjct: 294 QDEMDLKMAVYAAMIDRVDQNIGRLMKHLKESGIEDDTLILFLSDNGGCQEGGVLGGAHF 353 Query: 365 -------------------PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK--SDGI 403 TPFR K EGG P F+ W G I R Sbjct: 354 LDPEQRNRQYFHGYGEAWANASNTPFRLYKHFNHEGGTATPFFMRWPGKIAARDAWCAEP 413 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL 463 L D+ PT LD+AG +DGV G +R+ + Sbjct: 414 AQLIDVMPTILDVAGATYPAKYAE-NAIPPLDGVSLRPTMQGE--PLDRQQPICIEHENN 470 Query: 464 AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIP 523 A++R ++K +G +Q A ++N+ D E+ ++ V H Sbjct: 471 ASIRAGDWKLV-------------GRGVAAPRGVQPAKWELYNIADDRTETQNLAVEHPE 517 Query: 524 MGVPLQTEMHAYMEILKKYPPRAQI 548 L + +A+ + + YP R Sbjct: 518 KVRELSQQWNAWAKRVGVYPKRQAP 542 >UniRef50_A6LEC5 Arylsulfatase A n=2 Tax=Parabacteroides RepID=A6LEC5_PARD8 Length = 483 Score = 458 bits (1178), Expect = e-127, Method: Composition-based stats. Identities = 107/495 (21%), Positives = 180/495 (36%), Gaps = 73/495 (14%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFN-------GGGVAVGNPTPDIDAVASQGLIL 127 + +++ KPN+++ L DD+G+ DV + TP++D +A QG+ Sbjct: 21 SCDAKEEAVPKPNIIILLADDLGYNDVSCYRNENFPQQSDSFPTSQTPNLDLLARQGIRF 80 Query: 128 TSAYS-QPSSSPTRATILTGQYSIHHGILMPPMYGQP-GGLQGLTTLPQLLHDQGYVTQA 185 T+ Y SSP+RA ++TG+ G+ P T+ ++L Y T Sbjct: 81 TNFYCGAAVSSPSRAALMTGRNCTRTGVYNYLEQNSPMHLRDSEVTIAEVLKQADYATGH 140 Query: 186 IGKWHM--GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLP 243 GKWH+ G + P + GFD +P Sbjct: 141 FGKWHLSSGRPDQPYPNDQGFDYSF---------------------------YALNNSVP 173 Query: 244 FSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 + + R GE Q + + +++LDK +PFFL H Sbjct: 174 SHHNPTNFFRNGEPQ------GEIEGYSCDIVVTEALQWLDK--NKQEPFFLNVWFNEPH 225 Query: 304 FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 F + Y C+ M+ L L++ DNT+++F SDNG Sbjct: 226 FPMEAPEELKKRHAINPEYYGCIENMDIAIGKLMNYLKEQNLEDNTIVIFASDNG---SQ 282 Query: 364 PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGI-VDLADLFPTALDLAGHPGA 422 + PFRG K +EGG+RVP V W + D+ PT LA P Sbjct: 283 WDYSNLPFRGEKHFNYEGGLRVPCIVRWHKHVPTGVISEFNGCFTDILPTLASLADAP-- 340 Query: 423 KVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY---FLNGKLAAVRMDEFKYH-VLIQ 478 VP IDG+D + FLG R+ + +++ + +R ++ Sbjct: 341 -----VPTDRVIDGMDISPVFLGKAETLERENPLFFFRYIHDPICMIREGDWCLLGYDEP 395 Query: 479 QPYAYTQSGYQGGFTGTVMQTAG------------SSVFNLYTDPQESDSIGVRHIPMGV 526 P+A++ G T ++NL D +E + +H + Sbjct: 396 LPWAFSLDELALGKVKPWYLTKEHMEFAKKVFPKYFELYNLRDDREERIDVADKHPEIVA 455 Query: 527 PLQTEMHAYMEILKK 541 L+++M + + Sbjct: 456 RLKSKMLKLKQEVVA 470 >UniRef50_C0BKJ9 Sulfatase n=2 Tax=Bacteroidetes RepID=C0BKJ9_9BACT Length = 493 Score = 457 bits (1176), Expect = e-127, Method: Composition-based stats. Identities = 114/511 (22%), Positives = 199/511 (38%), Gaps = 81/511 (15%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 T + +E + + PN++ L DD+G+ ++G G TP++D +A+ G+ T Sbjct: 12 TFFSCSTVENQKDQPPNIIYILADDLGYGELGSYGQKKI---KTPNLDRLAADGMRFTQH 68 Query: 131 YS-QPSSSPTRATILTGQYSIHHGILMP---------PMYGQPGGLQGLTTLPQLLHDQG 180 Y+ P +P+R LTG ++ H I GQ + TL ++L G Sbjct: 69 YTGAPVCAPSRYMFLTGNHAGHAYIRGNYELGQFSDEMEGGQMPIPETTPTLAKMLKKAG 128 Query: 181 YVTQAIGKWHMGENKE-SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI 239 Y T IGKW +G N+ P GFD + G+ + + P D+ + + Sbjct: 129 YQTAMIGKWGLGMNETTGSPLLHGFDYYYGYLDQKQAHNYY------PTHLWENDKKDPL 182 Query: 240 KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 F VH+ + K E R ++ ++FLD A SDKP+FLYY + Sbjct: 183 NNDYF---LVHSPISSKANQSDFDQFKGQEYAPDRMLEKAIQFLDTTA-SDKPYFLYYPS 238 Query: 300 RGCHF-------------------DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTL 340 H N Y +Y + ++ ++ ++ Sbjct: 239 PIPHVSLQVPDSLVDQYRDVFEEEPYLGNKGYTAHQFPNAAYAAMITHLDSEVGKIWDSV 298 Query: 341 EKNGQLDNTLIVFTSDNGPEAEVP-----PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI 395 ++ GQ +NTLI+F+SDNGP + RG K +EGG+R+P YWKG I Sbjct: 299 KEKGQEENTLILFSSDNGPTFAGGVDPDFFNSAAGLRGLKMDVYEGGIRIPFIAYWKGKI 358 Query: 396 QPRKSDGIVD-LADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKA 454 + ++ D+F T +LAG + DG+ LG + Sbjct: 359 KAGSISDLISGHWDMFNTFAELAGQDQSAP----------DGISILPELLGESQNETHDY 408 Query: 455 EH--YFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ 512 + Y A+R++++K + + + ++NL TD Sbjct: 409 IYFEYPEKRGQIALRIEDWKGVKVEMKTNL----------------DSKWELYNLKTDRN 452 Query: 513 ESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 E ++ H + ++ + + ++P Sbjct: 453 EVFNVAAEHPEIV----NKIDSLHKTAHRHP 479 >UniRef50_A1WGP9 Sulfatase n=6 Tax=Proteobacteria RepID=A1WGP9_VEREI Length = 470 Score = 457 bits (1176), Expect = e-127, Method: Composition-based stats. Identities = 135/478 (28%), Positives = 220/478 (46%), Gaps = 35/478 (7%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 + + PN+V+ + D++GW + G GGG G PTP+IDA+A+QGL L + Sbjct: 21 PHFCKAATMSVSTPNIVLIVADNLGWGEPGCYGGGALRGAPTPNIDALATQGLRLQNFNV 80 Query: 133 QPSSSPTRATILTGQYSIHHGILMPPMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 + PTR+ ++TG++ I G L G P G + TL QLL QGY + GKWH+ Sbjct: 81 ESDCVPTRSALMTGRHPIRTGCLQSVPPGLPQGLTRREITLAQLLSAQGYASAHYGKWHL 140 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 G+ P + GFD++ G +D V +P V P + Sbjct: 141 GDVPGRLPSDRGFDEWYGIARTTDESQFTSTVGFDPAVVDLPW-------------IMRG 187 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 G + + +D ++ + F+ + A + +PFFLY HF P+ Sbjct: 188 RSGQPSENLKVYDLDSRRQIDAELVEQSIAFMRRNASTGRPFFLYLPLIHLHFPTLPHPD 247 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR-TP 370 +AG + A + D MVE++ + + L++ G +N++++F SDNGPE VP G P Sbjct: 248 FAGRTGA-GDFADSMVELDHRVGQVVRALDELGAAENSVLIFCSDNGPEFRVPYRGTAGP 306 Query: 371 FRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTALDLAGHPGAKVANLVP 429 + G + EG +RVP V W G I R S+ IV + DLF T +AG +P Sbjct: 307 WSGTYHTAMEGSLRVPCIVRWPGHISAARVSNEIVHVTDLFTTLAGVAGA-------RIP 359 Query: 430 KTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQ 489 + IDGVDQ FFLG S R+ +++ +L AV+ ++K H ++ Sbjct: 360 QDRPIDGVDQLPFFLGRQSASAREGFPFYIKEELRAVKWRDWKLHFY-----------WE 408 Query: 490 GGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQ 547 + + +FN+ DP+E + + + P+ + A+ + ++P Sbjct: 409 PVVNESKGKLESPYLFNITRDPKEQMDVMAYNTWVRAPMLKLVKAFQDSFVQHPNTRP 466 >UniRef50_C9KTV0 Arylsulfatase n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KTV0_9BACE Length = 459 Score = 457 bits (1176), Expect = e-127, Method: Composition-based stats. Identities = 120/496 (24%), Positives = 185/496 (37%), Gaps = 81/496 (16%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 + +E K+PN V+ + DD+G+ DVG G TP+ID +A +G++ T Sbjct: 14 ALAAFSPVEMMAQKQPNFVIIVADDMGYGDVGIYGNEYI---KTPNIDQIAREGMMFTDF 70 Query: 131 YSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQP------GGLQGLTTLPQLLHDQGYVT 183 +S SSPTR +LTG+Y G+ + + G T ++L D GY T Sbjct: 71 HSNGSVSSPTRCGLLTGRYQQRAGLEKVLLVPRDDKDKEVGLPSEEITFAKILGDNGYRT 130 Query: 184 QAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLP 243 IGKWH+G ++ P N GF F GF S + Y R+ + + + + + Sbjct: 131 ALIGKWHLGYLQKHHPMNFGFQKFVGFKSGNVDYQSHRNRYGDMDWWDGLEMKDM----- 185 Query: 244 FSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 ++ + DKPF LY H Sbjct: 186 ------------------------SGYTTTLLTTLSEDYIKE--NKDKPFCLYIAHAAPH 219 Query: 304 FDNYPNAKYA---------GSSPAR---TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 + A + R Y D + E++ + +TL+K +NT + Sbjct: 220 SPMQGPDEKAVRTEATPEGDKNSDRSNKEIYKDMVEELDWSVGRILETLKKYKLDENTFV 279 Query: 352 VFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLF 410 VF SDNGP ++GAKGS WEGG RVP Y G I+ + V DLF Sbjct: 280 VFFSDNGPVINNG-GSAGGYKGAKGSPWEGGHRVPGICYMPGTIKEGTTCEQTVMSFDLF 338 Query: 411 PTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDE 470 PT LD+A +DG F G N + + K +VR + Sbjct: 339 PTMLDMADIHYDD------SKKKLDGTSLVPLFKGENLAP--RLLFWGNGNKTISVRDGK 390 Query: 471 FKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQT 530 +K Q+ +F+L DP E +++ + + L Sbjct: 391 WKLVRYNQKGGITLH------------------LFDLNNDPYEKNNLSKQEPELVERLDK 432 Query: 531 EMHAYMEILKKYPPRA 546 E+ + E + P Sbjct: 433 EITRWAESVYSEVPDQ 448 >UniRef50_A5FF56 Sulfatase n=2 Tax=Bacteria RepID=A5FF56_FLAJ1 Length = 524 Score = 456 bits (1175), Expect = e-127, Method: Composition-based stats. Identities = 186/533 (34%), Positives = 279/533 (52%), Gaps = 34/533 (6%) Query: 35 KGFAGYDHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLD 94 Q P + D + P + HP QDKE + KL++L+KK PN+++ L+D Sbjct: 10 LALVAEMSAQQNYFNPTVKVKDYLEPAIPHPDQDKEMKDKLSKLKKK----PNILIILID 65 Query: 95 DVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGI 154 D+G+ D+G GGGVA+G PTP++D +A +GL LTS Y+QP+ +P+RA I+TG+ G+ Sbjct: 66 DMGYGDIGVYGGGVAIGAPTPNMDKLAHEGLQLTSTYAQPTCTPSRAAIMTGRIPARSGL 125 Query: 155 LMPPMYGQPGG---LQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFN 211 P + G+ T ++L GY + GKWH+GE+K S P VG+D++ GF Sbjct: 126 TRPTLTGENPKVNPWASENTTAKILSQNGYKSAISGKWHLGESKGSLPNEVGYDEWLGFG 185 Query: 212 SVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDL 271 SV Y ++ + + P++ PDR +K++ G + + + + Sbjct: 186 SVQSEYAQFVNEWIYPDLINKPDRLAAVKKMVDQNILTGVKGGENKVVQPISNIEELSKV 245 Query: 272 DQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMND 331 DQ + +Y F+ + K +KPF+L + H DNY + Y G SPA Y D +VE++D Sbjct: 246 DQVFANYSEDFIKRSVKENKPFYLIHSFSKVHNDNYVSEGYKGKSPAAIPYKDAIVEVDD 305 Query: 332 VFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR-TPFRGAKGSTWEGGVRVPTFVY 390 + L K L+ DNTL+ TSDNGP +V P G TPFRG KG+TWEGGVRVP Y Sbjct: 306 IVGRLMKLLQDLKIDDNTLVFLTSDNGPNEDVWPDGGYTPFRGGKGTTWEGGVRVPGIAY 365 Query: 391 WKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ 449 WKGMI P SDG+ D+ D+F T+L AG + +P + +IDGVDQ SFFL G Sbjct: 366 WKGMIAPGRISDGLFDICDMFNTSLSAAGV-----LDKIPSSNYIDGVDQLSFFLSDKGV 420 Query: 450 SNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYT 509 SNR A + K A+R E+K H+ + A ++ Q M V+N+Y Sbjct: 421 SNRNAVFMYSETKFMAIRWQEYKVHMNVFNTSATRRNLDQSTIQSIGM---SPWVYNIYA 477 Query: 510 DPQESDSIGVRH-------IPMGVPLQTEMHAYMEILKKYPPR----AQIKSD 551 DP+E + H + + A++ +KYP + + ++ Sbjct: 478 DPKEQ--LSQGHRYFEWGIPGV----MGLIAAHLATYQKYPMKDIGLKKPGAE 524 >UniRef50_B4D681 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D681_9BACT Length = 536 Score = 456 bits (1175), Expect = e-127, Method: Composition-based stats. Identities = 111/518 (21%), Positives = 185/518 (35%), Gaps = 68/518 (13%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PS 135 L + PN++ L DD+G+ DV TP++D + G+I T A+S Sbjct: 24 ALPRAHAANPNIIYILCDDLGYGDVKCLNAE--GKIATPNMDRLGKAGMIFTDAHSSSAV 81 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPG--GLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 SPTR I+TG+Y+ + + G QG T+ +L + GY T IGKWH+G Sbjct: 82 CSPTRYGIITGRYNWRSPLQSGVLGGLSPRLIEQGRMTVASMLKEHGYATACIGKWHLGM 141 Query: 194 NK----------------------------ESQPQNVGFDDFRGFNSVSDMYTEWRDVHV 225 + ++ P +VGFD + G ++ DM + Sbjct: 142 DWAKLPGKDVTELSVEKPDQVHNVDYAAPIKNGPNSVGFDYYYGISASLDMVPYTFIEND 201 Query: 226 NPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDK 285 + V + D+S + S + D+ V ++ + Sbjct: 202 HVTVLPTVDKSFPFTEGRESH---------PTRPGPAAPGFEPRDVLPTLTRKAVDYIGQ 252 Query: 286 MAKS---DKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEK 342 KPFFLY H P+A++ G S + Y D ++E + + + LE+ Sbjct: 253 RTNDAQNGKPFFLYLPLNSPHTPIAPSAEWQGKS-GISPYADFVMETDWAIGEVLRVLEE 311 Query: 343 NGQLDNTLIVFTSDNGPEA--------EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGM 394 G DNT++ SDNG E H FRG K ++GG +P V W Sbjct: 312 KGLADNTIVFMASDNGCSPSADFAELAEKGHHPSYVFRGHKADIFDGGHHIPFLVRWPAK 371 Query: 395 IQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRK 453 I+ SD +V L D T D+ G +P D G + + Sbjct: 372 IKAGSTSDQVVCLTDFMATCADVLGIK-------LPDNAAEDSASLLPVLEGKADKPIHE 424 Query: 454 AEHYFLNGKLAAVRMDEFKY-HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ 512 A + A+R +K ++ + + ++++ D Sbjct: 425 AVVHHSVNGSFAIRQGNWKLELCPSSGGWSDPRPKTAA-----ANKLPAVQLYDMSADIG 479 Query: 513 ESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 E ++ H + L M ++ + P Q Sbjct: 480 ERKNVEAEHTEVVDRLIRLMEKFVADGRTTPGARQQND 517 >UniRef50_A6DKP3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKP3_9BACT Length = 465 Score = 456 bits (1174), Expect = e-126, Method: Composition-based stats. Identities = 114/477 (23%), Positives = 187/477 (39%), Gaps = 74/477 (15%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQ 133 LA L KPN++V L DD+G+ DV ++G TP ID++A G + Y + Sbjct: 13 LASLSASAA-KPNIIVILADDLGYGDVSYHG--TLKETTTPHIDSIAQSGAWFQNGYSAA 69 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMY------GQPGGLQGLTTLPQLLHDQGYVTQAIG 187 P P+RA +L+G+Y G + G +P++L +GY T +G Sbjct: 70 PVCGPSRAGLLSGRYQQRFGYYDNIGPFTLNKDVEAGLPLSQKLIPEILVKEGYATGMVG 129 Query: 188 KWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 KWH G+ + P N GF +F GFN+ + + +K + D Sbjct: 130 KWHDGDQHKFWPYNRGFQEFYGFNNGAI-------------------NNWVLKGENHTVD 170 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 + AV ++ E + + + V+F+D+ +PFFLY H Sbjct: 171 EWGAVHRENKRVENS-----GEYMTEAFGREAVEFIDRH--KTEPFFLYLSFNAVHGPLQ 223 Query: 308 PNAKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 Y R + M+D + + L K G +NT+I FTSDNG + + Sbjct: 224 APKSYTNQFKHIKPENRALCLAMLKSMDDNIGLVLEKLRKEGLEENTIIFFTSDNGGKLK 283 Query: 363 VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK--SDGIVDLADLFPTALDLAGHP 420 +RG K + ++GG+ VP V WK I + + V DL T AG Sbjct: 284 GNYSFNGKYRGEKNTVFDGGLHVPYAVQWKAQIPAQTKALEAPVHSIDLAHTIFAAAGVE 343 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 + +DG + + + +R Y+ N A+R +++KY + Sbjct: 344 -------IKDEYKLDGRNLLPYLKNQSDFDDRN--LYWANNANIAIRDNKWKYLKQAGKT 394 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 Y +FNL DP ES+++ ++ +Q A+ Sbjct: 395 Y----------------------LFNLEEDPYESNNLVSQYPEKAQDMQKRHDAWQA 429 >UniRef50_A3ZMN6 Arylsulfatase B n=3 Tax=Bacteria RepID=A3ZMN6_9PLAN Length = 455 Score = 456 bits (1173), Expect = e-126, Method: Composition-based stats. Identities = 114/491 (23%), Positives = 171/491 (34%), Gaps = 68/491 (13%) Query: 69 KETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILT 128 T +A +PN+V L DD+G DV + G + TP +DA+A+ G L Sbjct: 12 ALTLASVATTFATDAPRPNIVFLLADDLGGADVSWRGSPI----KTPQLDALANSGAKLE 67 Query: 129 SAYSQPSSSPTRATILTGQYSIHHGILMPP--MYGQPGGLQGLTTLPQLLHDQGYVTQAI 186 Y QP SPTR+ +LTG+Y + +G+ + + G TL + L D GY T + Sbjct: 68 QFYVQPVCSPTRSALLTGRYPMRYGLQVGVVRPWADYGLPLDERTLAEALQDAGYETAIV 127 Query: 187 GKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS 245 GKWH+G + P GFD G + + Y Sbjct: 128 GKWHLGHVSPAYLPMARGFDHQYGHYNGALDYF--------------------------- 160 Query: 246 KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD 305 H GG D + V+ + KP FLY H Sbjct: 161 ---THDRDGGHDWHKDDHVNRDEGYATHLIAQEAVRVIQD-RDKKKPLFLYVPFNAVHSP 216 Query: 306 NYPNAKYA----GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA 361 YA R +Y + +++ + +++ LDNTL +F+SDNG Sbjct: 217 LQVPESYAAPYGDMKKRRQAYAGMVAALDEAVGQIVDEIQRQEMLDNTLFIFSSDNGGPE 276 Query: 362 EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHP 420 P RG K + +EGGVRV F WKG I P K + + + D +PT ++LAG Sbjct: 277 PGKLTDNGPLRGGKHTLYEGGVRVCAFASWKGRIAPGSKVEAPLHIVDWYPTLIELAGGS 336 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 + +DG + T S + A+R+ ++K V Sbjct: 337 LQQA-------KPLDGRNIWPSIT-TGEPSPHDVIVCNITPTEGAIRVGDWKLVVHNIG- 387 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 +FNL D E + + M L+ Sbjct: 388 ----------------KPREKVELFNLSDDLAEQQNRATTNAKMLRKLRNRFDQLASEAA 431 Query: 541 KYPPRAQIKSD 551 D Sbjct: 432 PAKNAGPQPKD 442 >UniRef50_UPI00016C41FE sulfatase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C41FE Length = 499 Score = 456 bits (1173), Expect = e-126, Method: Composition-based stats. Identities = 128/514 (24%), Positives = 192/514 (37%), Gaps = 83/514 (16%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTR 140 K PN+V L DDVG+ D+G G TP++D +A QG LT A+S +PTR Sbjct: 21 DPKPPNIVFILADDVGYGDLGCYGST---KVRTPNLDTLAKQGTRLTDAHSPAAVCTPTR 77 Query: 141 ATILTGQYSIHH--GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 +LTGQY+ H G + T+P L GY T A+GKWH+G ++ Sbjct: 78 YALLTGQYAWRHAPGSRILSGVAPLSIKPDTLTVPAFLKQNGYTTAAVGKWHLGLGEKET 137 Query: 199 -------PQNV--GFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPDRSEYIKQLPF 244 P GFD + D R V+ +P+ ++ ++ + P Sbjct: 138 DYNGEIKPGAREVGFDYSFLIPATGDRTPCVFVENGRVVNYDPKDPITVSYTKKVGTEPT 197 Query: 245 SKDDVHAVRGGEQQAIADIT-----------------PKYMEDLDQRWMDYGVKFLDKMA 287 K++ + + D+T ED+ V+F+ K Sbjct: 198 GKENPELLTVQKPSLGHDMTIVNGISRIGWMSGGKAARWKDEDIADDITKKAVEFIGKA- 256 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 DKPFFLY+ T H P+ ++ G S GDC+ E++ + L++ D Sbjct: 257 -KDKPFFLYFATHDAHVPRVPHPRFKGKS-GHGLRGDCIEELDWCVGEIVAALDRYKLTD 314 Query: 348 NTLIVFTSDNGPEAE-----------VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ 396 NTL+VFTSDNG + RG KG +EGG RVP W + Sbjct: 315 NTLVVFTSDNGGVMDDGYIDGTATDTSGHKCNGALRGFKGGLYEGGHRVPFIAKWPVHVA 374 Query: 397 PR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFL-GTNGQSNRKA 454 SDG+V DL T + G L+P D VD + R Sbjct: 375 AGKVSDGLVCHVDLLRTCAAILG-------KLLPSGAGPDSVDIFPTLTADRPTKPCRST 427 Query: 455 EHYFLNGKLA-AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQE 513 + A A+R E+K + G +FNL DP E Sbjct: 428 LIHQSGNPNALAIRKGEWKLI------------------PNEGKKKVGPELFNLAADPTE 469 Query: 514 SDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQ 547 ++ + L + + +++ P Sbjct: 470 QKNLAADKPEVVKELAALL----KSVQENPTSRP 499 >UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6C430_9PLAN Length = 503 Score = 455 bits (1172), Expect = e-126, Method: Composition-based stats. Identities = 121/509 (23%), Positives = 192/509 (37%), Gaps = 62/509 (12%) Query: 59 MPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDID 118 + ++ E+ K+ +PN++V L DD+G+ D+ G V P+ID Sbjct: 8 LIIVISILFTNESLAAEPTASVKSPARPNIMVVLCDDLGYGDLACYGHPVIQS---PNID 64 Query: 119 AVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQPGG-LQGLTTLPQLL 176 A +GL LTS Y+ P+ SP+RA ++TG+ GI P + T+ LL Sbjct: 65 RFAKEGLKLTSCYAAHPNCSPSRAGLMTGRTPFRVGIYNWIPMLSPMHVRKREITIATLL 124 Query: 177 HDQGYVTQAIGKWHM-GENK---ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALS 232 GY T +GKWH+ G + QP + GFD + H NP + Sbjct: 125 RQAGYATCHVGKWHLNGMFNMVGQPQPSDHGFDHWF------STQNNALPTHENPFNFVR 178 Query: 233 PDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKP 292 R Q D ++L ++ +KP Sbjct: 179 NARP---------------------------VGPLQGFASQLVADEAEEWLTQLRDKEKP 211 Query: 293 FFLYYGTRGCHFDNYPNAKYAGSSPA-----RTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 FF++ H ++ A ++ + +M+D F + KTL+ + Sbjct: 212 FFMFVCFHEPHEPIASAERFRKLYTAPEGSTLPAHHGNVTQMDDAFGRILKTLDDQKLRE 271 Query: 348 NTLIVFTSDNGPE--AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIV 404 NTLI+FTSDNGP P P R KG+T+EGG+RVP V W +QP SD V Sbjct: 272 NTLIIFTSDNGPAITRRHPHGSSGPLRDKKGATYEGGIRVPGIVQWPEHVQPGTTSDVPV 331 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL---NG 461 D+ PT +A P P +DG + G + F N Sbjct: 332 CGVDILPTLCAVADIPA-------PTDRVLDGTNILPLLEGKPILRKKPLYWQFNRAKND 384 Query: 462 KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTG--TVMQTAGSSVFNLYTDPQESDSIGV 519 A+R E+K + P G + G ++++ +D E+ Sbjct: 385 AKVALRDGEWKLLAKLNVPSPKPSGGITTEEIDAVKNAKLEGFELYHIQSDIAETTDRAE 444 Query: 520 RHIPMGVPLQTEMHAYMEILKKYPPRAQI 548 + ++ +M A + ++ PR Sbjct: 445 SEQEILKKMKQQMQAIFDEVQAEAPRWPA 473 >UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DBQ5_9BACT Length = 483 Score = 455 bits (1171), Expect = e-126, Method: Composition-based stats. Identities = 118/492 (23%), Positives = 175/492 (35%), Gaps = 82/492 (16%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSS 137 E T KPNV+ L DD+G D+G G TP+ID +A+ G+ Y+ + Sbjct: 20 EPATPAKPNVIFILADDLGIGDLGCYGQQKI---RTPNIDHLAADGMRFLQHYTGCSVCA 76 Query: 138 PTRATILTGQYSIHHGILMPP-----MYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 P+R ++TG++ H I GQ Q T+ +L+ + GY T IGKW +G Sbjct: 77 PSRCALMTGRHMGHAAIRDNAQRGPSEEGQRPMPQDTFTVARLMQNAGYYTGIIGKWGLG 136 Query: 193 ENKESQ-PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 ++ P+++GF+ G+ S +T + P + E + P Sbjct: 137 MPEDHSSPRDMGFNYSFGYLCQSMAHTYY------PPYLWRNNERETLAGNPS------- 183 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 + I PK +KF+ DKPFFLY H Sbjct: 184 ---YDVSMKGVIEPKGEIYSHDVMASDALKFVRDHH--DKPFFLYLAFTIPHLSLQVPED 238 Query: 312 -------------------YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIV 352 YA + R +Y + M+ L L++ G DNTL+ Sbjct: 239 SMSEYHGQWTETPFRNTKHYANNETPRAAYAGMITRMDRDVGRLMALLKELGIDDNTLVF 298 Query: 353 FTSDNG------PEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVD 405 F+SDNG V FRG K +EGG+R P W G I+ +D Sbjct: 299 FSSDNGAVFPLAGTDPVFFQSTGGFRGYKQDLYEGGIRTPLIARWPGKIETGVTTDQASV 358 Query: 406 LADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN---GK 462 D PT +L G P DG+ LG Q + Y+ G Sbjct: 359 FYDFLPTMAELNGVPPPADT---------DGLSYLPTLLGKPAQQKQHDFLYWEYQSAGG 409 Query: 463 LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHI 522 AVRM ++K A V+NL +D ES + H Sbjct: 410 AVAVRMGDWKAIAN----------------KIKKNPNANFEVYNLASDRTESHDVAAEHP 453 Query: 523 PMGVPLQTEMHA 534 + + + Sbjct: 454 EIVAKAREIIAR 465 >UniRef50_Q7UYW3 Arylsulfatase B n=1 Tax=Rhodopirellula baltica RepID=Q7UYW3_RHOBA Length = 520 Score = 455 bits (1171), Expect = e-126, Method: Composition-based stats. Identities = 116/497 (23%), Positives = 176/497 (35%), Gaps = 87/497 (17%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PS 135 PN+VV L DD+G+ D+G G TP++D +A G++ + AY Sbjct: 47 ASRAAESTPPNIVVILADDMGYGDMGCMGSQTL---QTPNLDRLAESGVLCSQAYVASAV 103 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQP---------GGLQGLTTLPQLLHDQGYVTQAI 186 SP+RA +LT + G G TL L GY T I Sbjct: 104 CSPSRAGLLTSRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADHLGAAGYATALI 163 Query: 187 GKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 GKWH+G + P GFD F G + S Y Sbjct: 164 GKWHLGMGEMHHPNRRGFDHFCGMLTGSHHYFPA-------------------------- 197 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKM--AKSDKPFFLYYGTRGCHF 304 ++ ++ + E L + D G++F+D+ A D+P+F+++ H Sbjct: 198 ----TMKHVIERNGKRVDDFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNAPHT 253 Query: 305 DNYPNAKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP 359 + + R +Y M ++ + + LE+ GQ +NTL+VF SDNG Sbjct: 254 PMHATEADLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGG 313 Query: 360 EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAG 418 P RG KGS EGG+RVP W DG+V DL PT AG Sbjct: 314 ATNNGSW-NGPLRGVKGSMREGGIRVPMIWTWPAKFPAGVLYDGVVSSLDLLPTFCSAAG 372 Query: 419 HPGAKVANLVPKTTFI------------DGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAV 466 +A+ + DG+D + NR+ Y+ AA+ Sbjct: 373 AEPLALADPMSHEDASNRKRMNRLSGTHDGIDMAPHLADGSEPPNRR--LYWRLQGQAAI 430 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGV 526 K +P +F + TD ES + ++ Sbjct: 431 LDGTDKLLRPSHRPA---------------------ELFEVSTDVSESHDLSAQNPSRFR 469 Query: 527 PLQTEMHAYMEILKKYP 543 L E+ A+ +L P Sbjct: 470 ELYDELGAWESMLTTVP 486 >UniRef50_A6CAY0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAY0_9PLAN Length = 466 Score = 455 bits (1171), Expect = e-126, Method: Composition-based stats. Identities = 112/510 (21%), Positives = 194/510 (38%), Gaps = 93/510 (18%) Query: 70 ETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTS 129 + EK K+PN+++ D++G+ D+G G V TP +D +AS+G+ LT Sbjct: 19 SVPAPVTAAEKPENKRPNILLITADNLGYGDLGCYGNPVM---KTPMLDQLASEGVRLTD 75 Query: 130 AY-SQPSSSPTRATILTGQYSIHHGILMPPMYGQ---PGGLQGLTTLPQLLHDQGYVTQA 185 Y + P+ + +RAT+LTG+Y G+ + G + +P+ L QGY T Sbjct: 76 FYTASPTCTVSRATLLTGRYPQRIGLNHQLSADENYGDGLRKSEVLIPEYLKQQGYRTAC 135 Query: 186 IGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS 245 GKW++G + S+P GFD+F GF + + Y Sbjct: 136 FGKWNVGFSPGSRPTERGFDEFFGFAAGNIDYYHHYYAGR-------------------- 175 Query: 246 KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD 305 H + G ++ + + D +++ A+SD+PFF+Y HF Sbjct: 176 ----HDLWRGLKEVFVE------GYSTDLFADAACQYI--SAESDQPFFIYLPFNAPHFP 223 Query: 306 ------------NYPNA---KYAGSSP----ARTSYGDCMVEMNDVFANLYKTLEKNGQL 346 + G P + Y + ++ + K L+ +G Sbjct: 224 SQRNKQPGQGNEWQAPDLAFEKYGYDPQTKNPQERYRAVVTALDSAIGRVLKQLDTSGLR 283 Query: 347 DNTLIVFTSDNGP----EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDG 402 D T++++ SDNG E + P R + WEGG+RVP + + G ++ + Sbjct: 284 DQTIVIWYSDNGAFMLKERGLEVASNKPLRDGGVTLWEGGIRVPAIIRYPGHLKAGTVNQ 343 Query: 403 -IVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG 461 + D+ PT + LAG P +P +DG D R + N Sbjct: 344 SPLISLDILPTLITLAGGP-------LPAERILDGQDMLPALAAQTAPEPRTFFFQYRN- 395 Query: 462 KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 +AVR ++K + +F+L D E+ + R+ Sbjct: 396 -FSAVRRGKYKLVR--------------------IKPNQPFMLFDLEQDLSETTDLAERN 434 Query: 522 IPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 + LQ + + + R + KSD Sbjct: 435 PKVLNQLQQAYADWEREVAENEERRR-KSD 463 >UniRef50_A6DKD8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKD8_9BACT Length = 455 Score = 455 bits (1171), Expect = e-126, Method: Composition-based stats. Identities = 116/483 (24%), Positives = 176/483 (36%), Gaps = 80/483 (16%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSSSPTR 140 +KPN+++ L DD+G+ D+GF G TP IDA+A G+ T Y S P+R Sbjct: 18 AAQKPNIILILADDLGYEDLGFLGAPDI---KTPHIDALARSGMNFTQGYQSASVCGPSR 74 Query: 141 ATILTGQYSIHHGILMPPMY--------GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 A +LTG+Y G P G + LL Y T IGKWHMG Sbjct: 75 AGLLTGRYQQLFGSGENPPETGELSKRFPDAGIPLDEQMIFDLLKPAAYTTGVIGKWHMG 134 Query: 193 ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 + E +P D + GF + + Y E + + R+ Sbjct: 135 LSHEQRPTQRSVDYYYGFLNGAHSYREAKMDMKGAPMTWPIFRN---------------- 178 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 + + + + D GV F+ + DKPFFLY H K Sbjct: 179 ---------NEPVPFSGYTTEVFNDEGVNFIKR--NKDKPFFLYMSYNSVHGPWEAQPKD 227 Query: 313 AGSSPA-----RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP-- 365 S R Y ++ M+D L +TL+ G +NTL++F SDNG + Sbjct: 228 LQRSDHIKKKWRRIYSAMLISMDDGVGRLIQTLKDEGIYENTLVIFMSDNGAPNNLHEAE 287 Query: 366 ------HGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTALDLAG 418 RG KG T+EGG+RVP + W +I + V D+ PT + + Sbjct: 288 RAGDYLASNGSLRGRKGDTYEGGIRVPYIMSWPQVIPKQSTYQHPVSGLDIVPTLIHI-- 345 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ 478 + P + GV+ + G K Y+ A+R ++K Sbjct: 346 ------SQAAPAKKELSGVNLMPYITGEKTSRPHKTL-YWRRDDDYAIRDKDWKL----- 393 Query: 479 QPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 +FNL DP E +++ +H + LQ + + Sbjct: 394 -------------TWNDYNGPRTPMLFNLKDDPNEKNNLIHKHPEIAQKLQAKFDQWDSK 440 Query: 539 LKK 541 L Sbjct: 441 LPD 443 >UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria RepID=A6DGD3_9BACT Length = 713 Score = 455 bits (1171), Expect = e-126, Method: Composition-based stats. Identities = 110/531 (20%), Positives = 185/531 (34%), Gaps = 115/531 (21%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSS 136 +K + K+P++++FL+DD+GW D+ G TP +D +A +G T AY+ P Sbjct: 232 PKKASSKRPHIILFLIDDLGWNDIACYGSQF---YETPHLDKMAKEGFRFTDAYAANPVC 288 Query: 137 SPTRATILTGQYSIHHGILM--------------PPMYGQPGGLQGLTTLPQLLHDQGYV 182 SPTRA+IL G+Y G+ P+ + TL + L + GY Sbjct: 289 SPTRASILLGKYPSRVGLSNHSGSSGPKGPGHKLTPVPVKGNMPLEDITLAEALKEVGYK 348 Query: 183 TQAIGKWHMGENKE----SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 T IGKWH+ + + P+ GFD + + + + Sbjct: 349 TAHIGKWHLQAHHDTSRNHFPEKHGFDLNIAGHRMGQPGSFYFPY--------------- 393 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 + + + + L + D + ++ + D PFFL + Sbjct: 394 -----------KSKQHPSTNVPDMADGQEGDYLTDKLTDKAIHYIKE--NKDTPFFLNFW 440 Query: 299 TRGCHFDNYP----NAKYAGS----------------------SPARTSYGDCMVEMNDV 332 H P KY S SY + M++ Sbjct: 441 YYTVHTPIIPRQDLKKKYEAKANELGINKNQPGIPVLKSFARSSQNNPSYAAMVEAMDEN 500 Query: 333 FANLYKTLEKNGQLDNTLIVFTSDNGP----EAEVPPHGRTPFRGAKGSTWEGGVRVPTF 388 ++KTL++ D T+I+F SDNG P + P + K +EGG+R+P Sbjct: 501 IGRIFKTLKELQIDDETIIIFCSDNGGLSTSTGPNCPTSQLPLKAGKAWVYEGGIRIPFI 560 Query: 389 VYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNG 448 + W G ++ V D++PT LD+ P +DGV TS G Sbjct: 561 IKWPGKKGGKELQAPVCTTDIYPTLLDMLKLPAK-------PEQHLDGVSLTSLMNGQAK 613 Query: 449 QSNRKAEHYFL--------NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTA 500 + R+A G AVRM ++K +T Sbjct: 614 ELQREALFIHYPHYHHINSMGPAGAVRMGDYKLVEYY--------------------ETG 653 Query: 501 GSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 ++NL D E +++ + ++ + + P D Sbjct: 654 EFELYNLKEDIGEMNNLVKEQPERAAQMLKKLEQWRQQSNSPKPERNPHYD 704 >UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C1V3_9PLAN Length = 470 Score = 455 bits (1171), Expect = e-126, Method: Composition-based stats. Identities = 116/508 (22%), Positives = 181/508 (35%), Gaps = 91/508 (17%) Query: 73 QKLAELEKKTGKKP-NVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY 131 + + +KP NVV FL+DD+GW D+G G +P+ID +A++G+ T Y Sbjct: 20 SSITQPTHAADEKPWNVVFFLVDDLGWTDLGCYGSDF---YQSPNIDQLAAEGMKFTQNY 76 Query: 132 SQP-SSSPTRATILTGQYSIHHGILMP--------------PMYGQPGGLQGLTTLPQLL 176 S + SPTR +LTG Y + P + Q TTLP+ L Sbjct: 77 SACNACSPTRGALLTGMYPARTHLTDWIPGWAKSYTDFPLKPPEWKKHLDQKYTTLPEAL 136 Query: 177 HDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS 236 GY T +GKWH+G + + PQ+ GFD + + + S Sbjct: 137 RTAGYQTFHVGKWHLG-GRGNLPQDHGFDVNISGTNRGLPRSYHFPYGGDAMKWDS---- 191 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 + L R D V + + DKPFFLY Sbjct: 192 -----------------------SLTEAERQDRYLTDRMADEAVALIRQQ--QDKPFFLY 226 Query: 297 YGTRGCHFDNYPNA----KYAG----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDN 348 H KY G Y + +++ + L+++G D Sbjct: 227 CSFYSVHSPIQGRPDLVKKYKGLPAGKRHKNPEYAAMIQSVDEAIGRVRAQLKESGIADR 286 Query: 349 TLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLA 407 TLIVFTSDNG + P RG KG WEGG RVP V W G+ + Sbjct: 287 TLIVFTSDNGGVRRKTSN-NDPLRGEKGQHWEGGTRVPAIVLWPGVTPAGSVCAEPIITM 345 Query: 408 DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG------ 461 D +PT L++ G A +DG+ NR+A ++ Sbjct: 346 DFYPTILNITGV-----AGNTEHNQSVDGLSLVPLLKDPAATLNREALYWHYPHYNVFIG 400 Query: 462 -KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVR 520 +A+R+ E+K + ++NL D E+ + Sbjct: 401 VPYSAIRVGEYKLIHYY--------------------EDGNDELYNLAEDLSETSDVSKT 440 Query: 521 HIPMGVPLQTEMHAYMEILKKYPPRAQI 548 + + L+ + +++ + P + Sbjct: 441 YPELTARLERRLQQHLKQVGAQMPVSNP 468 >UniRef50_Q7UYA9 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA9_RHOBA Length = 474 Score = 454 bits (1169), Expect = e-126, Method: Composition-based stats. Identities = 114/486 (23%), Positives = 174/486 (35%), Gaps = 67/486 (13%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSS 136 E PNV++ + DD GW DVGFNG V TP++DA+AS G+ Y+ P Sbjct: 25 AETTDTNSPNVILLMSDDQGWGDVGFNGNEVV---QTPNLDAMASAGVRFDRFYAAAPLC 81 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 SPTR + LTG+Y GIL G G T+ ++L +GY T GKWH+G K Sbjct: 82 SPTRGSCLTGRYPFRFGILAAHTGG---MRVGEITIAEMLQKRGYATGMFGKWHIGWVKP 138 Query: 197 ---------SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 S P + GFD++ S + P + P+ + Sbjct: 139 DEVSTRGFYSPPSHHGFDEYFATTSAVPTWDPTITPQDWDSWGNGPGEP-WKGGFPYVHN 197 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 A D + MD + F++ A KPFF H Sbjct: 198 GREAKEN------------LSGDDSRVIMDRVIPFIE--ANQAKPFFATVWFHAPHEPVV 243 Query: 308 PNAKYAGSSPA----RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE--- 360 ++ P R +Y C+ M+ L L + G NT++ F SDNGP Sbjct: 244 AGEEFKKLYPKAGSKRKNYYGCITAMDQQVGRLRAKLRELGIEKNTVVFFCSDNGPSDGL 303 Query: 361 AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGI-VDLADLFPTALDLAGH 419 A+ PF+G K + +EGG+ VP W G I S + D PT + G Sbjct: 304 AKKGVASAGPFKGHKHTMYEGGLLVPACAEWPGTIPAGTSTEVRCSTVDFLPTVASIVGD 363 Query: 420 PGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL----AAVRMDEFKYHV 475 + T IDG+D G +R + ++ ++K Sbjct: 364 SMVQK-----ATRPIDGIDLMPLIRGEAKDRDRDLFFGYRRLYQGIDGQSIISGDWKLL- 417 Query: 476 LIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAY 535 + +++L DP E+ + L+ ++ Sbjct: 418 ------------------QEAKKNGRLRLYDLSKDPFETQDLSEEMPEQTEQLRKQLEEL 459 Query: 536 MEILKK 541 ++ Sbjct: 460 QASCQR 465 >UniRef50_A6C8R8 Arylsulfatase A n=2 Tax=Planctomycetaceae RepID=A6C8R8_9PLAN Length = 510 Score = 453 bits (1167), Expect = e-126, Method: Composition-based stats. Identities = 118/500 (23%), Positives = 192/500 (38%), Gaps = 59/500 (11%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 T + KKPN +V D++G+ D+ G V N TP ++ +A +G T Sbjct: 33 TPALQSASAAPQQKKPNFIVIFCDNLGYGDIEPFGSTV---NRTPCLNRMAREGRKFTHY 89 Query: 131 -YSQPSSSPTRATILTGQYSIHHGILMPPMYGQ-------PGGLQGLTTLPQLLHDQGYV 182 + +P+RA+I+TG YS G+ P GQ G T+ ++L QGY Sbjct: 90 CVTAGVCTPSRASIMTGCYSQRVGMHWNPRDGQVLRPISPYGLNPDEITVAEVLKKQGYK 149 Query: 183 TQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 T IGKWH+G+ P GFD F G DM + + L Sbjct: 150 TGMIGKWHLGDQTPFLPTRQGFDYFYGIPYSDDM------TQAVGQRLGDRLDGKNWPPL 203 Query: 243 PFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGC 302 P +D L + + + V+F++K ++PFFLY+ Sbjct: 204 PVMLNDTVI-----------EAGVDRNLLTKDYTEKAVEFIEK--NKNQPFFLYFPQAMP 250 Query: 303 HF---DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP 359 + + + G S +GD + E++ + L + G NTL+++TSDNG Sbjct: 251 GSTRKP-FASDAFRGKS-KNGPWGDSIEELDWSTGQILDKLVELGIDKNTLVIWTSDNGS 308 Query: 360 EAEVPPHG-----RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTA 413 + P G +T EG RVPT V+W + + + DL PT Sbjct: 309 PMAKDMNSTERGTNKPLNGRGYTTSEGAFRVPTIVWWPETVPAGTVCEELATTMDLLPTF 368 Query: 414 LDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGT-NGQSNRKAEHYFLNGKLAAVRMDEFK 472 LAG VP IDG D +G + ++ +Y+ +L AVR +K Sbjct: 369 ARLAGGK-------VPSDRIIDGHDIRPLIMGEADAKTPYDGFYYYAMEQLQAVRKGPWK 421 Query: 473 YHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 V +++ + + + +FN+ TD ++ +H + L + Sbjct: 422 LFVPLKEFSRHPHF--------KKGEGSRPLLFNVVTDISSEHNVADQHPEIVKELMSLA 473 Query: 533 HAYMEILKK--YPPRAQIKS 550 L +P Q + Sbjct: 474 EKARADLGDTNHPGANQRPA 493 >UniRef50_A7IPG5 Sulfatase n=3 Tax=Bacteria RepID=A7IPG5_XANP2 Length = 491 Score = 453 bits (1166), Expect = e-126, Method: Composition-based stats. Identities = 133/540 (24%), Positives = 200/540 (37%), Gaps = 98/540 (18%) Query: 24 AADTPSTATARKGFAGYDHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTG 83 DTP++A + P++ V + + + Sbjct: 1 MTDTPASADDDR-----SQPSRRDVLAGGAGFLAAVAGLSLLSSGA---------RAADA 46 Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 +P++V L DD+G+ DVGF+G + TP++D +A+QG L Y+QP +PTRA Sbjct: 47 PRPHIVYILADDLGFADVGFHGSDI----KTPNLDHLAAQGARLGQFYTQPFCTPTRAAF 102 Query: 144 LTGQYSIHHGIL--MPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQ 200 LTG+Y +H+G+ P + G LPQ L D GY T +GKWH+G +++ P+ Sbjct: 103 LTGRYPLHYGLQVGAIPSGAKYGLATDEFLLPQALKDVGYRTALVGKWHLGHADQKFWPR 162 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 GFD F G + H G Sbjct: 163 QRGFDSFYGPLVGEIDHF------------------------------KHEAHGVTDWYH 192 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS---- 316 + K + + V+ + P FLY H Y Sbjct: 193 DNTQVKEEGYDTELFGKEAVRLIA-AHDPKTPLFLYLAFTAPHTPFQAPQSYLDQYAHIA 251 Query: 317 -PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE-----------AEVP 364 P R +Y + M+D ++ L G +NTLIVF SDNG A Sbjct: 252 APQRRAYAAMITAMDDQIGHVVAALTSRGMRENTLIVFHSDNGGTRSKMFAGEGAVAGDL 311 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKV 424 P P+R KGS +EGG RV W G I P ++G++ + D+ PT LAG A Sbjct: 312 PASNAPYRDGKGSLYEGGTRVVALANWPGRIAPGAAEGVMHVVDMLPTLAKLAGASLA-- 369 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYT 484 K+ +DGVD GQ+ R Y + AVR +K + P Sbjct: 370 -----KSKPLDGVDVWPAL--AAGQAGRAGIVYNVEPTQGAVRDGRWKLVWRVVLPPTA- 421 Query: 485 QSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 +F++ DP E+ + +H LQ ++ A + PP Sbjct: 422 ------------------ELFDVEADPSETTDVSAQHPEKVAELQGKVVALARTMA--PP 461 >UniRef50_A6DSG4 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSG4_9BACT Length = 489 Score = 453 bits (1166), Expect = e-126, Method: Composition-based stats. Identities = 130/473 (27%), Positives = 203/473 (42%), Gaps = 62/473 (13%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSS 137 + + +KPN++ +L DD+G+ D+G G TP ID +A +G +S Y S Sbjct: 22 VSLQAQQKPNILFYLTDDLGYGDIGCYGAEGQY---TPAIDQLAKEGTKFSSFYVHQRCS 78 Query: 138 PTRATILTGQYSIHHGILMPP---MYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGEN 194 P+RA +TG Y+ G+ G G TLP+L+ GY T +GKWH+GE Sbjct: 79 PSRAAFMTGSYAHRVGLPQVIYKHREGPIGLNPSEITLPELMKTAGYNTALVGKWHLGEW 138 Query: 195 KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRG 254 K P N G+D F GF +V ++ I+ + G Sbjct: 139 KPFHPLNHGYDYFYGFL----------------KVIEGSEKPSLIENRKELASKIQKTEG 182 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG 314 + + F+ K PFFL Y H +P+ ++ G Sbjct: 183 Q----------------APGMVKAAINFMTKH--KKNPFFLVYSDPMPHAPYFPSEQFKG 224 Query: 315 SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE----VPPHGRTP 370 +S R +YG+ + E++ F +L L++ G +NT++VFTSDNGP E P Sbjct: 225 TS-KRGNYGEVIHEIDWQFKHLMDALDELGLKENTIVVFTSDNGPPVERQKKYDVGLSGP 283 Query: 371 FRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVP 429 R K + +EGGVRVP + W G ++ SD ++ + D+ PT +LAG VP Sbjct: 284 LRDGKWTNFEGGVRVPFIIRWPGKVKVDASSDAMIGIIDMLPTFCELAGVD-------VP 336 Query: 430 KTTFIDGVDQTSFFLG-TNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 IDGV+ LG ++ R+ + A + + +KY+ Q PY + Sbjct: 337 NDRVIDGVNILPQLLGDQESKALRETQIV----PGATIIHNGWKYYAKQQNPYNNKKPED 392 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 G ++FNL D E+ + +H + L+ M +M LKK Sbjct: 393 WNGL----QPAKEGALFNLKEDIGETTEVSAQHPEIAESLKKNMAKFMAELKK 441 >UniRef50_B4CVD2 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVD2_9BACT Length = 631 Score = 453 bits (1165), Expect = e-125, Method: Composition-based stats. Identities = 112/524 (21%), Positives = 170/524 (32%), Gaps = 108/524 (20%) Query: 56 DNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTP 115 + + K+ KPN+V L DD+G D+ G TP Sbjct: 4 RALALSLCFAVSLFAKDGDGGASAPKSRDKPNIVFILCDDLGVNDLSCYGRK---DQQTP 60 Query: 116 DIDAVASQGLILTSAY-SQPSSSPTRATILTGQYSIHHGILMPPMYG------------- 161 ++D +A +G+ T AY + P S +RA I+TG+ I Sbjct: 61 NLDRLAGEGMRFTCAYCASPICSASRAAIMTGKAPGRVHITNFLPGRADAPSQKFIQPEI 120 Query: 162 QPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWR 221 + T+ + LH GYV+ IGKWH+G K P N GFD Sbjct: 121 EGQLPLEENTIAKALHGAGYVSACIGKWHLG-GKGFLPTNQGFDYAF------------- 166 Query: 222 DVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVK 281 + A GG+ + + Sbjct: 167 --------------------AGHANTKPSATEGGKGEY--------------ELTAEAER 192 Query: 282 FLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA--RTSYGDCMVEMNDVFANLYKT 339 +L+K D PFFLY H + Y + ++D + K Sbjct: 193 WLEK--NKDHPFFLYLAHNSPHVPLAAKPELIEKHKDAWNPIYAAMIESLDDCVGRIMKK 250 Query: 340 LEKNGQLDNTLIVFTSDNGP-----EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGM 394 +++ G + T+ +FTSDNG P PFR KG EGG+R P V W G Sbjct: 251 VDELGLTEKTIFIFTSDNGGLHVYELPNTPSTYNAPFRAGKGYLEEGGLREPLIVRWPGK 310 Query: 395 IQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRK 453 I+ ++ V L D PT + AG A +DGV+ G R Sbjct: 311 IKAGATNETPVVLYDFMPTLMTAAGLDVAHTVG------PLDGVNILPLLTGGTIP-PRT 363 Query: 454 AEHYFLNGKLA------AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNL 507 +F N A+R E+K +T ++N+ Sbjct: 364 LYWHFPNYTNQGSKPAGAIRDGEWKLI--------------------QDDETGNLELYNI 403 Query: 508 YTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 DP E + + LQ ++ A+ + + A D Sbjct: 404 AADPGEKNDLAKSQSARVSELQGKLAAWRKSIGAQMGTANPNFD 447 >UniRef50_A6DF77 Arylsulphatase A n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF77_9BACT Length = 518 Score = 453 bits (1165), Expect = e-125, Method: Composition-based stats. Identities = 116/520 (22%), Positives = 201/520 (38%), Gaps = 72/520 (13%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTR 140 KKPN++ L DD+G+ D+ T ++D +A++G+ T A+S +P+R Sbjct: 16 ADKKPNILFILADDLGYGDLSCYNDE--AKVKTANLDQLANEGMRFTDAHSPSTVCTPSR 73 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMGENK--- 195 +I+TG+ + L + TLPQ+L + GY T GKWH+G + Sbjct: 74 YSIMTGRMAFRLNFKGVFTGVSGPCLITKDRLTLPQMLRNNGYETAMFGKWHIGMSFLDK 133 Query: 196 ------------------------------------ESQPQNVGFDDFRGFNSVSDMYTE 219 P N GFD F G ++ Sbjct: 134 NGDVIEVSEPPRKTPKLKKQEIALEAIKRVDYSKPIPDGPLNQGFDHFFGTVCC--PTSD 191 Query: 220 WRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYG 279 W +++ + P K L R G E++D +++ Sbjct: 192 WLYAYIDGDRIPVPPTKIVDKALLPKHFWSFDCRAGLLAPN-----FKHENVDMVFLEKS 246 Query: 280 VKFLDKMAKSD--KPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLY 337 + FLD K KPFFL++ + H ++P ++ G + A +GD + + + + L Sbjct: 247 LSFLDSHHKKQSAKPFFLFHSLQAVHLPSFPAKEFQGKTQA-GPHGDFIYQFDYIVGKLV 305 Query: 338 KTLEKNGQLDNTLIVFTSDNGPEAE--------VPPHGRTPFRGAKGSTWEGGVRVPTFV 389 + L+ G +NTL++ +SDNGPE +G P+RG K WEGG RVP Sbjct: 306 EKLKTLGMAENTLVIISSDNGPEVGTTINMRERYKHNGARPWRGVKRDNWEGGHRVPMIA 365 Query: 390 YWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNG 448 +W G I+ S V L D+ T + +P D + LG Sbjct: 366 WWPGKIRSSSVSQQTVCLTDIMATCASIVN-------TSLPNNAAEDSFNILPILLGQTT 418 Query: 449 QSNRKAEHYFLNGKLAAVRMDEFKY--HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFN 506 ++ R+ + ++R ++KY H + A + ++N Sbjct: 419 KAIREFTLHQTISLDLSIRHGDWKYLDHSGSGGNNYSGGRIKKALGLTNSKINAPAQLYN 478 Query: 507 LYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRA 546 L DP+E +++ +H + L+ ++ + + P R Sbjct: 479 LKADPKEVNNLYYQHPEIAQQLKAKLEEFKTSGRSAPKRN 518 >UniRef50_Q024K7 Sulfatase n=28 Tax=Bacteria RepID=Q024K7_SOLUE Length = 504 Score = 453 bits (1165), Expect = e-125, Method: Composition-based stats. Identities = 119/511 (23%), Positives = 191/511 (37%), Gaps = 66/511 (12%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-P 134 A K PN+V DD+G+ D G A TP++D A+ G+ T+A+S Sbjct: 16 ASRAFAAAKPPNIVYMYADDLGYGDTSCYG---ATRVKTPNLDRAAAAGIRFTNAHSSSA 72 Query: 135 SSSPTRATILTGQYSIHH-GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 + +P+R ++LTG+Y+ H G + P G TLP +L GY T A+GKWH+G Sbjct: 73 TCTPSRYSLLTGEYAWRHQGTGVLPGDASLIVQPGRYTLPAMLQQAGYRTGAVGKWHLGL 132 Query: 194 NKESQ---------PQNVGFDDFRGFNSVSDMYT-----EWRDVHVNPEVALSPDRSEYI 239 P VGFD F + D + V+++P L + Sbjct: 133 GGRDLDWNGEIRPGPLEVGFDYSFIFPATGDRVPCVFVENRKVVNLDPNDPLRVRYDKPF 192 Query: 240 KQLPFSKDDVHAVR----GGEQQAIADITPKYM------------EDLDQRWMDYGVKFL 283 P + ++ G I + + ED+ V FL Sbjct: 193 PGEPTGAANPELLKMKPSHGHDNTIVNGISRIGYMAGGKSARWVDEDMADTITGKAVSFL 252 Query: 284 DKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKN 343 ++ +PFFLY+ T H P+ ++ G + GD + E++ + TL++ Sbjct: 253 EQ--NRARPFFLYFATHDIHVPRVPHPRFVGKTD-MGPRGDAIAELDWSIGRILDTLDRL 309 Query: 344 GQLDNTLIVFTSDNGPEAEVP-----------PHGRTPFRGAKGSTWEGGVRVPTFVYWK 392 NTL VF+SDNGP + H P RG K S ++GG R+P V W Sbjct: 310 KLTRNTLFVFSSDNGPVVDDGYRDQAVERLGDHHPAGPLRGGKYSAYDGGTRIPFVVRWP 369 Query: 393 GMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNR 452 G ++P S + DL + L G +P+T D + LG Q Sbjct: 370 GTVKPGISAAPISQVDLLASFAALTG-------RKLPETAAPDSFNVLPALLGKTKQGRP 422 Query: 453 KAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ 512 + ++ ++K P G +F+L D Sbjct: 423 HIV---EHATALSLIAGDWKVIRPHTGPRRNQTGNEIG-------NDPEPQLFDLAHDIG 472 Query: 513 ESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 E +++ +H L + + + P Sbjct: 473 EQNNVAPQHPEKVQELLGMLAQIEKSPRTRP 503 >UniRef50_A6C4V9 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4V9_9PLAN Length = 480 Score = 453 bits (1165), Expect = e-125, Method: Composition-based stats. Identities = 113/499 (22%), Positives = 183/499 (36%), Gaps = 82/499 (16%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PS 135 E+ G +PN++V ++DD+G+ V G TP+ID +A++G+ T +S Sbjct: 28 AAERPPGDRPNLIVIMVDDMGYAGVSCFGNPY---FKTPEIDRLAAEGMKFTDFHSSGTV 84 Query: 136 SSPTRATILTGQYSIHHGILMPPMY------GQPGGLQGLTTLPQLLHDQGYVTQAIGKW 189 SPTRA +LTG+Y GI Q G + T +LL GY T IGKW Sbjct: 85 CSPTRAGLLTGRYQQRAGIEAVIHPVSDHPEHQKGLRKSENTFAELLKQAGYRTALIGKW 144 Query: 190 HMGE---NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 H G + E P N GFD F G++S + + Sbjct: 145 HQGYPHNSAEFHPDNHGFDTFVGYHSGNIDFISH-----------------------VGD 181 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 H G ++ + Y ++F+ + +PF LY H Sbjct: 182 HVKHDWWHGRKET------QETGYSTHLINQYALQFIKESRN--QPFCLYLAHEAIHNPV 233 Query: 307 YPN------------AKYAGSSPARTS--YGDCMVEMNDVFANLYKTLEKNGQLDNTLIV 352 ++ +S A + + ++ + + L K+G NT ++ Sbjct: 234 QVPGDPIRRTEAAGWKRWKPASEAERIEKFRGMTLPVDAGVGQIREFLVKSGLDKNTFVL 293 Query: 353 FTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFP 411 F SDNGP + P +RGAKGS +EGG RVP +W G IQ +D D+ P Sbjct: 294 FFSDNGPSRDFPSGSPK-WRGAKGSVYEGGHRVPAIAWWPGKIQAGTETDVPAISLDVMP 352 Query: 412 TALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH---YFLNGKLAAVRM 468 T L +A +PK +DGVD + S R + A+R Sbjct: 353 TLLGIAHID-------MPKERPLDGVDLSPVLFEQKPLSERPLFWASLSNNGSRSEAMRA 405 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPL 528 +K V + + ++ L DP E++++ + Sbjct: 406 GPWKLVVQHPRA------------KPGTFENEKVELYRLDQDPGEANNLSKAEPQRASRM 453 Query: 529 QTEMHAYMEILKKYPPRAQ 547 ++ + + + Sbjct: 454 LKQLKDWYQDTQNTATSQP 472 >UniRef50_B9XR48 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XR48_9BACT Length = 508 Score = 452 bits (1164), Expect = e-125, Method: Composition-based stats. Identities = 124/535 (23%), Positives = 201/535 (37%), Gaps = 77/535 (14%) Query: 43 PNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVG 102 P + A + + P+ + AE +KPNV+ F+ DD+G+ DVG Sbjct: 7 PRRLRFLVA--LLSVLSPLCINA----------AEPSPMPLRKPNVIFFIADDLGYADVG 54 Query: 103 FNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPM-- 159 G TP+ID +A++G+ T YS P +P+R ++TG++S H + Sbjct: 55 CFGQKKI---HTPNIDRIATEGMKFTQHYSGSPVCAPSRCVLMTGKHSGHSAVRDNRELK 111 Query: 160 -YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQPQNVGFDDFRGFNSVSDMY 217 GQ T+ +LL GY+T A GKW +G +P + GF F G+N + Sbjct: 112 PEGQFPLPANTITVARLLQQNGYITGAFGKWGLGGPESSGKPLDQGFTRFFGYNCQRVAH 171 Query: 218 TEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMD 277 ++ P + + P +D + + + Sbjct: 172 ------NLFPTYLWDDNHRLALDNPPIGEDQKLPADADSNDPASYKAFTGKSYAPDLYAE 225 Query: 278 YGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN----AKYAGSSP---------------A 318 ++F+ D PFFL++ T H +Y G P Sbjct: 226 QALRFIRD--NKDHPFFLFFPTIVPHVALQVPEDSLKEYEGKLPETPYTGGKGYLPNRTP 283 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP-------HGRTPF 371 +Y + M+ + +++ D+T+ VFTSDNGP + + PF Sbjct: 284 HAAYAAMITRMDRDLGRMLALIKELNLDDDTIFVFTSDNGPAPQDMGGTDTKFFNSSGPF 343 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPK 430 R K S +EGG+R+P V W G IQP SD + D PT L+L+G+ + Sbjct: 344 RSGKTSIYEGGMRIPLIVRWHGKIQPNSTSDRVTGFEDWLPTLLELSGNKKSVPTG---- 399 Query: 431 TTFIDGVDQTSFFLGTNGQSNRKAEH--YFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 IDG+ S LG R + + G A+R+ +K +P + Sbjct: 400 ---IDGLSFASTLLGEKLP-ERPFLYREFPAYGGQQAIRVGNWKAVRQHLKPKGNAKPNL 455 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 +++L TD ES + H + L M K +P Sbjct: 456 H------------IELYDLQTDIAESHDVSDEHPDIVTKLDNLMREQHIPSKAFP 498 >UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria RepID=C1ZCL4_PLALI Length = 470 Score = 452 bits (1163), Expect = e-125, Method: Composition-based stats. Identities = 120/506 (23%), Positives = 182/506 (35%), Gaps = 97/506 (19%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ- 133 K+ KPNV++ +DD+G D+G G TP IDA+A G T YS Sbjct: 18 FPVEAKEMADKPNVLLIFIDDLGKTDIGIEGS---SFYETPRIDALAKSGARFTQFYSAH 74 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMY-GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 P SPTRA ++TG+ GI Q T+ Q + GY T +GKWH+G Sbjct: 75 PVCSPTRAALMTGKMPQRLGITDWIRPESDVALPQSEVTIGQAFQEAGYHTAYLGKWHLG 134 Query: 193 ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 + P GFD +G N + + + + ++ K P Sbjct: 135 HKPQQHPAARGFDWTKGVNHGGQPSSYYFPYKNPQKPDAPNNVPDFEKCQP--------- 185 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA-- 310 + L ++ L + +PFFL H P Sbjct: 186 ---------------EDYLTDVLTSSAIEHL-QQRDRTRPFFLCLAHYAVHTPIQPPKNL 229 Query: 311 --KYA--------GSSPA---------------RTSYGDCMVEMNDVFANLYKTLEKNGQ 345 KY SP +Y + ++ L L+ G Sbjct: 230 VEKYQVKLATQKNPKSPGEGIQEGSAISRSQQDHPAYAAMVENLDTQVGRLLDELKTQGI 289 Query: 346 LDNTLIVFTSDNGPE-----AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS 400 LD T++VFTSDNG P P R KG T+EGG+R+PT++ W G I P+ Sbjct: 290 LDQTIVVFTSDNGGLCTLNGKSPGPTCNLPLRAGKGWTYEGGIRIPTYISWPGKISPQVL 349 Query: 401 DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTN--GQSNRKAEHYF 458 D D++PT L L P T +DG+ ++ +S R Y+ Sbjct: 350 DIPAYTCDIYPTLLSLCQIPPR-------PTQHVDGISLAGLLTKSSSLPESERTLVWYY 402 Query: 459 LNG------KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ 512 + AA+R +K ++T +++L DP Sbjct: 403 PHTHGSGHKPSAAIRQGPWKLIHF--------------------LETDRIELYHLEDDPG 442 Query: 513 ESDSIGVRHIPMGVPLQTEMHAYMEI 538 ES ++ +H + LQ E+ +E Sbjct: 443 ESRNLASKHPERALQLQKELQKIIES 468 >UniRef50_A6DSH3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH3_9BACT Length = 455 Score = 451 bits (1162), Expect = e-125, Method: Composition-based stats. Identities = 105/464 (22%), Positives = 169/464 (36%), Gaps = 67/464 (14%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTR 140 KPN++V L DD G+ DV TP DA+A G+I Y+ S TR Sbjct: 21 ADSKPNIIVILSDDQGYADVS-YNPEHDDYISTPHTDALAKSGVIFHRGYTSGSVCSTTR 79 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 + ++TG+Y +GI G G +P L + GY + A GKWH+G + P Sbjct: 80 SGLMTGRYQQRYGIYTAGEGG-TGTDLNAKFIPNYLKEAGYKSMAFGKWHLGHEMKYHPL 138 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 + GFDDF GF F + + + G Sbjct: 139 HRGFDDFYGFMGRGAH-------------------------DFFRLEKEYDGKFGGPIYR 173 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS---SP 317 L R + VKF+++ DKPFF Y H A+ + Sbjct: 174 GLEPIDDKGYLTTRITEETVKFIEE--NKDKPFFAYVAYNAVHTPAQAPAEDIKAVSGDE 231 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGS 377 R + ++ + KTL+K+ +NT+I++ SDNG + + P RG K Sbjct: 232 TRDILVAMLKHLDLGVGEIVKTLKKHDIYENTIIIYLSDNGGAKSMVAN-NKPLRGVKHD 290 Query: 378 TWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDG 436 ++GG+RVP + W I+ + + V D+ PT LD AG +P + IDG Sbjct: 291 IYDGGIRVPFLMSWPAQIKAGQDTQSPVISLDILPTLLDAAG---------LPALSDIDG 341 Query: 437 VDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTV 496 G +R + ++++ +K Sbjct: 342 ESMLPVIRGDKDNLDRP-FFWNHGDGQTGIQLNNWKLVF--------------------- 379 Query: 497 MQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 + ++ + D ES ++ H LQ ++ + Sbjct: 380 -NKGVTELYKISDDIGESKNLAASHPEKVQALQKIYDKWLSQMA 422 >UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Bacteria RepID=A6C284_9PLAN Length = 605 Score = 451 bits (1161), Expect = e-125, Method: Composition-based stats. Identities = 107/519 (20%), Positives = 176/519 (33%), Gaps = 98/519 (18%) Query: 47 LVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGG 106 L A + N+ K++Q + + PN+V+FL DD GW D+ NG Sbjct: 7 LFLLACILTGNLTASENKNPPHKKSQTR---PATQATTHPNIVIFLADDQGWGDLSHNGN 63 Query: 107 GVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGL 166 TP++D++A +G+ Y +PTRA LTG+Y G + GQ Sbjct: 64 T---NLHTPNVDSLAKEGVKFNRFYVGAVCAPTRAAFLTGRYHARTG-TIGVSTGQERFN 119 Query: 167 QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVN 226 T+ Q GY T A GKWH G + P GFD++ GF S + + N Sbjct: 120 SDEYTIAQAFKAAGYATGAFGKWHNGTQYPNHPNAKGFDEYYGFTSGHWGHYFSPMLDHN 179 Query: 227 PEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKM 286 + D + F+++ Sbjct: 180 GTFVKGNG-----------------------------------YITDDLTDKAMAFIEQQ 204 Query: 287 AKSDKPFFLYYGTRGCHFDNYPNAKY-----------------AGSSPARTSYGDCMVEM 329 ++ KPFF Y H +Y + + Sbjct: 205 VQNHKPFFAYLPYCTPHSPMQVPDQYWDRFKDKQLKLHNREPDREQPDHLRAALAMCENV 264 Query: 330 NDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFV 389 + + K L D+T++++ SDNGP +G KGS EGGVR P + Sbjct: 265 DWNVGRVLKKLNSLRITDDTIVIYFSDNGPN---GVRWNGDMKGKKGSLDEGGVRSPFVI 321 Query: 390 YWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNG 448 W G + + + I DL PT DLAG P+ IDGV L + Sbjct: 322 RWPGHLPAGQEVNQIAGAIDLLPTLTDLAGIK-------RPEPKPIDGVSLKPLMLNSKA 374 Query: 449 QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLY 508 + + +VR D+++ +++++ Sbjct: 375 DWP-ERMIFSSLRNRVSVRTDQYRLSR-------------------------KGELYDMH 408 Query: 509 TDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQ 547 DP + ++I + + LQ + + + + +P Sbjct: 409 ADPGQRNNIAKQKPEITAKLQQAVTDWRQSV--WPNGYP 445 >UniRef50_Q7UQ05 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UQ05_RHOBA Length = 525 Score = 450 bits (1159), Expect = e-125, Method: Composition-based stats. Identities = 117/576 (20%), Positives = 189/576 (32%), Gaps = 135/576 (23%) Query: 41 DHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMD 100 P+ TT ++ A +AE +PNV++FL+DD+GW D Sbjct: 13 SSPSLASSNLVTTAVL----LIATIASLGNPTTLVAEETSPAPSRPNVLLFLVDDLGWAD 68 Query: 101 VGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPM 159 +G G + TP IDA+A G+ T+AY+ P SPTRA+I+TG++ + I Sbjct: 69 LGCYGSTY---HETPQIDALAESGIRFTNAYAACPVCSPTRASIMTGRHPVRVDITDWIP 125 Query: 160 Y---------------GQPGGLQGLTTLPQLLHD-QGYVTQAIGKWHMGENKESQPQNVG 203 + T+ + L D Y T +GKWH+G+ P + G Sbjct: 126 GMSTDRAQNPRFQHVDDRDNLALDEVTIAEHLRDAADYQTFFLGKWHLGDV-GHLPTDQG 184 Query: 204 FDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADI 263 F G + NP + D Sbjct: 185 FQINIGGGHKGSPPGGYYSPWKNPYLKAKQDG---------------------------- 216 Query: 264 TPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS-------- 315 E L R D V +D ++ DKPFF+ H P+ + Sbjct: 217 -----EYLTTRLTDEAVSLVDTASREDKPFFMMMSYYNVHSPITPDKRTIDHFEEKQSNS 271 Query: 316 -------------------SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 +Y + ++ + K L+++G DNTL++F SD Sbjct: 272 PELQGDTPTIAERDAVTRGRQDNPAYASMVKAVDTSVGRIMKALKEHGVDDNTLVIFFSD 331 Query: 357 NGPEA---EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-----------KSDG 402 NG + + P +P R KG +EGG+R P V + D Sbjct: 332 NGGLSTLRKFGPTCNSPLRAGKGWLYEGGIREPLLVRLPKTMPGGATNETVSHQPKTVDS 391 Query: 403 IVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLG---TNGQSNRKAEHYFL 459 + DLFPT LD+ G P + + DG+ G S R ++ Sbjct: 392 VACSTDLFPTILDVVGLP-------LQPESHADGISLLPAIAGEAAETDSSPRDLHWHYP 444 Query: 460 N------GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQE 513 + AA+R +K +T + +++L D E Sbjct: 445 HYHGSLWRPGAAIRRGNYKLIEFY--------------------ETDTAELYDLSVDMGE 484 Query: 514 SDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIK 549 + + L+ + + + P Sbjct: 485 TKDLSKTEPERFAELRDALRQWQTEMNAKMPVPNPN 520 >UniRef50_D2R014 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R014_9PLAN Length = 475 Score = 450 bits (1159), Expect = e-125, Method: Composition-based stats. Identities = 113/523 (21%), Positives = 177/523 (33%), Gaps = 102/523 (19%) Query: 62 MQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVA 121 A T+Q + + PN+VV L+DD+G+ D+ G TP I+ +A Sbjct: 20 CVFAAAFCATKQAFSADSTRV---PNIVVILIDDMGFSDLSCMGSTY---YETPSINKLA 73 Query: 122 SQGLILTSAYSQP-SSSPTRATILTGQYSIHHGILMPPMYGQPG------------GLQG 168 + G+ T AYS SPTRA +LTG+Y + Sbjct: 74 ASGMRFTHAYSACTVCSPTRAAVLTGKYPARLHLTDWIPGQMSNKTKLKLPDWNKQLNLE 133 Query: 169 LTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPE 228 TL +LL GY T +IGKWH+G E +P GF G NS + + N Sbjct: 134 EITLAELLGAHGYTTASIGKWHLGP-PECEPTRQGFSLNIGGNSKGQPPSYFFPYERNGV 192 Query: 229 VALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAK 288 + K E L R D F+++ Sbjct: 193 LLPGL-----------------------------AEGKPNEYLTDRLTDACEAFIEE--N 221 Query: 289 SDKPFFLYYGTRGCHFDNYPNAK-----------YAGSSPARTSYGDCMVEMNDVFANLY 337 KPFFLY H + + G+ Y + ++ + Sbjct: 222 QSKPFFLYLPHYCVHTPLQAKPELIAKYEAKNAQFPGNPQHEAKYAAMVESLDQSVGRIM 281 Query: 338 KTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP 397 L+ T+++FTSDNG P R KGS +EGGVRVP V + MI+P Sbjct: 282 AKLDALDLTKKTIVIFTSDNGGLVLREITSNLPARAGKGSAYEGGVRVPLIVSYPPMIKP 341 Query: 398 R-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ-SNRKAE 455 D DLFPT +L+G + IDG + R Sbjct: 342 GTTCDVPAISMDLFPTLAELSGAKYS---------HDIDGKSIVPLLEEKPDAFAARPLY 392 Query: 456 HYFLNG------KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYT 509 ++ + +A+R+ ++ + +++L Sbjct: 393 WHYPHYHGGGATPYSAMRVGNYRLVEF--------------------FEDGRLELYDLAH 432 Query: 510 DPQESDSIGVRHIPMGVPLQTEMHAYMEILK---KYPPRAQIK 549 D E ++ + L ++ A+ + + P A+ K Sbjct: 433 DIGEMKNLAQEKPDLTEKLHRQLIAWRKSVDAQYATPREAEPK 475 >UniRef50_C1ZCM0 Arylsulfatase A family protein n=2 Tax=Bacteria RepID=C1ZCM0_PLALI Length = 509 Score = 450 bits (1159), Expect = e-125, Method: Composition-based stats. Identities = 149/486 (30%), Positives = 218/486 (44%), Gaps = 35/486 (7%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSS 137 L KPN++V + DDVGWM+V GG + G TP+ID + +G+ TS Y+QPS + Sbjct: 17 LSASAADKPNILVIMADDVGWMNVSSYGGDIM-GIRTPNIDRIGQEGIRFTSFYAQPSCT 75 Query: 138 PTRATILTGQYSIHHGILMPPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKE 196 RA LTGQ + G+ G P GLQ TL ++L +GY T GK H+G+ +E Sbjct: 76 AGRAAFLTGQLPVRTGLTTVGTPGSPAGLQKEDITLAEILKTKGYSTAQFGKNHLGDLEE 135 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P GFD++ G + + + +P+ P+ + V G Sbjct: 136 HLPHRHGFDEYFG----NLYHLNGNEDLEDPDRPTDPEFRKKFDPRGV----VSGTADGP 187 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 + +T K ME D + + FLD+ AK KPFFL++ + H + G S Sbjct: 188 TKDEGPLTTKRMETFDDEIVAKSLDFLDRKAKDQKPFFLWHCSARLHVFFHFKEGVRGKS 247 Query: 317 PART--SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRG 373 A YGD + E + L LE G NT++V+ +DNG + P G T PFRG Sbjct: 248 RAGREDVYGDALAEHDGHIGQLLAKLEATGLDKNTIVVYVTDNGAYQYMWPEGGTSPFRG 307 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKT-- 431 KG+TWEGGVR P V W G + R S IVD+ DL PT AG A Sbjct: 308 DKGTTWEGGVRAPCMVRWPGAVGGRVSSEIVDMTDLLPTLASAAGETDAVEKLKKGADYG 367 Query: 432 -----TFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQS 486 +DG DQT+ F G + +S RK Y+ L A+R + FK I++ Sbjct: 368 GKNYKVHLDGYDQTALFTGKSDKSARKFVFYYDETVLTAIRYESFKVTFSIKEG------ 421 Query: 487 GYQGGFTGTVMQTAGSSVFNLYTDPQESD--SIGVRHIP----MGVPLQTEMHAYMEILK 540 G + ++ + NL DP E + ++ + P+ ++ + Sbjct: 422 ---GHWDDPLVGLGRPMITNLRMDPFERQTGDVNRQYAEHKTWVLTPIVGIAEKHLTTFR 478 Query: 541 KYPPRA 546 +P R Sbjct: 479 DFPVRQ 484 >UniRef50_Q0KB87 Arylsulfatase A or related enzyme n=107 Tax=cellular organisms RepID=Q0KB87_RALEH Length = 585 Score = 450 bits (1158), Expect = e-125, Method: Composition-based stats. Identities = 139/522 (26%), Positives = 219/522 (41%), Gaps = 39/522 (7%) Query: 49 KPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGV 108 PA A + +PV A +GKKPN++V DD+G ++ Sbjct: 62 TPAQPPAQSNLPVAPEAA-------SAPVAVNTSGKKPNILVIFGDDIGQTNISAY-SMG 113 Query: 109 AVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQP-GGLQ 167 VG+ TP+ID +A +G+I T Y++ S + R++ +TGQ + G+ G G Sbjct: 114 VVGHRTPNIDRIAREGMIFTDYYAENSCTAGRSSFITGQSPLRTGLSKVGAPGATVGLQA 173 Query: 168 GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNP 227 T+ + L GY T GK H+G+ E P GFD+F G + E + + Sbjct: 174 RDVTIAEALKPLGYATGQFGKNHLGDRDEYLPTKHGFDEFYGNLYHLNAEEEPQRPY--- 230 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA 287 D+++ + + +H+ G+ + +T K ME +D D KF+ K Sbjct: 231 ---WPKDKNDPFVKNFSPRGVLHSTADGKIEDTGALTTKRMETIDDETTDAAQKFITKQV 287 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTS-YGDCMVEMNDVFANLYKTLEKNGQL 346 ++DKPFF++ T H + G S + Y D M+E + L KTL+ Sbjct: 288 QADKPFFVWMNTTRMHAFTHVRPSMQGQSGMPGNDYADGMIEHDGDVGKLLKTLDDLKIA 347 Query: 347 DNTLIVFTSDNGPEAEVPPH-GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIV 404 DNT++++T+DNGP P TPFR K + WEG RVP + W G I+ S+ + Sbjct: 348 DNTIVIYTTDNGPNQWSWPDAASTPFRSEKNTNWEGAFRVPAMIRWPGKIKAGTVSNEMF 407 Query: 405 DLADLFPTALDLAGHPGAKVA-------NLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY 457 D FPT L G K +DG +Q ++ G + RK +Y Sbjct: 408 SGLDWFPTLLAAVGDGDIKERLLKGTSLGSKNAKVHLDGYNQLAYLTGQTNKGARKEFYY 467 Query: 458 FLNGK-LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDS 516 F + L A+R D++K Q T G + +FNL DP E Sbjct: 468 FNDDGVLVAMRYDDWKVVFCEQ-----TTPGGFQVWQDPFKCLRVPKIFNLRMDPYERAD 522 Query: 517 -IGVRHIPMG---VPLQT----EMHAYMEILKKYPPRAQIKS 550 + ++ L + A+++ YPP + S Sbjct: 523 IVSDQYNDWLGKNAYLTEIGTMKAAAFLQTFVNYPPSQRPAS 564 >UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_RHOBA Length = 485 Score = 450 bits (1158), Expect = e-125, Method: Composition-based stats. Identities = 119/527 (22%), Positives = 183/527 (34%), Gaps = 74/527 (14%) Query: 33 ARKGFAGYDHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFL 92 K P + + T+ + P + T +LA +PNVV+ L Sbjct: 2 PSKPTHLQTSPPRSPHRFWCTVLLLITPTL--------TFGQLAGETHAQTLRPNVVMLL 53 Query: 93 LDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIH 151 DD+G+ DVG GG V TP ID +A+ G YS SP+RAT++TG++ I Sbjct: 54 ADDLGYRDVGCYGGPV----ETPTIDQLAAGGTRFQQFYSGCAVCSPSRATLMTGRHHIR 109 Query: 152 HGILMPP--MYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG----ENKESQPQNVGFD 205 G+ TL ++L D GY T +GKWH+G E + P GFD Sbjct: 110 AGVYSWIQDESQNSHLRLREVTLAEVLRDAGYATAHVGKWHLGLPTEERDKPTPDQHGFD 169 Query: 206 DFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITP 265 + + + H NP+ + Sbjct: 170 HWF------ATWNNAQPSHRNPDNFIRNGEP---------------------------VG 196 Query: 266 KYMEDLDQRWMDYGVKFLDKM--AKSDKPFFLYYGTRGCHFDNYPNA----KYAGSSPAR 319 + Q D ++++D+ + D+PFFL H KY S Sbjct: 197 QLEGYSCQLVADEAIRWMDRHRESDPDQPFFLNVWFHEPHAPIAAPDEVTQKYGKLSDKG 256 Query: 320 TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTW 379 Y + + L L+ G +NTLIV+ SDNG RG KG+ W Sbjct: 257 AVYSGTIDNTDQAIKRLLAKLDALGVRENTLIVYASDNG---SYRTDRVGKLRGRKGANW 313 Query: 380 EGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVD 438 EGG+RVP +W G I S+ L D+ PT L P +DG D Sbjct: 314 EGGIRVPGIFHWPGHIPAGVVSNEPAGLVDVLPTICGLLKIS--------PPQVHLDGSD 365 Query: 439 QTSFFLGTNGQSNRKAEHYFLNGKLA---AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGT 495 T G R ++ + A+R ++ + + Sbjct: 366 LTPLLTGHADSFERHQPLFWHLQRSQPIVAMRDGDYSLVGFRDYEMSNKNLFEEKWIPAI 425 Query: 496 VMQT-AGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 T ++NL DP ++ ++ ++ M + K Sbjct: 426 KNGTYHNFELYNLKDDPGQTKNLAAEQPERVEAMKQRMLQINAGIMK 472 >UniRef50_A3J5W3 Putative arylsulfatase n=1 Tax=Flavobacteria bacterium BAL38 RepID=A3J5W3_9FLAO Length = 468 Score = 449 bits (1157), Expect = e-125, Method: Composition-based stats. Identities = 106/499 (21%), Positives = 175/499 (35%), Gaps = 83/499 (16%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 + E K KKPN+V L DD+G+ ++G GG + TP+ID +A +G+ ++ Sbjct: 14 ATFGIQAQETKNTKKPNIVFILADDMGYNELGSYGGKII---ETPNIDQLAKEGMKFSNH 70 Query: 131 YSQP-SSSPTRATILTGQYSIHHGILMP---PMYGQPGGLQGLTTLPQLLHDQGYVTQAI 186 Y +P+R T++TG+++ H I P G T+ ++L GY T A Sbjct: 71 YCGSNICAPSRGTLMTGKHTGHAYIRDNKPLPYEGNEPIPASEITVAEILKTAGYTTGAF 130 Query: 187 GKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS 245 GKW +G E P N GFD F G+N + + Sbjct: 131 GKWGLGYPASEGSPNNQGFDQFYGYNGQIHAHNYFTS----------------------- 167 Query: 246 KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD 305 + + + A+I Y D ++F++ + PFFLY+ H Sbjct: 168 ----YLRKNDLVELNANIDAPYSVYSADIIKDRALEFVE--VNKNNPFFLYFCPTLPHNP 221 Query: 306 NYPNA----KYAGSSPARTS------------YGDCMVEMNDVFANLYKTLEKNGQLDNT 349 + +Y Y ++ + L++ LDNT Sbjct: 222 YHQPDDKTLEYYAKKTGFPIGDAHSEEFSVPKYAALSSRLDQQVGEIMAKLKELNLLDNT 281 Query: 350 LIVFTSDNGPEAEVPPHG----RTPFRGAKGSTWEGGVRVPTFVYWKGM-IQPRKSDGIV 404 LI+F SDNG RG K +EGG++ P +WKG I S+ I Sbjct: 282 LIIFASDNGSALTKEEDSYLRTGGDLRGRKSEVYEGGIKSPLIAFWKGKIIPGSSSNHIS 341 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLA 464 D PT ++ IDG+ LG + Y+ + Sbjct: 342 AFWDFLPTCAEIVKAKTPDN---------IDGISYLPTLLGKTDNQKQHDYLYWERSQSQ 392 Query: 465 AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPM 524 A+R + K + + + Q ++NL DP E +++ + Sbjct: 393 AIRKGDMKANFVYDKT----------------SQKQNIEIYNLAQDPFEKNNLAETMPEL 436 Query: 525 GVPLQTEMHAYMEILKKYP 543 + +P Sbjct: 437 KAEFIKIAQTARVESEIFP 455 >UniRef50_A6DHI1 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI1_9BACT Length = 472 Score = 449 bits (1157), Expect = e-125, Method: Composition-based stats. Identities = 110/476 (23%), Positives = 188/476 (39%), Gaps = 78/476 (16%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTR 140 KPN++ L DD+G+ +VG+NG + TP++D +AS+G+ T Y +P+R Sbjct: 17 AQMKPNIIYILCDDLGYGEVGYNGQKMI---QTPELDKLASKGMRFTDHYCGNAVCAPSR 73 Query: 141 ATILTGQYSIHHGILMPPMY---GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG-ENKE 196 A+++TG++ H I GQ TL +L+ GY T IGKW +G + Sbjct: 74 ASLITGKHPGHAFIRANSPGYPDGQTPIPADSETLGKLMKRAGYATACIGKWGLGGFHNA 133 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 P GFD F G+ + + + + R GE Sbjct: 134 GNPHKQGFDHFYGYTDQRKAHNYYPE---------------------------YLWRNGE 166 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF----DNYPNAKY 312 ++ + + + + +K++++ K D+PFFLY H + K Sbjct: 167 KEMLNNKNGEENDYSHDLMTVDALKYIEE--KKDQPFFLYLAYLIPHVKYQVPDLAQYKD 224 Query: 313 AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP----HGR 368 + M+ + + LE+ G DNTLI+F SDNG + + Sbjct: 225 KDWPKEMKIHAAMTSRMDRDIGTIARRLEELGIADNTLIMFNSDNGAHGKSNSEKFFNTS 284 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANL 427 +G K S ++GGVR P YW G IQ SD I D+ PT +L G Sbjct: 285 GDLKGLKRSMYDGGVRSPMIAYWPGTIQAGSVSDHISAFWDMMPTFSELTG--------- 335 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL----NGKLAAVRMDEFKYHVLIQQPYAY 483 P DG+ LG + + + Y+ N A+R ++K VL ++ Sbjct: 336 EPFKGETDGISMLPTLLGKDSEQKQHKYLYWELYESNKPNCAIRFGKWKGVVLDRRKGLN 395 Query: 484 TQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM-HAYMEI 538 ++++ D ES ++ ++ + ++ M A+++ Sbjct: 396 ------------------IELYDMSGDQSESKNLAAQYPEVVDEIRKMMVEAHVKS 433 >UniRef50_A7V8P8 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V8P8_BACUN Length = 525 Score = 449 bits (1157), Expect = e-125, Method: Composition-based stats. Identities = 124/510 (24%), Positives = 194/510 (38%), Gaps = 80/510 (15%) Query: 61 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV 120 ++ A + +++ ++PN+V+ + DD+GW DVG+ G AV TP+IDA+ Sbjct: 7 LVSVAALLPFSGSNAGNVQRDKSQRPNIVLVIADDMGWGDVGYQG---AVDVSTPNIDAL 63 Query: 121 ASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQ 179 A +G+ + Y S P+RA ILTG Y G ++ +G +TL +++ D Sbjct: 64 ARRGVQFSQGYVSCSISGPSRAGILTGVYQQRFGFYNN-LHPWAKIPEGQSTLGEMVRDC 122 Query: 180 GYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI 239 GY T +GKWHM ++ E P GFD F GF S D + +R Sbjct: 123 GYATGFVGKWHMADSPEQSPNRRGFDQFYGFWS--DTHDYYRSTDKPGVELY-------- 172 Query: 240 KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 D R GE Q + E + + V+F+DK A S PF L Sbjct: 173 -------DFCPLYRNGEIQPPLHESG---EYITDCFTREAVEFIDKHASS--PFLLCLSY 220 Query: 300 RGCHFDNYPNAKYAGSSPART-------SYGDCMVEMNDVFANLYKTLEKNGQLDNTLIV 352 H Y R + ++ ++D + ++L KNG +NTL + Sbjct: 221 NAVHSPWQVPEHYVNRLEGRRFHHEDRKVFAAMVLALDDGIGRVMESLRKNGLEENTLFI 280 Query: 353 FTSDNGPEAEVP----------PHGR------TPFRGAKGSTWEGGVRVPTFVYWKGMIQ 396 SDNG G PFRG K T+EGG+RVP + W + Sbjct: 281 LISDNGSPRGQGIECSTGYEYKDRGNTTMSSPGPFRGYKADTYEGGIRVPYIMSWPSELP 340 Query: 397 PR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGT-NGQSNRKA 454 D V D+FPT + G + + +DGV + + Sbjct: 341 QGMVYDNPVISLDIFPTVMQAVGGTSRQKYS-------LDGVSLLPYLKSEWPIDKRPHS 393 Query: 455 EHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 Y+ + A+R ++K Q T +F++ D +E Sbjct: 394 TLYWRRDEDFAIRKGDWKLVYNDQGS------------------TRKIQLFDMKDDKEEV 435 Query: 515 DSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 + + + L E A+ L PP Sbjct: 436 YDLSGEYPELADSLLAEFDAWDAAL---PP 462 >UniRef50_A6DP41 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DP41_9BACT Length = 534 Score = 449 bits (1156), Expect = e-124, Method: Composition-based stats. Identities = 123/511 (24%), Positives = 194/511 (37%), Gaps = 66/511 (12%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPT 139 + +KP+++ L+DD+G DV + TP IDA+A G++ T ++ +PT Sbjct: 18 QAMEKPHIIYVLMDDMGQGDVSCFNP--SSKIHTPQIDALAKNGMMFTDTHTNSSVCTPT 75 Query: 140 RATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 R ILTG+Y+ + + G L G TL LL QGY T IGKWH+G + Sbjct: 76 RYGILTGRYAWRTHLKKSVIGGTSPSLIKPGRMTLASLLKGQGYHTGMIGKWHLGWDFSF 135 Query: 198 QPQN----------------------------VGFDDFRGFNSVSDMYTEWRDVHVNPEV 229 P + GFD + S D+ + Sbjct: 136 HPDSVKIDPLYWGYTPGTKIDYAKGVENGPDVHGFDYYYSIPSSLDIPPYVYVENGRVTN 195 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS 289 +R + + + +ED+ + +F+ K AKS Sbjct: 196 LDISERKGEEGKRLW-------------RGGPMSADFDIEDVTPNFFRRANQFIAKNAKS 242 Query: 290 DKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNT 349 DKPFFLY H P K+ G S Y D +++++ L KTL+ N DNT Sbjct: 243 DKPFFLYLPLPSPHTPILPIKKFQGKSGVN-EYADFILQIDSHMGELIKTLKDNNIFDNT 301 Query: 350 LIVFTSDNGPEA-----EVPPHGRTP---FRGAKGSTWEGGVRVPTFVYWK-GMIQPR-K 399 L+VFT+DNG E+ G P FRG K +EGG RVP V W G +Q Sbjct: 302 LLVFTADNGISPRADIVEINNAGHFPSNGFRGRKADIFEGGHRVPYIVTWPNGGVQAGSV 361 Query: 400 SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL 459 S+ + D+ T D+ +P+ D + R A + Sbjct: 362 SEQTICTTDMLATLADILEVK-------LPENAGEDSYSTLPLLINRPYDFKRPATVHHS 414 Query: 460 NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGV 519 A+R ++K + ++NL +DP E+ ++ Sbjct: 415 INGSYAIRQGDWKLIFCAGSGGWPKSDLT--PEMASAQGLPVIQLYNLKSDPAETVNLYA 472 Query: 520 RHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 ++ + L +M Y++ + P AQ + Sbjct: 473 KYPHIVDRLTVQMQKYIDEGRSTPGEAQKNT 503 >UniRef50_C7ZGP1 Predicted protein n=3 Tax=Leotiomyceta RepID=C7ZGP1_NECH7 Length = 446 Score = 448 bits (1154), Expect = e-124, Method: Composition-based stats. Identities = 134/472 (28%), Positives = 220/472 (46%), Gaps = 36/472 (7%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 K KPN+V+ L D++GW ++G GGG+ G TP ID +A++GL+L + + PTR Sbjct: 3 KDPTKPNIVLILADNLGWGELGCYGGGILRGAATPRIDKLATEGLLLHNFNVESDCVPTR 62 Query: 141 ATILTGQYSIHHGILMPPMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 + ++TG++ I G G P G + TLP+ L QGY T GKWH+G+ P Sbjct: 63 SALMTGRHPIRTGCRQSVPAGFPQGLTRWERTLPECLKPQGYATAHHGKWHLGDIPGRYP 122 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 + GFD++ G +D + PEVA P + + G + + Sbjct: 123 SDRGFDEWLGIPRTTDESQFTSALGYAPEVAELP-------------YIMKGIAGQDSEN 169 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPAR 319 I + +D+ +D +L + K++KPFFLY+ HF P+ + G + + Sbjct: 170 ICIYDLEKRRLIDEMLVDQSKDWLSRQVKAEKPFFLYHPLVHLHFPTLPHRDFEGKT-GQ 228 Query: 320 TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR-TPFRGAKGST 378 + D M EM+ L L+ G DNT+++F SDNGPE P G P+ G + Sbjct: 229 GEFADSMAEMDYRVGELIDHLDSLGVSDNTVLIFASDNGPEFRPPYKGTAGPWSGTYHTA 288 Query: 379 WEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGV 437 EG +RVP + W G + S+ V + D+F T L++AG + VP IDG+ Sbjct: 289 MEGSLRVPFIIRWPGHVPTGVTSNETVHVTDIFTTILEIAG-------SEVPSDRPIDGI 341 Query: 438 DQTSFFLGTNG-QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTV 496 Q SFF + +S R+ +++ +L AV+ ++K H++ ++ + Sbjct: 342 SQVSFFKDPSTVKSQREGFLFYIKEELRAVKWKDWKLHLI-----------WEPKVNQSS 390 Query: 497 MQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQI 548 + +FN+ DP+E I + + P+ + + LK P Sbjct: 391 GKLESPYLFNVVRDPKEETDILAYNTWVMQPVLKLRAEFEKSLKSDPAPPDP 442 >UniRef50_D2QZL2 Sulfatase n=8 Tax=cellular organisms RepID=D2QZL2_9PLAN Length = 529 Score = 448 bits (1154), Expect = e-124, Method: Composition-based stats. Identities = 132/505 (26%), Positives = 210/505 (41%), Gaps = 34/505 (6%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 L + K+PN+V+ DDVG ++ GV G TP ID +A +G++ T Y++ Sbjct: 15 SLVASAQAQIKRPNIVIIWGDDVGQSNISAYSHGVM-GYKTPHIDRLAREGMMFTDYYAE 73 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 S + RA+ +TGQ+ + G+ + G G + T+ +LL GY T GK H+G Sbjct: 74 QSCTAGRASFITGQHGLRTGLTKVGLPGAALGLRKEDPTIAELLKPLGYATGQFGKNHLG 133 Query: 193 ENKESQPQNVGFDDFRGFNSVSDM--YTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 + E P GFD+F G + E D +P + +DD Sbjct: 134 DRNEFLPTVHGFDEFYGNLYHLNAEEEPEHADYPKDPAFRAKYGPRGVLDCKASDRDDPT 193 Query: 251 AVRGGEQ------QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF 304 + + +T K ME +D V ++ + +K+DKPFF++ HF Sbjct: 194 VDARFGKVGKQIIKDTGPLTKKRMETIDDDVASRAVDYIQRQSKADKPFFIWVNFTHMHF 253 Query: 305 DNYPNAKYAGSSPA-RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 + + G S + Y D M++ + + K ++ G DNT +++++DNGP Sbjct: 254 RTHVKPESKGQSGRWMSEYADAMIDHDKNVGTVLKAIDDAGIADNTFVMYSTDNGPHMNS 313 Query: 364 PPH-GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPG 421 P TPFR K S WEG RVP V W I+P S+ IV D PT L +AG Sbjct: 314 WPDAAMTPFRNEKNSNWEGAYRVPCAVRWPNKIKPGSVSNQIVGHHDWLPTLLAIAGDEQ 373 Query: 422 AKVA-------NLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLA-AVRMDEFKY 473 + DG + G +S R++ Y + + +R D +K Sbjct: 374 VTDKLLKGYKIGDMTYKVHPDGYNLVPHLTGQEEKSPRESFLYCNDDQQLVGLRYDNWKL 433 Query: 474 HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGV--------RHIPMG 525 + Q+ +G ++ +FNL DP E I H + Sbjct: 434 VFMEQRA-----TGTLRVWSEPFTTLRVPKIFNLRLDPYERADITSNTYYDWLIDHAFLL 488 Query: 526 VPLQTEMHAYMEILKKYPPRAQIKS 550 VP Q + ++ K+YP R + S Sbjct: 489 VPAQDYVGKFLLTFKEYPQRQKAAS 513 >UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4Q9_9PLAN Length = 490 Score = 448 bits (1153), Expect = e-124, Method: Composition-based stats. Identities = 107/531 (20%), Positives = 171/531 (32%), Gaps = 104/531 (19%) Query: 54 IADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNP 113 + ++ + L +K + +PN+V L+DD+GW D G + Sbjct: 3 LKSLHQSLLFAVCLLLISVTALHAEQKISADRPNIVFILIDDMGWPDPVSYGNQF---HD 59 Query: 114 TPDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILM------------PPMY 160 TP ID +AS G+ T Y+ P SPTRA+I GQY + Sbjct: 60 TPHIDQLASDGVRFTDFYAACPVCSPTRASIQAGQYQARLHLTDFIPGHWRPFEKLIVPE 119 Query: 161 GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEW 220 P + T +LL Y T GKWH+G P G+ Sbjct: 120 NAPHLPLEIVTPGELLQSANYNTAYFGKWHLGPES-HNPDQQGYQTSLVTGG-------- 170 Query: 221 RDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGV 280 H P +P K L D + Sbjct: 171 --RHFAPRFRTTPSTRIPNK----------------------------AYLADFLTDKTI 200 Query: 281 KFLDKMAKSDKPFFLYYGTRGCHFDNYPNA----KYAGSSP-----ARTSYGDCMVEMND 331 +F+ + KPFF+ H KY Y + ++D Sbjct: 201 EFIRQ--NKSKPFFVQLSHYAVHIPLEAKQQMIRKYQQKPKPAYGINNPVYAAMVAHVDD 258 Query: 332 VFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG-----RTPFRGAKGSTWEGGVRVP 386 + LE+ +NT+++FTSDNG + G P R KGS +EGG+RVP Sbjct: 259 SVGRIVAALEELKLTENTVVIFTSDNGGLRQSFSGGDIVSTNAPLRDEKGSLYEGGIRVP 318 Query: 387 TFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLG 445 + W G+ D +PT ++A + + IDG+ Sbjct: 319 LIIKWPGVAAAGKTCAEPTISIDFWPTFAEIA-------HTTLQEHQTIDGLSLLPLLKD 371 Query: 446 TNGQSNRKAEH-----YFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTA 500 + NR+ + Y + +A+R ++K Sbjct: 372 PSSHLNREEIYFHYPHYHHSTPASAIRAGDWKLIEF--------------------FADG 411 Query: 501 GSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 ++NL D E+ ++ ++ V LQ ++ + P K D Sbjct: 412 NLELYNLQQDLSETTNLAAKNPEKAVELQQKLADWRTRTGAALPVKNPKYD 462 >UniRef50_B1KD88 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD88_SHEWM Length = 500 Score = 448 bits (1152), Expect = e-124, Method: Composition-based stats. Identities = 110/520 (21%), Positives = 181/520 (34%), Gaps = 75/520 (14%) Query: 58 MMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDI 117 + T + +E K ++PNV+ FL DD+G D+G G TP+I Sbjct: 6 TTLSVAVLCSIMVTSCSQSNIEPKVNRQPNVIYFLADDLGVGDLGSYGQQHI---RTPNI 62 Query: 118 DAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPP----------MYGQPGGL 166 D +A++G+ + Y+ +P+RA+++TG+ H I GQ Sbjct: 63 DKLAAEGMRFSRHYAGSSVCAPSRASLMTGRDMGHTDIRGNIQLMDQPDSPEYQGQYPLA 122 Query: 167 QGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHV 225 QG TL L GY T A GKW +G P+ +GFD F G+ + + Sbjct: 123 QGTITLAHLFQLAGYQTGAFGKWGLGSLQSSGNPKAMGFDQFYGYLDQRHAHNYFPQYLW 182 Query: 226 NPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDK 285 + + D +D + D P + +F+ + Sbjct: 183 DGDEVARLDNPAINVHPKLDRDK----SDHREYMGKDYAPYK-------ILARAKEFISQ 231 Query: 286 MAKSDKPFFLYYGTRGCHFDNYPNAK-------------------YAGSSPARTSYGDCM 326 D+ FFLY H K Y R + + Sbjct: 232 --NRDEAFFLYVPFVVPHAAIQIPDKELDGYQFDETAHRLGEPRAYTPHPKPRAARAAMI 289 Query: 327 VEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP-----HGRTPFRGAKGSTWEG 381 M+ ++ L++ G DNTL++F+SDNG A + RG K + +EG Sbjct: 290 SRMDRDVGDIMAMLKELGLDDNTLVLFSSDNGATAAGGSDINFFNSTAGARGEKATLYEG 349 Query: 382 GVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQT 440 G+R P W G I SD + D+ PT L + I G+ Sbjct: 350 GIRAPLIARWPGNISAGSESDHLSAFWDMLPTFAQLLDLSVPEG---------IQGISML 400 Query: 441 SFFLGTNGQSNRKAEHY--FLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQ 498 LG ++ ++ F AV M +K + ++ Sbjct: 401 PTLLGKPQNQQHESLYWEFFSRNPSQAVVMGNWKAIRHYSKE-----------RGKGALE 449 Query: 499 TAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 ++++NL DP ES ++ +H + + M Sbjct: 450 LGATALYNLQEDPSESQNLAAKHPELVKKAEMIMAQRQRS 489 >UniRef50_Q7UIN1 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UIN1_RHOBA Length = 554 Score = 447 bits (1151), Expect = e-124, Method: Composition-based stats. Identities = 128/553 (23%), Positives = 204/553 (36%), Gaps = 73/553 (13%) Query: 42 HPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEK---KTGKKPNVVVFLLDDVGW 98 H L + ++ + + T++ +A+ T +PNV++ DD G+ Sbjct: 11 HSRLRLSQSNLSLRAIAILAVGLVCLSVSTRRAVAQQNPVIGSTDTRPNVIIVYTDDQGF 70 Query: 99 MDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMP 157 DV TP++D +A +GL T+A+S +P+R +LTG+YS + Sbjct: 71 GDVSSMNPD--AKFETPNMDRLAKEGLTFTNAHSSDSVCTPSRYGLLTGRYSWRTTLKRG 128 Query: 158 PMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNV------------- 202 M + L TL L D+GY T +GKWH+G P+ Sbjct: 129 VMNAEGKCLIADDRMTLASFLRDEGYQTGMVGKWHLGMQFPGSPKKRDWSQPVRDMPLDK 188 Query: 203 GFDDFRGFNSVSDM----YTEWRDVHVNPEVALSPDRS----EYIKQLPFSKDDVHAVRG 254 GFD F G + + + + R V P+ + +Y P+ + + A + Sbjct: 189 GFDHFFGIPASLNYGVLAWFDGRHAAVPPKSWTGKKPNKRHVDYRIMPPYQETETEARKR 248 Query: 255 GEQQAIADITPKYMEDLDQRWMDYGVKFLDKM--------AKSDKPFFLYYGTRGCHFDN 306 + I R+ D ++++ + A + PFFLY H+ Sbjct: 249 FKNTTIEVADDFVDNQCLTRFTDEAIEWITEATATPGNESASNAPPFFLYLPLTSPHYPV 308 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE------ 360 P +Y G YG+ M+E + L K LE NG DNTL++ TSDNGPE Sbjct: 309 CPLPEYWGQGDC-GGYGEFMIETDHHLGRLLKHLEANGLTDNTLVILTSDNGPEKSWKQR 367 Query: 361 -AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGIVDLADLFPTALDLA 417 + H +RG K +EGG RVP W I+ R SD +V DL T +L Sbjct: 368 IDDFGHHSNGSYRGGKRDIYEGGHRVPMLARWPNGIKQPGRISDALVGQVDLLATVAELL 427 Query: 418 GHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLI 477 G P +P D S L + + +R A+ ++K+ Sbjct: 428 GRP-------LPDEAAEDSHSFASILLDPSYEHHRVPLINHGVRGEFAITAGDWKWI--- 477 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 ++NL DP ES I H + L+ + + Sbjct: 478 ----------------APRRDNDEGELYNLANDPSESQDISSDHPTVVRRLRNALTKIVV 521 Query: 538 ILKKYPPRAQIKS 550 + Q Sbjct: 522 NGRSTSGDPQPND 534 >UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UPK7_RHOBA Length = 482 Score = 447 bits (1150), Expect = e-124, Method: Composition-based stats. Identities = 106/508 (20%), Positives = 179/508 (35%), Gaps = 72/508 (14%) Query: 48 VKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGG 107 A I ++ ++ T ++PNV+V L DD+ D+ GG Sbjct: 18 FVAAILILLSLNECHGQAPAVQDGDANAKSESDATSRRPNVIVILADDLAVGDLA---GG 74 Query: 108 VAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPMYGQP--- 163 TP++D AS+ + + AYS +P RA +LTG+Y G++ M P Sbjct: 75 DGSPTRTPNLDRFASESIQFSQAYSGSCVCAPARAALLTGRYPHRTGVVTLNMNRYPEMT 134 Query: 164 GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDV 223 + TT+ +L D GY T +GKWH G P + GFD+F GF D+ Sbjct: 135 RLRRDETTIADVLKDAGYATGLVGKWHTGRGDGFHPLDRGFDEFEGFFGSDDVGYFRYPF 194 Query: 224 HVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFL 283 +++ + L ++F+ Sbjct: 195 SEQRQISDVDE----------------------------------SYLTDDLNRRAIEFV 220 Query: 284 DKMAKSDKPFFLYYGTRGCHFDNYPNAK------YAGSSPARTSYGDCMVEMNDVFANLY 337 + + PFFL+ H + G + + + M+ L Sbjct: 221 RRHHEH--PFFLHLAHYAPHRPLEAPPEVIARYREQGFDESTATIYAMIEVMDRGIGELL 278 Query: 338 KTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP 397 ++ G ++T+++F SDNGP+ RG K EGG+RVP FV W + P Sbjct: 279 AEIDDLGLSEDTIVLFASDNGPDPLTGERFNRELRGTKYQVNEGGIRVPLFVRWSKRLAP 338 Query: 398 RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY 457 + D +V DL PT LDL + + +DG + + Sbjct: 339 GQRDQMVTFVDLMPTILDLCRVDVSMLNR-------LDGESFVPVLEDASIAHSTMRFWQ 391 Query: 458 F-----LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQ 512 + AAVR +K A + + +F+L DP Sbjct: 392 WNRASPNYTHNAAVRHGRYKLVRPYVTRGAKLKDSTEPSV-----------LFDLQNDPT 440 Query: 513 ESDSIGVRHIPMGVPLQTEMHAYMEILK 540 ES + ++ + + E+ + ++ Sbjct: 441 ESRDVSKQYPDIAERMSRELDRWSASVE 468 >UniRef50_B8KM61 Steryl-sulfatase n=2 Tax=gamma proteobacterium NOR5-3 RepID=B8KM61_9GAMM Length = 500 Score = 446 bits (1149), Expect = e-124, Method: Composition-based stats. Identities = 152/477 (31%), Positives = 232/477 (48%), Gaps = 29/477 (6%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGG-VAVGNPTPDIDAVASQGLILTSAYSQPSS 136 KPNVV+ L D++G+ D+G G G G PTP ID +AS+G++LT + +P Sbjct: 30 TPAIAADKPNVVLMLSDNMGYGDLGVYGSGGELRGMPTPRIDQLASEGMMLTQFFVEPGC 89 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGLQG-LTTLPQLLHDQGYVTQAIGKWHMGENK 195 +PTRA +LTG+YS G+ + G P LQ TL +L QGY T GKWH+G K Sbjct: 90 TPTRAALLTGRYSQRAGLGSIIIAGTPSTLQDSEVTLAELFKSQGYATAMTGKWHLGGEK 149 Query: 196 ESQPQNVGFDDF-RGFNSVSDM--YTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 +S P N GFD++ G +D Y + E A++ ++ + P KD V V Sbjct: 150 QSLPINQGFDEWHVGILQTTDGVLYPDGMRRSGFSEAAIAKSQTAIWESEP-GKDVVKKV 208 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 R + + Y ++ + VK++ + AK +PFFLY G H+ P+ + Sbjct: 209 RPYDLE--------YRRHIEGDIAEASVKYIKEQAKEKEPFFLYVGWSHVHYPALPHPDF 260 Query: 313 AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPH------ 366 G S A +GD ++E++ + +++ G DNT++++ SDNGP + Sbjct: 261 EGKSSA-GLFGDAVMELDYRTGQVLDAIKEAGIEDNTIVIWLSDNGPATTQGSNNDFLGS 319 Query: 367 GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVAN 426 PFRG G EG +RVP + W I+P KS+ +V + D +PT ++ G Sbjct: 320 SAGPFRGEVGDALEGSLRVPGMIKWPAKIKPAKSNEMVAIHDFYPTLANIIGAK------ 373 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQS 486 VP IDGVDQ FFLG N QS R++ F+ G++AAVR +++ Q + Sbjct: 374 -VPTDRAIDGVDQGDFFLGKNKQSARESLITFMEGEVAAVRWKQWR-IYPKQFVASEGNP 431 Query: 487 GYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 G SV+N+ DP+E + + P + Y + L+KYP Sbjct: 432 SLMGVGAYRAEGMGYPSVYNIARDPREQWNQTAVSAFVLGPYMQIVGEYQKSLEKYP 488 >UniRef50_Q7ULF9 Arylsulfatase n=4 Tax=Bacteria RepID=Q7ULF9_RHOBA Length = 538 Score = 446 bits (1149), Expect = e-124, Method: Composition-based stats. Identities = 171/501 (34%), Positives = 271/501 (54%), Gaps = 13/501 (2%) Query: 51 ATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAV 110 +N + QD+ + KLAE++ K GK+PN++ ++DD+G+ D G GGG A+ Sbjct: 41 TVVSLENHEAAIPLATQDQAAEDKLAEIKAKHGKRPNILWLVVDDMGYGDPGCYGGGAAI 100 Query: 111 GNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQ---PGGLQ 167 G TP+ID +AS+GL LTS YSQ + +PTR+ ILTG+ + G+ P + G + Sbjct: 101 GAATPNIDRLASEGLRLTSCYSQQTCTPTRSAILTGRLPVRTGLTRPILAGDKLTRNPWE 160 Query: 168 GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNP 227 +LP+LL D GY T GKWH+GE +P ++GFD++ G+ ++ D P Sbjct: 161 DEVSLPKLLSDAGYYTLLTGKWHVGEPVGMRPHDIGFDEYYGYYPAQKEISQRFDERRFP 220 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITP-KYMEDLDQRWMDYGVKFLDKM 286 ++ +P+R+ + + H +GG + + I + M ++ D+ ++ + ++ Sbjct: 221 DLVNNPERARAFEAIAPDNHLTHGFKGGRTEKLKQIQSTEDMGRAEKVLADFTIQRIKEL 280 Query: 287 AKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQL 346 AK D+PFFL + H DN+PN S A+ Y + + E++ + L++ L Sbjct: 281 AKEDQPFFLEHCFMKVHCDNFPNPDLGPLSAAKYYYKEAVAEVDLHVGEIMAALKEADVL 340 Query: 347 DNTLIVFTSDNGPEAEVPPH-GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIV 404 NT + FTSDNGP+ + P G TPFRGAKG+T+EGGVRVP YWKG++ +SDG+ Sbjct: 341 GNTFVFFTSDNGPQMDGWPDAGYTPFRGAKGTTFEGGVRVPGIAYWKGVVSGGRQSDGLF 400 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLA 464 DL DLF +L LA P + +P + D +DQTSF L +GQS R+A +++ +L Sbjct: 401 DLLDLFGVSLKLAEIPTSD----LPVDRYYDYIDQTSFLLQDDGQSKREAVYFWWGKELM 456 Query: 465 AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPM 524 + RM E+K HV + + ++ V +FNLY DP+E +G R Sbjct: 457 SCRMHEYKVHV---KAVLPESTHMHIDYSTLVDVGLAPWLFNLYIDPKEQLPVGHRRNAW 513 Query: 525 GVPLQTEMHAYMEILKKYPPR 545 + ++ A+ KKYP + Sbjct: 514 LATVLGKLKAHATTFKKYPAK 534 >UniRef50_UPI0001927538 PREDICTED: similar to CG8646 CG8646-PA n=5 Tax=Hydra magnipapillata RepID=UPI0001927538 Length = 502 Score = 446 bits (1148), Expect = e-123, Method: Composition-based stats. Identities = 114/507 (22%), Positives = 192/507 (37%), Gaps = 78/507 (15%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRAT 142 KP++++ + DD+GW D+ F+G PTP+ID +A+ G+IL + Y P +P+R+ Sbjct: 17 ADKPHIIMIVADDLGWNDISFHGSNEI---PTPNIDRLANNGVILDNYYVLPICTPSRSA 73 Query: 143 ILTGQYSIHHGIL--MPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQP 199 I+TG+Y IH G+ G LPQ L QGY T +GKWH+G K+ P Sbjct: 74 IMTGRYPIHTGMQQDTIFGPNPYGVGLNEKFLPQYLKQQGYKTHGVGKWHLGFFAKQYTP 133 Query: 200 QNVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 GFD + G + D + +S D+H G Sbjct: 134 TYRGFDSYYGSYLGKGDYWNH-------------------SNTETYSGLDLHDNENG--- 171 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF------DNYPNAKY 312 + + + + + ++ S +P FLY + H ++ Sbjct: 172 ----VFSQDGNYSTEMYTAEAISCINNH-NSSEPLFLYLAYQAVHSANTEEDPLQAPQEW 226 Query: 313 AG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA---EVP 364 R Y + M+ ++ L + LDN++I+FT+DNG A + Sbjct: 227 IDKFSYIKHEQRRKYAAMLGYMDYGVGRVHDALAEKKMLDNSIIIFTTDNGGPANGFDYN 286 Query: 365 PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKV 424 P RG K + +EGGVR +FVY K + PR S ++ + D PT ++LAG + Sbjct: 287 WANNFPLRGVKATLFEGGVRGVSFVYSKLIESPRVSHELIHITDWLPTLVNLAGGKVSDG 346 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN--GKLAAVRMDEFKYHVLIQQPYA 482 +DG DQ + + K A+R+ +K Sbjct: 347 F--------LDGFDQWATLQNKQSSQRNEVLLNIDEKVWKNEALRVGSWKIIKEGNYWDG 398 Query: 483 YTQSGYQGGFTGTV------------------MQTAGSSVFNLYTDPQESDSIGVRHIPM 524 + + +F++ DP E + + + + Sbjct: 399 WYPPPSFNEQSNNSFSYLSSTVKCGHDIPIVINHCDSYCLFHIDEDPCEINDLSKKFPEV 458 Query: 525 GVPLQTEMHAYMEILKKYPPRAQIKSD 551 L ++ Y + + PPR + D Sbjct: 459 LAELINRLNTYRQSM--VPPRNNMTID 483 >UniRef50_Q0BZE9 Sulfatase family protein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0BZE9_HYPNA Length = 459 Score = 446 bits (1148), Expect = e-123, Method: Composition-based stats. Identities = 129/459 (28%), Positives = 187/459 (40%), Gaps = 64/459 (13%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 +E K PN+++ + DD+GW D+ NG A TP+ID + +G+ LT Y+ Sbjct: 27 ATSETAPAAAKPPNIIIIMADDLGWGDISLNG---AALIETPNIDRIGQEGIQLTDFYAG 83 Query: 134 P-SSSPTRATILTGQYSIHHGILMPPMY-GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 SP+RA +LTG+Y I G+ Q G T+ ++L + GY T +GKWH+ Sbjct: 84 SNVCSPSRAALLTGRYPIRSGMQHVIFPHSQDGLPAEEITISEMLKNAGYRTGMVGKWHL 143 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 G +E P N GFD F G +DM PF Sbjct: 144 GHQEEYWPTNQGFDWFYGVPYSNDM-------------------------APFDLYRGKE 178 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 + +P L + +F++ + DKPFFLYY H + Sbjct: 179 I---------IESPADQSQLSLNYAKAAKEFIEDSS--DKPFFLYYAETFPHIPLFVPED 227 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPF 371 +G+S A YGD + ++ + TL++ G D+TLI+FTSDNGP E F Sbjct: 228 RSGTSDA-GLYGDVVETVDAGIGIVLDTLDEAGVADDTLIIFTSDNGPWFE---GSAGEF 283 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPK 430 RG KG T EGG RVP W G I S + DL PTA L+G +P Sbjct: 284 RGRKGETHEGGFRVPFLARWPGHIPKGSVSHEMAMNIDLLPTAASLSGAT-------LPA 336 Query: 431 TTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490 IDG D TS + +F ++ R F+ + + Sbjct: 337 DRVIDGKDLTSLLT-AGAPTPHDILFFFDGNEIVGARDARFRLVLNT----------FYR 385 Query: 491 GFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQ 529 + + +F+L DPQES S + L+ Sbjct: 386 TMSVPFEYFGTALLFDLEKDPQESFSFMREYPGEAERLK 424 >UniRef50_Q1YSH0 Sulfatase family protein n=4 Tax=cellular organisms RepID=Q1YSH0_9GAMM Length = 557 Score = 446 bits (1147), Expect = e-123, Method: Composition-based stats. Identities = 117/540 (21%), Positives = 188/540 (34%), Gaps = 78/540 (14%) Query: 44 NQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGF 103 L+ A + ++ + Q + + PN+++ L DD+G+ D+ Sbjct: 22 RLNLLLFAAPTLKALTSNIEANKRVVWPQGPASAETTPAKRPPNIILILTDDMGFNDISL 81 Query: 104 NGGGVAVG-NPTPDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYG 161 GG A G TP+ID +A QG+ + Y+ + +RA++LTG+YS G+ P+Y Sbjct: 82 YNGGAADGSLQTPNIDRIAEQGIRFNNGYAANAVCTSSRASLLTGRYSTRFGVEYTPIYK 141 Query: 162 QP---------------------------------GGLQGLTTLPQLLHDQGYVTQAIGK 188 G T+ ++L Q Y T IGK Sbjct: 142 TGVRIFNWMEELNPSTPPVLVDMDLAATLPPIDALGMPAAEITIGEVLQQQDYYTAHIGK 201 Query: 189 WHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 WH+G N + +P+ GFDD H + A P S + Sbjct: 202 WHLGSNGDMRPEQQGFDDSLSMKG----IFYLPPDHPDVVNAKIPGDSIDSMVWAVGSYE 257 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 V G + L + D V ++ A +PFFLY G H Sbjct: 258 VQWNGGPPFEP--------KGYLTDYFTDAAVDVIE--ANRHRPFFLYLAHWGPHNPVQA 307 Query: 309 NAKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 + + +Y + ++ + +L++NG DNTLI+FTSDNG + Sbjct: 308 SREDYDALPHIKDHRLRTYAAMLRALDRSVEKIEASLQENGLSDNTLIIFTSDNGGAGYL 367 Query: 364 PPHG-RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPG 421 P+RG K + +EGG VP W I+ + SD + D+F T AG Sbjct: 368 DLTDLNKPYRGWKLTHFEGGTHVPYMAKWPAQIEAGQSSDEAIHHIDMFHTIAAAAGAS- 426 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 VP +DGV+ F G + K + + +K Q Sbjct: 427 ------VPTDRTLDGVNLLPFMQGKQTGAPHKTLFWHTGHQQTVWHQG-WKMIRAEQSDK 479 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +F+L DP E +++ L + + K Sbjct: 480 PGADPMVF--------------LFDLNNDPTEQNNLIAEQPEKAAELTALLDTHHAQQAK 525 >UniRef50_D2R457 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R457_9PLAN Length = 516 Score = 446 bits (1147), Expect = e-123, Method: Composition-based stats. Identities = 127/496 (25%), Positives = 188/496 (37%), Gaps = 60/496 (12%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATI 143 PN+V L DD+G+ DVG G TP ID +A G+ L YS P +P+R + Sbjct: 32 PPNIVFILCDDLGYGDVGCFGQK---KTRTPHIDTLARDGMRLIQHYSGAPVCAPSRCVL 88 Query: 144 LTGQYSIHHGILMP---PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQP 199 LTG +S H + GQ +G TLP LL +GYV A GKW +G +P Sbjct: 89 LTGLHSGHSQVRDNREAQPEGQYPLAEGTVTLPGLL--EGYVCGAFGKWGLGGPESSGKP 146 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD F G+N + + P+ S D +K PF+ Q Sbjct: 147 LAQGFDRFFGYNCQRQAHNYY------PQHLWSNDEKVLLKNPPFAAHQKFPADADPQNP 200 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK-------- 311 A + + + +KF+D+ + KPFFLYY + H Sbjct: 201 AAFERYRGPDYAADLISEQALKFIDEHHQ--KPFFLYYASPVPHLALQVPEDSLKEYAGE 258 Query: 312 -----------YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP- 359 Y R +Y + M+ + + LEK G T++VF+SDNGP Sbjct: 259 FSETPYLGERGYLPHPTPRAAYAAMITRMDREIGRILERLEKYGLQRRTIVVFSSDNGPL 318 Query: 360 ------EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPT 412 RG KGS +EGG+RVPT V + G++ S + D PT Sbjct: 319 YDKLGGTDADFFQSALDLRGRKGSVYEGGIRVPTIVKFPGVVPAGTTSSTLGGFEDWMPT 378 Query: 413 ALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH--YFLNGKLAAVRMDE 470 L LAG DG D + G + Q+ R+ + + G VR + Sbjct: 379 LLSLAGMSTKIPEQA-------DGRDLSPSLRG-DWQAPREFLYREFPGYGGQQFVRSGK 430 Query: 471 FKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQT 530 +K + +++L DP ES ++ H + L Sbjct: 431 WKAVR----QNLVRPVPTGKKKLAEWKEPLAIELYDLEADPTESTNVAAEHPKVVAKLHA 486 Query: 531 -EMHAYMEILKKYPPR 545 + + ++ PR Sbjct: 487 IMLREHQPSVEFKMPR 502 >UniRef50_B0NLM9 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NLM9_BACSE Length = 463 Score = 446 bits (1147), Expect = e-123, Method: Composition-based stats. Identities = 110/483 (22%), Positives = 186/483 (38%), Gaps = 77/483 (15%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPTRA 141 G KPN++ L DD+G+ D+ G TP+ID +A+ G T Y+ SSP+R Sbjct: 32 GDKPNIIFILADDMGYCDLSCYGNKYI---ETPNIDRLAATGTAFTQCYAGSGISSPSRC 88 Query: 142 TILTGQYSIHH-------------GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGK 188 ++TG+ + + G+ + TT+ +L GY T + K Sbjct: 89 ALMTGKNTGNTTIRDNFCIAGGIEGLKGTKTIRRMHLQPNDTTIATVLGAAGYRTCLVNK 148 Query: 189 WHM-GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 WH+ G N E+ P N GFD+F G+ + D + P + ++ E +K+ K Sbjct: 149 WHLDGFNPEATPLNRGFDEFYGWLISTAY---SNDPYYYPYWRFNNEKLENVKENEGDKH 205 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 H + +KF+++ + PFFLY H Sbjct: 206 IKHN--------------------TDLSTEDAIKFINR--NKNNPFFLYLAYDAPHEPYN 243 Query: 308 PNA----KYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 + Y + M+ L L++ G +NTL++F SDNG + Sbjct: 244 IDETTWYDDEAWDMNTKRYASLITHMDRAIGRLLAELDRLGLRENTLVIFASDNGAAKQA 303 Query: 364 P---PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHP 420 P + +G KG +EGG+RVP V G + +K + I+ D+ PT LAG Sbjct: 304 PLEELGCKGSLKGMKGQLYEGGIRVPFIVNQPGKVPVQKLNNIIYFPDVMPTLAALAGAT 363 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 ++G++ F G ++ + ++ GK A R ++K Sbjct: 364 DKL-------PQKLNGINILPLFYGQQLDTDNRLLYWEFPGKQRAARCGDWKVV------ 410 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 TV + A ++N+ D ES ++ ++ + EM A Sbjct: 411 --------------TVKKDAPLELYNIKEDMTESVNLANKYPEKVAQFEKEMKAMRIPTP 456 Query: 541 KYP 543 +P Sbjct: 457 NWP 459 >UniRef50_A4A218 Arylsulfatase A n=2 Tax=Bacteria RepID=A4A218_9PLAN Length = 491 Score = 445 bits (1146), Expect = e-123, Method: Composition-based stats. Identities = 112/480 (23%), Positives = 190/480 (39%), Gaps = 61/480 (12%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-S 135 + K PN+V+F +D++G D+G G + + TP ID +A++G TS Y Sbjct: 31 AAQSADAKPPNIVLFFVDNLGTGDIGCYGSTL---HRTPHIDRLAAEGAKFTSFYVASGV 87 Query: 136 SSPTRATILTGQYSIHH-------GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGK 188 +P+RA ++TG Y + G+ + G TT+ ++LH GY T GK Sbjct: 88 CTPSRAALMTGCYPLRVDMHKSGEGVAVLRPLDTKGLNPKETTMAEVLHSVGYATGIFGK 147 Query: 189 WHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 WH+G+ E P GFD F G DM R + +LP +D+ Sbjct: 148 WHLGDQPEFLPTQQGFDTFFGIPYSDDM--------------TKDLRPQLWPELPLMRDE 193 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 P + L +R + + F+++ ++PFF+Y P Sbjct: 194 QVI-----------EAPVDRDLLVKRCTEEAIAFIEQ--NQERPFFVYIPHTMPGSTKRP 240 Query: 309 --NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-PEAEVPP 365 + + G S YGD + E++ + +TL++ + TL+++TSDNG P P Sbjct: 241 FSSPAFQGKS-KNGPYGDSVEELDWSTGQVMETLKRLDLDEQTLVIWTSDNGAPHRNPPQ 299 Query: 366 HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKV 424 P++G +T EG +R+P + W G I + D + DL PT LAG +K Sbjct: 300 GSNLPYQGDGYNTSEGAMRMPCVMRWPGKISAGQINDALCTTMDLLPTFGKLAGATMSKT 359 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNG---QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 IDG + + LG + + K ++ +L A+R +K Y Sbjct: 360 E--------IDGHEISRILLGESDTASPWDDKGFAFYYMDQLQAIRAGRWKL-------Y 404 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 + +++++ D E + H + L + Sbjct: 405 LPLDPKTGLRLPPAASKEGNVALYDVRNDVHEDQEVSAEHPDVVAHLTDLAQQIRREIGD 464 >UniRef50_A6P2X1 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6P2X1_9BACE Length = 494 Score = 445 bits (1145), Expect = e-123, Method: Composition-based stats. Identities = 138/487 (28%), Positives = 204/487 (41%), Gaps = 84/487 (17%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSS 136 +E + G PNVVV +DD+G+ D+G G TP+IDA+A G++LT+ Y+ P Sbjct: 63 VELENGDPPNVVVIYVDDMGYGDLGCTGATAIS---TPNIDALAEGGVLLTNYYAPAPIC 119 Query: 137 SPTRATILTGQYSIHH---GILMPPM------------------YGQPGGLQGLTTLPQL 175 S +RA +LTG+Y I G M Y G LP++ Sbjct: 120 SASRAGLLTGRYPIRTLTSGAYMNTEGLSGHLANLLEVVKGTYPYQNDGLPTDEILLPEV 179 Query: 176 LHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDR 235 L GY T +GKWH+G +E +P N GFD F G D Sbjct: 180 LQQAGYETALVGKWHLGIREEERPYNRGFDLFYGALYSDDNDPH---------------- 223 Query: 236 SEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFL 295 R + P + + +F+D D PFFL Sbjct: 224 -----------------RIYHNDEVVHDEPYDQSGMTKELTQVAKQFIDD--NQDGPFFL 264 Query: 296 YYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 YY + H+ + + ++ G+S A YGDCM E++ + TLE+NG L+NTL++FTS Sbjct: 265 YYASPFPHWPSNASEEWLGTSQA-GIYGDCMQEVDWSVGEIMDTLEENGLLENTLVIFTS 323 Query: 356 DNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTAL 414 DNGP + G+ RG K + + GG VP Y G I DG++ D+FPT L Sbjct: 324 DNGPWYDGATGGQ---RGRKDTNYNGGSHVPFIAYMPGTIPEGEVYDGLMSGVDVFPTIL 380 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYH 474 +L G +P+ IDG+D F G + S R + A+ D FKY Sbjct: 381 NLLGIE-------LPQDRVIDGMDMWPFLTGQSD-SPRTELFLNKDKDTFALIEDNFKYL 432 Query: 475 VLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 QG F ++NL TDP+E+ + + ++ + Sbjct: 433 ERSYSENGTYWMLQQGPF-----------LYNLDTDPEEAYDVTTHFPEKAEEMAQKIDS 481 Query: 535 YMEILKK 541 + + LK+ Sbjct: 482 FKQSLKE 488 >UniRef50_Q7UL93 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UL93_RHOBA Length = 470 Score = 445 bits (1145), Expect = e-123, Method: Composition-based stats. Identities = 110/506 (21%), Positives = 176/506 (34%), Gaps = 102/506 (20%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 + + ++P+++ + DD+GW D+ G V TP+IDA+A G+ +AY+ Sbjct: 35 NACLVSAEAAEQPHILFIMADDMGWKDLHCQGNDVL---RTPNIDALAEAGVRFDNAYAG 91 Query: 134 P-SSSPTRATILTGQYSIHHGILMP---------------PMYGQPGGLQGLTTLPQLLH 177 +PTRA+++TG I P TT+ + L Sbjct: 92 STVCTPTRASLMTGLAPARLHITQHGADSKSFWPDDRLIQPPPTNHELPHETTTMAERLK 151 Query: 178 DQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSE 237 GY T GKWH+G +K+ P GFD G + T + Sbjct: 152 AAGYTTGFFGKWHLGGDKKYWPTEHGFDVNVGGCGLGGPPTYF----------------- 194 Query: 238 YIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYY 297 + A K E L R D + F+ + + DKP F+ Sbjct: 195 -----------------DPYRIPALPPRKEGEYLTDRLADETIAFMRR--EKDKPMFVCL 235 Query: 298 GTRGCHFDNYPNAK----YAGSSP---ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 T H+ Y G YG + + + + L+ G D TL Sbjct: 236 WTYNPHYPFEAPEDLIEHYKGKEGTGLKNPIYGGQIEATDRGVGRVLRELDSLGIADETL 295 Query: 351 IVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDG-IVDLADL 409 +VFTSDNG + P R KG +EGG+RVP V W G+ + + V DL Sbjct: 296 VVFTSDNGGWS--GATDNRPLREGKGFLFEGGLRVPLIVRWPGVTEAATVNETPVVSMDL 353 Query: 410 FPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL--------NG 461 T LD AG A + +DG F G + R A ++ N Sbjct: 354 TATILDAAGVSLANGES-------LDGESLRPLFSGGKLE--RDALYFHYPHFAFHKDNR 404 Query: 462 KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 + +R ++K + +++L D E+ + H Sbjct: 405 PGSVIRSGQYKLILRHD--------------------DDSVELYDLQNDLSETSDLAAVH 444 Query: 522 IPMGVPLQTEMHAYMEILKKYPPRAQ 547 + L+ + ++E P + Sbjct: 445 PDVAQELKGRLMEWLEATGAGMPEKR 470 >UniRef50_C6Y1U6 Sulfatase n=2 Tax=Sphingobacteriales RepID=C6Y1U6_PEDHD Length = 523 Score = 445 bits (1145), Expect = e-123, Method: Composition-based stats. Identities = 127/540 (23%), Positives = 194/540 (35%), Gaps = 76/540 (14%) Query: 59 MPVMQHPAQDKETQQKLAELEKKTGKK-PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDI 117 V +L+ + KK PN+V L DD+G+ D+ G TP I Sbjct: 11 TWVTAFAFSVCLVLLSKTKLQAQQQKKLPNIVYILADDLGYGDIKIYNAG--AKVNTPHI 68 Query: 118 DAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPMYG--QPGGLQGLTTLPQ 174 D +A QG+ T A++ +P+R +ILTG+Y + + + G + +GL T+ Sbjct: 69 DKLAEQGMRFTDAHTTSSVCTPSRYSILTGRYPWRSRLPVGVLRGYSRTLIEEGLPTVAG 128 Query: 175 LLHDQGYVTQAIGKWHMGENK------------------------ESQPQN--------- 201 LL Y T IGKWH+G + E P Sbjct: 129 LLKTSSYRTAVIGKWHLGLDWMPKEAFKDSINPAFNKDRLYGITDEMNPDQIDFGRAPVR 188 Query: 202 ----VGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 GFD + DM ++ + P S R G + Sbjct: 189 GPRTQGFDYSYVLPASLDMPPYA---YLENDQLTEPLTGYTPGNKLASGYTGPFWRAGLK 245 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSP 317 D + + + F+ K A + PFFLY+ H P A+Y G S Sbjct: 246 SPSFDFYG-----VLPAFTNKATDFIKKEAATKNPFFLYFPMPAPHTPWMPTAEYRGKSQ 300 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP------EAEVPPHGRTPF 371 A YGD + E++ + + L+ G NTL+VFTSDNGP + H PF Sbjct: 301 A-GEYGDYLQEVDAAVGKILQVLDSLGLSKNTLVVFTSDNGPYWRDDFVQQYGHHAAGPF 359 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPK 430 RG KG +EGG RVP V + G ++ S+ LA+L T DL G+ + Sbjct: 360 RGMKGDAYEGGHRVPFIVRYPGKVKAGTISNVTTTLANLMATCADLTGNHAVQFETE--- 416 Query: 431 TTFIDGVDQTSFFLGTNGQ-SNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQ 489 D LG + + A + +R +K + S + Sbjct: 417 ----DSYSILPVLLGKAAGIAEQPAIVNISSKGFYDIRKGPWKLITGLGSGGFSVPSIVK 472 Query: 490 GGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIK 549 ++NL TD +E ++ R+ L + +K P + K Sbjct: 473 APEGQAAG-----QLYNLDTDIKEETNLYSRYPEKVKELSALLEK----IKAAPKGKRAK 523 >UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017445FC Length = 481 Score = 445 bits (1145), Expect = e-123, Method: Composition-based stats. Identities = 117/509 (22%), Positives = 172/509 (33%), Gaps = 107/509 (21%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPS 135 + +PNV+VFL DD+G+ ++G G TP++D +A+ G+ T YS Sbjct: 9 AASLQASARPNVIVFLADDLGYGELGCYGQKKI---KTPNLDQLAADGMRFTDFYSGHAV 65 Query: 136 SSPTRATILTGQYSIHHGILMPPMYG------------------QPGGLQGLTTLPQLLH 177 +P+R +LTG+++ H + Q T L Sbjct: 66 CAPSRCVMLTGKHTGHSFVRENSEGRAAQAKERNRIKAADGYLPQIALPASEATYASALQ 125 Query: 178 DQGYVTQAIGKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS 236 GY T +GKW +G + E P GFD F G+ S + + Sbjct: 126 KSGYRTACVGKWGLGHPSNEGSPNKHGFDLFYGYISQWQAHYYYPTYL------------ 173 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 + D + G + + + +KF++ PFFLY Sbjct: 174 -------WRNDVKEPLEGNDGKVG-------RQYAADLMEQEALKFMETT--GGGPFFLY 217 Query: 297 YGTRGCHFDNYPNAK-----------------------YAGSSPARTSYGDCMVEMNDVF 333 Y T H Y + R Y + M+ Sbjct: 218 YATPVPHVSLQVPPDEPSLAEYKQAFAGQDPPYDGRKSYLPTEDPRAIYAAMVTRMDRTL 277 Query: 334 ANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP-----HGRTPFRGAKGSTWEGGVRVPTF 388 L++ GQ NTLI+FTSDNG G P RG K W+GG+R P Sbjct: 278 GKFRDLLKRTGQDQNTLIIFTSDNGATFNGGYDREFFGGNQPLRGMKTQLWDGGIRTPFI 337 Query: 389 VYWKGMIQPRKSDGIV-DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTN 447 W G IQP + V DLFPT ++ G P +DGV G Sbjct: 338 AAWPGSIQPGQVSRFVGASWDLFPTFAEIVGFPVPAG---------LDGVSILPTLKGEV 388 Query: 448 GQSNRKAEHYFLN--GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVF 505 + Y+ G AVRM +K +A +F Sbjct: 389 ATQKQHDHLYWETVAGGHQAVRMGPWKGIR----------------LGVIKNPSAPVQLF 432 Query: 506 NLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 NL TD E+ + +H + + T M A Sbjct: 433 NLETDVSETTDVAAQHPDIVAKIATIMSA 461 >UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C3C8_9PLAN Length = 600 Score = 444 bits (1144), Expect = e-123, Method: Composition-based stats. Identities = 106/493 (21%), Positives = 179/493 (36%), Gaps = 76/493 (15%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSP 138 K+ ++PN+++ + DD G+ D +G TP I +A++G+ T Y+ +P Sbjct: 28 AKEKSRQPNIILVMTDDQGYWDTEISGNPKI---KTPTIKKLAAEGVTFTRFYANMVCAP 84 Query: 139 TRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 TRA ++TG++ + G+ G G TT+ Q+L GY T GKWH+G + Q Sbjct: 85 TRAGLMTGRHYLRTGLYNTRFGGDTLG-PNETTIAQVLQKAGYKTGLFGKWHLGRYAQYQ 143 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 PQ GFD F G + + NP+ + Sbjct: 144 PQRRGFDHFFGHYHG------HIERYTNPDQVVVNGTPV--------------------- 176 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN--------- 309 + + + D + F+ + +PFF Y H + Sbjct: 177 -------ETRGYVTDLFTDAAIDFIQR--NQQQPFFCYLAYNAPHSPFLLDTSHFGQPEG 227 Query: 310 ----AKY--AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 KY G + ++ + L +T+ T+++FTSDNG + Sbjct: 228 DKLIEKYLAKGLPLREARIYAMIERIDQNLSRLLQTVHDLKLDQETVVIFTSDNGGVSRG 287 Query: 364 PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGA 422 G +G+K S +EGG RVP V W +D +V DLFPT LAG P Sbjct: 288 FKAG---LKGSKASAYEGGTRVPFVVRWTDHFPAGKTTDAMVAQTDLFPTFCQLAGVP-- 342 Query: 423 KVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYA 482 VP +DG S G+S + ++ + R YH Sbjct: 343 -----VPSNVKLDGESILSLMEQGGGKSPHQYLYHTWD------RYTPNPYHRWAIHGPR 391 Query: 483 YTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK- 541 + G+ +++L DP E ++ ++ L+ E + + + Sbjct: 392 FKLVGHDPQGKKKKEGEPQGQLYDLQEDPGEKKNVADQYPEKVSELRGEFLRWFQDVTAG 451 Query: 542 ---YPPRAQIKSD 551 P + + Sbjct: 452 QVYEPAAIPVGDE 464 >UniRef50_UPI0001A444F6 arylsulfatase A n=1 Tax=Pectobacterium carotovorum subsp. brasiliensis PBR1692 RepID=UPI0001A444F6 Length = 487 Score = 444 bits (1144), Expect = e-123, Method: Composition-based stats. Identities = 126/498 (25%), Positives = 204/498 (40%), Gaps = 54/498 (10%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 L ++ KPNV++ DD+GW D+ G PTP +D +A+ G T+ Sbjct: 16 AGPALWQVAAAAQTKPNVIILFTDDMGWADMSVQGAKT----PTPHLDKLAATGQRWTNF 71 Query: 131 YSQ-PSSSPTRATILTGQYSIHHGILMPPMYG------QPGGLQGLTTLPQLLHDQGYVT 183 Y SSP+R ++TG+ G+ + G G ++ + L GY T Sbjct: 72 YVSSAISSPSRGGLMTGRIETKTGLYGTKIPGVFMDEDPDGFPDDEISMAESLQHNGYRT 131 Query: 184 QAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDR-------- 235 GKWH+G + P GFD++ G + +D ++ D +A S + Sbjct: 132 IMYGKWHLGTQSTAFPTRHGFDEWYGIPTSNDRFSTVVDQVEMNRLASSDPKRRELLSKM 191 Query: 236 ---SEYIKQLPFSKDDVHAVR-GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDK 291 + +Q ++ H+ + G+Q A + + V+++ D+ Sbjct: 192 EEINRAPRQEYWNVPLYHSYKDNGKQVDYAVPQGFQQASFTKDVTNKAVQYI--ADNKDQ 249 Query: 292 PFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 FF+Y H + + ++ G YGD M+E++ +Y+ LE N +NT++ Sbjct: 250 SFFMYMAYPQTHVPLFTSPEFKGK--GHNPYGDVMLEIDWSVGQIYQALEANKLAENTIV 307 Query: 352 VFTSDNGPEAEVP----PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLA 407 +FTSDNGP + P R K + +EGG RVP V WK I P+ D I Sbjct: 308 IFTSDNGPWLQYDKDGLAGSALPLRSGKSTVFEGGQRVPFIVNWKSHIAPKVVDDIGSTL 367 Query: 408 DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVR 467 DL PT + + G A+ +DGVD ++ FL S R YF GK+ A R Sbjct: 368 DLLPTLMKITGSQHAQ--------RDLDGVDLSAAFLNGK-PSARTFMPYFYWGKMDAYR 418 Query: 468 MDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVP 527 ++K ++ G + +FNL D E + + Sbjct: 419 DGDYKVVFRDKK-------------AGIPVDLEKPLMFNLRDDVSEQHDLSAKEPDRYRA 465 Query: 528 LQTEMHAYMEIL-KKYPP 544 L + AY + L +K PP Sbjct: 466 LIEKARAYEQSLGEKKPP 483 >UniRef50_A6DQE3 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DQE3_9BACT Length = 489 Score = 444 bits (1143), Expect = e-123, Method: Composition-based stats. Identities = 112/499 (22%), Positives = 191/499 (38%), Gaps = 71/499 (14%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSP 138 T K PN+V DD+G+ DV TP ID VA QG+I T +S +P Sbjct: 18 ANTDKLPNIVYIYADDLGYGDVSCLNPNGL--ISTPSIDKVAQQGMIFTDCHSSASVCTP 75 Query: 139 TRATILTGQYSIHHGILMPPMYG--QPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK- 195 +R +++TG+YS + + G + G T+ LL + GY T IGKWH+G N Sbjct: 76 SRYSLMTGRYSWRSSLKKGVLTGYKKAIIEDGRMTVASLLKENGYNTAMIGKWHLGMNWA 135 Query: 196 ---------------ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 + P + GFD F G ++ D + + + + V + + Sbjct: 136 LNSKNNKKIDYSRAIKKTPTSNGFDYFYGISASLD-FPPYIYIENDRAVGEPTEHIDLSF 194 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 + + + ++ + +++K +KPFFLY+ Sbjct: 195 NQGIDR---------HGRPGPIEPKFKVNNVLTELTQKTTAKISELSKQEKPFFLYFSLT 245 Query: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-- 358 H P ++ G S YGD ++E + + K ++ N NTL++ +SDNG Sbjct: 246 SPHTPCAPADEFIGKSSL-GLYGDFVMETDYRIGQVIKAIKDNDIEHNTLVIISSDNGCA 304 Query: 359 ---PEAEVPPHGRTP---FRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFP 411 G P FRG KGS +EGG RVP V W ++ +D V Sbjct: 305 TYIGHEAFQTKGHYPSYIFRGYKGSLFEGGHRVPYIVKWPAKVKAGALNDTPVSQVGFLA 364 Query: 412 TALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEF 471 T ++ G +P D V L N + ++ + A+R +E+ Sbjct: 365 TCAEIVGAE-------LPDNAGEDSVSNLPAMLSLNKKPIWESFIHKNGRGGLAIRHNEW 417 Query: 472 KYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTE 531 K + T +++NL D +E ++ +++ + L Sbjct: 418 KLIL-----------------------TKVPALYNLKNDIKEQKNLALQYPEIVSRLTKL 454 Query: 532 MHAYMEILKKYPPRAQIKS 550 + Y++ + P Q + Sbjct: 455 LQKYVDDGRSTPGEKQQNT 473 >UniRef50_C1ZFQ0 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZFQ0_PLALI Length = 522 Score = 444 bits (1143), Expect = e-123, Method: Composition-based stats. Identities = 125/509 (24%), Positives = 207/509 (40%), Gaps = 61/509 (11%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTR 140 ++PN+++ L DD+G+ D+ T ID +A +G+ T A+S +PTR Sbjct: 31 AAEQPNILLILADDLGYGDLRCYNSQ--SKVSTSHIDRLAREGMRFTDAHSPSTVCTPTR 88 Query: 141 ATILTGQYSIHH---GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG---EN 194 ++TGQ G + + G G TLP +L ++GY T +GKWH+G + Sbjct: 89 YGLMTGQMPFRAPSGGTVFTGVGGPSLIAPGRLTLPMMLRERGYSTACVGKWHIGLTFFD 148 Query: 195 KESQPQN----------------------VGFDDFRGFNSVSDMYTEWRDVHVNPEVALS 232 +E +P + GFD F G T+W + + Sbjct: 149 REGRPIHSNALEAVRQVDFSRRIDGGPVDHGFDSFFGTACC--PTTDWLYAFIENDRVPV 206 Query: 233 PDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA--KSD 290 P + K H R G + ME++D +++ +FL++ Sbjct: 207 PPTASLEKSALPKHPYAHDCRPGLI-----ASDFAMEEIDLIFLEKSRQFLNQHVRQNPG 261 Query: 291 KPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 KPFFL++ T+ H ++ ++ G S A +GD ++E++ + L K+LE+ +NTL Sbjct: 262 KPFFLFHSTQAVHLPSFAAKQFQGKSEA-GPHGDFLLELDYIVGELMKSLEELHIAENTL 320 Query: 351 IVFTSDNGPE--------AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSD- 401 ++FTSDNGPE ++ G P+RG K WEGG RVP V W G ++P ++ Sbjct: 321 VIFTSDNGPEVTSVIHMRSDHGHDGARPWRGMKRDAWEGGHRVPFIVRWPGKVRPGTTNS 380 Query: 402 GIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG 461 + L D+ T + +P D + +L + R G Sbjct: 381 QLTSLTDVMATVAAIV-------DTQLPDHAAEDSFNMLPAWLDESAPPIRPYLLTQSFG 433 Query: 462 KLA--AVRMDEFKY--HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSI 517 A+R E+KY H + A ++NL TDP ES ++ Sbjct: 434 GSRTLAIRQGEWKYLDHTGSGGNRYENDPSLKPFILPDAAPDAPGQLYNLSTDPGESTNL 493 Query: 518 GVRHIPMGVPLQTEMHAYMEILKKYPPRA 546 + L+T + + P R Sbjct: 494 YHARPEVTSRLKTLLEQSKTNGRSRPTRP 522 >UniRef50_UPI0000586CBD PREDICTED: similar to MGC86251 protein n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000586CBD Length = 525 Score = 444 bits (1142), Expect = e-123, Method: Composition-based stats. Identities = 120/472 (25%), Positives = 201/472 (42%), Gaps = 45/472 (9%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRA 141 K+PN+++F DD+G+ D+ G + TP++ +A+ G++LT YS P SP+RA Sbjct: 22 AKRPNIIIFYADDLGYGDLEPYGHPTSS---TPNLGRLAAGGIVLTQFYSSSPVCSPSRA 78 Query: 142 TILTGQYSIHHGILMPPMYGQ--PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE--S 197 +LTG+Y + G+ + G T + ++L +GY + A+GKWH+G Sbjct: 79 ALLTGRYQMRSGVYPHVFNVEMSGGLPLNETLISKMLKPEGYRSAAVGKWHLGLGNNSVY 138 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P N GFD+F G + + N +P EY F+ Sbjct: 139 LPHNHGFDEFLGLPASPSQCRCSVCFYPNVTCHRAPCSPEYSPCALFNG----------- 187 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSP 317 P + LD ++ +F+ ++ PFFLYY + H Y + +G+S Sbjct: 188 -TTIIEQPADLLTLDDKYAMQSRRFIRTNVETGTPFFLYYASHHTHHPQYAGKETSGTSI 246 Query: 318 ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE-VPPHGRTPF-RGAK 375 R +GD + ++ +Y+ L++NG L++T F+SDNGP G + K Sbjct: 247 -RGRFGDSLAALDWEVGQIYEELKENGILEDTFFFFSSDNGPSLSLENFGGNAGLMKCGK 305 Query: 376 GSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFID 435 +T+EGG+RVP V+W G I P +S + D+ PT + +D Sbjct: 306 ATTYEGGIRVPAIVHWPGQITPGRSMELSSTLDVLPTIASITNAKLP--------NVTLD 357 Query: 436 GVDQTSFFLGTNGQSNRKAEHYFLN-----GKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490 G D + F S R++ Y+ + K AVR ++K + Sbjct: 358 GYDMSPFLF-QGMPSLRESFFYYPSKVDTEHKSYAVRYKQYKAVFYTEGSALSNNKNKDV 416 Query: 491 GFTGTVMQT--AGSSVFNLYTDPQESDSIGVRH-IP-----MGVPLQTEMHA 534 GT ++T +F+L DP E +I + H ++ + A Sbjct: 417 DCRGTSLRTYHDPPMLFDLEQDPSEQYNISINHSPERDIILKLTKMRADFDA 468 >UniRef50_Q9NJU8 Sulfatase 1 n=2 Tax=Coelomata RepID=Q9NJU8_HELPO Length = 503 Score = 444 bits (1142), Expect = e-123, Method: Composition-based stats. Identities = 114/523 (21%), Positives = 203/523 (38%), Gaps = 70/523 (13%) Query: 58 MMPVMQHPAQDKETQQKLAEL-EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPD 116 ++ ++ Q A ++ +PN+V L DD G+ DVG++G + TP Sbjct: 5 LLVLIAIITACAVADQSSASAGTRQDAGQPNIVFVLADDFGFHDVGYHGSEI----HTPT 60 Query: 117 IDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYG-QPGGLQGL-TTLPQ 174 +DA+++ G+ L + Y QP +PTR+ +++G+Y IH G+ + QP L TL Sbjct: 61 LDALSASGVRLENYYVQPICTPTRSQLMSGRYQIHTGLQHGIINSCQPNALPNDSPTLAD 120 Query: 175 LLHDQGYVTQAIGKWHMGENK-ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSP 233 L + GY T +GKWH+G K E P N GFD + G+ + ++ Y Sbjct: 121 KLKESGYATHMVGKWHLGFYKQEYLPWNRGFDTYFGYLNAAEDYFNHNVPWRQVRYLDLR 180 Query: 234 DRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPF 293 D + ++ + + + + + + + KP Sbjct: 181 DNNGPVRN------------------------ETGQYSAHLFTGKAIDVV-QSHNTSKPL 215 Query: 294 FLYYGTRGCHFDNYPNAKYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDN 348 FLY + H KY R ++ + +++ ANL + L+ G +N Sbjct: 216 FLYLAYQSVHAPLEVPEKYEHKYRNITDKNRRTFAGMVSALDEGVANLTQALKDKGLWNN 275 Query: 349 TLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLA 407 T+++F++DNG + + P RG K S WEGG FV + + S G++ ++ Sbjct: 276 TVLIFSTDNGGQIHAGGN-NYPLRGWKASLWEGGFHGVGFVSGGALKRSGAVSKGLIHVS 334 Query: 408 DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY---------- 457 D FPT + LAG + T +DG +Q S R+ + Sbjct: 335 DWFPTLVTLAG-------GNLNGTKPLDGFNQWDTISNET-PSPREILLHNIDILYPQKG 386 Query: 458 -------FLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQT---AGSSVFNL 507 + AA+R+ ++K ++ + +Q +FN+ Sbjct: 387 VPLYSNTWDTRVRAAIRVGDYKLITGDPGNGSWVPPPDGHLYFVPEIQESAAKNVWLFNI 446 Query: 508 YTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 DP E + + + L + + PPR Sbjct: 447 TADPNEHNDLSSEKPLEVLRLLQILVQFNNT--AVPPRYPAPD 487 >UniRef50_A6DG53 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG53_9BACT Length = 515 Score = 443 bits (1141), Expect = e-123, Method: Composition-based stats. Identities = 115/503 (22%), Positives = 194/503 (38%), Gaps = 62/503 (12%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSP 138 + PN+++ L DD+G + G G PTP +D + +QG+ T A+S +P Sbjct: 27 AAKTETPNIILILADDMGIDSIQALNGK--SGIPTPHLDRLLTQGIHFTDAHSGSAVCTP 84 Query: 139 TRATILTGQYSIHHGILMPPM--YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK- 195 TR +LTG+Y+ + + + +P + TLP +L +GY T IGKWH+G + Sbjct: 85 TRYGVLTGRYAWRSRLKKSIVRQWERPLIEKDRLTLPGMLKKKGYNTACIGKWHLGWDWP 144 Query: 196 -------------------ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS 236 E P GFD + G + P V + R Sbjct: 145 KKGGGFTEKMKEIDFSEKIEGGPAGCGFDYYFG----------DDVPNWQPFVWIENGRM 194 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 + S + +E + + + V+++++ A++ +PFFLY Sbjct: 195 LGVPNKQLS-----FASHYHSGKGIGVEGWDLEAVLPKITEKSVEYINQQAETKQPFFLY 249 Query: 297 YGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 + H P+ + G S + Y D ++E + + K L+ G DNTL++FT+D Sbjct: 250 FSMTSPHTPIAPSKPFQGKS-GISRYADFLMETDWCVGQIMKALKDRGIADNTLLIFTAD 308 Query: 357 NGPEA--------EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLA 407 NG E + +RG K +EGG RVP V W G I+P KSD + L Sbjct: 309 NGTSPKCNFTELREKRTDLQNHWRGMKADAFEGGHRVPFIVSWPGHIKPGSKSDQTISLV 368 Query: 408 DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSN-RKAEHYFLNGKLAAV 466 D+ T D + A D V G + + +A + V Sbjct: 369 DIMATCADAVALTLSDSAAE-------DSVSLMPVLKGEDIATPLHEAVICHSISGVFVV 421 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGV 526 R ++K +++L +DP+E++++ H + Sbjct: 422 RKGKWKLQYSAGSGGLSLPKDKN----AKKKGLPTWQLYDLSSDPKETNNLINGHQEIVK 477 Query: 527 PLQTEMHAYMEILKKYPPRAQIK 549 L + Y+E + P Q Sbjct: 478 DLTAILRRYIENGRSTPGTPQKN 500 >UniRef50_Q1CY93 Sulfatase family protein n=4 Tax=Bacteria RepID=Q1CY93_MYXXD Length = 553 Score = 443 bits (1141), Expect = e-123, Method: Composition-based stats. Identities = 145/496 (29%), Positives = 217/496 (43%), Gaps = 33/496 (6%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 K +KPN++V DD+G ++ G +G TP+ID +A +G ++T Y Q S + R Sbjct: 16 KQSRKPNILVIWGDDIGIWNISAYNQG-MMGYFTPNIDRIAKEGAMMTDCYGQQSCTAGR 74 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQG-LTTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 A +TG + G+ M G GLQ T+ ++L GY GK H+G++ P Sbjct: 75 AAFITGMNPLRTGLTTIGMPGAKYGLQDSDPTIAEMLKPLGYTCGHFGKNHVGDSNPYLP 134 Query: 200 QNVGFDDFRGFNSVSDM--YTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG-- 255 GFD+F G + E D +P ++ +DD + Sbjct: 135 TVHGFDEFFGNLYHLNAEGEPECPDYPKDPTFKERFGPRGVLRSWATDRDDPTEDKRWGV 194 Query: 256 ----EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 + +T K ME +D ++ + F+++ K KPFFL++ T H Y K Sbjct: 195 VGKQRIEDTGALTRKRMETVDGEFLQGTLDFMERAVKDGKPFFLWHNTTRTHVWTYLQEK 254 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP-HGRTP 370 Y ++ Y D M E++D+ L L++ G DNTL+VF++DNG E P G +P Sbjct: 255 YRNAT-GYGLYADAMRELDDIVGVLLAKLDELGIADNTLVVFSTDNGVEKMGWPDGGNSP 313 Query: 371 FRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGA------- 422 FRG KGSTWEGGVRVP V W G+++P + I D PT + AG P Sbjct: 314 FRGEKGSTWEGGVRVPCMVRWPGVVEPGRVINDIFAHEDWMPTLVSAAGGPKDLVAQCQR 373 Query: 423 -KVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 A ++DG DQT G + + +G LAAVR D++K Q+ Sbjct: 374 GYKAGDKTFRVYLDGYDQTGLLAGKEKGPRHEFIYVLDSGNLAAVRYDDWKLIFSYQEG- 432 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH-IPMGVPL------QTEMHA 534 G F+G A + NL +DP E + G + Q + Sbjct: 433 ----EGPDMWFSGKRFDPAWPYLINLRSDPFEYGPKAGLYLKWYGERMFTFVPAQALVQK 488 Query: 535 YMEILKKYPPRAQIKS 550 + + L YPP S Sbjct: 489 FAQSLLDYPPSQAPGS 504 >UniRef50_A6DF72 Putative secreted sulfatase ydeN n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF72_9BACT Length = 481 Score = 443 bits (1141), Expect = e-123, Method: Composition-based stats. Identities = 112/507 (22%), Positives = 181/507 (35%), Gaps = 106/507 (20%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPTRATI 143 KPNV++ L+DD+GW D G + TP++D ++ G+ T AYS SPTR++I Sbjct: 23 KPNVIMILVDDLGWTDTTCYGSDL---YQTPNVDELSRTGMRFTDAYSACTVCSPTRSSI 79 Query: 144 LTGQYSIHHGILMPPMYG------------QPGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 +TG+ ++ + + TL + GY T IGKWH+ Sbjct: 80 MTGKNPANNNLTDWITGHVKPYAKLKSPNWKMHLTAEEITLAEAFKATGYKTVHIGKWHL 139 Query: 192 GENKESQPQNVGFDDFR-GFNSV---SDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 GE S P+N GFD+ GF + + + + NP + P Sbjct: 140 GEESVSWPENQGFDENIAGFRAGSPSAHGGGGYFSPYNNPRLKDGPKG------------ 187 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 E L +R +++ AK KPFF+ H Sbjct: 188 ---------------------EYLTERLAQEASQYIQSTAKLKKPFFMNLWLYNVHTPLQ 226 Query: 308 PNAKYAGSS---------PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG 358 + Y + M+D + + ++ G DNT+I+F SDNG Sbjct: 227 ARQEKIDKYTRLIQKGYQHTNPVYAAMVEHMDDAVGTVMQAVKDAGIEDNTIIIFNSDNG 286 Query: 359 PEAEVPPH------GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFP 411 + P R KG +EGGVRVP + W I+ S V D++P Sbjct: 287 GLRGNYENNRQKVTSNYPLRSGKGDMYEGGVRVPMIIKWSRKIKAGQTSSSPVISHDIYP 346 Query: 412 TALDLAGHPGAKVANLVPKTTFIDGVDQTS-FFLGTNGQSNRKAEHYFLNG-------KL 463 T LDL + K IDG+ G Q R A ++ Sbjct: 347 TLLDLCKIDVS-------KKQDIDGISLVPELLEGKTIQ--RDALYWHYPHYHLEGAKPY 397 Query: 464 AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIP 523 +A+R ++K L ++ +A ++NL D E +++ + Sbjct: 398 SAIRKGDWKLIFLYEESHA--------------------ELYNLRNDISERNNLAMTEKR 437 Query: 524 MGVPLQTEMHAYMEILKKYPPRAQIKS 550 L ++ + + + P Sbjct: 438 KLAELMGDLRTWKKKIGAQLPVFNPNY 464 >UniRef50_Q7UNN1 Arylsulphatase A n=3 Tax=Bacteria RepID=Q7UNN1_RHOBA Length = 529 Score = 443 bits (1141), Expect = e-123, Method: Composition-based stats. Identities = 119/516 (23%), Positives = 205/516 (39%), Gaps = 66/516 (12%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY- 131 L+E +PNV+V + DD+G+ D+G G A G TP+ID +AS+G TS Y Sbjct: 32 SPLSETSAADNDRPNVIVVMADDLGYGDIGCYG---AKGLETPNIDQMASEGCRFTSGYC 88 Query: 132 SQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQ-GLTTLPQLLHDQGYVTQAIGKWH 190 S + +PTR + LTG Y+ P + G TT ++L + GY T IGKWH Sbjct: 89 SASTCTPTRYSFLTGTYAFRFPNTGIAPPNSPALIPAGTTTTARILKNAGYKTAVIGKWH 148 Query: 191 MGENKES-----------QPQNVGFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPD 234 +G +++ P +GFD + +D + +++P L Sbjct: 149 LGLGEKNEGPDWNGDLKPGPLEIGFDHCILLPTTNDRVPQVYVNDHNVENLDPADPLWVG 208 Query: 235 RSEYIKQLP------------FSKDDVHAVRGGEQQAIADITPK----YMEDLDQRWMDY 278 + + P +S + G + EDL RW++ Sbjct: 209 NKKPSEDHPTGITHRDTLKMDWSHGHNSTIHNGISRIGFYTGGHAARFRDEDLSDRWVEE 268 Query: 279 GVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYK 338 +++ + ++PFFL++ + H + ++ G S GD + E++ L K Sbjct: 269 SKRWIAE--NREEPFFLFFASHDLHVPRVVHERFQG-STKLGPRGDAIAELDWCVGELMK 325 Query: 339 TLEKNGQLDNTLIVFTSDNGPEAE-----------VPPHGRTPFRGAKGSTWEGGVRVPT 387 +LE+NG + T++VF SDNGP + P++G K + +EGG R P Sbjct: 326 SLEENGLTEKTMLVFCSDNGPVLDDGYKDDANEKLGNHDPNGPYQGGKYTVYEGGTRTPF 385 Query: 388 FVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTN 447 G I SD +V D + + G +P +D + + + Sbjct: 386 ITRMPGTIPVGVSDEMVCTIDFAASLAAMVG-------QELPNDASLDSQNVLGALMNQS 438 Query: 448 GQSNRKAEHYFLNG--KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVF 505 G S R+ NG R+ ++K Q+ Y + T +++ Sbjct: 439 GASGREHLVQQDNGKVGNYGYRVGDWKLVRHDQK------KSYNFDLSMTRKPVPQFALY 492 Query: 506 NLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 NL +DP E + + +Q E+ ++ + Sbjct: 493 NLESDPAEQNDLSDSEPERAKQMQQELQKLLDAGRS 528 >UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFN4_9BACT Length = 481 Score = 443 bits (1140), Expect = e-123, Method: Composition-based stats. Identities = 116/482 (24%), Positives = 182/482 (37%), Gaps = 68/482 (14%) Query: 86 PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATIL 144 PNV+ L DD+G+ ++G G TP IDA+A +G+ T YS P +P+R +L Sbjct: 20 PNVIYILADDLGYGELGCYGQEKI---KTPHIDALAKEGMRFTRHYSGAPVCAPSRGVLL 76 Query: 145 TGQYSIHHGILMPPMY---GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQ 200 +GQ I + GQ + TL Q+ D+GY T A GKW +G S P+ Sbjct: 77 SGQQLSKAYIRNNREHKPEGQEPIPEPGMTLAQIFKDKGYATGAFGKWGLGYPGSSSDPK 136 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 +GFD F G+N ++ + P S D++ I + P AV Sbjct: 137 ALGFDTFYGYNCQRVAHSFY------PPHMWSNDKNITINEKPVPGHWRKAV-----GPD 185 Query: 261 ADITPKYME-DLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA- 318 D + Y E +D +KF+ DKPFF Y H +P + S P Sbjct: 186 FDFSQFYAENYAPDLILDEALKFIKD--NKDKPFFAYLPFVEPHLAMHPPHSWVDSYPKE 243 Query: 319 ------------------RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 R Y + ++++ ++ + L++ ++NTL++FTSDNG Sbjct: 244 WDSPKESYKAAYLPHLRPRAGYAAMISDLDEHVGSVMQLLKELDLVENTLVIFTSDNGAS 303 Query: 361 A-----EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTAL 414 + RG KGS +EGG+RVP +W G I + + SD + D+ T Sbjct: 304 HCIEVDHEFFNSTKDLRGLKGSVYEGGLRVPMIAHWPGKIKKAQVSDHVSGFVDVMATFC 363 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN-GKLAAVR-MDEFK 472 DL + + DGV G + F A+ +K Sbjct: 364 DLLQTEAPQTS---------DGVSFLPTLKGEKQEPQPVLAWEFQGYSGQQAIILDGRWK 414 Query: 473 YHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 T +++L DP E + + + + M Sbjct: 415 GVR----------QNLSPRGKKKAKSTPKWELYDLNKDPNEKTDLATQMPEIVDRIHKAM 464 Query: 533 HA 534 Sbjct: 465 MK 466 >UniRef50_A6DG39 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG39_9BACT Length = 473 Score = 443 bits (1139), Expect = e-122, Method: Composition-based stats. Identities = 114/478 (23%), Positives = 196/478 (41%), Gaps = 56/478 (11%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRAT 142 +KPN+V+FL DD+G+ D G P ID +A +G+ T A+S + +P+R Sbjct: 22 EKPNIVIFLADDLGYGDCGAFNSQ--SKIKMPHIDRLAEEGMRFTDAHSASATCTPSRYG 79 Query: 143 ILTGQYSIHHGILMPPMY-GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ--- 198 +LTG + G+ + G+P + TL LL + Y T +GKWH+G +S+ Sbjct: 80 LLTGINPVRTGVFNTLLKTGRPIIHKDEMTLADLLKVEDYETWMVGKWHLGFENKSKSLD 139 Query: 199 --------PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 P + GFD F G +P + + + + D + Sbjct: 140 LSQDLRGGPLDCGFDYFFGLA---------SSASSSPLCFIKNRKIQEVSSEFVEVDKIR 190 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAK--SDKPFFLYYGTRGCHFDNYP 308 + IA +ED+ R + V + + AK ++PF LY+ + H P Sbjct: 191 GSGQKSKYKIAVPKDLKLEDVSPRLSENAVGLIQEYAKSAKEQPFLLYFASIAPHQPWVP 250 Query: 309 NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP-------EA 361 + + G S Y D +++M+D + + L+ G NT+++FTSDNG A Sbjct: 251 SENFKGKS-GLGVYADFVMQMDDELGQINQALKDTGLEKNTIVIFTSDNGTGPGAHYLMA 309 Query: 362 EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHP 420 E H P RGAK S++EGG R+P W G+I +S +++ D+F T +L Sbjct: 310 EQGHHSSGPMRGAKASSYEGGHRMPFIAKWPGIIPVNSQSKAVINATDIFATIAELLKVD 369 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQP 480 + V D + N + +R + ++RM ++K Sbjct: 370 LKEKYPQVAP----DSFSFYKNLINLNQKQSRPSMVV-----RESIRMGDWKLI------ 414 Query: 481 YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 G ++ + ++NL +D E + + H + E +M+ Sbjct: 415 ------SSGGKKEFDSLKMSQFKLYNLSSDLAEKNDLAPSHPERAQEMYKEFKKFMDQ 466 >UniRef50_P34059 N-acetylgalactosamine-6-sulfatase n=23 Tax=Deuterostomia RepID=GALNS_HUMAN Length = 522 Score = 442 bits (1138), Expect = e-122, Method: Composition-based stats. Identities = 126/494 (25%), Positives = 205/494 (41%), Gaps = 59/494 (11%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSS 137 + PN+++ L+DD+GW D+G G TP++D +A++GL+ + YS P S Sbjct: 24 ASGAPQPPNILLLLMDDMGWGDLGVYGEP---SRETPNLDRMAAEGLLFPNFYSANPLCS 80 Query: 138 PTRATILTGQYSIHHGILMPPMYGQ---------PGGLQGLTTLPQLLHDQGYVTQAIGK 188 P+RA +LTG+ I +G + + G LP+LL GYV++ +GK Sbjct: 81 PSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGK 140 Query: 189 WHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 WH+G + P GFD++ G +P P ++ +P +D Sbjct: 141 WHLGHRPQFHPLKHGFDEWFG----------------SPNCHFGPYDNKARPNIPVYRDW 184 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 R E+ I T + +L Q ++ + F+ + A+ PFFLY+ H Y Sbjct: 185 EMVGRYYEEFPINLKTGE--ANLTQIYLQEALDFIKRQARHH-PFFLYWAVDATHAPVYA 241 Query: 309 NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP--- 365 + + G+S R YGD + E++D + + L+ DNT + FTSDNG P Sbjct: 242 SKPFLGTSQ-RGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG 300 Query: 366 HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKV 424 PF K +T+EGG+R P +W G + S + + DLF T+L LAG Sbjct: 301 GSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTP--- 357 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQ-QPYAY 483 P IDG++ L G+ + Y+ L A + + K H + Sbjct: 358 ----PSDRAIDGLNLLPTLL--QGRLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWEN 411 Query: 484 TQSGYQGGFTGTV---------MQTAGSSVFNLYTDPQESDSI---GVRHIPMGVPLQTE 531 + G V T +F+L DP E + + + + Sbjct: 412 FRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQEALSRITSV 471 Query: 532 MHAYMEILKKYPPR 545 + + E L P+ Sbjct: 472 VQQHQEALVPAQPQ 485 >UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CEC4_9PLAN Length = 467 Score = 442 bits (1138), Expect = e-122, Method: Composition-based stats. Identities = 107/510 (20%), Positives = 178/510 (34%), Gaps = 110/510 (21%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-S 132 + ++PN+V+F +DD+GW DVGF G TP ID +A + + T+AY + Sbjct: 18 SMLSQASAENQRPNIVLFFIDDLGWRDVGFMGSDF---FETPHIDRLADESMKFTAAYSA 74 Query: 133 QPSSSPTRATILTGQYSIHHGILMP--------------PMYGQPGGLQGLTTLPQLLHD 178 P+ +P+RA +++G Y+ HG+ P TT+ L Sbjct: 75 APNCAPSRACLMSGLYTPRHGVYTVGDPARGNDRYRKLIPAENNRVLDDRFTTIADRLSQ 134 Query: 179 QGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 GY ++GKWH+G P + GF N + + NP+++ Sbjct: 135 AGYRCASVGKWHLG----QSPLSQGFQVNIAGNQTGSPRGGYFSPYQNPQLS-------- 182 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 + E L R +F+ PFFLY Sbjct: 183 -------------------------DGEQGEFLTDRLTTAACQFIKD--NQGSPFFLYLT 215 Query: 299 TRGCHFDNYPNAKY--------AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 H + AG +Y + M+ + +TL + NT+ Sbjct: 216 HYAVHTPLQAKKEDIAYFQSKPAGKLHQHATYAAMIRSMDQSIGRVLQTLREQQLDQNTI 275 Query: 351 IVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSD-GIVDLADL 409 +VFTSDNG P P RG+KG +EGG+RVP + W G+ QP + V DL Sbjct: 276 VVFTSDNGG--YGPATSMLPLRGSKGMLYEGGIRVPLLIKWPGVTQPGSTTGEAVINVDL 333 Query: 410 FPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF----------- 458 +PT L++ P + +DG + ++ + Sbjct: 334 YPTFLEMTNIPVLESEL-------LDGESLVPLLKDPQTRLESRSLFWHFPAYLQKYQGM 386 Query: 459 ----LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 ++ +R ++K + + ++N D ES Sbjct: 387 QQRFRTTPVSVIRQGDWKLLEFFEDGHQ--------------------ELYNTRLDIGES 426 Query: 515 DSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 + H L +H + + +K P Sbjct: 427 KELSGSHPEKTQELSQALHRWQKQVKAAIP 456 >UniRef50_C6I6Z4 N-acetylgalactosamine-6-sulfatase n=11 Tax=Bacteroidetes RepID=C6I6Z4_9BACE Length = 504 Score = 442 bits (1138), Expect = e-122, Method: Composition-based stats. Identities = 118/520 (22%), Positives = 183/520 (35%), Gaps = 94/520 (18%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 + L+ + +PNV+ ++DD+G+ D+G G TP+ID + G+ T Sbjct: 11 SVFALSAKSQVKESRPNVIYIIMDDLGYGDIGCYGSEKI---ETPNIDRLYKDGISFTQH 67 Query: 131 YS-QPSSSPTRATILTGQYSIHHGIL-------------------MPPMYGQPGGLQGLT 170 Y+ P S+P R ++TG +S H I P + GQ Sbjct: 68 YTGSPVSAPARCVLMTGMHSGHAQIRANDEMAYRGAIMNYDSMYVHPGLEGQYPLKAHTM 127 Query: 171 TLPQLLHDQGYVTQAIGKWHMG-ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEV 229 TL +++ GYVT GKW +G E P GFD F G+N ++ + P Sbjct: 128 TLGRMMQQAGYVTGCFGKWGLGAPGTEGTPNKQGFDSFYGYNCQRQAHSYY------PAF 181 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYM-EDLDQRWMDYGVKFLDKMAK 288 + Y+ G + + A E + D + F+ + Sbjct: 182 LYKNEDRVYLANKVLDPHTTKLDAGADPRDEAAYAKFSQKEYANDLIFDELISFVGQ--N 239 Query: 289 SDKPFFLYYGTRGCHFDNYPNAK--------------YAGSS------PARTSYGDCMVE 328 KPFFL + T H K Y G + +Y + Sbjct: 240 RKKPFFLMWTTPLPHVSLQAPEKWVKYYVGKFGDEAPYIGKAGYMPCRYPHATYAAMISY 299 Query: 329 MNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV-----PPHGRTPFRG----AKGSTW 379 ++ L + L+K DNT+I+FTSDNGP PFR K Sbjct: 300 FDEQIGKLIEKLKKERLYDNTVIMFTSDNGPTFNGGSDSPWFDSGGPFRSEYGWGKCFVH 359 Query: 380 EGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVD 438 EGG+R+P V W G I+P SD I D+ PT D+A + DG+ Sbjct: 360 EGGIRIPAIVTWPGKIKPSTQSDHICGFQDVMPTLADIANIACPET----------DGIS 409 Query: 439 QTSFFLGTNGQSNRKAEHYFLNG----KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTG 494 LG + Y+ L A+RM ++K V Sbjct: 410 FLPALLGETERQKEHEYLYWEYPDPTIGLKAIRMGKWKGIVN-----------------N 452 Query: 495 TVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 + +++L +D +E + H + L M Sbjct: 453 IRKGNSTMELYDLESDLREEHDVAAEHPDIVRKLTRLMEK 492 >UniRef50_A6KZI7 Arylsulfatase n=23 Tax=Bacteroidales RepID=A6KZI7_BACV8 Length = 508 Score = 442 bits (1138), Expect = e-122, Method: Composition-based stats. Identities = 112/505 (22%), Positives = 186/505 (36%), Gaps = 88/505 (17%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSP 138 + KPN++ + DD+G+ D+G G TP+ID +A +G+ T AY+ P S+P Sbjct: 22 AQKTPKPNIIYIMCDDMGYGDLGCYGQPYIS---TPNIDNMAKEGMRFTQAYAGSPVSAP 78 Query: 139 TRATILTGQYSIHHGILMPPMY------------------GQPGGLQGLTTLPQLLHDQG 180 +RA+ +TGQ+S H + Y GQ G +P+++ D G Sbjct: 79 SRASFMTGQHSGHCEVRGNKEYWRDAPVVMYGNNKEYAVVGQHPYDPGHVIIPEIMKDNG 138 Query: 181 YVTQAIGKWHMGE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI 239 Y T GKW G S P G D++ G+ + + + +N + D + Sbjct: 139 YTTGMFGKWAGGYEGSVSTPDKRGIDEYYGYICQFQAHLYYPNF-LNRYSKSAGDTAVVR 197 Query: 240 KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 + + + + ++ + + +K+LDK +PFF + Sbjct: 198 VVMDENINYPMFGKDYFKRP---------QYSADMIHEEAMKWLDKQ-DGKQPFFGIFTY 247 Query: 300 RGCHFDNYPNA-----------------------KYAGSSPARTSYGDCMVEMNDVFANL 336 H + +Y S + + ++ + Sbjct: 248 TLPHAELAQPEDSILTGYQKKFFEDKTWGGQEGSRYNPSVHTHAQFAGMITRLDYYVGEV 307 Query: 337 YKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR-----TPFRGAKGSTWEGGVRVPTFVYW 391 L++ G +NT+++FTSDNGP E RG K +EGG+R+P V W Sbjct: 308 LNKLKEKGLDENTIVIFTSDNGPHEEGGADPTFFGRDGKLRGLKRQCYEGGIRIPFIVRW 367 Query: 392 KGMIQPRKS-DGIVDLADLFPTALDLAGHPG--AKVANLVPKTTFIDGVDQTSFFLGTNG 448 G + D + DL PT DLAG K N + DG+ LG G Sbjct: 368 PGKVPEGTVNDHQLAFYDLMPTFCDLAGVKNYVKKYTNKKKDVDYFDGISFAPTLLGQEG 427 Query: 449 QSNRKAEHY-FLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNL 507 Q ++ F VRM ++K V P+ ++NL Sbjct: 428 QKKHDFLYWEFDETDQIGVRMGDWKMVVKKGTPF----------------------LYNL 465 Query: 508 YTDPQESDSIGVRHIPMGVPLQTEM 532 TD E I H + ++ + Sbjct: 466 ATDIHEDHDIAAGHPDIVKQMKEII 490 >UniRef50_A6DF76 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF76_9BACT Length = 542 Score = 442 bits (1137), Expect = e-122, Method: Composition-based stats. Identities = 114/530 (21%), Positives = 184/530 (34%), Gaps = 88/530 (16%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTR 140 KPN+V L DD+G D G N TP+IDA+A++G+ T + PTR Sbjct: 18 AADKPNIVFILADDMGIGDTNCYGDEKCRIN-TPNIDALAAEGVRFTDFHVNSSICGPTR 76 Query: 141 ATILTGQYSIHHGIL-MPPMYGQPGGLQGL--TTLPQLLHDQGYVTQAIGKWHMG----- 192 ++TG+Y G +G G TL ++L GY T IGKWH+G Sbjct: 77 RALMTGRYPWRFGATVNNGPWGFCGPRPNTEKYTLGKVLKKAGYNTGYIGKWHLGTTMVT 136 Query: 193 -ENKE-------------SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 + K+ P GFD DMY + + +S Sbjct: 137 KDGKKQGLTNVDYTKPLVYGPMQFGFDYSFILPGSLDMYPYAFIKDNDWQGDVS------ 190 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 + A + + + F+ K SD PFFL+ Sbjct: 191 ----------ALKGWSAFNRVGAAEISFESNKVVETFYRESELFIKKQ-NSDTPFFLFLA 239 Query: 299 TRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG 358 H P ++ G S YGD ++E++ A + + L++ G +NTLI+F+SD+G Sbjct: 240 LTSPHTPVCPGEEWNGKSEL-GPYGDFVMEVDHSIARVKQALKEKGLYENTLIIFSSDHG 298 Query: 359 PEAEVPP--------------HGRTP---FRGAKGSTWEGGVRVPTFVYWKGMIQPRK-S 400 P G P +RG K S +EGG+RVP W G + Sbjct: 299 PAPYAGNILKATPNQISLLEQQGHYPAGIYRGYKFSIYEGGLRVPFIASWPGKTPKGQIC 358 Query: 401 DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN 460 + ++ DLF T +L + + D + + +RK Sbjct: 359 NQLIGFNDLFATFAELTNIK-------LQEDEAPDSISFARLLTKPSSNGDRKDLIMQSV 411 Query: 461 GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAG------------------- 501 A+R E+K + +G Sbjct: 412 -TSFAIRDGEWKLCLCPGSGIPANSENGKGNDPAPNAAWKKALEEFKGKPHQTDLLKAPF 470 Query: 502 SSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 +FNL DP+E +++ ++ + + + P ++K+D Sbjct: 471 VQLFNLAKDPEEKNNLASKNPRQVEKMINLFKKQIADGRSTPG-PKLKND 519 >UniRef50_C6VTS4 Sulfatase n=47 Tax=cellular organisms RepID=C6VTS4_DYAFD Length = 520 Score = 442 bits (1137), Expect = e-122, Method: Composition-based stats. Identities = 127/526 (24%), Positives = 197/526 (37%), Gaps = 68/526 (12%) Query: 59 MPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDID 118 + T + A +K KPN+V+ LDD+G+ DVG G A TP++D Sbjct: 11 ILGTSLAVLLAATGWQFAPQTEKAA-KPNIVIVNLDDLGYGDVGAYG---ATALKTPNMD 66 Query: 119 AVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQG-LTTLPQLL 176 +A+ G+ T+ Y+ + +P+R ++TG Y + P + T+P++L Sbjct: 67 RIANGGIRFTNGYATSSTCTPSRFALVTGVYPWRNKEAKILPGDAPLLIDTAQQTIPKVL 126 Query: 177 HDQGYVTQAIGKWHMGENKESQ---------PQNVGFDDFRGFNSVSD-----MYTEWRD 222 GY T +GKWH+G P +GFD + D R Sbjct: 127 KKAGYATAIVGKWHLGLGNGDTDWNKEVKPGPNQLGFDYSYILAATQDRVPTVYIENTRV 186 Query: 223 VHVNPEVALSPDRSEYIKQLPFSKDDVHA----VRGGEQQAIADITPK------------ 266 V ++P + + + P KD+ G Q+I + + Sbjct: 187 VGLDPNDPIRVSYKQNFEGEPTGKDNPELLKMKWHHGHDQSIVNGISRIGYMKGGQKAKW 246 Query: 267 YMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCM 326 E++ ++ +F+ KPFFLYY + H P+ ++ G + GD + Sbjct: 247 NDEEMADLFLTKAQQFIKDH--KSKPFFLYYAMQQPHVPRTPHPRFKGVT-GMGPRGDAI 303 Query: 327 VEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG-----------RTPFRGAK 375 E + L TLEK G L+NTLI+FTSDNGP H P RG K Sbjct: 304 AEADWCLGELLNTLEKEGILENTLIIFTSDNGPVVNDGYHDDAVEKLGKHKPAGPLRGGK 363 Query: 376 GSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFID 435 S +E GVRVP YWKG I+P SD +V DL + L G +D Sbjct: 364 YSLFEAGVRVPFITYWKGTIKPAVSDAVVCQLDLLSSLAHLTGQEAKG----------LD 413 Query: 436 GVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGT 495 + FLG + + A+R ++ P + G Sbjct: 414 SRNYLDVFLGKTQKGRSELIL--EASSRTALRQGDWLMIPPYNGPAINKMVNIELG---- 467 Query: 496 VMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 ++NL TD + ++ L T + K Sbjct: 468 --NAKEYQLYNLKTDIGQQHNLAKSEPERLKKLVTAFEQLQQGGAK 511 >UniRef50_UPI0000586CBA PREDICTED: similar to arylsulfatase B n=3 Tax=Deuterostomia RepID=UPI0000586CBA Length = 596 Score = 441 bits (1136), Expect = e-122, Method: Composition-based stats. Identities = 121/523 (23%), Positives = 206/523 (39%), Gaps = 84/523 (16%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 TGK P++V + DD GW DVG++ + TP++D +AS+G+ L + Y QP SP+R Sbjct: 94 ATGKPPHIVFIVADDYGWFDVGYHNSTI----KTPNLDLLASRGVKLENYYVQPICSPSR 149 Query: 141 ATILTGQYSIHHGILM-PPMYGQPG-GLQGLTTLPQLLHDQGYVTQAIGKWHMGENK-ES 197 + ++TG+Y IH G+ + QP TTLPQ L + GY T +GKWH+G K E Sbjct: 150 SQLMTGRYQIHTGLQHFVIIAPQPNCLPLNETTLPQKLKESGYATHLVGKWHLGFYKNEC 209 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P GFD G+ S Y P P+ + ++ + + Sbjct: 210 MPLQRGFDSSFGYLSGMQDYWTHFRSGSFPGF---PEGNHWLGIDFWDNN---------- 256 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG--- 314 + + T Y + + + + + + + ++P FLY + H KY Sbjct: 257 RVAWEYTGNYSQFV---FTERAQRVIQQH-NPNQPLFLYLPLQSVHGPLQVPEKYMKPYA 312 Query: 315 --SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFR 372 R +Y + M++ + +L++ G ++T++VFT+DNG + P R Sbjct: 313 HFQDVGRQTYAGMVATMDEAVGKVVDSLQEAGLWNDTVLVFTTDNGGTPGKSGN-NWPLR 371 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQPR----KSDGIVDLADLFPTALD-LAGHPGAKVANL 427 G K + WEGGV F+ MI S + ++D FPT ++ +AG A + Sbjct: 372 GTKNTLWEGGVHGVGFITGP-MIPAGVQGTVSKHFMHISDWFPTLIEGVAGGNTAGL--- 427 Query: 428 VPKTTFIDGVDQTSFFLGTNGQSNRKAEHY------------------------------ 457 +D + + S RK + Sbjct: 428 -----ALDSYNMWNSIT-KGTPSPRKELLHNIDPYIRADHPFGYGYDEETDMIYPLSGLY 481 Query: 458 ------FLNGKLAAVRMDEFKYH--VLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYT 509 F AA+R+ E+K + + + +FN+ Sbjct: 482 PKMAAEFSTDMRAAIRVGEWKLLTGFPGRSGWYPPPEWNIHPIDPVEAANKVTWLFNITA 541 Query: 510 DPQESDSIGVRHIPMGVPLQTEMHAYME-ILKKYPPRAQIKSD 551 DP E + + +H + L + AY + + P +K+D Sbjct: 542 DPCEKNDLSYQHPEVVTELVGRLEAYYKTSVPVRFPNQTVKAD 584 >UniRef50_A6DR29 N-acetylgalactosamine-6-sulfatase n=3 Tax=Bacteria RepID=A6DR29_9BACT Length = 510 Score = 441 bits (1136), Expect = e-122, Method: Composition-based stats. Identities = 127/497 (25%), Positives = 190/497 (38%), Gaps = 61/497 (12%) Query: 69 KETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILT 128 + + KPNV++ + DD+GW D GFNG V TP +D +A++GL L Sbjct: 9 AASAALFSPFISAESAKPNVILIMADDLGWGDTGFNGSKVI---KTPHLDQMAAEGLQLD 65 Query: 129 SAYSQP-SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIG 187 YS SPTRA++LTG+ G+ P Q TLP++L++QGY T G Sbjct: 66 RFYSASSVCSPTRASVLTGRNPYRTGV---PTANQGFLRPEEITLPEVLNEQGYATGHFG 122 Query: 188 KWHMG---------------ENKESQPQN-VGFDDFRGFNSVSDMYTEWRDVHVNPEVAL 231 KWH+G KE P G++D S Y + Sbjct: 123 KWHLGTLTHTEKDANRGKPGNTKEFNPPKLHGYEDAFVTESKVPTYDPMILPAKFDQGES 182 Query: 232 SPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDK 291 EY+K+ SK E + I D D + MD + F+D+ +K Sbjct: 183 KHLGWEYVKEGEESKPYGTFYWDIEGKKITDNL---KGDDSRVIMDRVLPFIDQAVADEK 239 Query: 292 PFFLYYGTRGCHFDNYPNAK----YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 PF H + Y G +Y C+ M++ L K L G D Sbjct: 240 PFLSVVWFHTPHLPCVAGPRHQEMYKGHPIHLRNYAGCVTAMDEQIGRLRKHLADKGVAD 299 Query: 348 NTLIVFTSDNGPEAEVPP--HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIV 404 NT+I F SDNGPE++ P FRG K +EGGVRVP + W ++ RK Sbjct: 300 NTMIWFCSDNGPESKERPDNGSAGHFRGRKRDLYEGGVRVPAVMVWPAKVKEARKISAPC 359 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLA 464 +D PT LD P + + DG + +++ F + + Sbjct: 360 ITSDYMPTILDALHIPHPQASYAT------DGRSLMPIINNEDFTRDKEIGIMFSSRIVW 413 Query: 465 AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPM 524 +FK ++NL +DP E + ++ + Sbjct: 414 --HKGDFKLLSYNGG--------------------KKYELYNLKSDPSEKTDVAAQNPEL 451 Query: 525 GVPLQTEMHAYMEILKK 541 L+ +M A+ E +K Sbjct: 452 VEKLKKDMLAWHESVKS 468 >UniRef50_A6BYR0 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BYR0_9PLAN Length = 658 Score = 441 bits (1136), Expect = e-122, Method: Composition-based stats. Identities = 123/546 (22%), Positives = 180/546 (32%), Gaps = 119/546 (21%) Query: 74 KLAELEKKTGKK-PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 ++ E + PNVV+FL+DD+GWMD G TP++ +A Q + T+AY+ Sbjct: 13 AISSAETVAADRAPNVVLFLVDDMGWMDSEPYGSRY---YETPNMSKLAKQSMRFTNAYA 69 Query: 133 QPSSSPTRATILTGQYSIHHGILMPPMYGQP-------------------------GGLQ 167 P SPTRA+ILTGQY HGI + P Sbjct: 70 TPLCSPTRASILTGQYPSRHGITSATGHRPPQAENFEFLPTAAPPNQKLRMPVSKNYLEP 129 Query: 168 GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNP 227 TL + L D GY T GKWH+G +P GF+ Sbjct: 130 NQYTLAEALRDAGYRTGHFGKWHLGLTTPHRPDKQGFET-------------------VW 170 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA 287 A P Y + + + E + R ++F++ A Sbjct: 171 HCAPDPGPPSYFSPYGVTPTGKPTAQH---RVGNITDGPDGEHITDRLTSEAIQFME--A 225 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAG---------SSPARTSYGDCMVEMNDVFANLYK 338 +PFFL H A+Y + +++ + + Sbjct: 226 HRSEPFFLNLWHYSVHGPWQHKAEYTAEFAKKQDPRKEQRNPVMASMLRNVDESLGRILQ 285 Query: 339 TLEKNGQLDNTLIVFTSDNGPEAEVP-------------------------------PHG 367 L++ DNTL +F SDNG A P Sbjct: 286 KLDELKLADNTLFIFYSDNGGNAHSWSSDDPKLKKITDKHPLYKTINSYRKWAGGEPPTN 345 Query: 368 RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVAN 426 P R KG +EGG RVP V W G IQP SD IV DL+PT LD + Sbjct: 346 NAPLREGKGRIYEGGQRVPLMVRWPGHIQPGTTSDAIVGPIDLYPTILD-------SLKL 398 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG--KLAAVRMDEFKYHVLIQQPYAYT 484 P IDG G+ R A + +VR ++K + Y Sbjct: 399 SQPANQIIDGKSFLPVLE-QTGELERTAYFTWFPHLIPAVSVRQGDWKLIRRFEPHRLYP 457 Query: 485 QSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPP 544 + ++NL D ESD++ + L + +++ P Sbjct: 458 EIR---------------ELYNLKADISESDNLARQRPDKVRELDALIDEFVKETGALYP 502 Query: 545 RAQIKS 550 + Sbjct: 503 QPNPAY 508 >UniRef50_A3ZLN5 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZLN5_9PLAN Length = 468 Score = 440 bits (1133), Expect = e-122, Method: Composition-based stats. Identities = 116/535 (21%), Positives = 187/535 (34%), Gaps = 110/535 (20%) Query: 55 ADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPT 114 M V + + A +K+ + P++V+ + DD G+ D+ G G T Sbjct: 4 TRLMTFVCALASALLVSN---AVAAEKSKRPPSIVLIVSDDQGFADLSCIGDN---GCRT 57 Query: 115 PDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQP---------- 163 P +D +A+ G LTS Y P+ +P+RA+++TG+Y +G P Sbjct: 58 PRLDQLAASGTRLTSFYVSWPACTPSRASLMTGRYPQRNGTYDMIRNEAPDYDYLYTPEE 117 Query: 164 ---------GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVS 214 G L +L GYV+ GKW G+ K P GFD + GF + Sbjct: 118 YAVTAERILGTDLQEVFLADVLKQAGYVSAVFGKWDGGQLKRYLPLQRGFDQYYGFANTG 177 Query: 215 DMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQR 274 Y V S + P +D L Sbjct: 178 VDYFTHERYGV---------PSMFRDNQPTEEDK-------------------GTYLTDL 209 Query: 275 WMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN----------YPNAKYAG--------SS 316 + ++F+D+ D+PFFLY H + +Y Sbjct: 210 FEREAIRFIDE--NHDRPFFLYLPFNAPHSASNLDRSIRGFAQAPQEYLDHFPGGESKQE 267 Query: 317 PARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKG 376 R +Y + M++ + L+++ DNTLI+F SDN +P RG K Sbjct: 268 KRRQAYLAAVERMDEAIGKVVDQLQQHQIADNTLIIFLSDN---GGGGGADNSPLRGGKA 324 Query: 377 STWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFID 435 +EGG RVP V+W G + S+ + ++FPT + G +P D Sbjct: 325 KMFEGGNRVPCIVHWPGKVPAGKVSNQFLTSLEVFPTVIAAIGGK-------LPDDVIYD 377 Query: 436 GVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGT 495 G D G S R+ + G +AA R+ ++K+ Sbjct: 378 GFDMLPVLNG--ASSPREEMFWKRRGDVAA-RVGDWKWVDSAAGKG-------------- 420 Query: 496 VMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 +F+L D E + H M L+ A+ ++ PR + Sbjct: 421 --------LFDLAHDIGEKKDLSKEHPEMLAKLKARFDAWTAEMEAADPRGPFRD 467 >UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Proteobacteria RepID=UPI0000E0F7DD Length = 493 Score = 440 bits (1133), Expect = e-122, Method: Composition-based stats. Identities = 114/513 (22%), Positives = 190/513 (37%), Gaps = 114/513 (22%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPT 139 KPN+++ ++DD+GW DVG+ TP+IDA+A QGL+ AY+ + +P+ Sbjct: 35 ADTTKPNIIMIVIDDLGWSDVGY--NQTTDYFETPNIDALAQQGLVFDQAYAGAANCAPS 92 Query: 140 RATILTGQYSIHHGILMP--------------PMYGQPGGLQGLTTLPQLLHDQGYVTQA 185 RA +++GQY HG+ P+ + G + T+ + L GY T Sbjct: 93 RAVLMSGQYGPRHGVYTVSPSDRGHAKTRKLIPIKNKRGLTTDIITIGESLKTAGYTTGT 152 Query: 186 IGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFS 245 GKWH+G + P GFD + M + + P + P Sbjct: 153 FGKWHLGAD----PDKQGFDVNVAGSHQG-MTFHYFSPYQLPNIEDGPKG---------- 197 Query: 246 KDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFD 305 E L +R + ++ + D+PFF Y H Sbjct: 198 -----------------------EYLTERLTTEVIDWVK--SSKDQPFFAYVPYYTVHTP 232 Query: 306 NYP-------NAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG 358 + S +Y + M+D ++ L+ G +NT+++FTSDNG Sbjct: 233 YQAVVDKVNKYHEKGIKSKREATYAAMVEHMDDNVGRIFDMLDSEGLAENTVVIFTSDNG 292 Query: 359 PEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAG 418 TP RG KGS ++GG+RVP V W ++P V AD +PT ++L Sbjct: 293 G--YRMSSFPTPLRGGKGSYYDGGLRVPLIVRWPEKVKPGLDHTPVINADFYPTLVNLTK 350 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY-------------------FL 459 +DGVD T+ LG + R + F Sbjct: 351 SKQP--------NQVLDGVDLTAHLLGQQDIAERDLFWHFPVYLQAHHAPTDQGQDPLFR 402 Query: 460 NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGV 519 +A+R ++K + ++NL D E +++ Sbjct: 403 TRPGSAIRSGDWKLL--------------------QYFENNEFELYNLANDLAEKNNLAS 442 Query: 520 RHIPMGVPLQTEMHAYMEILK-KYPPRAQIKSD 551 H L+T++ A+ + + P + + D Sbjct: 443 VHPSRVKELKTKLQAWQQQIGADIPTKLNPEYD 475 >UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9FLAO Length = 459 Score = 440 bits (1133), Expect = e-122, Method: Composition-based stats. Identities = 114/508 (22%), Positives = 183/508 (36%), Gaps = 87/508 (17%) Query: 57 NMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPD 116 ++ + T A+ PN++ L+DD+G+ D+ G A +P+ Sbjct: 16 AILALFSIGCLAAATGTCYAQERPDA---PNILCILVDDLGYGDLSCQG---ATDLQSPN 69 Query: 117 IDAVASQGLILTSAYSQP-SSSPTRATILTGQYSIHHGI----LMPPMYGQPGGLQGLTT 171 IDA+A+ G+ T+ Y+ SP+RA +LTG+Y G+ P Sbjct: 70 IDALAANGMRFTNFYANSTVCSPSRAALLTGRYPDLVGVPGVIRQNPENNWGNLADDAVL 129 Query: 172 LPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVAL 231 +P L+ GY T IGKWH+G + P + GF F+GF DM ++ D + Sbjct: 130 IPSELNPAGYHTGIIGKWHLGLEEPDTPNDRGFTYFKGFLG--DMMDDYWDHRRGGINWM 187 Query: 232 SPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDK 291 +R E + + D+ + FL + ++ Sbjct: 188 RLNREEIDPK---------------------------GHATDLFTDWTIDFLKERQGEEQ 220 Query: 292 PFFLYYGTRGCHFDNYPNAKYAGS--------SPARTSYGDCMVEMNDVFANLYKTLEKN 343 PFFLY HF P ++ + R + ++ + + L+ Sbjct: 221 PFFLYLAYNAPHFPIQPPREWLDKVREREPNLTEKRAKNVAFVEHLDYSVGRVMEALKTT 280 Query: 344 GQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDG 402 G +NTL+VF SDNG P RG K +EGG+RVP YWKG I P SD Sbjct: 281 GLEENTLVVFVSDNGGAL-WYAQSNGPLRGGKQDMYEGGIRVPAIFYWKGKIAPGTTSDN 339 Query: 403 IVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY----- 457 L DLFPT +LAG + +DG+ G + + ++ Sbjct: 340 TALLMDLFPTFCELAGRKPPEN---------VDGISLVPTLTGQAQDTANRYLYWVRREG 390 Query: 458 --FLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD 515 + A R +FK FN+ D E+ Sbjct: 391 GDYGGQAYYAARFGDFKILQNT--------------------PFEPIQFFNIGQDELETT 430 Query: 516 SIGVRHIPMGVPLQTEMHAYMEILKKYP 543 + L+ ++ ++ P Sbjct: 431 PL-ETDSEAYRALRAQLMEHIRTAGGVP 457 >UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 Tax=Bacteria RepID=A6CD52_9PLAN Length = 460 Score = 440 bits (1132), Expect = e-122, Method: Composition-based stats. Identities = 110/496 (22%), Positives = 169/496 (34%), Gaps = 98/496 (19%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSS 136 + + ++PN+++ DD G DVG G + PTP ID +A +GL+ YS Sbjct: 20 SQLQAAERPNILIIFTDDQGINDVGCYGSEI----PTPHIDQLAKEGLLFRQYYSASAIC 75 Query: 137 SPTRATILTGQYSIHHG-------ILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKW 189 +P+R ILTG+ + M + G G TT+ +L GY T +GKW Sbjct: 76 TPSRFGILTGRNPTRSQDQLLGALMFMSDIDQNRGIQPGETTIADVLQQNGYQTALLGKW 135 Query: 190 HMGENKE-SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 H+G E P GFD FRG Y Y + + Sbjct: 136 HLGHGTESFLPTAHGFDLFRGHTGGCIDYFTMT----------------YGNIPDWYHNQ 179 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH----- 303 H G + FL +DKPFFL+ H Sbjct: 180 RHVSENG--------------YATDLITEEAEHFLKDQQTTDKPFFLFLSYNAPHFGKGW 225 Query: 304 -----FD---NYPNAKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTL 350 R + V ++D + +L+ NG NTL Sbjct: 226 SPGDQSPVNIMQARGDDLKRVGTIKDKVRREFAAMTVSLDDGIGRVMSSLKNNGLDQNTL 285 Query: 351 IVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADL 409 ++F +D+G + + PFRGAK + +EGG+RVP + W G I+ ++ + DL Sbjct: 286 VIFMTDHGGDYVYGGN-NQPFRGAKATLFEGGIRVPCIIRWPGKIKAGTETNEVAWALDL 344 Query: 410 FPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH------YFLNGKL 463 FPT A +DG D + R+ G+ Sbjct: 345 FPTICHFANVDT--------DGLTLDGKDISGLLTRQTPVGTRELYWQLGPHAELKRGRW 396 Query: 464 AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIP 523 +A+R ++KY +F+L DP E ++ Sbjct: 397 SALRQGDWKYIQDAGGEEF---------------------LFDLKADPYEKQNLTQSQST 435 Query: 524 MGVPLQTEMHAYMEIL 539 LQ ++ L Sbjct: 436 KLTELQERRDTLVKTL 451 >UniRef50_A6DHS3 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHS3_9BACT Length = 524 Score = 440 bits (1132), Expect = e-122, Method: Composition-based stats. Identities = 120/526 (22%), Positives = 189/526 (35%), Gaps = 80/526 (15%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-S 132 L L KPN++ L DD+G+ D+ GG PTP +D +A +G+ T A+ S Sbjct: 11 SLFCLSLSAQDKPNIIFILADDMGYGDMSNEGG----LIPTPHLDRMADEGMKFTDAHTS 66 Query: 133 QPSSSPTRATILTGQYSIHHGILMPPMYGQ--PGGLQGLTTLPQLLHDQGYVTQAIGKWH 190 +PTR ILTG+Y+ + G P Q T+ L DQGY T +GKWH Sbjct: 67 SSVCTPTRYGILTGRYNWRSSKKKGVLSGTSAPLIPQDRVTIANFLKDQGYHTGMVGKWH 126 Query: 191 MGENKES------------------------------------QPQNVGFDDFRGFNSVS 214 +G + P + GFD F G + Sbjct: 127 LGIGWQMLDEAKKPEKSFLKEGYKMKNNKQAASWKVDYSKPAITPIHNGFDYFYGIAASL 186 Query: 215 DMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQR 274 DM + + +R + A Sbjct: 187 DMSPYVYIENDKAVEMATHERGFAT----------------PYRPGATGPSFDATYCLMT 230 Query: 275 WMDYGVKFL-DKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVF 333 + D K++ + A KPFFLY H P+ K+ G SP +T YGD ++E + V Sbjct: 231 FADKSRKYIAQQAADKSKPFFLYLPLTSPHTPIMPSEKFLGKSPTKTIYGDFVMETDWVV 290 Query: 334 ANLYKTLEKNGQLDNTLIVFTSDNGPEAEV--------PPHGRTPFRGAKGSTWEGGVRV 385 + L+K G DNTLIVFT+DNG +RG K +EGG RV Sbjct: 291 GEVMAELDKQGIADNTLIVFTADNGCSPTGSIPEHIKIGHSPNGQWRGHKADIFEGGHRV 350 Query: 386 PTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFL 444 P V W +Q +SD + AK++ + T D + Sbjct: 351 PFLVRWPAQVQTKTQSDSTICTT-----DFFATAADAAKLSASIEDTMAEDSYSFYADLT 405 Query: 445 GTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSV 504 +G++ R + A+R ++K ++ + + + Sbjct: 406 -QSGKTKRPFTIHHSINGSFAIRQGKWKLNLCPGSGGWSAPRPGKATKGLPL-----IQL 459 Query: 505 FNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 ++L DP E +++ H + L ++ ++ + P Q Sbjct: 460 YDLDGDPAEKNNLQDAHPEIVDNLVNQLAKEIKAGRSTPGAPQTNE 505 >UniRef50_B4AUP3 Sulfatase n=2 Tax=Bacteria RepID=B4AUP3_9CHRO Length = 570 Score = 439 bits (1131), Expect = e-122, Method: Composition-based stats. Identities = 135/553 (24%), Positives = 231/553 (41%), Gaps = 55/553 (9%) Query: 46 YLVKPATTIADNMMPVMQHPAQDKETQQ--KLAELEKKTGKKPNVVVFLLDDVGWMDVGF 103 + +K T+ + + + +A + + KKPN++V + DDVGW ++ Sbjct: 3 HFIKKVVTVVATIALAINFALVNVFPWSIGNIALAQTISPKKPNILVIMGDDVGWFNISA 62 Query: 104 NGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQP 163 +G TP+ID +AS+G++ T Y++ S + RA +TGQ G+ + G P Sbjct: 63 Y-NRGMMGYKTPNIDRIASEGMLFTDVYAEQSCTAGRAAFITGQSPGRTGMTKVGLPGVP 121 Query: 164 GGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRD 222 GL G TL +LL GY T GK H+G+ E P GFD+F G + E + Sbjct: 122 IGLSGEDPTLAELLKPLGYATGQFGKNHLGDLDEFLPTVHGFDEFYGNLYHLNAEEEPEN 181 Query: 223 VHV---------------------NPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIA 261 + +P+ + + L +D+ G Sbjct: 182 PDYPKNEIFKQKLGPRGVLHSYSLDYVTQENPEITCPEENLSKYEDENIPGLGQVICNTG 241 Query: 262 DITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY--AGSSPAR 319 +T + M+ +D ++D ++F++K + KPFF+++ T H + Sbjct: 242 PLTIERMKTVDDEFLDASLEFINKTQQEGKPFFVWFNTTRMHVFTHLKDDSYNPDLEKYD 301 Query: 320 TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR-TPFRGAKGST 378 YG+ M E + L L++ G D+T++++T+DNG E P G TPF G K + Sbjct: 302 DIYGEGMEEHDQDVGILLDYLDEQGLTDDTIVIYTTDNGAEVFSWPDGGTTPFHGEKNTN 361 Query: 379 WEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKT------ 431 WEGG RVP + W G I+ + S+ I+ D PT L AG P L + Sbjct: 362 WEGGFRVPAMIRWPGYIEAGQISNEIISHQDWLPTLLAAAGAPDDIAEQLKSEDGYNAGI 421 Query: 432 -----TFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL-AAVRMDEFKYHVLIQQPYAYTQ 485 +DG + + S R+ Y + +A+R+D++K Q+ + Sbjct: 422 KTFKKIHLDGYNLLPYLTDQEYHSPRRWFVYLTDDAYPSAIRVDDWKVIFSEQRAEGFE- 480 Query: 486 SGYQGGFTGTVMQTAGSSVFNLYTDPQES-----DSIGV---RHIPMGVPLQTEMHAYME 537 ++ + + NL DP E ++ RH + P Q +++ Sbjct: 481 -----VWSEPYVNLRVPMILNLRRDPFEKAPEESNNYIDWRFRHTFVIAPAQIVAQEFLD 535 Query: 538 ILKKYPPRAQIKS 550 ++YPPR + S Sbjct: 536 TFREYPPRQKPAS 548 >UniRef50_Q7US96 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7US96_RHOBA Length = 498 Score = 439 bits (1131), Expect = e-121, Method: Composition-based stats. Identities = 112/515 (21%), Positives = 178/515 (34%), Gaps = 105/515 (20%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ- 133 + +PN+++ +DD+GW D+G G TP ID +A++GL T+ Y+ Sbjct: 21 CCSPAQSRAGQPNILLIFIDDLGWKDIGCYGNDFV---ETPRIDQLAAEGLRFTNFYASG 77 Query: 134 PSSSPTRATILTGQYSIHHGIL-MPPMYGQP-----------GGLQGLTTLPQLLHDQGY 181 SPTR + +GQ GI P + +P T+ + L GY Sbjct: 78 AVCSPTRCALQSGQNQARIGITAHIPGHWRPFERVITPQTTMALPLDTVTIAESLKASGY 137 Query: 182 VTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQ 241 T +GKWH+G E QP G+D Sbjct: 138 TTGYVGKWHLGNGPEFQPDRQGYDFSAVIGG----------------------------- 168 Query: 242 LPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRG 301 H Q +D+ PK + D + F+ + D+PFFL Sbjct: 169 -------PHLPGRYRVQGRSDLKPKPNQYRTDFEADLCIDFMRQ--NKDQPFFLMLSPFA 219 Query: 302 CHFDN----------YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLI 351 H AK G+S Y + +D+ L +LE+ D+T+I Sbjct: 220 VHIPLAAMSEKVQKYEAMAKQTGNSLPHPVYAAMIEHCDDMVGRLVDSLEQLDIADDTMI 279 Query: 352 VFTSDNGP---------EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSD 401 VFTSDNG A+ + P +G KGS EGG+RVP + ++ D Sbjct: 280 VFTSDNGGLYKRYDYRESADDLVSSQAPLKGEKGSLHEGGIRVPLIIRHPATVKSAGVCD 339 Query: 402 GIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE-----H 456 D +PT +++AG +P IDG +R A H Sbjct: 340 EPTISHDFYPTFVEMAGGE-------LPINQTIDGHSLLPLMTAPTQTLDRDALHWHYPH 392 Query: 457 YFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDS 516 Y + +A+R ++K + T ++NL D E+ + Sbjct: 393 YHHDRPASAIRERDWKLIEYLDG-------------------TGDVELYNLADDLGETKN 433 Query: 517 IGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 + L+ ++ + + P D Sbjct: 434 LASEKQGRAGDLKRKLTTWRSSVLARTPIPNPSYD 468 >UniRef50_P15289 Arylsulfatase A component C n=34 Tax=Euteleostomi RepID=ARSA_HUMAN Length = 507 Score = 439 bits (1131), Expect = e-121, Method: Composition-based stats. Identities = 115/458 (25%), Positives = 196/458 (42%), Gaps = 41/458 (8%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRAT 142 + PN+V+ DD+G+ D+G G + TP++D +A+ GL T Y +P+RA Sbjct: 19 RPPNIVLIFADDLGYGDLGCYGHPSST---TPNLDQLAAGGLRFTDFYVPVSLCTPSRAA 75 Query: 143 ILTGQYSIHHGILMPPM--YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE--SQ 198 +LTG+ + G+ + + G T+ ++L +GY+T GKWH+G E Sbjct: 76 LLTGRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFL 135 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 P + GF F G D P + + +P + Sbjct: 136 PPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLAN----------- 184 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA 318 + P ++ L+ R+M + + + D+PFFLYY + H+ + +A S Sbjct: 185 LSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRPFFLYYASHHTHYPQFSGQSFAERS-G 243 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE--VPPHGRTPFRGAKG 376 R +GD ++E++ L + G L+ TL++FT+DNGPE R KG Sbjct: 244 RGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRMSRGGCSGLLRCGKG 303 Query: 377 STWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDG 436 +T+EGGVR P +W G I P + + DL PT LAG P +DG Sbjct: 304 TTYEGGVREPALAFWPGHIAPGVTHELASSLDLLPTLAALAGAPLP--------NVTLDG 355 Query: 437 VDQTSFFLGTNGQSNRKAEHYF-----LNGKLAAVRMDEFKYHVLIQQP-YAYTQSGYQG 490 D + LG G+S R++ ++ + AVR ++K H Q ++ T + Sbjct: 356 FDLSPLLLG-TGKSPRQSLFFYPSYPDEVRGVFAVRTGKYKAHFFTQGSAHSDTTADPAC 414 Query: 491 GFTGTVMQTAGSSVFNLYTDPQESDSI----GVRHIPM 524 + ++ +++L DP E+ ++ + Sbjct: 415 HASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATPEV 452 >UniRef50_D2R2H5 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R2H5_9PLAN Length = 507 Score = 439 bits (1131), Expect = e-121, Method: Composition-based stats. Identities = 128/517 (24%), Positives = 192/517 (37%), Gaps = 67/517 (12%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQ 133 L + +KPN++V + DD+G+ D+G NG TP ID VA++GL TS Y S Sbjct: 14 LVAAIASSAEKPNIIVIIADDLGYGDLGCNGSQTIA---TPHIDRVAAEGLRFTSGYCSA 70 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQPGG-LQGLTTLPQLLHDQGYVTQAIGKWHMG 192 + +PTR ++LTG Y+ P T+ LL QGY T IGKWH+G Sbjct: 71 STCTPTRYSLLTGTYAFRVKGTGIAAPNSPALIQPETVTVASLLKSQGYATACIGKWHLG 130 Query: 193 EN--KESQ-------PQNVGFDDFRGFNSVSDMYT-----EWRDVHVNPEVALSPDRSEY 238 K P +GFD + +D R +++P L + Sbjct: 131 LGVGKPDWNGELKPGPLEIGFDHCLLLPTTNDRVPQVFVENHRVRNLDPADPLWVGDEKP 190 Query: 239 IKQLPFS-------KDDVHAVRGGEQQAIADITPKYME---------DLDQRWMDYGVKF 282 P D G Y DL W+ ++ Sbjct: 191 SDDHPTGISHRSTLAMDWDYGHNGTIHNGISRIGFYTGGMKARFRDQDLADEWVKASAQW 250 Query: 283 LDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEK 342 ++ A PFFLY+ H P+ ++ G S GD ++E + L K LE+ Sbjct: 251 IE--ANKAGPFFLYFAAHDIHVPRTPHERFVGKS-GMGPRGDSILEFDWCVGELMKVLEQ 307 Query: 343 NGQLDNTLIVFTSDNGPEAEVPPHGRTP-----------FRGAKGSTWEGGVRVPTFVYW 391 + +NTL+V SDNGP + FRG K S +EGG R P V W Sbjct: 308 HQLAENTLVVICSDNGPVLNDGYKDQAVELIGKHAAAGLFRGGKYSVFEGGTRTPFIVSW 367 Query: 392 KGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSN 451 KG + SD +V D + LAG +P+ +D ++ LG + Sbjct: 368 KGRVASGVSDKLVSTIDFASSFAALAGAK-------IPEDACLDSLNLLDTLLGDKAAAG 420 Query: 452 RKAEHYFLNGK-LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTD 510 R+ NG +R ++K P G + +F L +D Sbjct: 421 REYVLQQDNGGTKLGLRAGDWKLVRGGALPGKKKGPGAR----------EADQLFRLSSD 470 Query: 511 PQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQ 547 P E+ ++ LQ + + + P Q Sbjct: 471 PGETKNVAAEFPAELEKLQKLLATIIADGRTRPVGPQ 507 >UniRef50_Q7UYD6 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UYD6_RHOBA Length = 889 Score = 439 bits (1131), Expect = e-121, Method: Composition-based stats. Identities = 125/558 (22%), Positives = 179/558 (32%), Gaps = 106/558 (18%) Query: 41 DHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMD 100 +P + A + ++ E + A K+PNV+ L DD+GW D Sbjct: 223 SYPKGFFGLQVHKGAKGTVLWKNIRVKELENEPA-ATPNASASKRPNVLFILADDLGWSD 281 Query: 101 VGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMP-- 157 G TP+I+ +A +G+ T AYS P SPTRA++LTG HGI P Sbjct: 282 TTLFG--TTKLYQTPNIERLAKRGMTFTRAYSSSPLCSPTRASVLTGLSPARHGITSPTC 339 Query: 158 ----------------------PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK 195 TL ++ D GY T GKWH+G Sbjct: 340 HLPKVVLEPKVSETGPPNKFSTVPESVTRLDTKYYTLAEMFRDNGYATGHFGKWHLGPEP 399 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 P GFD DV +P + K F D V Sbjct: 400 -YSPLEHGFD---------------VDVPHHPGPGPAGSYVAPWKFKDFDHDPVIPD--- 440 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA----K 311 E L+ R V+FL++ PFFL Y H + Sbjct: 441 -------------EHLEDRMAKEAVRFLEQHTNE--PFFLNYWMFSVHAPFDAKKELIEE 485 Query: 312 YAG----SSPAR-TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA----- 361 Y P R +Y + M+D L TL++ G D T+IVF SDNG Sbjct: 486 YRDRVDPKDPQRCPTYAAMIESMDDAIGTLLDTLDRLGIADETIIVFASDNGGNMYNEVD 545 Query: 362 EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHP 420 P RG K + +EGGVR P V G+++ SD I+ D +PT L++ Sbjct: 546 GTTATSNAPLRGGKATMYEGGVRGPAIVVQPGVVESGSRSDAIIQSIDFYPTLLEMLAID 605 Query: 421 GAKVANLVPKTTFIDGVDQTSFFLGTNGQS-------NRKAEHYFLNGKLAAVRMDEFKY 473 DGV G Q +V ++K Sbjct: 606 AQ-------PNQRFDGVSIVPALQGKPLQRDAIFTYFPHDPPVPNWMPPSVSVHQGDWKL 658 Query: 474 HVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMH 533 + + +FNL D E ++ +H + + Sbjct: 659 IRIFHGG---------------PNGSHRYKLFNLKNDLGERINLAAKHPDRVQQMDKLIG 703 Query: 534 AYMEILKKYPPRAQIKSD 551 ++ K P D Sbjct: 704 QHLVETKAVRPLVNKNFD 721 >UniRef50_Q7UYS6 Arylsulfatase A n=4 Tax=Bacteria RepID=Q7UYS6_RHOBA Length = 512 Score = 439 bits (1130), Expect = e-121, Method: Composition-based stats. Identities = 111/506 (21%), Positives = 194/506 (38%), Gaps = 68/506 (13%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPT 139 +T PNV++ DD+G+ D+ PTP +D +A G+ T +S +P+ Sbjct: 31 ETKTPPNVLILYADDLGYGDLNLQNAE--SKIPTPHLDQLARSGMRFTDGHSSSGICTPS 88 Query: 140 RATILTGQYSIH--HGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKES 197 R +LTG++ HGI +G+ TLP++ GY T AIGKWH+G + ++ Sbjct: 89 RYALLTGRHHWRDFHGI--VNAFGESVFEPEQLTLPEMFQQHGYQTAAIGKWHLGWDWDA 146 Query: 198 ------------------------------QPQNVGFDDFRGFNSVSDMYTEWRDVHVNP 227 P GFD + G ++ P Sbjct: 147 IKKPDAKTFGEGRKKGYGPEAFDWTKSIPDGPLAHGFDSYFG----------DTVINFPP 196 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVR--GGEQQAIADITPKYMEDLDQRWMDYGVKFLDK 285 + D+ ++ E + + GV+F++ Sbjct: 197 YCWIEDDKVVKAPDTIMDTAKWKPIKEGNWECRPGPMTSDWDPYQNIPTTTARGVQFIES 256 Query: 286 MAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQ 345 +SD+PFFLY+ H PN ++ G S A YGD + E +D L + L+++GQ Sbjct: 257 QKESDQPFFLYFAFPAPHAPIIPNDEFDGRSGA-GPYGDYVCETDDACGKLLRALKESGQ 315 Query: 346 LDNTLIVFTSDNGPEAE-------VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QP 397 +NT+++F++DNGPE PFRG K +EGG VP ++W G+ Sbjct: 316 SENTIVIFSADNGPERYAYARDEKYDHWSSQPFRGLKRDLYEGGHHVPFVIHWPGVTDSG 375 Query: 398 RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY 457 D +V D+F T ++ G + +P D Q +R++ Sbjct: 376 STCDALVSQVDIFATLAEMLG-------HSIPDGQAKDSRSLMPLLK-EPKQQHRQSLVQ 427 Query: 458 FLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSI 517 + A+R ++ + G++ +++L D +S+++ Sbjct: 428 NTRVDVYAIRDGKWLLIDAKSGYVSGRNKGWESRRQIPADDKLPHELYDLSVDIGQSENV 487 Query: 518 GVRHIPMGVPLQTEMHAYMEILKKYP 543 H + ++ + E YP Sbjct: 488 AGEHPEIVERMKALLQTIREDG--YP 511 >UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR28_9SPHI Length = 602 Score = 438 bits (1128), Expect = e-121, Method: Composition-based stats. Identities = 111/497 (22%), Positives = 183/497 (36%), Gaps = 90/497 (18%) Query: 60 PVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDA 119 ++ ++++T + PNV+V L DD GW D G TP D Sbjct: 14 ALICVVCSLLFASCTAKVVQEQTQRPPNVIVILTDDQGWGDFSHTGNEYL---KTPHFDK 70 Query: 120 VASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQ 179 + +G +L Y P +PTRA++LTG+Y + G+ G+ T+ ++ + Sbjct: 71 MTEEGALLDQFYVSPVCAPTRASVLTGRYHLRTGV-SFVTRGRENMRSEEVTIAEVFKEA 129 Query: 180 GYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI 239 GY T GKWH G + PQ GFD F GF S ++ + Sbjct: 130 GYATGCFGKWHNGAHYPENPQGQGFDTFLGFTSGH--WSNYF------------------ 169 Query: 240 KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 D GE ++ + MD ++F+D A D+PF + Sbjct: 170 --------DTELEYNGEMKSTK-------GFITDVLMDETIQFID--AHKDEPFLAFVPL 212 Query: 300 RGCHFDNYPNAKY------------AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 H KY + + ++D L K L+ + Sbjct: 213 NAPHTPYQVPDKYFDKYKDIDFGYDKKQNKKIATIYGMCENIDDNLGKLMKHLKDQELEE 272 Query: 348 NTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLA 407 NT++VF SDNGP+ P+RG K S EGG VP + WKG I + Sbjct: 273 NTIVVFLSDNGPQ---GARYNGPWRGGKTSVHEGGTLVPCAIQWKGHIPNSSKSSLTAHI 329 Query: 408 DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG-----K 462 DL PT + LAG P+ DG+D +++ +GT+ + + + Sbjct: 330 DLMPTLMGLAGIE-------KPENIQFDGIDLSNYLMGTSDDLGERNLYTHMTNFEITAD 382 Query: 463 LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHI 522 AVR ++++ + ++NL DP E +++ + Sbjct: 383 RGAVRQGDYRF----------------------TTEYGDVGLYNLKEDPSEENNLKDQLP 420 Query: 523 PMGVPLQTEMHAYMEIL 539 L+T + + + Sbjct: 421 EKTQELKTAFENWYKDV 437 >UniRef50_A6DM29 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DM29_9BACT Length = 481 Score = 438 bits (1128), Expect = e-121, Method: Composition-based stats. Identities = 111/494 (22%), Positives = 188/494 (38%), Gaps = 73/494 (14%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPS 135 KK ++PN+V+ L DD+G+ D+ G TP++D +A +G+ Y + P Sbjct: 27 PQTKKDTERPNIVLILCDDLGYGDLACYGHKQI---KTPNLDQMAKEGIRFNHFYSAAPV 83 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQ----PGGLQGLTTLPQLLHDQGYVTQAIGKWH- 190 S +R +LTG+ G+ + P + T PQLL GY T GKWH Sbjct: 84 CSASRVGLLTGRSPNRAGVYDWIPHSSESSSPHMRKNEITFPQLLQKAGYATCLSGKWHC 143 Query: 191 ---MGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 + ++QPQ+ GFD + + P K+ Sbjct: 144 NGALINTNQAQPQDAGFDYWF---------------------------ATQNNAAPSHKN 176 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSD--KPFFLYYGTRGCHFD 305 V+ +R G + + Q + + +++ K + +PFF+Y H Sbjct: 177 PVNFIRNGVELGPIE------GFSCQIVTNEAINWMEDHVKQNEKQPFFIYLSFHEPHEP 230 Query: 306 NYPNAK----YAG--SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGP 359 K Y G + + Y + ++ +L L+K DNTL++FTSDNGP Sbjct: 231 IASPQKIVDTYKGIAENTNQAEYFANVENLDKAVGSLMNQLKKLKINDNTLVIFTSDNGP 290 Query: 360 E-------AEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFP 411 E A +G K T E G RVP ++W I + SD ++ D FP Sbjct: 291 ETLNRYEAASRSYGSPGELKGMKLWTAEAGFRVPAIMHWPEKIATGQISDQVISALDFFP 350 Query: 412 TALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL---NGKLAAVRM 468 T DLA +K N +DG + T ++ + N + A+R Sbjct: 351 TFCDLAQASNSKSLN-------LDGSNFTPALHKKKMTRHKPLLWIYYAALNERQVAMRH 403 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPL 528 ++K Y + T + ++NL D E++ + ++ + Sbjct: 404 GDWK-ISAKLNLPRYHNITSKNFPKVTAATLSDYQLYNLSKDKSEANDLSNQNPKKSAQM 462 Query: 529 QTEMH-AYMEILKK 541 + Y ++L+ Sbjct: 463 IKFLKLQYQDLLED 476 >UniRef50_A6DI18 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DI18_9BACT Length = 562 Score = 438 bits (1127), Expect = e-121, Method: Composition-based stats. Identities = 113/499 (22%), Positives = 193/499 (38%), Gaps = 64/499 (12%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPT 139 +KPN++ L DD+G DV PTP +D +A+ G++ T A++ +PT Sbjct: 27 SAAEKPNIIYLLADDMGVGDVKAYNAD--SKIPTPALDNLAANGMMFTDAHTNSSVCTPT 84 Query: 140 RATILTGQYSIHHGILMPPMYGQPG--GLQGLTTLPQLLHDQGYVTQAIGKWHMGENK-- 195 R ILTG+YS G T+ LL +GY T IGKWH+G + Sbjct: 85 RYGILTGRYSWRTTKKSGVTQGLSPHLIDSNRETVASLLKKEGYATACIGKWHLGMDWSL 144 Query: 196 ---------------------ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPD 234 ++ P GFD + G + ++ + Sbjct: 145 KDGSIADSKSDQSQIDLSKEIQNGPNKNGFDYYFGMAASANHSPHCFI-----------E 193 Query: 235 RSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS--DKP 292 + +L D G + + ++ R+ + +++ D+P Sbjct: 194 DGYTVGKLQVLDDKQRKAVGIDGKPGLVAKGFKQSEILPRFTEKTCEWVRSQVNQKPDQP 253 Query: 293 FFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIV 352 FF+Y H P+AK+ G S +S+GD +E + + K L+ G DNT+I+ Sbjct: 254 FFVYMPLNSPHSPIVPSAKFLGKS-GLSSHGDFCMETDWALGEVVKILKALGIEDNTMII 312 Query: 353 FTSDNG--------PEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWK-GMIQPRKSDGI 403 FT+DNG P E +RG KG T+EGG RVP V W G+ + SD + Sbjct: 313 FTADNGTSPMAKFEPMQEQGHFPSYIYRGLKGETYEGGHRVPFIVKWPKGLAPAKTSDQL 372 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQS-NRKAEHYFLNGK 462 + DL T ++ G A D + +A + + Sbjct: 373 ICTTDLMATVAEINGIALANNVGE-------DSISFLPALREQAIPELANRAIVHHSDAG 425 Query: 463 LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHI 522 + A+R ++K + + V+ A +F++ DPQES ++ ++ Sbjct: 426 VFAIRQGKWKLLLDNIGGSRRSNPK-----DKPVIDDAEIQLFDMVNDPQESTNLSQKNP 480 Query: 523 PMGVPLQTEMHAYMEILKK 541 + L+ ++ Y+ + Sbjct: 481 EIVEGLKKQLADYINKGRS 499 >UniRef50_B4CZ54 Sulfatase n=3 Tax=Bacteria RepID=B4CZ54_9BACT Length = 500 Score = 438 bits (1127), Expect = e-121, Method: Composition-based stats. Identities = 92/471 (19%), Positives = 160/471 (33%), Gaps = 40/471 (8%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 + + KPN+V L DD G+ D+ G + TP +D + + + T + Sbjct: 17 AVVATAQGAPSKPNIVFILADDTGYGDLSATGNPIL---KTPHLDKLYNAAVRFTDFHVS 73 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 P+ SPTR+ ++TG++ +G+ + + T+ Q+L GY T GKWH+G+ Sbjct: 74 PTCSPTRSALMTGRHEFKNGVTHTILERER-LNPDAITIAQVLKSAGYTTGIFGKWHLGD 132 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 + QP GFD+ T P + Y Sbjct: 133 EPDHQPGQRGFDEVFIHGGGGIGQTYPGSCGDAP-------GNTYFNPAILHNGSFE--- 182 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 K + + + +++ K +PFF Y H +Y Sbjct: 183 ------------KTQGFCTDIFTNQAIHWME-SVKGKQPFFCYIPYNAAHVPVSCPDEYK 229 Query: 314 ----GS-SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR 368 G +Y + +++ + L++ G +TL+VF +DNG Sbjct: 230 KPYEGKVDDHLATYFGMVANIDENVGRVLAKLDEWGIAKDTLVVFMNDNGGHGPACKVFN 289 Query: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 RG+KGS W GG R + W P + G+ D FPT +LAG + A Sbjct: 290 AGMRGSKGSAWLGGTRAVSLWRWSDTFAPHDAAGLASNIDFFPTLAELAGATPNEKAQK- 348 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 +DG N + + + +KY + + Sbjct: 349 ----QVDGRSLLPLLRDGNAPWPERVLFTHVGRWPKGADVQAYKYAACSVRSGQWHLVSD 404 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEIL 539 + G +F++ D E + H + L E + + Sbjct: 405 GPPGKP---REKGWKLFDVSKDIGEDHDVVAEHPDVVTRLDAEYDRWWASV 452 >UniRef50_Q7UIU1 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UIU1_RHOBA Length = 529 Score = 438 bits (1127), Expect = e-121, Method: Composition-based stats. Identities = 120/494 (24%), Positives = 192/494 (38%), Gaps = 51/494 (10%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRA 141 +PN+++ + DD+G DV TP + +A +GL A++ +PTR Sbjct: 47 ASRPNIILVMADDLGIGDVSPTNPD--CKIKTPRLQQMADEGLTFLDAHTPSSVCTPTRY 104 Query: 142 TILTGQYSIHHGILMPPMYG--QPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE--- 196 +LTG+Y+ + + G + TL LL GY T IGKWH+G + Sbjct: 105 GLLTGRYNWRSRLAKGVLSGTSEHLIPGDRATLGHLLQGAGYHTAMIGKWHLGWDWHKNG 164 Query: 197 ----------SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 + P N GFD + G DM P + KQ P+ Sbjct: 165 KEIDFSKPVLNGPDNNGFDQYYGHCGSLDMPPYVWVDTGTPTSVPTRKEGVTKKQNPYGW 224 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 + +E + D + ++++ K DKPFFLY H Sbjct: 225 Y----------RNGPIGDDFEIEQVLPHLFDKSIAYVEERVKEDKPFFLYLPLPAPHTPI 274 Query: 307 YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP- 365 P + +S Y D +++M+ L + K G +NTL++FTSDNG E Sbjct: 275 VPVPPFKDAS-GMNPYADFVMQMDHHMGQLLDAISKAGIDENTLVIFTSDNGCSPEANFG 333 Query: 366 ----HGRTP---FRGAKGSTWEGGVRVPTFVYWKGM-IQPRKSDGIVDLADLFPTALDLA 417 HG P +RG K +EGG RVP V W G + + ++ + L D++ T + Sbjct: 334 ELAKHGHDPSGKYRGHKADIYEGGHRVPFIVRWPGKVVAGKTTNALTCLTDVYATLQSIT 393 Query: 418 GHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLI 477 P DG D T F G + S+R+A G A+R D +K + Sbjct: 394 DQPREATGGE-------DGFDLTDVF-GGDDSSDREALVSHSIGGSFAIRRDSWKLCLSH 445 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 + G +F+L TDP E +S+ + + L ++ Y+E Sbjct: 446 GSGGWSNPREPKAKLQG----LPPMQLFDLETDPAEKNSVAKENPEVVDSLLLLLNEYVE 501 Query: 538 ILKKYPPRAQIKSD 551 + ++ +D Sbjct: 502 TGRST-EGPKVAND 514 >UniRef50_A6KWS8 Arylsulfatase n=6 Tax=Bacteroides RepID=A6KWS8_BACV8 Length = 464 Score = 437 bits (1126), Expect = e-121, Method: Composition-based stats. Identities = 110/509 (21%), Positives = 181/509 (35%), Gaps = 101/509 (19%) Query: 62 MQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVA 121 + A + +A+ K PNV+ + DD+G D+G G TP+ID +A Sbjct: 7 ILFSAALLSSGLTMAQT-TTAEKSPNVIYIMADDLGIGDLGCYGQRQI---KTPNIDGIA 62 Query: 122 SQGLILTSAYSQP-SSSPTRATILTGQYSIHHGILMPPMYG-------QPGGLQGLTTLP 173 G+ YS S+P+R ++TG++ H I + G T+ Sbjct: 63 QNGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVA 122 Query: 174 QLLHDQGYVTQAIGKWHMG-ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALS 232 + + YVT +GKW MG E P GFD F G+ ++ + + E + Sbjct: 123 DIFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGYLGQRFAHSYYPEFLHENEQKIM 182 Query: 233 PDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKP 292 D Y ++ + F+D+ + KP Sbjct: 183 LDGKYYSH--------------------------------DLMLEKALNFIDE--NAQKP 208 Query: 293 FFLYYGTRGCHFDNY--------------------PNAKYAGSSPARTSYGDCMVEMNDV 332 FFLY+ H D Y R +Y + ++ Sbjct: 209 FFLYFSPTIPHADLDIMGEAMTEYEGEFCETPFGGSRDGYKSQQNPRAAYAAMVTYLDKS 268 Query: 333 FANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP-----HGRTPFRGAKGSTWEGGVRVPT 387 + K L++ G D+T+IVFTSDNG +E PFRG K +EGG+R P Sbjct: 269 VGLIIKELKEKGLYDHTIIVFTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPF 328 Query: 388 FVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGT 446 + W G+I ++ I D PT +L + IDG+ G Sbjct: 329 VIQWPGVIPQGVVTNHISAFWDFLPTIGELVQADIPQN---------IDGISYLPTLTGK 379 Query: 447 NGQSNRKAEHY--FLNGKLAAVRMDE-FKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSS 503 Q +Y F G ++ + +K L + T Sbjct: 380 GTQKEHDCIYYEFFEFGGKQSIMTPDGWKLVRLEVSDPSKTYE----------------E 423 Query: 504 VFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 ++N+YTDP E+ ++ ++ + L+ + Sbjct: 424 LYNIYTDPAETSNVIKQYPDVAKKLKNMI 452 >UniRef50_A6DMX8 Iduronate-sulfatase or arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMX8_9BACT Length = 532 Score = 437 bits (1124), Expect = e-121, Method: Composition-based stats. Identities = 117/531 (22%), Positives = 194/531 (36%), Gaps = 65/531 (12%) Query: 51 ATTIADNMMPVMQHPAQDKETQQKLAEL----------EKKTGKKPNVVVFLLDDVGWMD 100 + + +TQ K AE + + PN+V+ DD+G+ D Sbjct: 8 SVAKLLILQAFTCLSVCIAQTQSKSAEAPVSVLNEMRPKTTQSEYPNIVLIYADDLGYGD 67 Query: 101 VGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPM 159 + G A TP+ID +A G++ T +S + +P+R +LTG+Y + P Sbjct: 68 LSSYG---ATKIKTPNIDRLAKNGILFTDGHSTSATCTPSRYALLTGEYPLRINNYSPVF 124 Query: 160 YGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQP----------QNVGFDDFR 208 TT+ LL +GY T +GKWH+G + +P +GFD F Sbjct: 125 CADRLIIDTKKTTIASLLKRKGYTTACVGKWHLGFGDKPKPDWNKELKPGPLELGFDYFF 184 Query: 209 GFNSVSD-----MYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADI 263 G V+ R + ++P L+ R + RG + Sbjct: 185 GLPVVNSHPPFVYMENRRILGLDPNDPLTYKRGGKTYGKAYVGKHTSPHRGMPSVIGGKV 244 Query: 264 TPKYMED--LDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTS 321 D + ++ + ++++ DKPFFLYY + H P+ + G S Sbjct: 245 AHDLYVDELIGEKLTQKALTWMNQQ---DKPFFLYYASHNVHLPITPHPYFHGKSEC-GL 300 Query: 322 YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG----------RTPF 371 GD + E++ + +E+ G L+NT+ +FTSDNG + G Sbjct: 301 RGDFVEELDWSVGQIISAVERFGALENTIFIFTSDNGAIIKGKDQGDILDQLGHKPNGKL 360 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPK 430 +G K WE G RVP V W I SD ++ DL PT + G A Sbjct: 361 KGRKFGAWEAGHRVPFIVSWPNKIPAGKTSDALIANLDLLPTFAAITGQKLAPHEAR--- 417 Query: 431 TTFIDGVDQTSFFLGTNGQSNRKAEHYFLN-GKLAAVRMDEFKYHVLIQQPYAYTQSGYQ 489 DG +Q LG + S R + ++R ++ Y Sbjct: 418 ----DGFNQLPLLLGKDTTSARSELIIQPHKRSHKSLRQGDWVYI----------PGAGD 463 Query: 490 GGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 GG+ ++NL DP + + + + ++ M+ Sbjct: 464 GGWVPAKKGELPKQLYNLKDDPYQQQNRINDFPERADAMASHLNKLMKQYG 514 >UniRef50_C1ZJ89 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJ89_PLALI Length = 536 Score = 436 bits (1123), Expect = e-121, Method: Composition-based stats. Identities = 123/547 (22%), Positives = 199/547 (36%), Gaps = 92/547 (16%) Query: 46 YLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNG 105 V P A +++ ++Q A+ L +PNVV L DD+GW +VG G Sbjct: 1 MFVLPEIRAALSVLLLIQLAAESL--WANELTLISHQSPRPNVVFILADDLGWGEVGCFG 58 Query: 106 GGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPM----- 159 PTP+ID +AS+G+ LT YS P+ +P+R ++TG++ H I Sbjct: 59 Q---SKIPTPNIDRLASRGVKLTRHYSGAPTCAPSRCVLMTGKHLGHAEIRGNQQAKVKL 115 Query: 160 ----YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQNVGFDDFRGFNSVS 214 GQ T+ + GY T A GKW +G +P GFD+F G+N + Sbjct: 116 PQFTEGQHPLSDKALTIARQFQKAGYATGAFGKWGLGPVGSTGEPNRQGFDEFFGYNCQA 175 Query: 215 DMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQR 274 ++ + ++ + V + + + + + Sbjct: 176 LAHSYFPKALWKNAESIVNNEKP-----------VPGHKKQPEGEVTMEAYQGENYAPRL 224 Query: 275 WMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK-------------------YAGS 315 M + F+D+ + +PFFLY H P K Y Sbjct: 225 IMAEALSFIDRHHQ--QPFFLYLPFTEPHVAMQPPPKIVEEFPVEWDERVYRGDGGYLPH 282 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-----PEAEVPPHGRTP 370 R +Y + ++++ ++ +LEK+G L+ TLIVFTSDNG + G P Sbjct: 283 PRPRAAYAAMIRDLDNHVGDVITSLEKHGLLEKTLIVFTSDNGATHASANPDFHVGGADP 342 Query: 371 --------FRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPG 421 +G KGS +EGG+RVP V W G I P + D FPT + P Sbjct: 343 LFFNSTRELKGFKGSIYEGGLRVPAIVSWPGQIPPATTINTPSYFPDWFPTLCNATQLPL 402 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQ-----SNRKAEHYFLNGKLAAVRMDEFKYHVL 476 + +DGV+ G + Y V + +FK Sbjct: 403 PEG---------LDGVNLLPLLTGKTSPDQFIRPDPMVWVYAEYTGQVCVHLGDFKVLRR 453 Query: 477 IQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYM 536 + + V+ L +DP ES ++ + + A Sbjct: 454 GLRT----------------NRPGPWEVYQLVSDPGESTNLADSRPDLVTKAIEVLKAQT 497 Query: 537 EILKKYP 543 + +P Sbjct: 498 APNEIFP 504 >UniRef50_P15848 Arylsulfatase B n=32 Tax=Euteleostomi RepID=ARSB_HUMAN Length = 533 Score = 436 bits (1123), Expect = e-121, Method: Composition-based stats. Identities = 116/515 (22%), Positives = 191/515 (37%), Gaps = 78/515 (15%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 + P++V L DD+GW DVGF+G + TP +DA+A+ G++L + Y+QP +P+R+ Sbjct: 41 ASRPPHLVFLLADDLGWNDVGFHGSRI----RTPHLDALAAGGVLLDNYYTQPLCTPSRS 96 Query: 142 TILTGQYSIHHGILM-PPMYGQPGG-LQGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQ 198 +LTG+Y I G+ QP LPQLL + GY T +GKWH+G KE Sbjct: 97 QLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECL 156 Query: 199 PQNVGFDDFRGFN-SVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P GFD + G+ D Y+ R ++ + +D G + Sbjct: 157 PTRRGFDTYFGYLLGSEDYYSHERCTLIDA--------LNVTRCALDFRDGEEVATGYKN 208 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG--- 314 + + + +KP FLY + H +Y Sbjct: 209 M-----------YSTNIFTKRAIALITNH-PPEKPLFLYLALQSVHEPLQVPEEYLKPYD 256 Query: 315 --SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFR 372 R Y + M++ N+ L+ +G +NT+ +F++DNG + + P R Sbjct: 257 FIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAGGN-NWPLR 315 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 G K S WEGGVR FV + Q + ++ ++D PT + LA T Sbjct: 316 GRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLA-------RGHTNGT 368 Query: 432 TFIDGVDQTSFFLGTNGQSNRKAEHY-------------------------------FLN 460 +DG D S R + F Sbjct: 369 KPLDGFDVWKTIS-EGSPSPRIELLHNIDPNFVDSSPCPRNSMAPAKDDSSLPEYSAFNT 427 Query: 461 GKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGS---SVFNLYTDPQESDSI 517 AA+R +K + Q + + +F++ DP+E + Sbjct: 428 SVHAAIRHGNWKLLTGYPGCGYWFPPPSQYNVSEIPSSDPPTKTLWLFDIDRDPEERHDL 487 Query: 518 GVRHIPMGVPLQTEMHAYME-ILKKYPPRAQIKSD 551 + + L + + Y + + Y P + D Sbjct: 488 SREYPHIVTKLLSRLQFYHKHSVPVYFPAQDPRCD 522 >UniRef50_Q7UYH4 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYH4_RHOBA Length = 479 Score = 436 bits (1123), Expect = e-121, Method: Composition-based stats. Identities = 109/491 (22%), Positives = 182/491 (37%), Gaps = 46/491 (9%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS- 132 ++P+VVV L+DD+G+ D G TP+ID++A G+ T+A++ Sbjct: 11 AFVSSALFAAERPHVVVILVDDMGYGDPGCFNPD--SKIETPNIDSLARDGMRFTNAHAP 68 Query: 133 QPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 P +R ++TG+Y + +P TL L QGY T +GKWH+G Sbjct: 69 GPLCHMSRYGLMTGRYPFRTDV--SVWPREPLIDPDQATLASLAKSQGYRTTMVGKWHLG 126 Query: 193 ENKESQ----------PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 + + P + GFD F G + +D+ + N + DR E Sbjct: 127 FEERANESYDRPLLGGPVDRGFDHFFGIRASTDIPPYFYINDRNAVHPPT-DRIEANASE 185 Query: 243 PFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSD--KPFFLYYGTR 300 +S R G ++ D+ + D + + ++ P LY Sbjct: 186 GWSPIQGAFWRAGGIAPDLELA-----DVLPHFTDVAISSIQAHPNAEDASPMMLYLAYP 240 Query: 301 GCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPE 360 H P+ ++ G S SYGD ++ ++ + TL+ + +T+++FTSDNGP Sbjct: 241 APHTPWLPSPEFTGKSKVD-SYGDFVMMVDHEVGRVLDTLKVHDMERDTIVIFTSDNGPV 299 Query: 361 AEVP------PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTA 413 FRG K WE G R+P V W G S D +V DL T Sbjct: 300 WYENDTERFQHDSAGGFRGMKADAWEAGHRMPFIVRWPGHASASSSTDHLVCFTDLMATF 359 Query: 414 LDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGT--NGQSNRKAEHYFLNGKLAA-VRMDE 470 D+ +P+ D L + R K +R + Sbjct: 360 ADI-------WETELPQDAGPDSHSFLPALLQQPFEEGTKRTEFVMRAGSKSTMTIRAGD 412 Query: 471 FKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQT 530 +K + S ++NL +DP E++++ H + LQ Sbjct: 413 WKLITGLGSGGFSKPS-----RIPPKPGGPTGQLYNLKSDPAEANNVYQDHPDIVQRLQK 467 Query: 531 EMHAYMEILKK 541 M ++ + Sbjct: 468 RMKQIVDDGRS 478 >UniRef50_UPI0001745666 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745666 Length = 497 Score = 436 bits (1122), Expect = e-120, Method: Composition-based stats. Identities = 108/489 (22%), Positives = 179/489 (36%), Gaps = 53/489 (10%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPTR 140 +PN++ L+DD+G+ D+G G TP ID +A++G+ LT Y+ +P+R Sbjct: 34 AADRPNIIYILVDDMGYGDLGCFGQKTFT---TPHIDRMAAEGMKLTRHYAGSTVCAPSR 90 Query: 141 ATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQP 199 +LTG ++ H + ++ T+P LL GY T GK+ +G + P Sbjct: 91 CVLLTGLHTGHCRVRGNGLWT---MPDSDVTVPNLLKQAGYATACFGKYGLGKPLPDDDP 147 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD F G+ S + + + ++ + + + + G + Sbjct: 148 NRKGFDTFFGYVDTSHAHNFYPTYLIRNGQRVALNNVT----------EPGSRKAGHEDT 197 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH---------------- 303 + Q D +L A +PFF+YY H Sbjct: 198 GFATVDGRRQFAPQLIADELQTYLRDRAAGKQPFFVYYALNMPHANNEAGKNSPLKHGME 257 Query: 304 FDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV 363 +Y + M ++D + L+K G NTL++FTSDNGP AE Sbjct: 258 VPSYGEYANKDWPDVEKGFASAMRFVDDQVGAVLAALKKAGLDQNTLVMFTSDNGPHAEG 317 Query: 364 PP-----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLA 417 F G K S +GG+RVP W I+ +S+ + DL PT DLA Sbjct: 318 GHSSDFFDSNGAFSGIKRSMTDGGIRVPLVARWPAAIKARGESEHVSGFQDLLPTVADLA 377 Query: 418 GHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF---LNGKLAAVRMDEFKYH 474 G DG+ G +G+ + ++ G AV +K Sbjct: 378 GAKLEGET---------DGLSLVPTLTGKDGEQKQHKYLFWNFDEQGGKRAVLRWPWKLI 428 Query: 475 VLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 L Q+ G ++ ++NL D E +++ + L+ M Sbjct: 429 HLNTGTARMGQNAG-GKPQPVQPKSLEVQLYNLEEDVGEQNNLASLQPGIVSELEGYMKE 487 Query: 535 YMEILKKYP 543 + P Sbjct: 488 AWRAPQTQP 496 >UniRef50_A6C176 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C176_9PLAN Length = 599 Score = 436 bits (1122), Expect = e-120, Method: Composition-based stats. Identities = 97/498 (19%), Positives = 176/498 (35%), Gaps = 92/498 (18%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 + KPN+++ + DD G+ D+ +G + TP++D + + L LT+ + P+ Sbjct: 21 CPADTPDSGKPNIILVITDDQGYGDIAAHGNQMI---KTPNLDQLYQKSLRLTNFHVDPT 77 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK 195 +PTR+ ++TG+YS G+ M G+ TL ++ GY T GKWH+G+N Sbjct: 78 CAPTRSALMTGRYSTRTGVWHTIM-GRSLMDTNEVTLAEVFKSNGYRTGLFGKWHLGDNY 136 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 +PQ+ GF T + +++Y Sbjct: 137 PLRPQDQGF------------GTVVQHGGGGVGQTPDDWQNDYFSDTYLRNGKPE----- 179 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY--- 312 K+ W D +KF++ A KPFF Y T H + +Y Sbjct: 180 ----------KFQGYCTDIWFDEALKFIE--ADRTKPFFAYLSTNAPHSPYLVDPEYSDP 227 Query: 313 ---AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEA-------- 361 G ++ + +++ L + L+++G NT+++F +DNG A Sbjct: 228 YEDKGVPKKMAAFYGMITNIDENMGRLLRYLKESGLEKNTILIFMTDNGTAAGLQRPSTE 287 Query: 362 ------------------EVPPHGRTPFRGAKGSTWEGGVRVPTFVYWK-GMIQPRK-SD 401 E P RG KGS ++GG RVP +++W G + K + Sbjct: 288 DLSKKQQRRLSKGKPITLETWPGFNARMRGTKGSEYDGGHRVPCYIHWPQGGLTGGKNIN 347 Query: 402 GIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG 461 + D+ PT DL + +DG G + Sbjct: 348 QLTAHIDILPTLADLCDLTIS-------SELKLDGTSLVPILTGNKDALRNRT------- 393 Query: 462 KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 V Q+ + + ++++ DP ++ ++ + Sbjct: 394 -----------LIVHSQRIESPEKWRKSSVMAERWRLVNEKELYDIQNDPGQTKNVAAEY 442 Query: 522 IPMGVPLQTEMHAYMEIL 539 + L E + L Sbjct: 443 AGVVKYLSAEYEKWWSSL 460 >UniRef50_B2URC2 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2URC2_AKKM8 Length = 465 Score = 436 bits (1122), Expect = e-120, Method: Composition-based stats. Identities = 111/507 (21%), Positives = 175/507 (34%), Gaps = 87/507 (17%) Query: 52 TTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVG 111 ++ + Q +L+ PN++V L DD+G+ D+G G Sbjct: 2 MKEVRLLLILCGLFCGTAVAQPRLSS-------PPNMIVILADDLGYGDLGCTGSKQI-- 52 Query: 112 NPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPP-------MYGQP 163 TP +D +A +G+ + AY P SP+R +LTG++ +GI P Sbjct: 53 -KTPSLDRLAREGVFCSRAYVTAPMCSPSRMGLLTGRFPKRYGITTNPNIQMDYLPESHY 111 Query: 164 GGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDV 223 G Q +P+ L GY + GKWH+G K P GF + GF S Y Sbjct: 112 GLPQTEKLIPEYLAPCGYRSAVFGKWHLGHTKGYTPPERGFTHWWGFLGGSRHYFP---- 167 Query: 224 HVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFL 283 +K+ + V + + L D V+FL Sbjct: 168 ---------------VKKEAEGLNPSMIVSNFTDKT-------DITYLTDDITDRAVEFL 205 Query: 284 DKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS-----SPARTSYGDCMVEMNDVFANLYK 338 + K KPFF++ H+ N + + R Y + M+ + Sbjct: 206 QEAGKDKKPFFMFVSYNAPHWPNEAKPEDIAKFRNVQNGERRVYCAMVYAMDRGIGRILD 265 Query: 339 TLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKG---MI 395 L+ +G +T++VF SDNG E PFRGAK +EGGVRVP + + ++ Sbjct: 266 ALKADGLEKDTIVVFLSDNGGAPEA-SSCNAPFRGAKRQHFEGGVRVPFIIRYPADKRLV 324 Query: 396 QPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAE 455 V DL P L G +DG+D R Sbjct: 325 PGSVCRQPVSSVDLLPALLKANGRHIP---------RKLDGMDILELVGNKGAPVPRT-- 373 Query: 456 HYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESD 515 ++ +AV + KY ++ +N+ DPQE Sbjct: 374 FFWCTDYTSAVLTGDMKYLLV---------------------PDRAPQFYNVADDPQEQR 412 Query: 516 SIG-VRHIPMGVPLQTEMHAYMEILKK 541 + RH L ++ Y+ Sbjct: 413 DLYFSRHQD-ADLLAKKLGTYLTTTPA 438 >UniRef50_Q488V4 Sulfatase family protein n=30 Tax=Bacteria RepID=Q488V4_COLP3 Length = 525 Score = 436 bits (1121), Expect = e-120, Method: Composition-based stats. Identities = 136/494 (27%), Positives = 210/494 (42%), Gaps = 36/494 (7%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS 136 ++PN++ DD+G ++ G +G T +ID +A +G++ T Y + S Sbjct: 30 ASGTTDTERPNILAIWGDDIGQSNISAYTHG-MMGYKTTNIDRIAKEGVLFTDYYGENSC 88 Query: 137 SPTRATILTGQYSIHHGILMPPMYG-QPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENK 195 + RA +TGQY + G+ + G G T+ +LL D+GYVT GK H+G+ Sbjct: 89 TAGRAAFITGQYPVRTGLTKVGLPGSDKGLRAEDVTIAELLKDRGYVTGQFGKNHLGDKD 148 Query: 196 ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 E P N GFD+F G + PE P Y K+ + +H+ G Sbjct: 149 EFLPTNHGFDEFLGNLYHLNA-------EEEPEHPDYPKDQAYKKRFG-PRGVIHSFADG 200 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 + + +T K ME +D ++ KF+DK K++KPFF+++ H + + G Sbjct: 201 KIEDSGPLTKKRMETIDDEFLAATTKFIDKAHKNNKPFFVWFNATRMHIWTHLKEESKGL 260 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGA 374 S YGD M+E + L L++ DNT++++T+DNG E P G T PF+G Sbjct: 261 SKRGGIYGDGMMEHDYQVGVLLDQLDRLAIADNTIVLYTTDNGAEVFSWPDGGTIPFKGE 320 Query: 375 KGSTWEGGVRVPTFVYWKGMIQPRKSD-GIVDLADLFPTALDLAGHPGAKVA-------N 426 K +TWEGG RVP V W G I + +V D PT L AG K N Sbjct: 321 KNTTWEGGFRVPAMVRWPGKITAGDAKIEMVSHMDWAPTLLAAAGVTDIKEKLKQGTTVN 380 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF-LNGKLAAVRMDEFKYHVLIQQPYAYTQ 485 +DG + + G ++ R + YF G L+AVR + K IQ+ Sbjct: 381 GKKYKVHLDGYNLLPYLTGATDEAPRPSYLYFTDGGDLSAVRFGDMKLQYSIQECEGL-- 438 Query: 486 SGYQGGFTGTVMQTAGSSVFNLYTDPQES-DSIGVRHIPM--------GVPLQTEMHAYM 536 + + + NL DP E + T M Sbjct: 439 ----NVWICPLTPLRAPLLTNLRQDPYERARDESGSY-ERWYVDHIFEFSRGITMTAQQM 493 Query: 537 EILKKYPPRAQIKS 550 + ++PPR + S Sbjct: 494 KTFVEFPPRQKPAS 507 >UniRef50_A6CAR8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Planctomycetaceae RepID=A6CAR8_9PLAN Length = 501 Score = 435 bits (1120), Expect = e-120, Method: Composition-based stats. Identities = 118/536 (22%), Positives = 179/536 (33%), Gaps = 132/536 (24%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSP 138 K PN+++ + DD G+ D+G G + TP +D +A +G LTS Y P+ +P Sbjct: 32 KAAETPPNIIMIVSDDQGYRDLGSFGSEEIM---TPHLDRLAKEGAKLTSFYVTWPACTP 88 Query: 139 TRATILTGQYSIHHGILMPPMYGQP-------------------GGLQGLTTLPQLLHDQ 179 +R ++LTG+Y +GI P G LP LL Sbjct: 89 SRGSLLTGRYPQRNGIYDMIRNEAPDFGHKYKPAEYEVTFERIGGMDVREKLLPALLKPA 148 Query: 180 GYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYI 239 GYV+ GKW +G +K P GFDDF GF + Y V S Y Sbjct: 149 GYVSAIYGKWDLGIHKRFLPLARGFDDFYGFTNTGIDYFTHERYGV---------PSMYR 199 Query: 240 KQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGT 299 P +D + V+F+ + KPFFLY Sbjct: 200 NNQPTEEDK-------------------GTYCTYLFQREAVRFIKE--NHQKPFFLYLPF 238 Query: 300 RGCHF-----DN-----YPNAKYAGS---------------------------------- 315 H KY Sbjct: 239 NAPHGASSLDPRIRGGAQAPEKYKNMYPHLKDTLVTKKKTGRYEFRERPDGPVIHQGVSA 298 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAK 375 S R Y + M+D + L++ DNT++VF SDNG +P +G K Sbjct: 299 SKRRLEYVASITCMDDAIGEVLGLLDEYQIADNTIVVFFSDNGGS---GGADNSPLKGKK 355 Query: 376 GSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFI 434 G +EGG+RVP V + I+P D ++ +L PT L A P +P+ I Sbjct: 356 GMMFEGGIRVPCLVRYPAKIKPGTVNDELLTSLELVPTFLKEAAIP-------LPENVVI 408 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTG 494 DG D +G + Y+ + A R+ +K+ + Sbjct: 409 DGYDMLPVLMGKTTSPRNE--MYWQRREDKAARVGHWKWVESEKGSG------------- 453 Query: 495 TVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 +F+L D E + H ++ + + + PR + Sbjct: 454 ---------LFDLSQDIGEKHDLSPTHPKKLEEMKNHFANWKKQMADAEPRGPFRD 500 >UniRef50_A6UG37 Sulfatase n=16 Tax=Bacteria RepID=A6UG37_SINMW Length = 552 Score = 435 bits (1120), Expect = e-120, Method: Composition-based stats. Identities = 145/536 (27%), Positives = 235/536 (43%), Gaps = 41/536 (7%) Query: 34 RKGFAGYDHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLL 93 RK G ++ + TI A Q+ + +GK PN++V Sbjct: 9 RKNAQGSISIDRRSLLLGGTILAAAAAANGAVAVGSAKAQE--QSSAGSGKTPNILVIFG 66 Query: 94 DDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHG 153 DD+G + G+ G TP+ID +A++G I T AY Q S + RA+ + GQ G Sbjct: 67 DDIGIPQISAYTMGLM-GYRTPNIDRIAAEGAIFTDAYGQQSCTAGRASFILGQEPFRTG 125 Query: 154 ILMPPMYGQPGGLQG-LTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNS 212 +L M G P G+Q + T+ ++ +GY T GK H+G+ E P N GFD+F G Sbjct: 126 LLTIGMPGDPHGIQDWMPTIADVMKSKGYATGQFGKNHLGDRDEHLPTNHGFDEFFGNLY 185 Query: 213 VSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLD 272 + PE P E+ K + + + G+ + + K ME +D Sbjct: 186 HLNA-------EEEPEGYFYPKDEEFRKNFG-PRGVIKSSADGKIEDTGALNTKRMETVD 237 Query: 273 QRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDV 332 + ++ F+D+ AK+DKPFF ++ + H + + G + + + D MVE + Sbjct: 238 EEFLAAAKDFIDRQAKADKPFFCWFNSTRMHVFTHLKPESMGKT-GKGIHADGMVEHDGH 296 Query: 333 FANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG-RTPFRGAKGSTWEGGVRVPTFVYW 391 L + L+ G +NT++++T+DNG E + P G T F G KG+TWEGG R+P V W Sbjct: 297 VGQLLQQLDDLGITENTIVLYTTDNGAELALWPDGAMTMFHGEKGTTWEGGFRIPMMVRW 356 Query: 392 KGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVA-------NLVPKTTFIDGVDQTSFF 443 G+++P + V L D PT AG P K +DG D T+ Sbjct: 357 PGVVKPGTQINDPVTLMDWMPTFATAAGIPDVKEEMKTGFKSGDKTFKVHLDGYDLTALL 416 Query: 444 LGTNGQSNRKAEHYF-LNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGS 502 G + R+A +YF G L A+R +++K + + + + Sbjct: 417 KGEAEEPPREAVYYFDQGGNLNAIRWNDWKLSFAVNSEGNIATATRETPSWANIA----- 471 Query: 503 SVFNLYTDPQES--------DSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 NL DP E R++ + VP+Q+++ + + +YP Q S Sbjct: 472 ---NLRMDPYERGTKEGGGAMEFIARNMWLLVPIQSKIKEFFQDFDQYP--YQPGS 522 >UniRef50_C6I9F7 Sulfatase n=4 Tax=Bacteroides RepID=C6I9F7_9BACE Length = 493 Score = 435 bits (1120), Expect = e-120, Method: Composition-based stats. Identities = 103/529 (19%), Positives = 178/529 (33%), Gaps = 119/529 (22%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-P 134 ++ + PNV+ DD+G+ D+ G TP ID +A +G+ T +Y+ P Sbjct: 17 SDAQTDKQPHPNVIFIYADDLGYTDLSCTGSRF---YETPHIDKLAREGVCFTQSYAACP 73 Query: 135 SSSPTRATILTGQYSIHHGILMPPMYGQPGGL----------------QGLTTLPQLLHD 178 SSP+RA +LTG+Y + + G + T+ + Sbjct: 74 VSSPSRAALLTGKYPARINLTDYIPGDRAYGPHKNQRLASLPFNLHLSKDEITMAEAFRQ 133 Query: 179 QGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 GY T GKWH+ E+ E P+ GFD G N+ + + NP++ P+ Sbjct: 134 NGYSTFMAGKWHLAESAEYYPEQNGFDINIGGNNTGHPSKGYFSPYGNPQLKDGPEG--- 190 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 E L R D ++++ + +KPFF+Y Sbjct: 191 ------------------------------EYLTDRLTDEVIRYISE--PKEKPFFVYLS 218 Query: 299 TRGCHFDNYPNAKYAGSSPAR-------------------------TSYGDCMVEMNDVF 333 H A+ + +Y + +++ Sbjct: 219 YYTVHLPLQAKAEKIAKYRRKLSRAVPADSSFVKKGETYHKLVQDIPAYAAMVESLDENI 278 Query: 334 ANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP-----PHGRTPFRGAKGSTWEGGVRVPTF 388 L TL ++G + T++VFTSDNG A P P R KG +EGG++VP Sbjct: 279 GRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPTSNLPLRAGKGYLYEGGIKVPAI 338 Query: 389 VYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTN 447 + W ++ R SD + D +PT + +DGV G Sbjct: 339 IRWSRHLKGRQVSDTPIIGTDYYPT-------LLDLCGLPLLPGQHVDGVSMKPVLQGGR 391 Query: 448 GQSNRKAEHYFLNGK------LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAG 501 HY AA+R ++K + Sbjct: 392 LSRPSLFWHYPHYSGGLGGRPSAAIREGDYKLIEF--------------------FEDHH 431 Query: 502 SSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 ++N+ D E + + + L+ +++ + + + P Sbjct: 432 VELYNVIQDESEEKDLSQIYPEIADGLRKKLYLWYKEVGARMPVDNPHY 480 >UniRef50_UPI0001AEC7EA iduronate-sulfatase and sulfatase 1 precursor n=1 Tax=Alteromonas macleodii ATCC 27126 RepID=UPI0001AEC7EA Length = 590 Score = 435 bits (1119), Expect = e-120, Method: Composition-based stats. Identities = 110/580 (18%), Positives = 203/580 (35%), Gaps = 76/580 (13%) Query: 3 FSFSPKRLVVAVAAALPLMASAADTPSTATARKGFAGYDHP----------NQYLVKPAT 52 +SF+ L V + + + + A T S + + + Sbjct: 43 YSFT---LPVEIKSDQSINLTLASTKSNGAPNLTVQSWSSSLTITPNIQKTGEGVFSFVA 99 Query: 53 TIADNMMPVMQHPAQDKETQQKLAELEKK--TGKKPNVVVFLLDDVGWMDVGFNGGGVAV 110 P+ E+ + + K KPN+VV DD G+ DVG + + Sbjct: 100 PQVRKRTPLTITAKLLTESGVEFSIDAKTLIVPSKPNLVVVFTDDQGYADVGAHN--IVN 157 Query: 111 GNPTPDIDAVASQGLILTSAY-SQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL 169 TP+ID +A++G + T+ Y + P +P+RA ++TG Y G+ + Sbjct: 158 DIETPNIDKLAARGALFTNGYITAPQCTPSRAAMITGVYQQRFGVDDN---RYTPIPNNV 214 Query: 170 TTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEV 229 T+ Q + GY T +GKWH+ ++ S+P F + + + + + Sbjct: 215 VTMGQRFSELGYTTGLVGKWHLEIDQNSKPW---FRENY--PNTPIEQFNPGKLPSSVKE 269 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS 289 P Y + A + + D +F++ Sbjct: 270 QFYPSSMGYQYNYFGYANRYWANFDLKGNETKLGWVNNTDYRLDVVSDAATQFIE--MNY 327 Query: 290 DKPFFLYYGTRGCHFDNYPNAKYAGS----SPARTSYG-DCMVEMNDVFANLYKTLEKNG 344 D+PF+L+ H Y R Y M ++ + LEK+G Sbjct: 328 DEPFYLHVAHYAPHVPLAATDDYLSLFPENDSVRRRYALAMMYAVDTGVGQIVSQLEKHG 387 Query: 345 QLDNTLIVFTSDNGP------------EAEVPPHG-RTPFRGAKGSTWEGGVRVPTFVYW 391 L+NT+I F SDNG E E TP G KG +GG++VP + W Sbjct: 388 ILENTIITFISDNGAPVGLDFTDAPVSENEAWNGSLNTPLLGEKGMLTDGGIKVPFIIQW 447 Query: 392 KGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQS 450 IQ + V D+ +A+ AG + ++ +DGVD G++ Sbjct: 448 PNEIQGNTVIEEPVISLDVLYSAIKAAGAAESTLS-------ELDGVDIFPT-QGSDISG 499 Query: 451 NRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTD 510 ++ +AVR+ +KY + + +F+L + Sbjct: 500 LANRPLFWRFWNQSAVRLGNYKYLKMGSEHEF---------------------LFDLTKE 538 Query: 511 PQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 +++ L+ + + + + P ++++ S Sbjct: 539 ESNQNNLISMMPVKAEELKKLYEQWNKEMLRQPEQSKLNS 578 >UniRef50_A6DHI4 Arylsulfatase A (ASA) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI4_9BACT Length = 511 Score = 435 bits (1119), Expect = e-120, Method: Composition-based stats. Identities = 125/518 (24%), Positives = 201/518 (38%), Gaps = 85/518 (16%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY-SQPSS 136 +KPN+V DDVG+ DVG G PTP ID +A G+ T + S + Sbjct: 16 ASLTAAEKPNIVFIYGDDVGFGDVGVYGSEKI---PTPHIDKLAKGGIQFTDGHCSAATC 72 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 SP+R +LTG ++ HG+ + P + TLP++L + GYVT +GKWH+G + Sbjct: 73 SPSRFAMLTGVHAFRHGVNILPPNAPLSIPTDIPTLPKMLRENGYVTGVVGKWHLGIGAK 132 Query: 197 S-----------QPQNVGFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPDRSEYIK 240 P +GFD S +D R + +P + R+ Sbjct: 133 GVETDWNGDVKPGPLEIGFDQMFLLPSTNDRVPCVYLDGHRVYNYDPNDPIYVGRTLESV 192 Query: 241 QLPFSKDDVHAVRGGEQQA-----------------------IADITPKYMEDLDQRWMD 277 P S A + E + E + +++ Sbjct: 193 NKPGSTQYGDARKNPELMTYYPSTHGHNNSVINGIGRIGFMSGGEKALWNDETMADVFVE 252 Query: 278 YGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLY 337 +F+ + AK DKPFFLY+ ++ H P+ ++ G++ GD MV+ + L Sbjct: 253 KASEFIKEKAKGDKPFFLYFASQDIHVPRAPHPRFQGAT-KLGKRGDAMVQFDWCTGALM 311 Query: 338 KTLEKNGQLDNTLIVFTSDNGP-----------------EAEVPPHGRTPFRGAKGSTWE 380 K L++ G DNT++ F+SDNGP E + G +RG K +E Sbjct: 312 KALDEAGVADNTIVFFSSDNGPVYDDGYADGSVTKTSSKETDHGHDGSGIYRGGKYQIYE 371 Query: 381 GGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQT 440 GG RVP + W I+P SD +V+ DL+ + L GH + K ID D Sbjct: 372 GGTRVPFIISWPAKIKPAVSDAMVNQVDLYTSFAKLVGHD-------LRKEEAIDSRDTL 424 Query: 441 SFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTA 500 + FLG Q AVR ++K+ + + Sbjct: 425 AAFLGEESQGL-DYMFNEARKTDHAVRQGKWKFISKGGKKKKKSND-------------- 469 Query: 501 GSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 +++L DP E ++ + ++ + + Sbjct: 470 --ELYDLEADPSEQKNVVKEFPEVAGDMKKLLEQVRKS 505 >UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UZ43_RHOBA Length = 608 Score = 435 bits (1119), Expect = e-120, Method: Composition-based stats. Identities = 101/472 (21%), Positives = 175/472 (37%), Gaps = 62/472 (13%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS 136 + +PNVV+ + DD G+ D GF G V TP+IDA+A++ +LT + P+ Sbjct: 23 AHSVRAADRPNVVMVITDDQGYGDCGFTGNKVV---QTPNIDALAAESSVLTDYHVAPTC 79 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 SPTR+ ++TG ++ G+ G+ T ++ D GY T GKWH+G+N Sbjct: 80 SPTRSALMTGHWTNRTGVWHTI-SGRSMLRDNEVTFGEIFSDAGYQTGMFGKWHLGDNYP 138 Query: 197 SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGE 256 + ++ GF + ++ F H + + Sbjct: 139 YRAEDNGFTEVYRHGGGG-----------------VGQTPDFWDNAYFDGSYFHNGKAVK 181 Query: 257 QQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSS 316 + + G +F+ + ++D+PFF Y T H + KY Sbjct: 182 AE----------GFCTDVFFKEGNRFIRECVEADEPFFAYIATNAPHGPLHAPQKYIDMY 231 Query: 317 PAR----TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFR 372 P ++ + ++D K L + G DNT+ +FT+DNG + R Sbjct: 232 PEMNDNVATFFGMITNVDDNVGQTRKLLRELGVHDNTIFIFTTDNGTAGGASVY-NAGMR 290 Query: 373 GAKGSTWEGGVRVPTFVYWK--GMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPK 430 G KGS +EGG RVP +++ G + R ++ + D+ PT LD+ G P+ Sbjct: 291 GKKGSPYEGGHRVPFVMHYPEGGFAKSRTNNTLCHAVDVVPTLLDMCGVEA-------PE 343 Query: 431 TTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490 + DG S S Q+ + Sbjct: 344 SVKFDGTSIVSLLKDEVDSSFNDRM-----------------LITDSQRVIDPIKWRQSS 386 Query: 491 GFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKY 542 G ++N+ DP + ++I H ++ A+ L+ Sbjct: 387 VMQDKWRLINGKELYNIANDPGQENNIAGDHPEQVASMRAFYEAWWAELEPT 438 >UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UJ66_RHOBA Length = 616 Score = 435 bits (1119), Expect = e-120, Method: Composition-based stats. Identities = 95/491 (19%), Positives = 173/491 (35%), Gaps = 66/491 (13%) Query: 61 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV 120 + A A+ ++ +PNV++ + DD G+ D+ +G TP++D + Sbjct: 34 WILLAACLTTCSPAWAQTASES--RPNVILVVTDDQGYGDMSCHGNPWLN---TPNLDRL 88 Query: 121 ASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQG 180 A+Q + L + + P +PTRA ++TG+Y G G+ TT+ + + G Sbjct: 89 ATQSVRLENFHVDPFCTPTRAALMTGRYCTRVGAWA-VTEGRQLLDPDETTMAETFRESG 147 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 Y T GKWH+G+ P+ G + + + +P ++Y Sbjct: 148 YRTGMFGKWHLGDPPPFAPRERGLETVVRHMAGG------------ADEIGNPTGNDYFD 195 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 + + G W + + F+ +S++PFF Y T Sbjct: 196 DTYYRNGTPESFDG---------------YCTDIWFEEAIDFI--QKESEQPFFAYIPTN 238 Query: 301 GCHFDNYPNAKY------AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFT 354 H +Y G P R ++ + ++ L K L+++ DNT+++F Sbjct: 239 AMHSPYLVADRYSDPFKRQGIEPQRAAFYGMIQNFDENLGRLLKRLDQDNLRDNTMLIFM 298 Query: 355 SDNGPEAEVPP-----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLAD 408 SDNG RG KGS +EGG RVP F W + D + D Sbjct: 299 SDNGTAQGASEQNRKVGFNAGMRGKKGSVYEGGHRVPCFASWPAKWDGNRPVDQLTCHRD 358 Query: 409 LFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRM 468 PT ++L P DG ++ Q + Sbjct: 359 WLPTLIELCDLK-------RPADVTFDGRSMAGLLSHSSQQWPERTLVIERQPDN----- 406 Query: 469 DEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPL 528 ++ ++ + ++++ DP + +I + + L Sbjct: 407 -------VVSATKTQGRAQPPFVVLTDRWRLVRDELYDIQNDPGQIKNIAAEYPEVVREL 459 Query: 529 QTEMHAYMEIL 539 + E AY E + Sbjct: 460 RAEYDAYFEDV 470 >UniRef50_UPI0001BC7CBC sulfatase n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7CBC Length = 496 Score = 435 bits (1119), Expect = e-120, Method: Composition-based stats. Identities = 112/511 (21%), Positives = 181/511 (35%), Gaps = 87/511 (17%) Query: 51 ATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAV 110 +T++ ++ ++ + KPN+V+ L DD G+ V Sbjct: 8 TSTLSTALLAILPIAKASAQ------HTTPSHPDKPNIVIILADDQGYGGVNCY--PHIK 59 Query: 111 GNPTPDIDAVASQGLILTSAYSQP-SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL 169 TP+ID +A+ G+ Y+ SSPTRA ++TG+Y G G Q Sbjct: 60 KIVTPNIDKLAASGVQCMQGYTSGHLSSPTRAGLMTGKYQQSFGFYGLSTPHVGGIPQDQ 119 Query: 170 TTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEV 229 L + L + GY T IGKWH+G+ S P N GF F GF + Y + Sbjct: 120 KLLSEYLVENGYNTACIGKWHLGDYIRSHPNNRGFQTFFGFINGLHDYYD---------P 170 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS 289 + L F+ D++ V ME + V F+ + Sbjct: 171 LVGGSWDGVYNGLAFTLDNMEPVTE-------------MEYSTYEYTKRAVDFI--QKNA 215 Query: 290 DKPFFLYYGTRGCHFDNYPNAKY----------AGSSPARTSYGDCMVEMNDVFANLYKT 339 D PFFLY H + G ++ + +T Sbjct: 216 DHPFFLYLPYNAIHSPLQAPEELIGELAINPQEIGKDDIAR---AMTFALDQGVGKVVET 272 Query: 340 LEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK 399 LE+ G DNT+I + SDNG V + FRG KGS +EGG+RVP V + + Sbjct: 273 LEQLGLRDNTIIFYLSDNGA---VEYSDKWEFRGRKGSYYEGGIRVPFIVSYPAKLAKGT 329 Query: 400 S-DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF 458 + V D+ PT ++LAG A + GV+ + G + ++ Sbjct: 330 IYNKPVMSIDIAPTVMELAGLSHAD----------MHGVNLLPYLSGKDRTEPHDVLYWS 379 Query: 459 LNGKL--------AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTD 510 K A+R ++K Y ++++ D Sbjct: 380 TEKKSNNQVFKNEFAIRQGKWKLVSDPHFEKDYD-------------------LYDIEAD 420 Query: 511 PQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 PQE + ++ L ++ + + Sbjct: 421 PQEKHGLKDQYPEKYKELFGMYLNWINQMPE 451 >UniRef50_A6DJ15 Putative arylsulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ15_9BACT Length = 469 Score = 435 bits (1119), Expect = e-120, Method: Composition-based stats. Identities = 102/514 (19%), Positives = 184/514 (35%), Gaps = 108/514 (21%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-S 135 +KPN++ L+DD+G+ D+ G TP+ID + +G++ T YS Sbjct: 12 AGSAIANEKPNIIYLLVDDLGYGDLSLYGQK---KFSTPNIDRIGKEGMVFTDHYSGSTV 68 Query: 136 SSPTRATILTGQYSIHHGILMP------PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKW 189 +P+RA ++TG++S H + G+ +L +++ GY T IGKW Sbjct: 69 CAPSRAALMTGKHSGHGLVRGNYEVGPHGFGGELPLRPEDVSLAEVMKSAGYATGLIGKW 128 Query: 190 HMG-ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 MG + +P+ GFD GF + + + + + Sbjct: 129 GMGMDGTTGEPRKKGFDYSYGFLNQAHAHHYYPE-------------------------- 162 Query: 249 VHAVRGGEQQAIADITPKYME-DLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNY 307 + GE+ I + + + + G++F+++ DKPFFL++ H + Sbjct: 163 -YIYENGEKLMIPENKDDARGLYISDTFAEKGIEFVEE--NKDKPFFLFWAFVTPHAELL 219 Query: 308 PNAK------------------------------YAGSSPARTSYGDCMVEMNDVFANLY 337 YA R ++ + ++ +L+ Sbjct: 220 VPDDSLNEFKGKWPETPFVMGKQGGDGTDNPFGVYASQDHPRAAFSGMITRLDKRVGDLF 279 Query: 338 KTLEKNGQLDNTLIVFTSDNGPEAEVPP-----HGRTPFRGAKGSTWEGGVRVPTFVYWK 392 LE+ G DNT+I+F+SDNGP E G K EGG+RVP V W Sbjct: 280 DKLEELGIDDNTIIMFSSDNGPHKEGGADPDFFDSNAELTGYKRDLTEGGIRVPFMVRWP 339 Query: 393 GMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSN 451 +++ R KS D+ PT ++A + IDG+ G Q + Sbjct: 340 NVVKARSKSSHASAFWDVMPTIAEIANTDSPE---------DIDGLSFLPALKGEKQQVH 390 Query: 452 RKAEHYFLNGKL--AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYT 509 + F A+RM +K +++L + Sbjct: 391 KHLYWEFHERGYTEQALRMGNWKAIRHGVNS--------------------PIKLYDLIS 430 Query: 510 DPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 D E + + ++ + + + +P Sbjct: 431 DESEQNDVSAKYPATAKHITNILDTERTDSELWP 464 >UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C366AB Length = 470 Score = 434 bits (1118), Expect = e-120, Method: Composition-based stats. Identities = 112/527 (21%), Positives = 182/527 (34%), Gaps = 128/527 (24%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRATI 143 +PN + +DD+GW D+ G TP+ID + QG++ ++Y+ P SP+RA+ Sbjct: 4 QPNFLFIFMDDMGWRDLACTGSTF---YETPNIDRLCRQGMVFANSYASCPVCSPSRASC 60 Query: 144 LTGQYSIHHGILMP--------PMYGQ-------PGGLQGLTTLPQLLHDQGYVTQAIGK 188 LTG+Y G+ P+ G+ +G T+ Q L D GY T +GK Sbjct: 61 LTGKYPARLGVTDWIDMEGTSHPLKGKLIDAPYIKHLPEGEYTIAQALKDAGYDTWHVGK 120 Query: 189 WHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 WH+G +E P++ GFD G S + + + ++ P+ Sbjct: 121 WHLG-GREFYPEHFGFDVNIGGCSWGHPHDGYFSPYGIETLSEGPEG------------- 166 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAK--SDKPFFLYYGTRGCHFDN 306 E L R D V+ L K S KPF++ H Sbjct: 167 --------------------EYLTDRITDEAVRLLRKRQACGSRKPFYMNLCHYAVHTPI 206 Query: 307 YPNAKYAGSSPARTS------------------------------------YGDCMVEMN 330 + + Y + ++ Sbjct: 207 QVKDEDRARFEKKARELGLDKETALVEGEFHHTEDKKGRRVVRRVIQSDPSYAGMIWNLD 266 Query: 331 DVFANLYKTLEKNGQLDNTLIVFTSDNGP--EAEVPPHGRTPFRGAKGSTWEGGVRVPTF 388 L + L + G+ +NT++VFTSDNG +E P P KG +EGG RVP Sbjct: 267 QNIGRLLEALRECGEEENTVVVFTSDNGGLATSEGSPTCNLPASEGKGWVYEGGTRVPLI 326 Query: 389 VYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTN 447 V + G + P D V D +PT L+LAG P IDG G Sbjct: 327 VKYPGRVAPGSRCDVPVTTPDFYPTFLELAGVPQKAGI-------PIDGRSIVPLLSGNP 379 Query: 448 GQSNRKAEHYFLNGKLA------AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAG 501 R ++ + +V M ++KY + Sbjct: 380 MP-ERPIFWHYPHYGNQGGTPASSVVMGDYKYIEF--------------------FEDGR 418 Query: 502 SSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQI 548 +++L D E++++ + L+ +H + + P Sbjct: 419 GELYDLKADFSETNNLCEKMPETAARLRMLLHGWQREVCARFPEENA 465 >UniRef50_D2R207 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R207_9PLAN Length = 495 Score = 434 bits (1118), Expect = e-120, Method: Composition-based stats. Identities = 104/525 (19%), Positives = 188/525 (35%), Gaps = 80/525 (15%) Query: 56 DNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTP 115 ++ + + +PN++ + DD+G+ DVG G V TP Sbjct: 7 RALLQYFAFTVVCLSANYLVRSAQAADSDRPNIIWLMADDLGYGDVGCYGQKVIA---TP 63 Query: 116 DIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILMPPMYGQPG---GLQGLTT 171 +ID +A +GL T YS +P+R+ ++TG + H + G P T Sbjct: 64 NIDQMAREGLRFTQFYSGATVCAPSRSVLMTGLHHGHTRVRGNAGAGNPAAQALRADDFT 123 Query: 172 LPQLLHDQGYVTQAIGKWHMGEN---KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPE 228 + + L GY T +GKW +G++ P+ GFD+F G+ + + + P Sbjct: 124 VAKFLQQAGYRTALVGKWGLGDDGQASTGLPRKQGFDEFVGYLNQRHAHNHF------PS 177 Query: 229 VALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAK 288 + + +P E+ + K ++ D + + F+++ Sbjct: 178 FLWRNEEKFPLPNVP----------ELEEPDGSGYPKKAVQFADDLLTEEALAFVER--N 225 Query: 289 SDKPFFLYYGTRGCH--------------FDNYPNAKYAGSSPARTSYGDCMVEMNDVFA 334 ++PFFLY+ H ++ + + ++ Sbjct: 226 REQPFFLYWTPVIPHANNERARDLGNGAQVPDFGPYEKETWPEQDKGQAAMIHRLDTYVG 285 Query: 335 NLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG-----RTPFRGAKGSTWEGGVRVPTFV 389 + L++ TL +FTSDNGP E + + G K S +GG+RVP Sbjct: 286 RMLAKLKQLKLDQKTLFIFTSDNGPHNEARHNLERFQPSGSWTGIKRSLHDGGIRVPMIC 345 Query: 390 YWKGMIQPRKSDGIVDLA-DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNG 448 +W G I P++ V + D F TA +LA P +D + S G + Sbjct: 346 WWPGTIAPQQVSEHVGYSGDFFATAAELASRPAPAG---------LDSISFASTLRGDSS 396 Query: 449 QSNRKAEHYFLNGK----LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSV 504 + + Y+ + A + +K L A +V Sbjct: 397 KQAKHEFLYWEFHENGFSQATLCEGRYKGIRLR-------------------DPDAPIAV 437 Query: 505 FNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIK 549 ++L TDPQE I + + L + + + +P R Sbjct: 438 YDLQTDPQERVDIAATNPALAARLDHYLKSARTTNEDWPARKPAA 482 >UniRef50_B9YAN4 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YAN4_9FIRM Length = 470 Score = 434 bits (1117), Expect = e-120, Method: Composition-based stats. Identities = 103/528 (19%), Positives = 178/528 (33%), Gaps = 126/528 (23%) Query: 85 KPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTRATI 143 +PNV++ L+DD+GWMD+ G TP ID + +G+ AY+ P SP+RA+I Sbjct: 4 QPNVIMILIDDLGWMDLSCQGS---SFYETPHIDQLRREGMAFDQAYAACPVCSPSRASI 60 Query: 144 LTGQYSIHHGILMPPMYGQPG--------------GLQGLTTLPQLLHDQGYVTQAIGKW 189 L+G+Y + + ++ + + GY T +GKW Sbjct: 61 LSGKYPARLKVTDWIDHENYHPCRGKLIDAPYIKELSVSEFSMAKAFQEAGYQTWHVGKW 120 Query: 190 HMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 H+G+ P++ GFD G + + + ++ P+ Sbjct: 121 HLGKEATY-PEHHGFDVNLGGSWWGHPKKGYFSPYHMENLSDGPEG-------------- 165 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 E L R + + +PFFL H Sbjct: 166 -------------------EYLTDRIGAEAAALI-RSRDPQRPFFLNLWHYAVHTPLQAK 205 Query: 310 AK----YAGSSPART--------------------------------SYGDCMVEMNDVF 333 A+ + + Y + ++D Sbjct: 206 AEDIAYFEEKAKRMGLDQQDPFEIGDPFPILQKKDKRITRRIVQSDPVYAAMIKALDDSV 265 Query: 334 ANLYKTLEKNGQLDNTLIVFTSDNGP--EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYW 391 L TL+ G ++T+++FTSDNG AE P P KG +EG VR P FV W Sbjct: 266 GQLMATLKAEGLDEDTIVIFTSDNGGLATAEHSPTCNFPLSEGKGWMYEGAVREPLFVRW 325 Query: 392 KGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQS 450 G I+ S + D +PT L+L G P + DGV L + Sbjct: 326 PGKIEAGSLSHALTTSPDFYPTLLELCGLP-------LRPQQHCDGVSLAPVLLNPQAKF 378 Query: 451 NR-KAEHYFLNGKLA------AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSS 503 +R ++ + A+R ++KY + Sbjct: 379 DRGPIFWHYPHYGNQGGTPGSALRCGKWKYIEFY--------------------EDHSVR 418 Query: 504 VFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKSD 551 +F+L D E ++ + + + +H ++E + + P ++ Sbjct: 419 LFDLEQDVSEKHNVAEVYPDLVRQFHSLLHEWLEAVDAWYPEVNPHAE 466 >UniRef50_C1ZF13 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZF13_PLALI Length = 461 Score = 434 bits (1117), Expect = e-120, Method: Composition-based stats. Identities = 110/489 (22%), Positives = 177/489 (36%), Gaps = 60/489 (12%) Query: 58 MMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDI 117 ++ ++ + + A E + ++PN+++ L DD G + G TP I Sbjct: 5 LVLILAFQFTSQLALAQRATTETTSERRPNILLILSDDCGHAEFSIQGHP---RYKTPHI 61 Query: 118 DAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQP---GGLQGLTTLP 173 D++ G+ Y SP+RA +L G+Y G G + T LP Sbjct: 62 DSIGKNGVHFRQGYVSGCVCSPSRAGLLAGRYQQRFGHEFNIPPAYSETNGLPRSETLLP 121 Query: 174 QLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSP 233 QLL + GY T A+GKWH+G + P GF D+ GF S Y Sbjct: 122 QLLKEDGYRTIALGKWHLGYAPQFHPMERGFTDYYGFLQGSRSYFPL------------- 168 Query: 234 DRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPF 293 + I + + D + ++ + +P+ Sbjct: 169 --------------KKPTRLNQMLRDRTAIPEEQFGYMTDHLADEAIAYIKQ--WQSQPW 212 Query: 294 FLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVF 353 +Y H N A ++ Y + ++ + L++ G +TL++F Sbjct: 213 MMYLAFNATHSPNDATAVDLQAADGNKIYA-MTIALDRAVGKVLDALKECGLSKDTLVIF 271 Query: 354 TSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPT 412 +DNG H G KGSTWEGG R+P V + I D V DLFPT Sbjct: 272 INDNGGA---GGHDNGSLHGKKGSTWEGGTRIPFLVQYPAKIPSGQVIDEPVIALDLFPT 328 Query: 413 ALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFK 472 LD+AG A++ + +DG+ G Q Y+ +GK A+R K Sbjct: 329 ILDVAGLGDAELKKIPFDPEKLDGISLIPRMTGKT-QRLVDRPLYWKSGKRWAIRQGNLK 387 Query: 473 YHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 +G Q +F+L +DP E ++ H L+ Sbjct: 388 AV------------------SGNDDQGDQVELFDLSSDPDEQRNLAATHPDELQQLEALY 429 Query: 533 HAYMEILKK 541 + L+K Sbjct: 430 RKWESTLEK 438 >UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZGF2_PLALI Length = 490 Score = 434 bits (1117), Expect = e-120, Method: Composition-based stats. Identities = 120/510 (23%), Positives = 186/510 (36%), Gaps = 113/510 (22%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPS 135 L ++ + PN+++ L+DD+GW DVGF G TP ID +A GL+ T AY+ P+ Sbjct: 34 SLAAESRRPPNIILILMDDMGWRDVGFMGNKFV---ETPHIDRLAKTGLVFTQAYASAPN 90 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQPGGLQ---------------GLTTLPQLLHDQG 180 +PTRA +++GQY+ HGI QP G + T+ + L D G Sbjct: 91 CAPTRACLMSGQYAPRHGIYTVVDPRQPPGSPWHKWQAAESKSELDTNVVTIAEALRDGG 150 Query: 181 YVTQAIGKWHMGENKES--QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEY 238 Y T G W++G + P GF + + Sbjct: 151 YATAFFGMWNLGRGRTGPVTPGGQGFQ-----------------------------KVVF 181 Query: 239 IKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYG 298 + L F KD + L R D +KF+D+ ++PFF+Y Sbjct: 182 PENLGFGKD--------------EYFDDGKHYLTDRLTDEVLKFVDEHR--EQPFFVYLP 225 Query: 299 TRGCHFDNYPNAKYAGSSPARTS----------YGDCMVEMNDVFANLYKTLEKNGQLDN 348 H P + + + + ++ + L++ DN Sbjct: 226 DHAIHAPFNPKPELLAKYERKAAASNDRRDDPACAATIEAVDHNVGRIMDHLKRLKLSDN 285 Query: 349 TLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLA 407 T+++FTSDNG + P P RG KG +EGG+RVP V G+ + D V Sbjct: 286 TVVIFTSDNGGTQQYTP----PLRGGKGELYEGGIRVPLVVAGPGVKSLGSRCDVPVSSI 341 Query: 408 DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF-----LNGK 462 DL+PT L+LAG P+ +DGV G + +F Sbjct: 342 DLYPTLLELAGIKP-------PEGQVLDGVSLAPLLQGDATLDRERLFWHFPCYVGKATP 394 Query: 463 LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHI 522 +A+R +FK + +FNL DP E ++ Sbjct: 395 SSAMREGDFKLIEF-------------------FEEGGRVELFNLKNDPNEEKNLASVMP 435 Query: 523 PMGVPLQTEMHAYM-EILKKYPPRAQIKSD 551 L + A+ + PP D Sbjct: 436 DKAAALAKTLRAWQKKTNASIPPGPNPSYD 465 >UniRef50_A7S8Q2 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7S8Q2_NEMVE Length = 540 Score = 434 bits (1116), Expect = e-120, Method: Composition-based stats. Identities = 116/494 (23%), Positives = 184/494 (37%), Gaps = 77/494 (15%) Query: 83 GKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRAT 142 P+++ L+DD+GW DVG++ ++ TP+ID +ASQG+ L S YSQP +P+R Sbjct: 32 AGPPHIMFILMDDLGWSDVGYHN--ISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSRGA 89 Query: 143 ILTGQYSIHHGILMPPMYGQP--GGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQP 199 ++TG+Y IH G+ + G + T+PQ L GY T IGKWH+G + + P Sbjct: 90 LMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFFDWDYTP 149 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD F GF + + + L F +D+ A G Q Sbjct: 150 LRRGFDSFLGFFAGEQDHWRHSKMGF----------------LDFRRDEEPANEYGGQ-- 191 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS---- 315 + + + + +P FL H + Sbjct: 192 ----------HSTDVFTQEAIN-IAMRHNASQPLFLLLSYAAVHTPLQAHPNDVNKIGGV 240 Query: 316 -SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGA 374 R +Y M + L ++NG +NTL+++ SDNG + P RG Sbjct: 241 SDKDRQNYLGMMGAADWSIGRLIDVYKRNGLWNNTLMIWASDNGAQPGKGGGYNWPLRGY 300 Query: 375 KGSTWEGGVRVPTFVYWKG---MIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 K S +EGGVRVP FV+ G + + + + D +PT + LAG Sbjct: 301 KSSLFEGGVRVPAFVH--GEMLQRKGGTVNDLFHVTDWYPTLVKLAGGEV---------E 349 Query: 432 TFIDGVDQTSFFLGTNGQSNRKAEHY-----------------FLNGKLAAVRMDEFKYH 474 IDGVDQ S R+ + F AA+R K Sbjct: 350 PDIDGVDQWPTLS-EGKPSKREEILHNIDIPANQEEERMAPRGFNYYSGAALRRGHMKLV 408 Query: 475 VLIQQPYAYTQSGYQGGFT------GTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPL 528 + Y + +++N+ DP+E + + + + L Sbjct: 409 YKMGDAGWYQLPENGHRGPVVEEMVKDRLPIVELALYNITADPEERNDLSKLNPDIVDSL 468 Query: 529 QTEMHAYMEILKKY 542 + +Y Sbjct: 469 WRRLQELNATSLEY 482 >UniRef50_Q7UN55 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UN55_RHOBA Length = 501 Score = 434 bits (1116), Expect = e-120, Method: Composition-based stats. Identities = 107/532 (20%), Positives = 174/532 (32%), Gaps = 72/532 (13%) Query: 38 AGYDHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGK----KPNVVVFLL 93 P Q + +++ + + + +PN++ + Sbjct: 3 LARIAPIQTSSQFLRSLSRLALAFCCIAVSYRVVSGDESSKADSPASGDALRPNIIYVMA 62 Query: 94 DDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHH 152 DD+G+ D+G G TP +D +A+ G+ T Y+ P+R T+ TG++ Sbjct: 63 DDLGYGDLGCYGQTRI---QTPHLDQMAADGIRFTDHYAGHTVCRPSRLTLWTGKHVGST 119 Query: 153 GILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-------NKESQPQNVGFD 205 G++ G T+ LL D GY T +GKW +G P GFD Sbjct: 120 GLIGNAARNLTG---EQPTVASLLSDAGYATGGVGKWALGNVDVPEEIENPGHPLANGFD 176 Query: 206 DFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITP 265 + G+ + S+ + + P + S D + R + Sbjct: 177 AWTGYMNQSNAHNYY------PRFLWQNYERRFFPGNVISTDPIARGR---------VAV 221 Query: 266 KYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH--------------FDNYPNAK 311 K D F+ + PF L+ H +Y Sbjct: 222 KRESYSHDVMTDAAFDFIREHRSD--PFLLHVHWTIPHANNEGGRLNGDGMEVPDYGIYA 279 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP-----H 366 G + + M+ L LE+ + TL++FTSDNGP E + Sbjct: 280 DEGWPNPEKGFAAMITRMDRDMGRLMDLLEELKLSEKTLVIFTSDNGPHHEGGHSDLFFN 339 Query: 367 GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVA 425 P +G+K S EGG+RVP W G I+P SD D PTA +LAG Sbjct: 340 SSGPLQGSKRSMHEGGIRVPFIAKWPGTIEPGTISDHPSAFWDFLPTACELAGAEPPA-- 397 Query: 426 NLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYF---LNGKLAAVRMDEFKYHVLIQQPYA 482 IDG+ L + + Y+ +R +K Sbjct: 398 -------DIDGISYLPALLDQPKKQTKHRYLYWASSEGPTSVGLRSGTWKAVNYPGGTKK 450 Query: 483 YTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 + V+ G +F+L +DP E + + H L Sbjct: 451 RRSGN-----SKPVVNEDGWKLFDLASDPGEKNDVSKDHPAELERLVEMARE 497 >UniRef50_Q46SG5 Arylsulfatase n=3 Tax=Proteobacteria RepID=Q46SG5_RALEJ Length = 542 Score = 433 bits (1115), Expect = e-120, Method: Composition-based stats. Identities = 143/518 (27%), Positives = 227/518 (43%), Gaps = 43/518 (8%) Query: 64 HPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQ 123 A +PN++V DD+GW +V G GV G TP+ID++ + Sbjct: 8 LVAVTATVAAMSPFAAGAQQTRPNILVIWGDDIGWENVSAYGMGVM-GYTTPNIDSIGME 66 Query: 124 GLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQP-GGLQGLTTLPQLLHDQGYV 182 G+ T Y+QPS + RA +TGQY I G+ G G +L +++ GY Sbjct: 67 GIRFTDQYAQPSCTAGRAAFITGQYPIRSGMTTVGQPGDKLGWQPASPSLGEVMKQAGYR 126 Query: 183 TQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMY--TEWRDVHVNPEVALSPDRSEYIK 240 T GK HMG+ P GFD+F G + E D D++ K Sbjct: 127 TGFFGKSHMGDRNSHLPTVHGFDEFFGNLYHLNTEELPENHDYQAYANGYPGGDKAFAQK 186 Query: 241 QLPFSKDDVHAVRGGEQQAIADITP--------------KYMEDLDQ-RWMDYGVKFLDK 285 P +A + + P K MED D + + F+ Sbjct: 187 FAPRGVLHTYATDNDDPTDMPRFGPVGKQKIEDTGPLTKKRMEDFDAAEVIPKAIDFMQG 246 Query: 286 MAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDC----MVEMNDVFANLYKTLE 341 + DKPFF++ T H + N K+ ++ T D M++ + + + L+ Sbjct: 247 AKQKDKPFFVWLNTSRMHLYTHLNDKWRYAAAKYTHEDDMQGSGMLQHDHDIGLVLEYLK 306 Query: 342 KNGQLDNTLIVFTSDNGPEAEVPPHGRT-PFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS 400 ++G NT++ +++DNGPE PHG T PFRG K +T+EGGVRV + + W G+I+P + Sbjct: 307 RSGLDKNTIVWYSTDNGPEHVSWPHGSTTPFRGEKMTTYEGGVRVVSMLRWPGVIKPGQI 366 Query: 401 -DGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL 459 +GI D+F T +AG P K +IDG++ ++ G S RK Y+ Sbjct: 367 KNGIQAHQDMFTTFAAIAGVPDVVGQMKREKHQYIDGINNLDYWTGKTADSARKDFLYYY 426 Query: 460 NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDS--- 516 KL AVRM +K H +++ Y GT+ + + +FNL +DP ES Sbjct: 427 ENKLTAVRMGPWKLHFSLKEDYY-----------GTLQPRSVTMLFNLRSDPFESYDSKD 475 Query: 517 ----IGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 + + + P+ + ++++ + YPP KS Sbjct: 476 AYGHLLQKAQWISGPMNELIASHLKTIADYPPVQPAKS 513 >UniRef50_A7SRP2 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7SRP2_NEMVE Length = 491 Score = 433 bits (1115), Expect = e-120, Method: Composition-based stats. Identities = 121/499 (24%), Positives = 204/499 (40%), Gaps = 73/499 (14%) Query: 81 KTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTR 140 ++ KP+++ L DD+GW DVGF+G + TP+ID +A+ G+IL + Y QP +PTR Sbjct: 20 QSSAKPHLLFVLADDLGWSDVGFHGSKI----QTPNIDRLAANGVILDNYYVQPVCTPTR 75 Query: 141 ATILTGQYSIHHGILMPPMYGQ--PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKES 197 A+++TG+Y IH G+ ++ G LT LPQ L GY T +GKWH+G N ES Sbjct: 76 ASLMTGKYPIHTGLQHGIIHNGRPYGLPLNLTLLPQKLRKAGYSTHMLGKWHLGFYNWES 135 Query: 198 QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQ 257 P GFD F GF S ++ + H + + Sbjct: 136 TPTYRGFDTFYGFYSGAENHYTHVQDH-------------------------YLDLRDNE 170 Query: 258 QAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG--- 314 + + D Y L + + + + P F+Y + H +Y Sbjct: 171 EIVRDQNGTYSAHL---FTKRAEQ-IVRAHDPSTPLFMYMAFQNVHSPVQAPKEYIDRYS 226 Query: 315 --SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFR 372 P R +Y + M+D NL + +K G +NT+++F++DNG + + P R Sbjct: 227 FIKDPLRRTYAAMVTIMDDALGNLTRAFDKAGLWENTILIFSTDNGGVPKNGGYD-YPLR 285 Query: 373 GAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKT 431 G K + WEGGVR FV+ + Q ++ + D +PT + LAG + + Sbjct: 286 GRKDTLWEGGVRGVAFVHGVALEQSGVKCKALMHVTDWYPTLVSLAG-------GSLDED 338 Query: 432 TFIDGVDQTSFFLGTNGQSNRKAEHYF--------------LNGKLAAVRMDEFKYHVLI 477 +DG D +S RK + + +R+ + K + + Sbjct: 339 EDLDGYDVWESISHGV-ESPRKELLHNIDTINIPPGDGSLGFSTTGIGLRVGDMKLLMAV 397 Query: 478 QQPYAYTQSGYQGG------FTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTE 531 + + G + + +++N+ DP E + + + LQ Sbjct: 398 PNISYFIPPEDRNGSVDWYIHSNNKVPMVEVALYNITADPYEKHDLHDKLPDVVTRLQLR 457 Query: 532 MHAYMEILKKYPPRAQIKS 550 + Y + PP + K Sbjct: 458 VEHYRKT--AVPPANKPKD 474 >UniRef50_A6C383 Sulfatase (Fragment) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C383_9PLAN Length = 405 Score = 433 bits (1115), Expect = e-120, Method: Composition-based stats. Identities = 105/452 (23%), Positives = 175/452 (38%), Gaps = 63/452 (13%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSP 138 + +KPNV++ DD G +D+ G + TP +D++A +G+ T Y+ P SP Sbjct: 3 AISSEKPNVIIIFTDDQGSVDLNCYGAKDLI---TPHMDSIARRGIRFTQFYASAPVCSP 59 Query: 139 TRATILTGQYSIHHGILMPPM--YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKE 196 +RA +LTG++ G+ +G+ G T+ +++ GY T IGKWH+G E Sbjct: 60 SRAGMLTGRFPARAGVPGNVSSHHGKSGMPTEQITIAEMMQQAGYQTAHIGKWHLGYTPE 119 Query: 197 SQPQNVGFDDFRGFNSVS-DMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGG 255 + P GF+ G D Y+ + + L + E + F Sbjct: 120 TMPHGQGFETSFGHMGGCIDNYSHFFYWNGPNRHDLWENGKEVWRDGAFFP--------- 170 Query: 256 EQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS 315 ++ ++ K DKPFFLY+ H+ K+ + Sbjct: 171 -----------------DLMVEQCQDYIRKA--GDKPFFLYWAINVPHYPLQGKEKWRKT 211 Query: 316 ----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE----VPPHG 367 S R Y + M+D + TL+ + T+I+F SD+G E Sbjct: 212 YAHLSSPRDKYAAFVSTMDDCIGEVLATLDACQLREKTIIIFQSDHGHSHEERTFGGGGS 271 Query: 368 RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVAN 426 P+RGAK S +EGG+RVP + W G I D + D PT L G P Sbjct: 272 AGPYRGAKFSLFEGGIRVPAMISWPGTIAEGEVRDQLATGCDWLPTISALTGAPLPA--- 328 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQS 486 +DG + + + +S + Y+ GK A+R ++K + Sbjct: 329 -----HHLDGKNLKAVIESSTAKSPHEN-FYWQIGKSWAIREGDWKLL----------GN 372 Query: 487 GYQGGFTGTVMQTAGSSVFNLYTDPQESDSIG 518 + + + +L D E ++ Sbjct: 373 PRDTSQQTPLGKENQIFLVDLSKDIGEKKNLA 404 >UniRef50_C7M5R4 Sulfatase n=4 Tax=Bacteroidetes RepID=C7M5R4_CAPOD Length = 480 Score = 433 bits (1114), Expect = e-120, Method: Composition-based stats. Identities = 111/519 (21%), Positives = 183/519 (35%), Gaps = 103/519 (19%) Query: 73 QKLAELEKKTGKK-PNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY 131 L + K +K PNV+ L DD+G+ D+ G + TP + +A +G+ T Y Sbjct: 10 ASLCTIGVKAQEKLPNVIFILADDLGYGDIEPYGQQII---KTPQLSKLADEGMKFTQFY 66 Query: 132 S-QPSSSPTRATILTGQYSIHHGILMP-----PMYGQPGGLQGLTTLPQLLHDQGYVTQA 185 + +P+RA+ +TGQ + I P+ GQ L ++ QL GY T Sbjct: 67 TGTSVCAPSRASFITGQTTGETHIRGNEEVREPVDGQAPLLANDPSVAQLFKKAGYNTGC 126 Query: 186 IGKWHMGENK-ESQPQNVGFDDFRGFNSVSDMYTEWRDV-HVNPEVALSPDRSEYIKQLP 243 GKW +G E P GFD F G+NS + + + E L P+ Y +Q Sbjct: 127 FGKWGLGIVPSEGNPLKQGFDTFFGYNSQFRAHRRYPAFLWHDNEKVLIPENGNYERQEV 186 Query: 244 FSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCH 303 + + + + ++ K ++KPFF++ H Sbjct: 187 YGE--------------------------DLIQEKILDYIGKQT-AEKPFFMWLTYTLPH 219 Query: 304 FDN-------------YPNAKYAG------------------SSPARTSYGDCMVEMNDV 332 + P Y G +Y + ++ Sbjct: 220 AELVVPHDSIYASYEYLPKKPYKGVDYDKITPKPFGWAGYMSQPHTYATYAAMVSRLDKY 279 Query: 333 FANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP-----HGRTPFRGAKGSTWEGGVRVPT 387 + K L+ G ++T+I+F SDNG E + RG K +EGG+R P Sbjct: 280 LGEIRKLLKVKGLDEDTIIIFASDNGAHREGGADPKFFNSSAGLRGIKRDLYEGGIRTPY 339 Query: 388 FVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGT 446 VYWKG I+ SD I D+ PT ++ + V LG Sbjct: 340 IVYWKGKIKAGSVSDHIGAFWDMMPTFAEITHQKYVPNRHQ---------VSFLPTLLGK 390 Query: 447 NGQSNRKAEHY--FLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSV 504 Q K ++ G AVR +K + A + Sbjct: 391 KQQQQHKYLYWEFHEMGGRQAVRYKNWKGVR----------------LNVNKDKKAPIEL 434 Query: 505 FNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 ++L TDP E ++ ++ + ++ M + +P Sbjct: 435 YDLTTDPAEQHNLAEKYPKIVKKIERFMEQSHTRSELFP 473 >UniRef50_C6VRQ8 Sulfatase n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VRQ8_DYAFD Length = 553 Score = 433 bits (1114), Expect = e-119, Method: Composition-based stats. Identities = 114/554 (20%), Positives = 184/554 (33%), Gaps = 138/554 (24%) Query: 72 QQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAY 131 Q+K A + ++PN+++ ++DD+G+ D+G G + TP ID +A +G+ Y Sbjct: 20 QKKSAPDRRVADQRPNIILIMVDDLGYSDIGAYGSEI----KTPHIDQLAGEGIRFREFY 75 Query: 132 SQPSSSPTRATILTGQYSIHHGI-LMPPMYGQP----GGLQGLTTLPQLLHDQGYVTQAI 186 + +PTRA+++TGQY G+ G P + T ++L + GY T Sbjct: 76 NNSICAPTRASLITGQYPHKAGVGYFNVNLGLPAYQGYLNKESLTFGEVLRNAGYSTLLS 135 Query: 187 GKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSK 246 GKWH+G + + P GFD F GF + + Y + P V L + Sbjct: 136 GKWHVGNDSTAWPNQRGFDRFYGFINGASNYFDIGKYGKGPAVELVENNKRINLPPD--- 192 Query: 247 DDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN 306 + L D+ + FLD+ +K+ KPFFLY H+ Sbjct: 193 ----------------------KYLTDEITDHALAFLDEQSKTAKPFFLYLAYNAPHWPL 230 Query: 307 YPNA----KYAG------------------------------------------------ 314 KY G Sbjct: 231 QAPEADIAKYKGRYSIGWDSLRAERLQRQKALGITDPKQSVAARDKDVTPWENVPYDEKL 290 Query: 315 -SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG------ 367 Y + ++ L + L+ + DNTLIVF SDNG + Sbjct: 291 LWERKMEIYAAMVDRVDQNIGKLREKLKALNKDDNTLIVFISDNGAQGGYAGASPRRPQR 350 Query: 368 ----------------------RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIV 404 P K + EGG+ P ++ I+ + G Sbjct: 351 NTGPAGTAGSYVYQDQPWAYVSNAPHAAYKNNMHEGGISAPFIAWFPRQIKGGQIVKGTG 410 Query: 405 DLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRK-AEHYFLNGKL 463 L DL PT DLA AN V T + G G +G+ +R ++ Sbjct: 411 HLIDLAPTFYDLAKAAYPATANGVATNT-LPGKSLVPVLTGKSGEVDRGGEPIFWERAGN 469 Query: 464 AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIP 523 AVR ++K +++L TD E+ + + Sbjct: 470 RAVRKGKWKIVSTYPAY--------------------KWELYDLETDRGETSDVASANPN 509 Query: 524 MGVPLQTEMHAYME 537 + L + + E Sbjct: 510 VVDQLAADYFRWAE 523 >UniRef50_UPI0000588CF9 PREDICTED: similar to arylsulfatase B n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000588CF9 Length = 545 Score = 432 bits (1113), Expect = e-119, Method: Composition-based stats. Identities = 112/498 (22%), Positives = 183/498 (36%), Gaps = 68/498 (13%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS 136 + + + P++V L DD G+ D+G+ TP++D +A++G+ L + Y QP Sbjct: 50 QPSRNPRRPPHIVFILADDYGFNDIGYRN----PAMRTPNLDYLAAEGIKLDNYYVQPIC 105 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPG--GLQGLTTLPQLLHDQGYVTQAIGKWHMGE- 193 +P+RA +++G+Y IH G+ ++ L TLPQ L + GY T GKWH+G Sbjct: 106 TPSRAQLMSGKYQIHTGLQHSIIWPPQPNCLPLDLPTLPQKLKEAGYATHMAGKWHLGFY 165 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 KE P N GFD F G + + Y P+ D Sbjct: 166 KKECWPTNRGFDSFLGILLGKGDHFLHTE---------EGGGGPYPSTWPWEGLDFRDGL 216 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 + ++K DKP FLY + H Y Sbjct: 217 QSTNAYSGI-------YSTHVIAERVENIIEKH-DKDKPLFLYVSFQAVHTPLQVPESYL 268 Query: 314 G------SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG 367 R Y M++ N+ K L+K G D+T++VF+SDNG + Sbjct: 269 QPFESSIQDEKRRIYAGMTYCMDEAVGNITKKLKKQGLWDDTVLVFSSDNGGNIDQG-AS 327 Query: 368 RTPFRGAKGSTWEGGVRVPTFVYWK---GMIQPRKSDGIVDLADLFPTALDLAGHPGAKV 424 P RG+K + WEGGVR FV ++ S ++D++D +PT ++ V Sbjct: 328 NWPLRGSKTTLWEGGVRAVGFVTSPLLSERMKGTVSRELIDISDWYPTLIE-------GV 380 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY-----------------------FLNG 461 A T +DG + S R + F Sbjct: 381 AGWTLSGTKLDGYNIWETLRSGK-PSARVELLHNIDPLITPPSTWPNESIAAAHNSFSTR 439 Query: 462 KLAAVRMDEFKYHVLI---QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIG 518 AA+R ++K + + ++ +FN+ DP+E + Sbjct: 440 TYAALRYKDWKIVTGYXSINNGWYSPAESSKQSVASEILPGKSVWLFNITRDPREFHDLS 499 Query: 519 VRHIPMGVPLQTEMHAYM 536 + + L + +Y Sbjct: 500 NQEPAIVNFLLERLESYQ 517 >UniRef50_C2FU81 Sulfatase family protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FU81_9SPHI Length = 461 Score = 432 bits (1112), Expect = e-119, Method: Composition-based stats. Identities = 118/481 (24%), Positives = 195/481 (40%), Gaps = 67/481 (13%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTR 140 +KPN++ L DD+G+ D+G G TP +D +A++G+ T PS +P+R Sbjct: 20 AQQKPNIIFVLTDDLGYSDLGCYGNPSIS---TPFLDKMAAKGVRATDYMVTSPSCTPSR 76 Query: 141 ATILTGQYSIHHGILMPPMYG-QPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQP 199 A++LTG+Y+ + + P G + G T+ ++L ++GY T IGKWH+G++ E P Sbjct: 77 ASLLTGRYASRYNLPDPIGPGAKNGLPAQEVTIAEMLKEKGYHTALIGKWHLGDHGEYLP 136 Query: 200 QNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQA 259 GFD F G D +P V + Q P + Sbjct: 137 NKQGFDYFYGMLYSHDY--------RDPYVKTDTTIKIFRNQTPVVTRPADSA------- 181 Query: 260 IADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPAR 319 L + + + +++ + K +PFFLYY H +A+ Sbjct: 182 -----------LSRIYTEEVKQYISQQ-KKGEPFFLYYAHNMPHLPVAFSAESGRMKDLH 229 Query: 320 --TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTP------- 370 G + +++ A ++ +LE+ G DNT+ +F+SDNGP E P Sbjct: 230 FAGPLGAVLEDLDRQLAIMWASLEEQGLADNTIFMFSSDNGPWIEYPVRMSGDHKTKNWH 289 Query: 371 ------FRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAK 423 FRG+K T+EGGVRVP YWKG + + D+ PT + G Sbjct: 290 VGTAGVFRGSKAQTYEGGVRVPFITYWKGHTPEGITLRNAISNVDILPTLAEWTGAS--- 346 Query: 424 VANLVPKTTFIDGVDQTSFFLGTNG--QSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPY 481 VP + +DG + + ++ + + +GK+ AVR +KY L Sbjct: 347 ----VPASRTLDGQSIAALLTSKSENITADHRPIYLVNHGKVEAVRKGSWKYRELPAGVN 402 Query: 482 AYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 + Y+ +FN+ DP E ++ L+ + L Sbjct: 403 NNSGKPYEA----------AKELFNISYDPSERTNVISEFPEKAQELKVLFDNFDASLDT 452 Query: 542 Y 542 Y Sbjct: 453 Y 453 >UniRef50_B5CWC8 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CWC8_9BACE Length = 493 Score = 432 bits (1112), Expect = e-119, Method: Composition-based stats. Identities = 112/487 (22%), Positives = 179/487 (36%), Gaps = 55/487 (11%) Query: 71 TQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSA 130 AE +K +KPN++ FL+DD+G D+ G TP+ID +A+ G++ T+ Sbjct: 16 VSTGCAEQKKVEEQKPNIIYFLVDDMGMGDLSLTGQK---KYETPNIDKLAADGMLFTNH 72 Query: 131 YS-QPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKW 189 Y S P+RA ++TG+++ H + Q G TL +L GY T IGKW Sbjct: 73 YCGTTVSGPSRACLMTGKHTGHTSVRGNQPGPQLLG-DNEATLASVLKGAGYKTAVIGKW 131 Query: 190 HMGEN-KESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDD 248 +G PQ GFD G+ ++ + + + V + +L ++D Sbjct: 132 GIGHPIPLDDPQRKGFDLSYGYLNMWHAHNCFPEFLYRNGVKEELTGN----KLALAEDG 187 Query: 249 VHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYP 308 + + + + +KF+ PFF+YY H +N Sbjct: 188 TNPWADMPEGTGVARMDARKQYAPDLFEKEALKFISD--NKKNPFFIYYALNLPHANNEA 245 Query: 309 NA------KY------AGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 Y + M ++ +L LEK G DNT+I+F SD Sbjct: 246 APNGCEVPSYNADIAAKDWPEVEKGFAQMMQIIDKQVGDLVAYLEKEGLADNTIIMFASD 305 Query: 357 NGPEAEVPP-----HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLF 410 NGP E RG K W+GG+R P V W G ++ S+ + D+ Sbjct: 306 NGPHQEGGHKVDFFDSNADLRGKKRDMWDGGIRTPFIVKWPGKVKAGSTSNHLSAFWDVL 365 Query: 411 PTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY---FLNGKLAAVR 467 PT D+A IDG+ LG + + Y + G AV Sbjct: 366 PTFCDIAKVEKPAG---------IDGLSLLPTLLGDTAKQEKHKYLYFEFYEEGGKQAVV 416 Query: 468 MDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVP 527 D +KY L + + ++ L D E + H M Sbjct: 417 ADNWKYIKLNVRQGKGAKPVETS-------------LYRLTDDVSEQKDVKEEHPEMVEI 463 Query: 528 LQTEMHA 534 ++ + Sbjct: 464 MEGYIKE 470 >UniRef50_A3HWU7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Bacteria RepID=A3HWU7_9SPHI Length = 472 Score = 432 bits (1112), Expect = e-119, Method: Composition-based stats. Identities = 104/481 (21%), Positives = 171/481 (35%), Gaps = 75/481 (15%) Query: 69 KETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILT 128 + Q + K N+V+ + DD+G+ D+GF G TP +D +A+ G+ T Sbjct: 18 NLSAQSKPSPQLSPKKHYNLVLIVADDLGYGDLGFTGSTQI---KTPHLDQLATNGVTFT 74 Query: 129 SAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQPG-------GLQGLTTLPQLLHDQG 180 Y SP+RA +TG + G +PG T+ L+ G Sbjct: 75 QGYVSSAVCSPSRAGFITGINQVEFGHDNNLAGVEPGFDIAYNGMPLSQKTIADHLNKLG 134 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 YV IGKWH+G+ + P GFD+F G+ Y E + L + Sbjct: 135 YVNGLIGKWHLGKEPQFHPLKRGFDEFWGYTGGGHDYFESLPNGKGYKEPLESNFK---- 190 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 TP + + + V F+++ D+PFFL+ Sbjct: 191 -----------------------TPDPITYITDDVGNESVDFIERH--KDEPFFLFAAFN 225 Query: 301 GCHFDNYPNAKYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 H + R +Y + ++ + +LE+ G +NTL+VF S Sbjct: 226 APHTPMQALEEDLALYQHIEDKKRRTYAAMVHRLDLNVGKIMTSLEEQGLSENTLVVFFS 285 Query: 356 DNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTAL 414 DNG + P+RG KG EGG+ VP + G++ V D+ PT L Sbjct: 286 DNGGPTDSNASLNAPYRGQKGILLEGGIHVPFVMNLPGLLPEGLIYQEQVTSLDVVPTFL 345 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYH 474 LAG + GVD G + + A+R ++K Sbjct: 346 ALAGDTETSMDMFS-------GVDLIPHLTGKTPPLADREM-TWKFTISRAIREGDWKLV 397 Query: 475 VLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 + ++NL DP E + + ++H+ L ++ Sbjct: 398 ---------------------SVPDRMPMLYNLAEDPSEQNDLALKHMDKTTYLLKKLGT 436 Query: 535 Y 535 + Sbjct: 437 W 437 >UniRef50_B8KM62 N-acetylgalactosamine-6-sulfatase n=1 Tax=gamma proteobacterium NOR5-3 RepID=B8KM62_9GAMM Length = 472 Score = 432 bits (1111), Expect = e-119, Method: Composition-based stats. Identities = 126/441 (28%), Positives = 206/441 (46%), Gaps = 26/441 (5%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRA 141 KPN+V+ +D+ G+ ++G GGG+ G TP ID +AS+G+ LT+ + +P+RA Sbjct: 6 AADKPNIVLINMDNFGYGELGVYGGGIVRGGATPRIDKLASEGIRLTNFNVEAQCTPSRA 65 Query: 142 TILTGQYSIHHGILMPPMYG-QPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQ 200 ++TG+Y++ G P+ G Q T+P++L D GY T GKW++G+ + P Sbjct: 66 ALMTGRYAVRTGNGTVPLQTVDYGLTQWEYTMPEMLSDAGYATAHFGKWNLGQREGRYPT 125 Query: 201 NVGFDDFRGFNSVSD--MYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 N GFD++ G + +D + ++A ++ IK+ + +G + Sbjct: 126 NQGFDEWYGIPNSTDESEWPTNEMFLKWAKIAKETGKTPMIKET----HVLSGRKGSPTK 181 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPA 318 + ++D+ D G F+ + AK+ KPFFLY H P+A++ G S Sbjct: 182 EVKVFDSSVRPEIDREVTDLGKDFMTRQAKAGKPFFLYLPYTQTHAPVTPSAEFKGKS-G 240 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG-RTPFRGAKGS 377 +GD +++++ L +++ G DNT+ +FT+DNG E G P+ G+ + Sbjct: 241 NGKWGDILMQIDAYTGELLDKVDELGIADNTIFIFTADNGGEMTPTFQGWNGPWSGSYFT 300 Query: 378 TWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDG 436 EG +RVP V W G + S+ IV DLF T ++AG VP ID Sbjct: 301 GMEGSLRVPFIVRWPGKVPAGKVSNEIVHEFDLFSTFANIAGGK-------VPTDRIIDS 353 Query: 437 VDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTV 496 D T FFLG QS R ++ + V+ +K Q G + + Sbjct: 354 KDMTDFFLGKQEQSGRDGFVIYVGDDIFGVKWQNYKMMF---------QELDGGNGSNKL 404 Query: 497 MQTAGSSVFNLYTDPQESDSI 517 FNLY DP+E + Sbjct: 405 NVFPFVRFFNLYEDPKEEYPL 425 >UniRef50_Q7UWW9 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UWW9_RHOBA Length = 622 Score = 431 bits (1110), Expect = e-119, Method: Composition-based stats. Identities = 104/537 (19%), Positives = 189/537 (35%), Gaps = 83/537 (15%) Query: 40 YDHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWM 99 + H N+ + T +A PNV++ + DD G+ Sbjct: 3 WIHSNRRD-----------TLLCTISIAFAITTLFIATPRPSGAASPNVILVMTDDQGYG 51 Query: 100 DVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPM 159 D FNG TP +D +AS+ + LT + P +PTR +++G + + + Sbjct: 52 DFSFNGNPYI---QTPALDRLASESVQLTDFHVAPMCTPTRGQLMSGLDAFRNS-AINVS 107 Query: 160 YGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVS-DMYT 218 G+ L T+ + D GY T GKWH+G+N +P++ GFD+ F S + Sbjct: 108 SGRTLLRHDLKTMADVFQDAGYRTGIFGKWHLGDNYPFRPEDRGFDETLWFPSSHINSVP 167 Query: 219 EWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDY 278 ++ D + + + + D Sbjct: 168 DFWDNDYFDDTYIRNGKRVAHS----------------------------GYCTDVFFDE 199 Query: 279 GVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPAR------------------- 319 +++ + + +D PFF + H+ + +Y Sbjct: 200 AIEWAKQTSPTDSPFFAFIPLNSAHWPWFVPDQYRARVRTMLGDTTELKRQLDTTPSNLE 259 Query: 320 --TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGS 377 S+ + ++D L + L+++G +NT++VF +DNG + RG K Sbjct: 260 DLISFLAMGLNIDDNVGTLTQYLDESGLSENTIVVFLTDNG-STFGDHYFNAGMRGKKTQ 318 Query: 378 TWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGV 437 WEGG RVP + W I +K D + + DL PT LA +DG Sbjct: 319 LWEGGHRVPCLIRWPEQITAQKIDDLTHVQDLLPTLAALADCDEHLPG-------PLDGT 371 Query: 438 DQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVM 497 LG + + RM +FK P ++G + + Sbjct: 372 SLAPRLLGETDSLADRMLVINYS------RMPQFKVTYTKGNPAIPRRNGAAVMWNKWRL 425 Query: 498 QTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK---YPPRAQIKSD 551 ++N+ DP + ++ H + ++ + + + +K P R I S+ Sbjct: 426 LENK-RLYNVEQDPHQDHNVAQDHPEIVAKMRAHLATWWDGVKDDVMTPERVVIGSE 481 >UniRef50_Q7UMZ5 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UMZ5_RHOBA Length = 484 Score = 431 bits (1109), Expect = e-119, Method: Composition-based stats. Identities = 110/495 (22%), Positives = 172/495 (34%), Gaps = 87/495 (17%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSS 136 L + +PN+V+ L DD+G+ D+G G TP +D +A+QG+ T AY+ P Sbjct: 31 LRADSNDRPNIVLILADDLGYGDLGCYGNDEQA---TPVLDRLATQGVRWTQAYANGPEC 87 Query: 137 SPTRATILTGQYSIHHG-------ILMPPMY---------GQPGGLQGLTTLPQLLHDQG 180 SPTRA +LTG+Y H G + Y + G TL + L G Sbjct: 88 SPTRAALLTGRYQQHVGGLECAIGVGNVGRYDDAIRLHLVNELGLPANRPTLAKRLSSVG 147 Query: 181 YVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIK 240 Y T GKWH+G + P GFD+ + Y + D + + Sbjct: 148 YETALFGKWHLGYEAKFSPMMHGFDEALYCIGGAMDYYHYLDSVA--------TYNLFHN 199 Query: 241 QLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTR 300 P S + D V+F+ +DKPFFLY Sbjct: 200 GRPISGE---------------------GYFTDTITDQAVRFIGDRNANDKPFFLYLPYT 238 Query: 301 GCHFDNYPN------------AKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDN 348 H + ++ Y + M++ + +E++ D Sbjct: 239 APHTPYQAPGESPVDPLPIDSPLWKQNADPPGVYRAMVRHMDEGIGKVLHAIEESKMTDR 298 Query: 349 TLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLA 407 TL++F SDNG + P RG KG +EGG+RVP W G + SD + Sbjct: 299 TLVIFASDNGGTSASR---NEPLRGFKGQAFEGGIRVPLIARWPGHLPEGVVSDQVTITF 355 Query: 408 DLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL--AA 465 DL + L AG + ++G+D S R + Sbjct: 356 DLTASMLAAAGITPTQED-------AMEGIDVLSLAANDEPVQPRTLYWRKPRDPQVWSG 408 Query: 466 VRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMG 525 +R +KY + S +FNL D E + + Sbjct: 409 MRDGNWKYVRQEKATVDGRTSI-------------QEWLFNLADDISEQTDLASQSTDEL 455 Query: 526 VPLQTEMHAYMEILK 540 L+ A+ + ++ Sbjct: 456 DRLRGRYLAWEQSVR 470 >UniRef50_A6C4B6 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4B6_9PLAN Length = 515 Score = 431 bits (1109), Expect = e-119, Method: Composition-based stats. Identities = 120/537 (22%), Positives = 188/537 (35%), Gaps = 80/537 (14%) Query: 58 MMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDI 117 ++P +PNVV+ L DD+G+ DV PTP++ Sbjct: 6 VIPTCCLILLLLAASPTPLSAAADKPNRPNVVIILADDMGYGDVTALN--KGSRIPTPNL 63 Query: 118 DAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPMYGQ---PGGLQGLTTLP 173 D A Q L+ T A++ P+R +LTG+Y + G T+ Sbjct: 64 DQFARQSLVFTDAHAAGSYCVPSRYGLLTGRYMWRTRLGSGGNLANFAGTLIEPGRRTIA 123 Query: 174 QLLHDQGYVTQAIGKWHMGENKESQ-----------------------------PQNVGF 204 L+ D GY T +GKWH G + + + P++ GF Sbjct: 124 NLMQDAGYQTGLVGKWHQGIDWKLRDESARVQIRVDPNYQNFKNIDFAAPALKGPKDYGF 183 Query: 205 DDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADIT 264 G ++M NP + +R+ I L ++ + Sbjct: 184 AYSFGTAGSAEM---------NPSTFIVNNRAAVIPTLTTAEAKEKFGEWYGRDDNIIAE 234 Query: 265 PKYMEDLDQRWMDYGVKFLDKMA--KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSY 322 M+ L + +F++ K D+PFFLYY H PN ++ G S A +Y Sbjct: 235 GYTMDRLVPTLSNKACEFVETAVRSKPDQPFFLYYAMTTPHNPIVPNQEFVGKSQA-GTY 293 Query: 323 GDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE---------------VPPHG 367 GD +VE++ L + L+ G DNTLI FTSDNGP Sbjct: 294 GDFVVELDFHVGRLLQKLKDLGIADNTLIFFTSDNGPVDRTRGYPQRWVRGDTQIYGHDS 353 Query: 368 RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVAN 426 P G KG EGG RVP V W I+P + + D+ PT ++ Sbjct: 354 TGPCSGWKGGLEEGGHRVPFIVRWAAKIKPGEECATTIVFNDVLPTLAEMLNVK------ 407 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSN-RKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQ 485 + T DGV G + + KA + + AVR FK + + Sbjct: 408 -LDSNTAEDGVSFYPALTGASRPVSFHKAIIHNHHNGHFAVRQGAFKLIIKGPKTVEEVL 466 Query: 486 SGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKY 542 +++L D ES + +H + + Y++ + Sbjct: 467 DARVPV---------KYQLYDLDKDIAESTDVSAKHPEKVKQMHALLKQYVKAGRST 514 >UniRef50_A6LIX5 Arylsulfatase n=2 Tax=Bacteroidales RepID=A6LIX5_PARD8 Length = 514 Score = 431 bits (1109), Expect = e-119, Method: Composition-based stats. Identities = 121/509 (23%), Positives = 192/509 (37%), Gaps = 62/509 (12%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSPTR 140 K+PNVV+ L DD+G+ DVG N TP ID +A G+ T A+S S P+R Sbjct: 20 AQKQPNVVIILADDMGYGDVGCNNP--YARVRTPAIDQLARNGIRFTDAHSAGALSGPSR 77 Query: 141 ATILTGQYSIHHGILM-PPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ- 198 ++TG+Y Y P T+ L+ + GY T +GKWH+G + + + Sbjct: 78 YGLVTGRYFFRTPKKSEYWGYLSPYIEPERLTIGSLMRNAGYTTACVGKWHLGLDWQLKD 137 Query: 199 --------------------------PQNVGFDDFRGFNSVSDMYTEWRDVHV------- 225 P +GFD + DM + Sbjct: 138 DSKPQILTPKKFGYTNTDFSAPVKRGPTELGFDYSFILPASLDMPPYAFVRNDRVVDPDV 197 Query: 226 ----NPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVK 281 + + + +++D++ RG + E+ +D G+ Sbjct: 198 ILTADAYPKKQDETVYAWDRKHTNENDIYWERGVWWRNGEMSRSFKFEECFPTIVDEGIA 257 Query: 282 FLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLE 341 F+D+ + DKPFFLY G H P ++ G S +YGD M ++++V A + L+ Sbjct: 258 FIDREGRKDKPFFLYMPLTGPHTPWLPTVQFKG-STELGTYGDFMGDIDNVVARVNAKLK 316 Query: 342 KNGQLDNTLIVFTSDNGPEAE------VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI 395 + G NT+++F SDNG E RG KG W+GG VP V+W I Sbjct: 317 ELGLEKNTIVIFASDNGGAWEEEDIQQYGHQSNWSRRGQKGDAWDGGHHVPLIVHWPDHI 376 Query: 396 Q-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKA 454 + P V L D+ T DL G +PK D G S R Sbjct: 377 KCPGVCSQTVGLVDILATLADLTG-------QSLPKGQAEDSFSFKKVLDGDMNASTRDQ 429 Query: 455 EHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 Y A++ ++KY + V ++N+ TD ES Sbjct: 430 IMYLSGSGKLAIKKGDWKYI-----DCLGSGGFTAPARLSPVKNGPKGQLYNMRTDSLES 484 Query: 515 DSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 +++ +R + L + ++ P Sbjct: 485 NNLFLREKGIANELSALLKKLLDQGYSRP 513 >UniRef50_Q7UTH7 Arylsulfatase A n=2 Tax=Bacteria RepID=Q7UTH7_RHOBA Length = 496 Score = 430 bits (1107), Expect = e-119, Method: Composition-based stats. Identities = 90/498 (18%), Positives = 174/498 (34%), Gaps = 47/498 (9%) Query: 50 PATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVA 109 +++ + P ++++ + PN+++ + DD G+ D+G +G Sbjct: 1 MPIKAILSVLLFLLVPCSGLRAADNGDDVDQVS--PPNIILVMTDDQGYGDLGCHGHPFL 58 Query: 110 VGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGL 169 TP++D + S+ + P+ +PTR+ +++G+ +G+ + L Sbjct: 59 ---KTPNLDRLHSESTRFNDFHVSPTCAPTRSALMSGRAPFKNGVTHTILERDRMALTS- 114 Query: 170 TTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEV 229 TT+ ++L GY T GKWH+G+ QP GFD+ + P Sbjct: 115 TTIAEVLKSAGYTTGIFGKWHLGDEDAYQPDRRGFDETFIHGAGGIGQNFAGSQSDAPGT 174 Query: 230 ALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKS 289 + + + + ++ KS Sbjct: 175 SYFN----------------------PIIKHNGTFVQTEGYCTDVFFQQALGWIRLQTKS 212 Query: 290 D-KPFFLYYGTRGCHFDNYPNAKYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKN 343 D KPFF Y T H +Y+ S ++ + +V ++D L L++ Sbjct: 213 DTKPFFAYIPTNAPHAPYKVEKRYSDRFRDKCSSPQSEFLGMIVNIDDNMGKLMGKLDEW 272 Query: 344 GQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDG 402 DNTL++F +DNG A+ +G KG+ EGG RVP F+ G + Sbjct: 273 DLADNTLLIFMTDNG-SAKGSKIYNAGMKGGKGTVNEGGSRVPLFMRLPGFTNSGVDIET 331 Query: 403 IVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGK 462 + DLFPT ++A +P +DG S + + + + Sbjct: 332 MTRHVDLFPTLAEIAHAE-------IPAEADLDGRSLVSLIKNPQLDWDHRFQFFHSGRW 384 Query: 463 LAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHI 522 A + P + +++L DP E+ + H Sbjct: 385 AKAGLKGK----FGKGDPNPDHSKHKNYAVRDEKWRLVNGELYDLENDPGETADVAGSHP 440 Query: 523 PMGVPLQTEMHAYMEILK 540 + + + + ++ Sbjct: 441 EVVSRMLVAFDEWWDEVR 458 >UniRef50_A6DKN7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKN7_9BACT Length = 465 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 107/482 (22%), Positives = 195/482 (40%), Gaps = 51/482 (10%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSSSP 138 + +K N+++ DD+ + +G G V TP ID++ ++G+ + Y+ + +P Sbjct: 14 AMSAEKTNIILIFADDMHYGALGVTGS-VLTKAKTPAIDSIFNEGVHFPNGYASHATCAP 72 Query: 139 TRATILTGQYSIHHGILMPPMYGQ------PGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 +RA +LTG+Y + P G +P L+ GY T AIGKWH+G Sbjct: 73 SRAGLLTGRYQARFDLETLPGGTADRKKTGYGVKTSEIMIPALMKKGGYQTCAIGKWHLG 132 Query: 193 ENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD-DVHA 251 ++E QP GFD + G+ Y V + + +K LP +D ++ Sbjct: 133 SSEEFQPNARGFDHWFGYRGSCGFYQFKSQVQSA-------KKGQELKPLPSGEDPNLDV 185 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 VR GE + L + D ++ + ++PFF+Y+ H + K Sbjct: 186 VRNGESVRLE-------GYLTDHFSDEAANWIKE--NKERPFFMYFAPYNVHAPDTVPNK 236 Query: 312 YAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPF 371 Y T++ + ++ + L++ G DNTL+VF++DNG + + F Sbjct: 237 YIPK--GGTAHDGVIAALDASVQTILDALKEAGIADNTLVVFSNDNGGKKDYSKT----F 290 Query: 372 RGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPGAKVANLVPK 430 +G K + +EGG+RVP + W I+ K +G+V DL PT LA +P Sbjct: 291 KGNKATFYEGGIRVPFAMRWPKGIEAGSKYNGVVSTLDLLPTFAALAKVD-------LPS 343 Query: 431 TTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQG 490 DG + + +++ H++ NG R+ ++K + + G Sbjct: 344 DRVYDGQNLLPVI--KDSAKDQRQAHFWRNGAWRTARVGDWKLVWQVDRKKQKALLNKLG 401 Query: 491 GFTGTVMQTA----------GSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILK 540 ++NL DP+E ++ + + + + Sbjct: 402 IKHVKGRGVTYAERADELFLEPELYNLANDPKEESNLAQSNPEKLQEMVKIYKDWEASIP 461 Query: 541 KY 542 K+ Sbjct: 462 KW 463 >UniRef50_Q7UYH3 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYH3_RHOBA Length = 598 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 101/514 (19%), Positives = 178/514 (34%), Gaps = 72/514 (14%) Query: 68 DKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLIL 127 + ++ E ++PNV+V + DD G D GF G + TP +D + +Q L Sbjct: 18 SALSSTSVSAAETNAAERPNVIVIMSDDQGVGDYGFMGNPII---RTPSLDKMRTQSGYL 74 Query: 128 TSAYSQPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIG 187 + Y +PTRA+++TG+Y+ + G+ TL + L + GY T G Sbjct: 75 SRFYVSNVCAPTRASLMTGRYNYRTRCIDT-YVGRAMMDPDEVTLAERLSEAGYQTGIFG 133 Query: 188 KWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKD 247 KWH+G+N +P + GFD+ + +Y F Sbjct: 134 KWHLGDNYPMRPMDQGFDESLIHRGGG----------IGQPSDPIGAEGKYTDPTLFHNG 183 Query: 248 DVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDN- 306 D A+ G + D + F K +S KPFF Y T H Sbjct: 184 DEVAMEG---------------YCTDIFFDAAIDFARKQTESGKPFFTYIATNAPHGPFD 228 Query: 307 ----------------------YPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNG 344 P + + ++ L+ +L++ Sbjct: 229 DVPNELYEEYKQVDFTPILVSDLPAKRRDAEFDKLARISAMITNIDQNVGKLFASLDELK 288 Query: 345 QLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGI 403 +NT++++ +DNGP + RG K +GG+R P +W + +D + Sbjct: 289 IRENTIVLYLNDNGPNSRRYVGN---MRGNKTQVDDGGIRSPLLFHWPAKVDASDTTDVM 345 Query: 404 VDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKL 463 + DL PT LD G ++ + +DG G S + Sbjct: 346 LAHIDLMPTLLDACGVAASE-------SPALDGKSFLPLLTGEMDYSQWETRLIAFQTHR 398 Query: 464 AAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIP 523 V K+H + + G ++NL DP++ + + +H Sbjct: 399 GNVPQ---KFHHFAMHEHPWKLVHPSGFGKERFEGEPKLELYNLEDDPKQQNDLADKHPE 455 Query: 524 MGVPLQTEMHAYMEILKKY------PPRAQIKSD 551 + L+ + + + PPR I ++ Sbjct: 456 IVQRLKQAYSKWFDDVSSTRPDNYAPPRIVIGTE 489 >UniRef50_A6DMV0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMV0_9BACT Length = 443 Score = 429 bits (1105), Expect = e-119, Method: Composition-based stats. Identities = 108/497 (21%), Positives = 173/497 (34%), Gaps = 98/497 (19%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTR 140 KPN+V ++DD G+ D G TP I+ +A GL T+ Y+ P SPTR Sbjct: 16 AQDKPNIVFIIIDDFGYADSEPYGAKDI---KTPGINELAKDGLKFTNFYANAPVCSPTR 72 Query: 141 ATILTGQYSIHHGILMPPMYGQP------------------GGLQGLTTLPQLLHDQGYV 182 +TG++ G YG G L LP+LL GY Sbjct: 73 CAFITGRWQQRSGFEWALGYGGTNSQLKNGQYEAVTDIHGIGLLPEKNHLPKLLKKAGYK 132 Query: 183 TQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQL 242 T A GKWH+G + P + GFD++ G Y ++ + Sbjct: 133 TGAFGKWHLGSQDKFNPIHHGFDEYYGPLLGHCDYYTYKYYDDTYTLREG---------- 182 Query: 243 PFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGC 302 K L + V F+D+ A DKPFF+Y Sbjct: 183 -------------------AKVIKDSGYLTTNINERAVDFIDRHA--DKPFFMYVPHMAV 221 Query: 303 HFDNYPNAKYAGS-------SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTS 355 H K R Y + E++ + L++ TL V +S Sbjct: 222 HSPYQSADKKPKQITKTNLNDGNRADYAAMVEEVDKGVEMIIAKLKEKKIFHKTLFVVSS 281 Query: 356 DNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI-QPRKSDGIVDLADLFPTAL 414 DNG P K + +EGG+RVP ++W I + SD I DL T L Sbjct: 282 DNGGA---HFSDNAPLFHRKTTLFEGGIRVPCIMHWPEKIGKGVVSDQIAITMDLSKTFL 338 Query: 415 DLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN--GKLAAVRMDEFK 472 LAG DG++ N + R + + AVRM ++K Sbjct: 339 ALAGIDEPS----------YDGINLLPMMTDKNNKVERTLFWRSNSKARRQKAVRMGKWK 388 Query: 473 YHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEM 532 Y + + ++NL D E+ ++ + + ++ ++ Sbjct: 389 YILDVNCE----------------------LLYNLENDIAENKNLFYQRPEIVQQMKQKL 426 Query: 533 HAYMEILKKYPPRAQIK 549 ++ + ++ P +++ Sbjct: 427 ASWEREMDQHQPPFKVR 443 >UniRef50_C1ZA41 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZA41_PLALI Length = 519 Score = 429 bits (1105), Expect = e-118, Method: Composition-based stats. Identities = 93/482 (19%), Positives = 167/482 (34%), Gaps = 72/482 (14%) Query: 79 EKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSP 138 ++ +PN+++ + DD G+ D+ +G V TP +D + Q + + P+ +P Sbjct: 39 ADESKTRPNIILMMTDDQGYGDLSLHGNPVV---KTPHLDQLGRQSVRFEQFHVSPTCAP 95 Query: 139 TRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQ 198 TRA+I+T ++ G+ + + L+ LPQ L GY T GKWH+G+ Q Sbjct: 96 TRASIMTSRHEFSSGVTHTILERERLSLKATI-LPQFLKRAGYTTGIFGKWHLGDEDAYQ 154 Query: 199 PQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQ 258 P GFD+ + P +K +R + Sbjct: 155 PGKRGFDEVFIHGGGGIGQSYPGS----------------CGDAPLNKYFNPVIRHNGKF 198 Query: 259 AIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY-----A 313 + + ++D + ++ ++PFF Y H +Y Sbjct: 199 VATN------GYCTKVFVDQAITWISSQ-PDNQPFFCYITPNAPHAPLDCPKEYYEPYLE 251 Query: 314 GSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRG 373 + + +D L K LE +T+++F +DNG A H R Sbjct: 252 HVPEDVARFYGMITHWDDQLGRLLKALEDRDISKDTIVIFMTDNG-SATGAKHFSAGMRA 310 Query: 374 AKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTF 433 KG+ +EGG+RVP F W G QP+ + D+ PT +LA P A + Sbjct: 311 NKGTPYEGGIRVPAFWSWAGHWQPQVRQEVTCHYDILPTLTELANVPVADDEKQSWQ--- 367 Query: 434 IDGVDQTSFFLGTNGQSNRKAEHYF----------------LNGKLAAVRMDEFKYHVLI 477 G G + + A+R+ ++K + Sbjct: 368 --GRSLVPLLAGRSPNWPPRPFITHVGRWPKEHDPKREPSTYQYAKCAIRLGDWKLISNV 425 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYME 537 +Q ++ L DP E ++ ++ L+ A+ Sbjct: 426 KQG------------------EPQWELYQLAEDPAEKINLAKKYPDRVEELKKIYDAWWL 467 Query: 538 IL 539 + Sbjct: 468 SV 469 >UniRef50_Q5FYB1 Arylsulfatase I n=5 Tax=Chordata RepID=ARSI_HUMAN Length = 569 Score = 429 bits (1105), Expect = e-118, Method: Composition-based stats. Identities = 116/503 (23%), Positives = 187/503 (37%), Gaps = 78/503 (15%) Query: 87 NVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTG 146 +++ L DD G+ DVG++G + TP +D +A++G+ L + Y QP +P+R+ +LTG Sbjct: 48 HIIFILTDDQGYHDVGYHGSDI----ETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTG 103 Query: 147 QYSIHHGILMPPMYGQPG--GLQGLTTLPQLLHDQGYVTQAIGKWHMGE-NKESQPQNVG 203 +Y IH G+ + Q TLPQ L + GY T +GKWH+G KE P G Sbjct: 104 RYQIHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRG 163 Query: 204 FDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADI 263 FD F G + + Y + + D+H Sbjct: 164 FDTFLGSLTGNVDYYTY----------------DNCDGPGVCGFDLHEGEN-------VA 200 Query: 264 TPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSP-----A 318 + + + +P FLY + H +Y A Sbjct: 201 WGLSGQYSTMLYAQRA-SHILASHSPQRPLFLYVAFQAVHTPLQSPREYLYRYRTMGNVA 259 Query: 319 RTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGST 378 R Y + M++ N+ L++ G +N++I+F+SDNG + P RG KG+ Sbjct: 260 RRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTFSG-GSNWPLRGRKGTY 318 Query: 379 WEGGVRVPTFVYWK-GMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGV 437 WEGGVR FV+ + R S ++ + D +PT + LAG + +DG Sbjct: 319 WEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADG-------LDGY 371 Query: 438 DQTSFFLGTNGQSNRKAEHY--------------------FLNGKLAAVRMDEFKYHVLI 477 D S R + + AA+R+ E+K Sbjct: 372 DVWPAIS-EGRASPRTEILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTGD 430 Query: 478 QQPYAYTQSGYQGGFTGTVMQTAG-------SSVFNLYTDPQESDSIGVRHIPMGVPLQT 530 + F G+ +FN+ DP E + + + + L Sbjct: 431 PGYGDWIPPQTLATFPGSWWNLERMASVRQAVWLFNISADPYEREDLAGQRPDVVRTLLA 490 Query: 531 EMHAYMEIL--KKYP---PRAQI 548 + Y +YP PRA Sbjct: 491 RLAEYNRTAIPVRYPAENPRAHP 513 >UniRef50_D2R1I8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R1I8_9PLAN Length = 427 Score = 429 bits (1105), Expect = e-118, Method: Composition-based stats. Identities = 112/482 (23%), Positives = 180/482 (37%), Gaps = 84/482 (17%) Query: 90 VFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSPTRATILTGQY 148 + L DD+G+ DV TP ID +A++G++LTS + SP+RA +LTG+Y Sbjct: 2 LILADDLGYGDVSTYHP---SDVRTPQIDQLAAEGMLLTSMRANCTVCSPSRAALLTGRY 58 Query: 149 SIHHGILMP----PMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGF 204 + G+ P + TL L GY T +GKWH+G + P GF Sbjct: 59 ADRVGVPGVIRTKPEDSWGWFDPTVPTLADELKRVGYHTAIVGKWHLGLESPNTPNERGF 118 Query: 205 DDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADIT 264 D F+GF DM + H G Sbjct: 119 DFFQGFLG--DMMDSYT---------------------------THLRYGNNYMRRNREV 149 Query: 265 PKYMEDLDQRWMDYGVKFL-DKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGS-------- 315 + + + D+ ++L ++ + ++PFFLY HF P A++ Sbjct: 150 IEPQGHATELFTDWASEYLVERAKQKEQPFFLYLAYNAPHFPIEPPAEWLAKVKERAPQL 209 Query: 316 SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAK 375 R + ++ + KTL++ G NT++VFTSDNG P+R K Sbjct: 210 DQKRAKNVAFVEHLDHSIGRVLKTLKETGLDQNTVVVFTSDNGGSL-PHAQNNDPWRDGK 268 Query: 376 GSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFI 434 S ++GG+RVP V W G I+ SD + DLFPT L+LAG +K + Sbjct: 269 QSHYDGGLRVPFMVRWPGQIKAGSRSDYVGLNFDLFPTFLELAGATPSK---------EL 319 Query: 435 DGVDQTSFFLGTNGQSNRKAEHYFLNGK-------LAAVRMDEFKYHVLIQQPYAYTQSG 487 D V G ++R G A+ E+K Sbjct: 320 DAVSLVPVLKGGKITTSRDLYFVRREGGVTYGGKSYEAIIRGEWKLL------------- 366 Query: 488 YQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQ 547 + ++N+ DP E+ + + + L + +++ P +A Sbjct: 367 -------QNDPYSALELYNIQNDPGETKDLAASNKKVVNELAAALRLHIQRGGATPWQAP 419 Query: 548 IK 549 + Sbjct: 420 PR 421 >UniRef50_B6RB10 Arylsulfatase n=7 Tax=Coelomata RepID=B6RB10_HALDI Length = 481 Score = 429 bits (1104), Expect = e-118, Method: Composition-based stats. Identities = 119/497 (23%), Positives = 186/497 (37%), Gaps = 64/497 (12%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 L + G+ ++V + DD+GW D+GF+ + TP+ID +A +GL+L Y Q Sbjct: 14 NLCDDVSAAGRPRHIVFIVADDLGWNDIGFHNPDII----TPNIDKLAREGLLLNHHYVQ 69 Query: 134 PSSSPTRATILTGQYSIHHGILM-PPMYGQPG-GLQGLTTLPQLLHDQGYVTQAIGKWHM 191 P SP+RA ++G Y G+ + QP +T LPQ L + GY T +GKWH Sbjct: 70 PLCSPSRAAFMSGYYPFKTGLQHSVILENQPVCLPLNITILPQKLKELGYATHIVGKWHN 129 Query: 192 GE-NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVH 250 G + P GFD F G+ + Y + P D+ Sbjct: 130 GFCSWNCTPTYRGFDSFFGYYGAMEDYYTH---------VIRGFLDYRNNTTPVWTDN-- 178 Query: 251 AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNA 310 R+ D +++ +P FLY + + A Sbjct: 179 -----------------GTYSTLRFTDVATDIIERH-NQSQPLFLYLAYQAVYGPIEVPA 220 Query: 311 KYAG-----SSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP 365 KY S R + + +++ N+ KTL + G +D+TLI+FT+DNG Sbjct: 221 KYEAMYPNIKSENRRKFSGMVSALDEAVGNVTKTLRQRGLMDDTLILFTADNGG-GVDES 279 Query: 366 HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKS-DGIVDLADLFPTALDLAGHPGAKV 424 P RG+K + +EGG R F+Y G+ + DG++ D PT AG Sbjct: 280 GNNYPLRGSKFTVYEGGTRAVGFMYGSGLQKTGTVFDGMIHAVDWLPTLTAAAGGTPVSD 339 Query: 425 ANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG------KLAAVRMDEFKYHVLIQ 478 DG++ T S R Y + AA+R+ ++K Sbjct: 340 R---------DGINLWPSLS-TASPSPRTEVVYNYDSHPQPVQGHAAIRVGDYKLIDGYP 389 Query: 479 QP----YAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 P Y Q + +FNL DP E + + M L + A Sbjct: 390 GPFPDWYKPEQVTSSLNTRFSRDSANQYQLFNLKDDPNERNDLSNFRPDMVKKLAARL-A 448 Query: 535 YMEILKKYPPRAQIKSD 551 + + P + D Sbjct: 449 WYKKQAVPPNFPETPDD 465 >UniRef50_B4CZ78 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CZ78_9BACT Length = 527 Score = 429 bits (1104), Expect = e-118, Method: Composition-based stats. Identities = 111/499 (22%), Positives = 176/499 (35%), Gaps = 55/499 (11%) Query: 75 LAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-Q 133 + +T PN+V+ L DD+G+ VG G TP+ID +A +G T A + Sbjct: 16 VFAPAAETTSTPNIVIILADDLGYGSVGCFGAD-GKLVRTPNIDRLAHEGRRFTDANTTS 74 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQPGGL--QGLTTLPQLLHDQGYVTQAIGKWHM 191 +PTR ++LTG+Y + + L + +L GY T AIGKWH+ Sbjct: 75 SVCTPTRYSLLTGRYCWRTSLKYETLNTFAPMLIEPTRYNMASMLKAHGYHTAAIGKWHL 134 Query: 192 GENKE---------------SQPQNVGFDDFRGFN---------SVSDMYTEWRDVHVNP 227 G P +GFD V + Y P Sbjct: 135 GYGDGKKDPKYRVDYTAELAPGPNELGFDYHFAVPQNHGDVTGVYVENHYVYGLRSGKIP 194 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA 287 P + + + G D + + + D ++++ Sbjct: 195 ADLKLPAPVPDDENFAPTYNSESQQGHGHTPMEIDAPRRVDDRVMPELTDQAAHWIEQQ- 253 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 K+ PFFLY+ H P+ G+S A +GD + E++ + +TLEK G Sbjct: 254 KAGTPFFLYFAPVAVHEPVTPSRDTRGTSQA-GRFGDWIHELDRTVGRVLETLEKQGFAQ 312 Query: 348 NTLIVFTSDNGP------------EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMI 395 NTL++FTSDNG +RG K ++GG VP W G I Sbjct: 313 NTLVIFTSDNGGIYEPTQKRPEMDAVHAGLAVNGQWRGGKTHVFQGGFNVPFIARWPGKI 372 Query: 396 QPRK-SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTN-GQSNRK 453 S ++ L D+ T + G D + LG Q R Sbjct: 373 PAGTESREMISLVDVLATTAAIVGEKLPSAEKAAE-----DSCNILPALLGEKYDQPLRS 427 Query: 454 AEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQE 513 N + A+R +K+ + P G + + ++NL DP E Sbjct: 428 DMVEHSNDGVFAIRKGPWKWIEGV--PVKQISPGLRKAHAAEFQR----QLYNLAEDPTE 481 Query: 514 SDSIGVRHIPMGVPLQTEM 532 S + +H + L+ + Sbjct: 482 SKDVSEQHPEIVKELEAAL 500 >UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UL40_RHOBA Length = 592 Score = 429 bits (1104), Expect = e-118, Method: Composition-based stats. Identities = 103/485 (21%), Positives = 174/485 (35%), Gaps = 81/485 (16%) Query: 74 KLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ 133 + +PNV++ + DD GW +VGF+G V TP++D A++G LT+ Y Sbjct: 35 SSVTVAVAAEPRPNVILVMTDDQGWAEVGFHGNEVL---KTPNLDRFAAEGTELTNFYVS 91 Query: 134 PSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 P +PTR++++TG+Y G + G+ TT+ ++ GY T GKWH+GE Sbjct: 92 PMCTPTRSSLMTGRYHFRTGAHDTYI-GRSNMNPEETTIAEVFAGAGYRTGIFGKWHLGE 150 Query: 194 NKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253 N + ++ GF V ++Y + + Sbjct: 151 NFPMRAEDQGFQ-----------------KVVVHGGGGIGQFADYPGNTYWDPTLQY--- 190 Query: 254 GGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYA 313 D K ++D ++F+ ++PFF Y H ++ Sbjct: 191 -------NDSFKKAKGYCTDVFIDESIQFMKDS--GEQPFFCYLPLNVPHSPFDVADEFR 241 Query: 314 GSSPAR-----------TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 + + + + F L + +E GQ +NT+I+F SDNGP + Sbjct: 242 ADYDNQNLADPDGRKWVAPIYGMITQFDGAFGRLLEAVEDMGQRENTIILFMSDNGPNST 301 Query: 363 VPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPTALDLAGHPG 421 G R KGS +E G+R P + W +Q K D DL PT D G Sbjct: 302 YFTAG---LRAKKGSVYENGIRSPFVIQWPKTLQGGRKFDTPAMHIDLLPTLADACGI-- 356 Query: 422 AKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG-------KLAAVRMDEFKYH 474 +P +DG G ++ N + R +K Sbjct: 357 -----GLPADLQVDGKSILGLLHGETQGFQQRYLFMQHNRANVPPKYENCMARRGPWKVV 411 Query: 475 VLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 G + G ++N+ DP E+ + +H + E A Sbjct: 412 -------------------GDGGEPTGFELYNIEQDPGETRDLADKHPEIVKAFVREYEA 452 Query: 535 YMEIL 539 + + + Sbjct: 453 WFDDV 457 >UniRef50_Q7URW3 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7URW3_RHOBA Length = 480 Score = 429 bits (1103), Expect = e-118, Method: Composition-based stats. Identities = 110/505 (21%), Positives = 177/505 (35%), Gaps = 56/505 (11%) Query: 61 VMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAV 120 A L +PN+VV + DD+G+ + G G PTP IDA+ Sbjct: 10 AAPSTACLFLASVALCWQGTTAASQPNLVVIIADDLGYGETGMMGN---AEIPTPAIDAL 66 Query: 121 ASQGLILTSAYSQPS-SSPTRATILTGQYSIHHGILMPP-----MYGQPGGLQGLTTLPQ 174 A G+ TS Y S SP+RA L+G+Y G + P + G T + Sbjct: 67 ARSGVRCTSGYVTSSYCSPSRAGFLSGRYQSRFGYDLNPTGERNNHPNAGLPPQQKTFVE 126 Query: 175 LLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTE-------WRDVHVNP 227 L GY T IGKWH+G P + GFD F GF Y W + N Sbjct: 127 HLQSAGYQTSLIGKWHLGTRPSQVPTSKGFDRFFGFLHEGHFYVPGPPFENVWTMLRDNT 186 Query: 228 EVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMA 287 + ++ + +++ + G ++ L D + + + A Sbjct: 187 LPTGRFETNQKTIRGNYARINEPDYDAGNPMLDGSEPIEHWNYLTDSITDKAIDAITQTA 246 Query: 288 KSDKPFFLYYGTRGCHFDNYPNAKYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEK 342 KPF + H + + + P R + ++ ++ + + L++ Sbjct: 247 S--KPFAMVVSYNAVHSPMQASLEDHAAMELIDDPQRRIFAGMLIALDRGVGRIIEKLDQ 304 Query: 343 NGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSD 401 +TL+VF SDNG P RG KGS +EGGVR+P G I + D Sbjct: 305 QKLRQDTLVVFFSDNGGPTAELTSSNAPLRGGKGSLYEGGVRIPMIWSMPGTIPAGAEED 364 Query: 402 GIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNG 461 + D+ + L LA +++ DG + ++G + + + Sbjct: 365 TPILSLDIAASFLPLAVGEASQLET--------DGTNVLP-WIGRGTFKLPRTVWWRMPR 415 Query: 462 KLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRH 521 A+R ++K+ Q +FNL D ES + H Sbjct: 416 GARALRHGDWKFVQARQN--------------------QPIELFNLALDLSESKDLSDVH 455 Query: 522 IPMGVPLQTEMHAYMEILKKYPPRA 546 L A + PP Sbjct: 456 PNRLQDLLAAWDAVQAEM---PPAK 477 >UniRef50_UPI000186ED10 arylsulfatase B precursor, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186ED10 Length = 570 Score = 429 bits (1103), Expect = e-118, Method: Composition-based stats. Identities = 114/528 (21%), Positives = 183/528 (34%), Gaps = 103/528 (19%) Query: 84 KKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATI 143 ++PN+++ L DD+GW DV F+G TP+IDA+A G+IL S Y +P+RA++ Sbjct: 45 ERPNIIIILADDLGWNDVSFHGSNQI---QTPNIDALAYNGIILNSHYVPALCTPSRASL 101 Query: 144 LTGQYSIHHGILM-PPMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHMG-ENKESQPQ 200 +TG+Y G+ + +P G T +P+ + GY T A+GKWH+G KE P Sbjct: 102 MTGKYPTSLGMQHLVILSPEPWGLPLNETLMPEYFNKNGYATHAVGKWHLGFFKKEYTPI 161 Query: 201 NVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAI 260 GFD G + Y + +S Y + F D + Sbjct: 162 YRGFDSHFGHWNGFQDYYDHTT--------MSDSLKGYDMRRNFEVDYSYQGM------- 206 Query: 261 ADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF-----DNYPNAKYAGS 315 + +K +D P FLY H Sbjct: 207 ---------YTTDVFTKEAIKIIDNHNSQKGPLFLYLSHLAPHSGNPDNPFQAPEDEISK 257 Query: 316 -----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEV---PPHG 367 P R Y + ++++ + LEKN L+N++I+F SDNG Sbjct: 258 HECINDPGRKIYAAMVTKLDESVGQVVSALEKNKMLNNSIIIFMSDNGAATYGLHSNRGS 317 Query: 368 RTPFRGAKGSTWEGGVRVPTFVYWK--GMIQPRKSDGIVDLADLFPTALDLAGHPGAKVA 425 P RG K S WEGGVR ++ + R S ++ ++D PT L AG + Sbjct: 318 NYPLRGLKESPWEGGVRGTAAIWSPFLNKTK-RVSKQLMHMSDWLPTLLTAAGLNYSSTQ 376 Query: 426 NLVPKTTFIDGVDQTSFFLGTNGQSNRKAEH--YFLNGKLAAVRMDEFKYHVLIQQPYAY 483 + IDG+D + + S RK Y +++ +D +KY Q Sbjct: 377 LI----NKIDGIDMWNVLSN-DLPSPRKEVFNNYDEIENYSSLMIDSWKYVEGTAQEGKA 431 Query: 484 TQSGYQGGFTG------------------------------------------------- 494 + Sbjct: 432 DYWFEEPSRNNCSEYRVSNEDIFRLRRDSTIICDNPTFSSSLSITRNNHTDVKNKTKYVL 491 Query: 495 -TVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKK 541 +FNL DP E ++ + ++ + + + K Sbjct: 492 TCDPLLKRFCLFNLNDDPCERLNLADVFPDVVKRIKNRLLELKKSVVK 539 >UniRef50_A6DLD9 Sulfatase n=2 Tax=Chlamydiae/Verrucomicrobia group RepID=A6DLD9_9BACT Length = 517 Score = 429 bits (1103), Expect = e-118, Method: Composition-based stats. Identities = 120/520 (23%), Positives = 198/520 (38%), Gaps = 68/520 (13%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSP 138 +KPN+++ DD+G+ D+ GG G TP ID +A+ G+ +S Y+ + +P Sbjct: 18 SAATEKPNILIIYADDIGYGDLSCYGGT---GAQTPFIDRLANDGIRFSSGYASAATCTP 74 Query: 139 TRATILTGQYSIHHGILMPPMYGQP-GGLQGLTTLPQLLHDQGYVTQAIGKWHMG----- 192 +R ++LTG+Y+ + P + + D GY+T +GKWH+G Sbjct: 75 SRYSLLTGEYAFRNKSAKILPGNAPLIIDPAKPNIASFMKDAGYITALVGKWHLGLGLSD 134 Query: 193 ------ENKESQPQNVGFDDFRGFNSVSD-----MYTEWRDVHVNPEVALSPDRSEYIKQ 241 N + P+ +GFD + D V ++P + ++ + Sbjct: 135 GSFDWNSNIKPAPRELGFDYSFYMAATGDRVPSVYIENSEVVDLDPSDPIKVSYAKPVGT 194 Query: 242 LPFSKDDVH----------------AVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDK 285 P H + ED+ +++ + F++K Sbjct: 195 EPTGISHPHLLTVQADVQHAGTIVNGISRIGTMTGGHAARFKDEDMADTYLNKAIDFINK 254 Query: 286 MAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQ 345 D+PFF+Y+ H P+ ++ GSS GD +V+ + L KTL+ N Sbjct: 255 S--KDQPFFMYFAAHDNHVPRRPHPRFQGSSSL-GPRGDAIVQFDWTVGKLIKTLKANKM 311 Query: 346 LDNTLIVFTSDNGP-----------EAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGM 394 NTLI+ +SDNGP PFRG K S WEGG R+P V W G Sbjct: 312 YRNTLIILSSDNGPVLFDGYWEGSEARNGDHKAAGPFRGGKYSLWEGGTRMPFIVSWPGK 371 Query: 395 IQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKA 454 IQ S ++ D+F + L G +PK+ DG + +G + R Sbjct: 372 IQSGTSSALISQVDIFASIATLIG-------KDLPKSASPDGQNMLPALMGKS-PVGRDY 423 Query: 455 EHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES 514 A+RM ++KY P +FNL DP E+ Sbjct: 424 LVE-EALSQVALRMGDWKYI-----PPGTVTERGGLDEWIKTPVHPPGMLFNLADDPGET 477 Query: 515 DSIGVRHIPMGVPLQTEMHAYMEI---LKKYPPRAQIKSD 551 + + +H + + KK P +Q+ + Sbjct: 478 NDLSKQHPKKVKAMLAILKKEAPSKFLNKKTPGASQLGFE 517 >UniRef50_Q5FYB0 Arylsulfatase J n=81 Tax=Eumetazoa RepID=ARSJ_HUMAN Length = 599 Score = 429 bits (1103), Expect = e-118, Method: Composition-based stats. Identities = 107/511 (20%), Positives = 183/511 (35%), Gaps = 77/511 (15%) Query: 77 ELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSS 136 E + +P+++ L DD G+ DVG++G + TP +D +A++G+ L + Y QP Sbjct: 67 EPSTTSTSQPHLIFILADDQGFRDVGYHGSEI----KTPTLDKLAAEGVKLENYYVQPIC 122 Query: 137 SPTRATILTGQYSIHHGILMPPMYGQPG--GLQGLTTLPQLLHDQGYVTQAIGKWHMGE- 193 +P+R+ +TG+Y IH G+ + TLPQ L + GY T +GKWH+G Sbjct: 123 TPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFY 182 Query: 194 NKESQPQNVGFDDFRG-FNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 KE P GFD F G D YT ++ + Sbjct: 183 RKECMPTRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGI------ 236 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKY 312 Q + V+ + KP FLY + H +Y Sbjct: 237 -----------------YSTQMYTQR-VQQILASHNPTKPIFLYIAYQAVHSPLQAPGRY 278 Query: 313 AGSSP-----ARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHG 367 R Y + +++ N+ L+ G +N++I+++SDNG + Sbjct: 279 FEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-GS 337 Query: 368 RTPFRGAKGSTWEGGVRVPTFVYWK-GMIQPRKSDGIVDLADLFPTALDLAGHPGAKVAN 426 P RG+KG+ WEGG+R FV+ + +V + D +PT + LA Sbjct: 338 NWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLA-------EG 390 Query: 427 LVPKTTFIDGVDQTSFFLGTNGQSNRKAEHY--------------------FLNGKLAAV 466 + + +DG D +S R + + +A+ Sbjct: 391 QIDEDIQLDGYDIWETIS-EGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAI 449 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQT--------AGSSVFNLYTDPQESDSIG 518 R+ +K + +FN+ DP E + Sbjct: 450 RVQHWKLLTGNPGYSDWVPPQSFSNLGPNRWHNERITLSTGKSVWLFNITADPYERVDLS 509 Query: 519 VRHIPMGVPLQTEMHAYMEIL--KKYPPRAQ 547 R+ + L + + + +YPP+ Sbjct: 510 NRYPGIVKKLLRRLSQFNKTAVPVRYPPKDP 540 >UniRef50_B4D4S6 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4S6_9BACT Length = 626 Score = 428 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 121/529 (22%), Positives = 178/529 (33%), Gaps = 116/529 (21%) Query: 78 LEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQ-PSS 136 E +PN+V L DD+GW D G TP+I+ +A++G+ T+AY+ P Sbjct: 20 AESSPKTRPNIVFILADDLGWSDTTLYG--TTKFFETPNIERLAARGMKFTNAYAANPVC 77 Query: 137 SPTRATILTGQYSIHHGILMPPMY------------------------GQPGGLQGLTTL 172 SPTRA+I+TG Y GI P + TL Sbjct: 78 SPTRASIMTGLYPGRLGITTPSGHVPEEKLEASLVARGSPSQKSLQATSATRLKLEYFTL 137 Query: 173 PQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALS 232 + L GY T GKWH+G P + GFD V+ Sbjct: 138 AEALKGAGYATGHFGKWHLGPEP-FDPLHQGFD-------------------VDVPHWSG 177 Query: 233 PDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKP 292 P + YI K + A G E L+ +KF+ D+P Sbjct: 178 PGPAGYIAPWKSPKFHLPAKPG--------------EQLEDLMSQEAIKFIR--VHKDEP 221 Query: 293 FFLYYGTRGCHFDNYPNAKYAGSSPART---------SYGDCMVEMNDVFANLYKTLEKN 343 F+L Y H + YG + ++D L TL++ Sbjct: 222 FYLNYWAFSVHSPWGGKPDLIEKYRRKADPNSAQRNPVYGAMVESLDDAVGRLLDTLDEL 281 Query: 344 GQLDNTLIVFTSDNGPEA------------EVPPHGRTPFRGAKGSTWEGGVRVPTFVYW 391 D+T+IVF SDNG PP P R KG+ +EGG R P V W Sbjct: 282 KLSDHTIIVFFSDNGGVNWFEPAMKEEAGMNSPPTTNAPLRAGKGTLYEGGTREPCVVVW 341 Query: 392 KGMIQ-PRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQS 450 G + ++D ++ D +PT L++AG DGV Q LGT Sbjct: 342 PGKTKAATQNDAMLCSVDFYPTLLEMAGVAAK-------PDLKFDGVSQVPALLGTGTPR 394 Query: 451 NRKAEHYFLNGKLAAV---------RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAG 501 + +Y + V R ++K F Q+ Sbjct: 395 DTLFCYYPVYSPPGHVVHTMPGVWGRRGDWKLIRY---------------FHDADDQSDR 439 Query: 502 SSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 ++NL+ D E+ + R L + A++ P Sbjct: 440 YELYNLHDDLGETKDLAARFPDKVKELNALIDAHLAETHALIPGKNPAY 488 >UniRef50_Q7UYA5 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA5_RHOBA Length = 562 Score = 428 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 110/521 (21%), Positives = 196/521 (37%), Gaps = 63/521 (12%) Query: 38 AGYDHPNQYLVKPATTIADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVG 97 H + P + + + + E +PN+++ L DD+G Sbjct: 80 LSLTHATFHPHTPNMKHCIDSLAIAIVAVVFLGSF-----TEAHADDRPNIILLLADDLG 134 Query: 98 WMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS-QPSSSPTRATILTGQYSIHHGILM 156 + D+ G TP +D +AS+GL Y+ SPTRA++LTG+Y + GI Sbjct: 135 YGDLSCFGSP---AVKTPHLDRLASEGLKCNRFYAGSAVCSPTRASVLTGRYPLRFGITK 191 Query: 157 PPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE----------NKESQPQNVGFDD 206 + TT+ +LL D GY T IGKWH+G + P+ GFD Sbjct: 192 HFNDRNGWLPESATTVAELLKDAGYNTAHIGKWHLGGLHVDEPGKRLTNQPGPRQHGFDF 251 Query: 207 FRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPK 266 + ++ P R + + + + +Q+ D P Sbjct: 252 Y------------------QTQIEQQPLRGQMGRDKTLFRKGGTVLLRNDQRISQDD-PY 292 Query: 267 YMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK-------YAGSSPAR 319 Y + D+ V+ ++K++ + PFF+ H P + + + Sbjct: 293 YHKHFTDANGDFAVEMIEKLSSEEDPFFINMWWLVPHKPYEPAPEPHWSDTAADDITDDQ 352 Query: 320 TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTW 379 + + M+ + + L++ DNTL++FTSDNG E H +G K Sbjct: 353 HRFRSMVQHMDAKVGAILRKLDELKIADNTLVLFTSDNGAAFEGFIHD---LKGGKTELH 409 Query: 380 EGGVRVPTFVYWKGMIQPRKSDGIVDLA-DLFPTALDLAGHPGAKVANLVPKTTFIDGVD 438 +GG+RVP V W I ++ DL PT D A +P +DG+ Sbjct: 410 DGGIRVPMIVRWPDAIPAGQTSQTFSHTNDLLPTFCDAASV-------QLPSDLPLDGLS 462 Query: 439 QTSFFLGTNGQSN-RKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVM 497 S + G S + ++ +++ H +PYA T+ +G + Sbjct: 463 LLSHWKGGTPPSQVERGTVFWQLDLYKSLQR-----HYPKPKPYA-TEVVMRGNWKLLAF 516 Query: 498 QTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEI 538 + +F++ DP E ++ H + L ++ ++ Sbjct: 517 KGKPVELFDVGADPNEKRNVLAEHPELVASLSAQLKDWLNE 557 >UniRef50_Q1YP24 Arylsulfatase A n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YP24_9GAMM Length = 502 Score = 427 bits (1100), Expect = e-118, Method: Composition-based stats. Identities = 120/514 (23%), Positives = 198/514 (38%), Gaps = 47/514 (9%) Query: 54 IADNMMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNP 113 I +++ + Q+ + K KPN ++ DD+G+ D G G + Sbjct: 3 IQRSILLALLLTGCFSHAQESKQQAPNKHKAKPNFILVYTDDMGYSDAGPFGNPLI---E 59 Query: 114 TPDIDAVASQGLILTSAYSQ-PSSSPTRATILTGQYSIHHGILMPPM-----YGQPGGLQ 167 TP ID +AS G T+ Y+ P +P+R +LTG+ + G+ + + G + Sbjct: 60 TPAIDRLASSGQTWTNFYAAAPVCTPSRGALLTGKLPVRTGLYGDNINVFFPGSKKGMPE 119 Query: 168 GLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNP 227 TTL ++ D Y T GKWH+G+ P GF+++ G +DM E + + Sbjct: 120 NETTLAEVFQDNQYATGMFGKWHLGDATGFYPTRHGFNEWLGIPYSNDMDWEVEGITSSN 179 Query: 228 EVALSPD----------------RSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDL 271 + D I + G P + Sbjct: 180 IFFPAQDIMAKYGTVSPVLQRQIFQPEINDWQVPLIHSRKLADGRFVDHEIQRPADQTLI 239 Query: 272 DQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMND 331 +R+ ++F+ + + KPFF+Y H + +A++AG S A YGD + E++ Sbjct: 240 TRRYTTESIRFMREAVTAQKPFFIYLAHSMPHVPLFRSAEFAGKSKA-GIYGDVIEEIDW 298 Query: 332 VFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGR--TPFRGAKGSTWEGGVRVPTFV 389 + + DNT IVFTSDNGP H TP R KG+T++GG+RV T Sbjct: 299 SLQKIIAATQALAIDDNTYIVFTSDNGPWLIYGTHAGTATPLRDGKGTTFDGGMRVMTVF 358 Query: 390 YWKGMIQPRKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQ 449 I D + DLF T LAG A D VD + Sbjct: 359 SGPD-IHQGIIDDLGSQTDLFATFTALAGFGSQTTAA--------DSVDLSHTLRNGQ-P 408 Query: 450 SNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYT 509 S R + ++ +L A R + K H + Q Y G + + +L Sbjct: 409 SPRTSIPFYSGSELRAFRYQDHKVHFVTQGAY---------GMKPAREVHQPAMLIDLKA 459 Query: 510 DPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 D E+++I + + + + + + + P Sbjct: 460 DVGEANNIAKNNPQRVLEVVQQAETFKQSITVAP 493 >UniRef50_A0Z7U6 Arylsulfatase n=2 Tax=Gammaproteobacteria RepID=A0Z7U6_9GAMM Length = 512 Score = 427 bits (1100), Expect = e-118, Method: Composition-based stats. Identities = 145/512 (28%), Positives = 228/512 (44%), Gaps = 35/512 (6%) Query: 58 MMPVMQHPAQDKETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDI 117 M M + + L KPN ++ DDVG+ +V G +G TP+I Sbjct: 1 METFMMLLKRCLVAALLVTPLSVFASDKPNFLMLWGDDVGYWNVSAYNQG-MMGYETPNI 59 Query: 118 DAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILMPPMYG-QPGGLQGLTTLPQLL 176 D++A G++ T AY + S + RA +TGQ G+L + G + G Q T+ + L Sbjct: 60 DSIAKDGMLFTHAYGEQSCTAGRAAFVTGQSGFRTGLLKVGLPGAKEGMDQRDPTIAEYL 119 Query: 177 HDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRS 236 +GY+T GK H+G+ E P N GFD+F G + + + +P+ P Sbjct: 120 KSKGYMTGQFGKNHLGDRDEHLPTNHGFDEFIG----NLYHLNAEEEPEHPDYPKDPAFR 175 Query: 237 EYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLY 296 E K + G + +T K ME +D+ + FL++ K+D+PFFL+ Sbjct: 176 EKFGPRGVIK----SSSDGRIEDTGPLTKKRMETIDEEVTAAALDFLERAVKADQPFFLW 231 Query: 297 YGTRGCHFDNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSD 356 Y T H + G + + D MVE + + + L++ G DNT++++T+D Sbjct: 232 YNTTRMHVHTRLKPESEGVT-GLGVFPDGMVEHDGMIGQMLDKLDELGITDNTVVMYTTD 290 Query: 357 NGPEAEVPPHGRT-PFRGAKGSTWEGGVRVPTFVYWKGMIQPRK-SDGIVDLADLFPTAL 414 NG E P G T PFRG K + WEGG RVP V W G+I+P S+GIV D FPT Sbjct: 291 NGAEKFTWPDGGTAPFRGEKNTNWEGGYRVPLLVKWPGLIEPGSRSNGIVSHMDWFPTIA 350 Query: 415 DLAGHPGAKVA-------NLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL-NGKLAAV 466 G K +DG + ++ G +S R YF +G L + Sbjct: 351 AALGDTDLKEQVSKGSAFGEGNSKVHLDGYNMLPYWGGETDESPRAEFFYFSDDGNLVGM 410 Query: 467 RMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQES--------DSIG 518 R +K Q+ +++ + +Q +F+LY+DP E Sbjct: 411 RYQRWKAVFAEQRAHSF------DVWADPFVQLRVPKIFDLYSDPFEEAEHESIHYKDWW 464 Query: 519 VRHIPMGVPLQTEMHAYMEILKKYPPRAQIKS 550 +H+ + VP QT + ++ +YPPR + S Sbjct: 465 FQHVFLLVPAQTYVGEFLGTFVEYPPRQKPAS 496 >UniRef50_A7AKS6 Putative uncharacterized protein n=3 Tax=Bacteroidales RepID=A7AKS6_9PORP Length = 464 Score = 427 bits (1098), Expect = e-118, Method: Composition-based stats. Identities = 105/475 (22%), Positives = 168/475 (35%), Gaps = 64/475 (13%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 + + + ++PN+++ L DD G+ D GF G A TP+ID +A++G I T A+ Sbjct: 21 ASCSSGQDEEAQRPNILILLADDAGYADFGFMG---ATDIQTPNIDRLAAEGCIFTDAHV 77 Query: 133 QP-SSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHM 191 SSP+R+ +LTG+Y +G G LP LL Y T IGKWH+ Sbjct: 78 AATVSSPSRSMMLTGRYGQRYGYECNLDKPGDGLPDDEELLPALLKRYDYRTGCIGKWHL 137 Query: 192 GENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHA 251 G +P GFD F G + Y + + Sbjct: 138 GSEPSQRPNAKGFDTFYGLLAGHRSYFYDPETSDKDGNLQQYQYNGR------------- 184 Query: 252 VRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAK 311 + +F+ + S++PF LY H N + Sbjct: 185 ------------KLSFDGYFTDELASKAQQFVTE---SEQPFMLYMSFTAPHSPNEATEE 229 Query: 312 YAGS--SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPPHGRT 369 R Y M ++ + L+ G+ DNT+I F SDNG Sbjct: 230 DLARFEGQPRQKYAAMMYALDRGVGKIVDELKAAGKFDNTIIFFLSDNGGSTT-NQSSNL 288 Query: 370 PFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLV 428 P +G KG+ +EGG RVP FV W + ++ G+ D+F T +D P + Sbjct: 289 PLKGFKGNKFEGGQRVPFFVVWGDRFKRDQRFTGLTSSLDIFATVVDALDIPEEGLHK-- 346 Query: 429 PKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGY 488 IDGV + G + +A + A+R +K + Sbjct: 347 ----PIDGVSLLPYLSGEKSGNPHEALF-WRKMDTRAIRSGSYKLIITRGVDSV------ 395 Query: 489 QGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYP 543 ++N+ D +E + L ++ + + K P Sbjct: 396 ---------------LYNMDQDVEEMHDLLSSEPEKARELMEQLSEWEQACCKDP 435 >UniRef50_Q8SZ72 RE14504p n=18 Tax=Neoptera RepID=Q8SZ72_DROME Length = 562 Score = 427 bits (1098), Expect = e-118, Method: Composition-based stats. Identities = 116/545 (21%), Positives = 193/545 (35%), Gaps = 104/545 (19%) Query: 76 AELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPS 135 A +K+ KPN++ L DD+G+ DVGF+G PTP+IDA+A G+IL Y P Sbjct: 16 AAEVEKSPAKPNIIFILADDLGFNDVGFHGS---AEIPTPNIDALAYSGIILNRYYVAPI 72 Query: 136 SSPTRATILTGQYSIHHGILMPPMYGQ--PGGLQGLTTLPQLLHDQGYVTQAIGKWHMGE 193 +P+R+ ++TG+Y IH G+ +Y G LPQ L++ GY + GKWH+G Sbjct: 73 CTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGH 132 Query: 194 NK-ESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAV 252 K + P GF GF S Y + + Sbjct: 133 WKLKYTPLYRGFSSHVGFWSGHQDYNDHT---------------------AVENNQWGLD 171 Query: 253 RGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF-----DNY 307 Q D+ Y D+ VK + + P FLY CH Sbjct: 172 MRNGTQVAYDLHGHYT---TDVITDHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLP 228 Query: 308 PNAKYAGS-----SPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAE 362 + R + + +M++ + L K+ L+N++I+F+SDNG A+ Sbjct: 229 VPDNDVIKMSHIPNYKRRKFAAMVSKMDNSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQ 288 Query: 363 V---PPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ-PRKSDGIVDLADLFPTALDLAG 418 P +G K + WEGGVR ++ + + R S+ + + D PT L+ AG Sbjct: 289 GFNLNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAG 348 Query: 419 HPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLN--GKLAAVRMDEFKYHVL 476 A IDG + + S R + ++ AA+ + ++K Sbjct: 349 GQPALSNLS----KQIDGQSIWRALV-QDKASPRLNVLHNIDDIWGSAALSVGDWKLVKG 403 Query: 477 IQQPYAYTQSGYQGGFTGTVMQT------------------------------------- 499 ++ G + Sbjct: 404 TNYRGSWDGWYGPAGERDPRLYDWQLVGRSRAGKALEALKMLPSRADQQRIRAAATVSCP 463 Query: 500 --------------AGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHAYMEILKKYPPR 545 + +F++ DP E ++ ++ + L TE+ + PP Sbjct: 464 GQSSQGTSCVATAFSAPCLFHIRDDPCEQYNLAKQYPEVVNALMTELERFNAT--AVPPS 521 Query: 546 AQIKS 550 + Sbjct: 522 NKPAD 526 >UniRef50_B0SY54 Sulfatase n=7 Tax=Alphaproteobacteria RepID=B0SY54_CAUSK Length = 559 Score = 426 bits (1096), Expect = e-117, Method: Composition-based stats. Identities = 119/517 (23%), Positives = 181/517 (35%), Gaps = 92/517 (17%) Query: 69 KETQQKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVG-NPTPDIDAVASQGLIL 127 + E + PNV+V L DD+G+ D+ FNGGGVA G PTP+ID++ G+ Sbjct: 45 AVAWSEGPEAAPSGPRPPNVIVILADDMGFNDITFNGGGVAGGLVPTPNIDSLGHDGVSF 104 Query: 128 TSAY-SQPSSSPTRATILTGQYSIHHGILMPPMY-------------------------- 160 + Y + +P+RATI+TG+Y+ G P Sbjct: 105 ANGYDGNATCAPSRATIMTGRYATRFGFEFTPAPVAFEKMVGSEGAAGDIVLPRFYPDRL 164 Query: 161 ---------------GQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFD 205 + T+ QLL +GY T GKWH+G S+P+ GFD Sbjct: 165 KAMPPGSTAPTPDAVNELSMPASEITVAQLLKTRGYHTLHFGKWHLGGKAGSRPEQKGFD 224 Query: 206 DFRGFNSVSDMYTEWRDVH-VNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADIT 264 + GF + MY D N + P LP++ + Sbjct: 225 ESLGFIAGGSMYLPEGDPGVENAKQPWDPIDRFLWPNLPYAVQFNGSPMFRP-------- 276 Query: 265 PKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPNAKYAG-----SSPAR 319 + D VK + A ++PFF+Y+ H Sbjct: 277 ---GGYMTDYLTDEAVKAVR--ANRNRPFFMYFAPNAIHTPLQATKADYDALPEIKDHRL 331 Query: 320 TSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP-PHGRTPFRGAKGST 378 YG + ++ L + L++ G NTL++FTSDNG + P P+RG K + Sbjct: 332 RVYGAMVRNLDRNVGRLLQALKEEGLDQNTLVIFTSDNGGANYIGLPDINRPYRGWKATF 391 Query: 379 WEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGV 437 +EGG+ P F+ W +I + V D+F A +PK IDGV Sbjct: 392 FEGGIHSPFFMRWPAVIPANSRYSAPVGHIDIF-------ATAAAAAGAPLPKDRVIDGV 444 Query: 438 DQTSFFLGTNGQSNRKAEHYFLNGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVM 497 D F G + + +G V ++K Q Sbjct: 445 DLVPFVQGKATGRPHQTLF-WRSGSYKVVLDGDWKLQSSEAQ------------------ 485 Query: 498 QTAGSSVFNLYTDPQESDSIGVRHIPMGVPLQTEMHA 534 +FNL DP E + + + Sbjct: 486 --NKIWLFNLAQDPTEQHELSAAQPERVKAMLALLRQ 520 >UniRef50_A4GIB2 Putative secreted sulfatase n=1 Tax=uncultured marine bacterium HF10_49E08 RepID=A4GIB2_9BACT Length = 667 Score = 426 bits (1096), Expect = e-117, Method: Composition-based stats. Identities = 117/561 (20%), Positives = 185/561 (32%), Gaps = 165/561 (29%) Query: 80 KKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQP-SSSP 138 + + +KPN+V FL+DD+GW DVG G + TP ID +A +G+ +AYS SP Sbjct: 18 QTSARKPNIVFFLVDDLGWSDVGCYGSKF---HETPAIDQLAKEGIRFDNAYSTCHVCSP 74 Query: 139 TRATILTGQYSIHHGILMPPMYGQP--------------GGLQGLTTLPQLLHDQGYVTQ 184 +RA+ILTG+Y + + G+P TL + L GY T Sbjct: 75 SRASILTGKYPARTNLTEW-LGGRPERDYEPLHHGEKLTALPDEEVTLAETLKSHGYATA 133 Query: 185 AIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPF 244 GK H+ P GFD+ ++ P Y ++LP Sbjct: 134 NYGKAHL----RVDPNAYGFDE---------------EITGWVRSYHYPFGGAYNEKLP- 173 Query: 245 SKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHF 304 K + + D + F+++ D+PFF++ H Sbjct: 174 --------------------AKKGDYYTDKLTDAALDFIER--NKDRPFFVHLEHFAVHD 211 Query: 305 DNYPNA----KYAGSSPARTS--------------------------------------- 321 KY A Sbjct: 212 PIQGRPDLVEKYRKKLAAMPKQDGPDFILESNPDGPELTTEELKALAENDELQDHQDARV 271 Query: 322 -----------YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVP------ 364 + + ++ + K L+ G DNT+++FT+DNG + Sbjct: 272 WWVKQKQDNVEFAGMLEATDESLGRIRKKLKDLGLADNTIVIFTADNGGMSASNQYRGIN 331 Query: 365 ----------PHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQP-RKSDGIVDLADLFPTA 413 P RGAKG +EGG+RVP VYW G I+P S+ +V D +PT Sbjct: 332 HPIESLDSRFASSNLPLRGAKGWNYEGGIRVPLVVYWPGRIKPDSTSNALVTGTDFYPTL 391 Query: 414 LDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSN---RKAEHYFLNGKLA---AVR 467 L++ G P IDGV G HY +G + A+R Sbjct: 392 LEMIGMPTL-------PNQHIDGVSFLPALRGKAHDRGAIYWHFPHYSNHGYQSPGGAIR 444 Query: 468 MDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPMGVP 527 + ++K + +F+L D E + + + Sbjct: 445 LGKYKLLEYY--------------------ENGSVQLFDLEKDIGEQNDLSKTKPDVKAK 484 Query: 528 LQTEMHAYMEILKKYPPRAQI 548 L +H + + P + Sbjct: 485 LLKMLHEWRREVDAKMPYPKT 505 >UniRef50_A6DMX9 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMX9_9BACT Length = 467 Score = 426 bits (1096), Expect = e-117, Method: Composition-based stats. Identities = 110/502 (21%), Positives = 180/502 (35%), Gaps = 93/502 (18%) Query: 73 QKLAELEKKTGKKPNVVVFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYS 132 +KPN+++ DD G+ D+G G N TP +D +A +G TS Y+ Sbjct: 12 STFVAASLTAAEKPNILIIFTDDQGYADLGCFGSE---ENQTPVLDKLAKEGTKFTSFYA 68 Query: 133 QPSSSPTRATILTGQYSIHHGILMPPMYGQPGGLQGLTTLPQLLHDQGYVTQAIGKWHMG 192 QP P+R+ +LTG+Y G T ++L + GY T +GKW + Sbjct: 69 QPVCGPSRSALLTGRYPARSKGW--------GMPASEITFAEMLKETGYQTACVGKWDVS 120 Query: 193 ENKE---SQPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDV 249 + P GFD + G + K D+ Sbjct: 121 NRQPIIPRMPNAQGFDYYYGTLGGN----------------------------GSGKIDL 152 Query: 250 HAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFFLYYGTRGCHFDNYPN 309 + + + T + M L + + + + FL+K +KPF LY H + Sbjct: 153 Y------ENNKKERTTEDMASLTRLYTNKAIDFLEKQRDPEKPFILYLAHTMTHTVVDAS 206 Query: 310 AKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNGPEAEVPP---- 365 K+ + Y + E++ L L + NTL+++TSDNGP + Sbjct: 207 PKFKEKTGDN-LYRAAVEELDYETGRLLNKLNQLNLSKNTLVIYTSDNGPWNQPKYINGG 265 Query: 366 ------------HGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-KSDGIVDLADLFPT 412 FR K S WEGG VP + W G I +DG++ D PT Sbjct: 266 AKNDHPENSIFWGDAGEFRDGKASIWEGGAHVPCVMRWPGKIAAGKTNDGLMATIDFLPT 325 Query: 413 ALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFLNGKLA-------- 464 + G +P IDGV+Q F G + ++ R+ Y Sbjct: 326 LAAVTGAK-------IPDERVIDGVNQLGFICGKS-ETARETYIYNPGSASVQTKLVQGN 377 Query: 465 AVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGVRHIPM 524 A+R +K + + +G T ++NL D E+ ++ ++ Sbjct: 378 AIREGNWKLISPLTVGWFLEDAG-----------TGSWELYNLKEDIGETKNLAKQYPEK 426 Query: 525 GVPLQTEMHAYMEILKKYPPRA 546 L+ + + K PR Sbjct: 427 VEHLKKLLQSSEAKFPKVKPRP 448 >UniRef50_A4GJF1 Sulfatase n=1 Tax=uncultured marine bacterium EB0_50A10 RepID=A4GJF1_9BACT Length = 544 Score = 426 bits (1096), Expect = e-117, Method: Composition-based stats. Identities = 110/502 (21%), Positives = 191/502 (38%), Gaps = 87/502 (17%) Query: 82 TGKKPNVVVFLLDDVGWMDVGFNGGGVAVG-NPTPDIDAVASQGLILTSAYSQ-PSSSPT 139 +PN+++ L DD+G+ D+ + GG A G T +IDA+A G++ T Y+ + +P+ Sbjct: 56 DDNRPNIILVLADDMGYNDISIHNGGAADGTLQTKNIDALAKSGILFTRGYAANATCAPS 115 Query: 140 RATILTGQYSIHHGILMPPMYGQP---------------------------------GGL 166 RA+I+TG+Y G P+ G Sbjct: 116 RASIMTGKYPTRFGYEFTPIPAFGRTVLGWLAEEDNFELKQRIDREVVSNMPPFMEQGMP 175 Query: 167 QGLTTLPQLLHDQGYVTQAIGKWHMGENKESQPQNVGFDDFRGFNSVSDMYTEWRDVHVN 226 T+ ++L D GY T IGKWH+G P + GF D G + + DV Sbjct: 176 TEQITIAEVLRDAGYYTAHIGKWHLGHEYGMDPMSQGFQDSLGLVGPLYLPEDHPDV--- 232 Query: 227 PEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKM 286 ++ I ++ + A G D + + D +K ++ Sbjct: 233 ----VNAKFDTRIDKMIWGMGQYSANFNGGDLFAPDK------YVTDYYTDEALKVIEN- 281 Query: 287 AKSDKPFFLYYGTRGCHFDNYP-NAKYAGSSPART----SYGDCMVEMNDVFANLYKTLE 341 ++PFFLY H + + S Y + ++ + + L+ Sbjct: 282 -NKNRPFFLYLSHWAIHNPLQALRSDFEQMSHMHGHNLQVYSGMINSLDRSVGKIIEKLK 340 Query: 342 KNGQLDNTLIVFTSDNGPEAEVPPHG-RTPFRGAKGSTWEGGVRVPTFVYWKGMIQPR-K 399 + TLI+FTSDNG + + P+RG K S ++GG+RVP + W I P K Sbjct: 341 ELDIYGKTLIIFTSDNGGANYIELNDINKPYRGWKISFFDGGIRVPYIISWPDEINPGKK 400 Query: 400 SDGIVDLADLFPTALDLAGHPGAKVANLVPKTTFIDGVDQTSFFLGTNGQSNRKAEHYFL 459 S+ V D+FPT L AG T +DGVD F + K + Sbjct: 401 SENAVHHFDIFPTILKAAGIE---------STNELDGVDLMPFIKNDSSSKPHKTLF-WR 450 Query: 460 NGKLAAVRMDEFKYHVLIQQPYAYTQSGYQGGFTGTVMQTAGSSVFNLYTDPQESDSIGV 519 +G +V + +K+ + ++ + +F+ DP E +++ Sbjct: 451 SGNHQSVLHEHWKFIISKKENFR--------------------WLFDTSADPTEKNNLVD 490 Query: 520 RHIPMGVPLQTEMHAYMEILKK 541 + + ++ + + K Sbjct: 491 SNPDVVKEIEELLVEFNSEQKD 512 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.312 0.170 0.600 Lambda K H 0.267 0.0522 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,882,003,404 Number of Sequences: 3077464 Number of extensions: 217652149 Number of successful extensions: 531108 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 3429 Number of HSP's successfully gapped in prelim test: 1808 Number of HSP's that attempted gapping in prelim test: 496229 Number of HSP's gapped (non-prelim): 9794 length of query: 551 length of database: 1,040,396,356 effective HSP length: 134 effective length of query: 417 effective length of database: 628,016,180 effective search space: 261882747060 effective search space used: 261882747060 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 97 (41.6 bits)