BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (560 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P77318 Uncharacterized sulfatase ydeN n=81 Tax=Gammapro... 1161 0.0 UniRef50_D2YC71 Sulfatase n=2 Tax=Vibrio mimicus RepID=D2YC71_VIBMI 749 0.0 UniRef50_D1P6M6 Putative sulfatase YdeN n=2 Tax=Providencia RepI... 409 e-112 UniRef50_C5BEH4 Sulfatase, putative n=37 Tax=Gammaproteobacteria... 407 e-112 UniRef50_B1KD78 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 281 4e-74 UniRef50_UPI0001968551 hypothetical protein BACCELL_00117 n=1 Ta... 216 1e-54 UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 206 3e-51 UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 201 6e-50 UniRef50_C1ZKY2 Arylsulfatase A family protein n=1 Tax=Planctomy... 199 3e-49 UniRef50_A7AKS6 Putative uncharacterized protein n=3 Tax=Bactero... 194 6e-48 UniRef50_D2QWC8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 192 3e-47 UniRef50_A6DKP3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 191 6e-47 UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 191 7e-47 UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomy... 189 2e-46 UniRef50_A6DKP2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 189 2e-46 UniRef50_Q7UGB8 Arylsulfatase homolog b1498 n=1 Tax=Rhodopirellu... 186 3e-45 UniRef50_A7V8P8 Putative uncharacterized protein n=1 Tax=Bactero... 184 5e-45 UniRef50_C1ZF13 Arylsulfatase A family protein n=1 Tax=Planctomy... 180 1e-43 UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomy... 179 2e-43 UniRef50_A6DKD8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 176 2e-42 UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 176 3e-42 UniRef50_Q7UYW3 Arylsulfatase B n=1 Tax=Rhodopirellula baltica R... 174 6e-42 UniRef50_Q02AN8 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 174 8e-42 UniRef50_A6DSH3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 174 1e-41 UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Plancto... 174 1e-41 UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Plancto... 173 2e-41 UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC1... 173 2e-41 UniRef50_A6DM29 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 172 2e-41 UniRef50_C6Y214 Sulfatase n=3 Tax=Sphingobacteriaceae RepID=C6Y2... 172 4e-41 UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase... 171 6e-41 UniRef50_B2URC2 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC B... 170 2e-40 UniRef50_A6DKN7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 170 2e-40 UniRef50_A0Z632 Arylsulfatase B n=1 Tax=marine gamma proteobacte... 170 2e-40 UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planct... 169 3e-40 UniRef50_A6P2X1 Putative uncharacterized protein n=1 Tax=Bactero... 168 5e-40 UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=... 168 5e-40 UniRef50_A4GJF1 Sulfatase n=1 Tax=uncultured marine bacterium EB... 167 7e-40 UniRef50_A5FAW4 Sulfatase n=1 Tax=Flavobacterium johnsoniae UW10... 167 8e-40 UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9... 166 2e-39 UniRef50_Q1YSH0 Sulfatase family protein n=4 Tax=cellular organi... 166 2e-39 UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria ... 166 3e-39 UniRef50_A6CAY0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 165 5e-39 UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 164 7e-39 UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT 164 7e-39 UniRef50_A6DSP6 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 164 7e-39 UniRef50_B0SY54 Sulfatase n=7 Tax=Alphaproteobacteria RepID=B0SY... 164 9e-39 UniRef50_A6DHI0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 164 1e-38 UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 163 2e-38 UniRef50_A6LED1 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LE... 162 2e-38 UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 162 2e-38 UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 162 3e-38 UniRef50_A6DLE2 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 162 3e-38 UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 161 5e-38 UniRef50_A4AQQ7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 161 8e-38 UniRef50_A6LDP6 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LD... 161 8e-38 UniRef50_A6DMY9 Putative uncharacterized protein n=2 Tax=Lentisp... 160 1e-37 UniRef50_A6C1Q0 N-acetylgalactosamine 6-sulfate sulfatase n=1 Ta... 160 1e-37 UniRef50_A3J5W3 Putative arylsulfatase n=1 Tax=Flavobacteria bac... 159 3e-37 UniRef50_UPI0001968B7D hypothetical protein BACCELL_01446 n=1 Ta... 159 4e-37 UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Pro... 158 4e-37 UniRef50_B1KD86 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 158 6e-37 UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium sp... 157 9e-37 UniRef50_A6C4Q6 Arylsulfatase n=1 Tax=Planctomyces maris DSM 879... 157 1e-36 UniRef50_A6DNJ0 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 157 1e-36 UniRef50_C3Q8V4 Arylsulfatase B n=6 Tax=Bacteroides RepID=C3Q8V4... 157 1e-36 UniRef50_A4AVA7 Aryl-sulphate sulphohydrolase n=2 Tax=Bacteroide... 157 1e-36 UniRef50_UPI0001BC7CBC sulfatase n=1 Tax=Bacteroides sp. D2 RepI... 156 2e-36 UniRef50_Q7UYA5 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 156 2e-36 UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=B... 156 2e-36 UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 155 5e-36 UniRef50_Q7UHJ6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 155 5e-36 UniRef50_B8HPF9 Sulfatase n=2 Tax=Bacteria RepID=B8HPF9_CYAP4 155 5e-36 UniRef50_A0YAF7 Arylsulfatase A n=4 Tax=Bacteria RepID=A0YAF7_9GAMM 154 7e-36 UniRef50_D2QZX4 Sulfatase n=10 Tax=Bacteria RepID=D2QZX4_9PLAN 154 8e-36 UniRef50_A6LEC5 Arylsulfatase A n=2 Tax=Parabacteroides RepID=A6... 154 9e-36 UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 154 1e-35 UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flamme... 153 2e-35 UniRef50_D2R917 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 153 2e-35 UniRef50_A4CMB0 Arylsulfatase A n=4 Tax=Bacteria RepID=A4CMB0_9FLAO 153 2e-35 UniRef50_A3HWU7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 153 2e-35 UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD 152 3e-35 UniRef50_Q2LZ24 GA16747 n=5 Tax=Drosophila RepID=Q2LZ24_DROPS 152 3e-35 UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 T... 151 7e-35 UniRef50_A6DPC8 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 150 1e-34 UniRef50_A3ZMN6 Arylsulfatase B n=3 Tax=Bacteria RepID=A3ZMN6_9PLAN 150 1e-34 UniRef50_A6C4V9 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 Re... 150 1e-34 UniRef50_UPI0001968C90 hypothetical protein BACCELL_02360 n=1 Ta... 149 3e-34 UniRef50_A9LGQ4 Secreted arylsulfatase n=4 Tax=Bacteria RepID=A9... 149 3e-34 UniRef50_C9KTU9 Twin-arginine translocation pathway signal n=5 T... 149 4e-34 UniRef50_A6DHI1 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 148 4e-34 UniRef50_A6DHS2 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 148 5e-34 UniRef50_Q0BZE9 Sulfatase family protein n=1 Tax=Hyphomonas nept... 148 5e-34 UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 148 5e-34 UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_R... 148 6e-34 UniRef50_Q7UM38 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 147 8e-34 UniRef50_A6LIX6 N-acetylgalactosamine 6-sulfatase n=2 Tax=Bacter... 147 8e-34 UniRef50_A6DIE0 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 147 8e-34 UniRef50_D2R014 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 147 1e-33 UniRef50_A6LCL3 Arylsulfatase A n=9 Tax=Bacteroidales RepID=A6LC... 147 1e-33 UniRef50_UPI0001B577E1 arylsulfatase precursor n=1 Tax=Streptomy... 147 1e-33 UniRef50_A6DGL0 Arylsulfatase A n=3 Tax=Lentisphaera araneosa HT... 146 2e-33 UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 146 2e-33 UniRef50_A6DG78 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 146 2e-33 UniRef50_A6DSG4 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 146 2e-33 UniRef50_A3ZLN5 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 146 3e-33 UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 T... 145 3e-33 UniRef50_D2R1I8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 145 4e-33 UniRef50_A3HZ22 Putative exported uslfatase n=1 Tax=Algoriphagus... 144 6e-33 UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bactero... 144 7e-33 UniRef50_B4CVD2 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 144 9e-33 UniRef50_A3I2R7 Arylsulfatase n=2 Tax=Bacteroidetes RepID=A3I2R7... 144 9e-33 UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 144 1e-32 UniRef50_A6DMY7 Iduronate-sulfatase and sulfatase 1 n=2 Tax=Lent... 143 1e-32 UniRef50_C1ZA41 Arylsulfatase A family protein n=1 Tax=Planctomy... 143 1e-32 UniRef50_A6DF72 Putative secreted sulfatase ydeN n=1 Tax=Lentisp... 143 1e-32 UniRef50_Q01N83 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 143 1e-32 UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 T... 143 2e-32 UniRef50_A6C176 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 143 2e-32 UniRef50_Q15XI1 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 143 2e-32 UniRef50_Q7UHK0 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 143 2e-32 UniRef50_B9XGT6 Sulfatase n=3 Tax=Bacteria RepID=B9XGT6_9BACT 143 2e-32 UniRef50_UPI0000586CBD PREDICTED: similar to MGC86251 protein n=... 143 2e-32 UniRef50_A9BNY8 Sulfatase n=11 Tax=cellular organisms RepID=A9BN... 142 3e-32 UniRef50_C2FU81 Sulfatase family protein n=2 Tax=Sphingobacteriu... 142 3e-32 UniRef50_C9KTV0 Arylsulfatase n=1 Tax=Bacteroides finegoldii DSM... 142 4e-32 UniRef50_A6DR15 Arylsulfatase n=2 Tax=Lentisphaera araneosa HTCC... 142 4e-32 UniRef50_Q7UTH7 Arylsulfatase A n=2 Tax=Bacteria RepID=Q7UTH7_RHOBA 142 4e-32 UniRef50_B1KFX9 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 142 5e-32 UniRef50_D2R921 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 142 5e-32 UniRef50_A6DM48 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 141 5e-32 UniRef50_A7HQ00 Steryl-sulfatase n=4 Tax=Proteobacteria RepID=A7... 141 5e-32 UniRef50_Q7UTJ1 Aryl-sulphate sulphohydrolase n=1 Tax=Rhodopirel... 141 5e-32 UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria Rep... 141 6e-32 UniRef50_P34059 N-acetylgalactosamine-6-sulfatase n=23 Tax=Deute... 140 9e-32 UniRef50_D1QVA8 N-acetylgalactosamine-6-sulfatase n=1 Tax=Prevot... 140 1e-31 UniRef50_C9KTC2 Arylsulphatase A n=5 Tax=Bacteroides RepID=C9KTC... 140 1e-31 UniRef50_Q7URY7 Aryl-sulphate sulphohydrolase n=1 Tax=Rhodopirel... 140 1e-31 UniRef50_Q482D6 Sulfatase family protein n=2 Tax=Bacteria RepID=... 140 1e-31 UniRef50_A6DFS2 N-acetylgalactosamine-6-sulfatase n=1 Tax=Lentis... 140 1e-31 UniRef50_Q7UL93 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 140 1e-31 UniRef50_B7S0F9 Sulfatase domain protein n=1 Tax=marine gamma pr... 140 1e-31 UniRef50_B5JMW2 Sulfatase domain protein n=1 Tax=Verrucomicrobia... 140 1e-31 UniRef50_C3KKB6 MIP05773p n=12 Tax=Drosophila RepID=C3KKB6_DROME 140 1e-31 UniRef50_Q7UQ05 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 140 2e-31 UniRef50_A6C4W8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 140 2e-31 UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 139 2e-31 UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces mari... 139 2e-31 UniRef50_C1ZI83 Arylsulfatase A family protein n=1 Tax=Planctomy... 139 2e-31 UniRef50_Q7UNI8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 139 2e-31 UniRef50_B4D3U0 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 139 2e-31 UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN... 139 2e-31 UniRef50_Q7UYS6 Arylsulfatase A n=4 Tax=Bacteria RepID=Q7UYS6_RHOBA 139 2e-31 UniRef50_A6DI98 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 139 3e-31 UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 139 3e-31 UniRef50_UPI0001C36AAF N-acetylgalactosamine 6-sulfate sulfatase... 138 5e-31 UniRef50_A6DFR6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Lentis... 138 5e-31 UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi D... 138 5e-31 UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7... 138 5e-31 UniRef50_C5VKQ0 N-acetylgalactosamine-6-sulfatase n=3 Tax=Prevot... 137 7e-31 UniRef50_A6DS95 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HT... 137 8e-31 UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 137 9e-31 UniRef50_A6LED2 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LE... 136 2e-30 UniRef50_A6DMW5 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Lent... 136 2e-30 UniRef50_C5C586 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 1233... 136 2e-30 UniRef50_A6DHI2 Aryl-sulphate sulphohydrolase n=2 Tax=Lentisphae... 136 2e-30 UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomy... 136 2e-30 UniRef50_A6KZI7 Arylsulfatase n=23 Tax=Bacteroidales RepID=A6KZI... 135 3e-30 UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Plancto... 135 3e-30 UniRef50_A6DR20 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 135 3e-30 UniRef50_A6C383 Sulfatase (Fragment) n=1 Tax=Planctomyces maris ... 135 4e-30 UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodop... 135 4e-30 UniRef50_B8KKX3 Arylsulfatase B n=1 Tax=gamma proteobacterium NO... 135 4e-30 UniRef50_C7PLP2 Sulfatase n=9 Tax=Bacteroidetes RepID=C7PLP2_CHIPD 135 4e-30 UniRef50_Q7UH85 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodop... 135 4e-30 UniRef50_C3ZQB5 Putative uncharacterized protein n=1 Tax=Branchi... 135 5e-30 UniRef50_A6CB33 Arylsulfatase n=1 Tax=Planctomyces maris DSM 879... 135 5e-30 UniRef50_A6DG54 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 135 6e-30 UniRef50_A6DG79 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 134 6e-30 UniRef50_Q7UH46 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 134 7e-30 UniRef50_UPI00005846A1 PREDICTED: similar to arylsulfatase n=1 T... 134 8e-30 UniRef50_C5EQ23 Arylsulfatase E n=1 Tax=Clostridiales bacterium ... 134 9e-30 UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF8... 134 1e-29 UniRef50_A3I2G9 Putative secreted sulfatase n=1 Tax=Algoriphagus... 134 1e-29 UniRef50_UPI00005887B4 PREDICTED: similar to galactosamine (N-ac... 134 1e-29 UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM ... 134 1e-29 UniRef50_C6I9F7 Sulfatase n=4 Tax=Bacteroides RepID=C6I9F7_9BACE 134 1e-29 UniRef50_P25549 Arylsulfatase n=54 Tax=Proteobacteria RepID=ASLA... 134 1e-29 UniRef50_Q0C069 Sulfatase family protein n=3 Tax=Bacteria RepID=... 134 1e-29 UniRef50_A6KWS8 Arylsulfatase n=6 Tax=Bacteroides RepID=A6KWS8_B... 133 2e-29 UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 133 2e-29 UniRef50_B8KHZ9 Arylsulfatase A n=2 Tax=Gammaproteobacteria RepI... 133 2e-29 UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_B... 133 2e-29 UniRef50_A6DMW1 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 133 2e-29 UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 133 2e-29 UniRef50_B0NLM9 Putative uncharacterized protein n=1 Tax=Bactero... 133 2e-29 UniRef50_Q7UMZ5 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 133 2e-29 UniRef50_P50473 Arylsulfatase n=8 Tax=Deuterostomia RepID=ARS_STRPU 132 2e-29 UniRef50_A6KZ75 Putative secreted sulfatase n=8 Tax=Bacteroides ... 132 3e-29 UniRef50_D0PR10 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 132 3e-29 UniRef50_A6DHY0 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 132 3e-29 UniRef50_A6DMX9 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 132 4e-29 UniRef50_A6CAR8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 132 4e-29 UniRef50_B5JJG5 Sulfatase, putative n=1 Tax=Verrucomicrobiae bac... 132 5e-29 UniRef50_B4CZ54 Sulfatase n=3 Tax=Bacteria RepID=B4CZ54_9BACT 132 5e-29 UniRef50_A6DQ01 N-acetylgalactosamine-4-sulfatase n=2 Tax=Lentis... 131 5e-29 UniRef50_B7RWW8 Sulfatase, putative n=1 Tax=marine gamma proteob... 131 6e-29 UniRef50_Q0SBH5 Arylsulfatase n=7 Tax=Bacteria RepID=Q0SBH5_RHOSR 131 7e-29 UniRef50_UPI0001C3580F sulfatase n=2 Tax=Clostridium hathewayi D... 131 8e-29 UniRef50_A6CEL4 Arylsulfatase A n=4 Tax=Bacteria RepID=A6CEL4_9PLAN 131 9e-29 UniRef50_UPI000186ED10 arylsulfatase B precursor, putative n=1 T... 130 9e-29 UniRef50_Q7ULE7 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Rhod... 130 1e-28 UniRef50_A6DU78 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 130 1e-28 UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3J... 130 1e-28 UniRef50_D0PR02 N-acetylgalactosamine-4-sulfatase n=1 Tax=Flamme... 130 1e-28 UniRef50_A7VQW1 Putative uncharacterized protein n=1 Tax=Clostri... 130 1e-28 UniRef50_B4GZS4 GL22855 n=1 Tax=Drosophila persimilis RepID=B4GZ... 130 1e-28 UniRef50_A7S8Q2 Predicted protein n=2 Tax=Nematostella vectensis... 130 1e-28 UniRef50_Q9VVM4 CG7402 n=10 Tax=Drosophila RepID=Q9VVM4_DROME 130 1e-28 UniRef50_Q9VVM1 CG7408 n=8 Tax=Sophophora RepID=Q9VVM1_DROME 130 2e-28 UniRef50_Q7UYA9 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodop... 130 2e-28 UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina ... 130 2e-28 UniRef50_Q7UYD6 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 130 2e-28 UniRef50_A6DMW2 Putative exported uslfatase n=1 Tax=Lentisphaera... 130 2e-28 UniRef50_A0Q2E3 N-acetylgalactosamine 6-sulfate sulfatase n=3 Ta... 129 2e-28 UniRef50_A4W906 Sulfatase n=43 Tax=Enterobacteriaceae RepID=A4W9... 129 2e-28 UniRef50_D2R323 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 129 3e-28 UniRef50_Q2GB51 Sulfatase n=6 Tax=Proteobacteria RepID=Q2GB51_NOVAD 129 3e-28 UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyc... 129 3e-28 UniRef50_B0TKJ5 Sulfatase n=2 Tax=Gammaproteobacteria RepID=B0TK... 129 4e-28 UniRef50_A6DUI7 Putative exported uslfatase n=1 Tax=Lentisphaera... 129 4e-28 UniRef50_Q7UXA8 N-acetylgalactosamine-6-sulfate sulfatase n=2 Ta... 129 4e-28 UniRef50_A6DR29 N-acetylgalactosamine-6-sulfatase n=3 Tax=Bacter... 129 4e-28 UniRef50_C0BKJ9 Sulfatase n=2 Tax=Bacteroidetes RepID=C0BKJ9_9BACT 128 5e-28 UniRef50_B4D4S6 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 128 5e-28 UniRef50_B9KQS8 Twin-arginine translocation pathway signal n=2 T... 128 5e-28 UniRef50_A6DJ15 Putative arylsulfatase n=2 Tax=Lentisphaera aran... 128 5e-28 UniRef50_A4XED5 Sulfatase n=1 Tax=Novosphingobium aromaticivoran... 128 5e-28 UniRef50_D2R5N1 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 128 6e-28 UniRef50_Q01RE9 Sulfatase n=4 Tax=Bacteria RepID=Q01RE9_SOLUE 128 6e-28 UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglob... 128 7e-28 UniRef50_A9MER1 Putative uncharacterized protein n=2 Tax=Enterob... 127 9e-28 UniRef50_UPI0001927538 PREDICTED: similar to CG8646 CG8646-PA n=... 127 1e-27 UniRef50_A3ZWK4 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 127 1e-27 UniRef50_A6DKM2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 127 1e-27 UniRef50_A7IPG5 Sulfatase n=3 Tax=Bacteria RepID=A7IPG5_XANP2 127 1e-27 UniRef50_B4D764 Steryl-sulfatase n=1 Tax=Chthoniobacter flavus E... 126 2e-27 UniRef50_C1ZHB0 Arylsulfatase A family protein n=1 Tax=Planctomy... 126 2e-27 UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC... 125 3e-27 UniRef50_C7M5R4 Sulfatase n=4 Tax=Bacteroidetes RepID=C7M5R4_CAPOD 125 3e-27 UniRef50_A6DMV0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 125 3e-27 >UniRef50_P77318 Uncharacterized sulfatase ydeN n=81 Tax=Gammaproteobacteria RepID=YDEN_ECOLI Length = 560 Score = 1161 bits (3003), Expect = 0.0, Method: Compositional matrix adjust. Identities = 560/560 (100%), Positives = 560/560 (100%) Query: 1 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI 60 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI Sbjct: 1 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI 60 Query: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV Sbjct: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 Query: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAA 180 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAA Sbjct: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAA 180 Query: 181 VGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS 240 VGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS Sbjct: 181 VGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS 240 Query: 241 LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ 300 LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ Sbjct: 241 LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ 300 Query: 301 FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ 360 FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ Sbjct: 301 FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ 360 Query: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVS 420 KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVS Sbjct: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVS 420 Query: 421 LLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ 480 LLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ Sbjct: 421 LLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ 480 Query: 481 FSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 FSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP Sbjct: 481 FSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 Query: 541 LSEVNQEKFNNIKKALSEAK 560 LSEVNQEKFNNIKKALSEAK Sbjct: 541 LSEVNQEKFNNIKKALSEAK 560 >UniRef50_D2YC71 Sulfatase n=2 Tax=Vibrio mimicus RepID=D2YC71_VIBMI Length = 577 Score = 749 bits (1934), Expect = 0.0, Method: Compositional matrix adjust. Identities = 353/551 (64%), Positives = 442/551 (80%), Gaps = 5/551 (0%) Query: 5 LKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLT 64 K+++++TSISLIL S + A + LKATKTNVAFSD +EYSTKGKPNII+LT Sbjct: 24 FKRNLLTTSISLILVSHLLPSFASTQNSDNLKATKTNVAFSDIEISEYSTKGKPNIIILT 83 Query: 65 MDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTN 124 +DD+GYGQ+ FD+ +F+ ++M++++VVDTYKI ID+AI AA+ STPT+ L+D GV+ N Sbjct: 84 VDDMGYGQMNFDQNTFNEESMKDQKVVDTYKIPIDEAINAAKNSTPTINKLIDTGVKINN 143 Query: 125 GYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKW 184 GYVAHGVSGPSRAAI+TG+APA+FGVYSN DA+ GIP+ E FLPE+FQNHGYYTAAVGKW Sbjct: 144 GYVAHGVSGPSRAAIITGKAPAKFGVYSNIDAEQGIPVEEKFLPEIFQNHGYYTAAVGKW 203 Query: 185 HLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKN 244 HLSKISNV V E KQTRDYHDNF T+S E+WQPQNRGF+YFMGFH G AYYNSP+LF+N Sbjct: 204 HLSKISNVAVDEAKQTRDYHDNFITYSGEQWQPQNRGFNYFMGFHTHGVAYYNSPALFRN 263 Query: 245 RERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG 304 RE +PAKGY+ DQ T+EAIGVV++AK+ D PF+LYLAYNAPHLPND PAP QYQ++F TG Sbjct: 264 RENIPAKGYVIDQFTNEAIGVVNKAKSNDAPFLLYLAYNAPHLPNDAPAPKQYQQRFKTG 323 Query: 305 SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYK 364 SQTADN+YAS+Y+VDQGVKR+L QLK N QYDNT+I+FTSDNGAVIDGPLPLNG QKG+K Sbjct: 324 SQTADNFYASIYAVDQGVKRLLAQLKANDQYDNTLIMFTSDNGAVIDGPLPLNGEQKGFK 383 Query: 365 SQTYPGGTHTPMFMWWKGKLQP--GNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLL 422 SQ GG HTPMF+WW G+ ++KL S+MDF+PTALDAA I IP+ LDGVSLL Sbjct: 384 SQVLSGGLHTPMFVWWNGRFHKTTKEFNKLTSSMDFFPTALDAAGIKIPEG--LDGVSLL 441 Query: 423 PWLQ-DKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQF 481 P+L +K PHK+L WI Y+H FDE+NIPFW+NYHK+VR +SDDYP NP TE S F Sbjct: 442 PYLNGEKTNSSPHKSLVWIAPYAHHFDEKNIPFWNNYHKYVRSESDDYPINPYTEQFSDF 501 Query: 482 SYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 S+ VR + +SL+Y E+ ++GLYKL D++ ++ ++ P VV M+ + EF + S+ P+ Sbjct: 502 SWAVRTDRFSLIYNPEDKKIGLYKLEDVRHENEISEQYPNVVSAMKNDLAEFANKSKMPI 561 Query: 542 SEVNQEKFNNI 552 S+ N +KFN + Sbjct: 562 SKDNYDKFNKV 572 >UniRef50_D1P6M6 Putative sulfatase YdeN n=2 Tax=Providencia RepID=D1P6M6_9ENTR Length = 549 Score = 409 bits (1052), Expect = e-112, Method: Compositional matrix adjust. Identities = 206/492 (41%), Positives = 303/492 (61%), Gaps = 6/492 (1%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 T KPN++++ MDDLG GQL F S D + R Y I+K +EAA+ + P + Sbjct: 45 TPEKPNVLLIVMDDLGTGQLDFVLDSLDVNELSKRPAPSRYDGDINKMVEAARIAMPNVS 104 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQN 173 + G++ TN +VAH V GPSRA I TGR+PA FG YSN DA GIP LP LFQ Sbjct: 105 EMAAGGIKMTNAFVAHPVCGPSRAGIFTGRSPASFGTYSNDDAMLGIPEDIKLLPALFQE 164 Query: 174 HGYYTAAVGKWHLSKISNVP-VPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 GY TA++GKWH +K+ P + EDKQTRDYHDN + + P RGFDY ++A+G Sbjct: 165 DGYATASIGKWHNAKVIKKPKIAEDKQTRDYHDNMISTPEPGFAPHERGFDYAYSYYASG 224 Query: 233 TAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP 292 A +NSP++++N E VPA GYI+ LTDE I +D K D+PF + L+Y+ PH+P + Sbjct: 225 AALWNSPAIWRNGENVPAPGYITHLLTDETIKFIDGHK--DKPFFINLSYSVPHIPLEEA 282 Query: 293 APDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG 352 +P +Y +FNTG+ AD Y+A++ + D+G+ +I+ LK+NG+ +NT+I F SDNGAV + Sbjct: 283 SPAKYMDKFNTGNVEADKYFAALNAADEGIGKIITTLKENGELENTLIFFISDNGAVHES 342 Query: 353 PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIP 411 P+P+N KG+K Q + GG P +W G + G D ++SA+D PTAL +A I+IP Sbjct: 343 PMPMNAMDKGFKGQMFNGGVSVPFVAYWPGHIPAGKQSDAMVSAIDILPTALQSAGITIP 402 Query: 412 KDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPH 471 LK++G +++P LQ K Q PH+ L W + + EEN FW YH+++ +Q + P Sbjct: 403 DSLKVEGKNIMPLLQGKTQKSPHQYLYWTGPGTKHYSEENQDFWHGYHEWITYQRKEAPK 462 Query: 472 NPNTEDLSQFSYTVRNNDYSL-VYTVENNQLGLYK-LTDLQQKDNLAAANPQVVKEMQGV 529 NPN E LS+ S+ VR+ +++L Y NQ L+ D + +LA+ P+ VK+++ Sbjct: 463 NPNLEKLSKGSWAVRDGEWALYFYDDGTNQPKLFNDKQDPSESIDLASKYPEKVKQLKSA 522 Query: 530 VREFIDSSQPPL 541 +++ P+ Sbjct: 523 YYQWVKDQPKPV 534 >UniRef50_C5BEH4 Sulfatase, putative n=37 Tax=Gammaproteobacteria RepID=C5BEH4_EDWI9 Length = 539 Score = 407 bits (1047), Expect = e-112, Method: Compositional matrix adjust. Identities = 207/506 (40%), Positives = 319/506 (63%), Gaps = 7/506 (1%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 T +PN++++ MDDLG GQL F + D K + R V + Y+ +DK I+AA+++ P + Sbjct: 34 TDSRPNVLLVIMDDLGTGQLDFALDALDTKALGKRPVAERYQGDLDKMIDAARRAMPNVA 93 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQN 173 L ++GV+ TN +VAH V GPSRA I TGR PA FG YSN DA G+PL T LP LFQ Sbjct: 94 QLANQGVKMTNAFVAHPVCGPSRAGIFTGRYPASFGTYSNDDAMLGVPLDITLLPALFQE 153 Query: 174 HGYYTAAVGKWHLSKISNVP-VPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 +GY TA +GKWH ++I V + QTRDYHDN + S + P++RGFDY ++A+G Sbjct: 154 NGYATANIGKWHNARIDKKNFVDKADQTRDYHDNMISVSEPGYGPESRGFDYSYSYYASG 213 Query: 233 TAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP 292 A +NSP++++N + V A GY++ LT+E + +D + +PF + LAY+ PH+P + Sbjct: 214 AALWNSPAIWQNGKNVAAPGYLTHNLTNETLKFLDDHQ--GKPFFISLAYSVPHIPLEQA 271 Query: 293 APDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG 352 +P +Y +F+TG+ AD Y+A+V + D+G+ +I+E+LK G+ DNT+I F SDNGAV + Sbjct: 272 SPARYMDKFHTGNAEADKYFAAVNAADEGIGQIIERLKALGELDNTLIFFISDNGAVHES 331 Query: 353 PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIP 411 P+PLNG +G+K Q + GG H P +W + G ++SA+D PTAL AA I+IP Sbjct: 332 PMPLNGMDRGFKGQMFNGGVHVPFVAYWPKHIPAGTQSNVMVSAIDILPTALKAAGITIP 391 Query: 412 KDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPH 471 +K+DG +LP L K Q PH+ L W + + EEN PFW +Y K++ +++ P Sbjct: 392 DAMKVDGRDILPQLSGKAQTSPHRYLFWAGPGAKHYSEENQPFWFDYWKWITYEAPMPPK 451 Query: 472 NPNTEDLSQFSYTVRNNDYSL-VYTVENNQLGLYK-LTDLQQKDNLAAANPQVVKEMQGV 529 NPN E LS S+ VR+ +++L Y +N++ L+ D + +LAA PQ V EM+ Sbjct: 452 NPNLEKLSPSSWAVRDGEWTLYFYDDGSNRVQLFNDRLDPAESQDLAAKYPQRVAEMKAA 511 Query: 530 VREFIDSSQPPLSEVNQEKFNNIKKA 555 ++I + P++ Q++++ ++++ Sbjct: 512 YHDWIKTKPKPVA-WGQDRYHILEQS 536 >UniRef50_B1KD78 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD78_SHEWM Length = 483 Score = 281 bits (719), Expect = 4e-74, Method: Compositional matrix adjust. Identities = 184/494 (37%), Positives = 282/494 (57%), Gaps = 52/494 (10%) Query: 58 PNIIVLTMDDLGYGQLPFD-----KGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 PN++++ DD+G+G + + S++P+ ++ D+ + + A A+K+TPTL Sbjct: 17 PNVVIVLADDMGFGHVAMNLDLATADSYNPQNLKR----DSQRHKPELARSYAKKATPTL 72 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD-GIPLTETFLPELF 171 L +EGVRFTN YV + GPSRAA+MTGR P RFG+Y+N D + G+P+ E L F Sbjct: 73 TQLANEGVRFTNAYVPSPLCGPSRAALMTGRYPQRFGIYNNADVKAAGLPVEENVLANNF 132 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA 231 + GY T AVGKWHL+K + +++ + P +RGFD+F GF + Sbjct: 133 RKAGYRTGAVGKWHLTK---------------GEKKASYTLAQ-HPLDRGFDFFFGFDRS 176 Query: 232 GTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDN 291 GT YY+S L NR+ V A+GY++DQLT+ AI +++ K+ +PF LY+AYNA H P + Sbjct: 177 GTPYYDSKILELNRKPVKAEGYLTDQLTNHAIDFINQDKS--KPFFLYMAYNAVHGPLNK 234 Query: 292 PAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID 351 AP +YQ FN+G + D +Y+ +Y++DQGV +I++QL NGQ DNTII+F SDNGA Sbjct: 235 AAPKEYQAPFNSGDRYLDYFYSYLYALDQGVAKIIKQLDSNGQLDNTIIMFLSDNGAPGG 294 Query: 352 GPLPL--NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY--DKLISAMDFYPTALDAAD 407 P PL N GYK Q + GGT P+ +W L G D +IS+MD PTAL AA Sbjct: 295 KPFPLPANAPFTGYKGQVWQGGTRVPVVIWGPKALVNGGRVDDAVISSMDLIPTALAAAG 354 Query: 408 ISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 + + + LDG +LLP L K+ E + L W + SH + F+R D Sbjct: 355 VDLSDN--LDGNNLLPKL--KRVEEDERQLFWASQLSH------------HWGFIR---D 395 Query: 468 DYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEM 526 + + ++ ++ VR+ ++ L Y ++ + L+ + TD + ++A +PQVVK++ Sbjct: 396 AKGKKIDDKSTAEPAWAVRSGEWMLRYWADSKKTELFNVSTDHAEHHDIANKHPQVVKQL 455 Query: 527 QGVVREFIDSSQPP 540 + + D+ P Sbjct: 456 TADYKVWFDTLAKP 469 >UniRef50_UPI0001968551 hypothetical protein BACCELL_00117 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968551 Length = 380 Score = 216 bits (551), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 129/391 (32%), Positives = 210/391 (53%), Gaps = 43/391 (10%) Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 +P++ +++GY A GK+H K D T S P +RGFDY+ Sbjct: 1 MPQVMKDYGYANGAFGKYHNGKGM--------------DEIHTCSPGH-HPLDRGFDYWF 45 Query: 227 GFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPH 286 GF++ GT+YYNSP LF+NRE + Y +D+ T+EA+ + + P ++YLAYNA H Sbjct: 46 GFNSHGTSYYNSPILFRNRENIACAEYTTDKFTEEAVQFIRGNE--GSPKLIYLAYNALH 103 Query: 287 LPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDN 346 P PAPD+Y +F S+ + Y A ++D GV ++++L+K G+ DNT+++F SDN Sbjct: 104 GPLGAPAPDKYMSRFQYKSKLLNTYAAYTAAIDDGVAAVMKELEKIGEIDNTMLVFISDN 163 Query: 347 GAV--IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTAL 403 GA LP NG G+K Q Y GG PMF+W+ K++ G ++++SAMD +PT Sbjct: 164 GAPGGAAAVLPKNGPFSGFKGQAYEGGIRVPMFIWYGDKIKQGYVCNEMVSAMDIFPTFF 223 Query: 404 DAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR 463 D A IS+PK +DG SL+P L H+ L W++ + EN W Y Sbjct: 224 DEAGISLPKQQAVDGKSLMPLLHGTSTKAVHEYLVWMSQQA-----EN---WGMY----- 270 Query: 464 HQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQV 522 + + + ++ ++ VR D+ L Y +E + L+ L D ++ +NL A P++ Sbjct: 271 --------SLSDQQTAEAAFMVRKGDFMLRYIMEEDTYYLHNLKEDRKEGENLIAQYPEL 322 Query: 523 VKEMQGVVREFIDSSQPPLSEVNQEKFNNIK 553 +EM+ + +++ PP+ Q+ + N++ Sbjct: 323 TQEMKTIFKDWFQQMIPPIG-WRQQLWQNVQ 352 >UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XH3_PSEA6 Length = 500 Score = 206 bits (523), Expect = 3e-51, Method: Compositional matrix adjust. Identities = 153/455 (33%), Positives = 209/455 (45%), Gaps = 91/455 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI+ + DDLGY + F+ GS D KT P L L Sbjct: 39 KPNILFVLADDLGYNDVGFN-GSTDIKT-------------------------PNLDGLA 72 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTETFLPELFQN 173 G+ F YVAH GPSRAAIMTGR P + G N ++ G+ E F+ + ++ Sbjct: 73 KNGMTFDAAYVAHPFCGPSRAAIMTGRYPHKIGAQFNLPEDNSNVGVSADELFIAQTMKS 132 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY+T A+GKWHL + A E+ P GFD F GF G Sbjct: 133 AGYFTGAMGKWHLGE-----------------------ASEYHPNKHGFDEFYGFLGGGH 169 Query: 234 AYYNSPSLFK-----------------------NRERVPAKGYISDQLTDEAIGVVDRAK 270 Y+ P F+ N + V YI+D L+ EA+ VD+A Sbjct: 170 NYF--PEQFEAAYNKRVAQGMTNINMYLTPLEHNGKEVRETEYITDGLSREAVNFVDKAA 227 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADN----YYASVYSVDQGVKRIL 326 +PF LYLAYNAPH+P A ++ F SQ D Y VY+VD+GV RI+ Sbjct: 228 AKKKPFFLYLAYNAPHVPLQ--AKEEDMAMF---SQIKDKKRRTYAGMVYAVDRGVGRIV 282 Query: 327 EQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 EQLKKNGQ+DNT+I+FTSDNG + G N K K GG TPM + W ++ Sbjct: 283 EQLKKNGQFDNTVIVFTSDNGGKL-GQGANNYPLKEGKGSVQEGGFRTPMLVHWPKHMKA 341 Query: 387 GN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSH 445 G+ + + A+D YPT +P+D KLDG + W + PHK+ +I H Sbjct: 342 GSRFSHPVLALDLYPTFAGLGGAVLPEDKKLDGKDI--WADIQANTAPHKD-EFIYVLRH 398 Query: 446 WFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ 480 + N K V++ +DD+ +D+S+ Sbjct: 399 RNGYSDAAARRNQFKAVKNHNDDWKLYNIAQDISE 433 >UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UGD7_RHOBA Length = 543 Score = 201 bits (511), Expect = 6e-50, Method: Compositional matrix adjust. Identities = 154/529 (29%), Positives = 239/529 (45%), Gaps = 110/529 (20%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 K +PNI+++ DDLGY + F+ + TP L Sbjct: 40 GAKDRPNIVLIVADDLGYSDVGFNG--------------------------CKEIPTPHL 73 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD------AQD--GIPLTE 164 L GV FTNGY +H PSRA ++TGR RFG SN + +D G+PL+E Sbjct: 74 DELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQRFGHGSNPEPDTQWHGEDTPGMPLSE 133 Query: 165 TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY 224 T L + + GY T A+GKWHL A+ + P RGFD Sbjct: 134 TTLADALKEAGYVTGAIGKWHLG-----------------------DAKPFWPNRRGFDE 170 Query: 225 FMGFHAAGTAYYN-----SPSLFKNRERVPAK----GYISDQLTDEAIGVVDRAKTLDQP 275 + GF G +Y+ P L +R P +++D + EA+ + R +T +P Sbjct: 171 WFGFSGGGFSYWGDLGMKDPLLGVHRGDEPVDPKTLTHLTDDFSTEAVKFIQRHET--EP 228 Query: 276 FMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQY 335 F LYLAYNAPH P D+ QK + Y A V +D+G+ R+++Q++++G Sbjct: 229 FFLYLAYNAPHAP-DHATRAHLQKTAHIEYGGRAVYGAMVAGMDEGIGRVVDQIRESGLG 287 Query: 336 DNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL-IS 394 +NT+I+F SDNG + +N +G+K + GG P + W G ++ G ++ I+ Sbjct: 288 ENTMIIFYSDNGGRRE--HAVNFPYRGHKGMLFEGGIRVPFLVSWPGTVRSGMKEESPIT 345 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPF 454 A+D +PTAL AA + ++ KLDG +LLP L D KQ P + L W S Sbjct: 346 ALDLFPTALAAAGMDPSQNDKLDGQNLLPVLTDDKQRLPERPLFWRYS------------ 393 Query: 455 WDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKD 513 DD + Y VR+ ++ L+ + ++ L+ L D +++ Sbjct: 394 ----------MGDD-----------SYGYAVRDGNWKLIDSRYKDRKLLFDLANDPWERE 432 Query: 514 NLAAANPQVVKEMQGVVREFIDSSQPP----LSEVNQEKFNNIKKALSE 558 +LAA +P+ V + ++ + + PP VN K N + E Sbjct: 433 DLAAQHPEQVARLSRMMEAWDARNVPPKWSDAHGVNVRKEENTRNEAVE 481 >UniRef50_C1ZKY2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZKY2_PLALI Length = 483 Score = 199 bits (506), Expect = 3e-49, Method: Compositional matrix adjust. Identities = 143/491 (29%), Positives = 222/491 (45%), Gaps = 86/491 (17%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI+++ DD+GY + F P TP L +L Sbjct: 32 RPNILLIVGDDMGYADVGFHGCKDIP--------------------------TPNLDALA 65 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG-VYSNTDAQDGIPLTETFLPELFQNHG 175 GV+FT+GYV P+RA ++TGR RFG ++ + A G+PLTE + + + G Sbjct: 66 KSGVQFTSGYVTGPYCSPTRAGLLTGRYQQRFGHEFNPSGANTGLPLTEVTIADRLKQVG 125 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY 235 Y T VGKWHL S PQ RGF+ F+GF ++ Sbjct: 126 YTTGLVGKWHLG-----------------------SQPAMHPQERGFEEFIGFLGGAHSF 162 Query: 236 YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPD 295 +++ + + E V Y +D EA+ +++ + D+P+ LYL++NA H P + D Sbjct: 163 FDAQGILRGHEPVKTIDYTTDLFGREAVSFIEKHR--DKPWFLYLSFNAVHTPM-HATED 219 Query: 296 QYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP 355 + K + Q Y A + ++D+ + ++L QL+ GQ T+++F SDNG + Sbjct: 220 RMAKLASISDQERRTYAAMMLAMDEAIGKVLTQLETTGQKQKTLVMFISDNGGPTMPGVT 279 Query: 356 LNGA----QKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIP 411 +NG+ +G K T GG P + W GK+ P +D + +D TAL A + Sbjct: 280 INGSINTPLRGSKRTTLEGGIRVPFVVSWPGKIAPAVFDSPVIQLDLTATALAVAGVE-- 337 Query: 412 KDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPH 471 KD+K DGV+LLP+LQ K+ PH L W F E+ +Y K VR+ S+ Sbjct: 338 KDVKSDGVNLLPYLQGKQSEVPHAALFW------RFGEQMAVRAGDY-KLVRYDSN---- 386 Query: 472 NPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVV 530 + T+ V LY L DL + +LAA+ P+ V E+Q Sbjct: 387 ----------ADTLTGKGKQPVTAAR-----LYDLKEDLGETRDLAASMPEKVAELQAQW 431 Query: 531 REFIDSSQPPL 541 + + PPL Sbjct: 432 DRWNQQNMPPL 442 >UniRef50_A7AKS6 Putative uncharacterized protein n=3 Tax=Bacteroidales RepID=A7AKS6_9PORP Length = 464 Score = 194 bits (494), Expect = 6e-48, Method: Compositional matrix adjust. Identities = 129/399 (32%), Positives = 185/399 (46%), Gaps = 76/399 (19%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI++L DD GY F + A TP + L Sbjct: 33 RPNILILLADDAGYADFGF--------------------------MGATDIQTPNIDRLA 66 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA-QDGIPLTETFLPELFQNHG 175 EG FT+ +VA VS PSR+ ++TGR R+G N D DG+P E LP L + + Sbjct: 67 AEGCIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECNLDKPGDGLPDDEELLPALLKRYD 126 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY 235 Y T +GKWHL S +P +GFD F G A +Y Sbjct: 127 YRTGCIGKWHLG-----------------------SEPSQRPNAKGFDTFYGLLAGHRSY 163 Query: 236 YNSPSLFK----------NRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 + P N ++ GY +D+L +A V + +QPFMLY+++ AP Sbjct: 164 FYDPETSDKDGNLQQYQYNGRKLSFDGYFTDELASKAQQFVTES---EQPFMLYMSFTAP 220 Query: 286 HLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSD 345 H PN+ D + + Q Y A +Y++D+GV +I+++LK G++DNTII F SD Sbjct: 221 HSPNEATEEDLARFE----GQPRQKYAAMMYALDRGVGKIVDELKAAGKFDNTIIFFLSD 276 Query: 346 NGAVI---DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPT 401 NG LPL KG+K + GG P F+ W + + + L S++D + T Sbjct: 277 NGGSTTNQSSNLPL----KGFKGNKFEGGQRVPFFVVWGDRFKRDQRFTGLTSSLDIFAT 332 Query: 402 ALDAADISIPKDLK-LDGVSLLPWLQDKKQGEPHKNLTW 439 +DA DI K +DGVSLLP+L +K G PH+ L W Sbjct: 333 VVDALDIPEEGLHKPIDGVSLLPYLSGEKSGNPHEALFW 371 >UniRef50_D2QWC8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QWC8_9PLAN Length = 468 Score = 192 bits (487), Expect = 3e-47, Method: Compositional matrix adjust. Identities = 145/484 (29%), Positives = 220/484 (45%), Gaps = 107/484 (22%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI+V+ DD+GY + +G+ + TP L +L Sbjct: 29 RPNIVVIVGDDMGY-----------------------HDLGVHGCKDI---PTPHLDALA 62 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD----AQDGIPLTETFLPELFQ 172 GVR T+GYV+ P+RA ++TGR RFG N + G+PL+ET L + + Sbjct: 63 TSGVRCTSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNPGPTPTGEIGLPLSETTLADRLK 122 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 GY T VGKWHL + E+ P +RGFD F GF Sbjct: 123 KVGYKTGMVGKWHLG-----------------------NDEKRHPLSRGFDEFFGFLGGA 159 Query: 233 TAYYNSPS-------LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 Y+ +P L + RE V K Y++D EA+ +DR+K PF LYL +NA Sbjct: 160 RTYFATPGNASAGTKLLRGREVVDEKEYLTDAFAREAVAYIDRSKA--SPFFLYLTFNAV 217 Query: 286 HLPNDNPAPDQYQKQFNTGSQTADNYYASVYS-VDQGVKRILEQLKKNGQYDNTIILFTS 344 H P + A +Y +F S Y ++ S +D V +++ +L++ +NT+I F S Sbjct: 218 HTPME--ASQKYLDRFTAVSDPKRQKYCAMMSAMDDAVGQVVAKLEREKLLENTLIFFVS 275 Query: 345 DNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTAL 403 DNG N +G+K+ T+ GG P F+ WKGK+ G YD+ + +DF PTAL Sbjct: 276 DNGGPTAANTGDNTPLRGFKATTWEGGIRVPYFVSWKGKIPAGKTYDQPVIQIDFVPTAL 335 Query: 404 DAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR 463 AA + K DGV+LLP+L + + PH +L W +F Sbjct: 336 AAAGAPAAE--KTDGVNLLPYLTFENKEAPHASLFW--------------------RF-- 371 Query: 464 HQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQV 522 P T +R+ +Y LV T + ++ LY L D+ + +L+A P++ Sbjct: 372 --------GPQT--------AIRHGNYKLVMTRDLDKPALYDLAADISETKDLSADKPEI 415 Query: 523 VKEM 526 V ++ Sbjct: 416 VAQL 419 >UniRef50_A6DKP3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKP3_9BACT Length = 465 Score = 191 bits (485), Expect = 6e-47, Method: Compositional matrix adjust. Identities = 137/441 (31%), Positives = 204/441 (46%), Gaps = 81/441 (18%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 ++ KPNIIV+ DDLGYG + + G+ T TP + Sbjct: 18 ASAAKPNIIVILADDLGYGDVSY-HGTLKETT------------------------TPHI 52 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT-------DAQDGIPLTET 165 S+ G F NGY A V GPSRA +++GR RFG Y N D + G+PL++ Sbjct: 53 DSIAQSGAWFQNGYSAAPVCGPSRAGLLSGRYQQRFGYYDNIGPFTLNKDVEAGLPLSQK 112 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 +PE+ GY T VGKWH D H ++ P NRGF F Sbjct: 113 LIPEILVKEGYATGMVGKWH--------------DGDQH---------KFWPYNRGFQEF 149 Query: 226 MGFHAAGTAYY----------NSPSLFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQ 274 GF+ + ++ + +RV G Y+++ EA+ +DR KT + Sbjct: 150 YGFNNGAINNWVLKGENHTVDEWGAVHRENKRVENSGEYMTEAFGREAVEFIDRHKT--E 207 Query: 275 PFMLYLAYNAPHLPNDNPAPDQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLKKNG 333 PF LYL++NA H P AP Y QF + + A + S+D + +LE+L+K G Sbjct: 208 PFFLYLSFNAVHGPLQ--APKSYTNQFKHIKPENRALCLAMLKSMDDNIGLVLEKLRKEG 265 Query: 334 QYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL--QPGNYDK 391 +NTII FTSDNG + G NG +G K+ + GG H P + WK ++ Q + Sbjct: 266 LEENTIIFFTSDNGGKLKGNYSFNGKYRGEKNTVFDGGLHVPYAVQWKAQIPAQTKALEA 325 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 + ++D T AA + I + KLDG +LLP+L+++ + +NL +W + N Sbjct: 326 PVHSIDLAHTIFAAAGVEIKDEYKLDGRNLLPYLKNQSDFDD-RNL-------YWANNAN 377 Query: 452 IPFWDNYHKFVRHQSDDYPHN 472 I DN K+++ Y N Sbjct: 378 IAIRDNKWKYLKQAGKTYLFN 398 >UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKB8_9BACT Length = 465 Score = 191 bits (485), Expect = 7e-47, Method: Compositional matrix adjust. Identities = 142/503 (28%), Positives = 216/503 (42%), Gaps = 115/503 (22%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN+IV+ DDLGY + F+ + P TP + S+ Sbjct: 20 RPNLIVIMADDLGYNDVGFNGCTEIP--------------------------TPGIDSIA 53 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN-----TDAQDGIPLTETFLPELF 171 GV+FTNGY ++ V GPSRA +TGR RFG N TD +P +E + E Sbjct: 54 QNGVKFTNGYTSYSVCGPSRAGFITGRYQQRFGFERNPQWNLTDPNSALPKSEMTIAESL 113 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA 231 GY+ +GKWHL + +P RGFD F G Sbjct: 114 TQVGYHCGIIGKWHLG-----------------------AEPSLRPNKRGFDEFFGHLGG 150 Query: 232 GTAYYNSPSLFKNRERV----------------PAK--GYISDQLTDEAIGVVDRAKTLD 273 G + + ++ E V P K Y++++ +DEA+ + R Sbjct: 151 GHRFMPEDLVIQHTEEVKNELDSYRSWITRNDTPVKTTKYLTEEFSDEAVSFIKRNH--Q 208 Query: 274 QPFMLYLAYNAPHLPNDNPAPDQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLKKN 332 +PF L+L+YNAPHLP A ++Y +F + Y A V +VD GV ++++ LK+ Sbjct: 209 KPFFLFLSYNAPHLPLQ--ATEKYLARFPHIKDPKRKTYAAMVSAVDDGVSQVMQSLKET 266 Query: 333 GQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDK 391 DNTI+ F SDNG N KG KS + GG P M + +Q YD Sbjct: 267 NIADNTIVFFLSDNGGPSHKNKSDNFPLKGQKSDVWEGGFRVPFAMQYPAAIQAKQVYDH 326 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 +S++D + T A D LDGV+L+P++ +K PH + Sbjct: 327 PVSSLDIFATIASLAQSPTHADKPLDGVNLIPFITGEKTQAPHAQI-------------- 372 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQ 510 F+R Q Y VR D+ LV ++ LY L+ D+ Sbjct: 373 ---------FIR-------------KFDQSRYVVRQGDFKLVIPYKDAPPQLYNLSKDIG 410 Query: 511 QKDNLAAANPQVVKEMQGVVREF 533 +++N+AA +P+ VKE++ V +++ Sbjct: 411 EENNIAAVHPERVKELEKVRKQW 433 >UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZF72_PLALI Length = 470 Score = 189 bits (481), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 149/492 (30%), Positives = 214/492 (43%), Gaps = 105/492 (21%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 PT+ T KPN+I+ DDLG+G+ GI Q Sbjct: 34 PTQ--TSRKPNVIIFYADDLGWGE-----------------------TGIQGN---PQIP 65 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ-DGIPLTETFL 167 TP + S+ GVR T G+VA PSRA ++TGR P RFG N A G+ L ET L Sbjct: 66 TPHIDSIAKNGVRCTQGFVAATYCSPSRAGLLTGRYPTRFGHEFNRIANVSGLDLQETTL 125 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 + GY TA VGKWHL E++P RGFD F G Sbjct: 126 ADRLHGLGYKTACVGKWHLG-----------------------DGPEYRPTKRGFDEFFG 162 Query: 228 FHAAGTAYYNSPSLFKNR------ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 A T +++ +R E Y +D+ ++ + + + P+ LYL Sbjct: 163 -TLANTPFFHPTKFVDSRVSNDVAEVSDENFYTTDEYAKRSVEWIGQQQ--QSPWFLYLP 219 Query: 282 YNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYS-VDQGVKRILEQLKKNGQYDNTII 340 +NA H P AP +Y +F + + +A++ S +D + ++L ++++ GQ +NT++ Sbjct: 220 FNAQHAPLQ--APQKYLDRFESIADPKRKLFAAMMSAMDDAIGQVLGKVRELGQEENTLV 277 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFY 399 F SDNG G NG +G+K T+ GGT P + WKGKL G YD + +D Sbjct: 278 FFISDNGGPTQGTTSQNGPLRGFKMTTFEGGTRVPFLVQWKGKLPAGKTYDNPVINLDVL 337 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYH 459 PT L AA I KLDGV L+P+ +PH+ L W F E+ Sbjct: 338 PTVLTAAGSKIDPAWKLDGVDLVPYFTSSIANKPHETLYWR------FGEQ--------- 382 Query: 460 KFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV-ENNQLGLYKL-TDLQQKDNLAA 517 + VR D+ LV + Q LY L +D+ + NLA+ Sbjct: 383 -----------------------WAVRQGDWKLVVARGGSGQPELYDLASDIAESKNLAS 419 Query: 518 ANPQVVKEMQGV 529 NP VKE+Q + Sbjct: 420 ENPAKVKELQAL 431 >UniRef50_A6DKP2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKP2_9BACT Length = 446 Score = 189 bits (480), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 156/518 (30%), Positives = 228/518 (44%), Gaps = 121/518 (23%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI+++ DD+G+G + + +E AQ TP + ++ Sbjct: 19 KPNIVLVFADDMGWGDVAY------------------------HGVEDAQ--TPAIDAIA 52 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 GV F GY A V GPSRA I+TGR FGV +N DA GIP ++ + EL + GY Sbjct: 53 KGGVWFEQGYAAASVCGPSRAGILTGRYQQLFGVVTNGDADKGIPKSQKNIAELLKPAGY 112 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY 236 + A GKWHL S + P +RGFD F GFH YY Sbjct: 113 KSGAFGKWHLG-----------------------SKKGQFPNDRGFDTFYGFHFGAHDYY 149 Query: 237 NS-----------PSLFKNRERVPAK--GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYN 283 + ++ N++ V K Y+++++TD A+ ++ K DQPF +Y+AYN Sbjct: 150 RADKKLNKKKKGYAPIYFNQDIVDYKEGDYLTEKITDHAVEFIEENK--DQPFFMYVAYN 207 Query: 284 APHLPNDNPAPDQYQKQFNTGSQTADN-YYASVYSVDQGVKRILEQLKKNGQYDNTIILF 342 + H P PD+Y + + A V ++D GV RI +LK+ +NTI +F Sbjct: 208 SVHSPWQ--VPDEYLARIPESVPAYRRLFLAMVLAMDDGVGRIRAKLKELNLDENTIFVF 265 Query: 343 TSDNGAVIDGPLPLNGAQ---------KGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKL 392 T+DNG+ G N Q +GYK TY GG P M W K++ GN ++ Sbjct: 266 TTDNGSPKIGNKKPNEGQYRMSMSQGFRGYKGDTYEGGIRVPFCMSWPKKIKSGNKFEAP 325 Query: 393 ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENI 452 + A D PT L AA + + G LLP+L+D+++G PH+ L W Sbjct: 326 VIAYDLAPTFLSAASLEYSTK-QFSGKDLLPYLEDEQKGRPHETLFW------------- 371 Query: 453 PFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLTDLQQK 512 RH D Y VR+ D+ L Y N+Q G K D +K Sbjct: 372 ---------HRHSGLD-------------DYAVRHGDWKLTY---NDQEGTSK--DFLKK 404 Query: 513 DNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFN 550 +L N +K+ ++ DS L ++ Q FN Sbjct: 405 VHLKLFN---LKQDPYEKKDLADSMPEKLQQLKQLYFN 439 >UniRef50_Q7UGB8 Arylsulfatase homolog b1498 n=1 Tax=Rhodopirellula baltica RepID=Q7UGB8_RHOBA Length = 656 Score = 186 bits (471), Expect = 3e-45, Method: Compositional matrix adjust. Identities = 137/505 (27%), Positives = 231/505 (45%), Gaps = 101/505 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN++++ DD G+G L + +PK STPTL +L Sbjct: 101 RPNVLLILTDDQGWGDLAAHR---NPKI-----------------------STPTLDALA 134 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 +E R YV+ V P+RAA++TGR P R GV T ++ + ET L EL+++ GY Sbjct: 135 NESARLDRFYVSP-VCAPTRAALLTGRYPERSGVAGVTGRREVMRAEETTLAELYRSAGY 193 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY 236 T GKWH + +P+ P +GF+ F GF Y Sbjct: 194 ATGCFGKWHNG--AQMPL---------------------HPNGQGFNEFFGFCGGHFNLY 230 Query: 237 NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ 296 + L +N V KGYI+D LTD A+ + D+PF Y+ +NAPH P Q Sbjct: 231 DDALLERNGTPVQTKGYITDVLTDAAVEFIQNHH--DRPFFCYVPFNAPH------GPFQ 282 Query: 297 YQK----QFNTGS--QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI 350 ++ ++N GS + YA V ++D V R+L+ L + + TI++F +DNG Sbjct: 283 VRRDLFDRYNDGSIDEKTAAVYAMVQNIDTNVSRLLKCLSDHSLDEETIVVFLTDNGP-- 340 Query: 351 DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISI 410 +G NG +G K + GG P F+ W G +QP + ++ + +D PT + DI + Sbjct: 341 NGKR-FNGGMRGTKGSVHEGGCRVPCFIRWTGNIQPQSISQVAAHIDLLPTLMQWCDIPL 399 Query: 411 PKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYP 470 P + LDG SL+ ++D +P I +Y Sbjct: 400 PTKVPLDGRSLVELIRDG--ADPTLADRSILTY--------------------------- 430 Query: 471 HNPNTEDLSQFS-YTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQG 528 PN L +F VR N + L T+E ++ L+ + TD Q ++A+++P++ K+++ Sbjct: 431 -RPNPMQLQKFGKAAVRTNTHRL--TIEKSKASLFDMTTDAGQTTDIASSHPELTKQLRS 487 Query: 529 VVREFIDSSQPPLSEVNQEKFNNIK 553 +++++ P ++ + ++++ Sbjct: 488 QIQKYVQEITPSITAIRPVPIDSMR 512 >UniRef50_A7V8P8 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V8P8_BACUN Length = 525 Score = 184 bits (468), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 135/446 (30%), Positives = 203/446 (45%), Gaps = 90/446 (20%) Query: 35 LKATKTNVAFSDFTPTEYSTKG--------KPNIIVLTMDDLGYGQLPFDKGSFDPKTME 86 +K + T V+ + P S G +PNI+++ DD+G+G Sbjct: 1 MKVSCTLVSVAALLPFSGSNAGNVQRDKSQRPNIVLVIADDMGWGD-------------- 46 Query: 87 NREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPA 146 +G A++ STP + +L GV+F+ GYV+ +SGPSRA I+TG Sbjct: 47 ---------VGYQGAVDV---STPNIDALARRGVQFSQGYVSCSISGPSRAGILTGVYQQ 94 Query: 147 RFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 RFG Y+N IP ++ L E+ ++ GY T VGKWH++ PE R D Sbjct: 95 RFGFYNNLHPWAKIPEGQSTLGEMVRDCGYATGFVGKWHMAD-----SPEQSPNRRGFDQ 149 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVP----AKGYISDQLTDEA 262 F F ++ DY+ G Y+ L++N E P + YI+D T EA Sbjct: 150 FYGFWSDT-------HDYYRSTDKPGVELYDFCPLYRNGEIQPPLHESGEYITDCFTREA 202 Query: 263 IGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS---QTADNYYASVYSVD 319 + +D K PF+L L+YNA H P P+ Y + + + A V ++D Sbjct: 203 VEFID--KHASSPFLLCLSYNAVHSPWQ--VPEHYVNRLEGRRFHHEDRKVFAAMVLALD 258 Query: 320 QGVKRILEQLKKNGQYDNTIILFTSDNGA--------------------VIDGPLPLNGA 359 G+ R++E L+KNG +NT+ + SDNG+ + P P Sbjct: 259 DGIGRVMESLRKNGLEENTLFILISDNGSPRGQGIECSTGYEYKDRGNTTMSSPGPF--- 315 Query: 360 QKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDG 418 +GYK+ TY GG P M W +L G YD + ++D +PT + A + + LDG Sbjct: 316 -RGYKADTYEGGIRVPYIMSWPSELPQGMVYDNPVISLDIFPTVMQAVGGTSRQKYSLDG 374 Query: 419 VSLLPWLQ-----DKKQGEPHKNLTW 439 VSLLP+L+ DK+ PH L W Sbjct: 375 VSLLPYLKSEWPIDKR---PHSTLYW 397 >UniRef50_C1ZF13 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZF13_PLALI Length = 461 Score = 180 bits (457), Expect = 1e-43, Method: Compositional matrix adjust. Identities = 147/527 (27%), Positives = 238/527 (45%), Gaps = 128/527 (24%) Query: 40 TNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 + +A + TE +++ +PNI+++ DD G+ + P+ YK Sbjct: 15 SQLALAQRATTETTSERRPNILLILSDDCGHAEFSIQG---HPR----------YK---- 57 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG-------VYS 152 TP + S+ GV F GYV+ V PSRA ++ GR RFG YS Sbjct: 58 ---------TPHIDSIGKNGVHFRQGYVSGCVCSPSRAGLLAGRYQQRFGHEFNIPPAYS 108 Query: 153 NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 T+ G+P +ET LP+L + GY T A+GKWHL A Sbjct: 109 ETN---GLPRSETLLPQLLKEDGYRTIALGKWHLG-----------------------YA 142 Query: 213 EEWQPQNRGFDYFMGFHAAGTAYY--NSPS----LFKNRERVPAK--GYISDQLTDEAIG 264 ++ P RGF + GF +Y+ P+ + ++R +P + GY++D L DEAI Sbjct: 143 PQFHPMERGFTDYYGFLQGSRSYFPLKKPTRLNQMLRDRTAIPEEQFGYMTDHLADEAIA 202 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADN--YYASVYSVDQGV 322 + + ++ QP+M+YLA+NA H PND A D Q AD YA ++D+ V Sbjct: 203 YIKQWQS--QPWMMYLAFNATHSPNDATAVDL---------QAADGNKIYAMTIALDRAV 251 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 ++L+ LK+ G +T+++F +DNG NG+ G K T+ GGT P + + Sbjct: 252 GKVLDALKECGLSKDTLVIFINDNGGAGGHD---NGSLHGKKGSTWEGGTRIPFLVQYPA 308 Query: 383 KLQPGNY-DKLISAMDFYPTALDAADIS------IPKD-LKLDGVSLLPWLQDKKQGEPH 434 K+ G D+ + A+D +PT LD A + IP D KLDG+SL+P + K Q Sbjct: 309 KIPSGQVIDEPVIALDLFPTILDVAGLGDAELKKIPFDPEKLDGISLIPRMTGKTQRLVD 368 Query: 435 KNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY 494 + L W + + N+ V ND Sbjct: 369 RPLYWKSGKRWAIRQGNLK------------------------------AVSGND----- 393 Query: 495 TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 + +Q+ L+ L +D ++ NLAA +P +++++ + R++ + + P Sbjct: 394 -DQGDQVELFDLSSDPDEQRNLAATHPDELQQLEALYRKWESTLEKP 439 >UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZAC9_PLALI Length = 479 Score = 179 bits (455), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 156/562 (27%), Positives = 242/562 (43%), Gaps = 137/562 (24%) Query: 13 SISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQ 72 +I+L LA + AF + A L A N + S G+PNI+V+ DDLGY Sbjct: 8 AIALWLA--LVAFCSQAL----LAAEDVN---------QTSKSGRPNILVIMADDLGYAD 52 Query: 73 LPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVS 132 L G P TP L L G+R TN YV+ Sbjct: 53 LGVQGGCEIP--------------------------TPHLDQLAASGIRCTNAYVSAPYC 86 Query: 133 GPSRAAIMTGRAPARFG----VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSK 188 PSRA +TG+ RFG + +A+ G+PL E + L Q GY TA +GKWH Sbjct: 87 SPSRAGFLTGKYQTRFGHEFNPHVGEEAKLGLPLEEVTIANLLQTEGYRTALIGKWH--- 143 Query: 189 ISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY------------- 235 ++D+H PQ+RGFD F GF G Y Sbjct: 144 --------QGFSKDHH------------PQSRGFDEFFGFLVGGHNYLLHKEVKARFGTA 183 Query: 236 YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPD 295 ++ +++ RE P +GY +D T+EA+ + ++P+ LYL+YNA H P + Sbjct: 184 HSHDMIYRGREVEPQEGYATDLFTNEALRWMSGPP--NKPWFLYLSYNAVHTPLEIAPHL 241 Query: 296 QYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL- 354 Q + + Y + + +D + RI + L ++G + T+I+F SDNG P+ Sbjct: 242 QKRIPESVKLPARRGYLSLLAGLDDSIGRITQHLSQHGLREKTLIIFLSDNGGSGRAPIL 301 Query: 355 ----PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTA--LDAAD 407 LN +G K QT GG P F+ W G+L Y++ I ++D PT L A + Sbjct: 302 AYNSGLNHPLRGDKGQTLEGGIRVPFFVSWPGQLPARTIYEQPIISLDLLPTVCQLAANN 361 Query: 408 ISIPKDL--KLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQ 465 + P+ L +DGV+L+P+ ++ G PH++L W +F + Sbjct: 362 PAKPQPLPQGIDGVNLMPYWLGQRSGAPHESLFW--------------------RFGPQK 401 Query: 466 SDDYPHNPNTEDLSQFSYTVRNNDYSLV-----YTVENNQLGLYKL-TDLQQKDNLAAAN 519 + VR ++ LV +N+ LY L TD+ +K+NLA + Sbjct: 402 A------------------VRAGNWKLVDWRDFPASKNSGWELYDLSTDISEKNNLAETH 443 Query: 520 PQVVKEMQGVVREFIDSSQPPL 541 P++V ++ ++ S+ PL Sbjct: 444 PEIVARLKTSWEKWNQSNIEPL 465 >UniRef50_A6DKD8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKD8_9BACT Length = 455 Score = 176 bits (446), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 141/514 (27%), Positives = 214/514 (41%), Gaps = 123/514 (23%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNII++ DDLGY L F + A TP + +L Sbjct: 21 KPNIILILADDLGYEDLGF--------------------------LGAPDIKTPHIDALA 54 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD---------GIPLTETFL 167 G+ FT GY + V GPSRA ++TGR FG N GIPL E + Sbjct: 55 RSGMNFTQGYQSASVCGPSRAGLLTGRYQQLFGSGENPPETGELSKRFPDAGIPLDEQMI 114 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 +L + Y T +GKWH+ + E +P R DY+ G Sbjct: 115 FDLLKPAAYTTGVIGKWHMG-----------------------LSHEQRPTQRSVDYYYG 151 Query: 228 FHAAGTAYYNSP----------SLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFM 277 F +Y + +F+N E VP GY ++ DE + + R K D+PF Sbjct: 152 FLNGAHSYREAKMDMKGAPMTWPIFRNNEPVPFSGYTTEVFNDEGVNFIKRNK--DKPFF 209 Query: 278 LYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDN 337 LY++YN+ H P + P Q+ + + Y A + S+D GV R+++ LK G Y+N Sbjct: 210 LYMSYNSVHGPWE-AQPKDLQRSDHIKKKWRRIYSAMLISMDDGVGRLIQTLKDEGIYEN 268 Query: 338 TIILFTSDNGA--------VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL-QPGN 388 T+++F SDNGA L NG+ +G K TY GG P M W + + Sbjct: 269 TLVIFMSDNGAPNNLHEAERAGDYLASNGSLRGRKGDTYEGGIRVPYIMSWPQVIPKQST 328 Query: 389 YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFD 448 Y +S +D PT + + + P +L GV+L+P++ +K PHK L W Sbjct: 329 YQHPVSGLDIVPTLIHISQ-AAPAKKELSGVNLMPYITGEKTSRPHKTLYW--------- 378 Query: 449 EENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLG--LYKL 506 + DD Y +R+ D+ L + N L+ L Sbjct: 379 ---------------RRDDD--------------YAIRDKDWKLTWNDYNGPRTPMLFNL 409 Query: 507 T-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 D +K+NL +P++ +++Q ++ DS P Sbjct: 410 KDDPNEKNNLIHKHPEIAQKLQAKFDQW-DSKLP 442 >UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYA9_9BACT Length = 490 Score = 176 bits (445), Expect = 3e-42, Method: Compositional matrix adjust. Identities = 149/521 (28%), Positives = 211/521 (40%), Gaps = 118/521 (22%) Query: 45 SDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEA 104 +D TPT+ +PNIIV+ DD GY F +GS D Sbjct: 31 ADDTPTK-----RPNIIVIVSDDQGYADASF-QGSKD----------------------- 61 Query: 105 AQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLT- 163 TP L +L GVR T GYV V PSRA +MTGR RFG ++N A+ +P+ Sbjct: 62 --ILTPNLDALAKSGVRCTRGYVTAPVCSPSRAGLMTGRYQERFGHHNNIVAEAALPIAH 119 Query: 164 ----ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN 219 ET LP++ GYYTA VGKWHL + +P Sbjct: 120 LPSNETLLPQVLAKAGYYTAMVGKWHLGL-----------------------QDGCRPYE 156 Query: 220 RGFDYFMGFHAAGTAYY-NSPS-------LFKNR--------ERVPAKGYISDQLTDEAI 263 RGFD F G G Y+ N P +K R E VP GY++D +A+ Sbjct: 157 RGFDEFFGIITGGHDYFVNHPEERAVGDQSYKARIERNGPVGEAVP--GYLTDAFGADAV 214 Query: 264 GVVDRAKTL--DQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQG 321 ++ + T DQP LYLA+NAPH P P S+ Y A + S+D Sbjct: 215 RIIRESHTKRPDQPLFLYLAFNAPHTPTQAPKDLVDTMPATLESKDRRTYAAQITSMDAS 274 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 V ++ LK+NG +T I+F SDNG + P N + +K Y GG P F + Sbjct: 275 VGKVRAALKENGMEKDTFIVFFSDNGGA-NHPYYDNTPLRDHKGSLYEGGIRVPFFAVYP 333 Query: 382 GKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 G + G+ +L ++++D + TA A LD V +LP L+ + H L W Sbjct: 334 GHIPAGSVCELPVTSLDVFATACALAGTKPETSHPLDSVDMLPVLEGNARQPTHATLFW- 392 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 + F V + D LV + + Sbjct: 393 ------------------------------------EFPGFGAAVADRDLKLVVPKKGSP 416 Query: 501 LGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 D+ +K +LAA NP+ V + ++ E+ + PL Sbjct: 417 QLFDLAVDIGEKSDLAAQNPEKVARLSTLLSEWHAQNARPL 457 >UniRef50_Q7UYW3 Arylsulfatase B n=1 Tax=Rhodopirellula baltica RepID=Q7UYW3_RHOBA Length = 520 Score = 174 bits (442), Expect = 6e-42, Method: Compositional matrix adjust. Identities = 130/421 (30%), Positives = 185/421 (43%), Gaps = 93/421 (22%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNI+V+ DD+GYG D G +T++ TP L L + Sbjct: 56 PNIVVILADDMGYG----DMGCMGSQTLQ----------------------TPNLDRLAE 89 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD----------GIPLTETFL 167 GV + YVA V PSRA ++T R P RFG N +A D G+P +E L Sbjct: 90 SGVLCSQAYVASAVCSPSRAGLLTSRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTL 149 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 + GY TA +GKWHL E P RGFD+F G Sbjct: 150 ADHLGAAGYATALIGKWHLGM-----------------------GEMHHPNRRGFDHFCG 186 Query: 228 FHAAGTAYYNSPSLFK-----NRERVP--AKGYISDQLTDEAIGVVDRAKTL--DQPFML 278 Y+ P+ K N +RV + Y++D TDE + +D+ K+ DQP+ + Sbjct: 187 MLTGSHHYF--PATMKHVIERNGKRVDDFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFV 244 Query: 279 YLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 + +YNAPH P D + N +Q Y A +Y++D+GV RI E L++ GQ++NT Sbjct: 245 FFSYNAPHTPMHATEAD-LARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENT 303 Query: 339 IILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMD 397 +++F SDNG + NG +G K GG PM W K G YD ++S++D Sbjct: 304 LLVFFSDNGGATNNG-SWNGPLRGVKGSMREGGIRVPMIWTWPAKFPAGVLYDGVVSSLD 362 Query: 398 FYPTALDAA--------------DISIPKDLKL-----DGVSLLPWLQDKKQGEPHKNLT 438 PT AA D S K + DG+ + P L D + P++ L Sbjct: 363 LLPTFCSAAGAEPLALADPMSHEDASNRKRMNRLSGTHDGIDMAPHLADGSE-PPNRRLY 421 Query: 439 W 439 W Sbjct: 422 W 422 >UniRef50_Q02AN8 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q02AN8_SOLUE Length = 443 Score = 174 bits (441), Expect = 8e-42, Method: Compositional matrix adjust. Identities = 126/403 (31%), Positives = 181/403 (44%), Gaps = 82/403 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN++V+ +DDLG L + + AA TP + +L Sbjct: 26 RPNVLVVVLDDLGCHDLGY--------------------------LGAADLKTPHIDALA 59 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA-QDGIPLTETFLPELFQNHG 175 G++F N Y V P+R+AI+TGR PA GV N A GIP L + + G Sbjct: 60 ARGLKFRNWYSNAPVCAPARSAILTGRFPASAGVPDNGPALAHGIPT----LASVLKGSG 115 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY 235 Y T GKWHL S +E P GFD F GFH+ Y Sbjct: 116 YQTGCFGKWHLG-----------------------STDETAPTGHGFDSFYGFHSGCVDY 152 Query: 236 Y--------NSPSLFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPH 286 Y N L+ NR + G Y+++++ DEA G + R ++PF+ Y+A+NAPH Sbjct: 153 YSHRFYWGDNYHDLWHNRTEIFEDGRYLTERIADEAAGFIGR----NRPFLGYVAFNAPH 208 Query: 287 LPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDN 346 P PA QY+ +F + Y A + +VD G+ +I L+ G +NT++ F DN Sbjct: 209 YPMHAPA--QYKARFPNLAPERQTYAAMIAAVDDGIGQIQRALETTGAAENTLMFFIGDN 266 Query: 347 GAVIDGPLPL---------NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAM 396 GA + L NG KGYK + GG H P F+ W ++ G + D+L +M Sbjct: 267 GATTEKRAGLNGDFATAGDNGVFKGYKFSLFDGGMHVPGFVSWPAGIRKGGWTDELAMSM 326 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 D PT A +P ++DG LL + PHK+L W Sbjct: 327 DILPTICRATGAPLPP--RVDGSDLLNTIASNAP-SPHKSLYW 366 >UniRef50_A6DSH3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH3_9BACT Length = 455 Score = 174 bits (440), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 139/500 (27%), Positives = 223/500 (44%), Gaps = 109/500 (21%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNIIV+ DD GY + S++P E+ + + STP +L Sbjct: 24 KPNIIVILSDDQGYADV-----SYNP---EHDDYI----------------STPHTDALA 59 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 GV F GY + V +R+ +MTGR R+G+Y+ + G L F+P + GY Sbjct: 60 KSGVIFHRGYTSGSVCSTTRSGLMTGRYQQRYGIYTAGEGGTGTDLNAKFIPNYLKEAGY 119 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT--- 233 + A GKWHL YH P +RGFD F GF G Sbjct: 120 KSMAFGKWHLG-----------HEMKYH------------PLHRGFDDFYGFMGRGAHDF 156 Query: 234 --------AYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 + P +++ E + KGY++ ++T+E + ++ K D+PF Y+AYNA Sbjct: 157 FRLEKEYDGKFGGP-IYRGLEPIDDKGYLTTRITEETVKFIEENK--DKPFFAYVAYNAV 213 Query: 286 HLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSD 345 H P PA D + +G +T D A + +D GV I++ LKK+ Y+NTII++ SD Sbjct: 214 HTPAQAPAEDI---KAVSGDETRDILVAMLKHLDLGVGEIVKTLKKHDIYENTIIIYLSD 270 Query: 346 NGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALD 404 NG + N +G K Y GG P M W +++ G + + ++D PT LD Sbjct: 271 NGGA-KSMVANNKPLRGVKHDIYDGGIRVPFLMSWPAQIKAGQDTQSPVISLDILPTLLD 329 Query: 405 AADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRH 464 AA +P +DG S+LP ++ K D + PF+ N+ Sbjct: 330 AA--GLPALSDIDGESMLPVIRGDK------------------DNLDRPFFWNH------ 363 Query: 465 QSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVV 523 ++ N++ LV+ + LYK++ D+ + NLAA++P+ V Sbjct: 364 --------------GDGQTGIQLNNWKLVFNKGVTE--LYKISDDIGESKNLAASHPEKV 407 Query: 524 KEMQGVVREFIDSSQPPLSE 543 + +Q + +++ P+S+ Sbjct: 408 QALQKIYDKWLSQMATPMSK 427 >UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C3C8_9PLAN Length = 600 Score = 174 bits (440), Expect = 1e-41, Method: Compositional matrix adjust. Identities = 124/396 (31%), Positives = 179/396 (45%), Gaps = 68/396 (17%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNII++ DD GY +D + N ++ TPT+ L Sbjct: 34 QPNIILVMTDDQGY---------WDTEISGNPKI-----------------KTPTIKKLA 67 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 EGV FT Y A+ V P+RA +MTGR R G+Y+ D + ET + ++ Q GY Sbjct: 68 AEGVTFTRFY-ANMVCAPTRAGLMTGRHYLRTGLYNTRFGGDTLGPNETTIAQVLQKAGY 126 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG-FHAAGTAY 235 T GKWHL + + ++QPQ RGFD+F G +H Y Sbjct: 127 KTGLFGKWHLGRYA-----------------------QYQPQRRGFDHFFGHYHGHIERY 163 Query: 236 YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP------- 288 N + N V +GY++D TD AI + R + QPF YLAYNAPH P Sbjct: 164 TNPDQVVVNGTPVETRGYVTDLFTDAAIDFIQRNQ--QQPFFCYLAYNAPHSPFLLDTSH 221 Query: 289 NDNPAPDQY-QKQFNTGSQTAD-NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDN 346 P D+ +K G + YA + +DQ + R+L+ + T+++FTSDN Sbjct: 222 FGQPEGDKLIEKYLAKGLPLREARIYAMIERIDQNLSRLLQTVHDLKLDQETVVIFTSDN 281 Query: 347 GAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDA 405 G V G KG K+ Y GGT P + W G D +++ D +PT Sbjct: 282 GGVSRG---FKAGLKGSKASAYEGGTRVPFVVRWTDHFPAGKTTDAMVAQTDLFPTFCQL 338 Query: 406 ADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL--TW 439 A + +P ++KLDG S+L ++ PH+ L TW Sbjct: 339 AGVPVPSNVKLDGESILSLMEQGGGKSPHQYLYHTW 374 >UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CBI6_9PLAN Length = 599 Score = 173 bits (439), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 147/488 (30%), Positives = 220/488 (45%), Gaps = 93/488 (19%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN++++ DD G+G D S D +E TP L Sbjct: 30 RPNVLLIMTDDQGWG----DVRSHDNPLIE----------------------TPQQDLLA 63 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 +G RF YV+ V P+R++++TGR R GV+ T + + ET + E+F+ GY Sbjct: 64 SQGARFERFYVSP-VCAPTRSSLLTGRYSLRTGVHGVTRGFENMRAEETTIAEMFKAAGY 122 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY 236 T A GKWH + + P+ P +GFD F GF Y Sbjct: 123 KTGAFGKWHNGR--HYPM---------------------HPNGQGFDEFFGFCGGHWNRY 159 Query: 237 NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ 296 +L N++ V +GYI+D LTD AI + + K DQPF Y+ YNAPH P P++ Sbjct: 160 FDTNLEHNKQPVKTEGYITDVLTDRAIDFIKQNK--DQPFFCYVPYNAPHSP--WIVPEK 215 Query: 297 YQKQF-NTG-SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL 354 Y ++ N G A YA V VD + R+++ L DNTI+LF +DNG + Sbjct: 216 YWDKYANKGLDDKARCAYAMVECVDDNLGRLMQTLDDLKLSDNTIVLFLTDNGPNSNR-- 273 Query: 355 PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISA-MDFYPTALDAADISIPKD 413 NG +G K + GG P+F+ + GK++ G K I+A +D PT L+ + D Sbjct: 274 -YNGNMRGRKGSIHEGGIRVPLFVRYPGKIKAGTVVKPIAAHIDILPTLLELCSVENTAD 332 Query: 414 LKLDGVSLLPWLQDKKQGE-PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHN 472 LDG SL+P L +K + P + L +S +IP D+ P+ Sbjct: 333 QPLDGKSLVPLLTNKSNKDWPQRML-----FSDRLFRNSIP------------DDELPNG 375 Query: 473 PNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVR 531 +VR + + Y E + LY + D QK N+ A+P V+K++ R Sbjct: 376 -----------SVRTDRWRAAY--ERGKWSLYDMQADPSQKQNVIEAHPAVIKDLSAAYR 422 Query: 532 E-FIDSSQ 538 + F D SQ Sbjct: 423 DWFKDVSQ 430 >UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788C38 Length = 452 Score = 173 bits (438), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 147/523 (28%), Positives = 223/523 (42%), Gaps = 131/523 (25%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN IV+ DDLGYG L G + T++ TP L L Sbjct: 16 QPNFIVIYCDDLGYGDL----GCYGSDTVK----------------------TPHLDGLA 49 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ---DGIPLTETFLPELFQN 173 DEG+RFTN Y V PSRA+++TG+ PAR GV A+ G+P E L + + Sbjct: 50 DEGIRFTNWYSNSPVCSPSRASLLTGKYPARAGVGEILGAKRGSHGLPADEVTLAKALKP 109 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY TA GKWHL +EE P GFD F GF A Sbjct: 110 AGYRTALYGKWHLGL-----------------------SEETSPNAHGFDEFFGFKAGCV 146 Query: 234 AYYNS-------------PSLFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFMLY 279 +Y+ L++N V G Y+++ +T+ ++ + R++ + PF L+ Sbjct: 147 DFYSHIFYWGQAHGVNPLHDLWENETEVWENGRYMTELITERSVDFIQRSREQEAPFFLF 206 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 +YNAPH P AP +Y +F A + +VD GV +I++ LK+ G Y++T+ Sbjct: 207 ASYNAPHYPMH--APQKYMDRFAHLPWDRQVMAAMIAAVDDGVGKIVKALKEAGCYEDTV 264 Query: 340 ILFTSDNGAVIDGPLPLNGAQ-----------KGYKSQTYPGGTHTPMFMWWKGKLQPGN 388 I F+SDNG + L+G + +G+K+ + GG P + W + G Sbjct: 265 IFFSSDNGPSSESRNWLDGTEDVYYGGSAGIFRGHKASLFEGGIREPAILSWPNGWEGGQ 324 Query: 389 Y-DKLISAMDFYPTALDAADISIP----KDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 D++ + MD PT LD A + + + LDG SL LQ ++ PH+ L W Sbjct: 325 VRDEVAAMMDLAPTFLDLAGVDPAAGPLQGVALDGSSLKEMLQ-MREPSPHQQLFW---- 379 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT-------V 496 +Y Q VR D+ LV V Sbjct: 380 ------------------------EY----------QGQLAVREGDWKLVLNGKLDFDRV 405 Query: 497 ENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 +Q+ L L+ D ++ NLA P++V+ + VR++ + Q Sbjct: 406 VPDQIHLSDLSRDPGERSNLADRYPEIVERLSRDVRDWYEEVQ 448 >UniRef50_A6DM29 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DM29_9BACT Length = 481 Score = 172 bits (437), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 140/426 (32%), Positives = 189/426 (44%), Gaps = 95/426 (22%) Query: 48 TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQK 107 TP +PNI+++ DDLGYG L + K Q Sbjct: 26 TPQTKKDTERPNIVLILCDDLGYGDL----ACYGHK----------------------QI 59 Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIP------ 161 TP L + EG+RF + Y A V SR ++TGR+P R GVY D IP Sbjct: 60 KTPNLDQMAKEGIRFNHFYSAAPVCSASRVGLLTGRSPNRAGVY------DWIPHSSESS 113 Query: 162 -----LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 E P+L Q GY T GKWH N + + Q Sbjct: 114 SPHMRKNEITFPQLLQKAGYATCLSGKWHC-------------------NGALINTNQAQ 154 Query: 217 PQNRGFDY-FMGFHAAGTAYYNSPSLFKN-RERVPAKGYISDQLTDEAIG-VVDRAKTLD 273 PQ+ GFDY F + A ++ N + +N E P +G+ +T+EAI + D K + Sbjct: 155 PQDAGFDYWFATQNNAAPSHKNPVNFIRNGVELGPIEGFSCQIVTNEAINWMEDHVKQNE 214 Query: 274 -QPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADN-----YYASVYSVDQGVKRILE 327 QPF +YL+++ PH P +P QK +T A+N Y+A+V ++D+ V ++ Sbjct: 215 KQPFFIYLSFHEPHEPIASP-----QKIVDTYKGIAENTNQAEYFANVENLDKAVGSLMN 269 Query: 328 QLKKNGQYDNTIILFTSDNGAVIDGPLPLN------------GAQKGYKSQTYPGGTHTP 375 QLKK DNT+++FTSDN GP LN G KG K T G P Sbjct: 270 QLKKLKINDNTLVIFTSDN-----GPETLNRYEAASRSYGSPGELKGMKLWTAEAGFRVP 324 Query: 376 MFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPH 434 M W K+ G D++ISA+DF+PT D A S K L LDG + P L KK H Sbjct: 325 AIMHWPEKIATGQISDQVISALDFFPTFCDLAQASNSKSLNLDGSNFTPALHKKKMTR-H 383 Query: 435 KNLTWI 440 K L WI Sbjct: 384 KPLLWI 389 >UniRef50_C6Y214 Sulfatase n=3 Tax=Sphingobacteriaceae RepID=C6Y214_PEDHD Length = 472 Score = 172 bits (435), Expect = 4e-41, Method: Compositional matrix adjust. Identities = 137/467 (29%), Positives = 200/467 (42%), Gaps = 79/467 (16%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 +K +T ++ + + T KPN+IV+ DD GY D G + K Sbjct: 4 IKTISTLLLALWTGISAAQVKTAAKPNVIVIVSDDAGY----VDFGCYGGK--------- 50 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 Q TP + ++ +G RFT+ YV+ V PSRA I+TGR RFG Sbjct: 51 -------------QIPTPNIDAIAKQGTRFTDAYVSASVCAPSRAGILTGRYQQRFGFEH 97 Query: 153 NTD---------AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY 203 NT G+ +E + Q +GY T A+GKWH D Sbjct: 98 NTSNVLAPGYKITDVGMDPSEQTIGNEMQANGYKTIAIGKWHQG--------------DE 143 Query: 204 HDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY-------NSPSLFKNRERVPAK--GYI 254 +F P NRGF+ F GF ++ N +L+ N+E VP Y+ Sbjct: 144 PKHF---------PLNRGFNEFYGFTGGHRDFFAYKGKRTNEHALYNNKEIVPENEITYL 194 Query: 255 SDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYAS 314 +D TD+A + K D+PF +YL+YNA H P N D ++ + Y A Sbjct: 195 TDMFTDKATSFITANK--DKPFFMYLSYNAVHTPM-NAKKDLMERYASIADTGRRAYAAM 251 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHT 374 + S+D G+ +++ LK N NT+I+F +DNG NG +G K + GG Sbjct: 252 MTSLDDGIGKVMATLKANQLDKNTLIIFINDNGGATVNSSD-NGPLRGMKGSKWEGGIRV 310 Query: 375 PMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 M M W G + D + +S++D PTA+ A KLDGV+LLP+L + P Sbjct: 311 AMMMKWPGHIAANKTDSRPVSSLDILPTAIGAGKGKQKGTKKLDGVNLLPYLSAGNKKTP 370 Query: 434 HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ 480 H+ L W + E N W K +R + N DLS+ Sbjct: 371 HEALYWRRGVAAAMREGN---W----KLIRVKESPTVQNVLLFDLSK 410 >UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4991 Length = 596 Score = 171 bits (433), Expect = 6e-41, Method: Compositional matrix adjust. Identities = 150/507 (29%), Positives = 221/507 (43%), Gaps = 122/507 (24%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 S GKPN++++ +DDLG L +F YK TP + Sbjct: 18 SAAGKPNVVLIVIDDLGQRDLGCYGSTF-------------YK-------------TPNI 51 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIP----------- 161 + +GVRFT+ Y A V P+RA+IMTG+ P R G+ + +P Sbjct: 52 DRMAKDGVRFTDFYAACPVCSPTRASIMTGKYPQRVGITDWLPGRKDLPGQRLKRPELKN 111 Query: 162 ---LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQ 218 L E + E + HGY TA +GKWHL + ++P+ Sbjct: 112 ELALEEVTVAETLKGHGYVTAHIGKWHLG------------------------GKGFEPE 147 Query: 219 NRGFDY-FMGFHAAGTAYYNSPSLFKNR--------ERVPAKGYISDQLTDEAIGVVDRA 269 +GFD G H Y +P F N+ E+ Y++D+L EA + Sbjct: 148 KQGFDVNVAGDHTGTPLSYFAP--FANKAGATMPGLEKAAPDEYLTDRLAAEAETFITAN 205 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTADNYYASVYSVDQGVKRILE 327 K D+PF LYL + H P P P D+Y+ Q G Q+ Y A V S+D V R+L+ Sbjct: 206 K--DKPFFLYLPHYGVHTPLRAPQPLVDKYKTQAVHGRQSNPVYAAMVESMDAAVGRVLK 263 Query: 328 QLKKNGQYDNTIILFTSDNG--AVIDGPLP----LNGAQKGYKSQTYPGGTHTPMFMWWK 381 +L DNT++LFTSDNG A ++G +P +N + K Y GG P+ W Sbjct: 264 RLDDLKLSDNTLVLFTSDNGGLATLEG-MPFAPTINAPLREGKGYLYEGGVRVPLIAKWP 322 Query: 382 GKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 GK++PG D++ ++DF+ T L+A + + DGVSL+P +K +P + L W Sbjct: 323 GKVKPGTVMDQVACSIDFFDTILEATGAT--SAARRDGVSLVPAFGGEKL-KP-RALYW- 377 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 YPH N S+ VR +Y LV E+ + Sbjct: 378 ---------------------------HYPHYANQG--SRPGGAVRAGNYKLVEYYEDGR 408 Query: 501 LGLYKLT-DLQQKDNLAAANPQVVKEM 526 L+ + DL + NLAA P VVK++ Sbjct: 409 RELFDVAKDLSESRNLAADKPDVVKDL 435 >UniRef50_B2URC2 Sulfatase n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2URC2_AKKM8 Length = 465 Score = 170 bits (430), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 121/407 (29%), Positives = 181/407 (44%), Gaps = 74/407 (18%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PN+IV+ DDLGYG L + Q TP+L L Sbjct: 29 PNMIVILADDLGYGDL--------------------------GCTGSKQIKTPSLDRLAR 62 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD--------GIPLTETFLPE 169 EGV + YV + PSR ++TGR P R+G+ +N + Q G+P TE +PE Sbjct: 63 EGVFCSRAYVTAPMCSPSRMGLLTGRFPKRYGITTNPNIQMDYLPESHYGLPQTEKLIPE 122 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH 229 GY +A GKWHL T+ Y P RGF ++ GF Sbjct: 123 YLAPCGYRSAVFGKWHLG-----------HTKGY------------TPPERGFTHWWGFL 159 Query: 230 AAGTAYY---------NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 Y+ N + N Y++D +TD A+ + A +PF +++ Sbjct: 160 GGSRHYFPVKKEAEGLNPSMIVSNFTDKTDITYLTDDITDRAVEFLQEAGKDKKPFFMFV 219 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 +YNAPH PN+ P+ K N + Y A VY++D+G+ RIL+ LK +G +TI+ Sbjct: 220 SYNAPHWPNEA-KPEDIAKFRNVQNGERRVYCAMVYAMDRGIGRILDALKADGLEKDTIV 278 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG--KLQPGNYDKL-ISAMD 397 +F SDNG + N +G K Q + GG P + + +L PG+ + +S++D Sbjct: 279 VFLSDNGGAPEAS-SCNAPFRGAKRQHFEGGVRVPFIIRYPADKRLVPGSVCRQPVSSVD 337 Query: 398 FYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYS 444 P L A IP+ KLDG+ +L + +K P + W T Y+ Sbjct: 338 LLPALLKANGRHIPR--KLDGMDILELVGNKGAPVP-RTFFWCTDYT 381 >UniRef50_A6DKN7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKN7_9BACT Length = 465 Score = 170 bits (430), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 143/522 (27%), Positives = 227/522 (43%), Gaps = 110/522 (21%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 K NII++ DD+ YG L G+ ++ K TP + S+ Sbjct: 19 KTNIILIFADDMHYGAL-----------------------GVTGSVLTKAK-TPAIDSIF 54 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF-------GVYSNTDAQDGIPLTETFLPE 169 +EGV F NGY +H PSRA ++TGR ARF G G+ +E +P Sbjct: 55 NEGVHFPNGYASHATCAPSRAGLLTGRYQARFDLETLPGGTADRKKTGYGVKTSEIMIPA 114 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH 229 L + GY T A+GKWHL S+EE+QP RGFD++ G+ Sbjct: 115 LMKKGGYQTCAIGKWHLG-----------------------SSEEFQPNARGFDHWFGYR 151 Query: 230 AAGTAYY--------------------NSPSL--FKNRERVPAKGYISDQLTDEAIGVVD 267 + Y P+L +N E V +GY++D +DEA + Sbjct: 152 GSCGFYQFKSQVQSAKKGQELKPLPSGEDPNLDVVRNGESVRLEGYLTDHFSDEAANWIK 211 Query: 268 RAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILE 327 K ++PF +Y A P + APD ++ TA + + ++D V+ IL+ Sbjct: 212 ENK--ERPFFMYFA------PYNVHAPDTVPNKYIPKGGTAHD--GVIAALDASVQTILD 261 Query: 328 QLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 LK+ G DNT+++F++DNG D + KG K+ Y GG P M W ++ G Sbjct: 262 ALKEAGIADNTLVVFSNDNGGKKD----YSKTFKGNKATFYEGGIRVPFAMRWPKGIEAG 317 Query: 388 N-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQD--KKQGEPH--KNLTWITS 442 + Y+ ++S +D PT A + +P D DG +LLP ++D K Q + H +N W T+ Sbjct: 318 SKYNGVVSTLDLLPTFAALAKVDLPSDRVYDGQNLLPVIKDSAKDQRQAHFWRNGAWRTA 377 Query: 443 YSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLG 502 + W + R + + + + T L E Sbjct: 378 --------RVGDWKLVWQVDRKKQKALLNKLGIKHVKGRGVTYAERADELFLEPE----- 424 Query: 503 LYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 LY L D +++ NLA +NP+ ++EM + +++ ++S P E Sbjct: 425 LYNLANDPKEESNLAQSNPEKLQEMVKIYKDW-EASIPKWRE 465 >UniRef50_A0Z632 Arylsulfatase B n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z632_9GAMM Length = 545 Score = 170 bits (430), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 147/514 (28%), Positives = 217/514 (42%), Gaps = 123/514 (23%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI+++ DDLG+ + + G D TP+L L Sbjct: 32 KPNILIMVADDLGWADVGYHGGDID---------------------------TPSLDRLA 64 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-YSNTDAQD--GIPLTETFLPELFQN 173 +GVR N + + P+RAA+MTGR P R GV Y D G+ E F+PE FQ Sbjct: 65 QQGVRL-NRFYTTPICSPTRAALMTGRDPIRLGVTYGVIFPWDNIGVHPDEHFMPETFQA 123 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY TA +GKWHL H T + P NRGF++F G Sbjct: 124 AGYQTAIIGKWHLG----------------HAQMT------YHPNNRGFEHFYGHLHTEV 161 Query: 234 AYY------NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHL 287 +Y +N + +GY + L DE + R + D+PF++Y+ + APH Sbjct: 162 GFYPPFSNQGGKDFQRNGVSIDDQGYETYLLADEVSRYI-RERDRDRPFLVYMPFIAPHT 220 Query: 288 PNDNPAP--DQYQK-----QFNTGSQTADN---------------YYASVYSVDQGVKRI 325 P D P D+Y+ QT D Y A V ++DQ + R+ Sbjct: 221 PLDAPVELQDKYKDIETDLPMARSRQTDDTRLISRVMLQPSARPMYAAVVDAMDQAIGRV 280 Query: 326 LEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ-KGYKSQTYPGGTHTPMFMWWKGKL 384 L+ L + G DNTI+LF SDNG N A +G K +T+ GG M W L Sbjct: 281 LDTLDQEGISDNTIVLFFSDNGGAAYSYGGANNAPLRGGKGETFEGGIRVTSLMRWPAML 340 Query: 385 QPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 +PG +++++S MD +PT +DAAD+ + LDG S+ L+ Q L Sbjct: 341 EPGQIFEQIMSVMDVFPTLVDAADVRPGNNFALDGRSMWTALKSGDQVPLEGPLI----- 395 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLG- 502 F E IP + N F++ N ++ LV V+ Q+ Sbjct: 396 ---FGSE-IPIYGN-----------------------FNFAAFNEEWKLVQEVQQEQIAI 428 Query: 503 -----LYKL-TDLQQKDNLAAANPQVVKEMQGVV 530 L+K+ +D + +NLAA P +V+ + + Sbjct: 429 TVTNYLFKISSDPYEHNNLAAVYPDIVENLSKAI 462 >UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAW6_9PLAN Length = 472 Score = 169 bits (428), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 131/438 (29%), Positives = 183/438 (41%), Gaps = 106/438 (24%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 S +PNIIVL DDLGYG+L Q TP + Sbjct: 21 SAAEQPNIIVLLADDLGYGELGCQGN--------------------------PQIPTPHI 54 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV------YSNTDAQDGIPLTETF 166 SL G+RFT YV PSRA ++TGR P RFG N D+ G+P E Sbjct: 55 DSLASHGIRFTQAYVTAPNCSPSRAGLLTGRIPTRFGYEFNPIGARNEDSGTGLPPDEQT 114 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 + E + GY T +GKWHL T DYH P GFD F Sbjct: 115 IAERLHDQGYTTCLIGKWHLGG-----------TADYH------------PFRHGFDEFF 151 Query: 227 GFHAAGTAY----YNSPSLFKNRERVPAKG------------------------------ 252 GF G + Y+ + R+ +P + Sbjct: 152 GFMHEGHYFVPPPYHGVTTMLRRKTLPGRQKGRWISENLIYSTHMGYDEPDYDANNPIIR 211 Query: 253 ---------YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPD-QYQKQFN 302 Y++D T EA+ ++R + D+PF LYLAYNA H P D Q+ Q Sbjct: 212 GGQPVNETEYLTDAFTREAVSFINRHQ--DKPFFLYLAYNAVHSPLQGKKKDIQHFTQIE 269 Query: 303 TGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKG 362 + + A + S+DQ + +IL+Q++++G + T+I+F SDNG N +G Sbjct: 270 DIHRQI--FAAMLSSMDQSIGKILKQVQQSGLDEKTLIVFLSDNGGPTRELTSSNLPLRG 327 Query: 363 YKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSL 421 K Y GG P M W G L P D +S++D +PT++ A S+P++ LDG +L Sbjct: 328 EKGSMYEGGLRVPFLMRWTGTLAPKQTIDVPVSSLDIFPTSVALAGASLPQN--LDGRNL 385 Query: 422 LPWLQDKKQGEPHKNLTW 439 LP L +K P + W Sbjct: 386 LPLLLQQKTELPVADFFW 403 >UniRef50_A6P2X1 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6P2X1_9BACE Length = 494 Score = 168 bits (426), Expect = 5e-40, Method: Compositional matrix adjust. Identities = 132/426 (30%), Positives = 186/426 (43%), Gaps = 107/426 (25%) Query: 40 TNVAFSDFTP---------TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 N+A D+ P E PN++V+ +DD+GYG L Sbjct: 44 VNIALRDYRPEGRKRYLEGVELENGDPPNVVVIYVDDMGYGDL----------------- 86 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF-- 148 A STP + +L + GV TN Y + SRA ++TGR P R Sbjct: 87 ---------GCTGATAISTPNIDALAEGGVLLTNYYAPAPICSASRAGLLTGRYPIRTLT 137 Query: 149 -GVYSNTDA-------------------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSK 188 G Y NT+ DG+P E LPE+ Q GY TA VGKWHL Sbjct: 138 SGAYMNTEGLSGHLANLLEVVKGTYPYQNDGLPTDEILLPEVLQQAGYETALVGKWHLG- 196 Query: 189 ISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY---NSP-SLFKN 244 EE +P NRGFD F G A Y N P ++ N Sbjct: 197 ----------------------IREEERPYNRGFDLFYG------ALYSDDNDPHRIYHN 228 Query: 245 RERVPAKGY----ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ 300 E V + Y ++ +LT A +D + D PF LY A PH P++ A +++ Sbjct: 229 DEVVHDEPYDQSGMTKELTQVAKQFIDDNQ--DGPFFLYYASPFPHWPSN--ASEEW--- 281 Query: 301 FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ 360 G+ A Y + VD V I++ L++NG +NT+++FTSDNG DG G Q Sbjct: 282 --LGTSQAGIYGDCMQEVDWSVGEIMDTLEENGLLENTLVIFTSDNGPWYDGA---TGGQ 336 Query: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGV 419 +G K Y GG+H P + G + G YD L+S +D +PT L+ I +P+D +DG+ Sbjct: 337 RGRKDTNYNGGSHVPFIAYMPGTIPEGEVYDGLMSGVDVFPTILNLLGIELPQDRVIDGM 396 Query: 420 SLLPWL 425 + P+L Sbjct: 397 DMWPFL 402 >UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D6K5_PAESJ Length = 434 Score = 168 bits (426), Expect = 5e-40, Method: Compositional matrix adjust. Identities = 130/414 (31%), Positives = 184/414 (44%), Gaps = 85/414 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNIIV DDLGYG L G + M+ TP L L Sbjct: 3 RPNIIVFYCDDLGYGDL----GCYGSDAMK----------------------TPHLDQLA 36 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFLPELFQN 173 EG+RFTN Y V PSRA+++TG+ PA+ GV S G+ L +T L + Sbjct: 37 SEGIRFTNWYSNSPVCSPSRASLLTGKYPAKAGVTSILGGKRGTKGLSLEQTTLASALKE 96 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 HGY+TA GKWHL ++ E+ P GFD F GF A Sbjct: 97 HGYHTALFGKWHLG-----------------------ASAEYGPNAHGFDQFYGFRAGCI 133 Query: 234 AYYNS-------------PSLFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFMLY 279 YY+ L++N V G Y+++ +T EA +D A D+P+ +Y Sbjct: 134 DYYSHIFYWGQGGGVNPVHDLWRNETEVWENGEYMTEAITREATSYIDAAPD-DEPYFMY 192 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 +AYNAPH P AP Y +F A + +VD GV I++ LK+ G Y++TI Sbjct: 193 VAYNAPHYPMH--APKAYLDRFPDLPPDRRIMAAMIAAVDDGVGEIVKALKQKGAYEDTI 250 Query: 340 ILFTSDNGAVIDGPLPLNGAQ-----------KGYKSQTYPGGTHTPMFMWWKGKL--QP 386 I F+SDNG + L+G + +G+K+ + GG P + + L Q Sbjct: 251 IFFSSDNGPSTESRNWLDGTEDLYYGGSAGRFRGHKASLFEGGIREPAILSYPAGLAEQQ 310 Query: 387 GNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 G D++ + MD +PT L+ + I + LDG S+ L P K L W Sbjct: 311 GQISDEMFAMMDIFPTMLELSGIGT-EGYSLDGHSVFDALSGNAL-SPRKQLFW 362 >UniRef50_A4GJF1 Sulfatase n=1 Tax=uncultured marine bacterium EB0_50A10 RepID=A4GJF1_9BACT Length = 544 Score = 167 bits (424), Expect = 7e-40, Method: Compositional matrix adjust. Identities = 153/582 (26%), Positives = 246/582 (42%), Gaps = 122/582 (20%) Query: 10 VSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKG--------KPNII 61 + S+ +++ SG A+ V +NV + PT +S KG +PNII Sbjct: 5 LMVSLMVLIVSGFVAWEYKVNILVWAIPKISNVTVQENIPTTWS-KGPDTPVDDNRPNII 63 Query: 62 VLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVR 121 ++ DD+GY + G T++ K I+A KS G+ Sbjct: 64 LVLADDMGYNDISIHNGGAADGTLQT------------KNIDALAKS----------GIL 101 Query: 122 FTNGYVAHGVSGPSRAAIMTGRAPARFG-------------------------------- 149 FT GY A+ PSRA+IMTG+ P RFG Sbjct: 102 FTRGYAANATCAPSRASIMTGKYPTRFGYEFTPIPAFGRTVLGWLAEEDNFELKQRIDRE 161 Query: 150 VYSNTD--AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 V SN + G+P + + E+ ++ GYYTA +GKWHL + D ++ + D+ Sbjct: 162 VVSNMPPFMEQGMPTEQITIAEVLRDAGYYTAHIGKWHLGHEYGM----DPMSQGFQDSL 217 Query: 208 TTFS---AEEWQPQ--NRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEA 262 E P N FD + G Y++ F + Y++D TDEA Sbjct: 218 GLVGPLYLPEDHPDVVNAKFDTRIDKMIWGMGQYSAN--FNGGDLFAPDKYVTDYYTDEA 275 Query: 263 IGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGV 322 + V++ K ++PF LYL++ A H P D +++ + Y + S+D+ V Sbjct: 276 LKVIENNK--NRPFFLYLSHWAIHNPLQALRSD-FEQMSHMHGHNLQVYSGMINSLDRSV 332 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDGPL-PLNGAQKGYKSQTYPGGTHTPMFMWWK 381 +I+E+LK+ Y T+I+FTSDNG L +N +G+K + GG P + W Sbjct: 333 GKIIEKLKELDIYGKTLIIFTSDNGGANYIELNDINKPYRGWKISFFDGGIRVPYIISWP 392 Query: 382 GKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 ++ PG + + D +PT L AA I + LDGV L+P++++ +PHK L W Sbjct: 393 DEINPGKKSENAVHHFDIFPTILKAAGIESTNE--LDGVDLMPFIKNDSSSKPHKTLFWR 450 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 + HQS V + + + + + N Sbjct: 451 SG--------------------NHQS------------------VLHEHWKFIISKKENF 472 Query: 501 LGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 L+ + D +K+NL +NP VVKE++ ++ EF + PL Sbjct: 473 RWLFDTSADPTEKNNLVDSNPDVVKEIEELLVEFNSEQKDPL 514 >UniRef50_A5FAW4 Sulfatase n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FAW4_FLAJ1 Length = 539 Score = 167 bits (424), Expect = 8e-40, Method: Compositional matrix adjust. Identities = 146/551 (26%), Positives = 229/551 (41%), Gaps = 117/551 (21%) Query: 36 KATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 K + AF T +++ KPNII+L DDLG + G P Sbjct: 41 KLAEGKAAFLSQKDTSAASEKKPNIIILLADDLGKYDISLYGGKSTP------------- 87 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG------ 149 TP + SL GV FT+GYV+ + PSRA ++TGR RFG Sbjct: 88 -------------TPQIDSLAASGVTFTDGYVSSSICSPSRAGLLTGRYQERFGHEYQPG 134 Query: 150 --------------VYSNTD----------------AQDGIPLTETFLPELFQNHGYYTA 179 NT+ A G+P +E +L + GY TA Sbjct: 135 DRYPKNNLEYYAFKYLLNTNSWRLNPKIEYPNDASIATQGLPKSEITFADLAKKQGYSTA 194 Query: 180 AVGKWHLSKISNVPVPEDKQTRDYH----DNFTTFSAEEWQPQ--NRGFDYFMGFHAAGT 233 +GKWHL P D+ DYH F+ F+ E+ P N F G Sbjct: 195 IIGKWHLGHTKGF-FPLDRGF-DYHYGFYQAFSLFAPEDNNPDIINHHHTDFTDKTIWGN 252 Query: 234 AYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA 293 + + ++ + K Y++++ +EA +D+ K ++PF+LY+ +NAPH P Sbjct: 253 GRVGTGQIRRDSTIIDEKKYLTEKFAEEAEAFIDKNK--NKPFLLYVPFNAPHTPFQ--V 308 Query: 294 PDQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG 352 +Y +F N + Y+A + ++D + I ++KK G +NT+I F SDNG Sbjct: 309 RKKYYDRFPNVKDENKRVYFAMISALDDAIGLIRAKVKKEGLEENTLIFFASDNGGADYT 368 Query: 353 PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIP 411 N KG K + GG + P + WKGK++P Y +S++D + T +P Sbjct: 369 YATTNAPLKGGKFSHFEGGVNVPFALSWKGKIKPHTIYKTPVSSLDIFSTIAAVTHSGLP 428 Query: 412 KDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPH 471 KD DGV L+ + + KQ H+NL W +S D Sbjct: 429 KDRVYDGVDLVDVVNNNKQA--HQNLYW-------------------------RSGD--- 458 Query: 472 NPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVV 530 + +R+ D+ L+ + + ++ LY L D + +LA+ NP+ VKE+Q + Sbjct: 459 ----------AKAIRSGDWKLIISGKTHETWLYNLAKDKSETTDLASKNPEKVKELQTAL 508 Query: 531 REFIDSSQPPL 541 + + PL Sbjct: 509 QNWEKGLIKPL 519 >UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9FLAO Length = 459 Score = 166 bits (421), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 127/407 (31%), Positives = 178/407 (43%), Gaps = 86/407 (21%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNI+ + +DDLGYG L A +P + +L Sbjct: 42 PNILCILVDDLGYGDL--------------------------SCQGATDLQSPNIDALAA 75 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV----YSNTDAQDG-IPLTETFLPELFQ 172 G+RFTN Y V PSRAA++TGR P GV N + G + +P Sbjct: 76 NGMRFTNFYANSTVCSPSRAALLTGRYPDLVGVPGVIRQNPENNWGNLADDAVLIPSELN 135 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF---- 228 GY+T +GKWHL E+ T P +RGF YF GF Sbjct: 136 PAGYHTGIIGKWHLGL-------EEPDT----------------PNDRGFTYFKGFLGDM 172 Query: 229 ------HAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 H G + + NRE + KG+ +D TD I + + +QPF LYLAY Sbjct: 173 MDDYWDHRRGGINW----MRLNREEIDPKGHATDLFTDWTIDFLKERQGEEQPFFLYLAY 228 Query: 283 NAPHLPNDNPAP---DQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 NAPH P P +++ N + A N A V +D V R++E LK G +NT+ Sbjct: 229 NAPHFPIQPPREWLDKVREREPNLTEKRAKN-VAFVEHLDYSVGRVMEALKTTGLEENTL 287 Query: 340 ILFTSDNGAVI-----DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLI 393 ++F SDNG + +GPL +G K Y GG P +WKGK+ PG D Sbjct: 288 VVFVSDNGGALWYAQSNGPL------RGGKQDMYEGGIRVPAIFYWKGKIAPGTTSDNTA 341 Query: 394 SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 MD +PT + A P++ +DG+SL+P L + Q ++ L W+ Sbjct: 342 LLMDLFPTFCELAGRKPPEN--VDGISLVPTLTGQAQDTANRYLYWV 386 >UniRef50_Q1YSH0 Sulfatase family protein n=4 Tax=cellular organisms RepID=Q1YSH0_9GAMM Length = 557 Score = 166 bits (421), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 132/460 (28%), Positives = 197/460 (42%), Gaps = 84/460 (18%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 K PNII++ DD+G+ + G +++ TP + Sbjct: 61 KRPPNIILILTDDMGFNDISLYNGGAADGSLQ----------------------TPNIDR 98 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV------------------------ 150 + ++G+RF NGY A+ V SRA+++TGR RFGV Sbjct: 99 IAEQGIRFNNGYAANAVCTSSRASLLTGRYSTRFGVEYTPIYKTGVRIFNWMEELNPSTP 158 Query: 151 -----------YSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQ 199 DA G+P E + E+ Q YYTA +GKWHL ++ PE + Sbjct: 159 PVLVDMDLAATLPPIDAL-GMPAAEITIGEVLQQQDYYTAHIGKWHLGSNGDM-RPEQQG 216 Query: 200 TRDYHDNFTTFSAEEWQPQ-------NRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKG 252 D F P D + + +N F+ KG Sbjct: 217 FDDSLSMKGIFYLPPDHPDVVNAKIPGDSIDSMVWAVGSYEVQWNGGPPFE------PKG 270 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYY 312 Y++D TD A+ V++ + +PF LYLA+ PH P D Y + Y Sbjct: 271 YLTDYFTDAAVDVIEANR--HRPFFLYLAHWGPHNPVQASRED-YDALPHIKDHRLRTYA 327 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-LNGAQKGYKSQTYPGG 371 A + ++D+ V++I L++NG DNT+I+FTSDNG L LN +G+K + GG Sbjct: 328 AMLRALDRSVEKIEASLQENGLSDNTLIIFTSDNGGAGYLDLTDLNKPYRGWKLTHFEGG 387 Query: 372 THTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ 430 TH P W +++ G + D+ I +D + T AA S+P D LDGV+LLP++Q K+ Sbjct: 388 THVPYMAKWPAQIEAGQSSDEAIHHIDMFHTIAAAAGASVPTDRTLDGVNLLPFMQGKQT 447 Query: 431 GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYP 470 G PHK L W T + W K +R + D P Sbjct: 448 GAPHKTLFWHTGHQQ-------TVWHQGWKMIRAEQSDKP 480 >UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria RepID=C1ZCL4_PLALI Length = 470 Score = 166 bits (419), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 145/525 (27%), Positives = 221/525 (42%), Gaps = 127/525 (24%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN++++ +DDLG + + SF TP + +L Sbjct: 28 KPNVLLIFIDDLGKTDIGIEGSSF--------------------------YETPRIDALA 61 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV--YSNTDAQDGIPLTETFLPELFQNH 174 G RFT Y AH V P+RAA+MTG+ P R G+ + ++ +P +E + + FQ Sbjct: 62 KSGARFTQFYSAHPVCSPTRAALMTGKMPQRLGITDWIRPESDVALPQSEVTIGQAFQEA 121 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG-- 232 GY+TA +GKWHL + P RGFD+ G + G Sbjct: 122 GYHTAYLGKWHLGH-----------------------KPQQHPAARGFDWTKGVNHGGQP 158 Query: 233 TAYY---------NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYN 283 ++YY ++P+ + E+ + Y++D LT AI + + + +PF L LA+ Sbjct: 159 SSYYFPYKNPQKPDAPNNVPDFEKCQPEDYLTDVLTSSAIEHLQQ-RDRTRPFFLCLAHY 217 Query: 284 APHLPNDNPA--PDQYQKQFNT-------------------GSQTADNYYASVYSVDQGV 322 A H P P ++YQ + T Q Y A V ++D V Sbjct: 218 AVHTPIQPPKNLVEKYQVKLATQKNPKSPGEGIQEGSAISRSQQDHPAYAAMVENLDTQV 277 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVID------GP---LPLNGAQKGYKSQTYPGGTH 373 R+L++LK G D TI++FTSDNG + GP LPL A KG+ TY GG Sbjct: 278 GRLLDELKTQGILDQTIVVFTSDNGGLCTLNGKSPGPTCNLPLR-AGKGW---TYEGGIR 333 Query: 374 TPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 P ++ W GK+ P D D YPT L I +DG+SL L K P Sbjct: 334 IPTYISWPGKISPQVLDIPAYTCDIYPTLLSLCQIPPRPTQHVDGISLA-GLLTKSSSLP 392 Query: 434 HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV 493 T + Y H H S P S +R + L+ Sbjct: 393 ESERTLVWYYPH-----------------THGSGHKP-----------SAAIRQGPWKLI 424 Query: 494 YTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 + +E +++ LY L D + NLA+ +P+ ++Q +++ I+SS Sbjct: 425 HFLETDRIELYHLEDDPGESRNLASKHPERALQLQKELQKIIESS 469 >UniRef50_A6CAY0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAY0_9PLAN Length = 466 Score = 165 bits (417), Expect = 5e-39, Method: Compositional matrix adjust. Identities = 148/518 (28%), Positives = 213/518 (41%), Gaps = 123/518 (23%) Query: 42 VAFSDFTPTEYSTK---GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGI 98 V+FS P + K +PNI+++T D+LGYG D G + M+ Sbjct: 16 VSFSVPAPVTAAEKPENKRPNILLITADNLGYG----DLGCYGNPVMK------------ 59 Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ- 157 TP L L EGVR T+ Y A SRA ++TGR P R G+ A Sbjct: 60 ----------TPMLDQLASEGVRLTDFYTASPTCTVSRATLLTGRYPQRIGLNHQLSADE 109 Query: 158 ---DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 DG+ +E +PE + GY TA GKW++ F+ S Sbjct: 110 NYGDGLRKSEVLIPEYLKQQGYRTACFGKWNVG-------------------FSPGS--- 147 Query: 215 WQPQNRGFDYFMGFHAAGTAYYN-----SPSLFKNRERVPAKGYISDQLTDEAIGVVDRA 269 +P RGFD F GF A YY+ L++ + V +GY +D D A + + Sbjct: 148 -RPTERGFDEFFGFAAGNIDYYHHYYAGRHDLWRGLKEVFVEGYSTDLFADAACQYI--S 204 Query: 270 KTLDQPFMLYLAYNAPHLP----------NDNPAPDQYQKQFNTGSQTA---DNYYASVY 316 DQPF +YL +NAPH P N+ APD +++ QT + Y A V Sbjct: 205 AESDQPFFIYLPFNAPHFPSQRNKQPGQGNEWQAPDLAFEKYGYDPQTKNPQERYRAVVT 264 Query: 317 SVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL----NGAQKGYKSQTYPGGT 372 ++D + R+L+QL +G D TI+++ SDNGA + L N + + GG Sbjct: 265 ALDSAIGRVLKQLDTSGLRDQTIVIWYSDNGAFMLKERGLEVASNKPLRDGGVTLWEGGI 324 Query: 373 HTPMFMWWKGKLQPG--NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ 430 P + + G L+ G N LIS +D PT + A +P + LDG +LP L + Sbjct: 325 RVPAIIRYPGHLKAGTVNQSPLIS-LDILPTLITLAGGPLPAERILDGQDMLPALAAQTA 383 Query: 431 GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDY 490 EP F+ Y F VR Y Sbjct: 384 PEPRT------------------FFFQYRNFS---------------------AVRRGKY 404 Query: 491 SLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQ 527 LV N L+ L DL + +LA NP+V+ ++Q Sbjct: 405 KLVRIKPNQPFMLFDLEQDLSETTDLAERNPKVLNQLQ 442 >UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKC9_9BACT Length = 454 Score = 164 bits (416), Expect = 7e-39, Method: Compositional matrix adjust. Identities = 120/402 (29%), Positives = 184/402 (45%), Gaps = 74/402 (18%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI+++ DDLGY + + G+++ TP + + Sbjct: 19 KPNILIILADDLGYADVGYH--------------------GLEEI------PTPNIDRIA 52 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS-------NTDAQDGIPLTETFLPE 169 +EGV+F+ GY + GP+RAA+M+G R G N G+P L + Sbjct: 53 NEGVQFSAGYSNGSICGPTRAALMSGVYQQRIGCEGICGGRKLNEHVVVGMPREVKTLAQ 112 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH 229 FQ GY T GKWHL + P +RGFD F G Sbjct: 113 YFQEAGYATGLFGKWHLGG-------------------ERLFDKTLMPTSRGFDEFFGIL 153 Query: 230 AAGTAYYNSPSLFKNRER--------VPAKG-YISDQLTDEAIGVVDRAKTLDQPFMLYL 280 + Y ++ NRER + +G Y +D + EA+ + R D+PF LYL Sbjct: 154 EGASLYDDT----VNRERKYIRQDTVIDYEGEYFTDAIGREAVSFITRKG--DKPFFLYL 207 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYS-VDQGVKRILEQLKKNGQYDNTI 339 + A H P A ++Y ++F + +A++ S +D + R+ + L+ G DNT+ Sbjct: 208 PFTAVHAPMQ--ASEKYMQRFAHIADPNRRVFAAMLSAMDDNIGRVFDALEHQGILDNTL 265 Query: 340 ILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWW-KGKLQPG-NYDKLISAMD 397 I+F SDNG D LN KG K+Q Y GG P + W KG++ G D+ + MD Sbjct: 266 IVFWSDNGGKPDNNYSLNHPLKGQKTQFYEGGIRVPACVRWPKGQIPAGKTLDQPVFLMD 325 Query: 398 FYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 +P+AL+AA I++PKD ++ ++LP +Q K PH + W Sbjct: 326 IFPSALEAAQITVPKD--IEAKTILPLMQGKTNQTPHPAMFW 365 >UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT Length = 500 Score = 164 bits (416), Expect = 7e-39, Method: Compositional matrix adjust. Identities = 150/567 (26%), Positives = 240/567 (42%), Gaps = 149/567 (26%) Query: 1 MKSALKKSVVSTSISL-ILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPN 59 MK+A+++ V ++ +L + + A HAAD +PN Sbjct: 6 MKTAVERIVFGGNLVWALLLTSLCATRVHAAD-------------------------RPN 40 Query: 60 IIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEG 119 + + DDLG+ + F+ +F TP L L EG Sbjct: 41 FVFILADDLGWKDVGFNGSTF--------------------------YETPNLDRLAREG 74 Query: 120 VRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY----SNTDAQDGI----------PLTET 165 +RFT+ Y A V P+RA+IMTG+ PAR + D D I P E Sbjct: 75 MRFTDAYAACSVCSPTRASIMTGKYPARLHLTDWLPGRPDKPDQILKHPKIITELPAAEI 134 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 L + Q GY TA +GKWHL + + W P+ GFD Sbjct: 135 TLAKALQEGGYKTAFIGKWHLGGLGH-----------------------W-PEQAGFDIN 170 Query: 226 MGFHAAG-TAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNA 284 +G G + Y SP + P Y++D+LTDEA+ ++ K PF+LYL++ + Sbjct: 171 IGGCGMGHPSSYFSPYKNPTLKDGPVGEYLADRLTDEAVKFIENTK--GTPFLLYLSHYS 228 Query: 285 PHLP--NDNPAPDQYQKQF---------------NTGSQTADN---YYASVYSVDQGVKR 324 H P ++YQK+ NT ++ N Y A + S+D+ V R Sbjct: 229 VHTPLQAKKGLIEKYQKKVMQLPPTKGPEFVTEGNTNARQVQNQPIYAAMMQSLDESVGR 288 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAV--IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 +L++LK+ G NT+I+FTSDNG + +G N + K Y GG P+ + W G Sbjct: 289 VLDKLKELGLDKNTVIIFTSDNGGLSTAEGAPTSNMPLRAGKGWPYEGGVREPLVVKWPG 348 Query: 383 KLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWIT 441 + + D + + D+YPT L+ A + + LDG+S P L+ K+ GE + L W Sbjct: 349 VTKAASVSDHQVMSTDYYPTLLEIAGLPARPEQHLDGISFTPALRGKEMGE--RPLFW-- 404 Query: 442 SYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQL 501 Y H+ ++ P S ++R D+ L+ E N++ Sbjct: 405 HYPHYSNQGGAP----------------------------SSSIRKGDWKLIEWYEENRI 436 Query: 502 GLYKLT-DLQQKDNLAAANPQVVKEMQ 527 L+ L D+ +K++LA+ + +E++ Sbjct: 437 ELFNLRLDVGEKNDLASTSALKREELK 463 >UniRef50_A6DSP6 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSP6_9BACT Length = 512 Score = 164 bits (416), Expect = 7e-39, Method: Compositional matrix adjust. Identities = 128/449 (28%), Positives = 199/449 (44%), Gaps = 127/449 (28%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNII++ DD+GY + + N+ ++ TP + S+ Sbjct: 20 QPNIILIFADDMGYDDVGYHG---------NKRII-----------------TPNIDSIA 53 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD----------GIPLTETF 166 ++GV+F+ GYV+ V GPSRA ++TG RFG N + G+P +++ Sbjct: 54 EQGVQFSQGYVSASVCGPSRAGLLTGVYQQRFGCGENPNGSGYPNQMKYPMAGLPQSQSM 113 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 + E + GY +GKWH+ ++ +P RG+D+F Sbjct: 114 ISEELKTLGYTNGMIGKWHMGFDMSL-----------------------RPNQRGYDFFY 150 Query: 227 GFHAAGTAYYNSPS----------LFKNRERVPA------------------KGYISDQL 258 GF Y +F+N E PA + Y++D Sbjct: 151 GFINGSHDYTEWTQEFAKGKSRWPIFRNEEMEPANKAQYIDVFKEKGVKVVDENYLTDLF 210 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTA-----DNYY- 312 TDEA+ +DR D+PF LYLAYNA H P +Q + +TA NY+ Sbjct: 211 TDEAVNFIDR--NADKPFFLYLAYNAVHHP--------WQTTQHALDKTAHLKDDKNYHV 260 Query: 313 --ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA-----VIDGP-----------L 354 + VY++D+G+ +++++LK+ DNTII+F SDNG+ + P + Sbjct: 261 FASMVYAMDEGIGKVMKKLKEKNIDDNTIIIFLSDNGSPQGQGIEHSPKDPNRHRGGFTM 320 Query: 355 PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAA---DISI 410 G +GYK TY GG P + W ++Q G YD ISA+D PT + AA D Sbjct: 321 SSTGIFRGYKGDTYEGGIRVPFCIKWPQQIQKGTKYDMPISALDLQPTLVKAAGGNDKKP 380 Query: 411 PKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 K DGV +LP+L++ K E ++L W Sbjct: 381 QKGFAYDGVDILPYLKEDK--EIKRSLFW 407 >UniRef50_B0SY54 Sulfatase n=7 Tax=Alphaproteobacteria RepID=B0SY54_CAUSK Length = 559 Score = 164 bits (415), Expect = 9e-39, Method: Compositional matrix adjust. Identities = 160/600 (26%), Positives = 238/600 (39%), Gaps = 166/600 (27%) Query: 12 TSISLILASGMAAFAAHA------ADDVKLKATKTN--VAFSDFTPTEYSTKGKPNIIVL 63 ++LI+A + A H ++L + N VA+S+ S PN+IV+ Sbjct: 8 AGLALIVAVALGWAATHKQAVFMWIAHMRLPHVEPNHAVAWSEGPEAAPSGPRPPNVIVI 67 Query: 64 TMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFT 123 DD+G+ + F+ G + TP + SL +GV F Sbjct: 68 LADDMGFNDITFNGG----------------------GVAGGLVPTPNIDSLGHDGVSFA 105 Query: 124 NGYVAHGVSGPSRAAIMTGRAPARFG-------------VYSNTDAQD------------ 158 NGY + PSRA IMTGR RFG V S A D Sbjct: 106 NGYDGNATCAPSRATIMTGRYATRFGFEFTPAPVAFEKMVGSEGAAGDIVLPRFYPDRLK 165 Query: 159 -----------------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 +P +E + +L + GY+T GKWHL + Sbjct: 166 AMPPGSTAPTPDAVNELSMPASEITVAQLLKTRGYHTLHFGKWHLGGKAGS--------- 216 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY--------------------YNSPSL 241 +P+ +GFD +GF A G+ Y + P+L Sbjct: 217 --------------RPEQKGFDESLGFIAGGSMYLPEGDPGVENAKQPWDPIDRFLWPNL 262 Query: 242 -----FKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ 296 F GY++D LTDEA+ V RA ++PF +Y A NA H P D Sbjct: 263 PYAVQFNGSPMFRPGGYMTDYLTDEAVKAV-RANR-NRPFFMYFAPNAIHTPLQATKAD- 319 Query: 297 YQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP- 355 Y Y A V ++D+ V R+L+ LK+ G NT+++FTSDNG LP Sbjct: 320 YDALPEIKDHRLRVYGAMVRNLDRNVGRLLQALKEEGLDQNTLVIFTSDNGGANYIGLPD 379 Query: 356 LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN--YDKLISAMDFYPTALDAADISIPKD 413 +N +G+K+ + GG H+P FM W + P N Y + +D + TA AA +PKD Sbjct: 380 INRPYRGWKATFFEGGIHSPFFMRWPAVI-PANSRYSAPVGHIDIFATAAAAAGAPLPKD 438 Query: 414 LKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNP 473 +DGV L+P++Q K G PH+ L W +S Y Sbjct: 439 RVIDGVDLVPFVQGKATGRPHQTLFW-------------------------RSGSY---- 469 Query: 474 NTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVRE 532 V + D+ L + N++ L+ L D ++ L+AA P+ VK M ++R+ Sbjct: 470 ---------KVVLDGDWKLQSSEAQNKIWLFNLAQDPTEQHELSAAQPERVKAMLALLRQ 520 >UniRef50_A6DHI0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI0_9BACT Length = 456 Score = 164 bits (414), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 148/513 (28%), Positives = 223/513 (43%), Gaps = 109/513 (21%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNII + DD+GYGQL GS+ K ++ TP L + Sbjct: 19 KPNIIFIMCDDMGYGQL----GSYGQKMIK----------------------TPRLDQMA 52 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD---AQDGIPLTETFLPELFQN 173 EG+R T+ Y V PSR ++MTG+ + N + Q+ IP + E + Sbjct: 53 KEGLRLTDYYAGTAVCAPSRCSLMTGQHVGHTYIRGNKEYPTGQEPIPAETITVAEKMKE 112 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY TA +GKW L + E +P +GFDYF G++ Sbjct: 113 AGYATALIGKWGLG----------------------YPGSEGEPNKQGFDYFFGYNDQKH 150 Query: 234 AYYNSPS-LFKNRERVPAKG-------YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 A+ + P L +N E + K Y LTDEA G + + K D PF LYLAY P Sbjct: 151 AHNHFPKFLLRNEETLTLKNNSGKEIEYSQYMLTDEAKGFIKKNK--DNPFFLYLAYVIP 208 Query: 286 H----LPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIIL 341 H +P D+ QY+ + + + + + +D+ V IL+ LK+ +NT+++ Sbjct: 209 HSRLQIPGDDECYLQYKDE--SWPEKQKKHAGMISRLDKDVGSILDLLKEMNLAENTLVV 266 Query: 342 FTSDNGAVIDG---PLPLN--GAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISA- 395 FTSDNGA +G P N G G K Y GG P W G ++PG I A Sbjct: 267 FTSDNGAHREGGARPEFFNDSGPLSGIKRSMYEGGVRVPFIAHWPGVIKPGQVSNHIGAH 326 Query: 396 MDFYPTALDAADISIPKDLKLDGVSLLPWLQ-DKKQGEPHKNLTWITSYSHWFDEENIPF 454 D PTA + + P+ +DG+S +P L+ + ++ E H L + HW + + Sbjct: 327 WDLMPTACELGGVQPPEG--IDGISYVPLLKGNMEEQEKHDYLYFEL---HWPTKRGVRK 381 Query: 455 WDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDN 514 D +V QS +PN + + F+ ++N DL QK + Sbjct: 382 GD----WVALQSKTSAIDPNKDTIKLFN--LKN--------------------DLGQKKD 415 Query: 515 LAAANPQVVKEMQGVVREFIDSSQP-PLSEVNQ 546 LA P+ V+E + + F+++ P PL E Q Sbjct: 416 LATQYPEKVEEFKKI---FLEAHTPAPLFEFGQ 445 >UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UPK7_RHOBA Length = 482 Score = 163 bits (412), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 112/388 (28%), Positives = 174/388 (44%), Gaps = 72/388 (18%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 +T +PN+IV+ DDL G L GS TP L Sbjct: 51 ATSRRPNVIVILADDLAVGDLAGGDGS--------------------------PTRTPNL 84 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS----NTDAQDGIPLTETFLP 168 E ++F+ Y V P+RAA++TGR P R GV + + ET + Sbjct: 85 DRFASESIQFSQAYSGSCVCAPARAALLTGRYPHRTGVVTLNMNRYPEMTRLRRDETTIA 144 Query: 169 ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF 228 ++ ++ GY T VGKWH + + + P +RGFD F GF Sbjct: 145 DVLKDAGYATGLVGKWHTGR-----------------------GDGFHPLDRGFDEFEGF 181 Query: 229 HAAG-TAYYNSPSLFKNRERVPA--KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 + Y+ P F + ++ + Y++D L AI V R + PF L+LA+ AP Sbjct: 182 FGSDDVGYFRYP--FSEQRQISDVDESYLTDDLNRRAIEFVRRHH--EHPFFLHLAHYAP 237 Query: 286 HLPNDNPAPDQYQKQFNTG-SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 H P + P P+ + G ++ YA + +D+G+ +L ++ G ++TI+LF S Sbjct: 238 HRPLEAP-PEVIARYREQGFDESTATIYAMIEVMDRGIGELLAEIDDLGLSEDTIVLFAS 296 Query: 345 DNGAVIDGPLPLNGAQ-----KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFY 399 DNG P PL G + +G K Q GG P+F+ W +L PG D++++ +D Sbjct: 297 DNG-----PDPLTGERFNRELRGTKYQVNEGGIRVPLFVRWSKRLAPGQRDQMVTFVDLM 351 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQD 427 PT LD + + +LDG S +P L+D Sbjct: 352 PTILDLCRVDVSMLNRLDGESFVPVLED 379 >UniRef50_A6LED1 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LED1_PARD8 Length = 459 Score = 162 bits (411), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 138/484 (28%), Positives = 209/484 (43%), Gaps = 93/484 (19%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN +++ DD+GYG L + T+ TP + + Sbjct: 30 KPNFVIIFCDDMGYGDL----SCYGNPTIR----------------------TPNIDRMA 63 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT------DAQDGIPLTETFLPEL 170 EG++ T YV GVS PSRAA+MTGR P R G+Y + +++ G+ E + ++ Sbjct: 64 CEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKV 123 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA 230 Q GY T VGKWHL S +P D Y +S + QN+G A Sbjct: 124 LQQSGYATGCVGKWHLGAFSPY-LPTDHGFDTYFG--IPYSNDMSPVQNKG--------A 172 Query: 231 AGTAYYNSPSLF--KNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP 288 + +P + K E P +G ++ + T++A+ + +PF LY A+ PH+P Sbjct: 173 HARNFPPTPLIVDGKQIESEPDQGELTRRYTEKAVSFIKNHS--KEPFFLYFAHTFPHIP 230 Query: 289 NDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA 348 Y G+ Y V +D V +L+ L++NG +NT ++FTSDNG Sbjct: 231 -------LYTNARFEGTSKRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNGP 283 Query: 349 VID--------GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYP 400 + GPL K K + GG P W GK+ P D+++++MD YP Sbjct: 284 WLTEHENGGSAGPL------KDGKGTWWEGGFRVPAICWMPGKINPAINDEIMTSMDLYP 337 Query: 401 TALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHK 460 T L A I PKDL LDGV+ L ++K + W S + W Y K Sbjct: 338 TFLSMAGIEQPKDLVLDGVNQTGLLFEEKHSARDEVYYWWGSELMAIRKGE---WKYYFK 394 Query: 461 FVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLAAANP 520 ++ D Y E ++ L+Y VE TD+ ++ NLA +P Sbjct: 395 TIK---DQYLRTCKIETPAE----------PLLYNVE---------TDISERFNLADKHP 432 Query: 521 QVVK 524 ++VK Sbjct: 433 EIVK 436 >UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=4 Tax=Bacteria RepID=A6C861_9PLAN Length = 498 Score = 162 bits (411), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 150/543 (27%), Positives = 222/543 (40%), Gaps = 143/543 (26%) Query: 42 VAFSDFTPTEYSTKGKP------NIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 +AFS S KP N + + +DDLGY D G +P+T Sbjct: 13 LAFSVLADRSLSAAEKPKQNKPLNFVFILVDDLGY----MDVGCNNPQTF---------- 58 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF------- 148 TP + L G+RFTNGY A+ V P+R +IMTG+ P R Sbjct: 59 -----------YETPHINQLAKTGMRFTNGYAANPVCSPTRYSIMTGKYPTRVDATNFFS 107 Query: 149 ----GVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYH 204 G + D +PL+ET + E + HGY T GKWHL Sbjct: 108 GKRAGKFLPAPLNDKMPLSETTIAEALKEHGYSTFFAGKWHLG----------------- 150 Query: 205 DNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY--------YNSPSLFKNRERVPAKG-YIS 255 +E+ P+ +GFD G G Y Y +P L KG ++ Sbjct: 151 ------PTQEFWPEKQGFDINRGGWHRGGPYGGGKYFSPYGNPRLTDG-----LKGEHLP 199 Query: 256 DQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP------DQYQKQFNTGSQT-A 308 D+L E +D + D+PF YLA+ + H P P P ++ ++ TG + A Sbjct: 200 DRLASETAQFIDAHR--DEPFFAYLAFYSVHTPLMGPGPLVTKYKEKAKRLGLTGKEEFA 257 Query: 309 DN--------------------YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA 348 D Y A V S+D+ V ++L+QL+++G +NT+++ T+DNG Sbjct: 258 DEEQVFPVDEKRRVRILQNHAVYAAMVESMDKAVGKVLQQLEESGVAENTVVMLTADNGG 317 Query: 349 V--IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDA 405 + +G N +G K Y GG + W G +PG+ D+ + DFYPT LD Sbjct: 318 LSTSEGSPTSNLPLRGGKGWLYEGGIREVFLIRWPGGTEPGSVCDEPVITTDFYPTILDL 377 Query: 406 ADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQ 465 A + + LDGVSL P+LQ + P K Y H+ ++ IP Sbjct: 378 AGLPLKPQQHLDGVSLKPFLQGEA---PFKRDALYWHYPHYSNQGGIP------------ 422 Query: 466 SDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVK 524 +R D+ L+ E+ Q+ LY L DL +K +LA P+ V Sbjct: 423 ----------------GGAIRVGDWKLIERFEDGQVHLYHLKEDLGEKQDLAEKYPERVA 466 Query: 525 EMQ 527 M+ Sbjct: 467 AMR 469 >UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 Length = 471 Score = 162 bits (410), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 126/419 (30%), Positives = 187/419 (44%), Gaps = 84/419 (20%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 T S +PNI+ L DD GY F +GS +TM+ T Sbjct: 19 TSLSYAKQPNIVFLFSDDAGYADFGF-QGS---ETMK----------------------T 52 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD-------------A 156 P L L EGVRFT GYV+ GPSRA IMTGR +FG Y + A Sbjct: 53 PNLDQLASEGVRFTQGYVSDSTCGPSRAGIMTGRYQQKFG-YEEINVPGYMSEHSAIKGA 111 Query: 157 QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 + GIPL E + + ++ GY TA GKWHL +E Sbjct: 112 EMGIPLDEVTMGDYMKSLGYRTAFYGKWHLG-----------------------GTDELH 148 Query: 217 PQNRGFDYFMGFHAAGTAYY----NSP----SLFKNRERVPA-------KGYISDQLTDE 261 P +RGFD F GF +Y+ N+P ++F +++ +GY++D L ++ Sbjct: 149 PMHRGFDEFYGFRGGDRSYWAYEVNAPERKSAVFTDKKLEHGIDQFQEHEGYLTDVLAEK 208 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQG 321 A +++A D+PF ++L++NA H P + D +F A ++D+ Sbjct: 209 ANQFIEKAP--DKPFFIFLSFNAVHTPMEATPED--LAKFPQLKGKRKEVAAMTLALDRA 264 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 +L +LK+ G D+T+++F++DNG D N G KS GG P + W Sbjct: 265 SGAVLNKLKELGLEDDTLVVFSNDNGGPTDKNASSNYPLAGTKSNFLEGGIRVPFLVKWP 324 Query: 382 GKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 KL G YDK +S +D PT A +LDGV L+P++ + PH+++ W Sbjct: 325 AKLAAGKVYDKPVSTLDLLPTFFKAGGGEEVMS-ELDGVDLMPYITGQNNKAPHESMYW 382 >UniRef50_A6DLE2 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLE2_9BACT Length = 441 Score = 162 bits (410), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 119/404 (29%), Positives = 182/404 (45%), Gaps = 80/404 (19%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNII++ DD GS D +++++ TP + S+ Sbjct: 21 PNIIIILADD---------AGSSDFSCYGSKQLL-----------------TPHIDSIAH 54 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT-----DAQD----GIPLTETFLP 168 G++FT Y A V PSRA ++TGR FG +N A D G+P+TE L Sbjct: 55 NGIKFTQAYTASSVCSPSRAGLLTGRYQQTFGHLANIPHSKHSANDPELLGLPVTEITLA 114 Query: 169 ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF 228 + + GY T +GKWHL + A+ + P RGFD F GF Sbjct: 115 DSLKELGYSTHCIGKWHLGE-----------------------ADHFHPNARGFDNFYGF 151 Query: 229 HAAGTAYYNSPSLFKNRERV--------PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 + Y+ L + +R+ P+ GY ++ T EAI ++ + D+PF +YL Sbjct: 152 LSGARTYFLGGELRGDMDRIMRNKEFAEPSSGYTTEVFTQEAIRIIQEEQ--DKPFFIYL 209 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 ++NA H P D A D+ ++ + Y + ++D +L+ LK + QY+NT+I Sbjct: 210 SHNAVHGPMD--AKDEDIMSYDFKNPLRKKYSGLMKNLDDQTGLLLQALKDSKQYENTLI 267 Query: 341 LFTSDNGAVIDGPLPLNGAQ----KGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISA 395 F SDNG GP NG+ +G+K + GG TP + W K+ G + DK I A Sbjct: 268 FFMSDNG----GPTTHNGSSNWPLRGFKGSEFEGGNRTPFLLQWPEKISAGLSSDKPIIA 323 Query: 396 MDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 D + T + AA + D G+ LLP + +K Q + L W Sbjct: 324 YDVFATCIQAAGGELVTDRTYHGIDLLPVI-NKPQETNARKLFW 366 >UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D464_9BACT Length = 474 Score = 161 bits (408), Expect = 5e-38, Method: Compositional matrix adjust. Identities = 128/436 (29%), Positives = 181/436 (41%), Gaps = 107/436 (24%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI+ + DDLGYG+ P +++ TP + L+ Sbjct: 27 RPNILFIVADDLGYGE---------PGCYGGKDI-----------------PTPNIDKLV 60 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV------YSNTDAQDGIPLTETFLPEL 170 GVRF++GYV+ SRAA+MTGR RFG N D G+P+ E + + Sbjct: 61 ASGVRFSSGYVSAPFCAASRAALMTGRYQTRFGFEYNPIGAKNADPGTGLPVNEKTVADR 120 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA 230 ++ GY T VGKWHL T +H PQ RGFD F GF Sbjct: 121 LRDVGYATGLVGKWHLGG-----------TAPFH------------PQRRGFDEFFGFLH 157 Query: 231 AGTAYY---------------------------------------NSPS------LFKNR 245 G Y N P+ L +N Sbjct: 158 EGHFYLPPPWSGATTWLRRKALPDGSQGRWTSPDGHTVWSTDLHENEPAYDADNPLLRNS 217 Query: 246 ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS 305 + V K ++D T EA +DR + QP+ LYLAYNA H P D Y ++F+ Sbjct: 218 QPVEEKANLTDAFTREACSFIDRHQA--QPWFLYLAYNAVHSPLQ--GEDTYMEKFSHIG 273 Query: 306 QTADNYYASVYS-VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYK 364 +A+V + +D+ + ++ QL+ +G +NT+++F SDNG N +G K Sbjct: 274 DIQRRIFAAVLAHLDEDIGKVRAQLRADGLEENTLVVFLSDNGGPTKELTSSNLPLRGGK 333 Query: 365 SQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 + GG P + WKG++ G+ D +MD TAL A + KLDGV LLP Sbjct: 334 GDLWDGGIRIPFAVSWKGQIPAGHTIDAPAISMDLTATALKLAGAET-EQAKLDGVDLLP 392 Query: 424 WLQDKKQGEPHKNLTW 439 L K PH L W Sbjct: 393 LLTGKTTAAPHDTLFW 408 >UniRef50_A4AQQ7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Bacteroidetes RepID=A4AQQ7_9FLAO Length = 596 Score = 161 bits (407), Expect = 8e-38, Method: Compositional matrix adjust. Identities = 126/428 (29%), Positives = 192/428 (44%), Gaps = 82/428 (19%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 V+ T + + KPN++++ DD G+G L F+ + Sbjct: 21 VSCEKKTKEKNEIQTKPNVVLIMTDDQGWGDLSFNGNT---------------------- 58 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIP 161 STP + ++ G F N YV V P+RA ++TG+ AR GVYS + + Sbjct: 59 ----NLSTPNIDAIAKNGASFQNFYV-QPVCSPTRAELLTGKYAARLGVYSTSTGGERFN 113 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 ET + E+F+ GY T A GKWH S + P YH P +RG Sbjct: 114 SKETTIAEIFKKAGYKTTAYGKWH----SGMQPP-------YH------------PNSRG 150 Query: 222 FDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 FD + GF + Y SP L N E V +G++ D LT++ + + K + PF LYL Sbjct: 151 FDDYYGFTSGHWGNYFSPMLEHNGEIVKGEGFLVDDLTNKGLDFITENK--NNPFFLYLP 208 Query: 282 YNAPHLPNDNPAPDQYQKQFNT---------GSQTADNY----YASVYSVDQGVKRILEQ 328 YN PH P P++Y ++F + ++N+ A V ++D + R+ + Sbjct: 209 YNTPHSPMQ--VPNEYWERFEKKKLDMRYQGNEEESENFTRAALAMVENIDFNMGRLTNK 266 Query: 329 LKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN 388 LK+ G +NTII++ SDNG NG +G K T GG +P F+ WK + P N Sbjct: 267 LKELGLEENTIIVYLSDNGP---NGWRWNGGMRGRKGSTDEGGVRSPFFIQWKNTI-PKN 322 Query: 389 --YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY--S 444 ++ A+D PT A I+ P +DG L + D KN TW + + + Sbjct: 323 KKISQIAGAIDILPTLTSLAGINQPTIKSIDGKDLKTLIAD-------KNPTWESRHIVN 375 Query: 445 HWFDEENI 452 HW + +I Sbjct: 376 HWRGKTSI 383 >UniRef50_A6LDP6 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LDP6_PARD8 Length = 452 Score = 161 bits (407), Expect = 8e-38, Method: Compositional matrix adjust. Identities = 122/402 (30%), Positives = 179/402 (44%), Gaps = 86/402 (21%) Query: 52 YSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPT 111 +S KPNIIV+ DD+GYG L F T++ TP Sbjct: 20 HSQPTKPNIIVINCDDMGYGDL----SCFGSPTIK----------------------TPN 53 Query: 112 LLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT------DAQDGIPLTET 165 + + EG ++++ YV+ VS PSRA ++TGR R G+Y + D++ G+P E Sbjct: 54 IDRMAIEGQKWSSFYVSASVSSPSRAGLLTGRLGVRTGMYGDQRRVLFPDSKGGLPSEEL 113 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 + EL + GY+TA +GKWHL + E+ P GFDYF Sbjct: 114 TIAELLKQAGYHTACIGKWHLGHLP-----------------------EYMPLRHGFDYF 150 Query: 226 MGF------------HAAGTAYYNSPSLF---KNRERVPAKGYISDQLTDEAIGVVDRAK 270 G+ T Y ++ K ER P + ++ Q+T+ AI + + Sbjct: 151 YGYPYSNDMSRKEQIKLGNTKYPYEYIIYEQEKELEREPQQYNLTQQVTEAAIRYIKSNE 210 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLK 330 + PF LYLA+ PH+P Y G Y +V +D V +IL+ LK Sbjct: 211 --NSPFFLYLAHPMPHMP-------VYASTDFQGKSARGKYGDTVEELDWSVGQILQTLK 261 Query: 331 KNGQYDNTIILFTSDNGAVI----DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 G NT+++FTSDNG + +G P G K K+ + GG P MW ++P Sbjct: 262 SEGLDKNTLVIFTSDNGPWLLCKQEGGSP--GPLKDGKASMFEGGFRVPCIMW-GAMVKP 318 Query: 387 GNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK 428 G + S +D PT + A I +P D DG+SLL L+DK Sbjct: 319 GYITDMASTLDLLPTFCEIAGIPLPSDRHYDGISLLNVLKDK 360 >UniRef50_A6DMY9 Putative uncharacterized protein n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMY9_9BACT Length = 590 Score = 160 bits (406), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 115/376 (30%), Positives = 173/376 (46%), Gaps = 59/376 (15%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI+++ DD GYG D + NR + TP L L Sbjct: 25 KPNIVLILTDDQGYG---------DISSHGNRMI-----------------DTPHLDQLA 58 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 ++G RF N +V++ V P+RA+++TGR R GV + + + E + E+F+ GY Sbjct: 59 EDGTRFENFFVSN-VCAPTRASLLTGRYHIRTGVVQVSRGLEIMRSEEATIAEVFKAQGY 117 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY 236 T GKWH Y +N P +GFD + GF A + Sbjct: 118 ETGLFGKWH-------------NGEHYPNN----------PPGQGFDEYFGFCAGHIGDF 154 Query: 237 NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ 296 +L N+ V KG+I+D LTD AI +++ + D+PF Y+ YNAPH P D+ Sbjct: 155 FDATLDHNKTFVKTKGFITDVLTDRAIDWIEKQQ--DKPFFAYIPYNAPHAPYQ--VEDK 210 Query: 297 YQKQFNTGSQTADN--YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL 354 Y +F +A + Y + ++D + R+L+ L DNTI++F +DNG + P Sbjct: 211 YYDEFAAKGYSAAHSAAYGMIENLDDNIGRLLKILDDLNLTDNTIVIFLTDNGP--NSPT 268 Query: 355 PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKD 413 NG KG K GG P F+ W GK+ G L + +D PT ++ A +++ Sbjct: 269 RFNGGMKGSKGSVDEGGVRVPFFIRWPGKIAKGRTIHDLAAHIDVLPTLMELAGVNVDLP 328 Query: 414 LKLDGVSLLPWLQDKK 429 KLDG SL + K Sbjct: 329 NKLDGRSLTSLISSSK 344 >UniRef50_A6C1Q0 N-acetylgalactosamine 6-sulfate sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C1Q0_9PLAN Length = 469 Score = 160 bits (405), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 109/348 (31%), Positives = 162/348 (46%), Gaps = 55/348 (15%) Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV---YSNTDAQDGIPL 162 Q TP + + +G FTN +VA V PSRA ++GR P + S+ +AQ+G L Sbjct: 52 QIHTPHMDQIGKQGAVFTNAFVATPVCSPSRATFLSGRFPTELKITDWISSEEAQEGAGL 111 Query: 163 TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF 222 T PE+ Q HGY TA +GKWHL +++ ++ P +GF Sbjct: 112 TAMTWPEVLQQHGYQTALIGKWHLGELN-----------------------QFHPHEKGF 148 Query: 223 DYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 +FMGF A GT N P+L E KG + D L D+AI + +K D+PF L L + Sbjct: 149 GHFMGFLAGGTRPMN-PTLEIKGETQKRKGSLPDLLVDDAINFIRTSK--DKPFALCLHF 205 Query: 283 NAPHLPNDNPAPDQYQKQFNTGS---------------QTADNYYASVYSVDQGVKRILE 327 APH P P P+Q + Q YYASV SVD+ + R+L+ Sbjct: 206 RAPHTPY-GPVPEQDSAHYEGMKIDVPITPGVIPEQIRQKNKEYYASVSSVDRNIGRLLK 264 Query: 328 QLKKNGQYDNTIILFTSDNG---------AVIDGPLPLNGAQKGYKSQTYPGGTHTPMFM 378 +L + +NT+++FTSD+G +G G + + P+ M Sbjct: 265 ELDQLRLAENTLVIFTSDHGYNNGRHGVSTKGNGHWIAGGVTGPKRPNMWDTSIRVPLVM 324 Query: 379 WWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 W ++PG +D+++S +D + L A I P +LKL G+ P L Sbjct: 325 RWPAVIKPGTQFDEIVSNIDMFKFVLGALKIPQPANLKLHGIDYSPLL 372 >UniRef50_A3J5W3 Putative arylsulfatase n=1 Tax=Flavobacteria bacterium BAL38 RepID=A3J5W3_9FLAO Length = 468 Score = 159 bits (402), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 125/417 (29%), Positives = 184/417 (44%), Gaps = 80/417 (19%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI+ + DD+GY +L GS+ K +E TP + L Sbjct: 28 KPNIVFILADDMGYNEL----GSYGGKIIE----------------------TPNIDQLA 61 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT----DAQDGIPLTETFLPELFQ 172 EG++F+N Y + PSR +MTG+ + N + + IP +E + E+ + Sbjct: 62 KEGMKFSNHYCGSNICAPSRGTLMTGKHTGHAYIRDNKPLPYEGNEPIPASEITVAEILK 121 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 GY T A GKW L + A E P N+GFD F G++ Sbjct: 122 TAGYTTGAFGKWGLG----------------------YPASEGSPNNQGFDQFYGYNGQI 159 Query: 233 TAYYNSPSLFKNRERV--------PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNA 284 A+ S + + V P Y +D + D A+ V+ K + PF LY Sbjct: 160 HAHNYFTSYLRKNDLVELNANIDAPYSVYSADIIKDRALEFVEVNK--NNPFFLYFCPTL 217 Query: 285 PHLPNDNP---APDQYQKQ--FNTGSQTADNY----YASVYS-VDQGVKRILEQLKKNGQ 334 PH P P + Y K+ F G ++ + YA++ S +DQ V I+ +LK+ Sbjct: 218 PHNPYHQPDDKTLEYYAKKTGFPIGDAHSEEFSVPKYAALSSRLDQQVGEIMAKLKELNL 277 Query: 335 YDNTIILFTSDNGAVI----DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYD 390 DNT+I+F SDNG+ + D L G +G KS+ Y GG +P+ +WKGK+ PG+ Sbjct: 278 LDNTLIIFASDNGSALTKEEDSYLRTGGDLRGRKSEVYEGGIKSPLIAFWKGKIIPGSSS 337 Query: 391 KLISAM-DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP-HKNLTWITSYSH 445 ISA DF PT + P + +DG+S LP L K + H L W S S Sbjct: 338 NHISAFWDFLPTCAEIVKAKTPDN--IDGISYLPTLLGKTDNQKQHDYLYWERSQSQ 392 >UniRef50_UPI0001968B7D hypothetical protein BACCELL_01446 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968B7D Length = 450 Score = 159 bits (401), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 117/407 (28%), Positives = 181/407 (44%), Gaps = 76/407 (18%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN++ + +DDLG+ + ++ + +TP + L Sbjct: 30 KPNVLFIAVDDLGWSDVGYNGSTL--------------------------VATPNIDRLA 63 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-IPLTETFLPELFQNHG 175 GV FT+GYV +SGPSR +++G ++G+ N D + +P LPE + G Sbjct: 64 SMGVSFTDGYVTAPISGPSRNGMVSGMYSQKYGMQINADLKFAQVPAQHKTLPETMREAG 123 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY 235 Y TA VGKWH+ + +NV E D N+ S T Sbjct: 124 YRTALVGKWHVCRDANVVFDEVYDRIDISSNYFPDS---------------------TGV 162 Query: 236 YNSPSL--FKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA 293 Y+ P L +E Y++D+LT AI +++ ++ PF LYL YNA H NP Sbjct: 163 YDGPRLPILAVKESTYENEYMTDKLTSHAIQFMNK-QSNATPFFLYLGYNAVH----NPW 217 Query: 294 PDQYQKQFNTGSQTADNYY----ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV 349 + +K ++ S D Y + + D+ + +L L+ DNTII+F SDNG Sbjct: 218 QAE-KKYYDRLSNIKDEYMRVQASLIACADENIGILLNYLENRKLIDNTIIVFVSDNGPA 276 Query: 350 IDGP---------------LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLI 393 GP +G+K Y GG TPM + +K L+ G + +I Sbjct: 277 KGGPELKTWEGYDPSFEYVFGQMKTLRGHKVDLYEGGIRTPMIIAYKPLLREGKVFRDMI 336 Query: 394 SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 S +DFYPT + IP+ LDGVSL+ +L+++ + +PH L W Sbjct: 337 STLDFYPTICEMTGSEIPEGTTLDGVSLVRYLREETKQQPHDMLFWC 383 >UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Proteobacteria RepID=UPI0000E0F7DD Length = 493 Score = 158 bits (400), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 141/493 (28%), Positives = 211/493 (42%), Gaps = 101/493 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNII++ +DDLG+ + ++ + D ++ TP + +L Sbjct: 39 KPNIIMIVIDDLGWSDVGYN------------QTTDYFE-------------TPNIDALA 73 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLP-------- 168 +G+ F Y PSRA +M+G+ R GVY+ + + G T +P Sbjct: 74 QQGLVFDQAYAGAANCAPSRAVLMSGQYGPRHGVYTVSPSDRGHAKTRKLIPIKNKRGLT 133 Query: 169 -------ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 E + GY T GKWHL P +G Sbjct: 134 TDIITIGESLKTAGYTTGTFGKWHLGA---------------------------DPDKQG 166 Query: 222 FDY-FMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 FD G H T +Y SP N E P Y++++LT E I V +K DQPF Y+ Sbjct: 167 FDVNVAGSHQGMTFHYFSPYQLPNIEDGPKGEYLTERLTTEVIDWVKSSK--DQPFFAYV 224 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTG--SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 Y H P D+ K G S+ Y A V +D V RI + L G +NT Sbjct: 225 PYYTVHTPY-QAVVDKVNKYHEKGIKSKREATYAAMVEHMDDNVGRIFDMLDSEGLAENT 283 Query: 339 IILFTSDNGA--VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISA 395 +++FTSDNG + P PL G + Y Y GG P+ + W K++PG ++ +I+A Sbjct: 284 VVIFTSDNGGYRMSSFPTPLRGGKGSY----YDGGLRVPLIVRWPEKVKPGLDHTPVINA 339 Query: 396 MDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFW 455 DFYPT ++ P + LDGV L L +Q ++L W + P + Sbjct: 340 -DFYPTLVNLTKSKQPNQV-LDGVDLTAHLLG-QQDIAERDLFW-----------HFPVY 385 Query: 456 DNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDN 514 H H D +P ++ +R+ D+ L+ ENN+ LY L DL +K+N Sbjct: 386 LQAH----HAPTDQGQDPLFR--TRPGSAIRSGDWKLLQYFENNEFELYNLANDLAEKNN 439 Query: 515 LAAANPQVVKEMQ 527 LA+ +P VKE++ Sbjct: 440 LASVHPSRVKELK 452 >UniRef50_B1KD86 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD86_SHEWM Length = 484 Score = 158 bits (399), Expect = 6e-37, Method: Compositional matrix adjust. Identities = 141/536 (26%), Positives = 240/536 (44%), Gaps = 107/536 (19%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 +TK N +FS E K + N++++ +DDLG ++DT Sbjct: 18 STKLNASFSPLKKEESKLK-QANVVIIYVDDLG--------------------IMDTGIY 56 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA 156 G +AQ TP + L + GVRFT Y PSRA++MTG PA G+ + + Sbjct: 57 G------SAQYPTPNIDKLANSGVRFTQAYANAANCAPSRASLMTGLTPAEHGILTVGSS 110 Query: 157 QDG-------IPLTE--------TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 + G IP+T T + +LF+ GY TA +GKWHL K + D Sbjct: 111 ERGESQYRKLIPVTNNTELNPDLTTIADLFKQQGYATAVIGKWHLGKTAPTEYGFDTAIA 170 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDE 261 H + P ++G +G G K+ Y+S+++T E Sbjct: 171 ASH---LGHPPSYFYPYSKGKRKLIGLEEGG---------LKDE-------YLSNRITRE 211 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTAD----NYYASVYS 317 A+ + + QPF LYL + A H P + AP ++ Q N Q + Y A + + Sbjct: 212 AVNYISSQR---QPFFLYLPFYAVHTPIE--APKEWVNQHNARQQAGEIKSAAYAAMIAN 266 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMF 377 +D+ V ++L+ L K+GQ +NT+++F SDNGA P + +GYKS + GG P+ Sbjct: 267 LDRDVGKLLQALDKSGQRENTLVVFASDNGAY--DPATSSLPYRGYKSSLFEGGIKIPLV 324 Query: 378 MWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKN 436 + W ++ P + ++ M D + L + PK L L ++ L ++ + +P + Sbjct: 325 LSWPKQIPPNSQNRTPVQMSDLF---LGIKHLLQPK-LALHRQDIIS-LAEQGKEQPERP 379 Query: 437 LTW-----ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 L W I ++ + + N P+W H P + +R Y Sbjct: 380 LYWHAPIYIDQFAPYRGQPNHPYWK--------------HTP--------AAAIRLGHYK 417 Query: 492 LVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL-SEVN 545 L+++ E + L+ L D Q+K+NL NP++ +++ ++++ +S P+ SE+N Sbjct: 418 LIHSYETGKQLLFDLDKDSQEKNNLVNQNPEIREKLFKALQQWQESVNAPMVSELN 473 >UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017445FC Length = 481 Score = 157 bits (397), Expect = 9e-37, Method: Compositional matrix adjust. Identities = 129/437 (29%), Positives = 186/437 (42%), Gaps = 104/437 (23%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN+IV DDLGYG+L G + K ++ TP L L Sbjct: 17 RPNVIVFLADDLGYGEL----GCYGQKKIK----------------------TPNLDQLA 50 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD-------------AQDG---- 159 +G+RFT+ Y H V PSR ++TG+ V N++ A DG Sbjct: 51 ADGMRFTDFYSGHAVCAPSRCVMLTGKHTGHSFVRENSEGRAAQAKERNRIKAADGYLPQ 110 Query: 160 --IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 +P +E Q GY TA VGKW L SN E P Sbjct: 111 IALPASEATYASALQKSGYRTACVGKWGLGHPSN----------------------EGSP 148 Query: 218 QNRGFDYFMGFHAAGTAYYNSPS-LFKNRERVPAKG--------YISDQLTDEAIGVVDR 268 GFD F G+ + A+Y P+ L++N + P +G Y +D + EA+ ++ Sbjct: 149 NKHGFDLFYGYISQWQAHYYYPTYLWRNDVKEPLEGNDGKVGRQYAADLMEQEALKFME- 207 Query: 269 AKTLDQPFMLYLAYNAPHL----PNDNPAPDQYQKQFNTGSQTADN-------------Y 311 T PF LY A PH+ P D P+ +Y++ F D Y Sbjct: 208 -TTGGGPFFLYYATPVPHVSLQVPPDEPSLAEYKQAFAGQDPPYDGRKSYLPTEDPRAIY 266 Query: 312 YASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKSQ 366 A V +D+ + + + LK+ GQ NT+I+FTSDNGA +G N +G K+Q Sbjct: 267 AAMVTRMDRTLGKFRDLLKRTGQDQNTLIIFTSDNGATFNGGYDREFFGGNQPLRGMKTQ 326 Query: 367 TYPGGTHTPMFMWWKGKLQPGNYDKLISA-MDFYPTALDAADISIPKDLKLDGVSLLPWL 425 + GG TP W G +QPG + + A D +PT + +P LDGVS+LP L Sbjct: 327 LWDGGIRTPFIAAWPGSIQPGQVSRFVGASWDLFPTFAEIVGFPVPAG--LDGVSILPTL 384 Query: 426 QDKKQGEP-HKNLTWIT 441 + + + H +L W T Sbjct: 385 KGEVATQKQHDHLYWET 401 >UniRef50_A6C4Q6 Arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4Q6_9PLAN Length = 574 Score = 157 bits (397), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 112/406 (27%), Positives = 178/406 (43%), Gaps = 65/406 (16%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 V++S + +PN+IV+ DD GYG + F +G+ Sbjct: 19 VSYSFGCEGTLCAESRPNVIVILTDDQGYGDVGF-RGNL--------------------- 56 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIP 161 + +TP L + ++ + T Y + V P+RA+++TGR R GV + + Sbjct: 57 ----KINTPHLDRMAEKSIELTRFYCSP-VCAPTRASLLTGRNYYRTGVIHTSRGGAKMQ 111 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 E + EL Q GY T GKWHL DN+ +PQ++G Sbjct: 112 GEEVTVAELLQQAGYQTGIFGKWHLG-----------------DNYPM------RPQDQG 148 Query: 222 FDYFMGFHAAGTAY-------YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQ 274 F + + G Y P L+KN + GY +D D A+ +DR ++ Sbjct: 149 FAESLIHKSGGIGQSPDQPNSYFHPKLWKNGVAFQSTGYCTDVFFDAALDFIDRQTKTEK 208 Query: 275 PFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS--QTADNYYASVYSVDQGVKRILEQLKKN 332 PF +YLA NAPH P + + Y K + +T Y + ++D+ + ++L L+++ Sbjct: 209 PFFVYLATNAPHTPLE--IAESYWKPYQRQGLDETTARVYGMITNLDENIGKLLSHLERS 266 Query: 333 GQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDK 391 + T++LF DNG G +G KS TY GG P W G + G D+ Sbjct: 267 ALAEKTVVLFLGDNGPQQK---RYTGGLRGRKSWTYEGGIRVPCLAQWPGHFREGEKIDQ 323 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL 437 + + +D PT L + P+ LKLDGV L P L +K+ P ++L Sbjct: 324 IAAHIDLMPTLLALTETRCPESLKLDGVDLSPLLTGRKEKLPARSL 369 >UniRef50_A6DNJ0 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DNJ0_9BACT Length = 630 Score = 157 bits (397), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 150/489 (30%), Positives = 220/489 (44%), Gaps = 87/489 (17%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNII + DDLGYG L S++P N E +A TPTL S+ Sbjct: 25 PNIIFMLADDLGYGDL----SSYNP----NAE---------GEAPNNTPIRTPTLDSMAK 67 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT--DAQDGI-----PLTETFLPEL 170 GVR+T+ + A + P+R A++T R P+R G ++ + DG+ P +L E Sbjct: 68 NGVRYTDFHSAAPICSPARRALLTARYPSRLGEWAEAYRGSPDGVVAKNDPTIAMWLKEA 127 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA 230 GY TAA GKW++ + +V P ++W + YF Sbjct: 128 ----GYATAAYGKWNIGESKDVSWP------------GAHGFDDWLIIDHNTGYFQ-HKN 170 Query: 231 AGTAYYNSPSLFK-NRERVP--AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHL 287 A P LF+ ERV Y++D TD+AI + K DQPF +YL ++ PH Sbjct: 171 ANKDCEGRPMLFETGGERVTNLEGQYLTDIWTDKAIDFIQETK--DQPFFIYLPWSIPHT 228 Query: 288 PNDNPAPDQYQKQFNTGS-----QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILF 342 P +PA D F+ G+ + + Y V +D + RI + LK+ G+YDNT+I+F Sbjct: 229 PLQDPASDP-SLAFDAGAKPKTVEGREVYVKMVEYLDSHIARIFKSLKEQGKYDNTLIIF 287 Query: 343 TSDNGAVIDGPL-PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS-AMDFYP 400 TSDNG ++ PL K K GG P M W K++ G D+ + MD Sbjct: 288 TSDNGGMVSANCWPL----KKTKQHLEEGGIRVPFLMQWPSKIKAGTVDQRAAIMMDASV 343 Query: 401 TALDAADIS--IPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNY 458 T L AAD +PKD +LDGV+L E ++ W W NY Sbjct: 344 TVLAAADAMKYVPKDRELDGVNLFA------NKEENREFGWRRRDWGWQ--------GNY 389 Query: 459 HKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAA 517 + ++S D+ + + + Y + N +S Y E LYKL+ DL +K+NL Sbjct: 390 LRQEAYRSGDW------KLIRSYQY-LGNKKWSAEYKEE-----LYKLSDDLGEKNNLKK 437 Query: 518 ANPQVVKEM 526 + P+ EM Sbjct: 438 SMPEKHAEM 446 >UniRef50_C3Q8V4 Arylsulfatase B n=6 Tax=Bacteroides RepID=C3Q8V4_9BACE Length = 498 Score = 157 bits (396), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 124/390 (31%), Positives = 178/390 (45%), Gaps = 70/390 (17%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI+++ DDLG+G + F ++ TP+L +L+ Sbjct: 65 RPNIVIVLADDLGWGDVGF---------------------------HGSEIKTPSLDALV 97 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD---AQDGIPLTETFLPELFQN 173 EGV Y + +S P+RA +MTGR P RFGV S +DG+ E + ++ Sbjct: 98 GEGVELERFYTSP-ISTPTRAGLMTGRYPNRFGVRSAVIPPWREDGLDENEETMADMLAR 156 Query: 174 HGYYT-AAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 +GY A +GKWHL T+ H P NRGF +F G Sbjct: 157 NGYKNRAIIGKWHLG-----------HTKKVH-----------YPMNRGFSHFYGHLNGA 194 Query: 233 TAYYN-----SPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHL 287 Y++ + E KGY ++ +T EAI +D A + PFMLY+AYNAPH Sbjct: 195 IDYFDLTREGELDWHNDWETCHDKGYSTELITQEAIRCID-AYEKEGPFMLYVAYNAPHT 253 Query: 288 PNDNPAPD--QYQKQFNT---GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILF 342 P D Y F++ Q Y A V +D+G+ I++ LKK G DNT +F Sbjct: 254 PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIF 313 Query: 343 TSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWW-KGKLQPGNYDKLISA-MDFYP 400 SDNG P +G +G+K + GG H P ++W K + Q N ++ +D P Sbjct: 314 FSDNGTA-GVPGSSSGPLRGHKFDEWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVP 372 Query: 401 TALD-AADISIPKDLKLDGVSLLPWLQDKK 429 T D D S PK + DG+S+LP L KK Sbjct: 373 TLKDLVGDHSRPKR-EYDGISILPVLNGKK 401 >UniRef50_A4AVA7 Aryl-sulphate sulphohydrolase n=2 Tax=Bacteroidetes RepID=A4AVA7_9FLAO Length = 487 Score = 157 bits (396), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 142/510 (27%), Positives = 224/510 (43%), Gaps = 104/510 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI+++ +DDLGY + F + + TP + L Sbjct: 47 KPNIVLINIDDLGYKDVGF--------------------------MGSEYYETPNIDILA 80 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-------IPLTET---- 165 G+ FTNGY A PSRA++MTG+ R G+Y+ ++ G IP T T Sbjct: 81 KAGMIFTNGYAAASNCAPSRASLMTGKWTPRHGIYTVNSSERGKSKDRKIIPSTNTSTLS 140 Query: 166 ----FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 LPE+ Q + Y T GKWHLS+ P + G Sbjct: 141 KESMVLPEVLQLNNYKTIHAGKWHLSE---------------------------SPLDYG 173 Query: 222 FDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 FD +G G P + R P K Y++D + + I V++ KT+ +PF L A Sbjct: 174 FDINIGGGHNGHPKSYYPPYGNVKLRSPNKEYLTDLIARQTIEVLN--KTI-EPFFLNYA 230 Query: 282 YNAPHLPND--NPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 A H P + +Y ++ Q Y V ++D+ + ++ LK NG Y NT+ Sbjct: 231 PYAVHTPIQPVDSILSKYNRKTAWKGQNNAKYATMVENLDRNIGLLIAALKDNGHYKNTL 290 Query: 340 ILFTSDNGAV--IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL-ISAM 396 I+FTSDNG + I PL + Y Y GG P F W K++ + IS + Sbjct: 291 IIFTSDNGGLYGITKQQPLRAGKGSY----YEGGIREPFFFMWNDKIKSNTKSNVPISHL 346 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWD 456 D +P+ ++AA IS + LDG SLLP L+ ++ + + L W + P + Sbjct: 347 DLFPSIVEAAGISY-NETSLDGNSLLPILK-QESTKLKRPLFW-----------HFPIYL 393 Query: 457 NYHKFVRHQSDDYPHNPNTEDL--SQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKD 513 + +Q+D N N + L ++ +R D+ L Y ENN++ LY LT D+ +++ Sbjct: 394 EAY----NQND----NENRDSLFRTRPGSVIREGDWKLHYYFENNEMELYNLTYDVGERN 445 Query: 514 NLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 NL +P+ K + ++ + + P+ E Sbjct: 446 NLINTHPKKAKVLLQQLKAWWKETSAPIPE 475 >UniRef50_UPI0001BC7CBC sulfatase n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7CBC Length = 496 Score = 156 bits (395), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 124/403 (30%), Positives = 173/403 (42%), Gaps = 76/403 (18%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI+++ DD GYG G++ + TP + L Sbjct: 36 KPNIVIILADDQGYG-------------------------GVNCYPHIKKIVTPNIDKLA 70 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS-NTDAQDGIPLTETFLPELFQNHG 175 GV+ GY + +S P+RA +MTG+ FG Y +T GIP + L E +G Sbjct: 71 ASGVQCMQGYTSGHLSSPTRAGLMTGKYQQSFGFYGLSTPHVGGIPQDQKLLSEYLVENG 130 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY 235 Y TA +GKWHL DY + P NRGF F GF Y Sbjct: 131 YNTACIGKWHLG--------------DYIRS---------HPNNRGFQTFFGFINGLHDY 167 Query: 236 YNS-------------PSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 Y+ N E V Y + + T A+ + K D PF LYL Y Sbjct: 168 YDPLVGGSWDGVYNGLAFTLDNMEPVTEMEYSTYEYTKRAVDFIQ--KNADHPFFLYLPY 225 Query: 283 NAPHLPNDNPAPDQYQKQFNTGSQTA---DNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 NA H P AP++ + Q D A +++DQGV +++E L++ G DNTI Sbjct: 226 NAIHSPLQ--APEELIGELAINPQEIGKDDIARAMTFALDQGVGKVVETLEQLGLRDNTI 283 Query: 340 ILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDF 398 I + SDNGAV +G K Y GG P + + KL G Y+K + ++D Sbjct: 284 IFYLSDNGAV---EYSDKWEFRGRKGSYYEGGIRVPFIVSYPAKLAKGTIYNKPVMSIDI 340 Query: 399 YPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWIT 441 PT ++ A +S + GV+LLP+L K + EPH L W T Sbjct: 341 APTVMELAGLS---HADMHGVNLLPYLSGKDRTEPHDVLYWST 380 >UniRef50_Q7UYA5 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA5_RHOBA Length = 562 Score = 156 bits (395), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 145/559 (25%), Positives = 233/559 (41%), Gaps = 107/559 (19%) Query: 10 VSTSISLILASGMAAFAAHAAD-----DVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLT 64 + T+ SL L+ A F H + D A V FT E +PNII+L Sbjct: 72 IRTNESLTLSLTHATFHPHTPNMKHCIDSLAIAIVAVVFLGSFT--EAHADDRPNIILLL 129 Query: 65 MDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTN 124 DDLGYG L + TP L L EG++ Sbjct: 130 ADDLGYGDL--------------------------SCFGSPAVKTPHLDRLASEGLKCNR 163 Query: 125 GYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-IPLTETFLPELFQNHGYYTAAVGK 183 Y V P+RA+++TGR P RFG+ + + ++G +P + T + EL ++ GY TA +GK Sbjct: 164 FYAGSAVCSPTRASVLTGRYPLRFGITKHFNDRNGWLPESATTVAELLKDAGYNTAHIGK 223 Query: 184 WHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF------------MG---- 227 WHL + +V P + T + P+ GFD++ MG Sbjct: 224 WHLGGL-HVDEPGKRLT------------NQPGPRQHGFDFYQTQIEQQPLRGQMGRDKT 270 Query: 228 -FHAAGTAYYNSPSLFKNRERVPA-----KGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 F GT L +N +R+ + +D D A+ ++++ + + PF + + Sbjct: 271 LFRKGGTV------LLRNDQRISQDDPYYHKHFTDANGDFAVEMIEKLSSEEDPFFINMW 324 Query: 282 YNAPHLPND-NPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 + PH P + P P + + + + V +D V IL +L + DNT++ Sbjct: 325 WLVPHKPYEPAPEPHWSDTAADDITDDQHRFRSMVQHMDAKVGAILRKLDELKIADNTLV 384 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DFY 399 LFTSDNGA +G KG K++ + GG PM + W + G + S D Sbjct: 385 LFTSDNGAAFEG---FIHDLKGGKTELHDGGIRVPMIVRWPDAIPAGQTSQTFSHTNDLL 441 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFW--DN 457 PT DAA + +P DL LDG+SL L K G P + E FW D Sbjct: 442 PTFCDAASVQLPSDLPLDGLSL---LSHWKGGTPPSQV-----------ERGTVFWQLDL 487 Query: 458 YHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLA 516 Y RH P+ + V ++ L+ + + L+ + D +K N+ Sbjct: 488 YKSLQRHYPKPKPYA---------TEVVMRGNWKLL-AFKGKPVELFDVGADPNEKRNVL 537 Query: 517 AANPQVVKEMQGVVREFID 535 A +P++V + ++++++ Sbjct: 538 AEHPELVASLSAQLKDWLN 556 >UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=Bacteria RepID=Q7UHJ9_RHOBA Length = 1012 Score = 156 bits (394), Expect = 2e-36, Method: Compositional matrix adjust. Identities = 144/533 (27%), Positives = 219/533 (41%), Gaps = 128/533 (24%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN IV+ DD GYG L A TP + + Sbjct: 570 KPNFIVILTDDQGYGDL--------------------------SCFGAKHVDTPRIDQMA 603 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR--------FGVYSNTDAQDGIPLTETFLP 168 EG R T+ YVA V PSRA +MTG P R FGV D + G+ E + Sbjct: 604 AEGSRLTSFYVAAPVCTPSRAGLMTGCYPKRIDMAMGSNFGVLLAGDPK-GLHPDEITIA 662 Query: 169 ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG- 227 E+ + GY T GKWHL E+ P +GFD F G Sbjct: 663 EVLKTAGYRTGMFGKWHLG-----------------------DQPEFLPTKQGFDEFFGI 699 Query: 228 --------FHAAGTAYYNSP-SLFKNR---ERVPAKGYISDQLTDEAIGVVDRAKTLDQP 275 FH Y+ P L +N E P +++ +LT++A+ ++R K DQP Sbjct: 700 PYSHDIHPFHPRQNHYHFPPLPLLQNDTVIEMDPDADFLTKRLTEQAVSFIERNK--DQP 757 Query: 276 FMLYLAYNAPHLP------------NDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVK 323 F LYL + PH P +D A + + + A+ + ++ +D V Sbjct: 758 FFLYLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNIDYATRANLFRQAIAEIDWSVG 817 Query: 324 RILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGK 383 +IL+ L+ NG + T++LFTSDNG + G +G+K T+ GG P + W G+ Sbjct: 818 QILDALRSNGLDEKTMVLFTSDNGPPKNTLYASPGELRGHKGTTFEGGMREPTVVRWPGQ 877 Query: 384 LQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITS 442 + G+ D+L++AMD PT A +IP D +DG + P L+ + Q PH Sbjct: 878 IPAGHQNDELMTAMDLLPTFAKLAGAAIPTDRVIDGKDIWPTLKGETQ-TPHDAF----- 931 Query: 443 YSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLG 502 F+ ++ +S ++ V N +Y +EN Sbjct: 932 -----------FYHRGNQLAAVRS------------GKWKLHVNNGVAKQLYDLEN---- 964 Query: 503 LYKLTDLQQKDNLAAANPQVVKEMQGVVREF----IDSSQPPLSEVNQEKFNN 551 DL +K N+ NP+VVK++Q +++F +S+P N + +N Sbjct: 965 -----DLGEKVNVIETNPEVVKKLQHQLKDFAADIASNSRPAAFNANPKSLSN 1012 Score = 124 bits (312), Expect = 8e-27, Method: Compositional matrix adjust. Identities = 123/441 (27%), Positives = 186/441 (42%), Gaps = 120/441 (27%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 T + + PN++++ +DDLGYG L G + A + ST Sbjct: 32 TSVAAERPPNVVLIFVDDLGYGDL----GCYG----------------------ATKLST 65 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF----GVYSNTDAQDGIPL--- 162 P + L EG RFT+ + A V PSR ++TG+ P R G++ G+ + Sbjct: 66 PNIDRLAAEGRRFTDAHSASAVCTPSRYGLLTGQYPVRAMGGQGIWGPLPTTSGLIIDTN 125 Query: 163 TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE--EWQ---- 216 T+T + ++F+N GY TA +GKWHL F E +WQ Sbjct: 126 TKT-IGKVFKNKGYATACLGKWHLG----------------------FKEEPCDWQVPLR 162 Query: 217 --PQNRGFDYFMGFHAAGTA----YYNSPSLF---------------------------K 243 PQ+ GFD++ G + Y N S+F K Sbjct: 163 PGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGGKPVSPTPMFPEEASVK 222 Query: 244 NRERVPAKGYISDQLTDEAIGVV--DRA-----KTLDQPFMLYLAYNAPHLPNDNPAPDQ 296 + R + DE G + +RA + ++PF LY A H P PAP Sbjct: 223 SPNRFSGALKAHEIYDDEKTGTLLTERAVKWITEKKNEPFFLYFATPNIHHPF-TPAP-- 279 Query: 297 YQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID----- 351 +F SQ Y V+ +D V I++ L+ NG DNT++LFTSDNGA+++ Sbjct: 280 ---RFKGTSQCG-LYGDFVHELDWMVGEIVQSLEDNGLTDNTLVLFTSDNGAMLNRAGRD 335 Query: 352 ----GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAA 406 G P NG G+K + GG P+ W GK++ G D+LIS +D + T Sbjct: 336 AIKAGHQP-NGELLGFKFGVWEGGHRVPLIAKWPGKIKAGTQSDQLISQVDLFATFSALT 394 Query: 407 DISIPKDLKLDGVSLLPWLQD 427 + +P + D +++LP L D Sbjct: 395 EQEMPSSEQKDSINMLPALLD 415 >UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6C430_9PLAN Length = 503 Score = 155 bits (391), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 112/403 (27%), Positives = 175/403 (43%), Gaps = 67/403 (16%) Query: 40 TNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 TN + + + +PNI+V+ DDLGYG L V+ Sbjct: 17 TNESLAAEPTASVKSPARPNIMVVLCDDLGYGDL----------ACYGHPVI-------- 58 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG 159 +P + EG++ T+ Y AH PSRA +MTGR P R G+Y + Sbjct: 59 --------QSPNIDRFAKEGLKLTSCYAAHPNCSPSRAGLMTGRTPFRVGIY------NW 104 Query: 160 IPL--------TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFS 211 IP+ E + L + GY T VGKWHL+ + N+ Sbjct: 105 IPMLSPMHVRKREITIATLLRQAGYATCHVGKWHLNGMFNM------------------- 145 Query: 212 AEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERV--PAKGYISDQLTDEAIGVVDRA 269 + QP + GFD++ + +P F R P +G+ S + DEA + + Sbjct: 146 VGQPQPSDHGFDHWFSTQNNALPTHENPFNFVRNARPVGPLQGFASQLVADEAEEWLTQL 205 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT-GSQTADNYYASVYSVDQGVKRILEQ 328 + ++PF +++ ++ PH P + ++++K + T ++ +V +D RIL+ Sbjct: 206 RDKEKPFFMFVCFHEPHEPIA--SAERFRKLYTAPEGSTLPAHHGNVTQMDDAFGRILKT 263 Query: 329 LKKNGQYDNTIILFTSDNGAVIDGPLP--LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 L +NT+I+FTSDNG I P +G + K TY GG P + W +QP Sbjct: 264 LDDQKLRENTLIIFTSDNGPAITRRHPHGSSGPLRDKKGATYEGGIRVPGIVQWPEHVQP 323 Query: 387 GNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK 428 G D + +D PT ADI P D LDG ++LP L+ K Sbjct: 324 GTTSDVPVCGVDILPTLCAVADIPAPTDRVLDGTNILPLLEGK 366 >UniRef50_Q7UHJ6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UHJ6_RHOBA Length = 500 Score = 155 bits (391), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 134/482 (27%), Positives = 198/482 (41%), Gaps = 98/482 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN +V DD+G+G D++ G + TP L L Sbjct: 72 RPNFVVFVADDMGWG--------------------DSHTYGHELI------QTPNLDRLA 105 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPL--TETFLPELFQNH 174 +GV+FT Y A GV PSR+AI+TGR P R GVY + L +E PEL + Sbjct: 106 SQGVKFTQCYSACGVCSPSRSAILTGRTPYRNGVYRHLSGNHEAHLRASEITFPELLKEV 165 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA 234 GY T VGKWHL PE P GFD++M + Sbjct: 166 GYETCHVGKWHLLSRQQFNNPEFP-----------------HPGEHGFDHWMCTQNNASP 208 Query: 235 YYNSPSLF-KNRERV-PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP 292 + +P F +N E V +GY + + EA + +PF + + + PH P Sbjct: 209 SHQNPDNFVRNGEPVGQLEGYSAQLVASEAARWLKDIHDPSKPFAMTVWVHEPHSPIATD 268 Query: 293 APDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG 352 + ++Q +N Y ++ +D + +++ L DNT++ FTSDN G Sbjct: 269 S--RFQSLYN--GHENSKYMGNITQMDHALGMVMDALDAQEVTDNTLLFFTSDN-----G 319 Query: 353 PLPL----NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAAD 407 P+P +G +G K + GG P W G +QPG D + D + T LD A Sbjct: 320 PVPAFGGSSGGLRGNKRSDHEGGIRVPGVARWPGHIQPGTISDTPVIGTDVFATVLDIAG 379 Query: 408 ISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIP-FWDNYHKFVRHQS 466 I +P D +DGVS+LP + K E + P FW Sbjct: 380 IPLPTDRTIDGVSMLPAFEGKPV------------------ERSTPLFWRT--------- 412 Query: 467 DDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKE 525 H ED +R D+ LV + LY++ D +++ +LAAA P+ KE Sbjct: 413 ----HVSPPED----RVALRIGDWKLVGDETLTKFQLYEIQKDWKEEHDLAAAMPEKTKE 464 Query: 526 MQ 527 M+ Sbjct: 465 MK 466 >UniRef50_B8HPF9 Sulfatase n=2 Tax=Bacteria RepID=B8HPF9_CYAP4 Length = 495 Score = 155 bits (391), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 124/468 (26%), Positives = 203/468 (43%), Gaps = 89/468 (19%) Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTD 155 D + TP L L G R Y + + PSRAA++TGR P R+G+ + + Sbjct: 62 DVGFHGSDIRTPNLDQLAKTGARLEQ-YYSQPMCTPSRAALLTGRYPHRYGLQTLVIPSA 120 Query: 156 AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEW 215 + G+P E LP+ + GY TA VGKWHL + ++ Sbjct: 121 GKYGLPTDEYLLPQALKEAGYETAIVGKWHLGH----------------------ADPKY 158 Query: 216 QPQNRGFDYFMGFHAAGTAYYNSPS-----LFKNRERVPAKGYISDQLTDEAIGVVDRAK 270 P+ RGFDY G Y+ + ++N + + +GY++ L +A+ ++++ Sbjct: 159 WPRQRGFDYQYGPLLGEIDYFTHSAHGKVDWYRNNQLIKEEGYVTTLLGQDAVKLIEKHN 218 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYAS-VYSVDQGVKRILEQL 329 P LYLA+ APH P AP +Y Q+ T + YA+ + ++D + +++ L Sbjct: 219 P-KTPLFLYLAFTAPHAPYQ--APQKYLDQYKTIADPNRRAYAAMITAMDDQIGQVVAAL 275 Query: 330 KKNGQYDNTIILFTSDNGA-----------VIDGPLPL-NGAQKGYKSQTYPGGTHTPMF 377 +K G +NT+I+F SDNG G +P NG + K+ Y GGT Sbjct: 276 EKRGMRNNTLIVFQSDNGGPRSAQFTGEVDTSGGTIPADNGPYRDGKASLYEGGTRVVAL 335 Query: 378 MWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKN 436 W GK+QPG + I +D YPT A +S+ K+ LDG+++ P L + K Sbjct: 336 ANWPGKIQPGTVVNHPIHIVDMYPTLTGLASVSVGKNKPLDGLNIWPALSEAKPS----- 390 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV 496 P + D+ F + D+ LV+ Sbjct: 391 ---------------------------------PRSQVVYDIEPFRAALSQEDWKLVWKA 417 Query: 497 E-NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFI-DSSQPPL 541 ++L L+ L+ D+ ++ NLA NP++V ++ + D+ PPL Sbjct: 418 TLPSRLELFNLSQDVSEQTNLAEQNPEIVSRLKQQIEVLSRDAVLPPL 465 >UniRef50_A0YAF7 Arylsulfatase A n=4 Tax=Bacteria RepID=A0YAF7_9GAMM Length = 479 Score = 154 bits (390), Expect = 7e-36, Method: Compositional matrix adjust. Identities = 118/397 (29%), Positives = 177/397 (44%), Gaps = 65/397 (16%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PN+I++ DD+GYG D G++ T+ +P L + Sbjct: 38 PNVIIIFADDMGYG----DIGAYGHPTIR----------------------SPNLDQMAA 71 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT------DAQDGIPLTETFLPELF 171 EG+++TN Y A V PSRA ++TGR P R G+ + + G+P TE + + Sbjct: 72 EGIKWTNFYAASSVCTPSRAGLLTGRLPVRSGMAHDQIRVLFPTSTGGLPTTEITIAKAL 131 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY-HDNFTTFSAEEWQPQNRGFDYFMGFHA 230 + Y TA VGKWHL + Q D+ D + + Y Sbjct: 132 KEKDYRTALVGKWHLGHLPGF------QPLDHGFDEYFGIPYSNDHDLKKELSYIQTITH 185 Query: 231 AGTAYYNSPSLFKNR---ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHL 287 A +N P L +NR ER + I+ + T EA+ + K +QPF LYLA++ PH+ Sbjct: 186 AKDGDFNVP-LMQNRSIIERPANQNTITKRYTQEAVSFIK--KNSNQPFFLYLAHSMPHV 242 Query: 288 PNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG 347 P A DQ++ GS Y + +D V ++L L + G +NT+++FTSDNG Sbjct: 243 PLF--ASDQFR-----GSSDRGLYGDVIEEIDWSVGQVLSTLSEQGISENTLVVFTSDNG 295 Query: 348 AVIDGPLPLNGAQKGY-------KSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYP 400 P + GA G K +Y GG P WW K++P S +D +P Sbjct: 296 -----PWLIMGAHGGSAGLLKSGKGTSYEGGMREPAIFWWPEKIKPAVAHNTASTLDLFP 350 Query: 401 TALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL 437 T + A I +P D DG L P + ++K E KN+ Sbjct: 351 TIMSIAGIDMPSDRSYDGYDLSPTMFEQKSNE-RKNI 386 >UniRef50_D2QZX4 Sulfatase n=10 Tax=Bacteria RepID=D2QZX4_9PLAN Length = 499 Score = 154 bits (389), Expect = 8e-36, Method: Compositional matrix adjust. Identities = 145/526 (27%), Positives = 213/526 (40%), Gaps = 103/526 (19%) Query: 27 AHAADDVKLKATKTNVAFSDFTPTEYS-----TKGKPNIIVLTMDDLGYGQLPFDKGSFD 81 AHAA T S TE S +K +PNI+++ DDLGY D G F Sbjct: 16 AHAAMTFVAFVLATTFVISSTAATEESAADAASKRRPNIVLIFCDDLGYA----DIGCFG 71 Query: 82 PKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMT 141 K E TP L L EG++FT+ VA V SRAA++T Sbjct: 72 AKGYE----------------------TPNLNKLASEGMKFTDFQVAAAVCSASRAALLT 109 Query: 142 GRAPARFGVYSNTDAQD--GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQ 199 G P R G+ S D GI E + EL QN GY TA GKWHL Sbjct: 110 GCYPQRVGILSALGPSDSIGIAKNELLISELLQNLGYKTACFGKWHLG------------ 157 Query: 200 TRDYHDNFTTFSAEEWQPQNRGFDYFMGF----------HAAGTAYYNSPSLFKNR--ER 247 +H+ F PQ GF + G A AY P + N+ E Sbjct: 158 ---HHEQFL--------PQQNGFATYFGLPYSNDMWPKHPTAKNAYPPLPLIDGNKTIEL 206 Query: 248 VPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT 307 P + ++ T++A+ + ++PF LY+ +N PH+P + + G Sbjct: 207 NPDQTKLTTWYTEKAVKFIHDCG--EKPFFLYVPHNMPHVP-------LFVSEKFAGKTK 257 Query: 308 ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI---DGPLPLNGAQKGYK 364 + + +D V I + L+ G DNT+++FTSDNG + D G ++G K Sbjct: 258 RGLFGDVIAEIDWSVGEITKALEATGNVDNTLVIFTSDNGPWLSYGDHAGSTGGFREG-K 316 Query: 365 SQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 + GG PM + G +QPG DKL S +D +PT +I K+DGVS+ P Sbjct: 317 GTVWEGGHRVPMIAKYPGTIQPGTTCDKLASTIDLFPTIAHYCGATIDPSRKIDGVSIQP 376 Query: 424 WLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQ--SDDYPHNPNTEDLSQF 481 L+ + + + +W N + VR + +PH Sbjct: 377 LLESVEGAKSSHEFFYY-------------YWGNGLEAVRDERFKLHFPHA-----FRSL 418 Query: 482 SYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEM 526 + T + YT +L L+ L D ++ N+AA +P+V + Sbjct: 419 TGTPGTDGMPNGYTQAKTELALFDLDADPFEQTNIAADHPEVTARL 464 >UniRef50_A6LEC5 Arylsulfatase A n=2 Tax=Parabacteroides RepID=A6LEC5_PARD8 Length = 483 Score = 154 bits (389), Expect = 9e-36, Method: Compositional matrix adjust. Identities = 116/395 (29%), Positives = 180/395 (45%), Gaps = 55/395 (13%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNII+L DDLGY + + P+ ++ TP L L Sbjct: 31 KPNIIILLADDLGYNDVSCYRNENFPQQSDS----------------FPTSQTPNLDLLA 74 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPL--TETFLPELFQNH 174 +G+RFTN Y VS PSRAA+MTGR R GVY+ + + L +E + E+ + Sbjct: 75 RQGIRFTNFYCGAAVSSPSRAALMTGRNCTRTGVYNYLEQNSPMHLRDSEVTIAEVLKQA 134 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY-FMGFHAAGT 233 Y T GKWHLS S P ++ P ++GFDY F + + Sbjct: 135 DYATGHFGKWHLS--SGRP-------------------DQPYPNDQGFDYSFYALNNSVP 173 Query: 234 AYYNSPSLFKNRE-RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP 292 +++N + F+N E + +GY D + EA+ +D+ K +PF L + +N PH P + Sbjct: 174 SHHNPTNFFRNGEPQGEIEGYSCDIVVTEALQWLDKNKQ--EPFFLNVWFNEPHFPME-- 229 Query: 293 APDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID- 351 AP++ +K+ + YY + ++D + +++ LK+ DNTI++F SDNG+ D Sbjct: 230 APEELKKRHAINPE----YYGCIENMDIAIGKLMNYLKEQNLEDNTIVIFASDNGSQWDY 285 Query: 352 GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISI 410 LP +G K Y GG P + W + G + D PT AD + Sbjct: 286 SNLPF----RGEKHFNYEGGLRVPCIVRWHKHVPTGVISEFNGCFTDILPTLASLADAPV 341 Query: 411 PKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSH 445 P D +DG+ + P K + +N + Y H Sbjct: 342 PTDRVIDGMDISPVFLGKAETLERENPLFFFRYIH 376 >UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Bacteria RepID=A6C284_9PLAN Length = 605 Score = 154 bits (388), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 118/394 (29%), Positives = 166/394 (42%), Gaps = 69/394 (17%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNI++ DD G+G L + + TP + SL Sbjct: 43 PNIVIFLADDQGWGDLSHNGNT--------------------------NLHTPNVDSLAK 76 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYY 177 EGV+F YV V P+RAA +TGR AR G + Q+ E + + F+ GY Sbjct: 77 EGVKFNRFYVG-AVCAPTRAAFLTGRYHARTGTIGVSTGQERFNSDEYTIAQAFKAAGYA 135 Query: 178 TAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYN 237 T A GKWH N T + P +GFD + GF + +Y Sbjct: 136 TGAFGKWH--------------------NGTQYPN---HPNAKGFDEYYGFTSGHWGHYF 172 Query: 238 SPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQY 297 SP L N V GYI+D LTD+A+ +++ +PF YL Y PH P PDQY Sbjct: 173 SPMLDHNGTFVKGNGYITDDLTDKAMAFIEQQVQNHKPFFAYLPYCTPHSPMQ--VPDQY 230 Query: 298 QKQFNTGSQTADNY-------------YASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 +F N A +VD V R+L++L D+TI+++ S Sbjct: 231 WDRFKDKQLKLHNREPDREQPDHLRAALAMCENVDWNVGRVLKKLNSLRITDDTIVIYFS 290 Query: 345 DNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTAL 403 DNG + NG KG K GG +P + W G L G +++ A+D PT Sbjct: 291 DNGP---NGVRWNGDMKGKKGSLDEGGVRSPFVIRWPGHLPAGQEVNQIAGAIDLLPTLT 347 Query: 404 DAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL 437 D A I P+ +DGVSL P + + K P + + Sbjct: 348 DLAGIKRPEPKPIDGVSLKPLMLNSKADWPERMI 381 >UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR28_9SPHI Length = 602 Score = 153 bits (387), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 129/483 (26%), Positives = 199/483 (41%), Gaps = 100/483 (20%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 T+ PN+IV+ DD G+G + TP Sbjct: 36 TQRPPNVIVILTDDQGWGDFSHTGNEY--------------------------LKTPHFD 69 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQN 173 + +EG YV+ V P+RA+++TGR R GV T ++ + E + E+F+ Sbjct: 70 KMTEEGALLDQFYVSP-VCAPTRASVLTGRYHLRTGVSFVTRGRENMRSEEVTIAEVFKE 128 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY T GKWH + PE+ PQ +GFD F+GF + Sbjct: 129 AGYATGCFGKWH----NGAHYPEN-------------------PQGQGFDTFLGFTSGHW 165 Query: 234 AYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA 293 + Y L N E KG+I+D L DE I +D K D+PF+ ++ NAPH P Sbjct: 166 SNYFDTELEYNGEMKSTKGFITDVLMDETIQFIDAHK--DEPFLAFVPLNAPHTPYQ--V 221 Query: 294 PDQY---QKQFNTGSQTADN-----YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSD 345 PD+Y K + G N Y ++D + ++++ LK +NTI++F SD Sbjct: 222 PDKYFDKYKDIDFGYDKKQNKKIATIYGMCENIDDNLGKLMKHLKDQELEENTIVVFLSD 281 Query: 346 NGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDA 405 NG NG +G K+ + GGT P + WKG + + L + +D PT + Sbjct: 282 NGP---QGARYNGPWRGGKTSVHEGGTLVPCAIQWKGHIPNSSKSSLTAHIDLMPTLMGL 338 Query: 406 ADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQ 465 A I P++++ DG+ L +L +NL Sbjct: 339 AGIEKPENIQFDGIDLSNYLMGTSDDLGERNL---------------------------- 370 Query: 466 SDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVK 524 Y H N E ++ VR DY +T E +GLY L D +++NL P+ + Sbjct: 371 ---YTHMTNFE-ITADRGAVRQGDYR--FTTEYGDVGLYNLKEDPSEENNLKDQLPEKTQ 424 Query: 525 EMQ 527 E++ Sbjct: 425 ELK 427 >UniRef50_D2R917 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R917_9PLAN Length = 486 Score = 153 bits (387), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 134/513 (26%), Positives = 212/513 (41%), Gaps = 120/513 (23%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI+ + DDLG+ + F+ + TP + +L Sbjct: 28 QPNIVHIVADDLGWKDVGFNG--------------------------CTEIKTPNIDALA 61 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFLPELFQN 173 G +F+ YV + P+RA +MTGR P R+G+ + T A G+ +E +P+ + Sbjct: 62 KGGAKFSQFYV-QNMCTPTRACLMTGRFPYRYGLQTIVIPTAAGYGLDTSEYLMPQCLGD 120 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY TA +GKWHL + +++ P+ RGFDY G Sbjct: 121 AGYKTAIIGKWHLGH----------------------ADQKYWPKQRGFDYQYGAMIGEL 158 Query: 234 AYYNSPS-----LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP 288 Y+ F++ + V +GY + + D+A+ + + +PF LYL +NAPH P Sbjct: 159 DYFTHDEHGVLDWFRDNKPVHEQGYTTTLIGDDAVKYI-HGQDGKKPFYLYLTFNAPHTP 217 Query: 289 NDNPAPDQY-QKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG 347 AP +Y K N T Y A V +D+ + +++ L + G +NT+I F SDNG Sbjct: 218 YQ--APKEYITKYLNIAEPTRRTYAAMVDCLDENIGKVVAALDQKGLRENTLIFFHSDNG 275 Query: 348 AVIDG------------PLPL-NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS 394 D LP NG + K + GG+ W GK++ D +I Sbjct: 276 GTKDKMFAGQMADMSKVVLPCDNGPYRNGKGSLFEGGSRVCALANWPGKIKAQTVDGMIH 335 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPF 454 A+D YPT A SI K LDG ++ Sbjct: 336 AVDLYPTFAALAGASIAKCKPLDGTNV--------------------------------- 362 Query: 455 WDNYHKFVRHQSDDYPHNPNTE---DLSQFSYTVRNNDYSLVY-TVENNQLGLYKLT-DL 509 WD ++ P +P TE + F +R D+ L++ T+ + + LY L D Sbjct: 363 WDTI-------AEGKP-SPRTEFFYSIEPFRAGLRQGDWKLIWRTMLPSSVDLYNLAEDP 414 Query: 510 QQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 +K+N+AAA+P V MQ + + PL+ Sbjct: 415 YEKNNIAAAHPDKVATMQARIETASKDAAKPLA 447 >UniRef50_A4CMB0 Arylsulfatase A n=4 Tax=Bacteria RepID=A4CMB0_9FLAO Length = 492 Score = 153 bits (386), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 125/404 (30%), Positives = 183/404 (45%), Gaps = 54/404 (13%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 T+T+ S+ T KPN I++ DDLGYG L SF T+ Sbjct: 30 TETSPGDSEGTAAAGGIPEKPNFIIVFADDLGYGDL----SSFGHPTIH----------- 74 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT--- 154 T L + EG ++TN YVA V PSRA ++TGR P R G+ SN Sbjct: 75 -----------TKNLDRMAAEGQKWTNFYVAASVCTPSRAGLLTGRLPVRNGLTSNEIGV 123 Query: 155 ---DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY----HDNF 207 D+ +G+P +E L E + GY T VGKWHL +P + DY + N Sbjct: 124 FFPDSHNGMPASEITLAEQLKKAGYATGMVGKWHLGHKEEY-LPPNHGFDDYFGIPYSND 182 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNR--ERVPAKGYISDQLTDEAIGV 265 F+ + Q+ Y + + T YN P + ER + I+ + DEA+ Sbjct: 183 MDFTGQFTSYQDYFGRYTERYESLKTEEYNVPLIRGTEEIERPVNQNTITKRYNDEAVKW 242 Query: 266 VDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRI 325 + K D+PF +YLA++ PH+P D+++ G+ Y V +D GV +I Sbjct: 243 IREHK--DEPFFMYLAHSLPHVPLFT--SDEFR-----GTSARGLYGDVVEEIDHGVGQI 293 Query: 326 LEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGY----KSQTYPGGTHTPMFMWWK 381 +E L+ G +NTI++FTSDNG + P ++G G K T+ GG P W Sbjct: 294 MELLEAEGLAENTIVVFTSDNGPWL--PTGISGGSAGLLREGKGTTWEGGMREPTIFWAP 351 Query: 382 GKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 G L + S +D + T A + +P D ++DGV L P L Sbjct: 352 GMLPAKVVMDMGSTLDLFNTFSSLAGVPMPDDREMDGVDLSPIL 395 >UniRef50_A3HWU7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Bacteria RepID=A3HWU7_9SPHI Length = 472 Score = 153 bits (386), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 122/420 (29%), Positives = 177/420 (42%), Gaps = 77/420 (18%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 + S K N++++ DDLGYG L F + Q TP Sbjct: 28 QLSPKKHYNLVLIVADDLGYGDLGFTG--------------------------STQIKTP 61 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD--------AQDGIPL 162 L L GV FT GYV+ V PSRA +TG FG +N A +G+PL Sbjct: 62 HLDQLATNGVTFTQGYVSSAVCSPSRAGFITGINQVEFGHDNNLAGVEPGFDIAYNGMPL 121 Query: 163 TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF 222 ++ + + GY +GKWHL K ++ P RGF Sbjct: 122 SQKTIADHLNKLGYVNGLIGKWHLGK-----------------------EPQFHPLKRGF 158 Query: 223 DYFMGFHAAGTAYYNS-----------PSLFKNRERVPAKGYISDQLTDEAIGVVDRAKT 271 D F G+ G Y+ S S FK + + YI+D + +E++ ++R K Sbjct: 159 DEFWGYTGGGHDYFESLPNGKGYKEPLESNFKTPDPIT---YITDDVGNESVDFIERHK- 214 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 D+PF L+ A+NAPH P D Q + + Y A V+ +D V +I+ L++ Sbjct: 215 -DEPFFLFAAFNAPHTPMQALEEDLALYQ-HIEDKKRRTYAAMVHRLDLNVGKIMTSLEE 272 Query: 332 NGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYD 390 G +NT+++F SDNG D LN +G K GG H P M G L G Y Sbjct: 273 QGLSENTLVVFFSDNGGPTDSNASLNAPYRGQKGILLEGGIHVPFVMNLPGLLPEGLIYQ 332 Query: 391 KLISAMDFYPTALD-AADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 + ++++D PT L A D D+ GV L+P L K + +TW + S E Sbjct: 333 EQVTSLDVVPTFLALAGDTETSMDM-FSGVDLIPHLTGKTPPLADREMTWKFTISRAIRE 391 >UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD Length = 452 Score = 152 bits (384), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 135/524 (25%), Positives = 230/524 (43%), Gaps = 98/524 (18%) Query: 33 VKLKATKTNVAFSDF--TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 ++++ VA S F P + +PN++++ DD +G+ D Sbjct: 1 MRIRRLSAMVALSCFMAAPLFAQQQKRPNVLIIYTDD---------QGTLD--------- 42 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 V+ Y A TP + L EGV F+ Y A V PSRA+++TGR P R + Sbjct: 43 VNCYG--------AKDLHTPNIDRLAKEGVLFSQFYAAAPVCSPSRASLLTGRYPQRAQL 94 Query: 151 YSNTDAQD---GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 +N +++ G+P ++ + E+F++ GY TA +GKWH+ Sbjct: 95 DNNAPSEEGHAGMPGSQYTMAEMFKDGGYTTAHIGKWHIG-------------------- 134 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYY---------NSPSLFKNRERVPAKG-YISDQ 257 +S E P +GFDY GF Y N L++N + + G + +D Sbjct: 135 --YSPET-MPNQQGFDYSFGFMGGCIDNYSHYFYWAGPNRHDLWRNGQEIWEDGKFFADL 191 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYS 317 E G +++ K D+PF LY A N PH P +++++ + Y A+V + Sbjct: 192 TVQEVNGFLEKNKRADKPFFLYWAINMPHYPLQ--GQEKWRQYYKDLPAPRRMYAAAVST 249 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID----GPLPLNGAQKGYKSQTYPGGTH 373 +D+ + ++L+QL + G +NTI++F SD G + G G +G K + GG Sbjct: 250 MDEKIGQVLQQLDRLGLAENTIVVFQSDQGHSTEDRSFGGGGFTGPYRGAKFSLFEGGIR 309 Query: 374 TPMFMWWKGKLQPGN--YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 P + W G L P N D+L +D+YPT +++P+ K+DG + + K Sbjct: 310 VPAIIRWTGHL-PKNEVRDQLCVNIDWYPTLAGLCKVALPQR-KIDGKDIQQVITSSKTS 367 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 PH W + + +EN P W VR + HNP++ ++ +D Sbjct: 368 SPHDIFFWQSQGT----KEN-PQWA-----VRQGNWKLLHNPSSAKKAE----TGPDDLF 413 Query: 492 LVYTVENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFID 535 LV + D + NLAA +P++V ++ ++I+ Sbjct: 414 LVNLQQ----------DTSEAKNLAAQHPEIVSSLKEQYLKWIN 447 >UniRef50_Q2LZ24 GA16747 n=5 Tax=Drosophila RepID=Q2LZ24_DROPS Length = 575 Score = 152 bits (384), Expect = 3e-35, Method: Compositional matrix adjust. Identities = 129/438 (29%), Positives = 195/438 (44%), Gaps = 93/438 (21%) Query: 21 GMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSF 80 G++AF + K +T + + T T G+PNII++ DD+G+ + F G Sbjct: 3 GLSAFLLLLLCFQRAKTDETPASAASETA---ETAGRPNIIIILADDMGFDDVSFRGG-- 57 Query: 81 DPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIM 140 RE + TP + +L G R + A + PSR A++ Sbjct: 58 -------REFL-----------------TPNIDALAFHG-RILDRLYAPAMCTPSRGALL 92 Query: 141 TGRAPARFG----VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPE 196 +GR P G V SN + G+ L T +PE+FQ GY T +GKWHL Sbjct: 93 SGRYPIHTGTQHFVISNEEPW-GLTLNATLMPEIFQQAGYSTNLIGKWHLG--------- 142 Query: 197 DKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGY--- 253 FS E+ P RGFDY G+ A YY + R ++PA+ Y Sbjct: 143 -------------FSRPEYTPTRRGFDYHYGYWGAYIDYY------QRRSKMPARNYSLG 183 Query: 254 -----------------ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPN-DNP--A 293 ++D LT+EA V+ + ++P L L++ A H N D+P A Sbjct: 184 YDFRRNMELECRDRGVYVTDLLTNEAERVIREREGQEEPLFLVLSHLATHTANEDDPLQA 243 Query: 294 PDQYQKQFNTGSQTADNYYASVYS-VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG 352 P++ ++F YA++ S +DQ V RI+ L GQ +N+I++F SDNGA G Sbjct: 244 PEEEIRKFAYIKDPNRRKYAAMVSKLDQSVGRIVSALNSTGQLENSIVIFYSDNGAPSVG 303 Query: 353 PLPLNGAQ---KGYKSQTYPGGTHTPMFMWWKGKLQP-GN-YDKLISAMDFYPTALDAAD 407 G+ +G K+ + GG + W +LQ GN + + I D+ P+ AA Sbjct: 304 MFANTGSNWPLRGQKNTPWEGGVRVAGAI-WSAQLQARGNIFTQPIYVADWLPSLAHAAG 362 Query: 408 ISIPKDLKLDGVSLLPWL 425 I +P L+LDG+ L P L Sbjct: 363 IELPHTLELDGIDLWPQL 380 >UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 Tax=Nostocaceae RepID=Q3M597_ANAVT Length = 457 Score = 151 bits (381), Expect = 7e-35, Method: Compositional matrix adjust. Identities = 129/425 (30%), Positives = 181/425 (42%), Gaps = 79/425 (18%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 L AT + FS T + +PN++ + +DD+G+G L G D Sbjct: 23 LMATASANLFSRAT----AQSSRPNVVFILVDDMGWGDLSI-YGRTD------------- 64 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF------ 148 TP L L +GVRFTN Y V P+R A +TGR AR Sbjct: 65 ------------YETPNLDRLARQGVRFTNAYANQTVCTPTRIAFLTGRYQARLPVGLRE 112 Query: 149 --GVYSNTDAQD-GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 G S + + GIP + + L + +GY TA VGKWH Y Sbjct: 113 PLGARSQPASNNIGIPANQPTIASLLKANGYETALVGKWHAG---------------YPP 157 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS------LFKNRERVPAKGYISDQLT 259 NF P +GFD + G + G Y+ L++N V GY++D T Sbjct: 158 NFG--------PLQKGFDEYFGHLSGGIEYFTHTGTDRILDLYENDVPVQRSGYVTDLFT 209 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLP----NDNPAPDQYQKQFNTGSQTADNYYASV 315 D A+ + R + +PF L L YNAPH P ND + Y T + Y A V Sbjct: 210 DRAVEFIQRPHS--RPFYLSLHYNAPHWPWQGPNDQASTAFYLTNGYTVGGSQATYAAMV 267 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTP 375 S+D GV R+L+ L+ +GQ DNT+++FTSDNG G +G K+ Y GG P Sbjct: 268 KSLDDGVGRVLDALEASGQADNTLVIFTSDNGGERFSNF---GPFRGQKASLYEGGIRVP 324 Query: 376 MFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPH 434 + + G Q +++I D T L A S + DG +LLP L+ + E Sbjct: 325 AIIRYPGVTQANQVSNQVIITFDLTATILAATGTSFHPNYPPDGQNLLPLLRGDRS-EFS 383 Query: 435 KNLTW 439 + L W Sbjct: 384 RTLFW 388 >UniRef50_A6DPC8 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DPC8_9BACT Length = 598 Score = 150 bits (380), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 124/408 (30%), Positives = 182/408 (44%), Gaps = 70/408 (17%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSF-DPKTMENREVVDTYKIGIDKAIEAAQKSTPT 111 +T KPN IV+ DD GY L G F PK TP Sbjct: 19 ATDKKPNFIVIFTDDQGYQDL----GCFGSPKI-----------------------KTPE 51 Query: 112 LLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQDGIPLTETFLPE 169 + + EG R+TN Y A+ + SRAA++TGR P+R GV+ A G+ +E + E Sbjct: 52 IDQMAKEGARYTNFYSANAICSASRAALLTGRYPSRNGVFHVYYPGASQGLKPSEITIAE 111 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF----DYF 225 + + GY T+ +GKWHL N +P ++ Y FS + W ++ F Sbjct: 112 VLKTAGYRTSIIGKWHLGD-RNQFLPTNQGFDSYFG--IPFSNDMWMSKDLALADDIKLF 168 Query: 226 MGFHAAGTAYYNSPSLFKNRER---VPA------------KGYISDQLTDEAIGVVDRAK 270 G + K +R VP + YI+ + TDEA+ ++ ++ Sbjct: 169 GGVTVEQIKSGEASKAVKGEKRGGKVPLMRDEEVVEYPVDQTYITQRYTDEALKIIKESE 228 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLK 330 QP+ +YLAY PH+P Y G Y +V +D V RIL+ LK Sbjct: 229 KKKQPYFIYLAYAMPHVP-------LYASPKFAGKSARGPYGDTVEEMDYHVGRILKHLK 281 Query: 331 KNGQYDNTIILFTSDNGAVIDG-----PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQ 385 +G NT+++FTSDNG G LPL GA K TY GG P MWW G + Sbjct: 282 SSGADKNTLVIFTSDNGPWNLGERGGSALPLRGA----KFSTYEGGHRVPCVMWWPGTIP 337 Query: 386 PG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 G + ++ + +DF PT A+ +P + LDG ++ P L+D +G+ Sbjct: 338 AGTDSAEIATTLDFMPTFAKLANAQLP-NRTLDGKNIAPMLRDGNKGK 384 >UniRef50_A3ZMN6 Arylsulfatase B n=3 Tax=Bacteria RepID=A3ZMN6_9PLAN Length = 455 Score = 150 bits (379), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 129/494 (26%), Positives = 213/494 (43%), Gaps = 100/494 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI+ L DDLG G D + + TP L +L Sbjct: 28 RPNIVFLLADDLG---------------------------GADVSWRGSPIKTPQLDALA 60 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD---AQDGIPLTETFLPELFQN 173 + G + YV V P+R+A++TGR P R+G+ A G+PL E L E Q+ Sbjct: 61 NSGAKLEQFYV-QPVCSPTRSALLTGRYPMRYGLQVGVVRPWADYGLPLDERTLAEALQD 119 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY TA VGKWHL +S +P R + + + N DYF G Sbjct: 120 AGYETAIVGKWHLGHVSPAYLP---MARGFDHQYGHY--------NGALDYFTHDRDGGH 168 Query: 234 AYYNSPSLFKNRERVPAKGYISDQLTDEAIGVV-DRAKTLDQPFMLYLAYNAPHLPNDNP 292 ++ + NR+ +GY + + EA+ V+ DR K +P LY+ +NA H P Sbjct: 169 DWHKDDHV--NRD----EGYATHLIAQEAVRVIQDRDKK--KPLFLYVPFNAVHSPLQ-- 218 Query: 293 APDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG 352 P+ Y + + Y V ++D+ V +I++++++ DNT+ +F+SDNG G Sbjct: 219 VPESYAAPYGDMKKRRQAYAGMVAALDEAVGQIVDEIQRQEMLDNTLFIFSSDNGGPEPG 278 Query: 353 PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIP 411 L NG +G K Y GG F WKG++ PG+ + + +D+YPT ++ A S+ Sbjct: 279 KLTDNGPLRGGKHTLYEGGVRVCAFASWKGRIAPGSKVEAPLHIVDWYPTLIELAGGSLQ 338 Query: 412 KDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPH 471 + LDG ++ P + GEP PH Sbjct: 339 QAKPLDGRNIWPSI---TTGEPS-----------------------------------PH 360 Query: 472 NPNTEDLSQFSYTVRNNDYSLVYTVEN-----NQLGLYKLT-DLQQKDNLAAANPQVVKE 525 + +++ +R D+ LV V N ++ L+ L+ DL ++ N A N +++++ Sbjct: 361 DVIVCNITPTEGAIRVGDWKLV--VHNIGKPREKVELFNLSDDLAEQQNRATTNAKMLRK 418 Query: 526 MQGVVREFIDSSQP 539 ++ + + P Sbjct: 419 LRNRFDQLASEAAP 432 >UniRef50_A6C4V9 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4V9_9PLAN Length = 480 Score = 150 bits (379), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 144/545 (26%), Positives = 216/545 (39%), Gaps = 128/545 (23%) Query: 35 LKATKTNVAFSDFT------PTEYSTKGKPNIIVLTMDDLGY-GQLPFDKGSFDPKTMEN 87 L T FS F E +PN+IV+ +DD+GY G F F Sbjct: 8 LTGMMTTAVFSMFCLVNLADAAERPPGDRPNLIVIMVDDMGYAGVSCFGNPYF------- 60 Query: 88 REVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR 147 TP + L EG++FT+ + + V P+RA ++TGR R Sbjct: 61 --------------------KTPEIDRLAAEGMKFTDFHSSGTVCSPTRAGLLTGRYQQR 100 Query: 148 FGVYS-------NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQT 200 G+ + + + Q G+ +E EL + GY TA +GKWH N Sbjct: 101 AGIEAVIHPVSDHPEHQKGLRKSENTFAELLKQAGYRTALIGKWHQGYPHN--------- 151 Query: 201 RDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS------LFKNRERVPAKGYI 254 + E+ P N GFD F+G+H+ + + + R+ GY Sbjct: 152 -----------SAEFHPDNHGFDTFVGYHSGNIDFISHVGDHVKHDWWHGRKETQETGYS 200 Query: 255 SDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG---------S 305 + + A+ + ++ +QPF LYLA+ A H P P D ++ G + Sbjct: 201 THLINQYALQFIKESR--NQPFCLYLAHEAIHNPVQVPG-DPIRRTEAAGWKRWKPASEA 257 Query: 306 QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQK--GY 363 + + + VD GV +I E L K+G NT +LF SDNG D P +G+ K G Sbjct: 258 ERIEKFRGMTLPVDAGVGQIREFLVKSGLDKNTFVLFFSDNGPSRDFP---SGSPKWRGA 314 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLL 422 K Y GG P WW GK+Q G D ++D PT L A I +PK+ LDGV L Sbjct: 315 KGSVYEGGHRVPAIAWWPGKIQAGTETDVPAISLDVMPTLLGIAHIDMPKERPLDGVDLS 374 Query: 423 PWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFS 482 P L ++K E FW + + S Sbjct: 375 PVLFEQKP-----------------LSERPLFWASL-----------------SNNGSRS 400 Query: 483 YTVRNNDYSLVY--------TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREF 533 +R + LV T EN ++ LY+L D + +NL+ A PQ M ++++ Sbjct: 401 EAMRAGPWKLVVQHPRAKPGTFENEKVELYRLDQDPGEANNLSKAEPQRASRMLKQLKDW 460 Query: 534 IDSSQ 538 +Q Sbjct: 461 YQDTQ 465 >UniRef50_UPI0001968C90 hypothetical protein BACCELL_02360 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968C90 Length = 525 Score = 149 bits (376), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 118/396 (29%), Positives = 180/396 (45%), Gaps = 85/396 (21%) Query: 40 TNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 V+ ++ TPT+ KPN + + MDD+GY V Y Sbjct: 61 VGVSCTEATPTKSE---KPNFVFIYMDDMGYSD------------------VSCYG---- 95 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQ 157 + +TP + +L EG++FT+ Y A +S PSRA +TGR PAR G+ D+ Sbjct: 96 ----ETRWTTPNIDALAAEGIKFTDCYAASPISSPSRAGFLTGRYPARMGIQGVFYPDSY 151 Query: 158 DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 G+ E + E+ + GY TA +GKWHL S E++ P Sbjct: 152 TGMAPEEVTMAEVLKVQGYATACIGKWHLG-----------------------SREKYLP 188 Query: 218 QNRGFDYFMGFHAAGTAYYN--SPSLFKNRERVPA----KGYISDQLTDEAIGVVDRAKT 271 +GFD + G Y N S ++ V ++ + T+EA+ + R Sbjct: 189 LQQGFDEYFGI-----PYSNDMSAQVYLRGNEVEEFHIDINNVTKKYTEEAVDYIRRKA- 242 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 DQPF L+LA++ H+P Y G A Y +V VD V RI+E L++ Sbjct: 243 -DQPFFLFLAHSMMHVP-------IYVSDEFAGKSGAGIYGDAVLEVDWSVGRIMETLRE 294 Query: 332 NGQYDNTIILFTSDNGAVI-DGP-----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQ 385 G DNT+++FTSDNG + +GP LPL K+ + GG P +WKG+++ Sbjct: 295 LGLDDNTLVVFTSDNGPWLQEGPLGGRALPLREG----KTTAFEGGVRVPCIAYWKGQIK 350 Query: 386 PGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSL 421 P ++S +D++PT + A I D++LDG L Sbjct: 351 PVVNTDVVSLLDWFPT-VTALSGGILPDVRLDGYDL 385 >UniRef50_A9LGQ4 Secreted arylsulfatase n=4 Tax=Bacteria RepID=A9LGQ4_9BACT Length = 608 Score = 149 bits (376), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 124/405 (30%), Positives = 181/405 (44%), Gaps = 77/405 (19%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN+IV DD G+G D N+ V +TP + SL Sbjct: 44 RPNVIVFLSDDQGWG---------DFSCTGNQSV-----------------ATPNIDSLA 77 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 +G+ F N +V V P+RA +TGR + V + Q+ I L ET + + GY Sbjct: 78 TQGLLFENFFVCP-VCSPTRAEFLTGRYHPQSNVKGVSQGQERIDLDETTIADCLSQAGY 136 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY 236 TAA GKWH + + P YH P RGFD F GF + Y Sbjct: 137 ATAAFGKWH----NGMQYP-------YH------------PCGRGFDDFYGFCSGHWGNY 173 Query: 237 NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ 296 +P+L N V +GYI+D T+ A+ ++ K+ QPF LYL YN PH P PD Sbjct: 174 FNPTLEHNGRIVKGEGYINDDFTNRALKFIEDHKS--QPFFLYLPYNTPHWPPQ--MPDA 229 Query: 297 Y-----QKQFNTGSQTAD--------NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFT 343 Y +K+ Q D + A V ++D V R+L +L + DNTI+++ Sbjct: 230 YWQRFAEKEIVQRGQKGDKEDLAKTRSALAMVENIDWNVGRVLAKLDELKIADNTIVIYF 289 Query: 344 SDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG--NYDKLISAMDFYPT 401 +DNG + N KG K T GG +P+F+ W ++ +++ A+D YPT Sbjct: 290 NDNGPNSN---RWNAGMKGKKGSTDEGGVRSPLFVRWPNGVKGAGRRVNQICGAIDLYPT 346 Query: 402 ALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHW 446 L A + D LDG +LLP + D + NL + +S+W Sbjct: 347 LLAATGSANVGDKILDGKNLLP-IWDGSE----TNLGFRMLFSYW 386 >UniRef50_C9KTU9 Twin-arginine translocation pathway signal n=5 Tax=Bacteroidales RepID=C9KTU9_9BACE Length = 453 Score = 149 bits (375), Expect = 4e-34, Method: Compositional matrix adjust. Identities = 111/404 (27%), Positives = 177/404 (43%), Gaps = 74/404 (18%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 K PN++++ +DDLG G L A STP + Sbjct: 28 KAIPNVLLILVDDLGLGDL--------------------------SCQYAKDLSTPNIDR 61 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV----YSNTDAQDG-IPLTETFLPE 169 + + GVR N Y VS PSRAA++TG PA GV + D G + +PE Sbjct: 62 IFETGVRLDNFYANSSVSSPSRAALLTGCFPAMVGVPGVIRPSIDQNWGYFGPSAVTMPE 121 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ----PQNRGFDYF 225 + +N GY TA +GKWHL W+ P RGFD+F Sbjct: 122 VLKNGGYRTALIGKWHLG---------------------------WESPNLPNERGFDHF 154 Query: 226 MGFHA-AGTAYYNSPS-----LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLY 279 GF A YY ++ N + + KG+ ++ T ++ + + PF LY Sbjct: 155 HGFLADMMDDYYTHRRQGGNYMYLNDKEIDPKGHATELFTSWSVDYIKKEAKEKNPFFLY 214 Query: 280 LAYNAPHLPNDNPAP--DQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDN 337 LAYNAPH P P ++ Q++ + A + +D + ++++ L+++GQ +N Sbjct: 215 LAYNAPHSPLQPPVEWVNKVQERDKSLPVKRARLIALIEHLDYNIGKVIQSLEESGQLNN 274 Query: 338 TIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAM 396 T+++F SDNG G + NG +G K + GG H + G + G D + M Sbjct: 275 TLVIFASDNGG-DRGSMANNGPTRGAKGDMFEGGIHVACALNMPGVFEGGRRDNHFVVMM 333 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 D PT D +++P ++DG+S+L ++ K Q + + W+ Sbjct: 334 DLMPTICDF--VNVPVKHEIDGISVLDAIKGKTQNTEDRFVFWL 375 >UniRef50_A6DHI1 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI1_9BACT Length = 472 Score = 148 bits (374), Expect = 4e-34, Method: Compositional matrix adjust. Identities = 122/401 (30%), Positives = 179/401 (44%), Gaps = 72/401 (17%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNII + DDLGYG++ ++ K I+ TP L L Sbjct: 20 KPNIIYILCDDLGYGEVGYNG---------------------QKMIQ-----TPELDKLA 53 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT----DAQDGIPLTETFLPELFQ 172 +G+RFT+ Y + V PSRA+++TG+ P + +N+ D Q IP L +L + Sbjct: 54 SKGMRFTDHYCGNAVCAPSRASLITGKHPGHAFIRANSPGYPDGQTPIPADSETLGKLMK 113 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 GY TA +GKW L N P +GFD+F G+ Sbjct: 114 RAGYATACIGKWGLGGFHNAG----------------------NPHKQGFDHFYGYTDQR 151 Query: 233 TAYYNSPS-LFKNRERV-------PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNA 284 A+ P L++N E+ Y D +T +A+ ++ K DQPF LYLAY Sbjct: 152 KAHNYYPEYLWRNGEKEMLNNKNGEENDYSHDLMTVDALKYIEEKK--DQPFFLYLAYLI 209 Query: 285 PHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 PH+ P QY+ + + + A +D+ + I +L++ G DNT+I+F S Sbjct: 210 PHVKYQVPDLAQYKDK--DWPKEMKIHAAMTSRMDRDIGTIARRLEELGIADNTLIMFNS 267 Query: 345 DNGAV----IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DFY 399 DNGA + +G KG K Y GG +PM +W G +Q G+ ISA D Sbjct: 268 DNGAHGKSNSEKFFNTSGDLKGLKRSMYDGGVRSPMIAYWPGTIQAGSVSDHISAFWDMM 327 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQDK-KQGEPHKNLTW 439 PT + P + DG+S+LP L K + + HK L W Sbjct: 328 PTFSELT--GEPFKGETDGISMLPTLLGKDSEQKQHKYLYW 366 >UniRef50_A6DHS2 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHS2_9BACT Length = 447 Score = 148 bits (374), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 143/526 (27%), Positives = 227/526 (43%), Gaps = 127/526 (24%) Query: 52 YSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPT 111 ++ KPNIIV+ +DD+G+ + SFD K YK TP Sbjct: 15 FADSAKPNIIVIMVDDMGWAGI----SSFDNKY---------YK-------------TPG 48 Query: 112 LLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG--VYSNTDAQD-----GIPLTE 164 + + EG++ T+ + V P+RAA+MTGR R G V N D + GI E Sbjct: 49 IDRMAVEGMKLTDFHSNGVVCSPTRAALMTGRYQQRSGCDVVINADPKHPDHVRGIRDEE 108 Query: 165 TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY 224 PE ++ Y TA GKWH+ E+ P N GFD Sbjct: 109 WTFPEAMKSADYATAVFGKWHIG-----------------------YKAEFHPMNHGFDE 145 Query: 225 FMGFHAA---GTAYYNSPSLF---KNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFML 278 F+GF + ++Y+ S F + RE KG+ SD +T+ ++ ++R K ++PF L Sbjct: 146 FVGFISGNIDAQSHYDRMSTFDWWQARELKDEKGHHSDLITEHSLDFIERNK--EKPFFL 203 Query: 279 YLAYNAPHLP------------NDNPAPDQYQKQFNTGSQTADNYYASVYS--VDQGVKR 324 Y+A+ PH P N P K + + DN+ ++ VD+GV R Sbjct: 204 YVAHGTPHSPFQARGSKIQRGPNKGQVPAWAPKIEYSKTPGDDNWLMKHFTLPVDEGVNR 263 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL 384 IL++L + NTI+ F SDNGA G + +G K Y GG P +W G++ Sbjct: 264 ILDKLVELKIDKNTIVWFLSDNGAA-KGNHSHSENTRGAKGSMYEGGHRVPALVWAPGRI 322 Query: 385 QPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 + G+ D+ + D +++ AA ++IP + +LDGV + P + + K+ Sbjct: 323 KAGSVSDQTMMTFDITASSIKAAGVAIPANHQLDGVDIHPTVFNNKK------------- 369 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGL 503 +E + W+N + S +R + LV V + L Sbjct: 370 ---LNERTL-IWEN---------------------GKGSGALRKGPWKLV--VNKKKQEL 402 Query: 504 YKLT-DLQQKDNLAAANPQVVKEM----QGVVREFIDSSQPPLSEV 544 Y L D ++ NLA + P+++KE+ Q ++ E S P SE+ Sbjct: 403 YNLADDHKESKNLAQSMPELIKELSEEYQTILNEI--SKNAPYSEI 446 >UniRef50_Q0BZE9 Sulfatase family protein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0BZE9_HYPNA Length = 459 Score = 148 bits (374), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 114/392 (29%), Positives = 175/392 (44%), Gaps = 82/392 (20%) Query: 41 NVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDK 100 N+A S+ P + PNII++ DDLG+G + + Sbjct: 25 NIATSETAP---AAAKPPNIIIIMADDLGWGDISLNG----------------------- 58 Query: 101 AIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT--DAQD 158 AA TP + + EG++ T+ Y V PSRAA++TGR P R G+ +QD Sbjct: 59 ---AALIETPNIDRIGQEGIQLTDFYAGSNVCSPSRAALLTGRYPIRSGMQHVIFPHSQD 115 Query: 159 GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQ 218 G+P E + E+ +N GY T VGKWHL EE+ P Sbjct: 116 GLPAEEITISEMLKNAGYRTGMVGKWHLGH-----------------------QEEYWPT 152 Query: 219 NRGFDYFMGFHAAGTAYYNSPS---LFKNRERVPAKGYISDQLTDEAIGVVDRAKTL--- 272 N+GFD+F G Y N + L++ +E + + +DQ + ++ AK Sbjct: 153 NQGFDWFY-----GVPYSNDMAPFDLYRGKEIIESP---ADQ-SQLSLNYAKAAKEFIED 203 Query: 273 --DQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLK 330 D+PF LY A PH+P + + +G+ A Y V +VD G+ +L+ L Sbjct: 204 SSDKPFFLYYAETFPHIP-------LFVPEDRSGTSDAGLYGDVVETVDAGIGIVLDTLD 256 Query: 331 KNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYD 390 + G D+T+I+FTSDNG +G G +G K +T+ GG P W G + G+ Sbjct: 257 EAGVADDTLIIFTSDNGPWFEGS---AGEFRGRKGETHEGGFRVPFLARWPGHIPKGSVS 313 Query: 391 -KLISAMDFYPTALDAADISIPKDLKLDGVSL 421 ++ +D PTA + ++P D +DG L Sbjct: 314 HEMAMNIDLLPTAASLSGATLPADRVIDGKDL 345 >UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DBQ5_9BACT Length = 483 Score = 148 bits (373), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 131/435 (30%), Positives = 183/435 (42%), Gaps = 98/435 (22%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 E +T KPN+I + DDLG G L G + + + TP Sbjct: 20 EPATPAKPNVIFILADDLGIGDL----GCYGQQKIR----------------------TP 53 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT------DAQDGIPLTE 164 + L +G+RF Y V PSR A+MTGR + N + Q +P Sbjct: 54 NIDHLAADGMRFLQHYTGCSVCAPSRCALMTGRHMGHAAIRDNAQRGPSEEGQRPMPQDT 113 Query: 165 TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY 224 + L QN GYYT +GKW L +PED + P++ GF+Y Sbjct: 114 FTVARLMQNAGYYTGIIGKWGLG------MPEDHSS----------------PRDMGFNY 151 Query: 225 FMGFHAAGTAY-YNSPSLFKNRER----------VPAKGYIS--------DQLTDEAIGV 265 G+ A+ Y P L++N ER V KG I D + +A+ Sbjct: 152 SFGYLCQSMAHTYYPPYLWRNNERETLAGNPSYDVSMKGVIEPKGEIYSHDVMASDALKF 211 Query: 266 VDRAKTLDQPFMLYLAYNAPHLPNDNP--APDQYQKQ-----FNTGSQTADN------YY 312 V D+PF LYLA+ PHL P + +Y Q F A+N Y Sbjct: 212 VRDHH--DKPFFLYLAFTIPHLSLQVPEDSMSEYHGQWTETPFRNTKHYANNETPRAAYA 269 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV--IDGPLPL----NGAQKGYKSQ 366 + +D+ V R++ LK+ G DNT++ F+SDNGAV + G P+ G +GYK Sbjct: 270 GMITRMDRDVGRLMALLKELGIDDNTLVFFSSDNGAVFPLAGTDPVFFQSTGGFRGYKQD 329 Query: 367 TYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 Y GG TP+ W GK++ G D+ DF PT + + P D DG+S LP L Sbjct: 330 LYEGGIRTPLIARWPGKIETGVTTDQASVFYDFLPTMAELNGVPPPAD--TDGLSYLPTL 387 Query: 426 QDK-KQGEPHKNLTW 439 K Q + H L W Sbjct: 388 LGKPAQQKQHDFLYW 402 >UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_RHOBA Length = 485 Score = 148 bits (373), Expect = 6e-34, Method: Compositional matrix adjust. Identities = 146/517 (28%), Positives = 209/517 (40%), Gaps = 118/517 (22%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 + F ++ +PN+++L DDLGY D G + Sbjct: 31 LTFGQLAGETHAQTLRPNVVMLLADDLGY----RDVGCY--------------------- 65 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQD 158 TPT+ L G RF Y V PSRA +MTGR R GVYS + Sbjct: 66 --GGPVETPTIDQLAAGGTRFQQFYSGCAVCSPSRATLMTGRHHIRAGVYSWIQDESQNS 123 Query: 159 GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPE-DKQTRDYHDNFTTFSAEEWQP 217 + L E L E+ ++ GY TA VGKWHL +P E DK T D H Sbjct: 124 HLRLREVTLAEVLRDAGYATAHVGKWHLG----LPTEERDKPTPDQH------------- 166 Query: 218 QNRGFD-YFMGFHAAGTAYYNSPSLFKNRERV-PAKGYISDQLTDEAIGVVDRAKTLD-- 273 GFD +F ++ A ++ N + +N E V +GY + DEAI +DR + D Sbjct: 167 ---GFDHWFATWNNAQPSHRNPDNFIRNGEPVGQLEGYSCQLVADEAIRWMDRHRESDPD 223 Query: 274 QPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNG 333 QPF L + ++ PH P APD+ +++ S Y ++ + DQ +KR+L +L G Sbjct: 224 QPFFLNVWFHEPHAPI--AAPDEVTQKYGKLSDKGAVYSGTIDNTDQAIKRLLAKLDALG 281 Query: 334 QYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKL 392 +NT+I++ SDNG+ + G +G K + GG P W G + G ++ Sbjct: 282 VRENTLIVYASDNGSYRTDRV---GKLRGRKGANWEGGIRVPGIFHWPGHIPAGVVSNEP 338 Query: 393 ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENI 452 +D PT IS P+ + LDG L P L T ++ F+ Sbjct: 339 AGLVDVLPTICGLLKISPPQ-VHLDGSDLTPLL---------------TGHADSFERHQP 382 Query: 453 PFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV----YTVENNQL------- 501 FW SQ +R+ DYSLV Y + N L Sbjct: 383 LFWH-------------------LQRSQPIVAMRDGDYSLVGFRDYEMSNKNLFEEKWIP 423 Query: 502 ----------GLYKLT-DLQQKDNLAAANPQVVKEMQ 527 LY L D Q NLAA P+ V+ M+ Sbjct: 424 AIKNGTYHNFELYNLKDDPGQTKNLAAEQPERVEAMK 460 >UniRef50_Q7UM38 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UM38_RHOBA Length = 667 Score = 147 bits (372), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 121/409 (29%), Positives = 183/409 (44%), Gaps = 78/409 (19%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN++V+ DD G+G L +P TP + SL Sbjct: 92 RPNVLVVLTDDQGWGDLSLHG---NPNLQ-----------------------TPHIDSLA 125 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 +GV+ N YV V P+RA +TGR R GV+S + + L+E + + FQ GY Sbjct: 126 RDGVQIKNFYVC-AVCSPTRAEFLTGRYHTRSGVFSTSAGGERFDLSERTIGDAFQAAGY 184 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY 236 TAA GKWH S + P YH P RGFD F GF + Y Sbjct: 185 RTAAFGKWH----SGMQAP-------YH------------PNARGFDEFYGFCSGHWGNY 221 Query: 237 NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPND------ 290 SP L N E V G+I D LT AI ++R + + PF +YL N PH P Sbjct: 222 FSPMLELNGEIVKGDGFIVDDLTQHAIDFMERDR--ENPFFIYLPLNTPHSPMQVPDEDW 279 Query: 291 ----------NPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 +P P+ +K+ ++ A A ++D V ++L+ L++ +NTI+ Sbjct: 280 QNFEGKEIVPDPRPENAKKEDVQHTRAA---LALCENIDDNVGQLLDALERLSLSENTIV 336 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFY 399 +F DNG +G NG +G K + GG +P + + K+ G + A+D + Sbjct: 337 VFFCDNGP--NGSR-FNGGLRGRKGAVHEGGLRSPCLIRYPSKIPAGQTVGGIAGAIDLF 393 Query: 400 PTALDAADISIPKDLK-LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWF 447 PT D D+ + LDG+SL+ L++ K +P + L + T++S F Sbjct: 394 PTLADLCDVEVGATAGPLDGISLIDGLREPKS-KPSERLIF-TAWSGKF 440 >UniRef50_A6LIX6 N-acetylgalactosamine 6-sulfatase n=2 Tax=Bacteroidales RepID=A6LIX6_PARD8 Length = 589 Score = 147 bits (372), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 123/407 (30%), Positives = 170/407 (41%), Gaps = 72/407 (17%) Query: 52 YSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPT 111 ++ K PNIIV+ DD G+G L F +F TP Sbjct: 21 FAQKQLPNIIVMLSDDQGWGDLGFTGNTF--------------------------VQTPN 54 Query: 112 LLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELF 171 + + EG N YV VS P+RA +TGR R GV S T + L E + E F Sbjct: 55 IDRIAHEGTILENFYVCP-VSSPTRAEFLTGRYHVRSGVNSTTGGGERFNLGEKTIAEYF 113 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA 231 + GY T+ GKWH S P YH P RGF+ F GF + Sbjct: 114 REAGYATSLFGKWH----SGTQYP-------YH------------PNARGFEEFYGFCSG 150 Query: 232 GTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDN 291 Y +P L N E + +G+I D LTD+A+ + K + PF ++L+YN PH P Sbjct: 151 HWGNYWNPVLEHNGEIISGEGFIIDDLTDKALDYIRDHK--EHPFFMFLSYNTPHSPMQV 208 Query: 292 P------APDQYQKQFNTGSQTADNYY-----ASVYSVDQGVKRILEQLKKNGQYDNTII 340 P D+ Q T + D + A ++D + R+L L TI+ Sbjct: 209 PDSWWNRVKDRTLSQRATFPEQEDTTFTKAALALAENLDWNIGRVLSLLHSLDLEQETIV 268 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYD-KLISAMDFY 399 ++ SDNG NG KG K T GG +P + W G ++ G + +L A+D Sbjct: 269 IYFSDNGP---NSFRWNGGMKGRKGSTDEGGVRSPFCIRWPGHIRKGAVETQLSGAIDLI 325 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHW 446 PT L A I KLDG+ L D+K + L YS+W Sbjct: 326 PTLLGLAGIEYTPLRKLDGIDWGQRLLDEKAPAIDRVL-----YSYW 367 >UniRef50_A6DIE0 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DIE0_9BACT Length = 527 Score = 147 bits (372), Expect = 8e-34, Method: Compositional matrix adjust. Identities = 149/564 (26%), Positives = 231/564 (40%), Gaps = 139/564 (24%) Query: 47 FTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQ 106 T T + KPN++V+ +DDLGY + SF P+ + + YK Sbjct: 16 LTGTSLQAQQKPNVVVIIVDDLGYADM-----SFLPQAPTD---IKHYK----------- 56 Query: 107 KSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETF 166 TP L G F N Y + PSRA I+TG R+G Y D + P + Sbjct: 57 --TPGFDRLFATGTYFENAYATSPICSPSRAGILTGSYQQRWGNYWYGDGK--FPNNKVT 112 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 +PE+ ++GY TA GK HLS E+ P GFD ++ Sbjct: 113 IPEMLSSNGYATAKYGKTHLS-----------------------GWEKKVPTMHGFDEYL 149 Query: 227 GFHAAGTAY----------YNSPSLFK-----------------NRERVPAK---GYISD 256 GF Y Y FK N E P + +D Sbjct: 150 GFMHHTWDYIRLSQKDVDAYKKKKEFKDFGCQVIGPLVKAEGQGNEELKPVSYENSFTTD 209 Query: 257 QLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP-------------------NDNPAPDQY 297 TDEAI + R K +PF L+L+YNA H+P + N A +Y Sbjct: 210 IFTDEAINFIKRDKG-GKPFYLHLSYNAVHMPTYVVEETWAKKVGARYVPWDRNAAKWEY 268 Query: 298 ---------QKQFNTG--------SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 K F+ S+ Y A+++++D G+ R+L+ L+K+GQ +NT+I Sbjct: 269 PYWDPAQEPHKTFHKKWGHMGEYDSEGRRCYLANLFALDYGISRLLDALEKSGQRENTMI 328 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG--NYDKLISAMDF 398 +FTSDNG ++ N +G K GG P+ + G L N L+S MD Sbjct: 329 IFTSDNGGTVN-TYSNNAPLRGSKYMLGEGGIRVPVIISMPGTLPQNIVNKSALVSGMDI 387 Query: 399 YPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNY 458 PT + I+ P+D +DG+S+LP L+ +K+ + H+ + W + + W Sbjct: 388 MPTIAELTGIAAPED--IDGISMLPVLKQEKK-QHHEWVAWAKNENSWVLRRG------- 437 Query: 459 HKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL--VYTVENNQLGLYKL-TDLQQKDNL 515 K+ ++ + H +T+ N L + + L+ L TD+ + N+ Sbjct: 438 -KWKLSKNAGWGHK---------GFTIGENSEVLPGKKMTYPSGINLFNLETDIGETTNV 487 Query: 516 AAANPQVVKEMQGVVREFIDSSQP 539 A NP+VV+EM + +E+ P Sbjct: 488 ADQNPEVVQEMLALHKEWASRMIP 511 >UniRef50_D2R014 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R014_9PLAN Length = 475 Score = 147 bits (371), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 135/501 (26%), Positives = 207/501 (41%), Gaps = 120/501 (23%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNI+V+ +DD+G+ L + + TP++ L Sbjct: 41 PNIVVILIDDMGFSDL--------------------------SCMGSTYYETPSINKLAA 74 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARF-------GVYSNT------DAQDGIPLTE 164 G+RFT+ Y A V P+RAA++TG+ PAR G SN D + L E Sbjct: 75 SGMRFTHAYSACTVCSPTRAAVLTGKYPARLHLTDWIPGQMSNKTKLKLPDWNKQLNLEE 134 Query: 165 TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY 224 L EL HGY TA++GKWHL E +P +GF Sbjct: 135 ITLAELLGAHGYTTASIGKWHLGP------------------------PECEPTRQGFSL 170 Query: 225 FMGFHAAGTAYYNSPSLFKNRER----VP--AKG----YISDQLTDEAIGVVDRAKTLDQ 274 +G ++ G PS F ER +P A+G Y++D+LTD ++ ++ + Sbjct: 171 NIGGNSKG----QPPSYFFPYERNGVLLPGLAEGKPNEYLTDRLTDACEAFIEENQS--K 224 Query: 275 PFMLYLAYNAPHLPNDNPAPDQYQK------QFNTGSQTADNYYASVYSVDQGVKRILEQ 328 PF LYL + H P P+ K QF Q Y A V S+DQ V RI+ + Sbjct: 225 PFFLYLPHYCVHTPLQA-KPELIAKYEAKNAQFPGNPQHEAKYAAMVESLDQSVGRIMAK 283 Query: 329 LKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG- 387 L TI++FTSDNG ++ + N + K Y GG P+ + + ++PG Sbjct: 284 LDALDLTKKTIVIFTSDNGGLVLREITSNLPARAGKGSAYEGGVRVPLIVSYPPMIKPGT 343 Query: 388 NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWF 447 D +MD +PT + + D +DG S++P L++K + L W Y H+ Sbjct: 344 TCDVPAISMDLFPTLAELSGAKYSHD--IDGKSIVPLLEEKPDAFAARPLYW--HYPHY- 398 Query: 448 DEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT 507 H P++ +R +Y LV E+ +L LY L Sbjct: 399 ----------------HGGGATPYS-----------AMRVGNYRLVEFFEDGRLELYDLA 431 Query: 508 -DLQQKDNLAAANPQVVKEMQ 527 D+ + NLA P + +++ Sbjct: 432 HDIGEMKNLAQEKPDLTEKLH 452 >UniRef50_A6LCL3 Arylsulfatase A n=9 Tax=Bacteroidales RepID=A6LCL3_PARD8 Length = 476 Score = 147 bits (370), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 116/403 (28%), Positives = 177/403 (43%), Gaps = 84/403 (20%) Query: 59 NIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDE 118 NI+++ +DD+GYG F+ A +TP + + E Sbjct: 25 NIVLINLDDVGYGDFSFNG--------------------------AYGYTTPNIDKMAAE 58 Query: 119 GVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQDGIPLTETFLPELFQNHGY 176 GVRFT+ V +SG SRA ++TG P R G D+ G+ E + E+ + GY Sbjct: 59 GVRFTHFLVGQPISGASRAGLLTGCYPNRIGFSGAPGPDSNYGVHPEEMTIAEVLKQKGY 118 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG--------- 227 TA GKWHL S +E+ P GFD + G Sbjct: 119 STAIFGKWHLG-----------------------SQKEFLPLQNGFDEYYGLPYSNDMWP 155 Query: 228 FHAAGTAYYNSPSL--FKNRERVPAKGYISDQ------LTDEAIGVVDRAKTLDQPFMLY 279 FH +N P L + E + GY +DQ T ++ + + K ++PF LY Sbjct: 156 FHPQQGEVFNFPDLPTYDGNEII---GYNTDQTRLTTDYTTRSVNFIKKNK--NKPFFLY 210 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 LA+N PH+P D+++ + G Y + +D V I + L++ G DNT+ Sbjct: 211 LAHNMPHVP--LAVSDKFKGKSEQGL-----YGDVMMEIDWSVGEIFKALRELGLEDNTL 263 Query: 340 ILFTSDNGAVID--GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAM 396 ++ TSDNG + G + K+ T+ GG P M+WKGK PG +KL S + Sbjct: 264 VILTSDNGPWTNYGNHAGSAGGLREAKATTFDGGNRVPCIMYWKGKTLPGTTCNKLASNI 323 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 D PT + +P K+DGVS+LP ++ KK P ++ + Sbjct: 324 DLLPTFAEITQAPLPPR-KIDGVSILPLIEGKKDANPRESFVY 365 >UniRef50_UPI0001B577E1 arylsulfatase precursor n=1 Tax=Streptomyces sp. C RepID=UPI0001B577E1 Length = 746 Score = 147 bits (370), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 138/503 (27%), Positives = 208/503 (41%), Gaps = 104/503 (20%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNI+V+ DDLGYG+L GS+ K + STP L L Sbjct: 48 PNIVVVLADDLGYGEL----GSYGQKLI----------------------STPRLDRLAT 81 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN--TDAQDGIPLTETFLPELFQNHG 175 EG+RFT+ Y V PSR +++TG V +N + Q + T+T ++ + G Sbjct: 82 EGLRFTDAYSTAAVCAPSRCSLLTGLHTGHSTVRANPSSGGQGSLTATDTTFAQVLRARG 141 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF--HAAGT 233 Y TA +GKW PE + ++ P RGF+ F G+ H+ Sbjct: 142 YRTAVIGKWGFG-------PE-------------AAGQDSHPAARGFEEFYGYIDHSHAH 181 Query: 234 AYYNSPSLFKN--RERVPAKG------YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 YY L+ N +E +PA Y L A+ +D +PF+L L N P Sbjct: 182 QYYPE-YLWHNAVKEPIPANAGGAKAVYAPHLLEQHALEFIDTHAA--EPFLLLLTPNVP 238 Query: 286 HLPNDNPAPDQYQKQFNTGSQTADN--YYASVYSVDQGVKRILEQLKKNGQYDNTIILFT 343 H P+D P Y + S TA N + A V D V +++++L+ G +T++L T Sbjct: 239 HAPSDIPDSSAYADR----SWTAANKGHAAQVSYFDSLVGKVVDRLRSLGLEQDTVVLVT 294 Query: 344 SDNGAVIDGPL-----PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDF 398 SDNG +G + NG +GYK Y GG P+ W G++Q G ++ D Sbjct: 295 SDNGPHEEGGVNPDLFDANGPLRGYKRNLYEGGVRVPLIAWGPGRVQQGTSNRPTPLTDV 354 Query: 399 YPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNY 458 PT + P D +DG+S P L H +L W F D Sbjct: 355 LPTLAELGGAPAPTD--VDGLSAAPLLAGSPDSARHGHLYW--------------FRDEL 398 Query: 459 HKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV-YTVENN--------QLGLYKL-TD 508 R + D + + + VR ++ V + E + Q+ LY L TD Sbjct: 399 GVTSRANAQD------GKRATWLAEAVRRENWKAVRFAPERDHNLPDDKWQVELYDLATD 452 Query: 509 LQQKDNLAAANPQVVKEMQGVVR 531 L + ++ A NP E+ ++R Sbjct: 453 LGETRDVLAKNPSKAAELVALMR 475 >UniRef50_A6DGL0 Arylsulfatase A n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DGL0_9BACT Length = 506 Score = 146 bits (369), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 137/545 (25%), Positives = 231/545 (42%), Gaps = 111/545 (20%) Query: 33 VKLKATKTNVAFSDFTPTEYSTK-------GKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 +K++ + ++A S + ++ + KPNII++ DD G G L M Sbjct: 1 MKIRKSFISIALSLLSLNNFAAETKKILKGAKPNIIMVLTDDQGMGDL---------SCM 51 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 N + TP + + + RFT+ V+ + P+RAAIM+GR+P Sbjct: 52 GNPIL-----------------RTPHIDKMYAKSTRFTDFQVSSTCT-PTRAAIMSGRSP 93 Query: 146 ARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 G+ +D + P+ Q GY T GKWHL Sbjct: 94 FEVGISHTLMQRDRLAPAVITFPQALQKSGYKTGLFGKWHLG------------------ 135 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYN----SPS---------LFKNRERVPAKG 252 EE++PQNRGFD + A G YN P+ L N V KG Sbjct: 136 -----DGEEYRPQNRGFDEVLMHGAGGIGQYNFGDFKPNATNKYFDNVLLHNDTIVQTKG 190 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF-NTG-SQTADN 310 + +D A+ + + +Q + Y++ NAPH P AP++Y+K+F + G +Q+ Sbjct: 191 FCTDVFFKAALSWIKKQHENNQTYFAYISLNAPHGP--LIAPEKYKKRFIDEGYNQSVAA 248 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV--------IDGPL-PLNGAQK 361 Y + ++D ++E+LK+ DNT+I+F +DNG + G N K Sbjct: 249 RYGMIENIDDNFGLMVEKLKEWKALDNTLIIFMTDNGMAMKSIGKKGVKGKFNAWNAGMK 308 Query: 362 GYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPK-DLKLDGV 419 G+K + GG+ P F +WKG L G + L + +D Y T + A +IP+ L G Sbjct: 309 GHKDSAWEGGSRVPSFWYWKGVLGEGVDISALSAHIDLYRTFCELAGTNIPESSLSPSGR 368 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR-HQSDDYPHNPNTEDL 478 SL+P L++ P+ WD+ F + T +L Sbjct: 369 SLIPLLEN-----PNAK------------------WDDRTLFFHRGRWGGGGRGKKTREL 405 Query: 479 SQ-FSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDS 536 ++ + VRN+ + LV ++ + L + D + N+ +P+V ++M+ ++ DS Sbjct: 406 AKYYGMAVRNSRWRLVNIMDGDGPWLSDIANDPGETKNVIEQHPEVAEKMKAQFDQWWDS 465 Query: 537 SQPPL 541 ++ L Sbjct: 466 TESLL 470 >UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6CBM1_9PLAN Length = 497 Score = 146 bits (369), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 113/400 (28%), Positives = 173/400 (43%), Gaps = 76/400 (19%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 + E KPNI+++ DDLGYG L Y + K Sbjct: 21 ELQAVEKQQAAKPNIVIILCDDLGYGDLA------------------CYGHPVIK----- 57 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPL--- 162 TP L L EG+R T+ Y + V PSRA ++TGR P R GVY +G P+ Sbjct: 58 ---TPHLDQLASEGMRLTDCYASAPVCSPSRAGLLTGRTPNRLGVYDWIP--EGHPMHLK 112 Query: 163 -TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 E + +L Q GY TA VGKWH + + N S E+ QP + G Sbjct: 113 RDEVTVAQLLQQAGYDTAHVGKWHCNGMFN-------------------SKEQPQPGDHG 153 Query: 222 FDYFMGFHAAGTAYYNSPSLF--KNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLY 279 F ++ + +P+ F + +G+ + DE I + + ++PF L+ Sbjct: 154 FRHWFSTQNNALPTHENPNNFVRNGKPLGEIEGFSCQIVADEGIRWLSDWREKEKPFFLH 213 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNTGSQTAD--NYYASVYSVDQGVKRILEQLKKNGQYDN 337 + ++ PH +P + + S D Y+A+V ++D+ V ++L +L + DN Sbjct: 214 VCFHEPH--ERVASPPALVETYLDKSLYEDQAQYFANVANMDRAVGKLLIKLDELKVADN 271 Query: 338 TIILFTSDNGAVIDGPLPLN-------------GAQKGYKSQTYPGGTHTPMFMWWKGKL 384 T++ FTSDN GP LN G +G K Y GG P + W GK+ Sbjct: 272 TLVFFTSDN-----GPETLNRYGKGSRRSWGSPGVLRGMKLHIYEGGIRVPGIVRWPGKI 326 Query: 385 QPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 + G + ++D PT + A +++P LDG SLLP Sbjct: 327 KAGQEIATPVCSVDLLPTFCEIAGVAVPDQRPLDGASLLP 366 >UniRef50_A6DG78 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG78_9BACT Length = 464 Score = 146 bits (369), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 131/474 (27%), Positives = 203/474 (42%), Gaps = 118/474 (24%) Query: 4 ALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVL 63 ++ + STS+ +L+ A++AD+ KL K PN+++ Sbjct: 3 TIRNLITSTSLFFLLS-------AYSADNKKLDINK------------------PNLVIF 37 Query: 64 TMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFT 123 DD +G+ D ++++ TP + L ++GVRFT Sbjct: 38 FTDD---------QGTLDVNCYGSKDLY-----------------TPNMDKLAEDGVRFT 71 Query: 124 NGYVAHGVSGPSRAAIMTGRAPARFGV--YSNTDAQD----GIPLTETFLPELFQNHGYY 177 Y AH V P+RA +MTGR P R V ++ DA+ + L E L E ++ GY Sbjct: 72 QAY-AHQVCCPARAMLMTGRHPQRSNVNHWTQGDAKGPKTRNMNLEEYTLAEALKDSGYK 130 Query: 178 TAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYN 237 TA GKWHL + ++ P +GFD F G YN Sbjct: 131 TALFGKWHLG-----------------------AHLDYGPTKQGFDEFYGIRGGFIDNYN 167 Query: 238 S--------PSLFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP 288 L++ + V +G Y + +TD A+ +DR K + PF L+LA+N PH P Sbjct: 168 HYFLHGEGFHDLYEGTKEVFDEGKYFPNLVTDRALNFIDRNK--NNPFFLFLAFNIPHYP 225 Query: 289 NDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA 348 A ++ +++ +Y + + D + +I+ +L+++G YDNTII+F SDNG Sbjct: 226 EQ--ADPKFDERYKNMKMPRQSYAKMISTTDDHMGQIMSKLQEHGIYDNTIIIFMSDNGH 283 Query: 349 VID----------GPLPLN------------GAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 + L N G +G KS Y GG P + + KL Sbjct: 284 SRERNHIKFDNHKSGLAKNTKYGALGGGGNTGKWRGNKSNFYEGGIRVPAIITFPNKLPK 343 Query: 387 GNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 G D+ I+AMD+ PT L+ +I PK +K DG SL + + PHK L W Sbjct: 344 GAVRDQAITAMDWMPTVLELCNIEPPK-IKFDGKSLTQVIASEDNPSPHKVLNW 396 >UniRef50_A6DSG4 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSG4_9BACT Length = 489 Score = 146 bits (368), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 117/383 (30%), Positives = 172/383 (44%), Gaps = 75/383 (19%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI+ DDLGYG D G + A + TP + L Sbjct: 29 KPNILFYLTDDLGYG----DIGCYG----------------------AEGQYTPAIDQLA 62 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG----VYSNTDAQDGIPLTETFLPELFQ 172 EG +F++ YV H PSRAA MTG R G +Y + + G+ +E LPEL + Sbjct: 63 KEGTKFSSFYV-HQRCSPSRAAFMTGSYAHRVGLPQVIYKHREGPIGLNPSEITLPELMK 121 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 GY TA VGKWHL + + +H P N G+DYF GF Sbjct: 122 TAGYNTALVGKWHLG-----------EWKPFH------------PLNHGYDYFYGFLKV- 157 Query: 233 TAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRA-----KTLDQPFMLYLAYNAPHL 287 PSL +NR+ + +K + +A G+V A K PF L + PH Sbjct: 158 IEGSEKPSLIENRKELASK---IQKTEGQAPGMVKAAINFMTKHKKNPFFLVYSDPMPH- 213 Query: 288 PNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG 347 AP +QF G+ NY ++ +D K +++ L + G +NTI++FTSDNG Sbjct: 214 -----APYFPSEQFK-GTSKRGNYGEVIHEIDWQFKHLMDALDELGLKENTIVVFTSDNG 267 Query: 348 AVIDGP----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQ-PGNYDKLISAMDFYPTA 402 ++ + L+G + K + GG P + W GK++ + D +I +D PT Sbjct: 268 PPVERQKKYDVGLSGPLRDGKWTNFEGGVRVPFIIRWPGKVKVDASSDAMIGIIDMLPTF 327 Query: 403 LDAADISIPKDLKLDGVSLLPWL 425 + A + +P D +DGV++LP L Sbjct: 328 CELAGVDVPNDRVIDGVNILPQL 350 >UniRef50_A3ZLN5 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZLN5_9PLAN Length = 468 Score = 146 bits (368), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 116/426 (27%), Positives = 182/426 (42%), Gaps = 96/426 (22%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 +K P+I+++ DD G+ L I TP L Sbjct: 28 SKRPPSIVLIVSDDQGFADL--------------------------SCIGDNGCRTPRLD 61 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQD------------- 158 L G R T+ YV+ PSRA++MTGR P R G Y +A D Sbjct: 62 QLAASGTRLTSFYVSWPACTPSRASLMTGRYPQRNGTYDMIRNEAPDYDYLYTPEEYAVT 121 Query: 159 -----GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 G L E FL ++ + GY +A GKW ++ + Sbjct: 122 AERILGTDLQEVFLADVLKQAGYVSAVFGKWDGGQL-----------------------K 158 Query: 214 EWQPQNRGFDYFMGFHAAGTAY-----YNSPSLFK-NRERVPAKG-YISDQLTDEAIGVV 266 + P RGFD + GF G Y Y PS+F+ N+ KG Y++D EAI + Sbjct: 159 RYLPLQRGFDQYYGFANTGVDYFTHERYGVPSMFRDNQPTEEDKGTYLTDLFEREAIRFI 218 Query: 267 DRAKTLDQPFMLYLAYNAPHLPND--------NPAPDQYQKQFNTGSQTADN----YYAS 314 D + D+PF LYL +NAPH ++ AP +Y F G + Y A+ Sbjct: 219 D--ENHDRPFFLYLPFNAPHSASNLDRSIRGFAQAPQEYLDHFPGGESKQEKRRQAYLAA 276 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHT 374 V +D+ + ++++QL+++ DNT+I+F SDNG N +G K++ + GG Sbjct: 277 VERMDEAIGKVVDQLQQHQIADNTLIIFLSDNGGGGGAD---NSPLRGGKAKMFEGGNRV 333 Query: 375 PMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 P + W GK+ G ++ +++++ +PT + A +P D+ DG +LP L P Sbjct: 334 PCIVHWPGKVPAGKVSNQFLTSLEVFPTVIAAIGGKLPDDVIYDGFDMLPVLNGASS--P 391 Query: 434 HKNLTW 439 + + W Sbjct: 392 REEMFW 397 >UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4W7_9PLAN Length = 459 Score = 145 bits (367), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 134/520 (25%), Positives = 211/520 (40%), Gaps = 129/520 (24%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNI+++ DDLGYG L N++V TP + L Sbjct: 35 PNIVLIMADDLGYGDL---------ACYGNKQV-----------------KTPHIDRLAA 68 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG------VYSNTDAQDGIPLTETFLPELF 171 ++FT+ + A + P+RAA++TG+ RFG + ++ G+P + EL Sbjct: 69 SALKFTDFHSAGAMCTPTRAAMLTGQYQQRFGRQFESALSGKSNHDIGLPHQAVTMAELL 128 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF--- 228 + GY TA GKWHL W P N+GFD F G Sbjct: 129 KQQGYATACFGKWHLGY-----------------------QPPWLPTNQGFDLFRGLTSG 165 Query: 229 ---HAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 H + + N E KGY +D L+ ++ ++ +T +PF LY+ + A Sbjct: 166 DGDHHTHVDRSGNEDWWHNNEISMEKGYTADLLSKYSVAFMEANRT--RPFFLYVPHLAI 223 Query: 286 HLPNDNPAPDQYQKQ---FNTG--------SQTADNYYASVYSVDQGVKRILEQLKKNGQ 334 H P P ++K ++ G + + A + S+DQ V +IL LK+ Sbjct: 224 HFPWQGPQDPPHRKAGQDYHAGKWGIIPDPGNVSPHTTAMIESLDQSVGKILSALKRLDL 283 Query: 335 YDNTIILFTSDNGAVID-----GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY 389 NT+++FTSDNG + + NG +G K+ Y GG P + W G + G Sbjct: 284 EQNTLVIFTSDNGGYLTYGKNFQNISSNGPLRGQKATLYEGGHRVPCLISWPGVITAGVT 343 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP--HKNLTWITSYSHWF 447 D+ ++D PT AA IS + + DG+ L P Q G P ++L W + Sbjct: 344 DQTAHSVDLLPTLAQAAGIS-ATNFQTDGLDLAPLWQ---TGRPLADRDLFWRMGNNR-- 397 Query: 448 DEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL- 506 VR + L ++NN+ LY L Sbjct: 398 ------------------------------------AVRRGQWKLC--LKNNRSELYHLE 419 Query: 507 TDLQQKDNLAAANPQVVKEMQGVVREF---IDSSQPPLSE 543 TDL ++ N AA +P++VK M ++E+ +D+S S+ Sbjct: 420 TDLGEQQNRAAEHPEIVKSMSQALKEWEADVDTSAKQFSK 459 >UniRef50_D2R1I8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R1I8_9PLAN Length = 427 Score = 145 bits (366), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 142/495 (28%), Positives = 213/495 (43%), Gaps = 108/495 (21%) Query: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 +++ DDLGYG D ++ P + TP + L EG+ Sbjct: 1 MLILADDLGYG----DVSTYHPSDVR----------------------TPQIDQLAAEGM 34 Query: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGV--YSNTDAQDGIPLTETFLPEL---FQNHG 175 T+ V PSRAA++TGR R GV T +D + +P L + G Sbjct: 35 LLTSMRANCTVCSPSRAALLTGRYADRVGVPGVIRTKPEDSWGWFDPTVPTLADELKRVG 94 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA----- 230 Y+TA VGKWHL E T P RGFD+F GF Sbjct: 95 YHTAIVGKWHLGL-------ESPNT----------------PNERGFDFFQGFLGDMMDS 131 Query: 231 -AGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIG-VVDRAKTLDQPFMLYLAYNAPHLP 288 Y + + +NRE + +G+ ++ TD A +V+RAK +QPF LYLAYNAPH P Sbjct: 132 YTTHLRYGNNYMRRNREVIEPQGHATELFTDWASEYLVERAKQKEQPFFLYLAYNAPHFP 191 Query: 289 NDNPAP--DQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDN 346 + PA + +++ Q A V +D + R+L+ LK+ G NT+++FTSDN Sbjct: 192 IEPPAEWLAKVKERAPQLDQKRAKNVAFVEHLDHSIGRVLKTLKETGLDQNTVVVFTSDN 251 Query: 347 GAVIDGPLPL---NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISA-MDFYPTA 402 G G LP N + K Y GG P + W G+++ G+ + D +PT Sbjct: 252 G----GSLPHAQNNDPWRDGKQSHYDGGLRVPFMVRWPGQIKAGSRSDYVGLNFDLFPTF 307 Query: 403 LDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFV 462 L+ A + K+ LD VSL+P L+ K IT+ + FV Sbjct: 308 LELAGATPSKE--LDAVSLVPVLKGGK----------ITTSRDLY-------------FV 342 Query: 463 RHQSDDYPHNPNTEDLSQFSYTVRNND-YSL--VYTVENNQLGLYKLTDLQQKDNLAAAN 519 R + + E + + + + ND YS +Y ++N D + +LAA+N Sbjct: 343 RREGGVTYGGKSYEAIIRGEWKLLQNDPYSALELYNIQN---------DPGETKDLAASN 393 Query: 520 PQVVKEMQGVVREFI 534 +VV E+ +R I Sbjct: 394 KKVVNELAAALRLHI 408 >UniRef50_A3HZ22 Putative exported uslfatase n=1 Tax=Algoriphagus sp. PR1 RepID=A3HZ22_9SPHI Length = 489 Score = 144 bits (364), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 133/473 (28%), Positives = 209/473 (44%), Gaps = 107/473 (22%) Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA----------- 156 TP + L EG+ FTN Y A + P+RAA++TG+ PAR G+ A Sbjct: 67 ETPNITKLAKEGILFTNSYAAAAICSPTRAALLTGKYPARLGITDWIRAKFNQNSTSGLP 126 Query: 157 ----------------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQT 200 Q +PL E + E + HGY T VGKWHL + Sbjct: 127 GEYEVFENKPLKTPKIQGFLPLEEITIAERMKAHGYGTLHVGKWHLGE------------ 174 Query: 201 RDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT--AYYNSPSLFKNRERV------PAK- 251 E + P+++GFD +G + G +Y++ K RE P K Sbjct: 175 ------------EGFYPEDQGFDVNIGGNDLGQPPSYFDPYLPAKPREFYEITTLKPRKE 222 Query: 252 -GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQK--QFNTGSQTA 308 +++D+ DE + + K + F ++ A A H P PD +K Q G+Q Sbjct: 223 GEFLTDREGDEVVNYIQNQK--GKKFFVHWAPYAVHTPIMG-KPDLVEKYEQKEPGNQRN 279 Query: 309 DNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI---DGPLPLNGAQKGYKS 365 Y A V SVDQ V ++L +L++ G +NT+++FTSDNG +I D P+ N K K Sbjct: 280 PVYAALVESVDQNVGKVLSELERMGLRENTLVIFTSDNGGLIGNYDNPITNNYPLKSQKG 339 Query: 366 QTYPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPTALD--AADISIPKDLKLDGVSLL 422 Y GG P + W GK+ G D+ I MD+ PT LD D ++P +L+GVSL Sbjct: 340 YPYEGGIRIPTIVSWPGKIPQGFVDETPIITMDWIPTILDFMGEDPTLP---ELEGVSLK 396 Query: 423 PWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFS 482 P L ++K ++L W +PH D+S + Sbjct: 397 PLLTERKD-LAERDLFWY----------------------------FPHY-RLSDISPYV 426 Query: 483 YTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFI 534 VR+ Y L++ ++ Q LY L D+++K N+ + + +++Q + +++ Sbjct: 427 -IVRSGGYKLIHYFDDTQDELYNLDYDMEEKVNVISTRGAIAEQLQQKIDQWL 478 >UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B5CXC7_9BACE Length = 509 Score = 144 bits (364), Expect = 7e-33, Method: Compositional matrix adjust. Identities = 139/548 (25%), Positives = 212/548 (38%), Gaps = 157/548 (28%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN++ + +DD G+ + ++ F TP + L Sbjct: 30 QPNVVFIMVDDYGWADVGYNGSRF--------------------------YETPNIDRLA 63 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG----------------- 159 EG+ FT+GY A +S PSR ++MTG+ PAR G+ TD G Sbjct: 64 SEGMIFTDGYAAASISSPSRVSLMTGKYPARTGI---TDWIPGYQYGLKPEQLKQYKMLA 120 Query: 160 ------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 +PL E + E F+ HGY T VGKWH ++ S Sbjct: 121 PEMPLNMPLEEVTMAEAFKEHGYATYHVGKWHCAEDS----------------------- 157 Query: 214 EWQPQNRGFDYFMG-----------FHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEA 262 + PQ +GFD +G G Y SP P +++D+L DE+ Sbjct: 158 LYYPQYQGFDVNIGGWLKGSPNGIRRSQGGKGAYCSPYRNPYLPDGPEGEFLTDRLGDES 217 Query: 263 IGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTAD------------- 309 I ++ + + D+PF LYLA+ A H P + A +Y K F +Q Sbjct: 218 IKLI-KNSSADKPFFLYLAFYAVHTPIE--AKPEYVKYFKWKAQRMGLDTIVPFTRNLEW 274 Query: 310 --------------------NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV 349 Y A +YS+D+ V R+++ LK NG NTI+ SDNG + Sbjct: 275 YKNAEYKAGHWKERTIQSDAEYAALIYSMDENVGRVMQALKDNGLDKNTIVCLLSDNGGL 334 Query: 350 --IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPTALDAA 406 +G N + K Y GG P + + ++ G+ + A+DFYPT LD A Sbjct: 335 STAEGSPTCNAPLRAGKGWLYEGGIREPFIIKYPQMVEAGSVCHTPVVAVDFYPTLLDMA 394 Query: 407 DISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQS 466 + + +DG SLLP L+ + +D I F Sbjct: 395 GLPLKSHQHVDGKSLLPLLKGDQA----------------YDRGPIFF------------ 426 Query: 467 DDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKE 525 YPH D + VR DY L+ E+ + LY L D+ + +L+ E Sbjct: 427 -HYPHYGGKGDTP--AGAVRMGDYKLIEFYEDGHVELYNLKNDISETRDLSKTEKDKAAE 483 Query: 526 MQGVVREF 533 MQ ++ + Sbjct: 484 MQKMLHRW 491 >UniRef50_B4CVD2 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVD2_9BACT Length = 631 Score = 144 bits (363), Expect = 9e-33, Method: Compositional matrix adjust. Identities = 134/498 (26%), Positives = 206/498 (41%), Gaps = 124/498 (24%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 ++ KPNI+ + DDLG L + R+ + TP L Sbjct: 30 SRDKPNIVFILCDDLGVNDL----------SCYGRK----------------DQQTPNLD 63 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV--------------YSNTDAQDG 159 L EG+RFT Y A + SRAAIMTG+AP R + + + + Sbjct: 64 RLAGEGMRFTCAYCASPICSASRAAIMTGKAPGRVHITNFLPGRADAPSQKFIQPEIEGQ 123 Query: 160 IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN 219 +PL E + + GY +A +GKWHL + + P N Sbjct: 124 LPLEENTIAKALHGAGYVSACIGKWHL------------------------GGKGFLPTN 159 Query: 220 RGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLY 279 +GFDY HA PS + G +LT EA +++ K D PF LY Sbjct: 160 QGFDYAFAGHA-----NTKPSATEG-------GKGEYELTAEAERWLEKNK--DHPFFLY 205 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 LA+N+PH+P P+ +K + + Y A + S+D V RI++++ + G + TI Sbjct: 206 LAHNSPHVPL-AAKPELIEKHKDAWNPI---YAAMIESLDDCVGRIMKKVDELGLTEKTI 261 Query: 340 ILFTSDNGAVIDGPLP-----LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL-I 393 +FTSDNG + LP N + K GG P+ + W GK++ G ++ + Sbjct: 262 FIFTSDNGGLHVYELPNTPSTYNAPFRAGKGYLEEGGLREPLIVRWPGKIKAGATNETPV 321 Query: 394 SAMDFYPTALDAADISIPKDLK-LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENI 452 DF PT + AA + + + LDGV++LP L P + L W Sbjct: 322 VLYDFMPTLMTAAGLDVAHTVGPLDGVNILPLLTGGTI--PPRTLYW------------- 366 Query: 453 PFWDNYHKFVRHQSDDYPHNPN-TEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQ 510 H PN T S+ + +R+ ++ L+ E L LY + D Sbjct: 367 ------------------HFPNYTNQGSKPAGAIRDGEWKLIQDDETGNLELYNIAADPG 408 Query: 511 QKDNLAAANPQVVKEMQG 528 +K++LA + V E+QG Sbjct: 409 EKNDLAKSQSARVSELQG 426 >UniRef50_A3I2R7 Arylsulfatase n=2 Tax=Bacteroidetes RepID=A3I2R7_9SPHI Length = 589 Score = 144 bits (363), Expect = 9e-33, Method: Compositional matrix adjust. Identities = 131/452 (28%), Positives = 191/452 (42%), Gaps = 102/452 (22%) Query: 35 LKATKTNVAFSD-----FTPTEYSTKGKP-NIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 +K+ K +++F F ++++ KP NII++ DD GYG F N+ Sbjct: 3 MKSLKKHLSFLTILLLVFCASKFTFAQKPPNIILIITDDQGYGDFGFTG---------NK 53 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF 148 V STPT+ L + FTN YV+ V P+RA++MTGR R Sbjct: 54 HV-----------------STPTIDQLAENSFEFTNFYVSP-VCAPTRASLMTGRYSLRT 95 Query: 149 GVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 G+ + + E + EL Q Y + GKWHL N P+ Sbjct: 96 GIRDTYNGGAMMSPDEITIAELLQKSDYTSGIFGKWHLG--DNYPM-------------- 139 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAG--------TAY------YNSPSLFKNRERVPAKGYI 254 +P ++GFD + H +G T Y Y P L+ N + +GY Sbjct: 140 -------RPSDQGFDESL-IHLSGGMGQVGDFTTYFQKDRSYFDPVLWHNNRQESYQGYC 191 Query: 255 SDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT------- 307 SD AI +++ K DQPF YL++NAPH P P + YQK N + T Sbjct: 192 SDIFASAAIEFIEKNK--DQPFFTYLSFNAPHTPLQVPE-EYYQKYKNIDTSTGYESDER 248 Query: 308 ------------ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP 355 A YA V ++D +K + +LK+ D TII+F +DNG L Sbjct: 249 PFYPMSDSQKEEARKVYAMVENIDDNLKNLFAKLKELEIEDETIIIFLTDNGPQQQRYL- 307 Query: 356 LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISA-MDFYPTALDAADISIPKDL 414 +G K Y GG TP+ + KL +SA +D PT D I +P D Sbjct: 308 --AGLRGLKGNVYQGGIRTPLLIHIPEKLSENRKINTLSAHIDILPTIADLVGIQLPLDR 365 Query: 415 KLDGVSLLPWLQDKKQGEPHKNLTWITSYSHW 446 K+DG SLLP L + +++L +S+W Sbjct: 366 KIDGKSLLPLLIGEVDSFENRSL-----FSYW 392 >UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4L0_9PLAN Length = 413 Score = 144 bits (362), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 122/449 (27%), Positives = 193/449 (42%), Gaps = 97/449 (21%) Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG----VYSN--TDAQDGIP 161 +TP L L G+RFT+ + + V P+RA ++TGR R G VY+N + G+ Sbjct: 19 NTPHLDRLAANGIRFTDFHSSGAVCSPTRAGLLTGRYQQRAGIDGVVYANPKKNRHHGLQ 78 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 E L + Q+ GY T GKWHL ++ P RG Sbjct: 79 KNEITLAQCLQDAGYQTGMFGKWHLG-----------------------YQRQYNPTFRG 115 Query: 222 FDYFMGFHAAGTAYY---NSPSLFK-------NRERVPAKGYISDQLTDEAIGVVDRAKT 271 F F+G+ + Y+ + +F NRE +GY++ + D A+ + + + Sbjct: 116 FQQFVGYVSGNVDYFAHLDGTGVFDWWHNAELNREE---QGYVTHLINDHALEFIRQQQ- 171 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG-------SQTADNYYASVYSVDQGVKR 324 ++PF +Y+A+ A H P P DQ ++ G A+ Y +D+G+ + Sbjct: 172 -EKPFFVYIAHEAVHSPYQGPH-DQPMRKEGGGDIKSAKRKDIANAYREMNTEMDKGIGQ 229 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL 384 I++ LK+ + T I F SDNGA +G NG +G+K + GG P W G++ Sbjct: 230 IVDVLKEVNLTEKTFIFFLSDNGANKNGS---NGKLRGFKGSLWEGGHRVPAIACWPGRI 286 Query: 385 QPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 G D+ + ++D PT L+ A+ IP KLDGVSL+ L+D+K P + Sbjct: 287 PEGTVCDEPVISIDLMPTILELANAKIPAGHKLDGVSLVSLLKDRKSLVPRQ-------- 338 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ-LG 502 FW+ K +R + LV + + Sbjct: 339 ---------IFWEYNGK----------------------SAMRQGHWKLVLNQTRKEPIE 367 Query: 503 LYKLT-DLQQKDNLAAANPQVVKEMQGVV 530 LY LT D+ + NLA PQ V++MQ + Sbjct: 368 LYDLTRDMSESKNLADNQPQRVQQMQSAL 396 >UniRef50_A6DMY7 Iduronate-sulfatase and sulfatase 1 n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMY7_9BACT Length = 483 Score = 143 bits (361), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 131/421 (31%), Positives = 188/421 (44%), Gaps = 89/421 (21%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNIIV+ DD GY +GS + TP L +L Sbjct: 29 PNIIVIVTDDHGYADFGAYEGS------------------------SPDLKTPHLDALAK 64 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYY 177 G TNGY PSRAAI T R RF + N A + + ET + E ++ GY Sbjct: 65 NGAVVTNGYSTAPQCTPSRAAISTSRYQTRFALDDNDLAP--MDINETTIAEKLKDAGYT 122 Query: 178 TAAVGKWHLS-----------------KISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 T VGKWHL+ K V +PE+ +R Y P +R Sbjct: 123 TGFVGKWHLNPNRLSTLWMKDHYSEGLKQKKVRIPEN-ISRPYF------------PMSR 169 Query: 221 GF-DYFMGFHAAGTAYYNSPSL----FKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQP 275 G+ DY+ G A + Y + L K++ V K + D+ TD A+ +D K ++P Sbjct: 170 GYSDYYDG---ALSTYLRNYDLQGKAIKHQREVDKKTFRVDKQTDAALAFLD--KNHNKP 224 Query: 276 FMLYLAYNAPHLPNDNPAPDQYQKQFNT--GSQTADNYY--ASVYSVDQGVKRILEQLKK 331 F L+L Y APH+P + +K F+ G T + AS+ ++D G+ ++E L+K Sbjct: 225 FFLHLNYYAPHVPME-----VVKKHFDRFPGEMTERRRWGLASLAAIDDGIGAVMESLRK 279 Query: 332 NGQYDNTIILFTSDNGAVI-----DGPL--------PLNGAQKGYKSQTYPGGTHTPMFM 378 +NTI+ + +DNGA + D P LNG G K GG P + Sbjct: 280 YKIEENTIVFYFADNGAPLKIKKEDLPFNAAGGWSGSLNGELVGEKGMISEGGVRVPYLV 339 Query: 379 WWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLT 438 +WKGK+ Y K +S MD TAL A +++ K +LDGV L+P+L K + PH+ L Sbjct: 340 YWKGKIPAQVYHKPVSTMDAGATALALAGVTVKKG-ELDGVDLMPYLSKKNKSNPHEYLY 398 Query: 439 W 439 W Sbjct: 399 W 399 >UniRef50_C1ZA41 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZA41_PLALI Length = 519 Score = 143 bits (361), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 131/497 (26%), Positives = 198/497 (39%), Gaps = 98/497 (19%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 +K +PNII++ DD GYG L ++ VV TP L Sbjct: 42 SKTRPNIILMMTDDQGYGDL----------SLHGNPVV----------------KTPHLD 75 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQN 173 L + VRF +V+ P+RA+IMT R GV ++ + L T LP+ + Sbjct: 76 QLGRQSVRFEQFHVS-PTCAPTRASIMTSRHEFSSGVTHTILERERLSLKATILPQFLKR 134 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD--YFMGFHAA 231 GY T GKWHL + +QP RGFD + G Sbjct: 135 AGYTTGIFGKWHLG-----------------------DEDAYQPGKRGFDEVFIHGGGGI 171 Query: 232 GTAY-----------YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 G +Y Y +P + N + V GY + D+AI + ++ +QPF Y+ Sbjct: 172 GQSYPGSCGDAPLNKYFNPVIRHNGKFVATNGYCTKVFVDQAITWIS-SQPDNQPFFCYI 230 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 NAPH P D P + Y+ + +Y + D + R+L+ L+ +TI+ Sbjct: 231 TPNAPHAPLDCPK-EYYEPYLEHVPEDVARFYGMITHWDDQLGRLLKALEDRDISKDTIV 289 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYP 400 +F +DNG+ G + + K Y GG P F W G QP ++ D P Sbjct: 290 IFMTDNGSAT-GAKHFSAGMRANKGTPYEGGIRVPAFWSWAGHWQPQVRQEVTCHYDILP 348 Query: 401 TALDAADISIPKDLK--LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNY 458 T + A++ + D K G SL+P L + P + IT W E Sbjct: 349 TLTELANVPVADDEKQSWQGRSLVPLLAGRSPNWPPRPF--ITHVGRWPKE--------- 397 Query: 459 HKFVRHQSDDYPHNPNTEDLSQFSY---TVRNNDYSLVYTVENN--QLGLYKLT-DLQQK 512 H+P E S + Y +R D+ L+ V+ Q LY+L D +K Sbjct: 398 ------------HDPKREP-STYQYAKCAIRLGDWKLISNVKQGEPQWELYQLAEDPAEK 444 Query: 513 DNLAAANPQVVKEMQGV 529 NLA P V+E++ + Sbjct: 445 INLAKKYPDRVEELKKI 461 >UniRef50_A6DF72 Putative secreted sulfatase ydeN n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF72_9BACT Length = 481 Score = 143 bits (361), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 135/515 (26%), Positives = 208/515 (40%), Gaps = 126/515 (24%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN+I++ +DDLG+ DT G D TP + L Sbjct: 23 KPNVIMILVDDLGW--------------------TDTTCYGSD------LYQTPNVDELS 56 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR-------------FGVYSNTDAQDGIPLT 163 G+RFT+ Y A V P+R++IMTG+ PA + + + + + Sbjct: 57 RTGMRFTDAYSACTVCSPTRSSIMTGKNPANNNLTDWITGHVKPYAKLKSPNWKMHLTAE 116 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 E L E F+ GY T +GKWHL + + W P+N+GFD Sbjct: 117 EITLAEAFKATGYKTVHIGKWHLGE----------------------ESVSW-PENQGFD 153 Query: 224 Y-FMGFHA------AGTAY---YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLD 273 GF A G Y YN+P L + P Y++++L EA + L Sbjct: 154 ENIAGFRAGSPSAHGGGGYFSPYNNPRL----KDGPKGEYLTERLAQEASQYIQSTAKLK 209 Query: 274 QPFMLYLAYNAPHLP--NDNPAPDQYQKQFNTGSQTADNYYAS-VYSVDQGVKRILEQLK 330 +PF + L H P D+Y + G Q + YA+ V +D V +++ +K Sbjct: 210 KPFFMNLWLYNVHTPLQARQEKIDKYTRLIQKGYQHTNPVYAAMVEHMDDAVGTVMQAVK 269 Query: 331 KNGQYDNTIILFTSDNGAV----------IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWW 380 G DNTII+F SDNG + + PL K Y GG PM + W Sbjct: 270 DAGIEDNTIIIFNSDNGGLRGNYENNRQKVTSNYPLRSG----KGDMYEGGVRVPMIIKW 325 Query: 381 KGKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 K++ G + + D YPT LD I + K +DG+SL+P L + K Sbjct: 326 SRKIKAGQTSSSPVISHDIYPTLLDLCKIDVSKKQDIDGISLVPELLEGKT--------- 376 Query: 440 ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN 499 + + +W YPH + E +S +R D+ L++ E + Sbjct: 377 --------IQRDALYW------------HYPHY-HLEGAKPYS-AIRKGDWKLIFLYEES 414 Query: 500 QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREF 533 LY L D+ +++NLA + + E+ G +R + Sbjct: 415 HAELYNLRNDISERNNLAMTEKRKLAELMGDLRTW 449 >UniRef50_Q01N83 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01N83_SOLUE Length = 461 Score = 143 bits (361), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 114/377 (30%), Positives = 162/377 (42%), Gaps = 68/377 (18%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI+V+ DDLGYG D + +TP + L Sbjct: 27 QPNIVVILADDLGYG---------------------------DLGCYGSPIATPNIDRLA 59 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD-GIPLTETFLPELFQNHG 175 +EG RFT+ Y A V PSRAA+MTGR P R V D G+P +E + ++ ++ G Sbjct: 60 EEGARFTSFYSASPVCSPSRAALMTGRYPTRVEVPVVLGPGDAGLPDSEITMAQVLKSAG 119 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY 235 Y T+ +GKWH+ S + P NRGFD F G + Sbjct: 120 YRTSCIGKWHIG-----------------------STPGYLPTNRGFDEFFGVPYSAD-I 155 Query: 236 YNSPSLFKNRERVPA--KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA 293 P + + PA ++ T EA+ + RA+ D PF LYLA+ APHLP A Sbjct: 156 TPCPLMRGSSVVAPAVDCSTLTSSFTQEALDFMRRAQ--DNPFFLYLAHTAPHLPLA--A 211 Query: 294 PDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP 353 ++ Q G Y V +D +++ LK G NT+++F+SDNG G Sbjct: 212 SPRFAGQSGLG-----MYADVVQELDWSTGQVMAALKATGLDSNTLVMFSSDNGPWYQGS 266 Query: 354 LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPK 412 G +G K +TY GG P + G + G L + MD PT A P Sbjct: 267 ---QGKLRGRKGETYEGGMREPFLARYPGVIPSGIGCAGLATTMDLLPTLARLAGAQTPS 323 Query: 413 DLKLDGVSLLPWLQDKK 429 + LDGV + P L ++ Sbjct: 324 N-PLDGVDIWPVLTGER 339 >UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 Tax=Bacteria RepID=A6CD52_9PLAN Length = 460 Score = 143 bits (361), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 103/349 (29%), Positives = 151/349 (43%), Gaps = 56/349 (16%) Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR--------FGV 150 D ++ TP + L EG+ F Y A + PSR I+TGR P R Sbjct: 42 DVGCYGSEIPTPHIDQLAKEGLLFRQYYSASAICTPSRFGILTGRNPTRSQDQLLGALMF 101 Query: 151 YSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 S+ D GI ET + ++ Q +GY TA +GKWHL Sbjct: 102 MSDIDQNRGIQPGETTIADVLQQNGYQTALLGKWHLGH---------------------- 139 Query: 211 SAEEWQPQNRGFDYFMGFHAAGT------AYYNSPSLFKNRERVPAKGYISDQLTDEAIG 264 E + P GFD F G H G Y N P + N+ V GY +D +T+EA Sbjct: 140 GTESFLPTAHGFDLFRG-HTGGCIDYFTMTYGNIPDWYHNQRHVSENGYATDLITEEAEH 198 Query: 265 VVDRAKTLDQPFMLYLAYNAPHL-----PND-------NPAPDQYQKQFNTGSQTADNYY 312 + +T D+PF L+L+YNAPH P D D ++ + + Sbjct: 199 FLKDQQTTDKPFFLFLSYNAPHFGKGWSPGDQSPVNIMQARGDDLKRVGTIKDKVRREFA 258 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--VIDGPLPLNGAQKGYKSQTYPG 370 A S+D G+ R++ LK NG NT+++F +D+G V G N +G K+ + G Sbjct: 259 AMTVSLDDGIGRVMSSLKNNGLDQNTLVIFMTDHGGDYVYGGN---NQPFRGAKATLFEG 315 Query: 371 GTHTPMFMWWKGKLQPGNYDKLIS-AMDFYPTALDAADISIPKDLKLDG 418 G P + W GK++ G ++ A+D +PT A++ L LDG Sbjct: 316 GIRVPCIIRWPGKIKAGTETNEVAWALDLFPTICHFANVDT-DGLTLDG 363 >UniRef50_A6C176 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C176_9PLAN Length = 599 Score = 143 bits (361), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 123/431 (28%), Positives = 186/431 (43%), Gaps = 87/431 (20%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 V+ D P + GKPNII++ DD GYG D A Sbjct: 16 VSLKD-CPADTPDSGKPNIILVITDDQGYG---------------------------DIA 47 Query: 102 IEAAQK-STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGI 160 Q TP L L + +R TN +V P+R+A+MTGR R GV+ + + Sbjct: 48 AHGNQMIKTPNLDQLYQKSLRLTNFHV-DPTCAPTRSALMTGRYSTRTGVWHTIMGRSLM 106 Query: 161 PLTETFLPELFQNHGYYTAAVGKWHLSKISNVPV-PEDK---QTRDYHDNFTTFSAEEWQ 216 E L E+F+++GY T GKWHL N P+ P+D+ + + ++WQ Sbjct: 107 DTNEVTLAEVFKSNGYRTGLFGKWHLG--DNYPLRPQDQGFGTVVQHGGGGVGQTPDDWQ 164 Query: 217 PQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPF 276 DYF S + +N + +GY +D DEA+ ++ +T +PF Sbjct: 165 N-----DYF------------SDTYLRNGKPEKFQGYCTDIWFDEALKFIEADRT--KPF 205 Query: 277 MLYLAYNAPHLPN--DNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQ 334 YL+ NAPH P D D Y+ + A +Y + ++D+ + R+L LK++G Sbjct: 206 FAYLSTNAPHSPYLVDPEYSDPYEDKGVPKKMAA--FYGMITNIDENMGRLLRYLKESGL 263 Query: 335 YDNTIILFTSDNGAVIDGPLP--------------------------LNGAQKGYKSQTY 368 NTI++F +DNG P N +G K Y Sbjct: 264 EKNTILIFMTDNGTAAGLQRPSTEDLSKKQQRRLSKGKPITLETWPGFNARMRGTKGSEY 323 Query: 369 PGGTHTPMFMWW-KGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQ 426 GG P ++ W +G L G N ++L + +D PT D D++I +LKLDG SL+P L Sbjct: 324 DGGHRVPCYIHWPQGGLTGGKNINQLTAHIDILPTLADLCDLTISSELKLDGTSLVPILT 383 Query: 427 DKKQGEPHKNL 437 K ++ L Sbjct: 384 GNKDALRNRTL 394 >UniRef50_Q15XI1 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XI1_PSEA6 Length = 510 Score = 143 bits (361), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 141/533 (26%), Positives = 217/533 (40%), Gaps = 142/533 (26%) Query: 57 KPNIIVLTMDDLGYGQL-PFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 KPN++++ +DDLGY + +++ SF TP + L Sbjct: 38 KPNVLLILVDDLGYSDIKAYNENSF--------------------------YDTPNIDKL 71 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGR---------------APARFGVYSNTDAQDGI 160 + V FTNGY A+ V PSR A++TG+ PAR G + + D + Sbjct: 72 ASQSVMFTNGYAANPVCSPSRFALLTGKHPTRGKATDWFPANDKPARAGRFLPAEFNDAL 131 Query: 161 PLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 PL+E L E F+ +GY TA +GKWHL K E+ P+N+ Sbjct: 132 PLSEITLAEAFKQNGYNTAFLGKWHLGK-----------------------TEDLWPENQ 168 Query: 221 GFDYFM-----GFHAAGTAYYNSPSLFKNRERV--PAKGYISDQLTDEAIGVVDRAKTLD 273 GFD + G AAG Y SP +KN P Y++ +LT+EAI +VD+ Sbjct: 169 GFDVNIAGTKNGHPAAG---YFSP--YKNARLTDGPKGEYLTQRLTNEAISLVDKYSKQT 223 Query: 274 QPFMLYLAYNAPHLPNDNPAPD--QYQKQFNTGS-----------------------QTA 308 PF + L++ H P P D +YQ + + Q Sbjct: 224 VPFFMMLSFYTVHTPLAAPNKDVQEYQAKIRQYAHNDEFQREEQVWPTAEKREVRVKQNH 283 Query: 309 DNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV--IDGPLPLNGAQKGYKSQ 366 Y A V +D V R+L +LK+ G ++T+++FTSDNG + +G N +G K Sbjct: 284 PTYAAMVKQMDTQVGRLLAKLKQAGMEESTLVVFTSDNGGLSSAEGSPTSNLPLRGGKGW 343 Query: 367 TYPGGTHTPMFMWWKGKLQPGNYDKL-----ISAMDFYPTALDAADISIPKDLKLDGVSL 421 Y GG P+ + KL + L +++ D YPT L A + + LDGV L Sbjct: 344 LYEGGIRVPLLV----KLPQKKHKHLQINEPVTSTDLYPTLLSAGHLDLLPQQHLDGVDL 399 Query: 422 LPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQF 481 + G L Y H YPH N Sbjct: 400 NQYFSP---GAKRDALMRRPLYFH-----------------------YPHYSNQGGFP-- 431 Query: 482 SYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREF 533 +R ++ L+ E+ ++ LY L D+ ++ +LA P+ V ++ + E+ Sbjct: 432 GAAIRQGNWKLIERFEDGKVHLYNLANDIGEQIDLANQAPERVASLRKKLHEW 484 >UniRef50_Q7UHK0 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UHK0_RHOBA Length = 478 Score = 143 bits (360), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 127/483 (26%), Positives = 201/483 (41%), Gaps = 93/483 (19%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PN +++ DDLGYG + +++ TP L L Sbjct: 43 PNFVLIFADDLGYGDI--------------------------SCYDSSGVKTPHLDQLAA 76 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG--VYSNTDAQD----GIPLTETFLPELF 171 EG R + +V V PSRAA++TGR P R G V N + G E +PEL Sbjct: 77 EGFRSKDFFVPANVCSPSRAALLTGRYPMRCGMPVARNENVAKYKDYGFAPDEITIPELL 136 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA- 230 GY + VGKWHL E P + GFD ++G + Sbjct: 137 GPAGYRSLMVGKWHLG----------------------MELEGSHPLDAGFDEYLGIPSN 174 Query: 231 -AGTAYYNSPSLFKNRERVPAKGYISDQL----TDEAIGVVDRAKTLDQPFMLYLAYNAP 285 N +L++ ++ V K ++L TDE I ++R K D PF +Y++++ Sbjct: 175 YEPRRGKNHNTLYRGKQ-VEQKNVACEELTKRYTDEVIDFIERQK--DDPFFIYVSHHIV 231 Query: 286 HLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSD 345 H P P+PD G+ Y + +D RI++ ++ G +NT+++FTSD Sbjct: 232 HNPL-KPSPD------FVGTSEKGKYGDFIKELDHSTGRIMQTIRDAGLDENTLVIFTSD 284 Query: 346 NGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALD 404 NG +G +G G K T GG P W K+ P D +++MD P + Sbjct: 285 NGPTRNGS---SGELSGGKYCTMEGGHRVPGMFRWTSKIAPNQVSDVTLTSMDLLPLFCE 341 Query: 405 AADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRH 464 A + IP D ++DG S+LP L + PH+ L + + E + + Sbjct: 342 LAGVPIPDDRQIDGKSILPVLLGQTSESPHQFLYYYNGTNLQAVREG-----KWKLHLPR 396 Query: 465 QSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVV 523 +DD P D ++ T+ N++ L+ L DL +K N+A +P++V Sbjct: 397 TTDDQPFWSKKPDKTKGFVTL-------------NEMRLFNLDRDLGEKKNVADRHPEIV 443 Query: 524 KEM 526 + Sbjct: 444 ARL 446 >UniRef50_B9XGT6 Sulfatase n=3 Tax=Bacteria RepID=B9XGT6_9BACT Length = 477 Score = 143 bits (360), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 137/516 (26%), Positives = 215/516 (41%), Gaps = 114/516 (22%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI+ + DDLGY D + K E TP + L Sbjct: 22 KPNIVFILADDLGYT----DVACYGSKYYE----------------------TPNIDKLA 55 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---------------NTDAQDGIP 161 +G++FT+G+ P+RA++M+G+ R GVY+ + +P Sbjct: 56 KDGIKFTDGHTCGPNCQPTRASLMSGQYGPRTGVYTVGSIDRFAWQTRSLHPVENVTKLP 115 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 L + L + + GY T GKWHL EDK E P RG Sbjct: 116 LDKITLAQSLKKAGYATGMFGKWHLG--------EDK---------------EHHPAQRG 152 Query: 222 FDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 FD + + +P + P Y++D LTD+A+ + R K D+PF LYL Sbjct: 153 FDEALVSMGVHFDFVTNPKV-----DYPKDEYLADFLTDKALDFIKRHK--DEPFFLYLP 205 Query: 282 YNAPHLPNDNPAPDQYQKQFNTGSQTADN-----YYASVYSVDQGVKRILEQLKKNGQYD 336 + A H P A + ++F + Q D Y A + SVD+ V R++ L + D Sbjct: 206 HYAVHKPLQ--AKKELIQKF-SAKQGVDGHHNPTYAAMIASVDESVGRVVALLDELKLSD 262 Query: 337 NTIILFTSDNGAVID---------GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 NT+++F+SDNG V G + N +G K Y GG P W GK+ G Sbjct: 263 NTLVIFSSDNGGVGGYQREGIKKAGDVTDNNPLRGGKGMLYEGGHRVPYIFRWPGKIPAG 322 Query: 388 N-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHW 446 D+ I ++D YPT L+ A P+ LDG S L L+ + +++ Sbjct: 323 KVCDQPIISIDLYPTLLELAGAKAPEKYPLDGTSYLKVLKSGGMKKLNRDAI-------- 374 Query: 447 FDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL 506 +W ++ ++ +D + P VR D+ L+ E+++L LY L Sbjct: 375 -------YW-HFPGYLGAGADTWRTLPVG--------VVRCGDWKLMEFFEDHRLELYNL 418 Query: 507 T-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 DL + +NLAA P+ +E++ + + Q P+ Sbjct: 419 REDLGETNNLAAKMPEKAQELEKKLVAWQKEVQAPM 454 >UniRef50_UPI0000586CBD PREDICTED: similar to MGC86251 protein n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000586CBD Length = 525 Score = 143 bits (360), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 116/402 (28%), Positives = 167/402 (41%), Gaps = 93/402 (23%) Query: 57 KPNIIVLTMDDLGYGQL-PFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 +PNII+ DDLGYG L P+ + STP L L Sbjct: 24 RPNIIIFYADDLGYGDLEPYGHPT---------------------------SSTPNLGRL 56 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFLPELFQ 172 G+ T Y + V PSRAA++TGR R GVY N + G+PL ET + ++ + Sbjct: 57 AAGGIVLTQFYSSSPVCSPSRAALLTGRYQMRSGVYPHVFNVEMSGGLPLNETLISKMLK 116 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 GY +AAVGKWHL +N + P N GFD F+G A+ Sbjct: 117 PEGYRSAAVGKWHLGLGNN---------------------SVYLPHNHGFDEFLGLPASP 155 Query: 233 TAYYNSPSLFKN--RERVPAKGYISDQLTDEAIGVVDRAK---TLD-------------- 273 + S + N R P S ++++ TLD Sbjct: 156 SQCRCSVCFYPNVTCHRAPCSPEYSPCALFNGTTIIEQPADLLTLDDKYAMQSRRFIRTN 215 Query: 274 ----QPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQL 329 PF LY A + H P QY + +G+ + S+ ++D V +I E+L Sbjct: 216 VETGTPFFLYYASHHTHHP-------QYAGKETSGTSIRGRFGDSLAALDWEVGQIYEEL 268 Query: 330 KKNGQYDNTIILFTSDNGAVIDGPLPLN------GAQKGYKSQTYPGGTHTPMFMWWKGK 383 K+NG ++T F+SDNG L L G K K+ TY GG P + W G+ Sbjct: 269 KENGILEDTFFFFSSDNGPS----LSLENFGGNAGLMKCGKATTYEGGIRVPAIVHWPGQ 324 Query: 384 LQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 + PG +L S +D PT + +P ++ LDG + P+L Sbjct: 325 ITPGRSMELSSTLDVLPTIASITNAKLP-NVTLDGYDMSPFL 365 >UniRef50_A9BNY8 Sulfatase n=11 Tax=cellular organisms RepID=A9BNY8_DELAS Length = 457 Score = 142 bits (359), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 120/411 (29%), Positives = 169/411 (41%), Gaps = 68/411 (16%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 TE +PNI+ + DDLGY L G + + + V + Sbjct: 12 TERICMSRPNILFIVADDLGYADL----GCYGGRAADFGAV------------------S 49 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN---TDAQDGIPLTETF 166 P L L G+R T GY V P+R A+ T R R + G PL E Sbjct: 50 PVLDRLAAGGLRLTQGYANSPVCSPTRFALATARYQYRLRGAAEEPINSKTRGTPLGEKL 109 Query: 167 -LP-------ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA---EEW 215 LP + ++ GY TA +GKWHL Y +F + E + Sbjct: 110 GLPPDMPTVASMLRDAGYRTALIGKWHLG---------------YPPHFGPLRSGYEEYF 154 Query: 216 QPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQP 275 P + G DYF ++G L+ E +GY++D L+ ++ V R D P Sbjct: 155 GPMSGGVDYFTHLSSSGQH-----DLWVGEEEHHDEGYLTDLLSQRSVDFVHRMAQGDAP 209 Query: 276 FMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADN-----YYASVYSVDQGVKRILEQLK 330 F L L Y APH P + + G D Y ++ +D+G+ I+E L+ Sbjct: 210 FFLSLHYTAPHWPWETRDDRSTAEALGAGIAHLDGGNIHQYRRMIHHMDEGIGWIVEALR 269 Query: 331 KNGQYDNTIILFTSDNGA-VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY 389 NGQ DNT+I+FTSDNG PL G K GG P W + PG Sbjct: 270 ANGQLDNTLIVFTSDNGGERFSDNWPLVGG----KMDLTEGGIRVPWIAHWPAVIAPGRS 325 Query: 390 D-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 + +MD+ T LDAA + P+ LDG+SLLP L+ + P + L W Sbjct: 326 SPQHCMSMDWSATVLDAAGVQAPEGHALDGISLLPVLRAEDAEFP-RTLHW 375 >UniRef50_C2FU81 Sulfatase family protein n=2 Tax=Sphingobacterium spiritivorum RepID=C2FU81_9SPHI Length = 461 Score = 142 bits (359), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 116/399 (29%), Positives = 171/399 (42%), Gaps = 79/399 (19%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNII + DDLGY L G + ++ STP L + Sbjct: 23 KPNIIFVLTDDLGYSDL----GCYGNPSI----------------------STPFLDKMA 56 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQDGIPLTETFLPELFQNH 174 +GVR T+ V PSRA+++TGR +R+ + A++G+P E + E+ + Sbjct: 57 AKGVRATDYMVTSPSCTPSRASLLTGRYASRYNLPDPIGPGAKNGLPAQEVTIAEMLKEK 116 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF---HAA 231 GY+TA +GKWHL E+ P +GFDYF G H Sbjct: 117 GYHTALIGKWHLG-----------------------DHGEYLPNKQGFDYFYGMLYSHDY 153 Query: 232 GTAYYNSPS---LFKNRERV---PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 Y + + +F+N+ V PA +S T+E + + K +PF LY A+N P Sbjct: 154 RDPYVKTDTTIKIFRNQTPVVTRPADSALSRIYTEEVKQYISQQKK-GEPFFLYYAHNMP 212 Query: 286 HLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSD 345 HLP A K + A + +D+ + + L++ G DNTI +F+SD Sbjct: 213 HLPVAFSAESGRMKDLHFAGPLG----AVLEDLDRQLAIMWASLEEQGLADNTIFMFSSD 268 Query: 346 NGAVIDGPLPLNGAQK-------------GYKSQTYPGGTHTPMFMWWKGKLQPG-NYDK 391 NG I+ P+ ++G K G K+QTY GG P +WKG G Sbjct: 269 NGPWIEYPVRMSGDHKTKNWHVGTAGVFRGSKAQTYEGGVRVPFITYWKGHTPEGITLRN 328 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ 430 IS +D PT + S+P LDG S+ L K + Sbjct: 329 AISNVDILPTLAEWTGASVPASRTLDGQSIAALLTSKSE 367 >UniRef50_C9KTV0 Arylsulfatase n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KTV0_9BACE Length = 459 Score = 142 bits (358), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 134/520 (25%), Positives = 215/520 (41%), Gaps = 117/520 (22%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 A + F+P E + +PN +++ DD+GYG D G + + ++ Sbjct: 13 CALAAFSPVEMMAQKQPNFVIIVADDMGYG----DVGIYGNEYIK--------------- 53 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-------YSNT 154 TP + + EG+ FT+ + VS P+R ++TGR R G+ + Sbjct: 54 -------TPNIDQIAREGMMFTDFHSNGSVSSPTRCGLLTGRYQQRAGLEKVLLVPRDDK 106 Query: 155 DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 D + G+P E ++ ++GY TA +GKWHL + ++ Sbjct: 107 DKEVGLPSEEITFAKILGDNGYRTALIGKWHLGYL-----------------------QK 143 Query: 215 WQPQNRGFDYFMGFHAAGTAY------YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDR 268 P N GF F+GF + Y Y + E GY + LT + + Sbjct: 144 HHPMNFGFQKFVGFKSGNVDYQSHRNRYGDMDWWDGLEMKDMSGYTTTLLTTLSEDYIKE 203 Query: 269 AKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFN-TGSQTADNYYASVY-----SVDQGV 322 K D+PF LY+A+ APH P P + + G + +D +Y +D V Sbjct: 204 NK--DKPFCLYIAHAAPHSPMQGPDEKAVRTEATPEGDKNSDRSNKEIYKDMVEELDWSV 261 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 RILE LKK +NT ++F SDNG VI+ G KG K + GG P + G Sbjct: 262 GRILETLKKYKLDENTFVVFFSDNGPVINNGGSA-GGYKGAKGSPWEGGHRVPGICYMPG 320 Query: 383 KLQPG-NYDKLISAMDFYPTALDAADISI-PKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 ++ G ++ + + D +PT LD ADI KLDG SL+P +GE NL Sbjct: 321 TIKEGTTCEQTVMSFDLFPTMLDMADIHYDDSKKKLDGTSLVPLF----KGE---NLA-- 371 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 + FW N +K + +VR+ + LV + Sbjct: 372 ---------PRLLFWGNGNKTI---------------------SVRDGKWKLVRYNQKGG 401 Query: 501 LGLYKLTDLQ----QKDNLAAANPQVVKEMQGVVREFIDS 536 + L+ L DL +K+NL+ P++V+ + + + +S Sbjct: 402 ITLH-LFDLNNDPYEKNNLSKQEPELVERLDKEITRWAES 440 >UniRef50_A6DR15 Arylsulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DR15_9BACT Length = 526 Score = 142 bits (358), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 127/451 (28%), Positives = 182/451 (40%), Gaps = 123/451 (27%) Query: 48 TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQK 107 +P Y +PNIIV+ DD+GY L G Sbjct: 32 SPMSYDVASRPNIIVILADDMGYSDLGCYGGEI--------------------------- 64 Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ------DGIP 161 TP + +L EGVRFT G+ PSRA+++TGR GV + Q + Sbjct: 65 QTPNIDALAREGVRFT-GFKNTARCTPSRASLLTGRYSHSVGVGAMQQDQHLPGYRGQLS 123 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 + E+ + HGY T VGKWH + + KQ + P +RG Sbjct: 124 ADAPTIAEILKPHGYATGVVGKWHQAVTG-----KSKQKPLF-------------PLDRG 165 Query: 222 FDYFMGFHAAGTAYYNSPSLFKNRERVP------AKGYISDQLTDEAIGVVDRAKTLDQP 275 FD+F G Y++ + KN E +P A Y++ L+D AI VD P Sbjct: 166 FDFFYGTWWGAKDYFSPKFMMKNSEHIPDSTTYPADFYLTHALSDSAIEFVDAQVGQQNP 225 Query: 276 FMLYLAYNAPHLPNDNPAPDQYQK------------------------------------ 299 F LYLA+ APH P PA D+ QK Sbjct: 226 FFLYLAHYAPHAPIQAPA-DRIQKCIDRYKAGFVKLQQERFERQQELGVAPDNAATIDQS 284 Query: 300 -QFNTGSQTADN--------YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI 350 ++NT S+ N Y A + +D+G+ ++++ LKKNGQYDNT+IL SDNG+ Sbjct: 285 SKWNTLSEADKNEWVTTMATYGAMIEIMDEGIGQLIDVLKKNGQYDNTLILVLSDNGSTP 344 Query: 351 DGPLPLNGAQ----------KGYKSQTYPGGTHTPMFMWWKGKLQ--PGNY-DKLISAMD 397 + N A G K+ GG +P+ + W KL+ G + +D Sbjct: 345 NHKGTRNLANLCATLSNTPFSGVKAHALEGGISSPLIVSWPDKLKEYAGQIRNGRCHIID 404 Query: 398 FYPTALDAADISIP------KDLKLDGVSLL 422 PT LDAA P K ++ DG++L+ Sbjct: 405 ILPTCLDAAGAKFPDAFKGIKPVQADGINLM 435 >UniRef50_Q7UTH7 Arylsulfatase A n=2 Tax=Bacteria RepID=Q7UTH7_RHOBA Length = 496 Score = 142 bits (357), Expect = 4e-32, Method: Compositional matrix adjust. Identities = 132/503 (26%), Positives = 209/503 (41%), Gaps = 99/503 (19%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNII++ DD GYG L F TP L L Sbjct: 35 PNIILVMTDDQGYGDLGCHGHPF--------------------------LKTPNLDRLHS 68 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYY 177 E RF + +V+ P+R+A+M+GRAP + GV +D + LT T + E+ ++ GY Sbjct: 69 ESTRFNDFHVSP-TCAPTRSALMSGRAPFKNGVTHTILERDRMALTSTTIAEVLKSAGYT 127 Query: 178 TAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY------------F 225 T GKWHL + +QP RGFD F Sbjct: 128 TGIFGKWHLG-----------------------DEDAYQPDRRGFDETFIHGAGGIGQNF 164 Query: 226 MGFH--AAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVD-RAKTLDQPFMLYLAY 282 G A GT+Y+N P + N V +GY +D +A+G + + K+ +PF Y+ Sbjct: 165 AGSQSDAPGTSYFN-PIIKHNGTFVQTEGYCTDVFFQQALGWIRLQTKSDTKPFFAYIPT 223 Query: 283 NAPHLPNDNPAPDQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIIL 341 NAPH P +Y +F + S + + ++D + +++ +L + DNT+++ Sbjct: 224 NAPHAPYK--VEKRYSDRFRDKCSSPQSEFLGMIVNIDDNMGKLMGKLDEWDLADNTLLI 281 Query: 342 FTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYP 400 F +DNG+ G N KG K GG+ P+FM G G + + + +D +P Sbjct: 282 FMTDNGSA-KGSKIYNAGMKGGKGTVNEGGSRVPLFMRLPGFTNSGVDIETMTRHVDLFP 340 Query: 401 TALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHK 460 T + A IP + LDG SL+ +++ + L W H F + W Sbjct: 341 TLAEIAHAEIPAEADLDGRSLVSLIKNPQ-------LDW----DHRFQFFHSGRWAKAGL 389 Query: 461 FVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV----YTVENNQLGLYKLTDLQQKDNLA 516 + D PN + +Y VR+ + LV Y +EN D + ++A Sbjct: 390 KGKFGKGD----PNPDHSKHKNYAVRDEKWRLVNGELYDLEN---------DPGETADVA 436 Query: 517 AANPQVVKEMQGVVREFIDSSQP 539 ++P+VV M E+ D +P Sbjct: 437 GSHPEVVSRMLVAFDEWWDEVRP 459 >UniRef50_B1KFX9 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KFX9_SHEWM Length = 548 Score = 142 bits (357), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 116/438 (26%), Positives = 186/438 (42%), Gaps = 99/438 (22%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNI+++ DD+G V T+ G+ IE TP + L Sbjct: 63 PNIVLILADDMGIND------------------VSTFGGGM---IE-----TPNIDKLAA 96 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG------------------ 159 +G FTNGY H PSRAA++TGR R G Y T DG Sbjct: 97 KGALFTNGYSGHANCAPSRAALLTGRDATRTG-YDTTPIPDGMSRIIAAIENNEDNGRPE 155 Query: 160 ------------------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 +P +E +PE+ + GY+T +GKWHL + PE Sbjct: 156 MSYSAEADATNPTYDNRGLPGSEILIPEILKESGYHTMHIGKWHLGR-----SPEMMPNA 210 Query: 202 DYHDNFTTFSAEEWQP-----------QNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPA 250 D + + P ++ G D F+ + +N +FK Sbjct: 211 QGFDESLMMDSGLYLPVDHPESVNAPVESSGLDRFIWATMRYSVNWNGGEIFK------P 264 Query: 251 KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADN 310 GY++D T+EA ++ ++PF LYLA+ PH P D Y+ + Sbjct: 265 NGYLTDYFTEEAEKAIE--ANANRPFFLYLAHWGPHNPVQAKRAD-YEAVGDIQPHNKRV 321 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG-----AVIDGPLPLNGAQKGYKS 365 Y A + S+D+ V+R++ +L+K G DNTI++ +SDNG A+ D LN +G+K+ Sbjct: 322 YAAMLRSIDRSVERVMAKLEKQGIADNTIVILSSDNGGADYVAIND----LNKPYRGWKN 377 Query: 366 QTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 + GG P + W + ++ ++ +D PT ++ A+ +P+D ++DGV + P Sbjct: 378 TFFEGGIRVPFSVTWPNVIDESTVIEEPVNHIDLMPTIINMANADLPQDREIDGVDIAPL 437 Query: 425 LQDKKQGE-PHKNLTWIT 441 Q + + E P + W T Sbjct: 438 WQGQPELERPQNAMFWFT 455 >UniRef50_D2R921 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R921_9PLAN Length = 676 Score = 142 bits (357), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 124/423 (29%), Positives = 187/423 (44%), Gaps = 73/423 (17%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN++ + DD+G+G L ++ TP + L Sbjct: 52 KPNVVYILADDVGWGDL---------------------------SVHGGGVPTPNIDKLF 84 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 +G+ ++ ++ V P+RA +TGR P R G + + L ET + E F+ +GY Sbjct: 85 AQGIEVSH-FMGWCVCSPTRAMFLTGRHPIRVGTGPEVGGE--LSLDETTIAEGFKANGY 141 Query: 177 YTAAVGKWHLSKISNVP------------VPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY 224 T GKWH + P +P + + N F E W G D+ Sbjct: 142 RTGVFGKWHSGSDPDTPAFRAAFAEAFKAIPNKQFAGGHGANAHGFD-EAWVYYGGGADF 200 Query: 225 FMGFHAAGTAYYNSPSLFKNRERVPA-KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYN 283 F G S + NRE P +GY D +T AI + K DQPF Y+ ++ Sbjct: 201 FNRRTVQGRGPV---SWWHNREFRPDDEGYTDDLVTQRAIEFIRENK--DQPFFCYVPFH 255 Query: 284 APHLP-----NDNPAPDQ--YQKQFNTGSQTADN----YYASVYSVDQGVKRILEQLKKN 332 H P ND A D K +T+D + A ++S+D + I ++L+K Sbjct: 256 IAHAPLQAKENDLAAIDSKTAAKLPTASGKTSDEGKHIHAAMLHSMDNNIAAIRDELEKL 315 Query: 333 GQYDNTIILFTSDNGAVIDG-PLPLNGAQKGYKSQTYPGGTHTPMFMWW-KGKLQPGN-Y 389 G DNTI +FTSDNGA+ G LPL +G+K Y GG P ++W KG L G + Sbjct: 316 GLSDNTIFVFTSDNGAMEAGSSLPL----RGHKHTIYEGGVRLPTAIYWPKGGLTGGRKW 371 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 + L A+D +PT + D ++PK LDG ++ P L+D Q P ++ +I W DE Sbjct: 372 NGLCGALDMFPTLMAMTDSTMPKTQPLDGKNVWPALRD-NQPSPVESYYFI-----WHDE 425 Query: 450 ENI 452 + I Sbjct: 426 DAI 428 >UniRef50_A6DM48 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DM48_9BACT Length = 484 Score = 141 bits (356), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 142/527 (26%), Positives = 235/527 (44%), Gaps = 106/527 (20%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 +K KPNII++ DD+ +G+L S +PK TP + Sbjct: 33 SKSKPNIILVLTDDMAWGELGM---SGNPKI-----------------------KTPNID 66 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQN 173 L E +RFTN VA PSRA IM+G+ GV + + T LP++ + Sbjct: 67 RLSKESLRFTNFNVAP-TCAPSRAQIMSGKHEFSVGVTHTILDRMNLRDDITILPQIMKQ 125 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM-GFHAAG 232 GY T VGKWHLS+ P K T + + +P RGFD + F+ G Sbjct: 126 GGYQTGMVGKWHLSE------PGHK---------TGLTGKPLEPHRRGFDTAIYTFNQLG 170 Query: 233 TAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP 292 +P+L N + +GY D + DE I ++ + ++P+ YLA + PH P Sbjct: 171 RF---NPTLSHNGKNSKYEGYCGDVVFDEGIKWMESC-SKEKPYFAYLATSIPHTPL--A 224 Query: 293 APDQYQKQFNTGSQTADN---YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV 349 AP +Y K +G++ +N YYA + +VD+ + +++ + TI++F +DNG Sbjct: 225 APQRY-KDLYSGAKLKNNEKNYYAMISAVDENIGKLMTWMASRKDDRETILIFMTDNGHA 283 Query: 350 IDGP----------LPLNG----AQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LIS 394 I GP L NG +G K+Q++ G T P + W G + L S Sbjct: 284 ISGPDGAGHSRDGRLKKNGLYNFGFRGGKTQSWRGATCVPFLIRWPGVTTSNTENNTLAS 343 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPF 454 MD PT + A + I DL + GVSLLP ++ +K L +SH + Sbjct: 344 GMDILPTFAEIAGVGI-DDLGVQGVSLLPDIKGEKSNVTKDRLL----FSH------VGR 392 Query: 455 WDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVR-----NNDYSLVYTVENN-QLGLYKLTD 508 W N DL + +Y R NN Y L + + + +L Y L D Sbjct: 393 W------------------NASDLME-TYKYRYAAIFNNRYRLTWGEKGHPELKDY-LND 432 Query: 509 LQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL-SEVNQEKFNNIKK 554 +++ ++ + +P++V+ + ++ ++++ + ++++Q K IK+ Sbjct: 433 REEEKDITSEHPELVQRFKQEYEKWWEAAKTGMVNDLHQLKTGKIKR 479 >UniRef50_A7HQ00 Steryl-sulfatase n=4 Tax=Proteobacteria RepID=A7HQ00_PARL1 Length = 553 Score = 141 bits (356), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 138/549 (25%), Positives = 218/549 (39%), Gaps = 127/549 (23%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNI+V+ DDLG+ + G P TP + S+ Sbjct: 71 PNIVVILADDLGFNDISHFGGGIVP--------------------------TPNIDSIAR 104 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG--------------------------VY 151 G FT+ Y PSRA IMTGR R G + Sbjct: 105 GGANFTSAYSGTAACAPSRAMIMTGRYGTRTGFEFTPTPPGMTRIVDMFYNDGTRTHEML 164 Query: 152 SNTDA--------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY 203 + +A + G+P +E L E + GY+ +GKWHL PE Sbjct: 165 VDREAAAKAPPFREQGLPGSEITLAEALKPKGYHNIHIGKWHLGN-----APEFLPNAQG 219 Query: 204 HDNFTTFSAEEWQPQNRG--------FDYFMGFHAAGTAY---YNSPSLFKNRERVPAKG 252 D + + P++ FD F A Y YN + F+ KG Sbjct: 220 FDESVMLESGLFLPEDSPDVVNAKLPFDPIDQFLWARMQYATSYNGSAWFE------PKG 273 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYY 312 Y++D TDEAI ++ + ++PF LYLA+ H P D Y + + Y Sbjct: 274 YLTDFYTDEAIKAIEANR--NRPFFLYLAHWGVHTPLQASKAD-YDALSHIEDERLRVYA 330 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-LNGAQKGYKSQTYPGG 371 A + ++D+ V R+L+ LK+NG +NT+++F+SDNGA LP +N +G+K + GG Sbjct: 331 AMIVALDRSVGRVLQSLKENGLEENTLVIFSSDNGAPGYIGLPDVNKPYRGWKLTFFEGG 390 Query: 372 THTPMFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ 430 P F W ++ G ++ +D +PT + AA +P D +DG+ LLP+ ++ Sbjct: 391 IRVPFFAKWPARIPAGTERTTPVAHLDMFPTIVAAAGGELPADRVIDGIDLLPYAARGEK 450 Query: 431 GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDY 490 P FW + H +Q+ V+ + + Sbjct: 451 PAPRPI-----------------FWRDGH----YQA------------------VQADGW 471 Query: 491 SLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKF 549 L N+ L+ L TD +++N+A NP+ V E++ +V + + PL E Sbjct: 472 KLQMAERPNKTWLFNLKTDPTEQNNVADENPEKVAELKALVEAHNATQREPLFPAVAEMP 531 Query: 550 NNIKKALSE 558 + K L E Sbjct: 532 VTVDKTLEE 540 >UniRef50_Q7UTJ1 Aryl-sulphate sulphohydrolase n=1 Tax=Rhodopirellula baltica RepID=Q7UTJ1_RHOBA Length = 637 Score = 141 bits (356), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 143/504 (28%), Positives = 209/504 (41%), Gaps = 119/504 (23%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNIIV DD G+ L D K TP + +L Sbjct: 57 RPNIIVFYTDDHGHADLSCQGVLTDIK-------------------------TPNVDALA 91 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 GV NGY PSRA ++ G+ ++FGV +N + +G +E + E Q GY Sbjct: 92 KSGVLARNGYSTAPQCVPSRAGLLIGKFQSKFGVEANGASLEGFN-SELTIAERLQKAGY 150 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY 236 TA GKWHL P ++ T + GF + ++ + Sbjct: 151 VTAQFGKWHLG-------PGNRIT------------------DHGFKHVFNQNSGASFSA 185 Query: 237 NSPSLFKNRE--RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP 294 N S +RE + + Y D + A ++DR D PF LY+AY APH+P D AP Sbjct: 186 NIDSDGHDREMSSLRPEMYHIDGCSKAAASIIDRYN--DDPFFLYVAYRAPHVPLD--AP 241 Query: 295 DQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--- 350 +Y +F + A + +VD GV I + L KN + T+I + DNGA + Sbjct: 242 RKYLDRFPGEMPERRRQALAMLSAVDDGVGLITDTLAKNNLTEKTLIFYIGDNGAPLKIH 301 Query: 351 -----------DGPL--PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAM 396 DG L PLNG +KG S+ GG H P + W GK+ G YD ISA+ Sbjct: 302 KLDAPGGGPGWDGSLNDPLNG-EKGMLSE---GGMHVPFVISWPGKIPAGQVYDHPISAL 357 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWD 456 D T A+I D DGV+L+P+L +K PH+ L W W + I Sbjct: 358 DVAATTAAIANIPAQPD-DFDGVNLVPFLTGEKSDAPHEFLAW-----RWIAQAAI---- 407 Query: 457 NYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLA 516 + D+ +R D +Y +EN DL+++ NLA Sbjct: 408 --------REGDW-------------KLLRGGDREYLYNLEN---------DLEEQTNLA 437 Query: 517 AANPQVVKEMQGVVREFIDSSQPP 540 A +P+V + ++ + + D PP Sbjct: 438 AKHPKVARRLREKLSIWADKLDPP 461 >UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria RepID=A6DGD3_9BACT Length = 713 Score = 141 bits (356), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 149/551 (27%), Positives = 212/551 (38%), Gaps = 136/551 (24%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 P + S+K +P+II+ +DDLG+ + F Sbjct: 232 PKKASSK-RPHIILFLIDDLGWNDIACYGSQF--------------------------YE 264 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG--------- 159 TP L + EG RFT+ Y A+ V P+RA+I+ G+ P+R G+ SN G Sbjct: 265 TPHLDKMAKEGFRFTDAYAANPVCSPTRASILLGKYPSRVGL-SNHSGSSGPKGPGHKLT 323 Query: 160 -------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 +PL + L E + GY TA +GKWHL + +HD ++ Sbjct: 324 PVPVKGNMPLEDITLAEALKEVGYKTAHIGKWHL--------------QAHHD-----TS 364 Query: 213 EEWQPQNRGFDYFMGFHAA---GTAYYNSPSLFKNRERVP--AKG----YISDQLTDEAI 263 P+ GFD + H G+ Y+ S VP A G Y++D+LTD+AI Sbjct: 365 RNHFPEKHGFDLNIAGHRMGQPGSFYFPYKSKQHPSTNVPDMADGQEGDYLTDKLTDKAI 424 Query: 264 GVVDRAKTLDQPFMLYLAYNAPHLP--------------------NDNPAPDQYQKQFNT 303 + K D PF L Y H P N N K F Sbjct: 425 HYIKENK--DTPFFLNFWYYTVHTPIIPRQDLKKKYEAKANELGINKNQPGIPVLKSFAR 482 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV--IDGP------LP 355 SQ +Y A V ++D+ + RI + LK+ D TII+F SDNG + GP LP Sbjct: 483 SSQNNPSYAAMVEAMDENIGRIFKTLKELQIDDETIIIFCSDNGGLSTSTGPNCPTSQLP 542 Query: 356 LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLK 415 L K K+ Y GG P + W GK + D YPT LD + + Sbjct: 543 L----KAGKAWVYEGGIRIPFIIKWPGKKGGKELQAPVCTTDIYPTLLDMLKLPAKPEQH 598 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 LDGVSL + + K L + H YPH + Sbjct: 599 LDGVSLTSLMNGQA-----KELQREALFIH-----------------------YPHYHHI 630 Query: 476 EDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFI 534 + + VR DY LV E + LY L D+ + +NL P+ +M + ++ Sbjct: 631 NSMGP-AGAVRMGDYKLVEYYETGEFELYNLKEDIGEMNNLVKEQPERAAQMLKKLEQWR 689 Query: 535 DSSQPPLSEVN 545 S P E N Sbjct: 690 QQSNSPKPERN 700 >UniRef50_P34059 N-acetylgalactosamine-6-sulfatase n=23 Tax=Deuterostomia RepID=GALNS_HUMAN Length = 522 Score = 140 bits (354), Expect = 9e-32, Method: Compositional matrix adjust. Identities = 137/503 (27%), Positives = 207/503 (41%), Gaps = 85/503 (16%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNI++L MDD+G+G D G + + E TP L + Sbjct: 31 PNILLLLMDDMGWG----DLGVYGEPSRE----------------------TPNLDRMAA 64 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY-SNTDAQD---------GIPLTETFL 167 EG+ F N Y A+ + PSRAA++TGR P R G Y +N A++ GIP +E L Sbjct: 65 EGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLL 124 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR----GFD 223 PEL + GY + VGKWHL P+ + D + + P + Sbjct: 125 PELLKKAGYVSKIVGKWHLGH-----RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIP 179 Query: 224 YFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYN 283 + + G Y P K E + Y+ EA+ + R + PF LY A + Sbjct: 180 VYRDWEMVGRYYEEFPINLKTGEANLTQIYL-----QEALDFIKR-QARHHPFFLYWAVD 233 Query: 284 APHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFT 343 A H P Y + G+ Y +V +D + +ILE L+ DNT + FT Sbjct: 234 ATHAP-------VYASKPFLGTSQRGRYGDAVREIDDSIGKILELLQDLHVADNTFVFFT 286 Query: 344 SDNGAVIDGPLPLNGAQKG----YKSQTYPGGTHTPMFMWWKGKLQPGNYD-KLISAMDF 398 SDNGA + P G G K T+ GG P WW G + G +L S MD Sbjct: 287 SDNGAALIS-APEQGGSNGPFLCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMDL 345 Query: 399 YPTALDAADISIPKDLKLDGVSLLPWLQDKK---------QGEPHKNLTWITSYSHWFDE 449 + T+L A ++ P D +DG++LLP L + +G+ T +H++ Sbjct: 346 FTTSLALAGLTPPSDRAIDGLNLLPTLLQGRLMDRPIFYYRGDTLMAATLGQHKAHFWTW 405 Query: 450 ENIPFWDNYHK---FVRHQSDDYPHNPNTEDLSQFSYTVR-----NNDYSLVYTVENNQL 501 N W+N+ + F Q+ N ED ++ + L + Q Sbjct: 406 TNS--WENFRQGIDFCPGQNVSGVTTHNLEDHTKLPLIFHLGRDPGERFPLSFASAEYQE 463 Query: 502 GLYKLTDL--QQKDNLAAANPQV 522 L ++T + Q ++ L A PQ+ Sbjct: 464 ALSRITSVVQQHQEALVPAQPQL 486 >UniRef50_D1QVA8 N-acetylgalactosamine-6-sulfatase n=1 Tax=Prevotella oris F0302 RepID=D1QVA8_9BACT Length = 521 Score = 140 bits (354), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 147/528 (27%), Positives = 222/528 (42%), Gaps = 132/528 (25%) Query: 57 KPNIIVLTMDDLGY--GQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 +PNII+ +DD+G+ LPF + +TM N D Y+ TP + Sbjct: 36 RPNIILFMVDDMGWQDTSLPF----WTQRTMYN----DRYE-------------TPNMER 74 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS-------NTDAQD--------- 158 L G+ F+ Y A +S PSR ++MTG AR V + +TD +D Sbjct: 75 LAARGMMFSQAY-ACPISSPSRCSLMTGSNAARHRVTNWTLEKNKSTDLKDDQLTLPEWN 133 Query: 159 --GIPLTE--------TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 GI E T L Q GY+T GK H D D Sbjct: 134 YNGISGVEGCRNTYRATSFVNLLQASGYHTIHCGKAHWGA-------RDTPGED------ 180 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGT-AYYNSPSLFKNRERVPAK---------------- 251 P + GF+ + HA G A Y S + N + PAK Sbjct: 181 --------PHHWGFEVNIAGHAGGGPATYLSERHYGNTDN-PAKQHKMAIPGLEKYWDTG 231 Query: 252 GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNY 311 ++++ LT EA+ +D+AK +QPF LY+++ A H+P D P Y K G + Sbjct: 232 TFLTEALTREALKSLDKAKLYNQPFYLYMSHYAVHIPIDR-DPRYYDKYLKKGLSEKEAA 290 Query: 312 YAS-VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP----LPL---NGAQKGY 363 YAS V +D+ + IL+ L KN + TI++F SDNG G PL N K Sbjct: 291 YASLVEGMDKSLGDILDWLDKNDETRRTIVIFMSDNGGYATGSQWRDQPLFTQNSPLKSG 350 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADI---SIPKDLKLDGV 419 K Y GG PM + W G ++PG+ + + D++PT L+ A I +P+ K+DG Sbjct: 351 KGSMYEGGIREPMIVSWSGTVKPGSVCRQYVMIEDYFPTLLEMAGIKHYKVPQ--KVDGK 408 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLS 479 S +P L K G+P + + +Y + W N + Sbjct: 409 SFIPLL--KGTGDPSRGRMLVWNYPN--------VWGNVGPGI----------------- 441 Query: 480 QFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEM 526 + +R D+ L+Y + ++ LY + D+ + NLAA P +VK++ Sbjct: 442 SLNCAIREGDWKLIYNYKTHEKELYDIPNDIGEAHNLAAERPSIVKKL 489 >UniRef50_C9KTC2 Arylsulphatase A n=5 Tax=Bacteroides RepID=C9KTC2_9BACE Length = 501 Score = 140 bits (354), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 124/421 (29%), Positives = 185/421 (43%), Gaps = 83/421 (19%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PN+I + DDLGYG D +F+P++ + TP + +L Sbjct: 22 PNVIFILADDLGYG----DISAFNPES---------------------KIHTPNIDNLTH 56 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFLPELFQNH 174 G+ FT+ + + +S PSR +I+TGR P R + S N + I + ++F + Sbjct: 57 SGISFTDAHSSSALSTPSRYSIITGRYPWRTTMKSGVLNGFSPAMITPDRRTIAQMFSEN 116 Query: 175 GYYTAAVGKWHLSKISNVPV-PEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA-G 232 GY TA +GKWHL P ++KQ D+ P +RGFDYF G A+ G Sbjct: 117 GYNTACIGKWHLGWDWAYPQNAKNKQDVDFSLPIKN------GPTDRGFDYFYGIPASLG 170 Query: 233 TAYY-----NSPSLFKNRERVPAKGY----------------ISDQLTDEAIGVVDRAKT 271 TA + + + NR P KG + I +++ + Sbjct: 171 TAPHVYVENDKVTALPNRTIGPQKGIKLIRNGVAGADFEPQDCLPNIIRHGIDYINKQRD 230 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 +PF LYL APH P PA ++YQ Q G +Y V +D V++I++ LKK Sbjct: 231 SKKPFFLYLPITAPHTPVL-PA-EKYQGQTIIG-----DYGDFVVMIDDMVQQIVKTLKK 283 Query: 332 NGQYDNTIILFTSDNGAVIDGPLPLNGAQ-------------KGYKSQTYPGGTHTPMFM 378 N Q +NTII+FTSDNG P G + +GYK+ Y GG P+ + Sbjct: 284 NNQLENTIIIFTSDNGCA-----PYIGVEEMENKGHHPSYIYRGYKNDIYEGGHRIPLIV 338 Query: 379 WWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLT 438 W+GK L+S DFY T + + + +D S+ P L KK K+L Sbjct: 339 SWQGKYTNETNGSLVSLTDFYATFAQMVNYQLKDEEAVDSYSIWPILS-KKGNSARKDLI 397 Query: 439 W 439 + Sbjct: 398 Y 398 >UniRef50_Q7URY7 Aryl-sulphate sulphohydrolase n=1 Tax=Rhodopirellula baltica RepID=Q7URY7_RHOBA Length = 490 Score = 140 bits (353), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 137/463 (29%), Positives = 204/463 (44%), Gaps = 89/463 (19%) Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG--------- 159 TP L +L + G+ F+N Y P+RA++++G+ R +Y+ + G Sbjct: 59 TPNLDALAERGMVFSNAYSCAANCAPARASLLSGQYSPRHEIYNVGTERRGNPKHGTLQH 118 Query: 160 IPLTETFLPEL------FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 IP TET ++ ++ GY T +GKWHLS + P+P Sbjct: 119 IPGTETLSSDIQTWAHQVRDAGYRTGIIGKWHLS---DDPLP------------------ 157 Query: 214 EWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRERVPA------KGYISDQLTDEAIGVV 266 GFD + AGT + P F +VP Y++D+LTDEAIG + Sbjct: 158 ------YGFD----INVAGTHSGSPPKGYFPPHPKVPGLQDTSDDEYLTDRLTDEAIGFI 207 Query: 267 DRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYY---ASVYSVDQGVK 323 + + + + LYL++ A H P PD K T ++ A + SVD+GV Sbjct: 208 EANQ--EWSWFLYLSHFAVHTPL-QAKPDLVAKYKAKQPGTLHDHAVMAAMIESVDEGVG 264 Query: 324 RILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGK 383 R++E L++ G +NT I+FTSDNG GP +GYK Y GG P F+ W G Sbjct: 265 RMVETLRELGLEENTAIVFTSDNGGF--GPATSMKPLRGYKGTYYEGGIREPFFVTWPGV 322 Query: 384 LQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE-PHKNLTWIT 441 + G D + A D YPT ++ +P D LDGVSL+P L K++G + L W Sbjct: 323 VDAGTKSDVPVIAADLYPTFIEMTGAKLPADQPLDGVSLMPLL--KQEGSLADRELYW-- 378 Query: 442 SYSHWFDEENIPFWDNYHKFVRHQSD-DYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 + P + + Q D Y P +R+ + L E+ Sbjct: 379 ---------HFPAYLQSYSVTDGQRDLLYRSRPCG--------IIRDGRWKLHEYFEDGG 421 Query: 501 LGLYKL-TDLQQKDNLAAANPQVVKEMQGVV---REFIDSSQP 539 L LY L TD + +NLA ANP + + + RE I +S P Sbjct: 422 LELYDLVTDPGESNNLADANPIKTQALHSKLVAWRERIGASMP 464 >UniRef50_Q482D6 Sulfatase family protein n=2 Tax=Bacteria RepID=Q482D6_COLP3 Length = 492 Score = 140 bits (353), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 129/428 (30%), Positives = 182/428 (42%), Gaps = 100/428 (23%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN+++L +DD G R+ + TY TP + L Sbjct: 30 KPNVVMLLVDDFG------------------RQDLSTYGSNF--------YETPNIDQLA 63 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-YSNTDAQDGIPLTETFLPELFQNHG 175 +G++F N Y AH PSR AI +G P R+GV + +PL+ E + G Sbjct: 64 ADGMKFDNAYAAHPRCVPSRVAIFSGSYPTRYGVPQGERVGKHHLPLSAVTFGEHLKEAG 123 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD--YFMGFHAAGT 233 Y T +GKWHL K E P +GFD G A Sbjct: 124 YQTGYIGKWHLGK------------------------EGGDPTKQGFDSSIMAGHWGAPP 159 Query: 234 AYY----NSPSLFKNRERVPAKG----YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 +YY KN+ +G Y++D+LTDEA+ +++ K DQPF+L LA+ A Sbjct: 160 SYYFPYTKMSKSGKNKGFAKVEGSEEEYLTDRLTDEALTFIEQKK--DQPFLLVLAHYAV 217 Query: 286 HLP---------------------NDNPAPDQYQKQFNTG----SQTADNYYASVYSVDQ 320 H P N P D + +TG Q +Y A V SVD Sbjct: 218 HTPIEGKPALVKKYKTKMKKLGIANAGPKSDADLIKDSTGYHKTIQNNPDYAAMVESVDI 277 Query: 321 GVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ-------KGYKSQTYPGGTH 373 V RI +QLK+ G DNTII+ TSD+G + L N + K Y GGT Sbjct: 278 SVGRIEQQLKRLGLEDNTIIILTSDHGGLSSRGLKSNRVLATSNNPYRHGKGWIYDGGTR 337 Query: 374 TPMFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADISI-PKDLKLDGVSLLPWLQDKKQG 431 P+ + W K++ G+ ++ ++ D YPT L A +S+ PKD + DGVS L L + Sbjct: 338 VPLIVKWPEKVKAGSISQVQVTGTDHYPTILQMAGLSLSPKDHQ-DGVSYLAALNSDET- 395 Query: 432 EPHKNLTW 439 P K + W Sbjct: 396 -PRKAMFW 402 >UniRef50_A6DFS2 N-acetylgalactosamine-6-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFS2_9BACT Length = 497 Score = 140 bits (353), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 114/377 (30%), Positives = 169/377 (44%), Gaps = 45/377 (11%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN I + DD+GYG L E D K TP L + Sbjct: 21 KPNFIFMMADDMGYGDL------------EAYGYNDKLK-------------TPNLNEMA 55 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 G+ FT+ Y V P+R + TGR P R G++ Q + E LPE+ + HGY Sbjct: 56 ANGMLFTSFYSQASVCSPTRFSCYTGRHPFRTGIWEAN--QGSLRDEEITLPEVLKKHGY 113 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDY---HDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 T GKWHL ++ + P H+N +EW + F + G Sbjct: 114 ATGHFGKWHLGQMVDDPTLGKGARMPMAPPHEN----GVDEWFAVHSCVPTFNPYGPNGE 169 Query: 234 -AYYNSPSLFKNRERVPAK--GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP-N 289 A + + + N RV G S + D AI +++A + PF+ Y+ +N PH P Sbjct: 170 EAAESDNAYYHNGVRVTDNLVGDSSRIIMDRAIPFIEKAVQNETPFITYIWFNTPHAPVT 229 Query: 290 DNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV 349 NP Q+Q + A YY+++ +D+ V R+ +L++ G +NT++ FTSDNG V Sbjct: 230 GNP---QWQSTYEPIVGKAWQYYSNLADMDKQVGRLRSKLQELGVANNTVLCFTSDNGPV 286 Query: 350 IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISA-MDFYPTALDAADI 408 G +G + K + GG P + W K++ G+ IS D++ TALDAA I Sbjct: 287 SHGS---SGPFRASKRHLFDGGVRVPGIIEWPAKVRKGSETAAISCTTDYFLTALDAAGI 343 Query: 409 SIPKDLKLDGVSLLPWL 425 K+DG SL+P L Sbjct: 344 DYQSPYKMDGQSLVPIL 360 >UniRef50_Q7UL93 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UL93_RHOBA Length = 470 Score = 140 bits (353), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 126/465 (27%), Positives = 201/465 (43%), Gaps = 98/465 (21%) Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN-TDAQD--------- 158 TP + +L + GVRF N Y V P+RA++MTG APAR + + D++ Sbjct: 72 TPNIDALAEAGVRFDNAYAGSTVCTPTRASLMTGLAPARLHITQHGADSKSFWPDDRLIQ 131 Query: 159 ------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 +P T + E + GY T GKWHL DK+ Sbjct: 132 PPPTNHELPHETTTMAERLKAAGYTTGFFGKWHLGG--------DKK------------- 170 Query: 213 EEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPA------KGYISDQLTDEAIGVV 266 + P GFD +G G P+ F + R+PA Y++D+L DE I + Sbjct: 171 --YWPTEHGFDVNVG----GCGLGGPPTYF-DPYRIPALPPRKEGEYLTDRLADETIAFM 223 Query: 267 DRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTADNYYASVYSVDQGVKR 324 R K D+P + L PH P + P + Y+ + TG + Y + + D+GV R Sbjct: 224 RREK--DKPMFVCLWTYNPHYPFEAPEDLIEHYKGKEGTGLKNP-IYGGQIEATDRGVGR 280 Query: 325 ILEQLKKNGQYDNTIILFTSDN----GAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWW 380 +L +L G D T+++FTSDN GA + PL + K + GG P+ + W Sbjct: 281 VLRELDSLGIADETLVVFTSDNGGWSGATDNRPL------REGKGFLFEGGLRVPLIVRW 334 Query: 381 KGKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 G + ++ + +MD T LDAA +S+ LDG SL P K Sbjct: 335 PGVTEAATVNETPVVSMDLTATILDAAGVSLANGESLDGESLRPLFSGGKL--------- 385 Query: 440 ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN 499 E + +Y F H+ D+ P + +R+ Y L+ +++ Sbjct: 386 ----------ERDALYFHYPHFAFHK-DNRPGS-----------VIRSGQYKLILRHDDD 423 Query: 500 QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 + LY L DL + +LAA +P V +E++G + E+++++ + E Sbjct: 424 SVELYDLQNDLSETSDLAAVHPDVAQELKGRLMEWLEATGAGMPE 468 >UniRef50_B7S0F9 Sulfatase domain protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7S0F9_9GAMM Length = 602 Score = 140 bits (353), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 121/407 (29%), Positives = 173/407 (42%), Gaps = 91/407 (22%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI++L +DD GY L + GS P TP L ++ Sbjct: 15 KPNILLLALDDFGYNDLAINNGSDSP--------------------------TPRLDAIA 48 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 +GVRFT Y + SR A++TGR PAR G + + D +T LP+ + GY Sbjct: 49 AQGVRFTRHYAESSCTA-SRVALLTGRYPARVGAHPYLNGIDHELMT---LPDALGSEGY 104 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY 236 VGKWH T D H E +P+ +GFD++ GF Y Sbjct: 105 IRHMVGKWH--------------TGDSH--------RESRPEYQGFDHWFGF--INQLYL 140 Query: 237 NSPSLFKNRERVPA-----------------KGYISDQLTDEAIGVVDRAKTLDQPFMLY 279 P N R +G+++D LTD A+ V+ R + P+ LY Sbjct: 141 RGPHRSANYRRGKPTYINPWLENELGDLQQYEGHLTDILTDRALDVIKREQN---PWFLY 197 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 L+Y APH P + A ++ +++ A Y A +D + RI++ L ++G+ DNT+ Sbjct: 198 LSYYAPHTPIEPAA--RFSERY--ADDPAGRYQAMKDQLDSNIGRIIDWLTESGEIDNTM 253 Query: 340 ILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DF 398 I+ SDNG P N G K+ GG TP+ + W G G D I+ + D Sbjct: 254 IIVVSDNGGTAKS-WPSNLPFYGSKATYTEGGVRTPLLLSWPGHWPVGQQDDQIAMIFDL 312 Query: 399 YPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSH 445 YPT L A + P+ LDG L P + L W YSH Sbjct: 313 YPTILAALGKAQPEG--LDGADLF------DSDRPPRTLKW---YSH 348 >UniRef50_B5JMW2 Sulfatase domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JMW2_9BACT Length = 594 Score = 140 bits (353), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 118/417 (28%), Positives = 182/417 (43%), Gaps = 73/417 (17%) Query: 44 FSDFTPTEYSTKGKP-NIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAI 102 +S TP ++ G P N++V+ DD G+G D ++ Sbjct: 15 YSLLTPLSAASGGNPPNVLVILADDQGWG---------------------------DLSL 47 Query: 103 EAAQK-STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIP 161 +Q +TPTL L +G +F N YV V P+RA +TGR R GVY + + Sbjct: 48 HGSQNLNTPTLDRLAQQGAQFENFYV-QPVCSPTRAEFLTGRYYPRGGVYDTGAGGERLD 106 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 E + ++F+ GY TAA GKWH + P YH P RG Sbjct: 107 ADEETIAQVFRTAGYATAAFGKWH----NGTQAP-------YH------------PNTRG 143 Query: 222 FDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 FD + GF + Y L N V + GY+ D LT + +++ PF YLA Sbjct: 144 FDEYYGFTSGHWGSYFDALLDHNGSLVQSAGYLPDTLTTATLDFIEQQTADQTPFFAYLA 203 Query: 282 YNAPHLPNDNPAPD----QYQKQFNTGSQTADN-------YYASVYSVDQGVKRILEQLK 330 PH P D +K + + AD A V ++D V R+L++++ Sbjct: 204 LPTPHSPMQTTDEDWARFANKKLTSLATNPADENPDHTRAALAMVENIDANVGRLLDRIQ 263 Query: 331 KNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NY 389 + +NTI+++ +DNG N +G K T GGT +P+F+ + K+QPG Sbjct: 264 ELDIEENTIVVYFTDNGP---NGWRYNANMRGRKGSTDEGGTRSPLFIRYPQKIQPGATL 320 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHW 446 + + S++D PT A+I+ LDG+SL P LQ+ P + + +S+W Sbjct: 321 NTIASSIDLLPTLGQLANITWQPAQTLDGISLAPQLQNPNLRLPDRTI-----FSYW 372 >UniRef50_C3KKB6 MIP05773p n=12 Tax=Drosophila RepID=C3KKB6_DROME Length = 564 Score = 140 bits (352), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 120/416 (28%), Positives = 178/416 (42%), Gaps = 89/416 (21%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 E + +PNII++ DD+G+ + F G RE + TP Sbjct: 20 ESAAARRPNIIIIMADDMGFDDVSFRGG---------REFL-----------------TP 53 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG----VYSNTDAQDGIPLTETF 166 + +L G R + A + PSR A+++GR P G V SN + + L T Sbjct: 54 NIDALAYHG-RLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVISNEEPW-ALTLNATL 111 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 +PE+F+ GY T VGKWHL FS E+ P RGFDY Sbjct: 112 MPEIFKEAGYSTNLVGKWHLG----------------------FSRPEYTPTRRGFDYHF 149 Query: 227 GFHAAGTAYYNSPSLFKNRERVPAKGY--------------------ISDQLTDEAIGVV 266 G+ A Y F+ R ++P Y ++D LT EA ++ Sbjct: 150 GYWGAYIDY------FQRRSKMPVANYSLGYDFRRNMELECRDRGVYVTDLLTAEAERLI 203 Query: 267 DRAKTLDQPFMLYLAYNAPHLPN-DNP--APDQYQKQFNTGSQTADNYYASVYS-VDQGV 322 +QP L L++ A H N D+P AP++ ++F+ YA++ S +DQ V Sbjct: 204 KDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLDQSV 263 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ---KGYKSQTYPGGTHTPMFMW 379 RI+ L Q +N+I++F SDNGA G G+ +G K+ + GG +W Sbjct: 264 GRIITALSSTDQLENSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGGVRVAGAIW 323 Query: 380 WKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPH 434 G G+ + + + D+ PT AADI + LKLDG+ L P L PH Sbjct: 324 SSGLQARGSIFRQPLYVADWLPTLSRAADIELDSSLKLDGIDLWPELSGSADA-PH 378 >UniRef50_Q7UQ05 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UQ05_RHOBA Length = 525 Score = 140 bits (352), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 130/540 (24%), Positives = 218/540 (40%), Gaps = 149/540 (27%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN+++ +DDLG+ L G + TY TP + +L Sbjct: 53 RPNVLLFLVDDLGWADL----GCYG----------STYH------------ETPQIDALA 86 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV----------------YSNTDAQDGI 160 + G+RFTN Y A V P+RA+IMTGR P R + + + D +D + Sbjct: 87 ESGIRFTNAYAACPVCSPTRASIMTGRHPVRVDITDWIPGMSTDRAQNPRFQHVDDRDNL 146 Query: 161 PLTETFLPELFQNHG-YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN 219 L E + E ++ Y T +GKWHL + ++P + Sbjct: 147 ALDEVTIAEHLRDAADYQTFFLGKWHLGDVGHLPT------------------------D 182 Query: 220 RGFDYFMGFHAAGT---AYYN---SPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLD 273 +GF +G G+ YY+ +P L ++ Y++ +LTDEA+ +VD A D Sbjct: 183 QGFQINIGGGHKGSPPGGYYSPWKNPYLKAKQD----GEYLTTRLTDEAVSLVDTASRED 238 Query: 274 QPFMLYLAY----------------------NAPHLPNDNPAPDQYQKQFNTGSQTADNY 311 +PF + ++Y N+P L D P + + G Q Y Sbjct: 239 KPFFMMMSYYNVHSPITPDKRTIDHFEEKQSNSPELQGDTPTIAE-RDAVTRGRQDNPAY 297 Query: 312 YASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID----GPLPLNGAQKGYKSQT 367 + V +VD V RI++ LK++G DNT+++F SDNG + GP N + K Sbjct: 298 ASMVKAVDTSVGRIMKALKEHGVDDNTLVIFFSDNGGLSTLRKFGPT-CNSPLRAGKGWL 356 Query: 368 YPGGTHTPMFMWWKGKL-----------QPGNYDKLISAMDFYPTALDAADISIPKDLKL 416 Y GG P+ + + QP D + + D +PT LD + + + Sbjct: 357 YEGGIREPLLVRLPKTMPGGATNETVSHQPKTVDSVACSTDLFPTILDVVGLPLQPESHA 416 Query: 417 DGVSLLPWL--QDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPN 474 DG+SLLP + + + ++L W YPH Sbjct: 417 DGISLLPAIAGEAAETDSSPRDLHW----------------------------HYPHYHG 448 Query: 475 TEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREF 533 + L + +R +Y L+ E + LY L+ D+ + +L+ P+ E++ +R++ Sbjct: 449 S--LWRPGAAIRRGNYKLIEFYETDTAELYDLSVDMGETKDLSKTEPERFAELRDALRQW 506 >UniRef50_A6C4W8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4W8_9PLAN Length = 459 Score = 140 bits (352), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 132/499 (26%), Positives = 198/499 (39%), Gaps = 116/499 (23%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNII + DDLGYG L G + K M+ TP + Sbjct: 28 RPNIIFIMADDLGYGDL----GCYGQKLMK----------------------TPHIDQFA 61 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFL-------PE 169 +G RFT Y V SRA ++TG +T A+D IP T+L E Sbjct: 62 AQGTRFTQAYAGGSVCTASRAVLLTGLHNG------HTPARDNIPHYATYLQESDVTIAE 115 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH 229 + Q GY VGKW L V + N+GFD + G+ Sbjct: 116 VLQKSGYRCGGVGKWSLGDAGTVG----------------------RATNQGFDMWFGYL 153 Query: 230 AAGTA-YYNSPSLFKNRERVPAKG-------YISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 A YY + L N R+ KG Y D LT+ A+ + + QPF LY A Sbjct: 154 NQDHAHYYFTEYLDDNEGRLELKGNTKNRQQYSHDLLTERALQFIRDSAA--QPFFLYAA 211 Query: 282 YNAPHL------PNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQY 335 Y PH P+ PD + Y A ++ +D+ V RI+ + + Sbjct: 212 YTLPHFSAKAEDPHGLAVPDTEPYSDRDWDIKSKKYAAMIHRLDRDVGRIMSLVNELQLR 271 Query: 336 DNTIILFTSDNGAVIDGPLPL--NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKL 392 + T+I+FTSDNG P L NG +G+K GG P W G + G D++ Sbjct: 272 ERTLIIFTSDNGGHRGVPAQLHTNGPLRGFKRDLTEGGIRVPFIANWPGTIPAGKVSDEV 331 Query: 393 ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENI 452 I+ D PT + A + + LDG+S+LP L+ + + H+ L W Sbjct: 332 IAFQDMLPTFAELAGAQVSAN--LDGISVLPALRGEPRKVKHEYLYW------------- 376 Query: 453 PFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQ 511 DY H +++ VR N++ + + ++ LY L DL + Sbjct: 377 ---------------DYGHC-----RARYDQAVRWNNWKGIRHGQQGEIALYNLDQDLSE 416 Query: 512 KDNLAAANPQVVKEMQGVV 530 ++A +PQVV+ + ++ Sbjct: 417 SRDVADKHPQVVQRIAEIM 435 >UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UJ66_RHOBA Length = 616 Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 124/497 (24%), Positives = 201/497 (40%), Gaps = 108/497 (21%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 +++ +PN+I++ DD GYG + + +TP L Sbjct: 52 ASESRPNVILVVTDDQGYGDMSCHGNPW--------------------------LNTPNL 85 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQ 172 L + VR N +V + P+RAA+MTGR R G ++ T+ + + ET + E F+ Sbjct: 86 DRLATQSVRLENFHVDPFCT-PTRAALMTGRYCTRVGAWAVTEGRQLLDPDETTMAETFR 144 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 GY T GKWHL P P + P+ RG + + A G Sbjct: 145 ESGYRTGMFGKWHLGD----PPP-------------------FAPRERGLETVVRHMAGG 181 Query: 233 TAYYNSP--------SLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNA 284 +P + ++N GY +D +EAI + K +QPF Y+ NA Sbjct: 182 ADEIGNPTGNDYFDDTYYRNGTPESFDGYCTDIWFEEAIDFIQ--KESEQPFFAYIPTNA 239 Query: 285 PHLP--NDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILF 342 H P + D +++Q + A +Y + + D+ + R+L++L ++ DNT+++F Sbjct: 240 MHSPYLVADRYSDPFKRQGIEPQRAA--FYGMIQNFDENLGRLLKRLDQDNLRDNTMLIF 297 Query: 343 TSDNGAVIDGP-----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN--YDKLISA 395 SDNG + N +G K Y GG P F W K GN D+L Sbjct: 298 MSDNGTAQGASEQNRKVGFNAGMRGKKGSVYEGGHRVPCFASWPAKWD-GNRPVDQLTCH 356 Query: 396 MDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFW 455 D+ PT ++ D+ P D+ DG S+ L Q P + L Sbjct: 357 RDWLPTLIELCDLKRPADVTFDGRSMAGLLSHSSQQWPERTLV----------------- 399 Query: 456 DNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV----YTVENNQLGLYKLTDLQQ 511 + Q D+ T+ +Q + V + + LV Y ++N D Q Sbjct: 400 ------IERQPDNVVSATKTQGRAQPPFVVLTDRWRLVRDELYDIQN---------DPGQ 444 Query: 512 KDNLAAANPQVVKEMQG 528 N+AA P+VV+E++ Sbjct: 445 IKNIAAEYPEVVRELRA 461 >UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BZT7_9PLAN Length = 459 Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 138/519 (26%), Positives = 210/519 (40%), Gaps = 128/519 (24%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 E + K KPNII + DDLGY +L G + K ++ TP Sbjct: 10 EATEKQKPNIIFIMADDLGYAEL----GCYGQKKIK----------------------TP 43 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPEL 170 + L EG++FT Y V PSR+ +MTG+ V +N D + +T + E+ Sbjct: 44 HIDKLAAEGMKFTQAYAGSMVCQPSRSVLMTGQHTGHTAVRAN-DLNQLLYEEDTTVAEV 102 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA 230 + GY T A GKW L + +P +GFD F G Sbjct: 103 LKIAGYATGAFGKWGLG----------------------YEGTPGRPGQQGFDDFTGQLL 140 Query: 231 AGTAYYNSPSLFKNRER---VPA-----KG-YISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 A++ P N E +P +G YI D + ++A + + K QPF YL Sbjct: 141 QVHAHFYYPFWIWNNEHRLMLPENENNQRGRYIHDLIHEDAKAFIQKNKA--QPFFAYLP 198 Query: 282 YNAPHL----PNDNPAPDQYQKQFN-----------TGSQTADNYYASVYS-VDQGVKRI 325 Y PH+ P ++ P Y+ QF GS+ +A + S +D V I Sbjct: 199 YIIPHVELVVPEESEKP--YRGQFPKKQILDPRPGYIGSEDGLTTFAGMVSRLDDHVGEI 256 Query: 326 LEQLKKNGQYDNTIILFTSDNGA----------VIDGPLPLNGAQKGYKSQTYPGGTHTP 375 + L+ G DNT+I+FTSDNG +G PL +G+K Y GG P Sbjct: 257 VTLLEDLGIRDNTLIIFTSDNGGQGGTWKEMTDFFNGNAPL----RGHKGSMYEGGIRVP 312 Query: 376 MFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPH 434 W GK+ G L I+ D PT A ++P + +DG+S LP L K + H Sbjct: 313 FIANWPGKIAAGKTSDLQIAFWDVLPTLAQVAGTTVPSGVDIDGISFLPTLLGKGKQPEH 372 Query: 435 KNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY 494 + L +W+ +R S +R ++ V Sbjct: 373 EYL----------------YWEYTRGKIR------------------SRAIRQGNWKAVQ 398 Query: 495 TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVRE 532 N + LY L TD+ + NLA +P+ +K++Q ++++ Sbjct: 399 NRMNQPIELYDLGTDIGETKNLAKQHPEKIKDLQQIMQQ 437 >UniRef50_C1ZI83 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZI83_PLALI Length = 558 Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 135/495 (27%), Positives = 208/495 (42%), Gaps = 103/495 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN++++ DDLGY D G+F A TP + + Sbjct: 107 KPNVVIINCDDLGYA----DVGAFG----------------------ATICKTPEIDRMA 140 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD--AQDGIPLTETFLPELFQNH 174 EGV+ T+ YVA V SR A++TG P R G+ +++GI +E L ELFQ+ Sbjct: 141 REGVKATSFYVAQAVCSASRTALLTGCLPNRIGILGALSHVSKNGIADSEVTLGELFQSQ 200 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA 234 GY TA GKWHL Y F P + GF +G + Sbjct: 201 GYSTAMYGKWHLG---------------YQAQFL--------PGHHGFGEALGIPYSNDM 237 Query: 235 YYNSP-------SLFKNRERVPAK--GYISDQ------LTDEAIGVVDRAKTLDQPFMLY 279 + +P LF+ + PA+ G+ +DQ T A+ +DR D+PF +Y Sbjct: 238 WSKNPYGKFPPLPLFRQKGDSPAEIIGHDTDQSRFTTDFTMAAVSFIDRHA--DKPFFIY 295 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 LA+ PH P ++ N+G + A Y + +D V I + L+K+ T+ Sbjct: 296 LAHPMPH------TPIFVSEERNSGER-AQLYRDVIGEIDWSVGTIRQTLEKHQLTRKTL 348 Query: 340 ILFTSDNG--AVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP-GNYDKLISAM 396 ++FTSDNG V G + K + GG P W G + P D ++ Sbjct: 349 VIFTSDNGPWLVFGNHAGSTGPLREGKGTMWDGGARVPFVACWPGVIPPDTTVDLPMATY 408 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWD 456 D +PT +P D +DGV + P L + +PH+ L W ++ + I Sbjct: 409 DLFPTFAKMLGAKLP-DHPIDGVDIWPQLTSASKAQPHQAL-WF-----YYGRDLIAVRS 461 Query: 457 NYHKFVRHQSDDYPHNPNTEDLSQFSYTV-RNNDYSLVYTVEN--NQLGLYKL-TDLQQK 512 K V +PH + + V R ND V +L LY L +D+ + Sbjct: 462 GPWKLV------FPHT--------YVHPVERGNDGQRGKLVNRKFTELALYNLDSDIGET 507 Query: 513 DNLAAANPQVVKEMQ 527 NLA+ +P++VK+++ Sbjct: 508 TNLASQHPEIVKQLE 522 >UniRef50_Q7UNI8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UNI8_RHOBA Length = 530 Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 105/394 (26%), Positives = 168/394 (42%), Gaps = 68/394 (17%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN+I++ DDLGY Q G FD ++ TP L L Sbjct: 53 RPNVILVMADDLGYAQ----TGYFDHPLLK----------------------TPELDRLA 86 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 G+R Y A V P+RA+++TGR+P R GV S+ A + E + + GY Sbjct: 87 AGGLRLDRFYAASAVCSPTRASVLTGRSPERTGVPSHGHA---LRHQERSIASAIRKAGY 143 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY 236 T GKWHL+ + VP + + P GFD ++ T ++ Sbjct: 144 VTGHFGKWHLNGLRGPGVP-------------ILGEDRYHPGRFGFDVWLSV----TNFF 186 Query: 237 NSPSLFKNRERVPA-KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP------- 288 + L R + A +G S+ + EA+ R+ QPF + Y PH P Sbjct: 187 DRNPLMSRRGKFEAFEGDSSEVIVKEALDFASRSARSQQPFFAVVWYGTPHSPFVASDED 246 Query: 289 --------NDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 D + Q+ + + + N+Y + ++D+ + + E L++ DNT+I Sbjct: 247 MQTLVDAGEDEDEKARLQRIVDGMDEGSRNHYGELVAMDRSLGTLREGLERLRVLDNTLI 306 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFY 399 F SDNG + +G+KSQ Y GG P + W ++Q G ++ A D Sbjct: 307 WFCSDNGGLKGIEPSTTEPLRGHKSQLYEGGLRVPCVLHWPDQIQAGRISRVPACATDIA 366 Query: 400 PTALDAADISIPKD---LKLDGVSLLPWLQDKKQ 430 PT +D + +P D +DG+SLLP +Q + + Sbjct: 367 PTLVDL--LGLPGDSLTTPVDGISLLPVIQGQDE 398 >UniRef50_B4D3U0 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D3U0_9BACT Length = 467 Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 109/401 (27%), Positives = 170/401 (42%), Gaps = 78/401 (19%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN I + DDLG+G + F G+ TP L L Sbjct: 40 RPNFIFILADDLGWGDVGFHHGNV---------------------------PTPNLDHLA 72 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 EG+ YV + V P+R A ++GR +RF V + + + L ++ GY Sbjct: 73 GEGLELMQHYV-YPVCSPTRCAFLSGRYASRFSVTTPQNPR-AFRWDTVTLARALKSVGY 130 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY 236 TA GKWHL S EW PQ GFD+ G A G + Sbjct: 131 DTALCGKWHLG-----------------------SKPEWGPQKFGFDHSYGSLAGGVGPW 167 Query: 237 N--------SPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP 288 + + + ++ + + +G+++D +T EA+ ++ D+PF LY+ + A H+P Sbjct: 168 DHHYKIGEFTQTWHRDGKLIEEQGHVTDLITKEAVEWLE--SRTDKPFFLYVPFTAVHIP 225 Query: 289 NDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA 348 P + + + +Y A+V +D V +IL L+K G+ NT+++F SDNGA Sbjct: 226 IREPDEILQRVPASITKPSLRHYGANVMHLDDSVGKILVALEKTGKAGNTLVIFGSDNGA 285 Query: 349 VID-------------GPLPLNGAQK---GYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL 392 + P P G+ + G K + Y GG HT W G+L+PG + L Sbjct: 286 IPGVENNDPLYPPDHYPPGPAGGSNEPLHGMKGEVYEGGIHTAAVARWPGQLKPGKFLGL 345 Query: 393 ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 D+ PT A KDLK DG ++ P L + +P Sbjct: 346 AHITDWMPTFCALAGYKPEKDLKWDGQNIWPQLTGAEPVKP 386 >UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN4_DYAFD Length = 497 Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 126/419 (30%), Positives = 179/419 (42%), Gaps = 99/419 (23%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNI+ + DDLGYG+L G + + ++ TP L L Sbjct: 27 PNIVYIYADDLGYGEL----GCYGQQKIK----------------------TPNLDRLAK 60 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD----------AQDGIPLTETFL 167 EG+RFT Y V P+RA +MTG+ + N + Q +P E + Sbjct: 61 EGIRFTQHYTGTPVCAPARAMLMTGKHAGHSAIRGNFELGGFRDEEERGQMPLPANELTV 120 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYH--------------------DNF 207 EL + GY TA GKW + ++N +Q DY+ D + Sbjct: 121 AELLKQKGYATALTGKWGMG-MNNTEGTPTRQGFDYYYGYLDQKQAHNLYPSHLWENDRW 179 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVD 267 T A+ WQ +R D A A + S FK +E PAK +T++A+ +D Sbjct: 180 DTL-AQPWQDIHRKLDP----AKATDADFES---FKGKEYAPAK------MTEKALAFID 225 Query: 268 RAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQK----QFNTGSQTADNYYAS--------- 314 R+K PF LY+ Y PH+ APD+Y K QF+ + YAS Sbjct: 226 RSKA--GPFFLYMPYTLPHVSLQ--APDEYVKKYIGQFDEKPYYGEKNYASTKYPLSTYA 281 Query: 315 --VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG---PLPLN--GAQKGYKSQT 367 + +D V IL++LK G DNTI++F+SDNGA +G P N +G K Sbjct: 282 SMITFLDDQVGIILDKLKALGLDDNTIVMFSSDNGATFNGGVNPQFFNSVAGLRGLKMDV 341 Query: 368 YPGGTHTPMFMWWKGKLQPGNYDKLISA-MDFYPTALDAADISIPKDLKLDGVSLLPWL 425 Y GG P + W GK++PG +SA D PT + + P DG+S LP L Sbjct: 342 YEGGIREPFIVRWPGKIKPGRVSDHVSAQFDLMPTLAELTGQASPPT---DGISFLPEL 397 >UniRef50_Q7UYS6 Arylsulfatase A n=4 Tax=Bacteria RepID=Q7UYS6_RHOBA Length = 512 Score = 139 bits (351), Expect = 2e-31, Method: Compositional matrix adjust. Identities = 124/426 (29%), Positives = 177/426 (41%), Gaps = 87/426 (20%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 TK PN+++L DDLGYG L ++N E ++ TP L Sbjct: 32 TKTPPNVLILYADDLGYGDL----------NLQNAE---------------SKIPTPHLD 66 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR-FGVYSNTDAQDGIPLTETFLPELFQ 172 L G+RFT+G+ + G+ PSR A++TGR R F N + + LPE+FQ Sbjct: 67 QLARSGMRFTDGHSSSGICTPSRYALLTGRHHWRDFHGIVNAFGESVFEPEQLTLPEMFQ 126 Query: 173 NHGYYTAAVGKWHLS-KISNVPVPEDKQTRDYHDNFTTFSAEEWQ------PQNRGFDYF 225 HGY TAA+GKWHL + P+ K + A +W P GFD + Sbjct: 127 QHGYQTAAIGKWHLGWDWDAIKKPDAKTFGEGRKKGYGPEAFDWTKSIPDGPLAHGFDSY 186 Query: 226 MGFHAAGTAYYNSPSLFKNRERVPAKGYISDQ---------------------------- 257 G Y ++ + V A I D Sbjct: 187 FGDTVINFPPY---CWIEDDKVVKAPDTIMDTAKWKPIKEGNWECRPGPMTSDWDPYQNI 243 Query: 258 --LTDEAIGVVDRAKTLDQPFMLYLAYNAPH---LPNDNPAPDQYQKQFNTGSQTADNYY 312 T + ++ K DQPF LY A+ APH +PND +F+ G A Y Sbjct: 244 PTTTARGVQFIESQKESDQPFFLYFAFPAPHAPIIPND---------EFD-GRSGAGPYG 293 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA-----VIDGPLPLNGAQ--KGYKS 365 V D ++L LK++GQ +NTI++F++DNG D +Q +G K Sbjct: 294 DYVCETDDACGKLLRALKESGQSENTIVIFSADNGPERYAYARDEKYDHWSSQPFRGLKR 353 Query: 366 QTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 Y GG H P + W G G+ D L+S +D + T + SIP D SL+P Sbjct: 354 DLYEGGHHVPFVIHWPGVTDSGSTCDALVSQVDIFATLAEMLGHSIPDGQAKDSRSLMPL 413 Query: 425 LQDKKQ 430 L++ KQ Sbjct: 414 LKEPKQ 419 >UniRef50_A6DI98 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DI98_9BACT Length = 468 Score = 139 bits (350), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 128/426 (30%), Positives = 177/426 (41%), Gaps = 112/426 (26%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 S+ K NII + DDLGYG+L GS+ + ++ TP L Sbjct: 18 SSDTKTNIIFILADDLGYGEL----GSYGQEKIK----------------------TPEL 51 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQ 172 + G+RFTN Y + SR +MTG+ A N D IP + L + Sbjct: 52 DKMAASGIRFTNHYSGYTTCTMSRKVLMTGKHIA------NLPMGDRIP--SITIAGLLK 103 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 N GY TA +GKW + + R HDN P+ GFD+ + G Sbjct: 104 NAGYKTAMIGKWGM------------KGRPGHDN---------SPEKHGFDHVFTYDNQG 142 Query: 233 TAYYNSPS-LFKNRERV---------PAKGYIS---------DQLTDEAIGVVDRAKTLD 273 A++ P L++N E++ GYI D+ T +A+G ++ K D Sbjct: 143 FAHFYYPEYLWRNGEKIHYPTNKNLLTEDGYIKEKHDGVYSHDEFTKDALGFIEENK--D 200 Query: 274 QPFMLYLAYNAPH----LPNDNPAP------------------------DQYQKQFNTGS 305 +PF LYL Y PH +P+D+ P QY K + Sbjct: 201 RPFFLYLPYTIPHAEITVPHDSVEPYLKLNWPETPKIIGGGGSKDPGYGSQYVKGYCGQK 260 Query: 306 QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL-----NGAQ 360 Y + +D+ V RIL+ LKK +NT++LF SDNGA +G L +G Sbjct: 261 YPHAAYAGMISRMDRDVGRILDLLKKLKIEENTLVLFGSDNGASPEGGQTLEFFQSSGKL 320 Query: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISA-MDFYPTALDAADISIPKDLKLDGV 419 +G K Y GT TP +W +Q G ISA DF TA D A I P+ DGV Sbjct: 321 RGDKRSIYEAGTRTPFIAYWPKTIQAGQKTGHISAYCDFVATACDVAGIKTPEH--SDGV 378 Query: 420 SLLPWL 425 S LP L Sbjct: 379 SYLPSL 384 >UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UL40_RHOBA Length = 592 Score = 139 bits (350), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 111/383 (28%), Positives = 163/383 (42%), Gaps = 57/383 (14%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN+I++ DD G+ ++ F EV+ TP L Sbjct: 46 RPNVILVMTDDQGWAEVGF----------HGNEVL----------------KTPNLDRFA 79 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 EG TN YV+ + P+R+++MTGR R G + + + ET + E+F GY Sbjct: 80 AEGTELTNFYVSP-MCTPTRSSLMTGRYHFRTGAHDTYIGRSNMNPEETTIAEVFAGAGY 138 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY 236 T GKWHL + N P+ + Q F + DY G Y+ Sbjct: 139 RTGIFGKWHLGE--NFPMRAEDQ------GFQKVVVHGGGGIGQFADY------PGNTYW 184 Query: 237 NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ 296 + P+L N AKGY +D DE+I + + +QPF YL N PH P D D+ Sbjct: 185 D-PTLQYNDSFKKAKGYCTDVFIDESIQFMKDSG--EQPFFCYLPLNVPHSPFD--VADE 239 Query: 297 YQKQFNT-------GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV 349 ++ ++ G + Y + D R+LE ++ GQ +NTIILF SDNG Sbjct: 240 FRADYDNQNLADPDGRKWVAPIYGMITQFDGAFGRLLEAVEDMGQRENTIILFMSDNGP- 298 Query: 350 IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADI 408 + K Y G +P + W LQ G +D +D PT DA I Sbjct: 299 --NSTYFTAGLRAKKGSVYENGIRSPFVIQWPKTLQGGRKFDTPAMHIDLLPTLADACGI 356 Query: 409 SIPKDLKLDGVSLLPWLQDKKQG 431 +P DL++DG S+L L + QG Sbjct: 357 GLPADLQVDGKSILGLLHGETQG 379 >UniRef50_UPI0001C36AAF N-acetylgalactosamine 6-sulfate sulfatase n=2 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C36AAF Length = 468 Score = 138 bits (348), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 108/372 (29%), Positives = 162/372 (43%), Gaps = 85/372 (22%) Query: 105 AQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTE 164 ++ TP L L +GVRF N + V P+RA+++TG+ P++ G+ +G Sbjct: 27 SEIQTPNLDKLAKQGVRFDNFFCTSPVCSPARASLLTGKIPSQHGILDYLSGGNGGASQA 86 Query: 165 TFLPELFQNH----------GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 E ++H GY GKWHL + Sbjct: 87 AI--EFLKDHRGYTDILAEEGYTCGLSGKWHL-------------------------GDG 119 Query: 215 WQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQ 274 PQ +GF ++ G YYN+P +F+N +++ KGYI+D +TDEAI +DR K +Q Sbjct: 120 GHPQ-KGFSFWYAHQKGGGPYYNAP-MFRNGQKIEEKGYITDVITDEAISFIDREKNKEQ 177 Query: 275 PFMLYLAYNAPHLPNDNPAPDQY-----QKQFNTGSQ--------------------TAD 309 PF L + Y APH P N P +Y F T Q + Sbjct: 178 PFYLSVHYTAPHSPWINCHPKKYTDLYEDCPFETCPQGEVHPWAKTEVIAGYQKPRESLI 237 Query: 310 NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI---------DGPLPLNGAQ 360 Y+A+V ++D V RIL++L++ ++T+I+F+SDNG +G PLN Sbjct: 238 GYFAAVTAMDDNVGRILKKLEEENLMEDTLIIFSSDNGFNCGHHGIWGKGNGTFPLN--- 294 Query: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGNY--DKLISAMDFYPTALDAADISIPKDLKLDG 418 Y P+ M KG + P N+ D++ S DF PT LD + KL G Sbjct: 295 ------MYDSSVKVPLIMCHKGHI-PENHVCDEMHSGYDFMPTPLDYLGFKNDEADKLPG 347 Query: 419 VSLLPWLQDKKQ 430 S L L ++Q Sbjct: 348 KSFLSALMGQEQ 359 >UniRef50_A6DFR6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFR6_9BACT Length = 573 Score = 138 bits (348), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 109/380 (28%), Positives = 167/380 (43%), Gaps = 71/380 (18%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN++++ DD GYG++ +K I+ TP + L Sbjct: 20 RPNVVLILTDDQGYGEVAAHG---------------------NKIIQ-----TPEMDKLY 53 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 EGVR N Y + + PSRAA++TGR +R GV+ ++ I E + + F GY Sbjct: 54 REGVRLDN-YHVNSICSPSRAALVTGRYASRVGVWHTLGGRNIIRKDEKTIADHFVAAGY 112 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY 236 T VGKWHL N P ++P++RGF F G + Sbjct: 113 KTGMVGKWHLG--DNAP---------------------YRPEDRGFQDV--FRIGGGSIG 147 Query: 237 NSPSLFKNR----------ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPH 286 P +KN + V KG+ +D D A+ V+ K PF L+++ APH Sbjct: 148 QLPDYWKNDLWDGHYWNKGQWVKTKGFCTDVQFDYALDFVEENKK--SPFFLFISTTAPH 205 Query: 287 LPN--DNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 P D + Y+K A +Y V ++D + R+ +L++ +NTI++F+S Sbjct: 206 SPTGADKKYLEPYEKLGLDKGICA--FYGMVTNIDDNIGRLRNKLRELKLEENTILIFSS 263 Query: 345 DNGAVIDGPL-PLNGAQKGYKSQTYPGGTHTPMFMWW-KGKLQPG-NYDKLISAMDFYPT 401 DNG+ D NG +G K Y GG P F++W KG G D++ + +D PT Sbjct: 264 DNGSACDKKGDSFNGGMQGKKGSLYEGGHRVPCFLYWPKGGWIGGKQLDQVTAHIDILPT 323 Query: 402 ALDAADISIPKDLKLDGVSL 421 L A I P + DG+ L Sbjct: 324 LLKACAIENPLNTAFDGIEL 343 >UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C366AB Length = 470 Score = 138 bits (348), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 122/525 (23%), Positives = 209/525 (39%), Gaps = 136/525 (25%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN + + MDD+G+ L +F TP + L Sbjct: 4 QPNFLFIFMDDMGWRDLACTGSTF--------------------------YETPNIDRLC 37 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG----------------I 160 +G+ F N Y + V PSRA+ +TG+ PAR GV D + + Sbjct: 38 RQGMVFANSYASCPVCSPSRASCLTGKYPARLGVTDWIDMEGTSHPLKGKLIDAPYIKHL 97 Query: 161 PLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 P E + + ++ GY T VGKWHL E+ P++ Sbjct: 98 PEGEYTIAQALKDAGYDTWHVGKWHL------------------------GGREFYPEHF 133 Query: 221 GFDYFMGFHAAGTAY--YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLD--QPF 276 GFD +G + G + Y SP + P Y++D++TDEA+ ++ + + +PF Sbjct: 134 GFDVNIGGCSWGHPHDGYFSPYGIETLSEGPEGEYLTDRITDEAVRLLRKRQACGSRKPF 193 Query: 277 MLYLAYNAPHLP-----NDNPAPDQYQKQFNTGSQTA----------------------- 308 + L + A H P D ++ ++ +TA Sbjct: 194 YMNLCHYAVHTPIQVKDEDRARFEKKARELGLDKETALVEGEFHHTEDKKGRRVVRRVIQ 253 Query: 309 --DNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG--AVIDGPLPLNGAQKGYK 364 +Y ++++DQ + R+LE L++ G+ +NT+++FTSDNG A +G N K Sbjct: 254 SDPSYAGMIWNLDQNIGRLLEALRECGEEENTVVVFTSDNGGLATSEGSPTCNLPASEGK 313 Query: 365 SQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 Y GGT P+ + + G++ PG+ D ++ DFYPT L+ A + + +DG S++P Sbjct: 314 GWVYEGGTRVPLIVKYPGRVAPGSRCDVPVTTPDFYPTFLELAGVPQKAGIPIDGRSIVP 373 Query: 424 WLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY 483 L P + + W Y H+ ++ P + Sbjct: 374 LLSGNPM--PERPIFW--HYPHYGNQGGTP----------------------------AS 401 Query: 484 TVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQ 527 +V DY + E+ + LY L D + +NL P+ ++ Sbjct: 402 SVVMGDYKYIEFFEDGRGELYDLKADFSETNNLCEKMPETAARLR 446 >UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7UX95_RHOBA Length = 538 Score = 138 bits (348), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 140/548 (25%), Positives = 217/548 (39%), Gaps = 141/548 (25%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 +T +PNI+++ DDLGYG+L G + + TP L Sbjct: 69 ATVSRPNIVLIVADDLGYGEL----GCYGQTKIR----------------------TPRL 102 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD----------------A 156 L EG++ TN Y + V PSR +MTG+ P V +N D Sbjct: 103 DQLAAEGIKLTNFYSGNAVCAPSRCCLMTGKHPGHAHVRNNGDPKIDPAVREALKLEFPG 162 Query: 157 QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 Q +P+ E + E ++ GY T A GKW L +F T Sbjct: 163 QYPLPVDEVTIAEYLKSVGYRTGAFGKWGLG------------------HFGTTG----D 200 Query: 217 PQNRGFDYFMGFHAAGTAYYNSPS-LFKNRERVPAKG---------YISDQLTDEAIGVV 266 P +GFD F GF+ A+ + P+ L++NR + G Y DQ +EA + Sbjct: 201 PNEQGFDLFYGFNCQRHAHNHYPNFLWRNRVKEVQPGNDRTLHGETYSQDQFVNEACEFI 260 Query: 267 DRAKTLD--QPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTAD--------------N 310 ++ D QPF YL + PHL P++ ++ + AD Sbjct: 261 RQSVAEDKTQPFFAYLPFAVPHLSIQ--VPEEEVDAYDGVIEEADYEHHGYLKHPRPRAG 318 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGA----------Q 360 Y A V +D+GV ++++ + G +NT+I+FTSDNG D L G+ Sbjct: 319 YAAMVTRMDEGVGQVVDLVDSLGLGENTLIMFTSDNGPTYD---RLGGSDSDYFNSASGM 375 Query: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISA-MDFYPTALDAADISIPKDLKLDGV 419 KG K Q GG PM G + G I A DF PT DAA + + DG+ Sbjct: 376 KGLKGQLDEGGIRVPMIARQTGVVPAGRTSDWIGAWWDFLPTITDAAGVEVDAS-TTDGI 434 Query: 420 SLLPWLQ-DKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDL 478 S LP L D + H+ L W ++P + + Sbjct: 435 SFLPLLHGDDAAQQSHEFLYW----------------------------EFPGYSGQQAI 466 Query: 479 SQFSYTVRNNDYSLVYT---VENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVR-EF 533 ++ D S E LY L+ DL + ++++A++P V+ +++ + + + Sbjct: 467 RMGNWKAIRKDLSKRLKKGQTEPPAFALYDLSKDLAESNDVSASHPDVMAKIEAIAKQQH 526 Query: 534 IDSSQPPL 541 + S Q PL Sbjct: 527 VPSEQFPL 534 >UniRef50_C5VKQ0 N-acetylgalactosamine-6-sulfatase n=3 Tax=Prevotella RepID=C5VKQ0_9BACT Length = 520 Score = 137 bits (346), Expect = 7e-31, Method: Compositional matrix adjust. Identities = 143/528 (27%), Positives = 217/528 (41%), Gaps = 132/528 (25%) Query: 57 KPNIIVLTMDDLGY--GQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 +PNII+ +DD+G+ LPF D T NR+ TP + Sbjct: 43 QPNIILFMVDDMGWQDTSLPFA----DSITANNRKY-----------------DTPNMER 81 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT--------DAQDGIPLTE-- 164 L EG+ FT+ Y A +S PSR ++MTG AR V + T +DG+ L + Sbjct: 82 LASEGMMFTDAY-ATPISSPSRCSLMTGMNMARHRVTNWTLHRDKMTDGKRDGVTLPDWN 140 Query: 165 -----------------TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 +F+ +L +N GY+T GK H I P + Sbjct: 141 YNGIAQSGNVAHTTKAISFV-QLLKNVGYHTIHCGKAHWGAID---TPGE---------- 186 Query: 208 TTFSAEEWQPQNRGFDY-FMGFHAAGTAYY--------------NSPSLFKNRERVPAKG 252 P + GFD G A G A Y SP ER G Sbjct: 187 --------NPCHFGFDVNITGTAAGGLATYLSERNYGFAKDGKPTSPFAIPGLERYWGTG 238 Query: 253 -YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNY 311 + ++ LT EAI +++AK DQPF LY+++ A H+P D + S+ Y Sbjct: 239 IFATEALTQEAIASLEKAKKYDQPFYLYMSHYAVHVPIDRDMRFYPTYRARGLSEKEAAY 298 Query: 312 YASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI------DGPL-PLNGAQKGYK 364 + + +D+ + +++ + K G TII+F SDNG + DG L N K K Sbjct: 299 ASLIAGMDKSLGDLMDWVAKAGLKRETIIIFMSDNGGLASSSYWRDGELYTQNAPLKSGK 358 Query: 365 SQTYPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADIS---IPKDLKLDGVS 420 Y GG P + W ++P I D YPT L A I +P+ K+DG Sbjct: 359 GSLYEGGIRVPFIVKWNNIVKPNTRSHAPIIIEDLYPTLLSMAGIKNYHVPQ--KIDGQD 416 Query: 421 LLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLS- 479 + P L+ K+QG+ + L W +YP+ + E L Sbjct: 417 ITPILRGKQQGDKKRQLIW----------------------------NYPNIWDGEGLGI 448 Query: 480 QFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEM 526 + +R + L+Y+ Q LY L+ DL +K+NLA+++PQ+V+ + Sbjct: 449 SLNCAIREGQWKLIYSYLTGQKELYDLSSDLSEKNNLASSHPQLVERL 496 >UniRef50_A6DS95 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DS95_9BACT Length = 491 Score = 137 bits (346), Expect = 8e-31, Method: Compositional matrix adjust. Identities = 132/505 (26%), Positives = 209/505 (41%), Gaps = 80/505 (15%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNII + DD GYG + + TP + L Sbjct: 28 PNIIFILTDDQGYGDMAVHGHPY--------------------------LETPNMDRLHS 61 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYY 177 E VRF YV+ S P+RAA+MTG R GV ++ + + ++ + GY Sbjct: 62 ESVRFDRFYVSPSCS-PTRAALMTGMHEFRNGVTHTVQPREKLYKGALTIADILKEGGYK 120 Query: 178 TAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYN 237 T VGKWHL + + + PQ RGFD++ +A G + Sbjct: 121 TGFVGKWHLG-----------------------NDKGYAPQYRGFDWYAK-NAKGPHNHF 156 Query: 238 SPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQY 297 + +N +R KG+ D DEA+ + A +QPF LYL +PH P AP+ Sbjct: 157 DVEMIRNGKRFQTKGFREDAFFDEAMTFMKEAG--EQPFFLYLCTYSPHTPLG--APEDL 212 Query: 298 QKQFNTGSQTADN--YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP 355 K++ ++ Y A + ++D + R+ + LKK YD+TI++F +DNG + G Sbjct: 213 LKKYKAKGLNDNHAAYLAMIENIDDNLGRLDQFLKKENLYDDTILIFMNDNGVTV-GLDV 271 Query: 356 LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLK 415 N +G K + GGT W K QP + L + +D PT + A + +P+ ++ Sbjct: 272 YNADMRGPKCTIWEGGTRAFSLWRWPKKWQPKTVENLTAHLDVLPTLCELAGVDVPEKVQ 331 Query: 416 --LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNP 473 L+G SL P L K H N + W N +R + H+ Sbjct: 332 GELEGYSLSPLLNGKDW--EHNNRLLFHNVGRW-PSGTAAAHKNAMCGIRKGNFLLVHSQ 388 Query: 474 NTEDLSQFSY-----TVRNNDYSL---VYTVENNQLGL-----YKLTDLQQK----DNLA 516 ED Y T+RN YT N Q ++L D+++ ++LA Sbjct: 389 GCEDPICEKYPSQCTTLRNVAKGFKHATYTKTNAQFHWGVSEGWQLYDVKKDPSNLNDLA 448 Query: 517 AANPQVVKEMQGVVREFIDSSQPPL 541 A+P++V E++ ++ D P + Sbjct: 449 NAHPELVDELKQAYSKWWDKQFPVM 473 >UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DTN4_9BACT Length = 482 Score = 137 bits (346), Expect = 9e-31, Method: Compositional matrix adjust. Identities = 122/427 (28%), Positives = 176/427 (41%), Gaps = 97/427 (22%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 S KPNII + DDLGYG L G + K ++ TP L Sbjct: 15 SAADKPNIIYILADDLGYGDL----GCYGQKVIQ----------------------TPHL 48 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQ 172 + G++FT Y V GPSR+ ++ G+ V N Q + P+ Q Sbjct: 49 DKMAANGMKFTQHYSGSTVCGPSRSCLLEGKHSGNTYVRGNGMLQMRQDPHDLIFPKALQ 108 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 GY+TA +GK S + D P +GFDYF GF + Sbjct: 109 KAGYHTAMIGK------SGMGCNTDDAAL---------------PYQKGFDYFFGFTSHT 147 Query: 233 TAYYNSPS-LFKNRERV-----------PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 A++ P+ L+KN +V Y S+ + +EA+ V+R K D PF L+L Sbjct: 148 QAHWFFPTHLWKNDGKVTKVEYPNNTLHEGDNYSSEVVMNEALDYVERQK--DGPFFLHL 205 Query: 281 AYNAPH---------------------LPNDNPAPD-QYQKQFNTGSQTADNYYASVYSV 318 A+ PH LP + P Y+++ T + A V + Sbjct: 206 AFQIPHASLRAKEEWKAKYRPILKEKLLPKKDKHPHYSYEREPKT------TFAAMVSYM 259 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG-----PLPLNGAQKGYKSQTYPGGTH 373 D V + ++L+ G +NT+I+F SDNGA+ +G NG +G K Y GG Sbjct: 260 DHNVGLLNKKLEDLGLAENTLIMFASDNGAMQEGGHKRDSFDSNGVLRGGKRDMYEGGVR 319 Query: 374 TPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 TPM +W GK++ G ISA D PT + A + +D DG+S +P L K Sbjct: 320 TPMIAYWPGKIKAGQTSDHISAFWDISPTVRELAGAKVQED--TDGISFVPTLLGKGSQT 377 Query: 433 PHKNLTW 439 H L W Sbjct: 378 KHDYLYW 384 >UniRef50_A6LED2 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LED2_PARD8 Length = 468 Score = 136 bits (343), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 131/496 (26%), Positives = 204/496 (41%), Gaps = 114/496 (22%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN+I++ +DD GYG L G + + + TP + + Sbjct: 25 RPNVIIVFIDDFGYGDL----GCYG----------------------STKHRTPHIDQMA 58 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD---------------AQDGIP 161 EG+R T+ YV VS PSR+A++TG P R ++ N D + G+ Sbjct: 59 KEGIRLTDFYVGSSVSTPSRSALLTGCYPRRVSMHVNADPTPLMSKGRQVLFPASHKGLN 118 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 E + EL + GY TA +GKWHL +P + P +G Sbjct: 119 PGEITIAELMKEQGYATACIGKWHLG--DQLP---------------------FLPTRQG 155 Query: 222 FDYFMGFHAAG---TAYYNSPSLFKNRERVPAKGY--ISDQLTDEAIGVVDRAKTLDQPF 276 FDY+ G + Y P + + V G+ ++ + T++ + + K + PF Sbjct: 156 FDYYYGIPYSNDMDRPYCPLPLMEQEEVIVAPVGHDSLTIRYTNKTVEFIKSHK--ESPF 213 Query: 277 MLYLAYNAPHLP-NDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQY 335 +YL +N H P +PA F SQ Y + +D + +LE LK+ G Sbjct: 214 FIYLCHNMTHNPLAASPA-------FKGKSQNG-LYGDATEELDWSMGVLLETLKEEGLD 265 Query: 336 DNTIILFTSDNGA--VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKL 392 NT+I+FTSDNGA G N +G K TY GG P M W K+ G D L Sbjct: 266 QNTLIIFTSDNGADEHFGG---TNRPLRGQKGTTYEGGFRVPCIMRWPAKIPAGQETDNL 322 Query: 393 ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENI 452 +++MDF PT ++P D +DG ++ L+ + P + T Y + + Sbjct: 323 VTSMDFLPTLAHYCSYAVPSDRVIDGHNVSGILEGESMASPTE-----TFYYYQKQQLQA 377 Query: 453 PFWDN--YHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLTDLQ 510 W N YH ++ + PH P+TE Y + N DL Sbjct: 378 VRWGNWKYHLPLKERIKG-PHFPDTEVGEARLYNLAN--------------------DLS 416 Query: 511 QKDNLAAANPQVVKEM 526 + N+ +P+VV +M Sbjct: 417 ETTNVIDKHPEVVTKM 432 >UniRef50_A6DMW5 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMW5_9BACT Length = 525 Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 127/469 (27%), Positives = 193/469 (41%), Gaps = 94/469 (20%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 P + PNII + DD G+ L + + R+ VD Sbjct: 24 PKDAGASRSPNIIFIITDDHGFNDL---------EATDLRDEVDM--------------- 59 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLP 168 P + L G T Y PSRA I+TGR +FG+ N + +P + + Sbjct: 60 -PNIHKLTSNGALMTQAYCTAPQCVPSRAGIVTGRYQQKFGLERNGEGP--LPKNQQSIA 116 Query: 169 ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE-------WQPQNRG 221 + GY T VGKWHL P K T+ + + T A+ +PQ G Sbjct: 117 MRLKKRGYKTGHVGKWHLE-------PNRKTTKWFEETGCTSMADAPKEAVNAHRPQGFG 169 Query: 222 FDYFMGFHAAGTAYY-----NSPSL------FKNRERVPAKGYISDQL-TDEAIGVVDRA 269 +D + +G Y+ N SL +K E+ Q+ T+ A+ ++R Sbjct: 170 YDEYA--QGSGNTYWSNFDVNGKSLKPQQINYKIYEKRAHTNKFRLQIQTELALNFIERH 227 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAP----------------DQYQKQFNTGSQTADNY-- 311 PF LYLAY PH P + P D +K + S A Y Sbjct: 228 APKPDPFFLYLAYYGPHSPLNAPKSLTDQVLSAEALATKGYDNTKKPYTNRSNFARPYTE 287 Query: 312 -------YASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI-----DGPLPLNGA 359 A + +D GV +I++QL+K+G+ DNTI+ F DNGA + DG + N Sbjct: 288 AEVRQQGLALLKGIDNGVGKIIQQLEKSGELDNTILFFMGDNGAPLSTKSWDGSV--NDP 345 Query: 360 QKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGV 419 G K GG P F++WKG ++ + K ++ +D TA+ A KD LDGV Sbjct: 346 WIGSKGIILEGGARIPYFVYWKGVIEAQVFKKAVNTLDAGATAVALAGGDPSKDSMLDGV 405 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDD 468 L+P+L K Q E ++ + Y + + I D +KF+ H++ + Sbjct: 406 DLMPFLTGKNQRELNRYI-----YQRYMNTAAI--IDGKYKFMTHENGE 447 >UniRef50_C5C586 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C586_BEUC1 Length = 478 Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 125/458 (27%), Positives = 190/458 (41%), Gaps = 106/458 (23%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI+++ +DDLG+ D G F E TP + +L Sbjct: 15 RPNIVLVVVDDLGW----RDLGCFGSTFYE----------------------TPHIDALA 48 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ-----------DGIPLTET 165 G RFT+ Y A V P+RA+++TG+ PAR GV + G+P E Sbjct: 49 ASGTRFTHSYAAAPVCSPTRASLLTGKYPARVGVTNWIGGHAIGALRDVPYFHGLPQDEY 108 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 L + GY T VGKWHL ++ P++ GFD Sbjct: 109 ALARALRAGGYRTWHVGKWHLGGGRHL------------------------PEHHGFDLN 144 Query: 226 MGFHAAGT-AYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNA 284 +G A+G+ Y +P E P +++D+LTD A+ +V + D PF+L L + A Sbjct: 145 VGGSASGSPVSYYAPYGIGALEDAPDGEFLTDRLTDVAVDLVRSSD--DAPFLLNLWHYA 202 Query: 285 PHLPNDNPA--PDQYQKQFNT------------------------------GSQTADNYY 312 H P + PA ++Y+ + T Q+ Y Sbjct: 203 VHTPIEAPAHLVEKYRHKAETLGLPTHGPDAVEAGEHMPARHLRSERVRRRRIQSDPTYA 262 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV--IDGPLPLNGAQKGYKSQTYPG 370 A + ++D V R++ L+ G+ D+T+I+FTSDNG + +G N K G Sbjct: 263 AMLETLDGAVGRLVTALRDVGKLDDTLIVFTSDNGGLSTAEGSPTCNAPLSEGKGWMADG 322 Query: 371 GTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 GT P + W G++ G L ++ DFYPT L AA ++ + +DGV+L P Sbjct: 323 GTRVPTIVSWPGRVPAGARSDLPFTSPDFYPTLLAAAGLTQLPEQHVDGVNLWP----AW 378 Query: 430 QGEPHKNLTWITSYSHWFDEENIP---FWDNYHKFVRH 464 QG P Y H+ ++ P D K VRH Sbjct: 379 QGAPLDRGPIFWHYPHYSNQGGAPSAAVRDGRWKLVRH 416 >UniRef50_A6DHI2 Aryl-sulphate sulphohydrolase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI2_9BACT Length = 493 Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 118/495 (23%), Positives = 212/495 (42%), Gaps = 96/495 (19%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KP+II++ +DDLG+ L + + +P + +L Sbjct: 22 KPHIILINIDDLGWTDLSYQGSKY--------------------------YESPNIDALA 55 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLP-------- 168 G+ F GY A PSRA++++G+ R VY+ + G +P Sbjct: 56 KSGMIFDQGYAAAANCAPSRASLISGQQSPRTEVYTVGNPARGASNKRKLIPSPNIDFVD 115 Query: 169 -------ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 + + GY TA +GK+H++K P G Sbjct: 116 ADNFTIADAMNSAGYLTATLGKYHVAK---------------------------DPLTHG 148 Query: 222 FDYFMGFHAAGTAY---YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFML 278 + +G G Y Y+SP + N + Y+ D LTDEAIG+ + QP + Sbjct: 149 WKINVGGREFGGPYNGGYHSPYEYPNLKETEKGRYLCDHLTDEAIGIF-KEHGAQQPIFM 207 Query: 279 YLAYNAPHLP-NDNPAPD-QYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYD 336 Y Y H P +P + +Y+ + T Y A + ++D V R++ L++ G + Sbjct: 208 YFPYYTIHAPIQGHPKFEPKYKAKAKTKGHFNPKYAAMIEALDHNVGRLVAALEEQGLRE 267 Query: 337 NTIILFTSDNGAVIDGPL--PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL-I 393 T+I+FTSDNG + PL + Y Y GG P F W G ++ G+ ++ + Sbjct: 268 KTLIMFTSDNGGHMKFSRQEPLRAGKGSY----YEGGIRVPFFASWPGVIEAGSRSQVPV 323 Query: 394 SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIP 453 + +DFYPT + A + +P D +DG S LP L+ E ++L Y H+ P Sbjct: 324 TGLDFYPTVCELAGVELPDDKVVDGKSFLPLLKS----EVDEDLKNRALYWHF------P 373 Query: 454 FWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQK 512 + + + ++ P + + ++ +R+ + L + E++ + LY + +D +K Sbjct: 374 IYLQAYL----KPNEKPESRDPLFRTRPGSVIRHGKWKLHHYFEDDGVELYDINSDRSEK 429 Query: 513 DNLAAANPQVVKEMQ 527 ++L++ P+VV +++ Sbjct: 430 NDLSSEYPEVVSKLR 444 >UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZGF2_PLALI Length = 490 Score = 136 bits (342), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 137/514 (26%), Positives = 214/514 (41%), Gaps = 121/514 (23%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 ++ PNII++ MDD+G+ + F M N+ V TP + Sbjct: 39 SRRPPNIILILMDDMGWRDVGF---------MGNKFV-----------------ETPHID 72 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD--GIPLTE------- 164 L G+ FT Y + P+RA +M+G+ R G+Y+ D + G P + Sbjct: 73 RLAKTGLVFTQAYASAPNCAPTRACLMSGQYAPRHGIYTVVDPRQPPGSPWHKWQAAESK 132 Query: 165 -------TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 + E ++ GY TA G W+L + PV Q F P Sbjct: 133 SELDTNVVTIAEALRDGGYATAFFGMWNLGRGRTGPVTPGGQ------GFQKVVF----P 182 Query: 218 QNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFM 277 +N GF Y++ K Y++D+LTDE + VD + +QPF Sbjct: 183 ENLGF--------GKDEYFDD-----------GKHYLTDRLTDEVLKFVDEHR--EQPFF 221 Query: 278 LYLAYNAPHLPNDNPAPD---QYQKQFNTGSQTADN--YYASVYSVDQGVKRILEQLKKN 332 +YL +A H P NP P+ +Y+++ + D+ A++ +VD V RI++ LK+ Sbjct: 222 VYLPDHAIHAPF-NPKPELLAKYERKAAASNDRRDDPACAATIEAVDHNVGRIMDHLKRL 280 Query: 333 GQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDK 391 DNT+++FTSDNG PL G K + Y GG P+ + G G+ D Sbjct: 281 KLSDNTVVIFTSDNGGTQQYTPPLRGG----KGELYEGGIRVPLVVAGPGVKSLGSRCDV 336 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 +S++D YPT L+ A I P+ LDGVSL P LQ D E Sbjct: 337 PVSSIDLYPTLLELAGIKPPEGQVLDGVSLAPLLQGDAT----------------LDRER 380 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV-YTVENNQLGLYKL-TDL 509 + FW H P + S +R D+ L+ + E ++ L+ L D Sbjct: 381 L-FW---------------HFPCYVGKATPSSAMREGDFKLIEFFEEGGRVELFNLKNDP 424 Query: 510 QQKDNLAAANPQVVKEMQGVVREF---IDSSQPP 540 ++ NLA+ P + +R + ++S PP Sbjct: 425 NEEKNLASVMPDKAAALAKTLRAWQKKTNASIPP 458 >UniRef50_A6KZI7 Arylsulfatase n=23 Tax=Bacteroidales RepID=A6KZI7_BACV8 Length = 508 Score = 135 bits (341), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 141/537 (26%), Positives = 210/537 (39%), Gaps = 134/537 (24%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNII + DD+GYG L G + + STP + ++ Sbjct: 27 KPNIIYIMCDDMGYGDL----GCYGQPYI----------------------STPNIDNMA 60 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLT------------- 163 EG+RFT Y VS PSRA+ MTG+ V N + P+ Sbjct: 61 KEGMRFTQAYAGSPVSAPSRASFMTGQHSGHCEVRGNKEYWRDAPVVMYGNNKEYAVVGQ 120 Query: 164 ------ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 +PE+ +++GY T GKW +V P+ + +Y+ F A + P Sbjct: 121 HPYDPGHVIIPEIMKDNGYTTGMFGKWAGGYEGSVSTPDKRGIDEYYGYICQFQAHLYYP 180 Query: 218 Q--NRGFDYFMGFHAAGTAYY--------NSPSLFKNRERVPAKGYISDQLTDEAIGVVD 267 NR A TA N P K+ + P Y +D + +EA+ +D Sbjct: 181 NFLNR-----YSKSAGDTAVVRVVMDENINYPMFGKDYFKRPQ--YSADMIHEEAMKWLD 233 Query: 268 RAKTLDQPFMLYLAYNAPHLPNDNPAPD---QYQKQF----NTGSQTADNYYASVYS--- 317 + QPF Y PH P YQK+F G Q Y SV++ Sbjct: 234 KQDG-KQPFFGIFTYTLPHAELAQPEDSILTGYQKKFFEDKTWGGQEGSRYNPSVHTHAQ 292 Query: 318 -------VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKS 365 +D V +L +LK+ G +NTI++FTSDNG +G +G +G K Sbjct: 293 FAGMITRLDYYVGEVLNKLKEKGLDENTIVIFTSDNGPHEEGGADPTFFGRDGKLRGLKR 352 Query: 366 QTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADI--------SIPKDLK- 415 Q Y GG P + W GK+ G D ++ D PT D A + + KD+ Sbjct: 353 QCYEGGIRIPFIVRWPGKVPEGTVNDHQLAFYDLMPTFCDLAGVKNYVKKYTNKKKDVDY 412 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 DG+S P L ++ + H L W FDE + Sbjct: 413 FDGISFAPTLLGQEGQKKHDFLYWE------FDETD------------------------ 442 Query: 476 EDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVR 531 VR D+ +V V+ LY L TD+ + ++AA +P +VK+M+ ++R Sbjct: 443 ------QIGVRMGDWKMV--VKKGTPFLYNLATDIHEDHDIAAGHPDIVKQMKEIIR 491 >UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C1V3_9PLAN Length = 470 Score = 135 bits (341), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 138/526 (26%), Positives = 221/526 (42%), Gaps = 105/526 (19%) Query: 44 FSDFTPTEYSTKGKP-NIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAI 102 S T ++ KP N++ +DDLG+ L G + D Y+ Sbjct: 19 LSSITQPTHAADEKPWNVVFFLVDDLGWTDL----GCYGS---------DFYQ------- 58 Query: 103 EAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG--- 159 +P + L EG++FT Y A P+R A++TG PAR + TD G Sbjct: 59 ------SPNIDQLAAEGMKFTQNYSACNACSPTRGALLTGMYPARTHL---TDWIPGWAK 109 Query: 160 ----IPLTE-----------TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYH 204 PL T LPE + GY T VGKWHL N+P Q + Sbjct: 110 SYTDFPLKPPEWKKHLDQKYTTLPEALRTAGYQTFHVGKWHLGGRGNLP-----QDHGFD 164 Query: 205 DNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIG 264 N + NRG F G A SL + + Y++D++ DEA+ Sbjct: 165 VNISG--------TNRGLPRSYHFPYGGDAMKWDSSLTEAERQ---DRYLTDRMADEAVA 213 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLP-NDNPAPDQYQKQFNTGSQTADNYYAS-VYSVDQGV 322 ++ + + D+PF LY ++ + H P P + K G + + YA+ + SVD+ + Sbjct: 214 LIRQQQ--DKPFFLYCSFYSVHSPIQGRPDLVKKYKGLPAGKRHKNPEYAAMIQSVDEAI 271 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 R+ QLK++G D T+I+FTSDNG V N +G K Q + GGT P + W G Sbjct: 272 GRVRAQLKESGIADRTLIVFTSDNGGV-RRKTSNNDPLRGEKGQHWEGGTRVPAIVLWPG 330 Query: 383 KLQPGN-YDKLISAMDFYPTALDAADIS--IPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 G+ + I MDFYPT L+ ++ + +DG+SL+P L+D + L W Sbjct: 331 VTPAGSVCAEPIITMDFYPTILNITGVAGNTEHNQSVDGLSLVPLLKDPAATLNREALYW 390 Query: 440 ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN 499 Y H+ +P+ +R +Y L++ E+ Sbjct: 391 --HYPHYNVFIGVPY----------------------------SAIRVGEYKLIHYYEDG 420 Query: 500 QLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFID--SSQPPLS 542 LY L DL + +++ P++ ++ +++ + +Q P+S Sbjct: 421 NDELYNLAEDLSETSDVSKTYPELTARLERRLQQHLKQVGAQMPVS 466 >UniRef50_A6DR20 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DR20_9BACT Length = 608 Score = 135 bits (341), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 144/540 (26%), Positives = 226/540 (41%), Gaps = 131/540 (24%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 K N+I++ DDLG V DT +G K + TP L L Sbjct: 18 KANVILILADDLG--------------------VSDT-SLGGSKLYQ-----TPNLERLA 51 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETF---------- 166 GV FTN Y A + P+R++I+TG+ PAR G + + + LT Sbjct: 52 KRGVYFTNAYAASPLCSPTRSSILTGQNPARTGFTAPHGHLENVVLTARAGKAAAPSKRQ 111 Query: 167 ---------------LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFS 211 L ++F+N GY TA GKWHL K Sbjct: 112 VSPVSVNRLSTEYLSLGKVFKNAGYKTAHFGKWHLGK----------------------- 148 Query: 212 AEEWQPQNRGFD----YFMGFHAAGTAYYNSPSLFKN-RERVPAKGYISDQLTDEAIGVV 266 + P GFD ++ G AG+ + +P + N +E P + +I D+L DE + Sbjct: 149 -SPYSPLEHGFDIDIPHWPGPGPAGS--FVAPWRYPNFKENYPGE-HIDDRLGDEIAKYI 204 Query: 267 DRAKTLDQPFMLYLAYNAPHLPND--NPAPDQYQKQFNTGSQTADNYYAS-VYSVDQGVK 323 K DQPF + + H P + D+Y+K + + + YA+ V S+D + Sbjct: 205 SENK--DQPFFINFWQFSVHAPFNAKQELIDKYRKLIDKNNPQHNPVYAAMVESMDDSIG 262 Query: 324 RILEQLKKNGQYDNTIILFTSDNG----AVIDGPLPL-NGAQKGYKSQTYPGGTHTPMFM 378 ++++ L+ N + TII+F SDNG +V+DG N +G K+ Y GGTH P + Sbjct: 263 KVIDALETNKLMEKTIIVFFSDNGGNIHSVVDGTTATSNKPFRGGKASIYEGGTHVPAIV 322 Query: 379 WWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL 437 W + + G D LI + D Y + L+ A + + D +S +P L K QG K Sbjct: 323 VWPNQTKTGVRNDSLIQSEDLYASILEMAALPVDYQQAKDSISFVPVL--KGQGAKRKQ- 379 Query: 438 TWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE 497 + +Y +PH+PN D S +R D+ L+ Sbjct: 380 --VFTY-------------------------FPHSPNVPDCVPPSAALRIGDWKLIKVFH 412 Query: 498 NN-----QLGLYKLTDLQ-QKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 +N + LY L + Q + NLA+ NP+ VK M V+ +++ S + + K+N+ Sbjct: 413 DNPDLTDRFELYNLANDQGEMLNLASQNPEKVKSMNEVIDQYLSKSA-CIKPIKNPKYNS 471 >UniRef50_A6C383 Sulfatase (Fragment) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C383_9PLAN Length = 405 Score = 135 bits (341), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 111/402 (27%), Positives = 171/402 (42%), Gaps = 74/402 (18%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN+I++ DD +GS D ++++ TP + S+ Sbjct: 8 KPNVIIIFTDD---------QGSVDLNCYGAKDLI-----------------TPHMDSIA 41 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA---QDGIPLTETFLPELFQN 173 G+RFT Y + V PSRA ++TGR PAR GV N + + G+P + + E+ Q Sbjct: 42 RRGIRFTQFYASAPVCSPSRAGMLTGRFPARAGVPGNVSSHHGKSGMPTEQITIAEMMQQ 101 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY TA +GKWHL PE P +GF+ G H G Sbjct: 102 AGYQTAHIGKWHLGY-----TPET------------------MPHGQGFETSFG-HMGGC 137 Query: 234 A-------YYNSPS---LFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 Y+N P+ L++N + V G + D + ++ + +A D+PF LY A Sbjct: 138 IDNYSHFFYWNGPNRHDLWENGKEVWRDGAFFPDLMVEQCQDYIRKAG--DKPFFLYWAI 195 Query: 283 NAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILF 342 N PH P ++++K + S D Y A V ++D + +L L + TII+F Sbjct: 196 NVPHYPLQ--GKEKWRKTYAHLSSPRDKYAAFVSTMDDCIGEVLATLDACQLREKTIIIF 253 Query: 343 TSDNGAVID----GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMD 397 SD+G + G G +G K + GG P + W G + G D+L + D Sbjct: 254 QSDHGHSHEERTFGGGGSAGPYRGAKFSLFEGGIRVPAMISWPGTIAEGEVRDQLATGCD 313 Query: 398 FYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 + PT +P LDG +L ++ PH+N W Sbjct: 314 WLPTISALTGAPLPAH-HLDGKNLKAVIESSTAKSPHENFYW 354 >UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UZ43_RHOBA Length = 608 Score = 135 bits (340), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 107/393 (27%), Positives = 169/393 (43%), Gaps = 67/393 (17%) Query: 49 PTEYSTKG--KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQ 106 P +S + +PN++++ DD GYG F +K ++ Sbjct: 21 PLAHSVRAADRPNVVMVITDDQGYGDCGFTG---------------------NKVVQ--- 56 Query: 107 KSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETF 166 TP + +L E T+ +VA S P+R+A+MTG R GV+ + + E Sbjct: 57 --TPNIDALAAESSVLTDYHVAPTCS-PTRSALMTGHWTNRTGVWHTISGRSMLRDNEVT 113 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD--Y 224 E+F + GY T GKWHL DN+ ++ ++ GF Y Sbjct: 114 FGEIFSDAGYQTGMFGKWHLG-----------------DNY------PYRAEDNGFTEVY 150 Query: 225 FMGFHAAG-------TAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFM 277 G G AY++ S F N + V A+G+ +D E + D+PF Sbjct: 151 RHGGGGVGQTPDFWDNAYFDG-SYFHNGKAVKAEGFCTDVFFKEGNRFIRECVEADEPFF 209 Query: 278 LYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDN 337 Y+A NAPH P AP +Y + + ++ + +VD V + + L++ G +DN Sbjct: 210 AYIATNAPHGPLH--APQKYIDMYPEMNDNVATFFGMITNVDDNVGQTRKLLRELGVHDN 267 Query: 338 TIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWW--KGKLQPGNYDKLISA 395 TI +FT+DNG G N +G K Y GG P M + G + + L A Sbjct: 268 TIFIFTTDNGTA-GGASVYNAGMRGKKGSPYEGGHRVPFVMHYPEGGFAKSRTNNTLCHA 326 Query: 396 MDFYPTALDAADISIPKDLKLDGVSLLPWLQDK 428 +D PT LD + P+ +K DG S++ L+D+ Sbjct: 327 VDVVPTLLDMCGVEAPESVKFDGTSIVSLLKDE 359 >UniRef50_B8KKX3 Arylsulfatase B n=1 Tax=gamma proteobacterium NOR5-3 RepID=B8KKX3_9GAMM Length = 507 Score = 135 bits (340), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 103/395 (26%), Positives = 163/395 (41%), Gaps = 96/395 (24%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN++++ DD+G+ D + TP + L Sbjct: 40 RPNVVIILADDMGWN---------------------------DVGYHGSDIHTPHIDQLA 72 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD--GIPLTETFLPELFQNH 174 EG+ Y A P+RAA+++G++ G+YS + G+ L + +P F++ Sbjct: 73 AEGLELDRFY-AQTACSPTRAALLSGQSSQSLGIYSPLSKLNPTGLALDQKIMPAYFRDA 131 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA 234 GY T VGKWHL F E++P RGFD+F G G Sbjct: 132 GYQTFMVGKWHLG----------------------FYEPEYRPLARGFDHFYGNLTGGVG 169 Query: 235 YYN-----SPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPN 289 Y+N +N + + +GY S L I + + + ++P LY A+NAPHLPN Sbjct: 170 YWNHVHGGGLDWQRNGKTLRQEGY-STHLQSAEITRLIQQRDPEKPLFLYAAFNAPHLPN 228 Query: 290 DNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV 349 + PA D + + + + A V +D + +++E L G +NT+I F SDNG + Sbjct: 229 EAPA-DTLARYAHIENPNRRIHAAMVTELDSAIGQLMETLSTEGMLENTLIWFMSDNGGL 287 Query: 350 IDGPLP------------------------------LNGA------QKGYKSQTYPGGTH 373 +P L+G +KG K Y GG Sbjct: 288 NRTAMPSGLVSMSQRLEDWFGKPLFPKTLEFIRTNALDGGSDNSPHRKG-KQSIYEGGAR 346 Query: 374 TPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADI 408 P F++WKG+L P ++++ D PT L A DI Sbjct: 347 VPSFVYWKGRLSPERITQMVTVKDVLPTLLSATDI 381 >UniRef50_C7PLP2 Sulfatase n=9 Tax=Bacteroidetes RepID=C7PLP2_CHIPD Length = 563 Score = 135 bits (340), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 153/595 (25%), Positives = 229/595 (38%), Gaps = 175/595 (29%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 T S +PNI+++ DDLGY L G + + T Sbjct: 26 TTVSKDERPNIVLILADDLGYSDL----GCY-----------------------GGEIQT 58 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTG--RAPARFGVYSNTDAQDG----IPLT 163 P L L G+RF + Y PSRA+++TG A G + A+ G I Sbjct: 59 PNLDYLAANGLRFRHFYNTSRCC-PSRASLLTGLYNQQAGIGEMTTARAEAGYRGYITEN 117 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDK-QTRDYHDNFTTFSAEEWQPQNRGF 222 L E+ ++ GY+TA GKWH+S P + + ++ + FS E P NRGF Sbjct: 118 TVTLAEVLKDAGYHTAMSGKWHVSNTVEQSTPAAQLKWLNHQASHPYFSPVEQYPVNRGF 177 Query: 223 DYFMGFHAAGTAYYNSPSLFKNR---ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLY 279 + + G Y++ SL E VP Y +D + D A+ V D+PF LY Sbjct: 178 EKYYGNIFGVVDYFDPFSLVNGTTPVESVPKDYYHTDAINDTAVSYVRALSKEDKPFFLY 237 Query: 280 LAYNAPHLPNDNPAPD--QYQKQFNTG--------------------------------- 304 +A+ APH P D +Y++ + G Sbjct: 238 VAHTAPHWPLQALPEDIKKYEQTYKGGWDVIREARYKRMVAQGLIDPKTTPLSPRINNQL 297 Query: 305 -----------SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA----- 348 ++ + A V +DQG+ R+++ L++ G+ DNTII+F SDNGA Sbjct: 298 SWDKNPDKDWDARAMAVHAAMVDRMDQGIGRLIQTLRETGKLDNTIIIFLSDNGASPENC 357 Query: 349 --------------------------VIDGPLP------------LNGAQKGYKSQTYPG 370 V+ GP +N + K+Q+Y G Sbjct: 358 MRYGPGFDRPGQTRDGKEISYPVKKDVLPGPQTTFASIGERWANVVNTPYQYAKAQSYEG 417 Query: 371 GTHTPMFMWW-KGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLK------LDGVSLL 422 G TPM +W KG G Y D+L MDF PT L+ A S P+ K GVSLL Sbjct: 418 GVRTPMIAYWPKGIKAKGAYADQLAHVMDFMPTFLNVAKASYPQTYKGHSITPSTGVSLL 477 Query: 423 PWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFS 482 P + K+ EP ++ + N H R+ Sbjct: 478 PAFEGKQ--EPGHDVLY-----------------NEHFNARY------------------ 500 Query: 483 YTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDS 536 VR D+ LV ++ LYK+ D + ++LAA +P VV M R++ ++ Sbjct: 501 --VRAGDWKLVSLSGDSTWHLYKINQDETELNDLAAQHPDVVARMTAQWRQWANT 553 >UniRef50_Q7UH85 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UH85_RHOBA Length = 470 Score = 135 bits (340), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 123/419 (29%), Positives = 174/419 (41%), Gaps = 78/419 (18%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNII+ DD G+G ++ TP L + Sbjct: 26 PNIILCMADDQGWGDTGYNG--------------------------HPHLRTPHLDQMAA 59 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-IPLTETFLPELFQNHGY 176 EGV FT Y A + P+RA+ TGR P RFGV T A G + TE + + + HGY Sbjct: 60 EGVTFTRFYAAAAMCSPTRASCYTGRNPYRFGV---TFAMKGMLEPTEIPITTVLKQHGY 116 Query: 177 YTAAVGKWHLSKISNVPVPEDKQ---------------TRDYHDNFTTFS-AEEWQP--- 217 T GKWHL +S +++ RD F T S W P Sbjct: 117 TTGHFGKWHLGTLSKTVGDQNRWGTFAKQPERYYCPPWERDVDVCFVTESKVPTWNPLVH 176 Query: 218 -------QNR-----GFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGV 265 QN+ G Y G Y+ P +E G S + D AI Sbjct: 177 PGPISKKQNQQAPKQGQPY-------GNEYFTGPG---QKETDNMDGDDSRVIMDRAIPF 226 Query: 266 VDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRI 325 + A D+PF + ++ PH P P +Y++ + + A +YYA + ++D+ V R+ Sbjct: 227 IRDAVQNDRPFFAAVWFHTPHSPVIG-GP-KYREMYREQPEPAQHYYACLTAMDEQVGRL 284 Query: 326 LEQLKKNGQYDNTIILFTSDNGAVIDG-PLPLNGAQ--KGYKSQTYPGGTHTPMFMWWKG 382 +LK G DNT++ F SDNG G P + A+ KGYK GG P M W Sbjct: 285 RAELKSLGVADNTMLCFCSDNGPARQGSPRHVGSAKNLKGYKLSIDEGGIRVPGLMVWPN 344 Query: 383 KL-QPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 K+ P D D++PT LDA + +P D DG SL+P L ++ E K L ++ Sbjct: 345 KVDSPRTLDAPCFTTDYFPTILDAIGVDLPTDRTYDGTSLIP-LVTRQTNERQKPLGFL 402 >UniRef50_C3ZQB5 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZQB5_BRAFL Length = 560 Score = 135 bits (340), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 114/407 (28%), Positives = 178/407 (43%), Gaps = 77/407 (18%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 + A KT + + + + + KPNII++ DD+G+G D S+ T E E Sbjct: 80 IGALKTKTSRDERSKLKTTVPVKPNIILMLADDMGWG----DLCSYGHPTQECGE----- 130 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 IDK + EG+RFT Y A + PSRAAI+TGR P R GV+ + Sbjct: 131 ---IDK--------------MAAEGMRFTQWYSADSLCSPSRAAILTGRLPVRVGVWGGS 173 Query: 155 D-----AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 + G+P ET + EL + GY T VGKWHL + V + + D Sbjct: 174 RVFLPASTGGLPRDETTIAELLKEAGYATGMVGKWHLGAVHFVHM-------ESPDPMLC 226 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRA 269 F Y+N+ +L + R ++ +++ + Sbjct: 227 FK-----------------------YWNA-TLVQQPFR---HDNLTTSFLQDSVAFMHNN 259 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQL 329 K D PF LYL++ H+ D + ++++ G Y + +D V +L+ L Sbjct: 260 K--DTPFFLYLSFA--HMHTDMFSAPRFRETSRRG-----RYGDGLRELDWAVGEVLKTL 310 Query: 330 KKNGQYDNTIILFTSDNGAVIDGPLP--LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 T+++F SD+G ++ NG KG K+ T+ GG P WW G + PG Sbjct: 311 VSLQIQHRTLVIFLSDHGGHLEICTEGGSNGILKGGKASTWDGGLRVPGIAWWPGVVAPG 370 Query: 388 NYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 + L+S+MD + TA + A ++ PKD DG SL+P L +K P Sbjct: 371 QVSQHLVSSMDVFQTAAELAGVTPPKDRIYDGKSLVPILLEKTAAVP 417 >UniRef50_A6CB33 Arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CB33_9PLAN Length = 590 Score = 135 bits (339), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 102/353 (28%), Positives = 162/353 (45%), Gaps = 44/353 (12%) Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFL 167 +TP L ++ G R + YV+ V P+RA +MTGR R + + TE + Sbjct: 56 NTPHLDAMAANGARLSRFYVS-PVCTPTRANLMTGRYNYRTRAIDTYIGRAMLEPTEVTI 114 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 E GY T GKWHL P Q + + + QP + Sbjct: 115 AEALAPAGYRTGIFGKWHLGD----SYPLRPQDQGFQEVLVHRGGGIGQPSDP------- 163 Query: 228 FHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHL 287 G Y P LF N E+ +GY +D D A+ +++ ++ D+P +Y+A NAPH Sbjct: 164 --PEGAGKYTDPVLFHNGEKKQMQGYCTDIYFDHALKFLEQNESQDKPTFMYIATNAPHG 221 Query: 288 P-NDNPA-----------PDQY----------QKQFNTGSQTADNYYASVYSVDQGVKRI 325 P +D P D Y +KQF+ S+ ++ + ++DQ + ++ Sbjct: 222 PFHDVPEDLRKKYQAMDLTDAYGFDMNPKRKNEKQFDKTSRV----FSMIENIDQNIGKL 277 Query: 326 LEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQ 385 + LKK DNT++LF +DNG +GP + G +G K GG + + W +L+ Sbjct: 278 FQHLKKIDALDNTLVLFLNDNGP--NGPRYV-GEHRGAKGSVNEGGIRSVLIAHWPAQLK 334 Query: 386 PGNYDKLISA-MDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL 437 G + I+A D +PT L A + P LKLDG+++LP L++K P ++L Sbjct: 335 AGTVNPTIAAHYDLFPTILAATGVEKPAGLKLDGINVLPLLKNKADQWPERSL 387 >UniRef50_A6DG54 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG54_9BACT Length = 469 Score = 135 bits (339), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 128/480 (26%), Positives = 199/480 (41%), Gaps = 78/480 (16%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PN++V+ DD G+ G+ D ++N L Sbjct: 28 PNVVVIYFDDTGWKDFGCFGGAVDTTHIDN---------------------------LAK 60 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQDGIPLTETFLPELFQNHG 175 G+RFT Y PSRA ++TGR P R G+YS + + +P +E + E + G Sbjct: 61 NGMRFTEYYAPAPNCSPSRAGLLTGRFPFRLGMYSYRSKNTPMHLPDSEITIAEALKTKG 120 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY 235 Y T GKWHL + P P +GFDY++ Sbjct: 121 YATGMFGKWHLGNLDGKSHP--------------------TPSEQGFDYWLACD-NNLIK 159 Query: 236 YNSPSLFKNRERV-PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP 294 +N SL +N + V G+ + + DEA + K PF Y+A++ H P D P Sbjct: 160 HNPKSLIRNGKPVGKIAGWAAQVVADEA---NEWMKKQTSPFFAYIAFSETHSPLDAPE- 215 Query: 295 DQYQKQFNTGSQTADNYYASV--YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG 352 + K G Y + YS D V IL+ L G DNT++ SDNG + Sbjct: 216 ELITKYIERGENKKRATYRGMTEYS-DAAVGSILKTLDDMGVSDNTLVFLASDNGPTSED 274 Query: 353 PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIP 411 +G KS T+ GG P + W GK++PG+ Y+ + +D PT D +P Sbjct: 275 SCE---GLRGKKSYTWEGGIRVPAIIRWPGKVKPGSEYNDPVGGIDLLPTLCDIVGAELP 331 Query: 412 KDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPH 471 K +DGVS+ L +G+P K T I S+ + +Y + H D Sbjct: 332 KR-HIDGVSIRSVL----EGKPFKRNTPILSFFYRTSPAASMRMGDY-VLIGHSDD---- 381 Query: 472 NPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVV 530 ED + S+++ D +V + + LY + DL Q+ N+AA P+ + E++ ++ Sbjct: 382 ----EDRKK-SHSMSAEDMPIVKSSKLVSFELYNIKNDLGQEKNIAATYPEKLAELRKIM 436 >UniRef50_A6DG79 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG79_9BACT Length = 486 Score = 134 bits (338), Expect = 6e-30, Method: Compositional matrix adjust. Identities = 113/398 (28%), Positives = 165/398 (41%), Gaps = 69/398 (17%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN I L DDLGYG F+ +K I+ TP L ++ Sbjct: 31 RPNFIFLMADDLGYGDTGFNG---------------------NKIIK-----TPHLDNMA 64 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-IPLTETFLPELFQNHG 175 EG RFT+ Y V P+R + +TGR R+G+ D G +P E + L + G Sbjct: 65 KEGARFTHFYSIGPVCAPTRGSALTGRHYMRYGM---MDVNVGKLPHQEITIARLCKQQG 121 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQ---------TRDYHDNFTT-FSAEEWQPQNRGFDYF 225 Y T GKWHL +S + P K RDY D F T S W P Sbjct: 122 YTTGHFGKWHLGTLSKIESPRHKNPAKDFAPPWERDYDDAFATEISVPTWDP-------- 173 Query: 226 MGFHAAGTAYYNSPSLFKNRERVPAK--GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYN 283 AAG + + N ++V G S + D AI + +A T + FM + ++ Sbjct: 174 ----AAGRYPKHDSPYWHNGQKVTDNLLGDDSRVIMDRAIPFIRKAVTDKKSFMTTIWFH 229 Query: 284 APHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFT 343 PH P A +Y K + + +YY + ++D+ + R+ E+L+K NTII F Sbjct: 230 TPHSP--VVAGPEYLKMYEGYKEGEQHYYGCITAMDEQIGRLREELRKLNVDQNTIIWFC 287 Query: 344 SDNGAVIDGP------------LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YD 390 SDNG G G +G K Y GG P + W GK+ G + Sbjct: 288 SDNGPEGRGNPKKKYDAYHGAFYGTAGKLRGRKRSLYNGGVCVPALVSWPGKIDAGKVIN 347 Query: 391 KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK 428 S +D+ + L ++ P LDG +++P L K Sbjct: 348 TPCSTLDYLESTLAQMNVKYPDSRPLDGENIIPILLGK 385 >UniRef50_Q7UH46 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UH46_RHOBA Length = 490 Score = 134 bits (338), Expect = 7e-30, Method: Compositional matrix adjust. Identities = 134/510 (26%), Positives = 213/510 (41%), Gaps = 115/510 (22%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNI+++ DDLG+G F+ + TP L +L + Sbjct: 32 PNIVLMMCDDLGWGDTGFNGNTI--------------------------IQTPELDALAN 65 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYY 177 EG + Y V P+RA+ +TGR R G+++ +P E L + + GY Sbjct: 66 EGTVLDHFYSVGPVCSPTRASFLTGRHYFRMGIWTANKGH--LPSQEFTLARMLKTRGYA 123 Query: 178 TAAVGKWHLSKISNVPVPEDKQT-----------RDYHDNFTTFSA-EEWQPQNRGFDYF 225 T GKWHL +S + K RDY +F T SA W P Sbjct: 124 TGHFGKWHLGTLSRTVSAKGKGRRPDLHYAPPWERDYDASFVTESAVCTWDPG------- 176 Query: 226 MGFHAAGTAYY-NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNA 284 +G A YY N + +N G S L D A+ ++ A DQPF+ + ++A Sbjct: 177 IGKRARNNPYYENGVATDEN-----VLGCDSRVLMDRALPFIEAAAERDQPFLSVIWFHA 231 Query: 285 PHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 PH D A +Y ++ G A +YY + +VD V R+ ++L G DNT++ F S Sbjct: 232 PH--EDIQAGPEYLAKYE-GHGEAAHYYGCITAVDDQVGRLRKKLASLGVADNTLLFFCS 288 Query: 345 DNGAVIDGPLPLN----------GAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLI 393 DNG +G P N G G K GG P F+ W G++ G + + Sbjct: 289 DNGP--EGGEPSNRMKTRRAGSAGEFSGRKRSVLDGGVRVPAFVHWPGQIPAGVRLNAPL 346 Query: 394 SAMDFYPT--ALDAADISIPKDLKLDGVSLLP-WLQDKKQGEPHKNLTWITSYSHWFDEE 450 S MD PT A+ A+ ++P L LDG ++LP W ++ Q E+ Sbjct: 347 SVMDLLPTVAAITGAE-TLPNRL-LDGENVLPIWKGEQAQ-----------------REK 387 Query: 451 NIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV-ENNQLGLYKLT-D 508 +IPF QF+ VR ++ + ++++ L+ L+ D Sbjct: 388 SIPF----------------------RYGQFACLVRGKHKLIIESPNDDSKDRLFDLSKD 425 Query: 509 LQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 + + +NLA P++ M+ + F++S++ Sbjct: 426 VSESNNLANQKPELTASMRTELLGFLESAK 455 >UniRef50_UPI00005846A1 PREDICTED: similar to arylsulfatase n=1 Tax=Strongylocentrotus purpuratus RepID=UPI00005846A1 Length = 552 Score = 134 bits (337), Expect = 8e-30, Method: Compositional matrix adjust. Identities = 109/415 (26%), Positives = 172/415 (41%), Gaps = 90/415 (21%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 + KPN ++ DD+GYG L S+ T E + D Sbjct: 55 RDKPNFVIFFADDMGYGDL----ASYGHPTQERGPIDDV--------------------- 89 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA--------QDGIPLTETF 166 +++ G++FT GYV V PSR A++TGR P R GV+S T + G+P TE Sbjct: 90 MVENGIKFTQGYVPDTVCTPSRVALLTGRYPVRSGVFSGTGGSRVFLPWTRSGLPSTELT 149 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF- 225 + E + GY T GKWHL + +TRD + P + GFD+ Sbjct: 150 IAEALKEEGYTTGMAGKWHLGL--------NSETRDDGVHL---------PMHHGFDFVG 192 Query: 226 --------MGFHAAGTAYYNSPSLFK----NRERVPAK----GYISDQLTDEAIGVVDRA 269 M G + + P + K R+++ A+ Y++ ++A+ ++ Sbjct: 193 HILPFTNSMACDDTGR-FVDFPDVTKCFLYKRDQIVAQPFNHTYLTQTFVNDAVSFIE-- 249 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQL 329 PF Y ++ PH+P Y G Y ++ + V +++ L Sbjct: 250 DNAHDPFFFYFPFSHPHVP-------LYASPRFAGKSQRGEYGDNINEMSWAVGEVIDAL 302 Query: 330 KKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ-------KGYKSQTYPGGTHTPMFMWWKG 382 + G NT++LF +D+G P P A KGYK+ T+ GG P +W G Sbjct: 303 EAKGLSQNTLVLFLADHG-----PQPEYCAHGGDPSIFKGYKTNTWEGGIRVPFVAYWPG 357 Query: 383 KLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL 437 ++ P D L+S +D T +D A+ ++P D DG + L K PH L Sbjct: 358 QITPRESDALVSTLDIMRTVVDLANGTLPDDTAYDGEVITDVLL-KNAPSPHDVL 411 >UniRef50_C5EQ23 Arylsulfatase E n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EQ23_9FIRM Length = 483 Score = 134 bits (337), Expect = 9e-30, Method: Compositional matrix adjust. Identities = 114/403 (28%), Positives = 160/403 (39%), Gaps = 92/403 (22%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNIIV DD GYG L + + TP L L Sbjct: 16 KPNIIVFLTDDQGYGDL--------------------------SCMGSTDVCTPNLDILA 49 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS-------NTDAQDGIPLTETFLPE 169 G RFT+ Y V PSRA ++TGR P GV S T GIP + L + Sbjct: 50 AGGARFTDFYAGSAVCSPSRACLLTGRYPYMTGVRSILGGIKTTTGLNPGIPTFASALKD 109 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH 229 L GY T VGKWHL + E +P + GFDYF GF Sbjct: 110 L----GYTTGMVGKWHLGAVP-----------------------ECRPTHMGFDYFCGFL 142 Query: 230 AAGTAYYN---------------SPSLFKNRERVP--AKGYISDQLTDEAIGVVDRAKTL 272 + Y++ + L++N ER Y ++ + + + Sbjct: 143 SGVNDYFSHIHYTEANSHPGINPNHDLWENDERCLKYTGEYSTELFARKGLEFIREQVEK 202 Query: 273 DQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKN 332 D PF LY A+NAPH P AP +Y ++F + A + +VD GV I+ LK+ Sbjct: 203 DMPFALYCAFNAPHYPMH--APYKYLERFKHLPEDRQIMAAMLSAVDDGVGEIMNYLKRR 260 Query: 333 GQYDNTIILFTSDNGAVIDGPLPLN-----------GAQKGYKSQTYPGGTHTPMFMWWK 381 G +++TII F SDNG + L+ G KG+K + GG P W Sbjct: 261 GIFNDTIIYFQSDNGPSKESRNWLDERKDYYYGGSTGGLKGHKFSLFDGGIRVPAIFSWP 320 Query: 382 GKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 + G + D +PT ++AA + D ++ G +LP Sbjct: 321 AMVPAGQVISEPCMGTDIFPTFINAAGGN-ASDYEISGCDILP 362 >UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF83_9BACT Length = 488 Score = 134 bits (336), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 131/493 (26%), Positives = 207/493 (41%), Gaps = 100/493 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNII++ DDLGYG L G + Q TP + L Sbjct: 42 RPNIILILADDLGYGDL----GCYG----------------------QTQIKTPNIDKLA 75 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD-AQDGIPLTETFLPELFQNHG 175 ++G++FT+ Y V PSRA +MTG+ + N D + +G LT + ++ + G Sbjct: 76 EDGMKFTSFYAGSTVCAPSRATLMTGKNTGHVNIRGNADLSLNGEELT---IAKILKLAG 132 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY 235 Y T +GKW L + +P + +Y A ++ P + F + Sbjct: 133 YATGCIGKWGLGNEGSPGLPGRQGFDEYLGYLDQVQAHDYYPTHL-------FRSDSKGE 185 Query: 236 YNSPSLFKNRERVPAKG-YISDQLTDEAIGV--VDRAKTLD--QPFMLYLAYNAPHLPN- 289 + +L +N KG Y +D T A+ +++ L+ + F LYL Y PH N Sbjct: 186 ESKIALTEN--DADHKGLYSNDFFTQSALNYLRINKPSKLNKHRSFFLYLPYTLPHANNE 243 Query: 290 ---------DNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 + P+ + Y + Q N A + +D V I++ LKK+ +NT++ Sbjct: 244 LGNRTGNGMEVPSTEPYTNE--QWPQVEKNKAAMITRLDHYVGEIMDYLKKSKLDENTVV 301 Query: 341 LFTSDNGAVIDG---PLPLN--GAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLIS 394 +F SDNG +G P N G +G K Y GG P + W +++ G+ D ++ Sbjct: 302 IFASDNGPHKEGGVNPKYFNSAGGLRGIKRDLYEGGIRVPFIVRWPARVKAGSISDAPLA 361 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPF 454 DF PTA + A S P + +DG+S LP L K Q H+ L W Sbjct: 362 FWDFLPTAAEIARTSSPTN--IDGISFLPTLLGKAQTNRHQYLYW--------------- 404 Query: 455 WDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKD 513 +H+ F VR D+ V N + LY L TD+ +KD Sbjct: 405 --EFHE------------------QGFDQAVRMGDWKAVRHGINGPIELYNLKTDVSEKD 444 Query: 514 NLAAANPQVVKEM 526 N+A NP+V+ ++ Sbjct: 445 NVADKNPEVMAKI 457 >UniRef50_A3I2G9 Putative secreted sulfatase n=1 Tax=Algoriphagus sp. PR1 RepID=A3I2G9_9SPHI Length = 512 Score = 134 bits (336), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 138/517 (26%), Positives = 209/517 (40%), Gaps = 95/517 (18%) Query: 47 FTPTEYSTKGKPNIIVLTMDDLGYG--QLPFDKGSFDPKTMENREVVDTYKIGIDKAIEA 104 F + T PN+IV +DDLG+ LP D S T+ N+ Sbjct: 18 FASCKKETVQPPNVIVFLVDDLGWNDTSLPMDGQS----TLYNQ---------------- 57 Query: 105 AQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLT- 163 + TP L L +G FT+ ++ + PSR +++TG+ R V + Q + T Sbjct: 58 -RYQTPNLEKLASQGKMFTHAR-SNAICVPSRVSLLTGQNFMRHQVKGDIIEQYNVRKTL 115 Query: 164 -----------ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 E LP + + GY T GK+H + PEDK Sbjct: 116 WFPPGKVIEKPENMLPAMLKKQGYRTIISGKYHACDL----CPEDKSP------------ 159 Query: 213 EEWQPQNRGFDYFM---GFHAAGTAYYNSPSLFKNRERVPAKG---------YISDQLTD 260 P+ GFD + GF A + Y KN E P G ++++ LT Sbjct: 160 ---TPEAAGFDVNIAGTGFGAPKSYYGIDSFQRKNTETQPMPGLESYFGKEIHLTEALTI 216 Query: 261 EAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYAS-VYSVD 319 EA+ A QPF LYL+++A H P P + G A+ YA+ + VD Sbjct: 217 EALKASKVAVDKGQPFFLYLSHHAVHTPIQEQKPYRENYTLTEGEPEAEAAYATMIEGVD 276 Query: 320 QGVKRILEQLKKNGQYDNTIILFTSDNGAVID--------GPLPLNGAQKGYKSQTYPGG 371 + +++ L G +NT+++F SDNG + G N + K+ Y GG Sbjct: 277 NSLGEVIKALDDWGIANNTLLIFYSDNGGRVLFRGKKSLYGDFEFNYPLRSGKASNYEGG 336 Query: 372 THTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ 430 P + W GK++ D + D Y T L+A IP D +DG+S LP L+ + Sbjct: 337 IRVPCVVRWPGKVKKQTVSDAPLVIEDIYTTVLEATHTKIPDDYAIDGMSWLPVLEKE-- 394 Query: 431 GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDY 490 EP K F + ++ F+ Y R Y N + + S + D+ Sbjct: 395 -EPQKA----------FKDRSMFFFMPY----RFDGVTYNGNDFSNGGVKPSASFIKGDW 439 Query: 491 SLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEM 526 L+Y E LY L D+ ++ NL A+ P+ KEM Sbjct: 440 KLIYFFEEEIFELYNLKEDVGEQKNLFASQPEKAKEM 476 >UniRef50_UPI00005887B4 PREDICTED: similar to galactosamine (N-acetyl)-6-sulfate sulfatase n=1 Tax=Strongylocentrotus purpuratus RepID=UPI00005887B4 Length = 465 Score = 134 bits (336), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 102/339 (30%), Positives = 148/339 (43%), Gaps = 53/339 (15%) Query: 107 KSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY----------SNTDA 156 K TP L + EG+ + Y A+ + PSRAA++TGR P R G Y S Sbjct: 19 KETPNLDQMAAEGILLPDFYAANPLGSPSRAALLTGRLPIRNGFYTTNGHAHNAWSQQIV 78 Query: 157 QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 + GIP +E LP+L + GY + VGKWHL + ++ Sbjct: 79 KGGIPDSEILLPKLLKLSGYKSKIVGKWHLGHLP-----------------------QYL 115 Query: 217 PQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPF 276 P GFD + G + ++++ E I E + ++++ QPF Sbjct: 116 PLKHGFDEWFGAPNCHIKSLPNIPVYRDSEM------IGRYFEQEGLNFIEKSAEAKQPF 169 Query: 277 MLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYD 336 LY +A H P Y + G Y +V +D+GV +IL +LK+ Sbjct: 170 FLYWTPDATHEP-------VYASKPFLGRSQRGLYGDAVIELDEGVGQILGKLKELQIDT 222 Query: 337 NTIILFTSDNGAVI----DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL 392 NT ++FTSDNGA +G NG K TY GG P WW ++PG Sbjct: 223 NTFVVFTSDNGAATYAKENG--GTNGPYLCGKRTTYEGGMRVPTIAWWPTHIKPGRVTHQ 280 Query: 393 I-SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ 430 I + MD + TAL+ A I P D +DG SLLP L + ++ Sbjct: 281 IGNIMDLFTTALNLAHIRPPSDRFIDGQSLLPALLNGEE 319 >UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4Q9_9PLAN Length = 490 Score = 134 bits (336), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 138/499 (27%), Positives = 211/499 (42%), Gaps = 120/499 (24%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI+ + +DD+G+ DP + N+ TP + L Sbjct: 34 RPNIVFILIDDMGWP---------DPVSYGNQF-----------------HDTPHIDQLA 67 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG---------IPLTETFL 167 +GVRFT+ Y A V P+RA+I G+ AR + TD G +P L Sbjct: 68 SDGVRFTDFYAACPVCSPTRASIQAGQYQARLHL---TDFIPGHWRPFEKLIVPENAPHL 124 Query: 168 P-------ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 P EL Q+ Y TA GKWHL S+ P D+Q Y + T + P+ R Sbjct: 125 PLEIVTPGELLQSANYNTAYFGKWHLGPESHNP---DQQ--GYQTSLVT-GGRHFAPRFR 178 Query: 221 GFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 +PS R+P K Y++D LTD+ I + + K+ +PF + L Sbjct: 179 ----------------TTPS-----TRIPNKAYLADFLTDKTIEFIRQNKS--KPFFVQL 215 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTGSQTADN-----YYASVYSVDQGVKRILEQLKKNGQY 335 ++ A H+P + A Q +++ + A Y A V VD V RI+ L++ Sbjct: 216 SHYAVHIPLE--AKQQMIRKYQQKPKPAYGINNPVYAAMVAHVDDSVGRIVAALEELKLT 273 Query: 336 DNTIILFTSDNGAVIDG-----PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NY 389 +NT+++FTSDNG + + N + K Y GG P+ + W G G Sbjct: 274 ENTVVIFTSDNGGLRQSFSGGDIVSTNAPLRDEKGSLYEGGIRVPLIIKWPGVAAAGKTC 333 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 + ++DF+PT + A ++ + +DG+SLLP L+D SH + Sbjct: 334 AEPTISIDFWPTFAEIAHTTLQEHQTIDGLSLLPLLKDPS--------------SH-LNR 378 Query: 450 ENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TD 508 E I F YPH ++ S +R D+ L+ + L LY L D Sbjct: 379 EEIYF-------------HYPHYHHSTPAS----AIRAGDWKLIEFFADGNLELYNLQQD 421 Query: 509 LQQKDNLAAANPQVVKEMQ 527 L + NLAA NP+ E+Q Sbjct: 422 LSETTNLAAKNPEKAVELQ 440 >UniRef50_C6I9F7 Sulfatase n=4 Tax=Bacteroides RepID=C6I9F7_9BACE Length = 493 Score = 134 bits (336), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 140/519 (26%), Positives = 207/519 (39%), Gaps = 135/519 (26%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PN+I + DDLGY L F TP + L Sbjct: 27 PNVIFIYADDLGYTDLSCTGSRF--------------------------YETPHIDKLAR 60 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV--YSNTDAQDG---------IPLT--- 163 EGV FT Y A VS PSRAA++TG+ PAR + Y D G +P Sbjct: 61 EGVCFTQSYAACPVSSPSRAALLTGKYPARINLTDYIPGDRAYGPHKNQRLASLPFNLHL 120 Query: 164 ---ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 E + E F+ +GY T GKWHL++ + E+ P+ Sbjct: 121 SKDEITMAEAFRQNGYSTFMAGKWHLAE-----------------------SAEYYPEQN 157 Query: 221 GFDYFMGFHAAG--TAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFML 278 GFD +G + G + Y SP + P Y++D+LTDE I + K ++PF + Sbjct: 158 GFDINIGGNNTGHPSKGYFSPYGNPQLKDGPEGEYLTDRLTDEVIRYISEPK--EKPFFV 215 Query: 279 YLAYNAPHLP----------------NDNPAPDQYQKQFNTGSQTADN---YYASVYSVD 319 YL+Y HLP PA + K+ T + + Y A V S+D Sbjct: 216 YLSYYTVHLPLQAKAEKIAKYRRKLSRAVPADSSFVKKGETYHKLVQDIPAYAAMVESLD 275 Query: 320 QGVKRILEQLKKNGQYDNTIILFTSDNGA---------VIDGPLPLNGAQKGYKSQTYPG 370 + + R+L+ L ++G + TI++FTSDNG + LPL A KGY Y G Sbjct: 276 ENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSNTTRNIPTSNLPLR-AGKGY---LYEG 331 Query: 371 GTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 G P + W L+ D I D+YPT LD + + +DGVS+ P LQ + Sbjct: 332 GIKVPAIIRWSRHLKGRQVSDTPIIGTDYYPTLLDLCGLPLLPGQHVDGVSMKPVLQGGR 391 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNND 489 P +L W YPH + S +R D Sbjct: 392 LSRP--SLFW----------------------------HYPHYSGGLG-GRPSAAIREGD 420 Query: 490 YSLVYTVENNQLGLYK-LTDLQQKDNLAAANPQVVKEMQ 527 Y L+ E++ + LY + D ++ +L+ P++ ++ Sbjct: 421 YKLIEFFEDHHVELYNVIQDESEEKDLSQIYPEIADGLR 459 >UniRef50_P25549 Arylsulfatase n=54 Tax=Proteobacteria RepID=ASLA_ECOLI Length = 551 Score = 134 bits (336), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 121/450 (26%), Positives = 182/450 (40%), Gaps = 114/450 (25%) Query: 18 LASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDK 77 +A M H A D K T+ +A E T KPN++V +DD+G+ + F+ Sbjct: 54 IADNMMPVMQHPAQD---KETQQKLA-----ELEKKTGKKPNVVVFLLDDVGWMDVGFNG 105 Query: 78 GSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRA 137 G A TP + ++ +G+ T+ Y + S P+RA Sbjct: 106 GGV-----------------------AVGNPTPDIDAVASQGLILTSAY-SQPSSSPTRA 141 Query: 138 AIMTGRAPARFGV-----YSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNV 192 I+TG+ G+ Y G+ T LP+L + GY T A+GKWH+ Sbjct: 142 TILTGQYSIHHGILMPPMYGQPGGLQGL----TTLPQLLHDQGYVTQAIGKWHMG----- 192 Query: 193 PVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYN------------SPS 240 E+K++ QPQN GFD F GF++ Y SP Sbjct: 193 ---ENKES---------------QPQNVGFDDFRGFNSVSDMYTEWRDVHVNPEVALSPD 234 Query: 241 LFKNRERVP--------AKG------------YISD---QLTDEAIGVVDRAKTLDQPFM 277 + +++P +G Y+ D + D + +D+ D+PF Sbjct: 235 RSEYIKQLPFSKDDVHAVRGGEQQAIADITPKYMEDLDQRWMDYGVKFLDKMAKSDKPFF 294 Query: 278 LYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYA-SVYSVDQGVKRILEQLKKNGQYD 336 LY H D Y GS A Y + ++ + + L+KNGQ D Sbjct: 295 LYYGTRGCHF-------DNYPNAKYAGSSPARTSYGDCMVEMNDVFANLYKTLEKNGQLD 347 Query: 337 NTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM 396 NT+I+FTSDNG + P +G K T+ GG P F++WKG +QP D ++ Sbjct: 348 NTLIVFTSDNGPEAEVPPHGRTPFRGAKGSTWEGGVRVPTFVYWKGMIQPRKSDGIVDLA 407 Query: 397 DFYPTALDAADIS-------IPKDLKLDGV 419 D +PTALD A +PK +DGV Sbjct: 408 DLFPTALDLAGHPGAKVANLVPKTTFIDGV 437 >UniRef50_Q0C069 Sulfatase family protein n=3 Tax=Bacteria RepID=Q0C069_HYPNA Length = 505 Score = 134 bits (336), Expect = 1e-29, Method: Compositional matrix adjust. Identities = 111/397 (27%), Positives = 166/397 (41%), Gaps = 66/397 (16%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 E + +PNI+++ +DD+GY D GSF + TP Sbjct: 39 EAAASEQPNIVLIFVDDMGYA----DIGSFG----------------------SPIARTP 72 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ--------DGIPL 162 L L EG ++T+ Y V PSRA +MTGR R G+ A+ G+P Sbjct: 73 NLDRLAMEGQKWTSFYAPAPVCTPSRAGLMTGRLAVRSGMAGLVQARHVLFPTSTGGLPQ 132 Query: 163 TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD-------NFTTFSAEEW 215 +E + EL Q GY +AA GKWH+ + +P + Y N W Sbjct: 133 SEVTIAELLQQEGYVSAAFGKWHMGHLPEF-LPTSHGFQSYFGIPYSNDMNMPGGGETPW 191 Query: 216 QPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERV--PAKGY-ISDQLTDEAIGVVDRAKTL 272 D F F ++ P L ++ E + PA + ++ + T+ AI ++ + Sbjct: 192 S-----IDLF--FEPPNIQNWDVP-LMQDEEIIERPADQFTLTQRYTERAIEFMETSHAE 243 Query: 273 DQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKN 332 QPF LYLA+N PH P + + TG Y + +D V I++ LK Sbjct: 244 GQPFFLYLAHNMPHTP-------LFTSEGFTGVSAGGAYGDVIEELDWSVGEIVDALKDM 296 Query: 333 GQYDNTIILFTSDNGAVIDGPLPLNGAQKGY----KSQTYPGGTHTPMFMWWKGKLQPGN 388 NT+++FTSDNG + + + G K T+ GG P WW G++ P Sbjct: 297 KIEKNTLVIFTSDNGPWLA--MKTHSGSAGMLRDGKGTTWEGGMRVPAIFWWPGQIAPRT 354 Query: 389 YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 L SA+D PT + +P+D DG L P L Sbjct: 355 VTDLGSALDLMPTFAAISGARLPEDRVYDGFDLSPAL 391 >UniRef50_A6KWS8 Arylsulfatase n=6 Tax=Bacteroides RepID=A6KWS8_BACV8 Length = 464 Score = 133 bits (335), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 115/413 (27%), Positives = 172/413 (41%), Gaps = 83/413 (20%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PN+I + DDLG G L G + + Q TP + + Sbjct: 30 PNVIYIMADDLGIGDL----GCYGQR----------------------QIKTPNIDGIAQ 63 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD--AQDGI------PLTETFLPE 169 G++F Y VS PSR A++TG+ + N DG+ P E + + Sbjct: 64 NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 123 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH 229 +F+ Y T VGKW + E P GFDYF G+ Sbjct: 124 IFKTKNYVTGCVGKWGMGG----------------------PGTEGMPGKHGFDYFYGYL 161 Query: 230 AAGTAYYNSPS-LFKNRERVPAKG--YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPH 286 A+ P L +N +++ G Y D + ++A+ +D + +PF LY + PH Sbjct: 162 GQRFAHSYYPEFLHENEQKIMLDGKYYSHDLMLEKALNFID--ENAQKPFFLYFSPTIPH 219 Query: 287 LPND--NPAPDQYQKQF--------NTGSQTADN----YYASVYSVDQGVKRILEQLKKN 332 D A +Y+ +F G ++ N Y A V +D+ V I+++LK+ Sbjct: 220 ADLDIMGEAMTEYEGEFCETPFGGSRDGYKSQQNPRAAYAAMVTYLDKSVGLIIKELKEK 279 Query: 333 GQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 G YD+TII+FTSDNG +G NG +G K Y GG TP + W G + G Sbjct: 280 GLYDHTIIVFTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQG 339 Query: 388 NYDKLISAM-DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 ISA DF PT + IP++ +DG+S LP L K + H + + Sbjct: 340 VVTNHISAFWDFLPTIGELVQADIPQN--IDGISYLPTLTGKGTQKEHDCIYY 390 >UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R322_9PLAN Length = 513 Score = 133 bits (335), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 135/524 (25%), Positives = 209/524 (39%), Gaps = 130/524 (24%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 P+ + + +PNI+ +DDLG D G + E Sbjct: 28 PSTIAAEQQPNIVFFLVDDLGQ----RDLGCYGSTFYE---------------------- 61 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV--YSNTDAQDG------- 159 TP + L +G RFT Y A V P+RA+I+TG P R G+ Y TD +G Sbjct: 62 TPNIDKLAADGARFTQAYAACPVCSPTRASILTGLWPQRTGITDYIATDNSNGPAKWNRN 121 Query: 160 -----------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 + L L + ++ GY T GKWHL Sbjct: 122 TMTLPAAYRDRLALDSPTLAKSLKSAGYATFFAGKWHLGP-------------------- 161 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGTAY----YNSPSLFKNRERVPAKGYISDQLTDEAIG 264 E + P+N+GFD G G Y Y SP PA ++ D+L E Sbjct: 162 ----EGFYPENQGFDINRGGIERGGPYGGKQYFSPYGNPRLTDGPAGEHLPDRLATETCQ 217 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLP----------------NDNPAPDQYQKQFNTGSQTA 308 ++ + QPF Y ++ + H P P ++ Q Sbjct: 218 FIEAHQ--KQPFFAYFSFYSVHTPLQAREDLRQKYVAKREKLGLKPTWGREHMRDVRQVQ 275 Query: 309 DN--YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNGAQKGYK 364 ++ Y A V ++DQ V ++L +L + G +NT+++FTSDNG + +G N +G K Sbjct: 276 EHAVYAAMVDAMDQAVGKVLAKLDELGLRENTLVIFTSDNGGLSTSEGWPTSNLPLRGGK 335 Query: 365 SQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 Y GG P+ M W K++ G+ D +S+ DF T L A + ++DGVSLLP Sbjct: 336 GWMYEGGIREPLVMRWPAKVKAGSTIDTPVSSPDFMATLLAATATKPAEQQQIDGVSLLP 395 Query: 424 WLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY 483 L +K E ++L W Y H+ ++ P + Sbjct: 396 LLAGEKLKE--RSLFW--HYPHYGNQGGAP----------------------------AA 423 Query: 484 TVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEM 526 +R + L+ +E+ Q+ L+ L TD + NLA+ P +V+EM Sbjct: 424 AIRRGSWKLIEWLEDGQVELFNLATDESETTNLASKEPALVREM 467 >UniRef50_B8KHZ9 Arylsulfatase A n=2 Tax=Gammaproteobacteria RepID=B8KHZ9_9GAMM Length = 483 Score = 133 bits (335), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 117/410 (28%), Positives = 181/410 (44%), Gaps = 95/410 (23%) Query: 59 NIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDE 118 N++++ +DDLGYG D G++ + ++ TP + L E Sbjct: 30 NVLLIYVDDLGYG----DTGAYGHRVVK----------------------TPHIDRLAAE 63 Query: 119 GVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT--DAQDGIPLTETFLPELFQNHGY 176 G+RFT Y + PSRA ++TGR P R GV S D+Q + ET L +L + GY Sbjct: 64 GMRFTQFYAPSALCSPSRAGLLTGRTPYRTGVESWIPDDSQVALGHNETTLADLAKARGY 123 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA------ 230 TA +GKWHL+ H T QP++ GFD+ G A Sbjct: 124 RTAVIGKWHLNG-------------GLHMQGTP------QPRDFGFDHQYGLAAWVKNAS 164 Query: 231 ---------AGTAYYNSPSLFKNRERV-PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 G + ++ +++N E V P K Y ++ ++DEAI + AK PF L L Sbjct: 165 VRESKELPRRGAMFPDN--MYRNNEAVGPTKKYSAELVSDEAIDWLSGAK---DPFFLLL 219 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTGSQTADN------------------YYASVYSVDQGV 322 Y+ H P +P Q Q + DN YYA+V +D + Sbjct: 220 TYSEVHTPIASPPEYLAQYQDYLTQEARDNPLLFYFDWRNRPWRGRGEYYANVSYMDAQL 279 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDG---PLPLNGAQ-----KGYKSQTYPGGTHT 374 R++E L+ G D+T+I+F+SDNG V D P L A +G K + GG Sbjct: 280 GRVIEYLRGKGVLDDTLIIFSSDNGPVTDAALTPWELGMAGETAGLRGKKRFLFEGGLRV 339 Query: 375 PMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 P + + +++ G + + +A+D +PT +++ + LDG SL P Sbjct: 340 PGIIRYPERIEAGRVESRPATALDVFPTLAQWLGVAVDSSVPLDGESLWP 389 >UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_BACFR Length = 489 Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 143/538 (26%), Positives = 215/538 (39%), Gaps = 139/538 (25%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN++ + DDLGYG L + + E TP + L Sbjct: 36 RPNVVFILADDLGYGDL----SCYGQEKFE----------------------TPNIDRLA 69 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD----AQDGIPLTETFLPELFQ 172 G+RFT Y VS PSR+ ++TG + N + Q +P + F+ Sbjct: 70 QNGMRFTQCYSGTTVSAPSRSCLITGTHSGHTAIRGNKELAPEGQFPLPENSQTIFNDFR 129 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 N GY T A GKW L I + P +G D F G++ Sbjct: 130 NAGYRTGAFGKWGLGYIGSAG----------------------DPYKQGIDQFYGYNCQL 167 Query: 233 TAYYNSPS-LFKNRERVP----------AKG-YISDQLTDEAIGVVDRA-KTLDQPFMLY 279 A+ P L+ N +RV KG Y D + +A+ +D A K DQPF ++ Sbjct: 168 LAHSYYPDHLWDNDKRVDLPDNNLNVQYGKGTYSQDLIHSKALAFLDEAAKEKDQPFFMW 227 Query: 280 LAYNAPH----LPNDN------------------PAPDQYQKQ-FNTGSQTADNYYASVY 316 PH +P D+ P ++K + T + A VY Sbjct: 228 YPTIIPHAELIVPEDSIIKKFRGKYPEKPYRGVEPGSPAFRKGGYCTQFYPHATFAAMVY 287 Query: 317 SVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKSQTYPGG 371 +D V +I+++LK G YDNTII+F+SDNG ++G NG +GYK Y GG Sbjct: 288 RLDVYVGQIVQKLKDMGVYDNTIIIFSSDNGPHMEGGADPDFFNSNGIWRGYKRDVYEGG 347 Query: 372 THTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPK--DLKLDGVSLLPWLQDK 428 PM + W G +QP D + S D PT + + PK +DGVS+LP LQ++ Sbjct: 348 IRVPMIISWPGHVQPSTETDFMCSFWDLMPTFREVLN---PKADTRNMDGVSILPLLQNR 404 Query: 429 KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNN 488 K + H+ L Y +F+ VR Sbjct: 405 KGQKEHEYL--------------------YFEFLEMNGRQ---------------AVRKG 429 Query: 489 DYSLVY-TVENNQ--LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVRE-FIDSSQPPL 541 D+ LV+ + N+ LY L +D +K N+ P+ E++ +++E I+ S PL Sbjct: 430 DWKLVHMNIRGNKPYYELYNLASDPSEKYNVLNQYPEKADELKAIMKEAHIEDSNWPL 487 >UniRef50_A6DMW1 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMW1_9BACT Length = 585 Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 133/539 (24%), Positives = 215/539 (39%), Gaps = 131/539 (24%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 P+ KPN+IV+ +DD+G D ++ K + Sbjct: 2 PSALIAAKKPNVIVILIDDMGL----MDSSTYGSKFYQ---------------------- 35 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV------------------ 150 T + L EG+ FT+ Y A + P+RA+IM+G+ P+R + Sbjct: 36 TANMSRLAKEGMLFTDAYAASPLCSPTRASIMSGQYPSRLHMTVAVTPKSKEKPKALAPA 95 Query: 151 ----YSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 ++++ +PL L E Q+ GY TA +GKWHL++ Sbjct: 96 PNQYCGKVESKNHMPLAVYTLAEALQDSGYTTAHIGKWHLTE------------------ 137 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHA-AGTAYYNSPSLFKNRERV--------PAKGYISDQ 257 + +N+GFD+ +G G Y SP K ++ P Y++++ Sbjct: 138 -----NPKHNAENQGFDFVIGGAGLPGPPDYYSPYKRKGKKAKGINNLSPGPKGEYLNER 192 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP---NDNPAPDQYQKQFNTGSQTADNYYAS 314 L E+I + + ++PF L L + A H P + P +++ Q Sbjct: 193 LAKESIKWIKSVQDSNKPFYLNLWHYAVHGPVIEKKDLMPKYLERRDPNNPQRCPEMGTM 252 Query: 315 VYSVDQGVKRILEQLKK---NGQYDNTIILFTSDNGAVI-----DGPLPLNGAQKGYKSQ 366 + S+D V +L+ L K DNT+I+ TSDNG VI N +G K+ Sbjct: 253 IDSMDNSVGMLLDWLDKPENKAVKDNTLIILTSDNGGVIHKETNGNTWTSNRPLRGGKAN 312 Query: 367 TYPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 TY GGT P + W ++ G+ + ++D YPT L+A +I K L DG S+LP L Sbjct: 313 TYEGGTRVPWIVRWPDTIKAGSVCTTPVQSIDIYPTVLEAVNIKAKKGLTFDGQSILPLL 372 Query: 426 QDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTV 485 + +K H+ + T + H F P S +V Sbjct: 373 EQRKM--EHQPI--FTDFQHLFGVMCAP----------------------------SSSV 400 Query: 486 RNNDYSLV---YTVENNQLGLYKLTDLQ----QKDNLAAANPQVVKEMQGVVREFIDSS 537 R D L+ + Q Y+L DL+ + NLAA P+ VKE+ ++ I + Sbjct: 401 RVGDMKLIRFYHAGPKAQSHAYELFDLKRDLYESINLAAYMPEKVKELDRLIEAHIKET 459 >UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4S5_9BACT Length = 486 Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 141/521 (27%), Positives = 212/521 (40%), Gaps = 108/521 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI+ + DD+G+ L G + E TP + Sbjct: 25 KPNILFILADDMGWSDL----GCYGADLHE----------------------TPNIDRFA 58 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR--FGVYSNTDAQDG--------------- 159 VRFT+ Y A V PSR+ +MTG+ AR F +++ AQ+G Sbjct: 59 SGAVRFTSAY-AMSVCSPSRSTLMTGKHAARLHFTIWAE-GAQEGGAKNRELREAESIWN 116 Query: 160 IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE--WQP 217 +P +E + ++ GY TA +GKWHL + P + D + T + A + W P Sbjct: 117 LPNSEKTIATYLKSAGYLTALIGKWHLGDWEHYP---EAHGFDINIGGTNWGAPQTFWWP 173 Query: 218 QNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFM 277 + G G + P L E Y++D+LTDEAI V+D A DQPF Sbjct: 174 -------YSGSGTHGPEFRYIPHL----EYGHPGEYLTDRLTDEAIKVIDHAG--DQPFF 220 Query: 278 LYLAYNAPHLPNDNPAPD--QYQKQFNTGSQTADNYYASV-YSVDQGVKRILEQLKKNGQ 334 +YLA++A H P + A D + ++ G YA++ +D+ V R+LE LK+ G Sbjct: 221 VYLAHHAVHTPIEAKADDIQHFDAKYRDGMNHRHTIYAAMNKELDENVGRVLEHLKERGL 280 Query: 335 YDNTIILFTSDNGAVI--------DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 NT+++F SDNG I + P+ N + K Y GG P+ + W G Sbjct: 281 DKNTVVIFASDNGGYIGVDKVSGKNMPVTNNAPLRSGKGALYEGGIRVPLIIRWPGVTPN 340 Query: 387 G-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSH 445 G D+ + D T L P DG+ + P L+D P L + H Sbjct: 341 GATCDEPVILTDMLQTFLHITG-QPPATDATDGMDISPLLKD-----PSAKLNRDALFFH 394 Query: 446 WFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYK 505 YPH +T + +R D+ L+ E+N L LY Sbjct: 395 -----------------------YPHYYHT---TTPVSAIRARDWKLLEFYEDNHLELYN 428 Query: 506 L-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVN 545 L DL +K +LA P ++ + + DS L + N Sbjct: 429 LRNDLSEKHDLAKEMPDKAAALRDQLNAWRDSVGAVLPQPN 469 >UniRef50_B0NLM9 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NLM9_BACSE Length = 463 Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 118/415 (28%), Positives = 170/415 (40%), Gaps = 87/415 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNII + DD+GY D + K +E TP + L Sbjct: 34 KPNIIFILADDMGY----CDLSCYGNKYIE----------------------TPNIDRLA 67 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIP--------------L 162 G FT Y G+S PSR A+MTG+ + N GI Sbjct: 68 ATGTAFTQCYAGSGISSPSRCALMTGKNTGNTTIRDNFCIAGGIEGLKGTKTIRRMHLQP 127 Query: 163 TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF 222 +T + + GY T V KWHL D F E P NRGF Sbjct: 128 NDTTIATVLGAAGYRTCLVNKWHL------------------DGFN----PEATPLNRGF 165 Query: 223 DYFMGFHAAGTAYYNSPSLF-------------KNRERVPAKGYISDQLTDEAIGVVDRA 269 D F G+ + TAY N P + K E + +D T++AI ++R Sbjct: 166 DEFYGWLIS-TAYSNDPYYYPYWRFNNEKLENVKENEGDKHIKHNTDLSTEDAIKFINRN 224 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQL 329 K + PF LYLAY+APH P + Y + Y + + +D+ + R+L +L Sbjct: 225 K--NNPFFLYLAYDAPHEPYNIDETTWYDDE--AWDMNTKRYASLITHMDRAIGRLLAEL 280 Query: 330 KKNGQYDNTIILFTSDNGAVIDGPLP---LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 + G +NT+++F SDNGA PL G+ KG K Q Y GG P + GK+ Sbjct: 281 DRLGLRENTLVIFASDNGAAKQAPLEELGCKGSLKGMKGQLYEGGIRVPFIVNQPGKVPV 340 Query: 387 GNYDKLISAMDFYPT--ALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 + +I D PT AL A +P+ KL+G+++LP ++ ++ L W Sbjct: 341 QKLNNIIYFPDVMPTLAALAGATDKLPQ--KLNGINILPLFYGQQLDTDNRLLYW 393 >UniRef50_Q7UMZ5 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UMZ5_RHOBA Length = 484 Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 118/448 (26%), Positives = 182/448 (40%), Gaps = 98/448 (21%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFD 81 + A +HA L+A + +PNI+++ DDLGYG L G + Sbjct: 18 LVALCSHACVPTLLRADSND---------------RPNIVLILADDLGYGDL----GCY- 57 Query: 82 PKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMT 141 G D +++TP L L +GVR+T Y P+RAA++T Sbjct: 58 ---------------GND------EQATPVLDRLATQGVRWTQAYANGPECSPTRAALLT 96 Query: 142 GRAPARFG----------VYSNTDA-------QDGIPLTETFLPELFQNHGYYTAAVGKW 184 GR G V DA + G+P L + + GY TA GKW Sbjct: 97 GRYQQHVGGLECAIGVGNVGRYDDAIRLHLVNELGLPANRPTLAKRLSSVGYETALFGKW 156 Query: 185 HLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYN------S 238 HL Y F+ P GFD + YY+ + Sbjct: 157 HLG---------------YEAKFS--------PMMHGFDEALYCIGGAMDYYHYLDSVAT 193 Query: 239 PSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP---D 295 +LF N + +GY +D +TD+A+ + D+PF LYL Y APH P P D Sbjct: 194 YNLFHNGRPISGEGYFTDTITDQAVRFIGDRNANDKPFFLYLPYTAPHTPYQAPGESPVD 253 Query: 296 QYQKQFNTGSQTADN---YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG 352 Q AD Y A V +D+G+ ++L ++++ D T+++F SDNG Sbjct: 254 PLPIDSPLWKQNADPPGVYRAMVRHMDEGIGKVLHAIEESKMTDRTLVIFASDNGGT--- 310 Query: 353 PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIP 411 N +G+K Q + GG P+ W G L G D++ D + L AA I+ Sbjct: 311 SASRNEPLRGFKGQAFEGGIRVPLIARWPGHLPEGVVSDQVTITFDLTASMLAAAGITPT 370 Query: 412 KDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 ++ ++G+ +L + + +P + L W Sbjct: 371 QEDAMEGIDVLSLAANDEPVQP-RTLYW 397 >UniRef50_P50473 Arylsulfatase n=8 Tax=Deuterostomia RepID=ARS_STRPU Length = 567 Score = 132 bits (333), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 111/411 (27%), Positives = 175/411 (42%), Gaps = 86/411 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN+I+L DD+G G L ++ P + M Sbjct: 66 KPNVILLLADDMGVGDL---------------------------SVYGHPTQEPGFIDQM 98 Query: 117 -DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD-----AQDGIPLTETFLPEL 170 ++G+RFT GY V PSR+AI+TGR P R GVY G+PL E + E Sbjct: 99 ANQGLRFTQGYSGDSVCTPSRSAIVTGRQPIRTGVYGEERIFLPWTTTGLPLYEVTIAEA 158 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF----- 225 + GY T VGKWHL + E+ + H P NRGFD+ Sbjct: 159 MKGAGYTTGMVGKWHLG------INENSSSDGAH-----------LPANRGFDFVGHNLP 201 Query: 226 ---------MGFHA------AGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAK 270 G H A YYNS S+ + + KG ++ L D+ +G ++ Sbjct: 202 FGNSWRCDDTGLHQDFPDTNACFLYYNSTSVAQPFQH---KG-LTQLLRDDTVGFIE--D 255 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLK 330 +++PF +Y+++ H+ + D + G Y ++ +DQ +++I+ L Sbjct: 256 NVNKPFFMYVSF--AHMHTSLFSSDDFSCTSRRG-----RYGDNLREMDQAIEQIVTTLV 308 Query: 331 KNGQYDNTIILFTSDNGAVID--GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN 388 N DNT+I FTSD+G + G +G K Q++ GG P ++W G + PG Sbjct: 309 DNDIDDNTVIFFTSDHGPHREYCGEGGDANVFRGGKGQSWEGGHRIPYIVYWPGTISPGV 368 Query: 389 YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 +++++MD TA++ +P D DG L L + PH + + Sbjct: 369 SHEIVTSMDIIATAVNLGGSQLPTDRIYDGKCLKSVLLEGAS-SPHDDFFY 418 >UniRef50_A6KZ75 Putative secreted sulfatase n=8 Tax=Bacteroides RepID=A6KZ75_BACV8 Length = 517 Score = 132 bits (333), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 142/547 (25%), Positives = 224/547 (40%), Gaps = 140/547 (25%) Query: 53 STKGKPNIIVLTMDDLGY--GQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 +T+ +PNII+ +DD+G+ LPF T K ++A E TP Sbjct: 29 TTQQRPNIILFMVDDMGWQDTSLPFW----------------TQKTHYNEAYE-----TP 67 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-------------------- 150 + L +G+ FT Y + +S +R +++TG R V Sbjct: 68 NMERLAKKGMMFTQAYACN-ISSATRCSLITGANNTRHRVTNWTLEKNKATDRPSNTIEL 126 Query: 151 ----YSNTDAQDGIPLT--ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYH 204 Y+ G P T T EL + +GY+T GK H I P + Sbjct: 127 PDWNYNGVSQVTGTPNTFVGTSFVELLRQNGYHTIHCGKAHFGSID---TPGE------- 176 Query: 205 DNFTTFSAEEWQPQNRGFDYFMGFHAAG--TAYYNSPSLFKNRERVP------------- 249 P + GF+ + HAAG Y + + R+ P Sbjct: 177 -----------NPTHWGFEVNIAGHAAGGLATYLSEQNYGHTRDGKPYSLMAIPGLEDYW 225 Query: 250 -AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTA 308 + ++ LT EAI +D+AK +QPF LY+A+ A H+P D + K G Sbjct: 226 GTGIFATEALTQEAIKALDKAKKYNQPFYLYMAHYAIHVPVDKDM-RFFPKYIKKGLSDK 284 Query: 309 DNYYAS-VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----------LPL 356 + YAS + +D+ + ++ L+KN + DNT+I+F SDNG + P PL Sbjct: 285 EAAYASLIEGMDKSLGDLMNWLEKNDEADNTVIIFMSDNGGLAAEPGWRDGQIHTQNAPL 344 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLK 415 N K Y GG PM + W G + P DK + DFYPT L+ A I+ K + Sbjct: 345 NSG----KGSLYEGGIREPMIVSWPGVVTPNTRCDKYLIIEDFYPTILEMAGITNYKTVN 400 Query: 416 -LDGVSLLPWLQDKKQGEPHKN--LTWITSYSHWFDEENIP-FWDNYHKFVRHQSDDYPH 471 +DG+S +P L K G+P K L W N P W N D P Sbjct: 401 PIDGISFMPLL--KGTGDPSKGRALVW-----------NFPNIWGN----------DGPG 437 Query: 472 NPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVV 530 ++R +++ LVY E + L+ + D+ +K+++A +P +VK + + Sbjct: 438 -------INLDCSIRKDEWKLVYYYETGKKELFNIPNDISEKNDVAKQHPGIVKRLSKEL 490 Query: 531 REFIDSS 537 ++ ++ Sbjct: 491 GNYLRAT 497 >UniRef50_D0PR10 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR10_9SPHI Length = 607 Score = 132 bits (333), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 108/399 (27%), Positives = 174/399 (43%), Gaps = 73/399 (18%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 V FS S KPN+I++ DD+GYG D N+++ Sbjct: 15 VFFSFLYIKSCSDIDKPNVIIILTDDMGYG---------DIAAHGNKDI----------- 54 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIP 161 STP + L DE +R TN +V + PSR+A+MTG+ R GV+ + + Sbjct: 55 ------STPHIDQLHDESLRLTNFHV-NPTCAPSRSALMTGKDANRVGVWHTVMGRSLLY 107 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 E + ++F + Y T GKWHL DN+ + PQ RG Sbjct: 108 EEEETMADIFSANNYATGLFGKWHLG-----------------DNYP------FAPQYRG 144 Query: 222 FDYFMGFHAAGTA----YYNSPSL----FKNRERVPAKGYISDQLTDEAIGVVDRAKTLD 273 F + G Y+N+ +N + +GY +D EA+ + K + Sbjct: 145 FQEVLTHGGGGVGQTPDYWNNDYFDDVYLRNGQEEKFEGYCTDVWFREALTFIKENK--E 202 Query: 274 QPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADN--YYASVYSVDQGVKRILEQLKK 331 PF+ Y++ NAPH P + P Y + + D +Y + ++D + + ++L++ Sbjct: 203 NPFLCYISTNAPHTPLN--VPSSYAEPYLKKGIQEDRAKFYGMISNIDDNIGLLRKKLEE 260 Query: 332 NGQYDNTIILFTSD----NGAVIDGPLPLNG---AQKGYKSQTYPGGTHTPMFMWWK-GK 383 G DNTI++F SD NGA + G L+G +G K Y GG P +++WK G Sbjct: 261 WGIADNTILIFMSDNGTANGATLKGKQLLSGYNANMRGVKGSPYDGGHRVPFYVYWKNGN 320 Query: 384 LQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSL 421 L G + ++L + +D PT + +S + DG+ L Sbjct: 321 LNHGMDINQLTAHIDVLPTLIKMCGLSNVPTINFDGIDL 359 >UniRef50_A6DHY0 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHY0_9BACT Length = 507 Score = 132 bits (332), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 127/502 (25%), Positives = 200/502 (39%), Gaps = 113/502 (22%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 K N + + DD GYG F + +KI TP L + Sbjct: 19 KLNYVFMMTDDQGYGDTGF----------------NGHKI----------IKTPHLDQMA 52 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 EG + T Y V P+R +TGR R+G++ +P E L + + GY Sbjct: 53 KEGAKLTQFYAGGPVCSPTRGTYLTGRHYYRYGIWGANVGH--LPKEEITLASVLKQQGY 110 Query: 177 YTAAVGKWHLSKIS--------------NVPVPEDKQTRDYHDNFTTFSA-EEWQPQNRG 221 T GKWHL ++ N P + RDY ++F S+ W P + Sbjct: 111 VTGHFGKWHLGTLNKDYSTKGESRKPTENFAPPWE---RDYDESFVVESSVSTWDPASEK 167 Query: 222 FDYFM-GFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 +++ G GT SL+ RV + D+AI ++RA + PF+ + Sbjct: 168 NPFYINGVPMKGT----EESLYGGAARV---------VVDKAIPFMERAVSEGNPFLAVV 214 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 +NAPH P A +Y + + + A +YY + +D+ V RI +L++ G NT++ Sbjct: 215 WFNAPHEPIK--AGPKYLEMYKEHGEAA-HYYGCLTEMDEQVGRIRAKLREMGVEKNTVL 271 Query: 341 LFTSDNG----AVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISA 395 F SDNG +G K Y GG P W GK+Q G+ D +S Sbjct: 272 FFCSDNGPEGKKAKGAKAGTTSGLRGRKRSLYDGGVRVPALAEWPGKIQAGSVIDAAMST 331 Query: 396 MDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFW 455 +D+ PT + + +P + LDG ++L L GE + + IPF Sbjct: 332 LDYLPTVIALQNHQMPDERPLDGENILALL----TGEESQR------------KRGIPF- 374 Query: 456 DNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDN 514 + + + DY LVY E LY L+ D +++N Sbjct: 375 ----------------------IHRGKAVLNRGDYKLVYPKE-----LYALSNDWSEENN 407 Query: 515 LAAANPQVVKEMQGVVREFIDS 536 +A+ P++V EM + F+ S Sbjct: 408 IASQYPEIVAEMSKELEAFVLS 429 >UniRef50_A6DMX9 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMX9_9BACT Length = 467 Score = 132 bits (332), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 117/411 (28%), Positives = 181/411 (44%), Gaps = 87/411 (21%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 V S F + KPNI+++ DD GY L G F Sbjct: 9 VLLSTFVAASLTAAEKPNILIIFTDDQGYADL----GCFG-------------------- 44 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIP 161 + + TP L L EG +FT+ Y A V GPSR+A++TGR PAR + G+P Sbjct: 45 --SEENQTPVLDKLAKEGTKFTSFY-AQPVCGPSRSALLTGRYPARSKGW-------GMP 94 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 +E E+ + GY TA VGKW +S + +P P +G Sbjct: 95 ASEITFAEMLKETGYQTACVGKWDVSNRQPI-IPR-------------------MPNAQG 134 Query: 222 FDYFMGFHAAGTAYYNSPSLFKN--RERVPAK-GYISDQLTDEAIGVVDRAKTLDQPFML 278 FDY+ G G L++N +ER ++ T++AI +++ + ++PF+L Sbjct: 135 FDYYYG--TLGGNGSGKIDLYENNKKERTTEDMASLTRLYTNKAIDFLEKQRDPEKPFIL 192 Query: 279 YLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYY-ASVYSVDQGVKRILEQLKKNGQYDN 337 YLA+ H D A ++++ +T DN Y A+V +D R+L +L + N Sbjct: 193 YLAHTMTHTVVD--ASPKFKE------KTGDNLYRAAVEELDYETGRLLNKLNQLNLSKN 244 Query: 338 TIILFTSDNGAVIDGPLPLNGAQKG-----------------YKSQTYPGGTHTPMFMWW 380 T++++TSDNG + P +NG K K+ + GG H P M W Sbjct: 245 TLVIYTSDNGP-WNQPKYINGGAKNDHPENSIFWGDAGEFRDGKASIWEGGAHVPCVMRW 303 Query: 381 KGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ 430 GK+ G D L++ +DF PT IP + +DGV+ L ++ K + Sbjct: 304 PGKIAAGKTNDGLMATIDFLPTLAAVTGAKIPDERVIDGVNQLGFICGKSE 354 >UniRef50_A6CAR8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Planctomycetaceae RepID=A6CAR8_9PLAN Length = 501 Score = 132 bits (331), Expect = 4e-29, Method: Compositional matrix adjust. Identities = 123/448 (27%), Positives = 180/448 (40%), Gaps = 121/448 (27%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNII++ DD GY L GSF + E++ TP L L Sbjct: 38 PNIIMIVSDDQGYRDL----GSFG-----SEEIM-----------------TPHLDRLAK 71 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQD----------------- 158 EG + T+ YV PSR +++TGR P R G+Y +A D Sbjct: 72 EGAKLTSFYVTWPACTPSRGSLLTGRYPQRNGIYDMIRNEAPDFGHKYKPAEYEVTFERI 131 Query: 159 -GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 G+ + E LP L + GY +A GKW L H F P Sbjct: 132 GGMDVREKLLPALLKPAGYVSAIYGKWDLG---------------IHKRFL--------P 168 Query: 218 QNRGFDYFMGFHAAGTAY-----YNSPSLFKNRERVPA-KG-YISDQLTDEAIGVVDRAK 270 RGFD F GF G Y Y PS+++N + KG Y + EA+ + + Sbjct: 169 LARGFDDFYGFTNTGIDYFTHERYGVPSMYRNNQPTEEDKGTYCTYLFQREAVRFI--KE 226 Query: 271 TLDQPFMLYLAYNAPH--------LPNDNPAPDQYQKQF-----------NTG------- 304 +PF LYL +NAPH + AP++Y+ + TG Sbjct: 227 NHQKPFFLYLPFNAPHGASSLDPRIRGGAQAPEKYKNMYPHLKDTLVTKKKTGRYEFRER 286 Query: 305 ------------SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG 352 S+ Y AS+ +D + +L L + DNTI++F SDNG Sbjct: 287 PDGPVIHQGVSASKRRLEYVASITCMDDAIGEVLGLLDEYQIADNTIVVFFSDNGGSGGA 346 Query: 353 PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIP 411 N KG K + GG P + + K++PG D+L+++++ PT L A I +P Sbjct: 347 D---NSPLKGKKGMMFEGGIRVPCLVRYPAKIKPGTVNDELLTSLELVPTFLKEAAIPLP 403 Query: 412 KDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 +++ +DG +LP L K P + W Sbjct: 404 ENVVIDGYDMLPVLMGKTT-SPRNEMYW 430 >UniRef50_B5JJG5 Sulfatase, putative n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JJG5_9BACT Length = 462 Score = 132 bits (331), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 111/384 (28%), Positives = 165/384 (42%), Gaps = 82/384 (21%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 S + PNI+ + DDLGY L + A +TP + Sbjct: 30 SAEKPPNIVFIFADDLGYNDL--------------------------SSYGATDIATPAI 63 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ--DGIPLTETFLPEL 170 SL ++G+RFT+ Y A V PSRAA++TGR P R G+ Q DGI ET + EL Sbjct: 64 DSLGEQGIRFTDFYSASPVCSPSRAALLTGRYPIRQGITGVFWPQSFDGIDPAETTIAEL 123 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA 230 Q +GY T VGKWHL +H ++ P GF + G Sbjct: 124 LQENGYRTGLVGKWHLG---------------HH--------QKHLPLQNGFHSYFGI-- 158 Query: 231 AGTAYYNSPSLFKNRERVPAKGYISDQ------LTDEAIGVVDRAKTLDQPFMLYLAYNA 284 Y N + + Y DQ T+EA+ +++ K DQPF LYLA++ Sbjct: 159 ---PYSNDMDMVVYMRGNDVESYEVDQHYTTRRYTEEAVQFIEQNK--DQPFFLYLAHSM 213 Query: 285 PHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 PH+P Y + G+ Y + +D V +IL+ L K+ +NT+++FTS Sbjct: 214 PHVP-------IYASENFVGTSKRGLYGDVIQELDWSVAQILDTLDKHQLSENTLVVFTS 266 Query: 345 DNGA------VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMD 397 DNG + PL K T+ GG P + W ++ G + + MD Sbjct: 267 DNGPWTALKHLGGSAAPLREG----KMFTFDGGMRVPCLVRWPAQIPAGQTSHAMANMMD 322 Query: 398 FYPTALDAADISIPKDLKLDGVSL 421 ++PT A++ PK +DG+ + Sbjct: 323 WFPTFSRIANLDTPKSRSIDGLDI 346 >UniRef50_B4CZ54 Sulfatase n=3 Tax=Bacteria RepID=B4CZ54_9BACT Length = 500 Score = 132 bits (331), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 112/408 (27%), Positives = 173/408 (42%), Gaps = 75/408 (18%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI+ + DD GYG L A TP L L Sbjct: 28 KPNIVFILADDTGYGDL--------------------------SATGNPILKTPHLDKLY 61 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 + VRFT+ +V+ S P+R+A+MTGR + GV ++ + + ++ ++ GY Sbjct: 62 NAAVRFTDFHVSPTCS-PTRSALMTGRHEFKNGVTHTILERERLNPDAITIAQVLKSAGY 120 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD--YFMGFHAAGTA 234 T GKWHL D D+ QP RGFD + G G Sbjct: 121 TTGIFGKWHLG--------------DEPDH---------QPGQRGFDEVFIHGGGGIGQT 157 Query: 235 Y-----------YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYN 283 Y Y +P++ N +G+ +D T++AI ++ K QPF Y+ YN Sbjct: 158 YPGSCGDAPGNTYFNPAILHNGSFEKTQGFCTDIFTNQAIHWMESVKG-KQPFFCYIPYN 216 Query: 284 APHLPNDNPAPDQYQKQFNTGSQTADN---YYASVYSVDQGVKRILEQLKKNGQYDNTII 340 A H+P PD+Y+K + + D+ Y+ V ++D+ V R+L +L + G +T++ Sbjct: 217 AAHVPVS--CPDEYKKPYE--GKVDDHLATYFGMVANIDENVGRVLAKLDEWGIAKDTLV 272 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYP 400 +F +DNG N +G K + GGT W P + L S +DF+P Sbjct: 273 VFMNDNGGHGPACKVFNAGMRGSKGSAWLGGTRAVSLWRWSDTFAPHDAAGLASNIDFFP 332 Query: 401 TALDAADISIPKDL--KLDGVSLLPWLQDKKQGEPHKNLTWITSYSHW 446 T + A + + ++DG SLLP L+D P + L T W Sbjct: 333 TLAELAGATPNEKAQKQVDGRSLLPLLRDGNAPWPERVL--FTHVGRW 378 >UniRef50_A6DQ01 N-acetylgalactosamine-4-sulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DQ01_9BACT Length = 616 Score = 131 bits (330), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 102/381 (26%), Positives = 164/381 (43%), Gaps = 69/381 (18%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 + KPNII++ DD GYG L + ++ TP + Sbjct: 19 QAKPNIIIVMTDDQGYGDL----------SCHGNPIL----------------KTPQIDE 52 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNH 174 + +R TN Y P+R+A+MTGR AR GV+ + + E + + +++ Sbjct: 53 FYKDALRLTN-YHVDPTCAPTRSALMTGRYSARVGVWHTVQGRHLMREREITMANILKDN 111 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA 234 GY T GKWHL A ++P++RGF + + A G Sbjct: 112 GYATGIFGKWHLG-----------------------DAYPYRPEDRGFTHVVTHGAGGVG 148 Query: 235 ---------YYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 Y+N + + N E V +G+ +D DEA + + +PF ++ NAP Sbjct: 149 QVPDYWGNDYFND-TYYVNGEFVKFEGFCTDVWFDEAKKFMKTQISKKKPFFTFITPNAP 207 Query: 286 HLPNDNPAPDQYQKQFN---TGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILF 342 H P AP +Y +N + ++ + ++D + E LK G DNT+++F Sbjct: 208 HGPMR--APQKYLDMYNQTKVKGTKLEAFFGMITNIDDNFGELREFLKDEGVADNTLLIF 265 Query: 343 TSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTP-MFMWWKGKLQPG-NYDKLISAMDFYP 400 T+DNG+ G N G K+ + GG P +F W KG L G + D+L + MD P Sbjct: 266 TTDNGSS-SGIGVYNAGMTGAKNSNFDGGHRVPFIFTWPKGNLMGGRDIDQLTAHMDILP 324 Query: 401 TALDAADISIPKDLKLDGVSL 421 + ++ + PK + DG SL Sbjct: 325 SFIEMFGLKAPK-IDFDGTSL 344 >UniRef50_B7RWW8 Sulfatase, putative n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RWW8_9GAMM Length = 486 Score = 131 bits (330), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 114/426 (26%), Positives = 182/426 (42%), Gaps = 100/426 (23%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 S PNI+ + DD+GYG + G +P + +TP + Sbjct: 56 SVNTAPNILFILYDDMGYGDI--GAGETNPDVI----------------------ATPNI 91 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-------------YSNTDAQDG 159 +L G+ ++ + V PSRA +TGR R G+ S+ + G Sbjct: 92 DALAAAGLVLSDFHSPAPVCTPSRAGYLTGRLAPRAGLPDVVFPSGSTKAFISSLLLKSG 151 Query: 160 ----IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEW 215 +P E + E+ + GY T VGKWHL + Sbjct: 152 SPVRLPAEEITVAEVLRAAGYRTGMVGKWHLGD-----------------------SRPS 188 Query: 216 QPQNRGFDYFMGFHAAGTAYYNSP----SLFKNRE-RVPA---KGYISDQLTDEAIGVVD 267 P + GF+++ G A Y++ +L++NR VPA + Y+S++ T+EA+ + Sbjct: 189 LPNDLGFEHYYG------ALYSNDMEPFALYRNRVVEVPAPVDQSYLSERYTEEALAFL- 241 Query: 268 RAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILE 327 + D+ F LY A+N PH P + + G+ Y + +D GV ++E Sbjct: 242 --RASDERFFLYFAHNFPHDP-------LHSRDGRLGTSDGGLYGDVLEEIDDGVGILVE 292 Query: 328 QLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 +L+ +G+ DNT+I+ TSDNG G G Q+G K T+ GG P W ++ G Sbjct: 293 ELRYSGKLDNTLIIITSDNGPWFLGN---AGDQRGRKGNTFEGGMRVPFIAHWPAEIPQG 349 Query: 388 NYDKLIS-AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHW 446 + ++ +D PT LD + P D LDG S+LP L K PH+ Y H+ Sbjct: 350 RSEPAMAMGIDLLPTVLDILALPAPNDRILDGRSMLPTLT-KGAASPHQ-------YLHY 401 Query: 447 FDEENI 452 +D E + Sbjct: 402 YDGETL 407 >UniRef50_Q0SBH5 Arylsulfatase n=7 Tax=Bacteria RepID=Q0SBH5_RHOSR Length = 790 Score = 131 bits (329), Expect = 7e-29, Method: Compositional matrix adjust. Identities = 136/509 (26%), Positives = 195/509 (38%), Gaps = 164/509 (32%) Query: 47 FTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQ 106 + P + G PNI+V+ +DD+GY D G F ++ Sbjct: 43 WAPETKAPAGAPNIVVILIDDMGYS----DIGPF-----------------------GSE 75 Query: 107 KSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTET- 165 TP L L D G R +N Y V P+RAA++TG P R G S + G P Sbjct: 76 IDTPNLNRLADSGYRLSN-YHTTSVCSPARAALLTGLNPHRAGYGSVANFDPGFPGLRME 134 Query: 166 ------FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN 219 L E+ + +GY T AVGKWHL++ +N+ +TRD W P Sbjct: 135 LADDALSLAEILRANGYATHAVGKWHLARDTNL---APGRTRD-----------SW-PLQ 179 Query: 220 RGFDYFMGFHAAGTAYYNSPSLFKNR-----ERVPAKGYISDQLTDEAIGVVD--RAKTL 272 RGFD + G ++Y L + E P+ Y++D +TD+A+ + RA Sbjct: 180 RGFDSYYGSLEGLNSFYYPNELISDNSVVDVEEYPSDYYVTDDITDKAVSRIKSLRAHDA 239 Query: 273 DQPFMLYLAYNAPHLPNDNPAPD--QYQKQ---------------------FNTGSQTAD 309 D+PF LY ++ A H P+ D +Y+ + F G+Q A+ Sbjct: 240 DKPFFLYFSHIAMHGPHQAKPEDLAKYRGRYTEGWDAVRRSRFDAQLEAGFFPPGTQMAE 299 Query: 310 N----------------------------YYASVYSVDQGVKRILEQLKKNGQYDNTIIL 341 Y A V S+DQ V R+L+ L + G+ DNTII+ Sbjct: 300 RNTEPGYDAPPWDELTEEEQLRFARYQEVYAAMVDSIDQSVGRVLDTLDELGETDNTIIV 359 Query: 342 FTSDN---------------GAVIDGPLP------------LNGAQK------------- 361 FTSDN + GP+P L G++K Sbjct: 360 FTSDNGGTAEGGAAGTRSYFSQFVHGPVPSDWVRDVPHDEELIGSEKIGVHYPRGWGQAS 419 Query: 362 -----GYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS-----AMDFYPTALDAADISIP 411 YK QT+ GG P + W L + D + D PT LD A + P Sbjct: 420 NTPFRFYKGQTFAGGVRVPFLLSWPAGLGRADGDTGLRKQYSYVTDITPTLLDLAGLETP 479 Query: 412 KDL------KLDGVSLLPWLQDKKQGEPH 434 + DGVS+ L+D Q PH Sbjct: 480 SHRNGLPAQERDGVSIAEVLRDAAQPTPH 508 >UniRef50_UPI0001C3580F sulfatase n=2 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C3580F Length = 471 Score = 131 bits (329), Expect = 8e-29, Method: Compositional matrix adjust. Identities = 96/357 (26%), Positives = 172/357 (48%), Gaps = 60/357 (16%) Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 D ++ ++ + Q TP + +E V TN Y + P RA++MTG+ P R G+ Sbjct: 10 ADQWRAEAVGSLGSDQVVTPNIDRFSEESVCCTNAYSTFPLCSPHRASLMTGKYPFRLGM 69 Query: 151 YSNTDA--QDGIPLT--ETFLPELFQNHGYYTAAVGKWHL--SKISNVPVPEDKQTRDYH 204 ++N ++ I L ET + + ++ G+ T +GKWHL S+++ P P+ Sbjct: 70 WTNCKIGLEEKIMLKPQETCIANVLKDAGFATGYIGKWHLDASELNFSPHPKS------- 122 Query: 205 DNFTTFSAEEWQP------QNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQL 258 A EW + +GFDYF+ + A + P + + E G S + Sbjct: 123 ------GAGEWDAYTPPGERRQGFDYFLSYGACDD--HLDPHYWLDDETQIKPGKWSAEF 174 Query: 259 -TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT-------------- 303 TD+AI +++ K ++PF L+++YN PHLP + P++Y ++F Sbjct: 175 ETDKAIEYMNQKKDGEEPFALFVSYNPPHLPYEL-VPERYYEKFKNLKVHYRPNVPESMR 233 Query: 304 ------GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLN 357 +QT Y+A+V+ +D+ RIL LK+NG + T+++ ++D+G ++ Sbjct: 234 EEGGLLETQTR-QYFAAVHGIDEQFGRILAWLKENGMEEKTLVVLSADHGEML------- 285 Query: 358 GAQKGYKSQT--YPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPK 412 G S+ Y H P+ KG+L+PG D + ++ D PT L+ D+++P+ Sbjct: 286 -GSHGLMSKNIWYDEALHIPLIFRQKGRLKPGKNDVIFASPDHMPTLLELLDLAVPE 341 >UniRef50_A6CEL4 Arylsulfatase A n=4 Tax=Bacteria RepID=A6CEL4_9PLAN Length = 527 Score = 131 bits (329), Expect = 9e-29, Method: Compositional matrix adjust. Identities = 137/544 (25%), Positives = 226/544 (41%), Gaps = 113/544 (20%) Query: 44 FSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIE 103 FS T PNII + DD+GYG + R + KI Sbjct: 11 FSQNTAHASEKANDPNIIYILADDMGYGDI--------------RALNPECKI------- 49 Query: 104 AAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGR----APARFGVYSNTDAQDG 159 +TP L L G+ FT+ + + V P+R ++TGR + + GV + Sbjct: 50 ----ATPHLDQLAHGGMIFTDAHSSSSVCTPTRYGVLTGRYNWRSRLKSGVLWGLSRRLI 105 Query: 160 IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVP----VPEDKQTRDYHDNFTTFSAEEW 215 P ET +P + + HGYYTA VGKWHL ++ E + + + ++ Sbjct: 106 EPDRET-VPSMLKEHGYYTACVGKWHLGMDWSLKQGGFATEQSYNKKTNPGWDVDYSKPI 164 Query: 216 Q--PQNRGFDYFMGFHAAGTAYYNSPSLFKNRER---VPA--KGYISD------------ 256 Q P + GFDYF G A+ P ++ +R +P K + D Sbjct: 165 QNGPNSVGFDYFFGISAS---LDMPPYVYIENDRSQGIPTVTKAFFRDGPAHKDFEAIDV 221 Query: 257 --QLTDEAIGVVDR---AKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNY 311 ++TD+ + ++D A +PF +Y NAPH P P P ++Q G + Y Sbjct: 222 LPRITDKTVQIIDEHAAASKEGKPFFIYFPLNAPHTPI-LPTP-EWQ-----GKSGINAY 274 Query: 312 YASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--------VIDGPLPLNGAQKGY 363 V VD V ++++ LKK G ++NT+++FT+DNG + D + +G+ Sbjct: 275 CDFVMQVDDTVGQVMQALKKQGIHENTLVIFTADNGCSPAANFKEMTDKDHQPSYQFRGH 334 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLL 422 K+ Y GG P W +++ G + D+L D + TA D +P D D VS+L Sbjct: 335 KADIYEGGHRVPFIANWPARIKAGTHSDQLTCLTDLFATAADIVGAKVPDDAGEDSVSIL 394 Query: 423 PWLQDK-----KQGEPHKNLTWITSYS--HWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 P ++ ++ H ++ S HW E P + +P P Sbjct: 395 PAMEGTAHTPLREAAVHHSIRGAFSIRKDHW-KLELCPGSGGW---------SFP-KPGK 443 Query: 476 EDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFI 534 ++LS+ + LY L D ++ N+ A +P+VVKE+ +++ + Sbjct: 444 DNLSELP-----------------AIQLYDLNHDAGEQKNVQAEHPEVVKELTTLLQSYA 486 Query: 535 DSSQ 538 D + Sbjct: 487 DRGR 490 >UniRef50_UPI000186ED10 arylsulfatase B precursor, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186ED10 Length = 570 Score = 130 bits (328), Expect = 9e-29, Method: Compositional matrix adjust. Identities = 143/550 (26%), Positives = 223/550 (40%), Gaps = 115/550 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNII++ DDLG+ + F + Q TP + +L Sbjct: 46 RPNIIIILADDLGWNDVSFHGSN--------------------------QIQTPNIDALA 79 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD---GIPLTETFLPELFQN 173 G+ + YV + PSRA++MTG+ P G+ G+PL ET +PE F Sbjct: 80 YNGIILNSHYVP-ALCTPSRASLMTGKYPTSLGMQHLVILSPEPWGLPLNETLMPEYFNK 138 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 +GY T AVGKWHL F +E+ P RGFD G Sbjct: 139 NGYATHAVGKWHLG----------------------FFKKEYTPIYRGFDSHFGHWNGFQ 176 Query: 234 AYYNSPSLFKNRERVPAKG-----------YISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 YY+ ++ + + + Y +D T EAI ++D + P LYL++ Sbjct: 177 DYYDHTTMSDSLKGYDMRRNFEVDYSYQGMYTTDVFTKEAIKIIDNHNSQKGPLFLYLSH 236 Query: 283 NAPHLPN-DNP--AP-DQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 APH N DNP AP D+ K Y A V +D+ V +++ L+KN +N+ Sbjct: 237 LAPHSGNPDNPFQAPEDEISKHECINDPGRKIYAAMVTKLDESVGQVVSALEKNKMLNNS 296 Query: 339 IILFTSDNGAVIDGPLPLNGAQ---KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LIS 394 II+F SDNGA G G+ +G K + GG +W + K L+ Sbjct: 297 IIIFMSDNGAATYGLHSNRGSNYPLRGLKESPWEGGVRGTAAIWSPFLNKTKRVSKQLMH 356 Query: 395 AMDFYPTALDAADISIPKDL---KLDGVSLLPWLQD---KKQGEPHKNLTWITSYS---- 444 D+ PT L AA ++ K+DG+ + L + + E N I +YS Sbjct: 357 MSDWLPTLLTAAGLNYSSTQLINKIDGIDMWNVLSNDLPSPRKEVFNNYDEIENYSSLMI 416 Query: 445 ----------------HWFDEENIPFWDNYHKF---------VRHQSDDYPHNP---NTE 476 +WF+E P +N ++ +R S NP ++ Sbjct: 417 DSWKYVEGTAQEGKADYWFEE---PSRNNCSEYRVSNEDIFRLRRDSTIICDNPTFSSSL 473 Query: 477 DLSQFSYTVRNNDYSLVYTVEN--NQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREF 533 +++ ++T N V T + + L+ L D ++ NLA P VVK ++ + E Sbjct: 474 SITRNNHTDVKNKTKYVLTCDPLLKRFCLFNLNDDPCERLNLADVFPDVVKRIKNRLLEL 533 Query: 534 IDSSQPPLSE 543 S PL++ Sbjct: 534 KKSVVKPLNK 543 >UniRef50_Q7ULE7 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Rhodopirellula baltica RepID=Q7ULE7_RHOBA Length = 1049 Score = 130 bits (328), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 124/409 (30%), Positives = 172/409 (42%), Gaps = 62/409 (15%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN++V+ DD G+ L + +N EV D TP + L Sbjct: 581 KPNVVVILTDDQGWADL----------SCQN-EVDDI--------------QTPHIDGLA 615 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 GVR TN YV PSRA ++TGR R G+ + D +P + E Q GY Sbjct: 616 ARGVRCTNAYVTAPQCSPSRAGLITGRYQQRLGIDTIPDMP--LPTNAVTIAEHLQPKGY 673 Query: 177 YTAAVGKWHLSK--------ISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD-YFMG 227 T VGKWHL +P K R + E + P +GFD Y+ G Sbjct: 674 KTGFVGKWHLEPNVTCIDWMRRELPAMAGKPRRKVRIPWNKI--EPYSPSQQGFDEYYWG 731 Query: 228 FHAAGTAYYN--SPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 ++ S L + + + + D T+ A+ + R DQPF L L Y P Sbjct: 732 ERTNYRTNFDLTSGELLAEMKPIRDERFRIDVQTNAAVKFIQRNH--DQPFYLQLNYYGP 789 Query: 286 HLPNDNPAPDQYQKQFNTGSQTADNY-YASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 H P + A +Y +F Y A + ++D GV +I++QLK G DNT+I+ TS Sbjct: 790 HTPLE--ATQKYLDRFPGPMPERRRYALAMISAIDDGVGQIVDQLKAEGVLDNTLIVMTS 847 Query: 345 DNGAVI-----DGPL---------PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NY 389 DNGA + D P+ LN G K GG PM +L G Y Sbjct: 848 DNGAPLKMTKTDSPINGDAGGWDGSLNDPWVGEKGMLSEGGIRVPMIWSLPTQLPSGITY 907 Query: 390 DKLISAMDFYPTALDAADISIPK-DLKLDGVSLLPWLQDKKQGEPHKNL 437 D +SA+D P+ L A +P D DG+ L+P L D Q P + L Sbjct: 908 DWPVSALDIAPSVLKLAGGELPSGDAAFDGIDLIPRLND-IQNPPTRTL 955 Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 91/385 (23%), Positives = 143/385 (37%), Gaps = 96/385 (24%) Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTET 165 Q TP L L G+ FTN + P R+A+ TGRAP + G+Y N + + Sbjct: 53 QTITPNLDRLAASGILFTNAHCPAPACNPCRSAVFTGRAPNQSGLYDNRQQMREVMPDDV 112 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 LP+ +NHGY+ + GK H S +E+ P+ + F Sbjct: 113 ILPQYMRNHGYHASGSGKL------------------LHYFIDAASWDEYFPKAESENPF 154 Query: 226 MGFHAAGTAYYNSPSLFKNRERVPAKGYISD----QLTDEAIG---VVDR------AKTL 272 +Y S + P + +D +TDE G V + + Sbjct: 155 ------PQTFYPSQRPVNLKRGGPWQYVETDWAALDVTDEEFGGDWAVSQWIGEQLQQKH 208 Query: 273 DQPFMLYLAYNAPHLPNDNPAPDQYQKQF--------------------NTGSQTADN-Y 311 DQPF L PH P P +Y + F G + A N Y Sbjct: 209 DQPFFLGCGIYRPHEPWF--VPKKYFEPFPLDSIQLPPGYLENDLDDVPPIGQRAARNRY 266 Query: 312 YASVYSVD---QGVK--------------RILEQLKKNGQYDNTIILFTSDNGAVIDGPL 354 +A + D QG++ R+L+ L+ DNTI++ SD+G + Sbjct: 267 FAHIQKQDQWKQGIQGYLASIHFADAMLGRLLDALESGPNADNTIVVLWSDHGWQL---- 322 Query: 355 PLNGAQKGYKSQT-YPGGTHTPMFMWWKGKLQP---------GNYDKLISAMDFYPTALD 404 G ++ ++ T + G T P+ + P D ++ + +PT LD Sbjct: 323 ---GEKEHWQKYTPWRGVTRVPLMIRVPKTSSPSLPNGTPIGARCDAPVNLLSLFPTVLD 379 Query: 405 AADISIPKDLKLDGVSLLPWLQDKK 429 +P + DG SLLP L++ K Sbjct: 380 LC--QLPSNPVNDGPSLLPLLKEPK 402 >UniRef50_A6DU78 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DU78_9BACT Length = 466 Score = 130 bits (328), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 130/508 (25%), Positives = 200/508 (39%), Gaps = 128/508 (25%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 K +PNII L +D+ GY F+ M N+ ++ TP + Sbjct: 19 KKQPNIIYLMLDEWGY---------FESSHMNNKYLI-----------------TPNIDQ 52 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNH 174 EG+RFTN Y GP+RA ++TG+ + +N D I ET L + + Sbjct: 53 FATEGMRFTNAYAGAPTCGPTRAVLLTGKHMGHTSMRTN-DGYSAIRADETTLGSMLKKK 111 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA 234 GY T GKW + VPE N GFD F G++ A Sbjct: 112 GYVTGGFGKWGVGARGTSGVPE----------------------NHGFDQFFGYYDQVHA 149 Query: 235 YYNSPS-LFKNRERVPAKG-----------YISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 + P L N + P KG + + + DE+I + + K D PF YL + Sbjct: 150 HTYFPEYLIHNSKEFPLKGNSNKDRYNGETHAQNVIFDESIKFIKKNK--DVPFFCYLPW 207 Query: 283 NAPH----LPNDNPAPDQYQKQFNTGSQTAD----NYYASVYSVDQGVKRILEQLKKNGQ 334 PH + D+P+ ++ + T Q+ D Y A ++ VD+ + I LK+ Sbjct: 208 TPPHGHWGIKKDDPSWQLFKDRPWTAGQSRDTDSRGYAAFMHMVDRQIGEIASLLKELDI 267 Query: 335 YDNTIILFTSDNGA-----VIDGPL--------PLNGAQ-KGYKSQTYPGGTHTPMFMWW 380 DNT+ DNG D P P G + + K Y GG PM++ W Sbjct: 268 DDNTVFFLCGDNGGSDYFKTKDHPHGFFAPNLNPETGERFRAGKRSLYEGGLKVPMYVRW 327 Query: 381 KGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 GK++ + D L D PT D A P + DG+++LP L K + HK + W Sbjct: 328 PGKVKASSVSDHLFYFPDIMPTLADIAKTDPP---ETDGLTILPTLLSKDGQKNHKFMYW 384 Query: 440 ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN 499 ++ R + +R + + L T + + Sbjct: 385 --------------------EYYRQTA------------------LRMDQWKLYRTDKKS 406 Query: 500 QLGLYKLT-DLQQKDNLAAANPQVVKEM 526 LY L+ D+Q+ N+A +PQV++EM Sbjct: 407 SWELYDLSKDIQELHNIAKDHPQVLQEM 434 >UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3JD43_NITOC Length = 440 Score = 130 bits (328), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 108/398 (27%), Positives = 177/398 (44%), Gaps = 84/398 (21%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 K PN+I++ DD+GYG D G + N+ + TP L + Sbjct: 16 KQPPNVILIVADDMGYG----DVGCYG-----NQHI-----------------KTPNLDA 49 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ-----DGIPLTETFLPE 169 L +G RFT+ + + P+RAA++TG R G++ Q + L E E Sbjct: 50 LAKKGARFTDFHSNGPLCTPTRAALLTGCYQQRVGLHIIPKDQRYAMAKAMSLEEITFAE 109 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF- 228 ++ GY TA VGKWHL D+ + P +GFD + G Sbjct: 110 ALKSVGYSTALVGKWHLG---------DRPA--------------FLPPRQGFDEYFGIP 146 Query: 229 -----HAAGTAYYNSPSLFKNRERV---PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 H ++ P L + E V P +++ T+EA+ + + K D+PF+LY+ Sbjct: 147 YSHDMHPWRKSFPPLP-LMRGEEIVELNPDLDHLTQYCTEEAVKFISKNK--DRPFLLYM 203 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTGSQTADN----------YYASVYSVDQGVKRILEQLK 330 + PH P +++ K+F+ A Y A++ +D V I++ ++ Sbjct: 204 PHPMPHQPVH--VSERFAKRFSKEQLAAIKGEDKKSRKFLYSATIEEIDWSVGEIIKAVR 261 Query: 331 KNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NY 389 G ++T + FTSDNG I PL +G K + + GG P +W+ K++PG Sbjct: 262 ALGIEESTFVAFTSDNGPAIGSAGPL----RGKKRELWEGGHRVPFIAYWQEKIRPGVVI 317 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQD 427 D++ +MD +PT +P+ K+DGV+LLP L + Sbjct: 318 DEIAMSMDLFPTMAAMGRAPLPRK-KIDGVNLLPLLCE 354 >UniRef50_D0PR02 N-acetylgalactosamine-4-sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR02_9SPHI Length = 595 Score = 130 bits (327), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 108/384 (28%), Positives = 167/384 (43%), Gaps = 70/384 (18%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 K PN+I++ DD G G L + TP + Sbjct: 26 KQAPNVILILTDDQGIGDLGCHGNPW--------------------------LKTPNIDK 59 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNH 174 ++ VR T+ +V+ + P+RAAIMTG+ P R G ++ +D + + + ++F++ Sbjct: 60 FYEQSVRLTDFHVSP-LCTPTRAAIMTGQYPIRNGAWATYKGRDALSKGQLTMADVFKSA 118 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA 234 GY TA GKWHL N PV +P + GFD+ + A G Sbjct: 119 GYSTALFGKWHLG--DNYPV---------------------RPSDSGFDHVVQHLAGGIG 155 Query: 235 --------YYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPH 286 Y + N + +GY +D EA+ +++ + +QPF +YL NAPH Sbjct: 156 ELSDYWGNSYFDDVYYVNNQPKQFQGYCTDVWFSEAMKFINQQEK-EQPFFIYLPLNAPH 214 Query: 287 LP--NDNPAPDQYQKQFNTGSQTAD-NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFT 343 P D Y+K GS+ D N Y + ++D+ + + LKK G NTI+++ Sbjct: 215 DPLIVDEKYAAPYKK--FEGSEIIDANLYGMIANIDENFGKFRKFLKKKGLDKNTILIYM 272 Query: 344 SDNGAVI----DGPLPLNGAQKGYKSQTYPGGTHTPMFM-WWKGKLQPGNYDKLISA-MD 397 SDNG DG L N KG K + GG P F+ W G ++ G + +SA +D Sbjct: 273 SDNGTRFGYSRDGKLGYNYHLKGMKGDKFEGGHRVPFFIQWMDGGIEGGKDIRSLSAHVD 332 Query: 398 FYPTALDAADISIPKDLKLDGVSL 421 PT I +PK+ DG+ L Sbjct: 333 LIPTLAKLCGIPLPKNQAFDGIDL 356 >UniRef50_A7VQW1 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VQW1_9CLOT Length = 588 Score = 130 bits (327), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 108/384 (28%), Positives = 166/384 (43%), Gaps = 74/384 (19%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN++ + DD GYG L N ++ TP + Sbjct: 5 RPNVVFVLTDDQGYGDL---------GCTGNPDI-----------------QTPQIDEFY 38 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 E VR T+ +VA + P+R AI TGR P R GV++ + + ET L E+F+++GY Sbjct: 39 KEAVRLTDYHVAP-LCAPTRGAIFTGRRPLRNGVWATCWGRSILHEGETTLAEVFRDNGY 97 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY- 235 T GKWHL DN+ ++PQ+RGF + G Sbjct: 98 ATGLFGKWHLG-----------------DNYP------YRPQDRGFTEVVAHKGGGVGQT 134 Query: 236 -------YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP 288 Y S ++N + +GY +D D A ++ LD+PF + NAPH P Sbjct: 135 PDFWGNNYFEDSYYQNGKLTRYEGYCTDVWFDAAERFIE--SHLDEPFFACITTNAPHEP 192 Query: 289 NDNPAPDQYQKQFNTGSQTAD-NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDN- 346 ++Y + +Y + ++D R+ ++L G DNT+++F +DN Sbjct: 193 --YLVEEKYAAPYRENENIVHPEFYGMISNIDLNFGRLRKKLSDWGIEDNTVLIFMTDNG 250 Query: 347 ---GAVIDGPLPL----NGAQKGYKSQTYPGGTHTPMFMWW-KGKLQPG-NYDKLISAMD 397 G IDG + N +G K+ Y GG P F+ W G L G + + +D Sbjct: 251 TSGGCEIDGNEHVLRGYNAGMRGMKTSYYDGGHRVPFFIRWPNGGLDGGRDVEDTSYHVD 310 Query: 398 FYPTALDAADISIPKDLKLDGVSL 421 F+PT D +S+P L+LDGVSL Sbjct: 311 FFPTLADLCGLSMPP-LQLDGVSL 333 >UniRef50_B4GZS4 GL22855 n=1 Tax=Drosophila persimilis RepID=B4GZS4_DROPE Length = 559 Score = 130 bits (327), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 114/408 (27%), Positives = 177/408 (43%), Gaps = 90/408 (22%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 E KPNII++ +DD+G+ + F + Q TP Sbjct: 21 EGEASAKPNIIIILIDDMGFNDVSFHGSN--------------------------QILTP 54 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFL 167 + +L G+ YV + + PSRA ++TG+ P G+ TD G+P E + Sbjct: 55 NIDALAYNGILLNRHYVPN-LCTPSRATLLTGKYPIHTGMQHFVIVTDEPWGLPRQERLM 113 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 PELF++ GY T VGKWHL F ++ P RGFD+ G Sbjct: 114 PELFRDAGYATHLVGKWHLG----------------------FWRKDLTPTMRGFDHHFG 151 Query: 228 FHAAGTAYYN----------SPSLFKNRERVPAK----GYISDQLTDEAIGVVDRAKTLD 273 ++ YY+ S L R+ P + Y ++ T EA V++R Sbjct: 152 YYNGYMDYYDQTVRMLDRNYSTGLDFRRDLEPCREAEGTYATEAFTTEARKVIERHDK-S 210 Query: 274 QPFMLYLAYNAPHLPN-DNP--APDQYQKQFNTGSQTADNYYAS-VYSVDQGVKRILEQL 329 +P + L++ A H N DNP AP++ +F YA + S+D+ V + + L Sbjct: 211 RPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHIRDPKRRTYAGMISSLDKSVGQTMRAL 270 Query: 330 KKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY 389 NG +N+I+L SDNGA G G+ ++ Q +G + Sbjct: 271 ADNGMLNNSIVLLYSDNGAPTVGIHSNAGSNYPFRGQ--------------RGYVS---- 312 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL 437 ++ I A+D+ PT AA +S+P DL+LDG++L P L Q +P +NL Sbjct: 313 NQAIHAIDWLPTLAAAAGVSLPSDLRLDGLNLWPSLSASAQPQP-RNL 359 >UniRef50_A7S8Q2 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7S8Q2_NEMVE Length = 540 Score = 130 bits (327), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 125/501 (24%), Positives = 211/501 (42%), Gaps = 75/501 (14%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 S G P+I+ + MDDLG+ + + I A++ TP + Sbjct: 30 SMAGPPHIMFILMDDLGWSDVGYHN--------------------ISHAVK-----TPNI 64 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFLPE 169 L +GV+ + Y + + PSR A+MTG+ P G+ N + G+P +P+ Sbjct: 65 DKLASQGVKLMS-YYSQPMCTPSRGALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQ 123 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH 229 + GY T+ +GKWHL F ++ P RGFD F+GF Sbjct: 124 KLRTLGYRTSMIGKWHLG----------------------FFDWDYTPLRRGFDSFLGFF 161 Query: 230 AAGTAYYNSPSL----FKNRERVPAKGY----ISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 A ++ + F+ R+ PA Y +D T EAI + R QP L L+ Sbjct: 162 AGEQDHWRHSKMGFLDFR-RDEEPANEYGGQHSTDVFTQEAINIAMRHNA-SQPLFLLLS 219 Query: 282 YNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIIL 341 Y A H P P+ K + NY + + D + R+++ K+NG ++NT+++ Sbjct: 220 YAAVHTPL-QAHPNDVNKIGGVSDKDRQNYLGMMGAADWSIGRLIDVYKRNGLWNNTLMI 278 Query: 342 FTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL---QPGNYDKLISAMDF 398 + SDNGA N +GYKS + GG P F+ G++ + G + L D+ Sbjct: 279 WASDNGAQPGKGGGYNWPLRGYKSSLFEGGVRVPAFV--HGEMLQRKGGTVNDLFHVTDW 336 Query: 399 YPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNY 458 YPT + A + D +DGV P L + K + + L I ++ +E P NY Sbjct: 337 YPTLVKLAGGEVEPD--IDGVDQWPTLSEGKPSKREEILHNIDIPANQEEERMAPRGFNY 394 Query: 459 HKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN-----QLGLYKLT-DLQQK 512 + + D + + +V + + +L LY +T D +++ Sbjct: 395 YSGAALRRGHMKLVYKMGDAGWYQLPENGHRGPVVEEMVKDRLPIVELALYNITADPEER 454 Query: 513 DNLAAANPQVVKEMQGVVREF 533 ++L+ NP +V + ++E Sbjct: 455 NDLSKLNPDIVDSLWRRLQEL 475 >UniRef50_Q9VVM4 CG7402 n=10 Tax=Drosophila RepID=Q9VVM4_DROME Length = 579 Score = 130 bits (327), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 115/408 (28%), Positives = 183/408 (44%), Gaps = 79/408 (19%) Query: 52 YSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPT 111 YSTK PNI+++ +DD+G + F + Q TP Sbjct: 24 YSTK--PNIVIILIDDMGMNDVSFHGSN--------------------------QILTPN 55 Query: 112 LLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFLP 168 + +L G+ YV + + PSRA ++TG+ P G+ TD G+P E +P Sbjct: 56 IDALAYNGILLNKHYVPN-LCTPSRATLLTGKYPIHTGMQHFVIITDEPWGLPQRERLMP 114 Query: 169 ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF 228 E+F++ GY T VGKWHL F ++ P RGFD+ G+ Sbjct: 115 EIFRDAGYSTHLVGKWHLG----------------------FWRKDLTPTMRGFDHHFGY 152 Query: 229 HAAGTAYYNSPSLFKNR------------ERVP-AKG-YISDQLTDEAIGVVDRAKTLDQ 274 + YY+ +R E P A G Y ++ T EA ++++ + Sbjct: 153 YNGYIDYYDHQVRMLDRNYSAGLDFRRDLEPCPEANGTYATEAFTSEAKRIIEQHDK-SK 211 Query: 275 PFMLYLAYNAPHLPN-DNP--APDQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLK 330 P + L++ A H N D+P AP++ +F + Y + S+D+ V + + LK Sbjct: 212 PLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQTIGALK 271 Query: 331 KNGQYDNTIILFTSDNGAVIDGPLPLNGAQ---KGYKSQTYPGGTHTPMFMWWKGKLQPG 387 NG +N+IIL SDNGA G G+ +G K + GG + +W L+ Sbjct: 272 DNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGALW-SPLLKER 330 Query: 388 NY--DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 Y ++ I A+D+ PT AA +S+P+DL LDG++L P L ++ +P Sbjct: 331 GYVSNQAIHAVDWLPTLAGAAGVSLPQDLPLDGINLWPMLSGNEEPKP 378 >UniRef50_Q9VVM1 CG7408 n=8 Tax=Sophophora RepID=Q9VVM1_DROME Length = 585 Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 115/397 (28%), Positives = 171/397 (43%), Gaps = 79/397 (19%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 +T KPNII++ DDLG+ + F +GS + TP + Sbjct: 30 ATSDKPNIIIIMADDLGFDDVSF-RGSNN-------------------------FLTPNI 63 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD---GIPLTETFLPE 169 +L GV N YVA + PSRAA++TG+ P G+ D G+PL ET + E Sbjct: 64 DALAYSGVILNNLYVA-PMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMAE 122 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH 229 +F+ +GY T+ +GKWHL S + P RGFD +G+ Sbjct: 123 IFRENGYRTSLLGKWHLG----------------------LSQRNFTPTERGFDRHLGYL 160 Query: 230 AAGTAYYNSP---------------SLFKNRERVPAKGYISDQLTDEAIGVVDR--AKTL 272 A YY SL + V Y++D LTD A+ ++ +K Sbjct: 161 GAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHV--GHYVTDLLTDAAVKEIEDHGSKNS 218 Query: 273 DQPFMLYLAYNAPHLPNDN---PAPDQYQKQFNTGSQTADNYYASVYS-VDQGVKRILEQ 328 QP L L + APH ND+ AP + +F S YYA++ S +D+ V +++ Sbjct: 219 SQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVGSVIDA 278 Query: 329 LKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ---KGYKSQTYPGGTHTPMFMWWKGKLQ 385 L + N+IILF SDNG G + +G K+ + G + +W + Sbjct: 279 LARQEMLQNSIILFLSDNGGPTQGQHSTTASNYPLRGQKNSPWEGALRSSAAIWSTEFER 338 Query: 386 PGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSL 421 G+ + + I D PT AA IS L LDG++L Sbjct: 339 LGSVWKQQIYIGDLLPTLAAAAGISPDPALHLDGLNL 375 >UniRef50_Q7UYA9 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA9_RHOBA Length = 474 Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 107/398 (26%), Positives = 174/398 (43%), Gaps = 54/398 (13%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 E + PN+I+L DD G+G + F+ EVV TP Sbjct: 26 ETTDTNSPNVILLMSDDQGWGDVGFN----------GNEVV----------------QTP 59 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPEL 170 L ++ GVRF Y A + P+R + +TGR P RFG+ + G+ + E + E+ Sbjct: 60 NLDAMASAGVRFDRFYAAAPLCSPTRGSCLTGRYPFRFGILAAHTG--GMRVGEITIAEM 117 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY-----HDNF-----TTFSAEEWQPQNR 220 Q GY T GKWH+ + P++ TR + H F TT + W P Sbjct: 118 LQKRGYATGMFGKWHIGWVK----PDEVSTRGFYSPPSHHGFDEYFATTSAVPTWDPTIT 173 Query: 221 GFDYFMGFHAAGTAYYNS-PSLFKNRE-RVPAKGYISDQLTDEAIGVVDRAKTLDQPFML 278 D+ + G + P + RE + G S + D I ++ + +PF Sbjct: 174 PQDWDSWGNGPGEPWKGGFPYVHNGREAKENLSGDDSRVIMDRVIPFIEANQA--KPFFA 231 Query: 279 YLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 + ++APH P A ++++K + NYY + ++DQ V R+ +L++ G NT Sbjct: 232 TVWFHAPHEPV--VAGEEFKKLYPKAGSKRKNYYGCITAMDQQVGRLRAKLRELGIEKNT 289 Query: 339 IILFTSDNG---AVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL-IS 394 ++ F SDNG + + G KG+K Y GG P W G + G ++ S Sbjct: 290 VVFFCSDNGPSDGLAKKGVASAGPFKGHKHTMYEGGLLVPACAEWPGTIPAGTSTEVRCS 349 Query: 395 AMDFYPT-ALDAADISIPKDLK-LDGVSLLPWLQDKKQ 430 +DF PT A D + K + +DG+ L+P ++ + + Sbjct: 350 TVDFLPTVASIVGDSMVQKATRPIDGIDLMPLIRGEAK 387 >UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZUT0_9PLAN Length = 457 Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 137/518 (26%), Positives = 217/518 (41%), Gaps = 126/518 (24%) Query: 45 SDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEA 104 SD PT KPNI+ + +DD+G D G + A Sbjct: 25 SDAAPT------KPNIVFILIDDMGCK----DAGCYG----------------------A 52 Query: 105 AQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG----- 159 STP + L ++G+RFT+ Y A V P+RA++MTG+ PAR + +N Q G Sbjct: 53 TNFSTPHIDRLANQGMRFTDAYAA-PVCSPTRASLMTGKHPARLHL-TNFIPQIGRQLPA 110 Query: 160 -----------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 +PL E + + GY A +GKWHL + Sbjct: 111 GKLIPPGFNHVLPLDEKTIAQELHADGYQCAMIGKWHLGEEH------------------ 152 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKG--------YISDQLTD 260 E++PQNRGFD + G Y P F ++++ P G Y+ D+LTD Sbjct: 153 ---GPEYRPQNRGFDRVVLSEHHGIFNYFYP--FVDQQKWPYAGPLPGNPGDYLPDRLTD 207 Query: 261 EAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFN-TGSQTADNYYASVY-SV 318 EAI V + ++PF LYL++ + H AP+ ++ G + YA++ +V Sbjct: 208 EAIDFVRENR--ERPFFLYLSHWSVH--GRYFAPESLIAKYRERGLEERPAIYAAMMETV 263 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGA-VIDGPLPLNGAQKGYKSQTYPGGTHTPMF 377 D V R++ L + DNT+ +F SDNG I PL G+ K Y GG P+ Sbjct: 264 DNSVGRLMATLDELNLADNTLFVFMSDNGGERITSMAPLRGS----KGSLYEGGVRVPLI 319 Query: 378 MWWKGKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKN 436 + + G ++P + + + D +PT LD A+ S +D KLDG S+ L ++ Sbjct: 320 VRYPGVVKPNTTCSVPVISHDLFPTFLDFAERSY-RDNKLDGHSIAGLLTGEQSELDRDA 378 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV 496 L W H P+ ++ +R + LV + Sbjct: 379 LYW-------------------------------HFPHYWGSTRPCSAMRQGRWKLVEHL 407 Query: 497 ENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREF 533 E + LY L +D ++ +LA PQ E++ ++ ++ Sbjct: 408 ETGRAQLYDLSSDPGEQRDLANEMPQQATELRKMLAQW 445 >UniRef50_Q7UYD6 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UYD6_RHOBA Length = 889 Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 135/543 (24%), Positives = 225/543 (41%), Gaps = 126/543 (23%) Query: 48 TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQK 107 TP ++K +PN++ + DDLG+ DT G K + Sbjct: 258 TPNASASK-RPNVLFILADDLGWS--------------------DTTLFGTTKLYQ---- 292 Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD-----------A 156 TP + L G+ FT Y + + P+RA+++TG +PAR G+ S T + Sbjct: 293 -TPNIERLAKRGMTFTRAYSSSPLCSPTRASVLTGLSPARHGITSPTCHLPKVVLEPKVS 351 Query: 157 QDGIPLTETFLP--------------ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 + G P + +P E+F+++GY T GKWHL Sbjct: 352 ETGPPNKFSTVPESVTRLDTKYYTLAEMFRDNGYATGHFGKWHL---------------- 395 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMGFH--AAGTAYYNSPSLFKNRERVPA--KGYISDQL 258 E + P GFD + H Y +P FK+ + P ++ D++ Sbjct: 396 --------GPEPYSPLEHGFDVDVPHHPGPGPAGSYVAPWKFKDFDHDPVIPDEHLEDRM 447 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFN-TGSQTADNYYASV 315 EA+ +++ ++PF L + H P D ++Y+ + + Q Y A + Sbjct: 448 AKEAVRFLEQHT--NEPFFLNYWMFSVHAPFDAKKELIEEYRDRVDPKDPQRCPTYAAMI 505 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA----VIDGPLPL-NGAQKGYKSQTYPG 370 S+D + +L+ L + G D TII+F SDNG +DG N +G K+ Y G Sbjct: 506 ESMDDAIGTLLDTLDRLGIADETIIVFASDNGGNMYNEVDGTTATSNAPLRGGKATMYEG 565 Query: 371 GTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 G P + G ++ G+ D +I ++DFYPT L+ I + + DGVS++P L Sbjct: 566 GVRGPAIVVQPGVVESGSRSDAIIQSIDFYPTLLEMLAIDAQPNQRFDGVSIVPAL---- 621 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNND 489 QG+P + I +Y +PH+P + S +V D Sbjct: 622 QGKPLQRDA-IFTY-------------------------FPHDPPVPNWMPPSVSVHQGD 655 Query: 490 YSLVYTVENNQLG--LYKL----TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 + L+ G YKL DL ++ NLAA +P V++M ++ + + ++ Sbjct: 656 WKLIRIFHGGPNGSHRYKLFNLKNDLGERINLAAKHPDRVQQMDKLIGQHLVETKAVRPL 715 Query: 544 VNQ 546 VN+ Sbjct: 716 VNK 718 >UniRef50_A6DMW2 Putative exported uslfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMW2_9BACT Length = 479 Score = 130 bits (326), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 110/400 (27%), Positives = 168/400 (42%), Gaps = 84/400 (21%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI+ + DD+G ++D G D + TP L L Sbjct: 27 RPNILFIVADDMG--------------------IMDLGVYGSDYYL------TPNLNKLA 60 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF---------GVYSN-----TDAQDGIPL 162 + +RF Y A V P+R AI+TGR P R +Y N + + L Sbjct: 61 SQSMRFDRAYAASHVCSPTRGAILTGRYPQRIHLTDALPWDRLYKNPKMIPPNHVKELSL 120 Query: 163 TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF 222 + Q + Y TA GKWHL + F F+ +E + GF Sbjct: 121 KLPTFARVLQKNDYRTAMFGKWHLGN---------------EERF--FTGKEHKAY--GF 161 Query: 223 DYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 D G AY KG ++LT+ + + K +PFML L + Sbjct: 162 DEAFGVSGKAKAY--------------DKGV--NELTERTLRFLKENK--KKPFMLCLMH 203 Query: 283 NAPHLPNDNP--APDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 + PH+P P A Y Q Y + D +K++L+ L+ G DNT++ Sbjct: 204 HVPHVPVACPPYAKALYDSVPKGKHQKNSKYAGMISHFDNSIKKVLDALRALGLDDNTVV 263 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFY 399 + TSDNG + + L N G K Y GGT P+ + W GK+ PG+ +K ++ + DF+ Sbjct: 264 IVTSDNGGLSN--LSSNKPYNGGKGSLYEGGTRVPLLIRWPGKITPGSVNKSVVISNDFF 321 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 PT L+ A + + + LDG S++P L+ K G+ + L W Sbjct: 322 PTFLELAGLPLMPEAHLDGKSMMPLLKGKTLGK--RTLYW 359 >UniRef50_A0Q2E3 N-acetylgalactosamine 6-sulfate sulfatase n=3 Tax=Firmicutes RepID=A0Q2E3_CLONN Length = 483 Score = 129 bits (325), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 99/357 (27%), Positives = 157/357 (43%), Gaps = 64/357 (17%) Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLP 168 TPTL SL + G+RF N + V P+RA+I TGR P++ G++ D + TE +L Sbjct: 31 TPTLDSLANNGIRFENFFCVSPVCSPARASIYTGRIPSQHGIHDWLDEWNNGYTTEEYLK 90 Query: 169 ------ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF 222 ++ +GY A GKWHL +PQN GF Sbjct: 91 GQSTFVDILAKNGYECAMSGKWHLGVAD-------------------------KPQN-GF 124 Query: 223 DYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 Y+ G YY +P ++K+ + + Y++D +TD + +++ + D PF L L Y Sbjct: 125 KYWYSHQKGGGPYYGAP-MYKDGTLIHEERYVTDVMTDYGLEFIEKQRDSDNPFYLSLNY 183 Query: 283 NAPHLP---------------------------NDNPAPDQYQKQFNTGSQTADNYYASV 315 APH P ND + K + + Y+A++ Sbjct: 184 TAPHAPWSPENHPKELLDLYKDCEFKSCPKDGKNDWSIDYIFPKTEDERREVLRGYFAAL 243 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYK-SQTYPGGTHT 374 SVD +KR++++LK+ G +NT+I+FTSDNG + G + G G + Sbjct: 244 TSVDNNIKRVIDKLKEMGVLENTLIIFTSDNGMNM-GHHGIFGKGNGTSPVNMFDTSVKI 302 Query: 375 PMFMWWKGKLQPGNYDKLISAMDFYPTALDAADI--SIPKDLKLDGVSLLPWLQDKK 429 P F+ G ++P L+S D PT ++ I I + +KL G S L+ +K Sbjct: 303 PCFITKIGDIKPQVSTDLLSHYDIRPTLMEYLGIEDEIDEGVKLPGRSFASLLRGEK 359 >UniRef50_A4W906 Sulfatase n=43 Tax=Enterobacteriaceae RepID=A4W906_ENT38 Length = 501 Score = 129 bits (325), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 119/415 (28%), Positives = 180/415 (43%), Gaps = 93/415 (22%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN++++ DDLGYG L Y I K TP + L Sbjct: 35 KPNVVIILADDLGYGDL------------------GIYGHPIVK--------TPNIDKLA 68 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPL--TETFLPELFQNH 174 EGVRF+ Y + PSRA ++TGR P R G+ S I L E + ++ Sbjct: 69 QEGVRFSQYYAPAPLCSPSRAGLLTGRTPFRTGIRSWIPTNKNIALGRNEKTIASYLKDQ 128 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM----GFHA 230 GY TA +GKWHL N V D HD + Q ++ GFDY + GF Sbjct: 129 GYDTAMMGKWHL----NAGV-------DRHD--------QPQAEDAGFDYTLVNAAGFVT 169 Query: 231 A-----------GTAYYNSPSLFKNRERVPAKGYISDQ-LTDEAIGVVDRAKTLDQPFML 278 + G Y N ++N + + IS + ++ EAI ++ K ++PF + Sbjct: 170 SDLDKAKERPRNGVVYPNG--FYRNGKALGTVNQISGEFVSQEAINWLNDKKD-NKPFFM 226 Query: 279 YLAYNAPHLPNDNPAP---------DQYQKQ---------FNTGSQTADNYYASVYSVDQ 320 Y+A+ H P +P +Y+KQ + + YYA++ +D+ Sbjct: 227 YVAFTEVHTPLASPKKYLEIYKNYMSEYEKQHPDMFYADWVDKPYRGPGEYYANISYMDE 286 Query: 321 GVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNG---AQKGYKSQTYPGGT 372 V ++L ++K GQ DNTII+FTSDNG V L + G +G K + GG Sbjct: 287 QVGKVLAKIKSMGQEDNTIIIFTSDNGPVTREARKWYELNMAGETDGLRGRKDNLWEGGI 346 Query: 373 HTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQ 426 P + + L G D +S +D PT + ++P D +DG S++P L+ Sbjct: 347 RVPAIIKYGQHLHAGTVTDTPVSGLDILPTLAELTHFNLPTDRIIDGESIVPVLE 401 >UniRef50_D2R323 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R323_9PLAN Length = 631 Score = 129 bits (324), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 113/419 (26%), Positives = 171/419 (40%), Gaps = 78/419 (18%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI+V DD G+G F + STP + S+ Sbjct: 51 QPNIVVFLADDAGWGDYSFSGNT--------------------------NLSTPHIDSIA 84 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGY 176 G +V V P+RA +TGR R GV + Q+ + L+E L + + GY Sbjct: 85 RGGASIDRFFVC-SVCSPTRAEFLTGRYHQRGGVRGVSTGQERLDLSERTLADSLRAAGY 143 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEW--QPQNRGFDYFMGFHAAGTA 234 T A GKWH + +W P RGFD + G+ + Sbjct: 144 ATGAFGKWH-------------------------NGSQWPYHPNARGFDEYFGYTSGHWG 178 Query: 235 YYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP 294 Y +P L N + +GYI D TD AI ++ +K ++PF Y+ + PH P P+ Sbjct: 179 EYFNPPLEHNGKLNNYEGYIVDICTDRAITFIEASK--NKPFFCYVPFTTPHSPWSVPSA 236 Query: 295 DQYQKQFNTGSQTADNY-----------YASVYSVDQGVKRILEQLKKNGQYDNTIILFT 343 D + Q + A N A V + D+ V R+L +L + +NTI+++ Sbjct: 237 DWKRFQDKPLEKRATNLKQEQLDQTRCALAMVENQDRNVGRVLSKLDELKLRENTIVVYF 296 Query: 344 SDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS-AMDFYPTA 402 SDNG G KG K T GG + ++ W ++ + I+ A+D PT Sbjct: 297 SDNGP---NSARWTGGMKGKKGTTDEGGVRSVCYIQWPKRIAAAQTIQPIAGAIDLLPTL 353 Query: 403 LDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL--TW-----ITSYSHWFDEENIPF 454 L A + +L LDG L P L ++ P + L TW S +H DE+ + F Sbjct: 354 LSLAGVKHVGELPLDGRDLAPLLTGQQPEWPERLLFTTWAGKVSARSQTHRLDEQGLLF 412 >UniRef50_Q2GB51 Sulfatase n=6 Tax=Proteobacteria RepID=Q2GB51_NOVAD Length = 491 Score = 129 bits (324), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 110/405 (27%), Positives = 159/405 (39%), Gaps = 77/405 (19%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI+ + DDLGY L + + E TP L L Sbjct: 55 RPNILYIMADDLGYADL----SCYGRRDFE----------------------TPVLDKLA 88 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT----DAQDGIPLTETFLPELFQ 172 +G+RFTN Y V +R ++TGR R V G+P + LP L Sbjct: 89 AQGLRFTNAYANSAVCTATRVGLITGRYQYRLPVGLEEPLAFRPNIGLPPSHPTLPSLLA 148 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 GY T+ +GKWHL + ++ P G+ F G + G Sbjct: 149 KAGYRTSLIGKWHLGSL-----------------------PDFDPLKSGYQTFWGIRSGG 185 Query: 233 TAYY------NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPH 286 YY P L+ V GY++D L D A+ + A + + P+ + L + APH Sbjct: 186 VDYYTHATSNGQPDLWDGPTPVERAGYLTDLLADRAVSEIREASSGEAPWFMSLHFTAPH 245 Query: 287 LPNDNPAPDQYQKQ----------FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYD 336 P + P + F+ +A Y A V +D + R+LE LK N Sbjct: 246 WPWEGPDDASESARIAKLKDPSALFHFDGGSAAIYAAMVRRLDYQIGRVLEALKANRAEQ 305 Query: 337 NTIILFTSDNGA-VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLIS 394 +TI++FTSDNG P + G K++ GG P + W G + G D I Sbjct: 306 DTIVVFTSDNGGERFSDTWPFS----GRKTELLEGGLRIPAIVRWPGVTRAGTTSDAQII 361 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 +MD+ PT L AA + DGV + P L E + L W Sbjct: 362 SMDWLPTFLAAAGSAPDPGHPSDGVDVTPALGGGSLAE--RALFW 404 >UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CEC4_9PLAN Length = 467 Score = 129 bits (324), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 124/499 (24%), Positives = 202/499 (40%), Gaps = 116/499 (23%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI++ +DDLG+ + F F TP + L Sbjct: 29 RPNIVLFFIDDLGWRDVGFMGSDF--------------------------FETPHIDRLA 62 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-------IPLTE----- 164 DE ++FT Y A PSRA +M+G R GVY+ D G IP Sbjct: 63 DESMKFTAAYSAAPNCAPSRACLMSGLYTPRHGVYTVGDPARGNDRYRKLIPAENNRVLD 122 Query: 165 ---TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 T + + GY A+VGKWHL + P ++G Sbjct: 123 DRFTTIADRLSQAGYRCASVGKWHLGQ---------------------------SPLSQG 155 Query: 222 FDYFMGFHAAGT------AYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQP 275 F + + G+ + Y +P L + +++D+LT A + + P Sbjct: 156 FQVNIAGNQTGSPRGGYFSPYQNPQLSDGEQ----GEFLTDRLTTAACQFIKDNQ--GSP 209 Query: 276 FMLYLAYNAPHLPNDNPAPD--QYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNG 333 F LYL + A H P D +Q + Y A + S+DQ + R+L+ L++ Sbjct: 210 FFLYLTHYAVHTPLQAKKEDIAYFQSKPAGKLHQHATYAAMIRSMDQSIGRVLQTLREQQ 269 Query: 334 QYDNTIILFTSDNGAVIDGP----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN- 388 NTI++FTSDNG GP LPL G+ K Y GG P+ + W G QPG+ Sbjct: 270 LDQNTIVVFTSDNGGY--GPATSMLPLRGS----KGMLYEGGIRVPLLIKWPGVTQPGST 323 Query: 389 YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFD 448 + + +D YPT L+ +I + + LDG SL+P L+D + ++L W Sbjct: 324 TGEAVINVDLYPTFLEMTNIPVLESELLDGESLVPLLKDPQTRLESRSLFW--------- 374 Query: 449 EENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT- 507 + P + ++ ++ + P + +R D+ L+ E+ LY Sbjct: 375 --HFPAYLQKYQGMQQRFRTTPVS-----------VIRQGDWKLLEFFEDGHQELYNTRL 421 Query: 508 DLQQKDNLAAANPQVVKEM 526 D+ + L+ ++P+ +E+ Sbjct: 422 DIGESKELSGSHPEKTQEL 440 >UniRef50_B0TKJ5 Sulfatase n=2 Tax=Gammaproteobacteria RepID=B0TKJ5_SHEHH Length = 492 Score = 129 bits (323), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 125/441 (28%), Positives = 187/441 (42%), Gaps = 101/441 (22%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 A + L + TNV+ + T T+ KPN+++ +DDLGYG L Sbjct: 7 ASAIVLASLSTNVSAAQTTVTD-----KPNVVIFYVDDLGYGDLA--------------- 46 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 TY I K TP + L EG++FT Y + PSRA ++TGR P R G Sbjct: 47 ---TYGHNIVK--------TPNIDKLAAEGIKFTQYYSPAPLCSPSRAGMLTGRTPYRTG 95 Query: 150 VYSNT-DAQD-GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 + S D Q+ I E L + ++ GY TA GK HL N Sbjct: 96 IRSWIPDGQNVHIGKEEITLAHMLKDEGYDTAITGKLHL-------------------NG 136 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKN----------------RERVPA- 250 + Q + GF++ F G N+ + KN R VP Sbjct: 137 GAHMKDHPQASDLGFEH--SFIIPGGWAKNAKTEAKNADGSLRHGKIHVDNFWRNGVPVG 194 Query: 251 --KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA--------------- 293 + +D + +EAIG +D + D+PF LY+ ++ H P +P Sbjct: 195 ETDQFSADLVANEAIGWLDD-QGGDKPFFLYVPFSEVHTPIASPQKYLDMYGDYLTDFAK 253 Query: 294 --PDQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI 350 PD + + N + Y+A++ +D + R++++LK G+YDNTIILF+SDNG V Sbjct: 254 ENPDLFHWDWVNQPYRGQGEYFANITYMDAQLGRVIDKLKAMGEYDNTIILFSSDNGPVT 313 Query: 351 ---DGPLPLN-----GAQKGYKSQTYPGGTHTPMFMWWKGKLQP-GNYDKLISAMDFYPT 401 P LN G +G K + GG PM M + G ++ + D+ I +D PT Sbjct: 314 REARKPYELNMAGETGGLRGRKDNLFEGGIRVPMIMKYHGHVKAETDSDEPIYGLDIVPT 373 Query: 402 ALDAADISIPKDLKLDGVSLL 422 + P D +DGVS + Sbjct: 374 LSELIGFDTPSDRTIDGVSFV 394 >UniRef50_A6DUI7 Putative exported uslfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DUI7_9BACT Length = 516 Score = 129 bits (323), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 138/547 (25%), Positives = 218/547 (39%), Gaps = 148/547 (27%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 K N+I + DDLG+ + F+ F TP L L Sbjct: 30 KLNVIFMIADDLGWMDVGFNGNKF--------------------------VETPNLDKLA 63 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-------------------------- 150 EG+ FTNGY + + P+RAA TG++PA G+ Sbjct: 64 SEGMVFTNGYASGPLCSPTRAAFHTGKSPATMGINVPVTKGLKGKTPGAYPMGGDKLKTK 123 Query: 151 -------------YSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPED 197 Y+NT GI E + + Q+ Y TA++GKWH+ Sbjct: 124 VGQRDIRHRLLPAYTNT----GIDPQEVTIADCLQSADYVTASIGKWHMG---------- 169 Query: 198 KQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFK-----NRERVPAKG 252 + S + P+ G+D + AG Y+ PS F + + P Sbjct: 170 ----------LSHSDPKADPREYGYD----INIAGGDYHGPPSWFSPYRIHSLKNGPKGE 215 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADN-- 310 +++++LT EAI ++ K D+PF LYL Y H P+ A ++Y K+F+ QT D+ Sbjct: 216 HLTERLTREAINFMEENK--DKPFFLYLPYYQVHSPHG--AREEYIKKFDH-KQTPDSKM 270 Query: 311 ---YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI-----DGPLPLNG---- 358 Y A V +D+ V I + LKK+G NT+++F+SDNG ++ + LP N Sbjct: 271 NSIYAAMVMHLDESVGLINDYLKKSGLDKNTLLIFSSDNGPLVYQRAGNQVLPRNTRLTF 330 Query: 359 --AQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLK 415 +G+K Y GT P G + + + I D Y T + +++P++ K Sbjct: 331 AEPLRGWKGSVYEAGTRVPYIFKLPGVIPANSISQTPIITHDLYATICEFTGVAVPEEQK 390 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 ++G SL P L K + +L W W +I W D P Sbjct: 391 VEGESLFPLLTQSKALQ-RTSLFWHNPKYSWSLNSDI-LW-----------ADRP----- 432 Query: 476 EDLSQFSYTVRNNDYSLVYTVEN---NQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVR 531 + +R Y L+Y E L LY L D + NL + P+ E++ + Sbjct: 433 ------ACAIRKGKYKLIYYFERKGERTLELYDLDNDQGETKNLVSDLPEKALELETELL 486 Query: 532 EFIDSSQ 538 ++D +Q Sbjct: 487 AWLDQTQ 493 >UniRef50_Q7UXA8 N-acetylgalactosamine-6-sulfate sulfatase n=2 Tax=Bacteria RepID=Q7UXA8_RHOBA Length = 495 Score = 129 bits (323), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 132/547 (24%), Positives = 216/547 (39%), Gaps = 134/547 (24%) Query: 16 LILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPF 75 + L + + F A + + A + VA S KPNI+ + DD G+G L Sbjct: 23 VCLTTVVILFVLAGATESRCAAAEDTVA--------SSVGKKPNILFIFADDWGWGDLSC 74 Query: 76 DKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPS 135 + TP + L EG F VA GV PS Sbjct: 75 HGHPY--------------------------VRTPNIDRLAREGTDFERFTVASGVCSPS 108 Query: 136 RAAIMTGRAPARFGV-----YSNTDAQDGIP----LTETFLPELFQNHGYYTAAVGKWHL 186 R A+MTG PAR + + ++A+ +P + LP L Q+ GY TA GKWHL Sbjct: 109 RTAVMTGHFPARHNIDGHFAWVPSNAKRNMPDWLDPSAVTLPRLLQSGGYKTAHFGKWHL 168 Query: 187 SKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRE 246 SN +P+ P G+D + F+ +G E Sbjct: 169 ---SNDMIPDSP-----------------TPAAYGYDRYGAFNCSG-------------E 195 Query: 247 RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF-NTGS 305 ++P + +E I ++ A + PF + L + PH P +Y+ +F ++G Sbjct: 196 QMPVH-----EDANETIRFIEEAHSKGDPFFVNLWVHEPHTPFH--VIPKYRWRFRDSGL 248 Query: 306 QTADNYYASVYS-VDQGVKRILEQLKKNGQYDNTIILFTSDNGAV--------------- 349 AD YA+V S D + +L+ L + + T+++F+SDNG Sbjct: 249 SEADEIYAAVLSHADDRIGEVLDALDRLELTNKTLVIFSSDNGPARGSANAKLELSYDTA 308 Query: 350 ------IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYD--KLISAMDFYPT 401 I + +KGYK+ + GG + P + W GK+ G D +ISA+D PT Sbjct: 309 TGAGFGIGASKGITAGRKGYKASLFEGGINVPFIVRWPGKVAAGKTDDSAMISAVDLLPT 368 Query: 402 ALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKF 461 D A + +P + DG+S + L+ + K L W S + W +++ P H + Sbjct: 369 FCDIAGVELPSAYQADGISQVSALKGQPTTGRTKPLFWKYS-ARWPAQKSRP-----HHW 422 Query: 462 VRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANP 520 SY V N + L+ +++ + LY + +D + +L + P Sbjct: 423 A-------------------SYCVVNERWKLLANQDSSYVELYDIVSDPFESTDLKESQP 463 Query: 521 QVVKEMQ 527 V ++ Sbjct: 464 DAVTKLS 470 >UniRef50_A6DR29 N-acetylgalactosamine-6-sulfatase n=3 Tax=Bacteria RepID=A6DR29_9BACT Length = 510 Score = 129 bits (323), Expect = 4e-28, Method: Compositional matrix adjust. Identities = 118/432 (27%), Positives = 189/432 (43%), Gaps = 73/432 (16%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 +K K+ + + F+P + KPN+I++ DDLG+G F+ GS Sbjct: 1 MKTKSLLIAASAALFSPFISAESAKPNVILIMADDLGWGDTGFN-GS------------- 46 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 K I+ TP L + EG++ Y A V P+RA+++TGR P R GV Sbjct: 47 -------KVIK-----TPHLDQMAAEGLQLDRFYSASSVCSPTRASVLTGRNPYRTGV-- 92 Query: 153 NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDK------------QT 200 T Q + E LPE+ GY T GKWHL +++ ++ + Sbjct: 93 PTANQGFLRPEEITLPEVLNEQGYATGHFGKWHLGTLTHTEKDANRGKPGNTKEFNPPKL 152 Query: 201 RDYHDNFTTFSA-------------EEWQPQNRGFDYFMGFHAA---GTAYYNSPSLFKN 244 Y D F T S ++ + ++ G++Y + GT Y++ + Sbjct: 153 HGYEDAFVTESKVPTYDPMILPAKFDQGESKHLGWEYVKEGEESKPYGTFYWD---IEGK 209 Query: 245 RERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG 304 + KG S + D + +D+A ++PF+ + ++ PHLP A ++Q+ + Sbjct: 210 KITDNLKGDDSRVIMDRVLPFIDQAVADEKPFLSVVWFHTPHLP--CVAGPRHQEMYKGH 267 Query: 305 SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ---K 361 NY V ++D+ + R+ + L G DNT+I F SDNG P NG+ + Sbjct: 268 PIHLRNYAGCVTAMDEQIGRLRKHLADKGVADNTMIWFCSDNGPE-SKERPDNGSAGHFR 326 Query: 362 GYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISA----MDFYPTALDAADISIPK-DLKL 416 G K Y GG P M W K++ + ISA D+ PT LDA I P+ Sbjct: 327 GRKRDLYEGGVRVPAVMVWPAKVKEA---RKISAPCITSDYMPTILDALHIPHPQASYAT 383 Query: 417 DGVSLLPWLQDK 428 DG SL+P + ++ Sbjct: 384 DGRSLMPIINNE 395 >UniRef50_C0BKJ9 Sulfatase n=2 Tax=Bacteroidetes RepID=C0BKJ9_9BACT Length = 493 Score = 128 bits (322), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 126/422 (29%), Positives = 184/422 (43%), Gaps = 87/422 (20%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNII + DDLGYG+L GS+ K ++ TP L L Sbjct: 27 PNIIYILADDLGYGEL----GSYGQKKIK----------------------TPNLDRLAA 60 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTG--------RAPARFGVYSNTDAQDGIPLTETF--L 167 +G+RFT Y V PSR +TG R G +S+ +P+ ET L Sbjct: 61 DGMRFTQHYTGAPVCAPSRYMFLTGNHAGHAYIRGNYELGQFSDEMEGGQMPIPETTPTL 120 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVP------------EDKQTRDYHDNFTTFSAEEW 215 ++ + GY TA +GKW L P + KQ +Y+ + ++ Sbjct: 121 AKMLKKAGYQTAMIGKWGLGMNETTGSPLLHGFDYYYGYLDQKQAHNYYPTHL-WENDKK 179 Query: 216 QPQNRGFDYFMGFHAAGTAYYNSPSL--FKNRERVPAKGYISDQLTDEAIGVVDRAKTLD 273 P N DYF+ H+ ++ N FK +E P D++ ++AI +D + D Sbjct: 180 DPLNN--DYFL-VHSPISSKANQSDFDQFKGQEYAP------DRMLEKAIQFLDTTAS-D 229 Query: 274 QPFMLYLAYNAPHLPNDNPAP--DQYQKQFN----------TGSQTADNYYASVYS-VDQ 320 +P+ LY PH+ P DQY+ F T Q + YA++ + +D Sbjct: 230 KPYFLYYPSPIPHVSLQVPDSLVDQYRDVFEEEPYLGNKGYTAHQFPNAAYAAMITHLDS 289 Query: 321 GVKRILEQLKKNGQYDNTIILFTSDNGAVIDG---PLPLNGAQ--KGYKSQTYPGGTHTP 375 V +I + +K+ GQ +NT+ILF+SDNG G P N A +G K Y GG P Sbjct: 290 EVGKIWDSVKEKGQEENTLILFSSDNGPTFAGGVDPDFFNSAAGLRGLKMDVYEGGIRIP 349 Query: 376 MFMWWKGKLQPGNYDKLISA-MDFYPT--ALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 +WKGK++ G+ LIS D + T L D S P DG+S+LP L + Q E Sbjct: 350 FIAYWKGKIKAGSISDLISGHWDMFNTFAELAGQDQSAP-----DGISILPELLGESQNE 404 Query: 433 PH 434 H Sbjct: 405 TH 406 >UniRef50_B4D4S6 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4S6_9BACT Length = 626 Score = 128 bits (322), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 128/425 (30%), Positives = 182/425 (42%), Gaps = 108/425 (25%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 E S K +PNI+ + DDLG+ DT G K E TP Sbjct: 21 ESSPKTRPNIVFILADDLGWS--------------------DTTLYGTTKFFE-----TP 55 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS-----------------N 153 + L G++FTN Y A+ V P+RA+IMTG P R G+ + Sbjct: 56 NIERLAARGMKFTNAYAANPVCSPTRASIMTGLYPGRLGITTPSGHVPEEKLEASLVARG 115 Query: 154 TDAQDGIPLT-------ETF-LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 + +Q + T E F L E + GY T GKWHL Sbjct: 116 SPSQKSLQATSATRLKLEYFTLAEALKGAGYATGHFGKWHLGP----------------- 158 Query: 206 NFTTFSAEEWQPQNRGFD----YFMGFHAAG-TAYYNSPSLFKNRERVPAK--GYISDQL 258 E + P ++GFD ++ G AG A + SP +PAK + D + Sbjct: 159 -------EPFDPLHQGFDVDVPHWSGPGPAGYIAPWKSPKF-----HLPAKPGEQLEDLM 206 Query: 259 TDEAIGVVDRAKTLDQPFML-YLAYNAPHLPNDNPAPD---QYQKQFNTGS-QTADNYYA 313 + EAI + K D+PF L Y A++ H P PD +Y+++ + S Q Y A Sbjct: 207 SQEAIKFIRVHK--DEPFYLNYWAFSV-HSPWGG-KPDLIEKYRRKADPNSAQRNPVYGA 262 Query: 314 SVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV------------IDGPLPLNGAQK 361 V S+D V R+L+ L + D+TII+F SDNG V ++ P N + Sbjct: 263 MVESLDDAVGRLLDTLDELKLSDHTIIVFFSDNGGVNWFEPAMKEEAGMNSPPTTNAPLR 322 Query: 362 GYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVS 420 K Y GGT P + W GK + D ++ ++DFYPT L+ A ++ DLK DGVS Sbjct: 323 AGKGTLYEGGTREPCVVVWPGKTKAATQNDAMLCSVDFYPTLLEMAGVAAKPDLKFDGVS 382 Query: 421 LLPWL 425 +P L Sbjct: 383 QVPAL 387 >UniRef50_B9KQS8 Twin-arginine translocation pathway signal n=2 Tax=Alphaproteobacteria RepID=B9KQS8_RHOSK Length = 509 Score = 128 bits (322), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 98/393 (24%), Positives = 164/393 (41%), Gaps = 73/393 (18%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 P +P+I+ + +DDLGY D + Sbjct: 55 PARAQEVARPHILYILVDDLGYA---------------------------DVGYHGSDVK 87 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN---TDAQDGIPLTET 165 TP + L EG R Y + P+RAA+MTGR P R+G+ + + + G+ E Sbjct: 88 TPNVDRLAAEGARLMQFYT-QPLCTPTRAALMTGRYPMRYGLQTGVIPSGGRYGLDTAEV 146 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 LP++ + GY TA VGKWHL + +++ P+ RG DYF Sbjct: 147 LLPQVLKEAGYKTALVGKWHLGH----------------------ADQKYWPRQRGVDYF 184 Query: 226 MGFHAAGTAYYNSPS-----LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 G ++ + +++ E V GY ++ +AI +++ + P +YL Sbjct: 185 YGPLVGEIDHFKHEAHGITDWYRDNEMVKEPGYDTELFGADAIRLIEEHDSA-TPLYMYL 243 Query: 281 AYNAPHLPNDNPAPDQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 ++ APH P APD+Y+ + + + Y A + +D V +L+ L++ G ++T+ Sbjct: 244 SFTAPHTPYQ--APDKYKDLYPDIADEGRKAYAAMISCMDDQVGLVLQALERRGMREDTL 301 Query: 340 ILFTSDNG----------AVIDGPL-PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN 388 ++F SDNG + G L P N + K Y GGT W G++ G Sbjct: 302 VIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVVALANWPGRIPAGE 361 Query: 389 YDKLISAMDFYPTALDAADISIPKDLKLDGVSL 421 ++ +D PT A I +LDG+ + Sbjct: 362 THGMMHVVDMLPTLAGLAQAEIAHAGQLDGMDV 394 >UniRef50_A6DJ15 Putative arylsulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ15_9BACT Length = 469 Score = 128 bits (322), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 122/430 (28%), Positives = 185/430 (43%), Gaps = 101/430 (23%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQK-STPTLLSL 115 KPNII L +DDLGYG L ++ +K STP + + Sbjct: 20 KPNIIYLLVDDLGYGDL---------------------------SLYGQKKFSTPNIDRI 52 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA-----QDGIPLT--ETFLP 168 EG+ FT+ Y V PSRAA+MTG+ V N + +PL + L Sbjct: 53 GKEGMVFTDHYSGSTVCAPSRAALMTGKHSGHGLVRGNYEVGPHGFGGELPLRPEDVSLA 112 Query: 169 ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF 228 E+ ++ GY T +GKW + +P+ +GFDY GF Sbjct: 113 EVMKSAGYATGLIGKWGMG----------------------MDGTTGEPRKKGFDYSYGF 150 Query: 229 HAAGTAYYNSPS-LFKNRERV-------PAKG-YISDQLTDEAIGVVDRAKTLDQPFMLY 279 A++ P +++N E++ A+G YISD ++ I V+ K D+PF L+ Sbjct: 151 LNQAHAHHYYPEYIYENGEKLMIPENKDDARGLYISDTFAEKGIEFVEENK--DKPFFLF 208 Query: 280 LAYNAPH----LPNDN---------PAPDQYQKQFNTGSQTADNYYAS-----------V 315 A+ PH +P+D+ P KQ G+ YAS + Sbjct: 209 WAFVTPHAELLVPDDSLNEFKGKWPETPFVMGKQGGDGTDNPFGVYASQDHPRAAFSGMI 268 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKSQTYPG 370 +D+ V + ++L++ G DNTII+F+SDNG +G N GYK G Sbjct: 269 TRLDKRVGDLFDKLEELGIDDNTIIMFSSDNGPHKEGGADPDFFDSNAELTGYKRDLTEG 328 Query: 371 GTHTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 G P + W ++ + SA D PT + A+ P+D +DG+S LP L+ +K Sbjct: 329 GIRVPFMVRWPNVVKARSKSSHASAFWDVMPTIAEIANTDSPED--IDGLSFLPALKGEK 386 Query: 430 QGEPHKNLTW 439 Q + HK+L W Sbjct: 387 Q-QVHKHLYW 395 >UniRef50_A4XED5 Sulfatase n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=A4XED5_NOVAD Length = 462 Score = 128 bits (322), Expect = 5e-28, Method: Compositional matrix adjust. Identities = 100/387 (25%), Positives = 160/387 (41%), Gaps = 77/387 (19%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PNI+ + DDLGY DT G + TP + S+ Sbjct: 34 RPNIVFIMADDLGY--------------------ADTSATG------SRHIRTPAIDSIG 67 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS------NTDAQDGIPLTETFLPEL 170 GV GY + + P+R A++TG RF + N A G+PL + + Sbjct: 68 AGGVMLRQGYSSTPICSPTRTALLTGCYAQRFAIGVEEPLGPNAPAGIGVPLDRPTIASV 127 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA 230 + GY T+ VGKWHL + P G+D+F+G Sbjct: 128 MKALGYRTSLVGKWHLGE-----------------------PPAHGPLKHGYDHFLGIVE 164 Query: 231 AGTAYY-------NSPS---LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 G Y+ P+ L ++ + GY++D DEA+ V++ +QPF L L Sbjct: 165 GGADYFVHRMVMSGKPAGVGLAEDDAQTDRTGYLTDIFGDEAVRVIEEGG--NQPFFLSL 222 Query: 281 AYNAPHLP----NDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYD 336 + APH P D F+ Y V ++DQ V ++L + ++G+ D Sbjct: 223 HFTAPHWPWEGREDEKLARALPSSFHYEGGNLAKYREMVETMDQNVAKVLAAIDRSGKAD 282 Query: 337 NTIILFTSDNGA-VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLIS 394 NT+++FTSDNG P G+K + GG P+ + W +++ G+ ++++ Sbjct: 283 NTVVVFTSDNGGERFSDTWPF----VGHKGEVLEGGVRVPLMVRWPRRIKAGSRSEQVMV 338 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSL 421 +MDF PT L A + + DG L Sbjct: 339 SMDFLPTLLGMAGGDAARIGRFDGADL 365 >UniRef50_D2R5N1 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R5N1_9PLAN Length = 600 Score = 128 bits (321), Expect = 6e-28, Method: Compositional matrix adjust. Identities = 109/398 (27%), Positives = 167/398 (41%), Gaps = 75/398 (18%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 T +PN++++ DD GYG D G+ +E TP+L Sbjct: 25 TSKRPNVLLIITDDQGYG----DIGAHGNTMIE----------------------TPSLN 58 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQN 173 +L + R TN +V +R+A+MTGR R GV+ + + ET +P+LF + Sbjct: 59 ALAKQATRLTNFHV-DPTCAETRSALMTGRYSCRTGVWHTIMGRSILRRDETTMPQLFSS 117 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ-PQNRGFDYFMGFHAAG 232 GY T GKWHL P R + ++ Q P + G DYF Sbjct: 118 GGYRTGMFGKWHLGD----SYPYRPMDRGFGESLVIAGGGVGQSPDHWGNDYF------- 166 Query: 233 TAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVD--RAKTLDQPFMLYLAYNAPHLPND 290 + +N + KGY +D A ++ +A + QPF Y+A NAPH P Sbjct: 167 -----DDTYLRNGQPEAQKGYCTDVFFQNAKAFIEQSQASSGKQPFFCYIATNAPHAPY- 220 Query: 291 NPAPDQYQKQFNTG-SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG-- 347 N P + G Q N+YA + ++D+ + ++L L + D+TI++F +DNG Sbjct: 221 NVDPQLAEPYLKKGVPQPMANFYAMISNIDENIGKLLAFLDERKLSDDTIVIFMTDNGTA 280 Query: 348 -------------------AVIDGPLPLNGAQKGYKSQT---YPGGTHTPMFMWWKGKLQ 385 A D P NG G ++Q Y GG P F+ W G Sbjct: 281 EGAGRSGRPGGKAKGKNLPATTDEP-KWNGYTAGMRAQKGSQYEGGHRVPCFIRWPGGKL 339 Query: 386 PGNYD--KLISAMDFYPTALDAADISIPKDLKLDGVSL 421 P +++ +L + D P+ L D+ P L+LDG L Sbjct: 340 PVDHEVKQLTAHFDLLPSLLKWCDVEKPAALELDGQPL 377 >UniRef50_Q01RE9 Sulfatase n=4 Tax=Bacteria RepID=Q01RE9_SOLUE Length = 499 Score = 128 bits (321), Expect = 6e-28, Method: Compositional matrix adjust. Identities = 107/392 (27%), Positives = 159/392 (40%), Gaps = 95/392 (24%) Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLP 168 TP L +L +G N +V + PSRA+I+TG R + N A IP F P Sbjct: 55 TPHLDTLARDGAHLKNAFVCTALCSPSRASILTGVYAHRHHIVDNNTA---IPRGTRFFP 111 Query: 169 ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF 228 +L Q GY T VGKWH+ + + P P GFD ++ F Sbjct: 112 QLLQRAGYKTGFVGKWHMGREGDDPQP-------------------------GFDKWVSF 146 Query: 229 HAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHL- 287 G+ L + + VP KGYI+D+LTD A+ + R +QP+ LYL++ A H Sbjct: 147 RGQGSYLPERNGLNVDGKHVPQKGYITDELTDYALDWL-RTVPKEQPYFLYLSHKAVHAD 205 Query: 288 --------------------------PNDNPAPDQYQKQFNTG-------------SQTA 308 PN P Q Q N+ + Sbjct: 206 FIPADRHKGAYAKETFRPPTTMDESGPNAQHRPMWVQNQRNSWHGVDFPYHSDLDVGEYY 265 Query: 309 DNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG------AVIDGPLPLNGAQKG 362 Y ++ VD V R+L+ L++ GQ D+T++++ DNG +ID Sbjct: 266 KRYAETLLGVDDSVDRMLDALRERGQLDSTLVIYMGDNGFQFGEHGLID----------- 314 Query: 363 YKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSL 421 K Y P+ G D++++ +D PT LDAA +IP+ LDG S+ Sbjct: 315 -KRTAYEESMRVPLLARCPEMFSGGRVVDRMVAGLDIMPTVLDAAGAAIPQ--GLDGRSM 371 Query: 422 LPWLQDKKQGEPHKNLTWITSYSHWFDEENIP 453 LP L + + +P + Y +W E N P Sbjct: 372 LPLL--RGENDPQWRTQLLYEY-YW--ERNFP 398 >UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5053 Length = 467 Score = 128 bits (321), Expect = 7e-28, Method: Compositional matrix adjust. Identities = 113/418 (27%), Positives = 168/418 (40%), Gaps = 89/418 (21%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNI+++ DDLG F+ G + ++ TP + L Sbjct: 25 KPNIVLIVADDLGC----FELGCYGQTKIK----------------------TPHIDKLA 58 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGR----APARFGVYSNTDAQDGIPLTETFLPELFQ 172 G +FT Y V PSR +MTG+ A R V + + Q I + + + + Sbjct: 59 QGGAKFTRFYSGSPVCAPSRCVLMTGKHSGHATVRNNVEAKPEGQFPIRAEDVTVADALK 118 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 HGY T A+GKW L P GFD F G++ Sbjct: 119 AHGYATGAMGKWGLGMFDTA----------------------GSPLKHGFDLFFGYNCQR 156 Query: 233 TAYYNSPS-LFKNRERVPAKG--------YISDQLTDEAIGVVDRAKTLDQPFMLYLAYN 283 A+ + P+ +++N +RV KG + D +EA+G ++ K +PF LYL + Sbjct: 157 HAHSHYPTYIYRNDKRVELKGNDGKTGKQFTQDLFEEEALGFIEANKA--KPFFLYLPFT 214 Query: 284 APHLP---------------NDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQ 328 PH+ D+PA D +K + Y A V +D+ V R++E+ Sbjct: 215 VPHVAVQVPEDSLNEYKGQLGDDPAYDG-KKGYQPHPAPHAGYAAMVTRMDRSVGRVVEK 273 Query: 329 LKKNGQYDNTIILFTSDNGAV--IDGP----LPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 L G NT++LFTSDNG + G G +G K Y GG P + G Sbjct: 274 LNALGLEKNTLVLFTSDNGPTHNVGGADSSFFNSAGKLRGLKGSVYEGGIRVPFIAYQPG 333 Query: 383 KLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 ++ G D + D PT A P +DG+S LP L+ +KQ H L W Sbjct: 334 TIKAGTESDAPLYFPDVLPTLCAFAGTKAPS--AIDGISFLPLLKGEKQ-PTHDFLYW 388 >UniRef50_A9MER1 Putative uncharacterized protein n=2 Tax=Enterobacteriaceae RepID=A9MER1_SALAR Length = 430 Score = 127 bits (320), Expect = 9e-28, Method: Compositional matrix adjust. Identities = 97/323 (30%), Positives = 142/323 (43%), Gaps = 50/323 (15%) Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFL 167 +TP L L EGV+F N + V GP+R+ + TGR P + G Y N A + E L Sbjct: 27 TTPVLDQLAREGVKFENAFTVQPVCGPARSCLQTGRYPTQNGCYRNNIA---MRQDEVTL 83 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 +LF GY TA +GKWHL+ + PV E + G+ Y++ Sbjct: 84 AKLFNQAGYDTAYIGKWHLADLDEKPVLEALRG--------------------GWQYWLA 123 Query: 228 FHAAGTAYYNSPSLFKNRERVPAK--GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 A + F + + P GY D T A+ + + + D PF+L+L+Y P Sbjct: 124 ADALEHTSHPYGGHFFDNDNQPVHFDGYRVDDQTTFALDYL-KNRQRDNPFLLFLSYLEP 182 Query: 286 HLPNDNP---APDQYQKQFNTGS-------------QTADNYYASVYSVDQGVKRILEQL 329 H ND APD Y ++F T S Q +YY ++D+ + RI++ L Sbjct: 183 HFQNDMARFVAPDGYAERFQTASVPPDLINRPGDWPQNLPDYYGMCQNLDENLGRIVDYL 242 Query: 330 KKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN- 388 K +G+YDNTIILF SD+G YK + P G G Sbjct: 243 KSSGEYDNTIILFFSDHGC------HFRTRNDEYKRSCHESSIRIPCVA-RGGPFSGGRT 295 Query: 389 YDKLISAMDFYPTALDAADISIP 411 + L++ +D T L AA I++P Sbjct: 296 VEHLVTLLDIPVTMLSAAGITVP 318 >UniRef50_UPI0001927538 PREDICTED: similar to CG8646 CG8646-PA n=5 Tax=Hydra magnipapillata RepID=UPI0001927538 Length = 502 Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 137/519 (26%), Positives = 220/519 (42%), Gaps = 96/519 (18%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KP+II++ DDLG+ + F + P TP + L Sbjct: 19 KPHIIMIVADDLGWNDISFHGSNEIP--------------------------TPNIDRLA 52 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD---GIPLTETFLPELFQN 173 + GV N YV + PSR+AIMTGR P G+ +T G+ L E FLP+ + Sbjct: 53 NNGVILDNYYVL-PICTPSRSAIMTGRYPIHTGMQQDTIFGPNPYGVGLNEKFLPQYLKQ 111 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY T VGKWHL F A+++ P RGFD + G + Sbjct: 112 QGYKTHGVGKWHLG----------------------FFAKQYTPTYRGFDSYYGSYLGKG 149 Query: 234 AYYNSPS--------LFKNRERVPAK--GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYN 283 Y+N + L N V ++ Y ++ T EAI ++ + +P LYLAY Sbjct: 150 DYWNHSNTETYSGLDLHDNENGVFSQDGNYSTEMYTAEAISCINNHNS-SEPLFLYLAYQ 208 Query: 284 APHLPN--DNP--APDQYQKQFNTGSQTADNYYASVYS-VDQGVKRILEQLKKNGQYDNT 338 A H N ++P AP ++ +F+ YA++ +D GV R+ + L + DN+ Sbjct: 209 AVHSANTEEDPLQAPQEWIDKFSYIKHEQRRKYAAMLGYMDYGVGRVHDALAEKKMLDNS 268 Query: 339 IILFTSDNGAVIDGPLPLNGAQ----KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS 394 II+FT+DNG +G N A +G K+ + GG F++ K P +LI Sbjct: 269 IIIFTTDNGGPANG-FDYNWANNFPLRGVKATLFEGGVRGVSFVYSKLIESPRVSHELIH 327 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI---------TSYSH 445 D+ PT ++ A + D LDG LQ+K+ + ++ L I Sbjct: 328 ITDWLPTLVNLAGGKV-SDGFLDGFDQWATLQNKQSSQRNEVLLNIDEKVWKNEALRVGS 386 Query: 446 WFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY---TVR-NNDYSLVYTVENNQL 501 W + +WD ++ P + N + + FSY TV+ +D +V ++ Sbjct: 387 WKIIKEGNYWDGWYP---------PPSFNEQSNNSFSYLSSTVKCGHDIPIVINHCDSYC 437 Query: 502 GLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 + D + ++L+ P+V+ E+ + + S PP Sbjct: 438 LFHIDEDPCEINDLSKKFPEVLAELINRLNTYRQSMVPP 476 >UniRef50_A3ZWK4 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Bacteria RepID=A3ZWK4_9PLAN Length = 442 Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 126/449 (28%), Positives = 189/449 (42%), Gaps = 53/449 (11%) Query: 104 AAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLT 163 AA TP L EGVRFT Y A V P+R +++TGR P R VY+ G P+ Sbjct: 15 AAPIHTPNLDQAAAEGVRFTRFYAAAPVCSPTRCSVLTGRNPNRSAVYAW-----GWPIR 69 Query: 164 --ETFLPELFQNHGYYTAAVGKWHLSKI-SNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 E L E Q GY T+ GKWHL + + PV P Sbjct: 70 PQEITLAERLQAAGYATSHFGKWHLGSVRKDSPV---------------------SPGKC 108 Query: 221 GFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 GFD ++ +A Y N P + V G SD D AI + ++PF + Sbjct: 109 GFDDWI---SAPNFYDNDPIMSDQGRAVQYHGESSDVTADLAIDWIRAQAKEEKPFFSVV 165 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 + +PH P+ A D ++ + +YY V +D+ +I LK+ G DNTI+ Sbjct: 166 WFGSPHSPHI--AADADRELYKDEPAKFRDYYGEVTGIDRAYGKIRSTLKELGISDNTIL 223 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL-QPGNYDKLISAMDFY 399 + SDNGA D G + K Y GG P + W + P + D + Sbjct: 224 WYCSDNGA--DKAKGSAGPFREKKGSIYEGGLLVPGILDWPARFPAPQTTSLRATTCDIF 281 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITS------YSHWFDEENI- 452 PT L AA +S K LDG++LLP L K P W T+ S EE + Sbjct: 282 PTVLAAAGLSPDKQRPLDGINLLPLLTAKTDMRPQPIGFWQTANGGKPVRSDAMMEELLN 341 Query: 453 ---PFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGL-YKLTD 508 D V + D P P + D ++ + D+ L + +EN + + ++L D Sbjct: 342 QQATGGDLPADEVSLHAADLPKPPVSIDTLAGHASLTSGDWKL-HRIENKKGAVRFELYD 400 Query: 509 LQ----QKDNLAAANPQVVKEMQGVVREF 533 L +K+N+ P++ +++ + R++ Sbjct: 401 LAADPYEKENVLKQYPEIAEKLTKLQRDW 429 >UniRef50_A6DKM2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKM2_9BACT Length = 472 Score = 127 bits (319), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 126/515 (24%), Positives = 209/515 (40%), Gaps = 115/515 (22%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNII++ DDLG L G + + TP + +L Sbjct: 19 KPNIILILADDLGGAGL----GCYGNEFF----------------------GTPNIDALA 52 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ------------DGIPLTE 164 + +RF N Y V PSRA +M+G+ R + + Q +G L + Sbjct: 53 AKSMRFDNAYSGSTVCAPSRACLMSGQYVGRHKITWVSQFQRDYIKKKRGPNLNGFRLLQ 112 Query: 165 TFLP-----------ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 P + F++ GY TA GKWHL P+D Sbjct: 113 PVHPYHMPEGTITLGQAFKDAGYATAMFGKWHLGH-----RPQD---------------- 151 Query: 214 EWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLD 273 QP GFD ++ F G ++ +P N+ + K Y++D D+AI ++R + Sbjct: 152 --QPDKMGFDEYLTFQ--GMKHF-APYTLPNKVQHGEKVYLTDLTCDKAIDFMERKVAAE 206 Query: 274 QPFMLYLAYNAPHLPND-NPAPDQYQKQFNTGSQTADNYYASVYS-VDQGVKRILEQLKK 331 +PF LY H P + A QY ++ G A++ +D V R+++++ + Sbjct: 207 KPFFLYYPDFLVHAPMEAKQAMIQYFEKKTIGQHHKSVIGAAMTKHLDDTVGRLVKKVDE 266 Query: 332 NGQYDNTIILFTSDNGAV---IDGPLPLNGAQ----KGYKSQTYPGGTHTPMFMWWKGKL 384 G +NTII+FTSDNG + DG G + KS Y GG+ P+ W G Sbjct: 267 LGIAENTIIIFTSDNGGLGYKSDGGYGDKGTSNYPYRSAKSSHYEGGSRVPLIFHWPGVT 326 Query: 385 QPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 + + +++S +D YPT L A ++ P++ LDG+ L++ KQ P ++L Sbjct: 327 EANSLSHEVVSGIDIYPTLLKIAQVAKPQEQILDGIDFSSILKNPKQKLPARDLF----- 381 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGL 503 +Y H+ S ++R D +Y L Sbjct: 382 -------------HYQPIYNHKV-----------FGDASVSLRRGDMKYIYYFVEENFEL 417 Query: 504 YKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 + L D+ QK +L+A P++ +E++ + +D + Sbjct: 418 FNLKDDVSQKKDLSADYPELCEELKKACFKHLDET 452 >UniRef50_A7IPG5 Sulfatase n=3 Tax=Bacteria RepID=A7IPG5_XANP2 Length = 491 Score = 127 bits (318), Expect = 1e-27, Method: Compositional matrix adjust. Identities = 103/396 (26%), Positives = 161/396 (40%), Gaps = 73/396 (18%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +P+I+ + DDLG+ + F + TP L L Sbjct: 48 RPHIVYILADDLGFADVGF---------------------------HGSDIKTPNLDHLA 80 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN---TDAQDGIPLTETFLPELFQN 173 +G R Y P+RAA +TGR P +G+ + A+ G+ E LP+ ++ Sbjct: 81 AQGARLGQFYT-QPFCTPTRAAFLTGRYPLHYGLQVGAIPSGAKYGLATDEFLLPQALKD 139 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY TA VGKWHL +++ P+ RGFD F G Sbjct: 140 VGYRTALVGKWHLGHAD----------------------QKFWPRQRGFDSFYGPLVGEI 177 Query: 234 AYYNSPS-----LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP 288 ++ + + + +V +GY ++ EA+ ++ A P LYLA+ APH P Sbjct: 178 DHFKHEAHGVTDWYHDNTQVKEEGYDTELFGKEAVRLI-AAHDPKTPLFLYLAFTAPHTP 236 Query: 289 NDNPAPDQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG 347 AP Y Q+ + + Y A + ++D + ++ L G +NT+I+F SDNG Sbjct: 237 FQ--APQSYLDQYAHIAAPQRRAYAAMITAMDDQIGHVVAALTSRGMRENTLIVFHSDNG 294 Query: 348 A----------VIDGPLPL-NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM 396 + G LP N + K Y GGT W G++ PG + ++ + Sbjct: 295 GTRSKMFAGEGAVAGDLPASNAPYRDGKGSLYEGGTRVVALANWPGRIAPGAAEGVMHVV 354 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 D PT A S+ K LDGV + P L + G Sbjct: 355 DMLPTLAKLAGASLAKSKPLDGVDVWPALAAGQAGR 390 >UniRef50_B4D764 Steryl-sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D764_9BACT Length = 499 Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 119/423 (28%), Positives = 168/423 (39%), Gaps = 105/423 (24%) Query: 57 KPNIIVLTMDDLGYGQL-PFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 KPN I++ +DD+GY + PF + NR TP L + Sbjct: 23 KPNFIIINIDDMGYADIAPFG-------SKLNR--------------------TPNLDRM 55 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTETFLPELFQ 172 EG + T Y A V PSR+A+MTG P R + A G+ E + EL + Sbjct: 56 AQEGRKLTCFYGAP-VCSPSRSALMTGCYPKRVLPIPSVLFPGAAVGLNPAEHTVAELLK 114 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF---H 229 GY T +GKWHL E+ P RGFDY++G + Sbjct: 115 KSGYATGCIGKWHLG-----------------------DQPEFLPPRRGFDYYLGLPYSN 151 Query: 230 AAGTAYYNSPSLFKN-----------RERVPAKGYISDQ---------------LTDEAI 263 G S S + +P G +Q DE Sbjct: 152 DMGPGEDGSKSSLGDPIPKPKATPNPSAPIPETGITGNQPPLPMLENEKVIARVRQDEQQ 211 Query: 264 GVVDR---------AKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYAS 314 G+VDR + D+PF LYL +NA H P Y + G Y Sbjct: 212 GLVDRYTKAAVKFITEHKDKPFFLYLPHNAVHFP-------IYPGKEWAGKSPNGYYSDW 264 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHT 374 V VD V ++L L++ D+T +LFTSDNG P +N +G+K+ T+ GG Sbjct: 265 VEQVDWSVGQVLNTLRELKLQDHTFVLFTSDNGGT---PRAVNAPLRGFKTTTWEGGMRE 321 Query: 375 PMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE- 432 P WW GK+ G I+ M D PT ++ A +P D K+DG ++ P L + + Sbjct: 322 PTIAWWPGKIPGGTSSDEITGMFDILPTLVNLAGGEVPTDHKIDGGNIWPVLAGEAGAKS 381 Query: 433 PHK 435 PH+ Sbjct: 382 PHE 384 >UniRef50_C1ZHB0 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZHB0_PLALI Length = 637 Score = 126 bits (317), Expect = 2e-27, Method: Compositional matrix adjust. Identities = 141/549 (25%), Positives = 223/549 (40%), Gaps = 145/549 (26%) Query: 35 LKATKTNVAFSDFTP------TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 L + T V + TP T + +PNIIV+ +DDLG+ Sbjct: 32 LTTSPTPVVLAQITPETSVDTTSAALADRPNIIVIMVDDLGWR----------------- 74 Query: 89 EVVDTYKIGIDKAIEAAQKS-TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR 147 D +I ++ S TP + +L GV FT Y + + PSRA+++TG+ PAR Sbjct: 75 ----------DTSIYGSKSSRTPHIDALAARGVIFTQAYSSSSLDEPSRASLLTGKWPAR 124 Query: 148 FGVYSNTDAQDG------------------IPLTETFLP-------ELFQNHGYYTAAVG 182 + + + G P + T LP E+ QN GY TA +G Sbjct: 125 LKLTQSRELNPGEILEPSLPQTALSHISMITPTSRTQLPGDELTAAEILQNAGYATAFMG 184 Query: 183 KWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLF 242 +W+L + ++ QP+N+GF H ++ S F Sbjct: 185 EWNLGENAS------------------------QPENQGFS-----HVVCSSPLTSQPQF 215 Query: 243 KNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF- 301 A + D LT +AI ++ +PF L L Y + P P+ D Q + Sbjct: 216 -------AGQHADDLLTQQAINWMETNS--KEPFFLNLWYQSVGAPFQAPSGDIQQARTL 266 Query: 302 ---NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA----VIDGPL 354 + Q A A + ++DQ V I+ L++ TII+FTSDNG I+G L Sbjct: 267 ADPSQDPQQAPVMAAMIAALDQRVGLIVAALERLQLTQRTIIVFTSDNGGNMTDTIEGDL 326 Query: 355 PL-NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPK 412 N +G K Y GG+ P+ + W G P + D +SA+D PT +D A +IP Sbjct: 327 LTSNRPLRGGKGSMYEGGSRVPLIVVWPGVATPARSCDDAVSAVDLLPTLVDMARGTIPA 386 Query: 413 DLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPH- 471 ++DGVSL P L + + FD I +H YPH Sbjct: 387 GHQIDGVSLKPAL----------------TGATGFDRGAI-----FHH--------YPHY 417 Query: 472 NPNTEDLSQFSYTVRNNDYSLV-----YTVENNQLGLYKL-TDLQQKDNLAAANPQVVKE 525 NP T S VR+ + L+ + + +++ +Y L D ++ NLA + + Sbjct: 418 NPTTGTTPAIS--VRSENMKLIRFFGGHVTQTDRIEVYDLQNDPGERINLARSRRDEIVR 475 Query: 526 MQGVVREFI 534 + +++ F+ Sbjct: 476 LTNLIQNFL 484 >UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFN4_9BACT Length = 481 Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 114/431 (26%), Positives = 179/431 (41%), Gaps = 103/431 (23%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PN+I + DDLGYG+L G + + ++ TP + +L Sbjct: 20 PNVIYILADDLGYGEL----GCYGQEKIK----------------------TPHIDALAK 53 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD----AQDGIPLTETFLPELFQN 173 EG+RFT Y V PSR +++G+ ++ + +N + Q+ IP L ++F++ Sbjct: 54 EGMRFTRHYSGAPVCAPSRGVLLSGQQLSKAYIRNNREHKPEGQEPIPEPGMTLAQIFKD 113 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY T A GKW L + P+ GFD F G++ Sbjct: 114 KGYATGAFGKWGLG----------------------YPGSSSDPKALGFDTFYGYNCQRV 151 Query: 234 A--------YYNSPSLFKNRERVP-----------------AKGYISDQLTDEAIGVVDR 268 A + N ++ N + VP A+ Y D + DEA+ + Sbjct: 152 AHSFYPPHMWSNDKNITINEKPVPGHWRKAVGPDFDFSQFYAENYAPDLILDEALKFIKD 211 Query: 269 AKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQT-----------ADNYYASV 315 K D+PF YL + PHL P D Y K++++ ++ Y A + Sbjct: 212 NK--DKPFFAYLPFVEPHLAMHPPHSWVDSYPKEWDSPKESYKAAYLPHLRPRAGYAAMI 269 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV----IDGPLPLNGAQ--KGYKSQTYP 369 +D+ V +++ LK+ +NT+++FTSDNGA +D N + +G K Y Sbjct: 270 SDLDEHVGSVMQLLKELDLVENTLVIFTSDNGASHCIEVDHEF-FNSTKDLRGLKGSVYE 328 Query: 370 GGTHTPMFMWWKGKLQPGNYDKLISA-MDFYPTALDAADISIPKDLKLDGVSLLPWLQDK 428 GG PM W GK++ +S +D T D P+ DGVS LP L+ + Sbjct: 329 GGLRVPMIAHWPGKIKKAQVSDHVSGFVDVMATFCDLLQTEAPQ--TSDGVSFLPTLKGE 386 Query: 429 KQGEPHKNLTW 439 KQ EP L W Sbjct: 387 KQ-EPQPVLAW 396 >UniRef50_C7M5R4 Sulfatase n=4 Tax=Bacteroidetes RepID=C7M5R4_CAPOD Length = 480 Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 121/431 (28%), Positives = 186/431 (43%), Gaps = 102/431 (23%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PN+I + DDLGYG ++ Y I K TP L L D Sbjct: 24 PNVIFILADDLGYGD------------------IEPYGQQIIK--------TPQLSKLAD 57 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGI----PL--TETFLPELF 171 EG++FT Y V PSRA+ +TG+ + N + ++ + PL + + +LF Sbjct: 58 EGMKFTQFYTGTSVCAPSRASFITGQTTGETHIRGNEEVREPVDGQAPLLANDPSVAQLF 117 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA 231 + GY T GKW L + + E P +GFD F G+++ Sbjct: 118 KKAGYNTGCFGKWGLGIVPS----------------------EGNPLKQGFDTFFGYNSQ 155 Query: 232 GTAYYNSPS-LFKNRERV--PAKG-------YISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 A+ P+ L+ + E+V P G Y D + ++ + + + +T ++PF ++L Sbjct: 156 FRAHRRYPAFLWHDNEKVLIPENGNYERQEVYGEDLIQEKILDYIGK-QTAEKPFFMWLT 214 Query: 282 YNAPH----LPNDN----------------------PAPDQYQKQFNTGSQTADNYYASV 315 Y PH +P+D+ P P + + + T Y A V Sbjct: 215 YTLPHAELVVPHDSIYASYEYLPKKPYKGVDYDKITPKPFGWAG-YMSQPHTYATYAAMV 273 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG---PLPLNGAQ--KGYKSQTYPG 370 +D+ + I + LK G ++TII+F SDNGA +G P N + +G K Y G Sbjct: 274 SRLDKYLGEIRKLLKVKGLDEDTIIIFASDNGAHREGGADPKFFNSSAGLRGIKRDLYEG 333 Query: 371 GTHTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDL-KLDGVSLLPWLQDK 428 G TP ++WKGK++ G+ I A D PT A+I+ K + VS LP L K Sbjct: 334 GIRTPYIVYWKGKIKAGSVSDHIGAFWDMMPT---FAEITHQKYVPNRHQVSFLPTLLGK 390 Query: 429 KQGEPHKNLTW 439 KQ + HK L W Sbjct: 391 KQQQQHKYLYW 401 >UniRef50_A6DMV0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMV0_9BACT Length = 443 Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 133/524 (25%), Positives = 209/524 (39%), Gaps = 127/524 (24%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 T + KPNI+ + +DD GY D + K ++ T Sbjct: 12 TTLVAQDKPNIVFIIIDDFGYA----DSEPYGAKDIK----------------------T 45 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-----YSNTDAQ------- 157 P + L +G++FTN Y V P+R A +TGR R G Y T++Q Sbjct: 46 PGINELAKDGLKFTNFYANAPVCSPTRCAFITGRWQQRSGFEWALGYGGTNSQLKNGQYE 105 Query: 158 -----DGIPLT--ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 GI L + LP+L + GY T A GKWHL Sbjct: 106 AVTDIHGIGLLPEKNHLPKLLKKAGYKTGAFGKWHLG----------------------- 142 Query: 211 SAEEWQPQNRGFDYFMG---FHAAGTAYYNSPSLFKNRE---RVPAKGYISDQLTDEAIG 264 S +++ P + GFD + G H Y + RE + GY++ + + A+ Sbjct: 143 SQDKFNPIHHGFDEYYGPLLGHCDYYTYKYYDDTYTLREGAKVIKDSGYLTTNINERAVD 202 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLP--NDNPAPDQYQK-QFNTGSQTADNYYASVYSVDQG 321 +DR D+PF +Y+ + A H P + + P Q K N G++ +Y A V VD+G Sbjct: 203 FIDRHA--DKPFFMYVPHMAVHSPYQSADKKPKQITKTNLNDGNRA--DYAAMVEEVDKG 258 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 V+ I+ +LK+ + T+ + +SDNG N K+ + GG P M W Sbjct: 259 VEMIIAKLKEKKIFHKTLFVVSSDNGG---AHFSDNAPLFHRKTTLFEGGIRVPCIMHWP 315 Query: 382 GKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 K+ G D++ MD T L A I P DG++LLP + DK Sbjct: 316 EKIGKGVVSDQIAITMDLSKTFLALAGIDEP---SYDGINLLPMMTDKNN---------- 362 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 E FW + K R ++ + ++ Y + N L+Y +EN Sbjct: 363 -------KVERTLFWRSNSKARRQKA---------VRMGKWKYILDVN-CELLYNLEN-- 403 Query: 501 LGLYKLTDLQQKDNLAAANPQVVKEMQGVVREF---IDSSQPPL 541 D+ + NL P++V++M+ + + +D QPP Sbjct: 404 -------DIAENKNLFYQRPEIVQQMKQKLASWEREMDQHQPPF 440 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P77318 Uncharacterized sulfatase ydeN n=81 Tax=Gammapro... 785 0.0 UniRef50_D2YC71 Sulfatase n=2 Tax=Vibrio mimicus RepID=D2YC71_VIBMI 616 e-175 UniRef50_D1P6M6 Putative sulfatase YdeN n=2 Tax=Providencia RepI... 535 e-150 UniRef50_C5BEH4 Sulfatase, putative n=37 Tax=Gammaproteobacteria... 534 e-150 UniRef50_B9XGT6 Sulfatase n=3 Tax=Bacteria RepID=B9XGT6_9BACT 469 e-130 UniRef50_C1ZKY2 Arylsulfatase A family protein n=1 Tax=Planctomy... 468 e-130 UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 465 e-129 UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT 463 e-129 UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomy... 460 e-128 UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Plancto... 458 e-127 UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomy... 457 e-127 UniRef50_D2QWC8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 456 e-126 UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase... 454 e-126 UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 449 e-124 UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD 449 e-124 UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 447 e-124 UniRef50_B8HPF9 Sulfatase n=2 Tax=Bacteria RepID=B8HPF9_CYAP4 446 e-123 UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 445 e-123 UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 445 e-123 UniRef50_A6C4Q6 Arylsulfatase n=1 Tax=Planctomyces maris DSM 879... 444 e-123 UniRef50_B1KD78 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 442 e-122 UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria Rep... 442 e-122 UniRef50_A6DKP2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 442 e-122 UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces mari... 441 e-122 UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Pro... 441 e-122 UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 440 e-122 UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 T... 437 e-121 UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyc... 437 e-121 UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 T... 436 e-120 UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina ... 436 e-120 UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 436 e-120 UniRef50_D2R014 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 436 e-120 UniRef50_Q7UGB8 Arylsulfatase homolog b1498 n=1 Tax=Rhodopirellu... 435 e-120 UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi D... 434 e-120 UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=B... 434 e-120 UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC1... 433 e-120 UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Plancto... 433 e-120 UniRef50_C6Y214 Sulfatase n=3 Tax=Sphingobacteriaceae RepID=C6Y2... 433 e-119 UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 431 e-119 UniRef50_A6C4V9 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 Re... 431 e-119 UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planct... 431 e-119 UniRef50_A6LEC5 Arylsulfatase A n=2 Tax=Parabacteroides RepID=A6... 431 e-119 UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 431 e-119 UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Plancto... 430 e-119 UniRef50_A6DSH3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 430 e-119 UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flamme... 430 e-119 UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 430 e-118 UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_R... 430 e-118 UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bactero... 430 e-118 UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 429 e-118 UniRef50_D2R917 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 429 e-118 UniRef50_A6DHI0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 428 e-118 UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria ... 428 e-118 UniRef50_A7HQ00 Steryl-sulfatase n=4 Tax=Proteobacteria RepID=A7... 428 e-118 UniRef50_A6CAY0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 427 e-118 UniRef50_A6DKP3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 426 e-118 UniRef50_Q7UYA5 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 426 e-117 UniRef50_C9KTV0 Arylsulfatase n=1 Tax=Bacteroides finegoldii DSM... 426 e-117 UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglob... 426 e-117 UniRef50_A4GJF1 Sulfatase n=1 Tax=uncultured marine bacterium EB... 425 e-117 UniRef50_B4CVD2 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 425 e-117 UniRef50_B9KQS8 Twin-arginine translocation pathway signal n=2 T... 425 e-117 UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM ... 425 e-117 UniRef50_A3ZMN6 Arylsulfatase B n=3 Tax=Bacteria RepID=A3ZMN6_9PLAN 424 e-117 UniRef50_C1ZA41 Arylsulfatase A family protein n=1 Tax=Planctomy... 424 e-117 UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 424 e-117 UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 424 e-117 UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 423 e-117 UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD 423 e-117 UniRef50_A6LED1 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LE... 423 e-116 UniRef50_Q7UTH7 Arylsulfatase A n=2 Tax=Bacteria RepID=Q7UTH7_RHOBA 421 e-116 UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF8... 421 e-116 UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=... 420 e-116 UniRef50_A6DF72 Putative secreted sulfatase ydeN n=1 Tax=Lentisp... 419 e-115 UniRef50_Q7UL93 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 419 e-115 UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomy... 419 e-115 UniRef50_A7V8P8 Putative uncharacterized protein n=1 Tax=Bactero... 419 e-115 UniRef50_Q15XI1 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 419 e-115 UniRef50_A6DKD8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 418 e-115 UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 418 e-115 UniRef50_A6DMY9 Putative uncharacterized protein n=2 Tax=Lentisp... 418 e-115 UniRef50_Q7UQ05 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 418 e-115 UniRef50_Q7UHJ6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 416 e-115 UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodop... 416 e-114 UniRef50_A6C4W8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 416 e-114 UniRef50_Q7UYD6 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 416 e-114 UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 415 e-114 UniRef50_A7IPG5 Sulfatase n=3 Tax=Bacteria RepID=A7IPG5_XANP2 415 e-114 UniRef50_B5JJG5 Sulfatase, putative n=1 Tax=Verrucomicrobiae bac... 414 e-114 UniRef50_Q7URY7 Aryl-sulphate sulphohydrolase n=1 Tax=Rhodopirel... 414 e-114 UniRef50_Q0C069 Sulfatase family protein n=3 Tax=Bacteria RepID=... 414 e-114 UniRef50_C3ZGR2 Putative uncharacterized protein n=1 Tax=Branchi... 414 e-114 UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 413 e-114 UniRef50_A6DHI2 Aryl-sulphate sulphohydrolase n=2 Tax=Lentisphae... 413 e-114 UniRef50_Q01N83 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 413 e-113 UniRef50_A6LDP6 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LD... 413 e-113 UniRef50_A6LED2 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LE... 412 e-113 UniRef50_A6DM29 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 412 e-113 UniRef50_A5FAW4 Sulfatase n=1 Tax=Flavobacterium johnsoniae UW10... 412 e-113 UniRef50_A6DPC8 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 411 e-113 UniRef50_Q7UJQ8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 411 e-113 UniRef50_B4D4S6 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 411 e-113 UniRef50_B4CZ54 Sulfatase n=3 Tax=Bacteria RepID=B4CZ54_9BACT 411 e-113 UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7... 411 e-113 UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 411 e-113 UniRef50_D2QZX4 Sulfatase n=10 Tax=Bacteria RepID=D2QZX4_9PLAN 411 e-113 UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN... 410 e-113 UniRef50_Q1YSH0 Sulfatase family protein n=4 Tax=cellular organi... 410 e-113 UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9... 410 e-113 UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 T... 409 e-112 UniRef50_A6DLE2 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 409 e-112 UniRef50_A6C176 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 409 e-112 UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium sp... 408 e-112 UniRef50_D2QTW6 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepI... 408 e-112 UniRef50_A6DHI1 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 408 e-112 UniRef50_Q7UKJ5 Arylsulfatase A n=3 Tax=Bacteria RepID=Q7UKJ5_RHOBA 408 e-112 UniRef50_A4AVA7 Aryl-sulphate sulphohydrolase n=2 Tax=Bacteroide... 408 e-112 UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 408 e-112 UniRef50_A6DKM2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 407 e-112 UniRef50_Q7UYW3 Arylsulfatase B n=1 Tax=Rhodopirellula baltica R... 407 e-112 UniRef50_A6DR29 N-acetylgalactosamine-6-sulfatase n=3 Tax=Bacter... 406 e-111 UniRef50_Q7US96 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 405 e-111 UniRef50_A4CMB0 Arylsulfatase A n=4 Tax=Bacteria RepID=A4CMB0_9FLAO 405 e-111 UniRef50_B0NLM9 Putative uncharacterized protein n=1 Tax=Bactero... 405 e-111 UniRef50_A9LGQ4 Secreted arylsulfatase n=4 Tax=Bacteria RepID=A9... 405 e-111 UniRef50_C6I9F7 Sulfatase n=4 Tax=Bacteroides RepID=C6I9F7_9BACE 405 e-111 UniRef50_UPI0001968C90 hypothetical protein BACCELL_02360 n=1 Ta... 405 e-111 UniRef50_Q7UYA9 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodop... 404 e-111 UniRef50_UPI0001BC7CBC sulfatase n=1 Tax=Bacteroides sp. D2 RepI... 404 e-111 UniRef50_A4AQQ7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 403 e-111 UniRef50_A6DS95 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HT... 402 e-110 UniRef50_C1ZF13 Arylsulfatase A family protein n=1 Tax=Planctomy... 402 e-110 UniRef50_A7AKS6 Putative uncharacterized protein n=3 Tax=Bactero... 401 e-110 UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_B... 401 e-110 UniRef50_C5EQ23 Arylsulfatase E n=1 Tax=Clostridiales bacterium ... 401 e-110 UniRef50_A6DSP6 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 400 e-110 UniRef50_UPI000186ED10 arylsulfatase B precursor, putative n=1 T... 400 e-110 UniRef50_UPI0001927538 PREDICTED: similar to CG8646 CG8646-PA n=... 400 e-110 UniRef50_Q7UHK0 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 400 e-109 UniRef50_D2R1I8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 399 e-109 UniRef50_A6C383 Sulfatase (Fragment) n=1 Tax=Planctomyces maris ... 398 e-109 UniRef50_A7S8Q2 Predicted protein n=2 Tax=Nematostella vectensis... 398 e-109 UniRef50_A6DSG4 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 397 e-109 UniRef50_A6DGL0 Arylsulfatase A n=3 Tax=Lentisphaera araneosa HT... 397 e-109 UniRef50_A4A2W0 Arylsulfatase A n=1 Tax=Blastopirellula marina D... 396 e-109 UniRef50_A4CGL5 Arylsulfatase A (Precursor) n=2 Tax=Flavobacteri... 396 e-108 UniRef50_B1KFX9 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 396 e-108 UniRef50_A6P2X1 Putative uncharacterized protein n=1 Tax=Bactero... 396 e-108 UniRef50_A3ZLN5 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 396 e-108 UniRef50_Q7UMZ5 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 396 e-108 UniRef50_A6C8S3 Arylsulphatase A n=1 Tax=Planctomyces maris DSM ... 395 e-108 UniRef50_A7SRP2 Predicted protein n=2 Tax=Nematostella vectensis... 395 e-108 UniRef50_A6KZI7 Arylsulfatase n=23 Tax=Bacteroidales RepID=A6KZI... 395 e-108 UniRef50_B5JMW2 Sulfatase domain protein n=1 Tax=Verrucomicrobia... 395 e-108 UniRef50_C5C586 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 1233... 395 e-108 UniRef50_A3ZWK4 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 394 e-108 UniRef50_B4D433 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 394 e-108 UniRef50_Q7UYH3 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 394 e-108 UniRef50_A6DG54 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 394 e-108 UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3J... 394 e-108 UniRef50_A6DUI7 Putative exported uslfatase n=1 Tax=Lentisphaera... 393 e-107 UniRef50_Q482D6 Sulfatase family protein n=2 Tax=Bacteria RepID=... 393 e-107 UniRef50_Q7UM38 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 393 e-107 UniRef50_A0YAF7 Arylsulfatase A n=4 Tax=Bacteria RepID=A0YAF7_9GAMM 393 e-107 UniRef50_UPI0001B577E1 arylsulfatase precursor n=1 Tax=Streptomy... 393 e-107 UniRef50_A0Z632 Arylsulfatase B n=1 Tax=marine gamma proteobacte... 393 e-107 UniRef50_A3J5W3 Putative arylsulfatase n=1 Tax=Flavobacteria bac... 392 e-107 UniRef50_B1KD88 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 392 e-107 UniRef50_A6CA27 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 391 e-107 UniRef50_A6DJ15 Putative arylsulfatase n=2 Tax=Lentisphaera aran... 391 e-107 UniRef50_Q7UWW9 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 391 e-107 UniRef50_A3HWU7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 391 e-107 UniRef50_B4D3U0 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 391 e-107 UniRef50_C6Y1Z7 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 ... 391 e-107 UniRef50_A6DNI9 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 391 e-107 UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC... 390 e-107 UniRef50_Q0BZE9 Sulfatase family protein n=1 Tax=Hyphomonas nept... 390 e-107 UniRef50_D2QTW5 Sulfatase n=2 Tax=Sphingobacteriales RepID=D2QTW... 390 e-106 UniRef50_A6LCL3 Arylsulfatase A n=9 Tax=Bacteroidales RepID=A6LC... 389 e-106 UniRef50_A3I2R7 Arylsulfatase n=2 Tax=Bacteroidetes RepID=A3I2R7... 389 e-106 UniRef50_A6DMW2 Putative exported uslfatase n=1 Tax=Lentisphaera... 389 e-106 UniRef50_Q7ULE7 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Rhod... 389 e-106 UniRef50_A6DMW1 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 389 e-106 UniRef50_C1ZIS7 Arylsulfatase A family protein n=1 Tax=Planctomy... 388 e-106 UniRef50_C1ZI83 Arylsulfatase A family protein n=1 Tax=Planctomy... 388 e-106 UniRef50_C0BKJ9 Sulfatase n=2 Tax=Bacteroidetes RepID=C0BKJ9_9BACT 388 e-106 UniRef50_D2R457 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 388 e-106 UniRef50_Q7UN55 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 388 e-106 UniRef50_D2R921 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 387 e-106 UniRef50_C9MNT2 Arylsulfatase n=4 Tax=Bacteroidales RepID=C9MNT2... 387 e-106 UniRef50_B0SY54 Sulfatase n=7 Tax=Alphaproteobacteria RepID=B0SY... 386 e-106 UniRef50_A6DHS2 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 386 e-106 UniRef50_A6DMX7 N-acetyl-galactosamine-6-sulfatase (GALNS) n=2 T... 386 e-105 UniRef50_A6DMX9 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 385 e-105 UniRef50_A6DJ37 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 385 e-105 UniRef50_A4W906 Sulfatase n=43 Tax=Enterobacteriaceae RepID=A4W9... 385 e-105 UniRef50_D2R323 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 385 e-105 UniRef50_D2QXE9 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 385 e-105 UniRef50_A6DR20 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 385 e-105 UniRef50_Q7UXA8 N-acetylgalactosamine-6-sulfate sulfatase n=2 Ta... 385 e-105 UniRef50_A6KWS8 Arylsulfatase n=6 Tax=Bacteroides RepID=A6KWS8_B... 384 e-105 UniRef50_B1KD86 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 384 e-105 UniRef50_A6CB33 Arylsulfatase n=1 Tax=Planctomyces maris DSM 879... 384 e-105 UniRef50_UPI0000586CBA PREDICTED: similar to arylsulfatase B n=3... 383 e-105 UniRef50_A6DKN7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 383 e-105 UniRef50_C7PRW9 Sulfatase n=1 Tax=Chitinophaga pinensis DSM 2588... 383 e-105 UniRef50_A7VQW1 Putative uncharacterized protein n=1 Tax=Clostri... 383 e-104 UniRef50_A4GIB1 Arylsulfatase n=2 Tax=Bacteria RepID=A4GIB1_9BACT 383 e-104 UniRef50_B9XR48 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XR4... 383 e-104 UniRef50_A6DHY0 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 382 e-104 UniRef50_A9BNY8 Sulfatase n=11 Tax=cellular organisms RepID=A9BN... 382 e-104 UniRef50_Q7UYW2 Arylsulfatase (A or B) n=2 Tax=Planctomycetaceae... 382 e-104 UniRef50_Q2GB51 Sulfatase n=6 Tax=Proteobacteria RepID=Q2GB51_NOVAD 382 e-104 UniRef50_A6KZI6 Sulfatase n=6 Tax=Bacteroides RepID=A6KZI6_BACV8 382 e-104 UniRef50_Q7UYA6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 381 e-104 UniRef50_UPI0001745D5D N-acetylgalactosamine 6-sulfate sulfatase... 381 e-104 UniRef50_Q7UER7 Sulfatase 1 n=8 Tax=Bacteria RepID=Q7UER7_RHOBA 381 e-104 UniRef50_A6DMV0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 381 e-104 UniRef50_Q9NJU8 Sulfatase 1 n=2 Tax=Coelomata RepID=Q9NJU8_HELPO 381 e-104 UniRef50_B5CWC8 Putative uncharacterized protein n=1 Tax=Bactero... 380 e-104 UniRef50_D0PR10 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 380 e-104 UniRef50_UPI0001745666 N-acetylgalactosamine 6-sulfate sulfatase... 379 e-103 UniRef50_B4D764 Steryl-sulfatase n=1 Tax=Chthoniobacter flavus E... 379 e-103 UniRef50_A6LIX6 N-acetylgalactosamine 6-sulfatase n=2 Tax=Bacter... 379 e-103 UniRef50_A3HZ22 Putative exported uslfatase n=1 Tax=Algoriphagus... 378 e-103 UniRef50_A6DFR6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Lentis... 378 e-103 UniRef50_B6RB10 Arylsulfatase n=7 Tax=Coelomata RepID=B6RB10_HALDI 378 e-103 UniRef50_A6DQ01 N-acetylgalactosamine-4-sulfatase n=2 Tax=Lentis... 378 e-103 UniRef50_A4XED5 Sulfatase n=1 Tax=Novosphingobium aromaticivoran... 378 e-103 UniRef50_Q15XP0 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 378 e-103 UniRef50_C5VKQ0 N-acetylgalactosamine-6-sulfatase n=3 Tax=Prevot... 377 e-103 UniRef50_C7M5R4 Sulfatase n=4 Tax=Bacteroidetes RepID=C7M5R4_CAPOD 377 e-103 UniRef50_A6DNJ0 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 377 e-103 UniRef50_A6CEL4 Arylsulfatase A n=4 Tax=Bacteria RepID=A6CEL4_9PLAN 377 e-103 UniRef50_Q7UH46 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 377 e-103 UniRef50_C6VSQ8 Sulfatase n=1 Tax=Dyadobacter fermentans DSM 180... 377 e-103 UniRef50_C5C581 Cerebroside-sulfatase n=1 Tax=Beutenbergia caver... 376 e-103 UniRef50_B7FQ28 Arylsulfatase n=1 Tax=Phaeodactylum tricornutum ... 376 e-102 UniRef50_A6CGJ8 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8... 375 e-102 UniRef50_D1QVA8 N-acetylgalactosamine-6-sulfatase n=1 Tax=Prevot... 375 e-102 UniRef50_A6DG78 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 375 e-102 UniRef50_A6DFS2 N-acetylgalactosamine-6-sulfatase n=1 Tax=Lentis... 375 e-102 UniRef50_UPI0000586CBD PREDICTED: similar to MGC86251 protein n=... 375 e-102 UniRef50_C1ZJ89 Arylsulfatase A family protein n=1 Tax=Planctomy... 375 e-102 UniRef50_D0PR02 N-acetylgalactosamine-4-sulfatase n=1 Tax=Flamme... 375 e-102 UniRef50_A6DG53 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 374 e-102 UniRef50_D2R207 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 374 e-102 Sequences not found previously or not previously below threshold: UniRef50_B9YAN4 Putative uncharacterized protein n=1 Tax=Holdema... 414 e-114 UniRef50_A6BYR0 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 398 e-109 UniRef50_A6DI94 Arylsulfatase A n=2 Tax=Bacteria RepID=A6DI94_9BACT 381 e-104 >UniRef50_P77318 Uncharacterized sulfatase ydeN n=81 Tax=Gammaproteobacteria RepID=YDEN_ECOLI Length = 560 Score = 785 bits (2028), Expect = 0.0, Method: Composition-based stats. Identities = 560/560 (100%), Positives = 560/560 (100%) Query: 1 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI 60 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI Sbjct: 1 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI 60 Query: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV Sbjct: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 Query: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAA 180 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAA Sbjct: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAA 180 Query: 181 VGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS 240 VGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS Sbjct: 181 VGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS 240 Query: 241 LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ 300 LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ Sbjct: 241 LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ 300 Query: 301 FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ 360 FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ Sbjct: 301 FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ 360 Query: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVS 420 KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVS Sbjct: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVS 420 Query: 421 LLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ 480 LLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ Sbjct: 421 LLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ 480 Query: 481 FSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 FSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP Sbjct: 481 FSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 Query: 541 LSEVNQEKFNNIKKALSEAK 560 LSEVNQEKFNNIKKALSEAK Sbjct: 541 LSEVNQEKFNNIKKALSEAK 560 >UniRef50_D2YC71 Sulfatase n=2 Tax=Vibrio mimicus RepID=D2YC71_VIBMI Length = 577 Score = 616 bits (1590), Expect = e-175, Method: Composition-based stats. Identities = 353/552 (63%), Positives = 442/552 (80%), Gaps = 5/552 (0%) Query: 4 ALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVL 63 K+++++TSISLIL S + A + LKATKTNVAFSD +EYSTKGKPNII+L Sbjct: 23 KFKRNLLTTSISLILVSHLLPSFASTQNSDNLKATKTNVAFSDIEISEYSTKGKPNIIIL 82 Query: 64 TMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFT 123 T+DD+GYGQ+ FD+ +F+ ++M++++VVDTYKI ID+AI AA+ STPT+ L+D GV+ Sbjct: 83 TVDDMGYGQMNFDQNTFNEESMKDQKVVDTYKIPIDEAINAAKNSTPTINKLIDTGVKIN 142 Query: 124 NGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGK 183 NGYVAHGVSGPSRAAI+TG+APA+FGVYSN DA+ GIP+ E FLPE+FQNHGYYTAAVGK Sbjct: 143 NGYVAHGVSGPSRAAIITGKAPAKFGVYSNIDAEQGIPVEEKFLPEIFQNHGYYTAAVGK 202 Query: 184 WHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFK 243 WHLSKISNV V E KQTRDYHDNF T+S E+WQPQNRGF+YFMGFH G AYYNSP+LF+ Sbjct: 203 WHLSKISNVAVDEAKQTRDYHDNFITYSGEQWQPQNRGFNYFMGFHTHGVAYYNSPALFR 262 Query: 244 NRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT 303 NRE +PAKGY+ DQ T+EAIGVV++AK+ D PF+LYLAYNAPHLPND PAP QYQ++F T Sbjct: 263 NRENIPAKGYVIDQFTNEAIGVVNKAKSNDAPFLLYLAYNAPHLPNDAPAPKQYQQRFKT 322 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGY 363 GSQTADN+YAS+Y+VDQGVKR+L QLK N QYDNT+I+FTSDNGAVIDGPLPLNG QKG+ Sbjct: 323 GSQTADNFYASIYAVDQGVKRLLAQLKANDQYDNTLIMFTSDNGAVIDGPLPLNGEQKGF 382 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPGN--YDKLISAMDFYPTALDAADISIPKDLKLDGVSL 421 KSQ GG HTPMF+WW G+ ++KL S+MDF+PTALDAA I IP+ LDGVSL Sbjct: 383 KSQVLSGGLHTPMFVWWNGRFHKTTKEFNKLTSSMDFFPTALDAAGIKIPEG--LDGVSL 440 Query: 422 LPWLQDKKQ-GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ 480 LP+L +K PHK+L WI Y+H FDE+NIPFW+NYHK+VR +SDDYP NP TE S Sbjct: 441 LPYLNGEKTNSSPHKSLVWIAPYAHHFDEKNIPFWNNYHKYVRSESDDYPINPYTEQFSD 500 Query: 481 FSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 FS+ VR + +SL+Y E+ ++GLYKL D++ ++ ++ P VV M+ + EF + S+ P Sbjct: 501 FSWAVRTDRFSLIYNPEDKKIGLYKLEDVRHENEISEQYPNVVSAMKNDLAEFANKSKMP 560 Query: 541 LSEVNQEKFNNI 552 +S+ N +KFN + Sbjct: 561 ISKDNYDKFNKV 572 >UniRef50_D1P6M6 Putative sulfatase YdeN n=2 Tax=Providencia RepID=D1P6M6_9ENTR Length = 549 Score = 535 bits (1378), Expect = e-150, Method: Composition-based stats. Identities = 208/528 (39%), Positives = 310/528 (58%), Gaps = 7/528 (1%) Query: 27 AHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTME 86 + T+ T KPN++++ MDDLG GQL F S D + Sbjct: 18 PSVKKSLLAGLIATSCLVPPIAANAGGTPEKPNVLLIVMDDLGTGQLDFVLDSLDVNELS 77 Query: 87 NREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPA 146 R Y I+K +EAA+ + P + + G++ TN +VAH V GPSRA I TGR+PA Sbjct: 78 KRPAPSRYDGDINKMVEAARIAMPNVSEMAAGGIKMTNAFVAHPVCGPSRAGIFTGRSPA 137 Query: 147 RFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVP-VPEDKQTRDYHD 205 FG YSN DA GIP LP LFQ GY TA++GKWH +K+ P + EDKQTRDYHD Sbjct: 138 SFGTYSNDDAMLGIPEDIKLLPALFQEDGYATASIGKWHNAKVIKKPKIAEDKQTRDYHD 197 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGV 265 N + + P RGFDY ++A+G A +NSP++++N E VPA GYI+ LTDE I Sbjct: 198 NMISTPEPGFAPHERGFDYAYSYYASGAALWNSPAIWRNGENVPAPGYITHLLTDETIKF 257 Query: 266 VDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRI 325 +D K D+PF + L+Y+ PH+P + +P +Y +FNTG+ AD Y+A++ + D+G+ +I Sbjct: 258 IDGHK--DKPFFINLSYSVPHIPLEEASPAKYMDKFNTGNVEADKYFAALNAADEGIGKI 315 Query: 326 LEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQ 385 + LK+NG+ +NT+I F SDNGAV + P+P+N KG+K Q + GG P +W G + Sbjct: 316 ITTLKENGELENTLIFFISDNGAVHESPMPMNAMDKGFKGQMFNGGVSVPFVAYWPGHIP 375 Query: 386 PG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYS 444 G D ++SA+D PTAL +A I+IP LK++G +++P LQ K Q PH+ L W + Sbjct: 376 AGKQSDAMVSAIDILPTALQSAGITIPDSLKVEGKNIMPLLQGKTQKSPHQYLYWTGPGT 435 Query: 445 HWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV-YTVENNQLGL 503 + EEN FW YH+++ +Q + P NPN E LS+ S+ VR+ +++L Y NQ L Sbjct: 436 KHYSEENQDFWHGYHEWITYQRKEAPKNPNLEKLSKGSWAVRDGEWALYFYDDGTNQPKL 495 Query: 504 YKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFN 550 + D + +LA+ P+ VK+++ +++ P+ Q+++ Sbjct: 496 FNDKQDPSESIDLASKYPEKVKQLKSAYYQWVKDQPKPV-VWGQDRYQ 542 >UniRef50_C5BEH4 Sulfatase, putative n=37 Tax=Gammaproteobacteria RepID=C5BEH4_EDWI9 Length = 539 Score = 534 bits (1376), Expect = e-150, Method: Composition-based stats. Identities = 206/507 (40%), Positives = 319/507 (62%), Gaps = 7/507 (1%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 T +PN++++ MDDLG GQL F + D K + R V + Y+ +DK I+AA+++ P + Sbjct: 34 TDSRPNVLLVIMDDLGTGQLDFALDALDTKALGKRPVAERYQGDLDKMIDAARRAMPNVA 93 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQN 173 L ++GV+ TN +VAH V GPSRA I TGR PA FG YSN DA G+PL T LP LFQ Sbjct: 94 QLANQGVKMTNAFVAHPVCGPSRAGIFTGRYPASFGTYSNDDAMLGVPLDITLLPALFQE 153 Query: 174 HGYYTAAVGKWHLSKIS-NVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 +GY TA +GKWH ++I V + QTRDYHDN + S + P++RGFDY ++A+G Sbjct: 154 NGYATANIGKWHNARIDKKNFVDKADQTRDYHDNMISVSEPGYGPESRGFDYSYSYYASG 213 Query: 233 TAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP 292 A +NSP++++N + V A GY++ LT+E + +D + +PF + LAY+ PH+P + Sbjct: 214 AALWNSPAIWQNGKNVAAPGYLTHNLTNETLKFLDDHQ--GKPFFISLAYSVPHIPLEQA 271 Query: 293 APDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG 352 +P +Y +F+TG+ AD Y+A+V + D+G+ +I+E+LK G+ DNT+I F SDNGAV + Sbjct: 272 SPARYMDKFHTGNAEADKYFAAVNAADEGIGQIIERLKALGELDNTLIFFISDNGAVHES 331 Query: 353 PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIP 411 P+PLNG +G+K Q + GG H P +W + G ++SA+D PTAL AA I+IP Sbjct: 332 PMPLNGMDRGFKGQMFNGGVHVPFVAYWPKHIPAGTQSNVMVSAIDILPTALKAAGITIP 391 Query: 412 KDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPH 471 +K+DG +LP L K Q PH+ L W + + EEN PFW +Y K++ +++ P Sbjct: 392 DAMKVDGRDILPQLSGKAQTSPHRYLFWAGPGAKHYSEENQPFWFDYWKWITYEAPMPPK 451 Query: 472 NPNTEDLSQFSYTVRNNDYSLVYTVEN-NQLGLYKLT-DLQQKDNLAAANPQVVKEMQGV 529 NPN E LS S+ VR+ +++L + + N++ L+ D + +LAA PQ V EM+ Sbjct: 452 NPNLEKLSPSSWAVRDGEWTLYFYDDGSNRVQLFNDRLDPAESQDLAAKYPQRVAEMKAA 511 Query: 530 VREFIDSSQPPLSEVNQEKFNNIKKAL 556 ++I + P++ Q++++ ++++ Sbjct: 512 YHDWIKTKPKPVA-WGQDRYHILEQSA 537 >UniRef50_B9XGT6 Sulfatase n=3 Tax=Bacteria RepID=B9XGT6_9BACT Length = 477 Score = 469 bits (1207), Expect = e-130, Method: Composition-based stats. Identities = 128/537 (23%), Positives = 211/537 (39%), Gaps = 111/537 (20%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 ++ + + + F + + KPNI+ + DDLGY + + Sbjct: 1 MRFLLSLLLMAVFCLSTKAA-NKPNIVFILADDLGYTDVACYGSKY-------------- 45 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY--- 151 TP + L +G++FT+G+ P+RA++M+G+ R GVY Sbjct: 46 ------------YETPNIDKLAKDGIKFTDGHTCGPNCQPTRASLMSGQYGPRTGVYTVG 93 Query: 152 ------------SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQ 199 + +PL + L + + GY T GKWHL + Sbjct: 94 SIDRFAWQTRSLHPVENVTKLPLDKITLAQSLKKAGYATGMFGKWHLGED---------- 143 Query: 200 TRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLT 259 +E P RGFD + + +P + P Y++D LT Sbjct: 144 -------------KEHHPAQRGFDEALVSMGVHFDFVTNP-----KVDYPKDEYLADFLT 185 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTADNYYASVYS 317 D+A+ + R K D+PF LYL + A H P ++ + Y A + S Sbjct: 186 DKALDFIKRHK--DEPFFLYLPHYAVHKPLQAKKELIQKFSAKQGVDGHHNPTYAAMIAS 243 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID---------GPLPLNGAQKGYKSQTY 368 VD+ V R++ L + DNT+++F+SDNG V G + N +G K Y Sbjct: 244 VDESVGRVVALLDELKLSDNTLVIFSSDNGGVGGYQREGIKKAGDVTDNNPLRGGKGMLY 303 Query: 369 PGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQ- 426 GG P W GK+ G D+ I ++D YPT L+ A P+ LDG S L L+ Sbjct: 304 EGGHRVPYIFRWPGKIPAGKVCDQPIISIDLYPTLLELAGAKAPEKYPLDGTSYLKVLKS 363 Query: 427 DKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVR 486 + + W ++ ++ +D + P VR Sbjct: 364 GGMKKLNRDAIYW-----------------HFPGYLGAGADTWRTLPVG--------VVR 398 Query: 487 NNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 D+ L+ E+++L LY L DL + +NLAA P+ +E++ + + Q P+ Sbjct: 399 CGDWKLMEFFEDHRLELYNLREDLGETNNLAAKMPEKAQELEKKLVAWQKEVQAPMP 455 >UniRef50_C1ZKY2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZKY2_PLALI Length = 483 Score = 468 bits (1204), Expect = e-130, Method: Composition-based stats. Identities = 141/510 (27%), Positives = 212/510 (41%), Gaps = 110/510 (21%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 T +PNI+++ DD+GY + F T Sbjct: 25 TPVIAADRPNILLIVGDDMGYADVGFHG--------------------------CKDIPT 58 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-YSNTDAQDGIPLTETFLP 168 P L +L GV+FT+GYV P+RA ++TGR RFG ++ + A G+PLTE + Sbjct: 59 PNLDALAKSGVQFTSGYVTGPYCSPTRAGLLTGRYQQRFGHEFNPSGANTGLPLTEVTIA 118 Query: 169 ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF 228 + + GY T VGKWHL S PQ RGF+ F+GF Sbjct: 119 DRLKQVGYTTGLVGKWHLG-----------------------SQPAMHPQERGFEEFIGF 155 Query: 229 HAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP 288 +++++ + + E V Y +D EA+ +++ + D+P+ LYL++NA H P Sbjct: 156 LGGAHSFFDAQGILRGHEPVKTIDYTTDLFGREAVSFIEKHR--DKPWFLYLSFNAVHTP 213 Query: 289 NDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA 348 D+ K + Q Y A + ++D+ + ++L QL+ GQ T+++F SDNG Sbjct: 214 MHA-TEDRMAKLASISDQERRTYAAMMLAMDEAIGKVLTQLETTGQKQKTLVMFISDNGG 272 Query: 349 VIDGPLPLNG----AQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALD 404 + +NG +G K T GG P + W GK+ P +D + +D TAL Sbjct: 273 PTMPGVTINGSINTPLRGSKRTTLEGGIRVPFVVSWPGKIAPAVFDSPVIQLDLTATALA 332 Query: 405 AADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRH 464 A + KD+K DGV+LLP+LQ K+ PH L W Sbjct: 333 VAGVE--KDVKSDGVNLLPYLQGKQSEVPHAALFWRFGE--------------------- 369 Query: 465 QSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVEN------------NQLGLYKLT-DLQQ 511 VR DY LV N LY L DL + Sbjct: 370 -----------------QMAVRAGDYKLVRYDSNADTLTGKGKQPVTAARLYDLKEDLGE 412 Query: 512 KDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 +LAA+ P+ V E+Q + + PPL Sbjct: 413 TRDLAASMPEKVAELQAQWDRWNQQNMPPL 442 >UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UGD7_RHOBA Length = 543 Score = 465 bits (1198), Expect = e-129, Method: Composition-based stats. Identities = 150/532 (28%), Positives = 235/532 (44%), Gaps = 110/532 (20%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 + K +PNI+++ DDLGY + F+ + T Sbjct: 37 SVVGAKDRPNIVLIVADDLGYSDVGFNG--------------------------CKEIPT 70 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD--------GIP 161 P L L GV FTNGY +H PSRA ++TGR RFG SN + G+P Sbjct: 71 PHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQRFGHGSNPEPDTQWHGEDTPGMP 130 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 L+ET L + + GY T A+GKWHL A+ + P RG Sbjct: 131 LSETTLADALKEAGYVTGAIGKWHLGD-----------------------AKPFWPNRRG 167 Query: 222 FDYFMGFHAAGTAYYNSPSL-------FKNRERVPAK--GYISDQLTDEAIGVVDRAKTL 272 FD + GF G +Y+ + + E V K +++D + EA+ + R +T Sbjct: 168 FDEWFGFSGGGFSYWGDLGMKDPLLGVHRGDEPVDPKTLTHLTDDFSTEAVKFIQRHET- 226 Query: 273 DQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKN 332 +PF LYLAYNAPH P D+ QK + Y A V +D+G+ R+++Q++++ Sbjct: 227 -EPFFLYLAYNAPHAP-DHATRAHLQKTAHIEYGGRAVYGAMVAGMDEGIGRVVDQIRES 284 Query: 333 GQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDK 391 G +NT+I+F SDNG + +N +G+K + GG P + W G ++ G + Sbjct: 285 GLGENTMIIFYSDNGGRRE--HAVNFPYRGHKGMLFEGGIRVPFLVSWPGTVRSGMKEES 342 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 I+A+D +PTAL AA + ++ KLDG +LLP L D KQ P + L W S Sbjct: 343 PITALDLFPTALAAAGMDPSQNDKLDGQNLLPVLTDDKQRLPERPLFWRYS--------- 393 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQ 510 + Y VR+ ++ L+ + ++ L+ L D Sbjct: 394 ------------------------MGDDSYGYAVRDGNWKLIDSRYKDRKLLFDLANDPW 429 Query: 511 QKDNLAAANPQVVKEMQGVVREFIDSSQPP----LSEVNQEKFNNIKKALSE 558 ++++LAA +P+ V + ++ + + PP VN K N + E Sbjct: 430 EREDLAAQHPEQVARLSRMMEAWDARNVPPKWSDAHGVNVRKEENTRNEAVE 481 >UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT Length = 500 Score = 463 bits (1193), Expect = e-129, Method: Composition-based stats. Identities = 146/586 (24%), Positives = 240/586 (40%), Gaps = 149/586 (25%) Query: 1 MKSALKKSVVSTSISL-ILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPN 59 MK+A+++ V ++ +L + + A HAAD +PN Sbjct: 6 MKTAVERIVFGGNLVWALLLTSLCATRVHAAD-------------------------RPN 40 Query: 60 IIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEG 119 + + DDLG+ + F+ +F TP L L EG Sbjct: 41 FVFILADDLGWKDVGFNGSTF--------------------------YETPNLDRLAREG 74 Query: 120 VRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ--------------DGIPLTET 165 +RFT+ Y A V P+RA+IMTG+ PAR + + +P E Sbjct: 75 MRFTDAYAACSVCSPTRASIMTGKYPARLHLTDWLPGRPDKPDQILKHPKIITELPAAEI 134 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 L + Q GY TA +GKWHL + + P+ GFD Sbjct: 135 TLAKALQEGGYKTAFIGKWHLGGLGH------------------------WPEQAGFDIN 170 Query: 226 MGFHAAGT-AYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNA 284 +G G + Y SP + P Y++D+LTDEA+ ++ K PF+LYL++ + Sbjct: 171 IGGCGMGHPSSYFSPYKNPTLKDGPVGEYLADRLTDEAVKFIENTK--GTPFLLYLSHYS 228 Query: 285 PHLPNDNPA--PDQYQKQFNTGSQTAD------------------NYYASVYSVDQGVKR 324 H P ++YQK+ T Y A + S+D+ V R Sbjct: 229 VHTPLQAKKGLIEKYQKKVMQLPPTKGPEFVTEGNTNARQVQNQPIYAAMMQSLDESVGR 288 Query: 325 ILEQLKKNGQYDNTIILFTSDNGA--VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 +L++LK+ G NT+I+FTSDNG +G N + K Y GG P+ + W G Sbjct: 289 VLDKLKELGLDKNTVIIFTSDNGGLSTAEGAPTSNMPLRAGKGWPYEGGVREPLVVKWPG 348 Query: 383 KLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWIT 441 + + D + + D+YPT L+ A + + LDG+S P L+ K+ GE + L W Sbjct: 349 VTKAASVSDHQVMSTDYYPTLLEIAGLPARPEQHLDGISFTPALRGKEMGE--RPLFW-- 404 Query: 442 SYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQL 501 Y H+ ++ P S ++R D+ L+ E N++ Sbjct: 405 HYPHYSNQGGAP----------------------------SSSIRKGDWKLIEWYEENRI 436 Query: 502 GLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 L+ L D+ +K++LA+ + +E++ ++ + S + + N Sbjct: 437 ELFNLRLDVGEKNDLASTSALKREELKSELQAWRASVKADMPLPNP 482 >UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZAC9_PLALI Length = 479 Score = 460 bits (1183), Expect = e-128, Method: Composition-based stats. Identities = 144/554 (25%), Positives = 226/554 (40%), Gaps = 124/554 (22%) Query: 23 AAFAAHAADDVKLK--ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSF 80 + +H A + L A + + + S G+PNI+V+ DDLGY L G Sbjct: 1 MSLGSHPAIALWLALVAFCSQALLAAEDVNQTSKSGRPNILVIMADDLGYADLGVQGGC- 59 Query: 81 DPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIM 140 + TP L L G+R TN YV+ PSRA + Sbjct: 60 -------------------------EIPTPHLDQLAASGIRCTNAYVSAPYCSPSRAGFL 94 Query: 141 TGRAPARFGVYSNT----DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPE 196 TG+ RFG N +A+ G+PL E + L Q GY TA +GKWH Sbjct: 95 TGKYQTRFGHEFNPHVGEEAKLGLPLEEVTIANLLQTEGYRTALIGKWHQG--------- 145 Query: 197 DKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY-------------YNSPSLFK 243 +++ PQ+RGFD F GF G Y ++ +++ Sbjct: 146 --------------FSKDHHPQSRGFDEFFGFLVGGHNYLLHKEVKARFGTAHSHDMIYR 191 Query: 244 NRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT 303 RE P +GY +D T+EA+ + + ++P+ LYL+YNA H P + Q + + Sbjct: 192 GREVEPQEGYATDLFTNEALRWM--SGPPNKPWFLYLSYNAVHTPLEIAPHLQKRIPESV 249 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL-----PLNG 358 Y + + +D + RI + L ++G + T+I+F SDNG P+ LN Sbjct: 250 KLPARRGYLSLLAGLDDSIGRITQHLSQHGLREKTLIIFLSDNGGSGRAPILAYNSGLNH 309 Query: 359 AQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAA----DISIPKD 413 +G K QT GG P F+ W G+L ++ I ++D PT A P Sbjct: 310 PLRGDKGQTLEGGIRVPFFVSWPGQLPARTIYEQPIISLDLLPTVCQLAANNPAKPQPLP 369 Query: 414 LKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNP 473 +DGV+L+P+ ++ G PH++L W Sbjct: 370 QGIDGVNLMPYWLGQRSGAPHESLFWRFG------------------------------- 398 Query: 474 NTEDLSQFSYTVRNNDYSLVYTVE-----NNQLGLYKL-TDLQQKDNLAAANPQVVKEMQ 527 VR ++ LV + N+ LY L TD+ +K+NLA +P++V ++ Sbjct: 399 -------PQKAVRAGNWKLVDWRDFPASKNSGWELYDLSTDISEKNNLAETHPEIVARLK 451 Query: 528 GVVREFIDSSQPPL 541 ++ S+ PL Sbjct: 452 TSWEKWNQSNIEPL 465 >UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CBI6_9PLAN Length = 599 Score = 458 bits (1179), Expect = e-127, Method: Composition-based stats. Identities = 131/491 (26%), Positives = 205/491 (41%), Gaps = 92/491 (18%) Query: 52 YSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPT 111 +PN++++ DD G+G + + TP Sbjct: 25 LQAAERPNVLLIMTDDQGWGDVRSH--------------------------DNPLIETPQ 58 Query: 112 LLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELF 171 L +G RF YV V P+R++++TGR R GV+ T + + ET + E+F Sbjct: 59 QDLLASQGARFERFYV-SPVCAPTRSSLLTGRYSLRTGVHGVTRGFENMRAEETTIAEMF 117 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA 231 + GY T A GKWH + P +GFD F GF Sbjct: 118 KAAGYKTGAFGKWHNGRHY-----------------------PMHPNGQGFDEFFGFCGG 154 Query: 232 GTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDN 291 Y +L N++ V +GYI+D LTD AI + + K DQPF Y+ YNAPH P Sbjct: 155 HWNRYFDTNLEHNKQPVKTEGYITDVLTDRAIDFIKQNK--DQPFFCYVPYNAPHSPWI- 211 Query: 292 PAPDQYQKQF--NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV 349 P++Y ++ A YA V VD + R+++ L DNTI+LF +DNG Sbjct: 212 -VPEKYWDKYANKGLDDKARCAYAMVECVDDNLGRLMQTLDDLKLSDNTIVLFLTDNGPN 270 Query: 350 IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLI-SAMDFYPTALDAADI 408 + NG +G K + GG P+F+ + GK++ G K I + +D PT L+ + Sbjct: 271 SN---RYNGNMRGRKGSIHEGGIRVPLFVRYPGKIKAGTVVKPIAAHIDILPTLLELCSV 327 Query: 409 SIPKDLKLDGVSLLPWLQDKK-QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 D LDG SL+P L +K + P + L + + ++ +P Sbjct: 328 ENTADQPLDGKSLVPLLTNKSNKDWPQRMLFSDRLFRNSIPDDELP-------------- 373 Query: 468 DYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEM 526 + +VR + + Y E + LY + D QK N+ A+P V+K++ Sbjct: 374 --------------NGSVRTDRWRAAY--ERGKWSLYDMQADPSQKQNVIEAHPAVIKDL 417 Query: 527 QGVVREFIDSS 537 R++ Sbjct: 418 SAAYRDWFKDV 428 >UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZF72_PLALI Length = 470 Score = 457 bits (1175), Expect = e-127, Method: Composition-based stats. Identities = 145/528 (27%), Positives = 213/528 (40%), Gaps = 106/528 (20%) Query: 25 FAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKT 84 F++ + L + + T KPN+I+ DDLG+G+ Sbjct: 9 FSSICLVGICLAGISSICDLAQGAEPT-QTSRKPNVIIFYADDLGWGETGIQG------- 60 Query: 85 MENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRA 144 Q TP + S+ GVR T G+VA PSRA ++TGR Sbjct: 61 -------------------NPQIPTPHIDSIAKNGVRCTQGFVAATYCSPSRAGLLTGRY 101 Query: 145 PARFGVYSNTDAQ-DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY 203 P RFG N A G+ L ET L + GY TA VGKWHL Sbjct: 102 PTRFGHEFNRIANVSGLDLQETTLADRLHGLGYKTACVGKWHLGD--------------- 146 Query: 204 HDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNR-------ERVPAKGYISD 256 E++P RGFD F G A + P+ F + E Y +D Sbjct: 147 --------GPEYRPTKRGFDEFFGTLA--NTPFFHPTKFVDSRVSNDVAEVSDENFYTTD 196 Query: 257 QLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADN-YYASV 315 + ++ + + + P+ LYL +NA H P AP +Y +F + + + A + Sbjct: 197 EYAKRSVEWIGQQQQS--PWFLYLPFNAQHAPLQ--APQKYLDRFESIADPKRKLFAAMM 252 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTP 375 ++D + ++L ++++ GQ +NT++ F SDNG G NG +G+K T+ GGT P Sbjct: 253 SAMDDAIGQVLGKVRELGQEENTLVFFISDNGGPTQGTTSQNGPLRGFKMTTFEGGTRVP 312 Query: 376 MFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPH 434 + WKGKL G YD + +D PT L AA I KLDGV L+P+ +PH Sbjct: 313 FLVQWKGKLPAGKTYDNPVINLDVLPTVLTAAGSKIDPAWKLDGVDLVPYFTSSIANKPH 372 Query: 435 KNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY 494 + L W + VR D+ LV Sbjct: 373 ETLYWRFGE--------------------------------------QWAVRQGDWKLVV 394 Query: 495 TVEN-NQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 Q LY L +D+ + NLA+ NP VKE+Q + ++ P Sbjct: 395 ARGGSGQPELYDLASDIAESKNLASENPAKVKELQALWDQWSHEQAAP 442 >UniRef50_D2QWC8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QWC8_9PLAN Length = 468 Score = 456 bits (1174), Expect = e-126, Method: Composition-based stats. Identities = 143/519 (27%), Positives = 216/519 (41%), Gaps = 107/519 (20%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 L+A + T + + +PNI+V+ DD+GY L Sbjct: 7 LRALVALGLLTAATTSMAADASRPNIVVIVGDDMGYHDLGVHG----------------- 49 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 TP L +L GVR T+GYV+ P+RA ++TGR RFG N Sbjct: 50 ---------CKDIPTPHLDALATSGVRCTSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP 100 Query: 155 D----AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 + G+PL+ET L + + GY T VGKWHL Sbjct: 101 GPTPTGEIGLPLSETTLADRLKKVGYKTGMVGKWHLGNDEKR------------------ 142 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNSP-------SLFKNRERVPAKGYISDQLTDEAI 263 P +RGFD F GF Y+ +P L + RE V K Y++D EA+ Sbjct: 143 -----HPLSRGFDEFFGFLGGARTYFATPGNASAGTKLLRGREVVDEKEYLTDAFAREAV 197 Query: 264 GVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTAD-NYYASVYSVDQGV 322 +DR+K PF LYL +NA H P + A +Y +F S Y A + ++D V Sbjct: 198 AYIDRSKAS--PFFLYLTFNAVHTPME--ASQKYLDRFTAVSDPKRQKYCAMMSAMDDAV 253 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 +++ +L++ +NT+I F SDNG N +G+K+ T+ GG P F+ WKG Sbjct: 254 GQVVAKLEREKLLENTLIFFVSDNGGPTAANTGDNTPLRGFKATTWEGGIRVPYFVSWKG 313 Query: 383 KLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWIT 441 K+ G YD+ + +DF PT A P K DGV+LLP+L + + PH +L W Sbjct: 314 KIPAGKTYDQPVIQIDFVPT--ALAAAGAPAAEKTDGVNLLPYLTFENKEAPHASLFWRF 371 Query: 442 SYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQL 501 +R+ +Y LV T + ++ Sbjct: 372 G--------------------------------------PQTAIRHGNYKLVMTRDLDKP 393 Query: 502 GLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 LY L D+ + +L+A P++V ++ + + P Sbjct: 394 ALYDLAADISETKDLSADKPEIVAQLTAAYDAWNQENIP 432 >UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4991 Length = 596 Score = 454 bits (1168), Expect = e-126, Method: Composition-based stats. Identities = 144/541 (26%), Positives = 220/541 (40%), Gaps = 116/541 (21%) Query: 41 NVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDK 100 V F S GKPN++++ +DDLG L +F Sbjct: 6 AVLALGFFALPASAAGKPNVVLIVIDDLGQRDLGCYGSTF-------------------- 45 Query: 101 AIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGI 160 TP + + +GVRFT+ Y A V P+RA+IMTG+ P R G+ + + Sbjct: 46 ------YKTPNIDRMAKDGVRFTDFYAACPVCSPTRASIMTGKYPQRVGITDWLPGRKDL 99 Query: 161 P--------------LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 P L E + E + HGY TA +GKWHL Sbjct: 100 PGQRLKRPELKNELALEEVTVAETLKGHGYVTAHIGKWHLG------------------- 140 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHAAGTA-YYNSP------SLFKNRERVPAKGYISDQLT 259 + ++P+ +GFD + GT Y +P + E+ Y++D+L Sbjct: 141 -----GKGFEPEKQGFDVNVAGDHTGTPLSYFAPFANKAGATMPGLEKAAPDEYLTDRLA 195 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTADNYYASVYS 317 EA + K D+PF LYL + H P P P D+Y+ Q G Q+ Y A V S Sbjct: 196 AEAETFITANK--DKPFFLYLPHYGVHTPLRAPQPLVDKYKTQAVHGRQSNPVYAAMVES 253 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID-----GPLPLNGAQKGYKSQTYPGGT 372 +D V R+L++L DNT++LFTSDNG + +N + K Y GG Sbjct: 254 MDAAVGRVLKRLDDLKLSDNTLVLFTSDNGGLATLEGMPFAPTINAPLREGKGYLYEGGV 313 Query: 373 HTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 P+ W GK++PG D++ ++DF+ T L+A + + DGVSL+P +K Sbjct: 314 RVPLIAKWPGKVKPGTVMDQVACSIDFFDTILEATGAT--SAARRDGVSLVPAFGGEKLK 371 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 + L W Y H+ ++ S+ VR +Y Sbjct: 372 P--RALYW--HYPHYANQG----------------------------SRPGGAVRAGNYK 399 Query: 492 LVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFN 550 LV E+ + L+ + DL + NLAA P VVK++ + + + N + Sbjct: 400 LVEYYEDGRRELFDVAKDLSESRNLAADKPDVVKDLAAKLDAWRTDVGAKMPTPNPDYRP 459 Query: 551 N 551 N Sbjct: 460 N 460 >UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R322_9PLAN Length = 513 Score = 449 bits (1156), Expect = e-124, Method: Composition-based stats. Identities = 138/571 (24%), Positives = 214/571 (37%), Gaps = 130/571 (22%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFD 81 M A + A + P+ + + +PNI+ +DDLG L +F Sbjct: 1 MKPSHLSAIRLSLIYAVVSTFLCCATLPSTIAAEQQPNIVFFLVDDLGQRDLGCYGSTF- 59 Query: 82 PKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMT 141 TP + L +G RFT Y A V P+RA+I+T Sbjct: 60 -------------------------YETPNIDKLAADGARFTQAYAACPVCSPTRASILT 94 Query: 142 GRAPARFGV-----YSNTDA---------------QDGIPLTETFLPELFQNHGYYTAAV 181 G P R G+ N++ +D + L L + ++ GY T Sbjct: 95 GLWPQRTGITDYIATDNSNGPAKWNRNTMTLPAAYRDRLALDSPTLAKSLKSAGYATFFA 154 Query: 182 GKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY----YN 237 GKWHL E + P+N+GFD G G Y Y Sbjct: 155 GKWHLG------------------------PEGFYPENQGFDINRGGIERGGPYGGKQYF 190 Query: 238 SPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--D 295 SP PA ++ D+L E ++ + QPF Y ++ + H P Sbjct: 191 SPYGNPRLTDGPAGEHLPDRLATETCQFIEAHQ--KQPFFAYFSFYSVHTPLQAREDLRQ 248 Query: 296 QYQKQFNTGS----------------QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 +Y + Q Y A V ++DQ V ++L +L + G +NT+ Sbjct: 249 KYVAKREKLGLKPTWGREHMRDVRQVQEHAVYAAMVDAMDQAVGKVLAKLDELGLRENTL 308 Query: 340 ILFTSDNGA--VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAM 396 ++FTSDNG +G N +G K Y GG P+ M W K++ G D +S+ Sbjct: 309 VIFTSDNGGLSTSEGWPTSNLPLRGGKGWMYEGGIREPLVMRWPAKVKAGSTIDTPVSSP 368 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWD 456 DF T L A + ++DGVSLLP L +K E ++L W Y H+ ++ P Sbjct: 369 DFMATLLAATATKPAEQQQIDGVSLLPLLAGEKLKE--RSLFW--HYPHYGNQGGAP--- 421 Query: 457 NYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNL 515 + +R + L+ +E+ Q+ L+ L TD + NL Sbjct: 422 -------------------------AAAIRRGSWKLIEWLEDGQVELFNLATDESETTNL 456 Query: 516 AAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 A+ P +V+EM + + L E N Sbjct: 457 ASKEPALVREMLAELHAWQKEVGAILPEKNP 487 >UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD Length = 452 Score = 449 bits (1155), Expect = e-124, Method: Composition-based stats. Identities = 122/535 (22%), Positives = 211/535 (39%), Gaps = 114/535 (21%) Query: 33 VKLKATKTNVAFSDF--TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 ++++ VA S F P + +PN++++ DD G + Sbjct: 1 MRIRRLSAMVALSCFMAAPLFAQQQKRPNVLIIYTDDQGTLDVNCYG------------- 47 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 A TP + L EGV F+ Y A V PSRA+++TGR P R + Sbjct: 48 -------------AKDLHTPNIDRLAKEGVLFSQFYAAAPVCSPSRASLLTGRYPQRAQL 94 Query: 151 YSNT---DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 +N + G+P ++ + E+F++ GY TA +GKWH+ Sbjct: 95 DNNAPSEEGHAGMPGSQYTMAEMFKDGGYTTAHIGKWHIG-------------------- 134 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS---------PSLFKNRERVPAKG-YISDQ 257 + E P +GFDY GF Y+ L++N + + G + +D Sbjct: 135 ---YSPETMPNQQGFDYSFGFMGGCIDNYSHYFYWAGPNRHDLWRNGQEIWEDGKFFADL 191 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYS 317 E G +++ K D+PF LY A N PH P +++++ + Y A+V + Sbjct: 192 TVQEVNGFLEKNKRADKPFFLYWAINMPHYPLQGQ--EKWRQYYKDLPAPRRMYAAAVST 249 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID----GPLPLNGAQKGYKSQTYPGGTH 373 +D+ + ++L+QL + G +NTI++F SD G + G G +G K + GG Sbjct: 250 MDEKIGQVLQQLDRLGLAENTIVVFQSDQGHSTEDRSFGGGGFTGPYRGAKFSLFEGGIR 309 Query: 374 TPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 P + W G L D+L +D+YPT +++P+ K+DG + + K Sbjct: 310 VPAIIRWTGHLPKNEVRDQLCVNIDWYPTLAGLCKVALPQ-RKIDGKDIQQVITSSKTSS 368 Query: 433 PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL 492 PH W + + VR ++ L Sbjct: 369 PHDIFFWQSQ---------------------------------GTKENPQWAVRQGNWKL 395 Query: 493 VYTV------ENNQLGLY--KL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 ++ E L+ L D + NLAA +P++V ++ ++I+ Sbjct: 396 LHNPSSAKKAETGPDDLFLVNLQQDTSEAKNLAAQHPEIVSSLKEQYLKWINEVV 450 >UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKB8_9BACT Length = 465 Score = 447 bits (1150), Expect = e-124, Method: Composition-based stats. Identities = 139/530 (26%), Positives = 215/530 (40%), Gaps = 116/530 (21%) Query: 39 KTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGI 98 T + F + +PN+IV+ DDLGY + F+ + Sbjct: 3 ATYIIFILISLNAICAS-RPNLIVIMADDLGYNDVGFNGCT------------------- 42 Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN----- 153 + TP + S+ GV+FTNGY ++ V GPSRA +TGR RFG N Sbjct: 43 -------EIPTPGIDSIAQNGVKFTNGYTSYSVCGPSRAGFITGRYQQRFGFERNPQWNL 95 Query: 154 TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 TD +P +E + E GY+ +GKWHL ++ Sbjct: 96 TDPNSALPKSEMTIAESLTQVGYHCGIIGKWHLGAEPSL--------------------- 134 Query: 214 EWQPQNRGFDYFMGFHAAGTAYYNSPSLF------------------KNRERVPAKGYIS 255 +P RGFD F G G + + +N V Y++ Sbjct: 135 --RPNKRGFDEFFGHLGGGHRFMPEDLVIQHTEEVKNELDSYRSWITRNDTPVKTTKYLT 192 Query: 256 DQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT-ADNYYAS 314 ++ +DEA+ + R +PF L+L+YNAPHLP A ++Y +F Y A Sbjct: 193 EEFSDEAVSFIKRNHQ--KPFFLFLSYNAPHLPLQ--ATEKYLARFPHIKDPKRKTYAAM 248 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHT 374 V +VD GV ++++ LK+ DNTI+ F SDNG N KG KS + GG Sbjct: 249 VSAVDDGVSQVMQSLKETNIADNTIVFFLSDNGGPSHKNKSDNFPLKGQKSDVWEGGFRV 308 Query: 375 PMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 P M + +Q D +S++D + T A D LDGV+L+P++ +K P Sbjct: 309 PFAMQYPAAIQAKQVYDHPVSSLDIFATIASLAQSPTHADKPLDGVNLIPFITGEKTQAP 368 Query: 434 HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV 493 H + Q Y VR D+ LV Sbjct: 369 HAQIF------------------------------------IRKFDQSRYVVRQGDFKLV 392 Query: 494 YTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 ++ LY L+ D+ +++N+AA +P+ VKE++ V +++ P+ Sbjct: 393 IPYKDAPPQLYNLSKDIGEENNIAAVHPERVKELEKVRKQWDSELMDPIF 442 >UniRef50_B8HPF9 Sulfatase n=2 Tax=Bacteria RepID=B8HPF9_CYAP4 Length = 495 Score = 446 bits (1147), Expect = e-123, Method: Composition-based stats. Identities = 127/519 (24%), Positives = 206/519 (39%), Gaps = 116/519 (22%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 + P+I+ + DD G+ + F + Sbjct: 39 AVAQQSSQPPHILFIMSDDQGWKDVGFH---------------------------GSDIR 71 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTET 165 TP L L G R Y + PSRAA++TGR P R+G+ + + + G+P E Sbjct: 72 TPNLDQLAKTGARLEQYYS-QPMCTPSRAALLTGRYPHRYGLQTLVIPSAGKYGLPTDEY 130 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 LP+ + GY TA VGKWHL ++ P+ RGFDY Sbjct: 131 LLPQALKEAGYETAIVGKWHLGHAD----------------------PKYWPRQRGFDYQ 168 Query: 226 MGFHAAGTAYYNSP-----SLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 G Y+ ++N + + +GY++ L +A+ ++++ P LYL Sbjct: 169 YGPLLGEIDYFTHSAHGKVDWYRNNQLIKEEGYVTTLLGQDAVKLIEKH-NPKTPLFLYL 227 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTG-SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 A+ APH P AP +Y Q+ T Y A + ++D + +++ L+K G +NT+ Sbjct: 228 AFTAPHAPYQ--APQKYLDQYKTIADPNRRAYAAMITAMDDQIGQVVAALEKRGMRNNTL 285 Query: 340 ILFTSDNGAVIDGPLP------------LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 I+F SDNG NG + K+ Y GGT W GK+QPG Sbjct: 286 IVFQSDNGGPRSAQFTGEVDTSGGTIPADNGPYRDGKASLYEGGTRVVALANWPGKIQPG 345 Query: 388 NY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHW 446 + I +D YPT A +S+ K+ LDG+++ P L + K Sbjct: 346 TVVNHPIHIVDMYPTLTGLASVSVGKNKPLDGLNIWPALSEAKPS--------------- 390 Query: 447 FDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE-NNQLGLYK 505 P + D+ F + D+ LV+ ++L L+ Sbjct: 391 -----------------------PRSQVVYDIEPFRAALSQEDWKLVWKATLPSRLELFN 427 Query: 506 LT-DLQQKDNLAAANPQVVKEMQGVVREF-IDSSQPPLS 542 L+ D+ ++ NLA NP++V ++ + D+ PPL Sbjct: 428 LSQDVSEQTNLAEQNPEIVSRLKQQIEVLSRDAVLPPLF 466 >UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=4 Tax=Bacteria RepID=A6C861_9PLAN Length = 498 Score = 445 bits (1144), Expect = e-123, Method: Composition-based stats. Identities = 142/562 (25%), Positives = 216/562 (38%), Gaps = 133/562 (23%) Query: 37 ATKTNVAFSDFTPTEYSTKGKP------NIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 +AFS S KP N + + +DDLGY + + Sbjct: 8 TLMLMLAFSVLADRSLSAAEKPKQNKPLNFVFILVDDLGYMDVGCNN------------- 54 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF-- 148 TP + L G+RFTNGY A+ V P+R +IMTG+ P R Sbjct: 55 ------------PQTFYETPHINQLAKTGMRFTNGYAANPVCSPTRYSIMTGKYPTRVDA 102 Query: 149 ---------GVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQ 199 G + D +PL+ET + E + HGY T GKWHL Sbjct: 103 TNFFSGKRAGKFLPAPLNDKMPLSETTIAEALKEHGYSTFFAGKWHLGPT---------- 152 Query: 200 TRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY----YNSPSLFKNRERVPAKGYIS 255 +E+ P+ +GFD G G Y Y SP ++ Sbjct: 153 -------------QEFWPEKQGFDINRGGWHRGGPYGGGKYFSPYGNPRLTDGLKGEHLP 199 Query: 256 DQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGS-------- 305 D+L E +D + D+PF YLA+ + H P P P +Y+++ Sbjct: 200 DRLASETAQFIDAHR--DEPFFAYLAFYSVHTPLMGPGPLVTKYKEKAKRLGLTGKEEFA 257 Query: 306 -----------------QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA 348 Q Y A V S+D+ V ++L+QL+++G +NT+++ T+DNG Sbjct: 258 DEEQVFPVDEKRRVRILQNHAVYAAMVESMDKAVGKVLQQLEESGVAENTVVMLTADNGG 317 Query: 349 --VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDA 405 +G N +G K Y GG + W G +PG+ D+ + DFYPT LD Sbjct: 318 LSTSEGSPTSNLPLRGGKGWLYEGGIREVFLIRWPGGTEPGSVCDEPVITTDFYPTILDL 377 Query: 406 ADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQ 465 A + + LDGVSL P+LQ + L W Y H+ ++ IP Sbjct: 378 AGLPLKPQQHLDGVSLKPFLQGEA-PFKRDALYW--HYPHYSNQGGIP------------ 422 Query: 466 SDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVK 524 +R D+ L+ E+ Q+ LY L DL +K +LA P+ V Sbjct: 423 ----------------GGAIRVGDWKLIERFEDGQVHLYHLKEDLGEKQDLAEKYPERVA 466 Query: 525 EMQGVVREFIDSSQPPLSEVNQ 546 M+ + ++ + + Sbjct: 467 AMRKQLHKWYQETDAKFLQAKP 488 >UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYA9_9BACT Length = 490 Score = 445 bits (1144), Expect = e-123, Method: Composition-based stats. Identities = 146/550 (26%), Positives = 213/550 (38%), Gaps = 116/550 (21%) Query: 15 SLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLP 74 ++ + + A V + ++ +D TPT +PNIIV+ DD GY Sbjct: 1 MTLIDPFLMSLLRKAFTSVAALSLASSSVRADDTPT-----KRPNIIVIVSDDQGYADAS 55 Query: 75 FDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGP 134 F TP L +L GVR T GYV V P Sbjct: 56 FQGS--------------------------KDILTPNLDALAKSGVRCTRGYVTAPVCSP 89 Query: 135 SRAAIMTGRAPARFGVYSNTDAQD-----GIPLTETFLPELFQNHGYYTAAVGKWHLSKI 189 SRA +MTGR RFG ++N A+ +P ET LP++ GYYTA VGKWHL Sbjct: 90 SRAGLMTGRYQERFGHHNNIVAEAALPIAHLPSNETLLPQVLAKAGYYTAMVGKWHLGLQ 149 Query: 190 SNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY-NSPS-------- 240 + +P RGFD F G G Y+ N P Sbjct: 150 -----------------------DGCRPYERGFDEFFGIITGGHDYFVNHPEERAVGDQS 186 Query: 241 ----LFKNRERVPA-KGYISDQLTDEAIGVVDRAKTL--DQPFMLYLAYNAPHLPNDNPA 293 + +N A GY++D +A+ ++ + T DQP LYLA+NAPH P P Sbjct: 187 YKARIERNGPVGEAVPGYLTDAFGADAVRIIRESHTKRPDQPLFLYLAFNAPHTPTQAPK 246 Query: 294 PDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP 353 S+ Y A + S+D V ++ LK+NG +T I+F SDNG + P Sbjct: 247 DLVDTMPATLESKDRRTYAAQITSMDASVGKVRAALKENGMEKDTFIVFFSDNGGA-NHP 305 Query: 354 LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPK 412 N + +K Y GG P F + G + G+ + ++++D + TA A Sbjct: 306 YYDNTPLRDHKGSLYEGGIRVPFFAVYPGHIPAGSVCELPVTSLDVFATACALAGTKPET 365 Query: 413 DLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHN 472 LD V +LP L+ + H L W Sbjct: 366 SHPLDSVDMLPVLEGNARQPTHATLFW--------------------------------- 392 Query: 473 PNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVR 531 + F V + D LV + L+ L D+ +K +LAA NP+ V + ++ Sbjct: 393 ----EFPGFGAAVADRDLKLVVP-KKGSPQLFDLAVDIGEKSDLAAQNPEKVARLSTLLS 447 Query: 532 EFIDSSQPPL 541 E+ + PL Sbjct: 448 EWHAQNARPL 457 >UniRef50_A6C4Q6 Arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4Q6_9PLAN Length = 574 Score = 444 bits (1142), Expect = e-123, Method: Composition-based stats. Identities = 124/515 (24%), Positives = 200/515 (38%), Gaps = 81/515 (15%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 V++S + +PN+IV+ DD GYG + F Sbjct: 11 WFAGFLLLVSYSFGCEGTLCAESRPNVIVILTDDQGYGDVGFRG---------------- 54 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 + +TP L + ++ + T Y V P+RA+++TGR R GV Sbjct: 55 ----------NLKINTPHLDRMAEKSIELTRFYC-SPVCAPTRASLLTGRNYYRTGVIHT 103 Query: 154 TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 + + E + EL Q GY T GKWHL Sbjct: 104 SRGGAKMQGEEVTVAELLQQAGYQTGIFGKWHLGDNY----------------------- 140 Query: 214 EWQPQNRGFDYFMGFHAAGTAY-------YNSPSLFKNRERVPAKGYISDQLTDEAIGVV 266 +PQ++GF + + G Y P L+KN + GY +D D A+ + Sbjct: 141 PMRPQDQGFAESLIHKSGGIGQSPDQPNSYFHPKLWKNGVAFQSTGYCTDVFFDAALDFI 200 Query: 267 DRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRIL 326 DR ++PF +YLA NAPH P + Q +T Y + ++D+ + ++L Sbjct: 201 DRQTKTEKPFFVYLATNAPHTPLEIAESYWKPYQRQGLDETTARVYGMITNLDENIGKLL 260 Query: 327 EQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 L+++ + T++LF DNG G +G KS TY GG P W G + Sbjct: 261 SHLERSALAEKTVVLFLGDNGPQ---QKRYTGGLRGRKSWTYEGGIRVPCLAQWPGHFRE 317 Query: 387 G-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSH 445 G D++ + +D PT L + P+ LKLDGV L P L +K+ P ++L + Sbjct: 318 GEKIDQIAAHIDLMPTLLALTETRCPESLKLDGVDLSPLLTGRKEKLPARSLFFQVHRGL 377 Query: 446 WFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYK 505 + + R + YP TE+L + V L Y Sbjct: 378 TPQR----YQNYAVVTERFKLAGYPGTFGTENLLLQAEPV---------------LEFYD 418 Query: 506 L-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 L TD ++ N+ ++P+ VK + ++ + Sbjct: 419 LSTDPGEQKNVLHSHPETVKALLKQYEDWFSEMKA 453 >UniRef50_B1KD78 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD78_SHEWM Length = 483 Score = 442 bits (1138), Expect = e-122, Method: Composition-based stats. Identities = 179/513 (34%), Positives = 276/513 (53%), Gaps = 45/513 (8%) Query: 48 TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVV-DTYKIGIDKAIEAAQ 106 + + + PN++++ DD+G+G + + + + + D+ + + A A+ Sbjct: 7 SSAIAAQQTPPNVVIVLADDMGFGHVAMNLDLATADSYNPQNLKRDSQRHKPELARSYAK 66 Query: 107 KSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD-GIPLTET 165 K+TPTL L +EGVRFTN YV + GPSRAA+MTGR P RFG+Y+N D + G+P+ E Sbjct: 67 KATPTLTQLANEGVRFTNAYVPSPLCGPSRAALMTGRYPQRFGIYNNADVKAAGLPVEEN 126 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 L F+ GY T AVGKWHL+K Q P +RGFD+F Sbjct: 127 VLANNFRKAGYRTGAVGKWHLTKGEKKASYTLAQ----------------HPLDRGFDFF 170 Query: 226 MGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 GF +GT YY+S L NR+ V A+GY++DQLT+ AI + + +PF LY+AYNA Sbjct: 171 FGFDRSGTPYYDSKILELNRKPVKAEGYLTDQLTNHAIDFI--NQDKSKPFFLYMAYNAV 228 Query: 286 HLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSD 345 H P + AP +YQ FN+G + D +Y+ +Y++DQGV +I++QL NGQ DNTII+F SD Sbjct: 229 HGPLNKAAPKEYQAPFNSGDRYLDYFYSYLYALDQGVAKIIKQLDSNGQLDNTIIMFLSD 288 Query: 346 NGAVIDGPLPL--NGAQKGYKSQTYPGGTHTPMFMWWK-GKLQPGNYDK-LISAMDFYPT 401 NGA P PL N GYK Q + GGT P+ +W + G D +IS+MD PT Sbjct: 289 NGAPGGKPFPLPANAPFTGYKGQVWQGGTRVPVVIWGPKALVNGGRVDDAVISSMDLIPT 348 Query: 402 ALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKF 461 AL AA + + + LDG +LLP L K+ E + L W + SH + Sbjct: 349 ALAAAGVDLSDN--LDGNNLLPKL--KRVEEDERQLFWASQLSHHWGFIR---------- 394 Query: 462 VRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANP 520 D + + ++ ++ VR+ ++ L Y ++ + L+ + TD + ++A +P Sbjct: 395 -----DAKGKKIDDKSTAEPAWAVRSGEWMLRYWADSKKTELFNVSTDHAEHHDIANKHP 449 Query: 521 QVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIK 553 QVVK++ + + D+ P ++ + ++ Sbjct: 450 QVVKQLTADYKVWFDTLAKPAG-WDKRYWEQLE 481 >UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria RepID=A6DGD3_9BACT Length = 713 Score = 442 bits (1138), Expect = e-122, Method: Composition-based stats. Identities = 134/568 (23%), Positives = 205/568 (36%), Gaps = 125/568 (22%) Query: 28 HAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMEN 87 H + K F + ++ +P+II+ +DDLG+ + F Sbjct: 210 HYTSEGSSILAKRVAQFIAQELPKKASSKRPHIILFLIDDLGWNDIACYGSQF------- 262 Query: 88 REVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR 147 TP L + EG RFT+ Y A+ V P+RA+I+ G+ P+R Sbjct: 263 -------------------YETPHLDKMAKEGFRFTDAYAANPVCSPTRASILLGKYPSR 303 Query: 148 FGVYSNTDA---------------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNV 192 G+ +++ + + +PL + L E + GY TA +GKWHL + Sbjct: 304 VGLSNHSGSSGPKGPGHKLTPVPVKGNMPLEDITLAEALKEVGYKTAHIGKWHLQAHHDT 363 Query: 193 PVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA---AGTAYYNSPSLFKNRERVP 249 P+ GFD + H G+ Y+ S VP Sbjct: 364 SRNHF-------------------PEKHGFDLNIAGHRMGQPGSFYFPYKSKQHPSTNVP 404 Query: 250 ------AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPN--DNPAPDQYQKQ- 300 Y++D+LTD+AI + K D PF L Y H P +Y+ + Sbjct: 405 DMADGQEGDYLTDKLTDKAIHYIKENK--DTPFFLNFWYYTVHTPIIPRQDLKKKYEAKA 462 Query: 301 -----------------FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFT 343 F SQ +Y A V ++D+ + RI + LK+ D TII+F Sbjct: 463 NELGINKNQPGIPVLKSFARSSQNNPSYAAMVEAMDENIGRIFKTLKELQIDDETIIIFC 522 Query: 344 SDNGAVIDGPLP----LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFY 399 SDNG + P K K+ Y GG P + W GK + D Y Sbjct: 523 SDNGGLSTSTGPNCPTSQLPLKAGKAWVYEGGIRIPFIIKWPGKKGGKELQAPVCTTDIY 582 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYH 459 PT LD + + LDGVSL + + + + L Sbjct: 583 PTLLDMLKLPAKPEQHLDGVSLTSLMNGQAKELQREALF--------------------- 621 Query: 460 KFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAA 518 YPH + + + VR DY LV E + LY L D+ + +NL Sbjct: 622 -------IHYPHYHHINSMG-PAGAVRMGDYKLVEYYETGEFELYNLKEDIGEMNNLVKE 673 Query: 519 NPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 P+ +M + ++ S P E N Sbjct: 674 QPERAAQMLKKLEQWRQQSNSPKPERNP 701 >UniRef50_A6DKP2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKP2_9BACT Length = 446 Score = 442 bits (1137), Expect = e-122, Method: Composition-based stats. Identities = 143/520 (27%), Positives = 219/520 (42%), Gaps = 123/520 (23%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 KPNI+++ DD+G+G + + TP + Sbjct: 16 AADKPNIVLVFADDMGWGDVAYHG--------------------------VEDAQTPAID 49 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQN 173 ++ GV F GY A V GPSRA I+TGR FGV +N DA GIP ++ + EL + Sbjct: 50 AIAKGGVWFEQGYAAASVCGPSRAGILTGRYQQLFGVVTNGDADKGIPKSQKNIAELLKP 109 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY + A GKWHL P +RGFD F GFH Sbjct: 110 AGYKSGAFGKWHLGSKKGQF-----------------------PNDRGFDTFYGFHFGAH 146 Query: 234 AYY-----------NSPSLFKNRERVPAKG--YISDQLTDEAIGVVDRAKTLDQPFMLYL 280 YY ++ N++ V K Y+++++TD A+ ++ K DQPF +Y+ Sbjct: 147 DYYRADKKLNKKKKGYAPIYFNQDIVDYKEGDYLTEKITDHAVEFIEENK--DQPFFMYV 204 Query: 281 AYNAPHLPNDNPAPDQYQKQFNT-GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 AYN+ H P PD+Y + + A V ++D GV RI +LK+ +NTI Sbjct: 205 AYNSVHSPWQ--VPDEYLARIPESVPAYRRLFLAMVLAMDDGVGRIRAKLKELNLDENTI 262 Query: 340 ILFTSDNGAVIDGPLPLNG---------AQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-Y 389 +FT+DNG+ G N +GYK TY GG P M W K++ GN + Sbjct: 263 FVFTTDNGSPKIGNKKPNEGQYRMSMSQGFRGYKGDTYEGGIRVPFCMSWPKKIKSGNKF 322 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 + + A D PT L AA + + G LLP+L+D+++G PH+ L W Sbjct: 323 EAPVIAYDLAPTFLSAASLEY-STKQFSGKDLLPYLEDEQKGRPHETLFW---------- 371 Query: 450 ENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN---------Q 500 H + D Y VR+ D+ L Y + Sbjct: 372 ---------------------HRHSGLD----DYAVRHGDWKLTYNDQEGTSKDFLKKVH 406 Query: 501 LGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 L L+ L D +K +LA + P+ +++++ + + ++ Sbjct: 407 LKLFNLKQDPYEKKDLADSMPEKLQQLKQLYFNWHETHAK 446 >UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BZT7_9PLAN Length = 459 Score = 441 bits (1134), Expect = e-122, Method: Composition-based stats. Identities = 130/513 (25%), Positives = 193/513 (37%), Gaps = 116/513 (22%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 E + K KPNII + DDLGY +L G + K + TP Sbjct: 10 EATEKQKPNIIFIMADDLGYAEL----GCYGQK----------------------KIKTP 43 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPEL 170 + L EG++FT Y V PSR+ +MTG+ V +N D + +T + E+ Sbjct: 44 HIDKLAAEGMKFTQAYAGSMVCQPSRSVLMTGQHTGHTAVRAN-DLNQLLYEEDTTVAEV 102 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA 230 + GY T A GKW L + +P +GFD F G Sbjct: 103 LKIAGYATGAFGKWGLG----------------------YEGTPGRPGQQGFDDFTGQLL 140 Query: 231 AGTAYYNSPSLFKNRE---------RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 A++ P N E YI D + ++A + + K QPF YL Sbjct: 141 QVHAHFYYPFWIWNNEHRLMLPENENNQRGRYIHDLIHEDAKAFIQKNKA--QPFFAYLP 198 Query: 282 YNAPHLPNDNPAPDQ--YQKQFNTGS--QTADNY----------YASVYSVDQGVKRILE 327 Y PH+ P + Y+ QF Y V +D V I+ Sbjct: 199 YIIPHVELVVPEESEKPYRGQFPKKQILDPRPGYIGSEDGLTTFAGMVSRLDDHVGEIVT 258 Query: 328 QLKKNGQYDNTIILFTSDNGA------VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 L+ G DNT+I+FTSDNG + N +G+K Y GG P W Sbjct: 259 LLEDLGIRDNTLIIFTSDNGGQGGTWKEMTDFFNGNAPLRGHKGSMYEGGIRVPFIANWP 318 Query: 382 GKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 GK+ G L I+ D PT A ++P + +DG+S LP L K + H+ L W Sbjct: 319 GKIAAGKTSDLQIAFWDVLPTLAQVAGTTVPSGVDIDGISFLPTLLGKGKQPEHEYLYWE 378 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 + S +R ++ V N Sbjct: 379 YTRGKIR----------------------------------SRAIRQGNWKAVQNRMNQP 404 Query: 501 LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVRE 532 + LY L TD+ + NLA +P+ +K++Q ++++ Sbjct: 405 IELYDLGTDIGETKNLAKQHPEKIKDLQQIMQQ 437 >UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Proteobacteria RepID=UPI0000E0F7DD Length = 493 Score = 441 bits (1134), Expect = e-122, Method: Composition-based stats. Identities = 134/524 (25%), Positives = 212/524 (40%), Gaps = 96/524 (18%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 + KPNII++ +DDLG+ + +++ + TP + Sbjct: 35 ADTTKPNIIMIVIDDLGWSDVGYNQTT-------------------------DYFETPNI 69 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA---------------Q 157 +L +G+ F Y PSRA +M+G+ R GVY+ + + + Sbjct: 70 DALAQQGLVFDQAYAGAANCAPSRAVLMSGQYGPRHGVYTVSPSDRGHAKTRKLIPIKNK 129 Query: 158 DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 G+ + E + GY T GKWHL P Sbjct: 130 RGLTTDIITIGESLKTAGYTTGTFGKWHLGA---------------------------DP 162 Query: 218 QNRGFDYFM-GFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPF 276 +GFD + G H T +Y SP N E P Y++++LT E I V +K DQPF Sbjct: 163 DKQGFDVNVAGSHQGMTFHYFSPYQLPNIEDGPKGEYLTERLTTEVIDWVKSSK--DQPF 220 Query: 277 MLYLAYNAPHLPNDNPAPD--QYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQ 334 Y+ Y H P +Y ++ S+ Y A V +D V RI + L G Sbjct: 221 FAYVPYYTVHTPYQAVVDKVNKYHEK-GIKSKREATYAAMVEHMDDNVGRIFDMLDSEGL 279 Query: 335 YDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS 394 +NT+++FTSDNG P +G K Y GG P+ + W K++PG + Sbjct: 280 AENTVVIFTSDNGGYRMSSFPT--PLRGGKGSYYDGGLRVPLIVRWPEKVKPGLDHTPVI 337 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPF 454 DFYPT ++ P + LDGV L L + Q ++L W + P Sbjct: 338 NADFYPTLVNLTKSKQP-NQVLDGVDLTAHLLGQ-QDIAERDLFW-----------HFPV 384 Query: 455 WDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKD 513 + H Q D ++ +R+ D+ L+ ENN+ LY L DL +K+ Sbjct: 385 YLQAHHAPTDQGQD------PLFRTRPGSAIRSGDWKLLQYFENNEFELYNLANDLAEKN 438 Query: 514 NLAAANPQVVKEMQGVVREFIDSSQPPLS-EVNQEKFNNIKKAL 556 NLA+ +P VKE++ ++ + + ++N E + + L Sbjct: 439 NLASVHPSRVKELKTKLQAWQQQIGADIPTKLNPEYDAKVNQQL 482 >UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Bacteria RepID=A6C284_9PLAN Length = 605 Score = 440 bits (1132), Expect = e-122, Method: Composition-based stats. Identities = 133/510 (26%), Positives = 195/510 (38%), Gaps = 114/510 (22%) Query: 45 SDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEA 104 S P +T PNI++ DD G+G L + + Sbjct: 31 SQTRPATQAT-THPNIVIFLADDQGWGDLSHNGNT------------------------- 64 Query: 105 AQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTE 164 TP + SL EGV+F YV V P+RAA +TGR AR G + Q+ E Sbjct: 65 -NLHTPNVDSLAKEGVKFNRFYVGA-VCAPTRAAFLTGRYHARTGTIGVSTGQERFNSDE 122 Query: 165 TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY 224 + + F+ GY T A GKWH P +GFD Sbjct: 123 YTIAQAFKAAGYATGAFGKWHNGTQY-----------------------PNHPNAKGFDE 159 Query: 225 FMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNA 284 + GF + +Y SP L N V GYI+D LTD+A+ +++ +PF YL Y Sbjct: 160 YYGFTSGHWGHYFSPMLDHNGTFVKGNGYITDDLTDKAMAFIEQQVQNHKPFFAYLPYCT 219 Query: 285 PHLPNDNPAPDQYQKQFNT-------------GSQTADNYYASVYSVDQGVKRILEQLKK 331 PH P PDQY +F A +VD V R+L++L Sbjct: 220 PHSPMQ--VPDQYWDRFKDKQLKLHNREPDREQPDHLRAALAMCENVDWNVGRVLKKLNS 277 Query: 332 NGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YD 390 D+TI+++ SDNG + NG KG K GG +P + W G L G + Sbjct: 278 LRITDDTIVIYFSDNGP---NGVRWNGDMKGKKGSLDEGGVRSPFVIRWPGHLPAGQEVN 334 Query: 391 KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEE 450 ++ A+D PT D A I P+ +DGVSL P + + K P + + Sbjct: 335 QIAGAIDLLPTLTDLAGIKRPEPKPIDGVSLKPLMLNSKADWPERMIF------------ 382 Query: 451 NIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDL 509 + +VR + Y L + LY + D Sbjct: 383 --------------------------SSLRNRVSVRTDQYRLSR-----KGELYDMHADP 411 Query: 510 QQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 Q++N+A P++ ++Q V ++ S P Sbjct: 412 GQRNNIAKQKPEITAKLQQAVTDWRQSVWP 441 >UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 Tax=Nostocaceae RepID=Q3M597_ANAVT Length = 457 Score = 437 bits (1123), Expect = e-121, Method: Composition-based stats. Identities = 136/529 (25%), Positives = 203/529 (38%), Gaps = 111/529 (20%) Query: 32 DVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVV 91 + T A ++ + +PN++ + +DD+G+G L Sbjct: 16 GMTAAGTLMATASANLFSRATAQSSRPNVVFILVDDMGWGDLSIYG-------------- 61 Query: 92 DTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR--FG 149 TP L L +GVRFTN Y V P+R A +TGR AR G Sbjct: 62 ------------RTDYETPNLDRLARQGVRFTNAYANQTVCTPTRIAFLTGRYQARLPVG 109 Query: 150 VYSNTDAQD-------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 + A+ GIP + + L + +GY TA VGKWH N Sbjct: 110 LREPLGARSQPASNNIGIPANQPTIASLLKANGYETALVGKWHAGYPPN----------- 158 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP------SLFKNRERVPAKGYISD 256 + P +GFD + G + G Y+ L++N V GY++D Sbjct: 159 ------------FGPLQKGFDEYFGHLSGGIEYFTHTGTDRILDLYENDVPVQRSGYVTD 206 Query: 257 QLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP----APDQYQKQFNTGSQTADNYY 312 TD A+ + R + +PF L L YNAPH P P + Y T + Y Sbjct: 207 LFTDRAVEFIQRPHS--RPFYLSLHYNAPHWPWQGPNDQASTAFYLTNGYTVGGSQATYA 264 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGT 372 A V S+D GV R+L+ L+ +GQ DNT+++FTSDNG G +G K+ Y GG Sbjct: 265 AMVKSLDDGVGRVLDALEASGQADNTLVIFTSDNGGERFSNF---GPFRGQKASLYEGGI 321 Query: 373 HTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 P + + G Q +++I D T L A S + DG +LLP L+ + Sbjct: 322 RVPAIIRYPGVTQANQVSNQVIITFDLTATILAATGTSFHPNYPPDGQNLLPLLRGDRS- 380 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 E + L W + L+ VR+ D+ Sbjct: 381 EFSRTLFWRYGAA---------------------------------LTTRQRAVRSGDWK 407 Query: 492 LVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 Y NQ L+ L TD + +L +N QV ++ + + P Sbjct: 408 --YWRRGNQEALFNLATDPGETTDLKDSNAQVFTRLRNQFQHWELQMLP 454 >UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CEC4_9PLAN Length = 467 Score = 437 bits (1123), Expect = e-121, Method: Composition-based stats. Identities = 123/520 (23%), Positives = 207/520 (39%), Gaps = 101/520 (19%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 + +PNI++ +DDLG+ + F F TP Sbjct: 23 ASAENQRPNIVLFFIDDLGWRDVGFMGSDF--------------------------FETP 56 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-------IPLT 163 + L DE ++FT Y A PSRA +M+G R GVY+ D G IP Sbjct: 57 HIDRLADESMKFTAAYSAAPNCAPSRACLMSGLYTPRHGVYTVGDPARGNDRYRKLIPAE 116 Query: 164 E--------TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEW 215 T + + GY A+VGKWHL + Sbjct: 117 NNRVLDDRFTTIADRLSQAGYRCASVGKWHLGQ--------------------------- 149 Query: 216 QPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAK--GYISDQLTDEAIGVVDRAKTLD 273 P ++GF + + G+ S ++N + + +++D+LT A + + Sbjct: 150 SPLSQGFQVNIAGNQTGSPRGGYFSPYQNPQLSDGEQGEFLTDRLTTAACQFIKDNQGS- 208 Query: 274 QPFMLYLAYNAPHLPNDNPAPD--QYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 PF LYL + A H P D +Q + Y A + S+DQ + R+L+ L++ Sbjct: 209 -PFFLYLTHYAVHTPLQAKKEDIAYFQSKPAGKLHQHATYAAMIRSMDQSIGRVLQTLRE 267 Query: 332 NGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYD- 390 NTI++FTSDNG GP +G K Y GG P+ + W G QPG+ Sbjct: 268 QQLDQNTIVVFTSDNGGY--GPATSMLPLRGSKGMLYEGGIRVPLLIKWPGVTQPGSTTG 325 Query: 391 KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEE 450 + + +D YPT L+ +I + + LDG SL+P L+D + ++L W Sbjct: 326 EAVINVDLYPTFLEMTNIPVLESELLDGESLVPLLKDPQTRLESRSLFW----------- 374 Query: 451 NIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DL 509 + P + ++ ++ + P +R D+ L+ E+ LY D+ Sbjct: 375 HFPAYLQKYQGMQQRFRTTP-----------VSVIRQGDWKLLEFFEDGHQELYNTRLDI 423 Query: 510 QQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS-EVNQEK 548 + L+ ++P+ +E+ + + + + E+N E Sbjct: 424 GESKELSGSHPEKTQELSQALHRWQKQVKAAIPAELNPEY 463 >UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4W7_9PLAN Length = 459 Score = 436 bits (1122), Expect = e-120, Method: Composition-based stats. Identities = 133/550 (24%), Positives = 215/550 (39%), Gaps = 128/550 (23%) Query: 29 AADDVKLKATKTNVAFSDF---TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 A D ++ +A S + + PNI+++ DDLGYG L Sbjct: 3 APDLMRSVLFALFIAVSCLLIRFSAAEAAQQPPNIVLIMADDLGYGDLACYG-------- 54 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 Q TP + L ++FT+ + A + P+RAA++TG+ Sbjct: 55 ------------------NKQVKTPHIDRLAASALKFTDFHSAGAMCTPTRAAMLTGQYQ 96 Query: 146 ARFG------VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQ 199 RFG + ++ G+P + EL + GY TA GKWHL Sbjct: 97 QRFGRQFESALSGKSNHDIGLPHQAVTMAELLKQQGYATACFGKWHLG------------ 144 Query: 200 TRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS------PSLFKNRERVPAKGY 253 W P N+GFD F G + ++ + N E KGY Sbjct: 145 -----------YQPPWLPTNQGFDLFRGLTSGDGDHHTHVDRSGNEDWWHNNEISMEKGY 193 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--------DQYQKQFNTGS 305 +D L+ ++ ++ +T +PF LY+ + A H P P D + ++ Sbjct: 194 TADLLSKYSVAFMEANRT--RPFFLYVPHLAIHFPWQGPQDPPHRKAGQDYHAGKWGIIP 251 Query: 306 QTAD---NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID-----GPLPLN 357 + + A + S+DQ V +IL LK+ NT+++FTSDNG + + N Sbjct: 252 DPGNVSPHTTAMIESLDQSVGKILSALKRLDLEQNTLVIFTSDNGGYLTYGKNFQNISSN 311 Query: 358 GAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLD 417 G +G K+ Y GG P + W G + G D+ ++D PT AA IS + + D Sbjct: 312 GPLRGQKATLYEGGHRVPCLISWPGVITAGVTDQTAHSVDLLPTLAQAAGISA-TNFQTD 370 Query: 418 GVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477 G+ L P Q + ++L W + Sbjct: 371 GLDLAPLWQTGR-PLADRDLFWRMGNNR-------------------------------- 397 Query: 478 LSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREF--- 533 VR + L ++NN+ LY L TDL ++ N AA +P++VK M ++E+ Sbjct: 398 ------AVRRGQWKL--CLKNNRSELYHLETDLGEQQNRAAEHPEIVKSMSQALKEWEAD 449 Query: 534 IDSSQPPLSE 543 +D+S S+ Sbjct: 450 VDTSAKQFSK 459 >UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZUT0_9PLAN Length = 457 Score = 436 bits (1121), Expect = e-120, Method: Composition-based stats. Identities = 126/531 (23%), Positives = 207/531 (38%), Gaps = 112/531 (21%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 V + + KPNI+ + +DD+G D G + Sbjct: 11 IAAILVLLASGALHSDAAPTKPNIVFILIDDMG----CKDAGCYG--------------- 51 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---- 152 A STP + L ++G+RFT+ Y A V P+RA++MTG+ PAR + + Sbjct: 52 -------ATNFSTPHIDRLANQGMRFTDAYAA-PVCSPTRASLMTGKHPARLHLTNFIPQ 103 Query: 153 -----------NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 +PL E + + GY A +GKWHL + Sbjct: 104 IGRQLPAGKLIPPGFNHVLPLDEKTIAQELHADGYQCAMIGKWHLGEEH----------- 152 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKG--------Y 253 E++PQNRGFD + G Y P F ++++ P G Y Sbjct: 153 ----------GPEYRPQNRGFDRVVLSEHHGIFNYFYP--FVDQQKWPYAGPLPGNPGDY 200 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYA 313 + D+LTDEAI V + ++PF LYL++ + H P + + + Y A Sbjct: 201 LPDRLTDEAIDFVRENR--ERPFFLYLSHWSVHGRYFAPESLIAKYRERGLEERPAIYAA 258 Query: 314 SVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTH 373 + +VD V R++ L + DNT+ +F SDNG + +G K Y GG Sbjct: 259 MMETVDNSVGRLMATLDELNLADNTLFVFMSDNGGER---ITSMAPLRGSKGSLYEGGVR 315 Query: 374 TPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 P+ + + G ++P + + D +PT LD A+ S +D KLDG S+ L ++ Sbjct: 316 VPLIVRYPGVVKPNTTCSVPVISHDLFPTFLDFAERSY-RDNKLDGHSIAGLLTGEQSEL 374 Query: 433 PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL 492 L W H P+ ++ +R + L Sbjct: 375 DRDALYW-------------------------------HFPHYWGSTRPCSAMRQGRWKL 403 Query: 493 VYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 V +E + LY L +D ++ +LA PQ E++ ++ ++ + Sbjct: 404 VEHLETGRAQLYDLSSDPGEQRDLANEMPQQATELRKMLAQWRTKVGAQMP 454 >UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4L0_9PLAN Length = 413 Score = 436 bits (1121), Expect = e-120, Method: Composition-based stats. Identities = 121/498 (24%), Positives = 191/498 (38%), Gaps = 115/498 (23%) Query: 64 TMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFT 123 DDLGYG L +TP L L G+RFT Sbjct: 1 MADDLGYGDLSCYGS--------------------------QNCNTPHLDRLAANGIRFT 34 Query: 124 NGYVAHGVSGPSRAAIMTGRAPARFGVYSNT------DAQDGIPLTETFLPELFQNHGYY 177 + + + V P+RA ++TGR R G+ + G+ E L + Q+ GY Sbjct: 35 DFHSSGAVCSPTRAGLLTGRYQQRAGIDGVVYANPKKNRHHGLQKNEITLAQCLQDAGYQ 94 Query: 178 TAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYN 237 T GKWHL ++ P RGF F+G+ + Y+ Sbjct: 95 TGMFGKWHLGYQR-----------------------QYNPTFRGFQQFVGYVSGNVDYFA 131 Query: 238 SP------SLFKNRE-RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPND 290 + N E +GY++ + D A+ + + + ++PF +Y+A+ A H P Sbjct: 132 HLDGTGVFDWWHNAELNREEQGYVTHLINDHALEFIRQQQ--EKPFFVYIAHEAVHSPYQ 189 Query: 291 NPAPDQYQK------QFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 P +K + A+ Y +D+G+ +I++ LK+ + T I F S Sbjct: 190 GPHDQPMRKEGGGDIKSAKRKDIANAYREMNTEMDKGIGQIVDVLKEVNLTEKTFIFFLS 249 Query: 345 DNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTAL 403 DNGA +G NG +G+K + GG P W G++ G D+ + ++D PT L Sbjct: 250 DNGANKNG---SNGKLRGFKGSLWEGGHRVPAIACWPGRIPEGTVCDEPVISIDLMPTIL 306 Query: 404 DAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR 463 + A+ IP KLDGVSL+ L+D+K P + + W Sbjct: 307 ELANAKIPAGHKLDGVSLVSLLKDRKSLVP-RQIFWEY---------------------- 343 Query: 464 HQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV-ENNQLGLYKLT-DLQQKDNLAAANPQ 521 +R + LV + LY LT D+ + NLA PQ Sbjct: 344 ----------------NGKSAMRQGHWKLVLNQTRKEPIELYDLTRDMSESKNLADNQPQ 387 Query: 522 VVKEMQGVVREFIDSSQP 539 V++MQ + + Q Sbjct: 388 RVQQMQSALAAWKSDVQK 405 >UniRef50_D2R014 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R014_9PLAN Length = 475 Score = 436 bits (1121), Expect = e-120, Method: Composition-based stats. Identities = 128/552 (23%), Positives = 211/552 (38%), Gaps = 111/552 (20%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGK-PNIIVLTMDDLGYGQLPFDKGSF 80 + H + L + AF + + PNI+V+ +DD+G+ L ++ Sbjct: 4 LVQHLLHYLTTLTLTSCVFAAAFCATKQAFSADSTRVPNIVVILIDDMGFSDLSCMGSTY 63 Query: 81 DPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIM 140 TP++ L G+RFT+ Y A V P+RAA++ Sbjct: 64 --------------------------YETPSINKLAASGMRFTHAYSACTVCSPTRAAVL 97 Query: 141 TGRAPARFGVYSNTDAQDG-------------IPLTETFLPELFQNHGYYTAAVGKWHLS 187 TG+ PAR + Q + L E L EL HGY TA++GKWHL Sbjct: 98 TGKYPARLHLTDWIPGQMSNKTKLKLPDWNKQLNLEEITLAELLGAHGYTTASIGKWHLG 157 Query: 188 KISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRER 247 E +P +GF +G ++ G +N Sbjct: 158 ------------------------PPECEPTRQGFSLNIGGNSKGQPPSYFFPYERNGVL 193 Query: 248 VP------AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQK 299 +P Y++D+LTD ++ ++ +PF LYL + H P +Y+ Sbjct: 194 LPGLAEGKPNEYLTDRLTDACEAFIEENQS--KPFFLYLPHYCVHTPLQAKPELIAKYEA 251 Query: 300 Q---FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL 356 + F Q Y A V S+DQ V RI+ +L TI++FTSDNG ++ + Sbjct: 252 KNAQFPGNPQHEAKYAAMVESLDQSVGRIMAKLDALDLTKKTIVIFTSDNGGLVLREITS 311 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLK 415 N + K Y GG P+ + + ++PG D +MD +PT + + D Sbjct: 312 NLPARAGKGSAYEGGVRVPLIVSYPPMIKPGTTCDVPAISMDLFPTLAELSGAKYSHD-- 369 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 +DG S++P L++K + L W Y H+ P+ Sbjct: 370 IDGKSIVPLLEEKPDAFAARPLYW--HYPHYHGGGATPY--------------------- 406 Query: 476 EDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFI 534 +R +Y LV E+ +L LY L D+ + NLA P + +++ + + Sbjct: 407 -------SAMRVGNYRLVEFFEDGRLELYDLAHDIGEMKNLAQEKPDLTEKLHRQLIAWR 459 Query: 535 DSSQPPLSEVNQ 546 S + + Sbjct: 460 KSVDAQYATPRE 471 >UniRef50_Q7UGB8 Arylsulfatase homolog b1498 n=1 Tax=Rhodopirellula baltica RepID=Q7UGB8_RHOBA Length = 656 Score = 435 bits (1118), Expect = e-120, Method: Composition-based stats. Identities = 128/536 (23%), Positives = 218/536 (40%), Gaps = 90/536 (16%) Query: 29 AADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 A + T + +PN++++ DD G+G L Sbjct: 73 CAKKICRTVVMVLFVIGAGTSIQAEASDRPNVLLILTDDQGWGDLAAH------------ 120 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF 148 + STPTL +L +E R YV V P+RAA++TGR P R Sbjct: 121 --------------RNPKISTPTLDALANESARLDRFYV-SPVCAPTRAALLTGRYPERS 165 Query: 149 GVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 GV T ++ + ET L EL+++ GY T GKWH Sbjct: 166 GVAGVTGRREVMRAEETTLAELYRSAGYATGCFGKWHNGAQM------------------ 207 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDR 268 P +GF+ F GF Y+ L +N V KGYI+D LTD A+ + Sbjct: 208 -----PLHPNGQGFNEFFGFCGGHFNLYDDALLERNGTPVQTKGYITDVLTDAAVEFIQN 262 Query: 269 AKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQ 328 D+PF Y+ +NAPH P + + + YA V ++D V R+L+ Sbjct: 263 HH--DRPFFCYVPFNAPHGPFQVRRDLFDRYNDGSIDEKTAAVYAMVQNIDTNVSRLLKC 320 Query: 329 LKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN 388 L + + TI++F +DNG NG +G K + GG P F+ W G +QP + Sbjct: 321 LSDHSLDEETIVVFLTDNGP---NGKRFNGGMRGTKGSVHEGGCRVPCFIRWTGNIQPQS 377 Query: 389 YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFD 448 ++ + +D PT + DI +P + LDG SL+ ++D D Sbjct: 378 ISQVAAHIDLLPTLMQWCDIPLPTKVPLDGRSLVELIRDGADPT-------------LAD 424 Query: 449 EENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-T 507 + + N + + VR N + L T+E ++ L+ + T Sbjct: 425 RSILTYRPNPMQLQKF----------------GKAAVRTNTHRL--TIEKSKASLFDMTT 466 Query: 508 DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKK---ALSEAK 560 D Q ++A+++P++ K+++ +++++ P ++ + ++++ +AK Sbjct: 467 DAGQTTDIASSHPELTKQLRSQIQKYVQEITPSITAIRPVPIDSMRSVYLPAVDAK 522 >UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C366AB Length = 470 Score = 434 bits (1117), Expect = e-120, Method: Composition-based stats. Identities = 123/548 (22%), Positives = 210/548 (38%), Gaps = 136/548 (24%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN + + MDD+G+ L +F TP + L Sbjct: 4 QPNFLFIFMDDMGWRDLACTGSTF--------------------------YETPNIDRLC 37 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD----------------GI 160 +G+ F N Y + V PSRA+ +TG+ PAR GV D + + Sbjct: 38 RQGMVFANSYASCPVCSPSRASCLTGKYPARLGVTDWIDMEGTSHPLKGKLIDAPYIKHL 97 Query: 161 PLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 P E + + ++ GY T VGKWHL E+ P++ Sbjct: 98 PEGEYTIAQALKDAGYDTWHVGKWHLGGR------------------------EFYPEHF 133 Query: 221 GFDYFMGFHAAGTAY--YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLD--QPF 276 GFD +G + G + Y SP + P Y++D++TDEA+ ++ + + +PF Sbjct: 134 GFDVNIGGCSWGHPHDGYFSPYGIETLSEGPEGEYLTDRITDEAVRLLRKRQACGSRKPF 193 Query: 277 MLYLAYNAPHLPNDNPAPDQ------------------YQKQFNTGSQTADN-------- 310 + L + A H P D+ + +F+ Sbjct: 194 YMNLCHYAVHTPIQVKDEDRARFEKKARELGLDKETALVEGEFHHTEDKKGRRVVRRVIQ 253 Query: 311 ----YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--VIDGPLPLNGAQKGYK 364 Y ++++DQ + R+LE L++ G+ +NT+++FTSDNG +G N K Sbjct: 254 SDPSYAGMIWNLDQNIGRLLEALRECGEEENTVVVFTSDNGGLATSEGSPTCNLPASEGK 313 Query: 365 SQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 Y GGT P+ + + G++ PG D ++ DFYPT L+ A + + +DG S++P Sbjct: 314 GWVYEGGTRVPLIVKYPGRVAPGSRCDVPVTTPDFYPTFLELAGVPQKAGIPIDGRSIVP 373 Query: 424 WLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY 483 L P + + W Y H+ ++ P + Sbjct: 374 LLSG--NPMPERPIFW--HYPHYGNQGGTP----------------------------AS 401 Query: 484 TVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 +V DY + E+ + LY L D + +NL P+ ++ ++ + Sbjct: 402 SVVMGDYKYIEFFEDGRGELYDLKADFSETNNLCEKMPETAARLRMLLHGWQREVCARFP 461 Query: 543 EVNQEKFN 550 E N E Sbjct: 462 EENAEYVQ 469 >UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=Bacteria RepID=Q7UHJ9_RHOBA Length = 1012 Score = 434 bits (1116), Expect = e-120, Method: Composition-based stats. Identities = 143/554 (25%), Positives = 216/554 (38%), Gaps = 130/554 (23%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 + + S + + KPN IV+ DD GYG L Sbjct: 550 PSSPTASVSPAGREKTAETTKPNFIVILTDDQGYGDLSCFG------------------- 590 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR--------F 148 A TP + + EG R T+ YVA V PSRA +MTG P R F Sbjct: 591 -------AKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRAGLMTGCYPKRIDMAMGSNF 643 Query: 149 GVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 GV D + G+ E + E+ + GY T GKWHL Sbjct: 644 GVLLAGDPK-GLHPDEITIAEVLKTAGYRTGMFGKWHLGDQ------------------- 683 Query: 209 TFSAEEWQPQNRGFDYFMG---------FHAAGTAYYNSP-SLFKNR---ERVPAKGYIS 255 E+ P +GFD F G FH Y+ P L +N E P +++ Sbjct: 684 ----PEFLPTKQGFDEFFGIPYSHDIHPFHPRQNHYHFPPLPLLQNDTVIEMDPDADFLT 739 Query: 256 DQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPD------------QYQKQFNT 303 +LT++A+ ++R K DQPF LYL + PH P P + + Sbjct: 740 KRLTEQAVSFIERNK--DQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNID 797 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGY 363 + A+ + ++ +D V +IL+ L+ NG + T++LFTSDNG + G +G+ Sbjct: 798 YATRANLFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPPKNTLYASPGELRGH 857 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLL 422 K T+ GG P + W G++ G D+L++AMD PT A +IP D +DG + Sbjct: 858 KGTTFEGGMREPTVVRWPGQIPAGHQNDELMTAMDLLPTFAKLAGAAIPTDRVIDGKDIW 917 Query: 423 PWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFS 482 P L+ + Q PH + Sbjct: 918 PTLKGETQ-TPHDAFFYHRGNQL------------------------------------- 939 Query: 483 YTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREF----IDSS 537 VR+ + L + LY L DL +K N+ NP+VVK++Q +++F +S Sbjct: 940 AAVRSGKWKLHVNNGVAK-QLYDLENDLGEKVNVIETNPEVVKKLQHQLKDFAADIASNS 998 Query: 538 QPPLSEVNQEKFNN 551 +P N + +N Sbjct: 999 RPAAFNANPKSLSN 1012 Score = 369 bits (949), Expect = e-100, Method: Composition-based stats. Identities = 128/557 (22%), Positives = 205/557 (36%), Gaps = 89/557 (15%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 + +K + T + + PN++++ +DDLGYG L Sbjct: 12 INSSAMKLYAVALMMLLGCGTSVAAERPPNVVLIFVDDLGYGDLGCYG------------ 59 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF- 148 A + STP + L EG RFT+ + A V PSR ++TG+ P R Sbjct: 60 --------------ATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSRYGLLTGQYPVRAM 105 Query: 149 ---GVYSNTDAQDGI--PLTETFLPELFQNHGYYTAAVGKWHLSKISNV--------PVP 195 G++ G+ + ++F+N GY TA +GKWHL P P Sbjct: 106 GGQGIWGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKWHLGFKEEPCDWQVPLRPGP 165 Query: 196 EDKQTRDYHDNFTTFSAEEWQPQN----RGFDYFMGFHAAGTAYYNSPSL---------- 241 +D Y S + N G+D G +P Sbjct: 166 QDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGGKPVSPTPMFPEEASVKSPN 225 Query: 242 -----FKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ 296 K E + + LT+ A+ + K ++PF LY A H P PAP Sbjct: 226 RFSGALKAHEIYDDEKTGT-LLTERAVKWITEKK--NEPFFLYFATPNIHHPF-TPAP-- 279 Query: 297 YQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID----- 351 +F SQ Y V+ +D V I++ L+ NG DNT++LFTSDNGA+++ Sbjct: 280 ---RFKGTSQ-CGLYGDFVHELDWMVGEIVQSLEDNGLTDNTLVLFTSDNGAMLNRAGRD 335 Query: 352 ---GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAAD 407 NG G+K + GG P+ W GK++ G D+LIS +D + T + Sbjct: 336 AIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKAGTQSDQLISQVDLFATFSALTE 395 Query: 408 ISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 +P + D +++LP L D L + + + Sbjct: 396 QEMPSSEQKDSINMLPALLDDPNEPLRTELVLAPRQPRNLAIRKGKWLYIGARGSGGFNG 455 Query: 468 DYPHNPNTEDLSQFSYT------VRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANP 520 P + + ++ + N LY L D Q N+ +P Sbjct: 456 SKPQHHAWGGPAAVQFSGQKNSDIVNGRIK----KNAPPAQLYDLENDRSQTTNVFREHP 511 Query: 521 QVVKEMQGVVREFIDSS 537 +VV+EM+ ++ + Sbjct: 512 EVVEEMKAMLESYRPKQ 528 >UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788C38 Length = 452 Score = 433 bits (1115), Expect = e-120, Method: Composition-based stats. Identities = 144/524 (27%), Positives = 216/524 (41%), Gaps = 131/524 (25%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 +PN IV+ DDLGYG L G + T TP L L Sbjct: 15 KQPNFIVIYCDDLGYGDL----GCYGSDT----------------------VKTPHLDGL 48 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV---YSNTDAQDGIPLTETFLPELFQ 172 DEG+RFTN Y V PSRA+++TG+ PAR GV G+P E L + + Sbjct: 49 ADEGIRFTNWYSNSPVCSPSRASLLTGKYPARAGVGEILGAKRGSHGLPADEVTLAKALK 108 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 GY TA GKWHL +EE P GFD F GF A Sbjct: 109 PAGYRTALYGKWHLGL-----------------------SEETSPNAHGFDEFFGFKAGC 145 Query: 233 TAYYNS-------------PSLFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFML 278 +Y+ L++N V G Y+++ +T+ ++ + R++ + PF L Sbjct: 146 VDFYSHIFYWGQAHGVNPLHDLWENETEVWENGRYMTELITERSVDFIQRSREQEAPFFL 205 Query: 279 YLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 + +YNAPH P AP +Y +F A + +VD GV +I++ LK+ G Y++T Sbjct: 206 FASYNAPHYPMH--APQKYMDRFAHLPWDRQVMAAMIAAVDDGVGKIVKALKEAGCYEDT 263 Query: 339 IILFTSDNGAVIDGPLPLNGA-----------QKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 +I F+SDNG + L+G +G+K+ + GG P + W + G Sbjct: 264 VIFFSSDNGPSSESRNWLDGTEDVYYGGSAGIFRGHKASLFEGGIREPAILSWPNGWEGG 323 Query: 388 NY-DKLISAMDFYPTALDAADISIP----KDLKLDGVSLLPWLQDKKQGEPHKNLTWITS 442 D++ + MD PT LD A + + + LDG SL LQ ++ PH+ L W Sbjct: 324 QVRDEVAAMMDLAPTFLDLAGVDPAAGPLQGVALDGSSLKEMLQ-MREPSPHQQLFWEY- 381 Query: 443 YSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE----- 497 Q VR D+ LV + Sbjct: 382 -------------------------------------QGQLAVREGDWKLVLNGKLDFDR 404 Query: 498 --NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 +Q+ L L+ D ++ NLA P++V+ + VR++ + Q Sbjct: 405 VVPDQIHLSDLSRDPGERSNLADRYPEIVERLSRDVRDWYEEVQ 448 >UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C1V3_9PLAN Length = 470 Score = 433 bits (1114), Expect = e-120, Method: Composition-based stats. Identities = 129/533 (24%), Positives = 208/533 (39%), Gaps = 97/533 (18%) Query: 37 ATKTNVAFSDFTPTEYSTKGKP-NIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 A S T ++ KP N++ +DDLG+ L F Sbjct: 12 AVILLCFLSSITQPTHAADEKPWNVVFFLVDDLGWTDLGCYGSDF--------------- 56 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD 155 +P + L EG++FT Y A P+R A++TG PAR + Sbjct: 57 -----------YQSPNIDQLAAEGMKFTQNYSACNACSPTRGALLTGMYPARTHLTDWIP 105 Query: 156 A---------------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQT 200 + + T LPE + GY T VGKWHL N+P Q Sbjct: 106 GWAKSYTDFPLKPPEWKKHLDQKYTTLPEALRTAGYQTFHVGKWHLGGRGNLP-----QD 160 Query: 201 RDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTD 260 + N + NRG F G A SL E Y++D++ D Sbjct: 161 HGFDVNISG--------TNRGLPRSYHFPYGGDAMKWDSSL---TEAERQDRYLTDRMAD 209 Query: 261 EAIGVVDRAKTLDQPFMLYLAYNAPHLPND--NPAPDQYQKQFNTGSQTADNYYASVYSV 318 EA+ ++ + + D+PF LY ++ + H P +Y+ Y A + SV Sbjct: 210 EAVALIRQQQ--DKPFFLYCSFYSVHSPIQGRPDLVKKYKGLPAGKRHKNPEYAAMIQSV 267 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFM 378 D+ + R+ QLK++G D T+I+FTSDNG V N +G K Q + GGT P + Sbjct: 268 DEAIGRVRAQLKESGIADRTLIVFTSDNGGVRRKT-SNNDPLRGEKGQHWEGGTRVPAIV 326 Query: 379 WWKGKLQPGNYD-KLISAMDFYPTALDAADIS--IPKDLKLDGVSLLPWLQDKKQGEPHK 435 W G G+ + I MDFYPT L+ ++ + +DG+SL+P L+D + Sbjct: 327 LWPGVTPAGSVCAEPIITMDFYPTILNITGVAGNTEHNQSVDGLSLVPLLKDPAATLNRE 386 Query: 436 NLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT 495 L W Y H+ +P+ +R +Y L++ Sbjct: 387 ALYW--HYPHYNVFIGVPY----------------------------SAIRVGEYKLIHY 416 Query: 496 VENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQE 547 E+ LY L DL + +++ P++ ++ +++ + + N + Sbjct: 417 YEDGNDELYNLAEDLSETSDVSKTYPELTARLERRLQQHLKQVGAQMPVSNPQ 469 >UniRef50_C6Y214 Sulfatase n=3 Tax=Sphingobacteriaceae RepID=C6Y214_PEDHD Length = 472 Score = 433 bits (1114), Expect = e-119, Method: Composition-based stats. Identities = 138/532 (25%), Positives = 211/532 (39%), Gaps = 114/532 (21%) Query: 32 DVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVV 91 +K +T ++ + + T KPN+IV+ DD GY G Sbjct: 3 GIKTISTLLLALWTGISAAQVKTAAKPNVIVIVSDDAGYVDFGCYGG------------- 49 Query: 92 DTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY 151 Q TP + ++ +G RFT+ YV+ V PSRA I+TGR RFG Sbjct: 50 -------------KQIPTPNIDAIAKQGTRFTDAYVSASVCAPSRAGILTGRYQQRFGFE 96 Query: 152 SNTDA---------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 NT G+ +E + Q +GY T A+GKWH Sbjct: 97 HNTSNVLAPGYKITDVGMDPSEQTIGNEMQANGYKTIAIGKWHQGD-------------- 142 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY-------NSPSLFKNRERVPAKG--Y 253 + P NRGF+ F GF ++ N +L+ N+E VP Y Sbjct: 143 ---------EPKHFPLNRGFNEFYGFTGGHRDFFAYKGKRTNEHALYNNKEIVPENEITY 193 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYA 313 ++D TD+A + K D+PF +YL+YNA H P + D ++ + Y A Sbjct: 194 LTDMFTDKATSFITANK--DKPFFMYLSYNAVHTPMNAKK-DLMERYASIADTGRRAYAA 250 Query: 314 SVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTH 373 + S+D G+ +++ LK N NT+I+F +DNG NG +G K + GG Sbjct: 251 MMTSLDDGIGKVMATLKANQLDKNTLIIFINDNGGATVNS-SDNGPLRGMKGSKWEGGIR 309 Query: 374 TPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 M M W G + D + +S++D PTA+ A KLDGV+LLP+L + Sbjct: 310 VAMMMKWPGHIAANKTDSRPVSSLDILPTAIGAGKGKQKGTKKLDGVNLLPYLSAGNKKT 369 Query: 433 PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL 492 PH+ L W + +R ++ L Sbjct: 370 PHEALYWRRGV--------------------------------------AAAMREGNWKL 391 Query: 493 VYTVENNQLG---LYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 + E+ + L+ L+ DL + NL+ P VKE+ + E+ P Sbjct: 392 IRVKESPTVQNVLLFDLSKDLSETKNLSEKYPAKVKELLVKLAEWEKGLDQP 443 >UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6CBM1_9PLAN Length = 497 Score = 431 bits (1110), Expect = e-119, Method: Composition-based stats. Identities = 126/523 (24%), Positives = 212/523 (40%), Gaps = 69/523 (13%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 + E KPNI+++ DDLGYG L Sbjct: 21 ELQAVEKQQAAKPNIVIILCDDLGYGDLACYG--------------------------HP 54 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPL--T 163 TP L L EG+R T+ Y + V PSRA ++TGR P R GVY + L Sbjct: 55 VIKTPHLDQLASEGMRLTDCYASAPVCSPSRAGLLTGRTPNRLGVYDWIPEGHPMHLKRD 114 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 E + +L Q GY TA VGKWH + + N S E+ QP + GF Sbjct: 115 EVTVAQLLQQAGYDTAHVGKWHCNGMFN-------------------SKEQPQPGDHGFR 155 Query: 224 YFMGFHAAGTAYYNSPSLF-KNRERV-PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 ++ + +P+ F +N + + +G+ + DE I + + ++PF L++ Sbjct: 156 HWFSTQNNALPTHENPNNFVRNGKPLGEIEGFSCQIVADEGIRWLSDWREKEKPFFLHVC 215 Query: 282 YNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIIL 341 ++ PH +P + + Y+A+V ++D+ V ++L +L + DNT++ Sbjct: 216 FHEPHERVASPPALVETYLDKSLYEDQAQYFANVANMDRAVGKLLIKLDELKVADNTLVF 275 Query: 342 FTSDNGAVI--------DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKL 392 FTSDNG G +G K Y GG P + W GK++ G Sbjct: 276 FTSDNGPETLNRYGKGSRRSWGSPGVLRGMKLHIYEGGIRVPGIVRWPGKIKAGQEIATP 335 Query: 393 ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENI 452 + ++D PT + A +++P LDG SLLP K E L W +Y + + Sbjct: 336 VCSVDLLPTFCEIAGVAVPDQRPLDGASLLPLFAGNKI-ERTTPLFW--NYYRAYSTPRV 392 Query: 453 PFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQ 511 + K V H S P +++ S + + + + LY L D+ + Sbjct: 393 AMREGDWKVVAHWSGPEGIIPLGGNVNSVSQEI-------IKNAKLTKFELYNLKDDISE 445 Query: 512 KDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKK 554 + NLA + + ++ + + + Q + +++ +K Sbjct: 446 QHNLAWQEQKRLDTLKKKLVQKYAAVQKEGPVWDTSEYDQSRK 488 >UniRef50_A6C4V9 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4V9_9PLAN Length = 480 Score = 431 bits (1110), Expect = e-119, Method: Composition-based stats. Identities = 135/541 (24%), Positives = 207/541 (38%), Gaps = 120/541 (22%) Query: 35 LKATKTNVAFSDFTPTEYSTK------GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 L T FS F + +PN+IV+ +DD+GY + + Sbjct: 8 LTGMMTTAVFSMFCLVNLADAAERPPGDRPNLIVIMVDDMGYAGVSCFGNPY-------- 59 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF 148 TP + L EG++FT+ + + V P+RA ++TGR R Sbjct: 60 ------------------FKTPEIDRLAAEGMKFTDFHSSGTVCSPTRAGLLTGRYQQRA 101 Query: 149 GVY-------SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 G+ + + Q G+ +E EL + GY TA +GKWH N Sbjct: 102 GIEAVIHPVSDHPEHQKGLRKSENTFAELLKQAGYRTALIGKWHQGYPHNSA-------- 153 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS------PSLFKNRERVPAKGYIS 255 E+ P N GFD F+G+H+ + + + R+ GY + Sbjct: 154 ------------EFHPDNHGFDTFVGYHSGNIDFISHVGDHVKHDWWHGRKETQETGYST 201 Query: 256 DQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQ--------T 307 + A+ + ++ +QPF LYLA+ A H P P + + + Sbjct: 202 HLINQYALQFIKESR--NQPFCLYLAHEAIHNPVQVPGDPIRRTEAAGWKRWKPASEAER 259 Query: 308 ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQT 367 + + VD GV +I E L K+G NT +LF SDNG D P +G K Sbjct: 260 IEKFRGMTLPVDAGVGQIREFLVKSGLDKNTFVLFFSDNGPSRDFPSGSP-KWRGAKGSV 318 Query: 368 YPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQ 426 Y GG P WW GK+Q G D ++D PT L A I +PK+ LDGV L P L Sbjct: 319 YEGGHRVPAIAWWPGKIQAGTETDVPAISLDVMPTLLGIAHIDMPKERPLDGVDLSPVLF 378 Query: 427 DKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVR 486 ++K + L W + + S +R Sbjct: 379 EQK-PLSERPLFWA---------------------------------SLSNNGSRSEAMR 404 Query: 487 NNDYSLVY--------TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 + LV T EN ++ LY+L D + +NL+ A PQ M ++++ + Sbjct: 405 AGPWKLVVQHPRAKPGTFENEKVELYRLDQDPGEANNLSKAEPQRASRMLKQLKDWYQDT 464 Query: 538 Q 538 Q Sbjct: 465 Q 465 >UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAW6_9PLAN Length = 472 Score = 431 bits (1109), Expect = e-119, Method: Composition-based stats. Identities = 145/565 (25%), Positives = 208/565 (36%), Gaps = 148/565 (26%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 +KL + F S +PNIIVL DDLGYG+L Sbjct: 1 MKLLSVLALFCSLTFFLNSLSAAEQPNIIVLLADDLGYGELGCQG--------------- 45 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY- 151 Q TP + SL G+RFT YV PSRA ++TGR P RFG Sbjct: 46 -----------NPQIPTPHIDSLASHGIRFTQAYVTAPNCSPSRAGLLTGRIPTRFGYEF 94 Query: 152 -----SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 N D+ G+P E + E + GY T +GKWHL ++ Sbjct: 95 NPIGARNEDSGTGLPPDEQTIAERLHDQGYTTCLIGKWHLGGTAD--------------- 139 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHAAGTAY------------------------------- 235 + P GFD F GF G + Sbjct: 140 --------YHPFRHGFDEFFGFMHEGHYFVPPPYHGVTTMLRRKTLPGRQKGRWISENLI 191 Query: 236 ------YNSPSL------FKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYN 283 Y+ P + + V Y++D T EA+ ++R + D+PF LYLAYN Sbjct: 192 YSTHMGYDEPDYDANNPIIRGGQPVNETEYLTDAFTREAVSFINRHQ--DKPFFLYLAYN 249 Query: 284 APHLPNDNPAPDQYQKQFNTGSQTADN-YYASVYSVDQGVKRILEQLKKNGQYDNTIILF 342 A H P D + F + A + S+DQ + +IL+Q++++G + T+I+F Sbjct: 250 AVHSPLQGKKKDI--QHFTQIEDIHRQIFAAMLSSMDQSIGKILKQVQQSGLDEKTLIVF 307 Query: 343 TSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPT 401 SDNG N +G K Y GG P M W G L P D +S++D +PT Sbjct: 308 LSDNGGPTRELTSSNLPLRGEKGSMYEGGLRVPFLMRWTGTLAPKQTIDVPVSSLDIFPT 367 Query: 402 ALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKF 461 ++ A S+P++ LDG +LLP L +K P + W Sbjct: 368 SVALAGASLPQN--LDGRNLLPLLLQQKTELPVADFFWRQG------------------- 406 Query: 462 VRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY---TVENNQLGLYKL-TDLQQKDNLAA 517 +R+ D+ +V T E LY L D + +LA Sbjct: 407 -------------------RKAALRSGDWKIVQMRGTREKPVWELYNLANDKSETIDLAT 447 Query: 518 ANPQVVKEMQGVVREFIDSSQPPLS 542 + E+Q E +P L Sbjct: 448 EQSEKRMELQTRWNELNAQMKPALF 472 >UniRef50_A6LEC5 Arylsulfatase A n=2 Tax=Parabacteroides RepID=A6LEC5_PARD8 Length = 483 Score = 431 bits (1109), Expect = e-119, Method: Composition-based stats. Identities = 122/499 (24%), Positives = 209/499 (41%), Gaps = 57/499 (11%) Query: 48 TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQK 107 + KPNII+L DDLGY + + P+ ++ Sbjct: 22 CDAKEEAVPKPNIIILLADDLGYNDVSCYRNENFPQQSDSFPTS---------------- 65 Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT--DAQDGIPLTET 165 TP L L +G+RFTN Y VS PSRAA+MTGR R GVY+ ++ + +E Sbjct: 66 QTPNLDLLARQGIRFTNFYCGAAVSSPSRAALMTGRNCTRTGVYNYLEQNSPMHLRDSEV 125 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY- 224 + E+ + Y T GKWHLS ++ P ++GFDY Sbjct: 126 TIAEVLKQADYATGHFGKWHLSSGR---------------------PDQPYPNDQGFDYS 164 Query: 225 FMGFHAAGTAYYNSPSLFKNRERVPA-KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYN 283 F + + +++N + F+N E +GY D + EA+ +D+ K +PF L + +N Sbjct: 165 FYALNNSVPSHHNPTNFFRNGEPQGEIEGYSCDIVVTEALQWLDKNKQ--EPFFLNVWFN 222 Query: 284 APHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFT 343 PH P + AP++ +K+ + YY + ++D + +++ LK+ DNTI++F Sbjct: 223 EPHFPME--APEELKKRHAINPE----YYGCIENMDIAIGKLMNYLKEQNLEDNTIVIFA 276 Query: 344 SDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPTA 402 SDNG+ D N +G K Y GG P + W + G + D PT Sbjct: 277 SDNGSQWD---YSNLPFRGEKHFNYEGGLRVPCIVRWHKHVPTGVISEFNGCFTDILPTL 333 Query: 403 LDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFV 462 AD +P D +DG+ + P K + +N + Y H D + ++ Sbjct: 334 ASLADAPVPTDRVIDGMDISPVFLGKAETLERENPLFFFRYIH--DPICMIREGDWCLLG 391 Query: 463 RHQSDDYPHNPNTEDLSQ-FSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANP 520 + + + + L + + + V LY L D +++ ++A +P Sbjct: 392 YDEPLPWAFSLDELALGKVKPWYLTKEHMEFAKKVFPKYFELYNLRDDREERIDVADKHP 451 Query: 521 QVVKEMQGVVREFIDSSQP 539 ++V ++ + + Sbjct: 452 EIVARLKSKMLKLKQEVVA 470 >UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKC9_9BACT Length = 454 Score = 431 bits (1108), Expect = e-119, Method: Composition-based stats. Identities = 122/502 (24%), Positives = 198/502 (39%), Gaps = 107/502 (21%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 KPNI+++ DDLGY + + + TP + Sbjct: 16 ATDKPNILIILADDLGYADVGYHG--------------------------LEEIPTPNID 49 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD-------GIPLTETF 166 + +EGV+F+ GY + GP+RAA+M+G R G + G+P Sbjct: 50 RIANEGVQFSAGYSNGSICGPTRAALMSGVYQQRIGCEGICGGRKLNEHVVVGMPREVKT 109 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 L + FQ GY T GKWHL + P +RGFD F Sbjct: 110 LAQYFQEAGYATGLFGKWHLGGER-------------------LFDKTLMPTSRGFDEFF 150 Query: 227 GFHAAGTAYYNS-----PSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 G + Y ++ + ++ Y +D + EA+ + R D+PF LYL Sbjct: 151 GILEGASLYDDTVNRERKYIRQDTVIDYEGEYFTDAIGREAVSFITR--KGDKPFFLYLP 208 Query: 282 YNAPHLPNDNPAPDQYQKQFNTGSQTADN-YYASVYSVDQGVKRILEQLKKNGQYDNTII 340 + A H P A ++Y ++F + + A + ++D + R+ + L+ G DNT+I Sbjct: 209 FTAVHAPMQ--ASEKYMQRFAHIADPNRRVFAAMLSAMDDNIGRVFDALEHQGILDNTLI 266 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK-GKLQPGNY-DKLISAMDF 398 +F SDNG D LN KG K+Q Y GG P + W G++ G D+ + MD Sbjct: 267 VFWSDNGGKPDNNYSLNHPLKGQKTQFYEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDI 326 Query: 399 YPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNY 458 +P+AL+AA I++PKD ++ ++LP +Q K PH + W Sbjct: 327 FPSALEAAQITVPKD--IEAKTILPLMQGKTNQTPHPAMFW------------------- 365 Query: 459 HKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAA 517 VR D+ L + L+ L D+ + N+ Sbjct: 366 -------------------KRAGKMAVRMGDWKL--SNAGGPSELFNLKQDISESRNIID 404 Query: 518 ANPQVVKEMQGVVREFIDSSQP 539 +P + +M + + + P Sbjct: 405 QHPDIANKMNRLWLNWDKKNVP 426 >UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C3C8_9PLAN Length = 600 Score = 430 bits (1107), Expect = e-119, Method: Composition-based stats. Identities = 134/509 (26%), Positives = 198/509 (38%), Gaps = 106/509 (20%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 +PNII++ DD GY + + TP Sbjct: 28 AKEKSRQPNIILVMTDDQGYWD--------------------------TEISGNPKIKTP 61 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPEL 170 T+ L EGV FT Y V P+RA +MTGR R G+Y+ D + ET + ++ Sbjct: 62 TIKKLAAEGVTFTRFYANM-VCAPTRAGLMTGRHYLRTGLYNTRFGGDTLGPNETTIAQV 120 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA 230 Q GY T GKWHL + + ++QPQ RGFD+F G + Sbjct: 121 LQKAGYKTGLFGKWHLGRYA-----------------------QYQPQRRGFDHFFGHYH 157 Query: 231 AGTAYYNSPS-LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP- 288 Y +P + N V +GY++D TD AI + R + QPF YLAYNAPH P Sbjct: 158 GHIERYTNPDQVVVNGTPVETRGYVTDLFTDAAIDFIQRNQQ--QPFFCYLAYNAPHSPF 215 Query: 289 ------NDNPAPDQYQKQF--NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 P D+ +++ YA + +DQ + R+L+ + T++ Sbjct: 216 LLDTSHFGQPEGDKLIEKYLAKGLPLREARIYAMIERIDQNLSRLLQTVHDLKLDQETVV 275 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFY 399 +FTSDNG V G KG K+ Y GGT P + W G D +++ D + Sbjct: 276 IFTSDNGGVSRG---FKAGLKGSKASAYEGGTRVPFVVRWTDHFPAGKTTDAMVAQTDLF 332 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYH 459 PT A + +P ++KLDG S+L ++ PH+ L Sbjct: 333 PTFCQLAGVPVPSNVKLDGESILSLMEQGGGKSPHQYLY--------------------- 371 Query: 460 KFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQL--------GLYKL-TDLQ 510 H D Y NP + + + LV + LY L D Sbjct: 372 ----HTWDRYTPNPY------HRWAIHGPRFKLVGHDPQGKKKKEGEPQGQLYDLQEDPG 421 Query: 511 QKDNLAAANPQVVKEMQGVVREFIDSSQP 539 +K N+A P+ V E++G + Sbjct: 422 EKKNVADQYPEKVSELRGEFLRWFQDVTA 450 >UniRef50_A6DSH3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH3_9BACT Length = 455 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 132/529 (24%), Positives = 219/529 (41%), Gaps = 108/529 (20%) Query: 35 LKATKTNVAFSDFTPTEYS-TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 + + T + + + KPNIIV+ DD GY + ++ Sbjct: 1 MINSFTKLFLALLCVNFVALADSKPNIIVILSDDQGYADVSYN----------------- 43 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 STP +L GV F GY + V +R+ +MTGR R+G+Y+ Sbjct: 44 -------PEHDDYISTPHTDALAKSGVIFHRGYTSGSVCSTTRSGLMTGRYQQRYGIYTA 96 Query: 154 TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 + G L F+P + GY + A GKWHL Sbjct: 97 GEGGTGTDLNAKFIPNYLKEAGYKSMAFGKWHLGHEMK---------------------- 134 Query: 214 EWQPQNRGFDYFMGFHAAGTAYY----------NSPSLFKNRERVPAKGYISDQLTDEAI 263 + P +RGFD F GF G + +++ E + KGY++ ++T+E + Sbjct: 135 -YHPLHRGFDDFYGFMGRGAHDFFRLEKEYDGKFGGPIYRGLEPIDDKGYLTTRITEETV 193 Query: 264 GVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVK 323 ++ K D+PF Y+AYNA H P PA D + +G +T D A + +D GV Sbjct: 194 KFIEENK--DKPFFAYVAYNAVHTPAQAPAEDI---KAVSGDETRDILVAMLKHLDLGVG 248 Query: 324 RILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGK 383 I++ LKK+ Y+NTII++ SDNG + N +G K Y GG P M W + Sbjct: 249 EIVKTLKKHDIYENTIIIYLSDNGGA-KSMVANNKPLRGVKHDIYDGGIRVPFLMSWPAQ 307 Query: 384 LQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITS 442 ++ G + + ++D PT LDAA +P +DG S+LP ++ K + W Sbjct: 308 IKAGQDTQSPVISLDILPTLLDAAG--LPALSDIDGESMLPVIRGDKDNL-DRPFFW--- 361 Query: 443 YSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLG 502 + ++ N++ LV+ Sbjct: 362 ----------------------------------NHGDGQTGIQLNNWKLVFN--KGVTE 385 Query: 503 LYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFN 550 LYK++ D+ + NLAA++P+ V+ +Q + +++ P+S+ K++ Sbjct: 386 LYKISDDIGESKNLAASHPEKVQALQKIYDKWLSQMATPMSKNTIVKWD 434 >UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR28_9SPHI Length = 602 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 129/521 (24%), Positives = 206/521 (39%), Gaps = 102/521 (19%) Query: 28 HAADDVKLKATKTNVAFSDFTP--TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 ++ L ++ F+ T + T+ PN+IV+ DD G+G + Sbjct: 8 YSLKGKALICVVCSLLFASCTAKVVQEQTQRPPNVIVILTDDQGWGDFSHTGNEY----- 62 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 TP + +EG YV V P+RA+++TGR Sbjct: 63 ---------------------LKTPHFDKMTEEGALLDQFYV-SPVCAPTRASVLTGRYH 100 Query: 146 ARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 R GV T ++ + E + E+F+ GY T GKWH PE+ Sbjct: 101 LRTGVSFVTRGRENMRSEEVTIAEVFKEAGYATGCFGKWHNG----AHYPEN-------- 148 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGV 265 PQ +GFD F+GF + + Y L N E KG+I+D L DE I Sbjct: 149 -----------PQGQGFDTFLGFTSGHWSNYFDTELEYNGEMKSTKGFITDVLMDETIQF 197 Query: 266 VDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS--------QTADNYYASVYS 317 +D K D+PF+ ++ NAPH P PD+Y ++ + Y + Sbjct: 198 IDAHK--DEPFLAFVPLNAPHTPYQ--VPDKYFDKYKDIDFGYDKKQNKKIATIYGMCEN 253 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMF 377 +D + ++++ LK +NTI++F SDNG NG +G K+ + GGT P Sbjct: 254 IDDNLGKLMKHLKDQELEENTIVVFLSDNGPQ---GARYNGPWRGGKTSVHEGGTLVPCA 310 Query: 378 MWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL 437 + WKG + + L + +D PT + A I P++++ DG+ L +L +NL Sbjct: 311 IQWKGHIPNSSKSSLTAHIDLMPTLMGLAGIEKPENIQFDGIDLSNYLMGTSDDLGERNL 370 Query: 438 TWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE 497 Y H N E ++ VR DY +T E Sbjct: 371 -------------------------------YTHMTNFE-ITADRGAVRQGDYR--FTTE 396 Query: 498 NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 +GLY L D +++NL P+ +E++ + Sbjct: 397 YGDVGLYNLKEDPSEENNLKDQLPEKTQELKTAFENWYKDV 437 >UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D464_9BACT Length = 474 Score = 430 bits (1105), Expect = e-118, Method: Composition-based stats. Identities = 141/559 (25%), Positives = 206/559 (36%), Gaps = 147/559 (26%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 A+ F + +PNI+ + DDLGYG+ G Sbjct: 7 ASVILTLFLFCAQLAIAAPKRPNILFIVADDLGYGEPGCYGG------------------ 48 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT-- 154 TP + L+ GVRF++GYV+ SRAA+MTGR RFG N Sbjct: 49 --------KDIPTPNIDKLVASGVRFSSGYVSAPFCAASRAALMTGRYQTRFGFEYNPIG 100 Query: 155 ----DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 D G+P+ E + + ++ GY T VGKWHL + Sbjct: 101 AKNADPGTGLPVNEKTVADRLRDVGYATGLVGKWHLGGTA-------------------- 140 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNSP------------------------------- 239 + PQ RGFD F GF G Y P Sbjct: 141 ---PFHPQRRGFDEFFGFLHEGHFYLPPPWSGATTWLRRKALPDGSQGRWTSPDGHTVWS 197 Query: 240 --------------SLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 L +N + V K ++D T EA +DR + QP+ LYLAYNA Sbjct: 198 TDLHENEPAYDADNPLLRNSQPVEEKANLTDAFTREACSFIDRHQA--QPWFLYLAYNAV 255 Query: 286 HLPNDNPAPDQYQKQFNTGSQ-TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 H P D Y ++F+ + A + +D+ + ++ QL+ +G +NT+++F S Sbjct: 256 HSPLQG--EDTYMEKFSHIGDIQRRIFAAVLAHLDEDIGKVRAQLRADGLEENTLVVFLS 313 Query: 345 DNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTAL 403 DNG N +G K + GG P + WKG++ G D +MD TAL Sbjct: 314 DNGGPTKELTSSNLPLRGGKGDLWDGGIRIPFAVSWKGQIPAGHTIDAPAISMDLTATAL 373 Query: 404 DAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR 463 A + KLDGV LLP L K PH L W + Sbjct: 374 KLAGAET-EQAKLDGVDLLPLLTGKTTAAPHDTLFWRVGRKN------------------ 414 Query: 464 HQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQV 522 +R+ D+ L+ + + LY L D+ + +N+AA N Sbjct: 415 --------------------ALRHGDWKLLR-QGSKEWQLYDLAHDVGETNNMAAQNAAR 453 Query: 523 VKEMQGVVREFIDSSQPPL 541 V E+ + ++ PL Sbjct: 454 VTELSALWDKWNSEQIDPL 472 >UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_RHOBA Length = 485 Score = 430 bits (1105), Expect = e-118, Method: Composition-based stats. Identities = 134/530 (25%), Positives = 205/530 (38%), Gaps = 76/530 (14%) Query: 26 AAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 + H L + F ++ +PN+++L DDLGY + G Sbjct: 15 SPHRFWCTVLLLITPTLTFGQLAGETHAQTLRPNVVMLLADDLGYRDVGCYGG------- 67 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 TPT+ L G RF Y V PSRA +MTGR Sbjct: 68 --------------------PVETPTIDQLAAGGTRFQQFYSGCAVCSPSRATLMTGRHH 107 Query: 146 ARFGVYSNT---DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 R GVYS + L E L E+ ++ GY TA VGKWHL Sbjct: 108 IRAGVYSWIQDESQNSHLRLREVTLAEVLRDAGYATAHVGKWHLG--------------- 152 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMG-FHAAGTAYYNSPSLFKNRERV-PAKGYISDQLTD 260 T ++ P GFD++ ++ A ++ N + +N E V +GY + D Sbjct: 153 ----LPTEERDKPTPDQHGFDHWFATWNNAQPSHRNPDNFIRNGEPVGQLEGYSCQLVAD 208 Query: 261 EAIGVVDRAK--TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSV 318 EAI +DR + DQPF L + ++ PH P APD+ +++ S Y ++ + Sbjct: 209 EAIRWMDRHRESDPDQPFFLNVWFHEPHAPI--AAPDEVTQKYGKLSDKGAVYSGTIDNT 266 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFM 378 DQ +KR+L +L G +NT+I++ SDNG+ + G +G K + GG P Sbjct: 267 DQAIKRLLAKLDALGVRENTLIVYASDNGSYRTDRV---GKLRGRKGANWEGGIRVPGIF 323 Query: 379 WWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG-EPHKN 436 W G + G ++ +D PT IS P+ + LDG L P L E H+ Sbjct: 324 HWPGHIPAGVVSNEPAGLVDVLPTICGLLKISPPQ-VHLDGSDLTPLLTGHADSFERHQP 382 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV 496 L W + DY + ++ ++N Y Sbjct: 383 LFW------HLQRSQPIVAMRDGDYSLVGFRDYEMSNKNLFEEKWIPAIKNGTY------ 430 Query: 497 ENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVN 545 + LY L D Q NLAA P+ V+ M+ + + + + Sbjct: 431 --HNFELYNLKDDPGQTKNLAAEQPERVEAMKQRMLQINAGIMKDAMDWH 478 >UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B5CXC7_9BACE Length = 509 Score = 430 bits (1105), Expect = e-118, Method: Composition-based stats. Identities = 137/580 (23%), Positives = 211/580 (36%), Gaps = 151/580 (26%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 L V + S +PN++ + +DD G+ + ++ F Sbjct: 8 LLTLAGGVTLAANMLHAASDNRQPNVVFIMVDDYGWADVGYNGSRF-------------- 53 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 TP + L EG+ FT+GY A +S PSR ++MTG+ PAR G+ Sbjct: 54 ------------YETPNIDRLASEGMIFTDGYAAASISSPSRVSLMTGKYPARTGITDWI 101 Query: 155 DA--------------------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPV 194 +PL E + E F+ HGY T VGKWH ++ S Sbjct: 102 PGYQYGLKPEQLKQYKMLAPEMPLNMPLEEVTMAEAFKEHGYATYHVGKWHCAEDS---- 157 Query: 195 PEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA-----------AGTAYYNSPSLFK 243 + PQ +GFD +G G Y SP Sbjct: 158 -------------------LYYPQYQGFDVNIGGWLKGSPNGIRRSQGGKGAYCSPYRNP 198 Query: 244 NRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT 303 P +++D+L DE+I ++ + + D+PF LYLA+ A H P + A +Y K F Sbjct: 199 YLPDGPEGEFLTDRLGDESIKLI-KNSSADKPFFLYLAFYAVHTPIE--AKPEYVKYFKW 255 Query: 304 GSQTAD---------------------------------NYYASVYSVDQGVKRILEQLK 330 +Q Y A +YS+D+ V R+++ LK Sbjct: 256 KAQRMGLDTIVPFTRNLEWYKNAEYKAGHWKERTIQSDAEYAALIYSMDENVGRVMQALK 315 Query: 331 KNGQYDNTIILFTSDNGA--VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN 388 NG NTI+ SDNG +G N + K Y GG P + + ++ G+ Sbjct: 316 DNGLDKNTIVCLLSDNGGLSTAEGSPTCNAPLRAGKGWLYEGGIREPFIIKYPQMVEAGS 375 Query: 389 Y-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWF 447 + A+DFYPT LD A + + +DG SLLP L+ + + Sbjct: 376 VCHTPVVAVDFYPTLLDMAGLPLKSHQHVDGKSLLPLLKGDQAYDRGPIFF--------- 426 Query: 448 DEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL- 506 YPH D + VR DY L+ E+ + LY L Sbjct: 427 --------------------HYPHYGGKGDT--PAGAVRMGDYKLIEFYEDGHVELYNLK 464 Query: 507 TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 D+ + +L+ EMQ ++ + + N Sbjct: 465 NDISETRDLSKTEKDKAAEMQKMLHRWRTDCNAKMPTRNP 504 >UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6C430_9PLAN Length = 503 Score = 429 bits (1103), Expect = e-118, Method: Composition-based stats. Identities = 125/521 (23%), Positives = 216/521 (41%), Gaps = 68/521 (13%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 + + TN + + + +PNI+V+ DDLGYG L Sbjct: 10 IVISILFTNESLAAEPTASVKSPARPNIMVVLCDDLGYGDLACYG--------------- 54 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 +P + EG++ T+ Y AH PSRA +MTGR P R G+Y+ Sbjct: 55 -----------HPVIQSPNIDRFAKEGLKLTSCYAAHPNCSPSRAGLMTGRTPFRVGIYN 103 Query: 153 NTD--AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 + + E + L + GY T VGKWHL+ + N+ Sbjct: 104 WIPMLSPMHVRKREITIATLLRQAGYATCHVGKWHLNGMFNM------------------ 145 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRERV-PAKGYISDQLTDEAIGVVDR 268 + QP + GFD++ + +P + +N V P +G+ S + DEA + + Sbjct: 146 -VGQPQPSDHGFDHWFSTQNNALPTHENPFNFVRNARPVGPLQGFASQLVADEAEEWLTQ 204 Query: 269 AKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG-SQTADNYYASVYSVDQGVKRILE 327 + ++PF +++ ++ PH P + ++++K + T ++ +V +D RIL+ Sbjct: 205 LRDKEKPFFMFVCFHEPHEPI--ASAERFRKLYTAPEGSTLPAHHGNVTQMDDAFGRILK 262 Query: 328 QLKKNGQYDNTIILFTSDNGAVID--GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQ 385 L +NT+I+FTSDNG I P +G + K TY GG P + W +Q Sbjct: 263 TLDDQKLRENTLIIFTSDNGPAITRRHPHGSSGPLRDKKGATYEGGIRVPGIVQWPEHVQ 322 Query: 386 PGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYS 444 PG D + +D PT ADI P D LDG ++LP L+ K K L W ++ Sbjct: 323 PGTTSDVPVCGVDILPTLCAVADIPAPTDRVLDGTNILPLLEGKPI-LRKKPLYW--QFN 379 Query: 445 HWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLY 504 ++ + D K + + P + S + + V + LY Sbjct: 380 RAKNDAKVALRDGEWKLLAKLNVPSP---------KPSGGITTEEIDAVKNAKLEGFELY 430 Query: 505 KL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEV 544 + +D+ + + A + +++K+M+ ++ D Q Sbjct: 431 HIQSDIAETTDRAESEQEILKKMKQQMQAIFDEVQAEAPRW 471 >UniRef50_D2R917 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R917_9PLAN Length = 486 Score = 429 bits (1103), Expect = e-118, Method: Composition-based stats. Identities = 128/535 (23%), Positives = 212/535 (39%), Gaps = 112/535 (20%) Query: 31 DDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 + KL+ + + + +PNI+ + DDLG+ + F+ + Sbjct: 2 NLTKLELWAAVLLVAFTAVASQAADRQPNIVHIVADDLGWKDVGFNGCT----------- 50 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 + TP + +L G +F+ YV + + P+RA +MTGR P R+G+ Sbjct: 51 ---------------EIKTPNIDALAKGGAKFSQFYVQN-MCTPTRACLMTGRFPYRYGL 94 Query: 151 YS---NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 + T A G+ +E +P+ + GY TA +GKWHL Sbjct: 95 QTIVIPTAAGYGLDTSEYLMPQCLGDAGYKTAIIGKWHLGHAD----------------- 137 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-----SLFKNRERVPAKGYISDQLTDEA 262 +++ P+ RGFDY G Y+ F++ + V +GY + + D+A Sbjct: 138 -----QKYWPKQRGFDYQYGAMIGELDYFTHDEHGVLDWFRDNKPVHEQGYTTTLIGDDA 192 Query: 263 IGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGV 322 + + + +PF LYL +NAPH P P + K N T Y A V +D+ + Sbjct: 193 VKYI-HGQDGKKPFYLYLTFNAPHTPYQAPK-EYITKYLNIAEPTRRTYAAMVDCLDENI 250 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-------------LNGAQKGYKSQTYP 369 +++ L + G +NT+I F SDNG D NG + K + Sbjct: 251 GKVVAALDQKGLRENTLIFFHSDNGGTKDKMFAGQMADMSKVVLPCDNGPYRNGKGSLFE 310 Query: 370 GGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 GG+ W GK++ D +I A+D YPT A SI K LDG ++ + + K Sbjct: 311 GGSRVCALANWPGKIKAQTVDGMIHAVDLYPTFAALAGASIAKCKPLDGTNVWDTIAEGK 370 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNND 489 P + + F +R D Sbjct: 371 -PSPRTEFFY-------------------------------------SIEPFRAGLRQGD 392 Query: 490 YSLVY-TVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 + L++ T+ + + LY L D +K+N+AAA+P V MQ + + PL+ Sbjct: 393 WKLIWRTMLPSSVDLYNLAEDPYEKNNIAAAHPDKVATMQARIETASKDAAKPLA 447 >UniRef50_A6DHI0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI0_9BACT Length = 456 Score = 428 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 140/523 (26%), Positives = 208/523 (39%), Gaps = 117/523 (22%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 ++ KPNII + DD+GYGQL GS+ K TP Sbjct: 13 AANSADKPNIIFIMCDDMGYGQL----GSYGQKM----------------------IKTP 46 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD---AQDGIPLTETFL 167 L + EG+R T+ Y V PSR ++MTG+ + N + Q+ IP + Sbjct: 47 RLDQMAKEGLRLTDYYAGTAVCAPSRCSLMTGQHVGHTYIRGNKEYPTGQEPIPAETITV 106 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 E + GY TA +GKW L + E +P +GFDYF G Sbjct: 107 AEKMKEAGYATALIGKWGLG----------------------YPGSEGEPNKQGFDYFFG 144 Query: 228 FHAAGTAYYNSPS-LFKNRERVPAK-------GYISDQLTDEAIGVVDRAKTLDQPFMLY 279 ++ A+ + P L +N E + K Y LTDEA G + + K D PF LY Sbjct: 145 YNDQKHAHNHFPKFLLRNEETLTLKNNSGKEIEYSQYMLTDEAKGFIKKNK--DNPFFLY 202 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNT--GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDN 337 LAY PH P D+ Q+ + + + +D+ V IL+ LK+ +N Sbjct: 203 LAYVIPHSRLQIPGDDECYLQYKDESWPEKQKKHAGMISRLDKDVGSILDLLKEMNLAEN 262 Query: 338 TIILFTSDNGAVIDGP-----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL 392 T+++FTSDNGA +G +G G K Y GG P W G ++PG Sbjct: 263 TLVVFTSDNGAHREGGARPEFFNDSGPLSGIKRSMYEGGVRVPFIAHWPGVIKPGQVSNH 322 Query: 393 I-SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK-KQGEPHKNLTWITSYSHWFDEE 450 I + D PTA + + P+ +DG+S +P L+ ++ E H L + HW + Sbjct: 323 IGAHWDLMPTACELGGVQPPEG--IDGISYVPLLKGNMEEQEKHDYLYFEL---HWPTKR 377 Query: 451 NIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY------TVENNQLGLY 504 VR D+ + + + L+ Sbjct: 378 G---------------------------------VRKGDWVALQSKTSAIDPNKDTIKLF 404 Query: 505 KL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 L DL QK +LA P+ V+E + + E + PL E Q Sbjct: 405 NLKNDLGQKKDLATQYPEKVEEFKKIFLEAH--TPAPLFEFGQ 445 >UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria RepID=C1ZCL4_PLALI Length = 470 Score = 428 bits (1100), Expect = e-118, Method: Composition-based stats. Identities = 140/535 (26%), Positives = 219/535 (40%), Gaps = 122/535 (22%) Query: 45 SDFTPTEYST-KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIE 103 + F P E KPN++++ +DDLG + + SF Sbjct: 15 APFFPVEAKEMADKPNVLLIFIDDLGKTDIGIEGSSF----------------------- 51 Query: 104 AAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD--GIP 161 TP + +L G RFT Y AH V P+RAA+MTG+ P R G+ + +P Sbjct: 52 ---YETPRIDALAKSGARFTQFYSAHPVCSPTRAALMTGKMPQRLGITDWIRPESDVALP 108 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 +E + + FQ GY+TA +GKWHL + P RG Sbjct: 109 QSEVTIGQAFQEAGYHTAYLGKWHLGH-----------------------KPQQHPAARG 145 Query: 222 FDYFMGFHAAG--TAYY---------NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAK 270 FD+ G + G ++YY ++P+ + E+ + Y++D LT AI + + + Sbjct: 146 FDWTKGVNHGGQPSSYYFPYKNPQKPDAPNNVPDFEKCQPEDYLTDVLTSSAIEHL-QQR 204 Query: 271 TLDQPFMLYLAYNAPHLPNDNPA--PDQYQKQFNTGS-------------------QTAD 309 +PF L LA+ A H P P ++YQ + T Q Sbjct: 205 DRTRPFFLCLAHYAVHTPIQPPKNLVEKYQVKLATQKNPKSPGEGIQEGSAISRSQQDHP 264 Query: 310 NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--VIDGPLP---LNGAQKGYK 364 Y A V ++D V R+L++LK G D TI++FTSDNG ++G P N + K Sbjct: 265 AYAAMVENLDTQVGRLLDELKTQGILDQTIVVFTSDNGGLCTLNGKSPGPTCNLPLRAGK 324 Query: 365 SQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 TY GG P ++ W GK+ P D D YPT L I +DG+SL Sbjct: 325 GWTYEGGIRIPTYISWPGKISPQVLDIPAYTCDIYPTLLSLCQIPPRPTQHVDGISLAGL 384 Query: 425 LQDKKQ-GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY 483 L E + L W ++H + S Sbjct: 385 LTKSSSLPESERTLVWYYPHTHGSGH------------------------------KPSA 414 Query: 484 TVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 +R + L++ +E +++ LY L D + NLA+ +P+ ++Q +++ I+SS Sbjct: 415 AIRQGPWKLIHFLETDRIELYHLEDDPGESRNLASKHPERALQLQKELQKIIESS 469 >UniRef50_A7HQ00 Steryl-sulfatase n=4 Tax=Proteobacteria RepID=A7HQ00_PARL1 Length = 553 Score = 428 bits (1100), Expect = e-118, Method: Composition-based stats. Identities = 135/558 (24%), Positives = 215/558 (38%), Gaps = 127/558 (22%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 E + PNI+V+ DDLG+ + G Sbjct: 62 AAEPAGNRPPNIVVILADDLGFNDISHFGGGI--------------------------VP 95 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG---------------VYSN 153 TP + S+ G FT+ Y PSRA IMTGR R G ++ N Sbjct: 96 TPNIDSIARGGANFTSAYSGTAACAPSRAMIMTGRYGTRTGFEFTPTPPGMTRIVDMFYN 155 Query: 154 TDAQD-------------------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPV 194 + G+P +E L E + GY+ +GKWHL Sbjct: 156 DGTRTHEMLVDREAAAKAPPFREQGLPGSEITLAEALKPKGYHNIHIGKWHLGN-----A 210 Query: 195 PEDKQTRDYHDNFTTFSAEEWQPQN--------RGFDYFMGFHAAGTAY---YNSPSLFK 243 PE D + + P++ FD F A Y YN + F+ Sbjct: 211 PEFLPNAQGFDESVMLESGLFLPEDSPDVVNAKLPFDPIDQFLWARMQYATSYNGSAWFE 270 Query: 244 NRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT 303 KGY++D TDEAI ++ + ++PF LYLA+ H P D Y + Sbjct: 271 ------PKGYLTDFYTDEAIKAIEANR--NRPFFLYLAHWGVHTPLQASKAD-YDALSHI 321 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-LNGAQKG 362 + Y A + ++D+ V R+L+ LK+NG +NT+++F+SDNGA LP +N +G Sbjct: 322 EDERLRVYAAMIVALDRSVGRVLQSLKENGLEENTLVIFSSDNGAPGYIGLPDVNKPYRG 381 Query: 363 YKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSL 421 +K + GG P F W ++ G ++ +D +PT + AA +P D +DG+ L Sbjct: 382 WKLTFFEGGIRVPFFAKWPARIPAGTERTTPVAHLDMFPTIVAAAGGELPADRVIDGIDL 441 Query: 422 LPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQF 481 LP+ ++ P + + W + Sbjct: 442 LPYAARGEKPAP-RPIFWRDGHY------------------------------------- 463 Query: 482 SYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 V+ + + L N+ L+ L TD +++N+A NP+ V E++ +V + + P Sbjct: 464 -QAVQADGWKLQMAERPNKTWLFNLKTDPTEQNNVADENPEKVAELKALVEAHNATQREP 522 Query: 541 LSEVNQEKFNNIKKALSE 558 L E + K L E Sbjct: 523 LFPAVAEMPVTVDKTLEE 540 >UniRef50_A6CAY0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAY0_9PLAN Length = 466 Score = 427 bits (1098), Expect = e-118, Method: Composition-based stats. Identities = 139/528 (26%), Positives = 210/528 (39%), Gaps = 121/528 (22%) Query: 42 VAFSDFTPTEYSTK---GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGI 98 V+FS P + K +PNI+++T D+LGYG L G + M Sbjct: 16 VSFSVPAPVTAAEKPENKRPNILLITADNLGYGDL----GCYGNPVM------------- 58 Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA-- 156 TP L L EGVR T+ Y A SRA ++TGR P R G+ A Sbjct: 59 ---------KTPMLDQLASEGVRLTDFYTASPTCTVSRATLLTGRYPQRIGLNHQLSADE 109 Query: 157 --QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 DG+ +E +PE + GY TA GKW++ + Sbjct: 110 NYGDGLRKSEVLIPEYLKQQGYRTACFGKWNVG-----------------------FSPG 146 Query: 215 WQPQNRGFDYFMGFHAAGTAYYNS-----PSLFKNRERVPAKGYISDQLTDEAIGVVDRA 269 +P RGFD F GF A YY+ L++ + V +GY +D D A + + Sbjct: 147 SRPTERGFDEFFGFAAGNIDYYHHYYAGRHDLWRGLKEVFVEGYSTDLFADAACQYI--S 204 Query: 270 KTLDQPFMLYLAYNAPHLP----------NDNPAPDQYQKQFNTGSQ---TADNYYASVY 316 DQPF +YL +NAPH P N+ APD +++ Q + Y A V Sbjct: 205 AESDQPFFIYLPFNAPHFPSQRNKQPGQGNEWQAPDLAFEKYGYDPQTKNPQERYRAVVT 264 Query: 317 SVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL----NGAQKGYKSQTYPGGT 372 ++D + R+L+QL +G D TI+++ SDNGA + L N + + GG Sbjct: 265 ALDSAIGRVLKQLDTSGLRDQTIVIWYSDNGAFMLKERGLEVASNKPLRDGGVTLWEGGI 324 Query: 373 HTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 P + + G L+ G ++ + ++D PT + A +P + LDG +LP L + Sbjct: 325 RVPAIIRYPGHLKAGTVNQSPLISLDILPTLITLAGGPLPAERILDGQDMLPALAAQTAP 384 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 EP ++S VR Y Sbjct: 385 EPRTFFFQYRNFS---------------------------------------AVRRGKYK 405 Query: 492 LVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 LV N L+ L DL + +LA NP+V+ ++Q ++ Sbjct: 406 LVRIKPNQPFMLFDLEQDLSETTDLAERNPKVLNQLQQAYADWEREVA 453 >UniRef50_A6DKP3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKP3_9BACT Length = 465 Score = 426 bits (1097), Expect = e-118, Method: Composition-based stats. Identities = 138/520 (26%), Positives = 216/520 (41%), Gaps = 115/520 (22%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 + ++ KPNIIV+ DDLGYG + + + + Sbjct: 14 ASLSASAAKPNIIVILADDLGYGDVSYHG-------------------------TLKETT 48 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD-------GIP 161 TP + S+ G F NGY A V GPSRA +++GR RFG Y N G+P Sbjct: 49 TPHIDSIAQSGAWFQNGYSAAPVCGPSRAGLLSGRYQQRFGYYDNIGPFTLNKDVEAGLP 108 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 L++ +PE+ GY T VGKWH ++ P NRG Sbjct: 109 LSQKLIPEILVKEGYATGMVGKWHDGDQ-----------------------HKFWPYNRG 145 Query: 222 FDYFMGFHAAGTAYY----------NSPSLFKNRERVPAKG-YISDQLTDEAIGVVDRAK 270 F F GF+ + ++ + +RV G Y+++ EA+ +DR K Sbjct: 146 FQEFYGFNNGAINNWVLKGENHTVDEWGAVHRENKRVENSGEYMTEAFGREAVEFIDRHK 205 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNY-YASVYSVDQGVKRILEQL 329 T +PF LYL++NA H P AP Y QF A + S+D + +LE+L Sbjct: 206 T--EPFFLYLSFNAVHGPLQ--APKSYTNQFKHIKPENRALCLAMLKSMDDNIGLVLEKL 261 Query: 330 KKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN- 388 +K G +NTII FTSDNG + G NG +G K+ + GG H P + WK ++ Sbjct: 262 RKEGLEENTIIFFTSDNGGKLKGNYSFNGKYRGEKNTVFDGGLHVPYAVQWKAQIPAQTK 321 Query: 389 -YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWF 447 + + ++D T AA + I + KLDG +LLP+L+++ + +NL W Sbjct: 322 ALEAPVHSIDLAHTIFAAAGVEIKDEYKLDGRNLLPYLKNQSDFD-DRNLYW-------- 372 Query: 448 DEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL- 506 + + +R+N + Y + + L+ L Sbjct: 373 ------------------------------ANNANIAIRDNKWK--YLKQAGKTYLFNLE 400 Query: 507 TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 D + +NL + P+ ++MQ + ++ P L N Sbjct: 401 EDPYESNNLVSQYPEKAQDMQKRHDAWQANNAPQLFGWNP 440 >UniRef50_Q7UYA5 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA5_RHOBA Length = 562 Score = 426 bits (1096), Expect = e-117, Method: Composition-based stats. Identities = 137/563 (24%), Positives = 225/563 (39%), Gaps = 95/563 (16%) Query: 1 MKSALKKSVVSTSISLILASGMAAFAAHAAD-----DVKLKATKTNVAFSDFTPTEYSTK 55 + A + T+ SL L+ A F H + D A V FT E Sbjct: 63 LPHARVSLHIRTNESLTLSLTHATFHPHTPNMKHCIDSLAIAIVAVVFLGSFT--EAHAD 120 Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 +PNII+L DDLGYG L TP L L Sbjct: 121 DRPNIILLLADDLGYGDLSCFGS--------------------------PAVKTPHLDRL 154 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-IPLTETFLPELFQNH 174 EG++ Y V P+RA+++TGR P RFG+ + + ++G +P + T + EL ++ Sbjct: 155 ASEGLKCNRFYAGSAVCSPTRASVLTGRYPLRFGITKHFNDRNGWLPESATTVAELLKDA 214 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF--------- 225 GY TA +GKWHL + + D + P+ GFD++ Sbjct: 215 GYNTAHIGKWHLGGL-------------HVDEPGKRLTNQPGPRQHGFDFYQTQIEQQPL 261 Query: 226 MGFHAAGTAYY--NSPSLFKNRERVPAKG-----YISDQLTDEAIGVVDRAKTLDQPFML 278 G + L +N +R+ + +D D A+ ++++ + + PF + Sbjct: 262 RGQMGRDKTLFRKGGTVLLRNDQRISQDDPYYHKHFTDANGDFAVEMIEKLSSEEDPFFI 321 Query: 279 YLAYNAPHLPND-NPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDN 337 + + PH P + P P + + + + V +D V IL +L + DN Sbjct: 322 NMWWLVPHKPYEPAPEPHWSDTAADDITDDQHRFRSMVQHMDAKVGAILRKLDELKIADN 381 Query: 338 TIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM- 396 T++LFTSDNGA +G + KG K++ + GG PM + W + G + S Sbjct: 382 TLVLFTSDNGAAFEGFIHD---LKGGKTELHDGGIRVPMIVRWPDAIPAGQTSQTFSHTN 438 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFW- 455 D PT DAA + +P DL LDG+SLL + E FW Sbjct: 439 DLLPTFCDAASVQLPSDLPLDGLSLLSHWKGGTPPSQ--------------VERGTVFWQ 484 Query: 456 -DNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKD 513 D Y RH P+ V ++ L+ + + L+ + D +K Sbjct: 485 LDLYKSLQRHYPKPKPYATE---------VVMRGNWKLL-AFKGKPVELFDVGADPNEKR 534 Query: 514 NLAAANPQVVKEMQGVVREFIDS 536 N+ A +P++V + ++++++ Sbjct: 535 NVLAEHPELVASLSAQLKDWLNE 557 >UniRef50_C9KTV0 Arylsulfatase n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KTV0_9BACE Length = 459 Score = 426 bits (1095), Expect = e-117, Method: Composition-based stats. Identities = 131/541 (24%), Positives = 212/541 (39%), Gaps = 118/541 (21%) Query: 39 KTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGI 98 A + F+P E + +PN +++ DD+GYG + + Sbjct: 10 AATCALAAFSPVEMMAQKQPNFVIIVADDMGYGDVGIYGNEY------------------ 51 Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-------Y 151 TP + + EG+ FT+ + VS P+R ++TGR R G+ Sbjct: 52 --------IKTPNIDQIAREGMMFTDFHSNGSVSSPTRCGLLTGRYQQRAGLEKVLLVPR 103 Query: 152 SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFS 211 + D + G+P E ++ ++GY TA +GKWHL + Sbjct: 104 DDKDKEVGLPSEEITFAKILGDNGYRTALIGKWHLGYL---------------------- 141 Query: 212 AEEWQPQNRGFDYFMGFHAAGTAY------YNSPSLFKNRERVPAKGYISDQLTDEAIGV 265 ++ P N GF F+GF + Y Y + E GY + LT + Sbjct: 142 -QKHHPMNFGFQKFVGFKSGNVDYQSHRNRYGDMDWWDGLEMKDMSGYTTTLLTTLSEDY 200 Query: 266 VDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQ------KQFNTGSQTADNYYASVYSVD 319 + K D+PF LY+A+ APH P P + N+ + Y V +D Sbjct: 201 IKENK--DKPFCLYIAHAAPHSPMQGPDEKAVRTEATPEGDKNSDRSNKEIYKDMVEELD 258 Query: 320 QGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMW 379 V RILE LKK +NT ++F SDNG VI+ G KG K + GG P + Sbjct: 259 WSVGRILETLKKYKLDENTFVVFFSDNGPVINNG-GSAGGYKGAKGSPWEGGHRVPGICY 317 Query: 380 WKGKLQPG-NYDKLISAMDFYPTALDAADISIPKD-LKLDGVSLLPWLQDKKQGEPHKNL 437 G ++ G ++ + + D +PT LD ADI KLDG SL+P + + Sbjct: 318 MPGTIKEGTTCEQTVMSFDLFPTMLDMADIHYDDSKKKLDGTSLVPLFKGENLA------ 371 Query: 438 TWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE 497 + FW N +K + +VR+ + LV + Sbjct: 372 ------------PRLLFWGNGNKTI---------------------SVRDGKWKLVRYNQ 398 Query: 498 NN--QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKK 554 L L+ L D +K+NL+ P++V+ + + + +S SEV + +++ Sbjct: 399 KGGITLHLFDLNNDPYEKNNLSKQEPELVERLDKEITRWAESV---YSEVPDQFARKVQR 455 Query: 555 A 555 Sbjct: 456 T 456 >UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5053 Length = 467 Score = 426 bits (1095), Expect = e-117, Method: Composition-based stats. Identities = 129/554 (23%), Positives = 205/554 (37%), Gaps = 128/554 (23%) Query: 35 LKATKTNVAFSDFTPTEYSTKG-KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 + +A + P+ + KPNI+++ DDLG F+ G + Sbjct: 2 FRTAAVFLAVALLAPSGRAADAPKPNIVLIVADDLG----CFELGCYGQ----------- 46 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 + TP + L G +FT Y V PSR +MTG+ V +N Sbjct: 47 -----------TKIKTPHIDKLAQGGAKFTRFYSGSPVCAPSRCVLMTGKHSGHATVRNN 95 Query: 154 T----DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 + Q I + + + + HGY T A+GKW L Sbjct: 96 VEAKPEGQFPIRAEDVTVADALKAHGYATGAMGKWGLGMFDTAG---------------- 139 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSL-FKNRERVPAKG--------YISDQLTD 260 P GFD F G++ A+ + P+ ++N +RV KG + D + Sbjct: 140 ------SPLKHGFDLFFGYNCQRHAHSHYPTYIYRNDKRVELKGNDGKTGKQFTQDLFEE 193 Query: 261 EAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQ------------FNTGSQ 306 EA+G ++ K +PF LYL + PH+ P ++Y+ Q + Sbjct: 194 EALGFIEANKA--KPFFLYLPFTVPHVAVQVPEDSLNEYKGQLGDDPAYDGKKGYQPHPA 251 Query: 307 TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP------LPLNGAQ 360 Y A V +D+ V R++E+L G NT++LFTSDNG + G Sbjct: 252 PHAGYAAMVTRMDRSVGRVVEKLNALGLEKNTLVLFTSDNGPTHNVGGADSSFFNSAGKL 311 Query: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGV 419 +G K Y GG P + G ++ G D + D PT A P +DG+ Sbjct: 312 RGLKGSVYEGGIRVPFIAYQPGTIKAGTESDAPLYFPDVLPTLCAFAGTKAPS--AIDGI 369 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLS 479 S LP L+ +KQ H L W S Sbjct: 370 SFLPLLKGEKQPT-HDFLYWEFSGYG---------------------------------- 394 Query: 480 QFSYTVRNNDYSLVYT---VENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVR-EFI 534 V ++ V + + LY L D +K+++AA NP V+ ++ ++ E Sbjct: 395 -GQQAVIEGEWKAVRQALGMGGVKTELYNLAKDPSEKEDVAAKNPAVLARLEKRLKNEHT 453 Query: 535 DSSQPPLSEVNQEK 548 +S PL ++ +K Sbjct: 454 PNSNFPLQTIDPKK 467 >UniRef50_A4GJF1 Sulfatase n=1 Tax=uncultured marine bacterium EB0_50A10 RepID=A4GJF1_9BACT Length = 544 Score = 425 bits (1094), Expect = e-117, Method: Composition-based stats. Identities = 147/591 (24%), Positives = 239/591 (40%), Gaps = 114/591 (19%) Query: 10 VSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYST-------KGKPNIIV 62 + S+ +++ SG A+ V +NV + PT +S +PNII+ Sbjct: 5 LMVSLMVLIVSGFVAWEYKVNILVWAIPKISNVTVQENIPTTWSKGPDTPVDDNRPNIIL 64 Query: 63 LTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRF 122 + DD+GY + G T + +L G+ F Sbjct: 65 VLADDMGYNDISIHNG----------------------GAADGTLQTKNIDALAKSGILF 102 Query: 123 TNGYVAHGVSGPSRAAIMTGRAPARFG--------------------------------V 150 T GY A+ PSRA+IMTG+ P RFG V Sbjct: 103 TRGYAANATCAPSRASIMTGKYPTRFGYEFTPIPAFGRTVLGWLAEEDNFELKQRIDREV 162 Query: 151 YSNTDA--QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 SN + G+P + + E+ ++ GYYTA +GKWHL + P + +D Sbjct: 163 VSNMPPFMEQGMPTEQITIAEVLRDAGYYTAHIGKWHLGHEYGM-DPMSQGFQDSLGLVG 221 Query: 209 TFSAEEWQPQ--NRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVV 266 E P N FD + G Y++ F + Y++D TDEA+ V+ Sbjct: 222 PLYLPEDHPDVVNAKFDTRIDKMIWGMGQYSA--NFNGGDLFAPDKYVTDYYTDEALKVI 279 Query: 267 DRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRIL 326 + K ++PF LYL++ A H P D +++ + Y + S+D+ V +I+ Sbjct: 280 ENNK--NRPFFLYLSHWAIHNPLQALRSD-FEQMSHMHGHNLQVYSGMINSLDRSVGKII 336 Query: 327 EQLKKNGQYDNTIILFTSDNGAVIDGPLP-LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQ 385 E+LK+ Y T+I+FTSDNG L +N +G+K + GG P + W ++ Sbjct: 337 EKLKELDIYGKTLIIFTSDNGGANYIELNDINKPYRGWKISFFDGGIRVPYIISWPDEIN 396 Query: 386 PGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYS 444 PG + + D +PT L AA I +LDGV L+P++++ +PHK L W Sbjct: 397 PGKKSENAVHHFDIFPTILKAAGIE--STNELDGVDLMPFIKNDSSSKPHKTLFWR---- 450 Query: 445 HWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLY 504 S +V + + + + + N L+ Sbjct: 451 ----------------------------------SGNHQSVLHEHWKFIISKKENFRWLF 476 Query: 505 KLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKK 554 + D +K+NL +NP VVKE++ ++ EF + PL + + I K Sbjct: 477 DTSADPTEKNNLVDSNPDVVKEIEELLVEFNSEQKDPLFPSSYDTPIMIDK 527 >UniRef50_B4CVD2 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVD2_9BACT Length = 631 Score = 425 bits (1093), Expect = e-117, Method: Composition-based stats. Identities = 138/555 (24%), Positives = 216/555 (38%), Gaps = 128/555 (23%) Query: 32 DVKLKATKTNVAFSDFT------PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 + A A S F + ++ KPNI+ + DDLG L Sbjct: 2 FPRALALSLCFAVSLFAKDGDGGASAPKSRDKPNIVFILCDDLGVNDLSCYG-------- 53 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 + TP L L EG+RFT Y A + SRAAIMTG+AP Sbjct: 54 ------------------RKDQQTPNLDRLAGEGMRFTCAYCASPICSASRAAIMTGKAP 95 Query: 146 ARFGVYSNTDAQ--------------DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISN 191 R + + + +PL E + + GY +A +GKWHL Sbjct: 96 GRVHITNFLPGRADAPSQKFIQPEIEGQLPLEENTIAKALHGAGYVSACIGKWHLG---- 151 Query: 192 VPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAK 251 + + P N+GFDY HA PS + + Sbjct: 152 --------------------GKGFLPTNQGFDYAFAGHAN-----TKPSATEGGKGEYE- 185 Query: 252 GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNY 311 LT EA +++ K D PF LYLA+N+PH+P P+ +K + + Y Sbjct: 186 ------LTAEAERWLEKNK--DHPFFLYLAHNSPHVPL-AAKPELIEKHKDAWNP---IY 233 Query: 312 YASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA-----VIDGPLPLNGAQKGYKSQ 366 A + S+D V RI++++ + G + TI +FTSDNG + + P N + K Sbjct: 234 AAMIESLDDCVGRIMKKVDELGLTEKTIFIFTSDNGGLHVYELPNTPSTYNAPFRAGKGY 293 Query: 367 TYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDL-KLDGVSLLPW 424 GG P+ + W GK++ G ++ + DF PT + AA + + + LDGV++LP Sbjct: 294 LEEGGLREPLIVRWPGKIKAGATNETPVVLYDFMPTLMTAAGLDVAHTVGPLDGVNILPL 353 Query: 425 LQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 L P + L W +P+ N S+ + Sbjct: 354 LTGGTI--PPRTLYW----------------------------HFPNYTNQG--SKPAGA 381 Query: 485 VRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 +R+ ++ L+ E L LY + D +K++LA + V E+QG + + S + Sbjct: 382 IRDGEWKLIQDDETGNLELYNIAADPGEKNDLAKSQSARVSELQGKLAAWRKSIGAQMGT 441 Query: 544 VNQEKFNNIKKALSE 558 N + K L E Sbjct: 442 ANPNFDSAFHKRLYE 456 >UniRef50_B9KQS8 Twin-arginine translocation pathway signal n=2 Tax=Alphaproteobacteria RepID=B9KQS8_RHOSK Length = 509 Score = 425 bits (1093), Expect = e-117, Method: Composition-based stats. Identities = 112/519 (21%), Positives = 194/519 (37%), Gaps = 113/519 (21%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 P +P+I+ + +DDLGY + + + Sbjct: 55 PARAQEVARPHILYILVDDLGYADVGYH---------------------------GSDVK 87 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTET 165 TP + L EG R Y + P+RAA+MTGR P R+G+ + + G+ E Sbjct: 88 TPNVDRLAAEGARLMQFY-TQPLCTPTRAALMTGRYPMRYGLQTGVIPSGGRYGLDTAEV 146 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 LP++ + GY TA VGKWHL +++ P+ RG DYF Sbjct: 147 LLPQVLKEAGYKTALVGKWHLGHAD----------------------QKYWPRQRGVDYF 184 Query: 226 MGFHAAGTAYYNSP-----SLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 G ++ +++ E V GY ++ +AI +++ + P +YL Sbjct: 185 YGPLVGEIDHFKHEAHGITDWYRDNEMVKEPGYDTELFGADAIRLIEEHDSA-TPLYMYL 243 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTG-SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 ++ APH P APD+Y+ + + Y A + +D V +L+ L++ G ++T+ Sbjct: 244 SFTAPHTPYQ--APDKYKDLYPDIADEGRKAYAAMISCMDDQVGLVLQALERRGMREDTL 301 Query: 340 ILFTSDNGAVIDGPL-----------PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN 388 ++F SDNG P N + K Y GGT W G++ G Sbjct: 302 VIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVVALANWPGRIPAGE 361 Query: 389 YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFD 448 ++ +D PT A I +LDG+ + + K + + Sbjct: 362 THGMMHVVDMLPTLAGLAQAEIAHAGQLDGMDVWQAISAGKASPREEVVY---------- 411 Query: 449 EENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY-TVENNQLGLYKL- 506 ++ +R+ + L + + ++ L+ L Sbjct: 412 ----------------------------NIEPTQGALRDGKWKLYWQPILPPKVELFDLE 443 Query: 507 TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVN 545 D + +L+A P+ + MQ V + S PPL N Sbjct: 444 ADPSETTDLSAKEPEQLARMQARVIDLARSMAPPLFYAN 482 >UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4Q9_9PLAN Length = 490 Score = 425 bits (1092), Expect = e-117, Method: Composition-based stats. Identities = 125/548 (22%), Positives = 211/548 (38%), Gaps = 111/548 (20%) Query: 31 DDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 + ++ + + + +PNI+ + +DD+G+ F Sbjct: 8 QSLLFAVCLLLISVTALHAEQKISADRPNIVFILIDDMGWPDPVSYGNQFHD-------- 59 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 TP + L +GVRFT+ Y A V P+RA+I G+ AR + Sbjct: 60 ------------------TPHIDQLASDGVRFTDFYAACPVCSPTRASIQAGQYQARLHL 101 Query: 151 YSNTDAQD-------------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPED 197 +PL EL Q+ Y TA GKWHL S+ P Sbjct: 102 TDFIPGHWRPFEKLIVPENAPHLPLEIVTPGELLQSANYNTAYFGKWHLGPESHNP---- 157 Query: 198 KQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQ 257 + + + P+ R +PS R+P K Y++D Sbjct: 158 --DQQGYQTSLVTGGRHFAPRFR----------------TTPS-----TRIPNKAYLADF 194 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA--PDQYQKQFN-TGSQTADNYYAS 314 LTD+ I + + K+ +PF + L++ A H+P + +YQ++ Y A Sbjct: 195 LTDKTIEFIRQNKS--KPFFVQLSHYAVHIPLEAKQQMIRKYQQKPKPAYGINNPVYAAM 252 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG-----PLPLNGAQKGYKSQTYP 369 V VD V RI+ L++ +NT+++FTSDNG + + N + K Y Sbjct: 253 VAHVDDSVGRIVAALEELKLTENTVVIFTSDNGGLRQSFSGGDIVSTNAPLRDEKGSLYE 312 Query: 370 GGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK 428 GG P+ + W G G + ++DF+PT + A ++ + +DG+SLLP L+D Sbjct: 313 GGIRVPLIIKWPGVAAAGKTCAEPTISIDFWPTFAEIAHTTLQEHQTIDGLSLLPLLKDP 372 Query: 429 KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNN 488 + + + YPH + S + +R Sbjct: 373 SSHLNREEIYF----------------------------HYPHYHH----STPASAIRAG 400 Query: 489 DYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQE 547 D+ L+ + L LY L DL + NLAA NP+ E+Q + ++ + L V Sbjct: 401 DWKLIEFFADGNLELYNLQQDLSETTNLAAKNPEKAVELQQKLADWRTRTGAALP-VKNP 459 Query: 548 KFNNIKKA 555 K++ + + Sbjct: 460 KYDPARAS 467 >UniRef50_A3ZMN6 Arylsulfatase B n=3 Tax=Bacteria RepID=A3ZMN6_9PLAN Length = 455 Score = 424 bits (1091), Expect = e-117, Method: Composition-based stats. Identities = 121/522 (23%), Positives = 209/522 (40%), Gaps = 104/522 (19%) Query: 40 TNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 + + T + +PNI+ L DDLG + + Sbjct: 11 IALTLASVATTFATDAPRPNIVFLLADDLGGADVSW------------------------ 46 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DA 156 + TP L +L + G + YV V P+R+A++TGR P R+G+ A Sbjct: 47 ---RGSPIKTPQLDALANSGAKLEQFYV-QPVCSPTRSALLTGRYPMRYGLQVGVVRPWA 102 Query: 157 QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 G+PL E L E Q+ GY TA VGKWHL +S + Sbjct: 103 DYGLPLDERTLAEALQDAGYETAIVGKWHLGHVS----------------------PAYL 140 Query: 217 PQNRGFDYFMGFHAAGTAYYNS-----PSLFKNRERVPAKGYISDQLTDEAIGVVDRAKT 271 P RGFD+ G + Y+ K+ +GY + + EA+ V+ + Sbjct: 141 PMARGFDHQYGHYNGALDYFTHDRDGGHDWHKDDHVNRDEGYATHLIAQEAVRVIQD-RD 199 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 +P LY+ +NA H P P+ Y + + Y V ++D+ V +I++++++ Sbjct: 200 KKKPLFLYVPFNAVHSPLQ--VPESYAAPYGDMKKRRQAYAGMVAALDEAVGQIVDEIQR 257 Query: 332 NGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YD 390 DNT+ +F+SDNG G L NG +G K Y GG F WKG++ PG+ + Sbjct: 258 QEMLDNTLFIFSSDNGGPEPGKLTDNGPLRGGKHTLYEGGVRVCAFASWKGRIAPGSKVE 317 Query: 391 KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEE 450 + +D+YPT ++ A S+ + LDG ++ P + + Sbjct: 318 APLHIVDWYPTLIELAGGSLQQAKPLDGRNIWPSITTGEPS------------------- 358 Query: 451 NIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT---VENNQLGLYKLT 507 PH+ +++ +R D+ LV ++ L+ L+ Sbjct: 359 -------------------PHDVIVCNITPTEGAIRVGDWKLVVHNIGKPREKVELFNLS 399 Query: 508 -DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEK 548 DL ++ N A N +++++++ + + P + Q K Sbjct: 400 DDLAEQQNRATTNAKMLRKLRNRFDQLASEAAPAKNAGPQPK 441 >UniRef50_C1ZA41 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZA41_PLALI Length = 519 Score = 424 bits (1091), Expect = e-117, Method: Composition-based stats. Identities = 127/537 (23%), Positives = 198/537 (36%), Gaps = 93/537 (17%) Query: 23 AAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDP 82 F A + + + + S K +PNII++ DD GYG L Sbjct: 12 VCFRQLAIFSMIATVIGAGLTIARIVEADES-KTRPNIILMMTDDQGYGDLSLHG----- 65 Query: 83 KTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTG 142 TP L L + VRF +V P+RA+IMT Sbjct: 66 ---------------------NPVVKTPHLDQLGRQSVRFEQFHV-SPTCAPTRASIMTS 103 Query: 143 RAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 R GV ++ + L T LP+ + GY T GKWHL Sbjct: 104 RHEFSSGVTHTILERERLSLKATILPQFLKRAGYTTGIFGKWHLGD-------------- 149 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY-------------YNSPSLFKNRERVP 249 + +QP RGFD G Y +P + N + V Sbjct: 150 ---------EDAYQPGKRGFDEVFIHGGGGIGQSYPGSCGDAPLNKYFNPVIRHNGKFVA 200 Query: 250 AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTAD 309 GY + D+AI + ++ +QPF Y+ NAPH P D P + Y+ + Sbjct: 201 TNGYCTKVFVDQAITWIS-SQPDNQPFFCYITPNAPHAPLDCPK-EYYEPYLEHVPEDVA 258 Query: 310 NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYP 369 +Y + D + R+L+ L+ +TI++F +DNG+ G + + K Y Sbjct: 259 RFYGMITHWDDQLGRLLKALEDRDISKDTIVIFMTDNGSAT-GAKHFSAGMRANKGTPYE 317 Query: 370 GGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLK--LDGVSLLPWLQD 427 GG P F W G QP ++ D PT + A++ + D K G SL+P L Sbjct: 318 GGIRVPAFWSWAGHWQPQVRQEVTCHYDILPTLTELANVPVADDEKQSWQGRSLVPLLAG 377 Query: 428 KKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRN 487 + P + IT W E + + +++ + +R Sbjct: 378 RSPNWPPRPF--ITHVGRWPKEHDPKREPSTYQYAK-------------------CAIRL 416 Query: 488 NDYSLVYTVENN--QLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 D+ L+ V+ Q LY+L D +K NLA P V+E++ + + S P + Sbjct: 417 GDWKLISNVKQGEPQWELYQLAEDPAEKINLAKKYPDRVEELKKIYDAWWLSVVPKM 473 >UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4S5_9BACT Length = 486 Score = 424 bits (1091), Expect = e-117, Method: Composition-based stats. Identities = 128/522 (24%), Positives = 191/522 (36%), Gaps = 102/522 (19%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 KPNI+ + DD+G+ L A TP + Sbjct: 23 PDKPNILFILADDMGWSDLGCYG--------------------------ADLHETPNIDR 56 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ----------------D 158 VRFT+ Y A V PSR+ +MTG+ AR + Sbjct: 57 FASGAVRFTSAY-AMSVCSPSRSTLMTGKHAARLHFTIWAEGAQEGGAKNRELREAESIW 115 Query: 159 GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQ 218 +P +E + ++ GY TA +GKWHL + P + H W Sbjct: 116 NLPNSEKTIATYLKSAGYLTALIGKWHLGDWEHYP--------EAHGFDINIGGTNWGAP 167 Query: 219 NRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFML 278 + + G G + P L E Y++D+LTDEAI V+D DQPF + Sbjct: 168 QTFWWPYSGSGTHGPEFRYIPHL----EYGHPGEYLTDRLTDEAIKVID--HAGDQPFFV 221 Query: 279 YLAYNAPHLPNDNPAPDQ---YQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQY 335 YLA++A H P + A D K + + Y A +D+ V R+LE LK+ G Sbjct: 222 YLAHHAVHTPIEAKADDIQHFDAKYRDGMNHRHTIYAAMNKELDENVGRVLEHLKERGLD 281 Query: 336 DNTIILFTSDNGAVI--------DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 NT+++F SDNG I + P+ N + K Y GG P+ + W G G Sbjct: 282 KNTVVIFASDNGGYIGVDKVSGKNMPVTNNAPLRSGKGALYEGGIRVPLIIRWPGVTPNG 341 Query: 388 -NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHW 446 D+ + D T L P DG+ + P L+D L + Sbjct: 342 ATCDEPVILTDMLQTFLHITG-QPPATDATDGMDISPLLKDPSAKLNRDALFF------- 393 Query: 447 FDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL 506 H P+ + +R D+ L+ E+N L LY L Sbjct: 394 ------------------------HYPHYYHTTTPVSAIRARDWKLLEFYEDNHLELYNL 429 Query: 507 -TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQE 547 DL +K +LA P ++ + + DS L + N + Sbjct: 430 RNDLSEKHDLAKEMPDKAAALRDQLNAWRDSVGAVLPQPNPD 471 >UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 Length = 471 Score = 424 bits (1090), Expect = e-117, Method: Composition-based stats. Identities = 137/521 (26%), Positives = 206/521 (39%), Gaps = 122/521 (23%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 T S +PNI+ L DD GY F T Sbjct: 19 TSLSYAKQPNIVFLFSDDAGYADFGFQGS--------------------------ETMKT 52 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY------------SNTDAQ 157 P L L EGVRFT GYV+ GPSRA IMTGR +FG + A+ Sbjct: 53 PNLDQLASEGVRFTQGYVSDSTCGPSRAGIMTGRYQQKFGYEEINVPGYMSEHSAIKGAE 112 Query: 158 DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 GIPL E + + ++ GY TA GKWHL +E P Sbjct: 113 MGIPLDEVTMGDYMKSLGYRTAFYGKWHLG-----------------------GTDELHP 149 Query: 218 QNRGFDYFMGFHAAGTAYY----NSPS----LFKNRE-------RVPAKGYISDQLTDEA 262 +RGFD F GF +Y+ N+P +F +++ +GY++D L ++A Sbjct: 150 MHRGFDEFYGFRGGDRSYWAYEVNAPERKSAVFTDKKLEHGIDQFQEHEGYLTDVLAEKA 209 Query: 263 IGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGV 322 ++ K D+PF ++L++NA H P + D +F A ++D+ Sbjct: 210 NQFIE--KAPDKPFFIFLSFNAVHTPMEATPEDL--AKFPQLKGKRKEVAAMTLALDRAS 265 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 +L +LK+ G D+T+++F++DNG D N G KS GG P + W Sbjct: 266 GAVLNKLKELGLEDDTLVVFSNDNGGPTDKNASSNYPLAGTKSNFLEGGIRVPFLVKWPA 325 Query: 383 KLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWIT 441 KL G DK +S +D PT A +LDGV L+P++ + PH+++ W Sbjct: 326 KLAAGKVYDKPVSTLDLLPTFFKAGGGEEVMS-ELDGVDLMPYITGQNNKAPHESMYW-- 382 Query: 442 SYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQL 501 + +R D+ L+ + Sbjct: 383 ------------------------------------KKETRAAIRQGDWKLLR-FPDRPA 405 Query: 502 GLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 LY L D+ ++ NLAA P+ VK+M + + + PL Sbjct: 406 ELYNLANDIGEQHNLAAQEPERVKQMYKDFFSWEMTLERPL 446 >UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UJ66_RHOBA Length = 616 Score = 423 bits (1089), Expect = e-117, Method: Composition-based stats. Identities = 121/498 (24%), Positives = 193/498 (38%), Gaps = 84/498 (16%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 + +++ +PN+I++ DD GYG + +T Sbjct: 49 AQTASESRPNVILVVTDDQGYGDMSCHG--------------------------NPWLNT 82 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPE 169 P L L + VR N +V P+RAA+MTGR R G ++ T+ + + ET + E Sbjct: 83 PNLDRLATQSVRLENFHV-DPFCTPTRAALMTGRYCTRVGAWAVTEGRQLLDPDETTMAE 141 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH 229 F+ GY T GKWHL P ++ + + E P G DYF Sbjct: 142 TFRESGYRTGMFGKWHLGDPPPF-APRERGLETVVRHMAGGADEIGNPT--GNDYF---- 194 Query: 230 AAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPN 289 + ++N GY +D +EAI + K +QPF Y+ NA H P Sbjct: 195 --------DDTYYRNGTPESFDGYCTDIWFEEAIDFI--QKESEQPFFAYIPTNAMHSPY 244 Query: 290 DNPAPDQYQKQFNTGS--QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG 347 D+Y F +Y + + D+ + R+L++L ++ DNT+++F SDNG Sbjct: 245 --LVADRYSDPFKRQGIEPQRAAFYGMIQNFDENLGRLLKRLDQDNLRDNTMLIFMSDNG 302 Query: 348 AVIDGP-----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPT 401 + N +G K Y GG P F W K D+L D+ PT Sbjct: 303 TAQGASEQNRKVGFNAGMRGKKGSVYEGGHRVPCFASWPAKWDGNRPVDQLTCHRDWLPT 362 Query: 402 ALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKF 461 ++ D+ P D+ DG S+ L Q P + L Sbjct: 363 LIELCDLKRPADVTFDGRSMAGLLSHSSQQWPERTLV----------------------- 399 Query: 462 VRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANP 520 + Q D+ T+ +Q + V + + LV LY + D Q N+AA P Sbjct: 400 IERQPDNVVSATKTQGRAQPPFVVLTDRWRLVRD------ELYDIQNDPGQIKNIAAEYP 453 Query: 521 QVVKEMQGVVREFIDSSQ 538 +VV+E++ + + Sbjct: 454 EVVRELRAEYDAYFEDVH 471 >UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD Length = 481 Score = 423 bits (1089), Expect = e-117, Method: Composition-based stats. Identities = 127/560 (22%), Positives = 199/560 (35%), Gaps = 134/560 (23%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 + L T+ + + +PNI+ + DDLGYG + F+ Sbjct: 5 LLLIPLLTSSFLTQRADAQAPKPQRPNIVFILADDLGYGDVGFNG--------------- 49 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP + L EG+ F Y V PSR++++TG+ + Sbjct: 50 -----------QKLIKTPNIDKLAKEGMIFNQFYAGTSVCAPSRSSLLTGQHTGHTYIRG 98 Query: 153 N----TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 N + Q I + T L E+ + GY TAA GKW L + + Sbjct: 99 NKGVEPEGQQPIADSVTTLAEVLKKSGYVTAAFGKWGLGPVGS----------------- 141 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRER---------VPAKGYISDQL 258 E P +GFD F G++ A+ P L+ N ++ + K Y D + Sbjct: 142 -----EGDPNKQGFDRFYGYNCQSLAHRYYPEHLWDNSKKILLEGNKGLIHNKEYAPDLI 196 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ---YQKQFNTGSQTADNY---- 311 +A+ V A+ QPF L+L Y PH P Y+ +F +Y Sbjct: 197 QKKALSFV-NAQDGKQPFFLFLPYILPHAELVVPDDSLFRYYKGKFEEKPHKGADYGPGA 255 Query: 312 ---------------YASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP--- 353 A V +D V +++ LKK G NT+++FTSDNG ++G Sbjct: 256 NGGGYASQDFPHATFAAMVARLDLYVGQVMNALKKKGLDKNTLVIFTSDNGPHVEGGADP 315 Query: 354 --LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISI 410 +G K Y GG P W ++PG+ I A D PT + A+ Sbjct: 316 RFFNSGAGFRGVKRDLYEGGIREPFAARWPAAIKPGSKSDYIGAFWDILPTFAELANAPA 375 Query: 411 PKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYP 470 P +DG+S L+ K + H L W Sbjct: 376 P--RNIDGISFTDALKGKAIQKKHDYLYWEFHEQG------------------------- 408 Query: 471 HNPNTEDLSQFSYTVRNNDYSLVYTV----ENNQLGLYKLT-DLQQKDNLAAANPQVVKE 525 VR ++ V + + LY L+ D Q+K+NL P+ KE Sbjct: 409 ----------GRQAVRQGNWKAVRLKAAGNPDALVELYDLSKDPQEKNNLTPQFPEKAKE 458 Query: 526 MQGVV-REFIDSSQPPLSEV 544 + ++ R + S+ P + Sbjct: 459 LGQIMNRAHVSSAIFPFGSL 478 >UniRef50_A6LED1 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LED1_PARD8 Length = 459 Score = 423 bits (1087), Expect = e-116, Method: Composition-based stats. Identities = 131/523 (25%), Positives = 206/523 (39%), Gaps = 105/523 (20%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 T ++ S KPN +++ DD+GYG L Sbjct: 11 TAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYG-------------------- 50 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN---- 153 TP + + EG++ T YV GVS PSRAA+MTGR P R G+Y + Sbjct: 51 ------NPTIRTPNIDRMACEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAV 104 Query: 154 --TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFS 211 +++ G+ E + ++ Q GY T VGKWHL S Sbjct: 105 LFPNSKAGLGQDEVTIAKVLQQSGYATGCVGKWHLGAFS--------------------- 143 Query: 212 AEEWQPQNRGFDYFMGFH-----------AAGTAYYNSPSLFKNRERV---PAKGYISDQ 257 + P + GFD + G A + L + +++ P +G ++ + Sbjct: 144 --PYLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPPTPLIVDGKQIESEPDQGELTRR 201 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYS 317 T++A+ + +PF LY A+ PH+P Y G+ Y V Sbjct: 202 YTEKAVSFIKNHSK--EPFFLYFAHTFPHIPL-------YTNARFEGTSKRGLYGDVVEE 252 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNGAQKGYKSQTYPGGTHTP 375 +D V +L+ L++NG +NT ++FTSDNG + G K K + GG P Sbjct: 253 IDWSVGEVLKALRENGLDENTFVIFTSDNGPWLTEHENGGSAGPLKDGKGTWWEGGFRVP 312 Query: 376 MFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHK 435 W GK+ P D+++++MD YPT L A I PKDL LDGV+ L ++K + Sbjct: 313 AICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKDLVLDGVNQTGLLFEEKHSARDE 372 Query: 436 NLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT 495 W W E + + + D Y E ++ L+Y Sbjct: 373 VYYW------WGSELMAIRKGEWKYYFKTIKDQYLRTCKIETPAEP----------LLYN 416 Query: 496 VENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 VE TD+ ++ NLA +P++VK + + + Sbjct: 417 VE---------TDISERFNLADKHPEIVKLLIEAGEKHKKGMK 450 >UniRef50_Q7UTH7 Arylsulfatase A n=2 Tax=Bacteria RepID=Q7UTH7_RHOBA Length = 496 Score = 421 bits (1084), Expect = e-116, Method: Composition-based stats. Identities = 126/520 (24%), Positives = 204/520 (39%), Gaps = 93/520 (17%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 PNII++ DD GYG L F TP L L Sbjct: 33 SPPNIILVMTDDQGYGDLGCHGHPF--------------------------LKTPNLDRL 66 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHG 175 E RF + +V P+R+A+M+GRAP + GV +D + LT T + E+ ++ G Sbjct: 67 HSESTRFNDFHV-SPTCAPTRSALMSGRAPFKNGVTHTILERDRMALTSTTIAEVLKSAG 125 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY 235 Y T GKWHL + +QP RGFD A G Sbjct: 126 YTTGIFGKWHLGD-----------------------EDAYQPDRRGFDETFIHGAGGIGQ 162 Query: 236 -------------YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLD-QPFMLYLA 281 Y +P + N V +GY +D +A+G + D +PF Y+ Sbjct: 163 NFAGSQSDAPGTSYFNPIIKHNGTFVQTEGYCTDVFFQQALGWIRLQTKSDTKPFFAYIP 222 Query: 282 YNAPHLPNDNPAPDQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 NAPH P +Y +F + S + + ++D + +++ +L + DNT++ Sbjct: 223 TNAPHAPY--KVEKRYSDRFRDKCSSPQSEFLGMIVNIDDNMGKLMGKLDEWDLADNTLL 280 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFY 399 +F +DNG G N KG K GG+ P+FM G G + + +D + Sbjct: 281 IFMTDNG-SAKGSKIYNAGMKGGKGTVNEGGSRVPLFMRLPGFTNSGVDIETMTRHVDLF 339 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYH 459 PT + A IP + LDG SL+ +++ + H+ F Sbjct: 340 PTLAEIAHAEIPAEADLDGRSLVSLIKNPQLDWDHRFQF---------------FHSGRW 384 Query: 460 KFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAA 518 + +PN + +Y VR+ + LV LY L D + ++A + Sbjct: 385 AKAGLKGKFGKGDPNPDHSKHKNYAVRDEKWRLV------NGELYDLENDPGETADVAGS 438 Query: 519 NPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKALSE 558 +P+VV M E+ D +P + +N++ ++ K + Sbjct: 439 HPEVVSRMLVAFDEWWDEVRPLM--INEDAPLDVGKPFRD 476 >UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF83_9BACT Length = 488 Score = 421 bits (1083), Expect = e-116, Method: Composition-based stats. Identities = 129/546 (23%), Positives = 211/546 (38%), Gaps = 96/546 (17%) Query: 31 DDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 V T + + + +PNII++ DDLGYG L G + Sbjct: 16 SMVSFSGLLTLTSDAQTSTNRPPAPRRPNIILILADDLGYGDL----GCYGQ-------- 63 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 Q TP + L ++G++FT+ Y V PSRA +MTG+ + Sbjct: 64 --------------TQIKTPNIDKLAEDGMKFTSFYAGSTVCAPSRATLMTGKNTGHVNI 109 Query: 151 YSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 N D + E + ++ + GY T +GKW L + +P + +Y Sbjct: 110 RGNADL--SLNGEELTIAKILKLAGYATGCIGKWGLGNEGSPGLPGRQGFDEYLGYLDQV 167 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAK 270 A ++ P + F + + +L +N + Y +D T A+ + K Sbjct: 168 QAHDYYPTHL-------FRSDSKGEESKIALTEN-DADHKGLYSNDFFTQSALNYLRINK 219 Query: 271 TLD----QPFMLYLAYNAPHLPND--------NPAPDQYQKQFNTGSQTADNYYASVYSV 318 + F LYL Y PH N+ P Q N A + + Sbjct: 220 PSKLNKHRSFFLYLPYTLPHANNELGNRTGNGMEVPSTEPYTNEQWPQVEKNKAAMITRL 279 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKSQTYPGGTH 373 D V I++ LKK+ +NT+++F SDNG +G G +G K Y GG Sbjct: 280 DHYVGEIMDYLKKSKLDENTVVIFASDNGPHKEGGVNPKYFNSAGGLRGIKRDLYEGGIR 339 Query: 374 TPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 P + W +++ G+ D ++ DF PTA + A S P + +DG+S LP L K Q Sbjct: 340 VPFIVRWPARVKAGSISDAPLAFWDFLPTAAEIARTSSPTN--IDGISFLPTLLGKAQTN 397 Query: 433 PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL 492 H+ L W F VR D+ Sbjct: 398 RHQYLYWEFHEQG-----------------------------------FDQAVRMGDWKA 422 Query: 493 VYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFI-DSSQPP---LSEVNQE 547 V N + LY L TD+ +KDN+A NP+V+ ++ +++ D + P ++E+ ++ Sbjct: 423 VRHGINGPIELYNLKTDVSEKDNVADKNPEVMAKIADYLKKARTDDPRWPAKTVAEIKED 482 Query: 548 KFNNIK 553 + +K Sbjct: 483 QETKVK 488 >UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D6K5_PAESJ Length = 434 Score = 420 bits (1080), Expect = e-116, Method: Composition-based stats. Identities = 144/522 (27%), Positives = 208/522 (39%), Gaps = 131/522 (25%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 +PNIIV DDLGYG L G + M TP L L Sbjct: 2 KRPNIIVFYCDDLGYGDL----GCYGSDAM----------------------KTPHLDQL 35 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD---GIPLTETFLPELFQ 172 EG+RFTN Y V PSRA+++TG+ PA+ GV S + G+ L +T L + Sbjct: 36 ASEGIRFTNWYSNSPVCSPSRASLLTGKYPAKAGVTSILGGKRGTKGLSLEQTTLASALK 95 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 HGY+TA GKWHL + E+ P GFD F GF A Sbjct: 96 EHGYHTALFGKWHLGASA-----------------------EYGPNAHGFDQFYGFRAGC 132 Query: 233 TAYYNS-------------PSLFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFML 278 YY+ L++N V G Y+++ +T EA +D A D+P+ + Sbjct: 133 IDYYSHIFYWGQGGGVNPVHDLWRNETEVWENGEYMTEAITREATSYID-AAPDDEPYFM 191 Query: 279 YLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 Y+AYNAPH P AP Y +F A + +VD GV I++ LK+ G Y++T Sbjct: 192 YVAYNAPHYPMH--APKAYLDRFPDLPPDRRIMAAMIAAVDDGVGEIVKALKQKGAYEDT 249 Query: 339 IILFTSDNGAVIDGP-----------LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL--Q 385 II F+SDNG + G +G+K+ + GG P + + L Q Sbjct: 250 IIFFSSDNGPSTESRNWLDGTEDLYYGGSAGRFRGHKASLFEGGIREPAILSYPAGLAEQ 309 Query: 386 PGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYS 444 G D++ + MD +PT L+ + I + LDG S+ L P K L W Sbjct: 310 QGQISDEMFAMMDIFPTMLELSGIGT-EGYSLDGHSVFDALSGNAL-SPRKQLFWEY--- 364 Query: 445 HWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT-------VE 497 + VR + LV E Sbjct: 365 -----------------------------------EGQLAVREGKWKLVLNGKLDFSRTE 389 Query: 498 NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 + + L L D ++ NL P++ + ++ VR++ S Q Sbjct: 390 ADAVHLSDLEQDSSERINLVKQYPEIAQRLERDVRQWYQSLQ 431 >UniRef50_A6DF72 Putative secreted sulfatase ydeN n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF72_9BACT Length = 481 Score = 419 bits (1078), Expect = e-115, Method: Composition-based stats. Identities = 129/520 (24%), Positives = 197/520 (37%), Gaps = 110/520 (21%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN+I++ +DDLG+ TP + L Sbjct: 23 KPNVIMILVDDLGWTDTTCYGS--------------------------DLYQTPNVDELS 56 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT-------------DAQDGIPLT 163 G+RFT+ Y A V P+R++IMTG+ PA + + + + Sbjct: 57 RTGMRFTDAYSACTVCSPTRSSIMTGKNPANNNLTDWITGHVKPYAKLKSPNWKMHLTAE 116 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 E L E F+ GY T +GKWHL + S P+N+GFD Sbjct: 117 EITLAEAFKATGYKTVHIGKWHLGEESVS-----------------------WPENQGFD 153 Query: 224 YFM-GFHAA-----GTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFM 277 + GF A G Y SP + P Y++++L EA + L +PF Sbjct: 154 ENIAGFRAGSPSAHGGGGYFSPYNNPRLKDGPKGEYLTERLAQEASQYIQSTAKLKKPFF 213 Query: 278 LYLAYNAPHLPNDNPAP--DQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQ 334 + L H P D+Y + T Y A V +D V +++ +K G Sbjct: 214 MNLWLYNVHTPLQARQEKIDKYTRLIQKGYQHTNPVYAAMVEHMDDAVGTVMQAVKDAGI 273 Query: 335 YDNTIILFTSDNGAVIDG------PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN 388 DNTII+F SDNG + + N + K Y GG PM + W K++ G Sbjct: 274 EDNTIIIFNSDNGGLRGNYENNRQKVTSNYPLRSGKGDMYEGGVRVPMIIKWSRKIKAGQ 333 Query: 389 Y-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWF 447 + + D YPT LD I + K +DG+SL+P L + K + L W Y H+ Sbjct: 334 TSSSPVISHDIYPTLLDLCKIDVSKKQDIDGISLVPELLEGKTIQ-RDALYW--HYPHYH 390 Query: 448 DEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL- 506 E P+ +R D+ L++ E + LY L Sbjct: 391 LEGAKPY----------------------------SAIRKGDWKLIFLYEESHAELYNLR 422 Query: 507 TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 D+ +++NLA + + E+ G +R + L N Sbjct: 423 NDISERNNLAMTEKRKLAELMGDLRTWKKKIGAQLPVFNP 462 >UniRef50_Q7UL93 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UL93_RHOBA Length = 470 Score = 419 bits (1077), Expect = e-115, Method: Composition-based stats. Identities = 122/517 (23%), Positives = 194/517 (37%), Gaps = 106/517 (20%) Query: 48 TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQK 107 +P+I+ + DD+G+ L Sbjct: 37 CLVSAEAAEQPHILFIMADDMGWKDLHCQG--------------------------NDVL 70 Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN-------------- 153 TP + +L + GVRF N Y V P+RA++MTG APAR + + Sbjct: 71 RTPNIDALAEAGVRFDNAYAGSTVCTPTRASLMTGLAPARLHITQHGADSKSFWPDDRLI 130 Query: 154 --TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFS 211 +P T + E + GY T GKWHL Sbjct: 131 QPPPTNHELPHETTTMAERLKAAGYTTGFFGKWHLGGDKK-------------------- 170 Query: 212 AEEWQPQNRGFDYFMGFHA-AGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAK 270 + P GFD +G G Y P Y++D+L DE I + R K Sbjct: 171 ---YWPTEHGFDVNVGGCGLGGPPTYFDPYRIPALPPRKEGEYLTDRLADETIAFMRREK 227 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQ 328 D+P + L PH P + P + Y+ + T Y + + D+GV R+L + Sbjct: 228 --DKPMFVCLWTYNPHYPFEAPEDLIEHYKGKEGT-GLKNPIYGGQIEATDRGVGRVLRE 284 Query: 329 LKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN 388 L G D T+++FTSDNG N + K + GG P+ + W G + Sbjct: 285 LDSLGIADETLVVFTSDNGGW--SGATDNRPLREGKGFLFEGGLRVPLIVRWPGVTEAAT 342 Query: 389 YDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWF 447 ++ + +MD T LDAA +S+ LDG SL P K L + Sbjct: 343 VNETPVVSMDLTATILDAAGVSLANGESLDGESLRPLFSGGK--LERDALYF-------- 392 Query: 448 DEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL- 506 YPH +D ++ +R+ Y L+ +++ + LY L Sbjct: 393 --------------------HYPHFAFHKD-NRPGSVIRSGQYKLILRHDDDSVELYDLQ 431 Query: 507 TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 DL + +LAA +P V +E++G + E+++++ + E Sbjct: 432 NDLSETSDLAAVHPDVAQELKGRLMEWLEATGAGMPE 468 >UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZGF2_PLALI Length = 490 Score = 419 bits (1077), Expect = e-115, Method: Composition-based stats. Identities = 125/524 (23%), Positives = 203/524 (38%), Gaps = 118/524 (22%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 ++ PNII++ MDD+G+ + F F TP Sbjct: 36 AAESRRPPNIILILMDDMGWRDVGFMGNKF--------------------------VETP 69 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG----------- 159 + L G+ FT Y + P+RA +M+G+ R G+Y+ D + Sbjct: 70 HIDRLAKTGLVFTQAYASAPNCAPTRACLMSGQYAPRHGIYTVVDPRQPPGSPWHKWQAA 129 Query: 160 -----IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 + + E ++ GY TA G W+L + PV Q Sbjct: 130 ESKSELDTNVVTIAEALRDGGYATAFFGMWNLGRGRTGPVTPGGQGFQKVVF-------- 181 Query: 215 WQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQ 274 P+N GF Y++ K Y++D+LTDE + VD + +Q Sbjct: 182 --PENLGF--------GKDEYFDD-----------GKHYLTDRLTDEVLKFVDEHR--EQ 218 Query: 275 PFMLYLAYNAPHLPNDNPAPD---QYQKQFNTGSQTADN--YYASVYSVDQGVKRILEQL 329 PF +YL +A H P NP P+ +Y+++ + D+ A++ +VD V RI++ L Sbjct: 219 PFFVYLPDHAIHAPF-NPKPELLAKYERKAAASNDRRDDPACAATIEAVDHNVGRIMDHL 277 Query: 330 KKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-N 388 K+ DNT+++FTSDNG +G K + Y GG P+ + G G Sbjct: 278 KRLKLSDNTVVIFTSDNGGTQQ----YTPPLRGGKGELYEGGIRVPLVVAGPGVKSLGSR 333 Query: 389 YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFD 448 D +S++D YPT L+ A I P+ LDGVSL P LQ + + L W Sbjct: 334 CDVPVSSIDLYPTLLELAGIKPPEGQVLDGVSLAPLLQGDATLD-RERLFW--------- 383 Query: 449 EENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVEN-NQLGLYKL- 506 H P + S +R D+ L+ E ++ L+ L Sbjct: 384 ----------------------HFPCYVGKATPSSAMREGDFKLIEFFEEGGRVELFNLK 421 Query: 507 TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFN 550 D ++ NLA+ P + +R + + + ++ Sbjct: 422 NDPNEEKNLASVMPDKAAALAKTLRAWQKKTNASIPPGPNPSYD 465 >UniRef50_A7V8P8 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V8P8_BACUN Length = 525 Score = 419 bits (1077), Expect = e-115, Method: Composition-based stats. Identities = 142/543 (26%), Positives = 231/543 (42%), Gaps = 118/543 (21%) Query: 35 LKATKTNVAFSDFTPTEYSTKG--------KPNIIVLTMDDLGYGQLPFDKGSFDPKTME 86 +K + T V+ + P S G +PNI+++ DD+G+G + + Sbjct: 1 MKVSCTLVSVAALLPFSGSNAGNVQRDKSQRPNIVLVIADDMGWGDVGYQG--------- 51 Query: 87 NREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPA 146 A STP + +L GV+F+ GYV+ +SGPSRA I+TG Sbjct: 52 -----------------AVDVSTPNIDALARRGVQFSQGYVSCSISGPSRAGILTGVYQQ 94 Query: 147 RFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 RFG Y+N IP ++ L E+ ++ GY T VGKWH++ PE R D Sbjct: 95 RFGFYNNLHPWAKIPEGQSTLGEMVRDCGYATGFVGKWHMADS-----PEQSPNRRGFDQ 149 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPA----KGYISDQLTDEA 262 F F ++ DY+ G Y+ L++N E P YI+D T EA Sbjct: 150 FYGFWSDTH-------DYYRSTDKPGVELYDFCPLYRNGEIQPPLHESGEYITDCFTREA 202 Query: 263 IGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT---GSQTADNYYASVYSVD 319 + +D+ + PF+L L+YNA H P P+ Y + + + A V ++D Sbjct: 203 VEFIDKHASS--PFLLCLSYNAVHSPWQ--VPEHYVNRLEGRRFHHEDRKVFAAMVLALD 258 Query: 320 QGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLN----------------GAQKGY 363 G+ R++E L+KNG +NT+ + SDNG+ + + G +GY Sbjct: 259 DGIGRVMESLRKNGLEENTLFILISDNGSPRGQGIECSTGYEYKDRGNTTMSSPGPFRGY 318 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLL 422 K+ TY GG P M W +L G YD + ++D +PT + A + + LDGVSLL Sbjct: 319 KADTYEGGIRVPYIMSWPSELPQGMVYDNPVISLDIFPTVMQAVGGTSRQKYSLDGVSLL 378 Query: 423 PWLQDKK--QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ 480 P+L+ + PH L W Sbjct: 379 PYLKSEWPIDKRPHSTLYWR--------------------------------------RD 400 Query: 481 FSYTVRNNDYSLVYTVENN--QLGLYKLTDLQQK-DNLAAANPQVVKEMQGVVREFIDSS 537 + +R D+ LVY + + ++ L+ + D +++ +L+ P++ + + D++ Sbjct: 401 EDFAIRKGDWKLVYNDQGSTRKIQLFDMKDDKEEVYDLSGEYPELADSLLAEFDAW-DAA 459 Query: 538 QPP 540 PP Sbjct: 460 LPP 462 >UniRef50_Q15XI1 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XI1_PSEA6 Length = 510 Score = 419 bits (1077), Expect = e-115, Method: Composition-based stats. Identities = 135/550 (24%), Positives = 211/550 (38%), Gaps = 122/550 (22%) Query: 45 SDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEA 104 S KPN++++ +DDLGY + E Sbjct: 26 SVLNSCAAQVVTKPNVLLILVDDLGYSDIKAYN-------------------------EN 60 Query: 105 AQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRA---------------PARFG 149 + TP + L + V FTNGY A+ V PSR A++TG+ PAR G Sbjct: 61 SFYDTPNIDKLASQSVMFTNGYAANPVCSPSRFALLTGKHPTRGKATDWFPANDKPARAG 120 Query: 150 VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 + + D +PL+E L E F+ +GY TA +GKWHL K ++ Sbjct: 121 RFLPAEFNDALPLSEITLAEAFKQNGYNTAFLGKWHLGKTEDL----------------- 163 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGT--AYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVD 267 P+N+GFD + G A Y SP P Y++ +LT+EAI +VD Sbjct: 164 ------WPENQGFDVNIAGTKNGHPAAGYFSPYKNARLTDGPKGEYLTQRLTNEAISLVD 217 Query: 268 RAKTLDQPFMLYLAYNAPHLPNDNPAPD--QYQKQFNTG--------------------- 304 + PF + L++ H P P D +YQ + Sbjct: 218 KYSKQTVPFFMMLSFYTVHTPLAAPNKDVQEYQAKIRQYAHNDEFQREEQVWPTAEKREV 277 Query: 305 --SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--VIDGPLPLNGAQ 360 Q Y A V +D V R+L +LK+ G ++T+++FTSDNG +G N Sbjct: 278 RVKQNHPTYAAMVKQMDTQVGRLLAKLKQAGMEESTLVVFTSDNGGLSSAEGSPTSNLPL 337 Query: 361 KGYKSQTYPGGTHTPMFMWWK-GKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGV 419 +G K Y GG P+ + K + ++ +++ D YPT L A + + LDGV Sbjct: 338 RGGKGWLYEGGIRVPLLVKLPQKKHKHLQINEPVTSTDLYPTLLSAGHLDLLPQQHLDGV 397 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLS 479 L + G L Y H YPH N Sbjct: 398 DLNQYF---SPGAKRDALMRRPLYFH-----------------------YPHYSNQGGF- 430 Query: 480 QFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 +R ++ L+ E+ ++ LY L D+ ++ +LA P+ V ++ + E+ + Sbjct: 431 -PGAAIRQGNWKLIERFEDGKVHLYNLANDIGEQIDLANQAPERVASLRKKLHEWYQQTS 489 Query: 539 PPLSEVNQEK 548 + K Sbjct: 490 ARFLKAKGNK 499 >UniRef50_A6DKD8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKD8_9BACT Length = 455 Score = 418 bits (1076), Expect = e-115, Method: Composition-based stats. Identities = 135/511 (26%), Positives = 206/511 (40%), Gaps = 122/511 (23%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 KPNII++ DDLGY L F + A TP + Sbjct: 18 AAQKPNIILILADDLGYEDLGF--------------------------LGAPDIKTPHID 51 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD---------GIPLTE 164 +L G+ FT GY + V GPSRA ++TGR FG N GIPL E Sbjct: 52 ALARSGMNFTQGYQSASVCGPSRAGLLTGRYQQLFGSGENPPETGELSKRFPDAGIPLDE 111 Query: 165 TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY 224 + +L + Y T +GKWH+ E +P R DY Sbjct: 112 QMIFDLLKPAAYTTGVIGKWHMGLS-----------------------HEQRPTQRSVDY 148 Query: 225 FMGFHAAGTAYYNSP----------SLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQ 274 + GF +Y + +F+N E VP GY ++ DE + + R K D+ Sbjct: 149 YYGFLNGAHSYREAKMDMKGAPMTWPIFRNNEPVPFSGYTTEVFNDEGVNFIKRNK--DK 206 Query: 275 PFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQ 334 PF LY++YN+ H P + P Q+ + + Y A + S+D GV R+++ LK G Sbjct: 207 PFFLYMSYNSVHGPWEAQ-PKDLQRSDHIKKKWRRIYSAMLISMDDGVGRLIQTLKDEGI 265 Query: 335 YDNTIILFTSDNGAV--------IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 Y+NT+++F SDNGA L NG+ +G K TY GG P M W + Sbjct: 266 YENTLVIFMSDNGAPNNLHEAERAGDYLASNGSLRGRKGDTYEGGIRVPYIMSWPQVIPK 325 Query: 387 G-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSH 445 Y +S +D PT + + + P +L GV+L+P++ +K PHK L W + Sbjct: 326 QSTYQHPVSGLDIVPTLIHISQAA-PAKKELSGVNLMPYITGEKTSRPHKTLYWRRDDDY 384 Query: 446 WFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ--LGL 503 +R+ D+ L + N L Sbjct: 385 --------------------------------------AIRDKDWKLTWNDYNGPRTPML 406 Query: 504 YKLTD-LQQKDNLAAANPQVVKEMQGVVREF 533 + L D +K+NL +P++ +++Q ++ Sbjct: 407 FNLKDDPNEKNNLIHKHPEIAQKLQAKFDQW 437 >UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XH3_PSEA6 Length = 500 Score = 418 bits (1075), Expect = e-115, Method: Composition-based stats. Identities = 147/540 (27%), Positives = 222/540 (41%), Gaps = 116/540 (21%) Query: 32 DVKLKATKTNVAFSDFTPTEY-STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 + + N + +D ++ + KPNI+ + DDLGY + F+ + Sbjct: 13 GTLIAISVGNASAADAGQSKADESNEKPNILFVLADDLGYNDVGFNGST----------- 61 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 TP L L G+ F YVAH GPSRAAIMTGR P + G Sbjct: 62 ---------------DIKTPNLDGLAKNGMTFDAAYVAHPFCGPSRAAIMTGRYPHKIGA 106 Query: 151 YSN---TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 N ++ G+ E F+ + ++ GY+T A+GKWHL + Sbjct: 107 QFNLPEDNSNVGVSADELFIAQTMKSAGYFTGAMGKWHLGE------------------- 147 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS---------------------PSLFKNRE 246 A E+ P GFD F GF G Y+ L N + Sbjct: 148 ----ASEYHPNKHGFDEFYGFLGGGHNYFPEQFEAAYNKRVAQGMTNINMYLTPLEHNGK 203 Query: 247 RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQ 306 V YI+D L+ EA+ VD+A +PF LYLAYNAPH+P D F+ Sbjct: 204 EVRETEYITDGLSREAVNFVDKAAAKKKPFFLYLAYNAPHVPLQAKEED--MAMFSQIKD 261 Query: 307 -TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKS 365 Y VY+VD+GV RI+EQLKKNGQ+DNT+I+FTSDNG + G N K K Sbjct: 262 KKRRTYAGMVYAVDRGVGRIVEQLKKNGQFDNTVIVFTSDNGGKL-GQGANNYPLKEGKG 320 Query: 366 QTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 GG TPM + W ++ G+ + A+D YPT +P+D KLDG + Sbjct: 321 SVQEGGFRTPMLVHWPKHMKAGSRFSHPVLALDLYPTFAGLGGAVLPEDKKLDGKDIWAD 380 Query: 425 LQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 +Q + + + + + D Sbjct: 381 IQANTAPHKDEFIYVLRHRNGYSD----------------------------------AA 406 Query: 485 VRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREF-IDSSQPPLS 542 R N + V ++ LY + D+ + ++++A +P ++++M + + ++ QP Sbjct: 407 ARRNQFKAVKNHNDD-WKLYNIAQDISEDNDISAQHPDILRDMVSSMESWSWNNQQPKWF 465 >UniRef50_A6DMY9 Putative uncharacterized protein n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMY9_9BACT Length = 590 Score = 418 bits (1075), Expect = e-115, Method: Composition-based stats. Identities = 131/529 (24%), Positives = 209/529 (39%), Gaps = 94/529 (17%) Query: 42 VAFSDFTPTEYS-TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDK 100 + F + + + KPNI+++ DD GYG + Sbjct: 9 LGLCAFALSPAALAEDKPNIVLILTDDQGYGDISSHG----------------------- 45 Query: 101 AIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGI 160 TP L L ++G RF N +V++ V P+RA+++TGR R GV + + + Sbjct: 46 ---NRMIDTPHLDQLAEDGTRFENFFVSN-VCAPTRASLLTGRYHIRTGVVQVSRGLEIM 101 Query: 161 PLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 E + E+F+ GY T GKWH + P + Sbjct: 102 RSEEATIAEVFKAQGYETGLFGKWHNGEHY-----------------------PNNPPGQ 138 Query: 221 GFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 GFD + GF A + +L N+ V KG+I+D LTD AI +++ + D+PF Y+ Sbjct: 139 GFDEYFGFCAGHIGDFFDATLDHNKTFVKTKGFITDVLTDRAIDWIEKQQ--DKPFFAYI 196 Query: 281 AYNAPHLPNDNPAPDQYQKQF--NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 YNAPH P D+Y +F S Y + ++D + R+L+ L DNT Sbjct: 197 PYNAPHAPYQ--VEDKYYDEFAAKGYSAAHSAAYGMIENLDDNIGRLLKILDDLNLTDNT 254 Query: 339 IILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMD 397 I++F +DNG + P NG KG K GG P F+ W GK+ G L + +D Sbjct: 255 IVIFLTDNGP--NSPTRFNGGMKGSKGSVDEGGVRVPFFIRWPGKIAKGRTIHDLAAHID 312 Query: 398 FYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDN 457 PT ++ A +++ KLDG SL + K + P W Sbjct: 313 VLPTLMELAGVNVDLPNKLDGRSLTSLISSSKTPK-------------------APAWPE 353 Query: 458 YHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYK-LTDLQQKDNLA 516 F + + + R+N Y V + + GLY + D Q+ +L Sbjct: 354 RLIFTQGPGTNM-------TPGSGAGAARSNQYRYVLS--RGEEGLYDMINDPGQEKDLK 404 Query: 517 AANPQVVKEMQGVVREFIDSSQPPLS-----EVNQEKFNNIKKALSEAK 560 + ++ E++ E++ V ++F EAK Sbjct: 405 KSKKKIFDELKAAYIEWLKDVSAGWEPNTTIPVGYKEFPATYLQAVEAK 453 >UniRef50_Q7UQ05 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UQ05_RHOBA Length = 525 Score = 418 bits (1074), Expect = e-115, Method: Composition-based stats. Identities = 128/603 (21%), Positives = 226/603 (37%), Gaps = 141/603 (23%) Query: 1 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI 60 M + +S S + +S + A + T + + +P +PN+ Sbjct: 1 MCRQMLRSHCPVSSPSLASSNLVTTAVLLIATIASLGNPTTLVAEETSP----APSRPNV 56 Query: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 ++ +DDLG+ L ++ TP + +L + G+ Sbjct: 57 LLFLVDDLGWADLGCYGSTYH--------------------------ETPQIDALAESGI 90 Query: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA----------------QDGIPLTE 164 RFTN Y A V P+RA+IMTGR P R + +D + L E Sbjct: 91 RFTNAYAACPVCSPTRASIMTGRHPVRVDITDWIPGMSTDRAQNPRFQHVDDRDNLALDE 150 Query: 165 TFLPELFQN-HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 + E ++ Y T +GKWHL + ++ P ++GF Sbjct: 151 VTIAEHLRDAADYQTFFLGKWHLGDVGHL------------------------PTDQGFQ 186 Query: 224 YFMGFHAAGTAYYNSPSLFKNR--ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 +G G+ S +KN + Y++ +LTDEA+ +VD A D+PF + ++ Sbjct: 187 INIGGGHKGSPPGGYYSPWKNPYLKAKQDGEYLTTRLTDEAVSLVDTASREDKPFFMMMS 246 Query: 282 YNAPHLPN--DNPAPDQYQKQFNTGS-------------------QTADNYYASVYSVDQ 320 Y H P D D ++++ + Q Y + V +VD Sbjct: 247 YYNVHSPITPDKRTIDHFEEKQSNSPELQGDTPTIAERDAVTRGRQDNPAYASMVKAVDT 306 Query: 321 GVKRILEQLKKNGQYDNTIILFTSDNGAVI---DGPLPLNGAQKGYKSQTYPGGTHTPMF 377 V RI++ LK++G DNT+++F SDNG + N + K Y GG P+ Sbjct: 307 SVGRIMKALKEHGVDDNTLVIFFSDNGGLSTLRKFGPTCNSPLRAGKGWLYEGGIREPLL 366 Query: 378 MWWKGKLQPGNYDKLISA-----------MDFYPTALDAADISIPKDLKLDGVSLLPWLQ 426 + + G ++ +S D +PT LD + + + DG+SLLP + Sbjct: 367 VRLPKTMPGGATNETVSHQPKTVDSVACSTDLFPTILDVVGLPLQPESHADGISLLPAIA 426 Query: 427 DK--KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 + + ++L W YPH + L + Sbjct: 427 GEAAETDSSPRDLHW----------------------------HYPHYHGS--LWRPGAA 456 Query: 485 VRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 +R +Y L+ E + LY L+ D+ + +L+ P+ E++ +R++ + Sbjct: 457 IRRGNYKLIEFYETDTAELYDLSVDMGETKDLSKTEPERFAELRDALRQWQTEMNAKMPV 516 Query: 544 VNQ 546 N Sbjct: 517 PNP 519 >UniRef50_Q7UHJ6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UHJ6_RHOBA Length = 500 Score = 416 bits (1071), Expect = e-115, Method: Composition-based stats. Identities = 137/542 (25%), Positives = 212/542 (39%), Gaps = 91/542 (16%) Query: 1 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI 60 M S+ + S++ S + M +A + + + T + +PN Sbjct: 18 MPSSTEPCSFSSTTSRTDCANMKTTSAISIASLFVCMLATQPF--AMADANAADAARPNF 75 Query: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 +V DD+G+G TP L L +GV Sbjct: 76 VVFVADDMGWGD--------------------------SHTYGHELIQTPNLDRLASQGV 109 Query: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD--GIPLTETFLPELFQNHGYYT 178 +FT Y A GV PSR+AI+TGR P R GVY + + +E PEL + GY T Sbjct: 110 KFTQCYSACGVCSPSRSAILTGRTPYRNGVYRHLSGNHEAHLRASEITFPELLKEVGYET 169 Query: 179 AAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS 238 VGKWHL PE P GFD++M + + + Sbjct: 170 CHVGKWHLLSRQQFNNPEFP-----------------HPGEHGFDHWMCTQNNASPSHQN 212 Query: 239 PSLF-KNRERV-PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ 296 P F +N E V +GY + + EA + +PF + + + PH P + + Sbjct: 213 PDNFVRNGEPVGQLEGYSAQLVASEAARWLKDIHDPSKPFAMTVWVHEPHSPIATDS--R 270 Query: 297 YQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL 356 +Q +N + Y ++ +D + +++ L DNT++ FTSDNG V Sbjct: 271 FQSLYNGHENS--KYMGNITQMDHALGMVMDALDAQEVTDNTLLFFTSDNGPVPAFG-GS 327 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLK 415 +G +G K + GG P W G +QPG D + D + T LD A I +P D Sbjct: 328 SGGLRGNKRSDHEGGIRVPGVARWPGHIQPGTISDTPVIGTDVFATVLDIAGIPLPTDRT 387 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 +DGVS+LP + K E L W T S D Sbjct: 388 IDGVSMLPAFEGK-PVERSTPLFWRTHVSPPEDR-------------------------- 420 Query: 476 EDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQ-GVVREF 533 +R D+ LV + LY++ D +++ +LAAA P+ KEM+ +++ + Sbjct: 421 -------VALRIGDWKLVGDETLTKFQLYEIQKDWKEEHDLAAAMPEKTKEMKDQLMKVW 473 Query: 534 ID 535 D Sbjct: 474 RD 475 >UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UZ43_RHOBA Length = 608 Score = 416 bits (1070), Expect = e-114, Method: Composition-based stats. Identities = 113/507 (22%), Positives = 186/507 (36%), Gaps = 98/507 (19%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 +PN++++ DD GYG F TP Sbjct: 25 SVRAADRPNVVMVITDDQGYGDCGFTG--------------------------NKVVQTP 58 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPEL 170 + +L E T+ +VA P+R+A+MTG R GV+ + + E E+ Sbjct: 59 NIDALAAESSVLTDYHVA-PTCSPTRSALMTGHWTNRTGVWHTISGRSMLRDNEVTFGEI 117 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA 230 F + GY T GKWHL ++ ++ GF Sbjct: 118 FSDAGYQTGMFGKWHLGDNY-----------------------PYRAEDNGFTEVYRHGG 154 Query: 231 AGTAY--------YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 G Y S F N + V A+G+ +D E + D+PF Y+A Sbjct: 155 GGVGQTPDFWDNAYFDGSYFHNGKAVKAEGFCTDVFFKEGNRFIRECVEADEPFFAYIAT 214 Query: 283 NAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILF 342 NAPH P AP +Y + + ++ + +VD V + + L++ G +DNTI +F Sbjct: 215 NAPHGPLH--APQKYIDMYPEMNDNVATFFGMITNVDDNVGQTRKLLRELGVHDNTIFIF 272 Query: 343 TSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK--GKLQPGNYDKLISAMDFYP 400 T+DNG G N +G K Y GG P M + G + + L A+D P Sbjct: 273 TTDNG-TAGGASVYNAGMRGKKGSPYEGGHRVPFVMHYPEGGFAKSRTNNTLCHAVDVVP 331 Query: 401 TALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHK 460 T LD + P+ +K DG S++ L+D+ + + S Sbjct: 332 TLLDMCGVEAPESVKFDGTSIVSLLKDEVDSSFNDRMLITDS------------------ 373 Query: 461 FVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAAN 519 + + +V + + L+ N LY + D Q++N+A + Sbjct: 374 -----------QRVIDPIKWRQSSVMQDKWRLI-----NGKELYNIANDPGQENNIAGDH 417 Query: 520 PQVVKEMQGVVREFIDSSQPPLSEVNQ 546 P+ V M+ + +P S+ + Sbjct: 418 PEQVASMRAFYEAWWAELEPTFSQTTE 444 >UniRef50_A6C4W8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4W8_9PLAN Length = 459 Score = 416 bits (1069), Expect = e-114, Method: Composition-based stats. Identities = 123/502 (24%), Positives = 191/502 (38%), Gaps = 104/502 (20%) Query: 48 TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQK 107 + + +PNII + DDLGYG L G + K M Sbjct: 19 ASMQAAEGERPNIIFIMADDLGYGDL----GCYGQKLM---------------------- 52 Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA-QDGIPLTETF 166 TP + +G RFT Y V SRA ++TG N + ++ Sbjct: 53 KTPHIDQFAAQGTRFTQAYAGGSVCTASRAVLLTGLHNGHTPARDNIPHYATYLQESDVT 112 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 + E+ Q GY VGKW L V N+GFD + Sbjct: 113 IAEVLQKSGYRCGGVGKWSLGDAGTVGRA----------------------TNQGFDMWF 150 Query: 227 GFHAAGTA-YYNSPSLFKNRERVPAKG-------YISDQLTDEAIGVVDRAKTLDQPFML 278 G+ A YY + L N R+ KG Y D LT+ A+ + + QPF L Sbjct: 151 GYLNQDHAHYYFTEYLDDNEGRLELKGNTKNRQQYSHDLLTERALQFIRDSAA--QPFFL 208 Query: 279 YLAYNAPHL------PNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKN 332 Y AY PH P+ PD + Y A ++ +D+ V RI+ + + Sbjct: 209 YAAYTLPHFSAKAEDPHGLAVPDTEPYSDRDWDIKSKKYAAMIHRLDRDVGRIMSLVNEL 268 Query: 333 GQYDNTIILFTSDNGAVIDGP--LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY- 389 + T+I+FTSDNG P L NG +G+K GG P W G + G Sbjct: 269 QLRERTLIIFTSDNGGHRGVPAQLHTNGPLRGFKRDLTEGGIRVPFIANWPGTIPAGKVS 328 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 D++I+ D PT + A + + LDG+S+LP L+ + + H+ L W + Sbjct: 329 DEVIAFQDMLPTFAELAGAQVSAN--LDGISVLPALRGEPRKVKHEYLYWDYGHCR---- 382 Query: 450 ENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TD 508 +++ VR N++ + + ++ LY L D Sbjct: 383 -----------------------------ARYDQAVRWNNWKGIRHGQQGEIALYNLDQD 413 Query: 509 LQQKDNLAAANPQVVKEMQGVV 530 L + ++A +PQVV+ + ++ Sbjct: 414 LSESRDVADKHPQVVQRIAEIM 435 >UniRef50_Q7UYD6 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UYD6_RHOBA Length = 889 Score = 416 bits (1069), Expect = e-114, Method: Composition-based stats. Identities = 122/554 (22%), Positives = 209/554 (37%), Gaps = 125/554 (22%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 S +PN++ + DDLG+ + TP Sbjct: 260 NASASKRPNVLFILADDLGWSDTTLFGTT-------------------------KLYQTP 294 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---------------- 154 + L G+ FT Y + + P+RA+++TG +PAR G+ S T Sbjct: 295 NIERLAKRGMTFTRAYSSSPLCSPTRASVLTGLSPARHGITSPTCHLPKVVLEPKVSETG 354 Query: 155 ---------DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 ++ + L E+F+++GY T GKWHL Sbjct: 355 PPNKFSTVPESVTRLDTKYYTLAEMFRDNGYATGHFGKWHLG------------------ 396 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGT--AYYNSPSLFK--NRERVPAKGYISDQLTDE 261 E + P GFD + H Y +P FK + + V ++ D++ E Sbjct: 397 ------PEPYSPLEHGFDVDVPHHPGPGPAGSYVAPWKFKDFDHDPVIPDEHLEDRMAKE 450 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTA-DNYYASVYSV 318 A+ +++ ++PF L + H P D ++Y+ + + Y A + S+ Sbjct: 451 AVRFLEQH--TNEPFFLNYWMFSVHAPFDAKKELIEEYRDRVDPKDPQRCPTYAAMIESM 508 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVI-----DGPLPLNGAQKGYKSQTYPGGTH 373 D + +L+ L + G D TII+F SDNG + N +G K+ Y GG Sbjct: 509 DDAIGTLLDTLDRLGIADETIIVFASDNGGNMYNEVDGTTATSNAPLRGGKATMYEGGVR 568 Query: 374 TPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 P + G ++ G D +I ++DFYPT L+ I + + DGVS++P LQ K Sbjct: 569 GPAIVVQPGVVESGSRSDAIIQSIDFYPTLLEMLAIDAQPNQRFDGVSIVPALQGK--PL 626 Query: 433 PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL 492 + +PH+P + S +V D+ L Sbjct: 627 QRDAIF----------------------------TYFPHDPPVPNWMPPSVSVHQGDWKL 658 Query: 493 VYTVENN-----QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 + + L+ L DL ++ NLAA +P V++M ++ + + ++ VN+ Sbjct: 659 IRIFHGGPNGSHRYKLFNLKNDLGERINLAAKHPDRVQQMDKLIGQHLVETKAVRPLVNK 718 Query: 547 EKFNNIKKALSEAK 560 A +E K Sbjct: 719 NFDPAKYNAGAEGK 732 >UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UL40_RHOBA Length = 592 Score = 415 bits (1066), Expect = e-114, Method: Composition-based stats. Identities = 126/531 (23%), Positives = 192/531 (36%), Gaps = 89/531 (16%) Query: 29 AADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 ++ +K+ V + + +PN+I++ DD G+ ++ F Sbjct: 18 PETNMSIKSIVWIVVCLSSVTVAVAAEPRPNVILVMTDDQGWAEVGFHG----------- 66 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF 148 TP L EG TN YV + P+R+++MTGR R Sbjct: 67 ---------------NEVLKTPNLDRFAAEGTELTNFYV-SPMCTPTRSSLMTGRYHFRT 110 Query: 149 GVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 G + + + ET + E+F GY T GKWHL E+ R F Sbjct: 111 GAHDTYIGRSNMNPEETTIAEVFAGAGYRTGIFGKWHLG--------ENFPMRAEDQGFQ 162 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDR 268 + DY G Y++ P+L N AKGY +D DE+I + Sbjct: 163 KVVVHGGGGIGQFADY------PGNTYWD-PTLQYNDSFKKAKGYCTDVFIDESIQFMKD 215 Query: 269 AKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG-------SQTADNYYASVYSVDQG 321 + +QPF YL N PH P D D+++ ++ + Y + D Sbjct: 216 --SGEQPFFCYLPLNVPHSPFD--VADEFRADYDNQNLADPDGRKWVAPIYGMITQFDGA 271 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 R+LE ++ GQ +NTIILF SDNG + K Y G +P + W Sbjct: 272 FGRLLEAVEDMGQRENTIILFMSDNGP---NSTYFTAGLRAKKGSVYENGIRSPFVIQWP 328 Query: 382 GKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 LQ G +D +D PT DA I +P DL++DG S+L L + QG + L Sbjct: 329 KTLQGGRKFDTPAMHIDLLPTLADACGIGLPADLQVDGKSILGLLHGETQGFQQRYLF-- 386 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT-VENN 499 HN + R + +V E Sbjct: 387 ----------------------------MQHNRANVPPKYENCMARRGPWKVVGDGGEPT 418 Query: 500 QLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKF 549 LY + D + +LA +P++VK + D L N + Sbjct: 419 GFELYNIEQDPGETRDLADKHPEIVKAFVREYEAWFDDVTTQLRRDNGVPY 469 >UniRef50_A7IPG5 Sulfatase n=3 Tax=Bacteria RepID=A7IPG5_XANP2 Length = 491 Score = 415 bits (1066), Expect = e-114, Method: Composition-based stats. Identities = 116/513 (22%), Positives = 188/513 (36%), Gaps = 114/513 (22%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 + +P+I+ + DDLG+ + F + T Sbjct: 41 ARAADAPRPHIVYILADDLGFADVGFH---------------------------GSDIKT 73 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY---SNTDAQDGIPLTETF 166 P L L +G R Y P+RAA +TGR P +G+ + A+ G+ E Sbjct: 74 PNLDHLAAQGARLGQFY-TQPFCTPTRAAFLTGRYPLHYGLQVGAIPSGAKYGLATDEFL 132 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 LP+ ++ GY TA VGKWHL +++ P+ RGFD F Sbjct: 133 LPQALKDVGYRTALVGKWHLGHAD----------------------QKFWPRQRGFDSFY 170 Query: 227 GFHAAGTAYYNSP-----SLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 G ++ + + +V +GY ++ EA+ ++ A P LYLA Sbjct: 171 GPLVGEIDHFKHEAHGVTDWYHDNTQVKEEGYDTELFGKEAVRLI-AAHDPKTPLFLYLA 229 Query: 282 YNAPHLPNDNPAPDQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 + APH P AP Y Q+ + + Y A + ++D + ++ L G +NT+I Sbjct: 230 FTAPHTPFQ--APQSYLDQYAHIAAPQRRAYAAMITAMDDQIGHVVAALTSRGMRENTLI 287 Query: 341 LFTSDNGAVIDGPLP-----------LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY 389 +F SDNG N + K Y GGT W G++ PG Sbjct: 288 VFHSDNGGTRSKMFAGEGAVAGDLPASNAPYRDGKGSLYEGGTRVVALANWPGRIAPGAA 347 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 + ++ +D PT A S+ K LDGV + P L + G Sbjct: 348 EGVMHVVDMLPTLAKLAGASLAKSKPLDGVDVWPALAAGQAGR----------------- 390 Query: 450 ENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE-NNQLGLYKL-T 507 ++ VR+ + LV+ V L+ + Sbjct: 391 ----------------------AGIVYNVEPTQGAVRDGRWKLVWRVVLPPTAELFDVEA 428 Query: 508 DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 D + +++A +P+ V E+QG V + PP Sbjct: 429 DPSETTDVSAQHPEKVAELQGKVVALARTMAPP 461 >UniRef50_B9YAN4 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YAN4_9FIRM Length = 470 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 127/541 (23%), Positives = 210/541 (38%), Gaps = 132/541 (24%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN+I++ +DDLG+ L SF TP + L Sbjct: 4 QPNVIMILIDDLGWMDLSCQGSSF--------------------------YETPHIDQLR 37 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIP--------------- 161 EG+ F Y A V PSRA+I++G+ PAR V D ++ P Sbjct: 38 REGMAFDQAYAACPVCSPSRASILSGKYPARLKVTDWIDHENYHPCRGKLIDAPYIKELS 97 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 ++E + + FQ GY T VGKWHL K + P++ G Sbjct: 98 VSEFSMAKAFQEAGYQTWHVGKWHLGKEAT------------------------YPEHHG 133 Query: 222 FDYFMGFHAAGTAY--YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLY 279 FD +G G Y SP +N P Y++D++ EA ++ R++ +PF L Sbjct: 134 FDVNLGGSWWGHPKKGYFSPYHMENLSDGPEGEYLTDRIGAEAAALI-RSRDPQRPFFLN 192 Query: 280 LAYNAPHLPNDNPAPD------------------------------QYQKQFNTGSQTAD 309 L + A H P A D + ++ Q+ Sbjct: 193 LWHYAVHTPLQAKAEDIAYFEEKAKRMGLDQQDPFEIGDPFPILQKKDKRITRRIVQSDP 252 Query: 310 NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--VIDGPLPLNGAQKGYKSQT 367 Y A + ++D V +++ LK G ++TI++FTSDNG + N K Sbjct: 253 VYAAMIKALDDSVGQLMATLKAEGLDEDTIVIFTSDNGGLATAEHSPTCNFPLSEGKGWM 312 Query: 368 YPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQ 426 Y G P+F+ W GK++ G+ L ++ DFYPT L+ + + DGVSL P L Sbjct: 313 YEGAVREPLFVRWPGKIEAGSLSHALTTSPDFYPTLLELCGLPLRPQQHCDGVSLAPVLL 372 Query: 427 DKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVR 486 + + + W Y H+ ++ P +R Sbjct: 373 NPQAKFDRGPIFW--HYPHYGNQGGTP----------------------------GSALR 402 Query: 487 NNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVN 545 + + E++ + L+ L D+ +K N+A P +V++ ++ E++++ EVN Sbjct: 403 CGKWKYIEFYEDHSVRLFDLEQDVSEKHNVAEVYPDLVRQFHSLLHEWLEAVDAWYPEVN 462 Query: 546 Q 546 Sbjct: 463 P 463 >UniRef50_B5JJG5 Sulfatase, putative n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JJG5_9BACT Length = 462 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 130/536 (24%), Positives = 204/536 (38%), Gaps = 130/536 (24%) Query: 48 TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQK 107 S + PNI+ + DDLGY L + A Sbjct: 25 PSAASSAEKPPNIVFIFADDLGYNDLS--------------------------SYGATDI 58 Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQDGIPLTET 165 +TP + SL ++G+RFT+ Y A V PSRAA++TGR P R G+ + DGI ET Sbjct: 59 ATPAIDSLGEQGIRFTDFYSASPVCSPSRAALLTGRYPIRQGITGVFWPQSFDGIDPAET 118 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 + EL Q +GY T VGKWHL ++ P GF + Sbjct: 119 TIAELLQENGYRTGLVGKWHLGHH-----------------------QKHLPLQNGFHSY 155 Query: 226 MGFHAAGTAYYNSPSL--FKNRERVP----AKGYISDQLTDEAIGVVDRAKTLDQPFMLY 279 G Y N + + V + Y + + T+EA+ +++ K DQPF LY Sbjct: 156 FGI-----PYSNDMDMVVYMRGNDVESYEVDQHYTTRRYTEEAVQFIEQNK--DQPFFLY 208 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 LA++ PH+P Y + G+ Y + +D V +IL+ L K+ +NT+ Sbjct: 209 LAHSMPHVPI-------YASENFVGTSKRGLYGDVIQELDWSVAQILDTLDKHQLSENTL 261 Query: 340 ILFTSDNGAVI--DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAM 396 ++FTSDNG + K T+ GG P + W ++ G + + M Sbjct: 262 VVFTSDNGPWTALKHLGGSAAPLREGKMFTFDGGMRVPCLVRWPAQIPAGQTSHAMANMM 321 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWD 456 D++PT A++ PK +DG+ + L ++ + Sbjct: 322 DWFPTFSRIANLDTPKSRSIDGLDITDVLTGSGPRADNEFFFFHGDGDLR---------- 371 Query: 457 NYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLG------------LY 504 R+ D+ L E NQ L+ Sbjct: 372 ---------------------------AYRDGDWKLKLPYEGNQAARWRQAVAAHPILLF 404 Query: 505 KLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKALSEA 559 L D + +LAA +P+ + MQ + +F+ S L E+ EK +K E+ Sbjct: 405 NLAEDPGETTDLAAQHPERLAAMQARMTDFLAS----LGELPPEKI--TRKPGDES 454 >UniRef50_Q7URY7 Aryl-sulphate sulphohydrolase n=1 Tax=Rhodopirellula baltica RepID=Q7URY7_RHOBA Length = 490 Score = 414 bits (1064), Expect = e-114, Method: Composition-based stats. Identities = 139/550 (25%), Positives = 215/550 (39%), Gaps = 104/550 (18%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 L A V + E + PN++ + +DD G+ F F Sbjct: 10 ALPAFLFAVVLVSTSTAETPSTEHPNVLFIYLDDYGWRDATFMGSDF------------- 56 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 TP L +L + G+ F+N Y P+RA++++G+ R +Y+ Sbjct: 57 -------------YETPNLDALAERGMVFSNAYSCAANCAPARASLLSGQYSPRHEIYNV 103 Query: 154 TDAQDG---------IPLTET------FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDK 198 + G IP TET ++ GY T +GKWHLS Sbjct: 104 GTERRGNPKHGTLQHIPGTETLSSDIQTWAHQVRDAGYRTGIIGKWHLSD---------- 153 Query: 199 QTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT---AYYNSPSLFKNRERVPAKGYIS 255 P GFD + +G+ Y+ + Y++ Sbjct: 154 -----------------DPLPYGFDINVAGTHSGSPPKGYFPPHPKVPGLQDTSDDEYLT 196 Query: 256 DQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPD---QYQKQFNTGSQTADNYY 312 D+LTDEAIG ++ + + LYL++ A H P PD +Y+ + Sbjct: 197 DRLTDEAIGFIEANQEWS--WFLYLSHFAVHTPLQA-KPDLVAKYKAKQPGTLHDHAVMA 253 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGT 372 A + SVD+GV R++E L++ G +NT I+FTSDNG GP +GYK Y GG Sbjct: 254 AMIESVDEGVGRMVETLRELGLEENTAIVFTSDNGGF--GPATSMKPLRGYKGTYYEGGI 311 Query: 373 HTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 P F+ W G + G D + A D YPT ++ +P D LDGVSL+P L+ ++ Sbjct: 312 REPFFVTWPGVVDAGTKSDVPVIAADLYPTFIEMTGAKLPADQPLDGVSLMPLLK-QEGS 370 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 + L W + P + + Q D S+ +R+ + Sbjct: 371 LADRELYW-----------HFPAYLQSYSVTDGQRDLLY-------RSRPCGIIRDGRWK 412 Query: 492 LVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFN 550 L E+ L LY L TD + +NLA ANP + + + + + + E Sbjct: 413 LHEYFEDGGLELYDLVTDPGESNNLADANPIKTQALHSKLVAWRERIGASMPT---EPNP 469 Query: 551 NIKKALSEAK 560 N SEAK Sbjct: 470 NHD-PASEAK 478 >UniRef50_Q0C069 Sulfatase family protein n=3 Tax=Bacteria RepID=Q0C069_HYPNA Length = 505 Score = 414 bits (1064), Expect = e-114, Method: Composition-based stats. Identities = 120/533 (22%), Positives = 198/533 (37%), Gaps = 99/533 (18%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 E + +PNI+++ +DD+GY + + + Sbjct: 34 SVAEKEAAASEQPNIVLIFVDDMGYADIG--------------------------SFGSP 67 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ-------- 157 TP L L EG ++T+ Y V PSRA +MTGR R G+ A+ Sbjct: 68 IARTPNLDRLAMEGQKWTSFYAPAPVCTPSRAGLMTGRLAVRSGMAGLVQARHVLFPTST 127 Query: 158 DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD--NFTTFSAEEW 215 G+P +E + EL Q GY +AA GKWH+ + +P + Y + Sbjct: 128 GGLPQSEVTIAELLQQEGYVSAAFGKWHMGHLPEF-LPTSHGFQSYFGIPYSNDMNMPGG 186 Query: 216 QPQNRGFDYFMGFHAAGTAYYNSPSLFKNR--ERVPAKGYISDQLTDEAIGVVDRAKTLD 273 D F F ++ P + ER + ++ + T+ AI ++ + Sbjct: 187 GETPWSIDLF--FEPPNIQNWDVPLMQDEEIIERPADQFTLTQRYTERAIEFMETSHAEG 244 Query: 274 QPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNG 333 QPF LYLA+N PH P + + TG Y + +D V I++ LK Sbjct: 245 QPFFLYLAHNMPHTPL-------FTSEGFTGVSAGGAYGDVIEELDWSVGEIVDALKDMK 297 Query: 334 QYDNTIILFTSDNGAV--IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK 391 NT+++FTSDNG + G + K T+ GG P WW G++ P Sbjct: 298 IEKNTLVIFTSDNGPWLAMKTHSGSAGMLRDGKGTTWEGGMRVPAIFWWPGQIAPRTVTD 357 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 L SA+D PT + +P+D DG L P L + P + L + Sbjct: 358 LGSALDLMPTFAAISGARLPEDRVYDGFDLSPALFSEGS-SPRETLYYYRFTD------- 409 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT----------VENNQL 501 + VR Y ++ E Sbjct: 410 ------------------------------VFAVRKGKYKAHFSTYGAFGGSGRTELETP 439 Query: 502 GLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIK 553 LY + D ++ N+AA +P++V E++ + + S +P +++ + + Sbjct: 440 ELYDIEADPSEQFNIAAQHPEIVMELKVLAEKQAASVEPVENQLERYPPGEKR 492 >UniRef50_C3ZGR2 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZGR2_BRAFL Length = 598 Score = 414 bits (1064), Expect = e-114, Method: Composition-based stats. Identities = 123/549 (22%), Positives = 202/549 (36%), Gaps = 117/549 (21%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 + S+ GKPNI+ + DD G+ + + TP Sbjct: 115 QESSSGKPNIVFILADDYGWNDIGYHGSV---------------------------IRTP 147 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFL 167 L L EGV+ N YV + PSR +MTGR R+G+ G+PL E L Sbjct: 148 NLDRLAAEGVKLENYYV-QPLCSPSRCQLMTGRYQIRYGLQHSLIWPPQPSGLPLDEVTL 206 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 P+ + GY T VGKWHL F +++ P +RGFD F G Sbjct: 207 PQRLKEGGYSTHIVGKWHLG----------------------FYKQDYTPTHRGFDTFYG 244 Query: 228 FHAAGTAYYNS---------PSLF-------KNRERVPAKG-YISDQLTDEAIGVVDRAK 270 + Y+ P + +NR G Y + ++AI ++ + Sbjct: 245 YLTGAEDYWTHRQKGGLPGQPQTWSGLDLRDQNRPVTDQNGTYSTHLFANKAIEII-AQQ 303 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLK 330 ++P L+L++ A H P P D + + Y A +DQ V + LK Sbjct: 304 DKNKPMFLFLSFQAVHDPLQAPEEDI-SRYSHISDTNRRVYAAMTTIMDQAVGNVTRALK 362 Query: 331 KNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG-KLQPGNY 389 + G +DNT+++F++DNG +D +N +G+K + GG F+ K + Sbjct: 363 QYGLWDNTVLIFSTDNGGRVDRG-GINWPLRGWKGSLWEGGVRGVGFVNSPLIKAKGRTS 421 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 D LI D++PT + A S LDG + + D K + L I H Sbjct: 422 DALIHISDWFPTLVGLASGSTNGTKPLDGHDVWEAISDGKPSPRREILHNIDPMFHTVPS 481 Query: 450 ENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN---------- 499 W + + +R+ D+ L+ N Sbjct: 482 PRPHQWGDRVF-----------------NTSVHAAIRSGDWKLLTGYPGNTSRVPPPSST 524 Query: 500 ----------QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL-----SE 543 L L+ + D +++ +L+ +P VV+E+ + + ++ P + Sbjct: 525 KEEPADTPGKHLWLFNIREDPEERTDLSQKHPGVVQELLEKLARYNRTAVPVFYPSFDPQ 584 Query: 544 VNQEKFNNI 552 N NI Sbjct: 585 ANPALHGNI 593 >UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DBQ5_9BACT Length = 483 Score = 413 bits (1063), Expect = e-114, Method: Composition-based stats. Identities = 139/563 (24%), Positives = 208/563 (36%), Gaps = 140/563 (24%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 +L A E +T KPN+I + DDLG G L G + + Sbjct: 3 RLTALFFAALAGCAFAAEPATPAKPNVIFILADDLGIGDL----GCYGQQ---------- 48 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 + TP + L +G+RF Y V PSR A+MTGR + N Sbjct: 49 ------------KIRTPNIDHLAADGMRFLQHYTGCSVCAPSRCALMTGRHMGHAAIRDN 96 Query: 154 T------DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 + Q +P + L QN GYYT +GKW L Sbjct: 97 AQRGPSEEGQRPMPQDTFTVARLMQNAGYYTGIIGKWGLGMPE----------------- 139 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAY-YNSPSLFKNRER----------------VPA 250 + P++ GF+Y G+ A+ Y P L++N ER + Sbjct: 140 -----DHSSPRDMGFNYSFGYLCQSMAHTYYPPYLWRNNERETLAGNPSYDVSMKGVIEP 194 Query: 251 KG--YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA-------------PD 295 KG Y D + +A+ V D+PF LYLA+ PHL P P Sbjct: 195 KGEIYSHDVMASDALKFVRDHH--DKPFFLYLAFTIPHLSLQVPEDSMSEYHGQWTETPF 252 Query: 296 QYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--VIDGP 353 + K + Y + +D+ V R++ LK+ G DNT++ F+SDNGA + G Sbjct: 253 RNTKHYANNETPRAAYAGMITRMDRDVGRLMALLKELGIDDNTLVFFSSDNGAVFPLAGT 312 Query: 354 LP----LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADI 408 P G +GYK Y GG TP+ W GK++ G S DF PT + + Sbjct: 313 DPVFFQSTGGFRGYKQDLYEGGIRTPLIARWPGKIETGVTTDQASVFYDFLPTMAELNGV 372 Query: 409 SIPKDLKLDGVSLLPWLQDK-KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 P D DG+S LP L K Q + H L W Sbjct: 373 PPPADT--DGLSYLPTLLGKPAQQKQHDFLYWEYQ------------------------- 405 Query: 468 DYPHNPNTEDLSQFSYTVRNNDYSLVYTV----ENNQLGLYKL-TDLQQKDNLAAANPQV 522 + + VR D+ + N +Y L +D + ++AA +P++ Sbjct: 406 ----------SAGGAVAVRMGDWKAIANKIKKNPNANFEVYNLASDRTESHDVAAEHPEI 455 Query: 523 VKEMQGVVREFIDSSQPPLSEVN 545 V + + ++ + + P+ E N Sbjct: 456 VAKAREIIAR--EHTPSPIKEWN 476 >UniRef50_A6DHI2 Aryl-sulphate sulphohydrolase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI2_9BACT Length = 493 Score = 413 bits (1062), Expect = e-114, Method: Composition-based stats. Identities = 118/532 (22%), Positives = 212/532 (39%), Gaps = 89/532 (16%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 K A F KP+II++ +DDLG+ L + + Sbjct: 2 IIKIQCALVFFLALAGFAAEKPHIILINIDDLGWTDLSYQGSKYYES------------- 48 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA 156 P + +L G+ F GY A PSRA++++G+ R VY+ + Sbjct: 49 -------------PNIDALAKSGMIFDQGYAAAANCAPSRASLISGQQSPRTEVYTVGNP 95 Query: 157 QDG-------IPLTET--------FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 G IP + + + GY TA +GK+H++K Sbjct: 96 ARGASNKRKLIPSPNIDFVDADNFTIADAMNSAGYLTATLGKYHVAKDPLT--------- 146 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDE 261 N G F G + G Y+SP + N + Y+ D LTDE Sbjct: 147 ------------HGWKINVGGREFGGPYNGG---YHSPYEYPNLKETEKGRYLCDHLTDE 191 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA--PDQYQKQFNTGSQTADNYYASVYSVD 319 AIG + + QP +Y Y H P +Y+ + T Y A + ++D Sbjct: 192 AIG-IFKEHGAQQPIFMYFPYYTIHAPIQGHPKFEPKYKAKAKTKGHFNPKYAAMIEALD 250 Query: 320 QGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMW 379 V R++ L++ G + T+I+FTSDNG + + K Y GG P F Sbjct: 251 HNVGRLVAALEEQGLREKTLIMFTSDNGGHMK--FSRQEPLRAGKGSYYEGGIRVPFFAS 308 Query: 380 WKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK-KQGEPHKNL 437 W G ++ G+ + ++ +DFYPT + A + +P D +DG S LP L+ + + ++ L Sbjct: 309 WPGVIEAGSRSQVPVTGLDFYPTVCELAGVELPDDKVVDGKSFLPLLKSEVDEDLKNRAL 368 Query: 438 TWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE 497 W + + ++ P + + ++ +R+ + L + E Sbjct: 369 YWHFPI---------------YLQAYLKPNEKPESRDPLFRTRPGSVIRHGKWKLHHYFE 413 Query: 498 NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS-EVNQE 547 ++ + LY + D +K++L++ P+VV +++ + + + + E+N + Sbjct: 414 DDGVELYDINSDRSEKNDLSSEYPEVVSKLRNKLDSWRNGIGAFIPTELNPD 465 >UniRef50_Q01N83 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01N83_SOLUE Length = 461 Score = 413 bits (1062), Expect = e-113, Method: Composition-based stats. Identities = 125/495 (25%), Positives = 196/495 (39%), Gaps = 91/495 (18%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 + +PNI+V+ DDLGYG L + +TP + Sbjct: 23 GQQRQPNIVVILADDLGYGDLGCY---------------------------GSPIATPNI 55 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD-GIPLTETFLPELF 171 L +EG RFT+ Y A V PSRAA+MTGR P R V D G+P +E + ++ Sbjct: 56 DRLAEEGARFTSFYSASPVCSPSRAALMTGRYPTRVEVPVVLGPGDAGLPDSEITMAQVL 115 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA 231 ++ GY T+ +GKWH+ S + P NRGFD F G + Sbjct: 116 KSAGYRTSCIGKWHIG-----------------------STPGYLPTNRGFDEFFGVPYS 152 Query: 232 GTAYYNSPSLFKNRERVPAK---GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP 288 L + V ++ T EA+ + RA+ D PF LYLA+ APHLP Sbjct: 153 AD--ITPCPLMRGSSVVAPAVDCSTLTSSFTQEALDFMRRAQ--DNPFFLYLAHTAPHLP 208 Query: 289 NDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA 348 A ++ Q Y V +D +++ LK G NT+++F+SDNG Sbjct: 209 L--AASPRFAGQ-----SGLGMYADVVQELDWSTGQVMAALKATGLDSNTLVMFSSDNGP 261 Query: 349 VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAAD 407 G G +G K +TY GG P + G + G L + MD PT A Sbjct: 262 WYQGSQ---GKLRGRKGETYEGGMREPFLARYPGVIPSGIGCAGLATTMDLLPTLARLAG 318 Query: 408 ISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 P + LDGV + P L ++ + + + + W R+ + Sbjct: 319 AQTPSN-PLDGVDIWPVLTGERAEVDRDVFLYFDAV--YLQCARLGRWK--LHLSRYNTK 373 Query: 468 DYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYK-LTDLQQKDNLAAANPQVVKEM 526 + P + + + LY ++D Q+ + AA++P +V ++ Sbjct: 374 AWSPLPPGGRV----------------NLPLPRPELYDVVSDPQESYDCAASHPAIVADI 417 Query: 527 QGVVREFIDSSQPPL 541 + V + + P + Sbjct: 418 RARVERMVQTFPPGI 432 >UniRef50_A6LDP6 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LDP6_PARD8 Length = 452 Score = 413 bits (1061), Expect = e-113, Method: Composition-based stats. Identities = 135/532 (25%), Positives = 208/532 (39%), Gaps = 135/532 (25%) Query: 52 YSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPT 111 +S KPNIIV+ DD+GYG L TP Sbjct: 20 HSQPTKPNIIVINCDDMGYGDLSCFGS--------------------------PTIKTPN 53 Query: 112 LLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN------TDAQDGIPLTET 165 + + EG ++++ YV+ VS PSRA ++TGR R G+Y + D++ G+P E Sbjct: 54 IDRMAIEGQKWSSFYVSASVSSPSRAGLLTGRLGVRTGMYGDQRRVLFPDSKGGLPSEEL 113 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 + EL + GY+TA +GKWHL + E+ P GFDYF Sbjct: 114 TIAELLKQAGYHTACIGKWHLGHL-----------------------PEYMPLRHGFDYF 150 Query: 226 MGFHA------------AGTAYYNSPSLFKNR---ERVPAKGYISDQLTDEAIGVVDRAK 270 G+ T Y +++ ER P + ++ Q+T+ AI + + Sbjct: 151 YGYPYSNDMSRKEQIKLGNTKYPYEYIIYEQEKELEREPQQYNLTQQVTEAAIRYIKSNE 210 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLK 330 PF LYLA+ PH+P Y G Y +V +D V +IL+ LK Sbjct: 211 NS--PFFLYLAHPMPHMPV-------YASTDFQGKSARGKYGDTVEELDWSVGQILQTLK 261 Query: 331 KNGQYDNTIILFTSDNGAVI----DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 G NT+++FTSDNG + +G P G K K+ + GG P M W ++P Sbjct: 262 SEGLDKNTLVIFTSDNGPWLLCKQEGGSP--GPLKDGKASMFEGGFRVPCIM-WGAMVKP 318 Query: 387 GNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHW 446 G + S +D PT + A I +P D DG+SLL L+DK + + S + Sbjct: 319 GYITDMASTLDLLPTFCEIAGIPLPSDRHYDGISLLNVLKDKSTCKRDVFYFYRGSELY- 377 Query: 447 FDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVEN-------- 498 +R Y ++ Sbjct: 378 -------------------------------------AIRKGKYKAHFSYRPAYGTTDKI 400 Query: 499 --NQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQE 547 ++ LY L TD + N+A +P +V+E+ + S + S +Q+ Sbjct: 401 IYDKPVLYDLGTDPGELYNIAEEHPDIVQELTMLANAHKASLKIAKSIFDQK 452 >UniRef50_A6LED2 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LED2_PARD8 Length = 468 Score = 412 bits (1060), Expect = e-113, Method: Composition-based stats. Identities = 128/551 (23%), Positives = 209/551 (37%), Gaps = 117/551 (21%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 +KL + +V +PN+I++ +DD GYG L + Sbjct: 1 MKLISNIISVLAFSGAAVATQAAERPNVIIVFIDDFGYGDLGCYGST------------- 47 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 + TP + + EG+R T+ YV VS PSR+A++TG P R ++ Sbjct: 48 -------------KHRTPHIDQMAKEGIRLTDFYVGSSVSTPSRSALLTGCYPRRVSMHV 94 Query: 153 NTD---------------AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPED 197 N D + G+ E + EL + GY TA +GKWHL Sbjct: 95 NADPTPLMSKGRQVLFPASHKGLNPGEITIAELMKEQGYATACIGKWHLGDQ-------- 146 Query: 198 KQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG---TAYYNSPSLFKNRERVPAKGY- 253 + P +GFDY+ G + Y P + + V G+ Sbjct: 147 ---------------LPFLPTRQGFDYYYGIPYSNDMDRPYCPLPLMEQEEVIVAPVGHD 191 Query: 254 -ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYY 312 ++ + T++ + + K PF +YL +N H P G Y Sbjct: 192 SLTIRYTNKTVEFIKSHKES--PFFIYLCHNMTHNPLAA-------SPAFKGKSQNGLYG 242 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGT 372 + +D + +LE LK+ G NT+I+FTSDNGA + N +G K TY GG Sbjct: 243 DATEELDWSMGVLLETLKEEGLDQNTLIIFTSDNGA-DEHFGGTNRPLRGQKGTTYEGGF 301 Query: 373 HTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 P M W K+ G D L+++MDF PT ++P D +DG ++ L+ + Sbjct: 302 RVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCSYAVPSDRVIDGHNVSGILEGESMA 361 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNY-HKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDY 490 P + + + + W N+ + + PH P+T Sbjct: 362 SPTETFYY-----YQKQQLQAVRWGNWKYHLPLKERIKGPHFPDT--------------- 401 Query: 491 SLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEV----- 544 E + LY L DL + N+ +P+VV +M ++I+ + + + Sbjct: 402 ------EVGEARLYNLANDLSETTNVIDKHPEVVTKM----NQWIEQVRSDMGDWGYEGR 451 Query: 545 NQEKFNNIKKA 555 NQ I + Sbjct: 452 NQRPAGIIDEP 462 >UniRef50_A6DM29 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DM29_9BACT Length = 481 Score = 412 bits (1059), Expect = e-113, Method: Composition-based stats. Identities = 139/507 (27%), Positives = 211/507 (41%), Gaps = 79/507 (15%) Query: 48 TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQK 107 TP +PNI+++ DDLGYG L Q Sbjct: 26 TPQTKKDTERPNIVLILCDDLGYGDLACYG--------------------------HKQI 59 Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD-----GIPL 162 TP L + EG+RF + Y A V SR ++TGR+P R GVY + Sbjct: 60 KTPNLDQMAKEGIRFNHFYSAAPVCSASRVGLLTGRSPNRAGVYDWIPHSSESSSPHMRK 119 Query: 163 TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF 222 E P+L Q GY T GKWH N + + QPQ+ GF Sbjct: 120 NEITFPQLLQKAGYATCLSGKWHC-------------------NGALINTNQAQPQDAGF 160 Query: 223 DYFMGF-HAAGTAYYNSPSLFKNR-ERVPAKGYISDQLTDEAIGVVDRA--KTLDQPFML 278 DY+ + A ++ N + +N E P +G+ +T+EAI ++ + QPF + Sbjct: 161 DYWFATQNNAAPSHKNPVNFIRNGVELGPIEGFSCQIVTNEAINWMEDHVKQNEKQPFFI 220 Query: 279 YLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 YL+++ PH P +P + + Y+A+V ++D+ V ++ QLKK DNT Sbjct: 221 YLSFHEPHEPIASPQKIVDTYKGIAENTNQAEYFANVENLDKAVGSLMNQLKKLKINDNT 280 Query: 339 IILFTSDNGA-------VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-D 390 +++FTSDNG G KG K T G P M W K+ G D Sbjct: 281 LVIFTSDNGPETLNRYEAASRSYGSPGELKGMKLWTAEAGFRVPAIMHWPEKIATGQISD 340 Query: 391 KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEE 450 ++ISA+DF+PT D A S K L LDG + P L KK+ HK L WI Y +E Sbjct: 341 QVISALDFFPTFCDLAQASNSKSLNLDGSNFTPALH-KKKMTRHKPLLWI--YYAALNER 397 Query: 451 NIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DL 509 + K + HN +++ + + + LY L+ D Sbjct: 398 QVAMRHGDWKISAKLNLPRYHNITSKNFPKVTAA------------TLSDYQLYNLSKDK 445 Query: 510 QQKDNLAAANPQVVKEMQGVVR-EFID 535 + ++L+ NP+ +M ++ ++ D Sbjct: 446 SEANDLSNQNPKKSAQMIKFLKLQYQD 472 >UniRef50_A5FAW4 Sulfatase n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FAW4_FLAJ1 Length = 539 Score = 412 bits (1059), Expect = e-113, Method: Composition-based stats. Identities = 137/550 (24%), Positives = 222/550 (40%), Gaps = 115/550 (20%) Query: 36 KATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 K + AF T +++ KPNII+L DDLG + G Sbjct: 41 KLAEGKAAFLSQKDTSAASEKKPNIIILLADDLGKYDISLYGG----------------- 83 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG------ 149 TP + SL GV FT+GYV+ + PSRA ++TGR RFG Sbjct: 84 ---------KSTPTPQIDSLAASGVTFTDGYVSSSICSPSRAGLLTGRYQERFGHEYQPG 134 Query: 150 --------------VYSNTDAQD----------------GIPLTETFLPELFQNHGYYTA 179 NT++ G+P +E +L + GY TA Sbjct: 135 DRYPKNNLEYYAFKYLLNTNSWRLNPKIEYPNDASIATQGLPKSEITFADLAKKQGYSTA 194 Query: 180 AVGKWHLSKISNVPVPEDKQTR---DYHDNFTTFSAEEWQPQ--NRGFDYFMGFHAAGTA 234 +GKWHL P D+ ++ F+ F+ E+ P N F G Sbjct: 195 IIGKWHLGHTKGF-FPLDRGFDYHYGFYQAFSLFAPEDNNPDIINHHHTDFTDKTIWGNG 253 Query: 235 YYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP 294 + + ++ + K Y++++ +EA +D+ K ++PF+LY+ +NAPH P Sbjct: 254 RVGTGQIRRDSTIIDEKKYLTEKFAEEAEAFIDKNK--NKPFLLYVPFNAPHTPFQVR-- 309 Query: 295 DQYQKQFNTGSQTADN-YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP 353 +Y +F Y+A + ++D + I ++KK G +NT+I F SDNG Sbjct: 310 KKYYDRFPNVKDENKRVYFAMISALDDAIGLIRAKVKKEGLEENTLIFFASDNGGADYTY 369 Query: 354 LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPK 412 N KG K + GG + P + WKGK++P +S++D + T +PK Sbjct: 370 ATTNAPLKGGKFSHFEGGVNVPFALSWKGKIKPHTIYKTPVSSLDIFSTIAAVTHSGLPK 429 Query: 413 DLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHN 472 D DGV L+ + + KQ H+NL W + Sbjct: 430 DRVYDGVDLVDVVNNNKQA--HQNLYWRSGD----------------------------- 458 Query: 473 PNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVR 531 + +R+ D+ L+ + + ++ LY L D + +LA+ NP+ VKE+Q ++ Sbjct: 459 ---------AKAIRSGDWKLIISGKTHETWLYNLAKDKSETTDLASKNPEKVKELQTALQ 509 Query: 532 EFIDSSQPPL 541 + PL Sbjct: 510 NWEKGLIKPL 519 >UniRef50_A6DPC8 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DPC8_9BACT Length = 598 Score = 411 bits (1058), Expect = e-113, Method: Composition-based stats. Identities = 129/532 (24%), Positives = 206/532 (38%), Gaps = 102/532 (19%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K + + +T KPN IV+ DD GY L Sbjct: 1 MKKLISILVLGLAYLHLQATDKKPNFIVIFTDDQGYQDLGCFGS---------------- 44 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN- 153 + TP + + EG R+TN Y A+ + SRAA++TGR P+R GV+ Sbjct: 45 ----------PKIKTPEIDQMAKEGARYTNFYSANAICSASRAALLTGRYPSRNGVFHVY 94 Query: 154 -TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 A G+ +E + E+ + GY T+ +GKWHL N +P ++ Y FS Sbjct: 95 YPGASQGLKPSEITIAEVLKTAGYRTSIIGKWHLGD-RNQFLPTNQGFDSYFG--IPFSN 151 Query: 213 EEWQPQNRGFDYFMGFHAA----------------GTAYYNSPSLFKNRERVP---AKGY 253 + W ++ + G L ++ E V + Y Sbjct: 152 DMWMSKDLALADDIKLFGGVTVEQIKSGEASKAVKGEKRGGKVPLMRDEEVVEYPVDQTY 211 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYA 313 I+ + TDEA+ ++ ++ QP+ +YLAY PH+P Y G Y Sbjct: 212 ITQRYTDEALKIIKESEKKKQPYFIYLAYAMPHVPL-------YASPKFAGKSARGPYGD 264 Query: 314 SVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNG-AQKGYKSQTYPGGT 372 +V +D V RIL+ LK +G NT+++FTSDNG G + +G K TY GG Sbjct: 265 TVEEMDYHVGRILKHLKSSGADKNTLVIFTSDNGPWNLGERGGSALPLRGAKFSTYEGGH 324 Query: 373 HTPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 P MWW G + G ++ + +DF PT A+ +P + LDG ++ P L+D +G Sbjct: 325 RVPCVMWWPGTIPAGTDSAEIATTLDFMPTFAKLANAQLP-NRTLDGKNIAPMLRDGNKG 383 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 + + S +H +R + Sbjct: 384 KSPYEKFYFWSKNHIE------------------------------------ALRIGNMK 407 Query: 492 LVYTVEN-----NQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 L + + + L+ L D+ + NLA P+ V M ++ E Sbjct: 408 LRMSWDKKNNVRKETELFNLEGDIAESHNLAPQMPEKVAAMTKMLLEAEQEQ 459 >UniRef50_Q7UJQ8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=4 Tax=Planctomycetaceae RepID=Q7UJQ8_RHOBA Length = 491 Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats. Identities = 121/537 (22%), Positives = 191/537 (35%), Gaps = 128/537 (23%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 + F+ T + +PNI+ + DDLGYG L G + + Sbjct: 20 LGFATAPSTSAADAKRPNIVFILADDLGYGDL----GCYGQEL----------------- 58 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD--- 158 TP L + EG+RFT+ Y + V PSR+ +MTG V N D Sbjct: 59 -----IQTPRLDQMAAEGMRFTDFYAGNTVCAPSRSVLMTGMHMGHTHVRGNAGGPDMSK 113 Query: 159 -GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 + + E+ Q+ GY TA GKW L + P Sbjct: 114 QSLRDENVTVAEVLQSAGYATALCGKWGLGD-------------------DALGGRDGLP 154 Query: 218 QNRGFDYFMGFHAAGTAYYNSPS-LFKNRERVPAKG----------------------YI 254 + +GFD+F G+ A+ P L++N +V + Y Sbjct: 155 RKQGFDHFYGYLNQVHAHNYYPEFLWRNETKVALRNEVQRRDRSYGGFTGGWATKRVDYS 214 Query: 255 SDQLTDEAIGVVDRAKT--LDQPFMLYLAYNAPHLPND--------NPAPDQYQKQFNTG 304 D + +EA+G + T +PF LYL+ PH N+ PD Sbjct: 215 HDLIANEAMGFIREKATDAATKPFFLYLSLTIPHANNEGTGMSGNGQEVPDYGIYADKDW 274 Query: 305 SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-----LNGA 359 S A + +D V RIL+ LK+ + T+++F+SDNG +G G Sbjct: 275 SDQDKGQAAMITRMDSDVGRILDLLKELQIDEQTVVMFSSDNGPHNEGGHNPKKFDPAGP 334 Query: 360 QKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLKLDG 418 +G K GG P+ + W G PG I D TA + A P+D D Sbjct: 335 LRGMKRALTEGGIRVPLIVRWPGTTPPGAVSDHIGYFGDLMATAAELAGTDFPEDA--DS 392 Query: 419 VSLLPWLQDKKQ-GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477 +S P + + + + H+ L W Sbjct: 393 ISFAPTIVGRPEAQQTHEYLYWEFYEQG-------------------------------- 420 Query: 478 LSQFSYTVRNNDYSLV-YTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVRE 532 VR ++ + LY L D+ + NLA+ +P++VK+++ ++ E Sbjct: 421 ---GRQAVRRVNWKAIREPWMTGPTQLYDLKADIGETTNLASDHPEIVKQLETLMEE 474 >UniRef50_B4D4S6 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4S6_9BACT Length = 626 Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats. Identities = 133/549 (24%), Positives = 202/549 (36%), Gaps = 134/549 (24%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 E S K +PNI+ + DDLG+ + T Sbjct: 20 AESSPKTRPNIVFILADDLGWSDTTLYGTT-------------------------KFFET 54 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD-------------- 155 P + L G++FTN Y A+ V P+RA+IMTG P R G+ + + Sbjct: 55 PNIERLAARGMKFTNAYAANPVCSPTRASIMTGLYPGRLGITTPSGHVPEEKLEASLVAR 114 Query: 156 -----------AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYH 204 + + L L E + GY T GKWHL Sbjct: 115 GSPSQKSLQATSATRLKLEYFTLAEALKGAGYATGHFGKWHLG----------------- 157 Query: 205 DNFTTFSAEEWQPQNRGFD----YFMG-FHAAGTAYYNSPSLFKNRERVPAKGYISDQLT 259 E + P ++GFD ++ G A A + SP + D ++ Sbjct: 158 -------PEPFDPLHQGFDVDVPHWSGPGPAGYIAPWKSPKFHL---PAKPGEQLEDLMS 207 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPN--DNPAPDQYQKQFNTGS-QTADNYYASVY 316 EAI + K D+PF L + H P ++Y+++ + S Q Y A V Sbjct: 208 QEAIKFIRVHK--DEPFYLNYWAFSVHSPWGGKPDLIEKYRRKADPNSAQRNPVYGAMVE 265 Query: 317 SVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV------------IDGPLPLNGAQKGYK 364 S+D V R+L+ L + D+TII+F SDNG V ++ P N + K Sbjct: 266 SLDDAVGRLLDTLDELKLSDHTIIVFFSDNGGVNWFEPAMKEEAGMNSPPTTNAPLRAGK 325 Query: 365 SQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 Y GGT P + W GK + D ++ ++DFYPT L+ A ++ DLK DGVS +P Sbjct: 326 GTLYEGGTREPCVVVWPGKTKAATQNDAMLCSVDFYPTLLEMAGVAAKPDLKFDGVSQVP 385 Query: 424 WLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY 483 L G P L P + V + Sbjct: 386 ALLG--TGTPRDTLFCYY-----------PVYSPPGHVVHTMPGVWG------------- 419 Query: 484 TVRNNDYSLVYTVEN-----NQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 R D+ L+ + ++ LY L DL + +LAA P VKE+ ++ + + Sbjct: 420 --RRGDWKLIRYFHDADDQSDRYELYNLHDDLGETKDLAARFPDKVKELNALIDAHLAET 477 Query: 538 QPPLSEVNQ 546 + N Sbjct: 478 HALIPGKNP 486 >UniRef50_B4CZ54 Sulfatase n=3 Tax=Bacteria RepID=B4CZ54_9BACT Length = 500 Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats. Identities = 118/515 (22%), Positives = 193/515 (37%), Gaps = 100/515 (19%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 T KPNI+ + DD GYG L A Sbjct: 20 ATAQGAPSKPNIVFILADDTGYGDLS--------------------------ATGNPILK 53 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLP 168 TP L L + VRFT+ +V P+R+A+MTGR + GV ++ + + Sbjct: 54 TPHLDKLYNAAVRFTDFHV-SPTCSPTRSALMTGRHEFKNGVTHTILERERLNPDAITIA 112 Query: 169 ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF 228 ++ ++ GY T GKWHL + QP RGFD Sbjct: 113 QVLKSAGYTTGIFGKWHLGD-----------------------EPDHQPGQRGFDEVFIH 149 Query: 229 HAAGTAY-------------YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQP 275 G Y +P++ N +G+ +D T++AI ++ K QP Sbjct: 150 GGGGIGQTYPGSCGDAPGNTYFNPAILHNGSFEKTQGFCTDIFTNQAIHWMESVK-GKQP 208 Query: 276 FMLYLAYNAPHLPNDNPAPDQYQKQFN-TGSQTADNYYASVYSVDQGVKRILEQLKKNGQ 334 F Y+ YNA H+P PD+Y+K + Y+ V ++D+ V R+L +L + G Sbjct: 209 FFCYIPYNAAHVPVSC--PDEYKKPYEGKVDDHLATYFGMVANIDENVGRVLAKLDEWGI 266 Query: 335 YDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS 394 +T+++F +DNG N +G K + GGT W P + L S Sbjct: 267 AKDTLVVFMNDNGGHGPACKVFNAGMRGSKGSAWLGGTRAVSLWRWSDTFAPHDAAGLAS 326 Query: 395 AMDFYPTALDAADISI--PKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENI 452 +DF+PT + A + ++DG SLLP L+D P + L Sbjct: 327 NIDFFPTLAELAGATPNEKAQKQVDGRSLLPLLRDGNAPWPERVLF-------------- 372 Query: 453 PFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ-----LGLYKLT 507 +P + + + +VR+ + LV + L+ ++ Sbjct: 373 -----------THVGRWPKGADVQAYKYAACSVRSGQWHLVSDGPPGKPREKGWKLFDVS 421 Query: 508 -DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 D+ + ++ A +P VV + + S P + Sbjct: 422 KDIGEDHDVVAEHPDVVTRLDAEYDRWWASVVPMM 456 >UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7UX95_RHOBA Length = 538 Score = 411 bits (1057), Expect = e-113, Method: Composition-based stats. Identities = 136/606 (22%), Positives = 224/606 (36%), Gaps = 151/606 (24%) Query: 1 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI 60 +K + ++V+ S + + + A + + ++ + +T +PNI Sbjct: 23 LKHNMNQAVLMPSRKWVRWALLLVCVAGVPN------LDSTTVSAEEPNAKDATVSRPNI 76 Query: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 +++ DDLGYG+L G + + TP L L EG+ Sbjct: 77 VLIVADDLGYGEL----GCYGQ----------------------TKIRTPRLDQLAAEGI 110 Query: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD----------------AQDGIPLTE 164 + TN Y + V PSR +MTG+ P V +N D Q +P+ E Sbjct: 111 KLTNFYSGNAVCAPSRCCLMTGKHPGHAHVRNNGDPKIDPAVREALKLEFPGQYPLPVDE 170 Query: 165 TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY 224 + E ++ GY T A GKW L P +GFD Sbjct: 171 VTIAEYLKSVGYRTGAFGKWGLGHFGTTG----------------------DPNEQGFDL 208 Query: 225 FMGFHAAGTAYYNSPS-LFKNRER---------VPAKGYISDQLTDEAIGVVDRAKTLD- 273 F GF+ A+ + P+ L++NR + + + Y DQ +EA + ++ D Sbjct: 209 FYGFNCQRHAHNHYPNFLWRNRVKEVQPGNDRTLHGETYSQDQFVNEACEFIRQSVAEDK 268 Query: 274 -QPFMLYLAYNAPHLPNDNPAP--DQYQKQ----------FNTGSQTADNYYASVYSVDQ 320 QPF YL + PHL P D Y + + Y A V +D+ Sbjct: 269 TQPFFAYLPFAVPHLSIQVPEEEVDAYDGVIEEADYEHHGYLKHPRPRAGYAAMVTRMDE 328 Query: 321 GVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLN-------GAQKGYKSQTYPGGTH 373 GV ++++ + G +NT+I+FTSDNG D + KG K Q GG Sbjct: 329 GVGQVVDLVDSLGLGENTLIMFTSDNGPTYDRLGGSDSDYFNSASGMKGLKGQLDEGGIR 388 Query: 374 TPMFMWWKGKLQPGNYDKLISA-MDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 PM G + G I A DF PT DAA + + DG+S LP L + Sbjct: 389 VPMIARQTGVVPAGRTSDWIGAWWDFLPTITDAAGVEVDASTT-DGISFLPLLHGDDAAQ 447 Query: 433 P-HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 H+ L W +R ++ Sbjct: 448 QSHEFLYWEFPGY-----------------------------------SGQQAIRMGNWK 472 Query: 492 LVYTV----------ENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVR-EFIDSSQP 539 + E LY L+ DL + ++++A++P V+ +++ + + + + S Q Sbjct: 473 AIRKDLSKRLKKGQTEPPAFALYDLSKDLAESNDVSASHPDVMAKIEAIAKQQHVPSEQF 532 Query: 540 PLSEVN 545 PL ++ Sbjct: 533 PLRVLD 538 >UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UPK7_RHOBA Length = 482 Score = 411 bits (1056), Expect = e-113, Method: Composition-based stats. Identities = 130/552 (23%), Positives = 213/552 (38%), Gaps = 108/552 (19%) Query: 2 KSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNII 61 K ++K S +I ++L+ A A D ++ +T +PN+I Sbjct: 10 KPSMKFSPFVAAILILLSLNECHGQAPAVQDGD----------ANAKSESDATSRRPNVI 59 Query: 62 VLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVR 121 V+ DDL G L GS TP L E ++ Sbjct: 60 VILADDLAVGDLAGGDGS--------------------------PTRTPNLDRFASESIQ 93 Query: 122 FTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG----IPLTETFLPELFQNHGYY 177 F+ Y V P+RAA++TGR P R GV + + + ET + ++ ++ GY Sbjct: 94 FSQAYSGSCVCAPARAALLTGRYPHRTGVVTLNMNRYPEMTRLRRDETTIADVLKDAGYA 153 Query: 178 TAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG-TAYY 236 T VGKWH + + + P +RGFD F GF + Y+ Sbjct: 154 TGLVGKWHTGR-----------------------GDGFHPLDRGFDEFEGFFGSDDVGYF 190 Query: 237 NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ 296 P + + + Y++D L AI V R + PF L+LA+ APH P + P Sbjct: 191 RYPFSEQRQISDVDESYLTDDLNRRAIEFVRRHH--EHPFFLHLAHYAPHRPLEAPPEVI 248 Query: 297 YQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL 356 + + ++ YA + +D+G+ +L ++ G ++TI+LF SDNG Sbjct: 249 ARYREQGFDESTATIYAMIEVMDRGIGELLAEIDDLGLSEDTIVLFASDNGPDPLTGERF 308 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKL 416 N +G K Q GG P+F+ W +L PG D++++ +D PT LD + + +L Sbjct: 309 NRELRGTKYQVNEGGIRVPLFVRWSKRLAPGQRDQMVTFVDLMPTILDLCRVDVSMLNRL 368 Query: 417 DGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTE 476 DG S +P L+D W + S +Y HN Sbjct: 369 DGESFVPVLEDASIAH-STMRFWQWN---------------------RASPNYTHN---- 402 Query: 477 DLSQFSYTVRNNDYSLVYTVE------NNQLG---LYKL-TDLQQKDNLAAANPQVVKEM 526 VR+ Y LV + L+ L D + +++ P + + M Sbjct: 403 ------AAVRHGRYKLVRPYVTRGAKLKDSTEPSVLFDLQNDPTESRDVSKQYPDIAERM 456 Query: 527 QGVVREFIDSSQ 538 + + S + Sbjct: 457 SRELDRWSASVE 468 >UniRef50_D2QZX4 Sulfatase n=10 Tax=Bacteria RepID=D2QZX4_9PLAN Length = 499 Score = 411 bits (1056), Expect = e-113, Method: Composition-based stats. Identities = 134/516 (25%), Positives = 202/516 (39%), Gaps = 77/516 (14%) Query: 27 AHAADDVKLKATKTNVAFSDFTPTEYST-----KGKPNIIVLTMDDLGYGQLPFDKGSFD 81 AHAA T S TE S K +PNI+++ DDLGY + Sbjct: 16 AHAAMTFVAFVLATTFVISSTAATEESAADAASKRRPNIVLIFCDDLGYADIGCFG---- 71 Query: 82 PKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMT 141 A TP L L EG++FT+ VA V SRAA++T Sbjct: 72 ----------------------AKGYETPNLNKLASEGMKFTDFQVAAAVCSASRAALLT 109 Query: 142 GRAPARFGVYSNTDAQD--GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQ 199 G P R G+ S D GI E + EL QN GY TA GKWHL +Q Sbjct: 110 GCYPQRVGILSALGPSDSIGIAKNELLISELLQNLGYKTACFGKWHLGH--------HEQ 161 Query: 200 TRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNR--ERVPAKGYISDQ 257 + F T+ + D + A AY P + N+ E P + ++ Sbjct: 162 FLPQQNGFATYFGLPYSN-----DMWPKHPTAKNAYPPLPLIDGNKTIELNPDQTKLTTW 216 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYS 317 T++A+ + ++PF LY+ +N PH+P + + G + + Sbjct: 217 YTEKAVKFI--HDCGEKPFFLYVPHNMPHVPL-------FVSEKFAGKTKRGLFGDVIAE 267 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNGAQKGYKSQTYPGGTHTP 375 +D V I + L+ G DNT+++FTSDNG + G + K + GG P Sbjct: 268 IDWSVGEITKALEATGNVDNTLVIFTSDNGPWLSYGDHAGSTGGFREGKGTVWEGGHRVP 327 Query: 376 MFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPH 434 M + G +QPG DKL S +D +PT +I K+DGVS+ P L+ + + Sbjct: 328 MIAKYPGTIQPGTTCDKLASTIDLFPTIAHYCGATIDPSRKIDGVSIQPLLESVEGAKSS 387 Query: 435 KNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY 494 Y +W + + + H P T+ + Y Sbjct: 388 HEFF----YYYWGNGLEAVRDERFKLHFPHAFRSLTGTPGTDGMPNG------------Y 431 Query: 495 TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGV 529 T +L L+ L D ++ N+AA +P+V + Sbjct: 432 TQAKTELALFDLDADPFEQTNIAADHPEVTARLTAA 467 >UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN4_DYAFD Length = 497 Score = 410 bits (1055), Expect = e-113, Method: Composition-based stats. Identities = 133/560 (23%), Positives = 209/560 (37%), Gaps = 154/560 (27%) Query: 41 NVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDK 100 ++ + + + PNI+ + DDLGYG+L G + + Sbjct: 10 TISITCTAQAQKAPDKLPNIVYIYADDLGYGEL----GCYGQQ----------------- 48 Query: 101 AIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD----- 155 + TP L L EG+RFT Y V P+RA +MTG+ + N + Sbjct: 49 -----KIKTPNLDRLAKEGIRFTQHYTGTPVCAPARAMLMTGKHAGHSAIRGNFELGGFR 103 Query: 156 -----AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 Q +P E + EL + GY TA GKW + + Sbjct: 104 DEEERGQMPLPANELTVAELLKQKGYATALTGKWGMGMNNT------------------- 144 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNR------------------------ 245 E P +GFDY+ G+ A+ P L++N Sbjct: 145 ---EGTPTRQGFDYYYGYLDQKQAHNLYPSHLWENDRWDTLAQPWQDIHRKLDPAKATDA 201 Query: 246 --ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ--- 300 E K Y ++T++A+ +DR+K PF LY+ Y PH+ APD+Y K+ Sbjct: 202 DFESFKGKEYAPAKMTEKALAFIDRSKAG--PFFLYMPYTLPHVSLQ--APDEYVKKYIG 257 Query: 301 ------------FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA 348 + + Y + + +D V IL++LK G DNTI++F+SDNGA Sbjct: 258 QFDEKPYYGEKNYASTKYPLSTYASMITFLDDQVGIILDKLKALGLDDNTIVMFSSDNGA 317 Query: 349 VIDG---PLPLN--GAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISA-MDFYPTA 402 +G P N +G K Y GG P + W GK++PG +SA D PT Sbjct: 318 TFNGGVNPQFFNSVAGLRGLKMDVYEGGIREPFIVRWPGKIKPGRVSDHVSAQFDLMPTL 377 Query: 403 LDAADISIPKDLKLDGVSLLPWLQDKKQ-GEPHKNLTWITSYSHWFDEENIPFWDNYHKF 461 + + P DG+S LP L + + H+ L + Sbjct: 378 AELTGQASP---PTDGISFLPELLGQTNRQKKHEFLYFEYP------------------- 415 Query: 462 VRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV----ENNQLGLYKL-TDLQQKDNLA 516 VR D+ V T N L+ L TD + ++A Sbjct: 416 ----------------EKGGQIAVRMGDWKGVKTDLRKNPGNPWQLFNLKTDRSESTDVA 459 Query: 517 AANPQVVKEMQGVVREFIDS 536 A++P ++K++ +V+ + Sbjct: 460 ASHPDILKKLDQIVKREHEE 479 >UniRef50_Q1YSH0 Sulfatase family protein n=4 Tax=cellular organisms RepID=Q1YSH0_9GAMM Length = 557 Score = 410 bits (1054), Expect = e-113, Method: Composition-based stats. Identities = 140/535 (26%), Positives = 211/535 (39%), Gaps = 102/535 (19%) Query: 48 TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQK 107 + K PNII++ DD+G+ + G Sbjct: 54 SAETTPAKRPPNIILILTDDMGFNDISLYNG----------------------GAADGSL 91 Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-------- 159 TP + + ++G+RF NGY A+ V SRA+++TGR RFGV + G Sbjct: 92 QTPNIDRIAEQGIRFNNGYAANAVCTSSRASLLTGRYSTRFGVEYTPIYKTGVRIFNWME 151 Query: 160 --------------------------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVP 193 +P E + E+ Q YYTA +GKWHL ++ Sbjct: 152 ELNPSTPPVLVDMDLAATLPPIDALGMPAAEITIGEVLQQQDYYTAHIGKWHLGSNGDM- 210 Query: 194 VPEDKQTRDYHDNFTTFSAEEWQPQ----NRGFDYFMGFHAAGTAYYNSPSLFKNRERVP 249 PE + D F P D A +Y + Sbjct: 211 RPEQQGFDDSLSMKGIFYLPPDHPDVVNAKIPGDSIDSMVWAVGSY---EVQWNGGPPFE 267 Query: 250 AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTAD 309 KGY++D TD A+ V++ + +PF LYLA+ PH P D Y + Sbjct: 268 PKGYLTDYFTDAAVDVIEANRH--RPFFLYLAHWGPHNPVQASRED-YDALPHIKDHRLR 324 Query: 310 NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-LNGAQKGYKSQTY 368 Y A + ++D+ V++I L++NG DNT+I+FTSDNG L LN +G+K + Sbjct: 325 TYAAMLRALDRSVEKIEASLQENGLSDNTLIIFTSDNGGAGYLDLTDLNKPYRGWKLTHF 384 Query: 369 PGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQD 427 GGTH P W +++ G D+ I +D + T AA S+P D LDGV+LLP++Q Sbjct: 385 EGGTHVPYMAKWPAQIEAGQSSDEAIHHIDMFHTIAAAAGASVPTDRTLDGVNLLPFMQG 444 Query: 428 KKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRN 487 K+ G PHK L W T + W K +R + D P Sbjct: 445 KQTGAPHKTLFWHTGH-------QQTVWHQGWKMIRAEQSDKPGA--------------- 482 Query: 488 NDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 + + L+ L D +++NL A P+ E+ ++ PL Sbjct: 483 ----------DPMVFLFDLNNDPTEQNNLIAEQPEKAAELTALLDTHHAQQAKPL 527 >UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9FLAO Length = 459 Score = 410 bits (1053), Expect = e-113, Method: Composition-based stats. Identities = 136/511 (26%), Positives = 205/511 (40%), Gaps = 98/511 (19%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 +A + T PNI+ + +DDLGYG L Sbjct: 26 LAAATGTCYAQERPDAPNILCILVDDLGYGDLSCQG------------------------ 61 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV----YSNTDAQ 157 A +P + +L G+RFTN Y V PSRAA++TGR P GV N + Sbjct: 62 --ATDLQSPNIDALAANGMRFTNFYANSTVCSPSRAALLTGRYPDLVGVPGVIRQNPENN 119 Query: 158 DG-IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 G + +P GY+T +GKWHL E Sbjct: 120 WGNLADDAVLIPSELNPAGYHTGIIGKWHLGL-----------------------EEPDT 156 Query: 217 PQNRGFDYFMGFHAA-GTAYYNS-----PSLFKNRERVPAKGYISDQLTDEAIGVVDRAK 270 P +RGF YF GF Y++ + NRE + KG+ +D TD I + + Sbjct: 157 PNDRGFTYFKGFLGDMMDDYWDHRRGGINWMRLNREEIDPKGHATDLFTDWTIDFLKERQ 216 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQ 328 +QPF LYLAYNAPH P P D+ +++ ++ A V +D V R++E Sbjct: 217 GEEQPFFLYLAYNAPHFPIQPPREWLDKVREREPNLTEKRAKNVAFVEHLDYSVGRVMEA 276 Query: 329 LKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN 388 LK G +NT+++F SDNG + NG +G K Y GG P +WKGK+ PG Sbjct: 277 LKTTGLEENTLVVFVSDNGGAL-WYAQSNGPLRGGKQDMYEGGIRVPAIFYWKGKIAPGT 335 Query: 389 Y-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWF 447 D MD +PT + A P++ +DG+SL+P L + Q ++ L W Sbjct: 336 TSDNTALLMDLFPTFCELAGRKPPEN--VDGISLVPTLTGQAQDTANRYLYW-------- 385 Query: 448 DEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL- 506 VR + DY Q Y R D+ ++ + + + Sbjct: 386 --------------VRREGGDYG--------GQAYYAARFGDFKILQNTPFEPIQFFNIG 423 Query: 507 TDLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 D + L + + + ++ + E I ++ Sbjct: 424 QDELETTPL-ETDSEAYRALRAQLMEHIRTA 453 >UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 Tax=Bacteria RepID=A6CD52_9PLAN Length = 460 Score = 409 bits (1051), Expect = e-112, Method: Composition-based stats. Identities = 121/524 (23%), Positives = 197/524 (37%), Gaps = 110/524 (20%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 ++ +PNI+++ DD G + ++ T Sbjct: 20 SQLQAAERPNILIIFTDDQGINDVGCY---------------------------GSEIPT 52 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF-----GVY---SNTDAQDGIP 161 P + L EG+ F Y A + PSR I+TGR P R G S+ D GI Sbjct: 53 PHIDQLAKEGLLFRQYYSASAICTPSRFGILTGRNPTRSQDQLLGALMFMSDIDQNRGIQ 112 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 ET + ++ Q +GY TA +GKWHL E + P G Sbjct: 113 PGETTIADVLQQNGYQTALLGKWHLGH----------------------GTESFLPTAHG 150 Query: 222 FDYFMGFHAAGTAYY-----NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPF 276 FD F G Y+ N P + N+ V GY +D +T+EA + +T D+PF Sbjct: 151 FDLFRGHTGGCIDYFTMTYGNIPDWYHNQRHVSENGYATDLITEEAEHFLKDQQTTDKPF 210 Query: 277 MLYLAYNAPH-----LPNDNPAPDQYQKQFNTGSQ-------TADNYYASVYSVDQGVKR 324 L+L+YNAPH P D + Q + + + + A S+D G+ R Sbjct: 211 FLFLSYNAPHFGKGWSPGDQSPVNIMQARGDDLKRVGTIKDKVRREFAAMTVSLDDGIGR 270 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL 384 ++ LK NG NT+++F +D+G N +G K+ + GG P + W GK+ Sbjct: 271 VMSSLKNNGLDQNTLVIFMTDHGGDYVYG-GNNQPFRGAKATLFEGGIRVPCIIRWPGKI 329 Query: 385 QPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 + G +++ A+D +PT A++ L LDG + L ++ + L W Sbjct: 330 KAGTETNEVAWALDLFPTICHFANVDT-DGLTLDGKDISGLLT-RQTPVGTRELYWQLG- 386 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGL 503 P+ E +R D+ + + L Sbjct: 387 -----------------------------PHAELKRGRWSALRQGDWKYIQDAGGEEF-L 416 Query: 504 YKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 + L D +K NL + + E+Q + + P + + Sbjct: 417 FDLKADPYEKQNLTQSQSTKLTELQERRDTLVKTLTPQVKSIAP 460 >UniRef50_A6DLE2 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLE2_9BACT Length = 441 Score = 409 bits (1051), Expect = e-112, Method: Composition-based stats. Identities = 128/526 (24%), Positives = 202/526 (38%), Gaps = 114/526 (21%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 FS T PNII++ DD G Sbjct: 6 CLFSLLC-TSLLANEPPNIIIILADDAGSSDFSCYGS----------------------- 41 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD--- 158 Q TP + S+ G++FT Y A V PSRA ++TGR FG +N Sbjct: 42 ---KQLLTPHIDSIAHNGIKFTQAYTASSVCSPSRAGLLTGRYQQTFGHLANIPHSKHSA 98 Query: 159 ------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 G+P+TE L + + GY T +GKWHL + A Sbjct: 99 NDPELLGLPVTEITLADSLKELGYSTHCIGKWHLGE-----------------------A 135 Query: 213 EEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERV--------PAKGYISDQLTDEAIG 264 + + P RGFD F GF + Y+ L + +R+ P+ GY ++ T EAI Sbjct: 136 DHFHPNARGFDNFYGFLSGARTYFLGGELRGDMDRIMRNKEFAEPSSGYTTEVFTQEAIR 195 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKR 324 ++ D+PF +YL++NA H P D A D+ ++ + Y + ++D Sbjct: 196 IIQE--EQDKPFFIYLSHNAVHGPMD--AKDEDIMSYDFKNPLRKKYSGLMKNLDDQTGL 251 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL 384 +L+ LK + QY+NT+I F SDNG N +G+K + GG TP + W K+ Sbjct: 252 LLQALKDSKQYENTLIFFMSDNGGPTTHNGSSNWPLRGFKGSEFEGGNRTPFLLQWPEKI 311 Query: 385 QPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 G + DK I A D + T + AA + D G+ LLP + +K Q + L W Sbjct: 312 SAGLSSDKPIIAYDVFATCIQAAGGELVTDRTYHGIDLLPVI-NKPQETNARKLFWSRG- 369 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGL 503 +Y++R + L + L Sbjct: 370 -------------------------------------KNYSMRQGKWKL--NILPTGSSL 390 Query: 504 YKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEK 548 Y L D +K +L+ P++ ++ + ++ + L + +K Sbjct: 391 YNLENDQSEKHDLSEQFPEIKAQLIKEMSKWKSTHAEALWQTGYKK 436 >UniRef50_A6C176 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C176_9PLAN Length = 599 Score = 409 bits (1051), Expect = e-112, Method: Composition-based stats. Identities = 130/525 (24%), Positives = 197/525 (37%), Gaps = 108/525 (20%) Query: 48 TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQK 107 P + GKPNII++ DD GYG + Sbjct: 21 CPADTPDSGKPNIILVITDDQGYGDIAAHG--------------------------NQMI 54 Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFL 167 TP L L + +R TN +V P+R+A+MTGR R GV+ + + E L Sbjct: 55 KTPNLDQLYQKSLRLTNFHV-DPTCAPTRSALMTGRYSTRTGVWHTIMGRSLMDTNEVTL 113 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 E+F+++GY T GKWHL N P+ Q + P + DYF Sbjct: 114 AEVFKSNGYRTGLFGKWHLGD--NYPLRPQDQGFGTVVQHGGGGVGQ-TPDDWQNDYF-- 168 Query: 228 FHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHL 287 S + +N + +GY +D DEA+ ++ +T +PF YL+ NAPH Sbjct: 169 ----------SDTYLRNGKPEKFQGYCTDIWFDEALKFIEADRT--KPFFAYLSTNAPHS 216 Query: 288 PNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG 347 P + + +Y + ++D+ + R+L LK++G NTI++F +DNG Sbjct: 217 PYLVDPEYSDPYEDKGVPKKMAAFYGMITNIDENMGRLLRYLKESGLEKNTILIFMTDNG 276 Query: 348 AVIDGPLP--------------------------LNGAQKGYKSQTYPGGTHTPMFMWWK 381 P N +G K Y GG P ++ W Sbjct: 277 TAAGLQRPSTEDLSKKQQRRLSKGKPITLETWPGFNARMRGTKGSEYDGGHRVPCYIHWP 336 Query: 382 --GKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 G N ++L + +D PT D D++I +LKLDG SL+P L K ++ L Sbjct: 337 QGGLTGGKNINQLTAHIDILPTLADLCDLTISSELKLDGTSLVPILTGNKDALRNRTLI- 395 Query: 440 ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN 499 H E+ W +V + LV N Sbjct: 396 ----VHSQRIESPEKWR-------------------------KSSVMAERWRLV-----N 421 Query: 500 QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 + LY + D Q N+AA VVK + ++ S P S Sbjct: 422 EKELYDIQNDPGQTKNVAAEYAGVVKYLSAEYEKWWSSLTPVFSR 466 >UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017445FC Length = 481 Score = 408 bits (1050), Expect = e-112, Method: Composition-based stats. Identities = 137/564 (24%), Positives = 218/564 (38%), Gaps = 146/564 (25%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 +A +PN+IV DDLGYG+L G + K Sbjct: 2 LAAVVTVAASLQASARPNVIVFLADDLGYGEL----GCYGQK------------------ 39 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD--- 158 + TP L L +G+RFT+ Y H V PSR ++TG+ V N++ + Sbjct: 40 ----KIKTPNLDQLAADGMRFTDFYSGHAVCAPSRCVMLTGKHTGHSFVRENSEGRAAQA 95 Query: 159 ----------------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 +P +E Q GY TA VGKW L SN Sbjct: 96 KERNRIKAADGYLPQIALPASEATYASALQKSGYRTACVGKWGLGHPSN----------- 144 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRERVPAKG--------Y 253 E P GFD F G+ + A+Y P L++N + P +G Y Sbjct: 145 -----------EGSPNKHGFDLFYGYISQWQAHYYYPTYLWRNDVKEPLEGNDGKVGRQY 193 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPD----QYQ----------- 298 +D + EA+ ++ T PF LY A PH+ P + +Y+ Sbjct: 194 AADLMEQEALKFME--TTGGGPFFLYYATPVPHVSLQVPPDEPSLAEYKQAFAGQDPPYD 251 Query: 299 --KQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP--- 353 K + Y A V +D+ + + + LK+ GQ NT+I+FTSDNGA +G Sbjct: 252 GRKSYLPTEDPRAIYAAMVTRMDRTLGKFRDLLKRTGQDQNTLIIFTSDNGATFNGGYDR 311 Query: 354 --LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISA-MDFYPTALDAADISI 410 N +G K+Q + GG TP W G +QPG + + A D +PT + + Sbjct: 312 EFFGGNQPLRGMKTQLWDGGIRTPFIAAWPGSIQPGQVSRFVGASWDLFPTFAEIVGFPV 371 Query: 411 PKDLKLDGVSLLPWLQDK-KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDY 469 P LDGVS+LP L+ + + H +L W Sbjct: 372 PAG--LDGVSILPTLKGEVATQKQHDHLYW------------------------------ 399 Query: 470 PHNPNTEDLSQFSYTVRNNDYSLVY----TVENNQLGLYKL-TDLQQKDNLAAANPQVVK 524 E ++ VR + + + + L+ L TD+ + ++AA +P +V Sbjct: 400 ------ETVAGGHQAVRMGPWKGIRLGVIKNPSAPVQLFNLETDVSETTDVAAQHPDIVA 453 Query: 525 EMQGVVRE-FIDSSQPPLSEVNQE 547 ++ ++ + S++ P+ E+++ Sbjct: 454 KIATIMSAGRVPSAEFPMGELDRP 477 >UniRef50_D2QTW6 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QTW6_9SPHI Length = 486 Score = 408 bits (1050), Expect = e-112, Method: Composition-based stats. Identities = 116/526 (22%), Positives = 199/526 (37%), Gaps = 100/526 (19%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 + PN+++ MDDLGYG L A +TP L Sbjct: 33 APATPPNVVLFFMDDLGYGDLSVTG--------------------------ALDYTTPNL 66 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQDGIPLTETFLPEL 170 + EG RFTN A V SRAA++TG P R G+Y ++ G+ E L EL Sbjct: 67 DKMAAEGTRFTNFLAAQAVCSASRAALLTGCYPNRLGLYGALGPNSPIGLNPNEETLAEL 126 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG--- 227 + GY T GKWHL +++ P +GFD + G Sbjct: 127 LKERGYATGMFGKWHLGD-----------------------NKQFLPMQQGFDEYYGVPY 163 Query: 228 -------FHAAGTAYYNSPSLFKNRERVPA------KGYISDQLTDEAIGVVDRAKTLDQ 274 A A Y E P G I+ +T++A+ + K + Sbjct: 164 SHDMWPLHPAQAQAKYPPLRWIDGNEPGPEIKDLNDAGKITGTITEKAVSFIRNHK--KK 221 Query: 275 PFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQ 334 PF LY+ + PH+P A +++ Q + + +D V +I+ +LK+ G Sbjct: 222 PFFLYVPHPLPHVPLATSA--RFKGQ-----SARGIFGDVLTELDWSVGQIMNELKQQGL 274 Query: 335 YDNTIILFTSDNGAVID--GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DK 391 NT+++F SDNG ++ +G + K ++ GG P + W G + G +K Sbjct: 275 DKNTLVIFISDNGPWLNYGDHAGSSGGFREGKGTSFEGGHRVPCLVRWPGVVPAGRVSNK 334 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 L++A+D PT + +PK +DGV + L+ P + + Sbjct: 335 LLTALDILPTVANVCGARLPKQR-IDGVDWVALLKGDNSVTPRDKFYYYYRKNSLEAVRQ 393 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQ 510 + + R P S ++ + LY L D Sbjct: 394 GDWKLVFAHPGRTYEGFLPGQGGKPGPSTETHAIAAG--------------LYDLRRDPG 439 Query: 511 QKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFN-NIKKA 555 ++ ++ +P+VV ++ + + ++ L + Q++ N+++ Sbjct: 440 ERYDVREQHPEVVARLETI----AEEARADLGDELQKRTGANVREP 481 >UniRef50_A6DHI1 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI1_9BACT Length = 472 Score = 408 bits (1049), Expect = e-112, Method: Composition-based stats. Identities = 137/515 (26%), Positives = 208/515 (40%), Gaps = 115/515 (22%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 + KPNII + DDLGYG++ ++ TP L Sbjct: 17 AQMKPNIIYILCDDLGYGEVGYNG--------------------------QKMIQTPELD 50 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN----TDAQDGIPLTETFLPE 169 L +G+RFT+ Y + V PSRA+++TG+ P + +N D Q IP L + Sbjct: 51 KLASKGMRFTDHYCGNAVCAPSRASLITGKHPGHAFIRANSPGYPDGQTPIPADSETLGK 110 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH 229 L + GY TA +GKW L N P +GFD+F G+ Sbjct: 111 LMKRAGYATACIGKWGLGGFHNAG----------------------NPHKQGFDHFYGYT 148 Query: 230 AAGTAYYNSP-SLFKNRER-------VPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 A+ P L++N E+ Y D +T +A+ ++ K DQPF LYLA Sbjct: 149 DQRKAHNYYPEYLWRNGEKEMLNNKNGEENDYSHDLMTVDALKYIEEKK--DQPFFLYLA 206 Query: 282 YNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIIL 341 Y PH+ P QY+ + + + A +D+ + I +L++ G DNT+I+ Sbjct: 207 YLIPHVKYQVPDLAQYKDK--DWPKEMKIHAAMTSRMDRDIGTIARRLEELGIADNTLIM 264 Query: 342 FTSDNGAVIDGP----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM- 396 F SDNGA +G KG K Y GG +PM +W G +Q G+ ISA Sbjct: 265 FNSDNGAHGKSNSEKFFNTSGDLKGLKRSMYDGGVRSPMIAYWPGTIQAGSVSDHISAFW 324 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDK-KQGEPHKNLTWITSYSHWFDEENIPFW 455 D PT + P + DG+S+LP L K + + HK L W Sbjct: 325 DMMPTFSELTG--EPFKGETDGISMLPTLLGKDSEQKQHKYLYW---------------- 366 Query: 456 DNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN--QLGLYKLT-DLQQK 512 + ++ + +R + V + LY ++ D + Sbjct: 367 ------------------ELYESNKPNCAIRFGKWKGVVLDRRKGLNIELYDMSGDQSES 408 Query: 513 DNLAAANPQVVKEMQGVVRE------FIDSSQPPL 541 NLAA P+VV E++ ++ E + D PL Sbjct: 409 KNLAAQYPEVVDEIRKMMVEAHVKSPYWDKDFKPL 443 >UniRef50_Q7UKJ5 Arylsulfatase A n=3 Tax=Bacteria RepID=Q7UKJ5_RHOBA Length = 489 Score = 408 bits (1049), Expect = e-112, Method: Composition-based stats. Identities = 119/519 (22%), Positives = 202/519 (38%), Gaps = 81/519 (15%) Query: 48 TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQK 107 T KPN+IV+ DD GY L Sbjct: 37 AAESTDTTEKPNVIVIFTDDQGYNDLGCYGS--------------------------PNI 70 Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTE 164 TP L L EG R+T+ Y A V PSRAA++TG P R G++ + + G+ E Sbjct: 71 KTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQHVLFPQSTYGLHPDE 130 Query: 165 TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF-- 222 + + ++ GY TA VGKWHL +P Y+ +S + P N+ Sbjct: 131 VTIADHLKSAGYATACVGKWHLGHHKET-LPTSNGFDSYYG--IPYSNDMNHPDNKRLGK 187 Query: 223 ---DYFMGFHAAGTAYYNSPSLFKNRERVP---AKGYISDQLTDEAIGVVDRAKTLDQPF 276 D ++ +N+P L ++ E + + ++ + TD AI V+ + D+PF Sbjct: 188 MSSDDRWTDQSSAVTLWNTP-LVQDEEIIELPVDQRTVTRRYTDRAIEFVEANQ--DKPF 244 Query: 277 MLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYD 336 LYL ++ PH+P P D Y + Y + +D V R+++ ++ G + Sbjct: 245 FLYLPHSMPHIPLYVP-EDVY------DPDPQNAYKCVIEHIDTEVGRLVQTVRDLGLSE 297 Query: 337 NTIILFTSDNGAVID--GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLI 393 T+I++TSDNG + G + K T+ GG P MW G++ G + Sbjct: 298 KTLIVYTSDNGPWLQFKNHGGSAGPLRAGKGTTFEGGQRVPCIMWAPGRIPAGTSSNAFA 357 Query: 394 SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIP 453 + MD PT +++ D K+DG+ L + ++ S + + Sbjct: 358 TNMDLLPTIASFTGVALENDRKIDGIDLTSTFTSDESA---RDEFVFYSAHGVLEGIRMG 414 Query: 454 FWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQK 512 W + R + P ++ L+ L+ D+ +K Sbjct: 415 DWKYLRQVARRGPNAKGPKPEP------------------------KVFLFDLSQDIGEK 450 Query: 513 DNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 +NL P+ V++M + E + V ++K N+ Sbjct: 451 NNLVEQQPERVQKMHARMEELNEEITANARPVWRKKVNS 489 >UniRef50_A4AVA7 Aryl-sulphate sulphohydrolase n=2 Tax=Bacteroidetes RepID=A4AVA7_9FLAO Length = 487 Score = 408 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 134/516 (25%), Positives = 218/516 (42%), Gaps = 96/516 (18%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 KPNI+++ +DDLGY + F + TP + L Sbjct: 46 RKPNIVLINIDDLGYKDVGFMGSEY--------------------------YETPNIDIL 79 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-------IPLTETF-- 166 G+ FTNGY A PSRA++MTG+ R G+Y+ ++ G IP T T Sbjct: 80 AKAGMIFTNGYAAASNCAPSRASLMTGKWTPRHGIYTVNSSERGKSKDRKIIPSTNTSTL 139 Query: 167 ------LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 LPE+ Q + Y T GKWHLS+ P + Sbjct: 140 SKESMVLPEVLQLNNYKTIHAGKWHLSE---------------------------SPLDY 172 Query: 221 GFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 GFD +G G P + R P K Y++D + + I V+ KT++ PF L Sbjct: 173 GFDINIGGGHNGHPKSYYPPYGNVKLRSPNKEYLTDLIARQTIEVL--NKTIE-PFFLNY 229 Query: 281 AYNAPHLPNDNPAP--DQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 A A H P +Y ++ Q Y V ++D+ + ++ LK NG Y NT Sbjct: 230 APYAVHTPIQPVDSILSKYNRKTAWKGQNNAKYATMVENLDRNIGLLIAALKDNGHYKNT 289 Query: 339 IILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMD 397 +I+FTSDNG + + + K Y GG P F W K++ + IS +D Sbjct: 290 LIIFTSDNGGLY--GITKQQPLRAGKGSYYEGGIREPFFFMWNDKIKSNTKSNVPISHLD 347 Query: 398 FYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDN 457 +P+ ++AA IS + LDG SLLP L+ + + L W + Sbjct: 348 LFPSIVEAAGISY-NETSLDGNSLLPILKQESTKLK-RPLFW-----------------H 388 Query: 458 YHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLA 516 + ++ + + N ++ ++ +R D+ L Y ENN++ LY LT D+ +++NL Sbjct: 389 FPIYLEAYNQNDNENRDSLFRTRPGSVIREGDWKLHYYFENNEMELYNLTYDVGERNNLI 448 Query: 517 AANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNI 552 +P+ K + ++ + + P+ E ++ ++ Sbjct: 449 NTHPKKAKVLLQQLKAWWKETSAPIPEQLNPEYASL 484 >UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DTN4_9BACT Length = 482 Score = 408 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 128/542 (23%), Positives = 203/542 (37%), Gaps = 127/542 (23%) Query: 47 FTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQ 106 S KPNII + DDLGYG L G + K Sbjct: 9 LFALNLSAADKPNIIYILADDLGYGDL----GCYGQKV---------------------- 42 Query: 107 KSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETF 166 TP L + G++FT Y V GPSR+ ++ G+ V N Q + Sbjct: 43 IQTPHLDKMAANGMKFTQHYSGSTVCGPSRSCLLEGKHSGNTYVRGNGMLQMRQDPHDLI 102 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 P+ Q GY+TA +GK + ++ + P +GFDYF Sbjct: 103 FPKALQKAGYHTAMIGKSGMGCNTD---------------------DAALPYQKGFDYFF 141 Query: 227 GFHAAGTAYYNSP-SLFKNRERV-----------PAKGYISDQLTDEAIGVVDRAKTLDQ 274 GF + A++ P L+KN +V Y S+ + +EA+ V+R K D Sbjct: 142 GFTSHTQAHWFFPTHLWKNDGKVTKVEYPNNTLHEGDNYSSEVVMNEALDYVERQK--DG 199 Query: 275 PFMLYLAYNAPHLPNDN-----------------PAPDQYQKQFNTGSQTADNYYASVYS 317 PF L+LA+ PH P D++ ++ + + A V Sbjct: 200 PFFLHLAFQIPHASLRAKEEWKAKYRPILKEKLLPKKDKHP-HYSYEREPKTTFAAMVSY 258 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-----LNGAQKGYKSQTYPGGT 372 +D V + ++L+ G +NT+I+F SDNGA+ +G NG +G K Y GG Sbjct: 259 MDHNVGLLNKKLEDLGLAENTLIMFASDNGAMQEGGHKRDSFDSNGVLRGGKRDMYEGGV 318 Query: 373 HTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 TPM +W GK++ G ISA D PT + A + +D DG+S +P L K Sbjct: 319 RTPMIAYWPGKIKAGQTSDHISAFWDISPTVRELAGAKVQEDT--DGISFVPTLLGKGSQ 376 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 H L W +R + Sbjct: 377 TKHDYLYWEFFEQG-----------------------------------GKRAIRMGKWK 401 Query: 492 LVYTVENN----QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 L+ N ++ L+ L D+ ++ +L+ P+ V + ++ + ++ P + Sbjct: 402 LILYKTNTDLNPKMELFDLEADISEQKDLSKQLPEKVSALLKLMDKAHTPAENPTFKFAS 461 Query: 547 EK 548 E+ Sbjct: 462 ER 463 >UniRef50_A6DKM2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKM2_9BACT Length = 472 Score = 407 bits (1047), Expect = e-112, Method: Composition-based stats. Identities = 125/550 (22%), Positives = 210/550 (38%), Gaps = 116/550 (21%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 K S T + + KPNII++ DDLG G + + Sbjct: 1 MKIYFILSCLCFTLFGAQ-KPNIILILADDLG----GAGLGCYGNEFFG----------- 44 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD-- 155 TP + +L + +RF N Y V PSRA +M+G+ R + + Sbjct: 45 -----------TPNIDALAAKSMRFDNAYSGSTVCAPSRACLMSGQYVGRHKITWVSQFQ 93 Query: 156 ---------------------AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPV 194 +P L + F++ GY TA GKWHL Sbjct: 94 RDYIKKKRGPNLNGFRLLQPVHPYHMPEGTITLGQAFKDAGYATAMFGKWHLGH------ 147 Query: 195 PEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYI 254 + QP GFD ++ F + +P N+ + K Y+ Sbjct: 148 -----------------RPQDQPDKMGFDEYLTFQGMK---HFAPYTLPNKVQHGEKVYL 187 Query: 255 SDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP-APDQYQKQFNTGSQTADNYYA 313 +D D+AI ++R ++PF LY H P + A QY ++ G A Sbjct: 188 TDLTCDKAIDFMERKVAAEKPFFLYYPDFLVHAPMEAKQAMIQYFEKKTIGQHHKSVIGA 247 Query: 314 SVY-SVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV---IDGPLP----LNGAQKGYKS 365 ++ +D V R+++++ + G +NTII+FTSDNG + DG N + KS Sbjct: 248 AMTKHLDDTVGRLVKKVDELGIAENTIIIFTSDNGGLGYKSDGGYGDKGTSNYPYRSAKS 307 Query: 366 QTYPGGTHTPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 Y GG+ P+ W G + + +++S +D YPT L A ++ P++ LDG+ Sbjct: 308 SHYEGGSRVPLIFHWPGVTEANSLSHEVVSGIDIYPTLLKIAQVAKPQEQILDGIDFSSI 367 Query: 425 LQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 L++ KQ P ++L Y N + S + Sbjct: 368 LKNPKQKLPARDLF-----------------------------HYQPIYNHKVFGDASVS 398 Query: 485 VRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 +R D +Y L+ L D+ QK +L+A P++ +E++ + +D + Sbjct: 399 LRRGDMKYIYYFVEENFELFNLKDDVSQKKDLSADYPELCEELKKACFKHLDETDALRMT 458 Query: 544 VNQEKFNNIK 553 +N + +K Sbjct: 459 LNPDYDPKLK 468 >UniRef50_Q7UYW3 Arylsulfatase B n=1 Tax=Rhodopirellula baltica RepID=Q7UYW3_RHOBA Length = 520 Score = 407 bits (1046), Expect = e-112, Method: Composition-based stats. Identities = 134/539 (24%), Positives = 208/539 (38%), Gaps = 130/539 (24%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 + + PNI+V+ DD+GYG + + + Sbjct: 47 ASRAAESTPPNIVVILADDMGYGDMGC--------------------------MGSQTLQ 80 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD---------- 158 TP L L + GV + YVA V PSRA ++T R P RFG N +A D Sbjct: 81 TPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTSRDPRRFGYEGNLNASDENYATRPELL 140 Query: 159 GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQ 218 G+P +E L + GY TA +GKWHL E P Sbjct: 141 GLPTSEKTLADHLGAAGYATALIGKWHLGM-----------------------GEMHHPN 177 Query: 219 NRGFDYFMGFHAAGTAYYNSPS---LFKNRERVPA--KGYISDQLTDEAIGVVDRAK--T 271 RGFD+F G Y+ + + +N +RV Y++D TDE + +D+ K Sbjct: 178 RRGFDHFCGMLTGSHHYFPATMKHVIERNGKRVDDFSSEYLTDFFTDEGLRFIDQHKSAN 237 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 DQP+ ++ +YNAPH P D + N +Q Y A +Y++D+GV RI E L++ Sbjct: 238 PDQPWFVFFSYNAPHTPMHATEADL-ARFANIQNQKRRTYAAMMYALDRGVGRIREHLEE 296 Query: 332 NGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YD 390 GQ++NT+++F SDNG + NG +G K GG PM W K G YD Sbjct: 297 TGQWENTLLVFFSDNGGATNNG-SWNGPLRGVKGSMREGGIRVPMIWTWPAKFPAGVLYD 355 Query: 391 KLISAMDFYPTALDAADISI-----------PKDLKL--------DGVSLLPWLQDKKQG 431 ++S++D PT AA + K DG+ + P L D + Sbjct: 356 GVVSSLDLLPTFCSAAGAEPLALADPMSHEDASNRKRMNRLSGTHDGIDMAPHLADGSEP 415 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 P++ L W Q + + Sbjct: 416 -PNRRLYWRL--------------------------------------QGQAAILDGTDK 436 Query: 492 LVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKF 549 L+ L+++ TD+ + +L+A NP +E+ + + +S + + Sbjct: 437 LLRPSHR-PAELFEVSTDVSESHDLSAQNPSRFRELYDELGAW-ESMLTTVPLWGSSPY 493 >UniRef50_A6DR29 N-acetylgalactosamine-6-sulfatase n=3 Tax=Bacteria RepID=A6DR29_9BACT Length = 510 Score = 406 bits (1044), Expect = e-111, Method: Composition-based stats. Identities = 124/536 (23%), Positives = 206/536 (38%), Gaps = 99/536 (18%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 +K K+ + + F+P + KPN+I++ DDLG+G F+ Sbjct: 1 MKTKSLLIAASAALFSPFISAESAKPNVILIMADDLGWGDTGFNGS-------------- 46 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP L + EG++ Y A V P+RA+++TGR P R GV Sbjct: 47 ------------KVIKTPHLDQMAAEGLQLDRFYSASSVCSPTRASVLTGRNPYRTGV-- 92 Query: 153 NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQT------------ 200 T Q + E LPE+ GY T GKWHL +++ ++ Sbjct: 93 PTANQGFLRPEEITLPEVLNEQGYATGHFGKWHLGTLTHTEKDANRGKPGNTKEFNPPKL 152 Query: 201 RDYHDNFTTFSA-EEWQPQ------NRGFDYFMGFHAAGTAYYNSPS--LFKNRERVPA- 250 Y D F T S + P ++G +G+ + P + + E Sbjct: 153 HGYEDAFVTESKVPTYDPMILPAKFDQGESKHLGWEYVKEGEESKPYGTFYWDIEGKKIT 212 Query: 251 ---KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT 307 KG S + D + +D+A ++PF+ + ++ PHLP A ++Q+ + Sbjct: 213 DNLKGDDSRVIMDRVLPFIDQAVADEKPFLSVVWFHTPHLP--CVAGPRHQEMYKGHPIH 270 Query: 308 ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL--PLNGAQKGYKS 365 NY V ++D+ + R+ + L G DNT+I F SDNG G +G K Sbjct: 271 LRNYAGCVTAMDEQIGRLRKHLADKGVADNTMIWFCSDNGPESKERPDNGSAGHFRGRKR 330 Query: 366 QTYPGGTHTPMFMWWKGKLQ-PGNYDKLISAMDFYPTALDAADISIPK-DLKLDGVSLLP 423 Y GG P M W K++ D+ PT LDA I P+ DG SL+P Sbjct: 331 DLYEGGVRVPAVMVWPAKVKEARKISAPCITSDYMPTILDALHIPHPQASYATDGRSLMP 390 Query: 424 WLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY 483 + ++ + +S W Sbjct: 391 IINNEDFTRDKEIGIMFSSRIVW------------------------------------- 413 Query: 484 TVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 D+ L+ + LY L +D +K ++AA NP++V++++ + + +S + Sbjct: 414 --HKGDFKLLSYNGGKKYELYNLKSDPSEKTDVAAQNPELVEKLKKDMLAWHESVK 467 >UniRef50_Q7US96 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7US96_RHOBA Length = 498 Score = 405 bits (1042), Expect = e-111, Method: Composition-based stats. Identities = 120/552 (21%), Positives = 203/552 (36%), Gaps = 117/552 (21%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 ++ +A S G+PNI+++ +DDLG+ + F Sbjct: 8 RVSPIAIAIAMIFCCSPAQSRAGQPNILLIFIDDLGWKDIGCYGNDF------------- 54 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 TP + L EG+RFTN Y + V P+R A+ +G+ AR G+ ++ Sbjct: 55 -------------VETPRIDQLAAEGLRFTNFYASGAVCSPTRCALQSGQNQARIGITAH 101 Query: 154 TDAQD-------------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQT 200 +PL + E + GY T VGKWHL Sbjct: 102 IPGHWRPFERVITPQTTMALPLDTVTIAESLKASGYTTGYVGKWHLG------------- 148 Query: 201 RDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTD 260 + E+QP +G+D+ + + P + Y +D D Sbjct: 149 ----------NGPEFQPDRQGYDFSAVIGGPHLPGRYRVQGRSDLKPKPNQ-YRTDFEAD 197 Query: 261 EAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQ-----TADNYYASV 315 I + + K DQPF L L+ A H+P ++ QK Q Y A + Sbjct: 198 LCIDFMRQNK--DQPFFLMLSPFAVHIPL-AAMSEKVQKYEAMAKQTGNSLPHPVYAAMI 254 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA---------VIDGPLPLNGAQKGYKSQ 366 D V R+++ L++ D+T+I+FTSDNG D + KG K Sbjct: 255 EHCDDMVGRLVDSLEQLDIADDTMIVFTSDNGGLYKRYDYRESADDLVSSQAPLKGEKGS 314 Query: 367 TYPGGTHTPMFMWWKGKLQ-PGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 + GG P+ + ++ G D+ + DFYPT ++ A +P + +DG SLLP + Sbjct: 315 LHEGGIRVPLIIRHPATVKSAGVCDEPTISHDFYPTFVEMAGGELPINQTIDGHSLLPLM 374 Query: 426 QDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTV 485 Q L W YPH + + + + Sbjct: 375 TAPTQTLDRDALHW----------------------------HYPHYHH----DRPASAI 402 Query: 486 RNNDYSLVYTVE-NNQLGLYKLTD-LQQKDNLAAANPQVVKEMQGVVREFIDS--SQPPL 541 R D+ L+ ++ + LY L D L + NLA+ +++ + + S ++ P+ Sbjct: 403 RERDWKLIEYLDGTGDVELYNLADDLGETKNLASEKQGRAGDLKRKLTTWRSSVLARTPI 462 Query: 542 SEVNQEKFNNIK 553 + + + Sbjct: 463 PNPSYDPERAHE 474 >UniRef50_A4CMB0 Arylsulfatase A n=4 Tax=Bacteria RepID=A4CMB0_9FLAO Length = 492 Score = 405 bits (1042), Expect = e-111, Method: Composition-based stats. Identities = 134/517 (25%), Positives = 207/517 (40%), Gaps = 78/517 (15%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 T+T+ S+ T KPN I++ DDLGYG L Sbjct: 30 TETSPGDSEGTAAAGGIPEKPNFIIVFADDLGYGDLS----------------------- 66 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN---- 153 + T L + EG ++TN YVA V PSRA ++TGR P R G+ SN Sbjct: 67 ---SFGHPTIHTKNLDRMAAEGQKWTNFYVAASVCTPSRAGLLTGRLPVRNGLTSNEIGV 123 Query: 154 --TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD----NF 207 D+ +G+P +E L E + GY T VGKWHL +P + DY N Sbjct: 124 FFPDSHNGMPASEITLAEQLKKAGYATGMVGKWHLGHKEEY-LPPNHGFDDYFGIPYSND 182 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVP---AKGYISDQLTDEAIG 264 F+ + Q+ Y + + T YN P L + E + + I+ + DEA+ Sbjct: 183 MDFTGQFTSYQDYFGRYTERYESLKTEEYNVP-LIRGTEEIERPVNQNTITKRYNDEAVK 241 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKR 324 + K D+PF +YLA++ PH+P + G+ Y V +D GV + Sbjct: 242 WIREHK--DEPFFMYLAHSLPHVPL-------FTSDEFRGTSARGLYGDVVEEIDHGVGQ 292 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAV--IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 I+E L+ G +NTI++FTSDNG G + K T+ GG P W G Sbjct: 293 IMELLEAEGLAENTIVVFTSDNGPWLPTGISGGSAGLLREGKGTTWEGGMREPTIFWAPG 352 Query: 383 KLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITS 442 L + S +D + T A + +P D ++DGV L P L + + + + Sbjct: 353 MLPAKVVMDMGSTLDLFNTFSSLAGVPMPDDREMDGVDLSPILFGDAESPRKEMFYYQGA 412 Query: 443 YSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLG 502 + + ++ HNP L+Y VE Sbjct: 413 DLYAVRLGAYKAHFYTKEAYVMGAERVEHNP-----------------PLLYNVEE---- 451 Query: 503 LYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 D +K +L+ +P+V++E++ VV + Sbjct: 452 -----DPSEKYDLSGKHPEVIEEIRRVVEAHNANMVK 483 >UniRef50_B0NLM9 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NLM9_BACSE Length = 463 Score = 405 bits (1041), Expect = e-111, Method: Composition-based stats. Identities = 125/519 (24%), Positives = 194/519 (37%), Gaps = 122/519 (23%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 KPNII + DD+GY L + TP + L Sbjct: 33 DKPNIIFILADDMGYCDLSCYGNKY--------------------------IETPNIDRL 66 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN--------------TDAQDGIP 161 G FT Y G+S PSR A+MTG+ + N T + + Sbjct: 67 AATGTAFTQCYAGSGISSPSRCALMTGKNTGNTTIRDNFCIAGGIEGLKGTKTIRRMHLQ 126 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 +T + + GY T V KWHL + E P NRG Sbjct: 127 PNDTTIATVLGAAGYRTCLVNKWHLDGFN----------------------PEATPLNRG 164 Query: 222 FDYFMGFHAAGTAYYNSPSLF----------KNRERVPAKGYI---SDQLTDEAIGVVDR 268 FD F G+ + TAY N P + +N + +I +D T++AI ++R Sbjct: 165 FDEFYGWLIS-TAYSNDPYYYPYWRFNNEKLENVKENEGDKHIKHNTDLSTEDAIKFINR 223 Query: 269 AKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQ 328 K + PF LYLAY+APH P + Y + Y + + +D+ + R+L + Sbjct: 224 NK--NNPFFLYLAYDAPHEPYNIDETTWYDDE--AWDMNTKRYASLITHMDRAIGRLLAE 279 Query: 329 LKKNGQYDNTIILFTSDNGAVIDGPL---PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQ 385 L + G +NT+++F SDNGA PL G+ KG K Q Y GG P + GK+ Sbjct: 280 LDRLGLRENTLVIFASDNGAAKQAPLEELGCKGSLKGMKGQLYEGGIRVPFIVNQPGKVP 339 Query: 386 PGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSH 445 + +I D PT A + KL+G+++LP ++ ++ L W Sbjct: 340 VQKLNNIIYFPDVMPTLAALAGATDKLPQKLNGINILPLFYGQQLDTDNRLLYW------ 393 Query: 446 WFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYK 505 + R D+ +V ++ L LY Sbjct: 394 -------------------------------EFPGKQRAARCGDWKVVTVKKDAPLELYN 422 Query: 506 LT-DLQQKDNLAAANPQVVKEMQGVVRE-FIDSSQPPLS 542 + D+ + NLA P+ V + + ++ I + PL Sbjct: 423 IKEDMTESVNLANKYPEKVAQFEKEMKAMRIPTPNWPLP 461 >UniRef50_A9LGQ4 Secreted arylsulfatase n=4 Tax=Bacteria RepID=A9LGQ4_9BACT Length = 608 Score = 405 bits (1040), Expect = e-111, Method: Composition-based stats. Identities = 136/569 (23%), Positives = 216/569 (37%), Gaps = 129/569 (22%) Query: 20 SGMAAFAAHAADDVKLKATKTNV---AFSDFTPT--EYSTKGKPNIIVLTMDDLGYGQLP 74 +G + ++ + + + A + P+ +PN+IV DD G+G Sbjct: 2 AGPTWLKTESLVNMNFRLLNSAIWLLAICCWFPSFVVAQNDQRPNVIVFLSDDQGWGDFS 61 Query: 75 FDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGP 134 +TP + SL +G+ F N +V V P Sbjct: 62 CTG--------------------------NQSVATPNIDSLATQGLLFENFFV-CPVCSP 94 Query: 135 SRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPV 194 +RA +TGR + V + Q+ I L ET + + GY TAA GKWH Sbjct: 95 TRAEFLTGRYHPQSNVKGVSQGQERIDLDETTIADCLSQAGYATAAFGKWHNGMQY---- 150 Query: 195 PEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYI 254 + P RGFD F GF + Y +P+L N V +GYI Sbjct: 151 -------------------PYHPCGRGFDDFYGFCSGHWGNYFNPTLEHNGRIVKGEGYI 191 Query: 255 SDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS--------- 305 +D T+ A+ ++ K+ QPF LYL YN PH P PD Y ++F Sbjct: 192 NDDFTNRALKFIEDHKS--QPFFLYLPYNTPHWPPQM--PDAYWQRFAEKEIVQRGQKGD 247 Query: 306 ----QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQK 361 + A V ++D V R+L +L + DNTI+++ +DNG + N K Sbjct: 248 KEDLAKTRSALAMVENIDWNVGRVLAKLDELKIADNTIVIYFNDNGPNSN---RWNAGMK 304 Query: 362 GYKSQTYPGGTHTPMFMWWKGKLQ--PGNYDKLISAMDFYPTALDAADISIPKDLKLDGV 419 G K T GG +P+F+ W ++ +++ A+D YPT L A + D LDG Sbjct: 305 GKKGSTDEGGVRSPLFVRWPNGVKGAGRRVNQICGAIDLYPTLLAATGSANVGDKILDGK 364 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLS 479 +LLP + NL + +S+W Sbjct: 365 NLLPIWDGSET-----NLGFRMLFSYW--------------------------------- 386 Query: 480 QFSYTVRNNDYSLVYTVENNQLGLYK-LTDLQQKDNLAAANPQVVK-------EMQGVVR 531 + +VR + L +N L+ LTD Q ++++ P V + + Sbjct: 387 RGKASVRTQQFRL-----DNNGWLFDMLTDPHQTKDISSDQPAVAALLLGSLIRFKQEME 441 Query: 532 EFIDSSQPPLSEVNQEKFNNIKKALSEAK 560 +DS++ P S V F + +A+ Sbjct: 442 AEMDSTKRPFS-VGHPDFAYTQLPARDAQ 469 >UniRef50_C6I9F7 Sulfatase n=4 Tax=Bacteroides RepID=C6I9F7_9BACE Length = 493 Score = 405 bits (1040), Expect = e-111, Method: Composition-based stats. Identities = 137/569 (24%), Positives = 215/569 (37%), Gaps = 130/569 (22%) Query: 35 LKATKTNVAFSDFTPTEYSTKGK---PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVV 91 +K +A T T + K PN+I + DDLGY L F Sbjct: 1 MKRLILPIACGICTVTSDAQTDKQPHPNVIFIYADDLGYTDLSCTGSRF----------- 49 Query: 92 DTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY 151 TP + L EGV FT Y A VS PSRAA++TG+ PAR + Sbjct: 50 ---------------YETPHIDKLAREGVCFTQSYAACPVSSPSRAALLTGKYPARINLT 94 Query: 152 SNTDAQD-----------------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPV 194 + E + E F+ +GY T GKWHL++ + Sbjct: 95 DYIPGDRAYGPHKNQRLASLPFNLHLSKDEITMAEAFRQNGYSTFMAGKWHLAESA---- 150 Query: 195 PEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA--YYNSPSLFKNRERVPAKG 252 E+ P+ GFD +G + G Y SP + P Sbjct: 151 -------------------EYYPEQNGFDINIGGNNTGHPSKGYFSPYGNPQLKDGPEGE 191 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPD--QYQKQ---------- 300 Y++D+LTDE I + K ++PF +YL+Y HLP A +Y+++ Sbjct: 192 YLTDRLTDEVIRYISEPK--EKPFFVYLSYYTVHLPLQAKAEKIAKYRRKLSRAVPADSS 249 Query: 301 -------FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP 353 ++ Q Y A V S+D+ + R+L+ L ++G + TI++FTSDNG + Sbjct: 250 FVKKGETYHKLVQDIPAYAAMVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSN 309 Query: 354 LP-----LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAAD 407 N + K Y GG P + W L+ D I D+YPT LD Sbjct: 310 TTRNIPTSNLPLRAGKGYLYEGGIKVPAIIRWSRHLKGRQVSDTPIIGTDYYPTLLDLCG 369 Query: 408 ISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 + + +DGVS+ P LQ + P +L W Sbjct: 370 LPLLPGQHVDGVSMKPVLQGGRLSRP--SLFW---------------------------- 399 Query: 468 DYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYK-LTDLQQKDNLAAANPQVVKEM 526 YPH + S +R DY L+ E++ + LY + D ++ +L+ P++ + Sbjct: 400 HYPHYSGGLG-GRPSAAIREGDYKLIEFFEDHHVELYNVIQDESEEKDLSQIYPEIADGL 458 Query: 527 QGVVREFIDSSQPPLSEVNQEKFNNIKKA 555 + + + + N + +K + Sbjct: 459 RKKLYLWYKEVGARMPVDNPHYVSPVKDS 487 >UniRef50_UPI0001968C90 hypothetical protein BACCELL_02360 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968C90 Length = 525 Score = 405 bits (1040), Expect = e-111, Method: Composition-based stats. Identities = 132/531 (24%), Positives = 219/531 (41%), Gaps = 102/531 (19%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 V+ ++ TPT+ KPN + + MDD+GY + Sbjct: 63 VSCTEATPTK---SEKPNFVFIYMDDMGYSDVSCYG------------------------ 95 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN--TDAQDG 159 + +TP + +L EG++FT+ Y A +S PSRA +TGR PAR G+ D+ G Sbjct: 96 --ETRWTTPNIDALAAEGIKFTDCYAASPISSPSRAGFLTGRYPARMGIQGVFYPDSYTG 153 Query: 160 IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN 219 + E + E+ + GY TA +GKWHL S E++ P Sbjct: 154 MAPEEVTMAEVLKVQGYATACIGKWHLG-----------------------SREKYLPLQ 190 Query: 220 RGFDYFMGFHAAGTAYYNSPSLFKNRERVPA----KGYISDQLTDEAIGVVDRAKTLDQP 275 +GFD + G + S ++ V ++ + T+EA+ + R DQP Sbjct: 191 QGFDEYFGIPYSNDM---SAQVYLRGNEVEEFHIDINNVTKKYTEEAVDYIRR--KADQP 245 Query: 276 FMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQY 335 F L+LA++ H+P Y G A Y +V VD V RI+E L++ G Sbjct: 246 FFLFLAHSMMHVPI-------YVSDEFAGKSGAGIYGDAVLEVDWSVGRIMETLRELGLD 298 Query: 336 DNTIILFTSDNGAVI-DGPLPLNG-AQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLI 393 DNT+++FTSDNG + +GPL + K+ + GG P +WKG+++P ++ Sbjct: 299 DNTLVVFTSDNGPWLQEGPLGGRALPLREGKTTAFEGGVRVPCIAYWKGQIKPVVNTDVV 358 Query: 394 SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIP 453 S +D++PT + +P D++LDG L L + + Sbjct: 359 SLLDWFPTVTALSGGILP-DVRLDGYDLTAVLNGTGKRASEDYAYF-------------- 403 Query: 454 FWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQK 512 R+ D + +S + ++ N + + + L+ L D+ ++ Sbjct: 404 ---------RNNRDITDYRSGDWKISLPAPGIKGNFWRA--STAEHDTLLFNLREDIGER 452 Query: 513 DNLAAANPQVVKEMQGVVREF---IDSSQPPLSEVNQEKFNNIKKALSEAK 560 NL P KEM ++E+ P L + ++K EAK Sbjct: 453 YNLYRKYPGKAKEMLQKLQEYTRNFGEIPPGLVMTGNDASKYLRKQRQEAK 503 >UniRef50_Q7UYA9 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA9_RHOBA Length = 474 Score = 404 bits (1039), Expect = e-111, Method: Composition-based stats. Identities = 111/515 (21%), Positives = 189/515 (36%), Gaps = 85/515 (16%) Query: 40 TNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 T + E + PN+I+L DD G+G + F+ Sbjct: 15 TLAVPATQLIAETTDTNSPNVILLMSDDQGWGDVGFNG---------------------- 52 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG 159 TP L ++ GVRF Y A + P+R + +TGR P RFG+ G Sbjct: 53 ----NEVVQTPNLDAMASAGVRFDRFYAAAPLCSPTRGSCLTGRYPFRFGIL--AAHTGG 106 Query: 160 IPLTETFLPELFQNHGYYTAAVGKWHLSKI--------SNVPVPEDKQTRDYHDNFTTFS 211 + + E + E+ Q GY T GKWH+ + P +Y TT + Sbjct: 107 MRVGEITIAEMLQKRGYATGMFGKWHIGWVKPDEVSTRGFYSPPSHHGFDEYFA--TTSA 164 Query: 212 AEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAK--GYISDQLTDEAIGVVDRA 269 W P D+ + G + N G S + D I ++ Sbjct: 165 VPTWDPTITPQDWDSWGNGPGEPWKGGFPYVHNGREAKENLSGDDSRVIMDRVIPFIEAN 224 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQL 329 + +PF + ++APH P A ++++K + NYY + ++DQ V R+ +L Sbjct: 225 QA--KPFFATVWFHAPHEP--VVAGEEFKKLYPKAGSKRKNYYGCITAMDQQVGRLRAKL 280 Query: 330 KKNGQYDNTIILFTSDNGAV---IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 ++ G NT++ F SDNG + G KG+K Y GG P W G + Sbjct: 281 RELGIEKNTVVFFCSDNGPSDGLAKKGVASAGPFKGHKHTMYEGGLLVPACAEWPGTIPA 340 Query: 387 GNYDKL-ISAMDFYPTALDAADISI--PKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 G ++ S +DF PT S+ +DG+ L+P ++ + + Sbjct: 341 GTSTEVRCSTVDFLPTVASIVGDSMVQKATRPIDGIDLMPLIRGEAKDRDRDLFFGYRRL 400 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV-YTVENNQLG 502 D ++ + D+ L+ +N +L Sbjct: 401 YQGID---------------------------------GQSIISGDWKLLQEAKKNGRLR 427 Query: 503 LYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDS 536 LY L+ D + +L+ P+ ++++ + E S Sbjct: 428 LYDLSKDPFETQDLSEEMPEQTEQLRKQLEELQAS 462 >UniRef50_UPI0001BC7CBC sulfatase n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7CBC Length = 496 Score = 404 bits (1039), Expect = e-111, Method: Composition-based stats. Identities = 138/532 (25%), Positives = 204/532 (38%), Gaps = 107/532 (20%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 +A + T S KPNI+++ DD GYG + Sbjct: 21 IAKASAQHTTPSHPDKPNIVIILADDQGYGGVNCY------------------------- 55 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS-NTDAQDGI 160 + TP + L GV+ GY + +S P+RA +MTG+ FG Y +T GI Sbjct: 56 PHIKKIVTPNIDKLAASGVQCMQGYTSGHLSSPTRAGLMTGKYQQSFGFYGLSTPHVGGI 115 Query: 161 PLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 P + L E +GY TA +GKWHL P NR Sbjct: 116 PQDQKLLSEYLVENGYNTACIGKWHLGDYIRS-----------------------HPNNR 152 Query: 221 GFDYFMGFHAAGTAYYNS-------------PSLFKNRERVPAKGYISDQLTDEAIGVVD 267 GF F GF YY+ N E V Y + + T A+ + Sbjct: 153 GFQTFFGFINGLHDYYDPLVGGSWDGVYNGLAFTLDNMEPVTEMEYSTYEYTKRAVDFI- 211 Query: 268 RAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQ---TADNYYASVYSVDQGVKR 324 K D PF LYL YNA H P AP++ + Q D A +++DQGV + Sbjct: 212 -QKNADHPFFLYLPYNAIHSPLQ--APEELIGELAINPQEIGKDDIARAMTFALDQGVGK 268 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL 384 ++E L++ G DNTII + SDNGAV +G K Y GG P + + KL Sbjct: 269 VVETLEQLGLRDNTIIFYLSDNGAV---EYSDKWEFRGRKGSYYEGGIRVPFIVSYPAKL 325 Query: 385 QPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 G +K + ++D PT ++ A +S + GV+LLP+L K + EPH L W T Sbjct: 326 AKGTIYNKPVMSIDIAPTVMELAGLS---HADMHGVNLLPYLSGKDRTEPHDVLYWSTE- 381 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT-VENNQLG 502 + + + + +R + LV Sbjct: 382 ----------------------------KKSNNQVFKNEFAIRQGKWKLVSDPHFEKDYD 413 Query: 503 LYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIK 553 LY + D Q+K L P+ KE+ G+ +I+ L+ + ++ Sbjct: 414 LYDIEADPQEKHGLKDQYPEKYKELFGMYLNWINQMPEELANGENARLKGME 465 >UniRef50_A4AQQ7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Bacteroidetes RepID=A4AQQ7_9FLAO Length = 596 Score = 403 bits (1037), Expect = e-111, Method: Composition-based stats. Identities = 129/521 (24%), Positives = 205/521 (39%), Gaps = 112/521 (21%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 V+ T + + KPN++++ DD G+G L F+ + Sbjct: 21 VSCEKKTKEKNEIQTKPNVVLIMTDDQGWGDLSFNGNT---------------------- 58 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIP 161 STP + ++ G F N YV V P+RA ++TG+ AR GVYS + + Sbjct: 59 ----NLSTPNIDAIAKNGASFQNFYV-QPVCSPTRAELLTGKYAARLGVYSTSTGGERFN 113 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 ET + E+F+ GY T A GKWH + P +RG Sbjct: 114 SKETTIAEIFKKAGYKTTAYGKWHSGMQ-----------------------PPYHPNSRG 150 Query: 222 FDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 FD + GF + Y SP L N E V +G++ D LT++ + + K + PF LYL Sbjct: 151 FDDYYGFTSGHWGNYFSPMLEHNGEIVKGEGFLVDDLTNKGLDFITENK--NNPFFLYLP 208 Query: 282 YNAPHLPNDNPAPD-----------QYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLK 330 YN PH P P +YQ A V ++D + R+ +LK Sbjct: 209 YNTPHSPMQVPNEYWERFEKKKLDMRYQGNEEESENFTRAALAMVENIDFNMGRLTNKLK 268 Query: 331 KNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-Y 389 + G +NTII++ SDNG NG +G K T GG +P F+ WK + Sbjct: 269 ELGLEENTIIVYLSDNGP---NGWRWNGGMRGRKGSTDEGGVRSPFFIQWKNTIPKNKKI 325 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 ++ A+D PT A I+ P +DG L + DK +++ +HW Sbjct: 326 SQIAGAIDILPTLTSLAGINQPTIKSIDGKDLKTLIADKNPTWESRHIV-----NHW--- 377 Query: 450 ENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TD 508 + ++R Y L +N+ LY + D Sbjct: 378 ------------------------------RGKTSIRTQKYRL-----DNENRLYDMQND 402 Query: 509 LQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKF 549 + Q+ +L++ PQ+ + + ++ + E N+ F Sbjct: 403 IGQRTDLSSELPQLTDSLVNIKNIWLKDAVTVKPE-NKRPF 442 >UniRef50_A6DS95 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DS95_9BACT Length = 491 Score = 402 bits (1034), Expect = e-110, Method: Composition-based stats. Identities = 127/517 (24%), Positives = 197/517 (38%), Gaps = 71/517 (13%) Query: 48 TPTEYSTKGK-PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQ 106 T + + K PNII + DD GYG + + Sbjct: 17 TLCSMAKQSKSPNIIFILTDDQGYGDMAVHGHPY-------------------------- 50 Query: 107 KSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETF 166 TP + L E VRF YV P+RAA+MTG R GV ++ + Sbjct: 51 LETPNMDRLHSESVRFDRFYV-SPSCSPTRAALMTGMHEFRNGVTHTVQPREKLYKGALT 109 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 + ++ + GY T VGKWHL + PQ RGFD + Sbjct: 110 IADILKEGGYKTGFVGKWHLGNDKG-----------------------YAPQYRGFD-WY 145 Query: 227 GFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPH 286 +A G + + +N +R KG+ D DEA+ + A +QPF LYL +PH Sbjct: 146 AKNAKGPHNHFDVEMIRNGKRFQTKGFREDAFFDEAMTFMKEA--GEQPFFLYLCTYSPH 203 Query: 287 LPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDN 346 P P + + + Y A + ++D + R+ + LKK YD+TI++F +DN Sbjct: 204 TPLGAPEDLLKKYKAKGLNDNHAAYLAMIENIDDNLGRLDQFLKKENLYDDTILIFMNDN 263 Query: 347 GAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAA 406 G V G N +G K + GGT W K QP + L + +D PT + A Sbjct: 264 G-VTVGLDVYNADMRGPKCTIWEGGTRAFSLWRWPKKWQPKTVENLTAHLDVLPTLCELA 322 Query: 407 DISIPKDLK--LDGVSLLPWLQDKKQGEPHKNLT-----WITSYSHWFDEENIPFWDNYH 459 + +P+ ++ L+G SL P L K ++ L W + + Sbjct: 323 GVDVPEKVQGELEGYSLSPLLNGKDWEHNNRLLFHNVGRWPSGTAAAHKNAMCGIRKGNF 382 Query: 460 KFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ--------LGLYKL-TDLQ 510 V Q + P V YT N Q LY + D Sbjct: 383 LLVHSQGCEDPICEKYPSQCTTLRNVAKGFKHATYTKTNAQFHWGVSEGWQLYDVKKDPS 442 Query: 511 QKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQE 547 ++LA A+P++V E++ ++ D P + + + Sbjct: 443 NLNDLANAHPELVDELKQAYSKWWDKQFPVMVKRGGD 479 >UniRef50_C1ZF13 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZF13_PLALI Length = 461 Score = 402 bits (1033), Expect = e-110, Method: Composition-based stats. Identities = 136/524 (25%), Positives = 228/524 (43%), Gaps = 122/524 (23%) Query: 40 TNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 + +A + TE +++ +PNI+++ DD G+ + Sbjct: 15 SQLALAQRATTETTSERRPNILLILSDDCGHAEFSIQG---------------------- 52 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA--- 156 + TP + S+ GV F GYV+ V PSRA ++ GR RFG N Sbjct: 53 ----HPRYKTPHIDSIGKNGVHFRQGYVSGCVCSPSRAGLLAGRYQQRFGHEFNIPPAYS 108 Query: 157 -QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEW 215 +G+P +ET LP+L + GY T A+GKWHL A ++ Sbjct: 109 ETNGLPRSETLLPQLLKEDGYRTIALGKWHLG-----------------------YAPQF 145 Query: 216 QPQNRGFDYFMGFHAAGTAYYNSP------SLFKNRERVPAK--GYISDQLTDEAIGVVD 267 P RGF + GF +Y+ + ++R +P + GY++D L DEAI + Sbjct: 146 HPMERGFTDYYGFLQGSRSYFPLKKPTRLNQMLRDRTAIPEEQFGYMTDHLADEAIAYIK 205 Query: 268 RAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILE 327 + ++ QP+M+YLA+NA H PND A D + + YA ++D+ V ++L+ Sbjct: 206 QWQS--QPWMMYLAFNATHSPNDATAVDL-------QAADGNKIYAMTIALDRAVGKVLD 256 Query: 328 QLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 LK+ G +T+++F +DNG NG+ G K T+ GGT P + + K+ G Sbjct: 257 ALKECGLSKDTLVIFINDNGGA---GGHDNGSLHGKKGSTWEGGTRIPFLVQYPAKIPSG 313 Query: 388 NY-DKLISAMDFYPTALDAADI------SIP-KDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 D+ + A+D +PT LD A + IP KLDG+SL+P + K Q + L W Sbjct: 314 QVIDEPVIALDLFPTILDVAGLGDAELKKIPFDPEKLDGISLIPRMTGKTQRLVDRPLYW 373 Query: 440 ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY--TVE 497 S + +R + V + Sbjct: 374 --------------------------------------KSGKRWAIRQGNLKAVSGNDDQ 395 Query: 498 NNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 +Q+ L+ L +D ++ NLAA +P +++++ + R++ + + P Sbjct: 396 GDQVELFDLSSDPDEQRNLAATHPDELQQLEALYRKWESTLEKP 439 >UniRef50_A7AKS6 Putative uncharacterized protein n=3 Tax=Bacteroidales RepID=A7AKS6_9PORP Length = 464 Score = 401 bits (1031), Expect = e-110, Method: Composition-based stats. Identities = 141/543 (25%), Positives = 222/543 (40%), Gaps = 111/543 (20%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 D + + + + + + +PNI++L DD GY F Sbjct: 6 IDRLFVVSLGAITGLASCSSGQDEEAQRPNILILLADDAGYADFGF-------------- 51 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 + A TP + L EG FT+ +VA VS PSR+ ++TGR R+G Sbjct: 52 ------------MGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGRYGQRYG 99 Query: 150 VYSNTD-AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 N D DG+P E LP L + + Y T +GKWHL + Sbjct: 100 YECNLDKPGDGLPDDEELLPALLKRYDYRTGCIGKWHLGSEPSQ---------------- 143 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS----------LFKNRERVPAKGYISDQL 258 +P +GFD F G A +Y+ P N ++ GY +D+L Sbjct: 144 -------RPNAKGFDTFYGLLAGHRSYFYDPETSDKDGNLQQYQYNGRKLSFDGYFTDEL 196 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSV 318 +A V +QPFMLY+++ APH PN+ D Q Y A +Y++ Sbjct: 197 ASKAQQFVTE---SEQPFMLYMSFTAPHSPNEATEEDL----ARFEGQPRQKYAAMMYAL 249 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFM 378 D+GV +I+++LK G++DNTII F SDNG N KG+K + GG P F+ Sbjct: 250 DRGVGKIVDELKAAGKFDNTIIFFLSDNGGSTTNQ-SSNLPLKGFKGNKFEGGQRVPFFV 308 Query: 379 WWKGKLQP-GNYDKLISAMDFYPTALDAADISIPK-DLKLDGVSLLPWLQDKKQGEPHKN 436 W + + + L S++D + T +DA DI +DGVSLLP+L +K G PH+ Sbjct: 309 VWGDRFKRDQRFTGLTSSLDIFATVVDALDIPEEGLHKPIDGVSLLPYLSGEKSGNPHEA 368 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV 496 L W + +R+ Y L+ T Sbjct: 369 LFWRKMDTR--------------------------------------AIRSGSYKLIITR 390 Query: 497 ENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKA 555 + + LY + D+++ +L ++ P+ +E+ + E+ + +E + I Sbjct: 391 GVDSV-LYNMDQDVEEMHDLLSSEPEKARELMEQLSEWEQACCKD-PLWIEEGWAEITNG 448 Query: 556 LSE 558 L E Sbjct: 449 LHE 451 >UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_BACFR Length = 489 Score = 401 bits (1030), Expect = e-110, Method: Composition-based stats. Identities = 132/544 (24%), Positives = 204/544 (37%), Gaps = 135/544 (24%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 + + +PN++ + DDLGYG L + TP Sbjct: 30 KAKEQTRPNVVFILADDLGYGDLSCYG--------------------------QEKFETP 63 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN----TDAQDGIPLTETF 166 + L G+RFT Y VS PSR+ ++TG + N + Q +P Sbjct: 64 NIDRLAQNGMRFTQCYSGTTVSAPSRSCLITGTHSGHTAIRGNKELAPEGQFPLPENSQT 123 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 + F+N GY T A GKW L I + P +G D F Sbjct: 124 IFNDFRNAGYRTGAFGKWGLGYIGSAG----------------------DPYKQGIDQFY 161 Query: 227 GFHAAGTAYYNSPS-LFKNRERVP-----------AKGYISDQLTDEAIGVVDR-AKTLD 273 G++ A+ P L+ N +RV Y D + +A+ +D AK D Sbjct: 162 GYNCQLLAHSYYPDHLWDNDKRVDLPDNNLNVQYGKGTYSQDLIHSKALAFLDEAAKEKD 221 Query: 274 QPFMLYLAYNAPHLPNDNPAP---DQYQKQFNTGS--------------------QTADN 310 QPF ++ PH P +++ ++ Sbjct: 222 QPFFMWYPTIIPHAELIVPEDSIIKKFRGKYPEKPYRGVEPGSPAFRKGGYCTQFYPHAT 281 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKS 365 + A VY +D V +I+++LK G YDNTII+F+SDNG ++G NG +GYK Sbjct: 282 FAAMVYRLDVYVGQIVQKLKDMGVYDNTIIIFSSDNGPHMEGGADPDFFNSNGIWRGYKR 341 Query: 366 QTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 Y GG PM + W G +QP D + S D PT + + +DGVS+LP Sbjct: 342 DVYEGGIRVPMIISWPGHVQPSTETDFMCSFWDLMPTFREVLN-PKADTRNMDGVSILPL 400 Query: 425 LQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 LQ++K + H+ L + Sbjct: 401 LQNRKGQKEHEYLYFEFL-----------------------------------EMNGRQA 425 Query: 485 VRNNDYSLVY-TVENNQL--GLYKL-TDLQQKDNLAAANPQVVKEMQGVVRE-FIDSSQP 539 VR D+ LV+ + N+ LY L +D +K N+ P+ E++ +++E I+ S Sbjct: 426 VRKGDWKLVHMNIRGNKPYYELYNLASDPSEKYNVLNQYPEKADELKAIMKEAHIEDSNW 485 Query: 540 PLSE 543 PL Sbjct: 486 PLFR 489 >UniRef50_C5EQ23 Arylsulfatase E n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EQ23_9FIRM Length = 483 Score = 401 bits (1030), Expect = e-110, Method: Composition-based stats. Identities = 123/517 (23%), Positives = 185/517 (35%), Gaps = 109/517 (21%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 KPNIIV DD GYG L + TP L L Sbjct: 15 KKPNIIVFLTDDQGYGDLSCMGST--------------------------DVCTPNLDIL 48 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA---QDGIPLTETFLPELFQ 172 G RFT+ Y V PSRA ++TGR P GV S G+ + Sbjct: 49 AAGGARFTDFYAGSAVCSPSRACLLTGRYPYMTGVRSILGGIKTTTGLNPGIPTFASALK 108 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 + GY T VGKWHL + E +P + GFDYF GF + Sbjct: 109 DLGYTTGMVGKWHLGAV-----------------------PECRPTHMGFDYFCGFLSGV 145 Query: 233 TAYYNS---------------PSLFKNRERV--PAKGYISDQLTDEAIGVVDRAKTLDQP 275 Y++ L++N ER Y ++ + + + D P Sbjct: 146 NDYFSHIHYTEANSHPGINPNHDLWENDERCLKYTGEYSTELFARKGLEFIREQVEKDMP 205 Query: 276 FMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQY 335 F LY A+NAPH P AP +Y ++F + A + +VD GV I+ LK+ G + Sbjct: 206 FALYCAFNAPHYPMH--APYKYLERFKHLPEDRQIMAAMLSAVDDGVGEIMNYLKRRGIF 263 Query: 336 DNTIILFTSDNGAVIDGP-----------LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL 384 ++TII F SDNG + G KG+K + GG P W + Sbjct: 264 NDTIIYFQSDNGPSKESRNWLDERKDYYYGGSTGGLKGHKFSLFDGGIRVPAIFSWPAMV 323 Query: 385 QPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 G + D +PT ++AA + D ++ G +LP + + + L W Sbjct: 324 PAGQVISEPCMGTDIFPTFINAAGGNA-SDYEISGCDILPVMTIGARRDK-DCLYWEMGQ 381 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGL 503 N + P +P TE +++ L Sbjct: 382 QTAVRRGN---YKLVINGFLRDGWSLPLDPKTET--------------------KHEVWL 418 Query: 504 YKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 L+ D+ ++ NL P++ KE++ + + Sbjct: 419 SDLSQDMGEEHNLVEEMPELAKELEEKALTWRRDLEA 455 >UniRef50_A6DSP6 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSP6_9BACT Length = 512 Score = 400 bits (1029), Expect = e-110, Method: Composition-based stats. Identities = 132/561 (23%), Positives = 225/561 (40%), Gaps = 154/561 (27%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 +PNII++ DD+GY + + + TP + Sbjct: 17 ADKQPNIILIFADDMGYDDVGYHG--------------------------NKRIITPNID 50 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD----------GIPLT 163 S+ ++GV+F+ GYV+ V GPSRA ++TG RFG N + G+P + Sbjct: 51 SIAEQGVQFSQGYVSASVCGPSRAGLLTGVYQQRFGCGENPNGSGYPNQMKYPMAGLPQS 110 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 ++ + E + GY +GKWH+ ++ +P RG+D Sbjct: 111 QSMISEELKTLGYTNGMIGKWHMGFDMSL-----------------------RPNQRGYD 147 Query: 224 YFMGFHAAGTAY----------YNSPSLFKNRERVPA------------------KGYIS 255 +F GF Y + +F+N E PA + Y++ Sbjct: 148 FFYGFINGSHDYTEWTQEFAKGKSRWPIFRNEEMEPANKAQYIDVFKEKGVKVVDENYLT 207 Query: 256 DQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASV 315 D TDEA+ +DR D+PF LYLAYNA H P + + + + V Sbjct: 208 DLFTDEAVNFIDRN--ADKPFFLYLAYNAVHHPWQTTQHALDKTAHLKDDKNYHVFASMV 265 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP----------------LPLNGA 359 Y++D+G+ +++++LK+ DNTII+F SDNG+ + G Sbjct: 266 YAMDEGIGKVMKKLKEKNIDDNTIIIFLSDNGSPQGQGIEHSPKDPNRHRGGFTMSSTGI 325 Query: 360 QKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADI---SIPKDLK 415 +GYK TY GG P + W ++Q G YD ISA+D PT + AA K Sbjct: 326 FRGYKGDTYEGGIRVPFCIKWPQQIQKGTKYDMPISALDLQPTLVKAAGGNDKKPQKGFA 385 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 DGV +LP+L++ K+ + ++L W + Sbjct: 386 YDGVDILPYLKEDKEIK--RSLFWRRDTDY------------------------------ 413 Query: 476 EDLSQFSYTVRNNDYSLVYTVENNQ--LGLYKLT-DLQQKDNLAAANPQVVKEMQGVVRE 532 +R D+ L + + + L+ + D +++ NL +P++ +++Q Sbjct: 414 --------AIRKGDWKLQWNDAHGPLTITLFNIKEDPEERSNLIKQHPELAQQLQNEFDT 465 Query: 533 FIDSSQPPLSEVNQEKFNNIK 553 + +S P +E +N ++ Sbjct: 466 WDNSM--PDNEWWGGPWNRLR 484 >UniRef50_UPI000186ED10 arylsulfatase B precursor, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186ED10 Length = 570 Score = 400 bits (1028), Expect = e-110, Method: Composition-based stats. Identities = 134/572 (23%), Positives = 213/572 (37%), Gaps = 114/572 (19%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 + S F +PNII++ DDLG+ + F + Sbjct: 31 ITLSIFFQNVVDGIERPNIIIILADDLGWNDVSFHGSN---------------------- 68 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQD 158 Q TP + +L G+ + YV + PSRA++MTG+ P G+ Sbjct: 69 ----QIQTPNIDALAYNGIILNSHYV-PALCTPSRASLMTGKYPTSLGMQHLVILSPEPW 123 Query: 159 GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQ 218 G+PL ET +PE F +GY T AVGKWHL F +E+ P Sbjct: 124 GLPLNETLMPEYFNKNGYATHAVGKWHLG----------------------FFKKEYTPI 161 Query: 219 NRGFDYFMGFHAAGTAYYNSPSLFKNRERVP-----------AKGYISDQLTDEAIGVVD 267 RGFD G YY+ ++ + + Y +D T EAI ++D Sbjct: 162 YRGFDSHFGHWNGFQDYYDHTTMSDSLKGYDMRRNFEVDYSYQGMYTTDVFTKEAIKIID 221 Query: 268 RAKTLDQPFMLYLAYNAPHL-----PNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGV 322 + P LYL++ APH P P D+ K Y A V +D+ V Sbjct: 222 NHNSQKGPLFLYLSHLAPHSGNPDNPFQAP-EDEISKHECINDPGRKIYAAMVTKLDESV 280 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDG---PLPLNGAQKGYKSQTYPGGTHTPMFMW 379 +++ L+KN +N+II+F SDNGA G N +G K + GG +W Sbjct: 281 GQVVSALEKNKMLNNSIIIFMSDNGAATYGLHSNRGSNYPLRGLKESPWEGGVRGTAAIW 340 Query: 380 WKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDL---KLDGVSLLPWLQDKKQGEPHK 435 + +L+ D+ PT L AA ++ K+DG+ + L + P K Sbjct: 341 SPFLNKTKRVSKQLMHMSDWLPTLLTAAGLNYSSTQLINKIDGIDMWNVLSN-DLPSPRK 399 Query: 436 NLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDY-------------------------- 469 + + I W + DY Sbjct: 400 EVFNNYDEIENYSSLMIDSWKYVEGTAQEGKADYWFEEPSRNNCSEYRVSNEDIFRLRRD 459 Query: 470 -------PHNPNTEDLSQFSYTVRNNDYSLVYTVEN--NQLGLYKLT-DLQQKDNLAAAN 519 P ++ +++ ++T N V T + + L+ L D ++ NLA Sbjct: 460 STIICDNPTFSSSLSITRNNHTDVKNKTKYVLTCDPLLKRFCLFNLNDDPCERLNLADVF 519 Query: 520 PQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 P VVK ++ + E S PL++ ++ ++N Sbjct: 520 PDVVKRIKNRLLELKKSVVKPLNKP-EDPYSN 550 >UniRef50_UPI0001927538 PREDICTED: similar to CG8646 CG8646-PA n=5 Tax=Hydra magnipapillata RepID=UPI0001927538 Length = 502 Score = 400 bits (1028), Expect = e-110, Method: Composition-based stats. Identities = 131/520 (25%), Positives = 212/520 (40%), Gaps = 78/520 (15%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 KP+II++ DDLG+ + F + + TP + Sbjct: 17 ADKPHIIMIVADDLGWNDISFHGSN--------------------------EIPTPNIDR 50 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTETFLPELF 171 L + GV N YV + PSR+AIMTGR P G+ +T G+ L E FLP+ Sbjct: 51 LANNGVILDNYYVL-PICTPSRSAIMTGRYPIHTGMQQDTIFGPNPYGVGLNEKFLPQYL 109 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA 231 + GY T VGKWHL F A+++ P RGFD + G + Sbjct: 110 KQQGYKTHGVGKWHLG----------------------FFAKQYTPTYRGFDSYYGSYLG 147 Query: 232 GTAYYNSPSLF----------KNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 Y+N + +N Y ++ T EAI + +P LYLA Sbjct: 148 KGDYWNHSNTETYSGLDLHDNENGVFSQDGNYSTEMYTAEAISCI-NNHNSSEPLFLYLA 206 Query: 282 YNAPHLPNDN----PAPDQYQKQFNTGS-QTADNYYASVYSVDQGVKRILEQLKKNGQYD 336 Y A H N AP ++ +F+ + Y A + +D GV R+ + L + D Sbjct: 207 YQAVHSANTEEDPLQAPQEWIDKFSYIKHEQRRKYAAMLGYMDYGVGRVHDALAEKKMLD 266 Query: 337 NTIILFTSDNGAVIDG---PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLI 393 N+II+FT+DNG +G N +G K+ + GG F++ K P +LI Sbjct: 267 NSIIIFTTDNGGPANGFDYNWANNFPLRGVKATLFEGGVRGVSFVYSKLIESPRVSHELI 326 Query: 394 SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIP 453 D+ PT ++ A + D LDG LQ+K+ + ++ L I + + Sbjct: 327 HITDWLPTLVNLAGGKV-SDGFLDGFDQWATLQNKQSSQRNEVLLNIDEKVWKNEALRVG 385 Query: 454 FWDNYHKFVRHQSDDYPHNPNTEDLSQFSY---TVRNN-DYSLVYTVENNQLGLYKL-TD 508 W + P + N + + FSY TV+ D +V ++ L+ + D Sbjct: 386 SWKIIKEGNYWDGWYPPPSFNEQSNNSFSYLSSTVKCGHDIPIVINHCDS-YCLFHIDED 444 Query: 509 LQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEK 548 + ++L+ P+V+ E+ + + S PP + + + Sbjct: 445 PCEINDLSKKFPEVLAELINRLNTYRQSMVPPRNNMTIDP 484 >UniRef50_Q7UHK0 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UHK0_RHOBA Length = 478 Score = 400 bits (1027), Expect = e-109, Method: Composition-based stats. Identities = 122/493 (24%), Positives = 197/493 (39%), Gaps = 91/493 (18%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 + + PN +++ DDLGYG + S T Sbjct: 35 SIAAADRPPNFVLIFADDLGYGDISCYDSS--------------------------GVKT 68 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ------DGIPLT 163 P L L EG R + +V V PSRAA++TGR P R G+ + G Sbjct: 69 PHLDQLAAEGFRSKDFFVPANVCSPSRAALLTGRYPMRCGMPVARNENVAKYKDYGFAPD 128 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 E +PEL GY + VGKWHL E P + GFD Sbjct: 129 EITIPELLGPAGYRSLMVGKWHLG----------------------MELEGSHPLDAGFD 166 Query: 224 YFMGFHAAGTAYY--NSPSLFKNRERVPAK---GYISDQLTDEAIGVVDRAKTLDQPFML 278 ++G + N +L++ ++ ++ + TDE I ++R K D PF + Sbjct: 167 EYLGIPSNYEPRRGKNHNTLYRGKQVEQKNVACEELTKRYTDEVIDFIERQK--DDPFFI 224 Query: 279 YLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 Y++++ H P P+PD G+ Y + +D RI++ ++ G +NT Sbjct: 225 YVSHHIVHNPL-KPSPD------FVGTSEKGKYGDFIKELDHSTGRIMQTIRDAGLDENT 277 Query: 339 IILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLI-SAMD 397 +++FTSDNG +G +G G K T GG P W K+ P + ++MD Sbjct: 278 LVIFTSDNGPTRNG---SSGELSGGKYCTMEGGHRVPGMFRWTSKIAPNQVSDVTLTSMD 334 Query: 398 FYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDN 457 P + A + IP D ++DG S+LP L + PH+ L + + E Sbjct: 335 LLPLFCELAGVPIPDDRQIDGKSILPVLLGQTSESPHQFLYYYNGTNLQAVRE-----GK 389 Query: 458 YHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLA 516 + + +DD P D ++ T+ N++ L+ L DL +K N+A Sbjct: 390 WKLHLPRTTDDQPFWSKKPDKTKGFVTL-------------NEMRLFNLDRDLGEKKNVA 436 Query: 517 AANPQVVKEMQGV 529 +P++V + Sbjct: 437 DRHPEIVARLNEQ 449 >UniRef50_D2R1I8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R1I8_9PLAN Length = 427 Score = 399 bits (1026), Expect = e-109, Method: Composition-based stats. Identities = 127/491 (25%), Positives = 195/491 (39%), Gaps = 98/491 (19%) Query: 62 VLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVR 121 ++ DDLGYG + + TP + L EG+ Sbjct: 2 LILADDLGYGDVS--------------------------TYHPSDVRTPQIDQLAAEGML 35 Query: 122 FTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT-----DAQDGIPLTETFLPELFQNHGY 176 T+ V PSRAA++TGR R GV D+ T L + + GY Sbjct: 36 LTSMRANCTVCSPSRAALLTGRYADRVGVPGVIRTKPEDSWGWFDPTVPTLADELKRVGY 95 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA-- 234 +TA VGKWHL S P RGFD+F GF Sbjct: 96 HTAIVGKWHLGLES-----------------------PNTPNERGFDFFQGFLGDMMDSY 132 Query: 235 ----YYNSPSLFKNRERVPAKGYISDQLTDEAIGVV-DRAKTLDQPFMLYLAYNAPHLPN 289 Y + + +NRE + +G+ ++ TD A + +RAK +QPF LYLAYNAPH P Sbjct: 133 TTHLRYGNNYMRRNREVIEPQGHATELFTDWASEYLVERAKQKEQPFFLYLAYNAPHFPI 192 Query: 290 DNPAP--DQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG 347 + PA + +++ Q A V +D + R+L+ LK+ G NT+++FTSDNG Sbjct: 193 EPPAEWLAKVKERAPQLDQKRAKNVAFVEHLDHSIGRVLKTLKETGLDQNTVVVFTSDNG 252 Query: 348 AVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS-AMDFYPTALDAA 406 + N + K Y GG P + W G+++ G+ + D +PT L+ A Sbjct: 253 GSLP-HAQNNDPWRDGKQSHYDGGLRVPFMVRWPGQIKAGSRSDYVGLNFDLFPTFLELA 311 Query: 407 DISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQS 466 + K+ LD VSL+P L+ K E + + ++ Sbjct: 312 GATPSKE--LDAVSLVPVLKGGKITTSRDLYF-------VRREGGVTYGGKSYE------ 356 Query: 467 DDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKE 525 + ++ L+ + L LY + D + +LAA+N +VV E Sbjct: 357 -----------------AIIRGEWKLLQNDPYSALELYNIQNDPGETKDLAASNKKVVNE 399 Query: 526 MQGVVREFIDS 536 + +R I Sbjct: 400 LAAALRLHIQR 410 >UniRef50_A6C383 Sulfatase (Fragment) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C383_9PLAN Length = 405 Score = 398 bits (1024), Expect = e-109, Method: Composition-based stats. Identities = 121/492 (24%), Positives = 185/492 (37%), Gaps = 121/492 (24%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 + KPN+I++ DD G D + K + TP + Sbjct: 5 SSEKPNVIIIFTDDQG----SVDLNCYGAKDLI----------------------TPHMD 38 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTETFLPEL 170 S+ G+RFT Y + V PSRA ++TGR PAR GV N + G+P + + E+ Sbjct: 39 SIARRGIRFTQFYASAPVCSPSRAGMLTGRFPARAGVPGNVSSHHGKSGMPTEQITIAEM 98 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA 230 Q GY TA +GKWHL E P +GF+ G Sbjct: 99 MQQAGYQTAHIGKWHLG-----------------------YTPETMPHGQGFETSFGHMG 135 Query: 231 AGTA------YYNSP---SLFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFMLYL 280 Y+N P L++N + V G + D + ++ + K D+PF LY Sbjct: 136 GCIDNYSHFFYWNGPNRHDLWENGKEVWRDGAFFPDLMVEQCQDYIR--KAGDKPFFLYW 193 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 A N PH P ++++K + S D Y A V ++D + +L L + TII Sbjct: 194 AINVPHYPLQGK--EKWRKTYAHLSSPRDKYAAFVSTMDDCIGEVLATLDACQLREKTII 251 Query: 341 LFTSDNGAVID----GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISA 395 +F SD+G + G G +G K + GG P + W G + G D+L + Sbjct: 252 IFQSDHGHSHEERTFGGGGSAGPYRGAKFSLFEGGIRVPAMISWPGTIAEGEVRDQLATG 311 Query: 396 MDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFW 455 D+ PT +P LDG +L ++ PH+N W Sbjct: 312 CDWLPTISALTGAPLPAH-HLDGKNLKAVIESSTAKSPHENFYWQIG------------- 357 Query: 456 DNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVEN----------NQLGLYK 505 S+ +R D+ L+ + NQ+ L Sbjct: 358 -------------------------KSWAIREGDWKLLGNPRDTSQQTPLGKENQIFLVD 392 Query: 506 LT-DLQQKDNLA 516 L+ D+ +K NLA Sbjct: 393 LSKDIGEKKNLA 404 >UniRef50_A6BYR0 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BYR0_9PLAN Length = 658 Score = 398 bits (1023), Expect = e-109, Method: Composition-based stats. Identities = 126/590 (21%), Positives = 212/590 (35%), Gaps = 147/590 (24%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 L +A S + + PN+++ +DD+G+ + Sbjct: 5 LVVLFCMIAIS--SAETVAADRAPNVVLFLVDDMGWMDSEPYGSRY-------------- 48 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 TP + L + +RFTN Y + P+RA+I+TG+ P+R G+ S T Sbjct: 49 ------------YETPNMSKLAKQSMRFTNAYAT-PLCSPTRASILTGQYPSRHGITSAT 95 Query: 155 DAQDG--------------------------IPLTETFLPELFQNHGYYTAAVGKWHLSK 188 + + + L E ++ GY T GKWHL Sbjct: 96 GHRPPQAENFEFLPTAAPPNQKLRMPVSKNYLEPNQYTLAEALRDAGYRTGHFGKWHLGL 155 Query: 189 ISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERV 248 + + + + + + P YF + T + N Sbjct: 156 TT-----PHRPDKQGFETVWHCAPDPGPPS-----YFSPYGVTPTGKPTAQHRVGNITDG 205 Query: 249 PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP---DQYQKQFNTGS 305 P +I+D+LT EAI ++ ++ +PF L L + + H P + A + +KQ Sbjct: 206 PDGEHITDRLTSEAIQFMEAHRS--EPFFLNLWHYSVHGPWQHKAEYTAEFAKKQDPRKE 263 Query: 306 QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG------------- 352 Q + + +VD+ + RIL++L + DNT+ +F SDNG Sbjct: 264 QRNPVMASMLRNVDESLGRILQKLDELKLADNTLFIFYSDNGGNAHSWSSDDPKLKKITD 323 Query: 353 ------------------PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLI 393 P N + K + Y GG P+ + W G +QPG D ++ Sbjct: 324 KHPLYKTINSYRKWAGGEPPTNNAPLREGKGRIYEGGQRVPLMVRWPGHIQPGTTSDAIV 383 Query: 394 SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIP 453 +D YPT LD+ +S P + +DG S LP L+ + E TW Sbjct: 384 GPIDLYPTILDSLKLSQPANQIIDGKSFLPVLEQTGELERTAYFTWFPHLI--------- 434 Query: 454 FWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQL-----GLYKL-T 507 + +VR D+ L+ E ++L LY L Sbjct: 435 ---------------------------PAVSVRQGDWKLIRRFEPHRLYPEIRELYNLKA 467 Query: 508 DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ---EKFNNIKK 554 D+ + DNLA P V+E+ ++ EF+ + + N + NI Sbjct: 468 DISESDNLARQRPDKVRELDALIDEFVKETGALYPQPNPAYKPRPGNIDN 517 >UniRef50_A7S8Q2 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7S8Q2_NEMVE Length = 540 Score = 398 bits (1023), Expect = e-109, Method: Composition-based stats. Identities = 125/505 (24%), Positives = 207/505 (40%), Gaps = 73/505 (14%) Query: 52 YSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPT 111 S G P+I+ + MDDLG+ + + S TP Sbjct: 29 LSMAGPPHIMFILMDDLGWSDVGYHNISHA-------------------------VKTPN 63 Query: 112 LLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFLP 168 + L +GV+ + Y + PSR A+MTG+ P G+ N + G+P +P Sbjct: 64 IDKLASQGVKLMSYYS-QPMCTPSRGALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIP 122 Query: 169 ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF 228 + + GY T+ +GKWHL F ++ P RGFD F+GF Sbjct: 123 QKLRTLGYRTSMIGKWHLG----------------------FFDWDYTPLRRGFDSFLGF 160 Query: 229 HAAGTAYYNSPS---LFKNRERVPAKGY----ISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 A ++ L R+ PA Y +D T EAI + R QP L L+ Sbjct: 161 FAGEQDHWRHSKMGFLDFRRDEEPANEYGGQHSTDVFTQEAINIAMRH-NASQPLFLLLS 219 Query: 282 YNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIIL 341 Y A H P P+ K + NY + + D + R+++ K+NG ++NT+++ Sbjct: 220 YAAVHTPLQA-HPNDVNKIGGVSDKDRQNYLGMMGAADWSIGRLIDVYKRNGLWNNTLMI 278 Query: 342 FTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL---QPGNYDKLISAMDF 398 + SDNGA N +GYKS + GG P F+ G++ + G + L D+ Sbjct: 279 WASDNGAQPGKGGGYNWPLRGYKSSLFEGGVRVPAFVH--GEMLQRKGGTVNDLFHVTDW 336 Query: 399 YPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNY 458 YPT + A + D +DGV P L + K + + L I ++ +E P NY Sbjct: 337 YPTLVKLAGGEVEPD--IDGVDQWPTLSEGKPSKREEILHNIDIPANQEEERMAPRGFNY 394 Query: 459 HKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ-----LGLYKLT-DLQQK 512 + + D + + +V + ++ L LY +T D +++ Sbjct: 395 YSGAALRRGHMKLVYKMGDAGWYQLPENGHRGPVVEEMVKDRLPIVELALYNITADPEER 454 Query: 513 DNLAAANPQVVKEMQGVVREFIDSS 537 ++L+ NP +V + ++E +S Sbjct: 455 NDLSKLNPDIVDSLWRRLQELNATS 479 >UniRef50_A6DSG4 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSG4_9BACT Length = 489 Score = 397 bits (1020), Expect = e-109, Method: Composition-based stats. Identities = 121/504 (24%), Positives = 205/504 (40%), Gaps = 98/504 (19%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 + KPNI+ DDLGYG + G + + T Sbjct: 22 VSLQAQQKPNILFYLTDDLGYGDI----GCYGAEGQY----------------------T 55 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG----VYSNTDAQDGIPLTET 165 P + L EG +F++ YV PSRAA MTG R G +Y + + G+ +E Sbjct: 56 PAIDQLAKEGTKFSSFYVHQR-CSPSRAAFMTGSYAHRVGLPQVIYKHREGPIGLNPSEI 114 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 LPEL + GY TA VGKWHL + + + P N G+DYF Sbjct: 115 TLPELMKTAGYNTALVGKWHLGE-----------------------WKPFHPLNHGYDYF 151 Query: 226 MGFHAAGTAYYNSPSLFKNRERVPAKGYISDQ----LTDEAIGVVDRAKTLDQPFMLYLA 281 GF PSL +NR+ + +K ++ + AI + + K PF L + Sbjct: 152 YGFLKV-IEGSEKPSLIENRKELASKIQKTEGQAPGMVKAAINFMTKHKK--NPFFLVYS 208 Query: 282 YNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIIL 341 PH P + + G+ NY ++ +D K +++ L + G +NTI++ Sbjct: 209 DPMPHAPY-------FPSEQFKGTSKRGNYGEVIHEIDWQFKHLMDALDELGLKENTIVV 261 Query: 342 FTSDNGAVIDGP----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP-GNYDKLISAM 396 FTSDNG ++ + L+G + K + GG P + W GK++ + D +I + Sbjct: 262 FTSDNGPPVERQKKYDVGLSGPLRDGKWTNFEGGVRVPFIIRWPGKVKVDASSDAMIGII 321 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWD 456 D PT + A + +P D +DGV++LP L ++ + + Sbjct: 322 DMLPTFCELAGVDVPNDRVIDGVNILPQLLGDQESKALRE---------TQIVPGATIIH 372 Query: 457 NYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNL 515 N K+ Q + Y +N ED + + L+ L D+ + + Sbjct: 373 NGWKYYAKQQNPY-NNKKPEDWNGLQPA--------------KEGALFNLKEDIGETTEV 417 Query: 516 AAANPQVVKEMQGVVREFIDSSQP 539 +A +P++ + ++ + +F+ + Sbjct: 418 SAQHPEIAESLKKNMAKFMAELKK 441 >UniRef50_A6DGL0 Arylsulfatase A n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DGL0_9BACT Length = 506 Score = 397 bits (1020), Expect = e-109, Method: Composition-based stats. Identities = 128/540 (23%), Positives = 213/540 (39%), Gaps = 107/540 (19%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGK-------PNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 +K++ + ++A S + ++ + K PNII++ DD G G L Sbjct: 1 MKIRKSFISIALSLLSLNNFAAETKKILKGAKPNIIMVLTDDQGMGDLSC---------- 50 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 + TP + + + RFT+ V+ P+RAAIM+GR+P Sbjct: 51 ----------------MGNPILRTPHIDKMYAKSTRFTDFQVSST-CTPTRAAIMSGRSP 93 Query: 146 ARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 G+ +D + P+ Q GY T GKWHL Sbjct: 94 FEVGISHTLMQRDRLAPAVITFPQALQKSGYKTGLFGKWHLGD----------------- 136 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS-------------LFKNRERVPAKG 252 EE++PQNRGFD + A G YN L N V KG Sbjct: 137 ------GEEYRPQNRGFDEVLMHGAGGIGQYNFGDFKPNATNKYFDNVLLHNDTIVQTKG 190 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF--NTGSQTADN 310 + +D A+ + + +Q + Y++ NAPH P AP++Y+K+F +Q+ Sbjct: 191 FCTDVFFKAALSWIKKQHENNQTYFAYISLNAPHGPLI--APEKYKKRFIDEGYNQSVAA 248 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG---------AVIDGPLPLNGAQK 361 Y + ++D ++E+LK+ DNT+I+F +DNG V N K Sbjct: 249 RYGMIENIDDNFGLMVEKLKEWKALDNTLIIFMTDNGMAMKSIGKKGVKGKFNAWNAGMK 308 Query: 362 GYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPK-DLKLDGV 419 G+K + GG+ P F +WKG L G L + +D Y T + A +IP+ L G Sbjct: 309 GHKDSAWEGGSRVPSFWYWKGVLGEGVDISALSAHIDLYRTFCELAGTNIPESSLSPSGR 368 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLS 479 SL+P L++ + L + E Sbjct: 369 SLIPLLENPNAKWDDRTLFFHRG---------------------RWGGGGRGKKTRELAK 407 Query: 480 QFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 + VRN+ + LV ++ + L + D + N+ +P+V ++M+ ++ DS++ Sbjct: 408 YYGMAVRNSRWRLVNIMDGDGPWLSDIANDPGETKNVIEQHPEVAEKMKAQFDQWWDSTE 467 >UniRef50_A4A2W0 Arylsulfatase A n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A2W0_9PLAN Length = 477 Score = 396 bits (1019), Expect = e-109, Method: Composition-based stats. Identities = 123/532 (23%), Positives = 193/532 (36%), Gaps = 133/532 (25%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 +L A V + + KPN +++ +DDLGY + + Sbjct: 6 SRLLALLIVVGWLVSSSCAQEVATKPNFVIINIDDLGYADIEPFGSEVN----------- 54 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP L ++ DEG++ T Y A V PSRAA+MTG P R Sbjct: 55 ---------------RTPNLNAMADEGMKLTCFYAA-PVCSPSRAALMTGCYPKRALTIP 98 Query: 153 NT---DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 + +G+ E + EL + GY TA +GKWHL Sbjct: 99 HVLFPGNAEGMSPNEVTIAELMKEQGYATAIIGKWHLGDQ-------------------- 138 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGT---------AYYNSPSLFKNRERVP----------- 249 ++ P +GFDY+ G + + Y +P + + P Sbjct: 139 ---PDFLPTRQGFDYYYGLPYSNDMGPAADGVKSNYGAPIPQRKGKGQPPLPLLRNETVL 195 Query: 250 ------AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT 303 + + T+EAI + + ++PF LYL ++A H P Y Sbjct: 196 QRVLAKDQTELVTNYTEEAIQFIRDHQ--EKPFFLYLPHSAVHFPM-------YPGDAFR 246 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGY 363 G + Y V VD V ++L+ LK G T+++FTSDNG +N + Sbjct: 247 GKNSHGLYNDWVEEVDWSVGQVLQALKDLGLDQRTLVIFTSDNGGQTRFG-AVNKPLRAG 305 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLL 422 K+ TY GG P + W GK+ G + D ++ +D PT + A + P D K+DG + Sbjct: 306 KATTYEGGMRVPTIVRWPGKVPAGSSSDAVVGMIDVLPTLVKLAGGTTPTDRKIDGADIG 365 Query: 423 PWLQD-KKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQF 481 P L K+ PH + Y Sbjct: 366 PILAGVKEAKSPHDVFYFYRGYDL------------------------------------ 389 Query: 482 SYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVRE 532 VR+ + L + LY L D+ + N+A N VV+ ++ + E Sbjct: 390 -EAVRSGPWKLRL----KEGALYNLHEDISEAKNVAPDNADVVERLRKIAAE 436 >UniRef50_A4CGL5 Arylsulfatase A (Precursor) n=2 Tax=Flavobacteria RepID=A4CGL5_9FLAO Length = 526 Score = 396 bits (1019), Expect = e-108, Method: Composition-based stats. Identities = 108/526 (20%), Positives = 187/526 (35%), Gaps = 94/526 (17%) Query: 24 AFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPK 83 + L + S+F + + PNI+++ DD GY + Sbjct: 41 CVRYPLLAIILLGVSCRETVKSEFAAADRA-DRPPNIVIIFTDDQGYSDVGVYG------ 93 Query: 84 TMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGR 143 A TP L ++ +G+ TN Y A V SRA ++TG Sbjct: 94 --------------------ARDIPTPNLDAMAADGLLLTNFYAAQPVCSASRAGLLTGC 133 Query: 144 APARFGVYSN--TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 P R G+++ ++ G+ E L EL + GY T GKWHL Sbjct: 134 YPNRVGIHNALMPNSPVGLNPAEETLAELLRQQGYRTGIFGKWHLGDH------------ 181 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-----------SLFKNR---ER 247 ++ P GFD F G + + P L++ + Sbjct: 182 -----------PDFLPTRHGFDEFFGIPYSNDMWPLHPLQGPVFDFGPLPLYEQERVVDT 230 Query: 248 VPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT 307 + + ++ Q+T+ ++ ++R K ++PF LY+ + PH+P + G Sbjct: 231 LEDQRLLTRQITERSVDFINRHK--EEPFFLYVPHPQPHVPL-------FVSDAFRGKSG 281 Query: 308 ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNGAQKGYKS 365 Y + +D V ++L L+ NG D+T ++FTSDNG + + K Sbjct: 282 RGLYGDVIMEIDWSVGQVLGALEDNGLTDDTWVIFTSDNGPWLAYGNHSGRAEPLREGKG 341 Query: 366 QTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 + GG P M + G+L G D+ + A+D PT P ++DG + Sbjct: 342 TNWEGGVREPCIMKFPGRLPRGKVLDEPLMAIDLLPTIASVTGSPQP-GREIDGKNAWGL 400 Query: 425 LQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 L + P + + +E ++ + H D +Y Sbjct: 401 LSGAEARGPQDAYY----FYYRVNELQAVRDGDWKLVLPHNYRTMQGQEPGADGLPGAY- 455 Query: 485 VRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGV 529 V+ LY L D + +NLA +P+V+ + Sbjct: 456 ---------DYVDVTAPELYNLREDPGETNNLAERHPEVLAAISRK 492 >UniRef50_B1KFX9 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KFX9_SHEWM Length = 548 Score = 396 bits (1018), Expect = e-108, Method: Composition-based stats. Identities = 128/608 (21%), Positives = 225/608 (37%), Gaps = 135/608 (22%) Query: 8 SVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGK-------PNI 60 ++ S ++ ++ A+ + F + K +K VA + + + K PNI Sbjct: 6 ALYSVALIVLSAALLWTFRFDVLVTLASKKSKKPVAENQEINWQKGPEQKDSTKTDLPNI 65 Query: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 +++ DD+G + G TP + L +G Sbjct: 66 VLILADDMGINDVSTFGG--------------------------GMIETPNIDKLAAKGA 99 Query: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA------------------------ 156 FTNGY H PSRAA++TGR R G + Sbjct: 100 LFTNGYSGHANCAPSRAALLTGRDATRTGYDTTPIPDGMSRIIAAIENNEDNGRPEMSYS 159 Query: 157 -----------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 G+P +E +PE+ + GY+T +GKWHL + PE D Sbjct: 160 AEADATNPTYDNRGLPGSEILIPEILKESGYHTMHIGKWHLGRS-----PEMMPNAQGFD 214 Query: 206 NFTTFSAEEWQPQNR-----------GFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYI 254 + + P + G D F+ + +N E GY+ Sbjct: 215 ESLMMDSGLYLPVDHPESVNAPVESSGLDRFIWATMRYSVNWN------GGEIFKPNGYL 268 Query: 255 SDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYAS 314 +D T+EA ++ ++PF LYLA+ PH P D Y+ + Y A Sbjct: 269 TDYFTEEAEKAIEAN--ANRPFFLYLAHWGPHNPVQAKRAD-YEAVGDIQPHNKRVYAAM 325 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-LNGAQKGYKSQTYPGGTH 373 + S+D+ V+R++ +L+K G DNTI++ +SDNG + LN +G+K+ + GG Sbjct: 326 LRSIDRSVERVMAKLEKQGIADNTIVILSSDNGGADYVAINDLNKPYRGWKNTFFEGGIR 385 Query: 374 TPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ-G 431 P + W + ++ ++ +D PT ++ A+ +P+D ++DGV + P Q + + Sbjct: 386 VPFSVTWPNVIDESTVIEEPVNHIDLMPTIINMANADLPQDREIDGVDIAPLWQGQPELE 445 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 P + W T V++ + Sbjct: 446 RPQNAMFWFTGDYR--------------------------------------VVQSKGWK 467 Query: 492 LVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFN 550 L ++ Q L+ L D ++ NLA + + E+ ++ ++ + E Sbjct: 468 LQQNPKSGQTFLHNLNVDPTEQKNLADSESAKLAELTKLIDAHFANAVDVIGESTIAAPI 527 Query: 551 NIKKALSE 558 I K L E Sbjct: 528 TIDKHLGE 535 >UniRef50_A6P2X1 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6P2X1_9BACE Length = 494 Score = 396 bits (1017), Expect = e-108, Method: Composition-based stats. Identities = 146/566 (25%), Positives = 224/566 (39%), Gaps = 133/566 (23%) Query: 17 ILASGMAAFAAHAADDVKLKATKTNVAFSDFTP---------TEYSTKGKPNIIVLTMDD 67 I A+ + F AA ++ N+A D+ P E PN++V+ +DD Sbjct: 25 IAAAILVVFLLTAAVLNRVV----NIALRDYRPEGRKRYLEGVELENGDPPNVVVIYVDD 80 Query: 68 LGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYV 127 +GYG L A STP + +L + GV TN Y Sbjct: 81 MGYGDLGCTG--------------------------ATAISTPNIDALAEGGVLLTNYYA 114 Query: 128 AHGVSGPSRAAIMTGRAPARF---GVYSNTDA-------------------QDGIPLTET 165 + SRA ++TGR P R G Y NT+ DG+P E Sbjct: 115 PAPICSASRAGLLTGRYPIRTLTSGAYMNTEGLSGHLANLLEVVKGTYPYQNDGLPTDEI 174 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 LPE+ Q GY TA VGKWHL EE +P NRGFD F Sbjct: 175 LLPEVLQQAGYETALVGKWHLGI-----------------------REEERPYNRGFDLF 211 Query: 226 MGFHAAGTAYYNSPSLFKNRERVPAKGY----ISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 G + + ++ N E V + Y ++ +LT A +D + D PF LY A Sbjct: 212 YGALYSDDN--DPHRIYHNDEVVHDEPYDQSGMTKELTQVAKQFIDDNQ--DGPFFLYYA 267 Query: 282 YNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIIL 341 PH P++ A +++ + A Y + VD V I++ L++NG +NT+++ Sbjct: 268 SPFPHWPSN--ASEEWLG-----TSQAGIYGDCMQEVDWSVGEIMDTLEENGLLENTLVI 320 Query: 342 FTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYP 400 FTSDNG DG G Q+G K Y GG+H P + G + G D L+S +D +P Sbjct: 321 FTSDNGPWYDGA---TGGQRGRKDTNYNGGSHVPFIAYMPGTIPEGEVYDGLMSGVDVFP 377 Query: 401 TALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHK 460 T L+ I +P+D +DG+ + P+L + P L D++ ++ K Sbjct: 378 TILNLLGIELPQDRVIDGMDMWPFLTGQSD-SPRTELFLN------KDKDTFALIEDNFK 430 Query: 461 FVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAAN 519 ++ + + L LY L TD ++ ++ Sbjct: 431 YLERSYSENGTY-----------------WML-----QQGPFLYNLDTDPEEAYDVTTHF 468 Query: 520 PQVVKEMQGVVREFIDSSQPPLSEVN 545 P+ +EM + F S + + Sbjct: 469 PEKAEEMAQKIDSFKQSLKENIRGWK 494 >UniRef50_A3ZLN5 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZLN5_9PLAN Length = 468 Score = 396 bits (1017), Expect = e-108, Method: Composition-based stats. Identities = 121/551 (21%), Positives = 206/551 (37%), Gaps = 140/551 (25%) Query: 33 VKLKATKTNVAFSDFTPTEYST---KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 +L +A + + K P+I+++ DD G+ L Sbjct: 4 TRLMTFVCALASALLVSNAVAAEKSKRPPSIVLIVSDDQGFADLSC-------------- 49 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 I TP L L G R T+ YV+ PSRA++MTGR P R G Sbjct: 50 ------------IGDNGCRTPRLDQLAASGTRLTSFYVSWPACTPSRASLMTGRYPQRNG 97 Query: 150 VYSNTDAQD--------------------GIPLTETFLPELFQNHGYYTAAVGKWHLSKI 189 Y + G L E FL ++ + GY +A GKW Sbjct: 98 TYDMIRNEAPDYDYLYTPEEYAVTAERILGTDLQEVFLADVLKQAGYVSAVFGKW----- 152 Query: 190 SNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS-----PSLFKN 244 + + P RGFD + GF G Y+ PS+F++ Sbjct: 153 ------------------DGGQLKRYLPLQRGFDQYYGFANTGVDYFTHERYGVPSMFRD 194 Query: 245 RERVPAKG--YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPND--------NPAP 294 + Y++D EAI +D D+PF LYL +NAPH ++ AP Sbjct: 195 NQPTEEDKGTYLTDLFEREAIRFIDENH--DRPFFLYLPFNAPHSASNLDRSIRGFAQAP 252 Query: 295 DQYQKQFN----TGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI 350 +Y F + Y A+V +D+ + ++++QL+++ DNT+I+F SDN Sbjct: 253 QEYLDHFPGGESKQEKRRQAYLAAVERMDEAIGKVVDQLQQHQIADNTLIIFLSDN---G 309 Query: 351 DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADIS 409 G N +G K++ + GG P + W GK+ G ++ +++++ +PT + A Sbjct: 310 GGGGADNSPLRGGKAKMFEGGNRVPCIVHWPGKVPAGKVSNQFLTSLEVFPTVIAAIGGK 369 Query: 410 IPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDY 469 +P D+ DG +LP L P + + W Sbjct: 370 LPDDVIYDGFDMLPVLNG--ASSPREEMFW------------------------------ 397 Query: 470 PHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQG 528 + R D+ V + GL+ L D+ +K +L+ +P+++ +++ Sbjct: 398 --------KRRGDVAARVGDWKWVDSAAGK--GLFDLAHDIGEKKDLSKEHPEMLAKLKA 447 Query: 529 VVREFIDSSQP 539 + + Sbjct: 448 RFDAWTAEMEA 458 >UniRef50_Q7UMZ5 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UMZ5_RHOBA Length = 484 Score = 396 bits (1017), Expect = e-108, Method: Composition-based stats. Identities = 118/556 (21%), Positives = 204/556 (36%), Gaps = 140/556 (25%) Query: 21 GMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSF 80 + A +HA L+A + +PNI+++ DDLGYG L G + Sbjct: 17 MLVALCSHACVPTLLRAD---------------SNDRPNIVLILADDLGYGDL----GCY 57 Query: 81 DPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIM 140 +++TP L L +GVR+T Y P+RAA++ Sbjct: 58 GND----------------------EQATPVLDRLATQGVRWTQAYANGPECSPTRAALL 95 Query: 141 TGRAPARFG-----------------VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGK 183 TGR G + + + G+P L + + GY TA GK Sbjct: 96 TGRYQQHVGGLECAIGVGNVGRYDDAIRLHLVNELGLPANRPTLAKRLSSVGYETALFGK 155 Query: 184 WHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS----- 238 WHL + + P GFD + YY+ Sbjct: 156 WHLGYEAK-----------------------FSPMMHGFDEALYCIGGAMDYYHYLDSVA 192 Query: 239 -PSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQY 297 +LF N + +GY +D +TD+A+ + D+PF LYL Y APH P P Sbjct: 193 TYNLFHNGRPISGEGYFTDTITDQAVRFIGDRNANDKPFFLYLPYTAPHTPYQAPGESPV 252 Query: 298 Q------KQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID 351 + + Y A V +D+G+ ++L ++++ D T+++F SDNG Sbjct: 253 DPLPIDSPLWKQNADPPGVYRAMVRHMDEGIGKVLHAIEESKMTDRTLVIFASDNGGTSA 312 Query: 352 GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISI 410 N +G+K Q + GG P+ W G L G D++ D + L AA I+ Sbjct: 313 SR---NEPLRGFKGQAFEGGIRVPLIARWPGHLPEGVVSDQVTITFDLTASMLAAAGITP 369 Query: 411 PKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYP 470 ++ ++G+ +L + + +P + L W + W Sbjct: 370 TQEDAMEGIDVLSLAANDEPVQP-RTLYWRKP-------RDPQVWSG------------- 408 Query: 471 HNPNTEDLSQFSYTVRNNDYSLVYTVENN-------QLGLYKLT-DLQQKDNLAAANPQV 522 +R+ ++ V + Q L+ L D+ ++ +LA+ + Sbjct: 409 --------------MRDGNWKYVRQEKATVDGRTSIQEWLFNLADDISEQTDLASQSTDE 454 Query: 523 VKEMQGVVREFIDSSQ 538 + ++G + S + Sbjct: 455 LDRLRGRYLAWEQSVR 470 >UniRef50_A6C8S3 Arylsulphatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8S3_9PLAN Length = 481 Score = 395 bits (1016), Expect = e-108, Method: Composition-based stats. Identities = 120/522 (22%), Positives = 185/522 (35%), Gaps = 98/522 (18%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 T KPN IV+ DDLGYG L Sbjct: 28 QQTLFAAQATAKPNFIVIFADDLGYGDLECYG--------------------------HP 61 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTET 165 + TP L + EG R T V PSRA ++TGR P R GV+ N + Sbjct: 62 RFKTPHLNQMAAEGARLTQFNVPVPYCAPSRATLLTGRYPWRHGVWYNPAPDGQQFRSGV 121 Query: 166 FLP-------ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQ 218 + EL + +GY T +GKWHL E+ P Sbjct: 122 GIAESELLLSELLKENGYATICIGKWHLGHD-----------------------PEYYPT 158 Query: 219 NRGFDYFMGFHAAGTAYYNSPSLFKNRERVPA---KGYISDQLTDEAIGVVDRAKTLDQP 275 GFD ++G + +L + + + + ++ + T+ A+ + + P Sbjct: 159 RHGFDDYLGILYSNDMR--PVNLMQGEKLLEYPVIQANLTKRYTERAVKFIQENQEG--P 214 Query: 276 FMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQY 335 F LYL + PH P + A Y + +D V I + L++ Sbjct: 215 FFLYLPHAMPHKPLAA-------SEAFYKKSGAGLYGDVIAELDWSVGEIFKTLRELNLD 267 Query: 336 DNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLIS 394 +NT+++F SDNG G G KS T+ GG PM W GK+ P D + Sbjct: 268 ENTLVIFASDNGPWFGGN---TAGLSGMKSTTWEGGLRVPMIARWPGKIPPRQVIDTVCG 324 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLT--------------WI 440 ++D +PT L A I +P D +DG L P L K+ PH+ L W Sbjct: 325 SIDVFPTILKQAGIPVPADRVIDGKDLFPVLT-KQAPTPHQALYSMKGNSLFTVRSGPWK 383 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 + N+ R P + + + N D + Sbjct: 384 LHVKPSPRQVLAGKGKNWID-PRGPDGITIIAPYEQAMPDQQPGIHNGD-------QPVP 435 Query: 501 LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 + L+ L D+ ++DN+A +P+VV + + E + Sbjct: 436 MMLFNLQQDIAEQDNVADEHPEVVARLMKLYHEMQAEVPASI 477 >UniRef50_A7SRP2 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7SRP2_NEMVE Length = 491 Score = 395 bits (1016), Expect = e-108, Method: Composition-based stats. Identities = 122/532 (22%), Positives = 198/532 (37%), Gaps = 75/532 (14%) Query: 41 NVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDK 100 + F + KP+++ + DDLG+ + F Sbjct: 8 HCFFLCLNVVVLQSSAKPHLLFVLADDLGWSDVGFH------------------------ 43 Query: 101 AIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQ 157 ++ TP + L GV N YV V P+RA++MTG+ P G+ + Sbjct: 44 ---GSKIQTPNIDRLAANGVILDNYYV-QPVCTPTRASLMTGKYPIHTGLQHGIIHNGRP 99 Query: 158 DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 G+PL T LP+ + GY T +GKWHL F E P Sbjct: 100 YGLPLNLTLLPQKLRKAGYSTHMLGKWHLG----------------------FYNWESTP 137 Query: 218 QNRGFDYFMGFHAAGTAYY-----NSPSLFKNRERVPAKG--YISDQLTDEAIGVVDRAK 270 RGFD F GF++ +Y + L N E V + Y + T A + RA Sbjct: 138 TYRGFDTFYGFYSGAENHYTHVQDHYLDLRDNEEIVRDQNGTYSAHLFTKRA-EQIVRAH 196 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT-ADNYYASVYSVDQGVKRILEQL 329 P +Y+A+ H P AP +Y +++ Y A V +D + + Sbjct: 197 DPSTPLFMYMAFQNVHSPVQ--APKEYIDRYSFIKDPLRRTYAAMVTIMDDALGNLTRAF 254 Query: 330 KKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-N 388 K G ++NTI++F++DNG V + +G K + GG F+ Q G Sbjct: 255 DKAGLWENTILIFSTDNGGVPKNG-GYDYPLRGRKDTLWEGGVRGVAFVHGVALEQSGVK 313 Query: 389 YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFD 448 L+ D+YPT + A S+ +D LDG + + + + L I + + Sbjct: 314 CKALMHVTDWYPTLVSLAGGSLDEDEDLDGYDVWESISHGVESPRKELLHNIDTINIPPG 373 Query: 449 EENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ-----LGL 503 + ++ F PN + D+ Y NN+ + L Sbjct: 374 DGSLGFSTTGIGLRVGDMKLLMAVPNISYFIPPEDRNGSVDW---YIHSNNKVPMVEVAL 430 Query: 504 YKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKK 554 Y +T D +K +L P VV +Q V + ++ PP ++ + K Sbjct: 431 YNITADPYEKHDLHDKLPDVVTRLQLRVEHYRKTAVPPANKPKDPYARQVAK 482 >UniRef50_A6KZI7 Arylsulfatase n=23 Tax=Bacteroidales RepID=A6KZI7_BACV8 Length = 508 Score = 395 bits (1016), Expect = e-108, Method: Composition-based stats. Identities = 131/535 (24%), Positives = 198/535 (37%), Gaps = 130/535 (24%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPNII + DD+GYG L + STP + ++ Sbjct: 27 KPNIIYIMCDDMGYGDLGCYGQPY--------------------------ISTPNIDNMA 60 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD-------------------AQ 157 EG+RFT Y VS PSRA+ MTG+ V N + Q Sbjct: 61 KEGMRFTQAYAGSPVSAPSRASFMTGQHSGHCEVRGNKEYWRDAPVVMYGNNKEYAVVGQ 120 Query: 158 DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 +PE+ +++GY T GKW +V P+ + +Y+ F A + P Sbjct: 121 HPYDPGHVIIPEIMKDNGYTTGMFGKWAGGYEGSVSTPDKRGIDEYYGYICQFQAHLYYP 180 Query: 218 QNRGFDYFMGFHAAGTA--------YYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRA 269 F A TA N P K+ + P Y +D + +EA+ +D+ Sbjct: 181 N---FLNRYSKSAGDTAVVRVVMDENINYPMFGKDYFKRP--QYSADMIHEEAMKWLDKQ 235 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQ---YQKQFNTGS--------------QTADNYY 312 QPF Y PH P YQK+F T + Sbjct: 236 -DGKQPFFGIFTYTLPHAELAQPEDSILTGYQKKFFEDKTWGGQEGSRYNPSVHTHAQFA 294 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKSQT 367 + +D V +L +LK+ G +NTI++FTSDNG +G +G +G K Q Sbjct: 295 GMITRLDYYVGEVLNKLKEKGLDENTIVIFTSDNGPHEEGGADPTFFGRDGKLRGLKRQC 354 Query: 368 YPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADISI--------PKDLK-LD 417 Y GG P + W GK+ G + ++ D PT D A + KD+ D Sbjct: 355 YEGGIRIPFIVRWPGKVPEGTVNDHQLAFYDLMPTFCDLAGVKNYVKKYTNKKKDVDYFD 414 Query: 418 GVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477 G+S P L ++ + H L W + Sbjct: 415 GISFAPTLLGQEGQKKHDFLYWEFDETDQIG----------------------------- 445 Query: 478 LSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVR 531 VR D+ +V V+ LY L TD+ + ++AA +P +VK+M+ ++R Sbjct: 446 -------VRMGDWKMV--VKKGTPFLYNLATDIHEDHDIAAGHPDIVKQMKEIIR 491 >UniRef50_B5JMW2 Sulfatase domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JMW2_9BACT Length = 594 Score = 395 bits (1015), Expect = e-108, Method: Composition-based stats. Identities = 127/523 (24%), Positives = 201/523 (38%), Gaps = 111/523 (21%) Query: 44 FSDFTPTEYSTKG-KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAI 102 +S TP ++ G PN++V+ DD G+G L Sbjct: 15 YSLLTPLSAASGGNPPNVLVILADDQGWGDLSLHGS------------------------ 50 Query: 103 EAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPL 162 +TPTL L +G +F N YV V P+RA +TGR R GVY + + Sbjct: 51 --QNLNTPTLDRLAQQGAQFENFYV-QPVCSPTRAEFLTGRYYPRGGVYDTGAGGERLDA 107 Query: 163 TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF 222 E + ++F+ GY TAA GKWH + + P RGF Sbjct: 108 DEETIAQVFRTAGYATAAFGKWHNGTQA-----------------------PYHPNTRGF 144 Query: 223 DYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 D + GF + Y L N V + GY+ D LT + +++ PF YLA Sbjct: 145 DEYYGFTSGHWGSYFDALLDHNGSLVQSAGYLPDTLTTATLDFIEQQTADQTPFFAYLAL 204 Query: 283 NAPHLPNDNPAPDQYQ-----------KQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 PH P D + + A V ++D V R+L+++++ Sbjct: 205 PTPHSPMQTTDEDWARFANKKLTSLATNPADENPDHTRAALAMVENIDANVGRLLDRIQE 264 Query: 332 NGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYD 390 +NTI+++ +DNG N +G K T GGT +P+F+ + K+QPG + Sbjct: 265 LDIEENTIVVYFTDNGP---NGWRYNANMRGRKGSTDEGGTRSPLFIRYPQKIQPGATLN 321 Query: 391 KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEE 450 + S++D PT A+I+ LDG+SL P LQ+ P + + S+W Sbjct: 322 TIASSIDLLPTLGQLANITWQPAQTLDGISLAPQLQNPNLRLPDRTIF-----SYWSGRI 376 Query: 451 NIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDL 509 + R DY L ++Q LY + TD Sbjct: 377 SA---------------------------------RTQDYRL-----DHQGQLYHIPTDR 398 Query: 510 QQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNI 552 Q +L+ +P++ +Q + + P N ++ I Sbjct: 399 GQTTDLSTKHPELTASLQSQIDAWRSELLTP-DAANPDRPLTI 440 >UniRef50_C5C586 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C586_BEUC1 Length = 478 Score = 395 bits (1014), Expect = e-108, Method: Composition-based stats. Identities = 126/549 (22%), Positives = 208/549 (37%), Gaps = 132/549 (24%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 +PNI+++ +DDLG+ L +F TP + + Sbjct: 13 PDRPNIVLVVVDDLGWRDLGCFGSTF--------------------------YETPHIDA 46 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA-----------QDGIPLT 163 L G RFT+ Y A V P+RA+++TG+ PAR GV + G+P Sbjct: 47 LAASGTRFTHSYAAAPVCSPTRASLLTGKYPARVGVTNWIGGHAIGALRDVPYFHGLPQD 106 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 E L + GY T VGKWHL ++ P++ GFD Sbjct: 107 EYALARALRAGGYRTWHVGKWHLGGGRHL------------------------PEHHGFD 142 Query: 224 YFMGFHAAGTA-YYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 +G A+G+ Y +P E P +++D+LTD A+ +V + D PF+L L + Sbjct: 143 LNVGGSASGSPVSYYAPYGIGALEDAPDGEFLTDRLTDVAVDLVR--SSDDAPFLLNLWH 200 Query: 283 NAPHLPNDNPA--PDQYQKQFNTGS------------------------------QTADN 310 A H P + PA ++Y+ + T Q+ Sbjct: 201 YAVHTPIEAPAHLVEKYRHKAETLGLPTHGPDAVEAGEHMPARHLRSERVRRRRIQSDPT 260 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--VIDGPLPLNGAQKGYKSQTY 368 Y A + ++D V R++ L+ G+ D+T+I+FTSDNG +G N K Sbjct: 261 YAAMLETLDGAVGRLVTALRDVGKLDDTLIVFTSDNGGLSTAEGSPTCNAPLSEGKGWMA 320 Query: 369 PGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQD 427 GGT P + W G++ G D ++ DFYPT L AA ++ + +DGV+L P Q Sbjct: 321 DGGTRVPTIVSWPGRVPAGARSDLPFTSPDFYPTLLAAAGLTQLPEQHVDGVNLWPAWQG 380 Query: 428 KKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRN 487 + W Y H+ ++ P S VR+ Sbjct: 381 --APLDRGPIFW--HYPHYSNQGGAP----------------------------SAAVRD 408 Query: 488 NDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 + LV L+ + D+ + +++ VV + + ++ + Sbjct: 409 GRWKLVRHFGIEHDELFDVVADVSESHDVSGRRRDVVARLSVTLDSWLADVGALIPRRTT 468 Query: 547 EKFNNIKKA 555 + + Sbjct: 469 PPPDTFDRP 477 >UniRef50_A3ZWK4 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Bacteria RepID=A3ZWK4_9PLAN Length = 442 Score = 394 bits (1013), Expect = e-108, Method: Composition-based stats. Identities = 123/490 (25%), Positives = 183/490 (37%), Gaps = 71/490 (14%) Query: 64 TMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFT 123 DD G+G + ++ AA TP L EGVRFT Sbjct: 1 MADDQGWGDVGYN--------------------------HAAPIHTPNLDQAAAEGVRFT 34 Query: 124 NGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGK 183 Y A V P+R +++TGR P R VY+ I E L E Q GY T+ GK Sbjct: 35 RFYAAAPVCSPTRCSVLTGRNPNRSAVYAW---GWPIRPQEITLAERLQAAGYATSHFGK 91 Query: 184 WHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFK 243 WHL + P GFD ++ +A Y N P + Sbjct: 92 WHLGSVRKDS--------------------PVSPGKCGFDDWI---SAPNFYDNDPIMSD 128 Query: 244 NRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT 303 V G SD D AI + ++PF + + +PH P+ D ++ + Sbjct: 129 QGRAVQYHGESSDVTADLAIDWIRAQAKEEKPFFSVVWFGSPHSPHIAADAD--RELYKD 186 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGY 363 +YY V +D+ +I LK+ G DNTI+ + SDNGA D G + Sbjct: 187 EPAKFRDYYGEVTGIDRAYGKIRSTLKELGISDNTILWYCSDNGA--DKAKGSAGPFREK 244 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDGVSLL 422 K Y GG P + W + L + D +PT L AA +S K LDG++LL Sbjct: 245 KGSIYEGGLLVPGILDWPARFPAPQTTSLRATTCDIFPTVLAAAGLSPDKQRPLDGINLL 304 Query: 423 PWLQDKKQGEPHKNLTWITSYSHWFDE----------ENIPFWDNYHKFVRHQSDDYPHN 472 P L K P W T+ + D V + D P Sbjct: 305 PLLTAKTDMRPQPIGFWQTANGGKPVRSDAMMEELLNQQATGGDLPADEVSLHAADLPKP 364 Query: 473 PNTEDLSQFSYTVRNNDYSLVYTVENN---QLGLYKL-TDLQQKDNLAAANPQVVKEMQG 528 P + D ++ + D+ L + LY L D +K+N+ P++ +++ Sbjct: 365 PVSIDTLAGHASLTSGDWKLHRIENKKGAVRFELYDLAADPYEKENVLKQYPEIAEKLTK 424 Query: 529 VVREFIDSSQ 538 + R++ S Sbjct: 425 LQRDWRLSVV 434 >UniRef50_B4D433 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D433_9BACT Length = 465 Score = 394 bits (1013), Expect = e-108, Method: Composition-based stats. Identities = 125/518 (24%), Positives = 197/518 (38%), Gaps = 107/518 (20%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 ++ KPN+I +DDLG L SF T Sbjct: 19 AADASPAKPNVIFFLVDDLGATDLSCFGSSF--------------------------YQT 52 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT-------------DA 156 P + L +G++FT+ Y A V P+RA+I++GR PA + D Sbjct: 53 PNIDRLAQDGLKFTHAYSACTVCSPTRASIISGRYPAELHLTDWIAGHKRPKAKLRIPDW 112 Query: 157 QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 + + LP+ GY T A+GKWHL E Sbjct: 113 TQHLTHDVSTLPQAMHAAGYTTCAIGKWHLG--------------------------EDG 146 Query: 217 PQNRGFDYFMGFHAAGT-AYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQP 275 P+ GFD + + G A Y SP + P ++SD+LT EA +++ K + P Sbjct: 147 PEKYGFDVAIADNGKGQPATYFSPYKNPHLSDGPPGEFLSDRLTTEAEKFIEQNK--EHP 204 Query: 276 FMLYLAYNAPHLPN--DNPAPDQYQKQFNTG-SQTADNYYASVYSVDQGVKRILEQLKKN 332 F LY A+ A H P +Y++ + Q Y + + SVD + + +L + Sbjct: 205 FFLYFAHYAVHTPLMGKPAVIAKYKEHVSPNDPQHNPVYASLIESVDDSLGHLRAKLDEL 264 Query: 333 GQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DK 391 D TII+FTSDNG +I + N + K Y GG P + G Q G Sbjct: 265 KLSDKTIIIFTSDNGGLILNQVTSNLGMRAGKGSAYEGGVRVPAIAFVPGVTQAGTVATT 324 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 + +MD+ T LD A + GVSL P L + + L W Y H+ Sbjct: 325 PVISMDWTATMLDLAGAKPLDQQR--GVSLAPVLHGGQISL--RALFW--HYPHYHPGGA 378 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQ 510 P+ + +++ LV E+N + LY L+ D + Sbjct: 379 TPY----------------------------CAMLEDNWRLVEFFEDNHVELYHLSDDPE 410 Query: 511 QKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEK 548 +K +LAA+ +E++ + + ++ L N + Sbjct: 411 EKHDLAASQSAKAEELKARLHAWRETMHAQLPTPNPDY 448 >UniRef50_Q7UYH3 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYH3_RHOBA Length = 598 Score = 394 bits (1013), Expect = e-108, Method: Composition-based stats. Identities = 115/535 (21%), Positives = 197/535 (36%), Gaps = 118/535 (22%) Query: 40 TNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 + ++ + + E + +PN+IV+ DD G G F Sbjct: 18 SALSSTSVSAAETNAAERPNVIVIMSDDQGVGDYGF------------------------ 53 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG 159 + TP+L + + + YV++ V P+RA++MTGR R + Sbjct: 54 --MGNPIIRTPSLDKMRTQSGYLSRFYVSN-VCAPTRASLMTGRYNYRTRCIDTYVGRAM 110 Query: 160 IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN 219 + E L E GY T GKWHL +P + Sbjct: 111 MDPDEVTLAERLSEAGYQTGIFGKWHLGDNY-----------------------PMRPMD 147 Query: 220 RGFDYFMGFHAAGTAY----------YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRA 269 +GFD + G Y P+LF N + V +GY +D D AI + Sbjct: 148 QGFDESLIHRGGGIGQPSDPIGAEGKYTDPTLFHNGDEVAMEGYCTDIFFDAAIDFARKQ 207 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQYQ-----------------KQFNTGSQTADNYY 312 +PF Y+A NAPH P D+ + Y+ K+ + Sbjct: 208 TESGKPFFTYIATNAPHGPFDDVPNELYEEYKQVDFTPILVSDLPAKRRDAEFDKLARIS 267 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGT 372 A + ++DQ V ++ L + +NTI+L+ +DNG G +G K+Q GG Sbjct: 268 AMITNIDQNVGKLFASLDELKIRENTIVLYLNDNGP---NSRRYVGNMRGNKTQVDDGGI 324 Query: 373 HTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 +P+ W K+ D +++ +D PT LDA ++ + LDG S LP L Sbjct: 325 RSPLLFHWPAKVDASDTTDVMLAHIDLMPTLLDACGVAASESPALDGKSFLPLLTG---- 380 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 E + W+ + + P + + + + + Sbjct: 381 -----------------EMDYSQWETRLIAFQTHRGNVPQKFH-------HFAMHEHPWK 416 Query: 492 LVY--------TVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 LV+ +L LY L D +Q+++LA +P++V+ ++ ++ D Sbjct: 417 LVHPSGFGKERFEGEPKLELYNLEDDPKQQNDLADKHPEIVQRLKQAYSKWFDDV 471 >UniRef50_A6DG54 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG54_9BACT Length = 469 Score = 394 bits (1012), Expect = e-108, Method: Composition-based stats. Identities = 119/481 (24%), Positives = 194/481 (40%), Gaps = 74/481 (15%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 PN++V+ DD G+ G+ D T + + Sbjct: 25 AAPPNVVVIYFDDTGWKDFGCFGGAVD---------------------------TTHIDN 57 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQDGIPLTETFLPELFQ 172 L G+RFT Y PSRA ++TGR P R G+YS + + +P +E + E + Sbjct: 58 LAKNGMRFTEYYAPAPNCSPSRAGLLTGRFPFRLGMYSYRSKNTPMHLPDSEITIAEALK 117 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 GY T GKWHL + P P +GFDY++ Sbjct: 118 TKGYATGMFGKWHLGNLDGKSHPT--------------------PSEQGFDYWLACDNNL 157 Query: 233 TAYYNSPSLFKNRERV-PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDN 291 + N SL +N + V G+ + + DEA + + + PF Y+A++ H P D Sbjct: 158 IKH-NPKSLIRNGKPVGKIAGWAAQVVADEANEWMKKQTS---PFFAYIAFSETHSPLDA 213 Query: 292 PAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID 351 P + ++ Y D V IL+ L G DNT++ SDNG + Sbjct: 214 PEELITKYIERGENKKRATYRGMTEYSDAAVGSILKTLDDMGVSDNTLVFLASDNGPTSE 273 Query: 352 GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISI 410 +G KS T+ GG P + W GK++PG+ Y+ + +D PT D + Sbjct: 274 DSCE---GLRGKKSYTWEGGIRVPAIIRWPGKVKPGSEYNDPVGGIDLLPTLCDIVGAEL 330 Query: 411 PKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYP 470 PK +DGVS+ L+ K P K T I S+ + +Y Sbjct: 331 PK-RHIDGVSIRSVLEGK----PFKRNTPILSFFYRTSPAASMRMGDYVLI--------- 376 Query: 471 HNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGV 529 + ++ + S+++ D +V + + LY + DL Q+ N+AA P+ + E++ + Sbjct: 377 -GHSDDEDRKKSHSMSAEDMPIVKSSKLVSFELYNIKNDLGQEKNIAATYPEKLAELRKI 435 Query: 530 V 530 + Sbjct: 436 M 436 >UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3JD43_NITOC Length = 440 Score = 394 bits (1012), Expect = e-108, Method: Composition-based stats. Identities = 117/510 (22%), Positives = 200/510 (39%), Gaps = 126/510 (24%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 K PN+I++ DD+GYG + G + + TP L + Sbjct: 16 KQPPNVILIVADDMGYGDV----GCYGNQ----------------------HIKTPNLDA 49 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD-----GIPLTETFLPE 169 L +G RFT+ + + P+RAA++TG R G++ Q + L E E Sbjct: 50 LAKKGARFTDFHSNGPLCTPTRAALLTGCYQQRVGLHIIPKDQRYAMAKAMSLEEITFAE 109 Query: 170 LFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH 229 ++ GY TA VGKWHL + P +GFD + G Sbjct: 110 ALKSVGYSTALVGKWHLGD-----------------------RPAFLPPRQGFDEYFGIP 146 Query: 230 AAGTAY-----YNSPSLFKNRERV---PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 + + + L + E V P +++ T+EA+ + + K D+PF+LY+ Sbjct: 147 YSHDMHPWRKSFPPLPLMRGEEIVELNPDLDHLTQYCTEEAVKFISKNK--DRPFLLYMP 204 Query: 282 YNAPHLPNDNPAPDQYQKQFN----------TGSQTADNYYASVYSVDQGVKRILEQLKK 331 + PH P +++ K+F+ Y A++ +D V I++ ++ Sbjct: 205 HPMPHQPVH--VSERFAKRFSKEQLAAIKGEDKKSRKFLYSATIEEIDWSVGEIIKAVRA 262 Query: 332 NGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-D 390 G ++T + FTSDNG I G +G K + + GG P +W+ K++PG D Sbjct: 263 LGIEESTFVAFTSDNGPAI----GSAGPLRGKKRELWEGGHRVPFIAYWQEKIRPGVVID 318 Query: 391 KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEE 450 ++ +MD +PT +P+ K+DGV+LLP L + + + + W Sbjct: 319 EIAMSMDLFPTMAAMGRAPLPR-KKIDGVNLLPLLCEGDKLS-ERTVFWR---------- 366 Query: 451 NIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ----LGLYKL 506 S+ R + L+ + +GLY L Sbjct: 367 ----------------------------SKGKKAARKGPWKLLMQPTKKKRPTSIGLYHL 398 Query: 507 -TDLQQKDNLAAANPQVVKEMQGVVREFID 535 DL ++ NLA P+ +K +Q + Sbjct: 399 NNDLSEQHNLAEIYPEKLKSLQLEFAAWEK 428 >UniRef50_A6DUI7 Putative exported uslfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DUI7_9BACT Length = 516 Score = 393 bits (1010), Expect = e-107, Method: Composition-based stats. Identities = 127/555 (22%), Positives = 210/555 (37%), Gaps = 130/555 (23%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 + + K N+I + DDLG+ + F+ F T Sbjct: 23 AKEAQHEKLNVIFMIADDLGWMDVGFNGNKF--------------------------VET 56 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD-------------- 155 P L L EG+ FTNGY + + P+RAA TG++PA G+ Sbjct: 57 PNLDKLASEGMVFTNGYASGPLCSPTRAAFHTGKSPATMGINVPVTKGLKGKTPGAYPMG 116 Query: 156 ---------------------AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPV 194 GI E + + Q+ Y TA++GKWH+ + P Sbjct: 117 GDKLKTKVGQRDIRHRLLPAYTNTGIDPQEVTIADCLQSADYVTASIGKWHMGLSHSDPK 176 Query: 195 PEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM-GFHAAGTAYYNSPSLFKNRERVPAKGY 253 + P+ G+D + G G + SP + + P + Sbjct: 177 AD--------------------PREYGYDINIAGGDYHGPPSWFSPYRIHSLKNGPKGEH 216 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT----GSQTAD 309 ++++LT EAI ++ K D+PF LYL Y H P+ A ++Y K+F+ S+ Sbjct: 217 LTERLTREAINFMEENK--DKPFFLYLPYYQVHSPHG--AREEYIKKFDHKQTPDSKMNS 272 Query: 310 NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA-----------VIDGPLPLNG 358 Y A V +D+ V I + LKK+G NT+++F+SDNG + L Sbjct: 273 IYAAMVMHLDESVGLINDYLKKSGLDKNTLLIFSSDNGPLVYQRAGNQVLPRNTRLTFAE 332 Query: 359 AQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLD 417 +G+K Y GT P G + + + I D Y T + +++P++ K++ Sbjct: 333 PLRGWKGSVYEAGTRVPYIFKLPGVIPANSISQTPIITHDLYATICEFTGVAVPEEQKVE 392 Query: 418 GVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477 G SL P L + + +L W W +I + D Sbjct: 393 GESLFPLLT-QSKALQRTSLFWHNPKYSWSLNSDILWAD--------------------- 430 Query: 478 LSQFSYTVRNNDYSLVYTVENN---QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREF 533 + + +R Y L+Y E L LY L D + NL + P+ E++ + + Sbjct: 431 --RPACAIRKGKYKLIYYFERKGERTLELYDLDNDQGETKNLVSDLPEKALELETELLAW 488 Query: 534 IDSSQPPLSEVNQEK 548 +D +Q N + Sbjct: 489 LDQTQAWKPIDNPDY 503 >UniRef50_Q482D6 Sulfatase family protein n=2 Tax=Bacteria RepID=Q482D6_COLP3 Length = 492 Score = 393 bits (1010), Expect = e-107, Method: Composition-based stats. Identities = 138/542 (25%), Positives = 206/542 (38%), Gaps = 127/542 (23%) Query: 43 AFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAI 102 S T + KPN+++L +DD G L +F Sbjct: 16 TCSQAVATPDKSTSKPNVVMLLVDDFGRQDLSTYGSNF---------------------- 53 Query: 103 EAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-YSNTDAQDGIP 161 TP + L +G++F N Y AH PSR AI +G P R+GV + +P Sbjct: 54 ----YETPNIDQLAADGMKFDNAYAAHPRCVPSRVAIFSGSYPTRYGVPQGERVGKHHLP 109 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 L+ E + GY T +GKWHL K E P +G Sbjct: 110 LSAVTFGEHLKEAGYQTGYIGKWHLGK------------------------EGGDPTKQG 145 Query: 222 FDYFM--GFHAAGTAYYNSPSLF----KNRERVPAKG----YISDQLTDEAIGVVDRAKT 271 FD + G A +YY + KN+ +G Y++D+LTDEA+ +++ K Sbjct: 146 FDSSIMAGHWGAPPSYYFPYTKMSKSGKNKGFAKVEGSEEEYLTDRLTDEALTFIEQKK- 204 Query: 272 LDQPFMLYLAYNAPHLPND---------------------NPAPDQYQKQ----FNTGSQ 306 DQPF+L LA+ A H P + P D + ++ Q Sbjct: 205 -DQPFLLVLAHYAVHTPIEGKPALVKKYKTKMKKLGIANAGPKSDADLIKDSTGYHKTIQ 263 Query: 307 TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL-------PLNGA 359 +Y A V SVD V RI +QLK+ G DNTII+ TSD+G + L N Sbjct: 264 NNPDYAAMVESVDISVGRIEQQLKRLGLEDNTIIILTSDHGGLSSRGLKSNRVLATSNNP 323 Query: 360 QKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDG 418 + K Y GGT P+ + W K++ G+ ++ ++ D YPT L A +S+ DG Sbjct: 324 YRHGKGWIYDGGTRVPLIVKWPEKVKAGSISQVQVTGTDHYPTILQMAGLSLSPKDHQDG 383 Query: 419 VSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDL 478 VS L L P K + W P ++ Sbjct: 384 VSYLAALN--SDETPRKAMFW----------------------------HSPAARPSKTG 413 Query: 479 SQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 S + ++ L+ ++ LY L D + +NLA P+ EM + + D Sbjct: 414 DTNSSAIIEGEWKLLDFWSTGKVELYNLKDDKSEANNLAKLMPEKTAEMLAKLTNWKDDI 473 Query: 538 QP 539 Sbjct: 474 DA 475 >UniRef50_Q7UM38 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UM38_RHOBA Length = 667 Score = 393 bits (1009), Expect = e-107, Method: Composition-based stats. Identities = 118/505 (23%), Positives = 189/505 (37%), Gaps = 114/505 (22%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 + +PN++V+ DD G+G L TP Sbjct: 86 SRPSGSRPNVLVVLTDDQGWGDLSLHG--------------------------NPNLQTP 119 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPEL 170 + SL +GV+ N YV V P+RA +TGR R GV+S + + L+E + + Sbjct: 120 HIDSLARDGVQIKNFYV-CAVCSPTRAEFLTGRYHTRSGVFSTSAGGERFDLSERTIGDA 178 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA 230 FQ GY TAA GKWH + + P RGFD F GF + Sbjct: 179 FQAAGYRTAAFGKWHSGMQA-----------------------PYHPNARGFDEFYGFCS 215 Query: 231 AGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPND 290 Y SP L N E V G+I D LT AI ++R + PF +YL N PH P Sbjct: 216 GHWGNYFSPMLELNGEIVKGDGFIVDDLTQHAIDFMER--DRENPFFIYLPLNTPHSPMQ 273 Query: 291 NPAPD-------------QYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDN 337 P D + + Q A ++D V ++L+ L++ +N Sbjct: 274 VPDEDWQNFEGKEIVPDPRPENAKKEDVQHTRAALALCENIDDNVGQLLDALERLSLSEN 333 Query: 338 TIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAM 396 TI++F DNG NG +G K + GG +P + + K+ G + A+ Sbjct: 334 TIVVFFCDNGP---NGSRFNGGLRGRKGAVHEGGLRSPCLIRYPSKIPAGQTVGGIAGAI 390 Query: 397 DFYPTALDAADISIPKDL-KLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFW 455 D +PT D D+ + LDG+SL+ L++ K + + Sbjct: 391 DLFPTLADLCDVEVGATAGPLDGISLIDGLREPKSKPSERLIF----------------- 433 Query: 456 DNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDN 514 ++VR+N Y + L+ + D + + Sbjct: 434 ---------------------TAWSGKFSVRSNRYRYHANGD-----LFDIVADPGETGS 467 Query: 515 LAAANPQVVKEMQGVVREFIDSSQP 539 +A P ++ + +++ ++P Sbjct: 468 VAEDQPVATARLKKALEDWVKETKP 492 >UniRef50_A0YAF7 Arylsulfatase A n=4 Tax=Bacteria RepID=A0YAF7_9GAMM Length = 479 Score = 393 bits (1009), Expect = e-107, Method: Composition-based stats. Identities = 125/526 (23%), Positives = 205/526 (38%), Gaps = 107/526 (20%) Query: 43 AFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAI 102 A++ P+ S PN+I++ DD+GYG + A Sbjct: 27 AYAVANPSHQS----PNVIIIFADDMGYGDIG--------------------------AY 56 Query: 103 EAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN------TDA 156 +P L + EG+++TN Y A V PSRA ++TGR P R G+ + + Sbjct: 57 GHPTIRSPNLDQMAAEGIKWTNFYAASSVCTPSRAGLLTGRLPVRSGMAHDQIRVLFPTS 116 Query: 157 QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 G+P TE + + + Y TA VGKWHL +P + D + Sbjct: 117 TGGLPTTEITIAKALKEKDYRTALVGKWHLGH-----LPGFQPLDHGFDEYFGIPYSNDH 171 Query: 217 PQNRGFDYFMGFHAAGTAYYNSPSLFKNR---ERVPAKGYISDQLTDEAIGVVDRAKTLD 273 + Y A +N P L +NR ER + I+ + T EA+ + + + Sbjct: 172 DLKKELSYIQTITHAKDGDFNVP-LMQNRSIIERPANQNTITKRYTQEAVSFIKKNS--N 228 Query: 274 QPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNG 333 QPF LYLA++ PH+P A DQ++ S Y + +D V ++L L + G Sbjct: 229 QPFFLYLAHSMPHVPL--FASDQFRG-----SSDRGLYGDVIEEIDWSVGQVLSTLSEQG 281 Query: 334 QYDNTIILFTSDNGAV--IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK 391 +NT+++FTSDNG + G K K +Y GG P WW K++P Sbjct: 282 ISENTLVVFTSDNGPWLIMGAHGGSAGLLKSGKGTSYEGGMREPAIFWWPEKIKPAVAHN 341 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 S +D +PT + A I +P D DG L P + ++K E + Sbjct: 342 TASTLDLFPTIMSIAGIDMPSDRSYDGYDLSPTMFEQKSNERKNIFYYH----------- 390 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL-------VYTVENN----- 499 + VR D+ + +YT E Sbjct: 391 ---------------------------GDKIFAVRQGDWKVHFKTVANIYTKEQKILTHT 423 Query: 500 QLGLYK-LTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEV 544 ++ L D ++ ++ A NP ++ ++ + S +P +++ Sbjct: 424 PPQVFNLLVDPSERFDVGAVNPAIIASAAKLIEQHQLSVKPVENQL 469 >UniRef50_UPI0001B577E1 arylsulfatase precursor n=1 Tax=Streptomyces sp. C RepID=UPI0001B577E1 Length = 746 Score = 393 bits (1009), Expect = e-107, Method: Composition-based stats. Identities = 135/548 (24%), Positives = 206/548 (37%), Gaps = 105/548 (19%) Query: 25 FAAHAADDVKLKATKTNVAFSDFTPTEYSTKGK-------PNIIVLTMDDLGYGQLPFDK 77 AA A T T + P + K PNI+V+ DDLGYG+L Sbjct: 8 LAASTATLGLTAVTATTAGSAQAVPAVTVPETKDGSGTRLPNIVVVLADDLGYGEL---- 63 Query: 78 GSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRA 137 GS+ K STP L L EG+RFT+ Y V PSR Sbjct: 64 GSYGQKL----------------------ISTPRLDRLATEGLRFTDAYSTAAVCAPSRC 101 Query: 138 AIMTGRAPARFGVYSNT--DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVP 195 +++TG V +N Q + T+T ++ + GY TA +GKW + Sbjct: 102 SLLTGLHTGHSTVRANPSSGGQGSLTATDTTFAQVLRARGYRTAVIGKWGFGPEA----- 156 Query: 196 EDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY-YNSPSLFKN--RERVPAKG 252 + ++ P RGF+ F G+ A+ Y L+ N +E +PA Sbjct: 157 ---------------AGQDSHPAARGFEEFYGYIDHSHAHQYYPEYLWHNAVKEPIPANA 201 Query: 253 ------YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQ 306 Y L A+ +D +PF+L L N PH P+D P Y + + + Sbjct: 202 GGAKAVYAPHLLEQHALEFIDTHAA--EPFLLLLTPNVPHAPSDIPDSSAYADR--SWTA 257 Query: 307 TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQK 361 + A V D V +++++L+ G +T++L TSDNG +G NG + Sbjct: 258 ANKGHAAQVSYFDSLVGKVVDRLRSLGLEQDTVVLVTSDNGPHEEGGVNPDLFDANGPLR 317 Query: 362 GYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSL 421 GYK Y GG P+ W G++Q G ++ D PT + P D +DG+S Sbjct: 318 GYKRNLYEGGVRVPLIAWGPGRVQQGTSNRPTPLTDVLPTLAELGGAPAPTD--VDGLSA 375 Query: 422 LPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQF 481 P L H +L W N + + + Sbjct: 376 APLLAGSPDSARHGHLYWFRDELGVTSRANA--------------------QDGKRATWL 415 Query: 482 SYTVRNNDYSLVYTVENN---------QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVR 531 + VR ++ V Q+ LY L TDL + ++ A NP E+ ++R Sbjct: 416 AEAVRRENWKAVRFAPERDHNLPDDKWQVELYDLATDLGETRDVLAKNPSKAAELVALMR 475 Query: 532 EFIDSSQP 539 + P Sbjct: 476 SSWKDTYP 483 >UniRef50_A0Z632 Arylsulfatase B n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z632_9GAMM Length = 545 Score = 393 bits (1009), Expect = e-107, Method: Composition-based stats. Identities = 132/524 (25%), Positives = 205/524 (39%), Gaps = 123/524 (23%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 + KPNI+++ DDLG+ + + G TP Sbjct: 26 ASAQSQKPNILIMVADDLGWADVGYHGG---------------------------DIDTP 58 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD---GIPLTETFL 167 +L L +GVR Y + P+RAA+MTGR P R GV G+ E F+ Sbjct: 59 SLDRLAQQGVRLNRFYTT-PICSPTRAALMTGRDPIRLGVTYGVIFPWDNIGVHPDEHFM 117 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 PE FQ GY TA +GKWHL + P NRGF++F G Sbjct: 118 PETFQAAGYQTAIIGKWHLGHAQMT----------------------YHPNNRGFEHFYG 155 Query: 228 FHAAGTAYYNS------PSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 +Y +N + +GY + L DE + + D+PF++Y+ Sbjct: 156 HLHTEVGFYPPFSNQGGKDFQRNGVSIDDQGYETYLLADEVSRYIRE-RDRDRPFLVYMP 214 Query: 282 YNAPHLPNDNPAP--DQYQK--------------------QFNTGSQTADNYYASVYSVD 319 + APH P D P D+Y+ + Y A V ++D Sbjct: 215 FIAPHTPLDAPVELQDKYKDIETDLPMARSRQTDDTRLISRVMLQPSARPMYAAVVDAMD 274 Query: 320 QGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG-PLPLNGAQKGYKSQTYPGGTHTPMFM 378 Q + R+L+ L + G DNTI+LF SDNG N +G K +T+ GG M Sbjct: 275 QAIGRVLDTLDQEGISDNTIVLFFSDNGGAAYSYGGANNAPLRGGKGETFEGGIRVTSLM 334 Query: 379 WWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL 437 W L+PG ++++S MD +PT +DAAD+ + LDG S+ L+ Q L Sbjct: 335 RWPAMLEPGQIFEQIMSVMDVFPTLVDAADVRPGNNFALDGRSMWTALKSGDQVPLEGPL 394 Query: 438 TWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE 497 + + + F++ N ++ LV V+ Sbjct: 395 IFGSEIPIY--------------------------------GNFNFAAFNEEWKLVQEVQ 422 Query: 498 NNQLG------LYKL-TDLQQKDNLAAANPQVVKEMQGVVREFI 534 Q+ L+K+ +D + +NLAA P +V+ + + + Sbjct: 423 QEQIAITVTNYLFKISSDPYEHNNLAAVYPDIVENLSKAILNWR 466 >UniRef50_A3J5W3 Putative arylsulfatase n=1 Tax=Flavobacteria bacterium BAL38 RepID=A3J5W3_9FLAO Length = 468 Score = 392 bits (1007), Expect = e-107, Method: Composition-based stats. Identities = 138/545 (25%), Positives = 205/545 (37%), Gaps = 124/545 (22%) Query: 31 DDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 + + +A E KPNI+ + DD+GY +L G Sbjct: 2 NTKNILTFAILIATFGIQAQETKNTKKPNIVFILADDMGYNELGSYGGKI---------- 51 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 TP + L EG++F+N Y + PSR +MTG+ + Sbjct: 52 ----------------IETPNIDQLAKEGMKFSNHYCGSNICAPSRGTLMTGKHTGHAYI 95 Query: 151 YSN----TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 N + + IP +E + E+ + GY T A GKW L Sbjct: 96 RDNKPLPYEGNEPIPASEITVAEILKTAGYTTGAFGKWGLG------------------- 136 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERV--------PAKGYISDQL 258 + A E P N+GFD F G++ A+ S + + V P Y +D + Sbjct: 137 ---YPASEGSPNNQGFDQFYGYNGQIHAHNYFTSYLRKNDLVELNANIDAPYSVYSADII 193 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP---DQYQKQFNTGSQTAD------ 309 D A+ V+ K + PF LY PH P P + Y K+ A Sbjct: 194 KDRALEFVEVNK--NNPFFLYFCPTLPHNPYHQPDDKTLEYYAKKTGFPIGDAHSEEFSV 251 Query: 310 -NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI----DGPLPLNGAQKGYK 364 Y A +DQ V I+ +LK+ DNT+I+F SDNG+ + D L G +G K Sbjct: 252 PKYAALSSRLDQQVGEIMAKLKELNLLDNTLIIFASDNGSALTKEEDSYLRTGGDLRGRK 311 Query: 365 SQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLKLDGVSLLP 423 S+ Y GG +P+ +WKGK+ PG+ ISA DF PT + P + +DG+S LP Sbjct: 312 SEVYEGGIKSPLIAFWKGKIIPGSSSNHISAFWDFLPTCAEIVKAKTPDN--IDGISYLP 369 Query: 424 WLQDKKQGEP-HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFS 482 L K + H L W S S Sbjct: 370 TLLGKTDNQKQHDYLYWERSQ--------------------------------------S 391 Query: 483 YTVRNNDYSL--VYTV--ENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVR-EFIDS 536 +R D VY + + +Y L D +K+NLA P++ E + + ++S Sbjct: 392 QAIRKGDMKANFVYDKTSQKQNIEIYNLAQDPFEKNNLAETMPELKAEFIKIAQTARVES 451 Query: 537 SQPPL 541 PL Sbjct: 452 EIFPL 456 >UniRef50_B1KD88 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD88_SHEWM Length = 500 Score = 392 bits (1007), Expect = e-107, Method: Composition-based stats. Identities = 128/565 (22%), Positives = 200/565 (35%), Gaps = 114/565 (20%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFD 81 M + V T+ + S+ P +PN+I DDLG G L GS+ Sbjct: 1 MKCRCTTLSVAVLCSIMVTSCSQSNIEP---KVNRQPNVIYFLADDLGVGDL----GSYG 53 Query: 82 PKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMT 141 + TP + L EG+RF+ Y V PSRA++MT Sbjct: 54 QQ----------------------HIRTPNIDKLAAEGMRFSRHYAGSSVCAPSRASLMT 91 Query: 142 GRAPARFGVYSNT-----------DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKIS 190 GR + N Q + L LFQ GY T A GKW L + Sbjct: 92 GRDMGHTDIRGNIQLMDQPDSPEYQGQYPLAQGTITLAHLFQLAGYQTGAFGKWGLGSLQ 151 Query: 191 NVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF-DYFMGFHAAGTAYYNSPSLFKNRERVP 249 + P+ ++ A + PQ D P L +++ Sbjct: 152 SSGNPKAMGFDQFYGYLDQRHAHNYFPQYLWDGDEVARLDNPAIN--VHPKLDRDKSDHR 209 Query: 250 A---KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT--- 303 K Y ++ A + + + D+ F LY+ + PH P + QF+ Sbjct: 210 EYMGKDYAPYKILARAKEFISQNR--DEAFFLYVPFVVPHAAIQIPDKELDGYQFDETAH 267 Query: 304 ----------GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP 353 + A + +D+ V I+ LK+ G DNT++LF+SDNGA G Sbjct: 268 RLGEPRAYTPHPKPRAARAAMISRMDRDVGDIMAMLKELGLDDNTLVLFSSDNGATAAGG 327 Query: 354 LPLN-----GAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAAD 407 +N +G K+ Y GG P+ W G + G+ +SA D PT D Sbjct: 328 SDINFFNSTAGARGEKATLYEGGIRAPLIARWPGNISAGSESDHLSAFWDMLPTFAQLLD 387 Query: 408 ISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 +S+P+ + G+S+LP L K Q + H++L W Sbjct: 388 LSVPEG--IQGISMLPTLLGKPQNQQHESLYWEFF------------------------- 420 Query: 468 DYPHNPNTEDLSQFSYTVRNNDYSLVYTV---------ENNQLGLYKL-TDLQQKDNLAA 517 S V ++ + E LY L D + NLAA Sbjct: 421 ----------SRNPSQAVVMGNWKAIRHYSKERGKGALELGATALYNLQEDPSESQNLAA 470 Query: 518 ANPQVVKEMQGVVREFIDSSQPPLS 542 +P++VK+ + ++ + S P + Sbjct: 471 KHPELVKKAEMIMAQRQRSPHLPWN 495 >UniRef50_A6CA27 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CA27_9PLAN Length = 491 Score = 391 bits (1006), Expect = e-107, Method: Composition-based stats. Identities = 111/532 (20%), Positives = 187/532 (35%), Gaps = 75/532 (14%) Query: 32 DVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVV 91 + L +PNII+ DD G+G+ F Sbjct: 8 FIPLLLACCLTGSGSLLHAAEQQSTRPNIILCMTDDQGWGETGF---------------- 51 Query: 92 DTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY 151 + TP L + +R Y A V P+R + +TGR P RF + Sbjct: 52 ----------MGHPILKTPHLDEMAASSLRLDRFYAAAPVCSPTRGSFLTGRHPNRFACF 101 Query: 152 SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFS 211 S + E + E ++ GY T GKWHL + S Sbjct: 102 SWGHT---LRPQEVTVAEAVKSVGYTTGHFGKWHLGSVQ--------------------S 138 Query: 212 AEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKT 271 P N GFD ++ ++ Y N P + N KG S D A+ + +A Sbjct: 139 NSPVSPGNSGFDEWV---SSPNFYENDPYMSHNGVVKQLKGESSRVTVDAALDFIKQADK 195 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 +PF+ + + PH P++ A + + + NY+ + VD+ + + QL+ Sbjct: 196 DKKPFLAVIWFGNPHTPHE--AVSELKDLYPDQKPNFQNYFGEISGVDRAMGHLRSQLRD 253 Query: 332 NGQYDNTIILFTSDNGA------VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL- 384 G +NT++ FTSDNG + G G+K + GG P + W + Sbjct: 254 LGLAENTLLWFTSDNGPRPPQFKTEEARSQATGGLAGFKGNLWEGGVRVPSLIEWPAVIK 313 Query: 385 QPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYS 444 +P + +D YPT L + +LDGVSLLP ++ + W Sbjct: 314 KPEVSNVPCGTIDIYPTVLAMTGAKVSHQPQLDGVSLLPLIEGQMTARGRPMGFWTYPEK 373 Query: 445 HWFDE----------ENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY 494 + P N +D + + + ++ L+ Sbjct: 374 GHPKRSTDILLALQKQQSPGHPNPKGPAPDADAGSLKTQYPKDKLPGAAALVDGNFKLLK 433 Query: 495 ---TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 + LY L D +K +L+ +PQ +K+M+ + ++ S L+ Sbjct: 434 METKRGKPRYTLYDLVKDPAEKQDLSQVDPQRLKKMKAALADWQHSVVDSLN 485 >UniRef50_A6DJ15 Putative arylsulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ15_9BACT Length = 469 Score = 391 bits (1006), Expect = e-107, Method: Composition-based stats. Identities = 125/541 (23%), Positives = 201/541 (37%), Gaps = 135/541 (24%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 KPNII L +DDLGYG L + STP Sbjct: 14 SAIANEKPNIIYLLVDDLGYGDLSLYG--------------------------QKKFSTP 47 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT-------DAQDGIPLT 163 + + EG+ FT+ Y V PSRAA+MTG+ V N + + Sbjct: 48 NIDRIGKEGMVFTDHYSGSTVCAPSRAALMTGKHSGHGLVRGNYEVGPHGFGGELPLRPE 107 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 + L E+ ++ GY T +GKW + P+ +GFD Sbjct: 108 DVSLAEVMKSAGYATGLIGKWGMGMDGTTGE----------------------PRKKGFD 145 Query: 224 YFMGFHAAGTAYYNSPSL-FKNRERVPAKG--------YISDQLTDEAIGVVDRAKTLDQ 274 Y GF A++ P ++N E++ YISD ++ I V+ K D+ Sbjct: 146 YSYGFLNQAHAHHYYPEYIYENGEKLMIPENKDDARGLYISDTFAEKGIEFVEENK--DK 203 Query: 275 PFMLYLAYNAPHLPNDNPAP------------------------DQYQKQFNTGSQTADN 310 PF L+ A+ PH P D + + Sbjct: 204 PFFLFWAFVTPHAELLVPDDSLNEFKGKWPETPFVMGKQGGDGTDNPFGVYASQDHPRAA 263 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKS 365 + + +D+ V + ++L++ G DNTII+F+SDNG +G N GYK Sbjct: 264 FSGMITRLDKRVGDLFDKLEELGIDDNTIIMFSSDNGPHKEGGADPDFFDSNAELTGYKR 323 Query: 366 QTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLKLDGVSLLPW 424 GG P + W ++ + SA D PT + A+ P+D +DG+S LP Sbjct: 324 DLTEGGIRVPFMVRWPNVVKARSKSSHASAFWDVMPTIAEIANTDSPED--IDGLSFLPA 381 Query: 425 LQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 L+ +KQ + HK+L W + ++ Sbjct: 382 LKGEKQ-QVHKHLYWEFHERGYTEQ----------------------------------A 406 Query: 485 VRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVR-EFIDSSQPPLS 542 +R ++ + N+ + LY L +D ++++++A P K + ++ E DS PL Sbjct: 407 LRMGNWKAIRHGVNSPIKLYDLISDESEQNDVSAKYPATAKHITNILDTERTDSELWPLK 466 Query: 543 E 543 E Sbjct: 467 E 467 >UniRef50_Q7UWW9 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UWW9_RHOBA Length = 622 Score = 391 bits (1006), Expect = e-107, Method: Composition-based stats. Identities = 116/535 (21%), Positives = 198/535 (37%), Gaps = 106/535 (19%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 + + T + + P+ ++ PN+I++ DD GYG F+ + Sbjct: 13 LCTISIAFAITTLFIATPRPSGAAS---PNVILVMTDDQGYGDFSFNGNPY--------- 60 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 TP L L E V+ T+ +VA + P+R +M+G R Sbjct: 61 -----------------IQTPALDRLASESVQLTDFHVA-PMCTPTRGQLMSGLDAFRNS 102 Query: 150 VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 + + + + + ++FQ+ GY T GKWHL Sbjct: 103 AINVSSGRTLLRHDLKTMADVFQDAGYRTGIFGKWHLGDNY------------------- 143 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTA--------YYNSPSLFKNRERVPAKGYISDQLTDE 261 ++P++RGFD + F ++ Y + +N +RV GY +D DE Sbjct: 144 ----PFRPEDRGFDETLWFPSSHINSVPDFWDNDYFDDTYIRNGKRVAHSGYCTDVFFDE 199 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG-------SQTADN---- 310 AI + D PF ++ N+ H P PDQY+ + T + D Sbjct: 200 AIEWAKQTSPTDSPFFAFIPLNSAHWPW--FVPDQYRARVRTMLGDTTELKRQLDTTPSN 257 Query: 311 ------YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYK 364 + A ++D V + + L ++G +NTI++F +DNG+ G N +G K Sbjct: 258 LEDLISFLAMGLNIDDNVGTLTQYLDESGLSENTIVVFLTDNGSTF-GDHYFNAGMRGKK 316 Query: 365 SQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 +Q + GG P + W ++ D L D PT AD LDG SL P Sbjct: 317 TQLWEGGHRVPCLIRWPEQITAQKIDDLTHVQDLLPTLAALADCDEHLPGPLDGTSLAPR 376 Query: 425 LQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 L + + L + + +F + P P + Sbjct: 377 LLGETDSLADRML--------------VINYSRMPQFKVTYTKGNPAIP-----RRNGAA 417 Query: 485 VRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 V N + L+ LY + D Q N+A +P++V +M+ + + D + Sbjct: 418 VMWNKWRLLENKR-----LYNVEQDPHQDHNVAQDHPEIVAKMRAHLATWWDGVK 467 >UniRef50_A3HWU7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Bacteria RepID=A3HWU7_9SPHI Length = 472 Score = 391 bits (1005), Expect = e-107, Method: Composition-based stats. Identities = 129/511 (25%), Positives = 197/511 (38%), Gaps = 109/511 (21%) Query: 41 NVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDK 100 N++ + S K N++++ DDLGYG L F + Sbjct: 18 NLSAQSKPSPQLSPKKHYNLVLIVADDLGYGDLGFTGST--------------------- 56 Query: 101 AIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA---- 156 Q TP L L GV FT GYV+ V PSRA +TG FG +N Sbjct: 57 -----QIKTPHLDQLATNGVTFTQGYVSSAVCSPSRAGFITGINQVEFGHDNNLAGVEPG 111 Query: 157 ----QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 +G+PL++ + + GY +GKWHL K Sbjct: 112 FDIAYNGMPLSQKTIADHLNKLGYVNGLIGKWHLGK-----------------------E 148 Query: 213 EEWQPQNRGFDYFMGFHAAGTAYYNS--------PSLFKNRERVPAKGYISDQLTDEAIG 264 ++ P RGFD F G+ G Y+ S L N + YI+D + +E++ Sbjct: 149 PQFHPLKRGFDEFWGYTGGGHDYFESLPNGKGYKEPLESNFKTPDPITYITDDVGNESVD 208 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKR 324 ++R K D+PF L+ A+NAPH P D Q + + Y A V+ +D V + Sbjct: 209 FIERHK--DEPFFLFAAFNAPHTPMQALEEDLALYQ-HIEDKKRRTYAAMVHRLDLNVGK 265 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL 384 I+ L++ G +NT+++F SDNG D LN +G K GG H P M G L Sbjct: 266 IMTSLEEQGLSENTLVVFFSDNGGPTDSNASLNAPYRGQKGILLEGGIHVPFVMNLPGLL 325 Query: 385 QPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 G Y + ++++D PT L A + GV L+P L K + +TW + Sbjct: 326 PEGLIYQEQVTSLDVVPTFLALAGDTETSMDMFSGVDLIPHLTGKTPPLADREMTWKFTI 385 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGL 503 S +R D+ LV +V + L Sbjct: 386 SR--------------------------------------AIREGDWKLV-SVPDRMPML 406 Query: 504 YKLT-DLQQKDNLAAANPQVVKEMQGVVREF 533 Y L D ++++LA + + + + Sbjct: 407 YNLAEDPSEQNDLALKHMDKTTYLLKKLGTW 437 >UniRef50_B4D3U0 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D3U0_9BACT Length = 467 Score = 391 bits (1005), Expect = e-107, Method: Composition-based stats. Identities = 117/524 (22%), Positives = 192/524 (36%), Gaps = 122/524 (23%) Query: 40 TNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 + + + +PN I + DDLG+G + F G+ Sbjct: 23 SLFGLVVSSLAADTAPLRPNFIFILADDLGWGDVGFHHGN-------------------- 62 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG 159 TP L L EG+ YV + V P+R A ++GR +RF V + + + Sbjct: 63 -------VPTPNLDHLAGEGLELMQHYV-YPVCSPTRCAFLSGRYASRFSVTTPQNPRA- 113 Query: 160 IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN 219 L ++ GY TA GKWHL S EW PQ Sbjct: 114 FRWDTVTLARALKSVGYDTALCGKWHLG-----------------------SKPEWGPQK 150 Query: 220 RGFDYFMGFHAAGTAYYNSPS--------LFKNRERVPAKGYISDQLTDEAIGVVDRAKT 271 GFD+ G A G ++ ++ + + +G+++D +T EA+ ++ Sbjct: 151 FGFDHSYGSLAGGVGPWDHHYKIGEFTQTWHRDGKLIEEQGHVTDLITKEAVEWLE--SR 208 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 D+PF LY+ + A H+P P + + + +Y A+V +D V +IL L+K Sbjct: 209 TDKPFFLYVPFTAVHIPIREPDEILQRVPASITKPSLRHYGANVMHLDDSVGKILVALEK 268 Query: 332 NGQYDNTIILFTSDNGA----------------VIDGPLPLNGAQKGYKSQTYPGGTHTP 375 G+ NT+++F SDNGA N G K + Y GG HT Sbjct: 269 TGKAGNTLVIFGSDNGAIPGVENNDPLYPPDHYPPGPAGGSNEPLHGMKGEVYEGGIHTA 328 Query: 376 MFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHK 435 W G+L+PG + L D+ PT A KDLK DG ++ P L + +P Sbjct: 329 AVARWPGQLKPGKFLGLAHITDWMPTFCALAGYKPEKDLKWDGQNIWPQLTGAEPVKPR- 387 Query: 436 NLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT 495 S +R+ D+ LV + Sbjct: 388 ------------------------------------TIYVAGPGFRSKALRDGDWKLVLS 411 Query: 496 VENN------QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVRE 532 ++ L+ + D + ++A P +V ++ + + Sbjct: 412 QTKGSKNSPPKVELFNIGADPTEHTDVAGQFPDIVGRLRIKLEQ 455 >UniRef50_C6Y1Z7 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y1Z7_PEDHD Length = 480 Score = 391 bits (1004), Expect = e-107, Method: Composition-based stats. Identities = 117/522 (22%), Positives = 205/522 (39%), Gaps = 78/522 (14%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 + + +PN+I++ MDD+GYG + T Sbjct: 19 AQTTKTQRPNVIIINMDDMGYGD--------------------------TEPYGMTGIPT 52 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQDGIPLTETFL 167 P EG+RFT+ A + PSRAA++TG P R G+ + D++ + E + Sbjct: 53 PNFNKAAKEGMRFTHFNAAQAICSPSRAALLTGCYPNRIGLRGALSPDSKIALDTAEETI 112 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 L + GY TA +GKWHL + H F +F + DY Sbjct: 113 ASLLKKAGYKTAMLGKWHLGSKA--------PNLPLHYGFDSFYGLPYSNDMWPVDYEGK 164 Query: 228 FHA--AGTAYYNSPSLFKNRERV------PAKGYISDQLTDEAIGVVDRAKTLDQPFMLY 279 A AG Y L + + ++ T +A+ ++ K+ PF LY Sbjct: 165 PQAAVAGKKSYPELPLLDGDKPADYVRTPDDQAMLTGTFTRKAVRFIENNKSA--PFFLY 222 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 LA+ PH+P A G + + +D V I++ L +N NTI Sbjct: 223 LAHPMPHVPLAASAA-------FRGKSELGLFGDVIMELDWSVGEIMKSLDRNKIASNTI 275 Query: 340 ILFTSDNGAVI--DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYD-KLISAM 396 ++ SDNG + +G +G K + GGT P + W GK++ G+ + LI+ M Sbjct: 276 LIIMSDNGPWLRFGNHAGSSGGFRGGKMTIWDGGTRVPCIIRWPGKVEAGSVNSNLITNM 335 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWD 456 D PT L + + P+ K+DG+S L + P + Y + + + Sbjct: 336 DILPTLLQLSHAAPPE-KKIDGISFADLLLGRSDKAPRQVFY----YYYNENSLKAVRYK 390 Query: 457 NYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNL 515 N+ + H S Y + + +D + T ++ LY L D + ++ Sbjct: 391 NWKLVLPHTSVSYTSDIHGKDGFPGAAT-----------RAEVKMALYDLAHDPGEAYDV 439 Query: 516 AAANPQVVKEMQGVVREFIDSSQPPLS-EVNQEKFNNIKKAL 556 P++V++M F++ ++ + ++ K N+++ Sbjct: 440 QQQYPELVQKM----LVFVEEARADMGDDLTGRKGKNLRQPA 477 >UniRef50_A6DNI9 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DNI9_9BACT Length = 500 Score = 391 bits (1004), Expect = e-107, Method: Composition-based stats. Identities = 116/580 (20%), Positives = 199/580 (34%), Gaps = 141/580 (24%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K + T + + PNI+ + DDLG+ +F Sbjct: 1 MKLILRSFILLFSLSTLNAKEMPPNIVFILADDLGWADPSCYGSTFH------------- 47 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 TP + SL GV+ +N + V P+RA++MTG R G+ Sbjct: 48 -------------ETPHIDSLAKRGVKLSNFHSTSPVCSPARASLMTGLYAERLGMTQPA 94 Query: 155 -----------DAQDGIPLTET--------------FLPELFQNHGYYTAAVGKWHLSKI 189 G P + ++ + GY T GKWHL Sbjct: 95 CHINLVSLKAHTPDKGWPHQKVISPKSTTRLDTVFPTYAKVLKAQGYVTGHYGKWHLGH- 153 Query: 190 SNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY--YNSPSLFKNRER 247 E + P GFD + + Y P + + Sbjct: 154 -----------------------EPYTPLEHGFDVDVPHTKSHGPKGSYFGPKKYSDSFT 190 Query: 248 VPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGS 305 + ++ D++ EAI + K D+PF+L + H P D+Y+K+ Sbjct: 191 LKKGEHLEDRMGQEAIEFIKENK--DRPFLLNYWAFSVHSPMFAKLDLLDKYRKKATKLP 248 Query: 306 ----QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG--------- 352 Q + + + D V +L+ + + G D TII+ +SDNG I+ Sbjct: 249 TDAQQRNPIFAGMIETFDDNVGLLLKAIDEAGIADRTIIVLSSDNGGTIESAYTHEAYWG 308 Query: 353 ----------PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPT 401 P N K K + GGT P + W GK++ G D S +D +PT Sbjct: 309 NGTVEEIVDIPATSNYPLKSGKGTIHDGGTAVPFIVVWPGKIKAGTKSDSYFSGVDVFPT 368 Query: 402 ALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKF 461 ++ A +P + +DGVS +P L ++ W Sbjct: 369 FVEMAGAKMPSGVAIDGVSQVPALITGEEVRDTLYGYW---------------------- 406 Query: 462 VRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ-----LGLYKL-TDLQQKDNL 515 P+ + S S +R+ DY LV + + L+ + D+ + N+ Sbjct: 407 --------PNYLVERNGSIPSAWIRHGDYKLVSYFFDGKNNKHRYELFDIKNDIGENHNI 458 Query: 516 AAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKA 555 AA NP+ V ++ ++++ ++ L ++N K Sbjct: 459 AAQNPERVAKLSAMLKQHFVETEAVLPKLNPNYDPQAKAP 498 >UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFN4_9BACT Length = 481 Score = 390 bits (1003), Expect = e-107, Method: Composition-based stats. Identities = 125/537 (23%), Positives = 201/537 (37%), Gaps = 129/537 (24%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PN+I + DDLGYG+L G + + + TP + +L Sbjct: 20 PNVIYILADDLGYGEL----GCYGQE----------------------KIKTPHIDALAK 53 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN----TDAQDGIPLTETFLPELFQN 173 EG+RFT Y V PSR +++G+ ++ + +N + Q+ IP L ++F++ Sbjct: 54 EGMRFTRHYSGAPVCAPSRGVLLSGQQLSKAYIRNNREHKPEGQEPIPEPGMTLAQIFKD 113 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY T A GKW L + P+ GFD F G++ Sbjct: 114 KGYATGAFGKWGLGYPGSSS----------------------DPKALGFDTFYGYNCQRV 151 Query: 234 AY-YNSPSLFKNRERVP------------------------AKGYISDQLTDEAIGVVDR 268 A+ + P ++ N + + A+ Y D + DEA+ + Sbjct: 152 AHSFYPPHMWSNDKNITINEKPVPGHWRKAVGPDFDFSQFYAENYAPDLILDEALKFIKD 211 Query: 269 AKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGS-----------QTADNYYASV 315 K D+PF YL + PHL P D Y K++++ + Y A + Sbjct: 212 NK--DKPFFAYLPFVEPHLAMHPPHSWVDSYPKEWDSPKESYKAAYLPHLRPRAGYAAMI 269 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI-----DGPLPLNGAQKGYKSQTYPG 370 +D+ V +++ LK+ +NT+++FTSDNGA +G K Y G Sbjct: 270 SDLDEHVGSVMQLLKELDLVENTLVIFTSDNGASHCIEVDHEFFNSTKDLRGLKGSVYEG 329 Query: 371 GTHTPMFMWWKGKLQPGNYDKLIS-AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 G PM W GK++ +S +D T D P+ DGVS LP L+ +K Sbjct: 330 GLRVPMIAHWPGKIKKAQVSDHVSGFVDVMATFCDLLQTEAPQTS--DGVSFLPTLKGEK 387 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNND 489 Q EP L W + + D K VR Sbjct: 388 Q-EPQPVLAWEFQG---YSGQQAIILDGRWKGVRQNLSP------------------RGK 425 Query: 490 YSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVV---REFIDSSQPPLS 542 + LY L D +K +LA P++V + + R ++ P++ Sbjct: 426 KK---AKSTPKWELYDLNKDPNEKTDLATQMPEIVDRIHKAMMKNRSHSETFNMPMA 479 >UniRef50_Q0BZE9 Sulfatase family protein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0BZE9_HYPNA Length = 459 Score = 390 bits (1002), Expect = e-107, Method: Composition-based stats. Identities = 126/512 (24%), Positives = 198/512 (38%), Gaps = 101/512 (19%) Query: 24 AFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPK 83 AA + N+A S+ P + PNII++ DDLG+G + + Sbjct: 8 WTAAIMLTAACAASPAANIATSETAP---AAAKPPNIIIIMADDLGWGDISLNG------ 58 Query: 84 TMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGR 143 AA TP + + EG++ T+ Y V PSRAA++TGR Sbjct: 59 --------------------AALIETPNIDRIGQEGIQLTDFYAGSNVCSPSRAALLTGR 98 Query: 144 APARFGVYSNT--DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 P R G+ +QDG+P E + E+ +N GY T VGKWHL Sbjct: 99 YPIRSGMQHVIFPHSQDGLPAEEITISEMLKNAGYRTGMVGKWHLGHQ------------ 146 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQ---L 258 EE+ P N+GFD+F G + L++ +E + + S Sbjct: 147 -----------EEYWPTNQGFDWFYGVPYSNDMAPF--DLYRGKEIIESPADQSQLSLNY 193 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSV 318 A ++ + D+PF LY A PH+P P + +G+ A Y V +V Sbjct: 194 AKAAKEFIED--SSDKPFFLYYAETFPHIPLFVP-------EDRSGTSDAGLYGDVVETV 244 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFM 378 D G+ +L+ L + G D+T+I+FTSDNG +G G +G K +T+ GG P Sbjct: 245 DAGIGIVLDTLDEAGVADDTLIIFTSDNGPWFEGSA---GEFRGRKGETHEGGFRVPFLA 301 Query: 379 WWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL 437 W G + G+ ++ +D PTA + ++P D +DG L L PH L Sbjct: 302 RWPGHIPKGSVSHEMAMNIDLLPTAASLSGATLPADRVIDGKDLTSLLTAGA-PTPHDIL 360 Query: 438 TWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE 497 + + + F + F R S + + Sbjct: 361 FFFDG-NEIVGARDARFRLVLNTFYRTMSVPFEYF------------------------- 394 Query: 498 NNQLGLYKL-TDLQQKDNLAAANPQVVKEMQG 528 L+ L D Q+ + P + ++ Sbjct: 395 -GTALLFDLEKDPQESFSFMREYPGEAERLKS 425 >UniRef50_D2QTW5 Sulfatase n=2 Tax=Sphingobacteriales RepID=D2QTW5_9SPHI Length = 523 Score = 390 bits (1001), Expect = e-106, Method: Composition-based stats. Identities = 118/546 (21%), Positives = 188/546 (34%), Gaps = 129/546 (23%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 V K +T V+ T S PNII + DDLGY +L G + + Sbjct: 26 VSFKPPRTTVSRDAVPRTAVS----PNIIYIYADDLGYAEL----GCYGQQ--------- 68 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 + TP L L EG+RFT Y + V P+R ++TG+ + Sbjct: 69 -------------KIRTPNLDKLAREGIRFTQHYTSMPVCAPARCMLLTGKHSGHSYIRG 115 Query: 153 NT----------DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 N Q + + L Q GY TA VGKW + + P ++ Sbjct: 116 NYEMGGFPDSLEGGQMPLYPGAFTIGRLLQQQGYKTACVGKWGMGMANTTGNPNEQGFDY 175 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPA------------ 250 ++ A + P + + N+P + +R P Sbjct: 176 FYGYLDQKQAHNYYPTHL-------WENGKPDKLNNPVIDVHRRLTPETATPEAFAYFRG 228 Query: 251 KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP--APDQYQKQFNTGSQ-- 306 Y D+L +A + + K+ PF LYL + APH+ P A +Y +F G Q Sbjct: 229 NDYAIDKLAQKAQAFIRQNKSG--PFFLYLPFTAPHVSLQAPEAAVKEYIGKFGDGEQRT 286 Query: 307 ---------------TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID 351 Y A + +D + ++++ LK +NT+++F+SDNGA + Sbjct: 287 ERPYLGEQGYASTPYPRATYAAMITHMDAQIGQLMQLLKDLKIDENTLVMFSSDNGATFN 346 Query: 352 GP-----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISA-MDFYPTALDA 405 G G +G K Y GG PM W G+++P +S D T + Sbjct: 347 GGVEAAYFNSVGKLRGLKMDVYEGGIREPMLARWPGRIKPNQTTDHVSVQYDLLATLAEL 406 Query: 406 ADISIPKDLKLDGVSLLPWLQDKKQGEP-HKNLTWITSYSHWFDEENIPFWDNYHKFVRH 464 P DG+S LP L + + H L W Sbjct: 407 VGYKRP--FATDGISFLPTLLGQSSSQKQHPFLYWEYP---------------------- 442 Query: 465 QSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT----VENNQLGLYKLT-DLQQKDNLAAAN 519 +R ++ V T LY L D+ + N+A + Sbjct: 443 -------------EKGGQLAIRMGNWKAVKTNVRKDRTTPWELYDLNKDVSETTNIADKH 489 Query: 520 PQVVKE 525 P ++++ Sbjct: 490 PDIIRQ 495 >UniRef50_A6LCL3 Arylsulfatase A n=9 Tax=Bacteroidales RepID=A6LCL3_PARD8 Length = 476 Score = 389 bits (1000), Expect = e-106, Method: Composition-based stats. Identities = 125/497 (25%), Positives = 201/497 (40%), Gaps = 99/497 (19%) Query: 59 NIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDE 118 NI+++ +DD+GYG F+ A +TP + + E Sbjct: 25 NIVLINLDDVGYGDFSFNG--------------------------AYGYTTPNIDKMAAE 58 Query: 119 GVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQDGIPLTETFLPELFQNHGY 176 GVRFT+ V +SG SRA ++TG P R G D+ G+ E + E+ + GY Sbjct: 59 GVRFTHFLVGQPISGASRAGLLTGCYPNRIGFSGAPGPDSNYGVHPEEMTIAEVLKQKGY 118 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG--------- 227 TA GKWHL S +E+ P GFD + G Sbjct: 119 STAIFGKWHLG-----------------------SQKEFLPLQNGFDEYYGLPYSNDMWP 155 Query: 228 FHAAGTAYYNSPSL--FKNRERVPAKGYISD------QLTDEAIGVVDRAKTLDQPFMLY 279 FH +N P L + E + GY +D T ++ + + K ++PF LY Sbjct: 156 FHPQQGEVFNFPDLPTYDGNEII---GYNTDQTRLTTDYTTRSVNFIKKNK--NKPFFLY 210 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 LA+N PH+P D+++ + Y + +D V I + L++ G DNT+ Sbjct: 211 LAHNMPHVPL--AVSDKFKGK-----SEQGLYGDVMMEIDWSVGEIFKALRELGLEDNTL 263 Query: 340 ILFTSDNGAVID--GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAM 396 ++ TSDNG + G + K+ T+ GG P M+WKGK PG +KL S + Sbjct: 264 VILTSDNGPWTNYGNHAGSAGGLREAKATTFDGGNRVPCIMYWKGKTLPGTTCNKLASNI 323 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWD 456 D PT + +P K+DGVS+LP ++ KK P ++ Y + ++ Sbjct: 324 DLLPTFAEITQAPLPP-RKIDGVSILPLIEGKKDANPRESFV----YYYRKNDLEAVTDG 378 Query: 457 NYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNL 515 + H+ Y D T +E + +Y L D ++ N+ Sbjct: 379 MFKLVFPHKYVTYGAYEPGNDGQPGKLT----------NLEIMKPEMYDLRRDPGERYNV 428 Query: 516 AAANPQVVKEMQGVVRE 532 P+ ++ + + Sbjct: 429 ITQYPEEAAKLMKIADQ 445 >UniRef50_A3I2R7 Arylsulfatase n=2 Tax=Bacteroidetes RepID=A3I2R7_9SPHI Length = 589 Score = 389 bits (1000), Expect = e-106, Method: Composition-based stats. Identities = 128/550 (23%), Positives = 207/550 (37%), Gaps = 128/550 (23%) Query: 41 NVAFSDFTPTEYS-TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 + F ++++ + PNII++ DD GYG F Sbjct: 14 TILLLVFCASKFTFAQKPPNIILIITDDQGYGDFGFTG---------------------- 51 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG 159 STPT+ L + FTN YV V P+RA++MTGR R G+ + Sbjct: 52 ----NKHVSTPTIDQLAENSFEFTNFYV-SPVCAPTRASLMTGRYSLRTGIRDTYNGGAM 106 Query: 160 IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN 219 + E + EL Q Y + GKWHL +P + Sbjct: 107 MSPDEITIAELLQKSDYTSGIFGKWHLGDNY-----------------------PMRPSD 143 Query: 220 RGFDYFMGFHAAGTAY-------------YNSPSLFKNRERVPAKGYISDQLTDEAIGVV 266 +GFD + + G Y P L+ N + +GY SD AI + Sbjct: 144 QGFDESLIHLSGGMGQVGDFTTYFQKDRSYFDPVLWHNNRQESYQGYCSDIFASAAIEFI 203 Query: 267 DRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF--------------------NTGSQ 306 ++ K DQPF YL++NAPH P P++Y +++ ++ + Sbjct: 204 EKNK--DQPFFTYLSFNAPHTPLQ--VPEEYYQKYKNIDTSTGYESDERPFYPMSDSQKE 259 Query: 307 TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQ 366 A YA V ++D +K + +LK+ D TII+F +DNG L +G K Sbjct: 260 EARKVYAMVENIDDNLKNLFAKLKELEIEDETIIIFLTDNGPQQQRYL---AGLRGLKGN 316 Query: 367 TYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 Y GG TP+ + KL + L + +D PT D I +P D K+DG SLLP L Sbjct: 317 VYQGGIRTPLLIHIPEKLSENRKINTLSAHIDILPTIADLVGIQLPLDRKIDGKSLLPLL 376 Query: 426 QDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTV 485 + +++L S+W + + + ++ Sbjct: 377 IGEVDSFENRSLF-----SYWNRKFPEKY--------------------------SNISI 405 Query: 486 RNNDYSLV----YTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 +N+++ LV Y LY L D ++ NL + E++ + + Sbjct: 406 QNSEWKLVGKTDYDASIEDFQLYNLKEDPYEQSNLITSKISKGLELKNELDQLYLELISE 465 Query: 541 LSEVNQEKFN 550 + +N K + Sbjct: 466 ENLINPPKIH 475 >UniRef50_A6DMW2 Putative exported uslfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMW2_9BACT Length = 479 Score = 389 bits (1000), Expect = e-106, Method: Composition-based stats. Identities = 116/531 (21%), Positives = 193/531 (36%), Gaps = 123/531 (23%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 + + + +PNI+ + DD+G L + Sbjct: 12 CTIALASLNLLNAAQRPNILFIVADDMGIMDLGVYGSDY--------------------- 50 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA----- 156 TP L L + +RF Y A V P+R AI+TGR P R + Sbjct: 51 -----YLTPNLNKLASQSMRFDRAYAASHVCSPTRGAILTGRYPQRIHLTDALPWDRLYK 105 Query: 157 -QDGIPLTET--------FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 IP + Q + Y TA GKWHL ++ + Sbjct: 106 NPKMIPPNHVKELSLKLPTFARVLQKNDYRTAMFGKWHLGNEERFFTGKEHKA------- 158 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVD 267 GFD G AY KG ++LT+ + + Sbjct: 159 ------------YGFDEAFGVSGKAKAY--------------DKG--VNELTERTLRFLK 190 Query: 268 RAKTLDQPFMLYLAYNAPHLPNDNP--APDQYQKQFNTGSQTADNYYASVYSVDQGVKRI 325 K +PFML L ++ PH+P P A Y Q Y + D +K++ Sbjct: 191 ENKK--KPFMLCLMHHVPHVPVACPPYAKALYDSVPKGKHQKNSKYAGMISHFDNSIKKV 248 Query: 326 LEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQ 385 L+ L+ G DNT+++ TSDNG + L N G K Y GGT P+ + W GK+ Sbjct: 249 LDALRALGLDDNTVVIVTSDNGGL--SNLSSNKPYNGGKGSLYEGGTRVPLLIRWPGKIT 306 Query: 386 PGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYS 444 PG+ +K ++ + DF+PT L+ A + + + LDG S++P L+ K G+ + L W Sbjct: 307 PGSVNKSVVISNDFFPTFLELAGLPLMPEAHLDGKSMMPLLKGKTLGK--RTLYW----- 359 Query: 445 HWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLY 504 +PH ++ + D L++ +E++ ++ Sbjct: 360 -----------------------HFPH------RGTPGSSIIDGDLKLIHKIESDTYEMF 390 Query: 505 KLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQP----PLSEVNQEKFN 550 L D + +NL P+ +Q ++ + P + + ++ Sbjct: 391 DLNSDPYEANNLFEKQPEQASRLQKMLARHLKEVAAQEMSPNPQWDPKRPK 441 >UniRef50_Q7ULE7 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Rhodopirellula baltica RepID=Q7ULE7_RHOBA Length = 1049 Score = 389 bits (999), Expect = e-106, Method: Composition-based stats. Identities = 131/522 (25%), Positives = 191/522 (36%), Gaps = 98/522 (18%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 D T KPN++V+ DD G+ L E Sbjct: 570 DRTAQAVIPASKPNVVVILTDDQGWADLSCQN-------------------------EVD 604 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTET 165 TP + L GVR TN YV PSRA ++TGR R G+ + D +P Sbjct: 605 DIQTPHIDGLAARGVRCTNAYVTAPQCSPSRAGLITGRYQQRLGIDTIPDMP--LPTNAV 662 Query: 166 FLPELFQNHGYYTAAVGKWHLS------KISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN 219 + E Q GY T VGKWHL +P E + P Sbjct: 663 TIAEHLQPKGYKTGFVGKWHLEPNVTCIDWMRRELPAMAGKPRRKVRIPWNKIEPYSPSQ 722 Query: 220 RGFD-YFMGFHAAGTAYYN--SPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPF 276 +GFD Y+ G ++ S L + + + + D T+ A+ + R DQPF Sbjct: 723 QGFDEYYWGERTNYRTNFDLTSGELLAEMKPIRDERFRIDVQTNAAVKFIQRNH--DQPF 780 Query: 277 MLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYY-ASVYSVDQGVKRILEQLKKNGQY 335 L L Y PH P + A +Y +F Y A + ++D GV +I++QLK G Sbjct: 781 YLQLNYYGPHTPLE--ATQKYLDRFPGPMPERRRYALAMISAIDDGVGQIVDQLKAEGVL 838 Query: 336 DNTIILFTSDNGAVI--------------DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 DNT+I+ TSDNGA + LN G K GG PM Sbjct: 839 DNTLIVMTSDNGAPLKMTKTDSPINGDAGGWDGSLNDPWVGEKGMLSEGGIRVPMIWSLP 898 Query: 382 GKLQPG-NYDKLISAMDFYPTALDAADISIPK-DLKLDGVSLLPWLQDKKQGEPHKNLTW 439 +L G YD +SA+D P+ L A +P D DG+ L+P L D Q P + L + Sbjct: 899 TQLPSGITYDWPVSALDIAPSVLKLAGGELPSGDAAFDGIDLIPRLND-IQNPPTRTLYF 957 Query: 440 ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN 499 FWD +R + ++ + Sbjct: 958 R-------------FWD-------------------------QAAIRRGKWKYIFAGDGR 979 Query: 500 QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 + L+ L +D + NLA P++ ++ + + P Sbjct: 980 RF-LFDLESDQHEHRNLAEEYPELANKLHASLASWTSELSPK 1020 Score = 209 bits (532), Expect = 3e-52, Method: Composition-based stats. Identities = 109/568 (19%), Positives = 178/568 (31%), Gaps = 147/568 (25%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 F+D PT PN++ + MDDL G Sbjct: 15 LAAPSTFADSPPTPSG----PNVLFIAMDDL-----NDWIGCLG---------------- 49 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ 157 Q TP L L G+ FTN + P R+A+ TGRAP + G+Y N Sbjct: 50 -----GHPQTITPNLDRLAASGILFTNAHCPAPACNPCRSAVFTGRAPNQSGLYDNRQQM 104 Query: 158 DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 + + LP+ +NHGY+ + GK + + + F +E P Sbjct: 105 REVMPDDVILPQYMRNHGYHASGSGK---------LLHYFIDAASWDEYFPKAESENPFP 155 Query: 218 QNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIG-VVDR--AKTLDQ 274 Q F P + + + D A+ + + DQ Sbjct: 156 Q-----TFYPSQRPVNLKRGGPWQYVETDWAALDVTDEEFGGDWAVSQWIGEQLQQKHDQ 210 Query: 275 PFMLYLAYNAPHLPNDNPAPDQYQKQF--------------------------------- 301 PF L PH P P +Y + F Sbjct: 211 PFFLGCGIYRPHEPW--FVPKKYFEPFPLDSIQLPPGYLENDLDDVPPIGQRAARNRYFA 268 Query: 302 -----NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL 356 + Q Y AS++ D + R+L+ L+ DNTI++ SD+G L Sbjct: 269 HIQKQDQWKQGIQGYLASIHFADAMLGRLLDALESGPNADNTIVVLWSDHG------WQL 322 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQP---------GNYDKLISAMDFYPTALDAAD 407 + K + G T P+ + P D ++ + +PT LD Sbjct: 323 GEKEHWQKYTPWRGVTRVPLMIRVPKTSSPSLPNGTPIGARCDAPVNLLSLFPTVLDLC- 381 Query: 408 ISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 +P + DG SLLP L++ K D Sbjct: 382 -QLPSNPVNDGPSLLPLLKEPKT------------------------------------D 404 Query: 468 DYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEM 526 + HN T +Y V + ++ ++ LY + D + +NLA P+ Sbjct: 405 TWKHNSVTYLSHPGAYAVSGRTHRYIH-YQDGSEELYNIEADPYEWNNLATK-PES---- 458 Query: 527 QGVVREFIDSSQPPLSEVNQEKFNNIKK 554 + +F +S ++ + ++ K Sbjct: 459 SEQLAQFRSTSPTKFAKRIEPSVKSLAK 486 >UniRef50_A6DMW1 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMW1_9BACT Length = 585 Score = 389 bits (999), Expect = e-106, Method: Composition-based stats. Identities = 132/548 (24%), Positives = 206/548 (37%), Gaps = 131/548 (23%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 P+ KPN+IV+ +DD+G F Sbjct: 2 PSALIAAKKPNVIVILIDDMGLMDSSTYGSKF--------------------------YQ 35 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY----------------- 151 T + L EG+ FT+ Y A + P+RA+IM+G+ P+R + Sbjct: 36 TANMSRLAKEGMLFTDAYAASPLCSPTRASIMSGQYPSRLHMTVAVTPKSKEKPKALAPA 95 Query: 152 -----SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 ++++ +PL L E Q+ GY TA +GKWHL Sbjct: 96 PNQYCGKVESKNHMPLAVYTLAEALQDSGYTTAHIGKWHL-------------------- 135 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHA-AGTAYYNSPSL--------FKNRERVPAKGYISDQ 257 + +N+GFD+ +G G Y SP N P Y++++ Sbjct: 136 ---TENPKHNAENQGFDFVIGGAGLPGPPDYYSPYKRKGKKAKGINNLSPGPKGEYLNER 192 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA---PDQYQKQFNTGSQTADNYYAS 314 L E+I + + ++PF L L + A H P P +++ Q Sbjct: 193 LAKESIKWIKSVQDSNKPFYLNLWHYAVHGPVIEKKDLMPKYLERRDPNNPQRCPEMGTM 252 Query: 315 VYSVDQGVKRILEQLKK---NGQYDNTIILFTSDNGAVI-----DGPLPLNGAQKGYKSQ 366 + S+D V +L+ L K DNT+I+ TSDNG VI N +G K+ Sbjct: 253 IDSMDNSVGMLLDWLDKPENKAVKDNTLIILTSDNGGVIHKETNGNTWTSNRPLRGGKAN 312 Query: 367 TYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 TY GGT P + W ++ G+ + ++D YPT L+A +I K L DG S+LP L Sbjct: 313 TYEGGTRVPWIVRWPDTIKAGSVCTTPVQSIDIYPTVLEAVNIKAKKGLTFDGQSILPLL 372 Query: 426 QDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTV 485 + +K H+ + T + H F P S +V Sbjct: 373 EQRKME--HQPIF--TDFQHLFGVMCAP----------------------------SSSV 400 Query: 486 RNNDYSLVYTVENNQ------LGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 R D L+ L+ L DL + NLAA P+ VKE+ ++ I + Sbjct: 401 RVGDMKLIRFYHAGPKAQSHAYELFDLKRDLYESINLAAYMPEKVKELDRLIEAHIKETA 460 Query: 539 PPLSEVNQ 546 + N+ Sbjct: 461 ALVPIANK 468 >UniRef50_C1ZIS7 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZIS7_PLALI Length = 631 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 118/540 (21%), Positives = 197/540 (36%), Gaps = 101/540 (18%) Query: 45 SDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEA 104 S+ P + +PNI+V+ DDLG+ L F Sbjct: 24 SEANPPSAPRQNRPNIVVILADDLGWADLGCYGNPFH----------------------- 60 Query: 105 AQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG----- 159 TP L L +G+R T Y A V P+RAA++TG+ PAR + + Sbjct: 61 ---KTPHLDQLARDGIRCTQAYAACPVCSPTRAALLTGQNPARLHLTDWLPGRGNRNDQA 117 Query: 160 ---------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 +P LP + +++GY T ++GKWHL ++ P+ H Sbjct: 118 LRVPEIRNSLPQGIMTLPGVLKSNGYQTCSIGKWHLGGGASGPL--------QHGFHEQI 169 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAK 270 + +E R F F A E +P Y++D L D+A+ +++ Sbjct: 170 AGDERGSPARWFAPFGPQAATNGEKDRQGKPIPGLEDIPDGKYLTDALADKAVAFIEKQ- 228 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT----ADNYYASVYSVDQGVKRIL 326 T ++PF LYL + A H P + AP++ ++F + Y A +Y +D V +++ Sbjct: 229 TAEKPFFLYLPHFAVHTPMN--APEETIQKFRDNKPPGVVRNEIYAAMLYHLDAAVGKVM 286 Query: 327 EQLKKNGQYDNTIILFTSDNGAVI-----DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 L + G NTI++FTSDNG + + P +N + K Y GG P+ + + Sbjct: 287 NSLTEKGFAKNTIVVFTSDNGGLATIEGKNTPATINAPLREGKGWLYEGGIRVPLIVSFP 346 Query: 382 GKLQPGNYDKLISAMDF-YPT------ALDAADI--SIPKDLKLDGVSLLPWLQDKKQGE 432 + G S D T L A I + + LDG+++ E Sbjct: 347 KHIPDG------STTDVPMTTLDLLPSLLSLAGIQYQVDANSPLDGMNISDIWTGNATPE 400 Query: 433 PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL 492 K Y H YPH N +R + Sbjct: 401 LKKAAFERPLYWH-----------------------YPHYANQGGF--PGGVIRQGPWKY 435 Query: 493 VYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 + + + L+ + D + N A P+ + + + + S + N + N Sbjct: 436 IENYQTGRKELFLVDKDPGEGRNRAPDEPEKITQFAAQLAAWKQSISAQETVPNPDYIPN 495 >UniRef50_C1ZI83 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZI83_PLALI Length = 558 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 123/500 (24%), Positives = 202/500 (40%), Gaps = 97/500 (19%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 + KPN++++ DDLGY + A A T Sbjct: 100 AAEARPEKPNVVIINCDDLGYADVG--------------------------AFGATICKT 133 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQDGIPLTETFL 167 P + + EGV+ T+ YVA V SR A++TG P R G+ + +++GI +E L Sbjct: 134 PEIDRMAREGVKATSFYVAQAVCSASRTALLTGCLPNRIGILGALSHVSKNGIADSEVTL 193 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 ELFQ+ GY TA GKWHL + ++ P + GF +G Sbjct: 194 GELFQSQGYSTAMYGKWHLGYQA-----------------------QFLPGHHGFGEALG 230 Query: 228 FHAAGTAYYNSPS-------LFKNRERVPAK--GYISD------QLTDEAIGVVDRAKTL 272 + + +P LF+ + PA+ G+ +D T A+ +DR Sbjct: 231 IPYSNDMWSKNPYGKFPPLPLFRQKGDSPAEIIGHDTDQSRFTTDFTMAAVSFIDRH--A 288 Query: 273 DQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKN 332 D+PF +YLA+ PH P + + + A Y + +D V I + L+K+ Sbjct: 289 DKPFFIYLAHPMPHTPI-------FVSEERNSGERAQLYRDVIGEIDWSVGTIRQTLEKH 341 Query: 333 GQYDNTIILFTSDNGAVI--DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP-GNY 389 T+++FTSDNG + G + K + GG P W G + P Sbjct: 342 QLTRKTLVIFTSDNGPWLVFGNHAGSTGPLREGKGTMWDGGARVPFVACWPGVIPPDTTV 401 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 D ++ D +PT +P D +DGV + P L + +PH+ L W ++ Sbjct: 402 DLPMATYDLFPTFAKMLGAKLP-DHPIDGVDIWPQLTSASKAQPHQAL-WF-----YYGR 454 Query: 450 ENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TD 508 + I K V + +P + LV + +L LY L +D Sbjct: 455 DLIAVRSGPWKLVFPHTYVHPVERGNDGQRG----------KLV-NRKFTELALYNLDSD 503 Query: 509 LQQKDNLAAANPQVVKEMQG 528 + + NLA+ +P++VK+++ Sbjct: 504 IGETTNLASQHPEIVKQLEA 523 >UniRef50_C0BKJ9 Sulfatase n=2 Tax=Bacteroidetes RepID=C0BKJ9_9BACT Length = 493 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 130/550 (23%), Positives = 197/550 (35%), Gaps = 147/550 (26%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 F + E PNII + DDLGYG+L GS+ K Sbjct: 11 FTFFSCSTVENQKDQPPNIIYILADDLGYGEL----GSYGQK------------------ 48 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT------- 154 + TP L L +G+RFT Y V PSR +TG + N Sbjct: 49 ----KIKTPNLDRLAADGMRFTQHYTGAPVCAPSRYMFLTGNHAGHAYIRGNYELGQFSD 104 Query: 155 ---DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFS 211 Q IP T L ++ + GY TA +GKW L Sbjct: 105 EMEGGQMPIPETTPTLAKMLKKAGYQTAMIGKWGLGMNETTG------------------ 146 Query: 212 AEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRERVP--------------------- 249 P GFDY+ G+ A+ P L++N ++ P Sbjct: 147 ----SPLLHGFDYYYGYLDQKQAHNYYPTHLWENDKKDPLNNDYFLVHSPISSKANQSDF 202 Query: 250 ----AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNT 303 + Y D++ ++AI +D + D+P+ LY PH+ P DQY+ F Sbjct: 203 DQFKGQEYAPDRMLEKAIQFLDTTAS-DKPYFLYYPSPIPHVSLQVPDSLVDQYRDVFEE 261 Query: 304 GSQ-----------TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG 352 Y A + +D V +I + +K+ GQ +NT+ILF+SDNG G Sbjct: 262 EPYLGNKGYTAHQFPNAAYAAMITHLDSEVGKIWDSVKEKGQEENTLILFSSDNGPTFAG 321 Query: 353 P-----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS-AMDFYPTALDAA 406 +G K Y GG P +WKGK++ G+ LIS D + T + A Sbjct: 322 GVDPDFFNSAAGLRGLKMDVYEGGIRIPFIAYWKGKIKAGSISDLISGHWDMFNTFAELA 381 Query: 407 DISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQS 466 DG+S+LP L + Q E H + + Sbjct: 382 GQDQSAP---DGISILPELLGESQNETHDYIYFEYP------------------------ 414 Query: 467 DDYPHNPNTEDLSQFSYTVRNNDYSLV----YTVENNQLGLYKL-TDLQQKDNLAAANPQ 521 + +R D+ V T +++ LY L TD + N+AA +P+ Sbjct: 415 -----------EKRGQIALRIEDWKGVKVEMKTNLDSKWELYNLKTDRNEVFNVAAEHPE 463 Query: 522 VVKEMQGVVR 531 +V ++ + + Sbjct: 464 IVNKIDSLHK 473 >UniRef50_D2R457 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R457_9PLAN Length = 516 Score = 388 bits (997), Expect = e-106, Method: Composition-based stats. Identities = 123/562 (21%), Positives = 195/562 (34%), Gaps = 84/562 (14%) Query: 28 HAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMEN 87 A + T + + PNI+ + DDLGYG + G F K Sbjct: 3 WCAMVRSTGVWLAALLLIGSTALVRAEELPPNIVFILCDDLGYGDV----GCFGQK---- 54 Query: 88 REVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR 147 + TP + +L +G+R Y V PSR ++TG Sbjct: 55 ------------------KTRTPHIDTLARDGMRLIQHYSGAPVCAPSRCVLLTGLHSGH 96 Query: 148 FGVYSN----TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY 203 V N + Q + LP L GY A GKW L + P + + Sbjct: 97 SQVRDNREAQPEGQYPLAEGTVTLPGLL--EGYVCGAFGKWGLGGPESSGKPLAQGFDRF 154 Query: 204 HDNFTTFSAEEWQPQN-RGFDYFMGFHA---AGTAYYNSPSLFKNR---ERVPAKGYISD 256 A + PQ+ D + A + + + +N ER Y +D Sbjct: 155 FGYNCQRQAHNYYPQHLWSNDEKVLLKNPPFAAHQKFPADADPQNPAAFERYRGPDYAAD 214 Query: 257 QLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNT----------- 303 ++++A+ +D +PF LY A PHL P +Y +F+ Sbjct: 215 LISEQALKFIDEHHQ--KPFFLYYASPVPHLALQVPEDSLKEYAGEFSETPYLGERGYLP 272 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNG----- 358 Y A + +D+ + RILE+L+K G TI++F+SDNG + D + Sbjct: 273 HPTPRAAYAAMITRMDREIGRILERLEKYGLQRRTIVVFSSDNGPLYDKLGGTDADFFQS 332 Query: 359 --AQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLK 415 +G K Y GG P + + G + G + D+ PT L A +S + Sbjct: 333 ALDLRGRKGSVYEGGIRVPTIVKFPGVVPAGTTSSTLGGFEDWMPTLLSLAGMSTKIPEQ 392 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 DG L P L+ Q P + L W + Sbjct: 393 ADGRDLSPSLRGDWQA-PREFLYREFPGYGGQQFVRSGKWKAVRQ--------------- 436 Query: 476 EDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGV-VREF 533 V L E + LY L D + N+AA +P+VV ++ + +RE Sbjct: 437 ----NLVRPVPTGKKKLAEWKEPLAIELYDLEADPTESTNVAAEHPKVVAKLHAIMLREH 492 Query: 534 IDSSQPPLSEVNQEKFNNIKKA 555 S + + ++ E+ K A Sbjct: 493 QPSVEFKMPRLDDEQAAAHKAA 514 >UniRef50_Q7UN55 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UN55_RHOBA Length = 501 Score = 388 bits (996), Expect = e-106, Method: Composition-based stats. Identities = 126/550 (22%), Positives = 212/550 (38%), Gaps = 107/550 (19%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKG---KPNIIVLTMDDLGYGQLPFDKG 78 + + + A + + V+ + + + G +PNII + DDLGYG L G Sbjct: 16 LRSLSRLALAFCCIAVSYRVVSGDESSKADSPASGDALRPNIIYVMADDLGYGDL----G 71 Query: 79 SFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAA 138 + + TP L + +G+RFT+ Y H V PSR Sbjct: 72 CYGQ----------------------TRIQTPHLDQMAADGIRFTDHYAGHTVCRPSRLT 109 Query: 139 IMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDK 198 + TG+ G+ N A + + + L + GY T VGKW L NV VPE+ Sbjct: 110 LWTGKHVGSTGLIGN--AARNLTGEQPTVASLLSDAGYATGGVGKWALG---NVDVPEEI 164 Query: 199 QTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS-LFKNRER---------- 247 + P GFD + G+ A+ P L++N ER Sbjct: 165 ENPG-------------HPLANGFDAWTGYMNQSNAHNYYPRFLWQNYERRFFPGNVIST 211 Query: 248 ---------VPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPND-------- 290 V + Y D +TD A + ++ PF+L++ + PH N+ Sbjct: 212 DPIARGRVAVKRESYSHDVMTDAAFDFIREHRSD--PFLLHVHWTIPHANNEGGRLNGDG 269 Query: 291 NPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI 350 PD + A + +D+ + R+++ L++ + T+++FTSDNG Sbjct: 270 MEVPDYGIYADEGWPNPEKGFAAMITRMDRDMGRLMDLLEELKLSEKTLVIFTSDNGPHH 329 Query: 351 DGPLP-----LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALD 404 +G +G +G K + GG P W G ++PG D + DF PTA + Sbjct: 330 EGGHSDLFFNSSGPLQGSKRSMHEGGIRVPFIAKWPGTIEPGTISDHPSAFWDFLPTACE 389 Query: 405 AADISIPKDLKLDGVSLLPWLQDK-KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR 463 A P D +DG+S LP L D+ K+ H+ L W +S W Sbjct: 390 LAGAEPPAD--IDGISYLPALLDQPKKQTKHRYLYWASSEGPTSVGLRSGTW-------- 439 Query: 464 HQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQV 522 ++ +YP V + + L+ L +D +K++++ +P Sbjct: 440 -KAVNYPGGTKKRRSGNSKPVVNEDGWK-----------LFDLASDPGEKNDVSKDHPAE 487 Query: 523 VKEMQGVVRE 532 ++ + + RE Sbjct: 488 LERLVEMARE 497 >UniRef50_D2R921 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R921_9PLAN Length = 676 Score = 387 bits (995), Expect = e-106, Method: Composition-based stats. Identities = 122/523 (23%), Positives = 202/523 (38%), Gaps = 100/523 (19%) Query: 52 YSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPT 111 + KPN++ + DD+G+G L G TP Sbjct: 47 ATEPAKPNVVYILADDVGWGDLSVHGG---------------------------GVPTPN 79 Query: 112 LLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELF 171 + L +G+ ++ + V P+RA +TGR P R G + + L ET + E F Sbjct: 80 IDKLFAQGIEVSHF-MGWCVCSPTRAMFLTGRHPIRVGTGPEVGGE--LSLDETTIAEGF 136 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA 231 + +GY T GKWH + P + A GFD ++ Sbjct: 137 KANGYRTGVFGKWHSGSDPDTPAFRAAFAEAFKAIPNKQFAGGHGANAHGFDEAWVYYGG 196 Query: 232 GTAYYN--------SPSLFKNRERVPA-KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 G ++N S + NRE P +GY D +T AI + K DQPF Y+ + Sbjct: 197 GADFFNRRTVQGRGPVSWWHNREFRPDDEGYTDDLVTQRAIEFIRENK--DQPFFCYVPF 254 Query: 283 NAPHLPNDNP-----------APDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 + H P A T + + A ++S+D + I ++L+K Sbjct: 255 HIAHAPLQAKENDLAAIDSKTAAKLPTASGKTSDEGKHIHAAMLHSMDNNIAAIRDELEK 314 Query: 332 NGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK--GKLQPGNY 389 G DNTI +FTSDNGA+ G + +G+K Y GG P ++W G + Sbjct: 315 LGLSDNTIFVFTSDNGAMEAG---SSLPLRGHKHTIYEGGVRLPTAIYWPKGGLTGGRKW 371 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 + L A+D +PT + D ++PK LDG ++ P L+D Q P ++ +I Sbjct: 372 NGLCGALDMFPTLMAMTDSTMPKTQPLDGKNVWPALRD-NQPSPVESYYFIWHDED---- 426 Query: 450 ENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-D 508 +R + + L + LY +T D Sbjct: 427 ----------------------------------AIRTDRWKLHRFH--GRYELYDITID 450 Query: 509 LQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS-EVNQEKFN 550 + +N+A ++P VVK + + + +S +S + +K++ Sbjct: 451 ETESNNIADSHPDVVKSLSAKMDAWAESLGAAISHQPAPKKYH 493 >UniRef50_C9MNT2 Arylsulfatase n=4 Tax=Bacteroidales RepID=C9MNT2_9BACT Length = 539 Score = 387 bits (994), Expect = e-106, Method: Composition-based stats. Identities = 125/570 (21%), Positives = 196/570 (34%), Gaps = 130/570 (22%) Query: 40 TNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 T + + KPNII + DD+GYG L + Sbjct: 38 TALGCVQGNAMTPKKQQKPNIIYIMCDDMGYGDLGCYGQKY------------------- 78 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD---- 155 TP + + EG+RFT Y VS PSRA +MTG+ V N + Sbjct: 79 -------ILTPNIDRMAKEGMRFTQAYAGAPVSAPSRACLMTGQHSGHTEVRGNKEYWTN 131 Query: 156 ---------------AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQT 200 Q LPE+ +++GY T GKW ++ P+ + Sbjct: 132 SKPVYYGENKDFSVVGQHPYDPNHIILPEIMKDNGYRTGMFGKWAGGYEGSLSTPDKRGV 191 Query: 201 RDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG-------TAYYNSPSLFKNRERVPAKGY 253 D++ F A + P + + T N P E Y Sbjct: 192 DDFYGYICQFQAHLYYPNF--LNEYYKERGDTAVKRVVLTENINHPMF--GDEYFKRTQY 247 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ---YQKQFNTGS----- 305 +D + A+ + +A+T D+PF Y PH P Y+KQF T Sbjct: 248 SADLIHQHAMDWL-KAQTKDKPFFGVFTYTLPHAELTQPDDSLVAFYKKQFFTDKTWGGQ 306 Query: 306 ---------QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP--- 353 T + A + +D V IL+ L + G DNT+++FTSDNG +G Sbjct: 307 EGSRYNAVVHTHAQFAAMITRLDSYVGEILKLLDERGLADNTLVIFTSDNGPHEEGGADP 366 Query: 354 --LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISI 410 +G +G K Q Y GG P W G ++ G + D PT + + Sbjct: 367 SFFNRDGKLRGIKRQCYEGGIRIPFIARWNGHIKAGVESNLPFAFYDLMPTFAEMVGVKD 426 Query: 411 PKDL---------KLDGVSLLPWLQDKKQGEPH-KNLTWITSYSHWFDEENIPFWDNYHK 460 DG+S+LP L + G+ L W + + Sbjct: 427 YVQRYRNKKKTIDYFDGISILPTLINDGIGQKKYPYLYWEFAETD--------------- 471 Query: 461 FVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAAN 519 VR D+ L+ LY L DL + ++A + Sbjct: 472 ---------------------QTAVRMGDWKLITIH--GIPHLYNLSNDLHEDHDIANEH 508 Query: 520 PQVVKEMQGV-VREFIDSSQPPLSEVNQEK 548 P +V++M + ++E +S P++ + +K Sbjct: 509 PDIVQKMIEIALKEHTNSELFPVTMPSLDK 538 >UniRef50_B0SY54 Sulfatase n=7 Tax=Alphaproteobacteria RepID=B0SY54_CAUSK Length = 559 Score = 386 bits (993), Expect = e-106, Method: Composition-based stats. Identities = 149/592 (25%), Positives = 232/592 (39%), Gaps = 132/592 (22%) Query: 12 TSISLILASGMAAFAAHAAD------DVKLKATKTN--VAFSDFTPTEYSTKGKPNIIVL 63 ++LI+A + A H ++L + N VA+S+ S PN+IV+ Sbjct: 8 AGLALIVAVALGWAATHKQAVFMWIAHMRLPHVEPNHAVAWSEGPEAAPSGPRPPNVIVI 67 Query: 64 TMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFT 123 DD+G+ + F+ G + TP + SL +GV F Sbjct: 68 LADDMGFNDITFNGG----------------------GVAGGLVPTPNIDSLGHDGVSFA 105 Query: 124 NGYVAHGVSGPSRAAIMTGRAPARFGV--------------------------------- 150 NGY + PSRA IMTGR RFG Sbjct: 106 NGYDGNATCAPSRATIMTGRYATRFGFEFTPAPVAFEKMVGSEGAAGDIVLPRFYPDRLK 165 Query: 151 ---------YSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 + + +P +E + +L + GY+T GKWHL + PE K Sbjct: 166 AMPPGSTAPTPDAVNELSMPASEITVAQLLKTRGYHTLHFGKWHLGGKAG-SRPEQKGFD 224 Query: 202 D--YHDNFTTFSAEEWQP----QNRGFDYFMGFHAAGTAY---YNSPSLFKNRERVPAKG 252 + + E P + +D F Y +N +F+ G Sbjct: 225 ESLGFIAGGSMYLPEGDPGVENAKQPWDPIDRFLWPNLPYAVQFNGSPMFRPG------G 278 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYY 312 Y++D LTDEA+ V + ++PF +Y A NA H P D Y Y Sbjct: 279 YMTDYLTDEAVKAVRANR--NRPFFMYFAPNAIHTPLQATKAD-YDALPEIKDHRLRVYG 335 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-LNGAQKGYKSQTYPGG 371 A V ++D+ V R+L+ LK+ G NT+++FTSDNG LP +N +G+K+ + GG Sbjct: 336 AMVRNLDRNVGRLLQALKEEGLDQNTLVIFTSDNGGANYIGLPDINRPYRGWKATFFEGG 395 Query: 372 THTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ 430 H+P FM W + Y + +D + TA AA +PKD +DGV L+P++Q K Sbjct: 396 IHSPFFMRWPAVIPANSRYSAPVGHIDIFATAAAAAGAPLPKDRVIDGVDLVPFVQGKAT 455 Query: 431 GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDY 490 G PH+ L W + V + D+ Sbjct: 456 GRPHQTLFWRSGSYK--------------------------------------VVLDGDW 477 Query: 491 SLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 L + N++ L+ L D ++ L+AA P+ VK M ++R+ + P+ Sbjct: 478 KLQSSEAQNKIWLFNLAQDPTEQHELSAAQPERVKAMLALLRQQDAQNAKPI 529 >UniRef50_A6DHS2 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHS2_9BACT Length = 447 Score = 386 bits (993), Expect = e-106, Method: Composition-based stats. Identities = 130/525 (24%), Positives = 213/525 (40%), Gaps = 123/525 (23%) Query: 52 YSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPT 111 ++ KPNIIV+ +DD+G+ + + TP Sbjct: 15 FADSAKPNIIVIMVDDMGWAGISSFDNKY--------------------------YKTPG 48 Query: 112 LLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG--VYSNTDAQ-----DGIPLTE 164 + + EG++ T+ + V P+RAA+MTGR R G V N D + GI E Sbjct: 49 IDRMAVEGMKLTDFHSNGVVCSPTRAALMTGRYQQRSGCDVVINADPKHPDHVRGIRDEE 108 Query: 165 TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY 224 PE ++ Y TA GKWH+ E+ P N GFD Sbjct: 109 WTFPEAMKSADYATAVFGKWHIG-----------------------YKAEFHPMNHGFDE 145 Query: 225 FMGFHAAGTA---YYNSP---SLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFML 278 F+GF + +Y+ ++ RE KG+ SD +T+ ++ ++R K ++PF L Sbjct: 146 FVGFISGNIDAQSHYDRMSTFDWWQARELKDEKGHHSDLITEHSLDFIERNK--EKPFFL 203 Query: 279 YLAYNAPHLPNDN------------PAPDQYQKQFNTGSQTADNYYA--SVYSVDQGVKR 324 Y+A+ PH P P K + + DN+ VD+GV R Sbjct: 204 YVAHGTPHSPFQARGSKIQRGPNKGQVPAWAPKIEYSKTPGDDNWLMKHFTLPVDEGVNR 263 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL 384 IL++L + NTI+ F SDNGA G + +G K Y GG P +W G++ Sbjct: 264 ILDKLVELKIDKNTIVWFLSDNGAA-KGNHSHSENTRGAKGSMYEGGHRVPALVWAPGRI 322 Query: 385 QPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 + G+ D+ + D +++ AA ++IP + +LDGV + P + + K+ + L W Sbjct: 323 KAGSVSDQTMMTFDITASSIKAAGVAIPANHQLDGVDIHPTVFNNKKLN-ERTLIW---- 377 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGL 503 + + S +R + LV + L Sbjct: 378 ---------------------------------ENGKGSGALRKGPWKLVVN--KKKQEL 402 Query: 504 YKLT-DLQQKDNLAAANPQVVKEMQGVVREFID--SSQPPLSEVN 545 Y L D ++ NLA + P+++KE+ + ++ S P SE+ Sbjct: 403 YNLADDHKESKNLAQSMPELIKELSEEYQTILNEISKNAPYSEIK 447 >UniRef50_A6DMX7 N-acetyl-galactosamine-6-sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMX7_9BACT Length = 578 Score = 386 bits (992), Expect = e-105, Method: Composition-based stats. Identities = 120/553 (21%), Positives = 201/553 (36%), Gaps = 124/553 (22%) Query: 39 KTNVAFSDFTPTEYSTKGKP-NIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 K S FT + KP N++ + DDLG+ + Sbjct: 4 KYCFLLSFFTVGLIAAADKPMNVVFILADDLGWSDTELYGQT------------------ 45 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ 157 TP ++ L G F Y + P+RA+ +TG+ PAR G Sbjct: 46 -------KLYKTPNIMRLAKMGCTFDRAYSNSPLCSPTRASFLTGQTPARHGSTQPRHHT 98 Query: 158 -------------------------DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNV 192 + + ++ + GY T GKWHL Sbjct: 99 KTVALKAELAKKARPTEKALPVSTATRLDTNFPTIGKMMKQAGYETGHFGKWHLG----- 153 Query: 193 PVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY--YNSPSLFKNRERVPA 250 E + P GFD + H Y +P ++ + Sbjct: 154 -------------------PEPYSPLQHGFDVDIPHHTGAGPGKSYVAPWSQEHIKPNYE 194 Query: 251 KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTA 308 K YI D++ +E + VD + D+PF + + H P D D+Y+K + S+ Sbjct: 195 KEYIEDRMVEECLKWVDGL-SGDKPFFMNYWMFSVHAPFDAKQELIDKYKKVIDPNSKQR 253 Query: 309 DN-YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL------PLNGAQK 361 Y A V S+D V +LE L+ G DNT+I+FTSDNG I L N Sbjct: 254 SALYAAMVQSLDDAVGALLEGLESRGLMDNTVIIFTSDNGGNIYSQLDEGIVPTSNFPLS 313 Query: 362 GYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVS 420 G K+ GG P + W G + G D+++ DFY T + + I++P+ +DG+ Sbjct: 314 GGKASMCEGGVRVPCTVVWPGVTKAGSRSDEIVQTSDFYTTIIKGSGIALPEGHVVDGID 373 Query: 421 LLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ 480 + P L+ +K L ++++ +P W Sbjct: 374 IRPALKGEK-------LDRKAIFTYFPCIVPVPEWL-----------------------P 403 Query: 481 FSYTVRNNDYSLVYTVENNQ-----LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFI 534 S +V + + LV + LY L D+ +++NLA +NP+++K + ++ ++ Sbjct: 404 PSMSVHSGKWKLVRVFFGGENGEHDYKLYDLSNDIAEENNLADSNPELLKRLDNLIEAYL 463 Query: 535 DSSQPPLSEVNQE 547 + N + Sbjct: 464 TETNAVTPVPNPD 476 >UniRef50_A6DMX9 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMX9_9BACT Length = 467 Score = 385 bits (990), Expect = e-105, Method: Composition-based stats. Identities = 122/542 (22%), Positives = 204/542 (37%), Gaps = 129/542 (23%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 + V S F + KPNI+++ DD GY L Sbjct: 5 LRKLVLLSTFVAASLTAAEKPNILIIFTDDQGYADLGCFGS------------------- 45 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ 157 + TP L L EG +FT+ Y A V GPSR+A++TGR PAR Sbjct: 46 -------EENQTPVLDKLAKEGTKFTSFY-AQPVCGPSRSALLTGRYPARS-------KG 90 Query: 158 DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 G+P +E E+ + GY TA VGKW D P Sbjct: 91 WGMPASEITFAEMLKETGYQTACVGKW--------------------DVSNRQPIIPRMP 130 Query: 218 QNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKG---YISDQLTDEAIGVVDRAKTLDQ 274 +GFDY+ G + L++N ++ ++ T++AI +++ + ++ Sbjct: 131 NAQGFDYYYGTLGGNGS--GKIDLYENNKKERTTEDMASLTRLYTNKAIDFLEKQRDPEK 188 Query: 275 PFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQ 334 PF+LYLA+ H D A +++++ + Y A+V +D R+L +L + Sbjct: 189 PFILYLAHTMTHTVVD--ASPKFKEKTGD-----NLYRAAVEELDYETGRLLNKLNQLNL 241 Query: 335 YDNTIILFTSDNGAVIDGPLPLNGA-----------------QKGYKSQTYPGGTHTPMF 377 NT++++TSDNG + P +NG + K+ + GG H P Sbjct: 242 SKNTLVIYTSDNGPW-NQPKYINGGAKNDHPENSIFWGDAGEFRDGKASIWEGGAHVPCV 300 Query: 378 MWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKN 436 M W GK+ G D L++ +DF PT IP + +DGV+ L ++ K + Sbjct: 301 MRWPGKIAAGKTNDGLMATIDFLPTLAAVTGAKIPDERVIDGVNQLGFICGKSETARETY 360 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV--- 493 + P + + + +R ++ L+ Sbjct: 361 IY------------------------------NPGSASVQTKLVQGNAIREGNWKLISPL 390 Query: 494 ------YTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 LY L D+ + NLA P+ V+ ++ ++ SS+ +V Sbjct: 391 TVGWFLEDAGTGSWELYNLKEDIGETKNLAKQYPEKVEHLKKLL----QSSEAKFPKVKP 446 Query: 547 EK 548 Sbjct: 447 RP 448 >UniRef50_A6DJ37 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ37_9BACT Length = 469 Score = 385 bits (990), Expect = e-105, Method: Composition-based stats. Identities = 113/553 (20%), Positives = 201/553 (36%), Gaps = 125/553 (22%) Query: 32 DVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVV 91 +++ + + + KPN++ + +DDLG+ L G F Sbjct: 2 NIQTTMLFIGLGIASLANIALAQSNKPNVLFVFIDDLGWKDLGCYGGKF----------- 50 Query: 92 DTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY 151 TP SL EG++FT Y A V P+RA++++G+ AR GV+ Sbjct: 51 ---------------IETPAADSLAAEGMKFTQAY-ASPVCSPTRASLISGQNAARHGVW 94 Query: 152 SNTDAQDG-------------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDK 198 D I ++ GY +GKWH + Sbjct: 95 EVIGVNDRPYAKMSSPLRKLEIDENIQTYADILNKEGYTCGLIGKWHAGR---------- 144 Query: 199 QTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQL 258 PQ GF ++N E + + Sbjct: 145 -----------------TPQAHGF---CKIDKKIHDPVLKKYAYENDEHKVGE------I 178 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPN--DNPAPDQYQKQFNT--GSQTADNYYAS 314 T +I + + K D PF L ++++A H P + ++Y+K+ + NY A Sbjct: 179 TANSIEFLRKNK--DNPFFLCVSHHAAHAPLIARDDLINKYRKKLRKTGITDVHPNYAAL 236 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--------DGPLPLNGAQKGYKSQ 366 V D+ + +L++LK DNT+++F SDNG +I + + K Sbjct: 237 VEMADESLGMLLDELKALKLEDNTMVVFYSDNGGMIKDMYLKQPEALATTMAPLRWQKGS 296 Query: 367 TYPGGTHTPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 Y GG P + W GK++PG +++++ D + T +D +IP++ DG+SL+P L Sbjct: 297 LYEGGIRVPFIVKWPGKVKPGTSSEQMLNSFDLFSTFVDVCGGTIPQEQVTDGLSLVPVL 356 Query: 426 QDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTV 485 + + + L W H P + + + Sbjct: 357 RGETELLERDTLYW-------------------------------HFPTSMWTRSPAGAI 385 Query: 486 RNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDS--SQPPLS 542 R DY L+ E+ ++ L+ L D+ + NL + + E+ + + S +Q P Sbjct: 386 RKGDYKLIEHFEDGRIELFNLKDDIGETVNLLYSESEKASELLSALTAWRRSLDAQMPTP 445 Query: 543 EVNQEKFNNIKKA 555 N + + A Sbjct: 446 NPNYDPVRAHEHA 458 >UniRef50_A4W906 Sulfatase n=43 Tax=Enterobacteriaceae RepID=A4W906_ENT38 Length = 501 Score = 385 bits (990), Expect = e-105, Method: Composition-based stats. Identities = 124/538 (23%), Positives = 202/538 (37%), Gaps = 104/538 (19%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 + NV ++ + + KPN++++ DDLGYG L Sbjct: 16 MTGGMGNVLAAEQSANQL---NKPNVVIILADDLGYGDLGIYG----------------- 55 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 TP + L EGVRF+ Y + PSRA ++TGR P R G+ S Sbjct: 56 ---------HPIVKTPNIDKLAQEGVRFSQYYAPAPLCSPSRAGLLTGRTPFRTGIRSWI 106 Query: 155 DAQDGIPL--TETFLPELFQNHGYYTAAVGKWHLS---KISNVPVPEDKQTRDYHDNFTT 209 I L E + ++ GY TA +GKWHL+ + P ED N Sbjct: 107 PTNKNIALGRNEKTIASYLKDQGYDTAMMGKWHLNAGVDRHDQPQAEDAGFDYTLVNAAG 166 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLT-DEAIGVVDR 268 F + D G Y N ++N + + IS + EAI ++ Sbjct: 167 FVTSD-------LDKAKERPRNGVVYPNG--FYRNGKALGTVNQISGEFVSQEAINWLND 217 Query: 269 AKTLDQPFMLYLAYNAPHLPNDNP-----------------APD-QYQKQFNTGSQTADN 310 K ++PF +Y+A+ H P +P PD Y + + Sbjct: 218 -KKDNKPFFMYVAFTEVHTPLASPKKYLEIYKNYMSEYEKQHPDMFYADWVDKPYRGPGE 276 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL--------NGAQKG 362 YYA++ +D+ V ++L ++K GQ DNTII+FTSDNG V +G Sbjct: 277 YYANISYMDEQVGKVLAKIKSMGQEDNTIIIFTSDNGPVTREARKWYELNMAGETDGLRG 336 Query: 363 YKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSL 421 K + GG P + + L G D +S +D PT + ++P D +DG S+ Sbjct: 337 RKDNLWEGGIRVPAIIKYGQHLHAGTVTDTPVSGLDILPTLAELTHFNLPTDRIIDGESI 396 Query: 422 LPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQF 481 +P L+ + L + D P + D+ Sbjct: 397 VPVLEGQTMNRQQPLLF---------------------------AIDMPFQDDPTDM--- 426 Query: 482 SYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 + +R+ D+ +++ + LY L D + N P + ++M + + S + Sbjct: 427 -WALRDGDWKMIFDRNSKPKYLYNLKLDRGETMNQLGKQPVLEQKMIAALARYQSSIE 483 >UniRef50_D2R323 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R323_9PLAN Length = 631 Score = 385 bits (989), Expect = e-105, Method: Composition-based stats. Identities = 124/542 (22%), Positives = 197/542 (36%), Gaps = 115/542 (21%) Query: 12 TSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKG--KPNIIVLTMDDLG 69 T+ S +L S HA L T S + T + + +PNI+V DD G Sbjct: 6 TTSSAMLTSHY--QLTHALFPTWLMFTAIVAFLSSASSTFAAEREVTQPNIVVFLADDAG 63 Query: 70 YGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAH 129 +G F + STP + S+ G +V Sbjct: 64 WGDYSFSGNT--------------------------NLSTPHIDSIARGGASIDRFFV-C 96 Query: 130 GVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKI 189 V P+RA +TGR R GV + Q+ + L+E L + + GY T A GKWH Sbjct: 97 SVCSPTRAEFLTGRYHQRGGVRGVSTGQERLDLSERTLADSLRAAGYATGAFGKWHNGSQ 156 Query: 190 SNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVP 249 + P RGFD + G+ + Y +P L N + Sbjct: 157 W-----------------------PYHPNARGFDEYFGYTSGHWGEYFNPPLEHNGKLNN 193 Query: 250 AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTA- 308 +GYI D TD AI ++ +K ++PF Y+ + PH P P+ D + Q + A Sbjct: 194 YEGYIVDICTDRAITFIEASK--NKPFFCYVPFTTPHSPWSVPSADWKRFQDKPLEKRAT 251 Query: 309 ----------DNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNG 358 A V + D+ V R+L +L + +NTI+++ SDNG G Sbjct: 252 NLKQEQLDQTRCALAMVENQDRNVGRVLSKLDELKLRENTIVVYFSDNGP---NSARWTG 308 Query: 359 AQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS-AMDFYPTALDAADISIPKDLKLD 417 KG K T GG + ++ W ++ + I+ A+D PT L A + +L LD Sbjct: 309 GMKGKKGTTDEGGVRSVCYIQWPKRIAAAQTIQPIAGAIDLLPTLLSLAGVKHVGELPLD 368 Query: 418 GVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477 G L P L ++ P + L Sbjct: 369 GRDLAPLLTGQQPEWPERLLF--------------------------------------T 390 Query: 478 LSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDS 536 + R+ + L + Q L+ + +D Q + P++ EM+ V ++ Sbjct: 391 TWAGKVSARSQTHRL-----DEQGLLFDMQSDPGQTTPVNDREPKLTAEMKSAVAKWKAE 445 Query: 537 SQ 538 + Sbjct: 446 ME 447 >UniRef50_D2QXE9 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QXE9_9PLAN Length = 495 Score = 385 bits (989), Expect = e-105, Method: Composition-based stats. Identities = 103/523 (19%), Positives = 185/523 (35%), Gaps = 85/523 (16%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 +PN++V+ +DDLG+G +TP + Sbjct: 37 AARQPNVVVVFIDDLGWGDFSCFGNKEG--------------------------ATPHID 70 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ---------DGIPLTE 164 + EG+RF+ YV+ + PSR ++ TG+ P R+ + S +++ + + Sbjct: 71 RMAAEGIRFSQFYVSSPICSPSRCSLTTGQYPQRWKITSFLNSRADNARRGVANWLDPEA 130 Query: 165 TFLPELFQNHGYYTAAVGKWHLS---KISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 + + Q HGY T GKWHL + + P NF A+ + Sbjct: 131 PTMARILQQHGYRTGHFGKWHLGGQRDVDDAPAIAKYGFDASLTNFEGMGAKLLPLTLKP 190 Query: 222 FDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 D G + +P + R ++ + D AI +D A +PF + + Sbjct: 191 GDSVPGKIWSDAERLGAPVTWMQRSKI------TGGFVDGAIAFIDAATRDGKPFYVNVW 244 Query: 282 YNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNG-QYDNTII 340 + H P P G + Y A + ++D + ++ + L+ +NTI+ Sbjct: 245 PDDVHSPFWPPVETW-------GENKRELYLAVLEAMDLQLGKLFDHLRSRDELRENTIV 297 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP---GNYDKLI--SA 395 L SDNG + G +G K++ + GG +P+ +W + G ++ S Sbjct: 298 LICSDNGP--EAGAGSAGPFRGGKTELFEGGIRSPLIVWSPALVAAEQRGKANETAVLST 355 Query: 396 MDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFW 455 +D P+ A +P D++ DG L + L W Sbjct: 356 LDLLPSLAKLAGAPLPADVQFDGEECSATLLGRGNESRTAPLFW---------------- 399 Query: 456 DNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDN 514 + D +R + L+ + LY L +D + N Sbjct: 400 --------RRPPDRKTAGAKGRRVLPDLAMREGKWKLLCDYDGAGALLYDLESDRGETKN 451 Query: 515 LAAANPQVVKEMQGVVREFIDSSQPPL-SEVNQEKFNNIKKAL 556 LA +P K MQ + + S E+ QE +++K Sbjct: 452 LAQQHPDRTKAMQAKLLAWHSSMPADRGPELGQETLESLRKTA 494 >UniRef50_A6DR20 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DR20_9BACT Length = 608 Score = 385 bits (989), Expect = e-105, Method: Composition-based stats. Identities = 134/541 (24%), Positives = 208/541 (38%), Gaps = 129/541 (23%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 + K N+I++ DDLG TP L Sbjct: 16 QEKANVILILADDLGVSDTSLGGSKL--------------------------YQTPNLER 49 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTET--------- 165 L GV FTN Y A + P+R++I+TG+ PAR G + + + LT Sbjct: 50 LAKRGVYFTNAYAASPLCSPTRSSILTGQNPARTGFTAPHGHLENVVLTARAGKAAAPSK 109 Query: 166 ----------------FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 L ++F+N GY TA GKWHL K Sbjct: 110 RQVSPVSVNRLSTEYLSLGKVFKNAGYKTAHFGKWHLGKS-------------------- 149 Query: 210 FSAEEWQPQNRGFD----YFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGV 265 + P GFD ++ G AG+ + +P + N + +I D+L DE Sbjct: 150 ----PYSPLEHGFDIDIPHWPGPGPAGS--FVAPWRYPNFKENYPGEHIDDRLGDEIAKY 203 Query: 266 VDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNT-GSQTADNYYASVYSVDQGV 322 + K DQPF + + H P + D+Y+K + Q Y A V S+D + Sbjct: 204 ISENK--DQPFFINFWQFSVHAPFNAKQELIDKYRKLIDKNNPQHNPVYAAMVESMDDSI 261 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDG-----PLPLNGAQKGYKSQTYPGGTHTPMF 377 ++++ L+ N + TII+F SDNG I N +G K+ Y GGTH P Sbjct: 262 GKVIDALETNKLMEKTIIVFFSDNGGNIHSVVDGTTATSNKPFRGGKASIYEGGTHVPAI 321 Query: 378 MWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKN 436 + W + + G D LI + D Y + L+ A + + D +S +P L+ QG K Sbjct: 322 VVWPNQTKTGVRNDSLIQSEDLYASILEMAALPVDYQQAKDSISFVPVLKG--QGAKRKQ 379 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV 496 + +PH+PN D S +R D+ L+ Sbjct: 380 VF----------------------------TYFPHSPNVPDCVPPSAALRIGDWKLIKVF 411 Query: 497 ENN-----QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFN 550 +N + LY L D + NLA+ NP+ VK M V+ +++ S + + K+N Sbjct: 412 HDNPDLTDRFELYNLANDQGEMLNLASQNPEKVKSMNEVIDQYLSKS-ACIKPIKNPKYN 470 Query: 551 N 551 + Sbjct: 471 S 471 >UniRef50_Q7UXA8 N-acetylgalactosamine-6-sulfate sulfatase n=2 Tax=Bacteria RepID=Q7UXA8_RHOBA Length = 495 Score = 385 bits (989), Expect = e-105, Method: Composition-based stats. Identities = 121/554 (21%), Positives = 201/554 (36%), Gaps = 130/554 (23%) Query: 16 LILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPF 75 + L + + F A + + A + VA S KPNI+ + DD G+G L Sbjct: 23 VCLTTVVILFVLAGATESRCAAAEDTVA--------SSVGKKPNILFIFADDWGWGDLSC 74 Query: 76 DKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPS 135 + TP + L EG F VA GV PS Sbjct: 75 HGHPY--------------------------VRTPNIDRLAREGTDFERFTVASGVCSPS 108 Query: 136 RAAIMTGRAPARFGV-----YSNTDAQDGIP----LTETFLPELFQNHGYYTAAVGKWHL 186 R A+MTG PAR + + ++A+ +P + LP L Q+ GY TA GKWHL Sbjct: 109 RTAVMTGHFPARHNIDGHFAWVPSNAKRNMPDWLDPSAVTLPRLLQSGGYKTAHFGKWHL 168 Query: 187 SKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRE 246 S P P G+D + F+ +G Sbjct: 169 SNDMIPDSPT--------------------PAAYGYDRYGAFNCSGEQMPVHED------ 202 Query: 247 RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQ 306 +E I ++ A + PF + L + PH P +++ + + S+ Sbjct: 203 ------------ANETIRFIEEAHSKGDPFFVNLWVHEPHTPFHVIPKYRWRFRDSGLSE 250 Query: 307 TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG-------------- 352 + Y A + D + +L+ L + + T+++F+SDNG Sbjct: 251 ADEIYAAVLSHADDRIGEVLDALDRLELTNKTLVIFSSDNGPARGSANAKLELSYDTATG 310 Query: 353 -------PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK--LISAMDFYPTAL 403 + +KGYK+ + GG + P + W GK+ G D +ISA+D PT Sbjct: 311 AGFGIGASKGITAGRKGYKASLFEGGINVPFIVRWPGKVAAGKTDDSAMISAVDLLPTFC 370 Query: 404 DAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR 463 D A + +P + DG+S + L+ + K L W S + W Sbjct: 371 DIAGVELPSAYQADGISQVSALKGQPTTGRTKPLFWKYSARWPAQKSRPHHWA------- 423 Query: 464 HQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQV 522 SY V N + L+ +++ + LY + +D + +L + P Sbjct: 424 ------------------SYCVVNERWKLLANQDSSYVELYDIVSDPFESTDLKESQPDA 465 Query: 523 VKEMQGVVREFIDS 536 V ++ + ++ S Sbjct: 466 VTKLSKQLTDWKAS 479 >UniRef50_A6KWS8 Arylsulfatase n=6 Tax=Bacteroides RepID=A6KWS8_BACV8 Length = 464 Score = 384 bits (988), Expect = e-105, Method: Composition-based stats. Identities = 125/536 (23%), Positives = 194/536 (36%), Gaps = 123/536 (22%) Query: 32 DVKLKATKTNVAFSDFTPTEYSTKGK-PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 + + + S T + +T K PN+I + DDLG G L G + + Sbjct: 3 NTRKILFSAALLSSGLTMAQTTTAEKSPNVIYIMADDLGIGDL----GCYGQR------- 51 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 Q TP + + G++F Y VS PSR A++TG+ + Sbjct: 52 ---------------QIKTPNIDGIAQNGMKFMQHYSGSTVSAPSRCALITGKHMGHAAI 96 Query: 151 YSNTDA--------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 N + +P E + ++F+ Y T VGKW + Sbjct: 97 RGNAKVAGSDGLLYETPLPAGEVTVADIFKTKNYVTGCVGKWGMGGPGT----------- 145 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRER---VPAKGYISDQLT 259 E P GFDYF G+ A+ P E+ + K Y D + Sbjct: 146 -----------EGMPGKHGFDYFYGYLGQRFAHSYYPEFLHENEQKIMLDGKYYSHDLML 194 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDN--PAPDQYQKQF------------NTGS 305 ++A+ +D +PF LY + PH D A +Y+ +F + Sbjct: 195 EKALNFIDENAQ--KPFFLYFSPTIPHADLDIMGEAMTEYEGEFCETPFGGSRDGYKSQQ 252 Query: 306 QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQ 360 Y A V +D+ V I+++LK+ G YD+TII+FTSDNG +G NG Sbjct: 253 NPRAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHSEGGHDPSYFDSNGPF 312 Query: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLKLDGV 419 +G K Y GG TP + W G + G ISA DF PT + IP++ +DG+ Sbjct: 313 RGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTIGELVQADIPQN--IDGI 370 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLS 479 S LP L K + H + + P Sbjct: 371 SYLPTLTGKGTQKEHDCIYYEFFEFGGKQSIMTP-------------------------- 404 Query: 480 QFSYTVRNNDYSLVY----TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVV 530 + + LV LY + TD + N+ P V K+++ ++ Sbjct: 405 --------DGWKLVRLEVSDPSKTYEELYNIYTDPAETSNVIKQYPDVAKKLKNMI 452 >UniRef50_B1KD86 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD86_SHEWM Length = 484 Score = 384 bits (986), Expect = e-105, Method: Composition-based stats. Identities = 128/541 (23%), Positives = 225/541 (41%), Gaps = 111/541 (20%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 + +TK N +FS E K N++++ +DDLG + Sbjct: 16 VISTKLNASFSPLKKEESKLKQA-NVVIIYVDDLGIMDTGIYGSA--------------- 59 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 Q TP + L + GVRFT Y PSRA++MTG PA G+ + Sbjct: 60 -----------QYPTPNIDKLANSGVRFTQAYANAANCAPSRASLMTGLTPAEHGILTVG 108 Query: 155 DAQDG---------------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQ 199 ++ G + T + +LF+ GY TA +GKWHL K + D Sbjct: 109 SSERGESQYRKLIPVTNNTELNPDLTTIADLFKQQGYATAVIGKWHLGKTAPTEYGFDTA 168 Query: 200 TRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLT 259 H + P ++G +G G Y+S+++T Sbjct: 169 IAASHLGHPPSY---FYPYSKGKRKLIGLEEGGL----------------KDEYLSNRIT 209 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQ----TADNYYASV 315 EA+ + + QPF LYL + A H P + AP ++ Q N Q + Y A + Sbjct: 210 REAVNYISSQR---QPFFLYLPFYAVHTPIE--APKEWVNQHNARQQAGEIKSAAYAAMI 264 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTP 375 ++D+ V ++L+ L K+GQ +NT+++F SDNGA P + +GYKS + GG P Sbjct: 265 ANLDRDVGKLLQALDKSGQRENTLVVFASDNGAY--DPATSSLPYRGYKSSLFEGGIKIP 322 Query: 376 MFMWWKGKLQPGNYD-KLISAMDFYPTALDAADIS--IPKDLKLDGVSLLPWLQDKKQGE 432 + + W ++ P + + + D + I + L L ++ L ++ + + Sbjct: 323 LVLSWPKQIPPNSQNRTPVQMSDLF------LGIKHLLQPKLALHRQDIIS-LAEQGKEQ 375 Query: 433 PHKNLTW-----ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRN 487 P + L W I ++ + + N P+W + + +R Sbjct: 376 PERPLYWHAPIYIDQFAPYRGQPNHPYWKH----------------------TPAAAIRL 413 Query: 488 NDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL-SEVN 545 Y L+++ E + L+ L D Q+K+NL NP++ +++ ++++ +S P+ SE+N Sbjct: 414 GHYKLIHSYETGKQLLFDLDKDSQEKNNLVNQNPEIREKLFKALQQWQESVNAPMVSELN 473 Query: 546 Q 546 Sbjct: 474 P 474 >UniRef50_A6CB33 Arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CB33_9PLAN Length = 590 Score = 384 bits (986), Expect = e-105, Method: Composition-based stats. Identities = 114/550 (20%), Positives = 194/550 (35%), Gaps = 95/550 (17%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 ++ V T+ N+I++ DD G F Sbjct: 4 LSHCRIVYALLIVLTVSLLATQLQAAQHTNVILIMTDDQGGWDYGFQG------------ 51 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 +TP L ++ G R + YV V P+RA +MTGR R Sbjct: 52 --------------NKHLNTPHLDAMAANGARLSRFYV-SPVCTPTRANLMTGRYNYRTR 96 Query: 150 VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 + + TE + E GY T GKWHL P Q + + + Sbjct: 97 AIDTYIGRAMLEPTEVTIAEALAPAGYRTGIFGKWHLGD----SYPLRPQDQGFQEVLVH 152 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRA 269 QP + G Y P LF N E+ +GY +D D A+ +++ Sbjct: 153 RGGGIGQPSD---------PPEGAGKYTDPVLFHNGEKKQMQGYCTDIYFDHALKFLEQN 203 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS-------------------QTADN 310 ++ D+P +Y+A NAPH P + P+ +K++ Sbjct: 204 ESQDKPTFMYIATNAPHGPFHD-VPEDLRKKYQAMDLTDAYGFDMNPKRKNEKQFDKTSR 262 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPG 370 ++ + ++DQ + ++ + LKK DNT++LF +DNG G +G K G Sbjct: 263 VFSMIENIDQNIGKLFQHLKKIDALDNTLVLFLNDNGP---NGPRYVGEHRGAKGSVNEG 319 Query: 371 GTHTPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 G + + W +L+ G + + + D +PT L A + P LKLDG+++LP L++K Sbjct: 320 GIRSVLIAHWPAQLKAGTVNPTIAAHYDLFPTILAATGVEKPAGLKLDGINVLPLLKNKA 379 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNND 489 P ++L E + + V D Sbjct: 380 DQWPERSLFLQWH------------------------------RGDEPQPRTNAAVVTQD 409 Query: 490 YSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEK 548 Y + ++ + L+ L D ++ +LAAA ++ M + + Sbjct: 410 YKMTFSKPDEPGKLFHLQNDPAERQDLAAAKTKLASHMTEQYNNWFQDVSSTRPDNYAPP 469 Query: 549 FNNIKKALSE 558 +I E Sbjct: 470 RIHIGNPKEE 479 >UniRef50_UPI0000586CBA PREDICTED: similar to arylsulfatase B n=3 Tax=Deuterostomia RepID=UPI0000586CBA Length = 596 Score = 383 bits (985), Expect = e-105, Method: Composition-based stats. Identities = 119/564 (21%), Positives = 202/564 (35%), Gaps = 81/564 (14%) Query: 21 GMAAFAAHAADDVKLKATKTNVAFSDFTPT--------EYSTKGKPNIIVLTMDDLGYGQ 72 M F A +K+K T T + +T P+I+ + DD G+ Sbjct: 54 SMEQFKAKELKMLKVKRCSCAFLVEALTVTVLICTGLIKGATGKPPHIVFIVADDYGWFD 113 Query: 73 LPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVS 132 + + + TP L L GV+ N YV + Sbjct: 114 VGYHNST---------------------------IKTPNLDLLASRGVKLENYYV-QPIC 145 Query: 133 GPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKI 189 PSR+ +MTGR G+ + +PL ET LP+ + GY T VGKWHL Sbjct: 146 SPSRSQLMTGRYQIHTGLQHFVIIAPQPNCLPLNETTLPQKLKESGYATHLVGKWHLGFY 205 Query: 190 SNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVP 249 N +P + + + W G F GF ++ + N Sbjct: 206 KNECMPLQRGFDSSFGYLSGMQ-DYWTHFRSG--SFPGFPEGN--HWLGIDFWDNNRVAW 260 Query: 250 A--KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQ- 306 Y T+ A V+ + +QP LYL + H P P++Y K + Sbjct: 261 EYTGNYSQFVFTERAQRVI-QQHNPNQPLFLYLPLQSVHGPLQ--VPEKYMKPYAHFQDV 317 Query: 307 TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQ 366 Y V ++D+ V ++++ L++ G +++T+++FT+DNG G N +G K+ Sbjct: 318 GRQTYAGMVATMDEAVGKVVDSLQEAGLWNDTVLVFTTDNGGTP-GKSGNNWPLRGTKNT 376 Query: 367 TYPGGTHTPMFMWWKGKLQPG----NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLL 422 + GG H F+ + G + D++PT ++ L LD ++ Sbjct: 377 LWEGGVHGVGFITGP-MIPAGVQGTVSKHFMHISDWFPTLIEGVAGGNTAGLALDSYNMW 435 Query: 423 PWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFS 482 + K P K L D +D + S YP E + Sbjct: 436 NSIT-KGTPSPRKELLHNIDPYIRADHPFGYGYDEETDMIYPLSGLYPK-MAAEFSTDMR 493 Query: 483 YTVRNNDYSLVYTVE----------------------NNQLGLYKLT-DLQQKDNLAAAN 519 +R ++ L+ N L+ +T D +K++L+ + Sbjct: 494 AAIRVGEWKLLTGFPGRSGWYPPPEWNIHPIDPVEAANKVTWLFNITADPCEKNDLSYQH 553 Query: 520 PQVVKEMQGVVREFIDSSQPPLSE 543 P+VV E+ G + + +S P Sbjct: 554 PEVVTELVGRLEAYYKTSVPVRFP 577 >UniRef50_A6DKN7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKN7_9BACT Length = 465 Score = 383 bits (985), Expect = e-105, Method: Composition-based stats. Identities = 135/523 (25%), Positives = 223/523 (42%), Gaps = 106/523 (20%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 + K NII++ DD+ YG L + TP + Sbjct: 16 SAEKTNIILIFADDMHYGALGVTGSVL------------------------TKAKTPAID 51 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF-------GVYSNTDAQDGIPLTETF 166 S+ +EGV F NGY +H PSRA ++TGR ARF G G+ +E Sbjct: 52 SIFNEGVHFPNGYASHATCAPSRAGLLTGRYQARFDLETLPGGTADRKKTGYGVKTSEIM 111 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 +P L + GY T A+GKWHL S+EE+QP RGFD++ Sbjct: 112 IPALMKKGGYQTCAIGKWHLG-----------------------SSEEFQPNARGFDHWF 148 Query: 227 GFHAAGTAYYN--------------------SPSL--FKNRERVPAKGYISDQLTDEAIG 264 G+ + Y P+L +N E V +GY++D +DEA Sbjct: 149 GYRGSCGFYQFKSQVQSAKKGQELKPLPSGEDPNLDVVRNGESVRLEGYLTDHFSDEAAN 208 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKR 324 + K ++PF +Y A H P + P++Y + T + ++D V+ Sbjct: 209 WIKENK--ERPFFMYFAPYNVHAP--DTVPNKYIPKGGTAHD------GVIAALDASVQT 258 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL 384 IL+ LK+ G DNT+++F++DNG D + KG K+ Y GG P M W + Sbjct: 259 ILDALKEAGIADNTLVVFSNDNGGKKD----YSKTFKGNKATFYEGGIRVPFAMRWPKGI 314 Query: 385 QPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 + G+ Y+ ++S +D PT A + +P D DG +LLP ++D + + + W Sbjct: 315 EAGSKYNGVVSTLDLLPTFAALAKVDLPSDRVYDGQNLLPVIKDSAKDQ-RQAHFWR--- 370 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFS--YTVRNNDYSLVYTVENNQL 501 + + + W + R + + + + Y R ++ L + Sbjct: 371 NGAWRTARVGDWKLVWQVDRKKQKALLNKLGIKHVKGRGVTYAERADELFL-------EP 423 Query: 502 GLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 LY L D +++ NLA +NP+ ++EM + +++ ++S P E Sbjct: 424 ELYNLANDPKEESNLAQSNPEKLQEMVKIYKDW-EASIPKWRE 465 >UniRef50_C7PRW9 Sulfatase n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PRW9_CHIPD Length = 460 Score = 383 bits (984), Expect = e-105, Method: Composition-based stats. Identities = 122/522 (23%), Positives = 190/522 (36%), Gaps = 125/522 (23%) Query: 48 TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQK 107 T KPNII + DDLGYG + + Sbjct: 10 TILTQGQTHKPNIIFILADDLGYGNISAYNS-------------------------KSPV 44 Query: 108 STPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFL 167 TP + L EG++F N Y + V PSR A++TG+ + NT + + ++ L Sbjct: 45 KTPNIDRLGQEGIQFKNFYSGNTVCAPSRCALLTGKHMGHAYIRGNT--RLPLRAEDSTL 102 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 +L Q +GY T GKW L + P+ +GFD F G Sbjct: 103 AQLLQGNGYRTGMFGKWGLGESGTTG----------------------SPEIKGFDTFFG 140 Query: 228 FHAAGTAY-YNSPSLFKNRE----RVPAKG--YISDQLTDEAIGVVDRAKTLDQPFMLYL 280 + A+ Y + LF+ +E RVP Y D++ A+ ++ K D+PF L+L Sbjct: 141 YLNQQHAHNYYTDYLFEVKEGQISRVPRDTNVYSQDEILQHALSFINDNK--DKPFFLFL 198 Query: 281 AYNAPHLPNDNPAPD-----------------QYQKQ---FNTGSQTADNYYASVYSVDQ 320 + PH PA D Y+++ + + + A V +D+ Sbjct: 199 PFTLPHAELAPPATDMQAFLNADGSSKLGPETPYERKNGTYRSQENPHAAFAAMVTKLDR 258 Query: 321 GVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKSQTYPGGTHTP 375 V I +K+ G DNT I FTSDNG +G NG KG K Y GG P Sbjct: 259 NVGEISALIKQLGLDDNTYIFFTSDNGPHREGGADPIYFDSNGPLKGIKRDLYEGGIRVP 318 Query: 376 MFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPH 434 + + GK+ G + D PT D + +DG+S L K H Sbjct: 319 LLVRAPGKVSAGQVSTIPWAFWDVLPTLSDITHSPVLSG--IDGLSYTKALNGTKPARQH 376 Query: 435 KNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY 494 + W + + +D+ L+ Sbjct: 377 DHFYWQF-----------------------------------NEGGLQEALLKDDWKLIR 401 Query: 495 TVENN---QLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVRE 532 + + LY L+ D+ ++ +LA PQ VK + G++ + Sbjct: 402 FKKRGTPERFELYHLSEDIGEEHDLATKYPQKVKALSGLMLQ 443 >UniRef50_A7VQW1 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VQW1_9CLOT Length = 588 Score = 383 bits (983), Expect = e-104, Method: Composition-based stats. Identities = 118/505 (23%), Positives = 191/505 (37%), Gaps = 110/505 (21%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 + +PN++ + DD GYG L TP + Sbjct: 2 NEKRPNVVFVLTDDQGYGDLGCTG--------------------------NPDIQTPQID 35 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQN 173 E VR T+ +VA + P+R AI TGR P R GV++ + + ET L E+F++ Sbjct: 36 EFYKEAVRLTDYHVA-PLCAPTRGAIFTGRRPLRNGVWATCWGRSILHEGETTLAEVFRD 94 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 +GY T GKWHL ++PQ+RGF + G Sbjct: 95 NGYATGLFGKWHLGDNY-----------------------PYRPQDRGFTEVVAHKGGGV 131 Query: 234 AY--------YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 Y S ++N + +GY +D D A ++ LD+PF + NAP Sbjct: 132 GQTPDFWGNNYFEDSYYQNGKLTRYEGYCTDVWFDAAERFIESH--LDEPFFACITTNAP 189 Query: 286 HLPNDNPAPDQYQKQFNTGSQT-ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 H P ++Y + +Y + ++D R+ ++L G DNT+++F + Sbjct: 190 HEPY--LVEEKYAAPYRENENIVHPEFYGMISNIDLNFGRLRKKLSDWGIEDNTVLIFMT 247 Query: 345 DNGAVIDGPL--------PLNGAQKGYKSQTYPGGTHTPMFMWWK-GKLQPGN-YDKLIS 394 DNG + N +G K+ Y GG P F+ W G L G + Sbjct: 248 DNGTSGGCEIDGNEHVLRGYNAGMRGMKTSYYDGGHRVPFFIRWPNGGLDGGRDVEDTSY 307 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPF 454 +DF+PT D +S+P L+LDGVSL L ++ + S Sbjct: 308 HVDFFPTLADLCGLSMPP-LQLDGVSLKGVLTGEEALPKGRVEFMQYHQS---------- 356 Query: 455 WDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKD 513 T S++ +V ++ + LV LY + D Q Sbjct: 357 --------------------TVVPSKWESSVVSDQWRLVR-----GKELYDIKADPGQNR 391 Query: 514 NLAAANPQVVKEMQGVVREFIDSSQ 538 ++A +P+VV+ ++ + Q Sbjct: 392 DIAGQHPEVVRRLRAAHEAYWQEMQ 416 >UniRef50_A4GIB1 Arylsulfatase n=2 Tax=Bacteria RepID=A4GIB1_9BACT Length = 608 Score = 383 bits (983), Expect = e-104, Method: Composition-based stats. Identities = 114/527 (21%), Positives = 204/527 (38%), Gaps = 103/527 (19%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K T T S + GKPN++++ DD GY ++ Sbjct: 1 MKITSTLFLLSIG----LTVFGKPNVLIIMTDDQGYPEVSAHG----------------- 39 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 TP L L + +R ++ +VA + P+R ++TG AR G + + Sbjct: 40 ---------NPVLQTPNLDRLHGQSLRLSDYHVA-PMCTPTRGQLLTGLDAARNGAVNVS 89 Query: 155 DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 + + + + ++ GY T GKWHL Sbjct: 90 SGRALLRPEVSTIANYYEEAGYSTGVFGKWHLGANY-----------------------P 126 Query: 215 WQPQNRGFDYFMGFHAA---------GTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGV 265 ++PQ+RGF + + ++ G Y++ N + +GY +D +EA+ Sbjct: 127 FRPQDRGFQESVWYPSSSIPSVPAYWGNDYFDD-VYIHNGKEKRFEGYCADVFFNEAMRF 185 Query: 266 VDRAKTLDQPFMLYLAYNAPHLPNDNPAPD-----------QYQKQFNTGSQTADNYYAS 314 + + +PFM YLA N PH P D ++ N + Y Sbjct: 186 MSESAKSKKPFMCYLATNTPHGPFWPKEEDRKEIAEVLAQSKFDNLDNNLKKRLALYLGM 245 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHT 374 + ++D + +L+ LK+ ++TI++F +DNG+++ GP N +G K++ + GG Sbjct: 246 IRNIDWNMGNLLKFLKEENLAEDTILIFKTDNGSLL-GPQYFNAGMRGKKTEIWEGGHRV 304 Query: 375 PMFMWWK--GKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 P F+ W G + + L D PT LD I K+ K DG+SL L+ KK+ Sbjct: 305 PCFIRWPNGGFGKARDIGGLTQVQDILPTVLDLCGIKPRKNTKFDGISLASVLRGKKKVS 364 Query: 433 PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL 492 + + + +P + N YP + + V + L Sbjct: 365 EDRTII--------INYSRMPGFSN-----------YPSPHSQTQMRADQAAVLWKRWRL 405 Query: 493 VYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 + LY L +D Q+ N+ +P+VV +M+ + + D + Sbjct: 406 LEDR-----ELYDLASDPLQQKNVIDQHPEVVAKMRQQLYSWWDGVK 447 >UniRef50_B9XR48 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XR48_9BACT Length = 508 Score = 383 bits (983), Expect = e-104, Method: Composition-based stats. Identities = 108/541 (19%), Positives = 189/541 (34%), Gaps = 120/541 (22%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 KPN+I DDLGY + G F K + TP + + Sbjct: 36 RKPNVIFFIADDLGYADV----GCFGQK----------------------KIHTPNIDRI 69 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN----TDAQDGIPLTETFLPELF 171 EG++FT Y V PSR +MTG+ V N + Q +P + L Sbjct: 70 ATEGMKFTQHYSGSPVCAPSRCVLMTGKHSGHSAVRDNRELKPEGQFPLPANTITVARLL 129 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA 231 Q +GY T A GKW L + P D+ + A P + Sbjct: 130 QQNGYITGAFGKWGLGGPESSGKPLDQGFTRFFGYNCQRVAHNLFPT-------YLWDDN 182 Query: 232 GTAYYNSPSLFKNR--------------ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFM 277 ++P + +++ + K Y D ++A+ + K D PF Sbjct: 183 HRLALDNPPIGEDQKLPADADSNDPASYKAFTGKSYAPDLYAEQALRFIRDNK--DHPFF 240 Query: 278 LYLAYNAPHLPNDNPAP--DQYQKQ-----------FNTGSQTADNYYASVYSVDQGVKR 324 L+ PH+ P +Y+ + + Y A + +D+ + R Sbjct: 241 LFFPTIVPHVALQVPEDSLKEYEGKLPETPYTGGKGYLPNRTPHAAYAAMITRMDRDLGR 300 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAVID-------GPLPLNGAQKGYKSQTYPGGTHTPMF 377 +L +K+ D+TI +FTSDNG +G + K+ Y GG P+ Sbjct: 301 MLALIKELNLDDDTIFVFTSDNGPAPQDMGGTDTKFFNSSGPFRSGKTSIYEGGMRIPLI 360 Query: 378 MWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKN 436 + W GK+QP + ++ D+ PT L+ + +DG+S L +K P + Sbjct: 361 VRWHGKIQPNSTSDRVTGFEDWLPTLLELSGNKKSVPTGIDGLSFASTLLGEK--LPERP 418 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV 496 + ++ +R ++ V Sbjct: 419 FLYREFPAY----------------------------------GGQQAIRVGNWKAVRQH 444 Query: 497 --------ENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVR-EFIDSSQPPLSEVNQ 546 N + LY L TD+ + +++ +P +V ++ ++R + I S P +++ Sbjct: 445 LKPKGNAKPNLHIELYDLQTDIAESHDVSDEHPDIVTKLDNLMREQHIPSKAFPFPALDK 504 Query: 547 E 547 Sbjct: 505 P 505 >UniRef50_A6DHY0 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHY0_9BACT Length = 507 Score = 382 bits (982), Expect = e-104, Method: Composition-based stats. Identities = 114/499 (22%), Positives = 185/499 (37%), Gaps = 99/499 (19%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 K N + + DD GYG F+ TP L Sbjct: 17 SEKLNYVFMMTDDQGYGDTGFNG--------------------------HKIIKTPHLDQ 50 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNH 174 + EG + T Y V P+R +TGR R+G++ +P E L + + Sbjct: 51 MAKEGAKLTQFYAGGPVCSPTRGTYLTGRHYYRYGIWGANVG--HLPKEEITLASVLKQQ 108 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA--- 231 GY T GKWHL + N ++R +NF P R +D ++ Sbjct: 109 GYVTGHFGKWHLGTL-NKDYSTKGESRKPTENFAP-------PWERDYDESFVVESSVST 160 Query: 232 ------GTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 +Y + K E G + + D+AI ++RA + PF+ + +NAP Sbjct: 161 WDPASEKNPFYINGVPMKGTEESLYGG-AARVVVDKAIPFMERAVSEGNPFLAVVWFNAP 219 Query: 286 HLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSD 345 H P A +Y + + + A +YY + +D+ V RI +L++ G NT++ F SD Sbjct: 220 HEPI--KAGPKYLEMYKEHGE-AAHYYGCLTEMDEQVGRIRAKLREMGVEKNTVLFFCSD 276 Query: 346 NGAV----IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYP 400 NG +G K Y GG P W GK+Q G+ D +S +D+ P Sbjct: 277 NGPEGKKAKGAKAGTTSGLRGRKRSLYDGGVRVPALAEWPGKIQAGSVIDAAMSTLDYLP 336 Query: 401 TALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHK 460 T + + +P + LDG ++L L ++ Sbjct: 337 TVIALQNHQMPDERPLDGENILALLTGEESQRKR-------------------------- 370 Query: 461 FVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAAN 519 + + + DY LVY LY L D +++N+A+ Sbjct: 371 -------------GIPFIHRGKAVLNRGDYKLVY-----PKELYALSNDWSEENNIASQY 412 Query: 520 PQVVKEMQGVVREFIDSSQ 538 P++V EM + F+ S + Sbjct: 413 PEIVAEMSKELEAFVLSMK 431 >UniRef50_A9BNY8 Sulfatase n=11 Tax=cellular organisms RepID=A9BNY8_DELAS Length = 457 Score = 382 bits (982), Expect = e-104, Method: Composition-based stats. Identities = 123/529 (23%), Positives = 190/529 (35%), Gaps = 112/529 (21%) Query: 39 KTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGI 98 + A TE +PNI+ + DDLGY L G + Sbjct: 1 MSAAASRPQPCTERICMSRPNILFIVADDLGYADLGCYGGRAADFGAVS----------- 49 Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-------- 150 P L L G+R T GY V P+R A+ T R R Sbjct: 50 -----------PVLDRLAAGGLRLTQGYANSPVCSPTRFALATARYQYRLRGAAEEPINS 98 Query: 151 ---YSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 + + G+P + + ++ GY TA +GKWHL Sbjct: 99 KTRGTPLGEKLGLPPDMPTVASMLRDAGYRTALIGKWHLG-------------------- 138 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS------PSLFKNRERVPAKGYISDQLTDE 261 + P G++ + G + G Y+ L+ E +GY++D L+ Sbjct: 139 ---YPPHFGPLRSGYEEYFGPMSGGVDYFTHLSSSGQHDLWVGEEEHHDEGYLTDLLSQR 195 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP-----APDQYQKQFNTGSQTADNYYASVY 316 ++ V R D PF L L Y APH P + A + Y ++ Sbjct: 196 SVDFVHRMAQGDAPFFLSLHYTAPHWPWETRDDRSTAEALGAGIAHLDGGNIHQYRRMIH 255 Query: 317 SVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPM 376 +D+G+ I+E L+ NGQ DNT+I+FTSDNG N G K GG P Sbjct: 256 HMDEGIGWIVEALRANGQLDNTLIVFTSDNGGER---FSDNWPLVGGKMDLTEGGIRVPW 312 Query: 377 FMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHK 435 W + PG + +MD+ T LDAA + P+ LDG+SLLP L+ + P + Sbjct: 313 IAHWPAVIAPGRSSPQHCMSMDWSATVLDAAGVQAPEGHALDGISLLPVLRAEDAEFP-R 371 Query: 436 NLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT 495 L W + + +R+ D+ + Sbjct: 372 TLHWRMKH------------------------------------RGQRALRDGDWKYLRV 395 Query: 496 VENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 + L+ L D +++ N AA P+ + M+ ++ PP+ E Sbjct: 396 --DGIDYLFDLAADERERANQAARAPERLAAMRSAWEDWNQGM-PPIPE 441 >UniRef50_Q7UYW2 Arylsulfatase (A or B) n=2 Tax=Planctomycetaceae RepID=Q7UYW2_RHOBA Length = 484 Score = 382 bits (981), Expect = e-104, Method: Composition-based stats. Identities = 122/556 (21%), Positives = 202/556 (36%), Gaps = 105/556 (18%) Query: 20 SGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGS 79 S + FA A L A+ ++ A +D + +PNI+ + DDL + L Sbjct: 2 SNPSFFAKAVAAFCCLVASDSHFAVAD-------AENRPNILFILADDLAWSDLGCYG-- 52 Query: 80 FDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAI 139 TP L L EG RFT Y + SRA+I Sbjct: 53 ------------------------HPWHDTPHLDRLASEGARFTQAYSPAPICSASRASI 88 Query: 140 MTGRAPARFG---VYSNTDAQD---------------GIPLTETFLPELFQNHGYYTAAV 181 +TG+ PAR V N + +PL E + E ++ GY TA Sbjct: 89 LTGKTPARLQFEFVTKNEPGRQKIDFPTPMSAPPLTLNLPLEEQTIAECLKDEGYQTAFF 148 Query: 182 GKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSL 241 GKWH+S H + P +GF++ A Y P Sbjct: 149 GKWHVSS---------------HHERYLGWSPTHGPAKQGFEF------AEEDYGAHPYD 187 Query: 242 FKNR---ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP---APD 295 +K + D + + + D+P+ + H P P + Sbjct: 188 WKRSPVATIKEPGRFAPDSMVQRVGAFLRQDH--DRPYFAMASSFYVHTPVRTPCQWLRE 245 Query: 296 QYQKQFNTGSQTADN---YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG 352 +Y + S+ +N Y A + + D V +IL L+ +G+ D TI++ SDNG + Sbjct: 246 KYDARVPATSKKRNNRIEYAAFLETFDHHVGQILNSLEASGRADRTIVILNSDNGGHPE- 304 Query: 353 PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIP 411 N +G K Y GG PM + W G +QP D+ + D PT + A + P Sbjct: 305 -YTANAPLRGSKWNLYEGGIRVPMIVRWPGVVQPKTEIDRPVIGYDLLPTMVALAGGNPP 363 Query: 412 KDLKLDGVSLLPWLQDKKQGE-PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYP 470 K DG S L+ +L W Y H + D Sbjct: 364 K---CDGESFAGSLRGDSPPTNEQHSLIWHFPYYHPEN------------GFAKAPDSIG 408 Query: 471 HNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGV 529 + ++ +R + L+ E+N++ LY L D+ + +L+ + E+Q Sbjct: 409 IDDFATSRTRPQSAIRRGQFKLLQFAEDNRVELYDLSNDIGELHDLSTQQADLASELQQE 468 Query: 530 VREFI--DSSQPPLSE 543 +R+ + +++ P+++ Sbjct: 469 LRQTLTRQNARFPMAK 484 >UniRef50_Q2GB51 Sulfatase n=6 Tax=Proteobacteria RepID=Q2GB51_NOVAD Length = 491 Score = 382 bits (981), Expect = e-104, Method: Composition-based stats. Identities = 119/527 (22%), Positives = 180/527 (34%), Gaps = 114/527 (21%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 L + +PNI+ + DDLGY L Sbjct: 33 LGGAAATMVLGAAPAIASKRARRPNILYIMADDLGYADLSCYG----------------- 75 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 TP L L +G+RFTN Y V +R ++TGR R V Sbjct: 76 ---------RRDFETPVLDKLAAQGLRFTNAYANSAVCTATRVGLITGRYQYRLPVGLEE 126 Query: 155 D----AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 G+P + LP L GY T+ +GKWHL + Sbjct: 127 PLAFRPNIGLPPSHPTLPSLLAKAGYRTSLIGKWHLGSL--------------------- 165 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNS------PSLFKNRERVPAKGYISDQLTDEAIG 264 ++ P G+ F G + G YY P L+ V GY++D L D A+ Sbjct: 166 --PDFDPLKSGYQTFWGIRSGGVDYYTHATSNGQPDLWDGPTPVERAGYLTDLLADRAVS 223 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ----------FNTGSQTADNYYAS 314 + A + + P+ + L + APH P + P + F+ +A Y A Sbjct: 224 EIREASSGEAPWFMSLHFTAPHWPWEGPDDASESARIAKLKDPSALFHFDGGSAAIYAAM 283 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHT 374 V +D + R+LE LK N +TI++FTSDNG G K++ GG Sbjct: 284 VRRLDYQIGRVLEALKANRAEQDTIVVFTSDNGGER---FSDTWPFSGRKTELLEGGLRI 340 Query: 375 PMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 P + W G + G D I +MD+ PT L AA + DGV + P L E Sbjct: 341 PAIVRWPGVTRAGTTSDAQIISMDWLPTFLAAAGSAPDPGHPSDGVDVTPALGGGSLAE- 399 Query: 434 HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV 493 + L W ++ VR + + Sbjct: 400 -RALFWRY------------------------------------KNRAQRAVRRGNLKYL 422 Query: 494 YTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 EN L+ + D ++ NL P+ ++ ++ + P Sbjct: 423 RIAENE--FLFDVAADPLERANLKDRQPEDFAALKAAWEKWNATMLP 467 >UniRef50_A6KZI6 Sulfatase n=6 Tax=Bacteroides RepID=A6KZI6_BACV8 Length = 473 Score = 382 bits (981), Expect = e-104, Method: Composition-based stats. Identities = 121/527 (22%), Positives = 200/527 (37%), Gaps = 119/527 (22%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 TK KPNI+ + DDLG+ L + TP + Sbjct: 28 TKEKPNIVFILADDLGWTDLGVMGSDY--------------------------YETPNID 61 Query: 114 SLMDEGVRFT------NGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-------- 159 L EG+ F +MTG R GVY+ + G Sbjct: 62 RLATEGILFDNAYAAAANSAPSRAC------MMTGMYTPRHGVYTVSPPDRGDRTKRKYI 115 Query: 160 -IPLTE------TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 IP E + E Q GY +GKWHL Sbjct: 116 AIPNVEDVCADFVTMAEALQEQGYQCGHIGKWHLGD----------------------DE 153 Query: 213 EEWQPQNRGFDYFMGFHAAGTAY-YNSPSLFKNRERVPA-------KGYISDQLTDEAIG 264 + P ++GF + +G + AG Y Y P ++ + Y++D+LT+EA+ Sbjct: 154 DGTGPLSQGFIWNVGGNRAGAPYSYFYPYCLPDKSKCHVGLEEGILGEYLTDRLTEEAVS 213 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTADNYYASVYSVDQGV 322 + PF L+L+++A H P ++Y+ + Y A + +D V Sbjct: 214 FIKSHSEG--PFFLHLSHHAVHTVLQAPDSLINKYRNKTPGKYHKNPIYAAMIEKLDDSV 271 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 RI + +K G D TI++F SDNG P+ N G K Y GG+ P+ + W G Sbjct: 272 GRICQVIKTLGIADRTIVIFYSDNGGSE--PVTDNYPLNGGKGMPYEGGSRVPLIIRWTG 329 Query: 383 KLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWIT 441 K++ G I+ +DFYPT + A IP + LDG + + + E ++L W Sbjct: 330 KIEGGIRSSVPITGVDFYPTFVTLAQGKIPAN--LDGKDIFTLINN---NETERDLFW-- 382 Query: 442 SYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQL 501 + P + + P+ ++R+ D+ L+Y E+ + Sbjct: 383 ---------HFPAYLESYLNGGRDFRAKPY-----------SSIRSGDWKLIYHYEDKSM 422 Query: 502 GLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS-EVNQ 546 L+ L DL + +L+ +NP E+ + ++I + P+ ++N Sbjct: 423 ELFNLKNDLGESQDLSGSNPVKRGELYQKLMKWIQETHAPIPVKLNP 469 >UniRef50_Q7UYA6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UYA6_RHOBA Length = 490 Score = 381 bits (980), Expect = e-104, Method: Composition-based stats. Identities = 122/504 (24%), Positives = 189/504 (37%), Gaps = 90/504 (17%) Query: 52 YSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPT 111 + PN +V+ DD GY + TP Sbjct: 17 FCVAAPPNFVVIFTDDQGYEDVGCFGS--------------------------PDIRTPR 50 Query: 112 LLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-IPLTETFLPEL 170 L ++ G++FT+ Y A + GPSRAA+MTG P R +T + E + E+ Sbjct: 51 LDAMAKGGMKFTSFY-AQPICGPSRAALMTGCYPMRVAERGHTKQIHPILHEDEVTIAEV 109 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA 230 + GY +A GKW L+K + D P +GFDYF G Sbjct: 110 LKTKGYASACFGKWDLAKHAQSGFFSD-----------------LLPTGQGFDYFYGTPT 152 Query: 231 AGTAYYNSPSLFKNRERVPAK---GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHL 287 + N L++N E + + ++ + TDEAI +++ + +QPF +Y+ + PH Sbjct: 153 SNDRVAN---LYRNEELIEPESDMATLTRRYTDEAISFIEKNQ--NQPFFVYIPHTMPHT 207 Query: 288 PNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG 347 D + G Y + +D V RIL+ L + DNT +LFTSDNG Sbjct: 208 RLDA-------SKDFKGKSKRGLYGDVIEEIDFNVGRILDSLNELNLADNTYVLFTSDNG 260 Query: 348 AV------------IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLIS 394 + G + K T+ GG P +W GK+ G D + + Sbjct: 261 PWLVKNKGHADGHRLGDHGGSAGPLRSGKVSTFEGGVRVPAILWAPGKVPAGTVCDSIAT 320 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK-KQGEPHKNLTWITSYSHWFDEENIP 453 MD PT A IP D +DG + + + +P K Sbjct: 321 TMDVMPTLAALAGAEIPTDRVIDGEDIRHLFHGEFDKADPDKAFF--------------- 365 Query: 454 FWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQK 512 ++ H Q H P ++ + RN + + Q L L DL + Sbjct: 366 YYLRVHLQAVRQGKWKLHLPREKEPVGAAPFGRNAHIAPKDRIGFKQPFLVDLDNDLGET 425 Query: 513 DNLAAANPQVVKEMQGVVREFIDS 536 N+AA NP+VV+ + G+ D Sbjct: 426 TNVAAENPEVVERLLGLAESMRDD 449 >UniRef50_UPI0001745D5D N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745D5D Length = 562 Score = 381 bits (980), Expect = e-104, Method: Composition-based stats. Identities = 118/502 (23%), Positives = 184/502 (36%), Gaps = 104/502 (20%) Query: 60 IIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEG 119 II + DDL G L G + K TP L + EG Sbjct: 120 IIYILSDDLAQGDL----GCYGQKL----------------------IKTPNLDRMAAEG 153 Query: 120 VRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN----TDAQDGIPLTETFLPELFQNHG 175 RFT Y V PSR+++MTG + +N + Q +P + ++ + G Sbjct: 154 TRFTQAYCGTSVCAPSRSSLMTGLHMGHCPIRANREIKPEGQMPLPADTLTVAQVLKGAG 213 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY 235 Y TA VGKW + P +GFD+F G + A+ Sbjct: 214 YATACVGKWGMGMFDTTG----------------------SPLKKGFDHFYGHNCQRKAH 251 Query: 236 -YNSPSLFKNRERVPAKG--YISDQLTDEAIGVVDRA--KTLDQPFMLYLAYNAPHLPND 290 Y P ++ + ++V G Y+ D +E++ V K DQPF L+ A PH Sbjct: 252 NYFPPYIWNDDQQVALDGKTYVQDLFANESLKWVREQKRKAPDQPFFLFYAITLPHGDYQ 311 Query: 291 NPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI 350 Y Q + T Y A V +D V R+L+ LK+ +NT+++ + DNG+ Sbjct: 312 TDNLGIYADQ-KDWTPTQKAYAAMVTRLDSDVGRLLDLLKELKIDENTLVMTSGDNGSSF 370 Query: 351 D--------GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPT 401 + G +G+K Y GG W G + G D+ + DF PT Sbjct: 371 PPDSELGRLFDQAMGGKLRGFKRGMYEGGLRQASIARWPGAIPAGRVSDEPWAFWDFLPT 430 Query: 402 ALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKF 461 A D A +P K DG+SL+ +L+ + W + Sbjct: 431 AADLAGAKLPSGYKPDGLSLVSFLKGGPAP-RREYFYWELHENASLQALRF--------- 480 Query: 462 VRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANP 520 + ++ V N + LY L TD + N+AAA+P Sbjct: 481 -------------------------DQNWKAVRNGPNQPVELYDLATDESEAHNVAAAHP 515 Query: 521 QVVKEMQGVVR-EFIDSSQPPL 541 V +++ +DS+ P+ Sbjct: 516 DRVTRALELMKSARVDSADFPM 537 >UniRef50_Q7UER7 Sulfatase 1 n=8 Tax=Bacteria RepID=Q7UER7_RHOBA Length = 553 Score = 381 bits (980), Expect = e-104, Method: Composition-based stats. Identities = 122/552 (22%), Positives = 207/552 (37%), Gaps = 126/552 (22%) Query: 40 TNVAFSDFTPTEYSTKG-KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGI 98 + F+ + T+ + + +PN++++ +DDLG + + F Sbjct: 39 ATITFACLSITQANAQDDRPNVVLILVDDLGLHDIGIEGSKFH----------------- 81 Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD 158 TP + +L G+RFT GY V PSRA+I G+ AR G+ A+ Sbjct: 82 ---------QTPHIDALAKRGMRFTAGYANCRVCSPSRASIQLGQFTARHGITDWIGAKT 132 Query: 159 G-----------------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 G +P + LPE + GY T GKWHL ++ Sbjct: 133 GMDFNRGDELLPAEYVHAMPAKDVTLPEALRESGYKTFFAGKWHLGGEGSM--------- 183 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNR--ERVPAKGYISDQLT 259 P + GFD +G H G+ + FKN E P ++ +L Sbjct: 184 ---------------PTDHGFDINIGGHHRGSPPGGFFAPFKNPVMEDGPDGESLTRRLG 228 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPND------------NPAPDQYQKQFNTGS-- 305 E ++ DQP+ L++ A H P PAP +F Sbjct: 229 KETASFIEGQ--DDQPYFAMLSFYAVHGPIQTTQELWQKYRESAPAPPADGNRFKIDRTL 286 Query: 306 -----QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL--PLNG 358 Q Y + ++D V ++ ++ +G+ DNT+++FT DNG V G N Sbjct: 287 PVRQIQDNPVYAGMMETLDNAVGDVMAAIEASGKADNTLVIFTGDNGGVSSGDAYSTSNL 346 Query: 359 AQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLD 417 +G K + + GG P ++ + D + D YPT LD ++ + +D Sbjct: 347 PHRGGKGRQWEGGLREPYYVSMPAIVPENSTSDVPVIGSDLYPTILDVCNLPLRPQQHID 406 Query: 418 GVSLLPWLQDKKQG-EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTE 476 G SL L K ++L W Y H+ ++ P Sbjct: 407 GRSLETVLAGGKDELLEQRSLIW--HYPHYGNQGGEP----------------------- 441 Query: 477 DLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFID 535 S +R DY L++ ++ LY L TD+ ++++LA+ P+ V M+ + ++ Sbjct: 442 -----SSVIRTGDYKLIHYHLDSHDELYHLPTDIGEQNDLASEQPERVAAMRKELLAYLK 496 Query: 536 SSQPPLSEVNQE 547 S + + Sbjct: 497 SVDAKFPQPDPR 508 >UniRef50_A6DMV0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMV0_9BACT Length = 443 Score = 381 bits (979), Expect = e-104, Method: Composition-based stats. Identities = 125/526 (23%), Positives = 200/526 (38%), Gaps = 129/526 (24%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 T + KPNI+ + +DD GY + A T Sbjct: 12 TTLVAQDKPNIVFIIIDDFGYAD--------------------------SEPYGAKDIKT 45 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-----YSNTDAQ------- 157 P + L +G++FTN Y V P+R A +TGR R G Y T++Q Sbjct: 46 PGINELAKDGLKFTNFYANAPVCSPTRCAFITGRWQQRSGFEWALGYGGTNSQLKNGQYE 105 Query: 158 -------DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 G+ + LP+L + GY T A GKWHL Sbjct: 106 AVTDIHGIGLLPEKNHLPKLLKKAGYKTGAFGKWHLG----------------------- 142 Query: 211 SAEEWQPQNRGFDYFMGFHAAG------TAYYNSPSLFKNRERVPAKGYISDQLTDEAIG 264 S +++ P + GFD + G Y ++ +L + + + GY++ + + A+ Sbjct: 143 SQDKFNPIHHGFDEYYGPLLGHCDYYTYKYYDDTYTLREGAKVIKDSGYLTTNINERAVD 202 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF---NTGSQTADNYYASVYSVDQG 321 +DR D+PF +Y+ + A H P + D+ KQ N +Y A V VD+G Sbjct: 203 FIDRH--ADKPFFMYVPHMAVHSPYQSA--DKKPKQITKTNLNDGNRADYAAMVEEVDKG 258 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 V+ I+ +LK+ + T+ + +SDNG N K+ + GG P M W Sbjct: 259 VEMIIAKLKEKKIFHKTLFVVSSDNGGA---HFSDNAPLFHRKTTLFEGGIRVPCIMHWP 315 Query: 382 GKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 K+ G D++ MD T L A I P DG++LLP + DK + + L W Sbjct: 316 EKIGKGVVSDQIAITMDLSKTFLALAGIDEPS---YDGINLLPMMTDKN-NKVERTLFWR 371 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 + ++ VR + + V N Sbjct: 372 ----------------------------------SNSKARRQKAVRMGKWKYILDV--NC 395 Query: 501 LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREF---IDSSQPPLS 542 LY L D+ + NL P++V++M+ + + +D QPP Sbjct: 396 ELLYNLENDIAENKNLFYQRPEIVQQMKQKLASWEREMDQHQPPFK 441 >UniRef50_A6DI94 Arylsulfatase A n=2 Tax=Bacteria RepID=A6DI94_9BACT Length = 472 Score = 381 bits (979), Expect = e-104, Method: Composition-based stats. Identities = 117/550 (21%), Positives = 209/550 (38%), Gaps = 122/550 (22%) Query: 39 KTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGI 98 K + P YS KPN I++ DD GYG L F+P+ Sbjct: 3 KLLITLLSLIPLVYSNDIKPNFIIIFTDDQGYGDLS----CFNPQ--------------- 43 Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDA 156 TP + + EG++F N YV+ V SRAA++TG R G+ S Sbjct: 44 -------GVQTPHIDQMATEGMKFNNFYVSAAVCSASRAALLTGTYNDRIGIKSAFFPGT 96 Query: 157 QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 + G+ E + EL + Y TA GKWHL ++ +P + Y +S + + Sbjct: 97 KQGLHPDEITIAELLKEQNYATACFGKWHLGDEPSL-LPSAQGFDTYFG--IPYSNDMFI 153 Query: 217 PQNRGFDYFMGFHA------------------------AGTAYYNSPSLFKNRERVP--- 249 ++ F F+ + Y + + + V Sbjct: 154 APHQTFAENAKFNGDWTLEKAKELQKFIAPHVNKRGPIWKSEYKALVPILEGEQIVEFPA 213 Query: 250 AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTAD 309 + ++ + D I +D+ + ++PF ++L PH+P + + G Sbjct: 214 DQASLTQRYFDRTIKFIDKNQ--NKPFFIFLTPAMPHVPL-------FASKEFRGKSKKG 264 Query: 310 NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP--LPLNGAQKGYKSQT 367 Y + +D R+++ LK+ NT+++FTSDNG + +G + K + Sbjct: 265 LYGDVIKEIDFHTGRLIKHLKEKELDQNTLVIFTSDNGPWLSYGDEGGSSGPLRDGKFTS 324 Query: 368 YPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQ 426 Y GG P W G ++ + ++L S +D PT + +P+D K+DG + P L+ Sbjct: 325 YEGGVRMPTVFWGPGLIKANSVCNQLASTIDLLPTFAQLVNTQVPQDRKIDGKDISPLLK 384 Query: 427 DKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVR 486 + H++L + VR Sbjct: 385 SQNHVI-HRHLFFRDE-----------------------------------------AVR 402 Query: 487 NNDYSLV-----YTVENNQL-GLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFID---S 536 + D+ LV T+ L LY L D+ + +NL +P+V + +Q + E + Sbjct: 403 SGDWKLVVKEHHMTMRKGPLPALYNLKNDVAESNNLIDTHPKVAQYLQSKLDEHLKDLNE 462 Query: 537 SQPPLSEVNQ 546 + P++++N+ Sbjct: 463 NSRPMADLNE 472 >UniRef50_Q9NJU8 Sulfatase 1 n=2 Tax=Coelomata RepID=Q9NJU8_HELPO Length = 503 Score = 381 bits (978), Expect = e-104, Method: Composition-based stats. Identities = 121/545 (22%), Positives = 218/545 (40%), Gaps = 82/545 (15%) Query: 29 AADDVKLKATKTNVAFSDFTPTEYST---KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 + L A T A +D + T G+PNI+ + DD G+ + + Sbjct: 2 CKCLLVLIAIITACAVADQSSASAGTRQDAGQPNIVFVLADDFGFHDVGYH--------- 52 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 ++ TPTL +L GVR N YV + P+R+ +M+GR Sbjct: 53 ------------------GSEIHTPTLDALSASGVRLENYYV-QPICTPTRSQLMSGRYQ 93 Query: 146 ARFGVYS---NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 G+ N+ + +P L + + GY T VGKWHL Sbjct: 94 IHTGLQHGIINSCQPNALPNDSPTLADKLKESGYATHMVGKWHLG--------------- 138 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFK-----------NRERVPAK 251 F +E+ P NRGFD + G+ A Y+N ++ R Sbjct: 139 -------FYKQEYLPWNRGFDTYFGYLNAAEDYFNHNVPWRQVRYLDLRDNNGPVRNETG 191 Query: 252 GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF-NTGSQTADN 310 Y + T +AI VV ++ +P LYLAY + H P + P++Y+ ++ N + Sbjct: 192 QYSAHLFTGKAIDVV-QSHNTSKPLFLYLAYQSVHAPLE--VPEKYEHKYRNITDKNRRT 248 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPG 370 + V ++D+GV + + LK G ++NT+++F++DNG I N +G+K+ + G Sbjct: 249 FAGMVSALDEGVANLTQALKDKGLWNNTVLIFSTDNGGQIHAG-GNNYPLRGWKASLWEG 307 Query: 371 GTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 G H F+ + G K LI D++PT + A ++ LDG + + ++ Sbjct: 308 GFHGVGFVSGGALKRSGAVSKGLIHVSDWFPTLVTLAGGNLNGTKPLDGFNQWDTISNET 367 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNND 489 P + L + + + ++ +P + N + V D Sbjct: 368 -PSPREIL--LHNIDILYPQKGVPLYSNTWDTRVRAAIRVGDYKLITGDPGNGSWVPPPD 424 Query: 490 YSLVYTVE-----NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 L + E + L+ +T D + ++L++ P V + ++ +F +++ PP Sbjct: 425 GHLYFVPEIQESAAKNVWLFNITADPNEHNDLSSEKPLEVLRLLQILVQFNNTAVPPRYP 484 Query: 544 VNQEK 548 + Sbjct: 485 APDPR 489 >UniRef50_B5CWC8 Putative uncharacterized protein n=1 Tax=Bacteroides plebeius DSM 17135 RepID=B5CWC8_9BACE Length = 493 Score = 380 bits (976), Expect = e-104, Method: Composition-based stats. Identities = 127/533 (23%), Positives = 187/533 (35%), Gaps = 129/533 (24%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 + KPNII +DD+G G L + TP + Sbjct: 28 EQKPNIIYFLVDDMGMGDLSLTG--------------------------QKKYETPNIDK 61 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNH 174 L +G+ FTN Y VSGPSRA +MTG+ V N + E L + + Sbjct: 62 LAADGMLFTNHYCGTTVSGPSRACLMTGKHTGHTSVRGNQPGPQLLGDNEATLASVLKGA 121 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA 234 GY TA +GKW + P+P D PQ +GFD G+ A Sbjct: 122 GYKTAVIGKWGIGH----PIPLD------------------DPQRKGFDLSYGYLNMWHA 159 Query: 235 YYNSPS-LFKNRERVP------------------------------AKGYISDQLTDEAI 263 + P L++N + K Y D EA+ Sbjct: 160 HNCFPEFLYRNGVKEELTGNKLALAEDGTNPWADMPEGTGVARMDARKQYAPDLFEKEAL 219 Query: 264 GVVDRAKTLDQPFMLYLAYNAPH-----LPNDNPAPDQYQK-QFNTGSQTADNYYASVYS 317 + K PF +Y A N PH PN P + + + Sbjct: 220 KFISDNK--KNPFFIYYALNLPHANNEAAPNGCEVPSYNADIAAKDWPEVEKGFAQMMQI 277 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-----LNGAQKGYKSQTYPGGT 372 +D+ V ++ L+K G DNTII+F SDNG +G N +G K + GG Sbjct: 278 IDKQVGDLVAYLEKEGLADNTIIMFASDNGPHQEGGHKVDFFDSNADLRGKKRDMWDGGI 337 Query: 373 HTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLKLDGVSLLPWLQDKK-Q 430 TP + W GK++ G+ +SA D PT D A + P +DG+SLLP L + Sbjct: 338 RTPFIVKWPGKVKAGSTSNHLSAFWDVLPTFCDIAKVEKPAG--IDGLSLLPTLLGDTAK 395 Query: 431 GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDY 490 E HK L + W VR P + Sbjct: 396 QEKHKYLYFEFYEEGGKQAVVADNWKYIKLNVRQGKGAKPVETS---------------- 439 Query: 491 SLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 LY+LT D+ ++ ++ +P++V+ ++ +I P Sbjct: 440 ------------LYRLTDDVSEQKDVKEEHPEMVE----IMEGYIKEGHTPFP 476 >UniRef50_D0PR10 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR10_9SPHI Length = 607 Score = 380 bits (976), Expect = e-104, Method: Composition-based stats. Identities = 119/529 (22%), Positives = 192/529 (36%), Gaps = 106/529 (20%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 V FS S KPN+I++ DD+GYG + Sbjct: 15 VFFSFLYIKSCSDIDKPNVIIILTDDMGYGDIAAHG------------------------ 50 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIP 161 STP + L DE +R TN +V PSR+A+MTG+ R GV+ + + Sbjct: 51 --NKDISTPHIDQLHDESLRLTNFHVN-PTCAPSRSALMTGKDANRVGVWHTVMGRSLLY 107 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 E + ++F + Y T GKWHL + PQ RG Sbjct: 108 EEEETMADIFSANNYATGLFGKWHLGDNY-----------------------PFAPQYRG 144 Query: 222 FDYFMGFHAAGT----AYYNSPS----LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLD 273 F + G Y+N+ +N + +GY +D EA+ + K + Sbjct: 145 FQEVLTHGGGGVGQTPDYWNNDYFDDVYLRNGQEEKFEGYCTDVWFREALTFIKENK--E 202 Query: 274 QPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNG 333 PF+ Y++ NAPH P + P+ + +Y + ++D + + ++L++ G Sbjct: 203 NPFLCYISTNAPHTPLNVPSSYAEPYLKKGIQEDRAKFYGMISNIDDNIGLLRKKLEEWG 262 Query: 334 QYDNTIILFTSDNGAVIDGPL-------PLNGAQKGYKSQTYPGGTHTPMFMWWK-GKLQ 385 DNTI++F SDNG L N +G K Y GG P +++WK G L Sbjct: 263 IADNTILIFMSDNGTANGATLKGKQLLSGYNANMRGVKGSPYDGGHRVPFYVYWKNGNLN 322 Query: 386 PG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG-EPHKNLTWITSY 443 G + ++L + +D PT + +S + DG+ L + + ++ L Sbjct: 323 HGMDINQLTAHIDVLPTLIKMCGLSNVPTINFDGIDLSQIFLGSDENLDVNRILI----- 377 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGL 503 E W N + V + L+ N L Sbjct: 378 GDSQRLETPKKWRNSY-------------------------VMMGQWRLI-----NGTEL 407 Query: 504 YKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 Y L D Q N+ Q+VK+++ + P K N Sbjct: 408 YNLKRDPSQVKNVFDLEHQIVKQLKEAYEKHWAEISPSFHRFAYIKLGN 456 >UniRef50_UPI0001745666 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745666 Length = 497 Score = 379 bits (975), Expect = e-103, Method: Composition-based stats. Identities = 127/531 (23%), Positives = 199/531 (37%), Gaps = 112/531 (21%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 +PNII + +DD+GYG L G F KT TP + Sbjct: 34 AADRPNIIYILVDDMGYGDL----GCFGQKTFT----------------------TPHID 67 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQN 173 + EG++ T Y V PSR ++TG V N +P ++ +P L + Sbjct: 68 RMAAEGMKLTRHYAGSTVCAPSRCVLLTGLHTGHCRVRGN--GLWTMPDSDVTVPNLLKQ 125 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 GY TA GK+ L K P+P+D P +GFD F G+ Sbjct: 126 AGYATACFGKYGLGK----PLPDD------------------DPNRKGFDTFFGYVDTSH 163 Query: 234 AYYNSP-SLFKNRERV---------------PAKGYIS---------DQLTDEAIGVVDR 268 A+ P L +N +RV G+ + + DE + Sbjct: 164 AHNFYPTYLIRNGQRVALNNVTEPGSRKAGHEDTGFATVDGRRQFAPQLIADELQTYLRD 223 Query: 269 AKTLDQPFMLYLAYNAPHLPNDN----------PAPDQYQKQFNTGSQTADNYYASVYSV 318 QPF +Y A N PH N+ P + + +++ V Sbjct: 224 RAAGKQPFFVYYALNMPHANNEAGKNSPLKHGMEVPSYGEYANKDWPDVEKGFASAMRFV 283 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-----LNGAQKGYKSQTYPGGTH 373 D V +L LKK G NT+++FTSDNG +G NGA G K GG Sbjct: 284 DDQVGAVLAALKKAGLDQNTLVMFTSDNGPHAEGGHSSDFFDSNGAFSGIKRSMTDGGIR 343 Query: 374 TPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLKLDGVSLLPWLQDKK-QG 431 P+ W ++ + +S D PT D A + + + DG+SL+P L K + Sbjct: 344 VPLVARWPAAIKARGESEHVSGFQDLLPTVADLAGAKL--EGETDGLSLVPTLTGKDGEQ 401 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 + HK L W ++ + + W K + + N Q Sbjct: 402 KQHKYLFW--NFDEQGGKRAVLRW--PWKLIHLNTGTARMGQNAGGKPQPVQP------- 450 Query: 492 LVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVRE-FIDSSQPP 540 ++ ++ LY L D+ +++NLA+ P +V E++G ++E + P Sbjct: 451 -----KSLEVQLYNLEEDVGEQNNLASLQPGIVSELEGYMKEAWRAPQTQP 496 >UniRef50_B4D764 Steryl-sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D764_9BACT Length = 499 Score = 379 bits (973), Expect = e-103, Method: Composition-based stats. Identities = 123/567 (21%), Positives = 192/567 (33%), Gaps = 131/567 (23%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K + + KPN I++ +DD+GY + + Sbjct: 1 MKPLRFLFSTLCLLAGAALAADKPNFIIINIDDMGYADIAPFGSKLN------------- 47 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF----GV 150 TP L + EG + T Y V PSR+A+MTG P R V Sbjct: 48 -------------RTPNLDRMAQEGRKLTCFY-GAPVCSPSRSALMTGCYPKRVLPIPSV 93 Query: 151 YSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 A G+ E + EL + GY T +GKWHL Sbjct: 94 L-FPGAAVGLNPAEHTVAELLKKSGYATGCIGKWHLGDQ--------------------- 131 Query: 211 SAEEWQPQNRGFDYFMGFHAAG-----------------------------------TAY 235 E+ P RGFDY++G + T Sbjct: 132 --PEFLPPRRGFDYYLGLPYSNDMGPGEDGSKSSLGDPIPKPKATPNPSAPIPETGITGN 189 Query: 236 YNSPSLFKNRE-----RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPND 290 + +N + R + + D+ T A+ + K D+PF LYL +NA H P Sbjct: 190 QPPLPMLENEKVIARVRQDEQQGLVDRYTKAAVKFITEHK--DKPFFLYLPHNAVHFPI- 246 Query: 291 NPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI 350 Y + G Y V VD V ++L L++ D+T +LFTSDNG Sbjct: 247 ------YPGKEWAGKSPNGYYSDWVEQVDWSVGQVLNTLRELKLQDHTFVLFTSDNGGT- 299 Query: 351 DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADIS 409 P +N +G+K+ T+ GG P WW GK+ G D++ D PT ++ A Sbjct: 300 --PRAVNAPLRGFKTTTWEGGMREPTIAWWPGKIPGGTSSDEITGMFDILPTLVNLAGGE 357 Query: 410 IPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDY 469 +P D K+DG ++ P L + + + + + P+ + + Sbjct: 358 VPTDHKIDGGNIWPVLAGEAGAKSPHEVFYYFNGLRLEGVRTGPWKLRFGSAGLAEGKGP 417 Query: 470 PHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQG 528 P Q LY L TD+ + N+A A+P VV ++ Sbjct: 418 VKKPAAPIPDQ----------------------LYNLQTDIGETTNVADAHPDVVAHLRE 455 Query: 529 VVREFIDSSQPPLSEVNQEKFNNIKKA 555 + D ++ Sbjct: 456 LADAMKDDLGRDGKGPGVRPLGRVENP 482 >UniRef50_A6LIX6 N-acetylgalactosamine 6-sulfatase n=2 Tax=Bacteroidales RepID=A6LIX6_PARD8 Length = 589 Score = 379 bits (973), Expect = e-103, Method: Composition-based stats. Identities = 131/531 (24%), Positives = 200/531 (37%), Gaps = 113/531 (21%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 KL + + ++ K PNIIV+ DD G+G L F +F Sbjct: 2 TKLGVLFSVLGVGAGCIPAFAQKQLPNIIVMLSDDQGWGDLGFTGNTF------------ 49 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP + + EG N YV VS P+RA +TGR R GV S Sbjct: 50 --------------VQTPNIDRIAHEGTILENFYV-CPVSSPTRAEFLTGRYHVRSGVNS 94 Query: 153 NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 T + L E + E F+ GY T+ GKWH Sbjct: 95 TTGGGERFNLGEKTIAEYFREAGYATSLFGKWHSGTQY---------------------- 132 Query: 213 EEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTL 272 + P RGF+ F GF + Y +P L N E + +G+I D LTD+A+ + K Sbjct: 133 -PYHPNARGFEEFYGFCSGHWGNYWNPVLEHNGEIISGEGFIIDDLTDKALDYIRDHK-- 189 Query: 273 DQPFMLYLAYNAPHLPNDNP------APDQYQKQFNTGSQTADNYY-----ASVYSVDQG 321 + PF ++L+YN PH P P D+ Q T + D + A ++D Sbjct: 190 EHPFFMFLSYNTPHSPMQVPDSWWNRVKDRTLSQRATFPEQEDTTFTKAALALAENLDWN 249 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 + R+L L TI+++ SDNG NG KG K T GG +P + W Sbjct: 250 IGRVLSLLHSLDLEQETIVIYFSDNGP---NSFRWNGGMKGRKGSTDEGGVRSPFCIRWP 306 Query: 382 GKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 G ++ G + +L A+D PT L A I KLDG+ L D+K + L Sbjct: 307 GHIRKGAVETQLSGAIDLIPTLLGLAGIEYTPLRKLDGIDWGQRLLDEKAPAIDRVL--- 363 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 YS+W + ++ + Y + + + Sbjct: 364 --YSYWGGKTSV--------------------------------------RISYYLLDAE 383 Query: 501 LGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFI-DSSQPPLSEVNQEKF 549 LYK D +Q+ +++ P++ + M+ + D + + F Sbjct: 384 DHLYKTDIDREQRKDVSDKEPEIYERMK-RYSNWFKDELLADFPKKDTRPF 433 >UniRef50_A3HZ22 Putative exported uslfatase n=1 Tax=Algoriphagus sp. PR1 RepID=A3HZ22_9SPHI Length = 489 Score = 378 bits (972), Expect = e-103, Method: Composition-based stats. Identities = 133/528 (25%), Positives = 214/528 (40%), Gaps = 129/528 (24%) Query: 59 NIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDE 118 N++++ +DDLG+ + F TP + L E Sbjct: 44 NVLIIHVDDLGWADIEPLGSDF--------------------------YETPNITKLAKE 77 Query: 119 GVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG------------------- 159 G+ FTN Y A + P+RAA++TG+ PAR G+ A+ Sbjct: 78 GILFTNSYAAAAICSPTRAALLTGKYPARLGITDWIRAKFNQNSTSGLPGEYEVFENKPL 137 Query: 160 --------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFS 211 +PL E + E + HGY T VGKWHL + Sbjct: 138 KTPKIQGFLPLEEITIAERMKAHGYGTLHVGKWHLGE----------------------- 174 Query: 212 AEEWQPQNRGFDYFMGFHAAGT--AYYNSPSLFKNRE--------RVPAKGYISDQLTDE 261 E + P+++GFD +G + G +Y++ K RE +++D+ DE Sbjct: 175 -EGFYPEDQGFDVNIGGNDLGQPPSYFDPYLPAKPREFYEITTLKPRKEGEFLTDREGDE 233 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADN--YYASVYSVD 319 + + K F ++ A A H P PD +K N Y A V SVD Sbjct: 234 VVNYIQNQKGKK--FFVHWAPYAVHTPI-MGKPDLVEKYEQKEPGNQRNPVYAALVESVD 290 Query: 320 QGVKRILEQLKKNGQYDNTIILFTSDNGAVI---DGPLPLNGAQKGYKSQTYPGGTHTPM 376 Q V ++L +L++ G +NT+++FTSDNG +I D P+ N K K Y GG P Sbjct: 291 QNVGKVLSELERMGLRENTLVIFTSDNGGLIGNYDNPITNNYPLKSQKGYPYEGGIRIPT 350 Query: 377 FMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHK 435 + W GK+ G D+ I MD+ PT LD P +L+GVSL P L ++K + Sbjct: 351 IVSWPGKIPQGFVDETPIITMDWIPTILDFMG-EDPTLPELEGVSLKPLLTERKD-LAER 408 Query: 436 NLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT 495 +L W + H+ + P+ VR+ Y L++ Sbjct: 409 DLFWY--FPHYRLSDISPY----------------------------VIVRSGGYKLIHY 438 Query: 496 VENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 ++ Q LY L D+++K N+ + + +++Q + +++ S L Sbjct: 439 FDDTQDELYNLDYDMEEKVNVISTRGAIAEQLQQKIDQWLVYSNARLP 486 >UniRef50_A6DFR6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFR6_9BACT Length = 573 Score = 378 bits (972), Expect = e-103, Method: Composition-based stats. Identities = 113/515 (21%), Positives = 203/515 (39%), Gaps = 104/515 (20%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 +PN++++ DD GYG++ TP + L Sbjct: 19 DRPNVVLILTDDQGYGEVAAHGNKI--------------------------IQTPEMDKL 52 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHG 175 EGVR N +V + PSRAA++TGR +R GV+ ++ I E + + F G Sbjct: 53 YREGVRLDNYHVNS-ICSPSRAALVTGRYASRVGVWHTLGGRNIIRKDEKTIADHFVAAG 111 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT-- 233 Y T VGKWHL + ++P++RGF Sbjct: 112 YKTGMVGKWHLGDNA-----------------------PYRPEDRGFQDVFRIGGGSIGQ 148 Query: 234 --AYY----NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHL 287 Y+ + + V KG+ +D D A+ V+ K PF L+++ APH Sbjct: 149 LPDYWKNDLWDGHYWNKGQWVKTKGFCTDVQFDYALDFVEENKKS--PFFLFISTTAPHS 206 Query: 288 PNDNPAPDQYQKQFNTGSQTAD--NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSD 345 P A +Y + + +Y V ++D + R+ +L++ +NTI++F+SD Sbjct: 207 PTG--ADKKYLEPYEKLGLDKGICAFYGMVTNIDDNIGRLRNKLRELKLEENTILIFSSD 264 Query: 346 NGAVIDG-PLPLNGAQKGYKSQTYPGGTHTPMFMWWK--GKLQPGNYDKLISAMDFYPTA 402 NG+ D NG +G K Y GG P F++W G + D++ + +D PT Sbjct: 265 NGSACDKKGDSFNGGMQGKKGSLYEGGHRVPCFLYWPKGGWIGGKQLDQVTAHIDILPTL 324 Query: 403 LDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFV 462 L A I P + DG+ L + +P + L+ + + ++ + F Sbjct: 325 LKACAIENPLNTAFDGIEL-----NGIIAKPAQKLSRLLITENKANKRDQEFQ------- 372 Query: 463 RHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQ 521 + V +++ L+ + LY + D QK+++A + + Sbjct: 373 -------------------NSVVLTDEWRLI-----DGQKLYDVKNDFTQKNDIAKEHNE 408 Query: 522 VVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKAL 556 VK ++ ++ S + +E+ ++ + L Sbjct: 409 RVKSLRKSYSQWYTSIKARFNELTPIDIDHEQAEL 443 >UniRef50_B6RB10 Arylsulfatase n=7 Tax=Coelomata RepID=B6RB10_HALDI Length = 481 Score = 378 bits (971), Expect = e-103, Method: Composition-based stats. Identities = 128/515 (24%), Positives = 206/515 (40%), Gaps = 78/515 (15%) Query: 51 EYSTKGKP-NIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 + S G+P +I+ + DDLG+ + F T Sbjct: 18 DVSAAGRPRHIVFIVADDLGWNDIGFH---------------------------NPDIIT 50 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTETF 166 P + L EG+ + YV + PSRAA M+G P + G+ + + +PL T Sbjct: 51 PNIDKLAREGLLLNHHYV-QPLCSPSRAAFMSGYYPFKTGLQHSVILENQPVCLPLNITI 109 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 LP+ + GY T VGKWH F + P RGFD F Sbjct: 110 LPQKLKELGYATHIVGKWHNG----------------------FCSWNCTPTYRGFDSFF 147 Query: 227 GFHAAGTAYYNSP-----SLFKNRERVPAKG--YISDQLTDEAIGVVDRAKTLDQPFMLY 279 G++ A YY N V Y + + TD A +++R QP LY Sbjct: 148 GYYGAMEDYYTHVIRGFLDYRNNTTPVWTDNGTYSTLRFTDVATDIIERH-NQSQPLFLY 206 Query: 280 LAYNAPHLPNDNPAPDQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 LAY A + P + PA +Y+ + N S+ + V ++D+ V + + L++ G D+T Sbjct: 207 LAYQAVYGPIEVPA--KYEAMYPNIKSENRRKFSGMVSALDEAVGNVTKTLRQRGLMDDT 264 Query: 339 IILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMD 397 +ILFT+DNG +D N +G K Y GGT FM+ G + G D +I A+D Sbjct: 265 LILFTADNGGGVDES-GNNYPLRGSKFTVYEGGTRAVGFMYGSGLQKTGTVFDGMIHAVD 323 Query: 398 FYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDN 457 + PT AA + D DG++L P L P + + + + Sbjct: 324 WLPTLTAAAGGTPVSDR--DGINLWPSLS-TASPSPRTEVVYNYDSHPQPVQGHAAIRVG 380 Query: 458 YHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLA 516 +K + +P E ++ T + D NQ L+ L D ++++L+ Sbjct: 381 DYKLIDGYPGPFPDWYKPEQVTSSLNTRFSRD-------SANQYQLFNLKDDPNERNDLS 433 Query: 517 AANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 P +VK++ + + + PP + +N Sbjct: 434 NFRPDMVKKLAARLAWYKKQAVPPNFPETPDDLSN 468 >UniRef50_A6DQ01 N-acetylgalactosamine-4-sulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DQ01_9BACT Length = 616 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 112/497 (22%), Positives = 192/497 (38%), Gaps = 103/497 (20%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 + KPNII++ DD GYG L TP + Sbjct: 18 AQAKPNIIIVMTDDQGYGDLSCHG--------------------------NPILKTPQID 51 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQN 173 + +R TN +V P+R+A+MTGR AR GV+ + + E + + ++ Sbjct: 52 EFYKDALRLTNYHV-DPTCAPTRSALMTGRYSARVGVWHTVQGRHLMREREITMANILKD 110 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGT 233 +GY T GKWHL A ++P++RGF + + A G Sbjct: 111 NGYATGIFGKWHLGD-----------------------AYPYRPEDRGFTHVVTHGAGGV 147 Query: 234 AY--------YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 Y + + + N E V +G+ +D DEA + + +PF ++ NAP Sbjct: 148 GQVPDYWGNDYFNDTYYVNGEFVKFEGFCTDVWFDEAKKFMKTQISKKKPFFTFITPNAP 207 Query: 286 HLPNDNPAPDQYQKQFNTGS---QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILF 342 H P AP +Y +N + ++ + ++D + E LK G DNT+++F Sbjct: 208 HGPMR--APQKYLDMYNQTKVKGTKLEAFFGMITNIDDNFGELREFLKDEGVADNTLLIF 265 Query: 343 TSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK-GKLQPGN-YDKLISAMDFYP 400 T+DNG G N G K+ + GG P W G L G D+L + MD P Sbjct: 266 TTDNG-SSSGIGVYNAGMTGAKNSNFDGGHRVPFIFTWPKGNLMGGRDIDQLTAHMDILP 324 Query: 401 TALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHK 460 + ++ + PK + DG SL ++ + + L + ++ W N Sbjct: 325 SFIEMFGLKAPK-IDFDGTSLEKIIKGDQTALRDRVLLVESQ-----RVKDPEKWRN--- 375 Query: 461 FVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAAN 519 V ++ + L+ N LY + D QK+++++ + Sbjct: 376 ----------------------TAVMSDQWRLL-----NAKQLYNIRKDPAQKNDVSSQH 408 Query: 520 PQVVKEMQGVVREFIDS 536 P+V + + + + Sbjct: 409 PEVKQRLLAAYDKRWED 425 >UniRef50_A4XED5 Sulfatase n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=A4XED5_NOVAD Length = 462 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 115/526 (21%), Positives = 192/526 (36%), Gaps = 116/526 (22%) Query: 36 KATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 + ++ T + +PNI+ + DDLGY Sbjct: 13 ISATALLSGQALAVTRKAAPERPNIVFIMADDLGYADTS--------------------- 51 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY---- 151 A + TP + S+ GV GY + + P+R A++TG RF + Sbjct: 52 -----ATGSRHIRTPAIDSIGAGGVMLRQGYSSTPICSPTRTALLTGCYAQRFAIGVEEP 106 Query: 152 --SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 N A G+PL + + + GY T+ VGKWHL + Sbjct: 107 LGPNAPAGIGVPLDRPTIASVMKALGYRTSLVGKWHLGEP-------------------- 146 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAYYNS----------PSLFKNRERVPAKGYISDQLT 259 P G+D+F+G G Y+ L ++ + GY++D Sbjct: 147 ---PAHGPLKHGYDHFLGIVEGGADYFVHRMVMSGKPAGVGLAEDDAQTDRTGYLTDIFG 203 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ----FNTGSQTADNYYASV 315 DEA+ V++ +QPF L L + APH P + ++ + F+ Y V Sbjct: 204 DEAVRVIEE--GGNQPFFLSLHFTAPHWPWEGREDEKLARALPSSFHYEGGNLAKYREMV 261 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTP 375 ++DQ V ++L + ++G+ DNT+++FTSDNG G+K + GG P Sbjct: 262 ETMDQNVAKVLAAIDRSGKADNTVVVFTSDNGGER---FSDTWPFVGHKGEVLEGGVRVP 318 Query: 376 MFMWWKGKLQPGNYDKLIS-AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPH 434 + + W +++ G+ + + +MDF PT L A + + DG L L Sbjct: 319 LMVRWPRRIKAGSRSEQVMVSMDFLPTLLGMAGGDAARIGRFDGADLSAQLAGAAPVT-- 376 Query: 435 KNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY 494 + L W S VR D + Sbjct: 377 RTLFWRFKASE------------------------------------QAAVRQGDMKYLR 400 Query: 495 TVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 + L+ L+ D +++ NLA ANP V M+ + ++ P Sbjct: 401 MA--GKEYLFDLSQDEREQANLAPANPDKVNAMRALWDDWNREMMP 444 >UniRef50_Q15XP0 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XP0_PSEA6 Length = 627 Score = 378 bits (970), Expect = e-103, Method: Composition-based stats. Identities = 108/528 (20%), Positives = 192/528 (36%), Gaps = 108/528 (20%) Query: 40 TNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 T + KPNI+++ DD GYG + Sbjct: 26 TVCSAVQNRSASAEPPTKPNIVLIVTDDQGYGDIGRHN---------------------- 63 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG 159 TP + + + R TN +V P+R+A++TG+ R GV+ + Sbjct: 64 ----NPIIQTPNIDDIAAQSARLTNFHV-DPTCSPTRSALLTGKHSLRAGVWHTILGRYM 118 Query: 160 IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN 219 + L E Q +GY T GKWHL ++PQ+ Sbjct: 119 LGPEHVTLAESLQENGYRTGIFGKWHLGDNY-----------------------PYRPQD 155 Query: 220 RGFDYFMGFHAAGT----AYY----NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKT 271 +GFD + G Y+ + + ++N GY + DEA +D+ Sbjct: 156 QGFDDVLIHGGGGVGQTPDYWGNTQFNDTYYRNGTPEKFSGYATKIWFDEAKKFIDKQH- 214 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 D P+ Y+A NAPH P P + ++ ++Y + +D+ V + L+ Sbjct: 215 -DTPYFAYIALNAPHGPYRAPETHIEPYEKRGLNRDMASFYGMISYIDEQVGELRAHLRA 273 Query: 332 NGQYDNTIILFTSDNG-------------------AVIDGPLPLNGAQKGYKSQTYPGGT 372 Q DNTI +F +DNG A N +GYK + Y GG Sbjct: 274 QDQLDNTIFIFMTDNGSSYKPTDAKTHLTKRHLPLAEQYPNWQPNDNMRGYKGEVYEGGH 333 Query: 373 HTPMFMWWK-GKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 P F+ + G + G+Y+ + + D PT L+ A+I P + LDG SL +L+ ++ Sbjct: 334 RVPFFISYPNGNITTGDYEAITAHFDVMPTLLELANIP-PVNSTLDGTSLATYLKGEQA- 391 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 +++L S + + + + + + + Sbjct: 392 --NRSLESKLSERAIVVTNQRVYHPSVKRPI---------------------AIAFHQWR 428 Query: 492 LVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 + N+ L+ L D Q++++ +P ++ M+ + + Q Sbjct: 429 YISA--NDSEKLFNLQQDPSQQNDIKNDHPDILARMRQRKQTWWQEMQ 474 >UniRef50_C5VKQ0 N-acetylgalactosamine-6-sulfatase n=3 Tax=Prevotella RepID=C5VKQ0_9BACT Length = 520 Score = 377 bits (969), Expect = e-103, Method: Composition-based stats. Identities = 130/555 (23%), Positives = 198/555 (35%), Gaps = 90/555 (16%) Query: 31 DDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 L T F +PNII+ +DD+G+ Sbjct: 17 SSTLLITTSIAALGISFPAKAQQVNTQPNIILFMVDDMGWQDTSL--------------- 61 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 + TP + L EG+ FT+ Y +S PSR ++MTG AR V Sbjct: 62 ----PFADSITANNRKYDTPNMERLASEGMMFTDAYAT-PISSPSRCSLMTGMNMARHRV 116 Query: 151 YSNTDAQDGIPL---TETFLP-----------------------ELFQNHGYYTAAVGKW 184 + T +D + LP +L +N GY+T GK Sbjct: 117 TNWTLHRDKMTDGKRDGVTLPDWNYNGIAQSGNVAHTTKAISFVQLLKNVGYHTIHCGKA 176 Query: 185 HLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKN 244 H I D + T R GF G SP Sbjct: 177 HWGAIDTPGENPCHFGFDVNITGTAAGGLATYLSER----NYGFAKDGKP--TSPFAIPG 230 Query: 245 RERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT 303 ER G + ++ LT EAI +++AK DQPF LY+++ A H+P D + Sbjct: 231 LERYWGTGIFATEALTQEAIASLEKAKKYDQPFYLYMSHYAVHVPIDRDMRFYPTYRARG 290 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-------LPL 356 S+ Y + + +D+ + +++ + K G TII+F SDNG + Sbjct: 291 LSEKEAAYASLIAGMDKSLGDLMDWVAKAGLKRETIIIFMSDNGGLASSSYWRDGELYTQ 350 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADIS-IPKDL 414 N K K Y GG P + W ++P I D YPT L A I Sbjct: 351 NAPLKSGKGSLYEGGIRVPFIVKWNNIVKPNTRSHAPIIIEDLYPTLLSMAGIKNYHVPQ 410 Query: 415 KLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPN 474 K+DG + P L+ K+QG+ + L W WD + Sbjct: 411 KIDGQDITPILRGKQQGDKKRQLIWNYP----------NIWDGEGLGI------------ 448 Query: 475 TEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREF 533 + +R + L+Y+ Q LY L +DL +K+NLA+++PQ+V+ + + Sbjct: 449 -----SLNCAIREGQWKLIYSYLTGQKELYDLSSDLSEKNNLASSHPQLVERLYRHLTSK 503 Query: 534 IDSSQPPLSEVNQEK 548 + V EK Sbjct: 504 LHKMNAQKPIVEGEK 518 >UniRef50_C7M5R4 Sulfatase n=4 Tax=Bacteroidetes RepID=C7M5R4_CAPOD Length = 480 Score = 377 bits (969), Expect = e-103, Method: Composition-based stats. Identities = 126/542 (23%), Positives = 197/542 (36%), Gaps = 139/542 (25%) Query: 44 FSDFTPTEYSTKGK-PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAI 102 + + K PN+I + DDLGYG + + Sbjct: 9 LASLCTIGVKAQEKLPNVIFILADDLGYGDI--------------------------EPY 42 Query: 103 EAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN------TDA 156 TP L L DEG++FT Y V PSRA+ +TG+ + N D Sbjct: 43 GQQIIKTPQLSKLADEGMKFTQFYTGTSVCAPSRASFITGQTTGETHIRGNEEVREPVDG 102 Query: 157 QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 Q + + + +LF+ GY T GKW L + + E Sbjct: 103 QAPLLANDPSVAQLFKKAGYNTGCFGKWGLGIVPS----------------------EGN 140 Query: 217 PQNRGFDYFMGFHAAGTAYYNSPS-LFKNRERV--PAKG-------YISDQLTDEAIGVV 266 P +GFD F G+++ A+ P+ L+ + E+V P G Y D + ++ + + Sbjct: 141 PLKQGFDTFFGYNSQFRAHRRYPAFLWHDNEKVLIPENGNYERQEVYGEDLIQEKILDYI 200 Query: 267 DRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ-------------------------F 301 + T ++PF ++L Y PH P Y + Sbjct: 201 GKQ-TAEKPFFMWLTYTLPHAELVVPHDSIYASYEYLPKKPYKGVDYDKITPKPFGWAGY 259 Query: 302 NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPL 356 + T Y A V +D+ + I + LK G ++TII+F SDNGA +G Sbjct: 260 MSQPHTYATYAAMVSRLDKYLGEIRKLLKVKGLDEDTIIIFASDNGAHREGGADPKFFNS 319 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLK 415 + +G K Y GG TP ++WKGK++ G+ I A D PT + + Sbjct: 320 SAGLRGIKRDLYEGGIRTPYIVYWKGKIKAGSVSDHIGAFWDMMPTFAEITHQKYVPNRH 379 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 VS LP L KKQ + HK L W Sbjct: 380 Q--VSFLPTLLGKKQQQQHKYLYWEFH--------------------------------- 404 Query: 476 EDLSQFSYTVRNNDYSLVY----TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVV 530 VR ++ V + + LY L TD ++ NLA P++VK+++ + Sbjct: 405 --EMGGRQAVRYKNWKGVRLNVNKDKKAPIELYDLTTDPAEQHNLAEKYPKIVKKIERFM 462 Query: 531 RE 532 + Sbjct: 463 EQ 464 >UniRef50_A6DNJ0 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DNJ0_9BACT Length = 630 Score = 377 bits (969), Expect = e-103, Method: Composition-based stats. Identities = 138/526 (26%), Positives = 206/526 (39%), Gaps = 95/526 (18%) Query: 36 KATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 K + + S F+ ++ PNII + DDLGYG L Sbjct: 5 KISLVVIFLSAFSL--FAEAKPPNIIFMLADDLGYGDLSSYN-----------------P 45 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT- 154 +A TPTL S+ GVR+T+ + A + P+R A++T R P+R G ++ Sbjct: 46 NAEGEAPNNTPIRTPTLDSMAKNGVRYTDFHSAAPICSPARRALLTARYPSRLGEWAEAY 105 Query: 155 -DAQDG-IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 + DG + + + + GY TAA GKW++ + +V P D Sbjct: 106 RGSPDGVVAKNDPTIAMWLKEAGYATAAYGKWNIGESKDVSWPGAHGFDD---------- 155 Query: 213 EEWQPQNRGFDYFMGFHAAGTAYYNSPSLFK-NRERVP--AKGYISDQLTDEAIGVVDRA 269 W + YF A P LF+ ERV Y++D TD+AI + Sbjct: 156 --WLIIDHNTGYFQ-HKNANKDCEGRPMLFETGGERVTNLEGQYLTDIWTDKAIDFIQET 212 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQ----YQKQFNTGSQTADNYYASVYSVDQGVKRI 325 K DQPF +YL ++ PH P +PA D + + Y V +D + RI Sbjct: 213 K--DQPFFIYLPWSIPHTPLQDPASDPSLAFDAGAKPKTVEGREVYVKMVEYLDSHIARI 270 Query: 326 LEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQ 385 + LK+ G+YDNT+I+FTSDNG ++ K K GG P M W K++ Sbjct: 271 FKSLKEQGKYDNTLIIFTSDNGGMVSANC---WPLKKTKQHLEEGGIRVPFLMQWPSKIK 327 Query: 386 PGNYDK-LISAMDFYPTALDAADIS--IPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITS 442 G D+ MD T L AAD +PKD +LDGV+L E ++ W Sbjct: 328 AGTVDQRAAIMMDASVTVLAAADAMKYVPKDRELDGVNLF------ANKEENREFGWRRR 381 Query: 443 YSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE----- 497 W R+ D+ L+ + + Sbjct: 382 DWGWQGNYL-----------------------------RQEAYRSGDWKLIRSYQYLGNK 412 Query: 498 ----NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 + LYKL+ DL +K+NL + P+ EM E+ Sbjct: 413 KWSAEYKEELYKLSDDLGEKNNLKKSMPEKHAEMVKSFDEWKAQVV 458 >UniRef50_A6CEL4 Arylsulfatase A n=4 Tax=Bacteria RepID=A6CEL4_9PLAN Length = 527 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 128/553 (23%), Positives = 210/553 (37%), Gaps = 99/553 (17%) Query: 36 KATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 A FS T PNII + DD+GYG + Sbjct: 3 AAFLPLFLFSQNTAHASEKANDPNIIYILADDMGYGDI---------------------- 40 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--- 152 + +TP L L G+ FT+ + + V P+R ++TGR R + S Sbjct: 41 ---RALNPECKIATPHLDQLAHGGMIFTDAHSSSSVCTPTRYGVLTGRYNWRSRLKSGVL 97 Query: 153 NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 ++ I +P + + HGYYTA VGKWHL ++ + Y+ Sbjct: 98 WGLSRRLIEPDRETVPSMLKEHGYYTACVGKWHLGMDWSLKQGGFATEQSYNKKTNPGWD 157 Query: 213 EEWQ------PQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVP-----AKGYISD----- 256 ++ P + GFDYF G A+ P ++ +R K + D Sbjct: 158 VDYSKPIQNGPNSVGFDYFFGISASLDM---PPYVYIENDRSQGIPTVTKAFFRDGPAHK 214 Query: 257 ---------QLTDEAIGVVDRA---KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG 304 ++TD+ + ++D +PF +Y NAPH P P P+ G Sbjct: 215 DFEAIDVLPRITDKTVQIIDEHAAASKEGKPFFIYFPLNAPHTPI-LPTPEW------QG 267 Query: 305 SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--------VIDGPLPL 356 + Y V VD V ++++ LKK G ++NT+++FT+DNG + D Sbjct: 268 KSGINAYCDFVMQVDDTVGQVMQALKKQGIHENTLVIFTADNGCSPAANFKEMTDKDHQP 327 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLK 415 + +G+K+ Y GG P W +++ G D+L D + TA D +P D Sbjct: 328 SYQFRGHKADIYEGGHRVPFIANWPARIKAGTHSDQLTCLTDLFATAADIVGAKVPDDAG 387 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 D VS+LP ++ H L + +I + P Sbjct: 388 EDSVSILPAMEG----TAHTPLREAAVHHSIRGAFSIRKDHWKLELCPGSGGWSFPKPGK 443 Query: 476 EDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFI 534 ++LS+ + LY L D ++ N+ A +P+VVKE+ +++ + Sbjct: 444 DNLSELPA-----------------IQLYDLNHDAGEQKNVQAEHPEVVKELTTLLQSYA 486 Query: 535 D--SSQPPLSEVN 545 D S P + N Sbjct: 487 DRGRSTPGKPQPN 499 >UniRef50_Q7UH46 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UH46_RHOBA Length = 490 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 131/533 (24%), Positives = 209/533 (39%), Gaps = 111/533 (20%) Query: 52 YSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPT 111 S PNI+++ DDLG+G F+ + TP Sbjct: 26 VSASDCPNIVLMMCDDLGWGDTGFNGNTI--------------------------IQTPE 59 Query: 112 LLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELF 171 L +L +EG + Y V P+RA+ +TGR R G++ T + +P E L + Sbjct: 60 LDALANEGTVLDHFYSVGPVCSPTRASFLTGRHYFRMGIW--TANKGHLPSQEFTLARML 117 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQ-----------TRDYHDNFTTFSA-EEWQPQN 219 + GY T GKWHL +S + K RDY +F T SA W P Sbjct: 118 KTRGYATGHFGKWHLGTLSRTVSAKGKGRRPDLHYAPPWERDYDASFVTESAVCTWDPG- 176 Query: 220 RGFDYFMGFHAAGTAYYNSPSLFKNRERVPAK--GYISDQLTDEAIGVVDRAKTLDQPFM 277 +G A YY +N G S L D A+ ++ A DQPF+ Sbjct: 177 ------IGKRARNNPYY------ENGVATDENVLGCDSRVLMDRALPFIEAAAERDQPFL 224 Query: 278 LYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDN 337 + ++APH D A +Y ++ + A +YY + +VD V R+ ++L G DN Sbjct: 225 SVIWFHAPH--EDIQAGPEYLAKYEGHGE-AAHYYGCITAVDDQVGRLRKKLASLGVADN 281 Query: 338 TIILFTSDNGAVIDGP--------LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-N 388 T++ F SDNG P G G K GG P F+ W G++ G Sbjct: 282 TLLFFCSDNGPEGGEPSNRMKTRRAGSAGEFSGRKRSVLDGGVRVPAFVHWPGQIPAGVR 341 Query: 389 YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFD 448 + +S MD PT + LDG ++LP + ++ Sbjct: 342 LNAPLSVMDLLPTVAAITGAETLPNRLLDGENVLPIWKGEQAQ----------------R 385 Query: 449 EENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV--ENNQLGLYKL 506 E++IPF QF+ VR + L+ ++++ L+ L Sbjct: 386 EKSIPF----------------------RYGQFACLVR-GKHKLIIESPNDDSKDRLFDL 422 Query: 507 T-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKALSE 558 + D+ + +NLA P++ M+ + F++S++ S +E N K + + Sbjct: 423 SKDVSESNNLANQKPELTASMRTELLGFLESAKA--SHAGEEYEGNDTKPVEK 473 >UniRef50_C6VSQ8 Sulfatase n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6VSQ8_DYAFD Length = 457 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 122/511 (23%), Positives = 188/511 (36%), Gaps = 127/511 (24%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PNI+ + DDLGYG + TP + +L Sbjct: 23 PNIVFILADDLGYGDIGAHG--------------------------QKLLRTPNIDALAK 56 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGI--------------PLT 163 EG+ FT+ + V PSR+ ++TG + N + GI Sbjct: 57 EGMIFTDIHAGAPVCSPSRSVLITGLHTGHTTIRGNATIRGGIVGNKGKQTVRRANLAAG 116 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 + + +L GY TA GKWHL + P +RGFD Sbjct: 117 DFTVGKLMAQSGYTTALTGKWHLDGYDTLAT----------------------PIHRGFD 154 Query: 224 YFMGFHAAGTAYYNSPSL----FKNRERVPA-------KGYISDQLT-DEAIGVVDRAKT 271 F G+ A Y + + N KGY +D LT DE++ + K Sbjct: 155 QFSGWLIAYPGTYANGYWPAKRYVNGVLKDVEQNENGRKGYYADDLTTDESLAFLAAQKD 214 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 +PF+L + YN+PH P D Y+ + Q Y A V+ +D+ V +I + L + Sbjct: 215 AKKPFVLMINYNSPHSPLDAADSSAYKDR--DWPQDMKIYGAQVHHLDENVGKIKKYLTE 272 Query: 332 NGQYDNTIILFTSDNGAVIDGP---------LPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 +G NTI+ F SDNG +G NG +GYK Y GG PM +W G Sbjct: 273 SGLAKNTIVFFCSDNGPRSEGTPQQTAIAEFFDSNGRLRGYKRDMYEGGIRVPMVVWAPG 332 Query: 383 KLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWIT 441 ++PG+ + D PT D A + DG S+L ++ K +P + L W Sbjct: 333 IVKPGSVSSEPAYFADIMPTFADIAGSKV--SYTTDGASVLASIKGKAAWQP-RFLYWEF 389 Query: 442 SYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQL 501 F VR + V +L Sbjct: 390 F-----------------------------------EKGFEQAVRYGKWKAVKA--KGKL 412 Query: 502 GLYKL-TDLQQKDNLAAANPQVVKEMQGVVR 531 LY L D+ + ++++A NP +V +++ ++ Sbjct: 413 ELYDLDKDISETNDVSADNPAIVAKIENYLK 443 >UniRef50_C5C581 Cerebroside-sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C581_BEUC1 Length = 458 Score = 376 bits (967), Expect = e-103, Method: Composition-based stats. Identities = 116/495 (23%), Positives = 181/495 (36%), Gaps = 118/495 (23%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 +PNI+++ DDLGYG L + TP L L Sbjct: 3 QRPNIVLINADDLGYGDLGCYGSMRND--------------------------TPHLDRL 36 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD-------GIPLTETFLP 168 EGVR T+ Y+A V PSR ++TG P R G G+ E + Sbjct: 37 AAEGVRLTDFYMASPVCSPSRGGMLTGCYPPRIGFGEFVGRPVLFPGDPVGLDPAERTMA 96 Query: 169 ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF 228 + + GY TAA+GKWH E+ P GFD + G Sbjct: 97 RVLGDAGYATAAIGKWHCGDQ-----------------------PEFLPTRHGFDSYFGI 133 Query: 229 -----HAAGTAYYNSPSL-FKNRERV----PAKGYISDQLTDEAIGVVDRAKTLDQPFML 278 + + P L + E V P + ++++ T A ++ QPF L Sbjct: 134 PFSNDMGRQREHEDWPPLPLMSGESVVQEQPDQRSLTERYTVAATRFIEEN--AHQPFFL 191 Query: 279 YLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 YLA+ H+P PAP + Y +V ++D +++ L++ G +NT Sbjct: 192 YLAHMYVHVPLFVPAP-------FLAASRNGGYGGAVAALDWSTGVVMDTLRRLGLEENT 244 Query: 339 IILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP-GNYDKLISAMD 397 I++FTSDNG+ G N +G+K+QT+ GG + W + G D + ++D Sbjct: 245 IVVFTSDNGSRARGEGGSNDPLRGHKAQTWEGGQRVACVVRWPAAIPAGGVCDAVTRSID 304 Query: 398 FYPTALDAADISIPKD--LKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFW 455 PT A + D +DGV L L P++ + Sbjct: 305 LLPTFAAVAGAADWADPARPVDGVDLTALLTG-AGPAPNETFAYYYMDDL---------- 353 Query: 456 DNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQL-GLYKL-TDLQQKD 513 VR D+ L + + + LY L TD + Sbjct: 354 ---------------------------EAVRVGDWKLHLSKRRDPMRELYDLRTDAAETH 386 Query: 514 NLAAANPQVVKEMQG 528 ++AA +P VV ++ Sbjct: 387 DVAADHPDVVARLEA 401 >UniRef50_B7FQ28 Arylsulfatase n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7FQ28_PHATR Length = 564 Score = 376 bits (965), Expect = e-102, Method: Composition-based stats. Identities = 121/560 (21%), Positives = 212/560 (37%), Gaps = 72/560 (12%) Query: 11 STSISLILASGMAAFAAHAADDVKLKATKTN--VAFSDFTPTEYSTKGKPNIIVLTMDDL 68 + + LA+ + + + + T N A T T S+ +P+I+++ MDDL Sbjct: 15 ISLRTFYLAASLCSGKTWCLPETQGSTTSDNHASAVEGITNTNSSSFFRPHILMIIMDDL 74 Query: 69 GYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVA 128 G L E + TP L +G+ YV Sbjct: 75 GSHDLGIH--------------------------ENSGIQTPHADQLARDGLYLDQYYVL 108 Query: 129 HGVSGPSRAAIMTGRAPARFGVYS--NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHL 186 P+RA++++GR P G ++ N G+PL E LP++ + GY AVGKWH+ Sbjct: 109 -PYCSPTRASLLSGRYPLHTGCHTIVNDWETQGLPLDEETLPQVLRRAGYQAHAVGKWHV 167 Query: 187 SKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN--RGFDYFMGFHAAGTAYYNSPSLFKN 244 P + + + + + RG Y M + A G + L Sbjct: 168 GHSRWTQTPTFRGFQSFFGFYLGAQDYNTHIKQGERGNAYEMHWDARGKCGRDCSRLVDE 227 Query: 245 RERVPAKGYISDQLTDEAIGVVDRA-KTLDQPFMLYLAYNAPHLPNDNPAPDQYQK---- 299 R Y + T EAI V++ + +P LYLA+ A H P+ P+ Y+K Sbjct: 228 R-----GNYSTHVFTREAIRVIENHPQRPHEPLFLYLAHQAVHWPDQ--VPETYRKFYEG 280 Query: 300 -QFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI---DGPLP 355 ++ + Y + + D+ + + + L+ G ++NT+++FT+DNG Sbjct: 281 ATYSNWTDQRKTYAGMLSAADESIGNVTKALQDAGMWENTLVVFTTDNGGPTAVCAAQGS 340 Query: 356 LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN------------YDKLISAMDFYPTAL 403 N ++G K Y GGT F+ + Y K+ +D+ PT Sbjct: 341 SNYPKRGGKCTVYEGGTTGDGFVSGPAWNKVARSRKKEYSETLELYSKVFHVVDWLPTLA 400 Query: 404 DAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR 463 + P LDGV+ + ++ P Y+++ ++ P H + Sbjct: 401 RMTGAT-PNGKPLDGVNQWDSMLQREPSAPPPREEVFVGYAYFGNQWYGPAIRYKHWKLI 459 Query: 464 HQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQV 522 P + F + LY L +D + N+A++ P + Sbjct: 460 QGQSGGPETSHDLPPGSFLPAPGG---------APGEYQLYDLQSDPSETQNIASSYPLI 510 Query: 523 VKEMQGVVREFIDSSQPPLS 542 V+ +QG + E+ S PP+S Sbjct: 511 VQILQGKLIEYHASFVPPIS 530 >UniRef50_A6CGJ8 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CGJ8_9PLAN Length = 520 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 125/546 (22%), Positives = 204/546 (37%), Gaps = 115/546 (21%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 P ++ + NI+ + DDLGYG + ++P+ ++ Sbjct: 24 PIAHAADKQSNIVYILADDLGYGDVS----CYNPE---------------------SKIK 58 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-YSNTDAQDG--IPLTET 165 TP + L EG++FT+ + V P+R I+TGR R + Y D D I + Sbjct: 59 TPHIDRLAAEGMKFTDAHTPSAVCTPTRYGILTGRYCWRTRLKYRVLDGFDPPLIEQDQV 118 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVP-------VPEDKQTRDYHDNFTTFSAE-EWQP 217 +P L + GY TA +GKWHL VP D++ R + ++ P Sbjct: 119 TVPSLLKKAGYDTACIGKWHLGMQWTDKNGQPVPAVPIDRRQRPRVGDDVDYTKPILGGP 178 Query: 218 QNRGFDYFMGFHAA-GTAYY-----NSPSLFK--NRERVPAKGYISDQ------------ 257 GFDY+ G A+ + + + P + ER+ + DQ Sbjct: 179 LTSGFDYYFGISASLNMSPFCFIRNDRPVILPTIPSERIQTEFLSVDQGMRSPDFTIRSV 238 Query: 258 ---LTDEAIGVVDRA--KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYY 312 LT EA+ ++R ++ ++PF LY APHLP D+++ + A Y Sbjct: 239 MPTLTGEAVKYIERHGKESPERPFFLYFPLTAPHLPLVPN--DEFKGK-----SAAGEYG 291 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV----------------------- 349 V VD V I++ L++ G +NT+++FTSDNG + Sbjct: 292 DFVLEVDATVGAIMDALQRTGVAENTLVIFTSDNGGLYHWWTPQETDDLKHYKPNHRGQY 351 Query: 350 -IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP-GNYDKLISAMDFYPTALDAAD 407 D N +G K+ + GG P + W GK D+L+ D T D Sbjct: 352 VKDRGHQGNAHLRGTKADIWEGGHRVPFIVRWPGKTPADSTNDELVELTDLLATCAAITD 411 Query: 408 ISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 +P D V++LP L KK P + S F P+ + + Sbjct: 412 TKLPDGDAQDSVNILPALLGKKSDTPLREYAIHHSLWGHFSVRQGPWKMIPKRGSGGFTR 471 Query: 468 DYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEM 526 P + + LY L D + N+ +P+VVK + Sbjct: 472 AREVEPAAGEPTG---------------------QLYNLKQDPSETKNVWLEHPEVVKPL 510 Query: 527 QGVVRE 532 ++ + Sbjct: 511 SAILEQ 516 >UniRef50_D1QVA8 N-acetylgalactosamine-6-sulfatase n=1 Tax=Prevotella oris F0302 RepID=D1QVA8_9BACT Length = 521 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 128/574 (22%), Positives = 208/574 (36%), Gaps = 122/574 (21%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFD 81 M F + + +A + + +PNII+ +DD+G+ Sbjct: 1 MKTFQPIPMGHFSVALSAMFLAVASSARAQDRVDNRPNIILFMVDDMGWQDTSL------ 54 Query: 82 PKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMT 141 + + + TP + L G+ F+ Y A +S PSR ++MT Sbjct: 55 -------------PFWTQRTMYNDRYETPNMERLAARGMMFSQAY-ACPISSPSRCSLMT 100 Query: 142 GRAPARFGVYSNT---DAQDGIPLTETFLPE-----------------------LFQNHG 175 G AR V + T + + + LPE L Q G Sbjct: 101 GSNAARHRVTNWTLEKNKSTDLKDDQLTLPEWNYNGISGVEGCRNTYRATSFVNLLQASG 160 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM-GFHAAGTA 234 Y+T GK H P + GF+ + G G A Sbjct: 161 YHTIHCGKAHWGARDTPGE---------------------DPHHWGFEVNIAGHAGGGPA 199 Query: 235 YYNSPSLFKNRER---------------VPAKGYISDQLTDEAIGVVDRAKTLDQPFMLY 279 Y S + N + ++++ LT EA+ +D+AK +QPF LY Sbjct: 200 TYLSERHYGNTDNPAKQHKMAIPGLEKYWDTGTFLTEALTREALKSLDKAKLYNQPFYLY 259 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNTG-SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 +++ A H+P D P Y K G S+ Y + V +D+ + IL+ L KN + T Sbjct: 260 MSHYAVHIPIDRD-PRYYDKYLKKGLSEKEAAYASLVEGMDKSLGDILDWLDKNDETRRT 318 Query: 339 IILFTSDNGAVIDGP-------LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYD- 390 I++F SDNG G N K K Y GG PM + W G ++PG+ Sbjct: 319 IVIFMSDNGGYATGSQWRDQPLFTQNSPLKSGKGSMYEGGIREPMIVSWSGTVKPGSVCR 378 Query: 391 KLISAMDFYPTALDAADISIPK-DLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 + + D++PT L+ A I K K+DG S +P L+ + L W Sbjct: 379 QYVMIEDYFPTLLEMAGIKHYKVPQKVDGKSFIPLLKGTGDPSRGRMLVWNYP------- 431 Query: 450 ENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TD 508 W N + + +R D+ L+Y + ++ LY + D Sbjct: 432 ---NVWGNVGPGI-----------------SLNCAIREGDWKLIYNYKTHEKELYDIPND 471 Query: 509 LQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 + + NLAA P +VK++ + ++ Sbjct: 472 IGEAHNLAAERPSIVKKLSKKLGNYLRKVAAQRP 505 >UniRef50_A6DG78 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG78_9BACT Length = 464 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 139/576 (24%), Positives = 215/576 (37%), Gaps = 158/576 (27%) Query: 4 ALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVL 63 ++ + STS+ +L+ A++AD+ KL KPN+++ Sbjct: 3 TIRNLITSTSLFFLLS-------AYSADNKKLDI------------------NKPNLVIF 37 Query: 64 TMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFT 123 DD G D + K + TP + L ++GVRFT Sbjct: 38 FTDDQG----TLDVNCYGSKDLY----------------------TPNMDKLAEDGVRFT 71 Query: 124 NGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA------QDGIPLTETFLPELFQNHGYY 177 Y V P+RA +MTGR P R V T + L E L E ++ GY Sbjct: 72 QAYAHQ-VCCPARAMLMTGRHPQRSNVNHWTQGDAKGPKTRNMNLEEYTLAEALKDSGYK 130 Query: 178 TAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYN 237 TA GKWHL + + P +GFD F G YN Sbjct: 131 TALFGKWHLGAHLD-----------------------YGPTKQGFDEFYGIRGGFIDNYN 167 Query: 238 S--------PSLFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP 288 L++ + V +G Y + +TD A+ +DR K + PF L+LA+N PH P Sbjct: 168 HYFLHGEGFHDLYEGTKEVFDEGKYFPNLVTDRALNFIDRNK--NNPFFLFLAFNIPHYP 225 Query: 289 NDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA 348 A ++ +++ +Y + + D + +I+ +L+++G YDNTII+F SDNG Sbjct: 226 EQ--ADPKFDERYKNMKMPRQSYAKMISTTDDHMGQIMSKLQEHGIYDNTIIIFMSDNGH 283 Query: 349 VIDGPL----------------------PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 + G +G KS Y GG P + + KL Sbjct: 284 SRERNHIKFDNHKSGLAKNTKYGALGGGGNTGKWRGNKSNFYEGGIRVPAIITFPNKLPK 343 Query: 387 GNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSH 445 G D+ I+AMD+ PT L+ +I PK +K DG SL + + PHK L W Sbjct: 344 GAVRDQAITAMDWMPTVLELCNIEPPK-IKFDGKSLTQVIASEDNPSPHKVLNWQWH--- 399 Query: 446 WFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYK 505 ++ +R + L L Sbjct: 400 -----------------------------------LAWAIRQGSWKL-MGRGTEPTFLGN 423 Query: 506 LTDLQ-QKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 L D Q +K N P++VK + + +++ P Sbjct: 424 LDDKQPEKTNYLTEKPELVKTLHQLHKQWALDVDAP 459 >UniRef50_A6DFS2 N-acetylgalactosamine-6-sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFS2_9BACT Length = 497 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 119/498 (23%), Positives = 196/498 (39%), Gaps = 83/498 (16%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 KPN I + DD+GYG L + TP L Sbjct: 18 ADQKPNFIFMMADDMGYGDLEAYG-------------------------YNDKLKTPNLN 52 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQN 173 + G+ FT+ Y V P+R + TGR P R G++ Q + E LPE+ + Sbjct: 53 EMAANGMLFTSFYSQASVCSPTRFSCYTGRHPFRTGIW--EANQGSLRDEEITLPEVLKK 110 Query: 174 HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA-G 232 HGY T GKWHL ++ + P P G D + H+ Sbjct: 111 HGYATGHFGKWHLGQMVDDPTLGK-----------GARMPMAPPHENGVDEWFAVHSCVP 159 Query: 233 TAYYNSPS----------LFKNRERVPAK--GYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 T P+ + N RV G S + D AI +++A + PF+ Y+ Sbjct: 160 TFNPYGPNGEEAAESDNAYYHNGVRVTDNLVGDSSRIIMDRAIPFIEKAVQNETPFITYI 219 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 +N PH P Q+Q + A YY+++ +D+ V R+ +L++ G +NT++ Sbjct: 220 WFNTPHAPVTGN--PQWQSTYEPIVGKAWQYYSNLADMDKQVGRLRSKLQELGVANNTVL 277 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS-AMDFY 399 FTSDNG V G +G + K + GG P + W K++ G+ IS D++ Sbjct: 278 CFTSDNGPVSHG---SSGPFRASKRHLFDGGVRVPGIIEWPAKVRKGSETAAISCTTDYF 334 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYH 459 TALDAA I K+DG SL+P L + + + + Sbjct: 335 LTALDAAGIDYQSPYKMDGQSLVPILIEDESTRREPLMF--------QSHGSQVVLGEKF 386 Query: 460 KFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYK-LTDLQQKDNLAAA 518 K +R + + + + + GL+ L+D+ +++N+AA+ Sbjct: 387 KAMRVYEGSFSQSHAEDAGLKL-----------------GEWGLFNRLSDVGEENNVAAS 429 Query: 519 NPQVVKEMQGVVREFIDS 536 P+++ + + + S Sbjct: 430 KPEILSQFSRIFETWDAS 447 >UniRef50_UPI0000586CBD PREDICTED: similar to MGC86251 protein n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000586CBD Length = 525 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 126/534 (23%), Positives = 210/534 (39%), Gaps = 98/534 (18%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 +PNII+ DDLGYG L + STP Sbjct: 18 TTGRAKRPNIIIFYADDLGYGDL--------------------------EPYGHPTSSTP 51 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFL 167 L L G+ T Y + V PSRAA++TGR R GVY N + G+PL ET + Sbjct: 52 NLGRLAAGGIVLTQFYSSSPVCSPSRAALLTGRYQMRSGVYPHVFNVEMSGGLPLNETLI 111 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 ++ + GY +AAVGKWHL +N + P N GFD F+G Sbjct: 112 SKMLKPEGYRSAAVGKWHLGLGNNSV---------------------YLPHNHGFDEFLG 150 Query: 228 --------------------FHAAGTAYYNSPSLFKNRERVPAKG---YISDQLTDEAIG 264 A + Y+ +LF + + D+ ++ Sbjct: 151 LPASPSQCRCSVCFYPNVTCHRAPCSPEYSPCALFNGTTIIEQPADLLTLDDKYAMQSRR 210 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKR 324 + PF LY A + H P QY + +G+ + S+ ++D V + Sbjct: 211 FIRTNVETGTPFFLYYASHHTHHP-------QYAGKETSGTSIRGRFGDSLAALDWEVGQ 263 Query: 325 ILEQLKKNGQYDNTIILFTSDNGA--VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 I E+LK+NG ++T F+SDNG ++ G K K+ TY GG P + W G Sbjct: 264 IYEELKENGILEDTFFFFSSDNGPSLSLENFGGNAGLMKCGKATTYEGGIRVPAIVHWPG 323 Query: 383 KLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITS 442 ++ PG +L S +D PT + +P ++ LDG + P+L + ++ + S Sbjct: 324 QITPGRSMELSSTLDVLPTIASITNAKLP-NVTLDGYDMSPFLF-QGMPSLRESFFYYPS 381 Query: 443 YSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLG 502 + + Y K V + N +D+ ++R ++ Sbjct: 382 KVDTEHKSYAVRYKQY-KAVFYTEGSALSNNKNKDVDCRGTSLRT---------YHDPPM 431 Query: 503 LYKLT-DLQQKDNLAAAN-PQ--VVKEMQGVVREFIDSSQPPLSEVNQEKFNNI 552 L+ L D ++ N++ + P+ ++ ++ + +F SE+N+ + N+ Sbjct: 432 LFDLEQDPSEQYNISINHSPERDIILKLTKMRADFDAKMVFAPSEMNKPRDKNL 485 >UniRef50_C1ZJ89 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJ89_PLALI Length = 536 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 127/601 (21%), Positives = 215/601 (35%), Gaps = 167/601 (27%) Query: 10 VSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLG 69 + ++S++L +AA + A + + +PN++ + DDLG Sbjct: 7 IRAALSVLLLIQLAAESLWANELTLISH----------------QSPRPNVVFILADDLG 50 Query: 70 YGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAH 129 +G++ G F ++ TP + L GV+ T Y Sbjct: 51 WGEV----GCFGQ----------------------SKIPTPNIDRLASRGVKLTRHYSGA 84 Query: 130 GVSGPSRAAIMTGRAPARFGVYSN----------TDAQDGIPLTETFLPELFQNHGYYTA 179 PSR +MTG+ + N T+ Q + + FQ GY T Sbjct: 85 PTCAPSRCVLMTGKHLGHAEIRGNQQAKVKLPQFTEGQHPLSDKALTIARQFQKAGYATG 144 Query: 180 AVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY-YNS 238 A GKW L + + P +GFD F G++ A+ Y Sbjct: 145 AFGKWGLGPVGSTGE----------------------PNRQGFDEFFGYNCQALAHSYFP 182 Query: 239 PSLFKNRERV-----------------------PAKGYISDQLTDEAIGVVDRAKTLDQP 275 +L+KN E + + Y + EA+ +DR QP Sbjct: 183 KALWKNAESIVNNEKPVPGHKKQPEGEVTMEAYQGENYAPRLIMAEALSFIDRHHQ--QP 240 Query: 276 FMLYLAYNAPHLPNDNPA------PDQYQKQ-------FNTGSQTADNYYASVYSVDQGV 322 F LYL + PH+ P P ++ ++ + + Y A + +D V Sbjct: 241 FFLYLPFTEPHVAMQPPPKIVEEFPVEWDERVYRGDGGYLPHPRPRAAYAAMIRDLDNHV 300 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDG-----------PLPLNGA--QKGYKSQTYP 369 ++ L+K+G + T+I+FTSDNGA PL N KG+K Y Sbjct: 301 GDVITSLEKHGLLEKTLIVFTSDNGATHASANPDFHVGGADPLFFNSTRELKGFKGSIYE 360 Query: 370 GGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK 428 GG P + W G++ P + D++PT +A + +P+ LDGV+LLP L K Sbjct: 361 GGLRVPAIVSWPGQIPPATTINTPSYFPDWFPTLCNATQLPLPEG--LDGVNLLPLLTGK 418 Query: 429 KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNN 488 + +F+R + + T V Sbjct: 419 TSPD---------------------------QFIRPDPMVWVYAEYT-----GQVCVHLG 446 Query: 489 DYSL----VYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVR-EFIDSSQPPLS 542 D+ + + T +Y+L +D + NLA + P +V + V++ + + P+ Sbjct: 447 DFKVLRRGLRTNRPGPWEVYQLVSDPGESTNLADSRPDLVTKAIEVLKAQTAPNEIFPMP 506 Query: 543 E 543 E Sbjct: 507 E 507 >UniRef50_D0PR02 N-acetylgalactosamine-4-sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR02_9SPHI Length = 595 Score = 375 bits (963), Expect = e-102, Method: Composition-based stats. Identities = 125/502 (24%), Positives = 200/502 (39%), Gaps = 106/502 (21%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 K PN+I++ DD G G L TP + Sbjct: 26 KQAPNVILILTDDQGIGDLGCHG--------------------------NPWLKTPNIDK 59 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNH 174 ++ VR T+ +V + P+RAAIMTG+ P R G ++ +D + + + ++F++ Sbjct: 60 FYEQSVRLTDFHV-SPLCTPTRAAIMTGQYPIRNGAWATYKGRDALSKGQLTMADVFKSA 118 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG-- 232 GY TA GKWHL N PV +P + GFD+ + A G Sbjct: 119 GYSTALFGKWHLGD--NYPV---------------------RPSDSGFDHVVQHLAGGIG 155 Query: 233 --TAYYNSPS----LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPH 286 + Y+ + + N + +GY +D EA+ + + +QPF +YL NAPH Sbjct: 156 ELSDYWGNSYFDDVYYVNNQPKQFQGYCTDVWFSEAMKFI-NQQEKEQPFFIYLPLNAPH 214 Query: 287 LPNDNPAPDQYQ---KQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFT 343 P ++Y K+F N Y + ++D+ + + LKK G NTI+++ Sbjct: 215 DPLI--VDEKYAAPYKKFEGSEIIDANLYGMIANIDENFGKFRKFLKKKGLDKNTILIYM 272 Query: 344 SDNGA----VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWW-KGKLQPGN-YDKLISAMD 397 SDNG DG L N KG K + GG P F+ W G ++ G L + +D Sbjct: 273 SDNGTRFGYSRDGKLGYNYHLKGMKGDKFEGGHRVPFFIQWMDGGIEGGKDIRSLSAHVD 332 Query: 398 FYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDN 457 PT I +PK+ DG+ L L ++ + Sbjct: 333 LIPTLAKLCGIPLPKNQAFDGIDLSGVLTKNEKPKDR----------------------- 369 Query: 458 YHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLA 516 FV H+ D P L + V N++ L+ N LY + TD Q N+A Sbjct: 370 -SVFVHHRQDWRPP------LQEKGTCVLKNEWRLI-----NGYQLYNMKTDPLQTTNVA 417 Query: 517 AANPQVVKEMQGVVREFIDSSQ 538 N ++V+ + + F ++ Sbjct: 418 EENKELVEALLEENKSFYQQTK 439 >UniRef50_A6DG53 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG53_9BACT Length = 515 Score = 374 bits (961), Expect = e-102, Method: Composition-based stats. Identities = 125/587 (21%), Positives = 212/587 (36%), Gaps = 138/587 (23%) Query: 10 VSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLG 69 ++T SL A G+A L A T++AF + PNII++ DD+G Sbjct: 1 MNTKQSLFNAVGIA----------VLAAMMTHLAFG------AAKTETPNIILILADDMG 44 Query: 70 YGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAH 129 + G TP L L+ +G+ FT+ + Sbjct: 45 IDSIQALNGKSG-------------------------IPTPHLDRLLTQGIHFTDAHSGS 79 Query: 130 GVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTET---FLPELFQNHGYYTAAVGKWHL 186 V P+R ++TGR R + + Q PL E LP + + GY TA +GKWHL Sbjct: 80 AVCTPTRYGVLTGRYAWRSRLKKSIVRQWERPLIEKDRLTLPGMLKKKGYNTACIGKWHL 139 Query: 187 SKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY---------- 236 + + E P GFDY+ G + Sbjct: 140 GWDWPK---KGGGFTEKMKEIDFSEKIEGGPAGCGFDYYFGDDVPNWQPFVWIENGRMLG 196 Query: 237 ------NSPSLFKNRERVPAKGYISD----QLTDEAIGVVDRAKTLDQPFMLYLAYNAPH 286 + S + + + + +G+ + ++T++++ +++ QPF LY + +PH Sbjct: 197 VPNKQLSFASHYHSGKGIGVEGWDLEAVLPKITEKSVEYINQQAETKQPFFLYFSMTSPH 256 Query: 287 LPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDN 346 P P G Y + D V +I++ LK G DNT+++FT+DN Sbjct: 257 TPIAPSKP-------FQGKSGISRYADFLMETDWCVGQIMKALKDRGIADNTLLIFTADN 309 Query: 347 GAVIDGPLP--------LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMD 397 G L +G K+ + GG P + W G ++PG+ D+ IS +D Sbjct: 310 GTSPKCNFTELREKRTDLQNHWRGMKADAFEGGHRVPFIVSWPGHIKPGSKSDQTISLVD 369 Query: 398 FYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDN 457 T DA +++ D VSL+P L+ + P Sbjct: 370 IMATCADAVALTLSDSAAEDSVSLMPVLKGEDIATP------------------------ 405 Query: 458 YHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN----------------QL 501 H + VR + L Y+ + Sbjct: 406 ------------LHEAVICHSISGVFVVRKGKWKLQYSAGSGGLSLPKDKNAKKKGLPTW 453 Query: 502 GLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFID--SSQPPLSEVN 545 LY L +D ++ +NL + ++VK++ ++R +I+ S P + N Sbjct: 454 QLYDLSSDPKETNNLINGHQEIVKDLTAILRRYIENGRSTPGTPQKN 500 >UniRef50_D2R207 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R207_9PLAN Length = 495 Score = 374 bits (961), Expect = e-102, Method: Composition-based stats. Identities = 124/522 (23%), Positives = 195/522 (37%), Gaps = 124/522 (23%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 + + +PNII L DDLGYG + G + K +T Sbjct: 29 AQAADSDRPNIIWLMADDLGYGDV----GCYGQKV----------------------IAT 62 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG----IPLTET 165 P + + EG+RFT Y V PSR+ +MTG V N A + + + Sbjct: 63 PNIDQMAREGLRFTQFYSGATVCAPSRSVLMTGLHHGHTRVRGNAGAGNPAAQALRADDF 122 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 + + Q GY TA VGKW L A P+ +GFD F Sbjct: 123 TVAKFLQQAGYRTALVGKWGLGDDG--------------------QASTGLPRKQGFDEF 162 Query: 226 MGFHAAGTAYYNSPS-LFKNRERVPAKG------------------YISDQLTDEAIGVV 266 +G+ A+ + PS L++N E+ P + D LT+EA+ V Sbjct: 163 VGYLNQRHAHNHFPSFLWRNEEKFPLPNVPELEEPDGSGYPKKAVQFADDLLTEEALAFV 222 Query: 267 DRAKTLDQPFMLYLAYNAPHLPND--------NPAPDQYQKQFNTGSQTADNYYASVYSV 318 +R + +QPF LY PH N+ PD + T + A ++ + Sbjct: 223 ERNR--EQPFFLYWTPVIPHANNERARDLGNGAQVPDFGPYEKETWPEQDKGQAAMIHRL 280 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL-----NGAQKGYKSQTYPGGTH 373 D V R+L +LK+ T+ +FTSDNG + L +G+ G K + GG Sbjct: 281 DTYVGRMLAKLKQLKLDQKTLFIFTSDNGPHNEARHNLERFQPSGSWTGIKRSLHDGGIR 340 Query: 374 TPMFMWWKGKLQPGNYDKLISAM-DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 PM WW G + P + + DF+ TA + A S P LD +S L+ + Sbjct: 341 VPMICWWPGTIAPQQVSEHVGYSGDFFATAAELA--SRPAPAGLDSISFASTLRGDSSKQ 398 Query: 433 -PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 H+ L W + + T+ Y Sbjct: 399 AKHEFLYWEFHENGFS----------------------------------QATLCEGRYK 424 Query: 492 LVYTVENN-QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVR 531 + + + + +Y L TD Q++ ++AA NP + + ++ Sbjct: 425 GIRLRDPDAPIAVYDLQTDPQERVDIAATNPALAARLDHYLK 466 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P77318 Uncharacterized sulfatase ydeN n=81 Tax=Gammapro... 621 e-176 UniRef50_D2YC71 Sulfatase n=2 Tax=Vibrio mimicus RepID=D2YC71_VIBMI 518 e-145 UniRef50_B9XGT6 Sulfatase n=3 Tax=Bacteria RepID=B9XGT6_9BACT 502 e-140 UniRef50_D1P6M6 Putative sulfatase YdeN n=2 Tax=Providencia RepI... 500 e-140 UniRef50_C5BEH4 Sulfatase, putative n=37 Tax=Gammaproteobacteria... 496 e-138 UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT 489 e-136 UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 488 e-136 UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase... 486 e-136 UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Plancto... 485 e-135 UniRef50_C1ZKY2 Arylsulfatase A family protein n=1 Tax=Planctomy... 483 e-135 UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomy... 481 e-134 UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 478 e-133 UniRef50_D2QWC8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 477 e-133 UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD 474 e-132 UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomy... 473 e-132 UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina ... 471 e-131 UniRef50_B8HPF9 Sulfatase n=2 Tax=Bacteria RepID=B8HPF9_CYAP4 469 e-130 UniRef50_A6LED1 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LE... 468 e-130 UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces mari... 468 e-130 UniRef50_D2R014 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 468 e-130 UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Pro... 467 e-130 UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyc... 466 e-130 UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=B... 466 e-130 UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 466 e-130 UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planct... 466 e-130 UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 T... 466 e-129 UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 466 e-129 UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 465 e-129 UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 T... 465 e-129 UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 465 e-129 UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria Rep... 464 e-129 UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 463 e-128 UniRef50_A6C4Q6 Arylsulfatase n=1 Tax=Planctomyces maris DSM 879... 463 e-128 UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Plancto... 463 e-128 UniRef50_C6Y214 Sulfatase n=3 Tax=Sphingobacteriaceae RepID=C6Y2... 462 e-128 UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 462 e-128 UniRef50_B5JJG5 Sulfatase, putative n=1 Tax=Verrucomicrobiae bac... 460 e-128 UniRef50_Q7UGB8 Arylsulfatase homolog b1498 n=1 Tax=Rhodopirellu... 460 e-128 UniRef50_C3ZGR2 Putative uncharacterized protein n=1 Tax=Branchi... 459 e-127 UniRef50_A6LEC5 Arylsulfatase A n=2 Tax=Parabacteroides RepID=A6... 458 e-127 UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM ... 458 e-127 UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bactero... 458 e-127 UniRef50_D2R917 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 458 e-127 UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC1... 457 e-127 UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 457 e-127 UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_R... 456 e-127 UniRef50_Q7UQ05 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 456 e-127 UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID... 456 e-126 UniRef50_A6LED2 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LE... 456 e-126 UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flamme... 455 e-126 UniRef50_A6CAY0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 455 e-126 UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Plancto... 455 e-126 UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 455 e-126 UniRef50_Q0C069 Sulfatase family protein n=3 Tax=Bacteria RepID=... 455 e-126 UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD 455 e-126 UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 454 e-126 UniRef50_B9KQS8 Twin-arginine translocation pathway signal n=2 T... 454 e-126 UniRef50_B4CVD2 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 453 e-126 UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi D... 453 e-125 UniRef50_A6DHI0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 453 e-125 UniRef50_A6DF72 Putative secreted sulfatase ydeN n=1 Tax=Lentisp... 453 e-125 UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 453 e-125 UniRef50_A3ZMN6 Arylsulfatase B n=3 Tax=Bacteria RepID=A3ZMN6_9PLAN 451 e-125 UniRef50_A6DKP2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 451 e-125 UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF8... 451 e-125 UniRef50_Q7UL93 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 451 e-125 UniRef50_D2QTW6 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepI... 451 e-125 UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria ... 450 e-125 UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 450 e-125 UniRef50_A6DKP3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 450 e-125 UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglob... 450 e-125 UniRef50_A6C4V9 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 Re... 449 e-124 UniRef50_Q7URY7 Aryl-sulphate sulphohydrolase n=1 Tax=Rhodopirel... 449 e-124 UniRef50_A6LDP6 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LD... 449 e-124 UniRef50_Q01N83 Sulfatase n=1 Tax=Candidatus Solibacter usitatus... 448 e-124 UniRef50_C9KTV0 Arylsulfatase n=1 Tax=Bacteroides finegoldii DSM... 447 e-124 UniRef50_A6C4W8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 447 e-124 UniRef50_Q7UYD6 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 446 e-124 UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 T... 446 e-124 UniRef50_UPI0001968C90 hypothetical protein BACCELL_02360 n=1 Ta... 446 e-123 UniRef50_Q7UTH7 Arylsulfatase A n=2 Tax=Bacteria RepID=Q7UTH7_RHOBA 446 e-123 UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodop... 445 e-123 UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 445 e-123 UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9... 445 e-123 UniRef50_Q7UKJ5 Arylsulfatase A n=3 Tax=Bacteria RepID=Q7UKJ5_RHOBA 445 e-123 UniRef50_Q7UHJ6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 444 e-123 UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomy... 444 e-123 UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica R... 444 e-123 UniRef50_D2QZX4 Sulfatase n=10 Tax=Bacteria RepID=D2QZX4_9PLAN 444 e-123 UniRef50_A6DMY9 Putative uncharacterized protein n=2 Tax=Lentisp... 444 e-123 UniRef50_UPI000186ED10 arylsulfatase B precursor, putative n=1 T... 444 e-123 UniRef50_A6DSH3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 444 e-123 UniRef50_Q7UJQ8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 444 e-123 UniRef50_A7SRP2 Predicted protein n=2 Tax=Nematostella vectensis... 443 e-123 UniRef50_A4CGL5 Arylsulfatase A (Precursor) n=2 Tax=Flavobacteri... 443 e-123 UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN... 443 e-123 UniRef50_C1ZA41 Arylsulfatase A family protein n=1 Tax=Planctomy... 443 e-123 UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=... 443 e-123 UniRef50_B9YAN4 Putative uncharacterized protein n=1 Tax=Holdema... 443 e-122 UniRef50_A7HQ00 Steryl-sulfatase n=4 Tax=Proteobacteria RepID=A7... 442 e-122 UniRef50_Q7US96 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 442 e-122 UniRef50_A4CMB0 Arylsulfatase A n=4 Tax=Bacteria RepID=A4CMB0_9FLAO 441 e-122 UniRef50_Q7UYA5 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 441 e-122 UniRef50_A7IPG5 Sulfatase n=3 Tax=Bacteria RepID=A7IPG5_XANP2 441 e-122 UniRef50_A6DKD8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 440 e-122 UniRef50_A9LGQ4 Secreted arylsulfatase n=4 Tax=Bacteria RepID=A9... 440 e-122 UniRef50_UPI0001927538 PREDICTED: similar to CG8646 CG8646-PA n=... 440 e-122 UniRef50_A7V8P8 Putative uncharacterized protein n=1 Tax=Bactero... 440 e-122 UniRef50_A6DLE2 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 440 e-122 UniRef50_A6DHI2 Aryl-sulphate sulphohydrolase n=2 Tax=Lentisphae... 439 e-121 UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7... 438 e-121 UniRef50_B4D4S6 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 438 e-121 UniRef50_B4CZ54 Sulfatase n=3 Tax=Bacteria RepID=B4CZ54_9BACT 437 e-121 UniRef50_A6C176 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 436 e-121 UniRef50_Q7UYW3 Arylsulfatase B n=1 Tax=Rhodopirellula baltica R... 436 e-121 UniRef50_C6I9F7 Sulfatase n=4 Tax=Bacteroides RepID=C6I9F7_9BACE 436 e-120 UniRef50_Q15XI1 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 436 e-120 UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 436 e-120 UniRef50_A4A2W0 Arylsulfatase A n=1 Tax=Blastopirellula marina D... 435 e-120 UniRef50_A6DKM2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 435 e-120 UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium sp... 434 e-120 UniRef50_A0YAF7 Arylsulfatase A n=4 Tax=Bacteria RepID=A0YAF7_9GAMM 434 e-120 UniRef50_B1KD78 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 434 e-120 UniRef50_A6DM29 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 433 e-120 UniRef50_A6DHI1 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 432 e-119 UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 432 e-119 UniRef50_A6DR29 N-acetylgalactosamine-6-sulfatase n=3 Tax=Bacter... 431 e-119 UniRef50_A4GJF1 Sulfatase n=1 Tax=uncultured marine bacterium EB... 431 e-119 UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_B... 431 e-119 UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 430 e-119 UniRef50_B0NLM9 Putative uncharacterized protein n=1 Tax=Bactero... 430 e-119 UniRef50_A6DNI9 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 430 e-119 UniRef50_Q7UYA9 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodop... 430 e-119 UniRef50_A6DSP6 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155... 429 e-119 UniRef50_A6DS95 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HT... 429 e-118 UniRef50_Q7UHK0 Arylsulphatase A n=1 Tax=Rhodopirellula baltica ... 429 e-118 UniRef50_A7S8Q2 Predicted protein n=2 Tax=Nematostella vectensis... 428 e-118 UniRef50_A6BYR0 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 428 e-118 UniRef50_A7RFN2 Predicted protein n=7 Tax=Eumetazoa RepID=A7RFN2... 428 e-118 UniRef50_C6Y1Z7 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 ... 428 e-118 UniRef50_Q7UYA6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 428 e-118 UniRef50_A6C8S3 Arylsulphatase A n=1 Tax=Planctomyces maris DSM ... 427 e-118 UniRef50_D2R1I8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 427 e-118 UniRef50_C5EQ23 Arylsulfatase E n=1 Tax=Clostridiales bacterium ... 427 e-118 UniRef50_Q1YSH0 Sulfatase family protein n=4 Tax=cellular organi... 426 e-118 UniRef50_A5FAW4 Sulfatase n=1 Tax=Flavobacterium johnsoniae UW10... 426 e-118 UniRef50_B4D433 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 426 e-117 UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3J... 425 e-117 UniRef50_A6DSG4 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 424 e-117 UniRef50_A3ZLN5 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 424 e-117 UniRef50_Q8SZ72 RE14504p n=18 Tax=Neoptera RepID=Q8SZ72_DROME 424 e-117 UniRef50_C5C586 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 1233... 424 e-117 UniRef50_A4AQQ7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 424 e-117 UniRef50_Q9NJU8 Sulfatase 1 n=2 Tax=Coelomata RepID=Q9NJU8_HELPO 424 e-117 UniRef50_B7QJZ0 Arylsulfatase B, putative n=9 Tax=Ixodes scapula... 423 e-117 UniRef50_A6CA27 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 423 e-117 UniRef50_A4AVA7 Aryl-sulphate sulphohydrolase n=2 Tax=Bacteroide... 423 e-117 UniRef50_P15848 Arylsulfatase B n=32 Tax=Euteleostomi RepID=ARSB... 423 e-116 UniRef50_A6DG54 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 423 e-116 UniRef50_B1KFX9 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 422 e-116 UniRef50_A6DMW2 Putative exported uslfatase n=1 Tax=Lentisphaera... 422 e-116 UniRef50_A6LCL3 Arylsulfatase A n=9 Tax=Bacteroidales RepID=A6LC... 422 e-116 UniRef50_A3I2R7 Arylsulfatase n=2 Tax=Bacteroidetes RepID=A3I2R7... 421 e-116 UniRef50_A6DUI7 Putative exported uslfatase n=1 Tax=Lentisphaera... 421 e-116 UniRef50_D0TQQ7 Putative uncharacterized protein n=1 Tax=Bactero... 421 e-116 UniRef50_Q7UYH3 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 421 e-116 UniRef50_D2R457 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 421 e-116 UniRef50_Q0BZE9 Sulfatase family protein n=1 Tax=Hyphomonas nept... 421 e-116 UniRef50_A6CB33 Arylsulfatase n=1 Tax=Planctomyces maris DSM 879... 421 e-116 UniRef50_A6DGL0 Arylsulfatase A n=3 Tax=Lentisphaera araneosa HT... 421 e-116 UniRef50_UPI0000586CBA PREDICTED: similar to arylsulfatase B n=3... 420 e-116 UniRef50_A6KWS8 Arylsulfatase n=6 Tax=Bacteroides RepID=A6KWS8_B... 420 e-116 UniRef50_D2R323 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 419 e-116 UniRef50_B5JMW2 Sulfatase domain protein n=1 Tax=Verrucomicrobia... 419 e-115 UniRef50_Q7UM38 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Ta... 419 e-115 UniRef50_D2QTW5 Sulfatase n=2 Tax=Sphingobacteriales RepID=D2QTW... 419 e-115 UniRef50_B1KD88 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 R... 419 e-115 UniRef50_Q5FYB0 Arylsulfatase J n=81 Tax=Eumetazoa RepID=ARSJ_HUMAN 419 e-115 UniRef50_A6DJ15 Putative arylsulfatase n=2 Tax=Lentisphaera aran... 418 e-115 UniRef50_Q2GB51 Sulfatase n=6 Tax=Proteobacteria RepID=Q2GB51_NOVAD 418 e-115 UniRef50_Q7UMZ5 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 418 e-115 UniRef50_A6DJ37 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 418 e-115 UniRef50_A6C383 Sulfatase (Fragment) n=1 Tax=Planctomyces maris ... 418 e-115 UniRef50_A6DI94 Arylsulfatase A n=2 Tax=Bacteria RepID=A6DI94_9BACT 418 e-115 UniRef50_B9XR48 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XR4... 418 e-115 UniRef50_C1ZIS7 Arylsulfatase A family protein n=1 Tax=Planctomy... 418 e-115 UniRef50_UPI0001BC7CBC sulfatase n=1 Tax=Bacteroides sp. D2 RepI... 418 e-115 UniRef50_Q482D6 Sulfatase family protein n=2 Tax=Bacteria RepID=... 418 e-115 UniRef50_Q7ULE7 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Rhod... 418 e-115 UniRef50_C0BKJ9 Sulfatase n=2 Tax=Bacteroidetes RepID=C0BKJ9_9BACT 417 e-115 UniRef50_C5PU94 N-acetylgalactosamine-6-sulfatase n=1 Tax=Sphing... 417 e-115 UniRef50_B4D3U0 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428... 417 e-115 UniRef50_Q7UWW9 Arylsulfatase n=1 Tax=Rhodopirellula baltica Rep... 416 e-115 UniRef50_A0Z632 Arylsulfatase B n=1 Tax=marine gamma proteobacte... 416 e-114 UniRef50_A6DMX7 N-acetyl-galactosamine-6-sulfatase (GALNS) n=2 T... 416 e-114 UniRef50_A3ZWK4 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 415 e-114 UniRef50_A6DMX9 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 415 e-114 UniRef50_B6RB10 Arylsulfatase n=7 Tax=Coelomata RepID=B6RB10_HALDI 415 e-114 UniRef50_A6KZI7 Arylsulfatase n=23 Tax=Bacteroidales RepID=A6KZI... 415 e-114 UniRef50_Q7UN55 N-acetylgalactosamine 6-sulfate sulfatase (GALNS... 414 e-114 UniRef50_C1ZI83 Arylsulfatase A family protein n=1 Tax=Planctomy... 414 e-114 UniRef50_A7AKS6 Putative uncharacterized protein n=3 Tax=Bactero... 414 e-114 UniRef50_Q5FYB1 Arylsulfatase I n=5 Tax=Chordata RepID=ARSI_HUMAN 414 e-114 UniRef50_A6P2X1 Putative uncharacterized protein n=1 Tax=Bactero... 414 e-114 UniRef50_D2QXE9 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 414 e-114 UniRef50_Q9VVM4 CG7402 n=10 Tax=Drosophila RepID=Q9VVM4_DROME 413 e-114 UniRef50_B4D764 Steryl-sulfatase n=1 Tax=Chthoniobacter flavus E... 413 e-114 UniRef50_B9XS23 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XS2... 413 e-114 UniRef50_A3HWU7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Ta... 413 e-114 UniRef50_A6DMX6 Arylsulphatase A n=1 Tax=Lentisphaera araneosa H... 413 e-113 UniRef50_A6CEL4 Arylsulfatase A n=4 Tax=Bacteria RepID=A6CEL4_9PLAN 413 e-113 UniRef50_A9BNY8 Sulfatase n=11 Tax=cellular organisms RepID=A9BN... 412 e-113 UniRef50_D2R921 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 Rep... 411 e-113 UniRef50_A3J5W3 Putative arylsulfatase n=1 Tax=Flavobacteria bac... 411 e-113 UniRef50_UPI0001745D5D N-acetylgalactosamine 6-sulfate sulfatase... 411 e-113 UniRef50_A4GIB1 Arylsulfatase n=2 Tax=Bacteria RepID=A4GIB1_9BACT 410 e-113 UniRef50_D1QVA8 N-acetylgalactosamine-6-sulfatase n=1 Tax=Prevot... 410 e-113 UniRef50_A6DF76 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 410 e-113 UniRef50_D2A5L7 Putative uncharacterized protein GLEAN_15152 n=2... 410 e-113 UniRef50_A6DMW1 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 T... 410 e-113 UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC... 410 e-113 UniRef50_D0PR10 N-acetylgalactosamine-6-sulfate sulfatase n=1 Ta... 410 e-113 UniRef50_D2R206 Steryl-sulfatase n=1 Tax=Pirellula staleyi DSM 6... 410 e-113 UniRef50_A3ZY29 Aryl-sulphate sulphohydrolase n=1 Tax=Blastopire... 410 e-113 UniRef50_C7PRW9 Sulfatase n=1 Tax=Chitinophaga pinensis DSM 2588... 409 e-112 UniRef50_A6KZI6 Sulfatase n=6 Tax=Bacteroides RepID=A6KZI6_BACV8 409 e-112 UniRef50_Q7UXA8 N-acetylgalactosamine-6-sulfate sulfatase n=2 Ta... 409 e-112 UniRef50_C9MNT2 Arylsulfatase n=4 Tax=Bacteroidales RepID=C9MNT2... 409 e-112 UniRef50_C1ZF13 Arylsulfatase A family protein n=1 Tax=Planctomy... 409 e-112 UniRef50_C1ZJ89 Arylsulfatase A family protein n=1 Tax=Planctomy... 409 e-112 UniRef50_UPI0000588CF9 PREDICTED: similar to arylsulfatase B n=1... 409 e-112 UniRef50_A6DQ01 N-acetylgalactosamine-4-sulfatase n=2 Tax=Lentis... 408 e-112 UniRef50_UPI0000586CBD PREDICTED: similar to MGC86251 protein n=... 408 e-112 UniRef50_C5C581 Cerebroside-sulfatase n=1 Tax=Beutenbergia caver... 408 e-112 UniRef50_A6C8R8 Arylsulfatase A n=2 Tax=Planctomycetaceae RepID=... 408 e-112 UniRef50_C7M5R4 Sulfatase n=4 Tax=Bacteroidetes RepID=C7M5R4_CAPOD 408 e-112 UniRef50_A6DSG6 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 407 e-112 UniRef50_A7VQW1 Putative uncharacterized protein n=1 Tax=Clostri... 407 e-112 UniRef50_A6CGJ8 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8... 406 e-112 UniRef50_Q15XP0 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6... 406 e-112 UniRef50_A4W906 Sulfatase n=43 Tax=Enterobacteriaceae RepID=A4W9... 406 e-112 UniRef50_B7PV03 Arylsulfatase B, putative n=7 Tax=Ixodes scapula... 406 e-112 UniRef50_UPI0001B577E1 arylsulfatase precursor n=1 Tax=Streptomy... 406 e-112 UniRef50_Q7UER7 Sulfatase 1 n=8 Tax=Bacteria RepID=Q7UER7_RHOBA 406 e-111 UniRef50_A6CA66 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Ta... 406 e-111 UniRef50_A6DI18 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HT... 406 e-111 UniRef50_P34059 N-acetylgalactosamine-6-sulfatase n=23 Tax=Deute... 406 e-111 UniRef50_A4XED5 Sulfatase n=1 Tax=Novosphingobium aromaticivoran... 406 e-111 UniRef50_C5VKQ0 N-acetylgalactosamine-6-sulfatase n=3 Tax=Prevot... 405 e-111 Sequences not found previously or not previously below threshold: UniRef50_A6DJ11 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HT... 414 e-114 >UniRef50_P77318 Uncharacterized sulfatase ydeN n=81 Tax=Gammaproteobacteria RepID=YDEN_ECOLI Length = 560 Score = 621 bits (1603), Expect = e-176, Method: Composition-based stats. Identities = 560/560 (100%), Positives = 560/560 (100%) Query: 1 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI 60 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI Sbjct: 1 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI 60 Query: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV Sbjct: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 Query: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAA 180 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAA Sbjct: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAA 180 Query: 181 VGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS 240 VGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS Sbjct: 181 VGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS 240 Query: 241 LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ 300 LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ Sbjct: 241 LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ 300 Query: 301 FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ 360 FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ Sbjct: 301 FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ 360 Query: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVS 420 KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVS Sbjct: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVS 420 Query: 421 LLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ 480 LLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ Sbjct: 421 LLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ 480 Query: 481 FSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 FSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP Sbjct: 481 FSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 Query: 541 LSEVNQEKFNNIKKALSEAK 560 LSEVNQEKFNNIKKALSEAK Sbjct: 541 LSEVNQEKFNNIKKALSEAK 560 >UniRef50_D2YC71 Sulfatase n=2 Tax=Vibrio mimicus RepID=D2YC71_VIBMI Length = 577 Score = 518 bits (1336), Expect = e-145, Method: Composition-based stats. Identities = 353/554 (63%), Positives = 442/554 (79%), Gaps = 5/554 (0%) Query: 2 KSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNII 61 K+++++TSISLIL S + A + LKATKTNVAFSD +EYSTKGKPNII Sbjct: 21 NMKFKRNLLTTSISLILVSHLLPSFASTQNSDNLKATKTNVAFSDIEISEYSTKGKPNII 80 Query: 62 VLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVR 121 +LT+DD+GYGQ+ FD+ +F+ ++M++++VVDTYKI ID+AI AA+ STPT+ L+D GV+ Sbjct: 81 ILTVDDMGYGQMNFDQNTFNEESMKDQKVVDTYKIPIDEAINAAKNSTPTINKLIDTGVK 140 Query: 122 FTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAV 181 NGYVAHGVSGPSRAAI+TG+APA+FGVYSN DA+ GIP+ E FLPE+FQNHGYYTAAV Sbjct: 141 INNGYVAHGVSGPSRAAIITGKAPAKFGVYSNIDAEQGIPVEEKFLPEIFQNHGYYTAAV 200 Query: 182 GKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSL 241 GKWHLSKISNV V E KQTRDYHDNF T+S E+WQPQNRGF+YFMGFH G AYYNSP+L Sbjct: 201 GKWHLSKISNVAVDEAKQTRDYHDNFITYSGEQWQPQNRGFNYFMGFHTHGVAYYNSPAL 260 Query: 242 FKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF 301 F+NRE +PAKGY+ DQ T+EAIGVV++AK+ D PF+LYLAYNAPHLPND PAP QYQ++F Sbjct: 261 FRNRENIPAKGYVIDQFTNEAIGVVNKAKSNDAPFLLYLAYNAPHLPNDAPAPKQYQQRF 320 Query: 302 NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQK 361 TGSQTADN+YAS+Y+VDQGVKR+L QLK N QYDNT+I+FTSDNGAVIDGPLPLNG QK Sbjct: 321 KTGSQTADNFYASIYAVDQGVKRLLAQLKANDQYDNTLIMFTSDNGAVIDGPLPLNGEQK 380 Query: 362 GYKSQTYPGGTHTPMFMWWKGKLQPGN--YDKLISAMDFYPTALDAADISIPKDLKLDGV 419 G+KSQ GG HTPMF+WW G+ ++KL S+MDF+PTALDAA I IP+ LDGV Sbjct: 381 GFKSQVLSGGLHTPMFVWWNGRFHKTTKEFNKLTSSMDFFPTALDAAGIKIPEG--LDGV 438 Query: 420 SLLPWLQDKKQ-GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDL 478 SLLP+L +K PHK+L WI Y+H FDE+NIPFW+NYHK+VR +SDDYP NP TE Sbjct: 439 SLLPYLNGEKTNSSPHKSLVWIAPYAHHFDEKNIPFWNNYHKYVRSESDDYPINPYTEQF 498 Query: 479 SQFSYTVRNNDYSLVYTVENNQLGLYKLTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 S FS+ VR + +SL+Y E+ ++GLYKL D++ ++ ++ P VV M+ + EF + S+ Sbjct: 499 SDFSWAVRTDRFSLIYNPEDKKIGLYKLEDVRHENEISEQYPNVVSAMKNDLAEFANKSK 558 Query: 539 PPLSEVNQEKFNNI 552 P+S+ N +KFN + Sbjct: 559 MPISKDNYDKFNKV 572 >UniRef50_B9XGT6 Sulfatase n=3 Tax=Bacteria RepID=B9XGT6_9BACT Length = 477 Score = 502 bits (1294), Expect = e-140, Method: Composition-based stats. Identities = 126/539 (23%), Positives = 204/539 (37%), Gaps = 111/539 (20%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 ++ + + + F + + KPNI+ + DDLGY + + Sbjct: 1 MRFLLSLLLMAVFCLSTKAAN-KPNIVFILADDLGYTDVACYGSKY-------------- 45 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY--- 151 TP + L +G++FT+G+ P+RA++M+G+ R GVY Sbjct: 46 ------------YETPNIDKLAKDGIKFTDGHTCGPNCQPTRASLMSGQYGPRTGVYTVG 93 Query: 152 ------------SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQ 199 + +PL + L + + GY T GKWHL + Sbjct: 94 SIDRFAWQTRSLHPVENVTKLPLDKITLAQSLKKAGYATGMFGKWHLGED---------- 143 Query: 200 TRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLT 259 +E P RGFD + + +P + P Y++D LT Sbjct: 144 -------------KEHHPAQRGFDEALVSMGVHFDFVTNP-----KVDYPKDEYLADFLT 185 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTADNYYASVYS 317 D+A+ + R K D+PF LYL + A H P ++ + Y A + S Sbjct: 186 DKALDFIKRHK--DEPFFLYLPHYAVHKPLQAKKELIQKFSAKQGVDGHHNPTYAAMIAS 243 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID---------GPLPLNGAQKGYKSQTY 368 VD+ V R++ L + DNT+++F+SDNG V G + N +G K Y Sbjct: 244 VDESVGRVVALLDELKLSDNTLVIFSSDNGGVGGYQREGIKKAGDVTDNNPLRGGKGMLY 303 Query: 369 PGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQ- 426 GG P W GK+ G D+ I ++D YPT L+ A P+ LDG S L L+ Sbjct: 304 EGGHRVPYIFRWPGKIPAGKVCDQPIISIDLYPTLLELAGAKAPEKYPLDGTSYLKVLKS 363 Query: 427 DKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVR 486 + + W + VR Sbjct: 364 GGMKKLNRDAIYWHFPGYLGAGADTWRTL-------------------------PVGVVR 398 Query: 487 NNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEV 544 D+ L+ E+++L LY L DL + +NLAA P+ +E++ + + Q P+ Sbjct: 399 CGDWKLMEFFEDHRLELYNLREDLGETNNLAAKMPEKAQELEKKLVAWQKEVQAPMPTA 457 >UniRef50_D1P6M6 Putative sulfatase YdeN n=2 Tax=Providencia RepID=D1P6M6_9ENTR Length = 549 Score = 500 bits (1288), Expect = e-140, Method: Composition-based stats. Identities = 207/528 (39%), Positives = 309/528 (58%), Gaps = 7/528 (1%) Query: 26 AAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 + T+ T KPN++++ MDDLG GQL F S D + Sbjct: 17 LPSVKKSLLAGLIATSCLVPPIAANAGGTPEKPNVLLIVMDDLGTGQLDFVLDSLDVNEL 76 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 R Y I+K +EAA+ + P + + G++ TN +VAH V GPSRA I TGR+P Sbjct: 77 SKRPAPSRYDGDINKMVEAARIAMPNVSEMAAGGIKMTNAFVAHPVCGPSRAGIFTGRSP 136 Query: 146 ARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKI-SNVPVPEDKQTRDYH 204 A FG YSN DA GIP LP LFQ GY TA++GKWH +K+ + EDKQTRDYH Sbjct: 137 ASFGTYSNDDAMLGIPEDIKLLPALFQEDGYATASIGKWHNAKVIKKPKIAEDKQTRDYH 196 Query: 205 DNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIG 264 DN + + P RGFDY ++A+G A +NSP++++N E VPA GYI+ LTDE I Sbjct: 197 DNMISTPEPGFAPHERGFDYAYSYYASGAALWNSPAIWRNGENVPAPGYITHLLTDETIK 256 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKR 324 +D K D+PF + L+Y+ PH+P + +P +Y +FNTG+ AD Y+A++ + D+G+ + Sbjct: 257 FIDGHK--DKPFFINLSYSVPHIPLEEASPAKYMDKFNTGNVEADKYFAALNAADEGIGK 314 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL 384 I+ LK+NG+ +NT+I F SDNGAV + P+P+N KG+K Q + GG P +W G + Sbjct: 315 IITTLKENGELENTLIFFISDNGAVHESPMPMNAMDKGFKGQMFNGGVSVPFVAYWPGHI 374 Query: 385 QPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 G D ++SA+D PTAL +A I+IP LK++G +++P LQ K Q PH+ L W Sbjct: 375 PAGKQSDAMVSAIDILPTALQSAGITIPDSLKVEGKNIMPLLQGKTQKSPHQYLYWTGPG 434 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV-YTVENNQLG 502 + + EEN FW YH+++ +Q + P NPN E LS+ S+ VR+ +++L Y NQ Sbjct: 435 TKHYSEENQDFWHGYHEWITYQRKEAPKNPNLEKLSKGSWAVRDGEWALYFYDDGTNQPK 494 Query: 503 LYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKF 549 L+ D + +LA+ P+ VK+++ +++ P+ Q+++ Sbjct: 495 LFNDKQDPSESIDLASKYPEKVKQLKSAYYQWVKDQPKPV-VWGQDRY 541 >UniRef50_C5BEH4 Sulfatase, putative n=37 Tax=Gammaproteobacteria RepID=C5BEH4_EDWI9 Length = 539 Score = 496 bits (1277), Expect = e-138, Method: Composition-based stats. Identities = 211/541 (39%), Positives = 328/541 (60%), Gaps = 8/541 (1%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYS-TKGKPNIIVLTMDDLGYGQLPFDKGSF 80 M F++ L A S + + T +PN++++ MDDLG GQL F + Sbjct: 1 MTFFSSKKKQLAGLVAAACLCGASSASAAPAAATDSRPNVLLVIMDDLGTGQLDFALDAL 60 Query: 81 DPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIM 140 D K + R V + Y+ +DK I+AA+++ P + L ++GV+ TN +VAH V GPSRA I Sbjct: 61 DTKALGKRPVAERYQGDLDKMIDAARRAMPNVAQLANQGVKMTNAFVAHPVCGPSRAGIF 120 Query: 141 TGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSK-ISNVPVPEDKQ 199 TGR PA FG YSN DA G+PL T LP LFQ +GY TA +GKWH ++ V + Q Sbjct: 121 TGRYPASFGTYSNDDAMLGVPLDITLLPALFQENGYATANIGKWHNARIDKKNFVDKADQ 180 Query: 200 TRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLT 259 TRDYHDN + S + P++RGFDY ++A+G A +NSP++++N + V A GY++ LT Sbjct: 181 TRDYHDNMISVSEPGYGPESRGFDYSYSYYASGAALWNSPAIWQNGKNVAAPGYLTHNLT 240 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVD 319 +E + +D + +PF + LAY+ PH+P + +P +Y +F+TG+ AD Y+A+V + D Sbjct: 241 NETLKFLDDHQ--GKPFFISLAYSVPHIPLEQASPARYMDKFHTGNAEADKYFAAVNAAD 298 Query: 320 QGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMW 379 +G+ +I+E+LK G+ DNT+I F SDNGAV + P+PLNG +G+K Q + GG H P + Sbjct: 299 EGIGQIIERLKALGELDNTLIFFISDNGAVHESPMPLNGMDRGFKGQMFNGGVHVPFVAY 358 Query: 380 WKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLT 438 W + G + ++SA+D PTAL AA I+IP +K+DG +LP L K Q PH+ L Sbjct: 359 WPKHIPAGTQSNVMVSAIDILPTALKAAGITIPDAMKVDGRDILPQLSGKAQTSPHRYLF 418 Query: 439 WITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV-YTVE 497 W + + EEN PFW +Y K++ +++ P NPN E LS S+ VR+ +++L Y Sbjct: 419 WAGPGAKHYSEENQPFWFDYWKWITYEAPMPPKNPNLEKLSPSSWAVRDGEWTLYFYDDG 478 Query: 498 NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKAL 556 +N++ L+ D + +LAA PQ V EM+ ++I + P++ Q++++ ++++ Sbjct: 479 SNRVQLFNDRLDPAESQDLAAKYPQRVAEMKAAYHDWIKTKPKPVA-WGQDRYHILEQSA 537 Query: 557 S 557 Sbjct: 538 R 538 >UniRef50_B9XK50 Sulfatase n=2 Tax=Bacteria RepID=B9XK50_9BACT Length = 500 Score = 489 bits (1259), Expect = e-136, Method: Composition-based stats. Identities = 133/576 (23%), Positives = 217/576 (37%), Gaps = 123/576 (21%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFD 81 M V + + T +PN + + DDLG+ + F+ +F Sbjct: 3 MKIMKTAVERIVFGGNLVWALLLTSLCATRVHAADRPNFVFILADDLGWKDVGFNGSTF- 61 Query: 82 PKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMT 141 TP L L EG+RFT+ Y A V P+RA+IMT Sbjct: 62 -------------------------YETPNLDRLAREGMRFTDAYAACSVCSPTRASIMT 96 Query: 142 GRAPARFGVYSNTDAQ--------------DGIPLTETFLPELFQNHGYYTAAVGKWHLS 187 G+ PAR + + +P E L + Q GY TA +GKWHL Sbjct: 97 GKYPARLHLTDWLPGRPDKPDQILKHPKIITELPAAEITLAKALQEGGYKTAFIGKWHLG 156 Query: 188 KISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA-AGTAYYNSPSLFKNRE 246 + P+ GFD +G + Y SP + Sbjct: 157 GLG------------------------HWPEQAGFDINIGGCGMGHPSSYFSPYKNPTLK 192 Query: 247 RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA--PDQYQKQFNTG 304 P Y++D+LTDEA+ ++ K PF+LYL++ + H P ++YQK+ Sbjct: 193 DGPVGEYLADRLTDEAVKFIENTK--GTPFLLYLSHYSVHTPLQAKKGLIEKYQKKVMQL 250 Query: 305 S------------------QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDN 346 Q Y A + S+D+ V R+L++LK+ G NT+I+FTSDN Sbjct: 251 PPTKGPEFVTEGNTNARQVQNQPIYAAMMQSLDESVGRVLDKLKELGLDKNTVIIFTSDN 310 Query: 347 GA--VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPTAL 403 G +G N + K Y GG P+ + W G + + + + D+YPT L Sbjct: 311 GGLSTAEGAPTSNMPLRAGKGWPYEGGVREPLVVKWPGVTKAASVSDHQVMSTDYYPTLL 370 Query: 404 DAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR 463 + A + + LDG+S P L+ K+ G + L W + Sbjct: 371 EIAGLPARPEQHLDGISFTPALRGKEMG--ERPLFWHYPHYSNQGGA------------- 415 Query: 464 HQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQV 522 S ++R D+ L+ E N++ L+ L D+ +K++LA+ + Sbjct: 416 -----------------PSSSIRKGDWKLIEWYEENRIELFNLRLDVGEKNDLASTSALK 458 Query: 523 VKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKALSE 558 +E++ ++ + S + + N Sbjct: 459 REELKSELQAWRASVKADMPLPNPNFDPKADGPFKR 494 >UniRef50_D2R322 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R322_9PLAN Length = 513 Score = 488 bits (1257), Expect = e-136, Method: Composition-based stats. Identities = 133/576 (23%), Positives = 206/576 (35%), Gaps = 130/576 (22%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFD 81 M A + A + P+ + + +PNI+ +DDLG L +F Sbjct: 1 MKPSHLSAIRLSLIYAVVSTFLCCATLPSTIAAEQQPNIVFFLVDDLGQRDLGCYGSTF- 59 Query: 82 PKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMT 141 TP + L +G RFT Y A V P+RA+I+T Sbjct: 60 -------------------------YETPNIDKLAADGARFTQAYAACPVCSPTRASILT 94 Query: 142 GRAPARFGVYS-----NTDAQ---------------DGIPLTETFLPELFQNHGYYTAAV 181 G P R G+ N++ D + L L + ++ GY T Sbjct: 95 GLWPQRTGITDYIATDNSNGPAKWNRNTMTLPAAYRDRLALDSPTLAKSLKSAGYATFFA 154 Query: 182 GKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY----YN 237 GKWHL E + P+N+GFD G G Y Y Sbjct: 155 GKWHLG------------------------PEGFYPENQGFDINRGGIERGGPYGGKQYF 190 Query: 238 SPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--D 295 SP PA ++ D+L E ++ + QPF Y ++ + H P Sbjct: 191 SPYGNPRLTDGPAGEHLPDRLATETCQFIEAHQK--QPFFAYFSFYSVHTPLQAREDLRQ 248 Query: 296 QYQKQFNTGS----------------QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 +Y + Q Y A V ++DQ V ++L +L + G +NT+ Sbjct: 249 KYVAKREKLGLKPTWGREHMRDVRQVQEHAVYAAMVDAMDQAVGKVLAKLDELGLRENTL 308 Query: 340 ILFTSDNGAVIDGPL--PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAM 396 ++FTSDNG + N +G K Y GG P+ M W K++ G D +S+ Sbjct: 309 VIFTSDNGGLSTSEGWPTSNLPLRGGKGWMYEGGIREPLVMRWPAKVKAGSTIDTPVSSP 368 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWD 456 DF T L A + ++DGVSLLP L +K ++L W + Sbjct: 369 DFMATLLAATATKPAEQQQIDGVSLLPLLAGEK--LKERSLFWHYPHYGNQGGA------ 420 Query: 457 NYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNL 515 + +R + L+ +E+ Q+ L+ L TD + NL Sbjct: 421 ------------------------PAAAIRRGSWKLIEWLEDGQVELFNLATDESETTNL 456 Query: 516 AAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 A+ P +V+EM + + L E N Sbjct: 457 ASKEPALVREMLAELHAWQKEVGAILPEKNPNYDPA 492 >UniRef50_UPI00016C4991 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4991 Length = 596 Score = 486 bits (1252), Expect = e-136, Method: Composition-based stats. Identities = 142/543 (26%), Positives = 214/543 (39%), Gaps = 116/543 (21%) Query: 39 KTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGI 98 V F S GKPN++++ +DDLG L +F Sbjct: 4 AFAVLALGFFALPASAAGKPNVVLIVIDDLGQRDLGCYGSTF------------------ 45 Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD 158 TP + + +GVRFT+ Y A V P+RA+IMTG+ P R G+ + Sbjct: 46 --------YKTPNIDRMAKDGVRFTDFYAACPVCSPTRASIMTGKYPQRVGITDWLPGRK 97 Query: 159 GIP--------------LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYH 204 +P L E + E + HGY TA +GKWHL Sbjct: 98 DLPGQRLKRPELKNELALEEVTVAETLKGHGYVTAHIGKWHLGG---------------- 141 Query: 205 DNFTTFSAEEWQPQNRGFDYFM-GFHAAGTAYYNSPS------LFKNRERVPAKGYISDQ 257 + ++P+ +GFD + G H Y +P E+ Y++D+ Sbjct: 142 --------KGFEPEKQGFDVNVAGDHTGTPLSYFAPFANKAGATMPGLEKAAPDEYLTDR 193 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTADNYYASV 315 L EA + K D+PF LYL + H P P P D+Y+ Q G Q+ Y A V Sbjct: 194 LAAEAETFITANK--DKPFFLYLPHYGVHTPLRAPQPLVDKYKTQAVHGRQSNPVYAAMV 251 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL-----PLNGAQKGYKSQTYPG 370 S+D V R+L++L DNT++LFTSDNG + +N + K Y G Sbjct: 252 ESMDAAVGRVLKRLDDLKLSDNTLVLFTSDNGGLATLEGMPFAPTINAPLREGKGYLYEG 311 Query: 371 GTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 G P+ W GK++PG D++ ++DF+ T L+A + DGVSL+P +K Sbjct: 312 GVRVPLIAKWPGKVKPGTVMDQVACSIDFFDTILEATGATSAARR--DGVSLVPAFGGEK 369 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNND 489 + L W + S+ VR + Sbjct: 370 LKP--RALYWHYPHY------------------------------ANQGSRPGGAVRAGN 397 Query: 490 YSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEK 548 Y LV E+ + L+ + DL + NLAA P VVK++ + + + N + Sbjct: 398 YKLVEYYEDGRRELFDVAKDLSESRNLAADKPDVVKDLAAKLDAWRTDVGAKMPTPNPDY 457 Query: 549 FNN 551 N Sbjct: 458 RPN 460 >UniRef50_A6CBI6 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CBI6_9PLAN Length = 599 Score = 485 bits (1249), Expect = e-135, Method: Composition-based stats. Identities = 133/516 (25%), Positives = 203/516 (39%), Gaps = 88/516 (17%) Query: 43 AFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAI 102 + +PN++++ DD G+G + Sbjct: 16 TLILSRGSFLQAAERPNVLLIMTDDQGWGDVRSH-------------------------- 49 Query: 103 EAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPL 162 + TP L +G RF YV V P+R++++TGR R GV+ T + + Sbjct: 50 DNPLIETPQQDLLASQGARFERFYV-SPVCAPTRSSLLTGRYSLRTGVHGVTRGFENMRA 108 Query: 163 TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF 222 ET + E+F+ GY T A GKWH + P +GF Sbjct: 109 EETTIAEMFKAAGYKTGAFGKWHNGRHY-----------------------PMHPNGQGF 145 Query: 223 DYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 D F GF Y +L N++ V +GYI+D LTD AI + + K DQPF Y+ Y Sbjct: 146 DEFFGFCGGHWNRYFDTNLEHNKQPVKTEGYITDVLTDRAIDFIKQNK--DQPFFCYVPY 203 Query: 283 NAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILF 342 NAPH P P + A YA V VD + R+++ L DNTI+LF Sbjct: 204 NAPHSPWIVPEKYWDKYANKGLDDKARCAYAMVECVDDNLGRLMQTLDDLKLSDNTIVLF 263 Query: 343 TSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLI-SAMDFYPT 401 +DNG + NG +G K + GG P+F+ + GK++ G K I + +D PT Sbjct: 264 LTDNGPNSN---RYNGNMRGRKGSIHEGGIRVPLFVRYPGKIKAGTVVKPIAAHIDILPT 320 Query: 402 ALDAADISIPKDLKLDGVSLLPWLQDKK-QGEPHKNLTWITSYSHWFDEENIPFWDNYHK 460 L+ + D LDG SL+P L +K + P + L Sbjct: 321 LLELCSVENTADQPLDGKSLVPLLTNKSNKDWPQRMLF---------------------- 358 Query: 461 FVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAAN 519 D + D + +VR + + Y E + LY + D QK N+ A+ Sbjct: 359 ------SDRLFRNSIPDDELPNGSVRTDRWRAAY--ERGKWSLYDMQADPSQKQNVIEAH 410 Query: 520 PQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKA 555 P V+K++ R++ E + K+ Sbjct: 411 PAVIKDLSAAYRDWFKDVSQAGFEPIPIPAGHPKEQ 446 >UniRef50_C1ZKY2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZKY2_PLALI Length = 483 Score = 483 bits (1244), Expect = e-135, Method: Composition-based stats. Identities = 142/523 (27%), Positives = 211/523 (40%), Gaps = 112/523 (21%) Query: 39 KTNVAFSDFTP--TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 + AF T +PNI+++ DD+GY + F Sbjct: 12 ISAFAFCMLALVITPVIAADRPNILLIVGDDMGYADVGFHGC------------------ 53 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA 156 TP L +L GV+FT+GYV P+RA ++TGR RFG N Sbjct: 54 --------KDIPTPNLDALAKSGVQFTSGYVTGPYCSPTRAGLLTGRYQQRFGHEFNPSG 105 Query: 157 -QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEW 215 G+PLTE + + + GY T VGKWHL Sbjct: 106 ANTGLPLTEVTIADRLKQVGYTTGLVGKWHLGSQ-----------------------PAM 142 Query: 216 QPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQP 275 PQ RGF+ F+GF +++++ + + E V Y +D EA+ +++ + D+P Sbjct: 143 HPQERGFEEFIGFLGGAHSFFDAQGILRGHEPVKTIDYTTDLFGREAVSFIEKHR--DKP 200 Query: 276 FMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQY 335 + LYL++NA H P D+ K + Q Y A + ++D+ + ++L QL+ GQ Sbjct: 201 WFLYLSFNAVHTPMHAT-EDRMAKLASISDQERRTYAAMMLAMDEAIGKVLTQLETTGQK 259 Query: 336 DNTIILFTSDNGAVIDGPLPLNG----AQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK 391 T+++F SDNG + +NG +G K T GG P + W GK+ P +D Sbjct: 260 QKTLVMFISDNGGPTMPGVTINGSINTPLRGSKRTTLEGGIRVPFVVSWPGKIAPAVFDS 319 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 + +D TAL A + KD+K DGV+LLP+LQ K+ PH L W Sbjct: 320 PVIQLDLTATALAVAGVE--KDVKSDGVNLLPYLQGKQSEVPHAALFWRFGE-------- 369 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVEN------------N 499 VR DY LV N Sbjct: 370 ------------------------------QMAVRAGDYKLVRYDSNADTLTGKGKQPVT 399 Query: 500 QLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 LY L DL + +LAA+ P+ V E+Q + + PPL Sbjct: 400 AARLYDLKEDLGETRDLAASMPEKVAELQAQWDRWNQQNMPPL 442 >UniRef50_C1ZF72 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZF72_PLALI Length = 470 Score = 481 bits (1238), Expect = e-134, Method: Composition-based stats. Identities = 145/528 (27%), Positives = 213/528 (40%), Gaps = 106/528 (20%) Query: 25 FAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKT 84 F++ + L + + T KPN+I+ DDLG+G+ Sbjct: 9 FSSICLVGICLAGISSICDLAQGAEP-TQTSRKPNVIIFYADDLGWGETGIQG------- 60 Query: 85 MENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRA 144 Q TP + S+ GVR T G+VA PSRA ++TGR Sbjct: 61 -------------------NPQIPTPHIDSIAKNGVRCTQGFVAATYCSPSRAGLLTGRY 101 Query: 145 PARFGVYSNTDAQ-DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY 203 P RFG N A G+ L ET L + GY TA VGKWHL Sbjct: 102 PTRFGHEFNRIANVSGLDLQETTLADRLHGLGYKTACVGKWHLGD--------------- 146 Query: 204 HDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNR-------ERVPAKGYISD 256 E++P RGFD F G A + P+ F + E Y +D Sbjct: 147 --------GPEYRPTKRGFDEFFGTLA--NTPFFHPTKFVDSRVSNDVAEVSDENFYTTD 196 Query: 257 QLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG-SQTADNYYASV 315 + ++ + + + P+ LYL +NA H P AP +Y +F + + A + Sbjct: 197 EYAKRSVEWIGQQQQS--PWFLYLPFNAQHAPLQ--APQKYLDRFESIADPKRKLFAAMM 252 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTP 375 ++D + ++L ++++ GQ +NT++ F SDNG G NG +G+K T+ GGT P Sbjct: 253 SAMDDAIGQVLGKVRELGQEENTLVFFISDNGGPTQGTTSQNGPLRGFKMTTFEGGTRVP 312 Query: 376 MFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPH 434 + WKGKL G YD + +D PT L AA I KLDGV L+P+ +PH Sbjct: 313 FLVQWKGKLPAGKTYDNPVINLDVLPTVLTAAGSKIDPAWKLDGVDLVPYFTSSIANKPH 372 Query: 435 KNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY 494 + L W + VR D+ LV Sbjct: 373 ETLYWRFGE--------------------------------------QWAVRQGDWKLVV 394 Query: 495 -TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 + Q LY L +D+ + NLA+ NP VKE+Q + ++ P Sbjct: 395 ARGGSGQPELYDLASDIAESKNLASENPAKVKELQALWDQWSHEQAAP 442 >UniRef50_Q7UGD7 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UGD7_RHOBA Length = 543 Score = 478 bits (1232), Expect = e-133, Method: Composition-based stats. Identities = 149/532 (28%), Positives = 234/532 (43%), Gaps = 110/532 (20%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 + K +PNI+++ DDLGY + F+ + T Sbjct: 37 SVVGAKDRPNIVLIVADDLGYSDVGFNGC--------------------------KEIPT 70 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD--------GIP 161 P L L GV FTNGY +H PSRA ++TGR RFG SN + G+P Sbjct: 71 PHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQRFGHGSNPEPDTQWHGEDTPGMP 130 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 L+ET L + + GY T A+GKWHL A+ + P RG Sbjct: 131 LSETTLADALKEAGYVTGAIGKWHLGD-----------------------AKPFWPNRRG 167 Query: 222 FDYFMGFHAAGTAYYNSPSL-------FKNRERVPAK--GYISDQLTDEAIGVVDRAKTL 272 FD + GF G +Y+ + + E V K +++D + EA+ + R +T Sbjct: 168 FDEWFGFSGGGFSYWGDLGMKDPLLGVHRGDEPVDPKTLTHLTDDFSTEAVKFIQRHET- 226 Query: 273 DQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKN 332 +PF LYLAYNAPH P+ QK + Y A V +D+G+ R+++Q++++ Sbjct: 227 -EPFFLYLAYNAPHAPDHATR-AHLQKTAHIEYGGRAVYGAMVAGMDEGIGRVVDQIRES 284 Query: 333 GQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDK 391 G +NT+I+F SDNG + +N +G+K + GG P + W G ++ G + Sbjct: 285 GLGENTMIIFYSDNGGRRE--HAVNFPYRGHKGMLFEGGIRVPFLVSWPGTVRSGMKEES 342 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 I+A+D +PTAL AA + ++ KLDG +LLP L D KQ P + L W S Sbjct: 343 PITALDLFPTALAAAGMDPSQNDKLDGQNLLPVLTDDKQRLPERPLFWRYSM-------- 394 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQ 510 + Y VR+ ++ L+ + ++ L+ L D Sbjct: 395 -------------------------GDDSYGYAVRDGNWKLIDSRYKDRKLLFDLANDPW 429 Query: 511 QKDNLAAANPQVVKEMQGVVREFIDSSQPP----LSEVNQEKFNNIKKALSE 558 ++++LAA +P+ V + ++ + + PP VN K N + E Sbjct: 430 EREDLAAQHPEQVARLSRMMEAWDARNVPPKWSDAHGVNVRKEENTRNEAVE 481 >UniRef50_D2QWC8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QWC8_9PLAN Length = 468 Score = 477 bits (1229), Expect = e-133, Method: Composition-based stats. Identities = 145/532 (27%), Positives = 218/532 (40%), Gaps = 107/532 (20%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 L+A + T + + +PNI+V+ DD+GY L Sbjct: 6 SLRALVALGLLTAATTSMAADASRPNIVVIVGDDMGYHDLGVHGC--------------- 50 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 TP L +L GVR T+GYV+ P+RA ++TGR RFG N Sbjct: 51 -----------KDIPTPHLDALATSGVRCTSGYVSGPYCSPTRAGLLTGRYQQRFGHEFN 99 Query: 154 TD----AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 + G+PL+ET L + + GY T VGKWHL Sbjct: 100 PGPTPTGEIGLPLSETTLADRLKKVGYKTGMVGKWHLGNDEKR----------------- 142 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-------SLFKNRERVPAKGYISDQLTDEA 262 P +RGFD F GF Y+ +P L + RE V K Y++D EA Sbjct: 143 ------HPLSRGFDEFFGFLGGARTYFATPGNASAGTKLLRGREVVDEKEYLTDAFAREA 196 Query: 263 IGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF-NTGSQTADNYYASVYSVDQG 321 + +DR+K PF LYL +NA H P + A +Y +F Y A + ++D Sbjct: 197 VAYIDRSKAS--PFFLYLTFNAVHTPME--ASQKYLDRFTAVSDPKRQKYCAMMSAMDDA 252 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 V +++ +L++ +NT+I F SDNG N +G+K+ T+ GG P F+ WK Sbjct: 253 VGQVVAKLEREKLLENTLIFFVSDNGGPTAANTGDNTPLRGFKATTWEGGIRVPYFVSWK 312 Query: 382 GKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 GK+ G YD+ + +DF PTAL A P K DGV+LLP+L + + PH +L W Sbjct: 313 GKIPAGKTYDQPVIQIDFVPTALAA--AGAPAAEKTDGVNLLPYLTFENKEAPHASLFWR 370 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 +R+ +Y LV T + ++ Sbjct: 371 FG--------------------------------------PQTAIRHGNYKLVMTRDLDK 392 Query: 501 LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 LY L D+ + +L+A P++V ++ + + P N Sbjct: 393 PALYDLAADISETKDLSADKPEIVAQLTAAYDAWNQENIPAAWGAPSRAIGN 444 >UniRef50_C7PJ01 Sulfatase n=2 Tax=Bacteroidetes RepID=C7PJ01_CHIPD Length = 452 Score = 474 bits (1221), Expect = e-132, Method: Composition-based stats. Identities = 122/535 (22%), Positives = 210/535 (39%), Gaps = 114/535 (21%) Query: 33 VKLKATKTNVAFSDF--TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 ++++ VA S F P + +PN++++ DD G + Sbjct: 1 MRIRRLSAMVALSCFMAAPLFAQQQKRPNVLIIYTDDQGTLDVNCYG------------- 47 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 A TP + L EGV F+ Y A V PSRA+++TGR P R + Sbjct: 48 -------------AKDLHTPNIDRLAKEGVLFSQFYAAAPVCSPSRASLLTGRYPQRAQL 94 Query: 151 YSNTD---AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 +N G+P ++ + E+F++ GY TA +GKWH+ Sbjct: 95 DNNAPSEEGHAGMPGSQYTMAEMFKDGGYTTAHIGKWHIG-------------------- 134 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS---------PSLFKNRERVPAKG-YISDQ 257 + E P +GFDY GF Y+ L++N + + G + +D Sbjct: 135 ---YSPETMPNQQGFDYSFGFMGGCIDNYSHYFYWAGPNRHDLWRNGQEIWEDGKFFADL 191 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYS 317 E G +++ K D+PF LY A N PH P +++++ + Y A+V + Sbjct: 192 TVQEVNGFLEKNKRADKPFFLYWAINMPHYPLQGQ--EKWRQYYKDLPAPRRMYAAAVST 249 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID----GPLPLNGAQKGYKSQTYPGGTH 373 +D+ + ++L+QL + G +NTI++F SD G + G G +G K + GG Sbjct: 250 MDEKIGQVLQQLDRLGLAENTIVVFQSDQGHSTEDRSFGGGGFTGPYRGAKFSLFEGGIR 309 Query: 374 TPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 P + W G L D+L +D+YPT +++P+ K+DG + + K Sbjct: 310 VPAIIRWTGHLPKNEVRDQLCVNIDWYPTLAGLCKVALPQ-RKIDGKDIQQVITSSKTSS 368 Query: 433 PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL 492 PH W + + VR ++ L Sbjct: 369 PHDIFFWQSQ---------------------------------GTKENPQWAVRQGNWKL 395 Query: 493 VYTV------ENNQLGLY--KL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 ++ E L+ L D + NLAA +P++V ++ ++I+ Sbjct: 396 LHNPSSAKKAETGPDDLFLVNLQQDTSEAKNLAAQHPEIVSSLKEQYLKWINEVV 450 >UniRef50_C1ZAC9 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZAC9_PLALI Length = 479 Score = 473 bits (1219), Expect = e-132, Method: Composition-based stats. Identities = 143/554 (25%), Positives = 223/554 (40%), Gaps = 124/554 (22%) Query: 23 AAFAAHAADDVKLK--ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSF 80 + +H A + L A + + + S G+PNI+V+ DDLGY L G Sbjct: 1 MSLGSHPAIALWLALVAFCSQALLAAEDVNQTSKSGRPNILVIMADDLGYADLGVQGGC- 59 Query: 81 DPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIM 140 + TP L L G+R TN YV+ PSRA + Sbjct: 60 -------------------------EIPTPHLDQLAASGIRCTNAYVSAPYCSPSRAGFL 94 Query: 141 TGRAPARFGVYSNTD----AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPE 196 TG+ RFG N A+ G+PL E + L Q GY TA +GKWH Sbjct: 95 TGKYQTRFGHEFNPHVGEEAKLGLPLEEVTIANLLQTEGYRTALIGKWHQG--------- 145 Query: 197 DKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY-------------YNSPSLFK 243 +++ PQ+RGFD F GF G Y ++ +++ Sbjct: 146 --------------FSKDHHPQSRGFDEFFGFLVGGHNYLLHKEVKARFGTAHSHDMIYR 191 Query: 244 NRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT 303 RE P +GY +D T+EA+ + ++P+ LYL+YNA H P + Q + + Sbjct: 192 GREVEPQEGYATDLFTNEALRWMSG--PPNKPWFLYLSYNAVHTPLEIAPHLQKRIPESV 249 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV-----IDGPLPLNG 358 Y + + +D + RI + L ++G + T+I+F SDNG + LN Sbjct: 250 KLPARRGYLSLLAGLDDSIGRITQHLSQHGLREKTLIIFLSDNGGSGRAPILAYNSGLNH 309 Query: 359 AQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAAD----ISIPKD 413 +G K QT GG P F+ W G+L ++ I ++D PT A P Sbjct: 310 PLRGDKGQTLEGGIRVPFFVSWPGQLPARTIYEQPIISLDLLPTVCQLAANNPAKPQPLP 369 Query: 414 LKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNP 473 +DGV+L+P+ ++ G PH++L W Sbjct: 370 QGIDGVNLMPYWLGQRSGAPHESLFWRFG------------------------------- 398 Query: 474 NTEDLSQFSYTVRNNDYSLVYTVE-----NNQLGLYKL-TDLQQKDNLAAANPQVVKEMQ 527 VR ++ LV + N+ LY L TD+ +K+NLA +P++V ++ Sbjct: 399 -------PQKAVRAGNWKLVDWRDFPASKNSGWELYDLSTDISEKNNLAETHPEIVARLK 451 Query: 528 GVVREFIDSSQPPL 541 ++ S+ PL Sbjct: 452 TSWEKWNQSNIEPL 465 >UniRef50_A3ZUT0 Arylsulphatase A n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZUT0_9PLAN Length = 457 Score = 471 bits (1214), Expect = e-131, Method: Composition-based stats. Identities = 120/529 (22%), Positives = 197/529 (37%), Gaps = 108/529 (20%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 V + + KPNI+ + +DD+G Sbjct: 11 IAAILVLLASGALHSDAAPTKPNIVFILIDDMGCKDAGCYG------------------- 51 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---- 152 A STP + L ++G+RFT+ Y A V P+RA++MTG+ PAR + + Sbjct: 52 -------ATNFSTPHIDRLANQGMRFTDAYAA-PVCSPTRASLMTGKHPARLHLTNFIPQ 103 Query: 153 -----------NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 +PL E + + GY A +GKWHL + Sbjct: 104 IGRQLPAGKLIPPGFNHVLPLDEKTIAQELHADGYQCAMIGKWHLGEEHG---------- 153 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP------SLFKNRERVPAKGYIS 255 E++PQNRGFD + G Y P + Y+ Sbjct: 154 -----------PEYRPQNRGFDRVVLSEHHGIFNYFYPFVDQQKWPYAGPLPGNPGDYLP 202 Query: 256 DQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASV 315 D+LTDEAI V + ++PF LYL++ + H P + + + Y A + Sbjct: 203 DRLTDEAIDFVRENR--ERPFFLYLSHWSVHGRYFAPESLIAKYRERGLEERPAIYAAMM 260 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTP 375 +VD V R++ L + DNT+ +F SDNG + +G K Y GG P Sbjct: 261 ETVDNSVGRLMATLDELNLADNTLFVFMSDNGGER---ITSMAPLRGSKGSLYEGGVRVP 317 Query: 376 MFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPH 434 + + + G ++P + + D +PT LD A+ S + KLDG S+ L ++ Sbjct: 318 LIVRYPGVVKPNTTCSVPVISHDLFPTFLDFAERSYRDN-KLDGHSIAGLLTGEQSELDR 376 Query: 435 KNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY 494 L W H P+ ++ +R + LV Sbjct: 377 DALYW-------------------------------HFPHYWGSTRPCSAMRQGRWKLVE 405 Query: 495 TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 +E + LY L +D ++ +LA PQ E++ ++ ++ + Sbjct: 406 HLETGRAQLYDLSSDPGEQRDLANEMPQQATELRKMLAQWRTKVGAQMP 454 >UniRef50_B8HPF9 Sulfatase n=2 Tax=Bacteria RepID=B8HPF9_CYAP4 Length = 495 Score = 469 bits (1208), Expect = e-130, Method: Composition-based stats. Identities = 126/519 (24%), Positives = 205/519 (39%), Gaps = 116/519 (22%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 + P+I+ + DD G+ + F Sbjct: 39 AVAQQSSQPPHILFIMSDDQGWKDVGFHGS---------------------------DIR 71 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTET 165 TP L L G R Y + PSRAA++TGR P R+G+ + + + G+P E Sbjct: 72 TPNLDQLAKTGARLEQYYS-QPMCTPSRAALLTGRYPHRYGLQTLVIPSAGKYGLPTDEY 130 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 LP+ + GY TA VGKWHL ++ P+ RGFDY Sbjct: 131 LLPQALKEAGYETAIVGKWHLGHAD----------------------PKYWPRQRGFDYQ 168 Query: 226 MGFHAAGTAYYNSP-----SLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 G Y+ ++N + + +GY++ L +A+ ++++ P LYL Sbjct: 169 YGPLLGEIDYFTHSAHGKVDWYRNNQLIKEEGYVTTLLGQDAVKLIEKH-NPKTPLFLYL 227 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTG-SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 A+ APH P AP +Y Q+ T Y A + ++D + +++ L+K G +NT+ Sbjct: 228 AFTAPHAPYQ--APQKYLDQYKTIADPNRRAYAAMITAMDDQIGQVVAALEKRGMRNNTL 285 Query: 340 ILFTSDNGAVIDGPLPL------------NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 I+F SDNG NG + K+ Y GGT W GK+QPG Sbjct: 286 IVFQSDNGGPRSAQFTGEVDTSGGTIPADNGPYRDGKASLYEGGTRVVALANWPGKIQPG 345 Query: 388 NY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHW 446 + I +D YPT A +S+ K+ LDG+++ P L + K + + Sbjct: 346 TVVNHPIHIVDMYPTLTGLASVSVGKNKPLDGLNIWPALSEAKPSPRSQVVY-------- 397 Query: 447 FDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE-NNQLGLYK 505 D+ F + D+ LV+ ++L L+ Sbjct: 398 ------------------------------DIEPFRAALSQEDWKLVWKATLPSRLELFN 427 Query: 506 LT-DLQQKDNLAAANPQVVKEMQGVVREF-IDSSQPPLS 542 L+ D+ ++ NLA NP++V ++ + D+ PPL Sbjct: 428 LSQDVSEQTNLAEQNPEIVSRLKQQIEVLSRDAVLPPLF 466 >UniRef50_A6LED1 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LED1_PARD8 Length = 459 Score = 468 bits (1206), Expect = e-130, Method: Composition-based stats. Identities = 126/537 (23%), Positives = 201/537 (37%), Gaps = 131/537 (24%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 T ++ S KPN +++ DD+GYG L Sbjct: 10 LTAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYG------------------- 50 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---- 152 TP + + EG++ T YV GVS PSRAA+MTGR P R G+Y Sbjct: 51 -------NPTIRTPNIDRMACEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVA 103 Query: 153 --NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 +++ G+ E + ++ Q GY T VGKWHL S Sbjct: 104 VLFPNSKAGLGQDEVTIAKVLQQSGYATGCVGKWHLGAFS-------------------- 143 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTA-----------YYNSPSLFKNRERV---PAKGYISD 256 + P + GFD + G + + L + +++ P +G ++ Sbjct: 144 ---PYLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPPTPLIVDGKQIESEPDQGELTR 200 Query: 257 QLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVY 316 + T++A+ + +PF LY A+ PH+P Y G+ Y V Sbjct: 201 RYTEKAVSFIKNH--SKEPFFLYFAHTFPHIPL-------YTNARFEGTSKRGLYGDVVE 251 Query: 317 SVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP--LPLNGAQKGYKSQTYPGGTHT 374 +D V +L+ L++NG +NT ++FTSDNG + G K K + GG Sbjct: 252 EIDWSVGEVLKALRENGLDENTFVIFTSDNGPWLTEHENGGSAGPLKDGKGTWWEGGFRV 311 Query: 375 PMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPH 434 P W GK+ P D+++++MD YPT L A I PKDL LDGV+ L ++K Sbjct: 312 PAICWMPGKINPAINDEIMTSMDLYPTFLSMAGIEQPKDLVLDGVNQTGLLFEEKHSARD 371 Query: 435 KNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY 494 + W S +R ++ + Sbjct: 372 EVYYWWGSEL--------------------------------------MAIRKGEWKYYF 393 Query: 495 TVENNQ------------LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 +Q LY + TD+ ++ NLA +P++VK + + + Sbjct: 394 KTIKDQYLRTCKIETPAEPLLYNVETDISERFNLADKHPEIVKLLIEAGEKHKKGMK 450 >UniRef50_A6BZT7 Putative arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BZT7_9PLAN Length = 459 Score = 468 bits (1205), Expect = e-130, Method: Composition-based stats. Identities = 128/535 (23%), Positives = 191/535 (35%), Gaps = 117/535 (21%) Query: 44 FSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIE 103 E + K KPNII + DDLGY +L Sbjct: 3 LLASVRLEATEKQKPNIIFIMADDLGYAELGCYG-------------------------- 36 Query: 104 AAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLT 163 + TP + L EG++FT Y V PSR+ +MTG+ V +N D + Sbjct: 37 QKKIKTPHIDKLAAEGMKFTQAYAGSMVCQPSRSVLMTGQHTGHTAVRAN-DLNQLLYEE 95 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 +T + E+ + GY T A GKW L +P +GFD Sbjct: 96 DTTVAEVLKIAGYATGAFGKWGLGYEGT----------------------PGRPGQQGFD 133 Query: 224 YFMGFHAAGTAYYNSPSLFKN---------RERVPAKGYISDQLTDEAIGVVDRAKTLDQ 274 F G A++ P N E YI D + ++A + + K Q Sbjct: 134 DFTGQLLQVHAHFYYPFWIWNNEHRLMLPENENNQRGRYIHDLIHEDAKAFIQKNKA--Q 191 Query: 275 PFMLYLAYNAPHLPNDNPAPDQ--YQKQFNTGS--QTADNY----------YASVYSVDQ 320 PF YL Y PH+ P + Y+ QF Y V +D Sbjct: 192 PFFAYLPYIIPHVELVVPEESEKPYRGQFPKKQILDPRPGYIGSEDGLTTFAGMVSRLDD 251 Query: 321 GVKRILEQLKKNGQYDNTIILFTSDNGA------VIDGPLPLNGAQKGYKSQTYPGGTHT 374 V I+ L+ G DNT+I+FTSDNG + N +G+K Y GG Sbjct: 252 HVGEIVTLLEDLGIRDNTLIIFTSDNGGQGGTWKEMTDFFNGNAPLRGHKGSMYEGGIRV 311 Query: 375 PMFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 P W GK+ G L I+ D PT A ++P + +DG+S LP L K + Sbjct: 312 PFIANWPGKIAAGKTSDLQIAFWDVLPTLAQVAGTTVPSGVDIDGISFLPTLLGKGKQPE 371 Query: 434 HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV 493 H+ L W S +R ++ V Sbjct: 372 HEYLYWEY----------------------------------TRGKIRSRAIRQGNWKAV 397 Query: 494 YTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQE 547 N + LY L TD+ + NLA +P+ +K++Q ++++ + + + Sbjct: 398 QNRMNQPIELYDLGTDIGETKNLAKQHPEKIKDLQQIMQQAHSEPRD-FPQTLKP 451 >UniRef50_D2R014 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R014_9PLAN Length = 475 Score = 468 bits (1204), Expect = e-130, Method: Composition-based stats. Identities = 126/550 (22%), Positives = 207/550 (37%), Gaps = 111/550 (20%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGK-PNIIVLTMDDLGYGQLPFDKGSF 80 + H + L + AF + + PNI+V+ +DD+G+ L ++ Sbjct: 4 LVQHLLHYLTTLTLTSCVFAAAFCATKQAFSADSTRVPNIVVILIDDMGFSDLSCMGSTY 63 Query: 81 DPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIM 140 TP++ L G+RFT+ Y A V P+RAA++ Sbjct: 64 --------------------------YETPSINKLAASGMRFTHAYSACTVCSPTRAAVL 97 Query: 141 TGRAPARFGVYSNTDAQDG-------------IPLTETFLPELFQNHGYYTAAVGKWHLS 187 TG+ PAR + Q + L E L EL HGY TA++GKWHL Sbjct: 98 TGKYPARLHLTDWIPGQMSNKTKLKLPDWNKQLNLEEITLAELLGAHGYTTASIGKWHLG 157 Query: 188 KISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRER 247 E +P +GF +G ++ G +N Sbjct: 158 ------------------------PPECEPTRQGFSLNIGGNSKGQPPSYFFPYERNGVL 193 Query: 248 VP------AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQK 299 +P Y++D+LTD ++ ++ +PF LYL + H P +Y+ Sbjct: 194 LPGLAEGKPNEYLTDRLTDACEAFIEENQS--KPFFLYLPHYCVHTPLQAKPELIAKYEA 251 Query: 300 Q---FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL 356 + F Q Y A V S+DQ V RI+ +L TI++FTSDNG ++ + Sbjct: 252 KNAQFPGNPQHEAKYAAMVESLDQSVGRIMAKLDALDLTKKTIVIFTSDNGGLVLREITS 311 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLK 415 N + K Y GG P+ + + ++PG D +MD +PT + + D Sbjct: 312 NLPARAGKGSAYEGGVRVPLIVSYPPMIKPGTTCDVPAISMDLFPTLAELSGAKYSHD-- 369 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 +DG S++P L++K + L W + H Sbjct: 370 IDGKSIVPLLEEKPDAFAARPLYWHYPHYHGGGAT------------------------- 404 Query: 476 EDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFI 534 +R +Y LV E+ +L LY L D+ + NLA P + +++ + + Sbjct: 405 -----PYSAMRVGNYRLVEFFEDGRLELYDLAHDIGEMKNLAQEKPDLTEKLHRQLIAWR 459 Query: 535 DSSQPPLSEV 544 S + Sbjct: 460 KSVDAQYATP 469 >UniRef50_UPI0000E0F7DD aryl-sulphate sulphohydrolase n=3 Tax=Proteobacteria RepID=UPI0000E0F7DD Length = 493 Score = 467 bits (1203), Expect = e-130, Method: Composition-based stats. Identities = 132/535 (24%), Positives = 208/535 (38%), Gaps = 99/535 (18%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 + KPNII++ +DDLG+ + +++ + T Sbjct: 32 AVIADTTKPNIIMIVIDDLGWSDVGYNQTT-------------------------DYFET 66 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA------------- 156 P + +L +G+ F Y PSRA +M+G+ R GVY+ + + Sbjct: 67 PNIDALAQQGLVFDQAYAGAANCAPSRAVLMSGQYGPRHGVYTVSPSDRGHAKTRKLIPI 126 Query: 157 --QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 + G+ + E + GY T GKWHL Sbjct: 127 KNKRGLTTDIITIGESLKTAGYTTGTFGKWHLGAD------------------------- 161 Query: 215 WQPQNRGFDYFM-GFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLD 273 P +GFD + G H T +Y SP N E P Y++++LT E I V +K D Sbjct: 162 --PDKQGFDVNVAGSHQGMTFHYFSPYQLPNIEDGPKGEYLTERLTTEVIDWVKSSK--D 217 Query: 274 QPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTA-DNYYASVYSVDQGVKRILEQLKKN 332 QPF Y+ Y H P + Y A V +D V RI + L Sbjct: 218 QPFFAYVPYYTVHTPYQAVVDKVNKYHEKGIKSKREATYAAMVEHMDDNVGRIFDMLDSE 277 Query: 333 GQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL 392 G +NT+++FTSDNG P +G K Y GG P+ + W K++PG Sbjct: 278 GLAENTVVIFTSDNGGYRMSSFP--TPLRGGKGSYYDGGLRVPLIVRWPEKVKPGLDHTP 335 Query: 393 ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENI 452 + DFYPT ++ P + LDGV L L + Q ++L W Sbjct: 336 VINADFYPTLVNLTKSKQP-NQVLDGVDLTAHLLGQ-QDIAERDLFWHFPVY-------- 385 Query: 453 PFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQ 511 ++ + ++ +R+ D+ L+ ENN+ LY L DL + Sbjct: 386 ---------LQAHHAPTDQGQDPLFRTRPGSAIRSGDWKLLQYFENNEFELYNLANDLAE 436 Query: 512 KDNLAAANPQVVKEMQGVVREFIDSSQPPLS-EVNQEKFNN-----IKKALSEAK 560 K+NLA+ +P VKE++ ++ + + ++N E I+K L +AK Sbjct: 437 KNNLASVHPSRVKELKTKLQAWQQQIGADIPTKLNPEYDAKVNQQLIRKQLLKAK 491 >UniRef50_A6CEC4 Aryl-sulphate sulphohydrolase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CEC4_9PLAN Length = 467 Score = 466 bits (1201), Expect = e-130, Method: Composition-based stats. Identities = 121/541 (22%), Positives = 199/541 (36%), Gaps = 101/541 (18%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 + + + T + +PNI++ +DDLG+ + F F Sbjct: 2 LEGIPMLRTLVFCCHLSMLSQASAENQRPNIVLFFIDDLGWRDVGFMGSDF--------- 52 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 TP + L DE ++FT Y A PSRA +M+G R G Sbjct: 53 -----------------FETPHIDRLADESMKFTAAYSAAPNCAPSRACLMSGLYTPRHG 95 Query: 150 VYSNTDAQDG---------------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPV 194 VY+ D G + T + + GY A+VGKWHL Sbjct: 96 VYTVGDPARGNDRYRKLIPAENNRVLDDRFTTIADRLSQAGYRCASVGKWHLG------- 148 Query: 195 PEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY--YNSPSLFKNRERVPAKG 252 P ++GF + + G+ Y SP Sbjct: 149 --------------------QSPLSQGFQVNIAGNQTGSPRGGYFSPYQNPQLSDGEQGE 188 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPD--QYQKQFNTGSQTADN 310 +++D+LT A + + PF LYL + A H P D +Q + Sbjct: 189 FLTDRLTTAACQFIKDNQGS--PFFLYLTHYAVHTPLQAKKEDIAYFQSKPAGKLHQHAT 246 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPG 370 Y A + S+DQ + R+L+ L++ NTI++FTSDNG GP +G K Y G Sbjct: 247 YAAMIRSMDQSIGRVLQTLREQQLDQNTIVVFTSDNGGY--GPATSMLPLRGSKGMLYEG 304 Query: 371 GTHTPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 G P+ + W G QPG+ + + +D YPT L+ +I + + LDG SL+P L+D + Sbjct: 305 GIRVPLLIKWPGVTQPGSTTGEAVINVDLYPTFLEMTNIPVLESELLDGESLVPLLKDPQ 364 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNND 489 ++L W + + +R D Sbjct: 365 TRLESRSLFWHFPAYLQKYQGMQQR----------------------FRTTPVSVIRQGD 402 Query: 490 YSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS-EVNQE 547 + L+ E+ LY D+ + L+ ++P+ +E+ + + + + E+N E Sbjct: 403 WKLLEFFEDGHQELYNTRLDIGESKELSGSHPEKTQELSQALHRWQKQVKAAIPAELNPE 462 Query: 548 K 548 Sbjct: 463 Y 463 >UniRef50_Q7UHJ9 Iduronate-sulfatase or arylsulfatase A n=4 Tax=Bacteria RepID=Q7UHJ9_RHOBA Length = 1012 Score = 466 bits (1201), Expect = e-130, Method: Composition-based stats. Identities = 138/553 (24%), Positives = 212/553 (38%), Gaps = 130/553 (23%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 + + S + + KPN IV+ DD GYG L Sbjct: 551 SSPTASVSPAGREKTAETTKPNFIVILTDDQGYGDLSCFG-------------------- 590 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR--------FG 149 A TP + + EG R T+ YVA V PSRA +MTG P R FG Sbjct: 591 ------AKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRAGLMTGCYPKRIDMAMGSNFG 644 Query: 150 VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 V D + G+ E + E+ + GY T GKWHL Sbjct: 645 VLLAGDPK-GLHPDEITIAEVLKTAGYRTGMFGKWHLGDQ-------------------- 683 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAYYNSP----------SLFKNRERV---PAKGYISD 256 E+ P +GFD F G + + P L +N + P +++ Sbjct: 684 ---PEFLPTKQGFDEFFGIPYSHDIHPFHPRQNHYHFPPLPLLQNDTVIEMDPDADFLTK 740 Query: 257 QLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPD------------QYQKQFNTG 304 +LT++A+ ++R K DQPF LYL + PH P P + + Sbjct: 741 RLTEQAVSFIERNK--DQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNIDY 798 Query: 305 SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYK 364 + A+ + ++ +D V +IL+ L+ NG + T++LFTSDNG + G +G+K Sbjct: 799 ATRANLFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPPKNTLYASPGELRGHK 858 Query: 365 SQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 T+ GG P + W G++ G D+L++AMD PT A +IP D +DG + P Sbjct: 859 GTTFEGGMREPTVVRWPGQIPAGHQNDELMTAMDLLPTFAKLAGAAIPTDRVIDGKDIWP 918 Query: 424 WLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY 483 L+ + Q PH + Sbjct: 919 TLKGETQ-TPHDAFFYHRGNQL-------------------------------------A 940 Query: 484 TVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSS----Q 538 VR+ + L + LY L DL +K N+ NP+VVK++Q +++F + Sbjct: 941 AVRSGKWKLHVNNGVAK-QLYDLENDLGEKVNVIETNPEVVKKLQHQLKDFAADIASNSR 999 Query: 539 PPLSEVNQEKFNN 551 P N + +N Sbjct: 1000 PAAFNANPKSLSN 1012 Score = 406 bits (1043), Expect = e-111, Method: Composition-based stats. Identities = 121/567 (21%), Positives = 200/567 (35%), Gaps = 115/567 (20%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 + +K + T + + PN++++ +DDLGYG L Sbjct: 12 INSSAMKLYAVALMMLLGCGTSVAAERPPNVVLIFVDDLGYGDLGCYG------------ 59 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF- 148 A + STP + L EG RFT+ + A V PSR ++TG+ P R Sbjct: 60 --------------ATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSRYGLLTGQYPVRAM 105 Query: 149 ---GVYSNTDAQDGI--PLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY 203 G++ G+ + ++F+N GY TA +GKWHL + Sbjct: 106 GGQGIWGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKWHLGFKEEPCDWQVPLRPG- 164 Query: 204 HDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA-------------YYNSPSLFKNRERVPA 250 PQ+ GFD++ G + Y S L + V Sbjct: 165 -------------PQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGGKPVSP 211 Query: 251 -----------------------KGYISD----QLTDEAIGVVDRAKTLDQPFMLYLAYN 283 + Y + LT+ A+ + ++PF LY A Sbjct: 212 TPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITE--KKNEPFFLYFATP 269 Query: 284 APHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFT 343 H P G+ Y V+ +D V I++ L+ NG DNT++LFT Sbjct: 270 NIHHPFTPAP-------RFKGTSQCGLYGDFVHELDWMVGEIVQSLEDNGLTDNTLVLFT 322 Query: 344 SDNGAVID--------GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLIS 394 SDNGA+++ NG G+K + GG P+ W GK++ G D+LIS Sbjct: 323 SDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKAGTQSDQLIS 382 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPF 454 +D + T + +P + D +++LP L D L + Sbjct: 383 QVDLFATFSALTEQEMPSSEQKDSINMLPALLDDPNEPLRTELVLAPRQPRNLAIRKGKW 442 Query: 455 WDNYHKFVRHQSDDYPHNPNTEDLSQFSYT------VRNNDYSLVYTVENNQLGLYKL-T 507 + + P + + ++ + N LY L Sbjct: 443 LYIGARGSGGFNGSKPQHHAWGGPAAVQFSGQKNSDIVNGRIK----KNAPPAQLYDLEN 498 Query: 508 DLQQKDNLAAANPQVVKEMQGVVREFI 534 D Q N+ +P+VV+EM+ ++ + Sbjct: 499 DRSQTTNVFREHPEVVEEMKAMLESYR 525 >UniRef50_A6DKB8 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKB8_9BACT Length = 465 Score = 466 bits (1201), Expect = e-130, Method: Composition-based stats. Identities = 137/530 (25%), Positives = 212/530 (40%), Gaps = 116/530 (21%) Query: 39 KTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGI 98 T + F + +PN+IV+ DDLGY + F+ + Sbjct: 3 ATYIIFILISLNAICAS-RPNLIVIMADDLGYNDVGFNGCT------------------- 42 Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---- 154 + TP + S+ GV+FTNGY ++ V GPSRA +TGR RFG N Sbjct: 43 -------EIPTPGIDSIAQNGVKFTNGYTSYSVCGPSRAGFITGRYQQRFGFERNPQWNL 95 Query: 155 -DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 D +P +E + E GY+ +GKWHL + Sbjct: 96 TDPNSALPKSEMTIAESLTQVGYHCGIIGKWHLG-----------------------AEP 132 Query: 214 EWQPQNRGFDYFMGFHAAGTAYYNSPSLF------------------KNRERVPAKGYIS 255 +P RGFD F G G + + +N V Y++ Sbjct: 133 SLRPNKRGFDEFFGHLGGGHRFMPEDLVIQHTEEVKNELDSYRSWITRNDTPVKTTKYLT 192 Query: 256 DQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT-ADNYYAS 314 ++ +DEA+ + R +PF L+L+YNAPHLP ++Y +F Y A Sbjct: 193 EEFSDEAVSFIKRNHQ--KPFFLFLSYNAPHLPLQAT--EKYLARFPHIKDPKRKTYAAM 248 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHT 374 V +VD GV ++++ LK+ DNTI+ F SDNG N KG KS + GG Sbjct: 249 VSAVDDGVSQVMQSLKETNIADNTIVFFLSDNGGPSHKNKSDNFPLKGQKSDVWEGGFRV 308 Query: 375 PMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 P M + +Q D +S++D + T A D LDGV+L+P++ +K P Sbjct: 309 PFAMQYPAAIQAKQVYDHPVSSLDIFATIASLAQSPTHADKPLDGVNLIPFITGEKTQAP 368 Query: 434 HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV 493 H + Q Y VR D+ LV Sbjct: 369 HAQIFIR------------------------------------KFDQSRYVVRQGDFKLV 392 Query: 494 YTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 ++ LY L+ D+ +++N+AA +P+ VKE++ V +++ P+ Sbjct: 393 IPYKDAPPQLYNLSKDIGEENNIAAVHPERVKELEKVRKQWDSELMDPIF 442 >UniRef50_A6CAW6 N-acetylgalactosamine-4-sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAW6_9PLAN Length = 472 Score = 466 bits (1200), Expect = e-130, Method: Composition-based stats. Identities = 144/564 (25%), Positives = 206/564 (36%), Gaps = 146/564 (25%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 +KL + F S +PNIIVL DDLGYG+L Sbjct: 1 MKLLSVLALFCSLTFFLNSLSAAEQPNIIVLLADDLGYGELGCQG--------------- 45 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 Q TP + SL G+RFT YV PSRA ++TGR P RFG Sbjct: 46 -----------NPQIPTPHIDSLASHGIRFTQAYVTAPNCSPSRAGLLTGRIPTRFGYEF 94 Query: 153 NT------DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 N D+ G+P E + E + GY T +GKWHL ++ Sbjct: 95 NPIGARNEDSGTGLPPDEQTIAERLHDQGYTTCLIGKWHLGGTAD--------------- 139 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS-------------------------- 240 + P GFD F GF G + P Sbjct: 140 --------YHPFRHGFDEFFGFMHEGHYFVPPPYHGVTTMLRRKTLPGRQKGRWISENLI 191 Query: 241 -----------------LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYN 283 + + + V Y++D T EA+ ++R + D+PF LYLAYN Sbjct: 192 YSTHMGYDEPDYDANNPIIRGGQPVNETEYLTDAFTREAVSFINRHQ--DKPFFLYLAYN 249 Query: 284 APHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFT 343 A H P D Q + A + S+DQ + +IL+Q++++G + T+I+F Sbjct: 250 AVHSPLQGKKKDI-QHFTQIEDIHRQIFAAMLSSMDQSIGKILKQVQQSGLDEKTLIVFL 308 Query: 344 SDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP-GNYDKLISAMDFYPTA 402 SDNG N +G K Y GG P M W G L P D +S++D +PT+ Sbjct: 309 SDNGGPTRELTSSNLPLRGEKGSMYEGGLRVPFLMRWTGTLAPKQTIDVPVSSLDIFPTS 368 Query: 403 LDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFV 462 + A S+P++ LDG +LLP L +K P + W Sbjct: 369 VALAGASLPQN--LDGRNLLPLLLQQKTELPVADFFWR---------------------- 404 Query: 463 RHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY---TVENNQLGLYKL-TDLQQKDNLAAA 518 +R+ D+ +V T E LY L D + +LA Sbjct: 405 ----------------QGRKAALRSGDWKIVQMRGTREKPVWELYNLANDKSETIDLATE 448 Query: 519 NPQVVKEMQGVVREFIDSSQPPLS 542 + E+Q E +P L Sbjct: 449 QSEKRMELQTRWNELNAQMKPALF 472 >UniRef50_A6C4W7 Twin-arginine translocation pathway signal n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4W7_9PLAN Length = 459 Score = 466 bits (1199), Expect = e-129, Method: Composition-based stats. Identities = 130/548 (23%), Positives = 210/548 (38%), Gaps = 125/548 (22%) Query: 29 AADDVKLKATKTNVAFSDF---TPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 A D ++ +A S + + PNI+++ DDLGYG L Sbjct: 3 APDLMRSVLFALFIAVSCLLIRFSAAEAAQQPPNIVLIMADDLGYGDLACYG-------- 54 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 Q TP + L ++FT+ + A + P+RAA++TG+ Sbjct: 55 ------------------NKQVKTPHIDRLAASALKFTDFHSAGAMCTPTRAAMLTGQYQ 96 Query: 146 ARFG------VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQ 199 RFG + ++ G+P + EL + GY TA GKWHL Sbjct: 97 QRFGRQFESALSGKSNHDIGLPHQAVTMAELLKQQGYATACFGKWHLG------------ 144 Query: 200 TRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS------PSLFKNRERVPAKGY 253 W P N+GFD F G + ++ + N E KGY Sbjct: 145 -----------YQPPWLPTNQGFDLFRGLTSGDGDHHTHVDRSGNEDWWHNNEISMEKGY 193 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--------DQYQKQFNTGS 305 +D L+ ++ ++ +T +PF LY+ + A H P P D + ++ Sbjct: 194 TADLLSKYSVAFMEANRT--RPFFLYVPHLAIHFPWQGPQDPPHRKAGQDYHAGKWGIIP 251 Query: 306 QT---ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID-----GPLPLN 357 + + A + S+DQ V +IL LK+ NT+++FTSDNG + + N Sbjct: 252 DPGNVSPHTTAMIESLDQSVGKILSALKRLDLEQNTLVIFTSDNGGYLTYGKNFQNISSN 311 Query: 358 GAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLD 417 G +G K+ Y GG P + W G + G D+ ++D PT AA IS + + D Sbjct: 312 GPLRGQKATLYEGGHRVPCLISWPGVITAGVTDQTAHSVDLLPTLAQAAGISA-TNFQTD 370 Query: 418 GVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477 G+ L P Q + ++L W Sbjct: 371 GLDLAPLWQTG-RPLADRDLFWRMGN---------------------------------- 395 Query: 478 LSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDS 536 + VR + L ++NN+ LY L TDL ++ N AA +P++VK M ++E+ Sbjct: 396 ----NRAVRRGQWKL--CLKNNRSELYHLETDLGEQQNRAAEHPEIVKSMSQALKEWEAD 449 Query: 537 SQPPLSEV 544 + Sbjct: 450 VDTSAKQF 457 >UniRef50_B4CYA9 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYA9_9BACT Length = 490 Score = 466 bits (1199), Expect = e-129, Method: Composition-based stats. Identities = 144/556 (25%), Positives = 211/556 (37%), Gaps = 116/556 (20%) Query: 15 SLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLP 74 ++ + + A V + ++ +D TPT +PNIIV+ DD GY Sbjct: 1 MTLIDPFLMSLLRKAFTSVAALSLASSSVRADDTPT-----KRPNIIVIVSDDQGYADAS 55 Query: 75 FDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGP 134 F TP L +L GVR T GYV V P Sbjct: 56 FQGS--------------------------KDILTPNLDALAKSGVRCTRGYVTAPVCSP 89 Query: 135 SRAAIMTGRAPARFGVYSNTDAQD-----GIPLTETFLPELFQNHGYYTAAVGKWHLSKI 189 SRA +MTGR RFG ++N A+ +P ET LP++ GYYTA VGKWHL Sbjct: 90 SRAGLMTGRYQERFGHHNNIVAEAALPIAHLPSNETLLPQVLAKAGYYTAMVGKWHLGLQ 149 Query: 190 SNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS--------- 240 +P RGFD F G G Y+ + Sbjct: 150 DG-----------------------CRPYERGFDEFFGIITGGHDYFVNHPEERAVGDQS 186 Query: 241 ----LFKNRERVPA-KGYISDQLTDEAIGVVDRAKT--LDQPFMLYLAYNAPHLPNDNPA 293 + +N A GY++D +A+ ++ + T DQP LYLA+NAPH P P Sbjct: 187 YKARIERNGPVGEAVPGYLTDAFGADAVRIIRESHTKRPDQPLFLYLAFNAPHTPTQAPK 246 Query: 294 PDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP 353 S+ Y A + S+D V ++ LK+NG +T I+F SDNG + P Sbjct: 247 DLVDTMPATLESKDRRTYAAQITSMDASVGKVRAALKENGMEKDTFIVFFSDNGGA-NHP 305 Query: 354 LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPK 412 N + +K Y GG P F + G + G+ + ++++D + TA A Sbjct: 306 YYDNTPLRDHKGSLYEGGIRVPFFAVYPGHIPAGSVCELPVTSLDVFATACALAGTKPET 365 Query: 413 DLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHN 472 LD V +LP L+ + H L W Sbjct: 366 SHPLDSVDMLPVLEGNARQPTHATLFWEFP------------------------------ 395 Query: 473 PNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVR 531 F V + D LV + L+ L D+ +K +LAA NP+ V + ++ Sbjct: 396 -------GFGAAVADRDLKLVV-PKKGSPQLFDLAVDIGEKSDLAAQNPEKVARLSTLLS 447 Query: 532 EFIDSSQPPLSEVNQE 547 E+ + PL + Sbjct: 448 EWHAQNARPLWGPGSQ 463 >UniRef50_A6C284 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Bacteria RepID=A6C284_9PLAN Length = 605 Score = 465 bits (1198), Expect = e-129, Method: Composition-based stats. Identities = 133/536 (24%), Positives = 197/536 (36%), Gaps = 120/536 (22%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 T PNI++ DD G+G L + Sbjct: 31 SQTRPATQATTHPNIVIFLADDQGWGDLSHNG--------------------------NT 64 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTET 165 TP + SL EGV+F YV V P+RAA +TGR AR G + Q+ E Sbjct: 65 NLHTPNVDSLAKEGVKFNRFYVGA-VCAPTRAAFLTGRYHARTGTIGVSTGQERFNSDEY 123 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 + + F+ GY T A GKWH P +GFD + Sbjct: 124 TIAQAFKAAGYATGAFGKWHNGTQY-----------------------PNHPNAKGFDEY 160 Query: 226 MGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 GF + +Y SP L N V GYI+D LTD+A+ +++ +PF YL Y P Sbjct: 161 YGFTSGHWGHYFSPMLDHNGTFVKGNGYITDDLTDKAMAFIEQQVQNHKPFFAYLPYCTP 220 Query: 286 HLPNDNPAPDQYQKQFNTG-------------SQTADNYYASVYSVDQGVKRILEQLKKN 332 H P PDQY +F A +VD V R+L++L Sbjct: 221 HSPMQ--VPDQYWDRFKDKQLKLHNREPDREQPDHLRAALAMCENVDWNVGRVLKKLNSL 278 Query: 333 GQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDK 391 D+TI+++ SDNG + NG KG K GG +P + W G L G ++ Sbjct: 279 RITDDTIVIYFSDNGP---NGVRWNGDMKGKKGSLDEGGVRSPFVIRWPGHLPAGQEVNQ 335 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 + A+D PT D A I P+ +DGVSL P + + K P + + Sbjct: 336 IAGAIDLLPTLTDLAGIKRPEPKPIDGVSLKPLMLNSKADWPERMIF------------- 382 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQ 510 + +VR + Y L + LY + D Sbjct: 383 -------------------------SSLRNRVSVRTDQYRLSR-----KGELYDMHADPG 412 Query: 511 QKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKF-------NNIKKALSEA 559 Q++N+A P++ ++Q V ++ S P + F + + +A Sbjct: 413 QRNNIAKQKPEITAKLQQAVTDWRQSVWPNGYPEDTRPFLIGYGGARSTQLPARDA 468 >UniRef50_Q3M597 Twin-arginine translocation pathway signal n=2 Tax=Nostocaceae RepID=Q3M597_ANAVT Length = 457 Score = 465 bits (1197), Expect = e-129, Method: Composition-based stats. Identities = 134/528 (25%), Positives = 201/528 (38%), Gaps = 111/528 (21%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 + T A ++ + +PN++ + +DD+G+G L + Sbjct: 17 MTAAGTLMATASANLFSRATAQSSRPNVVFILVDDMGWGDLSIYGRT------------- 63 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR--FGV 150 TP L L +GVRFTN Y V P+R A +TGR AR G+ Sbjct: 64 -------------DYETPNLDRLARQGVRFTNAYANQTVCTPTRIAFLTGRYQARLPVGL 110 Query: 151 YSNTDAQD-------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY 203 A+ GIP + + L + +GY TA VGKWH Sbjct: 111 REPLGARSQPASNNIGIPANQPTIASLLKANGYETALVGKWHAG---------------- 154 Query: 204 HDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP------SLFKNRERVPAKGYISDQ 257 + P +GFD + G + G Y+ L++N V GY++D Sbjct: 155 -------YPPNFGPLQKGFDEYFGHLSGGIEYFTHTGTDRILDLYENDVPVQRSGYVTDL 207 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ----YQKQFNTGSQTADNYYA 313 TD A+ + R + +PF L L YNAPH P P Y T + Y A Sbjct: 208 FTDRAVEFIQRPHS--RPFYLSLHYNAPHWPWQGPNDQASTAFYLTNGYTVGGSQATYAA 265 Query: 314 SVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTH 373 V S+D GV R+L+ L+ +GQ DNT+++FTSDNG G +G K+ Y GG Sbjct: 266 MVKSLDDGVGRVLDALEASGQADNTLVIFTSDNGGERFSNF---GPFRGQKASLYEGGIR 322 Query: 374 TPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 P + + G Q +++I D T L A S + DG +LLP L+ + E Sbjct: 323 VPAIIRYPGVTQANQVSNQVIITFDLTATILAATGTSFHPNYPPDGQNLLPLLRGD-RSE 381 Query: 433 PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL 492 + L W + L+ VR+ D+ Sbjct: 382 FSRTLFWRYGAA---------------------------------LTTRQRAVRSGDWKY 408 Query: 493 VYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 NQ L+ L TD + +L +N QV ++ + + P Sbjct: 409 WR--RGNQEALFNLATDPGETTDLKDSNAQVFTRLRNQFQHWELQMLP 454 >UniRef50_A6C861 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=4 Tax=Bacteria RepID=A6C861_9PLAN Length = 498 Score = 465 bits (1196), Expect = e-129, Method: Composition-based stats. Identities = 136/569 (23%), Positives = 209/569 (36%), Gaps = 133/569 (23%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKP------NIIVLTMDDLGYGQLPFDKGSFDPK 83 + +AFS S KP N + + +DDLGY + + Sbjct: 1 MQLMSRCTLMLMLAFSVLADRSLSAAEKPKQNKPLNFVFILVDDLGYMDVGCNN------ 54 Query: 84 TMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGR 143 TP + L G+RFTNGY A+ V P+R +IMTG+ Sbjct: 55 -------------------PQTFYETPHINQLAKTGMRFTNGYAANPVCSPTRYSIMTGK 95 Query: 144 APARFGVYSNTDAQDG-----------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNV 192 P R + + +PL+ET + E + HGY T GKWHL Sbjct: 96 YPTRVDATNFFSGKRAGKFLPAPLNDKMPLSETTIAEALKEHGYSTFFAGKWHLGPTQ-- 153 Query: 193 PVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY----YNSPSLFKNRERV 248 E+ P+ +GFD G G Y Y SP Sbjct: 154 ---------------------EFWPEKQGFDINRGGWHRGGPYGGGKYFSPYGNPRLTDG 192 Query: 249 PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGS- 305 ++ D+L E +D + D+PF YLA+ + H P P P +Y+++ Sbjct: 193 LKGEHLPDRLASETAQFIDAHR--DEPFFAYLAFYSVHTPLMGPGPLVTKYKEKAKRLGL 250 Query: 306 ------------------------QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIIL 341 Q Y A V S+D+ V ++L+QL+++G +NT+++ Sbjct: 251 TGKEEFADEEQVFPVDEKRRVRILQNHAVYAAMVESMDKAVGKVLQQLEESGVAENTVVM 310 Query: 342 FTSDNGA--VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDF 398 T+DNG +G N +G K Y GG + W G +PG+ D+ + DF Sbjct: 311 LTADNGGLSTSEGSPTSNLPLRGGKGWLYEGGIREVFLIRWPGGTEPGSVCDEPVITTDF 370 Query: 399 YPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNY 458 YPT LD A + + LDGVSL P+LQ + L W + Sbjct: 371 YPTILDLAGLPLKPQQHLDGVSLKPFLQGEA-PFKRDALYWHYPHYSNQGGI-------- 421 Query: 459 HKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAA 517 +R D+ L+ E+ Q+ LY L DL +K +LA Sbjct: 422 ----------------------PGGAIRVGDWKLIERFEDGQVHLYHLKEDLGEKQDLAE 459 Query: 518 ANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 P+ V M+ + ++ + + Sbjct: 460 KYPERVAAMRKQLHKWYQETDAKFLQAKP 488 >UniRef50_A6DGD3 Putative exported uslfatase n=3 Tax=Bacteria RepID=A6DGD3_9BACT Length = 713 Score = 464 bits (1195), Expect = e-129, Method: Composition-based stats. Identities = 125/564 (22%), Positives = 198/564 (35%), Gaps = 125/564 (22%) Query: 44 FSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIE 103 F + ++ +P+II+ +DDLG+ + F Sbjct: 226 FIAQELPKKASSKRPHIILFLIDDLGWNDIACYGSQF----------------------- 262 Query: 104 AAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA------- 156 TP L + EG RFT+ Y A+ V P+RA+I+ G+ P+R G+ +++ + Sbjct: 263 ---YETPHLDKMAKEGFRFTDAYAANPVCSPTRASILLGKYPSRVGLSNHSGSSGPKGPG 319 Query: 157 --------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 + +PL + L E + GY TA +GKWHL Sbjct: 320 HKLTPVPVKGNMPLEDITLAEALKEVGYKTAHIGKWHL-------------------QAH 360 Query: 209 TFSAEEWQPQNRGFDYFM-GFHAAGTAYYNSPSL--------FKNRERVPAKGYISDQLT 259 ++ P+ GFD + G + P + Y++D+LT Sbjct: 361 HDTSRNHFPEKHGFDLNIAGHRMGQPGSFYFPYKSKQHPSTNVPDMADGQEGDYLTDKLT 420 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQ----------------- 300 D+AI + K D PF L Y H P +Y+ + Sbjct: 421 DKAIHYIKENK--DTPFFLNFWYYTVHTPIIPRQDLKKKYEAKANELGINKNQPGIPVLK 478 Query: 301 -FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL----P 355 F SQ +Y A V ++D+ + RI + LK+ D TII+F SDNG + Sbjct: 479 SFARSSQNNPSYAAMVEAMDENIGRIFKTLKELQIDDETIIIFCSDNGGLSTSTGPNCPT 538 Query: 356 LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLK 415 K K+ Y GG P + W GK + D YPT LD + + Sbjct: 539 SQLPLKAGKAWVYEGGIRIPFIIKWPGKKGGKELQAPVCTTDIYPTLLDMLKLPAKPEQH 598 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 LDGVSL + + + + L + H Sbjct: 599 LDGVSLTSLMNGQAKELQREALFIHYPHYHHI---------------------------- 630 Query: 476 EDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFI 534 + + VR DY LV E + LY L D+ + +NL P+ +M + ++ Sbjct: 631 -NSMGPAGAVRMGDYKLVEYYETGEFELYNLKEDIGEMNNLVKEQPERAAQMLKKLEQWR 689 Query: 535 DSSQPPLSEVNQEKFNNIKKALSE 558 S P E N + + Sbjct: 690 QQSNSPKPERNPHYDPQKDYRIKK 713 >UniRef50_A6CBM1 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6CBM1_9PLAN Length = 497 Score = 463 bits (1191), Expect = e-128, Method: Composition-based stats. Identities = 124/523 (23%), Positives = 209/523 (39%), Gaps = 69/523 (13%) Query: 47 FTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQ 106 E KPNI+++ DDLGYG L Sbjct: 22 LQAVEKQQAAKPNIVIILCDDLGYGDLACYG--------------------------HPV 55 Query: 107 KSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT--DAQDGIPLTE 164 TP L L EG+R T+ Y + V PSRA ++TGR P R GVY + E Sbjct: 56 IKTPHLDQLASEGMRLTDCYASAPVCSPSRAGLLTGRTPNRLGVYDWIPEGHPMHLKRDE 115 Query: 165 TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY 224 + +L Q GY TA VGKWH + + N S E+ QP + GF + Sbjct: 116 VTVAQLLQQAGYDTAHVGKWHCNGMFN-------------------SKEQPQPGDHGFRH 156 Query: 225 FMGFHAAGTAYYNSPS-LFKNRERV-PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAY 282 + + +P+ +N + + +G+ + DE I + + ++PF L++ + Sbjct: 157 WFSTQNNALPTHENPNNFVRNGKPLGEIEGFSCQIVADEGIRWLSDWREKEKPFFLHVCF 216 Query: 283 NAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILF 342 + PH +P + + Y+A+V ++D+ V ++L +L + DNT++ F Sbjct: 217 HEPHERVASPPALVETYLDKSLYEDQAQYFANVANMDRAVGKLLIKLDELKVADNTLVFF 276 Query: 343 TSDNGAVI--------DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLI 393 TSDNG G +G K Y GG P + W GK++ G + Sbjct: 277 TSDNGPETLNRYGKGSRRSWGSPGVLRGMKLHIYEGGIRVPGIVRWPGKIKAGQEIATPV 336 Query: 394 SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIP 453 ++D PT + A +++P LDG SLLP K E L W +Y + + Sbjct: 337 CSVDLLPTFCEIAGVAVPDQRPLDGASLLPLFAGNKI-ERTTPLFW--NYYRAYSTPRVA 393 Query: 454 FWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQK 512 + K V H S P +++ S + + + + LY L D+ ++ Sbjct: 394 MREGDWKVVAHWSGPEGIIPLGGNVNSVSQEI-------IKNAKLTKFELYNLKDDISEQ 446 Query: 513 DNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKA 555 NLA + + ++ + + + Q + +++ +K Sbjct: 447 HNLAWQEQKRLDTLKKKLVQKYAAVQKEGPVWDTSEYDQSRKK 489 >UniRef50_A6C4Q6 Arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4Q6_9PLAN Length = 574 Score = 463 bits (1191), Expect = e-128, Method: Composition-based stats. Identities = 124/518 (23%), Positives = 201/518 (38%), Gaps = 81/518 (15%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 V++S + +PN+IV+ DD GYG + F Sbjct: 11 WFAGFLLLVSYSFGCEGTLCAESRPNVIVILTDDQGYGDVGFRGNL-------------- 56 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 + +TP L + ++ + T Y V P+RA+++TGR R GV Sbjct: 57 ------------KINTPHLDRMAEKSIELTRFYC-SPVCAPTRASLLTGRNYYRTGVIHT 103 Query: 154 TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 + + E + EL Q GY T GKWHL Sbjct: 104 SRGGAKMQGEEVTVAELLQQAGYQTGIFGKWHLGDNY----------------------- 140 Query: 214 EWQPQNRGFDYFMGFHAAGTAY-------YNSPSLFKNRERVPAKGYISDQLTDEAIGVV 266 +PQ++GF + + G Y P L+KN + GY +D D A+ + Sbjct: 141 PMRPQDQGFAESLIHKSGGIGQSPDQPNSYFHPKLWKNGVAFQSTGYCTDVFFDAALDFI 200 Query: 267 DRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRIL 326 DR ++PF +YLA NAPH P + Q +T Y + ++D+ + ++L Sbjct: 201 DRQTKTEKPFFVYLATNAPHTPLEIAESYWKPYQRQGLDETTARVYGMITNLDENIGKLL 260 Query: 327 EQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 L+++ + T++LF DNG G +G KS TY GG P W G + Sbjct: 261 SHLERSALAEKTVVLFLGDNGPQQK---RYTGGLRGRKSWTYEGGIRVPCLAQWPGHFRE 317 Query: 387 G-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSH 445 G D++ + +D PT L + P+ LKLDGV L P L +K+ P ++L + Sbjct: 318 GEKIDQIAAHIDLMPTLLALTETRCPESLKLDGVDLSPLLTGRKEKLPARSLFFQVHRGL 377 Query: 446 WFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYK 505 + + R + YP TE+L + V L Y Sbjct: 378 TPQR----YQNYAVVTERFKLAGYPGTFGTENLLLQAEPV---------------LEFYD 418 Query: 506 L-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 L TD ++ N+ ++P+ VK + ++ + + Sbjct: 419 LSTDPGEQKNVLHSHPETVKALLKQYEDWFSEMKATRN 456 >UniRef50_A6C1V3 Putative secreted sulfatase ydeN n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C1V3_9PLAN Length = 470 Score = 463 bits (1191), Expect = e-128, Method: Composition-based stats. Identities = 126/540 (23%), Positives = 205/540 (37%), Gaps = 97/540 (17%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKP-NIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 + +A S T ++ KP N++ +DDLG+ L F Sbjct: 5 IPSLFSQAVILLCFLSSITQPTHAADEKPWNVVFFLVDDLGWTDLGCYGSDF-------- 56 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF 148 +P + L EG++FT Y A P+R A++TG PAR Sbjct: 57 ------------------YQSPNIDQLAAEGMKFTQNYSACNACSPTRGALLTGMYPART 98 Query: 149 GVYSNTDA---------------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVP 193 + + + T LPE + GY T VGKWHL N Sbjct: 99 HLTDWIPGWAKSYTDFPLKPPEWKKHLDQKYTTLPEALRTAGYQTFHVGKWHLGGRGN-- 156 Query: 194 VPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGY 253 +P+D NRG F G A SL E Y Sbjct: 157 LPQDHGFDVNISG-----------TNRGLPRSYHFPYGGDAMKWDSSL---TEAERQDRY 202 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTADNY 311 ++D++ DEA+ ++ + + D+PF LY ++ + H P +Y+ Y Sbjct: 203 LTDRMADEAVALIRQQQ--DKPFFLYCSFYSVHSPIQGRPDLVKKYKGLPAGKRHKNPEY 260 Query: 312 YASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGG 371 A + SVD+ + R+ QLK++G D T+I+FTSDNG V N +G K Q + GG Sbjct: 261 AAMIQSVDEAIGRVRAQLKESGIADRTLIVFTSDNGGVRRK-TSNNDPLRGEKGQHWEGG 319 Query: 372 THTPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADIS--IPKDLKLDGVSLLPWLQDK 428 T P + W G G+ + I MDFYPT L+ ++ + +DG+SL+P L+D Sbjct: 320 TRVPAIVLWPGVTPAGSVCAEPIITMDFYPTILNITGVAGNTEHNQSVDGLSLVPLLKDP 379 Query: 429 KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNN 488 + L W + + F +R Sbjct: 380 AATLNREALYWHYPHYNVFIGV------------------------------PYSAIRVG 409 Query: 489 DYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQE 547 +Y L++ E+ LY L DL + +++ P++ ++ +++ + + N + Sbjct: 410 EYKLIHYYEDGNDELYNLAEDLSETSDVSKTYPELTARLERRLQQHLKQVGAQMPVSNPQ 469 >UniRef50_C6Y214 Sulfatase n=3 Tax=Sphingobacteriaceae RepID=C6Y214_PEDHD Length = 472 Score = 462 bits (1190), Expect = e-128, Method: Composition-based stats. Identities = 138/534 (25%), Positives = 210/534 (39%), Gaps = 114/534 (21%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 +K +T ++ + + T KPN+IV+ DD GY G Sbjct: 1 MKGIKTISTLLLALWTGISAAQVKTAAKPNVIVIVSDDAGYVDFGCYGG----------- 49 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 Q TP + ++ +G RFT+ YV+ V PSRA I+TGR RFG Sbjct: 50 ---------------KQIPTPNIDAIAKQGTRFTDAYVSASVCAPSRAGILTGRYQQRFG 94 Query: 150 VYSNTDA---------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQT 200 NT G+ +E + Q +GY T A+GKWH Sbjct: 95 FEHNTSNVLAPGYKITDVGMDPSEQTIGNEMQANGYKTIAIGKWHQGD------------ 142 Query: 201 RDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY-------NSPSLFKNRERVPAKG- 252 + P NRGF+ F GF ++ N +L+ N+E VP Sbjct: 143 -----------EPKHFPLNRGFNEFYGFTGGHRDFFAYKGKRTNEHALYNNKEIVPENEI 191 Query: 253 -YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNY 311 Y++D TD+A + K D+PF +YL+YNA H P + D ++ + Y Sbjct: 192 TYLTDMFTDKATSFITANK--DKPFFMYLSYNAVHTPMNAKK-DLMERYASIADTGRRAY 248 Query: 312 YASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGG 371 A + S+D G+ +++ LK N NT+I+F +DNG NG +G K + GG Sbjct: 249 AAMMTSLDDGIGKVMATLKANQLDKNTLIIFINDNGGATV-NSSDNGPLRGMKGSKWEGG 307 Query: 372 THTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ 430 M M W G + D +S++D PTA+ A KLDGV+LLP+L + Sbjct: 308 IRVAMMMKWPGHIAANKTDSRPVSSLDILPTAIGAGKGKQKGTKKLDGVNLLPYLSAGNK 367 Query: 431 GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDY 490 PH+ L W + +R ++ Sbjct: 368 KTPHEALYWRRGV--------------------------------------AAAMREGNW 389 Query: 491 SLVYTVENNQLG---LYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 L+ E+ + L+ L+ DL + NL+ P VKE+ + E+ P Sbjct: 390 KLIRVKESPTVQNVLLFDLSKDLSETKNLSEKYPAKVKELLVKLAEWEKGLDQP 443 >UniRef50_B4D464 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D464_9BACT Length = 474 Score = 462 bits (1190), Expect = e-128, Method: Composition-based stats. Identities = 141/561 (25%), Positives = 207/561 (36%), Gaps = 147/561 (26%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 A+ F + +PNI+ + DDLGYG+ G Sbjct: 7 ASVILTLFLFCAQLAIAAPKRPNILFIVADDLGYGEPGCYGG------------------ 48 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT-- 154 TP + L+ GVRF++GYV+ SRAA+MTGR RFG N Sbjct: 49 --------KDIPTPNIDKLVASGVRFSSGYVSAPFCAASRAALMTGRYQTRFGFEYNPIG 100 Query: 155 ----DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 D G+P+ E + + ++ GY T VGKWHL + Sbjct: 101 AKNADPGTGLPVNEKTVADRLRDVGYATGLVGKWHLGGTA-------------------- 140 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNSP------------------------------- 239 + PQ RGFD F GF G Y P Sbjct: 141 ---PFHPQRRGFDEFFGFLHEGHFYLPPPWSGATTWLRRKALPDGSQGRWTSPDGHTVWS 197 Query: 240 --------------SLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAP 285 L +N + V K ++D T EA +DR + QP+ LYLAYNA Sbjct: 198 TDLHENEPAYDADNPLLRNSQPVEEKANLTDAFTREACSFIDRHQA--QPWFLYLAYNAV 255 Query: 286 HLPNDNPAPDQYQKQFNTGSQ-TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 H P D Y ++F+ + A + +D+ + ++ QL+ +G +NT+++F S Sbjct: 256 HSPLQG--EDTYMEKFSHIGDIQRRIFAAVLAHLDEDIGKVRAQLRADGLEENTLVVFLS 313 Query: 345 DNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTAL 403 DNG N +G K + GG P + WKG++ G D +MD TAL Sbjct: 314 DNGGPTKELTSSNLPLRGGKGDLWDGGIRIPFAVSWKGQIPAGHTIDAPAISMDLTATAL 373 Query: 404 DAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR 463 A + KLDGV LLP L K PH L W + Sbjct: 374 KLAGAET-EQAKLDGVDLLPLLTGKTTAAPHDTLFWRVGRKN------------------ 414 Query: 464 HQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQV 522 +R+ D+ L+ + + LY L D+ + +N+AA N Sbjct: 415 --------------------ALRHGDWKLLR-QGSKEWQLYDLAHDVGETNNMAAQNAAR 453 Query: 523 VKEMQGVVREFIDSSQPPLSE 543 V E+ + ++ PL + Sbjct: 454 VTELSALWDKWNSEQIDPLWK 474 >UniRef50_B5JJG5 Sulfatase, putative n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JJG5_9BACT Length = 462 Score = 460 bits (1185), Expect = e-128, Method: Composition-based stats. Identities = 127/536 (23%), Positives = 199/536 (37%), Gaps = 120/536 (22%) Query: 43 AFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAI 102 S + PNI+ + DDLGY L Sbjct: 20 TLLTPPSAASSAEKPPNIVFIFADDLGYNDLSSYG------------------------- 54 Query: 103 EAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDAQDGI 160 A +TP + SL ++G+RFT+ Y A V PSRAA++TGR P R G+ + DGI Sbjct: 55 -ATDIATPAIDSLGEQGIRFTDFYSASPVCSPSRAALLTGRYPIRQGITGVFWPQSFDGI 113 Query: 161 PLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 ET + EL Q +GY T VGKWHL P Sbjct: 114 DPAETTIAELLQENGYRTGLVGKWHLGHHQK-----------------------HLPLQN 150 Query: 221 GFDYFMGFHAAGTAYYNSPSLFKNRERVP-AKGYISDQLTDEAIGVVDRAKTLDQPFMLY 279 GF + G + + E + Y + + T+EA+ +++ K DQPF LY Sbjct: 151 GFHSYFGIPYSNDMDMVVYMRGNDVESYEVDQHYTTRRYTEEAVQFIEQNK--DQPFFLY 208 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 LA++ PH+P Y + G+ Y + +D V +IL+ L K+ +NT+ Sbjct: 209 LAHSMPHVPI-------YASENFVGTSKRGLYGDVIQELDWSVAQILDTLDKHQLSENTL 261 Query: 340 ILFTSDNGAVI--DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAM 396 ++FTSDNG + K T+ GG P + W ++ G + + M Sbjct: 262 VVFTSDNGPWTALKHLGGSAAPLREGKMFTFDGGMRVPCLVRWPAQIPAGQTSHAMANMM 321 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWD 456 D++PT A++ PK +DG+ + L ++ + Sbjct: 322 DWFPTFSRIANLDTPKSRSIDGLDITDVLTGSGPRADNEFFFFHGDGDL----------- 370 Query: 457 NYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN------------QLGLY 504 R+ D+ L E N + L+ Sbjct: 371 --------------------------RAYRDGDWKLKLPYEGNQAARWRQAVAAHPILLF 404 Query: 505 KL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKALSEA 559 L D + +LAA +P+ + MQ + +F+ S L E+ EK +K E+ Sbjct: 405 NLAEDPGETTDLAAQHPERLAAMQARMTDFLAS----LGELPPEKI--TRKPGDES 454 >UniRef50_Q7UGB8 Arylsulfatase homolog b1498 n=1 Tax=Rhodopirellula baltica RepID=Q7UGB8_RHOBA Length = 656 Score = 460 bits (1183), Expect = e-128, Method: Composition-based stats. Identities = 127/536 (23%), Positives = 216/536 (40%), Gaps = 90/536 (16%) Query: 29 AADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 A + T + +PN++++ DD G+G L Sbjct: 73 CAKKICRTVVMVLFVIGAGTSIQAEASDRPNVLLILTDDQGWGDLAAH------------ 120 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF 148 + STPTL +L +E R YV V P+RAA++TGR P R Sbjct: 121 --------------RNPKISTPTLDALANESARLDRFYV-SPVCAPTRAALLTGRYPERS 165 Query: 149 GVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 GV T ++ + ET L EL+++ GY T GKWH Sbjct: 166 GVAGVTGRREVMRAEETTLAELYRSAGYATGCFGKWHNGAQM------------------ 207 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDR 268 P +GF+ F GF Y+ L +N V KGYI+D LTD A+ + Sbjct: 208 -----PLHPNGQGFNEFFGFCGGHFNLYDDALLERNGTPVQTKGYITDVLTDAAVEFIQN 262 Query: 269 AKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQ 328 D+PF Y+ +NAPH P + + + YA V ++D V R+L+ Sbjct: 263 HH--DRPFFCYVPFNAPHGPFQVRRDLFDRYNDGSIDEKTAAVYAMVQNIDTNVSRLLKC 320 Query: 329 LKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN 388 L + + TI++F +DNG NG +G K + GG P F+ W G +QP + Sbjct: 321 LSDHSLDEETIVVFLTDNGP---NGKRFNGGMRGTKGSVHEGGCRVPCFIRWTGNIQPQS 377 Query: 389 YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFD 448 ++ + +D PT + DI +P + LDG SL+ ++D Sbjct: 378 ISQVAAHIDLLPTLMQWCDIPLPTKVPLDGRSLVELIRDGADPT---------------- 421 Query: 449 EENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-T 507 + +S + VR N + L T+E ++ L+ + T Sbjct: 422 -------------LADRSILTYRPNPMQLQKFGKAAVRTNTHRL--TIEKSKASLFDMTT 466 Query: 508 DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKK---ALSEAK 560 D Q ++A+++P++ K+++ +++++ P ++ + ++++ +AK Sbjct: 467 DAGQTTDIASSHPELTKQLRSQIQKYVQEITPSITAIRPVPIDSMRSVYLPAVDAK 522 >UniRef50_C3ZGR2 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3ZGR2_BRAFL Length = 598 Score = 459 bits (1181), Expect = e-127, Method: Composition-based stats. Identities = 120/543 (22%), Positives = 199/543 (36%), Gaps = 112/543 (20%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 + S+ GKPNI+ + DD G+ + + TP Sbjct: 115 QESSSGKPNIVFILADDYGWNDIGYHGSV---------------------------IRTP 147 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFL 167 L L EGV+ N YV + PSR +MTGR R+G+ G+PL E L Sbjct: 148 NLDRLAAEGVKLENYYV-QPLCSPSRCQLMTGRYQIRYGLQHSLIWPPQPSGLPLDEVTL 206 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 P+ + GY T VGKWHL F +++ P +RGFD F G Sbjct: 207 PQRLKEGGYSTHIVGKWHLG----------------------FYKQDYTPTHRGFDTFYG 244 Query: 228 FHAAGTAYYNS---------PSLF-------KNRERVPAKG-YISDQLTDEAIGVVDRAK 270 + Y+ P + +NR G Y + ++AI ++ + Sbjct: 245 YLTGAEDYWTHRQKGGLPGQPQTWSGLDLRDQNRPVTDQNGTYSTHLFANKAIEII-AQQ 303 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLK 330 ++P L+L++ A H P P D + + Y A +DQ V + LK Sbjct: 304 DKNKPMFLFLSFQAVHDPLQAPEEDI-SRYSHISDTNRRVYAAMTTIMDQAVGNVTRALK 362 Query: 331 KNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG-KLQPGNY 389 + G +DNT+++F++DNG +D +N +G+K + GG F+ K + Sbjct: 363 QYGLWDNTVLIFSTDNGGRVD-RGGINWPLRGWKGSLWEGGVRGVGFVNSPLIKAKGRTS 421 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 D LI D++PT + A S LDG + + D K + L I H Sbjct: 422 DALIHISDWFPTLVGLASGSTNGTKPLDGHDVWEAISDGKPSPRREILHNIDPMFHTVPS 481 Query: 450 ENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY--------------- 494 W + + +R+ D+ L+ Sbjct: 482 PRPHQWGDRV-----------------FNTSVHAAIRSGDWKLLTGYPGNTSRVPPPSST 524 Query: 495 -----TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEK 548 L L+ + D +++ +L+ +P VV+E+ + + ++ P + Sbjct: 525 KEEPADTPGKHLWLFNIREDPEERTDLSQKHPGVVQELLEKLARYNRTAVPVFYPSFDPQ 584 Query: 549 FNN 551 N Sbjct: 585 ANP 587 >UniRef50_A6LEC5 Arylsulfatase A n=2 Tax=Parabacteroides RepID=A6LEC5_PARD8 Length = 483 Score = 458 bits (1179), Expect = e-127, Method: Composition-based stats. Identities = 120/513 (23%), Positives = 207/513 (40%), Gaps = 55/513 (10%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 + + + + KPNII+L DDLGY + + P+ ++ Sbjct: 7 FVAVNSAALMLSTVSCDAKEEAVPKPNIIILLADDLGYNDVSCYRNENFPQQSDSFPTS- 65 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP L L +G+RFTN Y VS PSRAA+MTGR R GVY+ Sbjct: 66 ---------------QTPNLDLLARQGIRFTNFYCGAAVSSPSRAALMTGRNCTRTGVYN 110 Query: 153 NT--DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 ++ + +E + E+ + Y T GKWHL ++ Sbjct: 111 YLEQNSPMHLRDSEVTIAEVLKQADYATGHFGKWHL---------------------SSG 149 Query: 211 SAEEWQPQNRGFDY-FMGFHAAGTAYYNSPSLFKNRERV-PAKGYISDQLTDEAIGVVDR 268 ++ P ++GFDY F + + +++N + F+N E +GY D + EA+ +D+ Sbjct: 150 RPDQPYPNDQGFDYSFYALNNSVPSHHNPTNFFRNGEPQGEIEGYSCDIVVTEALQWLDK 209 Query: 269 AKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQ 328 K +PF L + +N PH P + AP++ +K+ + YY + ++D + +++ Sbjct: 210 NKQ--EPFFLNVWFNEPHFPME--APEELKKRHAINPE----YYGCIENMDIAIGKLMNY 261 Query: 329 LKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN 388 LK+ DNTI++F SDNG+ D N +G K Y GG P + W + G Sbjct: 262 LKEQNLEDNTIVIFASDNGSQWD---YSNLPFRGEKHFNYEGGLRVPCIVRWHKHVPTGV 318 Query: 389 YDKL-ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWF 447 + D PT AD +P D +DG+ + P K + +N + Y H Sbjct: 319 ISEFNGCFTDILPTLASLADAPVPTDRVIDGMDISPVFLGKAETLERENPLFFFRYIHDP 378 Query: 448 DEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT 507 + + + + Y + V LY L Sbjct: 379 ICMIREGDWCLLGYDEPLPWAFSLDELALGKVKPWY-LTKEHMEFAKKVFPKYFELYNLR 437 Query: 508 -DLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 D +++ ++A +P++V ++ + + Sbjct: 438 DDREERIDVADKHPEIVARLKSKMLKLKQEVVA 470 >UniRef50_A6C4Q9 Arylsulphatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4Q9_9PLAN Length = 490 Score = 458 bits (1179), Expect = e-127, Method: Composition-based stats. Identities = 119/551 (21%), Positives = 200/551 (36%), Gaps = 114/551 (20%) Query: 26 AAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 + ++ + + + +PNI+ + +DD+G+ F Sbjct: 3 LKSLHQSLLFAVCLLLISVTALHAEQKISADRPNIVFILIDDMGWPDPVSYGNQFHD--- 59 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 TP + L +GVRFT+ Y A V P+RA+I G+ Sbjct: 60 -----------------------TPHIDQLASDGVRFTDFYAACPVCSPTRASIQAGQYQ 96 Query: 146 ARFGVYSNTDAQD-------------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNV 192 AR + +PL EL Q+ Y TA GKWHL Sbjct: 97 ARLHLTDFIPGHWRPFEKLIVPENAPHLPLEIVTPGELLQSANYNTAYFGKWHLG----- 151 Query: 193 PVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKG 252 E P +G+ + A + R+P K Sbjct: 152 -------------------PESHNPDQQGYQTSLVTGGRHFAPRFRTTP---STRIPNKA 189 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT----- 307 Y++D LTD+ I + + K+ +PF + L++ A H+P + A Q +++ + Sbjct: 190 YLADFLTDKTIEFIRQNKS--KPFFVQLSHYAVHIPLE--AKQQMIRKYQQKPKPAYGIN 245 Query: 308 ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-----LNGAQKG 362 Y A V VD V RI+ L++ +NT+++FTSDNG + N + Sbjct: 246 NPVYAAMVAHVDDSVGRIVAALEELKLTENTVVIFTSDNGGLRQSFSGGDIVSTNAPLRD 305 Query: 363 YKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSL 421 K Y GG P+ + W G G + ++DF+PT + A ++ + +DG+SL Sbjct: 306 EKGSLYEGGIRVPLIIKWPGVAAAGKTCAEPTISIDFWPTFAEIAHTTLQEHQTIDGLSL 365 Query: 422 LPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQF 481 LP L+D + + + + H Sbjct: 366 LPLLKDPSSHLNREEIYFHYPHYHHST--------------------------------P 393 Query: 482 SYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 + +R D+ L+ + L LY L DL + NLAA NP+ E+Q + ++ + Sbjct: 394 ASAIRAGDWKLIEFFADGNLELYNLQQDLSETTNLAAKNPEKAVELQQKLADWRTRTGAA 453 Query: 541 LSEVNQEKFNN 551 L N + Sbjct: 454 LPVKNPKYDPA 464 >UniRef50_B5CXC7 Putative uncharacterized protein n=2 Tax=Bacteroides RepID=B5CXC7_9BACE Length = 509 Score = 458 bits (1178), Expect = e-127, Method: Composition-based stats. Identities = 130/582 (22%), Positives = 204/582 (35%), Gaps = 147/582 (25%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 L V + S +PN++ + +DD G+ + ++ F Sbjct: 8 LLTLAGGVTLAANMLHAASDNRQPNVVFIMVDDYGWADVGYNGSRF-------------- 53 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 TP + L EG+ FT+GY A +S PSR ++MTG+ PAR G+ Sbjct: 54 ------------YETPNIDRLASEGMIFTDGYAAASISSPSRVSLMTGKYPARTGITDWI 101 Query: 155 DA--------------------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPV 194 +PL E + E F+ HGY T VGKWH Sbjct: 102 PGYQYGLKPEQLKQYKMLAPEMPLNMPLEEVTMAEAFKEHGYATYHVGKWHC-------- 153 Query: 195 PEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH-----------AAGTAYYNSPSLFK 243 + PQ +GFD +G G Y SP Sbjct: 154 ---------------AEDSLYYPQYQGFDVNIGGWLKGSPNGIRRSQGGKGAYCSPYRNP 198 Query: 244 NRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--------- 294 P +++D+L DE+I ++ + + D+PF LYLA+ A H P + Sbjct: 199 YLPDGPEGEFLTDRLGDESIKLI-KNSSADKPFFLYLAFYAVHTPIEAKPEYVKYFKWKA 257 Query: 295 ---------------DQYQ---KQFNTGSQT----ADNYYASVYSVDQGVKRILEQLKKN 332 + Y+ + + Y A +YS+D+ V R+++ LK N Sbjct: 258 QRMGLDTIVPFTRNLEWYKNAEYKAGHWKERTIQSDAEYAALIYSMDENVGRVMQALKDN 317 Query: 333 GQYDNTIILFTSDNGA--VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY- 389 G NTI+ SDNG +G N + K Y GG P + + ++ G+ Sbjct: 318 GLDKNTIVCLLSDNGGLSTAEGSPTCNAPLRAGKGWLYEGGIREPFIIKYPQMVEAGSVC 377 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 + A+DFYPT LD A + + +DG SLLP L+ + + Y D Sbjct: 378 HTPVVAVDFYPTLLDMAGLPLKSHQHVDGKSLLPLLKGDQAYDRGPIFFHYPHYGGKGD- 436 Query: 450 ENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TD 508 + VR DY L+ E+ + LY L D Sbjct: 437 ------------------------------TPAGAVRMGDYKLIEFYEDGHVELYNLKND 466 Query: 509 LQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFN 550 + + +L+ EMQ ++ + + N Sbjct: 467 ISETRDLSKTEKDKAAEMQKMLHRWRTDCNAKMPTRNPHYVP 508 >UniRef50_D2R917 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R917_9PLAN Length = 486 Score = 458 bits (1178), Expect = e-127, Method: Composition-based stats. Identities = 128/537 (23%), Positives = 212/537 (39%), Gaps = 114/537 (21%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 + KL+ + + + +PNI+ + DDLG+ + F+ + Sbjct: 1 MNLTKLELWAAVLLVAFTAVASQAADRQPNIVHIVADDLGWKDVGFNGCT---------- 50 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 + TP + +L G +F+ YV + + P+RA +MTGR P R+G Sbjct: 51 ----------------EIKTPNIDALAKGGAKFSQFYVQN-MCTPTRACLMTGRFPYRYG 93 Query: 150 VYS---NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 + + T A G+ +E +P+ + GY TA +GKWHL Sbjct: 94 LQTIVIPTAAGYGLDTSEYLMPQCLGDAGYKTAIIGKWHLGHAD---------------- 137 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-----SLFKNRERVPAKGYISDQLTDE 261 +++ P+ RGFDY G Y+ F++ + V +GY + + D+ Sbjct: 138 ------QKYWPKQRGFDYQYGAMIGELDYFTHDEHGVLDWFRDNKPVHEQGYTTTLIGDD 191 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF-NTGSQTADNYYASVYSVDQ 320 A+ + +PF LYL +NAPH P AP +Y ++ N T Y A V +D+ Sbjct: 192 AVKYIHGQ-DGKKPFYLYLTFNAPHTPYQ--APKEYITKYLNIAEPTRRTYAAMVDCLDE 248 Query: 321 GVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL-------------NGAQKGYKSQT 367 + +++ L + G +NT+I F SDNG D NG + K Sbjct: 249 NIGKVVAALDQKGLRENTLIFFHSDNGGTKDKMFAGQMADMSKVVLPCDNGPYRNGKGSL 308 Query: 368 YPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQD 427 + GG+ W GK++ D +I A+D YPT A SI K LDG ++ + + Sbjct: 309 FEGGSRVCALANWPGKIKAQTVDGMIHAVDLYPTFAALAGASIAKCKPLDGTNVWDTIAE 368 Query: 428 KKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRN 487 K + + F +R Sbjct: 369 GKPSPRTEFFY--------------------------------------SIEPFRAGLRQ 390 Query: 488 NDYSLV-YTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 D+ L+ T+ + + LY L D +K+N+AAA+P V MQ + + PL+ Sbjct: 391 GDWKLIWRTMLPSSVDLYNLAEDPYEKNNIAAAHPDKVATMQARIETASKDAAKPLA 447 >UniRef50_UPI0001788C38 sulfatase n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001788C38 Length = 452 Score = 457 bits (1177), Expect = e-127, Method: Composition-based stats. Identities = 141/524 (26%), Positives = 211/524 (40%), Gaps = 131/524 (25%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 +PN IV+ DDLGYG L TP L L Sbjct: 15 KQPNFIVIYCDDLGYGDLGCYGS--------------------------DTVKTPHLDGL 48 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD---AQDGIPLTETFLPELFQ 172 DEG+RFTN Y V PSRA+++TG+ PAR GV G+P E L + + Sbjct: 49 ADEGIRFTNWYSNSPVCSPSRASLLTGKYPARAGVGEILGAKRGSHGLPADEVTLAKALK 108 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 GY TA GKWHL +EE P GFD F GF A Sbjct: 109 PAGYRTALYGKWHLGL-----------------------SEETSPNAHGFDEFFGFKAGC 145 Query: 233 TAYYNS-------------PSLFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFML 278 +Y+ L++N V G Y+++ +T+ ++ + R++ + PF L Sbjct: 146 VDFYSHIFYWGQAHGVNPLHDLWENETEVWENGRYMTELITERSVDFIQRSREQEAPFFL 205 Query: 279 YLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 + +YNAPH P AP +Y +F A + +VD GV +I++ LK+ G Y++T Sbjct: 206 FASYNAPHYPMH--APQKYMDRFAHLPWDRQVMAAMIAAVDDGVGKIVKALKEAGCYEDT 263 Query: 339 IILFTSDNGAVIDGP-----------LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 +I F+SDNG + G +G+K+ + GG P + W + G Sbjct: 264 VIFFSSDNGPSSESRNWLDGTEDVYYGGSAGIFRGHKASLFEGGIREPAILSWPNGWEGG 323 Query: 388 NY-DKLISAMDFYPTALDAADISIP----KDLKLDGVSLLPWLQDKKQGEPHKNLTWITS 442 D++ + MD PT LD A + + + LDG SL LQ ++ PH+ L W Sbjct: 324 QVRDEVAAMMDLAPTFLDLAGVDPAAGPLQGVALDGSSLKEMLQ-MREPSPHQQLFWEY- 381 Query: 443 YSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE----- 497 Q VR D+ LV + Sbjct: 382 -------------------------------------QGQLAVREGDWKLVLNGKLDFDR 404 Query: 498 --NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 +Q+ L L+ D ++ NLA P++V+ + VR++ + Q Sbjct: 405 VVPDQIHLSDLSRDPGERSNLADRYPEIVERLSRDVRDWYEEVQ 448 >UniRef50_A6C4L0 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4L0_9PLAN Length = 413 Score = 457 bits (1177), Expect = e-127, Method: Composition-based stats. Identities = 120/498 (24%), Positives = 189/498 (37%), Gaps = 115/498 (23%) Query: 64 TMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFT 123 DDLGYG L TP L L G+RFT Sbjct: 1 MADDLGYGDLSCYGSQNCN--------------------------TPHLDRLAANGIRFT 34 Query: 124 NGYVAHGVSGPSRAAIMTGRAPARFGVYSNT------DAQDGIPLTETFLPELFQNHGYY 177 + + + V P+RA ++TGR R G+ + G+ E L + Q+ GY Sbjct: 35 DFHSSGAVCSPTRAGLLTGRYQQRAGIDGVVYANPKKNRHHGLQKNEITLAQCLQDAGYQ 94 Query: 178 TAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYN 237 T GKWHL ++ P RGF F+G+ + Y+ Sbjct: 95 TGMFGKWHLGYQR-----------------------QYNPTFRGFQQFVGYVSGNVDYFA 131 Query: 238 S------PSLFKNRE-RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPND 290 + N E +GY++ + D A+ + + + ++PF +Y+A+ A H P Sbjct: 132 HLDGTGVFDWWHNAELNREEQGYVTHLINDHALEFIRQQQ--EKPFFVYIAHEAVHSPYQ 189 Query: 291 NPAPDQYQK------QFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 P +K + A+ Y +D+G+ +I++ LK+ + T I F S Sbjct: 190 GPHDQPMRKEGGGDIKSAKRKDIANAYREMNTEMDKGIGQIVDVLKEVNLTEKTFIFFLS 249 Query: 345 DNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTAL 403 DNGA + NG +G+K + GG P W G++ G D+ + ++D PT L Sbjct: 250 DNGANKN---GSNGKLRGFKGSLWEGGHRVPAIACWPGRIPEGTVCDEPVISIDLMPTIL 306 Query: 404 DAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR 463 + A+ IP KLDGVSL+ L+D+K P + + W Sbjct: 307 ELANAKIPAGHKLDGVSLVSLLKDRKSLVPRQ-IFWEY---------------------- 343 Query: 464 HQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT-VENNQLGLYKLT-DLQQKDNLAAANPQ 521 +R + LV + LY LT D+ + NLA PQ Sbjct: 344 ----------------NGKSAMRQGHWKLVLNQTRKEPIELYDLTRDMSESKNLADNQPQ 387 Query: 522 VVKEMQGVVREFIDSSQP 539 V++MQ + + Q Sbjct: 388 RVQQMQSALAAWKSDVQK 405 >UniRef50_Q7UPG6 Arylsulphatase A n=2 Tax=Bacteria RepID=Q7UPG6_RHOBA Length = 485 Score = 456 bits (1175), Expect = e-127, Method: Composition-based stats. Identities = 135/530 (25%), Positives = 205/530 (38%), Gaps = 76/530 (14%) Query: 26 AAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 + H L + F ++ +PN+++L DDLGY + G Sbjct: 15 SPHRFWCTVLLLITPTLTFGQLAGETHAQTLRPNVVMLLADDLGYRDVGCYGGP------ 68 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 TPT+ L G RF Y V PSRA +MTGR Sbjct: 69 ---------------------VETPTIDQLAAGGTRFQQFYSGCAVCSPSRATLMTGRHH 107 Query: 146 ARFGVYSNTD---AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 R GVYS + L E L E+ ++ GY TA VGKWHL Sbjct: 108 IRAGVYSWIQDESQNSHLRLREVTLAEVLRDAGYATAHVGKWHLG--------------- 152 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMG-FHAAGTAYYNSPSLFKNRERV-PAKGYISDQLTD 260 T ++ P GFD++ ++ A ++ N + +N E V +GY + D Sbjct: 153 ----LPTEERDKPTPDQHGFDHWFATWNNAQPSHRNPDNFIRNGEPVGQLEGYSCQLVAD 208 Query: 261 EAIGVVDRAK--TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSV 318 EAI +DR + DQPF L + ++ PH P APD+ +++ S Y ++ + Sbjct: 209 EAIRWMDRHRESDPDQPFFLNVWFHEPHAPI--AAPDEVTQKYGKLSDKGAVYSGTIDNT 266 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFM 378 DQ +KR+L +L G +NT+I++ SDNG+ + G +G K + GG P Sbjct: 267 DQAIKRLLAKLDALGVRENTLIVYASDNGSYRTDRV---GKLRGRKGANWEGGIRVPGIF 323 Query: 379 WWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG-EPHKN 436 W G + G ++ +D PT IS P+ + LDG L P L E H+ Sbjct: 324 HWPGHIPAGVVSNEPAGLVDVLPTICGLLKISPPQ-VHLDGSDLTPLLTGHADSFERHQP 382 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV 496 L W S + DY + ++ ++N Y Sbjct: 383 LFWHLQRSQPIVAMRDGDYSLV------GFRDYEMSNKNLFEEKWIPAIKNGTYH----- 431 Query: 497 ENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVN 545 LY L D Q NLAA P+ V+ M+ + + + + Sbjct: 432 ---NFELYNLKDDPGQTKNLAAEQPERVEAMKQRMLQINAGIMKDAMDWH 478 >UniRef50_Q7UQ05 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UQ05_RHOBA Length = 525 Score = 456 bits (1174), Expect = e-127, Method: Composition-based stats. Identities = 125/603 (20%), Positives = 220/603 (36%), Gaps = 141/603 (23%) Query: 1 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI 60 M + +S S + +S + A + T + + +P +PN+ Sbjct: 1 MCRQMLRSHCPVSSPSLASSNLVTTAVLLIATIASLGNPTTLVAEETSP----APSRPNV 56 Query: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 ++ +DDLG+ L ++ TP + +L + G+ Sbjct: 57 LLFLVDDLGWADLGCYGSTYH--------------------------ETPQIDALAESGI 90 Query: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA----------------QDGIPLTE 164 RFTN Y A V P+RA+IMTGR P R + +D + L E Sbjct: 91 RFTNAYAACPVCSPTRASIMTGRHPVRVDITDWIPGMSTDRAQNPRFQHVDDRDNLALDE 150 Query: 165 TFLPELFQN-HGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 + E ++ Y T +GKWHL + P ++GF Sbjct: 151 VTIAEHLRDAADYQTFFLGKWHLGDVG------------------------HLPTDQGFQ 186 Query: 224 YFMGFHAAGTAYYNSPSLFKNR--ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 +G G+ S +KN + Y++ +LTDEA+ +VD A D+PF + ++ Sbjct: 187 INIGGGHKGSPPGGYYSPWKNPYLKAKQDGEYLTTRLTDEAVSLVDTASREDKPFFMMMS 246 Query: 282 YNAPHLPNDNPAP--DQYQKQFNTGS-------------------QTADNYYASVYSVDQ 320 Y H P D ++++ + Q Y + V +VD Sbjct: 247 YYNVHSPITPDKRTIDHFEEKQSNSPELQGDTPTIAERDAVTRGRQDNPAYASMVKAVDT 306 Query: 321 GVKRILEQLKKNGQYDNTIILFTSDNGAVID---GPLPLNGAQKGYKSQTYPGGTHTPMF 377 V RI++ LK++G DNT+++F SDNG + N + K Y GG P+ Sbjct: 307 SVGRIMKALKEHGVDDNTLVIFFSDNGGLSTLRKFGPTCNSPLRAGKGWLYEGGIREPLL 366 Query: 378 MWWKGKLQPG-----------NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQ 426 + + G D + + D +PT LD + + + DG+SLLP + Sbjct: 367 VRLPKTMPGGATNETVSHQPKTVDSVACSTDLFPTILDVVGLPLQPESHADGISLLPAIA 426 Query: 427 DK--KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 + + ++L W + H L + Sbjct: 427 GEAAETDSSPRDLHWHYPHYH------------------------------GSLWRPGAA 456 Query: 485 VRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 +R +Y L+ E + LY L+ D+ + +L+ P+ E++ +R++ + Sbjct: 457 IRRGNYKLIEFYETDTAELYDLSVDMGETKDLSKTEPERFAELRDALRQWQTEMNAKMPV 516 Query: 544 VNQ 546 N Sbjct: 517 PNP 519 >UniRef50_A6C430 Arylsulphatase A n=2 Tax=Planctomycetaceae RepID=A6C430_9PLAN Length = 503 Score = 456 bits (1173), Expect = e-126, Method: Composition-based stats. Identities = 124/525 (23%), Positives = 209/525 (39%), Gaps = 68/525 (12%) Query: 29 AADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 A + + TN + + + +PNI+V+ DDLGYG L Sbjct: 6 LALIIVISILFTNESLAAEPTASVKSPARPNIMVVLCDDLGYGDLACYG----------- 54 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF 148 +P + EG++ T+ Y AH PSRA +MTGR P R Sbjct: 55 ---------------HPVIQSPNIDRFAKEGLKLTSCYAAHPNCSPSRAGLMTGRTPFRV 99 Query: 149 GVYSNTD--AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 G+Y+ + + E + L + GY T VGKWHL+ + N+ Sbjct: 100 GIYNWIPMLSPMHVRKREITIATLLRQAGYATCHVGKWHLNGMFNMVGQP---------- 149 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRERV-PAKGYISDQLTDEAIG 264 QP + GFD++ + +P + +N V P +G+ S + DEA Sbjct: 150 ---------QPSDHGFDHWFSTQNNALPTHENPFNFVRNARPVGPLQGFASQLVADEAEE 200 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF-NTGSQTADNYYASVYSVDQGVK 323 + + + ++PF +++ ++ PH P + ++++K + T ++ +V +D Sbjct: 201 WLTQLRDKEKPFFMFVCFHEPHEPIASA--ERFRKLYTAPEGSTLPAHHGNVTQMDDAFG 258 Query: 324 RILEQLKKNGQYDNTIILFTSDNGAVID--GPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 RIL+ L +NT+I+FTSDNG I P +G + K TY GG P + W Sbjct: 259 RILKTLDDQKLRENTLIIFTSDNGPAITRRHPHGSSGPLRDKKGATYEGGIRVPGIVQWP 318 Query: 382 GKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 +QPG + +D PT ADI P D LDG ++LP L+ K K L W Sbjct: 319 EHVQPGTTSDVPVCGVDILPTLCAVADIPAPTDRVLDGTNILPLLEGKPI-LRKKPLYWQ 377 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 + N K + + S + + V + Sbjct: 378 FN-----------RAKNDAKVALRDGEWKLLAKLNVPSPKPSGGITTEEIDAVKNAKLEG 426 Query: 501 LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEV 544 LY + +D+ + + A + +++K+M+ ++ D Q Sbjct: 427 FELYHIQSDIAETTDRAESEQEILKKMKQQMQAIFDEVQAEAPRW 471 >UniRef50_A6LED2 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LED2_PARD8 Length = 468 Score = 456 bits (1173), Expect = e-126, Method: Composition-based stats. Identities = 123/561 (21%), Positives = 197/561 (35%), Gaps = 131/561 (23%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 +KL + +V +PN+I++ +DD GYG L + Sbjct: 1 MKLISNIISVLAFSGAAVATQAAERPNVIIVFIDDFGYGDLGCYGST------------- 47 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 + TP + + EG+R T+ YV VS PSR+A++TG P R ++ Sbjct: 48 -------------KHRTPHIDQMAKEGIRLTDFYVGSSVSTPSRSALLTGCYPRRVSMHV 94 Query: 153 NTDA---------------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPED 197 N D G+ E + EL + GY TA +GKWHL Sbjct: 95 NADPTPLMSKGRQVLFPASHKGLNPGEITIAELMKEQGYATACIGKWHLGDQ-------- 146 Query: 198 KQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS---PSLFKNRERVPAKGY- 253 + P +GFDY+ G + P + + V G+ Sbjct: 147 ---------------LPFLPTRQGFDYYYGIPYSNDMDRPYCPLPLMEQEEVIVAPVGHD 191 Query: 254 -ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYY 312 ++ + T++ + + K PF +YL +N H P G Y Sbjct: 192 SLTIRYTNKTVEFIKSHKES--PFFIYLCHNMTHNPLAASP-------AFKGKSQNGLYG 242 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGT 372 + +D + +LE LK+ G NT+I+FTSDNGA + N +G K TY GG Sbjct: 243 DATEELDWSMGVLLETLKEEGLDQNTLIIFTSDNGAD-EHFGGTNRPLRGQKGTTYEGGF 301 Query: 373 HTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 P M W K+ G D L+++MDF PT ++P D +DG ++ L+ + Sbjct: 302 RVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAHYCSYAVPSDRVIDGHNVSGILEGESMA 361 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 P + + Q VR ++ Sbjct: 362 SPTETFYYY-------------------------------------QKQQLQAVRWGNWK 384 Query: 492 LVYTVEN------------NQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 ++ + LY L DL + N+ +P+VV +M + + Sbjct: 385 YHLPLKERIKGPHFPDTEVGEARLYNLANDLSETTNVIDKHPEVVTKMNQWIEQVRSDMG 444 Query: 539 PPLSE-VNQEKFNNIKKALSE 558 E NQ I + Sbjct: 445 DWGYEGRNQRPAGIIDEPFPR 465 >UniRef50_D0PR28 N-acetylgalactosamine 6-sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR28_9SPHI Length = 602 Score = 455 bits (1172), Expect = e-126, Method: Composition-based stats. Identities = 123/539 (22%), Positives = 201/539 (37%), Gaps = 102/539 (18%) Query: 28 HAADDVKLKATKTNVAFSDFTP--TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 ++ L ++ F+ T + T+ PN+IV+ DD G+G + Sbjct: 8 YSLKGKALICVVCSLLFASCTAKVVQEQTQRPPNVIVILTDDQGWGDFSHTGNEY----- 62 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 TP + +EG YV V P+RA+++TGR Sbjct: 63 ---------------------LKTPHFDKMTEEGALLDQFYV-SPVCAPTRASVLTGRYH 100 Query: 146 ARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 R GV T ++ + E + E+F+ GY T GKWH Sbjct: 101 LRTGVSFVTRGRENMRSEEVTIAEVFKEAGYATGCFGKWHNGAHY--------------- 145 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGV 265 PQ +GFD F+GF + + Y L N E KG+I+D L DE I Sbjct: 146 --------PENPQGQGFDTFLGFTSGHWSNYFDTELEYNGEMKSTKGFITDVLMDETIQF 197 Query: 266 VDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS--------QTADNYYASVYS 317 +D K D+PF+ ++ NAPH P PD+Y ++ + Y + Sbjct: 198 IDAHK--DEPFLAFVPLNAPHTPYQ--VPDKYFDKYKDIDFGYDKKQNKKIATIYGMCEN 253 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMF 377 +D + ++++ LK +NTI++F SDNG NG +G K+ + GGT P Sbjct: 254 IDDNLGKLMKHLKDQELEENTIVVFLSDNGPQ---GARYNGPWRGGKTSVHEGGTLVPCA 310 Query: 378 MWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL 437 + WKG + + L + +D PT + A I P++++ DG+ L +L +NL Sbjct: 311 IQWKGHIPNSSKSSLTAHIDLMPTLMGLAGIEKPENIQFDGIDLSNYLMGTSDDLGERNL 370 Query: 438 TWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE 497 + + VR DY +T E Sbjct: 371 YTHMTNFEITADRG--------------------------------AVRQGDYR--FTTE 396 Query: 498 NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKA 555 +GLY L D +++NL P+ +E++ + + + Sbjct: 397 YGDVGLYNLKEDPSEENNLKDQLPEKTQELKTAFENWYKDVTSAGFSDLKIPMGYTESP 455 >UniRef50_A6CAY0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAY0_9PLAN Length = 466 Score = 455 bits (1172), Expect = e-126, Method: Composition-based stats. Identities = 132/534 (24%), Positives = 203/534 (38%), Gaps = 118/534 (22%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 + L + + T E +PNI+++T D+LGYG L Sbjct: 10 ILLFVWVSFSVPAPVTAAEKPENKRPNILLITADNLGYGDLGCYG--------------- 54 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP L L EGVR T+ Y A SRA ++TGR P R G+ Sbjct: 55 -----------NPVMKTPMLDQLASEGVRLTDFYTASPTCTVSRATLLTGRYPQRIGLNH 103 Query: 153 NTDA----QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 A DG+ +E +PE + GY TA GKW++ Sbjct: 104 QLSADENYGDGLRKSEVLIPEYLKQQGYRTACFGKWNVG--------------------- 142 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGTAYYNS-----PSLFKNRERVPAKGYISDQLTDEAI 263 + +P RGFD F GF A YY+ L++ + V +GY +D D A Sbjct: 143 --FSPGSRPTERGFDEFFGFAAGNIDYYHHYYAGRHDLWRGLKEVFVEGYSTDLFADAAC 200 Query: 264 GVVDRAKTLDQPFMLYLAYNAPHLP----------NDNPAPDQYQKQF---NTGSQTADN 310 + + DQPF +YL +NAPH P N+ APD +++ + Sbjct: 201 QYI--SAESDQPFFIYLPFNAPHFPSQRNKQPGQGNEWQAPDLAFEKYGYDPQTKNPQER 258 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP----LNGAQKGYKSQ 366 Y A V ++D + R+L+QL +G D TI+++ SDNGA + N + Sbjct: 259 YRAVVTALDSAIGRVLKQLDTSGLRDQTIVIWYSDNGAFMLKERGLEVASNKPLRDGGVT 318 Query: 367 TYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 + GG P + + G L+ G ++ + ++D PT + A +P + LDG +LP L Sbjct: 319 LWEGGIRVPAIIRYPGHLKAGTVNQSPLISLDILPTLITLAGGPLPAERILDGQDMLPAL 378 Query: 426 QDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTV 485 + EP ++ V Sbjct: 379 AAQTAPEPRTFFFQYRNF---------------------------------------SAV 399 Query: 486 RNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 R Y LV N L+ L DL + +LA NP+V+ ++Q ++ Sbjct: 400 RRGKYKLVRIKPNQPFMLFDLEQDLSETTDLAERNPKVLNQLQQAYADWEREVA 453 >UniRef50_A6C3C8 Putative uncharacterized protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C3C8_9PLAN Length = 600 Score = 455 bits (1172), Expect = e-126, Method: Composition-based stats. Identities = 131/521 (25%), Positives = 195/521 (37%), Gaps = 91/521 (17%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 +PNII++ DD GY + + TP Sbjct: 28 AKEKSRQPNIILVMTDDQGYWDT--------------------------EISGNPKIKTP 61 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPEL 170 T+ L EGV FT Y V P+RA +MTGR R G+Y+ D + ET + ++ Sbjct: 62 TIKKLAAEGVTFTRFYANM-VCAPTRAGLMTGRHYLRTGLYNTRFGGDTLGPNETTIAQV 120 Query: 171 FQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA 230 Q GY T GKWHL + + ++QPQ RGFD+F G + Sbjct: 121 LQKAGYKTGLFGKWHLGRYA-----------------------QYQPQRRGFDHFFGHYH 157 Query: 231 AGTAYYNSPS-LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLP- 288 Y +P + N V +GY++D TD AI + R + QPF YLAYNAPH P Sbjct: 158 GHIERYTNPDQVVVNGTPVETRGYVTDLFTDAAIDFIQRNQQ--QPFFCYLAYNAPHSPF 215 Query: 289 ------NDNPAPDQYQKQF--NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 P D+ +++ YA + +DQ + R+L+ + T++ Sbjct: 216 LLDTSHFGQPEGDKLIEKYLAKGLPLREARIYAMIERIDQNLSRLLQTVHDLKLDQETVV 275 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFY 399 +FTSDNG V KG K+ Y GGT P + W G D +++ D + Sbjct: 276 IFTSDNGGVSR---GFKAGLKGSKASAYEGGTRVPFVVRWTDHFPAGKTTDAMVAQTDLF 332 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYH 459 PT A + +P ++KLDG S+L ++ PH+ L Sbjct: 333 PTFCQLAGVPVPSNVKLDGESILSLMEQGGGKSPHQYLYHTWDRYTPNPYHRWAIHGPRF 392 Query: 460 KFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAA 518 K V H LY L D +K N+A Sbjct: 393 KLVGHDPQGKKKKEGEPQG-----------------------QLYDLQEDPGEKKNVADQ 429 Query: 519 NPQVVKEMQGVVREFIDSSQP-PLSEVNQEKFNNIKKALSE 558 P+ V E++G + + E + ++ E Sbjct: 430 YPEKVSELRGEFLRWFQDVTAGQVYEPAAIPVGDEQEPEVE 470 >UniRef50_Q7UJ66 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UJ66_RHOBA Length = 616 Score = 455 bits (1172), Expect = e-126, Method: Composition-based stats. Identities = 119/529 (22%), Positives = 193/529 (36%), Gaps = 80/529 (15%) Query: 17 ILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFD 76 H+ + + +++ +PN+I++ DD GYG + Sbjct: 16 CNCPHTIPIIDHSMHHRIWILLAACLTTCSPAWAQTASESRPNVILVVTDDQGYGDMSCH 75 Query: 77 KGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSR 136 +TP L L + VR N +V P+R Sbjct: 76 G--------------------------NPWLNTPNLDRLATQSVRLENFHV-DPFCTPTR 108 Query: 137 AAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPE 196 AA+MTGR R G ++ T+ + + ET + E F+ GY T GKWHL P Sbjct: 109 AALMTGRYCTRVGAWAVTEGRQLLDPDETTMAETFRESGYRTGMFGKWHLGDP-PPFAPR 167 Query: 197 DKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISD 256 ++ + + E P G DYF + ++N GY +D Sbjct: 168 ERGLETVVRHMAGGADEIGNPT--GNDYF------------DDTYYRNGTPESFDGYCTD 213 Query: 257 QLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVY 316 +EAI + K +QPF Y+ NA H P + +Y + Sbjct: 214 IWFEEAIDFI--QKESEQPFFAYIPTNAMHSPYLVADRYSDPFKRQGIEPQRAAFYGMIQ 271 Query: 317 SVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKSQTYPGG 371 + D+ + R+L++L ++ DNT+++F SDNG + N +G K Y GG Sbjct: 272 NFDENLGRLLKRLDQDNLRDNTMLIFMSDNGTAQGASEQNRKVGFNAGMRGKKGSVYEGG 331 Query: 372 THTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ 430 P F W K D+L D+ PT ++ D+ P D+ DG S+ L Q Sbjct: 332 HRVPCFASWPAKWDGNRPVDQLTCHRDWLPTLIELCDLKRPADVTFDGRSMAGLLSHSSQ 391 Query: 431 GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDY 490 P + L + Q D+ T+ +Q + V + + Sbjct: 392 QWPERTLV-----------------------IERQPDNVVSATKTQGRAQPPFVVLTDRW 428 Query: 491 SLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 LV LY + D Q N+AA P+VV+E++ + + Sbjct: 429 RLVRD------ELYDIQNDPGQIKNIAAEYPEVVRELRAEYDAYFEDVH 471 >UniRef50_Q0C069 Sulfatase family protein n=3 Tax=Bacteria RepID=Q0C069_HYPNA Length = 505 Score = 455 bits (1171), Expect = e-126, Method: Composition-based stats. Identities = 119/559 (21%), Positives = 197/559 (35%), Gaps = 141/559 (25%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 E + +PNI+++ +DD+GY + Sbjct: 34 SVAEKEAAASEQPNIVLIFVDDMGYADIGSFGS--------------------------P 67 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--------NTDAQ 157 TP L L EG ++T+ Y V PSRA +MTGR R G+ + Sbjct: 68 IARTPNLDRLAMEGQKWTSFYAPAPVCTPSRAGLMTGRLAVRSGMAGLVQARHVLFPTST 127 Query: 158 DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 G+P +E + EL Q GY +AA GKWH+ + E+ P Sbjct: 128 GGLPQSEVTIAELLQQEGYVSAAFGKWHMGHL-----------------------PEFLP 164 Query: 218 QNRGFDYFMGFHAAGTAYY---------------------NSPSLFKNRERVP---AKGY 253 + GF + G + L ++ E + + Sbjct: 165 TSHGFQSYFGIPYSNDMNMPGGGETPWSIDLFFEPPNIQNWDVPLMQDEEIIERPADQFT 224 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYA 313 ++ + T+ AI ++ + QPF LYLA+N PH P + + TG Y Sbjct: 225 LTQRYTERAIEFMETSHAEGQPFFLYLAHNMPHTPL-------FTSEGFTGVSAGGAYGD 277 Query: 314 SVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV--IDGPLPLNGAQKGYKSQTYPGG 371 + +D V I++ LK NT+++FTSDNG + G + K T+ GG Sbjct: 278 VIEELDWSVGEIVDALKDMKIEKNTLVIFTSDNGPWLAMKTHSGSAGMLRDGKGTTWEGG 337 Query: 372 THTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 P WW G++ P L SA+D PT + +P+D DG L P L + Sbjct: 338 MRVPAIFWWPGQIAPRTVTDLGSALDLMPTFAAISGARLPEDRVYDGFDLSPALFSEGSS 397 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 + + + VR Y Sbjct: 398 PRETLYYYRFTDV--------------------------------------FAVRKGKYK 419 Query: 492 LVYT----------VENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 ++ E LY + D ++ N+AA +P++V E++ + + S +P Sbjct: 420 AHFSTYGAFGGSGRTELETPELYDIEADPSEQFNIAAQHPEIVMELKVLAEKQAASVEPV 479 Query: 541 LSEVNQEKFNNIKKALSEA 559 +++ E++ +K E Sbjct: 480 ENQL--ERYPPGEKRGEEG 496 >UniRef50_C6W2Y9 Sulfatase n=15 Tax=Bacteroidetes RepID=C6W2Y9_DYAFD Length = 481 Score = 455 bits (1171), Expect = e-126, Method: Composition-based stats. Identities = 125/557 (22%), Positives = 195/557 (35%), Gaps = 134/557 (24%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 + L T+ + + +PNI+ + DDLGYG + F+ Sbjct: 5 LLLIPLLTSSFLTQRADAQAPKPQRPNIVFILADDLGYGDVGFNG--------------- 49 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP + L EG+ F Y V PSR++++TG+ + Sbjct: 50 -----------QKLIKTPNIDKLAKEGMIFNQFYAGTSVCAPSRSSLLTGQHTGHTYIRG 98 Query: 153 N----TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 N + Q I + T L E+ + GY TAA GKW L + + Sbjct: 99 NKGVEPEGQQPIADSVTTLAEVLKKSGYVTAAFGKWGLGPVGS----------------- 141 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRER---------VPAKGYISDQL 258 E P +GFD F G++ A+ P L+ N ++ + K Y D + Sbjct: 142 -----EGDPNKQGFDRFYGYNCQSLAHRYYPEHLWDNSKKILLEGNKGLIHNKEYAPDLI 196 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPD---QYQKQFNTGSQ--------- 306 +A+ V A+ QPF L+L Y PH P Y+ +F Sbjct: 197 QKKALSFV-NAQDGKQPFFLFLPYILPHAELVVPDDSLFRYYKGKFEEKPHKGADYGPGA 255 Query: 307 ----------TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP--- 353 + A V +D V +++ LKK G NT+++FTSDNG ++G Sbjct: 256 NGGGYASQDFPHATFAAMVARLDLYVGQVMNALKKKGLDKNTLVIFTSDNGPHVEGGADP 315 Query: 354 --LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLI-SAMDFYPTALDAADISI 410 +G K Y GG P W ++PG+ I + D PT + A+ Sbjct: 316 RFFNSGAGFRGVKRDLYEGGIREPFAARWPAAIKPGSKSDYIGAFWDILPTFAELANAPA 375 Query: 411 PKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYP 470 P +DG+S L+ K + H L W Sbjct: 376 P--RNIDGISFTDALKGKAIQKKHDYLYWEFHE--------------------------- 406 Query: 471 HNPNTEDLSQFSYTVRNNDYSLVYTVENNQL----GLYKLT-DLQQKDNLAAANPQVVKE 525 VR ++ V LY L+ D Q+K+NL P+ KE Sbjct: 407 --------QGGRQAVRQGNWKAVRLKAAGNPDALVELYDLSKDPQEKNNLTPQFPEKAKE 458 Query: 526 MQGVV-REFIDSSQPPL 541 + ++ R + S+ P Sbjct: 459 LGQIMNRAHVSSAIFPF 475 >UniRef50_A6DKC9 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKC9_9BACT Length = 454 Score = 454 bits (1168), Expect = e-126, Method: Composition-based stats. Identities = 129/542 (23%), Positives = 208/542 (38%), Gaps = 113/542 (20%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K T + KPNI+++ DDLGY + + Sbjct: 1 MKIFITLLFSCSLLW----ATDKPNILIILADDLGYADVGYHG----------------- 39 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 + TP + + +EGV+F+ GY + GP+RAA+M+G R G Sbjct: 40 ---------LEEIPTPNIDRIANEGVQFSAGYSNGSICGPTRAALMSGVYQQRIGCEGIC 90 Query: 155 DAQD-------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 + G+P L + FQ GY T GKWHL Sbjct: 91 GGRKLNEHVVVGMPREVKTLAQYFQEAGYATGLFGKWHLGGER----------------- 133 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS-----PSLFKNRERVPAKGYISDQLTDEA 262 + P +RGFD F G + Y ++ + ++ Y +D + EA Sbjct: 134 --LFDKTLMPTSRGFDEFFGILEGASLYDDTVNRERKYIRQDTVIDYEGEYFTDAIGREA 191 Query: 263 IGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG-SQTADNYYASVYSVDQG 321 + + R D+PF LYL + A H P A ++Y ++F + A + ++D Sbjct: 192 VSFITR--KGDKPFFLYLPFTAVHAPMQ--ASEKYMQRFAHIADPNRRVFAAMLSAMDDN 247 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 + R+ + L+ G DNT+I+F SDNG D LN KG K+Q Y GG P + W Sbjct: 248 IGRVFDALEHQGILDNTLIVFWSDNGGKPDNNYSLNHPLKGQKTQFYEGGIRVPACVRWP 307 Query: 382 -GKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 G++ G D+ + MD +P+AL+AA I++PKD ++ ++LP +Q K PH + W Sbjct: 308 KGQIPAGKTLDQPVFLMDIFPSALEAAQITVPKD--IEAKTILPLMQGKTNQTPHPAMFW 365 Query: 440 ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN 499 + VR D+ L + Sbjct: 366 KRA--------------------------------------GKMAVRMGDWKL--SNAGG 385 Query: 500 QLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKALSE 558 L+ L D+ + N+ +P + +M + + + P +K IK L Sbjct: 386 PSELFNLKQDISESRNIIDQHPDIANKMNRLWLNWDKKNVPAYF--GHDKALPIKVPLLH 443 Query: 559 AK 560 K Sbjct: 444 RK 445 >UniRef50_B9KQS8 Twin-arginine translocation pathway signal n=2 Tax=Alphaproteobacteria RepID=B9KQS8_RHOSK Length = 509 Score = 454 bits (1168), Expect = e-126, Method: Composition-based stats. Identities = 116/548 (21%), Positives = 198/548 (36%), Gaps = 118/548 (21%) Query: 25 FAAHAADDVKLKATKTNVAFSD-----FTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGS 79 +AH L + +A P +P+I+ + +DDLGY + + Sbjct: 26 LSAHPNRRDVLAGSAGFLAAIAGLSILAQPARAQEVARPHILYILVDDLGYADVGYHGS- 84 Query: 80 FDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAI 139 TP + L EG R Y + P+RAA+ Sbjct: 85 --------------------------DVKTPNVDRLAAEGARLMQFY-TQPLCTPTRAAL 117 Query: 140 MTGRAPARFGVYS---NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPE 196 MTGR P R+G+ + + + G+ E LP++ + GY TA VGKWHL Sbjct: 118 MTGRYPMRYGLQTGVIPSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHAD------ 171 Query: 197 DKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS-----PSLFKNRERVPAK 251 +++ P+ RG DYF G ++ +++ E V Sbjct: 172 ----------------QKYWPRQRGVDYFYGPLVGEIDHFKHEAHGITDWYRDNEMVKEP 215 Query: 252 GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG-SQTADN 310 GY ++ +AI +++ P +YL++ APH P APD+Y+ + + Sbjct: 216 GYDTELFGADAIRLIEEH-DSATPLYMYLSFTAPHTPYQ--APDKYKDLYPDIADEGRKA 272 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL-----------NGA 359 Y A + +D V +L+ L++ G ++T+++F SDNG N Sbjct: 273 YAAMISCMDDQVGLVLQALERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDP 332 Query: 360 QKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGV 419 + K Y GGT W G++ G ++ +D PT A I +LDG+ Sbjct: 333 LREGKGTLYEGGTRVVALANWPGRIPAGETHGMMHVVDMLPTLAGLAQAEIAHAGQLDGM 392 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLS 479 + + K + + I Sbjct: 393 DVWQAISAGKASPREEVVYNIE-------------------------------------- 414 Query: 480 QFSYTVRNNDYSLVY-TVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 +R+ + L + + ++ L+ L D + +L+A P+ + MQ V + S Sbjct: 415 PTQGALRDGKWKLYWQPILPPKVELFDLEADPSETTDLSAKEPEQLARMQARVIDLARSM 474 Query: 538 QPPLSEVN 545 PPL N Sbjct: 475 APPLFYAN 482 >UniRef50_B4CVD2 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CVD2_9BACT Length = 631 Score = 453 bits (1167), Expect = e-126, Method: Composition-based stats. Identities = 137/553 (24%), Positives = 212/553 (38%), Gaps = 128/553 (23%) Query: 34 KLKATKTNVAFSDFT------PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMEN 87 + A A S F + ++ KPNI+ + DDLG L Sbjct: 4 RALALSLCFAVSLFAKDGDGGASAPKSRDKPNIVFILCDDLGVNDLSCYGRKDQ------ 57 Query: 88 REVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR 147 TP L L EG+RFT Y A + SRAAIMTG+AP R Sbjct: 58 --------------------QTPNLDRLAGEGMRFTCAYCASPICSASRAAIMTGKAPGR 97 Query: 148 FGVYSNTDAQ--------------DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVP 193 + + + +PL E + + GY +A +GKWHL Sbjct: 98 VHITNFLPGRADAPSQKFIQPEIEGQLPLEENTIAKALHGAGYVSACIGKWHLGG----- 152 Query: 194 VPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGY 253 + + P N+GFDY HA PS + + Y Sbjct: 153 -------------------KGFLPTNQGFDYAFAGHAN-----TKPSATEGGK----GEY 184 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYA 313 +LT EA +++ K D PF LYLA+N+PH+P P+ +K + Y A Sbjct: 185 ---ELTAEAERWLEKNK--DHPFFLYLAHNSPHVPL-AAKPELIEKHKDAW---NPIYAA 235 Query: 314 SVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA-----VIDGPLPLNGAQKGYKSQTY 368 + S+D V RI++++ + G + TI +FTSDNG + + P N + K Sbjct: 236 MIESLDDCVGRIMKKVDELGLTEKTIFIFTSDNGGLHVYELPNTPSTYNAPFRAGKGYLE 295 Query: 369 PGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDL-KLDGVSLLPWLQ 426 GG P+ + W GK++ G ++ + DF PT + AA + + + LDGV++LP L Sbjct: 296 EGGLREPLIVRWPGKIKAGATNETPVVLYDFMPTLMTAAGLDVAHTVGPLDGVNILPLLT 355 Query: 427 DKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVR 486 + L W T S+ + +R Sbjct: 356 GGTIPP--RTLYWHFPNY------------------------------TNQGSKPAGAIR 383 Query: 487 NNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVN 545 + ++ L+ E L LY + D +K++LA + V E+QG + + S + N Sbjct: 384 DGEWKLIQDDETGNLELYNIAADPGEKNDLAKSQSARVSELQGKLAAWRKSIGAQMGTAN 443 Query: 546 QEKFNNIKKALSE 558 + K L E Sbjct: 444 PNFDSAFHKRLYE 456 >UniRef50_UPI0001C366AB sulfatase n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C366AB Length = 470 Score = 453 bits (1165), Expect = e-125, Method: Composition-based stats. Identities = 120/546 (21%), Positives = 205/546 (37%), Gaps = 136/546 (24%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN + + MDD+G+ L +F TP + L Sbjct: 4 QPNFLFIFMDDMGWRDLACTGSTF--------------------------YETPNIDRLC 37 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD----------------GI 160 +G+ F N Y + V PSRA+ +TG+ PAR GV D + + Sbjct: 38 RQGMVFANSYASCPVCSPSRASCLTGKYPARLGVTDWIDMEGTSHPLKGKLIDAPYIKHL 97 Query: 161 PLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 P E + + ++ GY T VGKWHL E+ P++ Sbjct: 98 PEGEYTIAQALKDAGYDTWHVGKWHLGGR------------------------EFYPEHF 133 Query: 221 GFDYFMGFHAAGTAY--YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLD--QPF 276 GFD +G + G + Y SP + P Y++D++TDEA+ ++ + + +PF Sbjct: 134 GFDVNIGGCSWGHPHDGYFSPYGIETLSEGPEGEYLTDRITDEAVRLLRKRQACGSRKPF 193 Query: 277 MLYLAYNAPHLPNDNPAPDQ------------------YQKQFNTGSQTADN-------- 310 + L + A H P D+ + +F+ Sbjct: 194 YMNLCHYAVHTPIQVKDEDRARFEKKARELGLDKETALVEGEFHHTEDKKGRRVVRRVIQ 253 Query: 311 ----YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--VIDGPLPLNGAQKGYK 364 Y ++++DQ + R+LE L++ G+ +NT+++FTSDNG +G N K Sbjct: 254 SDPSYAGMIWNLDQNIGRLLEALRECGEEENTVVVFTSDNGGLATSEGSPTCNLPASEGK 313 Query: 365 SQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 Y GGT P+ + + G++ PG D ++ DFYPT L+ A + + +DG S++P Sbjct: 314 GWVYEGGTRVPLIVKYPGRVAPGSRCDVPVTTPDFYPTFLELAGVPQKAGIPIDGRSIVP 373 Query: 424 WLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY 483 L P + + W + + Sbjct: 374 LLSGN--PMPERPIFWHYPHYGNQGGT------------------------------PAS 401 Query: 484 TVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 +V DY + E+ + LY L D + +NL P+ ++ ++ + Sbjct: 402 SVVMGDYKYIEFFEDGRGELYDLKADFSETNNLCEKMPETAARLRMLLHGWQREVCARFP 461 Query: 543 EVNQEK 548 E N E Sbjct: 462 EENAEY 467 >UniRef50_A6DHI0 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI0_9BACT Length = 456 Score = 453 bits (1165), Expect = e-125, Method: Composition-based stats. Identities = 135/532 (25%), Positives = 202/532 (37%), Gaps = 117/532 (21%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 ++ F ++ KPNII + DD+GYGQL Sbjct: 4 ISVFVFLMFAANSADKPNIIFIMCDDMGYGQLGSYG------------------------ 39 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN---TDAQD 158 TP L + EG+R T+ Y V PSR ++MTG+ + N Q+ Sbjct: 40 --QKMIKTPRLDQMAKEGLRLTDYYAGTAVCAPSRCSLMTGQHVGHTYIRGNKEYPTGQE 97 Query: 159 GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQ 218 IP + E + GY TA +GKW L + P Sbjct: 98 PIPAETITVAEKMKEAGYATALIGKWGLGYPGSEGE----------------------PN 135 Query: 219 NRGFDYFMGFHAAGTAYYNSP-SLFKNRERVPAK-------GYISDQLTDEAIGVVDRAK 270 +GFDYF G++ A+ + P L +N E + K Y LTDEA G + + K Sbjct: 136 KQGFDYFFGYNDQKHAHNHFPKFLLRNEETLTLKNNSGKEIEYSQYMLTDEAKGFIKKNK 195 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQ--YQKQFNTGSQTADNYYASVYSVDQGVKRILEQ 328 D PF LYLAY PH P D+ Q + + + + + +D+ V IL+ Sbjct: 196 --DNPFFLYLAYVIPHSRLQIPGDDECYLQYKDESWPEKQKKHAGMISRLDKDVGSILDL 253 Query: 329 LKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGK 383 LK+ +NT+++FTSDNGA +G +G G K Y GG P W G Sbjct: 254 LKEMNLAENTLVVFTSDNGAHREGGARPEFFNDSGPLSGIKRSMYEGGVRVPFIAHWPGV 313 Query: 384 LQPGNYDKLI-SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK-KQGEPHKNLTWIT 441 ++PG I + D PTA + + P+ +DG+S +P L+ ++ E H L + Sbjct: 314 IKPGQVSNHIGAHWDLMPTACELGGVQPPEG--IDGISYVPLLKGNMEEQEKHDYLYFEL 371 Query: 442 SYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY------T 495 VR D+ + Sbjct: 372 H------------------------------------WPTKRGVRKGDWVALQSKTSAID 395 Query: 496 VENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 + + L+ L DL QK +LA P+ V+E + + E + PL E Q Sbjct: 396 PNKDTIKLFNLKNDLGQKKDLATQYPEKVEEFKKIFLE--AHTPAPLFEFGQ 445 >UniRef50_A6DF72 Putative secreted sulfatase ydeN n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF72_9BACT Length = 481 Score = 453 bits (1165), Expect = e-125, Method: Composition-based stats. Identities = 127/535 (23%), Positives = 196/535 (36%), Gaps = 112/535 (20%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 KPN+I++ +DDLG+ TP + L Sbjct: 23 KPNVIMILVDDLGWTDTTCYGS--------------------------DLYQTPNVDELS 56 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA-------------QDGIPLT 163 G+RFT+ Y A V P+R++IMTG+ PA + + + Sbjct: 57 RTGMRFTDAYSACTVCSPTRSSIMTGKNPANNNLTDWITGHVKPYAKLKSPNWKMHLTAE 116 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 E L E F+ GY T +GKWHL + S P+N+GFD Sbjct: 117 EITLAEAFKATGYKTVHIGKWHLGEESVS-----------------------WPENQGFD 153 Query: 224 YFM-GFHAA-----GTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFM 277 + GF A G Y SP + P Y++++L EA + L +PF Sbjct: 154 ENIAGFRAGSPSAHGGGGYFSPYNNPRLKDGPKGEYLTERLAQEASQYIQSTAKLKKPFF 213 Query: 278 LYLAYNAPHLPNDNPAP--DQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQ 334 + L H P D+Y + T Y A V +D V +++ +K G Sbjct: 214 MNLWLYNVHTPLQARQEKIDKYTRLIQKGYQHTNPVYAAMVEHMDDAVGTVMQAVKDAGI 273 Query: 335 YDNTIILFTSDNGAVIDGP------LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG- 387 DNTII+F SDNG + + N + K Y GG PM + W K++ G Sbjct: 274 EDNTIIIFNSDNGGLRGNYENNRQKVTSNYPLRSGKGDMYEGGVRVPMIIKWSRKIKAGQ 333 Query: 388 NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLP-WLQDKKQGEPHKNLTWITSYSHW 446 + + D YPT LD I + K +DG+SL+P L+ K L W + H Sbjct: 334 TSSSPVISHDIYPTLLDLCKIDVSKKQDIDGISLVPELLEGKTIQ--RDALYWHYPHYHL 391 Query: 447 FDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL 506 + +R D+ L++ E + LY L Sbjct: 392 EGA------------------------------KPYSAIRKGDWKLIFLYEESHAELYNL 421 Query: 507 -TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKALSEAK 560 D+ +++NLA + + E+ G +R + L N ++ +K Sbjct: 422 RNDISERNNLAMTEKRKLAELMGDLRTWKKKIGAQLPVFNPNYNFEKEQNWIFSK 476 >UniRef50_B4D4S5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4S5_9BACT Length = 486 Score = 453 bits (1165), Expect = e-125, Method: Composition-based stats. Identities = 125/542 (23%), Positives = 188/542 (34%), Gaps = 126/542 (23%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 KPNI+ + DD+G+ L A TP + Sbjct: 23 PDKPNILFILADDMGWSDLGCYG--------------------------ADLHETPNIDR 56 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ----------------D 158 VRFT+ Y V PSR+ +MTG+ AR + Sbjct: 57 FASGAVRFTSAYA-MSVCSPSRSTLMTGKHAARLHFTIWAEGAQEGGAKNRELREAESIW 115 Query: 159 GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQ 218 +P +E + ++ GY TA +GKWHL E P+ Sbjct: 116 NLPNSEKTIATYLKSAGYLTALIGKWHLGD------------------------WEHYPE 151 Query: 219 NRGFD------------YFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVV 266 GFD F ++ + + E Y++D+LTDEAI V+ Sbjct: 152 AHGFDINIGGTNWGAPQTFWWPYSGSGTHGPEFRYIPHLEYGHPGEYLTDRLTDEAIKVI 211 Query: 267 DRAKTLDQPFMLYLAYNAPHLPNDNPAPD--QYQKQFNTGSQTADN-YYASVYSVDQGVK 323 D DQPF +YLA++A H P + A D + ++ G Y A +D+ V Sbjct: 212 D--HAGDQPFFVYLAHHAVHTPIEAKADDIQHFDAKYRDGMNHRHTIYAAMNKELDENVG 269 Query: 324 RILEQLKKNGQYDNTIILFTSDNGAVID--------GPLPLNGAQKGYKSQTYPGGTHTP 375 R+LE LK+ G NT+++F SDNG I P+ N + K Y GG P Sbjct: 270 RVLEHLKERGLDKNTVVIFASDNGGYIGVDKVSGKNMPVTNNAPLRSGKGALYEGGIRVP 329 Query: 376 MFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPH 434 + + W G G D+ + D T L P DG+ + P L+D Sbjct: 330 LIIRWPGVTPNGATCDEPVILTDMLQTFLHITG-QPPATDATDGMDISPLLKDPSAKLNR 388 Query: 435 KNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY 494 L + + + +R D+ L+ Sbjct: 389 DALFFHYPHYYHTT-------------------------------TPVSAIRARDWKLLE 417 Query: 495 TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIK 553 E+N L LY L DL +K +LA P ++ + + DS L + N + Sbjct: 418 FYEDNHLELYNLRNDLSEKHDLAKEMPDKAAALRDQLNAWRDSVGAVLPQPNPDFKGGKP 477 Query: 554 KA 555 K Sbjct: 478 KP 479 >UniRef50_A3ZMN6 Arylsulfatase B n=3 Tax=Bacteria RepID=A3ZMN6_9PLAN Length = 455 Score = 451 bits (1161), Expect = e-125, Method: Composition-based stats. Identities = 121/529 (22%), Positives = 210/529 (39%), Gaps = 104/529 (19%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 ++ + + T + +PNI+ L DDLG + + Sbjct: 4 IRSLILTIALTLASVATTFATDAPRPNIVFLLADDLGGADVSWRGSP------------- 50 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP L +L + G + YV V P+R+A++TGR P R+G+ Sbjct: 51 --------------IKTPQLDALANSGAKLEQFYV-QPVCSPTRSALLTGRYPMRYGLQV 95 Query: 153 N---TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 A G+PL E L E Q+ GY TA VGKWHL +S Sbjct: 96 GVVRPWADYGLPLDERTLAEALQDAGYETAIVGKWHLGHVS------------------- 136 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAYYNS-----PSLFKNRERVPAKGYISDQLTDEAIG 264 + P RGFD+ G + Y+ K+ +GY + + EA+ Sbjct: 137 ---PAYLPMARGFDHQYGHYNGALDYFTHDRDGGHDWHKDDHVNRDEGYATHLIAQEAVR 193 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKR 324 V+ + +P LY+ +NA H P P+ Y + + Y V ++D+ V + Sbjct: 194 VIQD-RDKKKPLFLYVPFNAVHSPLQ--VPESYAAPYGDMKKRRQAYAGMVAALDEAVGQ 250 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL 384 I++++++ DNT+ +F+SDNG G L NG +G K Y GG F WKG++ Sbjct: 251 IVDEIQRQEMLDNTLFIFSSDNGGPEPGKLTDNGPLRGGKHTLYEGGVRVCAFASWKGRI 310 Query: 385 QPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 PG+ + + +D+YPT ++ A S+ + LDG ++ P + + PH + Sbjct: 311 APGSKVEAPLHIVDWYPTLIELAGGSLQQAKPLDGRNIWPSITTG-EPSPHDVIV----- 364 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT---VENNQ 500 +++ +R D+ LV + Sbjct: 365 --------------------------------CNITPTEGAIRVGDWKLVVHNIGKPREK 392 Query: 501 LGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEK 548 + L+ L+ DL ++ N A N +++++++ + + P + Q K Sbjct: 393 VELFNLSDDLAEQQNRATTNAKMLRKLRNRFDQLASEAAPAKNAGPQPK 441 >UniRef50_A6DKP2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKP2_9BACT Length = 446 Score = 451 bits (1161), Expect = e-125, Method: Composition-based stats. Identities = 142/536 (26%), Positives = 220/536 (41%), Gaps = 127/536 (23%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K + + F KPNI+++ DD+G+G + + Sbjct: 1 MKFLFSLMGFVAL----LRAADKPNIVLVFADDMGWGDVAYHG----------------- 39 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 TP + ++ GV F GY A V GPSRA I+TGR FGV +N Sbjct: 40 ---------VEDAQTPAIDAIAKGGVWFEQGYAAASVCGPSRAGILTGRYQQLFGVVTNG 90 Query: 155 DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 DA GIP ++ + EL + GY + A GKWHL Sbjct: 91 DADKGIPKSQKNIAELLKPAGYKSGAFGKWHLGSKKGQF--------------------- 129 Query: 215 WQPQNRGFDYFMGFHAAGTAYY-----------NSPSLFKNRERVPAKG--YISDQLTDE 261 P +RGFD F GFH YY ++ N++ V K Y+++++TD Sbjct: 130 --PNDRGFDTFYGFHFGAHDYYRADKKLNKKKKGYAPIYFNQDIVDYKEGDYLTEKITDH 187 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG-SQTADNYYASVYSVDQ 320 A+ ++ K DQPF +Y+AYN+ H P PD+Y + + A V ++D Sbjct: 188 AVEFIEENK--DQPFFMYVAYNSVHSPWQ--VPDEYLARIPESVPAYRRLFLAMVLAMDD 243 Query: 321 GVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNG---------AQKGYKSQTYPGG 371 GV RI +LK+ +NTI +FT+DNG+ G N +GYK TY GG Sbjct: 244 GVGRIRAKLKELNLDENTIFVFTTDNGSPKIGNKKPNEGQYRMSMSQGFRGYKGDTYEGG 303 Query: 372 THTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ 430 P M W K++ G ++ + A D PT L AA + + G LLP+L+D+++ Sbjct: 304 IRVPFCMSWPKKIKSGNKFEAPVIAYDLAPTFLSAASLEY-STKQFSGKDLLPYLEDEQK 362 Query: 431 GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDY 490 G PH+ L W Y VR+ D+ Sbjct: 363 GRPHETLFWHRHSGLDD-----------------------------------YAVRHGDW 387 Query: 491 SLVYTVENN---------QLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDS 536 L Y + L L+ L D +K +LA + P+ +++++ + + ++ Sbjct: 388 KLTYNDQEGTSKDFLKKVHLKLFNLKQDPYEKKDLADSMPEKLQQLKQLYFNWHET 443 >UniRef50_B9XF83 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XF83_9BACT Length = 488 Score = 451 bits (1161), Expect = e-125, Method: Composition-based stats. Identities = 126/550 (22%), Positives = 202/550 (36%), Gaps = 96/550 (17%) Query: 24 AFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPK 83 V T + + + +PNII++ DDLGYG L Sbjct: 9 CRKLLLWSMVSFSGLLTLTSDAQTSTNRPPAPRRPNIILILADDLGYGDLGCYG------ 62 Query: 84 TMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGR 143 Q TP + L ++G++FT+ Y V PSRA +MTG+ Sbjct: 63 --------------------QTQIKTPNIDKLAEDGMKFTSFYAGSTVCAPSRATLMTGK 102 Query: 144 APARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY 203 + N D + E + ++ + GY T +GKW L + +P + +Y Sbjct: 103 NTGHVNIRGNAD--LSLNGEELTIAKILKLAGYATGCIGKWGLGNEGSPGLPGRQGFDEY 160 Query: 204 HDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAI 263 A ++ P + F + + +L +N Y +D T A+ Sbjct: 161 LGYLDQVQAHDYYPTHL-------FRSDSKGEESKIALTENDAD-HKGLYSNDFFTQSAL 212 Query: 264 GVVDRAK----TLDQPFMLYLAYNAPHLPND--------NPAPDQYQKQFNTGSQTADNY 311 + K + F LYL Y PH N+ P Q N Sbjct: 213 NYLRINKPSKLNKHRSFFLYLPYTLPHANNELGNRTGNGMEVPSTEPYTNEQWPQVEKNK 272 Query: 312 YASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKSQ 366 A + +D V I++ LKK+ +NT+++F SDNG +G G +G K Sbjct: 273 AAMITRLDHYVGEIMDYLKKSKLDENTVVIFASDNGPHKEGGVNPKYFNSAGGLRGIKRD 332 Query: 367 TYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 Y GG P + W +++ G+ D ++ DF PTA + A S P + +DG+S LP L Sbjct: 333 LYEGGIRVPFIVRWPARVKAGSISDAPLAFWDFLPTAAEIARTSSPTN--IDGISFLPTL 390 Query: 426 QDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTV 485 K Q H+ L W F V Sbjct: 391 LGKAQTNRHQYLYWEFHE-----------------------------------QGFDQAV 415 Query: 486 RNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEV 544 R D+ V N + LY L TD+ +KDN+A NP+V+ + + +++ ++ Sbjct: 416 RMGDWKAVRHGINGPIELYNLKTDVSEKDNVADKNPEVMAK----IADYLKKARTDDPRW 471 Query: 545 NQEKFNNIKK 554 + IK+ Sbjct: 472 PAKTVAEIKE 481 >UniRef50_Q7UL93 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UL93_RHOBA Length = 470 Score = 451 bits (1160), Expect = e-125, Method: Composition-based stats. Identities = 121/546 (22%), Positives = 201/546 (36%), Gaps = 106/546 (19%) Query: 19 ASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKG 78 M + ++L + +P+I+ + DD+G+ L Sbjct: 8 TLIMRTTNMNETRTIRLWVGLLLTFCWNACLVSAEAAEQPHILFIMADDMGWKDLHCQG- 66 Query: 79 SFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAA 138 TP + +L + GVRF N Y V P+RA+ Sbjct: 67 -------------------------NDVLRTPNIDALAEAGVRFDNAYAGSTVCTPTRAS 101 Query: 139 IMTGRAPARFGVYSN----------------TDAQDGIPLTETFLPELFQNHGYYTAAVG 182 +MTG APAR + + +P T + E + GY T G Sbjct: 102 LMTGLAPARLHITQHGADSKSFWPDDRLIQPPPTNHELPHETTTMAERLKAAGYTTGFFG 161 Query: 183 KWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHA-AGTAYYNSPSL 241 KWHL +++ P GFD +G G Y P Sbjct: 162 KWHLGGD-----------------------KKYWPTEHGFDVNVGGCGLGGPPTYFDPYR 198 Query: 242 FKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQK 299 Y++D+L DE I + R K D+P + L PH P + P + Y+ Sbjct: 199 IPALPPRKEGEYLTDRLADETIAFMRREK--DKPMFVCLWTYNPHYPFEAPEDLIEHYKG 256 Query: 300 QFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGA 359 + TG Y + + D+GV R+L +L G D T+++FTSDNG N Sbjct: 257 KEGTG-LKNPIYGGQIEATDRGVGRVLRELDSLGIADETLVVFTSDNGGW--SGATDNRP 313 Query: 360 QKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDG 418 + K + GG P+ + W G + ++ + +MD T LDAA +S+ LDG Sbjct: 314 LREGKGFLFEGGLRVPLIVRWPGVTEAATVNETPVVSMDLTATILDAAGVSLANGESLDG 373 Query: 419 VSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDL 478 SL P K L + + + + Sbjct: 374 ESLRPLFSGGK--LERDALYFHYPHFAFHKD----------------------------- 402 Query: 479 SQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 ++ +R+ Y L+ +++ + LY L DL + +LAA +P V +E++G + E+++++ Sbjct: 403 NRPGSVIRSGQYKLILRHDDDSVELYDLQNDLSETSDLAAVHPDVAQELKGRLMEWLEAT 462 Query: 538 QPPLSE 543 + E Sbjct: 463 GAGMPE 468 >UniRef50_D2QTW6 Sulfatase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QTW6_9SPHI Length = 486 Score = 451 bits (1160), Expect = e-125, Method: Composition-based stats. Identities = 115/556 (20%), Positives = 197/556 (35%), Gaps = 102/556 (18%) Query: 28 HAADDVKLKATKTNVAFSDFTPTE----YSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPK 83 +L + + + + + PN+++ MDDLGYG L Sbjct: 4 IPIRLSRLVLSAITLVGLGLSISAWVEKPAPATPPNVVLFFMDDLGYGDLSVTG------ 57 Query: 84 TMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGR 143 A +TP L + EG RFTN A V SRAA++TG Sbjct: 58 --------------------ALDYTTPNLDKMAAEGTRFTNFLAAQAVCSASRAALLTGC 97 Query: 144 APARFGVYS--NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 P R G+Y ++ G+ E L EL + GY T GKWHL Sbjct: 98 YPNRLGLYGALGPNSPIGLNPNEETLAELLKERGYATGMFGKWHLGDN------------ 145 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP----------SLFKNRERVP-- 249 +++ P +GFD + G + + P E P Sbjct: 146 -----------KQFLPMQQGFDEYYGVPYSHDMWPLHPAQAQAKYPPLRWIDGNEPGPEI 194 Query: 250 ----AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS 305 G I+ +T++A+ + K +PF LY+ + PH+P A G Sbjct: 195 KDLNDAGKITGTITEKAVSFIRNHKK--KPFFLYVPHPLPHVPLATSA-------RFKGQ 245 Query: 306 QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNGAQKGY 363 + + +D V +I+ +LK+ G NT+++F SDNG + +G + Sbjct: 246 SARGIFGDVLTELDWSVGQIMNELKQQGLDKNTLVIFISDNGPWLNYGDHAGSSGGFREG 305 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLL 422 K ++ GG P + W G + G +KL++A+D PT + +PK +DGV + Sbjct: 306 KGTSFEGGHRVPCLVRWPGVVPAGRVSNKLLTALDILPTVANVCGARLPKQR-IDGVDWV 364 Query: 423 PWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFS 482 L+ P + + + + R P S + Sbjct: 365 ALLKGDNSVTPRDKFYYYYRKNSLEAVRQGDWKLVFAHPGRTYEGFLPGQGGKPGPSTET 424 Query: 483 YTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 + + LY L D ++ ++ +P+VV ++ + E Sbjct: 425 HAIAAG--------------LYDLRRDPGERYDVREQHPEVVARLETIAEEARADLGD-- 468 Query: 542 SEVNQEKFNNIKKALS 557 E+ + N+++ Sbjct: 469 -ELQKRTGANVREPGR 483 >UniRef50_C1ZCL4 Arylsulfatase A family protein n=2 Tax=Bacteria RepID=C1ZCL4_PLALI Length = 470 Score = 450 bits (1158), Expect = e-125, Method: Composition-based stats. Identities = 135/541 (24%), Positives = 210/541 (38%), Gaps = 121/541 (22%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 A + F KPN++++ +DDLG + + SF Sbjct: 8 ALLLFMGAPFFPVEAKEMADKPNVLLIFIDDLGKTDIGIEGSSF---------------- 51 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN--T 154 TP + +L G RFT Y AH V P+RAA+MTG+ P R G+ Sbjct: 52 ----------YETPRIDALAKSGARFTQFYSAHPVCSPTRAALMTGKMPQRLGITDWIRP 101 Query: 155 DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 ++ +P +E + + FQ GY+TA +GKWHL + Sbjct: 102 ESDVALPQSEVTIGQAFQEAGYHTAYLGKWHLGH-----------------------KPQ 138 Query: 215 WQPQNRGFDYFMGFHAAGTA--YYNS---------PSLFKNRERVPAKGYISDQLTDEAI 263 P RGFD+ G + G YY P+ + E+ + Y++D LT AI Sbjct: 139 QHPAARGFDWTKGVNHGGQPSSYYFPYKNPQKPDAPNNVPDFEKCQPEDYLTDVLTSSAI 198 Query: 264 GVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQT-------------- 307 + + + +PF L LA+ A H P P ++YQ + T Sbjct: 199 EHL-QQRDRTRPFFLCLAHYAVHTPIQPPKNLVEKYQVKLATQKNPKSPGEGIQEGSAIS 257 Query: 308 -----ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL-----N 357 Y A V ++D V R+L++LK G D TI++FTSDNG + N Sbjct: 258 RSQQDHPAYAAMVENLDTQVGRLLDELKTQGILDQTIVVFTSDNGGLCTLNGKSPGPTCN 317 Query: 358 GAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLD 417 + K TY GG P ++ W GK+ P D D YPT L I +D Sbjct: 318 LPLRAGKGWTYEGGIRIPTYISWPGKISPQVLDIPAYTCDIYPTLLSLCQIPPRPTQHVD 377 Query: 418 GVSLLPWLQDKKQ-GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTE 476 G+SL L E + L W ++H Sbjct: 378 GISLAGLLTKSSSLPESERTLVWYYPHTHGSG---------------------------- 409 Query: 477 DLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFID 535 + S +R + L++ +E +++ LY L D + NLA+ +P+ ++Q +++ I+ Sbjct: 410 --HKPSAAIRQGPWKLIHFLETDRIELYHLEDDPGESRNLASKHPERALQLQKELQKIIE 467 Query: 536 S 536 S Sbjct: 468 S 468 >UniRef50_Q15XG7 Sulfatase n=2 Tax=Bacteria RepID=Q15XG7_PSEA6 Length = 471 Score = 450 bits (1158), Expect = e-125, Method: Composition-based stats. Identities = 136/532 (25%), Positives = 200/532 (37%), Gaps = 122/532 (22%) Query: 39 KTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGI 98 T S +PNI+ L DD GY F Sbjct: 8 VVAALSISVACTSLSYAKQPNIVFLFSDDAGYADFGFQGS-------------------- 47 Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG-----VYSN 153 TP L L EGVRFT GYV+ GPSRA IMTGR +FG V Sbjct: 48 ------ETMKTPNLDQLASEGVRFTQGYVSDSTCGPSRAGIMTGRYQQKFGYEEINVPGY 101 Query: 154 T-------DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 A+ GIPL E + + ++ GY TA GKWHL Sbjct: 102 MSEHSAIKGAEMGIPLDEVTMGDYMKSLGYRTAFYGKWHLGGT----------------- 144 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHAAGTAYY--------------NSPSLFKN-RERVPAK 251 +E P +RGFD F GF +Y+ L + + Sbjct: 145 ------DELHPMHRGFDEFYGFRGGDRSYWAYEVNAPERKSAVFTDKKLEHGIDQFQEHE 198 Query: 252 GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNY 311 GY++D L ++A ++ K D+PF ++L++NA H P + D +F Sbjct: 199 GYLTDVLAEKANQFIE--KAPDKPFFIFLSFNAVHTPMEATPED--LAKFPQLKGKRKEV 254 Query: 312 YASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGG 371 A ++D+ +L +LK+ G D+T+++F++DNG D N G KS GG Sbjct: 255 AAMTLALDRASGAVLNKLKELGLEDDTLVVFSNDNGGPTDKNASSNYPLAGTKSNFLEGG 314 Query: 372 THTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ 430 P + W KL G DK +S +D PT A +LDGV L+P++ + Sbjct: 315 IRVPFLVKWPAKLAAGKVYDKPVSTLDLLPTFFKAGGGEEVMS-ELDGVDLMPYITGQNN 373 Query: 431 GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDY 490 PH+++ W + +R D+ Sbjct: 374 KAPHESMYW--------------------------------------KKETRAAIRQGDW 395 Query: 491 SLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 L+ + LY L D+ ++ NLAA P+ VK+M + + + PL Sbjct: 396 KLLR-FPDRPAELYNLANDIGEQHNLAAQEPERVKQMYKDFFSWEMTLERPL 446 >UniRef50_A6DKP3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKP3_9BACT Length = 465 Score = 450 bits (1157), Expect = e-125, Method: Composition-based stats. Identities = 136/534 (25%), Positives = 212/534 (39%), Gaps = 116/534 (21%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 ++ S KPNIIV+ DDLGYG + + + Sbjct: 1 MRNLALQFLCFVLASLSASAA-KPNIIVILADDLGYGDVSYHGTLKETT----------- 48 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 TP + S+ G F NGY A V GPSRA +++GR RFG Y N Sbjct: 49 --------------TPHIDSIAQSGAWFQNGYSAAPVCGPSRAGLLSGRYQQRFGYYDNI 94 Query: 155 DAQD-------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 G+PL++ +PE+ GY T VGKWH Sbjct: 95 GPFTLNKDVEAGLPLSQKLIPEILVKEGYATGMVGKWHDGDQ------------------ 136 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYY-----------NSPSLFKNRERVPAKGYISD 256 ++ P NRGF F GF+ + +N+ + Y+++ Sbjct: 137 -----HKFWPYNRGFQEFYGFNNGAINNWVLKGENHTVDEWGAVHRENKRVENSGEYMTE 191 Query: 257 QLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT-GSQTADNYYASV 315 EA+ +DR KT +PF LYL++NA H P AP Y QF + A + Sbjct: 192 AFGREAVEFIDRHKT--EPFFLYLSFNAVHGPLQ--APKSYTNQFKHIKPENRALCLAML 247 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTP 375 S+D + +LE+L+K G +NTII FTSDNG + G NG +G K+ + GG H P Sbjct: 248 KSMDDNIGLVLEKLRKEGLEENTIIFFTSDNGGKLKGNYSFNGKYRGEKNTVFDGGLHVP 307 Query: 376 MFMWWKGKLQPGN--YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 + WK ++ + + ++D T AA + I + KLDG +LLP+L+++ Sbjct: 308 YAVQWKAQIPAQTKALEAPVHSIDLAHTIFAAAGVEIKDEYKLDGRNLLPYLKNQSD-FD 366 Query: 434 HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV 493 +NL W + + +R+N + + Sbjct: 367 DRNLYW--------------------------------------ANNANIAIRDNKWKYL 388 Query: 494 YTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 + + L+ L D + +NL + P+ ++MQ + ++ P L N Sbjct: 389 K--QAGKTYLFNLEEDPYESNNLVSQYPEKAQDMQKRHDAWQANNAPQLFGWNP 440 >UniRef50_UPI00016C5053 Arylsulfatase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5053 Length = 467 Score = 450 bits (1157), Expect = e-125, Method: Composition-based stats. Identities = 128/554 (23%), Positives = 203/554 (36%), Gaps = 128/554 (23%) Query: 35 LKATKTNVAFSDFTPTEYST-KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 + +A + P+ + KPNI+++ DDLG +L Sbjct: 2 FRTAAVFLAVALLAPSGRAADAPKPNIVLIVADDLGCFELGCYG---------------- 45 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 + TP + L G +FT Y V PSR +MTG+ V +N Sbjct: 46 ----------QTKIKTPHIDKLAQGGAKFTRFYSGSPVCAPSRCVLMTGKHSGHATVRNN 95 Query: 154 ----TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 + Q I + + + + HGY T A+GKW L Sbjct: 96 VEAKPEGQFPIRAEDVTVADALKAHGYATGAMGKWGLGMFDTAGS--------------- 140 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRERVPAKG--------YISDQLTD 260 P GFD F G++ A+ + P +++N +RV KG + D + Sbjct: 141 -------PLKHGFDLFFGYNCQRHAHSHYPTYIYRNDKRVELKGNDGKTGKQFTQDLFEE 193 Query: 261 EAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQ------------FNTGSQ 306 EA+G ++ K +PF LYL + PH+ P ++Y+ Q + Sbjct: 194 EALGFIEANKA--KPFFLYLPFTVPHVAVQVPEDSLNEYKGQLGDDPAYDGKKGYQPHPA 251 Query: 307 TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL------PLNGAQ 360 Y A V +D+ V R++E+L G NT++LFTSDNG + G Sbjct: 252 PHAGYAAMVTRMDRSVGRVVEKLNALGLEKNTLVLFTSDNGPTHNVGGADSSFFNSAGKL 311 Query: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGV 419 +G K Y GG P + G ++ G D + D PT A P +DG+ Sbjct: 312 RGLKGSVYEGGIRVPFIAYQPGTIKAGTESDAPLYFPDVLPTLCAFAGTKAPS--AIDGI 369 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLS 479 S LP L+ +KQ H L W S Sbjct: 370 SFLPLLKGEKQPT-HDFLYWEFS-----------------------------------GY 393 Query: 480 QFSYTVRNNDYSLVYT---VENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVR-EFI 534 V ++ V + + LY L D +K+++AA NP V+ ++ ++ E Sbjct: 394 GGQQAVIEGEWKAVRQALGMGGVKTELYNLAKDPSEKEDVAAKNPAVLARLEKRLKNEHT 453 Query: 535 DSSQPPLSEVNQEK 548 +S PL ++ +K Sbjct: 454 PNSNFPLQTIDPKK 467 >UniRef50_A6C4V9 Sulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4V9_9PLAN Length = 480 Score = 449 bits (1156), Expect = e-124, Method: Composition-based stats. Identities = 134/542 (24%), Positives = 208/542 (38%), Gaps = 120/542 (22%) Query: 34 KLKATKTNVAFSDFTPTEYSTK------GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMEN 87 L T FS F + +PN+IV+ +DD+GY + + Sbjct: 7 LLTGMMTTAVFSMFCLVNLADAAERPPGDRPNLIVIMVDDMGYAGVSCFGNPY------- 59 Query: 88 REVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR 147 TP + L EG++FT+ + + V P+RA ++TGR R Sbjct: 60 -------------------FKTPEIDRLAAEGMKFTDFHSSGTVCSPTRAGLLTGRYQQR 100 Query: 148 FGVY-------SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQT 200 G+ + + Q G+ +E EL + GY TA +GKWH N Sbjct: 101 AGIEAVIHPVSDHPEHQKGLRKSENTFAELLKQAGYRTALIGKWHQGYPHNSA------- 153 Query: 201 RDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS------PSLFKNRERVPAKGYI 254 E+ P N GFD F+G+H+ + + + R+ GY Sbjct: 154 -------------EFHPDNHGFDTFVGYHSGNIDFISHVGDHVKHDWWHGRKETQETGYS 200 Query: 255 SDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQ-------- 306 + + A+ + ++ +QPF LYLA+ A H P P + + + Sbjct: 201 THLINQYALQFIKESR--NQPFCLYLAHEAIHNPVQVPGDPIRRTEAAGWKRWKPASEAE 258 Query: 307 TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQ 366 + + VD GV +I E L K+G NT +LF SDNG D P +G K Sbjct: 259 RIEKFRGMTLPVDAGVGQIREFLVKSGLDKNTFVLFFSDNGPSRDFPSGSPK-WRGAKGS 317 Query: 367 TYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 Y GG P WW GK+Q G D ++D PT L A I +PK+ LDGV L P L Sbjct: 318 VYEGGHRVPAIAWWPGKIQAGTETDVPAISLDVMPTLLGIAHIDMPKERPLDGVDLSPVL 377 Query: 426 QDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTV 485 +++ + L W + ++ E + Sbjct: 378 F-EQKPLSERPLFWASLSNNGSRSE---------------------------------AM 403 Query: 486 RNNDYSLVY--------TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDS 536 R + LV T EN ++ LY+L D + +NL+ A PQ M ++++ Sbjct: 404 RAGPWKLVVQHPRAKPGTFENEKVELYRLDQDPGEANNLSKAEPQRASRMLKQLKDWYQD 463 Query: 537 SQ 538 +Q Sbjct: 464 TQ 465 >UniRef50_Q7URY7 Aryl-sulphate sulphohydrolase n=1 Tax=Rhodopirellula baltica RepID=Q7URY7_RHOBA Length = 490 Score = 449 bits (1156), Expect = e-124, Method: Composition-based stats. Identities = 127/549 (23%), Positives = 204/549 (37%), Gaps = 102/549 (18%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 L A V + E + PN++ + +DD G+ F F Sbjct: 10 ALPAFLFAVVLVSTSTAETPSTEHPNVLFIYLDDYGWRDATFMGSDF------------- 56 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 TP L +L + G+ F+N Y P+RA++++G+ R +Y+ Sbjct: 57 -------------YETPNLDALAERGMVFSNAYSCAANCAPARASLLSGQYSPRHEIYNV 103 Query: 154 TDAQDG---------------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDK 198 + G + ++ GY T +GKWHLS Sbjct: 104 GTERRGNPKHGTLQHIPGTETLSSDIQTWAHQVRDAGYRTGIIGKWHLSDD--------- 154 Query: 199 QTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA---YYNSPSLFKNRERVPAKGYIS 255 P GFD + +G+ Y+ + Y++ Sbjct: 155 ------------------PLPYGFDINVAGTHSGSPPKGYFPPHPKVPGLQDTSDDEYLT 196 Query: 256 DQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTADNYYA 313 D+LTDEAIG ++ + + LYL++ A H P +Y+ + A Sbjct: 197 DRLTDEAIGFIEANQEWS--WFLYLSHFAVHTPLQAKPDLVAKYKAKQPGTLHDHAVMAA 254 Query: 314 SVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTH 373 + SVD+GV R++E L++ G +NT I+FTSDNG GP +GYK Y GG Sbjct: 255 MIESVDEGVGRMVETLRELGLEENTAIVFTSDNGGF--GPATSMKPLRGYKGTYYEGGIR 312 Query: 374 TPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 P F+ W G + G D + A D YPT ++ +P D LDGVSL+P L+ ++ Sbjct: 313 EPFFVTWPGVVDAGTKSDVPVIAADLYPTFIEMTGAKLPADQPLDGVSLMPLLK-QEGSL 371 Query: 433 PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL 492 + L W D ++ + +R+ + L Sbjct: 372 ADRELYWHFPAYLQSYSVTDGQRDLLYRS------------------RPCGIIRDGRWKL 413 Query: 493 VYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 E+ L LY L TD + +NLA ANP + + + + + + + Sbjct: 414 HEYFEDGGLELYDLVTDPGESNNLADANPIKTQALHSKLVAWRERIGASMPTEPNPNHD- 472 Query: 552 IKKALSEAK 560 SEAK Sbjct: 473 ---PASEAK 478 >UniRef50_A6LDP6 Arylsulfatase A n=4 Tax=Bacteroidales RepID=A6LDP6_PARD8 Length = 452 Score = 449 bits (1155), Expect = e-124, Method: Composition-based stats. Identities = 129/520 (24%), Positives = 199/520 (38%), Gaps = 131/520 (25%) Query: 53 STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTL 112 S KPNIIV+ DD+GYG L TP + Sbjct: 21 SQPTKPNIIVINCDDMGYGDLSCFGS--------------------------PTIKTPNI 54 Query: 113 LSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS------NTDAQDGIPLTETF 166 + EG ++++ YV+ VS PSRA ++TGR R G+Y D++ G+P E Sbjct: 55 DRMAIEGQKWSSFYVSASVSSPSRAGLLTGRLGVRTGMYGDQRRVLFPDSKGGLPSEELT 114 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 + EL + GY+TA +GKWHL + E+ P GFDYF Sbjct: 115 IAELLKQAGYHTACIGKWHLGHL-----------------------PEYMPLRHGFDYFY 151 Query: 227 GFHA------------AGTAYYNSPSLFKNRERV--PAKGY-ISDQLTDEAIGVVDRAKT 271 G+ T Y +++ + + + Y ++ Q+T+ AI + + Sbjct: 152 GYPYSNDMSRKEQIKLGNTKYPYEYIIYEQEKELEREPQQYNLTQQVTEAAIRYIKSNEN 211 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 PF LYLA+ PH+P Y G Y +V +D V +IL+ LK Sbjct: 212 S--PFFLYLAHPMPHMPV-------YASTDFQGKSARGKYGDTVEELDWSVGQILQTLKS 262 Query: 332 NGQYDNTIILFTSDNGAVI--DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY 389 G NT+++FTSDNG + G K K+ + GG P M W ++PG Sbjct: 263 EGLDKNTLVIFTSDNGPWLLCKQEGGSPGPLKDGKASMFEGGFRVPCIM-WGAMVKPGYI 321 Query: 390 DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDE 449 + S +D PT + A I +P D DG+SLL L+DK + + S + Sbjct: 322 TDMASTLDLLPTFCEIAGIPLPSDRHYDGISLLNVLKDKSTCKRDVFYFYRGSELY---- 377 Query: 450 ENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVEN----------N 499 +R Y ++ + Sbjct: 378 ----------------------------------AIRKGKYKAHFSYRPAYGTTDKIIYD 403 Query: 500 QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 + LY L TD + N+A +P +V+E+ + S + Sbjct: 404 KPVLYDLGTDPGELYNIAEEHPDIVQELTMLANAHKASLK 443 >UniRef50_Q01N83 Sulfatase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01N83_SOLUE Length = 461 Score = 448 bits (1154), Expect = e-124, Method: Composition-based stats. Identities = 121/494 (24%), Positives = 184/494 (37%), Gaps = 91/494 (18%) Query: 54 TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLL 113 + +PNI+V+ DDLGYG L +TP + Sbjct: 24 QQRQPNIVVILADDLGYGDLGCYGSP---------------------------IATPNID 56 Query: 114 SLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD-GIPLTETFLPELFQ 172 L +EG RFT+ Y A V PSRAA+MTGR P R V D G+P +E + ++ + Sbjct: 57 RLAEEGARFTSFYSASPVCSPSRAALMTGRYPTRVEVPVVLGPGDAGLPDSEITMAQVLK 116 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 + GY T+ +GKWH+ + P NRGFD F G + Sbjct: 117 SAGYRTSCIGKWHIGST-----------------------PGYLPTNRGFDEFFGVPYSA 153 Query: 233 TAYYNSPSLFKNRERVPAK---GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPN 289 L + V ++ T EA+ + RA+ D PF LYLA+ APHLP Sbjct: 154 DI--TPCPLMRGSSVVAPAVDCSTLTSSFTQEALDFMRRAQ--DNPFFLYLAHTAPHLPL 209 Query: 290 DNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV 349 G Y V +D +++ LK G NT+++F+SDNG Sbjct: 210 AASP-------RFAGQSGLGMYADVVQELDWSTGQVMAALKATGLDSNTLVMFSSDNGPW 262 Query: 350 IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADI 408 G +G K +TY GG P + G + G L + MD PT A Sbjct: 263 YQ---GSQGKLRGRKGETYEGGMREPFLARYPGVIPSGIGCAGLATTMDLLPTLARLAGA 319 Query: 409 SIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDD 468 P + LDGV + P L ++ + + + S Sbjct: 320 QTPSN-PLDGVDIWPVLTGERAEVDRDVFLYFDAVYLQCARLGRWKLHLSRYNTKAWSPL 378 Query: 469 YPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYK-LTDLQQKDNLAAANPQVVKEMQ 527 P + + LY ++D Q+ + AA++P +V +++ Sbjct: 379 PPGGRV--------------------NLPLPRPELYDVVSDPQESYDCAASHPAIVADIR 418 Query: 528 GVVREFIDSSQPPL 541 V + + P + Sbjct: 419 ARVERMVQTFPPGI 432 >UniRef50_C9KTV0 Arylsulfatase n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KTV0_9BACE Length = 459 Score = 447 bits (1150), Expect = e-124, Method: Composition-based stats. Identities = 127/542 (23%), Positives = 204/542 (37%), Gaps = 115/542 (21%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 A + F+P E + +PN +++ DD+GYG + + Sbjct: 9 VAATCALAAFSPVEMMAQKQPNFVIIVADDMGYGDVGIYGNEY----------------- 51 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV------- 150 TP + + EG+ FT+ + VS P+R ++TGR R G+ Sbjct: 52 ---------IKTPNIDQIAREGMMFTDFHSNGSVSSPTRCGLLTGRYQQRAGLEKVLLVP 102 Query: 151 YSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 + D + G+P E ++ ++GY TA +GKWHL + Sbjct: 103 RDDKDKEVGLPSEEITFAKILGDNGYRTALIGKWHLGYLQK------------------- 143 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAY------YNSPSLFKNRERVPAKGYISDQLTDEAIG 264 P N GF F+GF + Y Y + E GY + LT + Sbjct: 144 ----HHPMNFGFQKFVGFKSGNVDYQSHRNRYGDMDWWDGLEMKDMSGYTTTLLTTLSED 199 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQ------KQFNTGSQTADNYYASVYSV 318 + K D+PF LY+A+ APH P P + N+ + Y V + Sbjct: 200 YIKENK--DKPFCLYIAHAAPHSPMQGPDEKAVRTEATPEGDKNSDRSNKEIYKDMVEEL 257 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFM 378 D V RILE LKK +NT ++F SDNG VI+ G KG K + GG P Sbjct: 258 DWSVGRILETLKKYKLDENTFVVFFSDNGPVIN-NGGSAGGYKGAKGSPWEGGHRVPGIC 316 Query: 379 WWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKD-LKLDGVSLLPWLQDKKQGEPHKN 436 + G ++ G ++ + + D +PT LD ADI KLDG SL+P + + + Sbjct: 317 YMPGTIKEGTTCEQTVMSFDLFPTMLDMADIHYDDSKKKLDGTSLVPLFKGENLAP--RL 374 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV 496 L W + +VR+ + LV Sbjct: 375 LFW-------------------------------------GNGNKTISVRDGKWKLVRYN 397 Query: 497 ENNQL--GLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIK 553 + + L+ L D +K+NL+ P++V+ + + + +S + + K Sbjct: 398 QKGGITLHLFDLNNDPYEKNNLSKQEPELVERLDKEITRWAESVYSEVPDQFARKVQRTN 457 Query: 554 KA 555 K Sbjct: 458 KK 459 >UniRef50_A6C4W8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C4W8_9PLAN Length = 459 Score = 447 bits (1150), Expect = e-124, Method: Composition-based stats. Identities = 120/530 (22%), Positives = 189/530 (35%), Gaps = 106/530 (20%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 + + +PNII + DDLGYG L Sbjct: 9 WLIVCLVCLPASMQAAEGERPNIIFIMADDLGYGDLGCYG-------------------- 48 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA- 156 TP + +G RFT Y V SRA ++TG N Sbjct: 49 ------QKLMKTPHIDQFAAQGTRFTQAYAGGSVCTASRAVLLTGLHNGHTPARDNIPHY 102 Query: 157 QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 + ++ + E+ Q GY VGKW L V Sbjct: 103 ATYLQESDVTIAEVLQKSGYRCGGVGKWSLGDAGTVGRA--------------------- 141 Query: 217 PQNRGFDYFMGFHAAGTA-YYNSPSLFKNRERVPAKG-------YISDQLTDEAIGVVDR 268 N+GFD + G+ A YY + L N R+ KG Y D LT+ A+ + Sbjct: 142 -TNQGFDMWFGYLNQDHAHYYFTEYLDDNEGRLELKGNTKNRQQYSHDLLTERALQFIRD 200 Query: 269 AKTLDQPFMLYLAYNAPH------LPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGV 322 + QPF LY AY PH P+ PD + Y A ++ +D+ V Sbjct: 201 SAA--QPFFLYAAYTLPHFSAKAEDPHGLAVPDTEPYSDRDWDIKSKKYAAMIHRLDRDV 258 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDGPL--PLNGAQKGYKSQTYPGGTHTPMFMWW 380 RI+ + + + T+I+FTSDNG P NG +G+K GG P W Sbjct: 259 GRIMSLVNELQLRERTLIIFTSDNGGHRGVPAQLHTNGPLRGFKRDLTEGGIRVPFIANW 318 Query: 381 KGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 G + G D++I+ D PT + A + + LDG+S+LP L+ + + H+ L W Sbjct: 319 PGTIPAGKVSDEVIAFQDMLPTFAELAGAQVSAN--LDGISVLPALRGEPRKVKHEYLYW 376 Query: 440 ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN 499 + ++ VR N++ + + Sbjct: 377 DYGHCRA---------------------------------RYDQAVRWNNWKGIRHGQQG 403 Query: 500 QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEK 548 ++ LY L DL + ++A +PQVV+ + ++ + P + + Sbjct: 404 EIALYNLDQDLSESRDVADKHPQVVQRIAEIMNT--AAVPNPRYPIGTKY 451 >UniRef50_Q7UYD6 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UYD6_RHOBA Length = 889 Score = 446 bits (1149), Expect = e-124, Method: Composition-based stats. Identities = 120/559 (21%), Positives = 207/559 (37%), Gaps = 125/559 (22%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 S +PN++ + DDLG+ + Sbjct: 255 PAATPNASASKRPNVLFILADDLGWSDTTLFGTT-------------------------K 289 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY-------------- 151 TP + L G+ FT Y + + P+RA+++TG +PAR G+ Sbjct: 290 LYQTPNIERLAKRGMTFTRAYSSSPLCSPTRASVLTGLSPARHGITSPTCHLPKVVLEPK 349 Query: 152 -----------SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQT 200 + ++ + L E+F+++GY T GKWHL Sbjct: 350 VSETGPPNKFSTVPESVTRLDTKYYTLAEMFRDNGYATGHFGKWHLG------------- 396 Query: 201 RDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA--YYNSPSLFK--NRERVPAKGYISD 256 E + P GFD + H Y +P FK + + V ++ D Sbjct: 397 -----------PEPYSPLEHGFDVDVPHHPGPGPAGSYVAPWKFKDFDHDPVIPDEHLED 445 Query: 257 QLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTA-DNYYA 313 ++ EA+ +++ ++PF L + H P D ++Y+ + + Y A Sbjct: 446 RMAKEAVRFLEQH--TNEPFFLNYWMFSVHAPFDAKKELIEEYRDRVDPKDPQRCPTYAA 503 Query: 314 SVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI-----DGPLPLNGAQKGYKSQTY 368 + S+D + +L+ L + G D TII+F SDNG + N +G K+ Y Sbjct: 504 MIESMDDAIGTLLDTLDRLGIADETIIVFASDNGGNMYNEVDGTTATSNAPLRGGKATMY 563 Query: 369 PGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQD 427 GG P + G ++ G D +I ++DFYPT L+ I + + DGVS++P LQ Sbjct: 564 EGGVRGPAIVVQPGVVESGSRSDAIIQSIDFYPTLLEMLAIDAQPNQRFDGVSIVPALQG 623 Query: 428 KKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRN 487 K + +PH+P + S +V Sbjct: 624 K--PLQRDAIF----------------------------TYFPHDPPVPNWMPPSVSVHQ 653 Query: 488 NDYSLVYTVENNQ-----LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 D+ L+ L+ L DL ++ NLAA +P V++M ++ + + ++ Sbjct: 654 GDWKLIRIFHGGPNGSHRYKLFNLKNDLGERINLAAKHPDRVQQMDKLIGQHLVETKAVR 713 Query: 542 SEVNQEKFNNIKKALSEAK 560 VN+ A +E K Sbjct: 714 PLVNKNFDPAKYNAGAEGK 732 >UniRef50_A6CD52 Twin-arginine translocation pathway signal n=2 Tax=Bacteria RepID=A6CD52_9PLAN Length = 460 Score = 446 bits (1148), Expect = e-124, Method: Composition-based stats. Identities = 122/540 (22%), Positives = 198/540 (36%), Gaps = 114/540 (21%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 + + AF + +PNI+++ DD G + Sbjct: 8 SILMFLSLFAFCS----QLQAAERPNILIIFTDDQGINDVGCYGS--------------- 48 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG---- 149 + TP + L EG+ F Y A + PSR I+TGR P R Sbjct: 49 ------------EIPTPHIDQLAKEGLLFRQYYSASAICTPSRFGILTGRNPTRSQDQLL 96 Query: 150 ----VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 S+ D GI ET + ++ Q +GY TA +GKWHL Sbjct: 97 GALMFMSDIDQNRGIQPGETTIADVLQQNGYQTALLGKWHLGH----------------- 139 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY-----NSPSLFKNRERVPAKGYISDQLTD 260 E + P GFD F G Y+ N P + N+ V GY +D +T+ Sbjct: 140 -----GTESFLPTAHGFDLFRGHTGGCIDYFTMTYGNIPDWYHNQRHVSENGYATDLITE 194 Query: 261 EAIGVVDRAKTLDQPFMLYLAYNAPH-----LPNDNPAPDQYQKQFNTGSQ-------TA 308 EA + +T D+PF L+L+YNAPH P D + Q + + + Sbjct: 195 EAEHFLKDQQTTDKPFFLFLSYNAPHFGKGWSPGDQSPVNIMQARGDDLKRVGTIKDKVR 254 Query: 309 DNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTY 368 + A S+D G+ R++ LK NG NT+++F +D+G N +G K+ + Sbjct: 255 REFAAMTVSLDDGIGRVMSSLKNNGLDQNTLVIFMTDHGGDYVYG-GNNQPFRGAKATLF 313 Query: 369 PGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQD 427 GG P + W GK++ G +++ A+D +PT A++ L LDG + L Sbjct: 314 EGGIRVPCIIRWPGKIKAGTETNEVAWALDLFPTICHFANVDT-DGLTLDGKDISGLLT- 371 Query: 428 KKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRN 487 ++ + L W P+ E +R Sbjct: 372 RQTPVGTRELYWQLG------------------------------PHAELKRGRWSALRQ 401 Query: 488 NDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 D+ + + L+ L D +K NL + + E+Q + + P + + Sbjct: 402 GDWKYIQDAGGEEF-LFDLKADPYEKQNLTQSQSTKLTELQERRDTLVKTLTPQVKSIAP 460 >UniRef50_UPI0001968C90 hypothetical protein BACCELL_02360 n=1 Tax=Bacteroides cellulosilyticus DSM 14838 RepID=UPI0001968C90 Length = 525 Score = 446 bits (1148), Expect = e-123, Method: Composition-based stats. Identities = 121/526 (23%), Positives = 199/526 (37%), Gaps = 97/526 (18%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 T + KPN + + MDD+GY + + Sbjct: 64 SCTEATPTKSEKPNFVFIYMDDMGYSDVSCYGETRWT----------------------- 100 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN--TDAQDGIPLT 163 TP + +L EG++FT+ Y A +S PSRA +TGR PAR G+ D+ G+ Sbjct: 101 ---TPNIDALAAEGIKFTDCYAASPISSPSRAGFLTGRYPARMGIQGVFYPDSYTGMAPE 157 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 E + E+ + GY TA +GKWHL S E++ P +GFD Sbjct: 158 EVTMAEVLKVQGYATACIGKWHLG-----------------------SREKYLPLQQGFD 194 Query: 224 YFMGFHAAGTAYYNSPSLFKNRERVP---AKGYISDQLTDEAIGVVDRAKTLDQPFMLYL 280 + G + ++ + E ++ + T+EA+ + R DQPF L+L Sbjct: 195 EYFGIPYSNDM--SAQVYLRGNEVEEFHIDINNVTKKYTEEAVDYIRR--KADQPFFLFL 250 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 A++ H+P Y G A Y +V VD V RI+E L++ G DNT++ Sbjct: 251 AHSMMHVPI-------YVSDEFAGKSGAGIYGDAVLEVDWSVGRIMETLRELGLDDNTLV 303 Query: 341 LFTSDNGAVIDGP--LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDF 398 +FTSDNG + + K+ + GG P +WKG+++P ++S +D+ Sbjct: 304 VFTSDNGPWLQEGPLGGRALPLREGKTTAFEGGVRVPCIAYWKGQIKPVVNTDVVSLLDW 363 Query: 399 YPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNY 458 +PT + +P D++LDG L L + + + Sbjct: 364 FPTVTALSGGILP-DVRLDGYDLTAVLNGTGKRASEDYAYFRNNRDITDYRSGDWKISLP 422 Query: 459 HKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAA 517 ++ + L+ L D+ ++ NL Sbjct: 423 APGIKGNFWRASTAEHDTL-------------------------LFNLREDIGERYNLYR 457 Query: 518 ANPQVVKEMQGVVREFID---SSQPPLSEVNQEKFNNIKKALSEAK 560 P KEM ++E+ P L + ++K EAK Sbjct: 458 KYPGKAKEMLQKLQEYTRNFGEIPPGLVMTGNDASKYLRKQRQEAK 503 >UniRef50_Q7UTH7 Arylsulfatase A n=2 Tax=Bacteria RepID=Q7UTH7_RHOBA Length = 496 Score = 446 bits (1147), Expect = e-123, Method: Composition-based stats. Identities = 126/552 (22%), Positives = 212/552 (38%), Gaps = 102/552 (18%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKG---------KPNIIVLTMDDLGYGQLPFDKGSFDPK 83 + +KA + + F + PNII++ DD GYG L Sbjct: 1 MPIKAILSVLLFLLVPCSGLRAADNGDDVDQVSPPNIILVMTDDQGYGDLGCHG------ 54 Query: 84 TMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGR 143 TP L L E RF + +V P+R+A+M+GR Sbjct: 55 --------------------HPFLKTPNLDRLHSESTRFNDFHV-SPTCAPTRSALMSGR 93 Query: 144 APARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY 203 AP + GV +D + LT T + E+ ++ GY T GKWHL Sbjct: 94 APFKNGVTHTILERDRMALTSTTIAEVLKSAGYTTGIFGKWHLGD--------------- 138 Query: 204 HDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY-------------YNSPSLFKNRERVPA 250 + +QP RGFD A G Y +P + N V Sbjct: 139 --------EDAYQPDRRGFDETFIHGAGGIGQNFAGSQSDAPGTSYFNPIIKHNGTFVQT 190 Query: 251 KGYISDQLTDEAIGVVDRAKTLD-QPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG-SQTA 308 +GY +D +A+G + D +PF Y+ NAPH P +Y +F S Sbjct: 191 EGYCTDVFFQQALGWIRLQTKSDTKPFFAYIPTNAPHAPY--KVEKRYSDRFRDKCSSPQ 248 Query: 309 DNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTY 368 + + ++D + +++ +L + DNT+++F +DNG+ + N KG K Sbjct: 249 SEFLGMIVNIDDNMGKLMGKLDEWDLADNTLLIFMTDNGSAKGSKI-YNAGMKGGKGTVN 307 Query: 369 PGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQD 427 GG+ P+FM G G + + +D +PT + A IP + LDG SL+ +++ Sbjct: 308 EGGSRVPLFMRLPGFTNSGVDIETMTRHVDLFPTLAEIAHAEIPAEADLDGRSLVSLIKN 367 Query: 428 KKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRN 487 + H+ + + + +PN + +Y VR+ Sbjct: 368 PQLDWDHRFQFFHSG---------------RWAKAGLKGKFGKGDPNPDHSKHKNYAVRD 412 Query: 488 NDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 + LV LY L D + ++A ++P+VV M E+ D +P + +N+ Sbjct: 413 EKWRLV------NGELYDLENDPGETADVAGSHPEVVSRMLVAFDEWWDEVRPLM--INE 464 Query: 547 EKFNNIKKALSE 558 + ++ K + Sbjct: 465 DAPLDVGKPFRD 476 >UniRef50_Q7UZ43 N-acetylgalactosamine-4-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UZ43_RHOBA Length = 608 Score = 445 bits (1146), Expect = e-123, Method: Composition-based stats. Identities = 114/539 (21%), Positives = 190/539 (35%), Gaps = 99/539 (18%) Query: 31 DDVKLKATKTNVAFSDFTPT-EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 L + +PN++++ DD GYG F Sbjct: 4 WLFCLAFVFAGSLWEGLPLAHSVRAADRPNVVMVITDDQGYGDCGFTG------------ 51 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 TP + +L E T+ +VA P+R+A+MTG R G Sbjct: 52 --------------NKVVQTPNIDALAAESSVLTDYHVA-PTCSPTRSALMTGHWTNRTG 96 Query: 150 VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 V+ + + E E+F + GY T GKWHL Sbjct: 97 VWHTISGRSMLRDNEVTFGEIFSDAGYQTGMFGKWHLGDNY------------------- 137 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAY--------YNSPSLFKNRERVPAKGYISDQLTDE 261 ++ ++ GF G Y S F N + V A+G+ +D E Sbjct: 138 ----PYRAEDNGFTEVYRHGGGGVGQTPDFWDNAYFDGSYFHNGKAVKAEGFCTDVFFKE 193 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQG 321 + D+PF Y+A NAPH P AP +Y + + ++ + +VD Sbjct: 194 GNRFIRECVEADEPFFAYIATNAPHGPLH--APQKYIDMYPEMNDNVATFFGMITNVDDN 251 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 V + + L++ G +DNTI +FT+DNG G N +G K Y GG P M + Sbjct: 252 VGQTRKLLRELGVHDNTIFIFTTDNG-TAGGASVYNAGMRGKKGSPYEGGHRVPFVMHYP 310 Query: 382 --GKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 G + + L A+D PT LD + P+ +K DG S++ L+D+ + + Sbjct: 311 EGGFAKSRTNNTLCHAVDVVPTLLDMCGVEAPESVKFDGTSIVSLLKDEVDSSFNDRMLI 370 Query: 440 ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN 499 S + + +V + + L+ N Sbjct: 371 TDSQR-----------------------------VIDPIKWRQSSVMQDKWRLI-----N 396 Query: 500 QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKALS 557 LY + D Q++N+A +P+ V M+ + +P S+ + + + Sbjct: 397 GKELYNIANDPGQENNIAGDHPEQVASMRAFYEAWWAELEPTFSQTTEMTVGHPDHPVV 455 >UniRef50_Q15XH3 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XH3_PSEA6 Length = 500 Score = 445 bits (1146), Expect = e-123, Method: Composition-based stats. Identities = 147/545 (26%), Positives = 223/545 (40%), Gaps = 116/545 (21%) Query: 27 AHAADDVKLKATKTNVAFSDFTPTEY-STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 + + + N + +D ++ + KPNI+ + DDLGY + F+ + Sbjct: 8 STLLWGTLIAISVGNASAADAGQSKADESNEKPNILFVLADDLGYNDVGFNGST------ 61 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 TP L L G+ F YVAH GPSRAAIMTGR P Sbjct: 62 --------------------DIKTPNLDGLAKNGMTFDAAYVAHPFCGPSRAAIMTGRYP 101 Query: 146 ARFGVYSN---TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 + G N ++ G+ E F+ + ++ GY+T A+GKWHL + Sbjct: 102 HKIGAQFNLPEDNSNVGVSADELFIAQTMKSAGYFTGAMGKWHLGE-------------- 147 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS---------------------PSL 241 A E+ P GFD F GF G Y+ L Sbjct: 148 ---------ASEYHPNKHGFDEFYGFLGGGHNYFPEQFEAAYNKRVAQGMTNINMYLTPL 198 Query: 242 FKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF 301 N + V YI+D L+ EA+ VD+A +PF LYLAYNAPH+P D F Sbjct: 199 EHNGKEVRETEYITDGLSREAVNFVDKAAAKKKPFFLYLAYNAPHVPLQAKEED--MAMF 256 Query: 302 NTGSQ-TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ 360 + Y VY+VD+GV RI+EQLKKNGQ+DNT+I+FTSDNG + G N Sbjct: 257 SQIKDKKRRTYAGMVYAVDRGVGRIVEQLKKNGQFDNTVIVFTSDNGGKL-GQGANNYPL 315 Query: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGV 419 K K GG TPM + W ++ G + + A+D YPT +P+D KLDG Sbjct: 316 KEGKGSVQEGGFRTPMLVHWPKHMKAGSRFSHPVLALDLYPTFAGLGGAVLPEDKKLDGK 375 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLS 479 + +Q + + + + + D Sbjct: 376 DIWADIQANTAPHKDEFIYVLRHRNGYSD------------------------------- 404 Query: 480 QFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREF-IDSS 537 R N + V ++ LY + D+ + ++++A +P ++++M + + ++ Sbjct: 405 ---AAARRNQFKAVKNHNDD-WKLYNIAQDISEDNDISAQHPDILRDMVSSMESWSWNNQ 460 Query: 538 QPPLS 542 QP Sbjct: 461 QPKWF 465 >UniRef50_A4CMB1 Arylsulphatase A n=6 Tax=Bacteria RepID=A4CMB1_9FLAO Length = 459 Score = 445 bits (1145), Expect = e-123, Method: Composition-based stats. Identities = 132/536 (24%), Positives = 205/536 (38%), Gaps = 103/536 (19%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYS--TKGKP---NIIVLTMDDLGYGQLPFD 76 M + + L A + + + +P NI+ + +DDLGYG L Sbjct: 1 MNFHQSDSRRLCPLNAILALFSIGCLAAATGTCYAQERPDAPNILCILVDDLGYGDLSCQ 60 Query: 77 KGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSR 136 A +P + +L G+RFTN Y V PSR Sbjct: 61 G--------------------------ATDLQSPNIDALAANGMRFTNFYANSTVCSPSR 94 Query: 137 AAIMTGRAPARFGV----YSNTDAQDG-IPLTETFLPELFQNHGYYTAAVGKWHLSKISN 191 AA++TGR P GV N + G + +P GY+T +GKWHL Sbjct: 95 AALLTGRYPDLVGVPGVIRQNPENNWGNLADDAVLIPSELNPAGYHTGIIGKWHLGL--- 151 Query: 192 VPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP------SLFKNR 245 E P +RGF YF GF Y + NR Sbjct: 152 --------------------EEPDTPNDRGFTYFKGFLGDMMDDYWDHRRGGINWMRLNR 191 Query: 246 ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNT 303 E + KG+ +D TD I + + +QPF LYLAYNAPH P P D+ +++ Sbjct: 192 EEIDPKGHATDLFTDWTIDFLKERQGEEQPFFLYLAYNAPHFPIQPPREWLDKVREREPN 251 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGY 363 ++ A V +D V R++E LK G +NT+++F SDNG + NG +G Sbjct: 252 LTEKRAKNVAFVEHLDYSVGRVMEALKTTGLEENTLVVFVSDNGGAL-WYAQSNGPLRGG 310 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLL 422 K Y GG P +WKGK+ PG D MD +PT + A P++ +DG+SL+ Sbjct: 311 KQDMYEGGIRVPAIFYWKGKIAPGTTSDNTALLMDLFPTFCELAGRKPPEN--VDGISLV 368 Query: 423 PWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFS 482 P L + Q ++ L W+ + + Sbjct: 369 PTLTGQAQDTANRYLYWVRREGGDYGGQAY------------------------------ 398 Query: 483 YTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 Y R D+ ++ + + + D + L + + + ++ + E I ++ Sbjct: 399 YAARFGDFKILQNTPFEPIQFFNIGQDELETTPL-ETDSEAYRALRAQLMEHIRTA 453 >UniRef50_Q7UKJ5 Arylsulfatase A n=3 Tax=Bacteria RepID=Q7UKJ5_RHOBA Length = 489 Score = 445 bits (1144), Expect = e-123, Method: Composition-based stats. Identities = 117/532 (21%), Positives = 200/532 (37%), Gaps = 103/532 (19%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 T KPN+IV+ DD GY L Sbjct: 35 SSAAESTDTTEKPNVIVIFTDDQGYNDLGCYGS--------------------------P 68 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN---TDAQDGIPL 162 TP L L EG R+T+ Y A V PSRAA++TG P R G++ + + G+ Sbjct: 69 NIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQHVLFPQSTYGLHP 128 Query: 163 TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF 222 E + + ++ GY TA VGKWHL +P Y+ +S + P N+ Sbjct: 129 DEVTIADHLKSAGYATACVGKWHLGHHKET-LPTSNGFDSYYG--IPYSNDMNHPDNKRL 185 Query: 223 -----DYFMGFHAAGTAYYNSPSLFKNRERVP---AKGYISDQLTDEAIGVVDRAKTLDQ 274 D ++ +N+P L ++ E + + ++ + TD AI V+ + D+ Sbjct: 186 GKMSSDDRWTDQSSAVTLWNTP-LVQDEEIIELPVDQRTVTRRYTDRAIEFVEANQ--DK 242 Query: 275 PFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQ 334 PF LYL ++ PH+P P D Y + Y + +D V R+++ ++ G Sbjct: 243 PFFLYLPHSMPHIPLYVP-EDVYD------PDPQNAYKCVIEHIDTEVGRLVQTVRDLGL 295 Query: 335 YDNTIILFTSDNGAVID--GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDK 391 + T+I++TSDNG + G + K T+ GG P MW G++ G + Sbjct: 296 SEKTLIVYTSDNGPWLQFKNHGGSAGPLRAGKGTTFEGGQRVPCIMWAPGRIPAGTSSNA 355 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 + MD PT +++ D K+DG+ L + + + + Sbjct: 356 FATNMDLLPTIASFTGVALENDRKIDGIDLTSTFTSD-ESARDEFVFYSAHGVLEG---- 410 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY-----------TVENNQ 500 +R D+ + + Sbjct: 411 ---------------------------------IRMGDWKYLRQVARRGPNAKGPKPEPK 437 Query: 501 LGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 + L+ L+ D+ +K+NL P+ V++M + E + V ++K N+ Sbjct: 438 VFLFDLSQDIGEKNNLVEQQPERVQKMHARMEELNEEITANARPVWRKKVNS 489 >UniRef50_Q7UHJ6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UHJ6_RHOBA Length = 500 Score = 444 bits (1143), Expect = e-123, Method: Composition-based stats. Identities = 138/561 (24%), Positives = 214/561 (38%), Gaps = 92/561 (16%) Query: 1 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI 60 M S+ + S++ S + M +A + + + T + +PN Sbjct: 18 MPSSTEPCSFSSTTSRTDCANMKTTSAISIASLFVCMLATQPF--AMADANAADAARPNF 75 Query: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 +V DD+G+G TP L L +GV Sbjct: 76 VVFVADDMGWGD--------------------------SHTYGHELIQTPNLDRLASQGV 109 Query: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD--GIPLTETFLPELFQNHGYYT 178 +FT Y A GV PSR+AI+TGR P R GVY + + +E PEL + GY T Sbjct: 110 KFTQCYSACGVCSPSRSAILTGRTPYRNGVYRHLSGNHEAHLRASEITFPELLKEVGYET 169 Query: 179 AAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS 238 VGKWHL PE P GFD++M + + + Sbjct: 170 CHVGKWHLLSRQQFNNPEFP-----------------HPGEHGFDHWMCTQNNASPSHQN 212 Query: 239 PS-LFKNRERV-PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ 296 P +N E V +GY + + EA + +PF + + + PH P + Sbjct: 213 PDNFVRNGEPVGQLEGYSAQLVASEAARWLKDIHDPSKPFAMTVWVHEPHSPI--ATDSR 270 Query: 297 YQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL 356 +Q +N Y ++ +D + +++ L DNT++ FTSDNG V Sbjct: 271 FQSLYNG--HENSKYMGNITQMDHALGMVMDALDAQEVTDNTLLFFTSDNGPVPAFG-GS 327 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLK 415 +G +G K + GG P W G +QPG D + D + T LD A I +P D Sbjct: 328 SGGLRGNKRSDHEGGIRVPGVARWPGHIQPGTISDTPVIGTDVFATVLDIAGIPLPTDRT 387 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 +DGVS+LP + K E L W T S D Sbjct: 388 IDGVSMLPAFEGK-PVERSTPLFWRTHVSPPEDRV------------------------- 421 Query: 476 EDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVRE-F 533 +R D+ LV + LY++ D +++ +LAAA P+ KEM+ + + + Sbjct: 422 --------ALRIGDWKLVGDETLTKFQLYEIQKDWKEEHDLAAAMPEKTKEMKDQLMKVW 473 Query: 534 ID-SSQPPLSEVNQEKFNNIK 553 D ++ P E+ + Sbjct: 474 RDIETEGPDHWWKNERQKPAR 494 >UniRef50_C1ZGF2 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZGF2_PLALI Length = 490 Score = 444 bits (1143), Expect = e-123, Method: Composition-based stats. Identities = 125/564 (22%), Positives = 207/564 (36%), Gaps = 120/564 (21%) Query: 19 ASGMAAFAAHAADDVKL-KATKTNVAFSDFTPTEYSTKGK--PNIIVLTMDDLGYGQLPF 75 + M F + + A + + + + PNII++ MDD+G+ + F Sbjct: 1 MTIMTRFLTTSIRQFTMRVAILCLTLWLPLHADSLAAESRRPPNIILILMDDMGWRDVGF 60 Query: 76 DKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPS 135 F TP + L G+ FT Y + P+ Sbjct: 61 MGNKF--------------------------VETPHIDRLAKTGLVFTQAYASAPNCAPT 94 Query: 136 RAAIMTGRAPARFGVYSNTDAQDG----------------IPLTETFLPELFQNHGYYTA 179 RA +M+G+ R G+Y+ D + + + E ++ GY TA Sbjct: 95 RACLMSGQYAPRHGIYTVVDPRQPPGSPWHKWQAAESKSELDTNVVTIAEALRDGGYATA 154 Query: 180 AVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP 239 G W+L + PV Q P+N GF Y++ Sbjct: 155 FFGMWNLGRGRTGPVTPGGQGFQKVVF----------PENLGF--------GKDEYFD-- 194 Query: 240 SLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQY 297 K Y++D+LTDE + VD + +QPF +YL +A H P + +Y Sbjct: 195 ---------DGKHYLTDRLTDEVLKFVDEHR--EQPFFVYLPDHAIHAPFNPKPELLAKY 243 Query: 298 QKQFNTGSQTA--DNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP 355 +++ + A++ +VD V RI++ LK+ DNT+++FTSDNG Sbjct: 244 ERKAAASNDRRDDPACAATIEAVDHNVGRIMDHLKRLKLSDNTVVIFTSDNGGTQQ---- 299 Query: 356 LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDL 414 +G K + Y GG P+ + G G D +S++D YPT L+ A I P+ Sbjct: 300 YTPPLRGGKGELYEGGIRVPLVVAGPGVKSLGSRCDVPVSSIDLYPTLLELAGIKPPEGQ 359 Query: 415 KLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPN 474 LDGVSL P LQ + + L W H P Sbjct: 360 VLDGVSLAPLLQGDATLDRER-LFW-------------------------------HFPC 387 Query: 475 TEDLSQFSYTVRNNDYSLVYTV-ENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVRE 532 + S +R D+ L+ E ++ L+ L D ++ NLA+ P + +R Sbjct: 388 YVGKATPSSAMREGDFKLIEFFEEGGRVELFNLKNDPNEEKNLASVMPDKAAALAKTLRA 447 Query: 533 FIDSSQPPLSE-VNQEKFNNIKKA 555 + + + N ++ Sbjct: 448 WQKKTNASIPPGPNPSYDPQAERP 471 >UniRef50_Q7UL40 Arylsulfatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UL40_RHOBA Length = 592 Score = 444 bits (1142), Expect = e-123, Method: Composition-based stats. Identities = 123/535 (22%), Positives = 184/535 (34%), Gaps = 85/535 (15%) Query: 29 AADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 ++ +K+ V + + +PN+I++ DD G+ ++ F Sbjct: 18 PETNMSIKSIVWIVVCLSSVTVAVAAEPRPNVILVMTDDQGWAEVGFHG----------- 66 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF 148 TP L EG TN YV + P+R+++MTGR R Sbjct: 67 ---------------NEVLKTPNLDRFAAEGTELTNFYV-SPMCTPTRSSLMTGRYHFRT 110 Query: 149 GVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 G + + + ET + E+F GY T GKWHL + + + + Sbjct: 111 GAHDTYIGRSNMNPEETTIAEVFAGAGYRTGIFGKWHLGENFPMRAEDQGFQKVVVHGGG 170 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDR 268 P N Y P+L N AKGY +D DE+I + Sbjct: 171 GIGQFADYPGN---------------TYWDPTLQYNDSFKKAKGYCTDVFIDESIQFMKD 215 Query: 269 AKTLDQPFMLYLAYNAPHLPNDNPAPDQ--YQKQFNTGSQTA---DNYYASVYSVDQGVK 323 + +QPF YL N PH P D + Y Q Y + D Sbjct: 216 --SGEQPFFCYLPLNVPHSPFDVADEFRADYDNQNLADPDGRKWVAPIYGMITQFDGAFG 273 Query: 324 RILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGK 383 R+LE ++ GQ +NTIILF SDNG + K Y G +P + W Sbjct: 274 RLLEAVEDMGQRENTIILFMSDNGPNST---YFTAGLRAKKGSVYENGIRSPFVIQWPKT 330 Query: 384 LQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITS 442 LQ G +D +D PT DA I +P DL++DG S+L L + QG + L Sbjct: 331 LQGGRKFDTPAMHIDLLPTLADACGIGLPADLQVDGKSILGLLHGETQGFQQRYLFMQ-- 388 Query: 443 YSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT-VENNQL 501 HN + R + +V E Sbjct: 389 ----------------------------HNRANVPPKYENCMARRGPWKVVGDGGEPTGF 420 Query: 502 GLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKA 555 LY + D + +LA +P++VK + D L N + Sbjct: 421 ELYNIEQDPGETRDLADKHPEIVKAFVREYEAWFDDVTTQLRRDNGVPYPTELNP 475 >UniRef50_D2QZX4 Sulfatase n=10 Tax=Bacteria RepID=D2QZX4_9PLAN Length = 499 Score = 444 bits (1142), Expect = e-123, Method: Composition-based stats. Identities = 132/539 (24%), Positives = 201/539 (37%), Gaps = 99/539 (18%) Query: 27 AHAADDVKLKATKTNVAFSDFTPTEYSTKG-----KPNIIVLTMDDLGYGQLPFDKGSFD 81 AHAA T S TE S +PNI+++ DDLGY + Sbjct: 16 AHAAMTFVAFVLATTFVISSTAATEESAADAASKRRPNIVLIFCDDLGYADIGCFG---- 71 Query: 82 PKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMT 141 A TP L L EG++FT+ VA V SRAA++T Sbjct: 72 ----------------------AKGYETPNLNKLASEGMKFTDFQVAAAVCSASRAALLT 109 Query: 142 GRAPARFGVYSNTDAQD--GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQ 199 G P R G+ S D GI E + EL QN GY TA GKWHL Sbjct: 110 GCYPQRVGILSALGPSDSIGIAKNELLISELLQNLGYKTACFGKWHLGHH---------- 159 Query: 200 TRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP---------SLFKNRERV-- 248 E++ PQ GF + G + + P L + + Sbjct: 160 -------------EQFLPQQNGFATYFGLPYSNDMWPKHPTAKNAYPPLPLIDGNKTIEL 206 Query: 249 -PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT 307 P + ++ T++A+ + ++PF LY+ +N PH+P + + G Sbjct: 207 NPDQTKLTTWYTEKAVKFI--HDCGEKPFFLYVPHNMPHVPL-------FVSEKFAGKTK 257 Query: 308 ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNGAQKGYKS 365 + + +D V I + L+ G DNT+++FTSDNG + G + K Sbjct: 258 RGLFGDVIAEIDWSVGEITKALEATGNVDNTLVIFTSDNGPWLSYGDHAGSTGGFREGKG 317 Query: 366 QTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 + GG PM + G +QPG DKL S +D +PT +I K+DGVS+ P Sbjct: 318 TVWEGGHRVPMIAKYPGTIQPGTTCDKLASTIDLFPTIAHYCGATIDPSRKIDGVSIQPL 377 Query: 425 LQD-KKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY 483 L+ + H+ + +W + + + H P T+ + Sbjct: 378 LESVEGAKSSHEFFYY-----YWGNGLEAVRDERFKLHFPHAFRSLTGTPGTDGMPNG-- 430 Query: 484 TVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 YT +L L+ L D ++ N+AA +P+V + L Sbjct: 431 ----------YTQAKTELALFDLDADPFEQTNIAADHPEVTARLTAAAESMRSDLGDSL 479 >UniRef50_A6DMY9 Putative uncharacterized protein n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMY9_9BACT Length = 590 Score = 444 bits (1142), Expect = e-123, Method: Composition-based stats. Identities = 131/535 (24%), Positives = 210/535 (39%), Gaps = 94/535 (17%) Query: 36 KATKTNVAFSDFTPTEYS-TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 K + F + + + KPNI+++ DD GYG + Sbjct: 3 KVFIKWLGLCAFALSPAALAEDKPNIVLILTDDQGYGDISSHG----------------- 45 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 TP L L ++G RF N +V++ V P+RA+++TGR R GV + Sbjct: 46 ---------NRMIDTPHLDQLAEDGTRFENFFVSN-VCAPTRASLLTGRYHIRTGVVQVS 95 Query: 155 DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 + + E + E+F+ GY T GKWH + Sbjct: 96 RGLEIMRSEEATIAEVFKAQGYETGLFGKWHNGEHY-----------------------P 132 Query: 215 WQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQ 274 P +GFD + GF A + +L N+ V KG+I+D LTD AI +++ + D+ Sbjct: 133 NNPPGQGFDEYFGFCAGHIGDFFDATLDHNKTFVKTKGFITDVLTDRAIDWIEKQQ--DK 190 Query: 275 PFMLYLAYNAPHLPNDNPAPDQYQKQF--NTGSQTADNYYASVYSVDQGVKRILEQLKKN 332 PF Y+ YNAPH P D+Y +F S Y + ++D + R+L+ L Sbjct: 191 PFFAYIPYNAPHAPYQ--VEDKYYDEFAAKGYSAAHSAAYGMIENLDDNIGRLLKILDDL 248 Query: 333 GQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL-QPGNYDK 391 DNTI++F +DNG + P NG KG K GG P F+ W GK+ + Sbjct: 249 NLTDNTIVIFLTDNGP--NSPTRFNGGMKGSKGSVDEGGVRVPFFIRWPGKIAKGRTIHD 306 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 L + +D PT ++ A +++ KLDG SL + K + Sbjct: 307 LAAHIDVLPTLMELAGVNVDLPNKLDGRSLTSLISSSKTPK------------------- 347 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYK-LTDLQ 510 P W F + + + R+N Y V + + GLY + D Sbjct: 348 APAWPERLIFTQGPGTNMTPGS-------GAGAARSNQYRYVLSR--GEEGLYDMINDPG 398 Query: 511 QKDNLAAANPQVVKEMQGVVREFIDSSQPPLS-----EVNQEKFNNIKKALSEAK 560 Q+ +L + ++ E++ E++ V ++F EAK Sbjct: 399 QEKDLKKSKKKIFDELKAAYIEWLKDVSAGWEPNTTIPVGYKEFPATYLQAVEAK 453 >UniRef50_UPI000186ED10 arylsulfatase B precursor, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186ED10 Length = 570 Score = 444 bits (1142), Expect = e-123, Method: Composition-based stats. Identities = 131/600 (21%), Positives = 216/600 (36%), Gaps = 111/600 (18%) Query: 14 ISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQL 73 + +I + F ++ + S F +PNII++ DDLG+ + Sbjct: 3 LIIIHVFQLNKFYLLLKMLIRENILLIFITLSIFFQNVVDGIERPNIIIILADDLGWNDV 62 Query: 74 PFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSG 133 F + Q TP + +L G+ + YV + Sbjct: 63 SFHGSN--------------------------QIQTPNIDALAYNGIILNSHYVPA-LCT 95 Query: 134 PSRAAIMTGRAPARFGVYSNT---DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKIS 190 PSRA++MTG+ P G+ G+PL ET +PE F +GY T AVGKWHL Sbjct: 96 PSRASLMTGKYPTSLGMQHLVILSPEPWGLPLNETLMPEYFNKNGYATHAVGKWHLG--- 152 Query: 191 NVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVP- 249 F +E+ P RGFD G YY+ ++ + + Sbjct: 153 -------------------FFKKEYTPIYRGFDSHFGHWNGFQDYYDHTTMSDSLKGYDM 193 Query: 250 ----------AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHL-----PNDNPAP 294 Y +D T EAI ++D + P LYL++ APH P P Sbjct: 194 RRNFEVDYSYQGMYTTDVFTKEAIKIIDNHNSQKGPLFLYLSHLAPHSGNPDNPFQAP-E 252 Query: 295 DQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG-- 352 D+ K Y A V +D+ V +++ L+KN +N+II+F SDNGA G Sbjct: 253 DEISKHECINDPGRKIYAAMVTKLDESVGQVVSALEKNKMLNNSIIIFMSDNGAATYGLH 312 Query: 353 -PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISI 410 N +G K + GG +W + +L+ D+ PT L AA ++ Sbjct: 313 SNRGSNYPLRGLKESPWEGGVRGTAAIWSPFLNKTKRVSKQLMHMSDWLPTLLTAAGLNY 372 Query: 411 PKDL---KLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 K+DG+ + L + + ++ + ++D Sbjct: 373 SSTQLINKIDGIDMWNVLSNDLPSPRKEVFNNYDEIENYSSLMIDSWKYVEGTAQEGKAD 432 Query: 468 DYPHNPNTEDLSQFSYT------VRNN--------------------------DYSLVYT 495 + P+ + S++ + +R + V T Sbjct: 433 YWFEEPSRNNCSEYRVSNEDIFRLRRDSTIICDNPTFSSSLSITRNNHTDVKNKTKYVLT 492 Query: 496 VEN--NQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNI 552 + + L+ L D ++ NLA P VVK ++ + E S PL++ N + Sbjct: 493 CDPLLKRFCLFNLNDDPCERLNLADVFPDVVKRIKNRLLELKKSVVKPLNKPEDPYSNPM 552 >UniRef50_A6DSH3 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSH3_9BACT Length = 455 Score = 444 bits (1142), Expect = e-123, Method: Composition-based stats. Identities = 132/530 (24%), Positives = 218/530 (41%), Gaps = 108/530 (20%) Query: 35 LKATKTNVAFSDFTPTEYS-TKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 + + T + + + KPNIIV+ DD GY + ++ Sbjct: 1 MINSFTKLFLALLCVNFVALADSKPNIIVILSDDQGYADVSYN----------------- 43 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 STP +L GV F GY + V +R+ +MTGR R+G+Y+ Sbjct: 44 -------PEHDDYISTPHTDALAKSGVIFHRGYTSGSVCSTTRSGLMTGRYQQRYGIYTA 96 Query: 154 TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 + G L F+P + GY + A GKWHL Sbjct: 97 GEGGTGTDLNAKFIPNYLKEAGYKSMAFGKWHLGHEMK---------------------- 134 Query: 214 EWQPQNRGFDYFMGFHAAGTAYYN----------SPSLFKNRERVPAKGYISDQLTDEAI 263 + P +RGFD F GF G + +++ E + KGY++ ++T+E + Sbjct: 135 -YHPLHRGFDDFYGFMGRGAHDFFRLEKEYDGKFGGPIYRGLEPIDDKGYLTTRITEETV 193 Query: 264 GVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVK 323 ++ K D+PF Y+AYNA H P PA D + +G +T D A + +D GV Sbjct: 194 KFIEENK--DKPFFAYVAYNAVHTPAQAPAEDI---KAVSGDETRDILVAMLKHLDLGVG 248 Query: 324 RILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGK 383 I++ LKK+ Y+NTII++ SDNG + N +G K Y GG P M W + Sbjct: 249 EIVKTLKKHDIYENTIIIYLSDNGGA-KSMVANNKPLRGVKHDIYDGGIRVPFLMSWPAQ 307 Query: 384 LQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITS 442 ++ G + + ++D PT LDAA +P +DG S+LP ++ K + W Sbjct: 308 IKAGQDTQSPVISLDILPTLLDAAG--LPALSDIDGESMLPVIRGDKDNL-DRPFFWNHG 364 Query: 443 YSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLG 502 ++ N++ LV+ Sbjct: 365 -------------------------------------DGQTGIQLNNWKLVFN--KGVTE 385 Query: 503 LYKLTD-LQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 LYK++D + + NLAA++P+ V+ +Q + +++ P+S+ K++ Sbjct: 386 LYKISDDIGESKNLAASHPEKVQALQKIYDKWLSQMATPMSKNTIVKWDP 435 >UniRef50_Q7UJQ8 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=4 Tax=Planctomycetaceae RepID=Q7UJQ8_RHOBA Length = 491 Score = 444 bits (1142), Expect = e-123, Method: Composition-based stats. Identities = 121/556 (21%), Positives = 191/556 (34%), Gaps = 128/556 (23%) Query: 23 AAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDP 82 AA + + + F+ T + +PNI+ + DDLGYG L Sbjct: 1 MRLAAVLRFSFPVLTSLLVLGFATAPSTSAADAKRPNIVFILADDLGYGDLGCYG----- 55 Query: 83 KTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTG 142 TP L + EG+RFT+ Y + V PSR+ +MTG Sbjct: 56 ---------------------QELIQTPRLDQMAAEGMRFTDFYAGNTVCAPSRSVLMTG 94 Query: 143 RAPARFGVYSNTDAQDG----IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDK 198 V N D + + E+ Q+ GY TA GKW L + Sbjct: 95 MHMGHTHVRGNAGGPDMSKQSLRDENVTVAEVLQSAGYATALCGKWGLGDDA-------- 146 Query: 199 QTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRERVPAKG----- 252 + P+ +GFD+F G+ A+ P L++N +V + Sbjct: 147 -----------LGGRDGLPRKQGFDHFYGYLNQVHAHNYYPEFLWRNETKVALRNEVQRR 195 Query: 253 -----------------YISDQLTDEAIGVVDRAK--TLDQPFMLYLAYNAPHLPNDNP- 292 Y D + +EA+G + +PF LYL+ PH N+ Sbjct: 196 DRSYGGFTGGWATKRVDYSHDLIANEAMGFIREKATDAATKPFFLYLSLTIPHANNEGTG 255 Query: 293 -------APDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSD 345 PD S A + +D V RIL+ LK+ + T+++F+SD Sbjct: 256 MSGNGQEVPDYGIYADKDWSDQDKGQAAMITRMDSDVGRILDLLKELQIDEQTVVMFSSD 315 Query: 346 NGAVIDGPLPL-----NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM-DFY 399 NG +G G +G K GG P+ + W G PG I D Sbjct: 316 NGPHNEGGHNPKKFDPAGPLRGMKRALTEGGIRVPLIVRWPGTTPPGAVSDHIGYFGDLM 375 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQD-KKQGEPHKNLTWITSYSHWFDEENIPFWDNY 458 TA + A P+D D +S P + + + H+ L W Sbjct: 376 ATAAELAGTDFPEDA--DSISFAPTIVGRPEAQQTHEYLYWEF----------------- 416 Query: 459 HKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY-TVENNQLGLYKLT-DLQQKDNLA 516 VR ++ + LY L D+ + NLA Sbjct: 417 ------------------YEQGGRQAVRRVNWKAIREPWMTGPTQLYDLKADIGETTNLA 458 Query: 517 AANPQVVKEMQGVVRE 532 + +P++VK+++ ++ E Sbjct: 459 SDHPEIVKQLETLMEE 474 >UniRef50_A7SRP2 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7SRP2_NEMVE Length = 491 Score = 443 bits (1141), Expect = e-123, Method: Composition-based stats. Identities = 119/533 (22%), Positives = 193/533 (36%), Gaps = 69/533 (12%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 + F + KP+++ + DDLG+ + F Sbjct: 5 LSFHCFFLCLNVVVLQSSAKPHLLFVLADDLGWSDVGFHGS------------------- 45 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NT 154 + TP + L GV N YV V P+RA++MTG+ P G+ + Sbjct: 46 --------KIQTPNIDRLAANGVILDNYYV-QPVCTPTRASLMTGKYPIHTGLQHGIIHN 96 Query: 155 DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 G+PL T LP+ + GY T +GKWHL F E Sbjct: 97 GRPYGLPLNLTLLPQKLRKAGYSTHMLGKWHLG----------------------FYNWE 134 Query: 215 WQPQNRGFDYFMGFHAAGTAYYNSP-----SLFKNRERVPA--KGYISDQLTDEAIGVVD 267 P RGFD F GF++ +Y L N E V Y + T A + Sbjct: 135 STPTYRGFDTFYGFYSGAENHYTHVQDHYLDLRDNEEIVRDQNGTYSAHLFTKRAEQ-IV 193 Query: 268 RAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT-ADNYYASVYSVDQGVKRIL 326 RA P +Y+A+ H P AP +Y +++ Y A V +D + + Sbjct: 194 RAHDPSTPLFMYMAFQNVHSPVQ--APKEYIDRYSFIKDPLRRTYAAMVTIMDDALGNLT 251 Query: 327 EQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 K G ++NTI++F++DNG V + +G K + GG F+ Q Sbjct: 252 RAFDKAGLWENTILIFSTDNGGVP-KNGGYDYPLRGRKDTLWEGGVRGVAFVHGVALEQS 310 Query: 387 GN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSH 445 G L+ D+YPT + A S+ +D LDG + + + + L I + + Sbjct: 311 GVKCKALMHVTDWYPTLVSLAGGSLDEDEDLDGYDVWESISHGVESPRKELLHNIDTINI 370 Query: 446 WFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLG--L 503 + ++ F PN + D+ + + + L Sbjct: 371 PPGDGSLGFSTTGIGLRVGDMKLLMAVPNISYFIPPEDRNGSVDWYIHSNNKVPMVEVAL 430 Query: 504 YKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKA 555 Y +T D +K +L P VV +Q V + ++ PP ++ + K Sbjct: 431 YNITADPYEKHDLHDKLPDVVTRLQLRVEHYRKTAVPPANKPKDPYARQVAKQ 483 >UniRef50_A4CGL5 Arylsulfatase A (Precursor) n=2 Tax=Flavobacteria RepID=A4CGL5_9FLAO Length = 526 Score = 443 bits (1141), Expect = e-123, Method: Composition-based stats. Identities = 108/541 (19%), Positives = 184/541 (34%), Gaps = 97/541 (17%) Query: 25 FAAHAADDVKLKATKTNVAFSDFTPTEYSTKGK----PNIIVLTMDDLGYGQLPFDKGSF 80 + L V+ + +E++ + PNI+++ DD GY + Sbjct: 37 WCQRCVRYPLLAIILLGVSCRETVKSEFAAADRADRPPNIVIIFTDDQGYSDVGVYG--- 93 Query: 81 DPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIM 140 A TP L ++ +G+ TN Y A V SRA ++ Sbjct: 94 -----------------------ARDIPTPNLDAMAADGLLLTNFYAAQPVCSASRAGLL 130 Query: 141 TGRAPARFGVYSN--TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDK 198 TG P R G+++ ++ G+ E L EL + GY T GKWHL Sbjct: 131 TGCYPNRVGIHNALMPNSPVGLNPAEETLAELLRQQGYRTGIFGKWHLGDH--------- 181 Query: 199 QTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-----------SLFKNRER 247 ++ P GFD F G + + P L++ Sbjct: 182 --------------PDFLPTRHGFDEFFGIPYSNDMWPLHPLQGPVFDFGPLPLYEQERV 227 Query: 248 V---PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG 304 V + ++ Q+T+ ++ ++R K ++PF LY+ + PH+P + G Sbjct: 228 VDTLEDQRLLTRQITERSVDFINRHK--EEPFFLYVPHPQPHVPL-------FVSDAFRG 278 Query: 305 SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNGAQKG 362 Y + +D V ++L L+ NG D+T ++FTSDNG + + Sbjct: 279 KSGRGLYGDVIMEIDWSVGQVLGALEDNGLTDDTWVIFTSDNGPWLAYGNHSGRAEPLRE 338 Query: 363 YKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSL 421 K + GG P M + G+L G D+ + A+D PT P ++DG + Sbjct: 339 GKGTNWEGGVREPCIMKFPGRLPRGKVLDEPLMAIDLLPTIASVTGSPQP-GREIDGKNA 397 Query: 422 LPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQF 481 L + P + + + + R P Sbjct: 398 WGLLSGAEARGPQDAYYFYYRVNELQAVRDGDWKLVLPHNYRTMQGQEPGADGLPGAYD- 456 Query: 482 SYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 V+ LY L D + +NLA +P+V+ + Sbjct: 457 -------------YVDVTAPELYNLREDPGETNNLAERHPEVLAAISRKADSMRRRLGDA 503 Query: 541 L 541 L Sbjct: 504 L 504 >UniRef50_C6VYN4 Sulfatase n=3 Tax=Sphingobacteriales RepID=C6VYN4_DYAFD Length = 497 Score = 443 bits (1141), Expect = e-123, Method: Composition-based stats. Identities = 134/569 (23%), Positives = 204/569 (35%), Gaps = 155/569 (27%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGK-PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVV 91 +KL T K PNI+ + DDLGYG+L Sbjct: 1 MKLLNLFLLTISITCTAQAQKAPDKLPNIVYIYADDLGYGELGCYG-------------- 46 Query: 92 DTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY 151 + TP L L EG+RFT Y V P+RA +MTG+ + Sbjct: 47 ------------QQKIKTPNLDRLAKEGIRFTQHYTGTPVCAPARAMLMTGKHAGHSAIR 94 Query: 152 SNTD----------AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 N + Q +P E + EL + GY TA GKW + + Sbjct: 95 GNFELGGFRDEEERGQMPLPANELTVAELLKQKGYATALTGKWGMGMNNT---------- 144 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNR--------------- 245 E P +GFDY+ G+ A+ P L++N Sbjct: 145 ------------EGTPTRQGFDYYYGYLDQKQAHNLYPSHLWENDRWDTLAQPWQDIHRK 192 Query: 246 -----------ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP 294 E K Y ++T++A+ +DR+K PF LY+ Y PH+ AP Sbjct: 193 LDPAKATDADFESFKGKEYAPAKMTEKALAFIDRSKAG--PFFLYMPYTLPHVSLQ--AP 248 Query: 295 DQYQKQ---------------FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 D+Y K+ + + Y + + +D V IL++LK G DNTI Sbjct: 249 DEYVKKYIGQFDEKPYYGEKNYASTKYPLSTYASMITFLDDQVGIILDKLKALGLDDNTI 308 Query: 340 ILFTSDNGAVIDGP-----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS 394 ++F+SDNGA +G +G K Y GG P + W GK++PG +S Sbjct: 309 VMFSSDNGATFNGGVNPQFFNSVAGLRGLKMDVYEGGIREPFIVRWPGKIKPGRVSDHVS 368 Query: 395 A-MDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ-GEPHKNLTWITSYSHWFDEENI 452 A D PT + + P DG+S LP L + + H+ L + Sbjct: 369 AQFDLMPTLAELTGQASP---PTDGISFLPELLGQTNRQKKHEFLYFEYPEKG------- 418 Query: 453 PFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV----ENNQLGLYKL-T 507 VR D+ V T N L+ L T Sbjct: 419 ----------------------------GQIAVRMGDWKGVKTDLRKNPGNPWQLFNLKT 450 Query: 508 DLQQKDNLAAANPQVVKEMQGVVREFIDS 536 D + ++AA++P ++K++ +V+ + Sbjct: 451 DRSESTDVAASHPDILKKLDQIVKREHEE 479 >UniRef50_C1ZA41 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZA41_PLALI Length = 519 Score = 443 bits (1140), Expect = e-123, Method: Composition-based stats. Identities = 125/537 (23%), Positives = 196/537 (36%), Gaps = 93/537 (17%) Query: 23 AAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDP 82 F A + + + + S K +PNII++ DD GYG L Sbjct: 12 VCFRQLAIFSMIATVIGAGLTIARIVEADES-KTRPNIILMMTDDQGYGDLSLHG----- 65 Query: 83 KTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTG 142 TP L L + VRF +V P+RA+IMT Sbjct: 66 ---------------------NPVVKTPHLDQLGRQSVRFEQFHV-SPTCAPTRASIMTS 103 Query: 143 RAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 R GV ++ + L T LP+ + GY T GKWHL Sbjct: 104 RHEFSSGVTHTILERERLSLKATILPQFLKRAGYTTGIFGKWHLGD-------------- 149 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY-------------YNSPSLFKNRERVP 249 + +QP RGFD G Y +P + N + V Sbjct: 150 ---------EDAYQPGKRGFDEVFIHGGGGIGQSYPGSCGDAPLNKYFNPVIRHNGKFVA 200 Query: 250 AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTAD 309 GY + D+AI + ++ +QPF Y+ NAPH P D P + Y+ + Sbjct: 201 TNGYCTKVFVDQAITWIS-SQPDNQPFFCYITPNAPHAPLDCPK-EYYEPYLEHVPEDVA 258 Query: 310 NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYP 369 +Y + D + R+L+ L+ +TI++F +DNG+ + + K Y Sbjct: 259 RFYGMITHWDDQLGRLLKALEDRDISKDTIVIFMTDNGSATGAKH-FSAGMRANKGTPYE 317 Query: 370 GGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLK--LDGVSLLPWLQD 427 GG P F W G QP ++ D PT + A++ + D K G SL+P L Sbjct: 318 GGIRVPAFWSWAGHWQPQVRQEVTCHYDILPTLTELANVPVADDEKQSWQGRSLVPLLAG 377 Query: 428 KKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRN 487 + P + T W E + + +++ + +R Sbjct: 378 RSPNWPPRPFI--THVGRWPKEHDPKREPSTYQYAK-------------------CAIRL 416 Query: 488 NDYSLVYTVENN--QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 D+ L+ V+ Q LY+L D +K NLA P V+E++ + + S P + Sbjct: 417 GDWKLISNVKQGEPQWELYQLAEDPAEKINLAKKYPDRVEELKKIYDAWWLSVVPKM 473 >UniRef50_C6D6K5 Sulfatase n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D6K5_PAESJ Length = 434 Score = 443 bits (1140), Expect = e-123, Method: Composition-based stats. Identities = 140/522 (26%), Positives = 202/522 (38%), Gaps = 131/522 (25%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 +PNIIV DDLGYG L TP L L Sbjct: 2 KRPNIIVFYCDDLGYGDLGCYGS--------------------------DAMKTPHLDQL 35 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD---GIPLTETFLPELFQ 172 EG+RFTN Y V PSRA+++TG+ PA+ GV S + G+ L +T L + Sbjct: 36 ASEGIRFTNWYSNSPVCSPSRASLLTGKYPAKAGVTSILGGKRGTKGLSLEQTTLASALK 95 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 HGY+TA GKWHL + E+ P GFD F GF A Sbjct: 96 EHGYHTALFGKWHLGASA-----------------------EYGPNAHGFDQFYGFRAGC 132 Query: 233 TAYYNS-------------PSLFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFML 278 YY+ L++N V G Y+++ +T EA +D A D+P+ + Sbjct: 133 IDYYSHIFYWGQGGGVNPVHDLWRNETEVWENGEYMTEAITREATSYID-AAPDDEPYFM 191 Query: 279 YLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 Y+AYNAPH P AP Y +F A + +VD GV I++ LK+ G Y++T Sbjct: 192 YVAYNAPHYPMH--APKAYLDRFPDLPPDRRIMAAMIAAVDDGVGEIVKALKQKGAYEDT 249 Query: 339 IILFTSDNGAVIDGP-----------LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 II F+SDNG + G +G+K+ + GG P + + L Sbjct: 250 IIFFSSDNGPSTESRNWLDGTEDLYYGGSAGRFRGHKASLFEGGIREPAILSYPAGLAEQ 309 Query: 388 N---YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYS 444 D++ + MD +PT L+ + I + LDG S+ L P K L W Sbjct: 310 QGQISDEMFAMMDIFPTMLELSGIGT-EGYSLDGHSVFDALSG-NALSPRKQLFWEYE-- 365 Query: 445 HWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT-------VE 497 VR + LV E Sbjct: 366 ------------------------------------GQLAVREGKWKLVLNGKLDFSRTE 389 Query: 498 NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 + + L L D ++ NL P++ + ++ VR++ S Q Sbjct: 390 ADAVHLSDLEQDSSERINLVKQYPEIAQRLERDVRQWYQSLQ 431 >UniRef50_B9YAN4 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9YAN4_9FIRM Length = 470 Score = 443 bits (1139), Expect = e-122, Method: Composition-based stats. Identities = 122/541 (22%), Positives = 203/541 (37%), Gaps = 132/541 (24%) Query: 57 KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLM 116 +PN+I++ +DDLG+ L SF TP + L Sbjct: 4 QPNVIMILIDDLGWMDLSCQGSSF--------------------------YETPHIDQLR 37 Query: 117 DEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG---------------IP 161 EG+ F Y A V PSRA+I++G+ PAR V D ++ + Sbjct: 38 REGMAFDQAYAACPVCSPSRASILSGKYPARLKVTDWIDHENYHPCRGKLIDAPYIKELS 97 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 ++E + + FQ GY T VGKWHL E P++ G Sbjct: 98 VSEFSMAKAFQEAGYQTWHVGKWHLG------------------------KEATYPEHHG 133 Query: 222 FDYFMGFHAAGTAY--YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLY 279 FD +G G Y SP +N P Y++D++ EA ++ R++ +PF L Sbjct: 134 FDVNLGGSWWGHPKKGYFSPYHMENLSDGPEGEYLTDRIGAEAAALI-RSRDPQRPFFLN 192 Query: 280 LAYNAPHLPNDNPAPD--QYQKQFN-------------------TGSQTA---------D 309 L + A H P A D ++++ Sbjct: 193 LWHYAVHTPLQAKAEDIAYFEEKAKRMGLDQQDPFEIGDPFPILQKKDKRITRRIVQSDP 252 Query: 310 NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--VIDGPLPLNGAQKGYKSQT 367 Y A + ++D V +++ LK G ++TI++FTSDNG + N K Sbjct: 253 VYAAMIKALDDSVGQLMATLKAEGLDEDTIVIFTSDNGGLATAEHSPTCNFPLSEGKGWM 312 Query: 368 YPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQ 426 Y G P+F+ W GK++ G+ L ++ DFYPT L+ + + DGVSL P L Sbjct: 313 YEGAVREPLFVRWPGKIEAGSLSHALTTSPDFYPTLLELCGLPLRPQQHCDGVSLAPVLL 372 Query: 427 DKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVR 486 + + + W + +R Sbjct: 373 NPQAKFDRGPIFWHYPHYGNQGGT------------------------------PGSALR 402 Query: 487 NNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVN 545 + + E++ + L+ L D+ +K N+A P +V++ ++ E++++ EVN Sbjct: 403 CGKWKYIEFYEDHSVRLFDLEQDVSEKHNVAEVYPDLVRQFHSLLHEWLEAVDAWYPEVN 462 Query: 546 Q 546 Sbjct: 463 P 463 >UniRef50_A7HQ00 Steryl-sulfatase n=4 Tax=Proteobacteria RepID=A7HQ00_PARL1 Length = 553 Score = 442 bits (1138), Expect = e-122, Method: Composition-based stats. Identities = 131/555 (23%), Positives = 207/555 (37%), Gaps = 113/555 (20%) Query: 45 SDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEA 104 E + PNI+V+ DDLG+ + G Sbjct: 58 GPAVAAEPAGNRPPNIVVILADDLGFNDISHFGGGI------------------------ 93 Query: 105 AQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD------ 158 TP + S+ G FT+ Y PSRA IMTGR R G Sbjct: 94 --VPTPNIDSIARGGANFTSAYSGTAACAPSRAMIMTGRYGTRTGFEFTPTPPGMTRIVD 151 Query: 159 ----------------------------GIPLTETFLPELFQNHGYYTAAVGKWHLSKIS 190 G+P +E L E + GY+ +GKWHL + Sbjct: 152 MFYNDGTRTHEMLVDREAAAKAPPFREQGLPGSEITLAEALKPKGYHNIHIGKWHLG-NA 210 Query: 191 NVPVPEDKQTRDYHDNFTTFSAEEWQPQ----NRGFDYFMGFHAAGTAYYNSPSLFKNRE 246 +P + + + E P FD F A Y S + Sbjct: 211 PEFLPNAQGFDESVMLESGLFLPEDSPDVVNAKLPFDPIDQFLWARMQYATS---YNGSA 267 Query: 247 RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQ 306 KGY++D TDEAI ++ + ++PF LYLA+ H P D Y + + Sbjct: 268 WFEPKGYLTDFYTDEAIKAIEANR--NRPFFLYLAHWGVHTPLQASKAD-YDALSHIEDE 324 Query: 307 TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL-NGAQKGYKS 365 Y A + ++D+ V R+L+ LK+NG +NT+++F+SDNGA LP N +G+K Sbjct: 325 RLRVYAAMIVALDRSVGRVLQSLKENGLEENTLVIFSSDNGAPGYIGLPDVNKPYRGWKL 384 Query: 366 QTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 + GG P F W ++ G ++ +D +PT + AA +P D +DG+ LLP+ Sbjct: 385 TFFEGGIRVPFFAKWPARIPAGTERTTPVAHLDMFPTIVAAAGGELPADRVIDGIDLLPY 444 Query: 425 LQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 ++ P + + W + Sbjct: 445 AARGEKPAP-RPIFWRDGHY--------------------------------------QA 465 Query: 485 VRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 V+ + + L N+ L+ L TD +++N+A NP+ V E++ +V + + PL Sbjct: 466 VQADGWKLQMAERPNKTWLFNLKTDPTEQNNVADENPEKVAELKALVEAHNATQREPLFP 525 Query: 544 VNQEKFNNIKKALSE 558 E + K L E Sbjct: 526 AVAEMPVTVDKTLEE 540 >UniRef50_Q7US96 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7US96_RHOBA Length = 498 Score = 442 bits (1137), Expect = e-122, Method: Composition-based stats. Identities = 115/552 (20%), Positives = 192/552 (34%), Gaps = 113/552 (20%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 ++ +A S G+PNI+++ +DDLG+ + Sbjct: 8 RVSPIAIAIAMIFCCSPAQSRAGQPNILLIFIDDLGWKDIGCYG---------------- 51 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 TP + L EG+RFTN Y + V P+R A+ +G+ AR G+ ++ Sbjct: 52 ----------NDFVETPRIDQLAAEGLRFTNFYASGAVCSPTRCALQSGQNQARIGITAH 101 Query: 154 TDAQDG-------------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQT 200 +PL + E + GY T VGKWHL Sbjct: 102 IPGHWRPFERVITPQTTMALPLDTVTIAESLKASGYTTGYVGKWHLG------------- 148 Query: 201 RDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTD 260 + E+QP +G+D+ + + P Y +D D Sbjct: 149 ----------NGPEFQPDRQGYDFSAVIGGPHLPGRYRVQGRSDLKPKP-NQYRTDFEAD 197 Query: 261 EAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQ--TADNYYASVY 316 I + + K DQPF L L+ A H+P + +Y+ Y A + Sbjct: 198 LCIDFMRQNK--DQPFFLMLSPFAVHIPLAAMSEKVQKYEAMAKQTGNSLPHPVYAAMIE 255 Query: 317 SVDQGVKRILEQLKKNGQYDNTIILFTSDNGA---------VIDGPLPLNGAQKGYKSQT 367 D V R+++ L++ D+T+I+FTSDNG D + KG K Sbjct: 256 HCDDMVGRLVDSLEQLDIADDTMIVFTSDNGGLYKRYDYRESADDLVSSQAPLKGEKGSL 315 Query: 368 YPGGTHTPMFMWWKGKLQ-PGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQ 426 + GG P+ + ++ G D+ + DFYPT ++ A +P + +DG SLLP + Sbjct: 316 HEGGIRVPLIIRHPATVKSAGVCDEPTISHDFYPTFVEMAGGELPINQTIDGHSLLPLMT 375 Query: 427 DKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVR 486 Q L W + H + + +R Sbjct: 376 APTQTLDRDALHWHYPHYHH--------------------------------DRPASAIR 403 Query: 487 NNDYSLVYTVE-NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEV 544 D+ L+ ++ + LY L DL + NLA+ +++ + + S Sbjct: 404 ERDWKLIEYLDGTGDVELYNLADDLGETKNLASEKQGRAGDLKRKLTTWRSSVLARTPIP 463 Query: 545 NQEKFNNIKKAL 556 N Sbjct: 464 NPSYDPERAHEW 475 >UniRef50_A4CMB0 Arylsulfatase A n=4 Tax=Bacteria RepID=A4CMB0_9FLAO Length = 492 Score = 441 bits (1136), Expect = e-122, Method: Composition-based stats. Identities = 127/532 (23%), Positives = 195/532 (36%), Gaps = 122/532 (22%) Query: 45 SDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEA 104 S+ T KPN I++ DDLGYG L Sbjct: 37 SEGTAAAGGIPEKPNFIIVFADDLGYGDLSSFG--------------------------H 70 Query: 105 AQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN------TDAQD 158 T L + EG ++TN YVA V PSRA ++TGR P R G+ SN D+ + Sbjct: 71 PTIHTKNLDRMAAEGQKWTNFYVAASVCTPSRAGLLTGRLPVRNGLTSNEIGVFFPDSHN 130 Query: 159 GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQ 218 G+P +E L E + GY T VGKWHL EE+ P Sbjct: 131 GMPASEITLAEQLKKAGYATGMVGKWHLGH-----------------------KEEYLPP 167 Query: 219 NRGFDYFMGFHAAGTAYY-------------------------NSPSLFKNRERVP---A 250 N GFD + G + + + L + E + Sbjct: 168 NHGFDDYFGIPYSNDMDFTGQFTSYQDYFGRYTERYESLKTEEYNVPLIRGTEEIERPVN 227 Query: 251 KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADN 310 + I+ + DEA+ + K D+PF +YLA++ PH+P + G+ Sbjct: 228 QNTITKRYNDEAVKWIREHK--DEPFFMYLAHSLPHVPL-------FTSDEFRGTSARGL 278 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV--IDGPLPLNGAQKGYKSQTY 368 Y V +D GV +I+E L+ G +NTI++FTSDNG G + K T+ Sbjct: 279 YGDVVEEIDHGVGQIMELLEAEGLAENTIVVFTSDNGPWLPTGISGGSAGLLREGKGTTW 338 Query: 369 PGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK 428 GG P W G L + S +D + T A + +P D ++DGV L P L Sbjct: 339 EGGMREPTIFWAPGMLPAKVVMDMGSTLDLFNTFSSLAGVPMPDDREMDGVDLSPILFGD 398 Query: 429 KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNN 488 + + + + + + + Sbjct: 399 AESPRKEMFYYQGADLYAVRLGAYKAHFYTKEAYVMGA---------------------- 436 Query: 489 DYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 VE+N LY + D +K +L+ +P+V++E++ VV + Sbjct: 437 -----ERVEHNPPLLYNVEEDPSEKYDLSGKHPEVIEEIRRVVEAHNANMVK 483 >UniRef50_Q7UYA5 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA5_RHOBA Length = 562 Score = 441 bits (1136), Expect = e-122, Method: Composition-based stats. Identities = 135/563 (23%), Positives = 224/563 (39%), Gaps = 95/563 (16%) Query: 1 MKSALKKSVVSTSISLILASGMAAFAAHAAD-----DVKLKATKTNVAFSDFTPTEYSTK 55 + A + T+ SL L+ A F H + D A V FT E Sbjct: 63 LPHARVSLHIRTNESLTLSLTHATFHPHTPNMKHCIDSLAIAIVAVVFLGSFT--EAHAD 120 Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 +PNII+L DDLGYG L TP L L Sbjct: 121 DRPNIILLLADDLGYGDLSCFGS--------------------------PAVKTPHLDRL 154 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-IPLTETFLPELFQNH 174 EG++ Y V P+RA+++TGR P RFG+ + + ++G +P + T + EL ++ Sbjct: 155 ASEGLKCNRFYAGSAVCSPTRASVLTGRYPLRFGITKHFNDRNGWLPESATTVAELLKDA 214 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF--------- 225 GY TA +GKWHL + + D + P+ GFD++ Sbjct: 215 GYNTAHIGKWHLGGL-------------HVDEPGKRLTNQPGPRQHGFDFYQTQIEQQPL 261 Query: 226 MGFHAAGTAYY--NSPSLFKNRERVPAK-----GYISDQLTDEAIGVVDRAKTLDQPFML 278 G + L +N +R+ + +D D A+ ++++ + + PF + Sbjct: 262 RGQMGRDKTLFRKGGTVLLRNDQRISQDDPYYHKHFTDANGDFAVEMIEKLSSEEDPFFI 321 Query: 279 YLAYNAPHLPNDN-PAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDN 337 + + PH P + P P + + + + V +D V IL +L + DN Sbjct: 322 NMWWLVPHKPYEPAPEPHWSDTAADDITDDQHRFRSMVQHMDAKVGAILRKLDELKIADN 381 Query: 338 TIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAM- 396 T++LFTSDNGA +G + KG K++ + GG PM + W + G + S Sbjct: 382 TLVLFTSDNGAAFEGFIHD---LKGGKTELHDGGIRVPMIVRWPDAIPAGQTSQTFSHTN 438 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP--HKNLTWITSYSHWFDEENIPF 454 D PT DAA + +P DL LDG+SLL + + W Sbjct: 439 DLLPTFCDAASVQLPSDLPLDGLSLLSHWKGGTPPSQVERGTVFWQL------------- 485 Query: 455 WDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKD 513 D Y RH P+ V ++ L+ + + L+ + D +K Sbjct: 486 -DLYKSLQRHYPKPKPYATE---------VVMRGNWKLL-AFKGKPVELFDVGADPNEKR 534 Query: 514 NLAAANPQVVKEMQGVVREFIDS 536 N+ A +P++V + ++++++ Sbjct: 535 NVLAEHPELVASLSAQLKDWLNE 557 >UniRef50_A7IPG5 Sulfatase n=3 Tax=Bacteria RepID=A7IPG5_XANP2 Length = 491 Score = 441 bits (1134), Expect = e-122, Method: Composition-based stats. Identities = 117/526 (22%), Positives = 188/526 (35%), Gaps = 115/526 (21%) Query: 38 TKTNVAFSDFTPTEYST-KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 S + + +P+I+ + DDLG+ + F Sbjct: 28 LAAVAGLSLLSSGARAADAPRPHIVYILADDLGFADVGFHGS------------------ 69 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY---SN 153 TP L L +G R Y P+RAA +TGR P +G+ Sbjct: 70 ---------DIKTPNLDHLAAQGARLGQFY-TQPFCTPTRAAFLTGRYPLHYGLQVGAIP 119 Query: 154 TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 + A+ G+ E LP+ ++ GY TA VGKWHL + Sbjct: 120 SGAKYGLATDEFLLPQALKDVGYRTALVGKWHLGHAD----------------------Q 157 Query: 214 EWQPQNRGFDYFMGFHAAGTAYYNS-----PSLFKNRERVPAKGYISDQLTDEAIGVVDR 268 ++ P+ RGFD F G ++ + + +V +GY ++ EA+ ++ Sbjct: 158 KFWPRQRGFDSFYGPLVGEIDHFKHEAHGVTDWYHDNTQVKEEGYDTELFGKEAVRLI-A 216 Query: 269 AKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS-QTADNYYASVYSVDQGVKRILE 327 A P LYLA+ APH P AP Y Q+ + Y A + ++D + ++ Sbjct: 217 AHDPKTPLFLYLAFTAPHTPFQ--APQSYLDQYAHIAAPQRRAYAAMITAMDDQIGHVVA 274 Query: 328 QLKKNGQYDNTIILFTSDNGAVIDGPLP-----------LNGAQKGYKSQTYPGGTHTPM 376 L G +NT+I+F SDNG N + K Y GGT Sbjct: 275 ALTSRGMRENTLIVFHSDNGGTRSKMFAGEGAVAGDLPASNAPYRDGKGSLYEGGTRVVA 334 Query: 377 FMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKN 436 W G++ PG + ++ +D PT A S+ K LDGV + P L + G Sbjct: 335 LANWPGRIAPGAAEGVMHVVDMLPTLAKLAGASLAKSKPLDGVDVWPALAAGQAGRA--- 391 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV 496 ++ VR+ + LV+ V Sbjct: 392 ------------------------------------GIVYNVEPTQGAVRDGRWKLVWRV 415 Query: 497 ENNQL-GLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 L+ + D + +++A +P+ V E+QG V + PP Sbjct: 416 VLPPTAELFDVEADPSETTDVSAQHPEKVAELQGKVVALARTMAPP 461 >UniRef50_A6DKD8 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKD8_9BACT Length = 455 Score = 440 bits (1133), Expect = e-122, Method: Composition-based stats. Identities = 141/548 (25%), Positives = 217/548 (39%), Gaps = 125/548 (22%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K + + F+ T KPNII++ DDLGY L F Sbjct: 1 MKLIFSLIFFTYSTLAL--AAQKPNIILILADDLGYEDLGFLG----------------- 41 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 A TP + +L G+ FT GY + V GPSRA ++TGR FG N Sbjct: 42 ---------APDIKTPHIDALARSGMNFTQGYQSASVCGPSRAGLLTGRYQQLFGSGENP 92 Query: 155 D---------AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 GIPL E + +L + Y T +GKWH+ Sbjct: 93 PETGELSKRFPDAGIPLDEQMIFDLLKPAAYTTGVIGKWHMGL----------------- 135 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS----------PSLFKNRERVPAKGYIS 255 + E +P R DY+ GF +Y + +F+N E VP GY + Sbjct: 136 ------SHEQRPTQRSVDYYYGFLNGAHSYREAKMDMKGAPMTWPIFRNNEPVPFSGYTT 189 Query: 256 DQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASV 315 + DE + + R K D+PF LY++YN+ H P + D + + + Y A + Sbjct: 190 EVFNDEGVNFIKRNK--DKPFFLYMSYNSVHGPWEAQPKDLQRS-DHIKKKWRRIYSAML 246 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP--------LPLNGAQKGYKSQT 367 S+D GV R+++ LK G Y+NT+++F SDNGA + L NG+ +G K T Sbjct: 247 ISMDDGVGRLIQTLKDEGIYENTLVIFMSDNGAPNNLHEAERAGDYLASNGSLRGRKGDT 306 Query: 368 YPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQ 426 Y GG P M W + Y +S +D PT + + + P +L GV+L+P++ Sbjct: 307 YEGGIRVPYIMSWPQVIPKQSTYQHPVSGLDIVPTLIHISQAA-PAKKELSGVNLMPYIT 365 Query: 427 DKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVR 486 +K PHK L W + +R Sbjct: 366 GEKTSRPHKTLYWRRDDDY--------------------------------------AIR 387 Query: 487 NNDYSLVYTVENNQ--LGLYKLTD-LQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 + D+ L + N L+ L D +K+NL +P++ +++Q ++ DS P Sbjct: 388 DKDWKLTWNDYNGPRTPMLFNLKDDPNEKNNLIHKHPEIAQKLQAKFDQW-DSKLPDNKW 446 Query: 544 VNQEKFNN 551 N Sbjct: 447 WGGPSNRN 454 >UniRef50_A9LGQ4 Secreted arylsulfatase n=4 Tax=Bacteria RepID=A9LGQ4_9BACT Length = 608 Score = 440 bits (1133), Expect = e-122, Method: Composition-based stats. Identities = 130/570 (22%), Positives = 208/570 (36%), Gaps = 129/570 (22%) Query: 19 ASGMAAFAAHAADDVKLKATKTNVAFSDFTP-----TEYSTKGKPNIIVLTMDDLGYGQL 73 +G + ++ + + + +PN+IV DD G+G Sbjct: 1 MAGPTWLKTESLVNMNFRLLNSAIWLLAICCWFPSFVVAQNDQRPNVIVFLSDDQGWGDF 60 Query: 74 PFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSG 133 +TP + SL +G+ F N +V V Sbjct: 61 SCTG--------------------------NQSVATPNIDSLATQGLLFENFFV-CPVCS 93 Query: 134 PSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVP 193 P+RA +TGR + V + Q+ I L ET + + GY TAA GKWH Sbjct: 94 PTRAEFLTGRYHPQSNVKGVSQGQERIDLDETTIADCLSQAGYATAAFGKWHNGMQY--- 150 Query: 194 VPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGY 253 + P RGFD F GF + Y +P+L N V +GY Sbjct: 151 --------------------PYHPCGRGFDDFYGFCSGHWGNYFNPTLEHNGRIVKGEGY 190 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS-------- 305 I+D T+ A+ ++ K+ QPF LYL YN PH P PD Y ++F Sbjct: 191 INDDFTNRALKFIEDHKS--QPFFLYLPYNTPHWP--PQMPDAYWQRFAEKEIVQRGQKG 246 Query: 306 -----QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ 360 + A V ++D V R+L +L + DNTI+++ +DNG + N Sbjct: 247 DKEDLAKTRSALAMVENIDWNVGRVLAKLDELKIADNTIVIYFNDNGPNSN---RWNAGM 303 Query: 361 KGYKSQTYPGGTHTPMFMWWKGKLQ--PGNYDKLISAMDFYPTALDAADISIPKDLKLDG 418 KG K T GG +P+F+ W ++ +++ A+D YPT L A + D LDG Sbjct: 304 KGKKGSTDEGGVRSPLFVRWPNGVKGAGRRVNQICGAIDLYPTLLAATGSANVGDKILDG 363 Query: 419 VSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDL 478 +LLP + + L Sbjct: 364 KNLLPIWDGSETNLGFRMLF--------------------------------------SY 385 Query: 479 SQFSYTVRNNDYSLVYTVENNQLGLYK-LTDLQQKDNLAAANPQVVK-------EMQGVV 530 + +VR + L +N L+ LTD Q ++++ P V + + Sbjct: 386 WRGKASVRTQQFRL-----DNNGWLFDMLTDPHQTKDISSDQPAVAALLLGSLIRFKQEM 440 Query: 531 REFIDSSQPPLSEVNQEKFNNIKKALSEAK 560 +DS++ P S + + F + +A+ Sbjct: 441 EAEMDSTKRPFSVGHPD-FAYTQLPARDAQ 469 >UniRef50_UPI0001927538 PREDICTED: similar to CG8646 CG8646-PA n=5 Tax=Hydra magnipapillata RepID=UPI0001927538 Length = 502 Score = 440 bits (1133), Expect = e-122, Method: Composition-based stats. Identities = 128/526 (24%), Positives = 209/526 (39%), Gaps = 80/526 (15%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 KP+II++ DDLG+ + F + + TP + Sbjct: 17 ADKPHIIMIVADDLGWNDISFHGSN--------------------------EIPTPNIDR 50 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFLPELF 171 L + GV N YV + PSR+AIMTGR P G+ G+ L E FLP+ Sbjct: 51 LANNGVILDNYYVL-PICTPSRSAIMTGRYPIHTGMQQDTIFGPNPYGVGLNEKFLPQYL 109 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA 231 + GY T VGKWHL F A+++ P RGFD + G + Sbjct: 110 KQQGYKTHGVGKWHLG----------------------FFAKQYTPTYRGFDSYYGSYLG 147 Query: 232 GTAYYNSPSLF----------KNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 Y+N + +N Y ++ T EAI + +P LYLA Sbjct: 148 KGDYWNHSNTETYSGLDLHDNENGVFSQDGNYSTEMYTAEAISCI-NNHNSSEPLFLYLA 206 Query: 282 YNAPHL------PNDNPAPDQYQKQFNTGS-QTADNYYASVYSVDQGVKRILEQLKKNGQ 334 Y A H P AP ++ +F+ + Y A + +D GV R+ + L + Sbjct: 207 YQAVHSANTEEDPLQ--APQEWIDKFSYIKHEQRRKYAAMLGYMDYGVGRVHDALAEKKM 264 Query: 335 YDNTIILFTSDNGAVIDG---PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK 391 DN+II+FT+DNG +G N +G K+ + GG F++ K P + Sbjct: 265 LDNSIIIFTTDNGGPANGFDYNWANNFPLRGVKATLFEGGVRGVSFVYSKLIESPRVSHE 324 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 LI D+ PT ++ A + LDG LQ+K+ + ++ L I + Sbjct: 325 LIHITDWLPTLVNLAGGKVSDGF-LDGFDQWATLQNKQSSQRNEVLLNIDEKVWKNEALR 383 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY---TVRNNDYSLVYTVENNQLGLYKL-T 507 + W + P + N + + FSY TV+ + + L+ + Sbjct: 384 VGSWKIIKEGNYWDGWYPPPSFNEQSNNSFSYLSSTVKCGHDIPIVINHCDSYCLFHIDE 443 Query: 508 DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIK 553 D + ++L+ P+V+ E+ + + S PP + + + ++ K Sbjct: 444 DPCEINDLSKKFPEVLAELINRLNTYRQSMVPPRNNMTIDPRSDPK 489 >UniRef50_A7V8P8 Putative uncharacterized protein n=1 Tax=Bacteroides uniformis ATCC 8492 RepID=A7V8P8_BACUN Length = 525 Score = 440 bits (1132), Expect = e-122, Method: Composition-based stats. Identities = 140/559 (25%), Positives = 228/559 (40%), Gaps = 139/559 (24%) Query: 35 LKATKTNVAFSDFTPTEYSTK--------GKPNIIVLTMDDLGYGQLPFDKGSFDPKTME 86 +K + T V+ + P S +PNI+++ DD+G+G + + Sbjct: 1 MKVSCTLVSVAALLPFSGSNAGNVQRDKSQRPNIVLVIADDMGWGDVGYQG--------- 51 Query: 87 NREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPA 146 A STP + +L GV+F+ GYV+ +SGPSRA I+TG Sbjct: 52 -----------------AVDVSTPNIDALARRGVQFSQGYVSCSISGPSRAGILTGVYQQ 94 Query: 147 RFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 RFG Y+N IP ++ L E+ ++ GY T VGKWH++ Sbjct: 95 RFGFYNNLHPWAKIPEGQSTLGEMVRDCGYATGFVGKWHMAD------------------ 136 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHAAGTAY-----------YNSPSLFKNRERVPA----K 251 + E P RGFD F GF + Y Y+ L++N E P Sbjct: 137 -----SPEQSPNRRGFDQFYGFWSDTHDYYRSTDKPGVELYDFCPLYRNGEIQPPLHESG 191 Query: 252 GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS---QTA 308 YI+D T EA+ +D+ + PF+L L+YNA H P P+ Y + + Sbjct: 192 EYITDCFTREAVEFIDKHASS--PFLLCLSYNAVHSPWQ--VPEHYVNRLEGRRFHHEDR 247 Query: 309 DNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL-------------- 354 + A V ++D G+ R++E L+KNG +NT+ + SDNG+ + Sbjct: 248 KVFAAMVLALDDGIGRVMESLRKNGLEENTLFILISDNGSPRGQGIECSTGYEYKDRGNT 307 Query: 355 --PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIP 411 G +GYK+ TY GG P M W +L G YD + ++D +PT + A + Sbjct: 308 TMSSPGPFRGYKADTYEGGIRVPYIMSWPSELPQGMVYDNPVISLDIFPTVMQAVGGTSR 367 Query: 412 KDLKLDGVSLLPWLQDKKQ--GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDY 469 + LDGVSLLP+L+ + PH L W Sbjct: 368 QKYSLDGVSLLPYLKSEWPIDKRPHSTLYWRRDED------------------------- 402 Query: 470 PHNPNTEDLSQFSYTVRNNDYSLVYTVENN--QLGLYKLTDLQQK-DNLAAANPQVVKEM 526 + +R D+ LVY + + ++ L+ + D +++ +L+ P++ + Sbjct: 403 -------------FAIRKGDWKLVYNDQGSTRKIQLFDMKDDKEEVYDLSGEYPELADSL 449 Query: 527 QGVVREFIDSSQPPLSEVN 545 + + P ++ Sbjct: 450 LAEFDAWDAALPPCTNQTT 468 >UniRef50_A6DLE2 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DLE2_9BACT Length = 441 Score = 440 bits (1132), Expect = e-122, Method: Composition-based stats. Identities = 126/533 (23%), Positives = 201/533 (37%), Gaps = 116/533 (21%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K + + PNII++ DD G Sbjct: 1 MKILFCLFSLLCTSLL---ANEPPNIIIILADDAGSSDFSCYGS---------------- 41 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 Q TP + S+ G++FT Y A V PSRA ++TGR FG +N Sbjct: 42 ----------KQLLTPHIDSIAHNGIKFTQAYTASSVCSPSRAGLLTGRYQQTFGHLANI 91 Query: 155 DAQD---------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 G+P+TE L + + GY T +GKWHL + Sbjct: 92 PHSKHSANDPELLGLPVTEITLADSLKELGYSTHCIGKWHLGE----------------- 134 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY-------NSPSLFKNRE-RVPAKGYISDQ 257 A+ + P RGFD F GF + Y+ + + +N+E P+ GY ++ Sbjct: 135 ------ADHFHPNARGFDNFYGFLSGARTYFLGGELRGDMDRIMRNKEFAEPSSGYTTEV 188 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYS 317 T EAI ++ D+PF +YL++NA H P D D F + Y + + Sbjct: 189 FTQEAIRIIQE--EQDKPFFIYLSHNAVHGPMDAKDEDIMSYDFK--NPLRKKYSGLMKN 244 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMF 377 +D +L+ LK + QY+NT+I F SDNG N +G+K + GG TP Sbjct: 245 LDDQTGLLLQALKDSKQYENTLIFFMSDNGGPTTHNGSSNWPLRGFKGSEFEGGNRTPFL 304 Query: 378 MWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKN 436 + W K+ G + DK I A D + T + AA + D G+ LLP + +K Q + Sbjct: 305 LQWPEKISAGLSSDKPIIAYDVFATCIQAAGGELVTDRTYHGIDLLPVI-NKPQETNARK 363 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV 496 L W +Y++R + L Sbjct: 364 LFWSRG--------------------------------------KNYSMRQGKWKLNILP 385 Query: 497 ENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEK 548 + LY L D +K +L+ P++ ++ + ++ + L + +K Sbjct: 386 TGSS--LYNLENDQSEKHDLSEQFPEIKAQLIKEMSKWKSTHAEALWQTGYKK 436 >UniRef50_A6DHI2 Aryl-sulphate sulphohydrolase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI2_9BACT Length = 493 Score = 439 bits (1129), Expect = e-121, Method: Composition-based stats. Identities = 116/532 (21%), Positives = 212/532 (39%), Gaps = 89/532 (16%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 K A F KP+II++ +DDLG+ L + + Sbjct: 2 IIKIQCALVFFLALAGFAAEKPHIILINIDDLGWTDLSYQGSKY---------------- 45 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA 156 +P + +L G+ F GY A PSRA++++G+ R VY+ + Sbjct: 46 ----------YESPNIDALAKSGMIFDQGYAAAANCAPSRASLISGQQSPRTEVYTVGNP 95 Query: 157 QDG---------------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 G + + + + GY TA +GK+H++K + Sbjct: 96 ARGASNKRKLIPSPNIDFVDADNFTIADAMNSAGYLTATLGKYHVAKDPLTHGWKI---- 151 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDE 261 N G F G + G Y+SP + N + Y+ D LTDE Sbjct: 152 -----------------NVGGREFGGPYNGG---YHSPYEYPNLKETEKGRYLCDHLTDE 191 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA--PDQYQKQFNTGSQTADNYYASVYSVD 319 AIG + + QP +Y Y H P +Y+ + T Y A + ++D Sbjct: 192 AIG-IFKEHGAQQPIFMYFPYYTIHAPIQGHPKFEPKYKAKAKTKGHFNPKYAAMIEALD 250 Query: 320 QGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMW 379 V R++ L++ G + T+I+FTSDNG + + K Y GG P F Sbjct: 251 HNVGRLVAALEEQGLREKTLIMFTSDNGGHMK--FSRQEPLRAGKGSYYEGGIRVPFFAS 308 Query: 380 WKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK-KQGEPHKNL 437 W G ++ G+ + ++ +DFYPT + A + +P D +DG S LP L+ + + ++ L Sbjct: 309 WPGVIEAGSRSQVPVTGLDFYPTVCELAGVELPDDKVVDGKSFLPLLKSEVDEDLKNRAL 368 Query: 438 TWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE 497 W + ++ P + + ++ +R+ + L + E Sbjct: 369 YWHFPIY---------------LQAYLKPNEKPESRDPLFRTRPGSVIRHGKWKLHHYFE 413 Query: 498 NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS-EVNQE 547 ++ + LY + D +K++L++ P+VV +++ + + + + E+N + Sbjct: 414 DDGVELYDINSDRSEKNDLSSEYPEVVSKLRNKLDSWRNGIGAFIPTELNPD 465 >UniRef50_Q7UX95 Arylsulfatase n=3 Tax=Planctomycetaceae RepID=Q7UX95_RHOBA Length = 538 Score = 438 bits (1128), Expect = e-121, Method: Composition-based stats. Identities = 132/606 (21%), Positives = 221/606 (36%), Gaps = 151/606 (24%) Query: 1 MKSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNI 60 +K + ++V+ S + + + A + + ++ + +T +PNI Sbjct: 23 LKHNMNQAVLMPSRKWVRWALLLVCVAGVPN------LDSTTVSAEEPNAKDATVSRPNI 76 Query: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 +++ DDLGYG+L + TP L L EG+ Sbjct: 77 VLIVADDLGYGELGCYG--------------------------QTKIRTPRLDQLAAEGI 110 Query: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD----------------AQDGIPLTE 164 + TN Y + V PSR +MTG+ P V +N D Q +P+ E Sbjct: 111 KLTNFYSGNAVCAPSRCCLMTGKHPGHAHVRNNGDPKIDPAVREALKLEFPGQYPLPVDE 170 Query: 165 TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY 224 + E ++ GY T A GKW L P +GFD Sbjct: 171 VTIAEYLKSVGYRTGAFGKWGLGHFGTTG----------------------DPNEQGFDL 208 Query: 225 FMGFHAAGTAYYNSPS-LFKNRER---------VPAKGYISDQLTDEAIGVVDRAKTLDQ 274 F GF+ A+ + P+ L++NR + + + Y DQ +EA + ++ D+ Sbjct: 209 FYGFNCQRHAHNHYPNFLWRNRVKEVQPGNDRTLHGETYSQDQFVNEACEFIRQSVAEDK 268 Query: 275 --PFMLYLAYNAPHLPNDNPAPDQYQKQ------------FNTGSQTADNYYASVYSVDQ 320 PF YL + PHL P + + + Y A V +D+ Sbjct: 269 TQPFFAYLPFAVPHLSIQVPEEEVDAYDGVIEEADYEHHGYLKHPRPRAGYAAMVTRMDE 328 Query: 321 GVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLN-------GAQKGYKSQTYPGGTH 373 GV ++++ + G +NT+I+FTSDNG D + KG K Q GG Sbjct: 329 GVGQVVDLVDSLGLGENTLIMFTSDNGPTYDRLGGSDSDYFNSASGMKGLKGQLDEGGIR 388 Query: 374 TPMFMWWKGKLQPGNYDKLISA-MDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 PM G + G I A DF PT DAA + + DG+S LP L + Sbjct: 389 VPMIARQTGVVPAGRTSDWIGAWWDFLPTITDAAGVEVDASTT-DGISFLPLLHGDDAAQ 447 Query: 433 P-HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 H+ L W +R ++ Sbjct: 448 QSHEFLYWEFP-----------------------------------GYSGQQAIRMGNWK 472 Query: 492 LVYTV----------ENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVV-REFIDSSQP 539 + E LY L+ DL + ++++A++P V+ +++ + ++ + S Q Sbjct: 473 AIRKDLSKRLKKGQTEPPAFALYDLSKDLAESNDVSASHPDVMAKIEAIAKQQHVPSEQF 532 Query: 540 PLSEVN 545 PL ++ Sbjct: 533 PLRVLD 538 >UniRef50_B4D4S6 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D4S6_9BACT Length = 626 Score = 438 bits (1126), Expect = e-121, Method: Composition-based stats. Identities = 131/567 (23%), Positives = 205/567 (36%), Gaps = 133/567 (23%) Query: 34 KLKATKTNVAFSDFTPTEYSTKG-KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 +L + S+ +PNI+ + DDLG+ + Sbjct: 3 RLLFFLLLIVCGAVARGAESSPKTRPNIVFILADDLGWSDTTLYGTT------------- 49 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP + L G++FTN Y A+ V P+RA+IMTG P R G+ + Sbjct: 50 ------------KFFETPNIERLAARGMKFTNAYAANPVCSPTRASIMTGLYPGRLGITT 97 Query: 153 NTD-------------------------AQDGIPLTETFLPELFQNHGYYTAAVGKWHLS 187 + + + L L E + GY T GKWHL Sbjct: 98 PSGHVPEEKLEASLVARGSPSQKSLQATSATRLKLEYFTLAEALKGAGYATGHFGKWHLG 157 Query: 188 KISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA-GTAYYNSPSLFKNRE 246 E + P ++GFD + + G A Y +P +K+ + Sbjct: 158 ------------------------PEPFDPLHQGFDVDVPHWSGPGPAGYIAP--WKSPK 191 Query: 247 ---RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQF 301 + D ++ EAI + K D+PF L + H P ++Y+++ Sbjct: 192 FHLPAKPGEQLEDLMSQEAIKFIRVHK--DEPFYLNYWAFSVHSPWGGKPDLIEKYRRKA 249 Query: 302 NTGS-QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV----------- 349 + S Q Y A V S+D V R+L+ L + D+TII+F SDNG V Sbjct: 250 DPNSAQRNPVYGAMVESLDDAVGRLLDTLDELKLSDHTIIVFFSDNGGVNWFEPAMKEEA 309 Query: 350 -IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAAD 407 ++ P N + K Y GGT P + W GK + D ++ ++DFYPT L+ A Sbjct: 310 GMNSPPTTNAPLRAGKGTLYEGGTREPCVVVWPGKTKAATQNDAMLCSVDFYPTLLEMAG 369 Query: 408 ISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 ++ DLK DGVS +P L G P L Sbjct: 370 VAAKPDLKFDGVSQVPALLG--TGTPRDTLFCYYPVY----------------------- 404 Query: 468 DYPHNPNTEDLSQFSYTVRNNDYSLVYTVEN-----NQLGLYKL-TDLQQKDNLAAANPQ 521 P + R D+ L+ + ++ LY L DL + +LAA P Sbjct: 405 ---SPPGHVVHTMPGVWGRRGDWKLIRYFHDADDQSDRYELYNLHDDLGETKDLAARFPD 461 Query: 522 VVKEMQGVVREFIDSSQPPLSEVNQEK 548 VKE+ ++ + + + N Sbjct: 462 KVKELNALIDAHLAETHALIPGKNPAY 488 >UniRef50_B4CZ54 Sulfatase n=3 Tax=Bacteria RepID=B4CZ54_9BACT Length = 500 Score = 437 bits (1124), Expect = e-121, Method: Composition-based stats. Identities = 116/524 (22%), Positives = 192/524 (36%), Gaps = 100/524 (19%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 + T KPNI+ + DD GYG L Sbjct: 9 SLVLCCCLAVVATAQGAPSKPNIVFILADDTGYGDLSATG-------------------- 48 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ 157 TP L L + VRFT+ +V P+R+A+MTGR + GV + Sbjct: 49 ------NPILKTPHLDKLYNAAVRFTDFHV-SPTCSPTRSALMTGRHEFKNGVTHTILER 101 Query: 158 DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 + + + ++ ++ GY T GKWHL + QP Sbjct: 102 ERLNPDAITIAQVLKSAGYTTGIFGKWHLGD-----------------------EPDHQP 138 Query: 218 QNRGFDYFMGFHAAGTAY-------------YNSPSLFKNRERVPAKGYISDQLTDEAIG 264 RGFD G Y +P++ N +G+ +D T++AI Sbjct: 139 GQRGFDEVFIHGGGGIGQTYPGSCGDAPGNTYFNPAILHNGSFEKTQGFCTDIFTNQAIH 198 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG-SQTADNYYASVYSVDQGVK 323 ++ + QPF Y+ YNA H+P PD+Y+K + Y+ V ++D+ V Sbjct: 199 WME-SVKGKQPFFCYIPYNAAHVP--VSCPDEYKKPYEGKVDDHLATYFGMVANIDENVG 255 Query: 324 RILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGK 383 R+L +L + G +T+++F +DNG N +G K + GGT W Sbjct: 256 RVLAKLDEWGIAKDTLVVFMNDNGGHGPACKVFNAGMRGSKGSAWLGGTRAVSLWRWSDT 315 Query: 384 LQPGNYDKLISAMDFYPTALDAADISI--PKDLKLDGVSLLPWLQDKKQGEPHKNLTWIT 441 P + L S +DF+PT + A + ++DG SLLP L+D P + L Sbjct: 316 FAPHDAAGLASNIDFFPTLAELAGATPNEKAQKQVDGRSLLPLLRDGNAPWPERVLF--- 372 Query: 442 SYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ- 500 +P + + + +VR+ + LV + Sbjct: 373 ----------------------THVGRWPKGADVQAYKYAACSVRSGQWHLVSDGPPGKP 410 Query: 501 ----LGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 L+ ++ D+ + ++ A +P VV + + S P Sbjct: 411 REKGWKLFDVSKDIGEDHDVVAEHPDVVTRLDAEYDRWWASVVP 454 >UniRef50_A6C176 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C176_9PLAN Length = 599 Score = 436 bits (1123), Expect = e-121, Method: Composition-based stats. Identities = 125/539 (23%), Positives = 197/539 (36%), Gaps = 108/539 (20%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 L + P + GKPNII++ DD GYG + Sbjct: 7 LLTFILILLVSLKDCPADTPDSGKPNIILVITDDQGYGDIAAHG---------------- 50 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 TP L L + +R TN +V P+R+A+MTGR R GV+ Sbjct: 51 ----------NQMIKTPNLDQLYQKSLRLTNFHV-DPTCAPTRSALMTGRYSTRTGVWHT 99 Query: 154 TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 + + E L E+F+++GY T GKWHL + P+D+ + + Sbjct: 100 IMGRSLMDTNEVTLAEVFKSNGYRTGLFGKWHLGDNYPL-RPQDQGFGTVVQHGGGGVGQ 158 Query: 214 EWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLD 273 P + DYF S + +N + +GY +D DEA+ ++ Sbjct: 159 T--PDDWQNDYF------------SDTYLRNGKPEKFQGYCTDIWFDEALKFIE--ADRT 202 Query: 274 QPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNG 333 +PF YL+ NAPH P + + +Y + ++D+ + R+L LK++G Sbjct: 203 KPFFAYLSTNAPHSPYLVDPEYSDPYEDKGVPKKMAAFYGMITNIDENMGRLLRYLKESG 262 Query: 334 QYDNTIILFTSDNGAVIDGP--------------------------LPLNGAQKGYKSQT 367 NTI++F +DNG N +G K Sbjct: 263 LEKNTILIFMTDNGTAAGLQRPSTEDLSKKQQRRLSKGKPITLETWPGFNARMRGTKGSE 322 Query: 368 YPGGTHTPMFMWWK--GKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 Y GG P ++ W G N ++L + +D PT D D++I +LKLDG SL+P L Sbjct: 323 YDGGHRVPCYIHWPQGGLTGGKNINQLTAHIDILPTLADLCDLTISSELKLDGTSLVPIL 382 Query: 426 QDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTV 485 K ++ L + ++ +V Sbjct: 383 TGNKDALRNRTLIVHSQRIESPEK------------------------------WRKSSV 412 Query: 486 RNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 + LV N+ LY + D Q N+AA VVK + ++ S P S Sbjct: 413 MAERWRLV-----NEKELYDIQNDPGQTKNVAAEYAGVVKYLSAEYEKWWSSLTPVFSR 466 >UniRef50_Q7UYW3 Arylsulfatase B n=1 Tax=Rhodopirellula baltica RepID=Q7UYW3_RHOBA Length = 520 Score = 436 bits (1123), Expect = e-121, Method: Composition-based stats. Identities = 132/551 (23%), Positives = 203/551 (36%), Gaps = 130/551 (23%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 + + + PNI+V+ DD+GYG + Sbjct: 35 VFLPLIFVFATEASRAAESTPPNIVVILADDMGYGDMGCMGS------------------ 76 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA 156 TP L L + GV + YVA V PSRA ++T R P RFG N +A Sbjct: 77 --------QTLQTPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTSRDPRRFGYEGNLNA 128 Query: 157 QD----------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 D G+P +E L + GY TA +GKWHL Sbjct: 129 SDENYATRPELLGLPTSEKTLADHLGAAGYATALIGKWHLGM------------------ 170 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS---PSLFKNRERVPA--KGYISDQLTDE 261 E P RGFD+F G Y+ + + +N +RV Y++D TDE Sbjct: 171 -----GEMHHPNRRGFDHFCGMLTGSHHYFPATMKHVIERNGKRVDDFSSEYLTDFFTDE 225 Query: 262 AIGVVDRAK--TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVD 319 + +D+ K DQP+ ++ +YNAPH P D + N +Q Y A +Y++D Sbjct: 226 GLRFIDQHKSANPDQPWFVFFSYNAPHTPMHATEADL-ARFANIQNQKRRTYAAMMYALD 284 Query: 320 QGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMW 379 +GV RI E L++ GQ++NT+++F SDNG + NG +G K GG PM Sbjct: 285 RGVGRIREHLEETGQWENTLLVFFSDNGGATN-NGSWNGPLRGVKGSMREGGIRVPMIWT 343 Query: 380 WKGKLQPGN-YDKLISAMDFYPTALDAADISIPK-------------------DLKLDGV 419 W K G YD ++S++D PT AA DG+ Sbjct: 344 WPAKFPAGVLYDGVVSSLDLLPTFCSAAGAEPLALADPMSHEDASNRKRMNRLSGTHDGI 403 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLS 479 + P L D + + L W Sbjct: 404 DMAPHLADGSEPPNRR-LYWRL-------------------------------------- 424 Query: 480 QFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 Q + + L+ L+++ TD+ + +L+A NP +E+ + + +S Sbjct: 425 QGQAAILDGTDKLLRPSHR-PAELFEVSTDVSESHDLSAQNPSRFRELYDELGAW-ESML 482 Query: 539 PPLSEVNQEKF 549 + + Sbjct: 483 TTVPLWGSSPY 493 >UniRef50_C6I9F7 Sulfatase n=4 Tax=Bacteroides RepID=C6I9F7_9BACE Length = 493 Score = 436 bits (1122), Expect = e-120, Method: Composition-based stats. Identities = 134/567 (23%), Positives = 210/567 (37%), Gaps = 130/567 (22%) Query: 35 LKATKTNVAFSDFTPTEYSTKGK---PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVV 91 +K +A T T + K PN+I + DDLGY L F Sbjct: 1 MKRLILPIACGICTVTSDAQTDKQPHPNVIFIYADDLGYTDLSCTGSRF----------- 49 Query: 92 DTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY 151 TP + L EGV FT Y A VS PSRAA++TG+ PAR + Sbjct: 50 ---------------YETPHIDKLAREGVCFTQSYAACPVSSPSRAALLTGKYPARINLT 94 Query: 152 SNTDAQD-----------------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPV 194 + E + E F+ +GY T GKWHL Sbjct: 95 DYIPGDRAYGPHKNQRLASLPFNLHLSKDEITMAEAFRQNGYSTFMAGKWHL-------- 146 Query: 195 PEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA--YYNSPSLFKNRERVPAKG 252 + E+ P+ GFD +G + G Y SP + P Sbjct: 147 ---------------AESAEYYPEQNGFDINIGGNNTGHPSKGYFSPYGNPQLKDGPEGE 191 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPD--QYQKQ---------- 300 Y++D+LTDE I + K ++PF +YL+Y HLP A +Y+++ Sbjct: 192 YLTDRLTDEVIRYISEPK--EKPFFVYLSYYTVHLPLQAKAEKIAKYRRKLSRAVPADSS 249 Query: 301 -------FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP 353 ++ Q Y A V S+D+ + R+L+ L ++G + TI++FTSDNG + Sbjct: 250 FVKKGETYHKLVQDIPAYAAMVESLDENIGRLLDTLHRSGLDERTIVVFTSDNGGMATSN 309 Query: 354 -----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAAD 407 N + K Y GG P + W L+ D I D+YPT LD Sbjct: 310 TTRNIPTSNLPLRAGKGYLYEGGIKVPAIIRWSRHLKGRQVSDTPIIGTDYYPTLLDLCG 369 Query: 408 ISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 + + +DGVS+ P LQ + P +L W + Sbjct: 370 LPLLPGQHVDGVSMKPVLQGGRLSRP--SLFWHYPHYSGGLGG----------------- 410 Query: 468 DYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYK-LTDLQQKDNLAAANPQVVKEM 526 + S +R DY L+ E++ + LY + D ++ +L+ P++ + Sbjct: 411 ------------RPSAAIREGDYKLIEFFEDHHVELYNVIQDESEEKDLSQIYPEIADGL 458 Query: 527 QGVVREFIDSSQPPLSEVNQEKFNNIK 553 + + + + N + +K Sbjct: 459 RKKLYLWYKEVGARMPVDNPHYVSPVK 485 >UniRef50_Q15XI1 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XI1_PSEA6 Length = 510 Score = 436 bits (1121), Expect = e-120, Method: Composition-based stats. Identities = 130/571 (22%), Positives = 208/571 (36%), Gaps = 126/571 (22%) Query: 26 AAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 H L V S KPN++++ +DDLGY + Sbjct: 7 RRHLQRLSYLLFITLIVCESVLNSCAAQVVTKPNVLLILVDDLGYSDIKAYN-------- 58 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 E + TP + L + V FTNGY A+ V PSR A++TG+ P Sbjct: 59 -----------------ENSFYDTPNIDKLASQSVMFTNGYAANPVCSPSRFALLTGKHP 101 Query: 146 ARFGVYSNTDAQD---------------GIPLTETFLPELFQNHGYYTAAVGKWHLSKIS 190 R A D +PL+E L E F+ +GY TA +GKWHL K Sbjct: 102 TRGKATDWFPANDKPARAGRFLPAEFNDALPLSEITLAEAFKQNGYNTAFLGKWHLGKTE 161 Query: 191 NVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA--YYNSPSLFKNRERV 248 ++ P+N+GFD + G Y SP Sbjct: 162 DL-----------------------WPENQGFDVNIAGTKNGHPAAGYFSPYKNARLTDG 198 Query: 249 PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPD--QYQKQFNTG-- 304 P Y++ +LT+EAI +VD+ PF + L++ H P P D +YQ + Sbjct: 199 PKGEYLTQRLTNEAISLVDKYSKQTVPFFMMLSFYTVHTPLAAPNKDVQEYQAKIRQYAH 258 Query: 305 ---------------------SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFT 343 Q Y A V +D V R+L +LK+ G ++T+++FT Sbjct: 259 NDEFQREEQVWPTAEKREVRVKQNHPTYAAMVKQMDTQVGRLLAKLKQAGMEESTLVVFT 318 Query: 344 SDNGA--VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK-GKLQPGNYDKLISAMDFYP 400 SDNG +G N +G K Y GG P+ + K + ++ +++ D YP Sbjct: 319 SDNGGLSSAEGSPTSNLPLRGGKGWLYEGGIRVPLLVKLPQKKHKHLQINEPVTSTDLYP 378 Query: 401 TALDAADISIPKDLKLDGVSLLPWLQ--DKKQGEPHKNLTWITSYSHWFDEENIPFWDNY 458 T L A + + LDGV L + K+ + L + + Sbjct: 379 TLLSAGHLDLLPQQHLDGVDLNQYFSPGAKRDALMRRPLYFHYPHYSNQGGF-------- 430 Query: 459 HKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAA 517 +R ++ L+ E+ ++ LY L D+ ++ +LA Sbjct: 431 ----------------------PGAAIRQGNWKLIERFEDGKVHLYNLANDIGEQIDLAN 468 Query: 518 ANPQVVKEMQGVVREFIDSSQPPLSEVNQEK 548 P+ V ++ + E+ + + K Sbjct: 469 QAPERVASLRKKLHEWYQQTSARFLKAKGNK 499 >UniRef50_Q7UPK7 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UPK7_RHOBA Length = 482 Score = 436 bits (1121), Expect = e-120, Method: Composition-based stats. Identities = 124/552 (22%), Positives = 207/552 (37%), Gaps = 108/552 (19%) Query: 2 KSALKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNII 61 K ++K S +I ++L+ A A D ++ +T +PN+I Sbjct: 10 KPSMKFSPFVAAILILLSLNECHGQAPAVQD----------GDANAKSESDATSRRPNVI 59 Query: 62 VLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVR 121 V+ DDL G L + + TP L E ++ Sbjct: 60 VILADDLAVGDL--------------------------AGGDGSPTRTPNLDRFASESIQ 93 Query: 122 FTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG----IPLTETFLPELFQNHGYY 177 F+ Y V P+RAA++TGR P R GV + + + ET + ++ ++ GY Sbjct: 94 FSQAYSGSCVCAPARAALLTGRYPHRTGVVTLNMNRYPEMTRLRRDETTIADVLKDAGYA 153 Query: 178 TAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYN 237 T VGKWH + + P +RGFD F GF + Y Sbjct: 154 TGLVGKWHTG-----------------------RGDGFHPLDRGFDEFEGFFGSDDVGYF 190 Query: 238 SPSLFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ 296 + R+ Y++D L AI V R + PF L+LA+ APH P + P Sbjct: 191 RYPFSEQRQISDVDESYLTDDLNRRAIEFVRRHH--EHPFFLHLAHYAPHRPLEAPPEVI 248 Query: 297 YQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL 356 + + ++ YA + +D+G+ +L ++ G ++TI+LF SDNG Sbjct: 249 ARYREQGFDESTATIYAMIEVMDRGIGELLAEIDDLGLSEDTIVLFASDNGPDPLTGERF 308 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKL 416 N +G K Q GG P+F+ W +L PG D++++ +D PT LD + + +L Sbjct: 309 NRELRGTKYQVNEGGIRVPLFVRWSKRLAPGQRDQMVTFVDLMPTILDLCRVDVSMLNRL 368 Query: 417 DGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTE 476 DG S +P L+D + H + + Sbjct: 369 DGESFVPVLEDAS--------------------------------IAHSTMRFWQWNRAS 396 Query: 477 DLSQFSYTVRNNDYSLVYTV---------ENNQLGLYKL-TDLQQKDNLAAANPQVVKEM 526 + VR+ Y LV L+ L D + +++ P + + M Sbjct: 397 PNYTHNAAVRHGRYKLVRPYVTRGAKLKDSTEPSVLFDLQNDPTESRDVSKQYPDIAERM 456 Query: 527 QGVVREFIDSSQ 538 + + S + Sbjct: 457 SRELDRWSASVE 468 >UniRef50_A4A2W0 Arylsulfatase A n=1 Tax=Blastopirellula marina DSM 3645 RepID=A4A2W0_9PLAN Length = 477 Score = 435 bits (1120), Expect = e-120, Method: Composition-based stats. Identities = 124/537 (23%), Positives = 194/537 (36%), Gaps = 133/537 (24%) Query: 29 AADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 +L A V + + KPN +++ +DDLGY + + Sbjct: 2 PVSLSRLLALLIVVGWLVSSSCAQEVATKPNFVIINIDDLGYADIEPFGSEVN------- 54 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR- 147 TP L ++ DEG++ T Y A V PSRAA+MTG P R Sbjct: 55 -------------------RTPNLNAMADEGMKLTCFYAA-PVCSPSRAALMTGCYPKRA 94 Query: 148 FGVYSN--TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 + +G+ E + EL + GY TA +GKWHL Sbjct: 95 LTIPHVLFPGNAEGMSPNEVTIAELMKEQGYATAIIGKWHLGDQ---------------- 138 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGT---------AYYNSP------------SLFKN 244 ++ P +GFDY+ G + + Y +P L +N Sbjct: 139 -------PDFLPTRQGFDYYYGLPYSNDMGPAADGVKSNYGAPIPQRKGKGQPPLPLLRN 191 Query: 245 RERVP-----AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQK 299 + + + T+EAI + + ++PF LYL ++A H P Y Sbjct: 192 ETVLQRVLAKDQTELVTNYTEEAIQFIRDHQ--EKPFFLYLPHSAVHFPM-------YPG 242 Query: 300 QFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGA 359 G + Y V VD V ++L+ LK G T+++FTSDNG +N Sbjct: 243 DAFRGKNSHGLYNDWVEEVDWSVGQVLQALKDLGLDQRTLVIFTSDNGGQTRFGA-VNKP 301 Query: 360 QKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDG 418 + K+ TY GG P + W GK+ G + D ++ +D PT + A + P D K+DG Sbjct: 302 LRAGKATTYEGGMRVPTIVRWPGKVPAGSSSDAVVGMIDVLPTLVKLAGGTTPTDRKIDG 361 Query: 419 VSLLPWLQD-KKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477 + P L K+ PH + Y Sbjct: 362 ADIGPILAGVKEAKSPHDVFYFYRGYDLE------------------------------- 390 Query: 478 LSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREF 533 VR+ + L + LY L D+ + N+A N VV+ ++ + E Sbjct: 391 ------AVRSGPWKL----RLKEGALYNLHEDISEAKNVAPDNADVVERLRKIAAEM 437 >UniRef50_A6DKM2 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DKM2_9BACT Length = 472 Score = 435 bits (1119), Expect = e-120, Method: Composition-based stats. Identities = 121/550 (22%), Positives = 202/550 (36%), Gaps = 116/550 (21%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 K S T + + KPNII++ DDLG L F Sbjct: 1 MKIYFILSCLCFTLFGAQ-KPNIILILADDLGGAGLGCYGNEFFG--------------- 44 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN---- 153 TP + +L + +RF N Y V PSRA +M+G+ R + Sbjct: 45 -----------TPNIDALAAKSMRFDNAYSGSTVCAPSRACLMSGQYVGRHKITWVSQFQ 93 Query: 154 -------------------TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPV 194 +P L + F++ GY TA GKWHL Sbjct: 94 RDYIKKKRGPNLNGFRLLQPVHPYHMPEGTITLGQAFKDAGYATAMFGKWHLGH------ 147 Query: 195 PEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYI 254 + QP GFD ++ F + +P N+ + K Y+ Sbjct: 148 -----------------RPQDQPDKMGFDEYLTFQG---MKHFAPYTLPNKVQHGEKVYL 187 Query: 255 SDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP-APDQYQKQFNTGSQTADNY-Y 312 +D D+AI ++R ++PF LY H P + A QY ++ G Sbjct: 188 TDLTCDKAIDFMERKVAAEKPFFLYYPDFLVHAPMEAKQAMIQYFEKKTIGQHHKSVIGA 247 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI-------DGPLPLNGAQKGYKS 365 A +D V R+++++ + G +NTII+FTSDNG + N + KS Sbjct: 248 AMTKHLDDTVGRLVKKVDELGIAENTIIIFTSDNGGLGYKSDGGYGDKGTSNYPYRSAKS 307 Query: 366 QTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 Y GG+ P+ W G + + +++S +D YPT L A ++ P++ LDG+ Sbjct: 308 SHYEGGSRVPLIFHWPGVTEANSLSHEVVSGIDIYPTLLKIAQVAKPQEQILDGIDFSSI 367 Query: 425 LQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 L++ KQ P ++L + + + Sbjct: 368 LKNPKQKLPARDLFHYQPIYNHKVFGDASV-----------------------------S 398 Query: 485 VRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 +R D +Y L+ L D+ QK +L+A P++ +E++ + +D + Sbjct: 399 LRRGDMKYIYYFVEENFELFNLKDDVSQKKDLSADYPELCEELKKACFKHLDETDALRMT 458 Query: 544 VNQEKFNNIK 553 +N + +K Sbjct: 459 LNPDYDPKLK 468 >UniRef50_UPI00017445FC Arylsulfatase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017445FC Length = 481 Score = 434 bits (1117), Expect = e-120, Method: Composition-based stats. Identities = 132/565 (23%), Positives = 213/565 (37%), Gaps = 146/565 (25%) Query: 41 NVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDK 100 +A +PN+IV DDLGYG+L Sbjct: 1 MLAAVVTVAASLQASARPNVIVFLADDLGYGELGCYG----------------------- 37 Query: 101 AIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD-- 158 + TP L L +G+RFT+ Y H V PSR ++TG+ V N++ + Sbjct: 38 ---QKKIKTPNLDQLAADGMRFTDFYSGHAVCAPSRCVMLTGKHTGHSFVRENSEGRAAQ 94 Query: 159 -----------------GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 +P +E Q GY TA VGKW L SN Sbjct: 95 AKERNRIKAADGYLPQIALPASEATYASALQKSGYRTACVGKWGLGHPSN---------- 144 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRERVPAKG-------- 252 E P GFD F G+ + A+Y P L++N + P +G Sbjct: 145 ------------EGSPNKHGFDLFYGYISQWQAHYYYPTYLWRNDVKEPLEGNDGKVGRQ 192 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT--------- 303 Y +D + EA+ ++ T PF LY A PH+ P + ++ Sbjct: 193 YAADLMEQEALKFME--TTGGGPFFLYYATPVPHVSLQVPPDEPSLAEYKQAFAGQDPPY 250 Query: 304 --------GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-- 353 Y A V +D+ + + + LK+ GQ NT+I+FTSDNGA +G Sbjct: 251 DGRKSYLPTEDPRAIYAAMVTRMDRTLGKFRDLLKRTGQDQNTLIIFTSDNGATFNGGYD 310 Query: 354 ---LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLI-SAMDFYPTALDAADIS 409 N +G K+Q + GG TP W G +QPG + + ++ D +PT + Sbjct: 311 REFFGGNQPLRGMKTQLWDGGIRTPFIAAWPGSIQPGQVSRFVGASWDLFPTFAEIVGFP 370 Query: 410 IPKDLKLDGVSLLPWLQDK-KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDD 468 +P LDGVS+LP L+ + + H +L W T Sbjct: 371 VPAG--LDGVSILPTLKGEVATQKQHDHLYWET--------------------------- 401 Query: 469 YPHNPNTEDLSQFSYTVRNNDYSLVY----TVENNQLGLYKL-TDLQQKDNLAAANPQVV 523 ++ VR + + + + L+ L TD+ + ++AA +P +V Sbjct: 402 ---------VAGGHQAVRMGPWKGIRLGVIKNPSAPVQLFNLETDVSETTDVAAQHPDIV 452 Query: 524 KEMQGVVRE-FIDSSQPPLSEVNQE 547 ++ ++ + S++ P+ E+++ Sbjct: 453 AKIATIMSAGRVPSAEFPMGELDRP 477 >UniRef50_A0YAF7 Arylsulfatase A n=4 Tax=Bacteria RepID=A0YAF7_9GAMM Length = 479 Score = 434 bits (1117), Expect = e-120, Method: Composition-based stats. Identities = 117/537 (21%), Positives = 195/537 (36%), Gaps = 137/537 (25%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 + PN+I++ DD+GYG + Sbjct: 29 AVANPSHQSPNVIIIFADDMGYGDIGAYG--------------------------HPTIR 62 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS------NTDAQDGIPL 162 +P L + EG+++TN Y A V PSRA ++TGR P R G+ + G+P Sbjct: 63 SPNLDQMAAEGIKWTNFYAASSVCTPSRAGLLTGRLPVRSGMAHDQIRVLFPTSTGGLPT 122 Query: 163 TETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF 222 TE + + + Y TA VGKWHL + +QP + GF Sbjct: 123 TEITIAKALKEKDYRTALVGKWHLGHL-----------------------PGFQPLDHGF 159 Query: 223 DYFMGFHAAGT-----------------AYYNSPSLFKNRERVP---AKGYISDQLTDEA 262 D + G + + L +NR + + I+ + T EA Sbjct: 160 DEYFGIPYSNDHDLKKELSYIQTITHAKDGDFNVPLMQNRSIIERPANQNTITKRYTQEA 219 Query: 263 IGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGV 322 + + + +QPF LYLA++ PH+P + GS Y + +D V Sbjct: 220 VSFIKKN--SNQPFFLYLAHSMPHVPL-------FASDQFRGSSDRGLYGDVIEEIDWSV 270 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAV--IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWW 380 ++L L + G +NT+++FTSDNG + G K K +Y GG P WW Sbjct: 271 GQVLSTLSEQGISENTLVVFTSDNGPWLIMGAHGGSAGLLKSGKGTSYEGGMREPAIFWW 330 Query: 381 KGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 K++P S +D +PT + A I +P D DG L P + ++K E + Sbjct: 331 PEKIKPAVAHNTASTLDLFPTIMSIAGIDMPSDRSYDGYDLSPTMFEQKSNERKNIFYYH 390 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL-------V 493 + VR D+ + + Sbjct: 391 GDKI--------------------------------------FAVRQGDWKVHFKTVANI 412 Query: 494 YTVENN-----QLGLYK-LTDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEV 544 YT E ++ L D ++ ++ A NP ++ ++ + S +P +++ Sbjct: 413 YTKEQKILTHTPPQVFNLLVDPSERFDVGAVNPAIIASAAKLIEQHQLSVKPVENQL 469 >UniRef50_B1KD78 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD78_SHEWM Length = 483 Score = 434 bits (1116), Expect = e-120, Method: Composition-based stats. Identities = 177/517 (34%), Positives = 273/517 (52%), Gaps = 45/517 (8%) Query: 44 FSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE-VVDTYKIGIDKAI 102 + + + PN++++ DD+G+G + + + + D+ + + A Sbjct: 3 LMLGSSAIAAQQTPPNVVIVLADDMGFGHVAMNLDLATADSYNPQNLKRDSQRHKPELAR 62 Query: 103 EAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD-GIP 161 A+K+TPTL L +EGVRFTN YV + GPSRAA+MTGR P RFG+Y+N D + G+P Sbjct: 63 SYAKKATPTLTQLANEGVRFTNAYVPSPLCGPSRAALMTGRYPQRFGIYNNADVKAAGLP 122 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 + E L F+ GY T AVGKWHL P +RG Sbjct: 123 VEENVLANNFRKAGYRTGAVGKWHL----------------TKGEKKASYTLAQHPLDRG 166 Query: 222 FDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLA 281 FD+F GF +GT YY+S L NR+ V A+GY++DQLT+ AI + + +PF LY+A Sbjct: 167 FDFFFGFDRSGTPYYDSKILELNRKPVKAEGYLTDQLTNHAIDFI--NQDKSKPFFLYMA 224 Query: 282 YNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIIL 341 YNA H P + AP +YQ FN+G + D +Y+ +Y++DQGV +I++QL NGQ DNTII+ Sbjct: 225 YNAVHGPLNKAAPKEYQAPFNSGDRYLDYFYSYLYALDQGVAKIIKQLDSNGQLDNTIIM 284 Query: 342 FTSDNGAVIDGPLPL--NGAQKGYKSQTYPGGTHTPMFMWWK-GKLQPGNYDK-LISAMD 397 F SDNGA P PL N GYK Q + GGT P+ +W + G D +IS+MD Sbjct: 285 FLSDNGAPGGKPFPLPANAPFTGYKGQVWQGGTRVPVVIWGPKALVNGGRVDDAVISSMD 344 Query: 398 FYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDN 457 PTAL AA + + + LDG +LLP L K+ E + L W + SH + Sbjct: 345 LIPTALAAAGVDLSDN--LDGNNLLPKL--KRVEEDERQLFWASQLSHHWG--------- 391 Query: 458 YHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLA 516 D + + ++ ++ VR+ ++ L Y ++ + L+ + TD + ++A Sbjct: 392 ------FIRDAKGKKIDDKSTAEPAWAVRSGEWMLRYWADSKKTELFNVSTDHAEHHDIA 445 Query: 517 AANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIK 553 +PQVVK++ + + D+ P + ++ + ++ Sbjct: 446 NKHPQVVKQLTADYKVWFDTLAKP-AGWDKRYWEQLE 481 >UniRef50_A6DM29 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DM29_9BACT Length = 481 Score = 433 bits (1115), Expect = e-120, Method: Composition-based stats. Identities = 141/524 (26%), Positives = 216/524 (41%), Gaps = 79/524 (15%) Query: 31 DDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 + + A ++ TP +PNI+++ DDLGYG L Sbjct: 9 RFLSVLALFISMNLYAQTPQTKKDTERPNIVLILCDDLGYGDLACYG------------- 55 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 Q TP L + EG+RF + Y A V SR ++TGR+P R GV Sbjct: 56 -------------HKQIKTPNLDQMAKEGIRFNHFYSAAPVCSASRVGLLTGRSPNRAGV 102 Query: 151 YSNTDAQD-----GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 Y + E P+L Q GY T GKWH Sbjct: 103 YDWIPHSSESSSPHMRKNEITFPQLLQKAGYATCLSGKWH-------------------C 143 Query: 206 NFTTFSAEEWQPQNRGFDYFMGF-HAAGTAYYNSPSLFKNR-ERVPAKGYISDQLTDEAI 263 N + + QPQ+ GFDY+ + A ++ N + +N E P +G+ +T+EAI Sbjct: 144 NGALINTNQAQPQDAGFDYWFATQNNAAPSHKNPVNFIRNGVELGPIEGFSCQIVTNEAI 203 Query: 264 GVVDRA--KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQG 321 ++ + QPF +YL+++ PH P +P + + Y+A+V ++D+ Sbjct: 204 NWMEDHVKQNEKQPFFIYLSFHEPHEPIASPQKIVDTYKGIAENTNQAEYFANVENLDKA 263 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDG-------PLPLNGAQKGYKSQTYPGGTHT 374 V ++ QLKK DNT+++FTSDNG G KG K T G Sbjct: 264 VGSLMNQLKKLKINDNTLVIFTSDNGPETLNRYEAASRSYGSPGELKGMKLWTAEAGFRV 323 Query: 375 PMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 P M W K+ G D++ISA+DF+PT D A S K L LDG + P L KK+ Sbjct: 324 PAIMHWPEKIATGQISDQVISALDFFPTFCDLAQASNSKSLNLDGSNFTPALH-KKKMTR 382 Query: 434 HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV 493 HK L WI Y +E + K + HN + + ++ V Sbjct: 383 HKPLLWI--YYAALNERQVAMRHGDWKISAKLNLPRYHN------------ITSKNFPKV 428 Query: 494 YTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVR-EFID 535 + LY L+ D + ++L+ NP+ +M ++ ++ D Sbjct: 429 TAATLSDYQLYNLSKDKSEANDLSNQNPKKSAQMIKFLKLQYQD 472 >UniRef50_A6DHI1 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DHI1_9BACT Length = 472 Score = 432 bits (1112), Expect = e-119, Method: Composition-based stats. Identities = 138/548 (25%), Positives = 214/548 (39%), Gaps = 123/548 (22%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K T + + KPNII + DDLGYG++ ++ Sbjct: 1 MKILFTVLTLISMPLL---AQMKPNIIYILCDDLGYGEVGYNG----------------- 40 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN- 153 TP L L +G+RFT+ Y + V PSRA+++TG+ P + +N Sbjct: 41 ---------QKMIQTPELDKLASKGMRFTDHYCGNAVCAPSRASLITGKHPGHAFIRANS 91 Query: 154 ---TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 D Q IP L +L + GY TA +GKW L N Sbjct: 92 PGYPDGQTPIPADSETLGKLMKRAGYATACIGKWGLGGFHNAGN---------------- 135 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRERV-------PAKGYISDQLTDEA 262 P +GFD+F G+ A+ P L++N E+ Y D +T +A Sbjct: 136 ------PHKQGFDHFYGYTDQRKAHNYYPEYLWRNGEKEMLNNKNGEENDYSHDLMTVDA 189 Query: 263 IGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGV 322 + ++ DQPF LYLAY PH+ P QY+ + + + A +D+ + Sbjct: 190 LKYIEE--KKDQPFFLYLAYLIPHVKYQVPDLAQYKDK--DWPKEMKIHAAMTSRMDRDI 245 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDGP----LPLNGAQKGYKSQTYPGGTHTPMFM 378 I +L++ G DNT+I+F SDNGA +G KG K Y GG +PM Sbjct: 246 GTIARRLEELGIADNTLIMFNSDNGAHGKSNSEKFFNTSGDLKGLKRSMYDGGVRSPMIA 305 Query: 379 WWKGKLQPGNYDKLIS-AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK-KQGEPHKN 436 +W G +Q G+ IS D PT + P + DG+S+LP L K + + HK Sbjct: 306 YWPGTIQAGSVSDHISAFWDMMPTFSELTG--EPFKGETDGISMLPTLLGKDSEQKQHKY 363 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV--Y 494 L W + ++ + +R + V Sbjct: 364 LYWE----------------------------------LYESNKPNCAIRFGKWKGVVLD 389 Query: 495 TVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVV-REFIDSSQPPLSEVNQEKFNNI 552 + + LY ++ D + NLAA P+VV E++ ++ + S ++ Sbjct: 390 RRKGLNIELYDMSGDQSESKNLAAQYPEVVDEIRKMMVEAHVKS----------PYWDKD 439 Query: 553 KKALSEAK 560 K L AK Sbjct: 440 FKPLYNAK 447 >UniRef50_B4DBQ5 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4DBQ5_9BACT Length = 483 Score = 432 bits (1112), Expect = e-119, Method: Composition-based stats. Identities = 135/563 (23%), Positives = 202/563 (35%), Gaps = 140/563 (24%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 +L A E +T KPN+I + DDLG G L Sbjct: 3 RLTALFFAALAGCAFAAEPATPAKPNVIFILADDLGIGDLGCYG---------------- 46 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 + TP + L +G+RF Y V PSR A+MTGR + N Sbjct: 47 ----------QQKIRTPNIDHLAADGMRFLQHYTGCSVCAPSRCALMTGRHMGHAAIRDN 96 Query: 154 T------DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 + Q +P + L QN GYYT +GKW L Sbjct: 97 AQRGPSEEGQRPMPQDTFTVARLMQNAGYYTGIIGKWGLGMPE----------------- 139 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAY-YNSPSLFKNRER----------------VPA 250 + P++ GF+Y G+ A+ Y P L++N ER + Sbjct: 140 -----DHSSPRDMGFNYSFGYLCQSMAHTYYPPYLWRNNERETLAGNPSYDVSMKGVIEP 194 Query: 251 KG--YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA-------------PD 295 KG Y D + +A+ V D+PF LYLA+ PHL P P Sbjct: 195 KGEIYSHDVMASDALKFVRDHH--DKPFFLYLAFTIPHLSLQVPEDSMSEYHGQWTETPF 252 Query: 296 QYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG------AV 349 + K + Y + +D+ V R++ LK+ G DNT++ F+SDNG Sbjct: 253 RNTKHYANNETPRAAYAGMITRMDRDVGRLMALLKELGIDDNTLVFFSSDNGAVFPLAGT 312 Query: 350 IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADI 408 G +GYK Y GG TP+ W GK++ G D+ DF PT + + Sbjct: 313 DPVFFQSTGGFRGYKQDLYEGGIRTPLIARWPGKIETGVTTDQASVFYDFLPTMAELNGV 372 Query: 409 SIPKDLKLDGVSLLPWLQDK-KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 P D DG+S LP L K Q + H L W + Sbjct: 373 PPPADT--DGLSYLPTLLGKPAQQKQHDFLYWEYQSAG---------------------- 408 Query: 468 DYPHNPNTEDLSQFSYTVRNNDYSLVYTV----ENNQLGLYKL-TDLQQKDNLAAANPQV 522 + VR D+ + N +Y L +D + ++AA +P++ Sbjct: 409 -------------GAVAVRMGDWKAIANKIKKNPNANFEVYNLASDRTESHDVAAEHPEI 455 Query: 523 VKEMQGVVREFIDSSQPPLSEVN 545 V + + ++ + + P+ E N Sbjct: 456 VAKAREIIA--REHTPSPIKEWN 476 >UniRef50_A6DR29 N-acetylgalactosamine-6-sulfatase n=3 Tax=Bacteria RepID=A6DR29_9BACT Length = 510 Score = 431 bits (1110), Expect = e-119, Method: Composition-based stats. Identities = 116/536 (21%), Positives = 200/536 (37%), Gaps = 99/536 (18%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 +K K+ + + F+P + KPN+I++ DDLG+G F+ Sbjct: 1 MKTKSLLIAASAALFSPFISAESAKPNVILIMADDLGWGDTGFNGS-------------- 46 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP L + EG++ Y A V P+RA+++TGR P R GV + Sbjct: 47 ------------KVIKTPHLDQMAAEGLQLDRFYSASSVCSPTRASVLTGRNPYRTGVPT 94 Query: 153 NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT--- 209 + E LPE+ GY T GKWHL +++ ++ F Sbjct: 95 ANQGF--LRPEEITLPEVLNEQGYATGHFGKWHLGTLTHTEKDANRGKPGNTKEFNPPKL 152 Query: 210 ----------FSAEEWQPQ------NRGFDYFMGFHAAGTA----YYNSPSLFKNRERVP 249 + P ++G +G+ Y + +++ Sbjct: 153 HGYEDAFVTESKVPTYDPMILPAKFDQGESKHLGWEYVKEGEESKPYGTFYWDIEGKKIT 212 Query: 250 A--KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT 307 KG S + D + +D+A ++PF+ + ++ PHLP ++Q+ + Sbjct: 213 DNLKGDDSRVIMDRVLPFIDQAVADEKPFLSVVWFHTPHLPCVAGP--RHQEMYKGHPIH 270 Query: 308 ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG--PLPLNGAQKGYKS 365 NY V ++D+ + R+ + L G DNT+I F SDNG G +G K Sbjct: 271 LRNYAGCVTAMDEQIGRLRKHLADKGVADNTMIWFCSDNGPESKERPDNGSAGHFRGRKR 330 Query: 366 QTYPGGTHTPMFMWWKGKLQ-PGNYDKLISAMDFYPTALDAADISIPK-DLKLDGVSLLP 423 Y GG P M W K++ D+ PT LDA I P+ DG SL+P Sbjct: 331 DLYEGGVRVPAVMVWPAKVKEARKISAPCITSDYMPTILDALHIPHPQASYATDGRSLMP 390 Query: 424 WLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY 483 + ++ + +S W Sbjct: 391 IINNEDFTRDKEIGIMFSSRIVW------------------------------------- 413 Query: 484 TVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 D+ L+ + LY L +D +K ++AA NP++V++++ + + +S + Sbjct: 414 --HKGDFKLLSYNGGKKYELYNLKSDPSEKTDVAAQNPELVEKLKKDMLAWHESVK 467 >UniRef50_A4GJF1 Sulfatase n=1 Tax=uncultured marine bacterium EB0_50A10 RepID=A4GJF1_9BACT Length = 544 Score = 431 bits (1109), Expect = e-119, Method: Composition-based stats. Identities = 147/591 (24%), Positives = 239/591 (40%), Gaps = 114/591 (19%) Query: 10 VSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYS-------TKGKPNIIV 62 + S+ +++ SG A+ V +NV + PT +S +PNII+ Sbjct: 5 LMVSLMVLIVSGFVAWEYKVNILVWAIPKISNVTVQENIPTTWSKGPDTPVDDNRPNIIL 64 Query: 63 LTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRF 122 + DD+GY + G T T + +L G+ F Sbjct: 65 VLADDMGYNDISIHNGGAADGT----------------------LQTKNIDALAKSGILF 102 Query: 123 TNGYVAHGVSGPSRAAIMTGRAPARFG--------------------------------V 150 T GY A+ PSRA+IMTG+ P RFG V Sbjct: 103 TRGYAANATCAPSRASIMTGKYPTRFGYEFTPIPAFGRTVLGWLAEEDNFELKQRIDREV 162 Query: 151 YSNTDA--QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 SN + G+P + + E+ ++ GYYTA +GKWHL + P + +D Sbjct: 163 VSNMPPFMEQGMPTEQITIAEVLRDAGYYTAHIGKWHLGHEYGM-DPMSQGFQDSLGLVG 221 Query: 209 TFSAEEWQPQ--NRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVV 266 E P N FD + G Y++ F + Y++D TDEA+ V+ Sbjct: 222 PLYLPEDHPDVVNAKFDTRIDKMIWGMGQYSAN--FNGGDLFAPDKYVTDYYTDEALKVI 279 Query: 267 DRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRIL 326 + K ++PF LYL++ A H P D +++ + Y + S+D+ V +I+ Sbjct: 280 ENNK--NRPFFLYLSHWAIHNPLQALRSD-FEQMSHMHGHNLQVYSGMINSLDRSVGKII 336 Query: 327 EQLKKNGQYDNTIILFTSDNGAVIDGPLPL-NGAQKGYKSQTYPGGTHTPMFMWWKGKLQ 385 E+LK+ Y T+I+FTSDNG L N +G+K + GG P + W ++ Sbjct: 337 EKLKELDIYGKTLIIFTSDNGGANYIELNDINKPYRGWKISFFDGGIRVPYIISWPDEIN 396 Query: 386 PG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYS 444 PG + + D +PT L AA I +LDGV L+P++++ +PHK L W + Sbjct: 397 PGKKSENAVHHFDIFPTILKAAGIE--STNELDGVDLMPFIKNDSSSKPHKTLFWRSGN- 453 Query: 445 HWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLY 504 +V + + + + + N L+ Sbjct: 454 -------------------------------------HQSVLHEHWKFIISKKENFRWLF 476 Query: 505 KLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKK 554 + D +K+NL +NP VVKE++ ++ EF + PL + + I K Sbjct: 477 DTSADPTEKNNLVDSNPDVVKEIEELLVEFNSEQKDPLFPSSYDTPIMIDK 527 >UniRef50_Q64YV7 Arylsulfatase n=5 Tax=Bacteroides RepID=Q64YV7_BACFR Length = 489 Score = 431 bits (1108), Expect = e-119, Method: Composition-based stats. Identities = 131/568 (23%), Positives = 202/568 (35%), Gaps = 135/568 (23%) Query: 27 AHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTME 86 L + + + + +PN++ + DDLGYG L Sbjct: 6 QKLLLGSALLVGMASTQQALARQKKAKEQTRPNVVFILADDLGYGDLSCYG--------- 56 Query: 87 NREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPA 146 + TP + L G+RFT Y VS PSR+ ++TG Sbjct: 57 -----------------QEKFETPNIDRLAQNGMRFTQCYSGTTVSAPSRSCLITGTHSG 99 Query: 147 RFGVYSN----TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 + N + Q +P + F+N GY T A GKW L I + Sbjct: 100 HTAIRGNKELAPEGQFPLPENSQTIFNDFRNAGYRTGAFGKWGLGYIGSAG--------- 150 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY-YNSPSLFKNRERVP-----------A 250 P +G D F G++ A+ Y L+ N +RV Sbjct: 151 -------------DPYKQGIDQFYGYNCQLLAHSYYPDHLWDNDKRVDLPDNNLNVQYGK 197 Query: 251 KGYISDQLTDEAIGVVDR-AKTLDQPFMLYLAYNAPHLPNDNPAP---DQYQKQFNTGS- 305 Y D + +A+ +D AK DQPF ++ PH P +++ ++ Sbjct: 198 GTYSQDLIHSKALAFLDEAAKEKDQPFFMWYPTIIPHAELIVPEDSIIKKFRGKYPEKPY 257 Query: 306 -------------------QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDN 346 + A VY +D V +I+++LK G YDNTII+F+SDN Sbjct: 258 RGVEPGSPAFRKGGYCTQFYPHATFAAMVYRLDVYVGQIVQKLKDMGVYDNTIIIFSSDN 317 Query: 347 GAVIDGP-----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYP 400 G ++G NG +GYK Y GG PM + W G +QP D + S D P Sbjct: 318 GPHMEGGADPDFFNSNGIWRGYKRDVYEGGIRVPMIISWPGHVQPSTETDFMCSFWDLMP 377 Query: 401 TALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHK 460 T + + +DGVS+LP LQ++K + H+ L + Sbjct: 378 TFREVLN-PKADTRNMDGVSILPLLQNRKGQKEHEYLYFEFLE----------------- 419 Query: 461 FVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY---TVENNQLGLYKL-TDLQQKDNLA 516 VR D+ LV+ LY L +D +K N+ Sbjct: 420 ------------------MNGRQAVRKGDWKLVHMNIRGNKPYYELYNLASDPSEKYNVL 461 Query: 517 AANPQVVKEMQGVV-REFIDSSQPPLSE 543 P+ E++ ++ I+ S PL Sbjct: 462 NQYPEKADELKAIMKEAHIEDSNWPLFR 489 >UniRef50_A6DTN4 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DTN4_9BACT Length = 482 Score = 430 bits (1107), Expect = e-119, Method: Composition-based stats. Identities = 120/542 (22%), Positives = 195/542 (35%), Gaps = 125/542 (23%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 S KPNII + DDLGYG L Sbjct: 8 LLFALNLSAADKPNIIYILADDLGYGDLGCYG--------------------------QK 41 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTET 165 TP L + G++FT Y V GPSR+ ++ G+ V N Q + Sbjct: 42 VIQTPHLDKMAANGMKFTQHYSGSTVCGPSRSCLLEGKHSGNTYVRGNGMLQMRQDPHDL 101 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYF 225 P+ Q GY+TA +GK + ++ + P +GFDYF Sbjct: 102 IFPKALQKAGYHTAMIGKSGMGCNTD---------------------DAALPYQKGFDYF 140 Query: 226 MGFHAAGTAYYNSPSL------------FKNRERVPAKGYISDQLTDEAIGVVDRAKTLD 273 GF + A++ P+ + N Y S+ + +EA+ V+R K D Sbjct: 141 FGFTSHTQAHWFFPTHLWKNDGKVTKVEYPNNTLHEGDNYSSEVVMNEALDYVERQK--D 198 Query: 274 QPFMLYLAYNAPHLPNDNPAPDQYQKQ----------------FNTGSQTADNYYASVYS 317 PF L+LA+ PH + + + ++ + + A V Sbjct: 199 GPFFLHLAFQIPHASLRAKEEWKAKYRPILKEKLLPKKDKHPHYSYEREPKTTFAAMVSY 258 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-----LNGAQKGYKSQTYPGGT 372 +D V + ++L+ G +NT+I+F SDNGA+ +G NG +G K Y GG Sbjct: 259 MDHNVGLLNKKLEDLGLAENTLIMFASDNGAMQEGGHKRDSFDSNGVLRGGKRDMYEGGV 318 Query: 373 HTPMFMWWKGKLQPGNYDKLIS-AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 TPM +W GK++ G IS D PT + A + +D DG+S +P L K Sbjct: 319 RTPMIAYWPGKIKAGQTSDHISAFWDISPTVRELAGAKVQEDT--DGISFVPTLLGKGSQ 376 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 H L W +R + Sbjct: 377 TKHDYLYWEF-----------------------------------FEQGGKRAIRMGKWK 401 Query: 492 LVYTVE----NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 L+ N ++ L+ L D+ ++ +L+ P+ V + ++ + ++ P + Sbjct: 402 LILYKTNTDLNPKMELFDLEADISEQKDLSKQLPEKVSALLKLMDKAHTPAENPTFKFAS 461 Query: 547 EK 548 E+ Sbjct: 462 ER 463 >UniRef50_B0NLM9 Putative uncharacterized protein n=1 Tax=Bacteroides stercoris ATCC 43183 RepID=B0NLM9_BACSE Length = 463 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 122/518 (23%), Positives = 188/518 (36%), Gaps = 120/518 (23%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 KPNII + DD+GY L + TP + L Sbjct: 33 DKPNIIFILADDMGYCDLSCYGNKY--------------------------IETPNIDRL 66 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN--------------TDAQDGIP 161 G FT Y G+S PSR A+MTG+ + N T + + Sbjct: 67 AATGTAFTQCYAGSGISSPSRCALMTGKNTGNTTIRDNFCIAGGIEGLKGTKTIRRMHLQ 126 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 +T + + GY T V KWHL E P NRG Sbjct: 127 PNDTTIATVLGAAGYRTCLVNKWHLDG----------------------FNPEATPLNRG 164 Query: 222 FDYFMGFH----AAGTAYYNSPSLFKNRERVPAKG--------YISDQLTDEAIGVVDRA 269 FD F G+ + YY F N + K + +D T++AI ++R Sbjct: 165 FDEFYGWLISTAYSNDPYYYPYWRFNNEKLENVKENEGDKHIKHNTDLSTEDAIKFINRN 224 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQL 329 K + PF LYLAY+APH P + Y + + Y + + +D+ + R+L +L Sbjct: 225 K--NNPFFLYLAYDAPHEPYNIDETTWYDDEAWDMNTKR--YASLITHMDRAIGRLLAEL 280 Query: 330 KKNGQYDNTIILFTSDNGAVIDGPL---PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP 386 + G +NT+++F SDNGA PL G+ KG K Q Y GG P + GK+ Sbjct: 281 DRLGLRENTLVIFASDNGAAKQAPLEELGCKGSLKGMKGQLYEGGIRVPFIVNQPGKVPV 340 Query: 387 GNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHW 446 + +I D PT A + KL+G+++LP ++ ++ L W Sbjct: 341 QKLNNIIYFPDVMPTLAALAGATDKLPQKLNGINILPLFYGQQLDTDNRLLYWEFP---- 396 Query: 447 FDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL 506 R D+ +V ++ L LY + Sbjct: 397 ---------------------------------GKQRAARCGDWKVVTVKKDAPLELYNI 423 Query: 507 T-DLQQKDNLAAANPQVVKEMQGVVREFI-DSSQPPLS 542 D+ + NLA P+ V + + ++ + PL Sbjct: 424 KEDMTESVNLANKYPEKVAQFEKEMKAMRIPTPNWPLP 461 >UniRef50_A6DNI9 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DNI9_9BACT Length = 500 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 114/580 (19%), Positives = 198/580 (34%), Gaps = 141/580 (24%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K + T + + PNI+ + DDLG+ +F Sbjct: 1 MKLILRSFILLFSLSTLNAKEMPPNIVFILADDLGWADPSCYGSTFH------------- 47 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN- 153 TP + SL GV+ +N + V P+RA++MTG R G+ Sbjct: 48 -------------ETPHIDSLAKRGVKLSNFHSTSPVCSPARASLMTGLYAERLGMTQPA 94 Query: 154 ------------------------TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKI 189 + + ++ + GY T GKWHL Sbjct: 95 CHINLVSLKAHTPDKGWPHQKVISPKSTTRLDTVFPTYAKVLKAQGYVTGHYGKWHLGH- 153 Query: 190 SNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY--YNSPSLFKNRER 247 E + P GFD + + Y P + + Sbjct: 154 -----------------------EPYTPLEHGFDVDVPHTKSHGPKGSYFGPKKYSDSFT 190 Query: 248 VPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGS 305 + ++ D++ EAI + K D+PF+L + H P D+Y+K+ Sbjct: 191 LKKGEHLEDRMGQEAIEFIKENK--DRPFLLNYWAFSVHSPMFAKLDLLDKYRKKATKLP 248 Query: 306 ----QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG--------- 352 Q + + + D V +L+ + + G D TII+ +SDNG I+ Sbjct: 249 TDAQQRNPIFAGMIETFDDNVGLLLKAIDEAGIADRTIIVLSSDNGGTIESAYTHEAYWG 308 Query: 353 ----------PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPT 401 P N K K + GGT P + W GK++ G D S +D +PT Sbjct: 309 NGTVEEIVDIPATSNYPLKSGKGTIHDGGTAVPFIVVWPGKIKAGTKSDSYFSGVDVFPT 368 Query: 402 ALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKF 461 ++ A +P + +DGVS +P L ++ W Sbjct: 369 FVEMAGAKMPSGVAIDGVSQVPALITGEEVRDTLYGYW---------------------- 406 Query: 462 VRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ-----LGLYKL-TDLQQKDNL 515 P+ + S S +R+ DY LV + + L+ + D+ + N+ Sbjct: 407 --------PNYLVERNGSIPSAWIRHGDYKLVSYFFDGKNNKHRYELFDIKNDIGENHNI 458 Query: 516 AAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKA 555 AA NP+ V ++ ++++ ++ L ++N K Sbjct: 459 AAQNPERVAKLSAMLKQHFVETEAVLPKLNPNYDPQAKAP 498 >UniRef50_Q7UYA9 N-acetylgalactosamine-6-sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYA9_RHOBA Length = 474 Score = 430 bits (1106), Expect = e-119, Method: Composition-based stats. Identities = 108/526 (20%), Positives = 190/526 (36%), Gaps = 85/526 (16%) Query: 29 AADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 + L T + E + PN+I+L DD G+G + F+ Sbjct: 4 PSRISFLCFAFTLAVPATQLIAETTDTNSPNVILLMSDDQGWGDVGFNG----------- 52 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF 148 TP L ++ GVRF Y A + P+R + +TGR P RF Sbjct: 53 ---------------NEVVQTPNLDAMASAGVRFDRFYAAAPLCSPTRGSCLTGRYPFRF 97 Query: 149 GVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISN--------VPVPEDKQT 200 G+ G+ + E + E+ Q GY T GKWH+ + P Sbjct: 98 GIL--AAHTGGMRVGEITIAEMLQKRGYATGMFGKWHIGWVKPDEVSTRGFYSPPSHHGF 155 Query: 201 RDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAK--GYISDQL 258 +Y + + + +D + + G + N G S + Sbjct: 156 DEYFATTSAVPTWDPTITPQDWDSW--GNGPGEPWKGGFPYVHNGREAKENLSGDDSRVI 213 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSV 318 D I ++ + +PF + ++APH P A ++++K + NYY + ++ Sbjct: 214 MDRVIPFIEANQA--KPFFATVWFHAPHEP--VVAGEEFKKLYPKAGSKRKNYYGCITAM 269 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAV---IDGPLPLNGAQKGYKSQTYPGGTHTP 375 DQ V R+ +L++ G NT++ F SDNG + G KG+K Y GG P Sbjct: 270 DQQVGRLRAKLRELGIEKNTVVFFCSDNGPSDGLAKKGVASAGPFKGHKHTMYEGGLLVP 329 Query: 376 MFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADISI--PKDLKLDGVSLLPWLQDKKQGE 432 W G + G ++ S +DF PT S+ +DG+ L+P ++ + + Sbjct: 330 ACAEWPGTIPAGTSTEVRCSTVDFLPTVASIVGDSMVQKATRPIDGIDLMPLIRGEAKDR 389 Query: 433 PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL 492 D ++ + D+ L Sbjct: 390 DRDLFFGYRRLYQGID---------------------------------GQSIISGDWKL 416 Query: 493 V-YTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDS 536 + +N +L LY L+ D + +L+ P+ ++++ + E S Sbjct: 417 LQEAKKNGRLRLYDLSKDPFETQDLSEEMPEQTEQLRKQLEELQAS 462 >UniRef50_A6DSP6 Sulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSP6_9BACT Length = 512 Score = 429 bits (1105), Expect = e-119, Method: Composition-based stats. Identities = 132/565 (23%), Positives = 222/565 (39%), Gaps = 154/565 (27%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 +PNII++ DD+GY + + + TP Sbjct: 14 ATFADKQPNIILIFADDMGYDDVGYHG--------------------------NKRIITP 47 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD----------GI 160 + S+ ++GV+F+ GYV+ V GPSRA ++TG RFG N + G+ Sbjct: 48 NIDSIAEQGVQFSQGYVSASVCGPSRAGLLTGVYQQRFGCGENPNGSGYPNQMKYPMAGL 107 Query: 161 PLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 P +++ + E + GY +GKWH+ ++ +P R Sbjct: 108 PQSQSMISEELKTLGYTNGMIGKWHMGFDMSL-----------------------RPNQR 144 Query: 221 GFDYFMGFHAAGTAY----------YNSPSLFKNRERVP------------------AKG 252 G+D+F GF Y + +F+N E P + Sbjct: 145 GYDFFYGFINGSHDYTEWTQEFAKGKSRWPIFRNEEMEPANKAQYIDVFKEKGVKVVDEN 204 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYY 312 Y++D TDEA+ +DR D+PF LYLAYNA H P + + + Sbjct: 205 YLTDLFTDEAVNFIDRN--ADKPFFLYLAYNAVHHPWQTTQHALDKTAHLKDDKNYHVFA 262 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL----------------PL 356 + VY++D+G+ +++++LK+ DNTII+F SDNG+ + Sbjct: 263 SMVYAMDEGIGKVMKKLKEKNIDDNTIIIFLSDNGSPQGQGIEHSPKDPNRHRGGFTMSS 322 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADI---SIPK 412 G +GYK TY GG P + W ++Q G YD ISA+D PT + AA K Sbjct: 323 TGIFRGYKGDTYEGGIRVPFCIKWPQQIQKGTKYDMPISALDLQPTLVKAAGGNDKKPQK 382 Query: 413 DLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHN 472 DGV +LP+L K+ E ++L W + Sbjct: 383 GFAYDGVDILPYL--KEDKEIKRSLFWRRDTDY--------------------------- 413 Query: 473 PNTEDLSQFSYTVRNNDYSLVYTVENNQ--LGLYKLT-DLQQKDNLAAANPQVVKEMQGV 529 +R D+ L + + + L+ + D +++ NL +P++ +++Q Sbjct: 414 -----------AIRKGDWKLQWNDAHGPLTITLFNIKEDPEERSNLIKQHPELAQQLQNE 462 Query: 530 VREFIDSSQPPLSEVNQEKFNNIKK 554 + +S P +E +N ++ Sbjct: 463 FDTWDNSM--PDNEWWGGPWNRLRH 485 >UniRef50_A6DS95 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DS95_9BACT Length = 491 Score = 429 bits (1105), Expect = e-118, Method: Composition-based stats. Identities = 127/519 (24%), Positives = 199/519 (38%), Gaps = 71/519 (13%) Query: 46 DFTPTEYSTKGK-PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEA 104 T + + K PNII + DD GYG + + Sbjct: 15 SMTLCSMAKQSKSPNIIFILTDDQGYGDMAVHGHPY------------------------ 50 Query: 105 AQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTE 164 TP + L E VRF YV P+RAA+MTG R GV ++ + Sbjct: 51 --LETPNMDRLHSESVRFDRFYV-SPSCSPTRAALMTGMHEFRNGVTHTVQPREKLYKGA 107 Query: 165 TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY 224 + ++ + GY T VGKWHL + + + PQ RGFD Sbjct: 108 LTIADILKEGGYKTGFVGKWHLG-----------------------NDKGYAPQYRGFD- 143 Query: 225 FMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNA 284 + +A G + + +N +R KG+ D DEA+ + A +QPF LYL + Sbjct: 144 WYAKNAKGPHNHFDVEMIRNGKRFQTKGFREDAFFDEAMTFMKEA--GEQPFFLYLCTYS 201 Query: 285 PHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 PH P P + + + Y A + ++D + R+ + LKK YD+TI++F + Sbjct: 202 PHTPLGAPEDLLKKYKAKGLNDNHAAYLAMIENIDDNLGRLDQFLKKENLYDDTILIFMN 261 Query: 345 DNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALD 404 DNG V G N +G K + GGT W K QP + L + +D PT + Sbjct: 262 DNG-VTVGLDVYNADMRGPKCTIWEGGTRAFSLWRWPKKWQPKTVENLTAHLDVLPTLCE 320 Query: 405 AADISIPKDLK--LDGVSLLPWLQDKKQGEPHKNLT-----WITSYSHWFDEENIPFWDN 457 A + +P+ ++ L+G SL P L K ++ L W + + Sbjct: 321 LAGVDVPEKVQGELEGYSLSPLLNGKDWEHNNRLLFHNVGRWPSGTAAAHKNAMCGIRKG 380 Query: 458 YHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ--------LGLYKL-TD 508 V Q + P V YT N Q LY + D Sbjct: 381 NFLLVHSQGCEDPICEKYPSQCTTLRNVAKGFKHATYTKTNAQFHWGVSEGWQLYDVKKD 440 Query: 509 LQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQE 547 ++LA A+P++V E++ ++ D P + + + Sbjct: 441 PSNLNDLANAHPELVDELKQAYSKWWDKQFPVMVKRGGD 479 >UniRef50_Q7UHK0 Arylsulphatase A n=1 Tax=Rhodopirellula baltica RepID=Q7UHK0_RHOBA Length = 478 Score = 429 bits (1104), Expect = e-118, Method: Composition-based stats. Identities = 121/542 (22%), Positives = 200/542 (36%), Gaps = 92/542 (16%) Query: 31 DDVKLKATKTNVAFS-DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 L S + + PN +++ DDLGYG + Sbjct: 15 QSCFLICLLMTFVMSWQVGSSIAAADRPPNFVLIFADDLGYGDISCYDS----------- 63 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 + TP L L EG R + +V V PSRAA++TGR P R G Sbjct: 64 ---------------SGVKTPHLDQLAAEGFRSKDFFVPANVCSPSRAALLTGRYPMRCG 108 Query: 150 VYSNTDAQ------DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY 203 + + G E +PEL GY + VGKWHL Sbjct: 109 MPVARNENVAKYKDYGFAPDEITIPELLGPAGYRSLMVGKWHLGME-------------- 154 Query: 204 HDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYY--NSPSLFKNRERVPAK---GYISDQL 258 E P + GFD ++G + N +L++ ++ ++ + Sbjct: 155 --------LEGSHPLDAGFDEYLGIPSNYEPRRGKNHNTLYRGKQVEQKNVACEELTKRY 206 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSV 318 TDE I ++R K D PF +Y++++ H P P+PD G+ Y + + Sbjct: 207 TDEVIDFIERQK--DDPFFIYVSHHIVHNPL-KPSPD------FVGTSEKGKYGDFIKEL 257 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFM 378 D RI++ ++ G +NT+++FTSDNG + +G G K T GG P Sbjct: 258 DHSTGRIMQTIRDAGLDENTLVIFTSDNGPTRN---GSSGELSGGKYCTMEGGHRVPGMF 314 Query: 379 WWKGKLQPGNYDKLI-SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNL 437 W K+ P + ++MD P + A + IP D ++DG S+LP L + PH+ L Sbjct: 315 RWTSKIAPNQVSDVTLTSMDLLPLFCELAGVPIPDDRQIDGKSILPVLLGQTSESPHQFL 374 Query: 438 TWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE 497 + + + + +DD P D ++ T+ Sbjct: 375 YY-----YNGTNLQAVREGKWKLHLPRTTDDQPFWSKKPDKTKGFVTL------------ 417 Query: 498 NNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKAL 556 N++ L+ L DL +K N+A +P++V + + + + Sbjct: 418 -NEMRLFNLDRDLGEKKNVADRHPEIVARLNEQAELIRTELGDVQTIGTDQYPIRLSNPQ 476 Query: 557 SE 558 Sbjct: 477 ER 478 >UniRef50_A7S8Q2 Predicted protein n=2 Tax=Nematostella vectensis RepID=A7S8Q2_NEMVE Length = 540 Score = 428 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 119/519 (22%), Positives = 207/519 (39%), Gaps = 69/519 (13%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 G P+I+ + MDDLG+ + + S TP + Sbjct: 32 AGPPHIMFILMDDLGWSDVGYHNISHA-------------------------VKTPNIDK 66 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFLPELF 171 L +GV+ + Y + PSR A+MTG+ P G+ N + G+P +P+ Sbjct: 67 LASQGVKLMSYYS-QPMCTPSRGALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQKL 125 Query: 172 QNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAA 231 + GY T+ +GKWHL F ++ P RGFD F+GF A Sbjct: 126 RTLGYRTSMIGKWHLG----------------------FFDWDYTPLRRGFDSFLGFFAG 163 Query: 232 GTAYYNSP-----SLFKNRER--VPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNA 284 ++ ++ E + +D T EAI + QP L L+Y A Sbjct: 164 EQDHWRHSKMGFLDFRRDEEPANEYGGQHSTDVFTQEAIN-IAMRHNASQPLFLLLSYAA 222 Query: 285 PHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTS 344 H P P+ K + NY + + D + R+++ K+NG ++NT++++ S Sbjct: 223 VHTPLQA-HPNDVNKIGGVSDKDRQNYLGMMGAADWSIGRLIDVYKRNGLWNNTLMIWAS 281 Query: 345 DNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG-KLQPGNYDKLISAMDFYPTAL 403 DNGA N +GYKS + GG P F+ + + + G + L D+YPT + Sbjct: 282 DNGAQPGKGGGYNWPLRGYKSSLFEGGVRVPAFVHGEMLQRKGGTVNDLFHVTDWYPTLV 341 Query: 404 DAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR 463 A + D +DGV P L + K + + L I ++ +E P NY+ Sbjct: 342 KLAGGEVEPD--IDGVDQWPTLSEGKPSKREEILHNIDIPANQEEERMAPRGFNYYSGAA 399 Query: 464 HQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN-----QLGLYKLT-DLQQKDNLAA 517 + D + + +V + + +L LY +T D +++++L+ Sbjct: 400 LRRGHMKLVYKMGDAGWYQLPENGHRGPVVEEMVKDRLPIVELALYNITADPEERNDLSK 459 Query: 518 ANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKAL 556 NP +V + ++E +S + + + + L Sbjct: 460 LNPDIVDSLWRRLQELNATSLEYRLQPEDPRSIALAERL 498 >UniRef50_A6BYR0 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BYR0_9PLAN Length = 658 Score = 428 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 126/595 (21%), Positives = 209/595 (35%), Gaps = 147/595 (24%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 L +A S + PN+++ +DD+G+ + Sbjct: 4 FLVVLFCMIAISS--AETVAADRAPNVVLFLVDDMGWMDSEPYGSRY------------- 48 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 TP + L + +RFTN Y + P+RA+I+TG+ P+R G+ S Sbjct: 49 -------------YETPNMSKLAKQSMRFTNAYAT-PLCSPTRASILTGQYPSRHGITSA 94 Query: 154 TDAQ--------------------------DGIPLTETFLPELFQNHGYYTAAVGKWHLS 187 T + + + + L E ++ GY T GKWHL Sbjct: 95 TGHRPPQAENFEFLPTAAPPNQKLRMPVSKNYLEPNQYTLAEALRDAGYRTGHFGKWHLG 154 Query: 188 KISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRER 247 P+ + YF + T + N Sbjct: 155 LT-TPHRPDKQGFETVWHCAPDPGPPS---------YFSPYGVTPTGKPTAQHRVGNITD 204 Query: 248 VPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP---DQYQKQFNTG 304 P +I+D+LT EAI ++ ++ +PF L L + + H P + A + +KQ Sbjct: 205 GPDGEHITDRLTSEAIQFMEAHRS--EPFFLNLWHYSVHGPWQHKAEYTAEFAKKQDPRK 262 Query: 305 SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI-------------- 350 Q + + +VD+ + RIL++L + DNT+ +F SDNG Sbjct: 263 EQRNPVMASMLRNVDESLGRILQKLDELKLADNTLFIFYSDNGGNAHSWSSDDPKLKKIT 322 Query: 351 -----------------DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKL 392 P N + K + Y GG P+ + W G +QPG D + Sbjct: 323 DKHPLYKTINSYRKWAGGEPPTNNAPLREGKGRIYEGGQRVPLMVRWPGHIQPGTTSDAI 382 Query: 393 ISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENI 452 + +D YPT LD+ +S P + +DG S LP L+ + E TW Sbjct: 383 VGPIDLYPTILDSLKLSQPANQIIDGKSFLPVLEQTGELERTAYFTWFPHLI-------- 434 Query: 453 PFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQL-----GLYKLT 507 + +VR D+ L+ E ++L LY L Sbjct: 435 ----------------------------PAVSVRQGDWKLIRRFEPHRLYPEIRELYNLK 466 Query: 508 -DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ---EKFNNIKKALSE 558 D+ + DNLA P V+E+ ++ EF+ + + N + NI + Sbjct: 467 ADISESDNLARQRPDKVRELDALIDEFVKETGALYPQPNPAYKPRPGNIDNKGRD 521 >UniRef50_A7RFN2 Predicted protein n=7 Tax=Eumetazoa RepID=A7RFN2_NEMVE Length = 512 Score = 428 bits (1102), Expect = e-118, Method: Composition-based stats. Identities = 120/547 (21%), Positives = 192/547 (35%), Gaps = 82/547 (14%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 ++ + KP+I+ + DDLG+ + F Sbjct: 1 MQYLVFLFGLMSLPFVALTANKKPHIVFIVADDLGWDDVSFHGSG--------------- 45 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 Q TP + L GV N YV + P+R+AIMTG+ P G+ + Sbjct: 46 -----------QIPTPNIDGLAKTGVILNNYYV-SPICTPTRSAIMTGKYPIHTGMQHSV 93 Query: 155 D---AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFS 211 G+ L ET +P+ + GY T VGKWHL F Sbjct: 94 ILAAQPYGLGLNETLMPQYLKRLGYATHGVGKWHLG----------------------FF 131 Query: 212 AEEWQPQNRGFDYFMGFHAAGTAYYNSP---------SLFKNRERVPAKG--YISDQLTD 260 E+ P RGFD + G+ Y++ L + + V + Y SD + Sbjct: 132 KYEYTPIQRGFDSYFGYWCGKGDYWDHSNNEKYGWGLDLHDSEQDVWTEWGHYSSDLFAE 191 Query: 261 EAIGVVDRAKTLDQPFMLYLAYNAPHL-----PNDNPAPDQYQKQFNTGSQTADNYYASV 315 +A+ V+ P LYL + A H P P PD K N + + A V Sbjct: 192 KAVNVISTH-NASVPLFLYLPFQAVHSANFIQPLQAP-PDLIDKFKNIKDERRRIFAAMV 249 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG---PLPLNGAQKGYKSQTYPGGT 372 S+D +K++++ LK Y+N+II+FT+DNG +G + N +G K + GG Sbjct: 250 SSMDGAIKKVVDSLKARSMYNNSIIVFTTDNGGPANGFDSNMASNFPLRGVKRTLWEGGI 309 Query: 373 HTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 F+ +PG +L+ D+ PT A I LDG L + Sbjct: 310 RGTAFIHSPLITKPGRVMTELMHVSDWLPTLYTVAGGDIHDLQNLDGFDLWDSISTDAMS 369 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFS----YTVRN 487 + + I + W W ++ S YP E + V+ Sbjct: 370 PREEMVHNIDPVN-WEAAYRFREWKIVVNQTKYMSGWYPLPNIEEREPHPATLRDAVVKC 428 Query: 488 NDYS--LVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEV 544 V ++ L+ + D + NLA +++ M + + P + Sbjct: 429 GPPPEIPVNCTASDGPCLFNIKNDPCEYVNLAKKELEILNNMLIWLEGYKKGMVPIRNTP 488 Query: 545 NQEKFNN 551 N Sbjct: 489 LDPSANP 495 >UniRef50_C6Y1Z7 Sulfatase n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6Y1Z7_PEDHD Length = 480 Score = 428 bits (1101), Expect = e-118, Method: Composition-based stats. Identities = 119/554 (21%), Positives = 206/554 (37%), Gaps = 109/554 (19%) Query: 35 LKATKTNVAFSDFTPT---EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVV 91 +K ++F F + + +PN+I++ MDD+GYG + Sbjct: 1 MKTGLFILSFCCFFAAGRAQTTKTQRPNVIIINMDDMGYGDTEPYGMT------------ 48 Query: 92 DTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY 151 TP EG+RFT+ A + PSRAA++TG P R G+ Sbjct: 49 --------------GIPTPNFNKAAKEGMRFTHFNAAQAICSPSRAALLTGCYPNRIGLR 94 Query: 152 S--NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 + D++ + E + L + GY TA +GKWHL Sbjct: 95 GALSPDSKIALDTAEETIASLLKKAGYKTAMLGKWHLG---------------------- 132 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAY-----------------YNSPSLFKNRERV---- 248 S P + GFD F G + + Y L + Sbjct: 133 -SKAPNLPLHYGFDSFYGLPYSNDMWPVDYEGKPQAAVAGKKSYPELPLLDGDKPADYVR 191 Query: 249 --PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQ 306 + ++ T +A+ ++ K+ PF LYLA+ PH+P A G Sbjct: 192 TPDDQAMLTGTFTRKAVRFIENNKSA--PFFLYLAHPMPHVPLAASA-------AFRGKS 242 Query: 307 TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNGAQKGYK 364 + + +D V I++ L +N NTI++ SDNG + +G +G K Sbjct: 243 ELGLFGDVIMELDWSVGEIMKSLDRNKIASNTILIIMSDNGPWLRFGNHAGSSGGFRGGK 302 Query: 365 SQTYPGGTHTPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 + GGT P + W GK++ G+ + LI+ MD PT L + + P+ K+DG+S Sbjct: 303 MTIWDGGTRVPCIIRWPGKVEAGSVNSNLITNMDILPTLLQLSHAAPPE-KKIDGISFAD 361 Query: 424 WLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY 483 L + P + + + + + N+ + H S Y + + +D + Sbjct: 362 LLLGRSDKAPRQVFYYYYN----ENSLKAVRYKNWKLVLPHTSVSYTSDIHGKDGFPGAA 417 Query: 484 TVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 T ++ LY L D + ++ P++V++M V E Sbjct: 418 -----------TRAEVKMALYDLAHDPGEAYDVQQQYPELVQKMLVFVEEARADMGD--- 463 Query: 543 EVNQEKFNNIKKAL 556 ++ K N+++ Sbjct: 464 DLTGRKGKNLRQPA 477 >UniRef50_Q7UYA6 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UYA6_RHOBA Length = 490 Score = 428 bits (1100), Expect = e-118, Method: Composition-based stats. Identities = 123/526 (23%), Positives = 192/526 (36%), Gaps = 95/526 (18%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 ++L+ F PN +V+ DD GY + Sbjct: 3 LRLQVLMCVCLMQGFCVAA-----PPNFVVIFTDDQGYEDVGCFGS-------------- 43 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP L ++ G++FT+ Y + GPSRAA+MTG P R Sbjct: 44 ------------PDIRTPRLDAMAKGGMKFTSFYA-QPICGPSRAALMTGCYPMRVAERG 90 Query: 153 NTDAQDG-IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFS 211 +T + E + E+ + GY +A GKW L+K + D Sbjct: 91 HTKQIHPILHEDEVTIAEVLKTKGYASACFGKWDLAKHAQSGFFSDL------------- 137 Query: 212 AEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAK---GYISDQLTDEAIGVVDR 268 P +GFDYF G + N L++N E + + ++ + TDEAI +++ Sbjct: 138 ----LPTGQGFDYFYGTPTSNDRVAN---LYRNEELIEPESDMATLTRRYTDEAISFIEK 190 Query: 269 AKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQ 328 + +QPF +Y+ + PH D + G Y + +D V RIL+ Sbjct: 191 NQ--NQPFFVYIPHTMPHTRLDA-------SKDFKGKSKRGLYGDVIEEIDFNVGRILDS 241 Query: 329 LKKNGQYDNTIILFTSDNGAV------------IDGPLPLNGAQKGYKSQTYPGGTHTPM 376 L + DNT +LFTSDNG + G + K T+ GG P Sbjct: 242 LNELNLADNTYVLFTSDNGPWLVKNKGHADGHRLGDHGGSAGPLRSGKVSTFEGGVRVPA 301 Query: 377 FMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK-KQGEPH 434 +W GK+ G D + + MD PT A IP D +DG + + + +P Sbjct: 302 ILWAPGKVPAGTVCDSIATTMDVMPTLAALAGAEIPTDRVIDGEDIRHLFHGEFDKADPD 361 Query: 435 KNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY 494 K + + H Q H P ++ + RN + Sbjct: 362 KAFFY---------------YLRVHLQAVRQGKWKLHLPREKEPVGAAPFGRNAHIAPKD 406 Query: 495 TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 + Q L L DL + N+AA NP+VV+ + G+ D Sbjct: 407 RIGFKQPFLVDLDNDLGETTNVAAENPEVVERLLGLAESMRDDLGD 452 >UniRef50_A6C8S3 Arylsulphatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C8S3_9PLAN Length = 481 Score = 427 bits (1099), Expect = e-118, Method: Composition-based stats. Identities = 120/522 (22%), Positives = 185/522 (35%), Gaps = 98/522 (18%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 T KPN IV+ DDLGYG L Sbjct: 28 QQTLFAAQATAKPNFIVIFADDLGYGDLECYG--------------------------HP 61 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTET 165 + TP L + EG R T V PSRA ++TGR P R GV+ N + Sbjct: 62 RFKTPHLNQMAAEGARLTQFNVPVPYCAPSRATLLTGRYPWRHGVWYNPAPDGQQFRSGV 121 Query: 166 FLP-------ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQ 218 + EL + +GY T +GKWHL E+ P Sbjct: 122 GIAESELLLSELLKENGYATICIGKWHLGHD-----------------------PEYYPT 158 Query: 219 NRGFDYFMGFHAAGTAYYNSPSLFKNRERVPA---KGYISDQLTDEAIGVVDRAKTLDQP 275 GFD ++G + +L + + + + ++ + T+ A+ + + P Sbjct: 159 RHGFDDYLGILYSNDMR--PVNLMQGEKLLEYPVIQANLTKRYTERAVKFIQENQEG--P 214 Query: 276 FMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQY 335 F LYL + PH P + A Y + +D V I + L++ Sbjct: 215 FFLYLPHAMPHKPLAA-------SEAFYKKSGAGLYGDVIAELDWSVGEIFKTLRELNLD 267 Query: 336 DNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLIS 394 +NT+++F SDNG G G KS T+ GG PM W GK+ P D + Sbjct: 268 ENTLVIFASDNGPWFGGN---TAGLSGMKSTTWEGGLRVPMIARWPGKIPPRQVIDTVCG 324 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLT--------------WI 440 ++D +PT L A I +P D +DG L P L K+ PH+ L W Sbjct: 325 SIDVFPTILKQAGIPVPADRVIDGKDLFPVLT-KQAPTPHQALYSMKGNSLFTVRSGPWK 383 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 + N+ R P + + + N D + Sbjct: 384 LHVKPSPRQVLAGKGKNWID-PRGPDGITIIAPYEQAMPDQQPGIHNGD-------QPVP 435 Query: 501 LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 + L+ L D+ ++DN+A +P+VV + + E + Sbjct: 436 MMLFNLQQDIAEQDNVADEHPEVVARLMKLYHEMQAEVPASI 477 >UniRef50_D2R1I8 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R1I8_9PLAN Length = 427 Score = 427 bits (1099), Expect = e-118, Method: Composition-based stats. Identities = 127/505 (25%), Positives = 195/505 (38%), Gaps = 98/505 (19%) Query: 62 VLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVR 121 ++ DDLGYG + + TP + L EG+ Sbjct: 2 LILADDLGYGDVSTY--------------------------HPSDVRTPQIDQLAAEGML 35 Query: 122 FTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT-----DAQDGIPLTETFLPELFQNHGY 176 T+ V PSRAA++TGR R GV D+ T L + + GY Sbjct: 36 LTSMRANCTVCSPSRAALLTGRYADRVGVPGVIRTKPEDSWGWFDPTVPTLADELKRVGY 95 Query: 177 YTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA-- 234 +TA VGKWHL S P RGFD+F GF Sbjct: 96 HTAIVGKWHLGLES-----------------------PNTPNERGFDFFQGFLGDMMDSY 132 Query: 235 ----YYNSPSLFKNRERVPAKGYISDQLTDEAIGVV-DRAKTLDQPFMLYLAYNAPHLPN 289 Y + + +NRE + +G+ ++ TD A + +RAK +QPF LYLAYNAPH P Sbjct: 133 TTHLRYGNNYMRRNREVIEPQGHATELFTDWASEYLVERAKQKEQPFFLYLAYNAPHFPI 192 Query: 290 DNPAP--DQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG 347 + PA + +++ Q A V +D + R+L+ LK+ G NT+++FTSDNG Sbjct: 193 EPPAEWLAKVKERAPQLDQKRAKNVAFVEHLDHSIGRVLKTLKETGLDQNTVVVFTSDNG 252 Query: 348 AVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS-AMDFYPTALDAA 406 + N + K Y GG P + W G+++ G+ + D +PT L+ A Sbjct: 253 GSLP-HAQNNDPWRDGKQSHYDGGLRVPFMVRWPGQIKAGSRSDYVGLNFDLFPTFLELA 311 Query: 407 DISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQS 466 + K +LD VSL+P L+ K + ++ Sbjct: 312 GATPSK--ELDAVSLVPVLKGGKITTSRDLYFVRREGGVTYGGKSYE------------- 356 Query: 467 DDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKE 525 + ++ L+ + L LY + D + +LAA+N +VV E Sbjct: 357 -----------------AIIRGEWKLLQNDPYSALELYNIQNDPGETKDLAASNKKVVNE 399 Query: 526 MQGVVREFIDSSQPPLSEVNQEKFN 550 + +R I + K Sbjct: 400 LAAALRLHIQRGGATPWQAPPRKPA 424 >UniRef50_C5EQ23 Arylsulfatase E n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EQ23_9FIRM Length = 483 Score = 427 bits (1098), Expect = e-118, Method: Composition-based stats. Identities = 123/517 (23%), Positives = 182/517 (35%), Gaps = 109/517 (21%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 KPNIIV DD GYG L + TP L L Sbjct: 15 KKPNIIVFLTDDQGYGDLSCMGST--------------------------DVCTPNLDIL 48 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA---QDGIPLTETFLPELFQ 172 G RFT+ Y V PSRA ++TGR P GV S G+ + Sbjct: 49 AAGGARFTDFYAGSAVCSPSRACLLTGRYPYMTGVRSILGGIKTTTGLNPGIPTFASALK 108 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 + GY T VGKWHL + E +P + GFDYF GF + Sbjct: 109 DLGYTTGMVGKWHLGAV-----------------------PECRPTHMGFDYFCGFLSGV 145 Query: 233 TAYYNS---------------PSLFKNRER--VPAKGYISDQLTDEAIGVVDRAKTLDQP 275 Y++ L++N ER Y ++ + + + D P Sbjct: 146 NDYFSHIHYTEANSHPGINPNHDLWENDERCLKYTGEYSTELFARKGLEFIREQVEKDMP 205 Query: 276 FMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQY 335 F LY A+NAPH P AP +Y ++F + A + +VD GV I+ LK+ G + Sbjct: 206 FALYCAFNAPHYPMH--APYKYLERFKHLPEDRQIMAAMLSAVDDGVGEIMNYLKRRGIF 263 Query: 336 DNTIILFTSDNGAVIDGP-----------LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL 384 ++TII F SDNG + G KG+K + GG P W + Sbjct: 264 NDTIIYFQSDNGPSKESRNWLDERKDYYYGGSTGGLKGHKFSLFDGGIRVPAIFSWPAMV 323 Query: 385 QPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 G + D +PT ++AA + D ++ G +LP + L W Sbjct: 324 PAGQVISEPCMGTDIFPTFINAAGGNA-SDYEISGCDILPVMTIGA-RRDKDCLYWEMGQ 381 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGL 503 N + P +P TE ++ L Sbjct: 382 QTAVRRGN---YKLVINGFLRDGWSLPLDPKTETKH--------------------EVWL 418 Query: 504 YKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 L+ D+ ++ NL P++ KE++ + + Sbjct: 419 SDLSQDMGEEHNLVEEMPELAKELEEKALTWRRDLEA 455 >UniRef50_Q1YSH0 Sulfatase family protein n=4 Tax=cellular organisms RepID=Q1YSH0_9GAMM Length = 557 Score = 426 bits (1097), Expect = e-118, Method: Composition-based stats. Identities = 141/552 (25%), Positives = 214/552 (38%), Gaps = 102/552 (18%) Query: 45 SDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEA 104 + K PNII++ DD+G+ + G + Sbjct: 51 GPASAETTPAKRPPNIILILTDDMGFNDISLYNGGAADGS-------------------- 90 Query: 105 AQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG----- 159 TP + + ++G+RF NGY A+ V SRA+++TGR RFGV + G Sbjct: 91 --LQTPNIDRIAEQGIRFNNGYAANAVCTSSRASLLTGRYSTRFGVEYTPIYKTGVRIFN 148 Query: 160 -----------------------------IPLTETFLPELFQNHGYYTAAVGKWHLSKIS 190 +P E + E+ Q YYTA +GKWHL Sbjct: 149 WMEELNPSTPPVLVDMDLAATLPPIDALGMPAAEITIGEVLQQQDYYTAHIGKWHLGSNG 208 Query: 191 NVPVPEDKQTRDYHDNFTTFSAEEWQPQ----NRGFDYFMGFHAAGTAYYNSPSLFKNRE 246 ++ PE + D F P D A +Y + Sbjct: 209 DM-RPEQQGFDDSLSMKGIFYLPPDHPDVVNAKIPGDSIDSMVWAVGSY---EVQWNGGP 264 Query: 247 RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQ 306 KGY++D TD A+ V++ + +PF LYLA+ PH P D Y + Sbjct: 265 PFEPKGYLTDYFTDAAVDVIEANR--HRPFFLYLAHWGPHNPVQASRED-YDALPHIKDH 321 Query: 307 TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL-NGAQKGYKS 365 Y A + ++D+ V++I L++NG DNT+I+FTSDNG L N +G+K Sbjct: 322 RLRTYAAMLRALDRSVEKIEASLQENGLSDNTLIIFTSDNGGAGYLDLTDLNKPYRGWKL 381 Query: 366 QTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 + GGTH P W +++ G D+ I +D + T AA S+P D LDGV+LLP+ Sbjct: 382 THFEGGTHVPYMAKWPAQIEAGQSSDEAIHHIDMFHTIAAAAGASVPTDRTLDGVNLLPF 441 Query: 425 LQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 +Q K+ G PHK L W T + W K +R + D P Sbjct: 442 MQGKQTGAPHKTLFWHTGH-------QQTVWHQGWKMIRAEQSDKPGA------------ 482 Query: 485 VRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 + + L+ L D +++NL A P+ E+ ++ PL + Sbjct: 483 -------------DPMVFLFDLNNDPTEQNNLIAEQPEKAAELTALLDTHHAQQAKPLWD 529 Query: 544 VNQEKFNNIKKA 555 I K Sbjct: 530 SALNAPQLIDKP 541 >UniRef50_A5FAW4 Sulfatase n=1 Tax=Flavobacterium johnsoniae UW101 RepID=A5FAW4_FLAJ1 Length = 539 Score = 426 bits (1097), Expect = e-118, Method: Composition-based stats. Identities = 134/566 (23%), Positives = 216/566 (38%), Gaps = 159/566 (28%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 AF T +++ KPNII+L DDLG + G Sbjct: 47 AAFLSQKDTSAASEKKPNIIILLADDLGKYDISLYGGKST-------------------- 86 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD--- 158 TP + SL GV FT+GYV+ + PSRA ++TGR RFG + Sbjct: 87 ------PTPQIDSLAASGVTFTDGYVSSSICSPSRAGLLTGRYQERFGHEYQPGDRYPKN 140 Query: 159 ---------------------------------GIPLTETFLPELFQNHGYYTAAVGKWH 185 G+P +E +L + GY TA +GKWH Sbjct: 141 NLEYYAFKYLLNTNSWRLNPKIEYPNDASIATQGLPKSEITFADLAKKQGYSTAIIGKWH 200 Query: 186 LSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFH---------------- 229 L + + P +RGFDY GF+ Sbjct: 201 LGHT-----------------------KGFFPLDRGFDYHYGFYQAFSLFAPEDNNPDII 237 Query: 230 AAGTAYYNSPSLFKNR-----------ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFML 278 + +++ N + K Y++++ +EA +D+ K ++PF+L Sbjct: 238 NHHHTDFTDKTIWGNGRVGTGQIRRDSTIIDEKKYLTEKFAEEAEAFIDKNK--NKPFLL 295 Query: 279 YLAYNAPHLPNDNPAPDQYQKQFNT-GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDN 337 Y+ +NAPH P +Y +F + Y+A + ++D + I ++KK G +N Sbjct: 296 YVPFNAPHTPFQVR--KKYYDRFPNVKDENKRVYFAMISALDDAIGLIRAKVKKEGLEEN 353 Query: 338 TIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAM 396 T+I F SDNG N KG K + GG + P + WKGK++P +S++ Sbjct: 354 TLIFFASDNGGADYTYATTNAPLKGGKFSHFEGGVNVPFALSWKGKIKPHTIYKTPVSSL 413 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWD 456 D + T +PKD DGV L+ + + KQ H+NL W + + Sbjct: 414 DIFSTIAAVTHSGLPKDRVYDGVDLVDVVNNNKQA--HQNLYWRSGDAK----------- 460 Query: 457 NYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNL 515 +R+ D+ L+ + + ++ LY L D + +L Sbjct: 461 ---------------------------AIRSGDWKLIISGKTHETWLYNLAKDKSETTDL 493 Query: 516 AAANPQVVKEMQGVVREFIDSSQPPL 541 A+ NP+ VKE+Q ++ + PL Sbjct: 494 ASKNPEKVKELQTALQNWEKGLIKPL 519 >UniRef50_B4D433 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D433_9BACT Length = 465 Score = 426 bits (1095), Expect = e-117, Method: Composition-based stats. Identities = 122/530 (23%), Positives = 193/530 (36%), Gaps = 107/530 (20%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 ++ KPN+I +DDLG L SF T Sbjct: 19 AADASPAKPNVIFFLVDDLGATDLSCFGSSF--------------------------YQT 52 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ------------ 157 P + L +G++FT+ Y A V P+RA+I++GR PA + Sbjct: 53 PNIDRLAQDGLKFTHAYSACTVCSPTRASIISGRYPAELHLTDWIAGHKRPKAKLRIPDW 112 Query: 158 -DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 + + LP+ GY T A+GKWHL E Sbjct: 113 TQHLTHDVSTLPQAMHAAGYTTCAIGKWHLG--------------------------EDG 146 Query: 217 PQNRGFDYFMGFHA-AGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQP 275 P+ GFD + + A Y SP + P ++SD+LT EA +++ K + P Sbjct: 147 PEKYGFDVAIADNGKGQPATYFSPYKNPHLSDGPPGEFLSDRLTTEAEKFIEQNK--EHP 204 Query: 276 FMLYLAYNAPHLPN--DNPAPDQYQKQ-FNTGSQTADNYYASVYSVDQGVKRILEQLKKN 332 F LY A+ A H P +Y++ Q Y + + SVD + + +L + Sbjct: 205 FFLYFAHYAVHTPLMGKPAVIAKYKEHVSPNDPQHNPVYASLIESVDDSLGHLRAKLDEL 264 Query: 333 GQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DK 391 D TII+FTSDNG +I + N + K Y GG P + G Q G Sbjct: 265 KLSDKTIIIFTSDNGGLILNQVTSNLGMRAGKGSAYEGGVRVPAIAFVPGVTQAGTVATT 324 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 + +MD+ T LD A + GVSL P L + + L W + H Sbjct: 325 PVISMDWTATMLDLAGAKPLDQQR--GVSLAPVLHGGQISL--RALFWHYPHYHPGGAT- 379 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQ 510 + +++ LV E+N + LY L+ D + Sbjct: 380 -----------------------------PYCAMLEDNWRLVEFFEDNHVELYHLSDDPE 410 Query: 511 QKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKALSEAK 560 +K +LAA+ +E++ + + ++ L N + + K Sbjct: 411 EKHDLAASQSAKAEELKARLHAWRETMHAQLPTPNPDYDPAHANDGPKKK 460 >UniRef50_Q3JD43 Sulfatase n=2 Tax=Nitrosococcus oceani RepID=Q3JD43_NITOC Length = 440 Score = 425 bits (1094), Expect = e-117, Method: Composition-based stats. Identities = 115/527 (21%), Positives = 198/527 (37%), Gaps = 126/527 (23%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 + K PN+I++ DD+GYG + Sbjct: 7 SSSLVSGREKQPPNVILIVADDMGYGDVGCYG--------------------------NQ 40 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG-----I 160 TP L +L +G RFT+ + + P+RAA++TG R G++ Q + Sbjct: 41 HIKTPNLDALAKKGARFTDFHSNGPLCTPTRAALLTGCYQQRVGLHIIPKDQRYAMAKAM 100 Query: 161 PLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 L E E ++ GY TA VGKWHL + P + Sbjct: 101 SLEEITFAEALKSVGYSTALVGKWHLGD-----------------------RPAFLPPRQ 137 Query: 221 GFDYFMGFHAAGTAY-----YNSPSLFKNRERV---PAKGYISDQLTDEAIGVVDRAKTL 272 GFD + G + + + L + E V P +++ T+EA+ + + K Sbjct: 138 GFDEYFGIPYSHDMHPWRKSFPPLPLMRGEEIVELNPDLDHLTQYCTEEAVKFISKNK-- 195 Query: 273 DQPFMLYLAYNAPHLPNDNPAPDQYQKQF----------NTGSQTADNYYASVYSVDQGV 322 D+PF+LY+ + PH P +++ K+F Y A++ +D V Sbjct: 196 DRPFLLYMPHPMPHQPVH--VSERFAKRFSKEQLAAIKGEDKKSRKFLYSATIEEIDWSV 253 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 I++ ++ G ++T + FTSDNG I G +G K + + GG P +W+ Sbjct: 254 GEIIKAVRALGIEESTFVAFTSDNGPAI----GSAGPLRGKKRELWEGGHRVPFIAYWQE 309 Query: 383 KLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWIT 441 K++PG D++ +MD +PT +P+ K+DGV+LLP L + + + W + Sbjct: 310 KIRPGVVIDEIAMSMDLFPTMAAMGRAPLPR-KKIDGVNLLPLLC-EGDKLSERTVFWRS 367 Query: 442 SYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ- 500 + R + L+ + Sbjct: 368 --------------------------------------KGKKAARKGPWKLLMQPTKKKR 389 Query: 501 ---LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 +GLY L DL ++ NLA P+ +K +Q + ++ Sbjct: 390 PTSIGLYHLNNDLSEQHNLAEIYPEKLKSLQLEFAAWEKYVDAGRAQ 436 >UniRef50_A6DSG4 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSG4_9BACT Length = 489 Score = 424 bits (1092), Expect = e-117, Method: Composition-based stats. Identities = 119/531 (22%), Positives = 207/531 (38%), Gaps = 100/531 (18%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 + + + F+ + KPNI+ DDLGYG + Sbjct: 4 LNRQFVTSLACLAFFTGV--VSLQAQQKPNILFYLTDDLGYGDIGCYGAEGQY------- 54 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 TP + L EG +F++ YV PSRAA MTG R G Sbjct: 55 -------------------TPAIDQLAKEGTKFSSFYVHQR-CSPSRAAFMTGSYAHRVG 94 Query: 150 ----VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 +Y + + G+ +E LPEL + GY TA VGKWHL + Sbjct: 95 LPQVIYKHREGPIGLNPSEITLPELMKTAGYNTALVGKWHLGE----------------- 137 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQ----LTDE 261 + + P N G+DYF GF PSL +NR+ + +K ++ + Sbjct: 138 ------WKPFHPLNHGYDYFYGFLKV-IEGSEKPSLIENRKELASKIQKTEGQAPGMVKA 190 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQG 321 AI + + K PF L + PH P + + G+ NY ++ +D Sbjct: 191 AINFMTKHKKN--PFFLVYSDPMPHAPY-------FPSEQFKGTSKRGNYGEVIHEIDWQ 241 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDGP----LPLNGAQKGYKSQTYPGGTHTPMF 377 K +++ L + G +NTI++FTSDNG ++ + L+G + K + GG P Sbjct: 242 FKHLMDALDELGLKENTIVVFTSDNGPPVERQKKYDVGLSGPLRDGKWTNFEGGVRVPFI 301 Query: 378 MWWKGKLQP-GNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKN 436 + W GK++ + D +I +D PT + A + +P D +DGV++LP L ++ + + Sbjct: 302 IRWPGKVKVDASSDAMIGIIDMLPTFCELAGVDVPNDRVIDGVNILPQLLGDQESKALRE 361 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV 496 + W Y K ++ P D++ + Sbjct: 362 -----TQIVPGATIIHNGWKYYAKQQNPYNNKKPE-----------------DWNGLQPA 399 Query: 497 ENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 + L+ L D+ + ++A +P++ + ++ + +F+ + Sbjct: 400 KEGA--LFNLKEDIGETTEVSAQHPEIAESLKKNMAKFMAELKKNSRPAGD 448 >UniRef50_A3ZLN5 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZLN5_9PLAN Length = 468 Score = 424 bits (1092), Expect = e-117, Method: Composition-based stats. Identities = 119/556 (21%), Positives = 205/556 (36%), Gaps = 140/556 (25%) Query: 32 DVKLKATKTNVAFSDFTPTEYSTKG---KPNIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 +L +A + + + P+I+++ DD G+ L + Sbjct: 3 STRLMTFVCALASALLVSNAVAAEKSKRPPSIVLIVSDDQGFADLSCIGDNGC------- 55 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF 148 TP L L G R T+ YV+ PSRA++MTGR P R Sbjct: 56 -------------------RTPRLDQLAASGTRLTSFYVSWPACTPSRASLMTGRYPQRN 96 Query: 149 GVYSNTDAQD--------------------GIPLTETFLPELFQNHGYYTAAVGKWHLSK 188 G Y + G L E FL ++ + GY +A GKW Sbjct: 97 GTYDMIRNEAPDYDYLYTPEEYAVTAERILGTDLQEVFLADVLKQAGYVSAVFGKW---- 152 Query: 189 ISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS-----PSLFK 243 + + P RGFD + GF G Y+ PS+F+ Sbjct: 153 -------------------DGGQLKRYLPLQRGFDQYYGFANTGVDYFTHERYGVPSMFR 193 Query: 244 NRERVPAK--GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPND--------NPA 293 + + Y++D EAI +D D+PF LYL +NAPH ++ A Sbjct: 194 DNQPTEEDKGTYLTDLFEREAIRFIDENH--DRPFFLYLPFNAPHSASNLDRSIRGFAQA 251 Query: 294 PDQYQKQFNTGSQT----ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV 349 P +Y F G Y A+V +D+ + ++++QL+++ DNT+I+F SDN Sbjct: 252 PQEYLDHFPGGESKQEKRRQAYLAAVERMDEAIGKVVDQLQQHQIADNTLIIFLSDN--- 308 Query: 350 IDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADI 408 G N +G K++ + GG P + W GK+ G ++ +++++ +PT + A Sbjct: 309 GGGGGADNSPLRGGKAKMFEGGNRVPCIVHWPGKVPAGKVSNQFLTSLEVFPTVIAAIGG 368 Query: 409 SIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDD 468 +P D+ DG +LP L P + + W Sbjct: 369 KLPDDVIYDGFDMLPVLNG--ASSPREEMFW----------------------------- 397 Query: 469 YPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQ 527 + R D+ V + L+ L D+ +K +L+ +P+++ +++ Sbjct: 398 ---------KRRGDVAARVGDWKWVDSAAGKG--LFDLAHDIGEKKDLSKEHPEMLAKLK 446 Query: 528 GVVREFIDSSQPPLSE 543 + + Sbjct: 447 ARFDAWTAEMEAADPR 462 >UniRef50_Q8SZ72 RE14504p n=18 Tax=Neoptera RepID=Q8SZ72_DROME Length = 562 Score = 424 bits (1091), Expect = e-117, Method: Composition-based stats. Identities = 119/580 (20%), Positives = 200/580 (34%), Gaps = 110/580 (18%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 + + + KPNII + DDLG+ + F Sbjct: 1 MWFLWLICLLLPIIDAAEVEKSPAKPNIIFILADDLGFNDVGFHGS-------------- 46 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 A+ TP + +L G+ YVA + PSR+A+MTG+ P G+ Sbjct: 47 ------------AEIPTPNIDALAYSGIILNRYYVA-PICTPSRSALMTGKYPIHTGMQH 93 Query: 153 NT---DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 G+PL E LP+ GY + GKWHL Sbjct: 94 TVLYAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGH--------------------- 132 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSL--------FKNRERVPAK---GYISDQL 258 ++ P RGF +GF + Y + ++ +N +V Y +D + Sbjct: 133 -WKLKYTPLYRGFSSHVGFWSGHQDYNDHTAVENNQWGLDMRNGTQVAYDLHGHYTTDVI 191 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHL--PNDN-PAPD-QYQKQFNTGSQTADNYYAS 314 TD ++ V+ P LY+A+ A H P + P PD K + + + A Sbjct: 192 TDHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAM 251 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG---PLPLNGAQKGYKSQTYPGG 371 V +D V +I++QL+K+ +N+II+F+SDNG G N KG K+ + GG Sbjct: 252 VSKMDNSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLKGVKNTLWEGG 311 Query: 372 THTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIP---KDLKLDGVSLLPWLQD 427 MW + ++ + +D+ PT L+AA ++DG S+ L Sbjct: 312 VRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAGGQPALSNLSKQIDGQSIWRALVQ 371 Query: 428 KKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDY--PHNPNTEDLSQFSYTV 485 K L I + R D + P L + Sbjct: 372 DKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTNYRGSWDGWYGPAGERDPRLYDWQLVG 431 Query: 486 RNNDYSLVYTVE---------------------------------NNQLGLYKLT-DLQQ 511 R+ + ++ + L+ + D + Sbjct: 432 RSRAGKALEALKMLPSRADQQRIRAAATVSCPGQSSQGTSCVATAFSAPCLFHIRDDPCE 491 Query: 512 KDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 + NLA P+VV + + F ++ PP ++ + + Sbjct: 492 QYNLAKQYPEVVNALMTELERFNATAVPPSNKPADPRADP 531 >UniRef50_C5C586 Sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C586_BEUC1 Length = 478 Score = 424 bits (1090), Expect = e-117, Method: Composition-based stats. Identities = 123/559 (22%), Positives = 202/559 (36%), Gaps = 132/559 (23%) Query: 45 SDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEA 104 + +PNI+++ +DDLG+ L +F Sbjct: 3 ASNEQVRVPEPDRPNIVLVVVDDLGWRDLGCFGSTF------------------------ 38 Query: 105 AQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA-------- 156 TP + +L G RFT+ Y A V P+RA+++TG+ PAR GV + Sbjct: 39 --YETPHIDALAASGTRFTHSYAAAPVCSPTRASLLTGKYPARVGVTNWIGGHAIGALRD 96 Query: 157 ---QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 G+P E L + GY T VGKWHL Sbjct: 97 VPYFHGLPQDEYALARALRAGGYRTWHVGKWHLGGGR----------------------- 133 Query: 214 EWQPQNRGFDYFMGFHAAGTA-YYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTL 272 P++ GFD +G A+G+ Y +P E P +++D+LTD A+ +V + Sbjct: 134 -HLPEHHGFDLNVGGSASGSPVSYYAPYGIGALEDAPDGEFLTDRLTDVAVDLVR--SSD 190 Query: 273 DQPFMLYLAYNAPHLPNDNPA--PDQYQKQFNTGS------------------------- 305 D PF+L L + A H P + PA ++Y+ + T Sbjct: 191 DAPFLLNLWHYAVHTPIEAPAHLVEKYRHKAETLGLPTHGPDAVEAGEHMPARHLRSERV 250 Query: 306 -----QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--VIDGPLPLNG 358 Q+ Y A + ++D V R++ L+ G+ D+T+I+FTSDNG +G N Sbjct: 251 RRRRIQSDPTYAAMLETLDGAVGRLVTALRDVGKLDDTLIVFTSDNGGLSTAEGSPTCNA 310 Query: 359 AQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLD 417 K GGT P + W G++ G D ++ DFYPT L AA ++ + +D Sbjct: 311 PLSEGKGWMADGGTRVPTIVSWPGRVPAGARSDLPFTSPDFYPTLLAAAGLTQLPEQHVD 370 Query: 418 GVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477 GV+L P Q + W + Sbjct: 371 GVNLWPAWQG--APLDRGPIFWHYPHYSNQGGA--------------------------- 401 Query: 478 LSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDS 536 S VR+ + LV L+ + D+ + +++ VV + + ++ Sbjct: 402 ---PSAAVRDGRWKLVRHFGIEHDELFDVVADVSESHDVSGRRRDVVARLSVTLDSWLAD 458 Query: 537 SQPPLSEVNQEKFNNIKKA 555 + + + Sbjct: 459 VGALIPRRTTPPPDTFDRP 477 >UniRef50_A4AQQ7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Bacteroidetes RepID=A4AQQ7_9FLAO Length = 596 Score = 424 bits (1090), Expect = e-117, Method: Composition-based stats. Identities = 126/528 (23%), Positives = 198/528 (37%), Gaps = 113/528 (21%) Query: 35 LKATKTNVAFSDFTPTEYST--KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 T S T+ + KPN++++ DD G+G L F+ Sbjct: 12 FILLLTLFIVSCEKKTKEKNEIQTKPNVVLIMTDDQGWGDLSFNG--------------- 56 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 STP + ++ G F N YV V P+RA ++TG+ AR GVYS Sbjct: 57 -----------NTNLSTPNIDAIAKNGASFQNFYV-QPVCSPTRAELLTGKYAARLGVYS 104 Query: 153 NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 + + ET + E+F+ GY T A GKWH Sbjct: 105 TSTGGERFNSKETTIAEIFKKAGYKTTAYGKWHSGMQ----------------------- 141 Query: 213 EEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTL 272 + P +RGFD + GF + Y SP L N E V +G++ D LT++ + + K Sbjct: 142 PPYHPNSRGFDDYYGFTSGHWGNYFSPMLEHNGEIVKGEGFLVDDLTNKGLDFITENK-- 199 Query: 273 DQPFMLYLAYNAPHLPNDNPAPDQ-----------YQKQFNTGSQTADNYYASVYSVDQG 321 + PF LYL YN PH P P YQ A V ++D Sbjct: 200 NNPFFLYLPYNTPHSPMQVPNEYWERFEKKKLDMRYQGNEEESENFTRAALAMVENIDFN 259 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 + R+ +LK+ G +NTII++ SDNG NG +G K T GG +P F+ WK Sbjct: 260 MGRLTNKLKELGLEENTIIVYLSDNGP---NGWRWNGGMRGRKGSTDEGGVRSPFFIQWK 316 Query: 382 GKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 + ++ A+D PT A I+ P +DG L + DK +++ Sbjct: 317 NTIPKNKKISQIAGAIDILPTLTSLAGINQPTIKSIDGKDLKTLIADKNPTWESRHIVNH 376 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ 500 + ++R Y L +N+ Sbjct: 377 --------------------------------------WRGKTSIRTQKYRL-----DNE 393 Query: 501 LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQE 547 LY + D+ Q+ +L++ PQ+ + + ++ + E + Sbjct: 394 NRLYDMQNDIGQRTDLSSELPQLTDSLVNIKNIWLKDAVTVKPENKRP 441 >UniRef50_Q9NJU8 Sulfatase 1 n=2 Tax=Coelomata RepID=Q9NJU8_HELPO Length = 503 Score = 424 bits (1090), Expect = e-117, Method: Composition-based stats. Identities = 118/548 (21%), Positives = 211/548 (38%), Gaps = 82/548 (14%) Query: 29 AADDVKLKATKTNVAFSDFTPTEY---STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 + L A T A +D + G+PNI+ + DD G+ + + Sbjct: 2 CKCLLVLIAIITACAVADQSSASAGTRQDAGQPNIVFVLADDFGFHDVGYHGS------- 54 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 + TPTL +L GVR N YV + P+R+ +M+GR Sbjct: 55 --------------------EIHTPTLDALSASGVRLENYYV-QPICTPTRSQLMSGRYQ 93 Query: 146 ARFGVYS---NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 G+ N+ + +P L + + GY T VGKWHL Sbjct: 94 IHTGLQHGIINSCQPNALPNDSPTLADKLKESGYATHMVGKWHLG--------------- 138 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFK-----------NRERVPAK 251 F +E+ P NRGFD + G+ A Y+N ++ R Sbjct: 139 -------FYKQEYLPWNRGFDTYFGYLNAAEDYFNHNVPWRQVRYLDLRDNNGPVRNETG 191 Query: 252 GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT-ADN 310 Y + T +AI VV ++ +P LYLAY + H P + P++Y+ ++ + Sbjct: 192 QYSAHLFTGKAIDVV-QSHNTSKPLFLYLAYQSVHAPLE--VPEKYEHKYRNITDKNRRT 248 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPG 370 + V ++D+GV + + LK G ++NT+++F++DNG I N +G+K+ + G Sbjct: 249 FAGMVSALDEGVANLTQALKDKGLWNNTVLIFSTDNGGQIHAG-GNNYPLRGWKASLWEG 307 Query: 371 GTHTPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 G H F+ + G LI D++PT + A ++ LDG + + ++ Sbjct: 308 GFHGVGFVSGGALKRSGAVSKGLIHVSDWFPTLVTLAGGNLNGTKPLDGFNQWDTISNET 367 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNND 489 L I ++ +P + N + V D Sbjct: 368 PSPREILLHNIDILYP---QKGVPLYSNTWDTRVRAAIRVGDYKLITGDPGNGSWVPPPD 424 Query: 490 YSLVYTVE-----NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 L + E + L+ +T D + ++L++ P V + ++ +F +++ PP Sbjct: 425 GHLYFVPEIQESAAKNVWLFNITADPNEHNDLSSEKPLEVLRLLQILVQFNNTAVPPRYP 484 Query: 544 VNQEKFNN 551 + + Sbjct: 485 APDPRCDP 492 >UniRef50_B7QJZ0 Arylsulfatase B, putative n=9 Tax=Ixodes scapularis RepID=B7QJZ0_IXOSC Length = 529 Score = 423 bits (1089), Expect = e-117, Method: Composition-based stats. Identities = 116/565 (20%), Positives = 195/565 (34%), Gaps = 105/565 (18%) Query: 39 KTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGI 98 + S+ PNII + DDLG+ + F Sbjct: 1 MCLILALQTAVFSKSSTVPPNIIFILADDLGWADVSFRG--------------------- 39 Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTD 155 Q TP L L +G+ N YV + PSR A+M+G P G+ Sbjct: 40 -----DPQIPTPNLDVLASQGIILNNYYV-QPLCAPSRGALMSGLYPIHTGLQHLVPGPG 93 Query: 156 AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEW 215 G+P T +PE +N GY T +GKWHL + E + Sbjct: 94 EPWGLPTNLTIMPEYLKNLGYATHMIGKWHLG----------------------YHKESY 131 Query: 216 QPQNRGFDYFMGFHAAGTAYYNSP---------SLFKNRERVPAKG--YISDQLTDEAIG 264 P RGFD F G+ G YY+ ++N V +G Y ++ T +A Sbjct: 132 TPTRRGFDSFYGYLNGGEDYYDHTILWSNASGLDFWENTTPVRNEGNHYSTELFTKKAQS 191 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLP---NDNPAPDQYQKQFNTGSQ-TADNYYASVYSVDQ 320 ++ + +P LY ++ A H + AP F + + +VY +D+ Sbjct: 192 LI-KHHDPAKPMFLYFSHQAVHCGDYKVELEAPALAIAHFPYIKELNRSIHAGAVYELDK 250 Query: 321 GVKRILEQLKKNGQYDNTIILFTSDNGAVIDG---PLPLNGAQKGYKSQTYPGGTHTPMF 377 V ++E L K G N+I++F++DNG + G N +G K + GG F Sbjct: 251 SVGLVMEALNKRGMLSNSIVIFSTDNGGLPWGVEPNSGYNWPLRGSKETNWEGGARGAAF 310 Query: 378 MWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKN 436 +W + G ++++ D+ PT AA ++ +DG + L + + + Sbjct: 311 VWSPLLFKSGRLSNQMMHITDWLPTLYSAAGGNVSTLGNIDGKDMWKALSEDLESPRQEV 370 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDL------------------ 478 L I + F D P Sbjct: 371 LINIDPIENSSALIVGRHKVVLGSFNEGSHDMRMKAPGGSRPVDGLDQMMLSSRTGKVLK 430 Query: 479 -------------SQFSYTVRNNDYSLVYTV-ENNQLGLYKLT-DLQQKDNLAAANPQVV 523 + VR + Y+ + + L D + +NLAA+N + Sbjct: 431 DFYNVRQLTVRPNWRNEAVVRCDRYAPPNNFVAASPPYYFDLEHDPCELNNLAASNVTEL 490 Query: 524 KEMQGVVREFIDSSQPPLSEVNQEK 548 +E+ ++E+ PP ++ + Sbjct: 491 EELIKKIKEYAKGMVPPANKPLDPR 515 >UniRef50_A6CA27 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CA27_9PLAN Length = 491 Score = 423 bits (1088), Expect = e-117, Method: Composition-based stats. Identities = 108/536 (20%), Positives = 186/536 (34%), Gaps = 75/536 (13%) Query: 32 DVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVV 91 + L +PNII+ DD G+G+ F Sbjct: 8 FIPLLLACCLTGSGSLLHAAEQQSTRPNIILCMTDDQGWGETGFMG-------------- 53 Query: 92 DTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY 151 TP L + +R Y A V P+R + +TGR P RF + Sbjct: 54 ------------HPILKTPHLDEMAASSLRLDRFYAAAPVCSPTRGSFLTGRHPNRFACF 101 Query: 152 SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFS 211 S + E + E ++ GY T GKWHL + + Sbjct: 102 SWGHT---LRPQEVTVAEAVKSVGYTTGHFGKWHLGSVQSNS------------------ 140 Query: 212 AEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKT 271 P N GFD ++ ++ Y N P + N KG S D A+ + +A Sbjct: 141 --PVSPGNSGFDEWV---SSPNFYENDPYMSHNGVVKQLKGESSRVTVDAALDFIKQADK 195 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 +PF+ + + PH P++ + + + NY+ + VD+ + + QL+ Sbjct: 196 DKKPFLAVIWFGNPHTPHEAV--SELKDLYPDQKPNFQNYFGEISGVDRAMGHLRSQLRD 253 Query: 332 NGQYDNTIILFTSDNGA------VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL- 384 G +NT++ FTSDNG + G G+K + GG P + W + Sbjct: 254 LGLAENTLLWFTSDNGPRPPQFKTEEARSQATGGLAGFKGNLWEGGVRVPSLIEWPAVIK 313 Query: 385 QPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYS 444 +P + +D YPT L + +LDGVSLLP ++ + W Sbjct: 314 KPEVSNVPCGTIDIYPTVLAMTGAKVSHQPQLDGVSLLPLIEGQMTARGRPMGFWTYPEK 373 Query: 445 HWFDEENIPFWDNYHKFVRHQSDDYPHNP----------NTEDLSQFSYTVRNNDYSLVY 494 + + P +D + + + ++ L+ Sbjct: 374 GHPKRSTDILLALQKQQSPGHPNPKGPAPDADAGSLKTQYPKDKLPGAAALVDGNFKLLK 433 Query: 495 ---TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 + LY L D +K +L+ +PQ +K+M+ + ++ S L+ + Sbjct: 434 METKRGKPRYTLYDLVKDPAEKQDLSQVDPQRLKKMKAALADWQHSVVDSLNGKDY 489 >UniRef50_A4AVA7 Aryl-sulphate sulphohydrolase n=2 Tax=Bacteroidetes RepID=A4AVA7_9FLAO Length = 487 Score = 423 bits (1088), Expect = e-117, Method: Composition-based stats. Identities = 134/556 (24%), Positives = 220/556 (39%), Gaps = 100/556 (17%) Query: 16 LILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPF 75 L LA +A + + L +S KPNI+++ +DDLGY + F Sbjct: 10 LFLAEDKSAKMRFSPTHLFLLVLSIVFLWSC----GDKRIRKPNIVLINIDDLGYKDVGF 65 Query: 76 DKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPS 135 + TP + L G+ FTNGY A PS Sbjct: 66 MGSEY--------------------------YETPNIDILAKAGMIFTNGYAAASNCAPS 99 Query: 136 RAAIMTGRAPARFGVYSNTDAQDG---------------IPLTETFLPELFQNHGYYTAA 180 RA++MTG+ R G+Y+ ++ G + LPE+ Q + Y T Sbjct: 100 RASLMTGKWTPRHGIYTVNSSERGKSKDRKIIPSTNTSTLSKESMVLPEVLQLNNYKTIH 159 Query: 181 VGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS 240 GKWHLS+ P + GFD +G G P Sbjct: 160 AGKWHLSES---------------------------PLDYGFDINIGGGHNGHPKSYYPP 192 Query: 241 LFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQ 298 + R P K Y++D + + I V+++ PF L A A H P +Y Sbjct: 193 YGNVKLRSPNKEYLTDLIARQTIEVLNKTIE---PFFLNYAPYAVHTPIQPVDSILSKYN 249 Query: 299 KQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNG 358 ++ Q Y V ++D+ + ++ LK NG Y NT+I+FTSDNG + + Sbjct: 250 RKTAWKGQNNAKYATMVENLDRNIGLLIAALKDNGHYKNTLIIFTSDNGGL--YGITKQQ 307 Query: 359 AQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLD 417 + K Y GG P F W K++ + IS +D +P+ ++AA IS + LD Sbjct: 308 PLRAGKGSYYEGGIREPFFFMWNDKIKSNTKSNVPISHLDLFPSIVEAAGISYNETS-LD 366 Query: 418 GVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477 G SLLP L+ + + L W + + + N ++ Sbjct: 367 GNSLLPILKQESTKLK-RPLFWHFPIY-----------------LEAYNQNDNENRDSLF 408 Query: 478 LSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDS 536 ++ +R D+ L Y ENN++ LY LT D+ +++NL +P+ K + ++ + Sbjct: 409 RTRPGSVIREGDWKLHYYFENNEMELYNLTYDVGERNNLINTHPKKAKVLLQQLKAWWKE 468 Query: 537 SQPPLSEVNQEKFNNI 552 + P+ E ++ ++ Sbjct: 469 TSAPIPEQLNPEYASL 484 >UniRef50_P15848 Arylsulfatase B n=32 Tax=Euteleostomi RepID=ARSB_HUMAN Length = 533 Score = 423 bits (1088), Expect = e-116, Method: Composition-based stats. Identities = 108/548 (19%), Positives = 186/548 (33%), Gaps = 107/548 (19%) Query: 50 TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKST 109 + P+++ L DDLG+ + F + T Sbjct: 37 SGAGASRPPHLVFLLADDLGWNDVGFHGS---------------------------RIRT 69 Query: 110 PTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETF 166 P L +L GV N Y + PSR+ ++TGR R G+ +PL E Sbjct: 70 PHLDALAAGGVLLDNYY-TQPLCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKL 128 Query: 167 LPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM 226 LP+L + GY T VGKWHL +E P RGFD + Sbjct: 129 LPQLLKEAGYTTHMVGKWHLGM----------------------YRKECLPTRRGFDTYF 166 Query: 227 GFHAAGTAYYNSPS--------------LFKNRERVPAK---GYISDQLTDEAIGVVDRA 269 G+ YY+ F++ E V Y ++ T AI ++ Sbjct: 167 GYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALIT-N 225 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF-NTGSQTADNYYASVYSVDQGVKRILEQ 328 ++P LYLA + H P P++Y K + + +Y V +D+ V + Sbjct: 226 HPPEKPLFLYLALQSVHEPLQ--VPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAA 283 Query: 329 LKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN 388 LK +G ++NT+ +F++DNG N +G K + GG F+ Q G Sbjct: 284 LKSSGLWNNTVFIFSTDNGGQTLAG-GNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGV 342 Query: 389 YD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWF 447 + +LI D+ PT + A LDG + + + + L I Sbjct: 343 KNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIELLHNIDPNFVDS 402 Query: 448 DEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV-------------- 493 + DD + + +R+ ++ L+ Sbjct: 403 S-------PCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTGYPGCGYWFPPPS 455 Query: 494 ---------YTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 L L+ + D +++ +L+ P +V ++ ++ + S P Sbjct: 456 QYNVSEIPSSDPPTKTLWLFDIDRDPEERHDLSREYPHIVTKLLSRLQFYHKHSVPVYFP 515 Query: 544 VNQEKFNN 551 + + Sbjct: 516 AQDPRCDP 523 >UniRef50_A6DG54 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DG54_9BACT Length = 469 Score = 423 bits (1087), Expect = e-116, Method: Composition-based stats. Identities = 116/508 (22%), Positives = 191/508 (37%), Gaps = 74/508 (14%) Query: 36 KATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 K + PN++V+ DD G+ G+ Sbjct: 6 KQLSLLITLIFIGLYIQVQAAPPNVVVIYFDDTGWKDFGCFGGA---------------- 49 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT- 154 T + +L G+RFT Y PSRA ++TGR P R G+YS Sbjct: 50 -----------VDTTHIDNLAKNGMRFTEYYAPAPNCSPSRAGLLTGRFPFRLGMYSYRS 98 Query: 155 -DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 + +P +E + E + GY T GKWHL + P Sbjct: 99 KNTPMHLPDSEITIAEALKTKGYATGMFGKWHLGNLDGKSHP------------------ 140 Query: 214 EWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERV-PAKGYISDQLTDEAIGVVDRAKTL 272 P +GFDY++ + N SL +N + V G+ + + DEA + + + Sbjct: 141 --TPSEQGFDYWLACDNNLIKH-NPKSLIRNGKPVGKIAGWAAQVVADEANEWMKKQTS- 196 Query: 273 DQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKN 332 PF Y+A++ H P D P + ++ Y D V IL+ L Sbjct: 197 --PFFAYIAFSETHSPLDAPEELITKYIERGENKKRATYRGMTEYSDAAVGSILKTLDDM 254 Query: 333 GQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDK 391 G DNT++ SDNG + +G KS T+ GG P + W GK++PG+ Y+ Sbjct: 255 GVSDNTLVFLASDNGPTSED---SCEGLRGKKSYTWEGGIRVPAIIRWPGKVKPGSEYND 311 Query: 392 LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEEN 451 + +D PT D +PK +DGVS+ L+ K L++ S Sbjct: 312 PVGGIDLLPTLCDIVGAELPK-RHIDGVSIRSVLEGKPFKRNTPILSFFYRTSPAASMRM 370 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQ 510 D + ++ + S+++ D +V + + LY + DL Sbjct: 371 --------------GDYVLIGHSDDEDRKKSHSMSAEDMPIVKSSKLVSFELYNIKNDLG 416 Query: 511 QKDNLAAANPQVVKEMQGVVREFIDSSQ 538 Q+ N+AA P+ + E++ ++ + Sbjct: 417 QEKNIAATYPEKLAELRKIMLALHHDAI 444 >UniRef50_B1KFX9 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KFX9_SHEWM Length = 548 Score = 422 bits (1086), Expect = e-116, Method: Composition-based stats. Identities = 125/604 (20%), Positives = 225/604 (37%), Gaps = 127/604 (21%) Query: 8 SVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGK-------PNI 60 ++ S ++ ++ A+ + F + K +K VA + + + K PNI Sbjct: 6 ALYSVALIVLSAALLWTFRFDVLVTLASKKSKKPVAENQEINWQKGPEQKDSTKTDLPNI 65 Query: 61 IVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGV 120 +++ DD+G + G TP + L +G Sbjct: 66 VLILADDMGINDVSTFGGGM--------------------------IETPNIDKLAAKGA 99 Query: 121 RFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA------------------------ 156 FTNGY H PSRAA++TGR R G + Sbjct: 100 LFTNGYSGHANCAPSRAALLTGRDATRTGYDTTPIPDGMSRIIAAIENNEDNGRPEMSYS 159 Query: 157 -----------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 G+P +E +PE+ + GY+T +GKWHL + + +P + + Sbjct: 160 AEADATNPTYDNRGLPGSEILIPEILKESGYHTMHIGKWHLGRSPEM-MPNAQGFDESLM 218 Query: 206 NFTTFSAEEWQPQN-------RGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQL 258 + P++ G D F+ A + E GY++D Sbjct: 219 MDSGLYLPVDHPESVNAPVESSGLDRFIW------ATMRYSVNWNGGEIFKPNGYLTDYF 272 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSV 318 T+EA ++ ++PF LYLA+ PH P D Y+ + Y A + S+ Sbjct: 273 TEEAEKAIEAN--ANRPFFLYLAHWGPHNPVQAKRAD-YEAVGDIQPHNKRVYAAMLRSI 329 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL-NGAQKGYKSQTYPGGTHTPMF 377 D+ V+R++ +L+K G DNTI++ +SDNG + N +G+K+ + GG P Sbjct: 330 DRSVERVMAKLEKQGIADNTIVILSSDNGGADYVAINDLNKPYRGWKNTFFEGGIRVPFS 389 Query: 378 MWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ-GEPHK 435 + W + ++ ++ +D PT ++ A+ +P+D ++DGV + P Q + + P Sbjct: 390 VTWPNVIDESTVIEEPVNHIDLMPTIINMANADLPQDREIDGVDIAPLWQGQPELERPQN 449 Query: 436 NLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT 495 + W T V++ + L Sbjct: 450 AMFWFTGDY--------------------------------------RVVQSKGWKLQQN 471 Query: 496 VENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKK 554 ++ Q L+ L D ++ NLA + + E+ ++ ++ + E I K Sbjct: 472 PKSGQTFLHNLNVDPTEQKNLADSESAKLAELTKLIDAHFANAVDVIGESTIAAPITIDK 531 Query: 555 ALSE 558 L E Sbjct: 532 HLGE 535 >UniRef50_A6DMW2 Putative exported uslfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMW2_9BACT Length = 479 Score = 422 bits (1086), Expect = e-116, Method: Composition-based stats. Identities = 115/537 (21%), Positives = 192/537 (35%), Gaps = 123/537 (22%) Query: 36 KATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 K + + + +PNI+ + DD+G L + Sbjct: 6 KNIFLVCTIALASLNLLNAAQRPNILFIVADDMGIMDLGVYGSDY--------------- 50 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD 155 TP L L + +RF Y A V P+R AI+TGR P R + Sbjct: 51 -----------YLTPNLNKLASQSMRFDRAYAASHVCSPTRGAILTGRYPQRIHLTDALP 99 Query: 156 ------AQDGIPLTET--------FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTR 201 IP + Q + Y TA GKWHL ++ Sbjct: 100 WDRLYKNPKMIPPNHVKELSLKLPTFARVLQKNDYRTAMFGKWHLGNEERFFTGKEH--- 156 Query: 202 DYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDE 261 + GFD G AY KG ++LT+ Sbjct: 157 ----------------KAYGFDEAFGVSGKAKAY--------------DKG--VNELTER 184 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP--APDQYQKQFNTGSQTADNYYASVYSVD 319 + + K +PFML L ++ PH+P P A Y Q Y + D Sbjct: 185 TLRFLKENKK--KPFMLCLMHHVPHVPVACPPYAKALYDSVPKGKHQKNSKYAGMISHFD 242 Query: 320 QGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMW 379 +K++L+ L+ G DNT+++ TSDNG + L N G K Y GGT P+ + Sbjct: 243 NSIKKVLDALRALGLDDNTVVIVTSDNGGL--SNLSSNKPYNGGKGSLYEGGTRVPLLIR 300 Query: 380 WKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLT 438 W GK+ PG+ +K ++ + DF+PT L+ A + + + LDG S++P L+ K G+ + L Sbjct: 301 WPGKITPGSVNKSVVISNDFFPTFLELAGLPLMPEAHLDGKSMMPLLKGKTLGK--RTLY 358 Query: 439 WITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVEN 498 W + ++ + D L++ +E+ Sbjct: 359 WHFPHRG----------------------------------TPGSSIIDGDLKLIHKIES 384 Query: 499 NQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQP----PLSEVNQEKFN 550 + ++ L D + +NL P+ +Q ++ + P + + ++ Sbjct: 385 DTYEMFDLNSDPYEANNLFEKQPEQASRLQKMLARHLKEVAAQEMSPNPQWDPKRPK 441 >UniRef50_A6LCL3 Arylsulfatase A n=9 Tax=Bacteroidales RepID=A6LCL3_PARD8 Length = 476 Score = 422 bits (1085), Expect = e-116, Method: Composition-based stats. Identities = 117/518 (22%), Positives = 192/518 (37%), Gaps = 93/518 (17%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K + + NI+++ +DD+GYG F+ Sbjct: 1 MKNKLMLLTAASAFSMAGIAADYTNIVLINLDDVGYGDFSFNG----------------- 43 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS-- 152 A +TP + + EGVRFT+ V +SG SRA ++TG P R G Sbjct: 44 ---------AYGYTTPNIDKMAAEGVRFTHFLVGQPISGASRAGLLTGCYPNRIGFSGAP 94 Query: 153 NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 D+ G+ E + E+ + GY TA GKWHL Sbjct: 95 GPDSNYGVHPEEMTIAEVLKQKGYSTAIFGKWHLGSQ----------------------- 131 Query: 213 EEWQPQNRGFDYFMGFHAAGTAYYNSP-----------SLFKNRERVPAKGYISDQLTD- 260 +E+ P GFD + G + + P + E + + TD Sbjct: 132 KEFLPLQNGFDEYYGLPYSNDMWPFHPQQGEVFNFPDLPTYDGNEIIGYNTDQTRLTTDY 191 Query: 261 --EAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSV 318 ++ + + K ++PF LYLA+N PH+P G Y + + Sbjct: 192 TTRSVNFIKKNK--NKPFFLYLAHNMPHVPLAVSDK-------FKGKSEQGLYGDVMMEI 242 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNGAQKGYKSQTYPGGTHTPM 376 D V I + L++ G DNT+++ TSDNG G + K+ T+ GG P Sbjct: 243 DWSVGEIFKALRELGLEDNTLVILTSDNGPWTNYGNHAGSAGGLREAKATTFDGGNRVPC 302 Query: 377 FMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHK 435 M+WKGK PG +KL S +D PT + +P K+DGVS+LP ++ KK P + Sbjct: 303 IMYWKGKTLPGTTCNKLASNIDLLPTFAEITQAPLPP-RKIDGVSILPLIEGKKDANPRE 361 Query: 436 NLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT 495 + + + + F + P N + Sbjct: 362 SFVYYYRKNDLEAVTDGMFKLVFPHKYVTYGAYEPGNDGQPGK--------------LTN 407 Query: 496 VENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVRE 532 +E + +Y L D ++ N+ P+ ++ + + Sbjct: 408 LEIMKPEMYDLRRDPGERYNVITQYPEEAAKLMKIADQ 445 >UniRef50_A3I2R7 Arylsulfatase n=2 Tax=Bacteroidetes RepID=A3I2R7_9SPHI Length = 589 Score = 421 bits (1084), Expect = e-116, Method: Composition-based stats. Identities = 128/554 (23%), Positives = 199/554 (35%), Gaps = 127/554 (22%) Query: 36 KATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 + T + + PNII++ DD GYG F Sbjct: 10 LSFLTILLLVFCASKFTFAQKPPNIILIITDDQGYGDFGFTG------------------ 51 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD 155 STPT+ L + FTN YV V P+RA++MTGR R G+ + Sbjct: 52 --------NKHVSTPTIDQLAENSFEFTNFYV-SPVCAPTRASLMTGRYSLRTGIRDTYN 102 Query: 156 AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEW 215 + E + EL Q Y + GKWHL Sbjct: 103 GGAMMSPDEITIAELLQKSDYTSGIFGKWHLGDNY-----------------------PM 139 Query: 216 QPQNRGFDYFMGFHAAGTAY-------------YNSPSLFKNRERVPAKGYISDQLTDEA 262 +P ++GFD + + G Y P L+ N + +GY SD A Sbjct: 140 RPSDQGFDESLIHLSGGMGQVGDFTTYFQKDRSYFDPVLWHNNRQESYQGYCSDIFASAA 199 Query: 263 IGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT------------------- 303 I +++ K DQPF YL++NAPH P P++Y +++ Sbjct: 200 IEFIEKNK--DQPFFTYLSFNAPHTPLQ--VPEEYYQKYKNIDTSTGYESDERPFYPMSD 255 Query: 304 -GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKG 362 + A YA V ++D +K + +LK+ D TII+F +DNG L +G Sbjct: 256 SQKEEARKVYAMVENIDDNLKNLFAKLKELEIEDETIIIFLTDNGPQQQRYL---AGLRG 312 Query: 363 YKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSL 421 K Y GG TP+ + KL + L + +D PT D I +P D K+DG SL Sbjct: 313 LKGNVYQGGIRTPLLIHIPEKLSENRKINTLSAHIDILPTIADLVGIQLPLDRKIDGKSL 372 Query: 422 LPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQF 481 LP L + +++L + NI Sbjct: 373 LPLLIGEVDSFENRSLFSYWNRKFPEKYSNI----------------------------- 403 Query: 482 SYTVRNNDYSLV----YTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDS 536 +++N+++ LV Y LY L D ++ NL + E++ + + Sbjct: 404 --SIQNSEWKLVGKTDYDASIEDFQLYNLKEDPYEQSNLITSKISKGLELKNELDQLYLE 461 Query: 537 SQPPLSEVNQEKFN 550 + +N K + Sbjct: 462 LISEENLINPPKIH 475 >UniRef50_A6DUI7 Putative exported uslfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DUI7_9BACT Length = 516 Score = 421 bits (1084), Expect = e-116, Method: Composition-based stats. Identities = 126/576 (21%), Positives = 208/576 (36%), Gaps = 131/576 (22%) Query: 35 LKATKTNVAFSDFTPTEYSTK-GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 +K + + F + K N+I + DDLG+ + F+ F Sbjct: 7 MKPVLFLILIAPFALFAKEAQHEKLNVIFMIADDLGWMDVGFNGNKF------------- 53 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 TP L L EG+ FTNGY + + P+RAA TG++PA G+ Sbjct: 54 -------------VETPNLDKLASEGMVFTNGYASGPLCSPTRAAFHTGKSPATMGINVP 100 Query: 154 TD-----------------------------------AQDGIPLTETFLPELFQNHGYYT 178 GI E + + Q+ Y T Sbjct: 101 VTKGLKGKTPGAYPMGGDKLKTKVGQRDIRHRLLPAYTNTGIDPQEVTIADCLQSADYVT 160 Query: 179 AAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM-GFHAAGTAYYN 237 A++GKWH+ S + P+ G+D + G G + Sbjct: 161 ASIGKWHMGLSH--------------------SDPKADPREYGYDINIAGGDYHGPPSWF 200 Query: 238 SPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQY 297 SP + + P +++++LT EAI ++ K D+PF LYL Y H P+ ++Y Sbjct: 201 SPYRIHSLKNGPKGEHLTERLTREAINFMEENK--DKPFFLYLPYYQVHSPHGAR--EEY 256 Query: 298 QKQFNTGSQT----ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA----- 348 K+F+ Y A V +D+ V I + LKK+G NT+++F+SDNG Sbjct: 257 IKKFDHKQTPDSKMNSIYAAMVMHLDESVGLINDYLKKSGLDKNTLLIFSSDNGPLVYQR 316 Query: 349 ------VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPT 401 + L +G+K Y GT P G + + + I D Y T Sbjct: 317 AGNQVLPRNTRLTFAEPLRGWKGSVYEAGTRVPYIFKLPGVIPANSISQTPIITHDLYAT 376 Query: 402 ALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKF 461 + +++P++ K++G SL P L + + +L W W Sbjct: 377 ICEFTGVAVPEEQKVEGESLFPLLT-QSKALQRTSLFWHNPKYSWS-------------- 421 Query: 462 VRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN---QLGLYKL-TDLQQKDNLAA 517 N + + + +R Y L+Y E L LY L D + NL + Sbjct: 422 ---------LNSDILWADRPACAIRKGKYKLIYYFERKGERTLELYDLDNDQGETKNLVS 472 Query: 518 ANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIK 553 P+ E++ + ++D +Q N + Sbjct: 473 DLPEKALELETELLAWLDQTQAWKPIDNPDYDAKYD 508 >UniRef50_D0TQQ7 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TQQ7_9BACE Length = 853 Score = 421 bits (1084), Expect = e-116, Method: Composition-based stats. Identities = 111/556 (19%), Positives = 195/556 (35%), Gaps = 152/556 (27%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 F+ + KPN++++ DD GY L Sbjct: 11 LFSAGLLQARQKPNVVIIFTDDQGYQDLGCYGS--------------------------P 44 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN-TDAQDGIPLTE 164 TP + + EG++ T+ YV+ VS SRA ++TGR R GV +G+P E Sbjct: 45 LIQTPFIDRMAKEGIKLTDFYVSSSVSSASRAGLLTGRLNTRNGVKGVFFPESEGMPSEE 104 Query: 165 TFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDY 224 L E + GY T GKWHL + + P ++GFDY Sbjct: 105 ITLAEALKEQGYTTGCFGKWHLGDL-----------------------KGHLPTDQGFDY 141 Query: 225 FMGFHAAGTAY-----------------------------------------YNSPSLFK 243 + G + Y N+ LF+ Sbjct: 142 YYGIPYSNDMYIGPSQQFASNVTFREGYNLSKAKEDQEFVRTSSRADIKKRLNNASPLFE 201 Query: 244 NRERVP---AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ 300 + + + + + D AI ++ +QPF +Y+ + PH+P + + Sbjct: 202 GDKIIEYPCDQSTTTRRYFDHAIDFIENN--PEQPFFVYITPSMPHVPL-------FASE 252 Query: 301 FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNG 358 G Y V +D V R+++ L K +NT+++F SDNG + Sbjct: 253 QFKGKSKRGLYGDVVEEIDWNVGRLIDYLDKKKLAENTLVIFASDNGPWLSFKEDGGSAE 312 Query: 359 AQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLD 417 +G K Y GG P + WKG + G D +++++D +PT + A K+D Sbjct: 313 PLRGGKFSYYEGGVRVPCIIRWKGSIPAGVTSDAIVASIDLFPTIMHYAGCQ-SFKQKID 371 Query: 418 GVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477 G+++ +L++ + + H Sbjct: 372 GINISSFLKNPSLRLRDEYVYVKGGEVHG------------------------------- 400 Query: 478 LSQFSYTVRNNDYSLV------YTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVV 530 +R D+ + + + L+ L D+ + +NL P VKE+Q V+ Sbjct: 401 -------IRKGDWVYLPKTGNSKFKKGDVPELFNLKQDIGESNNLHLQYPNKVKELQEVM 453 Query: 531 REFIDSSQPPLSEVNQ 546 +++ +S P S++ Sbjct: 454 KKYQSTSTMPYSQIRD 469 >UniRef50_Q7UYH3 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UYH3_RHOBA Length = 598 Score = 421 bits (1083), Expect = e-116, Method: Composition-based stats. Identities = 113/546 (20%), Positives = 196/546 (35%), Gaps = 118/546 (21%) Query: 40 TNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 + ++ + + E + +PN+IV+ DD G G F Sbjct: 18 SALSSTSVSAAETNAAERPNVIVIMSDDQGVGDYGFMG---------------------- 55 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG 159 TP+L + + + YV++ V P+RA++MTGR R + Sbjct: 56 ----NPIIRTPSLDKMRTQSGYLSRFYVSN-VCAPTRASLMTGRYNYRTRCIDTYVGRAM 110 Query: 160 IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN 219 + E L E GY T GKWHL +P + Sbjct: 111 MDPDEVTLAERLSEAGYQTGIFGKWHLGDNY-----------------------PMRPMD 147 Query: 220 RGFDYFMGFHAAGTAY----------YNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRA 269 +GFD + G Y P+LF N + V +GY +D D AI + Sbjct: 148 QGFDESLIHRGGGIGQPSDPIGAEGKYTDPTLFHNGDEVAMEGYCTDIFFDAAIDFARKQ 207 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQY-----------------QKQFNTGSQTADNYY 312 +PF Y+A NAPH P D+ + Y K+ + Sbjct: 208 TESGKPFFTYIATNAPHGPFDDVPNELYEEYKQVDFTPILVSDLPAKRRDAEFDKLARIS 267 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGT 372 A + ++DQ V ++ L + +NTI+L+ +DNG + +G K+Q GG Sbjct: 268 AMITNIDQNVGKLFASLDELKIRENTIVLYLNDNGPNSRRYVGN---MRGNKTQVDDGGI 324 Query: 373 HTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 +P+ W K+ D +++ +D PT LDA ++ + LDG S LP L + Sbjct: 325 RSPLLFHWPAKVDASDTTDVMLAHIDLMPTLLDACGVAASESPALDGKSFLPLLTGE--- 381 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 + W+ + + P + + + + + Sbjct: 382 ------------------MDYSQWETRLIAFQTHRGNVPQKFH-------HFAMHEHPWK 416 Query: 492 LVYTVENN--------QLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 LV+ +L LY L D +Q+++LA +P++V+ ++ ++ D Sbjct: 417 LVHPSGFGKERFEGEPKLELYNLEDDPKQQNDLADKHPEIVQRLKQAYSKWFDDVSSTRP 476 Query: 543 EVNQEK 548 + Sbjct: 477 DNYAPP 482 >UniRef50_D2R457 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R457_9PLAN Length = 516 Score = 421 bits (1083), Expect = e-116, Method: Composition-based stats. Identities = 121/564 (21%), Positives = 193/564 (34%), Gaps = 84/564 (14%) Query: 28 HAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMEN 87 A + T + + PNI+ + DDLGYG + Sbjct: 3 WCAMVRSTGVWLAALLLIGSTALVRAEELPPNIVFILCDDLGYGDVGCFG---------- 52 Query: 88 REVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR 147 + TP + +L +G+R Y V PSR ++TG Sbjct: 53 ----------------QKKTRTPHIDTLARDGMRLIQHYSGAPVCAPSRCVLLTGLHSGH 96 Query: 148 FGVYSN----TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDY 203 V N + Q + LP L GY A GKW L + P + + Sbjct: 97 SQVRDNREAQPEGQYPLAEGTVTLPGLL--EGYVCGAFGKWGLGGPESSGKPLAQGFDRF 154 Query: 204 HDNFTTFSAEEWQPQN-RGFDYFMGFHA---AGTAYYNSPSLFKNR---ERVPAKGYISD 256 A + PQ+ D + A + + + +N ER Y +D Sbjct: 155 FGYNCQRQAHNYYPQHLWSNDEKVLLKNPPFAAHQKFPADADPQNPAAFERYRGPDYAAD 214 Query: 257 QLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNT----------- 303 ++++A+ +D +PF LY A PHL P +Y +F+ Sbjct: 215 LISEQALKFIDEHHQ--KPFFLYYASPVPHLALQVPEDSLKEYAGEFSETPYLGERGYLP 272 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNG----- 358 Y A + +D+ + RILE+L+K G TI++F+SDNG + D + Sbjct: 273 HPTPRAAYAAMITRMDREIGRILERLEKYGLQRRTIVVFSSDNGPLYDKLGGTDADFFQS 332 Query: 359 --AQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLK 415 +G K Y GG P + + G + G L D+ PT L A +S + Sbjct: 333 ALDLRGRKGSVYEGGIRVPTIVKFPGVVPAGTTSSTLGGFEDWMPTLLSLAGMSTKIPEQ 392 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 DG L P L+ Q P + L W + + Sbjct: 393 ADGRDLSPSLRGDWQA-PREFLYREFPGYGGQQFVRSGKWKAVRQNLVRP---------- 441 Query: 476 EDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGV-VREF 533 V L E + LY L D + N+AA +P+VV ++ + +RE Sbjct: 442 ---------VPTGKKKLAEWKEPLAIELYDLEADPTESTNVAAEHPKVVAKLHAIMLREH 492 Query: 534 IDSSQPPLSEVNQEKFNNIKKALS 557 S + + ++ E+ K A Sbjct: 493 QPSVEFKMPRLDDEQAAAHKAAGK 516 >UniRef50_Q0BZE9 Sulfatase family protein n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0BZE9_HYPNA Length = 459 Score = 421 bits (1082), Expect = e-116, Method: Composition-based stats. Identities = 120/490 (24%), Positives = 188/490 (38%), Gaps = 98/490 (20%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 + T + PNII++ DDLG+G + + AA Sbjct: 27 ATSETAPAAAKPPNIIIIMADDLGWGDISLNG--------------------------AA 60 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN--TDAQDGIPLT 163 TP + + EG++ T+ Y V PSRAA++TGR P R G+ +QDG+P Sbjct: 61 LIETPNIDRIGQEGIQLTDFYAGSNVCSPSRAALLTGRYPIRSGMQHVIFPHSQDGLPAE 120 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFD 223 E + E+ +N GY T VGKWHL EE+ P N+GFD Sbjct: 121 EITISEMLKNAGYRTGMVGKWHLGHQ-----------------------EEYWPTNQGFD 157 Query: 224 YFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQ---LTDEAIGVVDRAKTLDQPFMLYL 280 +F G + L++ +E + + S A ++ + D+PF LY Sbjct: 158 WFYGVPYSNDMAPF--DLYRGKEIIESPADQSQLSLNYAKAAKEFIED--SSDKPFFLYY 213 Query: 281 AYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 A PH+P P +G+ A Y V +VD G+ +L+ L + G D+T+I Sbjct: 214 AETFPHIPLFVPED-------RSGTSDAGLYGDVVETVDAGIGIVLDTLDEAGVADDTLI 266 Query: 341 LFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYD-KLISAMDFY 399 +FTSDNG + G +G K +T+ GG P W G + G+ ++ +D Sbjct: 267 IFTSDNGPWFE---GSAGEFRGRKGETHEGGFRVPFLARWPGHIPKGSVSHEMAMNIDLL 323 Query: 400 PTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYH 459 PTA + ++P D +DG L L PH L + + F + Sbjct: 324 PTAASLSGATLPADRVIDGKDLTSLLTAGA-PTPHDILFFFDGNEI-VGARDARFRLVLN 381 Query: 460 KFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAA 518 F R S + + L+ L D Q+ + Sbjct: 382 TFYRTMSVPFEYFGTAL--------------------------LFDLEKDPQESFSFMRE 415 Query: 519 NPQVVKEMQG 528 P + ++ Sbjct: 416 YPGEAERLKS 425 >UniRef50_A6CB33 Arylsulfatase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CB33_9PLAN Length = 590 Score = 421 bits (1082), Expect = e-116, Method: Composition-based stats. Identities = 114/560 (20%), Positives = 193/560 (34%), Gaps = 115/560 (20%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 ++ V T+ N+I++ DD G F Sbjct: 4 LSHCRIVYALLIVLTVSLLATQLQAAQHTNVILIMTDDQGGWDYGFQG------------ 51 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 +TP L ++ G R + YV V P+RA +MTGR R Sbjct: 52 --------------NKHLNTPHLDAMAANGARLSRFYV-SPVCTPTRANLMTGRYNYRTR 96 Query: 150 VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 + + TE + E GY T GKWHL Sbjct: 97 AIDTYIGRAMLEPTEVTIAEALAPAGYRTGIFGKWHLGDSY------------------- 137 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAY----------YNSPSLFKNRERVPAKGYISDQLT 259 +PQ++GF + G Y P LF N E+ +GY +D Sbjct: 138 ----PLRPQDQGFQEVLVHRGGGIGQPSDPPEGAGKYTDPVLFHNGEKKQMQGYCTDIYF 193 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS-------------- 305 D A+ +++ ++ D+P +Y+A NAPH P P+ +K++ Sbjct: 194 DHALKFLEQNESQDKPTFMYIATNAPHGPFH-DVPEDLRKKYQAMDLTDAYGFDMNPKRK 252 Query: 306 -----QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQ 360 ++ + ++DQ + ++ + LKK DNT++LF +DNG G Sbjct: 253 NEKQFDKTSRVFSMIENIDQNIGKLFQHLKKIDALDNTLVLFLNDNGP---NGPRYVGEH 309 Query: 361 KGYKSQTYPGGTHTPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGV 419 +G K GG + + W +L+ G + + + D +PT L A + P LKLDG+ Sbjct: 310 RGAKGSVNEGGIRSVLIAHWPAQLKAGTVNPTIAAHYDLFPTILAATGVEKPAGLKLDGI 369 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLS 479 ++LP L++K P ++L E Sbjct: 370 NVLPLLKNKADQWPERSLFLQWH------------------------------RGDEPQP 399 Query: 480 QFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 + + V DY + ++ + L+ L D ++ +LAAA ++ M + Sbjct: 400 RTNAAVVTQDYKMTFSKPDEPGKLFHLQNDPAERQDLAAAKTKLASHMTEQYNNWFQDVS 459 Query: 539 PPLSEVNQEKFNNIKKALSE 558 + +I E Sbjct: 460 STRPDNYAPPRIHIGNPKEE 479 >UniRef50_A6DGL0 Arylsulfatase A n=3 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DGL0_9BACT Length = 506 Score = 421 bits (1082), Expect = e-116, Method: Composition-based stats. Identities = 127/540 (23%), Positives = 211/540 (39%), Gaps = 107/540 (19%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGK-------PNIIVLTMDDLGYGQLPFDKGSFDPKTM 85 +K++ + ++A S + ++ + K PNII++ DD G G L Sbjct: 1 MKIRKSFISIALSLLSLNNFAAETKKILKGAKPNIIMVLTDDQGMGDLSCMG-------- 52 Query: 86 ENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAP 145 TP + + + RFT+ V P+RAAIM+GR+P Sbjct: 53 ------------------NPILRTPHIDKMYAKSTRFTDFQV-SSTCTPTRAAIMSGRSP 93 Query: 146 ARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 G+ +D + P+ Q GY T GKWHL Sbjct: 94 FEVGISHTLMQRDRLAPAVITFPQALQKSGYKTGLFGKWHLGD----------------- 136 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS-------------LFKNRERVPAKG 252 EE++PQNRGFD + A G YN L N V KG Sbjct: 137 ------GEEYRPQNRGFDEVLMHGAGGIGQYNFGDFKPNATNKYFDNVLLHNDTIVQTKG 190 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQF--NTGSQTADN 310 + +D A+ + + +Q + Y++ NAPH P AP++Y+K+F +Q+ Sbjct: 191 FCTDVFFKAALSWIKKQHENNQTYFAYISLNAPHGPLI--APEKYKKRFIDEGYNQSVAA 248 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG---------AVIDGPLPLNGAQK 361 Y + ++D ++E+LK+ DNT+I+F +DNG V N K Sbjct: 249 RYGMIENIDDNFGLMVEKLKEWKALDNTLIIFMTDNGMAMKSIGKKGVKGKFNAWNAGMK 308 Query: 362 GYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPK-DLKLDGV 419 G+K + GG+ P F +WKG L G L + +D Y T + A +IP+ L G Sbjct: 309 GHKDSAWEGGSRVPSFWYWKGVLGEGVDISALSAHIDLYRTFCELAGTNIPESSLSPSGR 368 Query: 420 SLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLS 479 SL+P L++ + L + + Sbjct: 369 SLIPLLENPNAKWDDRTLFFHRGRWGGGGRGKKTRELAKY-------------------- 408 Query: 480 QFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 + VRN+ + LV ++ + L + D + N+ +P+V ++M+ ++ DS++ Sbjct: 409 -YGMAVRNSRWRLVNIMDGDGPWLSDIANDPGETKNVIEQHPEVAEKMKAQFDQWWDSTE 467 >UniRef50_UPI0000586CBA PREDICTED: similar to arylsulfatase B n=3 Tax=Deuterostomia RepID=UPI0000586CBA Length = 596 Score = 420 bits (1081), Expect = e-116, Method: Composition-based stats. Identities = 114/583 (19%), Positives = 196/583 (33%), Gaps = 117/583 (20%) Query: 20 SGMAAFAAHAADDVKLKATKTNVAFSDFTPT--------EYSTKGKPNIIVLTMDDLGYG 71 M F A +K+K T T + +T P+I+ + DD G+ Sbjct: 53 PSMEQFKAKELKMLKVKRCSCAFLVEALTVTVLICTGLIKGATGKPPHIVFIVADDYGWF 112 Query: 72 QLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGV 131 + + + TP L L GV+ N YV + Sbjct: 113 DVGYHNST---------------------------IKTPNLDLLASRGVKLENYYV-QPI 144 Query: 132 SGPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSK 188 PSR+ +MTGR G+ + +PL ET LP+ + GY T VGKWHL Sbjct: 145 CSPSRSQLMTGRYQIHTGLQHFVIIAPQPNCLPLNETTLPQKLKESGYATHLVGKWHLG- 203 Query: 189 ISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS---------- 238 F E P RGFD G+ + Y+ Sbjct: 204 ---------------------FYKNECMPLQRGFDSSFGYLSGMQDYWTHFRSGSFPGFP 242 Query: 239 -------PSLFKNRERVPA--KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPN 289 + N Y T+ A V+ + +QP LYL + H P Sbjct: 243 EGNHWLGIDFWDNNRVAWEYTGNYSQFVFTERAQRVI-QQHNPNQPLFLYLPLQSVHGPL 301 Query: 290 DNPAPDQYQKQFNTGSQ-TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA 348 P++Y K + Y V ++D+ V ++++ L++ G +++T+++FT+DNG Sbjct: 302 Q--VPEKYMKPYAHFQDVGRQTYAGMVATMDEAVGKVVDSLQEAGLWNDTVLVFTTDNGG 359 Query: 349 VIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG----NYDKLISAMDFYPTALD 404 N +G K+ + GG H F+ + G + D++PT ++ Sbjct: 360 TPGKS-GNNWPLRGTKNTLWEGGVHGVGFITGP-MIPAGVQGTVSKHFMHISDWFPTLIE 417 Query: 405 -AADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR 463 A + L LD ++ + + L I Y + + Sbjct: 418 GVAGGNT-AGLALDSYNMWNSITKGTPSPRKELLHNIDPYIRADHPFGYGYDEETDMIYP 476 Query: 464 HQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVE----------------------NNQL 501 + +R ++ L+ N Sbjct: 477 LSGLYPKMAAEFSTDMR--AAIRVGEWKLLTGFPGRSGWYPPPEWNIHPIDPVEAANKVT 534 Query: 502 GLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 L+ +T D +K++L+ +P+VV E+ G + + +S P Sbjct: 535 WLFNITADPCEKNDLSYQHPEVVTELVGRLEAYYKTSVPVRFP 577 >UniRef50_A6KWS8 Arylsulfatase n=6 Tax=Bacteroides RepID=A6KWS8_BACV8 Length = 464 Score = 420 bits (1080), Expect = e-116, Method: Composition-based stats. Identities = 122/538 (22%), Positives = 192/538 (35%), Gaps = 123/538 (22%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGK-PNIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 + + + S T + +T K PN+I + DDLG G L Sbjct: 1 MKNTRKILFSAALLSSGLTMAQTTTAEKSPNVIYIMADDLGIGDLGCYG----------- 49 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF 148 Q TP + + G++F Y VS PSR A++TG+ Sbjct: 50 ---------------QRQIKTPNIDGIAQNGMKFMQHYSGSTVSAPSRCALITGKHMGHA 94 Query: 149 GVYSNTDA--------QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQT 200 + N + +P E + ++F+ Y T VGKW + Sbjct: 95 AIRGNAKVAGSDGLLYETPLPAGEVTVADIFKTKNYVTGCVGKWGMGGPGT--------- 145 Query: 201 RDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRER---VPAKGYISDQ 257 E P GFDYF G+ A+ P E+ + K Y D Sbjct: 146 -------------EGMPGKHGFDYFYGYLGQRFAHSYYPEFLHENEQKIMLDGKYYSHDL 192 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPND--NPAPDQYQKQ------------FNT 303 + ++A+ +D +PF LY + PH D A +Y+ + + + Sbjct: 193 MLEKALNFIDENAQ--KPFFLYFSPTIPHADLDIMGEAMTEYEGEFCETPFGGSRDGYKS 250 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL-----PLNG 358 Y A V +D+ V I+++LK+ G YD+TII+FTSDNG +G NG Sbjct: 251 QQNPRAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIVFTSDNGVHSEGGHDPSYFDSNG 310 Query: 359 AQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS-AMDFYPTALDAADISIPKDLKLD 417 +G K Y GG TP + W G + G IS DF PT + IP++ +D Sbjct: 311 PFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAFWDFLPTIGELVQADIPQN--ID 368 Query: 418 GVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477 G+S LP L K + H + + P Sbjct: 369 GISYLPTLTGKGTQKEHDCIYYEFFEFGGKQSIMTP------------------------ 404 Query: 478 LSQFSYTVRNNDYSLVYTVENNQ----LGLYKL-TDLQQKDNLAAANPQVVKEMQGVV 530 + + LV ++ LY + TD + N+ P V K+++ ++ Sbjct: 405 ----------DGWKLVRLEVSDPSKTYEELYNIYTDPAETSNVIKQYPDVAKKLKNMI 452 >UniRef50_D2R323 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R323_9PLAN Length = 631 Score = 419 bits (1079), Expect = e-116, Method: Composition-based stats. Identities = 125/556 (22%), Positives = 198/556 (35%), Gaps = 115/556 (20%) Query: 12 TSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKG--KPNIIVLTMDDLG 69 T+ S +L S HA L T S + T + + +PNI+V DD G Sbjct: 6 TTSSAMLTSHYQ--LTHALFPTWLMFTAIVAFLSSASSTFAAEREVTQPNIVVFLADDAG 63 Query: 70 YGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAH 129 +G F STP + S+ G +V Sbjct: 64 WGDYSFSG--------------------------NTNLSTPHIDSIARGGASIDRFFV-C 96 Query: 130 GVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKI 189 V P+RA +TGR R GV + Q+ + L+E L + + GY T A GKWH Sbjct: 97 SVCSPTRAEFLTGRYHQRGGVRGVSTGQERLDLSERTLADSLRAAGYATGAFGKWHNGSQ 156 Query: 190 SNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVP 249 + P RGFD + G+ + Y +P L N + Sbjct: 157 W-----------------------PYHPNARGFDEYFGYTSGHWGEYFNPPLEHNGKLNN 193 Query: 250 AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTA- 308 +GYI D TD AI ++ +K ++PF Y+ + PH P P+ D + Q + A Sbjct: 194 YEGYIVDICTDRAITFIEASK--NKPFFCYVPFTTPHSPWSVPSADWKRFQDKPLEKRAT 251 Query: 309 ----------DNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNG 358 A V + D+ V R+L +L + +NTI+++ SDNG G Sbjct: 252 NLKQEQLDQTRCALAMVENQDRNVGRVLSKLDELKLRENTIVVYFSDNGP---NSARWTG 308 Query: 359 AQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS-AMDFYPTALDAADISIPKDLKLD 417 KG K T GG + ++ W ++ + I+ A+D PT L A + +L LD Sbjct: 309 GMKGKKGTTDEGGVRSVCYIQWPKRIAAAQTIQPIAGAIDLLPTLLSLAGVKHVGELPLD 368 Query: 418 GVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477 G L P L ++ P + L Sbjct: 369 GRDLAPLLTGQQPEWPERLLF--------------------------------------T 390 Query: 478 LSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDS 536 + R+ + L + Q L+ + +D Q + P++ EM+ V ++ Sbjct: 391 TWAGKVSARSQTHRL-----DEQGLLFDMQSDPGQTTPVNDREPKLTAEMKSAVAKWKAE 445 Query: 537 SQPPLSEVNQEKFNNI 552 + L+ Sbjct: 446 MERELALDATPPLEAT 461 >UniRef50_B5JMW2 Sulfatase domain protein n=1 Tax=Verrucomicrobiae bacterium DG1235 RepID=B5JMW2_9BACT Length = 594 Score = 419 bits (1079), Expect = e-115, Method: Composition-based stats. Identities = 124/537 (23%), Positives = 200/537 (37%), Gaps = 111/537 (20%) Query: 33 VKLKATKTNVAFSDFTPTEYSTK-GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVV 91 + + +S TP ++ PN++V+ DD G+G L Sbjct: 4 LLIILFSFTTHYSLLTPLSAASGGNPPNVLVILADDQGWGDLSLHGS------------- 50 Query: 92 DTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY 151 +TPTL L +G +F N YV V P+RA +TGR R GVY Sbjct: 51 -------------QNLNTPTLDRLAQQGAQFENFYV-QPVCSPTRAEFLTGRYYPRGGVY 96 Query: 152 SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFS 211 + + E + ++F+ GY TAA GKWH + Sbjct: 97 DTGAGGERLDADEETIAQVFRTAGYATAAFGKWHNGTQA--------------------- 135 Query: 212 AEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKT 271 + P RGFD + GF + Y L N V + GY+ D LT + +++ Sbjct: 136 --PYHPNTRGFDEYYGFTSGHWGSYFDALLDHNGSLVQSAGYLPDTLTTATLDFIEQQTA 193 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQ-----------KQFNTGSQTADNYYASVYSVDQ 320 PF YLA PH P D + + A V ++D Sbjct: 194 DQTPFFAYLALPTPHSPMQTTDEDWARFANKKLTSLATNPADENPDHTRAALAMVENIDA 253 Query: 321 GVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWW 380 V R+L+++++ +NTI+++ +DNG N +G K T GGT +P+F+ + Sbjct: 254 NVGRLLDRIQELDIEENTIVVYFTDNGP---NGWRYNANMRGRKGSTDEGGTRSPLFIRY 310 Query: 381 KGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 K+QPG + + S++D PT A+I+ LDG+SL P LQ+ P + + Sbjct: 311 PQKIQPGATLNTIASSIDLLPTLGQLANITWQPAQTLDGISLAPQLQNPNLRLPDRTIF- 369 Query: 440 ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN 499 + R DY L ++ Sbjct: 370 -------------------------------------SYWSGRISARTQDYRL-----DH 387 Query: 500 QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKA 555 Q LY + TD Q +L+ +P++ +Q + + P + N ++ I Sbjct: 388 QGQLYHIPTDRGQTTDLSTKHPELTASLQSQIDAWRSELLTPDA-ANPDRPLTIGYP 443 >UniRef50_Q7UM38 N-acetylgalactosamine 6-sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UM38_RHOBA Length = 667 Score = 419 bits (1078), Expect = e-115, Method: Composition-based stats. Identities = 120/554 (21%), Positives = 196/554 (35%), Gaps = 122/554 (22%) Query: 31 DDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 L + + +PN++V+ DD G+G L Sbjct: 66 WSCLLTSWIGFGTSCRADSDSRPSGSRPNVLVVLTDDQGWGDLSLHG------------- 112 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 TP + SL +GV+ N YV V P+RA +TGR R GV Sbjct: 113 -------------NPNLQTPHIDSLARDGVQIKNFYV-CAVCSPTRAEFLTGRYHTRSGV 158 Query: 151 YSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 +S + + L+E + + FQ GY TAA GKWH + Sbjct: 159 FSTSAGGERFDLSERTIGDAFQAAGYRTAAFGKWHSGMQA-------------------- 198 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAK 270 + P RGFD F GF + Y SP L N E V G+I D LT AI ++R Sbjct: 199 ---PYHPNARGFDEFYGFCSGHWGNYFSPMLELNGEIVKGDGFIVDDLTQHAIDFMER-- 253 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQ--------YQKQFNTGSQTA-----DNYYASVYS 317 + PF +YL N PH P P D ++ A + Sbjct: 254 DRENPFFIYLPLNTPHSPMQVPDEDWQNFEGKEIVPDPRPENAKKEDVQHTRAALALCEN 313 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMF 377 +D V ++L+ L++ +NTI++F DNG NG +G K + GG +P Sbjct: 314 IDDNVGQLLDALERLSLSENTIVVFFCDNGP---NGSRFNGGLRGRKGAVHEGGLRSPCL 370 Query: 378 MWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDL-KLDGVSLLPWLQDKKQGEPHK 435 + + K+ G + A+D +PT D D+ + LDG+SL+ L++ K + Sbjct: 371 IRYPSKIPAGQTVGGIAGAIDLFPTLADLCDVEVGATAGPLDGISLIDGLREPKSKPSER 430 Query: 436 NLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT 495 + ++VR+N Y Sbjct: 431 LIF--------------------------------------TAWSGKFSVRSNRYRYHAN 452 Query: 496 VENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQP--------PLSEVNQ 546 + L+ + D + ++A P ++ + +++ ++P + V Sbjct: 453 GD-----LFDIVADPGETGSVAEDQPVATARLKKALEDWVKETKPRDRSHSEEQVFPVGH 507 Query: 547 EKFNNIKKALSEAK 560 + +A+ Sbjct: 508 PDHPWTQLPARDAQ 521 >UniRef50_D2QTW5 Sulfatase n=2 Tax=Sphingobacteriales RepID=D2QTW5_9SPHI Length = 523 Score = 419 bits (1077), Expect = e-115, Method: Composition-based stats. Identities = 116/572 (20%), Positives = 183/572 (31%), Gaps = 132/572 (23%) Query: 21 GMAAFAAHAADDVKLKATKTNVAFSDFTP-------TEYSTKGKPNIIVLTMDDLGYGQL 73 A V S P T PNII + DDLGY +L Sbjct: 3 AFLAMRQPLLWVSAFLLLMGWVLVSFKPPRTTVSRDAVPRTAVSPNIIYIYADDLGYAEL 62 Query: 74 PFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSG 133 + TP L L EG+RFT Y + V Sbjct: 63 GCYG--------------------------QQKIRTPNLDKLAREGIRFTQHYTSMPVCA 96 Query: 134 PSRAAIMTGRAPARFGVYSNT----------DAQDGIPLTETFLPELFQNHGYYTAAVGK 183 P+R ++TG+ + N Q + + L Q GY TA VGK Sbjct: 97 PARCMLLTGKHSGHSYIRGNYEMGGFPDSLEGGQMPLYPGAFTIGRLLQQQGYKTACVGK 156 Query: 184 WHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFK 243 W + + P ++ ++ A + P + + N+P + Sbjct: 157 WGMGMANTTGNPNEQGFDYFYGYLDQKQAHNYYPTHL-------WENGKPDKLNNPVIDV 209 Query: 244 NRERVPA------------KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDN 291 +R P Y D+L +A + + K+ PF LYL + APH+ Sbjct: 210 HRRLTPETATPEAFAYFRGNDYAIDKLAQKAQAFIRQNKSG--PFFLYLPFTAPHVSLQA 267 Query: 292 P--APDQYQKQFNTGSQ-----------------TADNYYASVYSVDQGVKRILEQLKKN 332 P A +Y +F G Q Y A + +D + ++++ LK Sbjct: 268 PEAAVKEYIGKFGDGEQRTERPYLGEQGYASTPYPRATYAAMITHMDAQIGQLMQLLKDL 327 Query: 333 GQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 +NT+++F+SDNGA +G G +G K Y GG PM W G+++P Sbjct: 328 KIDENTLVMFSSDNGATFNGGVEAAYFNSVGKLRGLKMDVYEGGIREPMLARWPGRIKPN 387 Query: 388 NYDKLISA-MDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP-HKNLTWITSYSH 445 +S D T + P DG+S LP L + + H L W Sbjct: 388 QTTDHVSVQYDLLATLAELVGYKRP--FATDGISFLPTLLGQSSSQKQHPFLYWEYPEKG 445 Query: 446 WFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL----VYTVENNQL 501 +R ++ V Sbjct: 446 -----------------------------------GQLAIRMGNWKAVKTNVRKDRTTPW 470 Query: 502 GLYKLT-DLQQKDNLAAANPQVVKEMQGVVRE 532 LY L D+ + N+A +P ++++ +V Sbjct: 471 ELYDLNKDVSETTNIADKHPDIIRQANAIVAR 502 >UniRef50_B1KD88 Sulfatase n=1 Tax=Shewanella woodyi ATCC 51908 RepID=B1KD88_SHEWM Length = 500 Score = 419 bits (1077), Expect = e-115, Method: Composition-based stats. Identities = 123/560 (21%), Positives = 190/560 (33%), Gaps = 115/560 (20%) Query: 29 AADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 V + E +PN+I DDLG G L Sbjct: 5 CTTLSVAVLCSIMVTSCSQSNIEPKVNRQPNVIYFLADDLGVGDLGSYG----------- 53 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF 148 TP + L EG+RF+ Y V PSRA++MTGR Sbjct: 54 ---------------QQHIRTPNIDKLAAEGMRFSRHYAGSSVCAPSRASLMTGRDMGHT 98 Query: 149 GVYSNT-----------DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPED 197 + N Q + L LFQ GY T A GKW L + + P+ Sbjct: 99 DIRGNIQLMDQPDSPEYQGQYPLAQGTITLAHLFQLAGYQTGAFGKWGLGSLQSSGNPKA 158 Query: 198 KQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYN---SPSLFKNRERVPA---K 251 ++ A + PQ + G A P L +++ K Sbjct: 159 MGFDQFYGYLDQRHAHNYFPQYL----WDGDEVARLDNPAINVHPKLDRDKSDHREYMGK 214 Query: 252 GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT-------- 303 Y ++ A + + + D+ F LY+ + PH P + QF+ Sbjct: 215 DYAPYKILARAKEFISQNR--DEAFFLYVPFVVPHAAIQIPDKELDGYQFDETAHRLGEP 272 Query: 304 -----GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP----- 353 + A + +D+ V I+ LK+ G DNT++LF+SDNGA G Sbjct: 273 RAYTPHPKPRAARAAMISRMDRDVGDIMAMLKELGLDDNTLVLFSSDNGATAAGGSDINF 332 Query: 354 LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPK 412 +G K+ Y GG P+ W G + G+ D L + D PT D+S+P+ Sbjct: 333 FNSTAGARGEKATLYEGGIRAPLIARWPGNISAGSESDHLSAFWDMLPTFAQLLDLSVPE 392 Query: 413 DLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHN 472 + G+S+LP L K Q + H++L W Sbjct: 393 G--IQGISMLPTLLGKPQNQQHESLYWEF------------------------------- 419 Query: 473 PNTEDLSQFSYTVRNNDYSLVYTV---------ENNQLGLYKL-TDLQQKDNLAAANPQV 522 S V ++ + E LY L D + NLAA +P++ Sbjct: 420 ----FSRNPSQAVVMGNWKAIRHYSKERGKGALELGATALYNLQEDPSESQNLAAKHPEL 475 Query: 523 VKEMQGVVREFIDSSQPPLS 542 VK+ + ++ + S P + Sbjct: 476 VKKAEMIMAQRQRSPHLPWN 495 >UniRef50_Q5FYB0 Arylsulfatase J n=81 Tax=Eumetazoa RepID=ARSJ_HUMAN Length = 599 Score = 419 bits (1077), Expect = e-115, Method: Composition-based stats. Identities = 113/568 (19%), Positives = 196/568 (34%), Gaps = 89/568 (15%) Query: 16 LILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPF 75 IL + + + + + ++ +P++I + DD G+ + + Sbjct: 34 WILCLLTYGYLSWGQALEEEEEGALLAQAGEKLEPSTTSTSQPHLIFILADDQGFRDVGY 93 Query: 76 DKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPS 135 + TPTL L EGV+ N YV + PS Sbjct: 94 HGS---------------------------EIKTPTLDKLAAEGVKLENYYV-QPICTPS 125 Query: 136 RAAIMTGRAPARFGVYSN---TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNV 192 R+ +TG+ G+ + + +PL LP+ + GY T VGKWHL Sbjct: 126 RSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLG----- 180 Query: 193 PVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS----------PSLF 242 F +E P RGFD F G YY L+ Sbjct: 181 -----------------FYRKECMPTRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLY 223 Query: 243 KNRERVPA---KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQK 299 +N Y + T + + + +P LY+AY A H P AP +Y + Sbjct: 224 ENDNAAWDYDNGIYSTQMYTQR-VQQILASHNPTKPIFLYIAYQAVHSPLQ--APGRYFE 280 Query: 300 QFNTGSQ-TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNG 358 + + Y A + +D+ + + LK G Y+N+II+++SDNG N Sbjct: 281 HYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-GSNW 339 Query: 359 AQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLD 417 +G K + GG F+ G +L+ D+YPT + A+ I +D++LD Sbjct: 340 PLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLD 399 Query: 418 GVSLLPWLQDKKQGEPHKNLT-------------WITSYSHWFDEENIPFWDNYHKFVRH 464 G + + + + L W Y W + K + Sbjct: 400 GYDIWETISEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTG 459 Query: 465 QSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVV 523 P + N +L + L+ +T D ++ +L+ P +V Sbjct: 460 NPGYSDWVPPQSFSNLGPNRWHNERITL---STGKSVWLFNITADPYERVDLSNRYPGIV 516 Query: 524 KEMQGVVREFIDSSQPPLSEVNQEKFNN 551 K++ + +F ++ P + N Sbjct: 517 KKLLRRLSQFNKTAVPVRYPPKDPRSNP 544 >UniRef50_A6DJ15 Putative arylsulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ15_9BACT Length = 469 Score = 418 bits (1076), Expect = e-115, Method: Composition-based stats. Identities = 122/550 (22%), Positives = 204/550 (37%), Gaps = 135/550 (24%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 + KPNII L +DDLGYG L Sbjct: 5 LLAFLMVAGSAIANEKPNIIYLLVDDLGYGDLSLYG------------------------ 40 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT------- 154 + STP + + EG+ FT+ Y V PSRAA+MTG+ V N Sbjct: 41 --QKKFSTPNIDRIGKEGMVFTDHYSGSTVCAPSRAALMTGKHSGHGLVRGNYEVGPHGF 98 Query: 155 DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 + + + L E+ ++ GY T +GKW + Sbjct: 99 GGELPLRPEDVSLAEVMKSAGYATGLIGKWGMGMDGTTGE-------------------- 138 Query: 215 WQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRERVPAKG--------YISDQLTDEAIGV 265 P+ +GFDY GF A++ P +++N E++ YISD ++ I Sbjct: 139 --PRKKGFDYSYGFLNQAHAHHYYPEYIYENGEKLMIPENKDDARGLYISDTFAEKGIEF 196 Query: 266 VDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQ----------------------F 301 V+ K D+PF L+ A+ PH P ++++ + + Sbjct: 197 VEENK--DKPFFLFWAFVTPHAELLVPDDSLNEFKGKWPETPFVMGKQGGDGTDNPFGVY 254 Query: 302 NTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPL 356 + + + +D+ V + ++L++ G DNTII+F+SDNG +G Sbjct: 255 ASQDHPRAAFSGMITRLDKRVGDLFDKLEELGIDDNTIIMFSSDNGPHKEGGADPDFFDS 314 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS-AMDFYPTALDAADISIPKDLK 415 N GYK GG P + W ++ + S D PT + A+ P+D Sbjct: 315 NAELTGYKRDLTEGGIRVPFMVRWPNVVKARSKSSHASAFWDVMPTIAEIANTDSPED-- 372 Query: 416 LDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNT 475 +DG+S LP L+ +KQ HK+L W + ++ Sbjct: 373 IDGLSFLPALKGEKQQ-VHKHLYWEFHERGYTEQ-------------------------- 405 Query: 476 EDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFI 534 +R ++ + N+ + LY L +D ++++++A P K + ++ Sbjct: 406 --------ALRMGNWKAIRHGVNSPIKLYDLISDESEQNDVSAKYPATAKHITNILDTER 457 Query: 535 -DSSQPPLSE 543 DS PL E Sbjct: 458 TDSELWPLKE 467 >UniRef50_Q2GB51 Sulfatase n=6 Tax=Proteobacteria RepID=Q2GB51_NOVAD Length = 491 Score = 418 bits (1076), Expect = e-115, Method: Composition-based stats. Identities = 117/531 (22%), Positives = 178/531 (33%), Gaps = 114/531 (21%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 L + +PNI+ + DDLGY L Sbjct: 33 LGGAAATMVLGAAPAIASKRARRPNILYIMADDLGYADLSCYG----------------- 75 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR--FGVYS 152 TP L L +G+RFTN Y V +R ++TGR R G+ Sbjct: 76 ---------RRDFETPVLDKLAAQGLRFTNAYANSAVCTATRVGLITGRYQYRLPVGLEE 126 Query: 153 NTD--AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 G+P + LP L GY T+ +GKWHL + Sbjct: 127 PLAFRPNIGLPPSHPTLPSLLAKAGYRTSLIGKWHLGSL--------------------- 165 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNS------PSLFKNRERVPAKGYISDQLTDEAIG 264 ++ P G+ F G + G YY P L+ V GY++D L D A+ Sbjct: 166 --PDFDPLKSGYQTFWGIRSGGVDYYTHATSNGQPDLWDGPTPVERAGYLTDLLADRAVS 223 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT----------ADNYYAS 314 + A + + P+ + L + APH P + P + A Y A Sbjct: 224 EIREASSGEAPWFMSLHFTAPHWPWEGPDDASESARIAKLKDPSALFHFDGGSAAIYAAM 283 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHT 374 V +D + R+LE LK N +TI++FTSDNG G K++ GG Sbjct: 284 VRRLDYQIGRVLEALKANRAEQDTIVVFTSDNGGER---FSDTWPFSGRKTELLEGGLRI 340 Query: 375 PMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 P + W G + G D I +MD+ PT L AA + DGV + P L Sbjct: 341 PAIVRWPGVTRAGTTSDAQIISMDWLPTFLAAAGSAPDPGHPSDGVDVTPALGGG--SLA 398 Query: 434 HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV 493 + L W ++ VR + + Sbjct: 399 ERALFWRY------------------------------------KNRAQRAVRRGNLKYL 422 Query: 494 YTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 EN L+ + D ++ NL P+ ++ ++ + P + Sbjct: 423 RIAENE--FLFDVAADPLERANLKDRQPEDFAALKAAWEKWNATMLPLDPQ 471 >UniRef50_Q7UMZ5 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UMZ5_RHOBA Length = 484 Score = 418 bits (1076), Expect = e-115, Method: Composition-based stats. Identities = 113/570 (19%), Positives = 192/570 (33%), Gaps = 128/570 (22%) Query: 20 SGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGS 79 + T + +PNI+++ DDLGYG L Sbjct: 1 MSLLHLGPRVFPLTVWMLVALCSHACVPTLLRADSNDRPNIVLILADDLGYGDLGCYGND 60 Query: 80 FDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAI 139 TP L L +GVR+T Y P+RAA+ Sbjct: 61 EQA--------------------------TPVLDRLATQGVRWTQAYANGPECSPTRAAL 94 Query: 140 MTGRAPARFG-----------------VYSNTDAQDGIPLTETFLPELFQNHGYYTAAVG 182 +TGR G + + + G+P L + + GY TA G Sbjct: 95 LTGRYQQHVGGLECAIGVGNVGRYDDAIRLHLVNELGLPANRPTLAKRLSSVGYETALFG 154 Query: 183 KWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS---- 238 KWHL + + P GFD + YY+ Sbjct: 155 KWHLGYEAK-----------------------FSPMMHGFDEALYCIGGAMDYYHYLDSV 191 Query: 239 --PSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ 296 +LF N + +GY +D +TD+A+ + D+PF LYL Y APH P P Sbjct: 192 ATYNLFHNGRPISGEGYFTDTITDQAVRFIGDRNANDKPFFLYLPYTAPHTPYQAPGESP 251 Query: 297 YQ------KQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI 350 + + Y A V +D+G+ ++L ++++ D T+++F SDNG Sbjct: 252 VDPLPIDSPLWKQNADPPGVYRAMVRHMDEGIGKVLHAIEESKMTDRTLVIFASDNGGTS 311 Query: 351 DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADIS 409 N +G+K Q + GG P+ W G L G D++ D + L AA I+ Sbjct: 312 ASR---NEPLRGFKGQAFEGGIRVPLIARWPGHLPEGVVSDQVTITFDLTASMLAAAGIT 368 Query: 410 IPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDY 469 ++ ++G+ +L + + +P + L W Sbjct: 369 PTQEDAMEGIDVLSLAANDEPVQP-RTLYWRKPRDPQV---------------------- 405 Query: 470 PHNPNTEDLSQFSYTVRNNDYSLVYTVENN-------QLGLYKLT-DLQQKDNLAAANPQ 521 +R+ ++ V + Q L+ L D+ ++ +LA+ + Sbjct: 406 ------------WSGMRDGNWKYVRQEKATVDGRTSIQEWLFNLADDISEQTDLASQSTD 453 Query: 522 VVKEMQGVVREFIDSS---QPPLSEVNQEK 548 + ++G + S + K Sbjct: 454 ELDRLRGRYLAWEQSVRNNRRGRPGWAPSK 483 >UniRef50_A6DJ37 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ37_9BACT Length = 469 Score = 418 bits (1076), Expect = e-115, Method: Composition-based stats. Identities = 108/546 (19%), Positives = 192/546 (35%), Gaps = 123/546 (22%) Query: 32 DVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVV 91 +++ + + + KPN++ + +DDLG+ L G F Sbjct: 2 NIQTTMLFIGLGIASLANIALAQSNKPNVLFVFIDDLGWKDLGCYGGKF----------- 50 Query: 92 DTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY 151 TP SL EG++FT Y V P+RA++++G+ AR GV+ Sbjct: 51 ---------------IETPAADSLAAEGMKFTQAYA-SPVCSPTRASLISGQNAARHGVW 94 Query: 152 SNTDAQDG-------------IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDK 198 D I ++ GY +GKWH + Sbjct: 95 EVIGVNDRPYAKMSSPLRKLEIDENIQTYADILNKEGYTCGLIGKWHAGR---------- 144 Query: 199 QTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQL 258 PQ GF ++N E + + Sbjct: 145 -----------------TPQAHGF---CKIDKKIHDPVLKKYAYENDEHKVGE------I 178 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFN--TGSQTADNYYAS 314 T +I + + K D PF L ++++A H P ++Y+K+ + NY A Sbjct: 179 TANSIEFLRKNK--DNPFFLCVSHHAAHAPLIARDDLINKYRKKLRKTGITDVHPNYAAL 236 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAV--------IDGPLPLNGAQKGYKSQ 366 V D+ + +L++LK DNT+++F SDNG + + + K Sbjct: 237 VEMADESLGMLLDELKALKLEDNTMVVFYSDNGGMIKDMYLKQPEALATTMAPLRWQKGS 296 Query: 367 TYPGGTHTPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 Y GG P + W GK++PG +++++ D + T +D +IP++ DG+SL+P L Sbjct: 297 LYEGGIRVPFIVKWPGKVKPGTSSEQMLNSFDLFSTFVDVCGGTIPQEQVTDGLSLVPVL 356 Query: 426 QDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTV 485 + + + L W S W + + Sbjct: 357 RGETELLERDTLYWHFPTSMWTRS-------------------------------PAGAI 385 Query: 486 RNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEV 544 R DY L+ E+ ++ L+ L D+ + NL + + E+ + + S + Sbjct: 386 RKGDYKLIEHFEDGRIELFNLKDDIGETVNLLYSESEKASELLSALTAWRRSLDAQMPTP 445 Query: 545 NQEKFN 550 N Sbjct: 446 NPNYDP 451 >UniRef50_A6C383 Sulfatase (Fragment) n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C383_9PLAN Length = 405 Score = 418 bits (1076), Expect = e-115, Method: Composition-based stats. Identities = 119/494 (24%), Positives = 181/494 (36%), Gaps = 121/494 (24%) Query: 52 YSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPT 111 + KPN+I++ DD G L A TP Sbjct: 3 AISSEKPNVIIIFTDDQGSVDLNCYG--------------------------AKDLITPH 36 Query: 112 LLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT---DAQDGIPLTETFLP 168 + S+ G+RFT Y + V PSRA ++TGR PAR GV N + G+P + + Sbjct: 37 MDSIARRGIRFTQFYASAPVCSPSRAGMLTGRFPARAGVPGNVSSHHGKSGMPTEQITIA 96 Query: 169 ELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGF 228 E+ Q GY TA +GKWHL E P +GF+ G Sbjct: 97 EMMQQAGYQTAHIGKWHLG-----------------------YTPETMPHGQGFETSFGH 133 Query: 229 HAAGTAYYNS---------PSLFKNRERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFML 278 Y+ L++N + V G + D + ++ + K D+PF L Sbjct: 134 MGGCIDNYSHFFYWNGPNRHDLWENGKEVWRDGAFFPDLMVEQCQDYIR--KAGDKPFFL 191 Query: 279 YLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNT 338 Y A N PH P ++++K + S D Y A V ++D + +L L + T Sbjct: 192 YWAINVPHYPLQGK--EKWRKTYAHLSSPRDKYAAFVSTMDDCIGEVLATLDACQLREKT 249 Query: 339 IILFTSDNGAVID----GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLI 393 II+F SD+G + G G +G K + GG P + W G + G D+L Sbjct: 250 IIIFQSDHGHSHEERTFGGGGSAGPYRGAKFSLFEGGIRVPAMISWPGTIAEGEVRDQLA 309 Query: 394 SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIP 453 + D+ PT +P LDG +L ++ PH+N W Sbjct: 310 TGCDWLPTISALTGAPLPAH-HLDGKNLKAVIESSTAKSPHENFYWQIG----------- 357 Query: 454 FWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVEN----------NQLGL 503 S+ +R D+ L+ + NQ+ L Sbjct: 358 ---------------------------KSWAIREGDWKLLGNPRDTSQQTPLGKENQIFL 390 Query: 504 YKLT-DLQQKDNLA 516 L+ D+ +K NLA Sbjct: 391 VDLSKDIGEKKNLA 404 >UniRef50_A6DI94 Arylsulfatase A n=2 Tax=Bacteria RepID=A6DI94_9BACT Length = 472 Score = 418 bits (1075), Expect = e-115, Method: Composition-based stats. Identities = 114/550 (20%), Positives = 203/550 (36%), Gaps = 122/550 (22%) Query: 39 KTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGI 98 K + P YS KPN I++ DD GYG L Sbjct: 3 KLLITLLSLIPLVYSNDIKPNFIIIFTDDQGYGDLSCFN--------------------- 41 Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NTDA 156 TP + + EG++F N YV+ V SRAA++TG R G+ S Sbjct: 42 -----PQGVQTPHIDQMATEGMKFNNFYVSAAVCSASRAALLTGTYNDRIGIKSAFFPGT 96 Query: 157 QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 + G+ E + EL + Y TA GKWHL ++ +P + Y +S + + Sbjct: 97 KQGLHPDEITIAELLKEQNYATACFGKWHLGDEPSL-LPSAQGFDTYFG--IPYSNDMFI 153 Query: 217 PQNRGFDYFMGFHA------------------------AGTAYYNSPSLFKNRERVP--- 249 ++ F F+ + Y + + + V Sbjct: 154 APHQTFAENAKFNGDWTLEKAKELQKFIAPHVNKRGPIWKSEYKALVPILEGEQIVEFPA 213 Query: 250 AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTAD 309 + ++ + D I +D+ + ++PF ++L PH+P + + G Sbjct: 214 DQASLTQRYFDRTIKFIDKNQ--NKPFFIFLTPAMPHVPL-------FASKEFRGKSKKG 264 Query: 310 NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNGAQKGYKSQT 367 Y + +D R+++ LK+ NT+++FTSDNG + +G + K + Sbjct: 265 LYGDVIKEIDFHTGRLIKHLKEKELDQNTLVIFTSDNGPWLSYGDEGGSSGPLRDGKFTS 324 Query: 368 YPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQ 426 Y GG P W G ++ + ++L S +D PT + +P+D K+DG + P L+ Sbjct: 325 YEGGVRMPTVFWGPGLIKANSVCNQLASTIDLLPTFAQLVNTQVPQDRKIDGKDISPLLK 384 Query: 427 DKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVR 486 + H++L + VR Sbjct: 385 SQNH-VIHRHLFFRDE-----------------------------------------AVR 402 Query: 487 NNDYSLV-----YTVENNQ-LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ- 538 + D+ LV T+ LY L D+ + +NL +P+V + +Q + E + Sbjct: 403 SGDWKLVVKEHHMTMRKGPLPALYNLKNDVAESNNLIDTHPKVAQYLQSKLDEHLKDLNE 462 Query: 539 --PPLSEVNQ 546 P++++N+ Sbjct: 463 NSRPMADLNE 472 >UniRef50_B9XR48 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XR48_9BACT Length = 508 Score = 418 bits (1075), Expect = e-115, Method: Composition-based stats. Identities = 107/569 (18%), Positives = 185/569 (32%), Gaps = 106/569 (18%) Query: 21 GMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSF 80 + + + + KPN+I DDLGY + Sbjct: 1 MPKHLLPRRLRFLVALLSVLSPLCINAAEPSPMPLRKPNVIFFIADDLGYADVGCFG--- 57 Query: 81 DPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIM 140 + TP + + EG++FT Y V PSR +M Sbjct: 58 -----------------------QKKIHTPNIDRIATEGMKFTQHYSGSPVCAPSRCVLM 94 Query: 141 TGRAPARFGVYSN----TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPE 196 TG+ V N + Q +P + L Q +GY T A GKW L + P Sbjct: 95 TGKHSGHSAVRDNRELKPEGQFPLPANTITVARLLQQNGYITGAFGKWGLGGPESSGKPL 154 Query: 197 DKQTRDYHDNFTTFSAEEWQPQNRGFDYF---MGFHAAGTAYYNSPSLFKNR----ERVP 249 D+ + A P D + G N + Sbjct: 155 DQGFTRFFGYNCQRVAHNLFPTYLWDDNHRLALDNPPIGEDQKLPADADSNDPASYKAFT 214 Query: 250 AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQ------- 300 K Y D ++A+ + K D PF L+ PH+ P +Y+ + Sbjct: 215 GKSYAPDLYAEQALRFIRDNK--DHPFFLFFPTIVPHVALQVPEDSLKEYEGKLPETPYT 272 Query: 301 ----FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP- 355 + Y A + +D+ + R+L +K+ D+TI +FTSDNG Sbjct: 273 GGKGYLPNRTPHAAYAAMITRMDRDLGRMLALIKELNLDDDTIFVFTSDNGPAPQDMGGT 332 Query: 356 ------LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADI 408 +G + K+ Y GG P+ + W GK+QP D++ D+ PT L+ + Sbjct: 333 DTKFFNSSGPFRSGKTSIYEGGMRIPLIVRWHGKIQPNSTSDRVTGFEDWLPTLLELSGN 392 Query: 409 SIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDD 468 +DG+S L +K P + + ++ Sbjct: 393 KKSVPTGIDGLSFASTLLGEK--LPERPFLYREFPAY----------------------- 427 Query: 469 YPHNPNTEDLSQFSYTVRNNDYSLVYTV--------ENNQLGLYKL-TDLQQKDNLAAAN 519 +R ++ V N + LY L TD+ + +++ + Sbjct: 428 -----------GGQQAIRVGNWKAVRQHLKPKGNAKPNLHIELYDLQTDIAESHDVSDEH 476 Query: 520 PQVVKEMQGVV-REFIDSSQPPLSEVNQE 547 P +V ++ ++ + I S P +++ Sbjct: 477 PDIVTKLDNLMREQHIPSKAFPFPALDKP 505 >UniRef50_C1ZIS7 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZIS7_PLALI Length = 631 Score = 418 bits (1075), Expect = e-115, Method: Composition-based stats. Identities = 116/552 (21%), Positives = 201/552 (36%), Gaps = 88/552 (15%) Query: 28 HAADDVKLKATKTNVAFSDFT---PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKT 84 H V + V F+ + P + +PNI+V+ DDLG+ L F Sbjct: 4 HLLLSVLVFLCGIAVNFAQASEANPPSAPRQNRPNIVVILADDLGWADLGCYGNPFH--- 60 Query: 85 MENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRA 144 TP L L +G+R T Y A V P+RAA++TG+ Sbjct: 61 -----------------------KTPHLDQLARDGIRCTQAYAACPVCSPTRAALLTGQN 97 Query: 145 PARFGVYSNTDAQDG--------------IPLTETFLPELFQNHGYYTAAVGKWHLSKIS 190 PAR + + +P LP + +++GY T ++GKWHL + Sbjct: 98 PARLHLTDWLPGRGNRNDQALRVPEIRNSLPQGIMTLPGVLKSNGYQTCSIGKWHLGGGA 157 Query: 191 NVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPA 250 + P+ H + +E R F F A E +P Sbjct: 158 SGPL--------QHGFHEQIAGDERGSPARWFAPFGPQAATNGEKDRQGKPIPGLEDIPD 209 Query: 251 KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTA 308 Y++D L D+A+ +++ T ++PF LYL + A H P + P +++ G Sbjct: 210 GKYLTDALADKAVAFIEKQ-TAEKPFFLYLPHFAVHTPMNAPEETIQKFRDNKPPGVVRN 268 Query: 309 DNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI-----DGPLPLNGAQKGY 363 + Y A +Y +D V +++ L + G NTI++FTSDNG + + P +N + Sbjct: 269 EIYAAMLYHLDAAVGKVMNSLTEKGFAKNTIVVFTSDNGGLATIEGKNTPATINAPLREG 328 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADIS--IPKDLKLDGVS 420 K Y GG P+ + + + G D ++ +D P+ L A I + + LDG++ Sbjct: 329 KGWLYEGGIRVPLIVSFPKHIPDGSTTDVPMTTLDLLPSLLSLAGIQYQVDANSPLDGMN 388 Query: 421 LLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ 480 + E K Y H+ Sbjct: 389 ISDIWTGNATPELKKAAFERPLYWHYP-------------------------HYANQGGF 423 Query: 481 FSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 +R + + + + L+ + D + N A P+ + + + + S Sbjct: 424 PGGVIRQGPWKYIENYQTGRKELFLVDKDPGEGRNRAPDEPEKITQFAAQLAAWKQSISA 483 Query: 540 PLSEVNQEKFNN 551 + N + N Sbjct: 484 QETVPNPDYIPN 495 >UniRef50_UPI0001BC7CBC sulfatase n=1 Tax=Bacteroides sp. D2 RepID=UPI0001BC7CBC Length = 496 Score = 418 bits (1075), Expect = e-115, Method: Composition-based stats. Identities = 136/537 (25%), Positives = 198/537 (36%), Gaps = 107/537 (19%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFD 81 M A + +A + T S KPNI+++ DD GYG + Sbjct: 1 MKISARKTSTLSTALLAILPIAKASAQHTTPSHPDKPNIVIILADDQGYGGVNCY----- 55 Query: 82 PKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMT 141 + TP + L GV+ GY + +S P+RA +MT Sbjct: 56 --------------------PHIKKIVTPNIDKLAASGVQCMQGYTSGHLSSPTRAGLMT 95 Query: 142 GRAPARFGVYS-NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQT 200 G+ FG Y +T GIP + L E +GY TA +GKWHL Sbjct: 96 GKYQQSFGFYGLSTPHVGGIPQDQKLLSEYLVENGYNTACIGKWHLGDYIRS-------- 147 Query: 201 RDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS-------------PSLFKNRER 247 P NRGF F GF YY+ N E Sbjct: 148 ---------------HPNNRGFQTFFGFINGLHDYYDPLVGGSWDGVYNGLAFTLDNMEP 192 Query: 248 VPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP---DQYQKQFNTG 304 V Y + + T A+ + K D PF LYL YNA H P P + G Sbjct: 193 VTEMEYSTYEYTKRAVDFI--QKNADHPFFLYLPYNAIHSPLQAPEELIGELAINPQEIG 250 Query: 305 SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYK 364 A +++DQGV +++E L++ G DNTII + SDNGAV +G K Sbjct: 251 KDDIAR--AMTFALDQGVGKVVETLEQLGLRDNTIIFYLSDNGAV---EYSDKWEFRGRK 305 Query: 365 SQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 Y GG P + + KL G +K + ++D PT ++ A + + GV+LLP Sbjct: 306 GSYYEGGIRVPFIVSYPAKLAKGTIYNKPVMSIDIAPTVMELAGL---SHADMHGVNLLP 362 Query: 424 WLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY 483 +L K + EPH L W T + + + + Sbjct: 363 YLSGKDRTEPHDVLYWSTE-----------------------------KKSNNQVFKNEF 393 Query: 484 TVRNNDYSLVYTVENNQ-LGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 +R + LV + LY + D Q+K L P+ KE+ G+ +I+ Sbjct: 394 AIRQGKWKLVSDPHFEKDYDLYDIEADPQEKHGLKDQYPEKYKELFGMYLNWINQMP 450 >UniRef50_Q482D6 Sulfatase family protein n=2 Tax=Bacteria RepID=Q482D6_COLP3 Length = 492 Score = 418 bits (1074), Expect = e-115, Method: Composition-based stats. Identities = 132/542 (24%), Positives = 200/542 (36%), Gaps = 127/542 (23%) Query: 43 AFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAI 102 S T + KPN+++L +DD G L +F Sbjct: 16 TCSQAVATPDKSTSKPNVVMLLVDDFGRQDLSTYGSNF---------------------- 53 Query: 103 EAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-YSNTDAQDGIP 161 TP + L +G++F N Y AH PSR AI +G P R+GV + +P Sbjct: 54 ----YETPNIDQLAADGMKFDNAYAAHPRCVPSRVAIFSGSYPTRYGVPQGERVGKHHLP 109 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRG 221 L+ E + GY T +GKWHL E P +G Sbjct: 110 LSAVTFGEHLKEAGYQTGYIGKWHLG------------------------KEGGDPTKQG 145 Query: 222 FDYFM--GFHAAGTAYYNSPSLF----KN----RERVPAKGYISDQLTDEAIGVVDRAKT 271 FD + G A +YY + KN + + Y++D+LTDEA+ +++ Sbjct: 146 FDSSIMAGHWGAPPSYYFPYTKMSKSGKNKGFAKVEGSEEEYLTDRLTDEALTFIEQ--K 203 Query: 272 LDQPFMLYLAYNAPHLPNDNPA-------------------PDQYQKQ------FNTGSQ 306 DQPF+L LA+ A H P + P ++ Q Sbjct: 204 KDQPFLLVLAHYAVHTPIEGKPALVKKYKTKMKKLGIANAGPKSDADLIKDSTGYHKTIQ 263 Query: 307 TADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-------LPLNGA 359 +Y A V SVD V RI +QLK+ G DNTII+ TSD+G + N Sbjct: 264 NNPDYAAMVESVDISVGRIEQQLKRLGLEDNTIIILTSDHGGLSSRGLKSNRVLATSNNP 323 Query: 360 QKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDG 418 + K Y GGT P+ + W K++ G+ ++ ++ D YPT L A +S+ DG Sbjct: 324 YRHGKGWIYDGGTRVPLIVKWPEKVKAGSISQVQVTGTDHYPTILQMAGLSLSPKDHQDG 383 Query: 419 VSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDL 478 VS L L P K + W + + + Sbjct: 384 VSYLAAL--NSDETPRKAMFWHSPAARPSKTGDTN------------------------- 416 Query: 479 SQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 S + ++ L+ ++ LY L D + +NLA P+ EM + + D Sbjct: 417 ---SSAIIEGEWKLLDFWSTGKVELYNLKDDKSEANNLAKLMPEKTAEMLAKLTNWKDDI 473 Query: 538 QP 539 Sbjct: 474 DA 475 >UniRef50_Q7ULE7 Iduronate-sulfatase and sulfatase 1 n=1 Tax=Rhodopirellula baltica RepID=Q7ULE7_RHOBA Length = 1049 Score = 418 bits (1074), Expect = e-115, Method: Composition-based stats. Identities = 125/527 (23%), Positives = 185/527 (35%), Gaps = 102/527 (19%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 D T KPN++V+ DD G+ L Sbjct: 566 FQIGDRTAQAVIPASKPNVVVILTDDQGWADLSCQN------------------------ 601 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIP 161 E TP + L GVR TN YV PSRA ++TGR R G+ + D +P Sbjct: 602 -EVDDIQTPHIDGLAARGVRCTNAYVTAPQCSPSRAGLITGRYQQRLGIDTIPD--MPLP 658 Query: 162 LTETFLPELFQNHGYYTAAVGKWHLS------KISNVPVPEDKQTRDYHDNFTTFSAEEW 215 + E Q GY T VGKWHL +P E + Sbjct: 659 TNAVTIAEHLQPKGYKTGFVGKWHLEPNVTCIDWMRRELPAMAGKPRRKVRIPWNKIEPY 718 Query: 216 QPQNRGFDYFMGFHAAGTAYYNSPSLFKNR-----ERVPAKGYISDQLTDEAIGVVDRAK 270 P +GFD + T Y + L + + + + D T+ A+ + R Sbjct: 719 SPSQQGFDEYY--WGERTNYRTNFDLTSGELLAEMKPIRDERFRIDVQTNAAVKFIQRNH 776 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYY-ASVYSVDQGVKRILEQL 329 DQPF L L Y PH P + +Y +F Y A + ++D GV +I++QL Sbjct: 777 --DQPFYLQLNYYGPHTPLEAT--QKYLDRFPGPMPERRRYALAMISAIDDGVGQIVDQL 832 Query: 330 KKNGQYDNTIILFTSDNGAVI--------------DGPLPLNGAQKGYKSQTYPGGTHTP 375 K G DNT+I+ TSDNGA + LN G K GG P Sbjct: 833 KAEGVLDNTLIVMTSDNGAPLKMTKTDSPINGDAGGWDGSLNDPWVGEKGMLSEGGIRVP 892 Query: 376 MFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPK-DLKLDGVSLLPWLQDKKQGEP 433 M +L G YD +SA+D P+ L A +P D DG+ L+P L + Q P Sbjct: 893 MIWSLPTQLPSGITYDWPVSALDIAPSVLKLAGGELPSGDAAFDGIDLIPRL-NDIQNPP 951 Query: 434 HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV 493 + L + +R + + Sbjct: 952 TRTLYFRF--------------------------------------WDQAAIRRGKWKYI 973 Query: 494 YTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 + + + L+ L +D + NLA P++ ++ + + P Sbjct: 974 FAGDGRRF-LFDLESDQHEHRNLAEEYPELANKLHASLASWTSELSP 1019 Score = 209 bits (532), Expect = 3e-52, Method: Composition-based stats. Identities = 108/573 (18%), Positives = 181/573 (31%), Gaps = 143/573 (24%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGK--PNIIVLTMDDLGYGQ-LPFDKGSFDPKTMENREV 90 L A+ + + + S PN++ + MDDL + G Sbjct: 5 LLLASMIWILLAAPSTFADSPPTPSGPNVLFIAMDDL--NDWIGCLGG------------ 50 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 Q TP L L G+ FTN + P R+A+ TGRAP + G+ Sbjct: 51 -------------HPQTITPNLDRLAASGILFTNAHCPAPACNPCRSAVFTGRAPNQSGL 97 Query: 151 YSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 Y N + + LP+ +NHGY+ + GK + + + F Sbjct: 98 YDNRQQMREVMPDDVILPQYMRNHGYHASGSGK---------LLHYFIDAASWDEYFPKA 148 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAI-GVVDR- 268 +E PQ F P + + + D A+ + Sbjct: 149 ESENPFPQ-----TFYPSQRPVNLKRGGPWQYVETDWAALDVTDEEFGGDWAVSQWIGEQ 203 Query: 269 -AKTLDQPFMLYLAYNAPHLPNDNPA-------------PDQYQK--------------- 299 + DQPF L PH P P P Y + Sbjct: 204 LQQKHDQPFFLGCGIYRPHEPWFVPKKYFEPFPLDSIQLPPGYLENDLDDVPPIGQRAAR 263 Query: 300 --------QFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID 351 + + Q Y AS++ D + R+L+ L+ DNTI++ SD+G + Sbjct: 264 NRYFAHIQKQDQWKQGIQGYLASIHFADAMLGRLLDALESGPNADNTIVVLWSDHGWQLG 323 Query: 352 GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGK----LQPGN-----YDKLISAMDFYPTA 402 K + G T P+ + L G D ++ + +PT Sbjct: 324 EKEHW------QKYTPWRGVTRVPLMIRVPKTSSPSLPNGTPIGARCDAPVNLLSLFPTV 377 Query: 403 LDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFV 462 LD +P + DG SLLP L++ K Sbjct: 378 LDLC--QLPSNPVNDGPSLLPLLKEPKT-------------------------------- 403 Query: 463 RHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQ 521 D + HN T +Y V + ++ ++ LY + D + +NLA P+ Sbjct: 404 ----DTWKHNSVTYLSHPGAYAVSGRTHRYIHY-QDGSEELYNIEADPYEWNNLATK-PE 457 Query: 522 VVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKK 554 + +F +S ++ + ++ K Sbjct: 458 S----SEQLAQFRSTSPTKFAKRIEPSVKSLAK 486 >UniRef50_C0BKJ9 Sulfatase n=2 Tax=Bacteroidetes RepID=C0BKJ9_9BACT Length = 493 Score = 417 bits (1072), Expect = e-115, Method: Composition-based stats. Identities = 122/535 (22%), Positives = 187/535 (34%), Gaps = 107/535 (20%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 F + E PNII + DDLGYG+L Sbjct: 6 LVFIGFTFFSCSTVENQKDQPPNIIYILADDLGYGELGSYG------------------- 46 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT-- 154 + TP L L +G+RFT Y V PSR +TG + N Sbjct: 47 -------QKKIKTPNLDRLAADGMRFTQHYTGAPVCAPSRYMFLTGNHAGHAYIRGNYEL 99 Query: 155 --------DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 Q IP T L ++ + GY TA +GKW L P Y+ Sbjct: 100 GQFSDEMEGGQMPIPETTPTLAKMLKKAGYQTAMIGKWGLGMNETTGSPLLHGFDYYYGY 159 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLF------KNRERVPAKGYISDQLTD 260 A + P + Y+ S + ++ + Y D++ + Sbjct: 160 LDQKQAHNYYPTHLW--ENDKKDPLNNDYFLVHSPISSKANQSDFDQFKGQEYAPDRMLE 217 Query: 261 EAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQ-----------T 307 +AI +D D+P+ LY PH+ P DQY+ F Sbjct: 218 KAIQFLDT-TASDKPYFLYYPSPIPHVSLQVPDSLVDQYRDVFEEEPYLGNKGYTAHQFP 276 Query: 308 ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKG 362 Y A + +D V +I + +K+ GQ +NT+ILF+SDNG G +G Sbjct: 277 NAAYAAMITHLDSEVGKIWDSVKEKGQEENTLILFSSDNGPTFAGGVDPDFFNSAAGLRG 336 Query: 363 YKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS-AMDFYPTALDAADISIPKDLKLDGVSL 421 K Y GG P +WKGK++ G+ LIS D + T + A DG+S+ Sbjct: 337 LKMDVYEGGIRIPFIAYWKGKIKAGSISDLISGHWDMFNTFAELAGQDQSAP---DGISI 393 Query: 422 LPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQF 481 LP L + Q E H + + + Sbjct: 394 LPELLGESQNETHDYIYFEYPEK-----------------------------------RG 418 Query: 482 SYTVRNNDYSLV----YTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVR 531 +R D+ V T +++ LY L TD + N+AA +P++V ++ + + Sbjct: 419 QIALRIEDWKGVKVEMKTNLDSKWELYNLKTDRNEVFNVAAEHPEIVNKIDSLHK 473 >UniRef50_C5PU94 N-acetylgalactosamine-6-sulfatase n=1 Tax=Sphingobacterium spiritivorum ATCC 33861 RepID=C5PU94_9SPHI Length = 443 Score = 417 bits (1072), Expect = e-115, Method: Composition-based stats. Identities = 115/527 (21%), Positives = 185/527 (35%), Gaps = 115/527 (21%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 + + S + +PN I++ +DD+GYG + + Sbjct: 1 MNSIRYLITLFICLLAVFNSSAQTQPNFIIIYVDDMGYGDVGING--------------- 45 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP L + EG+RF+N Y A SR A++TG+ P+R G Sbjct: 46 -----------NPNIETPNLDRMAMEGMRFSNYYSASPACTASRYALLTGKYPSRAGFRW 94 Query: 153 NTDAQD--GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 + D GI E+ + E + GY TA GKWHL Sbjct: 95 VLNPTDQIGIHQQESTIAERLKEKGYRTAIYGKWHLGSTR-------------------- 134 Query: 211 SAEEWQPQNRGFDYFMGFHAAGT---AYYNSPSL---FKNRERVPAKGYISDQLTDEAIG 264 +E+ P GFD ++G + Y +L + E P + ++ T++AI Sbjct: 135 --KEFLPLANGFDEYVGLPYSNDMIPPKYPDIALLSGYDTLELNPDQSKLTRLYTEKAIA 192 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKR 324 + + QPF +YL Y PH P G Y V +D + R Sbjct: 193 FITKNAK--QPFFIYLPYAMPHTPLHASED-------FLGKSKRGLYGDVVQELDHHIGR 243 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 +L LK+N T ++FTSDNG + + G + K T+ GG P F+W Sbjct: 244 LLTFLKENKLDQQTYVVFTSDNGPWLIQNQNGGSAGLFRDGKGSTWEGGMREPFFLWGHH 303 Query: 383 KLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWIT 441 + G +++ +A+D PT A IS + K+DG +L P KK + + Sbjct: 304 TIPKGYVENEVFTALDMLPTITALAGISAGPN-KIDGTNLKPLWSGKKDTKGRDEFFYF- 361 Query: 442 SYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL--------- 492 L VR + L Sbjct: 362 -----------------------------------GLDHQLMAVRKGPWKLHVKTYSQLG 386 Query: 493 VYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 + + L+ L D +K NLA+ P++V ++ ++ Sbjct: 387 LVYFDKQLPLLFNLDHDPSEKYNLASQYPEMVSDLTTLILSKEKEIA 433 >UniRef50_B4D3U0 Sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D3U0_9BACT Length = 467 Score = 417 bits (1072), Expect = e-115, Method: Composition-based stats. Identities = 116/524 (22%), Positives = 188/524 (35%), Gaps = 122/524 (23%) Query: 40 TNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 + + + +PN I + DDLG+G + F Sbjct: 23 SLFGLVVSSLAADTAPLRPNFIFILADDLGWGDVGFH----------------------- 59 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG 159 TP L L EG+ YV + V P+R A ++GR +RF V + Sbjct: 60 ----HGNVPTPNLDHLAGEGLELMQHYV-YPVCSPTRCAFLSGRYASRFSVTT-PQNPRA 113 Query: 160 IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN 219 L ++ GY TA GKWHL S EW PQ Sbjct: 114 FRWDTVTLARALKSVGYDTALCGKWHLG-----------------------SKPEWGPQK 150 Query: 220 RGFDYFMGFHAAGTAYYNSPS--------LFKNRERVPAKGYISDQLTDEAIGVVDRAKT 271 GFD+ G A G ++ ++ + + +G+++D +T EA+ ++ Sbjct: 151 FGFDHSYGSLAGGVGPWDHHYKIGEFTQTWHRDGKLIEEQGHVTDLITKEAVEWLE--SR 208 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 D+PF LY+ + A H+P P + + + +Y A+V +D V +IL L+K Sbjct: 209 TDKPFFLYVPFTAVHIPIREPDEILQRVPASITKPSLRHYGANVMHLDDSVGKILVALEK 268 Query: 332 NGQYDNTIILFTSDNGA----------------VIDGPLPLNGAQKGYKSQTYPGGTHTP 375 G+ NT+++F SDNGA N G K + Y GG HT Sbjct: 269 TGKAGNTLVIFGSDNGAIPGVENNDPLYPPDHYPPGPAGGSNEPLHGMKGEVYEGGIHTA 328 Query: 376 MFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHK 435 W G+L+PG + L D+ PT A KDLK DG ++ P L + +P Sbjct: 329 AVARWPGQLKPGKFLGLAHITDWMPTFCALAGYKPEKDLKWDGQNIWPQLTGAEPVKPRT 388 Query: 436 NLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT 495 S +R+ D+ LV + Sbjct: 389 IYV-------------------------------------AGPGFRSKALRDGDWKLVLS 411 Query: 496 VENN------QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVRE 532 ++ L+ + D + ++A P +V ++ + + Sbjct: 412 QTKGSKNSPPKVELFNIGADPTEHTDVAGQFPDIVGRLRIKLEQ 455 >UniRef50_Q7UWW9 Arylsulfatase n=1 Tax=Rhodopirellula baltica RepID=Q7UWW9_RHOBA Length = 622 Score = 416 bits (1071), Expect = e-115, Method: Composition-based stats. Identities = 115/543 (21%), Positives = 194/543 (35%), Gaps = 106/543 (19%) Query: 28 HAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMEN 87 + + T + + P+ ++ PN+I++ DD GYG F+ + Sbjct: 11 TLLCTISIAFAITTLFIATPRPSGAAS---PNVILVMTDDQGYGDFSFNGNPY------- 60 Query: 88 REVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR 147 TP L L E V+ T+ +VA + P+R +M+G R Sbjct: 61 -------------------IQTPALDRLASESVQLTDFHVA-PMCTPTRGQLMSGLDAFR 100 Query: 148 FGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 + + + + + ++FQ+ GY T GKWHL Sbjct: 101 NSAINVSSGRTLLRHDLKTMADVFQDAGYRTGIFGKWHLGDNY----------------- 143 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAG--------TAYYNSPSLFKNRERVPAKGYISDQLT 259 ++P++RGFD + F ++ Y + +N +RV GY +D Sbjct: 144 ------PFRPEDRGFDETLWFPSSHINSVPDFWDNDYFDDTYIRNGKRVAHSGYCTDVFF 197 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTG-------SQTADN-- 310 DEAI + D PF ++ N+ H P PDQY+ + T + D Sbjct: 198 DEAIEWAKQTSPTDSPFFAFIPLNSAHWPW--FVPDQYRARVRTMLGDTTELKRQLDTTP 255 Query: 311 --------YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKG 362 + A ++D V + + L ++G +NTI++F +DNG+ N +G Sbjct: 256 SNLEDLISFLAMGLNIDDNVGTLTQYLDESGLSENTIVVFLTDNGSTFGDHY-FNAGMRG 314 Query: 363 YKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLL 422 K+Q + GG P + W ++ D L D PT AD LDG SL Sbjct: 315 KKTQLWEGGHRVPCLIRWPEQITAQKIDDLTHVQDLLPTLAALADCDEHLPGPLDGTSLA 374 Query: 423 PWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFS 482 P L + + L S F N + Sbjct: 375 PRLLGETDSLADRMLVINYSRMPQFKVTYTK-------------------GNPAIPRRNG 415 Query: 483 YTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPL 541 V N + L+ LY + D Q N+A +P++V +M+ + + D + + Sbjct: 416 AAVMWNKWRLLENKR-----LYNVEQDPHQDHNVAQDHPEIVAKMRAHLATWWDGVKDDV 470 Query: 542 SEV 544 Sbjct: 471 MTP 473 >UniRef50_A0Z632 Arylsulfatase B n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z632_9GAMM Length = 545 Score = 416 bits (1070), Expect = e-114, Method: Composition-based stats. Identities = 139/574 (24%), Positives = 215/574 (37%), Gaps = 134/574 (23%) Query: 34 KLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 LKA + + KPNI+++ DDLG+ + + G Sbjct: 11 LLKALLMMCVILGVPAS--AQSQKPNILIMVADDLGWADVGYHGG--------------- 53 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 TP+L L +GVR Y + P+RAA+MTGR P R GV Sbjct: 54 ------------DIDTPSLDRLAQQGVRLNRFYTT-PICSPTRAALMTGRDPIRLGVTYG 100 Query: 154 TDAQD---GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 G+ E F+PE FQ GY TA +GKWHL Sbjct: 101 VIFPWDNIGVHPDEHFMPETFQAAGYQTAIIGKWHLGHAQMT------------------ 142 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNS------PSLFKNRERVPAKGYISDQLTDEAIG 264 + P NRGF++F G +Y +N + +GY + L DE Sbjct: 143 ----YHPNNRGFEHFYGHLHTEVGFYPPFSNQGGKDFQRNGVSIDDQGYETYLLADEVSR 198 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQK--------------------QFN 302 + + D+PF++Y+ + APH P D P D+Y+ + Sbjct: 199 YIRE-RDRDRPFLVYMPFIAPHTPLDAPVELQDKYKDIETDLPMARSRQTDDTRLISRVM 257 Query: 303 TGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLP-LNGAQK 361 Y A V ++DQ + R+L+ L + G DNTI+LF SDNG N + Sbjct: 258 LQPSARPMYAAVVDAMDQAIGRVLDTLDQEGISDNTIVLFFSDNGGAAYSYGGANNAPLR 317 Query: 362 GYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVS 420 G K +T+ GG M W L+PG ++++S MD +PT +DAAD+ + LDG S Sbjct: 318 GGKGETFEGGIRVTSLMRWPAMLEPGQIFEQIMSVMDVFPTLVDAADVRPGNNFALDGRS 377 Query: 421 LLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQ 480 + L+ Q L + + + Sbjct: 378 MWTALKSGDQVPLEGPLIFGSEIPIYG--------------------------------N 405 Query: 481 FSYTVRNNDYSLVYTVENNQL------GLYKL-TDLQQKDNLAAANPQVVKEMQGVVREF 533 F++ N ++ LV V+ Q+ L+K+ +D + +NLAA P +V+ + + + Sbjct: 406 FNFAAFNEEWKLVQEVQQEQIAITVTNYLFKISSDPYEHNNLAAVYPDIVENLSKAILNW 465 Query: 534 ID---------SSQPPLSEVNQEKFNNIKKALSE 558 PP + + L E Sbjct: 466 RALYPINGTRSQLVPPPGWRAPHDWASYPIPLEE 499 >UniRef50_A6DMX7 N-acetyl-galactosamine-6-sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMX7_9BACT Length = 578 Score = 416 bits (1069), Expect = e-114, Method: Composition-based stats. Identities = 117/551 (21%), Positives = 196/551 (35%), Gaps = 124/551 (22%) Query: 41 NVAFSDFTPTEYSTKGKP-NIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 S FT + KP N++ + DDLG+ + Sbjct: 6 CFLLSFFTVGLIAAADKPMNVVFILADDLGWSDTELYGQT-------------------- 45 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ-- 157 TP ++ L G F Y + P+RA+ +TG+ PAR G Sbjct: 46 -----KLYKTPNIMRLAKMGCTFDRAYSNSPLCSPTRASFLTGQTPARHGSTQPRHHTKT 100 Query: 158 -----------------------DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPV 194 + + ++ + GY T GKWHL Sbjct: 101 VALKAELAKKARPTEKALPVSTATRLDTNFPTIGKMMKQAGYETGHFGKWHLG------- 153 Query: 195 PEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY--YNSPSLFKNRERVPAKG 252 E + P GFD + H Y +P ++ + K Sbjct: 154 -----------------PEPYSPLQHGFDVDIPHHTGAGPGKSYVAPWSQEHIKPNYEKE 196 Query: 253 YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQTA-D 309 YI D++ +E + VD + D+PF + + H P D D+Y+K + S+ Sbjct: 197 YIEDRMVEECLKWVDGL-SGDKPFFMNYWMFSVHAPFDAKQELIDKYKKVIDPNSKQRSA 255 Query: 310 NYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP------LPLNGAQKGY 363 Y A V S+D V +LE L+ G DNT+I+FTSDNG I N G Sbjct: 256 LYAAMVQSLDDAVGALLEGLESRGLMDNTVIIFTSDNGGNIYSQLDEGIVPTSNFPLSGG 315 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLL 422 K+ GG P + W G + G D+++ DFY T + + I++P+ +DG+ + Sbjct: 316 KASMCEGGVRVPCTVVWPGVTKAGSRSDEIVQTSDFYTTIIKGSGIALPEGHVVDGIDIR 375 Query: 423 PWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFS 482 P L+ +K K + +P + S Sbjct: 376 PALKGEK--LDRKAIF----------------------------TYFPCIVPVPEWLPPS 405 Query: 483 YTVRNNDYSLVYTVENNQ-----LGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDS 536 +V + + LV + LY L D+ +++NLA +NP+++K + ++ ++ Sbjct: 406 MSVHSGKWKLVRVFFGGENGEHDYKLYDLSNDIAEENNLADSNPELLKRLDNLIEAYLTE 465 Query: 537 SQPPLSEVNQE 547 + N + Sbjct: 466 TNAVTPVPNPD 476 >UniRef50_A3ZWK4 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Bacteria RepID=A3ZWK4_9PLAN Length = 442 Score = 415 bits (1067), Expect = e-114, Method: Composition-based stats. Identities = 122/498 (24%), Positives = 187/498 (37%), Gaps = 71/498 (14%) Query: 64 TMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFT 123 DD G+G + ++ AA TP L EGVRFT Sbjct: 1 MADDQGWGDVGYN--------------------------HAAPIHTPNLDQAAAEGVRFT 34 Query: 124 NGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGK 183 Y A V P+R +++TGR P R VY+ I E L E Q GY T+ GK Sbjct: 35 RFYAAAPVCSPTRCSVLTGRNPNRSAVYAW---GWPIRPQEITLAERLQAAGYATSHFGK 91 Query: 184 WHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFK 243 WHL + P GFD ++ +A Y N P + Sbjct: 92 WHLGSVRKDS--------------------PVSPGKCGFDDWI---SAPNFYDNDPIMSD 128 Query: 244 NRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT 303 V G SD D AI + ++PF + + +PH P+ D ++ + Sbjct: 129 QGRAVQYHGESSDVTADLAIDWIRAQAKEEKPFFSVVWFGSPHSPHIAADAD--RELYKD 186 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGY 363 +YY V +D+ +I LK+ G DNTI+ + SDNGA D G + Sbjct: 187 EPAKFRDYYGEVTGIDRAYGKIRSTLKELGISDNTILWYCSDNGA--DKAKGSAGPFREK 244 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPTALDAADISIPKDLKLDGVSLL 422 K Y GG P + W + L + D +PT L AA +S K LDG++LL Sbjct: 245 KGSIYEGGLLVPGILDWPARFPAPQTTSLRATTCDIFPTVLAAAGLSPDKQRPLDGINLL 304 Query: 423 PWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRH----------QSDDYPHN 472 P L K P W T+ + + ++ + D P Sbjct: 305 PLLTAKTDMRPQPIGFWQTANGGKPVRSDAMMEELLNQQATGGDLPADEVSLHAADLPKP 364 Query: 473 PNTEDLSQFSYTVRNNDYSLVYTVENN---QLGLYKL-TDLQQKDNLAAANPQVVKEMQG 528 P + D ++ + D+ L + LY L D +K+N+ P++ +++ Sbjct: 365 PVSIDTLAGHASLTSGDWKLHRIENKKGAVRFELYDLAADPYEKENVLKQYPEIAEKLTK 424 Query: 529 VVREFIDSSQPPLSEVNQ 546 + R++ S L+ + Sbjct: 425 LQRDWRLSVVNSLNGADY 442 >UniRef50_A6DMX9 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMX9_9BACT Length = 467 Score = 415 bits (1067), Expect = e-114, Method: Composition-based stats. Identities = 117/538 (21%), Positives = 192/538 (35%), Gaps = 127/538 (23%) Query: 41 NVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDK 100 V S F + KPNI+++ DD GY L + Sbjct: 8 LVLLSTFVAASLTAAEKPNILIIFTDDQGYADLGCFGSEEN------------------- 48 Query: 101 AIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGI 160 TP L L EG +FT+ Y V GPSR+A++TGR PAR G+ Sbjct: 49 -------QTPVLDKLAKEGTKFTSFYA-QPVCGPSRSALLTGRYPARS-------KGWGM 93 Query: 161 PLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNR 220 P +E E+ + GY TA VGKW D P + Sbjct: 94 PASEITFAEMLKETGYQTACVGKW--------------------DVSNRQPIIPRMPNAQ 133 Query: 221 GFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGY---ISDQLTDEAIGVVDRAKTLDQPFM 277 GFDY+ G + L++N ++ ++ T++AI +++ + ++PF+ Sbjct: 134 GFDYYYGTLGGNGS--GKIDLYENNKKERTTEDMASLTRLYTNKAIDFLEKQRDPEKPFI 191 Query: 278 LYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDN 337 LYLA+ H D + Y A+V +D R+L +L + N Sbjct: 192 LYLAHTMTHTVVDASPK-------FKEKTGDNLYRAAVEELDYETGRLLNKLNQLNLSKN 244 Query: 338 TIILFTSDNGAVID----------------GPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 T++++TSDNG G + K+ + GG H P M W Sbjct: 245 TLVIYTSDNGPWNQPKYINGGAKNDHPENSIFWGDAGEFRDGKASIWEGGAHVPCVMRWP 304 Query: 382 GKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI 440 GK+ G + L++ +DF PT IP + +DGV+ L ++ K + + Sbjct: 305 GKIAAGKTNDGLMATIDFLPTLAAVTGAKIPDERVIDGVNQLGFICGKSETARETYIYN- 363 Query: 441 TSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV------- 493 P + + + +R ++ L+ Sbjct: 364 -----------------------------PGSASVQTKLVQGNAIREGNWKLISPLTVGW 394 Query: 494 --YTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEK 548 LY L D+ + NLA P+ V+ ++ ++ SS+ +V Sbjct: 395 FLEDAGTGSWELYNLKEDIGETKNLAKQYPEKVEHLKKLL----QSSEAKFPKVKPRP 448 >UniRef50_B6RB10 Arylsulfatase n=7 Tax=Coelomata RepID=B6RB10_HALDI Length = 481 Score = 415 bits (1067), Expect = e-114, Method: Composition-based stats. Identities = 127/531 (23%), Positives = 205/531 (38%), Gaps = 78/531 (14%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKP-NIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 V + S G+P +I+ + DDLG+ + F Sbjct: 2 FVQLLCTVLVIINLCDDVSAAGRPRHIVFIVADDLGWNDIGFH----------------- 44 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN 153 TP + L EG+ + YV + PSRAA M+G P + G+ + Sbjct: 45 ----------NPDIITPNIDKLAREGLLLNHHYV-QPLCSPSRAAFMSGYYPFKTGLQHS 93 Query: 154 T---DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 + +PL T LP+ + GY T VGKWH F Sbjct: 94 VILENQPVCLPLNITILPQKLKELGYATHIVGKWHNG----------------------F 131 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNSP-----SLFKNRERVPAK--GYISDQLTDEAI 263 + P RGFD F G++ A YY N V Y + + TD A Sbjct: 132 CSWNCTPTYRGFDSFFGYYGAMEDYYTHVIRGFLDYRNNTTPVWTDNGTYSTLRFTDVAT 191 Query: 264 GVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQ-TADNYYASVYSVDQGV 322 +++R QP LYLAY A + P + PA +Y+ + + V ++D+ V Sbjct: 192 DIIERH-NQSQPLFLYLAYQAVYGPIEVPA--KYEAMYPNIKSENRRKFSGMVSALDEAV 248 Query: 323 KRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKG 382 + + L++ G D+T+ILFT+DNG +D N +G K Y GGT FM+ G Sbjct: 249 GNVTKTLRQRGLMDDTLILFTADNGGGVDES-GNNYPLRGSKFTVYEGGTRAVGFMYGSG 307 Query: 383 KLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWIT 441 + G D +I A+D+ PT AA + D DG++L P L + + Sbjct: 308 LQKTGTVFDGMIHAVDWLPTLTAAAGGTPVSDR--DGINLWPSLSTASPSPRTEVVYNYD 365 Query: 442 SYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQL 501 S+ + + +K + +P E ++ T + D NQ Sbjct: 366 SHP-QPVQGHAAIRVGDYKLIDGYPGPFPDWYKPEQVTSSLNTRFSRD-------SANQY 417 Query: 502 GLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 L+ L D ++++L+ P +VK++ + + + PP + +N Sbjct: 418 QLFNLKDDPNERNDLSNFRPDMVKKLAARLAWYKKQAVPPNFPETPDDLSN 468 >UniRef50_A6KZI7 Arylsulfatase n=23 Tax=Bacteroidales RepID=A6KZI7_BACV8 Length = 508 Score = 415 bits (1067), Expect = e-114, Method: Composition-based stats. Identities = 130/560 (23%), Positives = 198/560 (35%), Gaps = 127/560 (22%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 + L + +AF T KPNII + DD+GYG L + Sbjct: 1 MKNRNLFLLTSGLAFPLLGAYAQKTP-KPNIIYIMCDDMGYGDLGCYGQPY--------- 50 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 STP + ++ EG+RFT Y VS PSRA+ MTG+ Sbjct: 51 -----------------ISTPNIDNMAKEGMRFTQAYAGSPVSAPSRASFMTGQHSGHCE 93 Query: 150 VYSN-------------------TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKIS 190 V N Q +PE+ +++GY T GKW Sbjct: 94 VRGNKEYWRDAPVVMYGNNKEYAVVGQHPYDPGHVIIPEIMKDNGYTTGMFGKWAGGYEG 153 Query: 191 NVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA------YYNSPSLFKN 244 +V P+ + +Y+ F A + P F A TA N Sbjct: 154 SVSTPDKRGIDEYYGYICQFQAHLYYPN---FLNRYSKSAGDTAVVRVVMDENINYPMFG 210 Query: 245 RERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ---YQKQF 301 ++ Y +D + +EA+ +D+ QPF Y PH P YQK+F Sbjct: 211 KDYFKRPQYSADMIHEEAMKWLDKQ-DGKQPFFGIFTYTLPHAELAQPEDSILTGYQKKF 269 Query: 302 NTGS--------------QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG 347 T + + +D V +L +LK+ G +NTI++FTSDNG Sbjct: 270 FEDKTWGGQEGSRYNPSVHTHAQFAGMITRLDYYVGEVLNKLKEKGLDENTIVIFTSDNG 329 Query: 348 AVIDGP-----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKL-ISAMDFYPT 401 +G +G +G K Q Y GG P + W GK+ G + ++ D PT Sbjct: 330 PHEEGGADPTFFGRDGKLRGLKRQCYEGGIRIPFIVRWPGKVPEGTVNDHQLAFYDLMPT 389 Query: 402 ALDAADISI---------PKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENI 452 D A + DG+S P L ++ + H L W + Sbjct: 390 FCDLAGVKNYVKKYTNKKKDVDYFDGISFAPTLLGQEGQKKHDFLYWEFDETDQIG---- 445 Query: 453 PFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQ 511 VR D+ +V V+ LY L TD+ + Sbjct: 446 --------------------------------VRMGDWKMV--VKKGTPFLYNLATDIHE 471 Query: 512 KDNLAAANPQVVKEMQGVVR 531 ++AA +P +VK+M+ ++R Sbjct: 472 DHDIAAGHPDIVKQMKEIIR 491 >UniRef50_A6DJ11 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DJ11_9BACT Length = 462 Score = 414 bits (1066), Expect = e-114, Method: Composition-based stats. Identities = 104/555 (18%), Positives = 178/555 (32%), Gaps = 153/555 (27%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 + + + KPN+I++ DD GY L Sbjct: 3 LFSLLTLISLQFLMAADTSKPNVIIILTDDQGYNDLSCYGS------------------- 43 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ 157 +P + L +EG++ T+ YVA V SRAA++TGR P GV Sbjct: 44 -------KTIKSPRIDQLAEEGLKLTSYYVASPVCSASRAALLTGRYPKLVGVPGVFFPN 96 Query: 158 D---GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 G+ + +L ++ GY T AVGKWHL E Sbjct: 97 RGHKGLDPKHQTIAKLLKSVGYATKAVGKWHLGD-----------------------ELE 133 Query: 215 WQPQNRGFDYFMGFHAAGT------------------------------------AYYNS 238 + P N+GFD + G + + Sbjct: 134 FLPTNQGFDSYYGIPYSNDMTPAFSMKYSENCLYREGVDQEALKKAFEANKIKPVGMKDK 193 Query: 239 PSLFKNRERVP---AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPD 295 L +N E + + I+ + TDE+I +D + ++PF LYLA++ PH P Sbjct: 194 VPLMRNDECIEMPADQSTITKRFTDESIKFIDESTASNKPFFLYLAHSMPHTPL------ 247 Query: 296 QYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGP 353 Y + G Y + +D V RI++ L + +NT+ ++TSDNG + Sbjct: 248 -YVSKDFEGKSAGGIYGDVIEEIDYNVGRIIDHLNEKNIAENTLFIYTSDNGPWLIKKSH 306 Query: 354 LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP-GNYDKLISAMDFYPTALDAADISIPK 412 K ++ GG P + W K+ +++ +MD +PT Sbjct: 307 GGSALPLFEGKMTSFEGGQRVPAIIRWPAKIPKDSVSNEMTLSMDIFPTLAKITGAKAQD 366 Query: 413 DLKLDGVSLLPWLQDKKQ-GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPH 471 ++G + L +D H + Sbjct: 367 ADLINGKNALELYEDPANFKTKHDYFFYS------------------------------- 395 Query: 472 NPNTEDLSQFSYTVRNNDYSLVY---------TVENNQLGLYKLT-DLQQKDNLAAANPQ 521 VR+ ++ + LY L+ D+ + NL P+ Sbjct: 396 ----------PRAVRHKNWKYHQQETFKLKSTARKTKGPSLYDLSKDIGESKNLINDYPE 445 Query: 522 VVKEMQGVVREFIDS 536 + +++ + E Sbjct: 446 IAAQLKNALLEHNKK 460 >UniRef50_Q7UN55 N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Rhodopirellula baltica RepID=Q7UN55_RHOBA Length = 501 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 120/550 (21%), Positives = 202/550 (36%), Gaps = 107/550 (19%) Query: 22 MAAFAAHAADDVKLKATKTNVA---FSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKG 78 + + + A + + V+ S +PNII + DDLGYG L Sbjct: 16 LRSLSRLALAFCCIAVSYRVVSGDESSKADSPASGDALRPNIIYVMADDLGYGDLGCYG- 74 Query: 79 SFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAA 138 + TP L + +G+RFT+ Y H V PSR Sbjct: 75 -------------------------QTRIQTPHLDQMAADGIRFTDHYAGHTVCRPSRLT 109 Query: 139 IMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDK 198 + TG+ G+ N A + + + L + GY T VGKW L + E+ Sbjct: 110 LWTGKHVGSTGLIGN--AARNLTGEQPTVASLLSDAGYATGGVGKWALGNVDVPEEIENP 167 Query: 199 QTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRER---------- 247 P GFD + G+ A+ P L++N ER Sbjct: 168 G----------------HPLANGFDAWTGYMNQSNAHNYYPRFLWQNYERRFFPGNVIST 211 Query: 248 ---------VPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPND-------- 290 V + Y D +TD A + ++ PF+L++ + PH N+ Sbjct: 212 DPIARGRVAVKRESYSHDVMTDAAFDFIREHRSD--PFLLHVHWTIPHANNEGGRLNGDG 269 Query: 291 NPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI 350 PD + A + +D+ + R+++ L++ + T+++FTSDNG Sbjct: 270 MEVPDYGIYADEGWPNPEKGFAAMITRMDRDMGRLMDLLEELKLSEKTLVIFTSDNGPHH 329 Query: 351 DGPLP-----LNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALD 404 +G +G +G K + GG P W G ++PG D + DF PTA + Sbjct: 330 EGGHSDLFFNSSGPLQGSKRSMHEGGIRVPFIAKWPGTIEPGTISDHPSAFWDFLPTACE 389 Query: 405 AADISIPKDLKLDGVSLLPWLQDK-KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVR 463 A P D +DG+S LP L D+ K+ H+ L W +S W + Sbjct: 390 LAGAEPPAD--IDGISYLPALLDQPKKQTKHRYLYWASSEGPTSVGLRSGTWKAVNYPGG 447 Query: 464 HQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQV 522 + R+ + V + L+ L +D +K++++ +P Sbjct: 448 TKKR------------------RSGNSKPVVN--EDGWKLFDLASDPGEKNDVSKDHPAE 487 Query: 523 VKEMQGVVRE 532 ++ + + RE Sbjct: 488 LERLVEMARE 497 >UniRef50_C1ZI83 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZI83_PLALI Length = 558 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 121/525 (23%), Positives = 202/525 (38%), Gaps = 99/525 (18%) Query: 27 AHAADDVKLKATKTNVAFSDFTP--TEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKT 84 + + + + T + KPN++++ DDLGY + + Sbjct: 75 SWFLCHLAISLSLCLWQVDSITKVMAAEARPEKPNVVIINCDDLGYADVGAFGATIC--- 131 Query: 85 MENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRA 144 TP + + EGV+ T+ YVA V SR A++TG Sbjct: 132 -----------------------KTPEIDRMAREGVKATSFYVAQAVCSASRTALLTGCL 168 Query: 145 PARFGVYSNTD--AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRD 202 P R G+ +++GI +E L ELFQ+ GY TA GKWHL + Sbjct: 169 PNRIGILGALSHVSKNGIADSEVTLGELFQSQGYSTAMYGKWHLGYQA------------ 216 Query: 203 YHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS-------LFKNRERVPAK--GY 253 ++ P + GF +G + + +P LF+ + PA+ G+ Sbjct: 217 -----------QFLPGHHGFGEALGIPYSNDMWSKNPYGKFPPLPLFRQKGDSPAEIIGH 265 Query: 254 ISDQ------LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT 307 +DQ T A+ +DR D+PF +YLA+ PH P + + + Sbjct: 266 DTDQSRFTTDFTMAAVSFIDRH--ADKPFFIYLAHPMPHTPI-------FVSEERNSGER 316 Query: 308 ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNGAQKGYKS 365 A Y + +D V I + L+K+ T+++FTSDNG + G + K Sbjct: 317 AQLYRDVIGEIDWSVGTIRQTLEKHQLTRKTLVIFTSDNGPWLVFGNHAGSTGPLREGKG 376 Query: 366 QTYPGGTHTPMFMWWKGKLQP-GNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 + GG P W G + P D ++ D +PT +P D +DGV + P Sbjct: 377 TMWDGGARVPFVACWPGVIPPDTTVDLPMATYDLFPTFAKMLGAKLP-DHPIDGVDIWPQ 435 Query: 425 LQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 L + +PH+ L W + P+ + H + Sbjct: 436 LTSASKAQPHQAL-WFYYGRDLIAVRSGPWKLVFPHTYVHPVERGNDGQRG--------- 485 Query: 485 VRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQG 528 LV + +L LY L +D+ + NLA+ +P++VK+++ Sbjct: 486 ------KLV-NRKFTELALYNLDSDIGETTNLASQHPEIVKQLEA 523 >UniRef50_A7AKS6 Putative uncharacterized protein n=3 Tax=Bacteroidales RepID=A7AKS6_9PORP Length = 464 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 142/543 (26%), Positives = 221/543 (40%), Gaps = 111/543 (20%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 D + + + + + + +PNI++L DD GY F Sbjct: 6 IDRLFVVSLGAITGLASCSSGQDEEAQRPNILILLADDAGYADFGFMG------------ 53 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 A TP + L EG FT+ +VA VS PSR+ ++TGR R+G Sbjct: 54 --------------ATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGRYGQRYG 99 Query: 150 VYSNTD-AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 N D DG+P E LP L + + Y T +GKWHL Sbjct: 100 YECNLDKPGDGLPDDEELLPALLKRYDYRTGCIGKWHLG--------------------- 138 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS----------LFKNRERVPAKGYISDQL 258 S +P +GFD F G A +Y+ P N ++ GY +D+L Sbjct: 139 --SEPSQRPNAKGFDTFYGLLAGHRSYFYDPETSDKDGNLQQYQYNGRKLSFDGYFTDEL 196 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSV 318 +A V +QPFMLY+++ APH PN+ D Q Y A +Y++ Sbjct: 197 ASKAQQFVTE---SEQPFMLYMSFTAPHSPNEATEEDL----ARFEGQPRQKYAAMMYAL 249 Query: 319 DQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFM 378 D+GV +I+++LK G++DNTII F SDNG N KG+K + GG P F+ Sbjct: 250 DRGVGKIVDELKAAGKFDNTIIFFLSDNGGSTT-NQSSNLPLKGFKGNKFEGGQRVPFFV 308 Query: 379 WWKGKLQP-GNYDKLISAMDFYPTALDAADISIPK-DLKLDGVSLLPWLQDKKQGEPHKN 436 W + + + L S++D + T +DA DI +DGVSLLP+L +K G PH+ Sbjct: 309 VWGDRFKRDQRFTGLTSSLDIFATVVDALDIPEEGLHKPIDGVSLLPYLSGEKSGNPHEA 368 Query: 437 LTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTV 496 L W + +R+ Y L+ T Sbjct: 369 LFWR--------------------------------------KMDTRAIRSGSYKLIITR 390 Query: 497 ENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNIKKA 555 + + LY + D+++ +L ++ P+ +E+ + E+ + +E + I Sbjct: 391 GVDSV-LYNMDQDVEEMHDLLSSEPEKARELMEQLSEWEQACCKD-PLWIEEGWAEITNG 448 Query: 556 LSE 558 L E Sbjct: 449 LHE 451 >UniRef50_Q5FYB1 Arylsulfatase I n=5 Tax=Chordata RepID=ARSI_HUMAN Length = 569 Score = 414 bits (1064), Expect = e-114, Method: Composition-based stats. Identities = 109/519 (21%), Positives = 189/519 (36%), Gaps = 79/519 (15%) Query: 59 NIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDE 118 +II + DD GY + + TPTL L + Sbjct: 48 HIIFILTDDQGYHDVGYHGS---------------------------DIETPTLDRLAAK 80 Query: 119 GVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN---TDAQDGIPLTETFLPELFQNHG 175 GV+ N Y+ + PSR+ ++TGR G+ + + +PL + LP+ Q G Sbjct: 81 GVKLENYYI-QPICTPSRSQLLTGRYQIHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAG 139 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY 235 Y T VGKWHL F +E P RGFD F+G Y Sbjct: 140 YSTHMVGKWHLG----------------------FYRKECLPTRRGFDTFLGSLTGNVDY 177 Query: 236 YNSPS---------LFKNRERVPA---KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYN 283 Y + E V Y + A + + + +P LY+A+ Sbjct: 178 YTYDNCDGPGVCGFDLHEGENVAWGLSGQYSTMLYAQRA-SHILASHSPQRPLFLYVAFQ 236 Query: 284 APHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFT 343 A H P +P Y+ + G+ Y A V +D+ V+ I LK+ G Y+N++I+F+ Sbjct: 237 AVHTPLQSPREYLYRYRT-MGNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFS 295 Query: 344 SDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK-GKLQPGNYDKLISAMDFYPTA 402 SDNG N +G K + GG F+ K + L+ D+YPT Sbjct: 296 SDNGGQT-FSGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTL 354 Query: 403 LDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYS----HWFDEENIPFWDNY 458 + A + LDG + P + + + + L I H E W+ Sbjct: 355 VGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHNIDPLYNHAQHGSLEGGFGIWNTA 414 Query: 459 HKFVRHQSDDYPHNPNT---EDLSQFSYTVRNNDYSLVYTVEN--NQLGLYKLT-DLQQK 512 + + + + + + + + + + + L+ ++ D ++ Sbjct: 415 VQAAIRVGEWKLLTGDPGYGDWIPPQTLATFPGSWWNLERMASVRQAVWLFNISADPYER 474 Query: 513 DNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 ++LA P VV+ + + E+ ++ P + + Sbjct: 475 EDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENPRAHP 513 >UniRef50_A6P2X1 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6P2X1_9BACE Length = 494 Score = 414 bits (1064), Expect = e-114, Method: Composition-based stats. Identities = 132/518 (25%), Positives = 201/518 (38%), Gaps = 120/518 (23%) Query: 56 GKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSL 115 PN++V+ +DD+GYG L + STP + +L Sbjct: 69 DPPNVVVIYVDDMGYGDLGCTGATA--------------------------ISTPNIDAL 102 Query: 116 MDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF---GVYSNTDA---------------- 156 + GV TN Y + SRA ++TGR P R G Y NT+ Sbjct: 103 AEGGVLLTNYYAPAPICSASRAGLLTGRYPIRTLTSGAYMNTEGLSGHLANLLEVVKGTY 162 Query: 157 ---QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 DG+P E LPE+ Q GY TA VGKWHL E Sbjct: 163 PYQNDGLPTDEILLPEVLQQAGYETALVGKWHLG-----------------------IRE 199 Query: 214 EWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGY----ISDQLTDEAIGVVDRA 269 E +P NRGFD F G + + ++ N E V + Y ++ +LT A +D Sbjct: 200 EERPYNRGFDLFYGALYS--DDNDPHRIYHNDEVVHDEPYDQSGMTKELTQVAKQFIDDN 257 Query: 270 KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQL 329 + D PF LY A PH P++ G+ A Y + VD V I++ L Sbjct: 258 Q--DGPFFLYYASPFPHWPSNASEE-------WLGTSQAGIYGDCMQEVDWSVGEIMDTL 308 Query: 330 KKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY 389 ++NG +NT+++FTSDNG D G Q+G K Y GG+H P + G + G Sbjct: 309 EENGLLENTLVIFTSDNGPWYD---GATGGQRGRKDTNYNGGSHVPFIAYMPGTIPEGEV 365 Query: 390 -DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFD 448 D L+S +D +PT L+ I +P+D +DG+ + P+L + + + Sbjct: 366 YDGLMSGVDVFPTILNLLGIELPQDRVIDGMDMWPFLTGQSDSPRTELFLNKDKDTFALI 425 Query: 449 EENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-T 507 E+N + + + + L LY L T Sbjct: 426 EDNFKYLERSYSENGTY------------------------WML-----QQGPFLYNLDT 456 Query: 508 DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVN 545 D ++ ++ P+ +EM + F S + + Sbjct: 457 DPEEAYDVTTHFPEKAEEMAQKIDSFKQSLKENIRGWK 494 >UniRef50_D2QXE9 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QXE9_9PLAN Length = 495 Score = 414 bits (1064), Expect = e-114, Method: Composition-based stats. Identities = 106/562 (18%), Positives = 190/562 (33%), Gaps = 93/562 (16%) Query: 16 LILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPF 75 L ++ A V + A +PN++V+ +DDLG+G Sbjct: 7 WELRGLLSWGLAGCLWLVAVLVAGEAFA--------DDAARQPNVVVVFIDDLGWGDFSC 58 Query: 76 DKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPS 135 TP + + EG+RF+ YV+ + PS Sbjct: 59 FGNKEGA--------------------------TPHIDRMAAEGIRFSQFYVSSPICSPS 92 Query: 136 RAAIMTGRAPARFGVYSNTDAQ---------DGIPLTETFLPELFQNHGYYTAAVGKWHL 186 R ++ TG+ P R+ + S +++ + + + + Q HGY T GKWHL Sbjct: 93 RCSLTTGQYPQRWKITSFLNSRADNARRGVANWLDPEAPTMARILQQHGYRTGHFGKWHL 152 Query: 187 S---KISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFK 243 + + P NF A+ + D G + +P + Sbjct: 153 GGQRDVDDAPAIAKYGFDASLTNFEGMGAKLLPLTLKPGDSVPGKIWSDAERLGAPVTWM 212 Query: 244 NRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT 303 R ++ D AI +D A +PF + + + H P P Sbjct: 213 QRSKITGG------FVDGAIAFIDAATRDGKPFYVNVWPDDVHSPFWPPVETW------- 259 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNG-QYDNTIILFTSDNGAVIDGPLPLNGAQKG 362 G + Y A + ++D + ++ + L+ +NTI+L SDNG + G +G Sbjct: 260 GENKRELYLAVLEAMDLQLGKLFDHLRSRDELRENTIVLICSDNGP--EAGAGSAGPFRG 317 Query: 363 YKSQTYPGGTHTPMFMWWKGKLQP---GNYDK--LISAMDFYPTALDAADISIPKDLKLD 417 K++ + GG +P+ +W + G ++ ++S +D P+ A +P D++ D Sbjct: 318 GKTELFEGGIRSPLIVWSPALVAAEQRGKANETAVLSTLDLLPSLAKLAGAPLPADVQFD 377 Query: 418 GVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTED 477 G L + L W Sbjct: 378 GEECSATLLGRGNESRTAPLFWRRPPDRKTAGAKGRRVL--------------------- 416 Query: 478 LSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDS 536 +R + L+ + LY L +D + NLA +P K MQ + + S Sbjct: 417 ---PDLAMREGKWKLLCDYDGAGALLYDLESDRGETKNLAQQHPDRTKAMQAKLLAWHSS 473 Query: 537 SQPPL-SEVNQEKFNNIKKALS 557 E+ QE +++K Sbjct: 474 MPADRGPELGQETLESLRKTAK 495 >UniRef50_Q9VVM4 CG7402 n=10 Tax=Drosophila RepID=Q9VVM4_DROME Length = 579 Score = 413 bits (1063), Expect = e-114, Method: Composition-based stats. Identities = 117/582 (20%), Positives = 207/582 (35%), Gaps = 119/582 (20%) Query: 38 TKTNVAFSDFTPTEYST--KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 + S Y + KPNI+++ +DD+G + F + Sbjct: 6 ILVLLVVSSILSLAYGSGYSTKPNIVIILIDDMGMNDVSFHGSN---------------- 49 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT- 154 Q TP + +L G+ YV + + PSRA ++TG+ P G+ Sbjct: 50 ----------QILTPNIDALAYNGILLNKHYVPN-LCTPSRATLLTGKYPIHTGMQHFVI 98 Query: 155 --DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 D G+P E +PE+F++ GY T VGKWHL F Sbjct: 99 ITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLG----------------------FWR 136 Query: 213 EEWQPQNRGFDYFMGFHAAGTAYYNSP------------SLFKNRERVPA--KGYISDQL 258 ++ P RGFD+ G++ YY+ ++ E P Y ++ Sbjct: 137 KDLTPTMRGFDHHFGYYNGYIDYYDHQVRMLDRNYSAGLDFRRDLEPCPEANGTYATEAF 196 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHL-----PNDNPAPDQYQKQFNTGSQTADNYYA 313 T EA ++++ +P + L++ A H P P ++ K + Y Sbjct: 197 TSEAKRIIEQH-DKSKPLFMVLSHLAVHTGNEDSPMQAP-EEEVAKFPHIRDPKRRTYAG 254 Query: 314 SVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI---DGPLPLNGAQKGYKSQTYPG 370 + S+D+ V + + LK NG +N+IIL SDNGA N +G K + G Sbjct: 255 MISSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEG 314 Query: 371 GTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 G + +W + G ++ I A+D+ PT AA +S+P+DL LDG++L P L + Sbjct: 315 GIRSAGALWSPLLKERGYVSNQAIHAVDWLPTLAGAAGVSLPQDLPLDGINLWPMLSGNE 374 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT----- 484 + +P + + + + + D + T + + Sbjct: 375 EPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFKGRYDQWLGELETNEDDPLGESYEQHV 434 Query: 485 ------------------VRNNDYSLVYTVEN----------------NQLGLYKL-TDL 509 +R T + L D Sbjct: 435 LASDVQSLLGNRGLTKDRIRQMRSEATETCPPIEGQNPLESHFKCEPLKAPCFFDLAKDP 494 Query: 510 QQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 ++ NLA P ++++ + + ++ P + + N Sbjct: 495 CERYNLAQMYPLQLQQLADELEQIRKTAIPSARVPHSDSRAN 536 >UniRef50_B4D764 Steryl-sulfatase n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D764_9BACT Length = 499 Score = 413 bits (1063), Expect = e-114, Method: Composition-based stats. Identities = 121/566 (21%), Positives = 191/566 (33%), Gaps = 129/566 (22%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K + + KPN I++ +DD+GY + + Sbjct: 1 MKPLRFLFSTLCLLAGAALAADKPNFIIINIDDMGYADIAPFGSKLN------------- 47 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF-GVYSN 153 TP L + EG + T Y V PSR+A+MTG P R + S Sbjct: 48 -------------RTPNLDRMAQEGRKLTCFY-GAPVCSPSRSALMTGCYPKRVLPIPSV 93 Query: 154 --TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFS 211 A G+ E + EL + GY T +GKWHL Sbjct: 94 LFPGAAVGLNPAEHTVAELLKKSGYATGCIGKWHLGDQ---------------------- 131 Query: 212 AEEWQPQNRGFDYFMGFHAAGT-----------------------------------AYY 236 E+ P RGFDY++G + Sbjct: 132 -PEFLPPRRGFDYYLGLPYSNDMGPGEDGSKSSLGDPIPKPKATPNPSAPIPETGITGNQ 190 Query: 237 NSPSLFKNRE-----RVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDN 291 + +N + R + + D+ T A+ + K D+PF LYL +NA H P Sbjct: 191 PPLPMLENEKVIARVRQDEQQGLVDRYTKAAVKFITEHK--DKPFFLYLPHNAVHFPI-- 246 Query: 292 PAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVID 351 Y + G Y V VD V ++L L++ D+T +LFTSDNG Sbjct: 247 -----YPGKEWAGKSPNGYYSDWVEQVDWSVGQVLNTLRELKLQDHTFVLFTSDNGGT-- 299 Query: 352 GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISI 410 P +N +G+K+ T+ GG P WW GK+ G D++ D PT ++ A + Sbjct: 300 -PRAVNAPLRGFKTTTWEGGMREPTIAWWPGKIPGGTSSDEITGMFDILPTLVNLAGGEV 358 Query: 411 PKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYP 470 P D K+DG ++ P L + + + + + P+ + + Sbjct: 359 PTDHKIDGGNIWPVLAGEAGAKSPHEVFYYFNGLRLEGVRTGPWKLRFGSAGLAEGKGPV 418 Query: 471 HNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGV 529 P LY L TD+ + N+A A+P VV ++ + Sbjct: 419 KKPAAPI----------------------PDQLYNLQTDIGETTNVADAHPDVVAHLREL 456 Query: 530 VREFIDSSQPPLSEVNQEKFNNIKKA 555 D ++ Sbjct: 457 ADAMKDDLGRDGKGPGVRPLGRVENP 482 >UniRef50_B9XS23 Sulfatase n=1 Tax=bacterium Ellin514 RepID=B9XS23_9BACT Length = 635 Score = 413 bits (1063), Expect = e-114, Method: Composition-based stats. Identities = 118/550 (21%), Positives = 194/550 (35%), Gaps = 107/550 (19%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 ++ V FS +T KPNII + DD+GYG + + + Sbjct: 1 MRTCLWFFVVLFSMGMAHA-ATSQKPNIIFILADDMGYGDIGPFGSTLN----------- 48 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP L + EG++ T+ Y A + PSRA I+TG R + Sbjct: 49 ---------------RTPNLDRMAKEGMKLTSFYAA-PLCTPSRAQILTGCYAKRVSLPK 92 Query: 153 NTDAQD--GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 + G+ E + +L + GY T A+GKWH+ Sbjct: 93 VLSPRSEVGLNTNEQTVAKLLKRQGYATMAIGKWHVGD---------------------- 130 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNSP--------------SLFKNRERVP-----AK 251 A E P GFD+++G + P L ++ + + + Sbjct: 131 -APENLPTRHGFDHYLGLPYSNDMGGEEPGKDQPAKRGARPPLPLVRDEQVIEVVKPADQ 189 Query: 252 GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNY 311 ++++ TDEA+ + QPF LYLA+ A H P G Y Sbjct: 190 DRLTERYTDEAVKFIRAN--DKQPFFLYLAHTAVHAPIHP-------GHNFRGKSRNGLY 240 Query: 312 YASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI--DGPLPLNGAQKGYKSQTYP 369 V VD V ++L+ L++ G +NT++LF+SDNG + G +G K T+ Sbjct: 241 GDWVEEVDWSVGKVLDTLRELGLSENTLVLFSSDNGPWLAQKTNGGTAGPLRGGKGGTFE 300 Query: 370 GGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK 428 GG P WW GK+ + D + +D PT + A ++PKD K+DG + L + Sbjct: 301 GGMREPTLAWWPGKVPAQSVCDTVAGNIDLLPTFVKLAGGTLPKDKKIDGRDISNLLLGQ 360 Query: 429 KQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNN 488 + + + + + Q + + R Sbjct: 361 TKEAQREAHYYFAG-----TALQAVRSGPWKLAIVPQYEGMGKFSENAVEGGKPFAPR-- 413 Query: 489 DYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSS-QPPLSEVNQ 546 LY L D+ +K ++ A +P +K + G V + Sbjct: 414 --------------LYNLDEDIGEKTDVVAEHPDEMKRLLGYVEAMEADLGVSKKNGPGV 459 Query: 547 EKFNNIKKAL 556 + K L Sbjct: 460 RPPGRVAKPL 469 >UniRef50_A3HWU7 N-acetylgalactosamine 6-sulfatase (GALNS) n=2 Tax=Bacteria RepID=A3HWU7_9SPHI Length = 472 Score = 413 bits (1063), Expect = e-114, Method: Composition-based stats. Identities = 128/520 (24%), Positives = 198/520 (38%), Gaps = 109/520 (20%) Query: 41 NVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDK 100 N++ + S K N++++ DDLGYG L F + Sbjct: 18 NLSAQSKPSPQLSPKKHYNLVLIVADDLGYGDLGFTGST--------------------- 56 Query: 101 AIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA---- 156 Q TP L L GV FT GYV+ V PSRA +TG FG +N Sbjct: 57 -----QIKTPHLDQLATNGVTFTQGYVSSAVCSPSRAGFITGINQVEFGHDNNLAGVEPG 111 Query: 157 ----QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 +G+PL++ + + GY +GKWHL Sbjct: 112 FDIAYNGMPLSQKTIADHLNKLGYVNGLIGKWHLG-----------------------KE 148 Query: 213 EEWQPQNRGFDYFMGFHAAGTAYY--------NSPSLFKNRERVPAKGYISDQLTDEAIG 264 ++ P RGFD F G+ G Y+ L N + YI+D + +E++ Sbjct: 149 PQFHPLKRGFDEFWGYTGGGHDYFESLPNGKGYKEPLESNFKTPDPITYITDDVGNESVD 208 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKR 324 ++R K D+PF L+ A+NAPH P D Q + + Y A V+ +D V + Sbjct: 209 FIERHK--DEPFFLFAAFNAPHTPMQALEEDLALYQ-HIEDKKRRTYAAMVHRLDLNVGK 265 Query: 325 ILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKL 384 I+ L++ G +NT+++F SDNG D LN +G K GG H P M G L Sbjct: 266 IMTSLEEQGLSENTLVVFFSDNGGPTDSNASLNAPYRGQKGILLEGGIHVPFVMNLPGLL 325 Query: 385 QPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSY 443 G Y + ++++D PT L A + GV L+P L K + +TW + Sbjct: 326 PEGLIYQEQVTSLDVVPTFLALAGDTETSMDMFSGVDLIPHLTGKTPPLADREMTWKFTI 385 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGL 503 S +R D+ LV +V + L Sbjct: 386 --------------------------------------SRAIREGDWKLV-SVPDRMPML 406 Query: 504 YKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 Y L D ++++LA + + + + + P+ Sbjct: 407 YNLAEDPSEQNDLALKHMDKTTYLLKKLGTWDVNLPHPVF 446 >UniRef50_A6DMX6 Arylsulphatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMX6_9BACT Length = 484 Score = 413 bits (1062), Expect = e-113, Method: Composition-based stats. Identities = 107/550 (19%), Positives = 193/550 (35%), Gaps = 125/550 (22%) Query: 51 EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTP 110 KPNI+++ +DDLG L F+ P Sbjct: 14 AVQAADKPNIVLIMVDDLGGRDLAVYGNKFNES--------------------------P 47 Query: 111 TLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQD------------ 158 + L +G+ F Y A V +RA+I +G+ PAR G++ Sbjct: 48 NIDKLATQGMVFDQAYAA-PVCSATRASIQSGQTPARVGIFDFIPGHWRPYEKVTVPHHK 106 Query: 159 --GIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 +P L + ++ GY T GKWHL+ + Sbjct: 107 IQHLPENIFTLGDAMKSAGYKTGYFGKWHLNDRTAKGKEARH-----------------T 149 Query: 217 PQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPF 276 P RG+D ++ G +F+ ++ +S LTD + + K DQPF Sbjct: 150 PDERGYDKSYMYNGGG----FYRPVFQPAYKLDKPKRLSQVLTDMGVDFIKENK--DQPF 203 Query: 277 MLYLAYNAPHLPNDNPAP--DQYQ-KQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNG 333 L++++ H+ D D+Y K+ + Y A + D V ++++ + G Sbjct: 204 FLFVSHYDVHVQLDADKDLIDKYLNKKRDPNYPGNAVYAAMIEHTDDSVGQLMKAIDDQG 263 Query: 334 QYDNTIILFTSDNGA-----------------------VIDGPLPLNGAQKGYKSQTYPG 370 DNT+ +F SDNG + N + K Y G Sbjct: 264 LADNTLFIFYSDNGGVDNRYDDIPLLGGRSVNVYPEGHPLRYVATSNAPLRSGKGTVYEG 323 Query: 371 GTHTPMFMWWKGKLQPGNYDKLI-SAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 G P+ + W GK+ PG + + S+ DFYP+ L+ PK+ LDGVS++P L K Sbjct: 324 GIRVPLIVRWPGKVSPGTRSEAVFSSSDFYPSFLEVTKTQAPKNQVLDGVSMVPALT-KN 382 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNND 489 +P + + H + +R D Sbjct: 383 SFDPEREVFTHYPVYHHDE--------------------------------PMSALRKGD 410 Query: 490 YSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEK 548 + ++ + + + LY + D+ + + + P EM+ + ++ ++ + N + Sbjct: 411 WKIIENLVSKEFYLYNIKYDVNEMVDHKVSLPAKFAEMKAALVKWQKETKAQMPVPNPKF 470 Query: 549 FNNIKKALSE 558 + + + Sbjct: 471 DPSKRYQWGK 480 >UniRef50_A6CEL4 Arylsulfatase A n=4 Tax=Bacteria RepID=A6CEL4_9PLAN Length = 527 Score = 413 bits (1061), Expect = e-113, Method: Composition-based stats. Identities = 125/552 (22%), Positives = 206/552 (37%), Gaps = 97/552 (17%) Query: 36 KATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 A FS T PNII + DD+GYG + Sbjct: 3 AAFLPLFLFSQNTAHASEKANDPNIIYILADDMGYGDI---------------------- 40 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--- 152 + +TP L L G+ FT+ + + V P+R ++TGR R + S Sbjct: 41 ---RALNPECKIATPHLDQLAHGGMIFTDAHSSSSVCTPTRYGVLTGRYNWRSRLKSGVL 97 Query: 153 NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 ++ I +P + + HGYYTA VGKWHL ++ + Y+ Sbjct: 98 WGLSRRLIEPDRETVPSMLKEHGYYTACVGKWHLGMDWSLKQGGFATEQSYNKKTNPGWD 157 Query: 213 EEW------QPQNRGFDYFMGFHAAGTAYYNSPSLFKNRE----RVPAKGYISD------ 256 ++ P + GFDYF G A+ +N K + D Sbjct: 158 VDYSKPIQNGPNSVGFDYFFGISASLDM--PPYVYIENDRSQGIPTVTKAFFRDGPAHKD 215 Query: 257 --------QLTDEAIGVVDRAK---TLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS 305 ++TD+ + ++D +PF +Y NAPH P P P+ G Sbjct: 216 FEAIDVLPRITDKTVQIIDEHAAASKEGKPFFIYFPLNAPHTPI-LPTPEW------QGK 268 Query: 306 QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--------VIDGPLPLN 357 + Y V VD V ++++ LKK G ++NT+++FT+DNG + D + Sbjct: 269 SGINAYCDFVMQVDDTVGQVMQALKKQGIHENTLVIFTADNGCSPAANFKEMTDKDHQPS 328 Query: 358 GAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKL 416 +G+K+ Y GG P W +++ G D+L D + TA D +P D Sbjct: 329 YQFRGHKADIYEGGHRVPFIANWPARIKAGTHSDQLTCLTDLFATAADIVGAKVPDDAGE 388 Query: 417 DGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTE 476 D VS+LP ++ + + H D++ + S + + Sbjct: 389 DSVSILPAMEGTAHTPLREA-----AVHHSIRGAFSIRKDHWKLELCPGSGGWSFPKPGK 443 Query: 477 DLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFID 535 D + LY L D ++ N+ A +P+VVKE+ +++ + D Sbjct: 444 DNLSELPAI----------------QLYDLNHDAGEQKNVQAEHPEVVKELTTLLQSYAD 487 Query: 536 --SSQPPLSEVN 545 S P + N Sbjct: 488 RGRSTPGKPQPN 499 >UniRef50_A9BNY8 Sulfatase n=11 Tax=cellular organisms RepID=A9BNY8_DELAS Length = 457 Score = 412 bits (1059), Expect = e-113, Method: Composition-based stats. Identities = 123/529 (23%), Positives = 189/529 (35%), Gaps = 112/529 (21%) Query: 39 KTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGI 98 + A TE +PNI+ + DDLGY L G + Sbjct: 1 MSAAASRPQPCTERICMSRPNILFIVADDLGYADLGCYGGRAADFGAVS----------- 49 Query: 99 DKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-------- 150 P L L G+R T GY V P+R A+ T R R Sbjct: 50 -----------PVLDRLAAGGLRLTQGYANSPVCSPTRFALATARYQYRLRGAAEEPINS 98 Query: 151 ---YSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNF 207 + + G+P + + ++ GY TA +GKWHL Sbjct: 99 KTRGTPLGEKLGLPPDMPTVASMLRDAGYRTALIGKWHLG-------------------- 138 Query: 208 TTFSAEEWQPQNRGFDYFMGFHAAGTAYYNS------PSLFKNRERVPAKGYISDQLTDE 261 + P G++ + G + G Y+ L+ E +GY++D L+ Sbjct: 139 ---YPPHFGPLRSGYEEYFGPMSGGVDYFTHLSSSGQHDLWVGEEEHHDEGYLTDLLSQR 195 Query: 262 AIGVVDRAKTLDQPFMLYLAYNAPHLPNDNP-----APDQYQKQFNTGSQTADNYYASVY 316 ++ V R D PF L L Y APH P + A + Y ++ Sbjct: 196 SVDFVHRMAQGDAPFFLSLHYTAPHWPWETRDDRSTAEALGAGIAHLDGGNIHQYRRMIH 255 Query: 317 SVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPM 376 +D+G+ I+E L+ NGQ DNT+I+FTSDNG N G K GG P Sbjct: 256 HMDEGIGWIVEALRANGQLDNTLIVFTSDNGGER---FSDNWPLVGGKMDLTEGGIRVPW 312 Query: 377 FMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHK 435 W + PG + +MD+ T LDAA + P+ LDG+SLLP L+ + P + Sbjct: 313 IAHWPAVIAPGRSSPQHCMSMDWSATVLDAAGVQAPEGHALDGISLLPVLRAEDAEFP-R 371 Query: 436 NLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYT 495 L W + +R+ D+ + Sbjct: 372 TLHWRM------------------------------------KHRGQRALRDGDWKYLRV 395 Query: 496 VENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSE 543 + L+ L D +++ N AA P+ + M+ ++ PP+ E Sbjct: 396 --DGIDYLFDLAADERERANQAARAPERLAAMRSAWEDWNQGM-PPIPE 441 >UniRef50_D2R921 Sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R921_9PLAN Length = 676 Score = 411 bits (1058), Expect = e-113, Method: Composition-based stats. Identities = 123/550 (22%), Positives = 207/550 (37%), Gaps = 100/550 (18%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 + + + S + KPN++ + DD+G+G L G Sbjct: 28 LLIPLFVLSTLLSAKGTVWATEPAKPNVVYILADDVGWGDLSVHGG-------------- 73 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP + L +G+ ++ + V P+RA +TGR P R G + Sbjct: 74 -------------GVPTPNIDKLFAQGIEVSHF-MGWCVCSPTRAMFLTGRHPIRVG--T 117 Query: 153 NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 + + L ET + E F+ +GY T GKWH + P + A Sbjct: 118 GPEVGGELSLDETTIAEGFKANGYRTGVFGKWHSGSDPDTPAFRAAFAEAFKAIPNKQFA 177 Query: 213 EEWQPQNRGFDYFMGFHAAGTAYYN--------SPSLFKNRERVP-AKGYISDQLTDEAI 263 GFD ++ G ++N S + NRE P +GY D +T AI Sbjct: 178 GGHGANAHGFDEAWVYYGGGADFFNRRTVQGRGPVSWWHNREFRPDDEGYTDDLVTQRAI 237 Query: 264 GVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQ-----------KQFNTGSQTADNYY 312 + K DQPF Y+ ++ H P D T + + Sbjct: 238 EFIRENK--DQPFFCYVPFHIAHAPLQAKENDLAAIDSKTAAKLPTASGKTSDEGKHIHA 295 Query: 313 ASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGT 372 A ++S+D + I ++L+K G DNTI +FTSDNGA+ + +G+K Y GG Sbjct: 296 AMLHSMDNNIAAIRDELEKLGLSDNTIFVFTSDNGAM---EAGSSLPLRGHKHTIYEGGV 352 Query: 373 HTPMFMWWK--GKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ 430 P ++W G ++ L A+D +PT + D ++PK LDG ++ P L+D Q Sbjct: 353 RLPTAIYWPKGGLTGGRKWNGLCGALDMFPTLMAMTDSTMPKTQPLDGKNVWPALRD-NQ 411 Query: 431 GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDY 490 P ++ +I +R + + Sbjct: 412 PSPVESYYFIWHDED--------------------------------------AIRTDRW 433 Query: 491 SLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS-EVNQEK 548 L + LY +T D + +N+A ++P VVK + + + +S +S + +K Sbjct: 434 KLHRFH--GRYELYDITIDETESNNIADSHPDVVKSLSAKMDAWAESLGAAISHQPAPKK 491 Query: 549 FNNIKKALSE 558 ++ E Sbjct: 492 YHVPAAPDGE 501 >UniRef50_A3J5W3 Putative arylsulfatase n=1 Tax=Flavobacteria bacterium BAL38 RepID=A3J5W3_9FLAO Length = 468 Score = 411 bits (1056), Expect = e-113, Method: Composition-based stats. Identities = 130/533 (24%), Positives = 198/533 (37%), Gaps = 123/533 (23%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 + + +A E KPNI+ + DD+GY +L G Sbjct: 1 MNTKNILTFAILIATFGIQAQETKNTKKPNIVFILADDMGYNELGSYGG----------- 49 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 TP + L EG++F+N Y + PSR +MTG+ Sbjct: 50 ---------------KIIETPNIDQLAKEGMKFSNHYCGSNICAPSRGTLMTGKHTGHAY 94 Query: 150 VYSN----TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 + N + + IP +E + E+ + GY T A GKW L Sbjct: 95 IRDNKPLPYEGNEPIPASEITVAEILKTAGYTTGAFGKWGLG------------------ 136 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKG--------YISDQ 257 + A E P N+GFD F G++ A+ S + + V Y +D Sbjct: 137 ----YPASEGSPNNQGFDQFYGYNGQIHAHNYFTSYLRKNDLVELNANIDAPYSVYSADI 192 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP---DQYQKQ-------FNTGSQT 307 + D A+ V+ K + PF LY PH P P + Y K+ ++ + Sbjct: 193 IKDRALEFVEVNK--NNPFFLYFCPTLPHNPYHQPDDKTLEYYAKKTGFPIGDAHSEEFS 250 Query: 308 ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI----DGPLPLNGAQKGY 363 Y A +DQ V I+ +LK+ DNT+I+F SDNG+ + D L G +G Sbjct: 251 VPKYAALSSRLDQQVGEIMAKLKELNLLDNTLIIFASDNGSALTKEEDSYLRTGGDLRGR 310 Query: 364 KSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS-AMDFYPTALDAADISIPKDLKLDGVSLL 422 KS+ Y GG +P+ +WKGK+ PG+ IS DF PT + P + +DG+S L Sbjct: 311 KSEVYEGGIKSPLIAFWKGKIIPGSSSNHISAFWDFLPTCAEIVKAKTPDN--IDGISYL 368 Query: 423 PWLQDKKQ-GEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQF 481 P L K + H L W S Sbjct: 369 PTLLGKTDNQKQHDYLYWERS--------------------------------------Q 390 Query: 482 SYTVRNNDYSLVYTVE----NNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGV 529 S +R D + + + +Y L D +K+NLA P++ E + Sbjct: 391 SQAIRKGDMKANFVYDKTSQKQNIEIYNLAQDPFEKNNLAETMPELKAEFIKI 443 >UniRef50_UPI0001745D5D N-acetylgalactosamine 6-sulfate sulfatase (GALNS) n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745D5D Length = 562 Score = 411 bits (1056), Expect = e-113, Method: Composition-based stats. Identities = 116/502 (23%), Positives = 181/502 (36%), Gaps = 104/502 (20%) Query: 60 IIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEG 119 II + DDL G L TP L + EG Sbjct: 120 IIYILSDDLAQGDLGCYG--------------------------QKLIKTPNLDRMAAEG 153 Query: 120 VRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN----TDAQDGIPLTETFLPELFQNHG 175 RFT Y V PSR+++MTG + +N + Q +P + ++ + G Sbjct: 154 TRFTQAYCGTSVCAPSRSSLMTGLHMGHCPIRANREIKPEGQMPLPADTLTVAQVLKGAG 213 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY 235 Y TA VGKW + P +GFD+F G + A+ Sbjct: 214 YATACVGKWGMGMFDTTGS----------------------PLKKGFDHFYGHNCQRKAH 251 Query: 236 -YNSPSLFKNRERV--PAKGYISDQLTDEAIGVVDRA--KTLDQPFMLYLAYNAPHLPND 290 Y P ++ + ++V K Y+ D +E++ V K DQPF L+ A PH Sbjct: 252 NYFPPYIWNDDQQVALDGKTYVQDLFANESLKWVREQKRKAPDQPFFLFYAITLPHGDYQ 311 Query: 291 NPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI 350 Y Q + T Y A V +D V R+L+ LK+ +NT+++ + DNG+ Sbjct: 312 TDNLGIYADQ-KDWTPTQKAYAAMVTRLDSDVGRLLDLLKELKIDENTLVMTSGDNGSSF 370 Query: 351 D--------GPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPT 401 + G +G+K Y GG W G + G D+ + DF PT Sbjct: 371 PPDSELGRLFDQAMGGKLRGFKRGMYEGGLRQASIARWPGAIPAGRVSDEPWAFWDFLPT 430 Query: 402 ALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKF 461 A D A +P K DG+SL+ +L+ + W + Sbjct: 431 AADLAGAKLPSGYKPDGLSLVSFLKGG-PAPRREYFYWELHENASLQALRF--------- 480 Query: 462 VRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANP 520 + ++ V N + LY L TD + N+AAA+P Sbjct: 481 -------------------------DQNWKAVRNGPNQPVELYDLATDESEAHNVAAAHP 515 Query: 521 QVVKEMQGVVR-EFIDSSQPPL 541 V +++ +DS+ P+ Sbjct: 516 DRVTRALELMKSARVDSADFPM 537 >UniRef50_A4GIB1 Arylsulfatase n=2 Tax=Bacteria RepID=A4GIB1_9BACT Length = 608 Score = 410 bits (1055), Expect = e-113, Method: Composition-based stats. Identities = 111/526 (21%), Positives = 196/526 (37%), Gaps = 101/526 (19%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K T T S KPN++++ DD GY ++ Sbjct: 1 MKITSTLFLLSIGLTVFG----KPNVLIIMTDDQGYPEVSAHG----------------- 39 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 TP L L + +R ++ +VA + P+R ++TG AR G + + Sbjct: 40 ---------NPVLQTPNLDRLHGQSLRLSDYHVA-PMCTPTRGQLLTGLDAARNGAVNVS 89 Query: 155 DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEE 214 + + + + ++ GY T GKWHL Sbjct: 90 SGRALLRPEVSTIANYYEEAGYSTGVFGKWHLGANY-----------------------P 126 Query: 215 WQPQNRGFDYFMGFHAAGTA--------YYNSPSLFKNRERVPAKGYISDQLTDEAIGVV 266 ++PQ+RGF + + ++ Y N + +GY +D +EA+ + Sbjct: 127 FRPQDRGFQESVWYPSSSIPSVPAYWGNDYFDDVYIHNGKEKRFEGYCADVFFNEAMRFM 186 Query: 267 DRAKTLDQPFMLYLAYNAPHLPNDNPAPD-----------QYQKQFNTGSQTADNYYASV 315 + +PFM YLA N PH P D ++ N + Y + Sbjct: 187 SESAKSKKPFMCYLATNTPHGPFWPKEEDRKEIAEVLAQSKFDNLDNNLKKRLALYLGMI 246 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTP 375 ++D + +L+ LK+ ++TI++F +DNG+++ GP N +G K++ + GG P Sbjct: 247 RNIDWNMGNLLKFLKEENLAEDTILIFKTDNGSLL-GPQYFNAGMRGKKTEIWEGGHRVP 305 Query: 376 MFMWWK--GKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEP 433 F+ W G + + L D PT LD I K+ K DG+SL L+ KK+ Sbjct: 306 CFIRWPNGGFGKARDIGGLTQVQDILPTVLDLCGIKPRKNTKFDGISLASVLRGKKKVSE 365 Query: 434 HKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLV 493 + + S +YP + + V + L+ Sbjct: 366 DRTIIINYS-------------------RMPGFSNYPSPHSQTQMRADQAAVLWKRWRLL 406 Query: 494 YTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 LY L +D Q+ N+ +P+VV +M+ + + D + Sbjct: 407 EDR-----ELYDLASDPLQQKNVIDQHPEVVAKMRQQLYSWWDGVK 447 >UniRef50_D1QVA8 N-acetylgalactosamine-6-sulfatase n=1 Tax=Prevotella oris F0302 RepID=D1QVA8_9BACT Length = 521 Score = 410 bits (1055), Expect = e-113, Method: Composition-based stats. Identities = 122/573 (21%), Positives = 203/573 (35%), Gaps = 120/573 (20%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFD 81 M F + + +A + + +PNII+ +DD+G+ Sbjct: 1 MKTFQPIPMGHFSVALSAMFLAVASSARAQDRVDNRPNIILFMVDDMGWQDTS------- 53 Query: 82 PKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMT 141 + + + TP + L G+ F+ Y +S PSR ++MT Sbjct: 54 ------------LPFWTQRTMYNDRYETPNMERLAARGMMFSQAYA-CPISSPSRCSLMT 100 Query: 142 GRAPARFGVYSNTDAQDG---IPLTETFLPE-----------------------LFQNHG 175 G AR V + T ++ + + LPE L Q G Sbjct: 101 GSNAARHRVTNWTLEKNKSTDLKDDQLTLPEWNYNGISGVEGCRNTYRATSFVNLLQASG 160 Query: 176 YYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFM-GFHAAGTA 234 Y+T GK H P + GF+ + G G A Sbjct: 161 YHTIHCGKAHWGARDTPGE---------------------DPHHWGFEVNIAGHAGGGPA 199 Query: 235 YYNSPSLFKN---------------RERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLY 279 Y S + N + ++++ LT EA+ +D+AK +QPF LY Sbjct: 200 TYLSERHYGNTDNPAKQHKMAIPGLEKYWDTGTFLTEALTREALKSLDKAKLYNQPFYLY 259 Query: 280 LAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTI 339 +++ A H+P D + S+ Y + V +D+ + IL+ L KN + TI Sbjct: 260 MSHYAVHIPIDRDPRYYDKYLKKGLSEKEAAYASLVEGMDKSLGDILDWLDKNDETRRTI 319 Query: 340 ILFTSDNGAVIDGP-------LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYD-K 391 ++F SDNG G N K K Y GG PM + W G ++PG+ + Sbjct: 320 VIFMSDNGGYATGSQWRDQPLFTQNSPLKSGKGSMYEGGIREPMIVSWSGTVKPGSVCRQ 379 Query: 392 LISAMDFYPTALDAADISIPK-DLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEE 450 + D++PT L+ A I K K+DG S +P L+ + L W Sbjct: 380 YVMIEDYFPTLLEMAGIKHYKVPQKVDGKSFIPLLKGTGDPSRGRMLVWNYPNV------ 433 Query: 451 NIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDL 509 N + +R D+ L+Y + ++ LY + D+ Sbjct: 434 ---------------------WGNVGPGISLNCAIREGDWKLIYNYKTHEKELYDIPNDI 472 Query: 510 QQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 + NLAA P +VK++ + ++ Sbjct: 473 GEAHNLAAERPSIVKKLSKKLGNYLRKVAAQRP 505 >UniRef50_A6DF76 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DF76_9BACT Length = 542 Score = 410 bits (1055), Expect = e-113, Method: Composition-based stats. Identities = 124/559 (22%), Positives = 207/559 (37%), Gaps = 84/559 (15%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K T + + T + KPNI+ + DD+G G Sbjct: 1 MKHLFTIIYIAIVTLSL--AADKPNIVFILADDMGIGDTNCYGD---------------- 42 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNT 154 E + +TP + +L EGVRFT+ +V + GP+R A+MTGR P RFG N Sbjct: 43 --------EKCRINTPNIDALAAEGVRFTDFHVNSSICGPTRRALMTGRYPWRFGATVN- 93 Query: 155 DAQDGI----PLTET-FLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 + G P TE L ++ + GY T +GKWHL + + N Sbjct: 94 NGPWGFCGPRPNTEKYTLGKVLKKAGYNTGYIGKWHLGTTMVTKDGKKQG----LTNVDY 149 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKN--RERVPAKGYIS------------ 255 + P GFDY + Y + + + KG+ + Sbjct: 150 TKPLVYGPMQFGFDYSFILPGSLDMYPYAFIKDNDWQGDVSALKGWSAFNRVGAAEISFE 209 Query: 256 -----DQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADN 310 + E+ + + D PF L+LA +PH P + G Sbjct: 210 SNKVVETFYRESELFIKKQ-NSDTPFFLFLALTSPHTPVCP-------GEEWNGKSELGP 261 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI-----------------DGP 353 Y V VD + R+ + LK+ G Y+NT+I+F+SD+G Sbjct: 262 YGDFVMEVDHSIARVKQALKEKGLYENTLIIFSSDHGPAPYAGNILKATPNQISLLEQQG 321 Query: 354 LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPK 412 G +GYK Y GG P W GK G ++LI D + T + +I + + Sbjct: 322 HYPAGIYRGYKFSIYEGGLRVPFIASWPGKTPKGQICNQLIGFNDLFATFAELTNIKLQE 381 Query: 413 DLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDD-YPH 471 D D +S L K+L + S + + S++ + Sbjct: 382 DEAPDSISFARLLTKPSSNGDRKDLIMQSVTSFAIRDGEWKLCLCPGSGIPANSENGKGN 441 Query: 472 NPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVV 530 +P + + ++ + L+ L D ++K+NLA+ NP+ V++M + Sbjct: 442 DPAPNAAWKKALEEFKGKPHQTDLLKAPFVQLFNLAKDPEEKNNLASKNPRQVEKMINLF 501 Query: 531 REFIDSSQ-PPLSEVNQEK 548 ++ I + P ++ +K Sbjct: 502 KKQIADGRSTPGPKLKNDK 520 >UniRef50_D2A5L7 Putative uncharacterized protein GLEAN_15152 n=2 Tax=Tribolium castaneum RepID=D2A5L7_TRICA Length = 563 Score = 410 bits (1055), Expect = e-113, Method: Composition-based stats. Identities = 115/576 (19%), Positives = 193/576 (33%), Gaps = 108/576 (18%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 + + + T ++ ST KPNII++ DDLGY + F Sbjct: 14 TVMVCVLGLLLYFILTSSKPSTAKKPNIIIIIADDLGYNDVSFHGS-------------- 59 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 +Q TP L + G+ Y PSR A++TG+ P R G+ Sbjct: 60 ------------SQIPTPNLAKMATRGIILDRFY-TQSTCTPSRTALLTGQYPIRSGMQG 106 Query: 153 NT---DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 +PL +P FQN GY T VGKWHL Sbjct: 107 YPLKAGENRSLPLNMPTMPLHFQNLGYKTHLVGKWHLGAAY------------------- 147 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSL--FKNRERVP--------------AKGY 253 +E P +GFD G+ Y++ S N V Y Sbjct: 148 ---KEDTPLGKGFDSHFGYWNGFVGYFDYVSFSKMDNGTLVKGLDLHDQFEPVWGSQGRY 204 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHL-----PNDNPAPDQYQKQFNTGSQTA 308 ++ T+ ++ V++ + P L +++ A H P DQ +F+ Sbjct: 205 ATELFTERSLDVIEGH-DVRVPLFLVVSHLAAHTGQNGSELGVPDVDQTNHEFSYIQDPR 263 Query: 309 DN-YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG---PLPLNGAQKGYK 364 Y V +D + RI+ +L + DN+I+LF SDNGA G N +G K Sbjct: 264 RRLYAGVVSHLDASIGRIMAKLDEKQMLDNSIVLFFSDNGAQTVGMYENSGSNWPLRGVK 323 Query: 365 SQTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 + GG ++ + G + LI D+ PT AA + ++DG+ Sbjct: 324 FSDFEGGVRVAATIYSPLFHKKGYVSEHLIHISDWLPTLYSAAGGDVAHLGQIDGIDQWD 383 Query: 424 WLQDKKQGEPHKNLT------------------------WITSYSHWFDEENIPFWDNYH 459 L + + L + + F+ ++ + + Sbjct: 384 ALTNNNPSNRTEILINIDEVDENFAIIRDKFKLIQGTPNYYYQQTLLFNCKSGSYHEGTF 443 Query: 460 KFVRHQSDDYPHNPNTEDLSQFS--YTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLA 516 S P NP + R D + + L+ L D + N+ Sbjct: 444 DQYYGDSGRGPENPTPNPNHTTTDLSWCRAPDQTPILNCTKG--CLFDLDKDPCETTNII 501 Query: 517 AANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNNI 552 + P++ ++ + +F P ++ K + I Sbjct: 502 ESEPEIANQLYEKIAQFWKELVPQRNKDTDPKSDPI 537 >UniRef50_A6DMW1 N-acetyl-galactosamine-6-sulfatase (GALNS) n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DMW1_9BACT Length = 585 Score = 410 bits (1054), Expect = e-113, Method: Composition-based stats. Identities = 126/548 (22%), Positives = 203/548 (37%), Gaps = 131/548 (23%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 P+ KPN+IV+ +DD+G F Sbjct: 2 PSALIAAKKPNVIVILIDDMGLMDSSTYGSKF--------------------------YQ 35 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY----------------- 151 T + L EG+ FT+ Y A + P+RA+IM+G+ P+R + Sbjct: 36 TANMSRLAKEGMLFTDAYAASPLCSPTRASIMSGQYPSRLHMTVAVTPKSKEKPKALAPA 95 Query: 152 -----SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDN 206 ++++ +PL L E Q+ GY TA +GKWHL Sbjct: 96 PNQYCGKVESKNHMPLAVYTLAEALQDSGYTTAHIGKWHL-------------------- 135 Query: 207 FTTFSAEEWQPQNRGFDYFMGFHA-AGTAYYNSPSLFKNRE--------RVPAKGYISDQ 257 + +N+GFD+ +G G Y SP K ++ P Y++++ Sbjct: 136 ---TENPKHNAENQGFDFVIGGAGLPGPPDYYSPYKRKGKKAKGINNLSPGPKGEYLNER 192 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA---PDQYQKQFNTGSQTADNYYAS 314 L E+I + + ++PF L L + A H P P +++ Q Sbjct: 193 LAKESIKWIKSVQDSNKPFYLNLWHYAVHGPVIEKKDLMPKYLERRDPNNPQRCPEMGTM 252 Query: 315 VYSVDQGVKRILEQLKK---NGQYDNTIILFTSDNGA-----VIDGPLPLNGAQKGYKSQ 366 + S+D V +L+ L K DNT+I+ TSDNG N +G K+ Sbjct: 253 IDSMDNSVGMLLDWLDKPENKAVKDNTLIILTSDNGGVIHKETNGNTWTSNRPLRGGKAN 312 Query: 367 TYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 TY GGT P + W ++ G+ + ++D YPT L+A +I K L DG S+LP L Sbjct: 313 TYEGGTRVPWIVRWPDTIKAGSVCTTPVQSIDIYPTVLEAVNIKAKKGLTFDGQSILPLL 372 Query: 426 QDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTV 485 + +K H+ + + + S +V Sbjct: 373 EQRKME--HQPIFTDFQHLFGV------------------------------MCAPSSSV 400 Query: 486 RNNDYSLVYTVENNQ------LGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 R D L+ L+ L DL + NLAA P+ VKE+ ++ I + Sbjct: 401 RVGDMKLIRFYHAGPKAQSHAYELFDLKRDLYESINLAAYMPEKVKELDRLIEAHIKETA 460 Query: 539 PPLSEVNQ 546 + N+ Sbjct: 461 ALVPIANK 468 >UniRef50_A6DFN4 Arylsulfatase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DFN4_9BACT Length = 481 Score = 410 bits (1054), Expect = e-113, Method: Composition-based stats. Identities = 120/539 (22%), Positives = 192/539 (35%), Gaps = 133/539 (24%) Query: 58 PNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMD 117 PN+I + DDLGYG+L + TP + +L Sbjct: 20 PNVIYILADDLGYGELGCYG--------------------------QEKIKTPHIDALAK 53 Query: 118 EGVRFTNGYVAHGVSGPSRAAIMTGR-----APARFGVYSNTDAQDGIPLTETFLPELFQ 172 EG+RFT Y V PSR +++G+ R + Q+ IP L ++F+ Sbjct: 54 EGMRFTRHYSGAPVCAPSRGVLLSGQQLSKAY-IRNNREHKPEGQEPIPEPGMTLAQIFK 112 Query: 173 NHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG 232 + GY T A GKW L + P+ GFD F G++ Sbjct: 113 DKGYATGAFGKWGLGYPGSSS----------------------DPKALGFDTFYGYNCQR 150 Query: 233 TAY-YNSPSLFKNRERV------------------------PAKGYISDQLTDEAIGVVD 267 A+ + P ++ N + + A+ Y D + DEA+ + Sbjct: 151 VAHSFYPPHMWSNDKNITINEKPVPGHWRKAVGPDFDFSQFYAENYAPDLILDEALKFIK 210 Query: 268 RAKTLDQPFMLYLAYNAPHLPNDNP-------------APDQYQKQFNTGSQTADNYYAS 314 K D+PF YL + PHL P + Y+ + + Y A Sbjct: 211 DNK--DKPFFAYLPFVEPHLAMHPPHSWVDSYPKEWDSPKESYKAAYLPHLRPRAGYAAM 268 Query: 315 VYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI-----DGPLPLNGAQKGYKSQTYP 369 + +D+ V +++ LK+ +NT+++FTSDNGA +G K Y Sbjct: 269 ISDLDEHVGSVMQLLKELDLVENTLVIFTSDNGASHCIEVDHEFFNSTKDLRGLKGSVYE 328 Query: 370 GGTHTPMFMWWKGKLQPGNYDKLIS-AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDK 428 GG PM W GK++ +S +D T D P+ DGVS LP L+ + Sbjct: 329 GGLRVPMIAHWPGKIKKAQVSDHVSGFVDVMATFCDLLQTEAPQTS--DGVSFLPTLKGE 386 Query: 429 KQGEPHKNLTWIT-SYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRN 487 KQ EP L W YS W + + + + Sbjct: 387 KQ-EPQPVLAWEFQGYSGQQAIILDGRWKGVRQNLSPRGKKKAKS--------------- 430 Query: 488 NDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVRE---FIDSSQPPLS 542 + LY L D +K +LA P++V + + + ++ P++ Sbjct: 431 ----------TPKWELYDLNKDPNEKTDLATQMPEIVDRIHKAMMKNRSHSETFNMPMA 479 >UniRef50_D0PR10 N-acetylgalactosamine-6-sulfate sulfatase n=1 Tax=Flammeovirga yaeyamensis RepID=D0PR10_9SPHI Length = 607 Score = 410 bits (1054), Expect = e-113, Method: Composition-based stats. Identities = 117/534 (21%), Positives = 187/534 (35%), Gaps = 104/534 (19%) Query: 36 KATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 V FS S KPN+I++ DD+GYG + Sbjct: 9 STIVLLVFFSFLYIKSCSDIDKPNVIIILTDDMGYGDIAAHG------------------ 50 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTD 155 STP + L DE +R TN +V PSR+A+MTG+ R GV+ Sbjct: 51 --------NKDISTPHIDQLHDESLRLTNFHVN-PTCAPSRSALMTGKDANRVGVWHTVM 101 Query: 156 AQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEW 215 + + E + ++F + Y T GKWHL + Sbjct: 102 GRSLLYEEEETMADIFSANNYATGLFGKWHLGDNY-----------------------PF 138 Query: 216 QPQNRGFDYFMGFHAAGT----AYYNSPS----LFKNRERVPAKGYISDQLTDEAIGVVD 267 PQ RGF + G Y+N+ +N + +GY +D EA+ + Sbjct: 139 APQYRGFQEVLTHGGGGVGQTPDYWNNDYFDDVYLRNGQEEKFEGYCTDVWFREALTFIK 198 Query: 268 RAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILE 327 K + PF+ Y++ NAPH P + P+ + +Y + ++D + + + Sbjct: 199 ENK--ENPFLCYISTNAPHTPLNVPSSYAEPYLKKGIQEDRAKFYGMISNIDDNIGLLRK 256 Query: 328 QLKKNGQYDNTIILFTSDNGAVIDGP-------LPLNGAQKGYKSQTYPGGTHTPMFMWW 380 +L++ G DNTI++F SDNG N +G K Y GG P +++W Sbjct: 257 KLEEWGIADNTILIFMSDNGTANGATLKGKQLLSGYNANMRGVKGSPYDGGHRVPFYVYW 316 Query: 381 K-GKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLT 438 K G L G + ++L + +D PT + +S + DG+ L + Sbjct: 317 KNGNLNHGMDINQLTAHIDVLPTLIKMCGLSNVPTINFDGIDLSQIFLGSDENLD----V 372 Query: 439 WITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVEN 498 E W N + V + L+ Sbjct: 373 NRILIGDSQRLETPKKWRNSY-------------------------VMMGQWRLI----- 402 Query: 499 NQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 N LY L D Q N+ Q+VK+++ + P K N Sbjct: 403 NGTELYNLKRDPSQVKNVFDLEHQIVKQLKEAYEKHWAEISPSFHRFAYIKLGN 456 >UniRef50_D2R206 Steryl-sulfatase n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R206_9PLAN Length = 504 Score = 410 bits (1054), Expect = e-113, Method: Composition-based stats. Identities = 114/560 (20%), Positives = 189/560 (33%), Gaps = 114/560 (20%) Query: 29 AADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENR 88 F + + +PN+I++ +DDLGY + Sbjct: 4 PILVRLAIGAIVASFLGSFVSEILAEESRPNVIIINIDDLGYADIGPFGS---------- 53 Query: 89 EVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF 148 + TP L + EG++ T+ Y A V PSRAA++TG P R Sbjct: 54 ----------------KKNPTPALTKMAAEGMKLTSHYAA-PVCSPSRAALLTGCYPKRV 96 Query: 149 -GVYSN--TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 + A G+ E + ++ + GY TA +GKWH+ + +P + Y+ Sbjct: 97 LSIPHVLFPSAGSGLHPDEVTIADMLKASGYKTACLGKWHVGDQAE-FLPTKQGFDSYYG 155 Query: 206 NFT-------------TFSAEEWQPQNRG---------FDYFMGFHAAGTAYYNSP-SLF 242 F A P +G + +G T P L Sbjct: 156 IPYSNDMGTATDGSKSNFGAPLPMPGAKGKGKQPAQATGELPLGSPTGLTGNMQPPLPLL 215 Query: 243 KNRERV-----PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQY 297 +N + V + ++ T A+ + K DQPF LY A+ A H P Y Sbjct: 216 ENDKVVARVRGEDQVNLTRDYTKRAVNFIRENK--DQPFFLYFAHTAVHFPM-------Y 266 Query: 298 QKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLN 357 + + V VD V +L L + + T+++FTSDNG + N Sbjct: 267 PSKEFR-TSDRGTLDDWVDEVDASVGEVLAALAEMKIDEKTLVIFTSDNGGSLP-HGSDN 324 Query: 358 GAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKL 416 KG K T+ GG P W G ++ G + +D PT A +P+ KL Sbjct: 325 TPLKGSKGLTWEGGIRVPTIARWPGTIKGGTSTSAITGMIDLLPTIAAATGAKLPE-RKL 383 Query: 417 DGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTE 476 DG++ LP L + P + + Sbjct: 384 DGLNQLPLLNGTAKESPRREFFYFRGLELD------------------------------ 413 Query: 477 DLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFID 535 VR +++ L + LY L +D+ + N+AA +P++VK + + + Sbjct: 414 -------AVRRDNWKLHLA----KGELYDLESDIGESKNVAADHPEIVKSLTELAATADN 462 Query: 536 SSQPPLSEVNQEKFNNIKKA 555 ++ Sbjct: 463 DLGQKGIGPGVRPLGTVEAP 482 >UniRef50_A3ZY29 Aryl-sulphate sulphohydrolase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZY29_9PLAN Length = 498 Score = 410 bits (1054), Expect = e-113, Method: Composition-based stats. Identities = 124/567 (21%), Positives = 194/567 (34%), Gaps = 107/567 (18%) Query: 25 FAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKT 84 F + + A S + + PNI+++ DD G+ + + F Sbjct: 6 FMSRCFRYSTMFAWLAICIVSLNLSLAVAAQQPPNIVLIFADDQGWRDIGYQGRGF---- 61 Query: 85 MENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRA 144 TP L L EG+ FT+GY + G PSRA +++G Sbjct: 62 ----------------------IETPNLDRLAGEGMVFTSGYASAGNCAPSRACLISGNY 99 Query: 145 PARFGVYSNT---------------DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKI 189 R VY+ + G+ + E Q GY T GKWHL+ Sbjct: 100 TPRHDVYAVGSTDRGKQREMRLVPAPNKSGLAKENVTMAEALQAAGYVTGHFGKWHLAGP 159 Query: 190 SNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVP 249 P +GFD G S N++ P Sbjct: 160 EGA-----------------------LPSEQGFDVTFDSFGEGELREGS---EGNKKGPP 193 Query: 250 AKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTGSQT 307 LT +A ++ + D+PF YLA++A H P A ++++ + Sbjct: 194 DDPKGVFTLTRKACEFIEANQ--DRPFFCYLAHHAIHGPLQGRAETLEKFKAKTRRKLDP 251 Query: 308 ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQT 367 Y A Y +D V +L +L + D T++ FTSDNGA +G K Sbjct: 252 GAMYAACTYDLDASVGMLLAKLDELKLADKTLVAFTSDNGAT---QAASQEPLRGSKGGY 308 Query: 368 YPGGTHTPMFMWWKGKLQP-GNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQ 426 Y GG P+ + W G QP D + +DFYPT L AA +P LDG SLLP L Sbjct: 309 YEGGIREPLIIRWPGVTQPSSTSDVPVINVDFYPTFLAAAGAPVPAGKILDGESLLPLLS 368 Query: 427 DKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVR 486 + W R + T S+ +R Sbjct: 369 G-AGPLKRTGIFWHFPGY----------------LDRPVIRGRELDVQTGFRSRPVSVIR 411 Query: 487 NNDYSLVYTVE-------------NNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVRE 532 D+ L E N+ + LY L D+ ++ +LA E+ + Sbjct: 412 KGDWKLHLFHEEWLLDGGRENLAANHAVELYNLAADIGERHDLATVETAKRDELLNDLLA 471 Query: 533 FIDSSQPPLS-EVNQEKFNNIKKALSE 558 ++ +S+ + + N + +K + Sbjct: 472 WLAASEAKIPTQPNPKFDPATRKDAGK 498 >UniRef50_C7PRW9 Sulfatase n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PRW9_CHIPD Length = 460 Score = 409 bits (1053), Expect = e-112, Method: Composition-based stats. Identities = 124/545 (22%), Positives = 191/545 (35%), Gaps = 128/545 (23%) Query: 38 TKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIG 97 T A T KPNII + DDLGYG + Sbjct: 1 MLTGCAL-LTTILTQGQTHKPNIIFILADDLGYGNISAYNSK------------------ 41 Query: 98 IDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ 157 + TP + L EG++F N Y + V PSR A++TG+ + NT + Sbjct: 42 -------SPVKTPNIDRLGQEGIQFKNFYSGNTVCAPSRCALLTGKHMGHAYIRGNT--R 92 Query: 158 DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQP 217 + ++ L +L Q +GY T GKW L + PE K Sbjct: 93 LPLRAEDSTLAQLLQGNGYRTGMFGKWGLGESGTTGSPEIK------------------- 133 Query: 218 QNRGFDYFMGFHAAGTAY-YNSPSLFKNRE------RVPAKGYISDQLTDEAIGVVDRAK 270 GFD F G+ A+ Y + LF+ +E Y D++ A+ ++ K Sbjct: 134 ---GFDTFFGYLNQQHAHNYYTDYLFEVKEGQISRVPRDTNVYSQDEILQHALSFINDNK 190 Query: 271 TLDQPFMLYLAYNAPHLPNDNPAPD-----------------QYQKQ---FNTGSQTADN 310 D+PF L+L + PH PA D Y+++ + + Sbjct: 191 --DKPFFLFLPFTLPHAELAPPATDMQAFLNADGSSKLGPETPYERKNGTYRSQENPHAA 248 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----LPLNGAQKGYKS 365 + A V +D+ V I +K+ G DNT I FTSDNG +G NG KG K Sbjct: 249 FAAMVTKLDRNVGEISALIKQLGLDDNTYIFFTSDNGPHREGGADPIYFDSNGPLKGIKR 308 Query: 366 QTYPGGTHTPMFMWWKGKLQPGNYDK-LISAMDFYPTALDAADISIPKDLKLDGVSLLPW 424 Y GG P+ + GK+ G + D PT D + +DG+S Sbjct: 309 DLYEGGIRVPLLVRAPGKVSAGQVSTIPWAFWDVLPTLSDITHSPVLSG--IDGLSYTKA 366 Query: 425 LQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYT 484 L K H + W + Sbjct: 367 LNGTKPARQHDHFYWQFNEGGL-----------------------------------QEA 391 Query: 485 VRNNDYSLVYTVENN---QLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPP 540 + +D+ L+ + + LY L+ D+ ++ +LA PQ VK + G++ + Sbjct: 392 LLKDDWKLIRFKKRGTPERFELYHLSEDIGEEHDLATKYPQKVKALSGLMLQ--SKMPAE 449 Query: 541 LSEVN 545 E + Sbjct: 450 NPEFD 454 >UniRef50_A6KZI6 Sulfatase n=6 Tax=Bacteroides RepID=A6KZI6_BACV8 Length = 473 Score = 409 bits (1053), Expect = e-112, Method: Composition-based stats. Identities = 118/549 (21%), Positives = 198/549 (36%), Gaps = 118/549 (21%) Query: 36 KATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 V + TK KPNI+ + DDLG+ L + Sbjct: 10 TVVLCTVGIQQSFSIDIVTKEKPNIVFILADDLGWTDLGVMGSDY--------------- 54 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFT------NGYVAHGVSGPSRAAIMTGRAPARFG 149 TP + L EG+ F +MTG R G Sbjct: 55 -----------YETPNIDRLATEGILFDNAYAAAANSAPSRAC------MMTGMYTPRHG 97 Query: 150 VYSNTDAQDG---------IP------LTETFLPELFQNHGYYTAAVGKWHLSKISNVPV 194 VY+ + G IP + E Q GY +GKWHL Sbjct: 98 VYTVSPPDRGDRTKRKYIAIPNVEDVCADFVTMAEALQEQGYQCGHIGKWHLGDDE---- 153 Query: 195 PEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY-YNSPSLFKNRERVPA--- 250 + P ++GF + +G + AG Y Y P ++ + Sbjct: 154 ------------------DGTGPLSQGFIWNVGGNRAGAPYSYFYPYCLPDKSKCHVGLE 195 Query: 251 ----KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAP--DQYQKQFNTG 304 Y++D+LT+EA+ + PF L+L+++A H P ++Y+ + Sbjct: 196 EGILGEYLTDRLTEEAVSFIKSHSEG--PFFLHLSHHAVHTVLQAPDSLINKYRNKTPGK 253 Query: 305 SQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYK 364 Y A + +D V RI + +K G D TI++F SDNG P+ N G K Sbjct: 254 YHKNPIYAAMIEKLDDSVGRICQVIKTLGIADRTIVIFYSDNGGS--EPVTDNYPLNGGK 311 Query: 365 SQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLP 423 Y GG+ P+ + W GK++ G I+ +DFYPT + A IP + LDG + Sbjct: 312 GMPYEGGSRVPLIIRWTGKIEGGIRSSVPITGVDFYPTFVTLAQGKIPAN--LDGKDIFT 369 Query: 424 WLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSY 483 + + + ++L W + N + ++ Sbjct: 370 LINNNET---ERDLFWHFPAYL----------------------ESYLNGGRDFRAKPYS 404 Query: 484 TVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLS 542 ++R+ D+ L+Y E+ + L+ L DL + +L+ +NP E+ + ++I + P+ Sbjct: 405 SIRSGDWKLIYHYEDKSMELFNLKNDLGESQDLSGSNPVKRGELYQKLMKWIQETHAPIP 464 Query: 543 EVNQEKFNN 551 + Sbjct: 465 VKLNPYYRE 473 >UniRef50_Q7UXA8 N-acetylgalactosamine-6-sulfate sulfatase n=2 Tax=Bacteria RepID=Q7UXA8_RHOBA Length = 495 Score = 409 bits (1053), Expect = e-112, Method: Composition-based stats. Identities = 118/563 (20%), Positives = 194/563 (34%), Gaps = 126/563 (22%) Query: 13 SISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPT----EYSTKGKPNIIVLTMDDL 68 + I + + V + S S KPNI+ + DD Sbjct: 8 TEDFIKSFPLQCVRYVCLTTVVILFVLAGATESRCAAAEDTVASSVGKKPNILFIFADDW 67 Query: 69 GYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVA 128 G+G L + TP + L EG F VA Sbjct: 68 GWGDLSCHGHPY--------------------------VRTPNIDRLAREGTDFERFTVA 101 Query: 129 HGVSGPSRAAIMTGRAPARF---GVYSNTDA------QDGIPLTETFLPELFQNHGYYTA 179 GV PSR A+MTG PAR G ++ + D + + LP L Q+ GY TA Sbjct: 102 SGVCSPSRTAVMTGHFPARHNIDGHFAWVPSNAKRNMPDWLDPSAVTLPRLLQSGGYKTA 161 Query: 180 AVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP 239 GKWHLS P P G+D + F+ +G Sbjct: 162 HFGKWHLSNDMIPDSP--------------------TPAAYGYDRYGAFNCSGEQMPVHE 201 Query: 240 SLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQK 299 +E I ++ A + PF + L + PH P +++ Sbjct: 202 D------------------ANETIRFIEEAHSKGDPFFVNLWVHEPHTPFHVIPKYRWRF 243 Query: 300 QFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG------- 352 + + S+ + Y A + D + +L+ L + + T+++F+SDNG Sbjct: 244 RDSGLSEADEIYAAVLSHADDRIGEVLDALDRLELTNKTLVIFSSDNGPARGSANAKLEL 303 Query: 353 --------------PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDK--LISAM 396 + +KGYK+ + GG + P + W GK+ G D +ISA+ Sbjct: 304 SYDTATGAGFGIGASKGITAGRKGYKASLFEGGINVPFIVRWPGKVAAGKTDDSAMISAV 363 Query: 397 DFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWD 456 D PT D A + +P + DG+S + L+ + K L W S + W Sbjct: 364 DLLPTFCDIAGVELPSAYQADGISQVSALKGQPTTGRTKPLFWKYSARWPAQKSRPHHWA 423 Query: 457 NYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNL 515 SY V N + L+ +++ + LY + +D + +L Sbjct: 424 -------------------------SYCVVNERWKLLANQDSSYVELYDIVSDPFESTDL 458 Query: 516 AAANPQVVKEMQGVVREFIDSSQ 538 + P V ++ + ++ S Sbjct: 459 KESQPDAVTKLSKQLTDWKASLP 481 >UniRef50_C9MNT2 Arylsulfatase n=4 Tax=Bacteroidales RepID=C9MNT2_9BACT Length = 539 Score = 409 bits (1052), Expect = e-112, Method: Composition-based stats. Identities = 126/579 (21%), Positives = 196/579 (33%), Gaps = 130/579 (22%) Query: 31 DDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 K T + + KPNII + DD+GYG L + Sbjct: 29 KLTKTLLPITALGCVQGNAMTPKKQQKPNIIYIMCDDMGYGDLGCYGQKY---------- 78 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 TP + + EG+RFT Y VS PSRA +MTG+ V Sbjct: 79 ----------------ILTPNIDRMAKEGMRFTQAYAGAPVSAPSRACLMTGQHSGHTEV 122 Query: 151 YSN-------------------TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISN 191 N Q LPE+ +++GY T GKW + Sbjct: 123 RGNKEYWTNSKPVYYGENKDFSVVGQHPYDPNHIILPEIMKDNGYRTGMFGKWAGGYEGS 182 Query: 192 VPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAG-------TAYYNSPSLFKN 244 + P+ + D++ F A + P + + T N P Sbjct: 183 LSTPDKRGVDDFYGYICQFQAHLYYPNFL--NEYYKERGDTAVKRVVLTENINHPMF--G 238 Query: 245 RERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQ---YQKQF 301 E Y +D + A+ + +A+T D+PF Y PH P Y+KQF Sbjct: 239 DEYFKRTQYSADLIHQHAMDWL-KAQTKDKPFFGVFTYTLPHAELTQPDDSLVAFYKKQF 297 Query: 302 NTGS--------------QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG 347 T T + A + +D V IL+ L + G DNT+++FTSDNG Sbjct: 298 FTDKTWGGQEGSRYNAVVHTHAQFAAMITRLDSYVGEILKLLDERGLADNTLVIFTSDNG 357 Query: 348 AVIDGP-----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPT 401 +G +G +G K Q Y GG P W G ++ G + + D PT Sbjct: 358 PHEEGGADPSFFNRDGKLRGIKRQCYEGGIRIPFIARWNGHIKAGVESNLPFAFYDLMPT 417 Query: 402 ALDAADISIPKDL---------KLDGVSLLPWLQDKKQGEPH-KNLTWITSYSHWFDEEN 451 + + DG+S+LP L + G+ L W + + Sbjct: 418 FAEMVGVKDYVQRYRNKKKTIDYFDGISILPTLINDGIGQKKYPYLYWEFAETD------ 471 Query: 452 IPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQ 510 VR D+ L+ LY L DL Sbjct: 472 ------------------------------QTAVRMGDWKLITIH--GIPHLYNLSNDLH 499 Query: 511 QKDNLAAANPQVVKEMQG-VVREFIDSSQPPLSEVNQEK 548 + ++A +P +V++M ++E +S P++ + +K Sbjct: 500 EDHDIANEHPDIVQKMIEIALKEHTNSELFPVTMPSLDK 538 >UniRef50_C1ZF13 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZF13_PLALI Length = 461 Score = 409 bits (1052), Expect = e-112, Method: Composition-based stats. Identities = 136/542 (25%), Positives = 230/542 (42%), Gaps = 122/542 (22%) Query: 30 ADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENRE 89 + + +A + TE +++ +PNI+++ DD G+ + Sbjct: 5 LVLILAFQFTSQLALAQRATTETTSERRPNILLILSDDCGHAEFSIQG------------ 52 Query: 90 VVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFG 149 + TP + S+ GV F GYV+ V PSRA ++ GR RFG Sbjct: 53 --------------HPRYKTPHIDSIGKNGVHFRQGYVSGCVCSPSRAGLLAGRYQQRFG 98 Query: 150 VYSNTDA----QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHD 205 N +G+P +ET LP+L + GY T A+GKWHL Sbjct: 99 HEFNIPPAYSETNGLPRSETLLPQLLKEDGYRTIALGKWHLG------------------ 140 Query: 206 NFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPS------LFKNRERVPAK--GYISDQ 257 A ++ P RGF + GF +Y+ + ++R +P + GY++D Sbjct: 141 -----YAPQFHPMERGFTDYYGFLQGSRSYFPLKKPTRLNQMLRDRTAIPEEQFGYMTDH 195 Query: 258 LTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYS 317 L DEAI + + ++ QP+M+YLA+NA H PND A D + Y A + Sbjct: 196 LADEAIAYIKQWQS--QPWMMYLAFNATHSPNDATAVDL------QAADGNKIY-AMTIA 246 Query: 318 VDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMF 377 +D+ V ++L+ LK+ G +T+++F +DNG NG+ G K T+ GGT P Sbjct: 247 LDRAVGKVLDALKECGLSKDTLVIFINDNGGA---GGHDNGSLHGKKGSTWEGGTRIPFL 303 Query: 378 MWWKGKLQPGNY-DKLISAMDFYPTALDAADI------SIP-KDLKLDGVSLLPWLQDKK 429 + + K+ G D+ + A+D +PT LD A + IP KLDG+SL+P + K Sbjct: 304 VQYPAKIPSGQVIDEPVIALDLFPTILDVAGLGDAELKKIPFDPEKLDGISLIPRMTGKT 363 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNND 489 Q + L W + + +R + Sbjct: 364 QRLVDRPLYWKSG--------------------------------------KRWAIRQGN 385 Query: 490 YSLVY--TVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQ 546 V + +Q+ L+ L +D ++ NLAA +P +++++ + R++ + + P + Sbjct: 386 LKAVSGNDDQGDQVELFDLSSDPDEQRNLAATHPDELQQLEALYRKWESTLEKPRWGSSP 445 Query: 547 EK 548 K Sbjct: 446 GK 447 >UniRef50_C1ZJ89 Arylsulfatase A family protein n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJ89_PLALI Length = 536 Score = 409 bits (1052), Expect = e-112, Method: Composition-based stats. Identities = 121/603 (20%), Positives = 207/603 (34%), Gaps = 160/603 (26%) Query: 28 HAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMEN 87 A + L +++ +PN++ + DDLG+G++ Sbjct: 9 AALSVLLLIQLAAESLWANELTLISHQSPRPNVVFILADDLGWGEVGCFG---------- 58 Query: 88 REVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPAR 147 ++ TP + L GV+ T Y PSR +MTG+ Sbjct: 59 ----------------QSKIPTPNIDRLASRGVKLTRHYSGAPTCAPSRCVLMTGKHLGH 102 Query: 148 FGVYSN----------TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPED 197 + N T+ Q + + FQ GY T A GKW L + + Sbjct: 103 AEIRGNQQAKVKLPQFTEGQHPLSDKALTIARQFQKAGYATGAFGKWGLGPVGSTGE--- 159 Query: 198 KQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY-YNSPSLFKNRERV-------- 248 P +GFD F G++ A+ Y +L+KN E + Sbjct: 160 -------------------PNRQGFDEFFGYNCQALAHSYFPKALWKNAESIVNNEKPVP 200 Query: 249 ---------------PAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA 293 + Y + EA+ +DR QPF LYL + PH+ P Sbjct: 201 GHKKQPEGEVTMEAYQGENYAPRLIMAEALSFIDRHHQ--QPFFLYLPFTEPHVAMQPPP 258 Query: 294 ------PDQYQKQ-------FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTII 340 P ++ ++ + + Y A + +D V ++ L+K+G + T+I Sbjct: 259 KIVEEFPVEWDERVYRGDGGYLPHPRPRAAYAAMIRDLDNHVGDVITSLEKHGLLEKTLI 318 Query: 341 LFTSDNGAV-----IDGPLPLNGA--------QKGYKSQTYPGGTHTPMFMWWKGKLQPG 387 +FTSDNGA D + KG+K Y GG P + W G++ P Sbjct: 319 VFTSDNGATHASANPDFHVGGADPLFFNSTRELKGFKGSIYEGGLRVPAIVSWPGQIPPA 378 Query: 388 -NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE---PHKNLTWITSY 443 + D++PT +A + +P+ LDGV+LLP L K + + W+ + Sbjct: 379 TTINTPSYFPDWFPTLCNATQLPLPEG--LDGVNLLPLLTGKTSPDQFIRPDPMVWVYAE 436 Query: 444 SHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL----VYTVENN 499 V D+ + + T Sbjct: 437 Y-----------------------------------TGQVCVHLGDFKVLRRGLRTNRPG 461 Query: 500 QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFID-SSQPPLSEVNQEKFNNIKKALS 557 +Y+L +D + NLA + P +V + V++ + P+ E + ++K Sbjct: 462 PWEVYQLVSDPGESTNLADSRPDLVTKAIEVLKAQTAPNEIFPMPECD---LPVLEKGAP 518 Query: 558 EAK 560 +AK Sbjct: 519 KAK 521 >UniRef50_UPI0000588CF9 PREDICTED: similar to arylsulfatase B n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000588CF9 Length = 545 Score = 409 bits (1051), Expect = e-112, Method: Composition-based stats. Identities = 107/571 (18%), Positives = 203/571 (35%), Gaps = 66/571 (11%) Query: 5 LKKSVVSTSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLT 64 +S V S L AF + + + + A + + P+I+ + Sbjct: 6 RTRSFVLLLSSAALLIIYLAFERYFWRVAPVVSALSGGAAGVGFQPSRNPRRPPHIVFIL 65 Query: 65 MDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTN 124 DD G+ + + TP L L EG++ N Sbjct: 66 ADDYGFNDIGY---------------------------RNPAMRTPNLDYLAAEGIKLDN 98 Query: 125 GYVAHGVSGPSRAAIMTGRAPARFGVYS---NTDAQDGIPLTETFLPELFQNHGYYTAAV 181 YV + PSRA +M+G+ G+ + +PL LP+ + GY T Sbjct: 99 YYV-QPICTPSRAQLMSGKYQIHTGLQHSIIWPPQPNCLPLDLPTLPQKLKEAGYATHMA 157 Query: 182 GKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSL 241 GKWHL P ++ + D+F+ G Y S Sbjct: 158 GKWHLGFYKKECWPTNRGFDSFLGILLGKG-----------DHFLHTEEGGGGPYPSTWP 206 Query: 242 FKNRERVPA--------KGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPA 293 ++ + Y + + + ++++ D+P LY+++ A H P P Sbjct: 207 WEGLDFRDGLQSTNAYSGIYSTHVIAERVENIIEKH-DKDKPLFLYVSFQAVHTPLQVPE 265 Query: 294 PDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP 353 + + + Y Y +D+ V I ++LKK G +D+T+++F+SDNG ID Sbjct: 266 SYLQPFESSIQDEKRRIYAGMTYCMDEAVGNITKKLKKQGLWDDTVLVFSSDNGGNID-Q 324 Query: 354 LPLNGAQKGYKSQTYPGGTHTPMFMWWK---GKLQPGNYDKLISAMDFYPTALD-AADIS 409 N +G K+ + GG F+ +++ +LI D+YPT ++ A + Sbjct: 325 GASNWPLRGSKTTLWEGGVRAVGFVTSPLLSERMKGTVSRELIDISDWYPTLIEGVAGWT 384 Query: 410 IPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITS--------YSHWFDEENIPFWDNYHKF 461 + KLDG ++ L+ K + L I + + F + Sbjct: 385 L-SGTKLDGYNIWETLRSGKPSARVELLHNIDPLITPPSTWPNESIAAAHNSFSTRTYAA 443 Query: 462 VRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANP 520 +R++ + + +S + + + L+ +T D ++ +L+ P Sbjct: 444 LRYKDWKIVTGYXSINNGWYSPAESSKQSVASEILPGKSVWLFNITRDPREFHDLSNQEP 503 Query: 521 QVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 +V + + + + P L K N Sbjct: 504 AIVNFLLERLESYQSGASPVLYPDIDTKANP 534 >UniRef50_A6DQ01 N-acetylgalactosamine-4-sulfatase n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DQ01_9BACT Length = 616 Score = 408 bits (1050), Expect = e-112, Method: Composition-based stats. Identities = 111/519 (21%), Positives = 194/519 (37%), Gaps = 107/519 (20%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 + A S F + KPNII++ DD GYG L Sbjct: 1 MNKLVFLCIFALSPFLLAQA----KPNIIIVMTDDQGYGDLSCHG--------------- 41 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS 152 TP + + +R TN +V P+R+A+MTGR AR GV+ Sbjct: 42 -----------NPILKTPQIDEFYKDALRLTNYHV-DPTCAPTRSALMTGRYSARVGVWH 89 Query: 153 NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSA 212 + + E + + +++GY T GKWHL Sbjct: 90 TVQGRHLMREREITMANILKDNGYATGIFGKWHLGDAY---------------------- 127 Query: 213 EEWQPQNRGFDYFMGFHAAGTAY--------YNSPSLFKNRERVPAKGYISDQLTDEAIG 264 ++P++RGF + + A G Y + + + N E V +G+ +D DEA Sbjct: 128 -PYRPEDRGFTHVVTHGAGGVGQVPDYWGNDYFNDTYYVNGEFVKFEGFCTDVWFDEAKK 186 Query: 265 VVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGS---QTADNYYASVYSVDQG 321 + + +PF ++ NAPH P AP +Y +N + ++ + ++D Sbjct: 187 FMKTQISKKKPFFTFITPNAPHGPM--RAPQKYLDMYNQTKVKGTKLEAFFGMITNIDDN 244 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWK 381 + E LK G DNT+++FT+DNG+ + N G K+ + GG P W Sbjct: 245 FGELREFLKDEGVADNTLLIFTTDNGSSSGIGV-YNAGMTGAKNSNFDGGHRVPFIFTWP 303 Query: 382 -GKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTW 439 G L G D+L + MD P+ ++ + PK + DG SL ++ + + L Sbjct: 304 KGNLMGGRDIDQLTAHMDILPSFIEMFGLKAPK-IDFDGTSLEKIIKGDQTALRDRVLLV 362 Query: 440 ITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENN 499 + + + V ++ + L+ N Sbjct: 363 ESQ------------------------------RVKDPEKWRNTAVMSDQWRLL-----N 387 Query: 500 QLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSS 537 LY + D QK+++++ +P+V + + + + Sbjct: 388 AKQLYNIRKDPAQKNDVSSQHPEVKQRLLAAYDKRWEDL 426 >UniRef50_UPI0000586CBD PREDICTED: similar to MGC86251 protein n=5 Tax=Strongylocentrotus purpuratus RepID=UPI0000586CBD Length = 525 Score = 408 bits (1049), Expect = e-112, Method: Composition-based stats. Identities = 124/551 (22%), Positives = 210/551 (38%), Gaps = 99/551 (17%) Query: 35 LKATKTNVAFSDFTPT-EYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDT 93 + + + S +PNII+ DDLGYG L Sbjct: 1 MLVLSSFLFVSLIINCFTTGRAKRPNIIIFYADDLGYGDLEPYG---------------- 44 Query: 94 YKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS- 152 STP L L G+ T Y + V PSRAA++TGR R GVY Sbjct: 45 ----------HPTSSTPNLGRLAAGGIVLTQFYSSSPVCSPSRAALLTGRYQMRSGVYPH 94 Query: 153 --NTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 N + G+PL ET + ++ + GY +AAVGKWHL +N Sbjct: 95 VFNVEMSGGLPLNETLISKMLKPEGYRSAAVGKWHLGLGNNSV----------------- 137 Query: 211 SAEEWQPQNRGFDYFMG--------------------FHAAGTAYYNSPSLFKNRERVPA 250 + P N GFD F+G A + Y+ +LF + Sbjct: 138 ----YLPHNHGFDEFLGLPASPSQCRCSVCFYPNVTCHRAPCSPEYSPCALFNGTTIIEQ 193 Query: 251 KG---YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQT 307 + D+ ++ + PF LY A + H P QY + +G+ Sbjct: 194 PADLLTLDDKYAMQSRRFIRTNVETGTPFFLYYASHHTHHP-------QYAGKETSGTSI 246 Query: 308 ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGA--VIDGPLPLNGAQKGYKS 365 + S+ ++D V +I E+LK+NG ++T F+SDNG ++ G K K+ Sbjct: 247 RGRFGDSLAALDWEVGQIYEELKENGILEDTFFFFSSDNGPSLSLENFGGNAGLMKCGKA 306 Query: 366 QTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWL 425 TY GG P + W G++ PG +L S +D PT + +P ++ LDG + P+L Sbjct: 307 TTYEGGIRVPAIVHWPGQITPGRSMELSSTLDVLPTIASITNAKLP-NVTLDGYDMSPFL 365 Query: 426 QDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTV 485 + ++ + S + + Y + +N N + + + Sbjct: 366 F-QGMPSLRESFFYYPSKVDTEHKSYAVRYKQYKAVFYTEGSALSNNKNKDVDCRGTS-- 422 Query: 486 RNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAAN-PQ--VVKEMQGVVREFIDSSQPPL 541 ++ L+ L D ++ N++ + P+ ++ ++ + +F Sbjct: 423 --------LRTYHDPPMLFDLEQDPSEQYNISINHSPERDIILKLTKMRADFDAKMVFAP 474 Query: 542 SEVNQEKFNNI 552 SE+N+ + N+ Sbjct: 475 SEMNKPRDKNL 485 >UniRef50_C5C581 Cerebroside-sulfatase n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C581_BEUC1 Length = 458 Score = 408 bits (1049), Expect = e-112, Method: Composition-based stats. Identities = 113/509 (22%), Positives = 176/509 (34%), Gaps = 118/509 (23%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 +PNI+++ DDLGYG L + TP L Sbjct: 2 TQRPNIVLINADDLGYGDLGCYGSMRND--------------------------TPHLDR 35 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN-------TDAQDGIPLTETFL 167 L EGVR T+ Y+A V PSR ++TG P R G G+ E + Sbjct: 36 LAAEGVRLTDFYMASPVCSPSRGGMLTGCYPPRIGFGEFVGRPVLFPGDPVGLDPAERTM 95 Query: 168 PELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMG 227 + + GY TAA+GKWH E+ P GFD + G Sbjct: 96 ARVLGDAGYATAAIGKWHCGDQ-----------------------PEFLPTRHGFDSYFG 132 Query: 228 FHAAG-------TAYYNSPSLFKNR---ERVPAKGYISDQLTDEAIGVVDRAKTLDQPFM 277 + + L + P + ++++ T A ++ QPF Sbjct: 133 IPFSNDMGRQREHEDWPPLPLMSGESVVQEQPDQRSLTERYTVAATRFIEEN--AHQPFF 190 Query: 278 LYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDN 337 LYLA+ H+P PAP + Y +V ++D +++ L++ G +N Sbjct: 191 LYLAHMYVHVPLFVPAP-------FLAASRNGGYGGAVAALDWSTGVVMDTLRRLGLEEN 243 Query: 338 TIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP-GNYDKLISAM 396 TI++FTSDNG+ G N +G+K+QT+ GG + W + G D + ++ Sbjct: 244 TIVVFTSDNGSRARGEGGSNDPLRGHKAQTWEGGQRVACVVRWPAAIPAGGVCDAVTRSI 303 Query: 397 DFYPTALDAADISIPKD--LKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPF 454 D PT A + D +DGV L L + Sbjct: 304 DLLPTFAAVAGAADWADPARPVDGVDLTALLTGAGPAPNETFAYYYMDDLE--------- 354 Query: 455 WDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQL-GLYKL-TDLQQK 512 VR D+ L + + + LY L TD + Sbjct: 355 -----------------------------AVRVGDWKLHLSKRRDPMRELYDLRTDAAET 385 Query: 513 DNLAAANPQVVKEMQGVVREFIDSSQPPL 541 ++AA +P VV ++ V Sbjct: 386 HDVAADHPDVVARLEAVAETIRADLGDAR 414 >UniRef50_A6C8R8 Arylsulfatase A n=2 Tax=Planctomycetaceae RepID=A6C8R8_9PLAN Length = 510 Score = 408 bits (1049), Expect = e-112, Method: Composition-based stats. Identities = 108/562 (19%), Positives = 191/562 (33%), Gaps = 138/562 (24%) Query: 42 VAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKA 101 V + + + + KPN IV+ D+LGYG + + + Sbjct: 32 VTPALQSASAAPQQKKPNFIVIFCDNLGYGDIEPFGSTVN-------------------- 71 Query: 102 IEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN-------- 153 TP L + EG +FT+ V GV PSRA+IMTG R G++ N Sbjct: 72 ------RTPCLNRMAREGRKFTHYCVTAGVCTPSRASIMTGCYSQRVGMHWNPRDGQVLR 125 Query: 154 TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE 213 + G+ E + E+ + GY T +GKWHL + Sbjct: 126 PISPYGLNPDEITVAEVLKKQGYKTGMIGKWHLGDQT----------------------- 162 Query: 214 EWQPQNRGFDYFMGFHAAGTA---------------YYNSPSLFKNRERVP---AKGYIS 255 + P +GFDYF G + + + N + + ++ Sbjct: 163 PFLPTRQGFDYFYGIPYSDDMTQAVGQRLGDRLDGKNWPPLPVMLNDTVIEAGVDRNLLT 222 Query: 256 DQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASV 315 T++A+ +++ K +QPF LY P + G + S+ Sbjct: 223 KDYTEKAVEFIEKNK--NQPFFLYFPQAMP-----GSTRKPFASDAFRGKSKNGPWGDSI 275 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL-----NGAQKGYKSQTYPG 370 +D +IL++L + G NT++++TSDNG+ + + N G T G Sbjct: 276 EELDWSTGQILDKLVELGIDKNTLVIWTSDNGSPMAKDMNSTERGTNKPLNGRGYTTSEG 335 Query: 371 GTHTPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 P +WW + G ++L + MD PT A +P D +DG + P + + Sbjct: 336 AFRVPTIVWWPETVPAGTVCEELATTMDLLPTFARLAGGKVPSDRIIDGHDIRPLIMGEA 395 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNND 489 + + + + VR Sbjct: 396 DAKTPYDGFYYY------------------------------------AMEQLQAVRKGP 419 Query: 490 YSL------------VYTVENNQLGLYK-LTDLQQKDNLAAANPQVVKEMQGVVREFIDS 536 + L E ++ L+ +TD+ + N+A +P++VKE+ + + Sbjct: 420 WKLFVPLKEFSRHPHFKKGEGSRPLLFNVVTDISSEHNVADQHPEIVKELMSLAEKARAD 479 Query: 537 SQPPL-SEVNQEKFNNIKKALS 557 NQ + + Sbjct: 480 LGDTNHPGANQRPAARVDHPVP 501 >UniRef50_C7M5R4 Sulfatase n=4 Tax=Bacteroidetes RepID=C7M5R4_CAPOD Length = 480 Score = 408 bits (1048), Expect = e-112, Method: Composition-based stats. Identities = 127/550 (23%), Positives = 200/550 (36%), Gaps = 138/550 (25%) Query: 35 LKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTY 94 +K ++ S T + + PN+I + DDLGYG + Sbjct: 1 MKYLLFSLLASLCTIGVKAQEKLPNVIFILADDLGYGDIEPYG----------------- 43 Query: 95 KIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSN- 153 TP L L DEG++FT Y V PSRA+ +TG+ + N Sbjct: 44 ---------QQIIKTPQLSKLADEGMKFTQFYTGTSVCAPSRASFITGQTTGETHIRGNE 94 Query: 154 -----TDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 D Q + + + +LF+ GY T GKW L + + Sbjct: 95 EVREPVDGQAPLLANDPSVAQLFKKAGYNTGCFGKWGLGIVPS----------------- 137 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGTAYYNSP-SLFKNRERV--PAKG-------YISDQL 258 E P +GFD F G+++ A+ P L+ + E+V P G Y D + Sbjct: 138 -----EGNPLKQGFDTFFGYNSQFRAHRRYPAFLWHDNEKVLIPENGNYERQEVYGEDLI 192 Query: 259 TDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ------------------ 300 ++ + + + T ++PF ++L Y PH P Y Sbjct: 193 QEKILDYIGKQ-TAEKPFFMWLTYTLPHAELVVPHDSIYASYEYLPKKPYKGVDYDKITP 251 Query: 301 -------FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP 353 + + T Y A V +D+ + I + LK G ++TII+F SDNGA +G Sbjct: 252 KPFGWAGYMSQPHTYATYAAMVSRLDKYLGEIRKLLKVKGLDEDTIIIFASDNGAHREGG 311 Query: 354 -----LPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLI-SAMDFYPTALDAAD 407 + +G K Y GG TP ++WKGK++ G+ I + D PT + Sbjct: 312 ADPKFFNSSAGLRGIKRDLYEGGIRTPYIVYWKGKIKAGSVSDHIGAFWDMMPTFAEITH 371 Query: 408 ISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSD 467 + VS LP L KKQ + HK L W Sbjct: 372 QKYVPNRHQ--VSFLPTLLGKKQQQQHKYLYWEFHE------------------------ 405 Query: 468 DYPHNPNTEDLSQFSYTVRNNDYSLVY----TVENNQLGLYKL-TDLQQKDNLAAANPQV 522 VR ++ V + + LY L TD ++ NLA P++ Sbjct: 406 -----------MGGRQAVRYKNWKGVRLNVNKDKKAPIELYDLTTDPAEQHNLAEKYPKI 454 Query: 523 VKEMQGVVRE 532 VK+++ + + Sbjct: 455 VKKIERFMEQ 464 >UniRef50_A6DSG6 Arylsulfatase A n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DSG6_9BACT Length = 499 Score = 407 bits (1047), Expect = e-112, Method: Composition-based stats. Identities = 128/537 (23%), Positives = 193/537 (35%), Gaps = 109/537 (20%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 +T + F + PN I + DD GYG L Sbjct: 2 TFRTLIISLSFLLGFTAKAEMPNFIFIMTDDQGYGDLGCYG------------------- 42 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYS--NT 154 TP + + D GVRFT+ Y H P+RA++MTG R GV S Sbjct: 43 -------HPIIKTPNIDKMADRGVRFTDFYARHK-CSPARASLMTGAFNFRVGVGSIVYP 94 Query: 155 DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAE- 213 ++ G+ +PE+ + GY TA +GKWHL + +P D+ Y T + Sbjct: 95 NSTTGLIKEVVTIPEMLKEKGYTTALIGKWHLGHTAGY-LPRDQGFDYYFGVPGTNHGDA 153 Query: 214 --EWQPQNRGF---------DYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQ---LT 259 P GF DY+ + NS L KN + I+ T Sbjct: 154 KTHKLPVAEGFKPSGEFTIEDYWA--DKGKGVHGNSTILMKNDNVIEWPTDITQLTKRYT 211 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVD 319 +A+ + K D+PF LY A+ PH P A G Y + +D Sbjct: 212 HDAVRYIKENK--DKPFFLYFAHGTPHHPYTVDA-------AFRGKSDHGLYGDMIEEID 262 Query: 320 QGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL-----PLNGAQKGYKSQTYPGGTHT 374 V +++ L++NG TII FTSDNGA N KG+K + GG Sbjct: 263 WSVGEVIKALQENGIEKKTIIAFTSDNGADSKPNKEHAEKGSNLPLKGWKGSSEEGGVRV 322 Query: 375 PMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQ-GE 432 P + W G L G +++ S MD +PT A I K+DG ++ P + + Sbjct: 323 PFVLSWPGTLPEGKKTNEIASLMDIFPTYAALAGIEPEVPQKIDGNNIFPIMMCEPDVKS 382 Query: 433 PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL 492 P+K + + + VRN+ + Sbjct: 383 PNKYIFY------------------------------------AGNTPKITGVRNHRFKY 406 Query: 493 VYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFID-----SSQPPLSE 543 LY + D+ + N+A P+V++E+Q + F S + P E Sbjct: 407 STKTSG----LYDMHADIGETTNVADKYPEVLQELQKAMEAFQKDIDENSMEAPFDE 459 >UniRef50_A7VQW1 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VQW1_9CLOT Length = 588 Score = 407 bits (1046), Expect = e-112, Method: Composition-based stats. Identities = 117/504 (23%), Positives = 191/504 (37%), Gaps = 110/504 (21%) Query: 55 KGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLS 114 + +PN++ + DD GYG L TP + Sbjct: 3 EKRPNVVFVLTDDQGYGDLGCTG--------------------------NPDIQTPQIDE 36 Query: 115 LMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPLTETFLPELFQNH 174 E VR T+ +VA + P+R AI TGR P R GV++ + + ET L E+F+++ Sbjct: 37 FYKEAVRLTDYHVA-PLCAPTRGAIFTGRRPLRNGVWATCWGRSILHEGETTLAEVFRDN 95 Query: 175 GYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTA 234 GY T GKWHL ++PQ+RGF + G Sbjct: 96 GYATGLFGKWHLGDNY-----------------------PYRPQDRGFTEVVAHKGGGVG 132 Query: 235 --------YYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPH 286 Y S ++N + +GY +D D A ++ LD+PF + NAPH Sbjct: 133 QTPDFWGNNYFEDSYYQNGKLTRYEGYCTDVWFDAAERFIESH--LDEPFFACITTNAPH 190 Query: 287 LPNDNPAPDQYQKQFNTGSQT-ADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSD 345 P ++Y + +Y + ++D R+ ++L G DNT+++F +D Sbjct: 191 EPY--LVEEKYAAPYRENENIVHPEFYGMISNIDLNFGRLRKKLSDWGIEDNTVLIFMTD 248 Query: 346 NGAVIDGPL--------PLNGAQKGYKSQTYPGGTHTPMFMWWK-GKLQPGN-YDKLISA 395 NG + N +G K+ Y GG P F+ W G L G + Sbjct: 249 NGTSGGCEIDGNEHVLRGYNAGMRGMKTSYYDGGHRVPFFIRWPNGGLDGGRDVEDTSYH 308 Query: 396 MDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFW 455 +DF+PT D +S+P L+LDGVSL L ++ + Sbjct: 309 VDFFPTLADLCGLSMPP-LQLDGVSLKGVLTGEEALPKGRVEFMQYH------------- 354 Query: 456 DNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDN 514 +T S++ +V ++ + LV LY + D Q + Sbjct: 355 -----------------QSTVVPSKWESSVVSDQWRLVRGK-----ELYDIKADPGQNRD 392 Query: 515 LAAANPQVVKEMQGVVREFIDSSQ 538 +A +P+VV+ ++ + Q Sbjct: 393 IAGQHPEVVRRLRAAHEAYWQEMQ 416 >UniRef50_A6CGJ8 Arylsulfatase A n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CGJ8_9PLAN Length = 520 Score = 406 bits (1045), Expect = e-112, Method: Composition-based stats. Identities = 121/552 (21%), Positives = 188/552 (34%), Gaps = 119/552 (21%) Query: 49 PTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKS 108 P ++ + NI+ + DDLGYG + ++ Sbjct: 24 PIAHAADKQSNIVYILADDLGYGDVSCYN-------------------------PESKIK 58 Query: 109 TPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV-YSNTDAQDG--IPLTET 165 TP + L EG++FT+ + V P+R I+TGR R + Y D D I + Sbjct: 59 TPHIDRLAAEGMKFTDAHTPSAVCTPTRYGILTGRYCWRTRLKYRVLDGFDPPLIEQDQV 118 Query: 166 FLPELFQNHGYYTAAVGKWHLSKISN-------VPVPEDKQTRDYHDNFTTFSAE-EWQP 217 +P L + GY TA +GKWHL VP D++ R + ++ P Sbjct: 119 TVPSLLKKAGYDTACIGKWHLGMQWTDKNGQPVPAVPIDRRQRPRVGDDVDYTKPILGGP 178 Query: 218 QNRGFDYFMGFHAAGTAYYNSPSLFKNRER-----------------VPAKGYISDQ--- 257 GFDY+ G A+ +N V D Sbjct: 179 LTSGFDYYFGISASLNMSPF--CFIRNDRPVILPTIPSERIQTEFLSVDQGMRSPDFTIR 236 Query: 258 -----LTDEAIGVVDRA--KTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADN 310 LT EA+ ++R ++ ++PF LY APHLP G A Sbjct: 237 SVMPTLTGEAVKYIERHGKESPERPFFLYFPLTAPHLPLVPNDE-------FKGKSAAGE 289 Query: 311 YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG------------------ 352 Y V VD V I++ L++ G +NT+++FTSDNG + Sbjct: 290 YGDFVLEVDATVGAIMDALQRTGVAENTLVIFTSDNGGLYHWWTPQETDDLKHYKPNHRG 349 Query: 353 ------PLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQP-GNYDKLISAMDFYPTALDA 405 N +G K+ + GG P + W GK D+L+ D T Sbjct: 350 QYVKDRGHQGNAHLRGTKADIWEGGHRVPFIVRWPGKTPADSTNDELVELTDLLATCAAI 409 Query: 406 ADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQ 465 D +P D V++LP L KK P + S F P+ + Sbjct: 410 TDTKLPDGDAQDSVNILPALLGKKSDTPLREYAIHHSLWGHFSVRQGPWKMIPKRGSGGF 469 Query: 466 SDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVK 524 + P + + LY L D + N+ +P+VVK Sbjct: 470 TRAREVEPAAGEPTG---------------------QLYNLKQDPSETKNVWLEHPEVVK 508 Query: 525 EMQGVVREFIDS 536 + ++ + Sbjct: 509 PLSAILEQVQKQ 520 >UniRef50_Q15XP0 Sulfatase n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15XP0_PSEA6 Length = 627 Score = 406 bits (1045), Expect = e-112, Method: Composition-based stats. Identities = 105/528 (19%), Positives = 190/528 (35%), Gaps = 108/528 (20%) Query: 40 TNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGID 99 T + KPNI+++ DD GYG + Sbjct: 26 TVCSAVQNRSASAEPPTKPNIVLIVTDDQGYGDIGRHN---------------------- 63 Query: 100 KAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDG 159 TP + + + R TN +V P+R+A++TG+ R GV+ + Sbjct: 64 ----NPIIQTPNIDDIAAQSARLTNFHV-DPTCSPTRSALLTGKHSLRAGVWHTILGRYM 118 Query: 160 IPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQN 219 + L E Q +GY T GKWHL ++PQ+ Sbjct: 119 LGPEHVTLAESLQENGYRTGIFGKWHLGDNY-----------------------PYRPQD 155 Query: 220 RGFDYFMGFHAAGT----AYY----NSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKT 271 +GFD + G Y+ + + ++N GY + DEA +D+ Sbjct: 156 QGFDDVLIHGGGGVGQTPDYWGNTQFNDTYYRNGTPEKFSGYATKIWFDEAKKFIDKQH- 214 Query: 272 LDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKK 331 D P+ Y+A NAPH P P + ++ ++Y + +D+ V + L+ Sbjct: 215 -DTPYFAYIALNAPHGPYRAPETHIEPYEKRGLNRDMASFYGMISYIDEQVGELRAHLRA 273 Query: 332 NGQYDNTIILFTSDNG-------------------AVIDGPLPLNGAQKGYKSQTYPGGT 372 Q DNTI +F +DNG A N +GYK + Y GG Sbjct: 274 QDQLDNTIFIFMTDNGSSYKPTDAKTHLTKRHLPLAEQYPNWQPNDNMRGYKGEVYEGGH 333 Query: 373 HTPMFMWWK-GKLQPGNYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQG 431 P F+ + G + G+Y+ + + D PT L+ A+I P + LDG SL +L+ ++ Sbjct: 334 RVPFFISYPNGNITTGDYEAITAHFDVMPTLLELANIP-PVNSTLDGTSLATYLKGEQAN 392 Query: 432 EPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYS 491 + + ++ + + + + + Sbjct: 393 R------------------------SLESKLSERAIVVTNQRVYHPSVKRPIAIAFHQWR 428 Query: 492 LVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 + ++ + L+ L D Q++++ +P ++ M+ + + Q Sbjct: 429 YISANDSEK--LFNLQQDPSQQNDIKNDHPDILARMRQRKQTWWQEMQ 474 >UniRef50_A4W906 Sulfatase n=43 Tax=Enterobacteriaceae RepID=A4W906_ENT38 Length = 501 Score = 406 bits (1045), Expect = e-112, Method: Composition-based stats. Identities = 113/527 (21%), Positives = 188/527 (35%), Gaps = 101/527 (19%) Query: 46 DFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAA 105 + KPN++++ DDLGYG L Sbjct: 24 LAAEQSANQLNKPNVVIILADDLGYGDLGIYG--------------------------HP 57 Query: 106 QKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQDGIPL--T 163 TP + L EGVRF+ Y + PSRA ++TGR P R G+ S I L Sbjct: 58 IVKTPNIDKLAQEGVRFSQYYAPAPLCSPSRAGLLTGRTPFRTGIRSWIPTNKNIALGRN 117 Query: 164 ETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGF- 222 E + ++ GY TA +GKWHL+ + + + GF Sbjct: 118 EKTIASYLKDQGYDTAMMGKWHLNAGVDRHDQPQAEDAGFDYTLV---------NAAGFV 168 Query: 223 --DYFMGFHAAGTAYYNSPSLFKNRERV-PAKGYISDQLTDEAIGVVDRAKTLDQPFMLY 279 D ++N + + + ++ EAI ++ K ++PF +Y Sbjct: 169 TSDLDKAKERPRNGVVYPNGFYRNGKALGTVNQISGEFVSQEAINWLND-KKDNKPFFMY 227 Query: 280 LAYNAPHLPNDNP-----------------APD-QYQKQFNTGSQTADNYYASVYSVDQG 321 +A+ H P +P PD Y + + YYA++ +D+ Sbjct: 228 VAFTEVHTPLASPKKYLEIYKNYMSEYEKQHPDMFYADWVDKPYRGPGEYYANISYMDEQ 287 Query: 322 VKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPL--------NGAQKGYKSQTYPGGTH 373 V ++L ++K GQ DNTII+FTSDNG V +G K + GG Sbjct: 288 VGKVLAKIKSMGQEDNTIIIFTSDNGPVTREARKWYELNMAGETDGLRGRKDNLWEGGIR 347 Query: 374 TPMFMWWKGKLQPGNY-DKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGE 432 P + + L G D +S +D PT + ++P D +DG S++P L+ + Sbjct: 348 VPAIIKYGQHLHAGTVTDTPVSGLDILPTLAELTHFNLPTDRIIDGESIVPVLEGQTMNR 407 Query: 433 PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSL 492 L I +D + +R+ D+ + Sbjct: 408 QQPLLFAIDMPF-------------------------------QDDPTDMWALRDGDWKM 436 Query: 493 VYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQ 538 ++ + LY L D + N P + ++M + + S + Sbjct: 437 IFDRNSKPKYLYNLKLDRGETMNQLGKQPVLEQKMIAALARYQSSIE 483 >UniRef50_B7PV03 Arylsulfatase B, putative n=7 Tax=Ixodes scapularis RepID=B7PV03_IXOSC Length = 588 Score = 406 bits (1045), Expect = e-112, Method: Composition-based stats. Identities = 122/591 (20%), Positives = 218/591 (36%), Gaps = 118/591 (19%) Query: 22 MAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFD 81 + A A + A + S +P +P+I+ + DDLG+ + ++ Sbjct: 34 LVNIATIGAVILLFIALAAMLLTSRRSPV-----RQPHIVFILADDLGWNDVSYNGC--- 85 Query: 82 PKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMT 141 Q TP + +L G+R Y + PSRAA+MT Sbjct: 86 -----------------------PQIRTPNIDALAWNGIRLQRYY-TQPMCTPSRAALMT 121 Query: 142 GRAPARFGVYSNT---DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDK 198 GR P G+ + G+PL LP+ + GY + +GKWHL Sbjct: 122 GRYPIHTGMQHFVILQNEPRGLPLKFKLLPQWLGDLGYVSQMLGKWHLG----------- 170 Query: 199 QTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFK--------------N 244 F +E+ P RGF +G YY+ K + Sbjct: 171 -----------FYKKEYTPTMRGFQKHIGSWGGFVDYYSHIRFNKIGFSHSGLDFRQGLS 219 Query: 245 RERVPAKGYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHL-----PNDNPAPDQYQK 299 R Y ++ +T+ A V++ L++P LYLA+ APH P P +Y Sbjct: 220 EGREFDGQYYTEFMTEAATRVIE-NHPLEKPLFLYLAHLAPHGANRHDPLQ--VPKKYSD 276 Query: 300 QFNTGSQTADN-YYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG---PLP 355 +++ Y V ++D+ V ++E L K G +T+++F+SDNG +G Sbjct: 277 KYHDIGHWNRTMYAGMVSALDESVGAVVEALGKRGMLSDTVLVFSSDNGGDTNGENPNYA 336 Query: 356 LNGAQKGYKSQTYPGGTHTPMFMWWK--GKLQPGNYDKLISAMDFYPTALDAADISIPKD 413 + KG K + GG H P F+W ++ +Y+ + D+ PT A Sbjct: 337 SSWPFKGQKRTLWEGGIHVPGFIWSPLFSGMRGFDYNNIFHISDWLPTLYQLAGGDPSDL 396 Query: 414 LKLDGVSLLPWLQDKKQGEPHKNLTWIT-----------SYSHWFDEENIPFWDNYHKFV 462 +DG+S L L + + + L I + D + + Sbjct: 397 GDIDGISHLDSLSRRSETPRKELLINIDPIENVSAIIEGHFKLVSGVVRGGVLDEWFQVP 456 Query: 463 RHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQ---------------------L 501 + + DY + S + +RN + + E+ Sbjct: 457 GNITWDYNRARQECETSLVARVLRNAGHDVACGSEDGSFPTPIKCGKRDPSKPCVPTVAP 516 Query: 502 GLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQPPLSEVNQEKFNN 551 L+ L+ D + +N+A + +V++ + G + + ++S PP + + E+ + Sbjct: 517 CLFDLSKDPCEYNNIAEQHNEVLQRLLGKLEGYRETSVPPGNLPSDEQADA 567 >UniRef50_UPI0001B577E1 arylsulfatase precursor n=1 Tax=Streptomyces sp. C RepID=UPI0001B577E1 Length = 746 Score = 406 bits (1045), Expect = e-112, Method: Composition-based stats. Identities = 131/558 (23%), Positives = 206/558 (36%), Gaps = 98/558 (17%) Query: 11 STSISLILASGMAAFAAHAADDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGY 70 S LA+ A A + + A + + S PNI+V+ DDLGY Sbjct: 1 MPSRRTFLAASTATLGLTAVTATTAGSAQAVPAVTVPETKDGSGTRLPNIVVVLADDLGY 60 Query: 71 GQLPFDKGSFDPKTMENREVVDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHG 130 G+L STP L L EG+RFT+ Y Sbjct: 61 GELGSYG--------------------------QKLISTPRLDRLATEGLRFTDAYSTAA 94 Query: 131 VSGPSRAAIMTGRAPARFGVYSNT--DAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSK 188 V PSR +++TG V +N Q + T+T ++ + GY TA +GKW Sbjct: 95 VCAPSRCSLLTGLHTGHSTVRANPSSGGQGSLTATDTTFAQVLRARGYRTAVIGKWGFG- 153 Query: 189 ISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY-YNSPSLFKN--R 245 + ++ P RGF+ F G+ A+ Y L+ N + Sbjct: 154 -------------------PEAAGQDSHPAARGFEEFYGYIDHSHAHQYYPEYLWHNAVK 194 Query: 246 ERVPAKG------YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQK 299 E +PA Y L A+ +D +PF+L L N PH P+D P Y Sbjct: 195 EPIPANAGGAKAVYAPHLLEQHALEFIDTHAA--EPFLLLLTPNVPHAPSDIPDSSAYAD 252 Query: 300 QFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGP-----L 354 + + + + A V D V +++++L+ G +T++L TSDNG +G Sbjct: 253 R--SWTAANKGHAAQVSYFDSLVGKVVDRLRSLGLEQDTVVLVTSDNGPHEEGGVNPDLF 310 Query: 355 PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLISAMDFYPTALDAADISIPKDL 414 NG +GYK Y GG P+ W G++Q G ++ D PT + P D Sbjct: 311 DANGPLRGYKRNLYEGGVRVPLIAWGPGRVQQGTSNRPTPLTDVLPTLAELGGAPAPTD- 369 Query: 415 KLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPN 474 +DG+S P L H +L W N + Sbjct: 370 -VDGLSAAPLLAGSPDSARHGHLYWFRDELGVTSRANA--------------------QD 408 Query: 475 TEDLSQFSYTVRNNDYSLVY-------TVENNQL--GLYKL-TDLQQKDNLAAANPQVVK 524 + + + VR ++ V + +++ LY L TDL + ++ A NP Sbjct: 409 GKRATWLAEAVRRENWKAVRFAPERDHNLPDDKWQVELYDLATDLGETRDVLAKNPSKAA 468 Query: 525 EMQGVVREFIDSSQPPLS 542 E+ ++R + P Sbjct: 469 ELVALMRSSWKDTYPRTP 486 >UniRef50_Q7UER7 Sulfatase 1 n=8 Tax=Bacteria RepID=Q7UER7_RHOBA Length = 553 Score = 406 bits (1045), Expect = e-111, Method: Composition-based stats. Identities = 119/572 (20%), Positives = 205/572 (35%), Gaps = 128/572 (22%) Query: 32 DVKLKATKTNVAFSDFTPTEY-STKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 L + F+ + T+ + +PN++++ +DDLG + + F Sbjct: 31 STLLWMLFATITFACLSITQANAQDDRPNVVLILVDDLGLHDIGIEGSKFH--------- 81 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 TP + +L G+RFT GY V PSRA+I G+ AR G+ Sbjct: 82 -----------------QTPHIDALAKRGMRFTAGYANCRVCSPSRASIQLGQFTARHGI 124 Query: 151 YSNTDAQDGI-----------------PLTETFLPELFQNHGYYTAAVGKWHLSKISNVP 193 A+ G+ P + LPE + GY T GKWHL ++ Sbjct: 125 TDWIGAKTGMDFNRGDELLPAEYVHAMPAKDVTLPEALRESGYKTFFAGKWHLGGEGSM- 183 Query: 194 VPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAY--YNSPSLFKNRERVPAK 251 P + GFD +G H G+ + +P E P Sbjct: 184 -----------------------PTDHGFDINIGGHHRGSPPGGFFAPFKNPVMEDGPDG 220 Query: 252 GYISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPND------------NPAPDQYQK 299 ++ +L E ++ DQP+ L++ A H P PAP Sbjct: 221 ESLTRRLGKETASFIEGQ--DDQPYFAMLSFYAVHGPIQTTQELWQKYRESAPAPPADGN 278 Query: 300 QFNTGS-------QTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDG 352 +F Q Y + ++D V ++ ++ +G+ DNT+++FT DNG V G Sbjct: 279 RFKIDRTLPVRQIQDNPVYAGMMETLDNAVGDVMAAIEASGKADNTLVIFTGDNGGVSSG 338 Query: 353 PL--PLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPG-NYDKLISAMDFYPTALDAADIS 409 N +G K + + GG P ++ + D + D YPT LD ++ Sbjct: 339 DAYSTSNLPHRGGKGRQWEGGLREPYYVSMPAIVPENSTSDVPVIGSDLYPTILDVCNLP 398 Query: 410 IPKDLKLDGVSLLPWLQDKKQGE-PHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDD 468 + +DG SL L K ++L W + E Sbjct: 399 LRPQQHIDGRSLETVLAGGKDELLEQRSLIWHYPHYGNQGGE------------------ 440 Query: 469 YPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLYKL-TDLQQKDNLAAANPQVVKEMQ 527 S +R DY L++ ++ LY L TD+ ++++LA+ P+ V M+ Sbjct: 441 ------------PSSVIRTGDYKLIHYHLDSHDELYHLPTDIGEQNDLASEQPERVAAMR 488 Query: 528 GVVREFIDSSQPPLSEVNQ--EKFNNIKKALS 557 + ++ S + + + ++ Sbjct: 489 KELLAYLKSVDAKFPQPDPRFDPEKAKQRWAR 520 >UniRef50_A6CA66 N-acetylgalactosamine 6-sulfatase (GALNS) n=3 Tax=Bacteria RepID=A6CA66_9PLAN Length = 478 Score = 406 bits (1044), Expect = e-111, Method: Composition-based stats. Identities = 107/527 (20%), Positives = 182/527 (34%), Gaps = 80/527 (15%) Query: 37 ATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKI 96 V+ + GKPNII++ DD G+G ++ Sbjct: 13 VVFALVSTGASFAAKAERSGKPNIILVMADDQGWGDTSYNG------------------- 53 Query: 97 GIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDA 156 TP L ++ + F Y V P+RA++MTGR P R V ++ Sbjct: 54 -------HPFVKTPELDAMAKDAFVFDRFYAGAPVCSPTRASVMTGRNPNRTKVTNH--- 103 Query: 157 QDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQ 216 + E + E + GY T GK HL Sbjct: 104 GRYMRPHEQTIAETLKAAGYVTGIFGKVHLGSGQPDS--------------------PCN 143 Query: 217 PQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISDQLTDEAIGVVDRAKTLDQPF 276 P GFD ++ + N P L + + KG S L D+ + +++ K +QP Sbjct: 144 PSGMGFDEWVIGLN---FFDNDPYLSRMGKIEHRKGKGSVILMDDTLEFLEKHKDGEQPI 200 Query: 277 MLYLAYNAPHLPND--NPAPDQYQKQFNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQ 334 + + +PH P+ P Y+ + YY + +DQ V R+ L+ Sbjct: 201 FTVVWFPSPHDPHAEVPEGPSLYKGK------PHAGYYREITLLDQQVGRLRRALRNMNI 254 Query: 335 YDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGNYDKLIS 394 +NTI+ + SDNG ++ + K Y GG P + W + G ++ Sbjct: 255 AENTIVWYCSDNGGLVKETSGG----REKKGSIYEGGLRVPGIIEWPARKLKGRTSVPVA 310 Query: 395 AMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPHKNLTWI----------TSYS 444 D YPT L A + + LDG+ + + W Sbjct: 311 TFDIYPTLLSLAGVELYAPHPLDGMDVSGIITGSVTKRSKPMGFWHQLQRGQGTRSDQIQ 370 Query: 445 HWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVYTVENNQLGLY 504 E+ H VR + D E+ + + + L + + LY Sbjct: 371 KAIMEKQQAGAPLPHDSVRMRKDVDEFPQFPEETTTGHAAWNDWPWKLHRI-KGTKFELY 429 Query: 505 KLT-DLQQKDNLAAANPQ---VVKEMQGVVREFIDSSQPPLSEVNQE 547 L+ D +K +L+ NP+ VK+MQ + ++ S ++ + + Sbjct: 430 NLSDDPMEKTDLS-KNPEQAQRVKQMQQELDAWMRSVIRSINGKDYQ 475 >UniRef50_A6DI18 Arylsulfatase A n=2 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DI18_9BACT Length = 562 Score = 406 bits (1044), Expect = e-111, Method: Composition-based stats. Identities = 110/561 (19%), Positives = 195/561 (34%), Gaps = 128/561 (22%) Query: 33 VKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVD 92 K+ + K + F + S KPNII L DD+G G + Sbjct: 7 SKMTSLKKISFLASFVCSLVSAAEKPNIIYLLADDMGVGDVKAYN--------------- 51 Query: 93 TYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARF---- 148 ++ TP L +L G+ FT+ + V P+R I+TGR R Sbjct: 52 ----------ADSKIPTPALDNLAANGMMFTDAHTNSSVCTPTRYGILTGRYSWRTTKKS 101 Query: 149 GVYSNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFT 208 GV + I + L + GY TA +GKWHL ++ ++ Sbjct: 102 GVTQGL-SPHLIDSNRETVASLLKKEGYATACIGKWHLGMDWSLKDGSIADSKSDQSQID 160 Query: 209 TFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKNRERVPAKGYISD------------ 256 + P GFDY+ G A +A ++ ++ V + D Sbjct: 161 LSKEIQNGPNKNGFDYYFGM--AASANHSPHCFIEDGYTVGKLQVLDDKQRKAVGIDGKP 218 Query: 257 --------------QLTDEAIGVVDR--AKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQ 300 + T++ V + DQPF +Y+ N+PH P A Sbjct: 219 GLVAKGFKQSEILPRFTEKTCEWVRSQVNQKPDQPFFVYMPLNSPHSPIVPSAK------ 272 Query: 301 FNTGSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNG--------AVIDG 352 G ++ D + +++ LK G DNT+I+FT+DNG + + Sbjct: 273 -FLGKSGLSSHGDFCMETDWALGEVVKILKALGIEDNTMIIFTADNGTSPMAKFEPMQEQ 331 Query: 353 PLPLNGAQKGYKSQTYPGGTHTPMFMWWK-GKLQPGNYDKLISAMDFYPTALDAADISIP 411 + +G K +TY GG P + W G D+LI D T + I++ Sbjct: 332 GHFPSYIYRGLKGETYEGGHRVPFIVKWPKGLAPAKTSDQLICTTDLMATVAEINGIALA 391 Query: 412 KDLKLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPH 471 ++ D +S LP L+++ E + Sbjct: 392 NNVGEDSISFLPALREQAIPE------------------------------------LAN 415 Query: 472 NPNTEDLSQFSYTVRNNDYSLVYT---------------VENNQLGLYKL-TDLQQKDNL 515 + +R + L+ +++ ++ L+ + D Q+ NL Sbjct: 416 RAIVHHSDAGVFAIRQGKWKLLLDNIGGSRRSNPKDKPVIDDAEIQLFDMVNDPQESTNL 475 Query: 516 AAANPQVVKEMQGVVREFIDS 536 + NP++V+ ++ + ++I+ Sbjct: 476 SQKNPEIVEGLKKQLADYINK 496 >UniRef50_P34059 N-acetylgalactosamine-6-sulfatase n=23 Tax=Deuterostomia RepID=GALNS_HUMAN Length = 522 Score = 406 bits (1043), Expect = e-111, Method: Composition-based stats. Identities = 121/539 (22%), Positives = 192/539 (35%), Gaps = 99/539 (18%) Query: 41 NVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYKIGIDK 100 + S PNI++L MDD+G+G L Sbjct: 14 LLVLSAAGMGASGAPQPPNILLLLMDDMGWGDLGVYG----------------------- 50 Query: 101 AIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVYSNTDAQ--- 157 + TP L + EG+ F N Y A+ + PSRAA++TGR P R G Y+ Sbjct: 51 ---EPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTNAHARNA 107 Query: 158 -------DGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTTF 210 GIP +E LPEL + GY + VGKWHL Sbjct: 108 YTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLGH---------------------- 145 Query: 211 SAEEWQPQNRGFDYFMGFHAAGTAYYNSP-----SLFKNRERV------------PAKGY 253 ++ P GFD + G Y++ ++++ E V + Sbjct: 146 -RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEAN 204 Query: 254 ISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNTGSQTADNYYA 313 ++ EA+ + R PF LY A +A H P Y + G+ Y Sbjct: 205 LTQIYLQEALDFIKRQARHH-PFFLYWAVDATHAPV-------YASKPFLGTSQRGRYGD 256 Query: 314 SVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVI---DGPLPLNGAQKGYKSQTYPG 370 +V +D + +ILE L+ DNT + FTSDNGA + NG K T+ G Sbjct: 257 AVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEG 316 Query: 371 GTHTPMFMWWKGKLQPGNYD-KLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKK 429 G P WW G + G +L S MD + T+L A ++ P D +DG++LLP L + Sbjct: 317 GMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLL-QG 375 Query: 430 QGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNND 489 + + +++S + + Sbjct: 376 RLMDRPIFYYRGDTLMAATLGQHKAHFWTWTNSWENFRQGIDFCPGQNVSGVTTHNLEDH 435 Query: 490 YSLVYTVENNQLGLYKLTDLQQKDNL---AAANPQVVKEMQGVVREFIDSSQPPLSEVN 545 L + D ++ L +A + + + VV++ ++ P ++N Sbjct: 436 TKLPLIFHLGR-------DPGERFPLSFASAEYQEALSRITSVVQQHQEALVPAQPQLN 487 >UniRef50_A4XED5 Sulfatase n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=A4XED5_NOVAD Length = 462 Score = 406 bits (1043), Expect = e-111, Method: Composition-based stats. Identities = 113/526 (21%), Positives = 190/526 (36%), Gaps = 116/526 (22%) Query: 36 KATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREVVDTYK 95 + ++ T + +PNI+ + DDLGY Sbjct: 13 ISATALLSGQALAVTRKAAPERPNIVFIMADDLGYADTSATGS----------------- 55 Query: 96 IGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGVY---- 151 TP + S+ GV GY + + P+R A++TG RF + Sbjct: 56 ---------RHIRTPAIDSIGAGGVMLRQGYSSTPICSPTRTALLTGCYAQRFAIGVEEP 106 Query: 152 --SNTDAQDGIPLTETFLPELFQNHGYYTAAVGKWHLSKISNVPVPEDKQTRDYHDNFTT 209 N A G+PL + + + GY T+ VGKWHL + Sbjct: 107 LGPNAPAGIGVPLDRPTIASVMKALGYRTSLVGKWHLGEP-------------------- 146 Query: 210 FSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLF----------KNRERVPAKGYISDQLT 259 P G+D+F+G G Y+ + ++ + GY++D Sbjct: 147 ---PAHGPLKHGYDHFLGIVEGGADYFVHRMVMSGKPAGVGLAEDDAQTDRTGYLTDIFG 203 Query: 260 DEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQY----QKQFNTGSQTADNYYASV 315 DEA+ V++ +QPF L L + APH P + ++ F+ Y V Sbjct: 204 DEAVRVIEE--GGNQPFFLSLHFTAPHWPWEGREDEKLARALPSSFHYEGGNLAKYREMV 261 Query: 316 YSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPLPLNGAQKGYKSQTYPGGTHTP 375 ++DQ V ++L + ++G+ DNT+++FTSDNG G+K + GG P Sbjct: 262 ETMDQNVAKVLAAIDRSGKADNTVVVFTSDNGGER---FSDTWPFVGHKGEVLEGGVRVP 318 Query: 376 MFMWWKGKLQPG-NYDKLISAMDFYPTALDAADISIPKDLKLDGVSLLPWLQDKKQGEPH 434 + + W +++ G ++++ +MDF PT L A + + DG L L Sbjct: 319 LMVRWPRRIKAGSRSEQVMVSMDFLPTLLGMAGGDAARIGRFDGADLSAQLAG--AAPVT 376 Query: 435 KNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPNTEDLSQFSYTVRNNDYSLVY 494 + L W S VR D + Sbjct: 377 RTLFWRFKASE------------------------------------QAAVRQGDMKYLR 400 Query: 495 TVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREFIDSSQP 539 + L+ L+ D +++ NLA ANP V M+ + ++ P Sbjct: 401 MA--GKEYLFDLSQDEREQANLAPANPDKVNAMRALWDDWNREMMP 444 >UniRef50_C5VKQ0 N-acetylgalactosamine-6-sulfatase n=3 Tax=Prevotella RepID=C5VKQ0_9BACT Length = 520 Score = 405 bits (1041), Expect = e-111, Method: Composition-based stats. Identities = 128/557 (22%), Positives = 195/557 (35%), Gaps = 90/557 (16%) Query: 31 DDVKLKATKTNVAFSDFTPTEYSTKGKPNIIVLTMDDLGYGQLPFDKGSFDPKTMENREV 90 L T F +PNII+ +DD+G+ Sbjct: 17 SSTLLITTSIAALGISFPAKAQQVNTQPNIILFMVDDMGWQDTS---------------- 60 Query: 91 VDTYKIGIDKAIEAAQKSTPTLLSLMDEGVRFTNGYVAHGVSGPSRAAIMTGRAPARFGV 150 + TP + L EG+ FT+ Y +S PSR ++MTG AR V Sbjct: 61 ---LPFADSITANNRKYDTPNMERLASEGMMFTDAYAT-PISSPSRCSLMTGMNMARHRV 116 Query: 151 YSNTDAQDGIPL---TETFLP-----------------------ELFQNHGYYTAAVGKW 184 + T +D + LP +L +N GY+T GK Sbjct: 117 TNWTLHRDKMTDGKRDGVTLPDWNYNGIAQSGNVAHTTKAISFVQLLKNVGYHTIHCGKA 176 Query: 185 HLSKISNVPVPEDKQTRDYHDNFTTFSAEEWQPQNRGFDYFMGFHAAGTAYYNSPSLFKN 244 H I D + T R GF A SP Sbjct: 177 HWGAIDTPGENPCHFGFDVNITGTAAGGLATYLSER----NYGF--AKDGKPTSPFAIPG 230 Query: 245 RERVPAKG-YISDQLTDEAIGVVDRAKTLDQPFMLYLAYNAPHLPNDNPAPDQYQKQFNT 303 ER G + ++ LT EAI +++AK DQPF LY+++ A H+P D + Sbjct: 231 LERYWGTGIFATEALTQEAIASLEKAKKYDQPFYLYMSHYAVHVPIDRDMRFYPTYRARG 290 Query: 304 GSQTADNYYASVYSVDQGVKRILEQLKKNGQYDNTIILFTSDNGAVIDGPL-------PL 356 S+ Y + + +D+ + +++ + K G TII+F SDNG + Sbjct: 291 LSEKEAAYASLIAGMDKSLGDLMDWVAKAGLKRETIIIFMSDNGGLASSSYWRDGELYTQ 350 Query: 357 NGAQKGYKSQTYPGGTHTPMFMWWKGKLQPGN-YDKLISAMDFYPTALDAADIS-IPKDL 414 N K K Y GG P + W ++P I D YPT L A I Sbjct: 351 NAPLKSGKGSLYEGGIRVPFIVKWNNIVKPNTRSHAPIIIEDLYPTLLSMAGIKNYHVPQ 410 Query: 415 KLDGVSLLPWLQDKKQGEPHKNLTWITSYSHWFDEENIPFWDNYHKFVRHQSDDYPHNPN 474 K+DG + P L+ K+QG+ + L W Sbjct: 411 KIDGQDITPILRGKQQGDKKRQLIWNYPN---------------------------IWDG 443 Query: 475 TEDLSQFSYTVRNNDYSLVYTVENNQLGLYKLT-DLQQKDNLAAANPQVVKEMQGVVREF 533 + +R + L+Y+ Q LY L+ DL +K+NLA+++PQ+V+ + + Sbjct: 444 EGLGISLNCAIREGQWKLIYSYLTGQKELYDLSSDLSEKNNLASSHPQLVERLYRHLTSK 503 Query: 534 IDSSQPPLSEVNQEKFN 550 + V EK Sbjct: 504 LHKMNAQKPIVEGEKRK 520 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.314 0.160 0.556 Lambda K H 0.267 0.0489 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 4,016,004,080 Number of Sequences: 3077464 Number of extensions: 225651994 Number of successful extensions: 449526 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 3759 Number of HSP's successfully gapped in prelim test: 1952 Number of HSP's that attempted gapping in prelim test: 414178 Number of HSP's gapped (non-prelim): 11816 length of query: 560 length of database: 1,040,396,356 effective HSP length: 134 effective length of query: 426 effective length of database: 628,016,180 effective search space: 267534892680 effective search space used: 267534892680 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.6 bits) S2: 96 (41.3 bits)