BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (350 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_C9R2J3 A/G-specific adenine glycosylase n=1 Tax=Aggrega... 431 e-119 UniRef50_P44320 A/G-specific adenine glycosylase n=308 Tax=Gamma... 410 e-113 UniRef50_A0Z8D4 A/G-specific adenine glycosylase MutY n=2 Tax=Ga... 370 e-101 UniRef50_B8KUN1 A/G-specific adenine glycosylase n=2 Tax=Gammapr... 360 6e-98 UniRef50_Q3KJJ2 A/G-specific adenine glycosylase n=5 Tax=Gammapr... 359 1e-97 UniRef50_UPI0000E105D2 A/G-specific adenine glycosylase n=1 Tax=... 353 4e-96 UniRef50_C7R6W3 A/G-specific adenine glycosylase n=1 Tax=Kangiel... 353 5e-96 UniRef50_A9KCK4 A/G-specific adenine DNA glycosylase n=6 Tax=Cox... 340 5e-92 UniRef50_Q2SMB7 A/G-specific adenine glycosylase n=10 Tax=Gammap... 332 8e-90 UniRef50_Q89A45 A/G-specific adenine glycosylase n=6 Tax=Buchner... 327 3e-88 UniRef50_Q3J8X1 A/G-specific DNA-adenine glycosylase n=13 Tax=Ga... 326 9e-88 UniRef50_B5JTP2 A/G-specific adenine glycosylase n=1 Tax=gamma p... 324 3e-87 UniRef50_A8PLA9 A/G-specific adenine glycosylase n=1 Tax=Rickett... 323 5e-87 UniRef50_B0RRL4 A/G-specific adenine glycosylase n=15 Tax=Gammap... 317 4e-85 UniRef50_D1RGZ1 A/G-specific adenine glycosylase n=1 Tax=Legione... 312 1e-83 UniRef50_Q2YCY4 A/G-specific DNA-adenine glycosylase n=4 Tax=Nit... 310 4e-83 UniRef50_C6N1N5 A/G specific adenine glycosylase n=1 Tax=Legione... 308 2e-82 UniRef50_Q7VRG8 A/G-specific adenine glycosylase n=2 Tax=Candida... 305 1e-81 UniRef50_A1WW55 A/G-specific DNA-adenine glycosylase n=1 Tax=Hal... 287 3e-76 UniRef50_D0L1V0 A/G-specific adenine glycosylase n=1 Tax=Halothi... 286 8e-76 UniRef50_A1KUL5 Adenine glycosylase n=32 Tax=Neisseriaceae RepID... 285 2e-75 UniRef50_A5EV94 A-G-specific adenine glycosylase n=1 Tax=Dichelo... 278 2e-73 UniRef50_C3X6B5 A/G-specific adenine glycosylase n=1 Tax=Oxaloba... 275 1e-72 UniRef50_A1VKP4 A/G-specific DNA-adenine glycosylase n=6 Tax=Com... 275 2e-72 UniRef50_UPI0001BC503C adenine glycosylase n=1 Tax=Neisseria muc... 272 1e-71 UniRef50_Q3A7R2 A/G-specific DNA-adenine glycosylase n=2 Tax=Des... 268 2e-70 UniRef50_UPI0000E87CC6 A/G-specific adenine glycosylase n=1 Tax=... 267 4e-70 UniRef50_A3MA77 A/G specific adenine glycosylase n=18 Tax=Acinet... 265 2e-69 UniRef50_C9PAA7 A/G-specific adenine glycosylase n=2 Tax=Vibrio ... 262 1e-68 UniRef50_Q1Q8E2 A/G-specific DNA-adenine glycosylase n=4 Tax=Mor... 259 7e-68 UniRef50_A6GM64 Probable a/g-specific adenine glycosylase protei... 254 2e-66 UniRef50_A3ZT46 A/G-specific adenine glycosylase n=1 Tax=Blastop... 249 1e-64 UniRef50_D0GU15 A/G-specific adenine glycosylase n=1 Tax=Vibrio ... 244 4e-63 UniRef50_Q2LY55 A/g-specific DNA glycosylase n=1 Tax=Syntrophus ... 241 2e-62 UniRef50_Q1DAW2 A/G-specific adenine glycosylase n=1 Tax=Myxococ... 241 2e-62 UniRef50_B1Y3P3 A/G-specific adenine glycosylase n=8 Tax=Burkhol... 239 1e-61 UniRef50_Q7UYZ0 A/G-specific adenine glycosylase n=1 Tax=Rhodopi... 236 7e-61 UniRef50_B6BVY5 A/G-specific adenine glycosylase n=1 Tax=beta pr... 236 1e-60 UniRef50_C0EDR1 Putative uncharacterized protein n=2 Tax=Clostri... 235 2e-60 UniRef50_Q7NMA0 A/G-specific adenine glycosylase n=1 Tax=Gloeoba... 235 2e-60 UniRef50_A9EQ53 A/G-specific adenine glycosylase n=1 Tax=Sorangi... 234 4e-60 UniRef50_B5EQC0 A/G-specific adenine glycosylase n=3 Tax=Acidith... 234 4e-60 UniRef50_A4T064 A/G-specific adenine glycosylase n=2 Tax=Polynuc... 234 5e-60 UniRef50_D0MK94 A/G-specific adenine glycosylase n=1 Tax=Rhodoth... 230 7e-59 UniRef50_B8HXE9 A/G-specific adenine glycosylase n=31 Tax=Cyanob... 227 6e-58 UniRef50_B8FCI1 A/G-specific adenine glycosylase n=1 Tax=Desulfa... 227 6e-58 UniRef50_C5EQF4 Putative uncharacterized protein n=2 Tax=Clostri... 226 1e-57 UniRef50_D2R2B9 HhH-GPD family protein n=1 Tax=Pirellula staleyi... 225 1e-57 UniRef50_B7GII9 A/G-specific adenine glycosylase n=76 Tax=Bacill... 225 2e-57 UniRef50_C1ZJ43 A/G-specific DNA-adenine glycosylase n=1 Tax=Pla... 225 2e-57 UniRef50_C8WEE4 A/G-specific adenine glycosylase n=3 Tax=Zymomon... 225 2e-57 UniRef50_Q67NN1 A/G-specific adenine glycosylase n=1 Tax=Symbiob... 223 6e-57 UniRef50_A7HQZ1 A/G-specific adenine glycosylase n=45 Tax=Alphap... 222 2e-56 UniRef50_C0QGE4 MutY n=1 Tax=Desulfobacterium autotrophicum HRM2... 221 4e-56 UniRef50_Q2KBH8 A/G-specific adenine glycosylase protein n=11 Ta... 221 4e-56 UniRef50_B3EUI2 A/G-specific adenine glycosylase n=1 Tax=Candida... 219 1e-55 UniRef50_A6CCE7 A/G-specific adenine glycosylase n=1 Tax=Plancto... 219 2e-55 UniRef50_Q2S0Q6 A/G-specific adenine glycosylase n=1 Tax=Salinib... 219 2e-55 UniRef50_C9LNH1 A/G-specific adenine glycosylase n=1 Tax=Dialist... 218 2e-55 UniRef50_C9MR28 A/G-specific adenine glycosylase n=4 Tax=Prevote... 218 3e-55 UniRef50_A3EPM8 Putative A/G-specific DNA glycosylase n=2 Tax=Le... 218 4e-55 UniRef50_A2RLW5 A/G-specific adenine glycosylase n=4 Tax=Lactoco... 217 6e-55 UniRef50_C5VJP5 A/G-specific adenine glycosylase n=1 Tax=Prevote... 216 1e-54 UniRef50_A6TW33 A/G-specific adenine glycosylase n=1 Tax=Alkalip... 215 2e-54 UniRef50_A0LLV9 A/G-specific DNA-adenine glycosylase n=1 Tax=Syn... 215 2e-54 UniRef50_A0L468 A/G-specific DNA-adenine glycosylase n=1 Tax=Mag... 214 3e-54 UniRef50_B9XFP7 A/G-specific adenine glycosylase n=1 Tax=bacteri... 214 3e-54 UniRef50_A6LDG5 A/G-specific adenine glycosylase n=33 Tax=Bacter... 214 5e-54 UniRef50_B1LXG3 A/G-specific adenine glycosylase n=74 Tax=Bacter... 214 5e-54 UniRef50_D2MN98 Putative A/G-specific adenine glycosylase n=1 Ta... 213 6e-54 UniRef50_Q5WIF8 A/G-specific adenine DNA glycosylase n=11 Tax=Ba... 213 7e-54 UniRef50_D2RMB6 A/G-specific adenine glycosylase n=2 Tax=Acidami... 213 8e-54 UniRef50_C8WS48 A/G-specific adenine glycosylase n=2 Tax=Alicycl... 213 8e-54 UniRef50_A0M7B9 A/G-specific adenine glycosylase n=16 Tax=Bacter... 213 1e-53 UniRef50_D1N1B7 A/G-specific adenine glycosylase n=2 Tax=Bacteri... 211 4e-53 UniRef50_C3J8R7 A/G-specific adenine glycosylase n=2 Tax=Bacteri... 211 4e-53 UniRef50_A8R947 Putative uncharacterized protein n=1 Tax=Eubacte... 210 5e-53 UniRef50_A9NG45 A/G-specific adenine DNA glycosylase n=1 Tax=Ach... 210 5e-53 UniRef50_C4L361 A/G-specific adenine glycosylase n=1 Tax=Exiguob... 210 6e-53 UniRef50_Q03V36 A/G-specific DNA-adenine glycosylase n=2 Tax=Leu... 210 7e-53 UniRef50_B9E7Y8 A/G-specific adenine glycosylase homolog n=4 Tax... 210 8e-53 UniRef50_Q1MS05 A/G-specific DNA glycosylase n=1 Tax=Lawsonia in... 209 9e-53 UniRef50_C6ACA0 A/G-specific adenine glycosylase MutY n=6 Tax=Rh... 209 1e-52 UniRef50_Q2G9R8 A/G-specific DNA-adenine glycosylase n=7 Tax=Alp... 208 2e-52 UniRef50_Q8CV61 A/G-specific adenine glycosylase n=1 Tax=Oceanob... 208 3e-52 UniRef50_C9LHH9 A/G-specific adenine glycosylase n=4 Tax=Bactero... 208 3e-52 UniRef50_B0MYW3 Putative uncharacterized protein n=1 Tax=Alistip... 207 3e-52 UniRef50_Q3SPD3 A/G-specific DNA-adenine glycosylase n=2 Tax=Nit... 207 4e-52 UniRef50_C9LTK5 A/G-specific adenine glycosylase n=1 Tax=Selenom... 207 4e-52 UniRef50_C2H8F6 A/G-specific adenine glycosylase n=16 Tax=Entero... 207 4e-52 UniRef50_C6XRC5 HhH-GPD family protein n=1 Tax=Hirschia baltica ... 207 6e-52 UniRef50_D1W5W3 A/G-specific adenine glycosylase n=1 Tax=Prevote... 206 9e-52 UniRef50_C6X0N8 A/G-specific adenine glycosylase n=1 Tax=Flavoba... 206 1e-51 UniRef50_A7VHH6 Putative uncharacterized protein n=1 Tax=Clostri... 206 1e-51 UniRef50_B8FVN1 A/G-specific adenine glycosylase n=2 Tax=Desulfi... 205 2e-51 UniRef50_Q3ANF7 Mutator mutT protein n=19 Tax=Cyanobacteria RepI... 205 2e-51 UniRef50_C7PL91 A/G-specific adenine glycosylase n=1 Tax=Chitino... 204 3e-51 UniRef50_C6D1C3 A/G-specific adenine glycosylase n=4 Tax=Bacilla... 204 3e-51 UniRef50_A8MJ42 A/G-specific adenine glycosylase n=2 Tax=Firmicu... 204 3e-51 UniRef50_C4G9U8 Putative uncharacterized protein n=1 Tax=Shuttle... 204 4e-51 UniRef50_C2MB00 A/G-specific adenine glycosylase n=1 Tax=Porphyr... 204 5e-51 UniRef50_B3QVZ4 A/G-specific adenine glycosylase n=1 Tax=Chloroh... 203 6e-51 UniRef50_B8DFN3 A/G-specific adenine glycosylase n=14 Tax=Lister... 202 1e-50 UniRef50_UPI0001910D7D adenine DNA glycosylase n=1 Tax=Salmonell... 202 1e-50 UniRef50_C2KZ90 Putative uncharacterized protein n=1 Tax=Oribact... 202 1e-50 UniRef50_B4D777 HhH-GPD family protein n=1 Tax=Chthoniobacter fl... 202 2e-50 UniRef50_A4VV79 A/G-specific DNA glycosylase n=98 Tax=Lactobacil... 201 2e-50 UniRef50_Q11NU4 A/G-specific adenine glycosylase n=5 Tax=Bactero... 201 3e-50 UniRef50_C6LA88 A/G-specific adenine glycosylase n=20 Tax=Clostr... 201 3e-50 UniRef50_A6NQR4 Putative uncharacterized protein n=1 Tax=Bactero... 201 3e-50 UniRef50_UPI00016C4021 A/G-specific adenine glycosylase n=1 Tax=... 201 3e-50 UniRef50_UPI000196B020 hypothetical protein CATMIT_00928 n=1 Tax... 201 4e-50 UniRef50_Q9UIF7 A/G-specific adenine DNA glycosylase n=51 Tax=Ch... 201 5e-50 UniRef50_C6HWU0 Putative A/G-specific DNA glycosylase n=1 Tax=Le... 200 7e-50 UniRef50_Q02CA8 A/G-specific DNA-adenine glycosylase n=1 Tax=Can... 199 9e-50 UniRef50_C6XY88 Helix-hairpin-helix motif protein n=4 Tax=Sphing... 199 1e-49 UniRef50_B9Y8G7 Putative uncharacterized protein n=1 Tax=Holdema... 199 1e-49 UniRef50_O31584 Probable A/G-specific adenine glycosylase yfhQ n... 199 2e-49 UniRef50_C8US26 A/G-specific DNA glycosylase n=18 Tax=Lactobacil... 198 2e-49 UniRef50_Q04T89 A/G-specific DNA glycosylase n=4 Tax=Leptospira ... 197 4e-49 UniRef50_C0BLH9 A/G-specific adenine glycosylase n=2 Tax=Flavoba... 197 6e-49 UniRef50_D1U9Y5 A/G-specific adenine glycosylase n=1 Tax=Desulfo... 196 8e-49 UniRef50_C9PXZ8 A/G-specific adenine glycosylase n=2 Tax=Prevote... 196 1e-48 UniRef50_A5IU34 A/G-specific DNA-adenine glycosylase n=56 Tax=St... 195 2e-48 UniRef50_Q8A7M6 A/G-specific adenine glycosylase n=2 Tax=Bactero... 195 2e-48 UniRef50_C4FU23 Putative uncharacterized protein n=1 Tax=Catonel... 195 2e-48 UniRef50_Q2N933 Putative uncharacterized protein n=1 Tax=Erythro... 195 3e-48 UniRef50_D2QT13 A/G-specific adenine glycosylase n=1 Tax=Spiroso... 195 3e-48 UniRef50_A9BD72 A/G-specific DNA glycosylase n=3 Tax=Cyanobacter... 194 4e-48 UniRef50_UPI0001BC63FA A/G-specific adenine glycosylase n=3 Tax=... 194 4e-48 UniRef50_Q0EVU8 Adenine glycosylase n=1 Tax=Mariprofundus ferroo... 194 6e-48 UniRef50_Q1ITJ1 A/G-specific DNA glycosylase n=1 Tax=Candidatus ... 193 7e-48 UniRef50_Q1GD89 A/G-specific DNA-adenine glycosylase n=11 Tax=Rh... 193 8e-48 UniRef50_A8ZZH0 A/G-specific adenine glycosylase n=1 Tax=Desulfo... 193 9e-48 UniRef50_B8J1Y9 A/G-specific adenine glycosylase n=12 Tax=Desulf... 192 1e-47 UniRef50_B7CBH3 Putative uncharacterized protein n=1 Tax=Eubacte... 192 1e-47 UniRef50_Q7VGS8 A/G-specific adenine glycosylase n=537 Tax=Helic... 191 3e-47 UniRef50_B8JJK1 Novel protein similar to H.sapiens MUTYH, mutY h... 190 6e-47 UniRef50_A6G959 A/G-specific adenine glycosylase n=1 Tax=Plesioc... 190 7e-47 UniRef50_B0SFS5 A/G-specific DNA glycosylase n=2 Tax=Leptospira ... 190 8e-47 UniRef50_C6XF00 A/G-specific adenine glycosylase n=1 Tax=Candida... 189 2e-46 UniRef50_C9KJM4 A/G-specific adenine glycosylase n=3 Tax=Clostri... 188 3e-46 UniRef50_Q0P4V1 Putative uncharacterized protein MGC145569 n=1 T... 188 3e-46 UniRef50_C0YGU7 Possible A/G-specific adenine glycosylase n=1 Ta... 187 3e-46 UniRef50_C4V5S7 Adenine glycosylase n=1 Tax=Selenomonas flueggei... 187 4e-46 UniRef50_Q6AKD3 Related to A/G-specific adenine glycosylase n=1 ... 187 4e-46 UniRef50_Q03P68 A/G-specific DNA-adenine glycosylase n=2 Tax=Lac... 187 5e-46 UniRef50_Q3KMR2 A/G-specific adenine DNA glycosylase n=9 Tax=Chl... 187 6e-46 UniRef50_C7G751 A/G-specific adenine glycosylase n=1 Tax=Rosebur... 186 8e-46 UniRef50_Q6MHU9 A/G-specific adenine glycosylase n=1 Tax=Bdellov... 186 9e-46 UniRef50_A6FJZ0 A/G-specific adenine glycosylase n=7 Tax=Alphapr... 185 2e-45 UniRef50_A9T1M0 Predicted protein n=1 Tax=Physcomitrella patens ... 184 3e-45 UniRef50_B7A8Y1 HhH-GPD family protein n=1 Tax=Thermus aquaticus... 184 4e-45 UniRef50_C1F8W0 A/G-specific adenine glycosylase, putative n=1 T... 183 7e-45 UniRef50_D1BPM8 A/G-specific adenine glycosylase n=3 Tax=Veillon... 183 7e-45 UniRef50_C1XGJ6 A/G-specific DNA-adenine glycosylase n=1 Tax=Mei... 183 8e-45 UniRef50_Q7MUV2 A/G-specific adenine glycosylase n=2 Tax=Porphyr... 182 1e-44 UniRef50_C8P1Q1 A/G-specific adenine glycosylase n=1 Tax=Erysipe... 182 1e-44 UniRef50_A7RIG3 Predicted protein n=1 Tax=Nematostella vectensis... 182 2e-44 UniRef50_A6Q673 A/G-specific adenine glycosylase n=2 Tax=Epsilon... 181 3e-44 UniRef50_Q254A0 Adenine/guanine glycosylase n=5 Tax=Chlamydophil... 181 4e-44 UniRef50_Q5HN66 A/G-specific adenine glycosylase n=5 Tax=Staphyl... 181 4e-44 UniRef50_A1W1M2 A/G-specific adenine glycosylase n=13 Tax=Campyl... 180 6e-44 UniRef50_C7JG78 DNA glycosylase A/G-specific MutY n=12 Tax=Alpha... 179 1e-43 UniRef50_B0D6Q9 Predicted protein n=1 Tax=Laccaria bicolor S238N... 179 2e-43 UniRef50_Q5SH32 A/G-specific adenine glycosylase (MutY) n=2 Tax=... 179 2e-43 UniRef50_C5YTP8 Putative uncharacterized protein Sb08g006760 n=1... 179 2e-43 UniRef50_Q1J2G1 A/G-specific adenine glycosylase n=3 Tax=Deinoco... 178 3e-43 UniRef50_B2UNR9 HhH-GPD family protein n=1 Tax=Akkermansia mucin... 177 4e-43 UniRef50_Q9ZMT8 A/G-SPECIFIC ADENINE GLYCOSYLASE n=21 Tax=Helico... 177 5e-43 UniRef50_C5ZWA7 A/G-specific adenine glycosylase n=3 Tax=Helicob... 176 9e-43 UniRef50_D0J8Q9 A/G-specific adenine glycosylase n=2 Tax=Blattab... 174 5e-42 UniRef50_UPI0001746831 A/G-specific adenine glycosylase n=1 Tax=... 173 8e-42 UniRef50_Q6MF36 Putative A/G-specific adenine glycosylase, mutY ... 172 2e-41 UniRef50_C3XHP3 A/G-specific adenine glycosylase n=1 Tax=Helicob... 171 4e-41 UniRef50_B2AMG7 Predicted CDS Pa_5_7310 n=1 Tax=Podospora anseri... 170 8e-41 UniRef50_Q2QW21 Os12g0211400 protein n=6 Tax=Poaceae RepID=Q2QW2... 169 9e-41 UniRef50_Q1ZXP7 Putative uncharacterized protein MYH n=1 Tax=Dic... 168 3e-40 UniRef50_C9RK91 HhH-GPD family protein n=1 Tax=Fibrobacter succi... 167 5e-40 UniRef50_B3T7B3 Putative HhH-GPD superfamily base excision DNA r... 167 6e-40 UniRef50_B7P4H9 A/G-specific adenine glycosylase muty, putative ... 166 9e-40 UniRef50_UPI0001926FFC PREDICTED: similar to mutY homolog, parti... 166 1e-39 UniRef50_C4PWX5 A/g-specific adenine dna glycosylase (Ec 3.2.2.-... 166 1e-39 UniRef50_A9KT85 A/G-specific adenine glycosylase n=3 Tax=Clostri... 166 1e-39 UniRef50_B2W3B1 A/G-specific adenine glycosylase n=2 Tax=Pleospo... 165 3e-39 UniRef50_UPI0000E49729 PREDICTED: hypothetical protein, partial ... 164 4e-39 UniRef50_A1A5M6 Mutyh protein n=4 Tax=Eukaryota RepID=A1A5M6_RAT 164 4e-39 UniRef50_B9L504 Catalytic Domain Of MutyY n=4 Tax=Bacteria RepID... 164 7e-39 UniRef50_B3DVQ4 A/G-specific DNA glycosylase n=1 Tax=Methylacidi... 163 1e-38 UniRef50_Q2IJ90 A/G-specific DNA-adenine glycosylase n=5 Tax=Cys... 162 2e-38 UniRef50_A4NPF3 A/G-specific adenine glycosylase n=3 Tax=Haemoph... 162 2e-38 UniRef50_A8I815 DNA repair glycosylase (Fragment) n=1 Tax=Chlamy... 161 3e-38 UniRef50_A6DT67 Adenine glycosylase n=1 Tax=Lentisphaera araneos... 161 3e-38 UniRef50_B9KEL2 A/G-specific adenine glycosylase n=3 Tax=Campylo... 161 3e-38 UniRef50_B1ZMV1 HhH-GPD family protein n=2 Tax=Verrucomicrobia R... 161 3e-38 UniRef50_A6Q1G1 A/G-specific adenine glycosylase n=1 Tax=Nitrati... 160 6e-38 UniRef50_D1PLU8 A/G-specific adenine glycosylase n=2 Tax=Ruminoc... 160 8e-38 UniRef50_P46230 A/G-specific adenine glycosylase (Fragment) n=14... 159 1e-37 UniRef50_Q10159 A/G-specific adenine DNA glycosylase n=1 Tax=Sch... 159 1e-37 UniRef50_Q1V0L8 A/G-specific adenine glycosylase n=3 Tax=Candida... 158 3e-37 UniRef50_A8Q4W7 Putative uncharacterized protein n=1 Tax=Malasse... 157 4e-37 UniRef50_C0ACJ1 HhH-GPD family protein n=1 Tax=Opitutaceae bacte... 156 1e-36 UniRef50_A7NN47 HhH-GPD family protein n=2 Tax=Bacteria RepID=A7... 154 3e-36 UniRef50_D2VUC5 Predicted protein n=1 Tax=Naegleria gruberi RepI... 154 5e-36 UniRef50_UPI000034F1ED adenine-DNA glycosylase-related / MYH-rel... 154 6e-36 UniRef50_Q6IEF2 A/G-specific adenine glycosylase, putative n=3 T... 153 7e-36 UniRef50_B9SF01 A/G-specific adenine glycosylase muty, putative ... 153 8e-36 UniRef50_Q7M9R1 A/G-SPECIFIC ADENINE GLYCOSYLASE EC 3.2.2 n=1 Ta... 152 1e-35 UniRef50_C9M6M4 A/G-specific adenine glycosylase n=1 Tax=Jonquet... 152 2e-35 UniRef50_A9UV22 Predicted protein (Fragment) n=1 Tax=Monosiga br... 152 2e-35 UniRef50_Q4P7F9 Putative uncharacterized protein n=1 Tax=Ustilag... 152 2e-35 UniRef50_C7YS47 Putative uncharacterized protein n=2 Tax=Nectria... 149 2e-34 UniRef50_A9AYZ2 HhH-GPD family protein n=5 Tax=Chloroflexi (clas... 147 5e-34 UniRef50_A4AHC9 Adenine glycosylase n=1 Tax=marine actinobacteri... 145 2e-33 UniRef50_B6K934 Helix-hairpin-helix motif-containing protein n=3... 145 2e-33 UniRef50_B7G7W7 Predicted protein n=1 Tax=Phaeodactylum tricornu... 144 5e-33 UniRef50_D1Y8G9 A/G-specific DNA-adenine glycosylase n=1 Tax=Pyr... 144 5e-33 UniRef50_D0RRM3 A/G-specific adenine glycosylase n=1 Tax=alpha p... 144 6e-33 UniRef50_UPI0000510111 A/G-specific adenine glycosylase n=1 Tax=... 142 1e-32 UniRef50_Q4Q851 A/G-specific adenine glycosylase, putative n=3 T... 142 2e-32 UniRef50_D1Z9Z8 Whole genome shotgun sequence assembly, scaffold... 141 3e-32 UniRef50_B6K1Z7 A/G-specific adenine DNA glycosylase n=1 Tax=Sch... 140 7e-32 UniRef50_C8NK18 A/G-specific adenine glycosylase n=8 Tax=Coryneb... 139 1e-31 UniRef50_D1V787 HhH-GPD family protein n=4 Tax=Actinomycetales R... 137 6e-31 UniRef50_A1RYX1 HhH-GPD family protein n=1 Tax=Thermofilum pende... 137 7e-31 UniRef50_Q5KQ30 A/G-specific adenine DNA glycosylase, putative n... 136 1e-30 UniRef50_B3DTE4 A/G-specific DNA glycosylase n=6 Tax=Bifidobacte... 135 2e-30 UniRef50_C1EFR0 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 135 2e-30 UniRef50_A0LRD5 HhH-GPD family protein n=5 Tax=Actinomycetales R... 134 3e-30 UniRef50_C6A996 A/G-specific adenine glycosylase n=9 Tax=Bifidob... 134 4e-30 UniRef50_B7KLS5 HhH-GPD family protein n=1 Tax=Cyanothece sp. PC... 134 5e-30 UniRef50_B7GUG9 HhH-GPD family protein n=2 Tax=Bifidobacterium R... 134 7e-30 UniRef50_UPI0001699F40 A/G-specific adenine glycosylase n=1 Tax=... 132 2e-29 UniRef50_A0JRA8 HhH-GPD family protein n=19 Tax=Actinomycetales ... 132 3e-29 UniRef50_A2PD72 A/G-specific adenine glycosylase n=1 Tax=Vibrio ... 130 6e-29 UniRef50_Q2QW16 Retrotransposon protein, putative, unclassified ... 130 7e-29 UniRef50_C9SE24 A/G-specific adenine DNA glycosylase n=2 Tax=Sor... 129 1e-28 UniRef50_Q97IY7 A/G-specific DNA glycosylase n=1 Tax=Clostridium... 129 1e-28 UniRef50_D1BVG2 HhH-GPD family protein n=39 Tax=Actinomycetales ... 129 2e-28 UniRef50_Q0W1Z4 Putative DNA glycosylase n=1 Tax=uncultured meth... 127 6e-28 UniRef50_Q9P9L6 U/G & T/G mismatch specific glycosylase (Pa-MIG)... 127 7e-28 UniRef50_A1A319 Probable A/G-specific adenine glycosylase n=3 Ta... 127 8e-28 UniRef50_C6WGJ6 HhH-GPD family protein n=48 Tax=Actinomycetales ... 125 3e-27 UniRef50_C8RS99 A/G-specific DNA glycosylase n=1 Tax=Corynebacte... 122 1e-26 UniRef50_B8P2D0 Predicted protein n=1 Tax=Postia placenta Mad-69... 122 2e-26 UniRef50_Q8II68 A/G-specific adenine glycosylase, putative n=1 T... 122 2e-26 UniRef50_B3L5A3 A/g-specific adenine glycosylase, putative n=6 T... 119 2e-25 UniRef50_Q027X1 Helix-hairpin-helix motif n=1 Tax=Candidatus Sol... 116 1e-24 UniRef50_C0QTN1 T/G-specific DNA glycosylase n=1 Tax=Persephonel... 116 1e-24 UniRef50_A1AUQ8 HhH-GPD family protein n=2 Tax=Desulfuromonadale... 116 1e-24 >UniRef50_C9R2J3 A/G-specific adenine glycosylase n=1 Tax=Aggregatibacter actinomycetemcomitans D11S-1 RepID=C9R2J3_AGGAD Length = 396 Score = 431 bits (1108), Expect = e-119, Method: Compositional matrix adjust. Identities = 212/359 (59%), Positives = 255/359 (71%), Gaps = 23/359 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ VL WY K+GRK LPWQ +KT Y VWLSEVMLQQTQV TVIPYFERF+ FP +T L Sbjct: 28 FAHAVLTWYAKFGRKNLPWQQNKTLYGVWLSEVMLQQTQVTTVIPYFERFVKTFPNLTAL 87 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANAPLDEVLHLWTGLGYYARARNLHKAAQ + HGG+FP FE+V ALPGVGRSTAGA+ Sbjct: 88 ANAPLDEVLHLWTGLGYYARARNLHKAAQIMRDQHGGEFPTEFEQVWALPGVGRSTAGAV 147 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 LS L +PILDGNVKRVL+R +AVSGWPG+K+ E+ LW L+ QVTP V FNQAMM Sbjct: 148 LSSCLNAPYPILDGNVKRVLSRYFAVSGWPGEKKTEDHLWHLTAQVTPTEQVADFNQAMM 207 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D+GAM+CTRSKPKC LCPL++ C A A +W +YPG+KPK+ LPER YFLLL+ + ++ Sbjct: 208 DIGAMVCTRSKPKCELCPLKSDCKANAEQNWQVYPGRKPKKVLPERESYFLLLEKDGKIA 267 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW 305 L QR +GLWGGLYCFPQF++++ L +LA I + TAFRHTFSHFHLDI P++ Sbjct: 268 LEQRDNAGLWGGLYCFPQFSEKQELLAYLAANGI--RQYQEWTAFRHTFSHFHLDIYPIY 325 Query: 306 LPVSSFTGCMDEGNA---------------------LWYNLAQPPSVGLAAPVERLLQQ 343 V + +E + WY+ P +GLAAPV+ LL Q Sbjct: 326 AQVDTPKALAEENRSHRQKVAETAGEYQSDLLSAVKYWYDPQNPEPIGLAAPVKNLLTQ 384 >UniRef50_P44320 A/G-specific adenine glycosylase n=308 Tax=Gammaproteobacteria RepID=MUTY_HAEIN Length = 378 Score = 410 bits (1054), Expect = e-113, Method: Compositional matrix adjust. Identities = 210/360 (58%), Positives = 253/360 (70%), Gaps = 24/360 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ VL WYDK+GRK LPWQ +KT Y VWLSEVMLQQTQVATVIPYFERF+ FP +T L Sbjct: 11 FAKSVLAWYDKFGRKHLPWQQNKTLYGVWLSEVMLQQTQVATVIPYFERFIKTFPNITAL 70 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA DEVLHLWTGLGYYARARNLHKAAQ+V G FP FE+V AL GVGRSTAGAI Sbjct: 71 ANASQDEVLHLWTGLGYYARARNLHKAAQKVRDEFNGNFPTNFEQVWALSGVGRSTAGAI 130 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 LS L + +PILDGNVKRVLAR +AV GW G+K+VEN+LW+L+EQVTP V FNQAMM Sbjct: 131 LSSVLNQPYPILDGNVKRVLARYFAVEGWSGEKKVENRLWALTEQVTPTTRVADFNQAMM 190 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D+GAM+C R+KPKC LCPL C+A N +W +P KKPK+ +PE+T YFL+L +V Sbjct: 191 DIGAMVCMRTKPKCDLCPLNIDCLAYKNTNWEKFPAKKPKKAMPEKTTYFLILSKNGKVC 250 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW 305 L QR SGLWGGL+CFPQF D+ SL +LAQ ++ + + +FRHTFSHFHLDI P++ Sbjct: 251 LEQRENSGLWGGLFCFPQFEDKSSLLHFLAQEKVT--HYQEWPSFRHTFSHFHLDIHPIY 308 Query: 306 LPVSSFTGCMDEGN---------------------ALWYNLAQPPSVGLAAPVERLLQQL 344 + S T C+++ N WY+ P +GLA PV+ LL Q Sbjct: 309 AEMES-TLCVEQANLDWRKVMESTKEYQSNLSSAVKYWYDPQNPEPIGLAQPVKNLLIQF 367 >UniRef50_A0Z8D4 A/G-specific adenine glycosylase MutY n=2 Tax=Gammaproteobacteria RepID=A0Z8D4_9GAMM Length = 349 Score = 370 bits (951), Expect = e-101, Method: Compositional matrix adjust. Identities = 182/344 (52%), Positives = 239/344 (69%), Gaps = 3/344 (0%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + F++++LDW+D +GR LPWQ KT Y+VWLSE+MLQQTQV TVIPYF+RF+ RFP+V Sbjct: 2 TSFASRLLDWFDLHGRHDLPWQKSKTAYRVWLSEIMLQQTQVQTVIPYFDRFLTRFPSVE 61 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA AP D+VLHLWTGLGYYARARNLH+AA+ V + GG+FP+ + + LPGVGRSTAG Sbjct: 62 DLAAAPEDDVLHLWTGLGYYARARNLHRAAKLVVSDFGGQFPQDLDGLLQLPGVGRSTAG 121 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILSL++G PILDGNVKRVLAR VSGWPG NKLW+L+E+ TP V + Q Sbjct: 122 AILSLAMGIRAPILDGNVKRVLARHDVVSGWPGTTTTLNKLWALAEEYTPTARVADYTQG 181 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED- 242 +MDLGA +CTRS P C CPL N C A + A +PGKKP++TLPER YF++ Q D Sbjct: 182 IMDLGATLCTRSAPGCQHCPLTNTCQARSRCEQAHFPGKKPRKTLPERETYFVIFQRPDG 241 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLT-QLTAFRHTFSHFHLDI 301 VL+ +RP G+WGGL+CFP+ D ++ LA + +A T L +H+FSH+ L I Sbjct: 242 TVLMEKRPEQGIWGGLWCFPESQDGDASPAHLATMRASAQEPTIYLDPIKHSFSHYRLTI 301 Query: 302 VPMWLP-VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 P+ L + + G + ++ W + P S+GL PV ++L+ L Sbjct: 302 RPILLKIIRASEGVAEVNSSQWIDPQTPGSIGLPVPVTQVLKTL 345 >UniRef50_B8KUN1 A/G-specific adenine glycosylase n=2 Tax=Gammaproteobacteria RepID=B8KUN1_9GAMM Length = 363 Score = 360 bits (923), Expect = 6e-98, Method: Compositional matrix adjust. Identities = 178/341 (52%), Positives = 231/341 (67%), Gaps = 4/341 (1%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 ++++L+W+D +GRK LPWQ D TPY+VW+SE+MLQQTQVATVIPY+ERFM FPTV L Sbjct: 11 LASRLLNWFDDHGRKDLPWQTDTTPYRVWISEIMLQQTQVATVIPYYERFMGEFPTVEAL 70 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 + AP D+VL LW+GLGYYARARNLH+ A+ V GG+FP+T + + LPG+GRSTAGAI Sbjct: 71 SAAPEDDVLKLWSGLGYYARARNLHRGAKMVTGDLGGEFPDTVDGLCTLPGIGRSTAGAI 130 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 +S+++G PILDGNVKRVLAR +AV GWPGK V +LW +E TP V + QA+M Sbjct: 131 ISIAMGGRAPILDGNVKRVLARHHAVDGWPGKSGVAAELWGHAEAHTPNTRVADYTQAIM 190 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF-LLLQHEDEV 244 DLGA +CTR +P+C +CPL + C A YPGKKPK+T P R+ +F L++ H V Sbjct: 191 DLGATLCTRRRPQCLVCPLVDTCHAGRAGDPEQYPGKKPKRTTPTRSAFFALVVDHTGAV 250 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAAD-NLTQLTAFRHTFSHFHLDIVP 303 LL +RPPSG+WGGL+ PQ D L LAQR D + QL HTFSHF L I P Sbjct: 251 LLQKRPPSGIWGGLWSLPQSPDRGELEN-LAQRFGVVDPEVEQLPMIEHTFSHFRLAITP 309 Query: 304 MWLPVSS-FTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + + +S+ T + W+ Q G+ APV +LL + Sbjct: 310 LRIRLSTAHTSVSEPTEYKWHQFTQSLPGGIPAPVAKLLAE 350 >UniRef50_Q3KJJ2 A/G-specific adenine glycosylase n=5 Tax=Gammaproteobacteria RepID=Q3KJJ2_PSEPF Length = 355 Score = 359 bits (921), Expect = 1e-97, Method: Compositional matrix adjust. Identities = 182/344 (52%), Positives = 234/344 (68%), Gaps = 1/344 (0%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M+A QFS VL+W+D++GR LPWQ + PY+VW+SE+MLQQTQV+TV+ YF+RFMA P Sbjct: 1 MRAEQFSTAVLEWFDRHGRHDLPWQQNINPYRVWVSEIMLQQTQVSTVLNYFDRFMAALP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV LA AP DEVLHLWTGLGYY RARNL K A+ V + +GG+FP E++ LPG+G S Sbjct: 61 TVEALAAAPEDEVLHLWTGLGYYTRARNLQKTAKIVVSQYGGEFPRDVEKLTELPGIGLS 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S+S+G PILDGNVKRVLAR A G+PG+ +V +LW+ +E+ TP V + Sbjct: 121 TAGAIGSISMGLRAPILDGNVKRVLARFTAQEGYPGEPKVAKQLWANAERFTPQDRVNAY 180 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QAMMDLGA +CTRSKP C LCPL+ GC A YP KP++ +P++ LL + Sbjct: 181 TQAMMDLGATLCTRSKPSCLLCPLEKGCEAHMLGLETRYPIPKPRKAIPQKRTLMPLLAN 240 Query: 241 ED-EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 ED +LL +RP SGLWGGL+ P+ D + L+ AQ + L + HTFSHF L Sbjct: 241 EDGAILLYRRPSSGLWGGLWSLPELDDLDDLQHLAAQHSLELGEQQALPSLVHTFSHFQL 300 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 I P + V G + E + LWYNLA PP +GLAAPV+ LL++ Sbjct: 301 SIEPWLVQVQEAGGHVAEADWLWYNLATPPRLGLAAPVKTLLER 344 >UniRef50_UPI0000E105D2 A/G-specific adenine glycosylase n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E105D2 Length = 375 Score = 353 bits (907), Expect = 4e-96, Method: Compositional matrix adjust. Identities = 182/372 (48%), Positives = 236/372 (63%), Gaps = 29/372 (7%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 A F+ L W+D +GRK LPWQ + + Y VW+SE+MLQQTQV TVIPYFERFM FPTV Sbjct: 2 AFDFAQTALAWFDLHGRKHLPWQQNISAYSVWISEIMLQQTQVTTVIPYFERFMRSFPTV 61 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 DLANAP ++VLH WTGLGYYARARNLHKAAQQ+A H G FP FEEV ALPG+GRSTA Sbjct: 62 NDLANAPQEDVLHHWTGLGYYARARNLHKAAQQIAEHHNGVFPTDFEEVLALPGIGRSTA 121 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GAIL+++ ++ PILDGNVKRVLAR +AV GWPG K+VE++LW + ++TP+ + + Q Sbjct: 122 GAILAIAEHQNHPILDGNVKRVLARFFAVEGWPGSKKVEDELWHFAGELTPSERIADYTQ 181 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 MMDLGA +CTRSKPKC +CPLQ+ C+A A+ + P KKPK+T+P + ++ D Sbjct: 182 VMMDLGATLCTRSKPKCEVCPLQSRCLAFASGRQSELPHKKPKKTIPSKYTTVIIPMLFD 241 Query: 243 EVLLAQRPPSGLWGGLYCF-------PQFADEESLRQWLAQRQIAADNLTQLTAFRHTFS 295 VL+ +RP G+WGGL+ F Q DE Q + L F+HTFS Sbjct: 242 RVLMTKRPEEGIWGGLWWFGGEFTPTAQKVDEHPTIYTALGEQFDVVSSDMLPEFKHTFS 301 Query: 296 HFHLDIVPMWLPVSS-------------------FTGC-MDEG--NALWYNLAQPPSVGL 333 HFHL I P+ L +++ TG + +G + W + QP +GL Sbjct: 302 HFHLHIQPVILYLTTDKVSEAVLSNSMPQMTLEDLTGAGIRDGQVHQRWVDYRQPADIGL 361 Query: 334 AAPVERLLQQLR 345 P + QL+ Sbjct: 362 CKPALTIFNQLQ 373 >UniRef50_C7R6W3 A/G-specific adenine glycosylase n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R6W3_KANKD Length = 355 Score = 353 bits (906), Expect = 5e-96, Method: Compositional matrix adjust. Identities = 175/355 (49%), Positives = 229/355 (64%), Gaps = 23/355 (6%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 A+ F+ QV+D+Y +GRK LPWQ + PY++WLSE+MLQQTQV TVIPYFERF+ FPTV Sbjct: 7 ATNFARQVIDYYHAFGRKHLPWQNTQDPYRIWLSEIMLQQTQVTTVIPYFERFVESFPTV 66 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 DLA+A D+V+ W+GLGYY+RARNLHKAA+ + T GG FP+ E + LPG+GRSTA Sbjct: 67 LDLAHATDDQVMQHWSGLGYYSRARNLHKAAKIIETEFGGDFPQDPEVIETLPGIGRSTA 126 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GAI S + + ILDGNVKRVLARCYA+ GWPG +V LW +E TP +NQ Sbjct: 127 GAIASFAFDQPTAILDGNVKRVLARCYAIEGWPGNGKVLKALWERAEANTPTQETAAYNQ 186 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE- 241 AMMDLGA++CTR+KP C CPL C+A NNS YPGKKPK+ P + Y+L+ +E Sbjct: 187 AMMDLGAVVCTRTKPNCPDCPLSKHCLAYHNNSIEQYPGKKPKKARPSKAVYWLISINET 246 Query: 242 ---DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 +VLL +RPPSG+WGGL+ P+ + Q+ I+ D ++QL F H FSH+ Sbjct: 247 SDQTQVLLHKRPPSGIWGGLWALPE----------VEQQDISPDKISQLDPFVHKFSHYD 296 Query: 299 LDIVPMWLPV---------SSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 L + P+ L S M+ A W++ Q +GL PV +LL +L Sbjct: 297 LQVQPLLLSSKADKALVEKKSHPSIMEPAQADWFSQNQLSEIGLPTPVSKLLSKL 351 >UniRef50_A9KCK4 A/G-specific adenine DNA glycosylase n=6 Tax=Coxiella burnetii RepID=A9KCK4_COXBN Length = 354 Score = 340 bits (872), Expect = 5e-92, Method: Compositional matrix adjust. Identities = 176/347 (50%), Positives = 233/347 (67%), Gaps = 3/347 (0%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M + QF+ VL W+D+YGR LPWQ TPY+VW+SE+MLQQTQV+TVIPYFERF+ RFP Sbjct: 1 MDSKQFAQGVLRWFDRYGRHDLPWQKKLTPYRVWVSEIMLQQTQVSTVIPYFERFIKRFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV LA APLDE+L W+GLGYYARARNLH+AAQ + + G+FP T E +++LPG+GRS Sbjct: 61 TVGALALAPLDEILAHWSGLGYYARARNLHRAAQIIHVTYHGRFPSTVETLSSLPGIGRS 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGA+LSL + ++ ILDGNVKRVLAR A+ ++ N LW+L+E+ TP + Sbjct: 121 TAGAVLSLGMHQYAVILDGNVKRVLARYNALDVPINQQVGINILWNLAEKYTPKNRCWDY 180 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQAMMD+GAMICTR+KPKCSLCPL++ C A + +P KK K ++ Y LLL++ Sbjct: 181 NQAMMDIGAMICTRTKPKCSLCPLKSSCKAHRLSQQMNFPIKKAKTARAQKAAYLLLLRN 240 Query: 241 -EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQR-QIAADNLTQLTAFRHTFSHFH 298 E+LL +RPP+G+WGGL+ FPQ EE + +W + A + + H FSHF Sbjct: 241 SRGEILLEKRPPTGIWGGLWSFPQCPIEEDIDKWCQTKLGFEAVICERWNSIFHQFSHFE 300 Query: 299 LDIVPMWLPVSSFTGCMDEG-NALWYNLAQPPSVGLAAPVERLLQQL 344 +I P+ L + + M E +WY G+AAPV RLL+QL Sbjct: 301 FEIKPVLLQIETRQPRMMECPPQIWYKEHSALPGGIAAPVARLLKQL 347 >UniRef50_Q2SMB7 A/G-specific adenine glycosylase n=10 Tax=Gammaproteobacteria RepID=Q2SMB7_HAHCH Length = 388 Score = 332 bits (852), Expect = 8e-90, Method: Compositional matrix adjust. Identities = 164/351 (46%), Positives = 223/351 (63%), Gaps = 12/351 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + F+A++L W+D++GR LPWQ +TPY VW+SE+MLQQTQV+TVIPYF +FM FPTV Sbjct: 29 NSFAARLLTWFDQHGRHDLPWQDPRTPYHVWISEIMLQQTQVSTVIPYFIKFMESFPTVA 88 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A D VL W GLGYYARARNLHKAA+ + G+FP T E + LPG+GRSTAG Sbjct: 89 ALAEADQDTVLSHWAGLGYYARARNLHKAAKTIVEKFNGEFPNTLETIQELPGIGRSTAG 148 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILS+ G PILDGNVKRVL R A+ GWPGK+E+E +LW L++ TP V + QA Sbjct: 149 AILSMGFGIRAPILDGNVKRVLCRHDAIEGWPGKREIETRLWELADAYTPEERVTDYTQA 208 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED- 242 +MDLGA +CTRSKP C+ CP++ C A + P K + +P + L+LQ+ED Sbjct: 209 IMDLGATLCTRSKPACARCPMETTCQGLAQDMTDQLPTPKAGKKIPTHERFVLVLQNEDG 268 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADE----ESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 +LL +RPP+G+WGGL+ P+ ++L+ W +++ + FRHTFSH+H Sbjct: 269 HILLERRPPTGIWGGLWSLPESEKNMDLPDALQSWKEHKRLQLTQYELASPFRHTFSHYH 328 Query: 299 LDIVPMWLPVSSFTGCMD------EGNALWYNLAQPPSVGLAAPVERLLQQ 343 L + P+ + D E ++ W++ Q GL AP+E+ L Q Sbjct: 329 LILKPVIASTADIERPDDLAFIVGESDSQWFDPDQELP-GLPAPIEKWLSQ 378 >UniRef50_Q89A45 A/G-specific adenine glycosylase n=6 Tax=Buchnera aphidicola RepID=MUTY_BUCBP Length = 351 Score = 327 bits (839), Expect = 3e-88, Method: Compositional matrix adjust. Identities = 144/346 (41%), Positives = 219/346 (63%), Gaps = 3/346 (0%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M F +L+WY +GRKTLPWQI K PYK W+SE+MLQQTQV TVIPY+ +F+ RFP Sbjct: 1 MTTLVFYQTILNWYHHFGRKTLPWQIKKNPYKTWISEIMLQQTQVKTVIPYYCKFIKRFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + L+++PLD +L+LW+GLGYY RARN++K A+ + G FP ++ E+ LPG+G+S Sbjct: 61 NIDTLSDSPLDSILNLWSGLGYYTRARNIYKTAKILKQKFNGIFPNSYAEIIKLPGIGKS 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAILS + ILDGN+KRVL R Y+++ K +E LW E +TP +F Sbjct: 121 TAGAILSFGFNLYSCILDGNIKRVLIRYYSIN--INNKYIEKLLWKTIESITPIYHTNKF 178 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS-WALYPGKKPKQTLPERTGYFLLLQ 239 NQA++D+GA+IC +S PKC++CPL++ C + NN + + K K +P+ +FL+LQ Sbjct: 179 NQALIDIGALICLKSNPKCNICPLKSTCKSYLNNKLFQINCKKNKKHIIPKTKYWFLILQ 238 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 +++ + L +R G+W L+CFPQF + + W+ + + L F+H SH L Sbjct: 239 YKNYIFLEKRQNLGIWKKLFCFPQFIRQNDILSWIQKNNTKIKKINILNEFKHKLSHLTL 298 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 I P+W+ ++ + + +WYNL P +GL PV +++ +++ Sbjct: 299 YINPIWIIINKISIFSNNNKTIWYNLNNPQCIGLPTPVTKIITKIK 344 >UniRef50_Q3J8X1 A/G-specific DNA-adenine glycosylase n=13 Tax=Gammaproteobacteria RepID=Q3J8X1_NITOC Length = 354 Score = 326 bits (835), Expect = 9e-88, Method: Compositional matrix adjust. Identities = 165/352 (46%), Positives = 224/352 (63%), Gaps = 11/352 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M S FS ++L W+D YGRK LPWQ + T Y+VW+SE+MLQQTQVATVIPY++RF+ RFP Sbjct: 1 MNKSDFSQRLLTWFDAYGRKDLPWQQNPTLYRVWVSEIMLQQTQVATVIPYYQRFIERFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 ++ LA+A +DE+L LW GLGYYARAR LH+AA+ HGG+ P T E + LPG+GRS Sbjct: 61 SLPALAHASVDEILGLWAGLGYYARARRLHQAARIAWETHGGELPATLEALMELPGIGRS 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 T GAIL+L+LG+ +PILDGNVKRVL R A+ WPG+ +VE +LW + + P + + Sbjct: 121 TGGAILALALGQRYPILDGNVKRVLTRQEAIEHWPGQPKVEKQLWQRAATLLPRTRLADY 180 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPER-TGYFLLLQ 239 QA+MDLGA +CTR +P C CP++ C A + YP +P++ LP R T +LL Sbjct: 181 TQAIMDLGATVCTRHRPHCPSCPVKKTCQAHLQENPEAYPRSRPRKRLPLRATCMLILLN 240 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIA--ADNLTQLTAFRHTFSHF 297 + EVLL +RPP G+WGGL+ FP+ + W Q Q + RH F+HF Sbjct: 241 DQGEVLLERRPPVGIWGGLWSFPECPPQTEAALW-CQEQFGWPIGEVQHWPPLRHHFTHF 299 Query: 298 HLDIVPMWLPVSSFTG----CMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 LDI P+ ++ G M+ + +WY + GL AP RLL++LR Sbjct: 300 TLDIQPV---IARIRGEARQVMEPNSQVWYKMEPMYKRGLPAPTLRLLKRLR 348 >UniRef50_B5JTP2 A/G-specific adenine glycosylase n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JTP2_9GAMM Length = 354 Score = 324 bits (830), Expect = 3e-87, Method: Compositional matrix adjust. Identities = 174/353 (49%), Positives = 223/353 (63%), Gaps = 7/353 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M FS ++L WY GR LPWQ DKTPY+VW+SE+MLQQTQVATVIPY++RFMA FP Sbjct: 1 MSERSFSERLLAWYRVSGRHDLPWQQDKTPYRVWVSEIMLQQTQVATVIPYYQRFMASFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ DLA+A D+VL W+GLGYYAR RNLHKAA+Q A G P+ E + ALPG+G S Sbjct: 61 TLEDLASATQDQVLQHWSGLGYYARGRNLHKAARQ-AHEAWGDLPDDVEALEALPGIGHS 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP-AVGVER 179 TAGAI S++ G+ ILDGNVKRVLAR AVSGWPG+ V +LW ++ TP AV Sbjct: 120 TAGAISSIAFGRRALILDGNVKRVLARHRAVSGWPGRTAVSRELWQWADLYTPSAVQAGE 179 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIA-AANNSWALYPGKKPKQTLPERTGYFLLL 238 + QA+MDLGA +CTR++P+C CP+ C+ + A K K + +L+ Sbjct: 180 YAQAIMDLGATLCTRTQPRCGECPVAEDCLGRGSGEPTAFPGKKPKKDKPVKSVRMWLVT 239 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 + EVLL QRP GLWGGL+ P+ +E L WL++R L FRH+FSHFH Sbjct: 240 NKKGEVLLQQRPAQGLWGGLWVLPEQRVDEPLDAWLSRRGWLGAEAVPLPKFRHSFSHFH 299 Query: 299 LDIVPMW---LPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 LD+ P+ + + T C D WYN + P G+AAPV RLL+ L T + Sbjct: 300 LDVEPLLVQDIDSARVTECADHSPEQWYN-PRSPQGGIAAPVARLLKSLPTDS 351 >UniRef50_A8PLA9 A/G-specific adenine glycosylase n=1 Tax=Rickettsiella grylli RepID=A8PLA9_9COXI Length = 354 Score = 323 bits (828), Expect = 5e-87, Method: Compositional matrix adjust. Identities = 162/346 (46%), Positives = 221/346 (63%), Gaps = 5/346 (1%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 +QF ++L W+ + GRK LPWQ ++PY WLSE+MLQQTQV TVIPYF+RF FPT++ Sbjct: 6 AQFQKKILTWFHQSGRKHLPWQQVQSPYFTWLSEIMLQQTQVTTVIPYFQRFTHHFPTLS 65 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LANA LDEV+ LW+GLGYYARAR+LH+ AQ + + GKFP+ + LPG+GRSTAG Sbjct: 66 SLANASLDEVIRLWSGLGYYARARHLHRCAQIIEEKYKGKFPQALILLQNLPGIGRSTAG 125 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA-VGVERFNQ 182 AI +L+ + ILDGNVKRV +R + +SGWPG V +LW+L+E+ TP V + Q Sbjct: 126 AIRALAFNQPAAILDGNVKRVFSRFHTLSGWPGLTHVNKQLWTLAERYTPHNKHVRHYTQ 185 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERT-GYFLLLQHE 241 AMMDLGA+ICT +C+ CPLQ C A + YP KP + P R +L+ Sbjct: 186 AMMDLGALICTPKHAQCTECPLQRHCKAYKEDCVIHYPSSKPVKMNPSRAITMVILINAR 245 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQI--AADNLTQLTAFRHTFSHFHL 299 E+LL +RPP G+WGGL+ P+ E ++ + + ++ +HTFSHF L Sbjct: 246 QEILLEKRPPRGIWGGLWSLPECTITEKIKNFCKKNYHCETEKSIKNYPFLKHTFSHFQL 305 Query: 300 DIVPMWLPVSSFTGCMDEGNAL-WYNLAQPPSVGLAAPVERLLQQL 344 +I P+ + V ++ E N + WYN Q S+GLAAPV++LL QL Sbjct: 306 EIRPVAIRVKKWSPPQMESNPIVWYNKQQLASIGLAAPVKKLLNQL 351 >UniRef50_B0RRL4 A/G-specific adenine glycosylase n=15 Tax=Gammaproteobacteria RepID=B0RRL4_XANCB Length = 430 Score = 317 bits (812), Expect = 4e-85, Method: Compositional matrix adjust. Identities = 159/350 (45%), Positives = 216/350 (61%), Gaps = 6/350 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + F ++L W+D +GR LPWQ + PY+VWLSE+MLQQTQVA VIPYF++F+A FP Sbjct: 81 LTTDAFVDRLLHWFDGHGRHDLPWQHPRAPYRVWLSEIMLQQTQVAVVIPYFQKFVASFP 140 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ DLA A D V+ W GLGYYARARNLH AA+Q LH G+ P F+ + ALPG+GRS Sbjct: 141 TLADLAAADNDTVMAHWAGLGYYARARNLHAAAKQCVALHAGELPRDFDALLALPGIGRS 200 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQ---VTPAVGV 177 TAGAILS + FPI+DGNVKRVL R + ++G+PG VE +LW L+ PA + Sbjct: 201 TAGAILSQAWNDRFPIMDGNVKRVLTRIHGIAGYPGLPVVEKQLWQLAANHVAHVPAGRL 260 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL 237 + QA MD GA +CTR++P C +CPLQ C+A P KP + LPER LL Sbjct: 261 ADYTQAQMDFGATLCTRARPACMVCPLQENCVARREGLVEALPTPKPGKQLPEREATALL 320 Query: 238 LQH-EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAA-DNLTQLTAFRHTFS 295 L++ +E+LL +RPP+G+W L+ PQ + LR+W A D ++ HTFS Sbjct: 321 LENAHNEILLQRRPPTGIWASLWTLPQAETDSDLREWFAAHIDGDYDRADEMPMIVHTFS 380 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNAL-WYNLAQPPSVGLAAPVERLLQQL 344 H+ L + P+ L + + + + L W A ++GL AP+ +LL L Sbjct: 381 HYRLHLQPLRLRKVALRQVLRDNDDLRWVARADLATLGLPAPIRKLLDAL 430 >UniRef50_D1RGZ1 A/G-specific adenine glycosylase n=1 Tax=Legionella longbeachae D-4968 RepID=D1RGZ1_LEGLO Length = 357 Score = 312 bits (799), Expect = 1e-83, Method: Compositional matrix adjust. Identities = 152/339 (44%), Positives = 215/339 (63%), Gaps = 2/339 (0%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 QFS +L W+D +GRK LPWQ +TPY+VW+SE+MLQQTQV TVIPYFERFM RFP ++D Sbjct: 9 QFSKPLLLWFDLHGRKNLPWQHPRTPYRVWVSEIMLQQTQVQTVIPYFERFMQRFPNISD 68 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA+A D+VL LW+GLGYY+RARNLH+ A+ + H G FP + + LPG+G ST+ A Sbjct: 69 LAHAQEDDVLSLWSGLGYYSRARNLHQTAKLILQHHQGIFPNDSKLLNELPGIGPSTSAA 128 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 ILS + K ILDGNVKRVL R + ++G+P + +V+ KLW L++ P + QA+ Sbjct: 129 ILSQAFNKPAAILDGNVKRVLTRFFRITGYPEQSQVKKKLWELADLCMPQENCADYTQAI 188 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED-E 243 MDLGA C P CS CPL C+A + L+P KK K+++P + L+L +E Sbjct: 189 MDLGATCCITKNPHCSSCPLHINCLAFKHKEQHLFPTKKIKKSIPIQYQQLLVLHNEQGY 248 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWL-AQRQIAADNLTQLTAFRHTFSHFHLDIV 302 + L +RP GLWGGL+C P + ++ ++ ++ ++ L AF+H FSHFHL+I Sbjct: 249 IYLEKRPSPGLWGGLWCLPNISTDKCPLDFIRVNYDLSGEHPQPLLAFKHRFSHFHLEIN 308 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 + + + F + E W+ Q S+GLA P ++L Sbjct: 309 ALSIKTTFFGRQLSENQGQWFAKEQLNSLGLAQPTRKIL 347 >UniRef50_Q2YCY4 A/G-specific DNA-adenine glycosylase n=4 Tax=Nitrosomonadaceae RepID=Q2YCY4_NITMU Length = 383 Score = 310 bits (795), Expect = 4e-83, Method: Compositional matrix adjust. Identities = 144/315 (45%), Positives = 205/315 (65%), Gaps = 7/315 (2%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 A+ F+ +++ W ++GR LPWQ + Y +WLSE+MLQQTQV TVIPY+ RF+ FP + Sbjct: 12 ANSFATRLIRWQREHGRHHLPWQNTRDAYSIWLSEIMLQQTQVGTVIPYYRRFLQCFPDI 71 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA+APLDEV+ W+GLGYY+RARNLHKAAQ++ HGG FPE + LPG+GRSTA Sbjct: 72 QSLASAPLDEVMVQWSGLGYYSRARNLHKAAQRIVGEHGGIFPEEVAIIRQLPGIGRSTA 131 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA-------V 175 AI + GK ILDGNVKR+L+RC+ + G+PG+K+VE +LW +E + P Sbjct: 132 AAIAVFAFGKRAAILDGNVKRILSRCFGIEGYPGEKQVEAQLWQKAEALLPKGDESPIER 191 Query: 176 GVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF 235 +E + QA+MDLGA IC R++P C CPL+ C+A +N P +P++ LPER Sbjct: 192 DIEGYTQALMDLGATICIRARPMCGSCPLRLECVAFRDNRAGSLPTPRPRKILPEREAVL 251 Query: 236 LLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFS 295 LL + ++LL +RP +G+WG L+ P+ E++ ++ + I ++Q+ A HTF+ Sbjct: 252 LLAVAQGKILLEKRPSTGIWGALWSLPEMGMNENVIEYCLRFGINVRPMSQMEALTHTFT 311 Query: 296 HFHLDIVPMWLPVSS 310 HF L I P+ L V S Sbjct: 312 HFRLRIYPLILQVIS 326 >UniRef50_C6N1N5 A/G specific adenine glycosylase n=1 Tax=Legionella drancourtii LLAP12 RepID=C6N1N5_9GAMM Length = 348 Score = 308 bits (788), Expect = 2e-82, Method: Compositional matrix adjust. Identities = 155/343 (45%), Positives = 211/343 (61%), Gaps = 2/343 (0%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 QFS +L+W+ +GRK LPWQ+ ++PY+VW+SE+MLQQTQV TVIPYFERFM RFP++ Sbjct: 6 EQFSKPLLNWFSLHGRKNLPWQLPRSPYRVWVSEIMLQQTQVQTVIPYFERFMQRFPSIQ 65 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA A DEVL LW+GLGYY+RARNLHK AQ +AT + G FP+ V LPG+G STA Sbjct: 66 DLAQANEDEVLSLWSGLGYYSRARNLHKTAQIIATEYQGIFPDELALVHELPGIGASTAA 125 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILS + + ILDGNVKRVL R + + G P + V+ KLW L++ P + QA Sbjct: 126 AILSQAFNQPTAILDGNVKRVLTRFFMIQGHPEQALVKKKLWELAQACMPQEDCADYTQA 185 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL-QHED 242 +MDLGA CT C CPLQ+ C+A + LYP KK K+ +P + L+L + Sbjct: 186 IMDLGATCCTTKNFNCLNCPLQDNCLALKHQEQHLYPTKKIKKPVPVQQQQLLVLCNKQG 245 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWL-AQRQIAADNLTQLTAFRHTFSHFHLDI 301 + L +RPP+GLWGGL+C P + ++ + + +N L AF+H FSHF L+I Sbjct: 246 HIYLEKRPPTGLWGGLWCLPSIDEGNCPLDFIRTEYDLVGENPQPLIAFKHRFSHFLLEI 305 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + + + + E W+ S+GLA P ++L L Sbjct: 306 NALKIITLALGTKLGEAKGQWFAKENLGSLGLATPTSKILSLL 348 >UniRef50_Q7VRG8 A/G-specific adenine glycosylase n=2 Tax=Candidatus Blochmannia RepID=Q7VRG8_BLOFL Length = 355 Score = 305 bits (782), Expect = 1e-81, Method: Compositional matrix adjust. Identities = 149/352 (42%), Positives = 220/352 (62%), Gaps = 13/352 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS ++L W LPWQI+KT YK+W+SEVMLQQTQVATVIPY+++F+ +FPT++ L Sbjct: 7 FSHKILSWSKNAHILKLPWQINKTIYKIWISEVMLQQTQVATVIPYYQKFIKKFPTISKL 66 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A ++E+L++W+GLGYY RA NLHK A + H G FP F + + PG+GRSTAGAI Sbjct: 67 AEANINEILYIWSGLGYYKRALNLHKTATIIIHHHNGVFPNNFNILLSFPGIGRSTAGAI 126 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSG-WPGKKEVENKLWSLSEQVTPA-VGVERFNQA 183 LSL+L K FPILDGN+KR+L R Y+++ ++ NKLWSL + + P FNQA Sbjct: 127 LSLALNKRFPILDGNIKRILIRYYSLNNQQTSPTKINNKLWSLIDSLLPLDSNYAIFNQA 186 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGK-----KPKQTLPERTGYFLLL 238 MMDLG +ICT S P+C++CPL + C + N+ L K PKQ ++ Y+L+L Sbjct: 187 MMDLGRLICTHSNPQCNICPLNSHCQSFLTNNINLLTQKYATIHNPKQ---KKIIYWLIL 243 Query: 239 --QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 +H + + L QR +W L+CFP+F + +L WL++ + + + +H S+ Sbjct: 244 LVKHRNIIYLTQRLQETIWNKLFCFPEFYNRTTLNTWLSKYNLHNNLKINMPIIKHNISN 303 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 L+I P + +++ +E N +WYNL +P +GL PV +L+ ++ Sbjct: 304 IALEIQPTLININNTIFNTNEKN-IWYNLNKPAIIGLPKPVNTILKIIQNKT 354 >UniRef50_A1WW55 A/G-specific DNA-adenine glycosylase n=1 Tax=Halorhodospira halophila SL1 RepID=A1WW55_HALHL Length = 358 Score = 287 bits (735), Expect = 3e-76, Method: Compositional matrix adjust. Identities = 159/344 (46%), Positives = 208/344 (60%), Gaps = 4/344 (1%) Query: 9 QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 Q++ W ++GR LPWQ TPY+VW+SE+MLQQT+V TV+PYFERFM R+P V LA A Sbjct: 16 QLIAWQRQHGRNDLPWQQPATPYRVWISEIMLQQTRVETVVPYFERFMERYPDVAALAAA 75 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 LD+VL LW GLGYYARARNLH AAQ++ T GG+ P + LPG+G STAGAI SL Sbjct: 76 ELDDVLALWAGLGYYARARNLHAAAQRIQTDWGGQLPAELSALQTLPGIGPSTAGAIRSL 135 Query: 129 SLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLG 188 G+ PILDGNVKRVLAR V GWPG+ V +LW+LS +TP RFNQ +MDLG Sbjct: 136 GHGQPAPILDGNVKRVLARLAGVEGWPGRSPVAKQLWALSAALTPEAECRRFNQGLMDLG 195 Query: 189 AMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQ 248 A++CT P C+ CPL C A A + YP +P + P R LL++H D +LL + Sbjct: 196 ALVCTPRDPACNACPLAASCTARAAGNPETYPAPRPARQRPRREVRLLLIEHADALLLER 255 Query: 249 RPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPV 308 RP +G+WGGL+ P+ E + + L A H +HF L + P L Sbjct: 256 RPATGIWGGLWSLPECPPSEDPVTRALRLGARCEPAGDLPARHHALTHFELIMQPTRLRW 315 Query: 309 SSFTGCMDEGNA--LWYNLAQPPSVGLAAPVERLLQQLRTGAPV 350 ++ T + E + +W+ Q GL AP+ R+L+ G PV Sbjct: 316 NAATPDIGEPDPQRIWFRPGQDTLPGLPAPILRILRD--AGYPV 357 >UniRef50_D0L1V0 A/G-specific adenine glycosylase n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L1V0_HALNC Length = 381 Score = 286 bits (732), Expect = 8e-76, Method: Compositional matrix adjust. Identities = 160/370 (43%), Positives = 218/370 (58%), Gaps = 33/370 (8%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M AS F +++LDW+D++GR LPWQ +TPY+VW+SE+MLQQTQVATVI YF RFM RFP Sbjct: 1 MSASDFHSRLLDWFDRHGRHDLPWQHPRTPYRVWISEIMLQQTQVATVIGYFNRFMQRFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 ++ LA AP+DEVL LW+GLGYYARARNLH AAQ +A PET ALP VG S Sbjct: 61 SLDVLAAAPVDEVLALWSGLGYYARARNLHAAAQIMAQ---QGVPETRAGWQALPSVGPS 117 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TA AI++ + ILDGNVKRVLAR + + L+ +++ TP V + Sbjct: 118 TAAAIMAQAFDVPETILDGNVKRVLARHAGIDRPIEQASTIQALYEVAKLHTPQTRVADY 177 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+MDLGA +CTR P CS CP+ C+A A+N P ++ +Q +P R F+ ++ Sbjct: 178 TQAIMDLGATLCTRHSPGCSACPVSADCVAFASNRVESLPVRRRRQPVPTRRAVFMAIED 237 Query: 241 EDE-VLLAQRPPSGLWGGLYCFPQF-------ADEESLRQWLAQRQIAADNLTQ------ 286 E E ++L +RPP+G+WGGL+C P++ + E +L ++ +Q A DN Q Sbjct: 238 EAERLMLIRRPPTGIWGGLWCLPEYIPADDCVSQEHTLDSAVSGKQ-AKDNSGQQSMLSI 296 Query: 287 -----------LTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNL----AQPPSV 331 LT F H F+H+ LD + VS + D + LW L A+ P + Sbjct: 297 HRLKQRGPTSPLTTFEHRFTHYLLDARIDHMVVSRTSSVEDNPDVLWLPLIELAARAPLL 356 Query: 332 GLAAPVERLL 341 GL P+ R L Sbjct: 357 GLPKPMSRFL 366 >UniRef50_A1KUL5 Adenine glycosylase n=32 Tax=Neisseriaceae RepID=A1KUL5_NEIMF Length = 349 Score = 285 bits (728), Expect = 2e-75, Method: Compositional matrix adjust. Identities = 153/346 (44%), Positives = 203/346 (58%), Gaps = 14/346 (4%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +++ W ++GR LPWQ+ K PY VWLSE+MLQQTQVATV+ Y+ RF+ +FPTV L Sbjct: 10 FSERLIRWQKQHGRHHLPWQV-KNPYCVWLSEIMLQQTQVATVLDYYPRFLEKFPTVQTL 68 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP DEVL LW GLGYY+RARNLHKAAQQV GG FP +++ L GVGRSTA AI Sbjct: 69 AAAPQDEVLSLWAGLGYYSRARNLHKAAQQVVRQFGGTFPSERKDLETLCGVGRSTAAAI 128 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER--FNQA 183 + S + ILDGNVKRVL R +A G P K+ EN LW+L+E + P+ + + Q Sbjct: 129 CAFSFNRRETILDGNVKRVLCRVFARDGNPQDKKFENSLWTLAESLLPSENADMPAYTQG 188 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED- 242 +MDLGA +C R+KP C CP+ + C A N A P KK +P Y+L++++ + Sbjct: 189 LMDLGATVCKRTKPLCHQCPMADICEAKKQNRTAELPRKKTAAEVPTLPLYWLIIRNRNG 248 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 +LL +RP G+WGGLYC P F L + A+ + ++ + TA H +H L I Sbjct: 249 AILLEKRPAKGIWGGLYCVPCFESLNGLSDFAAKFSLTMADMDEQTALTHRLTHRLLMIT 308 Query: 303 PMWLPVSSFTGCM---DEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 P F G M + +W GL P+E L R Sbjct: 309 P-------FEGQMPSESPSDGIWIKPTHLKDYGLPKPLEIYLNGNR 347 >UniRef50_A5EV94 A-G-specific adenine glycosylase n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EV94_DICNV Length = 347 Score = 278 bits (712), Expect = 2e-73, Method: Compositional matrix adjust. Identities = 140/304 (46%), Positives = 192/304 (63%), Gaps = 5/304 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M AS FSA+++ W +GR LPWQ + PY VWLSE+MLQQTQV TVIPYF+ F+ARFP Sbjct: 1 MNAS-FSARLISWQRTFGRHELPWQ-NCNPYHVWLSEIMLQQTQVVTVIPYFQCFIARFP 58 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + LA AP +EVL LW GLGYYARARNLH AAQ V G+ P T ++ L GVGRS Sbjct: 59 DIDTLAAAPEEEVLFLWRGLGYYARARNLHAAAQYVVHQLNGQLPNTRAQLEQLKGVGRS 118 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP--AVGVE 178 TA AI + GK I DGNV+RVL R + + + + + +LW+L+E + P A + Sbjct: 119 TAAAICVFAFGKKEAICDGNVRRVLTRHHGILDFIEAPKTQQQLWTLAEALLPDAADDLR 178 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 + Q +MDLG++ICTR++PKC+ CP++ C A N P +KPK+ P R GYF+L Sbjct: 179 SYTQGLMDLGSLICTRARPKCADCPVKTDCYALKNQQQMQLPRRKPKKAPPARNGYFVLC 238 Query: 239 QHED-EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 +++D E+ L +R G+WGGL+ P F E + W+A + + + + H F+HF Sbjct: 239 RNDDGEIYLQKRARRGIWGGLWSLPWFETEHEAKMWIAAQGFSWQSSSDSPVMMHRFTHF 298 Query: 298 HLDI 301 HL + Sbjct: 299 HLSM 302 >UniRef50_C3X6B5 A/G-specific adenine glycosylase n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X6B5_OXAFO Length = 373 Score = 275 bits (704), Expect = 1e-72, Method: Compositional matrix adjust. Identities = 143/351 (40%), Positives = 208/351 (59%), Gaps = 13/351 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + F++ ++ W KYGR +LPWQ + Y++WLSE+MLQQTQV+TVIPY+ RF+ FP V Sbjct: 14 ASFASILVAWQKKYGRHSLPWQNTRDAYRIWLSEIMLQQTQVSTVIPYYLRFLEFFPDVY 73 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A DEV+ W+GLGYY+RARNLHK A+ V +GG FP E + LPG+G+STA Sbjct: 74 SLAAASNDEVMKYWSGLGYYSRARNLHKCARIVVNEYGGIFPSDPESLEKLPGIGKSTAA 133 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI + S G ILDGNV RV +R + + +K +N+ W L+ ++ P +E + Q Sbjct: 134 AIAAFSAGAKAAILDGNVVRVFSRVFGIRDSITEKAGKNRFWELAYELLPETDIEAYTQG 193 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +MDLGA IC RSKP C CP + C+A + N + P +K K+ P + L+L+ + Sbjct: 194 LMDLGATICVRSKPDCVKCPFSHCCVALSENRISELPVRKIKKASPLKKIIMLVLESRGQ 253 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIA-----------ADNLTQLTAFRH 292 +LL +RP SG+WGGLY P++ + +LR+ + ++A A ++T L F H Sbjct: 254 ILLEKRPESGIWGGLYSLPEWETDVNLRK--HKDEMASLKTHVSSFGEAVSVTFLEPFVH 311 Query: 293 TFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 FSHF L + P + +S + +WY++ L APV +LL Sbjct: 312 VFSHFRLQVTPCVVTLSRRNRAIAPDAYVWYDIMSVNEAPLPAPVRKLLND 362 >UniRef50_A1VKP4 A/G-specific DNA-adenine glycosylase n=6 Tax=Comamonadaceae RepID=A1VKP4_POLNA Length = 384 Score = 275 bits (702), Expect = 2e-72, Method: Compositional matrix adjust. Identities = 153/349 (43%), Positives = 199/349 (57%), Gaps = 14/349 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 S F+ +++ W +GR +LPWQ + PY+VWLSE+MLQQTQVATV+ Y++RF+ RFPTV+ Sbjct: 25 SLFANEIVRWQQSHGRNSLPWQNTRDPYRVWLSEIMLQQTQVATVLAYYDRFVQRFPTVS 84 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA A DEV+ LW GLGYY+RARNLH+ AQ V LH G+FP + E++ LPG+GRSTA Sbjct: 85 DLAAATQDEVMALWGGLGYYSRARNLHRCAQDVMALHAGQFPRSAEQLQTLPGIGRSTAA 144 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP----AVGVER 179 AI S + ILDGNVKRVL R S + E LW+ + + P A + R Sbjct: 145 AIASFCFAERVAILDGNVKRVLTRVLGFSDDLAQSANERALWNQATDLLPHDNLARAMPR 204 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 + Q +MDLGA ICT +PKC LCP+Q C A YP K K + L Q Sbjct: 205 YTQGLMDLGATICTGRQPKCLLCPVQALCRGRAEGQPEKYPVKTRKLKRSTQALSLLWAQ 264 Query: 240 HED-EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 D V L +RP +G+W GLYC P F + LR + Q + L AF H +H Sbjct: 265 KPDGSVWLEKRPATGIWSGLYCLPVFESDAELRALVPQS--LEGRMEALPAFVHVLTHKD 322 Query: 299 LDIVPMWLPVSSFTGCM-----DEGN-ALWYNLAQPPSVGLAAPVERLL 341 L + P W+ M EG A W+ A P +GL AP+ +LL Sbjct: 323 LCLSP-WIAGFQADQAMPETMVSEGRPAAWFGPAAWPELGLPAPIRKLL 370 >UniRef50_UPI0001BC503C adenine glycosylase n=1 Tax=Neisseria mucosa ATCC 25996 RepID=UPI0001BC503C Length = 359 Score = 272 bits (696), Expect = 1e-71, Method: Compositional matrix adjust. Identities = 152/351 (43%), Positives = 202/351 (57%), Gaps = 17/351 (4%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +++ W ++GR LPWQ+ + PY VWLSE+MLQQTQVATV+ Y+ RF+ +FPTV L Sbjct: 7 FSERLIRWQKQHGRHHLPWQV-QNPYCVWLSEIMLQQTQVATVLDYYPRFLEKFPTVQTL 65 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP DEVL LW GLGYY+RARNLHKAA+QV G FP +++ L GVGRSTA AI Sbjct: 66 AAAPQDEVLSLWAGLGYYSRARNLHKAARQVVEEFDGTFPSERKDLETLCGVGRSTAAAI 125 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER--FNQA 183 + + + ILDGNVKRVL R +A G P K+ EN LW+L+E + P + + Q Sbjct: 126 CAFAFNRRETILDGNVKRVLCRVFARDGNPQDKKFENSLWTLAESLLPPENADMPAYTQG 185 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPK---QTLPERTGYFLLLQH 240 +MDLGA +C R+KP C CP+ + C A N A P KK QTLP Y+L+++ Sbjct: 186 LMDLGATVCKRTKPLCHQCPMADICEAKKQNRIAELPRKKTATEVQTLP---LYWLIIRD 242 Query: 241 ED-EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAA-------DNLTQLTAFRH 292 +D +LL +RP G+WGGLYC P F + + I + D+L++ H Sbjct: 243 QDGAILLEKRPAKGIWGGLYCVPCFEKLDEAYACAEKLGIFSECAASPWDDLSEQPTLTH 302 Query: 293 TFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 +H L I P SS + N LW GL P+ L+Q Sbjct: 303 RLTHRLLMITPFEAQTSSSENTVLPPNCLWVKPENLTDYGLPKPLADYLKQ 353 >UniRef50_Q3A7R2 A/G-specific DNA-adenine glycosylase n=2 Tax=Desulfuromonadales RepID=Q3A7R2_PELCD Length = 352 Score = 268 bits (686), Expect = 2e-70, Method: Compositional matrix adjust. Identities = 138/305 (45%), Positives = 192/305 (62%), Gaps = 8/305 (2%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 A + +LDWY + GR LPW+ + PY++WLSE+MLQQT V VIPY+ERF+A FP+V Sbjct: 8 ADTMNRCLLDWYGRCGRD-LPWRRTRDPYRIWLSEIMLQQTGVTAVIPYYERFLAAFPSV 66 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA APL++VL LW GLGYY RAR LH+AA +V + HGG+FPET E + ALPG+GRSTA Sbjct: 67 AALAAAPLEQVLELWAGLGYYRRARFLHEAACKVVSEHGGQFPETPEAIQALPGIGRSTA 126 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GAI+S++ + PILDGNV+RVL R A+SG P +VE +LW ++ +TP + Q Sbjct: 127 GAIVSIAFDRKAPILDGNVRRVLCRLLAISGDPRSSKVEKRLWQCADALTPEDRPHDYAQ 186 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+MDLGA +C +P C CPL C A P + ++T+P +L++ E Sbjct: 187 AIMDLGATVCKPRRPDCQACPLSGLCQAFWQGIQEQLPQRATRKTVPLVQQVAVLIEREG 246 Query: 243 EVLLAQRPPSGLWGGLYCFPQFA------DEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 L+ +RP G+ GGL+ FP A E + R LA ++A L+ + RH +SH Sbjct: 247 RYLVRRRPLDGMLGGLWEFPSVAVPEGKTAETAARTLLAGEGLSA-GLSPVGTVRHAYSH 305 Query: 297 FHLDI 301 F +++ Sbjct: 306 FRVEL 310 >UniRef50_UPI0000E87CC6 A/G-specific adenine glycosylase n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87CC6 Length = 343 Score = 267 bits (683), Expect = 4e-70, Method: Compositional matrix adjust. Identities = 128/340 (37%), Positives = 204/340 (60%), Gaps = 1/340 (0%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 + +V+ W+ GR LPWQ PY +W+SE+MLQQTQV++VIPY++RF+ FPTV L Sbjct: 4 IAEKVITWHKSSGRHDLPWQKINDPYLIWISEIMLQQTQVSSVIPYYQRFIKTFPTVEKL 63 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A D V+ W+GLGYY RA+ + + A+ + + KFP++ E++ +LPG+G+STAGAI Sbjct: 64 AFADHDVVMKHWSGLGYYRRAKFIMQTAKIIVQQYQSKFPDSVEKLLSLPGIGKSTAGAI 123 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 + + G PI+D NVKRV R Y + WPGK + + LWSL+EQ P+ ++ + QA+M Sbjct: 124 CAFAFGGIEPIMDANVKRVFCRFYGIMEWPGKAQTQKYLWSLAEQNLPSNNIQIYTQALM 183 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 DLGA +C S+P CS CPLQ C++ +N + P KKPK+TL + L+++ +++L Sbjct: 184 DLGATLCKGSQPVCSQCPLQLKCVSFKSNLCHVIPAKKPKKTLITKQVNVLIVEANNQIL 243 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQ-RQIAADNLTQLTAFRHTFSHFHLDIVPM 304 +R S +W GL+ P+ D ++W+ + N+ + A FSH+ L + Sbjct: 244 FQKRSDSSVWEGLWSTPEIQDFSYTKKWIKDVLGLTKFNVVKQGAHLAIFSHYKLQLNYN 303 Query: 305 WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + ++ + LW + + G+ P+++LLQ + Sbjct: 304 HILLTETFSHQQLKDFLWIDRPKIGGEGIPTPIKKLLQNI 343 >UniRef50_A3MA77 A/G specific adenine glycosylase n=18 Tax=Acinetobacter RepID=A3MA77_ACIBT Length = 355 Score = 265 bits (677), Expect = 2e-69, Method: Compositional matrix adjust. Identities = 151/292 (51%), Positives = 187/292 (64%), Gaps = 4/292 (1%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +L+W+D++GR LPWQ+ PYKVW+SE+MLQQTQV TV+ YF+RFM RFPTV L Sbjct: 6 FSDALLNWFDQHGRHDLPWQVADDPYKVWVSEIMLQQTQVKTVLQYFDRFMERFPTVEAL 65 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A DEV W GLGYYARARNLHKAA VA GKFPET EE ALPG+GRSTAGA+ Sbjct: 66 GYATWDEVAPYWAGLGYYARARNLHKAAGLVA--QQGKFPETLEEWIALPGIGRSTAGAL 123 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 +SL L ++ I+DGNVKRVLAR +A+ K + E ++W L+E++ P + QA+M Sbjct: 124 MSLGLRQYGVIMDGNVKRVLARFFAIEDDLSKPQHEREMWKLAEELCPTQRNHDYTQAIM 183 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 DLGA ICT KP C CP+Q C A P KKPK+T P +T L++Q EDE Sbjct: 184 DLGATICTPKKPLCLYCPMQAHCQAYQQGLEQELPFKKPKKTPPVKTADVLIIQCEDEWF 243 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 QR GLWGGL+C P +E R L+Q Q Q H+F+HF Sbjct: 244 WQQRQAHGLWGGLFCLPILENEHE-RLKLSQ-QFKLQPQPQTFQISHSFTHF 293 >UniRef50_C9PAA7 A/G-specific adenine glycosylase n=2 Tax=Vibrio RepID=C9PAA7_VIBFU Length = 341 Score = 262 bits (670), Expect = 1e-68, Method: Compositional matrix adjust. Identities = 135/340 (39%), Positives = 194/340 (57%), Gaps = 13/340 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 QF +L W +GR LPWQ + +PY+V +SEVMLQQTQV TVIPYFER+MA FPT+ Sbjct: 9 KQFQEHLLTWQRHHGRHDLPWQQNPSPYRVLVSEVMLQQTQVVTVIPYFERWMASFPTIE 68 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LANA D V++ W GLGYY+RARNL KAA + G+FP + +PGVGR TAG Sbjct: 69 ALANATEDAVMNHWQGLGYYSRARNLRKAALYIQDTWNGEFPADVNALQNIPGVGRYTAG 128 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI + + + PI+DGNVKR+ R + + G PG ++ +LWS +E TP ++ Q Sbjct: 129 AIAAFAFNTYGPIVDGNVKRLFCRYFGIEGVPGTSAMDKQLWSTAEAYTPTHNNRQYAQG 188 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 ++D+GA +C P C C C A N + P KPK+ P + G+FL L+ + + Sbjct: 189 LLDMGATLCKPKNPTCDACSFTTTCKAYQTNRVSELPTPKPKKVTPTKPGHFLWLESDGK 248 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVP 303 +LL +R G+WG L+C PQ +L Q+ +++ AF+HTF+H+ LD Sbjct: 249 LLLEKRADDGIWGALWCLPQI--------YLEPEQL-GEHIKLKGAFKHTFTHYKLD-GK 298 Query: 304 MWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 +W + ++ W L Q VGL P+ +++ Sbjct: 299 VWQ-IDKLG--VENPQQQWMPLHQLDDVGLPKPIRTFIEK 335 >UniRef50_Q1Q8E2 A/G-specific DNA-adenine glycosylase n=4 Tax=Moraxellaceae RepID=Q1Q8E2_PSYCK Length = 453 Score = 259 bits (663), Expect = 7e-68, Method: Compositional matrix adjust. Identities = 136/264 (51%), Positives = 170/264 (64%), Gaps = 7/264 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKT----PYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 F+ ++L W+ GR LPWQ +T PY VWLSEVMLQQTQV TV+PYF RFM FPT Sbjct: 53 FAPRLLAWFADNGRHDLPWQQHQTDTPNPYIVWLSEVMLQQTQVTTVLPYFARFMDSFPT 112 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATL--HGGKFPETFEEVAALPGVGR 119 V DLA A D + W GLGYYARARNLHK A+Q+ + G FP+T A+ GVG Sbjct: 113 VQDLAAAEWDTIAEHWAGLGYYARARNLHKGAKQLVAVIDETGDFPQTLAGWEAISGVGP 172 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 STAGAI+S+ L ++ I DGNVKRVL R A+ G K LW+L+E++TP Sbjct: 173 STAGAIMSMGLHRYGVICDGNVKRVLTRWAAIDGDITKSATTKDLWALAERLTPREQSGL 232 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 F QAMMD+GA +CTRSKP C LCPLQ+ C+A A YP K KQ P + LL++ Sbjct: 233 FAQAMMDMGATLCTRSKPACLLCPLQDDCLAHAQGRETEYPVKAKKQPKPSKFSNALLIE 292 Query: 240 HED-EVLLAQRPPSGLWGGLYCFP 262 +E+ E+L QRP +G+WGGL+ P Sbjct: 293 NENGEILWLQRPDNGIWGGLWSLP 316 >UniRef50_A6GM64 Probable a/g-specific adenine glycosylase protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GM64_9BURK Length = 377 Score = 254 bits (650), Expect = 2e-66, Method: Compositional matrix adjust. Identities = 136/295 (46%), Positives = 175/295 (59%), Gaps = 26/295 (8%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F ++ W ++GR+TLPWQ YKVWLSEVMLQQTQV TV+ Y+ RF+ +PTV DL Sbjct: 9 FGEVLVRWQKQHGRQTLPWQHTGDAYKVWLSEVMLQQTQVTTVLAYYARFLQAYPTVADL 68 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP +V+ LW GLGYY RARNLH A+QVA GG+FP T E+ +LPG+G+STAGAI Sbjct: 69 AGAPEQDVMQLWAGLGYYTRARNLHACAKQVAARFGGQFPRTVAELESLPGIGQSTAGAI 128 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 SL+ G PILDGNVKRV R Y + G+P + ++ LW+++E P +NQA+M Sbjct: 129 ASLAYGVQAPILDGNVKRVFCRYYGIEGYPEQTTIKKTLWAIAEANVPEQQPGVYNQALM 188 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE-- 243 DLGA C P CS CPL C+A L P KPK+ PE +F+ + EDE Sbjct: 189 DLGATCCVPRNPACSACPLMQSCVALQKGMVGLLPTPKPKKARPEL--HFVSMIVEDERG 246 Query: 244 -VLLAQRPPSGLWGGLYCFP--------QFADEESLRQWLAQRQIAADNLTQLTA 289 VLL + G+W GL+ P FAD+ DNLT++ A Sbjct: 247 AVLLELQTGKGVWQGLWTTPFEACGSQGGFADD-------------FDNLTRIAA 288 >UniRef50_A3ZT46 A/G-specific adenine glycosylase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZT46_9PLAN Length = 358 Score = 249 bits (636), Expect = 1e-64, Method: Compositional matrix adjust. Identities = 136/305 (44%), Positives = 180/305 (59%), Gaps = 9/305 (2%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 +F Q+L WY R LPW+ D+ PY+VW+SE+MLQQTQVATV YFERF A FPTVTD Sbjct: 9 KFQRQILAWYGGAARD-LPWRADRDPYRVWISEIMLQQTQVATVRAYFERFSAAFPTVTD 67 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A EVL LW GLGYY RAR LH AAQ +A H GKFP F + ALPGVGR TAGA Sbjct: 68 LAAADEAEVLRLWEGLGYYRRARQLHAAAQVIADEHRGKFPREFAAILALPGVGRYTAGA 127 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I S++ + PIL+ N R+ AR A P K + LW +E + P V FNQA+ Sbjct: 128 ICSIAYDQPAPILEANTIRLHARLLAYREDPTKTAGQRLLWQFAEHILPTEDVSSFNQAL 187 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LG+ ICT P+C +CP+ C A N+ A P K K +RT ++++ + EV Sbjct: 188 MELGSEICTPRNPQCGVCPVATLCQAKRENAVAEIPVPKKKMQYEDRTEVAIIVRRKKEV 247 Query: 245 LLAQRPPSGLWGGLYCFPQF---ADEESLRQWLAQR-----QIAADNLTQLTAFRHTFSH 296 LL Q W GL+ FP+F ++ E L LA + ++A ++LT +H + Sbjct: 248 LLRQCGADERWAGLWDFPRFQVASEAELLSGELASQLEERTGLSAAIGSRLTTIKHGVTK 307 Query: 297 FHLDI 301 + + + Sbjct: 308 YRITL 312 >UniRef50_D0GU15 A/G-specific adenine glycosylase n=1 Tax=Vibrio mimicus MB-451 RepID=D0GU15_VIBMI Length = 231 Score = 244 bits (622), Expect = 4e-63, Method: Compositional matrix adjust. Identities = 119/224 (53%), Positives = 158/224 (70%), Gaps = 5/224 (2%) Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 +LS K ILDGNVKR LARC+AV GWPG+K VEN+LW +E TP+V V+++NQAM Sbjct: 1 MLSSVYKKPHAILDGNVKRTLARCFAVEGWPGQKSVENQLWHYAEMHTPSVDVDKYNQAM 60 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 MD+GAMICTRSKPKCSLCP+++ C+A + YPGKKPK P + +F++L H + V Sbjct: 61 MDMGAMICTRSKPKCSLCPVESFCLAKQQGNPQEYPGKKPKTDKPVKATWFVMLYHNNAV 120 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQ---LTAFRHTFSHFHLDI 301 L QRP +G+WGGLYCFPQ ++ +++ + QR I + + L AFRHTFSH+HLDI Sbjct: 121 WLEQRPQTGIWGGLYCFPQ-SEIANIQTTIDQRAIGDNAIKSQKTLIAFRHTFSHYHLDI 179 Query: 302 VPMWLPVSSFTGCMDEGN-ALWYNLAQPPSVGLAAPVERLLQQL 344 P+ L +S + EG+ LWYNL+QP +GLAAPV++LL L Sbjct: 180 TPILLELSRKPDIIMEGSKGLWYNLSQPDEIGLAAPVKQLLHSL 223 >UniRef50_Q2LY55 A/g-specific DNA glycosylase n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LY55_SYNAS Length = 373 Score = 241 bits (616), Expect = 2e-62, Method: Compositional matrix adjust. Identities = 126/311 (40%), Positives = 184/311 (59%), Gaps = 15/311 (4%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 A + + + WY+K ++ LPW+ PY +WLSE+MLQQTQV VIPY+ RF+ +FPT Sbjct: 21 NAQRIATLLTAWYEK-NKRLLPWRSTSDPYAIWLSEIMLQQTQVEAVIPYYRRFLEQFPT 79 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + +LA APL+ VL +W +GYY+RAR+LH A+ + HGG+FP ++ ALPG+G T Sbjct: 80 IEELARAPLEAVLKVWEKMGYYSRARHLHATARLILESHGGRFPANPVDLMALPGIGSYT 139 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 +GAILS++ GK P +DGNVKRVL+R + V L +L+E++ PA RFN Sbjct: 140 SGAILSIAFGKSVPAVDGNVKRVLSRLFFVDSPVDLTSTRRLLSALAEKLVPARQPGRFN 199 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH- 240 QA+M+LGA++C P CS CPLQ+ C+A A + P + + P R +++ Sbjct: 200 QALMELGAVLCRPKTPLCSDCPLQSICLAYAGSGQHFLPVSRKRTKRPHREAVAAVIRDS 259 Query: 241 EDEVLLAQRPPSGLWGGLYCFP----------QFADEESLRQWLAQRQIAADNLTQLTAF 290 E +L+ +RP +G GGL+ FP A E R+ L A D+L L Sbjct: 260 EQRLLVIRRPAAGFLGGLWTFPGGMLNPGEIVTEAVERRCREGLNITVAAGDSLMTL--- 316 Query: 291 RHTFSHFHLDI 301 + T++HFHL + Sbjct: 317 QQTYTHFHLTL 327 >UniRef50_Q1DAW2 A/G-specific adenine glycosylase n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1DAW2_MYXXD Length = 390 Score = 241 bits (616), Expect = 2e-62, Method: Compositional matrix adjust. Identities = 121/260 (46%), Positives = 165/260 (63%), Gaps = 3/260 (1%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +LDWYD+ ++ LPW+ + Y +WLSEVMLQQTQV+TVIPY+ERF+ARFPT LA+AP Sbjct: 41 LLDWYDR-NKRDLPWRRTRDSYAIWLSEVMLQQTQVSTVIPYWERFLARFPTARALASAP 99 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 LD+VL W GLGYY+RARNLH+AAQ+V GG P T E+ LPG GR TAGA+ S++ Sbjct: 100 LDDVLAGWKGLGYYSRARNLHRAAQEVVARFGGTLPSTAAELLELPGFGRYTAGAVASIA 159 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 G+ P++DGNV RV +R + V G PG ++ E LW+L+ + FNQA+M+ GA Sbjct: 160 FGEEAPLVDGNVARVFSRIFEVEGLPGDRQREATLWALATALVKGERPGDFNQALMEHGA 219 Query: 190 MICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQR 249 C P C LCP++ C+A P K + T + T + H +L A+R Sbjct: 220 TTCRPENPLCLLCPVRGACVAFRKGRVDELPPAKVRATPKKLTLAVAVWPHAGTLLFARR 279 Query: 250 PPSGLWGGLYCFP--QFADE 267 +GL+GGL+ P + AD+ Sbjct: 280 ADAGLFGGLWELPAAEIADD 299 >UniRef50_B1Y3P3 A/G-specific adenine glycosylase n=8 Tax=Burkholderiales RepID=B1Y3P3_LEPCP Length = 384 Score = 239 bits (610), Expect = 1e-61, Method: Compositional matrix adjust. Identities = 137/306 (44%), Positives = 180/306 (58%), Gaps = 8/306 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 +A ++ W ++GR LPWQ + PY+VWLSEVMLQQTQVATV+ Y+ERF+ RFP + L Sbjct: 12 IAADLVRWQRQHGRHGLPWQASRDPYRVWLSEVMLQQTQVATVLGYYERFLQRFPDIAAL 71 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A APLD+VL LW+GLGYY+RARNLH+ AQ V HG P E++ LPG+G STA AI Sbjct: 72 AAAPLDDVLALWSGLGYYSRARNLHRCAQVVVAEHGAALPRRAEQLVELPGIGPSTAAAI 131 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP--AVGVERFNQA 183 S G+ I DGNVKRVLAR A G + LW+ ++++ P A + + Q Sbjct: 132 ASFCHGERVSIFDGNVKRVLARLLAFEGDLAQAGAAKVLWAQADRLVPTDAADMPAYTQG 191 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +MDLGA +CT P+C CPLQ C A A S YP K K R + L L D Sbjct: 192 LMDLGATVCTPRDPQCPACPLQRACRAHAAGSVLTYPIKSRKLKRSRRENWCLWLSLGDA 251 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIA----ADNL--TQLTAFRHTFSHF 297 V L QRP +G+W GL+ +P F D +++ + AQ A A+ L +L F H +H Sbjct: 252 VWLQQRPDTGIWAGLWAWPLFDDTDAVERLTAQLLAALGGHAERLQANRLPGFVHVLTHL 311 Query: 298 HLDIVP 303 + P Sbjct: 312 DWTLHP 317 >UniRef50_Q7UYZ0 A/G-specific adenine glycosylase n=1 Tax=Rhodopirellula baltica RepID=Q7UYZ0_RHOBA Length = 367 Score = 236 bits (603), Expect = 7e-61, Method: Compositional matrix adjust. Identities = 123/307 (40%), Positives = 182/307 (59%), Gaps = 16/307 (5%) Query: 9 QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 ++L W+ R LPW+ D +PY+VW+SE+M QQTQVATV+PYFERF++ +PT+ DLA+A Sbjct: 4 RLLSWFSDNARD-LPWRRDHSPYRVWISEIMCQQTQVATVLPYFERFLSTYPTIRDLADA 62 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 +++ +W GLGYY RAR+LH AA+++ H G+FPE+F++V ALPG+GR TAGAI S+ Sbjct: 63 DESQLMRMWEGLGYYRRARSLHAAAKKMVEEHNGEFPESFDDVLALPGIGRYTAGAIQSI 122 Query: 129 SLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP-------AVGVERFN 181 S K FPIL+GN +RV +R ++ P +K + +LW LS+++ P + G FN Sbjct: 123 SRNKAFPILEGNTQRVFSRWIGLTVPPTEKVAQARLWELSDKMLPPRKADDRSNGPAGFN 182 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA M+LGA+IC+ PKC CP+ C A PGK K RT +++ + Sbjct: 183 QAAMELGALICSPRSPKCDECPVATMCHANQMGLQDEIPGKISKVQYESRTELAVVISRD 242 Query: 242 DEVLLAQRPPSGLWGGLYCF-----PQFADEESLRQWLAQRQIAADNLTQ--LTAFRHTF 294 D L+ P + G+ F P+ AD + WLA Q+ D L +H Sbjct: 243 DRYLVRTIPEGVRFAGMIDFPRAGPPEAADVVGMESWLAD-QLGGDVRIGMCLKTIKHAV 301 Query: 295 SHFHLDI 301 + + + + Sbjct: 302 TRYRMTL 308 >UniRef50_B6BVY5 A/G-specific adenine glycosylase n=1 Tax=beta proteobacterium KB13 RepID=B6BVY5_9PROT Length = 294 Score = 236 bits (602), Expect = 1e-60, Method: Compositional matrix adjust. Identities = 122/306 (39%), Positives = 179/306 (58%), Gaps = 14/306 (4%) Query: 39 MLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVAT 98 MLQQTQV+TVIP++ +F+ RF + LA A DEV+ W+GLG+Y+RARNLHK A+ +A Sbjct: 1 MLQQTQVSTVIPFYIKFIDRFQNIKHLAEASEDEVMSYWSGLGFYSRARNLHKTARIIAE 60 Query: 99 LHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKK 158 + KFP+TFE + LPG+GRSTAGAILS K F ILDGNVKRVL R + + Sbjct: 61 QYSCKFPDTFESLIQLPGIGRSTAGAILSFCFKKKFAILDGNVKRVLTRFFGIQESISLA 120 Query: 159 EVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL 218 + E LW LSEQ+ P ++ + Q +MD GA +CT P+C CP+ CIA NN Sbjct: 121 KTEKDLWDLSEQLLPDGDIDIYTQGIMDFGATLCTPKNPQCHSCPMNQTCIAKQNNLTEA 180 Query: 219 YPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQ 278 P K +T +R+ F + + ++LL + +G+W GL+ PQ + +L ++ Sbjct: 181 IPVKNKTKTKEDRSTEFYIYECNKQILLVKN-RTGVWSGLWIPPQ-------KNYLGKKI 232 Query: 279 IAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVE 338 NL + + FSH+ L + +++ + M +G LW++ + P +GL AP++ Sbjct: 233 ----NLIKQGERQCVFSHYRLKYKYFLIKITNKSDLMVDG--LWFDWKEIPELGLPAPIK 286 Query: 339 RLLQQL 344 LL L Sbjct: 287 SLLISL 292 >UniRef50_C0EDR1 Putative uncharacterized protein n=2 Tax=Clostridium RepID=C0EDR1_9CLOT Length = 365 Score = 235 bits (599), Expect = 2e-60, Method: Compositional matrix adjust. Identities = 132/331 (39%), Positives = 186/331 (56%), Gaps = 9/331 (2%) Query: 2 QASQFSAQV----LDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMA 57 QA + AQ+ L WYD R LPW+ TPY+VW+SE+MLQQT+V+ V+PY+ERF++ Sbjct: 13 QAQELLAQIPPLLLTWYDHSAR-ILPWRQQPTPYRVWVSEIMLQQTRVSAVMPYYERFLS 71 Query: 58 RFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGV 117 PTV LA+AP + +L LW GLGYY R RN+ KAA+ V HGG+ P +FEE+ LPG+ Sbjct: 72 ALPTVGALADAPEEVLLKLWEGLGYYNRVRNMQKAARAVMEQHGGELPASFEELVKLPGI 131 Query: 118 GRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGV 177 G TAGA+ S++ G P +DGNV R+L+R V+ +L +Q+ PA V Sbjct: 132 GEYTAGAVASIAYGLRVPAVDGNVLRILSRWLLSRADVTMPPVKRAYQALVQQMLPAERV 191 Query: 178 ERFNQAMMDLGAMICT-RSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFL 236 FNQA+M+LGA +C P C CP+ C A A P K PK+ E L Sbjct: 192 GDFNQALMELGATVCLPNGDPLCESCPVAGLCRARERGCAAELPVKAPKKPRREEKRTIL 251 Query: 237 LLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 L+ E +VLL +RP +GL GL+ + S ++ A + ++ +L A +H FSH Sbjct: 252 LVVSEGKVLLTKRPANGLLAGLWEYLNLEGHLSAQEAAAHCGVPVSSVRRLGAAKHIFSH 311 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNALWYNLAQ 327 + + VS FT +G +W + Q Sbjct: 312 IEWKMRGYLIQVSGFT---PQGECVWADQLQ 339 >UniRef50_Q7NMA0 A/G-specific adenine glycosylase n=1 Tax=Gloeobacter violaceus RepID=Q7NMA0_GLOVI Length = 375 Score = 235 bits (599), Expect = 2e-60, Method: Compositional matrix adjust. Identities = 135/306 (44%), Positives = 185/306 (60%), Gaps = 15/306 (4%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q + AQ+L+WY + GR LPW+ + PY +W+SE+MLQQTQV TV+PY++R++A PT Sbjct: 18 QVVRLRAQLLEWYGRMGRD-LPWRRTRDPYAIWISEIMLQQTQVKTVLPYYQRWLAALPT 76 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEV-AALPGVGRS 120 V LA A L+ VL LW GLGYY RARNLHKAAQ + HGG FPET +++ ALPG+GRS Sbjct: 77 VAALAAAELEAVLKLWEGLGYYTRARNLHKAAQVIVKEHGGVFPETAQQLQQALPGIGRS 136 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S + G+ ILD N +RVL R +AV P + E KLW +S+++ F Sbjct: 137 TAGAIASSAFGRCEAILDANARRVLGRLFAVGDPPARAEA--KLWEISQRLVDPQAPHNF 194 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+MDLGA +CT P C LCP Q C+ + +P + + E G + ++ Sbjct: 195 NQALMDLGATVCTARSPLCLLCPWQVDCLGRRSGDPTHFPVRPARAVRSEIAGVSVAIEC 254 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFAD-------EESLRQWLAQRQIAADNLTQLTAFRHT 293 + + LL +RP GL GL+ FP F + EE++R R ++L +L H Sbjct: 255 QGKFLLVRRPERGLLAGLWEFP-FVESVGGGEPEETVRVAFGNR---LESLERLGQVEHE 310 Query: 294 FSHFHL 299 F+H HL Sbjct: 311 FTHRHL 316 >UniRef50_A9EQ53 A/G-specific adenine glycosylase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EQ53_SORC5 Length = 396 Score = 234 bits (597), Expect = 4e-60, Method: Compositional matrix adjust. Identities = 127/306 (41%), Positives = 179/306 (58%), Gaps = 12/306 (3%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + +A + W+ + R LPW+ + PY +WLSEVMLQQT+V TVIPY+ERF+AR+PTV Sbjct: 35 EIAAALEAWFGRVARD-LPWRRTRDPYAIWLSEVMLQQTRVETVIPYYERFLARYPTVFA 93 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA+A +D+VL LW+GLGYY RAR LH AA++V H G P + ALPGVG TAGA Sbjct: 94 LASAEIDDVLSLWSGLGYYRRARVLHLAAREVTARHDGALPRDVSALLALPGVGAYTAGA 153 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT----PAVGVERF 180 I S++ + P++DGNV RVL+R + KLWS +E++ +V RF Sbjct: 154 IASIAYDQPVPLVDGNVARVLSRIEGIDDDIRSASGTRKLWSTAERLVRGSADSVHPGRF 213 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+M+LGA +CT P+C CP+ C+A A A P PK+ +P +++ Sbjct: 214 NQALMELGATVCTPRNPRCDACPVDGACVARATGRQAELPVIAPKRDVPAVAMVAAVVRS 273 Query: 241 EDEVLLAQRPPSGLWGGLYCFP-----QFADEESLRQWLAQRQIAADNLTQLTAFRHTFS 295 D VL +R GL+GGL+ P AD +L + + + AA L ++ RH + Sbjct: 274 GDRVLFVRRAEGGLFGGLWEPPMVEARSLADARALLELVGVARDAA--LREVGRVRHILT 331 Query: 296 HFHLDI 301 H LD+ Sbjct: 332 HRRLDV 337 >UniRef50_B5EQC0 A/G-specific adenine glycosylase n=3 Tax=Acidithiobacillus RepID=B5EQC0_ACIF5 Length = 369 Score = 234 bits (596), Expect = 4e-60, Method: Compositional matrix adjust. Identities = 149/363 (41%), Positives = 194/363 (53%), Gaps = 27/363 (7%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +L WY ++GR LPW+ + PY++WL+E+MLQQTQV + PY+ RF+ P LA AP Sbjct: 8 LLAWYTRHGRHDLPWRQTRDPYRLWLAEIMLQQTQVESAKPYYVRFLQALPNWQALAAAP 67 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 D VL LW+GLGYYARARN +AAQ V T G FP+T E+ LPGVGRSTA A+L+ + Sbjct: 68 QDRVLALWSGLGYYARARNAQRAAQTVMTGFAGHFPDTLEKAITLPGVGRSTAAAVLASA 127 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 G ILD N +RVL R +A+ P + LW+L+ +TP +NQA+ DLGA Sbjct: 128 FGHRQAILDANARRVLIRSHAIDADPKAAATQQWLWTLASALTPE-DAHGYNQAIQDLGA 186 Query: 190 MICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL-LQHEDEVLLAQ 248 MICT +P+C CPL + C A A P + T P+R +FLL E +LL + Sbjct: 187 MICTPRQPRCPDCPLASRCKAHAQGRSHALPVTVHRPTKPQRRAFFLLAADMEGRILLEK 246 Query: 249 RPPSGLWGGLYCFPQ------------------FADEESLRQWLAQR-QIAADNLTQLTA 289 RP SG+WGGL+C PQ D ++RQ L + +LTA Sbjct: 247 RPDSGIWGGLWCLPQATPDSGELSILCNEPPSPLEDTPAIRQSLQEAWSRRLRLDLRLTA 306 Query: 290 F----RHTFSHFHLDIVPMWLPVSSFTGCMDEGNAL-WYNLAQPPSVGLAAPVERLLQQL 344 RH F+HF L V D G L WY A GL PV R+L+ L Sbjct: 307 LGEEQRHVFTHFQLRFRCAHARVLG-GAVADSGATLRWYRPADALKQGLPTPVRRILRNL 365 Query: 345 RTG 347 G Sbjct: 366 GGG 368 >UniRef50_A4T064 A/G-specific adenine glycosylase n=2 Tax=Polynucleobacter necessarius RepID=A4T064_POLSQ Length = 381 Score = 234 bits (596), Expect = 5e-60, Method: Compositional matrix adjust. Identities = 143/364 (39%), Positives = 192/364 (52%), Gaps = 26/364 (7%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +++ W+ + GR LPWQ ++ PY VW+SE+MLQQTQVATV+ + RFM RFPTV L Sbjct: 9 FSKKLIAWHAQSGRSGLPWQGNRDPYAVWVSEIMLQQTQVATVLERYPRFMKRFPTVKKL 68 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A +D+VL W GLGYY+RARNLH AQQ+ GKFP+ + L G+GRSTAGAI Sbjct: 69 AAADVDDVLAEWAGLGYYSRARNLHACAQQIVREFAGKFPQDPALLEQLKGIGRSTAGAI 128 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV--GVERFNQA 183 + + + PILD NVKR+LAR +A+ G K V + LW L+ ++ P + + QA Sbjct: 129 AAFAFHERAPILDANVKRILARLFAIEGAIQDKAVNDSLWKLATELLPLKPQDMPTYTQA 188 Query: 184 MMDLGAMICTRSKPKC----SLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 +MD GA CT KP C CP C A ++ P K K PE + LLL+ Sbjct: 189 LMDFGATWCTSRKPVCLSGEKKCPFAKDCQANLSDQVLALPKKVIKSKSPEFDCHMLLLR 248 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQ---------FADEESLRQWLAQRQIAADN----LTQ 286 + VLL +RP +WGGL+ P+ F EE L + + L Sbjct: 249 SGNSVLLQKRPDKAIWGGLWSLPESPWAPRDLSFLKEELSSSNLLSLTLPEEKSVLFLRN 308 Query: 287 LTA------FRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 T +H F+H L + +W + T N W L Q GL P++ L Sbjct: 309 CTPPNRGFYIKHVFTHRCL-WMQIWEVNAMKTIPFTNPNLKWVPLKQLGRHGLPQPIKVL 367 Query: 341 LQQL 344 LQ L Sbjct: 368 LQGL 371 >UniRef50_D0MK94 A/G-specific adenine glycosylase n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MK94_RHOM4 Length = 383 Score = 230 bits (586), Expect = 7e-59, Method: Compositional matrix adjust. Identities = 123/300 (41%), Positives = 175/300 (58%), Gaps = 9/300 (3%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 ++DWY ++ R LPW+ + PY++W++EVMLQQT+V PY+ERF+ FPTV LA A Sbjct: 25 LIDWYRRHARD-LPWRRTRDPYRIWVAEVMLQQTRVDQAGPYYERFLRAFPTVEALAAAS 83 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 LD+VL W GLGYYARARNLH+AA+Q+ HGG+ P T+E + LPGVG TA A+ S++ Sbjct: 84 LDDVLRCWEGLGYYARARNLHRAARQLVAEHGGRLPTTYEALRRLPGVGPYTAAAVASIA 143 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 G+ +LDGNV RVL R AV+ L +++ + FNQA+M+LGA Sbjct: 144 FGEPRAVLDGNVIRVLTRVLAVADDARASATRRALQEVADALISDEEPGTFNQALMELGA 203 Query: 190 MICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPE-RTGYFLLLQHEDEVLLAQ 248 +CT +P+C+ CPL+ C A A +P + P+ ++P LL E VL+ + Sbjct: 204 TVCTPVQPRCNDCPLREVCRARAMGDPTAFPVQTPRASVPHYEVALGLLFNEEGAVLIQR 263 Query: 249 RPPSGLWGGLYCFPQFADE--ESL-----RQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 RP GL GGL+ FP E ESL R+ + + L RH ++HF + + Sbjct: 264 RPEDGLLGGLWEFPGGKREPGESLEAACARELHEELGVRVAVGPCLATVRHAYTHFRVTL 323 >UniRef50_B8HXE9 A/G-specific adenine glycosylase n=31 Tax=Cyanobacteria RepID=B8HXE9_CYAP4 Length = 386 Score = 227 bits (578), Expect = 6e-58, Method: Compositional matrix adjust. Identities = 128/310 (41%), Positives = 181/310 (58%), Gaps = 20/310 (6%) Query: 8 AQVLD-------WYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 AQ+LD WY ++GR +LPW+ PY +W+SE+MLQQTQV TVIPY++R++A P Sbjct: 32 AQILDLQRSLLHWYRQHGR-SLPWRETSDPYAIWVSEIMLQQTQVQTVIPYYQRWLAALP 90 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ +A A +VL LW GLGYY+RARNLH+AAQ + G+FP E V LPG+GR+ Sbjct: 91 TIATVAAAEQQQVLKLWQGLGYYSRARNLHQAAQLIQQEFAGQFPSQLEAVLKLPGIGRT 150 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAG ILS + + ILDGNVKRVLAR A+ P ++ + LW S+++ F Sbjct: 151 TAGGILSSAFAQPVAILDGNVKRVLARLLALPVPP--RKAKGFLWQWSDRLLDRTQPREF 208 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERT-GYFLLLQ 239 NQA+MDLGA +C KP C LCP N C A N + P + LP ++ G ++ Sbjct: 209 NQALMDLGATVCVPKKPDCPLCPWSNHCQALQLNLQSELPVTETAAPLPHKSIGVAVIWN 268 Query: 240 HEDEVLLAQRPPSGLWGGLYCFP--QFADEESLRQWLAQRQIAADNLTQ------LTAFR 291 E+L+ +RP GL GGL+ FP + E++ + QR+I + + L Sbjct: 269 DRGEILIDRRPQKGLLGGLWEFPGGKIEPGETVMACI-QREIREELAIEIEVGEPLITID 327 Query: 292 HTFSHFHLDI 301 H ++HF + + Sbjct: 328 HAYTHFKVTL 337 >UniRef50_B8FCI1 A/G-specific adenine glycosylase n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FCI1_DESAA Length = 369 Score = 227 bits (578), Expect = 6e-58, Method: Compositional matrix adjust. Identities = 115/313 (36%), Positives = 182/313 (58%), Gaps = 14/313 (4%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 + FS +L WY++ R LPW+ PY +W+SE+MLQQTQV TVIPYF R+M FP + Sbjct: 12 SKDFSRSLLRWYEENAR-DLPWRRTSDPYAIWVSEIMLQQTQVKTVIPYFLRWMDAFPNI 70 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 + LA APLD+VL +W GLGYY+RARN+HKAA+++ G+ P T++ + LPG+G TA Sbjct: 71 SSLAEAPLDDVLKMWEGLGYYSRARNMHKAAKEIMDRLDGRMPRTYKGLLELPGIGAYTA 130 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GA+ S++ + P++D NVKRV AR + + ++ L+E + P+ FNQ Sbjct: 131 GAVCSIAYNQDVPLVDANVKRVFARILDMEKPVEQTAATREIRGLAESLIPSGKAGLFNQ 190 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAAN---NSWALYPGKKPKQTLPERTGYFLLLQ 239 A+M+LGA++CT P C CP+ C+A +S + P KK Q L G + Sbjct: 191 ALMELGALVCTPKNPDCKGCPVSVHCLALKEQTVDSRPVLPPKKKTQALEVSAGVCV--- 247 Query: 240 HEDEVLLAQRPPSGLWGGLYCFP-------QFADEESLRQWLAQRQIAADNLTQLTAFRH 292 + ++L+ +R P GL GL+ FP + ++ +R++ + +I + ++T +H Sbjct: 248 RDRKILIQKRLPKGLMAGLWEFPGGKLNPGESPEQALVREFAEELEIDIECGEKITVIQH 307 Query: 293 TFSHFHLDIVPMW 305 ++ F + + W Sbjct: 308 AYTRFRVRLHVFW 320 >UniRef50_C5EQF4 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=C5EQF4_9FIRM Length = 391 Score = 226 bits (575), Expect = 1e-57, Method: Compositional matrix adjust. Identities = 125/324 (38%), Positives = 179/324 (55%), Gaps = 15/324 (4%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +L WY R +LPW+ D PY+VW+SE+MLQQT+V V PYFERFM FP V DLA A Sbjct: 36 LLAWYGSRAR-SLPWRDDPKPYRVWISEIMLQQTRVEAVKPYFERFMKAFPEVRDLAGAE 94 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 D ++ +W GLGYY RARNL AA+ V +GG P +FEE+ LPG+G TAGAI S++ Sbjct: 95 DDYLMKMWEGLGYYNRARNLKAAARMVMEEYGGCLPASFEELIKLPGIGSYTAGAIASIA 154 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 G P +DGNV RV++R G K V+ + + V P+ R+NQ ++++GA Sbjct: 155 FGIPMPAVDGNVLRVISRVLGDRGDIRKASVKAGMEQELKAVMPSGDASRYNQGLIEIGA 214 Query: 190 MICT-RSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQT--LPERTGYFLLLQHEDEVLL 246 ++C +P+C CPL + C+ N W P K P + + ERT +++++D+V + Sbjct: 215 LVCIPGGEPRCGECPLASVCLTRRNGWWKEIPYKSPNKARKIEERT--VFIIEYQDKVAI 272 Query: 247 AQRPPSGLWGGLYCFPQ---FADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVP 303 +RP +GL GLY FP D + + L +T L A +H FSH + Sbjct: 273 HKRPMNGLLAGLYEFPNACGHPDADEIPAVLGMNPGCIKAVTPLPAAKHVFSHVEWHMTG 332 Query: 304 MWLPVSS------FTGCMDEGNAL 321 + V F G +EG + Sbjct: 333 YRVVVDEAAADPVFPGGREEGKGV 356 >UniRef50_D2R2B9 HhH-GPD family protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R2B9_9PLAN Length = 398 Score = 225 bits (574), Expect = 1e-57, Method: Compositional matrix adjust. Identities = 123/317 (38%), Positives = 177/317 (55%), Gaps = 17/317 (5%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 ++F ++L W+ ++ R LPW+ + Y++W+SEVMLQQTQVATVI YF+RF+ FPTV Sbjct: 27 ARFQQRMLAWFAEHARD-LPWRRSRDLYRIWISEVMLQQTQVATVIDYFQRFLVEFPTVV 85 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A +VL W GLGYY RAR+LH AA+++ G+FPET +EV +LPG+GR TAG Sbjct: 86 HLAEADEQQVLRQWEGLGYYRRARSLHAAAKKIVHDFRGQFPETLDEVMSLPGIGRYTAG 145 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILS+ L PIL+ N RV AR + + LW+++EQ+ P V FNQA Sbjct: 146 AILSIGLDAKLPILEANTIRVYARLAGYTREATSTAGQKFLWNIAEQILPDKKVGFFNQA 205 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 MM+LG+ +CT P C CP C+A + P + T +L+ + Sbjct: 206 MMELGSALCTPRTPSCDQCPASTWCVARREQAVEQIPLLSKRMVYEYLTEVAVLVVDRQQ 265 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEES-----------LRQWLAQR-----QIAADNLTQL 287 VLL Q P W GL+ FP+F+ E++ L +L +R IA+ Q Sbjct: 266 VLLRQCQPGERWAGLWDFPRFSLEKTMLGAKSTTETDLENYLIERVESLTGIASMIHDQW 325 Query: 288 TAFRHTFSHFHLDIVPM 304 + +H + + + + PM Sbjct: 326 PSLKHGVTKYRITLRPM 342 >UniRef50_B7GII9 A/G-specific adenine glycosylase n=76 Tax=Bacillaceae RepID=B7GII9_ANOFW Length = 373 Score = 225 bits (574), Expect = 2e-57, Method: Compositional matrix adjust. Identities = 116/302 (38%), Positives = 178/302 (58%), Gaps = 10/302 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 QF ++DW+ + ++ LPW+ DK PYK+W+SEVMLQQT+V TVIPYF +F+ +FPT+ Sbjct: 28 EQFQHDLIDWF-RTEQRDLPWRKDKDPYKIWVSEVMLQQTRVDTVIPYFYQFIEKFPTLD 86 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA+A +EVL W GLGYY+R RNLH A ++V +GG+ P + E+ ++L GVG T G Sbjct: 87 ALADAKEEEVLKAWEGLGYYSRVRNLHAAVKEVKEKYGGRVPASKEQFSSLKGVGPYTTG 146 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+LS++ G P +DGNV RVL+R + ++ + K + + FNQA Sbjct: 147 AVLSIAYGIPEPAVDGNVMRVLSRIFYITDDIARASTRKKFEQIVSCIISHDDPSDFNQA 206 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGY--FLLLQHE 241 +M+LGA++CT P C LCP+Q C A A A P K K P+ + +L E Sbjct: 207 LMELGALVCTPKNPSCFLCPVQRHCRAFAEGVEAELP-VKTKGKAPKHVAFRAIVLTNEE 265 Query: 242 DEVLLAQRPPSGLWGGLYCFPQ-----FADEESLRQWLAQR-QIAADNLTQLTAFRHTFS 295 ++ + +RP SGL L+ FP ++E++ + +++R + ++ ++ F H FS Sbjct: 266 GKIRIEKRPSSGLLANLWQFPNDEYAPTSNEQAFIKEISERYHVVLRSIQRIGTFEHVFS 325 Query: 296 HF 297 H Sbjct: 326 HI 327 >UniRef50_C1ZJ43 A/G-specific DNA-adenine glycosylase n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJ43_PLALI Length = 381 Score = 225 bits (574), Expect = 2e-57, Method: Compositional matrix adjust. Identities = 110/264 (41%), Positives = 159/264 (60%), Gaps = 1/264 (0%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 MQ ++F Q+L WY K+GR LPW+ PY +W+SE+MLQQT V VIPYFERFMA+FP Sbjct: 1 MQKTKFQKQLLAWYAKHGR-PLPWRASHDPYSIWISEIMLQQTTVTAVIPYFERFMAKFP 59 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 +V LA+AP +EVL LW GLGYY+RARNLH++A+ + + G FP++ E++ LPG+GR Sbjct: 60 SVQALASAPEEEVLKLWEGLGYYSRARNLHQSARVLMERYQGVFPQSVEQLLELPGIGRY 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S + PI++ N +R+ AR G + LW +E + + Sbjct: 120 TAGAISSFAFRLPAPIVEANTQRLYARILGYDGDLKNAAGQKALWGFAESIVSGKEPDLI 179 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+M+LG+++C P C CP+Q C A A P + + + LL+++ Sbjct: 180 NQALMELGSLVCKPIDPLCDQCPVQQHCRAFQEARQAEIPRAQARPVITPLVDATLLIEY 239 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQF 264 + E+ L QR W GL+ FP++ Sbjct: 240 QGELFLRQREKPERWAGLWDFPRY 263 >UniRef50_C8WEE4 A/G-specific adenine glycosylase n=3 Tax=Zymomonas mobilis RepID=C8WEE4_ZYMMN Length = 373 Score = 225 bits (573), Expect = 2e-57, Method: Compositional matrix adjust. Identities = 116/302 (38%), Positives = 180/302 (59%), Gaps = 21/302 (6%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDK-----TPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 + + +L+WY ++ R LPW+ + PY+VWLSE+MLQQT A PY+ +F+ R+ Sbjct: 9 EITTDLLNWYQRHAR-ILPWRTEAQQNKVDPYRVWLSEIMLQQTTTAHAAPYYLKFVERW 67 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 PTV LA A +V+ W GLGYY+RARNL K A++V GGKFP+ + + ALPG+GR Sbjct: 68 PTVEALAAAQEADVMAEWAGLGYYSRARNLIKCAKEVVA-SGGKFPDNEQGLLALPGIGR 126 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAV-SGWPGKKEVENKLWSLSEQVTPAVGVE 178 TA AI++++ GK ++D NV+RV++R +A+ + P + + + ++++TP + Sbjct: 127 YTAAAIVAIAFGKRAVVVDANVERVVSRLFAIETPLPASRPI---IAEETDKLTPDLAAG 183 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 F QAMMD+GA IC +P C++CP+ C + A++P K PK+ P R GY ++ Sbjct: 184 DFAQAMMDIGATICVNRQPTCAICPMMPHCEGQKTGNPAVFPIKAPKKIRPTRIGYAFIV 243 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQI---AADNLTQLTAFRHTFS 295 +D++LL +RP GL GG+ P +WL Q +I A+D T+ A + F Sbjct: 244 TSKDQILLIRRPDKGLLGGMRALPS-------SEWLDQSKITTTASDAETEQQAIKEAFP 296 Query: 296 HF 297 F Sbjct: 297 SF 298 >UniRef50_Q67NN1 A/G-specific adenine glycosylase n=1 Tax=Symbiobacterium thermophilum RepID=Q67NN1_SYMTH Length = 365 Score = 223 bits (569), Expect = 6e-57, Method: Compositional matrix adjust. Identities = 126/317 (39%), Positives = 180/317 (56%), Gaps = 25/317 (7%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 + ++LDWY R LPW+ + PY +W+SE+MLQQT+V TVIPY+ER+MARFPT+ L Sbjct: 1 MADRLLDWYSASARD-LPWRRTRDPYHIWVSEIMLQQTRVETVIPYYERWMARFPTLEAL 59 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+AP ++VL W GLGYY+RARNLH AA++V +GG P+ + VA+L G+G TAGAI Sbjct: 60 ADAPEEQVLKAWEGLGYYSRARNLHAAAREVVARYGGTVPDDPDAVASLKGIGPYTAGAI 119 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 LS++ + P +DGNV RV+AR YA+ + + L + P FNQA+M Sbjct: 120 LSIAFNRPVPAVDGNVLRVIARLYAIVDDIAQLATRRTIEELVRAMIPQDRPGDFNQALM 179 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIA--AANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 DLGA ICT +P+C LCP+++ C A + G+ L ER +++ E Sbjct: 180 DLGATICTPRRPRCLLCPVRDLCEGRRAGRETELPVKGRAKAPRLVERAA--AVIEREGR 237 Query: 244 VLLAQRPPSGLWGGLYCF----------PQFADEESLRQWLAQRQIAADNLTQLTAFRHT 293 +L+ +RP GL GL+ P+ A + LR L ++L +T H Sbjct: 238 LLIVRRPQEGLLAGLWELPGGDVPPGVGPEEALQTLLRNALGAEVAVGEHLADVT---HV 294 Query: 294 FSH-------FHLDIVP 303 FSH F ++VP Sbjct: 295 FSHLVWHLRCFRAEVVP 311 >UniRef50_A7HQZ1 A/G-specific adenine glycosylase n=45 Tax=Alphaproteobacteria RepID=A7HQZ1_PARL1 Length = 615 Score = 222 bits (565), Expect = 2e-56, Method: Compositional matrix adjust. Identities = 141/348 (40%), Positives = 190/348 (54%), Gaps = 46/348 (13%) Query: 7 SAQVLDWYDKYGRKTLPWQIDK----TPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 +A +L WYDK+ R LPW+ K PY VWLSE+MLQQT VATV PYF F+ R+P V Sbjct: 248 AAPLLAWYDKHAR-VLPWRARKGERADPYAVWLSEIMLQQTTVATVGPYFTGFLKRWPNV 306 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA AP +EV+ W GLGYY+RARNLH A++V++ +GGKFP+T E + +LPG+G TA Sbjct: 307 EALAAAPQEEVMKAWAGLGYYSRARNLHACAKEVSSEYGGKFPDTVEGLESLPGIGPYTA 366 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAV-SGWPGKK-EVENKLWSLSEQVTPAVGVERF 180 AI +++ G+ ++DGNV+RV+AR + + + P K ++ K +L TP F Sbjct: 367 AAIAAIAFGRAATVVDGNVERVVARLFEIETPLPAAKPDIREKARTL----TPEQRAGDF 422 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QAMMDLGA ICT P C+ CP+ + C A A + L P + PK+ P R G L Sbjct: 423 AQAMMDLGATICTPRSPACNRCPINDLCDARAAGTQNLLPARAPKKARPTRRGACFWLVR 482 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWL----------AQRQIAADNLTQLTA- 289 E V L +RP GL GG+ P +ES R +R +A + L Sbjct: 483 EGHVWLRRRPDKGLLGGMLEVPGTPWDESDRHRTVIELSHDENGGRRGVAGEVLDHAPME 542 Query: 290 ---------FRHTFSHFHLD-----------IVP----MWLPVSSFTG 313 HTF+HFHL+ IVP MW+P+ G Sbjct: 543 AEWRLVPGLVEHTFTHFHLELEVFTATTRKKIVPGREGMWVPLEEVAG 590 >UniRef50_C0QGE4 MutY n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QGE4_DESAH Length = 364 Score = 221 bits (562), Expect = 4e-56, Method: Compositional matrix adjust. Identities = 119/308 (38%), Positives = 174/308 (56%), Gaps = 16/308 (5%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + +L WY + R+ LPW+ PY +W+SEVMLQQTQV TV+PY+ RFM FPTV Sbjct: 3 EIQTALLKWYHTHHRE-LPWRKTANPYSIWVSEVMLQQTQVKTVVPYYGRFMEAFPTVAS 61 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A L++VL LW GLGYYARARN HKA Q V T G P + ALPGVG A A Sbjct: 62 LARADLEQVLKLWEGLGYYARARNFHKACQTVTTDLKGIIPRDLKGFKALPGVGDYIAAA 121 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 +LS++ ++DGNVKRVLAR + V+ P K+ + K + ++ P F Sbjct: 122 VLSIAFNIPLAVVDGNVKRVLARVFTMDDPVNHGPSHKKFQAKADLILDRSCPGA----F 177 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+M+LGA++C+ P C++CPL C A ++S YP + + +P +++ Sbjct: 178 NQAVMELGALVCSPRNPGCTICPLGQYCRALESDSVDKYPLRNKTKPVPTVHIAAGVVKK 237 Query: 241 EDEVLLAQRPPSGLWGGLYCFP-------QFADEESLRQWLAQRQIAADNLTQLTAFRHT 293 D++L+ R P GL GGL+ FP + A +R+ + + + L +H Sbjct: 238 GDKLLITLRKPEGLLGGLWEFPGGKVKTGEQASSACVRELAEETGLRVAVTSHLARVKHA 297 Query: 294 FSHFHLDI 301 ++HF +++ Sbjct: 298 YTHFKIEM 305 >UniRef50_Q2KBH8 A/G-specific adenine glycosylase protein n=11 Tax=Alphaproteobacteria RepID=Q2KBH8_RHIEC Length = 367 Score = 221 bits (562), Expect = 4e-56, Method: Compositional matrix adjust. Identities = 135/336 (40%), Positives = 185/336 (55%), Gaps = 34/336 (10%) Query: 10 VLDWYDKYGRKTLPWQIDK---------TPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 +LDWYD++ R LPW+I PY+VWLSEVMLQQT V V PYFE+F+ R+P Sbjct: 14 LLDWYDRHHRD-LPWRISPGMAARGVKPDPYRVWLSEVMLQQTTVQAVKPYFEKFLERWP 72 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 VTDLA A D V+ W GLGYYARARNL K A+ VA HGG FP+T E + +LPG+G Sbjct: 73 EVTDLAAAESDAVMATWAGLGYYARARNLKKCAEAVAKEHGGVFPDTEEGLKSLPGIGDY 132 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSG-WPGKKEVENKLWSLSEQVTPAVGVER 179 TA A+ +++ + ++DGNV+RV++R YA+ P K +L +TP+ Sbjct: 133 TAAAVAAIAFNRQAAVMDGNVERVISRLYAIEAPLPAAKPAMKAKVAL---LTPSGRPGD 189 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTG-YFLLL 238 F QAMMDLGA ICT +P CSLCP +N C A + L+P K K+ P R G F+ + Sbjct: 190 FAQAMMDLGATICTPKRPACSLCPFRNACQALKLSDPELFPVKAAKKEKPVRHGAAFIAV 249 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFR------- 291 + E+LL +R SGL GG+ P A W A ++ + + F Sbjct: 250 SGDGEILLRRRAESGLLGGMTEVPTTA-------WTA--RLDGETSSAAAPFEAAWQACG 300 Query: 292 ---HTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYN 324 H F+HF L + + +++ D N W+ Sbjct: 301 TVIHVFTHFELRLSIWRVAIAAKPKSRDGVNDEWWE 336 >UniRef50_B3EUI2 A/G-specific adenine glycosylase n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3EUI2_AMOA5 Length = 373 Score = 219 bits (559), Expect = 1e-55, Method: Compositional matrix adjust. Identities = 112/304 (36%), Positives = 176/304 (57%), Gaps = 19/304 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ ++++WY + R LPW+ K PYK+WLSE++LQQT+VA +PY++RF+ +PT+ DL Sbjct: 20 FAIKLIEWYQHHHR-ALPWRETKDPYKIWLSEIILQQTRVAQGLPYYQRFIENYPTIHDL 78 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+A +L +W GLGYY RARNLH A+ + T GKFP ++ + +LPG+G TA AI Sbjct: 79 ASASETAILRVWQGLGYYTRARNLHACARTIVTQFQGKFPNNYKALLSLPGIGVYTAAAI 138 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S++ + P++DGNV RVLAR + + + ++ L++ + + +NQA+M Sbjct: 139 ASIAFKEPIPVIDGNVYRVLARIFDIETAINSTKGKHIFNQLAQTLISKTAPDIYNQAIM 198 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ-HEDEV 244 + GA+ CT KP C+ C + C A N L P K+ K + +R ++L +Q +D++ Sbjct: 199 EFGAIQCTPLKPLCNTCIFKMDCSAFLANKQHLLPVKEAKVKIKQRFFHYLCIQLDDDQL 258 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQL------------TAFRH 292 + R P +W GLY F + EES R+ Q + D L QL T ++H Sbjct: 259 FMKSRKPGDIWTGLYDF--YLVEESERKEFDQLE---DELVQLIKKHQLYIEKVPTVYKH 313 Query: 293 TFSH 296 +H Sbjct: 314 ILTH 317 >UniRef50_A6CCE7 A/G-specific adenine glycosylase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCE7_9PLAN Length = 408 Score = 219 bits (557), Expect = 2e-55, Method: Compositional matrix adjust. Identities = 111/264 (42%), Positives = 151/264 (57%), Gaps = 1/264 (0%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 +F Q+ WY + R LPW+ P+ VW+SE+MLQQT VA VIPYF+RFM+RFP V Sbjct: 15 KFRRQLQSWYVSHQRD-LPWRRQHDPHAVWISEIMLQQTVVAAVIPYFKRFMSRFPDVET 73 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A EVL W GLGYY+RARN+HKAA+++A G+FP E + LPG+GR TAGA Sbjct: 74 LAAADESEVLQHWEGLGYYSRARNIHKAAKRIAGELEGRFPRDVESLQKLPGIGRYTAGA 133 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I S + PI++ N R+ +R + P K +N+LW +E + P FNQA+ Sbjct: 134 ICSFAYDTRAPIVEANTLRLYSRLIGLEEDPRSKSGQNQLWEFAELILPRKSPGEFNQAL 193 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 MDLG+++CT P C CP+ GC A L P K + + T + + V Sbjct: 194 MDLGSLVCTPQNPGCEDCPVNAGCEAFLRQRQHLIPVPKVRPEITPLTDVSIAVFSGSHV 253 Query: 245 LLAQRPPSGLWGGLYCFPQFADEE 268 ++ QR W GL+ FP+ EE Sbjct: 254 MIRQRTAGERWAGLWDFPRLTLEE 277 >UniRef50_Q2S0Q6 A/G-specific adenine glycosylase n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S0Q6_SALRD Length = 354 Score = 219 bits (557), Expect = 2e-55, Method: Compositional matrix adjust. Identities = 118/298 (39%), Positives = 164/298 (55%), Gaps = 9/298 (3%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +LDWYD + ++++PW+ PY++W+SE+MLQQT+V TV Y+ RF+ FPTV LA+A Sbjct: 1 MLDWYDTH-KRSMPWRETDDPYRIWVSEIMLQQTRVDTVRDYYHRFLEAFPTVEALADAD 59 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 D VL W GLG+YARAR+LH AAQ V H G P T + + L GVG TA A+LS++ Sbjct: 60 RDTVLKHWEGLGFYARARHLHTAAQHVVDEHDGTVPSTMDAIKDLKGVGPYTAAAVLSIA 119 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 K +LDGNV RVL+R +AV E L L+ ++ FNQAMM+LGA Sbjct: 120 YRKPHAVLDGNVTRVLSRVFAVDEDATTSAAEGHLRDLANELLDPDRPGDFNQAMMELGA 179 Query: 190 MICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLP-ERTGYFLLLQHEDEVLLAQ 248 ++CT P C CPL C A + YP + +P E L+ D +L+ + Sbjct: 180 LVCTPRTPHCDRCPLNAVCRAHDAGTEEDYPITPESEPVPHEDIAVGLVFDDNDRLLIQR 239 Query: 249 RPPSGLWGGLYCFP---QFADEE----SLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 RP GL GGL+ FP Q DE R+ + + ++ H +SHF + Sbjct: 240 RPDEGLLGGLWEFPGGKQEGDESMEAACRREVREELGVGMTDVEPFYTLSHAYSHFKI 297 >UniRef50_C9LNH1 A/G-specific adenine glycosylase n=1 Tax=Dialister invisus DSM 15470 RepID=C9LNH1_9FIRM Length = 351 Score = 218 bits (556), Expect = 2e-55, Method: Compositional matrix adjust. Identities = 125/337 (37%), Positives = 185/337 (54%), Gaps = 13/337 (3%) Query: 9 QVLDWYDKYGRKTLPWQIDK--TPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 ++L W+D+ R+ LPW+ + PY VW+SE+MLQQT+ V PYFE + RFPT+ LA Sbjct: 13 KLLAWFDQ-NRRDLPWREGRPRNPYYVWVSEIMLQQTRTEAVKPYFESWKRRFPTIEALA 71 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 A +VLH W GLGYY+RARNLHKAA+++A +GG PE ++V ALPG+G TAGAIL Sbjct: 72 EAKEADVLHAWQGLGYYSRARNLHKAAREIAEKYGGAIPEDKKDVRALPGIGEYTAGAIL 131 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMD 186 S++ GKH +DGNV RV AR Y + K ++ +L E+ PA + FN+A+MD Sbjct: 132 SMAYGKHEAAVDGNVLRVYARLYGIESDILKSAGRKEITTLVEKTLPARAGD-FNEALMD 190 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLL 246 LG+ +C PKC CPL C A + P + PK+ E ++ E + L Sbjct: 191 LGSEVCVPKHPKCEKCPLHGECAALRLGLENVLPIRTPKKKQKESIVCCAVMIKEGKALF 250 Query: 247 AQRPPSGLWGGLYCFPQ-FADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW 305 RP G+ ++ P +D E + ++ + ++ + H F+H +W Sbjct: 251 HLRPSKGMLSSMWELPMVLSDSEKDGVKMLEKLLGGKAGERIWTYTHVFTHR------IW 304 Query: 306 -LPVSSFTGCMD-EGNALWYNLAQPPSVGLAAPVERL 340 + F G + EG+ WY+ + + LA P +L Sbjct: 305 HMKAYLFHGAKEPEGDYRWYSCEEYREIPLAGPHAKL 341 >UniRef50_C9MR28 A/G-specific adenine glycosylase n=4 Tax=Prevotella RepID=C9MR28_9BACT Length = 343 Score = 218 bits (554), Expect = 3e-55, Method: Compositional matrix adjust. Identities = 116/263 (44%), Positives = 156/263 (59%), Gaps = 11/263 (4%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F++ +L WYD + LPW+ K Y +WLSEV+LQQT++ + Y++ FM ++PTV DL Sbjct: 3 FASTLLLWYDN-NERPLPWRETKDAYAIWLSEVILQQTRIVQGMSYWQHFMRQWPTVYDL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA DEVL W GLGYY+RARNLHKAAQQV L G FP T +E+ L GVG TA AI Sbjct: 62 ANASEDEVLKAWQGLGYYSRARNLHKAAQQVVELKG--FPHTAKELKKLKGVGEYTAAAI 119 Query: 126 LSLSLGKHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 S + + ++DGNV RVL+R + GKKE + SLS+ + P R+N Sbjct: 120 ASFAFDEKIAVVDGNVYRVLSRYKGIDTPIDTTSGKKEFQ----SLSQTLLPTHDSARYN 175 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MD GA+ CT + P C +CPL C+A P K K ER ++ +Q+ Sbjct: 176 QAIMDFGAIQCTPTSPHCDICPLCESCVAFREQRINQLPVKSKKIHQQERHLTYIYIQYN 235 Query: 242 DEVLLAQRPPSGLWGGLYCFPQF 264 DE++L QR +W GL+ PQ Sbjct: 236 DEIVLHQRGAGDIWQGLWELPQL 258 >UniRef50_A3EPM8 Putative A/G-specific DNA glycosylase n=2 Tax=Leptospirillum sp. Group II RepID=A3EPM8_9BACT Length = 355 Score = 218 bits (554), Expect = 4e-55, Method: Compositional matrix adjust. Identities = 133/348 (38%), Positives = 189/348 (54%), Gaps = 38/348 (10%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + +L WY + R +LPW+I++TPY+VW+SE+MLQQT V V+ YFERF+ RFP V Sbjct: 25 RLRDDLLVWYKEVSR-SLPWRINRTPYRVWVSEIMLQQTTVRAVLGYFERFLERFPDVDS 83 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA AP+++VL LW GLGYY RARNLH+AA+ +A+ G FPET E LPGVGRSTAGA Sbjct: 84 LAEAPVEDVLKLWEGLGYYQRARNLHRAARIIAS---GGFPETVEGWMNLPGVGRSTAGA 140 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVEN-KLWSLSEQ-VTPAVGVERFNQ 182 + S++LG+ PILD NV+RVL R +S PG E+ LW LS+ VT A NQ Sbjct: 141 VCSIALGQETPILDVNVRRVLGRLRGIS--PGDAVRESPDLWELSKAFVTEASDPGEVNQ 198 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQT--------LPERTGY 234 A+M++GA++C KP C++CP C + L P +K K+ +P Sbjct: 199 ALMEIGAVVCLPRKPLCTVCPWSLDCASCGAPEEILNPPRKKKEKQIRIRTALIPSDGSG 258 Query: 235 FLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTF 294 + L+Q D +L GLW D S+ + Q+ + H + Sbjct: 259 YFLVQGRDRLL------EGLW----------DVFSVSGPPGEGQMPFGKVL------HEY 296 Query: 295 SHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 SHF ++ + S + E ++LW + V L ++L+ Sbjct: 297 SHFREEVFLVREERSLLEAALGETSSLWLAKGEESPVALTGVARKILR 344 >UniRef50_A2RLW5 A/G-specific adenine glycosylase n=4 Tax=Lactococcus lactis RepID=A2RLW5_LACLM Length = 386 Score = 217 bits (552), Expect = 6e-55, Method: Compositional matrix adjust. Identities = 114/268 (42%), Positives = 156/268 (58%), Gaps = 14/268 (5%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +F +L WYD +K LPW+ PYK+W+SE+M QQTQV TVIPYFERFM ++PT Sbjct: 7 KIKEFQKDLLSWYDA-NKKPLPWRQTTEPYKIWISEIMSQQTQVETVIPYFERFMKKYPT 65 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA A E+L LW GLGYY+RARNL AAQ+V + GKFP+ +E+ +L G+G T Sbjct: 66 VESLAQADDTELLKLWEGLGYYSRARNLKIAAQEVVNEYNGKFPDNLKEILSLRGIGPYT 125 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWSLSEQVTPAVGV 177 A AI S+S P +DGN+ RV +R + +S +K + L +L + P Sbjct: 126 AAAIASISFDLAEPAIDGNLMRVTSRIFELECDISKSSSRKIFDEHLRTLVSKKRPG--- 182 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL 237 FNQ +MDLG+++C+ KPKC CPL C A A+ YP K K L ++ Y+ Sbjct: 183 -DFNQGLMDLGSLVCSPKKPKCETCPLNKYCGAVADGKQLNYPVKTKK--LKQKELYYTA 239 Query: 238 LQHED---EVLLAQRPPSGLWGGLYCFP 262 E+ E L +RP GL ++ FP Sbjct: 240 FALENSLGEYYLEKRPSKGLLADMWTFP 267 >UniRef50_C5VJP5 A/G-specific adenine glycosylase n=1 Tax=Prevotella melaninogenica ATCC 25845 RepID=C5VJP5_9BACT Length = 334 Score = 216 bits (549), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 112/258 (43%), Positives = 157/258 (60%), Gaps = 3/258 (1%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ +L W+ GR +LPW+ K Y +WLSEV+LQQT++ + Y+ERFMA++PTV DL Sbjct: 3 FAVTLLQWFKNNGR-SLPWRETKDAYAIWLSEVILQQTRIVQGMSYWERFMAKWPTVNDL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A +EVL W GLGYY+RARNLH AAQQV L G FP+TF+E+ L GVG TA AI Sbjct: 62 AAATENEVLKAWQGLGYYSRARNLHTAAQQVMEL--GGFPQTFKELKTLKGVGDYTAAAI 119 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S++ G+ ++DGNV RVL+R Y + E + + +L++ + P +N+A+M Sbjct: 120 ASIAFGEPVAVVDGNVYRVLSRYYGIETPIDSTEGKKEFQTLAQSLLPINEPADYNEAIM 179 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D GA CT + P CS CPL C+A P K K ER +L +++E ++ Sbjct: 180 DFGATQCTPNSPHCSACPLCETCVAFREQRIDELPVKSKKVKQRERHFTYLCIEYEGKIA 239 Query: 246 LAQRPPSGLWGGLYCFPQ 263 + QR +W GL+ FPQ Sbjct: 240 IHQRGAGDIWQGLWEFPQ 257 >UniRef50_A6TW33 A/G-specific adenine glycosylase n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TW33_ALKMQ Length = 352 Score = 215 bits (547), Expect = 2e-54, Method: Compositional matrix adjust. Identities = 111/303 (36%), Positives = 173/303 (57%), Gaps = 8/303 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M FS Q+++W+ + ++ +PW+ K PY +W+SE+MLQQT+V TVI Y++ FM +FP Sbjct: 1 MTEESFSYQLIEWF-REEKRWMPWRETKDPYCIWVSEIMLQQTRVETVISYYQNFMKKFP 59 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA A +EVL W GLGYY+R RNLH+AA ++ +H G P+ + + LPG+G Sbjct: 60 TIETLARASQEEVLKSWEGLGYYSRGRNLHRAANEIVLIHEGNVPKDKKILLKLPGIGPY 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAILS++ + P +DGNV RV++R + + +K+V N++ L Q+ P F Sbjct: 120 TAGAILSIAYNQKEPAVDGNVLRVMSRLFNIQEDIMEKKVVNEVTDLVFQLMPQDNGGDF 179 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 +A+M+LGA +C KP+C LCP+ N C A + P + K + L + + Sbjct: 180 TEALMELGATVCVPQKPRCRLCPVHNQCKAHHLDIQETLPIRIKKTKVKNYHKGILWMVY 239 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQF-----ADEESLRQWLAQRQI--AADNLTQLTAFRHT 293 +L+ Q P GL GGL+ P DE+++ Q Q ++ L + +H Sbjct: 240 NGTILVKQNPQKGLLGGLWALPTIDLMHKVDEKAVIQEDFQEEVGQVVVELEYIGKEKHV 299 Query: 294 FSH 296 F+H Sbjct: 300 FTH 302 >UniRef50_A0LLV9 A/G-specific DNA-adenine glycosylase n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LLV9_SYNFM Length = 388 Score = 215 bits (547), Expect = 2e-54, Method: Compositional matrix adjust. Identities = 111/302 (36%), Positives = 170/302 (56%), Gaps = 8/302 (2%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 Q +L W+D+ ++ LPW+ PY++W+SE+MLQQTQV T++PYF R+M RFP V Sbjct: 9 QIQTLLLSWFDE-NQRPLPWREKYRPYEIWISEIMLQQTQVKTMLPYFRRWMERFPDVQS 67 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 +A+A DEVL W GLGYY+RA N+ + A+ + HGG FP+ + +PG+G TAGA Sbjct: 68 IADAREDEVLKHWEGLGYYSRAVNIRRTAEIIVRHHGGTFPKAHSTILGMPGIGPYTAGA 127 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I S++ + P++DGNV+R+LAR + + +K +W+ +E++ PA +FNQA+ Sbjct: 128 ISSIAFNEDRPLVDGNVERILARLFNLDTPVEEKNTRKFIWNTAEELIPAGRARQFNQAL 187 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 MDLGA +C +P C CPL C + + P ++ + +L H V Sbjct: 188 MDLGATVCLPRRPACEKCPLNGLCESRRMGTADRRPVTNRRKDIASIEVAVGILHHRGRV 247 Query: 245 LLAQRPPSGLWGGLYCFP--QFADEESLRQWLAQRQIAADNLT-----QLTAFRHTFSHF 297 L+ +RP SGL L+ FP + ES Q L + L +L + RH ++ F Sbjct: 248 LIQKRPASGLMPNLWEFPGGKIHPGESPEQALIREFREELELEVRCRERLASIRHNYTSF 307 Query: 298 HL 299 + Sbjct: 308 RV 309 >UniRef50_A0L468 A/G-specific DNA-adenine glycosylase n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L468_MAGSM Length = 365 Score = 214 bits (546), Expect = 3e-54, Method: Compositional matrix adjust. Identities = 139/338 (41%), Positives = 194/338 (57%), Gaps = 21/338 (6%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + + ++L +YD+YGR LPW+ + Y++WLSE+MLQQT V TV+PY+E+F++ FP++T Sbjct: 8 ADLAQRLLAYYDEYGRD-LPWRQQQDLYRIWLSEIMLQQTGVKTVMPYYEKFLSHFPSIT 66 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A ++VL W GLGYY RAR LH+AAQQV HGG FPE +V ALPG+G STA Sbjct: 67 QLAAASQEQVLAQWQGLGYYRRARMLHQAAQQVVQQHGGLFPEEITQVQALPGIGPSTAA 126 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWSLSEQVTPAVGVER 179 AIL++ + ILDGNV RVLAR V PGK+ +LW ++ Q+T Sbjct: 127 AILAIGRNQAHTILDGNVMRVLARLLTLELPVDSTPGKQ----RLWQVARQLTSQQRPGD 182 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPER-TGYFLLL 238 + QA+MDLGA +CTRS+P CS CP C A + SWA YP K+ K+ P ++LL Sbjct: 183 YAQAIMDLGATLCTRSQPACSRCPWGGACAARQHGSWAEYPKKREKKPKPHHYQCMWVLL 242 Query: 239 QHEDEVLLAQRPPSGLWGGLY---------CFPQFADEESLRQWLAQRQIAADNLTQLTA 289 + + L +RP GL GGL+ P + L I L + Sbjct: 243 DTQQRIFLRKRPLEGLLGGLWEPLGEPLLETPPLGNLVQRASHHLTALGIQGQPLLEAQP 302 Query: 290 FRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQ 327 H F+HF L + P+ + +S +++ N W+ LAQ Sbjct: 303 VDHIFTHFRLTVYPILVVAASGAPILNDAN--WWPLAQ 338 >UniRef50_B9XFP7 A/G-specific adenine glycosylase n=1 Tax=bacterium Ellin514 RepID=B9XFP7_9BACT Length = 392 Score = 214 bits (546), Expect = 3e-54, Method: Compositional matrix adjust. Identities = 110/317 (34%), Positives = 170/317 (53%), Gaps = 17/317 (5%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 ++ ++ +L W+ + R LPW+ + PY +W+SE+MLQQTQV TVIPY+E +M P Sbjct: 34 LKIHALASALLHWFSQSARD-LPWRRTRDPYAIWVSEIMLQQTQVKTVIPYWEHWMQNLP 92 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA A + + LW GLGYY R RN+ +AAQ++ + H G FP F+ + AL G+GR Sbjct: 93 TIQSLAEAAPERIHKLWEGLGYYTRVRNMQQAAQEIMSRHDGSFPSDFDSILALKGIGRY 152 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV----- 175 TAGAI S++ + P+LDGNV RVL R + ++ P K +LWSL+E + + Sbjct: 153 TAGAIASIAFNQPKPLLDGNVIRVLTRLFGIAENPRDKITNEQLWSLAEALVVSASNSNN 212 Query: 176 ------GVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLP 229 NQ++M+LGA+ICT +P+C +CP++ CIA P + Sbjct: 213 NKSHPSACSHLNQSLMELGALICTPRQPQCLICPVRQDCIAHHTGKVESLPNLGERTAAT 272 Query: 230 ERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNL----- 284 +R + +++H+ L+ QRP + L+ FP L QI L Sbjct: 273 QRKFFAFVIEHQHRFLVTQRPAGVVNAHLWEFPNLEVNPDLPHPTPATQIDPLRLTLHSP 332 Query: 285 TQLTAFRHTFSHFHLDI 301 T L +HT + + + + Sbjct: 333 TPLCTIKHTITRYRITL 349 >UniRef50_A6LDG5 A/G-specific adenine glycosylase n=33 Tax=Bacteroidetes RepID=A6LDG5_PARD8 Length = 359 Score = 214 bits (544), Expect = 5e-54, Method: Compositional matrix adjust. Identities = 114/302 (37%), Positives = 174/302 (57%), Gaps = 11/302 (3%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + S +++WY+ Y R+ LPW+ + PY +W+SE++LQQT+V + YF RF RFP V Sbjct: 9 EISRILVEWYETYKRE-LPWRETRDPYIIWISEIILQQTRVVQGLEYFLRFTERFPDVAS 67 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A DEVL W GLGYY+RARNLH AA+ + G FPE ++EV +L G+G TA A Sbjct: 68 LAVAEEDEVLKYWQGLGYYSRARNLHAAAKSIMERFNGVFPENYKEVLSLKGIGEYTAAA 127 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I+S + + P++DGNV RVL+R +AV + + + L+ + NQA+ Sbjct: 128 IVSFAWNQPCPVVDGNVYRVLSRLFAVDTPIDTTKGKKQFAELAGMILDPKNAGTHNQAI 187 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LGA+ C P C +CPL++ C+A A+ + YP K+ K +R ++L + ++++ Sbjct: 188 MELGALQCVPQNPDCGVCPLKDKCVAFASGNVQAYPVKQNKTKTRDRYFHYLYIIYKEQT 247 Query: 245 LLAQRPPSGLWGGLYCFPQFA-----DEESLRQWLAQRQIAAD----NLTQ-LTAFRHTF 294 + +R +W GLY FP D L + A R + D ++TQ L+ +HT Sbjct: 248 WMNRRTGKDIWTGLYEFPLIETDHAMDFSGLCETQAFRNLLGDAGKLSITQGLSNVKHTL 307 Query: 295 SH 296 SH Sbjct: 308 SH 309 >UniRef50_B1LXG3 A/G-specific adenine glycosylase n=74 Tax=Bacteria RepID=B1LXG3_METRJ Length = 464 Score = 214 bits (544), Expect = 5e-54, Method: Compositional matrix adjust. Identities = 129/310 (41%), Positives = 175/310 (56%), Gaps = 28/310 (9%) Query: 9 QVLDWYDKYGRKTLPWQI----DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 +L WYD++ R+ LPW+ PY++WLSEVMLQQT +A V PYFERF+ RFP + Sbjct: 36 DLLAWYDRH-RRVLPWRALAGAAPDPYRIWLSEVMLQQTTIAAVRPYFERFLTRFPDIFA 94 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA AP + V+ W GLGYY+RARNLH A+ VA GG+FP+T E + LPG+G TAGA Sbjct: 95 LAEAPEEAVMSAWAGLGYYSRARNLHACARTVAA-AGGRFPDTAEGLRKLPGIGAYTAGA 153 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSG-WPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 I +++ + +DGNV+RVL+R YAV PG + ++ L++ + P F QA Sbjct: 154 IAAIAFDRQEAAVDGNVERVLSRAYAVEAPLPGSRP---EIRRLTQALVPTDRPGDFAQA 210 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGK-KPKQTLPERTGYFLLLQHED 242 +MDLGA ICT +P C+LCP C A + + +P K K + R F+ ++ D Sbjct: 211 LMDLGATICTPKRPACALCPWMRPCRARSLGTQESFPRKIKVAKGALRRGAAFVAIRSGD 270 Query: 243 E-VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNL-TQLTA--------FRH 292 E VLL RPP GL G + P A W +AA L L A RH Sbjct: 271 EAVLLRTRPPEGLLGNMAEPPGSA-------WEPDYDVAAALLDAPLDARWKRLPGLVRH 323 Query: 293 TFSHFHLDIV 302 F+HF L++ Sbjct: 324 GFTHFPLELT 333 >UniRef50_D2MN98 Putative A/G-specific adenine glycosylase n=1 Tax=Bulleidia extructa W1219 RepID=D2MN98_9FIRM Length = 346 Score = 213 bits (543), Expect = 6e-54, Method: Compositional matrix adjust. Identities = 115/305 (37%), Positives = 164/305 (53%), Gaps = 17/305 (5%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 Q +F Q++ WY + + PW++ K PY VW+SE+MLQQT++ TV+P +ERFM P Sbjct: 3 FQKKEFQ-QLIQWYQAH-HQNYPWRLTKDPYAVWISEIMLQQTRIETVLPKYERFMKELP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 ++ DLA D ++H W GLGYY+RARNL KAA Q+ H G FP EE+ AL G+G Sbjct: 61 SIHDLATVSEDHLMHFWEGLGYYSRARNLQKAALQIEDRHHGIFPHQLEEIQALTGIGDY 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLAR---CYAVSGWPGKKEVENKLWSLSEQVTPAVGV 177 TAGAI S S G P +DGNV RV +R Y P K + + +Q+ P Sbjct: 121 TAGAIASFSFGVGVPAIDGNVLRVYSRHEGLYQNVLDPSVKSL------VKQQMLPLYTK 174 Query: 178 ER------FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPER 231 E+ FNQA+M+LG IC P C CP+Q GC + ++ P + K + Sbjct: 175 EKHSDNGDFNQAIMELGEQICLPKNPDCQNCPIQKGCFSFLHHKQGELPIRISKTKKKKE 234 Query: 232 TGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFR 291 FL+ ++ +L+ +RP +GL LY + +L Q QI + L++ + Sbjct: 235 EHSFLIFYTKEAILVHKRPVNGLLANLYEPVNLDFFIDIEDFLQQHQIPVLSYQTLSSHK 294 Query: 292 HTFSH 296 H FSH Sbjct: 295 HIFSH 299 >UniRef50_Q5WIF8 A/G-specific adenine DNA glycosylase n=11 Tax=Bacillales RepID=Q5WIF8_BACSK Length = 385 Score = 213 bits (543), Expect = 7e-54, Method: Compositional matrix adjust. Identities = 121/307 (39%), Positives = 176/307 (57%), Gaps = 14/307 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 S F Q+++WY + R+ LPW+ PY +W+SE+MLQQT+V TVIPY+E+FM +FP + Sbjct: 37 SDFRRQLIEWYQAHKRE-LPWRESNDPYHIWVSEIMLQQTRVDTVIPYYEQFMRKFPEME 95 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA A +E+L +W GLGYY+R RNL A ++V +G P+T +E+ L GVG TAG Sbjct: 96 DLAYAEEEEILKVWEGLGYYSRVRNLQAAVREVVEHYGSVVPDTRKEIEQLKGVGPYTAG 155 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENK----LWSLSEQVTPAVGVER 179 AILS++ K P +DGNV RVL+R + + GK + K L+ L ++ P+ Sbjct: 156 AILSIAYAKAEPAVDGNVMRVLSRVFCMEDDIGKPQTRKKHEAILYELIDKSDPS----S 211 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPE-RTGYFLLL 238 FNQ +M+LGA++CT + P C LCP++ C+A P K K+ FLL Sbjct: 212 FNQGLMELGALVCTPTSPGCLLCPVRTQCLAYERGQQERLPIKMKKKKAKSIELESFLLK 271 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFA---DEESLRQWLAQR-QIAADNLTQLTAFRHTF 294 + E+L+ +RP GL GL+ P D +Q LA++ I A+ +H F Sbjct: 272 TEKGELLIEKRPDKGLLAGLWQLPVLEGRFDPGERQQKLAEKYHIEAEPSAANFQVKHIF 331 Query: 295 SHFHLDI 301 SH +I Sbjct: 332 SHLIWEI 338 >UniRef50_D2RMB6 A/G-specific adenine glycosylase n=2 Tax=Acidaminococcus RepID=D2RMB6_ACIFE Length = 352 Score = 213 bits (542), Expect = 8e-54, Method: Compositional matrix adjust. Identities = 129/353 (36%), Positives = 188/353 (53%), Gaps = 18/353 (5%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQID--KTPYKVWLSEVMLQQTQVATVIPYFERFMAR 58 M+ + + +L W+D R+ LPW+ + + PY VW+SE+MLQQT+ TV YF+R+M + Sbjct: 1 MKRNDWPETLLAWFDG-SRRALPWREEHPRNPYHVWVSEIMLQQTRTETVKGYFQRWMEQ 59 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 FPT+ DLA AP ++VL W GLGYY+RARNLHKAA+QV GG+ P + + +L G+G Sbjct: 60 FPTIRDLAQAPEEQVLRAWQGLGYYSRARNLHKAARQVMAEWGGQLPRERKALGSLAGIG 119 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAV----SGWPGKKEVENKLWSLSEQVTPA 174 T GAILS++ G+ P +DGN+ RVL+R Y V SG GKK + +L+E+ P Sbjct: 120 AYTVGAILSMAFGEKIPAVDGNLLRVLSRLYGVEEDISGTQGKKTIT----ALAEEAIPG 175 Query: 175 VGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGY 234 FN+A+MDLGA +C P+C CPL C A P KKPK E Sbjct: 176 DRPGDFNEALMDLGAEVCIPRHPRCEACPLTAFCQAWKEGKTETLPVKKPKAPQKELAAA 235 Query: 235 FLLLQHEDEVLLAQRPPSGLWGGLYCFPQFAD--EESLRQWLAQRQIAADNLTQLTAFRH 292 L+ E L +RP G+ ++ FP + E R L + + + + RH Sbjct: 236 CGLVVREGRYLFHKRPAKGMLASMWEFPMVLEPGAEKARSSLV-KLLETEAGPAVWQHRH 294 Query: 293 TFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 F+H ++ + + G G W++ + V LA P +L L+ Sbjct: 295 VFTHQIWNMTAYVMERAEVPG----GEWSWFSPEEWEKVPLAGPHAKLADWLK 343 >UniRef50_C8WS48 A/G-specific adenine glycosylase n=2 Tax=Alicyclobacillus acidocaldarius RepID=C8WS48_ALIAD Length = 382 Score = 213 bits (542), Expect = 8e-54, Method: Compositional matrix adjust. Identities = 116/281 (41%), Positives = 166/281 (59%), Gaps = 11/281 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ + WY + R LPW+ PY + +SE MLQQT+V TVIPY+ RFM RFPT L Sbjct: 8 FAHTLEAWYTQTSRD-LPWRRTADPYAILVSETMLQQTRVETVIPYYHRFMERFPTPLHL 66 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+A +D+VL +W GLGYY RARNL A + V HGG+ P+ +E+ ALPG+G T GA+ Sbjct: 67 ADADIDDVLKMWEGLGYYRRARNLKAAMEVVRDRHGGRIPDHPDELKALPGIGPYTLGAV 126 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVS---GWPG-KKEVENKLWSLSEQVTPAVGVERFN 181 LS++ + +P +DGNV RV++R A+ P K+++E + E+ TP V Sbjct: 127 LSIAFNRPYPAVDGNVLRVMSRYRAIEEPVDLPKVKRQIEQDVAETLERGTPRV----LT 182 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+M+LGA++CT KP+CS CP+ +GC A A+ P + PK+ +T L L Sbjct: 183 QALMELGALVCTPKKPRCSACPVVSGCAARAHGLTDALPKRLPKRARRRQTVVALWLTRG 242 Query: 242 DEVLLAQRPPSGLWGGLYCFP--QFADEESLRQWLAQRQIA 280 D QRP GL GG++ P + D + +W A+ + A Sbjct: 243 DSFWAEQRPEGGLLGGMWQLPSVELDDPSTSLEWHARARFA 283 >UniRef50_A0M7B9 A/G-specific adenine glycosylase n=16 Tax=Bacteroidetes RepID=A0M7B9_GRAFK Length = 350 Score = 213 bits (541), Expect = 1e-53, Method: Compositional matrix adjust. Identities = 127/339 (37%), Positives = 184/339 (54%), Gaps = 24/339 (7%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 S ++ WY + ++ LPW+ PY +WLSE+MLQQT++ +PY+ +F+ +P+V DL Sbjct: 3 LSNRLTHWYLQ-NKRDLPWRKTHEPYHIWLSEIMLQQTRIEQGLPYYNKFIQAYPSVFDL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA +EVL LW GLGYY+RARNLH+ A+ VA GKFP T++ + L GVG TA AI Sbjct: 62 ANATPEEVLKLWQGLGYYSRARNLHETAKYVAFELNGKFPGTYKGLLKLKGVGDYTASAI 121 Query: 126 LSLSLGKHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 S+ + ++DGNV RVL+R + ++ G KE + L ++ PA FN Sbjct: 122 ASICYNEPVAVVDGNVYRVLSRIFGIDTPINSAAGIKEFKLLAQELLDKNDPAT----FN 177 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+M+ GA+ C KPKC +CP + C+A N+ P K K + +R +L+ HE Sbjct: 178 QAIMEFGALHCKPQKPKCEICPFNDSCLALKNDKIKELPVKLKKNKVKKRYFNYLVFNHE 237 Query: 242 DE-VLLAQRPPSGLWGGLYCFP-----QFADEESLRQWLAQRQIAADNLTQLTAFR---- 291 +E +L QR G+W GLY FP + DE+ L + A + + +L F Sbjct: 238 NETTILEQRVGKGIWNGLYQFPLVESERLIDEKELVTFEAFKNSITGSGVELELFNEIPV 297 Query: 292 -HTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPP 329 H SH HL I W+ S G E + N+ P Sbjct: 298 IHKLSHQHL-ITRFWVVNS---GTASENSVKLKNVTDYP 332 >UniRef50_D1N1B7 A/G-specific adenine glycosylase n=2 Tax=Bacteria RepID=D1N1B7_9BACT Length = 356 Score = 211 bits (536), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 116/298 (38%), Positives = 171/298 (57%), Gaps = 11/298 (3%) Query: 8 AQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLAN 67 A +L+WYD++ R LPW+ PY+VW+SE+MLQQT+V V PY++RF+ P + LA Sbjct: 11 APLLEWYDRHAR-ILPWRESPEPYRVWISEIMLQQTRVEAVKPYYDRFLKELPDLHALAE 69 Query: 68 APLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILS 127 A ++L LW GLGYY R RNL KAA+ + + +GG+FP E + +LPG+G TAGAI S Sbjct: 70 ASEPQLLKLWEGLGYYNRVRNLQKAARVIVSEYGGEFPRDVETLRSLPGIGEYTAGAIAS 129 Query: 128 LSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDL 187 +S + P +DGNV RV++R A +V+ + QV PA F Q++M+L Sbjct: 130 ISFNRPEPAVDGNVLRVVSRLAASREDISSPKVKGAISDALRQVYPAGRCGDFTQSLMEL 189 Query: 188 GAMICT-RSKPKCSLCPLQNGCIAAANNSWA---LYPGKKPKQTLPERTGYFLLLQHEDE 243 GA +C P+C+ CPL C A + PG+KP++ P RT LL+ D Sbjct: 190 GATVCLPNGAPRCAECPLAELCAGRREGIAAGLPVKPGRKPRRIEP-RT--VFLLRCGDR 246 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESL---RQWLAQRQIAADNLTQLTAFRHTFSHFH 298 + L +RP +GL GL+ FP A ++ +W+A + I A+ + +H F+H Sbjct: 247 IALLRRPGNGLLAGLWEFPNTAGSLTVAEAAEWMAGQGIFAERIIATGGAKHVFTHLE 304 >UniRef50_C3J8R7 A/G-specific adenine glycosylase n=2 Tax=Bacteria RepID=C3J8R7_9PORP Length = 357 Score = 211 bits (536), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 132/354 (37%), Positives = 188/354 (53%), Gaps = 22/354 (6%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 Q + ++DW+++ R+ LPW+ + PY +WLSEV+LQQT+V YF RF+ FP+VTD Sbjct: 11 QLARALVDWFEENARE-LPWRTTRDPYHIWLSEVILQQTRVNQGHDYFLRFVEAFPSVTD 69 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A D+VL LW GLGYY+RA NLH+AAQ +AT H G FP+ F + ALPGVG TAGA Sbjct: 70 LAQASEDQVLSLWQGLGYYSRAHNLHRAAQVIATEHQGIFPKDFALIRALPGVGDYTAGA 129 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWS-LSEQVTPAVGVER 179 I S + +P +DGNV RV++R A + GK+ + L+ Q+ P+ + Sbjct: 130 IASFAFDMPYPAVDGNVLRVVSRLLASELPIDTLSGKQLCTQAVEELLATQLPPS----K 185 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 QA+++LGA++CT P+CS CP + C A L P K K TL +R ++ Sbjct: 186 LGQALIELGALVCTPQSPQCSACPASSWCRVAELPLARLLPIKGKKLTLRDRYFSYIYTT 245 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADE-----ESLRQWLAQRQIAADNL----TQLTAF 290 +VLL +R + +W GLY FP + + + WL + L T T Sbjct: 246 CHQQVLLEKRGRNDIWRGLYQFPLLESDSPQTPDEVAHWLEHELSYSGELRLAPTTFT-L 304 Query: 291 RHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 H SH L I + S + W N ++ L P+ R L+QL Sbjct: 305 EHRLSHQILHITIYTAELES--DAVGSTKYQWVNQSELGDYALPTPLVRFLEQL 356 >UniRef50_A8R947 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8R947_9FIRM Length = 348 Score = 210 bits (535), Expect = 5e-53, Method: Compositional matrix adjust. Identities = 119/323 (36%), Positives = 174/323 (53%), Gaps = 13/323 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 Q + Q+LDWY + R LPW+ TPY+VW+SE+MLQQT+V V PYFERF+A P + Sbjct: 8 EQLTKQLLDWYKQNAR-VLPWREKATPYRVWVSEIMLQQTRVEAVKPYFERFIATIPDIP 66 Query: 64 DLANAPLDEVL-HLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LAN DEVL LW GLGYY R N+ K AQ HGGK P T+ E+ LPG+G TA Sbjct: 67 TLANTE-DEVLAKLWEGLGYYRRVYNMKKCAQVCVEKHGGKLPNTYAELLELPGIGAYTA 125 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GAI S++ G+ +DGNV RV +R + K+ V+ + + + P FNQ Sbjct: 126 GAIASIAFGECVAAVDGNVLRVFSRVLVLEEDILKESVKRQYAKIVQMYIPKHESSAFNQ 185 Query: 183 AMMDLGAMICT-RSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 A+M+LGA IC + P+C++CP+ + C P K K+T +L+++ Sbjct: 186 ALMELGATICVPNAAPRCNICPIADNCKGYQCGKAHYLPNKAAKKTRRIEKKTICVLRYK 245 Query: 242 DEVLLAQRPPSGLWGGLYCF---PQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF- 297 D+ + QR GL GLY F + ++++ ++ + N+ +L +H FSH Sbjct: 246 DKFHVRQRASQGLLAGLYEFDWLEGYQSKKAVSEYY--KMYVVKNIRKLPDAKHVFSHIE 303 Query: 298 -HLD--IVPMWLPVSSFTGCMDE 317 H+ +V + +P C E Sbjct: 304 WHMKGYLVEVEMPCEEGVWCTLE 326 >UniRef50_A9NG45 A/G-specific adenine DNA glycosylase n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NG45_ACHLI Length = 334 Score = 210 bits (535), Expect = 5e-53, Method: Compositional matrix adjust. Identities = 110/292 (37%), Positives = 169/292 (57%), Gaps = 7/292 (2%) Query: 9 QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 ++ DWY + ++ LP++ PY +W+SE+MLQQTQV T++P+++RF+ +PT+ DLA A Sbjct: 5 RLFDWY-QTNKRDLPFRKTDNPYHIWVSEIMLQQTQVDTMLPFYDRFLTIYPTIQDLARA 63 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 +DE+L + G+GYY R R LHK AQ V H GK PE + ++ +PG+G TAGAI+S+ Sbjct: 64 DIDEILKVVQGIGYYRRFRMLHKGAQYVIDHHDGKLPEDYFKILKIPGIGAYTAGAIMSI 123 Query: 129 SLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLS-EQVTPAVGVERFNQAMMDL 187 + K +P DGNV RVL+R + + + KL ++ E + + + Q+MM+L Sbjct: 124 AFHKPYPATDGNVIRVLSRVKMLEDDFRLDKNKKKLNEMNKELIENSNNPYLYTQSMMEL 183 Query: 188 GAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLA 247 GA +C S P C CPLQ C+A NN YP P +T E + +L++++ L+ Sbjct: 184 GATVCKVSNPLCDTCPLQEVCLANINNVQQNYPKMSPLKTKKEIQYFTFILEYKEGFLMR 243 Query: 248 QRPPSGLWGGLYCFPQFADEESLRQWLAQRQ---IAADNLTQLTAFRHTFSH 296 +R L G Y F Q + +SL L Q Q + L +L +H F+H Sbjct: 244 KR-TEDLLHGFYEFVQI-ESDSLNGALTQAQDLGLEIQILDELQPIKHVFTH 293 >UniRef50_C4L361 A/G-specific adenine glycosylase n=1 Tax=Exiguobacterium sp. AT1b RepID=C4L361_EXISA Length = 344 Score = 210 bits (535), Expect = 6e-53, Method: Compositional matrix adjust. Identities = 111/293 (37%), Positives = 164/293 (55%), Gaps = 11/293 (3%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 F+ +++ W+++ ++ LPW+ K PY+VW+SEVMLQQT+V TVIPY+ RFM RFPT+ D Sbjct: 15 HFNEELVTWFNR-EKRDLPWRHAKNPYRVWVSEVMLQQTRVDTVIPYYNRFMERFPTLED 73 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A DEV+ W GLGYY+R RNLH+A ++VA+++ G PE E L GVG T GA Sbjct: 74 LAAADTDEVVKYWEGLGYYSRVRNLHEAVKEVASVYEGIVPEEKERFEKLKGVGPYTTGA 133 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 +LS++ + P +DGNV RV++R + + + + ++ FN+ + Sbjct: 134 VLSIAYNQPEPAVDGNVMRVMSRQFGIYDDIAMPKTRKIFEQVVRRLMDPAHASDFNEGV 193 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LGA +CT P CSLCP+Q+ C A A++ P K K L ++ + +V Sbjct: 194 MELGATVCTPKNPMCSLCPVQDTCYAYAHHVQDELPVKTKKGAARIEMYDALCIEKDGKV 253 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 QR GL G++ +P D + Q N T L +H FSH Sbjct: 254 AYEQRADKGLLAGMWQYP-LMDRGTGEQL---------NGTYLGQVKHVFSHI 296 >UniRef50_Q03V36 A/G-specific DNA-adenine glycosylase n=2 Tax=Leuconostoc mesenteroides RepID=Q03V36_LEUMM Length = 342 Score = 210 bits (534), Expect = 7e-53, Method: Compositional matrix adjust. Identities = 127/343 (37%), Positives = 185/343 (53%), Gaps = 19/343 (5%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 +F +L+WYDK GR LPW+++ PY+V +SE+MLQQT+V TVIPYF+RFM PTV D Sbjct: 11 EFRKTLLNWYDKEGRAHLPWRVNHDPYRVLVSEIMLQQTRVDTVIPYFDRFMTSLPTVGD 70 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA AP D VL LW GLGYY+RARNL KAAQ V G++PE+ +++ +LPGVG TA A Sbjct: 71 LATAPEDVVLKLWEGLGYYSRARNLQKAAQFVVNELHGQWPESSDDLQSLPGVGPYTAAA 130 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER---FN 181 I S+S + P +DGN RV +R + + + + + P V ER FN Sbjct: 131 IASISFNEVVPAVDGNQYRVFSRLLKIDADIADTKSRKIFYDI---IAPIVDPERPGDFN 187 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLG+ T P P++ A + YP K KQ ++ + + + Sbjct: 188 QAIMDLGSSYMTTKNPDSLYSPVREFNAAFRDGVEDQYPVKTTKQRPVKQLFMASVFEKD 247 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 ++L +RP SGL G + FP L Q + I L + H F+H +I Sbjct: 248 GKLLFEKRPDSGLLAGFWTFP-------LTQIESIESITGQQLN-IKPVTHIFTHRRWEI 299 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 WL V T +++ N +++ + ++ L +LL++L Sbjct: 300 ---WL-VKQETAMLND-NQQYFSSDEWSALSLPKVQHKLLEKL 337 >UniRef50_B9E7Y8 A/G-specific adenine glycosylase homolog n=4 Tax=Staphylococcaceae RepID=B9E7Y8_MACCJ Length = 344 Score = 210 bits (534), Expect = 8e-53, Method: Compositional matrix adjust. Identities = 113/309 (36%), Positives = 170/309 (55%), Gaps = 17/309 (5%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + QFS +LDW+ K ++ +PW+ K PYK+WLSEVMLQQTQV TV PY+ +F RFP Sbjct: 2 LNKQQFSQHLLDWFYK-NKREMPWRETKDPYKIWLSEVMLQQTQVNTVKPYYLKFTERFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + LA+A +DEV W GLGYY+R RN H A ++V + G P E+ L GVG Sbjct: 61 DIRTLASAEIDEVTKYWEGLGYYSRVRNFHSAVKEVQESYNGVVPNNPEDFLKLKGVGPY 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARC----YAVSGWPGKKEVENKLWSLSEQVTPAVG 176 T GA++S++ P +DGNV RV +R + +S ++ E+K+ + + A G Sbjct: 121 TQGAVMSIAFNHQIPAVDGNVYRVFSRLDNDDFDISSSSARRHFEDKVMDV---IPKAAG 177 Query: 177 VERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFL 236 FN+A+M+LGA +CT P C CP+Q C + + L P K K L Sbjct: 178 --DFNEALMELGATVCTPKSPLCMFCPVQQHCESYEAGTVQLRPVKLKKIKKKTEHWKVL 235 Query: 237 LLQHEDEVLLAQRPPSGLWGGLYCFPQF---ADEESLRQWLAQR-QIAADNLTQLTAFRH 292 ++ H+D++ + QRP +GL ++ FP F ++++ + L + AD L +H Sbjct: 236 VIMHDDKIYIEQRPSTGLLQSMWQFPMFNIDTHQDTIEEQLNMSLYVEADPFMSL---KH 292 Query: 293 TFSHFHLDI 301 F+H D+ Sbjct: 293 QFTHVTWDM 301 >UniRef50_Q1MS05 A/G-specific DNA glycosylase n=1 Tax=Lawsonia intracellularis PHE/MN1-00 RepID=Q1MS05_LAWIP Length = 363 Score = 209 bits (533), Expect = 9e-53, Method: Compositional matrix adjust. Identities = 107/302 (35%), Positives = 173/302 (57%), Gaps = 15/302 (4%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +L+W+ K ++ LPW+ PY VW++EVM+QQTQ+ + YF R++ +FP + +A AP Sbjct: 8 LLNWFYK-NKRALPWRETYLPYHVWIAEVMMQQTQMDRGVEYFLRWIKQFPDIASVAYAP 66 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 +++L W GLGYY R R+L AAQ + + G FPE +E++ LPG+G TAGAI S + Sbjct: 67 EEKLLSAWEGLGYYRRVRHLQSAAQVIMHKYNGTFPERYEDILELPGIGPYTAGAIASTA 126 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 + FP +DGNV+RVL+R + ++ K+ ++KL+ L +Q+ P FNQ++M+LGA Sbjct: 127 FNQDFPCIDGNVERVLSRIFDINTHIKKEPTKSKLYDLVKQLMPKKNARDFNQSVMELGA 186 Query: 190 MICTRSKPKCSLCPLQNGCIAAANNSWALYP---GKKPKQTLPERTGYFLLLQHEDEVLL 246 ++C + KP C +CP+ + C + N + P K P L TG +LQ ++ + Sbjct: 187 LVC-KKKPMCLICPVYSMCNSRINGTQLSRPVLAKKTPTTRLKMVTG---ILQCNQKIFI 242 Query: 247 AQRPPSGLWGGLYCFPQFADEES-------LRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 QR + +WG L+ FP E +R W Q + +T H ++H+H+ Sbjct: 243 QQRLDNNIWGRLWEFPNGCIETGETPESAIIRNWNEQLGFSIQIENIITTIIHNYTHYHI 302 Query: 300 DI 301 + Sbjct: 303 TL 304 >UniRef50_C6ACA0 A/G-specific adenine glycosylase MutY n=6 Tax=Rhizobiales RepID=C6ACA0_BARGA Length = 352 Score = 209 bits (533), Expect = 1e-52, Method: Compositional matrix adjust. Identities = 111/268 (41%), Positives = 160/268 (59%), Gaps = 13/268 (4%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDK---------TPYKVWLSEVMLQQTQVATVIPYFERF 55 + S+ +L WYD+ R LPW+I PY+VWLSE+MLQQT V TV PYF++F Sbjct: 3 EISSHLLAWYDQNHRH-LPWRITPEKQKQGIRPDPYQVWLSEIMLQQTTVETVKPYFKKF 61 Query: 56 MARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALP 115 + +P ++ LA A D+++ W GLGYY+RARNL K AQQ+ + G+FP++ +E+ L Sbjct: 62 LKLWPDLSSLAKASQDDIMKAWAGLGYYSRARNLKKCAQQLVENYAGQFPQSVKELRTLA 121 Query: 116 GVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV 175 G+G TA AI S++ ++DGNV+RV+AR +A++ K +VE K ++++T Sbjct: 122 GIGDYTAAAIASIAFNHPVAVVDGNVERVVARLFAITSILQKAKVEIK--EKTQKITALN 179 Query: 176 GVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGY- 234 F QAMMDLGA +CT KP C CPLQ C AA +P K PK+ P +TG Sbjct: 180 RPGDFAQAMMDLGATVCTPRKPSCYTCPLQCLCKAAKTQQQESFPVKAPKKERPSKTGIA 239 Query: 235 FLLLQHEDEVLLAQRPPSGLWGGLYCFP 262 F+ L + ++ L +R L GG+ P Sbjct: 240 FVALNKKRQIYLEKRQTQKLLGGMTQIP 267 >UniRef50_Q2G9R8 A/G-specific DNA-adenine glycosylase n=7 Tax=Alphaproteobacteria RepID=Q2G9R8_NOVAD Length = 359 Score = 208 bits (530), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 125/300 (41%), Positives = 170/300 (56%), Gaps = 19/300 (6%) Query: 10 VLDWYDKYGRK----TLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 +LDWYD + RK LP + + PY+VWLSEVMLQQT VA V PYFE+F +PTV DL Sbjct: 20 LLDWYDAHARKLPWRRLPGEARQDPYRVWLSEVMLQQTTVAAVGPYFEKFTRLWPTVGDL 79 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A +V+ W GLGYYARARNL A+ VA + GG FP++ + + ALPG+G TA A+ Sbjct: 80 AAADDGDVMAAWAGLGYYARARNLLACARAVAAM-GGTFPDSEDGLRALPGLGEYTAAAV 138 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 +++ G+ ++D NV+RV+AR +A+ + +L + QVTP F QAMM Sbjct: 139 AAIAFGRRAVVVDANVERVIARLFAIDEPLPAGKAAIRL--AAGQVTPEERAGDFAQAMM 196 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 DLGA +CT P+C LCPL+ C A A + P K ++ P R G ++ E VL Sbjct: 197 DLGATVCTARSPRCMLCPLREHCRALAEGAPERLPVKAARKAKPVRQGRAYWIEREGRVL 256 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQL----TAFRHTFSHFHLDI 301 L +RP G+ GG+ P W A R AD + RH F+HF L++ Sbjct: 257 LVRRPGRGMLGGMRALPDDG-------WSA-RGDGADAIGGEWRGGGVVRHGFTHFDLEL 308 >UniRef50_Q8CV61 A/G-specific adenine glycosylase n=1 Tax=Oceanobacillus iheyensis RepID=Q8CV61_OCEIH Length = 354 Score = 208 bits (529), Expect = 3e-52, Method: Compositional matrix adjust. Identities = 117/310 (37%), Positives = 169/310 (54%), Gaps = 21/310 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F +L+WY ++ LPW+ + PYK W+SE+MLQQT+V TVIPYF RFM ++PTV DL Sbjct: 13 FQTDLLEWY-YLNKRDLPWRREPNPYKTWVSEIMLQQTKVDTVIPYFNRFMEKYPTVYDL 71 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A +VL W GLGYY+RARNL A ++V + G+ P +E+A+L G+G T GAI Sbjct: 72 AKADEQDVLKSWEGLGYYSRARNLQTAVREVVDTYNGEIPNNEKELASLKGIGPYTKGAI 131 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 LS++ + P +DGNV RV +R + K+ + ++ ++ FNQA+M Sbjct: 132 LSIAFNQPVPAVDGNVLRVFSRILQIEDDIAKQSTKKEIEQYVGEIISHQDPSSFNQAIM 191 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED-EV 244 DLGA ICT KP C CP+ C A + P KK + + LL++++ E Sbjct: 192 DLGATICTPKKPTCMFCPVMEHCQAFQHGIQEQLPIKKKAKKQKIKQYVVLLIRNDQGEY 251 Query: 245 LLAQRPPSGLWGGLYCFPQFADEE----SLRQWL-------AQRQIAADNLTQLTAFRHT 293 ++ +R GL L+ FP +E + QW+ Q + DNL +HT Sbjct: 252 VIEKRSDQGLLANLWQFPMVPLDEVGIDQIEQWMYDKCGIDLQMESDCDNL------KHT 305 Query: 294 FSHF--HLDI 301 FSH +DI Sbjct: 306 FSHIIWEMDI 315 >UniRef50_C9LHH9 A/G-specific adenine glycosylase n=4 Tax=Bacteroidales RepID=C9LHH9_9BACT Length = 356 Score = 208 bits (529), Expect = 3e-52, Method: Compositional matrix adjust. Identities = 115/320 (35%), Positives = 178/320 (55%), Gaps = 17/320 (5%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 +A F+ Q+ +WY ++ LPW+ K PY++W+SE++LQQT+V Y+ RF+ RFP+ Sbjct: 12 EAHYFAEQIEEWYLN-NKRELPWRDIKDPYRIWISEIILQQTRVVQGYDYYLRFIDRFPS 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA A DEVL LW GLGYY+RARNL+ AA+ + + G+FP ++++ L GVG T Sbjct: 71 VEDLAKADEDEVLKLWQGLGYYSRARNLYSAAKSI--VERGEFPTNYKDIRMLKGVGDYT 128 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A AI S + + ++DGNV RVLAR + ++ E + +L++ + ++N Sbjct: 129 AAAIASFAYNLPYAVVDGNVYRVLARYWGITTPIDTTEGKKLFAALAQNLLDKKNPAQYN 188 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MD GA+ C + P C+ CPLQ CIA S + P K K + R +L ++++ Sbjct: 189 QALMDFGALQCVPNNPNCNECPLQANCIAFLETSVSSLPIKAKKTAIRTRFLVYLYIRYK 248 Query: 242 DEVLLAQRPPSGLWGGLY--CFPQFADEESLRQWLAQRQIAADNLTQLTAFR-------H 292 D +LL +R +W GLY +F++ L I NL + AFR H Sbjct: 249 DRILLHRRERGDIWQGLYEPLLIEFSEHAELPSI-----IQKANLPKQAAFRLAVKEIKH 303 Query: 293 TFSHFHLDIVPMWLPVSSFT 312 +H L + + ++ FT Sbjct: 304 VLTHQVLIVDCYEVDLTKFT 323 >UniRef50_B0MYW3 Putative uncharacterized protein n=1 Tax=Alistipes putredinis DSM 17216 RepID=B0MYW3_9BACT Length = 358 Score = 207 bits (528), Expect = 3e-52, Method: Compositional matrix adjust. Identities = 107/262 (40%), Positives = 163/262 (62%), Gaps = 9/262 (3%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + ++++L+WY + GR LPW+ + PY++W+SEV+LQQT+VA + Y+ RF+ FP V Sbjct: 13 EVASRLLEWYGREGRD-LPWRRTRDPYRIWISEVILQQTRVAQGMSYYHRFLELFPDVAA 71 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA+AP D VL W GLGYY+RARNL AA+++ HGG FP + +V ALPGVG TA A Sbjct: 72 LASAPEDLVLKCWQGLGYYSRARNLLAAARRIVETHGGVFPTAYADVRALPGVGDYTAAA 131 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 I S++ + LDGNV RVL+R Y + G++ SL ++ P + + Sbjct: 132 ICSIAYEEPCAALDGNVFRVLSRLYDLDTPIDTTSGRRTFAALADSLIDRQRPGL----Y 187 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+MD GA+ C ++P+C+ CPL++ C+A A + + P K+ + + R +L ++ Sbjct: 188 NQAIMDFGALCCLPAQPRCTECPLRDRCLAFAARTVDVRPVKQGRTAVEPRYFNYLHVEC 247 Query: 241 EDEVLLAQRPPSGLWGGLYCFP 262 DE++L +R +W GLY FP Sbjct: 248 GDELVLRRRGAGDIWQGLYEFP 269 >UniRef50_Q3SPD3 A/G-specific DNA-adenine glycosylase n=2 Tax=Nitrobacter RepID=Q3SPD3_NITWN Length = 415 Score = 207 bits (528), Expect = 4e-52, Method: Compositional matrix adjust. Identities = 129/333 (38%), Positives = 182/333 (54%), Gaps = 44/333 (13%) Query: 10 VLDWYDKYGRKTLPWQ------IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 +L+WYD+ R+ LPW+ +D PY+VWLSE+MLQQT V TV PYFE+F+AR+P V Sbjct: 25 LLEWYDR-NRRLLPWRALPGEPVD--PYRVWLSEIMLQQTTVKTVGPYFEKFLARWPDVA 81 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 +A A LD++L +W GLGYY+RARNLH A +V HGG+FP+T E++ ALPG+G TA Sbjct: 82 AMARASLDDILRMWAGLGYYSRARNLHACAVKVLRDHGGRFPDTEEDLRALPGIGPYTAA 141 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAV-----------------------SGWPGKKEV 160 AI +++ + +DGN++RV++R +AV SG GK Sbjct: 142 AIAAIAFNRRTMPVDGNIERVVSRLFAVDEPLPKAKPRIHTLAATLLGPSRSGRDGKSRA 201 Query: 161 EN-KLWSLSEQVTPAVGVE----RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS 215 + K + + + A V+ QA+MDLGA ICT KP C LCPL + C A A Sbjct: 202 GDVKTRAGRDGKSRASDVKTRAGDIAQALMDLGAAICTPKKPSCVLCPLSDDCAARARGD 261 Query: 216 WALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFA------DEES 269 +P K PK+ R G +++ EVL+ RP GL GG+ P D + Sbjct: 262 QETFPRKTPKKAGELRRGAAFVVRRGSEVLVRTRPAKGLLGGMTEVPTSTWLAAQDDTAA 321 Query: 270 LRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 L+Q + A + A H F+HF L++ Sbjct: 322 LKQAPCL-ESAPRWRRKAGAVTHVFTHFPLELA 353 >UniRef50_C9LTK5 A/G-specific adenine glycosylase n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LTK5_9FIRM Length = 404 Score = 207 bits (528), Expect = 4e-52, Method: Compositional matrix adjust. Identities = 127/311 (40%), Positives = 178/311 (57%), Gaps = 22/311 (7%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + + +L W+ R LPW+ + TPY+VW+SE+MLQQT+V V PYFERF+A P V Sbjct: 17 DEIAEPLLRWFHSEKR-ALPWREEPTPYRVWVSEIMLQQTRVEAVKPYFERFVAALPDVR 75 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A + ++ LW GLGYY+RAR+L AA+ + + HGG+ P F+ + ALPG+GR TAG Sbjct: 76 ALARADENTLMKLWEGLGYYSRARHLQSAARLICSDHGGEIPAHFDGLLALPGIGRYTAG 135 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWS-LSEQVTPAVGVE 178 A+ S++ G+ P +DGNV RV+ R A + K+ VE L + L E Sbjct: 136 AVASIAFGERRPAVDGNVLRVIMRLLACPADILKESTKRAVEEALIARLPEDAG------ 189 Query: 179 RFNQAMMDLGAMICT-RSKPKCSLCPLQNGCIAAANNSWALYPGKK-PKQTLPERTGYFL 236 FNQA+M+LGA+IC R C CPL+ C+A N A P K PK+ E+ FL Sbjct: 190 NFNQALMELGALICLPRGAAHCPSCPLERLCLAKEANLQAELPQKTPPKRRRTEKLTIFL 249 Query: 237 LLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAA-----DNLTQLTAFR 291 L ++ D++ L +RP GL GL+ FP A E L++ A+ + A L L A + Sbjct: 250 LAKN-DKIALEKRPAQGLLAGLWGFP--AMEGHLKKKEAEEALQAIGLIPAKLHALPAAQ 306 Query: 292 HTFSHFHLDIV 302 H FSH +V Sbjct: 307 HIFSHITWQMV 317 >UniRef50_C2H8F6 A/G-specific adenine glycosylase n=16 Tax=Enterococcus RepID=C2H8F6_ENTFC Length = 392 Score = 207 bits (527), Expect = 4e-52, Method: Compositional matrix adjust. Identities = 108/285 (37%), Positives = 161/285 (56%), Gaps = 9/285 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +F Q + WY++ R LPW+ ++ PY++W+SE+MLQQT+V TVI YF RFM FPT Sbjct: 12 ETKEFQDQFIQWYEQEKR-NLPWRYNRDPYRIWISEIMLQQTRVDTVIDYFYRFMEWFPT 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + +LA AP +++L W GLGYY+RARN+ AA+Q+ + GK P+T EE+++L G+G T Sbjct: 71 IEELATAPEEKLLKAWEGLGYYSRARNIQAAAKQIMSEFDGKMPQTPEEISSLKGIGPYT 130 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 GAI S++ G P +DGNV RV++R + + K ++ FN Sbjct: 131 TGAIASIAFGLPEPAVDGNVMRVVSRLFCIEADIAKASSRKIFDEAMRKIIDETYPGEFN 190 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF--LLLQ 239 QAMMDLG+ ICT + PKC CP+Q C+A +P K K+ P+ Y L Sbjct: 191 QAMMDLGSAICTPTSPKCEACPIQAFCLANKRGIQTSFPV-KTKKAKPKNVYYISAALQN 249 Query: 240 HEDEVLLAQRPPSGLWGGLYCFP--QFADEESLR---QWLAQRQI 279 H +R L ++ FP + EE R +W A++++ Sbjct: 250 HSGAYYFEERDSQKLLANMWTFPMVEVTQEEYERLKKEWEAKQEV 294 >UniRef50_C6XRC5 HhH-GPD family protein n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XRC5_HIRBI Length = 361 Score = 207 bits (526), Expect = 6e-52, Method: Compositional matrix adjust. Identities = 123/307 (40%), Positives = 168/307 (54%), Gaps = 24/307 (7%) Query: 9 QVLDWYDKYGRKTLPWQIDK---------TPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 ++L WYD+ R+ LPW++ PY+VWLSE+MLQQT V PYF++F + Sbjct: 19 ELLQWYDQMARR-LPWRLSPQARKNGDMPDPYRVWLSEIMLQQTTVPHAAPYFDKFTRLW 77 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 P V DLA A D+V+ W GLGYYARARNLHK A+ +A L FP T E + ALPGVG Sbjct: 78 PKVEDLAAAHNDDVMREWAGLGYYARARNLHKCAKVLAGLDA--FPNTIEALLALPGVGP 135 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAV------SGWP-GKKEVENKLWSLSEQVT 172 TA A+ S++ +DGN++RV++R A+ +GW KKE+ ++ +L Q + Sbjct: 136 YTAAAVGSIAFDLPVAPVDGNIERVISRLMAIAGDGSAAGWAQDKKEITQRVQTLVPQRS 195 Query: 173 PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERT 232 F QAMMDLGA +CT P C +CP + C+A + YP K ++ P R Sbjct: 196 -----GDFAQAMMDLGASVCTPKSPNCMICPWMDICLARQEGNQESYPAKPKRKPQPIRN 250 Query: 233 GYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRH 292 G ++ HE +VLL +RP SGL GG+ EE L Q + RH Sbjct: 251 GLAFVIFHEGKVLLQRRPDSGLLGGMLMPLSSPWEEGGEYTLEDYQPVKADWHYRGQARH 310 Query: 293 TFSHFHL 299 F+HF L Sbjct: 311 VFTHFAL 317 >UniRef50_D1W5W3 A/G-specific adenine glycosylase n=1 Tax=Prevotella buccalis ATCC 35310 RepID=D1W5W3_9BACT Length = 369 Score = 206 bits (524), Expect = 9e-52, Method: Compositional matrix adjust. Identities = 115/299 (38%), Positives = 169/299 (56%), Gaps = 4/299 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M +S F+ +L W+ + GR ++PW+ PY +W+SEV+LQQT++ Y+ERFMARFP Sbjct: 33 MGSSPFTLALLRWFSENGR-SMPWRETTDPYAIWISEVILQQTRIQQGWAYWERFMARFP 91 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 V DLA A DEVL LW GLGYY+RARNLH AA+QV L G FP T E + AL GVG Sbjct: 92 KVEDLAAASEDEVLRLWQGLGYYSRARNLHHAAKQVVEL--GHFPNTMEGLKALKGVGDY 149 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TA AI S++ G ++DGNV RVLAR Y + E + + +L++ + PA + Sbjct: 150 TAAAIGSIAFGLPVAVVDGNVYRVLARHYGIYTPINTTEGKKEFAALAQSLLPATEPSAY 209 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+MD GA+ CT + P+C +CPL + C+A P K + R ++ ++ Sbjct: 210 NQAIMDFGAIQCTPTSPRCLICPLTDSCMALRTGKVDELPIKLKTLKIKTRKLVYVYVRW 269 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 + + + +R +W GL+ P + E L + Q + + + R + F+L Sbjct: 270 QGKTAIHRRAAGDIWQGLWE-PLLYENEELPDFPGQLLLLKKGVKHVLTHRILQADFYL 327 >UniRef50_C6X0N8 A/G-specific adenine glycosylase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X0N8_FLAB3 Length = 372 Score = 206 bits (524), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 109/288 (37%), Positives = 161/288 (55%), Gaps = 6/288 (2%) Query: 9 QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 +L WY+ +GR LP++ PY++W+ E++ QQT++A + +++ F+ARFP V LANA Sbjct: 16 NLLAWYNTHGRD-LPFRKTGDPYRIWVCEIIFQQTRIAQGLNHYQNFIARFPDVHTLANA 74 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 DEVL W GLGYY+RA NLHKAA Q+ +GG FP+ ++++ L GVG+ TA AI S+ Sbjct: 75 ETDEVLLYWKGLGYYSRALNLHKAAMQIIHEYGGIFPKNYDDILKLRGVGKYTAAAIASI 134 Query: 129 SLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLG 188 G H P +DGN RVL+R +A + N L+ ++ P FN+AMMDLG Sbjct: 135 CFGAHIPAVDGNFYRVLSRVFAEDFDVSNSKAFNYFSGLALRMMPQNKAGHFNEAMMDLG 194 Query: 189 AMICTRSKPKCSLCPLQNGCIAAANNSWALYP--GKKPKQTLPERTGYFLLLQHEDEVLL 246 + +C PKC +CPL C+A +P KK K T E T YF ++ + L+ Sbjct: 195 SEVCRPRNPKCEICPLNADCLAFNLGLIPKFPVKTKKTKPTDLELTYYF--VEFGERFLI 252 Query: 247 AQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTF 294 QR +W L+ FP E+ +Q + + + LT H + Sbjct: 253 RQRNEDHIWKKLFEFP-IRIPENFKQDITHHKTVSHQLTHKNLTIHIY 299 >UniRef50_A7VHH6 Putative uncharacterized protein n=1 Tax=Clostridium sp. L2-50 RepID=A7VHH6_9CLOT Length = 365 Score = 206 bits (523), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 114/318 (35%), Positives = 167/318 (52%), Gaps = 11/318 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 Q + ++ WY ++ R LPW+ DK PY VWLSE+MLQQT+V V Y+ F+ PT+ Sbjct: 21 KQIADNLIVWYPEHAR-DLPWRKDKEPYHVWLSEIMLQQTRVEAVKEYYRTFLQELPTIE 79 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA D ++ LW GLGYY RARNL KAA +V G FP + E+ +LPG+G TAG Sbjct: 80 ALAEVDDDRLMKLWEGLGYYNRARNLKKAAGEVVIEWQGNFPAEYNEILSLPGIGEYTAG 139 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S+ P +DGNV RV R K+ V+ K+ QV + Q+ Sbjct: 140 AIGSICFDLPTPAVDGNVLRVYTRVMEDPSNIDKQAVKKKIREELLQVYRYGHCDMLTQS 199 Query: 184 MMDLGAMICT-RSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 +M++GA IC PKC +CPLQ C A ++SW YP ++ K+ L+L++ED Sbjct: 200 LMEVGATICLPNGAPKCEVCPLQELCKAHKHDSWQQYPVREAKKKRKVEEKAVLMLRYED 259 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQ---WLAQRQIAADNLTQLTAFRHTFSHFHL 299 +V + +R GL GL+ FP S ++ ++ + + + T + H FSH Sbjct: 260 KVAIRKRTEKGLLHGLWEFPNLPGSYSTQEILSYVTSKNLHPKEIWMETTYTHIFSHVE- 318 Query: 300 DIVPMWLPVSSFTGCMDE 317 W + + CM++ Sbjct: 319 -----WHMKAFYMECMEQ 331 >UniRef50_B8FVN1 A/G-specific adenine glycosylase n=2 Tax=Desulfitobacterium hafniense RepID=B8FVN1_DESHD Length = 401 Score = 205 bits (522), Expect = 2e-51, Method: Compositional matrix adjust. Identities = 109/256 (42%), Positives = 150/256 (58%), Gaps = 1/256 (0%) Query: 7 SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 S++++ W+++ R LPW+ K PY +W+SEVMLQQTQV T IPY+ RFM RFPT++ LA Sbjct: 6 SSKLVQWFNQVKR-DLPWRRTKDPYAIWVSEVMLQQTQVVTAIPYYLRFMGRFPTLSHLA 64 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 A +EVL LW GLGYY+RAR L + A+ V G+ P+ ++ + + GVG TA AI Sbjct: 65 EAEQEEVLELWRGLGYYSRARRLWEGARYVVETAEGRMPKDYQSLLHIKGVGEYTAAAIA 124 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMD 186 S++ + P++DGNVKRVL+R K +V P FNQ MM+ Sbjct: 125 SIAYEEQVPVMDGNVKRVLSRILRWEEDVEKARSRRFFLEYLGEVIPGDCPGDFNQGMME 184 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLL 246 LGA +CT P+C CPLQ C A +YP KK K+ L+L H+ +VLL Sbjct: 185 LGATVCTPKHPRCEQCPLQADCEGFALGDPQVYPVKKSKEKPGSAWRPTLILLHQGQVLL 244 Query: 247 AQRPPSGLWGGLYCFP 262 +RP +GL L+ FP Sbjct: 245 KKRPSTGLLADLWEFP 260 >UniRef50_Q3ANF7 Mutator mutT protein n=19 Tax=Cyanobacteria RepID=Q3ANF7_SYNSC Length = 396 Score = 205 bits (521), Expect = 2e-51, Method: Compositional matrix adjust. Identities = 129/322 (40%), Positives = 173/322 (53%), Gaps = 31/322 (9%) Query: 5 QFSAQVLDWYDKYGRKT---LPW-------------QIDKTPYKVWLSEVMLQQTQVATV 48 + SA +L W+ +GR+ PW Q+D PY +W++EVMLQQTQ+A Sbjct: 29 ELSACLLSWWQAHGRRDPVQKPWMFKPAGTWPEAVHQLD--PYGIWIAEVMLQQTQLAVA 86 Query: 49 IPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETF 108 +PY+ R+M FPTV LA A LDEV W GLGYY+R R LH+AAQ+ L G +P + Sbjct: 87 LPYWMRWMEAFPTVETLAAASLDEVRLQWQGLGYYSRVRRLHEAAQR---LVGRPWPRSL 143 Query: 109 EEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLS 168 EE ALPG+GR+TAG+ILS + PILDGNVKRVLAR A + P + + W S Sbjct: 144 EEWMALPGIGRTTAGSILSSAFNLRLPILDGNVKRVLARLTAHARPPARDDA--LFWCWS 201 Query: 169 EQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTL 228 E + V NQA+MDLGA +CT +P C CP + C A A+ +P + L Sbjct: 202 EALLDPVRARDTNQALMDLGATLCTPRQPACHRCPWHSQCAAYASGDPCRWPVTNAPKPL 261 Query: 229 P-ERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEES-------LRQWLAQRQIA 280 P + G ++L EVL+ QR GL GG++ FP E+ R+ + IA Sbjct: 262 PFQVIGVGVVLNAAGEVLIDQRLEEGLLGGMWEFPGGKQEQGETIETCIARELKEELGIA 321 Query: 281 ADNLTQLTAFRHTFSHFHLDIV 302 +L H +SH L V Sbjct: 322 VTVGAELITVDHAYSHKKLRFV 343 >UniRef50_C7PL91 A/G-specific adenine glycosylase n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PL91_CHIPD Length = 354 Score = 204 bits (520), Expect = 3e-51, Method: Compositional matrix adjust. Identities = 96/256 (37%), Positives = 153/256 (59%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ +L+W D +++PW+ +K PY++WLSE++LQQT+V Y+E+F+ +PTV +L Sbjct: 5 FTNALLEWNDNENTRSMPWKGEKDPYRIWLSEIILQQTRVEQGWAYYEKFILNYPTVQEL 64 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP + V LW GLGYYAR +N+ AA+Q+A+ + G FP T+E + +L GVG T+ AI Sbjct: 65 AAAPEEAVFRLWQGLGYYARCKNMLAAAKQIASQYHGHFPNTYETIQSLKGVGPYTSAAI 124 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S + +LDGNV RVL+R + + + + L++++ P +NQ++M Sbjct: 125 ASFAFNLPHAVLDGNVFRVLSRFFDIDTPIDTTAGKKQFTDLAQELLPHGKSASYNQSIM 184 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D GA++C +P C CPL C AL P K K + +R Y+L+LQH++ V Sbjct: 185 DFGAVVCKPQQPACKSCPLAAKCKGYQQGLTALLPVKSKKLVIKKRYFYYLVLQHKENVY 244 Query: 246 LAQRPPSGLWGGLYCF 261 + +R + +W L+ F Sbjct: 245 IRKRTENDIWQNLHEF 260 >UniRef50_C6D1C3 A/G-specific adenine glycosylase n=4 Tax=Bacillales RepID=C6D1C3_PAESJ Length = 398 Score = 204 bits (520), Expect = 3e-51, Method: Compositional matrix adjust. Identities = 110/263 (41%), Positives = 154/263 (58%), Gaps = 5/263 (1%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS ++L WY + R LPW++++ PY+VW+SE+MLQQT+V TVIPY+ERFM +FPTV L Sbjct: 13 FSRELLTWYRRIKRD-LPWRMNQDPYRVWVSEIMLQQTRVDTVIPYYERFMNKFPTVRAL 71 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP EVL W GLGYY+RARNL A++V +GG P+ VA L GVG T GAI Sbjct: 72 AEAPEPEVLKCWEGLGYYSRARNLQAGAREVVERYGGIVPDDKVAVAGLKGVGPYTTGAI 131 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 +S++ + P +DGNV RVL+R + + K + L+ + P FNQA+M Sbjct: 132 MSIAFNRPEPAVDGNVMRVLSRYFCLEDDIAKPATRVGIEKLAVSLIPEGAAGDFNQALM 191 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGK-KPKQTLPERTGYFLLLQ---HE 241 +LGA++CT P C CP+ C A P K K K PE+ ++ +E Sbjct: 192 ELGALVCTPKSPSCLPCPVMEHCEARLAGRETELPIKTKAKPPRPEKRVAAIIAGTGVYE 251 Query: 242 DEVLLAQRPPSGLWGGLYCFPQF 264 ++L+ QRP +GL ++ P Sbjct: 252 GKILVRQRPETGLLAQMWELPHL 274 >UniRef50_A8MJ42 A/G-specific adenine glycosylase n=2 Tax=Firmicutes RepID=A8MJ42_ALKOO Length = 544 Score = 204 bits (519), Expect = 3e-51, Method: Compositional matrix adjust. Identities = 111/302 (36%), Positives = 169/302 (55%), Gaps = 5/302 (1%) Query: 9 QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 ++L+WY K R LPW+ ++ PY+VWLSE+MLQQT+V TVI Y+ RF+ FPT+ LA A Sbjct: 200 KLLNWYQKNARD-LPWRKNQDPYRVWLSEIMLQQTRVDTVIDYYNRFLQAFPTIDALALA 258 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 + VL LW GLGYY+RARNLHK A+ + + G FPET EE+ LPG+G TAGAI S+ Sbjct: 259 DEERVLKLWEGLGYYSRARNLHKTAKIIVAQYEGNFPETHEELLKLPGIGSYTAGAIASI 318 Query: 129 SLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLG 188 S +DGNV RV++R +++V+ ++ + +V P F Q++M+LG Sbjct: 319 SFNLPVAAVDGNVLRVVSRITEDYRCIDEEKVKKEMGNQLAEVYPENQCGDFTQSLMELG 378 Query: 189 AMICT-RSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLA 247 A IC P C+ CP C+A N++ P +K K R + + +V L Sbjct: 379 ATICLPNGAPLCNECPAIEICMANKNDTQIFLPVRKEKTARKTRELTVFVFVSQGKVALQ 438 Query: 248 QRPPSGLWGGLYCFPQ---FADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPM 304 +R G+ GL+ P ++E+++ +L + + + H F+H D+V Sbjct: 439 KRTEKGVLEGLWELPNRDGMSEEDAVSLYLEELGTLEYRIEKQKKAHHIFTHIRWDMVCY 498 Query: 305 WL 306 ++ Sbjct: 499 YV 500 >UniRef50_C4G9U8 Putative uncharacterized protein n=1 Tax=Shuttleworthia satelles DSM 14600 RepID=C4G9U8_9FIRM Length = 374 Score = 204 bits (519), Expect = 4e-51, Method: Compositional matrix adjust. Identities = 112/307 (36%), Positives = 171/307 (55%), Gaps = 11/307 (3%) Query: 11 LDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPL 70 +DW+ + ++ LPW+ D+ PY +W+SE+MLQQT+V V PY++ F+A P++ DLA P Sbjct: 15 MDWF-QVNKRDLPWRRDRDPYHIWVSEIMLQQTRVEAVKPYYQGFLAALPSIADLAACPQ 73 Query: 71 DEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSL 130 D++ LW GLGYY+R RN+ KAA+ + GG+ P ++ + +LPG+G TAGAI S++ Sbjct: 74 DQLNKLWEGLGYYSRVRNMQKAAEVIMEKFGGQMPSDYDAILSLPGIGPYTAGAISSIAF 133 Query: 131 GKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAM 190 P +DGNV R+LAR + + + E V PA G FNQAMM++GA Sbjct: 134 DLPAPAVDGNVLRILARVSEDDIDIKSDLAKKRAQAALEGVMPASGSGIFNQAMMEIGAT 193 Query: 191 ICT-RSKPKCSLCPLQNGCIAAANNSWALYPGK---KPKQTLPERTGYFLLLQHEDEVLL 246 C P C CP + C+A + SW P + KP++ + ERT ++++ V++ Sbjct: 194 ACLPNGDPLCGQCPWKKYCLANRHGSWDHLPVRSKGKPRR-IEERT--VFIVRNGSRVVI 250 Query: 247 AQRPPSGLWGGLYCFPQFAD---EESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVP 303 +RP GL GLY FP + EE+ + Q + +L RH FSH ++ Sbjct: 251 DRRPDRGLLAGLYEFPNVREHLSEEAALAYAEQMGYEPLRIRRLEEARHIFSHVEWRMIG 310 Query: 304 MWLPVSS 310 + VS Sbjct: 311 YEIRVSD 317 >UniRef50_C2MB00 A/G-specific adenine glycosylase n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2MB00_9PORP Length = 363 Score = 204 bits (518), Expect = 5e-51, Method: Compositional matrix adjust. Identities = 115/264 (43%), Positives = 160/264 (60%), Gaps = 12/264 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 S ++ WY + RK LPW+ PY++WLSEV+LQQT++ Y+ RF+ +PTV+ Sbjct: 2 STLLTKLHSWYQENHRK-LPWRDIHDPYRIWLSEVILQQTRIDQGTSYYLRFVEHYPTVS 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA APLDEVL LW GLGYY+RARNLH+AAQ + GG FP ++ V ALPG+G TAG Sbjct: 61 DLAIAPLDEVLKLWEGLGYYSRARNLHRAAQLIVHELGGTFPADYQSVRALPGIGDYTAG 120 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWSLSEQVTPAVGVER 179 A+LS + + +P +DGNV RVL+R YA + GKK L E+ P G+ Sbjct: 121 AVLSFAYDQPYPAVDGNVLRVLSRLYASEEPIDTTQGKKYYTALARQLVEKA-PHPGLH- 178 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTL-PERTGYFLLL 238 NQAM++LGA+ICT C+ CP+++ C +A + A P K K ++ P GY +L Sbjct: 179 -NQAMIELGALICTPQLCDCTRCPVRSECPSADSPERASLPRKALKLSVQPRHLGYLFVL 237 Query: 239 QH---EDEVLLAQRPPSGLWGGLY 259 + + + +L QRP +W LY Sbjct: 238 KRGLDQWQTILYQRPTGDIWAKLY 261 >UniRef50_B3QVZ4 A/G-specific adenine glycosylase n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QVZ4_CHLT3 Length = 360 Score = 203 bits (517), Expect = 6e-51, Method: Compositional matrix adjust. Identities = 104/299 (34%), Positives = 170/299 (56%), Gaps = 8/299 (2%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +L W+ ++ LPW+ +TPYK+W+SE+MLQQTQVATVIPY+ERF+ FP + LANA Sbjct: 7 LLSWF-LLNKRDLPWRKKRTPYKIWVSEIMLQQTQVATVIPYYERFLNAFPNLESLANAD 65 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 +++++ +W GLGYY R +N+ +AA+ + H G FP E+ L G+G TA I S++ Sbjct: 66 INKLMKIWEGLGYYTRVKNMQEAAKTILQKHNGVFPSKKTELLQLKGIGDYTAAIIASIA 125 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 +H +DGNV RV++R A++ + + +++++ FN+AMM++GA Sbjct: 126 FKEHCAAVDGNVLRVISRLNAINAPIQLNTTKQTIRIVAQELLSLEHPGEFNEAMMEVGA 185 Query: 190 MICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQR 249 +IC P C +CP+ C A P K + +P ++ +D VL+A R Sbjct: 186 LICKPKNPTCDICPISLHCQAYKKGLEHKLPVKLKRAEIPHYHIAAGVIYKDDFVLIALR 245 Query: 250 PPSGLWGGLYCFPQFADE--ESL-----RQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 P +GL G L+ FP + ESL R+ + + + L +L + +H ++HF + + Sbjct: 246 PANGLLGNLWEFPGGKQQQGESLEDCCKREIFEETGLHVNVLEKLISVKHAYTHFKITL 304 >UniRef50_B8DFN3 A/G-specific adenine glycosylase n=14 Tax=Listeria RepID=B8DFN3_LISMH Length = 362 Score = 202 bits (515), Expect = 1e-50, Method: Compositional matrix adjust. Identities = 116/314 (36%), Positives = 172/314 (54%), Gaps = 19/314 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + F ++ WY+ ++ LPW+ + PY++W+SE+MLQQT+V TVIPYF RFM +FPT Sbjct: 10 KITAFQEALVSWYEA-NKRVLPWRENTEPYRIWVSEIMLQQTKVDTVIPYFNRFMTQFPT 68 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + D A ++L W GLGYY+R RNL A +QV G+ P + +L GVG T Sbjct: 69 MEDFVQADEADILKAWEGLGYYSRVRNLQTAMKQVMADFSGEVPTDLTTILSLKGVGPYT 128 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVS----GWPGKKEVENKLWSLSEQVTPAVGV 177 AGAILS++ + P +DGNV RV+AR +S +K E L+ L ++ PA Sbjct: 129 AGAILSIAYNQAEPAVDGNVMRVIARVLEISEDIMKASTRKIFEEVLYQLIDKENPAA-- 186 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL 237 FNQ +M++GA++CT +KP C LCPLQ C A N YP K K + ++ Sbjct: 187 --FNQGLMEIGALVCTPTKPMCMLCPLQPFCDAHKNGVETNYPVKIKKVKTKTKELLSII 244 Query: 238 LQHED-EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQI----AADNLTQ---LTA 289 + ED +V + +RP +GL ++ FP + +A+ Q D L + + Sbjct: 245 VISEDGKVAIEKRPENGLLANMWQFPTIEISKKENDEVAKLQFLHNYGLDVLLEDEPIAH 304 Query: 290 FRHTFSHF--HLDI 301 +H FSH +DI Sbjct: 305 IKHVFSHLVWKMDI 318 >UniRef50_UPI0001910D7D adenine DNA glycosylase n=1 Tax=Salmonella enterica subsp. enterica serovar Typhi str. AG3 RepID=UPI0001910D7D Length = 139 Score = 202 bits (514), Expect = 1e-50, Method: Compositional matrix adjust. Identities = 91/110 (82%), Positives = 96/110 (87%) Query: 214 NSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQW 273 SW+ YPGKKPKQTLPERTGYFLLLQH E+ LAQRPPSGLWGGLYCFPQFA E+ LR+W Sbjct: 30 ESWSRYPGKKPKQTLPERTGYFLLLQHNQEIFLAQRPPSGLWGGLYCFPQFASEDELREW 89 Query: 274 LAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWY 323 LAQR + ADNLTQL AFRHTFSHFHLDIVPMWLPVSS CMDEG ALWY Sbjct: 90 LAQRHVNADNLTQLNAFRHTFSHFHLDIVPMWLPVSSLDACMDEGGALWY 139 >UniRef50_C2KZ90 Putative uncharacterized protein n=1 Tax=Oribacterium sinus F0268 RepID=C2KZ90_9FIRM Length = 391 Score = 202 bits (514), Expect = 1e-50, Method: Compositional matrix adjust. Identities = 111/305 (36%), Positives = 169/305 (55%), Gaps = 13/305 (4%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ +L WY++ R+ LPW+ D PY WLSE+MLQQT+V V Y+ RF++ P + DL Sbjct: 19 FAKALLSWYNE-NRRILPWREDPGPYHTWLSEIMLQQTRVEAVKGYYSRFLSALPEIADL 77 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA ++VL LW GLGYY+RARNL KAA+ + T + G+ P+TF+E+ LPG+G TA AI Sbjct: 78 ANAEEEKVLKLWEGLGYYSRARNLQKAAKTIMTEYAGEMPKTFQELKKLPGIGEYTAAAI 137 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S+ + P LDGN+ R+ AR + + E + +S ++ + FNQA+M Sbjct: 138 ASIVYKEEIPALDGNLLRIFARLTSYPKVVLEPEGKKLAFSYFQEKMQGIDPGDFNQALM 197 Query: 186 DLGAMICT-RSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 DLG+ +C + + C CPL+ C +A + YP K+ L++H +E+ Sbjct: 198 DLGSGVCLPKGEIGCKTCPLKAFCSSAKAGNPMDYPKLPEKKKRKVEQYSIFLIRHREEL 257 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNL-------TQLTAFRHTFSHF 297 +L +R GL GLY F + S ++ L A +NL L +H F+H Sbjct: 258 MLKKRENKGLLAGLYEFYKLEGHCSEKEALE----AVENLGLRSLRIKALGEAKHIFTHK 313 Query: 298 HLDIV 302 +++ Sbjct: 314 EWEMI 318 >UniRef50_B4D777 HhH-GPD family protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D777_9BACT Length = 341 Score = 202 bits (513), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 121/337 (35%), Positives = 175/337 (51%), Gaps = 32/337 (9%) Query: 13 WYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDE 72 W+ ++GR LPW+ PY + +SE MLQQTQV TV Y+ R++ RFP LA A + Sbjct: 27 WFRQHGRD-LPWRRTHDPYAIMVSEFMLQQTQVVTVRDYYARWLERFPDFNALAAASEAD 85 Query: 73 VLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGK 132 VLH+W GLGYYARARNLH+AA+QVA LH G+ P ++ALPGVGR TAGA+ + + + Sbjct: 86 VLHVWQGLGYYARARNLHRAAKQVADLHSGQLPNDLVAISALPGVGRYTAGAVATFAFDQ 145 Query: 133 HFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMIC 192 PI+D N+ RV+AR + K LW +E++ PA N A+M+LGA++C Sbjct: 146 ATPIIDANIARVIARLLDLQEPIDTKRGSEILWLTAEELLPAKSGRVHNSALMELGALLC 205 Query: 193 TRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQ---TLPERTGYFLLLQHEDEVLLAQR 249 T P+C +CP++ C + S P KKP+ L E + + D LL ++ Sbjct: 206 TPRAPQCPICPIREHCRTKSPES---LPRKKPRPKTIALAENCAWIV----NDGNLLLEQ 258 Query: 250 PPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH--LDIVPMWLP 307 W GL+ P + + L AF + F++ L + +P Sbjct: 259 QTGSRWRGLWKLPVIGESHPRNKL-------------LLAFDYPFTNHRVTLSVYASRVP 305 Query: 308 VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + S N W L S+ LAAP R +++L Sbjct: 306 MES------RPNQQWVPLTSIDSIALAAPHRRAIKRL 336 >UniRef50_A4VV79 A/G-specific DNA glycosylase n=98 Tax=Lactobacillales RepID=A4VV79_STRSY Length = 410 Score = 201 bits (512), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 108/264 (40%), Positives = 156/264 (59%), Gaps = 14/264 (5%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F +LDWYD ++ LPW+ K PY +W+SE+MLQQT+V TVIPY+ERF+ PT++DL Sbjct: 44 FRKALLDWYDA-NKRDLPWRRTKDPYAIWVSEIMLQQTRVDTVIPYYERFLHHLPTISDL 102 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP + +L LW GLGYY+R RN+ KAAQQ+ G+FP T +++L G+G TAGAI Sbjct: 103 AQAPEEVILKLWEGLGYYSRVRNMQKAAQQMVEDFDGQFPTTHAAISSLKGIGPYTAGAI 162 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVS---GWPGKKEVENKLWSLSEQVTPAVGVER--- 179 S++ P +DGNV RVL+R + V G P +++ + L + ER Sbjct: 163 SSIAFNLPEPAVDGNVMRVLSRLFEVDYDIGLPANRKIFQAMMEL------LIDSERPGD 216 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQT-LPERTGYFLLL 238 FNQA+MDLG+ I + P+ P++ A N + YP K PK+ +P FL+ Sbjct: 217 FNQALMDLGSDIESPVNPRPQDSPVKAFSAAYLNGTMDKYPIKAPKKKPIPVAYQGFLIR 276 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFP 262 +++ LL + +GL G + FP Sbjct: 277 NKDNQFLLEKNNEAGLLSGFWSFP 300 >UniRef50_Q11NU4 A/G-specific adenine glycosylase n=5 Tax=Bacteroidetes RepID=Q11NU4_CYTH3 Length = 355 Score = 201 bits (512), Expect = 3e-50, Method: Compositional matrix adjust. Identities = 105/301 (34%), Positives = 174/301 (57%), Gaps = 13/301 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 +F +++ WY K ++ LPW+ + PY +WLSE++LQQT+V +PY+ F+ FPTV Sbjct: 8 KKFPKELIKWYLK-NKRELPWRDTRDPYPIWLSEIILQQTRVQQGMPYYFSFLKAFPTVK 66 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 +A A ++L LW GLGYY+RARNLHK A QV + GG FP +++E+ L GVG TA Sbjct: 67 HMAKASEKDILSLWQGLGYYSRARNLHKTALQVMSQFGGSFPGSYKELLDLKGVGPYTAA 126 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S + + ++DGNV RVL+R + + + + +L++Q+ P + +NQA Sbjct: 127 AIASFAYKEQVAVVDGNVYRVLSRVFGIYEDITQNSSKKTFAALAQQLIPQKDPDIYNQA 186 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +M+ GA+ CT ++PKC C C+A + P K ++ ER + ++ + + Sbjct: 187 IMEFGALHCTPAEPKCGDCCFAEICVAHLTGDQRVLPVKAKQKAKQERYMTYFMITNNNT 246 Query: 244 VLLAQRPPSGLWGGLYCF-----PQFAD-EESLRQW--LAQRQIAADNLTQLTAFRHTFS 295 VL+ +R + +W GL+ F P+ A E+L+ L ++ I+ +++T ++H S Sbjct: 247 VLMKERGSNDIWQGLFEFLLVETPKAASLTEALKLVPDLDKKNISGESIT----YKHILS 302 Query: 296 H 296 H Sbjct: 303 H 303 >UniRef50_C6LA88 A/G-specific adenine glycosylase n=20 Tax=Clostridiales RepID=C6LA88_9FIRM Length = 365 Score = 201 bits (512), Expect = 3e-50, Method: Compositional matrix adjust. Identities = 115/300 (38%), Positives = 169/300 (56%), Gaps = 19/300 (6%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +L W+D R LPW+ TPY+VW+SE+MLQQT+V V P+F+RF P V LA Sbjct: 14 LLAWFDANAR-VLPWRDSPTPYRVWVSEIMLQQTRVEAVKPFFQRFTEALPDVAALAACE 72 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 +++L LW GLGYY R RN+ KAAQ V +GG+ P +E++ L G+G TAGAI S++ Sbjct: 73 EEKLLKLWEGLGYYNRVRNMQKAAQTVMEEYGGELPADYEKLLKLKGIGSYTAGAIASIA 132 Query: 130 LGKHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 P +DGNV RV++R A + KKEVE+++ ++ P FNQA+M Sbjct: 133 FQIPVPAVDGNVLRVISRITASEKDILKASVKKEVEDEI----REIIPPERAGAFNQALM 188 Query: 186 DLGAMICTRSKP-KCSLCPLQNGCIAAANNSWALYPGK---KPKQTLPERTGYFLLLQHE 241 +LGA++C + P KC CPL C+A + P K KP++ + ERT L+++ Sbjct: 189 ELGAVVCVPNGPAKCDACPLYGQCLARERGIVSSLPKKSAAKPRR-VQERT--VLIIRDG 245 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFA---DEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 + V + +RPP GL GLY P +E + + L ++ + + L +H FSH Sbjct: 246 ERVAIHKRPPKGLLAGLYELPNVEGHLGQEEVLKLLKKKHFSPIRIQPLEEAKHIFSHVE 305 >UniRef50_A6NQR4 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NQR4_9BACE Length = 355 Score = 201 bits (512), Expect = 3e-50, Method: Compositional matrix adjust. Identities = 119/300 (39%), Positives = 158/300 (52%), Gaps = 12/300 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 Q +L WY + R LPW+ D TPY VW+SE+MLQQT+VA V+ Y+ RFM P V Sbjct: 2 EQLPIPLLAWYHENAR-VLPWRSDPTPYHVWVSEIMLQQTRVAAVMGYYSRFMEALPDVA 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA D ++ LW GLGYY+RARNL KAA QV +GG P ++EE+ L GVG TAG Sbjct: 61 ALAAVEDDTLMKLWQGLGYYSRARNLKKAAGQVMERYGGAIPASYEELLTLAGVGEYTAG 120 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S++ G P +DGNV RV+AR G + ++ +++ P FNQA Sbjct: 121 AISSIAFGIPVPAVDGNVLRVVARIAGDEGDITLPATKKRMGQALQEIIPTAMPGAFNQA 180 Query: 184 MMDLGAMICT-RSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQT--LPERTGYFLLLQH 240 MM+LGA +C P C CP C A + P K PK+ + ERT Y + H Sbjct: 181 MMELGATVCLPNGAPLCDRCPAAGFCAALIQDKINSLPVKAPKKARRVEERTVYLIF--H 238 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEES--LRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 E V L +RP GL GL+ +P E++ L W I + +H F+H Sbjct: 239 EGRVALRRRPDKGLLAGLWEYPNELAEKTGILESW----GIVPASQRWGGTGKHIFTHIE 294 >UniRef50_UPI00016C4021 A/G-specific adenine glycosylase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4021 Length = 375 Score = 201 bits (511), Expect = 3e-50, Method: Compositional matrix adjust. Identities = 117/307 (38%), Positives = 171/307 (55%), Gaps = 15/307 (4%) Query: 9 QVLDWYDKYGRKTLPWQID----KTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 ++L+W+DK+ R LPW+ + Y+VW+SEVMLQQT VA V+PYFERF+A FP V Sbjct: 18 KLLNWFDKHQRD-LPWRRTVNGARDAYQVWVSEVMLQQTTVAAVVPYFERFLAAFPDVRA 76 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A VL LW GLGYY RAR+LH AA+ + H G P+ + LPGVGR GA Sbjct: 77 LAAADEQRVLKLWEGLGYYRRARHLHAAAKLLVEAHNGDLPDDPDVWEPLPGVGRYILGA 136 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 +LS + + PI++ N RVLAR +A G P + E + +W+ +E V PA FNQ++ Sbjct: 137 VLSQAFDRPLPIVEANSLRVLARLFAYPGDPREGEGKVWVWAAAETVLPAKRAGDFNQSL 196 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LGA++CT + P C CP+++ C A P KK + + E ++++ V Sbjct: 197 MELGALVCTPTAPACDRCPVRDNCEAKRLGRQDQIPPKKKQPAITEVAEVGVVIRDGANV 256 Query: 245 LLAQRPP-SGLWGGLYCFPQFADEESLRQWLAQRQIAADNLT--------QLTAFRHTFS 295 LL QRP +G W ++ P E A ++A + LT ++ RH + Sbjct: 257 LLCQRPANAGRWQNMWEIPHAPRTEGEDVSAAAVRVARE-LTGFDVEPGAEIRTVRHGVT 315 Query: 296 HFHLDIV 302 + + +V Sbjct: 316 RYAITLV 322 >UniRef50_UPI000196B020 hypothetical protein CATMIT_00928 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196B020 Length = 333 Score = 201 bits (511), Expect = 4e-50, Method: Compositional matrix adjust. Identities = 106/294 (36%), Positives = 163/294 (55%), Gaps = 15/294 (5%) Query: 9 QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 ++++WY K ++ LP++ PYK+W+SE+MLQQT V VIPY+ RF+ RFP V L++A Sbjct: 6 ELINWY-KENKRVLPFRDIDDPYKIWVSEIMLQQTTVTAVIPYYNRFIERFPDVKTLSDA 64 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 ++EV W GLGYY RA +LHK+AQ + FP+ +E++ +L GVG TA AI+S Sbjct: 65 SIEEVYKYWEGLGYYRRALHLHKSAQMIE----NHFPKDYEDILSLVGVGPYTASAIMSF 120 Query: 129 SLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLG 188 + K + +DGN RVL+R YA+ + K+ + ++ + NQ +MD Sbjct: 121 AYHKPYIAVDGNALRVLSRLYAIEDNIASNKTVKKITEIGNKLVVGYDSAQINQGIMDFA 180 Query: 189 AMICTRSKPKCSLCPLQNGCIAAANNSWALYP---GKKPKQTLPERTGYFLLLQHEDEVL 245 IC P C CPLQ+ C A + N + P K K+++ TG ++ +EDEV+ Sbjct: 181 NAICLPVHPHCEECPLQSHCQAYSTNKQEVLPINIKKVNKKSISYITG---IVVYEDEVM 237 Query: 246 LAQRPPSGLWGGLYCFPQFADEES---LRQWLAQRQIAADNLTQLTAFRHTFSH 296 L + +GL +Y PQ+ E + + I + L+ L F+H FSH Sbjct: 238 LIKN-KNGLLENMYLLPQYEVESPYSFIEAFKENYGIELEALSHLKDFKHVFSH 290 >UniRef50_Q9UIF7 A/G-specific adenine DNA glycosylase n=51 Tax=Chordata RepID=MUTYH_HUMAN Length = 546 Score = 201 bits (510), Expect = 5e-50, Method: Compositional matrix adjust. Identities = 131/361 (36%), Positives = 181/361 (50%), Gaps = 64/361 (17%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQ--------IDKTPYKVWLSEVMLQQTQVATVIPYFE 53 + + F +L WYD+ R LPW+ +D+ Y VW+SEVMLQQTQVATVI Y+ Sbjct: 89 EVTAFRGSLLSWYDQEKRD-LPWRRRAEDEMDLDRRAYAVWVSEVMLQQTQVATVINYYT 147 Query: 54 RFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA 113 +M ++PT+ DLA+A L+EV LW GLGYY+R R L + A++V GG P T E + Sbjct: 148 GWMQKWPTLQDLASASLEEVNQLWAGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQ 207 Query: 114 -LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT 172 LPGVGR TAGAI S++ G+ ++DGNV RVL R A+ P V +LW L++Q+ Sbjct: 208 LLPGVGRYTAGAIASIAFGQATGVVDGNVARVLCRVRAIGADPSSTLVSQQLWGLAQQLV 267 Query: 173 PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIA---------------------- 210 FNQA M+LGA +CT +P CS CP+++ C A Sbjct: 268 DPARPGDFNQAAMELGATVCTPQRPLCSQCPVESLCRARQRVEQEQLLASGSLSGSPDVE 327 Query: 211 -AANNSWAL------------------YPGKKPKQTLPERTGYFLLLQHED----EVLLA 247 A N+ +P K ++ E + +L+ ++LL Sbjct: 328 ECAPNTGQCHLCLPPSEPWDQTLGVVNFPRKASRKPPREESSATCVLEQPGALGAQILLV 387 Query: 248 QRPPSGLWGGLYCFPQFA---DEESLRQWLAQR------QIAADNLTQLTAFRHTFSHFH 298 QRP SGL GL+ FP E+ R+ L Q + A +L L HTFSH Sbjct: 388 QRPNSGLLAGLWEFPSVTWEPSEQLQRKALLQELQRWAGPLPATHLRHLGEVVHTFSHIK 447 Query: 299 L 299 L Sbjct: 448 L 448 >UniRef50_C6HWU0 Putative A/G-specific DNA glycosylase n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HWU0_9BACT Length = 359 Score = 200 bits (508), Expect = 7e-50, Method: Compositional matrix adjust. Identities = 136/345 (39%), Positives = 187/345 (54%), Gaps = 23/345 (6%) Query: 8 AQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLAN 67 A +L WYD G + LPW+ PY++W+SE+MLQQT VATV+ +++RF++RFP V LA Sbjct: 2 APLLAWYDA-GHRDLPWRQGCDPYRIWVSEIMLQQTTVATVLKFYDRFLSRFPDVGALAG 60 Query: 68 APLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILS 127 A L EVL W+GLGYY RARNLH+AA+ VA G FPE+ A LPG+GRSTAGAI S Sbjct: 61 ADLTEVLRFWSGLGYYQRARNLHRAARIVA--ERGSFPESLAGWAELPGIGRSTAGAIFS 118 Query: 128 LSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQV---TPAVGVERFNQAM 184 +S PILD NV+RV+ R +AV +K+ E +LW LS+ P G NQA+ Sbjct: 119 ISRNLWAPILDANVRRVVERFFAVG--KEEKKREARLWELSDSFGRENPRPG--DTNQAL 174 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIA---AANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 M+LGA +C + P+CS+CPL++ C + L ++ ++ P R +LL E Sbjct: 175 MELGATVCLPASPRCSICPLRSSCRSRDPEPERVILLPERRRERRPAPPRRDRLVLLPAE 234 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLT--QLTAFRHTFSHFHL 299 +L R L GL+ A L A +L HT+SHF Sbjct: 235 GPLLFVPRGEGRLLEGLFDLYGIAASSIPPGALLDEGPYAGFRVAEELFTVTHTYSHFR- 293 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 ++V + V TG EG A +G+ P+ R L+ L Sbjct: 294 EVVHVVGLVGRGTGSAGEG-------AMAGDLGVVLPLSRALEGL 331 >UniRef50_Q02CA8 A/G-specific DNA-adenine glycosylase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q02CA8_SOLUE Length = 337 Score = 199 bits (507), Expect = 9e-50, Method: Compositional matrix adjust. Identities = 119/339 (35%), Positives = 177/339 (52%), Gaps = 31/339 (9%) Query: 13 WYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDE 72 WY + + LPW+ + PY+VW+SE+MLQQT+ VIPY+E+F+ARFP+V LA AP +E Sbjct: 13 WY-RAAHRDLPWRSSRDPYRVWISEIMLQQTRAQAVIPYYEKFLARFPSVESLAAAPEEE 71 Query: 73 VLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGK 132 VL LW+GLGYY+RARNL AA+Q+ G FP ++ + ALPG+G TA AI S++ G Sbjct: 72 VLALWSGLGYYSRARNLRNAARQITAAAG--FPREYDAIRALPGIGDYTAAAIASIAFGL 129 Query: 133 HFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE----RFNQAMMDLG 188 +LDGNV RV+AR + G + ++++Q A V+ FNQA+M+LG Sbjct: 130 PHAVLDGNVLRVVARVENDAADIGSSRTRERFRAIAQQWLEAKEVKHDPGHFNQALMELG 189 Query: 189 AMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQ 248 A +C +P C +CP+ C A + A P K KQ +G L+++ +LL + Sbjct: 190 ATVCVPRRPLCLVCPISACCRARQEGTAAQLPVKLRKQEPVRISGVLLVVRQRGRILLRR 249 Query: 249 RPPS-GLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH----FHLDIVP 303 R S G + P EE + +R + RHT +H F + Sbjct: 250 REDSQSRMAGFWELP--TPEELPGSEVGER---------IGEIRHTITHHRYTFEVRAAT 298 Query: 304 MWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 +P + G W+ +AQ + + + LQ Sbjct: 299 AAVPAT--------GGFAWFTIAQLAEIPFSTTARKALQ 329 >UniRef50_C6XY88 Helix-hairpin-helix motif protein n=4 Tax=Sphingobacteriaceae RepID=C6XY88_PEDHD Length = 349 Score = 199 bits (506), Expect = 1e-49, Method: Compositional matrix adjust. Identities = 114/304 (37%), Positives = 165/304 (54%), Gaps = 20/304 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F +++++WY + R LPW+ Y +WLSEV+LQQT+V +PYF F+ +PTV D Sbjct: 3 FQSEIVNWYLNHKRD-LPWRGTTDAYIIWLSEVILQQTRVDQGLPYFNNFLQNYPTVLDF 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+A +VL LW GLGYY+R RN+ A+QV LHGG FP ++++ L G+G TA AI Sbjct: 62 ASASETQVLKLWQGLGYYSRGRNMLFTARQVRDLHGGVFPVRYDQLIKLKGIGEYTAAAI 121 Query: 126 LSLSLGKHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 S S + +LDGNV RVL+R + ++ GKK+ + SL P+V +N Sbjct: 122 ASFSSNESKAVLDGNVFRVLSRYFGIESPINSSTGKKQFADLAQSLISGQQPSV----YN 177 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF--LLLQ 239 QA+M+ GA+ C P C +CP+Q+ C A ++ P K K L +RT YF L Sbjct: 178 QAIMEFGALQCKPKSPNCGICPVQDSCFAQKHHLVGTLPVKLNK--LKKRTRYFNYFLCM 235 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQ-------FADEESLRQWLAQRQIAADNLTQLTAFRH 292 D +L+ +R P +W LY FP D E L + AA + L+ +H Sbjct: 236 EGDNILVKKRSPGDIWQELYDFPLIETDRPFLEDPEKFAPLLQESFGAACKVRTLSHQKH 295 Query: 293 TFSH 296 +H Sbjct: 296 LLTH 299 >UniRef50_B9Y8G7 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y8G7_9FIRM Length = 397 Score = 199 bits (505), Expect = 1e-49, Method: Compositional matrix adjust. Identities = 114/296 (38%), Positives = 164/296 (55%), Gaps = 20/296 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F ++ WY K + LP++ + PY +W+SE+M QQT++ +++PY++R+M R+PTV L Sbjct: 18 FQEELCAWY-KIHHRELPFRQSRDPYAIWVSEIMAQQTRIDSMLPYYKRWMERWPTVEAL 76 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP++++LH+W GLGYY RAR LH A+ V +GG P E++ +PG+G TAGAI Sbjct: 77 AEAPIEDILHVWQGLGYYNRARKLHAGAKVVVERYGGLLPADVEQLRTIPGIGFYTAGAI 136 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLW----SLSEQVTPAVGVERFN 181 S++ G P +DGNV RV R KK + +W E PAV F Sbjct: 137 GSIAYGLRAPAVDGNVLRVTTRVLQYGEDITKKTTADYVWRQVYDWMEGSNPAV----FT 192 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QAMM++GA++CT P+C LCPL C A + SW+ YP KK + E Y ++++ Sbjct: 193 QAMMEIGALVCTPKNPQCLLCPLAPFCGAGQDGSWSQYPVKKKAKPPLELQLYTYWIEND 252 Query: 242 -DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 ++LL+ GL GLY PQ A L Q D L +H FSH Sbjct: 253 RRQILLSDDWSDGLMEGLYRLPQAA--APLEQ--------TDQLDFAGQRKHVFSH 298 >UniRef50_O31584 Probable A/G-specific adenine glycosylase yfhQ n=8 Tax=Bacillaceae RepID=YFHQ_BACSU Length = 369 Score = 199 bits (505), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 117/310 (37%), Positives = 173/310 (55%), Gaps = 18/310 (5%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 QF ++ W+++ R LPW+ D+ PYKVW+SEVMLQQT+V TVIPYF RF+ +FPTV Sbjct: 15 QFRDDLISWFEREQR-VLPWREDQDPYKVWVSEVMLQQTRVETVIPYFLRFVEQFPTVEA 73 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA+A ++VL W GLGYY+R RNL A ++V +GG P ++ L GVG T GA Sbjct: 74 LADADEEKVLKAWEGLGYYSRVRNLQSAVKEVKQEYGGIVPPDEKDFGGLKGVGPYTKGA 133 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAV---SGWPGKKEV-ENKLWSLSEQVTPAVGVERF 180 +LS++ K P +DGNV RV++R ++ P + + E+ + + + P+ F Sbjct: 134 VLSIAYNKPIPAVDGNVMRVMSRILSIWDDIAKPKTRTIFEDAIRAFISKEKPS----EF 189 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQ +M+LGA+ICT P C LCP+Q C A + P K K+ +T ++L Sbjct: 190 NQGLMELGALICTPKSPSCLLCPVQQHCSAFEEGTERELPVKSKKKKPGIKTMAAIVLTD 249 Query: 241 ED-EVLLAQRPPSGLWGGLYCFPQFADEESLR----QWLA----QRQIAADNLTQLTAFR 291 ED +V + +RP GL L+ FP ++ ++ Q +A + I AD Sbjct: 250 EDGQVYIHKRPSKGLLANLWEFPNLETQKGIKTEREQLIAFLENEYGIQADISDLQGVVE 309 Query: 292 HTFSHFHLDI 301 H F+H +I Sbjct: 310 HVFTHLVWNI 319 >UniRef50_C8US26 A/G-specific DNA glycosylase n=18 Tax=Lactobacillales RepID=C8US26_LACRG Length = 372 Score = 198 bits (504), Expect = 2e-49, Method: Compositional matrix adjust. Identities = 122/309 (39%), Positives = 172/309 (55%), Gaps = 16/309 (5%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + F +LDWYD + R TLPW+ D PY V +SE+MLQQTQV TVIPY+ RFMA+FPT Sbjct: 14 KVAAFQHALLDWYDHHAR-TLPWRQDHDPYHVMVSELMLQQTQVQTVIPYYNRFMAQFPT 72 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA AP VL W GLGYY+RAR L +AA+Q+ +GGK+P+T E+ L G+G T Sbjct: 73 VEALAAAPEATVLKAWEGLGYYSRARRLQQAAKQIVNDYGGKWPQTAAELQTLAGIGPYT 132 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S+S G+ P +DGN RV AR + V + + L + P FN Sbjct: 133 AGAIASISFGEVVPAIDGNAFRVFARLFKVDADIARPQTRKVFDDLIRPLMPKERPGDFN 192 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF-LLLQH 240 QA+MDLG+ + S P + P++ + + YP K K+ P YF L+++ Sbjct: 193 QAVMDLGSSYMSASHPDPAHSPVRAFDASFRDGVVQDYPVKT-KKPRPVIHRYFALVIRS 251 Query: 241 EDEVLLAQRPPSGLWGGLYCFP--QFADEESLRQWLAQRQIAAD--NLTQLT-------- 288 + LL QRP G+ L+ FP A+ E+ + +I+A +L+ +T Sbjct: 252 KAGYLLEQRPGKGMLADLWMFPLIDMAELEATMESEQLDEISARFADLSGMTLTFADLGR 311 Query: 289 -AFRHTFSH 296 +HTF+H Sbjct: 312 PQVQHTFTH 320 >UniRef50_Q04T89 A/G-specific DNA glycosylase n=4 Tax=Leptospira RepID=Q04T89_LEPBJ Length = 372 Score = 197 bits (502), Expect = 4e-49, Method: Compositional matrix adjust. Identities = 98/258 (37%), Positives = 147/258 (56%), Gaps = 2/258 (0%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + +L W+ K RK LP++I+K Y++W+SE+MLQQT+VA ++P +E F+ RFP Sbjct: 12 ELRENLLSWFQKNKRK-LPFRINKNAYRIWVSEIMLQQTRVAAMLPIYETFLKRFPDPKA 70 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 L +A +EV+ W GLGYY+RARNL K A+ + + G+FPE +EE ++PGVG TA A Sbjct: 71 LQDASEEEVMKYWKGLGYYSRARNLKKGAELLVEKYEGRFPEDYEEALSIPGVGSYTASA 130 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 +LS++ GK + +LDGNVKRVL+R + + P L L+++ N+AM Sbjct: 131 VLSIAYGKPYAVLDGNVKRVLSRLFLIESDPNSNSTNQVLADLAQKFLTPGDPGNHNEAM 190 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LGA++C P CS CP +N C A P K + + FL L+ + Sbjct: 191 MELGALVCI-PVPNCSACPFENHCEAKGAGKEKEIPVTKSVENWVDLDLNFLFLKSSGRI 249 Query: 245 LLAQRPPSGLWGGLYCFP 262 LL + + +Y P Sbjct: 250 LLVKYTTRRFFKTIYSLP 267 >UniRef50_C0BLH9 A/G-specific adenine glycosylase n=2 Tax=Flavobacteria RepID=C0BLH9_9BACT Length = 350 Score = 197 bits (500), Expect = 6e-49, Method: Compositional matrix adjust. Identities = 108/280 (38%), Positives = 151/280 (53%), Gaps = 4/280 (1%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +++ WY +++LPW+ K PY +WLSE++LQQT+VA PY+ +F FPTV DL Sbjct: 3 FSQEIIHWYQA-NKRSLPWRETKDPYHIWLSEIILQQTRVAQGTPYYVKFTDAFPTVFDL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP ++VL LW GLGYY+RARNLH AAQ V G FP ++ + L GVG TA AI Sbjct: 62 AAAPEEQVLKLWQGLGYYSRARNLHAAAQYVVNELNGVFPANYKALLQLKGVGDYTASAI 121 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S+ +LDGNV RVL+R + + E +++ +NQA+M Sbjct: 122 ASICFNSPEAVLDGNVYRVLSRYFGIDVPVNTTEGARLFKAIALDQLDLANPSDYNQAIM 181 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQT-LPERTGYFLLLQHEDEV 244 + GA+ CT C CPLQ+ C+A + P K PKQ + YF+ + Sbjct: 182 EFGALQCTPKLTDCPSCPLQSTCVAYRTQKVTVLPVKLPKQKPVKVYHYYFVFVDPFGNT 241 Query: 245 LLAQRPPSGLWGGLYCFP--QFADEESLRQWLAQRQIAAD 282 L +R G+W GLY FP F+ ++ L + A D Sbjct: 242 LYEKRTGKGIWEGLYQFPLATFSAPRTVEGLLTSMETAID 281 >UniRef50_D1U9Y5 A/G-specific adenine glycosylase n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1U9Y5_9DELT Length = 369 Score = 196 bits (499), Expect = 8e-49, Method: Compositional matrix adjust. Identities = 113/281 (40%), Positives = 166/281 (59%), Gaps = 10/281 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M + F++ +LDWYD GR LPW+ + PY+VW+SE+M QQTQ+ V+ YF+R+MAR+P Sbjct: 1 MDGNLFTSHLLDWYDANGRD-LPWRREPNPYRVWISEIMAQQTQLDRVVGYFDRWMARYP 59 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + LA A ++VL LW GLGYY+RARN+ K+A +A HG FP + ALPGVG Sbjct: 60 DLQSLALAREEDVLKLWEGLGYYSRARNILKSASVLAHAHGCVFPSDPIAIRALPGVGAY 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGA+ S++ G P +D NV RV AR + + V + + P F Sbjct: 120 TAGAVASIAFGLCEPAVDANVLRVFARLLDLDAPVAETGVRQTVERTVRALIPEDRPGDF 179 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN---NSWALYPGKKPKQTLPERTGYFLL 237 NQA+M+LGA++C + +P+C CP++ C+A A +S + P + + TG +L Sbjct: 180 NQALMELGALVCAK-RPRCGECPVRAHCLAHARGVADSRPVLPPSRKAIRIEMATG--VL 236 Query: 238 LQHEDEVLLAQRPPSGLWGGLYCFPQFADE--ESLRQWLAQ 276 ++H VL+ +R P +W GL+ FP E E+ RQ LA+ Sbjct: 237 VRH-GRVLIQKRRPGDVWPGLWEFPGGCVEPGETPRQALAR 276 >UniRef50_C9PXZ8 A/G-specific adenine glycosylase n=2 Tax=Prevotella RepID=C9PXZ8_9BACT Length = 403 Score = 196 bits (497), Expect = 1e-48, Method: Compositional matrix adjust. Identities = 110/279 (39%), Positives = 156/279 (55%), Gaps = 20/279 (7%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +L W+ +GR +LPW+ K PY +WLSEV+LQQT+V+ + Y++RFM +PTV L Sbjct: 7 FSTTLLHWFQHHGR-SLPWRETKDPYAIWLSEVILQQTRVSQGLAYWQRFMHNYPTVNAL 65 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A DEVL LW GLGYY+RARNLH+AA+Q+ L G FP T + ++ L GVG TA AI Sbjct: 66 AAATEDEVLRLWQGLGYYSRARNLHQAAKQIVEL--GYFPNTHDAISKLKGVGPYTAAAI 123 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S++ ++DGNV RVLAR + + E + + +L++ + P RFN+A+M Sbjct: 124 ASIAFNLPVAVVDGNVYRVLARFFGIDTPINSTEGKKQFATLAQSLLPHHAPARFNEAIM 183 Query: 186 DLGAMIC----------------TRSKPK-CSLCPLQNGCIAAANNSWALYPGKKPKQTL 228 D GA+ C T P C+ CPL C+A A P K Q Sbjct: 184 DFGALQCLPVKGETTKTNNNIATTNESPAFCNDCPLNGQCVAYAQGLVRSLPVKTKSQAP 243 Query: 229 PERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADE 267 +R ++ L+ EV + +RP +W GL+ + DE Sbjct: 244 KQRRMGYIYLRCRGEVAIRKRPAGDIWQGLWEPLLYEDE 282 >UniRef50_A5IU34 A/G-specific DNA-adenine glycosylase n=56 Tax=Staphylococcus RepID=A5IU34_STAA9 Length = 345 Score = 195 bits (496), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 120/352 (34%), Positives = 187/352 (53%), Gaps = 24/352 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q S F ++ W+D+ ++ +PW+ PY +WLSEVMLQQTQV TVI Y+ RF+ RFPT Sbjct: 3 QQSSFKENLIHWFDE-NQREMPWRQTTNPYYIWLSEVMLQQTQVKTVIDYYHRFVERFPT 61 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V L+ A DEVL W GLGYY+RARN H A ++V + G P+ ++ AL GVG T Sbjct: 62 VEVLSQASEDEVLKYWEGLGYYSRARNFHTAIKEVYDKYEGLVPKDPDQFKALKGVGPYT 121 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE--R 179 A++S++ +DGNV RV +R ++ ++++ S +++ P V E Sbjct: 122 QAAVMSIAYNVPLATVDGNVFRVWSR---LNDDYRDIKLQSTRKSYEQELLPYVTTEAGT 178 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYP--GKKPKQTLPERTGYFLL 237 FNQAMM+LGA+ICT P C CP+Q C A ++ P K + + E++ FL+ Sbjct: 179 FNQAMMELGALICTPKNPLCLFCPVQENCEAFDKGTFEKLPVKSKNVSKKVIEQS-VFLI 237 Query: 238 LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNL---TQLTAFRHTF 294 ++ + LL +R L G++ FP F E + R+ +I D T + +H F Sbjct: 238 RNNQGQYLLQKR-SEKLLHGMWQFPMFESEHARRK--MTEKIGHDIQPVETPIFELKHQF 294 Query: 295 SHFHLDIVPMWLPVSSFTGCMD----EGNALWYNLAQPPSVGLAAPVERLLQ 342 +H I V + +G ++ + +W+NL+ P+ ++ Q Sbjct: 295 THLTWKI-----KVYAASGAINIETLPDDMIWFNLSDRDQYTFPVPMSKIYQ 341 >UniRef50_Q8A7M6 A/G-specific adenine glycosylase n=2 Tax=Bacteroides RepID=Q8A7M6_BACTN Length = 357 Score = 195 bits (496), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 99/257 (38%), Positives = 145/257 (56%), Gaps = 5/257 (1%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +++WY +Y R LPW+ PY++W+SE++LQQT+VA YF RF+ RFP V L Sbjct: 13 FSEAIVEWYKEYKRD-LPWRESSDPYRIWISEIILQQTRVAQGYDYFLRFIKRFPDVKAL 71 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+A DEV+ W GLGYY+RARNLH AA+ + G FPET+ EV AL GVG TA AI Sbjct: 72 ADADEDEVMKYWQGLGYYSRARNLHAAAKSM----NGVFPETYPEVLALKGVGEYTAAAI 127 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S + G + ++DGNV RVL+R + + E + +L++++ +NQ +M Sbjct: 128 CSFAYGMPYAVVDGNVYRVLSRYFGIDTPIDSTEGKKLFAALADEMLDKKQPALYNQGIM 187 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D GA+ CT P C CPL + C A + P K+ K R ++ ++ Sbjct: 188 DFGAIQCTPQSPDCLFCPLADSCSALSTGRVTQLPVKQHKTKTTNRYFNYIYVRAGAHTY 247 Query: 246 LAQRPPSGLWGGLYCFP 262 + +R + +W L+ P Sbjct: 248 INKRTANDIWKNLFELP 264 >UniRef50_C4FU23 Putative uncharacterized protein n=1 Tax=Catonella morbi ATCC 51271 RepID=C4FU23_9FIRM Length = 410 Score = 195 bits (496), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 120/294 (40%), Positives = 169/294 (57%), Gaps = 32/294 (10%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F +L WYDK ++ LPW+ + PY +W+SE+MLQQTQVATVIPY++RFMA PTV DL Sbjct: 46 FRKDLLAWYDK-NKRDLPWRKSQDPYAIWVSEIMLQQTQVATVIPYYQRFMAALPTVADL 104 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP + +L+LW GLGYY+R RN+ AAQQV GG+ P+ + + +L G+G TA AI Sbjct: 105 AQAPEETLLNLWQGLGYYSRVRNMQAAAQQVMADFGGRMPDQVDSLLSLKGIGPYTAAAI 164 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVS---GWPGKKEVENKLWSLSEQVTPAVGVER--- 179 S++ G+ P LDGN+ R++AR + + P ++V E + + +R Sbjct: 165 ASMAFGRVAPALDGNLFRIVARLFRLKDDIALPKSRKV------FMEILDILIDPDRPGD 218 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL-----YP--GKKPKQTLPERT 232 FNQAMMDLGA + T S P+ PL AA + S+ + YP KK KQT + T Sbjct: 219 FNQAMMDLGACVMTPSNPRPDNHPL-----AAYDASYQVGDSDQYPVKSKKVKQTRHDLT 273 Query: 233 GYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQ 286 YF +L + LL + + L GL+ FP L Q + ++ A LT+ Sbjct: 274 AYF-VLDSQGNWLLRRHGEAELLTGLWHFPM------LEQSMVYEEVTAAELTE 320 >UniRef50_Q2N933 Putative uncharacterized protein n=1 Tax=Erythrobacter litoralis HTCC2594 RepID=Q2N933_ERYLH Length = 347 Score = 195 bits (495), Expect = 3e-48, Method: Compositional matrix adjust. Identities = 121/301 (40%), Positives = 166/301 (55%), Gaps = 13/301 (4%) Query: 7 SAQVLDWYDKYGRKTLPWQID-----KTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 S +LDWYD++ R LPW+ PY+VWLSE+MLQQT VA V PYF RF +PT Sbjct: 13 SGVLLDWYDRHARD-LPWRAQPGEPAADPYRVWLSEIMLQQTTVAAVKPYFARFTEVWPT 71 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA A ++V+ W GLGYY+RARNL AA+ V L G FPET E+ LPGVG T Sbjct: 72 VEALAEASEEDVMAAWAGLGYYSRARNLLMAAKAVGDL--GGFPETEAELRRLPGVGEYT 129 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A AI +++ G+ ++D NV+RV++R +A+ P K + + ++ +TP F Sbjct: 130 AAAIAAIAFGRRAVVVDANVERVVSRLFAIEE-PLPK-ARRAIRAHADAITPDRRAGDFA 187 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QAMMDLG+ +CT P+C LCPL C A +P K P++ P R G ++ E Sbjct: 188 QAMMDLGSQVCTTRAPQCLLCPLARFCEARTQEP-ERFPVKPPRKAKPLRKGRAFWIERE 246 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 V L +R SG+ GG+ P D S R + A + H F+HF L++ Sbjct: 247 GTVWLVRREGSGMLGGMRSLPD--DGWSARSDGSGELPLAGEWENAGSVFHVFTHFALEL 304 Query: 302 V 302 Sbjct: 305 T 305 >UniRef50_D2QT13 A/G-specific adenine glycosylase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QT13_9SPHI Length = 362 Score = 195 bits (495), Expect = 3e-48, Method: Compositional matrix adjust. Identities = 99/257 (38%), Positives = 149/257 (57%), Gaps = 2/257 (0%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ + WY + R LPW+ PY +WLSE++LQQT+VA PY+ERF+ +PT++D+ Sbjct: 14 FAPTLEQWYSVHKRD-LPWRHTLDPYYIWLSEIILQQTRVAQGKPYYERFVDAYPTISDM 72 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA ++L LW GLGYY+RARNLH+ A+ V GKFP T+ ++ + G+G TA A+ Sbjct: 73 ANADERKLLRLWQGLGYYSRARNLHQTARYVTEKLDGKFPNTYHDLLKMKGIGAYTAAAV 132 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT-PAVGVERFNQAM 184 S + G+ P++DGNV RVLAR + ++ + +L+ ++ AV +NQA+ Sbjct: 133 ASFAFGERVPVVDGNVYRVLARVFGITEDITTTTAKKTFAALATRLMQAAVDPATYNQAI 192 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+ GA+ CT P C LCP+Q C+A P K K + ER +++ Q + Sbjct: 193 MEFGAIHCTPVAPDCLLCPVQQQCVAYLTGRQHQLPVKSKKAPVRERFFSYVVFQQGGRL 252 Query: 245 LLAQRPPSGLWGGLYCF 261 L +R +W LY F Sbjct: 253 ALQERVARDIWQNLYDF 269 >UniRef50_A9BD72 A/G-specific DNA glycosylase n=3 Tax=Cyanobacteria RepID=A9BD72_PROM4 Length = 399 Score = 194 bits (493), Expect = 4e-48, Method: Compositional matrix adjust. Identities = 117/323 (36%), Positives = 171/323 (52%), Gaps = 28/323 (8%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDK-----------TPYKVWLSEVMLQQTQVATVIPYFER 54 + +L W+ GR +PW++ K Y + ++EVMLQQTQ+ V+PY+E+ Sbjct: 24 LRSTLLQWFKSNGRHYIPWKLTKDGTLPNENQYLAVYPILVAEVMLQQTQLKVVLPYWEK 83 Query: 55 FMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHG-------GKFPET 107 +M PT+ DLA A D+VL LW GLGYY+RAR LH ++ + L G +P+ Sbjct: 84 WMLALPTLVDLAKAEEDKVLLLWQGLGYYSRARRLHVTSRILLNLIGIPNSLNPANWPKD 143 Query: 108 FEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSL 167 E LPG+GR+TAG+I+S + P+LDGNVKRVL R + P K +LW L Sbjct: 144 LESWMNLPGIGRNTAGSIISSAFNLPSPLLDGNVKRVLTRLIGSTKTPNKDLA--RLWKL 201 Query: 168 SEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQT 227 S+ + FNQA+MDLGA ICT+ P C+ CP QN C A + + P K K Sbjct: 202 SDLLLDKNLPRTFNQALMDLGATICTKYNPICTNCPWQNYCSAYNSGNPENLPVKGQKLI 261 Query: 228 LPERT-GYFLLLQHEDEVLLAQRPPSGLWGGLYCFP--QFADEESLRQWLAQR-----QI 279 L + G L+L +VL+ QR G GG++ FP + +ES+ +A+ + Sbjct: 262 LSKAVIGVGLILNKNQDVLIDQRLDEGSMGGMWEFPGGKKEKDESIEMTIARELREELGV 321 Query: 280 AADNLTQLTAFRHTFSHFHLDIV 302 +L F H+++H L + Sbjct: 322 EVKVGKKLIEFDHSYTHKKLHFI 344 >UniRef50_UPI0001BC63FA A/G-specific adenine glycosylase n=3 Tax=Fusobacterium RepID=UPI0001BC63FA Length = 365 Score = 194 bits (493), Expect = 4e-48, Method: Compositional matrix adjust. Identities = 117/313 (37%), Positives = 168/313 (53%), Gaps = 5/313 (1%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + + ++L++YDK+ R L W+ + Y W+SE+MLQQT+V V PYF RF+ P Sbjct: 10 EMGKIAKKLLEYYDKHKR-DLAWRGEVPAYYTWISEIMLQQTRVEAVKPYFARFIEELPN 68 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LAN ++++ LW GLGYY+RARNL KAA Q+ +GG+ P+ +E+ L G+G T Sbjct: 69 IESLANCEEEKLMKLWQGLGYYSRARNLKKAACQIVEFYGGELPKEKKELLHLAGIGPYT 128 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ GK +DGNV RV++R +AV G + + K+ L+ Q P FN Sbjct: 129 AGAISSIAYGKKETAVDGNVIRVMSRLFAVDGNVLEGKGRQKIEELTYQELPEDRAGDFN 188 Query: 182 QAMMDLGAMICT-RSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+MDLGA IC C LCPL C A YP KK K+ LLL Sbjct: 189 QALMDLGATICIPNGAALCHLCPLHLECQANLKKEVEKYPEKKKKKERKLERQTILLLSD 248 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESL---RQWLAQRQIAADNLTQLTAFRHTFSHF 297 + L +R GL GL+ FP SL R++L ++ I+ + + H FSH Sbjct: 249 GQKFALEKRKEKGLLAGLWQFPMLEGRLSLQEVREYLKEKGISYSGIEEYEPAIHIFSHV 308 Query: 298 HLDIVPMWLPVSS 310 +V + V Sbjct: 309 EWHMVSYIIEVEK 321 >UniRef50_Q0EVU8 Adenine glycosylase n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EVU8_9PROT Length = 307 Score = 194 bits (492), Expect = 6e-48, Method: Compositional matrix adjust. Identities = 104/257 (40%), Positives = 153/257 (59%), Gaps = 16/257 (6%) Query: 11 LDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPL 70 + WY ++ R LPW+ PY++W+SE+MLQQTQV TV+P + ++ FPT+ LA A Sbjct: 1 MAWYARHARD-LPWRHTTDPYRIWISEIMLQQTQVKTVLPRYLAWLDTFPTIESLAAASA 59 Query: 71 DEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSL 130 D++L W GLGYY RAR +H+AA + G+FP F+++ +LPG+GRSTAGAI S Sbjct: 60 DDLLKAWEGLGYYRRARFIHQAAATIMEGFEGRFPRDFDDIVSLPGIGRSTAGAIASFCY 119 Query: 131 GKHFPILDGNVKRVLARCYAVSGWPGKKEVENK-LWSLSEQVTPAVGVER-FNQAMMDLG 188 G P+LDGNVKRVL R W G+ + +K LW L++Q G +NQAMM+LG Sbjct: 120 GASTPVLDGNVKRVLKR------WHGQPDASDKALWLLAQQAINISGKPGIWNQAMMELG 173 Query: 189 AMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH---EDEVL 245 A C+ P C CP+ C +A + + ++ ++ +P ++ + H + Sbjct: 174 ASACSAKSPDCGACPVNAFCASA----FQVEVSQERRKAVPVLDVHWRIDLHICPIKGIW 229 Query: 246 LAQRPPSGLWGGLYCFP 262 L QRP +G+WGGL+ P Sbjct: 230 LTQRPDAGIWGGLWTPP 246 >UniRef50_Q1ITJ1 A/G-specific DNA glycosylase n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1ITJ1_ACIBL Length = 324 Score = 193 bits (491), Expect = 7e-48, Method: Compositional matrix adjust. Identities = 106/261 (40%), Positives = 152/261 (58%), Gaps = 14/261 (5%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 AS +L WY ++ R+ LPW+ + PY +W+SE+MLQQT+VA V+ + +F+A+FP V Sbjct: 12 ASDLQKSLLSWY-RHSRRNLPWRRTRDPYAIWISEIMLQQTRVAAVLDKYAQFLAQFPNV 70 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA+A LDEVL +W+GLGYY RAR LH+AAQ V GKFP+T LPG+GR T+ Sbjct: 71 KALADASLDEVLTVWSGLGYYRRARALHQAAQMVVHHLHGKFPDTAAGWRQLPGIGRYTS 130 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 AI S++ + ++DGNV+RVL R G++ +LW +EQ+ +NQ Sbjct: 131 AAIASIAFNEPAAVVDGNVERVLERL------DGERHEGERLWERAEQLLAKRAPGDWNQ 184 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 AMM+LGA IC P+C +CP+ C L +PK+ E ++ L ++ Sbjct: 185 AMMELGATICLPQNPQCLVCPVNGPC----KTRGPLQSRPQPKRKRAEL--WYALYARKN 238 Query: 243 EVLLAQRPPS-GLWGGLYCFP 262 VLL QRP L G++ P Sbjct: 239 SVLLVQRPADHSLMAGMWELP 259 >UniRef50_Q1GD89 A/G-specific DNA-adenine glycosylase n=11 Tax=Rhodobacterales RepID=Q1GD89_SILST Length = 353 Score = 193 bits (490), Expect = 8e-48, Method: Compositional matrix adjust. Identities = 126/310 (40%), Positives = 172/310 (55%), Gaps = 18/310 (5%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKT---------PYKVWLSEVMLQQTQVATVIPY 51 + A Q S +L WYD + R+ +PW++ PY++WLSEVMLQQT VA V Y Sbjct: 6 VSAEQLSQDLLVWYDTHARE-MPWRVGPAARASGVRPDPYRIWLSEVMLQQTTVAAVKDY 64 Query: 52 FERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEV 111 FERF R+P V DLA A +V+ W GLGYYARARNL K A+ VA G FP+ +E + Sbjct: 65 FERFTRRWPRVGDLAAAEDGDVMAEWAGLGYYARARNLLKCARVVAEEFEGVFPDAYEGL 124 Query: 112 AALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAV-SGWPGKKEVENKLWSLSEQ 170 ALPG+G TA AI +++ + +LDGNV+RV+AR + P K V L + + Sbjct: 125 IALPGIGPYTAAAISAIAFDRPETVLDGNVERVMARLHDEHEPLPAVKPV---LKAHAAH 181 Query: 171 VTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPE 230 +TP+ + QA+MDLGA ICT P C +CP ++ C A + P K PK+ P Sbjct: 182 LTPSARPGDYAQAVMDLGATICTPKSPACGICPWRDPCRARVKGTAPELPKKTPKKPKPT 241 Query: 231 RTGY-FLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTA 289 R G+ +L E + LL +RP GL GG+ +P ++ + AD Sbjct: 242 RYGFVYLARSAEGDWLLERRPDKGLLGGMLGWPGSEWNDAPTE---TPPFDADWQDLGAE 298 Query: 290 FRHTFSHFHL 299 RHTF+HFHL Sbjct: 299 VRHTFTHFHL 308 >UniRef50_A8ZZH0 A/G-specific adenine glycosylase n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZZH0_DESOH Length = 360 Score = 193 bits (490), Expect = 9e-48, Method: Compositional matrix adjust. Identities = 110/308 (35%), Positives = 160/308 (51%), Gaps = 8/308 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 A F ++L WY + R LPW+ K PY +W+SEVMLQQTQVATV+ Y+ RF+ FP Sbjct: 4 FSAGPFQRRLLRWYTAHQRD-LPWRRSKNPYHIWVSEVMLQQTQVATVVDYYRRFLQAFP 62 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + LA A L +VL LW GLGYYARA NLHKAA+Q+ + P T E LPGVG Sbjct: 63 DIGTLAVAELQDVLKLWEGLGYYARAANLHKAARQIVAGGKKRVPRTPETFGRLPGVGDY 122 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 A+ S++ G P++DGNVKRVLAR + + + + + F Sbjct: 123 INAAVSSIAFGHPLPVVDGNVKRVLARLFLLDEPVNRPSNHRVFLEKARLLLAFKDPGTF 182 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQAMM+LGA++C +P C CP+ + C A +P + + P L++ Sbjct: 183 NQAMMELGALVCKPGRPLCDQCPVASFCGAHQAGRVTDFPRRLAARKNPHHHLAVGLVKK 242 Query: 241 EDEVLLAQRPPSGLWGGLYCFP--QFADEESLRQWLAQRQIAADNLT-----QLTAFRHT 293 + L+ +RP +GL GL+ P + E+ + + + LT +L H Sbjct: 243 GNRFLIVRRPATGLLAGLWEMPGGRVEKPENPADACCRAVLESVGLTVFPGPRLARVAHA 302 Query: 294 FSHFHLDI 301 ++HF + + Sbjct: 303 YTHFKITM 310 >UniRef50_B8J1Y9 A/G-specific adenine glycosylase n=12 Tax=Desulfovibrionales RepID=B8J1Y9_DESDA Length = 435 Score = 192 bits (489), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 111/302 (36%), Positives = 170/302 (56%), Gaps = 15/302 (4%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +LDW+ + R LPW+ TPY+VW+SEVMLQQTQ+ + YF R+M RFP + LA A Sbjct: 58 LLDWFAVHQR-ALPWRNSYTPYEVWISEVMLQQTQMERGVRYFIRWMERFPDIAALAAAH 116 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 ++VL +W GLGYY+RAR++ AA+++ H G FP + ALPGVG TAGAI S++ Sbjct: 117 EEDVLRMWEGLGYYSRARHILAAARKIMAEHNGIFPRDPAAIRALPGVGPYTAGAIASIA 176 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKL---WSLSEQVTPAVGVERFNQAMMD 186 K P +D NV+RV++R + V P K+E + W+L ++ P NQAMM+ Sbjct: 177 FEKKLPCVDANVERVVSRIFDVDS-PVKQEPAAGVIHQWAL--RLVPEGKARAHNQAMME 233 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLL 246 LGA++C R KP C+ CPL CI+ P + T+ +L+ +++ + Sbjct: 234 LGALVC-RKKPCCASCPLGAFCISHHLGIEDQRPVPGKRATVTPVNAVTGVLRRGEKIFV 292 Query: 247 AQRPPSGLWGGLYCFPQFADEES-------LRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 +RP SG+WG L+ FP EE +R+++ + + + RH ++ + L Sbjct: 293 QKRPASGVWGNLWEFPGGRVEEGESPEQAVVREFMEETGFTVNVAARHGIIRHGYTTYRL 352 Query: 300 DI 301 + Sbjct: 353 TL 354 >UniRef50_B7CBH3 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CBH3_9FIRM Length = 328 Score = 192 bits (488), Expect = 1e-47, Method: Compositional matrix adjust. Identities = 103/300 (34%), Positives = 168/300 (56%), Gaps = 19/300 (6%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 +F ++DWY K R L +++ K PY++W+SE+M QQT++ ++PYF+R++ + P + Sbjct: 2 KEFQKDLIDWYIKNHR-PLEFRLKKDPYEIWISEIMAQQTRIEAMLPYFKRWIQQLPDIE 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 +A +++ LW GLGYY+R +N+ K A + + GK P T EE+ LPG+G TAG Sbjct: 61 SVAKCDDEKLNKLWQGLGYYSRCKNIKKCAIECVEKYSGKLPCTKEELLKLPGIGPYTAG 120 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV-GVERFNQ 182 AI S++ G+ +DGNV RV +R Y + K V+ ++ L ++ P+ + +NQ Sbjct: 121 AIASIANGQRVSAVDGNVIRVFSRLYNIFEDVTKTSVKKQIEELVDESLPSKEEISYYNQ 180 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+LGA+IC P+C LCP++ C A S Y KK + + + Y L+ H+D Sbjct: 181 AIMELGALICIPKNPRCELCPVKKYCDAKDPGSLP-YKAKKKMRKIEYKHLYILV--HKD 237 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH--FHLD 300 +V + +R +GL GLY F + E + +L + H FSH +H+D Sbjct: 238 KVHVVKRADTGLLAGLYGFDETKPEHIIES------------IELEPYIHIFSHVEWHMD 285 >UniRef50_Q7VGS8 A/G-specific adenine glycosylase n=537 Tax=Helicobacter RepID=Q7VGS8_HELHP Length = 350 Score = 191 bits (486), Expect = 3e-47, Method: Compositional matrix adjust. Identities = 109/275 (39%), Positives = 155/275 (56%), Gaps = 16/275 (5%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQI---DKTPYKVWLSEVMLQQTQVATVIP-YFERFMAR 58 A + +L+WY K+GR LPW+ PY V++SE+MLQQTQV V+ Y+ FM Sbjct: 14 AQTWRENILNWYKKHGRVYLPWRNLSEKNAPYGVYVSEIMLQQTQVKRVLEHYYMPFMTA 73 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 FPT+T LANA + +L LW GLGYY R RN+ K+A+ P T+ E+ +LPG+G Sbjct: 74 FPTLTSLANAQEESLLKLWEGLGYYTRVRNMQKSARICCQRFNNTLPHTYAELISLPGIG 133 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 +AGAIL L ++ +DGN++RV R +A+S P +K +E W L E P + Sbjct: 134 AYSAGAILCFGLRQNVAFVDGNIRRVFCRIFALSS-PTQKSLEELAWILLE---PKYSFD 189 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL- 237 +NQA++D+GAMICT P C +CPLQ C + LYP K Q P L Sbjct: 190 -YNQALLDIGAMICTPKSPSCLICPLQQLC--EGQITPELYPTPKKSQLQPLHLHLALCI 246 Query: 238 ---LQHEDEVLLAQRPPSGLWGGLYCFPQFADEES 269 + ++ E+ L R L+ GLY P+ ++EE+ Sbjct: 247 SQNINNKTEIGLL-RSTQNLYRGLYNLPRLSEEEA 280 >UniRef50_B8JJK1 Novel protein similar to H.sapiens MUTYH, mutY homolog (E. coli) (MUTYH) n=2 Tax=Clupeocephala RepID=B8JJK1_DANRE Length = 526 Score = 190 bits (483), Expect = 6e-47, Method: Compositional matrix adjust. Identities = 130/415 (31%), Positives = 196/415 (47%), Gaps = 76/415 (18%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTP--------YKVWLSEVMLQQTQVATVIPYFE 53 + S F + ++ WYD+ R+ LPW+ T Y VW+SE+MLQQTQVATVI Y+ Sbjct: 55 EISVFRSDLMKWYDENKRE-LPWRTLATTEQDDNIRTYAVWVSEIMLQQTQVATVIDYYN 113 Query: 54 RFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEE-VA 112 R+M R+PTV LA A L+EV +W+GLGYY+R R LH+ AQ+V + G+ P+T + Sbjct: 114 RWMKRWPTVEKLAAATLEEVNQMWSGLGYYSRGRRLHEGAQKVVSELDGQMPKTTAGLLK 173 Query: 113 ALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT 172 LPGVGR TAGAI S++LG+ +DGNV RVL R A+ V + LW +++ + Sbjct: 174 QLPGVGRYTAGAIGSIALGQVTGAVDGNVIRVLCRVRAIGADSSSPAVTDALWRIADALV 233 Query: 173 PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQN-------------------------- 206 FNQAMM+LGA +CT P CS CP+Q Sbjct: 234 DPERPGDFNQAMMELGARVCTPKSPVCSQCPIQTHCHAFKKISMKQEMDCKRLLNKLATN 293 Query: 207 ---------GCIAAANNSWAL------------YPGKKPKQTLPERTGYFLLLQH----- 240 C++A + + L YP K K+ +++H Sbjct: 294 PKNPVPDIENCMSAGSCNLCLSEDWDPQLGVQNYPRKPVKKAPRVEQTLTCIVEHQRAGE 353 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQF---ADE----------ESLRQWLAQRQIAADNLTQL 287 E E LL QRP GL G++ P AD + +++WL + + ++ + Sbjct: 354 ESEYLLTQRPSKGLLAGMWELPSVLLQADISENKYKELICDMMQKWL-ETPLDTHSVQFV 412 Query: 288 TAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 H FSH H + + VS + + W + ++ V+++++ Sbjct: 413 GEVVHIFSHIHQTYIVFSVHVSDCSDREQKQKTCWLTKSALQKAAVSTGVKKIMK 467 >UniRef50_A6G959 A/G-specific adenine glycosylase n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G959_9DELT Length = 378 Score = 190 bits (482), Expect = 7e-47, Method: Compositional matrix adjust. Identities = 103/253 (40%), Positives = 146/253 (57%), Gaps = 3/253 (1%) Query: 13 WYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDE 72 W+ + R LPW+ + PY +W+SE+MLQQT+V TV Y++ F+ RFPTV LA A Sbjct: 23 WFKRDARD-LPWRRTRDPYAIWVSEIMLQQTRVDTVENYWQPFLDRFPTVESLAAAEQQA 81 Query: 73 VLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGK 132 VL W+GLGYY RA+ LH+ AQ V GG+ P T + + A+PG+GR TAGAI S++ + Sbjct: 82 VLEAWSGLGYYRRAKLLHRGAQYVHEELGGEVPGTADALRAIPGIGRYTAGAIASIAFDQ 141 Query: 133 HFPILDGNVKRVLARCYAVSGWPGKKEVENKL-WSLSEQVTPAVGVERFNQAMMDLGAMI 191 P++DGNV RV +R A+ P +++ + + W V QA+M+LGA + Sbjct: 142 PAPLVDGNVARVHSRLAAIEA-PAEQDAKAEAHWRFVAGVLEHGEPRVLAQALMELGATV 200 Query: 192 CTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPP 251 CT P C CP++ C A A P K K+ E L L+ D++L+ +RP Sbjct: 201 CTPRSPTCLTCPVREHCRARARGLQDQIPAPKVKKKATEHHLLALALRRGDKLLIERRPD 260 Query: 252 SGLWGGLYCFPQF 264 GL GGL+C P F Sbjct: 261 EGLLGGLWCLPVF 273 >UniRef50_B0SFS5 A/G-specific DNA glycosylase n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SFS5_LEPBA Length = 353 Score = 190 bits (482), Expect = 8e-47, Method: Compositional matrix adjust. Identities = 105/255 (41%), Positives = 145/255 (56%), Gaps = 9/255 (3%) Query: 12 DWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLD 71 DWY + R LP++ K Y +W+SEVMLQQT+VA ++P FE F+ RFP LA A + Sbjct: 9 DWYLLHKR-DLPFRKKKQAYPIWISEVMLQQTRVAAMLPLFENFVNRFPNPESLAKATEE 67 Query: 72 EVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLG 131 EVL W GLGYY+RARN+ KAA Q+ + G FP+ + V LPG+G TA AILS+S Sbjct: 68 EVLSFWKGLGYYSRARNIRKAAIQIVQQYNGSFPKDLDSVLKLPGIGNYTARAILSISYD 127 Query: 132 KHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDL 187 +LDGNVKRVL+R Y + G +KE++ K P NQA+M+L Sbjct: 128 LPLAVLDGNVKRVLSRYYGYTKNILGPQAEKELQLKADGFLNLDFPG----DHNQAVMEL 183 Query: 188 GAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLA 247 GA IC PKC +CPL +GC A + P ++ KQ TG ++Q ++ +LL Sbjct: 184 GATICLPESPKCLVCPLMDGCYARIHGKTKEIPLREKKQKQVLLTGEIWVIQQKNLILLI 243 Query: 248 QRPPSGLWGGLYCFP 262 + + G++ P Sbjct: 244 KEKKNRFLKGMFHLP 258 >UniRef50_C6XF00 A/G-specific adenine glycosylase n=1 Tax=Candidatus Liberibacter asiaticus str. psy62 RepID=C6XF00_LIBAP Length = 356 Score = 189 bits (479), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 101/269 (37%), Positives = 153/269 (56%), Gaps = 18/269 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDK--------TPYKVWLSEVMLQQTQVATVIPYFERFMA 57 +++LDWYD R LPW+ +PYKVW+SE+MLQQT V TV PYF++FM Sbjct: 8 IQSKILDWYDTNHR-VLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQ 66 Query: 58 RFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGV 117 ++PT+ L++A +E+L W GLGYY RARNL K A + + G FP E + LPG+ Sbjct: 67 KWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGI 126 Query: 118 GRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPG---KKEVENKLWSLSEQVTPA 174 G TA AI++++ ++D N++R+++R + + P K ++N ++ P Sbjct: 127 GDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIK-PAPLYHKTIKNYARKITSTSRPG 185 Query: 175 VGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTG- 233 F QAMMDLGA+ICT +KP C LCP+Q C+ + L K+ P RTG Sbjct: 186 ----DFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSEGKSHLLGINTIKKKRPMRTGA 241 Query: 234 YFLLLQHEDEVLLAQRPPSGLWGGLYCFP 262 F+ + +++ +LL +R + L G+ P Sbjct: 242 VFIAITNDNRILLRKRTNTRLLEGMDELP 270 >UniRef50_C9KJM4 A/G-specific adenine glycosylase n=3 Tax=Clostridiales RepID=C9KJM4_9FIRM Length = 370 Score = 188 bits (477), Expect = 3e-46, Method: Compositional matrix adjust. Identities = 118/307 (38%), Positives = 159/307 (51%), Gaps = 20/307 (6%) Query: 4 SQFSAQVLDWYD-KYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 S +L WY+ + + LPW+ + TPY WLSE+MLQQT+ + VIPY+ RF+ P + Sbjct: 19 SSIVGPLLTWYEGQSTLRCLPWREEVTPYHTWLSEIMLQQTRASAVIPYYTRFLEALPDI 78 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA + ++ LW GLGYY+RARNL KAA + +GGK P F + LPG+GR TA Sbjct: 79 ESLARCDDELLMKLWQGLGYYSRARNLKKAAGVICQEYGGKLPADFAALLELPGIGRYTA 138 Query: 123 GAILSLSLGKHFPILDGNVKRVLAR---CYAVSGWPG-KKEVENKL--WSLSEQVTPAVG 176 AI S++ G P +DGN+ RV R C P ++ VE L + S Q A Sbjct: 139 SAIGSIAFGLPLPAVDGNILRVTMRVLQCGEDIAQPAVRRAVEEALAPYYPSGQAAGA-- 196 Query: 177 VERFNQAMMDLGAMICT-RSKPKCSLCPLQNGCIAAANNSWALYP--GKKPKQTLPERTG 233 NQA MDLGA IC P C+ CPL C+A S P KK K+ + ++T Sbjct: 197 ---LNQAFMDLGATICLPHGAPHCAACPLARLCLAHDAGSEQQLPVKSKKAKRRVEQKT- 252 Query: 234 YFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFA---DEESLRQWLAQRQIAADNLTQLTAF 290 LLL + L +RP GL GL+ P + + WLA R + + +L Sbjct: 253 -VLLLHQGRALALTKRPNKGLLAGLWELPNLPGHLTQAEVEAWLAARGLRTLTIARLPDA 311 Query: 291 RHTFSHF 297 RH FSH Sbjct: 312 RHIFSHV 318 >UniRef50_Q0P4V1 Putative uncharacterized protein MGC145569 n=1 Tax=Xenopus (Silurana) tropicalis RepID=Q0P4V1_XENTR Length = 520 Score = 188 bits (477), Expect = 3e-46, Method: Compositional matrix adjust. Identities = 97/211 (45%), Positives = 134/211 (63%), Gaps = 10/211 (4%) Query: 9 QVLDWYDKYGRKTLPWQ--------IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 ++L WYDK ++ LPW+ +D+ Y VW+SEVMLQQTQVATVI Y+ ++M +P Sbjct: 54 KLLAWYDK-SKRDLPWRTMACTEPDLDRKAYAVWVSEVMLQQTQVATVIDYYNKWMKVWP 112 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA-LPGVGR 119 T+ DLA + L+EV +W+GLGYY+R R L + A++V GG P + +E+ LPGVGR Sbjct: 113 TMEDLARSSLEEVNEMWSGLGYYSRGRRLQEGAKKVVLELGGSMPRSADELQKLLPGVGR 172 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TAGAI S+S G+ ++DGNV RVL+R + V +KLW+L+ + Sbjct: 173 YTAGAIASISYGQVTGVVDGNVIRVLSRLRCIGADSSTLAVSDKLWNLANALVDPDRPGD 232 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIA 210 FNQ MM+LGA +CT KP C+ CPLQ C A Sbjct: 233 FNQGMMELGATVCTPKKPLCTACPLQGQCKA 263 >UniRef50_C0YGU7 Possible A/G-specific adenine glycosylase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YGU7_9FLAO Length = 359 Score = 187 bits (476), Expect = 3e-46, Method: Compositional matrix adjust. Identities = 100/263 (38%), Positives = 141/263 (53%), Gaps = 5/263 (1%) Query: 3 ASQF---SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 AS+F ++L+WY R LP++ K PYK+W+ E++ QQT++ + ++ F+ RF Sbjct: 22 ASEFLHIGNRLLEWYRNNAR-DLPFRQTKDPYKIWICEIVFQQTRINQGLNHYNNFIKRF 80 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 P V LA A +EVL W GLGYY+RA N+HKAAQQ+ + G FP +EE+ L GVG+ Sbjct: 81 PDVKTLAEADENEVLLYWKGLGYYSRAINIHKAAQQIMNDYQGVFPHQYEEILKLKGVGK 140 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA A+ S+ G P +DGN RVL+R +A L+ V P V Sbjct: 141 YTAAAVSSICFGGRMPAVDGNFYRVLSRIFADDFDISNSRAFTYFSELAALVMPE-NVGD 199 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 FNQAMMDLG+ IC P C CPL C+A + + YP K K + + + Sbjct: 200 FNQAMMDLGSEICKPKNPLCGECPLNEDCLAFSLQKISDYPVKTKKVKAGDLALTYYFVH 259 Query: 240 HEDEVLLAQRPPSGLWGGLYCFP 262 + L+ QR +W L+ FP Sbjct: 260 RNGKFLIRQRADDFIWKKLFEFP 282 >UniRef50_C4V5S7 Adenine glycosylase n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V5S7_9FIRM Length = 369 Score = 187 bits (476), Expect = 4e-46, Method: Compositional matrix adjust. Identities = 107/288 (37%), Positives = 147/288 (51%), Gaps = 5/288 (1%) Query: 20 KTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTG 79 + LPW+ + TPY VW+SE+MLQQT+ A V Y+ RF+ P++ DLA D ++ LW G Sbjct: 30 RDLPWRDEPTPYHVWISEIMLQQTRAAVVRAYYLRFLTALPSIHDLAAVNDDALMKLWQG 89 Query: 80 LGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDG 139 LGYY+RARNL +AAQ + HGG P F + LPG+GR TA AI S + G+ P +DG Sbjct: 90 LGYYSRARNLKRAAQVIVKEHGGDLPNDFNALLTLPGIGRYTASAIASFAYGQPHPAVDG 149 Query: 140 NVKRVLARCYAVSGWPGKKEVENKL-WSLSEQVTPAVGVERFNQAMMDLGAMICT-RSKP 197 N RV AR A GK + L +LS N+A MDLGA IC P Sbjct: 150 NFLRVAARITANPIDIGKDSTKRSLEAALSVSYPEGRDAGLLNEAFMDLGATICLPHGAP 209 Query: 198 KCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGG 257 C CP C+A + YP K + + L+L D + + +RP GL G Sbjct: 210 LCHSCPAAQLCLAHDRGTEQDYPVKSAAKARRKEKHTVLILSCGDRIAIRKRPTKGLLAG 269 Query: 258 LYCFPQFADEES---LRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 L+ +P ++S +R L ++ L RH FSH ++ Sbjct: 270 LWEYPHLDGKQSKRAVRAHLEDEGFRILSIDPLPPARHIFSHIEWELT 317 >UniRef50_Q6AKD3 Related to A/G-specific adenine glycosylase n=1 Tax=Desulfotalea psychrophila RepID=Q6AKD3_DESPS Length = 366 Score = 187 bits (476), Expect = 4e-46, Method: Compositional matrix adjust. Identities = 96/269 (35%), Positives = 150/269 (55%), Gaps = 3/269 (1%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 +F +Q+L W+ + + LPW+ PY VW+SE+MLQQTQ+ + YF R++ RFP V Sbjct: 11 DRFQSQLLTWF-RLQDRFLPWRQTYDPYHVWISEIMLQQTQMDRGVSYFNRWIERFPQVE 69 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 +A A E+ +W GLGYYARARNLH+AA+++ G+ P + + +LPG+G TA Sbjct: 70 AVAEADEQEIFKMWEGLGYYARARNLHRAAKKIVEEFAGELPCDIDLLRSLPGIGPYTAA 129 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S++ P +D NV R+ +R + + + +V + ++ P+ +NQA Sbjct: 130 AIGSVACNIDIPTIDANVARIFSRLFDIDRPVRETQVARAIEKVACDCLPSGRARHWNQA 189 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +MDLG ++C P+C+LCP+Q C+A A + L P PKQ L + E Sbjct: 190 LMDLGGLVCLPRAPRCTLCPIQEMCLAFARQTVQLRPVPLPKQKLIHIRRVAIFQVSEGR 249 Query: 244 VLLAQRPPSGLWGGLYCFPQ--FADEESL 270 +L+ LW GL+ FP +EE++ Sbjct: 250 LLIQPSRHPTLWQGLWEFPHTDLQEEEAV 278 >UniRef50_Q03P68 A/G-specific DNA-adenine glycosylase n=2 Tax=Lactobacillales RepID=Q03P68_LACBA Length = 379 Score = 187 bits (475), Expect = 5e-46, Method: Compositional matrix adjust. Identities = 110/263 (41%), Positives = 152/263 (57%), Gaps = 11/263 (4%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 +F +L WYD+ GR LPW+ D+ PY VW+SE+MLQQTQV TVIPY+ FMA FPTV D Sbjct: 11 EFRQTLLAWYDREGRD-LPWRHDQDPYHVWVSEIMLQQTQVQTVIPYYLNFMAMFPTVAD 69 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA AP +++L W GLGYY+RARNL +AA+Q+ + GK+P+T E+ L G+G TAGA Sbjct: 70 LAQAPEEQLLKAWQGLGYYSRARNLQRAARQLVDDYRGKWPQTAAELLDLTGIGPYTAGA 129 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT-PAVGVER---F 180 I S++ G+ P +DGN RV +R + K + L EQV P V +R F Sbjct: 130 IASIAFGEVVPAVDGNAFRVFSRLLLIDADIAKPQTRK----LFEQVIQPIVDPQRPGAF 185 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF-LLLQ 239 NQA+MDLG+ T + P++ + N A +P K K P Y+ L++ Sbjct: 186 NQAIMDLGSSYMTAKNSDPAHSPVRAFDASYRENCVADFPVKT-KAPRPRPVPYYGLIVT 244 Query: 240 HEDEVLLAQRPPSGLWGGLYCFP 262 L+ QR + + G + FP Sbjct: 245 SPAGYLMVQRDATQMLTGYWTFP 267 >UniRef50_Q3KMR2 A/G-specific adenine DNA glycosylase n=9 Tax=Chlamydia RepID=Q3KMR2_CHLTA Length = 368 Score = 187 bits (474), Expect = 6e-46, Method: Compositional matrix adjust. Identities = 87/247 (35%), Positives = 145/247 (58%), Gaps = 1/247 (0%) Query: 18 GRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLW 77 +++ PW+ TPY+VW+SEVMLQQT+ V+PYF ++M RFPT+ DLA A +V+ LW Sbjct: 25 SKRSFPWRDSPTPYRVWVSEVMLQQTRAEVVVPYFLKWMERFPTLQDLAQARESDVVQLW 84 Query: 78 TGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPIL 137 GLGYY+RARNL A+ + + GG+ P ++++ G+G TA AIL+ + + P + Sbjct: 85 EGLGYYSRARNLLAGARVITEIFGGEIPNDLALLSSIKGIGSYTANAILAFAFKQKNPAV 144 Query: 138 DGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKP 197 DGNV RV++R +A+ + ++ L E + P + ++ ++LGA IC + +P Sbjct: 145 DGNVLRVMSRLFAIEESIDRMNTRREITGLCESLLPDQDPQVIAESFIELGARIC-KKQP 203 Query: 198 KCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGG 257 C CPL++ C A + YP + + + +++ ++D+VL+ +R + G Sbjct: 204 LCEQCPLRSFCTAYRQGTMEQYPVRNTRAAISRLFRAVVIVLYKDQVLMTKREEKEIMAG 263 Query: 258 LYCFPQF 264 LY FP + Sbjct: 264 LYEFPYY 270 >UniRef50_C7G751 A/G-specific adenine glycosylase n=1 Tax=Roseburia intestinalis L1-82 RepID=C7G751_9FIRM Length = 395 Score = 186 bits (473), Expect = 8e-46, Method: Compositional matrix adjust. Identities = 111/325 (34%), Positives = 157/325 (48%), Gaps = 31/325 (9%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 Q + +L W+ + R LPW+ TPY+VW+SE+MLQQT+V V PYFERF P Sbjct: 8 KQLVSPLLQWFLNHAR-VLPWREKPTPYRVWVSEIMLQQTRVEAVKPYFERFTTALPDAK 66 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA P DE+L LW GLGYY R RN+ KAA +V +GG+ P +E++ L G+G TAG Sbjct: 67 ALAVCPEDELLKLWEGLGYYNRVRNMQKAAVEVVENYGGQLPADYEKLLKLKGIGHYTAG 126 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER---- 179 A+ S++ G P +DGNV RVL R A K+ N++ L E++ Sbjct: 127 AVASIAYGIPVPAVDGNVLRVLTRVSADDTDIMKQSFRNEMEKLLEKLMHGADGRNEKNV 186 Query: 180 ----------------------FNQAMMDLGAMICT-RSKPKCSLCPLQNGCIAAANNSW 216 FNQA+M+LGA IC P C+ CP ++ C A Sbjct: 187 FPWMEDADDLRAKVYHQNLAGAFNQALMELGATICVPNGAPLCTECPWKDFCEAKKQELI 246 Query: 217 ALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFA---DEESLRQW 273 P K + L+L+ +++V + +RP GL GLY P +E + Sbjct: 247 GQIPVKSKAKPRKIEKKTVLILRDDEKVAIRKRPAKGLLAGLYELPNVEGNMGQEEVLSM 306 Query: 274 LAQRQIAADNLTQLTAFRHTFSHFH 298 + + A + L +H FSH Sbjct: 307 VKEMGYAPIRIQPLGEAKHIFSHIE 331 >UniRef50_Q6MHU9 A/G-specific adenine glycosylase n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MHU9_BDEBA Length = 333 Score = 186 bits (473), Expect = 9e-46, Method: Compositional matrix adjust. Identities = 124/339 (36%), Positives = 183/339 (53%), Gaps = 27/339 (7%) Query: 9 QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 Q+ WY+K +++LPW+ +K PY++WLSEVMLQQT V VIPYFE+F+ +FPTV DLANA Sbjct: 19 QLTQWYNK-NKRSLPWRENKNPYRIWLSEVMLQQTTVVAVIPYFEKFLQKFPTVQDLANA 77 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 P +V+ W GLGYY+RARNLHKAA+ +A FP+T E+ LPG G T+ A+ S+ Sbjct: 78 PEADVMEAWAGLGYYSRARNLHKAAKALAAG---GFPKTAAELLELPGFGPYTSRAVASI 134 Query: 129 SLGKHFPILDGNVKRVLARCYA--VSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMD 186 + G+ +LDGNV RVL+R Y + W GK + L +S++++ + NQ +M+ Sbjct: 135 AFGEKVGVLDGNVIRVLSRRYGLKLEWWNGKG--RDHLQKISDELSLLGQADVVNQGLME 192 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLL 246 LGA +CT K C LCP C++ N P KKP++ L+ +V L Sbjct: 193 LGATVCTPQKVMCMLCPWAATCVSREKNLVEKLPLKKPRKESEVWVWKPLVAIKNQKVAL 252 Query: 247 AQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWL 306 Q + G FP E S+ + + A N+T H DI Sbjct: 253 VQNDYAPFLKGQMIFP---GEISMEKNKPKAYDAKHNIT------------HHDIFIQIT 297 Query: 307 PVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 S TG N W +L + V ++ ++++L +++ Sbjct: 298 EKKSLTG----KNLQWVDLKELKKVNPSSLLQKVLHKVK 332 >UniRef50_A6FJZ0 A/G-specific adenine glycosylase n=7 Tax=Alphaproteobacteria RepID=A6FJZ0_9RHOB Length = 352 Score = 185 bits (470), Expect = 2e-45, Method: Compositional matrix adjust. Identities = 130/305 (42%), Positives = 166/305 (54%), Gaps = 17/305 (5%) Query: 8 AQVLDWYDKYGRKTLPWQIDKT---------PYKVWLSEVMLQQTQVATVIPYFERFMAR 58 A +L WYD + R LPW++ PY+VWLSEVMLQQT VA V YFERF A Sbjct: 8 ADLLAWYDAHARD-LPWRVAPAARLAGTRPDPYRVWLSEVMLQQTTVAAVRAYFERFTAL 66 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 +PTV LA AP +V+ W GLGYYARARNL K A+ VA HGG+FPET E + ALPG+G Sbjct: 67 WPTVEALAAAPDAQVMGEWAGLGYYARARNLLKCARVVAEDHGGRFPETREALMALPGIG 126 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAV-SGWPGKKEVENKLWSLSEQVTPAVGV 177 T AI +++ ++DGNV+RV+AR + + + P K +L + +TP Sbjct: 127 PYTGAAIAAIAFDAPEVVVDGNVERVMARVHDIRTPLPAAKP---ELVRAAAALTPKRRP 183 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL 237 QA+MDLGA +CT P C +CP C A A P K PK+ P R G + Sbjct: 184 GCHAQAVMDLGATVCTPRAPACGICPWWGACAARKAGVQAELPHKTPKKPKPLRFGIAYV 243 Query: 238 LQHEDEV-LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 + D LL RP GL GG+ +P ES + I AD RHTF+H Sbjct: 244 ARRADGAWLLETRPDKGLLGGMLGWPGSDWTESAP--IPAPPIEADWTELGEEARHTFTH 301 Query: 297 FHLDI 301 FHL + Sbjct: 302 FHLRL 306 >UniRef50_A9T1M0 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9T1M0_PHYPA Length = 627 Score = 184 bits (468), Expect = 3e-45, Method: Compositional matrix adjust. Identities = 123/373 (32%), Positives = 181/373 (48%), Gaps = 64/373 (17%) Query: 31 YKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLD----------------EVL 74 Y VW+SE+MLQQT+VATVI Y++R+M +PTV LA A + EV Sbjct: 234 YAVWVSEMMLQQTRVATVISYYQRWMRTWPTVYALAQATQEVSSSAFIPNSSVQSCLEVN 293 Query: 75 HLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHF 134 +W GLGYY RAR L + A+++ GG+FP T EE+ +PG+G TAGAI S++ + Sbjct: 294 SVWAGLGYYRRARFLLEGAKKIVEEFGGEFPRTAEELQKVPGIGTYTAGAIASIAFKQVV 353 Query: 135 PILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTR 194 P++DGNV RVL R A+S P W+L+ Q+ FNQA+M+LGA ICT Sbjct: 354 PVVDGNVIRVLCRLRAISLNPKASTTVKLFWALASQLVEEYRPGDFNQALMELGATICTP 413 Query: 195 SKPKCSLCPLQNGCIA--------AANNSWALYPGKKPKQTLPERTGYFLLLQHEDE--- 243 + P C+LC + + C A S +P K PK E +++ E + Sbjct: 414 TSPSCALCSVSSQCAALKLVHRVEKNVTSVTCFPAKVPKSAPREEYVAVCIIELESQTKM 473 Query: 244 ------VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNL------------- 284 +LL QRPP GL GL+ FP A ++ R + DN Sbjct: 474 SKAASSLLLVQRPPKGLLAGLWEFPS-APLDNPNSTEKSRLMEMDNYLCDTLGFKIGSGG 532 Query: 285 ------TQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGN---------ALWYNLAQPP 329 + + H FSH L + W+ +S TG +++ + W +L Sbjct: 533 YTVMKREEFGTYVHVFSHIRLHMFIQWMLLS--TGEIEDKSTPKMQRSVVTKWVDLDSMN 590 Query: 330 SVGLAAPVERLLQ 342 S+GL + V+++ + Sbjct: 591 SLGLTSGVKKVYE 603 >UniRef50_B7A8Y1 HhH-GPD family protein n=1 Tax=Thermus aquaticus Y51MC23 RepID=B7A8Y1_THEAQ Length = 333 Score = 184 bits (467), Expect = 4e-45, Method: Compositional matrix adjust. Identities = 115/294 (39%), Positives = 161/294 (54%), Gaps = 23/294 (7%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +L WY K+ R LPW+ +K PY++ +SEV+LQQTQV IPY+ RF+ RFPT+ L AP Sbjct: 8 LLAWYRKHAR-PLPWRGEKDPYRILVSEVLLQQTQVRQAIPYYRRFLQRFPTLKALGEAP 66 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 L+EVL +W G GYY RA +LH+ AQQV L P++F ++ LPG+G TA A+ S++ Sbjct: 67 LEEVLRVWQGAGYYRRAVHLHRLAQQVEAL-----PQSFAQLKGLPGLGPYTAAAVASMA 121 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSL-SEQVTPAVGVERFNQAMMDLG 188 G+ +DGNV+RVLAR +A+ G K ++ SL E+ P +NQA+M+LG Sbjct: 122 FGERVAAVDGNVRRVLARLFALEG-ASPKALQGLAQSLMPEEAHPG----EWNQALMELG 176 Query: 189 AMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQ 248 A +C KP C CPL + C + YP + ++ ER +LL + L Sbjct: 177 ATVCLPRKPLCGACPLASRC--RGKEAPERYPLPQRRKVREERLAALVLLGRKGVYLERL 234 Query: 249 RPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTA-FRHTFSHFHLDI 301 G + GLY P F EE L R A +L RH +H L + Sbjct: 235 ---EGRFQGLYGVPLFPAEE-----LPVRAEALGVAPRLLGEVRHALTHRRLLV 280 >UniRef50_C1F8W0 A/G-specific adenine glycosylase, putative n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F8W0_ACIC5 Length = 354 Score = 183 bits (465), Expect = 7e-45, Method: Compositional matrix adjust. Identities = 101/254 (39%), Positives = 147/254 (57%), Gaps = 13/254 (5%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + F + WY + R LPW+ + PY +W+SE+MLQQT+VA V+ Y++RFM +FPT Sbjct: 4 EIASFQRDISAWYRQNARD-LPWRRTRDPYAIWVSEIMLQQTRVAAVMEYYQRFMGQFPT 62 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LA+AP + VL LW+GLGYY RAR +H AA V HGGK P T ++ LPG+G T Sbjct: 63 IEALASAPEESVLALWSGLGYYRRARMMHHAAHIVVAEHGGKMPATAAQLRKLPGIGDYT 122 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCY---AVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 + A+ S+S + P++DGNV+RVL R AV G P + + L + ++++ Sbjct: 123 SAAVASISFDEPVPVIDGNVERVLLRLRGEPAVKGHPDAPGLSD-LKAAAQELLDTEQPG 181 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 FNQAMM+LGA +C P C+ CP++ C + P KK + + Y L+ Sbjct: 182 DFNQAMMELGATVCLPRAPLCAECPVRAYCRTQGEHETG--PAKKMRSV---QVSYALIR 236 Query: 239 QH---EDEVLLAQR 249 Q +LLAQR Sbjct: 237 QAPGPRTAILLAQR 250 >UniRef50_D1BPM8 A/G-specific adenine glycosylase n=3 Tax=Veillonella RepID=D1BPM8_VEIPT Length = 365 Score = 183 bits (465), Expect = 7e-45, Method: Compositional matrix adjust. Identities = 96/254 (37%), Positives = 144/254 (56%), Gaps = 1/254 (0%) Query: 9 QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 Q+L WYD + R+ LPW+ PYK+W+SEVM QQT++ + PY++ +M FPT+ DLA A Sbjct: 12 QLLAWYDVHKRE-LPWRDCGDPYKIWVSEVMSQQTRIEAMKPYYDNWMRLFPTLEDLAKA 70 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 DEV+H W GLGYY+RARNL + V +GG P + + +L GVG TAGA+LS+ Sbjct: 71 SEDEVVHAWQGLGYYSRARNLRLGVKDVVENYGGIVPHDRKTMESLKGVGSYTAGAVLSM 130 Query: 129 SLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLG 188 + + +DGNV R+ AR Y + + + + ++ E+ P FNQA+MD G Sbjct: 131 AYNEPEAAVDGNVLRIYARLYRIFDDILSTKGKKTITAIVEETLPHDRPGDFNQALMDFG 190 Query: 189 AMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQ 248 + +C P+C CP+ N C A + P + K + E + +L ++ LL + Sbjct: 191 SAVCIPKTPRCGECPIVNMCAAYQHEDTDKLPVRIKKTKVVEVPLFVGILNYKGYYLLHK 250 Query: 249 RPPSGLWGGLYCFP 262 RP GL ++ FP Sbjct: 251 RPNRGLLRSMWEFP 264 >UniRef50_C1XGJ6 A/G-specific DNA-adenine glycosylase n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XGJ6_MEIRU Length = 336 Score = 183 bits (465), Expect = 8e-45, Method: Compositional matrix adjust. Identities = 122/308 (39%), Positives = 171/308 (55%), Gaps = 21/308 (6%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + +S + +L WY ++ R+ LPW+ + PY+V LSEV+LQQT+V IPY+ RF+ RFP Sbjct: 6 LMSSDLHSALLTWYQRHQRR-LPWRGEPDPYRVLLSEVLLQQTRVEQAIPYYHRFLQRFP 64 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA A + VL +W G GYY RARNLH+ AQQV GG P++ + ALPG+G Sbjct: 65 TLEALAQAEQEAVLQVWQGCGYYTRARNLHRLAQQVVAA-GGVLPQSARGLRALPGIGPY 123 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGW--PGKKEVENK----LWSLSEQVTPA 174 TA A+ S++ G+ +DGNV+RVL+R A W P K+V+ L +L +Q Sbjct: 124 TAAAVASIAFGEPAAAVDGNVRRVLSRLLA---WEHPTPKQVQEAADALLSALVQQKDAR 180 Query: 175 VGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKK-PKQTLPERTG 233 G +NQA+M+LGA +CT P C CP+ C A S A YP + KQ E Sbjct: 181 PG--DWNQALMELGATVCTPQNPGCGGCPVAAFCQGKA--SPAHYPASRVRKQKSLELVA 236 Query: 234 YFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHT 293 L+LQ + V L QR + GGL+ P +L + LA + D+ + RH Sbjct: 237 --LVLQGPEGVYLEQR-QGPVLGGLWGVPMEEGPGALERLLA--RFGLDSAELVGPVRHE 291 Query: 294 FSHFHLDI 301 F+H L I Sbjct: 292 FTHRKLHI 299 >UniRef50_Q7MUV2 A/G-specific adenine glycosylase n=2 Tax=Porphyromonas gingivalis RepID=Q7MUV2_PORGI Length = 407 Score = 182 bits (463), Expect = 1e-44, Method: Compositional matrix adjust. Identities = 111/326 (34%), Positives = 169/326 (51%), Gaps = 24/326 (7%) Query: 12 DWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLD 71 +WYD ++ LPW+ PY++W+SEV+LQQT+V Y+ RF+ FP V L+ A D Sbjct: 58 EWYDA-NKRDLPWRQTDDPYRIWISEVILQQTRVEQGRDYYHRFIECFPDVHSLSLASED 116 Query: 72 EVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLG 131 EVL W GLGYY+RARNLH+AA+ + + GG P T +E+ LPG+G TA A+LS + Sbjct: 117 EVLKQWEGLGYYSRARNLHRAARMIVSDFGGCIPRTRQEILRLPGIGDYTAAAVLSFAYD 176 Query: 132 KHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSL-SEQVTPAVGVERFNQAMMDLGAM 190 F +DGN+ RV++R + P KL+S ++ + R NQA+M+ GA+ Sbjct: 177 LPFAAVDGNIFRVISRLMNLDT-PIDTPAGKKLFSFWADALLDREAPARHNQAIMEFGAL 235 Query: 191 ICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE----VLL 246 CT + P C LCP++ C+A P KK + R YF+ ++ + Sbjct: 236 HCTPTSPSCLLCPVRRFCMADTAGCVDALPVKKGGLRITNRYLYFIYIRVITPTGVYTYI 295 Query: 247 AQRPPSGLWGGLYCFP--QFADEESLRQWLAQRQI---------AADNLTQLTAFRHTFS 295 +RP +W GLY FP + +D L L ++ + D+L F+H + Sbjct: 296 RRRPSGDIWQGLYEFPCVELSDHAVLETLLLSPELGNLLRSISGSMDSLP-FKTFKHQLT 354 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNAL 321 H +L W+ + T +D+ L Sbjct: 355 HRNL-----WIHGYTLTARLDKAPDL 375 >UniRef50_C8P1Q1 A/G-specific adenine glycosylase n=1 Tax=Erysipelothrix rhusiopathiae ATCC 19414 RepID=C8P1Q1_ERYRH Length = 327 Score = 182 bits (463), Expect = 1e-44, Method: Compositional matrix adjust. Identities = 102/299 (34%), Positives = 154/299 (51%), Gaps = 17/299 (5%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 F+ +++ WY + ++ LP++ K PYK+W+SE+M QQTQ+AT+IPY++R++ R+P V Sbjct: 2 NFTERLMGWYHE-NKRVLPFRSQKDPYKIWVSEIMAQQTQIATMIPYYDRWIKRYPDVET 60 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LANA +DEVL +W GLGYY RARNLH AQ V + G P +E+ +PG+G T+ A Sbjct: 61 LANAEIDEVLKMWEGLGYYRRARNLHAGAQYVMEHYEGTLPADKKELMKIPGIGDYTSSA 120 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE--RFNQ 182 I S++ +DGNVKRV+AR + + + ++ G + F Q Sbjct: 121 IASIAFSLPEIAIDGNVKRVMARYLNYTENVNTRACHKYFETFLKKELLLNGADPSDFTQ 180 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+LGA++CT S C CP + C P + +P L+ Q + Sbjct: 181 ALMELGALVCTPSNTNCEGCPFKEMCACYRGECIGTIPFIPKAKPVPIYEKSVLIYQKDG 240 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 +L++ GL GL PQ A D + L +H FSH +I Sbjct: 241 RILISDDHEDGLMEGLLRLPQI-----------DGIYATDPILTL---KHKFSHLQWNI 285 >UniRef50_A7RIG3 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7RIG3_NEMVE Length = 470 Score = 182 bits (461), Expect = 2e-44, Method: Compositional matrix adjust. Identities = 122/353 (34%), Positives = 175/353 (49%), Gaps = 60/353 (16%) Query: 9 QVLDWYDKYGRKTLPWQIDKTP-------YKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 +L WYD +++LPW+ T Y VW+SE+MLQQTQVATV+ Y+ R+M +P+ Sbjct: 37 NLLRWYD-INKRSLPWRAYATEQDANIRAYAVWVSEIMLQQTQVATVVDYYNRWMKNWPS 95 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEV-AALPGVGRS 120 + LA A L+EV W+GLGYY+RAR LH+AA +V GK P ++ LPGVG Sbjct: 96 LEALARASLEEVNECWSGLGYYSRARRLHEAAIKVVNELDGKIPTNAAKLQKELPGVGLY 155 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S++ G+ ++DGNV RVL+R + + WSL+ ++ P F Sbjct: 156 TAGAIASIAFGEATGVVDGNVIRVLSRLRRIGADMTSNTTMDHFWSLAHRLVPNDRPGDF 215 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGC---------------------IAAANNS---- 215 NQAMM+ GA +CT P+CS C L++ C I+A + Sbjct: 216 NQAMMEFGATLCTPKTPQCSKCVLRSSCQAHSQVEDFKDKFTKRITGERISACDQDIEDC 275 Query: 216 ---------W------ALYPGKKPKQTLPERTGYFLLLQ---HEDE---VLLAQRPPSGL 254 W YP K K+ E L++Q +ED+ LL QRP SGL Sbjct: 276 SLCLPPSEPWNPEIGVCNYPMKPKKKEAREEVFTVLIVQEECNEDDSGNFLLVQRPESGL 335 Query: 255 WGGLYCFPQFADEESLRQWLA--QRQIAADNLTQ---LTAFRHTFSHFHLDIV 302 GL+ FP E+ ++ ++ +N+ + + H FSH H V Sbjct: 336 LAGLWEFPNLEKEKINEDDVSALAKEYGLENVKKRNNVGELIHKFSHRHHKYV 388 >UniRef50_A6Q673 A/G-specific adenine glycosylase n=2 Tax=Epsilonproteobacteria RepID=A6Q673_SULNB Length = 326 Score = 181 bits (460), Expect = 3e-44, Method: Compositional matrix adjust. Identities = 102/257 (39%), Positives = 147/257 (57%), Gaps = 17/257 (6%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIP-YFERFMARFPTV 62 + ++ WY+ YGR LPW+ PY ++LSEVMLQQTQV TV+ Y+ F+ F ++ Sbjct: 12 KEIHQKIRTWYEAYGRHDLPWRSTNNPYHIYLSEVMLQQTQVKTVLERYYFPFLQAFSSL 71 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 L NAPLD+VL W GLGYY RA+NLH+ A V L P +E+ LPG+G++TA Sbjct: 72 EALGNAPLDDVLKQWEGLGYYNRAKNLHRTAGLVNEL-----PPEIDELVKLPGIGKNTA 126 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 AI + + + P+++ NVKR+L R + + + E KLW + +NQ Sbjct: 127 HAIAAFAFHQPVPVMEANVKRILCRMHRL-----RTPNEKKLWKFAYASVDKEDPFNYNQ 181 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 AMMD+GA +C P+C+ CPL+N C N + YP KK K+T+P R ++ ++D Sbjct: 182 AMMDIGATLCLPKNPQCNRCPLENICKGKNNPEY--YPLKK-KKTVPAREENIVIYLYDD 238 Query: 243 EVLLAQRPPS---GLWG 256 + L QR GLWG Sbjct: 239 RLSLQQRSGKFLHGLWG 255 >UniRef50_Q254A0 Adenine/guanine glycosylase n=5 Tax=Chlamydophila RepID=Q254A0_CHLFF Length = 369 Score = 181 bits (459), Expect = 4e-44, Method: Compositional matrix adjust. Identities = 99/314 (31%), Positives = 165/314 (52%), Gaps = 16/314 (5%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 +A +F + L + +++ PW+ D +PY VW+SEVMLQQT+ V+ YF +M RFPT Sbjct: 9 RAKKFPVEKLKQWFIDNKRSFPWRDDPSPYNVWVSEVMLQQTRAEVVVKYFLEWMKRFPT 68 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LA A +EV+ W GLGYY R RNL A+ V GGK P+ ++ + G+G T Sbjct: 69 IESLATANEEEVIKAWEGLGYYTRVRNLLLGARMVMKDFGGKLPDDPLDLMQIKGLGPYT 128 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AIL+ + + +DGNV RV++R + + + + ++ ++ + P + Sbjct: 129 VHAILAFAFKRRTAAVDGNVLRVISRVFLIDASIDLESTKTWIFRITLSILPVKDPQVVA 188 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 +A+++LGA +C RS PKC +CPL + C A P + ++ + + ++ +E Sbjct: 189 EALIELGACVCKRS-PKCEICPLNSVCGAFKEKKQKSLPIRHARKKVVSLFRWVAIILYE 247 Query: 242 DEVLLAQRPPSGLWGGLYCFP--------QFADEESLRQ----WLAQRQIAADNLTQLTA 289 D ++L QR P + GLY FP AD +L Q ++ I L + Sbjct: 248 DFIVLEQRKPEEMMAGLYEFPYIEVGSLDDLADINALVQEMECYVGSSLIFCGELEEQ-- 305 Query: 290 FRHTFSHFHLDIVP 303 RH+F+H+ + ++P Sbjct: 306 -RHSFTHYKVRLIP 318 >UniRef50_Q5HN66 A/G-specific adenine glycosylase n=5 Tax=Staphylococcus epidermidis RepID=Q5HN66_STAEQ Length = 347 Score = 181 bits (458), Expect = 4e-44, Method: Compositional matrix adjust. Identities = 114/358 (31%), Positives = 181/358 (50%), Gaps = 38/358 (10%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + F + DW+ K ++ +PW+ PY +WLSEVMLQQTQV TVI Y+ RF+ RFPT+ Sbjct: 5 NSFKKDIEDWFHK-NQRDMPWRETTNPYYIWLSEVMLQQTQVNTVIDYYYRFIHRFPTIQ 63 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 L+ A DEVL W GLGYY+RARN H A ++V + G+ P E L GVG T Sbjct: 64 SLSEANEDEVLKYWEGLGYYSRARNFHTAVKEVNNNYDGEVPYDPESFKKLKGVGPYTQA 123 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV--GVERFN 181 A++S++ +DGNV RV +R ++ ++++ + +++ P V FN Sbjct: 124 AVMSIAFNHPLATVDGNVFRVWSR---LNNDYRDIKLQSTRKAFEQELHPYVLKDAGTFN 180 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKP---KQTLPERTGYFLLL 238 QAMM+LGA++CT P C CP+Q C A + P K K+T+ ++ FL+ Sbjct: 181 QAMMELGALVCTPKSPLCLFCPIQEHCEAFHMGTTQELPVKTKSLNKKTIEQKV--FLIR 238 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQ--------LTAF 290 + LL +R L G++ FP +R+ + +D+L + + Sbjct: 239 NDNGQYLLEKR-KEKLLNGMWQFP-------MREQTNANDVISDDLGKSIETINEPVFKL 290 Query: 291 RHTFSHFHLDIVPMWLPVSSFTGCMD-EGNAL-----WYNLAQPPSVGLAAPVERLLQ 342 +H F+H +I V + T ++ + N L W+NL P++++ + Sbjct: 291 KHQFTHLTWEI-----KVYNVTAPLNIKENDLPKQMTWFNLDDREQYTFPVPMDKIYK 343 >UniRef50_A1W1M2 A/G-specific adenine glycosylase n=13 Tax=Campylobacter jejuni RepID=A1W1M2_CAMJJ Length = 339 Score = 180 bits (457), Expect = 6e-44, Method: Compositional matrix adjust. Identities = 107/314 (34%), Positives = 167/314 (53%), Gaps = 33/314 (10%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQ-------------IDKTPYKVWLSEVMLQQTQVATV 48 + + +L WY+K GRKTLPW+ ID+ Y V++SE+MLQQTQV +V Sbjct: 5 ELEKLQENLLLWYEKNGRKTLPWRNLQSQNCDESLKHIDRA-YGVYISEIMLQQTQVKSV 63 Query: 49 IPYFER-FMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPET 107 + F F+ +FPT+ LANA DE+L W GLGYY RARNL KAA + K P+ Sbjct: 64 LERFYFPFLQKFPTLESLANANEDELLKAWQGLGYYTRARNLKKAALECVDKFEAKLPKE 123 Query: 108 FEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSL 167 E++ L G+G TAGAI + +DGN++RVL+R +A+ P KE+E + Sbjct: 124 VEDLKKLSGIGAYTAGAIACFGYDQKVSFVDGNIRRVLSRLFALEN-PSMKELEKR---- 178 Query: 168 SEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQT 227 ++++ NQA++D+GA++C KC +CPL + C + LYP + K+ Sbjct: 179 AKELLNLNHAFDHNQALLDIGALVCVSKNAKCGICPLYDFCQGKFHTE--LYP--RAKKI 234 Query: 228 LPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQL 287 L E L L ++ + L+ G+Y FP F + E +++ D + + Sbjct: 235 LYESVNLNLFLFEFNKKFAIGQSQDKLYKGMYNFPFFKEGEY--------KLSKD-MGFV 285 Query: 288 TAFRHTFSHFHLDI 301 F+H+++ + L+I Sbjct: 286 GEFKHSYTKYKLNI 299 >UniRef50_C7JG78 DNA glycosylase A/G-specific MutY n=12 Tax=Alphaproteobacteria RepID=C7JG78_ACEP3 Length = 357 Score = 179 bits (454), Expect = 1e-43, Method: Compositional matrix adjust. Identities = 117/305 (38%), Positives = 162/305 (53%), Gaps = 18/305 (5%) Query: 10 VLDWYDKYGRKTLPWQI----DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 +L WYD++ R+TLPW++ PY+VWLSE+MLQQT V V PY+ RF +FPTV L Sbjct: 9 LLHWYDRH-RRTLPWRVVGQSHPDPYRVWLSEIMLQQTTVKAVAPYYLRFTEKFPTVQAL 67 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+A ++VL W GLGYY+RARNLH AQ V L G FP+ + + LPG+G TA A+ Sbjct: 68 ASADREDVLAAWAGLGYYSRARNLHACAQAVVAL--GGFPQDVQGLRVLPGIGPYTAAAV 125 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSG--WPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 +++ G +DGNV+RV AR +A++ P +K++ +L+ F QA Sbjct: 126 AAIAFGVPVVPVDGNVERVTARLFAITEPLPPARKKLAQLAITLNADREAQERPSDFAQA 185 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH-ED 242 + DLG+ +C+ P C LCP Q C A P K PK P R G L Q Sbjct: 186 LFDLGSSLCSPRAPACGLCPWQGECAGHKQGIAAELPRKLPKAERPVRYGAAFLAQDAAG 245 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQR--QIA--ADNLTQLTAFRHTFSHF- 297 ++LL +R GL + P E L W Q+A A +H F+HF Sbjct: 246 QILLRKRAEKGLLAAMTELP--GTEWRLENWSEAEILQVAPFAAGWKLAGRVKHVFTHFT 303 Query: 298 -HLDI 301 +LD+ Sbjct: 304 LYLDV 308 >UniRef50_B0D6Q9 Predicted protein n=1 Tax=Laccaria bicolor S238N-H82 RepID=B0D6Q9_LACBS Length = 562 Score = 179 bits (453), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 108/311 (34%), Positives = 161/311 (51%), Gaps = 58/311 (18%) Query: 10 VLDWYDK-YGRKTLPWQIDKTP-----------YKVWLSEVMLQQTQVATVIPYFERFMA 57 +L WY + + +PW+ P Y+VW+SE+MLQQTQVATVIPY+ R+M Sbjct: 92 LLRWYRTVHDTRRMPWRKPYNPSCGAEERAQRAYEVWVSEIMLQQTQVATVIPYYTRWME 151 Query: 58 RFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA-LPG 116 ++PT+ LA+A +D+V LW GLGYY+RA L AQ+ +GG+ P+ +E+ A +PG Sbjct: 152 KYPTIRHLASANVDQVNALWKGLGYYSRASRLLAGAQKAIQKYGGRLPDNAKEMEANIPG 211 Query: 117 VGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSL--------- 167 +GR +AGAI S++ G+ P+LDGNV R+L+R A+ P K + LW Sbjct: 212 IGRYSAGAICSIAYGEKVPVLDGNVHRLLSRVLALHAPPKAKSTLDILWDAATVMVQIEE 271 Query: 168 SEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN----------NSWA 217 ++ +P NQA+++LG+ +C P+C CP+QN C A N N+ Sbjct: 272 ADTTSPPQYAGDINQALIELGSTVCKVRDPECGTCPIQNWCSAYQNSEPKESAKEDNTHM 331 Query: 218 L-------------------YPGKKPKQTLPERTGYFLL-----LQHEDE--VLLAQRPP 251 + YP K ++ E + +Q+ DE L +RP Sbjct: 332 VDIEDICGLCEPLTGAGVMSYPMKTERKKAREELDIVNVVEWRPIQNSDERWFLFRRRPS 391 Query: 252 SGLWGGLYCFP 262 +GL GLY FP Sbjct: 392 TGLLAGLYEFP 402 >UniRef50_Q5SH32 A/G-specific adenine glycosylase (MutY) n=2 Tax=Thermus thermophilus RepID=Q5SH32_THET8 Length = 325 Score = 179 bits (453), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 111/294 (37%), Positives = 163/294 (55%), Gaps = 23/294 (7%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +L WY + R LPW+ +K PY+V +SEV+LQQT+V +PY+ RF+ RFPT+ LA A Sbjct: 8 LLAWYRENAR-PLPWRGEKDPYRVLVSEVLLQQTRVEQALPYYRRFLERFPTLKALAAAS 66 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 L+EVL +W G GYY RA +LH+ A+ V L P +F E+ LPG+G TA A+ S++ Sbjct: 67 LEEVLRVWQGAGYYRRAEHLHRLARSVEEL-----PPSFAELRGLPGLGPYTAAAVASIA 121 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP-AVGVERFNQAMMDLG 188 G+ +DGNV+RVL+R +A P +KE L++L++ + P V +NQA+M+LG Sbjct: 122 FGERVAAVDGNVRRVLSRLFARES-PKEKE----LFALAQGLLPEGVDPGVWNQALMELG 176 Query: 189 AMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQ 248 A +C +P+C CPL G + YP + ++ ER +LL + L Sbjct: 177 ATVCLPKRPRCGACPL--GAFCRGKEAPGRYPAPRKRRAKEERLVALVLLGRKGVHLERL 234 Query: 249 RPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQ-LTAFRHTFSHFHLDI 301 G + GLY P F EE L R+ A ++ L RH +H L + Sbjct: 235 ---EGRFQGLYGVPLFPPEE-----LPGREAAFGVRSRPLGEVRHALTHRRLRV 280 >UniRef50_C5YTP8 Putative uncharacterized protein Sb08g006760 n=1 Tax=Sorghum bicolor RepID=C5YTP8_SORBI Length = 494 Score = 179 bits (453), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 120/338 (35%), Positives = 177/338 (52%), Gaps = 45/338 (13%) Query: 6 FSAQVLDWYDKYGRKTLPWQI-----DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 AQ+L WYD + R+ LPW+ ++ Y VW+SEVMLQQT+V V+ Y+ER+MAR+P Sbjct: 64 LRAQLLRWYDAH-RRDLPWRCASGGEEERAYAVWVSEVMLQQTRVPVVVGYYERWMARWP 122 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV LA A +EV +W GLGYY RAR L + A+Q+ + G FP T + + G+G Sbjct: 123 TVRSLAAATQEEVNEMWAGLGYYRRARFLLEGAKQI--IEKGVFPRTASALREVRGIGDY 180 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S++ + P++DGNV RV++R YA++ P + + W L+ Q+ + F Sbjct: 181 TAGAIASIAFNEVVPVVDGNVVRVISRLYAIADNPKESSTVKRFWELAGQMVDPLRPGDF 240 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAA----NNSWA------LYPGKKPKQTLPE 230 NQAMM+LGA +C+++KP CS C + + C A A N S + P KP++ Sbjct: 241 NQAMMELGATLCSKTKPGCSECAVSSHCQALALSHENASVQVTDFPRVVPKAKPRRDFAA 300 Query: 231 RTGYFLLLQHEDEV----------LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIA 280 + E+E LL +RP GL GL+ FP +E + +R+ Sbjct: 301 VCVVQIAQGLEEEAPDAKDNSNLFLLIKRPEEGLLAGLWEFPLILVDEGKTDFQNRRKAM 360 Query: 281 ADNLTQLTAFR-----------------HTFSHFHLDI 301 L++L +F H FSH L + Sbjct: 361 DKYLSKLLSFDVGQRSDVIFREDVGEHVHIFSHIRLTM 398 >UniRef50_Q1J2G1 A/G-specific adenine glycosylase n=3 Tax=Deinococcus RepID=Q1J2G1_DEIGD Length = 343 Score = 178 bits (451), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 105/250 (42%), Positives = 141/250 (56%), Gaps = 13/250 (5%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQID----KTPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 S +L W+D+ GR LPW++ + PY+VW+SEV+LQQTQVA YFERF+ F Sbjct: 7 SSLRTVLLAWFDRAGR-ALPWRVGPEGRRDPYRVWVSEVLLQQTQVARGRVYFERFLEAF 65 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 PTV LA+AP++ VL W G GYYARARNLH+AA +A P T++ ALPGVG Sbjct: 66 PTVEALADAPIEAVLKAWEGCGYYARARNLHRAAGVMAR---EGVPTTYDGWRALPGVGP 122 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA A+ SL+ G+ + DGNV+RVLAR + P + + L + P Sbjct: 123 YTAAAVASLACGEARAVNDGNVRRVLARLHG-ERQPTAAWAQARADELLDPARPGA---- 177 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 N+A+MDLGA +CT P+C CPL+ C A A +P K + + E LLL Sbjct: 178 CNEALMDLGATVCTPKAPQCGECPLRRWCAAFQLGDPAAFPAPKVRAAVREVRAVALLLG 237 Query: 240 HEDEVLLAQR 249 E +L +R Sbjct: 238 DAQEAVLERR 247 >UniRef50_B2UNR9 HhH-GPD family protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UNR9_AKKM8 Length = 341 Score = 177 bits (450), Expect = 4e-43, Method: Compositional matrix adjust. Identities = 114/341 (33%), Positives = 171/341 (50%), Gaps = 13/341 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F +++W+ + GR PW+ P+ + +SE+MLQQT + TV+ ++R+M +FPT L Sbjct: 13 FRNALVEWFRREGRD-YPWRRTTDPWHILVSELMLQQTTIPTVLGRYDRWMRQFPTPAHL 71 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A L W GLGYY R R+L A+++ GG+FP+ E + LPG+G T+GA+ Sbjct: 72 AAVDEQTALRSWEGLGYYRRVRSLQAIAREIVNEFGGRFPDNAEGLKRLPGIGPYTSGAL 131 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 LS + K PI+D NV RVLAR S E + LWS +E + FN A+M Sbjct: 132 LSFAFNKAAPIVDANVARVLARIDNYSVPVDSTEGQKYLWSRAESLVDPEHAREFNSAIM 191 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 +LG C+ S P C LCP++ C A + P K PK + + +L VL Sbjct: 192 ELGQTCCSISSPDCLLCPVRPFCSAERPET---LPVKNPKPQVTRVEHHDILYIRGKSVL 248 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW 305 LA+ P G+Y FPQ DE +L +Q + +T +R T H+ P+ Sbjct: 249 LAKCPEGKRHAGMYRFPQREDEHTLSLPHVLKQTYS-----ITRYRVTRYIHHVTDTPLL 303 Query: 306 LPVSSFTGCMDEGNALWYNLAQPPSVGLAAP-VERLLQQLR 345 F +D+ + L +A P L +P + +LL +R Sbjct: 304 REGEEFVP-LDKIHGL--PMASPDRKALNSPTLGKLLDHIR 341 >UniRef50_Q9ZMT8 A/G-SPECIFIC ADENINE GLYCOSYLASE n=21 Tax=Helicobacter RepID=Q9ZMT8_HELPJ Length = 328 Score = 177 bits (449), Expect = 5e-43, Method: Compositional matrix adjust. Identities = 104/299 (34%), Positives = 165/299 (55%), Gaps = 31/299 (10%) Query: 10 VLDWYDKYGRKTLPWQIDK---TPYKVWLSEVMLQQTQVATVIP-YFERFMARFPTVTDL 65 +L WY+++GRK LP++ K PY+V++SEVM QQTQ++TVI ++ F+ FPT+ DL Sbjct: 8 LLKWYEEFGRKDLPFRNLKGINAPYEVYISEVMSQQTQISTVIERFYPPFLKAFPTLKDL 67 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANAPL+EVL LW GLGYY+RA+NL K+A+ H + P ++ + LPG+G TA AI Sbjct: 68 ANAPLEEVLLLWRGLGYYSRAKNLKKSAEICVKEHHSQLPNDYQSLLKLPGIGAYTANAI 127 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVE---NKLWSLSEQVTPAVGVERFNQ 182 L + +D NVKRVL R + + K+++ N +L+E NQ Sbjct: 128 LCFGFREKSACVDANVKRVLLRLFGLDPNIHAKDLQIKANDFLNLNESFN-------HNQ 180 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A++DLGA+IC+ KPKC++CP C+ N + KK ++ + E +++Q+ Sbjct: 181 ALIDLGALICS-PKPKCAICPFNPYCLGK--NHLERHTLKKKQEIIQEERYLGVVIQNNQ 237 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 L ++ L+ G++ FP + + L L +H+ + F L++ Sbjct: 238 IAL--EKIEQKLYLGMHHFPNLKENLEFK------------LPFLGTIKHSHTKFKLNL 282 >UniRef50_C5ZWA7 A/G-specific adenine glycosylase n=3 Tax=Helicobacter RepID=C5ZWA7_9HELI Length = 332 Score = 176 bits (447), Expect = 9e-43, Method: Compositional matrix adjust. Identities = 105/263 (39%), Positives = 148/263 (56%), Gaps = 21/263 (7%) Query: 8 AQVLDWYDKYGRKTLPWQIDKTP----YKVWLSEVMLQQTQVATVIP-YFERFMARFPTV 62 + +L WY+K GRK+LPW+ DKT Y+V +SE+MLQQTQV T++ ++ F+ +FPT+ Sbjct: 8 SSILLWYEKKGRKSLPWR-DKTSKNRAYRVLVSEIMLQQTQVKTILERFYFPFLEKFPTL 66 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 L+ A +EVL W GLGYY RARNL K A+ H G P+ E + LPG+GR TA Sbjct: 67 ETLSKAKEEEVLLQWRGLGYYTRARNLLKCAKICCESHKGILPKDIESLQKLPGIGRYTA 126 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GAI + +D N+KR+L R +A+ P K +E+K ++ P NQ Sbjct: 127 GAIACFGYDRAVSFVDSNIKRILTRFFALQS-PSPKLLESKAKTILNTQEPF----NHNQ 181 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKP---KQTLPERTGYFLLLQ 239 A++D+GA +CT PKC+ CPLQ C N ++ P KK KQTL G F+ Sbjct: 182 ALLDIGATLCTPKNPKCTQCPLQPFCQGKENPTFYT-PTKKTALLKQTL--EIGIFI--- 235 Query: 240 HEDEVLLAQRPPSGLWGGLYCFP 262 +D + + L+ LY FP Sbjct: 236 -QDSKFALTKSQTKLYYNLYNFP 257 >UniRef50_D0J8Q9 A/G-specific adenine glycosylase n=2 Tax=Blattabacterium RepID=D0J8Q9_BLASP Length = 351 Score = 174 bits (440), Expect = 5e-42, Method: Compositional matrix adjust. Identities = 104/311 (33%), Positives = 162/311 (52%), Gaps = 22/311 (7%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 FS ++++WY RK LPW+ K PY V +SE MLQQT+V+ + Y+ F+ +FP + Sbjct: 2 NFSKKIINWYKNNYRK-LPWRETKNPYYVLVSEFMLQQTRVSKTLKYYLDFVKKFPNLEK 60 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQ-VATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A EVL W GLGYY RAR LH A+Q V G FP+T++E+ G+G T Sbjct: 61 LAYAEEKEVLKEWEGLGYYTRARYLHLFAKQLVKEKPSGFFPKTYKELIKYKGIGPYTGA 120 Query: 124 AILSLSLGKHFPILDGNVKRVLAR----CYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 A+ S+ + P LDGN RV +R + ++ + K + + + P + Sbjct: 121 AVASICFDEVIPALDGNAYRVFSRYLGIYHNITSFRAKNIFRFFILKIMDHNDPGI---- 176 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKK-PKQTLPERTGYFLLL 238 FNQA+MDLG+ +CT K KC CP+++ C + N + P K K+++ R Y++ + Sbjct: 177 FNQAVMDLGSTLCTPRKAKCFSCPVKSSCFSLKNGTVYDLPVKNIIKKSVLHRFFYYVFI 236 Query: 239 -QHEDEVLLAQRPPSGLWGGLYCFPQFADEESL-------RQWLAQRQIAADNLTQLTAF 290 ++ + + +R +W GLY FP E++L + W R I+ + Q+ Sbjct: 237 WDKKNHLCIQKRNNQDIWKGLYDFPLIESEKNLSISEIKDKIWTRFRIISDSIIYQVI-- 294 Query: 291 RHTFSHFHLDI 301 H +H L I Sbjct: 295 -HKLTHQTLSI 304 >UniRef50_UPI0001746831 A/G-specific adenine glycosylase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746831 Length = 343 Score = 173 bits (438), Expect = 8e-42, Method: Compositional matrix adjust. Identities = 104/255 (40%), Positives = 140/255 (54%), Gaps = 8/255 (3%) Query: 13 WYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIP--YFERFMARFPTVTDLANAPL 70 W+ GR PW+ PY + +SE+MLQQTQ++TV+ Y+ R+M RFP LA A Sbjct: 19 WFAVSGRD-YPWRRTTDPYAILVSEMMLQQTQISTVLDRGYYARWMERFPNTATLAVADE 77 Query: 71 DEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSL 130 EVL +W GLGYY RAR L + AQ+V + HGG FP T E V ALPGVG TAGA+ S + Sbjct: 78 AEVLRVWEGLGYYRRARFLQQMAQRVESEHGGIFPATLEGVRALPGVGDYTAGAVCSFAH 137 Query: 131 GKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER-FNQAMMDLGA 189 PI+DGNV RVL+R + + +E +LW+ S + A R FN A+M+LG Sbjct: 138 DAAAPIVDGNVARVLSRVWDDATPVDSREGMARLWTRSRALVEAAQSPRVFNSALMELGQ 197 Query: 190 MICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQR 249 IC S P C CP+Q C+ S + K KQT+ + Q +E +L Q+ Sbjct: 198 TICRVSAPNCGSCPVQPHCLTRDPASLPV----KSKQTVITAVTERVYFQIGEEGVLLQQ 253 Query: 250 PPSGLWGGLYCFPQF 264 GL+ P+ Sbjct: 254 ESGKRRTGLWKLPEL 268 >UniRef50_Q6MF36 Putative A/G-specific adenine glycosylase, mutY n=2 Tax=Parachlamydiaceae RepID=Q6MF36_PARUW Length = 352 Score = 172 bits (435), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 110/312 (35%), Positives = 164/312 (52%), Gaps = 16/312 (5%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 +S F + L + ++ LPW+ + TPY VW+SEVMLQQTQVA VIPYF R+M FP++ Sbjct: 2 SSSFPNEKLKKWFLSSKRDLPWRQNPTPYAVWISEVMLQQTQVAVVIPYFNRWMQAFPSI 61 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA A L+EV+ W GLGYY+RAR+LH+ AQ + G+ P +E+ + G+G T Sbjct: 62 EALAIASLEEVIKAWEGLGYYSRARHLHQGAQYLVEHCQGELPADEKELKKIKGLGPYTI 121 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GAILS + + +DGNV RVLAR + + K KL +E + P ++ Sbjct: 122 GAILSFAFHQKKAAVDGNVLRVLARYFQIEEDIAKTSTIKKLRDFAESILPDEESWISSE 181 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFL----LL 238 A+++LGA IC+R K C CPL++ C + + K P +T +T Y ++ Sbjct: 182 ALIELGATICSR-KAICQECPLKSNCQSYLHG----LSDKLPIKTKAIKTQYLYRAVAIV 236 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQF-ADEESLRQWLAQRQIAAD-NLT-----QLTAFR 291 + D L+ + + LY FP F +E + Q I NLT L++ Sbjct: 237 KCGDYFLVKKGKKGEIMSDLYEFPFFEVSQEGITFKELQENIKLHWNLTVHLLQSLSSIE 296 Query: 292 HTFSHFHLDIVP 303 H F+ + + P Sbjct: 297 HGFTRYQARLDP 308 >UniRef50_C3XHP3 A/G-specific adenine glycosylase n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XHP3_9HELI Length = 356 Score = 171 bits (433), Expect = 4e-41, Method: Compositional matrix adjust. Identities = 103/268 (38%), Positives = 144/268 (53%), Gaps = 13/268 (4%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQI---DKTPYKVWLSEVMLQQTQVATVIP-YFERFMAR 58 A Q+ Q+L+WY GR +LPW+ + PY V++SE+MLQQTQV V YF F+ Sbjct: 22 AKQWQIQLLEWYAIQGRISLPWRNLKGENAPYGVYVSEIMLQQTQVKRVQEHYFAPFLNA 81 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 FPT+ LA A LD +L W GLGYY+RARN+ K A H P T +++ LPG+G Sbjct: 82 FPTLESLAKANLDSILKQWEGLGYYSRARNMQKTAIICCEKHNATLPNTRQDLLTLPGIG 141 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 T+GAIL + +DGN++RVL R +A+ P +K ++ + L + Sbjct: 142 AYTSGAILCFGFHQSVSFVDGNIRRVLCRIFALRE-PNQKLLDELAFLLLDTKHSF---- 196 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 +NQA++DLGAMICT P C +CP+QN C N + LYP K P L Sbjct: 197 DYNQALLDLGAMICTPKSPSCLICPVQNLCNGKVNPT--LYPTPKTSSLTPLTLHLLLYK 254 Query: 239 QHEDEV--LLAQRPPSGLWGGLYCFPQF 264 + + + + GL+ GLY PQ Sbjct: 255 DSQGRIAFVYEKGDKGGLYQGLYNLPQL 282 >UniRef50_B2AMG7 Predicted CDS Pa_5_7310 n=1 Tax=Podospora anserina RepID=B2AMG7_PODAN Length = 582 Score = 170 bits (430), Expect = 8e-41, Method: Compositional matrix adjust. Identities = 102/246 (41%), Positives = 149/246 (60%), Gaps = 27/246 (10%) Query: 8 AQVLDWYDK-YGRKTLPWQ-----------IDKTPYKVWLSEVMLQQTQVATVIPYFERF 55 A +L W+ K + +PW+ + + Y+VW+SE+MLQQT+VATVI Y+ ++ Sbjct: 123 ASLLRWFQKEQTTRLMPWRKPFLTNPSRADLSRRAYEVWISEIMLQQTRVATVIAYWNKW 182 Query: 56 MARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA-L 114 M ++PT+ DLA A +EV+++WTGLGYY+RAR +H AQ+V T G P+T E + + Sbjct: 183 MTKWPTIEDLAQATEEEVVNMWTGLGYYSRARRIHAGAQKVVTEMQGLLPDTVEGLMKHV 242 Query: 115 PGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSG-WPGKKEVENKLWSLSEQ--- 170 PGVGR TAGAI ++ G+ P++DGNV RVL+R + G G K V + LW +++ Sbjct: 243 PGVGRYTAGAISAIVFGEAEPMVDGNVMRVLSRQMGLMGDVKGDKRVVDVLWEAADRLVK 302 Query: 171 -VTPAVGVER-----FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKP 224 V A G E + QA+M+LG+ ICT KP+C CP+ C+A A AL G Sbjct: 303 VVAEADGEEGEKPGLWGQALMELGSTICT-PKPQCGKCPVTESCMAYAEG-LALASGL-- 358 Query: 225 KQTLPE 230 KQ +P+ Sbjct: 359 KQAVPD 364 >UniRef50_Q2QW21 Os12g0211400 protein n=6 Tax=Poaceae RepID=Q2QW21_ORYSJ Length = 474 Score = 169 bits (429), Expect = 9e-41, Method: Compositional matrix adjust. Identities = 119/350 (34%), Positives = 171/350 (48%), Gaps = 54/350 (15%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTP--------------YKVWLSEVMLQQTQVATV 48 A+ A++L WYD R+ LPW+ P Y VW+SEVMLQQT+V V Sbjct: 54 AAAVRAELLRWYDA-NRRDLPWRRAAEPPAGSGSGRGEEQRAYAVWVSEVMLQQTRVPVV 112 Query: 49 IPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETF 108 + Y+ R+MAR+PTV LA A +EV +W GLGYY RAR L + A+Q+ + G+FP T Sbjct: 113 VDYYSRWMARWPTVDSLAAATQEEVNEMWAGLGYYRRARFLLEGAKQI--VEKGEFPCTA 170 Query: 109 EEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLS 168 + + G+G TAGAI S++ + P++DGNV RV++R YA+ P + + W L+ Sbjct: 171 STLREVRGIGDYTAGAIASIAFNEVVPVVDGNVVRVISRFYAIPDNPKESSTVKRFWQLT 230 Query: 169 EQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSW----------AL 218 ++ FNQAMM+LGA +C+++KP CS CP+ + C A A +S + Sbjct: 231 GELVDPSRPGDFNQAMMELGATLCSKTKPGCSQCPVSSHCQALALSSQNASVKVTDFPRV 290 Query: 219 YPGKKPKQTLPE----------RTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEE 268 P KP+ G ++ LL +RP GL GL+ FP E Sbjct: 291 VPKAKPRSDFAAVCVVQISQGFGEGIAEAEGKDNLFLLIKRPEEGLLAGLWEFPSVLVNE 350 Query: 269 SLRQWLAQRQIAADNLTQLTAFR-----------------HTFSHFHLDI 301 L +R+ L QL + H FSH L + Sbjct: 351 GKTDTLNRRKEMDKYLKQLLSIDVTRRSSVILREDVGQHVHIFSHIRLTM 400 >UniRef50_Q1ZXP7 Putative uncharacterized protein MYH n=1 Tax=Dictyostelium discoideum RepID=Q1ZXP7_DICDI Length = 574 Score = 168 bits (425), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 86/235 (36%), Positives = 137/235 (58%), Gaps = 11/235 (4%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQ-----IDKT--PYKVWLSEVMLQQTQVATVIPYFER 54 + + +L WY+K ++ LPW+ +D+ Y+VW+SE+MLQQT+VATVI YF + Sbjct: 95 EIQEIRESMLGWYEK-NKRDLPWRKHDNSLDENVIAYRVWVSEIMLQQTRVATVIEYFNK 153 Query: 55 FMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAAL 114 ++ ++PT+ DLA+ ++EV +W+GLGYY RA+NL ++ V K P + + + Sbjct: 154 WIEKWPTINDLASTTIEEVNKVWSGLGYYRRAKNLWLGSKYVVDNFNSKIPSDVKSLLEI 213 Query: 115 PGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA 174 G+G TAGAI S++ K P++DGNV RVL+R ++ P W L + + Sbjct: 214 NGIGPYTAGAISSIAFNKPVPLVDGNVIRVLSRVRSIGANPKLSSTVKLFWELGNDLVES 273 Query: 175 VGVE-RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTL 228 V FNQ++M+LGA IC+ P+C CP+Q+ C A + P +PK ++ Sbjct: 274 VENPCNFNQSLMELGATICSVQSPQCKQCPIQSNCQAYQQEKQFIKP--EPKNSI 326 >UniRef50_C9RK91 HhH-GPD family protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RK91_FIBSS Length = 384 Score = 167 bits (423), Expect = 5e-40, Method: Compositional matrix adjust. Identities = 114/297 (38%), Positives = 151/297 (50%), Gaps = 27/297 (9%) Query: 22 LPWQ---ID--KTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHL 76 LPW+ +D + PY VW+SE MLQQTQV+TV YF R+M RFP V LA A EV Sbjct: 33 LPWRPAGLDAPRDPYAVWISETMLQQTQVSTVRDYFTRWMKRFPDVEALAKAEEAEVFKY 92 Query: 77 WTGLGYYARARNLHKAAQQVATLHGG----KFPETFEEVAALPGVGRSTAGAILSLSLGK 132 W GLGYY+RARN+ K A+ VA L K PET +E+ ALPG+G TAGAILSL+ + Sbjct: 93 WQGLGYYSRARNILKTAKIVAALRQAQEPCKMPETRKELEALPGIGAYTAGAILSLAYHQ 152 Query: 133 HFPILDGNVKRVLARCYAVSGWPGKKEVENK-----LWSLSEQVTPAVGVERFNQAMMDL 187 ILDGN+ R+ +R Y + P K NK W + +V + N+A+M+L Sbjct: 153 REAILDGNLVRIFSRLYELDFLPTDKGSANKNCTEIYWEYAREVADSPKAYMHNEALMEL 212 Query: 188 GAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLA 247 G +C P C CPL+ C A +P K K+T G L+++ DE +LA Sbjct: 213 GRTVCKTKSPLCETCPLRGECRAFQECRTTEFPPAK-KRTEKSWHGTVLVVESADEKILA 271 Query: 248 QRPPSGLWGGLYCFPQFADEESLRQWL------AQRQIAADNLTQLT---AFRHTFS 295 P F ES R A+ I AD + + FRH+ + Sbjct: 272 VNGGQKFLDKQLALPHF---ESARHATVALPAKAEDYINADEVKSVEYCGTFRHSIT 325 >UniRef50_B3T7B3 Putative HhH-GPD superfamily base excision DNA repair protein n=1 Tax=uncultured marine microorganism HF4000_APKG3D20 RepID=B3T7B3_9ZZZZ Length = 349 Score = 167 bits (422), Expect = 6e-40, Method: Compositional matrix adjust. Identities = 101/265 (38%), Positives = 143/265 (53%), Gaps = 6/265 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 S F +L WYD + R+ LPW+ + YK +SE MLQQT+VATV+PYFER++ +FP Sbjct: 16 SAFGENLLAWYDDH-RRDLPWRKQPSLYKTVVSEFMLQQTRVATVLPYFERWLKQFPDFA 74 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA+A + VL W GLGYY+RARNL K AQQ+ L K P PGVG A Sbjct: 75 ALADASEEAVLKAWEGLGYYSRARNLRKLAQQIEALE--KIPADSAAWEGFPGVGPYVAA 132 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSG-WPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 A+ S+S G ++DGNV RVLAR ++ + + KL +++++ +NQ Sbjct: 133 AVTSISFGTKAAVVDGNVVRVLARMLSIDEQFRDNATAQRKLRPIAQELLHYNRPGDYNQ 192 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+LGA +C R P CS CP C + YP + K+++ T L + + Sbjct: 193 AVMELGATVCHRRSPLCSDCPALYVCQSGQRGDPESYP-RTAKKSIERITVDRLWITCNN 251 Query: 243 EVLLAQRP-PSGLWGGLYCFPQFAD 266 +LL + P S GL P+ D Sbjct: 252 SLLLREAPADSKRLAGLMELPKLED 276 >UniRef50_B7P4H9 A/G-specific adenine glycosylase muty, putative n=1 Tax=Ixodes scapularis RepID=B7P4H9_IXOSC Length = 345 Score = 166 bits (421), Expect = 9e-40, Method: Compositional matrix adjust. Identities = 122/357 (34%), Positives = 175/357 (49%), Gaps = 52/357 (14%) Query: 14 YDKYGRKTLPW----QIDKTP----YKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 +D + LPW + D P Y VW+SE+MLQQT+V TVI Y++R+M ++PTV DL Sbjct: 2 FDFGDARQLPWRDIAKADGDPNQKAYAVWVSEIMLQQTRVTTVIEYYKRWMKKWPTVVDL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVA-ALPGVGRSTAGA 124 A A ++EVL +W GLGYY RAR LHK AQ+V GG FP T + +A +PGVG TA A Sbjct: 62 ARASIEEVLQVWAGLGYYQRARRLHKGAQKVVRDLGGLFPNTPKHLAREIPGVGCYTAAA 121 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 + S++ G +DGNV RV +R + G E LW+ + + V FNQA+ Sbjct: 122 VASIAFGHRAGAVDGNVARVYSRMRLLGATLGSSPSERALWAAANEAVCPVRPGDFNQAV 181 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LGA +CT + ++ LYP K K+ T +L+ D Sbjct: 182 MELGARVCT----------------LMSWDATQLYPHKAAKKEPRRETHAVSVLRSGDRY 225 Query: 245 LLAQRPPSG-LWGGLYCFP----------QFADEESLRQWLAQRQIAADNLT-------- 285 LL ++ G L GL+ FP + + +E R L QI A L Sbjct: 226 LLLRKTGQGRLLEGLWEFPNRQVDPQGSAKKSQDEERRVVL---QIGAQLLNSAGGKKRA 282 Query: 286 -QLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 +L H FSH H + C EG+ W + A+ S G++ ++++L Sbjct: 283 RRLGEVTHLFSHIHATYCVYETTLDE--ECETEGS--WMSAAEVESSGVSTAMKKVL 335 >UniRef50_UPI0001926FFC PREDICTED: similar to mutY homolog, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001926FFC Length = 424 Score = 166 bits (420), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 97/277 (35%), Positives = 146/277 (52%), Gaps = 31/277 (11%) Query: 33 VWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKA 92 VW+SE+MLQQTQVATVI Y+ +M ++P++ LA+A L+EV W+GLGYY+R R LH+A Sbjct: 88 VWVSEIMLQQTQVATVINYYNNWMKKWPSIASLASATLEEVNEAWSGLGYYSRGRRLHEA 147 Query: 93 AQQVATLHGGKFPETFEE-VAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAV 151 A+++ G P++ + V LPG+G TA AI S++ + ++DGNV RVL R + Sbjct: 148 AKKLIE-SGQPMPKSSKSLVKELPGIGPYTASAIASIAFNEVCGVVDGNVIRVLTRIRMI 206 Query: 152 SGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAA 211 K V + +W L+ + FNQ MM+LGA +CT P+CS CPL C++ Sbjct: 207 GADSSSKAVNDFIWELANTIVDEERPGDFNQGMMELGATVCTPKSPQCSQCPLSTLCMSY 266 Query: 212 ANN---------SWALYPGKKPKQTLPERTGYFLL--------------------LQHED 242 SW + + + E TG ++ +Q+ + Sbjct: 267 KKGIFLFVIPLVSWINQICIRQQYLVEEGTGLGVMNFPRKNKNKSPLAEEYDVAVIQNNN 326 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQI 279 + LL QRP GL GL+ FP F E+ + + Q+ Sbjct: 327 KFLLVQRPNKGLLAGLWEFPLFLRTENSKNEILSLQV 363 >UniRef50_C4PWX5 A/g-specific adenine dna glycosylase (Ec 3.2.2.-) (Muty homolog)(Mmyh)-related n=1 Tax=Schistosoma mansoni RepID=C4PWX5_SCHMA Length = 302 Score = 166 bits (420), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 93/234 (39%), Positives = 132/234 (56%), Gaps = 27/234 (11%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQ---IDKTPY----------------------KVWLS 36 Q + +L WYD+ ++ LPW+ ++ P VW+S Sbjct: 11 QIEKLRESLLLWYDR-SKRDLPWRRMALNPDPNLRGYAGMFDHSLDFVFHILYMCSVWVS 69 Query: 37 EVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQV 96 EVMLQQTQV TVI Y++R+M ++P+V LA+A LD+V LW+GLGYY+RAR LHK AQ++ Sbjct: 70 EVMLQQTQVKTVIDYYDRWMKKWPSVDQLASASLDDVNSLWSGLGYYSRARLLHKGAQKI 129 Query: 97 ATLHGGKFPETFEEVA-ALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWP 155 G FP++ E + ++PGVGR TAGAI S++ + P+LDGNV RVL R + Sbjct: 130 VDEFNGIFPQSAEVLKHSIPGVGRYTAGAIASIAFNQCTPVLDGNVIRVLTRLRQIGSPV 189 Query: 156 GKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCI 209 LW+L+ ++ FNQA+M+LGA+ CT P C CPL + Sbjct: 190 QLPTTMEYLWNLTTKLVDPNRPGDFNQALMELGAVCCTPKNPDCIKCPLNKANV 243 >UniRef50_A9KT85 A/G-specific adenine glycosylase n=3 Tax=Clostridiales RepID=A9KT85_CLOPH Length = 350 Score = 166 bits (420), Expect = 1e-39, Method: Compositional matrix adjust. Identities = 107/308 (34%), Positives = 159/308 (51%), Gaps = 9/308 (2%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +L WYD R L W+ + PY VW+SE+MLQQT+V V YF+RF+ PT+ DLA Sbjct: 12 LLHWYDLNAR-ILEWRSNPKPYYVWISEIMLQQTRVEAVKSYFDRFIKELPTIKDLAAVE 70 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 D ++ LW GLGYY RARNL KAA V + G+ P EE+ LPG+G T+GAI S++ Sbjct: 71 EDRLMKLWEGLGYYNRARNLKKAAIIVMEQYNGELPANREELKKLPGIGSYTSGAIGSIA 130 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 +DGNV RV+ R K++V+ +L E + P +NQ++M+LGA Sbjct: 131 FQLPVAAVDGNVLRVMKRIAGSFDDITKEKVKKELEEDIEAIIPKDRPGDYNQSLMELGA 190 Query: 190 MICT-RSKPKCSLCPLQNGCIAAANNSWALYP--GKKPKQTLPERTGYFLLLQHEDEVLL 246 +C KP C+ CP+ + C A N P KK ++ + E+T L+++ + Sbjct: 191 TVCLPNGKPLCNQCPVMHLCKAFHNGDELKIPVKPKKKERVIEEKT--VLVMECAGYYAI 248 Query: 247 AQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLT---QLTAFRHTFSHFHLDIVP 303 +R GL GL+ P + S+ Q + A +L +H FSH ++ Sbjct: 249 RKRTEKGLLHGLWELPNIEGKLSINQMEETIKTLAPYAKVERKLGEAKHIFSHKEWHMIG 308 Query: 304 MWLPVSSF 311 ++ V Sbjct: 309 YYIKVERI 316 >UniRef50_B2W3B1 A/G-specific adenine glycosylase n=2 Tax=Pleosporineae RepID=B2W3B1_PYRTR Length = 441 Score = 165 bits (417), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 86/201 (42%), Positives = 128/201 (63%), Gaps = 21/201 (10%) Query: 26 IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYAR 85 + K Y+VW+SEVMLQQT+V+TVIPYF ++A++PTV DLA A D+VL +W GLGYY+R Sbjct: 24 LSKRAYEVWVSEVMLQQTRVSTVIPYFNNWIAKWPTVQDLAAANHDDVLSVWKGLGYYSR 83 Query: 86 ARNLHKAAQQV--ATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKR 143 A LH+ A+++ A+ G P E+ PG+GR TAGA+ S++ G+ P+LDGNV R Sbjct: 84 ATRLHEGAKKMMSASEDAGTIPSGAAELQEFPGIGRYTAGAVSSIAFGEAEPVLDGNVAR 143 Query: 144 VLARCYAVSGWPGKKEVENKLWSLSEQV--------------TPAVGVERFNQAMMDLGA 189 VL+R + K+ + LW +++Q+ TP + +NQA+M+LG+ Sbjct: 144 VLSRQLGLYVDVKDKKATDILWDMADQLIKYASDFPKTKTSATPGL----WNQALMELGS 199 Query: 190 MICTRSKPKCSLCPLQNGCIA 210 +CT +P+C+ CP+Q C A Sbjct: 200 TVCT-PRPRCNDCPIQATCRA 219 >UniRef50_UPI0000E49729 PREDICTED: hypothetical protein, partial n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E49729 Length = 425 Score = 164 bits (415), Expect = 4e-39, Method: Compositional matrix adjust. Identities = 108/333 (32%), Positives = 154/333 (46%), Gaps = 65/333 (19%) Query: 31 YKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLH 90 Y VW+SE+M QQTQVATVI Y+ ++M ++PT+ L+ A L+EV +W GLGYY+R + L Sbjct: 16 YAVWVSEIMCQQTQVATVIDYYNKWMKKWPTLESLSKASLEEVREVWAGLGYYSRGQRLF 75 Query: 91 KAAQQVATLHGGKFPETFEEV-AALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCY 149 + A +V G+ P T E++ LPGVGR TAGAI S+S + ++DGNV RVL+R Sbjct: 76 EGACKVQNELDGQIPGTAEQLRKELPGVGRYTAGAIASISFSEATGVVDGNVIRVLSRLR 135 Query: 150 AVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCI 209 + + V +W L+ + FNQ+MM+LGA +C P+C CP+Q+ C Sbjct: 136 MIGADFTTQNVMTAIWDLANAIVDPDRPGDFNQSMMELGATVCHPKSPQCPSCPVQSHCR 195 Query: 210 AAA-----NNSWALYPGKKPKQTLPER----------TGYFLLL---------------- 238 A A K Q+LPE G L + Sbjct: 196 AIQQMDQLTRDLATKLTKSNGQSLPEEKDIVDIECAADGCSLCMDGPVDANLGVMNYPRK 255 Query: 239 --------------------QHEDEVLLAQRPPSGLWGGLYCFP--QFADEES------- 269 E+E L+ QRP +GL G++ FP + A+E S Sbjct: 256 PKKKPLKQQVIAVCIVERETNDEEEYLIVQRPDTGLLAGMWEFPSIEIAEETSRQKSRNK 315 Query: 270 ----LRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 L+ L + + F H FSH H Sbjct: 316 MDSYLKDTLNMTLKNVKDRKHIIQFVHMFSHIH 348 >UniRef50_A1A5M6 Mutyh protein n=4 Tax=Eukaryota RepID=A1A5M6_RAT Length = 184 Score = 164 bits (415), Expect = 4e-39, Method: Compositional matrix adjust. Identities = 85/173 (49%), Positives = 113/173 (65%), Gaps = 1/173 (0%) Query: 39 MLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVAT 98 MLQQTQVATVI Y+ R+M ++PT+ DLA+A L+EV LW+GLGYY+R R L + A++V Sbjct: 1 MLQQTQVATVIDYYTRWMQKWPTLQDLASASLEEVNQLWSGLGYYSRGRRLQEGARKVVE 60 Query: 99 LHGGKFPETFEEVAAL-PGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGK 157 GG P T E + L PGVGR TAGAI S++ + ++DGNV RVL R A+ P Sbjct: 61 ELGGHVPRTAETLQQLLPGVGRYTAGAIASIAFDQVTGVVDGNVIRVLCRVRAIGADPTS 120 Query: 158 KEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIA 210 V + LW L++Q+ FNQA M+LGA +CT +P CS CP+Q+ C A Sbjct: 121 SFVSHHLWDLAQQLVDPARPGDFNQAAMELGATVCTPQRPLCSHCPVQSLCRA 173 >UniRef50_B9L504 Catalytic Domain Of MutyY n=4 Tax=Bacteria RepID=B9L504_THERP Length = 358 Score = 164 bits (414), Expect = 7e-39, Method: Compositional matrix adjust. Identities = 92/223 (41%), Positives = 133/223 (59%), Gaps = 6/223 (2%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q +++DWY + R LPW+ + PY++ +SEVMLQQTQV VIPY+E F+ARFPT Sbjct: 63 QIRAVQRRLVDWYRREAR-DLPWRRTRDPYRILVSEVMLQQTQVERVIPYYEVFLARFPT 121 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA+A L EV+ +W GLGY RA L +AA+++ GG+FP + LPGVGR T Sbjct: 122 VEALASAALAEVIAVWGGLGYNRRAVYLWRAAREIVERWGGRFPGERRLLERLPGVGRYT 181 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCY-AVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 AGA+ + G+ D N+ RVL R + PG++E++ L+E+V P + Sbjct: 182 AGAVACFAFGERVAFWDTNIARVLRRVFLGPEARPGRRELDE----LAERVLPLDRAYEW 237 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKK 223 NQA+M+LGA IC+ +P+C +CPL C + + G + Sbjct: 238 NQALMELGARICSARRPRCEICPLCGLCRSVGTEGESRRRGGR 280 >UniRef50_B3DVQ4 A/G-specific DNA glycosylase n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DVQ4_METI4 Length = 355 Score = 163 bits (412), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 104/297 (35%), Positives = 147/297 (49%), Gaps = 35/297 (11%) Query: 13 WYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDE 72 WY + G K PW+ TPY V +SE MLQQTQ TV+PYF +M +FP LA A E Sbjct: 29 WYGENGFK-YPWRKSPTPYSVVVSEFMLQQTQAETVVPYFLAWMEKFPDWESLAKAQEKE 87 Query: 73 VLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGK 132 VL W GLGYY+RARNLH+ A + G+ P EE+ PG+G TA A+ SL+ G+ Sbjct: 88 VLRAWEGLGYYSRARNLHRIAVMLYHERKGELPSDPEELVKFPGIGPYTANAVASLAFGR 147 Query: 133 HFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA-VGVERFNQAMMDLGAMI 191 P LDGNV RV+AR ++ KKE ++ + + + FN A+MD G + Sbjct: 148 KIPALDGNVIRVMARLMNINQPIHKKETIRTIFQVVDSLMQGEAEASLFNSALMDFGRAV 207 Query: 192 CTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERT-------GYFLLLQHEDEV 244 C P+CS C L+ C A K+P + LP RT + + ED+ Sbjct: 208 CKPKYPRCSACVLKEMCKA-----------KQP-ELLPLRTKIEIEEKKEKIAIIREDDR 255 Query: 245 LLAQRPPS--GLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 Q+ S + GL+ FP F + + + L + RH+F+ + + Sbjct: 256 FWLQQANSNQNRYRGLWLFPYFDPD------------VMEEMAPLFSLRHSFTRYKI 300 >UniRef50_Q2IJ90 A/G-specific DNA-adenine glycosylase n=5 Tax=Cystobacterineae RepID=Q2IJ90_ANADE Length = 403 Score = 162 bits (410), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 104/248 (41%), Positives = 144/248 (58%), Gaps = 12/248 (4%) Query: 9 QVLDWYDKYGRKTLPW---QIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 ++L W+D G + LPW Q PY+VWL+EVMLQQTQVAT PY+ RF+ R+PT+ L Sbjct: 30 RLLAWWDA-GHRPLPWRQPQRGADPYRVWLAEVMLQQTQVATATPYWLRFVERWPTLEAL 88 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A ++VL W+GLGYYAR RNL AA++ HGG P + E + ALPG G TAGA+ Sbjct: 89 AAARDEDVLAAWSGLGYYARCRNLLAAAREALRRHGG-LPSSHEALRALPGFGPYTAGAV 147 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S++ P +DGNV RVL+R + V G P + ++ +L+ + NQA+M Sbjct: 148 ASIAFAAPVPAVDGNVTRVLSRLFLVEGDPAARGARARVAALAAALVDRERPGDLNQALM 207 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFL---LLQHED 242 +LGA +C R +P C+ CP+ C A A A P P + PE+ L ++ E Sbjct: 208 ELGATVC-RPRPDCARCPVAARCAARAAGRAAELP---PPRRRPEKVRQVLACAVVVREG 263 Query: 243 EVLLAQRP 250 V L +RP Sbjct: 264 RVALVRRP 271 >UniRef50_A4NPF3 A/G-specific adenine glycosylase n=3 Tax=Haemophilus influenzae RepID=A4NPF3_HAEIN Length = 197 Score = 162 bits (410), Expect = 2e-38, Method: Compositional matrix adjust. Identities = 87/186 (46%), Positives = 115/186 (61%), Gaps = 24/186 (12%) Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 FNQAMMD+GAM+CTR+KPKC LCPL C+A N +W +P KKPK+ +PE+T YFL+L Sbjct: 4 FNQAMMDIGAMVCTRTKPKCDLCPLNIDCLAYKNTNWEKFPAKKPKKAMPEKTTYFLILS 63 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 +V L QR SGLWGGL+CFPQF D+ SL +LAQ ++ + + +FRHTFSHFHL Sbjct: 64 KNGKVCLEQRENSGLWGGLFCFPQFEDKSSLLHFLAQEKVT--HYQEWPSFRHTFSHFHL 121 Query: 300 DIVPMWLPVSSFTGCMDEGN---------------------ALWYNLAQPPSVGLAAPVE 338 DI P++ + S T C+++ N WY+ P +GLA PV+ Sbjct: 122 DIHPIYAEMES-TLCVEQANLDWRKVMESTKEYQSNLSSAVKYWYDPQNPEPIGLAQPVK 180 Query: 339 RLLQQL 344 LL Q Sbjct: 181 NLLIQF 186 >UniRef50_A8I815 DNA repair glycosylase (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8I815_CHLRE Length = 793 Score = 161 bits (408), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 100/245 (40%), Positives = 135/245 (55%), Gaps = 37/245 (15%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTP------------------------------YKVW 34 QF ++L WYD R LPW+ +TP Y VW Sbjct: 68 QFRRELLSWYDGNHR-VLPWR--RTPHTQRGAAADGEGGEDGVGPAPAELPPQQFAYWVW 124 Query: 35 LSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQ 94 +SEVMLQQTQVATVIPYF R+++R+PTV+DLA A + V +W GLGYY RAR L + A+ Sbjct: 125 VSEVMLQQTQVATVIPYFRRWVSRWPTVSDLAAADTEAVNSMWAGLGYYRRARYLLEGAK 184 Query: 95 QVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGW 154 VA GG FP + +E+ +PGVG T+ A+ S++ G +DGNV RVL+R A+ G Sbjct: 185 FVAGQLGGTFPTSAQELLKIPGVGPYTSAAVASIAFGSPAAAVDGNVIRVLSRLRALPGD 244 Query: 155 PGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANN 214 P K + +++ +V +NQA+M+LGA +C P CS CP + C AAA Sbjct: 245 PTKLGAAHT--AMAGEVLDGGRPGCYNQALMELGATVCRPVNPDCSACPARPVCRAAAE- 301 Query: 215 SWALY 219 W Y Sbjct: 302 -WTAY 305 >UniRef50_A6DT67 Adenine glycosylase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DT67_9BACT Length = 357 Score = 161 bits (408), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 92/215 (42%), Positives = 131/215 (60%), Gaps = 12/215 (5%) Query: 22 LPWQID-KTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGL 80 LPW+ + + Y+V +SEVMLQQT VATV+P +E F +FP + LANA +++ W GL Sbjct: 23 LPWRSEVRDLYRVLVSEVMLQQTTVATVLPRYESFFEKFPDLASLANADENDLALAWKGL 82 Query: 81 GYYARARNLHKAAQQVATLH--GGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILD 138 GYY RA+NL+KA V +H GG+FP+ EE+ +PGVG TA A+ ++ + +D Sbjct: 83 GYYRRAQNLYKA---VTMIHQSGGEFPDGEEELQKVPGVGPYTAAALTAIGRNQLALAVD 139 Query: 139 GNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE-----RFNQAMMDLGAMICT 193 GN++RVL+R + + G K ++ + SL + T A +E +FN+A+MDLG IC Sbjct: 140 GNLQRVLSRYFFIEVEQGPK-LQKAVHSLIQNKTFAKTLELCGPRKFNEALMDLGRAICK 198 Query: 194 RSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTL 228 PKC CPLQN C A + L P KK KQ + Sbjct: 199 PRNPKCGECPLQNSCEARLKHKVELVPVKKEKQKV 233 >UniRef50_B9KEL2 A/G-specific adenine glycosylase n=3 Tax=Campylobacter RepID=B9KEL2_CAMLR Length = 342 Score = 161 bits (408), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 90/282 (31%), Positives = 147/282 (52%), Gaps = 29/282 (10%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYK-------------------VWLSEVMLQQTQVATVIP 50 +L WY++ GRK+LPW+I Y+ V++SE+MLQQTQV +V+ Sbjct: 8 ILKWYEQNGRKSLPWRILHEEYRKYGSEDDLKKLKNIDIAYAVYVSEIMLQQTQVKSVLQ 67 Query: 51 -YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFE 109 Y+ +F+A+FP++ L+ A DEVL W GLGYY RARN+HK A+ + P + Sbjct: 68 NYYFQFLAQFPSLKALSMASEDEVLKAWQGLGYYTRARNMHKCAKICVQEFDARLPLEIK 127 Query: 110 EVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSE 169 E+ LPG+G TAGAI + +D N+KRVL+R Y + P K + K ++ Sbjct: 128 ELQKLPGIGEYTAGAIACFGFLQAKSFVDANIKRVLSRFYGLQN-PNSKILVQK----AK 182 Query: 170 QVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLP 229 + NQA++D+GA++C KC +CPL+ C + + + K+T Sbjct: 183 EFLNYNNAFEHNQALLDIGALVCLPKNAKCEICPLK--CFCTGKDEYEKFHV--SKKTQY 238 Query: 230 ERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLR 271 E +L+ ++E L + L+ +Y F ++ +++ + Sbjct: 239 ENIVLKILIVQKNEQFLLVKSKEKLYFNMYNFLEYKNQKKAK 280 >UniRef50_B1ZMV1 HhH-GPD family protein n=2 Tax=Verrucomicrobia RepID=B1ZMV1_OPITP Length = 344 Score = 161 bits (407), Expect = 3e-38, Method: Compositional matrix adjust. Identities = 98/244 (40%), Positives = 137/244 (56%), Gaps = 5/244 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 ++F + +L WY + R+ LPW+ + + YK +SE MLQQTQV TV+PY++R++A P Sbjct: 12 AEFQSALLGWYRAHQRR-LPWRTEPSLYKTVVSEFMLQQTQVKTVLPYYDRWLAVLPNFA 70 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A +VL LW GLGYY+RARNLH+ +Q + L K P T E LPGVG TA Sbjct: 71 ALAGASEAQVLKLWEGLGYYSRARNLHQLSQAIHAL--PKPPRTPEAWRELPGVGPYTAA 128 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWS-LSEQVTPAVGVERFNQ 182 AI S++ G +DGNV R+LAR A K ++ L++ + NQ Sbjct: 129 AITSIAFGAPAACVDGNVVRILARLTANRTLFRDSASAAKTFTPLAQALLSPSAPGDHNQ 188 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 AMM+LGA +C R P C CP++ C AA YP PK+ + +RT + Q + Sbjct: 189 AMMELGATVCVRQNPLCLTCPVRPFCAAARAGDPESYPRLAPKK-MEKRTVTRVWCQRDG 247 Query: 243 EVLL 246 +LL Sbjct: 248 ALLL 251 >UniRef50_A6Q1G1 A/G-specific adenine glycosylase n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q1G1_NITSB Length = 310 Score = 160 bits (405), Expect = 6e-38, Method: Compositional matrix adjust. Identities = 83/200 (41%), Positives = 125/200 (62%), Gaps = 12/200 (6%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVI-PYFERFMARFPTVTDLANA 68 +L+W++K+GR LPW+ + YK++LSE+MLQQTQV+ V Y+ +F+ RFPT+ LA A Sbjct: 5 LLEWFEKHGRHELPWRQTQDVYKIYLSEIMLQQTQVSRVEGEYYPKFLKRFPTLKALAQA 64 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 +EVL LW+GLGYY+RARNL Q A + P+ +E+ LPG+G TA AI + Sbjct: 65 SENEVLALWSGLGYYSRARNL----LQCAKICKDTLPKEPKELMKLPGIGTYTANAICAF 120 Query: 129 SLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLG 188 + + ++D N+KRV+ R +A+ +KEV+ K + P ++ N A+MDLG Sbjct: 121 AYNQPVAVVDTNIKRVIMRFFALQ---DEKEVQQKAQMILNTNEP----KKHNLALMDLG 173 Query: 189 AMICTRSKPKCSLCPLQNGC 208 +++CT P C CP+Q C Sbjct: 174 SLLCTPKNPLCDQCPIQQWC 193 >UniRef50_D1PLU8 A/G-specific adenine glycosylase n=2 Tax=Ruminococcaceae RepID=D1PLU8_9FIRM Length = 342 Score = 160 bits (404), Expect = 8e-38, Method: Compositional matrix adjust. Identities = 97/290 (33%), Positives = 149/290 (51%), Gaps = 4/290 (1%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +L W+ + ++ LP++ + + Y +W+SE+MLQQT+VA IPY+ RF+A P LA+ Sbjct: 9 LLQWF-QANKRLLPFRQEPSAYHIWVSEIMLQQTRVAAAIPYYNRFIAALPDPAALASCE 67 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 D + LW GLGYY R N+ KAA+ V +GG P ++ + +LPG+G TAGAI S++ Sbjct: 68 PDALRKLWQGLGYYNRVNNMQKAARIVCEQYGGDLPSDYDALRSLPGIGDYTAGAIASIA 127 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 G P +DGNV RV AR Y + + + P +N+A+M+LGA Sbjct: 128 FGIPAPAVDGNVLRVFARLYNDDADIMQPATKRLFTGRVLEQMPKETPGPYNEALMELGA 187 Query: 190 MICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQR 249 ++C P+C CPL C+ A A P K + L++ +LL +R Sbjct: 188 LVCVPGMPRCEACPLAALCLGYAAGRQADLPVKPAPKVKTPVAVTVALVESPAGLLLQRR 247 Query: 250 PPSGLWGGLY---CFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 P GL GL+ + + L + L++ + L A RH F+H Sbjct: 248 PARGLLAGLWQPAAWETVMSQAELAEELSKIGVQVTWGDALPAARHVFTH 297 >UniRef50_P46230 A/G-specific adenine glycosylase (Fragment) n=145 Tax=cellular organisms RepID=MUTY_AERHY Length = 99 Score = 159 bits (403), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 71/93 (76%), Positives = 83/93 (89%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + F+ ++LDWY +GRKTLPWQ DKTPY+VW+SE+MLQQTQVATVIPY++RFMARFP V Sbjct: 7 TTFATRILDWYQIHGRKTLPWQQDKTPYRVWVSEIMLQQTQVATVIPYYQRFMARFPDVQ 66 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQV 96 LA AP+DEVLH WTGLGYYARARNLHKAAQQ+ Sbjct: 67 ALAQAPIDEVLHHWTGLGYYARARNLHKAAQQI 99 >UniRef50_Q10159 A/G-specific adenine DNA glycosylase n=1 Tax=Schizosaccharomyces pombe RepID=MYH1_SCHPO Length = 461 Score = 159 bits (402), Expect = 1e-37, Method: Compositional matrix adjust. Identities = 113/371 (30%), Positives = 174/371 (46%), Gaps = 75/371 (20%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQ-------IDKTP-----------YKVWLSEVMLQQ 42 ++ +F ++ +YDK R LPW+ + +P Y+V +SE+MLQQ Sbjct: 16 LEVERFRESLIQFYDKTKR-ILPWRKKECIPPSEDSPLEDWEQPVQRLYEVLVSEIMLQQ 74 Query: 43 TQVATVIPYFERFMARFPTVTDLANAPLD-EVLHLWTGLGYYARARNLHKAAQQVATLHG 101 T+V TV Y+ ++M PT+ A A + +V+ LW+G+G+Y R + LH+A Q +A LH Sbjct: 75 TRVETVKRYYTKWMETLPTLKSCAEAEYNTQVMPLWSGMGFYTRCKRLHQACQHLAKLHP 134 Query: 102 GKFPETFEEVA-ALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEV 160 + P T +E A +PGVG TAGA+LS++ + I+DGNV RVL+R A+ K + Sbjct: 135 SEIPRTGDEWAKGIPGVGPYTAGAVLSIAWKQPTGIVDGNVIRVLSRALAIHSDCSKGKA 194 Query: 161 ENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN------- 213 +W L+ ++ V FNQA+M+LGA+ CT P+CS+CP+ C A Sbjct: 195 NALIWKLANELVDPVRPGDFNQALMELGAITCTPQSPRCSVCPISEICKAYQEQNVIRDG 254 Query: 214 -------------------------NSW--ALYPGKKPKQTLPERTGYFLLLQHEDE--- 243 +W A YP K E ++ Q D Sbjct: 255 NTIKYDIEDVPCNICITDIPSKEDLQNWVVARYPVHPAKTKQREERALVVIFQKTDPSTK 314 Query: 244 ---VLLAQRPPSGLWGGLYCFP--QFADE-----------ESLRQWLAQRQIAADNLTQL 287 L+ +RP +GL GL+ FP +F E +S+ QW++ + Q Sbjct: 315 EKFFLIRKRPSAGLLAGLWDFPTIEFGQESWPKDMDAEFQKSIAQWISNDSRSLIKKYQS 374 Query: 288 TA-FRHTFSHF 297 + H FSH Sbjct: 375 RGRYLHIFSHI 385 >UniRef50_Q1V0L8 A/G-specific adenine glycosylase n=3 Tax=Candidatus Pelagibacter RepID=Q1V0L8_PELUB Length = 326 Score = 158 bits (399), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 102/353 (28%), Positives = 170/353 (48%), Gaps = 37/353 (10%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQID----KTPYKVWLSEVMLQQTQVATVIPYFERFM 56 M S + ++L WYD +++LPW+ + K Y +SE MLQQTQVATVIPYF F+ Sbjct: 1 MTKSILTKKILYWYDN-NKRSLPWRKELSQVKREYFTLVSEFMLQQTQVATVIPYFNNFI 59 Query: 57 ARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPG 116 P + L+ ++L W GLGYY+R +NL K AQ + + P T +E+ LPG Sbjct: 60 KDIPDIKSLSKIKEHKLLKYWEGLGYYSRVKNLKKTAQILEKNFNRRLPNTIDELKLLPG 119 Query: 117 VGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG 176 +G T+ AI++++ K F LDGN++RV+ R + P K+ ++ L + + + Sbjct: 120 IGDYTSNAIMAIAFNKPFIPLDGNIERVIKRLLNLK-LP-KEITKDNLVKNKKILGNSTR 177 Query: 177 VERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFL 236 + QA+M+LGA+IC P C CPL C + + + +K K F Sbjct: 178 ASDYAQALMELGALICRPKNPLCHQCPLIKNCKSFKKKDFEIIKERKKK------IDKFY 231 Query: 237 LLQ---HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAA-DNLTQLTAFRH 292 LL+ ++ LL + L FP +QI+ DNL + F+ Sbjct: 232 LLEVYKKNNKFLLIKNTKFNFLKNLQIFP-------------MKQISKPDNLAKALNFK- 277 Query: 293 TFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 S+ +++++ + S G + N+ W + ++ L +++ L+ Sbjct: 278 -ISNMNMNVI---IKFSKLKGEI--SNSSWIDSSKLEDYTLPTFTKKIFNYLK 324 >UniRef50_A8Q4W7 Putative uncharacterized protein n=1 Tax=Malassezia globosa CBS 7966 RepID=A8Q4W7_MALGO Length = 625 Score = 157 bits (398), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 84/202 (41%), Positives = 125/202 (61%), Gaps = 14/202 (6%) Query: 25 QIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYA 84 QI + Y+VW+SE+MLQQT+V T Y+ +++ +P++ LA+A +D+VL W GLGYY Sbjct: 114 QIQRRAYEVWISEIMLQQTRVETARSYWLKWIEAWPSIDALADASVDDVLAAWRGLGYYG 173 Query: 85 RARNLHKAAQQVATLHG----GKFPETFEEVA-ALPGVGRSTAGAILSLSLGKHFPILDG 139 RAR +H+AAQ++ +H G++PE E+ +PGVG TAGAI S+ G PILDG Sbjct: 174 RARRIHEAAQKI--MHDPSMKGQWPEYAHELCEKIPGVGPYTAGAISSIVFGHAVPILDG 231 Query: 140 NVKRVLARCYAVSGWPGKKEVENKLWSLS----EQVTPAVGVE---RFNQAMMDLGAMIC 192 NV RVL+R + P K + LW ++ E V P + R+NQ +M+LG+ +C Sbjct: 232 NVARVLSRQTGLYADPRSKSTNDLLWYMARMLVEHVAPNHRSDVPGRWNQGLMELGSTLC 291 Query: 193 TRSKPKCSLCPLQNGCIAAANN 214 T ++P C CP+Q+ C A + Sbjct: 292 TPTRPACDTCPIQSTCAVYAES 313 >UniRef50_C0ACJ1 HhH-GPD family protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0ACJ1_9BACT Length = 355 Score = 156 bits (394), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 89/219 (40%), Positives = 120/219 (54%), Gaps = 6/219 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + F +L WY R+TLPW+ + YK +SE MLQQTQV TV+PYF R++ FP Sbjct: 16 ASFQHALLAWYHA-SRRTLPWREAPSLYKTVVSEFMLQQTQVKTVLPYFARWLDAFPDFA 74 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA AP + VL W GLGYY RARNLHK A+ + L + P ++ ALPG+G T+ Sbjct: 75 ALAAAPEERVLKHWEGLGYYTRARNLHKLARALVAL--PEIPRRPDDWLALPGIGPYTSA 132 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKE--VENKLWSLSEQVTPAVGVERFN 181 AI S++ G+ ++DGNV R+L R A G P + L+ + N Sbjct: 133 AITSIAFGEPVAVVDGNVVRILTRLTA-DGTPFRDSATASKHFVPLANAIVNPAHPGDHN 191 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYP 220 QAMM+LGA +C R P C++CP+ C A YP Sbjct: 192 QAMMELGATMCHRQNPLCTVCPVIRFCAARREGDPETYP 230 >UniRef50_A7NN47 HhH-GPD family protein n=2 Tax=Bacteria RepID=A7NN47_ROSCS Length = 317 Score = 154 bits (390), Expect = 3e-36, Method: Compositional matrix adjust. Identities = 83/224 (37%), Positives = 126/224 (56%), Gaps = 6/224 (2%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +L W+ + R LPW+ + PY++ ++EVMLQQTQV V+P + F+ RFPT+ LA AP Sbjct: 24 LLKWFSEAARD-LPWRRTRDPYRILVAEVMLQQTQVDRVLPKYAAFLERFPTLHTLAEAP 82 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 EV+ +W GLGY RA NL +AA+ + +GG FP + LPG+G TAGA+ + Sbjct: 83 TAEVIRMWAGLGYNRRAVNLQRAARAICARYGGVFPRDVATLVTLPGIGSYTAGAVACFA 142 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 + +D N++RV+ R + P + E L +L+ P +NQA+M+LG+ Sbjct: 143 FEQDVAFMDTNIRRVIRRVFTD---PTETVNERALLALARAALPVGRSWMWNQALMELGS 199 Query: 190 MICTRSKPKCSLCPLQNGC--IAAANNSWALYPGKKPKQTLPER 231 ++CT P C CPL++ C AA S + ++ L ER Sbjct: 200 LVCTADAPACWRCPLRDQCRDYAARRESDERFASAPVRKRLAER 243 >UniRef50_D2VUC5 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VUC5_NAEGR Length = 616 Score = 154 bits (389), Expect = 5e-36, Method: Compositional matrix adjust. Identities = 101/325 (31%), Positives = 146/325 (44%), Gaps = 72/325 (22%) Query: 9 QVLDWYDKYGRKTLPW------------------QIDKTPYKVWLSEVMLQQTQVATVIP 50 +++WYD +K LPW ++ + Y+V +SE+MLQQTQV TV Sbjct: 43 NLINWYDN-NKKDLPWRKTIKSADDEDDDVKATEELHQRAYEVLVSEIMLQQTQVETVKT 101 Query: 51 YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGG-------- 102 YF ++M RFPTV DLA+A +D+ + +W+GLGYY RAR L+ AA+ + Sbjct: 102 YFLKWMKRFPTVKDLADATVDDAVGIWSGLGYYRRARYLYNAAKYIVEKINKPIKNEEII 161 Query: 103 ---------------KFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLAR 147 K P+ +++ LPGVGR TAGAI S++ + ++DGNV RV AR Sbjct: 162 SDDEDDDEEKKRIIYKLPDNVKDLMQLPGVGRYTAGAIASIAFSRTASVVDGNVFRVFAR 221 Query: 148 CYAVSG------------WPGKKEVENKLWSLSEQVTPAVGVE---------RFNQAMMD 186 + WP + + + V +E FNQA+M+ Sbjct: 222 LKRIEEDIAVNKTANTVFWPMADNLIQHYRDIEDTVLEKEDLELKHRYNVTGDFNQAVME 281 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAAN------NSWALYPGKKPKQTLPERTGYFLLLQH 240 LG +C PKC+ CPL C A + +YP K K + + Sbjct: 282 LGRTVCIPRNPKCTECPLAEVCEANKALEKEEIATVEIYPVKNKKTKKRTQVVSCCIFYK 341 Query: 241 E---DEVLLAQRPPSGLWGGLYCFP 262 E D L +RP L GG + FP Sbjct: 342 EGDVDSCLFEKRPKETLLGGTWHFP 366 >UniRef50_UPI000034F1ED adenine-DNA glycosylase-related / MYH-related n=1 Tax=Arabidopsis thaliana RepID=UPI000034F1ED Length = 630 Score = 154 bits (388), Expect = 6e-36, Method: Compositional matrix adjust. Identities = 118/368 (32%), Positives = 176/368 (47%), Gaps = 66/368 (17%) Query: 10 VLDWYDKYGRKTLPW-------QIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 +LDWYD ++ LPW + ++ Y+VW+SE+MLQQT+V TV+ Y++R+M ++PT+ Sbjct: 135 LLDWYD-VNKRDLPWRNRRSESEKERRAYEVWVSEIMLQQTRVQTVMKYYKRWMQKWPTI 193 Query: 63 TDLANAPLD-------------------EVLHLWTGLGYYARARNLHKAAQQVATLHGGK 103 DL A L+ EV +W GLGYY RAR L + A+ V G Sbjct: 194 YDLGQASLENLIVSRSRELSFLRGNEKKEVNEMWAGLGYYRRARFLLEGAKMVVAGTEG- 252 Query: 104 FPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENK 163 FP + + G+G+ TAGAI S++ + P++DGNV RVLAR A+S P + Sbjct: 253 FPNQASSLMKVKGIGQYTAGAIASIAFNEAVPVVDGNVIRVLARLKAISANPKDRLTARN 312 Query: 164 LWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGC----IAAANNSWAL- 218 W L+ Q+ FNQ++M+LGA +CT SKP CS CP+ + C ++ N + ++ Sbjct: 313 FWKLAAQLVDPSRPGDFNQSLMELGATLCTVSKPSCSSCPVSSQCRAFSLSEENRTISVT 372 Query: 219 -YPGK----KPKQTLPERTGYFLLLQHEDE-------VLLAQRPPSGLWGGLYCFPQF-- 264 YP K KP+ +L H E +L +RP GL GL+ FP Sbjct: 373 DYPTKVIKAKPRHDF---CCVCVLEIHNLERNQSGGRFVLVKRPEQGLLAGLWEFPSVIL 429 Query: 265 ---ADEESLRQWLA-------------QRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPV 308 AD + R + ++ + +L F H F+H + L V Sbjct: 430 NEEADSATRRNAINVYLKEAFRFHVELKKACTIVSREELGEFVHIFTHIRRKVYVELLVV 489 Query: 309 SSFTGCMD 316 G D Sbjct: 490 QLTGGTED 497 >UniRef50_Q6IEF2 A/G-specific adenine glycosylase, putative n=3 Tax=Trypanosoma RepID=Q6IEF2_9TRYP Length = 510 Score = 153 bits (387), Expect = 7e-36, Method: Compositional matrix adjust. Identities = 107/353 (30%), Positives = 164/353 (46%), Gaps = 70/353 (19%) Query: 11 LDWYDKYGRKTLPWQID-----------------KTPYKVWLSEVMLQQTQVATVIPYFE 53 ++W+ R+ LPW+ PY VW+SEVM QQTQ+ TVIPYF+ Sbjct: 72 IEWFHTNQRRDLPWRQTFSDTDACHPVVPVVTPLTDPYHVWVSEVMSQQTQMDTVIPYFK 131 Query: 54 RFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA 113 R++A FP + LA A V+ +W+G+GYY RA L K A+ V GG P T ++ A Sbjct: 132 RWIAVFPDIATLARATEASVMAVWSGMGYYRRALYLKKGAEHVMKHFGGSLPTTAAQLRA 191 Query: 114 LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGW-PGKKEVENKLWSLSEQVT 172 +PG+G T+ AI S+ + +DGNV RVL+R + P + +++ +++ Sbjct: 192 IPGIGLYTSAAIASICFRERIISVDGNVVRVLSRLRCERNFDPKSAKSIKEVFHWGQEIM 251 Query: 173 PAVGVER---FNQAMMDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPK--- 225 +R FNQ +M++GA +C S +P C CPL C A A A+ G+ P Sbjct: 252 GEGPCDRPGDFNQGLMEIGARVCKPSGRPLCEECPLHRYCGAYA----AMQSGEIPSIEG 307 Query: 226 ------QTLPERTGYFLLLQHE------------DEVLLAQRPPSGLWGGLYCFP----- 262 +TL ++ + L HE ++ +RP GL GG+ FP Sbjct: 308 VIPLRAKTLKKKKEHVFCLVHEFREKTAGDSALKRRFIVVRRPEEGLLGGMLEFPSRTYV 367 Query: 263 -------------QFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 DE LR+ LA N+ ++ +H FSH + ++ Sbjct: 368 SSPDGGTDIKGSRDVTDE--LRKKLAA---GHKNVVEVGHVQHIFSHIDMRVL 415 >UniRef50_B9SF01 A/G-specific adenine glycosylase muty, putative n=3 Tax=core eudicotyledons RepID=B9SF01_RICCO Length = 775 Score = 153 bits (387), Expect = 8e-36, Method: Compositional matrix adjust. Identities = 110/293 (37%), Positives = 158/293 (53%), Gaps = 35/293 (11%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT--------------PYKVWLSEVMLQQTQVAT 47 + + +L+WYD+ R+ LPW+ KT Y +W+SEVMLQQT+V T Sbjct: 35 ETQKIRESLLEWYDQNQRQ-LPWRRQKTTNPSQESEEEKEKRAYGIWVSEVMLQQTRVQT 93 Query: 48 VIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPET 107 VI Y+ R+M ++PT+ LA A L+EV +W GLGYY RAR L + A+ + GG FP T Sbjct: 94 VIDYYNRWMLKWPTIHHLAQASLEEVNEIWAGLGYYRRARFLLEGAKMIVA--GGGFPNT 151 Query: 108 FEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSL 167 + +PG+G TAGAI S++ + P++DGNV RVL R A+S P KLW L Sbjct: 152 VSSLRKVPGIGDYTAGAIASIAFKEVVPVVDGNVVRVLTRLRAISANPKDSMTVKKLWKL 211 Query: 168 SEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGC--IAAANNSWAL----YPG 221 + Q+ FNQ++M+LGA +C S P CS CP+ + C ++ +N ++ YP Sbjct: 212 AAQLVDPCRPGDFNQSLMELGATVCAPSNPSCSSCPVSSQCRVLSISNQDKSILVTDYPT 271 Query: 222 K----KPKQTLPERTGYFLL--------LQHEDEVLLAQRPPSGLWGGLYCFP 262 K KPK +L + + + LL +RP GL GL+ FP Sbjct: 272 KVVKVKPKHEFSAVCVVEILGSCGPVDNQKTDSKFLLVKRPDDGLLAGLWEFP 324 >UniRef50_Q7M9R1 A/G-SPECIFIC ADENINE GLYCOSYLASE EC 3.2.2 n=1 Tax=Wolinella succinogenes RepID=Q7M9R1_WOLSU Length = 320 Score = 152 bits (385), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 95/251 (37%), Positives = 135/251 (53%), Gaps = 16/251 (6%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIP-YFERFMARFPTVTDLANA 68 +L WY GR LPW+ PY++ +SE+MLQQTQV TV+ ++ F+ RFPT+ +A A Sbjct: 9 LLLWYQANGRHHLPWRHPLHPYEILISEMMLQQTQVNTVLERFYYPFLERFPTLESIARA 68 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 E+L W GLGYY+RARNLH A+ T G P + E+ LPG+G TA AI Sbjct: 69 EESEILLAWRGLGYYSRARNLHALAK---TCQQG-LPRSVSELEGLPGIGAYTARAIACF 124 Query: 129 SLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLG 188 + ILDGN+KR+L+R +A+ G G++E LW +E+ + NQA++D+G Sbjct: 125 GFRESVAILDGNIKRILSRFFALLG-VGERE----LWRRAEEFLNPLAAFDHNQALLDVG 179 Query: 189 AMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQ 248 A++C P C CPL C + P K + E ++ + E V + Q Sbjct: 180 ALLCKPKNPLCQECPLSPWC-KGKEDPLRYTPSKTRRYE--ELELFYGICIQEGRVAMVQ 236 Query: 249 ---RPPSGLWG 256 R GLWG Sbjct: 237 SQERLYKGLWG 247 >UniRef50_C9M6M4 A/G-specific adenine glycosylase n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M6M4_9BACT Length = 354 Score = 152 bits (384), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 96/280 (34%), Positives = 148/280 (52%), Gaps = 10/280 (3%) Query: 7 SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 S + +WY + GR LPW++ PY VW+SE MLQQTQV+ V+P + +++ FP LA Sbjct: 13 SRALAEWYRENGRH-LPWRLTAGPYAVWVSETMLQQTQVSRVVPLYLAWLSAFPNCRSLA 71 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 A VL LW GLGYY+RARN+ +A+ + + P +A LPG G TA A+ Sbjct: 72 EADESAVLSLWRGLGYYSRARNMLASARLIVSAGYDGPPNDQTFLAKLPGFGPYTAAAVR 131 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWP----GKKEVENKLWSLSEQVTPAVGVERFNQ 182 +L+ + LDGN++RV +R + P G K + + SL + +P + Sbjct: 132 ALAYDEPTAALDGNLRRVSSRLTDLDKDPALSEGNKVCQRAVESLMKFQSPRL----LTN 187 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+MDLG+ C +P+C LCPL+ C+A + AL P ++ + + R G LL + Sbjct: 188 ALMDLGSGPCA-PRPRCLLCPLEPYCLARKRGTTALRPVRRAARPIRRRFGAALLFSTQT 246 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAAD 282 ++ + QRP GLW + P + + R++AA Sbjct: 247 KLAVRQRPKGGLWSEFWEIPWEVGSDGEDSGVTARRLAAS 286 >UniRef50_A9UV22 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UV22_MONBE Length = 221 Score = 152 bits (383), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 82/216 (37%), Positives = 117/216 (54%), Gaps = 18/216 (8%) Query: 10 VLDWYDKYGRKTLPWQ----------------IDKTPYKVWLSEVMLQQTQVATVIPYFE 53 ++ WY ++ R LPW+ + Y VW+SEVM QQTQ++ V YFE Sbjct: 7 LIQWYHQHQR-VLPWRQPSVRRSAESMTHGQDSNTMAYAVWISEVMCQQTQISVVTDYFE 65 Query: 54 RFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA 113 R++A++PTV LA A L +V W GLGYY+RA LH+AAQ + G FP + Sbjct: 66 RWIAKWPTVQALAAAQLSDVHQAWAGLGYYSRATRLHEAAQYIVNQLDGSFPTVARLIFC 125 Query: 114 L-PGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT 172 PGVG TA A+ S+ K ++DGNV RVL+R ++ + + +W ++ + Sbjct: 126 WPPGVGPYTASAVASIVFAKRVGVVDGNVNRVLSRLGGIAVPLTQDAAKKWMWRQADTLA 185 Query: 173 PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGC 208 + NQAMM+LGA+ICT P+C CPL C Sbjct: 186 DHAAPGQINQAMMELGALICTPKSPQCHACPLAEHC 221 >UniRef50_Q4P7F9 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4P7F9_USTMA Length = 539 Score = 152 bits (383), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 99/275 (36%), Positives = 146/275 (53%), Gaps = 37/275 (13%) Query: 10 VLDWYDKYG-RKTLPWQID------------------KTPYKVWLSEVMLQQTQVATVIP 50 +L W++ ++ +PW+ D K Y+VW+SE+MLQQT+V TV Sbjct: 91 LLTWFETVSSKRDMPWRADFIDASHYSDAAKLREARKKRAYQVWISEIMLQQTRVETVKT 150 Query: 51 YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQ-VATLH-GGKFPETF 108 Y+ +M ++PT+ LA A + VL W GLGYY+RA +H AA++ VA H G PET Sbjct: 151 YWLNWMNKWPTIEALAAADPEAVLAAWRGLGYYSRATRIHTAAKRVVADPHMMGLLPETP 210 Query: 109 EEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSL 167 +E+ +PGVG TAGAI S+ G+ PI+DGNV RVL+R A+ P K + +W+ Sbjct: 211 QELEKNVPGVGPYTAGAISSIVFGQAVPIIDGNVVRVLSRQLALYANPKTKLTSHLMWAT 270 Query: 168 SEQVTPAVGVER---------FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN-NSWA 217 + ++ R +NQA+M+LG+ +CT +KP+C CP+Q C A + Sbjct: 271 ATKLVHKASALRGGKATVPGQWNQALMELGSTVCTPTKPRCDECPIQATCSAHQEAQADT 330 Query: 218 LYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPS 252 + KP Q +L ED L +R PS Sbjct: 331 IKANNKPNQ-----AAGAEVLDIEDICTLCERFPS 360 >UniRef50_C7YS47 Putative uncharacterized protein n=2 Tax=Nectriaceae RepID=C7YS47_NECH7 Length = 618 Score = 149 bits (375), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 87/242 (35%), Positives = 138/242 (57%), Gaps = 40/242 (16%) Query: 10 VLDWYDKYG-RKTLPWQ-----------------IDKTPYKVWLSEVMLQQTQVATVIPY 51 +LDW+D ++++PW+ +++ Y+VW+SE+MLQQT+VA VI Y Sbjct: 129 LLDWFDGVSTKRSMPWRKAWINPKDHLPDDLRNLLERRAYEVWISEIMLQQTRVAVVIDY 188 Query: 52 FERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVAT--LHGGKFPETFE 109 + ++MA++P++ DLA A D+VL W GLGYY+RA +H+A++ V G P + Sbjct: 189 WNKWMAKWPSIHDLAAASADDVLSAWRGLGYYSRATRIHEASKLVVKDPAMKGLLPSCTQ 248 Query: 110 EV-AALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSG-WPGKKEVENKLWSL 167 ++ A +PGVGR TAGAI ++ G+ P++DGNV RVL+R + G K V + LW+ Sbjct: 249 DLEAKVPGVGRYTAGAISAIVFGQAAPMVDGNVLRVLSRQLGLLGNVKTTKVVIDTLWAA 308 Query: 168 SEQVTPAVGVE-----------------RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIA 210 ++ + AV + R+ QA+M+LG+ +C KP C CP+ + C A Sbjct: 309 ADALVKAVARDETGTPDGEEAETSDRPGRWGQALMELGSTVCV-PKPNCGECPITSTCRA 367 Query: 211 AA 212 A Sbjct: 368 YA 369 >UniRef50_A9AYZ2 HhH-GPD family protein n=5 Tax=Chloroflexi (class) RepID=A9AYZ2_HERA2 Length = 323 Score = 147 bits (372), Expect = 5e-34, Method: Compositional matrix adjust. Identities = 83/213 (38%), Positives = 117/213 (54%), Gaps = 5/213 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 S +L W+ GR LPW+ + PY + +SE MLQQTQV VIP +E F+A FPTV Sbjct: 5 STLQIDLLAWFQANGRD-LPWRRTRNPYYILVSETMLQQTQVDRVIPKYEAFLALFPTVE 63 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVAT----LHGGKFPETFEEVAALPGVGR 119 LA+A +V+ W GLGY RA NL +AAQ + FP T E + LPG+G Sbjct: 64 ALASASTADVIRSWQGLGYNRRAVNLQRAAQAIVAAGYPADPAGFPATPEGLRNLPGIGA 123 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 T+GA+ + + LD N++RV+ R + E L ++Q+ P Sbjct: 124 YTSGAVACFAFERDVAFLDTNIRRVVRRLLVGPEDAPPETNEQTLIDYAQQLIPQGQGWA 183 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAA 212 +NQA+M+LGA+IC+ +KP+C CP+ C A A Sbjct: 184 WNQAIMELGALICSAAKPQCWRCPVNQHCRAYA 216 >UniRef50_A4AHC9 Adenine glycosylase n=1 Tax=marine actinobacterium PHSC20C1 RepID=A4AHC9_9ACTN Length = 297 Score = 145 bits (366), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 80/208 (38%), Positives = 119/208 (57%), Gaps = 5/208 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDK-TPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 ++FS +W++++GR LPW+ D + + +SE+MLQQT VA VIP E+++ R+PT Sbjct: 11 AEFSLVTREWFNEHGRD-LPWRHDGFGAWGILVSEIMLQQTPVARVIPRLEQWLDRWPTP 69 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA E + W LGY RA NLH AA + H G PE + + ALPG+G TA Sbjct: 70 AALAADSPGEAVRAWERLGYPRRALNLHAAATAITKNHDGTVPEDVQSLLALPGIGDYTA 129 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA--VGVERF 180 A+ + + G P++D NV+RV+AR A G G L ++ E + P V + Sbjct: 130 RAVAAFAYGHRHPVVDTNVRRVIARAVAGQGEAGPPSTRRDLAAM-ELLLPEDPVAAQLT 188 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGC 208 N A+M+LGA++CT +P C CP++ C Sbjct: 189 NAAVMELGAIVCTAKRPLCDECPVRELC 216 >UniRef50_B6K934 Helix-hairpin-helix motif-containing protein n=3 Tax=Toxoplasma gondii RepID=B6K934_TOXGO Length = 833 Score = 145 bits (365), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 81/205 (39%), Positives = 119/205 (58%), Gaps = 8/205 (3%) Query: 8 AQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLAN 67 + + D+ G K ++ +PY VW+SEVMLQQTQV TVI Y++R+M+R+PTV DL Sbjct: 244 GEAQEKVDRAGSKEATRRV--SPYGVWVSEVMLQQTQVCTVIDYWQRWMSRWPTVGDLVK 301 Query: 68 APLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILS 127 A +EV +W+GLGYY RAR L K AQ V G+ P E++ ++PG+G T GAI + Sbjct: 302 ATEEEVSQMWSGLGYYRRARQLLKGAQTVVQEFDGELPGDVEKLLSIPGIGPYTGGAISA 361 Query: 128 LSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER---FNQAM 184 ++ G +DGNV RVLAR ++ + + S + P + R +A+ Sbjct: 362 IAFGNRAAAVDGNVLRVLARLLGLAAPADSRAL---AMFCSRWMPPFLDPRRPGASTEAL 418 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCI 209 ++LGA ICT P C CP++ C+ Sbjct: 419 IELGATICTPRAPSCLSCPVRQFCL 443 >UniRef50_B7G7W7 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G7W7_PHATR Length = 645 Score = 144 bits (363), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 80/170 (47%), Positives = 103/170 (60%), Gaps = 8/170 (4%) Query: 29 TPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARN 88 T Y VW+SE+MLQQT+V V+PY+ R+M RFPTV DLA A D V W GLG+Y RAR Sbjct: 190 TAYGVWVSEIMLQQTRVEAVVPYWVRWMIRFPTVHDLALADEDAVNAHWAGLGFYRRARL 249 Query: 89 LHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARC 148 LH AA+ + G PE +E+ LPGVGR TA AI S++ + P++DGNV RVLAR Sbjct: 250 LHSAAKYIVNDCNGALPENVQELLHLPGVGRYTASAIASIAFNVNVPVVDGNVCRVLARL 309 Query: 149 YAVSG---WPGKKEVENKLWSLSEQVTPA---VGVERFNQAMMDLGAMIC 192 ++ P K +N W L+ Q+ A NQA+M+LGA C Sbjct: 310 RGIANNIKAPALK--DNHGWDLAAQIVSAGDGSAPGEVNQAIMELGATYC 357 >UniRef50_D1Y8G9 A/G-specific DNA-adenine glycosylase n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y8G9_9BACT Length = 365 Score = 144 bits (363), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 101/294 (34%), Positives = 151/294 (51%), Gaps = 14/294 (4%) Query: 1 MQASQFSAQVLD----WYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFM 56 M A +F + + WY+ + R LPW++D+ PY + +SE MLQQTQV V ++ R+M Sbjct: 1 MSAEKFHLEAAEALTAWYNSHKRD-LPWRLDRDPYHILVSEAMLQQTQVERVKSFYARWM 59 Query: 57 ARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPG 116 RFPT+T LA+A D+VL W GLGYY+RARNL +AA+ V+ P E + +LPG Sbjct: 60 ERFPTLTSLASASEDDVLACWQGLGYYSRARNLRRAARLVSGAGLKTLPADEEFLRSLPG 119 Query: 117 VGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG 176 +G T GA+ S++ P +DGNV+RV +R + P + + + + + + Sbjct: 120 LGPYTVGAVCSIAFDLPVPAIDGNVRRVFSRLLDMPDDPARAKGTALIAAHAAAILKLGS 179 Query: 177 VERFNQAMMDLGAMICT-RSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF 235 QA M+LGA +CT + +C CP+ C A A + A P + + R G Sbjct: 180 PHILTQAFMELGATVCTPGTTCQCGQCPVSRLCAAQAAGTQAQRPVSTRRNVVERRRGAA 239 Query: 236 LLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTA 289 LL+ + +RP GLW Y P E + A L++LTA Sbjct: 240 LLILVPHGCAVRRRPAGGLWSRFYEIPWLRGEAD--------ESAESCLSRLTA 285 >UniRef50_D0RRM3 A/G-specific adenine glycosylase n=1 Tax=alpha proteobacterium HIMB114 RepID=D0RRM3_9RICK Length = 337 Score = 144 bits (362), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 79/219 (36%), Positives = 121/219 (55%), Gaps = 12/219 (5%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTP----YKVWLSEVMLQQTQVATVIPYFERFMARF 59 S S ++L WY KY R LPW+ K YKV LSE MLQQTQV T +P+F F + Sbjct: 3 SSISKKILAWYQKYQR-DLPWRKYKNQKDRHYKVLLSEFMLQQTQVKTALPFFNNFYKKI 61 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 ++ L+ +V LW GLGYY RA+ L + ++ + + K P +E++ ALPG+G Sbjct: 62 NSLEKLSKTSQAKVNKLWQGLGYYRRAKFLLETSKIIKKKYNYKLPSQYEDLIALPGIGD 121 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA AILS++ K+ +DGNV+RV+ R + +SG + K+ + E++ Sbjct: 122 YTAKAILSIAFDKNEIGIDGNVERVVTRIFNISG-------KKKILNYVERLKVEKKASF 174 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL 218 Q +M++GA+IC P C+ C L C ++N + + Sbjct: 175 LMQGLMEVGALICKPKLPLCNDCFLNKNCKYSSNKKFNI 213 >UniRef50_UPI0000510111 A/G-specific adenine glycosylase n=1 Tax=Brevibacterium linens BL2 RepID=UPI0000510111 Length = 345 Score = 142 bits (359), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 80/204 (39%), Positives = 117/204 (57%), Gaps = 11/204 (5%) Query: 10 VLDWYDKYGRKTLPWQ-IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 ++ W++ R LPW+ D T + V +SE+M QQT V+ V P + +M ++PT DLA A Sbjct: 33 IITWFESAARP-LPWRDADTTAWAVLVSEIMSQQTPVSRVEPRWREWMQKWPTPADLAQA 91 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 P EVLH W LGY RA L +AA+ + G P+T +E+ LPG+G TA A+ S Sbjct: 92 PTAEVLHRWDRLGYPRRALRLQEAARVITEELDGHVPQTAKELERLPGIGSYTAAAVTSF 151 Query: 129 SLGKHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 + G+ +LD NV+RVL R +A S PG+ E E W + Q P +++N + Sbjct: 152 AHGERTTVLDTNVRRVLIRLFAGRDRPSPSPGRAETE---W--AGQFVPETEHKQWNAGV 206 Query: 185 MDLGAMICTRSKPKCSLCPLQNGC 208 M+ GA++CT P+C CPL + C Sbjct: 207 MEFGALVCTARNPQCETCPLNDIC 230 >UniRef50_Q4Q851 A/G-specific adenine glycosylase, putative n=3 Tax=Leishmania RepID=Q4Q851_LEIMA Length = 501 Score = 142 bits (357), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 113/363 (31%), Positives = 167/363 (46%), Gaps = 70/363 (19%) Query: 9 QVLDWYDKYGRKTLPWQ-------------IDK------------TPYKVWLSEVMLQQT 43 +V+ W+ ++ R+ LPW+ +D PY+VW+SEVM QQT Sbjct: 34 EVIAWWRQHQRQDLPWRQTLPRGSGDLSGAVDGEKHRTSATTARYDPYQVWVSEVMSQQT 93 Query: 44 QVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGK 103 ++ TVIPY+ +M RFP++ LA + DEV +W G+GYY RA L K A+ + K Sbjct: 94 RMETVIPYYVAWMKRFPSIEALAASTEDEVKSVWAGMGYYRRAMYLRKGAKYLLERSKEK 153 Query: 104 -------FPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPG 156 P + EE+ +PG+G T+ AI S+ GK +DGNV RVL+R + Sbjct: 154 EGTAAVCMPSSQEELLKVPGIGPYTSAAITSMCFGKPVCSVDGNVIRVLSRLRGARDFDP 213 Query: 157 K-----KEVE---NKLWSLSEQVTPAVGVE--RFNQAMMDLGAMICTRS-KPKCSLCPLQ 205 K KE +L S + V + NQ +M+LGA +C S P C+ CPLQ Sbjct: 214 KVPANVKEAAAWGQRLMGNSPTTSAVVCQDPSALNQGLMELGASVCRPSGAPLCTSCPLQ 273 Query: 206 NGCIAAA---NNSWALYPGKKPKQT--LPERTGYFLLLQHE---------DEVLLAQRPP 251 C A+A G P + + +R+ L + HE ++ +RP Sbjct: 274 RFCRASALLRGGDIEAIEGVIPVRAAKMAKRSARKLCVVHEMSASRIGDARRFVVVRRPA 333 Query: 252 SGLWGGLYCFPQFA-----DEESLR-------QWLAQRQIAADNLTQLTA-FRHTFSHFH 298 +GL GG+ FP DEE++ LA + A +L RH FSH Sbjct: 334 NGLLGGMLEFPTVNASAADDEEAVVPLLKHPLSMLAWKARARPTFARLCGNVRHIFSHIV 393 Query: 299 LDI 301 +D+ Sbjct: 394 MDV 396 >UniRef50_D1Z9Z8 Whole genome shotgun sequence assembly, scaffold_13 n=1 Tax=Sordaria macrospora RepID=D1Z9Z8_SORMA Length = 693 Score = 141 bits (356), Expect = 3e-32, Method: Compositional matrix adjust. Identities = 85/208 (40%), Positives = 117/208 (56%), Gaps = 28/208 (13%) Query: 31 YKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLH 90 Y+V LSE MLQQT+VA VI Y+ +++A PT+ LA + VL LW+GLGYY+RA L Sbjct: 214 YEVLLSETMLQQTRVAPVIAYYSKWLAILPTMQSLAASNPATVLSLWSGLGYYSRATRLQ 273 Query: 91 KAAQQVA-----TLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVL 145 AQQV + G P T E++ LPGVGR TAGA+ + G+ P++DGNV RVL Sbjct: 274 TLAQQVCPPLEQGGYDGLLPHTVEKLMQLPGVGRYTAGAVACIVYGRAEPMVDGNVIRVL 333 Query: 146 ARCYAVSGWPGKKEVENKLWSLSEQVTPAV-------------GVE--------RFNQAM 184 AR + G KEV +W ++++ AV G E ++ Q + Sbjct: 334 ARQLGIRGDVKSKEVLKVMWEAAKRLVEAVAWDGTDATDEERKGKEPPVSDRPGKWGQGL 393 Query: 185 MDLGAMICTR--SKPKCSLCPLQNGCIA 210 M+LGA +C +KPKC LCP++ C A Sbjct: 394 MELGATVCLGGLAKPKCGLCPIKGTCRA 421 >UniRef50_B6K1Z7 A/G-specific adenine DNA glycosylase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6K1Z7_SCHJY Length = 470 Score = 140 bits (353), Expect = 7e-32, Method: Compositional matrix adjust. Identities = 84/229 (36%), Positives = 128/229 (55%), Gaps = 21/229 (9%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQID--KTP----------------YKVWLSEVMLQQT 43 + F Q+LD+Y+ ++ LPW+ D K P Y+V +SE+MLQQT Sbjct: 13 EVKSFRKQLLDFYEN-TKRNLPWRKDPYKPPQKDDPDDRKKLAVQRFYEVLVSEIMLQQT 71 Query: 44 QVATVIPYFERFMARFPTVTDLANAPLD-EVLHLWTGLGYYARARNLHKAAQQVATLHGG 102 +V TV Y++++M PT+ A A D EV+ LW+G+G+Y R + LH A + +A L Sbjct: 72 RVETVKRYYDKWMKTLPTILHCAQADYDTEVMPLWSGMGFYGRCKRLHSACKYLAVLPAE 131 Query: 103 KFPETFEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVE 161 + P + E +A +PGVG TAGA+LS++ G ++DGNV+RVL+R A+ K + Sbjct: 132 EIPTSPERLAKNVPGVGPYTAGAVLSIAWGIPTGVVDGNVQRVLSRLLALHCNVTKGKPN 191 Query: 162 NKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIA 210 +W ++ + FNQA+M+LGA+ CT C CP+ N C A Sbjct: 192 AFVWQMANLLVDPNFPGNFNQALMELGAVTCTPQTFNCPGCPVSNICKA 240 >UniRef50_C8NK18 A/G-specific adenine glycosylase n=8 Tax=Corynebacterium RepID=C8NK18_COREF Length = 308 Score = 139 bits (351), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 81/212 (38%), Positives = 118/212 (55%), Gaps = 10/212 (4%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTP-YKVWLSEVMLQQTQVATVIPYFERFMARF 59 M + + VL W+ + + L W+ TP + + LSEVM QQT VA V P + +MA + Sbjct: 16 MSYNALQSSVLRWF-RLNARDLAWRDPDTPAWGILLSEVMSQQTPVARVEPIWLEWMATW 74 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 PT A A DEVL W LGY RA L + A+ + HGG+ P+T +E+ ALPG+G Sbjct: 75 PTPQAFAEASTDEVLRAWGKLGYPRRALRLLECARVIVDKHGGRVPDTVDELLALPGIGD 134 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYA---VSGWPGKKEVENKLWSLSEQVTPAVG 176 TA A+ + + G++ P++D NV+RV R + G KKE+ + + PA Sbjct: 135 YTARAVAAFAFGQNVPVVDTNVRRVYRRAVEGRFLQGTASKKELVD-----VAAILPADS 189 Query: 177 VERFNQAMMDLGAMICTRSKPKCSLCPLQNGC 208 F+ +M+LGA+ICT + PKC+ CPL C Sbjct: 190 GPEFSAGIMELGALICTATSPKCASCPLLELC 221 >UniRef50_D1V787 HhH-GPD family protein n=4 Tax=Actinomycetales RepID=D1V787_9ACTO Length = 332 Score = 137 bits (345), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 84/225 (37%), Positives = 129/225 (57%), Gaps = 8/225 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQI-DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 F+ VL+W+ R+ LPW+ + P+ V +SEVMLQQT V V+P +E ++AR+P Sbjct: 42 FAEPVLEWFGLVARE-LPWRRPEAGPWAVLVSEVMLQQTPVNRVLPVYEAWLARWPDPAA 100 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA P E + +W LGY RA LH+AA + HGG P+ +++ ALPG+G TA A Sbjct: 101 LAAEPSGEAVRMWGRLGYPRRALRLHQAATAIVERHGGAVPDNLDDLLALPGIGTYTARA 160 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWP-GKKEVENKLWSLSEQVTPA--VGVERFN 181 + + + + P++D NV+R++AR A+ G G V K +L E + PA R + Sbjct: 161 VTAFAFRQRQPVIDVNVRRLVAR--AIEGRAEGPVAVSRKDLALVEDLLPADAETAARAS 218 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCI-AAANNSWALYPGKKPK 225 A M+LGA++C P+C+ CP++ C A + A P +KP+ Sbjct: 219 AAFMELGALVCVARAPRCAGCPVRERCAWLLAGSPPADGPARKPQ 263 >UniRef50_A1RYX1 HhH-GPD family protein n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1RYX1_THEPD Length = 253 Score = 137 bits (344), Expect = 7e-31, Method: Compositional matrix adjust. Identities = 74/204 (36%), Positives = 114/204 (55%), Gaps = 9/204 (4%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + + ++ +WY + GR PW+ + PY + +E +LQ+T+ TV FE F +R+P+ Sbjct: 41 EVARRLAEWYRRRGR-DFPWRHTRDPYVILATEFLLQRTRAETVAKVFEEFFSRYPSPES 99 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LANA +E+ ++ LG RA L +AA+++ HGG P + EE+ L GVG A A Sbjct: 100 LANADPEELRKFFSRLGLVRRADALREAAREIVERHGGSVPRSKEELLKLKGVGPYIASA 159 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 +L + P++D NV+RVL R +G ++E E L L P + A+ Sbjct: 160 VLCFAYSAPVPVVDTNVERVLGR---AAGASSREEAEAFLERLLRHGNP----REISLAL 212 Query: 185 MDLGAMICTRSKPKCSLCPLQNGC 208 +DLGA++CTR KPKC CPL + C Sbjct: 213 IDLGALVCTR-KPKCPECPLSDLC 235 >UniRef50_Q5KQ30 A/G-specific adenine DNA glycosylase, putative n=2 Tax=Filobasidiella neoformans RepID=Q5KQ30_CRYNE Length = 568 Score = 136 bits (342), Expect = 1e-30, Method: Compositional matrix adjust. Identities = 81/222 (36%), Positives = 126/222 (56%), Gaps = 28/222 (12%) Query: 10 VLDWYDKYGRKT-LPWQIDKTP-----------YKVWLSEVMLQQTQVATVIPYFERFMA 57 +L+W++ K +PW+ P Y++W+SEVMLQQTQV TVI Y++R+M Sbjct: 96 LLEWFECVREKRGMPWRKKYDPSLSFEEKGQRAYEIWVSEVMLQQTQVTTVIAYWQRWME 155 Query: 58 RFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVA--TLHGGKFPETFEEV-AAL 114 R+PT++DLA A ++ GLGYY RAR+L A+ V + + G+ P+ + + Sbjct: 156 RWPTISDLAKADVE-----VRGLGYYRRARSLLAGAKTVMGNSKYNGRLPDDPAVLEKEI 210 Query: 115 PGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA 174 GVGR TAGAI S++ G PI+DGN+ R+L R AV LW +++++ Sbjct: 211 DGVGRYTAGAICSMAYGVRTPIVDGNIHRLLTRLLAVHAPQTGPATIKFLWRVADELIKH 270 Query: 175 VG--------VERFNQAMMDLGAMICTRSKPKCSLCPLQNGC 208 + V +NQA+M+LG+ +C + P+C +CPL+ C Sbjct: 271 LPSGDKHNNVVGDWNQALMELGSQVCKPANPECGVCPLKKAC 312 >UniRef50_B3DTE4 A/G-specific DNA glycosylase n=6 Tax=Bifidobacterium RepID=B3DTE4_BIFLD Length = 328 Score = 135 bits (341), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 99/319 (31%), Positives = 155/319 (48%), Gaps = 51/319 (15%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDK-TPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 M + S ++ W++ R LPW+ + TP+ V +SEVM QQTQ++ V+PY+ +M R+ Sbjct: 1 MNDTAISLRLGAWWEANARD-LPWRFGRATPWGVLVSEVMSQQTQMSRVVPYWTDWMERW 59 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 P LA AP EV+ W LGY RA L + A+ VA + K P T++E+ ALPG+G Sbjct: 60 PDARALAEAPKAEVITAWGRLGYPRRALRLQECARVVAEDYADKLPRTYDELTALPGIGD 119 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCY------AVSGWPGKKEVENKLWSLSEQVTP 173 TA A++S + G+ ++D N++RVL+R + + P ++ + N++ E + Sbjct: 120 YTASAVMSFAFGERIAVIDTNIRRVLSRVFLGVESRGGATSPAERALANRMLPKDEILGC 179 Query: 174 AVGV--------------------------ER----FNQAMMDLGAMICTRSKPKCSLCP 203 V ER +NQ++M+LGA+ICT P C CP Sbjct: 180 DADVADNAGSAEHVVNSTIRGGKRSRLHRGERPSVTWNQSVMELGAVICTAKSPLCDTCP 239 Query: 204 LQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRP-PSGLWGGLYCFP 262 + + C A PG ++T P + Q VL A R P+G P Sbjct: 240 IADDCAFLK----AGRPGLGERRTRPRQRFQGTDRQVRGLVLAALRELPAG-----ATLP 290 Query: 263 QFADEESLRQWLAQRQIAA 281 + E++ + W Q Q+AA Sbjct: 291 R---EDADKLWKDQIQLAA 306 >UniRef50_C1EFR0 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1EFR0_9CHLO Length = 574 Score = 135 bits (341), Expect = 2e-30, Method: Compositional matrix adjust. Identities = 96/266 (36%), Positives = 127/266 (47%), Gaps = 34/266 (12%) Query: 27 DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARA 86 D+ Y VW+SE+M QQTQ+ V Y+ R+ R+P V+ LA+A +EV LW GLGYY RA Sbjct: 180 DQYAYGVWVSEIMSQQTQIERVAQYWLRWTERWPDVSSLASATREEVNELWAGLGYYRRA 239 Query: 87 RNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLA 146 L A+ V FP + +A +PGVG TA AI S++ + +DGNV RV Sbjct: 240 GFLLDGARHVVDNCDTIFPNDAKGLANVPGVGPYTAAAIASIAFDEPVAAVDGNVIRVCT 299 Query: 147 RCYAVSGWPGKKEVENKL---------WSLSEQVTPAVGVERFNQAMMDLGAMICTRSKP 197 R AV+G + + W + + P FNQAMM+LGA +CT P Sbjct: 300 RLAAVTGGGDAAKPSSDAAKAVRLCADWLIDDSTRPG----DFNQAMMELGATVCTPKAP 355 Query: 198 KCSLCPLQNGCIAAANNSWALYPGKKPKQ-----------------TLPERTG---YFLL 237 C CPL+ GC AA S A G K + ER G + Sbjct: 356 ACGQCPLRVGCAGAALES-AGSSGFKVTDLPEKEKKPEKREERVAVRVVERKGGRRDGVP 414 Query: 238 LQHEDEVLLAQRPPSGLWGGLYCFPQ 263 + LL +RP GL GGL+ FP Sbjct: 415 GPADSSFLLVRRPEGGLLGGLWEFPS 440 >UniRef50_A0LRD5 HhH-GPD family protein n=5 Tax=Actinomycetales RepID=A0LRD5_ACIC1 Length = 335 Score = 134 bits (338), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 83/211 (39%), Positives = 120/211 (56%), Gaps = 8/211 (3%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQ-IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 A + +VL WY ++GR+ LPW+ D TP++V +SEVMLQQT V+ V+P + + AR+PT Sbjct: 42 AQRVVHRVLAWYRRHGRRDLPWRRSDVTPWQVLVSEVMLQQTPVSRVLPVYAVWTARWPT 101 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 LA A + + W LGY RA LH+AA+ + GG P+ +A LPG+GR T Sbjct: 102 PQSLAAATPADAVRAWGRLGYPRRALWLHQAARAIVDRFGGIVPDEPGVLATLPGIGRYT 161 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYA----VSGWPGKKEVENKLWSLSEQVTPAVGV 177 A A+ + + + +LD NV+RVLAR +G P E + L + A Sbjct: 162 AAAVAAFAYRRRVAVLDTNVRRVLARFLTGVPHPTGTPRAAEHRSLDALLPKNADRAA-- 219 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGC 208 +F+ A+M+LGA+ICT P C+ CPL C Sbjct: 220 -QFSVALMELGALICTSRSPGCARCPLTTDC 249 >UniRef50_C6A996 A/G-specific adenine glycosylase n=9 Tax=Bifidobacteriaceae RepID=C6A996_BIFLB Length = 329 Score = 134 bits (338), Expect = 4e-30, Method: Compositional matrix adjust. Identities = 89/256 (34%), Positives = 136/256 (53%), Gaps = 12/256 (4%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKT-PYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 A Q ++ W+ R LPW+ +T P+ V + EVM QQTQ++ V+PY+E +M+++P Sbjct: 38 ARQALRELATWWQTSARD-LPWRYGRTTPWGVLVCEVMSQQTQMSRVVPYWEAWMSQWPD 96 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 LA A EV+ W LGY RA L + A+ VA + + P ++E+ ALP VG T Sbjct: 97 AASLAAAEKSEVIRAWGRLGYPRRALRLQECAEVVARDYDDRLPREYDELMALPSVGDYT 156 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCY-AVSGWPGKKEVENKLWSLSEQVTPAVGVER- 179 A A+LS + G+ ++D N++R L+R + V G + +L+ V P + Sbjct: 157 ASAVLSFAYGERVAVIDTNIRRALSRAFLGVESLGGSCTPLER--ALAWVVLPKAAEQSV 214 Query: 180 -FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 +NQA+M+LGA++CT P+C CPLQ C A PG K+T P + + Sbjct: 215 LWNQAVMELGALVCTAKAPQCEQCPLQPQC----EFVKAGMPGLGEKRTRPRQRFHGTDR 270 Query: 239 QHEDEVLLAQRP-PSG 253 Q +L A R P+G Sbjct: 271 QVRGSILQALRALPAG 286 >UniRef50_B7KLS5 HhH-GPD family protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KLS5_CYAP7 Length = 230 Score = 134 bits (337), Expect = 5e-30, Method: Compositional matrix adjust. Identities = 67/194 (34%), Positives = 110/194 (56%), Gaps = 1/194 (0%) Query: 20 KTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTG 79 + PW+ + PY + ++E++LQ+T TV P + F+ R+PT+ L+ A E+ L Sbjct: 34 RDFPWRHTRDPYAILIAEILLQRTSARTVAPVYLEFLRRYPTLMTLSQAREHELSDLMRP 93 Query: 80 LGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDG 139 LG +RA NL + A L+GG+ P++ EE+ LPGVG+ TA A+ + + G +LD Sbjct: 94 LGLRSRAANLKRLAITAIALYGGELPDSEEELLKLPGVGKYTARAVCANAYGHPLAVLDV 153 Query: 140 NVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKC 199 NV R+L R + G ++ + LWS+++ V +R+N ++D GA +C +KP C Sbjct: 154 NVARILRRFFGFDGTKIERR-DAFLWSVAQAVALKRETDRWNLTLIDFGAEVCRATKPNC 212 Query: 200 SLCPLQNGCIAAAN 213 CPL+ C + N Sbjct: 213 RDCPLRGKCQFSLN 226 >UniRef50_B7GUG9 HhH-GPD family protein n=2 Tax=Bifidobacterium RepID=B7GUG9_BIFLI Length = 318 Score = 134 bits (336), Expect = 7e-30, Method: Compositional matrix adjust. Identities = 98/312 (31%), Positives = 158/312 (50%), Gaps = 47/312 (15%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDK-TPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 M + S ++ W++ R LPW+ + TP+ V +SEVM QQTQ++ V+PY+ +MAR+ Sbjct: 1 MNDTAISLRLGAWWEANARD-LPWRFGRATPWGVLVSEVMSQQTQMSRVVPYWNDWMARW 59 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 P LA AP +V+ W LGY RA L + A+ VA +G + P T++E+ ALPG+G Sbjct: 60 PDARALAAAPKADVITAWGRLGYPRRALRLRECARVVAEEYGDELPRTYDELVALPGIGD 119 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCY------AVSGWPGKKEVENKLWS------- 166 TA A+LS + G+ ++D N++RVL+R + + P ++ + N++ Sbjct: 120 YTASAVLSFAFGERIAVIDTNIRRVLSRVFLGTESRGGAASPAERALANRMLPQDRVCGD 179 Query: 167 ----------------LSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIA 210 L P+V +NQ++M+LGA++CT P C +CP+ + C Sbjct: 180 GADCTDHAYRSGEHTFLQRSEPPSVT---WNQSVMELGAVVCTAKTPLCEICPIADDCAF 236 Query: 211 AANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRP-PSGLWGGLYCFPQFADEES 269 A PG ++T P + Q VL A R P+G+ P+ E++ Sbjct: 237 LK----AGRPGLGERRTRPRQRFQGTDRQVRGLVLAALRELPAGV-----TLPR---EDA 284 Query: 270 LRQWLAQRQIAA 281 + W Q Q+AA Sbjct: 285 DKLWKDQIQLAA 296 >UniRef50_UPI0001699F40 A/G-specific adenine glycosylase n=1 Tax=Endoriftia persephone 'Hot96_1+Hot96_2' RepID=UPI0001699F40 Length = 180 Score = 132 bits (331), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 61/98 (62%), Positives = 73/98 (74%) Query: 23 PWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGY 82 P + PY+VW+SE+MLQQTQVA VIPYF RFMARFP + LA A DEVLHLW+GLGY Sbjct: 4 PGRASGDPYRVWISEIMLQQTQVAKVIPYFNRFMARFPDLASLAAAAEDEVLHLWSGLGY 63 Query: 83 YARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 YARARNL KA + + G+FPE E + +LPG+GRS Sbjct: 64 YARARNLRKADEVLQGHFSGRFPEAIEALESLPGIGRS 101 >UniRef50_A0JRA8 HhH-GPD family protein n=19 Tax=Actinomycetales RepID=A0JRA8_ARTS2 Length = 347 Score = 132 bits (331), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 73/199 (36%), Positives = 115/199 (57%), Gaps = 3/199 (1%) Query: 12 DWYDKYGRKTLPWQIDK-TPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPL 70 DW+ R LPW+ + +P+ V +SE+MLQQT V V+P +E ++ R+P+ LA Sbjct: 49 DWFGTTARD-LPWRDPECSPWGVLVSEIMLQQTPVVRVLPVWEDWLRRWPSPAHLATEAS 107 Query: 71 DEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSL 130 E + W LGY RA LH AA + H G P T++E+ LPGVG TA A+ + + Sbjct: 108 GEAVRHWGRLGYPRRALRLHAAAVAIVEKHDGGVPGTYDELLELPGVGSYTAAAVAAFAF 167 Query: 131 GKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE-RFNQAMMDLGA 189 G+ ++D N++RV AR ++ + P + ++ +E + VG+ R+N A+M+LGA Sbjct: 168 GRRETVVDTNIRRVHARLFSGTALPSQSLTAAEMRLAAELLPADVGLSVRWNAAVMELGA 227 Query: 190 MICTRSKPKCSLCPLQNGC 208 ++CT PKC CP++ C Sbjct: 228 LVCTARAPKCGECPVRGAC 246 >UniRef50_A2PD72 A/G-specific adenine glycosylase n=1 Tax=Vibrio cholerae 1587 RepID=A2PD72_VIBCH Length = 154 Score = 130 bits (327), Expect = 6e-29, Method: Compositional matrix adjust. Identities = 67/141 (47%), Positives = 92/141 (65%), Gaps = 5/141 (3%) Query: 208 CIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADE 267 C+A + YPGKKPK P + +F++L H++ V L QRP SG+WGGLYCFPQ ++ Sbjct: 7 CLAKQQGNPQEYPGKKPKTDKPVKATWFVMLYHDNAVWLEQRPQSGIWGGLYCFPQ-SEI 65 Query: 268 ESLRQWLAQRQIAADNLTQ---LTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGN-ALWY 323 +++ + QR I +T L AFRHTFSH+HLDI P+ L +S + EG+ LWY Sbjct: 66 ANIQTTIDQRAIGDSTITSQKTLIAFRHTFSHYHLDITPILLQLSRKPDIVMEGSKGLWY 125 Query: 324 NLAQPPSVGLAAPVERLLQQL 344 NL+QP +GLAAPV++LL L Sbjct: 126 NLSQPDEIGLAAPVKQLLHSL 146 >UniRef50_Q2QW16 Retrotransposon protein, putative, unclassified n=43 Tax=Spermatophyta RepID=Q2QW16_ORYSJ Length = 1795 Score = 130 bits (327), Expect = 7e-29, Method: Compositional matrix adjust. Identities = 73/183 (39%), Positives = 107/183 (58%), Gaps = 19/183 (10%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQI---------------DKTPYKVWLSEVMLQQTQVAT 47 A+ A++L WYD R+ LPW+ +K Y VW+SEVMLQQT+V Sbjct: 52 AAAVRAELLRWYDA-NRRDLPWRRAAEPAGSGSGSGRGEEKRAYAVWVSEVMLQQTRVPV 110 Query: 48 VIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPET 107 V+ Y+ R+MAR+PTV LA A +EV +W GLGYY RAR L + A+Q+ + G+FP T Sbjct: 111 VVDYYSRWMARWPTVDSLAAATQEEVNEMWAGLGYYRRARFLLEGAKQI--VEKGEFPRT 168 Query: 108 FEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSL 167 + + G+G TAGAI S++ + P++DGNV RV++R YA+ P + + + Sbjct: 169 ASALREVRGIGDYTAGAIASIAFNEVVPVVDGNV-RVISRLYAIPDNPKESSTVKRFCTR 227 Query: 168 SEQ 170 +Q Sbjct: 228 RQQ 230 >UniRef50_C9SE24 A/G-specific adenine DNA glycosylase n=2 Tax=Sordariomycetes RepID=C9SE24_VERA1 Length = 424 Score = 129 bits (325), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 80/199 (40%), Positives = 111/199 (55%), Gaps = 24/199 (12%) Query: 39 MLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVAT 98 MLQQT+VA VI Y+ R+ A++PT+ DLA A D VL W GLGYY+RA +H+AA+ V Sbjct: 1 MLQQTRVAVVIDYWNRWTAKWPTIQDLAAAEPDHVLSAWRGLGYYSRATRIHEAARLVVA 60 Query: 99 LHG--GKFPETFEEV-AALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSG-W 154 G P+ E+ + +PGVGR TAGAI ++ G P++DGNV RVL+R V G Sbjct: 61 DDEMQGLLPQDVAELESKVPGVGRYTAGAISAIVFGHAAPMVDGNVLRVLSRQLGVFGNV 120 Query: 155 PGKKEVENKLWSLSEQVTPAVGVE-------------------RFNQAMMDLGAMICTRS 195 K + LW+ ++ + AV + R+ QA+M+LG+ ICT Sbjct: 121 KTDKTTIDMLWAAADALVKAVASDGDTGGSSRDDQPPVSDRPGRWGQALMELGSTICT-P 179 Query: 196 KPKCSLCPLQNGCIAAANN 214 KP C+ CP+ C A A Sbjct: 180 KPDCAQCPITASCRAYAEG 198 >UniRef50_Q97IY7 A/G-specific DNA glycosylase n=1 Tax=Clostridium acetobutylicum RepID=Q97IY7_CLOAB Length = 215 Score = 129 bits (325), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 63/208 (30%), Positives = 108/208 (51%), Gaps = 3/208 (1%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVI-PYFERFMARFP 60 + F +LDWYDK ++ PW+ PYKV +SE++LQQT V V+ PYF R + ++ Sbjct: 7 EIYMFQCFLLDWYDK-NKRNFPWRYTFDPYKVLVSEILLQQTNVDKVVEPYF-RIINKYK 64 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + +LA + + +++ G+G + RA L A + + P+ ++E+ + G+G Sbjct: 65 NIHELAESEDVFLKNVFKGIGLFYRADRLKNIAGNIVNYNKKVIPDKWDELIKIKGIGYY 124 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 A+L K + +LD NV R+ R + + + + KL+ ++ + P + Sbjct: 125 ICSALLCFGFNKPYAVLDTNVIRIFERIFDIKSEKKRPRDDIKLFEFAQLLIPEDRYVDY 184 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGC 208 N A++D GA ICT PKCS C + C Sbjct: 185 NYAILDFGACICTMYNPKCSQCIFRFNC 212 >UniRef50_D1BVG2 HhH-GPD family protein n=39 Tax=Actinomycetales RepID=D1BVG2_XYLCX Length = 581 Score = 129 bits (324), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 76/203 (37%), Positives = 113/203 (55%), Gaps = 5/203 (2%) Query: 9 QVLDWYDKYGRKTLPWQI-DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLAN 67 +V+ W+D R+ LPW+ D+TP+ V +SEVMLQQT V V P + +M R+PT +DLA Sbjct: 299 RVVRWFDG-ARRDLPWRAADRTPWGVLVSEVMLQQTPVVRVEPAWRAWMERWPTPSDLAA 357 Query: 68 APLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILS 127 A +VL W LGY RA L + A+ + H G P+ + ALPGVG TA A+ + Sbjct: 358 ASTADVLRAWDRLGYPRRALRLQECARVLVERHDGAVPDDEAALRALPGVGEYTAAAVRA 417 Query: 128 LSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP--AVGVERFNQAMM 185 + G+ ++D NV+RVLAR P + + + V P + A M Sbjct: 418 FAFGRRAVVVDTNVRRVLARAVGGVALPAPSPTAAER-ATATAVVPHDDDAAAAWAAASM 476 Query: 186 DLGAMICTRSKPKCSLCPLQNGC 208 +LGA++CT P+C+ CP+++ C Sbjct: 477 ELGALVCTARSPRCAECPVRDLC 499 >UniRef50_Q0W1Z4 Putative DNA glycosylase n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W1Z4_UNCMA Length = 212 Score = 127 bits (319), Expect = 6e-28, Method: Compositional matrix adjust. Identities = 67/209 (32%), Positives = 114/209 (54%), Gaps = 2/209 (0%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + +L+W+ K R PW+ +PY V ++E MLQ+T+ V+P + +F+ ++P V Sbjct: 3 EIIKNLLNWHKKNYR-PYPWRNVSSPYMVMIAEFMLQRTRADQVVPVYNQFLKKYPDVDQ 61 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A ++++ LG Y RA + AA+ + G P+ E++ +PGVG AGA Sbjct: 62 LAEADIEDIKATLKPLGLYWRANHFKMAAEYIQRTFSGNIPDNKEDLKNIPGVGDYAAGA 121 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 IL+++ K I+D N+ RVL R Y + G G+ + ++ L+ Q+ + A+ Sbjct: 122 ILAVAFRKKSCIVDSNIARVLNRYYGL-GLNGEIRRKKEIVELACQLFNHKEPNKILFAI 180 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAAN 213 +D A++CT PK +CPL+N C+ N Sbjct: 181 IDFSAIVCTPVNPKHGICPLKNNCLYYKN 209 >UniRef50_Q9P9L6 U/G & T/G mismatch specific glycosylase (Pa-MIG) n=2 Tax=Thermoprotei RepID=Q9P9L6_PYRAE Length = 230 Score = 127 bits (318), Expect = 7e-28, Method: Compositional matrix adjust. Identities = 63/210 (30%), Positives = 114/210 (54%), Gaps = 4/210 (1%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 ++F ++ WY ++G K LPW+ P+ V ++ ++L++T V V+ + F+ R+P+ Sbjct: 13 TKFRDAIIKWYREFGEKDLPWRKAGDPWAVLVAALLLRKTTVKQVVDIYREFLRRYPSPA 72 Query: 64 DLANAPLDEVLHLWTGLGY-YARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA+A ++E+ + LG + RA L K ++++ G+ P + + +LPGVG A Sbjct: 73 RLADASVEEIKAIIQPLGMEHVRATLLKKLSEELVRRFNGQIPCDRDALKSLPGVGDYAA 132 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG--VERF 180 +L + GK P+LD N+ RV+ R + + + + +LW+ + + P + F Sbjct: 133 SEVLLTACGKPEPLLDRNMIRVIERVFGIKSKKRRPHTDRELWNFARSLVPRDPELAKEF 192 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNG-CI 209 N ++D +CT PKCSLCPL N C+ Sbjct: 193 NFGVLDFARKVCTAKSPKCSLCPLANNVCV 222 >UniRef50_A1A319 Probable A/G-specific adenine glycosylase n=3 Tax=Bifidobacterium RepID=A1A319_BIFAA Length = 340 Score = 127 bits (318), Expect = 8e-28, Method: Compositional matrix adjust. Identities = 82/241 (34%), Positives = 128/241 (53%), Gaps = 19/241 (7%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-PYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 A + + + W+ R LPW+ +T P+ V +SEVM QQTQ+ V+PY+ +M R+P Sbjct: 39 DADEIATTLAQWWQTNARD-LPWRFGRTTPWGVLVSEVMSQQTQMGRVVPYWTAWMDRWP 97 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 LA+AP +V+ W LGY RA L + A +A + + P T++E+ ALPG+G Sbjct: 98 DAAALADAPKSDVITAWGRLGYPRRALRLQECAHVIAYDYADELPHTYDELLALPGIGDY 157 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCY-AVSGWPGKKEVENKLWSLSEQVTP---AVG 176 TA A+LS + G+ ++D N++RVL+R + V G + +L++++ P + Sbjct: 158 TASAVLSFAFGERIAVVDTNIRRVLSRAFVGVESLGGSASPAER--ALAKRLLPDDDSAK 215 Query: 177 VERF-------NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLP 229 RF NQA+M+LGA +CT P C CP+ C N PG ++T P Sbjct: 216 CRRFDRPSVVWNQAVMELGATVCTAKSPLCEACPIAGKCAFLRNGR----PGLGQRRTRP 271 Query: 230 E 230 Sbjct: 272 R 272 >UniRef50_C6WGJ6 HhH-GPD family protein n=48 Tax=Actinomycetales RepID=C6WGJ6_ACTMD Length = 329 Score = 125 bits (313), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 76/210 (36%), Positives = 117/210 (55%), Gaps = 4/210 (1%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDK-TPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 +++ VL+ + + LPW+ + T + V +SE+MLQQT VA V P + ++ ++P Sbjct: 36 ESAHLPPSVLNTWFAATARDLPWRDPECTAWGVLVSEIMLQQTPVARVEPIWRVWLDKWP 95 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 +D+A A EVL +W LGY RA LH AAQ VA H P+ E + ALPG+G Sbjct: 96 RPSDMAAASQGEVLRMWGKLGYPRRALRLHAAAQAVAAEHDDVVPDDVETLLALPGIGAY 155 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP--AVGVE 178 TA A+ + + G+ P++D NV+RV+AR +G G L + E + P Sbjct: 156 TARAVAAFAYGRRCPVVDTNVRRVVARAVHGAGDAGPPSTTRDLRDV-EALLPEDEASAA 214 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGC 208 ++ A+M+LGA++CT P+CS CP+ C Sbjct: 215 TYSAALMELGALVCTARTPRCSACPVLGSC 244 >UniRef50_C8RS99 A/G-specific DNA glycosylase n=1 Tax=Corynebacterium jeikeium ATCC 43734 RepID=C8RS99_CORJE Length = 385 Score = 122 bits (307), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 83/252 (32%), Positives = 115/252 (45%), Gaps = 47/252 (18%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQ-IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 + + A++ WY + R LPW+ D TP+ + LSEVM QQT VA VIP + ++ R+ Sbjct: 50 VDSPDLPAKLNAWYARNARD-LPWRHPDTTPWAILLSEVMSQQTPVARVIPLWRAWLERW 108 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 PT DLA AP E+L +W LGY RA L + A H G P E+ ALPG+G Sbjct: 109 PTPADLAAAPRSEILRMWANLGYPRRALRLKECAIACVERHDGAVPHDIAELEALPGIGH 168 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSG----WPGKKEVENKLWSLSEQVTP-- 173 TA A+ + + G+ P++D NV+RV R V G P + + SL V P Sbjct: 169 YTARAVAAFAFGQAVPVVDTNVRRVYRRL--VDGRYLQGPARARDLADVASLLPHVDPDP 226 Query: 174 --------------------------------AVGVER-----FNQAMMDLGAMICTRSK 196 A +R A+M+LGA+ICT Sbjct: 227 RLDGRQLLPPPQPQPTDTGTGTDSCAGGSTATARDADRDAANLMCAALMELGALICTAKS 286 Query: 197 PKCSLCPLQNGC 208 P C CP+ + C Sbjct: 287 PACDQCPVIDDC 298 >UniRef50_B8P2D0 Predicted protein n=1 Tax=Postia placenta Mad-698-R RepID=B8P2D0_POSPM Length = 562 Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 96/311 (30%), Positives = 140/311 (45%), Gaps = 73/311 (23%) Query: 2 QASQFSAQVLDWYDK-YGRKTLPWQIDKTPYK----------------VWLSEVMLQQTQ 44 + S +L+WY+ + + +PW + PY VW+SE+MLQQTQ Sbjct: 77 EPSPLREALLEWYEGVHAARGMPW---RKPYDPSMNNDQRAQRAYEVLVWVSEIMLQQTQ 133 Query: 45 VATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKF 104 VATVIPY+ R+M ++PT+ DLA + ++ V +W GLGYY+RA L AQ+ GG+ Sbjct: 134 VATVIPYYNRWMKKYPTIRDLAASDIETVNSIWRGLGYYSRAARLLAGAQKAVRDFGGRL 193 Query: 105 PETFEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENK 163 P+ +++ A +PG+GR +AGAI S++ R AV P K+ + Sbjct: 194 PDNAKDMEANIPGIGRYSAGAICSIAYND--------------RILAVHAPPKSKQTLDI 239 Query: 164 LW-SLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGC--IAAANNSWALY- 219 LW + V NQA+++LG S P C L +G AA+ S A Sbjct: 240 LWQGATAMVEGGTRPGDLNQALIELG------STPWCQAVKLADGKEPEGAASKSGAEVL 293 Query: 220 ------------PGKKPKQTLPERTGYFLLLQHEDEV----------------LLAQRPP 251 P +P + P R + D V LL +RP Sbjct: 294 DIEEICTLCEPLPVGRPVTSFPMRAEKKKAREELDIVNVIEWRSQADGGGRWFLLVRRPE 353 Query: 252 SGLWGGLYCFP 262 GL GGL+ FP Sbjct: 354 GGLLGGLHEFP 364 >UniRef50_Q8II68 A/G-specific adenine glycosylase, putative n=1 Tax=Plasmodium falciparum 3D7 RepID=Q8II68_PLAF7 Length = 613 Score = 122 bits (306), Expect = 2e-26, Method: Compositional matrix adjust. Identities = 61/180 (33%), Positives = 98/180 (54%), Gaps = 2/180 (1%) Query: 31 YKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLH 90 Y++++SE+MLQQT+V TV+ ++ ++M ++ + DLA LD+VL LW GLGYY RA+NL Sbjct: 212 YQIYISEIMLQQTKVHTVLNFYLKWMNKWNNIFDLAKCNLDDVLILWKGLGYYNRAKNLL 271 Query: 91 KAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYA 150 + + V + G FP + + LPG+G T+ AI + +D N+ R+ +R Sbjct: 272 ECCKIVVDKYNGIFPNDLKLLKTLPGIGDYTSKAICIHLYNRKDICIDTNIIRIFSRITD 331 Query: 151 VSGWPGKKEVENKLWSLSEQVTPA-VGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCI 209 + + +SE + +QA MDLG+ +C S P CS CP+ C+ Sbjct: 332 TINYYNSGTLLKHCEKVSEILCSGESNYSDLSQAFMDLGSSVCNNS-PDCSQCPINKYCM 390 >UniRef50_B3L5A3 A/g-specific adenine glycosylase, putative n=6 Tax=Plasmodium RepID=B3L5A3_PLAKH Length = 582 Score = 119 bits (298), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 60/180 (33%), Positives = 100/180 (55%), Gaps = 2/180 (1%) Query: 31 YKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLH 90 Y++++SE+MLQQT+V V+ ++ ++M ++ T+ +LA + LD+VL +W GLGYY RA+NL Sbjct: 177 YQIYVSEIMLQQTRVHAVVNFYLKWMNKWGTIFELAKSNLDDVLIVWKGLGYYNRAKNLL 236 Query: 91 KAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYA 150 + V + G FP + + LPG+G T+ AI + +D NV R+ +R Sbjct: 237 DCCKHVVDKYDGVFPNDLKLLKELPGIGDYTSKAICIHLYNRKDICIDTNVIRIFSRITD 296 Query: 151 VSGWPGKKEVENKLWSLSEQVTPA-VGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCI 209 + G + +S + +QA+MDLG+ IC + P+C+ CPL C+ Sbjct: 297 TINYSGSTVLTKHCERVSHFLCEGDSNYSDLSQALMDLGSSICNGT-PQCAQCPLNKHCL 355 >UniRef50_Q027X1 Helix-hairpin-helix motif n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q027X1_SOLUE Length = 219 Score = 116 bits (291), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 65/203 (32%), Positives = 108/203 (53%), Gaps = 6/203 (2%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +L W+ GR + W+ ++ PY V +SE++L++T V + F+ RFP L+ A Sbjct: 14 LLKWFRSSGR-SFYWRENRDPYVVLVSELLLKKTAAPVVDRFLPAFLKRFPDFASLSRAR 72 Query: 70 LDEVLHLWTGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 ++ + LG RA L AQ ++ G P T +++ ALPGVG TA ++L + Sbjct: 73 HATLVRILQPLGLSDQRAGQLRALAQAISGSKDGCVPATRQDLLALPGVGEYTANSLLCV 132 Query: 129 SLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT---PAVGVERFNQAMM 185 S G+ P++D NV R++ R + + + ++W L+ +T P VE N A++ Sbjct: 133 SFGQAVPVVDTNVARIVMRAFGIGHSRCEARRSPEIWGLAADITGNVPTRAVE-VNWALL 191 Query: 186 DLGAMICTRSKPKCSLCPLQNGC 208 DLGA +CT P+C CP+ + C Sbjct: 192 DLGANVCTARTPRCRDCPVSSIC 214 >UniRef50_C0QTN1 T/G-specific DNA glycosylase n=1 Tax=Persephonella marina EX-H1 RepID=C0QTN1_PERMH Length = 228 Score = 116 bits (291), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 61/216 (28%), Positives = 104/216 (48%), Gaps = 8/216 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 F ++ W++ + R PW+ +K PYKV ++E+ L +T + V ++ F+ +FP Sbjct: 15 FDVKHFHEKLFSWWEDHKR-IYPWRFEKDPYKVLIAEIFLHRTNSSQVEKVYKDFIDKFP 73 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + L A +E+ L LG R +K + +GG P +E+ +LPG+G Sbjct: 74 DIRSLLKAKKEEISPLLQSLGLKWRQELFNKMILILREKYGGNIPLNHKELKSLPGIGDY 133 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLAR--CYAVSGWPGKKEV-ENKLWSLSEQVTPAVGV 177 A A++ +L P+LD N+ RV R C ++ + + N ++ L + P V Sbjct: 134 IAAAVIIFTLNNPLPLLDSNIVRVTGRLFCLKITDSSRRSRLFRNYIYCLIYKNDPRV-- 191 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN 213 F A++D A++C P C CPL+N C N Sbjct: 192 --FYYALIDFAALVCKPKDPDCDKCPLRNFCCIYKN 225 >UniRef50_A1AUQ8 HhH-GPD family protein n=2 Tax=Desulfuromonadales RepID=A1AUQ8_PELPD Length = 299 Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 56/130 (43%), Positives = 79/130 (60%) Query: 20 KTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTG 79 + +PW+ PY + +SEVMLQQTQV V + +F+ FPTV LA APL+E+L LW G Sbjct: 38 RPMPWRETSDPYHILVSEVMLQQTQVERVKAKYAQFLETFPTVRHLATAPLEELLRLWQG 97 Query: 80 LGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDG 139 LGY RA L + A+Q+ LH G FP T E+ LPG+G TA A+ + + G P ++ Sbjct: 98 LGYNRRAIALKRCAEQIHILHEGNFPTTIHELQNLPGIGPYTARAVAAFAFGLPEPFIET 157 Query: 140 NVKRVLARCY 149 N++ V + Sbjct: 158 NIRTVFIHFF 167 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_A0Z8D4 A/G-specific adenine glycosylase MutY n=2 Tax=Ga... 423 e-117 UniRef50_C9R2J3 A/G-specific adenine glycosylase n=1 Tax=Aggrega... 423 e-117 UniRef50_B8KUN1 A/G-specific adenine glycosylase n=2 Tax=Gammapr... 413 e-114 UniRef50_C7R6W3 A/G-specific adenine glycosylase n=1 Tax=Kangiel... 412 e-113 UniRef50_Q3KJJ2 A/G-specific adenine glycosylase n=5 Tax=Gammapr... 410 e-113 UniRef50_D1RGZ1 A/G-specific adenine glycosylase n=1 Tax=Legione... 409 e-113 UniRef50_Q2SMB7 A/G-specific adenine glycosylase n=10 Tax=Gammap... 405 e-112 UniRef50_C6N1N5 A/G specific adenine glycosylase n=1 Tax=Legione... 405 e-111 UniRef50_P44320 A/G-specific adenine glycosylase n=308 Tax=Gamma... 405 e-111 UniRef50_A9KCK4 A/G-specific adenine DNA glycosylase n=6 Tax=Cox... 400 e-110 UniRef50_UPI0000E105D2 A/G-specific adenine glycosylase n=1 Tax=... 398 e-109 UniRef50_Q3J8X1 A/G-specific DNA-adenine glycosylase n=13 Tax=Ga... 392 e-107 UniRef50_A8PLA9 A/G-specific adenine glycosylase n=1 Tax=Rickett... 391 e-107 UniRef50_Q67NN1 A/G-specific adenine glycosylase n=1 Tax=Symbiob... 389 e-107 UniRef50_Q2YCY4 A/G-specific DNA-adenine glycosylase n=4 Tax=Nit... 384 e-105 UniRef50_B5JTP2 A/G-specific adenine glycosylase n=1 Tax=gamma p... 382 e-105 UniRef50_B0RRL4 A/G-specific adenine glycosylase n=15 Tax=Gammap... 381 e-104 UniRef50_UPI0001BC503C adenine glycosylase n=1 Tax=Neisseria muc... 380 e-104 UniRef50_A1KUL5 Adenine glycosylase n=32 Tax=Neisseriaceae RepID... 379 e-104 UniRef50_Q3A7R2 A/G-specific DNA-adenine glycosylase n=2 Tax=Des... 378 e-103 UniRef50_C9PAA7 A/G-specific adenine glycosylase n=2 Tax=Vibrio ... 378 e-103 UniRef50_C3X6B5 A/G-specific adenine glycosylase n=1 Tax=Oxaloba... 376 e-103 UniRef50_Q89A45 A/G-specific adenine glycosylase n=6 Tax=Buchner... 375 e-102 UniRef50_UPI0000E87CC6 A/G-specific adenine glycosylase n=1 Tax=... 374 e-102 UniRef50_Q2S0Q6 A/G-specific adenine glycosylase n=1 Tax=Salinib... 371 e-101 UniRef50_A5EV94 A-G-specific adenine glycosylase n=1 Tax=Dichelo... 370 e-101 UniRef50_B8FCI1 A/G-specific adenine glycosylase n=1 Tax=Desulfa... 368 e-100 UniRef50_B7GII9 A/G-specific adenine glycosylase n=76 Tax=Bacill... 368 e-100 UniRef50_A8MJ42 A/G-specific adenine glycosylase n=2 Tax=Firmicu... 368 e-100 UniRef50_D0L1V0 A/G-specific adenine glycosylase n=1 Tax=Halothi... 367 e-100 UniRef50_A1WW55 A/G-specific DNA-adenine glycosylase n=1 Tax=Hal... 367 e-100 UniRef50_D2RMB6 A/G-specific adenine glycosylase n=2 Tax=Acidami... 367 e-100 UniRef50_D0MK94 A/G-specific adenine glycosylase n=1 Tax=Rhodoth... 366 e-100 UniRef50_A1VKP4 A/G-specific DNA-adenine glycosylase n=6 Tax=Com... 366 e-100 UniRef50_A2RLW5 A/G-specific adenine glycosylase n=4 Tax=Lactoco... 363 6e-99 UniRef50_C0EDR1 Putative uncharacterized protein n=2 Tax=Clostri... 362 1e-98 UniRef50_Q03V36 A/G-specific DNA-adenine glycosylase n=2 Tax=Leu... 362 1e-98 UniRef50_A6TW33 A/G-specific adenine glycosylase n=1 Tax=Alkalip... 359 6e-98 UniRef50_Q2LY55 A/g-specific DNA glycosylase n=1 Tax=Syntrophus ... 358 2e-97 UniRef50_C4L361 A/G-specific adenine glycosylase n=1 Tax=Exiguob... 358 2e-97 UniRef50_A3ZT46 A/G-specific adenine glycosylase n=1 Tax=Blastop... 357 3e-97 UniRef50_C5EQF4 Putative uncharacterized protein n=2 Tax=Clostri... 357 4e-97 UniRef50_Q8CV61 A/G-specific adenine glycosylase n=1 Tax=Oceanob... 355 1e-96 UniRef50_C9LNH1 A/G-specific adenine glycosylase n=1 Tax=Dialist... 354 2e-96 UniRef50_D1N1B7 A/G-specific adenine glycosylase n=2 Tax=Bacteri... 354 2e-96 UniRef50_A4T064 A/G-specific adenine glycosylase n=2 Tax=Polynuc... 354 3e-96 UniRef50_C5VJP5 A/G-specific adenine glycosylase n=1 Tax=Prevote... 352 1e-95 UniRef50_C0QGE4 MutY n=1 Tax=Desulfobacterium autotrophicum HRM2... 352 1e-95 UniRef50_A6LDG5 A/G-specific adenine glycosylase n=33 Tax=Bacter... 352 1e-95 UniRef50_A5IU34 A/G-specific DNA-adenine glycosylase n=56 Tax=St... 352 1e-95 UniRef50_B3QVZ4 A/G-specific adenine glycosylase n=1 Tax=Chloroh... 351 1e-95 UniRef50_C1ZJ43 A/G-specific DNA-adenine glycosylase n=1 Tax=Pla... 351 3e-95 UniRef50_B9XFP7 A/G-specific adenine glycosylase n=1 Tax=bacteri... 349 8e-95 UniRef50_A4VV79 A/G-specific DNA glycosylase n=98 Tax=Lactobacil... 349 1e-94 UniRef50_C2H8F6 A/G-specific adenine glycosylase n=16 Tax=Entero... 348 1e-94 UniRef50_A9EQ53 A/G-specific adenine glycosylase n=1 Tax=Sorangi... 348 2e-94 UniRef50_Q5WIF8 A/G-specific adenine DNA glycosylase n=11 Tax=Ba... 348 2e-94 UniRef50_C8US26 A/G-specific DNA glycosylase n=18 Tax=Lactobacil... 348 3e-94 UniRef50_A7RIG3 Predicted protein n=1 Tax=Nematostella vectensis... 347 3e-94 UniRef50_Q1DAW2 A/G-specific adenine glycosylase n=1 Tax=Myxococ... 347 3e-94 UniRef50_B8HXE9 A/G-specific adenine glycosylase n=31 Tax=Cyanob... 347 3e-94 UniRef50_A6NQR4 Putative uncharacterized protein n=1 Tax=Bactero... 347 4e-94 UniRef50_D1W5W3 A/G-specific adenine glycosylase n=1 Tax=Prevote... 346 5e-94 UniRef50_B8JJK1 Novel protein similar to H.sapiens MUTYH, mutY h... 346 6e-94 UniRef50_C6LA88 A/G-specific adenine glycosylase n=20 Tax=Clostr... 346 9e-94 UniRef50_A8R947 Putative uncharacterized protein n=1 Tax=Eubacte... 344 2e-93 UniRef50_A0LLV9 A/G-specific DNA-adenine glycosylase n=1 Tax=Syn... 344 2e-93 UniRef50_Q1MS05 A/G-specific DNA glycosylase n=1 Tax=Lawsonia in... 344 3e-93 UniRef50_O31584 Probable A/G-specific adenine glycosylase yfhQ n... 344 4e-93 UniRef50_D1BPM8 A/G-specific adenine glycosylase n=3 Tax=Veillon... 343 4e-93 UniRef50_A3MA77 A/G specific adenine glycosylase n=18 Tax=Acinet... 343 4e-93 UniRef50_C6D1C3 A/G-specific adenine glycosylase n=4 Tax=Bacilla... 343 5e-93 UniRef50_Q9UIF7 A/G-specific adenine DNA glycosylase n=51 Tax=Ch... 343 6e-93 UniRef50_C6ACA0 A/G-specific adenine glycosylase MutY n=6 Tax=Rh... 343 7e-93 UniRef50_C4FU23 Putative uncharacterized protein n=1 Tax=Catonel... 343 7e-93 UniRef50_C4G9U8 Putative uncharacterized protein n=1 Tax=Shuttle... 342 1e-92 UniRef50_C9LHH9 A/G-specific adenine glycosylase n=4 Tax=Bactero... 342 1e-92 UniRef50_A6GM64 Probable a/g-specific adenine glycosylase protei... 341 2e-92 UniRef50_B9E7Y8 A/G-specific adenine glycosylase homolog n=4 Tax... 341 3e-92 UniRef50_C9MR28 A/G-specific adenine glycosylase n=4 Tax=Prevote... 340 5e-92 UniRef50_B8DFN3 A/G-specific adenine glycosylase n=14 Tax=Lister... 340 5e-92 UniRef50_A6CCE7 A/G-specific adenine glycosylase n=1 Tax=Plancto... 339 6e-92 UniRef50_A7VHH6 Putative uncharacterized protein n=1 Tax=Clostri... 339 6e-92 UniRef50_UPI00016C4021 A/G-specific adenine glycosylase n=1 Tax=... 339 7e-92 UniRef50_C3J8R7 A/G-specific adenine glycosylase n=2 Tax=Bacteri... 339 7e-92 UniRef50_Q7NMA0 A/G-specific adenine glycosylase n=1 Tax=Gloeoba... 339 8e-92 UniRef50_Q5HN66 A/G-specific adenine glycosylase n=5 Tax=Staphyl... 339 1e-91 UniRef50_Q1Q8E2 A/G-specific DNA-adenine glycosylase n=4 Tax=Mor... 339 1e-91 UniRef50_Q7VRG8 A/G-specific adenine glycosylase n=2 Tax=Candida... 338 2e-91 UniRef50_C7G751 A/G-specific adenine glycosylase n=1 Tax=Rosebur... 338 2e-91 UniRef50_A0M7B9 A/G-specific adenine glycosylase n=16 Tax=Bacter... 338 3e-91 UniRef50_B0MYW3 Putative uncharacterized protein n=1 Tax=Alistip... 337 5e-91 UniRef50_Q11NU4 A/G-specific adenine glycosylase n=5 Tax=Bactero... 334 2e-90 UniRef50_A6G959 A/G-specific adenine glycosylase n=1 Tax=Plesioc... 334 2e-90 UniRef50_B9Y8G7 Putative uncharacterized protein n=1 Tax=Holdema... 334 2e-90 UniRef50_A8ZZH0 A/G-specific adenine glycosylase n=1 Tax=Desulfo... 334 3e-90 UniRef50_A9KT85 A/G-specific adenine glycosylase n=3 Tax=Clostri... 334 4e-90 UniRef50_A9T1M0 Predicted protein n=1 Tax=Physcomitrella patens ... 334 4e-90 UniRef50_B8J1Y9 A/G-specific adenine glycosylase n=12 Tax=Desulf... 333 7e-90 UniRef50_D2MN98 Putative A/G-specific adenine glycosylase n=1 Ta... 333 9e-90 UniRef50_Q7UYZ0 A/G-specific adenine glycosylase n=1 Tax=Rhodopi... 332 9e-90 UniRef50_C8WS48 A/G-specific adenine glycosylase n=2 Tax=Alicycl... 332 1e-89 UniRef50_Q8A7M6 A/G-specific adenine glycosylase n=2 Tax=Bactero... 332 1e-89 UniRef50_Q03P68 A/G-specific DNA-adenine glycosylase n=2 Tax=Lac... 331 2e-89 UniRef50_B5EQC0 A/G-specific adenine glycosylase n=3 Tax=Acidith... 331 2e-89 UniRef50_UPI000196B020 hypothetical protein CATMIT_00928 n=1 Tax... 330 3e-89 UniRef50_A7HQZ1 A/G-specific adenine glycosylase n=45 Tax=Alphap... 330 4e-89 UniRef50_C4V5S7 Adenine glycosylase n=1 Tax=Selenomonas flueggei... 329 7e-89 UniRef50_C6XRC5 HhH-GPD family protein n=1 Tax=Hirschia baltica ... 329 7e-89 UniRef50_B1Y3P3 A/G-specific adenine glycosylase n=8 Tax=Burkhol... 329 1e-88 UniRef50_D1U9Y5 A/G-specific adenine glycosylase n=1 Tax=Desulfo... 328 1e-88 UniRef50_C9LTK5 A/G-specific adenine glycosylase n=1 Tax=Selenom... 328 2e-88 UniRef50_UPI0001BC63FA A/G-specific adenine glycosylase n=3 Tax=... 328 2e-88 UniRef50_Q0P4V1 Putative uncharacterized protein MGC145569 n=1 T... 328 3e-88 UniRef50_C7PL91 A/G-specific adenine glycosylase n=1 Tax=Chitino... 327 4e-88 UniRef50_D2R2B9 HhH-GPD family protein n=1 Tax=Pirellula staleyi... 325 1e-87 UniRef50_C9KJM4 A/G-specific adenine glycosylase n=3 Tax=Clostri... 325 1e-87 UniRef50_C8WEE4 A/G-specific adenine glycosylase n=3 Tax=Zymomon... 324 2e-87 UniRef50_Q6AKD3 Related to A/G-specific adenine glycosylase n=1 ... 324 3e-87 UniRef50_C0BLH9 A/G-specific adenine glycosylase n=2 Tax=Flavoba... 324 3e-87 UniRef50_Q2KBH8 A/G-specific adenine glycosylase protein n=11 Ta... 324 3e-87 UniRef50_Q02CA8 A/G-specific DNA-adenine glycosylase n=1 Tax=Can... 324 3e-87 UniRef50_Q7MUV2 A/G-specific adenine glycosylase n=2 Tax=Porphyr... 324 4e-87 UniRef50_B6BVY5 A/G-specific adenine glycosylase n=1 Tax=beta pr... 323 6e-87 UniRef50_B7CBH3 Putative uncharacterized protein n=1 Tax=Eubacte... 322 1e-86 UniRef50_B4D777 HhH-GPD family protein n=1 Tax=Chthoniobacter fl... 322 1e-86 UniRef50_C9PXZ8 A/G-specific adenine glycosylase n=2 Tax=Prevote... 322 1e-86 UniRef50_B8FVN1 A/G-specific adenine glycosylase n=2 Tax=Desulfi... 321 2e-86 UniRef50_Q2QW21 Os12g0211400 protein n=6 Tax=Poaceae RepID=Q2QW2... 321 2e-86 UniRef50_A9NG45 A/G-specific adenine DNA glycosylase n=1 Tax=Ach... 321 2e-86 UniRef50_B3EUI2 A/G-specific adenine glycosylase n=1 Tax=Candida... 321 2e-86 UniRef50_C6X0N8 A/G-specific adenine glycosylase n=1 Tax=Flavoba... 319 9e-86 UniRef50_C6XY88 Helix-hairpin-helix motif protein n=4 Tax=Sphing... 317 3e-85 UniRef50_C2KZ90 Putative uncharacterized protein n=1 Tax=Oribact... 317 4e-85 UniRef50_A0L468 A/G-specific DNA-adenine glycosylase n=1 Tax=Mag... 315 1e-84 UniRef50_Q6MF36 Putative A/G-specific adenine glycosylase, mutY ... 315 2e-84 UniRef50_D2QT13 A/G-specific adenine glycosylase n=1 Tax=Spiroso... 315 2e-84 UniRef50_C0YGU7 Possible A/G-specific adenine glycosylase n=1 Ta... 315 2e-84 UniRef50_Q2N933 Putative uncharacterized protein n=1 Tax=Erythro... 314 2e-84 UniRef50_Q2G9R8 A/G-specific DNA-adenine glycosylase n=7 Tax=Alp... 314 3e-84 UniRef50_C1XGJ6 A/G-specific DNA-adenine glycosylase n=1 Tax=Mei... 314 3e-84 UniRef50_Q3KMR2 A/G-specific adenine DNA glycosylase n=9 Tax=Chl... 313 5e-84 UniRef50_Q254A0 Adenine/guanine glycosylase n=5 Tax=Chlamydophil... 313 5e-84 UniRef50_B1LXG3 A/G-specific adenine glycosylase n=74 Tax=Bacter... 312 1e-83 UniRef50_C8P1Q1 A/G-specific adenine glycosylase n=1 Tax=Erysipe... 311 2e-83 UniRef50_B2UNR9 HhH-GPD family protein n=1 Tax=Akkermansia mucin... 311 3e-83 UniRef50_A9BD72 A/G-specific DNA glycosylase n=3 Tax=Cyanobacter... 311 3e-83 UniRef50_C1F8W0 A/G-specific adenine glycosylase, putative n=1 T... 309 1e-82 UniRef50_A3EPM8 Putative A/G-specific DNA glycosylase n=2 Tax=Le... 309 1e-82 UniRef50_Q3SPD3 A/G-specific DNA-adenine glycosylase n=2 Tax=Nit... 307 3e-82 UniRef50_Q3ANF7 Mutator mutT protein n=19 Tax=Cyanobacteria RepI... 307 4e-82 UniRef50_C5YTP8 Putative uncharacterized protein Sb08g006760 n=1... 307 4e-82 UniRef50_Q04T89 A/G-specific DNA glycosylase n=4 Tax=Leptospira ... 305 1e-81 UniRef50_C2MB00 A/G-specific adenine glycosylase n=1 Tax=Porphyr... 304 2e-81 UniRef50_Q1GD89 A/G-specific DNA-adenine glycosylase n=11 Tax=Rh... 303 6e-81 UniRef50_D1PLU8 A/G-specific adenine glycosylase n=2 Tax=Ruminoc... 303 6e-81 UniRef50_Q1ITJ1 A/G-specific DNA glycosylase n=1 Tax=Candidatus ... 300 7e-80 UniRef50_B9SF01 A/G-specific adenine glycosylase muty, putative ... 299 1e-79 UniRef50_C7JG78 DNA glycosylase A/G-specific MutY n=12 Tax=Alpha... 299 1e-79 UniRef50_UPI000034F1ED adenine-DNA glycosylase-related / MYH-rel... 299 1e-79 UniRef50_Q0EVU8 Adenine glycosylase n=1 Tax=Mariprofundus ferroo... 297 3e-79 UniRef50_B0SFS5 A/G-specific DNA glycosylase n=2 Tax=Leptospira ... 296 7e-79 UniRef50_D0J8Q9 A/G-specific adenine glycosylase n=2 Tax=Blattab... 296 1e-78 UniRef50_B3DVQ4 A/G-specific DNA glycosylase n=1 Tax=Methylacidi... 294 2e-78 UniRef50_C6XF00 A/G-specific adenine glycosylase n=1 Tax=Candida... 294 2e-78 UniRef50_UPI0001926FFC PREDICTED: similar to mutY homolog, parti... 293 6e-78 UniRef50_B0D6Q9 Predicted protein n=1 Tax=Laccaria bicolor S238N... 292 1e-77 UniRef50_D2VUC5 Predicted protein n=1 Tax=Naegleria gruberi RepI... 292 2e-77 UniRef50_C6HWU0 Putative A/G-specific DNA glycosylase n=1 Tax=Le... 291 2e-77 UniRef50_Q10159 A/G-specific adenine DNA glycosylase n=1 Tax=Sch... 291 3e-77 UniRef50_A6FJZ0 A/G-specific adenine glycosylase n=7 Tax=Alphapr... 288 2e-76 UniRef50_C1EFR0 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 288 2e-76 UniRef50_B7P4H9 A/G-specific adenine glycosylase muty, putative ... 288 3e-76 UniRef50_Q9ZMT8 A/G-SPECIFIC ADENINE GLYCOSYLASE n=21 Tax=Helico... 287 4e-76 UniRef50_A1W1M2 A/G-specific adenine glycosylase n=13 Tax=Campyl... 286 1e-75 UniRef50_Q5SH32 A/G-specific adenine glycosylase (MutY) n=2 Tax=... 286 1e-75 UniRef50_C5ZWA7 A/G-specific adenine glycosylase n=3 Tax=Helicob... 285 2e-75 UniRef50_UPI0000E49729 PREDICTED: hypothetical protein, partial ... 284 5e-75 UniRef50_Q7VGS8 A/G-specific adenine glycosylase n=537 Tax=Helic... 282 1e-74 UniRef50_Q6IEF2 A/G-specific adenine glycosylase, putative n=3 T... 281 2e-74 UniRef50_Q1V0L8 A/G-specific adenine glycosylase n=3 Tax=Candida... 281 3e-74 UniRef50_Q5KQ30 A/G-specific adenine DNA glycosylase, putative n... 281 3e-74 UniRef50_B7A8Y1 HhH-GPD family protein n=1 Tax=Thermus aquaticus... 281 4e-74 UniRef50_UPI0001746831 A/G-specific adenine glycosylase n=1 Tax=... 279 1e-73 UniRef50_B3T7B3 Putative HhH-GPD superfamily base excision DNA r... 278 2e-73 UniRef50_B6K1Z7 A/G-specific adenine DNA glycosylase n=1 Tax=Sch... 277 5e-73 UniRef50_Q6MHU9 A/G-specific adenine glycosylase n=1 Tax=Bdellov... 276 1e-72 UniRef50_C3XHP3 A/G-specific adenine glycosylase n=1 Tax=Helicob... 272 1e-71 UniRef50_A6Q1G1 A/G-specific adenine glycosylase n=1 Tax=Nitrati... 271 2e-71 UniRef50_B1ZMV1 HhH-GPD family protein n=2 Tax=Verrucomicrobia R... 271 2e-71 UniRef50_Q1J2G1 A/G-specific adenine glycosylase n=3 Tax=Deinoco... 266 7e-70 UniRef50_B9KEL2 A/G-specific adenine glycosylase n=3 Tax=Campylo... 266 1e-69 UniRef50_Q4Q851 A/G-specific adenine glycosylase, putative n=3 T... 266 1e-69 UniRef50_Q1ZXP7 Putative uncharacterized protein MYH n=1 Tax=Dic... 265 2e-69 UniRef50_Q7M9R1 A/G-SPECIFIC ADENINE GLYCOSYLASE EC 3.2.2 n=1 Ta... 265 2e-69 UniRef50_A6Q673 A/G-specific adenine glycosylase n=2 Tax=Epsilon... 263 6e-69 UniRef50_Q2IJ90 A/G-specific DNA-adenine glycosylase n=5 Tax=Cys... 262 1e-68 UniRef50_C0ACJ1 HhH-GPD family protein n=1 Tax=Opitutaceae bacte... 262 2e-68 UniRef50_C4PWX5 A/g-specific adenine dna glycosylase (Ec 3.2.2.-... 260 3e-68 UniRef50_A8I815 DNA repair glycosylase (Fragment) n=1 Tax=Chlamy... 255 2e-66 UniRef50_C9M6M4 A/G-specific adenine glycosylase n=1 Tax=Jonquet... 254 3e-66 UniRef50_D1Y8G9 A/G-specific DNA-adenine glycosylase n=1 Tax=Pyr... 253 9e-66 UniRef50_A6DT67 Adenine glycosylase n=1 Tax=Lentisphaera araneos... 252 1e-65 UniRef50_Q9SU12 Adenine DNA glycosylase like protein n=1 Tax=Ara... 252 2e-65 UniRef50_A7NN47 HhH-GPD family protein n=2 Tax=Bacteria RepID=A7... 247 4e-64 UniRef50_C9RK91 HhH-GPD family protein n=1 Tax=Fibrobacter succi... 247 5e-64 UniRef50_B6K934 Helix-hairpin-helix motif-containing protein n=3... 246 8e-64 UniRef50_B2AMG7 Predicted CDS Pa_5_7310 n=1 Tax=Podospora anseri... 246 1e-63 UniRef50_A9AYZ2 HhH-GPD family protein n=5 Tax=Chloroflexi (clas... 246 1e-63 UniRef50_Q4P7F9 Putative uncharacterized protein n=1 Tax=Ustilag... 245 1e-63 UniRef50_B9L504 Catalytic Domain Of MutyY n=4 Tax=Bacteria RepID... 243 6e-63 UniRef50_D0GU15 A/G-specific adenine glycosylase n=1 Tax=Vibrio ... 243 8e-63 UniRef50_C8NK18 A/G-specific adenine glycosylase n=8 Tax=Coryneb... 239 1e-61 UniRef50_B3DTE4 A/G-specific DNA glycosylase n=6 Tax=Bifidobacte... 238 2e-61 UniRef50_UPI0000510111 A/G-specific adenine glycosylase n=1 Tax=... 237 3e-61 UniRef50_B8P2D0 Predicted protein n=1 Tax=Postia placenta Mad-69... 237 5e-61 UniRef50_A1A5M6 Mutyh protein n=4 Tax=Eukaryota RepID=A1A5M6_RAT 235 1e-60 UniRef50_A9UV22 Predicted protein (Fragment) n=1 Tax=Monosiga br... 235 2e-60 UniRef50_A4AHC9 Adenine glycosylase n=1 Tax=marine actinobacteri... 233 7e-60 UniRef50_D1BVG2 HhH-GPD family protein n=39 Tax=Actinomycetales ... 233 9e-60 UniRef50_B7GUG9 HhH-GPD family protein n=2 Tax=Bifidobacterium R... 232 1e-59 UniRef50_C6A996 A/G-specific adenine glycosylase n=9 Tax=Bifidob... 232 1e-59 UniRef50_B2W3B1 A/G-specific adenine glycosylase n=2 Tax=Pleospo... 231 3e-59 UniRef50_A1RYX1 HhH-GPD family protein n=1 Tax=Thermofilum pende... 230 6e-59 UniRef50_C7YS47 Putative uncharacterized protein n=2 Tax=Nectria... 229 1e-58 UniRef50_Q18HE6 A/G-specific adenine glycosylase n=12 Tax=Haloba... 228 2e-58 UniRef50_A1A319 Probable A/G-specific adenine glycosylase n=3 Ta... 228 3e-58 UniRef50_C1MWM2 Predicted protein n=1 Tax=Micromonas pusilla CCM... 225 1e-57 UniRef50_C8RS99 A/G-specific DNA glycosylase n=1 Tax=Corynebacte... 225 2e-57 UniRef50_A8Q4W7 Putative uncharacterized protein n=1 Tax=Malasse... 223 7e-57 UniRef50_B8DWV0 Endonuclease III n=11 Tax=Bifidobacteriaceae Rep... 222 2e-56 UniRef50_D0RRM3 A/G-specific adenine glycosylase n=1 Tax=alpha p... 221 3e-56 UniRef50_D1Z9Z8 Whole genome shotgun sequence assembly, scaffold... 221 3e-56 UniRef50_A0LRD5 HhH-GPD family protein n=5 Tax=Actinomycetales R... 219 1e-55 UniRef50_B7KLS5 HhH-GPD family protein n=1 Tax=Cyanothece sp. PC... 218 2e-55 UniRef50_Q03K95 DNA-(Apurinic or apyrimidinic site) lyase / endo... 218 2e-55 UniRef50_Q5M3Y9 Endonuclease III, DNA repair n=21 Tax=Firmicutes... 218 3e-55 UniRef50_C8WS61 Endonuclease III n=10 Tax=Firmicutes RepID=C8WS6... 217 4e-55 UniRef50_D1V787 HhH-GPD family protein n=4 Tax=Actinomycetales R... 217 4e-55 UniRef50_O26858 Endonuclease III n=4 Tax=Euryarchaeota RepID=O26... 217 4e-55 UniRef50_C6WGJ6 HhH-GPD family protein n=48 Tax=Actinomycetales ... 216 9e-55 UniRef50_Q0W1Z4 Putative DNA glycosylase n=1 Tax=uncultured meth... 216 1e-54 UniRef50_Q11NT5 DNA-(Apurinic or apyrimidinic site) lyase / endo... 216 1e-54 UniRef50_Q9P9L6 U/G & T/G mismatch specific glycosylase (Pa-MIG)... 215 2e-54 UniRef50_B7G7W7 Predicted protein n=1 Tax=Phaeodactylum tricornu... 213 8e-54 UniRef50_Q1ASN5 DNA-(Apurinic or apyrimidinic site) lyase / endo... 213 8e-54 UniRef50_A4E9V4 Putative uncharacterized protein n=1 Tax=Collins... 213 9e-54 UniRef50_C5WHA6 Endonuclease III n=7 Tax=Firmicutes RepID=C5WHA6... 212 1e-53 UniRef50_C0QG99 Putative endonuclease III n=1 Tax=Desulfobacteri... 212 2e-53 UniRef50_Q9WYK0 Endonuclease III n=12 Tax=Thermotogaceae RepID=E... 211 3e-53 Sequences not found previously or not previously below threshold: >UniRef50_A0Z8D4 A/G-specific adenine glycosylase MutY n=2 Tax=Gammaproteobacteria RepID=A0Z8D4_9GAMM Length = 349 Score = 423 bits (1089), Expect = e-117, Method: Composition-based stats. Identities = 181/344 (52%), Positives = 239/344 (69%), Gaps = 3/344 (0%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + F++++LDW+D +GR LPWQ KT Y+VWLSE+MLQQTQV TVIPYF+RF+ RFP+V Sbjct: 2 TSFASRLLDWFDLHGRHDLPWQKSKTAYRVWLSEIMLQQTQVQTVIPYFDRFLTRFPSVE 61 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA AP D+VLHLWTGLGYYARARNLH+AA+ V + GG+FP+ + + LPGVGRSTAG Sbjct: 62 DLAAAPEDDVLHLWTGLGYYARARNLHRAAKLVVSDFGGQFPQDLDGLLQLPGVGRSTAG 121 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILSL++G PILDGNVKRVLAR VSGWPG NKLW+L+E+ TP V + Q Sbjct: 122 AILSLAMGIRAPILDGNVKRVLARHDVVSGWPGTTTTLNKLWALAEEYTPTARVADYTQG 181 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH-ED 242 +MDLGA +CTRS P C CPL N C A + A +PGKKP++TLPER YF++ Q + Sbjct: 182 IMDLGATLCTRSAPGCQHCPLTNTCQARSRCEQAHFPGKKPRKTLPERETYFVIFQRPDG 241 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLT-QLTAFRHTFSHFHLDI 301 VL+ +RP G+WGGL+CFP+ D ++ LA + +A T L +H+FSH+ L I Sbjct: 242 TVLMEKRPEQGIWGGLWCFPESQDGDASPAHLATMRASAQEPTIYLDPIKHSFSHYRLTI 301 Query: 302 VPMWLPVSSFT-GCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 P+ L + + G + ++ W + P S+GL PV ++L+ L Sbjct: 302 RPILLKIIRASEGVAEVNSSQWIDPQTPGSIGLPVPVTQVLKTL 345 >UniRef50_C9R2J3 A/G-specific adenine glycosylase n=1 Tax=Aggregatibacter actinomycetemcomitans D11S-1 RepID=C9R2J3_AGGAD Length = 396 Score = 423 bits (1087), Expect = e-117, Method: Composition-based stats. Identities = 212/359 (59%), Positives = 255/359 (71%), Gaps = 23/359 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ VL WY K+GRK LPWQ +KT Y VWLSEVMLQQTQV TVIPYFERF+ FP +T L Sbjct: 28 FAHAVLTWYAKFGRKNLPWQQNKTLYGVWLSEVMLQQTQVTTVIPYFERFVKTFPNLTAL 87 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANAPLDEVLHLWTGLGYYARARNLHKAAQ + HGG+FP FE+V ALPGVGRSTAGA+ Sbjct: 88 ANAPLDEVLHLWTGLGYYARARNLHKAAQIMRDQHGGEFPTEFEQVWALPGVGRSTAGAV 147 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 LS L +PILDGNVKRVL+R +AVSGWPG+K+ E+ LW L+ QVTP V FNQAMM Sbjct: 148 LSSCLNAPYPILDGNVKRVLSRYFAVSGWPGEKKTEDHLWHLTAQVTPTEQVADFNQAMM 207 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D+GAM+CTRSKPKC LCPL++ C A A +W +YPG+KPK+ LPER YFLLL+ + ++ Sbjct: 208 DIGAMVCTRSKPKCELCPLKSDCKANAEQNWQVYPGRKPKKVLPERESYFLLLEKDGKIA 267 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW 305 L QR +GLWGGLYCFPQF++++ L +LA I + TAFRHTFSHFHLDI P++ Sbjct: 268 LEQRDNAGLWGGLYCFPQFSEKQELLAYLAANGI--RQYQEWTAFRHTFSHFHLDIYPIY 325 Query: 306 LPVSSFTGCMDEGNA---------------------LWYNLAQPPSVGLAAPVERLLQQ 343 V + +E + WY+ P +GLAAPV+ LL Q Sbjct: 326 AQVDTPKALAEENRSHRQKVAETAGEYQSDLLSAVKYWYDPQNPEPIGLAAPVKNLLTQ 384 >UniRef50_B8KUN1 A/G-specific adenine glycosylase n=2 Tax=Gammaproteobacteria RepID=B8KUN1_9GAMM Length = 363 Score = 413 bits (1062), Expect = e-114, Method: Composition-based stats. Identities = 173/338 (51%), Positives = 227/338 (67%), Gaps = 2/338 (0%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 ++++L+W+D +GRK LPWQ D TPY+VW+SE+MLQQTQVATVIPY+ERFM FPTV L Sbjct: 11 LASRLLNWFDDHGRKDLPWQTDTTPYRVWISEIMLQQTQVATVIPYYERFMGEFPTVEAL 70 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 + AP D+VL LW+GLGYYARARNLH+ A+ V GG+FP+T + + LPG+GRSTAGAI Sbjct: 71 SAAPEDDVLKLWSGLGYYARARNLHRGAKMVTGDLGGEFPDTVDGLCTLPGIGRSTAGAI 130 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 +S+++G PILDGNVKRVLAR +AV GWPGK V +LW +E TP V + QA+M Sbjct: 131 ISIAMGGRAPILDGNVKRVLARHHAVDGWPGKSGVAAELWGHAEAHTPNTRVADYTQAIM 190 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF-LLLQHEDEV 244 DLGA +CTR +P+C +CPL + C A YPGKKPK+T P R+ +F L++ H V Sbjct: 191 DLGATLCTRRRPQCLVCPLVDTCHAGRAGDPEQYPGKKPKRTTPTRSAFFALVVDHTGAV 250 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPM 304 LL +RPPSG+WGGL+ PQ D L + + + QL HTFSHF L I P+ Sbjct: 251 LLQKRPPSGIWGGLWSLPQSPDRGELENLAQRFGVVDPEVEQLPMIEHTFSHFRLAITPL 310 Query: 305 WLPVSS-FTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 + +S+ T + W+ Q G+ APV +LL Sbjct: 311 RIRLSTAHTSVSEPTEYKWHQFTQSLPGGIPAPVAKLL 348 >UniRef50_C7R6W3 A/G-specific adenine glycosylase n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R6W3_KANKD Length = 355 Score = 412 bits (1059), Expect = e-113, Method: Composition-based stats. Identities = 173/360 (48%), Positives = 229/360 (63%), Gaps = 23/360 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 A+ F+ QV+D+Y +GRK LPWQ + PY++WLSE+MLQQTQV TVIPYFERF+ FPT Sbjct: 6 SATNFARQVIDYYHAFGRKHLPWQNTQDPYRIWLSEIMLQQTQVTTVIPYFERFVESFPT 65 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA+A D+V+ W+GLGYY+RARNLHKAA+ + T GG FP+ E + LPG+GRST Sbjct: 66 VLDLAHATDDQVMQHWSGLGYYSRARNLHKAAKIIETEFGGDFPQDPEVIETLPGIGRST 125 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S + + ILDGNVKRVLARCYA+ GWPG +V LW +E TP +N Sbjct: 126 AGAIASFAFDQPTAILDGNVKRVLARCYAIEGWPGNGKVLKALWERAEANTPTQETAAYN 185 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QAMMDLGA++CTR+KP C CPL C+A NNS YPGKKPK+ P + Y+L+ +E Sbjct: 186 QAMMDLGAVVCTRTKPNCPDCPLSKHCLAYHNNSIEQYPGKKPKKARPSKAVYWLISINE 245 Query: 242 ----DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 +VLL +RPPSG+WGGL+ P+ ++ I+ D ++QL F H FSH+ Sbjct: 246 TSDQTQVLLHKRPPSGIWGGLWALPEVEQQD----------ISPDKISQLDPFVHKFSHY 295 Query: 298 HLDIVPM---------WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 L + P+ + S M+ A W++ Q +GL PV +LL +L + Sbjct: 296 DLQVQPLLLSSKADKALVEKKSHPSIMEPAQADWFSQNQLSEIGLPTPVSKLLSKLLGDS 355 >UniRef50_Q3KJJ2 A/G-specific adenine glycosylase n=5 Tax=Gammaproteobacteria RepID=Q3KJJ2_PSEPF Length = 355 Score = 410 bits (1055), Expect = e-113, Method: Composition-based stats. Identities = 182/344 (52%), Positives = 233/344 (67%), Gaps = 1/344 (0%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M+A QFS VL+W+D++GR LPWQ + PY+VW+SE+MLQQTQV+TV+ YF+RFMA P Sbjct: 1 MRAEQFSTAVLEWFDRHGRHDLPWQQNINPYRVWVSEIMLQQTQVSTVLNYFDRFMAALP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV LA AP DEVLHLWTGLGYY RARNL K A+ V + +GG+FP E++ LPG+G S Sbjct: 61 TVEALAAAPEDEVLHLWTGLGYYTRARNLQKTAKIVVSQYGGEFPRDVEKLTELPGIGLS 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S+S+G PILDGNVKRVLAR A G+PG+ +V +LW+ +E+ TP V + Sbjct: 121 TAGAIGSISMGLRAPILDGNVKRVLARFTAQEGYPGEPKVAKQLWANAERFTPQDRVNAY 180 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLP-ERTGYFLLLQ 239 QAMMDLGA +CTRSKP C LCPL+ GC A YP KP++ +P +RT LL Sbjct: 181 TQAMMDLGATLCTRSKPSCLLCPLEKGCEAHMLGLETRYPIPKPRKAIPQKRTLMPLLAN 240 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 + +LL +RP SGLWGGL+ P+ D + L+ AQ + L + HTFSHF L Sbjct: 241 EDGAILLYRRPSSGLWGGLWSLPELDDLDDLQHLAAQHSLELGEQQALPSLVHTFSHFQL 300 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 I P + V G + E + LWYNLA PP +GLAAPV+ LL++ Sbjct: 301 SIEPWLVQVQEAGGHVAEADWLWYNLATPPRLGLAAPVKTLLER 344 >UniRef50_D1RGZ1 A/G-specific adenine glycosylase n=1 Tax=Legionella longbeachae D-4968 RepID=D1RGZ1_LEGLO Length = 357 Score = 409 bits (1051), Expect = e-113, Method: Composition-based stats. Identities = 152/342 (44%), Positives = 215/342 (62%), Gaps = 2/342 (0%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 QFS +L W+D +GRK LPWQ +TPY+VW+SE+MLQQTQV TVIPYFERFM RFP ++ Sbjct: 8 EQFSKPLLLWFDLHGRKNLPWQHPRTPYRVWVSEIMLQQTQVQTVIPYFERFMQRFPNIS 67 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA+A D+VL LW+GLGYY+RARNLH+ A+ + H G FP + + LPG+G ST+ Sbjct: 68 DLAHAQEDDVLSLWSGLGYYSRARNLHQTAKLILQHHQGIFPNDSKLLNELPGIGPSTSA 127 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILS + K ILDGNVKRVL R + ++G+P + +V+ KLW L++ P + QA Sbjct: 128 AILSQAFNKPAAILDGNVKRVLTRFFRITGYPEQSQVKKKLWELADLCMPQENCADYTQA 187 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE-D 242 +MDLGA C P CS CPL C+A + L+P KK K+++P + L+L +E Sbjct: 188 IMDLGATCCITKNPHCSSCPLHINCLAFKHKEQHLFPTKKIKKSIPIQYQQLLVLHNEQG 247 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWL-AQRQIAADNLTQLTAFRHTFSHFHLDI 301 + L +RP GLWGGL+C P + ++ ++ ++ ++ L AF+H FSHFHL+I Sbjct: 248 YIYLEKRPSPGLWGGLWCLPNISTDKCPLDFIRVNYDLSGEHPQPLLAFKHRFSHFHLEI 307 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + + + F + E W+ Q S+GLA P ++L Sbjct: 308 NALSIKTTFFGRQLSENQGQWFAKEQLNSLGLAQPTRKILSY 349 >UniRef50_Q2SMB7 A/G-specific adenine glycosylase n=10 Tax=Gammaproteobacteria RepID=Q2SMB7_HAHCH Length = 388 Score = 405 bits (1042), Expect = e-112, Method: Composition-based stats. Identities = 163/351 (46%), Positives = 223/351 (63%), Gaps = 12/351 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + F+A++L W+D++GR LPWQ +TPY VW+SE+MLQQTQV+TVIPYF +FM FPTV Sbjct: 29 NSFAARLLTWFDQHGRHDLPWQDPRTPYHVWISEIMLQQTQVSTVIPYFIKFMESFPTVA 88 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A D VL W GLGYYARARNLHKAA+ + G+FP T E + LPG+GRSTAG Sbjct: 89 ALAEADQDTVLSHWAGLGYYARARNLHKAAKTIVEKFNGEFPNTLETIQELPGIGRSTAG 148 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILS+ G PILDGNVKRVL R A+ GWPGK+E+E +LW L++ TP V + QA Sbjct: 149 AILSMGFGIRAPILDGNVKRVLCRHDAIEGWPGKREIETRLWELADAYTPEERVTDYTQA 208 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED- 242 +MDLGA +CTRSKP C+ CP++ C A + P K + +P + L+LQ+ED Sbjct: 209 IMDLGATLCTRSKPACARCPMETTCQGLAQDMTDQLPTPKAGKKIPTHERFVLVLQNEDG 268 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADE----ESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 +LL +RPP+G+WGGL+ P+ ++L+ W +++ + FRHTFSH+H Sbjct: 269 HILLERRPPTGIWGGLWSLPESEKNMDLPDALQSWKEHKRLQLTQYELASPFRHTFSHYH 328 Query: 299 LDIVPMWLPVSSFTGC------MDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 L + P+ + + E ++ W++ Q GL AP+E+ L Q Sbjct: 329 LILKPVIASTADIERPDDLAFIVGESDSQWFDPDQELP-GLPAPIEKWLSQ 378 >UniRef50_C6N1N5 A/G specific adenine glycosylase n=1 Tax=Legionella drancourtii LLAP12 RepID=C6N1N5_9GAMM Length = 348 Score = 405 bits (1042), Expect = e-111, Method: Composition-based stats. Identities = 155/343 (45%), Positives = 211/343 (61%), Gaps = 2/343 (0%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 QFS +L+W+ +GRK LPWQ+ ++PY+VW+SE+MLQQTQV TVIPYFERFM RFP++ Sbjct: 6 EQFSKPLLNWFSLHGRKNLPWQLPRSPYRVWVSEIMLQQTQVQTVIPYFERFMQRFPSIQ 65 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA A DEVL LW+GLGYY+RARNLHK AQ +AT + G FP+ V LPG+G STA Sbjct: 66 DLAQANEDEVLSLWSGLGYYSRARNLHKTAQIIATEYQGIFPDELALVHELPGIGASTAA 125 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILS + + ILDGNVKRVL R + + G P + V+ KLW L++ P + QA Sbjct: 126 AILSQAFNQPTAILDGNVKRVLTRFFMIQGHPEQALVKKKLWELAQACMPQEDCADYTQA 185 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL-QHED 242 +MDLGA CT C CPLQ+ C+A + LYP KK K+ +P + L+L + Sbjct: 186 IMDLGATCCTTKNFNCLNCPLQDNCLALKHQEQHLYPTKKIKKPVPVQQQQLLVLCNKQG 245 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWL-AQRQIAADNLTQLTAFRHTFSHFHLDI 301 + L +RPP+GLWGGL+C P + ++ + + +N L AF+H FSHF L+I Sbjct: 246 HIYLEKRPPTGLWGGLWCLPSIDEGNCPLDFIRTEYDLVGENPQPLIAFKHRFSHFLLEI 305 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + + + + E W+ S+GLA P ++L L Sbjct: 306 NALKIITLALGTKLGEAKGQWFAKENLGSLGLATPTSKILSLL 348 >UniRef50_P44320 A/G-specific adenine glycosylase n=308 Tax=Gammaproteobacteria RepID=MUTY_HAEIN Length = 378 Score = 405 bits (1042), Expect = e-111, Method: Composition-based stats. Identities = 210/360 (58%), Positives = 253/360 (70%), Gaps = 24/360 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ VL WYDK+GRK LPWQ +KT Y VWLSEVMLQQTQVATVIPYFERF+ FP +T L Sbjct: 11 FAKSVLAWYDKFGRKHLPWQQNKTLYGVWLSEVMLQQTQVATVIPYFERFIKTFPNITAL 70 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA DEVLHLWTGLGYYARARNLHKAAQ+V G FP FE+V AL GVGRSTAGAI Sbjct: 71 ANASQDEVLHLWTGLGYYARARNLHKAAQKVRDEFNGNFPTNFEQVWALSGVGRSTAGAI 130 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 LS L + +PILDGNVKRVLAR +AV GW G+K+VEN+LW+L+EQVTP V FNQAMM Sbjct: 131 LSSVLNQPYPILDGNVKRVLARYFAVEGWSGEKKVENRLWALTEQVTPTTRVADFNQAMM 190 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D+GAM+C R+KPKC LCPL C+A N +W +P KKPK+ +PE+T YFL+L +V Sbjct: 191 DIGAMVCMRTKPKCDLCPLNIDCLAYKNTNWEKFPAKKPKKAMPEKTTYFLILSKNGKVC 250 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW 305 L QR SGLWGGL+CFPQF D+ SL +LAQ ++ + + +FRHTFSHFHLDI P++ Sbjct: 251 LEQRENSGLWGGLFCFPQFEDKSSLLHFLAQEKVT--HYQEWPSFRHTFSHFHLDIHPIY 308 Query: 306 LPVSSFTGCMDEGNA---------------------LWYNLAQPPSVGLAAPVERLLQQL 344 + S T C+++ N WY+ P +GLA PV+ LL Q Sbjct: 309 AEMES-TLCVEQANLDWRKVMESTKEYQSNLSSAVKYWYDPQNPEPIGLAQPVKNLLIQF 367 >UniRef50_A9KCK4 A/G-specific adenine DNA glycosylase n=6 Tax=Coxiella burnetii RepID=A9KCK4_COXBN Length = 354 Score = 400 bits (1029), Expect = e-110, Method: Composition-based stats. Identities = 177/353 (50%), Positives = 234/353 (66%), Gaps = 3/353 (0%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M + QF+ VL W+D+YGR LPWQ TPY+VW+SE+MLQQTQV+TVIPYFERF+ RFP Sbjct: 1 MDSKQFAQGVLRWFDRYGRHDLPWQKKLTPYRVWVSEIMLQQTQVSTVIPYFERFIKRFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV LA APLDE+L W+GLGYYARARNLH+AAQ + + G+FP T E +++LPG+GRS Sbjct: 61 TVGALALAPLDEILAHWSGLGYYARARNLHRAAQIIHVTYHGRFPSTVETLSSLPGIGRS 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGA+LSL + ++ ILDGNVKRVLAR A+ ++ N LW+L+E+ TP + Sbjct: 121 TAGAVLSLGMHQYAVILDGNVKRVLARYNALDVPINQQVGINILWNLAEKYTPKNRCWDY 180 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQAMMD+GAMICTR+KPKCSLCPL++ C A + +P KK K ++ Y LLL++ Sbjct: 181 NQAMMDIGAMICTRTKPKCSLCPLKSSCKAHRLSQQMNFPIKKAKTARAQKAAYLLLLRN 240 Query: 241 -EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLA-QRQIAADNLTQLTAFRHTFSHFH 298 E+LL +RPP+G+WGGL+ FPQ EE + +W + A + + H FSHF Sbjct: 241 SRGEILLEKRPPTGIWGGLWSFPQCPIEEDIDKWCQTKLGFEAVICERWNSIFHQFSHFE 300 Query: 299 LDIVPMWLPVSSFTGCMDE-GNALWYNLAQPPSVGLAAPVERLLQQLRTGAPV 350 +I P+ L + + M E +WY G+AAPV RLL+QL V Sbjct: 301 FEIKPVLLQIETRQPRMMECPPQIWYKEHSALPGGIAAPVARLLKQLTQQTAV 353 >UniRef50_UPI0000E105D2 A/G-specific adenine glycosylase n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E105D2 Length = 375 Score = 398 bits (1024), Expect = e-109, Method: Composition-based stats. Identities = 180/370 (48%), Positives = 232/370 (62%), Gaps = 29/370 (7%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 F+ L W+D +GRK LPWQ + + Y VW+SE+MLQQTQV TVIPYFERFM FPTV D Sbjct: 4 DFAQTALAWFDLHGRKHLPWQQNISAYSVWISEIMLQQTQVTTVIPYFERFMRSFPTVND 63 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LANAP ++VLH WTGLGYYARARNLHKAAQQ+A H G FP FEEV ALPG+GRSTAGA Sbjct: 64 LANAPQEDVLHHWTGLGYYARARNLHKAAQQIAEHHNGVFPTDFEEVLALPGIGRSTAGA 123 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 IL+++ ++ PILDGNVKRVLAR +AV GWPG K+VE++LW + ++TP+ + + Q M Sbjct: 124 ILAIAEHQNHPILDGNVKRVLARFFAVEGWPGSKKVEDELWHFAGELTPSERIADYTQVM 183 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 MDLGA +CTRSKPKC +CPLQ+ C+A A+ + P KKPK+T+P + ++ D V Sbjct: 184 MDLGATLCTRSKPKCEVCPLQSRCLAFASGRQSELPHKKPKKTIPSKYTTVIIPMLFDRV 243 Query: 245 LLAQRPPSGLWGGLYCF-----P--QFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 L+ +RP G+WGGL+ F P Q DE Q + L F+HTFSHF Sbjct: 244 LMTKRPEEGIWGGLWWFGGEFTPTAQKVDEHPTIYTALGEQFDVVSSDMLPEFKHTFSHF 303 Query: 298 HLDIVPMW-------------------LPVSSFTGC---MDEGNALWYNLAQPPSVGLAA 335 HL I P+ + + TG + + W + QP +GL Sbjct: 304 HLHIQPVILYLTTDKVSEAVLSNSMPQMTLEDLTGAGIRDGQVHQRWVDYRQPADIGLCK 363 Query: 336 PVERLLQQLR 345 P + QL+ Sbjct: 364 PALTIFNQLQ 373 >UniRef50_Q3J8X1 A/G-specific DNA-adenine glycosylase n=13 Tax=Gammaproteobacteria RepID=Q3J8X1_NITOC Length = 354 Score = 392 bits (1007), Expect = e-107, Method: Composition-based stats. Identities = 163/348 (46%), Positives = 221/348 (63%), Gaps = 3/348 (0%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M S FS ++L W+D YGRK LPWQ + T Y+VW+SE+MLQQTQVATVIPY++RF+ RFP Sbjct: 1 MNKSDFSQRLLTWFDAYGRKDLPWQQNPTLYRVWVSEIMLQQTQVATVIPYYQRFIERFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 ++ LA+A +DE+L LW GLGYYARAR LH+AA+ HGG+ P T E + LPG+GRS Sbjct: 61 SLPALAHASVDEILGLWAGLGYYARARRLHQAARIAWETHGGELPATLEALMELPGIGRS 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 T GAIL+L+LG+ +PILDGNVKRVL R A+ WPG+ +VE +LW + + P + + Sbjct: 121 TGGAILALALGQRYPILDGNVKRVLTRQEAIEHWPGQPKVEKQLWQRAATLLPRTRLADY 180 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPER-TGYFLLLQ 239 QA+MDLGA +CTR +P C CP++ C A + YP +P++ LP R T +LL Sbjct: 181 TQAIMDLGATVCTRHRPHCPSCPVKKTCQAHLQENPEAYPRSRPRKRLPLRATCMLILLN 240 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWL-AQRQIAADNLTQLTAFRHTFSHFH 298 + EVLL +RPP G+WGGL+ FP+ + W Q + RH F+HF Sbjct: 241 DQGEVLLERRPPVGIWGGLWSFPECPPQTEAALWCQEQFGWPIGEVQHWPPLRHHFTHFT 300 Query: 299 LDIVPMWLPVS-SFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 LDI P+ + M+ + +WY + GL AP RLL++LR Sbjct: 301 LDIQPVIARIRGEARQVMEPNSQVWYKMEPMYKRGLPAPTLRLLKRLR 348 >UniRef50_A8PLA9 A/G-specific adenine glycosylase n=1 Tax=Rickettsiella grylli RepID=A8PLA9_9COXI Length = 354 Score = 391 bits (1006), Expect = e-107, Method: Composition-based stats. Identities = 162/349 (46%), Positives = 222/349 (63%), Gaps = 5/349 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + +QF ++L W+ + GRK LPWQ ++PY WLSE+MLQQTQV TVIPYF+RF FP Sbjct: 3 LTPAQFQKKILTWFHQSGRKHLPWQQVQSPYFTWLSEIMLQQTQVTTVIPYFQRFTHHFP 62 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T++ LANA LDEV+ LW+GLGYYARAR+LH+ AQ + + GKFP+ + LPG+GRS Sbjct: 63 TLSSLANASLDEVIRLWSGLGYYARARHLHRCAQIIEEKYKGKFPQALILLQNLPGIGRS 122 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA-VGVER 179 TAGAI +L+ + ILDGNVKRV +R + +SGWPG V +LW+L+E+ TP V Sbjct: 123 TAGAIRALAFNQPAAILDGNVKRVFSRFHTLSGWPGLTHVNKQLWTLAERYTPHNKHVRH 182 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERT-GYFLLL 238 + QAMMDLGA+ICT +C+ CPLQ C A + YP KP + P R +L+ Sbjct: 183 YTQAMMDLGALICTPKHAQCTECPLQRHCKAYKEDCVIHYPSSKPVKMNPSRAITMVILI 242 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQ--IAADNLTQLTAFRHTFSH 296 E+LL +RPP G+WGGL+ P+ E ++ + + ++ +HTFSH Sbjct: 243 NARQEILLEKRPPRGIWGGLWSLPECTITEKIKNFCKKNYHCETEKSIKNYPFLKHTFSH 302 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNAL-WYNLAQPPSVGLAAPVERLLQQL 344 F L+I P+ + V ++ E N + WYN Q S+GLAAPV++LL QL Sbjct: 303 FQLEIRPVAIRVKKWSPPQMESNPIVWYNKQQLASIGLAAPVKKLLNQL 351 >UniRef50_Q67NN1 A/G-specific adenine glycosylase n=1 Tax=Symbiobacterium thermophilum RepID=Q67NN1_SYMTH Length = 365 Score = 389 bits (999), Expect = e-107, Method: Composition-based stats. Identities = 123/346 (35%), Positives = 181/346 (52%), Gaps = 8/346 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 + ++LDWY R LPW+ + PY +W+SE+MLQQT+V TVIPY+ER+MARFPT+ L Sbjct: 1 MADRLLDWYSASAR-DLPWRRTRDPYHIWVSEIMLQQTRVETVIPYYERWMARFPTLEAL 59 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+AP ++VL W GLGYY+RARNLH AA++V +GG P+ + VA+L G+G TAGAI Sbjct: 60 ADAPEEQVLKAWEGLGYYSRARNLHAAAREVVARYGGTVPDDPDAVASLKGIGPYTAGAI 119 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 LS++ + P +DGNV RV+AR YA+ + + L + P FNQA+M Sbjct: 120 LSIAFNRPVPAVDGNVLRVIARLYAIVDDIAQLATRRTIEELVRAMIPQDRPGDFNQALM 179 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 DLGA ICT +P+C LCP+++ C P K + +++ E +L Sbjct: 180 DLGATICTPRRPRCLLCPVRDLCEGRRAGRETELPVKGRAKAPRLVERAAAVIEREGRLL 239 Query: 246 LAQRPPSGLWGGLYCF------PQFADEESLRQWL-AQRQIAADNLTQLTAFRHTFSHFH 298 + +RP GL GL+ P EE+L+ L L H FSH Sbjct: 240 IVRRPQEGLLAGLWELPGGDVPPGVGPEEALQTLLRNALGAEVAVGEHLADVTHVFSHLV 299 Query: 299 LDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + V T + + W + + A ++ +++ Sbjct: 300 WHLRCFRAEVVPGTEVAERADVRWVKVDALGAYPFPAIYHKVFERV 345 >UniRef50_Q2YCY4 A/G-specific DNA-adenine glycosylase n=4 Tax=Nitrosomonadaceae RepID=Q2YCY4_NITMU Length = 383 Score = 384 bits (987), Expect = e-105, Method: Composition-based stats. Identities = 148/354 (41%), Positives = 214/354 (60%), Gaps = 15/354 (4%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 A+ F+ +++ W ++GR LPWQ + Y +WLSE+MLQQTQV TVIPY+ RF+ FP + Sbjct: 12 ANSFATRLIRWQREHGRHHLPWQNTRDAYSIWLSEIMLQQTQVGTVIPYYRRFLQCFPDI 71 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA+APLDEV+ W+GLGYY+RARNLHKAAQ++ HGG FPE + LPG+GRSTA Sbjct: 72 QSLASAPLDEVMVQWSGLGYYSRARNLHKAAQRIVGEHGGIFPEEVAIIRQLPGIGRSTA 131 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA-------V 175 AI + GK ILDGNVKR+L+RC+ + G+PG+K+VE +LW +E + P Sbjct: 132 AAIAVFAFGKRAAILDGNVKRILSRCFGIEGYPGEKQVEAQLWQKAEALLPKGDESPIER 191 Query: 176 GVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF 235 +E + QA+MDLGA IC R++P C CPL+ C+A +N P +P++ LPER Sbjct: 192 DIEGYTQALMDLGATICIRARPMCGSCPLRLECVAFRDNRAGSLPTPRPRKILPEREAVL 251 Query: 236 LLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFS 295 LL + ++LL +RP +G+WG L+ P+ E++ ++ + I ++Q+ A HTF+ Sbjct: 252 LLAVAQGKILLEKRPSTGIWGALWSLPEMGMNENVIEYCLRFGINVRPMSQMEALTHTFT 311 Query: 296 HFHLDIVPMWLPVSSFTG--------CMDEGNALWYNLAQPPSVGLAAPVERLL 341 HF L I P+ L V S +W + +PV ++L Sbjct: 312 HFRLRIYPLILQVISRPPDHLTPEVLSQPRRPCVWRMPEDALKAAIPSPVRKVL 365 >UniRef50_B5JTP2 A/G-specific adenine glycosylase n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JTP2_9GAMM Length = 354 Score = 382 bits (982), Expect = e-105, Method: Composition-based stats. Identities = 177/353 (50%), Positives = 227/353 (64%), Gaps = 7/353 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M FS ++L WY GR LPWQ DKTPY+VW+SE+MLQQTQVATVIPY++RFMA FP Sbjct: 1 MSERSFSERLLAWYRVSGRHDLPWQQDKTPYRVWVSEIMLQQTQVATVIPYYQRFMASFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ DLA+A D+VL W+GLGYYAR RNLHKAA+Q G P+ E + ALPG+G S Sbjct: 61 TLEDLASATQDQVLQHWSGLGYYARGRNLHKAARQAHEAW-GDLPDDVEALEALPGIGHS 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP-AVGVER 179 TAGAI S++ G+ ILDGNVKRVLAR AVSGWPG+ V +LW ++ TP AV Sbjct: 120 TAGAISSIAFGRRALILDGNVKRVLARHRAVSGWPGRTAVSRELWQWADLYTPSAVQAGE 179 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPER-TGYFLLL 238 + QA+MDLGA +CTR++P+C CP+ C+ + +PGKKPK+ P + +L+ Sbjct: 180 YAQAIMDLGATLCTRTQPRCGECPVAEDCLGRGSGEPTAFPGKKPKKDKPVKSVRMWLVT 239 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 + EVLL QRP GLWGGL+ P+ +E L WL++R L FRH+FSHFH Sbjct: 240 NKKGEVLLQQRPAQGLWGGLWVLPEQRVDEPLDAWLSRRGWLGAEAVPLPKFRHSFSHFH 299 Query: 299 LDIVPMWLP---VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 LD+ P+ + + T C D WYN P G+AAPV RLL+ L T + Sbjct: 300 LDVEPLLVQDIDSARVTECADHSPEQWYNPRSP-QGGIAAPVARLLKSLPTDS 351 >UniRef50_B0RRL4 A/G-specific adenine glycosylase n=15 Tax=Gammaproteobacteria RepID=B0RRL4_XANCB Length = 430 Score = 381 bits (980), Expect = e-104, Method: Composition-based stats. Identities = 159/350 (45%), Positives = 216/350 (61%), Gaps = 6/350 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + F ++L W+D +GR LPWQ + PY+VWLSE+MLQQTQVA VIPYF++F+A FP Sbjct: 81 LTTDAFVDRLLHWFDGHGRHDLPWQHPRAPYRVWLSEIMLQQTQVAVVIPYFQKFVASFP 140 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ DLA A D V+ W GLGYYARARNLH AA+Q LH G+ P F+ + ALPG+GRS Sbjct: 141 TLADLAAADNDTVMAHWAGLGYYARARNLHAAAKQCVALHAGELPRDFDALLALPGIGRS 200 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSE---QVTPAVGV 177 TAGAILS + FPI+DGNVKRVL R + ++G+PG VE +LW L+ PA + Sbjct: 201 TAGAILSQAWNDRFPIMDGNVKRVLTRIHGIAGYPGLPVVEKQLWQLAANHVAHVPAGRL 260 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL 237 + QA MD GA +CTR++P C +CPLQ C+A P KP + LPER LL Sbjct: 261 ADYTQAQMDFGATLCTRARPACMVCPLQENCVARREGLVEALPTPKPGKQLPEREATALL 320 Query: 238 LQH-EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAA-DNLTQLTAFRHTFS 295 L++ +E+LL +RPP+G+W L+ PQ + LR+W A D ++ HTFS Sbjct: 321 LENAHNEILLQRRPPTGIWASLWTLPQAETDSDLREWFAAHIDGDYDRADEMPMIVHTFS 380 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNAL-WYNLAQPPSVGLAAPVERLLQQL 344 H+ L + P+ L + + + + L W A ++GL AP+ +LL L Sbjct: 381 HYRLHLQPLRLRKVALRQVLRDNDDLRWVARADLATLGLPAPIRKLLDAL 430 >UniRef50_UPI0001BC503C adenine glycosylase n=1 Tax=Neisseria mucosa ATCC 25996 RepID=UPI0001BC503C Length = 359 Score = 380 bits (975), Expect = e-104, Method: Composition-based stats. Identities = 148/348 (42%), Positives = 198/348 (56%), Gaps = 11/348 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +++ W ++GR LPWQ+ + PY VWLSE+MLQQTQVATV+ Y+ RF+ +FPTV L Sbjct: 7 FSERLIRWQKQHGRHHLPWQV-QNPYCVWLSEIMLQQTQVATVLDYYPRFLEKFPTVQTL 65 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP DEVL LW GLGYY+RARNLHKAA+QV G FP +++ L GVGRSTA AI Sbjct: 66 AAAPQDEVLSLWAGLGYYSRARNLHKAARQVVEEFDGTFPSERKDLETLCGVGRSTAAAI 125 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE--RFNQA 183 + + + ILDGNVKRVL R +A G P K+ EN LW+L+E + P + + Q Sbjct: 126 CAFAFNRRETILDGNVKRVLCRVFARDGNPQDKKFENSLWTLAESLLPPENADMPAYTQG 185 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED- 242 +MDLGA +C R+KP C CP+ + C A N A P KK + Y+L+++ +D Sbjct: 186 LMDLGATVCKRTKPLCHQCPMADICEAKKQNRIAELPRKKTATEVQTLPLYWLIIRDQDG 245 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQI-------AADNLTQLTAFRHTFS 295 +LL +RP G+WGGLYC P F + + I D+L++ H + Sbjct: 246 AILLEKRPAKGIWGGLYCVPCFEKLDEAYACAEKLGIFSECAASPWDDLSEQPTLTHRLT 305 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 H L I P SS + N LW GL P+ L+Q Sbjct: 306 HRLLMITPFEAQTSSSENTVLPPNCLWVKPENLTDYGLPKPLADYLKQ 353 >UniRef50_A1KUL5 Adenine glycosylase n=32 Tax=Neisseriaceae RepID=A1KUL5_NEIMF Length = 349 Score = 379 bits (974), Expect = e-104, Method: Composition-based stats. Identities = 150/343 (43%), Positives = 200/343 (58%), Gaps = 8/343 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +++ W ++GR LPWQ+ K PY VWLSE+MLQQTQVATV+ Y+ RF+ +FPTV L Sbjct: 10 FSERLIRWQKQHGRHHLPWQV-KNPYCVWLSEIMLQQTQVATVLDYYPRFLEKFPTVQTL 68 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP DEVL LW GLGYY+RARNLHKAAQQV GG FP +++ L GVGRSTA AI Sbjct: 69 AAAPQDEVLSLWAGLGYYSRARNLHKAAQQVVRQFGGTFPSERKDLETLCGVGRSTAAAI 128 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE--RFNQA 183 + S + ILDGNVKRVL R +A G P K+ EN LW+L+E + P+ + + Q Sbjct: 129 CAFSFNRRETILDGNVKRVLCRVFARDGNPQDKKFENSLWTLAESLLPSENADMPAYTQG 188 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH-ED 242 +MDLGA +C R+KP C CP+ + C A N A P KK +P Y+L++++ Sbjct: 189 LMDLGATVCKRTKPLCHQCPMADICEAKKQNRTAELPRKKTAAEVPTLPLYWLIIRNRNG 248 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 +LL +RP G+WGGLYC P F L + A+ + ++ + TA H +H L I Sbjct: 249 AILLEKRPAKGIWGGLYCVPCFESLNGLSDFAAKFSLTMADMDEQTALTHRLTHRLLMIT 308 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 P + + +W GL P+E L R Sbjct: 309 PFEGQM----PSESPSDGIWIKPTHLKDYGLPKPLEIYLNGNR 347 >UniRef50_Q3A7R2 A/G-specific DNA-adenine glycosylase n=2 Tax=Desulfuromonadales RepID=Q3A7R2_PELCD Length = 352 Score = 378 bits (971), Expect = e-103, Method: Composition-based stats. Identities = 142/344 (41%), Positives = 200/344 (58%), Gaps = 7/344 (2%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 A + +LDWY + GR LPW+ + PY++WLSE+MLQQT V VIPY+ERF+A FP+V Sbjct: 8 ADTMNRCLLDWYGRCGR-DLPWRRTRDPYRIWLSEIMLQQTGVTAVIPYYERFLAAFPSV 66 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA APL++VL LW GLGYY RAR LH+AA +V + HGG+FPET E + ALPG+GRSTA Sbjct: 67 AALAAAPLEQVLELWAGLGYYRRARFLHEAACKVVSEHGGQFPETPEAIQALPGIGRSTA 126 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GAI+S++ + PILDGNV+RVL R A+SG P +VE +LW ++ +TP + Q Sbjct: 127 GAIVSIAFDRKAPILDGNVRRVLCRLLAISGDPRSSKVEKRLWQCADALTPEDRPHDYAQ 186 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+MDLGA +C +P C CPL C A P + ++T+P +L++ E Sbjct: 187 AIMDLGATVCKPRRPDCQACPLSGLCQAFWQGIQEQLPQRATRKTVPLVQQVAVLIEREG 246 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADN-----LTQLTAFRHTFSHF 297 L+ +RP G+ GGL+ FP A E A R + A L+ + RH +SHF Sbjct: 247 RYLVRRRPLDGMLGGLWEFPSVAVPEGKTAETAARTLLAGEGLSAGLSPVGTVRHAYSHF 306 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 +++ + DE + W + + L ++ L Sbjct: 307 RVELHVFACRENRAGMVADEEH-RWLSPQELADWPLHGSHKKAL 349 >UniRef50_C9PAA7 A/G-specific adenine glycosylase n=2 Tax=Vibrio RepID=C9PAA7_VIBFU Length = 341 Score = 378 bits (971), Expect = e-103, Method: Composition-based stats. Identities = 134/343 (39%), Positives = 193/343 (56%), Gaps = 13/343 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + QF +L W +GR LPWQ + +PY+V +SEVMLQQTQV TVIPYFER+MA FP Sbjct: 6 LTPKQFQEHLLTWQRHHGRHDLPWQQNPSPYRVLVSEVMLQQTQVVTVIPYFERWMASFP 65 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LANA D V++ W GLGYY+RARNL KAA + G+FP + +PGVGR Sbjct: 66 TIEALANATEDAVMNHWQGLGYYSRARNLRKAALYIQDTWNGEFPADVNALQNIPGVGRY 125 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI + + + PI+DGNVKR+ R + + G PG ++ +LWS +E TP ++ Sbjct: 126 TAGAIAAFAFNTYGPIVDGNVKRLFCRYFGIEGVPGTSAMDKQLWSTAEAYTPTHNNRQY 185 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 Q ++D+GA +C P C C C A N + P KPK+ P + G+FL L+ Sbjct: 186 AQGLLDMGATLCKPKNPTCDACSFTTTCKAYQTNRVSELPTPKPKKVTPTKPGHFLWLES 245 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 + ++LL +R G+WG L+C PQ +L Q+ +++ AF+HTF+H+ LD Sbjct: 246 DGKLLLEKRADDGIWGALWCLPQI--------YLEPEQL-GEHIKLKGAFKHTFTHYKLD 296 Query: 301 IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + ++ W L Q VGL P+ +++ Sbjct: 297 GKVWQID----KLGVENPQQQWMPLHQLDDVGLPKPIRTFIEK 335 >UniRef50_C3X6B5 A/G-specific adenine glycosylase n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X6B5_OXAFO Length = 373 Score = 376 bits (966), Expect = e-103, Method: Composition-based stats. Identities = 142/349 (40%), Positives = 205/349 (58%), Gaps = 9/349 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + F++ ++ W KYGR +LPWQ + Y++WLSE+MLQQTQV+TVIPY+ RF+ FP V Sbjct: 14 ASFASILVAWQKKYGRHSLPWQNTRDAYRIWLSEIMLQQTQVSTVIPYYLRFLEFFPDVY 73 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A DEV+ W+GLGYY+RARNLHK A+ V +GG FP E + LPG+G+STA Sbjct: 74 SLAAASNDEVMKYWSGLGYYSRARNLHKCARIVVNEYGGIFPSDPESLEKLPGIGKSTAA 133 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI + S G ILDGNV RV +R + + +K +N+ W L+ ++ P +E + Q Sbjct: 134 AIAAFSAGAKAAILDGNVVRVFSRVFGIRDSITEKAGKNRFWELAYELLPETDIEAYTQG 193 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +MDLGA IC RSKP C CP + C+A + N + P +K K+ P + L+L+ + Sbjct: 194 LMDLGATICVRSKPDCVKCPFSHCCVALSENRISELPVRKIKKASPLKKIIMLVLESRGQ 253 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQI---------AADNLTQLTAFRHTF 294 +LL +RP SG+WGGLY P++ + +LR+ + A ++T L F H F Sbjct: 254 ILLEKRPESGIWGGLYSLPEWETDVNLRKHKDEMASLKTHVSSFGEAVSVTFLEPFVHVF 313 Query: 295 SHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 SHF L + P + +S + +WY++ L APV +LL Sbjct: 314 SHFRLQVTPCVVTLSRRNRAIAPDAYVWYDIMSVNEAPLPAPVRKLLND 362 >UniRef50_Q89A45 A/G-specific adenine glycosylase n=6 Tax=Buchnera aphidicola RepID=MUTY_BUCBP Length = 351 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 145/346 (41%), Positives = 217/346 (62%), Gaps = 3/346 (0%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M F +L+WY +GRKTLPWQI K PYK W+SE+MLQQTQV TVIPY+ +F+ RFP Sbjct: 1 MTTLVFYQTILNWYHHFGRKTLPWQIKKNPYKTWISEIMLQQTQVKTVIPYYCKFIKRFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + L+++PLD +L+LW+GLGYY RARN++K A+ + G FP ++ E+ LPG+G+S Sbjct: 61 NIDTLSDSPLDSILNLWSGLGYYTRARNIYKTAKILKQKFNGIFPNSYAEIIKLPGIGKS 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAILS + ILDGN+KRVL R Y++ K +E LW E +TP +F Sbjct: 121 TAGAILSFGFNLYSCILDGNIKRVLIRYYSI--NINNKYIEKLLWKTIESITPIYHTNKF 178 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKK-PKQTLPERTGYFLLLQ 239 NQA++D+GA+IC +S PKC++CPL++ C + NN KK K +P+ +FL+LQ Sbjct: 179 NQALIDIGALICLKSNPKCNICPLKSTCKSYLNNKLFQINCKKNKKHIIPKTKYWFLILQ 238 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 +++ + L +R G+W L+CFPQF + + W+ + + L F+H SH L Sbjct: 239 YKNYIFLEKRQNLGIWKKLFCFPQFIRQNDILSWIQKNNTKIKKINILNEFKHKLSHLTL 298 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 I P+W+ ++ + + +WYNL P +GL PV +++ +++ Sbjct: 299 YINPIWIIINKISIFSNNNKTIWYNLNNPQCIGLPTPVTKIITKIK 344 >UniRef50_UPI0000E87CC6 A/G-specific adenine glycosylase n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87CC6 Length = 343 Score = 374 bits (960), Expect = e-102, Method: Composition-based stats. Identities = 128/340 (37%), Positives = 204/340 (60%), Gaps = 1/340 (0%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 + +V+ W+ GR LPWQ PY +W+SE+MLQQTQV++VIPY++RF+ FPTV L Sbjct: 4 IAEKVITWHKSSGRHDLPWQKINDPYLIWISEIMLQQTQVSSVIPYYQRFIKTFPTVEKL 63 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A D V+ W+GLGYY RA+ + + A+ + + KFP++ E++ +LPG+G+STAGAI Sbjct: 64 AFADHDVVMKHWSGLGYYRRAKFIMQTAKIIVQQYQSKFPDSVEKLLSLPGIGKSTAGAI 123 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 + + G PI+D NVKRV R Y + WPGK + + LWSL+EQ P+ ++ + QA+M Sbjct: 124 CAFAFGGIEPIMDANVKRVFCRFYGIMEWPGKAQTQKYLWSLAEQNLPSNNIQIYTQALM 183 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 DLGA +C S+P CS CPLQ C++ +N + P KKPK+TL + L+++ +++L Sbjct: 184 DLGATLCKGSQPVCSQCPLQLKCVSFKSNLCHVIPAKKPKKTLITKQVNVLIVEANNQIL 243 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQ-RQIAADNLTQLTAFRHTFSHFHLDIVPM 304 +R S +W GL+ P+ D ++W+ + N+ + A FSH+ L + Sbjct: 244 FQKRSDSSVWEGLWSTPEIQDFSYTKKWIKDVLGLTKFNVVKQGAHLAIFSHYKLQLNYN 303 Query: 305 WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + ++ + LW + + G+ P+++LLQ + Sbjct: 304 HILLTETFSHQQLKDFLWIDRPKIGGEGIPTPIKKLLQNI 343 >UniRef50_Q2S0Q6 A/G-specific adenine glycosylase n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S0Q6_SALRD Length = 354 Score = 371 bits (952), Expect = e-101, Method: Composition-based stats. Identities = 120/343 (34%), Positives = 173/343 (50%), Gaps = 10/343 (2%) Query: 11 LDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPL 70 LDWYD + R ++PW+ PY++W+SE+MLQQT+V TV Y+ RF+ FPTV LA+A Sbjct: 2 LDWYDTHKR-SMPWRETDDPYRIWVSEIMLQQTRVDTVRDYYHRFLEAFPTVEALADADR 60 Query: 71 DEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSL 130 D VL W GLG+YARAR+LH AAQ V H G P T + + L GVG TA A+LS++ Sbjct: 61 DTVLKHWEGLGFYARARHLHTAAQHVVDEHDGTVPSTMDAIKDLKGVGPYTAAAVLSIAY 120 Query: 131 GKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAM 190 K +LDGNV RVL+R +AV E L L+ ++ FNQAMM+LGA+ Sbjct: 121 RKPHAVLDGNVTRVLSRVFAVDEDATTSAAEGHLRDLANELLDPDRPGDFNQAMMELGAL 180 Query: 191 ICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERT-GYFLLLQHEDEVLLAQR 249 +CT P C CPL C A + YP + +P L+ D +L+ +R Sbjct: 181 VCTPRTPHCDRCPLNAVCRAHDAGTEEDYPITPESEPVPHEDIAVGLVFDDNDRLLIQRR 240 Query: 250 PPSGLWGGLYCFPQFADE-------ESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 P GL GGL+ FP E R+ + + ++ H +SHF + + Sbjct: 241 PDEGLLGGLWEFPGGKQEGDESMEAACRREVREELGVGMTDVEPFYTLSHAYSHFKITLH 300 Query: 303 PMWLPVSSFTG-CMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 ++ ++ W + + RL+++L Sbjct: 301 AFRGRLADGPPEAREDQPFRWVTVDELDDYAFPRANRRLIEEL 343 >UniRef50_A5EV94 A-G-specific adenine glycosylase n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EV94_DICNV Length = 347 Score = 370 bits (951), Expect = e-101, Method: Composition-based stats. Identities = 147/345 (42%), Positives = 204/345 (59%), Gaps = 12/345 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M AS FSA+++ W +GR LPWQ + PY VWLSE+MLQQTQV TVIPYF+ F+ARFP Sbjct: 1 MNAS-FSARLISWQRTFGRHELPWQ-NCNPYHVWLSEIMLQQTQVVTVIPYFQCFIARFP 58 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + LA AP +EVL LW GLGYYARARNLH AAQ V G+ P T ++ L GVGRS Sbjct: 59 DIDTLAAAPEEEVLFLWRGLGYYARARNLHAAAQYVVHQLNGQLPNTRAQLEQLKGVGRS 118 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP--AVGVE 178 TA AI + GK I DGNV+RVL R + + + + + +LW+L+E + P A + Sbjct: 119 TAAAICVFAFGKKEAICDGNVRRVLTRHHGILDFIEAPKTQQQLWTLAEALLPDAADDLR 178 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 + Q +MDLG++ICTR++PKC+ CP++ C A N P +KPK+ P R GYF+L Sbjct: 179 SYTQGLMDLGSLICTRARPKCADCPVKTDCYALKNQQQMQLPRRKPKKAPPARNGYFVLC 238 Query: 239 QHED-EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 +++D E+ L +R G+WGGL+ P F E + W+A + + + + H F+HF Sbjct: 239 RNDDGEIYLQKRARRGIWGGLWSLPWFETEHEAKMWIAAQGFSWQSSSDSPVMMHRFTHF 298 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 HL + T + + +Y + L AP+ LL Sbjct: 299 HLSMY-------LLTAQTTQKSTDFYAPDAVAQMALPAPLMTLLN 336 >UniRef50_B8FCI1 A/G-specific adenine glycosylase n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FCI1_DESAA Length = 369 Score = 368 bits (946), Expect = e-100, Method: Composition-based stats. Identities = 119/358 (33%), Positives = 196/358 (54%), Gaps = 12/358 (3%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 + FS +L WY++ R LPW+ PY +W+SE+MLQQTQV TVIPYF R+M FP + Sbjct: 12 SKDFSRSLLRWYEENAR-DLPWRRTSDPYAIWVSEIMLQQTQVKTVIPYFLRWMDAFPNI 70 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 + LA APLD+VL +W GLGYY+RARN+HKAA+++ G+ P T++ + LPG+G TA Sbjct: 71 SSLAEAPLDDVLKMWEGLGYYSRARNMHKAAKEIMDRLDGRMPRTYKGLLELPGIGAYTA 130 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GA+ S++ + P++D NVKRV AR + + ++ L+E + P+ FNQ Sbjct: 131 GAVCSIAYNQDVPLVDANVKRVFARILDMEKPVEQTAATREIRGLAESLIPSGKAGLFNQ 190 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+LGA++CT P C CP+ C+A + P PK+ + + Sbjct: 191 ALMELGALVCTPKNPDCKGCPVSVHCLALKEQTVDSRPVLPPKKKTQALEVSAGVCVRDR 250 Query: 243 EVLLAQRPPSGLWGGLYCFP-------QFADEESLRQWLAQRQIAADNLTQLTAFRHTFS 295 ++L+ +R P GL GL+ FP + ++ +R++ + +I + ++T +H ++ Sbjct: 251 KILIQKRLPKGLMAGLWEFPGGKLNPGESPEQALVREFAEELEIDIECGEKITVIQHAYT 310 Query: 296 HFHLDIVPMW--LPVSSFTGCMDEGNAL-WYNLAQPPSVGLAAPVERLLQQL-RTGAP 349 F + + W + + T + + W + + + + RL+Q L + G P Sbjct: 311 RFRVRLHVFWCSMKKPAQTPALHAAEEIRWVSPKELDGLAFPSADRRLIQMLMKDGIP 368 >UniRef50_B7GII9 A/G-specific adenine glycosylase n=76 Tax=Bacillaceae RepID=B7GII9_ANOFW Length = 373 Score = 368 bits (945), Expect = e-100, Method: Composition-based stats. Identities = 116/347 (33%), Positives = 181/347 (52%), Gaps = 10/347 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 QF ++DW+ R LPW+ DK PYK+W+SEVMLQQT+V TVIPYF +F+ +FP Sbjct: 25 FHIEQFQHDLIDWFRTEQR-DLPWRKDKDPYKIWVSEVMLQQTRVDTVIPYFYQFIEKFP 83 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA+A +EVL W GLGYY+R RNLH A ++V +GG+ P + E+ ++L GVG Sbjct: 84 TLDALADAKEEEVLKAWEGLGYYSRVRNLHAAVKEVKEKYGGRVPASKEQFSSLKGVGPY 143 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 T GA+LS++ G P +DGNV RVL+R + ++ + K + + F Sbjct: 144 TTGAVLSIAYGIPEPAVDGNVMRVLSRIFYITDDIARASTRKKFEQIVSCIISHDDPSDF 203 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL-Q 239 NQA+M+LGA++CT P C LCP+Q C A A A P K + ++L Sbjct: 204 NQALMELGALVCTPKNPSCFLCPVQRHCRAFAEGVEAELPVKTKGKAPKHVAFRAIVLTN 263 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFA------DEESLRQWLAQRQIAADNLTQLTAFRHT 293 E ++ + +RP SGL L+ FP ++ +++ + + ++ ++ F H Sbjct: 264 EEGKIRIEKRPSSGLLANLWQFPNDEYAPTSNEQAFIKEISERYHVVLRSIQRIGTFEHV 323 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 FSH I ++ M + W + + +++ Sbjct: 324 FSHIVWHIEAY--KGNALELQMGDETNKWVTVDELNQYAFPVIYQKI 368 >UniRef50_A8MJ42 A/G-specific adenine glycosylase n=2 Tax=Firmicutes RepID=A8MJ42_ALKOO Length = 544 Score = 368 bits (944), Expect = e-100, Method: Composition-based stats. Identities = 114/344 (33%), Positives = 180/344 (52%), Gaps = 11/344 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + ++L+WY K R LPW+ ++ PY+VWLSE+MLQQT+V TVI Y+ RF+ FPT+ Sbjct: 195 KRLPEKLLNWYQKNAR-DLPWRKNQDPYRVWLSEIMLQQTRVDTVIDYYNRFLQAFPTID 253 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A + VL LW GLGYY+RARNLHK A+ + + G FPET EE+ LPG+G TAG Sbjct: 254 ALALADEERVLKLWEGLGYYSRARNLHKTAKIIVAQYEGNFPETHEELLKLPGIGSYTAG 313 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S+S +DGNV RV++R +++V+ ++ + +V P F Q+ Sbjct: 314 AIASISFNLPVAAVDGNVLRVVSRITEDYRCIDEEKVKKEMGNQLAEVYPENQCGDFTQS 373 Query: 184 MMDLGAMICTR-SKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 +M+LGA IC P C+ CP C+A N++ P +K K R + + Sbjct: 374 LMELGATICLPNGAPLCNECPAIEICMANKNDTQIFLPVRKEKTARKTRELTVFVFVSQG 433 Query: 243 EVLLAQRPPSGLWGGLYCFPQ---FADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 +V L +R G+ GL+ P ++E+++ +L + + + H F+H Sbjct: 434 KVALQKRTEKGVLEGLWELPNRDGMSEEDAVSLYLEELGTLEYRIEKQKKAHHIFTHIRW 493 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPP-SVGLAAPVERLLQ 342 D+V ++ + G + +W + + ++ L Sbjct: 494 DMVCYYVHCTHPFG-----SYIWAGSEELEKEYSVPTAFKKFLS 532 >UniRef50_D0L1V0 A/G-specific adenine glycosylase n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L1V0_HALNC Length = 381 Score = 367 bits (943), Expect = e-100, Method: Composition-based stats. Identities = 156/374 (41%), Positives = 217/374 (58%), Gaps = 31/374 (8%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M AS F +++LDW+D++GR LPWQ +TPY+VW+SE+MLQQTQVATVI YF RFM RFP Sbjct: 1 MSASDFHSRLLDWFDRHGRHDLPWQHPRTPYRVWISEIMLQQTQVATVIGYFNRFMQRFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 ++ LA AP+DEVL LW+GLGYYARARNLH AAQ +A PET ALP VG S Sbjct: 61 SLDVLAAAPVDEVLALWSGLGYYARARNLHAAAQIMAQQ---GVPETRAGWQALPSVGPS 117 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TA AI++ + ILDGNVKRVLAR + + L+ +++ TP V + Sbjct: 118 TAAAIMAQAFDVPETILDGNVKRVLARHAGIDRPIEQASTIQALYEVAKLHTPQTRVADY 177 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+MDLGA +CTR P CS CP+ C+A A+N P ++ +Q +P R F+ ++ Sbjct: 178 TQAIMDLGATLCTRHSPGCSACPVSADCVAFASNRVESLPVRRRRQPVPTRRAVFMAIED 237 Query: 241 E-DEVLLAQRPPSGLWGGLYCFPQF-------ADEESLRQWLAQRQIAADNLTQ------ 286 E + ++L +RPP+G+WGGL+C P++ + E +L ++ +Q ++ Q Sbjct: 238 EAERLMLIRRPPTGIWGGLWCLPEYIPADDCVSQEHTLDSAVSGKQAKDNSGQQSMLSIH 297 Query: 287 ----------LTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNL----AQPPSVG 332 LT F H F+H+ LD + VS + D + LW L A+ P +G Sbjct: 298 RLKQRGPTSPLTTFEHRFTHYLLDARIDHMVVSRTSSVEDNPDVLWLPLIELAARAPLLG 357 Query: 333 LAAPVERLLQQLRT 346 L P+ R L Sbjct: 358 LPKPMSRFLSDYPA 371 >UniRef50_A1WW55 A/G-specific DNA-adenine glycosylase n=1 Tax=Halorhodospira halophila SL1 RepID=A1WW55_HALHL Length = 358 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 159/349 (45%), Positives = 207/349 (59%), Gaps = 4/349 (1%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 Q++ W ++GR LPWQ TPY+VW+SE+MLQQT+V TV+PYFERFM R+P V Sbjct: 11 QALQEQLIAWQRQHGRNDLPWQQPATPYRVWISEIMLQQTRVETVVPYFERFMERYPDVA 70 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A LD+VL LW GLGYYARARNLH AAQ++ T GG+ P + LPG+G STAG Sbjct: 71 ALAAAELDDVLALWAGLGYYARARNLHAAAQRIQTDWGGQLPAELSALQTLPGIGPSTAG 130 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI SL G+ PILDGNVKRVLAR V GWPG+ V +LW+LS +TP RFNQ Sbjct: 131 AIRSLGHGQPAPILDGNVKRVLARLAGVEGWPGRSPVAKQLWALSAALTPEAECRRFNQG 190 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +MDLGA++CT P C+ CPL C A A + YP +P + P R LL++H D Sbjct: 191 LMDLGALVCTPRDPACNACPLAASCTARAAGNPETYPAPRPARQRPRREVRLLLIEHADA 250 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVP 303 +LL +RP +G+WGGL+ P+ E + + L A H +HF L + P Sbjct: 251 LLLERRPATGIWGGLWSLPECPPSEDPVTRALRLGARCEPAGDLPARHHALTHFELIMQP 310 Query: 304 MWLPVSSFTGCMDEGNAL--WYNLAQPPSVGLAAPVERLLQQLRTGAPV 350 L ++ T + E + W+ Q GL AP+ R+L+ G PV Sbjct: 311 TRLRWNAATPDIGEPDPQRIWFRPGQDTLPGLPAPILRILRD--AGYPV 357 >UniRef50_D2RMB6 A/G-specific adenine glycosylase n=2 Tax=Acidaminococcus RepID=D2RMB6_ACIFE Length = 352 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 124/349 (35%), Positives = 185/349 (53%), Gaps = 10/349 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQID--KTPYKVWLSEVMLQQTQVATVIPYFERFMAR 58 M+ + + +L W+D R+ LPW+ + + PY VW+SE+MLQQT+ TV YF+R+M + Sbjct: 1 MKRNDWPETLLAWFDG-SRRALPWREEHPRNPYHVWVSEIMLQQTRTETVKGYFQRWMEQ 59 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 FPT+ DLA AP ++VL W GLGYY+RARNLHKAA+QV GG+ P + + +L G+G Sbjct: 60 FPTIRDLAQAPEEQVLRAWQGLGYYSRARNLHKAARQVMAEWGGQLPRERKALGSLAGIG 119 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 T GAILS++ G+ P +DGN+ RVL+R Y V + + + +L+E+ P Sbjct: 120 AYTVGAILSMAFGEKIPAVDGNLLRVLSRLYGVEEDISGTQGKKTITALAEEAIPGDRPG 179 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 FN+A+MDLGA +C P+C CPL C A P KKPK E L+ Sbjct: 180 DFNEALMDLGAEVCIPRHPRCEACPLTAFCQAWKEGKTETLPVKKPKAPQKELAAACGLV 239 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADE--ESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 E L +RP G+ ++ FP + E R L + + + + RH F+H Sbjct: 240 VREGRYLFHKRPAKGMLASMWEFPMVLEPGAEKARSSLVKL-LETEAGPAVWQHRHVFTH 298 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 ++ + + G G W++ + V LA P +L L+ Sbjct: 299 QIWNMTAYVMERAEVPG----GEWSWFSPEEWEKVPLAGPHAKLADWLK 343 >UniRef50_D0MK94 A/G-specific adenine glycosylase n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MK94_RHOM4 Length = 383 Score = 366 bits (939), Expect = e-100, Method: Composition-based stats. Identities = 126/345 (36%), Positives = 183/345 (53%), Gaps = 10/345 (2%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 ++DWY ++ R LPW+ + PY++W++EVMLQQT+V PY+ERF+ FPTV LA A Sbjct: 25 LIDWYRRHAR-DLPWRRTRDPYRIWVAEVMLQQTRVDQAGPYYERFLRAFPTVEALAAAS 83 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 LD+VL W GLGYYARARNLH+AA+Q+ HGG+ P T+E + LPGVG TA A+ S++ Sbjct: 84 LDDVLRCWEGLGYYARARNLHRAARQLVAEHGGRLPTTYEALRRLPGVGPYTAAAVASIA 143 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 G+ +LDGNV RVL R AV+ L +++ + FNQA+M+LGA Sbjct: 144 FGEPRAVLDGNVIRVLTRVLAVADDARASATRRALQEVADALISDEEPGTFNQALMELGA 203 Query: 190 MICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPE-RTGYFLLLQHEDEVLLAQ 248 +CT +P+C+ CPL+ C A A +P + P+ ++P LL E VL+ + Sbjct: 204 TVCTPVQPRCNDCPLREVCRARAMGDPTAFPVQTPRASVPHYEVALGLLFNEEGAVLIQR 263 Query: 249 RPPSGLWGGLYCFPQFADE-------ESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 RP GL GGL+ FP E R+ + + L RH ++HF + + Sbjct: 264 RPEDGLLGGLWEFPGGKREPGESLEAACARELHEELGVRVAVGPCLATVRHAYTHFRVTL 323 Query: 302 VPMWLPVSSFTGCMDEG-NALWYNLAQPPSVGLAAPVERLLQQLR 345 + G W L + RL++ L+ Sbjct: 324 YAFPCTLLEGVPRSRAGLPLRWVPLNELDHYAFPRANRRLIELLK 368 >UniRef50_A1VKP4 A/G-specific DNA-adenine glycosylase n=6 Tax=Comamonadaceae RepID=A1VKP4_POLNA Length = 384 Score = 366 bits (939), Expect = e-100, Method: Composition-based stats. Identities = 149/347 (42%), Positives = 196/347 (56%), Gaps = 14/347 (4%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ +++ W +GR +LPWQ + PY+VWLSE+MLQQTQVATV+ Y++RF+ RFPTV+DL Sbjct: 27 FANEIVRWQQSHGRNSLPWQNTRDPYRVWLSEIMLQQTQVATVLAYYDRFVQRFPTVSDL 86 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A DEV+ LW GLGYY+RARNLH+ AQ V LH G+FP + E++ LPG+GRSTA AI Sbjct: 87 AAATQDEVMALWGGLGYYSRARNLHRCAQDVMALHAGQFPRSAEQLQTLPGIGRSTAAAI 146 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE----RFN 181 S + ILDGNVKRVL R S + E LW+ + + P + R+ Sbjct: 147 ASFCFAERVAILDGNVKRVLTRVLGFSDDLAQSANERALWNQATDLLPHDNLARAMPRYT 206 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH- 240 Q +MDLGA ICT +PKC LCP+Q C A YP K K + L Q Sbjct: 207 QGLMDLGATICTGRQPKCLLCPVQALCRGRAEGQPEKYPVKTRKLKRSTQALSLLWAQKP 266 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 + V L +RP +G+W GLYC P F + LR + Q + L AF H +H L Sbjct: 267 DGSVWLEKRPATGIWSGLYCLPVFESDAELRALVPQS--LEGRMEALPAFVHVLTHKDLC 324 Query: 301 IVPMWLPVSSFTGCMDEG------NALWYNLAQPPSVGLAAPVERLL 341 + P W+ M E A W+ A P +GL AP+ +LL Sbjct: 325 LSP-WIAGFQADQAMPETMVSEGRPAAWFGPAAWPELGLPAPIRKLL 370 >UniRef50_A2RLW5 A/G-specific adenine glycosylase n=4 Tax=Lactococcus lactis RepID=A2RLW5_LACLM Length = 386 Score = 363 bits (932), Expect = 6e-99, Method: Composition-based stats. Identities = 122/359 (33%), Positives = 172/359 (47%), Gaps = 21/359 (5%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +F +L WYD +K LPW+ PYK+W+SE+M QQTQV TVIPYFERFM ++PT Sbjct: 7 KIKEFQKDLLSWYDAN-KKPLPWRQTTEPYKIWISEIMSQQTQVETVIPYFERFMKKYPT 65 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA A E+L LW GLGYY+RARNL AAQ+V + GKFP+ +E+ +L G+G T Sbjct: 66 VESLAQADDTELLKLWEGLGYYSRARNLKIAAQEVVNEYNGKFPDNLKEILSLRGIGPYT 125 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A AI S+S P +DGN+ RV +R + + K + FN Sbjct: 126 AAAIASISFDLAEPAIDGNLMRVTSRIFELECDISKSSSRKIFDEHLRTLVSKKRPGDFN 185 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 Q +MDLG+++C+ KPKC CPL C A A+ YP K K E L++ Sbjct: 186 QGLMDLGSLVCSPKKPKCETCPLNKYCGAVADGKQLNYPVKTKKLKQKELYYTAFALENS 245 Query: 242 -DEVLLAQRPPSGLWGGLYCFPQFADEE------------SLRQWLAQRQIAADNLTQLT 288 E L +RP GL ++ FP S + + + + Sbjct: 246 LGEYYLEKRPSKGLLADMWTFPLTELSSVDFEKIITDGAISKNITMPELPESISKMEYFG 305 Query: 289 AFRHTFSHFHLDIVPMWLPVSSFTGCMDE-----GNALWYNLAQPPS--VGLAAPVERL 340 F H FSH IV + + DE ++Y+ + V L + ++ Sbjct: 306 NFTHIFSHQKWHIVLIKVKAEEKFEIADELLTADKKWVYYDSKKLGENVVNLNQDLSKI 364 >UniRef50_C0EDR1 Putative uncharacterized protein n=2 Tax=Clostridium RepID=C0EDR1_9CLOT Length = 365 Score = 362 bits (930), Expect = 1e-98, Method: Composition-based stats. Identities = 130/344 (37%), Positives = 191/344 (55%), Gaps = 6/344 (1%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 +Q +L WYD + + LPW+ TPY+VW+SE+MLQQT+V+ V+PY+ERF++ PTV Sbjct: 19 AQIPPLLLTWYD-HSARILPWRQQPTPYRVWVSEIMLQQTRVSAVMPYYERFLSALPTVG 77 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA+AP + +L LW GLGYY R RN+ KAA+ V HGG+ P +FEE+ LPG+G TAG Sbjct: 78 ALADAPEEVLLKLWEGLGYYNRVRNMQKAARAVMEQHGGELPASFEELVKLPGIGEYTAG 137 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+ S++ G P +DGNV R+L+R V+ +L +Q+ PA V FNQA Sbjct: 138 AVASIAYGLRVPAVDGNVLRILSRWLLSRADVTMPPVKRAYQALVQQMLPAERVGDFNQA 197 Query: 184 MMDLGAMICTR-SKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 +M+LGA +C P C CP+ C A A P K PK+ E LL+ E Sbjct: 198 LMELGATVCLPNGDPLCESCPVAGLCRARERGCAAELPVKAPKKPRREEKRTILLVVSEG 257 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 +VLL +RP +GL GL+ + S ++ A + ++ +L A +H FSH + Sbjct: 258 KVLLTKRPANGLLAGLWEYLNLEGHLSAQEAAAHCGVPVSSVRRLGAAKHIFSHIEWKMR 317 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQ-PPSVGLAAPVERLLQQLR 345 + VS FT +G +W + Q + + ++ + LR Sbjct: 318 GYLIQVSGFTP---QGECVWADQLQFEEQYSVPSAFQQYTKLLR 358 >UniRef50_Q03V36 A/G-specific DNA-adenine glycosylase n=2 Tax=Leuconostoc mesenteroides RepID=Q03V36_LEUMM Length = 342 Score = 362 bits (929), Expect = 1e-98, Method: Composition-based stats. Identities = 121/343 (35%), Positives = 174/343 (50%), Gaps = 13/343 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 +F +L+WYDK GR LPW+++ PY+V +SE+MLQQT+V TVIPYF+RFM PT Sbjct: 8 TIKEFRKTLLNWYDKEGRAHLPWRVNHDPYRVLVSEIMLQQTRVDTVIPYFDRFMTSLPT 67 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA AP D VL LW GLGYY+RARNL KAAQ V G++PE+ +++ +LPGVG T Sbjct: 68 VGDLATAPEDVVLKLWEGLGYYSRARNLQKAAQFVVNELHGQWPESSDDLQSLPGVGPYT 127 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A AI S+S + P +DGN RV +R + + + + + FN Sbjct: 128 AAAIASISFNEVVPAVDGNQYRVFSRLLKIDADIADTKSRKIFYDIIAPIVDPERPGDFN 187 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLG+ T P P++ A + YP K KQ ++ + + + Sbjct: 188 QAIMDLGSSYMTTKNPDSLYSPVREFNAAFRDGVEDQYPVKTTKQRPVKQLFMASVFEKD 247 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 ++L +RP SGL G + FP E Q I H F+H +I Sbjct: 248 GKLLFEKRPDSGLLAGFWTFPLTQIESIESITGQQLNIK--------PVTHIFTHRRWEI 299 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 WL M N +++ + ++ L +LL++L Sbjct: 300 ---WLVKQET--AMLNDNQQYFSSDEWSALSLPKVQHKLLEKL 337 >UniRef50_A6TW33 A/G-specific adenine glycosylase n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TW33_ALKMQ Length = 352 Score = 359 bits (923), Expect = 6e-98, Method: Composition-based stats. Identities = 112/351 (31%), Positives = 183/351 (52%), Gaps = 10/351 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M FS Q+++W+ + R +PW+ K PY +W+SE+MLQQT+V TVI Y++ FM +FP Sbjct: 1 MTEESFSYQLIEWFREEKRW-MPWRETKDPYCIWVSEIMLQQTRVETVISYYQNFMKKFP 59 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA A +EVL W GLGYY+R RNLH+AA ++ +H G P+ + + LPG+G Sbjct: 60 TIETLARASQEEVLKSWEGLGYYSRGRNLHRAANEIVLIHEGNVPKDKKILLKLPGIGPY 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAILS++ + P +DGNV RV++R + + +K+V N++ L Q+ P F Sbjct: 120 TAGAILSIAYNQKEPAVDGNVLRVMSRLFNIQEDIMEKKVVNEVTDLVFQLMPQDNGGDF 179 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 +A+M+LGA +C KP+C LCP+ N C A + P + K + L + + Sbjct: 180 TEALMELGATVCVPQKPRCRLCPVHNQCKAHHLDIQETLPIRIKKTKVKNYHKGILWMVY 239 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFA-----DEESL--RQWLAQRQIAADNLTQLTAFRHT 293 +L+ Q P GL GGL+ P DE+++ + + L + +H Sbjct: 240 NGTILVKQNPQKGLLGGLWALPTIDLMHKVDEKAVIQEDFQEEVGQVVVELEYIGKEKHV 299 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 F+H + + + W + Q ++ +++ ++ Sbjct: 300 FTHQRWQMSIFKGRSNDHLRVKEP--YQWVPIGQLETLTFPIVYRKVIDKI 348 >UniRef50_Q2LY55 A/g-specific DNA glycosylase n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LY55_SYNAS Length = 373 Score = 358 bits (919), Expect = 2e-97, Method: Composition-based stats. Identities = 128/352 (36%), Positives = 192/352 (54%), Gaps = 10/352 (2%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 A + + + WY+K R LPW+ PY +WLSE+MLQQTQV VIPY+ RF+ +FPT Sbjct: 21 NAQRIATLLTAWYEKNKR-LLPWRSTSDPYAIWLSEIMLQQTQVEAVIPYYRRFLEQFPT 79 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + +LA APL+ VL +W +GYY+RAR+LH A+ + HGG+FP ++ ALPG+G T Sbjct: 80 IEELARAPLEAVLKVWEKMGYYSRARHLHATARLILESHGGRFPANPVDLMALPGIGSYT 139 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 +GAILS++ GK P +DGNVKRVL+R + V L +L+E++ PA RFN Sbjct: 140 SGAILSIAFGKSVPAVDGNVKRVLSRLFFVDSPVDLTSTRRLLSALAEKLVPARQPGRFN 199 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH- 240 QA+M+LGA++C P CS CPLQ+ C+A A + P + + P R +++ Sbjct: 200 QALMELGAVLCRPKTPLCSDCPLQSICLAYAGSGQHFLPVSRKRTKRPHREAVAAVIRDS 259 Query: 241 EDEVLLAQRPPSGLWGGLYCFP-------QFADEESLRQWLAQRQIAADNLTQLTAFRHT 293 E +L+ +RP +G GGL+ FP + E R+ I L + T Sbjct: 260 EQRLLVIRRPAAGFLGGLWTFPGGMLNPGEIVTEAVERRCREGLNITVAAGDSLMTLQQT 319 Query: 294 FSHFHLDIVPMW-LPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 ++HFHL + + + N W + ++ + R+L+ L Sbjct: 320 YTHFHLTLHVFAGTILDGVPDSPQKDNWRWVSPGDIRNLPFSRAELRILETL 371 >UniRef50_C4L361 A/G-specific adenine glycosylase n=1 Tax=Exiguobacterium sp. AT1b RepID=C4L361_EXISA Length = 344 Score = 358 bits (918), Expect = 2e-97, Method: Composition-based stats. Identities = 115/346 (33%), Positives = 179/346 (51%), Gaps = 18/346 (5%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 F+ +++ W+++ R LPW+ K PY+VW+SEVMLQQT+V TVIPY+ RFM RFPT Sbjct: 12 NIHHFNEELVTWFNREKR-DLPWRHAKNPYRVWVSEVMLQQTRVDTVIPYYNRFMERFPT 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + DLA A DEV+ W GLGYY+R RNLH+A ++VA+++ G PE E L GVG T Sbjct: 71 LEDLAAADTDEVVKYWEGLGYYSRVRNLHEAVKEVASVYEGIVPEEKERFEKLKGVGPYT 130 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 GA+LS++ + P +DGNV RV++R + + + + ++ FN Sbjct: 131 TGAVLSIAYNQPEPAVDGNVMRVMSRQFGIYDDIAMPKTRKIFEQVVRRLMDPAHASDFN 190 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 + +M+LGA +CT P CSLCP+Q+ C A A++ P K K L ++ + Sbjct: 191 EGVMELGATVCTPKNPMCSLCPVQDTCYAYAHHVQDELPVKTKKGAARIEMYDALCIEKD 250 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 +V QR GL G++ +P + N T L +H FSH I Sbjct: 251 GKVAYEQRADKGLLAGMWQYPLMDRGTGEQL----------NGTYLGQVKHVFSHIVWYI 300 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 + + + +W + ++ ++ ++ L++L G Sbjct: 301 DLYRVE------TLPTDDVVWLDASEREVKTVSVAQQK-LERLAGG 339 >UniRef50_A3ZT46 A/G-specific adenine glycosylase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZT46_9PLAN Length = 358 Score = 357 bits (917), Expect = 3e-97, Method: Composition-based stats. Identities = 139/350 (39%), Positives = 187/350 (53%), Gaps = 10/350 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 +F Q+L WY R LPW+ D+ PY+VW+SE+MLQQTQVATV YFERF A FPTVT Sbjct: 8 RKFQRQILAWYGGAAR-DLPWRADRDPYRVWISEIMLQQTQVATVRAYFERFSAAFPTVT 66 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA A EVL LW GLGYY RAR LH AAQ +A H GKFP F + ALPGVGR TAG Sbjct: 67 DLAAADEAEVLRLWEGLGYYRRARQLHAAAQVIADEHRGKFPREFAAILALPGVGRYTAG 126 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S++ + PIL+ N R+ AR A P K + LW +E + P V FNQA Sbjct: 127 AICSIAYDQPAPILEANTIRLHARLLAYREDPTKTAGQRLLWQFAEHILPTEDVSSFNQA 186 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +M+LG+ ICT P+C +CP+ C A N+ A P K K +RT ++++ + E Sbjct: 187 LMELGSEICTPRNPQCGVCPVATLCQAKRENAVAEIPVPKKKMQYEDRTEVAIIVRRKKE 246 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLR--------QWLAQRQIAADNLTQLTAFRHTFS 295 VLL Q W GL+ FP+F Q + ++A ++LT +H + Sbjct: 247 VLLRQCGADERWAGLWDFPRFQVASEAELLSGELASQLEERTGLSAAIGSRLTTIKHGVT 306 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNAL-WYNLAQPPSVGLAAPVERLLQQL 344 + + + + D L W + + L+ ++ Q L Sbjct: 307 KYRITLHCHEAEGMTGRLRKDATAPLVWAAVDKLADFALSTTGRKIAQLL 356 >UniRef50_C5EQF4 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=C5EQF4_9FIRM Length = 391 Score = 357 bits (916), Expect = 4e-97, Method: Composition-based stats. Identities = 122/330 (36%), Positives = 176/330 (53%), Gaps = 11/330 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +L WY R +LPW+ D PY+VW+SE+MLQQT+V V PYFERFM FP Sbjct: 28 RLKAMEKPLLAWYGSRAR-SLPWRDDPKPYRVWISEIMLQQTRVEAVKPYFERFMKAFPE 86 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA A D ++ +W GLGYY RARNL AA+ V +GG P +FEE+ LPG+G T Sbjct: 87 VRDLAGAEDDYLMKMWEGLGYYNRARNLKAAARMVMEEYGGCLPASFEELIKLPGIGSYT 146 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ G P +DGNV RV++R G K V+ + + V P+ R+N Sbjct: 147 AGAIASIAFGIPMPAVDGNVLRVISRVLGDRGDIRKASVKAGMEQELKAVMPSGDASRYN 206 Query: 182 QAMMDLGAMICTR-SKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 Q ++++GA++C +P+C CPL + C+ N W P K P + ++++ Sbjct: 207 QGLIEIGALVCIPGGEPRCGECPLASVCLTRRNGWWKEIPYKSPNKARKIEERTVFIIEY 266 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQF---ADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 +D+V + +RP +GL GLY FP D + + L +T L A +H FSH Sbjct: 267 QDKVAIHKRPMNGLLAGLYEFPNACGHPDADEIPAVLGMNPGCIKAVTPLPAAKHVFSHV 326 Query: 298 HLDIVPMWLPVSS------FTGCMDEGNAL 321 + + V F G +EG + Sbjct: 327 EWHMTGYRVVVDEAAADPVFPGGREEGKGV 356 >UniRef50_Q8CV61 A/G-specific adenine glycosylase n=1 Tax=Oceanobacillus iheyensis RepID=Q8CV61_OCEIH Length = 354 Score = 355 bits (912), Expect = 1e-96, Method: Composition-based stats. Identities = 115/349 (32%), Positives = 181/349 (51%), Gaps = 9/349 (2%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 F +L+WY R LPW+ + PYK W+SE+MLQQT+V TVIPYF RFM ++PT Sbjct: 9 NIPVFQTDLLEWYYLNKR-DLPWRREPNPYKTWVSEIMLQQTKVDTVIPYFNRFMEKYPT 67 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA A +VL W GLGYY+RARNL A ++V + G+ P +E+A+L G+G T Sbjct: 68 VYDLAKADEQDVLKSWEGLGYYSRARNLQTAVREVVDTYNGEIPNNEKELASLKGIGPYT 127 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 GAILS++ + P +DGNV RV +R + K+ + ++ ++ FN Sbjct: 128 KGAILSIAFNQPVPAVDGNVLRVFSRILQIEDDIAKQSTKKEIEQYVGEIISHQDPSSFN 187 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLGA ICT KP C CP+ C A + P KK + + LL++++ Sbjct: 188 QAIMDLGATICTPKKPTCMFCPVMEHCQAFQHGIQEQLPIKKKAKKQKIKQYVVLLIRND 247 Query: 242 -DEVLLAQRPPSGLWGGLYCFPQFADEE----SLRQWL-AQRQIAADNLTQLTAFRHTFS 295 E ++ +R GL L+ FP +E + QW+ + I + +HTFS Sbjct: 248 QGEYVIEKRSDQGLLANLWQFPMVPLDEVGIDQIEQWMYDKCGIDLQMESDCDNLKHTFS 307 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 H ++ S+ +++ + ++ + + +++++ L Sbjct: 308 HIIWEMDIK--KASTSQNFLNKDSLMFIAKEEIDNFPFPVSHQKMMKFL 354 >UniRef50_C9LNH1 A/G-specific adenine glycosylase n=1 Tax=Dialister invisus DSM 15470 RepID=C9LNH1_9FIRM Length = 351 Score = 354 bits (909), Expect = 2e-96, Method: Composition-based stats. Identities = 122/340 (35%), Positives = 182/340 (53%), Gaps = 9/340 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDK--TPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 ++ ++L W+D+ R+ LPW+ + PY VW+SE+MLQQT+ V PYFE + RFPT Sbjct: 8 QEWPHKLLAWFDQN-RRDLPWREGRPRNPYYVWVSEIMLQQTRTEAVKPYFESWKRRFPT 66 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LA A +VLH W GLGYY+RARNLHKAA+++A +GG PE ++V ALPG+G T Sbjct: 67 IEALAEAKEADVLHAWQGLGYYSRARNLHKAAREIAEKYGGAIPEDKKDVRALPGIGEYT 126 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAILS++ GKH +DGNV RV AR Y + K ++ +L E+ PA FN Sbjct: 127 AGAILSMAYGKHEAAVDGNVLRVYARLYGIESDILKSAGRKEITTLVEKTLPA-RAGDFN 185 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 +A+MDLG+ +C PKC CPL C A + P + PK+ E ++ E Sbjct: 186 EALMDLGSEVCVPKHPKCEKCPLHGECAALRLGLENVLPIRTPKKKQKESIVCCAVMIKE 245 Query: 242 DEVLLAQRPPSGLWGGLYCFPQ-FADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 + L RP G+ ++ P +D E + ++ + ++ + H F+H Sbjct: 246 GKALFHLRPSKGMLSSMWELPMVLSDSEKDGVKMLEKLLGGKAGERIWTYTHVFTHRIWH 305 Query: 301 IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 + + EG+ WY+ + + LA P +L Sbjct: 306 MKAYLFHGAKEP----EGDYRWYSCEEYREIPLAGPHAKL 341 >UniRef50_D1N1B7 A/G-specific adenine glycosylase n=2 Tax=Bacteria RepID=D1N1B7_9BACT Length = 356 Score = 354 bits (909), Expect = 2e-96, Method: Composition-based stats. Identities = 116/356 (32%), Positives = 175/356 (49%), Gaps = 14/356 (3%) Query: 1 MQASQFSA---QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMA 57 M + A +L+WYD++ R LPW+ PY+VW+SE+MLQQT+V V PY++RF+ Sbjct: 1 MDSESLQAIVAPLLEWYDRHAR-ILPWRESPEPYRVWISEIMLQQTRVEAVKPYYDRFLK 59 Query: 58 RFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGV 117 P + LA A ++L LW GLGYY R RNL KAA+ + + +GG+FP E + +LPG+ Sbjct: 60 ELPDLHALAEASEPQLLKLWEGLGYYNRVRNLQKAARVIVSEYGGEFPRDVETLRSLPGI 119 Query: 118 GRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGV 177 G TAGAI S+S + P +DGNV RV++R A +V+ + QV PA Sbjct: 120 GEYTAGAIASISFNRPEPAVDGNVLRVVSRLAASREDISSPKVKGAISDALRQVYPAGRC 179 Query: 178 ERFNQAMMDLGAMICTR-SKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFL 236 F Q++M+LGA +C P+C+ CPL C A P K ++ Sbjct: 180 GDFTQSLMELGATVCLPNGAPRCAECPLAELCAGRREGIAAGLPVKPGRKPRRIEPRTVF 239 Query: 237 LLQHEDEVLLAQRPPSGLWGGLYCFPQFADE---ESLRQWLAQRQIAADNLTQLTAFRHT 293 LL+ D + L +RP +GL GL+ FP A +W+A + I A+ + +H Sbjct: 240 LLRCGDRIALLRRPGNGLLAGLWEFPNTAGSLTVAEAAEWMAGQGIFAERIIATGGAKHV 299 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPP-SVGLAAPVERLLQQLRTGA 348 F+H + + G W + + L ++ L + Sbjct: 300 FTHLEWHMSGFLADCK-----AENGGFHWVTRDELEREISLPTAFRPFVEALDAES 350 >UniRef50_A4T064 A/G-specific adenine glycosylase n=2 Tax=Polynucleobacter necessarius RepID=A4T064_POLSQ Length = 381 Score = 354 bits (909), Expect = 3e-96, Method: Composition-based stats. Identities = 139/367 (37%), Positives = 192/367 (52%), Gaps = 26/367 (7%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 FS +++ W+ + GR LPWQ ++ PY VW+SE+MLQQTQVATV+ + RFM RFPTV Sbjct: 6 IDSFSKKLIAWHAQSGRSGLPWQGNRDPYAVWVSEIMLQQTQVATVLERYPRFMKRFPTV 65 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA A +D+VL W GLGYY+RARNLH AQQ+ GKFP+ + L G+GRSTA Sbjct: 66 KKLAAADVDDVLAEWAGLGYYSRARNLHACAQQIVREFAGKFPQDPALLEQLKGIGRSTA 125 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP--AVGVERF 180 GAI + + + PILD NVKR+LAR +A+ G K V + LW L+ ++ P + + Sbjct: 126 GAIAAFAFHERAPILDANVKRILARLFAIEGAIQDKAVNDSLWKLATELLPLKPQDMPTY 185 Query: 181 NQAMMDLGAMICTRSKPKC----SLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFL 236 QA+MD GA CT KP C CP C A ++ P K K PE + L Sbjct: 186 TQALMDFGATWCTSRKPVCLSGEKKCPFAKDCQANLSDQVLALPKKVIKSKSPEFDCHML 245 Query: 237 LLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTA------- 289 LL+ + VLL +RP +WGGL+ P+ +L + +++ L+ Sbjct: 246 LLRSGNSVLLQKRPDKAIWGGLWSLPESPWAPRDLSFLKEELSSSNLLSLTLPEEKSVLF 305 Query: 290 ------------FRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPV 337 +H F+H L + +W + T N W L Q GL P+ Sbjct: 306 LRNCTPPNRGFYIKHVFTHRCLWMQ-IWEVNAMKTIPFTNPNLKWVPLKQLGRHGLPQPI 364 Query: 338 ERLLQQL 344 + LLQ L Sbjct: 365 KVLLQGL 371 >UniRef50_C5VJP5 A/G-specific adenine glycosylase n=1 Tax=Prevotella melaninogenica ATCC 25845 RepID=C5VJP5_9BACT Length = 334 Score = 352 bits (903), Expect = 1e-95, Method: Composition-based stats. Identities = 123/341 (36%), Positives = 180/341 (52%), Gaps = 9/341 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ +L W+ GR +LPW+ K Y +WLSEV+LQQT++ + Y+ERFMA++PTV DL Sbjct: 3 FAVTLLQWFKNNGR-SLPWRETKDAYAIWLSEVILQQTRIVQGMSYWERFMAKWPTVNDL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A +EVL W GLGYY+RARNLH AAQQV L G FP+TF+E+ L GVG TA AI Sbjct: 62 AAATENEVLKAWQGLGYYSRARNLHTAAQQVMEL--GGFPQTFKELKTLKGVGDYTAAAI 119 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S++ G+ ++DGNV RVL+R Y + E + + +L++ + P +N+A+M Sbjct: 120 ASIAFGEPVAVVDGNVYRVLSRYYGIETPIDSTEGKKEFQTLAQSLLPINEPADYNEAIM 179 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D GA CT + P CS CPL C+A P K K ER +L +++E ++ Sbjct: 180 DFGATQCTPNSPHCSACPLCETCVAFREQRIDELPVKSKKVKQRERHFTYLCIEYEGKIA 239 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW 305 + QR +W GL+ FPQ +E + + A L + +H +H L Sbjct: 240 IHQRGAGDIWQGLWEFPQ--EEHLTSSEDSAWKTEAQLLQK--GVKHILTHQILLADIYL 295 Query: 306 LPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 + +W + L +E LL+ + Sbjct: 296 WQPTRRPQL--PSEFIWIEKQDLENYALPRLIEILLKAVPA 334 >UniRef50_C0QGE4 MutY n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QGE4_DESAH Length = 364 Score = 352 bits (903), Expect = 1e-95, Method: Composition-based stats. Identities = 118/350 (33%), Positives = 177/350 (50%), Gaps = 9/350 (2%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + +L WY + R+ LPW+ PY +W+SEVMLQQTQV TV+PY+ RFM FPTV Sbjct: 3 EIQTALLKWYHTHHRE-LPWRKTANPYSIWVSEVMLQQTQVKTVVPYYGRFMEAFPTVAS 61 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A L++VL LW GLGYYARARN HKA Q V T G P + ALPGVG A A Sbjct: 62 LARADLEQVLKLWEGLGYYARARNFHKACQTVTTDLKGIIPRDLKGFKALPGVGDYIAAA 121 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 +LS++ ++DGNVKRVLAR + + K + ++ + FNQA+ Sbjct: 122 VLSIAFNIPLAVVDGNVKRVLARVFTMDDPVNHGPSHKKFQAKADLILDRSCPGAFNQAV 181 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LGA++C+ P C++CPL C A ++S YP + + +P +++ D++ Sbjct: 182 MELGALVCSPRNPGCTICPLGQYCRALESDSVDKYPLRNKTKPVPTVHIAAGVVKKGDKL 241 Query: 245 LLAQRPPSGLWGGLYCFP-------QFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 L+ R P GL GGL+ FP + A +R+ + + + L +H ++HF Sbjct: 242 LITLRKPEGLLGGLWEFPGGKVKTGEQASSACVRELAEETGLRVAVTSHLARVKHAYTHF 301 Query: 298 HLDIVPMWLP-VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 +++ +S + W + + + L Sbjct: 302 KIEMDIFNCQYISGNVRLNGPVDHRWIFPHEIRQYPFPKANLKFIPLLEN 351 >UniRef50_A6LDG5 A/G-specific adenine glycosylase n=33 Tax=Bacteroidetes RepID=A6LDG5_PARD8 Length = 359 Score = 352 bits (903), Expect = 1e-95, Method: Composition-based stats. Identities = 116/352 (32%), Positives = 184/352 (52%), Gaps = 13/352 (3%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + S +++WY+ Y R+ LPW+ + PY +W+SE++LQQT+V + YF RF RFP V Sbjct: 9 EISRILVEWYETYKRE-LPWRETRDPYIIWISEIILQQTRVVQGLEYFLRFTERFPDVAS 67 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A DEVL W GLGYY+RARNLH AA+ + G FPE ++EV +L G+G TA A Sbjct: 68 LAVAEEDEVLKYWQGLGYYSRARNLHAAAKSIMERFNGVFPENYKEVLSLKGIGEYTAAA 127 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I+S + + P++DGNV RVL+R +AV + + + L+ + NQA+ Sbjct: 128 IVSFAWNQPCPVVDGNVYRVLSRLFAVDTPIDTTKGKKQFAELAGMILDPKNAGTHNQAI 187 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LGA+ C P C +CPL++ C+A A+ + YP K+ K +R ++L + ++++ Sbjct: 188 MELGALQCVPQNPDCGVCPLKDKCVAFASGNVQAYPVKQNKTKTRDRYFHYLYIIYKEQT 247 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLR-----QWLAQRQIAADNLTQ-----LTAFRHTF 294 + +R +W GLY FP + ++ + A R + D L+ +HT Sbjct: 248 WMNRRTGKDIWTGLYEFPLIETDHAMDFSGLCETQAFRNLLGDAGKLSITQGLSNVKHTL 307 Query: 295 SHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 SH L + + + GN L + + L++L Sbjct: 308 SHQILYASFYQIEIEQVPESL--GNYLSLPCRDIEKYAVPRLIHIYLEKLNG 357 >UniRef50_A5IU34 A/G-specific DNA-adenine glycosylase n=56 Tax=Staphylococcus RepID=A5IU34_STAA9 Length = 345 Score = 352 bits (903), Expect = 1e-95, Method: Composition-based stats. Identities = 115/349 (32%), Positives = 171/349 (48%), Gaps = 10/349 (2%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q S F ++ W+D+ R+ +PW+ PY +WLSEVMLQQTQV TVI Y+ RF+ RFPT Sbjct: 3 QQSSFKENLIHWFDENQRE-MPWRQTTNPYYIWLSEVMLQQTQVKTVIDYYHRFVERFPT 61 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V L+ A DEVL W GLGYY+RARN H A ++V + G P+ ++ AL GVG T Sbjct: 62 VEVLSQASEDEVLKYWEGLGYYSRARNFHTAIKEVYDKYEGLVPKDPDQFKALKGVGPYT 121 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV--GVER 179 A++S++ +DGNV RV +R + S +++ P V Sbjct: 122 QAAVMSIAYNVPLATVDGNVFRVWSRLNDDYRDIKLQSTRK---SYEQELLPYVTTEAGT 178 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPK-QTLPERTGYFLLL 238 FNQAMM+LGA+ICT P C CP+Q C A ++ P K FL+ Sbjct: 179 FNQAMMELGALICTPKNPLCLFCPVQENCEAFDKGTFEKLPVKSKNVSKKVIEQSVFLIR 238 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNL-TQLTAFRHTFSHF 297 ++ + LL +R L G++ FP F E + R+ + + T + +H F+H Sbjct: 239 NNQGQYLLQKRSEK-LLHGMWQFPMFESEHARRKMTEKIGHDIQPVETPIFELKHQFTHL 297 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 I + + +W+NL+ P+ ++ Q + Sbjct: 298 TWKIKVYAASGA-INIETLPDDMIWFNLSDRDQYTFPVPMSKIYQFING 345 >UniRef50_B3QVZ4 A/G-specific adenine glycosylase n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QVZ4_CHLT3 Length = 360 Score = 351 bits (902), Expect = 1e-95, Method: Composition-based stats. Identities = 106/349 (30%), Positives = 180/349 (51%), Gaps = 9/349 (2%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 +L W+ R LPW+ +TPYK+W+SE+MLQQTQVATVIPY+ERF+ FP + Sbjct: 2 SIEKDLLSWFLLNKR-DLPWRKKRTPYKIWVSEIMLQQTQVATVIPYYERFLNAFPNLES 60 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LANA +++++ +W GLGYY R +N+ +AA+ + H G FP E+ L G+G TA Sbjct: 61 LANADINKLMKIWEGLGYYTRVKNMQEAAKTILQKHNGVFPSKKTELLQLKGIGDYTAAI 120 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I S++ +H +DGNV RV++R A++ + + +++++ FN+AM Sbjct: 121 IASIAFKEHCAAVDGNVLRVISRLNAINAPIQLNTTKQTIRIVAQELLSLEHPGEFNEAM 180 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M++GA+IC P C +CP+ C A P K + +P ++ +D V Sbjct: 181 MEVGALICKPKNPTCDICPISLHCQAYKKGLEHKLPVKLKRAEIPHYHIAAGVIYKDDFV 240 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESL-------RQWLAQRQIAADNLTQLTAFRHTFSHF 297 L+A RP +GL G L+ FP ++ R+ + + + L +L + +H ++HF Sbjct: 241 LIALRPANGLLGNLWEFPGGKQQQGESLEDCCKREIFEETGLHVNVLEKLISVKHAYTHF 300 Query: 298 HLDIVPMWLP-VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 + + +S W + S ++++++L+ Sbjct: 301 KITLHAYRCNYISGSPEPRASQALKWVRIEDLTSYAFPKANKKIIEKLQ 349 >UniRef50_C1ZJ43 A/G-specific DNA-adenine glycosylase n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJ43_PLALI Length = 381 Score = 351 bits (900), Expect = 3e-95, Method: Composition-based stats. Identities = 119/373 (31%), Positives = 190/373 (50%), Gaps = 31/373 (8%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 MQ ++F Q+L WY K+GR LPW+ PY +W+SE+MLQQT V VIPYFERFMA+FP Sbjct: 1 MQKTKFQKQLLAWYAKHGR-PLPWRASHDPYSIWISEIMLQQTTVTAVIPYFERFMAKFP 59 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 +V LA+AP +EVL LW GLGYY+RARNLH++A+ + + G FP++ E++ LPG+GR Sbjct: 60 SVQALASAPEEEVLKLWEGLGYYSRARNLHQSARVLMERYQGVFPQSVEQLLELPGIGRY 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S + PI++ N +R+ AR G + LW +E + + Sbjct: 120 TAGAISSFAFRLPAPIVEANTQRLYARILGYDGDLKNAAGQKALWGFAESIVSGKEPDLI 179 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+M+LG+++C P C CP+Q C A A P + + + LL+++ Sbjct: 180 NQALMELGSLVCKPIDPLCDQCPVQQHCRAFQEARQAEIPRAQARPVITPLVDATLLIEY 239 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQ---FADEESLRQWLAQRQIA----------------- 280 + E+ L QR W GL+ FP+ F E + ++ ++ ++ Sbjct: 240 QGELFLRQREKPERWAGLWDFPRYTLFDPENTSEEFQKEKDVSTSALALSLKARVQEQLA 299 Query: 281 --ADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMD--------EGNALWYNLAQPPS 330 +T+ + H + + + + +S + + W+ S Sbjct: 300 VHPGEVTEFSRLTHGVTRYRITLHAFGCDLSDGVASRQSKALYEQLKSHGGWFGCESLDS 359 Query: 331 VGLAAPVERLLQQ 343 + + +L++Q Sbjct: 360 LAVPVTTRKLVKQ 372 >UniRef50_B9XFP7 A/G-specific adenine glycosylase n=1 Tax=bacterium Ellin514 RepID=B9XFP7_9BACT Length = 392 Score = 349 bits (896), Expect = 8e-95, Method: Composition-based stats. Identities = 115/360 (31%), Positives = 185/360 (51%), Gaps = 18/360 (5%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 ++ ++ +L W+ + R LPW+ + PY +W+SE+MLQQTQV TVIPY+E +M P Sbjct: 34 LKIHALASALLHWFSQSAR-DLPWRRTRDPYAIWVSEIMLQQTQVKTVIPYWEHWMQNLP 92 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA A + + LW GLGYY R RN+ +AAQ++ + H G FP F+ + AL G+GR Sbjct: 93 TIQSLAEAAPERIHKLWEGLGYYTRVRNMQQAAQEIMSRHDGSFPSDFDSILALKGIGRY 152 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT-------- 172 TAGAI S++ + P+LDGNV RVL R + ++ P K +LWSL+E + Sbjct: 153 TAGAIASIAFNQPKPLLDGNVIRVLTRLFGIAENPRDKITNEQLWSLAEALVVSASNSNN 212 Query: 173 ---PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLP 229 NQ++M+LGA+ICT +P+C +CP++ CIA P + Sbjct: 213 NKSHPSACSHLNQSLMELGALICTPRQPQCLICPVRQDCIAHHTGKVESLPNLGERTAAT 272 Query: 230 ERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNL----- 284 +R + +++H+ L+ QRP + L+ FP L QI L Sbjct: 273 QRKFFAFVIEHQHRFLVTQRPAGVVNAHLWEFPNLEVNPDLPHPTPATQIDPLRLTLHSP 332 Query: 285 TQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 T L +HT + + + + +++ W LA+ + + +++L++L Sbjct: 333 TPLCTIKHTITRYRITLQVFRATLAN-PDTQTHLPTQWKTLAELHQLPFPSAHKKILKKL 391 >UniRef50_A4VV79 A/G-specific DNA glycosylase n=98 Tax=Lactobacillales RepID=A4VV79_STRSY Length = 410 Score = 349 bits (895), Expect = 1e-94, Method: Composition-based stats. Identities = 116/359 (32%), Positives = 180/359 (50%), Gaps = 23/359 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + F +LDWYD R LPW+ K PY +W+SE+MLQQT+V TVIPY+ERF+ PT Sbjct: 40 KIQAFRKALLDWYDANKR-DLPWRRTKDPYAIWVSEIMLQQTRVDTVIPYYERFLHHLPT 98 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 ++DLA AP + +L LW GLGYY+R RN+ KAAQQ+ G+FP T +++L G+G T Sbjct: 99 ISDLAQAPEEVILKLWEGLGYYSRVRNMQKAAQQMVEDFDGQFPTTHAAISSLKGIGPYT 158 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ P +DGNV RVL+R + V G ++ E + + FN Sbjct: 159 AGAISSIAFNLPEPAVDGNVMRVLSRLFEVDYDIGLPANRKIFQAMMELLIDSERPGDFN 218 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQT-LPERTGYFLLLQH 240 QA+MDLG+ I + P+ P++ A N + YP K PK+ +P FL+ Sbjct: 219 QALMDLGSDIESPVNPRPQDSPVKAFSAAYLNGTMDKYPIKAPKKKPIPVAYQGFLIRNK 278 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNL---------------- 284 +++ LL + +GL G + FP + + ++ ++A + + Sbjct: 279 DNQFLLEKNNEAGLLSGFWSFPLLEKGAIVDKQVSLFEVAEEVVQPDIRQSFTELYGLTV 338 Query: 285 ----TQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVER 339 + +H FSH I + + + W ++ P+ A P ++ Sbjct: 339 DWQEQEFGIVQHIFSHRKWQIEMVEGVAETLNLP-ESKELKWVSVEDFPTYPFAKPQQK 396 >UniRef50_C2H8F6 A/G-specific adenine glycosylase n=16 Tax=Enterococcus RepID=C2H8F6_ENTFC Length = 392 Score = 348 bits (894), Expect = 1e-94, Method: Composition-based stats. Identities = 115/372 (30%), Positives = 174/372 (46%), Gaps = 30/372 (8%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +F Q + WY++ R LPW+ ++ PY++W+SE+MLQQT+V TVI YF RFM FPT Sbjct: 12 ETKEFQDQFIQWYEQEKR-NLPWRYNRDPYRIWISEIMLQQTRVDTVIDYFYRFMEWFPT 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + +LA AP +++L W GLGYY+RARN+ AA+Q+ + GK P+T EE+++L G+G T Sbjct: 71 IEELATAPEEKLLKAWEGLGYYSRARNIQAAAKQIMSEFDGKMPQTPEEISSLKGIGPYT 130 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 GAI S++ G P +DGNV RV++R + + K ++ FN Sbjct: 131 TGAIASIAFGLPEPAVDGNVMRVVSRLFCIEADIAKASSRKIFDEAMRKIIDETYPGEFN 190 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ-H 240 QAMMDLG+ ICT + PKC CP+Q C+A +P K K LQ H Sbjct: 191 QAMMDLGSAICTPTSPKCEACPIQAFCLANKRGIQTSFPVKTKKAKPKNVYYISAALQNH 250 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFAD--------------------------EESLRQWL 274 +R L ++ FP E+ Sbjct: 251 SGAYYFEERDSQKLLANMWTFPMVEVTQEEYERLKKEWEAKQEVDLFDDLVAEDGKELPF 310 Query: 275 AQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCM--DEGNALWYNLAQPPSVG 332 ++++ L H FSH ++ + + + + W A SV Sbjct: 311 EKQELFIWQTRHLGEVTHVFSHLKWHVLLFYGRATEEAEQEFTENKTSKWLKPAAFDSVV 370 Query: 333 LAAPVERLLQQL 344 +L++QL Sbjct: 371 FPKVQMKLVEQL 382 >UniRef50_A9EQ53 A/G-specific adenine glycosylase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EQ53_SORC5 Length = 396 Score = 348 bits (894), Expect = 2e-94, Method: Composition-based stats. Identities = 131/351 (37%), Positives = 187/351 (53%), Gaps = 14/351 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + +A + W+ + R LPW+ + PY +WLSEVMLQQT+V TVIPY+ERF+AR+PTV Sbjct: 34 REIAAALEAWFGRVAR-DLPWRRTRDPYAIWLSEVMLQQTRVETVIPYYERFLARYPTVF 92 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA+A +D+VL LW+GLGYY RAR LH AA++V H G P + ALPGVG TAG Sbjct: 93 ALASAEIDDVLSLWSGLGYYRRARVLHLAAREVTARHDGALPRDVSALLALPGVGAYTAG 152 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT----PAVGVER 179 AI S++ + P++DGNV RVL+R + KLWS +E++ +V R Sbjct: 153 AIASIAYDQPVPLVDGNVARVLSRIEGIDDDIRSASGTRKLWSTAERLVRGSADSVHPGR 212 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 FNQA+M+LGA +CT P+C CP+ C+A A A P PK+ +P +++ Sbjct: 213 FNQALMELGATVCTPRNPRCDACPVDGACVARATGRQAELPVIAPKRDVPAVAMVAAVVR 272 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADE--ESLRQWLAQRQIAAD-NLTQLTAFRHTFSH 296 D VL +R GL+GGL+ P R L +A D L ++ RH +H Sbjct: 273 SGDRVLFVRRAEGGLFGGLWEPPMVEARSLADARALLELVGVARDAALREVGRVRHILTH 332 Query: 297 FHLDIVPMWLPVS-----SFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 LD+ V+ A W + +G++ ++L Sbjct: 333 RRLDVTVARAEVAAPWRFRAKLASPYEKAAWLDPGAL-DIGVSKLARKVLS 382 >UniRef50_Q5WIF8 A/G-specific adenine DNA glycosylase n=11 Tax=Bacillales RepID=Q5WIF8_BACSK Length = 385 Score = 348 bits (893), Expect = 2e-94, Method: Composition-based stats. Identities = 119/345 (34%), Positives = 179/345 (51%), Gaps = 8/345 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 S F Q+++WY + R+ LPW+ PY +W+SE+MLQQT+V TVIPY+E+FM +FP + Sbjct: 37 SDFRRQLIEWYQAHKRE-LPWRESNDPYHIWVSEIMLQQTRVDTVIPYYEQFMRKFPEME 95 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA A +E+L +W GLGYY+R RNL A ++V +G P+T +E+ L GVG TAG Sbjct: 96 DLAYAEEEEILKVWEGLGYYSRVRNLQAAVREVVEHYGSVVPDTRKEIEQLKGVGPYTAG 155 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILS++ K P +DGNV RVL+R + + GK + K ++ ++ FNQ Sbjct: 156 AILSIAYAKAEPAVDGNVMRVLSRVFCMEDDIGKPQTRKKHEAILYELIDKSDPSSFNQG 215 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPE-RTGYFLLLQHED 242 +M+LGA++CT + P C LCP++ C+A P K K+ FLL + Sbjct: 216 LMELGALVCTPTSPGCLLCPVRTQCLAYERGQQERLPIKMKKKKAKSIELESFLLKTEKG 275 Query: 243 EVLLAQRPPSGLWGGLYCFP----QFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 E+L+ +RP GL GL+ P +F E ++ + I A+ +H FSH Sbjct: 276 ELLIEKRPDKGLLAGLWQLPVLEGRFDPGERQQKLAEKYHIEAEPSAANFQVKHIFSHLI 335 Query: 299 LDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 +I + N + ++LL Sbjct: 336 WEIELYVGRANRNGEL--PANCRIIKAEELEQYAFPVSQQKLLNH 378 >UniRef50_C8US26 A/G-specific DNA glycosylase n=18 Tax=Lactobacillales RepID=C8US26_LACRG Length = 372 Score = 348 bits (892), Expect = 3e-94, Method: Composition-based stats. Identities = 121/362 (33%), Positives = 175/362 (48%), Gaps = 18/362 (4%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + F +LDWYD + R TLPW+ D PY V +SE+MLQQTQV TVIPY+ RFMA+FPT Sbjct: 14 KVAAFQHALLDWYDHHAR-TLPWRQDHDPYHVMVSELMLQQTQVQTVIPYYNRFMAQFPT 72 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA AP VL W GLGYY+RAR L +AA+Q+ +GGK+P+T E+ L G+G T Sbjct: 73 VEALAAAPEATVLKAWEGLGYYSRARRLQQAAKQIVNDYGGKWPQTAAELQTLAGIGPYT 132 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S+S G+ P +DGN RV AR + V + + L + P FN Sbjct: 133 AGAIASISFGEVVPAIDGNAFRVFARLFKVDADIARPQTRKVFDDLIRPLMPKERPGDFN 192 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLG+ + S P + P++ + + YP K K + L+++ + Sbjct: 193 QAVMDLGSSYMSASHPDPAHSPVRAFDASFRDGVVQDYPVKTKKPRPVIHRYFALVIRSK 252 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQL-------------- 287 LL QRP G+ L+ FP D L + Q+ + Sbjct: 253 AGYLLEQRPGKGMLADLWMFPLI-DMAELEATMESEQLDEISARFADLSGMTLTFADLGR 311 Query: 288 TAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 +HTF+H + + ++ W + + L ++L + L Sbjct: 312 PQVQHTFTHQRWQLTLIGADAAAAELKFMPA--RWVPVEDFGKMALPTVQKKLNRALGLT 369 Query: 348 AP 349 P Sbjct: 370 EP 371 >UniRef50_A7RIG3 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7RIG3_NEMVE Length = 470 Score = 347 bits (891), Expect = 3e-94, Method: Composition-based stats. Identities = 122/403 (30%), Positives = 176/403 (43%), Gaps = 63/403 (15%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-------PYKVWLSEVMLQQTQVATVIPYFER 54 +L WYD R +LPW+ T Y VW+SE+MLQQTQVATV+ Y+ R Sbjct: 30 DIQLIRENLLRWYDINKR-SLPWRAYATEQDANIRAYAVWVSEIMLQQTQVATVVDYYNR 88 Query: 55 FMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA- 113 +M +P++ LA A L+EV W+GLGYY+RAR LH+AA +V GK P ++ Sbjct: 89 WMKNWPSLEALARASLEEVNECWSGLGYYSRARRLHEAAIKVVNELDGKIPTNAAKLQKE 148 Query: 114 LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP 173 LPGVG TAGAI S++ G+ ++DGNV RVL+R + + WSL+ ++ P Sbjct: 149 LPGVGLYTAGAIASIAFGEATGVVDGNVIRVLSRLRRIGADMTSNTTMDHFWSLAHRLVP 208 Query: 174 AVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAA--------------------- 212 FNQAMM+ GA +CT P+CS C L++ C A + Sbjct: 209 NDRPGDFNQAMMEFGATLCTPKTPQCSKCVLRSSCQAHSQVEDFKDKFTKRITGERISAC 268 Query: 213 -------------------NNSWALYPGKKPKQTLPERTGYFLLLQHE------DEVLLA 247 YP K K+ E L++Q E LL Sbjct: 269 DQDIEDCSLCLPPSEPWNPEIGVCNYPMKPKKKEAREEVFTVLIVQEECNEDDSGNFLLV 328 Query: 248 QRPPSGLWGGLYCFPQFA----DEESLRQWLAQRQIA-ADNLTQLTAFRHTFSHFHLDIV 302 QRP SGL GL+ FP +E+ + + + + H FSH H V Sbjct: 329 QRPESGLLAGLWEFPNLEKEKINEDDVSALAKEYGLENVKKRNNVGELIHKFSHRHHKYV 388 Query: 303 PMWLPVSSFTGCMDEGNAL---WYNLAQPPSVGLAAPVERLLQ 342 + W N + S ++ ++++ + Sbjct: 389 VELFSCKKVNTTETNASGQPMKWVNPEELDSSAISTAMKKVFK 431 >UniRef50_Q1DAW2 A/G-specific adenine glycosylase n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1DAW2_MYXXD Length = 390 Score = 347 bits (891), Expect = 3e-94, Method: Composition-based stats. Identities = 130/346 (37%), Positives = 184/346 (53%), Gaps = 8/346 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + +LDWYD+ R LPW+ + Y +WLSEVMLQQTQV+TVIPY+ERF+ARFPT Sbjct: 35 ASVRGPLLDWYDRNKR-DLPWRRTRDSYAIWLSEVMLQQTQVSTVIPYWERFLARFPTAR 93 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA+APLD+VL W GLGYY+RARNLH+AAQ+V GG P T E+ LPG GR TAG Sbjct: 94 ALASAPLDDVLAGWKGLGYYSRARNLHRAAQEVVARFGGTLPSTAAELLELPGFGRYTAG 153 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+ S++ G+ P++DGNV RV +R + V G PG ++ E LW+L+ + FNQA Sbjct: 154 AVASIAFGEEAPLVDGNVARVFSRIFEVEGLPGDRQREATLWALATALVKGERPGDFNQA 213 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +M+ GA C P C LCP++ C+A P K + T + T + H Sbjct: 214 LMEHGATTCRPENPLCLLCPVRGACVAFRKGRVDELPPAKVRATPKKLTLAVAVWPHAGT 273 Query: 244 VLLAQRPPSGLWGGLYCFP------QFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 +L A+R +GL+GGL+ P D E+ + A + L + +H Sbjct: 274 LLFARRADAGLFGGLWELPAAEIADDAPDSEARARLSAALGVDVTLEGALGTVKRQLTHR 333 Query: 298 HLDIVPMWLPVSSFTGCMDE-GNALWYNLAQPPSVGLAAPVERLLQ 342 L + + + W +G++ ++R L Sbjct: 334 DLSLRLLRVSGPRRPASTPAFQELRWCAPDDAEKLGMSTAMQRALD 379 >UniRef50_B8HXE9 A/G-specific adenine glycosylase n=31 Tax=Cyanobacteria RepID=B8HXE9_CYAP4 Length = 386 Score = 347 bits (891), Expect = 3e-94, Method: Composition-based stats. Identities = 133/353 (37%), Positives = 184/353 (52%), Gaps = 12/353 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q +L WY ++GR +LPW+ PY +W+SE+MLQQTQV TVIPY++R++A PT Sbjct: 33 QILDLQRSLLHWYRQHGR-SLPWRETSDPYAIWVSEIMLQQTQVQTVIPYYQRWLAALPT 91 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + +A A +VL LW GLGYY+RARNLH+AAQ + G+FP E V LPG+GR+T Sbjct: 92 IATVAAAEQQQVLKLWQGLGYYSRARNLHQAAQLIQQEFAGQFPSQLEAVLKLPGIGRTT 151 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AG ILS + + ILDGNVKRVLAR A+ P K + LW S+++ FN Sbjct: 152 AGGILSSAFAQPVAILDGNVKRVLARLLALPVPPRKAKG--FLWQWSDRLLDRTQPREFN 209 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPER-TGYFLLLQH 240 QA+MDLGA +C KP C LCP N C A N + P + LP + G ++ Sbjct: 210 QALMDLGATVCVPKKPDCPLCPWSNHCQALQLNLQSELPVTETAAPLPHKSIGVAVIWND 269 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESL-------RQWLAQRQIAADNLTQLTAFRHT 293 E+L+ +RP GL GGL+ FP E R+ + I + L H Sbjct: 270 RGEILIDRRPQKGLLGGLWEFPGGKIEPGETVMACIQREIREELAIEIEVGEPLITIDHA 329 Query: 294 FSHFHLDIVPMWLP-VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 ++HF + + VS + W L + E+++ LR Sbjct: 330 YTHFKVTLNVHHCRYVSGEPQPLGCDEVRWVTLEEIDQYPFPKANEQIIAALR 382 >UniRef50_A6NQR4 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NQR4_9BACE Length = 355 Score = 347 bits (890), Expect = 4e-94, Method: Composition-based stats. Identities = 116/329 (35%), Positives = 160/329 (48%), Gaps = 7/329 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 Q +L WY + R LPW+ D TPY VW+SE+MLQQT+VA V+ Y+ RFM P V Sbjct: 2 EQLPIPLLAWYHENAR-VLPWRSDPTPYHVWVSEIMLQQTRVAAVMGYYSRFMEALPDVA 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA D ++ LW GLGYY+RARNL KAA QV +GG P ++EE+ L GVG TAG Sbjct: 61 ALAAVEDDTLMKLWQGLGYYSRARNLKKAAGQVMERYGGAIPASYEELLTLAGVGEYTAG 120 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S++ G P +DGNV RV+AR G + ++ +++ P FNQA Sbjct: 121 AISSIAFGIPVPAVDGNVLRVVARIAGDEGDITLPATKKRMGQALQEIIPTAMPGAFNQA 180 Query: 184 MMDLGAMICTR-SKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 MM+LGA +C P C CP C A + P K PK+ L+ HE Sbjct: 181 MMELGATVCLPNGAPLCDRCPAAGFCAALIQDKINSLPVKAPKKARRVEERTVYLIFHEG 240 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 V L +RP GL GL+ +P E++ L I + +H F+H + Sbjct: 241 RVALRRRPDKGLLAGLWEYPNELAEKT--GILESWGIVPASQRWGGTGKHIFTHIEWHMT 298 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPSV 331 + + +W + + Sbjct: 299 ARMVQAETEELPEG---WVWADGPALRQI 324 >UniRef50_D1W5W3 A/G-specific adenine glycosylase n=1 Tax=Prevotella buccalis ATCC 35310 RepID=D1W5W3_9BACT Length = 369 Score = 346 bits (889), Expect = 5e-94, Method: Composition-based stats. Identities = 122/345 (35%), Positives = 181/345 (52%), Gaps = 12/345 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M +S F+ +L W+ + GR ++PW+ PY +W+SEV+LQQT++ Y+ERFMARFP Sbjct: 33 MGSSPFTLALLRWFSENGR-SMPWRETTDPYAIWISEVILQQTRIQQGWAYWERFMARFP 91 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 V DLA A DEVL LW GLGYY+RARNLH AA+QV L G FP T E + AL GVG Sbjct: 92 KVEDLAAASEDEVLRLWQGLGYYSRARNLHHAAKQVVEL--GHFPNTMEGLKALKGVGDY 149 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TA AI S++ G ++DGNV RVLAR Y + E + + +L++ + PA + Sbjct: 150 TAAAIGSIAFGLPVAVVDGNVYRVLARHYGIYTPINTTEGKKEFAALAQSLLPATEPSAY 209 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+MD GA+ CT + P+C +CPL + C+A P K + R ++ ++ Sbjct: 210 NQAIMDFGAIQCTPTSPRCLICPLTDSCMALRTGKVDELPIKLKTLKIKTRKLVYVYVRW 269 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 + + + +R +W GL+ P + E L + Q + +H +H L Sbjct: 270 QGKTAIHRRAAGDIWQGLWE-PLLYENEELPDFPGQLLLLKK------GVKHVLTHRILQ 322 Query: 301 IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 L + + +W ++ L +E LL+ L Sbjct: 323 ADFYLLKANEKPAL--PDDYIWVEESELDQYALPRLIELLLESLP 365 >UniRef50_B8JJK1 Novel protein similar to H.sapiens MUTYH, mutY homolog (E. coli) (MUTYH) n=2 Tax=Clupeocephala RepID=B8JJK1_DANRE Length = 526 Score = 346 bits (888), Expect = 6e-94, Method: Composition-based stats. Identities = 126/414 (30%), Positives = 187/414 (45%), Gaps = 74/414 (17%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTP--------YKVWLSEVMLQQTQVATVIPYFE 53 + S F + ++ WYD+ R+ LPW+ T Y VW+SE+MLQQTQVATVI Y+ Sbjct: 55 EISVFRSDLMKWYDENKRE-LPWRTLATTEQDDNIRTYAVWVSEIMLQQTQVATVIDYYN 113 Query: 54 RFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA 113 R+M R+PTV LA A L+EV +W+GLGYY+R R LH+ AQ+V + G+ P+T + Sbjct: 114 RWMKRWPTVEKLAAATLEEVNQMWSGLGYYSRGRRLHEGAQKVVSELDGQMPKTTAGLLK 173 Query: 114 -LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT 172 LPGVGR TAGAI S++LG+ +DGNV RVL R A+ V + LW +++ + Sbjct: 174 QLPGVGRYTAGAIGSIALGQVTGAVDGNVIRVLCRVRAIGADSSSPAVTDALWRIADALV 233 Query: 173 PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS----------------- 215 FNQAMM+LGA +CT P CS CP+Q C A S Sbjct: 234 DPERPGDFNQAMMELGARVCTPKSPVCSQCPIQTHCHAFKKISMKQEMDCKRLLNKLATN 293 Query: 216 ------------------------------WALYPGKKPKQTLPERTGYFLLLQH----- 240 YP K K+ +++H Sbjct: 294 PKNPVPDIENCMSAGSCNLCLSEDWDPQLGVQNYPRKPVKKAPRVEQTLTCIVEHQRAGE 353 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWL------------AQRQIAADNLTQLT 288 E E LL QRP GL G++ P + + + + + ++ + Sbjct: 354 ESEYLLTQRPSKGLLAGMWELPSVLLQADISENKYKELICDMMQKWLETPLDTHSVQFVG 413 Query: 289 AFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 H FSH H + + VS + + W + ++ V+++++ Sbjct: 414 EVVHIFSHIHQTYIVFSVHVSDCSDREQKQKTCWLTKSALQKAAVSTGVKKIMK 467 >UniRef50_C6LA88 A/G-specific adenine glycosylase n=20 Tax=Clostridiales RepID=C6LA88_9FIRM Length = 365 Score = 346 bits (887), Expect = 9e-94, Method: Composition-based stats. Identities = 112/349 (32%), Positives = 172/349 (49%), Gaps = 9/349 (2%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +L W+D R LPW+ TPY+VW+SE+MLQQT+V V P+F+RF P Sbjct: 6 ELESIVQPLLAWFDANAR-VLPWRDSPTPYRVWVSEIMLQQTRVEAVKPFFQRFTEALPD 64 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA +++L LW GLGYY R RN+ KAAQ V +GG+ P +E++ L G+G T Sbjct: 65 VAALAACEEEKLLKLWEGLGYYNRVRNMQKAAQTVMEEYGGELPADYEKLLKLKGIGSYT 124 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ P +DGNV RV++R A K V+ ++ ++ P FN Sbjct: 125 AGAIASIAFQIPVPAVDGNVLRVISRITASEKDILKASVKKEVEDEIREIIPPERAGAFN 184 Query: 182 QAMMDLGAMICTRSKP-KCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+M+LGA++C + P KC CPL C+A + P K + + L+++ Sbjct: 185 QALMELGAVVCVPNGPAKCDACPLYGQCLARERGIVSSLPKKSAAKPRRVQERTVLIIRD 244 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFA---DEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 + V + +RPP GL GLY P +E + + L ++ + + L +H FSH Sbjct: 245 GERVAIHKRPPKGLLAGLYELPNVEGHLGQEEVLKLLKKKHFSPIRIQPLEEAKHIFSHV 304 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPS-VGLAAPVERLLQQLR 345 + + V E L+ + + A + L+ Sbjct: 305 EWHMTGYVITVEE---PERETEYLFVEPRETEENYPVPAAFSAYTKYLQ 350 >UniRef50_A8R947 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8R947_9FIRM Length = 348 Score = 344 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 116/337 (34%), Positives = 170/337 (50%), Gaps = 9/337 (2%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q + Q+LDWY + R LPW+ TPY+VW+SE+MLQQT+V V PYFERF+A P Sbjct: 6 DKEQLTKQLLDWYKQNAR-VLPWREKATPYRVWVSEIMLQQTRVEAVKPYFERFIATIPD 64 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LAN + + LW GLGYY R N+ K AQ HGGK P T+ E+ LPG+G T Sbjct: 65 IPTLANTEDEVLAKLWEGLGYYRRVYNMKKCAQVCVEKHGGKLPNTYAELLELPGIGAYT 124 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ G+ +DGNV RV +R + K+ V+ + + + P FN Sbjct: 125 AGAIASIAFGECVAAVDGNVLRVFSRVLVLEEDILKESVKRQYAKIVQMYIPKHESSAFN 184 Query: 182 QAMMDLGAMICTR-SKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+M+LGA IC + P+C++CP+ + C P K K+T +L++ Sbjct: 185 QALMELGATICVPNAAPRCNICPIADNCKGYQCGKAHYLPNKAAKKTRRIEKKTICVLRY 244 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQI-AADNLTQLTAFRHTFSHFHL 299 +D+ + QR GL GLY F +S + ++ N+ +L +H FSH Sbjct: 245 KDKFHVRQRASQGLLAGLYEFDWLEGYQSKKAVSEYYKMYVVKNIRKLPDAKHVFSHIEW 304 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQP-PSVGLAA 335 + + V M +W L + S + Sbjct: 305 HMKGYLVEVE-----MPCEEGVWCTLEELKASYAIPT 336 >UniRef50_A0LLV9 A/G-specific DNA-adenine glycosylase n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LLV9_SYNFM Length = 388 Score = 344 bits (884), Expect = 2e-93, Method: Composition-based stats. Identities = 115/355 (32%), Positives = 186/355 (52%), Gaps = 11/355 (3%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 Q +L W+D+ R LPW+ PY++W+SE+MLQQTQV T++PYF R+M RFP V Sbjct: 9 QIQTLLLSWFDENQR-PLPWREKYRPYEIWISEIMLQQTQVKTMLPYFRRWMERFPDVQS 67 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 +A+A DEVL W GLGYY+RA N+ + A+ + HGG FP+ + +PG+G TAGA Sbjct: 68 IADAREDEVLKHWEGLGYYSRAVNIRRTAEIIVRHHGGTFPKAHSTILGMPGIGPYTAGA 127 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I S++ + P++DGNV+R+LAR + + +K +W+ +E++ PA +FNQA+ Sbjct: 128 ISSIAFNEDRPLVDGNVERILARLFNLDTPVEEKNTRKFIWNTAEELIPAGRARQFNQAL 187 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 MDLGA +C +P C CPL C + + P ++ + +L H V Sbjct: 188 MDLGATVCLPRRPACEKCPLNGLCESRRMGTADRRPVTNRRKDIASIEVAVGILHHRGRV 247 Query: 245 LLAQRPPSGLWGGLYCFP-------QFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 L+ +RP SGL L+ FP + ++ +R++ + ++ +L + RH ++ F Sbjct: 248 LIQKRPASGLMPNLWEFPGGKIHPGESPEQALIREFREELELEVRCRERLASIRHNYTSF 307 Query: 298 HLDIVPMW-LPVSSFTGCMDEG--NALWYNLAQPPSVGLAAPVERLLQQLRTGAP 349 + + P S + A W + + A +L+ + P Sbjct: 308 RVLLHAFLCRPADSRPRPVLRSAVEARWVVVEELDQYAFPAANRKLIDLVSGRKP 362 >UniRef50_Q1MS05 A/G-specific DNA glycosylase n=1 Tax=Lawsonia intracellularis PHE/MN1-00 RepID=Q1MS05_LAWIP Length = 363 Score = 344 bits (883), Expect = 3e-93, Method: Composition-based stats. Identities = 108/355 (30%), Positives = 183/355 (51%), Gaps = 20/355 (5%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + +L+W+ K R LPW+ PY VW++EVM+QQTQ+ + YF R++ +FP + Sbjct: 2 KEIQQALLNWFYKNKR-ALPWRETYLPYHVWIAEVMMQQTQMDRGVEYFLRWIKQFPDIA 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 +A AP +++L W GLGYY R R+L AAQ + + G FPE +E++ LPG+G TAG Sbjct: 61 SVAYAPEEKLLSAWEGLGYYRRVRHLQSAAQVIMHKYNGTFPERYEDILELPGIGPYTAG 120 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S + + FP +DGNV+RVL+R + ++ K+ ++KL+ L +Q+ P FNQ+ Sbjct: 121 AIASTAFNQDFPCIDGNVERVLSRIFDINTHIKKEPTKSKLYDLVKQLMPKKNARDFNQS 180 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +M+LGA++C KP C +CP+ + C + N + P K +LQ + Sbjct: 181 VMELGALVCK-KKPMCLICPVYSMCNSRINGTQLSRPVLAKKTPTTRLKMVTGILQCNQK 239 Query: 244 VLLAQRPPSGLWGGLYCFP-------QFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 + + QR + +WG L+ FP + + +R W Q + +T H ++H Sbjct: 240 IFIQQRLDNNIWGRLWEFPNGCIETGETPESAIIRNWNEQLGFSIQIENIITTIIHNYTH 299 Query: 297 FHLDIVPMWLPVSS-----------FTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 +H+ + + S + + W + + ++ L +P +L Sbjct: 300 YHITLYCFDICFSQNITNTLSVVLPNPTRLSASSYRWVSQKELQTIPLPSPHRKL 354 >UniRef50_O31584 Probable A/G-specific adenine glycosylase yfhQ n=8 Tax=Bacillaceae RepID=YFHQ_BACSU Length = 369 Score = 344 bits (882), Expect = 4e-93, Method: Composition-based stats. Identities = 114/348 (32%), Positives = 169/348 (48%), Gaps = 13/348 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 QF ++ W+++ R LPW+ D+ PYKVW+SEVMLQQT+V TVIPYF RF+ +FPT Sbjct: 12 DIQQFRDDLISWFEREQR-VLPWREDQDPYKVWVSEVMLQQTRVETVIPYFLRFVEQFPT 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA+A ++VL W GLGYY+R RNL A ++V +GG P ++ L GVG T Sbjct: 71 VEALADADEEKVLKAWEGLGYYSRVRNLQSAVKEVKQEYGGIVPPDEKDFGGLKGVGPYT 130 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 GA+LS++ K P +DGNV RV++R ++ K + FN Sbjct: 131 KGAVLSIAYNKPIPAVDGNVMRVMSRILSIWDDIAKPKTRTIFEDAIRAFISKEKPSEFN 190 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 Q +M+LGA+ICT P C LCP+Q C A + P K K+ +T ++L E Sbjct: 191 QGLMELGALICTPKSPSCLLCPVQQHCSAFEEGTERELPVKSKKKKPGIKTMAAIVLTDE 250 Query: 242 D-EVLLAQRPPSGLWGGLYCFPQFADEESLRQ--------WLAQRQIAADNLTQLTAFRH 292 D +V + +RP GL L+ FP ++ ++ + I AD H Sbjct: 251 DGQVYIHKRPSKGLLANLWEFPNLETQKGIKTEREQLIAFLENEYGIQADISDLQGVVEH 310 Query: 293 TFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 F+H +I + V + + +++ Sbjct: 311 VFTHLVWNISVFFGKVKQVSDTSKLKK---VTKEELEQFAFPVSHQKI 355 >UniRef50_D1BPM8 A/G-specific adenine glycosylase n=3 Tax=Veillonella RepID=D1BPM8_VEIPT Length = 365 Score = 343 bits (881), Expect = 4e-93, Method: Composition-based stats. Identities = 107/355 (30%), Positives = 167/355 (47%), Gaps = 19/355 (5%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 ++ Q+L WYD + R+ LPW+ PYK+W+SEVM QQT++ + PY++ +M FPT+ Sbjct: 7 PKWVPQLLAWYDVHKRE-LPWRDCGDPYKIWVSEVMSQQTRIEAMKPYYDNWMRLFPTLE 65 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA A DEV+H W GLGYY+RARNL + V +GG P + + +L GVG TAG Sbjct: 66 DLAKASEDEVVHAWQGLGYYSRARNLRLGVKDVVENYGGIVPHDRKTMESLKGVGSYTAG 125 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+LS++ + +DGNV R+ AR Y + + + + ++ E+ P FNQA Sbjct: 126 AVLSMAYNEPEAAVDGNVLRIYARLYRIFDDILSTKGKKTITAIVEETLPHDRPGDFNQA 185 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +MD G+ +C P+C CP+ N C A + P + K + E + +L ++ Sbjct: 186 LMDFGSAVCIPKTPRCGECPIVNMCAAYQHEDTDKLPVRIKKTKVVEVPLFVGILNYKGY 245 Query: 244 VLLAQRPPSGLWGGLYCFPQFA-------DEESLRQWLAQRQIAADNLTQL-TAFRHTFS 295 LL +RP GL ++ FP E L + + T L H FS Sbjct: 246 YLLHKRPNRGLLRSMWEFPSVEMVSAFEDGERGLEELVQPLGFELSLQTVLVKEITHIFS 305 Query: 296 HFHLDIVPMWLPVSSFTGCMD----------EGNALWYNLAQPPSVGLAAPVERL 340 H + ++ + + + + A P +L Sbjct: 306 HRKWFMKAFRGDLTYTGNANNISIADIQQQLPKDWMLIKRDEFSDYAWAGPHGKL 360 >UniRef50_A3MA77 A/G specific adenine glycosylase n=18 Tax=Acinetobacter RepID=A3MA77_ACIBT Length = 355 Score = 343 bits (881), Expect = 4e-93, Method: Composition-based stats. Identities = 155/349 (44%), Positives = 202/349 (57%), Gaps = 6/349 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M FS +L+W+D++GR LPWQ+ PYKVW+SE+MLQQTQV TV+ YF+RFM RFP Sbjct: 1 MPEFSFSDALLNWFDQHGRHDLPWQVADDPYKVWVSEIMLQQTQVKTVLQYFDRFMERFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV L A DEV W GLGYYARARNLHKAA VA GKFPET EE ALPG+GRS Sbjct: 61 TVEALGYATWDEVAPYWAGLGYYARARNLHKAAGLVAQQ--GKFPETLEEWIALPGIGRS 118 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGA++SL L ++ I+DGNVKRVLAR +A+ K + E ++W L+E++ P + Sbjct: 119 TAGALMSLGLRQYGVIMDGNVKRVLARFFAIEDDLSKPQHEREMWKLAEELCPTQRNHDY 178 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+MDLGA ICT KP C CP+Q C A P KKPK+T P +T L++Q Sbjct: 179 TQAIMDLGATICTPKKPLCLYCPMQAHCQAYQQGLEQELPFKKPKKTPPVKTADVLIIQC 238 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 EDE QR GLWGGL+C P +E + Q ++ T H+F+HF Sbjct: 239 EDEWFWQQRQAHGLWGGLFCLPILENEHERLKLSQQFKLQPQPQT--FQISHSFTHFTWL 296 Query: 301 I--VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 + + E W + Q + G+ +++L+ R Sbjct: 297 LNAHVFHVEPDQKEHLAIELEGQWLSPEQAIAKGVPTAMKKLISTSRHD 345 >UniRef50_C6D1C3 A/G-specific adenine glycosylase n=4 Tax=Bacillales RepID=C6D1C3_PAESJ Length = 398 Score = 343 bits (881), Expect = 5e-93, Method: Composition-based stats. Identities = 119/383 (31%), Positives = 174/383 (45%), Gaps = 47/383 (12%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS ++L WY + R LPW++++ PY+VW+SE+MLQQT+V TVIPY+ERFM +FPTV L Sbjct: 13 FSRELLTWYRRIKR-DLPWRMNQDPYRVWVSEIMLQQTRVDTVIPYYERFMNKFPTVRAL 71 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP EVL W GLGYY+RARNL A++V +GG P+ VA L GVG T GAI Sbjct: 72 AEAPEPEVLKCWEGLGYYSRARNLQAGAREVVERYGGIVPDDKVAVAGLKGVGPYTTGAI 131 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 +S++ + P +DGNV RVL+R + + K + L+ + P FNQA+M Sbjct: 132 MSIAFNRPEPAVDGNVMRVLSRYFCLEDDIAKPATRVGIEKLAVSLIPEGAAGDFNQALM 191 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL----QHE 241 +LGA++CT P C CP+ C A P K + ++ +E Sbjct: 192 ELGALVCTPKSPSCLPCPVMEHCEARLAGRETELPIKTKAKPPRPEKRVAAIIAGTGVYE 251 Query: 242 DEVLLAQRPPSGLWGGLYCFPQF----------------------ADEESLRQWLAQRQI 279 ++L+ QRP +GL ++ P A E R + + Sbjct: 252 GKILVRQRPETGLLAQMWELPHLLVPKEQEQIVDAAPETDGSTAAAGEWLTRSMEEETGL 311 Query: 280 AADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMD--------------------EGN 319 H FSH H + + + G + +G Sbjct: 312 LVRPERWFMDADHVFSHIHWKMRIYLADLGALAGAGEAAAGASLAAETAAAYEAGGKQGE 371 Query: 320 ALWYNLAQPPSVGLAAPVERLLQ 342 W ++ R+L+ Sbjct: 372 YRWIGPEDMETLAFPNLFIRILR 394 >UniRef50_Q9UIF7 A/G-specific adenine DNA glycosylase n=51 Tax=Chordata RepID=MUTYH_HUMAN Length = 546 Score = 343 bits (880), Expect = 6e-93, Method: Composition-based stats. Identities = 135/412 (32%), Positives = 193/412 (46%), Gaps = 65/412 (15%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQI--------DKTPYKVWLSEVMLQQTQVATVIPYFE 53 + + F +L WYD+ R LPW+ D+ Y VW+SEVMLQQTQVATVI Y+ Sbjct: 89 EVTAFRGSLLSWYDQEKR-DLPWRRRAEDEMDLDRRAYAVWVSEVMLQQTQVATVINYYT 147 Query: 54 RFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA 113 +M ++PT+ DLA+A L+EV LW GLGYY+R R L + A++V GG P T E + Sbjct: 148 GWMQKWPTLQDLASASLEEVNQLWAGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQ 207 Query: 114 L-PGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT 172 L PGVGR TAGAI S++ G+ ++DGNV RVL R A+ P V +LW L++Q+ Sbjct: 208 LLPGVGRYTAGAIASIAFGQATGVVDGNVARVLCRVRAIGADPSSTLVSQQLWGLAQQLV 267 Query: 173 PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL-------------- 218 FNQA M+LGA +CT +P CS CP+++ C A Sbjct: 268 DPARPGDFNQAAMELGATVCTPQRPLCSQCPVESLCRARQRVEQEQLLASGSLSGSPDVE 327 Query: 219 ---------------------------YPGKKPKQTLPERTGYFLLLQHED----EVLLA 247 +P K ++ E + +L+ ++LL Sbjct: 328 ECAPNTGQCHLCLPPSEPWDQTLGVVNFPRKASRKPPREESSATCVLEQPGALGAQILLV 387 Query: 248 QRPPSGLWGGLYCFPQFADEESL---RQWLAQR------QIAADNLTQLTAFRHTFSHFH 298 QRP SGL GL+ FP E S R+ L Q + A +L L HTFSH Sbjct: 388 QRPNSGLLAGLWEFPSVTWEPSEQLQRKALLQELQRWAGPLPATHLRHLGEVVHTFSHIK 447 Query: 299 LDIVPMWLPVSSFTGCMD-EGNALWYNLAQPPSVGLAAPVERLLQQLRTGAP 349 L L + T A W + + ++ ++++ + + P Sbjct: 448 LTYQVYGLALEGQTPVTTVPPGARWLTQEEFHTAAVSTAMKKVFRVYQGQQP 499 >UniRef50_C6ACA0 A/G-specific adenine glycosylase MutY n=6 Tax=Rhizobiales RepID=C6ACA0_BARGA Length = 352 Score = 343 bits (879), Expect = 7e-93, Method: Composition-based stats. Identities = 121/348 (34%), Positives = 182/348 (52%), Gaps = 18/348 (5%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDK---------TPYKVWLSEVMLQQTQVATVIPYFERF 55 + S+ +L WYD+ R LPW+I PY+VWLSE+MLQQT V TV PYF++F Sbjct: 3 EISSHLLAWYDQNHRH-LPWRITPEKQKQGIRPDPYQVWLSEIMLQQTTVETVKPYFKKF 61 Query: 56 MARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALP 115 + +P ++ LA A D+++ W GLGYY+RARNL K AQQ+ + G+FP++ +E+ L Sbjct: 62 LKLWPDLSSLAKASQDDIMKAWAGLGYYSRARNLKKCAQQLVENYAGQFPQSVKELRTLA 121 Query: 116 GVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV 175 G+G TA AI S++ ++DGNV+RV+AR +A++ K +VE + ++++T Sbjct: 122 GIGDYTAAAIASIAFNHPVAVVDGNVERVVARLFAITSILQKAKVE--IKEKTQKITALN 179 Query: 176 GVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF 235 F QAMMDLGA +CT KP C CPLQ C AA +P K PK+ P +TG Sbjct: 180 RPGDFAQAMMDLGATVCTPRKPSCYTCPLQCLCKAAKTQQQESFPVKAPKKERPSKTGIA 239 Query: 236 LL-LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTF 294 + L + ++ L +R L GG+ P + Q N H F Sbjct: 240 FVALNKKRQIYLEKRQTQKLLGGMTQIPNNI--GINNENGLQNAPFTANWQFKGQITHVF 297 Query: 295 SHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 +HF L + + + M+ N W N+ L +++ + Sbjct: 298 THFSLKLDVYY---TEDIHEMNCENGWWCNIQHLAEEALPTVMKKAIS 342 >UniRef50_C4FU23 Putative uncharacterized protein n=1 Tax=Catonella morbi ATCC 51271 RepID=C4FU23_9FIRM Length = 410 Score = 343 bits (879), Expect = 7e-93, Method: Composition-based stats. Identities = 122/364 (33%), Positives = 176/364 (48%), Gaps = 24/364 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + F +L WYDK R LPW+ + PY +W+SE+MLQQTQVATVIPY++RFMA PT Sbjct: 42 RVHAFRKDLLAWYDKNKR-DLPWRKSQDPYAIWVSEIMLQQTQVATVIPYYQRFMAALPT 100 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA AP + +L+LW GLGYY+R RN+ AAQQV GG+ P+ + + +L G+G T Sbjct: 101 VADLAQAPEETLLNLWQGLGYYSRVRNMQAAAQQVMADFGGRMPDQVDSLLSLKGIGPYT 160 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A AI S++ G+ P LDGN+ R++AR + + + + + + FN Sbjct: 161 AAAIASMAFGRVAPALDGNLFRIVARLFRLKDDIALPKSRKVFMEILDILIDPDRPGDFN 220 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGY-FLLLQH 240 QAMMDLGA + T S P+ PL + YP K K + +L Sbjct: 221 QAMMDLGACVMTPSNPRPDNHPLAAYDASYQVGDSDQYPVKSKKVKQTRHDLTAYFVLDS 280 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADE------------ESLRQWLAQRQIAADN----- 283 + LL + + L GL+ FP E L WL + + + Sbjct: 281 QGNWLLRRHGEAELLTGLWHFPMLEQSMVYEEVTAAELTEPLLDWLREEALVEGDSPLSS 340 Query: 284 ---LTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 L +H FSH + + L + +G + EG W + L+ E+L Sbjct: 341 QVVSPALGQVKHVFSHRIWQVQLVGLRLD-TSGPLQEG-WCWVAPEDVGQLPLSTLQEKL 398 Query: 341 LQQL 344 +Q L Sbjct: 399 MQVL 402 >UniRef50_C4G9U8 Putative uncharacterized protein n=1 Tax=Shuttleworthia satelles DSM 14600 RepID=C4G9U8_9FIRM Length = 374 Score = 342 bits (878), Expect = 1e-92, Method: Composition-based stats. Identities = 109/320 (34%), Positives = 164/320 (51%), Gaps = 5/320 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 S +DW+ R LPW+ D+ PY +W+SE+MLQQT+V V PY++ F+A P Sbjct: 5 FDLSPLVTPAMDWFQVNKR-DLPWRRDRDPYHIWVSEIMLQQTRVEAVKPYYQGFLAALP 63 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 ++ DLA P D++ LW GLGYY+R RN+ KAA+ + GG+ P ++ + +LPG+G Sbjct: 64 SIADLAACPQDQLNKLWEGLGYYSRVRNMQKAAEVIMEKFGGQMPSDYDAILSLPGIGPY 123 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S++ P +DGNV R+LAR + + + E V PA G F Sbjct: 124 TAGAISSIAFDLPAPAVDGNVLRILARVSEDDIDIKSDLAKKRAQAALEGVMPASGSGIF 183 Query: 181 NQAMMDLGAMICTR-SKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 NQAMM++GA C P C CP + C+A + SW P + + +++ Sbjct: 184 NQAMMEIGATACLPNGDPLCGQCPWKKYCLANRHGSWDHLPVRSKGKPRRIEERTVFIVR 243 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFAD---EESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 + V++ +RP GL GLY FP + EE+ + Q + +L RH FSH Sbjct: 244 NGSRVVIDRRPDRGLLAGLYEFPNVREHLSEEAALAYAEQMGYEPLRIRRLEEARHIFSH 303 Query: 297 FHLDIVPMWLPVSSFTGCMD 316 ++ + VS + Sbjct: 304 VEWRMIGYEIRVSDAELAKE 323 >UniRef50_C9LHH9 A/G-specific adenine glycosylase n=4 Tax=Bacteroidales RepID=C9LHH9_9BACT Length = 356 Score = 342 bits (878), Expect = 1e-92, Method: Composition-based stats. Identities = 115/347 (33%), Positives = 186/347 (53%), Gaps = 14/347 (4%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 +A F+ Q+ +WY ++ LPW+ K PY++W+SE++LQQT+V Y+ RF+ RFP+ Sbjct: 12 EAHYFAEQIEEWY-LNNKRELPWRDIKDPYRIWISEIILQQTRVVQGYDYYLRFIDRFPS 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA A DEVL LW GLGYY+RARNL+ AA+ + G+FP ++++ L GVG T Sbjct: 71 VEDLAKADEDEVLKLWQGLGYYSRARNLYSAAKSIVER--GEFPTNYKDIRMLKGVGDYT 128 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A AI S + + ++DGNV RVLAR + ++ E + +L++ + ++N Sbjct: 129 AAAIASFAYNLPYAVVDGNVYRVLARYWGITTPIDTTEGKKLFAALAQNLLDKKNPAQYN 188 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MD GA+ C + P C+ CPLQ CIA S + P K K + R +L ++++ Sbjct: 189 QALMDFGALQCVPNNPNCNECPLQANCIAFLETSVSSLPIKAKKTAIRTRFLVYLYIRYK 248 Query: 242 DEVLLAQRPPSGLWGGLYC--FPQFADEESLRQWLAQRQIAADNLTQLT--AFRHTFSHF 297 D +LL +R +W GLY +F++ L + + + +L +H +H Sbjct: 249 DRILLHRRERGDIWQGLYEPLLIEFSEHAELPSIIQKANLPKQAAFRLAVKEIKHVLTHQ 308 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 L + + ++ FT + + + P RL++QL Sbjct: 309 VLIVDCYEVDLTKFTTIE---KYSFIPEKERANY----PASRLVEQL 348 >UniRef50_A6GM64 Probable a/g-specific adenine glycosylase protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GM64_9BURK Length = 377 Score = 341 bits (876), Expect = 2e-92, Method: Composition-based stats. Identities = 140/369 (37%), Positives = 190/369 (51%), Gaps = 33/369 (8%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + F ++ W ++GR+TLPWQ YKVWLSEVMLQQTQV TV+ Y+ RF+ +P Sbjct: 4 LTGEPFGEVLVRWQKQHGRQTLPWQHTGDAYKVWLSEVMLQQTQVTTVLAYYARFLQAYP 63 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV DLA AP +V+ LW GLGYY RARNLH A+QVA GG+FP T E+ +LPG+G+S Sbjct: 64 TVADLAGAPEQDVMQLWAGLGYYTRARNLHACAKQVAARFGGQFPRTVAELESLPGIGQS 123 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI SL+ G PILDGNVKRV R Y + G+P + ++ LW+++E P + Sbjct: 124 TAGAIASLAYGVQAPILDGNVKRVFCRYYGIEGYPEQTTIKKTLWAIAEANVPEQQPGVY 183 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+MDLGA C P CS CPL C+A L P KPK+ PE ++++ Sbjct: 184 NQALMDLGATCCVPRNPACSACPLMQSCVALQKGMVGLLPTPKPKKARPELHFVSMIVED 243 Query: 241 E-DEVLLAQRPPSGLWGGLYCFP--------QFADEES-----LRQWLAQRQIAADN--- 283 E VLL + G+W GL+ P FAD+ W+ + + A Sbjct: 244 ERGAVLLELQTGKGVWQGLWTTPFEACGSQGGFADDFDNLTRIAAVWVERYGLQAHRVEL 303 Query: 284 ------LTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPV 337 L H +H + + + V +W + V Sbjct: 304 EHQLRALQGQPWLVHELTHRKMHFKLLRVKV----------PGVWVGCHVVSEKPVPKIV 353 Query: 338 ERLLQQLRT 346 +LL+Q R Sbjct: 354 HKLLEQSRA 362 >UniRef50_B9E7Y8 A/G-specific adenine glycosylase homolog n=4 Tax=Staphylococcaceae RepID=B9E7Y8_MACCJ Length = 344 Score = 341 bits (874), Expect = 3e-92, Method: Composition-based stats. Identities = 109/344 (31%), Positives = 164/344 (47%), Gaps = 8/344 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + QFS +LDW+ K R+ +PW+ K PYK+WLSEVMLQQTQV TV PY+ +F RFP Sbjct: 2 LNKQQFSQHLLDWFYKNKRE-MPWRETKDPYKIWLSEVMLQQTQVNTVKPYYLKFTERFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + LA+A +DEV W GLGYY+R RN H A ++V + G P E+ L GVG Sbjct: 61 DIRTLASAEIDEVTKYWEGLGYYSRVRNFHSAVKEVQESYNGVVPNNPEDFLKLKGVGPY 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 T GA++S++ P +DGNV RV +R V P F Sbjct: 121 TQGAVMSIAFNHQIPAVDGNVYRVFSRLDNDDFDISSSSARRHFEDKVMDVIPKA-AGDF 179 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 N+A+M+LGA +CT P C CP+Q C + + L P K K L++ H Sbjct: 180 NEALMELGATVCTPKSPLCMFCPVQQHCESYEAGTVQLRPVKLKKIKKKTEHWKVLVIMH 239 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAA-DNLTQLTAFRHTFSHFHL 299 +D++ + QRP +GL ++ FP F + Q ++ + +H F+H Sbjct: 240 DDKIYIEQRPSTGLLQSMWQFPMFNIDTHQDTIEEQLNMSLYVEADPFMSLKHQFTHVTW 299 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 D+ A + ++ + + ++ ++ Sbjct: 300 DMEVYRAVALGRVD-----KARFIDIEEKERFNFPVSMTKIFKK 338 >UniRef50_C9MR28 A/G-specific adenine glycosylase n=4 Tax=Prevotella RepID=C9MR28_9BACT Length = 343 Score = 340 bits (872), Expect = 5e-92, Method: Composition-based stats. Identities = 122/340 (35%), Positives = 175/340 (51%), Gaps = 9/340 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F++ +L WYD + LPW+ K Y +WLSEV+LQQT++ + Y++ FM ++PTV DL Sbjct: 3 FASTLLLWYD-NNERPLPWRETKDAYAIWLSEVILQQTRIVQGMSYWQHFMRQWPTVYDL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA DEVL W GLGYY+RARNLHKAAQQV L G FP T +E+ L GVG TA AI Sbjct: 62 ANASEDEVLKAWQGLGYYSRARNLHKAAQQVVELKG--FPHTAKELKKLKGVGEYTAAAI 119 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S + + ++DGNV RVL+R + + + SLS+ + P R+NQA+M Sbjct: 120 ASFAFDEKIAVVDGNVYRVLSRYKGIDTPIDTTSGKKEFQSLSQTLLPTHDSARYNQAIM 179 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D GA+ CT + P C +CPL C+A P K K ER ++ +Q+ DE++ Sbjct: 180 DFGAIQCTPTSPHCDICPLCESCVAFREQRINQLPVKSKKIHQQERHLTYIYIQYNDEIV 239 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW 305 L QR +W GL+ PQ + + + +H +H L Sbjct: 240 LHQRGAGDIWQGLWELPQLEH----LSKEDGEEWKREAIEVCKDVKHVLTHQLL--FANL 293 Query: 306 LPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 + + + +W + L +E LL +LR Sbjct: 294 YYWAPQQRPILSADFIWIPTEKIADYALPRLMEILLDRLR 333 >UniRef50_B8DFN3 A/G-specific adenine glycosylase n=14 Tax=Listeria RepID=B8DFN3_LISMH Length = 362 Score = 340 bits (872), Expect = 5e-92, Method: Composition-based stats. Identities = 112/346 (32%), Positives = 171/346 (49%), Gaps = 12/346 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + F ++ WY+ R LPW+ + PY++W+SE+MLQQT+V TVIPYF RFM +FPT Sbjct: 10 KITAFQEALVSWYEANKR-VLPWRENTEPYRIWVSEIMLQQTKVDTVIPYFNRFMTQFPT 68 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + D A ++L W GLGYY+R RNL A +QV G+ P + +L GVG T Sbjct: 69 MEDFVQADEADILKAWEGLGYYSRVRNLQTAMKQVMADFSGEVPTDLTTILSLKGVGPYT 128 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAILS++ + P +DGNV RV+AR +S K + Q+ FN Sbjct: 129 AGAILSIAYNQAEPAVDGNVMRVIARVLEISEDIMKASTRKIFEEVLYQLIDKENPAAFN 188 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 Q +M++GA++CT +KP C LCPLQ C A N YP K K + +++ E Sbjct: 189 QGLMEIGALVCTPTKPMCMLCPLQPFCDAHKNGVETNYPVKIKKVKTKTKELLSIIVISE 248 Query: 242 D-EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNL-------TQLTAFRHT 293 D +V + +RP +GL ++ FP + +A+ Q + + +H Sbjct: 249 DGKVAIEKRPENGLLANMWQFPTIEISKKENDEVAKLQFLHNYGLDVLLEDEPIAHIKHV 308 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVER 339 FSH + + V+ N + + + P ++ Sbjct: 309 FSHLVWKMD---IRVAKLQSANPNENWCFETEEEMKRLAFPVPYQK 351 >UniRef50_A6CCE7 A/G-specific adenine glycosylase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCE7_9PLAN Length = 408 Score = 339 bits (871), Expect = 6e-92, Method: Composition-based stats. Identities = 122/394 (30%), Positives = 181/394 (45%), Gaps = 52/394 (13%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +F Q+ WY + R LPW+ P+ VW+SE+MLQQT VA VIPYF+RFM+RFP Sbjct: 12 RRQKFRRQLQSWYVSHQR-DLPWRRQHDPHAVWISEIMLQQTVVAAVIPYFKRFMSRFPD 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA A EVL W GLGYY+RARN+HKAA+++A G+FP E + LPG+GR T Sbjct: 71 VETLAAADESEVLQHWEGLGYYSRARNIHKAAKRIAGELEGRFPRDVESLQKLPGIGRYT 130 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S + PI++ N R+ +R + P K +N+LW +E + P FN Sbjct: 131 AGAICSFAYDTRAPIVEANTLRLYSRLIGLEEDPRSKSGQNQLWEFAELILPRKSPGEFN 190 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLG+++CT P C CP+ GC A L P K + + T + + Sbjct: 191 QALMDLGSLVCTPQNPGCEDCPVNAGCEAFLRQRQHLIPVPKVRPEITPLTDVSIAVFSG 250 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEE--------------------------------- 268 V++ QR W GL+ FP+ EE Sbjct: 251 SHVMIRQRTAGERWAGLWDFPRLTLEEMNGSPHPTIVRKKTGRQKELFSAQQSAGVEIPE 310 Query: 269 --------SLRQWLAQRQIAADNLTQL-TAFRHTFSHFHLDIVPMWLPV---------SS 310 L +L + ++ QL T RH+ + + + ++ + + Sbjct: 311 GLSPAVISRLENYLKEEAGIEASMQQLMTEIRHSVTRYKIRLLCFVAQLDPRAAKKKTTQ 370 Query: 311 FTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 W ++ + S L+ + + L Sbjct: 371 LNSPPGNSEYQWVSVNELDSYPLSVTGRKFAKLL 404 >UniRef50_A7VHH6 Putative uncharacterized protein n=1 Tax=Clostridium sp. L2-50 RepID=A7VHH6_9CLOT Length = 365 Score = 339 bits (871), Expect = 6e-92, Method: Composition-based stats. Identities = 116/348 (33%), Positives = 173/348 (49%), Gaps = 13/348 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 Q + ++ WY ++ R LPW+ DK PY VWLSE+MLQQT+V V Y+ F+ PT+ Sbjct: 21 KQIADNLIVWYPEHAR-DLPWRKDKEPYHVWLSEIMLQQTRVEAVKEYYRTFLQELPTIE 79 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA D ++ LW GLGYY RARNL KAA +V G FP + E+ +LPG+G TAG Sbjct: 80 ALAEVDDDRLMKLWEGLGYYNRARNLKKAAGEVVIEWQGNFPAEYNEILSLPGIGEYTAG 139 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S+ P +DGNV RV R K+ V+ K+ QV + Q+ Sbjct: 140 AIGSICFDLPTPAVDGNVLRVYTRVMEDPSNIDKQAVKKKIREELLQVYRYGHCDMLTQS 199 Query: 184 MMDLGAMICTR-SKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 +M++GA IC PKC +CPLQ C A ++SW YP ++ K+ L+L++ED Sbjct: 200 LMEVGATICLPNGAPKCEVCPLQELCKAHKHDSWQQYPVREAKKKRKVEEKAVLMLRYED 259 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADE---ESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 +V + +R GL GL+ FP + + ++ + + + T + H FSH Sbjct: 260 KVAIRKRTEKGLLHGLWEFPNLPGSYSTQEILSYVTSKNLHPKEIWMETTYTHIFSHVEW 319 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQP-PSVGLA---APVERLLQQ 343 + ++ + W L + + + AP + LL Sbjct: 320 HMKAFYMECMEQQA----KDLRWVTLEELKQEIAIPSAFAPFKDLLNS 363 >UniRef50_UPI00016C4021 A/G-specific adenine glycosylase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4021 Length = 375 Score = 339 bits (871), Expect = 7e-92, Method: Composition-based stats. Identities = 120/351 (34%), Positives = 182/351 (51%), Gaps = 14/351 (3%) Query: 7 SAQVLDWYDKYGRKTLPWQID----KTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 ++L+W+DK+ R LPW+ + Y+VW+SEVMLQQT VA V+PYFERF+A FP V Sbjct: 16 RTKLLNWFDKHQR-DLPWRRTVNGARDAYQVWVSEVMLQQTTVAAVVPYFERFLAAFPDV 74 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA A VL LW GLGYY RAR+LH AA+ + H G P+ + LPGVGR Sbjct: 75 RALAAADEQRVLKLWEGLGYYRRARHLHAAAKLLVEAHNGDLPDDPDVWEPLPGVGRYIL 134 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GA+LS + + PI++ N RVLAR +A G P + E + +W+ +E V PA FNQ Sbjct: 135 GAVLSQAFDRPLPIVEANSLRVLARLFAYPGDPREGEGKVWVWAAAETVLPAKRAGDFNQ 194 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 ++M+LGA++CT + P C CP+++ C A P KK + + E ++++ Sbjct: 195 SLMELGALVCTPTAPACDRCPVRDNCEAKRLGRQDQIPPKKKQPAITEVAEVGVVIRDGA 254 Query: 243 EVLLAQRPPS-GLWGGLYCFPQFADEE-------SLRQWLAQRQIAADNLTQLTAFRHTF 294 VLL QRP + G W ++ P E ++R + ++ RH Sbjct: 255 NVLLCQRPANAGRWQNMWEIPHAPRTEGEDVSAAAVRVARELTGFDVEPGAEIRTVRHGV 314 Query: 295 SHFHLDIVPMWLPVSSFTGCMDE-GNALWYNLAQPPSVGLAAPVERLLQQL 344 + + + +V + + NA W + +++P +L+ L Sbjct: 315 TRYAITLVCVGSVLRGGAFAAGHYANAKWVAPQELGDYPVSSPQRKLMTAL 365 >UniRef50_C3J8R7 A/G-specific adenine glycosylase n=2 Tax=Bacteria RepID=C3J8R7_9PORP Length = 357 Score = 339 bits (871), Expect = 7e-92, Method: Composition-based stats. Identities = 126/350 (36%), Positives = 180/350 (51%), Gaps = 12/350 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 Q + ++DW+++ R+ LPW+ + PY +WLSEV+LQQT+V YF RF+ FP+VT Sbjct: 10 QQLARALVDWFEENARE-LPWRTTRDPYHIWLSEVILQQTRVNQGHDYFLRFVEAFPSVT 68 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA A D+VL LW GLGYY+RA NLH+AAQ +AT H G FP+ F + ALPGVG TAG Sbjct: 69 DLAQASEDQVLSLWQGLGYYSRAHNLHRAAQVIATEHQGIFPKDFALIRALPGVGDYTAG 128 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT-PAVGVERFNQ 182 AI S + +P +DGNV RV++R A + E++ + + Q Sbjct: 129 AIASFAFDMPYPAVDGNVLRVVSRLLASELPIDTLSGKQLCTQAVEELLATQLPPSKLGQ 188 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+++LGA++CT P+CS CP + C A L P K K TL +R ++ Sbjct: 189 ALIELGALVCTPQSPQCSACPASSWCRVAELPLARLLPIKGKKLTLRDRYFSYIYTTCHQ 248 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADE-----ESLRQWLAQRQIAADNLTQLT---AFRHTF 294 +VLL +R + +W GLY FP + + + WL + L H Sbjct: 249 QVLLEKRGRNDIWRGLYQFPLLESDSPQTPDEVAHWLEHELSYSGELRLAPTTFTLEHRL 308 Query: 295 SHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 SH L I + S + W N ++ L P+ R L+QL Sbjct: 309 SHQILHITIYTAELES--DAVGSTKYQWVNQSELGDYALPTPLVRFLEQL 356 >UniRef50_Q7NMA0 A/G-specific adenine glycosylase n=1 Tax=Gloeobacter violaceus RepID=Q7NMA0_GLOVI Length = 375 Score = 339 bits (870), Expect = 8e-92, Method: Composition-based stats. Identities = 135/356 (37%), Positives = 191/356 (53%), Gaps = 11/356 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q + AQ+L+WY + GR LPW+ + PY +W+SE+MLQQTQV TV+PY++R++A PT Sbjct: 18 QVVRLRAQLLEWYGRMGR-DLPWRRTRDPYAIWISEIMLQQTQVKTVLPYYQRWLAALPT 76 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEV-AALPGVGRS 120 V LA A L+ VL LW GLGYY RARNLHKAAQ + HGG FPET +++ ALPG+GRS Sbjct: 77 VAALAAAELEAVLKLWEGLGYYTRARNLHKAAQVIVKEHGGVFPETAQQLQQALPGIGRS 136 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S + G+ ILD N +RVL R +AV P + E KLW +S+++ F Sbjct: 137 TAGAIASSAFGRCEAILDANARRVLGRLFAVGDPPAR--AEAKLWEISQRLVDPQAPHNF 194 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+MDLGA +CT P C LCP Q C+ + +P + + E G + ++ Sbjct: 195 NQALMDLGATVCTARSPLCLLCPWQVDCLGRRSGDPTHFPVRPARAVRSEIAGVSVAIEC 254 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFAD---EESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 + + LL +RP GL GL+ FP E ++L +L H F+H Sbjct: 255 QGKFLLVRRPERGLLAGLWEFPFVESVGGGEPEETVRVAFGNRLESLERLGQVEHEFTHR 314 Query: 298 HLDIVPMWLPVSSFTGCMDEG----NALWYNLAQPPSVGLAAPVERLLQQLRTGAP 349 HL + + + + W + V ++ + L+ P Sbjct: 315 HLTAQVLRAQWIAAPAALPKVFDCREHTWQPPECWLKFPMPGYVHKICKLLKEALP 370 >UniRef50_Q5HN66 A/G-specific adenine glycosylase n=5 Tax=Staphylococcus epidermidis RepID=Q5HN66_STAEQ Length = 347 Score = 339 bits (869), Expect = 1e-91, Method: Composition-based stats. Identities = 106/348 (30%), Positives = 159/348 (45%), Gaps = 10/348 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + F + DW+ K R +PW+ PY +WLSEVMLQQTQV TVI Y+ RF+ RFPT+ Sbjct: 5 NSFKKDIEDWFHKNQR-DMPWRETTNPYYIWLSEVMLQQTQVNTVIDYYYRFIHRFPTIQ 63 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 L+ A DEVL W GLGYY+RARN H A ++V + G+ P E L GVG T Sbjct: 64 SLSEANEDEVLKYWEGLGYYSRARNFHTAVKEVNNNYDGEVPYDPESFKKLKGVGPYTQA 123 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A++S++ +DGNV RV +R + FNQA Sbjct: 124 AVMSIAFNHPLATVDGNVFRVWSRLNNDYRDIKLQSTRKAFEQELHPYV-LKDAGTFNQA 182 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKP---KQTLPERTGYFLLLQH 240 MM+LGA++CT P C CP+Q C A + P K K+T+ ++ FL+ Sbjct: 183 MMELGALVCTPKSPLCLFCPIQEHCEAFHMGTTQELPVKTKSLNKKTIEQK--VFLIRND 240 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNL-TQLTAFRHTFSHFHL 299 + LL +R L G++ FP + + + + + +H F+H Sbjct: 241 NGQYLLEKRKEK-LLNGMWQFPMREQTNANDVISDDLGKSIETINEPVFKLKHQFTHLTW 299 Query: 300 DIVPMWLPV-SSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 +I + + W+NL P++++ + + Sbjct: 300 EIKVYNVTAPLNIKENDLPKQMTWFNLDDREQYTFPVPMDKIYKFIEG 347 >UniRef50_Q1Q8E2 A/G-specific DNA-adenine glycosylase n=4 Tax=Moraxellaceae RepID=Q1Q8E2_PSYCK Length = 453 Score = 339 bits (869), Expect = 1e-91, Method: Composition-based stats. Identities = 153/403 (37%), Positives = 208/403 (51%), Gaps = 64/403 (15%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQI----DKTPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 F+ ++L W+ GR LPWQ PY VWLSEVMLQQTQV TV+PYF RFM F Sbjct: 51 DAFAPRLLAWFADNGRHDLPWQQHQTDTPNPYIVWLSEVMLQQTQVTTVLPYFARFMDSF 110 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHG--GKFPETFEEVAALPGV 117 PTV DLA A D + W GLGYYARARNLHK A+Q+ + G FP+T A+ GV Sbjct: 111 PTVQDLAAAEWDTIAEHWAGLGYYARARNLHKGAKQLVAVIDETGDFPQTLAGWEAISGV 170 Query: 118 GRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGV 177 G STAGAI+S+ L ++ I DGNVKRVL R A+ G K LW+L+E++TP Sbjct: 171 GPSTAGAIMSMGLHRYGVICDGNVKRVLTRWAAIDGDITKSATTKDLWALAERLTPREQS 230 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL 237 F QAMMD+GA +CTRSKP C LCPLQ+ C+A A YP K KQ P + LL Sbjct: 231 GLFAQAMMDMGATLCTRSKPACLLCPLQDDCLAHAQGRETEYPVKAKKQPKPSKFSNALL 290 Query: 238 LQHE-DEVLLAQRPPSGLWGGLYCFPQ--------------------------------- 263 +++E E+L QRP +G+WGGL+ P Sbjct: 291 IENENGEILWLQRPDNGIWGGLWSLPLAFIEKISGKVAVKDADKNKNKGPIAANDNQLLE 350 Query: 264 ------------FADEESLRQWLAQRQIAADNLTQL----TAFRHTFSHFHLDIVPMWL- 306 E+ + +WL + +++A +++ +H+ +HFH + P + Sbjct: 351 VASNEKIYETEFTTAEQIINEWLDKNKLSAKSVSNTLLDDAPIKHSLTHFHWYLTPQSVT 410 Query: 307 ----PVSSFTGCMD--EGNALWYNLAQP-PSVGLAAPVERLLQ 342 V+ T + E N W N ++GL + ++L+ Sbjct: 411 LNDKQVTEITKALQAAEININWLNTDDAQATLGLPRAMVKILE 453 >UniRef50_Q7VRG8 A/G-specific adenine glycosylase n=2 Tax=Candidatus Blochmannia RepID=Q7VRG8_BLOFL Length = 355 Score = 338 bits (867), Expect = 2e-91, Method: Composition-based stats. Identities = 148/350 (42%), Positives = 216/350 (61%), Gaps = 9/350 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS ++L W LPWQI+KT YK+W+SEVMLQQTQVATVIPY+++F+ +FPT++ L Sbjct: 7 FSHKILSWSKNAHILKLPWQINKTIYKIWISEVMLQQTQVATVIPYYQKFIKKFPTISKL 66 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A ++E+L++W+GLGYY RA NLHK A + H G FP F + + PG+GRSTAGAI Sbjct: 67 AEANINEILYIWSGLGYYKRALNLHKTATIIIHHHNGVFPNNFNILLSFPGIGRSTAGAI 126 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSG-WPGKKEVENKLWSLSEQVTPAV-GVERFNQA 183 LSL+L K FPILDGN+KR+L R Y+++ ++ NKLWSL + + P FNQA Sbjct: 127 LSLALNKRFPILDGNIKRILIRYYSLNNQQTSPTKINNKLWSLIDSLLPLDSNYAIFNQA 186 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGK-----KPKQTLPERTGYFLLL 238 MMDLG +ICT S P+C++CPL + C + N+ L K PKQ LL+ Sbjct: 187 MMDLGRLICTHSNPQCNICPLNSHCQSFLTNNINLLTQKYATIHNPKQK-KIIYWLILLV 245 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 +H + + L QR +W L+CFP+F + +L WL++ + + + +H S+ Sbjct: 246 KHRNIIYLTQRLQETIWNKLFCFPEFYNRTTLNTWLSKYNLHNNLKINMPIIKHNISNIA 305 Query: 299 LDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 L+I P + +++ +E N +WYNL +P +GL PV +L+ ++ Sbjct: 306 LEIQPTLININNTIFNTNEKN-IWYNLNKPAIIGLPKPVNTILKIIQNKT 354 >UniRef50_C7G751 A/G-specific adenine glycosylase n=1 Tax=Roseburia intestinalis L1-82 RepID=C7G751_9FIRM Length = 395 Score = 338 bits (866), Expect = 2e-91, Method: Composition-based stats. Identities = 112/339 (33%), Positives = 160/339 (47%), Gaps = 31/339 (9%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q + +L W+ + R LPW+ TPY+VW+SE+MLQQT+V V PYFERF P Sbjct: 6 DLKQLVSPLLQWFLNHAR-VLPWREKPTPYRVWVSEIMLQQTRVEAVKPYFERFTTALPD 64 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 LA P DE+L LW GLGYY R RN+ KAA +V +GG+ P +E++ L G+G T Sbjct: 65 AKALAVCPEDELLKLWEGLGYYNRVRNMQKAAVEVVENYGGQLPADYEKLLKLKGIGHYT 124 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG----- 176 AGA+ S++ G P +DGNV RVL R A K+ N++ L E++ Sbjct: 125 AGAVASIAYGIPVPAVDGNVLRVLTRVSADDTDIMKQSFRNEMEKLLEKLMHGADGRNEK 184 Query: 177 ---------------------VERFNQAMMDLGAMICTR-SKPKCSLCPLQNGCIAAANN 214 FNQA+M+LGA IC P C+ CP ++ C A Sbjct: 185 NVFPWMEDADDLRAKVYHQNLAGAFNQALMELGATICVPNGAPLCTECPWKDFCEAKKQE 244 Query: 215 SWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFA---DEESLR 271 P K + L+L+ +++V + +RP GL GLY P +E + Sbjct: 245 LIGQIPVKSKAKPRKIEKKTVLILRDDEKVAIRKRPAKGLLAGLYELPNVEGNMGQEEVL 304 Query: 272 QWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSS 310 + + A + L +H FSH + + V Sbjct: 305 SMVKEMGYAPIRIQPLGEAKHIFSHIEWHMTGYAVRVEE 343 >UniRef50_A0M7B9 A/G-specific adenine glycosylase n=16 Tax=Bacteroidetes RepID=A0M7B9_GRAFK Length = 350 Score = 338 bits (866), Expect = 3e-91, Method: Composition-based stats. Identities = 125/355 (35%), Positives = 181/355 (50%), Gaps = 20/355 (5%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 S ++ WY + R LPW+ PY +WLSE+MLQQT++ +PY+ +F+ +P+V DL Sbjct: 3 LSNRLTHWYLQNKR-DLPWRKTHEPYHIWLSEIMLQQTRIEQGLPYYNKFIQAYPSVFDL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA +EVL LW GLGYY+RARNLH+ A+ VA GKFP T++ + L GVG TA AI Sbjct: 62 ANATPEEVLKLWQGLGYYSRARNLHETAKYVAFELNGKFPGTYKGLLKLKGVGDYTASAI 121 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S+ + ++DGNV RVL+R + + + L++++ FNQA+M Sbjct: 122 ASICYNEPVAVVDGNVYRVLSRIFGIDTPINSAAGIKEFKLLAQELLDKNDPATFNQAIM 181 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE-V 244 + GA+ C KPKC +CP + C+A N+ P K K + +R +L+ HE+E Sbjct: 182 EFGALHCKPQKPKCEICPFNDSCLALKNDKIKELPVKLKKNKVKKRYFNYLVFNHENETT 241 Query: 245 LLAQRPPSGLWGGLYCFPQFA-----DEESLRQWLAQRQIAADNLTQLT-----AFRHTF 294 +L QR G+W GLY FP DE+ L + A + + +L H Sbjct: 242 ILEQRVGKGIWNGLYQFPLVESERLIDEKELVTFEAFKNSITGSGVELELFNEIPVIHKL 301 Query: 295 SHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ-QLRTGA 348 SH HL I W+ S G E + L + +E L L G Sbjct: 302 SHQHL-ITRFWVVNS---GTASENS---VKLKNVTDYPVPVLIENFLNEYLPDGY 349 >UniRef50_B0MYW3 Putative uncharacterized protein n=1 Tax=Alistipes putredinis DSM 17216 RepID=B0MYW3_9BACT Length = 358 Score = 337 bits (864), Expect = 5e-91, Method: Composition-based stats. Identities = 117/350 (33%), Positives = 184/350 (52%), Gaps = 13/350 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + ++++L+WY + GR LPW+ + PY++W+SEV+LQQT+VA + Y+ RF+ FP V Sbjct: 12 PEVASRLLEWYGREGR-DLPWRRTRDPYRIWISEVILQQTRVAQGMSYYHRFLELFPDVA 70 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA+AP D VL W GLGYY+RARNL AA+++ HGG FP + +V ALPGVG TA Sbjct: 71 ALASAPEDLVLKCWQGLGYYSRARNLLAAARRIVETHGGVFPTAYADVRALPGVGDYTAA 130 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S++ + LDGNV RVL+R Y + +L++ + +NQA Sbjct: 131 AICSIAYEEPCAALDGNVFRVLSRLYDLDTPIDTTSGRRTFAALADSLIDRQRPGLYNQA 190 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +MD GA+ C ++P+C+ CPL++ C+A A + + P K+ + + R +L ++ DE Sbjct: 191 IMDFGALCCLPAQPRCTECPLRDRCLAFAARTVDVRPVKQGRTAVEPRYFNYLHVECGDE 250 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQ-------LTAFR---HT 293 ++L +R +W GLY FP E + + L + + H Sbjct: 251 LVLRRRGAGDIWQGLYEFPLIETPEPVEYTVLAAMPEFCALFKDAGTVCLAGTVKMPVHQ 310 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 SH + V + V +T + E L + ++ ER L + Sbjct: 311 LSHRAIHAVFYRIVVERWTPSLRE--MLVISRETLGDYAVSRLTERYLSR 358 >UniRef50_Q11NU4 A/G-specific adenine glycosylase n=5 Tax=Bacteroidetes RepID=Q11NU4_CYTH3 Length = 355 Score = 334 bits (858), Expect = 2e-90, Method: Composition-based stats. Identities = 106/346 (30%), Positives = 177/346 (51%), Gaps = 7/346 (2%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 +F +++ WY K R+ LPW+ + PY +WLSE++LQQT+V +PY+ F+ FPTV Sbjct: 7 KKKFPKELIKWYLKNKRE-LPWRDTRDPYPIWLSEIILQQTRVQQGMPYYFSFLKAFPTV 65 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 +A A ++L LW GLGYY+RARNLHK A QV + GG FP +++E+ L GVG TA Sbjct: 66 KHMAKASEKDILSLWQGLGYYSRARNLHKTALQVMSQFGGSFPGSYKELLDLKGVGPYTA 125 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 AI S + + ++DGNV RVL+R + + + + +L++Q+ P + +NQ Sbjct: 126 AAIASFAYKEQVAVVDGNVYRVLSRVFGIYEDITQNSSKKTFAALAQQLIPQKDPDIYNQ 185 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+ GA+ CT ++PKC C C+A + P K ++ ER + ++ + + Sbjct: 186 AIMEFGALHCTPAEPKCGDCCFAEICVAHLTGDQRVLPVKAKQKAKQERYMTYFMITNNN 245 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQL----TAFRHTFSHFH 298 VL+ +R + +W GL+ F ++ A + + + + ++H SH Sbjct: 246 TVLMKERGSNDIWQGLFEFLLVETPKAASLTEALKLVPDLDKKNISGESITYKHILSHQI 305 Query: 299 LDIVPMWLPVSSFT-GCMDEG-NALWYNLAQPPSVGLAAPVERLLQ 342 L V C+ + N + P + + L+ Sbjct: 306 LHAKIYMAEVEKKVYNCIKKTYNLEEIAFNELPVTPKPVLITKYLK 351 >UniRef50_A6G959 A/G-specific adenine glycosylase n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G959_9DELT Length = 378 Score = 334 bits (858), Expect = 2e-90, Method: Composition-based stats. Identities = 118/361 (32%), Positives = 172/361 (47%), Gaps = 22/361 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + + W+ + R LPW+ + PY +W+SE+MLQQT+V TV Y++ F+ RFPT Sbjct: 12 TVRRVAKALGAWFKRDAR-DLPWRRTRDPYAIWVSEIMLQQTRVDTVENYWQPFLDRFPT 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA A VL W+GLGYY RA+ LH+ AQ V GG+ P T + + A+PG+GR T Sbjct: 71 VESLAAAEQQAVLEAWSGLGYYRRAKLLHRGAQYVHEELGGEVPGTADALRAIPGIGRYT 130 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ + P++DGNV RV +R A+ + W V Sbjct: 131 AGAIASIAFDQPAPLVDGNVARVHSRLAAIEAPAEQDAKAEAHWRFVAGVLEHGEPRVLA 190 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+M+LGA +CT P C CP++ C A A P K K+ E L L+ Sbjct: 191 QALMELGATVCTPRSPTCLTCPVREHCRARARGLQDQIPAPKVKKKATEHHLLALALRRG 250 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADE------------ESLRQWLAQRQIAADNLT---- 285 D++L+ +RP GL GGL+C P F + LR L ++ A + Sbjct: 251 DKLLIERRPDEGLLGGLWCLPVFEAPSPVSPAPKGAKLDDLRASLERQAGEALGVKLRLE 310 Query: 286 --QLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNAL---WYNLAQPPSVGLAAPVERL 340 +H FSH ++P + G+ L W P+ G+ +L Sbjct: 311 PAPAPRVKHVFSHRIWHLLPWRARANPAPRTRSRGDDLRLAWLARGSDPAGGIPTLTRKL 370 Query: 341 L 341 L Sbjct: 371 L 371 >UniRef50_B9Y8G7 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y8G7_9FIRM Length = 397 Score = 334 bits (857), Expect = 2e-90, Method: Composition-based stats. Identities = 111/343 (32%), Positives = 169/343 (49%), Gaps = 12/343 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 ++ F ++ WY + R+ LP++ + PY +W+SE+M QQT++ +++PY++R+M R+P Sbjct: 13 VKKMNFQEELCAWYKIHHRE-LPFRQSRDPYAIWVSEIMAQQTRIDSMLPYYKRWMERWP 71 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV LA AP++++LH+W GLGYY RAR LH A+ V +GG P E++ +PG+G Sbjct: 72 TVEALAEAPIEDILHVWQGLGYYNRARKLHAGAKVVVERYGGLLPADVEQLRTIPGIGFY 131 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S++ G P +DGNV RV R KK + +W F Sbjct: 132 TAGAIGSIAYGLRAPAVDGNVLRVTTRVLQYGEDITKKTTADYVWRQVYDWMEGSNPAVF 191 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QAMM++GA++CT P+C LCPL C A + SW+ YP KK + E Y +++ Sbjct: 192 TQAMMEIGALVCTPKNPQCLLCPLAPFCGAGQDGSWSQYPVKKKAKPPLELQLYTYWIEN 251 Query: 241 EDE-VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 + +LL+ GL GLY PQ A D L +H FSH Sbjct: 252 DRRQILLSDDWSDGLMEGLYRLPQAAAPLE----------QTDQLDFAGQRKHVFSHRVW 301 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 + + W Q ++ + + L Sbjct: 302 KMECYRGQWDDQVQETLPEHCFWIAKDQLDTLPIVGAHRKWLD 344 >UniRef50_A8ZZH0 A/G-specific adenine glycosylase n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZZH0_DESOH Length = 360 Score = 334 bits (857), Expect = 3e-90, Method: Composition-based stats. Identities = 115/359 (32%), Positives = 167/359 (46%), Gaps = 13/359 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 A F ++L WY + R LPW+ K PY +W+SEVMLQQTQVATV+ Y+ RF+ FP Sbjct: 4 FSAGPFQRRLLRWYTAHQR-DLPWRRSKNPYHIWVSEVMLQQTQVATVVDYYRRFLQAFP 62 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + LA A L +VL LW GLGYYARA NLHKAA+Q+ + P T E LPGVG Sbjct: 63 DIGTLAVAELQDVLKLWEGLGYYARAANLHKAARQIVAGGKKRVPRTPETFGRLPGVGDY 122 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 A+ S++ G P++DGNVKRVLAR + + + + + F Sbjct: 123 INAAVSSIAFGHPLPVVDGNVKRVLARLFLLDEPVNRPSNHRVFLEKARLLLAFKDPGTF 182 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQAMM+LGA++C +P C CP+ + C A +P + + P L++ Sbjct: 183 NQAMMELGALVCKPGRPLCDQCPVASFCGAHQAGRVTDFPRRLAARKNPHHHLAVGLVKK 242 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESL-------RQWLAQRQIAADNLTQLTAFRHT 293 + L+ +RP +GL GL+ P E+ R L + +L H Sbjct: 243 GNRFLIVRRPATGLLAGLWEMPGGRVEKPENPADACCRAVLESVGLTVFPGPRLARVAHA 302 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNAL---WYNLAQPPSVGLAAPVERLLQQLRTGAP 349 ++HF + + V +G + W + + + L P Sbjct: 303 YTHFKITMDLFACDV--VSGRVKRNGYQAHHWIRMKDIGQYPFHRAMHKAFAALAGALP 359 >UniRef50_A9KT85 A/G-specific adenine glycosylase n=3 Tax=Clostridiales RepID=A9KT85_CLOPH Length = 350 Score = 334 bits (856), Expect = 4e-90, Method: Composition-based stats. Identities = 106/331 (32%), Positives = 159/331 (48%), Gaps = 7/331 (2%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +L WYD R L W+ + PY VW+SE+MLQQT+V V YF+RF+ PT+ DLA Sbjct: 12 LLHWYDLNAR-ILEWRSNPKPYYVWISEIMLQQTRVEAVKSYFDRFIKELPTIKDLAAVE 70 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 D ++ LW GLGYY RARNL KAA V + G+ P EE+ LPG+G T+GAI S++ Sbjct: 71 EDRLMKLWEGLGYYNRARNLKKAAIIVMEQYNGELPANREELKKLPGIGSYTSGAIGSIA 130 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 +DGNV RV+ R K++V+ +L E + P +NQ++M+LGA Sbjct: 131 FQLPVAAVDGNVLRVMKRIAGSFDDITKEKVKKELEEDIEAIIPKDRPGDYNQSLMELGA 190 Query: 190 MICTR-SKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQ 248 +C KP C+ CP+ + C A N P K K+ L+++ + + Sbjct: 191 TVCLPNGKPLCNQCPVMHLCKAFHNGDELKIPVKPKKKERVIEEKTVLVMECAGYYAIRK 250 Query: 249 RPPSGLWGGLYCFPQFADEESL---RQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW 305 R GL GL+ P + S+ + + A +L +H FSH ++ + Sbjct: 251 RTEKGLLHGLWELPNIEGKLSINQMEETIKTLAPYAKVERKLGEAKHIFSHKEWHMIGYY 310 Query: 306 LPVSSFTGCMDEGNALWYNLAQP-PSVGLAA 335 + V + +W + L Sbjct: 311 IKVERIED-VPIPELIWVDNQSIRKDYSLPT 340 >UniRef50_A9T1M0 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9T1M0_PHYPA Length = 627 Score = 334 bits (856), Expect = 4e-90, Method: Composition-based stats. Identities = 131/455 (28%), Positives = 195/455 (42%), Gaps = 113/455 (24%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQID-----KTP-------------------------- 30 + A +L WYD R LPW+I+ + P Sbjct: 156 EVMTIRASLLSWYDSNHR-VLPWRINLYSCLENPSYSADGAGEICKEESGVGVMNDASLS 214 Query: 31 -------------------YKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLD 71 Y VW+SE+MLQQT+VATVI Y++R+M +PTV LA A + Sbjct: 215 GVLCRDSMHNATETEGERAYAVWVSEMMLQQTRVATVISYYQRWMRTWPTVYALAQATQE 274 Query: 72 ----------------EVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALP 115 EV +W GLGYY RAR L + A+++ GG+FP T EE+ +P Sbjct: 275 VSSSAFIPNSSVQSCLEVNSVWAGLGYYRRARFLLEGAKKIVEEFGGEFPRTAEELQKVP 334 Query: 116 GVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV 175 G+G TAGAI S++ + P++DGNV RVL R A+S P W+L+ Q+ Sbjct: 335 GIGTYTAGAIASIAFKQVVPVVDGNVIRVLCRLRAISLNPKASTTVKLFWALASQLVEEY 394 Query: 176 GVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN--------NSWALYPGKKPKQT 227 FNQA+M+LGA ICT + P C+LC + + C A S +P K PK Sbjct: 395 RPGDFNQALMELGATICTPTSPSCALCSVSSQCAALKLVHRVEKNVTSVTCFPAKVPKSA 454 Query: 228 LPERTGYFLLLQHEDE---------VLLAQRPPSGLWGGLYCFPQFADE----------- 267 E +++ E + +LL QRPP GL GL+ FP + Sbjct: 455 PREEYVAVCIIELESQTKMSKAASSLLLVQRPPKGLLAGLWEFPSAPLDNPNSTEKSRLM 514 Query: 268 -------ESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGN- 319 ++L + + + H FSH L + W+ +S TG +++ + Sbjct: 515 EMDNYLCDTLGFKIGSGGYTVMKREEFGTYVHVFSHIRLHMFIQWMLLS--TGEIEDKST 572 Query: 320 --------ALWYNLAQPPSVGLAAPVERLLQQLRT 346 W +L S+GL + V+++ + Sbjct: 573 PKMQRSVVTKWVDLDSMNSLGLTSGVKKVYEMFSA 607 >UniRef50_B8J1Y9 A/G-specific adenine glycosylase n=12 Tax=Desulfovibrionales RepID=B8J1Y9_DESDA Length = 435 Score = 333 bits (854), Expect = 7e-90, Method: Composition-based stats. Identities = 113/361 (31%), Positives = 181/361 (50%), Gaps = 21/361 (5%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 +LDW+ + R LPW+ TPY+VW+SEVMLQQTQ+ + YF R+M RFP + Sbjct: 51 IPALQNALLDWFAVHQR-ALPWRNSYTPYEVWISEVMLQQTQMERGVRYFIRWMERFPDI 109 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA A ++VL +W GLGYY+RAR++ AA+++ H G FP + ALPGVG TA Sbjct: 110 AALAAAHEEDVLRMWEGLGYYSRARHILAAARKIMAEHNGIFPRDPAAIRALPGVGPYTA 169 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GAI S++ K P +D NV+RV++R + V ++ + + ++ P NQ Sbjct: 170 GAIASIAFEKKLPCVDANVERVVSRIFDVDSPVKQEPAAGVIHQWALRLVPEGKARAHNQ 229 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 AMM+LGA++C R KP C+ CPL CI+ P + T+ +L+ + Sbjct: 230 AMMELGALVC-RKKPCCASCPLGAFCISHHLGIEDQRPVPGKRATVTPVNAVTGVLRRGE 288 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEES-------LRQWLAQRQIAADNLTQLTAFRHTFS 295 ++ + +RP SG+WG L+ FP EE +R+++ + + + RH ++ Sbjct: 289 KIFVQKRPASGVWGNLWEFPGGRVEEGESPEQAVVREFMEETGFTVNVAARHGIIRHGYT 348 Query: 296 HFHLDIVPMWLPVSS-FTGCMDE-----------GNALWYNLAQPPSVGLAAPVERLLQQ 343 + L + L +++ C E W + + + + A +L Sbjct: 349 TYRLTLHCFGLDLAAPGKDCTPETCPAPPVLTAATQYRWASPQELEDLAMPAAHRKLADS 408 Query: 344 L 344 L Sbjct: 409 L 409 >UniRef50_D2MN98 Putative A/G-specific adenine glycosylase n=1 Tax=Bulleidia extructa W1219 RepID=D2MN98_9FIRM Length = 346 Score = 333 bits (853), Expect = 9e-90, Method: Composition-based stats. Identities = 114/338 (33%), Positives = 166/338 (49%), Gaps = 8/338 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 Q +F Q++ WY + + PW++ K PY VW+SE+MLQQT++ TV+P +ERFM P Sbjct: 3 FQKKEFQ-QLIQWYQAH-HQNYPWRLTKDPYAVWISEIMLQQTRIETVLPKYERFMKELP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 ++ DLA D ++H W GLGYY+RARNL KAA Q+ H G FP EE+ AL G+G Sbjct: 61 SIHDLATVSEDHLMHFWEGLGYYSRARNLQKAALQIEDRHHGIFPHQLEEIQALTGIGDY 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSG---WPGKKEVENKLWSLSEQVTPAVGV 177 TAGAI S S G P +DGNV RV +R + P K + + Sbjct: 121 TAGAIASFSFGVGVPAIDGNVLRVYSRHEGLYQNVLDPSVKSLVKQQMLPLYTKEKHSDN 180 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL 237 FNQA+M+LG IC P C CP+Q GC + ++ P + K + FL+ Sbjct: 181 GDFNQAIMELGEQICLPKNPDCQNCPIQKGCFSFLHHKQGELPIRISKTKKKKEEHSFLI 240 Query: 238 LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 ++ +L+ +RP +GL LY + +L Q QI + L++ +H FSH Sbjct: 241 FYTKEAILVHKRPVNGLLANLYEPVNLDFFIDIEDFLQQHQIPVLSYQTLSSHKHIFSHR 300 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAA 335 I + V W + + ++ Sbjct: 301 IWQIQAYAIEVKELFPF---PEYQWIEKKKSQHLAFSS 335 >UniRef50_Q7UYZ0 A/G-specific adenine glycosylase n=1 Tax=Rhodopirellula baltica RepID=Q7UYZ0_RHOBA Length = 367 Score = 332 bits (852), Expect = 9e-90, Method: Composition-based stats. Identities = 123/357 (34%), Positives = 194/357 (54%), Gaps = 19/357 (5%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 ++L W+ R LPW+ D +PY+VW+SE+M QQTQVATV+PYFERF++ +PT+ DL Sbjct: 1 MRKRLLSWFSDNAR-DLPWRRDHSPYRVWISEIMCQQTQVATVLPYFERFLSTYPTIRDL 59 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+A +++ +W GLGYY RAR+LH AA+++ H G+FPE+F++V ALPG+GR TAGAI Sbjct: 60 ADADESQLMRMWEGLGYYRRARSLHAAAKKMVEEHNGEFPESFDDVLALPGIGRYTAGAI 119 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER------ 179 S+S K FPIL+GN +RV +R ++ P +K + +LW LS+++ P + Sbjct: 120 QSISRNKAFPILEGNTQRVFSRWIGLTVPPTEKVAQARLWELSDKMLPPRKADDRSNGPA 179 Query: 180 -FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 FNQA M+LGA+IC+ PKC CP+ C A PGK K RT +++ Sbjct: 180 GFNQAAMELGALICSPRSPKCDECPVATMCHANQMGLQDEIPGKISKVQYESRTELAVVI 239 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEES-----LRQWLA-QRQIAADNLTQLTAFRH 292 +D L+ P + G+ FP+ E+ + WLA Q L +H Sbjct: 240 SRDDRYLVRTIPEGVRFAGMIDFPRAGPPEAADVVGMESWLADQLGGDVRIGMCLKTIKH 299 Query: 293 TFSHFHLDIVPMWLPVSSFTGCMDEG-----NALWYNLAQPPSVGLAAPVERLLQQL 344 + + + + + + E + W + + + ++ ++++ L Sbjct: 300 AVTRYRMTLHVHIGEWIADGDRVAESGPIPDSWQWATVDELADMPMSVTGRKIVRLL 356 >UniRef50_C8WS48 A/G-specific adenine glycosylase n=2 Tax=Alicyclobacillus acidocaldarius RepID=C8WS48_ALIAD Length = 382 Score = 332 bits (852), Expect = 1e-89, Method: Composition-based stats. Identities = 121/365 (33%), Positives = 183/365 (50%), Gaps = 25/365 (6%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + F+ + WY + R LPW+ PY + +SE MLQQT+V TVIPY+ RFM RFPT Sbjct: 6 AAFAHTLEAWYTQTSR-DLPWRRTADPYAILVSETMLQQTRVETVIPYYHRFMERFPTPL 64 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA+A +D+VL +W GLGYY RARNL A + V HGG+ P+ +E+ ALPG+G T G Sbjct: 65 HLADADIDDVLKMWEGLGYYRRARNLKAAMEVVRDRHGGRIPDHPDELKALPGIGPYTLG 124 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+LS++ + +P +DGNV RV++R A+ +V+ ++ + QA Sbjct: 125 AVLSIAFNRPYPAVDGNVLRVMSRYRAIEEPVDLPKVKRQIEQDVAETLERGTPRVLTQA 184 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +M+LGA++CT KP+CS CP+ +GC A A+ P + PK+ +T L L D Sbjct: 185 LMELGALVCTPKKPRCSACPVVSGCAARAHGLTDALPKRLPKRARRRQTVVALWLTRGDS 244 Query: 244 VLLAQRPPSGLWGGLYCFP--QFADEESLRQWLAQ-----------------RQIAADNL 284 QRP GL GG++ P + D + +W A+ R+ A Sbjct: 245 FWAEQRPEGGLLGGMWQLPSVELDDPSTSLEWHARARFAELRFGHAQRGEAVREPEAMAF 304 Query: 285 TQLTAFRHTFSHFHLDIVPMW-LPVSSFTGCMDEGN----ALWYNLAQPPSVGLAAPVER 339 ++ H F+H +V + G E + W + + + Sbjct: 305 ERVCEVSHAFTHLDWTVVVFAPVEYDQAGGQPLEIHPGHEGAWVGFEEISNFVWPKVYQD 364 Query: 340 LLQQL 344 +L+QL Sbjct: 365 VLRQL 369 >UniRef50_Q8A7M6 A/G-specific adenine glycosylase n=2 Tax=Bacteroides RepID=Q8A7M6_BACTN Length = 357 Score = 332 bits (851), Expect = 1e-89, Method: Composition-based stats. Identities = 111/352 (31%), Positives = 168/352 (47%), Gaps = 17/352 (4%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +++WY +Y R LPW+ PY++W+SE++LQQT+VA YF RF+ RFP V L Sbjct: 13 FSEAIVEWYKEYKR-DLPWRESSDPYRIWISEIILQQTRVAQGYDYFLRFIKRFPDVKAL 71 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+A DEV+ W GLGYY+RARNLH AA+ + G FPET+ EV AL GVG TA AI Sbjct: 72 ADADEDEVMKYWQGLGYYSRARNLHAAAKSM----NGVFPETYPEVLALKGVGEYTAAAI 127 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S + G + ++DGNV RVL+R + + E + +L++++ +NQ +M Sbjct: 128 CSFAYGMPYAVVDGNVYRVLSRYFGIDTPIDSTEGKKLFAALADEMLDKKQPALYNQGIM 187 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D GA+ CT P C CPL + C A + P K+ K R ++ ++ Sbjct: 188 DFGAIQCTPQSPDCLFCPLADSCSALSTGRVTQLPVKQHKTKTTNRYFNYIYVRAGAHTY 247 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTA----------FRHTFS 295 + +R + +W L+ P +L + +L +H S Sbjct: 248 INKRTANDIWKNLFELPLIETPVALSEEEFLALPEFRSLIASGENPVIRLVCRGVKHVLS 307 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 H I V+ N L + ++ V+ L+Q + Sbjct: 308 HRV--IYANLYEVALPEDTKSFANFQRIRLEELDKYAISKLVQTLIQVISNK 357 >UniRef50_Q03P68 A/G-specific DNA-adenine glycosylase n=2 Tax=Lactobacillales RepID=Q03P68_LACBA Length = 379 Score = 331 bits (850), Expect = 2e-89, Method: Composition-based stats. Identities = 117/368 (31%), Positives = 176/368 (47%), Gaps = 22/368 (5%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 +F +L WYD+ GR LPW+ D+ PY VW+SE+MLQQTQV TVIPY+ FMA FPT Sbjct: 8 TIREFRQTLLAWYDREGR-DLPWRHDQDPYHVWVSEIMLQQTQVQTVIPYYLNFMAMFPT 66 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA AP +++L W GLGYY+RARNL +AA+Q+ + GK+P+T E+ L G+G T Sbjct: 67 VADLAQAPEEQLLKAWQGLGYYSRARNLQRAARQLVDDYRGKWPQTAAELLDLTGIGPYT 126 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ G+ P +DGN RV +R + K + + + + FN Sbjct: 127 AGAIASIAFGEVVPAVDGNAFRVFSRLLLIDADIAKPQTRKLFEQVIQPIVDPQRPGAFN 186 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLG+ T + P++ + N A +P K Y L++ Sbjct: 187 QAIMDLGSSYMTAKNSDPAHSPVRAFDASYRENCVADFPVKTKAPRPRPVPYYGLIVTSP 246 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFAD---EESLRQWLAQRQIAADNLTQLT---------- 288 L+ QR + + G + FP +++ Q A Q +L Q Sbjct: 247 AGYLMVQRDATQMLTGYWTFPLIQQADLQDATDQQPATLQTDIAHLEQRFMTDYQLPITL 306 Query: 289 ------AFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 HT++H + + +SS W + ++ L E+L+ Sbjct: 307 TPLSGQPVSHTYTHQRWQVKLLVAELSSAADLTFYP-GKWVAASTIKTLPLPKVQEKLMT 365 Query: 343 QLRT-GAP 349 + + G P Sbjct: 366 RYQNQGHP 373 >UniRef50_B5EQC0 A/G-specific adenine glycosylase n=3 Tax=Acidithiobacillus RepID=B5EQC0_ACIF5 Length = 369 Score = 331 bits (849), Expect = 2e-89, Method: Composition-based stats. Identities = 144/366 (39%), Positives = 189/366 (51%), Gaps = 25/366 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 + +L WY ++GR LPW+ + PY++WL+E+MLQQTQV + PY+ RF+ P L Sbjct: 4 IAEPLLAWYTRHGRHDLPWRQTRDPYRLWLAEIMLQQTQVESAKPYYVRFLQALPNWQAL 63 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP D VL LW+GLGYYARARN +AAQ V T G FP+T E+ LPGVGRSTA A+ Sbjct: 64 AAAPQDRVLALWSGLGYYARARNAQRAAQTVMTGFAGHFPDTLEKAITLPGVGRSTAAAV 123 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 L+ + G ILD N +RVL R +A+ P + LW+L+ +TP +NQA+ Sbjct: 124 LASAFGHRQAILDANARRVLIRSHAIDADPKAAATQQWLWTLASALTP-EDAHGYNQAIQ 182 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH-EDEV 244 DLGAMICT +P+C CPL + C A A P + T P+R +FLL E + Sbjct: 183 DLGAMICTPRQPRCPDCPLASRCKAHAQGRSHALPVTVHRPTKPQRRAFFLLAADMEGRI 242 Query: 245 LLAQRPPSGLWGGLYCFPQ------------------FADEESLRQWLAQRQIAADNLTQ 286 LL +RP SG+WGGL+C PQ D ++RQ L + L Sbjct: 243 LLEKRPDSGIWGGLWCLPQATPDSGELSILCNEPPSPLEDTPAIRQSLQEAWSRRLRLDL 302 Query: 287 LTAF-----RHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 RH F+HF L V WY A GL PV R+L Sbjct: 303 RLTALGEEQRHVFTHFQLRFRCAHARVLGGAVADSGATLRWYRPADALKQGLPTPVRRIL 362 Query: 342 QQLRTG 347 + L G Sbjct: 363 RNLGGG 368 >UniRef50_UPI000196B020 hypothetical protein CATMIT_00928 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196B020 Length = 333 Score = 330 bits (847), Expect = 3e-89, Method: Composition-based stats. Identities = 105/343 (30%), Positives = 171/343 (49%), Gaps = 16/343 (4%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 ++++WY + R LP++ PYK+W+SE+MLQQT V VIPY+ RF+ RFP V L Sbjct: 3 IQEELINWYKENKR-VLPFRDIDDPYKIWVSEIMLQQTTVTAVIPYYNRFIERFPDVKTL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ++A ++EV W GLGYY RA +LHK+AQ + FP+ +E++ +L GVG TA AI Sbjct: 62 SDASIEEVYKYWEGLGYYRRALHLHKSAQMIENH----FPKDYEDILSLVGVGPYTASAI 117 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 +S + K + +DGN RVL+R YA+ + K+ + ++ + NQ +M Sbjct: 118 MSFAYHKPYIAVDGNALRVLSRLYAIEDNIASNKTVKKITEIGNKLVVGYDSAQINQGIM 177 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D IC P C CPLQ+ C A + N + P K + ++ +EDEV+ Sbjct: 178 DFANAICLPVHPHCEECPLQSHCQAYSTNKQEVLPINIKKVNKKSISYITGIVVYEDEVM 237 Query: 246 LAQRPPSGLWGGLYCFPQFADEES---LRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 L + +GL +Y PQ+ E + + I + L+ L F+H FSH + Sbjct: 238 LIKN-KNGLLENMYLLPQYEVESPYSFIEAFKENYGIELEALSHLKDFKHVFSHRTWYMH 296 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 + + +Y + + + +++L+ + Sbjct: 297 VYIFKANKHSIN-------FYTNKEQEEITIPTAHKKILKYYK 332 >UniRef50_A7HQZ1 A/G-specific adenine glycosylase n=45 Tax=Alphaproteobacteria RepID=A7HQZ1_PARL1 Length = 615 Score = 330 bits (847), Expect = 4e-89, Method: Composition-based stats. Identities = 135/358 (37%), Positives = 188/358 (52%), Gaps = 29/358 (8%) Query: 7 SAQVLDWYDKYGRKTLPWQIDK----TPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 +A +L WYDK+ R LPW+ K PY VWLSE+MLQQT VATV PYF F+ R+P V Sbjct: 248 AAPLLAWYDKHAR-VLPWRARKGERADPYAVWLSEIMLQQTTVATVGPYFTGFLKRWPNV 306 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA AP +EV+ W GLGYY+RARNLH A++V++ +GGKFP+T E + +LPG+G TA Sbjct: 307 EALAAAPQEEVMKAWAGLGYYSRARNLHACAKEVSSEYGGKFPDTVEGLESLPGIGPYTA 366 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 AI +++ G+ ++DGNV+RV+AR + + + + + +TP F Q Sbjct: 367 AAIAAIAFGRAATVVDGNVERVVARLFEIETPLPAAKP--DIREKARTLTPEQRAGDFAQ 424 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 AMMDLGA ICT P C+ CP+ + C A A + L P + PK+ P R G L E Sbjct: 425 AMMDLGATICTPRSPACNRCPINDLCDARAAGTQNLLPARAPKKARPTRRGACFWLVREG 484 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWL----------AQRQIAADNLTQLTA--- 289 V L +RP GL GG+ P +ES R +R +A + L Sbjct: 485 HVWLRRRPDKGLLGGMLEVPGTPWDESDRHRTVIELSHDENGGRRGVAGEVLDHAPMEAE 544 Query: 290 -------FRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 HTF+HFHL++ EG +W L + L + ++ Sbjct: 545 WRLVPGLVEHTFTHFHLELEVFTATTRKKIVPGREG--MWVPLEEVAGEALPTVMRKV 600 >UniRef50_C4V5S7 Adenine glycosylase n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V5S7_9FIRM Length = 369 Score = 329 bits (845), Expect = 7e-89, Method: Composition-based stats. Identities = 114/344 (33%), Positives = 161/344 (46%), Gaps = 16/344 (4%) Query: 8 AQVLDWYDKYGR-KTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 +L W+ + LPW+ + TPY VW+SE+MLQQT+ A V Y+ RF+ P++ DLA Sbjct: 17 RALLTWHKSAPDTRDLPWRDEPTPYHVWISEIMLQQTRAAVVRAYYLRFLTALPSIHDLA 76 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 D ++ LW GLGYY+RARNL +AAQ + HGG P F + LPG+GR TA AI Sbjct: 77 AVNDDALMKLWQGLGYYSRARNLKRAAQVIVKEHGGDLPNDFNALLTLPGIGRYTASAIA 136 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA-VGVERFNQAMM 185 S + G+ P +DGN RV AR A GK + L + P N+A M Sbjct: 137 SFAYGQPHPAVDGNFLRVAARITANPIDIGKDSTKRSLEAALSVSYPEGRDAGLLNEAFM 196 Query: 186 DLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 DLGA IC P C CP C+A + YP K + + L+L D + Sbjct: 197 DLGATICLPHGAPLCHSCPAAQLCLAHDRGTEQDYPVKSAAKARRKEKHTVLILSCGDRI 256 Query: 245 LLAQRPPSGLWGGLYCFPQFADEES---LRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 + +RP GL GL+ +P ++S +R L ++ L RH FSH ++ Sbjct: 257 AIRKRPTKGLLAGLWEYPHLDGKQSKRAVRAHLEDEGFRILSIDPLPPARHIFSHIEWEL 316 Query: 302 VPMWLPVSSF---------TGCMDEGNALWYNLAQPPS-VGLAA 335 + V+ T + LW A+ + A Sbjct: 317 TGWAVTVAEKNEPPLMAAETASDAPSSLLWVRRAELAETYSIPA 360 >UniRef50_C6XRC5 HhH-GPD family protein n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XRC5_HIRBI Length = 361 Score = 329 bits (845), Expect = 7e-89, Method: Composition-based stats. Identities = 122/348 (35%), Positives = 176/348 (50%), Gaps = 20/348 (5%) Query: 7 SAQVLDWYDKYGRKTLPWQIDK---------TPYKVWLSEVMLQQTQVATVIPYFERFMA 57 + ++L WYD+ R+ LPW++ PY+VWLSE+MLQQT V PYF++F Sbjct: 17 AHELLQWYDQMARR-LPWRLSPQARKNGDMPDPYRVWLSEIMLQQTTVPHAAPYFDKFTR 75 Query: 58 RFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGV 117 +P V DLA A D+V+ W GLGYYARARNLHK A+ +A L FP T E + ALPGV Sbjct: 76 LWPKVEDLAAAHNDDVMREWAGLGYYARARNLHKCAKVLAGL--DAFPNTIEALLALPGV 133 Query: 118 GRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKK---EVENKLWSLSEQVTPA 174 G TA A+ S++ +DGN++RV++R A++G + + ++ + + P Sbjct: 134 GPYTAAAVGSIAFDLPVAPVDGNIERVISRLMAIAGDGSAAGWAQDKKEITQRVQTLVPQ 193 Query: 175 VGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGY 234 F QAMMDLGA +CT P C +CP + C+A + YP K ++ P R G Sbjct: 194 -RSGDFAQAMMDLGASVCTPKSPNCMICPWMDICLARQEGNQESYPAKPKRKPQPIRNGL 252 Query: 235 FLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTF 294 ++ HE +VLL +RP SGL GG+ EE L Q + RH F Sbjct: 253 AFVIFHEGKVLLQRRPDSGLLGGMLMPLSSPWEEGGEYTLEDYQPVKADWHYRGQARHVF 312 Query: 295 SHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 +HF L ++ + W ++ GL + L+ Sbjct: 313 THFALHWDVYVAHLTKKPDLSGD----WIDIDMCLEAGLPTVGRKALK 356 >UniRef50_B1Y3P3 A/G-specific adenine glycosylase n=8 Tax=Burkholderiales RepID=B1Y3P3_LEPCP Length = 384 Score = 329 bits (843), Expect = 1e-88, Method: Composition-based stats. Identities = 143/373 (38%), Positives = 195/373 (52%), Gaps = 33/373 (8%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +A ++ W ++GR LPWQ + PY+VWLSEVMLQQTQVATV+ Y+ERF+ RFP Sbjct: 8 ELPSIAADLVRWQRQHGRHGLPWQASRDPYRVWLSEVMLQQTQVATVLGYYERFLQRFPD 67 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LA APLD+VL LW+GLGYY+RARNLH+ AQ V HG P E++ LPG+G ST Sbjct: 68 IAALAAAPLDDVLALWSGLGYYSRARNLHRCAQVVVAEHGAALPRRAEQLVELPGIGPST 127 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE--R 179 A AI S G+ I DGNVKRVLAR A G + LW+ ++++ P + Sbjct: 128 AAAIASFCHGERVSIFDGNVKRVLARLLAFEGDLAQAGAAKVLWAQADRLVPTDAADMPA 187 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 + Q +MDLGA +CT P+C CPLQ C A A S YP K K R + L L Sbjct: 188 YTQGLMDLGATVCTPRDPQCPACPLQRACRAHAAGSVLTYPIKSRKLKRSRRENWCLWLS 247 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAAD------NLTQLTAFRHT 293 D V L QRP +G+W GL+ +P F D +++ + AQ A +L F H Sbjct: 248 LGDAVWLQQRPDTGIWAGLWAWPLFDDTDAVERLTAQLLAALGGHAERLQANRLPGFVHV 307 Query: 294 FSHFHLDIVPMWLP-----------------------VSSFTGCMDE--GNALWYNLAQP 328 +H + P L + +++ G W+ A Sbjct: 308 LTHLDWTLHPCRLHLPAPVVLQAREPGGGDGERGDGPLERLDAVLNQQLGAGRWHAAADL 367 Query: 329 PSVGLAAPVERLL 341 +GL AP+ ++L Sbjct: 368 HRLGLPAPLRKVL 380 >UniRef50_D1U9Y5 A/G-specific adenine glycosylase n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1U9Y5_9DELT Length = 369 Score = 328 bits (842), Expect = 1e-88, Method: Composition-based stats. Identities = 115/353 (32%), Positives = 182/353 (51%), Gaps = 13/353 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M + F++ +LDWYD GR LPW+ + PY+VW+SE+M QQTQ+ V+ YF+R+MAR+P Sbjct: 1 MDGNLFTSHLLDWYDANGR-DLPWRREPNPYRVWISEIMAQQTQLDRVVGYFDRWMARYP 59 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + LA A ++VL LW GLGYY+RARN+ K+A +A HG FP + ALPGVG Sbjct: 60 DLQSLALAREEDVLKLWEGLGYYSRARNILKSASVLAHAHGCVFPSDPIAIRALPGVGAY 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGA+ S++ G P +D NV RV AR + + V + + P F Sbjct: 120 TAGAVASIAFGLCEPAVDANVLRVFARLLDLDAPVAETGVRQTVERTVRALIPEDRPGDF 179 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+M+LGA++C +P+C CP++ C+A A P P + +L Sbjct: 180 NQALMELGALVC-AKRPRCGECPVRAHCLAHARGVADSRPVLPPSRKAIRIEMATGVLVR 238 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQ--FADEESLRQWLAQ-----RQIAADNLTQLTAFRHT 293 VL+ +R P +W GL+ FP E+ RQ LA+ ++ + + ++T ++ Sbjct: 239 HGRVLIQKRRPGDVWPGLWEFPGGCVEPGETPRQALAREFREEVELVVEPVEKITVVAYS 298 Query: 294 FSHFHLDIVPMWLPVSSFTGCM----DEGNALWYNLAQPPSVGLAAPVERLLQ 342 ++ + + + ++ + + + AQ A RL++ Sbjct: 299 YTRYRVTMHCYRCRLTGDAEPVPVFNEAAEGGFVLPAQLAEYAFPAGHRRLIE 351 >UniRef50_C9LTK5 A/G-specific adenine glycosylase n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LTK5_9FIRM Length = 404 Score = 328 bits (841), Expect = 2e-88, Method: Composition-based stats. Identities = 115/311 (36%), Positives = 165/311 (53%), Gaps = 6/311 (1%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + + +L W+ R LPW+ + TPY+VW+SE+MLQQT+V V PYFERF+A P V Sbjct: 17 DEIAEPLLRWFHSEKR-ALPWREEPTPYRVWVSEIMLQQTRVEAVKPYFERFVAALPDVR 75 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A + ++ LW GLGYY+RAR+L AA+ + + HGG+ P F+ + ALPG+GR TAG Sbjct: 76 ALARADENTLMKLWEGLGYYSRARHLQSAARLICSDHGGEIPAHFDGLLALPGIGRYTAG 135 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+ S++ G+ P +DGNV RV+ R A K+ + + P FNQA Sbjct: 136 AVASIAFGERRPAVDGNVLRVIMRLLACPADILKESTKRAVEEALIARLP-EDAGNFNQA 194 Query: 184 MMDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 +M+LGA+IC C CPL+ C+A N A P K P + LL D Sbjct: 195 LMELGALICLPRGAAHCPSCPLERLCLAKEANLQAELPQKTPPKRRRTEKLTIFLLAKND 254 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADE---ESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 ++ L +RP GL GL+ FP + + L + L L A +H FSH Sbjct: 255 KIALEKRPAQGLLAGLWGFPAMEGHLKKKEAEEALQAIGLIPAKLHALPAAQHIFSHITW 314 Query: 300 DIVPMWLPVSS 310 +V + ++ Sbjct: 315 QMVGWRVELAE 325 >UniRef50_UPI0001BC63FA A/G-specific adenine glycosylase n=3 Tax=Fusobacterium RepID=UPI0001BC63FA Length = 365 Score = 328 bits (840), Expect = 2e-88, Method: Composition-based stats. Identities = 121/352 (34%), Positives = 182/352 (51%), Gaps = 7/352 (1%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + + ++L++YDK+ R L W+ + Y W+SE+MLQQT+V V PYF RF+ P Sbjct: 10 EMGKIAKKLLEYYDKHKR-DLAWRGEVPAYYTWISEIMLQQTRVEAVKPYFARFIEELPN 68 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LAN ++++ LW GLGYY+RARNL KAA Q+ +GG+ P+ +E+ L G+G T Sbjct: 69 IESLANCEEEKLMKLWQGLGYYSRARNLKKAACQIVEFYGGELPKEKKELLHLAGIGPYT 128 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ GK +DGNV RV++R +AV G + + K+ L+ Q P FN Sbjct: 129 AGAISSIAYGKKETAVDGNVIRVMSRLFAVDGNVLEGKGRQKIEELTYQELPEDRAGDFN 188 Query: 182 QAMMDLGAMICTR-SKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+MDLGA IC C LCPL C A YP KK K+ LLL Sbjct: 189 QALMDLGATICIPNGAALCHLCPLHLECQANLKKEVEKYPEKKKKKERKLERQTILLLSD 248 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADE---ESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 + L +R GL GL+ FP + +R++L ++ I+ + + H FSH Sbjct: 249 GQKFALEKRKEKGLLAGLWQFPMLEGRLSLQEVREYLKEKGISYSGIEEYEPAIHIFSHV 308 Query: 298 HLDIVPMWLPVSSFTGCM-DEGNALWYNLAQ-PPSVGLAAPVERLLQQLRTG 347 +V + V + E N +W + + + + + L L+ G Sbjct: 309 EWHMVSYIIEVEKWEIQEKREENFVWLSKEEILTEYSVPSAFKVYLDYLKQG 360 >UniRef50_Q0P4V1 Putative uncharacterized protein MGC145569 n=1 Tax=Xenopus (Silurana) tropicalis RepID=Q0P4V1_XENTR Length = 520 Score = 328 bits (840), Expect = 3e-88, Method: Composition-based stats. Identities = 131/423 (30%), Positives = 192/423 (45%), Gaps = 83/423 (19%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQI--------DKTPYKVWLSEVMLQQTQVATVIPYFE 53 + ++L WYDK R LPW+ D+ Y VW+SEVMLQQTQVATVI Y+ Sbjct: 47 ETEIIRDKLLAWYDKSKR-DLPWRTMACTEPDLDRKAYAVWVSEVMLQQTQVATVIDYYN 105 Query: 54 RFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA 113 ++M +PT+ DLA + L+EV +W+GLGYY+R R L + A++V GG P + +E+ Sbjct: 106 KWMKVWPTMEDLARSSLEEVNEMWSGLGYYSRGRRLQEGAKKVVLELGGSMPRSADELQK 165 Query: 114 L-PGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT 172 L PGVGR TAGAI S+S G+ ++DGNV RVL+R + V +KLW+L+ + Sbjct: 166 LLPGVGRYTAGAIASISYGQVTGVVDGNVIRVLSRLRCIGADSSTLAVSDKLWNLANALV 225 Query: 173 PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAA--------------------- 211 FNQ MM+LGA +CT KP C+ CPLQ C A Sbjct: 226 DPDRPGDFNQGMMELGATVCTPKKPLCTACPLQGQCKAYLKVIAEKESAVKTLIKKQASP 285 Query: 212 -----------------------------ANNSWALYPGKKPKQTLP-ERTGYFLLLQHE 241 ++ A +P K K+ E+T + + Sbjct: 286 IAKDVGDIEDCDLGPGLCALCVPTSDPWDSSLGVANFPRKSAKKPSRMEQTAICVWEKCG 345 Query: 242 D----EVLLAQRPPSGLWGGLYCFPQFADEE------------SLRQWLAQRQIAADNLT 285 D E L+ QRP SGL GL+ FP +E L Q L+ + L Sbjct: 346 DHGELEYLIVQRPSSGLLAGLWEFPSILLDEKFTEQNRQHSLLGLLQDLSGHAVPLQKLQ 405 Query: 286 QLTAFRHTFSHFHLDIVPMWLPVSSFTGC------MDEGNALWYNLAQPPSVGLAAPVER 339 H FSH H V +L +++ C + W + + + +++ Sbjct: 406 YKGEVVHIFSHIHQTYVVYFLSLNTTENCSVKTEETERPLTRWVTKKEFLNSAVPTAMKK 465 Query: 340 LLQ 342 +++ Sbjct: 466 IMK 468 >UniRef50_C7PL91 A/G-specific adenine glycosylase n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PL91_CHIPD Length = 354 Score = 327 bits (839), Expect = 4e-88, Method: Composition-based stats. Identities = 109/347 (31%), Positives = 175/347 (50%), Gaps = 12/347 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ +L+W D +++PW+ +K PY++WLSE++LQQT+V Y+E+F+ +PTV +L Sbjct: 5 FTNALLEWNDNENTRSMPWKGEKDPYRIWLSEIILQQTRVEQGWAYYEKFILNYPTVQEL 64 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP + V LW GLGYYAR +N+ AA+Q+A+ + G FP T+E + +L GVG T+ AI Sbjct: 65 AAAPEEAVFRLWQGLGYYARCKNMLAAAKQIASQYHGHFPNTYETIQSLKGVGPYTSAAI 124 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S + +LDGNV RVL+R + + + + L++++ P +NQ++M Sbjct: 125 ASFAFNLPHAVLDGNVFRVLSRFFDIDTPIDTTAGKKQFTDLAQELLPHGKSASYNQSIM 184 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D GA++C +P C CPL C AL P K K + +R Y+L+LQH++ V Sbjct: 185 DFGAVVCKPQQPACKSCPLAAKCKGYQQGLTALLPVKSKKLVIKKRYFYYLVLQHKENVY 244 Query: 246 LAQRPPSGLWGGLYCF-----PQFADEESLRQWLAQRQIAADNLTQL----TAFRHTFSH 296 + +R + +W L+ F P D SL A + + D + F+ +H Sbjct: 245 IRKRTENDIWQNLHEFILIETPGPEDPGSLLSSAAFKAVMKDIRYNMDGASATFKQQLTH 304 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + + L VS Q + L+ Sbjct: 305 QTIHSQFLLLSVSKKPEI---PGYTAVPRDQLDLYAFPKTITDFLRN 348 >UniRef50_D2R2B9 HhH-GPD family protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R2B9_9PLAN Length = 398 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 128/363 (35%), Positives = 186/363 (51%), Gaps = 23/363 (6%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 ++F ++L W+ ++ R LPW+ + Y++W+SEVMLQQTQVATVI YF+RF+ FPTV Sbjct: 27 ARFQQRMLAWFAEHAR-DLPWRRSRDLYRIWISEVMLQQTQVATVIDYFQRFLVEFPTVV 85 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A +VL W GLGYY RAR+LH AA+++ G+FPET +EV +LPG+GR TAG Sbjct: 86 HLAEADEQQVLRQWEGLGYYRRARSLHAAAKKIVHDFRGQFPETLDEVMSLPGIGRYTAG 145 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILS+ L PIL+ N RV AR + + LW+++EQ+ P V FNQA Sbjct: 146 AILSIGLDAKLPILEANTIRVYARLAGYTREATSTAGQKFLWNIAEQILPDKKVGFFNQA 205 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 MM+LG+ +CT P C CP C+A + P + T +L+ + Sbjct: 206 MMELGSALCTPRTPSCDQCPASTWCVARREQAVEQIPLLSKRMVYEYLTEVAVLVVDRQQ 265 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEES-----------LRQWLAQR-----QIAADNLTQL 287 VLL Q P W GL+ FP+F+ E++ L +L +R IA+ Q Sbjct: 266 VLLRQCQPGERWAGLWDFPRFSLEKTMLGAKSTTETDLENYLIERVESLTGIASMIHDQW 325 Query: 288 TAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNA------LWYNLAQPPSVGLAAPVERLL 341 + +H + + + + PM + W + L+ +L Sbjct: 326 PSLKHGVTKYRITLRPMVASLREAGSKSSRAKQKLAEHCKWTRIDALDDEPLSVTGRKLA 385 Query: 342 QQL 344 Q L Sbjct: 386 QLL 388 >UniRef50_C9KJM4 A/G-specific adenine glycosylase n=3 Tax=Clostridiales RepID=C9KJM4_9FIRM Length = 370 Score = 325 bits (834), Expect = 1e-87, Method: Composition-based stats. Identities = 114/348 (32%), Positives = 161/348 (46%), Gaps = 9/348 (2%) Query: 4 SQFSAQVLDWYD-KYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 S +L WY+ + + LPW+ + TPY WLSE+MLQQT+ + VIPY+ RF+ P + Sbjct: 19 SSIVGPLLTWYEGQSTLRCLPWREEVTPYHTWLSEIMLQQTRASAVIPYYTRFLEALPDI 78 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA + ++ LW GLGYY+RARNL KAA + +GGK P F + LPG+GR TA Sbjct: 79 ESLARCDDELLMKLWQGLGYYSRARNLKKAAGVICQEYGGKLPADFAALLELPGIGRYTA 138 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP-AVGVERFN 181 AI S++ G P +DGN+ RV R + V + P N Sbjct: 139 SAIGSIAFGLPLPAVDGNILRVTMRVLQCGEDIAQPAVRRAVEEALAPYYPSGQAAGALN 198 Query: 182 QAMMDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA MDLGA IC P C+ CPL C+A S P K K LLL Sbjct: 199 QAFMDLGATICLPHGAPHCAACPLARLCLAHDAGSEQQLPVKSKKAKRRVEQKTVLLLHQ 258 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFAD---EESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 + L +RP GL GL+ P + + WLA R + + +L RH FSH Sbjct: 259 GRALALTKRPNKGLLAGLWELPNLPGHLTQAEVEAWLAARGLRTLTIARLPDARHIFSHV 318 Query: 298 HLDIVPMWLPVSSFTGCMDE--GNALWYNLAQPPS-VGLAAPVERLLQ 342 + + ++S + E +W + + + + + L Sbjct: 319 EWQMQGWAVELASPDAMVAEDASPLVWASPEELADTYSIPSAFQYFLH 366 >UniRef50_C8WEE4 A/G-specific adenine glycosylase n=3 Tax=Zymomonas mobilis RepID=C8WEE4_ZYMMN Length = 373 Score = 324 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 123/380 (32%), Positives = 201/380 (52%), Gaps = 44/380 (11%) Query: 1 MQAS----QFSAQVLDWYDKYGRKTLPWQIDK-----TPYKVWLSEVMLQQTQVATVIPY 51 MQ S + + +L+WY ++ R LPW+ + PY+VWLSE+MLQQT A PY Sbjct: 1 MQKSAVYREITTDLLNWYQRHAR-ILPWRTEAQQNKVDPYRVWLSEIMLQQTTTAHAAPY 59 Query: 52 FERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEV 111 + +F+ R+PTV LA A +V+ W GLGYY+RARNL K A++V GGKFP+ + + Sbjct: 60 YLKFVERWPTVEALAAAQEADVMAEWAGLGYYSRARNLIKCAKEVVAS-GGKFPDNEQGL 118 Query: 112 AALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQV 171 ALPG+GR TA AI++++ GK ++D NV+RV++R +A+ + ++++ Sbjct: 119 LALPGIGRYTAAAIVAIAFGKRAVVVDANVERVVSRLFAIETPLPASRP--IIAEETDKL 176 Query: 172 TPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPER 231 TP + F QAMMD+GA IC +P C++CP+ C + A++P K PK+ P R Sbjct: 177 TPDLAAGDFAQAMMDIGATICVNRQPTCAICPMMPHCEGQKTGNPAVFPIKAPKKIRPTR 236 Query: 232 TGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFA----------------DEESLRQWL- 274 GY ++ +D++LL +RP GL GG+ P +++++++ Sbjct: 237 IGYAFIVTSKDQILLIRRPDKGLLGGMRALPSSEWLDQSKITTTASDAETEQQAIKEAFP 296 Query: 275 -----AQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPP 329 AQ+ + + H F+HF L++ ++S D ++ + + Sbjct: 297 SFPAQAQKNLRTLAWEKQGHIEHIFTHFALELRIFSAEMTS-----DMIEGEYWPIDEID 351 Query: 330 SVGLAAP----VERLLQQLR 345 GL V++ L+Q R Sbjct: 352 LAGLPTVFKKAVKKYLKQTR 371 >UniRef50_Q6AKD3 Related to A/G-specific adenine glycosylase n=1 Tax=Desulfotalea psychrophila RepID=Q6AKD3_DESPS Length = 366 Score = 324 bits (831), Expect = 3e-87, Method: Composition-based stats. Identities = 104/349 (29%), Positives = 170/349 (48%), Gaps = 10/349 (2%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 +F +Q+L W+ R LPW+ PY VW+SE+MLQQTQ+ + YF R++ RFP Sbjct: 9 DLDRFQSQLLTWFRLQDR-FLPWRQTYDPYHVWISEIMLQQTQMDRGVSYFNRWIERFPQ 67 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V +A A E+ +W GLGYYARARNLH+AA+++ G+ P + + +LPG+G T Sbjct: 68 VEAVAEADEQEIFKMWEGLGYYARARNLHRAAKKIVEEFAGELPCDIDLLRSLPGIGPYT 127 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A AI S++ P +D NV R+ +R + + + +V + ++ P+ +N Sbjct: 128 AAAIGSVACNIDIPTIDANVARIFSRLFDIDRPVRETQVARAIEKVACDCLPSGRARHWN 187 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLG ++C P+C+LCP+Q C+A A + L P PKQ L + E Sbjct: 188 QALMDLGGLVCLPRAPRCTLCPIQEMCLAFARQTVQLRPVPLPKQKLIHIRRVAIFQVSE 247 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEES------LRQWLAQRQIAADNLTQLTAFRHTFS 295 +L+ LW GL+ FP +E + + L L + +H ++ Sbjct: 248 GRLLIQPSRHPTLWQGLWEFPHTDLQEEEAVEVGITRLLGLLPSENRVLEPMVTVKHAYT 307 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + + L + ++ W G +A ++L+ + Sbjct: 308 KYRVQFHCYLLRGEAGCQGEEQRLCDW---QALARHGFSAGPRKVLEHI 353 >UniRef50_C0BLH9 A/G-specific adenine glycosylase n=2 Tax=Flavobacteria RepID=C0BLH9_9BACT Length = 350 Score = 324 bits (831), Expect = 3e-87, Method: Composition-based stats. Identities = 112/317 (35%), Positives = 158/317 (49%), Gaps = 13/317 (4%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 FS +++ WY R +LPW+ K PY +WLSE++LQQT+VA PY+ +F FPTV D Sbjct: 2 SFSQEIIHWYQANKR-SLPWRETKDPYHIWLSEIILQQTRVAQGTPYYVKFTDAFPTVFD 60 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA AP ++VL LW GLGYY+RARNLH AAQ V G FP ++ + L GVG TA A Sbjct: 61 LAAAPEEQVLKLWQGLGYYSRARNLHAAAQYVVNELNGVFPANYKALLQLKGVGDYTASA 120 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I S+ +LDGNV RVL+R + + E +++ +NQA+ Sbjct: 121 IASICFNSPEAVLDGNVYRVLSRYFGIDVPVNTTEGARLFKAIALDQLDLANPSDYNQAI 180 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE-DE 243 M+ GA+ CT C CPLQ+ C+A + P K PKQ + Y+ + Sbjct: 181 MEFGALQCTPKLTDCPSCPLQSTCVAYRTQKVTVLPVKLPKQKPVKVYHYYFVFVDPFGN 240 Query: 244 VLLAQRPPSGLWGGLYCFP--QFADEESLRQWLAQRQIAADNLTQLTA---------FRH 292 L +R G+W GLY FP F+ ++ L + A D A H Sbjct: 241 TLYEKRTGKGIWEGLYQFPLATFSAPRTVEGLLTSMETAIDFSLYENAKHFVYHDTYIVH 300 Query: 293 TFSHFHLDIVPMWLPVS 309 +H H+ + + + Sbjct: 301 KLTHRHIYATFIVIETA 317 >UniRef50_Q2KBH8 A/G-specific adenine glycosylase protein n=11 Tax=Alphaproteobacteria RepID=Q2KBH8_RHIEC Length = 367 Score = 324 bits (831), Expect = 3e-87, Method: Composition-based stats. Identities = 130/347 (37%), Positives = 185/347 (53%), Gaps = 15/347 (4%) Query: 7 SAQVLDWYDKYGRKTLPWQIDK---------TPYKVWLSEVMLQQTQVATVIPYFERFMA 57 + +LDWYD++ R LPW+I PY+VWLSEVMLQQT V V PYFE+F+ Sbjct: 11 AKPLLDWYDRHHR-DLPWRISPGMAARGVKPDPYRVWLSEVMLQQTTVQAVKPYFEKFLE 69 Query: 58 RFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGV 117 R+P VTDLA A D V+ W GLGYYARARNL K A+ VA HGG FP+T E + +LPG+ Sbjct: 70 RWPEVTDLAAAESDAVMATWAGLGYYARARNLKKCAEAVAKEHGGVFPDTEEGLKSLPGI 129 Query: 118 GRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGV 177 G TA A+ +++ + ++DGNV+RV++R YA+ + + + +TP+ Sbjct: 130 GDYTAAAVAAIAFNRQAAVMDGNVERVISRLYAIEAPLPAAKP--AMKAKVALLTPSGRP 187 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL 237 F QAMMDLGA ICT +P CSLCP +N C A + L+P K K+ P R G + Sbjct: 188 GDFAQAMMDLGATICTPKRPACSLCPFRNACQALKLSDPELFPVKAAKKEKPVRHGAAFI 247 Query: 238 LQH-EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAAD-NLTQLTAFRHTFS 295 + E+LL +R SGL GG+ P A L + + H F+ Sbjct: 248 AVSGDGEILLRRRAESGLLGGMTEVPTTAWTARLDGETSSAAAPFEAAWQACGTVIHVFT 307 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNALWYNL-AQPPSVGLAAPVERLL 341 HF L + + +++ D N W+ + L +++ + Sbjct: 308 HFELRLSIWRVAIAAKPKSRDGVNDEWWEPVTNLEAQALPTIMKKAI 354 >UniRef50_Q02CA8 A/G-specific DNA-adenine glycosylase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q02CA8_SOLUE Length = 337 Score = 324 bits (830), Expect = 3e-87, Method: Composition-based stats. Identities = 112/336 (33%), Positives = 169/336 (50%), Gaps = 23/336 (6%) Query: 12 DWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLD 71 WY + LPW+ + PY+VW+SE+MLQQT+ VIPY+E+F+ARFP+V LA AP + Sbjct: 12 AWYRA-AHRDLPWRSSRDPYRVWISEIMLQQTRAQAVIPYYEKFLARFPSVESLAAAPEE 70 Query: 72 EVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLG 131 EVL LW+GLGYY+RARNL AA+Q+ G FP ++ + ALPG+G TA AI S++ G Sbjct: 71 EVLALWSGLGYYSRARNLRNAARQITAAAG--FPREYDAIRALPGIGDYTAAAIASIAFG 128 Query: 132 KHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV----GVERFNQAMMDL 187 +LDGNV RV+AR + G + ++++Q A FNQA+M+L Sbjct: 129 LPHAVLDGNVLRVVARVENDAADIGSSRTRERFRAIAQQWLEAKEVKHDPGHFNQALMEL 188 Query: 188 GAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLA 247 GA +C +P C +CP+ C A + A P K KQ +G L+++ +LL Sbjct: 189 GATVCVPRRPLCLVCPISACCRARQEGTAAQLPVKLRKQEPVRISGVLLVVRQRGRILLR 248 Query: 248 QRPP-SGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWL 306 +R G + P + ++ ++ RHT +H Sbjct: 249 RREDSQSRMAGFWELP-----------TPEELPGSEVGERIGEIRHTITHHRYTF----E 293 Query: 307 PVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 ++ G W+ +AQ + + + LQ Sbjct: 294 VRAATAAVPATGGFAWFTIAQLAEIPFSTTARKALQ 329 >UniRef50_Q7MUV2 A/G-specific adenine glycosylase n=2 Tax=Porphyromonas gingivalis RepID=Q7MUV2_PORGI Length = 407 Score = 324 bits (830), Expect = 4e-87, Method: Composition-based stats. Identities = 108/349 (30%), Positives = 167/349 (47%), Gaps = 24/349 (6%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + + +WYD R LPW+ PY++W+SEV+LQQT+V Y+ RF+ FP V Sbjct: 50 PELRKLLAEWYDANKR-DLPWRQTDDPYRIWISEVILQQTRVEQGRDYYHRFIECFPDVH 108 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 L+ A DEVL W GLGYY+RARNLH+AA+ + + GG P T +E+ LPG+G TA Sbjct: 109 SLSLASEDEVLKQWEGLGYYSRARNLHRAARMIVSDFGGCIPRTRQEILRLPGIGDYTAA 168 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+LS + F +DGN+ RV++R + + ++ + R NQA Sbjct: 169 AVLSFAYDLPFAAVDGNIFRVISRLMNLDTPIDTPAGKKLFSFWADALLDREAPARHNQA 228 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH--- 240 +M+ GA+ CT + P C LCP++ C+A P KK + R YF+ ++ Sbjct: 229 IMEFGALHCTPTSPSCLLCPVRRFCMADTAGCVDALPVKKGGLRITNRYLYFIYIRVITP 288 Query: 241 -EDEVLLAQRPPSGLWGGLYCFP--QFADEESLRQWLAQRQIA------ADNLTQLT--A 289 + +RP +W GLY FP + +D L L ++ + ++ L Sbjct: 289 TGVYTYIRRRPSGDIWQGLYEFPCVELSDHAVLETLLLSPELGNLLRSISGSMDSLPFKT 348 Query: 290 FRHTFSHFHLDIVPMWLPVSSFTGCMDEGNAL----WYNLAQPPSVGLA 334 F+H +H +L W+ + T +D+ L Q Sbjct: 349 FKHQLTHRNL-----WIHGYTLTARLDKAPDLDGYRCIREEQLDDFAFP 392 >UniRef50_B6BVY5 A/G-specific adenine glycosylase n=1 Tax=beta proteobacterium KB13 RepID=B6BVY5_9PROT Length = 294 Score = 323 bits (828), Expect = 6e-87, Method: Composition-based stats. Identities = 122/308 (39%), Positives = 179/308 (58%), Gaps = 14/308 (4%) Query: 39 MLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVAT 98 MLQQTQV+TVIP++ +F+ RF + LA A DEV+ W+GLG+Y+RARNLHK A+ +A Sbjct: 1 MLQQTQVSTVIPFYIKFIDRFQNIKHLAEASEDEVMSYWSGLGFYSRARNLHKTARIIAE 60 Query: 99 LHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKK 158 + KFP+TFE + LPG+GRSTAGAILS K F ILDGNVKRVL R + + Sbjct: 61 QYSCKFPDTFESLIQLPGIGRSTAGAILSFCFKKKFAILDGNVKRVLTRFFGIQESISLA 120 Query: 159 EVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL 218 + E LW LSEQ+ P ++ + Q +MD GA +CT P+C CP+ CIA NN Sbjct: 121 KTEKDLWDLSEQLLPDGDIDIYTQGIMDFGATLCTPKNPQCHSCPMNQTCIAKQNNLTEA 180 Query: 219 YPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQ 278 P K +T +R+ F + + ++LL + +G+W GL+ PQ + +L ++ Sbjct: 181 IPVKNKTKTKEDRSTEFYIYECNKQILLVKN-RTGVWSGLWIPPQ-------KNYLGKKI 232 Query: 279 IAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVE 338 NL + + FSH+ L + +++ + M +G LW++ + P +GL AP++ Sbjct: 233 ----NLIKQGERQCVFSHYRLKYKYFLIKITNKSDLMVDG--LWFDWKEIPELGLPAPIK 286 Query: 339 RLLQQLRT 346 LL L Sbjct: 287 SLLISLTN 294 >UniRef50_B7CBH3 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CBH3_9FIRM Length = 328 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 100/308 (32%), Positives = 162/308 (52%), Gaps = 17/308 (5%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 +F ++DWY K R L +++ K PY++W+SE+M QQT++ ++PYF+R++ + P + Sbjct: 2 KEFQKDLIDWYIKNHR-PLEFRLKKDPYEIWISEIMAQQTRIEAMLPYFKRWIQQLPDIE 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 +A +++ LW GLGYY+R +N+ K A + + GK P T EE+ LPG+G TAG Sbjct: 61 SVAKCDDEKLNKLWQGLGYYSRCKNIKKCAIECVEKYSGKLPCTKEELLKLPGIGPYTAG 120 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP-AVGVERFNQ 182 AI S++ G+ +DGNV RV +R Y + K V+ ++ L ++ P + +NQ Sbjct: 121 AIASIANGQRVSAVDGNVIRVFSRLYNIFEDVTKTSVKKQIEELVDESLPSKEEISYYNQ 180 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+LGA+IC P+C LCP++ C A P K K+ + +L H+D Sbjct: 181 AIMELGALICIPKNPRCELCPVKKYCDA---KDPGSLPYKAKKKMRKIEYKHLYILVHKD 237 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 +V + +R +GL GLY F + E + +L + H FSH + Sbjct: 238 KVHVVKRADTGLLAGLYGFDETKPEHIIES------------IELEPYIHIFSHVEWHMD 285 Query: 303 PMWLPVSS 310 V Sbjct: 286 ATLCIVDR 293 >UniRef50_B4D777 HhH-GPD family protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D777_9BACT Length = 341 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 118/341 (34%), Positives = 174/341 (51%), Gaps = 22/341 (6%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 ++ W+ ++GR LPW+ PY + +SE MLQQTQV TV Y+ R++ RFP Sbjct: 18 KSLRQKLARWFRQHGR-DLPWRRTHDPYAIMVSEFMLQQTQVVTVRDYYARWLERFPDFN 76 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A +VLH+W GLGYYARARNLH+AA+QVA LH G+ P ++ALPGVGR TAG Sbjct: 77 ALAAASEADVLHVWQGLGYYARARNLHRAAKQVADLHSGQLPNDLVAISALPGVGRYTAG 136 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+ + + + PI+D N+ RV+AR + K LW +E++ PA N A Sbjct: 137 AVATFAFDQATPIIDANIARVIARLLDLQEPIDTKRGSEILWLTAEELLPAKSGRVHNSA 196 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +M+LGA++CT P+C +CP++ C S P KKP+ + ++ Sbjct: 197 LMELGALLCTPRAPQCPICPIREHC---RTKSPESLPRKKPRPKTIALAENCAWIVNDGN 253 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVP 303 +LL Q+ S W GL+ P + + L AF + F++ + + Sbjct: 254 LLLEQQTGS-RWRGLWKLPVIGESHPRNKL-------------LLAFDYPFTNHRVTLSV 299 Query: 304 MWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 +S N W L S+ LAAP R +++L Sbjct: 300 Y----ASRVPMESRPNQQWVPLTSIDSIALAAPHRRAIKRL 336 >UniRef50_C9PXZ8 A/G-specific adenine glycosylase n=2 Tax=Prevotella RepID=C9PXZ8_9BACT Length = 403 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 123/390 (31%), Positives = 177/390 (45%), Gaps = 66/390 (16%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +L W+ +GR +LPW+ K PY +WLSEV+LQQT+V+ + Y++RFM +PTV L Sbjct: 7 FSTTLLHWFQHHGR-SLPWRETKDPYAIWLSEVILQQTRVSQGLAYWQRFMHNYPTVNAL 65 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A DEVL LW GLGYY+RARNLH+AA+Q+ L G FP T + ++ L GVG TA AI Sbjct: 66 AAATEDEVLRLWQGLGYYSRARNLHQAAKQIVEL--GYFPNTHDAISKLKGVGPYTAAAI 123 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S++ ++DGNV RVLAR + + E + + +L++ + P RFN+A+M Sbjct: 124 ASIAFNLPVAVVDGNVYRVLARFFGIDTPINSTEGKKQFATLAQSLLPHHAPARFNEAIM 183 Query: 186 DLGAMICTRSK----------------PK-CSLCPLQNGCIAAANNSWALYPGKKPKQTL 228 D GA+ C K P C+ CPL C+A A P K Q Sbjct: 184 DFGALQCLPVKGETTKTNNNIATTNESPAFCNDCPLNGQCVAYAQGLVRSLPVKTKSQAP 243 Query: 229 PERTGYFLLLQHEDEVLLAQRPPSGLWGGLYC---------------------------- 260 +R ++ L+ EV + +RP +W GL+ Sbjct: 244 KQRRMGYIYLRCRGEVAIRKRPAGDIWQGLWEPLLYEDEVLSSAQTPKTSKQESTNYIED 303 Query: 261 FPQFADEESLRQWLAQRQIAADNLTQL----------------TAFRHTFSHFHLDIVPM 304 FP L L Q A T L FRH +H I+ Sbjct: 304 FPSPQLLAYLDSLLKGEQCATSTATSLHTHPHPTNSSPTIKGPYTFRHVLTHR--IIMAQ 361 Query: 305 WLPVSSFTGCMDEGNALWYNLAQPPSVGLA 334 + V + + +W + ++ Sbjct: 362 FAIVETEVKPILPSGYIWVTEGELNKYAIS 391 >UniRef50_B8FVN1 A/G-specific adenine glycosylase n=2 Tax=Desulfitobacterium hafniense RepID=B8FVN1_DESHD Length = 401 Score = 321 bits (824), Expect = 2e-86, Method: Composition-based stats. Identities = 122/391 (31%), Positives = 180/391 (46%), Gaps = 58/391 (14%) Query: 7 SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 S++++ W+++ R LPW+ K PY +W+SEVMLQQTQV T IPY+ RFM RFPT++ LA Sbjct: 6 SSKLVQWFNQVKR-DLPWRRTKDPYAIWVSEVMLQQTQVVTAIPYYLRFMGRFPTLSHLA 64 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 A +EVL LW GLGYY+RAR L + A+ V G+ P+ ++ + + GVG TA AI Sbjct: 65 EAEQEEVLELWRGLGYYSRARRLWEGARYVVETAEGRMPKDYQSLLHIKGVGEYTAAAIA 124 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMD 186 S++ + P++DGNVKRVL+R K +V P FNQ MM+ Sbjct: 125 SIAYEEQVPVMDGNVKRVLSRILRWEEDVEKARSRRFFLEYLGEVIPGDCPGDFNQGMME 184 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLL 246 LGA +CT P+C CPLQ C A +YP KK K+ L+L H+ +VLL Sbjct: 185 LGATVCTPKHPRCEQCPLQADCEGFALGDPQVYPVKKSKEKPGSAWRPTLILLHQGQVLL 244 Query: 247 AQRPPSGLWGGLYCFPQFA--------------------------DEESLRQ-------- 272 +RP +GL L+ FP EE+ R Sbjct: 245 KKRPSTGLLADLWEFPGEEMMVPPWENYRGGKEFSAEAVKVAERVGEETSRYRYNLDFAW 304 Query: 273 -------------WLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW----------LPVS 309 A +++ + ++ HTFSH I + + V Sbjct: 305 YELYKNQIGDAACDEAVQELLSRRPSRQGPLVHTFSHRRWQIYWLVLNLDEVMDKTVRVR 364 Query: 310 SFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 + W ++ + + L +++ Sbjct: 365 ENEDSLKGDGLCWLDVKELDKIALPVAFQKV 395 >UniRef50_Q2QW21 Os12g0211400 protein n=6 Tax=Poaceae RepID=Q2QW21_ORYSJ Length = 474 Score = 321 bits (824), Expect = 2e-86, Method: Composition-based stats. Identities = 124/401 (30%), Positives = 184/401 (45%), Gaps = 61/401 (15%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTP--------------YKVWLSEVMLQQTQVATV 48 A+ A++L WYD R+ LPW+ P Y VW+SEVMLQQT+V V Sbjct: 54 AAAVRAELLRWYDAN-RRDLPWRRAAEPPAGSGSGRGEEQRAYAVWVSEVMLQQTRVPVV 112 Query: 49 IPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETF 108 + Y+ R+MAR+PTV LA A +EV +W GLGYY RAR L + A+Q+ G+FP T Sbjct: 113 VDYYSRWMARWPTVDSLAAATQEEVNEMWAGLGYYRRARFLLEGAKQIVEK--GEFPCTA 170 Query: 109 EEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLS 168 + + G+G TAGAI S++ + P++DGNV RV++R YA+ P + + W L+ Sbjct: 171 STLREVRGIGDYTAGAIASIAFNEVVPVVDGNVVRVISRFYAIPDNPKESSTVKRFWQLT 230 Query: 169 EQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSW------ALYPGK 222 ++ FNQAMM+LGA +C+++KP CS CP+ + C A A +S +P Sbjct: 231 GELVDPSRPGDFNQAMMELGATLCSKTKPGCSQCPVSSHCQALALSSQNASVKVTDFPRV 290 Query: 223 KPKQTLPERTGYFLLLQ--------------HEDEVLLAQRPPSGLWGGLYCFPQFADEE 268 PK ++Q ++ LL +RP GL GL+ FP E Sbjct: 291 VPKAKPRSDFAAVCVVQISQGFGEGIAEAEGKDNLFLLIKRPEEGLLAGLWEFPSVLVNE 350 Query: 269 SLRQWLAQRQIAADNLTQL-----------------TAFRHTFSHFHLDIVPMWLPV--- 308 L +R+ L QL H FSH L + + + Sbjct: 351 GKTDTLNRRKEMDKYLKQLLSIDVTRRSSVILREDVGQHVHIFSHIRLTMFVELMILNLK 410 Query: 309 ----SSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 D N S+GL + + ++ ++ Sbjct: 411 DDGEELCKEGQDSTKLRLINENAVQSMGLTSGIRKVYNMVK 451 >UniRef50_A9NG45 A/G-specific adenine DNA glycosylase n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NG45_ACHLI Length = 334 Score = 321 bits (824), Expect = 2e-86, Method: Composition-based stats. Identities = 117/340 (34%), Positives = 184/340 (54%), Gaps = 14/340 (4%) Query: 9 QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 ++ DWY R LP++ PY +W+SE+MLQQTQV T++P+++RF+ +PT+ DLA A Sbjct: 5 RLFDWYQTNKR-DLPFRKTDNPYHIWVSEIMLQQTQVDTMLPFYDRFLTIYPTIQDLARA 63 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 +DE+L + G+GYY R R LHK AQ V H GK PE + ++ +PG+G TAGAI+S+ Sbjct: 64 DIDEILKVVQGIGYYRRFRMLHKGAQYVIDHHDGKLPEDYFKILKIPGIGAYTAGAIMSI 123 Query: 129 SLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP-AVGVERFNQAMMDL 187 + K +P DGNV RVL+R + + + KL +++++ + + Q+MM+L Sbjct: 124 AFHKPYPATDGNVIRVLSRVKMLEDDFRLDKNKKKLNEMNKELIENSNNPYLYTQSMMEL 183 Query: 188 GAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLA 247 GA +C S P C CPLQ C+A NN YP P +T E + +L++++ L+ Sbjct: 184 GATVCKVSNPLCDTCPLQEVCLANINNVQQNYPKMSPLKTKKEIQYFTFILEYKEGFLMR 243 Query: 248 QRPPSGLWGGLYCFPQFADEESLRQWLAQRQ---IAADNLTQLTAFRHTFSHFHLDIVPM 304 +R L G Y F Q +SL L Q Q + L +L +H F+H I Sbjct: 244 KRTE-DLLHGFYEFVQIE-SDSLNGALTQAQDLGLEIQILDELQPIKHVFTHMIWQINLY 301 Query: 305 WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 +S+ T + + V +A V++ ++Q+ Sbjct: 302 RGTISNITSPYEIVH-------DFSQVPIATVVKKQIKQI 334 >UniRef50_B3EUI2 A/G-specific adenine glycosylase n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3EUI2_AMOA5 Length = 373 Score = 321 bits (823), Expect = 2e-86, Method: Composition-based stats. Identities = 106/304 (34%), Positives = 177/304 (58%), Gaps = 13/304 (4%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ ++++WY ++ + LPW+ K PYK+WLSE++LQQT+VA +PY++RF+ +PT+ DL Sbjct: 20 FAIKLIEWY-QHHHRALPWRETKDPYKIWLSEIILQQTRVAQGLPYYQRFIENYPTIHDL 78 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+A +L +W GLGYY RARNLH A+ + T GKFP ++ + +LPG+G TA AI Sbjct: 79 ASASETAILRVWQGLGYYTRARNLHACARTIVTQFQGKFPNNYKALLSLPGIGVYTAAAI 138 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S++ + P++DGNV RVLAR + + + ++ L++ + + +NQA+M Sbjct: 139 ASIAFKEPIPVIDGNVYRVLARIFDIETAINSTKGKHIFNQLAQTLISKTAPDIYNQAIM 198 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ-HEDEV 244 + GA+ CT KP C+ C + C A N L P K+ K + +R ++L +Q +D++ Sbjct: 199 EFGAIQCTPLKPLCNTCIFKMDCSAFLANKQHLLPVKEAKVKIKQRFFHYLCIQLDDDQL 258 Query: 245 LLAQRPPSGLWGGLYCFPQFAD---------EESLRQWLAQRQIAADNLTQLTAFRHTFS 295 + R P +W GLY F + E+ L Q + + Q+ + + + ++H + Sbjct: 259 FMKSRKPGDIWTGLYDFYLVEESERKEFDQLEDELVQLIKKHQLYIEKVPTV--YKHILT 316 Query: 296 HFHL 299 H L Sbjct: 317 HRVL 320 >UniRef50_C6X0N8 A/G-specific adenine glycosylase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X0N8_FLAB3 Length = 372 Score = 319 bits (818), Expect = 9e-86, Method: Composition-based stats. Identities = 109/340 (32%), Positives = 172/340 (50%), Gaps = 15/340 (4%) Query: 8 AQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLAN 67 +L WY+ +GR LP++ PY++W+ E++ QQT++A + +++ F+ARFP V LAN Sbjct: 15 ENLLAWYNTHGR-DLPFRKTGDPYRIWVCEIIFQQTRIAQGLNHYQNFIARFPDVHTLAN 73 Query: 68 APLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILS 127 A DEVL W GLGYY+RA NLHKAA Q+ +GG FP+ ++++ L GVG+ TA AI S Sbjct: 74 AETDEVLLYWKGLGYYSRALNLHKAAMQIIHEYGGIFPKNYDDILKLRGVGKYTAAAIAS 133 Query: 128 LSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDL 187 + G H P +DGN RVL+R +A + N L+ ++ P FN+AMMDL Sbjct: 134 ICFGAHIPAVDGNFYRVLSRVFAEDFDVSNSKAFNYFSGLALRMMPQNKAGHFNEAMMDL 193 Query: 188 GAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLA 247 G+ +C PKC +CPL C+A +P K K + + ++ + L+ Sbjct: 194 GSEVCRPRNPKCEICPLNADCLAFNLGLIPKFPVKTKKTKPTDLELTYYFVEFGERFLIR 253 Query: 248 QRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLP 307 QR +W L+ FP E+ +Q ++T H +H +L I + Sbjct: 254 QRNEDHIWKKLFEFP-IRIPENFKQ----------DITHHKTVSHQLTHKNLTIHIYKVT 302 Query: 308 VSSFTGCMD---EGNALWYNLAQPPSVGLAAPVERLLQQL 344 + + ++ + N L P++ L + Sbjct: 303 LIAENAFLEFALKENLQIITLEDSHHKSFPKPLQNYLARF 342 >UniRef50_C6XY88 Helix-hairpin-helix motif protein n=4 Tax=Sphingobacteriaceae RepID=C6XY88_PEDHD Length = 349 Score = 317 bits (813), Expect = 3e-85, Method: Composition-based stats. Identities = 107/343 (31%), Positives = 165/343 (48%), Gaps = 9/343 (2%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 F +++++WY + R LPW+ Y +WLSEV+LQQT+V +PYF F+ +PTV D Sbjct: 2 SFQSEIVNWYLNHKR-DLPWRGTTDAYIIWLSEVILQQTRVDQGLPYFNNFLQNYPTVLD 60 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 A+A +VL LW GLGYY+R RN+ A+QV LHGG FP ++++ L G+G TA A Sbjct: 61 FASASETQVLKLWQGLGYYSRGRNMLFTARQVRDLHGGVFPVRYDQLIKLKGIGEYTAAA 120 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I S S + +LDGNV RVL+R + + + + L++ + +NQA+ Sbjct: 121 IASFSSNESKAVLDGNVFRVLSRYFGIESPINSSTGKKQFADLAQSLISGQQPSVYNQAI 180 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+ GA+ C P C +CP+Q+ C A ++ P K K R + L D + Sbjct: 181 MEFGALQCKPKSPNCGICPVQDSCFAQKHHLVGTLPVKLNKLKKRTRYFNYFLCMEGDNI 240 Query: 245 LLAQRPPSGLWGGLYCFPQ-------FADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 L+ +R P +W LY FP D E L + AA + L+ +H +H Sbjct: 241 LVKKRSPGDIWQELYDFPLIETDRPFLEDPEKFAPLLQESFGAACKVRTLSHQKHLLTHQ 300 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 + + L W +L + + + Sbjct: 301 TIYVQFFGLDNYIINFN-QNAEIKWVSLPEFDELPQPKVITNF 342 >UniRef50_C2KZ90 Putative uncharacterized protein n=1 Tax=Oribacterium sinus F0268 RepID=C2KZ90_9FIRM Length = 391 Score = 317 bits (813), Expect = 4e-85, Method: Composition-based stats. Identities = 107/305 (35%), Positives = 168/305 (55%), Gaps = 5/305 (1%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ +L WY++ R+ LPW+ D PY WLSE+MLQQT+V V Y+ RF++ P + DL Sbjct: 19 FAKALLSWYNEN-RRILPWREDPGPYHTWLSEIMLQQTRVEAVKGYYSRFLSALPEIADL 77 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA ++VL LW GLGYY+RARNL KAA+ + T + G+ P+TF+E+ LPG+G TA AI Sbjct: 78 ANAEEEKVLKLWEGLGYYSRARNLQKAAKTIMTEYAGEMPKTFQELKKLPGIGEYTAAAI 137 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S+ + P LDGN+ R+ AR + + E + +S ++ + FNQA+M Sbjct: 138 ASIVYKEEIPALDGNLLRIFARLTSYPKVVLEPEGKKLAFSYFQEKMQGIDPGDFNQALM 197 Query: 186 DLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 DLG+ +C + C CPL+ C +A + YP K+ L++H +E+ Sbjct: 198 DLGSGVCLPKGEIGCKTCPLKAFCSSAKAGNPMDYPKLPEKKKRKVEQYSIFLIRHREEL 257 Query: 245 LLAQRPPSGLWGGLYCFPQFA---DEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 +L +R GL GLY F + E+ + + + + + L +H F+H ++ Sbjct: 258 MLKKRENKGLLAGLYEFYKLEGHCSEKEALEAVENLGLRSLRIKALGEAKHIFTHKEWEM 317 Query: 302 VPMWL 306 + + Sbjct: 318 IGYEV 322 >UniRef50_A0L468 A/G-specific DNA-adenine glycosylase n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L468_MAGSM Length = 365 Score = 315 bits (808), Expect = 1e-84, Method: Composition-based stats. Identities = 136/347 (39%), Positives = 195/347 (56%), Gaps = 13/347 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + + ++L +YD+YGR LPW+ + Y++WLSE+MLQQT V TV+PY+E+F++ FP++T Sbjct: 8 ADLAQRLLAYYDEYGR-DLPWRQQQDLYRIWLSEIMLQQTGVKTVMPYYEKFLSHFPSIT 66 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A ++VL W GLGYY RAR LH+AAQQV HGG FPE +V ALPG+G STA Sbjct: 67 QLAAASQEQVLAQWQGLGYYRRARMLHQAAQQVVQQHGGLFPEEITQVQALPGIGPSTAA 126 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AIL++ + ILDGNV RVLAR + + +LW ++ Q+T + QA Sbjct: 127 AILAIGRNQAHTILDGNVMRVLARLLTLELPVDSTPGKQRLWQVARQLTSQQRPGDYAQA 186 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERT-GYFLLLQHED 242 +MDLGA +CTRS+P CS CP C A + SWA YP K+ K+ P ++LL + Sbjct: 187 IMDLGATLCTRSQPACSRCPWGGACAARQHGSWAEYPKKREKKPKPHHYQCMWVLLDTQQ 246 Query: 243 EVLLAQRPPSGLWGGLYC---------FPQFADEESLRQWLAQRQIAADNLTQLTAFRHT 293 + L +RP GL GGL+ P + L I L + H Sbjct: 247 RIFLRKRPLEGLLGGLWEPLGEPLLETPPLGNLVQRASHHLTALGIQGQPLLEAQPVDHI 306 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 F+HF L + P+ + +S +++ N W+ LAQ +A ++ Sbjct: 307 FTHFRLTVYPILVVAASGAPILNDAN--WWPLAQLDQRPIATLHRKV 351 >UniRef50_Q6MF36 Putative A/G-specific adenine glycosylase, mutY n=2 Tax=Parachlamydiaceae RepID=Q6MF36_PARUW Length = 352 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 109/351 (31%), Positives = 166/351 (47%), Gaps = 12/351 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M +S + ++ W+ R LPW+ + TPY VW+SEVMLQQTQVA VIPYF R+M FP Sbjct: 1 MSSSFPNEKLKKWFLSSKR-DLPWRQNPTPYAVWISEVMLQQTQVAVVIPYFNRWMQAFP 59 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 ++ LA A L+EV+ W GLGYY+RAR+LH+ AQ + G+ P +E+ + G+G Sbjct: 60 SIEALAIASLEEVIKAWEGLGYYSRARHLHQGAQYLVEHCQGELPADEKELKKIKGLGPY 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 T GAILS + + +DGNV RVLAR + + K KL +E + P Sbjct: 120 TIGAILSFAFHQKKAAVDGNVLRVLARYFQIEEDIAKTSTIKKLRDFAESILPDEESWIS 179 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 ++A+++LGA IC K C CPL++ C + + P K +++ Sbjct: 180 SEALIELGATIC-SRKAICQECPLKSNCQSYLHGLSDKLPIKTKAIKTQYLYRAVAIVKC 238 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFA-DEESLRQWLAQRQIAADN------LTQLTAFRHT 293 D L+ + + LY FP F +E + Q I L L++ H Sbjct: 239 GDYFLVKKGKKGEIMSDLYEFPFFEVSQEGITFKELQENIKLHWNLTVHLLQSLSSIEHG 298 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 F+ + + P + + W + ++ R+ Q L Sbjct: 299 FTRYQARLDPCVFNCLEMSPV---KDLEWLDTKSLKQKAFSSGHRRIFQSL 346 >UniRef50_D2QT13 A/G-specific adenine glycosylase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QT13_9SPHI Length = 362 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 112/341 (32%), Positives = 173/341 (50%), Gaps = 12/341 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ + WY + R LPW+ PY +WLSE++LQQT+VA PY+ERF+ +PT++D+ Sbjct: 14 FAPTLEQWYSVHKR-DLPWRHTLDPYYIWLSEIILQQTRVAQGKPYYERFVDAYPTISDM 72 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA ++L LW GLGYY+RARNLH+ A+ V GKFP T+ ++ + G+G TA A+ Sbjct: 73 ANADERKLLRLWQGLGYYSRARNLHQTARYVTEKLDGKFPNTYHDLLKMKGIGAYTAAAV 132 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT-PAVGVERFNQAM 184 S + G+ P++DGNV RVLAR + ++ + +L+ ++ AV +NQA+ Sbjct: 133 ASFAFGERVPVVDGNVYRVLARVFGITEDITTTTAKKTFAALATRLMQAAVDPATYNQAI 192 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+ GA+ CT P C LCP+Q C+A P K K + ER +++ Q + Sbjct: 193 MEFGAIHCTPVAPDCLLCPVQQQCVAYLTGRQHQLPVKSKKAPVRERFFSYVVFQQGGRL 252 Query: 245 LLAQRPPSGLWGGLYCFPQFADEE---SLRQWLAQRQIAA----DNLTQLT-AFRHTFSH 296 L +R +W LY F +E +LR +++ L + SH Sbjct: 253 ALQERVARDIWQNLYDFYLVETDEPKTALRDIPLSESVSSLVNRGTLVEAPVEAMQLLSH 312 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPV 337 + + + WY+LA+ L PV Sbjct: 313 QRIRAYFFLVDLPETETGSLPIGLQWYSLAEISI--LPKPV 351 >UniRef50_C0YGU7 Possible A/G-specific adenine glycosylase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YGU7_9FLAO Length = 359 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 109/342 (31%), Positives = 160/342 (46%), Gaps = 16/342 (4%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 ++L+WY R LP++ K PYK+W+ E++ QQT++ + ++ F+ RFP V Sbjct: 27 HIGNRLLEWYRNNAR-DLPFRQTKDPYKIWICEIVFQQTRINQGLNHYNNFIKRFPDVKT 85 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A +EVL W GLGYY+RA N+HKAAQQ+ + G FP +EE+ L GVG+ TA A Sbjct: 86 LAEADENEVLLYWKGLGYYSRAINIHKAAQQIMNDYQGVFPHQYEEILKLKGVGKYTAAA 145 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 + S+ G P +DGN RVL+R +A L+ V P V FNQAM Sbjct: 146 VSSICFGGRMPAVDGNFYRVLSRIFADDFDISNSRAFTYFSELAALVMP-ENVGDFNQAM 204 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 MDLG+ IC P C CPL C+A + + YP K K + + + + Sbjct: 205 MDLGSEICKPKNPLCGECPLNEDCLAFSLQKISDYPVKTKKVKAGDLALTYYFVHRNGKF 264 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPM 304 L+ QR +W L+ FP + + H +H +L I Sbjct: 265 LIRQRADDFIWKKLFEFPAAIPSDMKPFITGSK-----------TITHKLTHKNLSIEIW 313 Query: 305 WLPVSS---FTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + V+S + + E L +L P+E +Q Sbjct: 314 NVEVTSEKVWNDFIAENQYLITDLEGSHEKSFPKPLEIYIQN 355 >UniRef50_Q2N933 Putative uncharacterized protein n=1 Tax=Erythrobacter litoralis HTCC2594 RepID=Q2N933_ERYLH Length = 347 Score = 314 bits (806), Expect = 2e-84, Method: Composition-based stats. Identities = 124/341 (36%), Positives = 176/341 (51%), Gaps = 17/341 (4%) Query: 10 VLDWYDKYGRKTLPWQ-----IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 +LDWYD++ R LPW+ PY+VWLSE+MLQQT VA V PYF RF +PTV Sbjct: 16 LLDWYDRHAR-DLPWRAQPGEPAADPYRVWLSEIMLQQTTVAAVKPYFARFTEVWPTVEA 74 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A ++V+ W GLGYY+RARNL AA+ V L G FPET E+ LPGVG TA A Sbjct: 75 LAEASEEDVMAAWAGLGYYSRARNLLMAAKAVGDL--GGFPETEAELRRLPGVGEYTAAA 132 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I +++ G+ ++D NV+RV++R +A+ + + + ++ +TP F QAM Sbjct: 133 IAAIAFGRRAVVVDANVERVVSRLFAIEEP--LPKARRAIRAHADAITPDRRAGDFAQAM 190 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 MDLG+ +CT P+C LCPL C A +P K P++ P R G ++ E V Sbjct: 191 MDLGSQVCTTRAPQCLLCPLARFCEA-RTQEPERFPVKPPRKAKPLRKGRAFWIEREGTV 249 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPM 304 L +R SG+ GG+ P D S R + A + H F+HF L++ + Sbjct: 250 WLVRREGSGMLGGMRSLP--DDGWSARSDGSGELPLAGEWENAGSVFHVFTHFALELTVL 307 Query: 305 WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 + E W+ L GL + ++ R Sbjct: 308 RHGGAEHPTGEGE----WWPLDDLGKAGLPTLFAKAARRAR 344 >UniRef50_Q2G9R8 A/G-specific DNA-adenine glycosylase n=7 Tax=Alphaproteobacteria RepID=Q2G9R8_NOVAD Length = 359 Score = 314 bits (806), Expect = 3e-84, Method: Composition-based stats. Identities = 124/347 (35%), Positives = 179/347 (51%), Gaps = 15/347 (4%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQI-----DKTPYKVWLSEVMLQQTQVATVIPYFERF 55 + +LDWYD + RK LPW+ + PY+VWLSEVMLQQT VA V PYFE+F Sbjct: 11 FDPQAIAPALLDWYDAHARK-LPWRRLPGEARQDPYRVWLSEVMLQQTTVAAVGPYFEKF 69 Query: 56 MARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALP 115 +PTV DLA A +V+ W GLGYYARARNL A+ VA GG FP++ + + ALP Sbjct: 70 TRLWPTVGDLAAADDGDVMAAWAGLGYYARARNLLACARAVAA-MGGTFPDSEDGLRALP 128 Query: 116 GVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV 175 G+G TA A+ +++ G+ ++D NV+RV+AR +A+ + + + QVTP Sbjct: 129 GLGEYTAAAVAAIAFGRRAVVVDANVERVIARLFAIDEP--LPAGKAAIRLAAGQVTPEE 186 Query: 176 GVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF 235 F QAMMDLGA +CT P+C LCPL+ C A A + P K ++ P R G Sbjct: 187 RAGDFAQAMMDLGATVCTARSPRCMLCPLREHCRALAEGAPERLPVKAARKAKPVRQGRA 246 Query: 236 LLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFS 295 ++ E VLL +RP G+ GG+ P ++ RH F+ Sbjct: 247 YWIEREGRVLLVRRPGRGMLGGMRALP----DDGWSARGDGADAIGGEWRGGGVVRHGFT 302 Query: 296 HFHLDIVPMWLPVSSFT--GCMDEGNALWYNLAQPPSVGLAAPVERL 340 HF L++ M + +++ W+ + + + GL + Sbjct: 303 HFDLELQLMLCVQAEAASLPGLNDIEGEWWPVDEIEAAGLPTVFAKA 349 >UniRef50_C1XGJ6 A/G-specific DNA-adenine glycosylase n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XGJ6_MEIRU Length = 336 Score = 314 bits (805), Expect = 3e-84, Method: Composition-based stats. Identities = 121/351 (34%), Positives = 178/351 (50%), Gaps = 26/351 (7%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + +S + +L WY ++ R+ LPW+ + PY+V LSEV+LQQT+V IPY+ RF+ RFP Sbjct: 6 LMSSDLHSALLTWYQRHQRR-LPWRGEPDPYRVLLSEVLLQQTRVEQAIPYYHRFLQRFP 64 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA A + VL +W G GYY RARNLH+ AQQV GG P++ + ALPG+G Sbjct: 65 TLEALAQAEQEAVLQVWQGCGYYTRARNLHRLAQQVVAA-GGVLPQSARGLRALPGIGPY 123 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT--PAVGVE 178 TA A+ S++ G+ +DGNV+RVL+R A P K+V+ +L + Sbjct: 124 TAAAVASIAFGEPAAAVDGNVRRVLSRLLAWEH-PTPKQVQEAADALLSALVQQKDARPG 182 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 +NQA+M+LGA +CT P C CP+ C S A YP + ++ L+L Sbjct: 183 DWNQALMELGATVCTPQNPGCGGCPVAAFCQ--GKASPAHYPASRVRK-QKSLELVALVL 239 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 Q + V L QR + GGL+ P +L + LA+ + + L + RH F+H Sbjct: 240 QGPEGVYLEQR-QGPVLGGLWGVPMEEGPGALERLLARFGLDSAEL--VGPVRHEFTHRK 296 Query: 299 LDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAP 349 L I P W P L+ +++L L +P Sbjct: 297 LHIRVYRAP--------------WVAGENPVRRPLSRLDQKIL-ALAGASP 332 >UniRef50_Q3KMR2 A/G-specific adenine DNA glycosylase n=9 Tax=Chlamydia RepID=Q3KMR2_CHLTA Length = 368 Score = 313 bits (803), Expect = 5e-84, Method: Composition-based stats. Identities = 99/341 (29%), Positives = 173/341 (50%), Gaps = 17/341 (4%) Query: 8 AQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLAN 67 + W+ + R + PW+ TPY+VW+SEVMLQQT+ V+PYF ++M RFPT+ DLA Sbjct: 16 EALRSWFLESKR-SFPWRDSPTPYRVWVSEVMLQQTRAEVVVPYFLKWMERFPTLQDLAQ 74 Query: 68 APLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILS 127 A +V+ LW GLGYY+RARNL A+ + + GG+ P ++++ G+G TA AIL+ Sbjct: 75 ARESDVVQLWEGLGYYSRARNLLAGARVITEIFGGEIPNDLALLSSIKGIGSYTANAILA 134 Query: 128 LSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDL 187 + + P +DGNV RV++R +A+ + ++ L E + P + ++ ++L Sbjct: 135 FAFKQKNPAVDGNVLRVMSRLFAIEESIDRMNTRREITGLCESLLPDQDPQVIAESFIEL 194 Query: 188 GAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLA 247 GA IC +P C CPL++ C A + YP + + + +++ ++D+VL+ Sbjct: 195 GARICK-KQPLCEQCPLRSFCTAYRQGTMEQYPVRNTRAAISRLFRAVVIVLYKDQVLMT 253 Query: 248 QRPPSGLWGGLYCFPQFA-------DEESLRQWLAQ-RQIAADNLTQLTAFRHTFSHFHL 299 +R + GLY FP + D E + + + ++ L + + F+ + + Sbjct: 254 KREEKEIMAGLYEFPYYQLPKEDCCDIEKITHLVQKDYGETLYFVSSLPSQKQAFTRYRV 313 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 + P Y LA+ S+ ++ R+ Sbjct: 314 TLFPHVFYTKYSLPNS-------YTLAELSSLPSSSGHRRI 347 >UniRef50_Q254A0 Adenine/guanine glycosylase n=5 Tax=Chlamydophila RepID=Q254A0_CHLFF Length = 369 Score = 313 bits (803), Expect = 5e-84, Method: Composition-based stats. Identities = 100/353 (28%), Positives = 173/353 (49%), Gaps = 17/353 (4%) Query: 2 QASQF-SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 +A +F ++ W+ R + PW+ D +PY VW+SEVMLQQT+ V+ YF +M RFP Sbjct: 9 RAKKFPVEKLKQWFIDNKR-SFPWRDDPSPYNVWVSEVMLQQTRAEVVVKYFLEWMKRFP 67 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA A +EV+ W GLGYY R RNL A+ V GGK P+ ++ + G+G Sbjct: 68 TIESLATANEEEVIKAWEGLGYYTRVRNLLLGARMVMKDFGGKLPDDPLDLMQIKGLGPY 127 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 T AIL+ + + +DGNV RV++R + + + + ++ ++ + P + Sbjct: 128 TVHAILAFAFKRRTAAVDGNVLRVISRVFLIDASIDLESTKTWIFRITLSILPVKDPQVV 187 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 +A+++LGA +C PKC +CPL + C A P + ++ + + ++ + Sbjct: 188 AEALIELGACVCK-RSPKCEICPLNSVCGAFKEKKQKSLPIRHARKKVVSLFRWVAIILY 246 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFA-----DEESLRQWLAQRQIAADNL----TQLTAFR 291 ED ++L QR P + GLY FP D + + + + + +L R Sbjct: 247 EDFIVLEQRKPEEMMAGLYEFPYIEVGSLDDLADINALVQEMECYVGSSLIFCGELEEQR 306 Query: 292 HTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 H+F+H+ + ++P F + L Y L S+ ++ ++ L Sbjct: 307 HSFTHYKVRLIPKI-----FYARSKPKSELLYPLNTIDSLPFSSGHRKIKDWL 354 >UniRef50_B1LXG3 A/G-specific adenine glycosylase n=74 Tax=Bacteria RepID=B1LXG3_METRJ Length = 464 Score = 312 bits (800), Expect = 1e-83, Method: Composition-based stats. Identities = 129/343 (37%), Positives = 181/343 (52%), Gaps = 14/343 (4%) Query: 7 SAQVLDWYDKYGRKTLPWQI----DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 + +L WYD++ R+ LPW+ PY++WLSEVMLQQT +A V PYFERF+ RFP + Sbjct: 34 ADDLLAWYDRH-RRVLPWRALAGAAPDPYRIWLSEVMLQQTTIAAVRPYFERFLTRFPDI 92 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA AP + V+ W GLGYY+RARNLH A+ VA GG+FP+T E + LPG+G TA Sbjct: 93 FALAEAPEEAVMSAWAGLGYYSRARNLHACARTVAAA-GGRFPDTAEGLRKLPGIGAYTA 151 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GAI +++ + +DGNV+RVL+R YAV E + L++ + P F Q Sbjct: 152 GAIAAIAFDRQEAAVDGNVERVLSRAYAVEAPLPGSRPE--IRRLTQALVPTDRPGDFAQ 209 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGK-KPKQTLPERTGYFLLLQHE 241 A+MDLGA ICT +P C+LCP C A + + +P K K + R F+ ++ Sbjct: 210 ALMDLGATICTPKRPACALCPWMRPCRARSLGTQESFPRKIKVAKGALRRGAAFVAIRSG 269 Query: 242 DE-VLLAQRPPSGLWGGLYCFPQFADEE--SLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 DE VLL RPP GL G + P A E + L + A RH F+HF Sbjct: 270 DEAVLLRTRPPEGLLGNMAEPPGSAWEPDYDVAAALLDAPLDARWKRLPGLVRHGFTHFP 329 Query: 299 LDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 L++ V+ T + + L + ++L Sbjct: 330 LELTVFVARVALATPA--PAGTRFTPRSALDDEPLPGLMRKVL 370 >UniRef50_C8P1Q1 A/G-specific adenine glycosylase n=1 Tax=Erysipelothrix rhusiopathiae ATCC 19414 RepID=C8P1Q1_ERYRH Length = 327 Score = 311 bits (797), Expect = 2e-83, Method: Composition-based stats. Identities = 102/334 (30%), Positives = 156/334 (46%), Gaps = 21/334 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ +++ WY + R LP++ K PYK+W+SE+M QQTQ+AT+IPY++R++ R+P V L Sbjct: 3 FTERLMGWYHENKR-VLPFRSQKDPYKIWVSEIMAQQTQIATMIPYYDRWIKRYPDVETL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA +DEVL +W GLGYY RARNLH AQ V + G P +E+ +PG+G T+ AI Sbjct: 62 ANAEIDEVLKMWEGLGYYRRARNLHAGAQYVMEHYEGTLPADKKELMKIPGIGDYTSSAI 121 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLS--EQVTPAVGVERFNQA 183 S++ +DGNVKRV+AR + + + E + F QA Sbjct: 122 ASIAFSLPEIAIDGNVKRVMARYLNYTENVNTRACHKYFETFLKKELLLNGADPSDFTQA 181 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +M+LGA++CT S C CP + C P + +P L+ Q + Sbjct: 182 LMELGALVCTPSNTNCEGCPFKEMCACYRGECIGTIPFIPKAKPVPIYEKSVLIYQKDGR 241 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVP 303 +L++ GL GL PQ + + +H FSH +I Sbjct: 242 ILISDDHEDGLMEGLLRLPQIDGIYAT--------------DPILTLKHKFSHLQWNISV 287 Query: 304 MWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPV 337 + T N + + + + Sbjct: 288 FEVD----TIQDLNHNWYFIPREELKRLAIITAH 317 >UniRef50_B2UNR9 HhH-GPD family protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UNR9_AKKM8 Length = 341 Score = 311 bits (796), Expect = 3e-83, Method: Composition-based stats. Identities = 113/346 (32%), Positives = 170/346 (49%), Gaps = 13/346 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 F +++W+ + GR PW+ P+ + +SE+MLQQT + TV+ ++R+M +FP Sbjct: 8 FNIHAFRNALVEWFRREGR-DYPWRRTTDPWHILVSELMLQQTTIPTVLGRYDRWMRQFP 66 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T LA L W GLGYY R R+L A+++ GG+FP+ E + LPG+G Sbjct: 67 TPAHLAAVDEQTALRSWEGLGYYRRVRSLQAIAREIVNEFGGRFPDNAEGLKRLPGIGPY 126 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 T+GA+LS + K PI+D NV RVLAR S E + LWS +E + F Sbjct: 127 TSGALLSFAFNKAAPIVDANVARVLARIDNYSVPVDSTEGQKYLWSRAESLVDPEHAREF 186 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 N A+M+LG C+ S P C LCP++ C A P K PK + + +L Sbjct: 187 NSAIMELGQTCCSISSPDCLLCPVRPFCSAER---PETLPVKNPKPQVTRVEHHDILYIR 243 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 VLLA+ P G+Y FPQ DE +L +Q + ++T + H H+ Sbjct: 244 GKSVLLAKCPEGKRHAGMYRFPQREDEHTLSLPHVLKQTYSITRYRVTRYIH-----HVT 298 Query: 301 IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAP-VERLLQQLR 345 P+ F +D+ + L +A P L +P + +LL +R Sbjct: 299 DTPLLREGEEFVP-LDKIHGL--PMASPDRKALNSPTLGKLLDHIR 341 >UniRef50_A9BD72 A/G-specific DNA glycosylase n=3 Tax=Cyanobacteria RepID=A9BD72_PROM4 Length = 399 Score = 311 bits (796), Expect = 3e-83, Method: Composition-based stats. Identities = 122/371 (32%), Positives = 181/371 (48%), Gaps = 29/371 (7%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-----------PYKVWLSEVMLQQTQVATVIP 50 + + +L W+ GR +PW++ K Y + ++EVMLQQTQ+ V+P Sbjct: 20 KIDALRSTLLQWFKSNGRHYIPWKLTKDGTLPNENQYLAVYPILVAEVMLQQTQLKVVLP 79 Query: 51 YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHG-------GK 103 Y+E++M PT+ DLA A D+VL LW GLGYY+RAR LH ++ + L G Sbjct: 80 YWEKWMLALPTLVDLAKAEEDKVLLLWQGLGYYSRARRLHVTSRILLNLIGIPNSLNPAN 139 Query: 104 FPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENK 163 +P+ E LPG+GR+TAG+I+S + P+LDGNVKRVL R + P K + Sbjct: 140 WPKDLESWMNLPGIGRNTAGSIISSAFNLPSPLLDGNVKRVLTRLIGSTKTPNKDLA--R 197 Query: 164 LWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKK 223 LW LS+ + FNQA+MDLGA ICT+ P C+ CP QN C A + + P K Sbjct: 198 LWKLSDLLLDKNLPRTFNQALMDLGATICTKYNPICTNCPWQNYCSAYNSGNPENLPVKG 257 Query: 224 PKQTL-PERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEES-------LRQWLA 275 K L G L+L +VL+ QR G GG++ FP E+ R+ Sbjct: 258 QKLILSKAVIGVGLILNKNQDVLIDQRLDEGSMGGMWEFPGGKKEKDESIEMTIARELRE 317 Query: 276 QRQIAADNLTQLTAFRHTFSHFHLD-IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLA 334 + + +L F H+++H L IV + +S + W L+ + Sbjct: 318 ELGVEVKVGKKLIEFDHSYTHKKLHFIVHLCELISGKPKPLSSQEVRWVKLSDLQNYPFP 377 Query: 335 APVERLLQQLR 345 ++ L+ Sbjct: 378 KANSYMISALK 388 >UniRef50_C1F8W0 A/G-specific adenine glycosylase, putative n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F8W0_ACIC5 Length = 354 Score = 309 bits (791), Expect = 1e-82, Method: Composition-based stats. Identities = 113/352 (32%), Positives = 177/352 (50%), Gaps = 27/352 (7%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + F + WY + R LPW+ + PY +W+SE+MLQQT+VA V+ Y++RFM +FPT Sbjct: 4 EIASFQRDISAWYRQNAR-DLPWRRTRDPYAIWVSEIMLQQTRVAAVMEYYQRFMGQFPT 62 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LA+AP + VL LW+GLGYY RAR +H AA V HGGK P T ++ LPG+G T Sbjct: 63 IEALASAPEESVLALWSGLGYYRRARMMHHAAHIVVAEHGGKMPATAAQLRKLPGIGDYT 122 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVE--NKLWSLSEQVTPAVGVER 179 + A+ S+S + P++DGNV+RVL R G + + L + ++++ Sbjct: 123 SAAVASISFDEPVPVIDGNVERVLLRLRGEPAVKGHPDAPGLSDLKAAAQELLDTEQPGD 182 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 FNQAMM+LGA +C P C+ CP++ C + K+ + Y L+ Q Sbjct: 183 FNQAMMELGATVCLPRAPLCAECPVRAYCRTQGEHETG-----PAKKMRSVQVSYALIRQ 237 Query: 240 HED---EVLLAQRPPSGL-WGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTF- 294 +LLAQR GL+ P LA+ I L RH+ Sbjct: 238 APGPRTAILLAQRGAQESQMPGLWELPLVDP-----AGLAEDGI-------LLTVRHSIT 285 Query: 295 -SHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 +++++ I LP + + W + + S+ L ++L++L+ Sbjct: 286 NTNYYVTIY-SLLPDQRDRLQVPARSTRWISARELRSIPLTGLARKVLKRLK 336 >UniRef50_A3EPM8 Putative A/G-specific DNA glycosylase n=2 Tax=Leptospirillum sp. Group II RepID=A3EPM8_9BACT Length = 355 Score = 309 bits (791), Expect = 1e-82, Method: Composition-based stats. Identities = 127/340 (37%), Positives = 180/340 (52%), Gaps = 20/340 (5%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + +L WY + R +LPW+I++TPY+VW+SE+MLQQT V V+ YFERF+ RFP V Sbjct: 24 ERLRDDLLVWYKEVSR-SLPWRINRTPYRVWVSEIMLQQTTVRAVLGYFERFLERFPDVD 82 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA AP+++VL LW GLGYY RARNLH+AA+ +A+ G FPET E LPGVGRSTAG Sbjct: 83 SLAEAPVEDVLKLWEGLGYYQRARNLHRAARIIAS---GGFPETVEGWMNLPGVGRSTAG 139 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQ-VTPAVGVERFNQ 182 A+ S++LG+ PILD NV+RVL R +S +E + LW LS+ VT A NQ Sbjct: 140 AVCSIALGQETPILDVNVRRVLGRLRGISPGDAVRESPD-LWELSKAFVTEASDPGEVNQ 198 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M++GA++C KP C++CP C + L P +K K+ R L+ Sbjct: 199 ALMEIGAVVCLPRKPLCTVCPWSLDCASCGAPEEILNPPRKKKEKQ-IRIRTALIPSDGS 257 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 L Q L GL+ + + H +SHF ++ Sbjct: 258 GYFLVQ-GRDRLLEGLW------------DVFSVSGPPGEGQMPFGKVLHEYSHFREEVF 304 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 + S + E ++LW + V L ++L+ Sbjct: 305 LVREERSLLEAALGETSSLWLAKGEESPVALTGVARKILR 344 >UniRef50_Q3SPD3 A/G-specific DNA-adenine glycosylase n=2 Tax=Nitrobacter RepID=Q3SPD3_NITWN Length = 415 Score = 307 bits (788), Expect = 3e-82, Method: Composition-based stats. Identities = 125/369 (33%), Positives = 179/369 (48%), Gaps = 40/369 (10%) Query: 10 VLDWYDKYGRKTLPWQ----IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 +L+WYD+ R+ LPW+ PY+VWLSE+MLQQT V TV PYFE+F+AR+P V + Sbjct: 25 LLEWYDRN-RRLLPWRALPGEPVDPYRVWLSEIMLQQTTVKTVGPYFEKFLARWPDVAAM 83 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A LD++L +W GLGYY+RARNLH A +V HGG+FP+T E++ ALPG+G TA AI Sbjct: 84 ARASLDDILRMWAGLGYYSRARNLHACAVKVLRDHGGRFPDTEEDLRALPGIGPYTAAAI 143 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGW----------------------------PGK 157 +++ + +DGN++RV++R +AV G Sbjct: 144 AAIAFNRRTMPVDGNIERVVSRLFAVDEPLPKAKPRIHTLAATLLGPSRSGRDGKSRAGD 203 Query: 158 KEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWA 217 + S QA+MDLGA ICT KP C LCPL + C A A Sbjct: 204 VKTRAGRDGKSRASDVKTRAGDIAQALMDLGAAICTPKKPSCVLCPLSDDCAARARGDQE 263 Query: 218 LYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQR 277 +P K PK+ R G +++ EVL+ RP GL GG+ P + A + Sbjct: 264 TFPRKTPKKAGELRRGAAFVVRRGSEVLVRTRPAKGLLGGMTEVPTSTWLAAQDDTAALK 323 Query: 278 QI-----AADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVG 332 Q A + A H F+HF L++ + W +A+ Sbjct: 324 QAPCLESAPRWRRKAGAVTHVFTHFPLELAVY--TAAVARRTAAPEGMRWVPIARLNDEA 381 Query: 333 LAAPVERLL 341 L + +++ Sbjct: 382 LPNLMRKVI 390 >UniRef50_Q3ANF7 Mutator mutT protein n=19 Tax=Cyanobacteria RepID=Q3ANF7_SYNSC Length = 396 Score = 307 bits (787), Expect = 4e-82, Method: Composition-based stats. Identities = 131/363 (36%), Positives = 180/363 (49%), Gaps = 28/363 (7%) Query: 5 QFSAQVLDWYDKYGRKTL---PWQIDK-----------TPYKVWLSEVMLQQTQVATVIP 50 + SA +L W+ +GR+ PW PY +W++EVMLQQTQ+A +P Sbjct: 29 ELSACLLSWWQAHGRRDPVQKPWMFKPAGTWPEAVHQLDPYGIWIAEVMLQQTQLAVALP 88 Query: 51 YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEE 110 Y+ R+M FPTV LA A LDEV W GLGYY+R R LH+AAQ++ G +P + EE Sbjct: 89 YWMRWMEAFPTVETLAAASLDEVRLQWQGLGYYSRVRRLHEAAQRLV---GRPWPRSLEE 145 Query: 111 VAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQ 170 ALPG+GR+TAG+ILS + PILDGNVKRVLAR A + P + + W SE Sbjct: 146 WMALPGIGRTTAGSILSSAFNLRLPILDGNVKRVLARLTAHARPPARDDA--LFWCWSEA 203 Query: 171 VTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLP- 229 + V NQA+MDLGA +CT +P C CP + C A A+ +P + LP Sbjct: 204 LLDPVRARDTNQALMDLGATLCTPRQPACHRCPWHSQCAAYASGDPCRWPVTNAPKPLPF 263 Query: 230 ERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEES-------LRQWLAQRQIAAD 282 + G ++L EVL+ QR GL GG++ FP E+ R+ + IA Sbjct: 264 QVIGVGVVLNAAGEVLIDQRLEEGLLGGMWEFPGGKQEQGETIETCIARELKEELGIAVT 323 Query: 283 NLTQLTAFRHTFSHFHLDIVPMWLP-VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 +L H +SH L V +S + W A R++ Sbjct: 324 VGAELITVDHAYSHKKLRFVVHLCDWMSGEPQPLASQQVRWVRPDDLVDYAFPAANARII 383 Query: 342 QQL 344 + L Sbjct: 384 EAL 386 >UniRef50_C5YTP8 Putative uncharacterized protein Sb08g006760 n=1 Tax=Sorghum bicolor RepID=C5YTP8_SORBI Length = 494 Score = 307 bits (786), Expect = 4e-82, Method: Composition-based stats. Identities = 118/360 (32%), Positives = 178/360 (49%), Gaps = 51/360 (14%) Query: 6 FSAQVLDWYDKYGRKTLPWQ-----IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 AQ+L WYD + R+ LPW+ ++ Y VW+SEVMLQQT+V V+ Y+ER+MAR+P Sbjct: 64 LRAQLLRWYDAH-RRDLPWRCASGGEEERAYAVWVSEVMLQQTRVPVVVGYYERWMARWP 122 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV LA A +EV +W GLGYY RAR L + A+Q+ G FP T + + G+G Sbjct: 123 TVRSLAAATQEEVNEMWAGLGYYRRARFLLEGAKQIIEK--GVFPRTASALREVRGIGDY 180 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S++ + P++DGNV RV++R YA++ P + + W L+ Q+ + F Sbjct: 181 TAGAIASIAFNEVVPVVDGNVVRVISRLYAIADNPKESSTVKRFWELAGQMVDPLRPGDF 240 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIA------AANNSWALYPGKKPKQTLPERTGY 234 NQAMM+LGA +C+++KP CS C + + C A A+ +P PK Sbjct: 241 NQAMMELGATLCSKTKPGCSECAVSSHCQALALSHENASVQVTDFPRVVPKAKPRRDFAA 300 Query: 235 FLLLQ--------------HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIA 280 ++Q + + LL +RP GL GL+ FP +E + +R+ Sbjct: 301 VCVVQIAQGLEEEAPDAKDNSNLFLLIKRPEEGLLAGLWEFPLILVDEGKTDFQNRRKAM 360 Query: 281 ADNLTQL-----------------TAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWY 323 L++L H FSH L + + ++ +G W Sbjct: 361 DKYLSKLLSFDVGQRSDVIFREDVGEHVHIFSHIRLTMHVELMIINL------KGRVCWV 414 >UniRef50_Q04T89 A/G-specific DNA glycosylase n=4 Tax=Leptospira RepID=Q04T89_LEPBJ Length = 372 Score = 305 bits (782), Expect = 1e-81, Method: Composition-based stats. Identities = 106/352 (30%), Positives = 172/352 (48%), Gaps = 12/352 (3%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + +L W+ K RK LP++I+K Y++W+SE+MLQQT+VA ++P +E F+ RFP Sbjct: 12 ELRENLLSWFQKNKRK-LPFRINKNAYRIWVSEIMLQQTRVAAMLPIYETFLKRFPDPKA 70 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 L +A +EV+ W GLGYY+RARNL K A+ + + G+FPE +EE ++PGVG TA A Sbjct: 71 LQDASEEEVMKYWKGLGYYSRARNLKKGAELLVEKYEGRFPEDYEEALSIPGVGSYTASA 130 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 +LS++ GK + +LDGNVKRVL+R + + P L L+++ N+AM Sbjct: 131 VLSIAYGKPYAVLDGNVKRVLSRLFLIESDPNSNSTNQVLADLAQKFLTPGDPGNHNEAM 190 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LGA++C P CS CP +N C A P K + + FL L+ + Sbjct: 191 MELGALVCIPV-PNCSACPFENHCEAKGAGKEKEIPVTKSVENWVDLDLNFLFLKSSGRI 249 Query: 245 LLAQRPPSGLWGGLYCFP---QFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 LL + + +Y P + +W + + + RH+ ++ + + Sbjct: 250 LLVKYTTRRFFKTIYSLPFRLEGKHPYEKDEWAEELFADSGVIQNSLRSRHSITNHRIRL 309 Query: 302 V------PMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 + + + W N + + + L +LR Sbjct: 310 KFSELDEKNVVRIEKSLQKRKDIQFKWVNESDLKEE-FPSSISGKLIRLRNK 360 >UniRef50_C2MB00 A/G-specific adenine glycosylase n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2MB00_9PORP Length = 363 Score = 304 bits (780), Expect = 2e-81, Method: Composition-based stats. Identities = 123/355 (34%), Positives = 182/355 (51%), Gaps = 15/355 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 S ++ WY + RK LPW+ PY++WLSEV+LQQT++ Y+ RF+ +PTV+ Sbjct: 2 STLLTKLHSWYQENHRK-LPWRDIHDPYRIWLSEVILQQTRIDQGTSYYLRFVEHYPTVS 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA APLDEVL LW GLGYY+RARNLH+AAQ + GG FP ++ V ALPG+G TAG Sbjct: 61 DLAIAPLDEVLKLWEGLGYYSRARNLHRAAQLIVHELGGTFPADYQSVRALPGIGDYTAG 120 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT-PAVGVERFNQ 182 A+LS + + +P +DGNV RVL+R YA + + +L+ Q+ A NQ Sbjct: 121 AVLSFAYDQPYPAVDGNVLRVLSRLYASEEPIDTTQGKKYYTALARQLVEKAPHPGLHNQ 180 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 AM++LGA+ICT C+ CP+++ C +A + A P K K ++ R +L + Sbjct: 181 AMIELGALICTPQLCDCTRCPVRSECPSADSPERASLPRKALKLSVQPRHLGYLFVLKRG 240 Query: 243 ----EVLLAQRPPSGLWGGLYCFPQFADE--------ESLRQWLAQRQIAADNLTQLTAF 290 + +L QRP +W LY D+ +A L + Sbjct: 241 LDQWQTILYQRPTGDIWAKLYQPTLLYDQPVAPQEELLLQPLPEPPAALAHCPLRPFRRY 300 Query: 291 RHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPS-VGLAAPVERLLQQL 344 +H +H L I + + + +W L + L +++ LQQL Sbjct: 301 KHRLTHRQLYIDCYYTILPDSSTPRLPDGGVWVPLHDESEWLALPITLKKALQQL 355 >UniRef50_Q1GD89 A/G-specific DNA-adenine glycosylase n=11 Tax=Rhodobacterales RepID=Q1GD89_SILST Length = 353 Score = 303 bits (777), Expect = 6e-81, Method: Composition-based stats. Identities = 125/353 (35%), Positives = 177/353 (50%), Gaps = 20/353 (5%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDK---------TPYKVWLSEVMLQQTQVATVIPY 51 + A Q S +L WYD + R+ +PW++ PY++WLSEVMLQQT VA V Y Sbjct: 6 VSAEQLSQDLLVWYDTHARE-MPWRVGPAARASGVRPDPYRIWLSEVMLQQTTVAAVKDY 64 Query: 52 FERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEV 111 FERF R+P V DLA A +V+ W GLGYYARARNL K A+ VA G FP+ +E + Sbjct: 65 FERFTRRWPRVGDLAAAEDGDVMAEWAGLGYYARARNLLKCARVVAEEFEGVFPDAYEGL 124 Query: 112 AALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQV 171 ALPG+G TA AI +++ + +LDGNV+RV+AR + V+ L + + + Sbjct: 125 IALPGIGPYTAAAISAIAFDRPETVLDGNVERVMARLHDEHEP--LPAVKPVLKAHAAHL 182 Query: 172 TPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPER 231 TP+ + QA+MDLGA ICT P C +CP ++ C A + P K PK+ P R Sbjct: 183 TPSARPGDYAQAVMDLGATICTPKSPACGICPWRDPCRARVKGTAPELPKKTPKKPKPTR 242 Query: 232 TG-YFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAF 290 G +L E + LL +RP GL GG+ +P ++ + AD Sbjct: 243 YGFVYLARSAEGDWLLERRPDKGLLGGMLGWPGSEWNDAPTETPP---FDADWQDLGAEV 299 Query: 291 RHTFSHFHLDIVPMWLPVSSFTGCMDEGNAL-WYNLAQPPSVGLAAPVERLLQ 342 RHTF+HFHL + + + + L + + Sbjct: 300 RHTFTHFHLILQ---VRSAELPADFEPRAGQELVRRHDFRPSSLPTVMRKAFD 349 >UniRef50_D1PLU8 A/G-specific adenine glycosylase n=2 Tax=Ruminococcaceae RepID=D1PLU8_9FIRM Length = 342 Score = 303 bits (777), Expect = 6e-81, Method: Composition-based stats. Identities = 101/329 (30%), Positives = 157/329 (47%), Gaps = 7/329 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 + +L W+ R LP++ + + Y +W+SE+MLQQT+VA IPY+ RF+A P L Sbjct: 5 IAPPLLQWFQANKR-LLPFRQEPSAYHIWVSEIMLQQTRVAAAIPYYNRFIAALPDPAAL 63 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+ D + LW GLGYY R N+ KAA+ V +GG P ++ + +LPG+G TAGAI Sbjct: 64 ASCEPDALRKLWQGLGYYNRVNNMQKAARIVCEQYGGDLPSDYDALRSLPGIGDYTAGAI 123 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S++ G P +DGNV RV AR Y + + + P +N+A+M Sbjct: 124 ASIAFGIPAPAVDGNVLRVFARLYNDDADIMQPATKRLFTGRVLEQMPKETPGPYNEALM 183 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 +LGA++C P+C CPL C+ A A P K + L++ +L Sbjct: 184 ELGALVCVPGMPRCEACPLAALCLGYAAGRQADLPVKPAPKVKTPVAVTVALVESPAGLL 243 Query: 246 LAQRPPSGLWGGLYC---FPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 L +RP GL GL+ + + L + L++ + L A RH F+H ++ Sbjct: 244 LQRRPARGLLAGLWQPAAWETVMSQAELAEELSKIGVQVTWGDALPAARHVFTHKIWNLG 303 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPSV 331 ++ C +W + V Sbjct: 304 GWH---AAAPACALPEGWVWAAPEELEQV 329 >UniRef50_Q1ITJ1 A/G-specific DNA glycosylase n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1ITJ1_ACIBL Length = 324 Score = 300 bits (768), Expect = 7e-80, Method: Composition-based stats. Identities = 113/343 (32%), Positives = 173/343 (50%), Gaps = 36/343 (10%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 AS +L WY ++ R+ LPW+ + PY +W+SE+MLQQT+VA V+ + +F+A+FP Sbjct: 11 DASDLQKSLLSWY-RHSRRNLPWRRTRDPYAIWISEIMLQQTRVAAVLDKYAQFLAQFPN 69 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA+A LDEVL +W+GLGYY RAR LH+AAQ V GKFP+T LPG+GR T Sbjct: 70 VKALADASLDEVLTVWSGLGYYRRARALHQAAQMVVHHLHGKFPDTAAGWRQLPGIGRYT 129 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 + AI S++ + ++DGNV+RVL R G++ +LW +EQ+ +N Sbjct: 130 SAAIASIAFNEPAAVVDGNVERVLERL------DGERHEGERLWERAEQLLAKRAPGDWN 183 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QAMM+LGA IC P+C +CP+ C P K K+ ++ L + Sbjct: 184 QAMMELGATICLPQNPQCLVCPVNGPCKTRG--PLQSRPQPKRKRA----ELWYALYARK 237 Query: 242 DEVLLAQRPPS-GLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 + VLL QRP L G++ P I A+ + L RH+ + Sbjct: 238 NSVLLVQRPADHSLMAGMWELP---------------AIRANGVEPLHKLRHSITDTDYA 282 Query: 301 IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + + G + + W + + + ++L++ Sbjct: 283 VFVVR-------GRTAKKHGKWVTHEEAHRMAITGLTRKILRK 318 >UniRef50_B9SF01 A/G-specific adenine glycosylase muty, putative n=3 Tax=core eudicotyledons RepID=B9SF01_RICCO Length = 775 Score = 299 bits (765), Expect = 1e-79, Method: Composition-based stats. Identities = 116/396 (29%), Positives = 177/396 (44%), Gaps = 59/396 (14%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT--------------PYKVWLSEVMLQQTQVAT 47 + + +L+WYD+ R+ LPW+ KT Y +W+SEVMLQQT+V T Sbjct: 35 ETQKIRESLLEWYDQNQRQ-LPWRRQKTTNPSQESEEEKEKRAYGIWVSEVMLQQTRVQT 93 Query: 48 VIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPET 107 VI Y+ R+M ++PT+ LA A L+EV +W GLGYY RAR L + A+ + GG FP T Sbjct: 94 VIDYYNRWMLKWPTIHHLAQASLEEVNEIWAGLGYYRRARFLLEGAKMIVA--GGGFPNT 151 Query: 108 FEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSL 167 + +PG+G TAGAI S++ + P++DGNV RVL R A+S P KLW L Sbjct: 152 VSSLRKVPGIGDYTAGAIASIAFKEVVPVVDGNVVRVLTRLRAISANPKDSMTVKKLWKL 211 Query: 168 SEQVTPAVGVERFNQAMMDLGAMICTRSK------PKCSLCPLQNGCIAAANNSWALYPG 221 + Q+ FNQ++M+LGA +C S P S C + + + YP Sbjct: 212 AAQLVDPCRPGDFNQSLMELGATVCAPSNPSCSSCPVSSQCRVLSISNQDKSILVTDYPT 271 Query: 222 KKPKQTLPERTGYFLLLQ------------HEDEVLLAQRPPSGLWGGLYCFPQFADEES 269 K K +++ + + LL +RP GL GL+ FP ++ Sbjct: 272 KVVKVKPKHEFSAVCVVEILGSCGPVDNQKTDSKFLLVKRPDDGLLAGLWEFPTCRLDKE 331 Query: 270 LRQWLAQRQI----------------AADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTG 313 + +I + + F H F+H L + L + G Sbjct: 332 ADLITRRNEIDHFMKKSFRLDPEKTYSMVLREDIGEFVHIFTHIRLKVYVDLLVIRLKGG 391 Query: 314 CM--------DEGNALWYNLAQPPSVGLAAPVERLL 341 + N P++GL + V +++ Sbjct: 392 MSQLFRKQEKEATNWKCVEKKALPNLGLTSGVRKVI 427 >UniRef50_C7JG78 DNA glycosylase A/G-specific MutY n=12 Tax=Alphaproteobacteria RepID=C7JG78_ACEP3 Length = 357 Score = 299 bits (765), Expect = 1e-79, Method: Composition-based stats. Identities = 116/346 (33%), Positives = 170/346 (49%), Gaps = 13/346 (3%) Query: 7 SAQVLDWYDKYGRKTLPWQI----DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 + +L WYD++ R+TLPW++ PY+VWLSE+MLQQT V V PY+ RF +FPTV Sbjct: 6 AHALLHWYDRH-RRTLPWRVVGQSHPDPYRVWLSEIMLQQTTVKAVAPYYLRFTEKFPTV 64 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA+A ++VL W GLGYY+RARNLH AQ V L G FP+ + + LPG+G TA Sbjct: 65 QALASADREDVLAAWAGLGYYSRARNLHACAQAVVAL--GGFPQDVQGLRVLPGIGPYTA 122 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSG--WPGKKEVENKLWSLSEQVTPAVGVERF 180 A+ +++ G +DGNV+RV AR +A++ P +K++ +L+ F Sbjct: 123 AAVAAIAFGVPVVPVDGNVERVTARLFAITEPLPPARKKLAQLAITLNADREAQERPSDF 182 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+ DLG+ +C+ P C LCP Q C A P K PK P R G L Q Sbjct: 183 AQALFDLGSSLCSPRAPACGLCPWQGECAGHKQGIAAELPRKLPKAERPVRYGAAFLAQD 242 Query: 241 -EDEVLLAQRPPSGLWGGLYCFPQFAD--EESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 ++LL +R GL + P E + Q A +H F+HF Sbjct: 243 AAGQILLRKRAEKGLLAAMTELPGTEWRLENWSEAEILQVAPFAAGWKLAGRVKHVFTHF 302 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 L + + F+ L + L + +++ ++ Sbjct: 303 TLYLDVYVAQIKHFSNQAVSDGFL-VPVECVKDTALPSVMQKCFEK 347 >UniRef50_UPI000034F1ED adenine-DNA glycosylase-related / MYH-related n=1 Tax=Arabidopsis thaliana RepID=UPI000034F1ED Length = 630 Score = 299 bits (765), Expect = 1e-79, Method: Composition-based stats. Identities = 119/414 (28%), Positives = 175/414 (42%), Gaps = 72/414 (17%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-------PYKVWLSEVMLQQTQVATVIPYFER 54 + + +LDWYD R LPW+ ++ Y+VW+SE+MLQQT+V TV+ Y++R Sbjct: 127 ETQKIRMGLLDWYDVNKR-DLPWRNRRSESEKERRAYEVWVSEIMLQQTRVQTVMKYYKR 185 Query: 55 FMARFPTVTDLANAPLD-------------------EVLHLWTGLGYYARARNLHKAAQQ 95 +M ++PT+ DL A L+ EV +W GLGYY RAR L + A+ Sbjct: 186 WMQKWPTIYDLGQASLENLIVSRSRELSFLRGNEKKEVNEMWAGLGYYRRARFLLEGAKM 245 Query: 96 VATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWP 155 V G FP + + G+G+ TAGAI S++ + P++DGNV RVLAR A+S P Sbjct: 246 VVAGTEG-FPNQASSLMKVKGIGQYTAGAIASIAFNEAVPVVDGNVIRVLARLKAISANP 304 Query: 156 GKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSK------PKCSLCPLQNGCI 209 + W L+ Q+ FNQ++M+LGA +CT SK P S C + Sbjct: 305 KDRLTARNFWKLAAQLVDPSRPGDFNQSLMELGATLCTVSKPSCSSCPVSSQCRAFSLSE 364 Query: 210 AAANNSWALYPGKKPKQTLPERTGYFLLL--------QHEDEVLLAQRPPSGLWGGLYCF 261 S YP K K +L Q +L +RP GL GL+ F Sbjct: 365 ENRTISVTDYPTKVIKAKPRHDFCCVCVLEIHNLERNQSGGRFVLVKRPEQGLLAGLWEF 424 Query: 262 PQFADEESLRQWLAQRQI------------------AADNLTQLTAFRHTFSHFHLDIVP 303 P E + I + +L F H F+H + Sbjct: 425 PSVILNEEADSATRRNAINVYLKEAFRFHVELKKACTIVSREELGEFVHIFTHIRRKVYV 484 Query: 304 MWLPVSSFTGCMDEGNAL------W--YNLAQPPSVGLAAPVERL----LQQLR 345 L V G D W + ++GL + V ++ LQ ++ Sbjct: 485 ELLVVQLTGGTEDLFKGQAKDTLTWKCVSSDVLSTLGLTSAVRKVPPFRLQHIK 538 >UniRef50_Q0EVU8 Adenine glycosylase n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EVU8_9PROT Length = 307 Score = 297 bits (762), Expect = 3e-79, Method: Composition-based stats. Identities = 109/337 (32%), Positives = 164/337 (48%), Gaps = 36/337 (10%) Query: 11 LDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPL 70 + WY ++ R LPW+ PY++W+SE+MLQQTQV TV+P + ++ FPT+ LA A Sbjct: 1 MAWYARHAR-DLPWRHTTDPYRIWISEIMLQQTQVKTVLPRYLAWLDTFPTIESLAAASA 59 Query: 71 DEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSL 130 D++L W GLGYY RAR +H+AA + G+FP F+++ +LPG+GRSTAGAI S Sbjct: 60 DDLLKAWEGLGYYRRARFIHQAAATIMEGFEGRFPRDFDDIVSLPGIGRSTAGAIASFCY 119 Query: 131 GKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP-AVGVERFNQAMMDLGA 189 G P+LDGNVKRVL R + + LW L++Q + +NQAMM+LGA Sbjct: 120 GASTPVLDGNVKRVLKRWHG-----QPDASDKALWLLAQQAINISGKPGIWNQAMMELGA 174 Query: 190 MICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH---EDEVLL 246 C+ P C CP+ C +A + + ++ +P ++ + H + L Sbjct: 175 SACSAKSPDCGACPVNAFCASAFQVEVSQ----ERRKAVPVLDVHWRIDLHICPIKGIWL 230 Query: 247 AQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWL 306 QRP +G+WGGL+ P + + + H +H L + Sbjct: 231 TQRPDAGIWGGLWTPPITELDAPPVEA--------------PSHIHQLTHRRLHLYAYQC 276 Query: 307 PVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 G W L + RLL++ Sbjct: 277 ESEPV------GAGHWVP--SIEGYALPTGIHRLLEK 305 >UniRef50_B0SFS5 A/G-specific DNA glycosylase n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SFS5_LEPBA Length = 353 Score = 296 bits (759), Expect = 7e-79, Method: Composition-based stats. Identities = 112/349 (32%), Positives = 172/349 (49%), Gaps = 15/349 (4%) Query: 7 SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 ++ DWY + R LP++ K Y +W+SEVMLQQT+VA ++P FE F+ RFP LA Sbjct: 4 QKKLRDWYLLHKR-DLPFRKKKQAYPIWISEVMLQQTRVAAMLPLFENFVNRFPNPESLA 62 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 A +EVL W GLGYY+RARN+ KAA Q+ + G FP+ + V LPG+G TA AIL Sbjct: 63 KATEEEVLSFWKGLGYYSRARNIRKAAIQIVQQYNGSFPKDLDSVLKLPGIGNYTARAIL 122 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMD 186 S+S +LDGNVKRVL+R Y + + E +L ++ NQA+M+ Sbjct: 123 SISYDLPLAVLDGNVKRVLSRYYGYTKNILGPQAEKELQLKADGFLNLDFPGDHNQAVME 182 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLL 246 LGA IC PKC +CPL +GC A + P ++ KQ TG ++Q ++ +LL Sbjct: 183 LGATICLPESPKCLVCPLMDGCYARIHGKTKEIPLREKKQKQVLLTGEIWVIQQKNLILL 242 Query: 247 AQRPPSGLWGGLYCFPQF------ADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 + + G++ P + + + Q F+HT ++ L+ Sbjct: 243 IKEKKNRFLKGMFHLPSGFLGEIPNSDYAPSPFFHSVQSEYKETPSKGKFKHTITYHKLE 302 Query: 301 IVPMWLPVSSFTGCM-----DEGNALWYNLAQPP---SVGLAAPVERLL 341 + + ++ + W ++ LA V+++L Sbjct: 303 YSVHLVNLKEPNQIQSLLAGNDLESKWVEVSDLESEFPSSLAKKVKKIL 351 >UniRef50_D0J8Q9 A/G-specific adenine glycosylase n=2 Tax=Blattabacterium RepID=D0J8Q9_BLASP Length = 351 Score = 296 bits (757), Expect = 1e-78, Method: Composition-based stats. Identities = 102/346 (29%), Positives = 160/346 (46%), Gaps = 9/346 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS ++++WY RK LPW+ K PY V +SE MLQQT+V+ + Y+ F+ +FP + L Sbjct: 3 FSKKIINWYKNNYRK-LPWRETKNPYYVLVSEFMLQQTRVSKTLKYYLDFVKKFPNLEKL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVA-TLHGGKFPETFEEVAALPGVGRSTAGA 124 A A EVL W GLGYY RAR LH A+Q+ G FP+T++E+ G+G T A Sbjct: 62 AYAEEKEVLKEWEGLGYYTRARYLHLFAKQLVKEKPSGFFPKTYKELIKYKGIGPYTGAA 121 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 + S+ + P LDGN RV +R + +N ++ FNQA+ Sbjct: 122 VASICFDEVIPALDGNAYRVFSRYLGIYHNITSFRAKNIFRFFILKIMDHNDPGIFNQAV 181 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKK--PKQTLPERTGYFLLLQHED 242 MDLG+ +CT K KC CP+++ C + N + P K K L Y + ++ Sbjct: 182 MDLGSTLCTPRKAKCFSCPVKSSCFSLKNGTVYDLPVKNIIKKSVLHRFFYYVFIWDKKN 241 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQI----AADNLTQLTAFRHTFSHFH 298 + + +R +W GLY FP E++L + +I + + + H +H Sbjct: 242 HLCIQKRNNQDIWKGLYDFPLIESEKNLSISEIKDKIWTRFRIISDSIIYQVIHKLTHQT 301 Query: 299 LDIVPMWLP-VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 L I + + + N + + P+ L+ Sbjct: 302 LSIQFLNCRTIQNVQKEKYFDNFFFIPYEKIVEYPFPRPITLFLKH 347 >UniRef50_B3DVQ4 A/G-specific DNA glycosylase n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DVQ4_METI4 Length = 355 Score = 294 bits (754), Expect = 2e-78, Method: Composition-based stats. Identities = 107/344 (31%), Positives = 161/344 (46%), Gaps = 19/344 (5%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 +F + WY + G K PW+ TPY V +SE MLQQTQ TV+PYF +M +FP Sbjct: 19 KKKFWKFLSRWYGENGFK-YPWRKSPTPYSVVVSEFMLQQTQAETVVPYFLAWMEKFPDW 77 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA A EVL W GLGYY+RARNLH+ A + G+ P EE+ PG+G TA Sbjct: 78 ESLAKAQEKEVLRAWEGLGYYSRARNLHRIAVMLYHERKGELPSDPEELVKFPGIGPYTA 137 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV-GVERFN 181 A+ SL+ G+ P LDGNV RV+AR ++ KKE ++ + + + FN Sbjct: 138 NAVASLAFGRKIPALDGNVIRVMARLMNINQPIHKKETIRTIFQVVDSLMQGEAEASLFN 197 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 A+MD G +C P+CS C L+ C A L P + + ++ ++ + + Sbjct: 198 SALMDFGRAVCKPKYPRCSACVLKEMCKAKQ---PELLPLRTKIEIEEKKEKIAIIRE-D 253 Query: 242 DEVLLAQ-RPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 D L Q + GL+ FP F + + + L + RH+F+ + + Sbjct: 254 DRFWLQQANSNQNRYRGLWLFPYFDPD------------VMEEMAPLFSLRHSFTRYKIL 301 Query: 301 IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + + M W + S+ L +P ++ QL Sbjct: 302 LEVCEASWNKERLKMMPMEGQWVKKKEIFSLPLPSPHRKIWLQL 345 >UniRef50_C6XF00 A/G-specific adenine glycosylase n=1 Tax=Candidatus Liberibacter asiaticus str. psy62 RepID=C6XF00_LIBAP Length = 356 Score = 294 bits (754), Expect = 2e-78, Method: Composition-based stats. Identities = 104/353 (29%), Positives = 171/353 (48%), Gaps = 14/353 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDK--------TPYKVWLSEVMLQQTQVATVIPYFERFMA 57 +++LDWYD R LPW+ +PYKVW+SE+MLQQT V TV PYF++FM Sbjct: 8 IQSKILDWYDTNHR-VLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQ 66 Query: 58 RFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGV 117 ++PT+ L++A +E+L W GLGYY RARNL K A + + G FP E + LPG+ Sbjct: 67 KWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGI 126 Query: 118 GRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGV 177 G TA AI++++ ++D N++R+++R + + + + + ++T Sbjct: 127 GDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPA--PLYHKTIKNYARKITSTSRP 184 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL 237 F QAMMDLGA+ICT +KP C LCP+Q C+ + L K+ P RTG + Sbjct: 185 GDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSEGKSHLLGINTIKKKRPMRTGAVFI 244 Query: 238 -LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 + +++ +LL +R + L G+ P A + + L Sbjct: 245 AITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFT 304 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAP 349 + +W + + ++ W++ + L +++ L P Sbjct: 305 HFTLTLFVWKTI--VPQIVIIPDSTWHDAQNLANAALPTVMKKALSAGGIKVP 355 >UniRef50_UPI0001926FFC PREDICTED: similar to mutY homolog, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001926FFC Length = 424 Score = 293 bits (751), Expect = 6e-78, Method: Composition-based stats. Identities = 94/296 (31%), Positives = 147/296 (49%), Gaps = 31/296 (10%) Query: 33 VWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKA 92 VW+SE+MLQQTQVATVI Y+ +M ++P++ LA+A L+EV W+GLGYY+R R LH+A Sbjct: 88 VWVSEIMLQQTQVATVINYYNNWMKKWPSIASLASATLEEVNEAWSGLGYYSRGRRLHEA 147 Query: 93 AQQVATLHGGKFPETFEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAV 151 A+++ G P++ + + LPG+G TA AI S++ + ++DGNV RVL R + Sbjct: 148 AKKLIES-GQPMPKSSKSLVKELPGIGPYTASAIASIAFNEVCGVVDGNVIRVLTRIRMI 206 Query: 152 SGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAA 211 K V + +W L+ + FNQ MM+LGA +CT P+CS CPL C++ Sbjct: 207 GADSSSKAVNDFIWELANTIVDEERPGDFNQGMMELGATVCTPKSPQCSQCPLSTLCMSY 266 Query: 212 ANNS-----------------------------WALYPGKKPKQTLPERTGYFLLLQHED 242 +P K ++ ++Q+ + Sbjct: 267 KKGIFLFVIPLVSWINQICIRQQYLVEEGTGLGVMNFPRKNKNKSPLAEEYDVAVIQNNN 326 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 + LL QRP GL GL+ FP F E+ + + Q+ ++ + + F Sbjct: 327 KFLLVQRPNKGLLAGLWEFPLFLRTENSKNEILSLQVEKRFSIKIEKIKEINTIFK 382 >UniRef50_B0D6Q9 Predicted protein n=1 Tax=Laccaria bicolor S238N-H82 RepID=B0D6Q9_LACBS Length = 562 Score = 292 bits (748), Expect = 1e-77, Method: Composition-based stats. Identities = 114/402 (28%), Positives = 171/402 (42%), Gaps = 95/402 (23%) Query: 6 FSAQVLDWYDK-YGRKTLPWQIDKTP-----------YKVWLSEVMLQQTQVATVIPYFE 53 +L WY + + +PW+ P Y+VW+SE+MLQQTQVATVIPY+ Sbjct: 88 MRVALLRWYRTVHDTRRMPWRKPYNPSCGAEERAQRAYEVWVSEIMLQQTQVATVIPYYT 147 Query: 54 RFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA 113 R+M ++PT+ LA+A +D+V LW GLGYY+RA L AQ+ +GG+ P+ +E+ A Sbjct: 148 RWMEKYPTIRHLASANVDQVNALWKGLGYYSRASRLLAGAQKAIQKYGGRLPDNAKEMEA 207 Query: 114 -LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSL----- 167 +PG+GR +AGAI S++ G+ P+LDGNV R+L+R A+ P K + LW Sbjct: 208 NIPGIGRYSAGAICSIAYGEKVPVLDGNVHRLLSRVLALHAPPKAKSTLDILWDAATVMV 267 Query: 168 ----SEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL----- 218 ++ +P NQA+++LG+ +C P+C CP+QN C A N+ Sbjct: 268 QIEEADTTSPPQYAGDINQALIELGSTVCKVRDPECGTCPIQNWCSAYQNSEPKESAKED 327 Query: 219 ------------------------YPGKKPKQTLPERTGYFLL-----LQHEDE--VLLA 247 YP K ++ E + +Q+ DE L Sbjct: 328 NTHMVDIEDICGLCEPLTGAGVMSYPMKTERKKAREELDIVNVVEWRPIQNSDERWFLFR 387 Query: 248 QRPPSGLWGGLYCFPQFA-------------------------------------DEESL 270 +RP +GL GLY FP E L Sbjct: 388 RRPSTGLLAGLYEFPTSPAVSKPISHDALKHIPYNLISQVVQDARVFPLSTHAQKSGEDL 447 Query: 271 RQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFT 312 + + H FSH W+ + + Sbjct: 448 EDDCEDLHLRISKIKVAGDVVHLFSHVKKTYRAQWVLLEGGS 489 >UniRef50_D2VUC5 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VUC5_NAEGR Length = 616 Score = 292 bits (747), Expect = 2e-77, Method: Composition-based stats. Identities = 112/470 (23%), Positives = 179/470 (38%), Gaps = 127/470 (27%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQID------------------KTPYKVWLSEVMLQQT 43 + +++WYD +K LPW+ + Y+V +SE+MLQQT Sbjct: 36 ERGHIWRNLINWYD-NNKKDLPWRKTIKSADDEDDDVKATEELHQRAYEVLVSEIMLQQT 94 Query: 44 QVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGG- 102 QV TV YF ++M RFPTV DLA+A +D+ + +W+GLGYY RAR L+ AA+ + Sbjct: 95 QVETVKTYFLKWMKRFPTVKDLADATVDDAVGIWSGLGYYRRARYLYNAAKYIVEKINKP 154 Query: 103 ----------------------KFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGN 140 K P+ +++ LPGVGR TAGAI S++ + ++DGN Sbjct: 155 IKNEEIISDDEDDDEEKKRIIYKLPDNVKDLMQLPGVGRYTAGAIASIAFSRTASVVDGN 214 Query: 141 VKRVLARCYAVSGWPGKKEVENK-LWSLSEQVTPAVG--------------------VER 179 V RV AR + + N W +++ + Sbjct: 215 VFRVFARLKRIEEDIAVNKTANTVFWPMADNLIQHYRDIEDTVLEKEDLELKHRYNVTGD 274 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN------NSWALYPGKKPKQTLPERTG 233 FNQA+M+LG +C PKC+ CPL C A + +YP K K + Sbjct: 275 FNQAVMELGRTVCIPRNPKCTECPLAEVCEANKALEKEEIATVEIYPVKNKKTKKRTQVV 334 Query: 234 YFLLLQHEDEV---LLAQRPPSGLWGGLYCFPQFADEES--------------------- 269 + E +V L +RP L GG + FP +++ Sbjct: 335 SCCIFYKEGDVDSCLFEKRPKETLLGGTWHFPFVIIQDTKAEEEENEEDSEDEEGSKKKK 394 Query: 270 --------------------------LRQWLAQRQIAADNLTQLT---AFRHTFSHFHLD 300 L+++L + D + H F+H + Sbjct: 395 PKKQAKKTKATTKAAASSSESEHFIKLKEFLVKEGFVQDKSIPFEYHASVDHVFTHINQT 454 Query: 301 IVPM-----WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 +L +E W+ +Q S G++ ++ +Q + Sbjct: 455 NHIYSCSDKYLIKDLEDSNSEEDQWKWFANSQRKSQGISTVASKIFKQFK 504 >UniRef50_C6HWU0 Putative A/G-specific DNA glycosylase n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HWU0_9BACT Length = 359 Score = 291 bits (746), Expect = 2e-77, Method: Composition-based stats. Identities = 133/343 (38%), Positives = 185/343 (53%), Gaps = 19/343 (5%) Query: 8 AQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLAN 67 A +L WYD G + LPW+ PY++W+SE+MLQQT VATV+ +++RF++RFP V LA Sbjct: 2 APLLAWYDA-GHRDLPWRQGCDPYRIWVSEIMLQQTTVATVLKFYDRFLSRFPDVGALAG 60 Query: 68 APLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILS 127 A L EVL W+GLGYY RARNLH+AA+ VA G FPE+ A LPG+GRSTAGAI S Sbjct: 61 ADLTEVLRFWSGLGYYQRARNLHRAARIVAER--GSFPESLAGWAELPGIGRSTAGAIFS 118 Query: 128 LSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQV-TPAVGVERFNQAMMD 186 +S PILD NV+RV+ R +AV +K+ E +LW LS+ NQA+M+ Sbjct: 119 ISRNLWAPILDANVRRVVERFFAVG--KEEKKREARLWELSDSFGRENPRPGDTNQALME 176 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANNSWALY---PGKKPKQTLPERTGYFLLLQHEDE 243 LGA +C + P+CS+CPL++ C + + ++ ++ P R +LL E Sbjct: 177 LGATVCLPASPRCSICPLRSSCRSRDPEPERVILLPERRRERRPAPPRRDRLVLLPAEGP 236 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAAD--NLTQLTAFRHTFSHFHLDI 301 +L R L GL+ A L A +L HT+SHF ++ Sbjct: 237 LLFVPRGEGRLLEGLFDLYGIAASSIPPGALLDEGPYAGFRVAEELFTVTHTYSHFR-EV 295 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 V + V TG EG A +G+ P+ R L+ L Sbjct: 296 VHVVGLVGRGTGSAGEG-------AMAGDLGVVLPLSRALEGL 331 >UniRef50_Q10159 A/G-specific adenine DNA glycosylase n=1 Tax=Schizosaccharomyces pombe RepID=MYH1_SCHPO Length = 461 Score = 291 bits (745), Expect = 3e-77, Method: Composition-based stats. Identities = 113/406 (27%), Positives = 179/406 (44%), Gaps = 77/406 (18%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQI-------DKTP-----------YKVWLSEVMLQQ 42 ++ +F ++ +YDK R LPW+ + +P Y+V +SE+MLQQ Sbjct: 16 LEVERFRESLIQFYDKTKR-ILPWRKKECIPPSEDSPLEDWEQPVQRLYEVLVSEIMLQQ 74 Query: 43 TQVATVIPYFERFMARFPTVTDLANAPLD-EVLHLWTGLGYYARARNLHKAAQQVATLHG 101 T+V TV Y+ ++M PT+ A A + +V+ LW+G+G+Y R + LH+A Q +A LH Sbjct: 75 TRVETVKRYYTKWMETLPTLKSCAEAEYNTQVMPLWSGMGFYTRCKRLHQACQHLAKLHP 134 Query: 102 GKFPETFEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEV 160 + P T +E A +PGVG TAGA+LS++ + I+DGNV RVL+R A+ K + Sbjct: 135 SEIPRTGDEWAKGIPGVGPYTAGAVLSIAWKQPTGIVDGNVIRVLSRALAIHSDCSKGKA 194 Query: 161 ENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS----- 215 +W L+ ++ V FNQA+M+LGA+ CT P+CS+CP+ C A + Sbjct: 195 NALIWKLANELVDPVRPGDFNQALMELGAITCTPQSPRCSVCPISEICKAYQEQNVIRDG 254 Query: 216 -----------------------------WALYPGKKPKQTLPERTGYFLLLQHEDE--- 243 A YP K E ++ Q D Sbjct: 255 NTIKYDIEDVPCNICITDIPSKEDLQNWVVARYPVHPAKTKQREERALVVIFQKTDPSTK 314 Query: 244 ---VLLAQRPPSGLWGGLYCFPQF-------------ADEESLRQWLAQRQIA-ADNLTQ 286 L+ +RP +GL GL+ FP ++S+ QW++ + Sbjct: 315 EKFFLIRKRPSAGLLAGLWDFPTIEFGQESWPKDMDAEFQKSIAQWISNDSRSLIKKYQS 374 Query: 287 LTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVG 332 + H FSH + S + + W + + VG Sbjct: 375 RGRYLHIFSHIRKTSHVFYAIAS--PDIVTNEDFFWISQSDLEHVG 418 >UniRef50_A6FJZ0 A/G-specific adenine glycosylase n=7 Tax=Alphaproteobacteria RepID=A6FJZ0_9RHOB Length = 352 Score = 288 bits (738), Expect = 2e-76, Method: Composition-based stats. Identities = 130/345 (37%), Positives = 172/345 (49%), Gaps = 18/345 (5%) Query: 8 AQVLDWYDKYGRKTLPWQ---------IDKTPYKVWLSEVMLQQTQVATVIPYFERFMAR 58 A +L WYD + R LPW+ PY+VWLSEVMLQQT VA V YFERF A Sbjct: 8 ADLLAWYDAHAR-DLPWRVAPAARLAGTRPDPYRVWLSEVMLQQTTVAAVRAYFERFTAL 66 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 +PTV LA AP +V+ W GLGYYARARNL K A+ VA HGG+FPET E + ALPG+G Sbjct: 67 WPTVEALAAAPDAQVMGEWAGLGYYARARNLLKCARVVAEDHGGRFPETREALMALPGIG 126 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 T AI +++ ++DGNV+RV+AR + + + E L + +TP Sbjct: 127 PYTGAAIAAIAFDAPEVVVDGNVERVMARVHDIRTPLPAAKPE--LVRAAAALTPKRRPG 184 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTG-YFLL 237 QA+MDLGA +CT P C +CP C A A P K PK+ P R G ++ Sbjct: 185 CHAQAVMDLGATVCTPRAPACGICPWWGACAARKAGVQAELPHKTPKKPKPLRFGIAYVA 244 Query: 238 LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 + + LL RP GL GG+ +P ES + I AD RHTF+HF Sbjct: 245 RRADGAWLLETRPDKGLLGGMLGWPGSDWTESAP--IPAPPIEADWTELGEEARHTFTHF 302 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 HL + + V+ + L + ++ Sbjct: 303 HLRLA---IKVAHVPQNAAPLRGAFTPATAFRPSDLPTVMRKVFD 344 >UniRef50_C1EFR0 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1EFR0_9CHLO Length = 574 Score = 288 bits (737), Expect = 2e-76, Method: Composition-based stats. Identities = 116/432 (26%), Positives = 165/432 (38%), Gaps = 90/432 (20%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQI----------------------------------- 26 A + ++L WYDK R LPW+ Sbjct: 117 DAPRLRDELLAWYDKSHR-ILPWRRNFHSHHAPPPPGPQGGDGCEPKEYWAGCDETGAPT 175 Query: 27 ----DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGY 82 D+ Y VW+SE+M QQTQ+ V Y+ R+ R+P V+ LA+A +EV LW GLGY Sbjct: 176 NVPRDQYAYGVWVSEIMSQQTQIERVAQYWLRWTERWPDVSSLASATREEVNELWAGLGY 235 Query: 83 YARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVK 142 Y RA L A+ V FP + +A +PGVG TA AI S++ + +DGNV Sbjct: 236 YRRAGFLLDGARHVVDNCDTIFPNDAKGLANVPGVGPYTAAAIASIAFDEPVAAVDGNVI 295 Query: 143 RVLARCYAVSG-----WPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKP 197 RV R AV+G P + + + FNQAMM+LGA +CT P Sbjct: 296 RVCTRLAAVTGGGDAAKPSSDAAKAVRLCADWLIDDSTRPGDFNQAMMELGATVCTPKAP 355 Query: 198 KCSLCPLQNGCIAAANNS-------WALYPGKKPKQTLPERTGYFLLLQHED-------- 242 C CPL+ GC AA S P K+ K E +++ + Sbjct: 356 ACGQCPLRVGCAGAALESAGSSGFKVTDLPEKEKKPEKREERVAVRVVERKGGRRDGVPG 415 Query: 243 ----EVLLAQRPPSGLWGGLYCFPQFADEESLR--------------QWLAQRQIAADNL 284 LL +RP GL GGL+ FP ++L + + Sbjct: 416 PADSSFLLVRRPEGGLLGGLWEFPSANVPDNLEPGDPDFYASCADVVHSCGSSGKFSKSR 475 Query: 285 TQLTAFRHTFSHFHLDIVPMWLPVS------------SFTGCMDEGNALWYNLAQPPSVG 332 L HTFSH V + W + ++G Sbjct: 476 VGLGEVTHTFSHVRHTYVAQHEVWETSADDDGDEAPPTRGDTAGGREWRWVTATELQTLG 535 Query: 333 LAAPVERLLQQL 344 L++ V ++ + L Sbjct: 536 LSSGVRKIYELL 547 >UniRef50_B7P4H9 A/G-specific adenine glycosylase muty, putative n=1 Tax=Ixodes scapularis RepID=B7P4H9_IXOSC Length = 345 Score = 288 bits (737), Expect = 3e-76, Method: Composition-based stats. Identities = 114/355 (32%), Positives = 169/355 (47%), Gaps = 46/355 (12%) Query: 14 YDKYGRKTLPWQI----DKTP----YKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 +D + LPW+ D P Y VW+SE+MLQQT+V TVI Y++R+M ++PTV DL Sbjct: 2 FDFGDARQLPWRDIAKADGDPNQKAYAVWVSEIMLQQTRVTTVIEYYKRWMKKWPTVVDL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEV-AALPGVGRSTAGA 124 A A ++EVL +W GLGYY RAR LHK AQ+V GG FP T + + +PGVG TA A Sbjct: 62 ARASIEEVLQVWAGLGYYQRARRLHKGAQKVVRDLGGLFPNTPKHLAREIPGVGCYTAAA 121 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 + S++ G +DGNV RV +R + G E LW+ + + V FNQA+ Sbjct: 122 VASIAFGHRAGAVDGNVARVYSRMRLLGATLGSSPSERALWAAANEAVCPVRPGDFNQAV 181 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LGA +CT + ++ LYP K K+ T +L+ D Sbjct: 182 MELGARVCT----------------LMSWDATQLYPHKAAKKEPRRETHAVSVLRSGDRY 225 Query: 245 LLAQRPPSG-LWGGLYCFP--QFADEESLRQWLAQRQIAADNL--------------TQL 287 LL ++ G L GL+ FP Q + S ++ + + + +L Sbjct: 226 LLLRKTGQGRLLEGLWEFPNRQVDPQGSAKKSQDEERRVVLQIGAQLLNSAGGKKRARRL 285 Query: 288 TAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 H FSH H + E W + A+ S G++ ++++L Sbjct: 286 GEVTHLFSHIHATYCVYETTLDE----ECETEGSWMSAAEVESSGVSTAMKKVLS 336 >UniRef50_Q9ZMT8 A/G-SPECIFIC ADENINE GLYCOSYLASE n=21 Tax=Helicobacter RepID=Q9ZMT8_HELPJ Length = 328 Score = 287 bits (735), Expect = 4e-76, Method: Composition-based stats. Identities = 105/348 (30%), Positives = 172/348 (49%), Gaps = 30/348 (8%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDK---TPYKVWLSEVMLQQTQVATVI-PYFERFMARF 59 +L WY+++GRK LP++ K PY+V++SEVM QQTQ++TVI ++ F+ F Sbjct: 2 ETLHNALLKWYEEFGRKDLPFRNLKGINAPYEVYISEVMSQQTQISTVIERFYPPFLKAF 61 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 PT+ DLANAPL+EVL LW GLGYY+RA+NL K+A+ H + P ++ + LPG+G Sbjct: 62 PTLKDLANAPLEEVLLLWRGLGYYSRAKNLKKSAEICVKEHHSQLPNDYQSLLKLPGIGA 121 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA AIL + +D NVKRVL R + + K L + Sbjct: 122 YTANAILCFGFREKSACVDANVKRVLLRLFGLDPNIHAK----DLQIKANDFLNLNESFN 177 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 NQA++DLGA+IC+ KPKC++CP C+ + K KQ + + Y ++ Sbjct: 178 HNQALIDLGALICSP-KPKCAICPFNPYCLGKNHLERHTL---KKKQEIIQEERYLGVVI 233 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 +++ L + L+ G++ FP + + L L +H+ + F L Sbjct: 234 QNNQIALEK-IEQKLYLGMHHFPNLKENLEFK------------LPFLGTIKHSHTKFKL 280 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 ++ + +Y+L ++ +++ ++L L+ Sbjct: 281 NLNLYLATTKDL-----KNPIRFYSLKDLETLPISSMTLKILNFLKQK 323 >UniRef50_A1W1M2 A/G-specific adenine glycosylase n=13 Tax=Campylobacter jejuni RepID=A1W1M2_CAMJJ Length = 339 Score = 286 bits (731), Expect = 1e-75, Method: Composition-based stats. Identities = 109/357 (30%), Positives = 180/357 (50%), Gaps = 36/357 (10%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT------------PYKVWLSEVMLQQTQVATVI 49 + + +L WY+K GRKTLPW+ ++ Y V++SE+MLQQTQV +V+ Sbjct: 5 ELEKLQENLLLWYEKNGRKTLPWRNLQSQNCDESLKHIDRAYGVYISEIMLQQTQVKSVL 64 Query: 50 -PYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETF 108 ++ F+ +FPT+ LANA DE+L W GLGYY RARNL KAA + K P+ Sbjct: 65 ERFYFPFLQKFPTLESLANANEDELLKAWQGLGYYTRARNLKKAALECVDKFEAKLPKEV 124 Query: 109 EEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLS 168 E++ L G+G TAGAI + +DGN++RVL+R +A+ P KE+E + + Sbjct: 125 EDLKKLSGIGAYTAGAIACFGYDQKVSFVDGNIRRVLSRLFALE-NPSMKELEKR----A 179 Query: 169 EQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTL 228 +++ NQA++D+GA++C KC +CPL + C LYP + K+ L Sbjct: 180 KELLNLNHAFDHNQALLDIGALVCVSKNAKCGICPLYDFCQ--GKFHTELYP--RAKKIL 235 Query: 229 PERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLT 288 E L L ++ + L+ G+Y FP F + E ++ + Sbjct: 236 YESVNLNLFLFEFNKKFAIGQSQDKLYKGMYNFPFFKEGEYKLS---------KDMGFVG 286 Query: 289 AFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 F+H+++ + L+I +++ + N + L + S L+A + L+ ++ Sbjct: 287 EFKHSYTKYKLNIKVYHQILNN-----ENKNYEFKTLKELESTALSALSLKALKLIK 338 >UniRef50_Q5SH32 A/G-specific adenine glycosylase (MutY) n=2 Tax=Thermus thermophilus RepID=Q5SH32_THET8 Length = 325 Score = 286 bits (731), Expect = 1e-75, Method: Composition-based stats. Identities = 113/341 (33%), Positives = 171/341 (50%), Gaps = 34/341 (9%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + +L WY + R LPW+ +K PY+V +SEV+LQQT+V +PY+ RF+ RFPT+ Sbjct: 2 EAWRKALLAWYRENAR-PLPWRGEKDPYRVLVSEVLLQQTRVEQALPYYRRFLERFPTLK 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A L+EVL +W G GYY RA +LH+ A+ V P +F E+ LPG+G TA Sbjct: 61 ALAAASLEEVLRVWQGAGYYRRAEHLHRLARSVEE-----LPPSFAELRGLPGLGPYTAA 115 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA-VGVERFNQ 182 A+ S++ G+ +DGNV+RVL+R +A E +L++L++ + P V +NQ Sbjct: 116 AVASIAFGERVAAVDGNVRRVLSRLFARESPK-----EKELFALAQGLLPEGVDPGVWNQ 170 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+LGA +C +P+C CPL G + YP + ++ ER +LL + Sbjct: 171 ALMELGATVCLPKRPRCGACPL--GAFCRGKEAPGRYPAPRKRRAKEERLVALVLLGRKG 228 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 V L + G + GLY P F EE + L RH +H L + Sbjct: 229 -VHLER--LEGRFQGLYGVPLFPPEE----LPGREAAFGVRSRPLGEVRHALTHRRLRV- 280 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 G + EG P L +E++L++ Sbjct: 281 -------EVRGALWEGEG-----EDPWKRPLPKLMEKVLRK 309 >UniRef50_C5ZWA7 A/G-specific adenine glycosylase n=3 Tax=Helicobacter RepID=C5ZWA7_9HELI Length = 332 Score = 285 bits (729), Expect = 2e-75, Method: Composition-based stats. Identities = 101/352 (28%), Positives = 168/352 (47%), Gaps = 26/352 (7%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKT---PYKVWLSEVMLQQTQVATVI-PYFERFM 56 M + +L WY+K GRK+LPW+ + Y+V +SE+MLQQTQV T++ ++ F+ Sbjct: 1 MNLPLLHSSILLWYEKKGRKSLPWRDKTSKNRAYRVLVSEIMLQQTQVKTILERFYFPFL 60 Query: 57 ARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPG 116 +FPT+ L+ A +EVL W GLGYY RARNL K A+ H G P+ E + LPG Sbjct: 61 EKFPTLETLSKAKEEEVLLQWRGLGYYTRARNLLKCAKICCESHKGILPKDIESLQKLPG 120 Query: 117 VGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG 176 +GR TAGAI + +D N+KR+L R +A+ L S ++ + Sbjct: 121 IGRYTAGAIACFGYDRAVSFVDSNIKRILTRFFALQSPSP-----KLLESKAKTILNTQE 175 Query: 177 VERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFL 236 NQA++D+GA +CT PKC+ CPLQ C N ++ K L ++T Sbjct: 176 PFNHNQALLDIGATLCTPKNPKCTQCPLQPFCQGKENPTFY---TPTKKTALLKQTLEIG 232 Query: 237 LLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 + + + L + + L+ LY FP + +S +L +H ++ Sbjct: 233 IFIQDSKFALTK-SQTKLYYNLYNFPLISPPKS------------HQTNKLGILKHAYTK 279 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 ++L + L + ++ + ++ +++ ++L L+ Sbjct: 280 YNLTLHLYSLQSQELLQQTQQ-KIEFFTQEELKNLPISSMTLKILDFLKKST 330 >UniRef50_UPI0000E49729 PREDICTED: hypothetical protein, partial n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E49729 Length = 425 Score = 284 bits (726), Expect = 5e-75, Method: Composition-based stats. Identities = 108/396 (27%), Positives = 162/396 (40%), Gaps = 74/396 (18%) Query: 27 DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARA 86 + Y VW+SE+M QQTQVATVI Y+ ++M ++PT+ L+ A L+EV +W GLGYY+R Sbjct: 12 NHKAYAVWVSEIMCQQTQVATVIDYYNKWMKKWPTLESLSKASLEEVREVWAGLGYYSRG 71 Query: 87 RNLHKAAQQVATLHGGKFPETFEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVL 145 + L + A +V G+ P T E++ LPGVGR TAGAI S+S + ++DGNV RVL Sbjct: 72 QRLFEGACKVQNELDGQIPGTAEQLRKELPGVGRYTAGAIASISFSEATGVVDGNVIRVL 131 Query: 146 ARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQ 205 +R + + V +W L+ + FNQ+MM+LGA +C P+C CP+Q Sbjct: 132 SRLRMIGADFTTQNVMTAIWDLANAIVDPDRPGDFNQSMMELGATVCHPKSPQCPSCPVQ 191 Query: 206 NGCIA--AANNSWALYPGKKPK---QTLPERTGYFLL----------------------- 237 + C A + K K Q+LPE + Sbjct: 192 SHCRAIQQMDQLTRDLATKLTKSNGQSLPEEKDIVDIECAADGCSLCMDGPVDANLGVMN 251 Query: 238 -----------------------LQHEDEVLLAQRPPSGLWGGLYCFPQFADEES----- 269 E+E L+ QRP +GL G++ FP E Sbjct: 252 YPRKPKKKPLKQQVIAVCIVERETNDEEEYLIVQRPDTGLLAGMWEFPSIEIAEETSRQK 311 Query: 270 --------LRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPV---------SSFT 312 L+ L + + F H FSH H + V Sbjct: 312 SRNKMDSYLKDTLNMTLKNVKDRKHIIQFVHMFSHIHQTYELETMKVEEEEEEDVSDKSQ 371 Query: 313 GCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 D + W + + ++A + + Sbjct: 372 ESEDIPHHQWVSRSAFEGQAVSAGMRKAFDAYNGKT 407 >UniRef50_Q7VGS8 A/G-specific adenine glycosylase n=537 Tax=Helicobacter RepID=Q7VGS8_HELHP Length = 350 Score = 282 bits (723), Expect = 1e-74, Method: Composition-based stats. Identities = 110/350 (31%), Positives = 172/350 (49%), Gaps = 30/350 (8%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQ---IDKTPYKVWLSEVMLQQTQVATVIP-YFERFMAR 58 A + +L+WY K+GR LPW+ PY V++SE+MLQQTQV V+ Y+ FM Sbjct: 14 AQTWRENILNWYKKHGRVYLPWRNLSEKNAPYGVYVSEIMLQQTQVKRVLEHYYMPFMTA 73 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 FPT+T LANA + +L LW GLGYY R RN+ K+A+ P T+ E+ +LPG+G Sbjct: 74 FPTLTSLANAQEESLLKLWEGLGYYTRVRNMQKSARICCQRFNNTLPHTYAELISLPGIG 133 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 +AGAIL L ++ +DGN++RV R +A+S + L L+ + Sbjct: 134 AYSAGAILCFGLRQNVAFVDGNIRRVFCRIFALSSP-----TQKSLEELAWILLEPKYSF 188 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL- 237 +NQA++D+GAMICT P C +CPLQ C + LYP K Q P L Sbjct: 189 DYNQALLDIGAMICTPKSPSCLICPLQQLCE--GQITPELYPTPKKSQLQPLHLHLALCI 246 Query: 238 ---LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTF 294 + ++ E+ L R L+ GLY P+ ++EE A T+ ++H + Sbjct: 247 SQNINNKTEIGLL-RSTQNLYRGLYNLPRLSEEE-----------AKTYGTKCGEYKHHY 294 Query: 295 SHFHLDIVPMWLPVSSF---TGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 + + + + + + ++ L + L++ ++ L Sbjct: 295 TKYAITATIYHISFEDLIHISTYQKKEEITFFTLQDLQNKPLSSLCKKAL 344 >UniRef50_Q6IEF2 A/G-specific adenine glycosylase, putative n=3 Tax=Trypanosoma RepID=Q6IEF2_9TRYP Length = 510 Score = 281 bits (720), Expect = 2e-74, Method: Composition-based stats. Identities = 101/358 (28%), Positives = 158/358 (44%), Gaps = 52/358 (14%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQID-----------------KTPYKVWLSEVMLQQTQVA 46 + ++W+ R+ LPW+ PY VW+SEVM QQTQ+ Sbjct: 65 ADIRQATIEWFHTNQRRDLPWRQTFSDTDACHPVVPVVTPLTDPYHVWVSEVMSQQTQMD 124 Query: 47 TVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPE 106 TVIPYF+R++A FP + LA A V+ +W+G+GYY RA L K A+ V GG P Sbjct: 125 TVIPYFKRWIAVFPDIATLARATEASVMAVWSGMGYYRRALYLKKGAEHVMKHFGGSLPT 184 Query: 107 TFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCY-AVSGWPGKKEVENKLW 165 T ++ A+PG+G T+ AI S+ + +DGNV RVL+R + P + +++ Sbjct: 185 TAAQLRAIPGIGLYTSAAIASICFRERIISVDGNVVRVLSRLRCERNFDPKSAKSIKEVF 244 Query: 166 SLSEQVT---PAVGVERFNQAMMDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYP- 220 +++ P FNQ +M++GA +C S +P C CPL C A A P Sbjct: 245 HWGQEIMGEGPCDRPGDFNQGLMEIGARVCKPSGRPLCEECPLHRYCGAYAAMQSGEIPS 304 Query: 221 ----GKKPKQTLPERTGYFLLLQHE------------DEVLLAQRPPSGLWGGLYCFPQ- 263 +TL ++ + L HE ++ +RP GL GG+ FP Sbjct: 305 IEGVIPLRAKTLKKKKEHVFCLVHEFREKTAGDSALKRRFIVVRRPEEGLLGGMLEFPSR 364 Query: 264 -----------FADEESLRQWLAQRQIAA-DNLTQLTAFRHTFSHFHLDIVPMWLPVS 309 + L ++ A N+ ++ +H FSH + ++ Sbjct: 365 TYVSSPDGGTDIKGSRDVTDELRKKLAAGHKNVVEVGHVQHIFSHIDMRVLVYHAVWD 422 >UniRef50_Q1V0L8 A/G-specific adenine glycosylase n=3 Tax=Candidatus Pelagibacter RepID=Q1V0L8_PELUB Length = 326 Score = 281 bits (719), Expect = 3e-74, Method: Composition-based stats. Identities = 96/349 (27%), Positives = 164/349 (46%), Gaps = 29/349 (8%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQID----KTPYKVWLSEVMLQQTQVATVIPYFERFM 56 M S + ++L WYD R +LPW+ + K Y +SE MLQQTQVATVIPYF F+ Sbjct: 1 MTKSILTKKILYWYDNNKR-SLPWRKELSQVKREYFTLVSEFMLQQTQVATVIPYFNNFI 59 Query: 57 ARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPG 116 P + L+ ++L W GLGYY+R +NL K AQ + + P T +E+ LPG Sbjct: 60 KDIPDIKSLSKIKEHKLLKYWEGLGYYSRVKNLKKTAQILEKNFNRRLPNTIDELKLLPG 119 Query: 117 VGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG 176 +G T+ AI++++ K F LDGN++RV+ R + K+ ++ L + + + Sbjct: 120 IGDYTSNAIMAIAFNKPFIPLDGNIERVIKRLLNLKLP--KEITKDNLVKNKKILGNSTR 177 Query: 177 VERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFL 236 + QA+M+LGA+IC P C CPL C + + + K ++ ++ Sbjct: 178 ASDYAQALMELGALICRPKNPLCHQCPLIKNCKSFKKKDFEII---KERKKKIDKFYLLE 234 Query: 237 LLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 + + ++ LL + L FP + DNL + F+ S+ Sbjct: 235 VYKKNNKFLLIKNTKFNFLKNLQIFPMKQISKP------------DNLAKALNFK--ISN 280 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 ++++ + S G + N+ W + ++ L +++ L+ Sbjct: 281 MNMNV---IIKFSKLKGEI--SNSSWIDSSKLEDYTLPTFTKKIFNYLK 324 >UniRef50_Q5KQ30 A/G-specific adenine DNA glycosylase, putative n=2 Tax=Filobasidiella neoformans RepID=Q5KQ30_CRYNE Length = 568 Score = 281 bits (718), Expect = 3e-74, Method: Composition-based stats. Identities = 108/414 (26%), Positives = 180/414 (43%), Gaps = 86/414 (20%) Query: 8 AQVLDWYD-KYGRKTLPWQIDKTP-----------YKVWLSEVMLQQTQVATVIPYFERF 55 +L+W++ ++ +PW+ P Y++W+SEVMLQQTQV TVI Y++R+ Sbjct: 94 ESLLEWFECVREKRGMPWRKKYDPSLSFEEKGQRAYEIWVSEVMLQQTQVTTVIAYWQRW 153 Query: 56 MARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVA--TLHGGKFPETFEEVAA 113 M R+PT++DLA A ++ GLGYY RAR+L A+ V + + G+ P+ + Sbjct: 154 MERWPTISDLAKADVE-----VRGLGYYRRARSLLAGAKTVMGNSKYNGRLPDDPAVLEK 208 Query: 114 -LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQV- 171 + GVGR TAGAI S++ G PI+DGN+ R+L R AV LW +++++ Sbjct: 209 EIDGVGRYTAGAICSMAYGVRTPIVDGNIHRLLTRLLAVHAPQTGPATIKFLWRVADELI 268 Query: 172 --TPAVGVE-----RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN----------- 213 P+ +NQA+M+LG+ +C + P+C +CPL+ C A Sbjct: 269 KHLPSGDKHNNVVGDWNQALMELGSQVCKPANPECGVCPLKKACKGYAELSNSPSQPSTT 328 Query: 214 -------------------NSWALYPGKKPKQTLPERTGYFLLLQHED-----EVLLAQR 249 + ++P KK K+ ++Q L +R Sbjct: 329 KSDCKLCAPIPCDIETDRIPTVMVFPMKKEKKASRVEEETVCIVQWRGDGDQRRWLFTKR 388 Query: 250 PPSGLWGGLYCFPQFADE-------------ESLRQWLAQRQIAADNLTQ----LTAFRH 292 P GL GL+ P E+L ++ + A+ L + + H Sbjct: 389 PEKGLLAGLFEPPTTPVSAGLSSSERLGASLEALSDYIEITEGEAEGLQKSSRDVGNIPH 448 Query: 293 TFSHFHLDIVPMWLPVSSFTG------CMDEGNALWYNLAQPPSVGLAAPVERL 340 FSH ++ L ++S A+W N + + V+++ Sbjct: 449 IFSHINMTYHIHLLTLTSPGNEPPSVKPKAPRPAVWLNEEEVEKANVGTGVKKV 502 >UniRef50_B7A8Y1 HhH-GPD family protein n=1 Tax=Thermus aquaticus Y51MC23 RepID=B7A8Y1_THEAQ Length = 333 Score = 281 bits (718), Expect = 4e-74, Method: Composition-based stats. Identities = 116/341 (34%), Positives = 170/341 (49%), Gaps = 34/341 (9%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + +L WY K+ R LPW+ +K PY++ +SEV+LQQTQV IPY+ RF+ RFPT+ Sbjct: 2 KAWQEALLAWYRKHAR-PLPWRGEKDPYRILVSEVLLQQTQVRQAIPYYRRFLQRFPTLK 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 L APL+EVL +W G GYY RA +LH+ AQQV P++F ++ LPG+G TA Sbjct: 61 ALGEAPLEEVLRVWQGAGYYRRAVHLHRLAQQVEA-----LPQSFAQLKGLPGLGPYTAA 115 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV-GVERFNQ 182 A+ S++ G+ +DGNV+RVLAR +A+ G L L++ + P +NQ Sbjct: 116 AVASMAFGERVAAVDGNVRRVLARLFALEG-----ASPKALQGLAQSLMPEEAHPGEWNQ 170 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+LGA +C KP C CPL + YP + ++ ER +LL + Sbjct: 171 ALMELGATVCLPRKPLCGACPLA--SRCRGKEAPERYPLPQRRKVREERLAALVLLGRKG 228 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 L G + GLY P F EE + + L RH +H L Sbjct: 229 VYLERL---EGRFQGLYGVPLFPAEE----LPVRAEALGVAPRLLGEVRHALTHRRL--- 278 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + + + + G EG W L +E++L+Q Sbjct: 279 LVEVHGAFWEG---EGEDPW-------KRPLPKLMEKVLRQ 309 >UniRef50_UPI0001746831 A/G-specific adenine glycosylase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746831 Length = 343 Score = 279 bits (714), Expect = 1e-73, Method: Composition-based stats. Identities = 113/347 (32%), Positives = 164/347 (47%), Gaps = 25/347 (7%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIP--YFERFMARF 59 ++ + + W+ GR PW+ PY + +SE+MLQQTQ++TV+ Y+ R+M RF Sbjct: 8 DPAKIAHALEAWFAVSGR-DYPWRRTTDPYAILVSEMMLQQTQISTVLDRGYYARWMERF 66 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 P LA A EVL +W GLGYY RAR L + AQ+V + HGG FP T E V ALPGVG Sbjct: 67 PNTATLAVADEAEVLRVWEGLGYYRRARFLQQMAQRVESEHGGIFPATLEGVRALPGVGD 126 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG-VE 178 TAGA+ S + PI+DGNV RVL+R + + +E +LW+ S + A Sbjct: 127 YTAGAVCSFAHDAAAPIVDGNVARVLSRVWDDATPVDSREGMARLWTRSRALVEAAQSPR 186 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 FN A+M+LG IC S P C CP+Q C+ A P K + + T Sbjct: 187 VFNSALMELGQTICRVSAPNCGSCPVQPHCLTR---DPASLPVKSKQTVITAVTERVYFQ 243 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 E+ VLL Q GL+ P+ L + ++T + + Sbjct: 244 IGEEGVLLQQ-ESGKRRTGLWKLPELPALPDPPAVLHRS-------------KYTITRYK 289 Query: 299 LDIVPMWLPV-SSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 +D+ W+ G W LA ++ + +P + L + Sbjct: 290 VDL---WVHAPPDKAGVAPPEGLQWIPLAGLEALPMPSPYRKALNAV 333 >UniRef50_B3T7B3 Putative HhH-GPD superfamily base excision DNA repair protein n=1 Tax=uncultured marine microorganism HF4000_APKG3D20 RepID=B3T7B3_9ZZZZ Length = 349 Score = 278 bits (711), Expect = 2e-73, Method: Composition-based stats. Identities = 109/343 (31%), Positives = 163/343 (47%), Gaps = 14/343 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 S F +L WYD + R+ LPW+ + YK +SE MLQQT+VATV+PYFER++ +FP Sbjct: 16 SAFGENLLAWYDDH-RRDLPWRKQPSLYKTVVSEFMLQQTRVATVLPYFERWLKQFPDFA 74 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA+A + VL W GLGYY+RARNL K AQQ+ L K P PGVG A Sbjct: 75 ALADASEEAVLKAWEGLGYYSRARNLRKLAQQIEALE--KIPADSAAWEGFPGVGPYVAA 132 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEV-ENKLWSLSEQVTPAVGVERFNQ 182 A+ S+S G ++DGNV RVLAR ++ + KL +++++ +NQ Sbjct: 133 AVTSISFGTKAAVVDGNVVRVLARMLSIDEQFRDNATAQRKLRPIAQELLHYNRPGDYNQ 192 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+LGA +C R P CS CP C + YP + K+++ T L + + Sbjct: 193 AVMELGATVCHRRSPLCSDCPALYVCQSGQRGDPESYP-RTAKKSIERITVDRLWITCNN 251 Query: 243 EVLLAQRP-PSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 +LL + P S GL P+ D ++ + +A + S+ ++ Sbjct: 252 SLLLREAPADSKRLAGLMELPKLEDLPTVPKPGKTALLAIKKR--------SISNQSIEE 303 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + D + L+AP R ++ L Sbjct: 304 RIYRAELPHGFNLKDCPELRLTRQRDLGEITLSAPHRRWIESL 346 >UniRef50_B6K1Z7 A/G-specific adenine DNA glycosylase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6K1Z7_SCHJY Length = 470 Score = 277 bits (708), Expect = 5e-73, Method: Composition-based stats. Identities = 108/418 (25%), Positives = 167/418 (39%), Gaps = 90/418 (21%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQID-------KTP-----------YKVWLSEVMLQQT 43 + F Q+LD+Y+ R LPW+ D P Y+V +SE+MLQQT Sbjct: 13 EVKSFRKQLLDFYENTKR-NLPWRKDPYKPPQKDDPDDRKKLAVQRFYEVLVSEIMLQQT 71 Query: 44 QVATVIPYFERFMARFPTVTDLANAPLD-EVLHLWTGLGYYARARNLHKAAQQVATLHGG 102 +V TV Y++++M PT+ A A D EV+ LW+G+G+Y R + LH A + +A L Sbjct: 72 RVETVKRYYDKWMKTLPTILHCAQADYDTEVMPLWSGMGFYGRCKRLHSACKYLAVLPAE 131 Query: 103 KFPETFEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVE 161 + P + E +A +PGVG TAGA+LS++ G ++DGNV+RVL+R A+ K + Sbjct: 132 EIPTSPERLAKNVPGVGPYTAGAVLSIAWGIPTGVVDGNVQRVLSRLLALHCNVTKGKPN 191 Query: 162 NKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANN------- 214 +W ++ + FNQA+M+LGA+ CT C CP+ N C A Sbjct: 192 AFVWQMANLLVDPNFPGNFNQALMELGAVTCTPQTFNCPGCPVSNICKAYQEKIAIEQTD 251 Query: 215 ----------------------------------------SWALYPGKKPKQTLPERTGY 234 + YP + PK Sbjct: 252 NSVRDVEDIICQSHQSISKNTCEICSLCEPDPPYTSAEIWIQSRYPLRPPKVKQRIERAI 311 Query: 235 FLLLQHEDE------VLLAQRPPSGLWGGLYCFPQFADEE-------------SLRQWLA 275 ++ Q ++ L+ +RP GL GL+ P E +L W++ Sbjct: 312 VVVFQSQNSETDEKYFLIRKRPTGGLLAGLWDLPTVEIGEDPWPKDMQLAFDNALSSWIS 371 Query: 276 QRQIA-ADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVG 332 H F+H H ++ E W + + VG Sbjct: 372 SENRQYIKKYQSCGRSTHIFTHIHKTSHVFYVLADQKLQVQKE--FYWISQSDLEHVG 427 >UniRef50_Q6MHU9 A/G-specific adenine glycosylase n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MHU9_BDEBA Length = 333 Score = 276 bits (705), Expect = 1e-72, Method: Composition-based stats. Identities = 119/339 (35%), Positives = 176/339 (51%), Gaps = 23/339 (6%) Query: 7 SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 Q+ WY+K R +LPW+ +K PY++WLSEVMLQQT V VIPYFE+F+ +FPTV DLA Sbjct: 17 HKQLTQWYNKNKR-SLPWRENKNPYRIWLSEVMLQQTTVVAVIPYFEKFLQKFPTVQDLA 75 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 NAP +V+ W GLGYY+RARNLHKAA+ +A FP+T E+ LPG G T+ A+ Sbjct: 76 NAPEADVMEAWAGLGYYSRARNLHKAAKALAAG---GFPKTAAELLELPGFGPYTSRAVA 132 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMD 186 S++ G+ +LDGNV RVL+R Y + + + L +S++++ + NQ +M+ Sbjct: 133 SIAFGEKVGVLDGNVIRVLSRRYGLKLEWWNGKGRDHLQKISDELSLLGQADVVNQGLME 192 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLL 246 LGA +CT K C LCP C++ N P KKP++ L+ +V L Sbjct: 193 LGATVCTPQKVMCMLCPWAATCVSREKNLVEKLPLKKPRKESEVWVWKPLVAIKNQKVAL 252 Query: 247 AQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWL 306 Q + G FP N + +H +H DI Sbjct: 253 VQNDYAPFLKGQMIFPG-------------EISMEKNKPKAYDAKHNITHH--DIFIQIT 297 Query: 307 PVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 S TG N W +L + V ++ ++++L +++ Sbjct: 298 EKKSLTG----KNLQWVDLKELKKVNPSSLLQKVLHKVK 332 >UniRef50_C3XHP3 A/G-specific adenine glycosylase n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XHP3_9HELI Length = 356 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 108/348 (31%), Positives = 164/348 (47%), Gaps = 25/348 (7%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQI---DKTPYKVWLSEVMLQQTQVATVIP-YFERFMAR 58 A Q+ Q+L+WY GR +LPW+ + PY V++SE+MLQQTQV V YF F+ Sbjct: 22 AKQWQIQLLEWYAIQGRISLPWRNLKGENAPYGVYVSEIMLQQTQVKRVQEHYFAPFLNA 81 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 FPT+ LA A LD +L W GLGYY+RARN+ K A H P T +++ LPG+G Sbjct: 82 FPTLESLAKANLDSILKQWEGLGYYSRARNMQKTAIICCEKHNATLPNTRQDLLTLPGIG 141 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 T+GAIL + +DGN++RVL R +A+ + L L+ + Sbjct: 142 AYTSGAILCFGFHQSVSFVDGNIRRVLCRIFALREP-----NQKLLDELAFLLLDTKHSF 196 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 +NQA++DLGAMICT P C +CP+QN C N + LYP K P L Sbjct: 197 DYNQALLDLGAMICTPKSPSCLICPVQNLCNGKVNPT--LYPTPKTSSLTPLTLHLLLYK 254 Query: 239 QHEDEVLL--AQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 + + + GL+ GLY PQ + + +F+H ++ Sbjct: 255 DSQGRIAFVYEKGDKGGLYQGLYNLPQLETGAVTNK---------QGFYKCGSFKHHYTK 305 Query: 297 FHLDIVPMWL---PVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 + + L + T + + + + + L++ ++ L Sbjct: 306 YAITANVYKLDSKHLEILTQSLPHIKLYFLSQKELETKPLSSLCKKAL 353 >UniRef50_A6Q1G1 A/G-specific adenine glycosylase n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q1G1_NITSB Length = 310 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 94/347 (27%), Positives = 163/347 (46%), Gaps = 43/347 (12%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVI-PYFERFMARFPTVTD 64 +L+W++K+GR LPW+ + YK++LSE+MLQQTQV+ V Y+ +F+ RFPT+ Sbjct: 1 MHKVLLEWFEKHGRHELPWRQTQDVYKIYLSEIMLQQTQVSRVEGEYYPKFLKRFPTLKA 60 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A +EVL LW+GLGYY+RARNL + A+ P+ +E+ LPG+G TA A Sbjct: 61 LAQASENEVLALWSGLGYYSRARNLLQCAKICKD----TLPKEPKELMKLPGIGTYTANA 116 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I + + + ++D N+KRV+ R +A+ E ++ ++ + ++ N A+ Sbjct: 117 ICAFAYNQPVAVVDTNIKRVIMRFFALQD-------EKEVQQKAQMILNTNEPKKHNLAL 169 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 MDLG+++CT P C CP+Q C S + GK K E+ +F + + Sbjct: 170 MDLGSLLCTPKNPLCDQCPIQQWCA--GKESPHSF-GKTKKTKREEKVIHFGIFLQSGHI 226 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPM 304 + R L+ ++ P F+H+++ +++ Sbjct: 227 AME-RSNKNLYKNMFILPAIES----------------APKPFATFKHSYTKYNIQATLT 269 Query: 305 WLPVSSFTGCMDEGNALWYNLAQPPSVGLA----APVERLLQQLRTG 347 L N L + +A +++ ++ L+ Sbjct: 270 LLNK-------RPPNTHLIPLQALSAYPVASIALKGIKKAIKHLQED 309 >UniRef50_B1ZMV1 HhH-GPD family protein n=2 Tax=Verrucomicrobia RepID=B1ZMV1_OPITP Length = 344 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 108/343 (31%), Positives = 161/343 (46%), Gaps = 17/343 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 ++F + +L WY + R+ LPW+ + + YK +SE MLQQTQV TV+PY++R++A P Sbjct: 12 AEFQSALLGWYRAHQRR-LPWRTEPSLYKTVVSEFMLQQTQVKTVLPYYDRWLAVLPNFA 70 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A +VL LW GLGYY+RARNLH+ +Q + L K P T E LPGVG TA Sbjct: 71 ALAGASEAQVLKLWEGLGYYSRARNLHQLSQAIHALP--KPPRTPEAWRELPGVGPYTAA 128 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWP-GKKEVENKLWSLSEQVTPAVGVERFNQ 182 AI S++ G +DGNV R+LAR A L++ + NQ Sbjct: 129 AITSIAFGAPAACVDGNVVRILARLTANRTLFRDSASAAKTFTPLAQALLSPSAPGDHNQ 188 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 AMM+LGA +C R P C CP++ C AA YP PK+ + +RT + Q + Sbjct: 189 AMMELGATVCVRQNPLCLTCPVRPFCAAARAGDPESYPRLAPKK-MEKRTVTRVWCQRDG 247 Query: 243 EVLLAQRPPS-GLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 +LL + + ++ P D + L R + F + Sbjct: 248 ALLLHRAADDARRFARMHELPTAED-------AGFDPARFEKRAPLVVKRRGITRFQITE 300 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 P +W +A+ ++ L+ P R + ++ Sbjct: 301 SIHLAP----PPRKLGAGFVWVPVAELHAITLSGPHRRWITEI 339 >UniRef50_Q1J2G1 A/G-specific adenine glycosylase n=3 Tax=Deinococcus RepID=Q1J2G1_DEIGD Length = 343 Score = 266 bits (681), Expect = 7e-70, Method: Composition-based stats. Identities = 107/315 (33%), Positives = 159/315 (50%), Gaps = 30/315 (9%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQID----KTPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 S +L W+D+ GR LPW++ + PY+VW+SEV+LQQTQVA YFERF+ F Sbjct: 7 SSLRTVLLAWFDRAGR-ALPWRVGPEGRRDPYRVWVSEVLLQQTQVARGRVYFERFLEAF 65 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 PTV LA+AP++ VL W G GYYARARNLH+AA +A P T++ ALPGVG Sbjct: 66 PTVEALADAPIEAVLKAWEGCGYYARARNLHRAAGVMARE---GVPTTYDGWRALPGVGP 122 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA A+ SL+ G+ + DGNV+RVLAR + ++ + ++++ Sbjct: 123 YTAAAVASLACGEARAVNDGNVRRVLARLHG-----ERQPTAAWAQARADELLDPARPGA 177 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 N+A+MDLGA +CT P+C CPL+ C A A +P K + + E LLL Sbjct: 178 CNEALMDLGATVCTPKAPQCGECPLRRWCAAFQLGDPAAFPAPKVRAAVREVRAVALLLG 237 Query: 240 HEDEVLLAQRP----------PSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTA 289 E +L +R P+ + + + + ++ A +L Sbjct: 238 DAQEAVLERRAGSLLGGLMGLPTEML-------EAGESKVDALTRLAARLGATVAGELGT 290 Query: 290 FRHTFSHFHLDIVPM 304 H+ +H + + Sbjct: 291 VTHSMTHRRVTLTVY 305 >UniRef50_B9KEL2 A/G-specific adenine glycosylase n=3 Tax=Campylobacter RepID=B9KEL2_CAMLR Length = 342 Score = 266 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 95/362 (26%), Positives = 169/362 (46%), Gaps = 47/362 (12%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYK-------------------VWLSEVMLQQTQ 44 + +L WY++ GRK+LPW+I Y+ V++SE+MLQQTQ Sbjct: 2 QKIHKSILKWYEQNGRKSLPWRILHEEYRKYGSEDDLKKLKNIDIAYAVYVSEIMLQQTQ 61 Query: 45 VATVI-PYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGK 103 V +V+ Y+ +F+A+FP++ L+ A DEVL W GLGYY RARN+HK A+ + Sbjct: 62 VKSVLQNYYFQFLAQFPSLKALSMASEDEVLKAWQGLGYYTRARNMHKCAKICVQEFDAR 121 Query: 104 FPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENK 163 P +E+ LPG+G TAGAI + +D N+KRVL+R Y + K V+ Sbjct: 122 LPLEIKELQKLPGIGEYTAGAIACFGFLQAKSFVDANIKRVLSRFYGLQNPNSKILVQK- 180 Query: 164 LWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKK 223 +++ NQA++D+GA++C KC +CPL+ C + + + K Sbjct: 181 ----AKEFLNYNNAFEHNQALLDIGALVCLPKNAKCEICPLK--CFCTGKDEYEKFHVSK 234 Query: 224 PKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADN 283 K L++Q ++ LL + L+ +Y F ++ +++ + Sbjct: 235 -KTQYENIVLKILIVQKNEQFLLVK-SKEKLYFNMYNFLEYKNQKKAK------------ 280 Query: 284 LTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + F+H+++ + ++ +L F + ++ + L+ + L+ Sbjct: 281 --FIGEFKHSYTKYKINAKVYFLKDDDFDDLKVKA----FSYKDLEYLALSKLALKTLEL 334 Query: 344 LR 345 + Sbjct: 335 FK 336 >UniRef50_Q4Q851 A/G-specific adenine glycosylase, putative n=3 Tax=Leishmania RepID=Q4Q851_LEIMA Length = 501 Score = 266 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 100/372 (26%), Positives = 155/372 (41%), Gaps = 70/372 (18%) Query: 7 SAQVLDWYDKYGRKTLPWQID-------------------------KTPYKVWLSEVMLQ 41 +V+ W+ ++ R+ LPW+ PY+VW+SEVM Q Sbjct: 32 QEEVIAWWRQHQRQDLPWRQTLPRGSGDLSGAVDGEKHRTSATTARYDPYQVWVSEVMSQ 91 Query: 42 QTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHG 101 QT++ TVIPY+ +M RFP++ LA + DEV +W G+GYY RA L K A+ + Sbjct: 92 QTRMETVIPYYVAWMKRFPSIEALAASTEDEVKSVWAGMGYYRRAMYLRKGAKYLLERSK 151 Query: 102 GK-------FPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSG- 153 K P + EE+ +PG+G T+ AI S+ GK +DGNV RVL+R Sbjct: 152 EKEGTAAVCMPSSQEELLKVPGIGPYTSAAITSMCFGKPVCSVDGNVIRVLSRLRGARDF 211 Query: 154 WPGKKEVENKLWSLSEQVTPAV---------GVERFNQAMMDLGAMICTRS-KPKCSLCP 203 P + + +++ NQ +M+LGA +C S P C+ CP Sbjct: 212 DPKVPANVKEAAAWGQRLMGNSPTTSAVVCQDPSALNQGLMELGASVCRPSGAPLCTSCP 271 Query: 204 LQNGCIAAA-------NNSWALYPGKKPKQTLPERTGYFLL-------LQHEDEVLLAQR 249 LQ C A+A + P + K ++ + ++ +R Sbjct: 272 LQRFCRASALLRGGDIEAIEGVIPVRAAKMAKRSARKLCVVHEMSASRIGDARRFVVVRR 331 Query: 250 PPSGLWGGLYCFPQF----ADEESLRQWLAQRQIAADNLTQ---------LTAFRHTFSH 296 P +GL GG+ FP AD+E L + ++ RH FSH Sbjct: 332 PANGLLGGMLEFPTVNASAADDEEAVVPLLKHPLSMLAWKARARPTFARLCGNVRHIFSH 391 Query: 297 FHLDIVPMWLPV 308 +D+ + Sbjct: 392 IVMDVGVAHVQW 403 >UniRef50_Q1ZXP7 Putative uncharacterized protein MYH n=1 Tax=Dictyostelium discoideum RepID=Q1ZXP7_DICDI Length = 574 Score = 265 bits (678), Expect = 2e-69, Method: Composition-based stats. Identities = 84/230 (36%), Positives = 128/230 (55%), Gaps = 9/230 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-------PYKVWLSEVMLQQTQVATVIPYFER 54 + + +L WY+K R LPW+ Y+VW+SE+MLQQT+VATVI YF + Sbjct: 95 EIQEIRESMLGWYEKNKR-DLPWRKHDNSLDENVIAYRVWVSEIMLQQTRVATVIEYFNK 153 Query: 55 FMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAAL 114 ++ ++PT+ DLA+ ++EV +W+GLGYY RA+NL ++ V K P + + + Sbjct: 154 WIEKWPTINDLASTTIEEVNKVWSGLGYYRRAKNLWLGSKYVVDNFNSKIPSDVKSLLEI 213 Query: 115 PGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA 174 G+G TAGAI S++ K P++DGNV RVL+R ++ P W L + + Sbjct: 214 NGIGPYTAGAISSIAFNKPVPLVDGNVIRVLSRVRSIGANPKLSSTVKLFWELGNDLVES 273 Query: 175 -VGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKK 223 FNQ++M+LGA IC+ P+C CP+Q+ C A + P K Sbjct: 274 VENPCNFNQSLMELGATICSVQSPQCKQCPIQSNCQAYQQEKQFIKPEPK 323 Score = 44.9 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 18/71 (25%), Positives = 25/71 (35%), Gaps = 3/71 (4%) Query: 199 CSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ--HEDEVLLAQRPPSGLWG 256 C +C S YP K K + L+ + LL QRP +GL Sbjct: 360 CKICQ-SFEDSDGPTESVCRYPKKVLKTKARDENVNVFLIHQTKSNLFLLTQRPDTGLLA 418 Query: 257 GLYCFPQFADE 267 L+ P +E Sbjct: 419 SLFEAPSIIEE 429 >UniRef50_Q7M9R1 A/G-SPECIFIC ADENINE GLYCOSYLASE EC 3.2.2 n=1 Tax=Wolinella succinogenes RepID=Q7M9R1_WOLSU Length = 320 Score = 265 bits (677), Expect = 2e-69, Method: Composition-based stats. Identities = 99/336 (29%), Positives = 153/336 (45%), Gaps = 34/336 (10%) Query: 7 SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVI-PYFERFMARFPTVTDL 65 +L WY GR LPW+ PY++ +SE+MLQQTQV TV+ ++ F+ RFPT+ + Sbjct: 6 QTSLLLWYQANGRHHLPWRHPLHPYEILISEMMLQQTQVNTVLERFYYPFLERFPTLESI 65 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A E+L W GLGYY+RARNLH A+ P + E+ LPG+G TA AI Sbjct: 66 ARAEESEILLAWRGLGYYSRARNLHALAKTCQQ----GLPRSVSELEGLPGIGAYTARAI 121 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 + ILDGN+KR+L+R +A+ G E +LW +E+ + NQA++ Sbjct: 122 ACFGFRESVAILDGNIKRILSRFFALLG-----VGERELWRRAEEFLNPLAAFDHNQALL 176 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D+GA++C P C CPL C + P K + E ++ + E V Sbjct: 177 DVGALLCKPKNPLCQECPLSPWCKG-KEDPLRYTPSKTRRY--EELELFYGICIQEGRVA 233 Query: 246 LAQRPPSGLWGGLYCF-PQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPM 304 + Q L+ GL+ F P A + + L +H ++ + + Sbjct: 234 MVQ-SQERLYKGLWGFVPLLG--------------APLDSSSLGMIKHGYTKYKITAHLY 278 Query: 305 WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 + + W L + + L+ +L Sbjct: 279 EIR-----SLPGDSKPSWIPLGELEHLPLSILAHKL 309 >UniRef50_A6Q673 A/G-specific adenine glycosylase n=2 Tax=Epsilonproteobacteria RepID=A6Q673_SULNB Length = 326 Score = 263 bits (673), Expect = 6e-69, Method: Composition-based stats. Identities = 107/343 (31%), Positives = 170/343 (49%), Gaps = 34/343 (9%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVI-PYFERFMARF 59 + + ++ WY+ YGR LPW+ PY ++LSEVMLQQTQV TV+ Y+ F+ F Sbjct: 9 VSLKEIHQKIRTWYEAYGRHDLPWRSTNNPYHIYLSEVMLQQTQVKTVLERYYFPFLQAF 68 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 ++ L NAPLD+VL W GLGYY RA+NLH+ A V P +E+ LPG+G+ Sbjct: 69 SSLEALGNAPLDDVLKQWEGLGYYNRAKNLHRTAGLVNE-----LPPEIDELVKLPGIGK 123 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 +TA AI + + + P+++ NVKR+L R + + + E KLW + Sbjct: 124 NTAHAIAAFAFHQPVPVMEANVKRILCRMHRL-----RTPNEKKLWKFAYASVDKEDPFN 178 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 +NQAMMD+GA +C P+C+ CPL+N C N+ YP KK K+T+P R ++ Sbjct: 179 YNQAMMDIGATLCLPKNPQCNRCPLENICK--GKNNPEYYPLKK-KKTVPAREENIVIYL 235 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 ++D + L QR GL+ F +I + H ++HF L Sbjct: 236 YDDRLSLQQR-SGKFLHGLWGF-------------ESVEIPPCAAEYIGEVTHAYTHFKL 281 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 + + + E ++ + + ++ E+++ Sbjct: 282 K---CRVYLYFESSPEQE---YYFTPEKIGKLAISKVDEKIVN 318 >UniRef50_Q2IJ90 A/G-specific DNA-adenine glycosylase n=5 Tax=Cystobacterineae RepID=Q2IJ90_ANADE Length = 403 Score = 262 bits (671), Expect = 1e-68, Method: Composition-based stats. Identities = 113/341 (33%), Positives = 170/341 (49%), Gaps = 16/341 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDK---TPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + ++L W+D G + LPW+ + PY+VWL+EVMLQQTQVAT PY+ RF+ R+P Sbjct: 25 AALRRRLLAWWDA-GHRPLPWRQPQRGADPYRVWLAEVMLQQTQVATATPYWLRFVERWP 83 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA A ++VL W+GLGYYAR RNL AA++ HGG P + E + ALPG G Sbjct: 84 TLEALAAARDEDVLAAWSGLGYYARCRNLLAAAREALRRHGG-LPSSHEALRALPGFGPY 142 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGA+ S++ P +DGNV RVL+R + V G P + ++ +L+ + Sbjct: 143 TAGAVASIAFAAPVPAVDGNVTRVLSRLFLVEGDPAARGARARVAALAAALVDRERPGDL 202 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+M+LGA +C P C+ CP+ C A A A P + + + ++ Sbjct: 203 NQALMELGATVCRPR-PDCARCPVAARCAARAAGRAAELPPPRRRPEKVRQVLACAVVVR 261 Query: 241 EDEVLLAQRPPSGLWGGLYCFP--QFADEESLRQWLA-----QRQIAADNLTQLTAFRHT 293 E V L +RP +GL+ GL FP + A ++ L + + +L Sbjct: 262 EGRVALVRRPAAGLFAGLAAFPAAEVAPGDAAGAALERYARTRHGLRLRAGEELARVERV 321 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLA 334 +H L++ + ++ E W GL Sbjct: 322 LTHRLLELRALRCSLARPPP---EEGIRWVPADALDGAGLP 359 >UniRef50_C0ACJ1 HhH-GPD family protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0ACJ1_9BACT Length = 355 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 96/279 (34%), Positives = 135/279 (48%), Gaps = 10/279 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + F +L WY R+TLPW+ + YK +SE MLQQTQV TV+PYF R++ FP Sbjct: 16 ASFQHALLAWYHA-SRRTLPWREAPSLYKTVVSEFMLQQTQVKTVLPYFARWLDAFPDFA 74 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA AP + VL W GLGYY RARNLHK A+ + L + P ++ ALPG+G T+ Sbjct: 75 ALAAAPEERVLKHWEGLGYYTRARNLHKLARALVALP--EIPRRPDDWLALPGIGPYTSA 132 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWP--GKKEVENKLWSLSEQVTPAVGVERFN 181 AI S++ G+ ++DGNV R+L R A G P L+ + N Sbjct: 133 AITSIAFGEPVAVVDGNVVRILTRLTA-DGTPFRDSATASKHFVPLANAIVNPAHPGDHN 191 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QAMM+LGA +C R P C++CP+ C A YP + + + + Sbjct: 192 QAMMELGATMCHRQNPLCTVCPVIRFCAARREGDPETYP-RLAAKVITAINVVRAWCVRD 250 Query: 242 DEVLLAQRPPSG--LWGGLYCFPQFADEESLRQWLAQRQ 278 +LL R +G ++ P + LA Sbjct: 251 GRLLLH-RAAAGSRRLANMHELPSAEHLGLTPEQLAAAG 288 >UniRef50_C4PWX5 A/g-specific adenine dna glycosylase (Ec 3.2.2.-) (Muty homolog)(Mmyh)-related n=1 Tax=Schistosoma mansoni RepID=C4PWX5_SCHMA Length = 302 Score = 260 bits (666), Expect = 3e-68, Method: Composition-based stats. Identities = 100/295 (33%), Positives = 144/295 (48%), Gaps = 53/295 (17%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQI---DKTP----------------YK------VWLS 36 Q + +L WYD+ R LPW+ + P + VW+S Sbjct: 11 QIEKLRESLLLWYDRSKR-DLPWRRMALNPDPNLRGYAGMFDHSLDFVFHILYMCSVWVS 69 Query: 37 EVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQV 96 EVMLQQTQV TVI Y++R+M ++P+V LA+A LD+V LW+GLGYY+RAR LHK AQ++ Sbjct: 70 EVMLQQTQVKTVIDYYDRWMKKWPSVDQLASASLDDVNSLWSGLGYYSRARLLHKGAQKI 129 Query: 97 ATLHGGKFPETFEEVA-ALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWP 155 G FP++ E + ++PGVGR TAGAI S++ + P+LDGNV RVL R + Sbjct: 130 VDEFNGIFPQSAEVLKHSIPGVGRYTAGAIASIAFNQCTPVLDGNVIRVLTRLRQIGSPV 189 Query: 156 GKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGC------- 208 LW+L+ ++ FNQA+M+LGA+ CT P C CPL Sbjct: 190 QLPTTMEYLWNLTTKLVDPNRPGDFNQALMELGAVCCTPKNPDCIKCPLNKANVNKNDYI 249 Query: 209 --------------IAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQR 249 + + YP K K+ ++ + +L +R Sbjct: 250 STDLEDCRLCISSSVYQKSLGVMNYPVKLNKREPRKQNTVI-----QSHILRERR 299 >UniRef50_A8I815 DNA repair glycosylase (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8I815_CHLRE Length = 793 Score = 255 bits (652), Expect = 2e-66, Method: Composition-based stats. Identities = 101/282 (35%), Positives = 140/282 (49%), Gaps = 47/282 (16%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT----------------------------PYKV 33 + QF ++L WYD R LPW+ Y V Sbjct: 65 ERVQFRRELLSWYDGNHR-VLPWRRTPHTQRGAAADGEGGEDGVGPAPAELPPQQFAYWV 123 Query: 34 WLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAA 93 W+SEVMLQQTQVATVIPYF R+++R+PTV+DLA A + V +W GLGYY RAR L + A Sbjct: 124 WVSEVMLQQTQVATVIPYFRRWVSRWPTVSDLAAADTEAVNSMWAGLGYYRRARYLLEGA 183 Query: 94 QQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSG 153 + VA GG FP + +E+ +PGVG T+ A+ S++ G +DGNV RVL+R A+ G Sbjct: 184 KFVAGQLGGTFPTSAQELLKIPGVGPYTSAAVASIAFGSPAAAVDGNVIRVLSRLRALPG 243 Query: 154 WPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN 213 P K + + +V +NQA+M+LGA +C P CS CP + C AAA Sbjct: 244 DPTKLGAAHTAM--AGEVLDGGRPGCYNQALMELGATVCRPVNPDCSACPARPVCRAAAE 301 Query: 214 ----------------NSWALYPGKKPKQTLPERTGYFLLLQ 239 +PG+K + E+ +L+ Sbjct: 302 WTAYVEGGGDEGAEDAPRVTRFPGRKAVKEKREQAVAVTVLE 343 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 13/39 (33%), Positives = 17/39 (43%) Query: 235 FLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQW 273 LL + E LL RP GL GL+ FP ++ Sbjct: 431 ALLAKGERRYLLVMRPEGGLLAGLWEFPGAEVQDYTPNA 469 >UniRef50_C9M6M4 A/G-specific adenine glycosylase n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M6M4_9BACT Length = 354 Score = 254 bits (650), Expect = 3e-66, Method: Composition-based stats. Identities = 101/343 (29%), Positives = 161/343 (46%), Gaps = 6/343 (1%) Query: 7 SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 S + +WY + GR LPW++ PY VW+SE MLQQTQV+ V+P + +++ FP LA Sbjct: 13 SRALAEWYRENGRH-LPWRLTAGPYAVWVSETMLQQTQVSRVVPLYLAWLSAFPNCRSLA 71 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 A VL LW GLGYY+RARN+ +A+ + + P +A LPG G TA A+ Sbjct: 72 EADESAVLSLWRGLGYYSRARNMLASARLIVSAGYDGPPNDQTFLAKLPGFGPYTAAAVR 131 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMD 186 +L+ + LDGN++RV +R + P E E + A+MD Sbjct: 132 ALAYDEPTAALDGNLRRVSSRLTDLDKDPALSEGNKVCQRAVESLMKFQSPRLLTNALMD 191 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLL 246 LG+ C P+C LCPL+ C+A + AL P ++ + + R G LL + ++ + Sbjct: 192 LGSGPCAPR-PRCLLCPLEPYCLARKRGTTALRPVRRAARPIRRRFGAALLFSTQTKLAV 250 Query: 247 AQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFS-HFHLDIVPMW 305 QRP GLW + P + + R++AA + +F + + Sbjct: 251 RQRPKGGLWSEFWEIPWEVGSDGEDSGVTARRLAASLGEE--SFPEPLTPSVTMRFTSWQ 308 Query: 306 LPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ-LRTG 347 + F + S+ + + RLL++ L++G Sbjct: 309 VETKLFRAASQPTGTELLPVDDALSLPMPLGILRLLREALQSG 351 >UniRef50_D1Y8G9 A/G-specific DNA-adenine glycosylase n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y8G9_9BACT Length = 365 Score = 253 bits (646), Expect = 9e-66, Method: Composition-based stats. Identities = 106/354 (29%), Positives = 170/354 (48%), Gaps = 17/354 (4%) Query: 1 MQASQF----SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFM 56 M A +F + + WY+ + R LPW++D+ PY + +SE MLQQTQV V ++ R+M Sbjct: 1 MSAEKFHLEAAEALTAWYNSHKR-DLPWRLDRDPYHILVSEAMLQQTQVERVKSFYARWM 59 Query: 57 ARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPG 116 RFPT+T LA+A D+VL W GLGYY+RARNL +AA+ V+ P E + +LPG Sbjct: 60 ERFPTLTSLASASEDDVLACWQGLGYYSRARNLRRAARLVSGAGLKTLPADEEFLRSLPG 119 Query: 117 VGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG 176 +G T GA+ S++ P +DGNV+RV +R + P + + + + + + Sbjct: 120 LGPYTVGAVCSIAFDLPVPAIDGNVRRVFSRLLDMPDDPARAKGTALIAAHAAAILKLGS 179 Query: 177 VERFNQAMMDLGAMICTR-SKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF 235 QA M+LGA +CT + +C CP+ C A A + A P + + R G Sbjct: 180 PHILTQAFMELGATVCTPGTTCQCGQCPVSRLCAAQAAGTQAQRPVSTRRNVVERRRGAA 239 Query: 236 LLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEES------LRQWLAQRQIAADNLTQLTA 289 LL+ + +RP GLW Y P E L + A+ ++ A L Sbjct: 240 LLILVPHGCAVRRRPAGGLWSRFYEIPWLRGEADESAESCLSRLTAELRLTAPCLDLELE 299 Query: 290 FRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 F+ + + + + D ++ + + A ++RL+Q+ Sbjct: 300 ETLKFTCWQVRLHLWSCRMLQPPAGCDA-----FSADELADFPMPAGLKRLVQK 348 >UniRef50_A6DT67 Adenine glycosylase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DT67_9BACT Length = 357 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 102/358 (28%), Positives = 169/358 (47%), Gaps = 21/358 (5%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQID-KTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 F ++ W LPW+ + + Y+V +SEVMLQQT VATV+P +E F +FP + Sbjct: 7 KDFQ-DLVSWSKDNYSH-LPWRSEVRDLYRVLVSEVMLQQTTVATVLPRYESFFEKFPDL 64 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LANA +++ W GLGYY RA+NL+KA + GG+FP+ EE+ +PGVG TA Sbjct: 65 ASLANADENDLALAWKGLGYYRRAQNLYKAVTMI-HQSGGEFPDGEEELQKVPGVGPYTA 123 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSE-----QVTPAVGV 177 A+ ++ + +DGN++RVL+R + + G K ++ + SL + + G Sbjct: 124 AALTAIGRNQLALAVDGNLQRVLSRYFFIEVEQGPK-LQKAVHSLIQNKTFAKTLELCGP 182 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTL---PERTGY 234 +FN+A+MDLG IC PKC CPLQN C A + L P KK KQ + + Sbjct: 183 RKFNEALMDLGRAICKPRNPKCGECPLQNSCEARLKHKVELVPVKKEKQKVAISELKLVR 242 Query: 235 FLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTF 294 FL+ + E+L ++ + P E ++ N + Sbjct: 243 FLVKNSQGEMLFYKKRQGEWLKDQWELPTAIVESEDQKIKQYPFANVANYSD-EQITTAI 301 Query: 295 SHFHLDIVPMWLPVSSFT----GCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 + + I W + G +++ + + + +++ E++L + + Sbjct: 302 TRYR--IRNYWKEMEELDFRDLGILNDRDFAFLPWGKTQEH-ISSASEKVLTLINGRS 356 >UniRef50_Q9SU12 Adenine DNA glycosylase like protein n=1 Tax=Arabidopsis thaliana RepID=Q9SU12_ARATH Length = 608 Score = 252 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 105/414 (25%), Positives = 157/414 (37%), Gaps = 94/414 (22%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-------PYKVWLSEVMLQQTQVATVIPYFER 54 + + +LDWYD R LPW+ ++ Y+VW+SE+MLQQT+V TV+ Y++R Sbjct: 127 ETQKIRMGLLDWYDVNKR-DLPWRNRRSESEKERRAYEVWVSEIMLQQTRVQTVMKYYKR 185 Query: 55 FMARFPTVTDLANAPLD-------------------EVLHLWTGLGYYARARNLHKAAQQ 95 +M ++PT+ DL A L+ EV +W GLGYY RAR L + A+ Sbjct: 186 WMQKWPTIYDLGQASLENLIVSRSRELSFLRGNEKKEVNEMWAGLGYYRRARFLLEGAKM 245 Query: 96 VATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWP 155 V G FP + + G+G+ TAGAI S++ + Sbjct: 246 VVAGTEG-FPNQASSLMKVKGIGQYTAGAIASIAFNEVSQF------------------- 285 Query: 156 GKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSK------PKCSLCPLQNGCI 209 L+ Q+ FNQ++M+LGA +CT SK P S C + Sbjct: 286 ---HFFQSSMKLAAQLVDPSRPGDFNQSLMELGATLCTVSKPSCSSCPVSSQCRAFSLSE 342 Query: 210 AAANNSWALYPGKKPKQTLPERTGYFLLL--------QHEDEVLLAQRPPSGLWGGLYCF 261 S YP K K +L Q +L +RP GL GL+ F Sbjct: 343 ENRTISVTDYPTKVIKAKPRHDFCCVCVLEIHNLERNQSGGRFVLVKRPEQGLLAGLWEF 402 Query: 262 PQFADEESLRQWLAQRQI------------------AADNLTQLTAFRHTFSHFHLDIVP 303 P E + I + +L F H F+H + Sbjct: 403 PSVILNEEADSATRRNAINVYLKEAFRFHVELKKACTIVSREELGEFVHIFTHIRRKVYV 462 Query: 304 MWLPVSSFTGCMDEGNAL------W--YNLAQPPSVGLAAPVERL----LQQLR 345 L V G D W + ++GL + V ++ LQ ++ Sbjct: 463 ELLVVQLTGGTEDLFKGQAKDTLTWKCVSSDVLSTLGLTSAVRKVPPFRLQHIK 516 >UniRef50_A7NN47 HhH-GPD family protein n=2 Tax=Bacteria RepID=A7NN47_ROSCS Length = 317 Score = 247 bits (631), Expect = 4e-64, Method: Composition-based stats. Identities = 83/239 (34%), Positives = 129/239 (53%), Gaps = 6/239 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + + +L W+ + R LPW+ + PY++ ++EVMLQQTQV V+P + F+ RFP Sbjct: 15 LSLNDLHQALLKWFSEAAR-DLPWRRTRDPYRILVAEVMLQQTQVDRVLPKYAAFLERFP 73 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA AP EV+ +W GLGY RA NL +AA+ + +GG FP + LPG+G Sbjct: 74 TLHTLAEAPTAEVIRMWAGLGYNRRAVNLQRAARAICARYGGVFPRDVATLVTLPGIGSY 133 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGA+ + + +D N++RV+ R + P + E L +L+ P + Sbjct: 134 TAGAVACFAFEQDVAFMDTNIRRVIRRVF---TDPTETVNERALLALARAALPVGRSWMW 190 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGC--IAAANNSWALYPGKKPKQTLPERTGYFLL 237 NQA+M+LG+++CT P C CPL++ C AA S + ++ L ER + Sbjct: 191 NQALMELGSLVCTADAPACWRCPLRDQCRDYAARRESDERFASAPVRKRLAERRERPFI 249 >UniRef50_C9RK91 HhH-GPD family protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RK91_FIBSS Length = 384 Score = 247 bits (631), Expect = 5e-64, Method: Composition-based stats. Identities = 107/321 (33%), Positives = 152/321 (47%), Gaps = 22/321 (6%) Query: 8 AQVLDWYDKYGRKTLPWQID-----KTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 ++ +W+ LPW+ + PY VW+SE MLQQTQV+TV YF R+M RFP V Sbjct: 20 KRLREWFRANA-AELPWRPAGLDAPRDPYAVWISETMLQQTQVSTVRDYFTRWMKRFPDV 78 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGG----KFPETFEEVAALPGVG 118 LA A EV W GLGYY+RARN+ K A+ VA L K PET +E+ ALPG+G Sbjct: 79 EALAKAEEAEVFKYWQGLGYYSRARNILKTAKIVAALRQAQEPCKMPETRKELEALPGIG 138 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENK-----LWSLSEQVTP 173 TAGAILSL+ + ILDGN+ R+ +R Y + P K NK W + +V Sbjct: 139 AYTAGAILSLAYHQREAILDGNLVRIFSRLYELDFLPTDKGSANKNCTEIYWEYAREVAD 198 Query: 174 AVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTG 233 + N+A+M+LG +C P C CPL+ C A +P K+T G Sbjct: 199 SPKAYMHNEALMELGRTVCKTKSPLCETCPLRGECRAFQECRTTEFP-PAKKRTEKSWHG 257 Query: 234 YFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQR------QIAADNLTQL 287 L+++ DE +LA P F L + ++ Sbjct: 258 TVLVVESADEKILAVNGGQKFLDKQLALPHFESARHATVALPAKAEDYINADEVKSVEYC 317 Query: 288 TAFRHTFSHFHLDIVPMWLPV 308 FRH+ + ++ + + + Sbjct: 318 GTFRHSITVHKIECDVLHVQL 338 >UniRef50_B6K934 Helix-hairpin-helix motif-containing protein n=3 Tax=Toxoplasma gondii RepID=B6K934_TOXGO Length = 833 Score = 246 bits (629), Expect = 8e-64, Method: Composition-based stats. Identities = 96/332 (28%), Positives = 137/332 (41%), Gaps = 79/332 (23%) Query: 29 TPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARN 88 +PY VW+SEVMLQQTQV TVI Y++R+M+R+PTV DL A +EV +W+GLGYY RAR Sbjct: 263 SPYGVWVSEVMLQQTQVCTVIDYWQRWMSRWPTVGDLVKATEEEVSQMWSGLGYYRRARQ 322 Query: 89 LHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARC 148 L K AQ V G+ P E++ ++PG+G T GAI +++ G +DGNV RVLAR Sbjct: 323 LLKGAQTVVQEFDGELPGDVEKLLSIPGIGPYTGGAISAIAFGNRAAAVDGNVLRVLARL 382 Query: 149 YAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGC 208 ++ + + +A+++LGA ICT P C CP++ C Sbjct: 383 LGLAAPADSRALAMFCSRWMPPFLDPRRPGASTEALIELGATICTPRAPSCLSCPVRQFC 442 Query: 209 ----------------------------------IAAANNSWALYPGKKPKQTLPERTGY 234 A A YP K ++ PE + Sbjct: 443 LVNREAKSGTSACEARREIHSADCKLCIPFAEAQAAVRERQQAAYPVAKATKSRPEESYV 502 Query: 235 FLLL-----QHED----------------------------------------EVLLAQR 249 L + + ED E+LL +R Sbjct: 503 VLCVTRVPGEREDADAHSTGRRGHSSNSTKSQIACGEDRHTSGQVREREFEQWEILLRRR 562 Query: 250 PPSGLWGGLYCFPQFADEESLRQWLAQRQIAA 281 P +GL G P + + + AA Sbjct: 563 PSAGLLAGQLEVPSCLRSRTQGAKTPEEKKAA 594 >UniRef50_B2AMG7 Predicted CDS Pa_5_7310 n=1 Tax=Podospora anserina RepID=B2AMG7_PODAN Length = 582 Score = 246 bits (628), Expect = 1e-63, Method: Composition-based stats. Identities = 92/236 (38%), Positives = 139/236 (58%), Gaps = 24/236 (10%) Query: 1 MQASQFSAQVLDWYDKYGR-KTLPWQI-----------DKTPYKVWLSEVMLQQTQVATV 48 + + A +L W+ K + +PW+ + Y+VW+SE+MLQQT+VATV Sbjct: 116 LSSPSAQASLLRWFQKEQTTRLMPWRKPFLTNPSRADLSRRAYEVWISEIMLQQTRVATV 175 Query: 49 IPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETF 108 I Y+ ++M ++PT+ DLA A +EV+++WTGLGYY+RAR +H AQ+V T G P+T Sbjct: 176 IAYWNKWMTKWPTIEDLAQATEEEVVNMWTGLGYYSRARRIHAGAQKVVTEMQGLLPDTV 235 Query: 109 EEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWP-GKKEVENKLWS 166 E + +PGVGR TAGAI ++ G+ P++DGNV RVL+R + G G K V + LW Sbjct: 236 EGLMKHVPGVGRYTAGAISAIVFGEAEPMVDGNVMRVLSRQMGLMGDVKGDKRVVDVLWE 295 Query: 167 LSEQVTP---------AVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN 213 ++++ + QA+M+LG+ ICT KP+C CP+ C+A A Sbjct: 296 AADRLVKVVAEADGEEGEKPGLWGQALMELGSTICTP-KPQCGKCPVTESCMAYAE 350 Score = 77.6 bits (190), Expect = 7e-13, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 49/141 (34%), Gaps = 18/141 (12%) Query: 217 ALYPGKKPKQTLPERTGYFLLLQH--EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWL 274 +P KKPK+ + E ++ + + L+++RP GL GL+ FP + S Sbjct: 432 QRFPLKKPKKQVREEEALVCAIRRVSDGQYLISRRPGKGLLAGLWEFPSYILPASNDSTT 491 Query: 275 AQR------------QIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMD----EG 318 R + +L FSH L + + + E Sbjct: 492 KSRKKQALDYASGLVESTDGYRGELGTVPWLFSHLRLAMHVQLFELDDSDRSLRTLPLES 551 Query: 319 NALWYNLAQPPSVGLAAPVER 339 N W + + S + +++ Sbjct: 552 NYRWVSSDEIESESMGTGMKK 572 >UniRef50_A9AYZ2 HhH-GPD family protein n=5 Tax=Chloroflexi (class) RepID=A9AYZ2_HERA2 Length = 323 Score = 246 bits (628), Expect = 1e-63, Method: Composition-based stats. Identities = 87/244 (35%), Positives = 126/244 (51%), Gaps = 12/244 (4%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + S +L W+ GR LPW+ + PY + +SE MLQQTQV VIP +E F+A FPT Sbjct: 3 ELSTLQIDLLAWFQANGR-DLPWRRTRNPYYILVSETMLQQTQVDRVIPKYEAFLALFPT 61 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATL----HGGKFPETFEEVAALPGV 117 V LA+A +V+ W GLGY RA NL +AAQ + FP T E + LPG+ Sbjct: 62 VEALASASTADVIRSWQGLGYNRRAVNLQRAAQAIVAAGYPADPAGFPATPEGLRNLPGI 121 Query: 118 GRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGV 177 G T+GA+ + + LD N++RV+ R + E L ++Q+ P Sbjct: 122 GAYTSGAVACFAFERDVAFLDTNIRRVVRRLLVGPEDAPPETNEQTLIDYAQQLIPQGQG 181 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAA-------NNSWALYPGKKPKQTLPE 230 +NQA+M+LGA+IC+ +KP+C CP+ C A A P KP++ E Sbjct: 182 WAWNQAIMELGALICSAAKPQCWRCPVNQHCRAYAIWREANTQLDMWQPPVIKPRKKAAE 241 Query: 231 RTGY 234 + + Sbjct: 242 QPFH 245 >UniRef50_Q4P7F9 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4P7F9_USTMA Length = 539 Score = 245 bits (627), Expect = 1e-63, Method: Composition-based stats. Identities = 94/275 (34%), Positives = 141/275 (51%), Gaps = 35/275 (12%) Query: 9 QVLDWYDK-YGRKTLPWQID------------------KTPYKVWLSEVMLQQTQVATVI 49 +L W++ ++ +PW+ D K Y+VW+SE+MLQQT+V TV Sbjct: 90 ALLTWFETVSSKRDMPWRADFIDASHYSDAAKLREARKKRAYQVWISEIMLQQTRVETVK 149 Query: 50 PYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHG--GKFPET 107 Y+ +M ++PT+ LA A + VL W GLGYY+RA +H AA++V G PET Sbjct: 150 TYWLNWMNKWPTIEALAAADPEAVLAAWRGLGYYSRATRIHTAAKRVVADPHMMGLLPET 209 Query: 108 FEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWS 166 +E+ +PGVG TAGAI S+ G+ PI+DGNV RVL+R A+ P K + +W+ Sbjct: 210 PQELEKNVPGVGPYTAGAISSIVFGQAVPIIDGNVVRVLSRQLALYANPKTKLTSHLMWA 269 Query: 167 LSEQVTPAVG---------VERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWA 217 + ++ ++NQA+M+LG+ +CT +KP+C CP+Q C A Sbjct: 270 TATKLVHKASALRGGKATVPGQWNQALMELGSTVCTPTKPRCDECPIQATCSAHQEAQAD 329 Query: 218 LYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPS 252 P + +L ED L +R PS Sbjct: 330 TIKANNK----PNQAAGAEVLDIEDICTLCERFPS 360 Score = 53.8 bits (128), Expect = 9e-06, Method: Composition-based stats. Identities = 16/54 (29%), Positives = 23/54 (42%), Gaps = 1/54 (1%) Query: 213 NNSWALYPGKKPKQTLPERTGYFLLLQHE-DEVLLAQRPPSGLWGGLYCFPQFA 265 +P K K+T+ + ++Q E LL QRP SGL + FP Sbjct: 418 ERHICKFPMKLEKKTIRQEECLVCIIQRSSGEFLLEQRPASGLLASTWQFPSLT 471 >UniRef50_B9L504 Catalytic Domain Of MutyY n=4 Tax=Bacteria RepID=B9L504_THERP Length = 358 Score = 243 bits (621), Expect = 6e-63, Method: Composition-based stats. Identities = 90/209 (43%), Positives = 127/209 (60%), Gaps = 4/209 (1%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q +++DWY + R LPW+ + PY++ +SEVMLQQTQV VIPY+E F+ARFPT Sbjct: 63 QIRAVQRRLVDWYRREAR-DLPWRRTRDPYRILVSEVMLQQTQVERVIPYYEVFLARFPT 121 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA+A L EV+ +W GLGY RA L +AA+++ GG+FP + LPGVGR T Sbjct: 122 VEALASAALAEVIAVWGGLGYNRRAVYLWRAAREIVERWGGRFPGERRLLERLPGVGRYT 181 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGA+ + G+ D N+ RVL R + P + +L L+E+V P +N Sbjct: 182 AGAVACFAFGERVAFWDTNIARVLRRVFLG---PEARPGRRELDELAERVLPLDRAYEWN 238 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIA 210 QA+M+LGA IC+ +P+C +CPL C + Sbjct: 239 QALMELGARICSARRPRCEICPLCGLCRS 267 >UniRef50_D0GU15 A/G-specific adenine glycosylase n=1 Tax=Vibrio mimicus MB-451 RepID=D0GU15_VIBMI Length = 231 Score = 243 bits (620), Expect = 8e-63, Method: Composition-based stats. Identities = 120/224 (53%), Positives = 157/224 (70%), Gaps = 5/224 (2%) Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 LS K ILDGNVKR LARC+AV GWPG+K VEN+LW +E TP+V V+++NQAMM Sbjct: 2 LSSVYKKPHAILDGNVKRTLARCFAVEGWPGQKSVENQLWHYAEMHTPSVDVDKYNQAMM 61 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D+GAMICTRSKPKCSLCP+++ C+A + YPGKKPK P + +F++L H + V Sbjct: 62 DMGAMICTRSKPKCSLCPVESFCLAKQQGNPQEYPGKKPKTDKPVKATWFVMLYHNNAVW 121 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQI---AADNLTQLTAFRHTFSHFHLDIV 302 L QRP +G+WGGLYCFPQ ++ +++ + QR I A + L AFRHTFSH+HLDI Sbjct: 122 LEQRPQTGIWGGLYCFPQ-SEIANIQTTIDQRAIGDNAIKSQKTLIAFRHTFSHYHLDIT 180 Query: 303 PMWLPVSSFTGCMDEGN-ALWYNLAQPPSVGLAAPVERLLQQLR 345 P+ L +S + EG+ LWYNL+QP +GLAAPV++LL L Sbjct: 181 PILLELSRKPDIIMEGSKGLWYNLSQPDEIGLAAPVKQLLHSLP 224 >UniRef50_C8NK18 A/G-specific adenine glycosylase n=8 Tax=Corynebacterium RepID=C8NK18_COREF Length = 308 Score = 239 bits (611), Expect = 1e-61, Method: Composition-based stats. Identities = 85/253 (33%), Positives = 123/253 (48%), Gaps = 4/253 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTP-YKVWLSEVMLQQTQVATVIPYFERFMARF 59 M + + VL W+ R L W+ TP + + LSEVM QQT VA V P + +MA + Sbjct: 16 MSYNALQSSVLRWFRLNAR-DLAWRDPDTPAWGILLSEVMSQQTPVARVEPIWLEWMATW 74 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 PT A A DEVL W LGY RA L + A+ + HGG+ P+T +E+ ALPG+G Sbjct: 75 PTPQAFAEASTDEVLRAWGKLGYPRRALRLLECARVIVDKHGGRVPDTVDELLALPGIGD 134 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA A+ + + G++ P++D NV+RV R AV G + K + PA Sbjct: 135 YTARAVAAFAFGQNVPVVDTNVRRVYRR--AVEGRFLQGTASKKELVDVAAILPADSGPE 192 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 F+ +M+LGA+ICT + PKC+ CPL C + A + + Q Sbjct: 193 FSAGIMELGALICTATSPKCASCPLLELCEWQRLDCPAPSEEELASAKKRVQKFTGTDRQ 252 Query: 240 HEDEVLLAQRPPS 252 ++ R Sbjct: 253 VRGLIMDVLRGAE 265 >UniRef50_B3DTE4 A/G-specific DNA glycosylase n=6 Tax=Bifidobacterium RepID=B3DTE4_BIFLD Length = 328 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 94/323 (29%), Positives = 147/323 (45%), Gaps = 59/323 (18%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDK-TPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 M + S ++ W++ R LPW+ + TP+ V +SEVM QQTQ++ V+PY+ +M R+ Sbjct: 1 MNDTAISLRLGAWWEANAR-DLPWRFGRATPWGVLVSEVMSQQTQMSRVVPYWTDWMERW 59 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 P LA AP EV+ W LGY RA L + A+ VA + K P T++E+ ALPG+G Sbjct: 60 PDARALAEAPKAEVITAWGRLGYPRRALRLQECARVVAEDYADKLPRTYDELTALPGIGD 119 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCY-AVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 TA A++S + G+ ++D N++RVL+R + V G + +L+ ++ P + Sbjct: 120 YTASAVMSFAFGERIAVIDTNIRRVLSRVFLGVESRGGATSPAER--ALANRMLPKDEIL 177 Query: 179 -------------------------------------RFNQAMMDLGAMICTRSKPKCSL 201 +NQ++M+LGA+ICT P C Sbjct: 178 GCDADVADNAGSAEHVVNSTIRGGKRSRLHRGERPSVTWNQSVMELGAVICTAKSPLCDT 237 Query: 202 CPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCF 261 CP+ + C PG ++T P + Q VL A R Sbjct: 238 CPIADDCAFLKAGR----PGLGERRTRPRQRFQGTDRQVRGLVLAALR----------EL 283 Query: 262 P---QFADEESLRQWLAQRQIAA 281 P E++ + W Q Q+AA Sbjct: 284 PAGATLPREDADKLWKDQIQLAA 306 >UniRef50_UPI0000510111 A/G-specific adenine glycosylase n=1 Tax=Brevibacterium linens BL2 RepID=UPI0000510111 Length = 345 Score = 237 bits (606), Expect = 3e-61, Method: Composition-based stats. Identities = 76/226 (33%), Positives = 116/226 (51%), Gaps = 3/226 (1%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQI-DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + ++ W++ R LPW+ D T + V +SE+M QQT V+ V P + +M ++P Sbjct: 25 DIHRVRETIITWFESAAR-PLPWRDADTTAWAVLVSEIMSQQTPVSRVEPRWREWMQKWP 83 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T DLA AP EVLH W LGY RA L +AA+ + G P+T +E+ LPG+G Sbjct: 84 TPADLAQAPTAEVLHRWDRLGYPRRALRLQEAARVITEELDGHVPQTAKELERLPGIGSY 143 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TA A+ S + G+ +LD NV+RVL R +A P + + Q P +++ Sbjct: 144 TAAAVTSFAHGERTTVLDTNVRRVLIRLFAGRDRPSPSPGRAE-TEWAGQFVPETEHKQW 202 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQ 226 N +M+ GA++CT P+C CPL + C K ++ Sbjct: 203 NAGVMEFGALVCTARNPQCETCPLNDICAWQKAGRPTSAVKPKTQK 248 >UniRef50_B8P2D0 Predicted protein n=1 Tax=Postia placenta Mad-698-R RepID=B8P2D0_POSPM Length = 562 Score = 237 bits (604), Expect = 5e-61, Method: Composition-based stats. Identities = 96/401 (23%), Positives = 152/401 (37%), Gaps = 108/401 (26%) Query: 2 QASQFSAQVLDWYDK-YGRKTLPWQIDKTP-----------YKV--WLSEVMLQQTQVAT 47 + S +L+WY+ + + +PW+ P Y+V W+SE+MLQQTQVAT Sbjct: 77 EPSPLREALLEWYEGVHAARGMPWRKPYDPSMNNDQRAQRAYEVLVWVSEIMLQQTQVAT 136 Query: 48 VIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPET 107 VIPY+ R+M ++PT+ DLA + ++ V +W GLGYY+RA L AQ+ GG+ P+ Sbjct: 137 VIPYYNRWMKKYPTIRDLAASDIETVNSIWRGLGYYSRAARLLAGAQKAVRDFGGRLPDN 196 Query: 108 FEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWS 166 +++ A +PG+GR +AGAI S++ R AV P K+ + LW Sbjct: 197 AKDMEANIPGIGRYSAGAICSIAYND--------------RILAVHAPPKSKQTLDILWQ 242 Query: 167 LSEQVTP-AVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN------------ 213 + + NQA+++LG+ P C L +G Sbjct: 243 GATAMVEGGTRPGDLNQALIELGST------PWCQAVKLADGKEPEGAASKSGAEVLDIE 296 Query: 214 ------------NSWALYPGKKPKQTLPERTGYFLLLQHEDE-------VLLAQRPPSGL 254 +P + K+ E +++ + LL +RP GL Sbjct: 297 EICTLCEPLPVGRPVTSFPMRAEKKKAREELDIVNVIEWRSQADGGGRWFLLVRRPEGGL 356 Query: 255 WGGLYCFP-----------------------------------------QFADEESLRQW 273 GGL+ FP Q E S + Sbjct: 357 LGGLHEFPTSPAVPVTISAAAQKKVPSALLRDLLTSPPAGEHGARTSGDQARGEISGAEL 416 Query: 274 LAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGC 314 + ++ H FSH W+ + G Sbjct: 417 EPSTALRIVHVEPAGDVIHIFSHIRKTYRVQWVVLEGGGGT 457 >UniRef50_A1A5M6 Mutyh protein n=4 Tax=Eukaryota RepID=A1A5M6_RAT Length = 184 Score = 235 bits (601), Expect = 1e-60, Method: Composition-based stats. Identities = 85/175 (48%), Positives = 113/175 (64%), Gaps = 1/175 (0%) Query: 39 MLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVAT 98 MLQQTQVATVI Y+ R+M ++PT+ DLA+A L+EV LW+GLGYY+R R L + A++V Sbjct: 1 MLQQTQVATVIDYYTRWMQKWPTLQDLASASLEEVNQLWSGLGYYSRGRRLQEGARKVVE 60 Query: 99 LHGGKFPETFEEVAAL-PGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGK 157 GG P T E + L PGVGR TAGAI S++ + ++DGNV RVL R A+ P Sbjct: 61 ELGGHVPRTAETLQQLLPGVGRYTAGAIASIAFDQVTGVVDGNVIRVLCRVRAIGADPTS 120 Query: 158 KEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAA 212 V + LW L++Q+ FNQA M+LGA +CT +P CS CP+Q+ C A Sbjct: 121 SFVSHHLWDLAQQLVDPARPGDFNQAAMELGATVCTPQRPLCSHCPVQSLCRAHQ 175 >UniRef50_A9UV22 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UV22_MONBE Length = 221 Score = 235 bits (599), Expect = 2e-60, Method: Composition-based stats. Identities = 83/222 (37%), Positives = 118/222 (53%), Gaps = 18/222 (8%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQI----------------DKTPYKVWLSEVMLQQTQVAT 47 S ++ WY ++ R LPW+ + Y VW+SEVM QQTQ++ Sbjct: 1 SAIQHALIQWYHQHQR-VLPWRQPSVRRSAESMTHGQDSNTMAYAVWISEVMCQQTQISV 59 Query: 48 VIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPET 107 V YFER++A++PTV LA A L +V W GLGYY+RA LH+AAQ + G FP Sbjct: 60 VTDYFERWIAKWPTVQALAAAQLSDVHQAWAGLGYYSRATRLHEAAQYIVNQLDGSFPTV 119 Query: 108 FEEVAALP-GVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWS 166 + P GVG TA A+ S+ K ++DGNV RVL+R ++ + + +W Sbjct: 120 ARLIFCWPPGVGPYTASAVASIVFAKRVGVVDGNVNRVLSRLGGIAVPLTQDAAKKWMWR 179 Query: 167 LSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGC 208 ++ + + NQAMM+LGA+ICT P+C CPL C Sbjct: 180 QADTLADHAAPGQINQAMMELGALICTPKSPQCHACPLAEHC 221 >UniRef50_A4AHC9 Adenine glycosylase n=1 Tax=marine actinobacterium PHSC20C1 RepID=A4AHC9_9ACTN Length = 297 Score = 233 bits (595), Expect = 7e-60, Method: Composition-based stats. Identities = 86/254 (33%), Positives = 129/254 (50%), Gaps = 9/254 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDK-TPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 ++FS +W++++GR LPW+ D + + +SE+MLQQT VA VIP E+++ R+P Sbjct: 9 NLAEFSLVTREWFNEHGR-DLPWRHDGFGAWGILVSEIMLQQTPVARVIPRLEQWLDRWP 67 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T LA E + W LGY RA NLH AA + H G PE + + ALPG+G Sbjct: 68 TPAALAADSPGEAVRAWERLGYPRRALNLHAAATAITKNHDGTVPEDVQSLLALPGIGDY 127 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV--GVE 178 TA A+ + + G P++D NV+RV+AR A G G L + E + P + Sbjct: 128 TARAVAAFAYGHRHPVVDTNVRRVIARAVAGQGEAGPPSTRRDL-AAMELLLPEDPVAAQ 186 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 N A+M+LGA++CT +P C CP++ C A YP + K+ ++ Sbjct: 187 LTNAAVMELGAIVCTAKRPLCDECPVRELCQWRARG----YPPFEGKRQAVQKRFEGSDR 242 Query: 239 QHEDEVLLAQRPPS 252 Q +L R Sbjct: 243 QVRGLILAELRASD 256 >UniRef50_D1BVG2 HhH-GPD family protein n=39 Tax=Actinomycetales RepID=D1BVG2_XYLCX Length = 581 Score = 233 bits (594), Expect = 9e-60, Method: Composition-based stats. Identities = 82/253 (32%), Positives = 122/253 (48%), Gaps = 9/253 (3%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQI-DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +V+ W+D R+ LPW+ D+TP+ V +SEVMLQQT V V P + +M R+PT Sbjct: 293 SRALVERVVRWFDG-ARRDLPWRAADRTPWGVLVSEVMLQQTPVVRVEPAWRAWMERWPT 351 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 +DLA A +VL W LGY RA L + A+ + H G P+ + ALPGVG T Sbjct: 352 PSDLAAASTADVLRAWDRLGYPRRALRLQECARVLVERHDGAVPDDEAALRALPGVGEYT 411 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A A+ + + G+ ++D NV+RVLAR P + + V P Sbjct: 412 AAAVRAFAFGRRAVVVDTNVRRVLARAVGGVALPAP-SPTAAERATATAVVPHDDDAAAA 470 Query: 182 QAM--MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 A M+LGA++CT P+C+ CP+++ C A PG + + Q Sbjct: 471 WAAASMELGALVCTARSPRCAECPVRDLCAWRAAGC----PGDAHAHRRKTQAWHGTDRQ 526 Query: 240 HEDEVLLAQRPPS 252 ++ A R Sbjct: 527 CRGRIMAALRAAD 539 >UniRef50_B7GUG9 HhH-GPD family protein n=2 Tax=Bifidobacterium RepID=B7GUG9_BIFLI Length = 318 Score = 232 bits (593), Expect = 1e-59, Method: Composition-based stats. Identities = 94/312 (30%), Positives = 149/312 (47%), Gaps = 47/312 (15%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDK-TPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 M + S ++ W++ R LPW+ + TP+ V +SEVM QQTQ++ V+PY+ +MAR+ Sbjct: 1 MNDTAISLRLGAWWEANAR-DLPWRFGRATPWGVLVSEVMSQQTQMSRVVPYWNDWMARW 59 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 P LA AP +V+ W LGY RA L + A+ VA +G + P T++E+ ALPG+G Sbjct: 60 PDARALAAAPKADVITAWGRLGYPRRALRLRECARVVAEEYGDELPRTYDELVALPGIGD 119 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGV-- 177 TA A+LS + G+ ++D N++RVL+R + + G +L+ ++ P V Sbjct: 120 YTASAVLSFAFGERIAVIDTNIRRVLSRVFLGTESRG-GAASPAERALANRMLPQDRVCG 178 Query: 178 -------------------------ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAA 212 +NQ++M+LGA++CT P C +CP+ + C Sbjct: 179 DGADCTDHAYRSGEHTFLQRSEPPSVTWNQSVMELGAVVCTAKTPLCEICPIADDCAFLK 238 Query: 213 NNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFP---QFADEES 269 PG ++T P + Q VL A R P E++ Sbjct: 239 AGR----PGLGERRTRPRQRFQGTDRQVRGLVLAALR----------ELPAGVTLPREDA 284 Query: 270 LRQWLAQRQIAA 281 + W Q Q+AA Sbjct: 285 DKLWKDQIQLAA 296 >UniRef50_C6A996 A/G-specific adenine glycosylase n=9 Tax=Bifidobacteriaceae RepID=C6A996_BIFLB Length = 329 Score = 232 bits (592), Expect = 1e-59, Method: Composition-based stats. Identities = 87/252 (34%), Positives = 131/252 (51%), Gaps = 13/252 (5%) Query: 8 AQVLDWYDKYGRKTLPWQIDKT-PYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 ++ W+ R LPW+ +T P+ V + EVM QQTQ++ V+PY+E +M+++P LA Sbjct: 43 RELATWWQTSAR-DLPWRYGRTTPWGVLVCEVMSQQTQMSRVVPYWEAWMSQWPDAASLA 101 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 A EV+ W LGY RA L + A+ VA + + P ++E+ ALP VG TA A+L Sbjct: 102 AAEKSEVIRAWGRLGYPRRALRLQECAEVVARDYDDRLPREYDELMALPSVGDYTASAVL 161 Query: 127 SLSLGKHFPILDGNVKRVLAR-CYAVSGWPGKKEVENKLWSLSEQVTPA--VGVERFNQA 183 S + G+ ++D N++R L+R V G + +L+ V P +NQA Sbjct: 162 SFAYGERVAVIDTNIRRALSRAFLGVESLGGSCTPLER--ALAWVVLPKAAEQSVLWNQA 219 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +M+LGA++CT P+C CPLQ C A PG K+T P + + Q Sbjct: 220 VMELGALVCTAKAPQCEQCPLQPQC----EFVKAGMPGLGEKRTRPRQRFHGTDRQVRGS 275 Query: 244 VLLAQR--PPSG 253 +L A R P G Sbjct: 276 ILQALRALPAGG 287 >UniRef50_B2W3B1 A/G-specific adenine glycosylase n=2 Tax=Pleosporineae RepID=B2W3B1_PYRTR Length = 441 Score = 231 bits (590), Expect = 3e-59, Method: Composition-based stats. Identities = 83/201 (41%), Positives = 124/201 (61%), Gaps = 13/201 (6%) Query: 27 DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARA 86 K Y+VW+SEVMLQQT+V+TVIPYF ++A++PTV DLA A D+VL +W GLGYY+RA Sbjct: 25 SKRAYEVWVSEVMLQQTRVSTVIPYFNNWIAKWPTVQDLAAANHDDVLSVWKGLGYYSRA 84 Query: 87 RNLHKAAQQVA--TLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRV 144 LH+ A+++ + G P E+ PG+GR TAGA+ S++ G+ P+LDGNV RV Sbjct: 85 TRLHEGAKKMMSASEDAGTIPSGAAELQEFPGIGRYTAGAVSSIAFGEAEPVLDGNVARV 144 Query: 145 LARCYAVSGWPGKKEVENKLWSLSEQVTP----------AVGVERFNQAMMDLGAMICTR 194 L+R + K+ + LW +++Q+ + +NQA+M+LG+ +CT Sbjct: 145 LSRQLGLYVDVKDKKATDILWDMADQLIKYASDFPKTKTSATPGLWNQALMELGSTVCTP 204 Query: 195 SKPKCSLCPLQNGCIAAANNS 215 P+C+ CP+Q C A + Sbjct: 205 R-PRCNDCPIQATCRAYSEGK 224 Score = 41.0 bits (95), Expect = 0.064, Method: Composition-based stats. Identities = 9/66 (13%), Positives = 18/66 (27%), Gaps = 7/66 (10%) Query: 281 ADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCM-------DEGNALWYNLAQPPSVGL 333 A + H F+H L + V + + W + A L Sbjct: 370 ARYVAHFPPLVHVFTHLKLTMHAYQFRVEADAAEAVDLECGDEPCTRKWVDAAAMGDETL 429 Query: 334 AAPVER 339 + + + Sbjct: 430 STGMRK 435 >UniRef50_A1RYX1 HhH-GPD family protein n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1RYX1_THEPD Length = 253 Score = 230 bits (587), Expect = 6e-59, Method: Composition-based stats. Identities = 69/213 (32%), Positives = 111/213 (52%), Gaps = 9/213 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + + ++ +WY + GR PW+ + PY + +E +LQ+T+ TV FE F +R+P+ Sbjct: 40 EEVARRLAEWYRRRGR-DFPWRHTRDPYVILATEFLLQRTRAETVAKVFEEFFSRYPSPE 98 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LANA +E+ ++ LG RA L +AA+++ HGG P + EE+ L GVG A Sbjct: 99 SLANADPEELRKFFSRLGLVRRADALREAAREIVERHGGSVPRSKEELLKLKGVGPYIAS 158 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+L + P++D NV+RVL R S + + E++ + A Sbjct: 159 AVLCFAYSAPVPVVDTNVERVLGRAAGASS-------REEAEAFLERLLRHGNPREISLA 211 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSW 216 ++DLGA++CT KPKC CPL + C + Sbjct: 212 LIDLGALVCT-RKPKCPECPLSDLCSYRGSGRP 243 >UniRef50_C7YS47 Putative uncharacterized protein n=2 Tax=Nectriaceae RepID=C7YS47_NECH7 Length = 618 Score = 229 bits (585), Expect = 1e-58, Method: Composition-based stats. Identities = 84/249 (33%), Positives = 134/249 (53%), Gaps = 40/249 (16%) Query: 7 SAQVLDWYDKYG-RKTLPWQI-----------------DKTPYKVWLSEVMLQQTQVATV 48 +LDW+D ++++PW+ ++ Y+VW+SE+MLQQT+VA V Sbjct: 126 RKALLDWFDGVSTKRSMPWRKAWINPKDHLPDDLRNLLERRAYEVWISEIMLQQTRVAVV 185 Query: 49 IPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATL--HGGKFPE 106 I Y+ ++MA++P++ DLA A D+VL W GLGYY+RA +H+A++ V G P Sbjct: 186 IDYWNKWMAKWPSIHDLAAASADDVLSAWRGLGYYSRATRIHEASKLVVKDPAMKGLLPS 245 Query: 107 TFEEVA-ALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVE-NKL 164 +++ +PGVGR TAGAI ++ G+ P++DGNV RVL+R + G +V + L Sbjct: 246 CTQDLEAKVPGVGRYTAGAISAIVFGQAAPMVDGNVLRVLSRQLGLLGNVKTTKVVIDTL 305 Query: 165 WSLSEQVTPA-----------------VGVERFNQAMMDLGAMICTRSKPKCSLCPLQNG 207 W+ ++ + A R+ QA+M+LG+ +C KP C CP+ + Sbjct: 306 WAAADALVKAVARDETGTPDGEEAETSDRPGRWGQALMELGSTVCVP-KPNCGECPITST 364 Query: 208 CIAAANNSW 216 C A A Sbjct: 365 CRAYAEGRT 373 Score = 74.6 bits (182), Expect = 5e-12, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 56/155 (36%), Gaps = 25/155 (16%) Query: 218 LYPGKKPKQTLPERTGYFLLLQHED-EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQ 276 +P K K+ + E +++ D + L+ +RP GL GL+ FP + ++ + Sbjct: 462 KFPLKVIKKAVREEETLVCVIRRGDGQYLVQKRPEKGLLAGLWEFPSYILQDPAEGSTSA 521 Query: 277 R----------QIAADNLTQLTAFRHT---------FSHFHLDIVPMWLPVS-----SFT 312 + +I ++ + + +H FSH L + + S + Sbjct: 522 KRKSKALVYLSKIGGEHDHKTSKVKHGGELGSVPWLFSHLKLTMHVHLFSLQDDDWLSDS 581 Query: 313 GCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 + + W + + + + + ++ G Sbjct: 582 ESLKKSRLRWASSDEVDDESMGTGMRKCWSLVKDG 616 >UniRef50_Q18HE6 A/G-specific adenine glycosylase n=12 Tax=Halobacteriaceae RepID=Q18HE6_HALWD Length = 337 Score = 228 bits (582), Expect = 2e-58, Method: Composition-based stats. Identities = 68/215 (31%), Positives = 104/215 (48%), Gaps = 10/215 (4%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 F +++ WY+ R+ PW+ PY + +SEVM QTQ+ V+ ++ F+ R+PT Sbjct: 21 DIDVFRNRLISWYEAEHRE-FPWRETDDPYAILVSEVMSHQTQLDRVVEAWKDFIQRWPT 79 Query: 62 VTDLANAPLDEVLHLWT--GLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 V LA V+ W+ LGY RA LH+AA QV + G P +E+ +L GVG Sbjct: 80 VKALAGDSQSAVVTFWSEHALGYNNRASYLHEAANQVVDEYDGTVPADPDELLSLMGVGP 139 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA A+ S + ++D NV+RVL R + E ++ + P Sbjct: 140 YTANAVASFAFNNGDAVVDTNVERVLYRVFKQIRQADDPPYEQ----IASALLPVERSRT 195 Query: 180 FNQAMMDLGAMICTRSKPKCSL--CPLQNGCIAAA 212 +N A+M+LG + C P+C CP + C A Sbjct: 196 WNNAIMELGGVACK-KTPRCDEANCPWRQWCHAYQ 229 >UniRef50_A1A319 Probable A/G-specific adenine glycosylase n=3 Tax=Bifidobacterium RepID=A1A319_BIFAA Length = 340 Score = 228 bits (581), Expect = 3e-58, Method: Composition-based stats. Identities = 83/256 (32%), Positives = 131/256 (51%), Gaps = 19/256 (7%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-PYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 A + + + W+ R LPW+ +T P+ V +SEVM QQTQ+ V+PY+ +M R+P Sbjct: 39 DADEIATTLAQWWQTNAR-DLPWRFGRTTPWGVLVSEVMSQQTQMGRVVPYWTAWMDRWP 97 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 LA+AP +V+ W LGY RA L + A +A + + P T++E+ ALPG+G Sbjct: 98 DAAALADAPKSDVITAWGRLGYPRRALRLQECAHVIAYDYADELPHTYDELLALPGIGDY 157 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCY-AVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA A+LS + G+ ++D N++RVL+R + V G + +L++++ P + Sbjct: 158 TASAVLSFAFGERIAVVDTNIRRVLSRAFVGVESLGGSASPAER--ALAKRLLPDDDSAK 215 Query: 180 ----------FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLP 229 +NQA+M+LGA +CT P C CP+ C N PG ++T P Sbjct: 216 CRRFDRPSVVWNQAVMELGATVCTAKSPLCEACPIAGKCAFLRNGR----PGLGQRRTRP 271 Query: 230 ERTGYFLLLQHEDEVL 245 + Q VL Sbjct: 272 RQRFQGTDRQVRGLVL 287 >UniRef50_C1MWM2 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MWM2_9CHLO Length = 335 Score = 225 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 90/333 (27%), Positives = 123/333 (36%), Gaps = 85/333 (25%) Query: 39 MLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVAT 98 M QQTQ+ V Y+ R+ R+PT LA A +EV LW GLGYY RAR L A+ V Sbjct: 1 MSQQTQIDRVATYWLRWTERWPTAAHLAEATQEEVNELWAGLGYYRRARFLLDGARWVVA 60 Query: 99 LH--GGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVS--GW 154 GGK P + +PGVG TA A+ S++ + +DGNV RV++R V G Sbjct: 61 EDGGGGKMPSDAASLGKIPGVGPYTASAVASIAFDEPIAAVDGNVLRVVSRLACVRGGGD 120 Query: 155 PGKKEVE--NKLWSLSEQVTPAVGVERFN------------------------------- 181 K ++++ + A N Sbjct: 121 VTKPGTSAGKACKAVADALLCAERPGDHNQARSHEGSRASFFTHRSGSFTTLYLIACVAF 180 Query: 182 -----------------QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL------ 218 QAMM+LGA +CT PKC CP+ + C + A Sbjct: 181 RLTDDGLFFSYEIARRIQAMMELGATVCTPRNPKCGECPVASMCASRALELEEEANVITA 240 Query: 219 ---------YPGKKPKQTLPERTGYFLLLQ--HEDE--------------VLLAQRPPSG 253 P K K E T +++ D LL +RP G Sbjct: 241 GKEPFRVTDLPEKDKKADKREETVSVRVVRGTRGDTETGADGASKKSAAFFLLVKRPEGG 300 Query: 254 LWGGLYCFPQFADEESLRQWLAQRQIAADNLTQ 286 L GL+ FP E++ A R A L + Sbjct: 301 LLSGLWEFPSVVMEDTDADEDALRAAADRLLAE 333 >UniRef50_C8RS99 A/G-specific DNA glycosylase n=1 Tax=Corynebacterium jeikeium ATCC 43734 RepID=C8RS99_CORJE Length = 385 Score = 225 bits (573), Expect = 2e-57, Method: Composition-based stats. Identities = 89/334 (26%), Positives = 132/334 (39%), Gaps = 51/334 (15%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQ-IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 + + A++ WY + R LPW+ D TP+ + LSEVM QQT VA VIP + ++ R+ Sbjct: 50 VDSPDLPAKLNAWYARNAR-DLPWRHPDTTPWAILLSEVMSQQTPVARVIPLWRAWLERW 108 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 PT DLA AP E+L +W LGY RA L + A H G P E+ ALPG+G Sbjct: 109 PTPADLAAAPRSEILRMWANLGYPRRALRLKECAIACVERHDGAVPHDIAELEALPGIGH 168 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVEN--KLWSLSEQVTPAVGV 177 TA A+ + + G+ P++D NV+RV R G + + SL V P + Sbjct: 169 YTARAVAAFAFGQAVPVVDTNVRRVYRRLVDGRYLQGPARARDLADVASLLPHVDPDPRL 228 Query: 178 ER---------------------------------------FNQAMMDLGAMICTRSKPK 198 + A+M+LGA+ICT P Sbjct: 229 DGRQLLPPPQPQPTDTGTGTDSCAGGSTATARDADRDAANLMCAALMELGALICTAKSPA 288 Query: 199 CSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV--LLAQRPPSGLWG 256 C CP+ + C + + Q ++ LL + G+ Sbjct: 289 CDQCPVIDDCAWVQAGKPQPTEAEAQAAAQRVQKFVGTDRQVRGKIMGLLREAGEGGVTK 348 Query: 257 G----LYCFPQFADEESLRQWLAQRQIAADNLTQ 286 L+ P E Q L + A + + Sbjct: 349 KEIDLLW--PATEQLERALQSLLDDGLVARDGAK 380 >UniRef50_A8Q4W7 Putative uncharacterized protein n=1 Tax=Malassezia globosa CBS 7966 RepID=A8Q4W7_MALGO Length = 625 Score = 223 bits (569), Expect = 7e-57, Method: Composition-based stats. Identities = 87/273 (31%), Positives = 133/273 (48%), Gaps = 49/273 (17%) Query: 1 MQASQFSAQVLDWYDKYGRKTL-PWQID-------------------------------- 27 + ++L W+D L PW+ Sbjct: 52 LDQPDAVQRLLAWFDHKQHDRLMPWRQAWIDPNEDAPESTRVKRARAHETGTHRSAHNIV 111 Query: 28 -----KTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGY 82 + Y+VW+SE+MLQQT+V T Y+ +++ +P++ LA+A +D+VL W GLGY Sbjct: 112 QEQIQRRAYEVWISEIMLQQTRVETARSYWLKWIEAWPSIDALADASVDDVLAAWRGLGY 171 Query: 83 YARARNLHKAAQQVATL--HGGKFPETFEEV-AALPGVGRSTAGAILSLSLGKHFPILDG 139 Y RAR +H+AAQ++ G++PE E+ +PGVG TAGAI S+ G PILDG Sbjct: 172 YGRARRIHEAAQKIMHDPSMKGQWPEYAHELCEKIPGVGPYTAGAISSIVFGHAVPILDG 231 Query: 140 NVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG-------VERFNQAMMDLGAMIC 192 NV RVL+R + P K + LW ++ + V R+NQ +M+LG+ +C Sbjct: 232 NVARVLSRQTGLYADPRSKSTNDLLWYMARMLVEHVAPNHRSDVPGRWNQGLMELGSTLC 291 Query: 193 TRSKPKCSLCPLQNGCIAAANNS-WALYPGKKP 224 T ++P C CP+Q+ C A + + P P Sbjct: 292 TPTRPACDTCPIQSTCAVYAESILYERAPVSSP 324 Score = 56.5 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 53/206 (25%), Gaps = 70/206 (33%) Query: 204 LQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE------VLLAQRPPSGLWGG 257 L + L+P + KQ + +++ LL QRP +GL Sbjct: 412 LSSSSALLRAKYIQLFPMRAAKQAPRVESRVVCIVRTTSLSSATPVFLLHQRPDTGLLAK 471 Query: 258 LYCFPQF----ADEESLRQWLAQRQIAADN------------------------------ 283 L+ FP ++S DN Sbjct: 472 LWEFPTVVVSPPGDQSTDNAKEGEGDYCDNRVNHTQGDIEVYKQQALSFVASLRPPAAKS 531 Query: 284 --------------------LTQLTAFRHTFSHFHLDIVPMWLPV---------SSFTGC 314 + L RH FSH H + + + V S+ G Sbjct: 532 APDPSDNKHVTITARLQVDHIQPLGFVRHEFSHLHWHMHVLLVDVTVEGQGTCSSTNNGA 591 Query: 315 MDEGN-ALWYNLAQPPSVGLAAPVER 339 +G+ W + + + + R Sbjct: 592 EMDGHTQQWRTAHEVEAASMGTGLRR 617 >UniRef50_B8DWV0 Endonuclease III n=11 Tax=Bifidobacteriaceae RepID=B8DWV0_BIFA0 Length = 247 Score = 222 bits (566), Expect = 2e-56, Method: Composition-based stats. Identities = 54/213 (25%), Positives = 92/213 (43%), Gaps = 7/213 (3%) Query: 28 KTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYA-RA 86 TP+++ ++ ++ QT V +PT LA+A L++V + LG+Y +A Sbjct: 34 HTPFELLVATILSAQTTDKRVNSITPELFGTYPTAAALADARLEDVESIIRPLGFYHVKA 93 Query: 87 RNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLA 146 ++ A+Q+ GG+ P+T EE+ +LPGVGR TA +L + +D +V RV Sbjct: 94 EHIIAVARQIVERFGGQIPQTMEELTSLPGVGRKTANVVLGNAFRVPGFPVDTHVIRVTG 153 Query: 147 RCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQN 206 R + W ++ P + ++ G ICT P+C CPL Sbjct: 154 RLHWRDDWMKTSTTPERIEQEITGCFPESEWTDLSHRLIIFGRNICTSRSPECENCPLLP 213 Query: 207 GCIAAAN------NSWALYPGKKPKQTLPERTG 233 C +A A + P++ +T Sbjct: 214 TCPSAGEFLEIAAERKARATRRAPRKPRSRKTA 246 >UniRef50_D0RRM3 A/G-specific adenine glycosylase n=1 Tax=alpha proteobacterium HIMB114 RepID=D0RRM3_9RICK Length = 337 Score = 221 bits (564), Expect = 3e-56, Method: Composition-based stats. Identities = 93/346 (26%), Positives = 163/346 (47%), Gaps = 18/346 (5%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTP----YKVWLSEVMLQQTQVATVIPYFERFMARF 59 S S ++L WY KY R LPW+ K YKV LSE MLQQTQV T +P+F F + Sbjct: 3 SSISKKILAWYQKYQR-DLPWRKYKNQKDRHYKVLLSEFMLQQTQVKTALPFFNNFYKKI 61 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 ++ L+ +V LW GLGYY RA+ L + ++ + + K P +E++ ALPG+G Sbjct: 62 NSLEKLSKTSQAKVNKLWQGLGYYRRAKFLLETSKIIKKKYNYKLPSQYEDLIALPGIGD 121 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA AILS++ K+ +DGNV+RV+ R + +SG + K+ + E++ Sbjct: 122 YTAKAILSIAFDKNEIGIDGNVERVVTRIFNISG-------KKKILNYVERLKVEKKASF 174 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 Q +M++GA+IC P C+ C L C ++N + + +K K + ++ + Sbjct: 175 LMQGLMEVGALICKPKLPLCNDCFLNKNCKYSSNKKFNI--DEKVKAEKKIKKFIAIIYR 232 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 ++LL G P + + + N L + S+F Sbjct: 233 KNKKILLRYEKNLGPLKDFLNVPLVLEISQEGRNYKKILKKNKNYQNLGILNISISNFKA 292 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 +I+ + + + N + S ++ ++++L++++ Sbjct: 293 EILC----LETKKMIKVNRPYYFMNATELKSKFISGFLKKILKKVK 334 >UniRef50_D1Z9Z8 Whole genome shotgun sequence assembly, scaffold_13 n=1 Tax=Sordaria macrospora RepID=D1Z9Z8_SORMA Length = 693 Score = 221 bits (564), Expect = 3e-56, Method: Composition-based stats. Identities = 94/277 (33%), Positives = 134/277 (48%), Gaps = 61/277 (22%) Query: 7 SAQVLDWYD--KYGRKTLPWQI-------------------------DKTPYKVWLSEVM 39 +L W+D Y R+ +PW+ + Y+V LSE M Sbjct: 164 RDALLTWFDSVTYDRR-MPWRKPWIDTTALLSSSSSSVCPDDLREAVAQRAYEVLLSETM 222 Query: 40 LQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATL 99 LQQT+VA VI Y+ +++A PT+ LA + VL LW+GLGYY+RA L AQQV Sbjct: 223 LQQTRVAPVIAYYSKWLAILPTMQSLAASNPATVLSLWSGLGYYSRATRLQTLAQQVCPP 282 Query: 100 -----HGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGW 154 + G P T E++ LPGVGR TAGA+ + G+ P++DGNV RVLAR + G Sbjct: 283 LEQGGYDGLLPHTVEKLMQLPGVGRYTAGAVACIVYGRAEPMVDGNVIRVLARQLGIRGD 342 Query: 155 PGKKEVENKLWSLSEQVTPAVGVE---------------------RFNQAMMDLGAMICT 193 KEV +W ++++ AV + ++ Q +M+LGA +C Sbjct: 343 VKSKEVLKVMWEAAKRLVEAVAWDGTDATDEERKGKEPPVSDRPGKWGQGLMELGATVCL 402 Query: 194 R--SKPKCSLCPLQNGCIAAANN-----SWALYPGKK 223 +KPKC LCP++ C A L PGK+ Sbjct: 403 GGLAKPKCGLCPIKGTCRAYQEGVEIAAQKGLLPGKE 439 Score = 71.1 bits (173), Expect = 6e-11, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 59/163 (36%), Gaps = 37/163 (22%) Query: 214 NSWALYPGKKPK-QTLPERTGYFLLLQH--EDEVLLAQRPPSGLWGGLYCFPQFA----- 265 + + YP KPK + L E + ++ + L+ +RP GL G++ P Sbjct: 521 SHCSKYPYIKPKTKKLREEECLVIAIRRPSDGRYLIHKRPAKGLLAGMWELPSHTLPADS 580 Query: 266 ---------DEESLRQWLAQ--RQIAADNL---------TQLTAFRHTFSHFHLDIVPMW 305 EE +R+ Q R++ ++L +L TFSHF L + Sbjct: 581 AWSNETVAMREEKVRKVSEQGLREVGVESLGVRYWGKSQEELGVVPWTFSHFKLTMHVWA 640 Query: 306 LPVSSFTGCMDEGNA---------LWYNLAQPPSVGLAAPVER 339 + ++ G G+ W + + + +++ Sbjct: 641 VDLAEGDGQSVSGDYEQEGDAKVKRWATMEEIEGENMGTGMKK 683 >UniRef50_A0LRD5 HhH-GPD family protein n=5 Tax=Actinomycetales RepID=A0LRD5_ACIC1 Length = 335 Score = 219 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 84/239 (35%), Positives = 122/239 (51%), Gaps = 4/239 (1%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDK-TPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 A + +VL WY ++GR+ LPW+ TP++V +SEVMLQQT V+ V+P + + AR+PT Sbjct: 42 AQRVVHRVLAWYRRHGRRDLPWRRSDVTPWQVLVSEVMLQQTPVSRVLPVYAVWTARWPT 101 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 LA A + + W LGY RA LH+AA+ + GG P+ +A LPG+GR T Sbjct: 102 PQSLAAATPADAVRAWGRLGYPRRALWLHQAARAIVDRFGGIVPDEPGVLATLPGIGRYT 161 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA--VGVER 179 A A+ + + + +LD NV+RVLAR P + + P + Sbjct: 162 AAAVAAFAYRRRVAVLDTNVRRVLARFLTGVPHP-TGTPRAAEHRSLDALLPKNADRAAQ 220 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 F+ A+M+LGA+ICT P C+ CPL C A + T +R LL Sbjct: 221 FSVALMELGALICTSRSPGCARCPLTTDCAWHRAGRPAGTRRPRAPYTGSDRQARGALL 279 >UniRef50_B7KLS5 HhH-GPD family protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KLS5_CYAP7 Length = 230 Score = 218 bits (556), Expect = 2e-55, Method: Composition-based stats. Identities = 69/208 (33%), Positives = 115/208 (55%), Gaps = 2/208 (0%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 ++ W ++ R PW+ + PY + ++E++LQ+T TV P + F+ R+PT+ L Sbjct: 21 LRKKLRKWGERNFR-DFPWRHTRDPYAILIAEILLQRTSARTVAPVYLEFLRRYPTLMTL 79 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 + A E+ L LG +RA NL + A L+GG+ P++ EE+ LPGVG+ TA A+ Sbjct: 80 SQAREHELSDLMRPLGLRSRAANLKRLAITAIALYGGELPDSEEELLKLPGVGKYTARAV 139 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 + + G +LD NV R+L R + G ++ + LWS+++ V +R+N ++ Sbjct: 140 CANAYGHPLAVLDVNVARILRRFFGFDGTKIERR-DAFLWSVAQAVALKRETDRWNLTLI 198 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAAN 213 D GA +C +KP C CPL+ C + N Sbjct: 199 DFGAEVCRATKPNCRDCPLRGKCQFSLN 226 >UniRef50_Q03K95 DNA-(Apurinic or apyrimidinic site) lyase / endonuclease III n=5 Tax=Lactobacillales RepID=Q03K95_STRTD Length = 214 Score = 218 bits (556), Expect = 2e-55, Method: Composition-based stats. Identities = 59/202 (29%), Positives = 95/202 (47%), Gaps = 10/202 (4%) Query: 15 DKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVL 74 D +G L W +TP+++ ++ ++ QT V AR+P + DLA+A LD+V Sbjct: 20 DAHG--ELEW---ETPFQLLVAVILSAQTTDKAVNKVTPGLWARYPEIEDLASANLDDVE 74 Query: 75 HLWTGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKH 133 +G Y +A+N+ K A+ + G+ P+T +E+ +LPGVGR TA +L+ G Sbjct: 75 MCLRTIGLYKNKAKNIIKTARAILMNFDGQVPKTHKELESLPGVGRKTANVVLAEVYGIP 134 Query: 134 FPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICT 193 +D +V RV R +E+E +L + P + M+ G C Sbjct: 135 SIAVDTHVSRVSKRLNIAPENASVEEIEAELM----KKIPKKDWIISHHRMIFFGRYHCL 190 Query: 194 RSKPKCSLCPLQNGCIAAANNS 215 PKC CPLQ+ C + Sbjct: 191 AKNPKCQTCPLQSYCKYYRETT 212 >UniRef50_Q5M3Y9 Endonuclease III, DNA repair n=21 Tax=Firmicutes RepID=Q5M3Y9_STRT2 Length = 219 Score = 218 bits (556), Expect = 3e-55, Method: Composition-based stats. Identities = 59/202 (29%), Positives = 95/202 (47%), Gaps = 10/202 (4%) Query: 15 DKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVL 74 D +G L W +TP+++ ++ ++ QT V AR+P + DLA+A LD+V Sbjct: 25 DAHG--ELEW---ETPFQLLVAVILSAQTTDKAVNKVTPGLWARYPEIEDLASANLDDVE 79 Query: 75 HLWTGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKH 133 +G Y +A+N+ K A+ + G+ P+T +E+ +LPGVGR TA +L+ G Sbjct: 80 MCLRTIGLYKNKAKNIIKTARAILMNFDGQVPKTHKELESLPGVGRKTANVVLAEVYGIP 139 Query: 134 FPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICT 193 +D +V RV R +E+E +L + P + M+ G C Sbjct: 140 SIAVDTHVSRVSKRLNIAPENASVEEIEAELM----KKIPKKDWIISHHRMIFFGRYHCL 195 Query: 194 RSKPKCSLCPLQNGCIAAANNS 215 PKC CPLQ+ C + Sbjct: 196 AKNPKCQTCPLQSYCKYYRETT 217 >UniRef50_C8WS61 Endonuclease III n=10 Tax=Firmicutes RepID=C8WS61_ALIAD Length = 220 Score = 217 bits (554), Expect = 4e-55, Method: Composition-based stats. Identities = 47/191 (24%), Positives = 89/191 (46%), Gaps = 6/191 (3%) Query: 29 TPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLG-YYARAR 87 TP+++ ++ ++ Q V R A++ A A DEV +G + ++++ Sbjct: 31 TPFELLVATILSAQCTDERVNMVTPRLFAKYRGPEGFAKASPDEVAEDIREVGLFRSKSK 90 Query: 88 NLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLAR 147 ++ + A+ + +GG+ P++ + + LPGVGR TA ++S + G +D +V+RV R Sbjct: 91 HIVETARILVDEYGGEVPKSRDRLMELPGVGRKTANVVVSNAYGVPAFAVDTHVQRVTNR 150 Query: 148 CYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNG 207 K + L P + + A++ G +CT KPKC +CP+ + Sbjct: 151 IGLAKSNDPLKTEQQVCAKL-----PPELWTKAHHALILHGRRVCTARKPKCHICPVADL 205 Query: 208 CIAAANNSWAL 218 C A N + Sbjct: 206 CQCARNQAEQK 216 >UniRef50_D1V787 HhH-GPD family protein n=4 Tax=Actinomycetales RepID=D1V787_9ACTO Length = 332 Score = 217 bits (554), Expect = 4e-55, Method: Composition-based stats. Identities = 86/252 (34%), Positives = 131/252 (51%), Gaps = 12/252 (4%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKT-PYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 F+ VL+W+ R+ LPW+ + P+ V +SEVMLQQT V V+P +E ++AR+P Sbjct: 42 FAEPVLEWFGLVARE-LPWRRPEAGPWAVLVSEVMLQQTPVNRVLPVYEAWLARWPDPAA 100 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA P E + +W LGY RA LH+AA + HGG P+ +++ ALPG+G TA A Sbjct: 101 LAAEPSGEAVRMWGRLGYPRRALRLHQAATAIVERHGGAVPDNLDDLLALPGIGTYTARA 160 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV--GVERFNQ 182 + + + + P++D NV+R++AR G V K +L E + PA R + Sbjct: 161 VTAFAFRQRQPVIDVNVRRLVARAIEGRAE-GPVAVSRKDLALVEDLLPADAETAARASA 219 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIA-AANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 A M+LGA++C P+C+ CP++ C A + A P +KP + Q Sbjct: 220 AFMELGALVCVARAPRCAGCPVRERCAWLLAGSPPADGPARKP------QGYAGTDRQVR 273 Query: 242 DEVLLAQRPPSG 253 +L R G Sbjct: 274 GRLLAVLREADG 285 >UniRef50_O26858 Endonuclease III n=4 Tax=Euryarchaeota RepID=O26858_METTH Length = 233 Score = 217 bits (554), Expect = 4e-55, Method: Composition-based stats. Identities = 55/200 (27%), Positives = 90/200 (45%), Gaps = 9/200 (4%) Query: 26 IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYA- 84 D+ PY+V + ++ Q+T+ R+P++ D+A APL+E+ L G+Y Sbjct: 41 EDRDPYRVLIRTILSQRTRDENTDEATASLFERYPSIEDVAYAPLEEIEALIRKAGFYHV 100 Query: 85 RARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRV 144 +AR + + ++ + + GK P+ E+ LPGVGR TA +L + G+ +D +V R+ Sbjct: 101 KARRVREVSRIILEEYDGKVPDDINELLKLPGVGRKTANCVLVYAFGRPAIPVDTHVHRI 160 Query: 145 LARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPL 204 R V +E E L +V P N M+ G IC P+ CP+ Sbjct: 161 SNRIGLVDTR-TPEETERALM----KVIPREYWIELNDLMVQFGQDICRPLGPRHEECPI 215 Query: 205 QNGCIAAANNSWALYPGKKP 224 + C GKK Sbjct: 216 ADHCDYYFRIKEE---GKKK 232 >UniRef50_C6WGJ6 HhH-GPD family protein n=48 Tax=Actinomycetales RepID=C6WGJ6_ACTMD Length = 329 Score = 216 bits (551), Expect = 9e-55, Method: Composition-based stats. Identities = 82/230 (35%), Positives = 124/230 (53%), Gaps = 7/230 (3%) Query: 2 QASQFSAQVLD-WYDKYGRKTLPWQ-IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 +++ VL+ W+ R LPW+ + T + V +SE+MLQQT VA V P + ++ ++ Sbjct: 36 ESAHLPPSVLNTWFAATAR-DLPWRDPECTAWGVLVSEIMLQQTPVARVEPIWRVWLDKW 94 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 P +D+A A EVL +W LGY RA LH AAQ VA H P+ E + ALPG+G Sbjct: 95 PRPSDMAAASQGEVLRMWGKLGYPRRALRLHAAAQAVAAEHDDVVPDDVETLLALPGIGA 154 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA A+ + + G+ P++D NV+RV+AR +G G L + E + P Sbjct: 155 YTARAVAAFAYGRRCPVVDTNVRRVVARAVHGAGDAGPPSTTRDLRDV-EALLPEDEASA 213 Query: 180 --FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWA-LYPGKKPKQ 226 ++ A+M+LGA++CT P+CS CP+ C N A P K ++ Sbjct: 214 ATYSAALMELGALVCTARTPRCSACPVLGSCQWQRNGRPAYDGPAKAVQK 263 >UniRef50_Q0W1Z4 Putative DNA glycosylase n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W1Z4_UNCMA Length = 212 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 67/209 (32%), Positives = 114/209 (54%), Gaps = 2/209 (0%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + +L+W+ K R PW+ +PY V ++E MLQ+T+ V+P + +F+ ++P V Sbjct: 3 EIIKNLLNWHKKNYR-PYPWRNVSSPYMVMIAEFMLQRTRADQVVPVYNQFLKKYPDVDQ 61 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A ++++ LG Y RA + AA+ + G P+ E++ +PGVG AGA Sbjct: 62 LAEADIEDIKATLKPLGLYWRANHFKMAAEYIQRTFSGNIPDNKEDLKNIPGVGDYAAGA 121 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 IL+++ K I+D N+ RVL R Y + G G+ + ++ L+ Q+ + A+ Sbjct: 122 ILAVAFRKKSCIVDSNIARVLNRYYGL-GLNGEIRRKKEIVELACQLFNHKEPNKILFAI 180 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAAN 213 +D A++CT PK +CPL+N C+ N Sbjct: 181 IDFSAIVCTPVNPKHGICPLKNNCLYYKN 209 >UniRef50_Q11NT5 DNA-(Apurinic or apyrimidinic site) lyase / endonuclease III n=28 Tax=Bacteroidetes RepID=Q11NT5_CYTH3 Length = 218 Score = 216 bits (550), Expect = 1e-54, Method: Composition-based stats. Identities = 57/220 (25%), Positives = 93/220 (42%), Gaps = 6/220 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 MQ + + LD++ + + + PY++ ++ + Q V R+P Sbjct: 1 MQRKERFEKFLDYFSTHSPEPETELVYSNPYELLVAVSLSAQCTDKRVNLTTPALFNRYP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGY-YARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 LA A DEV H + Y +A++L AQ + G+ P+T E++ LPGVGR Sbjct: 61 DAASLAKATSDEVFHYIRSISYPNNKAKHLVGMAQMLMKDFNGEVPDTVEDLVKLPGVGR 120 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKK-EVENKLWSLSEQVTPAVGVE 178 TA I S+ + +D +V RV R V VE +L + P V Sbjct: 121 KTANVIASVIWQQPTMAVDTHVYRVSRRIGLVPQTATTPLAVEKQLM----KYIPTALVH 176 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL 218 + + ++ G C PKC +CP+ C+ + N L Sbjct: 177 KAHHWLILHGRYTCLARTPKCEVCPVTEICMWYSKNKKKL 216 >UniRef50_Q9P9L6 U/G & T/G mismatch specific glycosylase (Pa-MIG) n=2 Tax=Thermoprotei RepID=Q9P9L6_PYRAE Length = 230 Score = 215 bits (549), Expect = 2e-54, Method: Composition-based stats. Identities = 62/218 (28%), Positives = 113/218 (51%), Gaps = 3/218 (1%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 ++F ++ WY ++G K LPW+ P+ V ++ ++L++T V V+ + F+ R+P+ Sbjct: 13 TKFRDAIIKWYREFGEKDLPWRKAGDPWAVLVAALLLRKTTVKQVVDIYREFLRRYPSPA 72 Query: 64 DLANAPLDEVLHLWTGLGY-YARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA+A ++E+ + LG + RA L K ++++ G+ P + + +LPGVG A Sbjct: 73 RLADASVEEIKAIIQPLGMEHVRATLLKKLSEELVRRFNGQIPCDRDALKSLPGVGDYAA 132 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA--VGVERF 180 +L + GK P+LD N+ RV+ R + + + + +LW+ + + P + F Sbjct: 133 SEVLLTACGKPEPLLDRNMIRVIERVFGIKSKKRRPHTDRELWNFARSLVPRDPELAKEF 192 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL 218 N ++D +CT PKCSLCPL N + Sbjct: 193 NFGVLDFARKVCTAKSPKCSLCPLANNVCVFYQKRERV 230 >UniRef50_B7G7W7 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B7G7W7_PHATR Length = 645 Score = 213 bits (542), Expect = 8e-54, Method: Composition-based stats. Identities = 77/170 (45%), Positives = 102/170 (60%), Gaps = 4/170 (2%) Query: 29 TPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARN 88 T Y VW+SE+MLQQT+V V+PY+ R+M RFPTV DLA A D V W GLG+Y RAR Sbjct: 190 TAYGVWVSEIMLQQTRVEAVVPYWVRWMIRFPTVHDLALADEDAVNAHWAGLGFYRRARL 249 Query: 89 LHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARC 148 LH AA+ + G PE +E+ LPGVGR TA AI S++ + P++DGNV RVLAR Sbjct: 250 LHSAAKYIVNDCNGALPENVQELLHLPGVGRYTASAIASIAFNVNVPVVDGNVCRVLARL 309 Query: 149 YAVSGWPGKKEVEN-KLWSLSEQVTPAVG---VERFNQAMMDLGAMICTR 194 ++ +++ W L+ Q+ A NQA+M+LGA C Sbjct: 310 RGIANNIKAPALKDNHGWDLAAQIVSAGDGSAPGEVNQAIMELGATYCAP 359 >UniRef50_Q1ASN5 DNA-(Apurinic or apyrimidinic site) lyase / endonuclease III n=4 Tax=Bacteria RepID=Q1ASN5_RUBXD Length = 214 Score = 213 bits (542), Expect = 8e-54, Method: Composition-based stats. Identities = 56/214 (26%), Positives = 97/214 (45%), Gaps = 12/214 (5%) Query: 1 MQASQFSAQVLDWYDKY--GRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMAR 58 M A+ + +Y + L W P ++ ++ ++ Q V ER + Sbjct: 1 MSAAPVGEVIARLKREYPDAKTALNW---SNPLELLVAVILSAQCTDERVNRVTERLFRK 57 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYA-RARNLHKAAQQVATLHGGKFPETFEEVAALPGV 117 + T D A+APL+E+ G+Y +AR + A+ + HGG+ P+T EE+ ALPGV Sbjct: 58 YRTAEDYASAPLEELEQDIRPTGFYRNKARAIQGMARALLERHGGEVPKTMEELVALPGV 117 Query: 118 GRSTAGAILSLSLGKHF-PILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG 176 GR TA +L + G + ++D +V+RV R ++ +++E L P Sbjct: 118 GRKTANVVLGNAFGVNEGVVVDTHVRRVSRRL-GLTSSDDPEKIERDLLPQ----VPEEE 172 Query: 177 VERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIA 210 F ++ G +C +P C C L + C + Sbjct: 173 RTLFAHLLIFHGRRVCKARRPDCPGCVLNDICPS 206 >UniRef50_A4E9V4 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4E9V4_9ACTN Length = 221 Score = 213 bits (542), Expect = 9e-54, Method: Composition-based stats. Identities = 56/205 (27%), Positives = 98/205 (47%), Gaps = 11/205 (5%) Query: 15 DKYGRKTLP---WQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLD 71 ++ R+ P + + P+++ ++ ++ QT A V + A++PT +A A + Sbjct: 18 ERLNRRYGPVECFLDHENPFRLLIAVLLSAQTTDAQVNKVTPKLFAQWPTPEAMAGASVA 77 Query: 72 EVLHLWTGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSL 130 +V LG+Y ++A++ +AAQ + +GG+ P +E+ LPGVGR TA +L++ Sbjct: 78 DVADTIKSLGFYKSKAKHAVEAAQMIVADYGGEVPADMKELVKLPGVGRKTANIVLNVGY 137 Query: 131 GKHFPI-LDGNVKRVLARCYAVSGWPGKK--EVENKLWSLSEQVTPAVGVERFNQAMMDL 187 G I +D +V R+ R K+ + E L ++ P E N + Sbjct: 138 GIVEGIAVDTHVNRIAHRLMLSPKTHAKEPLKTEQDLL----KILPHEYWESVNHQWITF 193 Query: 188 GAMICTRSKPKCSLCPLQNGCIAAA 212 G IC KPKC CPL + C + Sbjct: 194 GREICDARKPKCDECPLADLCPSVR 218 >UniRef50_C5WHA6 Endonuclease III n=7 Tax=Firmicutes RepID=C5WHA6_STRDG Length = 222 Score = 212 bits (541), Expect = 1e-53, Method: Composition-based stats. Identities = 54/198 (27%), Positives = 94/198 (47%), Gaps = 8/198 (4%) Query: 18 GRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLW 77 + L W +TP+++ ++ ++ QT V A++P + DLA+A + +V + Sbjct: 26 AKGELDW---ETPFQLLIAVILSAQTTDKAVNKVTPALWAKYPEIEDLASANVTDVENCL 82 Query: 78 TGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPI 136 +G Y +A+N+ K AQ + T GG+ P+T +E+ +LPGVGR TA +L+ Sbjct: 83 RTIGLYKNKAKNIIKTAQAMLTDFGGQVPKTHKELESLPGVGRKTANVVLAEVYAVPAIA 142 Query: 137 LDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSK 196 +D +V RV R + E+E L + P + ++ G C Sbjct: 143 VDTHVSRVAKRLNVSAPDADVTEIEQDLMAK----IPKKDWIITHHRLIFFGRYHCLAKN 198 Query: 197 PKCSLCPLQNGCIAAANN 214 PKC++CP+Q C + Sbjct: 199 PKCAICPVQTYCKYYKDT 216 >UniRef50_C0QG99 Putative endonuclease III n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QG99_DESAH Length = 212 Score = 212 bits (540), Expect = 2e-53, Method: Composition-based stats. Identities = 60/190 (31%), Positives = 89/190 (46%), Gaps = 6/190 (3%) Query: 27 DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYA-R 85 KTP+++ ++ ++ Q V RFPT LA A LDE+ + G+Y + Sbjct: 28 HKTPFQLLVATILSAQCTDVQVNRVTPVLFDRFPTPDKLAGASLDEIKPIVFSTGFYNNK 87 Query: 86 ARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVL 145 A+N+ AQ + T+HGG P + + LPGVGR TA + S++ G ++D +V RV Sbjct: 88 AKNIKACAQSIMTVHGGIVPTSMTALTGLPGVGRKTANLVRSVAFGMDTIVVDTHVYRVS 147 Query: 146 ARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQ 205 R G +VE+ L + + P M+ LG C KP C CPLQ Sbjct: 148 RRLGLSKGL-NPAKVESDLMA----IIPQKSWNDLCLQMIYLGREFCDARKPLCRKCPLQ 202 Query: 206 NGCIAAANNS 215 C + NS Sbjct: 203 EICPSNMKNS 212 >UniRef50_Q9WYK0 Endonuclease III n=12 Tax=Thermotogaceae RepID=END3_THEMA Length = 213 Score = 211 bits (538), Expect = 3e-53, Method: Composition-based stats. Identities = 50/224 (22%), Positives = 99/224 (44%), Gaps = 13/224 (5%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 + + +++ + + ++T P++V +S V+ Q+T+ ++ + T Sbjct: 2 IEELAREIVKRFPRNHKET-------DPFRVLISTVLSQRTRDENTEKASKKLFEVYRTP 54 Query: 63 TDLANAPLDEVLHLWTGLGYYA-RARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 +LA A +++ L G Y +A + + ++ + +GG+ P++ EE+ LPGVGR T Sbjct: 55 QELAKAKPEDLYDLIKESGMYRQKAERIVEISRILVEKYGGRVPDSLEELLKLPGVGRKT 114 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A +L + K +D +V R+ R GW + E +L +++ P N Sbjct: 115 ANIVLWVGFKKPALAVDTHVHRISNRL----GWVKTRTPEETEEAL-KKLLPEDLWGPIN 169 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPK 225 +M++ G IC P C C L+N C + + Sbjct: 170 GSMVEFGRRICKPQNPLCEECFLKNHCEFYRRRGKGEVRNRTER 213 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_Q67NN1 A/G-specific adenine glycosylase n=1 Tax=Symbiob... 413 e-114 UniRef50_A0Z8D4 A/G-specific adenine glycosylase MutY n=2 Tax=Ga... 402 e-111 UniRef50_Q2S0Q6 A/G-specific adenine glycosylase n=1 Tax=Salinib... 401 e-110 UniRef50_D1RGZ1 A/G-specific adenine glycosylase n=1 Tax=Legione... 397 e-109 UniRef50_B8KUN1 A/G-specific adenine glycosylase n=2 Tax=Gammapr... 396 e-109 UniRef50_C6N1N5 A/G specific adenine glycosylase n=1 Tax=Legione... 395 e-108 UniRef50_Q2SMB7 A/G-specific adenine glycosylase n=10 Tax=Gammap... 394 e-108 UniRef50_C7R6W3 A/G-specific adenine glycosylase n=1 Tax=Kangiel... 394 e-108 UniRef50_C9R2J3 A/G-specific adenine glycosylase n=1 Tax=Aggrega... 392 e-108 UniRef50_D0MK94 A/G-specific adenine glycosylase n=1 Tax=Rhodoth... 391 e-107 UniRef50_B8FCI1 A/G-specific adenine glycosylase n=1 Tax=Desulfa... 391 e-107 UniRef50_B7GII9 A/G-specific adenine glycosylase n=76 Tax=Bacill... 391 e-107 UniRef50_Q3A7R2 A/G-specific DNA-adenine glycosylase n=2 Tax=Des... 390 e-107 UniRef50_Q3KJJ2 A/G-specific adenine glycosylase n=5 Tax=Gammapr... 389 e-107 UniRef50_A9KCK4 A/G-specific adenine DNA glycosylase n=6 Tax=Cox... 387 e-106 UniRef50_Q2LY55 A/g-specific DNA glycosylase n=1 Tax=Syntrophus ... 386 e-106 UniRef50_A8MJ42 A/G-specific adenine glycosylase n=2 Tax=Firmicu... 384 e-105 UniRef50_C9PAA7 A/G-specific adenine glycosylase n=2 Tax=Vibrio ... 384 e-105 UniRef50_Q8CV61 A/G-specific adenine glycosylase n=1 Tax=Oceanob... 383 e-105 UniRef50_UPI0000E105D2 A/G-specific adenine glycosylase n=1 Tax=... 383 e-105 UniRef50_Q3J8X1 A/G-specific DNA-adenine glycosylase n=13 Tax=Ga... 382 e-104 UniRef50_B3QVZ4 A/G-specific adenine glycosylase n=1 Tax=Chloroh... 382 e-104 UniRef50_B8HXE9 A/G-specific adenine glycosylase n=31 Tax=Cyanob... 381 e-104 UniRef50_A7RIG3 Predicted protein n=1 Tax=Nematostella vectensis... 381 e-104 UniRef50_A6TW33 A/G-specific adenine glycosylase n=1 Tax=Alkalip... 381 e-104 UniRef50_A6LDG5 A/G-specific adenine glycosylase n=33 Tax=Bacter... 380 e-104 UniRef50_D2RMB6 A/G-specific adenine glycosylase n=2 Tax=Acidami... 379 e-104 UniRef50_A2RLW5 A/G-specific adenine glycosylase n=4 Tax=Lactoco... 379 e-104 UniRef50_D1N1B7 A/G-specific adenine glycosylase n=2 Tax=Bacteri... 379 e-103 UniRef50_UPI0001BC503C adenine glycosylase n=1 Tax=Neisseria muc... 379 e-103 UniRef50_C4L361 A/G-specific adenine glycosylase n=1 Tax=Exiguob... 378 e-103 UniRef50_C0QGE4 MutY n=1 Tax=Desulfobacterium autotrophicum HRM2... 377 e-103 UniRef50_C0EDR1 Putative uncharacterized protein n=2 Tax=Clostri... 377 e-103 UniRef50_Q2YCY4 A/G-specific DNA-adenine glycosylase n=4 Tax=Nit... 377 e-103 UniRef50_C5VJP5 A/G-specific adenine glycosylase n=1 Tax=Prevote... 376 e-103 UniRef50_P44320 A/G-specific adenine glycosylase n=308 Tax=Gamma... 376 e-103 UniRef50_A8PLA9 A/G-specific adenine glycosylase n=1 Tax=Rickett... 375 e-102 UniRef50_C6LA88 A/G-specific adenine glycosylase n=20 Tax=Clostr... 374 e-102 UniRef50_Q5WIF8 A/G-specific adenine DNA glycosylase n=11 Tax=Ba... 374 e-102 UniRef50_A1KUL5 Adenine glycosylase n=32 Tax=Neisseriaceae RepID... 374 e-102 UniRef50_A0LLV9 A/G-specific DNA-adenine glycosylase n=1 Tax=Syn... 373 e-102 UniRef50_B0RRL4 A/G-specific adenine glycosylase n=15 Tax=Gammap... 373 e-102 UniRef50_A4VV79 A/G-specific DNA glycosylase n=98 Tax=Lactobacil... 372 e-101 UniRef50_C2H8F6 A/G-specific adenine glycosylase n=16 Tax=Entero... 372 e-101 UniRef50_Q9UIF7 A/G-specific adenine DNA glycosylase n=51 Tax=Ch... 371 e-101 UniRef50_O31584 Probable A/G-specific adenine glycosylase yfhQ n... 370 e-101 UniRef50_Q1DAW2 A/G-specific adenine glycosylase n=1 Tax=Myxococ... 370 e-101 UniRef50_A3ZT46 A/G-specific adenine glycosylase n=1 Tax=Blastop... 370 e-101 UniRef50_B9XFP7 A/G-specific adenine glycosylase n=1 Tax=bacteri... 369 e-101 UniRef50_D1BPM8 A/G-specific adenine glycosylase n=3 Tax=Veillon... 369 e-100 UniRef50_Q03V36 A/G-specific DNA-adenine glycosylase n=2 Tax=Leu... 369 e-100 UniRef50_UPI0000E87CC6 A/G-specific adenine glycosylase n=1 Tax=... 368 e-100 UniRef50_A8R947 Putative uncharacterized protein n=1 Tax=Eubacte... 367 e-100 UniRef50_C5EQF4 Putative uncharacterized protein n=2 Tax=Clostri... 367 e-100 UniRef50_B9E7Y8 A/G-specific adenine glycosylase homolog n=4 Tax... 367 e-100 UniRef50_D1W5W3 A/G-specific adenine glycosylase n=1 Tax=Prevote... 367 e-100 UniRef50_A6NQR4 Putative uncharacterized protein n=1 Tax=Bactero... 366 e-100 UniRef50_B8JJK1 Novel protein similar to H.sapiens MUTYH, mutY h... 366 e-100 UniRef50_C9LNH1 A/G-specific adenine glycosylase n=1 Tax=Dialist... 366 e-100 UniRef50_C3X6B5 A/G-specific adenine glycosylase n=1 Tax=Oxaloba... 365 1e-99 UniRef50_B5JTP2 A/G-specific adenine glycosylase n=1 Tax=gamma p... 365 1e-99 UniRef50_A5IU34 A/G-specific DNA-adenine glycosylase n=56 Tax=St... 365 2e-99 UniRef50_Q89A45 A/G-specific adenine glycosylase n=6 Tax=Buchner... 364 4e-99 UniRef50_C8US26 A/G-specific DNA glycosylase n=18 Tax=Lactobacil... 364 4e-99 UniRef50_B8DFN3 A/G-specific adenine glycosylase n=14 Tax=Lister... 364 4e-99 UniRef50_A5EV94 A-G-specific adenine glycosylase n=1 Tax=Dichelo... 363 4e-99 UniRef50_C1ZJ43 A/G-specific DNA-adenine glycosylase n=1 Tax=Pla... 363 5e-99 UniRef50_C6D1C3 A/G-specific adenine glycosylase n=4 Tax=Bacilla... 363 6e-99 UniRef50_C4FU23 Putative uncharacterized protein n=1 Tax=Catonel... 362 8e-99 UniRef50_A1VKP4 A/G-specific DNA-adenine glycosylase n=6 Tax=Com... 362 9e-99 UniRef50_Q11NU4 A/G-specific adenine glycosylase n=5 Tax=Bactero... 362 2e-98 UniRef50_Q8A7M6 A/G-specific adenine glycosylase n=2 Tax=Bactero... 361 2e-98 UniRef50_Q5HN66 A/G-specific adenine glycosylase n=5 Tax=Staphyl... 361 2e-98 UniRef50_C9LHH9 A/G-specific adenine glycosylase n=4 Tax=Bactero... 360 4e-98 UniRef50_C6ACA0 A/G-specific adenine glycosylase MutY n=6 Tax=Rh... 359 6e-98 UniRef50_A8ZZH0 A/G-specific adenine glycosylase n=1 Tax=Desulfo... 359 7e-98 UniRef50_B0MYW3 Putative uncharacterized protein n=1 Tax=Alistip... 359 7e-98 UniRef50_C9MR28 A/G-specific adenine glycosylase n=4 Tax=Prevote... 359 8e-98 UniRef50_Q1MS05 A/G-specific DNA glycosylase n=1 Tax=Lawsonia in... 359 1e-97 UniRef50_A3MA77 A/G specific adenine glycosylase n=18 Tax=Acinet... 359 1e-97 UniRef50_Q03P68 A/G-specific DNA-adenine glycosylase n=2 Tax=Lac... 358 2e-97 UniRef50_A4T064 A/G-specific adenine glycosylase n=2 Tax=Polynuc... 357 2e-97 UniRef50_C8WS48 A/G-specific adenine glycosylase n=2 Tax=Alicycl... 357 3e-97 UniRef50_A9KT85 A/G-specific adenine glycosylase n=3 Tax=Clostri... 357 3e-97 UniRef50_B8J1Y9 A/G-specific adenine glycosylase n=12 Tax=Desulf... 357 4e-97 UniRef50_Q7NMA0 A/G-specific adenine glycosylase n=1 Tax=Gloeoba... 356 6e-97 UniRef50_A9EQ53 A/G-specific adenine glycosylase n=1 Tax=Sorangi... 356 8e-97 UniRef50_C7G751 A/G-specific adenine glycosylase n=1 Tax=Rosebur... 356 8e-97 UniRef50_D0L1V0 A/G-specific adenine glycosylase n=1 Tax=Halothi... 356 9e-97 UniRef50_B9Y8G7 Putative uncharacterized protein n=1 Tax=Holdema... 356 1e-96 UniRef50_A6CCE7 A/G-specific adenine glycosylase n=1 Tax=Plancto... 354 2e-96 UniRef50_A1WW55 A/G-specific DNA-adenine glycosylase n=1 Tax=Hal... 354 2e-96 UniRef50_Q0P4V1 Putative uncharacterized protein MGC145569 n=1 T... 354 3e-96 UniRef50_A7VHH6 Putative uncharacterized protein n=1 Tax=Clostri... 354 4e-96 UniRef50_A6GM64 Probable a/g-specific adenine glycosylase protei... 352 9e-96 UniRef50_UPI000196B020 hypothetical protein CATMIT_00928 n=1 Tax... 352 1e-95 UniRef50_C3J8R7 A/G-specific adenine glycosylase n=2 Tax=Bacteri... 351 2e-95 UniRef50_A6G959 A/G-specific adenine glycosylase n=1 Tax=Plesioc... 350 3e-95 UniRef50_C4G9U8 Putative uncharacterized protein n=1 Tax=Shuttle... 350 3e-95 UniRef50_A9T1M0 Predicted protein n=1 Tax=Physcomitrella patens ... 350 5e-95 UniRef50_D1U9Y5 A/G-specific adenine glycosylase n=1 Tax=Desulfo... 350 6e-95 UniRef50_A0M7B9 A/G-specific adenine glycosylase n=16 Tax=Bacter... 349 8e-95 UniRef50_UPI00016C4021 A/G-specific adenine glycosylase n=1 Tax=... 349 1e-94 UniRef50_Q7UYZ0 A/G-specific adenine glycosylase n=1 Tax=Rhodopi... 347 3e-94 UniRef50_UPI0001BC63FA A/G-specific adenine glycosylase n=3 Tax=... 347 5e-94 UniRef50_Q6AKD3 Related to A/G-specific adenine glycosylase n=1 ... 344 3e-93 UniRef50_Q7MUV2 A/G-specific adenine glycosylase n=2 Tax=Porphyr... 344 3e-93 UniRef50_C9LTK5 A/G-specific adenine glycosylase n=1 Tax=Selenom... 344 4e-93 UniRef50_C9KJM4 A/G-specific adenine glycosylase n=3 Tax=Clostri... 343 6e-93 UniRef50_C4V5S7 Adenine glycosylase n=1 Tax=Selenomonas flueggei... 343 7e-93 UniRef50_C6XRC5 HhH-GPD family protein n=1 Tax=Hirschia baltica ... 343 7e-93 UniRef50_C9PXZ8 A/G-specific adenine glycosylase n=2 Tax=Prevote... 342 2e-92 UniRef50_C7PL91 A/G-specific adenine glycosylase n=1 Tax=Chitino... 341 2e-92 UniRef50_B8FVN1 A/G-specific adenine glycosylase n=2 Tax=Desulfi... 341 2e-92 UniRef50_Q1Q8E2 A/G-specific DNA-adenine glycosylase n=4 Tax=Mor... 339 8e-92 UniRef50_Q02CA8 A/G-specific DNA-adenine glycosylase n=1 Tax=Can... 339 1e-91 UniRef50_C6X0N8 A/G-specific adenine glycosylase n=1 Tax=Flavoba... 338 2e-91 UniRef50_A9NG45 A/G-specific adenine DNA glycosylase n=1 Tax=Ach... 338 2e-91 UniRef50_A7HQZ1 A/G-specific adenine glycosylase n=45 Tax=Alphap... 337 2e-91 UniRef50_B7CBH3 Putative uncharacterized protein n=1 Tax=Eubacte... 336 7e-91 UniRef50_C1XGJ6 A/G-specific DNA-adenine glycosylase n=1 Tax=Mei... 336 7e-91 UniRef50_D2MN98 Putative A/G-specific adenine glycosylase n=1 Ta... 336 8e-91 UniRef50_D2QT13 A/G-specific adenine glycosylase n=1 Tax=Spiroso... 336 9e-91 UniRef50_B3EUI2 A/G-specific adenine glycosylase n=1 Tax=Candida... 335 1e-90 UniRef50_C1F8W0 A/G-specific adenine glycosylase, putative n=1 T... 335 2e-90 UniRef50_Q2KBH8 A/G-specific adenine glycosylase protein n=11 Ta... 335 2e-90 UniRef50_Q04T89 A/G-specific DNA glycosylase n=4 Tax=Leptospira ... 334 2e-90 UniRef50_B1LXG3 A/G-specific adenine glycosylase n=74 Tax=Bacter... 334 3e-90 UniRef50_C8P1Q1 A/G-specific adenine glycosylase n=1 Tax=Erysipe... 333 5e-90 UniRef50_C0BLH9 A/G-specific adenine glycosylase n=2 Tax=Flavoba... 333 6e-90 UniRef50_C6XY88 Helix-hairpin-helix motif protein n=4 Tax=Sphing... 332 9e-90 UniRef50_A9BD72 A/G-specific DNA glycosylase n=3 Tax=Cyanobacter... 332 9e-90 UniRef50_C0YGU7 Possible A/G-specific adenine glycosylase n=1 Ta... 332 1e-89 UniRef50_C8WEE4 A/G-specific adenine glycosylase n=3 Tax=Zymomon... 332 1e-89 UniRef50_D2R2B9 HhH-GPD family protein n=1 Tax=Pirellula staleyi... 332 1e-89 UniRef50_Q3ANF7 Mutator mutT protein n=19 Tax=Cyanobacteria RepI... 332 1e-89 UniRef50_B5EQC0 A/G-specific adenine glycosylase n=3 Tax=Acidith... 331 2e-89 UniRef50_Q6MF36 Putative A/G-specific adenine glycosylase, mutY ... 329 7e-89 UniRef50_Q3KMR2 A/G-specific adenine DNA glycosylase n=9 Tax=Chl... 329 9e-89 UniRef50_C2MB00 A/G-specific adenine glycosylase n=1 Tax=Porphyr... 327 4e-88 UniRef50_B1Y3P3 A/G-specific adenine glycosylase n=8 Tax=Burkhol... 327 4e-88 UniRef50_D1PLU8 A/G-specific adenine glycosylase n=2 Tax=Ruminoc... 326 8e-88 UniRef50_B4D777 HhH-GPD family protein n=1 Tax=Chthoniobacter fl... 324 4e-87 UniRef50_Q7VRG8 A/G-specific adenine glycosylase n=2 Tax=Candida... 324 4e-87 UniRef50_Q254A0 Adenine/guanine glycosylase n=5 Tax=Chlamydophil... 323 6e-87 UniRef50_Q2QW21 Os12g0211400 protein n=6 Tax=Poaceae RepID=Q2QW2... 323 7e-87 UniRef50_Q3SPD3 A/G-specific DNA-adenine glycosylase n=2 Tax=Nit... 322 1e-86 UniRef50_Q2G9R8 A/G-specific DNA-adenine glycosylase n=7 Tax=Alp... 322 1e-86 UniRef50_Q2N933 Putative uncharacterized protein n=1 Tax=Erythro... 321 2e-86 UniRef50_D0J8Q9 A/G-specific adenine glycosylase n=2 Tax=Blattab... 321 2e-86 UniRef50_C2KZ90 Putative uncharacterized protein n=1 Tax=Oribact... 321 2e-86 UniRef50_B3DVQ4 A/G-specific DNA glycosylase n=1 Tax=Methylacidi... 321 2e-86 UniRef50_B0SFS5 A/G-specific DNA glycosylase n=2 Tax=Leptospira ... 320 5e-86 UniRef50_UPI000034F1ED adenine-DNA glycosylase-related / MYH-rel... 319 1e-85 UniRef50_Q1GD89 A/G-specific DNA-adenine glycosylase n=11 Tax=Rh... 318 2e-85 UniRef50_C6XF00 A/G-specific adenine glycosylase n=1 Tax=Candida... 318 2e-85 UniRef50_B6BVY5 A/G-specific adenine glycosylase n=1 Tax=beta pr... 317 4e-85 UniRef50_A0L468 A/G-specific DNA-adenine glycosylase n=1 Tax=Mag... 315 1e-84 UniRef50_C7JG78 DNA glycosylase A/G-specific MutY n=12 Tax=Alpha... 315 2e-84 UniRef50_Q1ITJ1 A/G-specific DNA glycosylase n=1 Tax=Candidatus ... 314 3e-84 UniRef50_Q0EVU8 Adenine glycosylase n=1 Tax=Mariprofundus ferroo... 314 5e-84 UniRef50_B2UNR9 HhH-GPD family protein n=1 Tax=Akkermansia mucin... 313 5e-84 UniRef50_Q10159 A/G-specific adenine DNA glycosylase n=1 Tax=Sch... 313 8e-84 UniRef50_C5YTP8 Putative uncharacterized protein Sb08g006760 n=1... 312 1e-83 UniRef50_A3EPM8 Putative A/G-specific DNA glycosylase n=2 Tax=Le... 311 3e-83 UniRef50_C1EFR0 Predicted protein n=1 Tax=Micromonas sp. RCC299 ... 310 5e-83 UniRef50_UPI0000E49729 PREDICTED: hypothetical protein, partial ... 310 6e-83 UniRef50_C5ZWA7 A/G-specific adenine glycosylase n=3 Tax=Helicob... 309 1e-82 UniRef50_Q5SH32 A/G-specific adenine glycosylase (MutY) n=2 Tax=... 308 2e-82 UniRef50_B0D6Q9 Predicted protein n=1 Tax=Laccaria bicolor S238N... 307 4e-82 UniRef50_B9SF01 A/G-specific adenine glycosylase muty, putative ... 306 9e-82 UniRef50_A6FJZ0 A/G-specific adenine glycosylase n=7 Tax=Alphapr... 305 2e-81 UniRef50_A1W1M2 A/G-specific adenine glycosylase n=13 Tax=Campyl... 305 2e-81 UniRef50_Q9ZMT8 A/G-SPECIFIC ADENINE GLYCOSYLASE n=21 Tax=Helico... 304 2e-81 UniRef50_UPI0001926FFC PREDICTED: similar to mutY homolog, parti... 304 3e-81 UniRef50_Q5KQ30 A/G-specific adenine DNA glycosylase, putative n... 304 4e-81 UniRef50_B7P4H9 A/G-specific adenine glycosylase muty, putative ... 302 1e-80 UniRef50_B7A8Y1 HhH-GPD family protein n=1 Tax=Thermus aquaticus... 301 2e-80 UniRef50_D2VUC5 Predicted protein n=1 Tax=Naegleria gruberi RepI... 301 3e-80 UniRef50_Q7VGS8 A/G-specific adenine glycosylase n=537 Tax=Helic... 300 5e-80 UniRef50_B6K1Z7 A/G-specific adenine DNA glycosylase n=1 Tax=Sch... 298 1e-79 UniRef50_C3XHP3 A/G-specific adenine glycosylase n=1 Tax=Helicob... 296 7e-79 UniRef50_A6Q1G1 A/G-specific adenine glycosylase n=1 Tax=Nitrati... 296 9e-79 UniRef50_UPI0001746831 A/G-specific adenine glycosylase n=1 Tax=... 295 2e-78 UniRef50_B3T7B3 Putative HhH-GPD superfamily base excision DNA r... 295 2e-78 UniRef50_Q6IEF2 A/G-specific adenine glycosylase, putative n=3 T... 293 5e-78 UniRef50_B1ZMV1 HhH-GPD family protein n=2 Tax=Verrucomicrobia R... 290 4e-77 UniRef50_Q7M9R1 A/G-SPECIFIC ADENINE GLYCOSYLASE EC 3.2.2 n=1 Ta... 287 6e-76 UniRef50_C6HWU0 Putative A/G-specific DNA glycosylase n=1 Tax=Le... 283 6e-75 UniRef50_Q6MHU9 A/G-specific adenine glycosylase n=1 Tax=Bdellov... 283 7e-75 UniRef50_Q1ZXP7 Putative uncharacterized protein MYH n=1 Tax=Dic... 283 7e-75 UniRef50_Q1V0L8 A/G-specific adenine glycosylase n=3 Tax=Candida... 282 1e-74 UniRef50_A6DT67 Adenine glycosylase n=1 Tax=Lentisphaera araneos... 280 4e-74 UniRef50_Q2IJ90 A/G-specific DNA-adenine glycosylase n=5 Tax=Cys... 280 6e-74 UniRef50_A6Q673 A/G-specific adenine glycosylase n=2 Tax=Epsilon... 280 7e-74 UniRef50_B9KEL2 A/G-specific adenine glycosylase n=3 Tax=Campylo... 278 2e-73 UniRef50_Q4Q851 A/G-specific adenine glycosylase, putative n=3 T... 275 2e-72 UniRef50_Q1J2G1 A/G-specific adenine glycosylase n=3 Tax=Deinoco... 274 4e-72 UniRef50_C4PWX5 A/g-specific adenine dna glycosylase (Ec 3.2.2.-... 273 6e-72 UniRef50_C0ACJ1 HhH-GPD family protein n=1 Tax=Opitutaceae bacte... 272 1e-71 UniRef50_D1Y8G9 A/G-specific DNA-adenine glycosylase n=1 Tax=Pyr... 272 1e-71 UniRef50_A8I815 DNA repair glycosylase (Fragment) n=1 Tax=Chlamy... 271 3e-71 UniRef50_Q9SU12 Adenine DNA glycosylase like protein n=1 Tax=Ara... 267 4e-70 UniRef50_C9M6M4 A/G-specific adenine glycosylase n=1 Tax=Jonquet... 264 3e-69 UniRef50_C8NK18 A/G-specific adenine glycosylase n=8 Tax=Coryneb... 262 2e-68 UniRef50_C9RK91 HhH-GPD family protein n=1 Tax=Fibrobacter succi... 261 3e-68 UniRef50_A9AYZ2 HhH-GPD family protein n=5 Tax=Chloroflexi (clas... 257 6e-67 UniRef50_B6K934 Helix-hairpin-helix motif-containing protein n=3... 255 1e-66 UniRef50_UPI0000510111 A/G-specific adenine glycosylase n=1 Tax=... 255 2e-66 UniRef50_Q4P7F9 Putative uncharacterized protein n=1 Tax=Ustilag... 253 6e-66 UniRef50_B3DTE4 A/G-specific DNA glycosylase n=6 Tax=Bifidobacte... 252 1e-65 UniRef50_D0RRM3 A/G-specific adenine glycosylase n=1 Tax=alpha p... 251 2e-65 UniRef50_A7NN47 HhH-GPD family protein n=2 Tax=Bacteria RepID=A7... 250 6e-65 UniRef50_D1BVG2 HhH-GPD family protein n=39 Tax=Actinomycetales ... 248 3e-64 UniRef50_A1A5M6 Mutyh protein n=4 Tax=Eukaryota RepID=A1A5M6_RAT 247 4e-64 UniRef50_B9L504 Catalytic Domain Of MutyY n=4 Tax=Bacteria RepID... 247 5e-64 UniRef50_C7HUA0 Endonuclease III n=4 Tax=Anaerococcus RepID=C7HU... 247 5e-64 UniRef50_C8RS99 A/G-specific DNA glycosylase n=1 Tax=Corynebacte... 247 5e-64 UniRef50_B2AMG7 Predicted CDS Pa_5_7310 n=1 Tax=Podospora anseri... 247 5e-64 UniRef50_B8DWV0 Endonuclease III n=11 Tax=Bifidobacteriaceae Rep... 246 6e-64 UniRef50_O26858 Endonuclease III n=4 Tax=Euryarchaeota RepID=O26... 245 1e-63 UniRef50_A4AHC9 Adenine glycosylase n=1 Tax=marine actinobacteri... 245 1e-63 UniRef50_Q9WYK0 Endonuclease III n=12 Tax=Thermotogaceae RepID=E... 245 2e-63 UniRef50_B2A677 DNA-(Apurinic or apyrimidinic site) lyase, endon... 244 3e-63 UniRef50_Q5M3Y9 Endonuclease III, DNA repair n=21 Tax=Firmicutes... 244 4e-63 UniRef50_Q03K95 DNA-(Apurinic or apyrimidinic site) lyase / endo... 244 4e-63 UniRef50_B8P2D0 Predicted protein n=1 Tax=Postia placenta Mad-69... 243 5e-63 UniRef50_A1RYX1 HhH-GPD family protein n=1 Tax=Thermofilum pende... 243 6e-63 UniRef50_B7GUG9 HhH-GPD family protein n=2 Tax=Bifidobacterium R... 243 7e-63 UniRef50_C8WS61 Endonuclease III n=10 Tax=Firmicutes RepID=C8WS6... 243 8e-63 UniRef50_Q1ASN5 DNA-(Apurinic or apyrimidinic site) lyase / endo... 242 1e-62 UniRef50_C2KXT3 DNA-(Apurinic or apyrimidinic site) lyase n=1 Ta... 242 2e-62 UniRef50_C1MWM2 Predicted protein n=1 Tax=Micromonas pusilla CCM... 241 2e-62 UniRef50_C1SIS5 DNA-(Apurinic or apyrimidinic site) lyase; endon... 241 3e-62 UniRef50_C6A996 A/G-specific adenine glycosylase n=9 Tax=Bifidob... 241 3e-62 UniRef50_B2W3B1 A/G-specific adenine glycosylase n=2 Tax=Pleospo... 241 3e-62 UniRef50_A9UV22 Predicted protein (Fragment) n=1 Tax=Monosiga br... 241 4e-62 UniRef50_C0QU82 Endonuclease III n=4 Tax=Hydrogenothermaceae Rep... 240 5e-62 UniRef50_Q18HE6 A/G-specific adenine glycosylase n=12 Tax=Haloba... 240 5e-62 UniRef50_C0QG99 Putative endonuclease III n=1 Tax=Desulfobacteri... 240 7e-62 UniRef50_Q11NT5 DNA-(Apurinic or apyrimidinic site) lyase / endo... 239 9e-62 UniRef50_Q2FR30 DNA-(Apurinic or apyrimidinic site) lyase / endo... 239 1e-61 UniRef50_A1A319 Probable A/G-specific adenine glycosylase n=3 Ta... 239 1e-61 UniRef50_Q2R7H4 Endonuclease III-like protein 1, putative, expre... 238 2e-61 UniRef50_A5UJU9 Endonuclease III n=5 Tax=Archaea RepID=A5UJU9_METS3 238 2e-61 UniRef50_Q2RH98 DNA-(Apurinic or apyrimidinic site) lyase / endo... 238 3e-61 UniRef50_C5WHA6 Endonuclease III n=7 Tax=Firmicutes RepID=C5WHA6... 238 3e-61 UniRef50_A4E9V4 Putative uncharacterized protein n=1 Tax=Collins... 237 4e-61 UniRef50_B2J6L0 Endonuclease III n=10 Tax=Bacteria RepID=B2J6L0_... 237 5e-61 UniRef50_C0Z686 Endonuclease III n=134 Tax=Firmicutes RepID=C0Z6... 237 5e-61 Sequences not found previously or not previously below threshold: >UniRef50_Q67NN1 A/G-specific adenine glycosylase n=1 Tax=Symbiobacterium thermophilum RepID=Q67NN1_SYMTH Length = 365 Score = 413 bits (1061), Expect = e-114, Method: Composition-based stats. Identities = 120/346 (34%), Positives = 177/346 (51%), Gaps = 8/346 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 + ++LDWY R LPW+ + PY +W+SE+MLQQT+V TVIPY+ER+MARFPT+ L Sbjct: 1 MADRLLDWYSASAR-DLPWRRTRDPYHIWVSEIMLQQTRVETVIPYYERWMARFPTLEAL 59 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+AP ++VL W GLGYY+RARNLH AA++V +GG P+ + VA+L G+G TAGAI Sbjct: 60 ADAPEEQVLKAWEGLGYYSRARNLHAAAREVVARYGGTVPDDPDAVASLKGIGPYTAGAI 119 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 LS++ + P +DGNV RV+AR YA+ + + L + P FNQA+M Sbjct: 120 LSIAFNRPVPAVDGNVLRVIARLYAIVDDIAQLATRRTIEELVRAMIPQDRPGDFNQALM 179 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 DLGA ICT +P+C LCP+++ C P K + +++ E +L Sbjct: 180 DLGATICTPRRPRCLLCPVRDLCEGRRAGRETELPVKGRAKAPRLVERAAAVIEREGRLL 239 Query: 246 LAQRPPSGLWGGLYCFPQ-------FADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 + +RP GL GL+ P +E L H FSH Sbjct: 240 IVRRPQEGLLAGLWELPGGDVPPGVGPEEALQTLLRNALGAEVAVGEHLADVTHVFSHLV 299 Query: 299 LDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + V T + + W + + A ++ +++ Sbjct: 300 WHLRCFRAEVVPGTEVAERADVRWVKVDALGAYPFPAIYHKVFERV 345 >UniRef50_A0Z8D4 A/G-specific adenine glycosylase MutY n=2 Tax=Gammaproteobacteria RepID=A0Z8D4_9GAMM Length = 349 Score = 402 bits (1035), Expect = e-111, Method: Composition-based stats. Identities = 182/343 (53%), Positives = 238/343 (69%), Gaps = 3/343 (0%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 F++++LDW+D +GR LPWQ KT Y+VWLSE+MLQQTQV TVIPYF+RF+ RFP+V D Sbjct: 3 SFASRLLDWFDLHGRHDLPWQKSKTAYRVWLSEIMLQQTQVQTVIPYFDRFLTRFPSVED 62 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA AP D+VLHLWTGLGYYARARNLH+AA+ V + GG+FP+ + + LPGVGRSTAGA Sbjct: 63 LAAAPEDDVLHLWTGLGYYARARNLHRAAKLVVSDFGGQFPQDLDGLLQLPGVGRSTAGA 122 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 ILSL++G PILDGNVKRVLAR VSGWPG NKLW+L+E+ TP V + Q + Sbjct: 123 ILSLAMGIRAPILDGNVKRVLARHDVVSGWPGTTTTLNKLWALAEEYTPTARVADYTQGI 182 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH-EDE 243 MDLGA +CTRS P C CPL N C A + A +PGKKP++TLPER YF++ Q + Sbjct: 183 MDLGATLCTRSAPGCQHCPLTNTCQARSRCEQAHFPGKKPRKTLPERETYFVIFQRPDGT 242 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLT-QLTAFRHTFSHFHLDIV 302 VL+ +RP G+WGGL+CFP+ D ++ LA + +A T L +H+FSH+ L I Sbjct: 243 VLMEKRPEQGIWGGLWCFPESQDGDASPAHLATMRASAQEPTIYLDPIKHSFSHYRLTIR 302 Query: 303 PMWLPVSSFTGCMDEGN-ALWYNLAQPPSVGLAAPVERLLQQL 344 P+ L + + + E N + W + P S+GL PV ++L+ L Sbjct: 303 PILLKIIRASEGVAEVNSSQWIDPQTPGSIGLPVPVTQVLKTL 345 >UniRef50_Q2S0Q6 A/G-specific adenine glycosylase n=1 Tax=Salinibacter ruber DSM 13855 RepID=Q2S0Q6_SALRD Length = 354 Score = 401 bits (1031), Expect = e-110, Method: Composition-based stats. Identities = 121/344 (35%), Positives = 173/344 (50%), Gaps = 10/344 (2%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +LDWYD + R ++PW+ PY++W+SE+MLQQT+V TV Y+ RF+ FPTV LA+A Sbjct: 1 MLDWYDTHKR-SMPWRETDDPYRIWVSEIMLQQTRVDTVRDYYHRFLEAFPTVEALADAD 59 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 D VL W GLG+YARAR+LH AAQ V H G P T + + L GVG TA A+LS++ Sbjct: 60 RDTVLKHWEGLGFYARARHLHTAAQHVVDEHDGTVPSTMDAIKDLKGVGPYTAAAVLSIA 119 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 K +LDGNV RVL+R +AV E L L+ ++ FNQAMM+LGA Sbjct: 120 YRKPHAVLDGNVTRVLSRVFAVDEDATTSAAEGHLRDLANELLDPDRPGDFNQAMMELGA 179 Query: 190 MICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERT-GYFLLLQHEDEVLLAQ 248 ++CT P C CPL C A + YP + +P L+ D +L+ + Sbjct: 180 LVCTPRTPHCDRCPLNAVCRAHDAGTEEDYPITPESEPVPHEDIAVGLVFDDNDRLLIQR 239 Query: 249 RPPSGLWGGLYCFPQFADEESL-------RQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 RP GL GGL+ FP E R+ + + ++ H +SHF + + Sbjct: 240 RPDEGLLGGLWEFPGGKQEGDESMEAACRREVREELGVGMTDVEPFYTLSHAYSHFKITL 299 Query: 302 VPMWLPVSSFTGCMDEG-NALWYNLAQPPSVGLAAPVERLLQQL 344 ++ E W + + RL+++L Sbjct: 300 HAFRGRLADGPPEAREDQPFRWVTVDELDDYAFPRANRRLIEEL 343 >UniRef50_D1RGZ1 A/G-specific adenine glycosylase n=1 Tax=Legionella longbeachae D-4968 RepID=D1RGZ1_LEGLO Length = 357 Score = 397 bits (1021), Expect = e-109, Method: Composition-based stats. Identities = 152/342 (44%), Positives = 215/342 (62%), Gaps = 2/342 (0%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 QFS +L W+D +GRK LPWQ +TPY+VW+SE+MLQQTQV TVIPYFERFM RFP ++ Sbjct: 8 EQFSKPLLLWFDLHGRKNLPWQHPRTPYRVWVSEIMLQQTQVQTVIPYFERFMQRFPNIS 67 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA+A D+VL LW+GLGYY+RARNLH+ A+ + H G FP + + LPG+G ST+ Sbjct: 68 DLAHAQEDDVLSLWSGLGYYSRARNLHQTAKLILQHHQGIFPNDSKLLNELPGIGPSTSA 127 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILS + K ILDGNVKRVL R + ++G+P + +V+ KLW L++ P + QA Sbjct: 128 AILSQAFNKPAAILDGNVKRVLTRFFRITGYPEQSQVKKKLWELADLCMPQENCADYTQA 187 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE-D 242 +MDLGA C P CS CPL C+A + L+P KK K+++P + L+L +E Sbjct: 188 IMDLGATCCITKNPHCSSCPLHINCLAFKHKEQHLFPTKKIKKSIPIQYQQLLVLHNEQG 247 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLA-QRQIAADNLTQLTAFRHTFSHFHLDI 301 + L +RP GLWGGL+C P + ++ ++ ++ ++ L AF+H FSHFHL+I Sbjct: 248 YIYLEKRPSPGLWGGLWCLPNISTDKCPLDFIRVNYDLSGEHPQPLLAFKHRFSHFHLEI 307 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + + + F + E W+ Q S+GLA P ++L Sbjct: 308 NALSIKTTFFGRQLSENQGQWFAKEQLNSLGLAQPTRKILSY 349 >UniRef50_B8KUN1 A/G-specific adenine glycosylase n=2 Tax=Gammaproteobacteria RepID=B8KUN1_9GAMM Length = 363 Score = 396 bits (1017), Expect = e-109, Method: Composition-based stats. Identities = 172/340 (50%), Positives = 225/340 (66%), Gaps = 2/340 (0%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 ++++L+W+D +GRK LPWQ D TPY+VW+SE+MLQQTQVATVIPY+ERFM FPTV Sbjct: 9 RTLASRLLNWFDDHGRKDLPWQTDTTPYRVWISEIMLQQTQVATVIPYYERFMGEFPTVE 68 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 L+ AP D+VL LW+GLGYYARARNLH+ A+ V GG+FP+T + + LPG+GRSTAG Sbjct: 69 ALSAAPEDDVLKLWSGLGYYARARNLHRGAKMVTGDLGGEFPDTVDGLCTLPGIGRSTAG 128 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI+S+++G PILDGNVKRVLAR +AV GWPGK V +LW +E TP V + QA Sbjct: 129 AIISIAMGGRAPILDGNVKRVLARHHAVDGWPGKSGVAAELWGHAEAHTPNTRVADYTQA 188 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPER-TGYFLLLQHED 242 +MDLGA +CTR +P+C +CPL + C A YPGKKPK+T P R + L++ H Sbjct: 189 IMDLGATLCTRRRPQCLVCPLVDTCHAGRAGDPEQYPGKKPKRTTPTRSAFFALVVDHTG 248 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 VLL +RPPSG+WGGL+ PQ D L + + + QL HTFSHF L I Sbjct: 249 AVLLQKRPPSGIWGGLWSLPQSPDRGELENLAQRFGVVDPEVEQLPMIEHTFSHFRLAIT 308 Query: 303 PMWLPVSSF-TGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 P+ + +S+ T + W+ Q G+ APV +LL Sbjct: 309 PLRIRLSTAHTSVSEPTEYKWHQFTQSLPGGIPAPVAKLL 348 >UniRef50_C6N1N5 A/G specific adenine glycosylase n=1 Tax=Legionella drancourtii LLAP12 RepID=C6N1N5_9GAMM Length = 348 Score = 395 bits (1016), Expect = e-108, Method: Composition-based stats. Identities = 155/343 (45%), Positives = 211/343 (61%), Gaps = 2/343 (0%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 QFS +L+W+ +GRK LPWQ+ ++PY+VW+SE+MLQQTQV TVIPYFERFM RFP++ Sbjct: 6 EQFSKPLLNWFSLHGRKNLPWQLPRSPYRVWVSEIMLQQTQVQTVIPYFERFMQRFPSIQ 65 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA A DEVL LW+GLGYY+RARNLHK AQ +AT + G FP+ V LPG+G STA Sbjct: 66 DLAQANEDEVLSLWSGLGYYSRARNLHKTAQIIATEYQGIFPDELALVHELPGIGASTAA 125 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILS + + ILDGNVKRVL R + + G P + V+ KLW L++ P + QA Sbjct: 126 AILSQAFNQPTAILDGNVKRVLTRFFMIQGHPEQALVKKKLWELAQACMPQEDCADYTQA 185 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL-QHED 242 +MDLGA CT C CPLQ+ C+A + LYP KK K+ +P + L+L + Sbjct: 186 IMDLGATCCTTKNFNCLNCPLQDNCLALKHQEQHLYPTKKIKKPVPVQQQQLLVLCNKQG 245 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWL-AQRQIAADNLTQLTAFRHTFSHFHLDI 301 + L +RPP+GLWGGL+C P + ++ + + +N L AF+H FSHF L+I Sbjct: 246 HIYLEKRPPTGLWGGLWCLPSIDEGNCPLDFIRTEYDLVGENPQPLIAFKHRFSHFLLEI 305 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + + + + E W+ S+GLA P ++L L Sbjct: 306 NALKIITLALGTKLGEAKGQWFAKENLGSLGLATPTSKILSLL 348 >UniRef50_Q2SMB7 A/G-specific adenine glycosylase n=10 Tax=Gammaproteobacteria RepID=Q2SMB7_HAHCH Length = 388 Score = 394 bits (1013), Expect = e-108, Method: Composition-based stats. Identities = 162/350 (46%), Positives = 221/350 (63%), Gaps = 12/350 (3%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 F+A++L W+D++GR LPWQ +TPY VW+SE+MLQQTQV+TVIPYF +FM FPTV Sbjct: 30 SFAARLLTWFDQHGRHDLPWQDPRTPYHVWISEIMLQQTQVSTVIPYFIKFMESFPTVAA 89 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A D VL W GLGYYARARNLHKAA+ + G+FP T E + LPG+GRSTAGA Sbjct: 90 LAEADQDTVLSHWAGLGYYARARNLHKAAKTIVEKFNGEFPNTLETIQELPGIGRSTAGA 149 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 ILS+ G PILDGNVKRVL R A+ GWPGK+E+E +LW L++ TP V + QA+ Sbjct: 150 ILSMGFGIRAPILDGNVKRVLCRHDAIEGWPGKREIETRLWELADAYTPEERVTDYTQAI 209 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE-DE 243 MDLGA +CTRSKP C+ CP++ C A + P K + +P + L+LQ+E Sbjct: 210 MDLGATLCTRSKPACARCPMETTCQGLAQDMTDQLPTPKAGKKIPTHERFVLVLQNEDGH 269 Query: 244 VLLAQRPPSGLWGGLYCFPQFA----DEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 +LL +RPP+G+WGGL+ P+ ++L+ W +++ + FRHTFSH+HL Sbjct: 270 ILLERRPPTGIWGGLWSLPESEKNMDLPDALQSWKEHKRLQLTQYELASPFRHTFSHYHL 329 Query: 300 DIVPMWLPVSSFTGC------MDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + P+ + + E ++ W++ Q GL AP+E+ L Q Sbjct: 330 ILKPVIASTADIERPDDLAFIVGESDSQWFDPDQ-ELPGLPAPIEKWLSQ 378 >UniRef50_C7R6W3 A/G-specific adenine glycosylase n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R6W3_KANKD Length = 355 Score = 394 bits (1012), Expect = e-108, Method: Composition-based stats. Identities = 172/360 (47%), Positives = 228/360 (63%), Gaps = 23/360 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 A+ F+ QV+D+Y +GRK LPWQ + PY++WLSE+MLQQTQV TVIPYFERF+ FPT Sbjct: 6 SATNFARQVIDYYHAFGRKHLPWQNTQDPYRIWLSEIMLQQTQVTTVIPYFERFVESFPT 65 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA+A D+V+ W+GLGYY+RARNLHKAA+ + T GG FP+ E + LPG+GRST Sbjct: 66 VLDLAHATDDQVMQHWSGLGYYSRARNLHKAAKIIETEFGGDFPQDPEVIETLPGIGRST 125 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S + + ILDGNVKRVLARCYA+ GWPG +V LW +E TP +N Sbjct: 126 AGAIASFAFDQPTAILDGNVKRVLARCYAIEGWPGNGKVLKALWERAEANTPTQETAAYN 185 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL---- 237 QAMMDLGA++CTR+KP C CPL C+A NNS YPGKKPK+ P + Y+L+ Sbjct: 186 QAMMDLGAVVCTRTKPNCPDCPLSKHCLAYHNNSIEQYPGKKPKKARPSKAVYWLISINE 245 Query: 238 LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 + +VLL +RPPSG+WGGL+ P+ ++ I+ D ++QL F H FSH+ Sbjct: 246 TSDQTQVLLHKRPPSGIWGGLWALPEVEQQD----------ISPDKISQLDPFVHKFSHY 295 Query: 298 HLDIVPM---------WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 L + P+ + S M+ A W++ Q +GL PV +LL +L + Sbjct: 296 DLQVQPLLLSSKADKALVEKKSHPSIMEPAQADWFSQNQLSEIGLPTPVSKLLSKLLGDS 355 >UniRef50_C9R2J3 A/G-specific adenine glycosylase n=1 Tax=Aggregatibacter actinomycetemcomitans D11S-1 RepID=C9R2J3_AGGAD Length = 396 Score = 392 bits (1008), Expect = e-108, Method: Composition-based stats. Identities = 212/360 (58%), Positives = 255/360 (70%), Gaps = 23/360 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ VL WY K+GRK LPWQ +KT Y VWLSEVMLQQTQV TVIPYFERF+ FP +T L Sbjct: 28 FAHAVLTWYAKFGRKNLPWQQNKTLYGVWLSEVMLQQTQVTTVIPYFERFVKTFPNLTAL 87 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANAPLDEVLHLWTGLGYYARARNLHKAAQ + HGG+FP FE+V ALPGVGRSTAGA+ Sbjct: 88 ANAPLDEVLHLWTGLGYYARARNLHKAAQIMRDQHGGEFPTEFEQVWALPGVGRSTAGAV 147 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 LS L +PILDGNVKRVL+R +AVSGWPG+K+ E+ LW L+ QVTP V FNQAMM Sbjct: 148 LSSCLNAPYPILDGNVKRVLSRYFAVSGWPGEKKTEDHLWHLTAQVTPTEQVADFNQAMM 207 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D+GAM+CTRSKPKC LCPL++ C A A +W +YPG+KPK+ LPER YFLLL+ + ++ Sbjct: 208 DIGAMVCTRSKPKCELCPLKSDCKANAEQNWQVYPGRKPKKVLPERESYFLLLEKDGKIA 267 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW 305 L QR +GLWGGLYCFPQF++++ L +LA I + TAFRHTFSHFHLDI P++ Sbjct: 268 LEQRDNAGLWGGLYCFPQFSEKQELLAYLAANGI--RQYQEWTAFRHTFSHFHLDIYPIY 325 Query: 306 LPVSSFTGCMDEGNA---------------------LWYNLAQPPSVGLAAPVERLLQQL 344 V + +E + WY+ P +GLAAPV+ LL Q Sbjct: 326 AQVDTPKALAEENRSHRQKVAETAGEYQSDLLSAVKYWYDPQNPEPIGLAAPVKNLLTQY 385 >UniRef50_D0MK94 A/G-specific adenine glycosylase n=1 Tax=Rhodothermus marinus DSM 4252 RepID=D0MK94_RHOM4 Length = 383 Score = 391 bits (1006), Expect = e-107, Method: Composition-based stats. Identities = 126/347 (36%), Positives = 183/347 (52%), Gaps = 10/347 (2%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 ++DWY ++ R LPW+ + PY++W++EVMLQQT+V PY+ERF+ FPTV LA A Sbjct: 25 LIDWYRRHAR-DLPWRRTRDPYRIWVAEVMLQQTRVDQAGPYYERFLRAFPTVEALAAAS 83 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 LD+VL W GLGYYARARNLH+AA+Q+ HGG+ P T+E + LPGVG TA A+ S++ Sbjct: 84 LDDVLRCWEGLGYYARARNLHRAARQLVAEHGGRLPTTYEALRRLPGVGPYTAAAVASIA 143 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGA 189 G+ +LDGNV RVL R AV+ L +++ + FNQA+M+LGA Sbjct: 144 FGEPRAVLDGNVIRVLTRVLAVADDARASATRRALQEVADALISDEEPGTFNQALMELGA 203 Query: 190 MICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPE-RTGYFLLLQHEDEVLLAQ 248 +CT +P+C+ CPL+ C A A +P + P+ ++P LL E VL+ + Sbjct: 204 TVCTPVQPRCNDCPLREVCRARAMGDPTAFPVQTPRASVPHYEVALGLLFNEEGAVLIQR 263 Query: 249 RPPSGLWGGLYCFPQFADE-------ESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 RP GL GGL+ FP E R+ + + L RH ++HF + + Sbjct: 264 RPEDGLLGGLWEFPGGKREPGESLEAACARELHEELGVRVAVGPCLATVRHAYTHFRVTL 323 Query: 302 VPMWLPVSSFTGCMDEG-NALWYNLAQPPSVGLAAPVERLLQQLRTG 347 + G W L + RL++ L+ Sbjct: 324 YAFPCTLLEGVPRSRAGLPLRWVPLNELDHYAFPRANRRLIELLKQR 370 >UniRef50_B8FCI1 A/G-specific adenine glycosylase n=1 Tax=Desulfatibacillum alkenivorans AK-01 RepID=B8FCI1_DESAA Length = 369 Score = 391 bits (1006), Expect = e-107, Method: Composition-based stats. Identities = 118/357 (33%), Positives = 192/357 (53%), Gaps = 12/357 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 FS +L WY++ R LPW+ PY +W+SE+MLQQTQV TVIPYF R+M FP ++ Sbjct: 13 KDFSRSLLRWYEENAR-DLPWRRTSDPYAIWVSEIMLQQTQVKTVIPYFLRWMDAFPNIS 71 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA APLD+VL +W GLGYY+RARN+HKAA+++ G+ P T++ + LPG+G TAG Sbjct: 72 SLAEAPLDDVLKMWEGLGYYSRARNMHKAAKEIMDRLDGRMPRTYKGLLELPGIGAYTAG 131 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+ S++ + P++D NVKRV AR + + ++ L+E + P+ FNQA Sbjct: 132 AVCSIAYNQDVPLVDANVKRVFARILDMEKPVEQTAATREIRGLAESLIPSGKAGLFNQA 191 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +M+LGA++CT P C CP+ C+A + P PK+ + + + Sbjct: 192 LMELGALVCTPKNPDCKGCPVSVHCLALKEQTVDSRPVLPPKKKTQALEVSAGVCVRDRK 251 Query: 244 VLLAQRPPSGLWGGLYCFP-------QFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 +L+ +R P GL GL+ FP + ++ +R++ + +I + ++T +H ++ Sbjct: 252 ILIQKRLPKGLMAGLWEFPGGKLNPGESPEQALVREFAEELEIDIECGEKITVIQHAYTR 311 Query: 297 FHLDIVPMWLPV---SSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL-RTGAP 349 F + + W + + W + + + + RL+Q L + G P Sbjct: 312 FRVRLHVFWCSMKKPAQTPALHAAEEIRWVSPKELDGLAFPSADRRLIQMLMKDGIP 368 >UniRef50_B7GII9 A/G-specific adenine glycosylase n=76 Tax=Bacillaceae RepID=B7GII9_ANOFW Length = 373 Score = 391 bits (1004), Expect = e-107, Method: Composition-based stats. Identities = 117/351 (33%), Positives = 182/351 (51%), Gaps = 10/351 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 QF ++DW+ R LPW+ DK PYK+W+SEVMLQQT+V TVIPYF +F+ +FP Sbjct: 25 FHIEQFQHDLIDWFRTEQR-DLPWRKDKDPYKIWVSEVMLQQTRVDTVIPYFYQFIEKFP 83 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA+A +EVL W GLGYY+R RNLH A ++V +GG+ P + E+ ++L GVG Sbjct: 84 TLDALADAKEEEVLKAWEGLGYYSRVRNLHAAVKEVKEKYGGRVPASKEQFSSLKGVGPY 143 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 T GA+LS++ G P +DGNV RVL+R + ++ + K + + F Sbjct: 144 TTGAVLSIAYGIPEPAVDGNVMRVLSRIFYITDDIARASTRKKFEQIVSCIISHDDPSDF 203 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL-Q 239 NQA+M+LGA++CT P C LCP+Q C A A A P K + ++L Sbjct: 204 NQALMELGALVCTPKNPSCFLCPVQRHCRAFAEGVEAELPVKTKGKAPKHVAFRAIVLTN 263 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFA------DEESLRQWLAQRQIAADNLTQLTAFRHT 293 E ++ + +RP SGL L+ FP ++ +++ + + ++ ++ F H Sbjct: 264 EEGKIRIEKRPSSGLLANLWQFPNDEYAPTSNEQAFIKEISERYHVVLRSIQRIGTFEHV 323 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 FSH I ++ M + W + + +++ Q Sbjct: 324 FSHIVWHIEAY--KGNALELQMGDETNKWVTVDELNQYAFPVIYQKIWQAY 372 >UniRef50_Q3A7R2 A/G-specific DNA-adenine glycosylase n=2 Tax=Desulfuromonadales RepID=Q3A7R2_PELCD Length = 352 Score = 390 bits (1003), Expect = e-107, Method: Composition-based stats. Identities = 142/347 (40%), Positives = 201/347 (57%), Gaps = 7/347 (2%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 A + +LDWY + GR LPW+ + PY++WLSE+MLQQT V VIPY+ERF+A FP+V Sbjct: 8 ADTMNRCLLDWYGRCGR-DLPWRRTRDPYRIWLSEIMLQQTGVTAVIPYYERFLAAFPSV 66 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA APL++VL LW GLGYY RAR LH+AA +V + HGG+FPET E + ALPG+GRSTA Sbjct: 67 AALAAAPLEQVLELWAGLGYYRRARFLHEAACKVVSEHGGQFPETPEAIQALPGIGRSTA 126 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GAI+S++ + PILDGNV+RVL R A+SG P +VE +LW ++ +TP + Q Sbjct: 127 GAIVSIAFDRKAPILDGNVRRVLCRLLAISGDPRSSKVEKRLWQCADALTPEDRPHDYAQ 186 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+MDLGA +C +P C CPL C A P + ++T+P +L++ E Sbjct: 187 AIMDLGATVCKPRRPDCQACPLSGLCQAFWQGIQEQLPQRATRKTVPLVQQVAVLIEREG 246 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADN-----LTQLTAFRHTFSHF 297 L+ +RP G+ GGL+ FP A E A R + A L+ + RH +SHF Sbjct: 247 RYLVRRRPLDGMLGGLWEFPSVAVPEGKTAETAARTLLAGEGLSAGLSPVGTVRHAYSHF 306 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 +++ + G + + W + + L ++ L L Sbjct: 307 RVELHVFACR-ENRAGMVADEEHRWLSPQELADWPLHGSHKKALVLL 352 >UniRef50_Q3KJJ2 A/G-specific adenine glycosylase n=5 Tax=Gammaproteobacteria RepID=Q3KJJ2_PSEPF Length = 355 Score = 389 bits (1001), Expect = e-107, Method: Composition-based stats. Identities = 182/344 (52%), Positives = 233/344 (67%), Gaps = 1/344 (0%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M+A QFS VL+W+D++GR LPWQ + PY+VW+SE+MLQQTQV+TV+ YF+RFMA P Sbjct: 1 MRAEQFSTAVLEWFDRHGRHDLPWQQNINPYRVWVSEIMLQQTQVSTVLNYFDRFMAALP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV LA AP DEVLHLWTGLGYY RARNL K A+ V + +GG+FP E++ LPG+G S Sbjct: 61 TVEALAAAPEDEVLHLWTGLGYYTRARNLQKTAKIVVSQYGGEFPRDVEKLTELPGIGLS 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S+S+G PILDGNVKRVLAR A G+PG+ +V +LW+ +E+ TP V + Sbjct: 121 TAGAIGSISMGLRAPILDGNVKRVLARFTAQEGYPGEPKVAKQLWANAERFTPQDRVNAY 180 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLP-ERTGYFLLLQ 239 QAMMDLGA +CTRSKP C LCPL+ GC A YP KP++ +P +RT LL Sbjct: 181 TQAMMDLGATLCTRSKPSCLLCPLEKGCEAHMLGLETRYPIPKPRKAIPQKRTLMPLLAN 240 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 + +LL +RP SGLWGGL+ P+ D + L+ AQ + L + HTFSHF L Sbjct: 241 EDGAILLYRRPSSGLWGGLWSLPELDDLDDLQHLAAQHSLELGEQQALPSLVHTFSHFQL 300 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 I P + V G + E + LWYNLA PP +GLAAPV+ LL++ Sbjct: 301 SIEPWLVQVQEAGGHVAEADWLWYNLATPPRLGLAAPVKTLLER 344 >UniRef50_A9KCK4 A/G-specific adenine DNA glycosylase n=6 Tax=Coxiella burnetii RepID=A9KCK4_COXBN Length = 354 Score = 387 bits (994), Expect = e-106, Method: Composition-based stats. Identities = 177/353 (50%), Positives = 234/353 (66%), Gaps = 3/353 (0%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M + QF+ VL W+D+YGR LPWQ TPY+VW+SE+MLQQTQV+TVIPYFERF+ RFP Sbjct: 1 MDSKQFAQGVLRWFDRYGRHDLPWQKKLTPYRVWVSEIMLQQTQVSTVIPYFERFIKRFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV LA APLDE+L W+GLGYYARARNLH+AAQ + + G+FP T E +++LPG+GRS Sbjct: 61 TVGALALAPLDEILAHWSGLGYYARARNLHRAAQIIHVTYHGRFPSTVETLSSLPGIGRS 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGA+LSL + ++ ILDGNVKRVLAR A+ ++ N LW+L+E+ TP + Sbjct: 121 TAGAVLSLGMHQYAVILDGNVKRVLARYNALDVPINQQVGINILWNLAEKYTPKNRCWDY 180 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQAMMD+GAMICTR+KPKCSLCPL++ C A + +P KK K ++ Y LLL++ Sbjct: 181 NQAMMDIGAMICTRTKPKCSLCPLKSSCKAHRLSQQMNFPIKKAKTARAQKAAYLLLLRN 240 Query: 241 -EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLA-QRQIAADNLTQLTAFRHTFSHFH 298 E+LL +RPP+G+WGGL+ FPQ EE + +W + A + + H FSHF Sbjct: 241 SRGEILLEKRPPTGIWGGLWSFPQCPIEEDIDKWCQTKLGFEAVICERWNSIFHQFSHFE 300 Query: 299 LDIVPMWLPVSSFTGCMDE-GNALWYNLAQPPSVGLAAPVERLLQQLRTGAPV 350 +I P+ L + + M E +WY G+AAPV RLL+QL V Sbjct: 301 FEIKPVLLQIETRQPRMMECPPQIWYKEHSALPGGIAAPVARLLKQLTQQTAV 353 >UniRef50_Q2LY55 A/g-specific DNA glycosylase n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LY55_SYNAS Length = 373 Score = 386 bits (992), Expect = e-106, Method: Composition-based stats. Identities = 128/352 (36%), Positives = 192/352 (54%), Gaps = 10/352 (2%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 A + + + WY+K R LPW+ PY +WLSE+MLQQTQV VIPY+ RF+ +FPT Sbjct: 21 NAQRIATLLTAWYEKNKRL-LPWRSTSDPYAIWLSEIMLQQTQVEAVIPYYRRFLEQFPT 79 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + +LA APL+ VL +W +GYY+RAR+LH A+ + HGG+FP ++ ALPG+G T Sbjct: 80 IEELARAPLEAVLKVWEKMGYYSRARHLHATARLILESHGGRFPANPVDLMALPGIGSYT 139 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 +GAILS++ GK P +DGNVKRVL+R + V L +L+E++ PA RFN Sbjct: 140 SGAILSIAFGKSVPAVDGNVKRVLSRLFFVDSPVDLTSTRRLLSALAEKLVPARQPGRFN 199 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH- 240 QA+M+LGA++C P CS CPLQ+ C+A A + P + + P R +++ Sbjct: 200 QALMELGAVLCRPKTPLCSDCPLQSICLAYAGSGQHFLPVSRKRTKRPHREAVAAVIRDS 259 Query: 241 EDEVLLAQRPPSGLWGGLYCFP-------QFADEESLRQWLAQRQIAADNLTQLTAFRHT 293 E +L+ +RP +G GGL+ FP + E R+ I L + T Sbjct: 260 EQRLLVIRRPAAGFLGGLWTFPGGMLNPGEIVTEAVERRCREGLNITVAAGDSLMTLQQT 319 Query: 294 FSHFHLDIVPMWLP-VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 ++HFHL + + + N W + ++ + R+L+ L Sbjct: 320 YTHFHLTLHVFAGTILDGVPDSPQKDNWRWVSPGDIRNLPFSRAELRILETL 371 >UniRef50_A8MJ42 A/G-specific adenine glycosylase n=2 Tax=Firmicutes RepID=A8MJ42_ALKOO Length = 544 Score = 384 bits (988), Expect = e-105, Method: Composition-based stats. Identities = 114/344 (33%), Positives = 181/344 (52%), Gaps = 11/344 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + ++L+WY K R LPW+ ++ PY+VWLSE+MLQQT+V TVI Y+ RF+ FPT+ Sbjct: 195 KRLPEKLLNWYQKNAR-DLPWRKNQDPYRVWLSEIMLQQTRVDTVIDYYNRFLQAFPTID 253 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A + VL LW GLGYY+RARNLHK A+ + + G FPET EE+ LPG+G TAG Sbjct: 254 ALALADEERVLKLWEGLGYYSRARNLHKTAKIIVAQYEGNFPETHEELLKLPGIGSYTAG 313 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S+S +DGNV RV++R +++V+ ++ + +V P F Q+ Sbjct: 314 AIASISFNLPVAAVDGNVLRVVSRITEDYRCIDEEKVKKEMGNQLAEVYPENQCGDFTQS 373 Query: 184 MMDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 +M+LGA IC + P C+ CP C+A N++ P +K K R + + Sbjct: 374 LMELGATICLPNGAPLCNECPAIEICMANKNDTQIFLPVRKEKTARKTRELTVFVFVSQG 433 Query: 243 EVLLAQRPPSGLWGGLYCFPQ---FADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 +V L +R G+ GL+ P ++E+++ +L + + + H F+H Sbjct: 434 KVALQKRTEKGVLEGLWELPNRDGMSEEDAVSLYLEELGTLEYRIEKQKKAHHIFTHIRW 493 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPP-SVGLAAPVERLLQ 342 D+V ++ + G + +W + + ++ L Sbjct: 494 DMVCYYVHCTHPFG-----SYIWAGSEELEKEYSVPTAFKKFLS 532 >UniRef50_C9PAA7 A/G-specific adenine glycosylase n=2 Tax=Vibrio RepID=C9PAA7_VIBFU Length = 341 Score = 384 bits (987), Expect = e-105, Method: Composition-based stats. Identities = 133/343 (38%), Positives = 190/343 (55%), Gaps = 13/343 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + QF +L W +GR LPWQ + +PY+V +SEVMLQQTQV TVIPYFER+MA FP Sbjct: 6 LTPKQFQEHLLTWQRHHGRHDLPWQQNPSPYRVLVSEVMLQQTQVVTVIPYFERWMASFP 65 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LANA D V++ W GLGYY+RARNL KAA + G+FP + +PGVGR Sbjct: 66 TIEALANATEDAVMNHWQGLGYYSRARNLRKAALYIQDTWNGEFPADVNALQNIPGVGRY 125 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI + + + PI+DGNVKR+ R + + G PG ++ +LWS +E TP ++ Sbjct: 126 TAGAIAAFAFNTYGPIVDGNVKRLFCRYFGIEGVPGTSAMDKQLWSTAEAYTPTHNNRQY 185 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 Q ++D+GA +C P C C C A N + P KPK+ P + G+FL L+ Sbjct: 186 AQGLLDMGATLCKPKNPTCDACSFTTTCKAYQTNRVSELPTPKPKKVTPTKPGHFLWLES 245 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 + ++LL +R G+WG L+C PQ E +++ AF+HTF+H+ LD Sbjct: 246 DGKLLLEKRADDGIWGALWCLPQIYLEPEQL---------GEHIKLKGAFKHTFTHYKLD 296 Query: 301 IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + ++ W L Q VGL P+ +++ Sbjct: 297 GKVWQID----KLGVENPQQQWMPLHQLDDVGLPKPIRTFIEK 335 >UniRef50_Q8CV61 A/G-specific adenine glycosylase n=1 Tax=Oceanobacillus iheyensis RepID=Q8CV61_OCEIH Length = 354 Score = 383 bits (985), Expect = e-105, Method: Composition-based stats. Identities = 115/349 (32%), Positives = 181/349 (51%), Gaps = 9/349 (2%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 F +L+WY R LPW+ + PYK W+SE+MLQQT+V TVIPYF RFM ++PT Sbjct: 9 NIPVFQTDLLEWYYLNKR-DLPWRREPNPYKTWVSEIMLQQTKVDTVIPYFNRFMEKYPT 67 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA A +VL W GLGYY+RARNL A ++V + G+ P +E+A+L G+G T Sbjct: 68 VYDLAKADEQDVLKSWEGLGYYSRARNLQTAVREVVDTYNGEIPNNEKELASLKGIGPYT 127 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 GAILS++ + P +DGNV RV +R + K+ + ++ ++ FN Sbjct: 128 KGAILSIAFNQPVPAVDGNVLRVFSRILQIEDDIAKQSTKKEIEQYVGEIISHQDPSSFN 187 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLGA ICT KP C CP+ C A + P KK + + LL++++ Sbjct: 188 QAIMDLGATICTPKKPTCMFCPVMEHCQAFQHGIQEQLPIKKKAKKQKIKQYVVLLIRND 247 Query: 242 -DEVLLAQRPPSGLWGGLYCFPQFADEE----SLRQWL-AQRQIAADNLTQLTAFRHTFS 295 E ++ +R GL L+ FP +E + QW+ + I + +HTFS Sbjct: 248 QGEYVIEKRSDQGLLANLWQFPMVPLDEVGIDQIEQWMYDKCGIDLQMESDCDNLKHTFS 307 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 H ++ S+ +++ + ++ + + +++++ L Sbjct: 308 HIIWEMDI--KKASTSQNFLNKDSLMFIAKEEIDNFPFPVSHQKMMKFL 354 >UniRef50_UPI0000E105D2 A/G-specific adenine glycosylase n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E105D2 Length = 375 Score = 383 bits (985), Expect = e-105, Method: Composition-based stats. Identities = 178/372 (47%), Positives = 230/372 (61%), Gaps = 29/372 (7%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 F+ L W+D +GRK LPWQ + + Y VW+SE+MLQQTQV TVIPYFERFM FPTV D Sbjct: 4 DFAQTALAWFDLHGRKHLPWQQNISAYSVWISEIMLQQTQVTTVIPYFERFMRSFPTVND 63 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LANAP ++VLH WTGLGYYARARNLHKAAQQ+A H G FP FEEV ALPG+GRSTAGA Sbjct: 64 LANAPQEDVLHHWTGLGYYARARNLHKAAQQIAEHHNGVFPTDFEEVLALPGIGRSTAGA 123 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 IL+++ ++ PILDGNVKRVLAR +AV GWPG K+VE++LW + ++TP+ + + Q M Sbjct: 124 ILAIAEHQNHPILDGNVKRVLARFFAVEGWPGSKKVEDELWHFAGELTPSERIADYTQVM 183 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 MDLGA +CTRSKPKC +CPLQ+ C+A A+ + P KKPK+T+P + ++ D V Sbjct: 184 MDLGATLCTRSKPKCEVCPLQSRCLAFASGRQSELPHKKPKKTIPSKYTTVIIPMLFDRV 243 Query: 245 LLAQRPPSGLWGGLYCF-----PQFA--DEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 L+ +RP G+WGGL+ F P DE Q + L F+HTFSHF Sbjct: 244 LMTKRPEEGIWGGLWWFGGEFTPTAQKVDEHPTIYTALGEQFDVVSSDMLPEFKHTFSHF 303 Query: 298 HLDIVPMWL----------------------PVSSFTGCMDEGNALWYNLAQPPSVGLAA 335 HL I P+ L ++ + + W + QP +GL Sbjct: 304 HLHIQPVILYLTTDKVSEAVLSNSMPQMTLEDLTGAGIRDGQVHQRWVDYRQPADIGLCK 363 Query: 336 PVERLLQQLRTG 347 P + QL+ Sbjct: 364 PALTIFNQLQQR 375 >UniRef50_Q3J8X1 A/G-specific DNA-adenine glycosylase n=13 Tax=Gammaproteobacteria RepID=Q3J8X1_NITOC Length = 354 Score = 382 bits (982), Expect = e-104, Method: Composition-based stats. Identities = 162/348 (46%), Positives = 220/348 (63%), Gaps = 3/348 (0%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M S FS ++L W+D YGRK LPWQ + T Y+VW+SE+MLQQTQVATVIPY++RF+ RFP Sbjct: 1 MNKSDFSQRLLTWFDAYGRKDLPWQQNPTLYRVWVSEIMLQQTQVATVIPYYQRFIERFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 ++ LA+A +DE+L LW GLGYYARAR LH+AA+ HGG+ P T E + LPG+GRS Sbjct: 61 SLPALAHASVDEILGLWAGLGYYARARRLHQAARIAWETHGGELPATLEALMELPGIGRS 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 T GAIL+L+LG+ +PILDGNVKRVL R A+ WPG+ +VE +LW + + P + + Sbjct: 121 TGGAILALALGQRYPILDGNVKRVLTRQEAIEHWPGQPKVEKQLWQRAATLLPRTRLADY 180 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL-LQ 239 QA+MDLGA +CTR +P C CP++ C A + YP +P++ LP R L+ L Sbjct: 181 TQAIMDLGATVCTRHRPHCPSCPVKKTCQAHLQENPEAYPRSRPRKRLPLRATCMLILLN 240 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWL-AQRQIAADNLTQLTAFRHTFSHFH 298 + EVLL +RPP G+WGGL+ FP+ + W Q + RH F+HF Sbjct: 241 DQGEVLLERRPPVGIWGGLWSFPECPPQTEAALWCQEQFGWPIGEVQHWPPLRHHFTHFT 300 Query: 299 LDIVPMWLPVS-SFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 LDI P+ + M+ + +WY + GL AP RLL++LR Sbjct: 301 LDIQPVIARIRGEARQVMEPNSQVWYKMEPMYKRGLPAPTLRLLKRLR 348 >UniRef50_B3QVZ4 A/G-specific adenine glycosylase n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QVZ4_CHLT3 Length = 360 Score = 382 bits (981), Expect = e-104, Method: Composition-based stats. Identities = 106/349 (30%), Positives = 180/349 (51%), Gaps = 9/349 (2%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 +L W+ R LPW+ +TPYK+W+SE+MLQQTQVATVIPY+ERF+ FP + Sbjct: 2 SIEKDLLSWFLLNKR-DLPWRKKRTPYKIWVSEIMLQQTQVATVIPYYERFLNAFPNLES 60 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LANA +++++ +W GLGYY R +N+ +AA+ + H G FP E+ L G+G TA Sbjct: 61 LANADINKLMKIWEGLGYYTRVKNMQEAAKTILQKHNGVFPSKKTELLQLKGIGDYTAAI 120 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I S++ +H +DGNV RV++R A++ + + +++++ FN+AM Sbjct: 121 IASIAFKEHCAAVDGNVLRVISRLNAINAPIQLNTTKQTIRIVAQELLSLEHPGEFNEAM 180 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M++GA+IC P C +CP+ C A P K + +P ++ +D V Sbjct: 181 MEVGALICKPKNPTCDICPISLHCQAYKKGLEHKLPVKLKRAEIPHYHIAAGVIYKDDFV 240 Query: 245 LLAQRPPSGLWGGLYCFPQFADEES-------LRQWLAQRQIAADNLTQLTAFRHTFSHF 297 L+A RP +GL G L+ FP ++ R+ + + + L +L + +H ++HF Sbjct: 241 LIALRPANGLLGNLWEFPGGKQQQGESLEDCCKREIFEETGLHVNVLEKLISVKHAYTHF 300 Query: 298 HLDIVPMWLP-VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 + + +S W + S ++++++L+ Sbjct: 301 KITLHAYRCNYISGSPEPRASQALKWVRIEDLTSYAFPKANKKIIEKLQ 349 >UniRef50_B8HXE9 A/G-specific adenine glycosylase n=31 Tax=Cyanobacteria RepID=B8HXE9_CYAP4 Length = 386 Score = 381 bits (980), Expect = e-104, Method: Composition-based stats. Identities = 133/353 (37%), Positives = 185/353 (52%), Gaps = 12/353 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q +L WY ++GR +LPW+ PY +W+SE+MLQQTQV TVIPY++R++A PT Sbjct: 33 QILDLQRSLLHWYRQHGR-SLPWRETSDPYAIWVSEIMLQQTQVQTVIPYYQRWLAALPT 91 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + +A A +VL LW GLGYY+RARNLH+AAQ + G+FP E V LPG+GR+T Sbjct: 92 IATVAAAEQQQVLKLWQGLGYYSRARNLHQAAQLIQQEFAGQFPSQLEAVLKLPGIGRTT 151 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AG ILS + + ILDGNVKRVLAR A+ P K + LW S+++ FN Sbjct: 152 AGGILSSAFAQPVAILDGNVKRVLARLLALPVPPRKAKG--FLWQWSDRLLDRTQPREFN 209 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERT-GYFLLLQH 240 QA+MDLGA +C KP C LCP N C A N + P + LP ++ G ++ Sbjct: 210 QALMDLGATVCVPKKPDCPLCPWSNHCQALQLNLQSELPVTETAAPLPHKSIGVAVIWND 269 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESL-------RQWLAQRQIAADNLTQLTAFRHT 293 E+L+ +RP GL GGL+ FP E R+ + I + L H Sbjct: 270 RGEILIDRRPQKGLLGGLWEFPGGKIEPGETVMACIQREIREELAIEIEVGEPLITIDHA 329 Query: 294 FSHFHLDIVPMWLP-VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 ++HF + + VS + W L + E+++ LR Sbjct: 330 YTHFKVTLNVHHCRYVSGEPQPLGCDEVRWVTLEEIDQYPFPKANEQIIAALR 382 >UniRef50_A7RIG3 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7RIG3_NEMVE Length = 470 Score = 381 bits (979), Expect = e-104, Method: Composition-based stats. Identities = 122/406 (30%), Positives = 177/406 (43%), Gaps = 63/406 (15%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-------PYKVWLSEVMLQQTQVATVIPYFER 54 +L WYD R +LPW+ T Y VW+SE+MLQQTQVATV+ Y+ R Sbjct: 30 DIQLIRENLLRWYDINKR-SLPWRAYATEQDANIRAYAVWVSEIMLQQTQVATVVDYYNR 88 Query: 55 FMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA- 113 +M +P++ LA A L+EV W+GLGYY+RAR LH+AA +V GK P ++ Sbjct: 89 WMKNWPSLEALARASLEEVNECWSGLGYYSRARRLHEAAIKVVNELDGKIPTNAAKLQKE 148 Query: 114 LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP 173 LPGVG TAGAI S++ G+ ++DGNV RVL+R + + WSL+ ++ P Sbjct: 149 LPGVGLYTAGAIASIAFGEATGVVDGNVIRVLSRLRRIGADMTSNTTMDHFWSLAHRLVP 208 Query: 174 AVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS------------------ 215 FNQAMM+ GA +CT P+CS C L++ C A + Sbjct: 209 NDRPGDFNQAMMEFGATLCTPKTPQCSKCVLRSSCQAHSQVEDFKDKFTKRITGERISAC 268 Query: 216 ----------------------WALYPGKKPKQTLPERTGYFLLLQHE------DEVLLA 247 YP K K+ E L++Q E LL Sbjct: 269 DQDIEDCSLCLPPSEPWNPEIGVCNYPMKPKKKEAREEVFTVLIVQEECNEDDSGNFLLV 328 Query: 248 QRPPSGLWGGLYCFPQFA----DEESLRQWLAQRQIA-ADNLTQLTAFRHTFSHFHLDIV 302 QRP SGL GL+ FP +E+ + + + + H FSH H V Sbjct: 329 QRPESGLLAGLWEFPNLEKEKINEDDVSALAKEYGLENVKKRNNVGELIHKFSHRHHKYV 388 Query: 303 PMWLPVSSFTGCMDEGNAL---WYNLAQPPSVGLAAPVERLLQQLR 345 + W N + S ++ ++++ + + Sbjct: 389 VELFSCKKVNTTETNASGQPMKWVNPEELDSSAISTAMKKVFKLYQ 434 >UniRef50_A6TW33 A/G-specific adenine glycosylase n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TW33_ALKMQ Length = 352 Score = 381 bits (978), Expect = e-104, Method: Composition-based stats. Identities = 110/351 (31%), Positives = 180/351 (51%), Gaps = 10/351 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M FS Q+++W+ + R +PW+ K PY +W+SE+MLQQT+V TVI Y++ FM +FP Sbjct: 1 MTEESFSYQLIEWFREEKRW-MPWRETKDPYCIWVSEIMLQQTRVETVISYYQNFMKKFP 59 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA A +EVL W GLGYY+R RNLH+AA ++ +H G P+ + + LPG+G Sbjct: 60 TIETLARASQEEVLKSWEGLGYYSRGRNLHRAANEIVLIHEGNVPKDKKILLKLPGIGPY 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAILS++ + P +DGNV RV++R + + +K+V N++ L Q+ P F Sbjct: 120 TAGAILSIAYNQKEPAVDGNVLRVMSRLFNIQEDIMEKKVVNEVTDLVFQLMPQDNGGDF 179 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 +A+M+LGA +C KP+C LCP+ N C A + P + K + L + + Sbjct: 180 TEALMELGATVCVPQKPRCRLCPVHNQCKAHHLDIQETLPIRIKKTKVKNYHKGILWMVY 239 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQ-------WLAQRQIAADNLTQLTAFRHT 293 +L+ Q P GL GGL+ P + + + + L + +H Sbjct: 240 NGTILVKQNPQKGLLGGLWALPTIDLMHKVDEKAVIQEDFQEEVGQVVVELEYIGKEKHV 299 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 F+H + + + W + Q ++ +++ ++ Sbjct: 300 FTHQRWQMSIFKGRSNDHLRVKEP--YQWVPIGQLETLTFPIVYRKVIDKI 348 >UniRef50_A6LDG5 A/G-specific adenine glycosylase n=33 Tax=Bacteroidetes RepID=A6LDG5_PARD8 Length = 359 Score = 380 bits (976), Expect = e-104, Method: Composition-based stats. Identities = 114/352 (32%), Positives = 181/352 (51%), Gaps = 13/352 (3%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + S +++WY+ Y R+ LPW+ + PY +W+SE++LQQT+V + YF RF RFP V Sbjct: 9 EISRILVEWYETYKRE-LPWRETRDPYIIWISEIILQQTRVVQGLEYFLRFTERFPDVAS 67 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A DEVL W GLGYY+RARNLH AA+ + G FPE ++EV +L G+G TA A Sbjct: 68 LAVAEEDEVLKYWQGLGYYSRARNLHAAAKSIMERFNGVFPENYKEVLSLKGIGEYTAAA 127 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I+S + + P++DGNV RVL+R +AV + + + L+ + NQA+ Sbjct: 128 IVSFAWNQPCPVVDGNVYRVLSRLFAVDTPIDTTKGKKQFAELAGMILDPKNAGTHNQAI 187 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LGA+ C P C +CPL++ C+A A+ + YP K+ K +R ++L + ++++ Sbjct: 188 MELGALQCVPQNPDCGVCPLKDKCVAFASGNVQAYPVKQNKTKTRDRYFHYLYIIYKEQT 247 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLR----------QWLAQRQIAADNLTQLTAFRHTF 294 + +R +W GLY FP + ++ + L L+ +HT Sbjct: 248 WMNRRTGKDIWTGLYEFPLIETDHAMDFSGLCETQAFRNLLGDAGKLSITQGLSNVKHTL 307 Query: 295 SHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 SH L + + + GN L + + L++L Sbjct: 308 SHQILYASFYQIEIEQVPESL--GNYLSLPCRDIEKYAVPRLIHIYLEKLNG 357 >UniRef50_D2RMB6 A/G-specific adenine glycosylase n=2 Tax=Acidaminococcus RepID=D2RMB6_ACIFE Length = 352 Score = 379 bits (975), Expect = e-104, Method: Composition-based stats. Identities = 121/348 (34%), Positives = 182/348 (52%), Gaps = 8/348 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQID--KTPYKVWLSEVMLQQTQVATVIPYFERFMAR 58 M+ + + +L W+D R+ LPW+ + + PY VW+SE+MLQQT+ TV YF+R+M + Sbjct: 1 MKRNDWPETLLAWFDGS-RRALPWREEHPRNPYHVWVSEIMLQQTRTETVKGYFQRWMEQ 59 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 FPT+ DLA AP ++VL W GLGYY+RARNLHKAA+QV GG+ P + + +L G+G Sbjct: 60 FPTIRDLAQAPEEQVLRAWQGLGYYSRARNLHKAARQVMAEWGGQLPRERKALGSLAGIG 119 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 T GAILS++ G+ P +DGN+ RVL+R Y V + + + +L+E+ P Sbjct: 120 AYTVGAILSMAFGEKIPAVDGNLLRVLSRLYGVEEDISGTQGKKTITALAEEAIPGDRPG 179 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 FN+A+MDLGA +C P+C CPL C A P KKPK E L+ Sbjct: 180 DFNEALMDLGAEVCIPRHPRCEACPLTAFCQAWKEGKTETLPVKKPKAPQKELAAACGLV 239 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQ-FADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 E L +RP G+ ++ FP + + + + + RH F+H Sbjct: 240 VREGRYLFHKRPAKGMLASMWEFPMVLEPGAEKARSSLVKLLETEAGPAVWQHRHVFTHQ 299 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 ++ + + G G W++ + V LA P +L L+ Sbjct: 300 IWNMTAYVMERAEVPG----GEWSWFSPEEWEKVPLAGPHAKLADWLK 343 >UniRef50_A2RLW5 A/G-specific adenine glycosylase n=4 Tax=Lactococcus lactis RepID=A2RLW5_LACLM Length = 386 Score = 379 bits (975), Expect = e-104, Method: Composition-based stats. Identities = 122/380 (32%), Positives = 174/380 (45%), Gaps = 35/380 (9%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +F +L WYD + LPW+ PYK+W+SE+M QQTQV TVIPYFERFM ++PT Sbjct: 7 KIKEFQKDLLSWYDANKK-PLPWRQTTEPYKIWISEIMSQQTQVETVIPYFERFMKKYPT 65 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA A E+L LW GLGYY+RARNL AAQ+V + GKFP+ +E+ +L G+G T Sbjct: 66 VESLAQADDTELLKLWEGLGYYSRARNLKIAAQEVVNEYNGKFPDNLKEILSLRGIGPYT 125 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A AI S+S P +DGN+ RV +R + + K + FN Sbjct: 126 AAAIASISFDLAEPAIDGNLMRVTSRIFELECDISKSSSRKIFDEHLRTLVSKKRPGDFN 185 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 Q +MDLG+++C+ KPKC CPL C A A+ YP K K E L++ Sbjct: 186 QGLMDLGSLVCSPKKPKCETCPLNKYCGAVADGKQLNYPVKTKKLKQKELYYTAFALENS 245 Query: 242 -DEVLLAQRPPSGLWGGLYCFPQFADEE------------SLRQWLAQRQIAADNLTQLT 288 E L +RP GL ++ FP S + + + + Sbjct: 246 LGEYYLEKRPSKGLLADMWTFPLTELSSVDFEKIITDGAISKNITMPELPESISKMEYFG 305 Query: 289 AFRHTFSHFHLDIVPMWLPVSSFTGCMDE-----GNALWYNLAQPPS------------- 330 F H FSH IV + + DE ++Y+ + Sbjct: 306 NFTHIFSHQKWHIVLIKVKAEEKFEIADELLTADKKWVYYDSKKLGENVVNLNQDLSKIS 365 Query: 331 ---VGLAAPVERLLQQLRTG 347 + L+ P ++ + L Sbjct: 366 RLKLPLSGPQVKMFELLDKK 385 >UniRef50_D1N1B7 A/G-specific adenine glycosylase n=2 Tax=Bacteria RepID=D1N1B7_9BACT Length = 356 Score = 379 bits (973), Expect = e-103, Method: Composition-based stats. Identities = 115/353 (32%), Positives = 174/353 (49%), Gaps = 11/353 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 A +L+WYD++ R LPW+ PY+VW+SE+MLQQT+V V PY++RF+ P Sbjct: 5 SLQAIVAPLLEWYDRHAR-ILPWRESPEPYRVWISEIMLQQTRVEAVKPYYDRFLKELPD 63 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LA A ++L LW GLGYY R RNL KAA+ + + +GG+FP E + +LPG+G T Sbjct: 64 LHALAEASEPQLLKLWEGLGYYNRVRNLQKAARVIVSEYGGEFPRDVETLRSLPGIGEYT 123 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S+S + P +DGNV RV++R A +V+ + QV PA F Sbjct: 124 AGAIASISFNRPEPAVDGNVLRVVSRLAASREDISSPKVKGAISDALRQVYPAGRCGDFT 183 Query: 182 QAMMDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 Q++M+LGA +C + P+C+ CPL C A P K ++ LL+ Sbjct: 184 QSLMELGATVCLPNGAPRCAECPLAELCAGRREGIAAGLPVKPGRKPRRIEPRTVFLLRC 243 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADE---ESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 D + L +RP +GL GL+ FP A +W+A + I A+ + +H F+H Sbjct: 244 GDRIALLRRPGNGLLAGLWEFPNTAGSLTVAEAAEWMAGQGIFAERIIATGGAKHVFTHL 303 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPP-SVGLAAPVERLLQQLRTGAP 349 + + G W + + L ++ L + Sbjct: 304 EWHMSGFLADCK-----AENGGFHWVTRDELEREISLPTAFRPFVEALDAESA 351 >UniRef50_UPI0001BC503C adenine glycosylase n=1 Tax=Neisseria mucosa ATCC 25996 RepID=UPI0001BC503C Length = 359 Score = 379 bits (973), Expect = e-103, Method: Composition-based stats. Identities = 147/348 (42%), Positives = 196/348 (56%), Gaps = 11/348 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +++ W ++GR LPWQ + PY VWLSE+MLQQTQVATV+ Y+ RF+ +FPTV L Sbjct: 7 FSERLIRWQKQHGRHHLPWQ-VQNPYCVWLSEIMLQQTQVATVLDYYPRFLEKFPTVQTL 65 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP DEVL LW GLGYY+RARNLHKAA+QV G FP +++ L GVGRSTA AI Sbjct: 66 AAAPQDEVLSLWAGLGYYSRARNLHKAARQVVEEFDGTFPSERKDLETLCGVGRSTAAAI 125 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER--FNQA 183 + + + ILDGNVKRVL R +A G P K+ EN LW+L+E + P + + Q Sbjct: 126 CAFAFNRRETILDGNVKRVLCRVFARDGNPQDKKFENSLWTLAESLLPPENADMPAYTQG 185 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH-ED 242 +MDLGA +C R+KP C CP+ + C A N A P KK + Y+L+++ + Sbjct: 186 LMDLGATVCKRTKPLCHQCPMADICEAKKQNRIAELPRKKTATEVQTLPLYWLIIRDQDG 245 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQI-------AADNLTQLTAFRHTFS 295 +LL +RP G+WGGLYC P F + + I D+L++ H + Sbjct: 246 AILLEKRPAKGIWGGLYCVPCFEKLDEAYACAEKLGIFSECAASPWDDLSEQPTLTHRLT 305 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 H L I P SS + N LW GL P+ L+Q Sbjct: 306 HRLLMITPFEAQTSSSENTVLPPNCLWVKPENLTDYGLPKPLADYLKQ 353 >UniRef50_C4L361 A/G-specific adenine glycosylase n=1 Tax=Exiguobacterium sp. AT1b RepID=C4L361_EXISA Length = 344 Score = 378 bits (972), Expect = e-103, Method: Composition-based stats. Identities = 115/346 (33%), Positives = 179/346 (51%), Gaps = 18/346 (5%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 F+ +++ W+++ R LPW+ K PY+VW+SEVMLQQT+V TVIPY+ RFM RFPT Sbjct: 12 NIHHFNEELVTWFNREKR-DLPWRHAKNPYRVWVSEVMLQQTRVDTVIPYYNRFMERFPT 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + DLA A DEV+ W GLGYY+R RNLH+A ++VA+++ G PE E L GVG T Sbjct: 71 LEDLAAADTDEVVKYWEGLGYYSRVRNLHEAVKEVASVYEGIVPEEKERFEKLKGVGPYT 130 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 GA+LS++ + P +DGNV RV++R + + + + ++ FN Sbjct: 131 TGAVLSIAYNQPEPAVDGNVMRVMSRQFGIYDDIAMPKTRKIFEQVVRRLMDPAHASDFN 190 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 + +M+LGA +CT P CSLCP+Q+ C A A++ P K K L ++ + Sbjct: 191 EGVMELGATVCTPKNPMCSLCPVQDTCYAYAHHVQDELPVKTKKGAARIEMYDALCIEKD 250 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 +V QR GL G++ +P + N T L +H FSH I Sbjct: 251 GKVAYEQRADKGLLAGMWQYPLMDRGTGEQL----------NGTYLGQVKHVFSHIVWYI 300 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 + + + +W + ++ ++ ++ L++L G Sbjct: 301 DLYRVE------TLPTDDVVWLDASEREVKTVSVAQQK-LERLAGG 339 >UniRef50_C0QGE4 MutY n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QGE4_DESAH Length = 364 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 117/350 (33%), Positives = 175/350 (50%), Gaps = 9/350 (2%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + +L WY + R+ LPW+ PY +W+SEVMLQQTQV TV+PY+ RFM FPTV Sbjct: 3 EIQTALLKWYHTHHRE-LPWRKTANPYSIWVSEVMLQQTQVKTVVPYYGRFMEAFPTVAS 61 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A L++VL LW GLGYYARARN HKA Q V T G P + ALPGVG A A Sbjct: 62 LARADLEQVLKLWEGLGYYARARNFHKACQTVTTDLKGIIPRDLKGFKALPGVGDYIAAA 121 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 +LS++ ++DGNVKRVLAR + + K + ++ + FNQA+ Sbjct: 122 VLSIAFNIPLAVVDGNVKRVLARVFTMDDPVNHGPSHKKFQAKADLILDRSCPGAFNQAV 181 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LGA++C+ P C++CPL C A ++S YP + + +P +++ D++ Sbjct: 182 MELGALVCSPRNPGCTICPLGQYCRALESDSVDKYPLRNKTKPVPTVHIAAGVVKKGDKL 241 Query: 245 LLAQRPPSGLWGGLYCFPQFA-------DEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 L+ R P GL GGL+ FP +R+ + + + L +H ++HF Sbjct: 242 LITLRKPEGLLGGLWEFPGGKVKTGEQASSACVRELAEETGLRVAVTSHLARVKHAYTHF 301 Query: 298 HLDIVPMWLP-VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 +++ +S + W + + + L Sbjct: 302 KIEMDIFNCQYISGNVRLNGPVDHRWIFPHEIRQYPFPKANLKFIPLLEN 351 >UniRef50_C0EDR1 Putative uncharacterized protein n=2 Tax=Clostridium RepID=C0EDR1_9CLOT Length = 365 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 131/345 (37%), Positives = 191/345 (55%), Gaps = 6/345 (1%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 +Q +L WYD R LPW+ TPY+VW+SE+MLQQT+V+ V+PY+ERF++ PTV Sbjct: 19 AQIPPLLLTWYDHSAR-ILPWRQQPTPYRVWVSEIMLQQTRVSAVMPYYERFLSALPTVG 77 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA+AP + +L LW GLGYY R RN+ KAA+ V HGG+ P +FEE+ LPG+G TAG Sbjct: 78 ALADAPEEVLLKLWEGLGYYNRVRNMQKAARAVMEQHGGELPASFEELVKLPGIGEYTAG 137 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+ S++ G P +DGNV R+L+R V+ +L +Q+ PA V FNQA Sbjct: 138 AVASIAYGLRVPAVDGNVLRILSRWLLSRADVTMPPVKRAYQALVQQMLPAERVGDFNQA 197 Query: 184 MMDLGAMICTRSK-PKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 +M+LGA +C + P C CP+ C A A P K PK+ E LL+ E Sbjct: 198 LMELGATVCLPNGDPLCESCPVAGLCRARERGCAAELPVKAPKKPRREEKRTILLVVSEG 257 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 +VLL +RP +GL GL+ + S ++ A + ++ +L A +H FSH + Sbjct: 258 KVLLTKRPANGLLAGLWEYLNLEGHLSAQEAAAHCGVPVSSVRRLGAAKHIFSHIEWKMR 317 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPS-VGLAAPVERLLQQLRT 346 + VS FT +G +W + Q + + ++ + LR Sbjct: 318 GYLIQVSGFTP---QGECVWADQLQFEEQYSVPSAFQQYTKLLRQ 359 >UniRef50_Q2YCY4 A/G-specific DNA-adenine glycosylase n=4 Tax=Nitrosomonadaceae RepID=Q2YCY4_NITMU Length = 383 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 149/361 (41%), Positives = 216/361 (59%), Gaps = 15/361 (4%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 A+ F+ +++ W ++GR LPWQ + Y +WLSE+MLQQTQV TVIPY+ RF+ FP + Sbjct: 12 ANSFATRLIRWQREHGRHHLPWQNTRDAYSIWLSEIMLQQTQVGTVIPYYRRFLQCFPDI 71 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA+APLDEV+ W+GLGYY+RARNLHKAAQ++ HGG FPE + LPG+GRSTA Sbjct: 72 QSLASAPLDEVMVQWSGLGYYSRARNLHKAAQRIVGEHGGIFPEEVAIIRQLPGIGRSTA 131 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV------- 175 AI + GK ILDGNVKR+L+RC+ + G+PG+K+VE +LW +E + P Sbjct: 132 AAIAVFAFGKRAAILDGNVKRILSRCFGIEGYPGEKQVEAQLWQKAEALLPKGDESPIER 191 Query: 176 GVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF 235 +E + QA+MDLGA IC R++P C CPL+ C+A +N P +P++ LPER Sbjct: 192 DIEGYTQALMDLGATICIRARPMCGSCPLRLECVAFRDNRAGSLPTPRPRKILPEREAVL 251 Query: 236 LLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFS 295 LL + ++LL +RP +G+WG L+ P+ E++ ++ + I ++Q+ A HTF+ Sbjct: 252 LLAVAQGKILLEKRPSTGIWGALWSLPEMGMNENVIEYCLRFGINVRPMSQMEALTHTFT 311 Query: 296 HFHLDIVPMWLPVSSFTG--------CMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 HF L I P+ L V S +W + +PV ++L Q + Sbjct: 312 HFRLRIYPLILQVISRPPDHLTPEVLSQPRRPCVWRMPEDALKAAIPSPVRKVLLQYASQ 371 Query: 348 A 348 Sbjct: 372 T 372 >UniRef50_C5VJP5 A/G-specific adenine glycosylase n=1 Tax=Prevotella melaninogenica ATCC 25845 RepID=C5VJP5_9BACT Length = 334 Score = 376 bits (967), Expect = e-103, Method: Composition-based stats. Identities = 122/343 (35%), Positives = 176/343 (51%), Gaps = 13/343 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ +L W+ GR +LPW+ K Y +WLSEV+LQQT++ + Y+ERFMA++PTV DL Sbjct: 3 FAVTLLQWFKNNGR-SLPWRETKDAYAIWLSEVILQQTRIVQGMSYWERFMAKWPTVNDL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A +EVL W GLGYY+RARNLH AAQQV L G FP+TF+E+ L GVG TA AI Sbjct: 62 AAATENEVLKAWQGLGYYSRARNLHTAAQQVMEL--GGFPQTFKELKTLKGVGDYTAAAI 119 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S++ G+ ++DGNV RVL+R Y + E + + +L++ + P +N+A+M Sbjct: 120 ASIAFGEPVAVVDGNVYRVLSRYYGIETPIDSTEGKKEFQTLAQSLLPINEPADYNEAIM 179 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D GA CT + P CS CPL C+A P K K ER +L +++E ++ Sbjct: 180 DFGATQCTPNSPHCSACPLCETCVAFREQRIDELPVKSKKVKQRERHFTYLCIEYEGKIA 239 Query: 246 LAQRPPSGLWGGLYCFPQFA--DEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVP 303 + QR +W GL+ FPQ W + Q+ +H +H L Sbjct: 240 IHQRGAGDIWQGLWEFPQEEHLTSSEDSAWKTEAQLLQKG------VKHILTHQILLADI 293 Query: 304 MWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 +W + L +E LL+ + Sbjct: 294 YL--WQPTRRPQLPSEFIWIEKQDLENYALPRLIEILLKAVPA 334 >UniRef50_P44320 A/G-specific adenine glycosylase n=308 Tax=Gammaproteobacteria RepID=MUTY_HAEIN Length = 378 Score = 376 bits (967), Expect = e-103, Method: Composition-based stats. Identities = 210/360 (58%), Positives = 252/360 (70%), Gaps = 24/360 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ VL WYDK+GRK LPWQ +KT Y VWLSEVMLQQTQVATVIPYFERF+ FP +T L Sbjct: 11 FAKSVLAWYDKFGRKHLPWQQNKTLYGVWLSEVMLQQTQVATVIPYFERFIKTFPNITAL 70 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA DEVLHLWTGLGYYARARNLHKAAQ+V G FP FE+V AL GVGRSTAGAI Sbjct: 71 ANASQDEVLHLWTGLGYYARARNLHKAAQKVRDEFNGNFPTNFEQVWALSGVGRSTAGAI 130 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 LS L + +PILDGNVKRVLAR +AV GW G+K+VEN+LW+L+EQVTP V FNQAMM Sbjct: 131 LSSVLNQPYPILDGNVKRVLARYFAVEGWSGEKKVENRLWALTEQVTPTTRVADFNQAMM 190 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D+GAM+C R+KPKC LCPL C+A N +W +P KKPK+ +PE+T YFL+L +V Sbjct: 191 DIGAMVCMRTKPKCDLCPLNIDCLAYKNTNWEKFPAKKPKKAMPEKTTYFLILSKNGKVC 250 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW 305 L QR SGLWGGL+CFPQF D+ SL +LAQ + + + +FRHTFSHFHLDI P++ Sbjct: 251 LEQRENSGLWGGLFCFPQFEDKSSLLHFLAQEK--VTHYQEWPSFRHTFSHFHLDIHPIY 308 Query: 306 LPVSSFTGCMDEGNA---------------------LWYNLAQPPSVGLAAPVERLLQQL 344 + S T C+++ N WY+ P +GLA PV+ LL Q Sbjct: 309 AEMES-TLCVEQANLDWRKVMESTKEYQSNLSSAVKYWYDPQNPEPIGLAQPVKNLLIQF 367 >UniRef50_A8PLA9 A/G-specific adenine glycosylase n=1 Tax=Rickettsiella grylli RepID=A8PLA9_9COXI Length = 354 Score = 375 bits (964), Expect = e-102, Method: Composition-based stats. Identities = 162/349 (46%), Positives = 222/349 (63%), Gaps = 5/349 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + +QF ++L W+ + GRK LPWQ ++PY WLSE+MLQQTQV TVIPYF+RF FP Sbjct: 3 LTPAQFQKKILTWFHQSGRKHLPWQQVQSPYFTWLSEIMLQQTQVTTVIPYFQRFTHHFP 62 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T++ LANA LDEV+ LW+GLGYYARAR+LH+ AQ + + GKFP+ + LPG+GRS Sbjct: 63 TLSSLANASLDEVIRLWSGLGYYARARHLHRCAQIIEEKYKGKFPQALILLQNLPGIGRS 122 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA-VGVER 179 TAGAI +L+ + ILDGNVKRV +R + +SGWPG V +LW+L+E+ TP V Sbjct: 123 TAGAIRALAFNQPAAILDGNVKRVFSRFHTLSGWPGLTHVNKQLWTLAERYTPHNKHVRH 182 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGY-FLLL 238 + QAMMDLGA+ICT +C+ CPLQ C A + YP KP + P R +L+ Sbjct: 183 YTQAMMDLGALICTPKHAQCTECPLQRHCKAYKEDCVIHYPSSKPVKMNPSRAITMVILI 242 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQ--IAADNLTQLTAFRHTFSH 296 E+LL +RPP G+WGGL+ P+ E ++ + + ++ +HTFSH Sbjct: 243 NARQEILLEKRPPRGIWGGLWSLPECTITEKIKNFCKKNYHCETEKSIKNYPFLKHTFSH 302 Query: 297 FHLDIVPMWLPVSSFTGCMDEGN-ALWYNLAQPPSVGLAAPVERLLQQL 344 F L+I P+ + V ++ E N +WYN Q S+GLAAPV++LL QL Sbjct: 303 FQLEIRPVAIRVKKWSPPQMESNPIVWYNKQQLASIGLAAPVKKLLNQL 351 >UniRef50_C6LA88 A/G-specific adenine glycosylase n=20 Tax=Clostridiales RepID=C6LA88_9FIRM Length = 365 Score = 374 bits (962), Expect = e-102, Method: Composition-based stats. Identities = 112/351 (31%), Positives = 172/351 (49%), Gaps = 9/351 (2%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +L W+D R LPW+ TPY+VW+SE+MLQQT+V V P+F+RF P Sbjct: 6 ELESIVQPLLAWFDANAR-VLPWRDSPTPYRVWVSEIMLQQTRVEAVKPFFQRFTEALPD 64 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA +++L LW GLGYY R RN+ KAAQ V +GG+ P +E++ L G+G T Sbjct: 65 VAALAACEEEKLLKLWEGLGYYNRVRNMQKAAQTVMEEYGGELPADYEKLLKLKGIGSYT 124 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ P +DGNV RV++R A K V+ ++ ++ P FN Sbjct: 125 AGAIASIAFQIPVPAVDGNVLRVISRITASEKDILKASVKKEVEDEIREIIPPERAGAFN 184 Query: 182 QAMMDLGAMICTRSKP-KCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+M+LGA++C + P KC CPL C+A + P K + + L+++ Sbjct: 185 QALMELGAVVCVPNGPAKCDACPLYGQCLARERGIVSSLPKKSAAKPRRVQERTVLIIRD 244 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFA---DEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 + V + +RPP GL GLY P +E + + L ++ + + L +H FSH Sbjct: 245 GERVAIHKRPPKGLLAGLYELPNVEGHLGQEEVLKLLKKKHFSPIRIQPLEEAKHIFSHV 304 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPS-VGLAAPVERLLQQLRTG 347 + + V E L+ + + A + L+ Sbjct: 305 EWHMTGYVITVEE---PERETEYLFVEPRETEENYPVPAAFSAYTKYLQIK 352 >UniRef50_Q5WIF8 A/G-specific adenine DNA glycosylase n=11 Tax=Bacillales RepID=Q5WIF8_BACSK Length = 385 Score = 374 bits (961), Expect = e-102, Method: Composition-based stats. Identities = 118/345 (34%), Positives = 177/345 (51%), Gaps = 8/345 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 S F Q+++WY + R+ LPW+ PY +W+SE+MLQQT+V TVIPY+E+FM +FP + Sbjct: 37 SDFRRQLIEWYQAHKRE-LPWRESNDPYHIWVSEIMLQQTRVDTVIPYYEQFMRKFPEME 95 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA A +E+L +W GLGYY+R RNL A ++V +G P+T +E+ L GVG TAG Sbjct: 96 DLAYAEEEEILKVWEGLGYYSRVRNLQAAVREVVEHYGSVVPDTRKEIEQLKGVGPYTAG 155 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILS++ K P +DGNV RVL+R + + GK + K ++ ++ FNQ Sbjct: 156 AILSIAYAKAEPAVDGNVMRVLSRVFCMEDDIGKPQTRKKHEAILYELIDKSDPSSFNQG 215 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGY-FLLLQHED 242 +M+LGA++CT + P C LCP++ C+A P K K+ FLL + Sbjct: 216 LMELGALVCTPTSPGCLLCPVRTQCLAYERGQQERLPIKMKKKKAKSIELESFLLKTEKG 275 Query: 243 EVLLAQRPPSGLWGGLYCFPQFA----DEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 E+L+ +RP GL GL+ P E ++ + I A+ +H FSH Sbjct: 276 ELLIEKRPDKGLLAGLWQLPVLEGRFDPGERQQKLAEKYHIEAEPSAANFQVKHIFSHLI 335 Query: 299 LDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 +I + N + ++LL Sbjct: 336 WEIELYVGRANR--NGELPANCRIIKAEELEQYAFPVSQQKLLNH 378 >UniRef50_A1KUL5 Adenine glycosylase n=32 Tax=Neisseriaceae RepID=A1KUL5_NEIMF Length = 349 Score = 374 bits (960), Expect = e-102, Method: Composition-based stats. Identities = 149/340 (43%), Positives = 198/340 (58%), Gaps = 8/340 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +++ W ++GR LPWQ K PY VWLSE+MLQQTQVATV+ Y+ RF+ +FPTV L Sbjct: 10 FSERLIRWQKQHGRHHLPWQ-VKNPYCVWLSEIMLQQTQVATVLDYYPRFLEKFPTVQTL 68 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP DEVL LW GLGYY+RARNLHKAAQQV GG FP +++ L GVGRSTA AI Sbjct: 69 AAAPQDEVLSLWAGLGYYSRARNLHKAAQQVVRQFGGTFPSERKDLETLCGVGRSTAAAI 128 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER--FNQA 183 + S + ILDGNVKRVL R +A G P K+ EN LW+L+E + P+ + + Q Sbjct: 129 CAFSFNRRETILDGNVKRVLCRVFARDGNPQDKKFENSLWTLAESLLPSENADMPAYTQG 188 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE-D 242 +MDLGA +C R+KP C CP+ + C A N A P KK +P Y+L++++ Sbjct: 189 LMDLGATVCKRTKPLCHQCPMADICEAKKQNRTAELPRKKTAAEVPTLPLYWLIIRNRNG 248 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 +LL +RP G+WGGLYC P F L + A+ + ++ + TA H +H L I Sbjct: 249 AILLEKRPAKGIWGGLYCVPCFESLNGLSDFAAKFSLTMADMDEQTALTHRLTHRLLMIT 308 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 P + + +W GL P+E L Sbjct: 309 PFEGQM----PSESPSDGIWIKPTHLKDYGLPKPLEIYLN 344 >UniRef50_A0LLV9 A/G-specific DNA-adenine glycosylase n=1 Tax=Syntrophobacter fumaroxidans MPOB RepID=A0LLV9_SYNFM Length = 388 Score = 373 bits (958), Expect = e-102, Method: Composition-based stats. Identities = 113/355 (31%), Positives = 184/355 (51%), Gaps = 11/355 (3%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 Q +L W+D+ R LPW+ PY++W+SE+MLQQTQV T++PYF R+M RFP V Sbjct: 9 QIQTLLLSWFDENQR-PLPWREKYRPYEIWISEIMLQQTQVKTMLPYFRRWMERFPDVQS 67 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 +A+A DEVL W GLGYY+RA N+ + A+ + HGG FP+ + +PG+G TAGA Sbjct: 68 IADAREDEVLKHWEGLGYYSRAVNIRRTAEIIVRHHGGTFPKAHSTILGMPGIGPYTAGA 127 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I S++ + P++DGNV+R+LAR + + +K +W+ +E++ PA +FNQA+ Sbjct: 128 ISSIAFNEDRPLVDGNVERILARLFNLDTPVEEKNTRKFIWNTAEELIPAGRARQFNQAL 187 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 MDLGA +C +P C CPL C + + P ++ + +L H V Sbjct: 188 MDLGATVCLPRRPACEKCPLNGLCESRRMGTADRRPVTNRRKDIASIEVAVGILHHRGRV 247 Query: 245 LLAQRPPSGLWGGLYCFP-------QFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 L+ +RP SGL L+ FP + ++ +R++ + ++ +L + RH ++ F Sbjct: 248 LIQKRPASGLMPNLWEFPGGKIHPGESPEQALIREFREELELEVRCRERLASIRHNYTSF 307 Query: 298 HLDIVPMWLPVSSFTGC---MDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAP 349 + + + A W + + A +L+ + P Sbjct: 308 RVLLHAFLCRPADSRPRPVLRSAVEARWVVVEELDQYAFPAANRKLIDLVSGRKP 362 >UniRef50_B0RRL4 A/G-specific adenine glycosylase n=15 Tax=Gammaproteobacteria RepID=B0RRL4_XANCB Length = 430 Score = 373 bits (958), Expect = e-102, Method: Composition-based stats. Identities = 160/350 (45%), Positives = 215/350 (61%), Gaps = 6/350 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + F ++L W+D +GR LPWQ + PY+VWLSE+MLQQTQVA VIPYF++F+A FP Sbjct: 81 LTTDAFVDRLLHWFDGHGRHDLPWQHPRAPYRVWLSEIMLQQTQVAVVIPYFQKFVASFP 140 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ DLA A D V+ W GLGYYARARNLH AA+Q LH G+ P F+ + ALPG+GRS Sbjct: 141 TLADLAAADNDTVMAHWAGLGYYARARNLHAAAKQCVALHAGELPRDFDALLALPGIGRS 200 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSE---QVTPAVGV 177 TAGAILS + FPI+DGNVKRVL R + ++G+PG VE +LW L+ PA + Sbjct: 201 TAGAILSQAWNDRFPIMDGNVKRVLTRIHGIAGYPGLPVVEKQLWQLAANHVAHVPAGRL 260 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL 237 + QA MD GA +CTR++P C +CPLQ C+A P KP + LPER LL Sbjct: 261 ADYTQAQMDFGATLCTRARPACMVCPLQENCVARREGLVEALPTPKPGKQLPEREATALL 320 Query: 238 LQH-EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQR-QIAADNLTQLTAFRHTFS 295 L++ +E+LL +RPP+G+W L+ PQ + LR+W A D ++ HTFS Sbjct: 321 LENAHNEILLQRRPPTGIWASLWTLPQAETDSDLREWFAAHIDGDYDRADEMPMIVHTFS 380 Query: 296 HFHLDIVPMWLP-VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 H+ L + P+ L V+ D + W A ++GL AP+ +LL L Sbjct: 381 HYRLHLQPLRLRKVALRQVLRDNDDLRWVARADLATLGLPAPIRKLLDAL 430 >UniRef50_A4VV79 A/G-specific DNA glycosylase n=98 Tax=Lactobacillales RepID=A4VV79_STRSY Length = 410 Score = 372 bits (956), Expect = e-101, Method: Composition-based stats. Identities = 116/364 (31%), Positives = 177/364 (48%), Gaps = 23/364 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + F +LDWYD R LPW+ K PY +W+SE+MLQQT+V TVIPY+ERF+ PT Sbjct: 40 KIQAFRKALLDWYDANKR-DLPWRRTKDPYAIWVSEIMLQQTRVDTVIPYYERFLHHLPT 98 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 ++DLA AP + +L LW GLGYY+R RN+ KAAQQ+ G+FP T +++L G+G T Sbjct: 99 ISDLAQAPEEVILKLWEGLGYYSRVRNMQKAAQQMVEDFDGQFPTTHAAISSLKGIGPYT 158 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ P +DGNV RVL+R + V G ++ E + + FN Sbjct: 159 AGAISSIAFNLPEPAVDGNVMRVLSRLFEVDYDIGLPANRKIFQAMMELLIDSERPGDFN 218 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTL-PERTGYFLLLQH 240 QA+MDLG+ I + P+ P++ A N + YP K PK+ P FL+ Sbjct: 219 QALMDLGSDIESPVNPRPQDSPVKAFSAAYLNGTMDKYPIKAPKKKPIPVAYQGFLIRNK 278 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFA-------------------DEESLRQWLAQRQIAA 281 +++ LL + +GL G + FP + + + + Sbjct: 279 DNQFLLEKNNEAGLLSGFWSFPLLEKGAIVDKQVSLFEVAEEVVQPDIRQSFTELYGLTV 338 Query: 282 DNL-TQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 D + +H FSH I + + + W ++ P+ A P +++ Sbjct: 339 DWQEQEFGIVQHIFSHRKWQIEMVEGVAETLNLP-ESKELKWVSVEDFPTYPFAKPQQKM 397 Query: 341 LQQL 344 + Sbjct: 398 WENF 401 >UniRef50_C2H8F6 A/G-specific adenine glycosylase n=16 Tax=Enterococcus RepID=C2H8F6_ENTFC Length = 392 Score = 372 bits (956), Expect = e-101, Method: Composition-based stats. Identities = 115/372 (30%), Positives = 174/372 (46%), Gaps = 30/372 (8%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +F Q + WY++ R LPW+ ++ PY++W+SE+MLQQT+V TVI YF RFM FPT Sbjct: 12 ETKEFQDQFIQWYEQEKR-NLPWRYNRDPYRIWISEIMLQQTRVDTVIDYFYRFMEWFPT 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + +LA AP +++L W GLGYY+RARN+ AA+Q+ + GK P+T EE+++L G+G T Sbjct: 71 IEELATAPEEKLLKAWEGLGYYSRARNIQAAAKQIMSEFDGKMPQTPEEISSLKGIGPYT 130 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 GAI S++ G P +DGNV RV++R + + K ++ FN Sbjct: 131 TGAIASIAFGLPEPAVDGNVMRVVSRLFCIEADIAKASSRKIFDEAMRKIIDETYPGEFN 190 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ-H 240 QAMMDLG+ ICT + PKC CP+Q C+A +P K K LQ H Sbjct: 191 QAMMDLGSAICTPTSPKCEACPIQAFCLANKRGIQTSFPVKTKKAKPKNVYYISAALQNH 250 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFAD--------------------------EESLRQWL 274 +R L ++ FP E+ Sbjct: 251 SGAYYFEERDSQKLLANMWTFPMVEVTQEEYERLKKEWEAKQEVDLFDDLVAEDGKELPF 310 Query: 275 AQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCM--DEGNALWYNLAQPPSVG 332 ++++ L H FSH ++ + + + + W A SV Sbjct: 311 EKQELFIWQTRHLGEVTHVFSHLKWHVLLFYGRATEEAEQEFTENKTSKWLKPAAFDSVV 370 Query: 333 LAAPVERLLQQL 344 +L++QL Sbjct: 371 FPKVQMKLVEQL 382 >UniRef50_Q9UIF7 A/G-specific adenine DNA glycosylase n=51 Tax=Chordata RepID=MUTYH_HUMAN Length = 546 Score = 371 bits (954), Expect = e-101, Method: Composition-based stats. Identities = 134/412 (32%), Positives = 193/412 (46%), Gaps = 65/412 (15%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQI--------DKTPYKVWLSEVMLQQTQVATVIPYFE 53 + + F +L WYD+ R LPW+ D+ Y VW+SEVMLQQTQVATVI Y+ Sbjct: 89 EVTAFRGSLLSWYDQEKR-DLPWRRRAEDEMDLDRRAYAVWVSEVMLQQTQVATVINYYT 147 Query: 54 RFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA 113 +M ++PT+ DLA+A L+EV LW GLGYY+R R L + A++V GG P T E + Sbjct: 148 GWMQKWPTLQDLASASLEEVNQLWAGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQ 207 Query: 114 L-PGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT 172 L PGVGR TAGAI S++ G+ ++DGNV RVL R A+ P V +LW L++Q+ Sbjct: 208 LLPGVGRYTAGAIASIAFGQATGVVDGNVARVLCRVRAIGADPSSTLVSQQLWGLAQQLV 267 Query: 173 PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL-------------- 218 FNQA M+LGA +CT +P CS CP+++ C A Sbjct: 268 DPARPGDFNQAAMELGATVCTPQRPLCSQCPVESLCRARQRVEQEQLLASGSLSGSPDVE 327 Query: 219 ---------------------------YPGKKPKQTLPERTGYFLLLQHED----EVLLA 247 +P K ++ E + +L+ ++LL Sbjct: 328 ECAPNTGQCHLCLPPSEPWDQTLGVVNFPRKASRKPPREESSATCVLEQPGALGAQILLV 387 Query: 248 QRPPSGLWGGLYCFPQFADEESL----RQWLAQRQ-----IAADNLTQLTAFRHTFSHFH 298 QRP SGL GL+ FP E S + L + Q + A +L L HTFSH Sbjct: 388 QRPNSGLLAGLWEFPSVTWEPSEQLQRKALLQELQRWAGPLPATHLRHLGEVVHTFSHIK 447 Query: 299 LDIVPMWLPVSSFTGCMD-EGNALWYNLAQPPSVGLAAPVERLLQQLRTGAP 349 L L + T A W + + ++ ++++ + + P Sbjct: 448 LTYQVYGLALEGQTPVTTVPPGARWLTQEEFHTAAVSTAMKKVFRVYQGQQP 499 >UniRef50_O31584 Probable A/G-specific adenine glycosylase yfhQ n=8 Tax=Bacillaceae RepID=YFHQ_BACSU Length = 369 Score = 370 bits (951), Expect = e-101, Method: Composition-based stats. Identities = 114/351 (32%), Positives = 169/351 (48%), Gaps = 13/351 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 QF ++ W+++ R LPW+ D+ PYKVW+SEVMLQQT+V TVIPYF RF+ +FPT Sbjct: 12 DIQQFRDDLISWFEREQR-VLPWREDQDPYKVWVSEVMLQQTRVETVIPYFLRFVEQFPT 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA+A ++VL W GLGYY+R RNL A ++V +GG P ++ L GVG T Sbjct: 71 VEALADADEEKVLKAWEGLGYYSRVRNLQSAVKEVKQEYGGIVPPDEKDFGGLKGVGPYT 130 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 GA+LS++ K P +DGNV RV++R ++ K + FN Sbjct: 131 KGAVLSIAYNKPIPAVDGNVMRVMSRILSIWDDIAKPKTRTIFEDAIRAFISKEKPSEFN 190 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 Q +M+LGA+ICT P C LCP+Q C A + P K K+ +T ++L E Sbjct: 191 QGLMELGALICTPKSPSCLLCPVQQHCSAFEEGTERELPVKSKKKKPGIKTMAAIVLTDE 250 Query: 242 -DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQ--------RQIAADNLTQLTAFRH 292 +V + +RP GL L+ FP ++ ++ Q I AD H Sbjct: 251 DGQVYIHKRPSKGLLANLWEFPNLETQKGIKTEREQLIAFLENEYGIQADISDLQGVVEH 310 Query: 293 TFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 F+H +I + V + + +++ + Sbjct: 311 VFTHLVWNISVFFGKVKQVSDTSKLKK---VTKEELEQFAFPVSHQKIWKM 358 >UniRef50_Q1DAW2 A/G-specific adenine glycosylase n=1 Tax=Myxococcus xanthus DK 1622 RepID=Q1DAW2_MYXXD Length = 390 Score = 370 bits (950), Expect = e-101, Method: Composition-based stats. Identities = 128/346 (36%), Positives = 182/346 (52%), Gaps = 8/346 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + +LDWYD+ R LPW+ + Y +WLSEVMLQQTQV+TVIPY+ERF+ARFPT Sbjct: 35 ASVRGPLLDWYDRNKR-DLPWRRTRDSYAIWLSEVMLQQTQVSTVIPYWERFLARFPTAR 93 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA+APLD+VL W GLGYY+RARNLH+AAQ+V GG P T E+ LPG GR TAG Sbjct: 94 ALASAPLDDVLAGWKGLGYYSRARNLHRAAQEVVARFGGTLPSTAAELLELPGFGRYTAG 153 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+ S++ G+ P++DGNV RV +R + V G PG ++ E LW+L+ + FNQA Sbjct: 154 AVASIAFGEEAPLVDGNVARVFSRIFEVEGLPGDRQREATLWALATALVKGERPGDFNQA 213 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +M+ GA C P C LCP++ C+A P K + T + T + H Sbjct: 214 LMEHGATTCRPENPLCLLCPVRGACVAFRKGRVDELPPAKVRATPKKLTLAVAVWPHAGT 273 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQ------RQIAADNLTQLTAFRHTFSHF 297 +L A+R +GL+GGL+ P + A+ + L + +H Sbjct: 274 LLFARRADAGLFGGLWELPAAEIADDAPDSEARARLSAALGVDVTLEGALGTVKRQLTHR 333 Query: 298 HLDIVPMWLPVSSFTGCMDE-GNALWYNLAQPPSVGLAAPVERLLQ 342 L + + + W +G++ ++R L Sbjct: 334 DLSLRLLRVSGPRRPASTPAFQELRWCAPDDAEKLGMSTAMQRALD 379 >UniRef50_A3ZT46 A/G-specific adenine glycosylase n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZT46_9PLAN Length = 358 Score = 370 bits (950), Expect = e-101, Method: Composition-based stats. Identities = 138/350 (39%), Positives = 187/350 (53%), Gaps = 10/350 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 +F Q+L WY R LPW+ D+ PY+VW+SE+MLQQTQVATV YFERF A FPTVT Sbjct: 8 RKFQRQILAWYGGAAR-DLPWRADRDPYRVWISEIMLQQTQVATVRAYFERFSAAFPTVT 66 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA A EVL LW GLGYY RAR LH AAQ +A H GKFP F + ALPGVGR TAG Sbjct: 67 DLAAADEAEVLRLWEGLGYYRRARQLHAAAQVIADEHRGKFPREFAAILALPGVGRYTAG 126 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S++ + PIL+ N R+ AR A P K + LW +E + P V FNQA Sbjct: 127 AICSIAYDQPAPILEANTIRLHARLLAYREDPTKTAGQRLLWQFAEHILPTEDVSSFNQA 186 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +M+LG+ ICT P+C +CP+ C A N+ A P K K +RT ++++ + E Sbjct: 187 LMELGSEICTPRNPQCGVCPVATLCQAKRENAVAEIPVPKKKMQYEDRTEVAIIVRRKKE 246 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLR--------QWLAQRQIAADNLTQLTAFRHTFS 295 VLL Q W GL+ FP+F Q + ++A ++LT +H + Sbjct: 247 VLLRQCGADERWAGLWDFPRFQVASEAELLSGELASQLEERTGLSAAIGSRLTTIKHGVT 306 Query: 296 HFHLDIVPMWLPVSSFTGCMDE-GNALWYNLAQPPSVGLAAPVERLLQQL 344 + + + + D +W + + L+ ++ Q L Sbjct: 307 KYRITLHCHEAEGMTGRLRKDATAPLVWAAVDKLADFALSTTGRKIAQLL 356 >UniRef50_B9XFP7 A/G-specific adenine glycosylase n=1 Tax=bacterium Ellin514 RepID=B9XFP7_9BACT Length = 392 Score = 369 bits (949), Expect = e-101, Method: Composition-based stats. Identities = 115/360 (31%), Positives = 185/360 (51%), Gaps = 18/360 (5%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 ++ ++ +L W+ + R LPW+ + PY +W+SE+MLQQTQV TVIPY+E +M P Sbjct: 34 LKIHALASALLHWFSQSAR-DLPWRRTRDPYAIWVSEIMLQQTQVKTVIPYWEHWMQNLP 92 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA A + + LW GLGYY R RN+ +AAQ++ + H G FP F+ + AL G+GR Sbjct: 93 TIQSLAEAAPERIHKLWEGLGYYTRVRNMQQAAQEIMSRHDGSFPSDFDSILALKGIGRY 152 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT-------- 172 TAGAI S++ + P+LDGNV RVL R + ++ P K +LWSL+E + Sbjct: 153 TAGAIASIAFNQPKPLLDGNVIRVLTRLFGIAENPRDKITNEQLWSLAEALVVSASNSNN 212 Query: 173 ---PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLP 229 NQ++M+LGA+ICT +P+C +CP++ CIA P + Sbjct: 213 NKSHPSACSHLNQSLMELGALICTPRQPQCLICPVRQDCIAHHTGKVESLPNLGERTAAT 272 Query: 230 ERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNL----- 284 +R + +++H+ L+ QRP + L+ FP L QI L Sbjct: 273 QRKFFAFVIEHQHRFLVTQRPAGVVNAHLWEFPNLEVNPDLPHPTPATQIDPLRLTLHSP 332 Query: 285 TQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 T L +HT + + + + +++ W LA+ + + +++L++L Sbjct: 333 TPLCTIKHTITRYRITLQVFRATLAN-PDTQTHLPTQWKTLAELHQLPFPSAHKKILKKL 391 >UniRef50_D1BPM8 A/G-specific adenine glycosylase n=3 Tax=Veillonella RepID=D1BPM8_VEIPT Length = 365 Score = 369 bits (947), Expect = e-100, Method: Composition-based stats. Identities = 107/355 (30%), Positives = 166/355 (46%), Gaps = 19/355 (5%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 ++ Q+L WYD + R+ LPW+ PYK+W+SEVM QQT++ + PY++ +M FPT+ Sbjct: 7 PKWVPQLLAWYDVHKRE-LPWRDCGDPYKIWVSEVMSQQTRIEAMKPYYDNWMRLFPTLE 65 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA A DEV+H W GLGYY+RARNL + V +GG P + + +L GVG TAG Sbjct: 66 DLAKASEDEVVHAWQGLGYYSRARNLRLGVKDVVENYGGIVPHDRKTMESLKGVGSYTAG 125 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+LS++ + +DGNV R+ AR Y + + + + ++ E+ P FNQA Sbjct: 126 AVLSMAYNEPEAAVDGNVLRIYARLYRIFDDILSTKGKKTITAIVEETLPHDRPGDFNQA 185 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +MD G+ +C P+C CP+ N C A + P + K + E + +L ++ Sbjct: 186 LMDFGSAVCIPKTPRCGECPIVNMCAAYQHEDTDKLPVRIKKTKVVEVPLFVGILNYKGY 245 Query: 244 VLLAQRPPSGLWGGLYCFPQFA-------DEESLRQWLAQRQIAADNLTQL-TAFRHTFS 295 LL +RP GL ++ FP E L + + T L H FS Sbjct: 246 YLLHKRPNRGLLRSMWEFPSVEMVSAFEDGERGLEELVQPLGFELSLQTVLVKEITHIFS 305 Query: 296 HFHLDIVPMWLPVSSFT----------GCMDEGNALWYNLAQPPSVGLAAPVERL 340 H + ++ + + + A P +L Sbjct: 306 HRKWFMKAFRGDLTYTGNANNISIADIQQQLPKDWMLIKRDEFSDYAWAGPHGKL 360 >UniRef50_Q03V36 A/G-specific DNA-adenine glycosylase n=2 Tax=Leuconostoc mesenteroides RepID=Q03V36_LEUMM Length = 342 Score = 369 bits (947), Expect = e-100, Method: Composition-based stats. Identities = 119/343 (34%), Positives = 174/343 (50%), Gaps = 13/343 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 +F +L+WYDK GR LPW+++ PY+V +SE+MLQQT+V TVIPYF+RFM PT Sbjct: 8 TIKEFRKTLLNWYDKEGRAHLPWRVNHDPYRVLVSEIMLQQTRVDTVIPYFDRFMTSLPT 67 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA AP D VL LW GLGYY+RARNL KAAQ V G++PE+ +++ +LPGVG T Sbjct: 68 VGDLATAPEDVVLKLWEGLGYYSRARNLQKAAQFVVNELHGQWPESSDDLQSLPGVGPYT 127 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A AI S+S + P +DGN RV +R + + + + + FN Sbjct: 128 AAAIASISFNEVVPAVDGNQYRVFSRLLKIDADIADTKSRKIFYDIIAPIVDPERPGDFN 187 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLG+ T P P++ A + YP K KQ ++ + + + Sbjct: 188 QAIMDLGSSYMTTKNPDSLYSPVREFNAAFRDGVEDQYPVKTTKQRPVKQLFMASVFEKD 247 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 ++L +RP SGL G + FP E Q I H F+H +I Sbjct: 248 GKLLFEKRPDSGLLAGFWTFPLTQIESIESITGQQLNIK--------PVTHIFTHRRWEI 299 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + + M N +++ + ++ L +LL++L Sbjct: 300 --WLVKQET---AMLNDNQQYFSSDEWSALSLPKVQHKLLEKL 337 >UniRef50_UPI0000E87CC6 A/G-specific adenine glycosylase n=1 Tax=Methylophilales bacterium HTCC2181 RepID=UPI0000E87CC6 Length = 343 Score = 368 bits (945), Expect = e-100, Method: Composition-based stats. Identities = 128/340 (37%), Positives = 204/340 (60%), Gaps = 1/340 (0%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 + +V+ W+ GR LPWQ PY +W+SE+MLQQTQV++VIPY++RF+ FPTV L Sbjct: 4 IAEKVITWHKSSGRHDLPWQKINDPYLIWISEIMLQQTQVSSVIPYYQRFIKTFPTVEKL 63 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A D V+ W+GLGYY RA+ + + A+ + + KFP++ E++ +LPG+G+STAGAI Sbjct: 64 AFADHDVVMKHWSGLGYYRRAKFIMQTAKIIVQQYQSKFPDSVEKLLSLPGIGKSTAGAI 123 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 + + G PI+D NVKRV R Y + WPGK + + LWSL+EQ P+ ++ + QA+M Sbjct: 124 CAFAFGGIEPIMDANVKRVFCRFYGIMEWPGKAQTQKYLWSLAEQNLPSNNIQIYTQALM 183 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 DLGA +C S+P CS CPLQ C++ +N + P KKPK+TL + L+++ +++L Sbjct: 184 DLGATLCKGSQPVCSQCPLQLKCVSFKSNLCHVIPAKKPKKTLITKQVNVLIVEANNQIL 243 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQ-RQIAADNLTQLTAFRHTFSHFHLDIVPM 304 +R S +W GL+ P+ D ++W+ + N+ + A FSH+ L + Sbjct: 244 FQKRSDSSVWEGLWSTPEIQDFSYTKKWIKDVLGLTKFNVVKQGAHLAIFSHYKLQLNYN 303 Query: 305 WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + ++ + LW + + G+ P+++LLQ + Sbjct: 304 HILLTETFSHQQLKDFLWIDRPKIGGEGIPTPIKKLLQNI 343 >UniRef50_A8R947 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8R947_9FIRM Length = 348 Score = 367 bits (944), Expect = e-100, Method: Composition-based stats. Identities = 116/340 (34%), Positives = 171/340 (50%), Gaps = 9/340 (2%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q + Q+LDWY + R LPW+ TPY+VW+SE+MLQQT+V V PYFERF+A P Sbjct: 6 DKEQLTKQLLDWYKQNAR-VLPWREKATPYRVWVSEIMLQQTRVEAVKPYFERFIATIPD 64 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LAN + + LW GLGYY R N+ K AQ HGGK P T+ E+ LPG+G T Sbjct: 65 IPTLANTEDEVLAKLWEGLGYYRRVYNMKKCAQVCVEKHGGKLPNTYAELLELPGIGAYT 124 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ G+ +DGNV RV +R + K+ V+ + + + P FN Sbjct: 125 AGAIASIAFGECVAAVDGNVLRVFSRVLVLEEDILKESVKRQYAKIVQMYIPKHESSAFN 184 Query: 182 QAMMDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+M+LGA IC + P+C++CP+ + C P K K+T +L++ Sbjct: 185 QALMELGATICVPNAAPRCNICPIADNCKGYQCGKAHYLPNKAAKKTRRIEKKTICVLRY 244 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIA-ADNLTQLTAFRHTFSHFHL 299 +D+ + QR GL GLY F +S + ++ N+ +L +H FSH Sbjct: 245 KDKFHVRQRASQGLLAGLYEFDWLEGYQSKKAVSEYYKMYVVKNIRKLPDAKHVFSHIEW 304 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQP-PSVGLAAPVE 338 + + V M +W L + S + + Sbjct: 305 HMKGYLVEVE-----MPCEEGVWCTLEELKASYAIPTAYK 339 >UniRef50_C5EQF4 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=C5EQF4_9FIRM Length = 391 Score = 367 bits (943), Expect = e-100, Method: Composition-based stats. Identities = 121/358 (33%), Positives = 177/358 (49%), Gaps = 22/358 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +L WY R +LPW+ D PY+VW+SE+MLQQT+V V PYFERFM FP Sbjct: 28 RLKAMEKPLLAWYGSRAR-SLPWRDDPKPYRVWISEIMLQQTRVEAVKPYFERFMKAFPE 86 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA A D ++ +W GLGYY RARNL AA+ V +GG P +FEE+ LPG+G T Sbjct: 87 VRDLAGAEDDYLMKMWEGLGYYNRARNLKAAARMVMEEYGGCLPASFEELIKLPGIGSYT 146 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ G P +DGNV RV++R G K V+ + + V P+ R+N Sbjct: 147 AGAIASIAFGIPMPAVDGNVLRVISRVLGDRGDIRKASVKAGMEQELKAVMPSGDASRYN 206 Query: 182 QAMMDLGAMICTRSK-PKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 Q ++++GA++C P+C CPL + C+ N W P K P + ++++ Sbjct: 207 QGLIEIGALVCIPGGEPRCGECPLASVCLTRRNGWWKEIPYKSPNKARKIEERTVFIIEY 266 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQF---ADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 +D+V + +RP +GL GLY FP D + + L +T L A +H FSH Sbjct: 267 QDKVAIHKRPMNGLLAGLYEFPNACGHPDADEIPAVLGMNPGCIKAVTPLPAAKHVFSHV 326 Query: 298 HLDIVPMWLPVSSF----------------TGCMDEGNALWYNLAQPP-SVGLAAPVE 338 + + V G+ + + Q S + + Sbjct: 327 EWHMTGYRVVVDEAAADPVFPGGREEGKGVADEAAAGDWFFVSREQLASSYAIPNAFQ 384 >UniRef50_B9E7Y8 A/G-specific adenine glycosylase homolog n=4 Tax=Staphylococcaceae RepID=B9E7Y8_MACCJ Length = 344 Score = 367 bits (943), Expect = e-100, Method: Composition-based stats. Identities = 109/345 (31%), Positives = 164/345 (47%), Gaps = 8/345 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + QFS +LDW+ K R+ +PW+ K PYK+WLSEVMLQQTQV TV PY+ +F RFP Sbjct: 2 LNKQQFSQHLLDWFYKNKRE-MPWRETKDPYKIWLSEVMLQQTQVNTVKPYYLKFTERFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + LA+A +DEV W GLGYY+R RN H A ++V + G P E+ L GVG Sbjct: 61 DIRTLASAEIDEVTKYWEGLGYYSRVRNFHSAVKEVQESYNGVVPNNPEDFLKLKGVGPY 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 T GA++S++ P +DGNV RV +R V P F Sbjct: 121 TQGAVMSIAFNHQIPAVDGNVYRVFSRLDNDDFDISSSSARRHFEDKVMDVIPKA-AGDF 179 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 N+A+M+LGA +CT P C CP+Q C + + L P K K L++ H Sbjct: 180 NEALMELGATVCTPKSPLCMFCPVQQHCESYEAGTVQLRPVKLKKIKKKTEHWKVLVIMH 239 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAA-DNLTQLTAFRHTFSHFHL 299 +D++ + QRP +GL ++ FP F + Q ++ + +H F+H Sbjct: 240 DDKIYIEQRPSTGLLQSMWQFPMFNIDTHQDTIEEQLNMSLYVEADPFMSLKHQFTHVTW 299 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 D+ A + ++ + + ++ ++ Sbjct: 300 DMEVYRAVALGRVD-----KARFIDIEEKERFNFPVSMTKIFKKY 339 >UniRef50_D1W5W3 A/G-specific adenine glycosylase n=1 Tax=Prevotella buccalis ATCC 35310 RepID=D1W5W3_9BACT Length = 369 Score = 367 bits (942), Expect = e-100, Method: Composition-based stats. Identities = 122/345 (35%), Positives = 181/345 (52%), Gaps = 12/345 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M +S F+ +L W+ + GR ++PW+ PY +W+SEV+LQQT++ Y+ERFMARFP Sbjct: 33 MGSSPFTLALLRWFSENGR-SMPWRETTDPYAIWISEVILQQTRIQQGWAYWERFMARFP 91 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 V DLA A DEVL LW GLGYY+RARNLH AA+QV L G FP T E + AL GVG Sbjct: 92 KVEDLAAASEDEVLRLWQGLGYYSRARNLHHAAKQVVEL--GHFPNTMEGLKALKGVGDY 149 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TA AI S++ G ++DGNV RVLAR Y + E + + +L++ + PA + Sbjct: 150 TAAAIGSIAFGLPVAVVDGNVYRVLARHYGIYTPINTTEGKKEFAALAQSLLPATEPSAY 209 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+MD GA+ CT + P+C +CPL + C+A P K + R ++ ++ Sbjct: 210 NQAIMDFGAIQCTPTSPRCLICPLTDSCMALRTGKVDELPIKLKTLKIKTRKLVYVYVRW 269 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 + + + +R +W GL+ P + E L + Q + +H +H L Sbjct: 270 QGKTAIHRRAAGDIWQGLWE-PLLYENEELPDFPGQLLLLKKG------VKHVLTHRILQ 322 Query: 301 IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 L + + +W ++ L +E LL+ L Sbjct: 323 ADFYLLKANEKPAL--PDDYIWVEESELDQYALPRLIELLLESLP 365 >UniRef50_A6NQR4 Putative uncharacterized protein n=1 Tax=Bacteroides capillosus ATCC 29799 RepID=A6NQR4_9BACE Length = 355 Score = 366 bits (941), Expect = e-100, Method: Composition-based stats. Identities = 115/337 (34%), Positives = 160/337 (47%), Gaps = 8/337 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 Q +L WY + R LPW+ D TPY VW+SE+MLQQT+VA V+ Y+ RFM P V Sbjct: 2 EQLPIPLLAWYHENAR-VLPWRSDPTPYHVWVSEIMLQQTRVAAVMGYYSRFMEALPDVA 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA D ++ LW GLGYY+RARNL KAA QV +GG P ++EE+ L GVG TAG Sbjct: 61 ALAAVEDDTLMKLWQGLGYYSRARNLKKAAGQVMERYGGAIPASYEELLTLAGVGEYTAG 120 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S++ G P +DGNV RV+AR G + ++ +++ P FNQA Sbjct: 121 AISSIAFGIPVPAVDGNVLRVVARIAGDEGDITLPATKKRMGQALQEIIPTAMPGAFNQA 180 Query: 184 MMDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 MM+LGA +C + P C CP C A + P K PK+ L+ HE Sbjct: 181 MMELGATVCLPNGAPLCDRCPAAGFCAALIQDKINSLPVKAPKKARRVEERTVYLIFHEG 240 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 V L +RP GL GL+ +P + L I + +H F+H + Sbjct: 241 RVALRRRPDKGLLAGLWEYPN--ELAEKTGILESWGIVPASQRWGGTGKHIFTHIEWHMT 298 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPS-VGLAAPVE 338 + + +W + + + Sbjct: 299 ARMVQAETEE---LPEGWVWADGPALRQIYAVPNAFQ 332 >UniRef50_B8JJK1 Novel protein similar to H.sapiens MUTYH, mutY homolog (E. coli) (MUTYH) n=2 Tax=Clupeocephala RepID=B8JJK1_DANRE Length = 526 Score = 366 bits (941), Expect = e-100, Method: Composition-based stats. Identities = 126/420 (30%), Positives = 189/420 (45%), Gaps = 74/420 (17%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTP--------YKVWLSEVMLQQTQVATVIPYFE 53 + S F + ++ WYD+ R+ LPW+ T Y VW+SE+MLQQTQVATVI Y+ Sbjct: 55 EISVFRSDLMKWYDENKRE-LPWRTLATTEQDDNIRTYAVWVSEIMLQQTQVATVIDYYN 113 Query: 54 RFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA 113 R+M R+PTV LA A L+EV +W+GLGYY+R R LH+ AQ+V + G+ P+T + Sbjct: 114 RWMKRWPTVEKLAAATLEEVNQMWSGLGYYSRGRRLHEGAQKVVSELDGQMPKTTAGLLK 173 Query: 114 -LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT 172 LPGVGR TAGAI S++LG+ +DGNV RVL R A+ V + LW +++ + Sbjct: 174 QLPGVGRYTAGAIGSIALGQVTGAVDGNVIRVLCRVRAIGADSSSPAVTDALWRIADALV 233 Query: 173 PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS----------------- 215 FNQAMM+LGA +CT P CS CP+Q C A S Sbjct: 234 DPERPGDFNQAMMELGARVCTPKSPVCSQCPIQTHCHAFKKISMKQEMDCKRLLNKLATN 293 Query: 216 ------------------------------WALYPGKKPKQTLPERTGYFLLLQH----- 240 YP K K+ +++H Sbjct: 294 PKNPVPDIENCMSAGSCNLCLSEDWDPQLGVQNYPRKPVKKAPRVEQTLTCIVEHQRAGE 353 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWL------------AQRQIAADNLTQLT 288 E E LL QRP GL G++ P + + + + + ++ + Sbjct: 354 ESEYLLTQRPSKGLLAGMWELPSVLLQADISENKYKELICDMMQKWLETPLDTHSVQFVG 413 Query: 289 AFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 H FSH H + + VS + + W + ++ V+++++ + + Sbjct: 414 EVVHIFSHIHQTYIVFSVHVSDCSDREQKQKTCWLTKSALQKAAVSTGVKKIMKLYESSS 473 >UniRef50_C9LNH1 A/G-specific adenine glycosylase n=1 Tax=Dialister invisus DSM 15470 RepID=C9LNH1_9FIRM Length = 351 Score = 366 bits (940), Expect = e-100, Method: Composition-based stats. Identities = 120/340 (35%), Positives = 180/340 (52%), Gaps = 9/340 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDK--TPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 ++ ++L W+D+ R+ LPW+ + PY VW+SE+MLQQT+ V PYFE + RFPT Sbjct: 8 QEWPHKLLAWFDQN-RRDLPWREGRPRNPYYVWVSEIMLQQTRTEAVKPYFESWKRRFPT 66 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LA A +VLH W GLGYY+RARNLHKAA+++A +GG PE ++V ALPG+G T Sbjct: 67 IEALAEAKEADVLHAWQGLGYYSRARNLHKAAREIAEKYGGAIPEDKKDVRALPGIGEYT 126 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAILS++ GKH +DGNV RV AR Y + K ++ +L E+ PA FN Sbjct: 127 AGAILSMAYGKHEAAVDGNVLRVYARLYGIESDILKSAGRKEITTLVEKTLPA-RAGDFN 185 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 +A+MDLG+ +C PKC CPL C A + P + PK+ E ++ E Sbjct: 186 EALMDLGSEVCVPKHPKCEKCPLHGECAALRLGLENVLPIRTPKKKQKESIVCCAVMIKE 245 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQW-LAQRQIAADNLTQLTAFRHTFSHFHLD 300 + L RP G+ ++ P + + ++ + ++ + H F+H Sbjct: 246 GKALFHLRPSKGMLSSMWELPMVLSDSEKDGVKMLEKLLGGKAGERIWTYTHVFTHRIWH 305 Query: 301 IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 + + EG+ WY+ + + LA P +L Sbjct: 306 MKAYLFHGAKEP----EGDYRWYSCEEYREIPLAGPHAKL 341 >UniRef50_C3X6B5 A/G-specific adenine glycosylase n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X6B5_OXAFO Length = 373 Score = 365 bits (938), Expect = 1e-99, Method: Composition-based stats. Identities = 142/348 (40%), Positives = 205/348 (58%), Gaps = 9/348 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + F++ ++ W KYGR +LPWQ + Y++WLSE+MLQQTQV+TVIPY+ RF+ FP V Sbjct: 14 ASFASILVAWQKKYGRHSLPWQNTRDAYRIWLSEIMLQQTQVSTVIPYYLRFLEFFPDVY 73 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A DEV+ W+GLGYY+RARNLHK A+ V +GG FP E + LPG+G+STA Sbjct: 74 SLAAASNDEVMKYWSGLGYYSRARNLHKCARIVVNEYGGIFPSDPESLEKLPGIGKSTAA 133 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI + S G ILDGNV RV +R + + +K +N+ W L+ ++ P +E + Q Sbjct: 134 AIAAFSAGAKAAILDGNVVRVFSRVFGIRDSITEKAGKNRFWELAYELLPETDIEAYTQG 193 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +MDLGA IC RSKP C CP + C+A + N + P +K K+ P + L+L+ + Sbjct: 194 LMDLGATICVRSKPDCVKCPFSHCCVALSENRISELPVRKIKKASPLKKIIMLVLESRGQ 253 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQI---------AADNLTQLTAFRHTF 294 +LL +RP SG+WGGLY P++ + +LR+ + A ++T L F H F Sbjct: 254 ILLEKRPESGIWGGLYSLPEWETDVNLRKHKDEMASLKTHVSSFGEAVSVTFLEPFVHVF 313 Query: 295 SHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 SHF L + P + +S + +WY++ L APV +LL Sbjct: 314 SHFRLQVTPCVVTLSRRNRAIAPDAYVWYDIMSVNEAPLPAPVRKLLN 361 >UniRef50_B5JTP2 A/G-specific adenine glycosylase n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JTP2_9GAMM Length = 354 Score = 365 bits (937), Expect = 1e-99, Method: Composition-based stats. Identities = 175/353 (49%), Positives = 226/353 (64%), Gaps = 7/353 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M FS ++L WY GR LPWQ DKTPY+VW+SE+MLQQTQVATVIPY++RFMA FP Sbjct: 1 MSERSFSERLLAWYRVSGRHDLPWQQDKTPYRVWVSEIMLQQTQVATVIPYYQRFMASFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ DLA+A D+VL W+GLGYYAR RNLHKAA+Q G P+ E + ALPG+G S Sbjct: 61 TLEDLASATQDQVLQHWSGLGYYARGRNLHKAARQAHEAW-GDLPDDVEALEALPGIGHS 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV-GVER 179 TAGAI S++ G+ ILDGNVKRVLAR AVSGWPG+ V +LW ++ TP+ Sbjct: 120 TAGAISSIAFGRRALILDGNVKRVLARHRAVSGWPGRTAVSRELWQWADLYTPSAVQAGE 179 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPER-TGYFLLL 238 + QA+MDLGA +CTR++P+C CP+ C+ + +PGKKPK+ P + +L+ Sbjct: 180 YAQAIMDLGATLCTRTQPRCGECPVAEDCLGRGSGEPTAFPGKKPKKDKPVKSVRMWLVT 239 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 + EVLL QRP GLWGGL+ P+ +E L WL++R L FRH+FSHFH Sbjct: 240 NKKGEVLLQQRPAQGLWGGLWVLPEQRVDEPLDAWLSRRGWLGAEAVPLPKFRHSFSHFH 299 Query: 299 LDIVPMWLP---VSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 LD+ P+ + + T C D WYN P G+AAPV RLL+ L T + Sbjct: 300 LDVEPLLVQDIDSARVTECADHSPEQWYNPRSPQG-GIAAPVARLLKSLPTDS 351 >UniRef50_A5IU34 A/G-specific DNA-adenine glycosylase n=56 Tax=Staphylococcus RepID=A5IU34_STAA9 Length = 345 Score = 365 bits (937), Expect = 2e-99, Method: Composition-based stats. Identities = 111/347 (31%), Positives = 167/347 (48%), Gaps = 10/347 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 S F ++ W+D+ R+ +PW+ PY +WLSEVMLQQTQV TVI Y+ RF+ RFPTV Sbjct: 5 SSFKENLIHWFDENQRE-MPWRQTTNPYYIWLSEVMLQQTQVKTVIDYYHRFVERFPTVE 63 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 L+ A DEVL W GLGYY+RARN H A ++V + G P+ ++ AL GVG T Sbjct: 64 VLSQASEDEVLKYWEGLGYYSRARNFHTAIKEVYDKYEGLVPKDPDQFKALKGVGPYTQA 123 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP--AVGVERFN 181 A++S++ +DGNV RV +R + +++ P FN Sbjct: 124 AVMSIAYNVPLATVDGNVFRVWSRLNDDYRDIKLQSTRKSYE---QELLPYVTTEAGTFN 180 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPK-QTLPERTGYFLLLQH 240 QAMM+LGA+ICT P C CP+Q C A ++ P K FL+ + Sbjct: 181 QAMMELGALICTPKNPLCLFCPVQENCEAFDKGTFEKLPVKSKNVSKKVIEQSVFLIRNN 240 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLT-QLTAFRHTFSHFHL 299 + + LL +R L G++ FP F E + R+ + + + +H F+H Sbjct: 241 QGQYLLQKRSEK-LLHGMWQFPMFESEHARRKMTEKIGHDIQPVETPIFELKHQFTHLTW 299 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 I + + +W+NL+ P+ ++ Q + Sbjct: 300 KIKVYAASGA-INIETLPDDMIWFNLSDRDQYTFPVPMSKIYQFING 345 >UniRef50_Q89A45 A/G-specific adenine glycosylase n=6 Tax=Buchnera aphidicola RepID=MUTY_BUCBP Length = 351 Score = 364 bits (934), Expect = 4e-99, Method: Composition-based stats. Identities = 145/346 (41%), Positives = 217/346 (62%), Gaps = 3/346 (0%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M F +L+WY +GRKTLPWQI K PYK W+SE+MLQQTQV TVIPY+ +F+ RFP Sbjct: 1 MTTLVFYQTILNWYHHFGRKTLPWQIKKNPYKTWISEIMLQQTQVKTVIPYYCKFIKRFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + L+++PLD +L+LW+GLGYY RARN++K A+ + G FP ++ E+ LPG+G+S Sbjct: 61 NIDTLSDSPLDSILNLWSGLGYYTRARNIYKTAKILKQKFNGIFPNSYAEIIKLPGIGKS 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAILS + ILDGN+KRVL R Y++ K +E LW E +TP +F Sbjct: 121 TAGAILSFGFNLYSCILDGNIKRVLIRYYSI--NINNKYIEKLLWKTIESITPIYHTNKF 178 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKK-PKQTLPERTGYFLLLQ 239 NQA++D+GA+IC +S PKC++CPL++ C + NN KK K +P+ +FL+LQ Sbjct: 179 NQALIDIGALICLKSNPKCNICPLKSTCKSYLNNKLFQINCKKNKKHIIPKTKYWFLILQ 238 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 +++ + L +R G+W L+CFPQF + + W+ + + L F+H SH L Sbjct: 239 YKNYIFLEKRQNLGIWKKLFCFPQFIRQNDILSWIQKNNTKIKKINILNEFKHKLSHLTL 298 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 I P+W+ ++ + + +WYNL P +GL PV +++ +++ Sbjct: 299 YINPIWIIINKISIFSNNNKTIWYNLNNPQCIGLPTPVTKIITKIK 344 >UniRef50_C8US26 A/G-specific DNA glycosylase n=18 Tax=Lactobacillales RepID=C8US26_LACRG Length = 372 Score = 364 bits (934), Expect = 4e-99, Method: Composition-based stats. Identities = 120/362 (33%), Positives = 174/362 (48%), Gaps = 18/362 (4%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + F +LDWYD + R TLPW+ D PY V +SE+MLQQTQV TVIPY+ RFMA+FPT Sbjct: 14 KVAAFQHALLDWYDHHAR-TLPWRQDHDPYHVMVSELMLQQTQVQTVIPYYNRFMAQFPT 72 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA AP VL W GLGYY+RAR L +AA+Q+ +GGK+P+T E+ L G+G T Sbjct: 73 VEALAAAPEATVLKAWEGLGYYSRARRLQQAAKQIVNDYGGKWPQTAAELQTLAGIGPYT 132 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S+S G+ P +DGN RV AR + V + + L + P FN Sbjct: 133 AGAIASISFGEVVPAIDGNAFRVFARLFKVDADIARPQTRKVFDDLIRPLMPKERPGDFN 192 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLG+ + S P + P++ + + YP K K + L+++ + Sbjct: 193 QAVMDLGSSYMSASHPDPAHSPVRAFDASFRDGVVQDYPVKTKKPRPVIHRYFALVIRSK 252 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQL-------------- 287 LL QRP G+ L+ FP L + Q+ + Sbjct: 253 AGYLLEQRPGKGMLADLWMFPLIDM-AELEATMESEQLDEISARFADLSGMTLTFADLGR 311 Query: 288 TAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 +HTF+H + + ++ W + + L ++L + L Sbjct: 312 PQVQHTFTHQRWQLTLIGADAAAAELKFMPA--RWVPVEDFGKMALPTVQKKLNRALGLT 369 Query: 348 AP 349 P Sbjct: 370 EP 371 >UniRef50_B8DFN3 A/G-specific adenine glycosylase n=14 Tax=Listeria RepID=B8DFN3_LISMH Length = 362 Score = 364 bits (934), Expect = 4e-99, Method: Composition-based stats. Identities = 111/353 (31%), Positives = 173/353 (49%), Gaps = 12/353 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + F ++ WY+ R LPW+ + PY++W+SE+MLQQT+V TVIPYF RFM +FPT Sbjct: 10 KITAFQEALVSWYEANKR-VLPWRENTEPYRIWVSEIMLQQTKVDTVIPYFNRFMTQFPT 68 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + D A ++L W GLGYY+R RNL A +QV G+ P + +L GVG T Sbjct: 69 MEDFVQADEADILKAWEGLGYYSRVRNLQTAMKQVMADFSGEVPTDLTTILSLKGVGPYT 128 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAILS++ + P +DGNV RV+AR +S K + Q+ FN Sbjct: 129 AGAILSIAYNQAEPAVDGNVMRVIARVLEISEDIMKASTRKIFEEVLYQLIDKENPAAFN 188 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGY-FLLLQH 240 Q +M++GA++CT +KP C LCPLQ C A N YP K K + +++ Sbjct: 189 QGLMEIGALVCTPTKPMCMLCPLQPFCDAHKNGVETNYPVKIKKVKTKTKELLSIIVISE 248 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNL-------TQLTAFRHT 293 + +V + +RP +GL ++ FP + +A+ Q + + +H Sbjct: 249 DGKVAIEKRPENGLLANMWQFPTIEISKKENDEVAKLQFLHNYGLDVLLEDEPIAHIKHV 308 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 FSH + + V+ N + + + P +++ Q + Sbjct: 309 FSHLVWKMD---IRVAKLQSANPNENWCFETEEEMKRLAFPVPYQKMWQAWKD 358 >UniRef50_A5EV94 A-G-specific adenine glycosylase n=1 Tax=Dichelobacter nodosus VCS1703A RepID=A5EV94_DICNV Length = 347 Score = 363 bits (933), Expect = 4e-99, Method: Composition-based stats. Identities = 143/346 (41%), Positives = 199/346 (57%), Gaps = 12/346 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M AS FSA+++ W +GR LPWQ + PY VWLSE+MLQQTQV TVIPYF+ F+ARFP Sbjct: 1 MNAS-FSARLISWQRTFGRHELPWQ-NCNPYHVWLSEIMLQQTQVVTVIPYFQCFIARFP 58 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + LA AP +EVL LW GLGYYARARNLH AAQ V G+ P T ++ L GVGRS Sbjct: 59 DIDTLAAAPEEEVLFLWRGLGYYARARNLHAAAQYVVHQLNGQLPNTRAQLEQLKGVGRS 118 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP--AVGVE 178 TA AI + GK I DGNV+RVL R + + + + + +LW+L+E + P A + Sbjct: 119 TAAAICVFAFGKKEAICDGNVRRVLTRHHGILDFIEAPKTQQQLWTLAEALLPDAADDLR 178 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERT-GYFLL 237 + Q +MDLG++ICTR++PKC+ CP++ C A N P +KPK+ P R + L Sbjct: 179 SYTQGLMDLGSLICTRARPKCADCPVKTDCYALKNQQQMQLPRRKPKKAPPARNGYFVLC 238 Query: 238 LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 + E+ L +R G+WGGL+ P F E + W+A + + + + H F+HF Sbjct: 239 RNDDGEIYLQKRARRGIWGGLWSLPWFETEHEAKMWIAAQGFSWQSSSDSPVMMHRFTHF 298 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 HL + + T + + +Y + L AP+ LL Sbjct: 299 HLSMYLL-------TAQTTQKSTDFYAPDAVAQMALPAPLMTLLNA 337 >UniRef50_C1ZJ43 A/G-specific DNA-adenine glycosylase n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZJ43_PLALI Length = 381 Score = 363 bits (933), Expect = 5e-99, Method: Composition-based stats. Identities = 120/373 (32%), Positives = 187/373 (50%), Gaps = 31/373 (8%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 MQ ++F Q+L WY K+GR LPW+ PY +W+SE+MLQQT V VIPYFERFMA+FP Sbjct: 1 MQKTKFQKQLLAWYAKHGR-PLPWRASHDPYSIWISEIMLQQTTVTAVIPYFERFMAKFP 59 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 +V LA+AP +EVL LW GLGYY+RARNLH++A+ + + G FP++ E++ LPG+GR Sbjct: 60 SVQALASAPEEEVLKLWEGLGYYSRARNLHQSARVLMERYQGVFPQSVEQLLELPGIGRY 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S + PI++ N +R+ AR G + LW +E + + Sbjct: 120 TAGAISSFAFRLPAPIVEANTQRLYARILGYDGDLKNAAGQKALWGFAESIVSGKEPDLI 179 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+M+LG+++C P C CP+Q C A A P + + + LL+++ Sbjct: 180 NQALMELGSLVCKPIDPLCDQCPVQQHCRAFQEARQAEIPRAQARPVITPLVDATLLIEY 239 Query: 241 EDEVLLAQRPPSGLWGGLYCFP---QFADEESLRQW-------------------LAQRQ 278 + E+ L QR W GL+ FP F E + ++ Q Sbjct: 240 QGELFLRQREKPERWAGLWDFPRYTLFDPENTSEEFQKEKDVSTSALALSLKARVQEQLA 299 Query: 279 IAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMD--------EGNALWYNLAQPPS 330 + +T+ + H + + + + +S + + W+ S Sbjct: 300 VHPGEVTEFSRLTHGVTRYRITLHAFGCDLSDGVASRQSKALYEQLKSHGGWFGCESLDS 359 Query: 331 VGLAAPVERLLQQ 343 + + +L++Q Sbjct: 360 LAVPVTTRKLVKQ 372 >UniRef50_C6D1C3 A/G-specific adenine glycosylase n=4 Tax=Bacillales RepID=C6D1C3_PAESJ Length = 398 Score = 363 bits (932), Expect = 6e-99, Method: Composition-based stats. Identities = 117/385 (30%), Positives = 173/385 (44%), Gaps = 47/385 (12%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS ++L WY + R LPW++++ PY+VW+SE+MLQQT+V TVIPY+ERFM +FPTV L Sbjct: 13 FSRELLTWYRRIKR-DLPWRMNQDPYRVWVSEIMLQQTRVDTVIPYYERFMNKFPTVRAL 71 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP EVL W GLGYY+RARNL A++V +GG P+ VA L GVG T GAI Sbjct: 72 AEAPEPEVLKCWEGLGYYSRARNLQAGAREVVERYGGIVPDDKVAVAGLKGVGPYTTGAI 131 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 +S++ + P +DGNV RVL+R + + K + L+ + P FNQA+M Sbjct: 132 MSIAFNRPEPAVDGNVMRVLSRYFCLEDDIAKPATRVGIEKLAVSLIPEGAAGDFNQALM 191 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL----QHE 241 +LGA++CT P C CP+ C A P K + ++ +E Sbjct: 192 ELGALVCTPKSPSCLPCPVMEHCEARLAGRETELPIKTKAKPPRPEKRVAAIIAGTGVYE 251 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEES----------------------LRQWLAQRQI 279 ++L+ QRP +GL ++ P + R + + Sbjct: 252 GKILVRQRPETGLLAQMWELPHLLVPKEQEQIVDAAPETDGSTAAAGEWLTRSMEEETGL 311 Query: 280 AADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCM--------------------DEGN 319 H FSH H + + + G +G Sbjct: 312 LVRPERWFMDADHVFSHIHWKMRIYLADLGALAGAGEAAAGASLAAETAAAYEAGGKQGE 371 Query: 320 ALWYNLAQPPSVGLAAPVERLLQQL 344 W ++ R+L++ Sbjct: 372 YRWIGPEDMETLAFPNLFIRILREY 396 >UniRef50_C4FU23 Putative uncharacterized protein n=1 Tax=Catonella morbi ATCC 51271 RepID=C4FU23_9FIRM Length = 410 Score = 362 bits (931), Expect = 8e-99, Method: Composition-based stats. Identities = 119/364 (32%), Positives = 174/364 (47%), Gaps = 24/364 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + F +L WYDK R LPW+ + PY +W+SE+MLQQTQVATVIPY++RFMA PT Sbjct: 42 RVHAFRKDLLAWYDKNKR-DLPWRKSQDPYAIWVSEIMLQQTQVATVIPYYQRFMAALPT 100 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA AP + +L+LW GLGYY+R RN+ AAQQV GG+ P+ + + +L G+G T Sbjct: 101 VADLAQAPEETLLNLWQGLGYYSRVRNMQAAAQQVMADFGGRMPDQVDSLLSLKGIGPYT 160 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A AI S++ G+ P LDGN+ R++AR + + + + + + FN Sbjct: 161 AAAIASMAFGRVAPALDGNLFRIVARLFRLKDDIALPKSRKVFMEILDILIDPDRPGDFN 220 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPK-QTLPERTGYFLLLQH 240 QAMMDLGA + T S P+ PL + YP K K + + +L Sbjct: 221 QAMMDLGACVMTPSNPRPDNHPLAAYDASYQVGDSDQYPVKSKKVKQTRHDLTAYFVLDS 280 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADE------------ESLRQWLAQRQIAADN----- 283 + LL + + L GL+ FP E L WL + + + Sbjct: 281 QGNWLLRRHGEAELLTGLWHFPMLEQSMVYEEVTAAELTEPLLDWLREEALVEGDSPLSS 340 Query: 284 ---LTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 L +H FSH + + L + + + W + L+ E+L Sbjct: 341 QVVSPALGQVKHVFSHRIWQVQLVGLRLDTSGPLQE--GWCWVAPEDVGQLPLSTLQEKL 398 Query: 341 LQQL 344 +Q L Sbjct: 399 MQVL 402 >UniRef50_A1VKP4 A/G-specific DNA-adenine glycosylase n=6 Tax=Comamonadaceae RepID=A1VKP4_POLNA Length = 384 Score = 362 bits (931), Expect = 9e-99, Method: Composition-based stats. Identities = 147/346 (42%), Positives = 194/346 (56%), Gaps = 12/346 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ +++ W +GR +LPWQ + PY+VWLSE+MLQQTQVATV+ Y++RF+ RFPTV+DL Sbjct: 27 FANEIVRWQQSHGRNSLPWQNTRDPYRVWLSEIMLQQTQVATVLAYYDRFVQRFPTVSDL 86 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A DEV+ LW GLGYY+RARNLH+ AQ V LH G+FP + E++ LPG+GRSTA AI Sbjct: 87 AAATQDEVMALWGGLGYYSRARNLHRCAQDVMALHAGQFPRSAEQLQTLPGIGRSTAAAI 146 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE----RFN 181 S + ILDGNVKRVL R S + E LW+ + + P + R+ Sbjct: 147 ASFCFAERVAILDGNVKRVLTRVLGFSDDLAQSANERALWNQATDLLPHDNLARAMPRYT 206 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH- 240 Q +MDLGA ICT +PKC LCP+Q C A YP K K + L Q Sbjct: 207 QGLMDLGATICTGRQPKCLLCPVQALCRGRAEGQPEKYPVKTRKLKRSTQALSLLWAQKP 266 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 + V L +RP +G+W GLYC P F + LR + Q + L AF H +H L Sbjct: 267 DGSVWLEKRPATGIWSGLYCLPVFESDAELRALVPQ--SLEGRMEALPAFVHVLTHKDLC 324 Query: 301 IVPMWL-----PVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 + P T + A W+ A P +GL AP+ +LL Sbjct: 325 LSPWIAGFQADQAMPETMVSEGRPAAWFGPAAWPELGLPAPIRKLL 370 >UniRef50_Q11NU4 A/G-specific adenine glycosylase n=5 Tax=Bacteroidetes RepID=Q11NU4_CYTH3 Length = 355 Score = 362 bits (929), Expect = 2e-98, Method: Composition-based stats. Identities = 105/347 (30%), Positives = 175/347 (50%), Gaps = 7/347 (2%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 +F +++ WY K R+ LPW+ + PY +WLSE++LQQT+V +PY+ F+ FPTV Sbjct: 7 KKKFPKELIKWYLKNKRE-LPWRDTRDPYPIWLSEIILQQTRVQQGMPYYFSFLKAFPTV 65 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 +A A ++L LW GLGYY+RARNLHK A QV + GG FP +++E+ L GVG TA Sbjct: 66 KHMAKASEKDILSLWQGLGYYSRARNLHKTALQVMSQFGGSFPGSYKELLDLKGVGPYTA 125 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 AI S + + ++DGNV RVL+R + + + + +L++Q+ P + +NQ Sbjct: 126 AAIASFAYKEQVAVVDGNVYRVLSRVFGIYEDITQNSSKKTFAALAQQLIPQKDPDIYNQ 185 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+ GA+ CT ++PKC C C+A + P K ++ ER + ++ + + Sbjct: 186 AIMEFGALHCTPAEPKCGDCCFAEICVAHLTGDQRVLPVKAKQKAKQERYMTYFMITNNN 245 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQL----TAFRHTFSHFH 298 VL+ +R + +W GL+ F ++ A + + + + ++H SH Sbjct: 246 TVLMKERGSNDIWQGLFEFLLVETPKAASLTEALKLVPDLDKKNISGESITYKHILSHQI 305 Query: 299 LDIVPMWLPVSSFTGCMDEG--NALWYNLAQPPSVGLAAPVERLLQQ 343 L V + N + P + + L+ Sbjct: 306 LHAKIYMAEVEKKVYNCIKKTYNLEEIAFNELPVTPKPVLITKYLKA 352 >UniRef50_Q8A7M6 A/G-specific adenine glycosylase n=2 Tax=Bacteroides RepID=Q8A7M6_BACTN Length = 357 Score = 361 bits (928), Expect = 2e-98, Method: Composition-based stats. Identities = 111/357 (31%), Positives = 169/357 (47%), Gaps = 19/357 (5%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M FS +++WY +Y R LPW+ PY++W+SE++LQQT+VA YF RF+ RFP Sbjct: 10 MNI--FSEAIVEWYKEYKR-DLPWRESSDPYRIWISEIILQQTRVAQGYDYFLRFIKRFP 66 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 V LA+A DEV+ W GLGYY+RARNLH AA+ + G FPET+ EV AL GVG Sbjct: 67 DVKALADADEDEVMKYWQGLGYYSRARNLHAAAKSM----NGVFPETYPEVLALKGVGEY 122 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TA AI S + G + ++DGNV RVL+R + + E + +L++++ + Sbjct: 123 TAAAICSFAYGMPYAVVDGNVYRVLSRYFGIDTPIDSTEGKKLFAALADEMLDKKQPALY 182 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQ +MD GA+ CT P C CPL + C A + P K+ K R ++ ++ Sbjct: 183 NQGIMDFGAIQCTPQSPDCLFCPLADSCSALSTGRVTQLPVKQHKTKTTNRYFNYIYVRA 242 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTA----------F 290 + +R + +W L+ P +L + +L Sbjct: 243 GAHTYINKRTANDIWKNLFELPLIETPVALSEEEFLALPEFRSLIASGENPVIRLVCRGV 302 Query: 291 RHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 +H SH + V+ N L + ++ V+ L+Q + Sbjct: 303 KHVLSHRVIYANLY--EVALPEDTKSFANFQRIRLEELDKYAISKLVQTLIQVISNK 357 >UniRef50_Q5HN66 A/G-specific adenine glycosylase n=5 Tax=Staphylococcus epidermidis RepID=Q5HN66_STAEQ Length = 347 Score = 361 bits (927), Expect = 2e-98, Method: Composition-based stats. Identities = 104/345 (30%), Positives = 152/345 (44%), Gaps = 6/345 (1%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 F + DW+ K R +PW+ PY +WLSEVMLQQTQV TVI Y+ RF+ RFPT+ Sbjct: 6 SFKKDIEDWFHKNQR-DMPWRETTNPYYIWLSEVMLQQTQVNTVIDYYYRFIHRFPTIQS 64 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 L+ A DEVL W GLGYY+RARN H A ++V + G+ P E L GVG T A Sbjct: 65 LSEANEDEVLKYWEGLGYYSRARNFHTAVKEVNNNYDGEVPYDPESFKKLKGVGPYTQAA 124 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 ++S++ +DGNV RV +R + FNQAM Sbjct: 125 VMSIAFNHPLATVDGNVFRVWSRLNNDYRDIKLQSTRKAFEQELHPYV-LKDAGTFNQAM 183 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPK-QTLPERTGYFLLLQHEDE 243 M+LGA++CT P C CP+Q C A + P K FL+ + Sbjct: 184 MELGALVCTPKSPLCLFCPIQEHCEAFHMGTTQELPVKTKSLNKKTIEQKVFLIRNDNGQ 243 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNL-TQLTAFRHTFSHFHLDIV 302 LL +R L G++ FP + + + + + +H F+H +I Sbjct: 244 YLLEKRKEK-LLNGMWQFPMREQTNANDVISDDLGKSIETINEPVFKLKHQFTHLTWEIK 302 Query: 303 PMWLPVS-SFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 + + W+NL P++++ + + Sbjct: 303 VYNVTAPLNIKENDLPKQMTWFNLDDREQYTFPVPMDKIYKFIEG 347 >UniRef50_C9LHH9 A/G-specific adenine glycosylase n=4 Tax=Bacteroidales RepID=C9LHH9_9BACT Length = 356 Score = 360 bits (925), Expect = 4e-98, Method: Composition-based stats. Identities = 114/349 (32%), Positives = 183/349 (52%), Gaps = 10/349 (2%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 +A F+ Q+ +WY R+ LPW+ K PY++W+SE++LQQT+V Y+ RF+ RFP+ Sbjct: 12 EAHYFAEQIEEWYLNNKRE-LPWRDIKDPYRIWISEIILQQTRVVQGYDYYLRFIDRFPS 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA A DEVL LW GLGYY+RARNL+ AA+ + G+FP ++++ L GVG T Sbjct: 71 VEDLAKADEDEVLKLWQGLGYYSRARNLYSAAKSIVER--GEFPTNYKDIRMLKGVGDYT 128 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A AI S + + ++DGNV RVLAR + ++ E + +L++ + ++N Sbjct: 129 AAAIASFAYNLPYAVVDGNVYRVLARYWGITTPIDTTEGKKLFAALAQNLLDKKNPAQYN 188 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MD GA+ C + P C+ CPLQ CIA S + P K K + R +L ++++ Sbjct: 189 QALMDFGALQCVPNNPNCNECPLQANCIAFLETSVSSLPIKAKKTAIRTRFLVYLYIRYK 248 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEE--SLRQWLAQRQIAADNLTQLT--AFRHTFSHF 297 D +LL +R +W GLY E L + + + +L +H +H Sbjct: 249 DRILLHRRERGDIWQGLYEPLLIEFSEHAELPSIIQKANLPKQAAFRLAVKEIKHVLTHQ 308 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 L + + ++ FT + + + + VE+L + + Sbjct: 309 VLIVDCYEVDLTKFTTI---EKYSFIPEKERANYPASRLVEQLYEAIDN 354 >UniRef50_C6ACA0 A/G-specific adenine glycosylase MutY n=6 Tax=Rhizobiales RepID=C6ACA0_BARGA Length = 352 Score = 359 bits (923), Expect = 6e-98, Method: Composition-based stats. Identities = 121/348 (34%), Positives = 182/348 (52%), Gaps = 18/348 (5%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDK---------TPYKVWLSEVMLQQTQVATVIPYFERF 55 + S+ +L WYD+ R LPW+I PY+VWLSE+MLQQT V TV PYF++F Sbjct: 3 EISSHLLAWYDQNHRH-LPWRITPEKQKQGIRPDPYQVWLSEIMLQQTTVETVKPYFKKF 61 Query: 56 MARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALP 115 + +P ++ LA A D+++ W GLGYY+RARNL K AQQ+ + G+FP++ +E+ L Sbjct: 62 LKLWPDLSSLAKASQDDIMKAWAGLGYYSRARNLKKCAQQLVENYAGQFPQSVKELRTLA 121 Query: 116 GVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV 175 G+G TA AI S++ ++DGNV+RV+AR +A++ K +VE + ++++T Sbjct: 122 GIGDYTAAAIASIAFNHPVAVVDGNVERVVARLFAITSILQKAKVE--IKEKTQKITALN 179 Query: 176 GVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF 235 F QAMMDLGA +CT KP C CPLQ C AA +P K PK+ P +TG Sbjct: 180 RPGDFAQAMMDLGATVCTPRKPSCYTCPLQCLCKAAKTQQQESFPVKAPKKERPSKTGIA 239 Query: 236 LL-LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTF 294 + L + ++ L +R L GG+ P + Q N H F Sbjct: 240 FVALNKKRQIYLEKRQTQKLLGGMTQIPNNIGINNENGL--QNAPFTANWQFKGQITHVF 297 Query: 295 SHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 +HF L + + + M+ N W N+ L +++ + Sbjct: 298 THFSLKLDVYY---TEDIHEMNCENGWWCNIQHLAEEALPTVMKKAIS 342 >UniRef50_A8ZZH0 A/G-specific adenine glycosylase n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZZH0_DESOH Length = 360 Score = 359 bits (923), Expect = 7e-98, Method: Composition-based stats. Identities = 115/359 (32%), Positives = 167/359 (46%), Gaps = 13/359 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 A F ++L WY + R LPW+ K PY +W+SEVMLQQTQVATV+ Y+ RF+ FP Sbjct: 4 FSAGPFQRRLLRWYTAHQR-DLPWRRSKNPYHIWVSEVMLQQTQVATVVDYYRRFLQAFP 62 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + LA A L +VL LW GLGYYARA NLHKAA+Q+ + P T E LPGVG Sbjct: 63 DIGTLAVAELQDVLKLWEGLGYYARAANLHKAARQIVAGGKKRVPRTPETFGRLPGVGDY 122 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 A+ S++ G P++DGNVKRVLAR + + + + + F Sbjct: 123 INAAVSSIAFGHPLPVVDGNVKRVLARLFLLDEPVNRPSNHRVFLEKARLLLAFKDPGTF 182 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQAMM+LGA++C +P C CP+ + C A +P + + P L++ Sbjct: 183 NQAMMELGALVCKPGRPLCDQCPVASFCGAHQAGRVTDFPRRLAARKNPHHHLAVGLVKK 242 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESL-------RQWLAQRQIAADNLTQLTAFRHT 293 + L+ +RP +GL GL+ P E+ R L + +L H Sbjct: 243 GNRFLIVRRPATGLLAGLWEMPGGRVEKPENPADACCRAVLESVGLTVFPGPRLARVAHA 302 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNAL---WYNLAQPPSVGLAAPVERLLQQLRTGAP 349 ++HF + + V +G + W + + + L P Sbjct: 303 YTHFKITMDLFACDV--VSGRVKRNGYQAHHWIRMKDIGQYPFHRAMHKAFAALAGALP 359 >UniRef50_B0MYW3 Putative uncharacterized protein n=1 Tax=Alistipes putredinis DSM 17216 RepID=B0MYW3_9BACT Length = 358 Score = 359 bits (923), Expect = 7e-98, Method: Composition-based stats. Identities = 116/350 (33%), Positives = 183/350 (52%), Gaps = 13/350 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + ++++L+WY + GR LPW+ + PY++W+SEV+LQQT+VA + Y+ RF+ FP V Sbjct: 12 PEVASRLLEWYGREGR-DLPWRRTRDPYRIWISEVILQQTRVAQGMSYYHRFLELFPDVA 70 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA+AP D VL W GLGYY+RARNL AA+++ HGG FP + +V ALPGVG TA Sbjct: 71 ALASAPEDLVLKCWQGLGYYSRARNLLAAARRIVETHGGVFPTAYADVRALPGVGDYTAA 130 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S++ + LDGNV RVL+R Y + +L++ + +NQA Sbjct: 131 AICSIAYEEPCAALDGNVFRVLSRLYDLDTPIDTTSGRRTFAALADSLIDRQRPGLYNQA 190 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +MD GA+ C ++P+C+ CPL++ C+A A + + P K+ + + R +L ++ DE Sbjct: 191 IMDFGALCCLPAQPRCTECPLRDRCLAFAARTVDVRPVKQGRTAVEPRYFNYLHVECGDE 250 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQ-------LTAFR---HT 293 ++L +R +W GLY FP E + + L + + H Sbjct: 251 LVLRRRGAGDIWQGLYEFPLIETPEPVEYTVLAAMPEFCALFKDAGTVCLAGTVKMPVHQ 310 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 SH + V + V +T + L + ++ ER L + Sbjct: 311 LSHRAIHAVFYRIVVERWTPSL--REMLVISRETLGDYAVSRLTERYLSR 358 >UniRef50_C9MR28 A/G-specific adenine glycosylase n=4 Tax=Prevotella RepID=C9MR28_9BACT Length = 343 Score = 359 bits (923), Expect = 8e-98, Method: Composition-based stats. Identities = 123/340 (36%), Positives = 176/340 (51%), Gaps = 9/340 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F++ +L WYD R LPW+ K Y +WLSEV+LQQT++ + Y++ FM ++PTV DL Sbjct: 3 FASTLLLWYDNNER-PLPWRETKDAYAIWLSEVILQQTRIVQGMSYWQHFMRQWPTVYDL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA DEVL W GLGYY+RARNLHKAAQQV L G FP T +E+ L GVG TA AI Sbjct: 62 ANASEDEVLKAWQGLGYYSRARNLHKAAQQVVELKG--FPHTAKELKKLKGVGEYTAAAI 119 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S + + ++DGNV RVL+R + + + SLS+ + P R+NQA+M Sbjct: 120 ASFAFDEKIAVVDGNVYRVLSRYKGIDTPIDTTSGKKEFQSLSQTLLPTHDSARYNQAIM 179 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D GA+ CT + P C +CPL C+A P K K ER ++ +Q+ DE++ Sbjct: 180 DFGAIQCTPTSPHCDICPLCESCVAFREQRINQLPVKSKKIHQQERHLTYIYIQYNDEIV 239 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW 305 L QR +W GL+ PQ + + + +H +H L + Sbjct: 240 LHQRGAGDIWQGLWELPQLEH----LSKEDGEEWKREAIEVCKDVKHVLTHQLLFANLYY 295 Query: 306 LPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 + + + +W + L +E LL +LR Sbjct: 296 --WAPQQRPILSADFIWIPTEKIADYALPRLMEILLDRLR 333 >UniRef50_Q1MS05 A/G-specific DNA glycosylase n=1 Tax=Lawsonia intracellularis PHE/MN1-00 RepID=Q1MS05_LAWIP Length = 363 Score = 359 bits (921), Expect = 1e-97, Method: Composition-based stats. Identities = 108/355 (30%), Positives = 182/355 (51%), Gaps = 20/355 (5%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + +L+W+ K R LPW+ PY VW++EVM+QQTQ+ + YF R++ +FP + Sbjct: 2 KEIQQALLNWFYKNKR-ALPWRETYLPYHVWIAEVMMQQTQMDRGVEYFLRWIKQFPDIA 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 +A AP +++L W GLGYY R R+L AAQ + + G FPE +E++ LPG+G TAG Sbjct: 61 SVAYAPEEKLLSAWEGLGYYRRVRHLQSAAQVIMHKYNGTFPERYEDILELPGIGPYTAG 120 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S + + FP +DGNV+RVL+R + ++ K+ ++KL+ L +Q+ P FNQ+ Sbjct: 121 AIASTAFNQDFPCIDGNVERVLSRIFDINTHIKKEPTKSKLYDLVKQLMPKKNARDFNQS 180 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +M+LGA++C KP C +CP+ + C + N + P K +LQ + Sbjct: 181 VMELGALVCK-KKPMCLICPVYSMCNSRINGTQLSRPVLAKKTPTTRLKMVTGILQCNQK 239 Query: 244 VLLAQRPPSGLWGGLYCFP-------QFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 + + QR + +WG L+ FP + + +R W Q + +T H ++H Sbjct: 240 IFIQQRLDNNIWGRLWEFPNGCIETGETPESAIIRNWNEQLGFSIQIENIITTIIHNYTH 299 Query: 297 FHLDIVPM-----------WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 +H+ + V + + W + + ++ L +P +L Sbjct: 300 YHITLYCFDICFSQNITNTLSVVLPNPTRLSASSYRWVSQKELQTIPLPSPHRKL 354 >UniRef50_A3MA77 A/G specific adenine glycosylase n=18 Tax=Acinetobacter RepID=A3MA77_ACIBT Length = 355 Score = 359 bits (921), Expect = 1e-97, Method: Composition-based stats. Identities = 155/349 (44%), Positives = 202/349 (57%), Gaps = 6/349 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M FS +L+W+D++GR LPWQ+ PYKVW+SE+MLQQTQV TV+ YF+RFM RFP Sbjct: 1 MPEFSFSDALLNWFDQHGRHDLPWQVADDPYKVWVSEIMLQQTQVKTVLQYFDRFMERFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV L A DEV W GLGYYARARNLHKAA VA GKFPET EE ALPG+GRS Sbjct: 61 TVEALGYATWDEVAPYWAGLGYYARARNLHKAAGLVAQQ--GKFPETLEEWIALPGIGRS 118 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGA++SL L ++ I+DGNVKRVLAR +A+ K + E ++W L+E++ P + Sbjct: 119 TAGALMSLGLRQYGVIMDGNVKRVLARFFAIEDDLSKPQHEREMWKLAEELCPTQRNHDY 178 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+MDLGA ICT KP C CP+Q C A P KKPK+T P +T L++Q Sbjct: 179 TQAIMDLGATICTPKKPLCLYCPMQAHCQAYQQGLEQELPFKKPKKTPPVKTADVLIIQC 238 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 EDE QR GLWGGL+C P +E + Q ++ T H+F+HF Sbjct: 239 EDEWFWQQRQAHGLWGGLFCLPILENEHERLKLSQQFKLQPQPQT--FQISHSFTHFTWL 296 Query: 301 I--VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 + + E W + Q + G+ +++L+ R Sbjct: 297 LNAHVFHVEPDQKEHLAIELEGQWLSPEQAIAKGVPTAMKKLISTSRHD 345 >UniRef50_Q03P68 A/G-specific DNA-adenine glycosylase n=2 Tax=Lactobacillales RepID=Q03P68_LACBA Length = 379 Score = 358 bits (919), Expect = 2e-97, Method: Composition-based stats. Identities = 114/368 (30%), Positives = 172/368 (46%), Gaps = 22/368 (5%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 +F +L WYD+ GR LPW+ D+ PY VW+SE+MLQQTQV TVIPY+ FMA FPT Sbjct: 8 TIREFRQTLLAWYDREGR-DLPWRHDQDPYHVWVSEIMLQQTQVQTVIPYYLNFMAMFPT 66 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V DLA AP +++L W GLGYY+RARNL +AA+Q+ + GK+P+T E+ L G+G T Sbjct: 67 VADLAQAPEEQLLKAWQGLGYYSRARNLQRAARQLVDDYRGKWPQTAAELLDLTGIGPYT 126 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ G+ P +DGN RV +R + K + + + + FN Sbjct: 127 AGAIASIAFGEVVPAVDGNAFRVFSRLLLIDADIAKPQTRKLFEQVIQPIVDPQRPGAFN 186 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLG+ T + P++ + N A +P K Y L++ Sbjct: 187 QAIMDLGSSYMTAKNSDPAHSPVRAFDASYRENCVADFPVKTKAPRPRPVPYYGLIVTSP 246 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTA------------ 289 L+ QR + + G + FP + Q ++ L Sbjct: 247 AGYLMVQRDATQMLTGYWTFPLIQQADLQDATDQQPATLQTDIAHLEQRFMTDYQLPITL 306 Query: 290 -------FRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 HT++H + + +SS W + ++ L E+L+ Sbjct: 307 TPLSGQPVSHTYTHQRWQVKLLVAELSSAADLTFYP-GKWVAASTIKTLPLPKVQEKLMT 365 Query: 343 QLRT-GAP 349 + + G P Sbjct: 366 RYQNQGHP 373 >UniRef50_A4T064 A/G-specific adenine glycosylase n=2 Tax=Polynucleobacter necessarius RepID=A4T064_POLSQ Length = 381 Score = 357 bits (918), Expect = 2e-97, Method: Composition-based stats. Identities = 138/367 (37%), Positives = 191/367 (52%), Gaps = 26/367 (7%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 FS +++ W+ + GR LPWQ ++ PY VW+SE+MLQQTQVATV+ + RFM RFPTV Sbjct: 6 IDSFSKKLIAWHAQSGRSGLPWQGNRDPYAVWVSEIMLQQTQVATVLERYPRFMKRFPTV 65 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA A +D+VL W GLGYY+RARNLH AQQ+ GKFP+ + L G+GRSTA Sbjct: 66 KKLAAADVDDVLAEWAGLGYYSRARNLHACAQQIVREFAGKFPQDPALLEQLKGIGRSTA 125 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP--AVGVERF 180 GAI + + + PILD NVKR+LAR +A+ G K V + LW L+ ++ P + + Sbjct: 126 GAIAAFAFHERAPILDANVKRILARLFAIEGAIQDKAVNDSLWKLATELLPLKPQDMPTY 185 Query: 181 NQAMMDLGAMICTRSKPKC----SLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFL 236 QA+MD GA CT KP C CP C A ++ P K K PE + L Sbjct: 186 TQALMDFGATWCTSRKPVCLSGEKKCPFAKDCQANLSDQVLALPKKVIKSKSPEFDCHML 245 Query: 237 LLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTA------- 289 LL+ + VLL +RP +WGGL+ P+ +L + +++ L+ Sbjct: 246 LLRSGNSVLLQKRPDKAIWGGLWSLPESPWAPRDLSFLKEELSSSNLLSLTLPEEKSVLF 305 Query: 290 ------------FRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPV 337 +H F+H L + + + T N W L Q GL P+ Sbjct: 306 LRNCTPPNRGFYIKHVFTHRCLWMQIWEVN-AMKTIPFTNPNLKWVPLKQLGRHGLPQPI 364 Query: 338 ERLLQQL 344 + LLQ L Sbjct: 365 KVLLQGL 371 >UniRef50_C8WS48 A/G-specific adenine glycosylase n=2 Tax=Alicyclobacillus acidocaldarius RepID=C8WS48_ALIAD Length = 382 Score = 357 bits (918), Expect = 3e-97, Method: Composition-based stats. Identities = 119/367 (32%), Positives = 179/367 (48%), Gaps = 25/367 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + F+ + WY + R LPW+ PY + +SE MLQQT+V TVIPY+ RFM RFPT Sbjct: 4 SLAAFAHTLEAWYTQTSR-DLPWRRTADPYAILVSETMLQQTRVETVIPYYHRFMERFPT 62 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 LA+A +D+VL +W GLGYY RARNL A + V HGG+ P+ +E+ ALPG+G T Sbjct: 63 PLHLADADIDDVLKMWEGLGYYRRARNLKAAMEVVRDRHGGRIPDHPDELKALPGIGPYT 122 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 GA+LS++ + +P +DGNV RV++R A+ +V+ ++ + Sbjct: 123 LGAVLSIAFNRPYPAVDGNVLRVMSRYRAIEEPVDLPKVKRQIEQDVAETLERGTPRVLT 182 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+M+LGA++CT KP+CS CP+ +GC A A+ P + PK+ +T L L Sbjct: 183 QALMELGALVCTPKKPRCSACPVVSGCAARAHGLTDALPKRLPKRARRRQTVVALWLTRG 242 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQW-------------------LAQRQIAAD 282 D QRP GL GG++ P ++ A R+ A Sbjct: 243 DSFWAEQRPEGGLLGGMWQLPSVELDDPSTSLEWHARARFAELRFGHAQRGEAVREPEAM 302 Query: 283 NLTQLTAFRHTFSHFHLDIVPMW-LPVSSFTGCMDEGN----ALWYNLAQPPSVGLAAPV 337 ++ H F+H +V + G E + W + + Sbjct: 303 AFERVCEVSHAFTHLDWTVVVFAPVEYDQAGGQPLEIHPGHEGAWVGFEEISNFVWPKVY 362 Query: 338 ERLLQQL 344 + +L+QL Sbjct: 363 QDVLRQL 369 >UniRef50_A9KT85 A/G-specific adenine glycosylase n=3 Tax=Clostridiales RepID=A9KT85_CLOPH Length = 350 Score = 357 bits (917), Expect = 3e-97, Method: Composition-based stats. Identities = 106/340 (31%), Positives = 161/340 (47%), Gaps = 7/340 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + +L WYD R L W+ + PY VW+SE+MLQQT+V V YF+RF+ PT+ Sbjct: 6 RETVEYLLHWYDLNAR-ILEWRSNPKPYYVWISEIMLQQTRVEAVKSYFDRFIKELPTIK 64 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 DLA D ++ LW GLGYY RARNL KAA V + G+ P EE+ LPG+G T+G Sbjct: 65 DLAAVEEDRLMKLWEGLGYYNRARNLKKAAIIVMEQYNGELPANREELKKLPGIGSYTSG 124 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S++ +DGNV RV+ R K++V+ +L E + P +NQ+ Sbjct: 125 AIGSIAFQLPVAAVDGNVLRVMKRIAGSFDDITKEKVKKELEEDIEAIIPKDRPGDYNQS 184 Query: 184 MMDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 +M+LGA +C + KP C+ CP+ + C A N P K K+ L+++ Sbjct: 185 LMELGATVCLPNGKPLCNQCPVMHLCKAFHNGDELKIPVKPKKKERVIEEKTVLVMECAG 244 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADE---ESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 + +R GL GL+ P + + + + A +L +H FSH Sbjct: 245 YYAIRKRTEKGLLHGLWELPNIEGKLSINQMEETIKTLAPYAKVERKLGEAKHIFSHKEW 304 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQP-PSVGLAAPVE 338 ++ ++ V + +W + L E Sbjct: 305 HMIGYYIKVERIED-VPIPELIWVDNQSIRKDYSLPTAFE 343 >UniRef50_B8J1Y9 A/G-specific adenine glycosylase n=12 Tax=Desulfovibrionales RepID=B8J1Y9_DESDA Length = 435 Score = 357 bits (917), Expect = 4e-97, Method: Composition-based stats. Identities = 108/361 (29%), Positives = 179/361 (49%), Gaps = 21/361 (5%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 +LDW+ + R LPW+ TPY+VW+SEVMLQQTQ+ + YF R+M RFP + Sbjct: 51 IPALQNALLDWFAVHQR-ALPWRNSYTPYEVWISEVMLQQTQMERGVRYFIRWMERFPDI 109 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA A ++VL +W GLGYY+RAR++ AA+++ H G FP + ALPGVG TA Sbjct: 110 AALAAAHEEDVLRMWEGLGYYSRARHILAAARKIMAEHNGIFPRDPAAIRALPGVGPYTA 169 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 GAI S++ K P +D NV+RV++R + V ++ + + ++ P NQ Sbjct: 170 GAIASIAFEKKLPCVDANVERVVSRIFDVDSPVKQEPAAGVIHQWALRLVPEGKARAHNQ 229 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 AMM+LGA++C KP C+ CPL CI+ P + T+ +L+ + Sbjct: 230 AMMELGALVCR-KKPCCASCPLGAFCISHHLGIEDQRPVPGKRATVTPVNAVTGVLRRGE 288 Query: 243 EVLLAQRPPSGLWGGLYCFP-------QFADEESLRQWLAQRQIAADNLTQLTAFRHTFS 295 ++ + +RP SG+WG L+ FP + ++ +R+++ + + + RH ++ Sbjct: 289 KIFVQKRPASGVWGNLWEFPGGRVEEGESPEQAVVREFMEETGFTVNVAARHGIIRHGYT 348 Query: 296 HFHLDIVPMWLPVS------------SFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + L + L ++ + W + + + + A +L Sbjct: 349 TYRLTLHCFGLDLAAPGKDCTPETCPAPPVLTAATQYRWASPQELEDLAMPAAHRKLADS 408 Query: 344 L 344 L Sbjct: 409 L 409 >UniRef50_Q7NMA0 A/G-specific adenine glycosylase n=1 Tax=Gloeobacter violaceus RepID=Q7NMA0_GLOVI Length = 375 Score = 356 bits (915), Expect = 6e-97, Method: Composition-based stats. Identities = 135/356 (37%), Positives = 190/356 (53%), Gaps = 11/356 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q + AQ+L+WY + GR LPW+ + PY +W+SE+MLQQTQV TV+PY++R++A PT Sbjct: 18 QVVRLRAQLLEWYGRMGR-DLPWRRTRDPYAIWISEIMLQQTQVKTVLPYYQRWLAALPT 76 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEV-AALPGVGRS 120 V LA A L+ VL LW GLGYY RARNLHKAAQ + HGG FPET +++ ALPG+GRS Sbjct: 77 VAALAAAELEAVLKLWEGLGYYTRARNLHKAAQVIVKEHGGVFPETAQQLQQALPGIGRS 136 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S + G+ ILD N +RVL R +AV P E KLW +S+++ F Sbjct: 137 TAGAIASSAFGRCEAILDANARRVLGRLFAVGDPPA--RAEAKLWEISQRLVDPQAPHNF 194 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+MDLGA +CT P C LCP Q C+ + +P + + E G + ++ Sbjct: 195 NQALMDLGATVCTARSPLCLLCPWQVDCLGRRSGDPTHFPVRPARAVRSEIAGVSVAIEC 254 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFA---DEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 + + LL +RP GL GL+ FP E ++L +L H F+H Sbjct: 255 QGKFLLVRRPERGLLAGLWEFPFVESVGGGEPEETVRVAFGNRLESLERLGQVEHEFTHR 314 Query: 298 HLDIVPMWLPVSSFTGCMDE----GNALWYNLAQPPSVGLAAPVERLLQQLRTGAP 349 HL + + + + W + V ++ + L+ P Sbjct: 315 HLTAQVLRAQWIAAPAALPKVFDCREHTWQPPECWLKFPMPGYVHKICKLLKEALP 370 >UniRef50_A9EQ53 A/G-specific adenine glycosylase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EQ53_SORC5 Length = 396 Score = 356 bits (914), Expect = 8e-97, Method: Composition-based stats. Identities = 131/352 (37%), Positives = 187/352 (53%), Gaps = 14/352 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + +A + W+ + R LPW+ + PY +WLSEVMLQQT+V TVIPY+ERF+AR+PTV Sbjct: 34 REIAAALEAWFGRVAR-DLPWRRTRDPYAIWLSEVMLQQTRVETVIPYYERFLARYPTVF 92 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA+A +D+VL LW+GLGYY RAR LH AA++V H G P + ALPGVG TAG Sbjct: 93 ALASAEIDDVLSLWSGLGYYRRARVLHLAAREVTARHDGALPRDVSALLALPGVGAYTAG 152 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT----PAVGVER 179 AI S++ + P++DGNV RVL+R + KLWS +E++ +V R Sbjct: 153 AIASIAYDQPVPLVDGNVARVLSRIEGIDDDIRSASGTRKLWSTAERLVRGSADSVHPGR 212 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 FNQA+M+LGA +CT P+C CP+ C+A A A P PK+ +P +++ Sbjct: 213 FNQALMELGATVCTPRNPRCDACPVDGACVARATGRQAELPVIAPKRDVPAVAMVAAVVR 272 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADE--ESLRQWLAQRQIAAD-NLTQLTAFRHTFSH 296 D VL +R GL+GGL+ P R L +A D L ++ RH +H Sbjct: 273 SGDRVLFVRRAEGGLFGGLWEPPMVEARSLADARALLELVGVARDAALREVGRVRHILTH 332 Query: 297 FHLDIVPMWLPVS-----SFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 LD+ V+ A W + +G++ ++L Sbjct: 333 RRLDVTVARAEVAAPWRFRAKLASPYEKAAWLDPGALD-IGVSKLARKVLSA 383 >UniRef50_C7G751 A/G-specific adenine glycosylase n=1 Tax=Roseburia intestinalis L1-82 RepID=C7G751_9FIRM Length = 395 Score = 356 bits (914), Expect = 8e-97, Method: Composition-based stats. Identities = 113/374 (30%), Positives = 165/374 (44%), Gaps = 39/374 (10%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q + +L W+ + R LPW+ TPY+VW+SE+MLQQT+V V PYFERF P Sbjct: 6 DLKQLVSPLLQWFLNHAR-VLPWREKPTPYRVWVSEIMLQQTRVEAVKPYFERFTTALPD 64 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 LA P DE+L LW GLGYY R RN+ KAA +V +GG+ P +E++ L G+G T Sbjct: 65 AKALAVCPEDELLKLWEGLGYYNRVRNMQKAAVEVVENYGGQLPADYEKLLKLKGIGHYT 124 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT--------- 172 AGA+ S++ G P +DGNV RVL R A K+ N++ L E++ Sbjct: 125 AGAVASIAYGIPVPAVDGNVLRVLTRVSADDTDIMKQSFRNEMEKLLEKLMHGADGRNEK 184 Query: 173 -----------------PAVGVERFNQAMMDLGAMICTRS-KPKCSLCPLQNGCIAAANN 214 FNQA+M+LGA IC + P C+ CP ++ C A Sbjct: 185 NVFPWMEDADDLRAKVYHQNLAGAFNQALMELGATICVPNGAPLCTECPWKDFCEAKKQE 244 Query: 215 SWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFA---DEESLR 271 P K + L+L+ +++V + +RP GL GLY P +E + Sbjct: 245 LIGQIPVKSKAKPRKIEKKTVLILRDDEKVAIRKRPAKGLLAGLYELPNVEGNMGQEEVL 304 Query: 272 QWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFT-------GCMDEGNALWYN 324 + + A + L +H FSH + + V L+ Sbjct: 305 SMVKEMGYAPIRIQPLGEAKHIFSHIEWHMTGYAVRVEEPEMHPQVQCEKNRADGLLFVE 364 Query: 325 LAQP-PSVGLAAPV 337 + + Sbjct: 365 AEDAKEKYAIPSAF 378 >UniRef50_D0L1V0 A/G-specific adenine glycosylase n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0L1V0_HALNC Length = 381 Score = 356 bits (913), Expect = 9e-97, Method: Composition-based stats. Identities = 153/374 (40%), Positives = 214/374 (57%), Gaps = 31/374 (8%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M AS F +++LDW+D++GR LPWQ +TPY+VW+SE+MLQQTQVATVI YF RFM RFP Sbjct: 1 MSASDFHSRLLDWFDRHGRHDLPWQHPRTPYRVWISEIMLQQTQVATVIGYFNRFMQRFP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 ++ LA AP+DEVL LW+GLGYYARARNLH AAQ +A PET ALP VG S Sbjct: 61 SLDVLAAAPVDEVLALWSGLGYYARARNLHAAAQIMAQQ---GVPETRAGWQALPSVGPS 117 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TA AI++ + ILDGNVKRVLAR + + L+ +++ TP V + Sbjct: 118 TAAAIMAQAFDVPETILDGNVKRVLARHAGIDRPIEQASTIQALYEVAKLHTPQTRVADY 177 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+MDLGA +CTR P CS CP+ C+A A+N P ++ +Q +P R F+ ++ Sbjct: 178 TQAIMDLGATLCTRHSPGCSACPVSADCVAFASNRVESLPVRRRRQPVPTRRAVFMAIED 237 Query: 241 E-DEVLLAQRPPSGLWGGLYCFPQ-------FADEESLRQWLAQRQIAADNLT------- 285 E + ++L +RPP+G+WGGL+C P+ + E +L ++ +Q ++ Sbjct: 238 EAERLMLIRRPPTGIWGGLWCLPEYIPADDCVSQEHTLDSAVSGKQAKDNSGQQSMLSIH 297 Query: 286 ---------QLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPS----VG 332 LT F H F+H+ LD + VS + D + LW L + + +G Sbjct: 298 RLKQRGPTSPLTTFEHRFTHYLLDARIDHMVVSRTSSVEDNPDVLWLPLIELAARAPLLG 357 Query: 333 LAAPVERLLQQLRT 346 L P+ R L Sbjct: 358 LPKPMSRFLSDYPA 371 >UniRef50_B9Y8G7 Putative uncharacterized protein n=1 Tax=Holdemania filiformis DSM 12042 RepID=B9Y8G7_9FIRM Length = 397 Score = 356 bits (913), Expect = 1e-96, Method: Composition-based stats. Identities = 111/343 (32%), Positives = 169/343 (49%), Gaps = 12/343 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 ++ F ++ WY + R+ LP++ + PY +W+SE+M QQT++ +++PY++R+M R+P Sbjct: 13 VKKMNFQEELCAWYKIHHRE-LPFRQSRDPYAIWVSEIMAQQTRIDSMLPYYKRWMERWP 71 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV LA AP++++LH+W GLGYY RAR LH A+ V +GG P E++ +PG+G Sbjct: 72 TVEALAEAPIEDILHVWQGLGYYNRARKLHAGAKVVVERYGGLLPADVEQLRTIPGIGFY 131 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S++ G P +DGNV RV R KK + +W F Sbjct: 132 TAGAIGSIAYGLRAPAVDGNVLRVTTRVLQYGEDITKKTTADYVWRQVYDWMEGSNPAVF 191 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QAMM++GA++CT P+C LCPL C A + SW+ YP KK + E Y +++ Sbjct: 192 TQAMMEIGALVCTPKNPQCLLCPLAPFCGAGQDGSWSQYPVKKKAKPPLELQLYTYWIEN 251 Query: 241 EDE-VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 + +LL+ GL GLY PQ A D L +H FSH Sbjct: 252 DRRQILLSDDWSDGLMEGLYRLPQAAAPLE----------QTDQLDFAGQRKHVFSHRVW 301 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 + + W Q ++ + + L Sbjct: 302 KMECYRGQWDDQVQETLPEHCFWIAKDQLDTLPIVGAHRKWLD 344 >UniRef50_A6CCE7 A/G-specific adenine glycosylase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCE7_9PLAN Length = 408 Score = 354 bits (910), Expect = 2e-96, Method: Composition-based stats. Identities = 120/397 (30%), Positives = 176/397 (44%), Gaps = 52/397 (13%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +F Q+ WY + R LPW+ P+ VW+SE+MLQQT VA VIPYF+RFM+RFP Sbjct: 12 RRQKFRRQLQSWYVSHQR-DLPWRRQHDPHAVWISEIMLQQTVVAAVIPYFKRFMSRFPD 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA A EVL W GLGYY+RARN+HKAA+++A G+FP E + LPG+GR T Sbjct: 71 VETLAAADESEVLQHWEGLGYYSRARNIHKAAKRIAGELEGRFPRDVESLQKLPGIGRYT 130 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S + PI++ N R+ +R + P K +N+LW +E + P FN Sbjct: 131 AGAICSFAYDTRAPIVEANTLRLYSRLIGLEEDPRSKSGQNQLWEFAELILPRKSPGEFN 190 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLG+++CT P C CP+ GC A L P K + + T + + Sbjct: 191 QALMDLGSLVCTPQNPGCEDCPVNAGCEAFLRQRQHLIPVPKVRPEITPLTDVSIAVFSG 250 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEES-------------------------------- 269 V++ QR W GL+ FP+ EE Sbjct: 251 SHVMIRQRTAGERWAGLWDFPRLTLEEMNGSPHPTIVRKKTGRQKELFSAQQSAGVEIPE 310 Query: 270 ----------LRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVS---------S 310 + I A +T RH+ + + + ++ + Sbjct: 311 GLSPAVISRLENYLKEEAGIEASMQQLMTEIRHSVTRYKIRLLCFVAQLDPRAAKKKTTQ 370 Query: 311 FTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 W ++ + S L+ + + L Sbjct: 371 LNSPPGNSEYQWVSVNELDSYPLSVTGRKFAKLLAER 407 >UniRef50_A1WW55 A/G-specific DNA-adenine glycosylase n=1 Tax=Halorhodospira halophila SL1 RepID=A1WW55_HALHL Length = 358 Score = 354 bits (910), Expect = 2e-96, Method: Composition-based stats. Identities = 159/349 (45%), Positives = 207/349 (59%), Gaps = 4/349 (1%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 Q++ W ++GR LPWQ TPY+VW+SE+MLQQT+V TV+PYFERFM R+P V Sbjct: 11 QALQEQLIAWQRQHGRNDLPWQQPATPYRVWISEIMLQQTRVETVVPYFERFMERYPDVA 70 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A LD+VL LW GLGYYARARNLH AAQ++ T GG+ P + LPG+G STAG Sbjct: 71 ALAAAELDDVLALWAGLGYYARARNLHAAAQRIQTDWGGQLPAELSALQTLPGIGPSTAG 130 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI SL G+ PILDGNVKRVLAR V GWPG+ V +LW+LS +TP RFNQ Sbjct: 131 AIRSLGHGQPAPILDGNVKRVLARLAGVEGWPGRSPVAKQLWALSAALTPEAECRRFNQG 190 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +MDLGA++CT P C+ CPL C A A + YP +P + P R LL++H D Sbjct: 191 LMDLGALVCTPRDPACNACPLAASCTARAAGNPETYPAPRPARQRPRREVRLLLIEHADA 250 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVP 303 +LL +RP +G+WGGL+ P+ E + + L A H +HF L + P Sbjct: 251 LLLERRPATGIWGGLWSLPECPPSEDPVTRALRLGARCEPAGDLPARHHALTHFELIMQP 310 Query: 304 MWLPVSSFTGCMDEGNAL--WYNLAQPPSVGLAAPVERLLQQLRTGAPV 350 L ++ T + E + W+ Q GL AP+ R+L+ G PV Sbjct: 311 TRLRWNAATPDIGEPDPQRIWFRPGQDTLPGLPAPILRILRD--AGYPV 357 >UniRef50_Q0P4V1 Putative uncharacterized protein MGC145569 n=1 Tax=Xenopus (Silurana) tropicalis RepID=Q0P4V1_XENTR Length = 520 Score = 354 bits (909), Expect = 3e-96, Method: Composition-based stats. Identities = 128/424 (30%), Positives = 188/424 (44%), Gaps = 83/424 (19%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQI--------DKTPYKVWLSEVMLQQTQVATVIPYFE 53 + ++L WYDK R LPW+ D+ Y VW+SEVMLQQTQVATVI Y+ Sbjct: 47 ETEIIRDKLLAWYDKSKR-DLPWRTMACTEPDLDRKAYAVWVSEVMLQQTQVATVIDYYN 105 Query: 54 RFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA 113 ++M +PT+ DLA + L+EV +W+GLGYY+R R L + A++V GG P + +E+ Sbjct: 106 KWMKVWPTMEDLARSSLEEVNEMWSGLGYYSRGRRLQEGAKKVVLELGGSMPRSADELQK 165 Query: 114 L-PGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT 172 L PGVGR TAGAI S+S G+ ++DGNV RVL+R + V +KLW+L+ + Sbjct: 166 LLPGVGRYTAGAIASISYGQVTGVVDGNVIRVLSRLRCIGADSSTLAVSDKLWNLANALV 225 Query: 173 PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAA--------------------- 211 FNQ MM+LGA +CT KP C+ CPLQ C A Sbjct: 226 DPDRPGDFNQGMMELGATVCTPKKPLCTACPLQGQCKAYLKVIAEKESAVKTLIKKQASP 285 Query: 212 -----------------------------ANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 ++ A +P K K+ + + Sbjct: 286 IAKDVGDIEDCDLGPGLCALCVPTSDPWDSSLGVANFPRKSAKKPSRMEQTAICVWEKCG 345 Query: 243 -----EVLLAQRPPSGLWGGLYCFPQFADEE------------SLRQWLAQRQIAADNLT 285 E L+ QRP SGL GL+ FP +E L Q L+ + L Sbjct: 346 DHGELEYLIVQRPSSGLLAGLWEFPSILLDEKFTEQNRQHSLLGLLQDLSGHAVPLQKLQ 405 Query: 286 QLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGN------ALWYNLAQPPSVGLAAPVER 339 H FSH H V +L +++ C + W + + + +++ Sbjct: 406 YKGEVVHIFSHIHQTYVVYFLSLNTTENCSVKTEETERPLTRWVTKKEFLNSAVPTAMKK 465 Query: 340 LLQQ 343 +++ Sbjct: 466 IMKL 469 >UniRef50_A7VHH6 Putative uncharacterized protein n=1 Tax=Clostridium sp. L2-50 RepID=A7VHH6_9CLOT Length = 365 Score = 354 bits (908), Expect = 4e-96, Method: Composition-based stats. Identities = 116/349 (33%), Positives = 174/349 (49%), Gaps = 10/349 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 Q + ++ WY ++ R LPW+ DK PY VWLSE+MLQQT+V V Y+ F+ PT+ Sbjct: 21 KQIADNLIVWYPEHAR-DLPWRKDKEPYHVWLSEIMLQQTRVEAVKEYYRTFLQELPTIE 79 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA D ++ LW GLGYY RARNL KAA +V G FP + E+ +LPG+G TAG Sbjct: 80 ALAEVDDDRLMKLWEGLGYYNRARNLKKAAGEVVIEWQGNFPAEYNEILSLPGIGEYTAG 139 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AI S+ P +DGNV RV R K+ V+ K+ QV + Q+ Sbjct: 140 AIGSICFDLPTPAVDGNVLRVYTRVMEDPSNIDKQAVKKKIREELLQVYRYGHCDMLTQS 199 Query: 184 MMDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 +M++GA IC + PKC +CPLQ C A ++SW YP ++ K+ L+L++ED Sbjct: 200 LMEVGATICLPNGAPKCEVCPLQELCKAHKHDSWQQYPVREAKKKRKVEEKAVLMLRYED 259 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLA---QRQIAADNLTQLTAFRHTFSHFHL 299 +V + +R GL GL+ FP S ++ L+ + + + T + H FSH Sbjct: 260 KVAIRKRTEKGLLHGLWEFPNLPGSYSTQEILSYVTSKNLHPKEIWMETTYTHIFSHVEW 319 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQP-PSVGLAAPVERLLQQLRTG 347 + ++ + W L + + + + L +G Sbjct: 320 HMKAFYMECME----QQAKDLRWVTLEELKQEIAIPSAFAPFKDLLNSG 364 >UniRef50_A6GM64 Probable a/g-specific adenine glycosylase protein n=1 Tax=Limnobacter sp. MED105 RepID=A6GM64_9BURK Length = 377 Score = 352 bits (905), Expect = 9e-96, Method: Composition-based stats. Identities = 138/369 (37%), Positives = 187/369 (50%), Gaps = 33/369 (8%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + F ++ W ++GR+TLPWQ YKVWLSEVMLQQTQV TV+ Y+ RF+ +P Sbjct: 4 LTGEPFGEVLVRWQKQHGRQTLPWQHTGDAYKVWLSEVMLQQTQVTTVLAYYARFLQAYP 63 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV DLA AP +V+ LW GLGYY RARNLH A+QVA GG+FP T E+ +LPG+G+S Sbjct: 64 TVADLAGAPEQDVMQLWAGLGYYTRARNLHACAKQVAARFGGQFPRTVAELESLPGIGQS 123 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI SL+ G PILDGNVKRV R Y + G+P + ++ LW+++E P + Sbjct: 124 TAGAIASLAYGVQAPILDGNVKRVFCRYYGIEGYPEQTTIKKTLWAIAEANVPEQQPGVY 183 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+MDLGA C P CS CPL C+A L P KPK+ PE ++++ Sbjct: 184 NQALMDLGATCCVPRNPACSACPLMQSCVALQKGMVGLLPTPKPKKARPELHFVSMIVED 243 Query: 241 E-DEVLLAQRPPSGLWGGLYCFPQFA-------------DEESLRQWLAQRQIAA----- 281 E VLL + G+W GL+ P A W+ + + A Sbjct: 244 ERGAVLLELQTGKGVWQGLWTTPFEACGSQGGFADDFDNLTRIAAVWVERYGLQAHRVEL 303 Query: 282 ----DNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPV 337 L H +H + + + V +W + V Sbjct: 304 EHQLRALQGQPWLVHELTHRKMHFKLLRVKV----------PGVWVGCHVVSEKPVPKIV 353 Query: 338 ERLLQQLRT 346 +LL+Q R Sbjct: 354 HKLLEQSRA 362 >UniRef50_UPI000196B020 hypothetical protein CATMIT_00928 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196B020 Length = 333 Score = 352 bits (904), Expect = 1e-95, Method: Composition-based stats. Identities = 105/344 (30%), Positives = 171/344 (49%), Gaps = 16/344 (4%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 ++++WY + R LP++ PYK+W+SE+MLQQT V VIPY+ RF+ RFP V L Sbjct: 3 IQEELINWYKENKR-VLPFRDIDDPYKIWVSEIMLQQTTVTAVIPYYNRFIERFPDVKTL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ++A ++EV W GLGYY RA +LHK+AQ + FP+ +E++ +L GVG TA AI Sbjct: 62 SDASIEEVYKYWEGLGYYRRALHLHKSAQMIENH----FPKDYEDILSLVGVGPYTASAI 117 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 +S + K + +DGN RVL+R YA+ + K+ + ++ + NQ +M Sbjct: 118 MSFAYHKPYIAVDGNALRVLSRLYAIEDNIASNKTVKKITEIGNKLVVGYDSAQINQGIM 177 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D IC P C CPLQ+ C A + N + P K + ++ +EDEV+ Sbjct: 178 DFANAICLPVHPHCEECPLQSHCQAYSTNKQEVLPINIKKVNKKSISYITGIVVYEDEVM 237 Query: 246 LAQRPPSGLWGGLYCFPQFADEES---LRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 L + +GL +Y PQ+ E + + I + L+ L F+H FSH + Sbjct: 238 LIKN-KNGLLENMYLLPQYEVESPYSFIEAFKENYGIELEALSHLKDFKHVFSHRTWYMH 296 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 + + +Y + + + +++L+ + Sbjct: 297 VYIFKANKHSIN-------FYTNKEQEEITIPTAHKKILKYYKD 333 >UniRef50_C3J8R7 A/G-specific adenine glycosylase n=2 Tax=Bacteria RepID=C3J8R7_9PORP Length = 357 Score = 351 bits (902), Expect = 2e-95, Method: Composition-based stats. Identities = 126/351 (35%), Positives = 179/351 (50%), Gaps = 12/351 (3%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 Q + ++DW+++ R+ LPW+ + PY +WLSEV+LQQT+V YF RF+ FP+V Sbjct: 9 PQQLARALVDWFEENARE-LPWRTTRDPYHIWLSEVILQQTRVNQGHDYFLRFVEAFPSV 67 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 TDLA A D+VL LW GLGYY+RA NLH+AAQ +AT H G FP+ F + ALPGVG TA Sbjct: 68 TDLAQASEDQVLSLWQGLGYYSRAHNLHRAAQVIATEHQGIFPKDFALIRALPGVGDYTA 127 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG-VERFN 181 GAI S + +P +DGNV RV++R A + E++ + Sbjct: 128 GAIASFAFDMPYPAVDGNVLRVVSRLLASELPIDTLSGKQLCTQAVEELLATQLPPSKLG 187 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+++LGA++CT P+CS CP + C A L P K K TL +R ++ Sbjct: 188 QALIELGALVCTPQSPQCSACPASSWCRVAELPLARLLPIKGKKLTLRDRYFSYIYTTCH 247 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADE-----ESLRQWLAQRQIAADNLTQLT---AFRHT 293 +VLL +R + +W GLY FP + + + WL + L H Sbjct: 248 QQVLLEKRGRNDIWRGLYQFPLLESDSPQTPDEVAHWLEHELSYSGELRLAPTTFTLEHR 307 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 SH L I + S + W N ++ L P+ R L+QL Sbjct: 308 LSHQILHITIYTAELES--DAVGSTKYQWVNQSELGDYALPTPLVRFLEQL 356 >UniRef50_A6G959 A/G-specific adenine glycosylase n=1 Tax=Plesiocystis pacifica SIR-1 RepID=A6G959_9DELT Length = 378 Score = 350 bits (900), Expect = 3e-95, Method: Composition-based stats. Identities = 117/361 (32%), Positives = 171/361 (47%), Gaps = 22/361 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + + W+ + R LPW+ + PY +W+SE+MLQQT+V TV Y++ F+ RFPT Sbjct: 12 TVRRVAKALGAWFKRDAR-DLPWRRTRDPYAIWVSEIMLQQTRVDTVENYWQPFLDRFPT 70 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA A VL W+GLGYY RA+ LH+ AQ V GG+ P T + + A+PG+GR T Sbjct: 71 VESLAAAEQQAVLEAWSGLGYYRRAKLLHRGAQYVHEELGGEVPGTADALRAIPGIGRYT 130 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI S++ + P++DGNV RV +R A+ + W V Sbjct: 131 AGAIASIAFDQPAPLVDGNVARVHSRLAAIEAPAEQDAKAEAHWRFVAGVLEHGEPRVLA 190 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+M+LGA +CT P C CP++ C A A P K K+ E L L+ Sbjct: 191 QALMELGATVCTPRSPTCLTCPVREHCRARARGLQDQIPAPKVKKKATEHHLLALALRRG 250 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADE------------ESLRQWLAQRQIAADNLT---- 285 D++L+ +RP GL GGL+C P F + LR L ++ A + Sbjct: 251 DKLLIERRPDEGLLGGLWCLPVFEAPSPVSPAPKGAKLDDLRASLERQAGEALGVKLRLE 310 Query: 286 --QLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGN---ALWYNLAQPPSVGLAAPVERL 340 +H FSH ++P + G+ W P+ G+ +L Sbjct: 311 PAPAPRVKHVFSHRIWHLLPWRARANPAPRTRSRGDDLRLAWLARGSDPAGGIPTLTRKL 370 Query: 341 L 341 L Sbjct: 371 L 371 >UniRef50_C4G9U8 Putative uncharacterized protein n=1 Tax=Shuttleworthia satelles DSM 14600 RepID=C4G9U8_9FIRM Length = 374 Score = 350 bits (900), Expect = 3e-95, Method: Composition-based stats. Identities = 110/357 (30%), Positives = 174/357 (48%), Gaps = 13/357 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 S +DW+ R LPW+ D+ PY +W+SE+MLQQT+V V PY++ F+A P Sbjct: 5 FDLSPLVTPAMDWFQVNKR-DLPWRRDRDPYHIWVSEIMLQQTRVEAVKPYYQGFLAALP 63 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 ++ DLA P D++ LW GLGYY+R RN+ KAA+ + GG+ P ++ + +LPG+G Sbjct: 64 SIADLAACPQDQLNKLWEGLGYYSRVRNMQKAAEVIMEKFGGQMPSDYDAILSLPGIGPY 123 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S++ P +DGNV R+LAR + + + E V PA G F Sbjct: 124 TAGAISSIAFDLPAPAVDGNVLRILARVSEDDIDIKSDLAKKRAQAALEGVMPASGSGIF 183 Query: 181 NQAMMDLGAMICTRSK-PKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 NQAMM++GA C + P C CP + C+A + SW P + + +++ Sbjct: 184 NQAMMEIGATACLPNGDPLCGQCPWKKYCLANRHGSWDHLPVRSKGKPRRIEERTVFIVR 243 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFAD---EESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 + V++ +RP GL GLY FP + EE+ + Q + +L RH FSH Sbjct: 244 NGSRVVIDRRPDRGLLAGLYEFPNVREHLSEEAALAYAEQMGYEPLRIRRLEEARHIFSH 303 Query: 297 FHLDIVPMWLPVSSFT------GCMDEGNALW-YNLAQPP-SVGLAAPVERLLQQLR 345 ++ + VS + W ++ + + + + ++ Sbjct: 304 VEWRMIGYEIRVSDAELAKEMVENVPADKISWPVDIGEISRKYAIPSAFAAYAKAIQ 360 >UniRef50_A9T1M0 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9T1M0_PHYPA Length = 627 Score = 350 bits (899), Expect = 5e-95, Method: Composition-based stats. Identities = 124/453 (27%), Positives = 185/453 (40%), Gaps = 109/453 (24%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQ------------------------------------ 25 + A +L WYD R LPW+ Sbjct: 156 EVMTIRASLLSWYDSNHR-VLPWRINLYSCLENPSYSADGAGEICKEESGVGVMNDASLS 214 Query: 26 --------------IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLD 71 + Y VW+SE+MLQQT+VATVI Y++R+M +PTV LA A + Sbjct: 215 GVLCRDSMHNATETEGERAYAVWVSEMMLQQTRVATVISYYQRWMRTWPTVYALAQATQE 274 Query: 72 ----------------EVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALP 115 EV +W GLGYY RAR L + A+++ GG+FP T EE+ +P Sbjct: 275 VSSSAFIPNSSVQSCLEVNSVWAGLGYYRRARFLLEGAKKIVEEFGGEFPRTAEELQKVP 334 Query: 116 GVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV 175 G+G TAGAI S++ + P++DGNV RVL R A+S P W+L+ Q+ Sbjct: 335 GIGTYTAGAIASIAFKQVVPVVDGNVIRVLCRLRAISLNPKASTTVKLFWALASQLVEEY 394 Query: 176 GVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS--------WALYPGKKPKQT 227 FNQA+M+LGA ICT + P C+LC + + C A +P K PK Sbjct: 395 RPGDFNQALMELGATICTPTSPSCALCSVSSQCAALKLVHRVEKNVTSVTCFPAKVPKSA 454 Query: 228 LPERTGYFLLLQHEDE---------VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQR- 277 E +++ E + +LL QRPP GL GL+ FP + ++ Sbjct: 455 PREEYVAVCIIELESQTKMSKAASSLLLVQRPPKGLLAGLWEFPSAPLDNPNSTEKSRLM 514 Query: 278 -----------------QIAADNLTQLTAFRHTFSHFHLDIVPMWL-----PVSSFTGCM 315 + + H FSH L + W+ + + Sbjct: 515 EMDNYLCDTLGFKIGSGGYTVMKREEFGTYVHVFSHIRLHMFIQWMLLSTGEIEDKSTPK 574 Query: 316 DEGN--ALWYNLAQPPSVGLAAPVERLLQQLRT 346 + + W +L S+GL + V+++ + Sbjct: 575 MQRSVVTKWVDLDSMNSLGLTSGVKKVYEMFSA 607 >UniRef50_D1U9Y5 A/G-specific adenine glycosylase n=1 Tax=Desulfovibrio aespoeensis Aspo-2 RepID=D1U9Y5_9DELT Length = 369 Score = 350 bits (898), Expect = 6e-95, Method: Composition-based stats. Identities = 115/353 (32%), Positives = 182/353 (51%), Gaps = 13/353 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M + F++ +LDWYD GR LPW+ + PY+VW+SE+M QQTQ+ V+ YF+R+MAR+P Sbjct: 1 MDGNLFTSHLLDWYDANGR-DLPWRREPNPYRVWISEIMAQQTQLDRVVGYFDRWMARYP 59 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + LA A ++VL LW GLGYY+RARN+ K+A +A HG FP + ALPGVG Sbjct: 60 DLQSLALAREEDVLKLWEGLGYYSRARNILKSASVLAHAHGCVFPSDPIAIRALPGVGAY 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGA+ S++ G P +D NV RV AR + + V + + P F Sbjct: 120 TAGAVASIAFGLCEPAVDANVLRVFARLLDLDAPVAETGVRQTVERTVRALIPEDRPGDF 179 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+M+LGA++C +P+C CP++ C+A A P P + +L Sbjct: 180 NQALMELGALVC-AKRPRCGECPVRAHCLAHARGVADSRPVLPPSRKAIRIEMATGVLVR 238 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQ--FADEESLRQWLAQ-----RQIAADNLTQLTAFRHT 293 VL+ +R P +W GL+ FP E+ RQ LA+ ++ + + ++T ++ Sbjct: 239 HGRVLIQKRRPGDVWPGLWEFPGGCVEPGETPRQALAREFREEVELVVEPVEKITVVAYS 298 Query: 294 FSHFHLDIVPMWLPVSSFTGCM----DEGNALWYNLAQPPSVGLAAPVERLLQ 342 ++ + + + ++ + + + AQ A RL++ Sbjct: 299 YTRYRVTMHCYRCRLTGDAEPVPVFNEAAEGGFVLPAQLAEYAFPAGHRRLIE 351 >UniRef50_A0M7B9 A/G-specific adenine glycosylase n=16 Tax=Bacteroidetes RepID=A0M7B9_GRAFK Length = 350 Score = 349 bits (897), Expect = 8e-95, Method: Composition-based stats. Identities = 120/355 (33%), Positives = 178/355 (50%), Gaps = 20/355 (5%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 S ++ WY + R LPW+ PY +WLSE+MLQQT++ +PY+ +F+ +P+V DL Sbjct: 3 LSNRLTHWYLQNKR-DLPWRKTHEPYHIWLSEIMLQQTRIEQGLPYYNKFIQAYPSVFDL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA +EVL LW GLGYY+RARNLH+ A+ VA GKFP T++ + L GVG TA AI Sbjct: 62 ANATPEEVLKLWQGLGYYSRARNLHETAKYVAFELNGKFPGTYKGLLKLKGVGDYTASAI 121 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S+ + ++DGNV RVL+R + + + L++++ FNQA+M Sbjct: 122 ASICYNEPVAVVDGNVYRVLSRIFGIDTPINSAAGIKEFKLLAQELLDKNDPATFNQAIM 181 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE-V 244 + GA+ C KPKC +CP + C+A N+ P K K + +R +L+ HE+E Sbjct: 182 EFGALHCKPQKPKCEICPFNDSCLALKNDKIKELPVKLKKNKVKKRYFNYLVFNHENETT 241 Query: 245 LLAQRPPSGLWGGLYCFPQFA-----DEESLRQWLAQRQIAADNLTQLT-----AFRHTF 294 +L QR G+W GLY FP DE+ L + A + + +L H Sbjct: 242 ILEQRVGKGIWNGLYQFPLVESERLIDEKELVTFEAFKNSITGSGVELELFNEIPVIHKL 301 Query: 295 SHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL-QQLRTGA 348 SH HL + + + L + +E L + L G Sbjct: 302 SHQHLITRFWVVNSGTASENS-------VKLKNVTDYPVPVLIENFLNEYLPDGY 349 >UniRef50_UPI00016C4021 A/G-specific adenine glycosylase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4021 Length = 375 Score = 349 bits (895), Expect = 1e-94, Method: Composition-based stats. Identities = 120/352 (34%), Positives = 182/352 (51%), Gaps = 14/352 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQID----KTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 ++L+W+DK+ R LPW+ + Y+VW+SEVMLQQT VA V+PYFERF+A FP Sbjct: 15 VRTKLLNWFDKHQR-DLPWRRTVNGARDAYQVWVSEVMLQQTTVAAVVPYFERFLAAFPD 73 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA A VL LW GLGYY RAR+LH AA+ + H G P+ + LPGVGR Sbjct: 74 VRALAAADEQRVLKLWEGLGYYRRARHLHAAAKLLVEAHNGDLPDDPDVWEPLPGVGRYI 133 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 GA+LS + + PI++ N RVLAR +A G P + E + +W+ +E V PA FN Sbjct: 134 LGAVLSQAFDRPLPIVEANSLRVLARLFAYPGDPREGEGKVWVWAAAETVLPAKRAGDFN 193 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 Q++M+LGA++CT + P C CP+++ C A P KK + + E ++++ Sbjct: 194 QSLMELGALVCTPTAPACDRCPVRDNCEAKRLGRQDQIPPKKKQPAITEVAEVGVVIRDG 253 Query: 242 DEVLLAQRPPS-GLWGGLYCFPQFADEE-------SLRQWLAQRQIAADNLTQLTAFRHT 293 VLL QRP + G W ++ P E ++R + ++ RH Sbjct: 254 ANVLLCQRPANAGRWQNMWEIPHAPRTEGEDVSAAAVRVARELTGFDVEPGAEIRTVRHG 313 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDE-GNALWYNLAQPPSVGLAAPVERLLQQL 344 + + + +V + + NA W + +++P +L+ L Sbjct: 314 VTRYAITLVCVGSVLRGGAFAAGHYANAKWVAPQELGDYPVSSPQRKLMTAL 365 >UniRef50_Q7UYZ0 A/G-specific adenine glycosylase n=1 Tax=Rhodopirellula baltica RepID=Q7UYZ0_RHOBA Length = 367 Score = 347 bits (892), Expect = 3e-94, Method: Composition-based stats. Identities = 119/357 (33%), Positives = 189/357 (52%), Gaps = 19/357 (5%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 ++L W+ R LPW+ D +PY+VW+SE+M QQTQVATV+PYFERF++ +PT+ DL Sbjct: 1 MRKRLLSWFSDNAR-DLPWRRDHSPYRVWISEIMCQQTQVATVLPYFERFLSTYPTIRDL 59 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+A +++ +W GLGYY RAR+LH AA+++ H G+FPE+F++V ALPG+GR TAGAI Sbjct: 60 ADADESQLMRMWEGLGYYRRARSLHAAAKKMVEEHNGEFPESFDDVLALPGIGRYTAGAI 119 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER------ 179 S+S K FPIL+GN +RV +R ++ P +K + +LW LS+++ P + Sbjct: 120 QSISRNKAFPILEGNTQRVFSRWIGLTVPPTEKVAQARLWELSDKMLPPRKADDRSNGPA 179 Query: 180 -FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 FNQA M+LGA+IC+ PKC CP+ C A PGK K RT +++ Sbjct: 180 GFNQAAMELGALICSPRSPKCDECPVATMCHANQMGLQDEIPGKISKVQYESRTELAVVI 239 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQ------WLAQRQIAADNLTQLTAFRH 292 +D L+ P + G+ FP+ E+ Q L +H Sbjct: 240 SRDDRYLVRTIPEGVRFAGMIDFPRAGPPEAADVVGMESWLADQLGGDVRIGMCLKTIKH 299 Query: 293 TFSHFHLDIVPMWLPVSSFTGCMD-----EGNALWYNLAQPPSVGLAAPVERLLQQL 344 + + + + + + + W + + + ++ ++++ L Sbjct: 300 AVTRYRMTLHVHIGEWIADGDRVAESGPIPDSWQWATVDELADMPMSVTGRKIVRLL 356 >UniRef50_UPI0001BC63FA A/G-specific adenine glycosylase n=3 Tax=Fusobacterium RepID=UPI0001BC63FA Length = 365 Score = 347 bits (890), Expect = 5e-94, Method: Composition-based stats. Identities = 121/350 (34%), Positives = 182/350 (52%), Gaps = 7/350 (2%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + + ++L++YDK+ R L W+ + Y W+SE+MLQQT+V V PYF RF+ P + Sbjct: 13 KIAKKLLEYYDKHKR-DLAWRGEVPAYYTWISEIMLQQTRVEAVKPYFARFIEELPNIES 71 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LAN ++++ LW GLGYY+RARNL KAA Q+ +GG+ P+ +E+ L G+G TAGA Sbjct: 72 LANCEEEKLMKLWQGLGYYSRARNLKKAACQIVEFYGGELPKEKKELLHLAGIGPYTAGA 131 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I S++ GK +DGNV RV++R +AV G + + K+ L+ Q P FNQA+ Sbjct: 132 ISSIAYGKKETAVDGNVIRVMSRLFAVDGNVLEGKGRQKIEELTYQELPEDRAGDFNQAL 191 Query: 185 MDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 MDLGA IC + C LCPL C A YP KK K+ LLL + Sbjct: 192 MDLGATICIPNGAALCHLCPLHLECQANLKKEVEKYPEKKKKKERKLERQTILLLSDGQK 251 Query: 244 VLLAQRPPSGLWGGLYCFPQFADE---ESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 L +R GL GL+ FP + +R++L ++ I+ + + H FSH Sbjct: 252 FALEKRKEKGLLAGLWQFPMLEGRLSLQEVREYLKEKGISYSGIEEYEPAIHIFSHVEWH 311 Query: 301 IVPMWLPVSSFT-GCMDEGNALWYNLAQ-PPSVGLAAPVERLLQQLRTGA 348 +V + V + E N +W + + + + + L L+ G Sbjct: 312 MVSYIIEVEKWEIQEKREENFVWLSKEEILTEYSVPSAFKVYLDYLKQGQ 361 >UniRef50_Q6AKD3 Related to A/G-specific adenine glycosylase n=1 Tax=Desulfotalea psychrophila RepID=Q6AKD3_DESPS Length = 366 Score = 344 bits (884), Expect = 3e-93, Method: Composition-based stats. Identities = 104/349 (29%), Positives = 170/349 (48%), Gaps = 10/349 (2%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 +F +Q+L W+ R LPW+ PY VW+SE+MLQQTQ+ + YF R++ RFP Sbjct: 9 DLDRFQSQLLTWFRLQDRF-LPWRQTYDPYHVWISEIMLQQTQMDRGVSYFNRWIERFPQ 67 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V +A A E+ +W GLGYYARARNLH+AA+++ G+ P + + +LPG+G T Sbjct: 68 VEAVAEADEQEIFKMWEGLGYYARARNLHRAAKKIVEEFAGELPCDIDLLRSLPGIGPYT 127 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A AI S++ P +D NV R+ +R + + + +V + ++ P+ +N Sbjct: 128 AAAIGSVACNIDIPTIDANVARIFSRLFDIDRPVRETQVARAIEKVACDCLPSGRARHWN 187 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLG ++C P+C+LCP+Q C+A A + L P PKQ L + E Sbjct: 188 QALMDLGGLVCLPRAPRCTLCPIQEMCLAFARQTVQLRPVPLPKQKLIHIRRVAIFQVSE 247 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEES------LRQWLAQRQIAADNLTQLTAFRHTFS 295 +L+ LW GL+ FP +E + + L L + +H ++ Sbjct: 248 GRLLIQPSRHPTLWQGLWEFPHTDLQEEEAVEVGITRLLGLLPSENRVLEPMVTVKHAYT 307 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + + L + ++ W G +A ++L+ + Sbjct: 308 KYRVQFHCYLLRGEAGCQGEEQRLCDW---QALARHGFSAGPRKVLEHI 353 >UniRef50_Q7MUV2 A/G-specific adenine glycosylase n=2 Tax=Porphyromonas gingivalis RepID=Q7MUV2_PORGI Length = 407 Score = 344 bits (883), Expect = 3e-93, Method: Composition-based stats. Identities = 103/346 (29%), Positives = 157/346 (45%), Gaps = 16/346 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + + +WYD R LPW+ PY++W+SEV+LQQT+V Y+ RF+ FP V Sbjct: 50 PELRKLLAEWYDANKR-DLPWRQTDDPYRIWISEVILQQTRVEQGRDYYHRFIECFPDVH 108 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 L+ A DEVL W GLGYY+RARNLH+AA+ + + GG P T +E+ LPG+G TA Sbjct: 109 SLSLASEDEVLKQWEGLGYYSRARNLHRAARMIVSDFGGCIPRTRQEILRLPGIGDYTAA 168 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+LS + F +DGN+ RV++R + + ++ + R NQA Sbjct: 169 AVLSFAYDLPFAAVDGNIFRVISRLMNLDTPIDTPAGKKLFSFWADALLDREAPARHNQA 228 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH--- 240 +M+ GA+ CT + P C LCP++ C+A P KK + R YF+ ++ Sbjct: 229 IMEFGALHCTPTSPSCLLCPVRRFCMADTAGCVDALPVKKGGLRITNRYLYFIYIRVITP 288 Query: 241 -EDEVLLAQRPPSGLWGGLYCFPQFADEE----SLRQWLAQRQIAADNLT------QLTA 289 + +RP +W GLY FP + + +++ Sbjct: 289 TGVYTYIRRRPSGDIWQGLYEFPCVELSDHAVLETLLLSPELGNLLRSISGSMDSLPFKT 348 Query: 290 FRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAA 335 F+H +H +L I L + D Q Sbjct: 349 FKHQLTHRNLWIHGYTLT-ARLDKAPDLDGYRCIREEQLDDFAFPR 393 >UniRef50_C9LTK5 A/G-specific adenine glycosylase n=1 Tax=Selenomonas sputigena ATCC 35185 RepID=C9LTK5_9FIRM Length = 404 Score = 344 bits (882), Expect = 4e-93, Method: Composition-based stats. Identities = 115/314 (36%), Positives = 165/314 (52%), Gaps = 6/314 (1%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + +L W+ R LPW+ + TPY+VW+SE+MLQQT+V V PYFERF+A P Sbjct: 15 TLDEIAEPLLRWFHSEKR-ALPWREEPTPYRVWVSEIMLQQTRVEAVKPYFERFVAALPD 73 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA A + ++ LW GLGYY+RAR+L AA+ + + HGG+ P F+ + ALPG+GR T Sbjct: 74 VRALARADENTLMKLWEGLGYYSRARHLQSAARLICSDHGGEIPAHFDGLLALPGIGRYT 133 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGA+ S++ G+ P +DGNV RV+ R A K+ + + P FN Sbjct: 134 AGAVASIAFGERRPAVDGNVLRVIMRLLACPADILKESTKRAVEEALIARLP-EDAGNFN 192 Query: 182 QAMMDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+M+LGA+IC C CPL+ C+A N A P K P + LL Sbjct: 193 QALMELGALICLPRGAAHCPSCPLERLCLAKEANLQAELPQKTPPKRRRTEKLTIFLLAK 252 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADE---ESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 D++ L +RP GL GL+ FP + + L + L L A +H FSH Sbjct: 253 NDKIALEKRPAQGLLAGLWGFPAMEGHLKKKEAEEALQAIGLIPAKLHALPAAQHIFSHI 312 Query: 298 HLDIVPMWLPVSSF 311 +V + ++ Sbjct: 313 TWQMVGWRVELAEP 326 >UniRef50_C9KJM4 A/G-specific adenine glycosylase n=3 Tax=Clostridiales RepID=C9KJM4_9FIRM Length = 370 Score = 343 bits (880), Expect = 6e-93, Method: Composition-based stats. Identities = 114/348 (32%), Positives = 162/348 (46%), Gaps = 9/348 (2%) Query: 4 SQFSAQVLDWYD-KYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 S +L WY+ + + LPW+ + TPY WLSE+MLQQT+ + VIPY+ RF+ P + Sbjct: 19 SSIVGPLLTWYEGQSTLRCLPWREEVTPYHTWLSEIMLQQTRASAVIPYYTRFLEALPDI 78 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA + ++ LW GLGYY+RARNL KAA + +GGK P F + LPG+GR TA Sbjct: 79 ESLARCDDELLMKLWQGLGYYSRARNLKKAAGVICQEYGGKLPADFAALLELPGIGRYTA 138 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG-VERFN 181 AI S++ G P +DGN+ RV R + V + P+ N Sbjct: 139 SAIGSIAFGLPLPAVDGNILRVTMRVLQCGEDIAQPAVRRAVEEALAPYYPSGQAAGALN 198 Query: 182 QAMMDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA MDLGA IC P C+ CPL C+A S P K K LLL Sbjct: 199 QAFMDLGATICLPHGAPHCAACPLARLCLAHDAGSEQQLPVKSKKAKRRVEQKTVLLLHQ 258 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFAD---EESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 + L +RP GL GL+ P + + WLA R + + +L RH FSH Sbjct: 259 GRALALTKRPNKGLLAGLWELPNLPGHLTQAEVEAWLAARGLRTLTIARLPDARHIFSHV 318 Query: 298 HLDIVPMWLPVSSFTGCMDEG--NALWYNLAQPPS-VGLAAPVERLLQ 342 + + ++S + E +W + + + + + L Sbjct: 319 EWQMQGWAVELASPDAMVAEDASPLVWASPEELADTYSIPSAFQYFLH 366 >UniRef50_C4V5S7 Adenine glycosylase n=1 Tax=Selenomonas flueggei ATCC 43531 RepID=C4V5S7_9FIRM Length = 369 Score = 343 bits (880), Expect = 7e-93, Method: Composition-based stats. Identities = 113/346 (32%), Positives = 160/346 (46%), Gaps = 16/346 (4%) Query: 8 AQVLDWYDKYGR-KTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 +L W+ + LPW+ + TPY VW+SE+MLQQT+ A V Y+ RF+ P++ DLA Sbjct: 17 RALLTWHKSAPDTRDLPWRDEPTPYHVWISEIMLQQTRAAVVRAYYLRFLTALPSIHDLA 76 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 D ++ LW GLGYY+RARNL +AAQ + HGG P F + LPG+GR TA AI Sbjct: 77 AVNDDALMKLWQGLGYYSRARNLKRAAQVIVKEHGGDLPNDFNALLTLPGIGRYTASAIA 136 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG-VERFNQAMM 185 S + G+ P +DGN RV AR A GK + L + P N+A M Sbjct: 137 SFAYGQPHPAVDGNFLRVAARITANPIDIGKDSTKRSLEAALSVSYPEGRDAGLLNEAFM 196 Query: 186 DLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 DLGA IC P C CP C+A + YP K + + L+L D + Sbjct: 197 DLGATICLPHGAPLCHSCPAAQLCLAHDRGTEQDYPVKSAAKARRKEKHTVLILSCGDRI 256 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQW---LAQRQIAADNLTQLTAFRHTFSHFHLDI 301 + +RP GL GL+ +P ++S R L ++ L RH FSH ++ Sbjct: 257 AIRKRPTKGLLAGLWEYPHLDGKQSKRAVRAHLEDEGFRILSIDPLPPARHIFSHIEWEL 316 Query: 302 VPMWLPVSSFTGCM---------DEGNALWYNLAQP-PSVGLAAPV 337 + V+ + LW A+ + + A Sbjct: 317 TGWAVTVAEKNEPPLMAAETASDAPSSLLWVRRAELAETYSIPAAF 362 >UniRef50_C6XRC5 HhH-GPD family protein n=1 Tax=Hirschia baltica ATCC 49814 RepID=C6XRC5_HIRBI Length = 361 Score = 343 bits (880), Expect = 7e-93, Method: Composition-based stats. Identities = 122/349 (34%), Positives = 176/349 (50%), Gaps = 20/349 (5%) Query: 7 SAQVLDWYDKYGRKTLPWQIDK---------TPYKVWLSEVMLQQTQVATVIPYFERFMA 57 + ++L WYD+ R+ LPW++ PY+VWLSE+MLQQT V PYF++F Sbjct: 17 AHELLQWYDQMARR-LPWRLSPQARKNGDMPDPYRVWLSEIMLQQTTVPHAAPYFDKFTR 75 Query: 58 RFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGV 117 +P V DLA A D+V+ W GLGYYARARNLHK A+ +A L FP T E + ALPGV Sbjct: 76 LWPKVEDLAAAHNDDVMREWAGLGYYARARNLHKCAKVLAGL--DAFPNTIEALLALPGV 133 Query: 118 GRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKK---EVENKLWSLSEQVTPA 174 G TA A+ S++ +DGN++RV++R A++G + + ++ + + P Sbjct: 134 GPYTAAAVGSIAFDLPVAPVDGNIERVISRLMAIAGDGSAAGWAQDKKEITQRVQTLVPQ 193 Query: 175 VGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGY 234 F QAMMDLGA +CT P C +CP + C+A + YP K ++ P R G Sbjct: 194 -RSGDFAQAMMDLGASVCTPKSPNCMICPWMDICLARQEGNQESYPAKPKRKPQPIRNGL 252 Query: 235 FLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTF 294 ++ HE +VLL +RP SGL GG+ EE L Q + RH F Sbjct: 253 AFVIFHEGKVLLQRRPDSGLLGGMLMPLSSPWEEGGEYTLEDYQPVKADWHYRGQARHVF 312 Query: 295 SHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 +HF L ++ + W ++ GL + L+ Sbjct: 313 THFALHWDVYVAHLTKKPDLSGD----WIDIDMCLEAGLPTVGRKALKL 357 >UniRef50_C9PXZ8 A/G-specific adenine glycosylase n=2 Tax=Prevotella RepID=C9PXZ8_9BACT Length = 403 Score = 342 bits (877), Expect = 2e-92, Method: Composition-based stats. Identities = 123/402 (30%), Positives = 180/402 (44%), Gaps = 66/402 (16%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS +L W+ +GR +LPW+ K PY +WLSEV+LQQT+V+ + Y++RFM +PTV L Sbjct: 7 FSTTLLHWFQHHGR-SLPWRETKDPYAIWLSEVILQQTRVSQGLAYWQRFMHNYPTVNAL 65 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A DEVL LW GLGYY+RARNLH+AA+Q+ L G FP T + ++ L GVG TA AI Sbjct: 66 AAATEDEVLRLWQGLGYYSRARNLHQAAKQIVEL--GYFPNTHDAISKLKGVGPYTAAAI 123 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S++ ++DGNV RVLAR + + E + + +L++ + P RFN+A+M Sbjct: 124 ASIAFNLPVAVVDGNVYRVLARFFGIDTPINSTEGKKQFATLAQSLLPHHAPARFNEAIM 183 Query: 186 DLGAMICTRSK----------------PK-CSLCPLQNGCIAAANNSWALYPGKKPKQTL 228 D GA+ C K P C+ CPL C+A A P K Q Sbjct: 184 DFGALQCLPVKGETTKTNNNIATTNESPAFCNDCPLNGQCVAYAQGLVRSLPVKTKSQAP 243 Query: 229 PERTGYFLLLQHEDEVLLAQRPPSGLWGGLYC---------------------------- 260 +R ++ L+ EV + +RP +W GL+ Sbjct: 244 KQRRMGYIYLRCRGEVAIRKRPAGDIWQGLWEPLLYEDEVLSSAQTPKTSKQESTNYIED 303 Query: 261 FPQFADEESLRQWLAQRQIAADNLTQL----------------TAFRHTFSHFHLDIVPM 304 FP L L Q A T L FRH +H + Sbjct: 304 FPSPQLLAYLDSLLKGEQCATSTATSLHTHPHPTNSSPTIKGPYTFRHVLTHRIIMAQF- 362 Query: 305 WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 V + + +W + ++ E ++++ Sbjct: 363 -AIVETEVKPILPSGYIWVTEGELNKYAISRLFELFQEKVKG 403 >UniRef50_C7PL91 A/G-specific adenine glycosylase n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PL91_CHIPD Length = 354 Score = 341 bits (876), Expect = 2e-92, Method: Composition-based stats. Identities = 109/346 (31%), Positives = 175/346 (50%), Gaps = 12/346 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ +L+W D +++PW+ +K PY++WLSE++LQQT+V Y+E+F+ +PTV +L Sbjct: 5 FTNALLEWNDNENTRSMPWKGEKDPYRIWLSEIILQQTRVEQGWAYYEKFILNYPTVQEL 64 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP + V LW GLGYYAR +N+ AA+Q+A+ + G FP T+E + +L GVG T+ AI Sbjct: 65 AAAPEEAVFRLWQGLGYYARCKNMLAAAKQIASQYHGHFPNTYETIQSLKGVGPYTSAAI 124 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S + +LDGNV RVL+R + + + + L++++ P +NQ++M Sbjct: 125 ASFAFNLPHAVLDGNVFRVLSRFFDIDTPIDTTAGKKQFTDLAQELLPHGKSASYNQSIM 184 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D GA++C +P C CPL C AL P K K + +R Y+L+LQH++ V Sbjct: 185 DFGAVVCKPQQPACKSCPLAAKCKGYQQGLTALLPVKSKKLVIKKRYFYYLVLQHKENVY 244 Query: 246 LAQRPPSGLWGGLYCF-----PQFADEESLRQWLAQRQIAADNLTQL----TAFRHTFSH 296 + +R + +W L+ F P D SL A + + D + F+ +H Sbjct: 245 IRKRTENDIWQNLHEFILIETPGPEDPGSLLSSAAFKAVMKDIRYNMDGASATFKQQLTH 304 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 + + L VS Q + L+ Sbjct: 305 QTIHSQFLLLSVSKKPEI---PGYTAVPRDQLDLYAFPKTITDFLR 347 >UniRef50_B8FVN1 A/G-specific adenine glycosylase n=2 Tax=Desulfitobacterium hafniense RepID=B8FVN1_DESHD Length = 401 Score = 341 bits (876), Expect = 2e-92, Method: Composition-based stats. Identities = 118/395 (29%), Positives = 177/395 (44%), Gaps = 58/395 (14%) Query: 7 SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 S++++ W+++ R LPW+ K PY +W+SEVMLQQTQV T IPY+ RFM RFPT++ LA Sbjct: 6 SSKLVQWFNQVKR-DLPWRRTKDPYAIWVSEVMLQQTQVVTAIPYYLRFMGRFPTLSHLA 64 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 A +EVL LW GLGYY+RAR L + A+ V G+ P+ ++ + + GVG TA AI Sbjct: 65 EAEQEEVLELWRGLGYYSRARRLWEGARYVVETAEGRMPKDYQSLLHIKGVGEYTAAAIA 124 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMD 186 S++ + P++DGNVKRVL+R K +V P FNQ MM+ Sbjct: 125 SIAYEEQVPVMDGNVKRVLSRILRWEEDVEKARSRRFFLEYLGEVIPGDCPGDFNQGMME 184 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLL 246 LGA +CT P+C CPLQ C A +YP KK K+ L+L H+ +VLL Sbjct: 185 LGATVCTPKHPRCEQCPLQADCEGFALGDPQVYPVKKSKEKPGSAWRPTLILLHQGQVLL 244 Query: 247 AQRPPSGLWGGLYCFPQFAD--------------EESLRQWLAQRQ-------------- 278 +RP +GL L+ FP + + Sbjct: 245 KKRPSTGLLADLWEFPGEEMMVPPWENYRGGKEFSAEAVKVAERVGEETSRYRYNLDFAW 304 Query: 279 -------------------IAADNLTQLTAFRHTFSHFHLDIVPMW----------LPVS 309 + + ++ HTFSH I + + V Sbjct: 305 YELYKNQIGDAACDEAVQELLSRRPSRQGPLVHTFSHRRWQIYWLVLNLDEVMDKTVRVR 364 Query: 310 SFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + W ++ + + L +++ ++ Sbjct: 365 ENEDSLKGDGLCWLDVKELDKIALPVAFQKVWAKV 399 >UniRef50_Q1Q8E2 A/G-specific DNA-adenine glycosylase n=4 Tax=Moraxellaceae RepID=Q1Q8E2_PSYCK Length = 453 Score = 339 bits (871), Expect = 8e-92, Method: Composition-based stats. Identities = 151/405 (37%), Positives = 206/405 (50%), Gaps = 64/405 (15%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQI----DKTPYKVWLSEVMLQQTQVATVIPYFERFMA 57 F+ ++L W+ GR LPWQ PY VWLSEVMLQQTQV TV+PYF RFM Sbjct: 49 SLDAFAPRLLAWFADNGRHDLPWQQHQTDTPNPYIVWLSEVMLQQTQVTTVLPYFARFMD 108 Query: 58 RFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHG--GKFPETFEEVAALP 115 FPTV DLA A D + W GLGYYARARNLHK A+Q+ + G FP+T A+ Sbjct: 109 SFPTVQDLAAAEWDTIAEHWAGLGYYARARNLHKGAKQLVAVIDETGDFPQTLAGWEAIS 168 Query: 116 GVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV 175 GVG STAGAI+S+ L ++ I DGNVKRVL R A+ G K LW+L+E++TP Sbjct: 169 GVGPSTAGAIMSMGLHRYGVICDGNVKRVLTRWAAIDGDITKSATTKDLWALAERLTPRE 228 Query: 176 GVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF 235 F QAMMD+GA +CTRSKP C LCPLQ+ C+A A YP K KQ P + Sbjct: 229 QSGLFAQAMMDMGATLCTRSKPACLLCPLQDDCLAHAQGRETEYPVKAKKQPKPSKFSNA 288 Query: 236 LLLQHE-DEVLLAQRPPSGLWGGLYCFPQ------------------------------- 263 LL+++E E+L QRP +G+WGGL+ P Sbjct: 289 LLIENENGEILWLQRPDNGIWGGLWSLPLAFIEKISGKVAVKDADKNKNKGPIAANDNQL 348 Query: 264 --------------FADEESLRQWLAQRQIAADNLTQL----TAFRHTFSHFHLDIVPMW 305 E+ + +WL + +++A +++ +H+ +HFH + P Sbjct: 349 LEVASNEKIYETEFTTAEQIINEWLDKNKLSAKSVSNTLLDDAPIKHSLTHFHWYLTPQS 408 Query: 306 LPVSSFT-------GCMDEGNALWYNLAQPP-SVGLAAPVERLLQ 342 + ++ E N W N ++GL + ++L+ Sbjct: 409 VTLNDKQVTEITKALQAAEININWLNTDDAQATLGLPRAMVKILE 453 >UniRef50_Q02CA8 A/G-specific DNA-adenine glycosylase n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q02CA8_SOLUE Length = 337 Score = 339 bits (869), Expect = 1e-91, Method: Composition-based stats. Identities = 116/348 (33%), Positives = 174/348 (50%), Gaps = 23/348 (6%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M S+ +L WY R LPW+ + PY+VW+SE+MLQQT+ VIPY+E+F+ARFP Sbjct: 1 MSGPSLSSLLLAWYRAAHR-DLPWRSSRDPYRVWISEIMLQQTRAQAVIPYYEKFLARFP 59 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 +V LA AP +EVL LW+GLGYY+RARNL AA+Q+ G FP ++ + ALPG+G Sbjct: 60 SVESLAAAPEEEVLALWSGLGYYSRARNLRNAARQITAAAG--FPREYDAIRALPGIGDY 117 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV----G 176 TA AI S++ G +LDGNV RV+AR + G + ++++Q A Sbjct: 118 TAAAIASIAFGLPHAVLDGNVLRVVARVENDAADIGSSRTRERFRAIAQQWLEAKEVKHD 177 Query: 177 VERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFL 236 FNQA+M+LGA +C +P C +CP+ C A + A P K KQ +G L Sbjct: 178 PGHFNQALMELGATVCVPRRPLCLVCPISACCRARQEGTAAQLPVKLRKQEPVRISGVLL 237 Query: 237 LLQHEDEVLLAQRPP-SGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFS 295 +++ +LL +R G + P + ++ ++ RHT + Sbjct: 238 VVRQRGRILLRRREDSQSRMAGFWELPT-----------PEELPGSEVGERIGEIRHTIT 286 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 H ++ G W+ +AQ + + + LQ Sbjct: 287 HHRYTFEV----RAATAAVPATGGFAWFTIAQLAEIPFSTTARKALQL 330 >UniRef50_C6X0N8 A/G-specific adenine glycosylase n=1 Tax=Flavobacteriaceae bacterium 3519-10 RepID=C6X0N8_FLAB3 Length = 372 Score = 338 bits (868), Expect = 2e-91, Method: Composition-based stats. Identities = 108/344 (31%), Positives = 170/344 (49%), Gaps = 15/344 (4%) Query: 8 AQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLAN 67 +L WY+ +GR LP++ PY++W+ E++ QQT++A + +++ F+ARFP V LAN Sbjct: 15 ENLLAWYNTHGR-DLPFRKTGDPYRIWVCEIIFQQTRIAQGLNHYQNFIARFPDVHTLAN 73 Query: 68 APLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILS 127 A DEVL W GLGYY+RA NLHKAA Q+ +GG FP+ ++++ L GVG+ TA AI S Sbjct: 74 AETDEVLLYWKGLGYYSRALNLHKAAMQIIHEYGGIFPKNYDDILKLRGVGKYTAAAIAS 133 Query: 128 LSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDL 187 + G H P +DGN RVL+R +A + N L+ ++ P FN+AMMDL Sbjct: 134 ICFGAHIPAVDGNFYRVLSRVFAEDFDVSNSKAFNYFSGLALRMMPQNKAGHFNEAMMDL 193 Query: 188 GAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLA 247 G+ +C PKC +CPL C+A +P K K + + ++ + L+ Sbjct: 194 GSEVCRPRNPKCEICPLNADCLAFNLGLIPKFPVKTKKTKPTDLELTYYFVEFGERFLIR 253 Query: 248 QRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLP 307 QR +W L+ FP E ++T H +H +L I + Sbjct: 254 QRNEDHIWKKLFEFPIRIPEN-----------FKQDITHHKTVSHQLTHKNLTIHIYKVT 302 Query: 308 VSSFTGCMD---EGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 + + ++ + N L P++ L + + Sbjct: 303 LIAENAFLEFALKENLQIITLEDSHHKSFPKPLQNYLARFNLAS 346 >UniRef50_A9NG45 A/G-specific adenine DNA glycosylase n=1 Tax=Acholeplasma laidlawii PG-8A RepID=A9NG45_ACHLI Length = 334 Score = 338 bits (867), Expect = 2e-91, Method: Composition-based stats. Identities = 112/339 (33%), Positives = 179/339 (52%), Gaps = 12/339 (3%) Query: 9 QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 ++ DWY R LP++ PY +W+SE+MLQQTQV T++P+++RF+ +PT+ DLA A Sbjct: 5 RLFDWYQTNKR-DLPFRKTDNPYHIWVSEIMLQQTQVDTMLPFYDRFLTIYPTIQDLARA 63 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 +DE+L + G+GYY R R LHK AQ V H GK PE + ++ +PG+G TAGAI+S+ Sbjct: 64 DIDEILKVVQGIGYYRRFRMLHKGAQYVIDHHDGKLPEDYFKILKIPGIGAYTAGAIMSI 123 Query: 129 SLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT-PAVGVERFNQAMMDL 187 + K +P DGNV RVL+R + + + KL +++++ + + Q+MM+L Sbjct: 124 AFHKPYPATDGNVIRVLSRVKMLEDDFRLDKNKKKLNEMNKELIENSNNPYLYTQSMMEL 183 Query: 188 GAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLA 247 GA +C S P C CPLQ C+A NN YP P +T E + +L++++ L+ Sbjct: 184 GATVCKVSNPLCDTCPLQEVCLANINNVQQNYPKMSPLKTKKEIQYFTFILEYKEGFLMR 243 Query: 248 QRPPSGLWGGLYCFPQFADEE--SLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMW 305 +R L G Y F Q + + L +L +H F+H I Sbjct: 244 KRTED-LLHGFYEFVQIESDSLNGALTQAQDLGLEIQILDELQPIKHVFTHMIWQINLYR 302 Query: 306 LPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 +S+ T + + V +A V++ ++Q+ Sbjct: 303 GTISNITSPYEI-------VHDFSQVPIATVVKKQIKQI 334 >UniRef50_A7HQZ1 A/G-specific adenine glycosylase n=45 Tax=Alphaproteobacteria RepID=A7HQZ1_PARL1 Length = 615 Score = 337 bits (866), Expect = 2e-91, Method: Composition-based stats. Identities = 132/361 (36%), Positives = 187/361 (51%), Gaps = 29/361 (8%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDK----TPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 +A +L WYDK+ R LPW+ K PY VWLSE+MLQQT VATV PYF F+ R+ Sbjct: 245 KAAAAPLLAWYDKHAR-VLPWRARKGERADPYAVWLSEIMLQQTTVATVGPYFTGFLKRW 303 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 P V LA AP +EV+ W GLGYY+RARNLH A++V++ +GGKFP+T E + +LPG+G Sbjct: 304 PNVEALAAAPQEEVMKAWAGLGYYSRARNLHACAKEVSSEYGGKFPDTVEGLESLPGIGP 363 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA AI +++ G+ ++DGNV+RV+AR + + + + + + +TP Sbjct: 364 YTAAAIAAIAFGRAATVVDGNVERVVARLFEIETPLPAAKPD--IREKARTLTPEQRAGD 421 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 F QAMMDLGA ICT P C+ CP+ + C A A + L P + PK+ P R G L Sbjct: 422 FAQAMMDLGATICTPRSPACNRCPINDLCDARAAGTQNLLPARAPKKARPTRRGACFWLV 481 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLR--------------------QWLAQRQI 279 E V L +RP GL GG+ P +ES R + L + Sbjct: 482 REGHVWLRRRPDKGLLGGMLEVPGTPWDESDRHRTVIELSHDENGGRRGVAGEVLDHAPM 541 Query: 280 AADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVER 339 A+ HTF+HFHL++ + +W L + L + + Sbjct: 542 EAEWRLVPGLVEHTFTHFHLELEVFTATTRKK--IVPGREGMWVPLEEVAGEALPTVMRK 599 Query: 340 L 340 + Sbjct: 600 V 600 >UniRef50_B7CBH3 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CBH3_9FIRM Length = 328 Score = 336 bits (863), Expect = 7e-91, Method: Composition-based stats. Identities = 102/344 (29%), Positives = 175/344 (50%), Gaps = 25/344 (7%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 +F ++DWY K R L +++ K PY++W+SE+M QQT++ ++PYF+R++ + P + Sbjct: 2 KEFQKDLIDWYIKNHR-PLEFRLKKDPYEIWISEIMAQQTRIEAMLPYFKRWIQQLPDIE 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 +A +++ LW GLGYY+R +N+ K A + + GK P T EE+ LPG+G TAG Sbjct: 61 SVAKCDDEKLNKLWQGLGYYSRCKNIKKCAIECVEKYSGKLPCTKEELLKLPGIGPYTAG 120 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP-AVGVERFNQ 182 AI S++ G+ +DGNV RV +R Y + K V+ ++ L ++ P + +NQ Sbjct: 121 AIASIANGQRVSAVDGNVIRVFSRLYNIFEDVTKTSVKKQIEELVDESLPSKEEISYYNQ 180 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+LGA+IC P+C LCP++ C A P K K+ + +L H+D Sbjct: 181 AIMELGALICIPKNPRCELCPVKKYCDAK---DPGSLPYKAKKKMRKIEYKHLYILVHKD 237 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 +V + +R +GL GLY F + E + +L + H FSH + Sbjct: 238 KVHVVKRADTGLLAGLYGFDETKPEHIIESI------------ELEPYIHIFSHVEWHMD 285 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPS-VGLAAPVERLLQQLR 345 V ++ N + + + + + ++Q++ Sbjct: 286 ATLCIVDR-----EDSNYK--TVEEIEEHIAMPSAFMPFMKQVK 322 >UniRef50_C1XGJ6 A/G-specific DNA-adenine glycosylase n=1 Tax=Meiothermus ruber DSM 1279 RepID=C1XGJ6_MEIRU Length = 336 Score = 336 bits (862), Expect = 7e-91, Method: Composition-based stats. Identities = 121/351 (34%), Positives = 178/351 (50%), Gaps = 26/351 (7%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + +S + +L WY ++ R+ LPW+ + PY+V LSEV+LQQT+V IPY+ RF+ RFP Sbjct: 6 LMSSDLHSALLTWYQRHQRR-LPWRGEPDPYRVLLSEVLLQQTRVEQAIPYYHRFLQRFP 64 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA A + VL +W G GYY RARNLH+ AQQV GG P++ + ALPG+G Sbjct: 65 TLEALAQAEQEAVLQVWQGCGYYTRARNLHRLAQQVVAA-GGVLPQSARGLRALPGIGPY 123 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV--GVE 178 TA A+ S++ G+ +DGNV+RVL+R A P K+V+ +L + Sbjct: 124 TAAAVASIAFGEPAAAVDGNVRRVLSRLLA-WEHPTPKQVQEAADALLSALVQQKDARPG 182 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 +NQA+M+LGA +CT P C CP+ C S A YP + ++ L+L Sbjct: 183 DWNQALMELGATVCTPQNPGCGGCPVAAFCQ--GKASPAHYPASRVRK-QKSLELVALVL 239 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 Q + V L QR + GGL+ P +L + LA+ + + L + RH F+H Sbjct: 240 QGPEGVYLEQR-QGPVLGGLWGVPMEEGPGALERLLARFGLDSAEL--VGPVRHEFTHRK 296 Query: 299 LDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAP 349 L I P W P L+ +++L L +P Sbjct: 297 LHIRVYRAP--------------WVAGENPVRRPLSRLDQKIL-ALAGASP 332 >UniRef50_D2MN98 Putative A/G-specific adenine glycosylase n=1 Tax=Bulleidia extructa W1219 RepID=D2MN98_9FIRM Length = 346 Score = 336 bits (862), Expect = 8e-91, Method: Composition-based stats. Identities = 112/340 (32%), Positives = 166/340 (48%), Gaps = 8/340 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 Q +F ++ WY + + PW++ K PY VW+SE+MLQQT++ TV+P +ERFM P Sbjct: 3 FQKKEFQQ-LIQWYQAHH-QNYPWRLTKDPYAVWISEIMLQQTRIETVLPKYERFMKELP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 ++ DLA D ++H W GLGYY+RARNL KAA Q+ H G FP EE+ AL G+G Sbjct: 61 SIHDLATVSEDHLMHFWEGLGYYSRARNLQKAALQIEDRHHGIFPHQLEEIQALTGIGDY 120 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV---GV 177 TAGAI S S G P +DGNV RV +R + V++ + + Sbjct: 121 TAGAIASFSFGVGVPAIDGNVLRVYSRHEGLYQNVLDPSVKSLVKQQMLPLYTKEKHSDN 180 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL 237 FNQA+M+LG IC P C CP+Q GC + ++ P + K + FL+ Sbjct: 181 GDFNQAIMELGEQICLPKNPDCQNCPIQKGCFSFLHHKQGELPIRISKTKKKKEEHSFLI 240 Query: 238 LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 ++ +L+ +RP +GL LY + +L Q QI + L++ +H FSH Sbjct: 241 FYTKEAILVHKRPVNGLLANLYEPVNLDFFIDIEDFLQQHQIPVLSYQTLSSHKHIFSHR 300 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPV 337 I + V W + + ++ Sbjct: 301 IWQIQAYAIEVKELFPF---PEYQWIEKKKSQHLAFSSAF 337 >UniRef50_D2QT13 A/G-specific adenine glycosylase n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QT13_9SPHI Length = 362 Score = 336 bits (862), Expect = 9e-91, Method: Composition-based stats. Identities = 107/345 (31%), Positives = 167/345 (48%), Gaps = 10/345 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ + WY + R LPW+ PY +WLSE++LQQT+VA PY+ERF+ +PT++D+ Sbjct: 14 FAPTLEQWYSVHKR-DLPWRHTLDPYYIWLSEIILQQTRVAQGKPYYERFVDAYPTISDM 72 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA ++L LW GLGYY+RARNLH+ A+ V GKFP T+ ++ + G+G TA A+ Sbjct: 73 ANADERKLLRLWQGLGYYSRARNLHQTARYVTEKLDGKFPNTYHDLLKMKGIGAYTAAAV 132 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV-GVERFNQAM 184 S + G+ P++DGNV RVLAR + ++ + +L+ ++ A +NQA+ Sbjct: 133 ASFAFGERVPVVDGNVYRVLARVFGITEDITTTTAKKTFAALATRLMQAAVDPATYNQAI 192 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+ GA+ CT P C LCP+Q C+A P K K + ER +++ Q + Sbjct: 193 MEFGAIHCTPVAPDCLLCPVQQQCVAYLTGRQHQLPVKSKKAPVRERFFSYVVFQQGGRL 252 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQWLA-QRQIAADNLTQLTAFRHT-------FSH 296 L +R +W LY F +E + +L SH Sbjct: 253 ALQERVARDIWQNLYDFYLVETDEPKTALRDIPLSESVSSLVNRGTLVEAPVEAMQLLSH 312 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 + + + WY+LA+ + + L Sbjct: 313 QRIRAYFFLVDLPETETGSLPIGLQWYSLAEISILPKPVLITNYL 357 >UniRef50_B3EUI2 A/G-specific adenine glycosylase n=1 Tax=Candidatus Amoebophilus asiaticus 5a2 RepID=B3EUI2_AMOA5 Length = 373 Score = 335 bits (861), Expect = 1e-90, Method: Composition-based stats. Identities = 105/314 (33%), Positives = 174/314 (55%), Gaps = 9/314 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ ++++WY + R LPW+ K PYK+WLSE++LQQT+VA +PY++RF+ +PT+ DL Sbjct: 20 FAIKLIEWYQHHHR-ALPWRETKDPYKIWLSEIILQQTRVAQGLPYYQRFIENYPTIHDL 78 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+A +L +W GLGYY RARNLH A+ + T GKFP ++ + +LPG+G TA AI Sbjct: 79 ASASETAILRVWQGLGYYTRARNLHACARTIVTQFQGKFPNNYKALLSLPGIGVYTAAAI 138 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S++ + P++DGNV RVLAR + + + ++ L++ + + +NQA+M Sbjct: 139 ASIAFKEPIPVIDGNVYRVLARIFDIETAINSTKGKHIFNQLAQTLISKTAPDIYNQAIM 198 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ-HEDEV 244 + GA+ CT KP C+ C + C A N L P K+ K + +R ++L +Q +D++ Sbjct: 199 EFGAIQCTPLKPLCNTCIFKMDCSAFLANKQHLLPVKEAKVKIKQRFFHYLCIQLDDDQL 258 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLR------QWLAQRQIAADNLTQLTAF-RHTFSHF 297 + R P +W GLY F + E + + + + ++ +H +H Sbjct: 259 FMKSRKPGDIWTGLYDFYLVEESERKEFDQLEDELVQLIKKHQLYIEKVPTVYKHILTHR 318 Query: 298 HLDIVPMWLPVSSF 311 L + V+ Sbjct: 319 VLYASFFKIIVTKA 332 >UniRef50_C1F8W0 A/G-specific adenine glycosylase, putative n=1 Tax=Acidobacterium capsulatum ATCC 51196 RepID=C1F8W0_ACIC5 Length = 354 Score = 335 bits (859), Expect = 2e-90, Method: Composition-based stats. Identities = 112/351 (31%), Positives = 175/351 (49%), Gaps = 25/351 (7%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + + F + WY + R LPW+ + PY +W+SE+MLQQT+VA V+ Y++RFM +FPT Sbjct: 4 EIASFQRDISAWYRQNAR-DLPWRRTRDPYAIWVSEIMLQQTRVAAVMEYYQRFMGQFPT 62 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LA+AP + VL LW+GLGYY RAR +H AA V HGGK P T ++ LPG+G T Sbjct: 63 IEALASAPEESVLALWSGLGYYRRARMMHHAAHIVVAEHGGKMPATAAQLRKLPGIGDYT 122 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVE--NKLWSLSEQVTPAVGVER 179 + A+ S+S + P++DGNV+RVL R G + + L + ++++ Sbjct: 123 SAAVASISFDEPVPVIDGNVERVLLRLRGEPAVKGHPDAPGLSDLKAAAQELLDTEQPGD 182 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 FNQAMM+LGA +C P C+ CP++ C + K+ + Y L+ Q Sbjct: 183 FNQAMMELGATVCLPRAPLCAECPVRAYCRTQGEHETG-----PAKKMRSVQVSYALIRQ 237 Query: 240 HED---EVLLAQRPPSGL-WGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFS 295 +LLAQR GL+ P LA+ I L RH+ + Sbjct: 238 APGPRTAILLAQRGAQESQMPGLWELPLVD-----PAGLAEDGI-------LLTVRHSIT 285 Query: 296 HFHLDIVPM-WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 + + + LP + + W + + S+ L ++L++L+ Sbjct: 286 NTNYYVTIYSLLPDQRDRLQVPARSTRWISARELRSIPLTGLARKVLKRLK 336 >UniRef50_Q2KBH8 A/G-specific adenine glycosylase protein n=11 Tax=Alphaproteobacteria RepID=Q2KBH8_RHIEC Length = 367 Score = 335 bits (859), Expect = 2e-90, Method: Composition-based stats. Identities = 130/347 (37%), Positives = 187/347 (53%), Gaps = 15/347 (4%) Query: 7 SAQVLDWYDKYGRKTLPWQIDK---------TPYKVWLSEVMLQQTQVATVIPYFERFMA 57 + +LDWYD++ R LPW+I PY+VWLSEVMLQQT V V PYFE+F+ Sbjct: 11 AKPLLDWYDRHHR-DLPWRISPGMAARGVKPDPYRVWLSEVMLQQTTVQAVKPYFEKFLE 69 Query: 58 RFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGV 117 R+P VTDLA A D V+ W GLGYYARARNL K A+ VA HGG FP+T E + +LPG+ Sbjct: 70 RWPEVTDLAAAESDAVMATWAGLGYYARARNLKKCAEAVAKEHGGVFPDTEEGLKSLPGI 129 Query: 118 GRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGV 177 G TA A+ +++ + ++DGNV+RV++R YA+ + + + +TP+ Sbjct: 130 GDYTAAAVAAIAFNRQAAVMDGNVERVISRLYAIEAPLPAAKP--AMKAKVALLTPSGRP 187 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL 237 F QAMMDLGA ICT +P CSLCP +N C A + L+P K K+ P R G + Sbjct: 188 GDFAQAMMDLGATICTPKRPACSLCPFRNACQALKLSDPELFPVKAAKKEKPVRHGAAFI 247 Query: 238 -LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAAD-NLTQLTAFRHTFS 295 + + E+LL +R SGL GG+ P A L + + H F+ Sbjct: 248 AVSGDGEILLRRRAESGLLGGMTEVPTTAWTARLDGETSSAAAPFEAAWQACGTVIHVFT 307 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNALWY-NLAQPPSVGLAAPVERLL 341 HF L + + +++ D N W+ + + L +++ + Sbjct: 308 HFELRLSIWRVAIAAKPKSRDGVNDEWWEPVTNLEAQALPTIMKKAI 354 >UniRef50_Q04T89 A/G-specific DNA glycosylase n=4 Tax=Leptospira RepID=Q04T89_LEPBJ Length = 372 Score = 334 bits (858), Expect = 2e-90, Method: Composition-based stats. Identities = 106/352 (30%), Positives = 172/352 (48%), Gaps = 12/352 (3%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + +L W+ K RK LP++I+K Y++W+SE+MLQQT+VA ++P +E F+ RFP Sbjct: 12 ELRENLLSWFQKNKRK-LPFRINKNAYRIWVSEIMLQQTRVAAMLPIYETFLKRFPDPKA 70 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 L +A +EV+ W GLGYY+RARNL K A+ + + G+FPE +EE ++PGVG TA A Sbjct: 71 LQDASEEEVMKYWKGLGYYSRARNLKKGAELLVEKYEGRFPEDYEEALSIPGVGSYTASA 130 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 +LS++ GK + +LDGNVKRVL+R + + P L L+++ N+AM Sbjct: 131 VLSIAYGKPYAVLDGNVKRVLSRLFLIESDPNSNSTNQVLADLAQKFLTPGDPGNHNEAM 190 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LGA++C P CS CP +N C A P K + + FL L+ + Sbjct: 191 MELGALVCIPV-PNCSACPFENHCEAKGAGKEKEIPVTKSVENWVDLDLNFLFLKSSGRI 249 Query: 245 LLAQRPPSGLWGGLYCFP---QFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 LL + + +Y P + +W + + + RH+ ++ + + Sbjct: 250 LLVKYTTRRFFKTIYSLPFRLEGKHPYEKDEWAEELFADSGVIQNSLRSRHSITNHRIRL 309 Query: 302 VP------MWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 + + + W N + + + L +LR Sbjct: 310 KFSELDEKNVVRIEKSLQKRKDIQFKWVNESDLKEE-FPSSISGKLIRLRNK 360 >UniRef50_B1LXG3 A/G-specific adenine glycosylase n=74 Tax=Bacteria RepID=B1LXG3_METRJ Length = 464 Score = 334 bits (858), Expect = 3e-90, Method: Composition-based stats. Identities = 124/344 (36%), Positives = 174/344 (50%), Gaps = 14/344 (4%) Query: 6 FSAQVLDWYDKYGRKTLPWQI----DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +L WYD++ R+ LPW+ PY++WLSEVMLQQT +A V PYFERF+ RFP Sbjct: 33 RADDLLAWYDRH-RRVLPWRALAGAAPDPYRIWLSEVMLQQTTIAAVRPYFERFLTRFPD 91 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LA AP + V+ W GLGYY+RARNLH A+ VA GG+FP+T E + LPG+G T Sbjct: 92 IFALAEAPEEAVMSAWAGLGYYSRARNLHACARTVAAA-GGRFPDTAEGLRKLPGIGAYT 150 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGAI +++ + +DGNV+RVL+R YAV E + L++ + P F Sbjct: 151 AGAIAAIAFDRQEAAVDGNVERVLSRAYAVEAPLPGSRPE--IRRLTQALVPTDRPGDFA 208 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+MDLGA ICT +P C+LCP C A + + +P K R G + Sbjct: 209 QALMDLGATICTPKRPACALCPWMRPCRARSLGTQESFPRKIKVAKGALRRGAAFVAIRS 268 Query: 242 D--EVLLAQRPPSGLWGGLYCFPQF--ADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 VLL RPP GL G + P + + L + A RH F+HF Sbjct: 269 GDEAVLLRTRPPEGLLGNMAEPPGSAWEPDYDVAAALLDAPLDARWKRLPGLVRHGFTHF 328 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 L++ V+ T + + L + ++L Sbjct: 329 PLELTVFVARVALATPA--PAGTRFTPRSALDDEPLPGLMRKVL 370 >UniRef50_C8P1Q1 A/G-specific adenine glycosylase n=1 Tax=Erysipelothrix rhusiopathiae ATCC 19414 RepID=C8P1Q1_ERYRH Length = 327 Score = 333 bits (855), Expect = 5e-90, Method: Composition-based stats. Identities = 102/334 (30%), Positives = 156/334 (46%), Gaps = 21/334 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ +++ WY + R LP++ K PYK+W+SE+M QQTQ+AT+IPY++R++ R+P V L Sbjct: 3 FTERLMGWYHENKR-VLPFRSQKDPYKIWVSEIMAQQTQIATMIPYYDRWIKRYPDVETL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA +DEVL +W GLGYY RARNLH AQ V + G P +E+ +PG+G T+ AI Sbjct: 62 ANAEIDEVLKMWEGLGYYRRARNLHAGAQYVMEHYEGTLPADKKELMKIPGIGDYTSSAI 121 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLS--EQVTPAVGVERFNQA 183 S++ +DGNVKRV+AR + + + E + F QA Sbjct: 122 ASIAFSLPEIAIDGNVKRVMARYLNYTENVNTRACHKYFETFLKKELLLNGADPSDFTQA 181 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 +M+LGA++CT S C CP + C P + +P L+ Q + Sbjct: 182 LMELGALVCTPSNTNCEGCPFKEMCACYRGECIGTIPFIPKAKPVPIYEKSVLIYQKDGR 241 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVP 303 +L++ GL GL PQ + + +H FSH +I Sbjct: 242 ILISDDHEDGLMEGLLRLPQIDGIYATD--------------PILTLKHKFSHLQWNISV 287 Query: 304 MWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPV 337 + T N + + + + Sbjct: 288 FEVD----TIQDLNHNWYFIPREELKRLAIITAH 317 >UniRef50_C0BLH9 A/G-specific adenine glycosylase n=2 Tax=Flavobacteria RepID=C0BLH9_9BACT Length = 350 Score = 333 bits (854), Expect = 6e-90, Method: Composition-based stats. Identities = 110/356 (30%), Positives = 156/356 (43%), Gaps = 20/356 (5%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 FS +++ WY R +LPW+ K PY +WLSE++LQQT+VA PY+ +F FPTV D Sbjct: 2 SFSQEIIHWYQANKR-SLPWRETKDPYHIWLSEIILQQTRVAQGTPYYVKFTDAFPTVFD 60 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA AP ++VL LW GLGYY+RARNLH AAQ V G FP ++ + L GVG TA A Sbjct: 61 LAAAPEEQVLKLWQGLGYYSRARNLHAAAQYVVNELNGVFPANYKALLQLKGVGDYTASA 120 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I S+ +LDGNV RVL+R + + E +++ +NQA+ Sbjct: 121 IASICFNSPEAVLDGNVYRVLSRYFGIDVPVNTTEGARLFKAIALDQLDLANPSDYNQAI 180 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE-DE 243 M+ GA+ CT C CPLQ+ C+A + P K PKQ + Y+ + Sbjct: 181 MEFGALQCTPKLTDCPSCPLQSTCVAYRTQKVTVLPVKLPKQKPVKVYHYYFVFVDPFGN 240 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTA-----------FRH 292 L +R G+W GLY FP + L H Sbjct: 241 TLYEKRTGKGIWEGLYQFPLATFSAPRTVEGLLTSMETAIDFSLYENAKHFVYHDTYIVH 300 Query: 293 TFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 +H H+ + + T + A + + + Sbjct: 301 KLTHRHIYATFIVIE----TAVTLKKG---MPFGSIDQFPTAVLISDFINAFKNSY 349 >UniRef50_C6XY88 Helix-hairpin-helix motif protein n=4 Tax=Sphingobacteriaceae RepID=C6XY88_PEDHD Length = 349 Score = 332 bits (853), Expect = 9e-90, Method: Composition-based stats. Identities = 107/340 (31%), Positives = 166/340 (48%), Gaps = 9/340 (2%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 F +++++WY + R LPW+ Y +WLSEV+LQQT+V +PYF F+ +PTV D Sbjct: 2 SFQSEIVNWYLNHKR-DLPWRGTTDAYIIWLSEVILQQTRVDQGLPYFNNFLQNYPTVLD 60 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 A+A +VL LW GLGYY+R RN+ A+QV LHGG FP ++++ L G+G TA A Sbjct: 61 FASASETQVLKLWQGLGYYSRGRNMLFTARQVRDLHGGVFPVRYDQLIKLKGIGEYTAAA 120 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I S S + +LDGNV RVL+R + + + + L++ + +NQA+ Sbjct: 121 IASFSSNESKAVLDGNVFRVLSRYFGIESPINSSTGKKQFADLAQSLISGQQPSVYNQAI 180 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+ GA+ C P C +CP+Q+ C A ++ P K K R + L D + Sbjct: 181 MEFGALQCKPKSPNCGICPVQDSCFAQKHHLVGTLPVKLNKLKKRTRYFNYFLCMEGDNI 240 Query: 245 LLAQRPPSGLWGGLYCFPQF-------ADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 L+ +R P +W LY FP D E L + AA + L+ +H +H Sbjct: 241 LVKKRSPGDIWQELYDFPLIETDRPFLEDPEKFAPLLQESFGAACKVRTLSHQKHLLTHQ 300 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPV 337 + + L + W +L + + + Sbjct: 301 TIYVQFFGLD-NYIINFNQNAEIKWVSLPEFDELPQPKVI 339 >UniRef50_A9BD72 A/G-specific DNA glycosylase n=3 Tax=Cyanobacteria RepID=A9BD72_PROM4 Length = 399 Score = 332 bits (853), Expect = 9e-90, Method: Composition-based stats. Identities = 120/371 (32%), Positives = 178/371 (47%), Gaps = 29/371 (7%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-----------PYKVWLSEVMLQQTQVATVIP 50 + + +L W+ GR +PW++ K Y + ++EVMLQQTQ+ V+P Sbjct: 20 KIDALRSTLLQWFKSNGRHYIPWKLTKDGTLPNENQYLAVYPILVAEVMLQQTQLKVVLP 79 Query: 51 YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHG-------GK 103 Y+E++M PT+ DLA A D+VL LW GLGYY+RAR LH ++ + L G Sbjct: 80 YWEKWMLALPTLVDLAKAEEDKVLLLWQGLGYYSRARRLHVTSRILLNLIGIPNSLNPAN 139 Query: 104 FPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENK 163 +P+ E LPG+GR+TAG+I+S + P+LDGNVKRVL R + P K Sbjct: 140 WPKDLESWMNLPGIGRNTAGSIISSAFNLPSPLLDGNVKRVLTRLIGSTKTPNKDLAR-- 197 Query: 164 LWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKK 223 LW LS+ + FNQA+MDLGA ICT+ P C+ CP QN C A + + P K Sbjct: 198 LWKLSDLLLDKNLPRTFNQALMDLGATICTKYNPICTNCPWQNYCSAYNSGNPENLPVKG 257 Query: 224 PKQTL-PERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEES-------LRQWLA 275 K L G L+L +VL+ QR G GG++ FP E+ R+ Sbjct: 258 QKLILSKAVIGVGLILNKNQDVLIDQRLDEGSMGGMWEFPGGKKEKDESIEMTIARELRE 317 Query: 276 QRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPV-SSFTGCMDEGNALWYNLAQPPSVGLA 334 + + +L F H+++H L + + S + W L+ + Sbjct: 318 ELGVEVKVGKKLIEFDHSYTHKKLHFIVHLCELISGKPKPLSSQEVRWVKLSDLQNYPFP 377 Query: 335 APVERLLQQLR 345 ++ L+ Sbjct: 378 KANSYMISALK 388 >UniRef50_C0YGU7 Possible A/G-specific adenine glycosylase n=1 Tax=Chryseobacterium gleum ATCC 35910 RepID=C0YGU7_9FLAO Length = 359 Score = 332 bits (853), Expect = 1e-89, Method: Composition-based stats. Identities = 109/341 (31%), Positives = 160/341 (46%), Gaps = 16/341 (4%) Query: 5 QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 ++L+WY R LP++ K PYK+W+ E++ QQT++ + ++ F+ RFP V Sbjct: 27 HIGNRLLEWYRNNAR-DLPFRQTKDPYKIWICEIVFQQTRINQGLNHYNNFIKRFPDVKT 85 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A +EVL W GLGYY+RA N+HKAAQQ+ + G FP +EE+ L GVG+ TA A Sbjct: 86 LAEADENEVLLYWKGLGYYSRAINIHKAAQQIMNDYQGVFPHQYEEILKLKGVGKYTAAA 145 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 + S+ G P +DGN RVL+R +A L+ V P V FNQAM Sbjct: 146 VSSICFGGRMPAVDGNFYRVLSRIFADDFDISNSRAFTYFSELAALVMP-ENVGDFNQAM 204 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 MDLG+ IC P C CPL C+A + + YP K K + + + + Sbjct: 205 MDLGSEICKPKNPLCGECPLNEDCLAFSLQKISDYPVKTKKVKAGDLALTYYFVHRNGKF 264 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPM 304 L+ QR +W L+ FP + + H +H +L I Sbjct: 265 LIRQRADDFIWKKLFEFPAAIPSDMKPFITGSK-----------TITHKLTHKNLSIEIW 313 Query: 305 WLPVSS---FTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 + V+S + + E L +L P+E +Q Sbjct: 314 NVEVTSEKVWNDFIAENQYLITDLEGSHEKSFPKPLEIYIQ 354 >UniRef50_C8WEE4 A/G-specific adenine glycosylase n=3 Tax=Zymomonas mobilis RepID=C8WEE4_ZYMMN Length = 373 Score = 332 bits (852), Expect = 1e-89, Method: Composition-based stats. Identities = 116/368 (31%), Positives = 191/368 (51%), Gaps = 36/368 (9%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDK-----TPYKVWLSEVMLQQTQVATVIPYFERFMAR 58 + + +L+WY ++ R LPW+ + PY+VWLSE+MLQQT A PY+ +F+ R Sbjct: 8 REITTDLLNWYQRHAR-ILPWRTEAQQNKVDPYRVWLSEIMLQQTTTAHAAPYYLKFVER 66 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 +PTV LA A +V+ W GLGYY+RARNL K A++V GGKFP+ + + ALPG+G Sbjct: 67 WPTVEALAAAQEADVMAEWAGLGYYSRARNLIKCAKEVVAS-GGKFPDNEQGLLALPGIG 125 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 R TA AI++++ GK ++D NV+RV++R +A+ + ++++TP + Sbjct: 126 RYTAAAIVAIAFGKRAVVVDANVERVVSRLFAIETPLPASRP--IIAEETDKLTPDLAAG 183 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 F QAMMD+GA IC +P C++CP+ C + A++P K PK+ P R GY ++ Sbjct: 184 DFAQAMMDIGATICVNRQPTCAICPMMPHCEGQKTGNPAVFPIKAPKKIRPTRIGYAFIV 243 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWL----------------------AQ 276 +D++LL +RP GL GG+ P + + AQ Sbjct: 244 TSKDQILLIRRPDKGLLGGMRALPSSEWLDQSKITTTASDAETEQQAIKEAFPSFPAQAQ 303 Query: 277 RQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAP 336 + + + H F+HF L++ ++S D ++ + + GL Sbjct: 304 KNLRTLAWEKQGHIEHIFTHFALELRIFSAEMTS-----DMIEGEYWPIDEIDLAGLPTV 358 Query: 337 VERLLQQL 344 ++ +++ Sbjct: 359 FKKAVKKY 366 >UniRef50_D2R2B9 HhH-GPD family protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2R2B9_9PLAN Length = 398 Score = 332 bits (852), Expect = 1e-89, Method: Composition-based stats. Identities = 125/363 (34%), Positives = 185/363 (50%), Gaps = 23/363 (6%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 ++F ++L W+ ++ R LPW+ + Y++W+SEVMLQQTQVATVI YF+RF+ FPTV Sbjct: 27 ARFQQRMLAWFAEHAR-DLPWRRSRDLYRIWISEVMLQQTQVATVIDYFQRFLVEFPTVV 85 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A +VL W GLGYY RAR+LH AA+++ G+FPET +EV +LPG+GR TAG Sbjct: 86 HLAEADEQQVLRQWEGLGYYRRARSLHAAAKKIVHDFRGQFPETLDEVMSLPGIGRYTAG 145 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 AILS+ L PIL+ N RV AR + + LW+++EQ+ P V FNQA Sbjct: 146 AILSIGLDAKLPILEANTIRVYARLAGYTREATSTAGQKFLWNIAEQILPDKKVGFFNQA 205 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE 243 MM+LG+ +CT P C CP C+A + P + T +L+ + Sbjct: 206 MMELGSALCTPRTPSCDQCPASTWCVARREQAVEQIPLLSKRMVYEYLTEVAVLVVDRQQ 265 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEES----------------LRQWLAQRQIAADNLTQL 287 VLL Q P W GL+ FP+F+ E++ + + + IA+ Q Sbjct: 266 VLLRQCQPGERWAGLWDFPRFSLEKTMLGAKSTTETDLENYLIERVESLTGIASMIHDQW 325 Query: 288 TAFRHTFSHFHLDIVPMWLPVSSFTG------CMDEGNALWYNLAQPPSVGLAAPVERLL 341 + +H + + + + PM + + W + L+ +L Sbjct: 326 PSLKHGVTKYRITLRPMVASLREAGSKSSRAKQKLAEHCKWTRIDALDDEPLSVTGRKLA 385 Query: 342 QQL 344 Q L Sbjct: 386 QLL 388 >UniRef50_Q3ANF7 Mutator mutT protein n=19 Tax=Cyanobacteria RepID=Q3ANF7_SYNSC Length = 396 Score = 332 bits (851), Expect = 1e-89, Method: Composition-based stats. Identities = 131/363 (36%), Positives = 179/363 (49%), Gaps = 28/363 (7%) Query: 5 QFSAQVLDWYDKYGRKTL---PWQIDK-----------TPYKVWLSEVMLQQTQVATVIP 50 + SA +L W+ +GR+ PW PY +W++EVMLQQTQ+A +P Sbjct: 29 ELSACLLSWWQAHGRRDPVQKPWMFKPAGTWPEAVHQLDPYGIWIAEVMLQQTQLAVALP 88 Query: 51 YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEE 110 Y+ R+M FPTV LA A LDEV W GLGYY+R R LH+AAQ++ G +P + EE Sbjct: 89 YWMRWMEAFPTVETLAAASLDEVRLQWQGLGYYSRVRRLHEAAQRLV---GRPWPRSLEE 145 Query: 111 VAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQ 170 ALPG+GR+TAG+ILS + PILDGNVKRVLAR A + P + + W SE Sbjct: 146 WMALPGIGRTTAGSILSSAFNLRLPILDGNVKRVLARLTAHARPPARD--DALFWCWSEA 203 Query: 171 VTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPE 230 + V NQA+MDLGA +CT +P C CP + C A A+ +P + LP Sbjct: 204 LLDPVRARDTNQALMDLGATLCTPRQPACHRCPWHSQCAAYASGDPCRWPVTNAPKPLPF 263 Query: 231 RT-GYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEES-------LRQWLAQRQIAAD 282 + G ++L EVL+ QR GL GG++ FP E+ R+ + IA Sbjct: 264 QVIGVGVVLNAAGEVLIDQRLEEGLLGGMWEFPGGKQEQGETIETCIARELKEELGIAVT 323 Query: 283 NLTQLTAFRHTFSHFHLDIVPMWLPV-SSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 +L H +SH L V S + W A R++ Sbjct: 324 VGAELITVDHAYSHKKLRFVVHLCDWMSGEPQPLASQQVRWVRPDDLVDYAFPAANARII 383 Query: 342 QQL 344 + L Sbjct: 384 EAL 386 >UniRef50_B5EQC0 A/G-specific adenine glycosylase n=3 Tax=Acidithiobacillus RepID=B5EQC0_ACIF5 Length = 369 Score = 331 bits (849), Expect = 2e-89, Method: Composition-based stats. Identities = 141/366 (38%), Positives = 186/366 (50%), Gaps = 25/366 (6%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 + +L WY ++GR LPW+ + PY++WL+E+MLQQTQV + PY+ RF+ P L Sbjct: 4 IAEPLLAWYTRHGRHDLPWRQTRDPYRLWLAEIMLQQTQVESAKPYYVRFLQALPNWQAL 63 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A AP D VL LW+GLGYYARARN +AAQ V T G FP+T E+ LPGVGRSTA A+ Sbjct: 64 AAAPQDRVLALWSGLGYYARARNAQRAAQTVMTGFAGHFPDTLEKAITLPGVGRSTAAAV 123 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 L+ + G ILD N +RVL R +A+ P + LW+L+ +TP +NQA+ Sbjct: 124 LASAFGHRQAILDANARRVLIRSHAIDADPKAAATQQWLWTLASALTP-EDAHGYNQAIQ 182 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH-EDEV 244 DLGAMICT +P+C CPL + C A A P + T P+R +FLL E + Sbjct: 183 DLGAMICTPRQPRCPDCPLASRCKAHAQGRSHALPVTVHRPTKPQRRAFFLLAADMEGRI 242 Query: 245 LLAQRPPSGLWGGLYCFPQFADEE--------SLRQWLAQRQIAADNLTQL--------- 287 LL +RP SG+WGGL+C PQ + L +L + Sbjct: 243 LLEKRPDSGIWGGLWCLPQATPDSGELSILCNEPPSPLEDTPAIRQSLQEAWSRRLRLDL 302 Query: 288 ------TAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 RH F+HF L V WY A GL PV R+L Sbjct: 303 RLTALGEEQRHVFTHFQLRFRCAHARVLGGAVADSGATLRWYRPADALKQGLPTPVRRIL 362 Query: 342 QQLRTG 347 + L G Sbjct: 363 RNLGGG 368 >UniRef50_Q6MF36 Putative A/G-specific adenine glycosylase, mutY n=2 Tax=Parachlamydiaceae RepID=Q6MF36_PARUW Length = 352 Score = 329 bits (845), Expect = 7e-89, Method: Composition-based stats. Identities = 106/351 (30%), Positives = 163/351 (46%), Gaps = 12/351 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M +S + ++ W+ R LPW+ + TPY VW+SEVMLQQTQVA VIPYF R+M FP Sbjct: 1 MSSSFPNEKLKKWFLSSKR-DLPWRQNPTPYAVWISEVMLQQTQVAVVIPYFNRWMQAFP 59 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 ++ LA A L+EV+ W GLGYY+RAR+LH+ AQ + G+ P +E+ + G+G Sbjct: 60 SIEALAIASLEEVIKAWEGLGYYSRARHLHQGAQYLVEHCQGELPADEKELKKIKGLGPY 119 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 T GAILS + + +DGNV RVLAR + + K KL +E + P Sbjct: 120 TIGAILSFAFHQKKAAVDGNVLRVLARYFQIEEDIAKTSTIKKLRDFAESILPDEESWIS 179 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 ++A+++LGA IC K C CPL++ C + + P K +++ Sbjct: 180 SEALIELGATIC-SRKAICQECPLKSNCQSYLHGLSDKLPIKTKAIKTQYLYRAVAIVKC 238 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEES-------LRQWLAQRQIAADNLTQLTAFRHT 293 D L+ + + LY FP F + + L L++ H Sbjct: 239 GDYFLVKKGKKGEIMSDLYEFPFFEVSQEGITFKELQENIKLHWNLTVHLLQSLSSIEHG 298 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 F+ + + P + + W + ++ R+ Q L Sbjct: 299 FTRYQARLDPCVFNCLEMSPV---KDLEWLDTKSLKQKAFSSGHRRIFQSL 346 >UniRef50_Q3KMR2 A/G-specific adenine DNA glycosylase n=9 Tax=Chlamydia RepID=Q3KMR2_CHLTA Length = 368 Score = 329 bits (844), Expect = 9e-89, Method: Composition-based stats. Identities = 97/341 (28%), Positives = 171/341 (50%), Gaps = 17/341 (4%) Query: 8 AQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLAN 67 + W+ + R + PW+ TPY+VW+SEVMLQQT+ V+PYF ++M RFPT+ DLA Sbjct: 16 EALRSWFLESKR-SFPWRDSPTPYRVWVSEVMLQQTRAEVVVPYFLKWMERFPTLQDLAQ 74 Query: 68 APLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILS 127 A +V+ LW GLGYY+RARNL A+ + + GG+ P ++++ G+G TA AIL+ Sbjct: 75 ARESDVVQLWEGLGYYSRARNLLAGARVITEIFGGEIPNDLALLSSIKGIGSYTANAILA 134 Query: 128 LSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDL 187 + + P +DGNV RV++R +A+ + ++ L E + P + ++ ++L Sbjct: 135 FAFKQKNPAVDGNVLRVMSRLFAIEESIDRMNTRREITGLCESLLPDQDPQVIAESFIEL 194 Query: 188 GAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLA 247 GA IC +P C CPL++ C A + YP + + + +++ ++D+VL+ Sbjct: 195 GARICK-KQPLCEQCPLRSFCTAYRQGTMEQYPVRNTRAAISRLFRAVVIVLYKDQVLMT 253 Query: 248 QRPPSGLWGGLYCFPQFADEESLRQWLAQ--------RQIAADNLTQLTAFRHTFSHFHL 299 +R + GLY FP + + + + ++ L + + F+ + + Sbjct: 254 KREEKEIMAGLYEFPYYQLPKEDCCDIEKITHLVQKDYGETLYFVSSLPSQKQAFTRYRV 313 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 + P Y LA+ S+ ++ R+ Sbjct: 314 TLFPHVFYTKYSLPNS-------YTLAELSSLPSSSGHRRI 347 >UniRef50_C2MB00 A/G-specific adenine glycosylase n=1 Tax=Porphyromonas uenonis 60-3 RepID=C2MB00_9PORP Length = 363 Score = 327 bits (839), Expect = 4e-88, Method: Composition-based stats. Identities = 121/350 (34%), Positives = 180/350 (51%), Gaps = 15/350 (4%) Query: 9 QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 ++ WY + RK LPW+ PY++WLSEV+LQQT++ Y+ RF+ +PTV+DLA A Sbjct: 7 KLHSWYQENHRK-LPWRDIHDPYRIWLSEVILQQTRIDQGTSYYLRFVEHYPTVSDLAIA 65 Query: 69 PLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSL 128 PLDEVL LW GLGYY+RARNLH+AAQ + GG FP ++ V ALPG+G TAGA+LS Sbjct: 66 PLDEVLKLWEGLGYYSRARNLHRAAQLIVHELGGTFPADYQSVRALPGIGDYTAGAVLSF 125 Query: 129 SLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA-VGVERFNQAMMDL 187 + + +P +DGNV RVL+R YA + + +L+ Q+ NQAM++L Sbjct: 126 AYDQPYPAVDGNVLRVLSRLYASEEPIDTTQGKKYYTALARQLVEKAPHPGLHNQAMIEL 185 Query: 188 GAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED----E 243 GA+ICT C+ CP+++ C +A + A P K K ++ R +L + + Sbjct: 186 GALICTPQLCDCTRCPVRSECPSADSPERASLPRKALKLSVQPRHLGYLFVLKRGLDQWQ 245 Query: 244 VLLAQRPPSGLWGGLYCFPQFADEE--------SLRQWLAQRQIAADNLTQLTAFRHTFS 295 +L QRP +W LY D+ +A L ++H + Sbjct: 246 TILYQRPTGDIWAKLYQPTLLYDQPVAPQEELLLQPLPEPPAALAHCPLRPFRRYKHRLT 305 Query: 296 HFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPS-VGLAAPVERLLQQL 344 H L I + + + +W L + L +++ LQQL Sbjct: 306 HRQLYIDCYYTILPDSSTPRLPDGGVWVPLHDESEWLALPITLKKALQQL 355 >UniRef50_B1Y3P3 A/G-specific adenine glycosylase n=8 Tax=Burkholderiales RepID=B1Y3P3_LEPCP Length = 384 Score = 327 bits (838), Expect = 4e-88, Method: Composition-based stats. Identities = 143/373 (38%), Positives = 192/373 (51%), Gaps = 33/373 (8%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + +A ++ W ++GR LPWQ + PY+VWLSEVMLQQTQVATV+ Y+ERF+ RFP Sbjct: 8 ELPSIAADLVRWQRQHGRHGLPWQASRDPYRVWLSEVMLQQTQVATVLGYYERFLQRFPD 67 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 + LA APLD+VL LW+GLGYY+RARNLH+ AQ V HG P E++ LPG+G ST Sbjct: 68 IAALAAAPLDDVLALWSGLGYYSRARNLHRCAQVVVAEHGAALPRRAEQLVELPGIGPST 127 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER-- 179 A AI S G+ I DGNVKRVLAR A G + LW+ ++++ P + Sbjct: 128 AAAIASFCHGERVSIFDGNVKRVLARLLAFEGDLAQAGAAKVLWAQADRLVPTDAADMPA 187 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 + Q +MDLGA +CT P+C CPLQ C A A S YP K K R + L L Sbjct: 188 YTQGLMDLGATVCTPRDPQCPACPLQRACRAHAAGSVLTYPIKSRKLKRSRRENWCLWLS 247 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAAD------NLTQLTAFRHT 293 D V L QRP +G+W GL+ +P F D +++ + AQ A +L F H Sbjct: 248 LGDAVWLQQRPDTGIWAGLWAWPLFDDTDAVERLTAQLLAALGGHAERLQANRLPGFVHV 307 Query: 294 FSHFHLDIVPMWLPVS-------------------------SFTGCMDEGNALWYNLAQP 328 +H + P L + G W+ A Sbjct: 308 LTHLDWTLHPCRLHLPAPVVLQAREPGGGDGERGDGPLERLDAVLNQQLGAGRWHAAADL 367 Query: 329 PSVGLAAPVERLL 341 +GL AP+ ++L Sbjct: 368 HRLGLPAPLRKVL 380 >UniRef50_D1PLU8 A/G-specific adenine glycosylase n=2 Tax=Ruminococcaceae RepID=D1PLU8_9FIRM Length = 342 Score = 326 bits (836), Expect = 8e-88, Method: Composition-based stats. Identities = 100/342 (29%), Positives = 159/342 (46%), Gaps = 8/342 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 + +L W+ R LP++ + + Y +W+SE+MLQQT+VA IPY+ RF+A P L Sbjct: 5 IAPPLLQWFQANKRL-LPFRQEPSAYHIWVSEIMLQQTRVAAAIPYYNRFIAALPDPAAL 63 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A+ D + LW GLGYY R N+ KAA+ V +GG P ++ + +LPG+G TAGAI Sbjct: 64 ASCEPDALRKLWQGLGYYNRVNNMQKAARIVCEQYGGDLPSDYDALRSLPGIGDYTAGAI 123 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S++ G P +DGNV RV AR Y + + + P +N+A+M Sbjct: 124 ASIAFGIPAPAVDGNVLRVFARLYNDDADIMQPATKRLFTGRVLEQMPKETPGPYNEALM 183 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 +LGA++C P+C CPL C+ A A P K + L++ +L Sbjct: 184 ELGALVCVPGMPRCEACPLAALCLGYAAGRQADLPVKPAPKVKTPVAVTVALVESPAGLL 243 Query: 246 LAQRPPSGLWGGLYC---FPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 L +RP GL GL+ + + L + L++ + L A RH F+H ++ Sbjct: 244 LQRRPARGLLAGLWQPAAWETVMSQAELAEELSKIGVQVTWGDALPAARHVFTHKIWNLG 303 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPS-VGLAAPVERLLQQ 343 + C +W + + +++ Sbjct: 304 GWHA---AAPACALPEGWVWAAPEELEQVYAVPNAYAAYMKK 342 >UniRef50_B4D777 HhH-GPD family protein n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4D777_9BACT Length = 341 Score = 324 bits (830), Expect = 4e-87, Method: Composition-based stats. Identities = 118/347 (34%), Positives = 174/347 (50%), Gaps = 22/347 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 ++ W+ ++GR LPW+ PY + +SE MLQQTQV TV Y+ R++ RFP Sbjct: 16 NPKSLRQKLARWFRQHGR-DLPWRRTHDPYAIMVSEFMLQQTQVVTVRDYYARWLERFPD 74 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 LA A +VLH+W GLGYYARARNLH+AA+QVA LH G+ P ++ALPGVGR T Sbjct: 75 FNALAAASEADVLHVWQGLGYYARARNLHRAAKQVADLHSGQLPNDLVAISALPGVGRYT 134 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGA+ + + + PI+D N+ RV+AR + K LW +E++ PA N Sbjct: 135 AGAVATFAFDQATPIIDANIARVIARLLDLQEPIDTKRGSEILWLTAEELLPAKSGRVHN 194 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 A+M+LGA++CT P+C +CP++ C S P KKP+ + ++ Sbjct: 195 SALMELGALLCTPRAPQCPICPIREHC---RTKSPESLPRKKPRPKTIALAENCAWIVND 251 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 +LL Q+ S W GL+ P + + L AF + F++ + + Sbjct: 252 GNLLLEQQTGS-RWRGLWKLPVIGESHPRNKL-------------LLAFDYPFTNHRVTL 297 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 +S N W L S+ LAAP R +++L Sbjct: 298 SVY----ASRVPMESRPNQQWVPLTSIDSIALAAPHRRAIKRLAASH 340 >UniRef50_Q7VRG8 A/G-specific adenine glycosylase n=2 Tax=Candidatus Blochmannia RepID=Q7VRG8_BLOFL Length = 355 Score = 324 bits (830), Expect = 4e-87, Method: Composition-based stats. Identities = 145/349 (41%), Positives = 214/349 (61%), Gaps = 7/349 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS ++L W LPWQI+KT YK+W+SEVMLQQTQVATVIPY+++F+ +FPT++ L Sbjct: 7 FSHKILSWSKNAHILKLPWQINKTIYKIWISEVMLQQTQVATVIPYYQKFIKKFPTISKL 66 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A ++E+L++W+GLGYY RA NLHK A + H G FP F + + PG+GRSTAGAI Sbjct: 67 AEANINEILYIWSGLGYYKRALNLHKTATIIIHHHNGVFPNNFNILLSFPGIGRSTAGAI 126 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSG-WPGKKEVENKLWSLSEQVTPAV-GVERFNQA 183 LSL+L K FPILDGN+KR+L R Y+++ ++ NKLWSL + + P FNQA Sbjct: 127 LSLALNKRFPILDGNIKRILIRYYSLNNQQTSPTKINNKLWSLIDSLLPLDSNYAIFNQA 186 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGK----KPKQTLPERTGYFLLLQ 239 MMDLG +ICT S P+C++CPL + C + N+ L K + LL++ Sbjct: 187 MMDLGRLICTHSNPQCNICPLNSHCQSFLTNNINLLTQKYATIHNPKQKKIIYWLILLVK 246 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 H + + L QR +W L+CFP+F + +L WL++ + + + +H S+ L Sbjct: 247 HRNIIYLTQRLQETIWNKLFCFPEFYNRTTLNTWLSKYNLHNNLKINMPIIKHNISNIAL 306 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 +I P + +++ +E N +WYNL +P +GL PV +L+ ++ Sbjct: 307 EIQPTLININNTIFNTNEKN-IWYNLNKPAIIGLPKPVNTILKIIQNKT 354 >UniRef50_Q254A0 Adenine/guanine glycosylase n=5 Tax=Chlamydophila RepID=Q254A0_CHLFF Length = 369 Score = 323 bits (829), Expect = 6e-87, Method: Composition-based stats. Identities = 100/353 (28%), Positives = 173/353 (49%), Gaps = 17/353 (4%) Query: 2 QASQF-SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 +A +F ++ W+ R + PW+ D +PY VW+SEVMLQQT+ V+ YF +M RFP Sbjct: 9 RAKKFPVEKLKQWFIDNKR-SFPWRDDPSPYNVWVSEVMLQQTRAEVVVKYFLEWMKRFP 67 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA A +EV+ W GLGYY R RNL A+ V GGK P+ ++ + G+G Sbjct: 68 TIESLATANEEEVIKAWEGLGYYTRVRNLLLGARMVMKDFGGKLPDDPLDLMQIKGLGPY 127 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 T AIL+ + + +DGNV RV++R + + + + ++ ++ + P + Sbjct: 128 TVHAILAFAFKRRTAAVDGNVLRVISRVFLIDASIDLESTKTWIFRITLSILPVKDPQVV 187 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 +A+++LGA +C PKC +CPL + C A P + ++ + + ++ + Sbjct: 188 AEALIELGACVCK-RSPKCEICPLNSVCGAFKEKKQKSLPIRHARKKVVSLFRWVAIILY 246 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFA-----DEESLRQWLAQRQIAADNL----TQLTAFR 291 ED ++L QR P + GLY FP D + + + + + +L R Sbjct: 247 EDFIVLEQRKPEEMMAGLYEFPYIEVGSLDDLADINALVQEMECYVGSSLIFCGELEEQR 306 Query: 292 HTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 H+F+H+ + ++P F + L Y L S+ ++ ++ L Sbjct: 307 HSFTHYKVRLIP-----KIFYARSKPKSELLYPLNTIDSLPFSSGHRKIKDWL 354 >UniRef50_Q2QW21 Os12g0211400 protein n=6 Tax=Poaceae RepID=Q2QW21_ORYSJ Length = 474 Score = 323 bits (828), Expect = 7e-87, Method: Composition-based stats. Identities = 122/401 (30%), Positives = 184/401 (45%), Gaps = 61/401 (15%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTP--------------YKVWLSEVMLQQTQVATV 48 A+ A++L WYD R+ LPW+ P Y VW+SEVMLQQT+V V Sbjct: 54 AAAVRAELLRWYDAN-RRDLPWRRAAEPPAGSGSGRGEEQRAYAVWVSEVMLQQTRVPVV 112 Query: 49 IPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETF 108 + Y+ R+MAR+PTV LA A +EV +W GLGYY RAR L + A+Q+ G+FP T Sbjct: 113 VDYYSRWMARWPTVDSLAAATQEEVNEMWAGLGYYRRARFLLEGAKQIVEK--GEFPCTA 170 Query: 109 EEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLS 168 + + G+G TAGAI S++ + P++DGNV RV++R YA+ P + + W L+ Sbjct: 171 STLREVRGIGDYTAGAIASIAFNEVVPVVDGNVVRVISRFYAIPDNPKESSTVKRFWQLT 230 Query: 169 EQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIA------AANNSWALYPGK 222 ++ FNQAMM+LGA +C+++KP CS CP+ + C A A+ +P Sbjct: 231 GELVDPSRPGDFNQAMMELGATLCSKTKPGCSQCPVSSHCQALALSSQNASVKVTDFPRV 290 Query: 223 KPKQTLPERTGYFLLLQ--------------HEDEVLLAQRPPSGLWGGLYCFPQFADEE 268 PK ++Q ++ LL +RP GL GL+ FP E Sbjct: 291 VPKAKPRSDFAAVCVVQISQGFGEGIAEAEGKDNLFLLIKRPEEGLLAGLWEFPSVLVNE 350 Query: 269 SLRQWLAQRQIAADNLTQL-----------------TAFRHTFSHFHLDIVPMWLPVSSF 311 L +R+ L QL H FSH L + + ++ Sbjct: 351 GKTDTLNRRKEMDKYLKQLLSIDVTRRSSVILREDVGQHVHIFSHIRLTMFVELMILNLK 410 Query: 312 TGCMD-------EGNALWYNLAQPPSVGLAAPVERLLQQLR 345 + N S+GL + + ++ ++ Sbjct: 411 DDGEELCKEGQDSTKLRLINENAVQSMGLTSGIRKVYNMVK 451 >UniRef50_Q3SPD3 A/G-specific DNA-adenine glycosylase n=2 Tax=Nitrobacter RepID=Q3SPD3_NITWN Length = 415 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 124/371 (33%), Positives = 182/371 (49%), Gaps = 44/371 (11%) Query: 10 VLDWYDKYGRKTLPWQ----IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 +L+WYD+ R+ LPW+ PY+VWLSE+MLQQT V TV PYFE+F+AR+P V + Sbjct: 25 LLEWYDRN-RRLLPWRALPGEPVDPYRVWLSEIMLQQTTVKTVGPYFEKFLARWPDVAAM 83 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A LD++L +W GLGYY+RARNLH A +V HGG+FP+T E++ ALPG+G TA AI Sbjct: 84 ARASLDDILRMWAGLGYYSRARNLHACAVKVLRDHGGRFPDTEEDLRALPGIGPYTAAAI 143 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT------------- 172 +++ + +DGN++RV++R +AV K + + +L+ + Sbjct: 144 AAIAFNRRTMPVDGNIERVVSRLFAVDEPLPKAKPR--IHTLAATLLGPSRSGRDGKSRA 201 Query: 173 -----------------PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS 215 QA+MDLGA ICT KP C LCPL + C A A Sbjct: 202 GDVKTRAGRDGKSRASDVKTRAGDIAQALMDLGAAICTPKKPSCVLCPLSDDCAARARGD 261 Query: 216 WALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLA 275 +P K PK+ R G +++ EVL+ RP GL GG+ P + A Sbjct: 262 QETFPRKTPKKAGELRRGAAFVVRRGSEVLVRTRPAKGLLGGMTEVPTSTWLAAQDDTAA 321 Query: 276 QRQIA-----ADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPS 330 +Q + A H F+HF L++ + W +A+ Sbjct: 322 LKQAPCLESAPRWRRKAGAVTHVFTHFPLELAVYT--AAVARRTAAPEGMRWVPIARLND 379 Query: 331 VGLAAPVERLL 341 L + +++ Sbjct: 380 EALPNLMRKVI 390 >UniRef50_Q2G9R8 A/G-specific DNA-adenine glycosylase n=7 Tax=Alphaproteobacteria RepID=Q2G9R8_NOVAD Length = 359 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 124/350 (35%), Positives = 180/350 (51%), Gaps = 15/350 (4%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQI-----DKTPYKVWLSEVMLQQTQVATVIPYFERF 55 + +LDWYD + RK LPW+ + PY+VWLSEVMLQQT VA V PYFE+F Sbjct: 11 FDPQAIAPALLDWYDAHARK-LPWRRLPGEARQDPYRVWLSEVMLQQTTVAAVGPYFEKF 69 Query: 56 MARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALP 115 +PTV DLA A +V+ W GLGYYARARNL A+ VA GG FP++ + + ALP Sbjct: 70 TRLWPTVGDLAAADDGDVMAAWAGLGYYARARNLLACARAVAA-MGGTFPDSEDGLRALP 128 Query: 116 GVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV 175 G+G TA A+ +++ G+ ++D NV+RV+AR +A+ + + + QVTP Sbjct: 129 GLGEYTAAAVAAIAFGRRAVVVDANVERVIARLFAIDEPL--PAGKAAIRLAAGQVTPEE 186 Query: 176 GVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF 235 F QAMMDLGA +CT P+C LCPL+ C A A + P K ++ P R G Sbjct: 187 RAGDFAQAMMDLGATVCTARSPRCMLCPLREHCRALAEGAPERLPVKAARKAKPVRQGRA 246 Query: 236 LLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFS 295 ++ E VLL +RP G+ GG+ P ++ RH F+ Sbjct: 247 YWIEREGRVLLVRRPGRGMLGGMRALP----DDGWSARGDGADAIGGEWRGGGVVRHGFT 302 Query: 296 HFHLDIVPMWLPVSSFT--GCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 HF L++ M + +++ W+ + + + GL + + Sbjct: 303 HFDLELQLMLCVQAEAASLPGLNDIEGEWWPVDEIEAAGLPTVFAKAARL 352 >UniRef50_Q2N933 Putative uncharacterized protein n=1 Tax=Erythrobacter litoralis HTCC2594 RepID=Q2N933_ERYLH Length = 347 Score = 321 bits (824), Expect = 2e-86, Method: Composition-based stats. Identities = 124/342 (36%), Positives = 176/342 (51%), Gaps = 17/342 (4%) Query: 10 VLDWYDKYGRKTLPWQ-----IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 +LDWYD++ R LPW+ PY+VWLSE+MLQQT VA V PYF RF +PTV Sbjct: 16 LLDWYDRHAR-DLPWRAQPGEPAADPYRVWLSEIMLQQTTVAAVKPYFARFTEVWPTVEA 74 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A ++V+ W GLGYY+RARNL AA+ V L G FPET E+ LPGVG TA A Sbjct: 75 LAEASEEDVMAAWAGLGYYSRARNLLMAAKAVGDL--GGFPETEAELRRLPGVGEYTAAA 132 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I +++ G+ ++D NV+RV++R +A+ + + + ++ +TP F QAM Sbjct: 133 IAAIAFGRRAVVVDANVERVVSRLFAIEEPL--PKARRAIRAHADAITPDRRAGDFAQAM 190 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 MDLG+ +CT P+C LCPL C A +P K P++ P R G ++ E V Sbjct: 191 MDLGSQVCTTRAPQCLLCPLARFCEA-RTQEPERFPVKPPRKAKPLRKGRAFWIEREGTV 249 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPM 304 L +R SG+ GG+ P D S R + A + H F+HF L++ + Sbjct: 250 WLVRREGSGMLGGMRSLP--DDGWSARSDGSGELPLAGEWENAGSVFHVFTHFALELTVL 307 Query: 305 WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 + G W+ L GL + ++ R Sbjct: 308 RHGGAEHPT----GEGEWWPLDDLGKAGLPTLFAKAARRARQ 345 >UniRef50_D0J8Q9 A/G-specific adenine glycosylase n=2 Tax=Blattabacterium RepID=D0J8Q9_BLASP Length = 351 Score = 321 bits (824), Expect = 2e-86, Method: Composition-based stats. Identities = 102/346 (29%), Positives = 158/346 (45%), Gaps = 9/346 (2%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 FS ++++WY RK LPW+ K PY V +SE MLQQT+V+ + Y+ F+ +FP + L Sbjct: 3 FSKKIINWYKNNYRK-LPWRETKNPYYVLVSEFMLQQTRVSKTLKYYLDFVKKFPNLEKL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVA-TLHGGKFPETFEEVAALPGVGRSTAGA 124 A A EVL W GLGYY RAR LH A+Q+ G FP+T++E+ G+G T A Sbjct: 62 AYAEEKEVLKEWEGLGYYTRARYLHLFAKQLVKEKPSGFFPKTYKELIKYKGIGPYTGAA 121 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 + S+ + P LDGN RV +R + +N ++ FNQA+ Sbjct: 122 VASICFDEVIPALDGNAYRVFSRYLGIYHNITSFRAKNIFRFFILKIMDHNDPGIFNQAV 181 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKK--PKQTLPERTGYFLLLQHED 242 MDLG+ +CT K KC CP+++ C + N + P K K L Y + ++ Sbjct: 182 MDLGSTLCTPRKAKCFSCPVKSSCFSLKNGTVYDLPVKNIIKKSVLHRFFYYVFIWDKKN 241 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQI----AADNLTQLTAFRHTFSHFH 298 + + +R +W GLY FP E++L + +I + + + H +H Sbjct: 242 HLCIQKRNNQDIWKGLYDFPLIESEKNLSISEIKDKIWTRFRIISDSIIYQVIHKLTHQT 301 Query: 299 LDIVPMWLPVSSFTGCMDE-GNALWYNLAQPPSVGLAAPVERLLQQ 343 L I + N + + P+ L+ Sbjct: 302 LSIQFLNCRTIQNVQKEKYFDNFFFIPYEKIVEYPFPRPITLFLKH 347 >UniRef50_C2KZ90 Putative uncharacterized protein n=1 Tax=Oribacterium sinus F0268 RepID=C2KZ90_9FIRM Length = 391 Score = 321 bits (824), Expect = 2e-86, Method: Composition-based stats. Identities = 107/305 (35%), Positives = 167/305 (54%), Gaps = 5/305 (1%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 F+ +L WY++ R+ LPW+ D PY WLSE+MLQQT+V V Y+ RF++ P + DL Sbjct: 19 FAKALLSWYNEN-RRILPWREDPGPYHTWLSEIMLQQTRVEAVKGYYSRFLSALPEIADL 77 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 ANA ++VL LW GLGYY+RARNL KAA+ + T + G+ P+TF+E+ LPG+G TA AI Sbjct: 78 ANAEEEKVLKLWEGLGYYSRARNLQKAAKTIMTEYAGEMPKTFQELKKLPGIGEYTAAAI 137 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 S+ + P LDGN+ R+ AR + + E + +S ++ + FNQA+M Sbjct: 138 ASIVYKEEIPALDGNLLRIFARLTSYPKVVLEPEGKKLAFSYFQEKMQGIDPGDFNQALM 197 Query: 186 DLGAMICTRSK-PKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 DLG+ +C C CPL+ C +A + YP K+ L++H +E+ Sbjct: 198 DLGSGVCLPKGEIGCKTCPLKAFCSSAKAGNPMDYPKLPEKKKRKVEQYSIFLIRHREEL 257 Query: 245 LLAQRPPSGLWGGLYCFPQFAD---EESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 +L +R GL GLY F + E+ + + + + + L +H F+H ++ Sbjct: 258 MLKKRENKGLLAGLYEFYKLEGHCSEKEALEAVENLGLRSLRIKALGEAKHIFTHKEWEM 317 Query: 302 VPMWL 306 + + Sbjct: 318 IGYEV 322 >UniRef50_B3DVQ4 A/G-specific DNA glycosylase n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DVQ4_METI4 Length = 355 Score = 321 bits (824), Expect = 2e-86, Method: Composition-based stats. Identities = 106/344 (30%), Positives = 160/344 (46%), Gaps = 19/344 (5%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 +F + WY + G PW+ TPY V +SE MLQQTQ TV+PYF +M +FP Sbjct: 19 KKKFWKFLSRWYGENG-FKYPWRKSPTPYSVVVSEFMLQQTQAETVVPYFLAWMEKFPDW 77 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA A EVL W GLGYY+RARNLH+ A + G+ P EE+ PG+G TA Sbjct: 78 ESLAKAQEKEVLRAWEGLGYYSRARNLHRIAVMLYHERKGELPSDPEELVKFPGIGPYTA 137 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV-GVERFN 181 A+ SL+ G+ P LDGNV RV+AR ++ KKE ++ + + + FN Sbjct: 138 NAVASLAFGRKIPALDGNVIRVMARLMNINQPIHKKETIRTIFQVVDSLMQGEAEASLFN 197 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 A+MD G +C P+CS C L+ C A L P + + ++ ++ + + Sbjct: 198 SALMDFGRAVCKPKYPRCSACVLKEMCKAKQ---PELLPLRTKIEIEEKKEKIAIIRE-D 253 Query: 242 DEVLLAQ-RPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 D L Q + GL+ FP F + + + L + RH+F+ + + Sbjct: 254 DRFWLQQANSNQNRYRGLWLFPYFDPD------------VMEEMAPLFSLRHSFTRYKIL 301 Query: 301 IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + + M W + S+ L +P ++ QL Sbjct: 302 LEVCEASWNKERLKMMPMEGQWVKKKEIFSLPLPSPHRKIWLQL 345 >UniRef50_B0SFS5 A/G-specific DNA glycosylase n=2 Tax=Leptospira biflexa serovar Patoc RepID=B0SFS5_LEPBA Length = 353 Score = 320 bits (821), Expect = 5e-86, Method: Composition-based stats. Identities = 110/349 (31%), Positives = 173/349 (49%), Gaps = 12/349 (3%) Query: 7 SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 ++ DWY + R LP++ K Y +W+SEVMLQQT+VA ++P FE F+ RFP LA Sbjct: 4 QKKLRDWYLLHKR-DLPFRKKKQAYPIWISEVMLQQTRVAAMLPLFENFVNRFPNPESLA 62 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 A +EVL W GLGYY+RARN+ KAA Q+ + G FP+ + V LPG+G TA AIL Sbjct: 63 KATEEEVLSFWKGLGYYSRARNIRKAAIQIVQQYNGSFPKDLDSVLKLPGIGNYTARAIL 122 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMD 186 S+S +LDGNVKRVL+R Y + + E +L ++ NQA+M+ Sbjct: 123 SISYDLPLAVLDGNVKRVLSRYYGYTKNILGPQAEKELQLKADGFLNLDFPGDHNQAVME 182 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLL 246 LGA IC PKC +CPL +GC A + P ++ KQ TG ++Q ++ +LL Sbjct: 183 LGATICLPESPKCLVCPLMDGCYARIHGKTKEIPLREKKQKQVLLTGEIWVIQQKNLILL 242 Query: 247 AQRPPSGLWGGLYCFPQF------ADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 + + G++ P + + + Q F+HT ++ L+ Sbjct: 243 IKEKKNRFLKGMFHLPSGFLGEIPNSDYAPSPFFHSVQSEYKETPSKGKFKHTITYHKLE 302 Query: 301 IVPMWLPVSSFTGCM-----DEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + + ++ + W ++ S ++ +++ + L Sbjct: 303 YSVHLVNLKEPNQIQSLLAGNDLESKWVEVSDLESEFPSSLAKKVKKIL 351 >UniRef50_UPI000034F1ED adenine-DNA glycosylase-related / MYH-related n=1 Tax=Arabidopsis thaliana RepID=UPI000034F1ED Length = 630 Score = 319 bits (817), Expect = 1e-85, Method: Composition-based stats. Identities = 119/414 (28%), Positives = 175/414 (42%), Gaps = 72/414 (17%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-------PYKVWLSEVMLQQTQVATVIPYFER 54 + + +LDWYD R LPW+ ++ Y+VW+SE+MLQQT+V TV+ Y++R Sbjct: 127 ETQKIRMGLLDWYDVNKR-DLPWRNRRSESEKERRAYEVWVSEIMLQQTRVQTVMKYYKR 185 Query: 55 FMARFPTVTDLANAPLD-------------------EVLHLWTGLGYYARARNLHKAAQQ 95 +M ++PT+ DL A L+ EV +W GLGYY RAR L + A+ Sbjct: 186 WMQKWPTIYDLGQASLENLIVSRSRELSFLRGNEKKEVNEMWAGLGYYRRARFLLEGAKM 245 Query: 96 VATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWP 155 V G FP + + G+G+ TAGAI S++ + P++DGNV RVLAR A+S P Sbjct: 246 VVAGTEG-FPNQASSLMKVKGIGQYTAGAIASIAFNEAVPVVDGNVIRVLARLKAISANP 304 Query: 156 GKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSK------PKCSLCPLQNGCI 209 + W L+ Q+ FNQ++M+LGA +CT SK P S C + Sbjct: 305 KDRLTARNFWKLAAQLVDPSRPGDFNQSLMELGATLCTVSKPSCSSCPVSSQCRAFSLSE 364 Query: 210 AAANNSWALYPGKKPKQTLPERTGYFLLL--------QHEDEVLLAQRPPSGLWGGLYCF 261 S YP K K +L Q +L +RP GL GL+ F Sbjct: 365 ENRTISVTDYPTKVIKAKPRHDFCCVCVLEIHNLERNQSGGRFVLVKRPEQGLLAGLWEF 424 Query: 262 PQFADEESLRQWLAQRQI------------------AADNLTQLTAFRHTFSHFHLDIVP 303 P E + I + +L F H F+H + Sbjct: 425 PSVILNEEADSATRRNAINVYLKEAFRFHVELKKACTIVSREELGEFVHIFTHIRRKVYV 484 Query: 304 MWLPVSSFTGCMDEGNAL------W--YNLAQPPSVGLAAPVERL----LQQLR 345 L V G D W + ++GL + V ++ LQ ++ Sbjct: 485 ELLVVQLTGGTEDLFKGQAKDTLTWKCVSSDVLSTLGLTSAVRKVPPFRLQHIK 538 >UniRef50_Q1GD89 A/G-specific DNA-adenine glycosylase n=11 Tax=Rhodobacterales RepID=Q1GD89_SILST Length = 353 Score = 318 bits (816), Expect = 2e-85, Method: Composition-based stats. Identities = 124/353 (35%), Positives = 175/353 (49%), Gaps = 18/353 (5%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQI---------DKTPYKVWLSEVMLQQTQVATVIPY 51 + A Q S +L WYD + R+ +PW++ PY++WLSEVMLQQT VA V Y Sbjct: 6 VSAEQLSQDLLVWYDTHARE-MPWRVGPAARASGVRPDPYRIWLSEVMLQQTTVAAVKDY 64 Query: 52 FERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEV 111 FERF R+P V DLA A +V+ W GLGYYARARNL K A+ VA G FP+ +E + Sbjct: 65 FERFTRRWPRVGDLAAAEDGDVMAEWAGLGYYARARNLLKCARVVAEEFEGVFPDAYEGL 124 Query: 112 AALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQV 171 ALPG+G TA AI +++ + +LDGNV+RV+AR + V+ L + + + Sbjct: 125 IALPGIGPYTAAAISAIAFDRPETVLDGNVERVMARLHDEHEPL--PAVKPVLKAHAAHL 182 Query: 172 TPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPER 231 TP+ + QA+MDLGA ICT P C +CP ++ C A + P K PK+ P R Sbjct: 183 TPSARPGDYAQAVMDLGATICTPKSPACGICPWRDPCRARVKGTAPELPKKTPKKPKPTR 242 Query: 232 T-GYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAF 290 +L E + LL +RP GL GG+ +P ++ + AD Sbjct: 243 YGFVYLARSAEGDWLLERRPDKGLLGGMLGWPGSEWNDAPTETPP---FDADWQDLGAEV 299 Query: 291 RHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 RHTF+HFHL + + + L + + Sbjct: 300 RHTFTHFHLILQVRSAELPADFEPRAGQEL--VRRHDFRPSSLPTVMRKAFDL 350 >UniRef50_C6XF00 A/G-specific adenine glycosylase n=1 Tax=Candidatus Liberibacter asiaticus str. psy62 RepID=C6XF00_LIBAP Length = 356 Score = 318 bits (815), Expect = 2e-85, Method: Composition-based stats. Identities = 104/353 (29%), Positives = 171/353 (48%), Gaps = 14/353 (3%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDK--------TPYKVWLSEVMLQQTQVATVIPYFERFMA 57 +++LDWYD R LPW+ +PYKVW+SE+MLQQT V TV PYF++FM Sbjct: 8 IQSKILDWYDTNHR-VLPWRTSPKTEKSSLPSPYKVWISEIMLQQTTVKTVEPYFKKFMQ 66 Query: 58 RFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGV 117 ++PT+ L++A +E+L W GLGYY RARNL K A + + G FP E + LPG+ Sbjct: 67 KWPTIFCLSSAKDEEILSAWAGLGYYTRARNLKKCADIIVKKYEGNFPHKVEILKKLPGI 126 Query: 118 GRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGV 177 G TA AI++++ ++D N++R+++R + + + + + ++T Sbjct: 127 GDYTASAIVAIAFNHFAVVVDTNIERIISRYFDIIKPA--PLYHKTIKNYARKITSTSRP 184 Query: 178 ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL 237 F QAMMDLGA+ICT +KP C LCP+Q C+ + L K+ P RTG + Sbjct: 185 GDFVQAMMDLGALICTSNKPLCPLCPIQKNCLTFSEGKSHLLGINTIKKKRPMRTGAVFI 244 Query: 238 -LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 + +++ +LL +R + L G+ P A + + L Sbjct: 245 AITNDNRILLRKRTNTRLLEGMDELPGSAWSSTKDGNIDTHSAPFTANWILCNTITHTFT 304 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGAP 349 + +W + + ++ W++ + L +++ L P Sbjct: 305 HFTLTLFVWKTI--VPQIVIIPDSTWHDAQNLANAALPTVMKKALSAGGIKVP 355 >UniRef50_B6BVY5 A/G-specific adenine glycosylase n=1 Tax=beta proteobacterium KB13 RepID=B6BVY5_9PROT Length = 294 Score = 317 bits (813), Expect = 4e-85, Method: Composition-based stats. Identities = 120/308 (38%), Positives = 173/308 (56%), Gaps = 14/308 (4%) Query: 39 MLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVAT 98 MLQQTQV+TVIP++ +F+ RF + LA A DEV+ W+GLG+Y+RARNLHK A+ +A Sbjct: 1 MLQQTQVSTVIPFYIKFIDRFQNIKHLAEASEDEVMSYWSGLGFYSRARNLHKTARIIAE 60 Query: 99 LHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKK 158 + KFP+TFE + LPG+GRSTAGAILS K F ILDGNVKRVL R + + Sbjct: 61 QYSCKFPDTFESLIQLPGIGRSTAGAILSFCFKKKFAILDGNVKRVLTRFFGIQESISLA 120 Query: 159 EVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL 218 + E LW LSEQ+ P ++ + Q +MD GA +CT P+C CP+ CIA NN Sbjct: 121 KTEKDLWDLSEQLLPDGDIDIYTQGIMDFGATLCTPKNPQCHSCPMNQTCIAKQNNLTEA 180 Query: 219 YPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQ 278 P K +T +R+ F + + ++LL + +G+W GL+ PQ Sbjct: 181 IPVKNKTKTKEDRSTEFYIYECNKQILLVKN-RTGVWSGLWIPPQ-----------KNYL 228 Query: 279 IAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVE 338 NL + + FSH+ L + +++ + M + LW++ + P +GL AP++ Sbjct: 229 GKKINLIKQGERQCVFSHYRLKYKYFLIKITNKSDLMV--DGLWFDWKEIPELGLPAPIK 286 Query: 339 RLLQQLRT 346 LL L Sbjct: 287 SLLISLTN 294 >UniRef50_A0L468 A/G-specific DNA-adenine glycosylase n=1 Tax=Magnetococcus sp. MC-1 RepID=A0L468_MAGSM Length = 365 Score = 315 bits (809), Expect = 1e-84, Method: Composition-based stats. Identities = 136/348 (39%), Positives = 195/348 (56%), Gaps = 13/348 (3%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 + + ++L +YD+YGR LPW+ + Y++WLSE+MLQQT V TV+PY+E+F++ FP++ Sbjct: 7 PADLAQRLLAYYDEYGR-DLPWRQQQDLYRIWLSEIMLQQTGVKTVMPYYEKFLSHFPSI 65 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 T LA A ++VL W GLGYY RAR LH+AAQQV HGG FPE +V ALPG+G STA Sbjct: 66 TQLAAASQEQVLAQWQGLGYYRRARMLHQAAQQVVQQHGGLFPEEITQVQALPGIGPSTA 125 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 AIL++ + ILDGNV RVLAR + + +LW ++ Q+T + Q Sbjct: 126 AAILAIGRNQAHTILDGNVMRVLARLLTLELPVDSTPGKQRLWQVARQLTSQQRPGDYAQ 185 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERT-GYFLLLQHE 241 A+MDLGA +CTRS+P CS CP C A + SWA YP K+ K+ P ++LL + Sbjct: 186 AIMDLGATLCTRSQPACSRCPWGGACAARQHGSWAEYPKKREKKPKPHHYQCMWVLLDTQ 245 Query: 242 DEVLLAQRPPSGLWGGLYCF---PQFADE------ESLRQWLAQRQIAADNLTQLTAFRH 292 + L +RP GL GGL+ P + L I L + H Sbjct: 246 QRIFLRKRPLEGLLGGLWEPLGEPLLETPPLGNLVQRASHHLTALGIQGQPLLEAQPVDH 305 Query: 293 TFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERL 340 F+HF L + P+ + +S +++ N W+ LAQ +A ++ Sbjct: 306 IFTHFRLTVYPILVVAASGAPILNDAN--WWPLAQLDQRPIATLHRKV 351 >UniRef50_C7JG78 DNA glycosylase A/G-specific MutY n=12 Tax=Alphaproteobacteria RepID=C7JG78_ACEP3 Length = 357 Score = 315 bits (808), Expect = 2e-84, Method: Composition-based stats. Identities = 116/346 (33%), Positives = 169/346 (48%), Gaps = 13/346 (3%) Query: 7 SAQVLDWYDKYGRKTLPWQ----IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 + +L WYD++ R+TLPW+ PY+VWLSE+MLQQT V V PY+ RF +FPTV Sbjct: 6 AHALLHWYDRH-RRTLPWRVVGQSHPDPYRVWLSEIMLQQTTVKAVAPYYLRFTEKFPTV 64 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA+A ++VL W GLGYY+RARNLH AQ V L G FP+ + + LPG+G TA Sbjct: 65 QALASADREDVLAAWAGLGYYSRARNLHACAQAVVAL--GGFPQDVQGLRVLPGIGPYTA 122 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSG--WPGKKEVENKLWSLSEQVTPAVGVERF 180 A+ +++ G +DGNV+RV AR +A++ P +K++ +L+ F Sbjct: 123 AAVAAIAFGVPVVPVDGNVERVTARLFAITEPLPPARKKLAQLAITLNADREAQERPSDF 182 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 QA+ DLG+ +C+ P C LCP Q C A P K PK P R G L Q Sbjct: 183 AQALFDLGSSLCSPRAPACGLCPWQGECAGHKQGIAAELPRKLPKAERPVRYGAAFLAQD 242 Query: 241 -EDEVLLAQRPPSGLWGGLYCFPQFAD--EESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 ++LL +R GL + P E + Q A +H F+HF Sbjct: 243 AAGQILLRKRAEKGLLAAMTELPGTEWRLENWSEAEILQVAPFAAGWKLAGRVKHVFTHF 302 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 L + + F+ L + L + +++ ++ Sbjct: 303 TLYLDVYVAQIKHFSNQAVSDGFL-VPVECVKDTALPSVMQKCFEK 347 >UniRef50_Q1ITJ1 A/G-specific DNA glycosylase n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1ITJ1_ACIBL Length = 324 Score = 314 bits (805), Expect = 3e-84, Method: Composition-based stats. Identities = 111/343 (32%), Positives = 172/343 (50%), Gaps = 36/343 (10%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 AS +L WY ++ R+ LPW+ + PY +W+SE+MLQQT+VA V+ + +F+A+FP Sbjct: 11 DASDLQKSLLSWY-RHSRRNLPWRRTRDPYAIWISEIMLQQTRVAAVLDKYAQFLAQFPN 69 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA+A LDEVL +W+GLGYY RAR LH+AAQ V GKFP+T LPG+GR T Sbjct: 70 VKALADASLDEVLTVWSGLGYYRRARALHQAAQMVVHHLHGKFPDTAAGWRQLPGIGRYT 129 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 + AI S++ + ++DGNV+RVL R G++ +LW +EQ+ +N Sbjct: 130 SAAIASIAFNEPAAVVDGNVERVLERL------DGERHEGERLWERAEQLLAKRAPGDWN 183 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QAMM+LGA IC P+C +CP+ C P +K ++ L + Sbjct: 184 QAMMELGATICLPQNPQCLVCPVNGPCKTRGPLQSRPQPKRKRA------ELWYALYARK 237 Query: 242 DEVLLAQRPPS-GLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 + VLL QRP L G++ P I A+ + L RH+ + Sbjct: 238 NSVLLVQRPADHSLMAGMWELP---------------AIRANGVEPLHKLRHSITDTDYA 282 Query: 301 IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + + + + + W + + + ++L++ Sbjct: 283 VFVVRGRTA-------KKHGKWVTHEEAHRMAITGLTRKILRK 318 >UniRef50_Q0EVU8 Adenine glycosylase n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EVU8_9PROT Length = 307 Score = 314 bits (804), Expect = 5e-84, Method: Composition-based stats. Identities = 107/334 (32%), Positives = 157/334 (47%), Gaps = 30/334 (8%) Query: 11 LDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPL 70 + WY ++ R LPW+ PY++W+SE+MLQQTQV TV+P + ++ FPT+ LA A Sbjct: 1 MAWYARHAR-DLPWRHTTDPYRIWISEIMLQQTQVKTVLPRYLAWLDTFPTIESLAAASA 59 Query: 71 DEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSL 130 D++L W GLGYY RAR +H+AA + G+FP F+++ +LPG+GRSTAGAI S Sbjct: 60 DDLLKAWEGLGYYRRARFIHQAAATIMEGFEGRFPRDFDDIVSLPGIGRSTAGAIASFCY 119 Query: 131 GKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTP-AVGVERFNQAMMDLGA 189 G P+LDGNVKRVL R + + LW L++Q + +NQAMM+LGA Sbjct: 120 GASTPVLDGNVKRVLKRWHG-----QPDASDKALWLLAQQAINISGKPGIWNQAMMELGA 174 Query: 190 MICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQR 249 C+ P C CP+ C +A + K + + + L QR Sbjct: 175 SACSAKSPDCGACPVNAFCASAFQVEVSQERRKAVPVLDVHWRIDLHICPIKG-IWLTQR 233 Query: 250 PPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVS 309 P +G+WGGL+ P + + + H +H L + Sbjct: 234 PDAGIWGGLWTPPITELDAPPVE--------------APSHIHQLTHRRLHLYAYQCESE 279 Query: 310 SFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 W L + RLL++ Sbjct: 280 PVGA------GHWVP--SIEGYALPTGIHRLLEK 305 >UniRef50_B2UNR9 HhH-GPD family protein n=1 Tax=Akkermansia muciniphila ATCC BAA-835 RepID=B2UNR9_AKKM8 Length = 341 Score = 313 bits (803), Expect = 5e-84, Method: Composition-based stats. Identities = 103/342 (30%), Positives = 160/342 (46%), Gaps = 20/342 (5%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 F +++W+ + GR PW+ P+ + +SE+MLQQT + TV+ ++R+M +FP Sbjct: 8 FNIHAFRNALVEWFRREGR-DYPWRRTTDPWHILVSELMLQQTTIPTVLGRYDRWMRQFP 66 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T LA L W GLGYY R R+L A+++ GG+FP+ E + LPG+G Sbjct: 67 TPAHLAAVDEQTALRSWEGLGYYRRVRSLQAIAREIVNEFGGRFPDNAEGLKRLPGIGPY 126 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 T+GA+LS + K PI+D NV RVLAR S E + LWS +E + F Sbjct: 127 TSGALLSFAFNKAAPIVDANVARVLARIDNYSVPVDSTEGQKYLWSRAESLVDPEHAREF 186 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 N A+M+LG C+ S P C LCP++ C A P K PK + + +L Sbjct: 187 NSAIMELGQTCCSISSPDCLLCPVRPFCSA---ERPETLPVKNPKPQVTRVEHHDILYIR 243 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLD 300 VLLA+ P G+Y FPQ DE +L +L + ++ + + + Sbjct: 244 GKSVLLAKCPEGKRHAGMYRFPQREDEHTL------------SLPHVLKQTYSITRYRVT 291 Query: 301 IVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 + + + L + + +A+P + L Sbjct: 292 RYIHHVT----DTPLLREGEEFVPLDKIHGLPMASPDRKALN 329 >UniRef50_Q10159 A/G-specific adenine DNA glycosylase n=1 Tax=Schizosaccharomyces pombe RepID=MYH1_SCHPO Length = 461 Score = 313 bits (802), Expect = 8e-84, Method: Composition-based stats. Identities = 109/407 (26%), Positives = 173/407 (42%), Gaps = 77/407 (18%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQI-------DKTP-----------YKVWLSEVMLQQ 42 ++ +F ++ +YDK R LPW+ + +P Y+V +SE+MLQQ Sbjct: 16 LEVERFRESLIQFYDKTKR-ILPWRKKECIPPSEDSPLEDWEQPVQRLYEVLVSEIMLQQ 74 Query: 43 TQVATVIPYFERFMARFPTVTDLANAPLD-EVLHLWTGLGYYARARNLHKAAQQVATLHG 101 T+V TV Y+ ++M PT+ A A + +V+ LW+G+G+Y R + LH+A Q +A LH Sbjct: 75 TRVETVKRYYTKWMETLPTLKSCAEAEYNTQVMPLWSGMGFYTRCKRLHQACQHLAKLHP 134 Query: 102 GKFPETFEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEV 160 + P T +E A +PGVG TAGA+LS++ + I+DGNV RVL+R A+ K + Sbjct: 135 SEIPRTGDEWAKGIPGVGPYTAGAVLSIAWKQPTGIVDGNVIRVLSRALAIHSDCSKGKA 194 Query: 161 ENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL-- 218 +W L+ ++ V FNQA+M+LGA+ CT P+CS+CP+ C A + Sbjct: 195 NALIWKLANELVDPVRPGDFNQALMELGAITCTPQSPRCSVCPISEICKAYQEQNVIRDG 254 Query: 219 --------------------------------YPGKKPKQTLPERTGYFLLLQHEDE--- 243 YP K E ++ Q D Sbjct: 255 NTIKYDIEDVPCNICITDIPSKEDLQNWVVARYPVHPAKTKQREERALVVIFQKTDPSTK 314 Query: 244 ---VLLAQRPPSGLWGGLYCFPQFA-----DEESLRQWLAQRQIA---------ADNLTQ 286 L+ +RP +GL GL+ FP + + + Sbjct: 315 EKFFLIRKRPSAGLLAGLWDFPTIEFGQESWPKDMDAEFQKSIAQWISNDSRSLIKKYQS 374 Query: 287 LTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGL 333 + H FSH + S + + W + + VG+ Sbjct: 375 RGRYLHIFSHIRKTSHVFYAIAS--PDIVTNEDFFWISQSDLEHVGM 419 >UniRef50_C5YTP8 Putative uncharacterized protein Sb08g006760 n=1 Tax=Sorghum bicolor RepID=C5YTP8_SORBI Length = 494 Score = 312 bits (801), Expect = 1e-83, Method: Composition-based stats. Identities = 116/346 (33%), Positives = 175/346 (50%), Gaps = 45/346 (13%) Query: 6 FSAQVLDWYDKYGRKTLPWQ-----IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 AQ+L WYD + R+ LPW+ ++ Y VW+SEVMLQQT+V V+ Y+ER+MAR+P Sbjct: 64 LRAQLLRWYDAH-RRDLPWRCASGGEEERAYAVWVSEVMLQQTRVPVVVGYYERWMARWP 122 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 TV LA A +EV +W GLGYY RAR L + A+Q+ G FP T + + G+G Sbjct: 123 TVRSLAAATQEEVNEMWAGLGYYRRARFLLEGAKQIIEK--GVFPRTASALREVRGIGDY 180 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGAI S++ + P++DGNV RV++R YA++ P + + W L+ Q+ + F Sbjct: 181 TAGAIASIAFNEVVPVVDGNVVRVISRLYAIADNPKESSTVKRFWELAGQMVDPLRPGDF 240 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIA------AANNSWALYPGKKPKQTLPERTGY 234 NQAMM+LGA +C+++KP CS C + + C A A+ +P PK Sbjct: 241 NQAMMELGATLCSKTKPGCSECAVSSHCQALALSHENASVQVTDFPRVVPKAKPRRDFAA 300 Query: 235 FLLLQ--------------HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIA 280 ++Q + + LL +RP GL GL+ FP +E + +R+ Sbjct: 301 VCVVQIAQGLEEEAPDAKDNSNLFLLIKRPEEGLLAGLWEFPLILVDEGKTDFQNRRKAM 360 Query: 281 ADNLTQL-----------------TAFRHTFSHFHLDIVPMWLPVS 309 L++L H FSH L + + ++ Sbjct: 361 DKYLSKLLSFDVGQRSDVIFREDVGEHVHIFSHIRLTMHVELMIIN 406 >UniRef50_A3EPM8 Putative A/G-specific DNA glycosylase n=2 Tax=Leptospirillum sp. Group II RepID=A3EPM8_9BACT Length = 355 Score = 311 bits (797), Expect = 3e-83, Method: Composition-based stats. Identities = 124/341 (36%), Positives = 178/341 (52%), Gaps = 22/341 (6%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + +L WY + R +LPW+I++TPY+VW+SE+MLQQT V V+ YFERF+ RFP V Sbjct: 24 ERLRDDLLVWYKEVSR-SLPWRINRTPYRVWVSEIMLQQTTVRAVLGYFERFLERFPDVD 82 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA AP+++VL LW GLGYY RARNLH+AA+ +A+ G FPET E LPGVGRSTAG Sbjct: 83 SLAEAPVEDVLKLWEGLGYYQRARNLHRAARIIAS---GGFPETVEGWMNLPGVGRSTAG 139 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAV-SGWPGKKEVENKLWSLSEQVT-PAVGVERFN 181 A+ S++LG+ PILD NV+RVL R + G ++ + LW LS+ A N Sbjct: 140 AVCSIALGQETPILDVNVRRVLGRLRGISPGDAVRESPD--LWELSKAFVTEASDPGEVN 197 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHE 241 QA+M++GA++C KP C++CP C + L P +K K+ R L+ Sbjct: 198 QALMEIGAVVCLPRKPLCTVCPWSLDCASCGAPEEILNPPRKKKEKQ-IRIRTALIPSDG 256 Query: 242 DEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 L Q L GL+ + + H +SHF ++ Sbjct: 257 SGYFLVQ-GRDRLLEGLW------------DVFSVSGPPGEGQMPFGKVLHEYSHFREEV 303 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 + S + E ++LW + V L ++L+ Sbjct: 304 FLVREERSLLEAALGETSSLWLAKGEESPVALTGVARKILR 344 >UniRef50_C1EFR0 Predicted protein n=1 Tax=Micromonas sp. RCC299 RepID=C1EFR0_9CHLO Length = 574 Score = 310 bits (794), Expect = 5e-83, Method: Composition-based stats. Identities = 114/432 (26%), Positives = 164/432 (37%), Gaps = 90/432 (20%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQID---------------------------------- 27 A + ++L WYDK R LPW+ + Sbjct: 117 DAPRLRDELLAWYDKSHR-ILPWRRNFHSHHAPPPPGPQGGDGCEPKEYWAGCDETGAPT 175 Query: 28 ---KT--PYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGY 82 + Y VW+SE+M QQTQ+ V Y+ R+ R+P V+ LA+A +EV LW GLGY Sbjct: 176 NVPRDQYAYGVWVSEIMSQQTQIERVAQYWLRWTERWPDVSSLASATREEVNELWAGLGY 235 Query: 83 YARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVK 142 Y RA L A+ V FP + +A +PGVG TA AI S++ + +DGNV Sbjct: 236 YRRAGFLLDGARHVVDNCDTIFPNDAKGLANVPGVGPYTAAAIASIAFDEPVAAVDGNVI 295 Query: 143 RVLARCYAVSG-----WPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKP 197 RV R AV+G P + + + FNQAMM+LGA +CT P Sbjct: 296 RVCTRLAAVTGGGDAAKPSSDAAKAVRLCADWLIDDSTRPGDFNQAMMELGATVCTPKAP 355 Query: 198 KCSLCPLQNGCIAAAN-------NSWALYPGKKPKQTLPERTGYFLLLQHED-------- 242 C CPL+ GC AA P K+ K E +++ + Sbjct: 356 ACGQCPLRVGCAGAALESAGSSGFKVTDLPEKEKKPEKREERVAVRVVERKGGRRDGVPG 415 Query: 243 ----EVLLAQRPPSGLWGGLYCFPQFADEESLR--------------QWLAQRQIAADNL 284 LL +RP GL GGL+ FP ++L + + Sbjct: 416 PADSSFLLVRRPEGGLLGGLWEFPSANVPDNLEPGDPDFYASCADVVHSCGSSGKFSKSR 475 Query: 285 TQLTAFRHTFSHFHLDIVPMWLPVS------------SFTGCMDEGNALWYNLAQPPSVG 332 L HTFSH V + W + ++G Sbjct: 476 VGLGEVTHTFSHVRHTYVAQHEVWETSADDDGDEAPPTRGDTAGGREWRWVTATELQTLG 535 Query: 333 LAAPVERLLQQL 344 L++ V ++ + L Sbjct: 536 LSSGVRKIYELL 547 >UniRef50_UPI0000E49729 PREDICTED: hypothetical protein, partial n=2 Tax=Strongylocentrotus purpuratus RepID=UPI0000E49729 Length = 425 Score = 310 bits (794), Expect = 6e-83, Method: Composition-based stats. Identities = 103/396 (26%), Positives = 159/396 (40%), Gaps = 74/396 (18%) Query: 27 DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARA 86 + Y VW+SE+M QQTQVATVI Y+ ++M ++PT+ L+ A L+EV +W GLGYY+R Sbjct: 12 NHKAYAVWVSEIMCQQTQVATVIDYYNKWMKKWPTLESLSKASLEEVREVWAGLGYYSRG 71 Query: 87 RNLHKAAQQVATLHGGKFPETFEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVL 145 + L + A +V G+ P T E++ LPGVGR TAGAI S+S + ++DGNV RVL Sbjct: 72 QRLFEGACKVQNELDGQIPGTAEQLRKELPGVGRYTAGAIASISFSEATGVVDGNVIRVL 131 Query: 146 ARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQ 205 +R + + V +W L+ + FNQ+MM+LGA +C P+C CP+Q Sbjct: 132 SRLRMIGADFTTQNVMTAIWDLANAIVDPDRPGDFNQSMMELGATVCHPKSPQCPSCPVQ 191 Query: 206 NGCIAAANNSWALYPGKKPKQT-------------------------------------- 227 + C A Sbjct: 192 SHCRAIQQMDQLTRDLATKLTKSNGQSLPEEKDIVDIECAADGCSLCMDGPVDANLGVMN 251 Query: 228 ---------LPERTGYFLLLQ----HEDEVLLAQRPPSGLWGGLYCFPQFADEES----- 269 L ++ +++ E+E L+ QRP +GL G++ FP E Sbjct: 252 YPRKPKKKPLKQQVIAVCIVERETNDEEEYLIVQRPDTGLLAGMWEFPSIEIAEETSRQK 311 Query: 270 --------LRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPV---------SSFT 312 L+ L + + F H FSH H + V Sbjct: 312 SRNKMDSYLKDTLNMTLKNVKDRKHIIQFVHMFSHIHQTYELETMKVEEEEEEDVSDKSQ 371 Query: 313 GCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 D + W + + ++A + + Sbjct: 372 ESEDIPHHQWVSRSAFEGQAVSAGMRKAFDAYNGKT 407 >UniRef50_C5ZWA7 A/G-specific adenine glycosylase n=3 Tax=Helicobacter RepID=C5ZWA7_9HELI Length = 332 Score = 309 bits (791), Expect = 1e-82, Method: Composition-based stats. Identities = 101/352 (28%), Positives = 167/352 (47%), Gaps = 26/352 (7%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKT---PYKVWLSEVMLQQTQVATV-IPYFERFM 56 M + +L WY+K GRK+LPW+ + Y+V +SE+MLQQTQV T+ ++ F+ Sbjct: 1 MNLPLLHSSILLWYEKKGRKSLPWRDKTSKNRAYRVLVSEIMLQQTQVKTILERFYFPFL 60 Query: 57 ARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPG 116 +FPT+ L+ A +EVL W GLGYY RARNL K A+ H G P+ E + LPG Sbjct: 61 EKFPTLETLSKAKEEEVLLQWRGLGYYTRARNLLKCAKICCESHKGILPKDIESLQKLPG 120 Query: 117 VGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG 176 +GR TAGAI + +D N+KR+L R +A+ L S ++ + Sbjct: 121 IGRYTAGAIACFGYDRAVSFVDSNIKRILTRFFALQSP-----SPKLLESKAKTILNTQE 175 Query: 177 VERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFL 236 NQA++D+GA +CT PKC+ CPLQ C N ++ K L ++T Sbjct: 176 PFNHNQALLDIGATLCTPKNPKCTQCPLQPFCQGKENPTFYT---PTKKTALLKQTLEIG 232 Query: 237 LLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 + + + L + + L+ LY FP + +S +L +H ++ Sbjct: 233 IFIQDSKFALTK-SQTKLYYNLYNFPLISPPKS------------HQTNKLGILKHAYTK 279 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 ++L + L + ++ + ++ +++ ++L L+ Sbjct: 280 YNLTLHLYSLQSQELLQQTQQ-KIEFFTQEELKNLPISSMTLKILDFLKKST 330 >UniRef50_Q5SH32 A/G-specific adenine glycosylase (MutY) n=2 Tax=Thermus thermophilus RepID=Q5SH32_THET8 Length = 325 Score = 308 bits (789), Expect = 2e-82, Method: Composition-based stats. Identities = 109/341 (31%), Positives = 165/341 (48%), Gaps = 34/341 (9%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + +L WY + R LPW+ +K PY+V +SEV+LQQT+V +PY+ RF+ RFPT+ Sbjct: 2 EAWRKALLAWYRENAR-PLPWRGEKDPYRVLVSEVLLQQTRVEQALPYYRRFLERFPTLK 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A L+EVL +W G GYY RA +LH+ A+ V P +F E+ LPG+G TA Sbjct: 61 ALAAASLEEVLRVWQGAGYYRRAEHLHRLARSVEE-----LPPSFAELRGLPGLGPYTAA 115 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV-GVERFNQ 182 A+ S++ G+ +DGNV+RVL+R +A E +L++L++ + P +NQ Sbjct: 116 AVASIAFGERVAAVDGNVRRVLSRLFARESPK-----EKELFALAQGLLPEGVDPGVWNQ 170 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+LGA +C +P+C CPL C + YP + ++ ER L+L Sbjct: 171 ALMELGATVCLPKRPRCGACPLGAFC--RGKEAPGRYPAPRKRRAKEER-LVALVLLGRK 227 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 V L + G + GLY P F EE + L RH +H L + Sbjct: 228 GVHLER--LEGRFQGLYGVPLFPPEE----LPGREAAFGVRSRPLGEVRHALTHRRLRVE 281 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 P L +E++L++ Sbjct: 282 VRGALWEGEG-------------EDPWKRPLPKLMEKVLRK 309 >UniRef50_B0D6Q9 Predicted protein n=1 Tax=Laccaria bicolor S238N-H82 RepID=B0D6Q9_LACBS Length = 562 Score = 307 bits (787), Expect = 4e-82, Method: Composition-based stats. Identities = 117/466 (25%), Positives = 178/466 (38%), Gaps = 125/466 (26%) Query: 6 FSAQVLDWYDK-YGRKTLPWQIDKTP-----------YKVWLSEVMLQQTQVATVIPYFE 53 +L WY + + +PW+ P Y+VW+SE+MLQQTQVATVIPY+ Sbjct: 88 MRVALLRWYRTVHDTRRMPWRKPYNPSCGAEERAQRAYEVWVSEIMLQQTQVATVIPYYT 147 Query: 54 RFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAA 113 R+M ++PT+ LA+A +D+V LW GLGYY+RA L AQ+ +GG+ P+ +E+ A Sbjct: 148 RWMEKYPTIRHLASANVDQVNALWKGLGYYSRASRLLAGAQKAIQKYGGRLPDNAKEMEA 207 Query: 114 -LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT 172 +PG+GR +AGAI S++ G+ P+LDGNV R+L+R A+ P K + LW + + Sbjct: 208 NIPGIGRYSAGAICSIAYGEKVPVLDGNVHRLLSRVLALHAPPKAKSTLDILWDAATVMV 267 Query: 173 ---------PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS-------- 215 P NQA+++LG+ +C P+C CP+QN C A N+ Sbjct: 268 QIEEADTTSPPQYAGDINQALIELGSTVCKVRDPECGTCPIQNWCSAYQNSEPKESAKED 327 Query: 216 ---------------------WALYPGKKPKQTLPERTGYFLLLQ-------HEDEVLLA 247 YP K ++ E +++ E L Sbjct: 328 NTHMVDIEDICGLCEPLTGAGVMSYPMKTERKKAREELDIVNVVEWRPIQNSDERWFLFR 387 Query: 248 QRPPSGLWGGLYCFPQFA-------------------------------------DEESL 270 +RP +GL GLY FP E L Sbjct: 388 RRPSTGLLAGLYEFPTSPAVSKPISHDALKHIPYNLISQVVQDARVFPLSTHAQKSGEDL 447 Query: 271 RQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVS--------------------- 309 + + H FSH W+ + Sbjct: 448 EDDCEDLHLRISKIKVAGDVVHLFSHVKKTYRAQWVLLEGGSTPPTLKSNLEVEIPIKKS 507 Query: 310 ---------SFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 +A+W L V + V ++ + Sbjct: 508 GKRKTSPATGGPDTESPAHAMWVLLDDVADVNMGTGVAKVWNLTQA 553 >UniRef50_B9SF01 A/G-specific adenine glycosylase muty, putative n=3 Tax=core eudicotyledons RepID=B9SF01_RICCO Length = 775 Score = 306 bits (784), Expect = 9e-82, Method: Composition-based stats. Identities = 115/396 (29%), Positives = 175/396 (44%), Gaps = 59/396 (14%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT--------------PYKVWLSEVMLQQTQVAT 47 + + +L+WYD+ R+ LPW+ KT Y +W+SEVMLQQT+V T Sbjct: 35 ETQKIRESLLEWYDQNQRQ-LPWRRQKTTNPSQESEEEKEKRAYGIWVSEVMLQQTRVQT 93 Query: 48 VIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPET 107 VI Y+ R+M ++PT+ LA A L+EV +W GLGYY RAR L + A+ + GG FP T Sbjct: 94 VIDYYNRWMLKWPTIHHLAQASLEEVNEIWAGLGYYRRARFLLEGAKMIVA--GGGFPNT 151 Query: 108 FEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSL 167 + +PG+G TAGAI S++ + P++DGNV RVL R A+S P KLW L Sbjct: 152 VSSLRKVPGIGDYTAGAIASIAFKEVVPVVDGNVVRVLTRLRAISANPKDSMTVKKLWKL 211 Query: 168 SEQVTPAVGVERFNQAMMDLGAMICTRSK------PKCSLCPLQNGCIAAANNSWALYPG 221 + Q+ FNQ++M+LGA +C S P S C + + + YP Sbjct: 212 AAQLVDPCRPGDFNQSLMELGATVCAPSNPSCSSCPVSSQCRVLSISNQDKSILVTDYPT 271 Query: 222 KKPKQTLPERTGYFLLLQ------------HEDEVLLAQRPPSGLWGGLYCFPQFADEES 269 K K +++ + + LL +RP GL GL+ FP ++ Sbjct: 272 KVVKVKPKHEFSAVCVVEILGSCGPVDNQKTDSKFLLVKRPDDGLLAGLWEFPTCRLDKE 331 Query: 270 LRQWLAQRQIA----------------ADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFT- 312 + +I + F H F+H L + L + Sbjct: 332 ADLITRRNEIDHFMKKSFRLDPEKTYSMVLREDIGEFVHIFTHIRLKVYVDLLVIRLKGG 391 Query: 313 -------GCMDEGNALWYNLAQPPSVGLAAPVERLL 341 + N P++GL + V +++ Sbjct: 392 MSQLFRKQEKEATNWKCVEKKALPNLGLTSGVRKVI 427 >UniRef50_A6FJZ0 A/G-specific adenine glycosylase n=7 Tax=Alphaproteobacteria RepID=A6FJZ0_9RHOB Length = 352 Score = 305 bits (782), Expect = 2e-81, Method: Composition-based stats. Identities = 131/348 (37%), Positives = 172/348 (49%), Gaps = 18/348 (5%) Query: 8 AQVLDWYDKYGRKTLPWQ---------IDKTPYKVWLSEVMLQQTQVATVIPYFERFMAR 58 A +L WYD + R LPW+ PY+VWLSEVMLQQT VA V YFERF A Sbjct: 8 ADLLAWYDAHAR-DLPWRVAPAARLAGTRPDPYRVWLSEVMLQQTTVAAVRAYFERFTAL 66 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 +PTV LA AP +V+ W GLGYYARARNL K A+ VA HGG+FPET E + ALPG+G Sbjct: 67 WPTVEALAAAPDAQVMGEWAGLGYYARARNLLKCARVVAEDHGGRFPETREALMALPGIG 126 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 T AI +++ ++DGNV+RV+AR + + + E L + +TP Sbjct: 127 PYTGAAIAAIAFDAPEVVVDGNVERVMARVHDIRTPLPAAKPE--LVRAAAALTPKRRPG 184 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL- 237 QA+MDLGA +CT P C +CP C A A P K PK+ P R G + Sbjct: 185 CHAQAVMDLGATVCTPRAPACGICPWWGACAARKAGVQAELPHKTPKKPKPLRFGIAYVA 244 Query: 238 LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHF 297 + + LL RP GL GG+ +P ES + I AD RHTF+HF Sbjct: 245 RRADGAWLLETRPDKGLLGGMLGWPGSDWTESAP--IPAPPIEADWTELGEEARHTFTHF 302 Query: 298 HLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 HL + + V+ + L + ++ R Sbjct: 303 HLRL---AIKVAHVPQNAAPLRGAFTPATAFRPSDLPTVMRKVFDLAR 347 >UniRef50_A1W1M2 A/G-specific adenine glycosylase n=13 Tax=Campylobacter jejuni RepID=A1W1M2_CAMJJ Length = 339 Score = 305 bits (781), Expect = 2e-81, Method: Composition-based stats. Identities = 104/357 (29%), Positives = 170/357 (47%), Gaps = 36/357 (10%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQ------------IDKTPYKVWLSEVMLQQTQVATV- 48 + + +L WY+K GRKTLPW+ Y V++SE+MLQQTQV +V Sbjct: 5 ELEKLQENLLLWYEKNGRKTLPWRNLQSQNCDESLKHIDRAYGVYISEIMLQQTQVKSVL 64 Query: 49 IPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETF 108 ++ F+ +FPT+ LANA DE+L W GLGYY RARNL KAA + K P+ Sbjct: 65 ERFYFPFLQKFPTLESLANANEDELLKAWQGLGYYTRARNLKKAALECVDKFEAKLPKEV 124 Query: 109 EEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLS 168 E++ L G+G TAGAI + +DGN++RVL+R +A+ +L + Sbjct: 125 EDLKKLSGIGAYTAGAIACFGYDQKVSFVDGNIRRVLSRLFALENPSM-----KELEKRA 179 Query: 169 EQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTL 228 +++ NQA++D+GA++C KC +CPL + C LYP + K Sbjct: 180 KELLNLNHAFDHNQALLDIGALVCVSKNAKCGICPLYDFCQ--GKFHTELYP-RAKKILY 236 Query: 229 PERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLT 288 L + + + Q L+ G+Y FP F + E ++ + Sbjct: 237 ESVNLNLFLFEFNKKFAIGQ-SQDKLYKGMYNFPFFKEGEYKLS---------KDMGFVG 286 Query: 289 AFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 F+H+++ + L+I +++ + N + L + S L+A + L+ ++ Sbjct: 287 EFKHSYTKYKLNIKVYHQILNN-----ENKNYEFKTLKELESTALSALSLKALKLIK 338 >UniRef50_Q9ZMT8 A/G-SPECIFIC ADENINE GLYCOSYLASE n=21 Tax=Helicobacter RepID=Q9ZMT8_HELPJ Length = 328 Score = 304 bits (780), Expect = 2e-81, Method: Composition-based stats. Identities = 103/348 (29%), Positives = 170/348 (48%), Gaps = 30/348 (8%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDK---TPYKVWLSEVMLQQTQVATV-IPYFERFMARF 59 +L WY+++GRK LP++ K PY+V++SEVM QQTQ++TV ++ F+ F Sbjct: 2 ETLHNALLKWYEEFGRKDLPFRNLKGINAPYEVYISEVMSQQTQISTVIERFYPPFLKAF 61 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 PT+ DLANAPL+EVL LW GLGYY+RA+NL K+A+ H + P ++ + LPG+G Sbjct: 62 PTLKDLANAPLEEVLLLWRGLGYYSRAKNLKKSAEICVKEHHSQLPNDYQSLLKLPGIGA 121 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA AIL + +D NVKRVL R + + L + Sbjct: 122 YTANAILCFGFREKSACVDANVKRVLLRLFGLDPNIHA----KDLQIKANDFLNLNESFN 177 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 NQA++DLGA+IC+ KPKC++CP C+ + K KQ + + Y ++ Sbjct: 178 HNQALIDLGALICSP-KPKCAICPFNPYCLGKNHLERHTL---KKKQEIIQEERYLGVVI 233 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 +++ L + L+ G++ FP + + L L +H+ + F L Sbjct: 234 QNNQIALEK-IEQKLYLGMHHFPNLKENLEFK------------LPFLGTIKHSHTKFKL 280 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 ++ + +Y+L ++ +++ ++L L+ Sbjct: 281 NLNLYLATTKDL-----KNPIRFYSLKDLETLPISSMTLKILNFLKQK 323 >UniRef50_UPI0001926FFC PREDICTED: similar to mutY homolog, partial n=1 Tax=Hydra magnipapillata RepID=UPI0001926FFC Length = 424 Score = 304 bits (779), Expect = 3e-81, Method: Composition-based stats. Identities = 94/296 (31%), Positives = 147/296 (49%), Gaps = 31/296 (10%) Query: 33 VWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKA 92 VW+SE+MLQQTQVATVI Y+ +M ++P++ LA+A L+EV W+GLGYY+R R LH+A Sbjct: 88 VWVSEIMLQQTQVATVINYYNNWMKKWPSIASLASATLEEVNEAWSGLGYYSRGRRLHEA 147 Query: 93 AQQVATLHGGKFPETFEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAV 151 A+++ G P++ + + LPG+G TA AI S++ + ++DGNV RVL R + Sbjct: 148 AKKLIES-GQPMPKSSKSLVKELPGIGPYTASAIASIAFNEVCGVVDGNVIRVLTRIRMI 206 Query: 152 SGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAA 211 K V + +W L+ + FNQ MM+LGA +CT P+CS CPL C++ Sbjct: 207 GADSSSKAVNDFIWELANTIVDEERPGDFNQGMMELGATVCTPKSPQCSQCPLSTLCMSY 266 Query: 212 ANN-----------------------------SWALYPGKKPKQTLPERTGYFLLLQHED 242 +P K ++ ++Q+ + Sbjct: 267 KKGIFLFVIPLVSWINQICIRQQYLVEEGTGLGVMNFPRKNKNKSPLAEEYDVAVIQNNN 326 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 + LL QRP GL GL+ FP F E+ + + Q+ ++ + + F Sbjct: 327 KFLLVQRPNKGLLAGLWEFPLFLRTENSKNEILSLQVEKRFSIKIEKIKEINTIFK 382 >UniRef50_Q5KQ30 A/G-specific adenine DNA glycosylase, putative n=2 Tax=Filobasidiella neoformans RepID=Q5KQ30_CRYNE Length = 568 Score = 304 bits (779), Expect = 4e-81, Method: Composition-based stats. Identities = 105/418 (25%), Positives = 176/418 (42%), Gaps = 86/418 (20%) Query: 8 AQVLDWYD-KYGRKTLPWQIDKTP-----------YKVWLSEVMLQQTQVATVIPYFERF 55 +L+W++ ++ +PW+ P Y++W+SEVMLQQTQV TVI Y++R+ Sbjct: 94 ESLLEWFECVREKRGMPWRKKYDPSLSFEEKGQRAYEIWVSEVMLQQTQVTTVIAYWQRW 153 Query: 56 MARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVA--TLHGGKFPETFEEVAA 113 M R+PT++DLA A ++ GLGYY RAR+L A+ V + + G+ P+ + Sbjct: 154 MERWPTISDLAKADVE-----VRGLGYYRRARSLLAGAKTVMGNSKYNGRLPDDPAVLEK 208 Query: 114 -LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQV- 171 + GVGR TAGAI S++ G PI+DGN+ R+L R AV LW +++++ Sbjct: 209 EIDGVGRYTAGAICSMAYGVRTPIVDGNIHRLLTRLLAVHAPQTGPATIKFLWRVADELI 268 Query: 172 --TPAVGV-----ERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN----------- 213 P+ +NQA+M+LG+ +C + P+C +CPL+ C A Sbjct: 269 KHLPSGDKHNNVVGDWNQALMELGSQVCKPANPECGVCPLKKACKGYAELSNSPSQPSTT 328 Query: 214 -------------------NSWALYPGKKPKQTLPERTGYFLLLQHED-----EVLLAQR 249 + ++P KK K+ ++Q L +R Sbjct: 329 KSDCKLCAPIPCDIETDRIPTVMVFPMKKEKKASRVEEETVCIVQWRGDGDQRRWLFTKR 388 Query: 250 PPSGLWGGLYCFPQFADEESLRQ-----------------WLAQRQIAADNLTQLTAFRH 292 P GL GL+ P L + + + + H Sbjct: 389 PEKGLLAGLFEPPTTPVSAGLSSSERLGASLEALSDYIEITEGEAEGLQKSSRDVGNIPH 448 Query: 293 TFSHFHLDIVPMWLPVSSFTG------CMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 FSH ++ L ++S A+W N + + V+++ ++ Sbjct: 449 IFSHINMTYHIHLLTLTSPGNEPPSVKPKAPRPAVWLNEEEVEKANVGTGVKKVWAEI 506 >UniRef50_B7P4H9 A/G-specific adenine glycosylase muty, putative n=1 Tax=Ixodes scapularis RepID=B7P4H9_IXOSC Length = 345 Score = 302 bits (774), Expect = 1e-80, Method: Composition-based stats. Identities = 112/360 (31%), Positives = 167/360 (46%), Gaps = 46/360 (12%) Query: 14 YDKYGRKTLPWQ--------IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDL 65 +D + LPW+ ++ Y VW+SE+MLQQT+V TVI Y++R+M ++PTV DL Sbjct: 2 FDFGDARQLPWRDIAKADGDPNQKAYAVWVSEIMLQQTRVTTVIEYYKRWMKKWPTVVDL 61 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEV-AALPGVGRSTAGA 124 A A ++EVL +W GLGYY RAR LHK AQ+V GG FP T + + +PGVG TA A Sbjct: 62 ARASIEEVLQVWAGLGYYQRARRLHKGAQKVVRDLGGLFPNTPKHLAREIPGVGCYTAAA 121 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 + S++ G +DGNV RV +R + G E LW+ + + V FNQA+ Sbjct: 122 VASIAFGHRAGAVDGNVARVYSRMRLLGATLGSSPSERALWAAANEAVCPVRPGDFNQAV 181 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 M+LGA +CT + ++ LYP K K+ T +L+ D Sbjct: 182 MELGARVCT----------------LMSWDATQLYPHKAAKKEPRRETHAVSVLRSGDRY 225 Query: 245 LLAQRPPSG-LWGGLYCFPQ---------FADEESLRQWLAQRQIA-------ADNLTQL 287 LL ++ G L GL+ FP ++ R+ + Q +L Sbjct: 226 LLLRKTGQGRLLEGLWEFPNRQVDPQGSAKKSQDEERRVVLQIGAQLLNSAGGKKRARRL 285 Query: 288 TAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTG 347 H FSH H + E W + A+ S G++ ++++L Sbjct: 286 GEVTHLFSHIHATYCVYETTLDE----ECETEGSWMSAAEVESSGVSTAMKKVLSLAAGR 341 >UniRef50_B7A8Y1 HhH-GPD family protein n=1 Tax=Thermus aquaticus Y51MC23 RepID=B7A8Y1_THEAQ Length = 333 Score = 301 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 113/341 (33%), Positives = 164/341 (48%), Gaps = 34/341 (9%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + +L WY K+ R LPW+ +K PY++ +SEV+LQQTQV IPY+ RF+ RFPT+ Sbjct: 2 KAWQEALLAWYRKHAR-PLPWRGEKDPYRILVSEVLLQQTQVRQAIPYYRRFLQRFPTLK 60 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 L APL+EVL +W G GYY RA +LH+ AQQV P++F ++ LPG+G TA Sbjct: 61 ALGEAPLEEVLRVWQGAGYYRRAVHLHRLAQQVEA-----LPQSFAQLKGLPGLGPYTAA 115 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV-GVERFNQ 182 A+ S++ G+ +DGNV+RVLAR +A+ G L L++ + P +NQ Sbjct: 116 AVASMAFGERVAAVDGNVRRVLARLFALEG-----ASPKALQGLAQSLMPEEAHPGEWNQ 170 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+LGA +C KP C CPL + C P ++ + E L+L Sbjct: 171 ALMELGATVCLPRKPLCGACPLASRCRGKEAPERYPLP---QRRKVREERLAALVLLGRK 227 Query: 243 EVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIV 302 V L + G + GLY P F EE + + L RH +H L + Sbjct: 228 GVYLER--LEGRFQGLYGVPLFPAEE----LPVRAEALGVAPRLLGEVRHALTHRRLLVE 281 Query: 303 PMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 P L +E++L+Q Sbjct: 282 VHGAFWEGEG-------------EDPWKRPLPKLMEKVLRQ 309 >UniRef50_D2VUC5 Predicted protein n=1 Tax=Naegleria gruberi RepID=D2VUC5_NAEGR Length = 616 Score = 301 bits (771), Expect = 3e-80, Method: Composition-based stats. Identities = 111/470 (23%), Positives = 179/470 (38%), Gaps = 127/470 (27%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQID------------------KTPYKVWLSEVMLQQT 43 + +++WYD + LPW+ + Y+V +SE+MLQQT Sbjct: 36 ERGHIWRNLINWYDNNKK-DLPWRKTIKSADDEDDDVKATEELHQRAYEVLVSEIMLQQT 94 Query: 44 QVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGG- 102 QV TV YF ++M RFPTV DLA+A +D+ + +W+GLGYY RAR L+ AA+ + Sbjct: 95 QVETVKTYFLKWMKRFPTVKDLADATVDDAVGIWSGLGYYRRARYLYNAAKYIVEKINKP 154 Query: 103 ----------------------KFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGN 140 K P+ +++ LPGVGR TAGAI S++ + ++DGN Sbjct: 155 IKNEEIISDDEDDDEEKKRIIYKLPDNVKDLMQLPGVGRYTAGAIASIAFSRTASVVDGN 214 Query: 141 VKRVLARCYAVSGWPGKKEVENK-LWSLSEQV-----------TPAVGV---------ER 179 V RV AR + + N W +++ + + Sbjct: 215 VFRVFARLKRIEEDIAVNKTANTVFWPMADNLIQHYRDIEDTVLEKEDLELKHRYNVTGD 274 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN------NSWALYPGKKPKQTLPERTG 233 FNQA+M+LG +C PKC+ CPL C A + +YP K K + Sbjct: 275 FNQAVMELGRTVCIPRNPKCTECPLAEVCEANKALEKEEIATVEIYPVKNKKTKKRTQVV 334 Query: 234 YFLLLQHEDEV---LLAQRPPSGLWGGLYCFPQFADEES--------------------- 269 + E +V L +RP L GG + FP +++ Sbjct: 335 SCCIFYKEGDVDSCLFEKRPKETLLGGTWHFPFVIIQDTKAEEEENEEDSEDEEGSKKKK 394 Query: 270 --------------------------LRQWLAQRQIAADNLTQLT---AFRHTFSHFHLD 300 L+++L + D + H F+H + Sbjct: 395 PKKQAKKTKATTKAAASSSESEHFIKLKEFLVKEGFVQDKSIPFEYHASVDHVFTHINQT 454 Query: 301 IVPM-----WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 +L +E W+ +Q S G++ ++ +Q + Sbjct: 455 NHIYSCSDKYLIKDLEDSNSEEDQWKWFANSQRKSQGISTVASKIFKQFK 504 >UniRef50_Q7VGS8 A/G-specific adenine glycosylase n=537 Tax=Helicobacter RepID=Q7VGS8_HELHP Length = 350 Score = 300 bits (769), Expect = 5e-80, Method: Composition-based stats. Identities = 109/350 (31%), Positives = 172/350 (49%), Gaps = 30/350 (8%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKT---PYKVWLSEVMLQQTQVATVIP-YFERFMAR 58 A + +L+WY K+GR LPW+ PY V++SE+MLQQTQV V+ Y+ FM Sbjct: 14 AQTWRENILNWYKKHGRVYLPWRNLSEKNAPYGVYVSEIMLQQTQVKRVLEHYYMPFMTA 73 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 FPT+T LANA + +L LW GLGYY R RN+ K+A+ P T+ E+ +LPG+G Sbjct: 74 FPTLTSLANAQEESLLKLWEGLGYYTRVRNMQKSARICCQRFNNTLPHTYAELISLPGIG 133 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 +AGAIL L ++ +DGN++RV R +A+S + L L+ + Sbjct: 134 AYSAGAILCFGLRQNVAFVDGNIRRVFCRIFALSSP-----TQKSLEELAWILLEPKYSF 188 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLL- 237 +NQA++D+GAMICT P C +CPLQ C + LYP K Q P L Sbjct: 189 DYNQALLDIGAMICTPKSPSCLICPLQQLCE--GQITPELYPTPKKSQLQPLHLHLALCI 246 Query: 238 ---LQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTF 294 + ++ E+ L R L+ GLY P+ ++EE+ T+ ++H + Sbjct: 247 SQNINNKTEIGL-LRSTQNLYRGLYNLPRLSEEEAK-----------TYGTKCGEYKHHY 294 Query: 295 SHFHLDIVPMWLPVSSF---TGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 + + + + + + ++ L + L++ ++ L Sbjct: 295 TKYAITATIYHISFEDLIHISTYQKKEEITFFTLQDLQNKPLSSLCKKAL 344 >UniRef50_B6K1Z7 A/G-specific adenine DNA glycosylase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6K1Z7_SCHJY Length = 470 Score = 298 bits (765), Expect = 1e-79, Method: Composition-based stats. Identities = 107/419 (25%), Positives = 166/419 (39%), Gaps = 90/419 (21%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQID-------KTP-----------YKVWLSEVMLQQT 43 + F Q+LD+Y+ R LPW+ D P Y+V +SE+MLQQT Sbjct: 13 EVKSFRKQLLDFYENTKR-NLPWRKDPYKPPQKDDPDDRKKLAVQRFYEVLVSEIMLQQT 71 Query: 44 QVATVIPYFERFMARFPTVTDLANAPLD-EVLHLWTGLGYYARARNLHKAAQQVATLHGG 102 +V TV Y++++M PT+ A A D EV+ LW+G+G+Y R + LH A + +A L Sbjct: 72 RVETVKRYYDKWMKTLPTILHCAQADYDTEVMPLWSGMGFYGRCKRLHSACKYLAVLPAE 131 Query: 103 KFPETFEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVE 161 + P + E +A +PGVG TAGA+LS++ G ++DGNV+RVL+R A+ K + Sbjct: 132 EIPTSPERLAKNVPGVGPYTAGAVLSIAWGIPTGVVDGNVQRVLSRLLALHCNVTKGKPN 191 Query: 162 NKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANN------- 214 +W ++ + FNQA+M+LGA+ CT C CP+ N C A Sbjct: 192 AFVWQMANLLVDPNFPGNFNQALMELGAVTCTPQTFNCPGCPVSNICKAYQEKIAIEQTD 251 Query: 215 ----------------------------------------SWALYPGKKPKQTLPERTGY 234 + YP + PK Sbjct: 252 NSVRDVEDIICQSHQSISKNTCEICSLCEPDPPYTSAEIWIQSRYPLRPPKVKQRIERAI 311 Query: 235 FLLLQHED------EVLLAQRPPSGLWGGLYCFPQFADEES-------------LRQWLA 275 ++ Q ++ L+ +RP GL GL+ P E L W++ Sbjct: 312 VVVFQSQNSETDEKYFLIRKRPTGGLLAGLWDLPTVEIGEDPWPKDMQLAFDNALSSWIS 371 Query: 276 QRQIA-ADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGL 333 H F+H H ++ W + + VG+ Sbjct: 372 SENRQYIKKYQSCGRSTHIFTHIHKTSHVFYVLADQKLQVQ--KEFYWISQSDLEHVGM 428 >UniRef50_C3XHP3 A/G-specific adenine glycosylase n=1 Tax=Helicobacter bilis ATCC 43879 RepID=C3XHP3_9HELI Length = 356 Score = 296 bits (759), Expect = 7e-79, Method: Composition-based stats. Identities = 107/348 (30%), Positives = 163/348 (46%), Gaps = 25/348 (7%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQI---DKTPYKVWLSEVMLQQTQVATVIP-YFERFMAR 58 A Q+ Q+L+WY GR +LPW+ + PY V++SE+MLQQTQV V YF F+ Sbjct: 22 AKQWQIQLLEWYAIQGRISLPWRNLKGENAPYGVYVSEIMLQQTQVKRVQEHYFAPFLNA 81 Query: 59 FPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 FPT+ LA A LD +L W GLGYY+RARN+ K A H P T +++ LPG+G Sbjct: 82 FPTLESLAKANLDSILKQWEGLGYYSRARNMQKTAIICCEKHNATLPNTRQDLLTLPGIG 141 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 T+GAIL + +DGN++RVL R +A+ + L L+ + Sbjct: 142 AYTSGAILCFGFHQSVSFVDGNIRRVLCRIFALREP-----NQKLLDELAFLLLDTKHSF 196 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 +NQA++DLGAMICT P C +CP+QN C + LYP K P L Sbjct: 197 DYNQALLDLGAMICTPKSPSCLICPVQNLC--NGKVNPTLYPTPKTSSLTPLTLHLLLYK 254 Query: 239 QHEDEVLL--AQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 + + + GL+ GLY PQ + + +F+H ++ Sbjct: 255 DSQGRIAFVYEKGDKGGLYQGLYNLPQLETGAVTNK---------QGFYKCGSFKHHYTK 305 Query: 297 FHLDIVPMWL---PVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLL 341 + + L + T + + + + + L++ ++ L Sbjct: 306 YAITANVYKLDSKHLEILTQSLPHIKLYFLSQKELETKPLSSLCKKAL 353 >UniRef50_A6Q1G1 A/G-specific adenine glycosylase n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q1G1_NITSB Length = 310 Score = 296 bits (758), Expect = 9e-79, Method: Composition-based stats. Identities = 93/347 (26%), Positives = 162/347 (46%), Gaps = 43/347 (12%) Query: 6 FSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVI-PYFERFMARFPTVTD 64 +L+W++K+GR LPW+ + YK++LSE+MLQQTQV+ V Y+ +F+ RFPT+ Sbjct: 1 MHKVLLEWFEKHGRHELPWRQTQDVYKIYLSEIMLQQTQVSRVEGEYYPKFLKRFPTLKA 60 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGA 124 LA A +EVL LW+GLGYY+RARNL + A+ P+ +E+ LPG+G TA A Sbjct: 61 LAQASENEVLALWSGLGYYSRARNLLQCAKICKD----TLPKEPKELMKLPGIGTYTANA 116 Query: 125 ILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAM 184 I + + + ++D N+KRV+ R +A+ E ++ ++ + ++ N A+ Sbjct: 117 ICAFAYNQPVAVVDTNIKRVIMRFFALQD-------EKEVQQKAQMILNTNEPKKHNLAL 169 Query: 185 MDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV 244 MDLG+++CT P C CP+Q C + GK K E+ +F + + Sbjct: 170 MDLGSLLCTPKNPLCDQCPIQQWCAGKESPHSF---GKTKKTKREEKVIHFGIFLQSGHI 226 Query: 245 LLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPM 304 + R L+ ++ P F+H+++ +++ Sbjct: 227 AME-RSNKNLYKNMFILPAIES----------------APKPFATFKHSYTKYNIQATLT 269 Query: 305 WLPVSSFTGCMDEGNALWYNLAQPPSVGLA----APVERLLQQLRTG 347 L N L + +A +++ ++ L+ Sbjct: 270 LL-------NKRPPNTHLIPLQALSAYPVASIALKGIKKAIKHLQED 309 >UniRef50_UPI0001746831 A/G-specific adenine glycosylase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746831 Length = 343 Score = 295 bits (756), Expect = 2e-78, Method: Composition-based stats. Identities = 112/346 (32%), Positives = 162/346 (46%), Gaps = 23/346 (6%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIP--YFERFMARF 59 ++ + + W+ GR PW+ PY + +SE+MLQQTQ++TV+ Y+ R+M RF Sbjct: 8 DPAKIAHALEAWFAVSGR-DYPWRRTTDPYAILVSEMMLQQTQISTVLDRGYYARWMERF 66 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 P LA A EVL +W GLGYY RAR L + AQ+V + HGG FP T E V ALPGVG Sbjct: 67 PNTATLAVADEAEVLRVWEGLGYYRRARFLQQMAQRVESEHGGIFPATLEGVRALPGVGD 126 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG-VE 178 TAGA+ S + PI+DGNV RVL+R + + +E +LW+ S + A Sbjct: 127 YTAGAVCSFAHDAAAPIVDGNVARVLSRVWDDATPVDSREGMARLWTRSRALVEAAQSPR 186 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 FN A+M+LG IC S P C CP+Q C+ A P K + + T Sbjct: 187 VFNSALMELGQTICRVSAPNCGSCPVQPHCLTR---DPASLPVKSKQTVITAVTERVYFQ 243 Query: 239 QHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFH 298 E+ VLL Q GL+ P+ L + ++T + + Sbjct: 244 IGEEGVLLQQESGKRR-TGLWKLPELPALPDPPAVLHRS-------------KYTITRYK 289 Query: 299 LDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 +D+ G W LA ++ + +P + L + Sbjct: 290 VDLWVHA--PPDKAGVAPPEGLQWIPLAGLEALPMPSPYRKALNAV 333 >UniRef50_B3T7B3 Putative HhH-GPD superfamily base excision DNA repair protein n=1 Tax=uncultured marine microorganism HF4000_APKG3D20 RepID=B3T7B3_9ZZZZ Length = 349 Score = 295 bits (755), Expect = 2e-78, Method: Composition-based stats. Identities = 108/343 (31%), Positives = 163/343 (47%), Gaps = 14/343 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 S F +L WYD + R+ LPW+ + YK +SE MLQQT+VATV+PYFER++ +FP Sbjct: 16 SAFGENLLAWYDDH-RRDLPWRKQPSLYKTVVSEFMLQQTRVATVLPYFERWLKQFPDFA 74 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA+A + VL W GLGYY+RARNL K AQQ+ L K P PGVG A Sbjct: 75 ALADASEEAVLKAWEGLGYYSRARNLRKLAQQIEALE--KIPADSAAWEGFPGVGPYVAA 132 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEV-ENKLWSLSEQVTPAVGVERFNQ 182 A+ S+S G ++DGNV RVLAR ++ + KL +++++ +NQ Sbjct: 133 AVTSISFGTKAAVVDGNVVRVLARMLSIDEQFRDNATAQRKLRPIAQELLHYNRPGDYNQ 192 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 A+M+LGA +C R P CS CP C + YP + K+++ T L + + Sbjct: 193 AVMELGATVCHRRSPLCSDCPALYVCQSGQRGDPESYP-RTAKKSIERITVDRLWITCNN 251 Query: 243 EVLLAQRPPS-GLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 +LL + P GL P+ D ++ + +A + + S+ ++ Sbjct: 252 SLLLREAPADSKRLAGLMELPKLEDLPTVPKPGKTALLAIK--------KRSISNQSIEE 303 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 + D + L+AP R ++ L Sbjct: 304 RIYRAELPHGFNLKDCPELRLTRQRDLGEITLSAPHRRWIESL 346 >UniRef50_Q6IEF2 A/G-specific adenine glycosylase, putative n=3 Tax=Trypanosoma RepID=Q6IEF2_9TRYP Length = 510 Score = 293 bits (752), Expect = 5e-78, Method: Composition-based stats. Identities = 99/361 (27%), Positives = 157/361 (43%), Gaps = 52/361 (14%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQID-----------------KTPYKVWLSEVMLQQTQVA 46 + ++W+ R+ LPW+ PY VW+SEVM QQTQ+ Sbjct: 65 ADIRQATIEWFHTNQRRDLPWRQTFSDTDACHPVVPVVTPLTDPYHVWVSEVMSQQTQMD 124 Query: 47 TVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPE 106 TVIPYF+R++A FP + LA A V+ +W+G+GYY RA L K A+ V GG P Sbjct: 125 TVIPYFKRWIAVFPDIATLARATEASVMAVWSGMGYYRRALYLKKGAEHVMKHFGGSLPT 184 Query: 107 TFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCY-AVSGWPGKKEVENKLW 165 T ++ A+PG+G T+ AI S+ + +DGNV RVL+R + P + +++ Sbjct: 185 TAAQLRAIPGIGLYTSAAIASICFRERIISVDGNVVRVLSRLRCERNFDPKSAKSIKEVF 244 Query: 166 SLSEQVT---PAVGVERFNQAMMDLGAMICTRS-KPKCSLCPLQNGCIAAANNSWALYP- 220 +++ P FNQ +M++GA +C S +P C CPL C A A P Sbjct: 245 HWGQEIMGEGPCDRPGDFNQGLMEIGARVCKPSGRPLCEECPLHRYCGAYAAMQSGEIPS 304 Query: 221 ----GKKPKQTLPERTGYFLLLQHE------------DEVLLAQRPPSGLWGGLYCFPQF 264 +TL ++ + L HE ++ +RP GL GG+ FP Sbjct: 305 IEGVIPLRAKTLKKKKEHVFCLVHEFREKTAGDSALKRRFIVVRRPEEGLLGGMLEFPSR 364 Query: 265 A-------------DEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSF 311 + + + N+ ++ +H FSH + ++ Sbjct: 365 TYVSSPDGGTDIKGSRDVTDELRKKLAAGHKNVVEVGHVQHIFSHIDMRVLVYHAVWDGP 424 Query: 312 T 312 + Sbjct: 425 S 425 >UniRef50_B1ZMV1 HhH-GPD family protein n=2 Tax=Verrucomicrobia RepID=B1ZMV1_OPITP Length = 344 Score = 290 bits (744), Expect = 4e-77, Method: Composition-based stats. Identities = 106/347 (30%), Positives = 159/347 (45%), Gaps = 17/347 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 ++F + +L WY + R+ LPW+ + + YK +SE MLQQTQV TV+PY++R++A P Sbjct: 12 AEFQSALLGWYRAHQRR-LPWRTEPSLYKTVVSEFMLQQTQVKTVLPYYDRWLAVLPNFA 70 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LA A +VL LW GLGYY+RARNLH+ +Q + L K P T E LPGVG TA Sbjct: 71 ALAGASEAQVLKLWEGLGYYSRARNLHQLSQAIHALP--KPPRTPEAWRELPGVGPYTAA 128 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWP-GKKEVENKLWSLSEQVTPAVGVERFNQ 182 AI S++ G +DGNV R+LAR A L++ + NQ Sbjct: 129 AITSIAFGAPAACVDGNVVRILARLTANRTLFRDSASAAKTFTPLAQALLSPSAPGDHNQ 188 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHED 242 AMM+LGA +C R P C CP++ C AA YP PK+ +RT + Q + Sbjct: 189 AMMELGATVCVRQNPLCLTCPVRPFCAAARAGDPESYPRLAPKKM-EKRTVTRVWCQRDG 247 Query: 243 EVLLAQRPPS-GLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDI 301 +LL + + ++ P D +R + F + Sbjct: 248 ALLLHRAADDARRFARMHELPTAEDAGFDPARFEKRAPLVVKRR-------GITRFQITE 300 Query: 302 VPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 +W +A+ ++ L+ P R + ++ + Sbjct: 301 SIHLA----PPPRKLGAGFVWVPVAELHAITLSGPHRRWITEILARS 343 >UniRef50_Q7M9R1 A/G-SPECIFIC ADENINE GLYCOSYLASE EC 3.2.2 n=1 Tax=Wolinella succinogenes RepID=Q7M9R1_WOLSU Length = 320 Score = 287 bits (734), Expect = 6e-76, Method: Composition-based stats. Identities = 98/338 (28%), Positives = 149/338 (44%), Gaps = 34/338 (10%) Query: 7 SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATV-IPYFERFMARFPTVTDL 65 +L WY GR LPW+ PY++ +SE+MLQQTQV TV ++ F+ RFPT+ + Sbjct: 6 QTSLLLWYQANGRHHLPWRHPLHPYEILISEMMLQQTQVNTVLERFYYPFLERFPTLESI 65 Query: 66 ANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 A A E+L W GLGYY+RARNLH A+ P + E+ LPG+G TA AI Sbjct: 66 ARAEESEILLAWRGLGYYSRARNLHALAKTCQQ----GLPRSVSELEGLPGIGAYTARAI 121 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 + ILDGN+KR+L+R +A+ G E +LW +E+ + NQA++ Sbjct: 122 ACFGFRESVAILDGNIKRILSRFFALLG-----VGERELWRRAEEFLNPLAAFDHNQALL 176 Query: 186 DLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 D+GA++C P C CPL C + P K + E ++ + E V Sbjct: 177 DVGALLCKPKNPLCQECPLSPWCKG-KEDPLRYTPSKTRR--YEELELFYGICIQEGRVA 233 Query: 246 LAQRPPSGLWGGLYCF-PQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPM 304 + Q L+ GL+ F P L +H ++ + + Sbjct: 234 MVQ-SQERLYKGLWGFVPLLGAPLDSSS--------------LGMIKHGYTKYKITAHLY 278 Query: 305 WLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 + + W L + + L+ +L Sbjct: 279 EIR-----SLPGDSKPSWIPLGELEHLPLSILAHKLWS 311 >UniRef50_C6HWU0 Putative A/G-specific DNA glycosylase n=1 Tax=Leptospirillum ferrodiazotrophum RepID=C6HWU0_9BACT Length = 359 Score = 283 bits (725), Expect = 6e-75, Method: Composition-based stats. Identities = 128/347 (36%), Positives = 184/347 (53%), Gaps = 17/347 (4%) Query: 10 VLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAP 69 +L WYD G + LPW+ PY++W+SE+MLQQT VATV+ +++RF++RFP V LA A Sbjct: 4 LLAWYDA-GHRDLPWRQGCDPYRIWVSEIMLQQTTVATVLKFYDRFLSRFPDVGALAGAD 62 Query: 70 LDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLS 129 L EVL W+GLGYY RARNLH+AA+ VA G FPE+ A LPG+GRSTAGAI S+S Sbjct: 63 LTEVLRFWSGLGYYQRARNLHRAARIVAER--GSFPESLAGWAELPGIGRSTAGAIFSIS 120 Query: 130 LGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV-GVERFNQAMMDLG 188 PILD NV+RV+ R +AV +K+ E +LW LS+ NQA+M+LG Sbjct: 121 RNLWAPILDANVRRVVERFFAVG--KEEKKREARLWELSDSFGRENPRPGDTNQALMELG 178 Query: 189 AMICTRSKPKCSLCPLQNGCIAAA---NNSWALYPGKKPKQTLPERTGYFLLLQHEDEVL 245 A +C + P+CS+CPL++ C + L ++ ++ P R +LL E +L Sbjct: 179 ATVCLPASPRCSICPLRSSCRSRDPEPERVILLPERRRERRPAPPRRDRLVLLPAEGPLL 238 Query: 246 LAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAA--DNLTQLTAFRHTFSHFHLDIVP 303 R L GL+ A L A +L HT+SHF + Sbjct: 239 FVPRGEGRLLEGLFDLYGIAASSIPPGALLDEGPYAGFRVAEELFTVTHTYSHFREVVHV 298 Query: 304 M----WLPVSSFTGCMDEGNALWYNLAQPPS--VGLAAPVERLLQQL 344 + S+ G M + L++ + L +++L++L Sbjct: 299 VGLVGRGTGSAGEGAMAGDLGVVLPLSRALEGLIPLTGVAKKILREL 345 >UniRef50_Q6MHU9 A/G-specific adenine glycosylase n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MHU9_BDEBA Length = 333 Score = 283 bits (725), Expect = 7e-75, Method: Composition-based stats. Identities = 116/339 (34%), Positives = 176/339 (51%), Gaps = 23/339 (6%) Query: 7 SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 Q+ WY+K R +LPW+ +K PY++WLSEVMLQQT V VIPYFE+F+ +FPTV DLA Sbjct: 17 HKQLTQWYNKNKR-SLPWRENKNPYRIWLSEVMLQQTTVVAVIPYFEKFLQKFPTVQDLA 75 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 NAP +V+ W GLGYY+RARNLHKAA+ +A FP+T E+ LPG G T+ A+ Sbjct: 76 NAPEADVMEAWAGLGYYSRARNLHKAAKALAAG---GFPKTAAELLELPGFGPYTSRAVA 132 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMD 186 S++ G+ +LDGNV RVL+R Y + + + L +S++++ + NQ +M+ Sbjct: 133 SIAFGEKVGVLDGNVIRVLSRRYGLKLEWWNGKGRDHLQKISDELSLLGQADVVNQGLME 192 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLL 246 LGA +CT K C LCP C++ N P KKP++ L+ +V L Sbjct: 193 LGATVCTPQKVMCMLCPWAATCVSREKNLVEKLPLKKPRKESEVWVWKPLVAIKNQKVAL 252 Query: 247 AQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWL 306 Q + G FP E N + +H +H + I Sbjct: 253 VQNDYAPFLKGQMIFPGEISME-------------KNKPKAYDAKHNITHHDIFIQI--- 296 Query: 307 PVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 + + N W +L + V ++ ++++L +++ Sbjct: 297 ---TEKKSLTGKNLQWVDLKELKKVNPSSLLQKVLHKVK 332 >UniRef50_Q1ZXP7 Putative uncharacterized protein MYH n=1 Tax=Dictyostelium discoideum RepID=Q1ZXP7_DICDI Length = 574 Score = 283 bits (724), Expect = 7e-75, Method: Composition-based stats. Identities = 84/230 (36%), Positives = 128/230 (55%), Gaps = 9/230 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-------PYKVWLSEVMLQQTQVATVIPYFER 54 + + +L WY+K R LPW+ Y+VW+SE+MLQQT+VATVI YF + Sbjct: 95 EIQEIRESMLGWYEKNKR-DLPWRKHDNSLDENVIAYRVWVSEIMLQQTRVATVIEYFNK 153 Query: 55 FMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAAL 114 ++ ++PT+ DLA+ ++EV +W+GLGYY RA+NL ++ V K P + + + Sbjct: 154 WIEKWPTINDLASTTIEEVNKVWSGLGYYRRAKNLWLGSKYVVDNFNSKIPSDVKSLLEI 213 Query: 115 PGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA 174 G+G TAGAI S++ K P++DGNV RVL+R ++ P W L + + Sbjct: 214 NGIGPYTAGAISSIAFNKPVPLVDGNVIRVLSRVRSIGANPKLSSTVKLFWELGNDLVES 273 Query: 175 -VGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKK 223 FNQ++M+LGA IC+ P+C CP+Q+ C A + P K Sbjct: 274 VENPCNFNQSLMELGATICSVQSPQCKQCPIQSNCQAYQQEKQFIKPEPK 323 Score = 47.1 bits (111), Expect = 0.001, Method: Composition-based stats. Identities = 13/88 (14%), Positives = 29/88 (32%), Gaps = 6/88 (6%) Query: 264 FADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDE-----G 318 E + + + I ++ + H FSH + + P + Sbjct: 487 IEKEILKKLFKSSTDITIKSVKSIGTVLHKFSHINQTLTVYDCPCDFKKEPIPNDKSKTK 546 Query: 319 NALWYNLAQPP-SVGLAAPVERLLQQLR 345 N W L + SVG++ + + ++ Sbjct: 547 NICWLKLDEIQSSVGVSKQMLKCFDLIK 574 >UniRef50_Q1V0L8 A/G-specific adenine glycosylase n=3 Tax=Candidatus Pelagibacter RepID=Q1V0L8_PELUB Length = 326 Score = 282 bits (722), Expect = 1e-74, Method: Composition-based stats. Identities = 92/349 (26%), Positives = 153/349 (43%), Gaps = 29/349 (8%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQID----KTPYKVWLSEVMLQQTQVATVIPYFERFM 56 M S + ++L WYD R +LPW+ + K Y +SE MLQQTQVATVIPYF F+ Sbjct: 1 MTKSILTKKILYWYDNNKR-SLPWRKELSQVKREYFTLVSEFMLQQTQVATVIPYFNNFI 59 Query: 57 ARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPG 116 P + L+ ++L W GLGYY+R +NL K AQ + + P T +E+ LPG Sbjct: 60 KDIPDIKSLSKIKEHKLLKYWEGLGYYSRVKNLKKTAQILEKNFNRRLPNTIDELKLLPG 119 Query: 117 VGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG 176 +G T+ AI++++ K F LDGN++RV+ R + K+ ++ L + + + Sbjct: 120 IGDYTSNAIMAIAFNKPFIPLDGNIERVIKRLLNLKLP--KEITKDNLVKNKKILGNSTR 177 Query: 177 VERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFL 236 + QA+M+LGA+IC P C CPL C + + K ++ ++ Sbjct: 178 ASDYAQALMELGALICRPKNPLCHQCPLIKNCKSFKKKD---FEIIKERKKKIDKFYLLE 234 Query: 237 LLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSH 296 + + ++ LL + L FP + + + FS Sbjct: 235 VYKKNNKFLLIKNTKFNFLKNLQIFPMKQISKPDN---LAKALNFKISNMNMNVIIKFSK 291 Query: 297 FHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 +I N+ W + ++ L +++ L+ Sbjct: 292 LKGEI----------------SNSSWIDSSKLEDYTLPTFTKKIFNYLK 324 >UniRef50_A6DT67 Adenine glycosylase n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DT67_9BACT Length = 357 Score = 280 bits (718), Expect = 4e-74, Method: Composition-based stats. Identities = 100/359 (27%), Positives = 167/359 (46%), Gaps = 18/359 (5%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQ-IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + ++ W LPW+ + Y+V +SEVMLQQT VATV+P +E F +FP Sbjct: 4 NIEKDFQDLVSWSKDNY-SHLPWRSEVRDLYRVLVSEVMLQQTTVATVLPRYESFFEKFP 62 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 + LANA +++ W GLGYY RA+NL+KA + GG+FP+ EE+ +PGVG Sbjct: 63 DLASLANADENDLALAWKGLGYYRRAQNLYKAVTMI-HQSGGEFPDGEEELQKVPGVGPY 121 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVS---GWPGKKEVENKLWSL-SEQVTPAVG 176 TA A+ ++ + +DGN++RVL+R + + G +K V + + + + G Sbjct: 122 TAAALTAIGRNQLALAVDGNLQRVLSRYFFIEVEQGPKLQKAVHSLIQNKTFAKTLELCG 181 Query: 177 VERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTL---PERTG 233 +FN+A+MDLG IC PKC CPLQN C A + L P KK KQ + + Sbjct: 182 PRKFNEALMDLGRAICKPRNPKCGECPLQNSCEARLKHKVELVPVKKEKQKVAISELKLV 241 Query: 234 YFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHT 293 FL+ + E+L ++ + P E ++ N + Sbjct: 242 RFLVKNSQGEMLFYKKRQGEWLKDQWELPTAIVESEDQKIKQYPFANVANYSD-EQITTA 300 Query: 294 FSHFHLDIVPMWLPVSSFT----GCMDEGNALWYNLAQPPSVGLAAPVERLLQQLRTGA 348 + + I W + G +++ + + + +++ E++L + + Sbjct: 301 ITRYR--IRNYWKEMEELDFRDLGILNDRDFAFLPWGKTQEH-ISSASEKVLTLINGRS 356 >UniRef50_Q2IJ90 A/G-specific DNA-adenine glycosylase n=5 Tax=Cystobacterineae RepID=Q2IJ90_ANADE Length = 403 Score = 280 bits (717), Expect = 6e-74, Method: Composition-based stats. Identities = 112/341 (32%), Positives = 166/341 (48%), Gaps = 16/341 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDK---TPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + ++L W+D G + LPW+ + PY+VWL+EVMLQQTQVAT PY+ RF+ R+P Sbjct: 25 AALRRRLLAWWDA-GHRPLPWRQPQRGADPYRVWLAEVMLQQTQVATATPYWLRFVERWP 83 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA A ++VL W+GLGYYAR RNL AA++ HGG P + E + ALPG G Sbjct: 84 TLEALAAARDEDVLAAWSGLGYYARCRNLLAAAREALRRHGG-LPSSHEALRALPGFGPY 142 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGA+ S++ P +DGNV RVL+R + V G P + ++ +L+ + Sbjct: 143 TAGAVASIAFAAPVPAVDGNVTRVLSRLFLVEGDPAARGARARVAALAAALVDRERPGDL 202 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQA+M+LGA +C P C+ CP+ C A A A P + + + ++ Sbjct: 203 NQALMELGATVCRPR-PDCARCPVAARCAARAAGRAAELPPPRRRPEKVRQVLACAVVVR 261 Query: 241 EDEVLLAQRPPSGLWGGLYCFPQFADEESL-------RQWLAQRQIAADNLTQLTAFRHT 293 E V L +RP +GL+ GL FP R + + +L Sbjct: 262 EGRVALVRRPAAGLFAGLAAFPAAEVAPGDAAGAALERYARTRHGLRLRAGEELARVERV 321 Query: 294 FSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLA 334 +H L++ + ++ E W GL Sbjct: 322 LTHRLLELRALRCSLARPPP---EEGIRWVPADALDGAGLP 359 >UniRef50_A6Q673 A/G-specific adenine glycosylase n=2 Tax=Epsilonproteobacteria RepID=A6Q673_SULNB Length = 326 Score = 280 bits (716), Expect = 7e-74, Method: Composition-based stats. Identities = 106/345 (30%), Positives = 164/345 (47%), Gaps = 34/345 (9%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATV-IPYFERFMARF 59 + + ++ WY+ YGR LPW+ PY ++LSEVMLQQTQV TV Y+ F+ F Sbjct: 9 VSLKEIHQKIRTWYEAYGRHDLPWRSTNNPYHIYLSEVMLQQTQVKTVLERYYFPFLQAF 68 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 ++ L NAPLD+VL W GLGYY RA+NLH+ A V P +E+ LPG+G+ Sbjct: 69 SSLEALGNAPLDDVLKQWEGLGYYNRAKNLHRTAGLVNE-----LPPEIDELVKLPGIGK 123 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 +TA AI + + + P+++ NVKR+L R + + E KLW + Sbjct: 124 NTAHAIAAFAFHQPVPVMEANVKRILCRMHRLRTP-----NEKKLWKFAYASVDKEDPFN 178 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 +NQAMMD+GA +C P+C+ CPL+N C N+ YP KK K+T+P R ++ Sbjct: 179 YNQAMMDIGATLCLPKNPQCNRCPLENIC--KGKNNPEYYPLKK-KKTVPAREENIVIYL 235 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 ++D + L QR GL+ F +I + H ++HF L Sbjct: 236 YDDRLSLQQR-SGKFLHGLWGF-------------ESVEIPPCAAEYIGEVTHAYTHFKL 281 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQL 344 ++ + + ++ E+++ Sbjct: 282 KCRVYLY------FESSPEQEYYFTPEKIGKLAISKVDEKIVNLY 320 >UniRef50_B9KEL2 A/G-specific adenine glycosylase n=3 Tax=Campylobacter RepID=B9KEL2_CAMLR Length = 342 Score = 278 bits (712), Expect = 2e-73, Method: Composition-based stats. Identities = 93/362 (25%), Positives = 162/362 (44%), Gaps = 47/362 (12%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKT-------------------PYKVWLSEVMLQQTQ 44 + +L WY++ GRK+LPW+I Y V++SE+MLQQTQ Sbjct: 2 QKIHKSILKWYEQNGRKSLPWRILHEEYRKYGSEDDLKKLKNIDIAYAVYVSEIMLQQTQ 61 Query: 45 VATVIP-YFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGK 103 V +V+ Y+ +F+A+FP++ L+ A DEVL W GLGYY RARN+HK A+ + Sbjct: 62 VKSVLQNYYFQFLAQFPSLKALSMASEDEVLKAWQGLGYYTRARNMHKCAKICVQEFDAR 121 Query: 104 FPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENK 163 P +E+ LPG+G TAGAI + +D N+KRVL+R Y + Sbjct: 122 LPLEIKELQKLPGIGEYTAGAIACFGFLQAKSFVDANIKRVLSRFYGLQNPNS-----KI 176 Query: 164 LWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKK 223 L +++ NQA++D+GA++C KC +CPL+ C Sbjct: 177 LVQKAKEFLNYNNAFEHNQALLDIGALVCLPKNAKCEICPLKCFCTGKDEYEKFHV---S 233 Query: 224 PKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADN 283 K L++Q ++ LL + L+ +Y F ++ +++ + Sbjct: 234 KKTQYENIVLKILIVQKNEQFLLVK-SKEKLYFNMYNFLEYKNQKKAK------------ 280 Query: 284 LTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 + F+H+++ + ++ +L F + ++ + L+ + L+ Sbjct: 281 --FIGEFKHSYTKYKINAKVYFLKDDDFDDLKVKA----FSYKDLEYLALSKLALKTLEL 334 Query: 344 LR 345 + Sbjct: 335 FK 336 >UniRef50_Q4Q851 A/G-specific adenine glycosylase, putative n=3 Tax=Leishmania RepID=Q4Q851_LEIMA Length = 501 Score = 275 bits (703), Expect = 2e-72, Method: Composition-based stats. Identities = 101/380 (26%), Positives = 157/380 (41%), Gaps = 70/380 (18%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQID-------------------------KTPYKVWLSEV 38 +V+ W+ ++ R+ LPW+ PY+VW+SEV Sbjct: 29 RSVQEEVIAWWRQHQRQDLPWRQTLPRGSGDLSGAVDGEKHRTSATTARYDPYQVWVSEV 88 Query: 39 MLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVAT 98 M QQT++ TVIPY+ +M RFP++ LA + DEV +W G+GYY RA L K A+ + Sbjct: 89 MSQQTRMETVIPYYVAWMKRFPSIEALAASTEDEVKSVWAGMGYYRRAMYLRKGAKYLLE 148 Query: 99 LHGGK-------FPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAV 151 K P + EE+ +PG+G T+ AI S+ GK +DGNV RVL+R Sbjct: 149 RSKEKEGTAAVCMPSSQEELLKVPGIGPYTSAAITSMCFGKPVCSVDGNVIRVLSRLRGA 208 Query: 152 SG-WPGKKEVENKLWSLSEQVT---PA------VGVERFNQAMMDLGAMICTRS-KPKCS 200 P + + +++ P NQ +M+LGA +C S P C+ Sbjct: 209 RDFDPKVPANVKEAAAWGQRLMGNSPTTSAVVCQDPSALNQGLMELGASVCRPSGAPLCT 268 Query: 201 LCPLQNGCIAAA-------NNSWALYPGKKPKQTLPERTGYFLL-------LQHEDEVLL 246 CPLQ C A+A + P + K ++ + ++ Sbjct: 269 SCPLQRFCRASALLRGGDIEAIEGVIPVRAAKMAKRSARKLCVVHEMSASRIGDARRFVV 328 Query: 247 AQRPPSGLWGGLYCFPQF----ADEESLRQWLAQRQIAADNL---------TQLTAFRHT 293 +RP +GL GG+ FP AD+E L + ++ RH Sbjct: 329 VRRPANGLLGGMLEFPTVNASAADDEEAVVPLLKHPLSMLAWKARARPTFARLCGNVRHI 388 Query: 294 FSHFHLDIVPMWLPVSSFTG 313 FSH +D+ + + Sbjct: 389 FSHIVMDVGVAHVQWPADAD 408 >UniRef50_Q1J2G1 A/G-specific adenine glycosylase n=3 Tax=Deinococcus RepID=Q1J2G1_DEIGD Length = 343 Score = 274 bits (701), Expect = 4e-72, Method: Composition-based stats. Identities = 107/315 (33%), Positives = 156/315 (49%), Gaps = 30/315 (9%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQ----IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 S +L W+D+ GR LPW+ + PY+VW+SEV+LQQTQVA YFERF+ F Sbjct: 7 SSLRTVLLAWFDRAGR-ALPWRVGPEGRRDPYRVWVSEVLLQQTQVARGRVYFERFLEAF 65 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 PTV LA+AP++ VL W G GYYARARNLH+AA +A P T++ ALPGVG Sbjct: 66 PTVEALADAPIEAVLKAWEGCGYYARARNLHRAAGVMARE---GVPTTYDGWRALPGVGP 122 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA A+ SL+ G+ + DGNV+RVLAR + + ++++ Sbjct: 123 YTAAAVASLACGEARAVNDGNVRRVLARLHGERQP-----TAAWAQARADELLDPARPGA 177 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 N+A+MDLGA +CT P+C CPL+ C A A +P K + + E LLL Sbjct: 178 CNEALMDLGATVCTPKAPQCGECPLRRWCAAFQLGDPAAFPAPKVRAAVREVRAVALLLG 237 Query: 240 HEDEVLLAQRP----------PSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTA 289 E +L +R P+ + + + + ++ A +L Sbjct: 238 DAQEAVLERRAGSLLGGLMGLPTEML-------EAGESKVDALTRLAARLGATVAGELGT 290 Query: 290 FRHTFSHFHLDIVPM 304 H+ +H + + Sbjct: 291 VTHSMTHRRVTLTVY 305 >UniRef50_C4PWX5 A/g-specific adenine dna glycosylase (Ec 3.2.2.-) (Muty homolog)(Mmyh)-related n=1 Tax=Schistosoma mansoni RepID=C4PWX5_SCHMA Length = 302 Score = 273 bits (699), Expect = 6e-72, Method: Composition-based stats. Identities = 101/295 (34%), Positives = 144/295 (48%), Gaps = 53/295 (17%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQI---DKTP----------------YK------VWLS 36 Q + +L WYD+ R LPW+ + P + VW+S Sbjct: 11 QIEKLRESLLLWYDRSKR-DLPWRRMALNPDPNLRGYAGMFDHSLDFVFHILYMCSVWVS 69 Query: 37 EVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQV 96 EVMLQQTQV TVI Y++R+M ++P+V LA+A LD+V LW+GLGYY+RAR LHK AQ++ Sbjct: 70 EVMLQQTQVKTVIDYYDRWMKKWPSVDQLASASLDDVNSLWSGLGYYSRARLLHKGAQKI 129 Query: 97 ATLHGGKFPETFEEVA-ALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWP 155 G FP++ E + ++PGVGR TAGAI S++ + P+LDGNV RVL R + Sbjct: 130 VDEFNGIFPQSAEVLKHSIPGVGRYTAGAIASIAFNQCTPVLDGNVIRVLTRLRQIGSPV 189 Query: 156 GKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQ---------- 205 LW+L+ ++ FNQA+M+LGA+ CT P C CPL Sbjct: 190 QLPTTMEYLWNLTTKLVDPNRPGDFNQALMELGAVCCTPKNPDCIKCPLNKANVNKNDYI 249 Query: 206 ----NGCI-------AAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQR 249 C + YP K K+ ++ + +L +R Sbjct: 250 STDLEDCRLCISSSVYQKSLGVMNYPVKLNKREPRKQNTVI-----QSHILRERR 299 >UniRef50_C0ACJ1 HhH-GPD family protein n=1 Tax=Opitutaceae bacterium TAV2 RepID=C0ACJ1_9BACT Length = 355 Score = 272 bits (697), Expect = 1e-71, Method: Composition-based stats. Identities = 94/305 (30%), Positives = 135/305 (44%), Gaps = 12/305 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + F +L WY R+TLPW+ + YK +SE MLQQTQV TV+PYF R++ FP Sbjct: 14 TRASFQHALLAWYHAS-RRTLPWREAPSLYKTVVSEFMLQQTQVKTVLPYFARWLDAFPD 72 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 LA AP + VL W GLGYY RARNLHK A+ + L + P ++ ALPG+G T Sbjct: 73 FAALAAAPEERVLKHWEGLGYYTRARNLHKLARALVALP--EIPRRPDDWLALPGIGPYT 130 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWP-GKKEVENKLWSLSEQVTPAVGVERF 180 + AI S++ G+ ++DGNV R+L R A L+ + Sbjct: 131 SAAITSIAFGEPVAVVDGNVVRILTRLTADGTPFRDSATASKHFVPLANAIVNPAHPGDH 190 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 NQAMM+LGA +C R P C++CP+ C A YP K + Sbjct: 191 NQAMMELGATMCHRQNPLCTVCPVIRFCAARREGDPETYPRLAAK-VITAINVVRAWCVR 249 Query: 241 EDEVLLAQRPPS-GLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 + +LL + ++ P + LA L + + + + Sbjct: 250 DGRLLLHRAAAGSRRLANMHELPSAEHLGLTPEQLAAAGPL------LAKKKRGITRYAI 303 Query: 300 DIVPM 304 Sbjct: 304 TENIH 308 >UniRef50_D1Y8G9 A/G-specific DNA-adenine glycosylase n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y8G9_9BACT Length = 365 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 106/354 (29%), Positives = 170/354 (48%), Gaps = 17/354 (4%) Query: 1 MQASQF----SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFM 56 M A +F + + WY+ + R LPW++D+ PY + +SE MLQQTQV V ++ R+M Sbjct: 1 MSAEKFHLEAAEALTAWYNSHKR-DLPWRLDRDPYHILVSEAMLQQTQVERVKSFYARWM 59 Query: 57 ARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPG 116 RFPT+T LA+A D+VL W GLGYY+RARNL +AA+ V+ P E + +LPG Sbjct: 60 ERFPTLTSLASASEDDVLACWQGLGYYSRARNLRRAARLVSGAGLKTLPADEEFLRSLPG 119 Query: 117 VGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG 176 +G T GA+ S++ P +DGNV+RV +R + P + + + + + + Sbjct: 120 LGPYTVGAVCSIAFDLPVPAIDGNVRRVFSRLLDMPDDPARAKGTALIAAHAAAILKLGS 179 Query: 177 VERFNQAMMDLGAMICTR-SKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYF 235 QA M+LGA +CT + +C CP+ C A A + A P + + R G Sbjct: 180 PHILTQAFMELGATVCTPGTTCQCGQCPVSRLCAAQAAGTQAQRPVSTRRNVVERRRGAA 239 Query: 236 LLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEES------LRQWLAQRQIAADNLTQLTA 289 LL+ + +RP GLW Y P E L + A+ ++ A L Sbjct: 240 LLILVPHGCAVRRRPAGGLWSRFYEIPWLRGEADESAESCLSRLTAELRLTAPCLDLELE 299 Query: 290 FRHTFSHFHLDIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQ 343 F+ + + + + D ++ + + A ++RL+Q+ Sbjct: 300 ETLKFTCWQVRLHLWSCRMLQPPAGCDA-----FSADELADFPMPAGLKRLVQK 348 >UniRef50_A8I815 DNA repair glycosylase (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8I815_CHLRE Length = 793 Score = 271 bits (694), Expect = 3e-71, Method: Composition-based stats. Identities = 101/282 (35%), Positives = 142/282 (50%), Gaps = 47/282 (16%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT----------------------------PYKV 33 + QF ++L WYD R LPW+ Y V Sbjct: 65 ERVQFRRELLSWYDGNHR-VLPWRRTPHTQRGAAADGEGGEDGVGPAPAELPPQQFAYWV 123 Query: 34 WLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAA 93 W+SEVMLQQTQVATVIPYF R+++R+PTV+DLA A + V +W GLGYY RAR L + A Sbjct: 124 WVSEVMLQQTQVATVIPYFRRWVSRWPTVSDLAAADTEAVNSMWAGLGYYRRARYLLEGA 183 Query: 94 QQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSG 153 + VA GG FP + +E+ +PGVG T+ A+ S++ G +DGNV RVL+R A+ G Sbjct: 184 KFVAGQLGGTFPTSAQELLKIPGVGPYTSAAVASIAFGSPAAAVDGNVIRVLSRLRALPG 243 Query: 154 WPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN 213 P K + +++ +V +NQA+M+LGA +C P CS CP + C AAA Sbjct: 244 DPTKLGAAHT--AMAGEVLDGGRPGCYNQALMELGATVCRPVNPDCSACPARPVCRAAAE 301 Query: 214 ----------------NSWALYPGKKPKQTLPERTGYFLLLQ 239 +PG+K + E+ +L+ Sbjct: 302 WTAYVEGGGDEGAEDAPRVTRFPGRKAVKEKREQAVAVTVLE 343 Score = 47.8 bits (113), Expect = 5e-04, Method: Composition-based stats. Identities = 13/46 (28%), Positives = 18/46 (39%) Query: 228 LPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQW 273 + LL + E LL RP GL GL+ FP ++ Sbjct: 424 KSDVDVSALLAKGERRYLLVMRPEGGLLAGLWEFPGAEVQDYTPNA 469 >UniRef50_Q9SU12 Adenine DNA glycosylase like protein n=1 Tax=Arabidopsis thaliana RepID=Q9SU12_ARATH Length = 608 Score = 267 bits (684), Expect = 4e-70, Method: Composition-based stats. Identities = 105/414 (25%), Positives = 158/414 (38%), Gaps = 94/414 (22%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-------PYKVWLSEVMLQQTQVATVIPYFER 54 + + +LDWYD R LPW+ ++ Y+VW+SE+MLQQT+V TV+ Y++R Sbjct: 127 ETQKIRMGLLDWYDVNKR-DLPWRNRRSESEKERRAYEVWVSEIMLQQTRVQTVMKYYKR 185 Query: 55 FMARFPTVTDLANAPLD-------------------EVLHLWTGLGYYARARNLHKAAQQ 95 +M ++PT+ DL A L+ EV +W GLGYY RAR L + A+ Sbjct: 186 WMQKWPTIYDLGQASLENLIVSRSRELSFLRGNEKKEVNEMWAGLGYYRRARFLLEGAKM 245 Query: 96 VATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWP 155 V G FP + + G+G+ TAGAI S++ + Sbjct: 246 VVAGTEG-FPNQASSLMKVKGIGQYTAGAIASIAFNE----------------------V 282 Query: 156 GKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSK------PKCSLCPLQNGCI 209 + L+ Q+ FNQ++M+LGA +CT SK P S C + Sbjct: 283 SQFHFFQSSMKLAAQLVDPSRPGDFNQSLMELGATLCTVSKPSCSSCPVSSQCRAFSLSE 342 Query: 210 AAANNSWALYPGKKPKQTLPERTGYFLLL--------QHEDEVLLAQRPPSGLWGGLYCF 261 S YP K K +L Q +L +RP GL GL+ F Sbjct: 343 ENRTISVTDYPTKVIKAKPRHDFCCVCVLEIHNLERNQSGGRFVLVKRPEQGLLAGLWEF 402 Query: 262 PQFADEESLRQWLAQRQI------------------AADNLTQLTAFRHTFSHFHLDIVP 303 P E + I + +L F H F+H + Sbjct: 403 PSVILNEEADSATRRNAINVYLKEAFRFHVELKKACTIVSREELGEFVHIFTHIRRKVYV 462 Query: 304 MWLPVSSFTGCMDEGNAL------W--YNLAQPPSVGLAAPVERL----LQQLR 345 L V G D W + ++GL + V ++ LQ ++ Sbjct: 463 ELLVVQLTGGTEDLFKGQAKDTLTWKCVSSDVLSTLGLTSAVRKVPPFRLQHIK 516 >UniRef50_C9M6M4 A/G-specific adenine glycosylase n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M6M4_9BACT Length = 354 Score = 264 bits (676), Expect = 3e-69, Method: Composition-based stats. Identities = 100/336 (29%), Positives = 155/336 (46%), Gaps = 3/336 (0%) Query: 7 SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 S + +WY + GR LPW++ PY VW+SE MLQQTQV+ V+P + +++ FP LA Sbjct: 13 SRALAEWYRENGRH-LPWRLTAGPYAVWVSETMLQQTQVSRVVPLYLAWLSAFPNCRSLA 71 Query: 67 NAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 A VL LW GLGYY+RARN+ +A+ + + P +A LPG G TA A+ Sbjct: 72 EADESAVLSLWRGLGYYSRARNMLASARLIVSAGYDGPPNDQTFLAKLPGFGPYTAAAVR 131 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMD 186 +L+ + LDGN++RV +R + P E E + A+MD Sbjct: 132 ALAYDEPTAALDGNLRRVSSRLTDLDKDPALSEGNKVCQRAVESLMKFQSPRLLTNALMD 191 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLL 246 LG+ C P+C LCPL+ C+A + AL P ++ + + R G LL + ++ + Sbjct: 192 LGSGPCAPR-PRCLLCPLEPYCLARKRGTTALRPVRRAARPIRRRFGAALLFSTQTKLAV 250 Query: 247 AQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHLDIVPMWL 306 QRP GLW + P + + R++AA + T S + + Sbjct: 251 RQRPKGGLWSEFWEIPWEVGSDGEDSGVTARRLAASLGEESFPEPLTPS-VTMRFTSWQV 309 Query: 307 PVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQ 342 F + S+ + + RLL+ Sbjct: 310 ETKLFRAASQPTGTELLPVDDALSLPMPLGILRLLR 345 >UniRef50_C8NK18 A/G-specific adenine glycosylase n=8 Tax=Corynebacterium RepID=C8NK18_COREF Length = 308 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 85/253 (33%), Positives = 123/253 (48%), Gaps = 4/253 (1%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTP-YKVWLSEVMLQQTQVATVIPYFERFMARF 59 M + + VL W+ R L W+ TP + + LSEVM QQT VA V P + +MA + Sbjct: 16 MSYNALQSSVLRWFRLNAR-DLAWRDPDTPAWGILLSEVMSQQTPVARVEPIWLEWMATW 74 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 PT A A DEVL W LGY RA L + A+ + HGG+ P+T +E+ ALPG+G Sbjct: 75 PTPQAFAEASTDEVLRAWGKLGYPRRALRLLECARVIVDKHGGRVPDTVDELLALPGIGD 134 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA A+ + + G++ P++D NV+RV R AV G + K + PA Sbjct: 135 YTARAVAAFAFGQNVPVVDTNVRRVYRR--AVEGRFLQGTASKKELVDVAAILPADSGPE 192 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 F+ +M+LGA+ICT + PKC+ CPL C + A + + Q Sbjct: 193 FSAGIMELGALICTATSPKCASCPLLELCEWQRLDCPAPSEEELASAKKRVQKFTGTDRQ 252 Query: 240 HEDEVLLAQRPPS 252 ++ R Sbjct: 253 VRGLIMDVLRGAE 265 >UniRef50_C9RK91 HhH-GPD family protein n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RK91_FIBSS Length = 384 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 109/372 (29%), Positives = 161/372 (43%), Gaps = 31/372 (8%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQID-----KTPYKVWLSEVMLQQTQVATVIPYFERF 55 M ++ +W+ LPW+ + PY VW+SE MLQQTQV+TV YF R+ Sbjct: 14 MTPDSL-KRLREWFRANA-AELPWRPAGLDAPRDPYAVWISETMLQQTQVSTVRDYFTRW 71 Query: 56 MARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGG----KFPETFEEV 111 M RFP V LA A EV W GLGYY+RARN+ K A+ VA L K PET +E+ Sbjct: 72 MKRFPDVEALAKAEEAEVFKYWQGLGYYSRARNILKTAKIVAALRQAQEPCKMPETRKEL 131 Query: 112 AALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVE-----NKLWS 166 ALPG+G TAGAILSL+ + ILDGN+ R+ +R Y + P K W Sbjct: 132 EALPGIGAYTAGAILSLAYHQREAILDGNLVRIFSRLYELDFLPTDKGSANKNCTEIYWE 191 Query: 167 LSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQ 226 + +V + N+A+M+LG +C P C CPL+ C A +P K+ Sbjct: 192 YAREVADSPKAYMHNEALMELGRTVCKTKSPLCETCPLRGECRAFQECRTTEFP-PAKKR 250 Query: 227 TLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQI------A 280 T G L+++ DE +LA P F L + Sbjct: 251 TEKSWHGTVLVVESADEKILAVNGGQKFLDKQLALPHFESARHATVALPAKAEDYINADE 310 Query: 281 ADNLTQLTAFRHTFSHFHLDIVPMWLPVSSF--------TGCMDEGNALWYNLAQPPSVG 332 ++ FRH+ + ++ + + + + W A+ Sbjct: 311 VKSVEYCGTFRHSITVHKIECDVLHVQLLTKAKASHLPTENSDKPTAFEWIEKAKAFETF 370 Query: 333 LAAPVERLLQQL 344 + + L+++ Sbjct: 371 ANSFSLKALKKV 382 >UniRef50_A9AYZ2 HhH-GPD family protein n=5 Tax=Chloroflexi (class) RepID=A9AYZ2_HERA2 Length = 323 Score = 257 bits (656), Expect = 6e-67, Method: Composition-based stats. Identities = 90/260 (34%), Positives = 131/260 (50%), Gaps = 14/260 (5%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 + S +L W+ GR LPW+ + PY + +SE MLQQTQV VIP +E F+A FPT Sbjct: 3 ELSTLQIDLLAWFQANGR-DLPWRRTRNPYYILVSETMLQQTQVDRVIPKYEAFLALFPT 61 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVAT-----LHGGKFPETFEEVAALPG 116 V LA+A +V+ W GLGY RA NL +AAQ + G FP T E + LPG Sbjct: 62 VEALASASTADVIRSWQGLGYNRRAVNLQRAAQAIVAAGYPADPAG-FPATPEGLRNLPG 120 Query: 117 VGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG 176 +G T+GA+ + + LD N++RV+ R + E L ++Q+ P Sbjct: 121 IGAYTSGAVACFAFERDVAFLDTNIRRVVRRLLVGPEDAPPETNEQTLIDYAQQLIPQGQ 180 Query: 177 VERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAA-------NNSWALYPGKKPKQTLP 229 +NQA+M+LGA+IC+ +KP+C CP+ C A A P KP++ Sbjct: 181 GWAWNQAIMELGALICSAAKPQCWRCPVNQHCRAYAIWREANTQLDMWQPPVIKPRKKAA 240 Query: 230 ERTGYFLLLQHEDEVLLAQR 249 E+ + ++ A R Sbjct: 241 EQPFHTSNRYFRGRIIDALR 260 >UniRef50_B6K934 Helix-hairpin-helix motif-containing protein n=3 Tax=Toxoplasma gondii RepID=B6K934_TOXGO Length = 833 Score = 255 bits (653), Expect = 1e-66, Method: Composition-based stats. Identities = 96/332 (28%), Positives = 137/332 (41%), Gaps = 79/332 (23%) Query: 29 TPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARN 88 +PY VW+SEVMLQQTQV TVI Y++R+M+R+PTV DL A +EV +W+GLGYY RAR Sbjct: 263 SPYGVWVSEVMLQQTQVCTVIDYWQRWMSRWPTVGDLVKATEEEVSQMWSGLGYYRRARQ 322 Query: 89 LHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARC 148 L K AQ V G+ P E++ ++PG+G T GAI +++ G +DGNV RVLAR Sbjct: 323 LLKGAQTVVQEFDGELPGDVEKLLSIPGIGPYTGGAISAIAFGNRAAAVDGNVLRVLARL 382 Query: 149 YAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGC 208 ++ + + +A+++LGA ICT P C CP++ C Sbjct: 383 LGLAAPADSRALAMFCSRWMPPFLDPRRPGASTEALIELGATICTPRAPSCLSCPVRQFC 442 Query: 209 ----------------------------------IAAANNSWALYPGKKPKQTLPERTGY 234 A A YP K ++ PE + Sbjct: 443 LVNREAKSGTSACEARREIHSADCKLCIPFAEAQAAVRERQQAAYPVAKATKSRPEESYV 502 Query: 235 FLLL-----QHED----------------------------------------EVLLAQR 249 L + + ED E+LL +R Sbjct: 503 VLCVTRVPGEREDADAHSTGRRGHSSNSTKSQIACGEDRHTSGQVREREFEQWEILLRRR 562 Query: 250 PPSGLWGGLYCFPQFADEESLRQWLAQRQIAA 281 P +GL G P + + + AA Sbjct: 563 PSAGLLAGQLEVPSCLRSRTQGAKTPEEKKAA 594 >UniRef50_UPI0000510111 A/G-specific adenine glycosylase n=1 Tax=Brevibacterium linens BL2 RepID=UPI0000510111 Length = 345 Score = 255 bits (652), Expect = 2e-66, Method: Composition-based stats. Identities = 75/245 (30%), Positives = 115/245 (46%), Gaps = 9/245 (3%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-PYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + ++ W++ R LPW+ T + V +SE+M QQT V+ V P + +M ++P Sbjct: 25 DIHRVRETIITWFESAAR-PLPWRDADTTAWAVLVSEIMSQQTPVSRVEPRWREWMQKWP 83 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T DLA AP EVLH W LGY RA L +AA+ + G P+T +E+ LPG+G Sbjct: 84 TPADLAQAPTAEVLHRWDRLGYPRRALRLQEAARVITEELDGHVPQTAKELERLPGIGSY 143 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TA A+ S + G+ +LD NV+RVL R +A P + Q P +++ Sbjct: 144 TAAAVTSFAHGERTTVLDTNVRRVLIRLFAGRDRPSP-SPGRAETEWAGQFVPETEHKQW 202 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 N +M+ GA++CT P+C CPL + C + Q Sbjct: 203 NAGVMEFGALVCTARNPQCETCPLNDICAWQKAGRP------TSAVKPKTQKWAGTDRQL 256 Query: 241 EDEVL 245 ++ Sbjct: 257 RGAIM 261 >UniRef50_Q4P7F9 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4P7F9_USTMA Length = 539 Score = 253 bits (647), Expect = 6e-66, Method: Composition-based stats. Identities = 93/275 (33%), Positives = 140/275 (50%), Gaps = 35/275 (12%) Query: 9 QVLDWYDKYG-RKTLPWQ------------------IDKTPYKVWLSEVMLQQTQVATVI 49 +L W++ ++ +PW+ K Y+VW+SE+MLQQT+V TV Sbjct: 90 ALLTWFETVSSKRDMPWRADFIDASHYSDAAKLREARKKRAYQVWISEIMLQQTRVETVK 149 Query: 50 PYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHG--GKFPET 107 Y+ +M ++PT+ LA A + VL W GLGYY+RA +H AA++V G PET Sbjct: 150 TYWLNWMNKWPTIEALAAADPEAVLAAWRGLGYYSRATRIHTAAKRVVADPHMMGLLPET 209 Query: 108 FEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWS 166 +E+ +PGVG TAGAI S+ G+ PI+DGNV RVL+R A+ P K + +W+ Sbjct: 210 PQELEKNVPGVGPYTAGAISSIVFGQAVPIIDGNVVRVLSRQLALYANPKTKLTSHLMWA 269 Query: 167 LSEQVTPAVG---------VERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWA 217 + ++ ++NQA+M+LG+ +CT +KP+C CP+Q C A Sbjct: 270 TATKLVHKASALRGGKATVPGQWNQALMELGSTVCTPTKPRCDECPIQATCSAHQEAQAD 329 Query: 218 LYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPS 252 P + +L ED L +R PS Sbjct: 330 TIKANNK----PNQAAGAEVLDIEDICTLCERFPS 360 Score = 57.9 bits (139), Expect = 6e-07, Method: Composition-based stats. Identities = 17/84 (20%), Positives = 29/84 (34%), Gaps = 1/84 (1%) Query: 213 NNSWALYPGKKPKQTLPERTGYFLLLQHE-DEVLLAQRPPSGLWGGLYCFPQFADEESLR 271 +P K K+T+ + ++Q E LL QRP SGL + FP + Sbjct: 418 ERHICKFPMKLEKKTIRQEECLVCIIQRSSGEFLLEQRPASGLLASTWQFPSLTLSTTAA 477 Query: 272 QWLAQRQIAADNLTQLTAFRHTFS 295 + ++ F+ Sbjct: 478 GSPSDQRQPVFGSKPTAKLVDPFT 501 >UniRef50_B3DTE4 A/G-specific DNA glycosylase n=6 Tax=Bifidobacterium RepID=B3DTE4_BIFLD Length = 328 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 93/323 (28%), Positives = 144/323 (44%), Gaps = 57/323 (17%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDK-TPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 M + S ++ W++ R LPW+ + TP+ V +SEVM QQTQ++ V+PY+ +M R+ Sbjct: 1 MNDTAISLRLGAWWEANAR-DLPWRFGRATPWGVLVSEVMSQQTQMSRVVPYWTDWMERW 59 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 P LA AP EV+ W LGY RA L + A+ VA + K P T++E+ ALPG+G Sbjct: 60 PDARALAEAPKAEVITAWGRLGYPRRALRLQECARVVAEDYADKLPRTYDELTALPGIGD 119 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVG--- 176 TA A++S + G+ ++D N++RVL+R + G +L+ ++ P Sbjct: 120 YTASAVMSFAFGERIAVIDTNIRRVLSRVFLGVESRG-GATSPAERALANRMLPKDEILG 178 Query: 177 ----------------------------------VERFNQAMMDLGAMICTRSKPKCSLC 202 +NQ++M+LGA+ICT P C C Sbjct: 179 CDADVADNAGSAEHVVNSTIRGGKRSRLHRGERPSVTWNQSVMELGAVICTAKSPLCDTC 238 Query: 203 PLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFP 262 P+ + C PG ++T P + Q VL A R P Sbjct: 239 PIADDCAFLKAGR----PGLGERRTRPRQRFQGTDRQVRGLVLAALR----------ELP 284 Query: 263 ---QFADEESLRQWLAQRQIAAD 282 E++ + W Q Q+AA Sbjct: 285 AGATLPREDADKLWKDQIQLAAC 307 >UniRef50_D0RRM3 A/G-specific adenine glycosylase n=1 Tax=alpha proteobacterium HIMB114 RepID=D0RRM3_9RICK Length = 337 Score = 251 bits (642), Expect = 2e-65, Method: Composition-based stats. Identities = 93/346 (26%), Positives = 163/346 (47%), Gaps = 18/346 (5%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTP----YKVWLSEVMLQQTQVATVIPYFERFMARF 59 S S ++L WY KY R LPW+ K YKV LSE MLQQTQV T +P+F F + Sbjct: 3 SSISKKILAWYQKYQR-DLPWRKYKNQKDRHYKVLLSEFMLQQTQVKTALPFFNNFYKKI 61 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 ++ L+ +V LW GLGYY RA+ L + ++ + + K P +E++ ALPG+G Sbjct: 62 NSLEKLSKTSQAKVNKLWQGLGYYRRAKFLLETSKIIKKKYNYKLPSQYEDLIALPGIGD 121 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA AILS++ K+ +DGNV+RV+ R + +SG + K+ + E++ Sbjct: 122 YTAKAILSIAFDKNEIGIDGNVERVVTRIFNISG-------KKKILNYVERLKVEKKASF 174 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQ 239 Q +M++GA+IC P C+ C L C ++N + + +K K + ++ + Sbjct: 175 LMQGLMEVGALICKPKLPLCNDCFLNKNCKYSSNKKFNI--DEKVKAEKKIKKFIAIIYR 232 Query: 240 HEDEVLLAQRPPSGLWGGLYCFPQFADEESLRQWLAQRQIAADNLTQLTAFRHTFSHFHL 299 ++LL G P + + + N L + S+F Sbjct: 233 KNKKILLRYEKNLGPLKDFLNVPLVLEISQEGRNYKKILKKNKNYQNLGILNISISNFKA 292 Query: 300 DIVPMWLPVSSFTGCMDEGNALWYNLAQPPSVGLAAPVERLLQQLR 345 +I+ + + + N + S ++ ++++L++++ Sbjct: 293 EILC----LETKKMIKVNRPYYFMNATELKSKFISGFLKKILKKVK 334 >UniRef50_A7NN47 HhH-GPD family protein n=2 Tax=Bacteria RepID=A7NN47_ROSCS Length = 317 Score = 250 bits (639), Expect = 6e-65, Method: Composition-based stats. Identities = 84/254 (33%), Positives = 130/254 (51%), Gaps = 9/254 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + + +L W+ + R LPW+ + PY++ ++EVMLQQTQV V+P + F+ RFP Sbjct: 15 LSLNDLHQALLKWFSEAAR-DLPWRRTRDPYRILVAEVMLQQTQVDRVLPKYAAFLERFP 73 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T+ LA AP EV+ +W GLGY RA NL +AA+ + +GG FP + LPG+G Sbjct: 74 TLHTLAEAPTAEVIRMWAGLGYNRRAVNLQRAARAICARYGGVFPRDVATLVTLPGIGSY 133 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 TAGA+ + + +D N++RV+ R + P + E L +L+ P + Sbjct: 134 TAGAVACFAFEQDVAFMDTNIRRVIRRVF---TDPTETVNERALLALARAALPVGRSWMW 190 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL-----YPGKKPKQTLPERTGYF 235 NQA+M+LG+++CT P C CPL++ C A + P +K ER Sbjct: 191 NQALMELGSLVCTADAPACWRCPLRDQCRDYAARRESDERFASAPVRKRLAERRERPFIG 250 Query: 236 LLLQHEDEVLLAQR 249 ++ A R Sbjct: 251 SNRYFRGRIIEALR 264 >UniRef50_D1BVG2 HhH-GPD family protein n=39 Tax=Actinomycetales RepID=D1BVG2_XYLCX Length = 581 Score = 248 bits (633), Expect = 3e-64, Method: Composition-based stats. Identities = 81/252 (32%), Positives = 121/252 (48%), Gaps = 9/252 (3%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQID-KTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 +V+ W+D R+ LPW+ +TP+ V +SEVMLQQT V V P + +M R+PT Sbjct: 294 RALVERVVRWFDG-ARRDLPWRAADRTPWGVLVSEVMLQQTPVVRVEPAWRAWMERWPTP 352 Query: 63 TDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 +DLA A +VL W LGY RA L + A+ + H G P+ + ALPGVG TA Sbjct: 353 SDLAAASTADVLRAWDRLGYPRRALRLQECARVLVERHDGAVPDDEAALRALPGVGEYTA 412 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA--VGVERF 180 A+ + + G+ ++D NV+RVLAR P + + V P + Sbjct: 413 AAVRAFAFGRRAVVVDTNVRRVLARAVGGVALPAP-SPTAAERATATAVVPHDDDAAAAW 471 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQH 240 A M+LGA++CT P+C+ CP+++ C A PG + + Q Sbjct: 472 AAASMELGALVCTARSPRCAECPVRDLCAWRAAGC----PGDAHAHRRKTQAWHGTDRQC 527 Query: 241 EDEVLLAQRPPS 252 ++ A R Sbjct: 528 RGRIMAALRAAD 539 >UniRef50_A1A5M6 Mutyh protein n=4 Tax=Eukaryota RepID=A1A5M6_RAT Length = 184 Score = 247 bits (632), Expect = 4e-64, Method: Composition-based stats. Identities = 86/178 (48%), Positives = 114/178 (64%), Gaps = 1/178 (0%) Query: 39 MLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVAT 98 MLQQTQVATVI Y+ R+M ++PT+ DLA+A L+EV LW+GLGYY+R R L + A++V Sbjct: 1 MLQQTQVATVIDYYTRWMQKWPTLQDLASASLEEVNQLWSGLGYYSRGRRLQEGARKVVE 60 Query: 99 LHGGKFPETFEEVAAL-PGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGK 157 GG P T E + L PGVGR TAGAI S++ + ++DGNV RVL R A+ P Sbjct: 61 ELGGHVPRTAETLQQLLPGVGRYTAGAIASIAFDQVTGVVDGNVIRVLCRVRAIGADPTS 120 Query: 158 KEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS 215 V + LW L++Q+ FNQA M+LGA +CT +P CS CP+Q+ C A S Sbjct: 121 SFVSHHLWDLAQQLVDPARPGDFNQAAMELGATVCTPQRPLCSHCPVQSLCRAHQRVS 178 >UniRef50_B9L504 Catalytic Domain Of MutyY n=4 Tax=Bacteria RepID=B9L504_THERP Length = 358 Score = 247 bits (631), Expect = 5e-64, Method: Composition-based stats. Identities = 89/209 (42%), Positives = 126/209 (60%), Gaps = 4/209 (1%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 Q +++DWY + R LPW+ + PY++ +SEVMLQQTQV VIPY+E F+ARFPT Sbjct: 63 QIRAVQRRLVDWYRREAR-DLPWRRTRDPYRILVSEVMLQQTQVERVIPYYEVFLARFPT 121 Query: 62 VTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 V LA+A L EV+ +W GLGY RA L +AA+++ GG+FP + LPGVGR T Sbjct: 122 VEALASAALAEVIAVWGGLGYNRRAVYLWRAAREIVERWGGRFPGERRLLERLPGVGRYT 181 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 AGA+ + G+ D N+ RVL R + + +L L+E+V P +N Sbjct: 182 AGAVACFAFGERVAFWDTNIARVLRRVFLGPE---ARPGRRELDELAERVLPLDRAYEWN 238 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIA 210 QA+M+LGA IC+ +P+C +CPL C + Sbjct: 239 QALMELGARICSARRPRCEICPLCGLCRS 267 >UniRef50_C7HUA0 Endonuclease III n=4 Tax=Anaerococcus RepID=C7HUA0_9FIRM Length = 215 Score = 247 bits (631), Expect = 5e-64, Method: Composition-based stats. Identities = 45/217 (20%), Positives = 85/217 (39%), Gaps = 8/217 (3%) Query: 1 MQASQFSAQVLDWYDKYGRK-TLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 + S+ +V+D D+ + TP+++ ++ ++ Q V Sbjct: 5 LNKSEI-NEVVDRLDQMYPNLDKSFLDFTTPFELLIATILSAQCTDVRVNKVTSNMFKFA 63 Query: 60 PTVTDLANAPLDEVLHLWTGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 T D +N + E+ G Y +A+N+ A+ + G P+ +++ LPGVG Sbjct: 64 NTPEDFSNMDIKEIESYIKTCGLYKNKAKNIKNASIMLIREFDGIVPDNMKDLTKLPGVG 123 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 R TA ++S + G +D +V+RV R ++ E L + P Sbjct: 124 RKTANVVMSNAFGIDAIAVDTHVQRVSNRI-GLAASKDVLNTEKDL----RKNLPKEKWS 178 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS 215 + + ++ G IC P C C L++ C Sbjct: 179 KLHHQIIAHGRKICKARNPLCEECDLKDLCEDYKERR 215 >UniRef50_C8RS99 A/G-specific DNA glycosylase n=1 Tax=Corynebacterium jeikeium ATCC 43734 RepID=C8RS99_CORJE Length = 385 Score = 247 bits (631), Expect = 5e-64, Method: Composition-based stats. Identities = 87/336 (25%), Positives = 130/336 (38%), Gaps = 55/336 (16%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQ-IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 + + A++ WY + R LPW+ D TP+ + LSEVM QQT VA VIP + ++ R+ Sbjct: 50 VDSPDLPAKLNAWYARNAR-DLPWRHPDTTPWAILLSEVMSQQTPVARVIPLWRAWLERW 108 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 PT DLA AP E+L +W LGY RA L + A H G P E+ ALPG+G Sbjct: 109 PTPADLAAAPRSEILRMWANLGYPRRALRLKECAIACVERHDGAVPHDIAELEALPGIGH 168 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA A+ + + G+ P++D NV+RV R G + + + P V + Sbjct: 169 YTARAVAAFAFGQAVPVVDTNVRRVYRRLVDGRYLQGPARARD--LADVASLLPHVDPDP 226 Query: 180 -------------------------------------------FNQAMMDLGAMICTRSK 196 A+M+LGA+ICT Sbjct: 227 RLDGRQLLPPPQPQPTDTGTGTDSCAGGSTATARDADRDAANLMCAALMELGALICTAKS 286 Query: 197 PKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDEV--LLAQRPPSGL 254 P C CP+ + C + + Q ++ LL + G+ Sbjct: 287 PACDQCPVIDDCAWVQAGKPQPTEAEAQAAAQRVQKFVGTDRQVRGKIMGLLREAGEGGV 346 Query: 255 WGG----LYCFPQFADEESLRQWLAQRQIAADNLTQ 286 L+ P E Q L + A + + Sbjct: 347 TKKEIDLLW--PATEQLERALQSLLDDGLVARDGAK 380 >UniRef50_B2AMG7 Predicted CDS Pa_5_7310 n=1 Tax=Podospora anserina RepID=B2AMG7_PODAN Length = 582 Score = 247 bits (631), Expect = 5e-64, Method: Composition-based stats. Identities = 96/264 (36%), Positives = 144/264 (54%), Gaps = 27/264 (10%) Query: 1 MQASQFSAQVLDWYDKYGRKTL-PWQI-----------DKTPYKVWLSEVMLQQTQVATV 48 + + A +L W+ K L PW+ + Y+VW+SE+MLQQT+VATV Sbjct: 116 LSSPSAQASLLRWFQKEQTTRLMPWRKPFLTNPSRADLSRRAYEVWISEIMLQQTRVATV 175 Query: 49 IPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETF 108 I Y+ ++M ++PT+ DLA A +EV+++WTGLGYY+RAR +H AQ+V T G P+T Sbjct: 176 IAYWNKWMTKWPTIEDLAQATEEEVVNMWTGLGYYSRARRIHAGAQKVVTEMQGLLPDTV 235 Query: 109 EEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPG-KKEVENKLWS 166 E + +PGVGR TAGAI ++ G+ P++DGNV RVL+R + G K V + LW Sbjct: 236 EGLMKHVPGVGRYTAGAISAIVFGEAEPMVDGNVMRVLSRQMGLMGDVKGDKRVVDVLWE 295 Query: 167 LSEQVT---------PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWA 217 ++++ + QA+M+LG+ ICT KP+C CP+ C+A A Sbjct: 296 AADRLVKVVAEADGEEGEKPGLWGQALMELGSTICTP-KPQCGKCPVTESCMAYAEGLAL 354 Query: 218 LYPGKKPKQTLPERTGYFLLLQHE 241 KQ +P+ + E Sbjct: 355 ---ASGLKQAVPDIEDGIACVLCE 375 Score = 99.8 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 53/150 (35%), Gaps = 18/150 (12%) Query: 215 SWALYPGKKPKQTLPERTGYFLLLQH--EDEVLLAQRPPSGLWGGLYCFPQFADEESLRQ 272 +P KKPK+ + E ++ + + L+++RP GL GL+ FP + S Sbjct: 430 HAQRFPLKKPKKQVREEEALVCAIRRVSDGQYLISRRPGKGLLAGLWEFPSYILPASNDS 489 Query: 273 WLAQRQIAADNLT------------QLTAFRHTFSHFHLDIVPMWLPVSSFTGCMD---- 316 R+ A + +L FSH L + + + Sbjct: 490 TTKSRKKQALDYASGLVESTDGYRGELGTVPWLFSHLRLAMHVQLFELDDSDRSLRTLPL 549 Query: 317 EGNALWYNLAQPPSVGLAAPVERLLQQLRT 346 E N W + + S + +++ ++ Sbjct: 550 ESNYRWVSSDEIESESMGTGMKKCWSLIKA 579 >UniRef50_B8DWV0 Endonuclease III n=11 Tax=Bifidobacteriaceae RepID=B8DWV0_BIFA0 Length = 247 Score = 246 bits (630), Expect = 6e-64, Method: Composition-based stats. Identities = 53/209 (25%), Positives = 87/209 (41%), Gaps = 1/209 (0%) Query: 25 QIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYA 84 TP+++ ++ ++ QT V +PT LA+A L++V + LG+Y Sbjct: 31 LDFHTPFELLVATILSAQTTDKRVNSITPELFGTYPTAAALADARLEDVESIIRPLGFYH 90 Query: 85 -RARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKR 143 +A ++ A+Q+ GG+ P+T EE+ +LPGVGR TA +L + +D +V R Sbjct: 91 VKAEHIIAVARQIVERFGGQIPQTMEELTSLPGVGRKTANVVLGNAFRVPGFPVDTHVIR 150 Query: 144 VLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCP 203 V R + W ++ P + ++ G ICT P+C CP Sbjct: 151 VTGRLHWRDDWMKTSTTPERIEQEITGCFPESEWTDLSHRLIIFGRNICTSRSPECENCP 210 Query: 204 LQNGCIAAANNSWALYPGKKPKQTLPERT 232 L C +A K R Sbjct: 211 LLPTCPSAGEFLEIAAERKARATRRAPRK 239 >UniRef50_O26858 Endonuclease III n=4 Tax=Euryarchaeota RepID=O26858_METTH Length = 233 Score = 245 bits (627), Expect = 1e-63, Method: Composition-based stats. Identities = 51/191 (26%), Positives = 87/191 (45%), Gaps = 6/191 (3%) Query: 26 IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYA- 84 D+ PY+V + ++ Q+T+ R+P++ D+A APL+E+ L G+Y Sbjct: 41 EDRDPYRVLIRTILSQRTRDENTDEATASLFERYPSIEDVAYAPLEEIEALIRKAGFYHV 100 Query: 85 RARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRV 144 +AR + + ++ + + GK P+ E+ LPGVGR TA +L + G+ +D +V R+ Sbjct: 101 KARRVREVSRIILEEYDGKVPDDINELLKLPGVGRKTANCVLVYAFGRPAIPVDTHVHRI 160 Query: 145 LARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPL 204 R V + E +L +V P N M+ G IC P+ CP+ Sbjct: 161 SNRIGLV----DTRTPEETERALM-KVIPREYWIELNDLMVQFGQDICRPLGPRHEECPI 215 Query: 205 QNGCIAAANNS 215 + C Sbjct: 216 ADHCDYYFRIK 226 >UniRef50_A4AHC9 Adenine glycosylase n=1 Tax=marine actinobacterium PHSC20C1 RepID=A4AHC9_9ACTN Length = 297 Score = 245 bits (627), Expect = 1e-63, Method: Composition-based stats. Identities = 92/290 (31%), Positives = 138/290 (47%), Gaps = 16/290 (5%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDK-TPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 ++FS +W++++GR LPW+ D + + +SE+MLQQT VA VIP E+++ R+P Sbjct: 9 NLAEFSLVTREWFNEHGR-DLPWRHDGFGAWGILVSEIMLQQTPVARVIPRLEQWLDRWP 67 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 T LA E + W LGY RA NLH AA + H G PE + + ALPG+G Sbjct: 68 TPAALAADSPGEAVRAWERLGYPRRALNLHAAATAITKNHDGTVPEDVQSLLALPGIGDY 127 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAV--GVE 178 TA A+ + + G P++D NV+RV+AR A G G L + E + P + Sbjct: 128 TARAVAAFAYGHRHPVVDTNVRRVIARAVAGQGEAGPPSTRRDL-AAMELLLPEDPVAAQ 186 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 N A+M+LGA++CT +P C CP++ C A YP + K+ ++ Sbjct: 187 LTNAAVMELGAIVCTAKRPLCDECPVRELCQWRARG----YPPFEGKRQAVQKRFEGSDR 242 Query: 239 QHEDEVLLAQRPPS-----GLWGGLYCFPQFADEESLRQWLAQRQIAADN 283 Q +L R G ++ P E L +A N Sbjct: 243 QVRGLILAELRASDVPVTAGEIAQVW--PDAEQRERALAGLLADGLAVQN 290 >UniRef50_Q9WYK0 Endonuclease III n=12 Tax=Thermotogaceae RepID=END3_THEMA Length = 213 Score = 245 bits (626), Expect = 2e-63, Method: Composition-based stats. Identities = 50/224 (22%), Positives = 99/224 (44%), Gaps = 13/224 (5%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 + + +++ + + ++T P++V +S V+ Q+T+ ++ + T Sbjct: 2 IEELAREIVKRFPRNHKET-------DPFRVLISTVLSQRTRDENTEKASKKLFEVYRTP 54 Query: 63 TDLANAPLDEVLHLWTGLGYYA-RARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 +LA A +++ L G Y +A + + ++ + +GG+ P++ EE+ LPGVGR T Sbjct: 55 QELAKAKPEDLYDLIKESGMYRQKAERIVEISRILVEKYGGRVPDSLEELLKLPGVGRKT 114 Query: 122 AGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFN 181 A +L + K +D +V R+ R GW + E +L +++ P N Sbjct: 115 ANIVLWVGFKKPALAVDTHVHRISNRL----GWVKTRTPEETEEAL-KKLLPEDLWGPIN 169 Query: 182 QAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPK 225 +M++ G IC P C C L+N C + + Sbjct: 170 GSMVEFGRRICKPQNPLCEECFLKNHCEFYRRRGKGEVRNRTER 213 >UniRef50_B2A677 DNA-(Apurinic or apyrimidinic site) lyase, endonuclease III n=11 Tax=Clostridia RepID=B2A677_NATTJ Length = 218 Score = 244 bits (624), Expect = 3e-63, Method: Composition-based stats. Identities = 42/219 (19%), Positives = 84/219 (38%), Gaps = 7/219 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M+ + + Q+L +P+++ ++ ++ Q V + Sbjct: 1 MEKKE-AEQILSKLKANYPSARTALKFNSPFELLIATILSAQCTDIRVNEITKELFELAN 59 Query: 61 TVTDLANAPLDEVLHLWTGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 T D+ ++ + G G Y +++N+ + + + + G+ P EE+ LPGVGR Sbjct: 60 TPQDILKLGRPRLIQIIKGAGLYKNKSKNILETCEILVDEYEGEVPAKREELEKLPGVGR 119 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA +L+ + +D +V RV R ++ + VE L V Sbjct: 120 KTANVVLANAFNIPAFAVDTHVLRVSKRL-GLTDKEDPRGVEQDLM----SVFDRDDWNV 174 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWAL 218 + ++ G +C KP+C C + C +N Sbjct: 175 GHHLLIYHGRAVCKARKPQCENCSIIEHCKYYHDNYQYP 213 >UniRef50_Q5M3Y9 Endonuclease III, DNA repair n=21 Tax=Firmicutes RepID=Q5M3Y9_STRT2 Length = 219 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 58/216 (26%), Positives = 95/216 (43%), Gaps = 5/216 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M + + L K +TP+++ ++ ++ QT V AR+P Sbjct: 6 MLGRKRVNEALALMGKMFPDAHGELEWETPFQLLVAVILSAQTTDKAVNKVTPGLWARYP 65 Query: 61 TVTDLANAPLDEVLHLWTGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 + DLA+A LD+V +G Y +A+N+ K A+ + G+ P+T +E+ +LPGVGR Sbjct: 66 EIEDLASANLDDVEMCLRTIGLYKNKAKNIIKTARAILMNFDGQVPKTHKELESLPGVGR 125 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA +L+ G +D +V RV R +E+E +L + P Sbjct: 126 KTANVVLAEVYGIPSIAVDTHVSRVSKRLNIAPENASVEEIEAELM----KKIPKKDWII 181 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS 215 + M+ G C PKC CPLQ+ C + Sbjct: 182 SHHRMIFFGRYHCLAKNPKCQTCPLQSYCKYYRETT 217 >UniRef50_Q03K95 DNA-(Apurinic or apyrimidinic site) lyase / endonuclease III n=5 Tax=Lactobacillales RepID=Q03K95_STRTD Length = 214 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 58/216 (26%), Positives = 95/216 (43%), Gaps = 5/216 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M + + L K +TP+++ ++ ++ QT V AR+P Sbjct: 1 MLGRKRVNEALALMGKMFPDAHGELEWETPFQLLVAVILSAQTTDKAVNKVTPGLWARYP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 + DLA+A LD+V +G Y +A+N+ K A+ + G+ P+T +E+ +LPGVGR Sbjct: 61 EIEDLASANLDDVEMCLRTIGLYKNKAKNIIKTARAILMNFDGQVPKTHKELESLPGVGR 120 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA +L+ G +D +V RV R +E+E +L + P Sbjct: 121 KTANVVLAEVYGIPSIAVDTHVSRVSKRLNIAPENASVEEIEAELM----KKIPKKDWII 176 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS 215 + M+ G C PKC CPLQ+ C + Sbjct: 177 SHHRMIFFGRYHCLAKNPKCQTCPLQSYCKYYRETT 212 >UniRef50_B8P2D0 Predicted protein n=1 Tax=Postia placenta Mad-698-R RepID=B8P2D0_POSPM Length = 562 Score = 243 bits (622), Expect = 5e-63, Method: Composition-based stats. Identities = 98/402 (24%), Positives = 155/402 (38%), Gaps = 106/402 (26%) Query: 2 QASQFSAQVLDWYD-KYGRKTLPWQIDKTP-----------YKV--WLSEVMLQQTQVAT 47 + S +L+WY+ + + +PW+ P Y+V W+SE+MLQQTQVAT Sbjct: 77 EPSPLREALLEWYEGVHAARGMPWRKPYDPSMNNDQRAQRAYEVLVWVSEIMLQQTQVAT 136 Query: 48 VIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPET 107 VIPY+ R+M ++PT+ DLA + ++ V +W GLGYY+RA L AQ+ GG+ P+ Sbjct: 137 VIPYYNRWMKKYPTIRDLAASDIETVNSIWRGLGYYSRAARLLAGAQKAVRDFGGRLPDN 196 Query: 108 FEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWS 166 +++ A +PG+GR +AGAI S++ D R AV P K+ + LW Sbjct: 197 AKDMEANIPGIGRYSAGAICSIAYN------D--------RILAVHAPPKSKQTLDILWQ 242 Query: 167 LSEQVTPAV-GVERFNQAMMDLGAM-ICTRSK------PK----------------CSLC 202 + + NQA+++LG+ C K P+ C+LC Sbjct: 243 GATAMVEGGTRPGDLNQALIELGSTPWCQAVKLADGKEPEGAASKSGAEVLDIEEICTLC 302 Query: 203 PLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLLQHEDE-------VLLAQRPPSGLW 255 +P + K+ E +++ + LL +RP GL Sbjct: 303 E-----PLPVGRPVTSFPMRAEKKKAREELDIVNVIEWRSQADGGGRWFLLVRRPEGGLL 357 Query: 256 GGLYCFPQFA-----------------------------------------DEESLRQWL 274 GGL+ FP E S + Sbjct: 358 GGLHEFPTSPAVPVTISAAAQKKVPSALLRDLLTSPPAGEHGARTSGDQARGEISGAELE 417 Query: 275 AQRQIAADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMD 316 + ++ H FSH W+ + G Sbjct: 418 PSTALRIVHVEPAGDVIHIFSHIRKTYRVQWVVLEGGGGTPP 459 >UniRef50_A1RYX1 HhH-GPD family protein n=1 Tax=Thermofilum pendens Hrk 5 RepID=A1RYX1_THEPD Length = 253 Score = 243 bits (622), Expect = 6e-63, Method: Composition-based stats. Identities = 69/213 (32%), Positives = 111/213 (52%), Gaps = 9/213 (4%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 + + ++ +WY + GR PW+ + PY + +E +LQ+T+ TV FE F +R+P+ Sbjct: 40 EEVARRLAEWYRRRGR-DFPWRHTRDPYVILATEFLLQRTRAETVAKVFEEFFSRYPSPE 98 Query: 64 DLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAG 123 LANA +E+ ++ LG RA L +AA+++ HGG P + EE+ L GVG A Sbjct: 99 SLANADPEELRKFFSRLGLVRRADALREAAREIVERHGGSVPRSKEELLKLKGVGPYIAS 158 Query: 124 AILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQA 183 A+L + P++D NV+RVL R S + + E++ + A Sbjct: 159 AVLCFAYSAPVPVVDTNVERVLGRAAGASS-------REEAEAFLERLLRHGNPREISLA 211 Query: 184 MMDLGAMICTRSKPKCSLCPLQNGCIAAANNSW 216 ++DLGA++CT KPKC CPL + C + Sbjct: 212 LIDLGALVCT-RKPKCPECPLSDLCSYRGSGRP 243 >UniRef50_B7GUG9 HhH-GPD family protein n=2 Tax=Bifidobacterium RepID=B7GUG9_BIFLI Length = 318 Score = 243 bits (621), Expect = 7e-63, Method: Composition-based stats. Identities = 94/313 (30%), Positives = 150/313 (47%), Gaps = 47/313 (15%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDK-TPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 M + S ++ W++ R LPW+ + TP+ V +SEVM QQTQ++ V+PY+ +MAR+ Sbjct: 1 MNDTAISLRLGAWWEANAR-DLPWRFGRATPWGVLVSEVMSQQTQMSRVVPYWNDWMARW 59 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 P LA AP +V+ W LGY RA L + A+ VA +G + P T++E+ ALPG+G Sbjct: 60 PDARALAAAPKADVITAWGRLGYPRRALRLRECARVVAEEYGDELPRTYDELVALPGIGD 119 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE- 178 TA A+LS + G+ ++D N++RVL+R + + G + +L+ ++ P V Sbjct: 120 YTASAVLSFAFGERIAVIDTNIRRVLSRVFLGTESRGGAASPAER-ALANRMLPQDRVCG 178 Query: 179 --------------------------RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAA 212 +NQ++M+LGA++CT P C +CP+ + C Sbjct: 179 DGADCTDHAYRSGEHTFLQRSEPPSVTWNQSVMELGAVVCTAKTPLCEICPIADDCAFLK 238 Query: 213 NNSWALYPGKKPKQTLPERTGYFLLLQHEDEVLLAQRPPSGLWGGLYCFP---QFADEES 269 PG ++T P + Q VL A R P E++ Sbjct: 239 AGR----PGLGERRTRPRQRFQGTDRQVRGLVLAALR----------ELPAGVTLPREDA 284 Query: 270 LRQWLAQRQIAAD 282 + W Q Q+AA Sbjct: 285 DKLWKDQIQLAAC 297 >UniRef50_C8WS61 Endonuclease III n=10 Tax=Firmicutes RepID=C8WS61_ALIAD Length = 220 Score = 243 bits (620), Expect = 8e-63, Method: Composition-based stats. Identities = 47/207 (22%), Positives = 94/207 (45%), Gaps = 6/207 (2%) Query: 9 QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 +V++ + TP+++ ++ ++ Q V R A++ A A Sbjct: 11 RVVERLLEAYPDARCQLHFTTPFELLVATILSAQCTDERVNMVTPRLFAKYRGPEGFAKA 70 Query: 69 PLDEVLHLWTGLGYYA-RARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILS 127 DEV +G + +++++ + A+ + +GG+ P++ + + LPGVGR TA ++S Sbjct: 71 SPDEVAEDIREVGLFRSKSKHIVETARILVDEYGGEVPKSRDRLMELPGVGRKTANVVVS 130 Query: 128 LSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDL 187 + G +D +V+RV R + E ++ + P + + A++ Sbjct: 131 NAYGVPAFAVDTHVQRVTNRIGLAKSN-DPLKTEQQVCAK----LPPELWTKAHHALILH 185 Query: 188 GAMICTRSKPKCSLCPLQNGCIAAANN 214 G +CT KPKC +CP+ + C A N Sbjct: 186 GRRVCTARKPKCHICPVADLCQCARNQ 212 >UniRef50_Q1ASN5 DNA-(Apurinic or apyrimidinic site) lyase / endonuclease III n=4 Tax=Bacteria RepID=Q1ASN5_RUBXD Length = 214 Score = 242 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 53/204 (25%), Positives = 89/204 (43%), Gaps = 7/204 (3%) Query: 9 QVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANA 68 +V+ + P ++ ++ ++ Q V ER ++ T D A+A Sbjct: 8 EVIARLKREYPDAKTALNWSNPLELLVAVILSAQCTDERVNRVTERLFRKYRTAEDYASA 67 Query: 69 PLDEVLHLWTGLGYYA-RARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILS 127 PL+E+ G+Y +AR + A+ + HGG+ P+T EE+ ALPGVGR TA +L Sbjct: 68 PLEELEQDIRPTGFYRNKARAIQGMARALLERHGGEVPKTMEELVALPGVGRKTANVVLG 127 Query: 128 LSLGKHFP-ILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMD 186 + G + ++D +V+RV R + E L Q P F ++ Sbjct: 128 NAFGVNEGVVVDTHVRRVSRRLGL----TSSDDPEKIERDLLPQ-VPEEERTLFAHLLIF 182 Query: 187 LGAMICTRSKPKCSLCPLQNGCIA 210 G +C +P C C L + C + Sbjct: 183 HGRRVCKARRPDCPGCVLNDICPS 206 >UniRef50_C2KXT3 DNA-(Apurinic or apyrimidinic site) lyase n=1 Tax=Oribacterium sinus F0268 RepID=C2KXT3_9FIRM Length = 250 Score = 242 bits (618), Expect = 2e-62, Method: Composition-based stats. Identities = 52/227 (22%), Positives = 96/227 (42%), Gaps = 9/227 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M+ + ++ YG + + + + +++ + ++ Q A V E+ ++ Sbjct: 13 MRVREIMKRLDA---HYGDRPMIFLEAENAWQLLFATILSAQCTDARVNMVTEKLFVKYK 69 Query: 61 TVTDLANAPLDEVLHLWTGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 + + L E+ G+Y +A+N+ A+ + +GG+ P E + LPGVGR Sbjct: 70 DLQAFVDCDLKELEEDIHSTGFYHNKAKNMKACAKALVEEYGGEVPRNIEALTGLPGVGR 129 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 T IL ++D +VKR+ R P +VE +L P R Sbjct: 130 KTGNLILGNIYHIPSIVVDTHVKRISNRLGLADS-PDPTKVEFQLME----HLPEEFWIR 184 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQ 226 +N ++ LG +CT PKC C LQ+ C ++ + P + K Sbjct: 185 WNTHIIALGRTLCTSQNPKCGECYLQDLCPSSKKDPETWRPVGERKT 231 >UniRef50_C1MWM2 Predicted protein n=1 Tax=Micromonas pusilla CCMP1545 RepID=C1MWM2_9CHLO Length = 335 Score = 241 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 90/333 (27%), Positives = 123/333 (36%), Gaps = 85/333 (25%) Query: 39 MLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVAT 98 M QQTQ+ V Y+ R+ R+PT LA A +EV LW GLGYY RAR L A+ V Sbjct: 1 MSQQTQIDRVATYWLRWTERWPTAAHLAEATQEEVNELWAGLGYYRRARFLLDGARWVVA 60 Query: 99 L--HGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVS--GW 154 GGK P + +PGVG TA A+ S++ + +DGNV RV++R V G Sbjct: 61 EDGGGGKMPSDAASLGKIPGVGPYTASAVASIAFDEPIAAVDGNVLRVVSRLACVRGGGD 120 Query: 155 PGKKEVE--NKLWSLSEQVTPAVGVERFN------------------------------- 181 K ++++ + A N Sbjct: 121 VTKPGTSAGKACKAVADALLCAERPGDHNQARSHEGSRASFFTHRSGSFTTLYLIACVAF 180 Query: 182 -----------------QAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN----------- 213 QAMM+LGA +CT PKC CP+ + C + A Sbjct: 181 RLTDDGLFFSYEIARRIQAMMELGATVCTPRNPKCGECPVASMCASRALELEEEANVITA 240 Query: 214 ----NSWALYPGKKPKQTLPERTGYFLLLQ--HEDE--------------VLLAQRPPSG 253 P K K E T +++ D LL +RP G Sbjct: 241 GKEPFRVTDLPEKDKKADKREETVSVRVVRGTRGDTETGADGASKKSAAFFLLVKRPEGG 300 Query: 254 LWGGLYCFPQFADEESLRQWLAQRQIAADNLTQ 286 L GL+ FP E++ A R A L + Sbjct: 301 LLSGLWEFPSVVMEDTDADEDALRAAADRLLAE 333 >UniRef50_C1SIS5 DNA-(Apurinic or apyrimidinic site) lyase; endonuclease III n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SIS5_9BACT Length = 210 Score = 241 bits (616), Expect = 3e-62, Method: Composition-based stats. Identities = 49/214 (22%), Positives = 97/214 (45%), Gaps = 6/214 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 M + + + ++ + +TP+++ + ++ Q A V + A +P Sbjct: 1 MTKQERAEAFEKYLEEKYPVVVCSLNYQTPFQLLTATILSAQCTDARVNIVTKDLFAAYP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGYYA-RARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 LA+A +++V + G Y +++N+ A+ + HGG+ P+ +E+ AL GVGR Sbjct: 61 DPFSLADADIEDVAKIIKSTGMYKMKSKNIIGMAKALVENHGGEVPQDMDELLALSGVGR 120 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA + K ++D +VKR+ R ++ ++VE L E++ Sbjct: 121 KTANVVRGNFWQKPGVVVDTHVKRISGRV-GLTDNTTPEKVEKDL----EKLIKGEKQCD 175 Query: 180 FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAAN 213 + ++ G ICT PKC +C + + C A+ Sbjct: 176 WCHRVIYFGREICTARSPKCGICGVSHVCKYYAS 209 >UniRef50_C6A996 A/G-specific adenine glycosylase n=9 Tax=Bifidobacteriaceae RepID=C6A996_BIFLB Length = 329 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 87/259 (33%), Positives = 131/259 (50%), Gaps = 11/259 (4%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-PYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 A Q ++ W+ R LPW+ +T P+ V + EVM QQTQ++ V+PY+E +M+++P Sbjct: 37 SARQALRELATWWQTSAR-DLPWRYGRTTPWGVLVCEVMSQQTQMSRVVPYWEAWMSQWP 95 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 LA A EV+ W LGY RA L + A+ VA + + P ++E+ ALP VG Sbjct: 96 DAASLAAAEKSEVIRAWGRLGYPRRALRLQECAEVVARDYDDRLPREYDELMALPSVGDY 155 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPA--VGVE 178 TA A+LS + G+ ++D N++R L+R + G +L+ V P Sbjct: 156 TASAVLSFAYGERVAVIDTNIRRALSRAFLGVESLGGSCTP-LERALAWVVLPKAAEQSV 214 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERTGYFLLL 238 +NQA+M+LGA++CT P+C CPLQ C PG K+T P + + Sbjct: 215 LWNQAVMELGALVCTAKAPQCEQCPLQPQCEFVKAG----MPGLGEKRTRPRQRFHGTDR 270 Query: 239 QHEDEVLLAQR--PPSGLW 255 Q +L A R P G Sbjct: 271 QVRGSILQALRALPAGGAL 289 >UniRef50_B2W3B1 A/G-specific adenine glycosylase n=2 Tax=Pleosporineae RepID=B2W3B1_PYRTR Length = 441 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 88/245 (35%), Positives = 137/245 (55%), Gaps = 14/245 (5%) Query: 6 FSAQVLDWYDKYGRKTLPWQI-DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTD 64 + + ++ + GR+ +I K Y+VW+SEVMLQQT+V+TVIPYF ++A++PTV D Sbjct: 3 WRKKWINPAEYEGREEELGRILSKRAYEVWVSEVMLQQTRVSTVIPYFNNWIAKWPTVQD 62 Query: 65 LANAPLDEVLHLWTGLGYYARARNLHKAAQQVA--TLHGGKFPETFEEVAALPGVGRSTA 122 LA A D+VL +W GLGYY+RA LH+ A+++ + G P E+ PG+GR TA Sbjct: 63 LAAANHDDVLSVWKGLGYYSRATRLHEGAKKMMSASEDAGTIPSGAAELQEFPGIGRYTA 122 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVT------PAVG 176 GA+ S++ G+ P+LDGNV RVL+R + K+ + LW +++Q+ P Sbjct: 123 GAVSSIAFGEAEPVLDGNVARVLSRQLGLYVDVKDKKATDILWDMADQLIKYASDFPKTK 182 Query: 177 ----VERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLPERT 232 +NQA+M+LG+ +CT P+C+ CP+Q C A + + P Sbjct: 183 TSATPGLWNQALMELGSTVCTPR-PRCNDCPIQATCRAYSEGKALSDKRQSPTVVPDIED 241 Query: 233 GYFLL 237 + Sbjct: 242 ACSIC 246 Score = 52.1 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 10/72 (13%), Positives = 20/72 (27%), Gaps = 7/72 (9%) Query: 280 AADNLTQLTAFRHTFSHFHLDIVPMWLPVSSFTGCMDEGNA-------LWYNLAQPPSVG 332 A + H F+H L + V + + W + A Sbjct: 369 KARYVAHFPPLVHVFTHLKLTMHAYQFRVEADAAEAVDLECGDEPCTRKWVDAAAMGDET 428 Query: 333 LAAPVERLLQQL 344 L+ + + + L Sbjct: 429 LSTGMRKCWELL 440 >UniRef50_A9UV22 Predicted protein (Fragment) n=1 Tax=Monosiga brevicollis RepID=A9UV22_MONBE Length = 221 Score = 241 bits (615), Expect = 4e-62, Method: Composition-based stats. Identities = 83/222 (37%), Positives = 118/222 (53%), Gaps = 18/222 (8%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQI----------------DKTPYKVWLSEVMLQQTQVAT 47 S ++ WY ++ R LPW+ + Y VW+SEVM QQTQ++ Sbjct: 1 SAIQHALIQWYHQHQR-VLPWRQPSVRRSAESMTHGQDSNTMAYAVWISEVMCQQTQISV 59 Query: 48 VIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPET 107 V YFER++A++PTV LA A L +V W GLGYY+RA LH+AAQ + G FP Sbjct: 60 VTDYFERWIAKWPTVQALAAAQLSDVHQAWAGLGYYSRATRLHEAAQYIVNQLDGSFPTV 119 Query: 108 FEEVAALP-GVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWS 166 + P GVG TA A+ S+ K ++DGNV RVL+R ++ + + +W Sbjct: 120 ARLIFCWPPGVGPYTASAVASIVFAKRVGVVDGNVNRVLSRLGGIAVPLTQDAAKKWMWR 179 Query: 167 LSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGC 208 ++ + + NQAMM+LGA+ICT P+C CPL C Sbjct: 180 QADTLADHAAPGQINQAMMELGALICTPKSPQCHACPLAEHC 221 >UniRef50_C0QU82 Endonuclease III n=4 Tax=Hydrogenothermaceae RepID=C0QU82_PERMH Length = 215 Score = 240 bits (614), Expect = 5e-62, Method: Composition-based stats. Identities = 49/208 (23%), Positives = 88/208 (42%), Gaps = 6/208 (2%) Query: 8 AQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLAN 67 V++ K+ + P+++ ++ ++ Q V F ++P +A Sbjct: 12 QAVIEGLKKHFPEPWIDLKFSNPFQLLVATILAAQATDKKVNEVTAVFFKKYPDPESIAK 71 Query: 68 APLDEVLHLWTGLGYYAR-ARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAIL 126 APL+++ + + +Y R A+ L + + + GK P+ +++ LPGVGR TA IL Sbjct: 72 APLEQIENDIKQINFYRRKAKLLKECCEAIVKEFNGKIPDNIDDLTKLPGVGRKTASVIL 131 Query: 127 SLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMD 186 + K ++D +VKRV R ++ +E L ++AM+ Sbjct: 132 VNAFNKPAIVVDTHVKRVSQRL-GITESNNPDRIEKDLAEF----FSKENWIFISKAMVL 186 Query: 187 LGAMICTRSKPKCSLCPLQNGCIAAANN 214 G IC PKC C L + C N Sbjct: 187 FGRYICKAKNPKCKECALLDICPYDKKN 214 >UniRef50_Q18HE6 A/G-specific adenine glycosylase n=12 Tax=Halobacteriaceae RepID=Q18HE6_HALWD Length = 337 Score = 240 bits (613), Expect = 5e-62, Method: Composition-based stats. Identities = 71/247 (28%), Positives = 111/247 (44%), Gaps = 11/247 (4%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPT 61 F +++ WY+ R+ PW+ PY + +SEVM QTQ+ V+ ++ F+ R+PT Sbjct: 21 DIDVFRNRLISWYEAEHRE-FPWRETDDPYAILVSEVMSHQTQLDRVVEAWKDFIQRWPT 79 Query: 62 VTDLANAPLDEVLHLWTG--LGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 V LA V+ W+ LGY RA LH+AA QV + G P +E+ +L GVG Sbjct: 80 VKALAGDSQSAVVTFWSEHALGYNNRASYLHEAANQVVDEYDGTVPADPDELLSLMGVGP 139 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA A+ S + ++D NV+RVL R + E ++ + P Sbjct: 140 YTANAVASFAFNNGDAVVDTNVERVLYRVFKQIRQADDPPYEQ----IASALLPVERSRT 195 Query: 180 FNQAMMDLGAMICTRSKPKCSL--CPLQNGCIAAANNSWALYPGKK-PKQTLPERTGYFL 236 +N A+M+LG + C P+C CP + C A + P R Sbjct: 196 WNNAIMELGGVACK-KTPRCDEANCPWRQWCHAYQTGDFTAPDVPTQPSFEGSRRQFRGR 254 Query: 237 LLQHEDE 243 +++ E Sbjct: 255 IVRTLGE 261 >UniRef50_C0QG99 Putative endonuclease III n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QG99_DESAH Length = 212 Score = 240 bits (612), Expect = 7e-62, Method: Composition-based stats. Identities = 63/217 (29%), Positives = 96/217 (44%), Gaps = 7/217 (3%) Query: 1 MQAS-QFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 M+ S + +L + KTP+++ ++ ++ Q V RF Sbjct: 1 MEISTAYINTILRILKRQYPTVKTQLAHKTPFQLLVATILSAQCTDVQVNRVTPVLFDRF 60 Query: 60 PTVTDLANAPLDEVLHLWTGLGYYA-RARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 PT LA A LDE+ + G+Y +A+N+ AQ + T+HGG P + + LPGVG Sbjct: 61 PTPDKLAGASLDEIKPIVFSTGFYNNKAKNIKACAQSIMTVHGGIVPTSMTALTGLPGVG 120 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 R TA + S++ G ++D +V RV R G +VE+ L + + P Sbjct: 121 RKTANLVRSVAFGMDTIVVDTHVYRVSRRLGLSKGL-NPAKVESDLMA----IIPQKSWN 175 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS 215 M+ LG C KP C CPLQ C + NS Sbjct: 176 DLCLQMIYLGREFCDARKPLCRKCPLQEICPSNMKNS 212 >UniRef50_Q11NT5 DNA-(Apurinic or apyrimidinic site) lyase / endonuclease III n=28 Tax=Bacteroidetes RepID=Q11NT5_CYTH3 Length = 218 Score = 239 bits (611), Expect = 9e-62, Method: Composition-based stats. Identities = 56/217 (25%), Positives = 92/217 (42%), Gaps = 6/217 (2%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 MQ + + LD++ + + + PY++ ++ + Q V R+P Sbjct: 1 MQRKERFEKFLDYFSTHSPEPETELVYSNPYELLVAVSLSAQCTDKRVNLTTPALFNRYP 60 Query: 61 TVTDLANAPLDEVLHLWTGLGY-YARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 LA A DEV H + Y +A++L AQ + G+ P+T E++ LPGVGR Sbjct: 61 DAASLAKATSDEVFHYIRSISYPNNKAKHLVGMAQMLMKDFNGEVPDTVEDLVKLPGVGR 120 Query: 120 STAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKK-EVENKLWSLSEQVTPAVGVE 178 TA I S+ + +D +V RV R V VE +L + P V Sbjct: 121 KTANVIASVIWQQPTMAVDTHVYRVSRRIGLVPQTATTPLAVEKQLM----KYIPTALVH 176 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS 215 + + ++ G C PKC +CP+ C+ + N Sbjct: 177 KAHHWLILHGRYTCLARTPKCEVCPVTEICMWYSKNK 213 >UniRef50_Q2FR30 DNA-(Apurinic or apyrimidinic site) lyase / endonuclease III n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FR30_METHJ Length = 215 Score = 239 bits (611), Expect = 1e-61, Method: Composition-based stats. Identities = 53/217 (24%), Positives = 94/217 (43%), Gaps = 8/217 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + + +++ YD + + K PY+ ++ ++ QT V ++P Sbjct: 4 LDPNTILSRLRSRYDLLNSRD-EFLHFKNPYETLIATILSAQTTDRCVNMVTRELFMKYP 62 Query: 61 TVTDLANAPLDEVLHLWTGLGYYA-RARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 V L+ AP+ +V L G++ +ARN+ A+Q V G+ P+ +++ LPGVGR Sbjct: 63 DVAALSEAPVQDVEKLIHPTGFFRTKARNIIAASQMVMKEFDGRVPDEMDDLVRLPGVGR 122 Query: 120 STAGAILSLSLGK-HFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 TA +L + K +D +V+RV R + + + L +V P Sbjct: 123 KTANIVLDHAFSKTVGIAVDTHVRRVSMRLGL----TDESDPDRIEMDLV-RVFPKEFWA 177 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNS 215 N + G +CT P C C L + C A++N Sbjct: 178 EINGLFILHGRRVCTARHPACDNCNLADLCRYASSNK 214 >UniRef50_A1A319 Probable A/G-specific adenine glycosylase n=3 Tax=Bifidobacterium RepID=A1A319_BIFAA Length = 340 Score = 239 bits (610), Expect = 1e-61, Method: Composition-based stats. Identities = 85/260 (32%), Positives = 132/260 (50%), Gaps = 19/260 (7%) Query: 2 QASQFSAQVLDWYDKYGRKTLPWQIDKT-PYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 A + + + W+ R LPW+ +T P+ V +SEVM QQTQ+ V+PY+ +M R+P Sbjct: 39 DADEIATTLAQWWQTNAR-DLPWRFGRTTPWGVLVSEVMSQQTQMGRVVPYWTAWMDRWP 97 Query: 61 TVTDLANAPLDEVLHLWTGLGYYARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRS 120 LA+AP +V+ W LGY RA L + A +A + + P T++E+ ALPG+G Sbjct: 98 DAAALADAPKSDVITAWGRLGYPRRALRLQECAHVIAYDYADELPHTYDELLALPGIGDY 157 Query: 121 TAGAILSLSLGKHFPILDGNVKRVLARCY-AVSGWPGKKEVENKLWSLSEQVTPAVGVER 179 TA A+LS + G+ ++D N++RVL+R + V G +L++++ P + Sbjct: 158 TASAVLSFAFGERIAVVDTNIRRVLSRAFVGVESLGGSASPAE--RALAKRLLPDDDSAK 215 Query: 180 ----------FNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQTLP 229 +NQA+M+LGA +CT P C CP+ C N PG ++T P Sbjct: 216 CRRFDRPSVVWNQAVMELGATVCTAKSPLCEACPIAGKCAFLRNGR----PGLGQRRTRP 271 Query: 230 ERTGYFLLLQHEDEVLLAQR 249 + Q VL A R Sbjct: 272 RQRFQGTDRQVRGLVLNALR 291 >UniRef50_Q2R7H4 Endonuclease III-like protein 1, putative, expressed n=7 Tax=Poaceae RepID=Q2R7H4_ORYSJ Length = 362 Score = 238 bits (609), Expect = 2e-61, Method: Composition-based stats. Identities = 53/234 (22%), Positives = 91/234 (38%), Gaps = 13/234 (5%) Query: 3 ASQFSAQVLDWYDKYGRKT----LPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMAR 58 D G + LP + + V +S +M QT+ ER + Sbjct: 130 IKNMRLSGEAPVDTKGCEKAGSLLP--PKERRFAVLISTMMSSQTKDEVTHAAVERLSEK 187 Query: 59 -FPTVTDLANAPLDEVLHLWTGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPG 116 + + +L +G+Y +A+ + +A++ GG P++ E+ AL G Sbjct: 188 GLLDPDAIVRTDEATLANLIKPVGFYQRKAKFIKEASKICLERFGGDIPDSLNELLALKG 247 Query: 117 VGRSTAGAILSLSL-GKHFPILDGNVKRVLARCYAV---SGWPGKKEVENKLWSLSEQVT 172 VG A ++S++ +D +V R+ R V E SL E+ Sbjct: 248 VGPKMAHLVMSIAWKNTQGICVDTHVHRISNRLGWVFREGTKQKTTTPEQTRMSL-EKWL 306 Query: 173 PAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSWALYPGKKPKQ 226 P E N ++ G ICT +PKC +C + N C +A S + P +K + Sbjct: 307 PKDEWEPINPLLVGFGQTICTPLRPKCDMCGINNICPSAFKESSSPNPKQKKMR 360 >UniRef50_A5UJU9 Endonuclease III n=5 Tax=Archaea RepID=A5UJU9_METS3 Length = 210 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 47/185 (25%), Positives = 87/185 (47%), Gaps = 6/185 (3%) Query: 27 DKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLWTGLGYYA-R 85 +K PYKV + ++ Q+T+ + ++ + ++ +AP D+V L G+Y + Sbjct: 30 NKDPYKVLVRTILSQRTRDENTDQATKNLFGKYKNIYEIVDAPTDDVEELIRCSGFYRVK 89 Query: 86 ARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAILSLSLGKHFPILDGNVKRVL 145 A + + ++ + +GG+ P+ +E+ LPGVGR TA +L + +D +V R+ Sbjct: 90 AARIKEVSRILIDQYGGEVPDNLKELVELPGVGRKTANCVLVYAFELPAIPVDTHVHRIS 149 Query: 146 ARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMMDLGAMICTRSKPKCSLCPLQ 205 R V K E L+ ++ P + N M+ G IC P+C +CP+ Sbjct: 150 NRIGLV----NTKTPEQTEVELA-KIAPKELWIKLNDLMVQFGQTICKPMSPQCEMCPIS 204 Query: 206 NGCIA 210 + C Sbjct: 205 DICDY 209 >UniRef50_Q2RH98 DNA-(Apurinic or apyrimidinic site) lyase / endonuclease III n=1 Tax=Moorella thermoacetica ATCC 39073 RepID=Q2RH98_MOOTA Length = 233 Score = 238 bits (607), Expect = 3e-61, Method: Composition-based stats. Identities = 48/207 (23%), Positives = 83/207 (40%), Gaps = 6/207 (2%) Query: 4 SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVT 63 +L + P+++ ++ ++ QT V R+PT Sbjct: 25 RDRVEAILHLLRAAYPGARSRLNFRNPFELLVAAILSAQTTDDQVNKVTGELFRRYPTPE 84 Query: 64 DLANAPLDEVLHLWTGLGYYA-RARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTA 122 LA A +EV LG Y +A +L A + + +GG+ P+ E++ L GVGR A Sbjct: 85 VLAAADPEEVAACIKSLGLYRTKAAHLVAACRTLVREYGGRVPDKLEDLLRLHGVGRKVA 144 Query: 123 GAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQ 182 +LS + G+ +D +V RV R +E E +L + P + Sbjct: 145 NVVLSNAFGRDVIAVDTHVFRVANRLGLARAG-DVRETERQLMAA----LPPGSRGEAHH 199 Query: 183 AMMDLGAMICTRSKPKCSLCPLQNGCI 209 ++ G +C P+C C L++ C Sbjct: 200 LLIYHGREVCRARNPRCRDCTLRSYCR 226 >UniRef50_C5WHA6 Endonuclease III n=7 Tax=Firmicutes RepID=C5WHA6_STRDG Length = 222 Score = 238 bits (607), Expect = 3e-61, Method: Composition-based stats. Identities = 55/214 (25%), Positives = 98/214 (45%), Gaps = 6/214 (2%) Query: 1 MQA-SQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARF 59 M+ + +VL + + +TP+++ ++ ++ QT V A++ Sbjct: 5 MRIGKERLRKVLALIAEMFPEAKGELDWETPFQLLIAVILSAQTTDKAVNKVTPALWAKY 64 Query: 60 PTVTDLANAPLDEVLHLWTGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPGVG 118 P + DLA+A + +V + +G Y +A+N+ K AQ + T GG+ P+T +E+ +LPGVG Sbjct: 65 PEIEDLASANVTDVENCLRTIGLYKNKAKNIIKTAQAMLTDFGGQVPKTHKELESLPGVG 124 Query: 119 RSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 R TA +L+ +D +V RV R + E+E L + P Sbjct: 125 RKTANVVLAEVYAVPAIAVDTHVSRVAKRLNVSAPDADVTEIEQDLMAK----IPKKDWI 180 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAA 212 + ++ G C PKC++CP+Q C Sbjct: 181 ITHHRLIFFGRYHCLAKNPKCAICPVQTYCKYYK 214 >UniRef50_A4E9V4 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4E9V4_9ACTN Length = 221 Score = 237 bits (606), Expect = 4e-61, Method: Composition-based stats. Identities = 54/210 (25%), Positives = 97/210 (46%), Gaps = 4/210 (1%) Query: 3 ASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTV 62 + + +V + ++ + + P+++ ++ ++ QT A V + A++PT Sbjct: 9 KRERAVEVCERLNRRYGPVECFLDHENPFRLLIAVLLSAQTTDAQVNKVTPKLFAQWPTP 68 Query: 63 TDLANAPLDEVLHLWTGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRST 121 +A A + +V LG+Y ++A++ +AAQ + +GG+ P +E+ LPGVGR T Sbjct: 69 EAMAGASVADVADTIKSLGFYKSKAKHAVEAAQMIVADYGGEVPADMKELVKLPGVGRKT 128 Query: 122 AGAILSLSLGKHF-PILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERF 180 A +L++ G +D +V R+ R KE L ++ P E Sbjct: 129 ANIVLNVGYGIVEGIAVDTHVNRIAHRLMLSPKTHA-KEPLKTEQDLL-KILPHEYWESV 186 Query: 181 NQAMMDLGAMICTRSKPKCSLCPLQNGCIA 210 N + G IC KPKC CPL + C + Sbjct: 187 NHQWITFGREICDARKPKCDECPLADLCPS 216 >UniRef50_B2J6L0 Endonuclease III n=10 Tax=Bacteria RepID=B2J6L0_NOSP7 Length = 229 Score = 237 bits (605), Expect = 5e-61, Method: Composition-based stats. Identities = 50/218 (22%), Positives = 87/218 (39%), Gaps = 7/218 (3%) Query: 1 MQASQFSAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFP 60 + + S ++L + TP ++ ++ ++ Q V +FP Sbjct: 15 LSKKKRSLEILARLKRLYPDATCSLNYSTPVQLLVATILSAQCTDERVNKVTPALFGKFP 74 Query: 61 TVTDLANAPLDEVLHLWTGLGYYA-RARNLHKAAQQVATLHGGKFPETFEEVAALPGVGR 119 LA A L E+ L G+Y +A+N+ A + + T P E++ LPGV R Sbjct: 75 DAQSLAIADLVELESLVRSTGFYRNKAKNIQAACRMIVTEFNSVVPNQMEQLLKLPGVAR 134 Query: 120 STAGAILSLSLGKHFPI-LDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVE 178 TA +L+ + G + + +D +VKR+ R ++ +E L L P E Sbjct: 135 KTANVVLAHAYGINAGVTVDTHVKRLCQRL-GLTEAKDPVRIEQDLMGL----LPQSDWE 189 Query: 179 RFNQAMMDLGAMICTRSKPKCSLCPLQNGCIAAANNSW 216 ++ ++ G IC P C C L + C A Sbjct: 190 NWSIRLIYHGRAICKARSPVCVACELADLCPVANKPMV 227 >UniRef50_C0Z686 Endonuclease III n=134 Tax=Firmicutes RepID=C0Z686_BREBN Length = 227 Score = 237 bits (605), Expect = 5e-61, Method: Composition-based stats. Identities = 46/203 (22%), Positives = 84/203 (41%), Gaps = 6/203 (2%) Query: 7 SAQVLDWYDKYGRKTLPWQIDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLA 66 A +LD + TP+++ ++ ++ Q V + Sbjct: 8 VADILDNLQQLYPDAHCELNYTTPFELLIATILSAQCTDKRVNEITAPMFQQLNQPEHYL 67 Query: 67 NAPLDEVLHLWTGLGYY-ARARNLHKAAQQVATLHGGKFPETFEEVAALPGVGRSTAGAI 125 + +E+ GLG Y +++N+ + + + + + P+T E+ ALPGVGR TA + Sbjct: 68 HLTQEEMEEHIKGLGLYKNKSKNILETCRILYEKYNSEVPQTHAELEALPGVGRKTANVV 127 Query: 126 LSLSLGKHFPILDGNVKRVLARCYAVSGWPGKKEVENKLWSLSEQVTPAVGVERFNQAMM 185 LS + G +D +V RV R + EVE +L + P + ++ Sbjct: 128 LSNAFGIPAIAVDTHVFRVGNRLGLANSD-NVDEVERQLM----KRIPKEKWTDAHHWLI 182 Query: 186 DLGAMICTRSKPKCSLCPLQNGC 208 G +C+ P+C C LQ+ C Sbjct: 183 WHGRRVCSSRNPQCGSCTLQSMC 205 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.312 0.164 0.514 Lambda K H 0.267 0.0503 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,535,373,433 Number of Sequences: 3077464 Number of extensions: 120136176 Number of successful extensions: 325551 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 1832 Number of HSP's successfully gapped in prelim test: 1031 Number of HSP's that attempted gapping in prelim test: 318056 Number of HSP's gapped (non-prelim): 3288 length of query: 350 length of database: 1,040,396,356 effective HSP length: 129 effective length of query: 221 effective length of database: 643,403,500 effective search space: 142192173500 effective search space used: 142192173500 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.0 bits) S2: 94 (40.5 bits)