BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (282 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P04395 DNA-3-methyladenine glycosylase 2 n=122 Tax=Ente... 577 e-163 UniRef50_A8GHQ3 Transcriptional regulator, AraC family n=3 Tax=P... 249 1e-64 UniRef50_A3ETF2 Putative Ada DNA repair protein and transcriptio... 194 2e-48 UniRef50_D1R7A9 Putative uncharacterized protein n=1 Tax=Parachl... 183 5e-45 UniRef50_UPI0001BC59A0 AraC family transcriptional regulator n=1... 181 3e-44 UniRef50_Q6MA41 Putative DNA-3-methyladenine glycosidase II n=3 ... 178 2e-43 UniRef50_A8MFS4 Transcriptional regulator, AraC family n=2 Tax=C... 177 3e-43 UniRef50_D2L7I2 Transcriptional regulator, AraC family n=1 Tax=D... 176 9e-43 UniRef50_Q2SDC7 Adenosine deaminase n=3 Tax=Bacteria RepID=Q2SDC... 170 5e-41 UniRef50_Q46QC3 Transcriptional regulator Ada n=25 Tax=cellular ... 164 3e-39 UniRef50_A6SU78 Methylated-DNA-[protein]-cysteine S-methyltransf... 163 5e-39 UniRef50_C8QGF9 Transcriptional regulator, AraC family n=1 Tax=P... 163 6e-39 UniRef50_C0WE04 Transcriptional regulator n=1 Tax=Acidaminococcu... 157 4e-37 UniRef50_Q2RNZ4 Transcriptional regulator Ada / DNA-3-methyladen... 156 8e-37 UniRef50_A7HP34 Ada metal-binding domain protein n=3 Tax=Bacteri... 155 1e-36 UniRef50_Q1IT49 DNA-3-methyladenine glycosylase II / Transcripti... 153 6e-36 UniRef50_Q0AGQ6 DNA-3-methyladenine glycosylase II / DNA-O6-meth... 152 2e-35 UniRef50_Q02KH7 DNA-3-methyladenine glycosidase II n=8 Tax=Pseud... 149 7e-35 UniRef50_B9DJS2 Putative uncharacterized protein n=1 Tax=Staphyl... 148 2e-34 UniRef50_Q2T2N2 DNA-3-methyladenine glycosylase II n=65 Tax=Burk... 148 2e-34 UniRef50_A5KSU6 Transcriptional regulator, AraC family n=1 Tax=c... 144 3e-33 UniRef50_B0SWZ0 Transcriptional regulator, AraC family n=7 Tax=B... 143 5e-33 UniRef50_B1ZFN9 AlkA domain protein n=6 Tax=Methylobacterium Rep... 143 6e-33 UniRef50_Q12D18 Transcriptional regulator Ada / DNA-O6-methylgua... 142 2e-32 UniRef50_C7R5W7 Transcriptional regulator, AraC family n=1 Tax=K... 139 1e-31 UniRef50_Q2IPL2 Transcriptional regulator Ada / DNA-O6-methylgua... 137 4e-31 UniRef50_B4S0Y6 Ada regulatory protein n=3 Tax=Alteromonas macle... 137 6e-31 UniRef50_A7HG85 AlkA domain protein n=2 Tax=Myxococcales RepID=A... 134 3e-30 UniRef50_A6EY17 Transcriptional Regulator, AraC family protein n... 130 5e-29 UniRef50_B0KRT0 AlkA domain protein n=1 Tax=Pseudomonas putida G... 129 8e-29 UniRef50_B0RQX4 DNA methylation and regulatory protein (Methylat... 129 9e-29 UniRef50_A1TR03 DNA-O6-methylguanine--protein-cysteine S-methylt... 129 9e-29 UniRef50_A1WKZ8 DNA-3-methyladenine glycosylase II / Transcripti... 123 6e-27 UniRef50_Q15P13 DNA-O6-methylguanine--protein-cysteine S-methylt... 122 1e-26 UniRef50_D1BI44 DNA-3-methyladenine glycosylase II /DNA-O6-methy... 119 1e-25 UniRef50_C5C5F4 HhH-GPD family protein n=1 Tax=Beutenbergia cave... 118 2e-25 UniRef50_A4SQS2 DNA methylation and regulatory protein n=2 Tax=A... 116 8e-25 UniRef50_Q1QTR7 Transcriptional regulator Ada / DNA-3-methyladen... 116 1e-24 UniRef50_UPI0001901D5D methylated-DNA--protein-cysteine methyltr... 115 2e-24 UniRef50_Q10630 Methylated-DNA--protein-cysteine methyltransfera... 115 2e-24 UniRef50_B7RWC6 AlkA N-terminal domain family protein n=1 Tax=ma... 114 3e-24 UniRef50_Q6MR46 DNA methylation and regulatory protein Ada n=1 T... 114 4e-24 UniRef50_C4DFD0 DNA-3-methyladenine glycosylase II; Transcriptio... 108 2e-22 UniRef50_UPI0000E0EED3 Ada family regulatory protein n=1 Tax=Gla... 108 3e-22 UniRef50_Q3IBU8 Putative ADA regulatory protein (Regulatory prot... 107 3e-22 UniRef50_A8LHD8 Transcriptional regulator, AraC family n=4 Tax=A... 107 5e-22 UniRef50_C0Q970 AlkA n=1 Tax=Desulfobacterium autotrophicum HRM2... 107 6e-22 UniRef50_A3XSB2 Ada regulatory protein n=1 Tax=Vibrio sp. MED222... 105 2e-21 UniRef50_Q1ZAD8 Hypothetical ada regulatory protein n=2 Tax=Phot... 105 3e-21 UniRef50_Q7MGD3 Adenosine deaminase n=51 Tax=Vibrionales RepID=Q... 103 5e-21 UniRef50_D0LE01 Ada metal-binding domain protein n=1 Tax=Gordoni... 102 1e-20 UniRef50_C7MYM6 DNA-3-methyladenine glycosylase II /DNA-O6-methy... 102 2e-20 UniRef50_D1C0H7 Transcriptional regulator, AraC family n=1 Tax=X... 102 2e-20 UniRef50_A1S7Q4 DNA-3-methyladenine glycosylase II / DNA-O6-meth... 99 1e-19 UniRef50_C1YI07 DNA-O6-methylguanine--protein-cysteine S-methylt... 99 1e-19 UniRef50_Q12L65 DNA-O6-methylguanine--protein-cysteine S-methylt... 99 2e-19 UniRef50_C8XKJ9 AlkA domain protein n=1 Tax=Nakamurella multipar... 99 2e-19 UniRef50_B2GIR9 Putative methylated-DNA--protein-cysteine methyl... 99 2e-19 UniRef50_A0JV31 DNA-O6-methylguanine--protein-cysteine S-methylt... 98 4e-19 UniRef50_A5CSR4 Putative DNA glycosylase n=2 Tax=Clavibacter mic... 96 2e-18 UniRef50_C1RNZ7 DNA-3-methyladenine glycosylase II n=1 Tax=Cellu... 95 2e-18 UniRef50_A3D6C4 Transcriptional regulator Ada / DNA-3-methyladen... 94 7e-18 UniRef50_A6WG49 HhH-GPD family protein n=5 Tax=Actinomycetales R... 93 8e-18 UniRef50_C0ZIT0 DNA-3-methyladenine glycosylase II n=75 Tax=Baci... 92 2e-17 UniRef50_A4BNP3 3-methyladenine DNA glycosylase/8-oxoguanineDNA ... 91 6e-17 UniRef50_Q1YTX8 Putative DNA-3-methyladenine glycosylase II n=1 ... 90 8e-17 UniRef50_C6D2P4 DNA-3-methyladenine glycosylase II n=1 Tax=Paeni... 90 9e-17 UniRef50_C8SYC6 DNA-3-methyladenine glycosylase 2 (Fragment) n=1... 89 1e-16 UniRef50_C2AV46 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 88 3e-16 UniRef50_D0J4I7 HhH-GPD n=2 Tax=Comamonas testosteroni RepID=D0J... 85 2e-15 UniRef50_P37878 DNA-3-methyladenine glycosylase n=4 Tax=Bacillac... 84 5e-15 UniRef50_C7NLP9 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 84 6e-15 UniRef50_A1ZCF3 HhH-GPD n=1 Tax=Microscilla marina ATCC 23134 Re... 83 9e-15 UniRef50_D2PPK3 Transcriptional regulator, AraC family n=1 Tax=K... 82 3e-14 UniRef50_Q7N9Z6 Similarities with the C-terminal region of 3-met... 81 3e-14 UniRef50_Q5NXL1 DNA-3-methyladenine glycosidase II n=3 Tax=Betap... 80 7e-14 UniRef50_C6MGP3 HhH-GPD family protein n=1 Tax=Nitrosomonas sp. ... 79 2e-13 UniRef50_C7QDZ2 Transcriptional regulator, AraC family n=2 Tax=A... 79 2e-13 UniRef50_D1CD20 DNA-3-methyladenine glycosylase II n=1 Tax=Therm... 78 3e-13 UniRef50_C0Z5U6 Putative DNA-3-methyladenine glycosylase II n=1 ... 77 5e-13 UniRef50_D1Z1B8 Putative DNA glycosidase n=1 Tax=Methanocella pa... 77 8e-13 UniRef50_C6XZ60 HhH-GPD family protein n=1 Tax=Pedobacter hepari... 75 3e-12 UniRef50_C4L050 DNA-3-methyladenine glycosylase II n=4 Tax=Bacil... 73 1e-11 UniRef50_C6W476 HhH-GPD family protein n=1 Tax=Dyadobacter ferme... 73 1e-11 UniRef50_B9XBY0 HhH-GPD family protein n=1 Tax=bacterium Ellin51... 73 1e-11 UniRef50_C7PMW8 8-oxoguanine DNA glycosylase domain protein n=1 ... 72 2e-11 UniRef50_Q82VT3 HhH-GPD n=2 Tax=Betaproteobacteria RepID=Q82VT3_... 71 4e-11 UniRef50_Q9ZET9 DNA-3-methyladenine glycosidase (Fragment) n=1 T... 71 5e-11 UniRef50_D1P0X5 DNA-3-methyladenine glycosylase II n=4 Tax=Enter... 71 5e-11 UniRef50_D1C1F2 HhH-GPD family protein n=1 Tax=Sphaerobacter the... 70 7e-11 UniRef50_A9B7A8 Transcriptional regulator, AraC family n=1 Tax=H... 70 1e-10 UniRef50_B8GAB8 DNA-3-methyladenine glycosylase II n=3 Tax=Chlor... 69 2e-10 UniRef50_C7MAP3 Adenosine deaminase n=1 Tax=Brachybacterium faec... 69 2e-10 UniRef50_B4X1U6 Base excision DNA repair protein, HhH-GPD family... 69 2e-10 UniRef50_Q2BC23 DNA-3-methyladenine glycosylase II n=1 Tax=Bacil... 68 4e-10 UniRef50_A5KST9 DNA-3-methyladenine glycosylase II n=1 Tax=candi... 67 9e-10 UniRef50_O31544 Putative DNA-3-methyladenine glycosylase yfjP n=... 67 1e-09 UniRef50_Q2FMK1 HhH-GPD n=1 Tax=Methanospirillum hungatei JF-1 R... 66 1e-09 UniRef50_A9BVD9 HhH-GPD family protein n=1 Tax=Delftia acidovora... 66 1e-09 UniRef50_D1VDS6 HhH-GPD family protein n=3 Tax=Actinomycetales R... 66 2e-09 UniRef50_C6A294 AlkA 3-methyladenine DNA glycosylase n=9 Tax=The... 65 2e-09 UniRef50_Q0VPN7 Putative uncharacterized protein n=1 Tax=Alcaniv... 65 3e-09 UniRef50_B6EMH3 DNA repair protein n=2 Tax=Gammaproteobacteria R... 65 3e-09 UniRef50_Q1AWP7 DNA-3-methyladenine glycosylase II n=1 Tax=Rubro... 64 5e-09 UniRef50_C6WJ98 Transcriptional regulator, AraC family n=5 Tax=A... 63 1e-08 UniRef50_B3T536 Putative HhH-GPD superfamily base excision DNA r... 63 1e-08 UniRef50_A5WCQ9 HhH-GPD family protein n=2 Tax=Psychrobacter Rep... 63 1e-08 UniRef50_B5IDT4 Base excision DNA repair protein, HhH-GPD family... 62 2e-08 UniRef50_B1ZV80 Transcriptional regulator, AraC family n=2 Tax=O... 62 3e-08 UniRef50_D1RHI7 HhH-GPD family base excision repair protein n=1 ... 60 6e-08 UniRef50_B8IZY6 HhH-GPD family protein n=8 Tax=Bacteria RepID=B8... 60 7e-08 UniRef50_UPI00016C4C1A DNA-3-methyladenine glycosylase II n=1 Ta... 60 7e-08 UniRef50_B9LPN6 HhH-GPD family protein n=4 Tax=Halobacteriaceae ... 60 7e-08 UniRef50_Q1ITU3 DNA-3-methyladenine glycosylase II n=2 Tax=Bacte... 60 8e-08 UniRef50_Q92383 DNA-3-methyladenine glycosylase 1 n=1 Tax=Schizo... 60 1e-07 UniRef50_Q81IC3 DNA-3-methyladenine glycosylase II n=75 Tax=Baci... 59 1e-07 UniRef50_A8TVS7 HhH-GPD n=1 Tax=alpha proteobacterium BAL199 Rep... 59 1e-07 UniRef50_B4CYJ1 DNA-3-methyladenine glycosylase II n=1 Tax=Chtho... 59 2e-07 UniRef50_Q01SY7 DNA-3-methyladenine glycosylase II n=1 Tax=Candi... 59 2e-07 UniRef50_Q1J274 Endonuclease III, DNA-3-methyladenine glycosidas... 59 2e-07 UniRef50_Q3INX6 DNA N-glycosylase / DNA lyase n=6 Tax=Halobacter... 59 2e-07 UniRef50_C1DYL3 Predicted protein n=2 Tax=Micromonas RepID=C1DYL... 59 3e-07 UniRef50_C1D8D7 HhH-GPD family protein n=1 Tax=Laribacter hongko... 58 3e-07 UniRef50_Q1H1S0 DNA-3-methyladenine glycosylase II n=1 Tax=Methy... 58 3e-07 UniRef50_Q5FSB3 DNA-3-methyladenine glycosylase n=1 Tax=Gluconob... 58 3e-07 UniRef50_A5V920 HhH-GPD family protein n=7 Tax=Sphingomonadales ... 58 3e-07 UniRef50_Q6CEP5 YALI0B14080p n=1 Tax=Yarrowia lipolytica RepID=Q... 58 4e-07 UniRef50_Q0BSG3 DNA-3-methyladenine glycosylase II n=12 Tax=Prot... 58 4e-07 UniRef50_Q0USE2 Putative uncharacterized protein n=1 Tax=Phaeosp... 58 4e-07 UniRef50_Q0BWS7 Putative DNA-3-methyladenine glycosylase n=1 Tax... 57 5e-07 UniRef50_B7K2N0 DNA-3-methyladenine glycosylase II n=5 Tax=Chroo... 57 5e-07 UniRef50_C7MP98 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 57 9e-07 UniRef50_Q2SX77 DNA-3-methyladenine glycosylase n=60 Tax=Betapro... 57 9e-07 UniRef50_A6GQ39 3-methyladenine DNA glycosylase II n=1 Tax=Limno... 57 9e-07 UniRef50_UPI00018509D2 YfjP n=1 Tax=Bacillus coahuilensis m4-4 R... 57 1e-06 UniRef50_B2SXP8 HhH-GPD family protein n=39 Tax=Betaproteobacter... 56 1e-06 UniRef50_B2B817 Predicted CDS Pa_2_12990 n=8 Tax=Leotiomyceta Re... 56 1e-06 UniRef50_B1YMD5 HhH-GPD family protein n=1 Tax=Exiguobacterium s... 56 1e-06 UniRef50_D0XPK8 HhH-GPD family protein n=1 Tax=Brevundimonas sub... 56 1e-06 UniRef50_O28163 3-methyladenine DNA glycosylase (AlkA) n=1 Tax=A... 56 2e-06 UniRef50_B6G8M1 Putative uncharacterized protein n=1 Tax=Collins... 55 2e-06 UniRef50_A8IJX2 HhH-GPD protein n=1 Tax=Azorhizobium caulinodans... 55 3e-06 UniRef50_C6CD76 DNA-3-methyladenine glycosylase II n=1 Tax=Dicke... 55 4e-06 UniRef50_B3EJD3 8-oxoguanine DNA glycosylase domain protein n=1 ... 54 4e-06 UniRef50_A0RYQ2 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 54 5e-06 UniRef50_A9EU33 Methylated-DNA--protein-cysteine methyltransfera... 54 6e-06 UniRef50_A6CCG3 Probable DNA-3-methyladenine glycosylase n=1 Tax... 54 6e-06 UniRef50_C8W0S2 HhH-GPD family protein n=6 Tax=Bacteria RepID=C8... 54 6e-06 UniRef50_C7PK12 HhH-GPD family protein n=1 Tax=Chitinophaga pine... 54 6e-06 UniRef50_Q5K8T8 DNA-3-methyladenine glycosidase, putative n=1 Ta... 54 7e-06 UniRef50_A2QHV8 Contig An04c0070, complete genome n=10 Tax=Eurot... 54 7e-06 UniRef50_A9RKT9 Predicted protein (Fragment) n=1 Tax=Physcomitre... 54 8e-06 UniRef50_B0U6C0 DNA-3-methyladenine glycosidase n=16 Tax=Xanthom... 54 8e-06 UniRef50_Q972N8 Putative uncharacterized protein ST1094 n=1 Tax=... 53 9e-06 UniRef50_Q9KC25 DNA-3-methyladenine glycosidase n=1 Tax=Bacillus... 53 1e-05 UniRef50_O94468 Probable DNA-3-methyladenine glycosylase 2 n=1 T... 53 1e-05 UniRef50_B5ES79 HhH-GPD family protein n=4 Tax=Acidithiobacillus... 53 1e-05 UniRef50_Q4ZR24 DNA-3-methyladenine glycosylase II n=4 Tax=Pseud... 53 1e-05 UniRef50_Q6BZL7 DEHA2A00418p n=2 Tax=Debaryomyces hansenii RepID... 53 1e-05 UniRef50_A1K6J5 DNA-3-methyladenine glycosylase II n=21 Tax=Prot... 52 2e-05 UniRef50_D1HE56 Whole genome shotgun sequence of line PN40024, s... 52 2e-05 UniRef50_Q3B3Y2 HhH-GPD n=1 Tax=Chlorobium luteolum DSM 273 RepI... 52 3e-05 UniRef50_C8WLI9 HhH-GPD family protein n=4 Tax=Bacteria RepID=C8... 51 5e-05 UniRef50_UPI0001B54083 YfjP n=1 Tax=Streptomyces sp. AA4 RepID=U... 50 6e-05 UniRef50_Q5SLG4 DNA-3-methyladenine glycosidase n=6 Tax=Bacteria... 50 6e-05 UniRef50_A9FBN7 Putative DNA-3-methyladenine glycosidase n=1 Tax... 50 7e-05 UniRef50_B3E6X3 HhH-GPD family protein n=2 Tax=Bacteria RepID=B3... 50 7e-05 UniRef50_B4S806 8-oxoguanine DNA glycosylase domain protein n=1 ... 50 7e-05 UniRef50_B0D0G2 Predicted protein n=1 Tax=Laccaria bicolor S238N... 50 7e-05 UniRef50_UPI0000D54B32 HhH-GPD n=1 Tax=Psychroflexus torquis ATC... 50 8e-05 UniRef50_A9M750 HhH-GPD family protein n=55 Tax=Rhizobiales RepI... 50 8e-05 UniRef50_A7EZ08 Putative uncharacterized protein n=1 Tax=Sclerot... 50 9e-05 UniRef50_A6TTX3 Methylated-DNA--protein-cysteine methyltransfera... 50 1e-04 UniRef50_C5G8B3 DNA-3-methyladenine glycosylase n=8 Tax=Onygenal... 49 1e-04 UniRef50_A9T041 Predicted protein n=1 Tax=Physcomitrella patens ... 49 1e-04 UniRef50_C1A5A1 DNA-3-methyladenine glycosylase n=1 Tax=Gemmatim... 49 2e-04 UniRef50_B2J3A5 HhH-GPD family protein n=1 Tax=Nostoc punctiform... 49 2e-04 UniRef50_Q04UT1 DNA-3-methyladenine glycosylase II n=4 Tax=Lepto... 49 2e-04 UniRef50_D2QEN8 HhH-GPD family protein n=1 Tax=Spirosoma lingual... 49 2e-04 UniRef50_Q9LN45 F18O14.25 n=22 Tax=Magnoliophyta RepID=Q9LN45_ARATH 49 3e-04 UniRef50_D1VAP6 HhH-GPD family protein n=1 Tax=Frankia sp. EuI1c... 49 3e-04 UniRef50_B3QN63 8-oxoguanine DNA glycosylase domain protein n=2 ... 48 4e-04 UniRef50_Q754R1 AFR011Wp n=1 Tax=Eremothecium gossypii RepID=Q75... 48 4e-04 UniRef50_D2LH30 HhH-GPD family protein n=1 Tax=Rhodomicrobium va... 48 4e-04 UniRef50_A9I9J6 DNA-3-methyladenine glycosidase II n=1 Tax=Borde... 48 4e-04 UniRef50_D0J2I3 HhH-GPD n=6 Tax=Comamonadaceae RepID=D0J2I3_COMTE 48 4e-04 UniRef50_A3JFL8 3-methyladenine DNA glycosylase n=1 Tax=Marinoba... 48 4e-04 UniRef50_C0NIP1 Putative uncharacterized protein n=1 Tax=Ajellom... 48 5e-04 UniRef50_B3QVZ3 8-oxoguanine DNA glycosylase domain protein n=1 ... 47 6e-04 UniRef50_A6EE77 3-methyladenine DNA glycosylase n=1 Tax=Pedobact... 47 7e-04 UniRef50_B6K1P6 DNA-3-methyladenine glycosylase n=1 Tax=Schizosa... 47 7e-04 UniRef50_B8GY42 DNA-3-methyladenine glycosylase II n=4 Tax=Caulo... 47 7e-04 UniRef50_A0Z859 HhH-GPD protein n=1 Tax=marine gamma proteobacte... 47 8e-04 UniRef50_D0LW65 DNA-3-methyladenine glycosylase II n=1 Tax=Halia... 47 0.001 UniRef50_C0D4Q9 Putative uncharacterized protein n=2 Tax=Clostri... 47 0.001 UniRef50_B4B851 DNA-3-methyladenine glycosylase II n=2 Tax=Cyano... 46 0.001 UniRef50_Q8TL35 DNA-3-methyladenine glycosylase II n=1 Tax=Metha... 46 0.001 UniRef50_Q55703 Slr0231 protein n=1 Tax=Synechocystis sp. PCC 68... 46 0.001 UniRef50_Q9YFG9 Putative uncharacterized protein n=1 Tax=Aeropyr... 46 0.001 UniRef50_C6IXS6 DNA-3-methyladenine glycosidase n=1 Tax=Paenibac... 46 0.001 UniRef50_B8EL05 HhH-GPD family protein n=5 Tax=Alphaproteobacter... 45 0.002 UniRef50_C0KTC3 8-oxoguanine DNA glycosylase n=1 Tax=Clostridium... 45 0.002 UniRef50_Q11DX8 HhH-GPD n=3 Tax=Rhizobiales RepID=Q11DX8_MESSB 45 0.004 UniRef50_Q7UGU9 DNA-3-methyladenine glycosidase n=1 Tax=Rhodopir... 45 0.004 UniRef50_B6JZD7 DNA-3-methyladenine glycosylase n=1 Tax=Schizosa... 44 0.004 UniRef50_C7MW75 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 44 0.004 UniRef50_C1XHZ0 DNA-3-methyladenine glycosylase II n=2 Tax=Meiot... 44 0.005 UniRef50_D1ZEJ1 Whole genome shotgun sequence assembly, scaffold... 44 0.005 UniRef50_C4DGP2 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 44 0.006 UniRef50_B6BWA6 DNA-3-methyladenine glycosylase 1 n=1 Tax=beta p... 44 0.007 UniRef50_A3LTR9 3-methyladenine DNA glycosylase (Fragment) n=5 T... 44 0.007 UniRef50_B7K9B1 HhH-GPD family protein n=3 Tax=Cyanobacteria Rep... 44 0.009 UniRef50_A8N5M3 Putative uncharacterized protein n=2 Tax=Agarica... 44 0.009 UniRef50_B4SHB1 8-oxoguanine DNA glycosylase domain protein n=3 ... 43 0.010 UniRef50_C7RDZ5 8-oxoguanine DNA glycosylase domain protein n=3 ... 43 0.010 UniRef50_B8PBS7 Predicted protein n=2 Tax=Postia placenta Mad-69... 43 0.011 UniRef50_B4WC67 AlkA N-terminal domain family n=1 Tax=Brevundimo... 43 0.012 UniRef50_Q4A0G9 Putative DNA-3-methyladenine glycosidase n=1 Tax... 43 0.012 UniRef50_Q688W2 Os05g0567500 protein n=2 Tax=Oryza sativa RepID=... 43 0.015 UniRef50_D0JW87 Glycosidase n=1 Tax=Yersinia pestis D182038 RepI... 42 0.016 UniRef50_B2W1R2 DNA-3-methyladenine glycosylase n=1 Tax=Pyrenoph... 42 0.023 UniRef50_C7MNR2 Endonuclease III n=3 Tax=Coriobacteriaceae RepID... 41 0.039 UniRef50_C8WCH4 HhH-GPD family protein n=3 Tax=Zymomonas mobilis... 41 0.040 UniRef50_Q0AQT9 HhH-GPD family protein n=2 Tax=Hyphomonadaceae R... 41 0.041 UniRef50_Q8DCC1 3-methyladenine DNA glycosylase n=6 Tax=Vibrio R... 41 0.052 UniRef50_A4ABI1 DNA-3-methyladenine glycosidase II n=2 Tax=uncla... 40 0.089 >UniRef50_P04395 DNA-3-methyladenine glycosylase 2 n=122 Tax=Enterobacteriaceae RepID=3MG2_ECOLI Length = 282 Score = 577 bits (1488), Expect = e-163, Method: Compositional matrix adjust. Identities = 282/282 (100%), Positives = 282/282 (100%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH Sbjct: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRA 120 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRA Sbjct: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRA 120 Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE Sbjct: 121 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 Query: 181 ALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 ALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL Sbjct: 181 ALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 Query: 241 IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA Sbjct: 241 IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 >UniRef50_A8GHQ3 Transcriptional regulator, AraC family n=3 Tax=Proteobacteria RepID=A8GHQ3_SERP5 Length = 512 Score = 249 bits (635), Expect = 1e-64, Method: Compositional matrix adjust. Identities = 136/286 (47%), Positives = 179/286 (62%), Gaps = 6/286 (2%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGE----YRGVVTAIPDIAR 56 ++ L ++PPYDW ML FL RAVS VE V Y R++A+ + Y G V+ P+ + Sbjct: 222 VFHLGYRPPYDWPRMLSFLQTRAVSGVEKVEGQQYLRAIAITQGGIDYHGWVSVQPEESH 281 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQ 116 + + + ++ L V E L ++ +LFDL P ++ ALG+L A PGLRLPGCV+ FEQ Sbjct: 282 NRVRVEIAPALSRVTTEVLRRIRQLFDLDAAPDLIVQALGQLAADAPGLRLPGCVNGFEQ 341 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI--CFPTPQRLAAADPQALKALGM 174 RA+LGQLVSV MAA +A+ +G L+ I FP +++A P+ L+ LG+ Sbjct: 342 ATRAVLGQLVSVKMAATFAGCMAERWGTPLEQPYAGITHVFPNAEQVARLQPEELRPLGV 401 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 LKRA ALI +A A EG L + D+EQ +K L PGIG WTA Y A+R W DVF Sbjct: 402 QLKRAAALIAIARAVTEGRLQLENVLDIEQGIKALTALPGIGSWTACYIAMRAWSWPDVF 461 Query: 235 LPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 L DYLIKQRFPGMTP QI YAE W+PWRSYA LH+W+ +GW P Sbjct: 462 LTGDYLIKQRFPGMTPRQIENYAECWRPWRSYATLHLWHNQGWVPS 507 >UniRef50_A3ETF2 Putative Ada DNA repair protein and transcriptional regulator, AraC family n=2 Tax=Leptospirillum sp. Group II RepID=A3ETF2_9BACT Length = 480 Score = 194 bits (494), Expect = 2e-48, Method: Compositional matrix adjust. Identities = 106/278 (38%), Positives = 161/278 (57%), Gaps = 6/278 (2%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGE----YRGVVTAIPDIAR 56 +++L ++PPY W M FL R V+ VE V++ Y R++ + + ++G ++A D Sbjct: 196 VFSLGFRPPYAWEAMFDFLGNRTVAGVEEVSEKVYRRAVRIRKGGTTFQGWLSAEADNTG 255 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQ 116 L + LS L PVA LA++ RLFDL+C+P+++ LG LG PG+R+PG D FE Sbjct: 256 KALRLTLSTSLAPVATTVLARVRRLFDLECHPELIADILGPLGMREPGIRVPGAFDGFET 315 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDD-FPEYI-CFPTPQRLAAADPQALKALGM 174 VR ILGQ VSV A L R+ +G+ +D +P+ FP+P+R+A D AL LG+ Sbjct: 316 AVRIILGQQVSVQGARTLAGRLVSAHGDPIDTPWPDITRAFPSPERIAGMDASALSGLGI 375 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 R +A++ LA A EG + + D E M+ L++ PGIG WTA A+R D F Sbjct: 376 FGFRIKAILGLAAAVAEGRITLAPGPDPEPQMEALRSIPGIGEWTAQAIAMRVLSWPDAF 435 Query: 235 LPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 DY I++ +P ++ A++W+PWR+YA L +W Sbjct: 436 PHTDYGIQKALKEKSPRRVLEVAQQWRPWRAYAALALW 473 >UniRef50_D1R7A9 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R7A9_9CHLA Length = 532 Score = 183 bits (465), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 99/277 (35%), Positives = 154/277 (55%), Gaps = 8/277 (2%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L ++PPYDW ++ FL R + VE + Y R++ +G+ +G + + +L L Sbjct: 251 LTYRPPYDWKGVINFLRVRLMKGVEHIEGDRYLRTIQLGKTKGWIQISHAEEKQSLIFEL 310 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR---LGAA---RPGLRLPGCVDAFEQG 117 S L PV L ++ +FDL P +++ L + L A PGLR+PG D FE Sbjct: 311 SHSLLPVLPALLGRIRSVFDLNARPDVISTHLRQDKWLTEAVNVNPGLRIPGAFDGFELA 370 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDD-FPEYICF-PTPQRLAAADPQALKALGMP 175 VRAILGQ ++V A L R+ Q +GE++ +PE PTPQRL A L +LG+ Sbjct: 371 VRAILGQQITVKAATTLAGRLVQAFGEKIQTPYPELKHLSPTPQRLTIATVDELASLGII 430 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 R++++IHLA + G L + E+ M+ L PGIG+WTA+Y A+R + D F Sbjct: 431 QSRSKSIIHLAEEVVSGRLQLDADVYPEKTMQKLVQIPGIGKWTAHYIAMRALRWPDAFP 490 Query: 236 PDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 +D ++++ +T +Q ++ W+PWRSYA+LH+W Sbjct: 491 KEDIVLRKCLGNVTASQAEILSQSWRPWRSYAVLHLW 527 >UniRef50_UPI0001BC59A0 AraC family transcriptional regulator n=1 Tax=Fusobacterium ulcerans ATCC 49185 RepID=UPI0001BC59A0 Length = 486 Score = 181 bits (458), Expect = 3e-44, Method: Compositional matrix adjust. Identities = 101/283 (35%), Positives = 154/283 (54%), Gaps = 14/283 (4%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAV--GEYR--GVVTAIPDIARHTL 59 L ++PPY W +L FLA RA+ VETV + Y R++ GE + A ++T+ Sbjct: 201 LGYRPPYQWEHILNFLALRAIPGVETVKEGKYYRTVHFLNGEKHIYSWIQAENQPEKNTI 260 Query: 60 HINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP-----GLRLPGCVDAF 114 + + A L PV ++ LAK+ LFDL C+P V L ++ +P G+R+PGC D F Sbjct: 261 AVTMPAELLPVLSQVLAKVRNLFDLSCDPYAVYEGLMKMNNIQPNICTLGIRVPGCFDPF 320 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQ---RLAAADPQAL 169 E VRA+LGQ +++ A L AR+ + +G ++ E + FP P+ +L +L Sbjct: 321 EMSVRAVLGQQITIKAAKTLAARITEKFGVTIETGIEGLTHIFPEPEDIYKLKDKITDSL 380 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 LG+ RA+ ++ LA+A + + E+ +K L GIG WTA Y A+R Sbjct: 381 GELGIIKTRAKTILELASAFVNKEIDFNFCIHPEEEIKKLMKISGIGNWTAQYIAMRAMG 440 Query: 230 AKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D FL DY +K+ P TP +I AE W+PWRSYA++++W Sbjct: 441 LTDAFLETDYGVKKALPSYTPKEILTLAEAWRPWRSYAVVNLW 483 >UniRef50_Q6MA41 Putative DNA-3-methyladenine glycosidase II n=3 Tax=Bacteria RepID=Q6MA41_PARUW Length = 476 Score = 178 bits (451), Expect = 2e-43, Method: Compositional matrix adjust. Identities = 94/278 (33%), Positives = 152/278 (54%), Gaps = 10/278 (3%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 LN++PPYDW L FL+ R++ +E V ++ Y R++ + EY+G + +H L + + Sbjct: 196 LNYRPPYDWIGFLNFLSIRSLKGIELVKNNCYLRTVQIREYKGWIHVSHVEDKHCLRVKI 255 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGAL------GRLGAARPGLRLPGCVDAFEQG 117 ++ L PV A L ++ FDL P ++ L A PGLR+PG D FE Sbjct: 256 ASSLVPVLAILLERIRNFFDLNARPDKISVQLEQDPFLAEEVAKNPGLRVPGTFDGFELA 315 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDD-FPE--YICFPTPQRLAAADPQALKALGM 174 RAILGQ ++V A L +R + +GE F E Y+C P+ QR+++ + + +G+ Sbjct: 316 FRAILGQQITVKAATTLASRFVKAFGEEFKTPFAELHYLC-PSSQRISSLKWEEIATIGI 374 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 RA+ +I LA TL + ++ +K L + GIG+WTA+Y ALR Q D F Sbjct: 375 IRARAQTIIELAKQMSSNTLKLEAGVNLRLTIKQLTSIAGIGQWTAHYIALRALQWPDAF 434 Query: 235 LPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 +D ++++ +T Q + ++ W+PWRSYA L++W Sbjct: 435 PKEDVALRKKLGKVTAKQAEKLSQVWRPWRSYATLYLW 472 >UniRef50_A8MFS4 Transcriptional regulator, AraC family n=2 Tax=Clostridiaceae RepID=A8MFS4_ALKOO Length = 485 Score = 177 bits (450), Expect = 3e-43, Method: Compositional matrix adjust. Identities = 102/286 (35%), Positives = 153/286 (53%), Gaps = 15/286 (5%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGV-----VTAIPDIARHT 58 L+++PPY W ML FLA RA++ +E V ++ Y R++ + G + R+ Sbjct: 199 LSYRPPYHWEDMLRFLAGRAITGIEVVKNNEYMRTVHLENSEGKPVYGWIRVGHQSKRNA 258 Query: 59 LHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP-----GLRLPGCVDA 113 L + +S L V + LA++ LFDL C+P V L + RP G R+PGC +A Sbjct: 259 LSVTVSQALLSVLPQVLARIRHLFDLYCDPDAVYETLQVMNDIRPNLCTLGTRVPGCFNA 318 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI--CFPTPQRLAAAD---PQA 168 FE VRA+LGQ ++V A+ L AR+ Q YG + E + FP+P+ + A + Sbjct: 319 FEMVVRAVLGQQITVKAASTLAARIVQTYGTPIQTGFEGLTHVFPSPEDILALNGPIENH 378 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 L LG+ RA+ + LA A ++G + +P E+ MK L GIG WTA Y A+R Sbjct: 379 LGPLGVIAARAKTIYELAQAFVQGEIDFDLPAQPEEEMKRLMAIRGIGSWTAQYIAMRAM 438 Query: 229 QAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 + D FL D +K+ P T ++ AE W+PWRSYA +++W T Sbjct: 439 EWPDAFLETDAGVKKALPPYTAKELLEIAEAWRPWRSYATVNLWNT 484 >UniRef50_D2L7I2 Transcriptional regulator, AraC family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L7I2_9DELT Length = 482 Score = 176 bits (446), Expect = 9e-43, Method: Compositional matrix adjust. Identities = 111/279 (39%), Positives = 152/279 (54%), Gaps = 8/279 (2%) Query: 3 TLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTA-----IPDIARH 57 TL ++PPYDW +LGFL R++ VE VAD Y R+LA+ GVV A A++ Sbjct: 199 TLGYRPPYDWDGLLGFLCLRSIGGVEAVADGVYRRTLAISR-NGVVHAGWLAVAHAPAKN 257 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQG 117 + + ++AGL PV L ++S LFDL C+P + L L GLRLPG D FE Sbjct: 258 AVRVTVAAGLLPVLPAVLTRVSHLFDLACDPAAIAAGLAGLADGHEGLRLPGAADGFEVA 317 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDD-FPEYI-CFPTPQRLAAADPQALKALGMP 175 VRAILGQ V+VA A L R A +GE + F + FP P R+A A+ + G+ Sbjct: 318 VRAILGQQVTVAGARTLARRFAAAFGEPVSTPFADLTTVFPGPARVAGLTVDAIASQGIL 377 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 RA A+I LA A EG L ++ DV L PGIG WTA+Y A+R D F Sbjct: 378 AARARAIIGLARAMAEGGLVLSPAADVAATRAALLALPGIGAWTADYIAMRALAWPDAFP 437 Query: 236 PDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 D+ +K+ P ++ A W+PWR+YA++H+W + Sbjct: 438 HTDFGVKKALGETDPKRVLERAAGWRPWRAYAVMHLWRS 476 >UniRef50_Q2SDC7 Adenosine deaminase n=3 Tax=Bacteria RepID=Q2SDC7_HAHCH Length = 484 Score = 170 bits (430), Expect = 5e-41, Method: Compositional matrix adjust. Identities = 99/281 (35%), Positives = 148/281 (52%), Gaps = 10/281 (3%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAV--GEYRGV----VTAIPDIA 55 + + ++PPY W +L FL+AR ++ V+ V D Y R + V GE V + D A Sbjct: 200 FEMPYRPPYAWDALLSFLSARTIAGVDAVVDGRYHRIVRVEAGEDSAVGWFEASHEADAA 259 Query: 56 RHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFE 115 R + I L + L + ++ FD+ C PQ +N LG L PG+R+P +D FE Sbjct: 260 R--IRIRLDSTLSRHIGYLINRLRAFFDVSCVPQEINKVLGTLAQNEPGMRIPSGMDGFE 317 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDD-FPEYI-CFPTPQRLAAADPQALKALG 173 VRAILGQ ++VA A L AR+ +G+ ++ FPE FP+ L + L +LG Sbjct: 318 IAVRAILGQQITVAAARTLLARLVDKFGDPIETPFPEINRTFPSAATLVNLPVEELASLG 377 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 + R A+ +A A L L ++ +VEQ ++ L PGIG WTA Y A+R D Sbjct: 378 VIRTRVRAIQEIAAAMLRSELTLSPAANVEQEIQRLHAIPGIGDWTAQYIAMRAMSWPDA 437 Query: 234 FLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 F D +++ G+ Q R AE W+PWR YA++H+W + Sbjct: 438 FPASDVGVRKALGGVDAKQSARAAEEWRPWRGYAVMHLWRS 478 >UniRef50_Q46QC3 Transcriptional regulator Ada n=25 Tax=cellular organisms RepID=Q46QC3_RALEJ Length = 509 Score = 164 bits (415), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 106/285 (37%), Positives = 146/285 (51%), Gaps = 12/285 (4%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGE----YRGVVTAIPDIAR 56 ++ L ++PP W +LGFLA RAV +E V D YAR+L+V +RG V R Sbjct: 195 VFELGYRPPLAWEALLGFLAVRAVDGIEQVRDGAYARTLSVESGGTTHRGWVRLDHVPGR 254 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQ 116 L + LSA L V + L K+ RL DL C P IV+ LG L + PG+RLPG VD FE Sbjct: 255 LVLRVTLSASLARVIPQALGKVRRLCDLGCRPDIVDRHLGELASDVPGMRLPGSVDGFEI 314 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERL--DDFPEYIC------FPTPQRLAAADPQA 168 VRA++GQ++SV A ++ AR+ Q G+ L P C FP+ LAA Sbjct: 315 AVRAVIGQVISVVQARRILARLGQTAGDALPAPAMPIDGCAPLQHGFPSAAALAALPDAD 374 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 + A G+ + L LA G LP+ EQ + L GIG WTA Y A+R Sbjct: 375 MVAAGVSPGKLRTLRALAQRVASGALPLEQHMPPEQTVAALCEIDGIGDWTAQYVAMRAL 434 Query: 229 QAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 D F DY +++ T ++ +W PWR+YA +H+W+ Sbjct: 435 GWPDAFPGTDYALRKVLGVNTVRAMQARTAQWAPWRAYAAIHLWH 479 >UniRef50_A6SU78 Methylated-DNA-[protein]-cysteine S-methyltransferase n=2 Tax=Oxalobacteraceae RepID=A6SU78_JANMA Length = 499 Score = 163 bits (413), Expect = 5e-39, Method: Compositional matrix adjust. Identities = 108/294 (36%), Positives = 159/294 (54%), Gaps = 27/294 (9%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADS-----YYARSLAVGEYRG--VVTAIPDIAR 56 L ++PPY W ML +LA RA+ VE V + Y RS+ + G VT +P AR Sbjct: 209 LAYRPPYAWEPMLAYLAGRAIPGVEGVVEDAPGTLSYVRSVMLNNTAGWLRVTHLP--AR 266 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGC 110 L ++L A L PV LA++ + FDL NP+I+ + L + PGLR+PG Sbjct: 267 RQLELSLPATLAPVLMPLLARVRKQFDLDANPEIIAAHLSADALLAQQIRLTPGLRVPGT 326 Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC----FPTPQRLAAADP 166 D FE +RA+LGQ VSVA A ++ R+ + +GE D +I FPT +RLAAAD Sbjct: 327 FDTFELAIRAVLGQQVSVAGATTVSGRLVKAFGEPADT--PFIGINRHFPTAERLAAADI 384 Query: 167 QALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALR 226 + ALGMP RA+ + ++A A++G L M +++ + +L+T GIG WTA Y A+R Sbjct: 385 GEIAALGMPGSRAQTIQNVARFAVQGGLQMKPGASLDECVSSLKTVRGIGEWTAQYVAMR 444 Query: 227 GWQAKDVFLPDDYLIKQ---RFPG---MTPAQIRRYAERWKPWRSYALLHIWYT 274 + D F D +++ G +T Q+ A W PWR+Y L +W++ Sbjct: 445 ALRFPDAFPTGDLGLQKAAVEVAGGTRLTEKQLLLRAAGWSPWRAYTALLLWHS 498 >UniRef50_C8QGF9 Transcriptional regulator, AraC family n=1 Tax=Pantoea sp. At-9b RepID=C8QGF9_9ENTR Length = 495 Score = 163 bits (412), Expect = 6e-39, Method: Compositional matrix adjust. Identities = 101/279 (36%), Positives = 149/279 (53%), Gaps = 8/279 (2%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L+++PPYDW +L FL R + VE VA+ Y R++A+G +G V + L + Sbjct: 202 LSYRPPYDWEAILDFLQQRVMKEVEWVAEGIYHRTVALGGCQGWVRVSHYPEKQALKVQF 261 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR--LGAAR----PGLRLPGCVDAFEQG 117 + L PV L ++ LFDL PQ + L + L A PGLR+PG D FE G Sbjct: 262 TTSLTPVLPALLRRLRDLFDLDAQPQRIADQLAQDPLLAPSLVRYPGLRVPGAFDGFELG 321 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERL-DDFPEYICF-PTPQRLAAADPQALKALGMP 175 VRAILGQ V+V A L++RVAQ +G + +PE P+ + LA A + +LG+ Sbjct: 322 VRAILGQQVTVKAATTLSSRVAQRFGAPMATPWPELSRLSPSAETLATATQDDIASLGIV 381 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 R++A++ LA A G L + + + L GIG WTA+Y A+R + D F Sbjct: 382 SARSQAILALAQACASGALRFNGAVNPDVVQQQLLALKGIGPWTASYIAMRALRWPDAFP 441 Query: 236 PDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 +D I+ G++ ++ W+PWRSYA+LHIW + Sbjct: 442 KEDIAIRNNLGGVSAKDAEVRSQVWRPWRSYAVLHIWKS 480 >UniRef50_C0WE04 Transcriptional regulator n=1 Tax=Acidaminococcus sp. D21 RepID=C0WE04_9FIRM Length = 483 Score = 157 bits (397), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 98/285 (34%), Positives = 155/285 (54%), Gaps = 15/285 (5%) Query: 3 TLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAV-GE----YRGVVTAIPDIARH 57 TL ++PPY S + FL RA+ +ETV++ Y R++ + GE Y G+++ P+ + Sbjct: 198 TLTYRPPYLASPLFDFLKGRAMKGIETVSEGIYKRTVTLAGEKGARYHGIISVSPNKKCN 257 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGAL-----GRLGAARPGLRLPGCVD 112 L + LS L PV ++ + ++SR FDL P+ + L G G G+R+PG D Sbjct: 258 ALTLTLSDSLLPVLSDVIFRVSRQFDLAAFPETIAAVLYAMNDGVPGTFAEGIRIPGAFD 317 Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAA---ADPQ 167 FE VRAILGQ ++V A+ L AR + G ++ + FPTP+++ + + Sbjct: 318 GFETAVRAILGQQITVKAASTLAARFVAVLGTPIETGHPGLTHLFPTPEKILSYGESLSD 377 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRG 227 L LG+ ++ ++ LA A ++G+L + E+ K L GIGRWT++Y A+R Sbjct: 378 ELGKLGIISSKSASIRALAQALMDGSLRLDGTRSREETKKALLALKGIGRWTSDYIAMRV 437 Query: 228 WQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 + D+FL D IK PG TP + AE W+P+RSYA + +W Sbjct: 438 LKDPDIFLETDAGIKHALPGTTPKERLTLAEAWRPFRSYATVSLW 482 >UniRef50_Q2RNZ4 Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=2 Tax=Bacteria RepID=Q2RNZ4_RHORT Length = 486 Score = 156 bits (394), Expect = 8e-37, Method: Compositional matrix adjust. Identities = 113/278 (40%), Positives = 146/278 (52%), Gaps = 20/278 (7%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLE 68 P+DW +L F RAV +E V Y R++A+G RGVVT + L + S Sbjct: 212 PFDWPGLLAFFRQRAVPGLERVEGDTYVRAIAIGAARGVVTI--RGSAEGLVVTPSLDRP 269 Query: 69 PVAAECLAKMSRLFDLQCNPQIVNGALGR------LGAARPGLRLPGCVDAFEQGVRAIL 122 A +A++ R+FDL + + LG L AARPGLR+PG D FE VRAIL Sbjct: 270 EGLAALVARLRRVFDLDADIGAIGAHLGADPLLAPLVAARPGLRVPGAWDGFELAVRAIL 329 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFP---EYICFPTPQRLAAADPQALKALGMPLKRA 179 GQ VSVA A L R+ +GE L + P FPT RLA AD L LG+ RA Sbjct: 330 GQQVSVAAATTLAGRLVGAFGEPLTNAPPAGPSRLFPTAARLAEAD---LGGLGLTTARA 386 Query: 180 EALIHLANAALEGTLPMTIPG-DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD 238 +A+ LA A +E T + PG D++ A+ L PGIG WTA Y ALR D D Sbjct: 387 KAISGLARAVVE-TPGLLDPGPDLDSAVARLCRLPGIGPWTAQYIALRALGEADALPVGD 445 Query: 239 YLIKQRFP--GM--TPAQIRRYAERWKPWRSYALLHIW 272 + + G+ TPA + AE W+PWRSYA+LH+W Sbjct: 446 IGVLRALAEDGVRPTPAALLARAEDWRPWRSYAVLHLW 483 >UniRef50_A7HP34 Ada metal-binding domain protein n=3 Tax=Bacteria RepID=A7HP34_PARL1 Length = 513 Score = 155 bits (393), Expect = 1e-36, Method: Compositional matrix adjust. Identities = 101/289 (34%), Positives = 143/289 (49%), Gaps = 20/289 (6%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH--- 60 L ++PP+D+ +L +L++RA+ VE +++ YARS +G +G+VT P L Sbjct: 228 LGYRPPFDFDRILAYLSSRALPGVERISEGRYARSFHLGGVKGLVTVTPAATGSALDARI 287 Query: 61 --INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVD 112 ++ G PV A A++ RLFDL P + + A+G A PGLR+ G D Sbjct: 288 AVLDAKGGTVPVRA-IAARLRRLFDLDAEPGAIAAAFAGDPAIGPRFARVPGLRVAGAFD 346 Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAAADPQALK 170 FE VRA+LGQ +SV A + R+ GE + I FP P+ LA AD L Sbjct: 347 GFELAVRAVLGQQISVKGATTIAGRIVARLGEEVTTEEPGITHFFPAPRALARAD---LS 403 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 LG+ R L LA A G L T ++ + L PGIG WTA+Y ALR Sbjct: 404 GLGLTGGRIATLTSLAQAVASGALDFTPRESLDAKLAELTALPGIGEWTAHYVALRALGE 463 Query: 231 KDVFLPDDYLIKQRFPGMTPA---QIRRYAERWKPWRSYALLHIWYTEG 276 D F D +++ P ++ R AE W+PWR YA L +W +G Sbjct: 464 PDAFPASDLGLRKAVGKGEPVSTKELERMAESWRPWRGYAALALWTIDG 512 >UniRef50_Q1IT49 DNA-3-methyladenine glycosylase II / Transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=3 Tax=Bacteria RepID=Q1IT49_ACIBL Length = 477 Score = 153 bits (386), Expect = 6e-36, Method: Compositional matrix adjust. Identities = 98/281 (34%), Positives = 146/281 (51%), Gaps = 16/281 (5%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 ++ L ++PPY W ML FL RA VE V + YARS+++ G +H+L Sbjct: 198 VFRLRYRPPYHWLGMLDFLRPRATPGVECVTEDAYARSISLHGKEGSFEVTHAPEQHSLV 257 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGAL-------GRLGAARPGLRLPGCVDA 113 + ++ + + ++ +FDL + + G L G L PG RLPG D Sbjct: 258 LRVNFEDSSALFQIVERVRAMFDLNADWGSIAGVLENDRLLRGHL-KGDPGRRLPGAWDG 316 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGE--RLDDFPEYICFPTPQRLAAADPQALKA 171 FE VRA+LGQ +SVA A L ++A+ +G R + ++ FPTP+ LA A + Sbjct: 317 FELAVRAVLGQQISVAAATNLAGQIARKFGRPLRKSNGISHL-FPTPEILADA-----AS 370 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 L +P+KRAE + LA A + L DV Q + L+T PGIG WTA Y ALR + Sbjct: 371 LPLPMKRAETIRALACAVRDCELQFDAITDVPQFCEQLKTIPGIGDWTAQYVALRALREP 430 Query: 232 DVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D F D +++ + A++ R AE W+PWR YA +++W Sbjct: 431 DAFPAGDLGLQKSLGVKSSAELERRAENWRPWRGYAAIYMW 471 >UniRef50_Q0AGQ6 DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / Transcriptional regulator Ada n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AGQ6_NITEC Length = 477 Score = 152 bits (383), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 92/279 (32%), Positives = 144/279 (51%), Gaps = 11/279 (3%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L+++PP W+ ++ FL +R+ + + + Y +++ + +G VTA D RH +++ Sbjct: 196 LSYRPPLAWNALIRFLCSRSNLRLSQIQNGNYLQTVNLDGCQGWVTAKHDTKRHQIYVQA 255 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR------LGAARPGLRLPGCVDAFEQG 117 S L P + RLFDL NP I+ LG L A PGLR+PG +D FE G Sbjct: 256 SRSLLPCLIRLQMYLRRLFDLDANPAIIEAHLGNDDILKPLIANHPGLRIPGTLDIFELG 315 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDD-FPEYI-CFPTPQRLAAADPQALKALGMP 175 +RAILGQ ++V A L R +G+ +D FP P + +A QAL +G+ Sbjct: 316 LRAILGQQITVKAATTLFGRFVATFGKPVDTPFPGLDRTSPPAELIADTSLQALIDIGLT 375 Query: 176 LKRAEALIHLANAALEGTL-PMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 +RA + A + G L P +I D + ++ L PGIG WTA Y A+R + F Sbjct: 376 GRRALTIQRFAQTIVNGALKPESI--DRNKIIEQLLELPGIGPWTAQYIAIRALGDSNAF 433 Query: 235 LPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 D + + PA++ R E+W+PWR+Y +H+W+ Sbjct: 434 PASDLGLLRGLRMEKPAELLRRTEKWQPWRAYGAIHLWH 472 >UniRef50_Q02KH7 DNA-3-methyladenine glycosidase II n=8 Tax=Pseudomonas RepID=Q02KH7_PSEAB Length = 297 Score = 149 bits (377), Expect = 7e-35, Method: Compositional matrix adjust. Identities = 104/283 (36%), Positives = 143/283 (50%), Gaps = 16/283 (5%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L +Q P++W A R ++ VE++ D +YARS P R L ++L Sbjct: 12 LPYQSPWEWRQFHQHFALRLLAGVESLGDDHYARSFRANGRPAWFEVRPLAERQVLALSL 71 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQG 117 S +AAE A++ R+FDL +P + + LG L AA PGLRLP D FEQ Sbjct: 72 SPSAHALAAELEARVRRMFDLDSDPAAIARHFAGDPLLGPLVAANPGLRLPVAFDPFEQA 131 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE---YICFPTPQRLAAADPQALKALGM 174 VRAI+GQ V+V A +T R+ Q GE L++ FPTP LA A+ L +GM Sbjct: 132 VRAIVGQQVTVKAAVTITGRLIQRLGEPLENLGYDGISHLFPTPAALAQAN---LDGIGM 188 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 P KR + L A A G L + + E ++ L PGIG WTA Y ALR D F Sbjct: 189 PGKRVQTLQRFAAAIASGELSLDLADGPEALVERLCALPGIGPWTAEYIALRAMGEADAF 248 Query: 235 LPDDY-LIKQRF---PGMTPAQIRRYAERWKPWRSYALLHIWY 273 D L+K G+ ++ AE W+PWR+YA +H+W+ Sbjct: 249 PAADLGLLKSTVWGPQGIDARSLKARAEAWRPWRAYAAIHLWH 291 >UniRef50_B9DJS2 Putative uncharacterized protein n=1 Tax=Staphylococcus carnosus subsp. carnosus TM300 RepID=B9DJS2_STACT Length = 341 Score = 148 bits (373), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 98/308 (31%), Positives = 149/308 (48%), Gaps = 39/308 (12%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGE------YRGVVTAIPDIA 55 + L +Q PY W+ M+ +L+ RA+ VE V D+YYAR++ + + +G + + Sbjct: 36 FNLYYQTPYIWTAMIDYLSKRAIPRVEIVQDNYYARTVLLKDTATKRAVKGWLKVKNNTK 95 Query: 56 RHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFE 115 + L + +SA L + + K+ FDL+ NP+I+N L + GLR+PG + FE Sbjct: 96 NNALLVEMSASLIHEWNKIIQKLRHFFDLEVNPEIINKTLNEDWITK-GLRVPGAFNGFE 154 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC-----FPTPQR---LAAAD-- 165 GVRAILGQ ++V A ++ R+ G F I FP P++ LA D Sbjct: 155 LGVRAILGQQITVKAATTISGRLVHALGT---PFKTKIAGLDTLFPIPEKFVYLAHCDTP 211 Query: 166 -PQALKALGMPLKRAEALIHLANAALEGTLPM---------TIPGD--------VEQAMK 207 L LG+ ++R+ + LA A + G + + +IP + E M Sbjct: 212 ISDLLGPLGVTVRRSNTIAALAEAIVNGEVQLNPVVHGVESSIPSNRYNTQMETAESEMN 271 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-MTPAQIRRYAERWKPWRSY 266 L GIG+WTA Y +R D FL D IK P TP AE+W P RSY Sbjct: 272 RLLAIKGIGKWTAQYIGMRALGYTDSFLETDIGIKNAMPNDTTPKSRLAVAEKWHPLRSY 331 Query: 267 ALLHIWYT 274 A++++W T Sbjct: 332 AVVNLWNT 339 >UniRef50_Q2T2N2 DNA-3-methyladenine glycosylase II n=65 Tax=Burkholderia RepID=Q2T2N2_BURTA Length = 343 Score = 148 bits (373), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 110/295 (37%), Positives = 151/295 (51%), Gaps = 33/295 (11%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 ++ L ++PPYDW +L F A RA+ VE V Y R++ +YRG V A+ + +H Sbjct: 46 VFELPFKPPYDWPRVLRFFAGRAIPGVEAVEGGAYRRTV---DYRGAVGAL-TVRKHPRK 101 Query: 61 INLSAGLE-----PVAAECLAKMSRLFDLQCNPQIVNGALGR------LGAARPGLRLPG 109 L A +E A A+++ +FDL +P + L R L A PGLR+PG Sbjct: 102 RCLVATVEGDAARHADAAFAARLATMFDLHADPAAIGAHLARDAWLAPLVDAAPGLRVPG 161 Query: 110 CVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC---FPTPQRLAAADP 166 +FE VRAI+GQ VSV A + R+ + GERL FP P LAA D Sbjct: 162 AWSSFELIVRAIVGQQVSVKAATTIVGRLVERAGERLVGHAPGATGWRFPEPAALAACD- 220 Query: 167 QALKALGMPLKRAEALIHLANAALEGTLPM----TIPGDVEQAMKTLQTFPGIGRWTANY 222 L +GMP KRA AL +A A G +P+ T P V A+ L PGIG WT Y Sbjct: 221 --LSRIGMPGKRAAALQGVARAVAAGDVPLDAYATDPAGVRAALLAL---PGIGPWTVEY 275 Query: 223 FALRGWQAKDVFLPDDYLIKQ----RFPGMT-PAQIRRYAERWKPWRSYALLHIW 272 A+R W+ D + D ++ Q R P + PA R A+ W+PWR+YA +H+W Sbjct: 276 VAMRAWRDADAWPATDLVLMQAIVARDPALDRPASQRLRADAWRPWRAYAAMHLW 330 >UniRef50_A5KSU6 Transcriptional regulator, AraC family n=1 Tax=candidate division TM7 genomosp. GTL1 RepID=A5KSU6_9BACT Length = 464 Score = 144 bits (363), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 97/288 (33%), Positives = 141/288 (48%), Gaps = 29/288 (10%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 M +++PP+DW +LGF+ RA S E D+ Y R + E VV +P A++ L Sbjct: 194 MLRTDYRPPFDWDLLLGFIKKRATPS-EWATDTTYHRLIGSDEI--VVRNVP--AKNYLT 248 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR------LGAARPGLRLPGCVDAF 114 I + L A L K+ RLFDL NP ++ L A PG+R+PGC D F Sbjct: 249 IEVPQKLSRHAHAILMKVRRLFDLDANPSVITTVLTNDPYLKPFLADNPGVRVPGCWDNF 308 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGM 174 E +RA++GQ VSV+ A + R+ + G TP LAA+ + ++GM Sbjct: 309 EMLIRAVVGQQVSVSAATTVMRRLVERIGS------------TPDTLAASSADEIASIGM 356 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 PLKRA + LA+ + + D ++ Q GIG WT Y LR D Sbjct: 357 PLKRATTIHTLAHKVKNSDIDLN-ECDPQRFADQFQHISGIGPWTIAYLQLRILHWPDA- 414 Query: 235 LPDDYLIKQR----FPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 LP + + QR + +T A++ +YAE W+PWRSYA+ +W Q Sbjct: 415 LPAEDIGLQRALIPYKRITKAELSKYAEAWRPWRSYAVFLLWNASSNQ 462 >UniRef50_B0SWZ0 Transcriptional regulator, AraC family n=7 Tax=Bacteria RepID=B0SWZ0_CAUSK Length = 505 Score = 143 bits (361), Expect = 5e-33, Method: Compositional matrix adjust. Identities = 98/295 (33%), Positives = 144/295 (48%), Gaps = 22/295 (7%) Query: 3 TLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHIN 62 TL ++PPYDW ML FLA RA+ VE + + Y R +A+ G + P I L + Sbjct: 207 TLRYRPPYDWDAMLAFLALRAIPGVEVIESNTYRRVIALDGAAGTIAVSP-IDGDRLSVA 265 Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR------LGAARPGLRLPGCVDAFEQ 116 + LA++ +FDL +P + L R + RPGLR+PG D FE Sbjct: 266 VRFPKLSALPRILARVRGVFDLSADPVGIAAVLSRDPDLARMVGLRPGLRVPGAWDGFEL 325 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERL----DDFPEYICFPTPQRLAAADPQALKAL 172 VRAILGQ ++V A KL + +GE L + FP+ +RLAA + L + Sbjct: 326 AVRAILGQQITVVQARKLAGDLVAAHGEPLAQPWTEPGLTHAFPSAERLAATN---LSGM 382 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 MP R L +A A + ++ +++ ++ L+ PGIG WTA Y A+R + D Sbjct: 383 KMPGARIRCLSAMAQAIADAPNLLSPTAGLDEMVRRLRALPGIGEWTAQYIAMRQLREPD 442 Query: 233 VFLPDDYLIKQRFPGM-----TPAQIRRYAERWKPWRSYALLHIWYT---EGWQP 279 F D + + + T Q+ AE W+PWR+YA LH+W + EG P Sbjct: 443 AFPAADVALMRALADVDGVRPTAEQLLTRAEAWRPWRAYAALHLWASLADEGAPP 497 >UniRef50_B1ZFN9 AlkA domain protein n=6 Tax=Methylobacterium RepID=B1ZFN9_METPB Length = 376 Score = 143 bits (361), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 102/292 (34%), Positives = 141/292 (48%), Gaps = 23/292 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 + L +PP+DW + F A A VETV YAR+ + G ++ + ++ I Sbjct: 16 FRLALRPPFDWGHLERFFADHASPGVETVTPGRYARTFLLAGRPGTLSVTCERGSLSVRI 75 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR------LGAARPGLRLPGCVDAFE 115 EP A L ++ +FDL +P + LGR L A RPGLR+PG D FE Sbjct: 76 RGPEADEPFEA-ILTRLRAMFDLGADPDAIAAGLGRDPTMAALVARRPGLRMPGAFDGFE 134 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERL-----DDFPEYI-CFPTPQRLAAADPQAL 169 VRAILGQ VSVA A +L R+ +G L D P FPTP++L A+ + Sbjct: 135 LAVRAILGQQVSVAAATRLAGRLVAAFGTPLGPKVGGDEPGLTHLFPTPEQLLEAEISLV 194 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 L MP R A+ LA A L GD++ + L+ PGIG WTA+Y A+R Sbjct: 195 --LNMPRARGRAIQGLAAAVLATPDLFAPGGDLDATVARLKALPGIGDWTAHYIAMRALA 252 Query: 230 AKDVFLPDDYLIKQRF------PGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 D F D + + PG A + R A W+PWR+YA +H+W + Sbjct: 253 QADAFPAGDVGLMRALDDGAGRPGRV-ALLDRAAA-WRPWRAYAAIHLWAED 302 >UniRef50_Q12D18 Transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II n=3 Tax=Proteobacteria RepID=Q12D18_POLSJ Length = 504 Score = 142 bits (357), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 97/283 (34%), Positives = 143/283 (50%), Gaps = 14/283 (4%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVA-DSYY---ARSLAVGE----YRGVVTAIPDIA 55 L ++PPYD + MLGF + R +S++E VA D+ + R+ V + G + A D Sbjct: 214 LGYRPPYDVAAMLGFFSKRTISAIEFVAADAQHPSIGRTFRVESGGKVHAGWLLAAFDET 273 Query: 56 RHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFE 115 R L +N+S L V + ++ FDL +P +N L GLR+PG +D +E Sbjct: 274 RSRLVLNVSDSLREVLPLVIRRVRATFDLDADPAAINSVLHAGFPQGDGLRVPGALDGYE 333 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDD-FPEYI-CFPTPQRLAAADPQALKALG 173 VRA+LGQ ++VA A L R+ +GE + +P+ FP P LAAA AL LG Sbjct: 334 LAVRAVLGQQITVAAARTLAQRMVDRFGEPVQTPWPQLTRLFPAPAMLAAASGDALGQLG 393 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 + +R A++ +A A + L + DV ++ L+ PGIG WTA Y A+R + D Sbjct: 394 IVRQRQAAIVGIAQAVADKRLQLHSGADVHATLEALKALPGIGDWTAQYIAMRALRWPDA 453 Query: 234 FLPDDYLIKQRFPGMTPAQIRRYAE----RWKPWRSYALLHIW 272 F D + + R AE WKPWRSYA++ W Sbjct: 454 FPAGDVALHKAMGVQGLKNPAREAELASHAWKPWRSYAVIRAW 496 >UniRef50_C7R5W7 Transcriptional regulator, AraC family n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R5W7_KANKD Length = 461 Score = 139 bits (350), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 86/275 (31%), Positives = 134/275 (48%), Gaps = 22/275 (8%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L+++PPYDWS M FL R +S++ETV D+ Y R+ ++ +G +A D +R + ++ + Sbjct: 204 LHYRPPYDWSLMQDFLKQRELSAIETVTDNCYGRTFSIDSSKGHFSAEIDPSRSSFNVTI 263 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP----GLRLPGCVDAFEQGVR 119 + R+ DL + +++ +L + +P GLRLP D FE GV+ Sbjct: 264 EMDDMSKLLTATHHIRRVLDLNSDLEVIENSLAQDVNIKPVLKSGLRLPATWDTFEAGVK 323 Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 AILGQ VSV A TA V + G + +D +Y FPT +++ D L L MP R Sbjct: 324 AILGQQVSVKAAYTHTASVIEQLGSKYND--QYKLFPTAKQIVNGD---LTFLKMPNSRK 378 Query: 180 EALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 + L A L + + + ++ GIG WT Y LR D F D Sbjct: 379 QTLHDFAQWYLSTS---------GEDLASILDIKGIGPWTYEYIKLRSGMDSDAFPEKDL 429 Query: 240 LIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 + M + +E+W+PWRSYA L +W++ Sbjct: 430 GV---IKAMEQYNLTN-SEQWQPWRSYATLQLWHS 460 >UniRef50_Q2IPL2 Transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II n=12 Tax=Proteobacteria RepID=Q2IPL2_ANADE Length = 514 Score = 137 bits (345), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 104/280 (37%), Positives = 137/280 (48%), Gaps = 17/280 (6%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARH----TLHINLS 64 PYDW +L FLAARA+ VE VAD Y R++A+ G V PD TL + Sbjct: 220 PYDWPALLEFLAARAIPGVEQVADGAYRRTVALDGAAGTVEVRPDPRGRGLLATLRLPRV 279 Query: 65 AGLEPVAAECLAKMSRLFDLQCNPQIVNGA--LGRLGAARPGLRLPGCVDAFEQGVRAIL 122 A + P + D ++G L L AARPGLR+PG + FE VRA+L Sbjct: 280 AAIAPAVERLRRLLDLDADAAAIGAHLSGDPLLAPLLAARPGLRVPGAWEPFELVVRAVL 339 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAAADPQALKALGMPLKRAE 180 GQ VSVA A L R+A G +D + FP P+ LA AD L+ LG+ RA Sbjct: 340 GQQVSVAAARTLAGRLAARLGAPVDSGDPALSRLFPGPEALAGAD---LEGLGLTRARAA 396 Query: 181 ALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD-- 238 L + A + + G++E A+ L PGIGRWTA Y A+R D F D Sbjct: 397 TLAAIGGAVRDDPSLLAPGGELEDAVARLDALPGIGRWTAQYVAMRALHQPDAFPEGDLG 456 Query: 239 ----YLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 + P ++ R AERW+PWR+YA LH+W + Sbjct: 457 LLAALGGLRGRGRAAPGELLRRAERWRPWRAYAALHLWMS 496 >UniRef50_B4S0Y6 Ada regulatory protein n=3 Tax=Alteromonas macleodii RepID=B4S0Y6_ALTMD Length = 475 Score = 137 bits (344), Expect = 6e-31, Method: Compositional matrix adjust. Identities = 96/291 (32%), Positives = 142/291 (48%), Gaps = 34/291 (11%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L+++PPY+W ++ FLAARA+S +E V+D+ Y R + GE G A+ + ARH +++ Sbjct: 204 LSYRPPYNWPYVREFLAARAISGMEVVSDNSYGRYFSCGESIGYFNAVHNEARHGFELHI 263 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLG----AARPGLRLPGCVDAFEQGVR 119 + + + L DL +P ++ +L + G A GLRLP FE G R Sbjct: 264 DMPDLRNLHKTIENIKLLLDLHADPLLIEESLKQAGLPDNALTAGLRLPSAWSVFESGCR 323 Query: 120 AILGQLVSVAMA-AKLTARVAQL--YGERLDDFPE----YICFPTPQRLAAADPQALKAL 172 AI+GQ VSV A ++T V QL G D + Y CFPTP+ +A + L L Sbjct: 324 AIVGQQVSVKAAIGQVTLLVHQLGKKGAVSDKYNTNSTAYYCFPTPEAVAGNN---LAFL 380 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 MP R EA+ A L +P K + G+G WT +Y +RG + D Sbjct: 381 RMPQARKEAVRQFACLFLNDKVP---------NHKEILAIKGVGPWTLDYLKMRGERNPD 431 Query: 233 VFLPDDYLIK---QRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 V+L D +++ Q +P + PAQ PWRSY L +W Q + Sbjct: 432 VYLEGDLIVRKMAQLYP-VEPAQA-------APWRSYLTLQLWQLSNQQKE 474 >UniRef50_A7HG85 AlkA domain protein n=2 Tax=Myxococcales RepID=A7HG85_ANADF Length = 485 Score = 134 bits (338), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 100/290 (34%), Positives = 138/290 (47%), Gaps = 27/290 (9%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTL- 59 + L+++PP DW +L FLAAR + VE V Y R++ +G G V+ D AR T Sbjct: 196 VLRLDFRPPLDWEALLAFLAARCTAGVEQVEGGAYRRTVRLGGRTGWVSVTRDPARPTAL 255 Query: 60 ----HINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR---LGAA---RPGLRLPG 109 ++L+ L P+AA A++ DL P V L R L A PGLR+PG Sbjct: 256 RAEASLSLAGALMPLAARLRAQL----DLDARPDAVASRLRRDPLLARALRRHPGLRVPG 311 Query: 110 CVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADP--- 166 D + VR I+GQ VSVA A ++ R+A GE P FP RLA + Sbjct: 312 AFDGLDAAVRVIVGQQVSVAAATTVSGRLAAALGE-----PVATPFPGLDRLAPSAEAIA 366 Query: 167 ----QALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANY 222 A+ +GMP RA ++ LA A G L + GD E L G+G WTA Sbjct: 367 AAGVDAIARVGMPGARARTILELARAVAGGGLALHRGGDGEAVRAGLLELSGVGPWTAEV 426 Query: 223 FALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 A+R D F D + + + + AE W+PWRSYA++H+W Sbjct: 427 VAMRALGEPDAFPASDLGVLRALGASSALEAEARAEAWRPWRSYAVMHLW 476 >UniRef50_A6EY17 Transcriptional Regulator, AraC family protein n=1 Tax=Marinobacter algicola DG893 RepID=A6EY17_9ALTE Length = 504 Score = 130 bits (327), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 98/291 (33%), Positives = 132/291 (45%), Gaps = 21/291 (7%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L +PP+D +L F ARA+ +E V +YARSL + G+V P + + L Sbjct: 213 LRARPPFDSEQLLAFFRARAIPGLEAVGAHHYARSLCIAGQPGLVICRPSDHPPGVQVIL 272 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR------LGAARPGLRLPGCVDAFEQG 117 E A++ RL DL + ++ L R L PGLR+PG + FE Sbjct: 273 RGPARQSILEVSARIRRLLDLDADLPGISEHLARDPLMEPLVTQHPGLRVPGSWERFEFS 332 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERL-DDFPEYIC----FPTPQRLAAADPQALKAL 172 VRAILGQ VS++ A L R+ YG+ L DD FP P L Q L L Sbjct: 333 VRAILGQQVSISAARTLAGRLVARYGQPLPDDLARGTGITHRFPEPAALVG---QPLNTL 389 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 GMP RA+ L + E P D + L + GIG WT Y ALRG D Sbjct: 390 GMPGSRADTLARITARFAE---PGFAEQDGNDLLAQLASMRGIGPWTLQYLALRGLGDPD 446 Query: 233 VFLPDDYLIKQRFPGMTPAQ----IRRYAERWKPWRSYALLHIWYTEGWQP 279 F D I + + Q + R+AERW+PWR+YA ++W + P Sbjct: 447 AFPASDLGILKAASHLGGPQDAKALTRHAERWRPWRAYAAQYLWTSLNAHP 497 >UniRef50_B0KRT0 AlkA domain protein n=1 Tax=Pseudomonas putida GB-1 RepID=B0KRT0_PSEPG Length = 325 Score = 129 bits (325), Expect = 8e-29, Method: Compositional matrix adjust. Identities = 103/283 (36%), Positives = 141/283 (49%), Gaps = 14/283 (4%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L ++ PY W LGFLAAR + +ET D Y+R+L V + V+ A P +H L + L Sbjct: 40 LRYRAPYHWPSTLGFLAARCIPGIETCHDGTYSRTLIVAGHHAVLHATPMTNQH-LRVRL 98 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALG------RLGAARPGLRLPGCVDAFEQG 117 +A++ R+FDL +P ++ L L ARPGLR+P DA EQ Sbjct: 99 EGAPSNALPGLIARLRRVFDLDADPARISAELSCDPLMASLLKARPGLRVPQGWDACEQA 158 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGE--RLDDFPEYICFPTPQRLAAADPQALKALGMP 175 +R +LGQ +SVA A L R+ Q +G RL FP LA A + +GMP Sbjct: 159 MRTVLGQQISVAGAMTLAGRLVQRHGAPLRLSAPGLSHVFPALPTLANAQ---FENMGMP 215 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF- 234 RA L LANA L + +++ ++ L GIG W+A+Y ALR A D Sbjct: 216 SARATTLATLANALLADPGLLRRGQVLDELLRNLCRLKGIGPWSAHYLALRQAGAADALP 275 Query: 235 LPDDYLIKQ-RFPGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 L D LIK R AQ+ A W+PWR+YA H+W + G Sbjct: 276 LGDVALIKALRLLEGDEAQLAERALDWRPWRAYAAQHLWASLG 318 >UniRef50_B0RQX4 DNA methylation and regulatory protein (Methylated-DNA--[protein]-cysteine S-methyltransferase) n=40 Tax=cellular organisms RepID=B0RQX4_XANCB Length = 521 Score = 129 bits (325), Expect = 9e-29, Method: Compositional matrix adjust. Identities = 94/281 (33%), Positives = 139/281 (49%), Gaps = 14/281 (4%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L ++PP D ML FL RA+ +E V Y R + ++ R L + + Sbjct: 234 LGYRPPLDLPAMLTFLQRRAIPGIEQVDADGYRRVIGAPGQATLIHVSAAPTRDELLLRI 293 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR---LGAA---RPGLRLPGCVDAFEQG 117 A + + ++ R+FDL + V+ L + L A RPGLR+PG D FE Sbjct: 294 GATDPRQIPQIVRRVRRIFDLDADLHAVHATLAQDPLLEQAITRRPGLRVPGGWDGFEVA 353 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI--CFPTPQRLAAADPQALKALGMP 175 VRA+LGQ +SVA AA L AR+ +G L D P + FPTP ++A A L+ LG+P Sbjct: 354 VRAVLGQQISVAGAATLAARLVDRHGGHLPDMPPGLDRSFPTPAQMADA---PLEQLGLP 410 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 RA L LA+A +G L + + PGIG WTA+Y A+R D F Sbjct: 411 RARAATLRALASACAQGRLHFGAGQRLPDFVAACTALPGIGPWTAHYIAMRALSHPDAFP 470 Query: 236 PDDYLIKQRFPG---MTPAQIRRYAERWKPWRSYALLHIWY 273 D +++Q ++ ++ W+PWR+YA+LH+W+ Sbjct: 471 AGDLILQQVLGAPERLSERATEARSQAWRPWRAYAVLHLWH 511 >UniRef50_A1TR03 DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II / Transcriptional regulator Ada n=9 Tax=Comamonadaceae RepID=A1TR03_ACIAC Length = 534 Score = 129 bits (325), Expect = 9e-29, Method: Compositional matrix adjust. Identities = 97/295 (32%), Positives = 141/295 (47%), Gaps = 22/295 (7%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVA----DSYYARSLAVGEYR---GVVTAIPDIA- 55 L ++PP D + +LGF R + +ETV + L E R G + A D Sbjct: 228 LAYRPPLDIAALLGFFGQRRIHGMETVDVPGLELRRTARLQDAEGRECTGWLAARFDGGA 287 Query: 56 --------RHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRL 107 + + + +S+ L P +A++ L DL +P+ +N L GLR+ Sbjct: 288 AAARGGPPKPHVVLRVSSSLLPALPGVIARVRGLLDLDADPEAINAVLHGDFPRGDGLRV 347 Query: 108 PGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAAAD 165 PG D FE VRA+LGQ V+VA A L RV + +G+ + +C FPTP LAA D Sbjct: 348 PGAWDGFELAVRAVLGQQVTVAAARTLAQRVVERWGDPVATPWPDLCRLFPTPAVLAACD 407 Query: 166 PQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFAL 225 AL LG+ +R A++ L+ A EG L + DV + L+ PGIG WTA Y A+ Sbjct: 408 GDALGQLGIVRQRQAAIVALSRAVAEGRLLLHAAADVAGTIAALRALPGIGDWTAQYIAM 467 Query: 226 RGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAER----WKPWRSYALLHIWYTEG 276 R + D F D + + + + R AE W+PWRSYA++ W G Sbjct: 468 RALRWPDAFPSGDVALHKALAVQSAPRPARAAEEASQAWRPWRSYAVVRAWAGTG 522 >UniRef50_A1WKZ8 DNA-3-methyladenine glycosylase II / Transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=4 Tax=Bacteria RepID=A1WKZ8_VEREI Length = 581 Score = 123 bits (309), Expect = 6e-27, Method: Compositional matrix adjust. Identities = 99/288 (34%), Positives = 135/288 (46%), Gaps = 24/288 (8%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETV----ADSYYARSLAVG--------EYRGVVTAI 51 L W+PP D + +L F A R + VE V A R++ + E G ++A Sbjct: 288 LAWRPPLDVAALLAFFARRQLHGVEWVLPDGAGPILRRTVRLAPGCTGQPREIIGWISAR 347 Query: 52 PDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCV 111 D +RH L + S L PV + ++ L DL +P +N L GLRLPG Sbjct: 348 FDGSRHLLLLQASDSLYPVLPLVIRRVRALLDLDADPAAINAVLHPHFPQGDGLRLPGAF 407 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL-DDFPEYI-CFPTPQRLAAADPQAL 169 D FE VRA+LGQ V++A A L R+ + G+ + +PE FP P LAA D L Sbjct: 408 DGFELAVRAVLGQQVTLAAARTLGQRLVERLGQTIATPWPELQRLFPAPATLAATDGAVL 467 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 +G+ +R A++ LA A G L + D E+ L PGIG WTA Y A+R + Sbjct: 468 GQMGIVRQRQAAIVALARAVDGGQLALHDGADPEKTTAALCALPGIGDWTAQYIAMRVLR 527 Query: 230 AKDVFLPDDY-------LIKQRFPGMTPAQIRRYAERWKPWRSYALLH 270 D F D L Q+ P A+ W+PWRSYALL Sbjct: 528 WPDAFPSGDVALHKALGLQGQKNPARAATA---AAQAWRPWRSYALLR 572 >UniRef50_Q15P13 DNA-O6-methylguanine--protein-cysteine S-methyltransferase / Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15P13_PSEA6 Length = 457 Score = 122 bits (306), Expect = 1e-26, Method: Compositional matrix adjust. Identities = 90/278 (32%), Positives = 129/278 (46%), Gaps = 26/278 (9%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L ++PPY+W + FLA RA+S E V+ YAR+ G +G A ++ + L Sbjct: 194 LAYRPPYNWPHLRDFLARRAISGSEWVSQDSYARNFTFGTSKGYFQAQHQPDKYRFLVTL 253 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAAR----PGLRLPGCVDAFEQGVR 119 + CL+ + R+ D+ + ++ + G ++ PG+R+PG + FE G R Sbjct: 254 AIDDLRQLKHCLSNVRRILDVDADSATIDNRIELSGLSKQTITPGIRIPGIWNTFEAGCR 313 Query: 120 AILGQLVSVAMAAKLTARVAQLYGE-RLDD---FPEYI-CFPTPQRLAAADPQALKALGM 174 AILGQ +SV A L ++ GE LDD PE FP P +A +D L LGM Sbjct: 314 AILGQQISVTAAINLVTKLVATIGEPVLDDQAPVPELNRYFPAPDAVANSD---LSFLGM 370 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 P R E L AA P T P D + GIG WT Y LRG D++ Sbjct: 371 PNSRRETLRRF--AAFYAQHPDTPPDD-------WLSIKGIGPWTVAYANLRGLSQADIW 421 Query: 235 LPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 L D +IK++ A++ PWRSY +W Sbjct: 422 LNSDLVIKKQLLLHDID-----ADKVSPWRSYLTFTLW 454 >UniRef50_D1BI44 DNA-3-methyladenine glycosylase II /DNA-O6-methylguanine--protein-cysteine S-methyltransferase /Transcriptional regulator Ada n=1 Tax=Sanguibacter keddieii DSM 10542 RepID=D1BI44_SANKS Length = 517 Score = 119 bits (298), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 106/310 (34%), Positives = 147/310 (47%), Gaps = 42/310 (13%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADS-----YYARSLAVGEYRGVVTAIPDIA 55 + L + P+D + +LGFLA RAV+ VET YAR+L + G V + Sbjct: 207 VVDLPVRQPFDAAGVLGFLADRAVAGVETATTEDDGTMRYARTLDLPHGPGAVEVV--AV 264 Query: 56 RHTLHINLSAGLEPVA----AECLAKMSRLFDLQCNPQIVNGALGR------LGAARPGL 105 R + A LE A A +A++ RL DL +P V+ AL + L RPG Sbjct: 265 RRQGRWEMRARLELAALGDVAPAVARVRRLLDLDADPVAVDSALAQDPALRPLVEERPGT 324 Query: 106 RLPGCVDAFEQGVRAILGQLVSVAMA----AKLTARVAQLYGERLDDFPEYICFPTPQRL 161 R+PG VD E VRA++GQ +SVA A +LTAR+ Y FPT ++ Sbjct: 325 RVPGAVDPHELVVRAVVGQQISVAAARTHLGRLTARLGTPYRSAFAGLDRL--FPTAAQV 382 Query: 162 AA-----ADPQAL---KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFP 213 AA AD + L + L +P + A++ +A A +G L + + D L P Sbjct: 383 AAGVPVPADDEVLDPDRPLRLPRRSVRAVVSVARALADGDLVVDVGADAAALRAELVDRP 442 Query: 214 GIGRWTANYFALRGWQAKDVFLPDDYLI--KQRFPGM------TPAQIRRYAER---WKP 262 GIG WTA Y A+R D +LP D + R G+ T A R AE W P Sbjct: 443 GIGPWTAAYVAMRVLGDPDAWLPGDVALVAGARAVGLLGTEKTTSAAHRALAEGASVWAP 502 Query: 263 WRSYALLHIW 272 WRSYA++H+W Sbjct: 503 WRSYAVVHLW 512 >UniRef50_C5C5F4 HhH-GPD family protein n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C5F4_BEUC1 Length = 330 Score = 118 bits (296), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 72/187 (38%), Positives = 97/187 (51%), Gaps = 11/187 (5%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL------DD 148 L L A PG+R+PG VD FE +LGQ VS+A A T+R YG L Sbjct: 132 LAPLVAGAPGMRVPGFVDPFEAAATTVLGQQVSLAAARTFTSRFVAAYGTPLRAAGAPST 191 Query: 149 FPEYICFPTPQRLAAADPQALKAL-GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMK 207 P + FPTP+ +A ADP L+A+ G+ RA +L LA A +G T PG E+ Sbjct: 192 APHWFAFPTPEAIARADPDELRAVVGLTRARASSLTSLAAAFADGLALDTGPGSRER--- 248 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYA 267 L PGIG WTA+Y LR + D F D ++++ + P + AE W+PWR Y Sbjct: 249 -LLALPGIGPWTADYLELRLLRDPDAFPAGDLVLRRGLGVVDPDEATALAESWRPWRGYG 307 Query: 268 LLHIWYT 274 + HIW + Sbjct: 308 VFHIWSS 314 >UniRef50_A4SQS2 DNA methylation and regulatory protein n=2 Tax=Aeromonas RepID=A4SQS2_AERS4 Length = 522 Score = 116 bits (291), Expect = 8e-25, Method: Compositional matrix adjust. Identities = 95/290 (32%), Positives = 136/290 (46%), Gaps = 32/290 (11%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L ++PPYD + ML F RA+ +E V + Y R VG+ G + I H++ + + Sbjct: 215 LPYRPPYDVAAMLAFYRLRAIPGLERVDGNVYERRHRVGDQSGWIR-IEQGKGHSIRLTV 273 Query: 64 SAGLEPVA-AECLAKMSRLFDLQCNPQIVNGALG------RLGAARPGLRLPGCVDAFEQ 116 L P A + L ++ R++DL + Q + LG RL + PG+RLP D +E Sbjct: 274 H-DLPPAALPDLLYRVRRMWDLDADMQRIGERLGQDPLLARLQSRWPGVRLPAGWDEYEV 332 Query: 117 GVRAILGQLVSVAMA----AKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 +RAI+GQ VSV A +L AR +G PTP +L A D L + Sbjct: 333 MLRAIVGQQVSVKGAITIMGRLLARTEAQFG--------VAQLPTPAQLCALD---LDGI 381 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 GMP R L LA A GTL + D + L PGIG WT Y+ LR Q D Sbjct: 382 GMPGSRIRTLQGLAAALASGTLSLNTASD-----EQLLALPGIGPWTVAYWRLRCGQDPD 436 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRY---AERWKPWRSYALLHIWYTEGWQP 279 F D ++++ G ++ +E W+PWR YA +W+ QP Sbjct: 437 AFPASDLVLQKALGGGDKLPVKEVLVQSEAWQPWRGYAASWLWHAMSEQP 486 >UniRef50_Q1QTR7 Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=2 Tax=Gammaproteobacteria RepID=Q1QTR7_CHRSD Length = 453 Score = 116 bits (290), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 89/273 (32%), Positives = 126/273 (46%), Gaps = 21/273 (7%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L ++PPY W W+ FLAAR + +E + Y R + G G TA+ RH + L Sbjct: 196 LAYRPPYAWEWLRDFLAARRIDRLEWGDEHRYGRHIQWGSASGHFTAVHVPERHGFRVTL 255 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR-LGAARP---GLRLPGCVDAFEQGVR 119 S + + R+ DL + ++ L + L P GLRLPG FE GVR Sbjct: 256 SLDDLGALLPVVRHIRRVLDLDADTALIEAQLRQTLPDTFPLVEGLRLPGVWTPFEAGVR 315 Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 A+LGQ VS++ A R+ + GE D + FPT R+AA+D L L MP R Sbjct: 316 AVLGQQVSISAARGHVTRLVEALGEPTGD--DGRQFPTAARIAASD---LAFLRMPQARR 370 Query: 180 EALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 + L LA AA + L + + GIG W+A+Y ALRG D++L D Sbjct: 371 DCLRGLAQAACDRRL--------DDDPRQWTALKGIGPWSADYAALRGTSHPDIWLGGDL 422 Query: 240 LIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 +K+ + + PWRSY L +W Sbjct: 423 GVKRALSALGTVE----PAHATPWRSYLTLQLW 451 >UniRef50_UPI0001901D5D methylated-DNA--protein-cysteine methyltransferase n=1 Tax=Mycobacterium tuberculosis T85 RepID=UPI0001901D5D Length = 361 Score = 115 bits (288), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 96/293 (32%), Positives = 134/293 (45%), Gaps = 28/293 (9%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVV--TAIPDIARHTLHINLSAG 66 P+ + + G LAA AV E V D Y R+L + G+V T PD R L ++ Sbjct: 74 PFAFEGVFGHLAATAVPGCEEVRDGAYRRTLRLPWGNGIVSLTPAPDHVRCLLVLDDFRD 133 Query: 67 LEPVAAECLAKMSRLFDLQCNPQIVNGALGR-------LGAARPGLRLPGCVDAFEQGVR 119 L A C RL DL +P+ + ALG +G A PG R+P VD E VR Sbjct: 134 LMTATARC----RRLLDLDADPEAIVEALGADPDLRAVVGKA-PGQRIPRTVDEAEFAVR 188 Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI--CFPTPQRLAAADPQALKALGMPLK 177 A+L Q VS A+ R+ YG + D + FP+ ++LA DP L +P Sbjct: 189 AVLAQQVSTKAASTHAGRLVAAYGRPVHDRHGALTHTFPSIEQLAEIDP---GHLAVPKA 245 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 R + L + + +L + D ++A L PG+G WTA A+RG D F Sbjct: 246 RQRTINALVASLADKSLVLDAGCDWQRARGQLLALPGVGPWTAEVIAMRGLGDPDAFPAS 305 Query: 238 DYLIKQRFPGM-TPAQIRR---YAERWKPWRSYALLHIWYT-----EGWQPDE 281 D ++ + PAQ R ++ RW+PWRSYA H+W T W P E Sbjct: 306 DLGLRLAAKKLGLPAQRRALTVHSARWRPWRSYATQHLWTTLEHPVNQWPPQE 358 >UniRef50_Q10630 Methylated-DNA--protein-cysteine methyltransferase n=52 Tax=Actinomycetales RepID=ADA_MYCTU Length = 496 Score = 115 bits (288), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 96/293 (32%), Positives = 134/293 (45%), Gaps = 28/293 (9%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVV--TAIPDIARHTLHINLSAG 66 P+ + + G LAA AV E V D Y R+L + G+V T PD R L ++ Sbjct: 209 PFAFEGVFGHLAATAVPGCEEVRDGAYRRTLRLPWGNGIVSLTPAPDHVRCLLVLDDFRD 268 Query: 67 LEPVAAECLAKMSRLFDLQCNPQIVNGALGR-------LGAARPGLRLPGCVDAFEQGVR 119 L A C RL DL +P+ + ALG +G A PG R+P VD E VR Sbjct: 269 LMTATARC----RRLLDLDADPEAIVEALGADPDLRAVVGKA-PGQRIPRTVDEAEFAVR 323 Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI--CFPTPQRLAAADPQALKALGMPLK 177 A+L Q VS A+ R+ YG + D + FP+ ++LA DP L +P Sbjct: 324 AVLAQQVSTKAASTHAGRLVAAYGRPVHDRHGALTHTFPSIEQLAEIDP---GHLAVPKA 380 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 R + L + + +L + D ++A L PG+G WTA A+RG D F Sbjct: 381 RQRTINALVASLADKSLVLDAGCDWQRARGQLLALPGVGPWTAEVIAMRGLGDPDAFPAS 440 Query: 238 DYLIKQRFPGM-TPAQIRR---YAERWKPWRSYALLHIWYT-----EGWQPDE 281 D ++ + PAQ R ++ RW+PWRSYA H+W T W P E Sbjct: 441 DLGLRLAAKKLGLPAQRRALTVHSARWRPWRSYATQHLWTTLEHPVNQWPPQE 493 >UniRef50_B7RWC6 AlkA N-terminal domain family protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RWC6_9GAMM Length = 471 Score = 114 bits (286), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 86/283 (30%), Positives = 131/283 (46%), Gaps = 32/283 (11%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L ++PPYDW+ ++ FL+ A++ VE + DS Y R+ + P ++ L + L Sbjct: 206 LQYRPPYDWNGVVDFLSHHAIAGVEEINDSRYRRNFRTTAGVAQLEIKPHKNKNALELRL 265 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNG------ALGRLGAARPGLRLPGCVDAFEQG 117 + ++ R+FDL NP+ ++ ALG L PG R PG FE Sbjct: 266 QLPDNSRLMSTVGQVRRMFDLDANPEQISALLQQDTALGPLSKRSPGARSPGHWSLFESA 325 Query: 118 VRAILGQLVSVAMAAKLTARVAQ-LYGERLDDFPEYICFPTPQRLAAADPQAL--KALGM 174 VRAI+GQ VS A + AR+A+ E + FP+ AAD AL + M Sbjct: 326 VRAIVGQQVSTVAARTVLARLAKACTKEGIVTFPD-----------AADIAALTDEHFPM 374 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 P +R E L L + +T ++ L F G+G WT A+RG DVF Sbjct: 375 PSRRRETLRSLCQTYSDREDELT--------LEALADFKGVGPWTVGMVAVRGAGDPDVF 426 Query: 235 LPDDYLIKQ---RFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 D +++ PG + ++ A +W+PWRSYA +W + Sbjct: 427 PTGDLGLERTWATLPG-SEGKLNDAAAQWRPWRSYAANLLWRS 468 >UniRef50_Q6MR46 DNA methylation and regulatory protein Ada n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MR46_BDEBA Length = 479 Score = 114 bits (285), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 85/282 (30%), Positives = 131/282 (46%), Gaps = 24/282 (8%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVT--AIPDIARHTLHI 61 L+++PP+D++ +L F + AV +E + R + V G +T +PD + L I Sbjct: 195 LSYRPPFDFTGLLHFYRSHAVGQLEWFEEGLMHRIIEVNGKVGQITLSDLPDESCIKLEI 254 Query: 62 NL--SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR------LGAARPGLRLPGCVDA 113 + + L + ++++ L DL +P I+ L L PG+RLP D Sbjct: 255 DFPDTTALHTI----ISRVRSLLDLDSDPVIIANVLETDKDMKALLKKHPGIRLPSSWDP 310 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGER---LDDFPEYICFPTPQRLAAADPQALK 170 FE V AILGQ+VSV L + L G L D FPTP ++ AD LK Sbjct: 311 FEVVVAAILGQVVSVERGRALVNDLIDLAGSDSGLLRDGKSVRLFPTPAQVIKAD---LK 367 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 +L +R E L+ L+ A + G L + DV+ ++ + PGIG WTA+Y AL+ + Sbjct: 368 SLKTTTRRKETLVALSKALINGDLSLEPAQDVDSFVEKILGIPGIGPWTASYMALKALRH 427 Query: 231 KDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D F D +I + + E + PWR Y +W Sbjct: 428 TDAFPATDLIIARAIAEHPKTKF----ESFSPWRGYVAALLW 465 >UniRef50_C4DFD0 DNA-3-methyladenine glycosylase II; Transcriptional regulator Ada; DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DFD0_9ACTO Length = 413 Score = 108 bits (270), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 92/272 (33%), Positives = 128/272 (47%), Gaps = 17/272 (6%) Query: 15 MLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAEC 74 + G LAA AV VE D Y R+L + GVV+ P H + + L ++ Sbjct: 129 VFGHLAATAVPGVEEWRDGAYRRTLRLPHGPGVVSLRPG-PDHVGCVLWLSDLRDLSI-A 186 Query: 75 LAKMSRLFDLQCNPQIVNGALGR------LGAARPGLRLPGCVDAFEQGVRAILGQLVSV 128 +A+ L DL +P V+ L R L A PG R+P VD E VRA+LGQ VS Sbjct: 187 IARCRWLLDLDADPVAVDELLSRDEVLAPLVAKAPGRRVPRTVDPGEFAVRAVLGQQVST 246 Query: 129 AMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAAADPQALKALGMPLKRAEALIHLA 186 A A AR+ YG+R++D + FP+P LA DP L MP+ R L+ L Sbjct: 247 AAARTHAARLVARYGQRVEDPGGGLTHLFPSPGELAGLDPDGLA---MPVSRKNTLLGLV 303 Query: 187 NAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP 246 A ++G + + + D A + L PG G WT A+R D F+ D I+ Sbjct: 304 RALVDGDVELGVGVDWRSAKEALSALPGFGPWTVESIAMRALGDPDAFVASDLGIRLAAE 363 Query: 247 GMT-PAQIRRYAER---WKPWRSYALLHIWYT 274 + P R ER W PWR+YA+ ++W T Sbjct: 364 QLGLPTGARALVERSRAWMPWRAYAVQYLWAT 395 >UniRef50_UPI0000E0EED3 Ada family regulatory protein n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0EED3 Length = 280 Score = 108 bits (269), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 86/296 (29%), Positives = 131/296 (44%), Gaps = 46/296 (15%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGV------------- 47 M L+ PY+WS + FL RA++ +E + +YAR ++ V Sbjct: 2 MIYLSVTQPYNWSMVHAFLTRRAIAGIEECGEFHYARYFDETDFYAVSGLSHVSNEGLTS 61 Query: 48 ----VTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAA-- 101 T P+ R + ++L E LA ++R+ D Q +P + AL + G Sbjct: 62 SWFCATYEPEAQRFAVQLSLHN--EACREAVLANIARVLDAQQDPNTIAQALTKAGFTPE 119 Query: 102 --RPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQ 159 GLRLP FE +RAI+GQ +SV A K+ + Q G + Y FP+ Sbjct: 120 HMTSGLRLPATWSPFEALIRAIVGQQISVNGAVKI---LNQWIGNLRAEANGYRHFPSAT 176 Query: 160 RLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWT 219 +A D L MP R +A ++LA ++ P + ++ L GIG WT Sbjct: 177 EIACCDTSKLP---MPKAR-QATLNLAAETVQAK-----PLHDSETIQDLLKIKGIGPWT 227 Query: 220 ANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYA---ERWKPWRSYALLHIW 272 NY +RG D+FL +D ++K Q+ R+A E KPWRSY + +W Sbjct: 228 VNYVLMRGISHPDIFLDNDLVVKN--------QLARFALTPELAKPWRSYVCIQLW 275 >UniRef50_Q3IBU8 Putative ADA regulatory protein (Regulatory protein of adaptative response) n=3 Tax=Alteromonadales RepID=Q3IBU8_PSEHT Length = 454 Score = 107 bits (268), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 79/274 (28%), Positives = 123/274 (44%), Gaps = 22/274 (8%) Query: 3 TLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHIN 62 TL ++PPY+W M FLA R ++ +E + + Y R+ + +G A ++ + Sbjct: 196 TLPFRPPYNWPAMQQFLAKRLIAPMEWITATSYGRTFSDEHCKGSFNAEFIAQKNHFKVA 255 Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQIVN----GALGRLGAARPGLRLPGCVDAFEQGV 118 ++ + + + R+ DL + ++ + A GLRLPG +FE G+ Sbjct: 256 ITINNTHCLQQVITNIRRVLDLDADINLITMHIQDNINNAFAVSEGLRLPGIWSSFEAGI 315 Query: 119 RAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKR 178 RA+LGQ VSV A L ++ GE+ + + FPTPQ+L +D K MP R Sbjct: 316 RAVLGQQVSVTAAHNLVTKLVSELGEQCNG---AVYFPTPQQLVNSDFAFFK---MPQAR 369 Query: 179 AEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD 238 AL +LA P D+ +K GIG WT NY LRG D+ L D Sbjct: 370 KNALYNLAQFCTLN--PQCDDLDLWLNLK------GIGPWTVNYAKLRGQSQPDILLDGD 421 Query: 239 YLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 +K+ A ++ P+RSY +W Sbjct: 422 LGVKK----AQAAVAVFSSDNCAPFRSYLTFQLW 451 >UniRef50_A8LHD8 Transcriptional regulator, AraC family n=4 Tax=Actinomycetales RepID=A8LHD8_FRASN Length = 540 Score = 107 bits (267), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 96/303 (31%), Positives = 130/303 (42%), Gaps = 38/303 (12%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVT--AIPDIARHTLHI 61 L ++ P + G L A AV VE D Y R++ +V +PD L + Sbjct: 215 LPFRAPLYPDNLFGHLVATAVPGVEEWRDGAYRRTMRTLHGHAIVALRPLPDHIGCRLAL 274 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNG------ALGRLGAARPGLRLPGCVDAFE 115 L PV C RL DL +P V+G AL L A PG R+P VD E Sbjct: 275 TDVRDLAPVIGRC----RRLLDLDADPIAVDGQLAADPALAPLVARAPGRRVPRTVDPAE 330 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGE--------------RLDDFPEYIC---FPTP 158 VRA+LGQ VSVA A AR+ G ++ D E+I + Sbjct: 331 LAVRAVLGQQVSVAAARTHAARLVTAVGTPIHDPEGGLTHLWPQIADLAEHIERTEYAEC 390 Query: 159 QRLAAADPQALKA-----LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFP 213 LA A P +A L +P R L + G + + GD E+A L P Sbjct: 391 TDLADAVPAGRRAGAPRGLALPAARRRTFAALVGGLVSGMIELGAGGDWERARAALAALP 450 Query: 214 GIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALL 269 GIG WT A+R D FLP D +++ + TPA + R+A W+PWR+YA+ Sbjct: 451 GIGPWTLETIAMRALGDPDAFLPGDLGVRRGAERLGLPATPAALSRHAAAWRPWRAYAVQ 510 Query: 270 HIW 272 H+W Sbjct: 511 HLW 513 >UniRef50_C0Q970 AlkA n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0Q970_DESAH Length = 353 Score = 107 bits (266), Expect = 6e-22, Method: Compositional matrix adjust. Identities = 75/283 (26%), Positives = 129/283 (45%), Gaps = 18/283 (6%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L + P D+S ++ F+ RA+ VE + D Y+R+ +G + + + + + Sbjct: 73 LPYARPLDFSQVIEFMKFRAIQGVEDIEDQRYSRTFRTNRSKGYFIVRDNPGKSAIELTI 132 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGL------RLPGCVDAFEQG 117 E ++ +FDL + +N + G+ RLP ++FE Sbjct: 133 YCDDIRCYMEIYNRVRLMFDLNTDFFPINKKFIKDKLLSKGMSDGHVPRLPIAFNSFEFC 192 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLD-DFP---EYICFPTPQRLAAADPQALKALG 173 +RA+LGQ +SV A+ L +R+A+ G + + +FP +Y FP P+ L +L+ +G Sbjct: 193 IRAVLGQQISVQAASTLASRIAKKAGPQTEKNFPPGLDYF-FPGPEELVKT---SLEGIG 248 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 + R + ++A L+ + E K GIG WT NY A+R D Sbjct: 249 ITGVRQATITNIAQGLLDNVFSLNPNQPFETFQKDFSAIRGIGEWTVNYVAMRSLGMVDS 308 Query: 234 FLPDDYLIKQRFP--GMTPA--QIRRYAERWKPWRSYALLHIW 272 F D I + G P +I + AE+W+P+R+YA L +W Sbjct: 309 FPAADLGIIKALEKNGKRPGRKEILKQAEKWRPYRAYAALCLW 351 >UniRef50_A3XSB2 Ada regulatory protein n=1 Tax=Vibrio sp. MED222 RepID=A3XSB2_9VIBR Length = 482 Score = 105 bits (261), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 90/292 (30%), Positives = 122/292 (41%), Gaps = 40/292 (13%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L++ P DW +LGF R + +E V D YY R++ V +G A R +L I Sbjct: 205 LSFHGPLDWDHLLGFYRRRMIEGLEEVGDGYYQRTVNVNGSKGWFKATLAKER-SLDIEF 263 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLG---AARPGLRLPGCVDAFEQGVRA 120 +A + R+FDL + V + A+ G+R+PG A+E GVRA Sbjct: 264 ELDDMSQLRSLIANIRRMFDLDVDISKVEDFFSTIDPNLVAKSGIRIPGVWSAWEAGVRA 323 Query: 121 ILGQLVSVAMA-AKLTARVAQLYG-------------ERLDDFP------EYICFPTPQR 160 ILGQ VSV A +L V +L G + D P E FPTP++ Sbjct: 324 ILGQQVSVTAAIGQLNLLVRKLSGSYQVFDSQEQANSQECSDLPQIADASEKAYFPTPKQ 383 Query: 161 LAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTA 220 +A AD L+ MP R E L A + + E K + GIG WT Sbjct: 384 IADADVSFLR---MPGSRKETLKRFAQ--------YMVDNEAEHPSKWID-LKGIGPWTI 431 Query: 221 NYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 Y LRG + L D ++K +F P E PW SYA H W Sbjct: 432 QYALLRGLSEPNHLLVGDLVVK-KFIEHRPTI---NTESVSPWGSYATFHCW 479 >UniRef50_Q1ZAD8 Hypothetical ada regulatory protein n=2 Tax=Photobacterium profundum RepID=Q1ZAD8_PHOPR Length = 514 Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 92/302 (30%), Positives = 129/302 (42%), Gaps = 52/302 (17%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLA-------------------- 40 + TL+++PPY+W + F A+R + +E ++ Y R+ + Sbjct: 212 VITLSYRPPYNWQHLQQFYASRIIEGLEWCDENSYGRTFSFDSDDCSHSVLNTGQNINHS 271 Query: 41 ------VGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGA 94 +GE+ IP+ + + I LS + + R DL + + + Sbjct: 272 EDAFDCIGEFTAF--HIPEKSVFLVRIQLSD--LRYLNRVIRNIRRCLDLDADIEHIEAR 327 Query: 95 LGR-LGA---ARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 L R L A GLRLPG FE G+RAILGQ VSV A L +V G + Sbjct: 328 LKRALNTDILAISGLRLPGTWSPFEAGIRAILGQQVSVQAARNLVTKVV---GNNPINTD 384 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQ 210 E FP PQ+L A + L L MP KR E + LAN A L + K L Sbjct: 385 ERCYFPLPQQLIADE---LTYLKMPGKRKETIRLLANYACNKPLDDS---------KALL 432 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLH 270 GIG WT +Y +RG D+FL D IK+ M A + PWRSY L+ Sbjct: 433 AIAGIGPWTVHYLRMRGLSDPDIFLIGDLGIKKALAKMNEA---FSPDAAAPWRSYLTLY 489 Query: 271 IW 272 +W Sbjct: 490 LW 491 >UniRef50_Q7MGD3 Adenosine deaminase n=51 Tax=Vibrionales RepID=Q7MGD3_VIBVY Length = 481 Score = 103 bits (258), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 87/269 (32%), Positives = 122/269 (45%), Gaps = 35/269 (13%) Query: 15 MLGFLAARAVSSVETVADSYYARSLAV-GEYRGVVTAIPDI---ARHTLHINLSAGLEPV 70 ML F RA+ S E V ++ Y R + + G+ G P + L + S + Sbjct: 235 MLDFYRQRAIESEEVVTETSYQRQVVINGKTVGFRAEFPATFPAEKRQLVVYFSMDDLTL 294 Query: 71 AAECLAKMSRLFDLQCNPQIVNGALGR--LGAARP-GLRLPGCVDAFEQGVRAILGQLVS 127 +A + R+FDL C+ +++ L LG + G+R+PG + +E GVRAILGQ VS Sbjct: 295 LRPMVAGIRRMFDLDCDTRVIEAHLNTVALGLVKSVGIRIPGVWNVWEAGVRAILGQQVS 354 Query: 128 VAMA-AKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLA 186 V A +L VA L+ + E FP+PQ++ AD L L MP R E L A Sbjct: 355 VKAAIGQLNLLVATLHHD-----SEVRTFPSPQQVVDAD---LHFLRMPQSRKETLRRFA 406 Query: 187 NAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ--- 243 LE D Q + GIG WT +Y LRG D L D ++K+ Sbjct: 407 VMMLENE-----HADPNQWL----ALKGIGPWTVSYAQLRGLSQPDRLLEKDLVVKKALA 457 Query: 244 RFPGMTPAQIRRYAERWKPWRSYALLHIW 272 +FP + E PW SYA H+W Sbjct: 458 QFPTLN-------QESASPWGSYATFHLW 479 >UniRef50_D0LE01 Ada metal-binding domain protein n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0LE01_GORB4 Length = 526 Score = 102 bits (255), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 102/305 (33%), Positives = 138/305 (45%), Gaps = 47/305 (15%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADS-----------YYAR--------SLAVGEY 44 L ++PPY WSWM FL + A + VE+V D Y R +LAV E Sbjct: 222 LVYRPPYRWSWMRWFLGSHAAAGVESVIDDDPDAITPATRWRYRRVLDLPHGPALAVVEP 281 Query: 45 RGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAAR-- 102 T P R TLH L + ++ R DL + AL A R Sbjct: 282 STEETG-PPFVRLTLHHMDMRDL----GVAVNRIRRHLDLDADVATAEDALRHDPALRPL 336 Query: 103 ----PGLRLPGCVDAFEQGVRAILGQLVSVAMA-AKLTARVAQLYGERLD-----DFPEY 152 PGLRLPG +D E +R ++GQ +SVA A + A VA+L G R+ D P Sbjct: 337 IDAAPGLRLPGSLDPAETILRTMIGQQISVAAARTHIDALVARL-GTRVPWPDEADLPPS 395 Query: 153 ICFPT---PQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKT- 208 FP+ P A A+ + L P +R E+++ +A A + T+ PG ++ Sbjct: 396 AVFPSATFPSATAIAE-HGHQVLRGPRRRIESIVAVAAALADKTV-EPHPGLAASDLRAQ 453 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIR-RYAERWKPWRSYA 267 L PGIG WTA A+R D+ L DD ++KQ MT I R W PWRSYA Sbjct: 454 LLELPGIGPWTAALVAMRVTGDPDIALTDDLVVKQ---AMTELGIDIRSVPSWSPWRSYA 510 Query: 268 LLHIW 272 +H+W Sbjct: 511 SMHLW 515 >UniRef50_C7MYM6 DNA-3-methyladenine glycosylase II /DNA-O6-methylguanine--protein-cysteine S-methyltransferase /Transcriptional regulator Ada n=20 Tax=Actinobacteria (class) RepID=C7MYM6_SACVD Length = 510 Score = 102 bits (254), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 92/295 (31%), Positives = 142/295 (48%), Gaps = 29/295 (9%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L ++ P+D + +L FL ARAV VE+ + Y R+L + VV P + HI Sbjct: 222 LPFRRPFDTTGVLDFLTARAVPGVEST-EGDYRRTLRLPHGAAVVRLSP----RSTHIEC 276 Query: 64 SAGLEPVA--AECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 L + + ++++ RL+DL +PQ V + AL +A PG+R+PG VD E Sbjct: 277 LLRLTDIRDLSGAVSRIRRLWDLDADPQAVLDCLSADPALAPWLSAAPGIRVPGAVDGPE 336 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYG-----ERLD--DFPEYICFPTPQRLAAADPQA 168 +RA+ Q +S A R+ G E LD D P + FP P +A A Sbjct: 337 LVLRALFEQGMSTRRAHIALGRLVTELGTPIAPELLDATDDPTLL-FPGPTAVAE---HA 392 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 L P R + + +A A +G L + + D E + L PGI W A+Y +R Sbjct: 393 ASILPGPQDRVDTIRTIAAALAQGDLDVHVGRDAEDLRRDLLAVPGISSWAADYILMRLL 452 Query: 229 QAKDVFLPDDYLIKQ--RFPGM--TPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 D+ L D ++++ R G+ T + + +A RW+PWRSYA +++W G QP Sbjct: 453 GHPDILLGTDLVLRRGARSLGIDATYSGLTTHARRWRPWRSYAGMYLWRA-GDQP 506 >UniRef50_D1C0H7 Transcriptional regulator, AraC family n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1C0H7_XYLCX Length = 543 Score = 102 bits (253), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 98/316 (31%), Positives = 139/316 (43%), Gaps = 49/316 (15%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETV-ADS----YYARSLAV--GEYRGVVTAIP---- 52 L + P+D + GFLAARAV+ VET AD YAR++A+ G V+A P Sbjct: 212 LPVREPFDAPGVFGFLAARAVTGVETASADDDGTLRYARTVALPHGPAAFEVSATPRAVS 271 Query: 53 --DIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR------LGAARPG 104 D + + + A +A++ RL DL +P V+ ALG L A PG Sbjct: 272 GRDARGWDVQVRVELTSLADVATVVARVRRLLDLDADPVAVDTALGTDPALALLVTATPG 331 Query: 105 LRLPGCVDAFEQGVRAILGQLVSVAMA----AKLTARVAQLYGERLDD----FP------ 150 +R+PG VD E VRAI+GQ +SVA A +L AR+ Y D FP Sbjct: 332 IRVPGAVDPHELLVRAIVGQQISVAAARTHLGRLAARLGTPYASSFDGLTTVFPSAAAIV 391 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQ 210 + + P A+DP + L +P + A++ A G L + + D + L Sbjct: 392 DGVPVTAPGTPEASDPD--RPLRLPARGVAAVVGATRALAAGDLAVDVGADPDTLRTALL 449 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPDD--------------YLIKQRFPGMTPAQIRRY 256 PG+G WTA Y A+R D + D +R P + + Sbjct: 450 ALPGVGAWTAAYVAMRVLGDPDAWPEGDVALVAGAAAAGIAAASAAERRPTQRHRDLAAH 509 Query: 257 AERWKPWRSYALLHIW 272 A W PWRSYA +H+W Sbjct: 510 AAAWAPWRSYAAMHLW 525 >UniRef50_A1S7Q4 DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / Transcriptional regulator Ada n=1 Tax=Shewanella amazonensis SB2B RepID=A1S7Q4_SHEAM Length = 483 Score = 99.4 bits (246), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 89/288 (30%), Positives = 135/288 (46%), Gaps = 34/288 (11%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVE----TVADSYYARSLAVGEYRGVVTAIPDIARHTL 59 L+++PPYD+ + F ARA+ E + Y R+L V G A ++ L Sbjct: 213 LSFRPPYDFMRLRAFFMARAIPGAEWFFNDAGEPCYGRTLMVAGDAGWFEACLLAGKNAL 272 Query: 60 HINLSAGLEPVA-AECLAKMSRLFDLQCNPQIVNGAL-GRL--GAARPGLRLPGCVDAFE 115 +++ G A ++ LA++ R+ D+ N +++ + G + G + LPG FE Sbjct: 273 AVSIFPGGRVSALSQWLAEIKRVLDIDANLSLIHEHIQGHMPEGVVLNTMTLPGAGSFFE 332 Query: 116 QGVRAILGQLVSVAMAAKL-------TARVAQLYGERLDDFPEYICFPTPQRLAAADPQA 168 RA+LGQ VS+ A +L T +L G R FPT +++A+A Sbjct: 333 AACRAVLGQQVSLVQATRLLGLLTAETTPEVELGGRRCR------VFPTAEQVASA---T 383 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 L++L MP R AL +A AL P +P D TL GIG WT +Y +RG Sbjct: 384 LESLKMPGSRKNALRDMA--ALFSRDP--VPDDA-----TLLAVKGIGPWTVSYARMRGL 434 Query: 229 QAKDVFLPDDYLIKQRFPGMTPAQI-RRYAERWKPWRSYALLHIWYTE 275 DV L D ++KQ+ M A++ R PW SY L +W+TE Sbjct: 435 SDPDVLLVGDLVVKQKLTAMGWAKVPDRLKSDVSPWGSYLTLALWHTE 482 >UniRef50_C1YI07 DNA-O6-methylguanine--protein-cysteine S-methyltransferase; DNA-3-methyladenine glycosylase II; Transcriptional regulator Ada n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YI07_NOCDA Length = 561 Score = 99.4 bits (246), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 103/317 (32%), Positives = 131/317 (41%), Gaps = 49/317 (15%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVT----AIPDIARHTL 59 L ++ P D + ML FL RAV VE D Y R+L + VV + A T Sbjct: 213 LPYREPIDLARMLRFLGDRAVPGVEEYRDGVYRRTLMLAHGPAVVELSEGSGTGRAGRTG 272 Query: 60 HINLSAGLEPVAAE-------------CLAKMS-------------RLFDLQCNPQIVNG 93 + G+ P A C ++S RL DL +P V Sbjct: 273 RAGATGGVRPADAVDGGVSVSGGGHVLCRLRLSEARDLTSAVRRCRRLLDLDADPGAVAE 332 Query: 94 ALGR---LG---AARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD 147 ALG LG AA PGLR PG VD E VRA+LGQ VSV A L R+ + +GE L Sbjct: 333 ALGGDPLLGPIVAAHPGLRSPGHVDPAELAVRAVLGQQVSVRAARTLAGRLVERFGEPLA 392 Query: 148 DFPE------YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGD 201 E FP+ A +P+ R AL L A G + + D Sbjct: 393 PGLEAPGGGLTHVFPS---PDALAAADPAGFSVPVARGRALAGLCEAIASGWIDLGPGCD 449 Query: 202 VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG----MTPAQIRRYA 257 ++A + L GIG WTA Y +RG DVFL D ++ TPA R A Sbjct: 450 RDEAERRLVELRGIGPWTAGYVRMRGLGDPDVFLHGDLGVRMALEAGGRRATPAAAAREA 509 Query: 258 ERWKPWRSYALLHIWYT 274 W PWRSYA +W + Sbjct: 510 REWSPWRSYANHALWAS 526 >UniRef50_Q12L65 DNA-O6-methylguanine--protein-cysteine S-methyltransferase / Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II n=8 Tax=Shewanella RepID=Q12L65_SHEDO Length = 545 Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 95/304 (31%), Positives = 131/304 (43%), Gaps = 52/304 (17%) Query: 3 TLNWQPPYDWSWMLGFLAARAVSSVETVADSY-YARSLAVGEYRGVVTAIPDIAR----- 56 +L ++PP +W M F R VS +E + + Y+RS +GV + A+ Sbjct: 227 SLAFRPPLNWHKMWAFYQFRQVSGMEILDEEQGYSRSFCFDGVKGVFRVRLNEAKSQFDT 286 Query: 57 --HTLHINLSAGLEPVAAECLAKMSRLFDLQCN---------PQIVNGALGRLGAARPGL 105 + LH + L PV + ++ RL DL + P + GA +L A GL Sbjct: 287 QIYLLHSHDVKQLHPV----VLRIRRLLDLDTDMATIAQIFVPLVAMGA--KLDA---GL 337 Query: 106 RLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAA 164 R+P FE RAILGQ VSV A KL + + YGE + + + FPTP+ +A A Sbjct: 338 RIPATASVFEAACRAILGQQVSVQQATKLLNTLVEHYGETFELNGQVWRLFPTPEAVATA 397 Query: 165 DPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFA 224 +L L MP R AL L E P + P D + GIG WT Y Sbjct: 398 ---SLDELKMPGARRLALNALGAYVQEH--PHSTPDDWLEV-------KGIGPWTVAYAK 445 Query: 225 LRGWQAKDVFLPDDYLIKQRFPGM---------TP----AQIRRYAERWKPWRSYALLHI 271 +RG +VFL D +IK R G+ TP A A + PW SY + Sbjct: 446 MRGLSESNVFLSSDLVIKHRIHGLYAKAGGIIETPKAYLALAADIANKVSPWGSYLTFGL 505 Query: 272 WYTE 275 W E Sbjct: 506 WDDE 509 >UniRef50_C8XKJ9 AlkA domain protein n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XKJ9_NAKMY Length = 300 Score = 98.6 bits (244), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 93/294 (31%), Positives = 129/294 (43%), Gaps = 41/294 (13%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLE 68 P+ +L FL+ +V VE V + YARSL +G D T+ ++L + Sbjct: 22 PFAADRLLAFLSRESVPGVEYVREREYARSLRLGSGD-------DAEVGTIRLHLPGPGD 74 Query: 69 PVAA-----------ECLAKMSRLFDLQCNPQIVNGAL----GRLGAAR--PGLRLPGCV 111 P E +A+ L DL + V+ L G + + PGLR+PG Sbjct: 75 PPTVRAVVRFAARIDEAVARCRHLLDLDTDGSAVDRVLRADPGLAASVQRCPGLRVPGPA 134 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI-----CFPTPQRLAAADP 166 + E VR ILGQ VSVA A R+ L +RL PE + FP P R+AA P Sbjct: 135 EPAETVVRTILGQQVSVAGARTAATRLVALADDRL---PEPVDGLTHLFPEPARIAALGP 191 Query: 167 QALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALR 226 A R A + AA + + A L PGIG WTA+Y ++R Sbjct: 192 TAFVG-----PRIRAQAVVTAAAAIAGGTLRLDRTDAHARGVLLAMPGIGPWTADYLSMR 246 Query: 227 GWQAKDVFLPDDYLIKQRFPGM-TPAQIRRYAER---WKPWRSYALLHIWYTEG 276 + DV L DD I++ + P Q R A R W+P+RSYA +H+W G Sbjct: 247 VFGDPDVLLVDDLAIRRGAGALGLPDQPRELAARGLDWRPFRSYAGMHLWAASG 300 >UniRef50_B2GIR9 Putative methylated-DNA--protein-cysteine methyltransferase/3-methyladenine-DNA glycosylase II n=1 Tax=Kocuria rhizophila DC2201 RepID=B2GIR9_KOCRD Length = 532 Score = 98.6 bits (244), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 90/289 (31%), Positives = 134/289 (46%), Gaps = 32/289 (11%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGE-------YRGVVTAIPDIAR 56 L+++ P D + + A AV VE + Y+R+L + YR + AR Sbjct: 241 LSYRAPLDLHGLFVWFAVHAVEGVEVGTATSYSRTLRLPGGPAWLRVYRRGADELRMRAR 300 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR------LGAARPGLRLPGC 110 T +L A +A++ RLFDL +P V+ AL L AARPGLR+ G Sbjct: 301 LTDLADLPA--------LIARVRRLFDLDADPLAVDEALSHVPALRPLVAARPGLRVVGS 352 Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAAADPQA 168 D E +R ++GQ +S+A A + + GE +F + FPT +A + Sbjct: 353 ADPEETLIRTLIGQQISLAAARTVLGARTREMGEPAPEFAPGLSHMFPTAAAIAEHGERF 412 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 L+ P R A++ A+A G L ++ D Q L PG+G WTA++ +R Sbjct: 413 LRG---PAARVRAVLGAASAVASGELSLSPGDDAAQQRAALLALPGVGPWTADHVRMRVT 469 Query: 229 QAKDVFLPDDYLIK---QR--FPGMTPAQIRRYAERWKPWRSYALLHIW 272 DVFL DD ++ QR PG A + +A+ PWRSYA H+W Sbjct: 470 GDPDVFLVDDGALRAGAQRIGLPGDKKA-LTAWAQSAAPWRSYATTHLW 517 >UniRef50_A0JV31 DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II / Transcriptional regulator Ada n=5 Tax=Actinobacteria (class) RepID=A0JV31_ARTS2 Length = 504 Score = 97.8 bits (242), Expect = 4e-19, Method: Compositional matrix adjust. Identities = 90/301 (29%), Positives = 138/301 (45%), Gaps = 34/301 (11%) Query: 3 TLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAV--GEYRGVVTAIPDIARHTLH 60 L ++ P+D + FLA R++ +ET + YAR+L + + R V D L Sbjct: 199 NLPYREPFD-PGIFQFLAVRSIPGIETGTGTSYARTLRLPHADARFSVEYDADAPGRPLV 257 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALG---RLG---AARPGLRLPGCVDAF 114 + + A L+++ RL DL +P ++ AL RL A PG+R+PG VD Sbjct: 258 LTIGAVDLRDLPSLLSRVRRLLDLDADPVAIDNALEADPRLAPAVKAFPGMRMPGAVDPQ 317 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERL---DDFPEYICFPTPQRLAAADPQALKA 171 E +RA++GQ ++VA A +++ E L D + FPT ++A DP Sbjct: 318 ELLIRAMIGQQITVAAARTALTQLSACGSESLVPADGL--HRLFPTAAQIA--DP-GFGL 372 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 L P +R +++ A A G L D+ L PG+G WT Y A+R A Sbjct: 373 LRGPQRRIDSVRAAAGAMAAGNLDFGYGDDLAGLQSKLLPLPGVGPWTVGYVAMRVIGAP 432 Query: 232 DVFLPDDYLIK-------------QRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 DVFL +D ++ +R PG+ PA + PWRSYA +H+W + Sbjct: 433 DVFLANDAAVRNGILALDTGPQAGERPPGVQPADFTDVS----PWRSYATMHLWRAAAMR 488 Query: 279 P 279 P Sbjct: 489 P 489 >UniRef50_A5CSR4 Putative DNA glycosylase n=2 Tax=Clavibacter michiganensis RepID=A5CSR4_CLAM3 Length = 311 Score = 95.5 bits (236), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 85/288 (29%), Positives = 125/288 (43%), Gaps = 31/288 (10%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVT-----AIP-DIARHTLHIN 62 P+D ++ FL+ AV+ E + + +S + G VT A P D+ + + Sbjct: 20 PFDGGGVIRFLSWHAVTGAEEGDATSFTQSARLAHGAGTVTVRLLEAEPGDVGGARVEVT 79 Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGA------ARPGLRLPGCVDAFEQ 116 AAE LA RL L + ++ L R A A PGLR+PG +D Sbjct: 80 TRVEHAADAAELLAGTRRLLGLDVDAARIDADLARDPALAAVVRATPGLRIPGTLDPRST 139 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICF-------PTPQRLAAADPQAL 169 R I+GQ +SVA A R+ GE D P + PT R+A + L Sbjct: 140 LFRTIVGQQISVASARATHGRMTADLGE---DLPASVAHGSVTRLPPTAARIARDGGELL 196 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMK-TLQTFPGIGRWTANYFALRGW 228 + P +R LI +A A G L + PG ++ L F G+G WTA+Y A+R Sbjct: 197 RG---PARRTATLIRIAEALETGELVIE-PGVPRAELRAALVAFHGVGPWTADYVAMRAL 252 Query: 229 QAKDVFLPDDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALLHIW 272 D+ L D ++++ + + A W PWRSYA LH+W Sbjct: 253 GEPDILLSGDLIVRRGGAALGLPDEARALDARAAAWSPWRSYATLHLW 300 >UniRef50_C1RNZ7 DNA-3-methyladenine glycosylase II n=1 Tax=Cellulomonas flavigena DSM 20109 RepID=C1RNZ7_9CELL Length = 302 Score = 95.1 bits (235), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 69/186 (37%), Positives = 92/186 (49%), Gaps = 8/186 (4%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC 154 LG L AARP LR+PG D FE V+ +L Q VS+ AR+A YG + Sbjct: 94 LGPLVAARPHLRVPGHPDGFEAAVQVVLTQQVSLGAGRTTGARLASAYGR--PGPGGLLA 151 Query: 155 FPTPQRLAAADPQALKA-LGMPLKRAEALIHLANAALEGT--LPMTIPGDVEQAMKTLQT 211 +P P+ LAAAD AL+A L +P RA A+ LA A G +P DV A L Sbjct: 152 YPRPEDLAAADSVALQAVLRVPHARARAVHALAVACAGGLRLVPGAPAADVRAA---LLA 208 Query: 212 FPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHI 271 PGIG WTA+ ALR +D F D ++++ + W PWR++A H+ Sbjct: 209 IPGIGPWTADVVALRALGDRDAFPAGDLVLRRALGVPDVRDVATAGRAWSPWRAFAATHL 268 Query: 272 WYTEGW 277 W G+ Sbjct: 269 WAAVGY 274 >UniRef50_A3D6C4 Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=11 Tax=Shewanella RepID=A3D6C4_SHEB5 Length = 565 Score = 93.6 bits (231), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 94/324 (29%), Positives = 132/324 (40%), Gaps = 71/324 (21%) Query: 6 WQPPYDWSWMLGFLAARAVSSVE---------------TVADSY--------------YA 36 ++PP DW+ L F RAV+ +E VAD Y Sbjct: 256 YRPPLDWASQLAFYRLRAVTGMEWFTPQMSHPQASDAVQVADEANLAAEANADDNGLEYG 315 Query: 37 RSLAVGEYRGVVTAI--PDIARHTLHINLSAGLEPVAAECL----AKMSRLFDLQCNPQI 90 R A+G+ RG V I P + R L I L+ E A + L ++ R+ DL + Q Sbjct: 316 RCFAIGKMRGTVQIIHEPKLNRFKLAIALT---EDSAVDELQLLVTEVRRILDLDADMQQ 372 Query: 91 VNGALGRLGAAR----PGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL 146 + L L + GLR+PG FE RAILGQ V+V A KL + + YGE Sbjct: 373 IEQGLSTLPSLGLMPFSGLRIPGAGSLFEAVCRAILGQQVTVVQATKLLNILVEAYGECF 432 Query: 147 D-DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQA 205 + EY FPTP+ + A +L L MP R AL LA E E + Sbjct: 433 SLNGREYRLFPTPEAIREA---SLTELKMPGARKLALNALAAFICE---------HPEAS 480 Query: 206 MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIR----------- 254 + + GIG WT Y LRG +VFL D ++K+ + R Sbjct: 481 VDDWLSVKGIGPWTIAYAKLRGLGDPNVFLHLDLIVKKHLLALYIKNNRLDETAAAAVIY 540 Query: 255 -----RYAERWKPWRSYALLHIWY 273 + +++ PW SY +W+ Sbjct: 541 SQLCEQLSQQIAPWGSYLTFQLWH 564 >UniRef50_A6WG49 HhH-GPD family protein n=5 Tax=Actinomycetales RepID=A6WG49_KINRD Length = 295 Score = 93.2 bits (230), Expect = 8e-18, Method: Compositional matrix adjust. Identities = 72/218 (33%), Positives = 100/218 (45%), Gaps = 11/218 (5%) Query: 73 ECLAKMSRLFDLQCNPQIVNGALGR------LGAARPGLRLPGCVDAFEQGVRAILGQLV 126 E + R L +P L R L AARPGLR+P V E V +LGQ V Sbjct: 78 EVEGTVRRWLGLDADPAEAEAWLARDPLLAPLVAARPGLRVPRAVAGVETAVLTVLGQQV 137 Query: 127 SVAMAAKLTARVAQLYGERLDDFPEYI-CFPTPQRLAAADPQALKA-LGMPLKRAEAL-- 182 S+A A R+ +G + P + FP LA A +A++A G+ RA + Sbjct: 138 SLAAARTFGGRLVAAFGTPVSSAPSSLTAFPAAAVLADAGAEAIRAATGVTGARARTVHA 197 Query: 183 -IHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLI 241 L+ P A L PGIG WTA+Y ALR +D FLP D ++ Sbjct: 198 LAAALAGGLDLDAAAGDPERAGAARARLLALPGIGPWTADYVALRVLGDRDAFLPGDLVL 257 Query: 242 KQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 ++ G++P + AE W+PWR +ALLH+W + P Sbjct: 258 RRALGGLSPKEAAARAEPWRPWRGHALLHLWTAAVFVP 295 >UniRef50_C0ZIT0 DNA-3-methyladenine glycosylase II n=75 Tax=Bacillales RepID=C0ZIT0_BREBN Length = 310 Score = 91.7 bits (226), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 61/182 (33%), Positives = 89/182 (48%), Gaps = 15/182 (8%) Query: 104 GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLA 162 GLR G D FE I+GQ +++ A L R+ + +G R++ + Y FPT +++A Sbjct: 127 GLRTMGIPDLFEALSWGIIGQQINLTYAYTLKRRLVEAFGRRVEFEGETYWLFPTAEKIA 186 Query: 163 AADPQALKALGMPLKRAEALIHLANAALEGTLPMTI---PGDVEQAMKTLQTFPGIGRWT 219 L L M K+ E LI +A +EG L + GD + A K L + GIG WT Sbjct: 187 GLSVTDLDGLRMTTKKCEYLIDVAQLIVEGKLSKELLWDGGDYQTAEKRLTSIRGIGPWT 246 Query: 220 ANYFALRGWQAKDVFLPDDY---------LIKQRFPGMTPAQIRRYAERWKPWRSYALLH 270 ANY +R + F DD L K++ P T A+IR ++ W W SYA + Sbjct: 247 ANYVLMRCLRMPSAFPIDDVGLHNAIKFLLGKEKKP--TKAEIRELSKTWTNWESYATFY 304 Query: 271 IW 272 +W Sbjct: 305 LW 306 >UniRef50_A4BNP3 3-methyladenine DNA glycosylase/8-oxoguanineDNA glycosylase n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BNP3_9GAMM Length = 263 Score = 90.5 bits (223), Expect = 6e-17, Method: Compositional matrix adjust. Identities = 83/275 (30%), Positives = 121/275 (44%), Gaps = 37/275 (13%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH-----INL 63 P+ W+ +LG+L AR + E + D Y R + G R T H + + Sbjct: 12 PFPWAALLGYLDARLIPGAERIVDDGYER-----RHNGATV------RVTYHAGGKCLRI 60 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP------GLRLPGCVDAFEQG 117 +A E ++ RLFD + + V+ L RP GLR GC FE Sbjct: 61 TADDAVCGDEITVRVIRLFDTGQDTRAVDRQLRACPLLRPRVDRMPGLRPLGCWCPFELC 120 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 VR ++GQ VSVA AA L R+A+ GE +P L A L A+GMP + Sbjct: 121 VRTVVGQQVSVAAAATLMRRLAERCGEL-----------SPAALCA---ADLDAIGMPGR 166 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 R L LA A G L + D L PGIG WT Y A+R + D+ Sbjct: 167 RVATLRRLAEAVATGELALE-HADWAAIDAGLSRLPGIGPWTRAYLAIRLGRQPDILPET 225 Query: 238 DYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D + + +P +R ++RW+P+R++A ++W Sbjct: 226 DLGLLRAAGAASPTVLRALSQRWRPYRAHAATYLW 260 >UniRef50_Q1YTX8 Putative DNA-3-methyladenine glycosylase II n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YTX8_9GAMM Length = 257 Score = 90.1 bits (222), Expect = 8e-17, Method: Compositional matrix adjust. Identities = 88/272 (32%), Positives = 128/272 (47%), Gaps = 28/272 (10%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYR-GVVTAIPDIARHTLHINLSAGL 67 P+ W +L +L+ R V E++AD++Y R YR G+V D L I S Sbjct: 9 PFPWQQLLEYLSFRLVPEFESIADNHYQRI-----YRDGLVRVSYDEPNGLLQIK-SDLP 62 Query: 68 EPVAAECLAKMSRLFDLQ-CNPQIVNGALGRLG--AARPGLRLPGCVDAFEQGVRAILGQ 124 + + +SR+F Q C I L L A PG R GC D FE +R I+GQ Sbjct: 63 QDQLDNLIVPVSRIFRPQLCTQAIYQQLLPHLPILAKSPGFRPLGCWDPFELCLRTIIGQ 122 Query: 125 LVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIH 184 V+VA A + R+ + G+ TP+ L AAD L +GMP R ALI Sbjct: 123 QVTVAAANTIMRRLVERCGQL-----------TPEALLAAD---LSNMGMPGARVAALIA 168 Query: 185 LANAALEGTLPMTIP-GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ 243 LA A G L ++ P ++++A+ L+ GIG WT Y A+R D F D + + Sbjct: 169 LATALANGDLDLSRPWPELKEALLKLR---GIGPWTCGYLAIRLGMDDDAFPETDVGLIR 225 Query: 244 RFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 + + AE W+P+R+YA + +W E Sbjct: 226 AAKSESAMALLASAELWRPYRAYAAVGLWALE 257 >UniRef50_C6D2P4 DNA-3-methyladenine glycosylase II n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D2P4_PAESJ Length = 304 Score = 89.7 bits (221), Expect = 9e-17, Method: Compositional matrix adjust. Identities = 64/199 (32%), Positives = 99/199 (49%), Gaps = 16/199 (8%) Query: 85 QCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGE 144 + +P +V+ A+GR GLR G D FE I+GQ +++A A L R + YG+ Sbjct: 107 EKDPLLVH-AIGRF----HGLRSVGISDLFEALCWGIIGQQINLAFAYTLKRRFVEAYGQ 161 Query: 145 RLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTL---PMTIPG 200 ++ + + FP P+ +A P+ + ++ M K++E LI +A EG+L + G Sbjct: 162 SVEREGRTFWQFPVPETIATLKPEDMASMQMTSKKSEYLIGVAKLMAEGSLDKQSLLALG 221 Query: 201 DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-----PA--QI 253 D K L GIG WTANY +R + + F D + +T PA +I Sbjct: 222 DFAAIEKQLTGIRGIGPWTANYVLMRCLRLPNAFPIADVGLHNSIKALTGSEAKPAISEI 281 Query: 254 RRYAERWKPWRSYALLHIW 272 R+ AE WK W SYA ++W Sbjct: 282 RQMAEGWKGWESYATFYLW 300 >UniRef50_C8SYC6 DNA-3-methyladenine glycosylase 2 (Fragment) n=1 Tax=Klebsiella pneumoniae subsp. rhinoscleromatis ATCC 13884 RepID=C8SYC6_KLEPR Length = 95 Score = 89.0 bits (219), Expect = 1e-16, Method: Compositional matrix adjust. Identities = 41/84 (48%), Positives = 54/84 (64%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 M L W PPYDW+WM+GFL ARAV+ VE D Y+R+ V +RG++ PD L Sbjct: 11 MVLLPWTPPYDWAWMVGFLQARAVAGVERFHDGGYSRNFGVEGHRGLIHLTPDEEAQGLR 70 Query: 61 INLSAGLEPVAAECLAKMSRLFDL 84 + LS GL+PVA C A++ +LFDL Sbjct: 71 VTLSPGLQPVAEICYARIGQLFDL 94 >UniRef50_C2AV46 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Tsukamurella paurometabola DSM 20162 RepID=C2AV46_TSUPA Length = 216 Score = 88.2 bits (217), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 63/197 (31%), Positives = 91/197 (46%), Gaps = 14/197 (7%) Query: 84 LQCNPQIVNGALGR------LGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTAR 137 L +P V+ AL L AA PG+RL GCVD E +R ++GQ +S+A A AR Sbjct: 14 LDADPLTVDEALSTDPRLAPLVAATPGIRLFGCVDPAELLLRTMIGQQISIAAATTHQAR 73 Query: 138 VAQLYGERLDDFPEYIC--FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP 195 + + GE +DD + FP+P +A + + L P R A+ +A EG L Sbjct: 74 LVEALGEPVDDPTGRVSRAFPSPAVVAE---RGHEVLTGPRARVTAIRSVAVEIAEGRLT 130 Query: 196 MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRR 255 + +A L G+G WTA+Y A+R D L D ++ + G + Sbjct: 131 LHPGMTRAEARDVLLRLSGVGPWTADYVAMRLLADPDTLLSSDLVVAK---GAAALDLDI 187 Query: 256 YAERWKPWRSYALLHIW 272 W PW SY LH+W Sbjct: 188 ATNHWSPWGSYVSLHLW 204 >UniRef50_D0J4I7 HhH-GPD n=2 Tax=Comamonas testosteroni RepID=D0J4I7_COMTE Length = 329 Score = 85.1 bits (209), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 73/243 (30%), Positives = 108/243 (44%), Gaps = 26/243 (10%) Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNP---QIVNG---ALGRLGAARPGLRLPGCV 111 +LH + E A A + R+ L P ++ +G LG L + + GL +PG Sbjct: 81 SLHAAHTEPAEADRAALQAMVKRMLGLIYAPDQLELAHGDHPELGVLLSRQAGLHVPGSP 140 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL---DDFPEYICFPTPQRLAAADPQA 168 FE AI GQ ++VA+A L ++ L GE L D P +P QR+AA A Sbjct: 141 TPFEALTWAITGQQITVAVAVSLRRKLIALAGEPLAQDGDMPALHAYPDAQRVAALGLDA 200 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMT-----------IPGDVEQAMKTLQTFPGIGR 217 L+ G +A+ L+ +A A EG LP+ DV A L GIG Sbjct: 201 LRGAGFSQAKAQTLLAVAQAVAEGQLPLDDWAARSAVGRWSEEDVAAASAQLLAVKGIGP 260 Query: 218 WTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ------IRRYAERWKPWRSYALLHI 271 WT NY LRG+ D L D +++ +T + + E++KPWR+ H+ Sbjct: 261 WTVNYTLLRGYGWPDGSLHGDVAVRRAIGLLTGSDKPDARAASDWLEQFKPWRALVAAHL 320 Query: 272 WYT 274 W + Sbjct: 321 WAS 323 >UniRef50_P37878 DNA-3-methyladenine glycosylase n=4 Tax=Bacillaceae RepID=3MGA_BACSU Length = 303 Score = 84.0 bits (206), Expect = 5e-15, Method: Compositional matrix adjust. Identities = 72/286 (25%), Positives = 125/286 (43%), Gaps = 27/286 (9%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVV--TAIPDIARHTLHINLSAGL 67 +D + LG+L + + ++ + +A+GE R +V + I + +N S + Sbjct: 18 FDMNANLGYLTREKNECMYEIENNIITKVIAIGEIRSLVQVSVINNKQMIVQFLNDSRPV 77 Query: 68 EPVAAECLAK-MSRLFDLQCNPQIVNGALGRLGAARP----------GLRLPGCVDAFEQ 116 E E + K + FDL + + A P GLR+ G D FE Sbjct: 78 EQWKREEIVKYIHEWFDLDNDLT----PFYEMAKADPLLKMPARKFYGLRVIGIPDLFEA 133 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMP 175 +LGQ +++A A L + + +G+ ++ + +Y FP +R+A P L + M Sbjct: 134 LCWGVLGQQINLAFAYSLKKQFVEAFGDSIEWNGKKYWVFPPYERIARLTPTDLADIKMT 193 Query: 176 LKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 +K++E +I +A G L + + + A K L GIG WTANY +R + Sbjct: 194 VKKSEYIIGIARLMASGELSREKLMKMNFKDAEKNLIKIRGIGPWTANYVLMRCLRFPTA 253 Query: 234 FLPDDY-------LIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 F DD +++ T +I + WK W+SYA ++W Sbjct: 254 FPIDDVGLIHSIKILRNMNRKPTKDEILEISVPWKEWQSYATFYLW 299 >UniRef50_C7NLP9 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Kytococcus sedentarius DSM 20547 RepID=C7NLP9_KYTSD Length = 286 Score = 83.6 bits (205), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 67/220 (30%), Positives = 97/220 (44%), Gaps = 29/220 (13%) Query: 70 VAAECLAKMSRLFDLQCNPQIVNGALGRLGA---------ARPGLRLPGCVDAFEQGVRA 120 V+ E A++ FDL + VN RLGA +RPG+R+ FE + Sbjct: 75 VSDEIAARVQHWFDLDTDLTPVNA---RLGADPVLAGQVRSRPGIRITRFHAPFEAVILT 131 Query: 121 ILGQLVSVAMAAKLTARVAQLYGER---LDDFPEYICFPTPQRLAAADPQALKA-LGMPL 176 +LGQ VS+A AR+ YG+ + P FPTP L A + L+A +G+ Sbjct: 132 VLGQQVSLAAGRLFAARLIAAYGDDAAPVRQEPGLRVFPTPVALTAVPVEELRAVIGLTG 191 Query: 177 KRAEAL----IHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 RA + H A A + +LP A L GIG WT +Y A+R D Sbjct: 192 TRARTVHAVAAHFAETARDASLP---------ARAELHAVHGIGPWTLDYLAIRASTDAD 242 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 F D ++++ ++P A W P+RSYA +W Sbjct: 243 AFPATDAVLRRTLAAISPDTGPERAASWSPYRSYAASRLW 282 >UniRef50_A1ZCF3 HhH-GPD n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZCF3_9SPHI Length = 207 Score = 83.2 bits (204), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 54/166 (32%), Positives = 87/166 (52%), Gaps = 20/166 (12%) Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 VR+I+GQ +SV AA + R +L FPE +PTP+ + AA+ LKA G+ + Sbjct: 36 VRSIVGQQLSVKAAATIYQRFREL-------FPE--NYPTPKLVVAAELDTLKAAGLSKQ 86 Query: 178 RAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 +A + ++A A+EG L + + E+ ++ L T G+GRWT + +Q DVF Sbjct: 87 KATYIKNVAAFAIEGGLDFEVLNNQTDEEIIQVLITIKGVGRWTVEMLLMFAFQRPDVFS 146 Query: 236 PDDYLIKQRFPGM---------TPAQIRRYAERWKPWRSYALLHIW 272 DD I+Q + A+++ A WKP+R+ A L++W Sbjct: 147 VDDLGIQQAVKKLYQLDEEGKALKAKMKTIANAWKPYRTLACLYLW 192 >UniRef50_D2PPK3 Transcriptional regulator, AraC family n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PPK3_9ACTO Length = 435 Score = 81.6 bits (200), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 79/279 (28%), Positives = 129/279 (46%), Gaps = 37/279 (13%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIP---DIARH 57 + L +QPPYDW M+ LAARAV VE+V+D Y R++ + GV+ P D+ Sbjct: 183 LMRLPYQPPYDWDAMVDHLAARAVPGVESVSDRVYRRTIGLDGGAGVLEIGPGEGDVLML 242 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQ---IVNGALGRLGAARPGLRLPGCVDAF 114 H+ GL V + + +RL + P + + LG L ARPGLR+PG A Sbjct: 243 RAHLPYWEGLIHV----VERAARLVGVASEPADRLLRDPLLGPLVVARPGLRVPGAWGAL 298 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAAADPQALKAL 172 E V+A+ Q S+ R+ + G+ + + + FP+ + LA++ +++L Sbjct: 299 EIAVQAVTAQDHSLKETRAQLGRLVKECGQPVPGLTDRLTHLFPSAEVLASSSTGIVQSL 358 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 A + +LEG G E + L T PG+ TA++ ALR +D Sbjct: 359 A-------AAVADGRVSLEG-------GSSEVLLAQLTTVPGLMPDTADWIALR-LGHQD 403 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHI 271 VF + A++ ++RW+P R+ A ++ Sbjct: 404 VF-------PRSLHAEVAAEV---SDRWRPHRAVAATYL 432 >UniRef50_Q7N9Z6 Similarities with the C-terminal region of 3-methyladenine DNA glycosylase n=2 Tax=Enterobacteriaceae RepID=Q7N9Z6_PHOLL Length = 299 Score = 81.3 bits (199), Expect = 3e-14, Method: Compositional matrix adjust. Identities = 67/239 (28%), Positives = 101/239 (42%), Gaps = 26/239 (10%) Query: 49 TAIPDIARHTLHINLSAGLEPVAA-ECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRL 107 T PD TL ++ L+PV ECL K + +G L A + GLR+ Sbjct: 72 TTGPDERLSTLASHMPGLLQPVHLFECLYKR-------------HPVIGSLIARQSGLRI 118 Query: 108 PGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQ 167 FE A++GQ +SV+ A + R Q G + CFPT +++ Sbjct: 119 YQSATPFEALSWAVIGQQISVSAAISIRRRFIQAMG--VQHSSGLWCFPTARQIINHSED 176 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTIPG---DVEQAMKTLQTFPGIGRWTANYFA 224 L+ G + +A+AL+ L+ G L + I D++Q + L GIG WT NY Sbjct: 177 ELRQCGFSVSKAKALLRLSQLIESGELTLAISNSETDIQQLIDNLLAIKGIGMWTINYSL 236 Query: 225 LRGWQAKDVFLPDDYLIK---QRFPGMTPAQIRRYAERW----KPWRSYALLHIWYTEG 276 LRG+ + L D ++ QR AE+W PW++ H+W E Sbjct: 237 LRGFNYLNGSLHGDVAVRRNIQRLFNQNEKVSAEQAEKWLADFAPWKALLAAHLWQQES 295 >UniRef50_Q5NXL1 DNA-3-methyladenine glycosidase II n=3 Tax=Betaproteobacteria RepID=Q5NXL1_AZOSE Length = 300 Score = 80.1 bits (196), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 59/190 (31%), Positives = 89/190 (46%), Gaps = 16/190 (8%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC 154 LG L A PGLR+P FE AI GQ +SV A L R+ ++ G L C Sbjct: 107 LGPLIARHPGLRVPLSASPFEALSWAITGQQISVRAAISLRRRLIEVAG--LRHSVGLAC 164 Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM-----TIPGDVEQAMKTL 209 +P +R+A + L++ G +A+ LI L E LP+ T+P V++ + L Sbjct: 165 YPDAERVAGLNEADLRSAGFSQAKAQTLIRLGRLVAEDELPLNTWIATLP--VDEIRERL 222 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF-------PGMTPAQIRRYAERWKP 262 GIG WT +Y LRG+ D L D ++++ +T Q +R+ + P Sbjct: 223 MRVRGIGPWTIDYALLRGFGWLDGSLHGDVVVRRSLQAVLDCPDSVTEGQAKRWLAEFSP 282 Query: 263 WRSYALLHIW 272 WR+ H+W Sbjct: 283 WRALIAAHLW 292 >UniRef50_C6MGP3 HhH-GPD family protein n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MGP3_9PROT Length = 316 Score = 78.6 bits (192), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 52/193 (26%), Positives = 87/193 (45%), Gaps = 11/193 (5%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + LG L A + GLR+P AFE + AI GQ +S++ A + ++ QL G R Sbjct: 105 HAQLGSLIAKQSGLRVPVSATAFEALIWAIAGQKISISAALAIRRKLIQLIGLRHSG--G 162 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGD---VEQAMKT 208 C P Q L+ L+ +G +A+ ++ ++ + L ++ +E + Sbjct: 163 LYCHPNAQHLSHLSISDLRQIGFSHSKAQTILTVSQRVICNELELSSASAEPPIEHIRQQ 222 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ------RFPGMTPAQIRRYAERWKP 262 L GIG WT +Y LRG+ D L D +++ + Q R++ E + P Sbjct: 223 LLQIRGIGLWTVDYTLLRGYGWLDGSLHGDVAVRRGLQILLNCESINENQTRQWLENFSP 282 Query: 263 WRSYALLHIWYTE 275 WR+ H+W E Sbjct: 283 WRALVAAHLWNIE 295 >UniRef50_C7QDZ2 Transcriptional regulator, AraC family n=2 Tax=Actinomycetales RepID=C7QDZ2_CATAD Length = 564 Score = 78.6 bits (192), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 48/136 (35%), Positives = 70/136 (51%), Gaps = 9/136 (6%) Query: 146 LDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQA 205 +D + + FP P+ LAA D + LG+ + L LA A G L + D +A Sbjct: 421 VDKKADLLPFPRPETLAAGD---YEGLGLTRRTVATLRALATAVASGDLALDRGVDRTEA 477 Query: 206 MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQR-----FPGMTPAQIRRYAERW 260 L PGIG WTA+Y ALR + D F D +++++ PG A + +AE W Sbjct: 478 RAKLLAVPGIGPWTADYVALRVFGDPDAFPVGDLIVRRQAERLGLPGAEKALL-AHAESW 536 Query: 261 KPWRSYALLHIWYTEG 276 +PWR+YA LH+W + G Sbjct: 537 RPWRAYAALHLWASSG 552 Score = 77.8 bits (190), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 54/149 (36%), Positives = 77/149 (51%), Gaps = 12/149 (8%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVA----DSYYARSLAVGEYRGVVTAIPDIARHTL 59 L ++ P+D++ +LG+ RA+ V+ V D Y R+L + G V D + + Sbjct: 216 LTYRTPFDFAALLGWFGDRAIPGVDEVVGTGRDLVYRRALRLPHGTGQVELRDD--KGVV 273 Query: 60 HINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNG------ALGRLGAARPGLRLPGCVDA 113 H L A + + L DL +P V+ AL L AARPGLR+PG VD Sbjct: 274 HARLVVDDLRDVAVAVRRCRDLLDLDADPAQVDAVLAGDPALAPLVAARPGLRVPGAVDG 333 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLY 142 FE VRA+LGQ +SVA A +TAR+ Q + Sbjct: 334 FEIAVRAVLGQQISVAAARTMTARLVQRF 362 >UniRef50_D1CD20 DNA-3-methyladenine glycosylase II n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CD20_THET1 Length = 301 Score = 78.2 bits (191), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 64/190 (33%), Positives = 91/190 (47%), Gaps = 18/190 (9%) Query: 101 ARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQ 159 ARP L D E + AI+GQ V+VA A KL AR+ +L G L+ D Y FP Sbjct: 115 ARPVL----IADPLEALMWAIIGQQVNVAFARKLKARLVELCGSVLEVDGERYWVFPPAW 170 Query: 160 RLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG--DVEQAMKTLQTFPGIGR 217 R+A L+ ++A ++ LA A G L + G VE+A+ L F G+GR Sbjct: 171 RIADLPEDLLRGNQFSRQKARYILGLARAVASGELDLRALGVLPVEEAIAELVRFLGVGR 230 Query: 218 WTANYFALRGWQAKDVFLPDDY----LIKQRFPG---MTPAQIRRYAERWKPWRSYA--- 267 WTA Y +RG DV D ++ + + G T A++R + W PWR++ Sbjct: 231 WTAEYVLMRGLGRADVIPAADLGLRAVMGRHYLGGRVATEAEVREISAAWSPWRAWGAWL 290 Query: 268 -LLHIWYTEG 276 LH+ T G Sbjct: 291 WWLHLQVTRG 300 >UniRef50_C0Z5U6 Putative DNA-3-methyladenine glycosylase II n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z5U6_BREBN Length = 309 Score = 77.4 bits (189), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 57/196 (29%), Positives = 90/196 (45%), Gaps = 11/196 (5%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDF-- 149 G L L GLR D F+ V+ I+GQ +++ AA LT R+ L G+ +++ Sbjct: 104 EGELAILTERFRGLRPMLDADLFQCMVKTIIGQQINLTFAANLTERLVTLAGDPVENQNG 163 Query: 150 PEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI--PGDVEQAMK 207 I FPTP +A + L++L ++AE +I A A + T+ + + E+ + Sbjct: 164 EGIIAFPTPDSVARLTVEDLRSLQFSQRKAEYIIDFARAIVNETVDLERLWTMEDEEIIT 223 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ---RFPGM----TPAQIRRYAERW 260 L + GIGRWT + G D+ D ++ GM IR+ E+W Sbjct: 224 YLTSLRGIGRWTVECLLMFGMGRPDLLPAADIGLRNGIVHLYGMETKPNENDIRKLGEKW 283 Query: 261 KPWRSYALLHIWYTEG 276 PWRS L++W G Sbjct: 284 APWRSIYCLYVWEAVG 299 >UniRef50_D1Z1B8 Putative DNA glycosidase n=1 Tax=Methanocella paludicola SANAE RepID=D1Z1B8_METPS Length = 303 Score = 76.6 bits (187), Expect = 8e-13, Method: Compositional matrix adjust. Identities = 61/210 (29%), Positives = 91/210 (43%), Gaps = 28/210 (13%) Query: 81 LFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQ 140 + L N ++ G + + +P P FE + AI Q VS+ + L R+A+ Sbjct: 100 FYALTKNDVVIGGLVRQFCGVKP----PRFPTIFEALLNAIACQQVSLDVGIILLDRLAE 155 Query: 141 LYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLA------NAALEGTL 194 YG DD FP P+ LA+ + +K LG ++A A+ LA NA+LE Sbjct: 156 RYGRAFDD---EAAFPAPEGLASIPVEEIKKLGFSYQKARAIKELAAAIASGNASLERVY 212 Query: 195 PMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF--------- 245 M+ ++A+K L T GIGRW+A Y LRG D F DD + Sbjct: 213 RMS----DQEAIKYLSTLRGIGRWSAEYVLLRGLGRLDSFPADDIGARNNLQRLFHLDHK 268 Query: 246 PGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 PG +I+ RW P+ H+ + Sbjct: 269 PGY--GEIKELTSRWHPYEGLVYFHLLLNK 296 >UniRef50_C6XZ60 HhH-GPD family protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XZ60_PEDHD Length = 301 Score = 75.1 bits (183), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 75/278 (26%), Positives = 122/278 (43%), Gaps = 25/278 (8%) Query: 16 LGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH-INLSAGLEPVAAEC 74 L FL+ + +V + R G +V P + L +N+S E + A Sbjct: 20 LWFLSRDFDDCMYSVFEDRVRRGFRQGSGIMIVDIYPMSDKLILEWLNISPSAEDITA-V 78 Query: 75 LAKMSRLFDLQCN--PQIVNGALGR----LGAARPGLRLPGCVDAFEQGVRAILGQLVSV 128 + +S FDL + P A R + GLR G D FE I+GQ +++ Sbjct: 79 VQFVSEWFDLNTDLIPFYKTIAADRRISYMAEDFAGLRFIGMPDFFEALAWCIIGQQINL 138 Query: 129 AMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLAN 187 + A K+ R+ + YG D +Y FP P+ +A A L+ L K+AE +I +A Sbjct: 139 SFAYKVKRRLVERYGTCTQFDGQKYYLFPGPEIIAKASISDLRELQFSEKKAEYIIAIAE 198 Query: 188 AALEGTLPMTIPG---DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF---------- 234 A L G L + D+E +K L GIG+WTANY ++ + Sbjct: 199 AFLNGMLNKELLQRLPDLESRIKFLTNIRGIGQWTANYALMKSLKEPACIPYGDAGLLNA 258 Query: 235 LPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 L + +IK + PA I ++ + ++ W+SY + ++W Sbjct: 259 LLNHGIIKSK--DNKPA-IAKFFKAFEGWQSYIVFYLW 293 >UniRef50_C4L050 DNA-3-methyladenine glycosylase II n=4 Tax=Bacillales RepID=C4L050_EXISA Length = 297 Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 72/287 (25%), Positives = 126/287 (43%), Gaps = 26/287 (9%) Query: 7 QPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAG 66 Q P+D+ L FL+ + + + +GE R ++ + + H +H+ Sbjct: 9 QQPFDFQECLVFLSRSEQEVLHVTTPDMVRKLMRIGE-RLILIELREEVNH-IHVRFPFD 66 Query: 67 -LEPVAAECLAKMSRL-FDLQ--CNPQIVNGALGRLGA----ARPGLRLPGCVDAFEQGV 118 + E +A+ R DL+ P GA L A GLR+ G D FE Sbjct: 67 EVSETEKEHVAREVRNWLDLERDLKPFETMGAKDELLAPLIETHRGLRMIGFPDLFEALT 126 Query: 119 RAILGQLVSVAMAAKLTARVAQLYGE-RLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 AI+GQ ++++ A + R + YG+ R+ + Y FP +R+A +P+ L+ L + Sbjct: 127 WAIIGQQITLSFAYTIKRRFVERYGDHRVIEGRAYWTFPRAERIALLEPEELRELQFSRR 186 Query: 178 RAEALIHLANAALEGTLPMTI-----PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 +AE +I +A G L T D+ Q + ++ G+G WTA+Y ++ +Q Sbjct: 187 KAEYVIDIAREITNGDLSKTALQSHSSADIRQRLLAIR---GVGAWTADYVLMKCFQDAS 243 Query: 233 VFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIW 272 F D + Q T +++RY E W+ + YA ++W Sbjct: 244 AFPIADVGLHQAIQHQLGTAKKPTIEEVKRYGESWQGFEGYATFYLW 290 >UniRef50_C6W476 HhH-GPD family protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W476_DYAFD Length = 300 Score = 73.2 bits (178), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 45/142 (31%), Positives = 73/142 (51%), Gaps = 4/142 (2%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFP 150 + L + A GLRL G D FE +I+GQ +++ A KL R+ + YG ++ + Sbjct: 102 DSRLAYMTDAFRGLRLVGISDMFEAICWSIIGQQINLTFAYKLKRRMVERYGTHVEWNGE 161 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG---DVEQAMK 207 + FPTP+ LA A L+A+ K+AE ++ +A A +G L + D K Sbjct: 162 VFPVFPTPEALANAGIDELRAMQFSQKKAEYVVGIAQAFADGKLNAEVISALPDFASRQK 221 Query: 208 TLQTFPGIGRWTANYFALRGWQ 229 L + G+G WTANY ++ ++ Sbjct: 222 VLVAYKGVGIWTANYVLMKTFR 243 >UniRef50_B9XBY0 HhH-GPD family protein n=1 Tax=bacterium Ellin514 RepID=B9XBY0_9BACT Length = 294 Score = 72.8 bits (177), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 66/238 (27%), Positives = 100/238 (42%), Gaps = 34/238 (14%) Query: 49 TAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLP 108 T + IAR+ L + + +P E +AK L L L + GLR+P Sbjct: 74 TTLQQIARNLLALRI----DPEPFEAMAKEDNL-------------LASLVQKQTGLRIP 116 Query: 109 GCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQA 168 FE AI+GQ ++++ A L QL G + C P +A +P Sbjct: 117 HTTTPFEALAWAIIGQQINLSFAITLRRSFIQLAGTKHSS--GLWCHPDASAVARLNPDH 174 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMT-----IPGDVEQAMKTLQTFPGIGRWTANYF 223 L L +AE L+ +A G LP+ P +++ A+ ++ GIG WT NY Sbjct: 175 LGQLKFSRAKAETLVRMAQLVDSGKLPLDEWQNHSPEEIQAALLAIK---GIGPWTVNYT 231 Query: 224 ALRGWQAKDVFLPDDYLIK---QRFPGM----TPAQIRRYAERWKPWRSYALLHIWYT 274 LRG+ D L D I+ R G T +I +R++P RS H+W + Sbjct: 232 LLRGFAFADCSLHGDAAIRNALNRLSGSATKPTIKEIETLLQRYRPHRSMTAAHLWKS 289 >UniRef50_C7PMW8 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PMW8_CHIPD Length = 302 Score = 72.0 bits (175), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 54/193 (27%), Positives = 90/193 (46%), Gaps = 14/193 (7%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER-LDDFP 150 + L L GLRL G D FE +I GQ +++ A L R Q +G + + Sbjct: 107 DAVLKPLADRYKGLRLIGIPDLFEALTWSITGQQITLGFAYTLRQRFIQAFGHHAVINGK 166 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTL--PMTIPGDVEQAMKT 208 +Y +P P +A+ +P +L A+ +A+ +I LA A G L D +QA Sbjct: 167 DYYVYPHPAVVASLEPASLIAMQFSRSKADYIIGLAKAMTGGLLTDKQLWEMDYQQARAH 226 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDD----YLIKQRF-----PGMTPAQIRRYAER 259 L +F GIG W+ANY ++ + + +D +KQ+ P + A ++ Y Sbjct: 227 LISFRGIGNWSANYVLMKYHRHHEALPLEDAGLHNALKQQLQLTAKPSL--ADVKAYTGH 284 Query: 260 WKPWRSYALLHIW 272 W+ + +YA ++W Sbjct: 285 WREYAAYATFYLW 297 >UniRef50_Q82VT3 HhH-GPD n=2 Tax=Betaproteobacteria RepID=Q82VT3_NITEU Length = 205 Score = 70.9 bits (172), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 55/170 (32%), Positives = 79/170 (46%), Gaps = 20/170 (11%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 +AF RAI+GQ +SV AA + +V L PE TP+ L A + L+ Sbjct: 37 NAFATLARAIVGQQISVKAAASVWQKVTTL-------IPEI----TPEALIATEIDLLRT 85 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQ 229 G+ ++ + L L+ LEGTL D+ E ++ L GIGRWTA F + Sbjct: 86 CGLSARKVDYLRDLSRHFLEGTLVTVNWHDLDDETLIRKLVEVKGIGRWTAEMFLIFHLH 145 Query: 230 AKDVFLPDDYLIKQ----RFPGMTPAQ---IRRYAERWKPWRSYALLHIW 272 DV DD +++ + P IR AE W+PWRS A ++W Sbjct: 146 RPDVLPLDDIGLQRAVSLHYNASQPVAKQAIRTIAESWQPWRSVATWYLW 195 >UniRef50_Q9ZET9 DNA-3-methyladenine glycosidase (Fragment) n=1 Tax=Mycobacterium avium subsp. paratuberculosis RepID=Q9ZET9_MYCPA Length = 185 Score = 70.9 bits (172), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 54/164 (32%), Positives = 77/164 (46%), Gaps = 13/164 (7%) Query: 119 RAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI---CFPTPQRLAAADPQALKALGMP 175 RA+LGQ VS+ A R+ YG + D PE FP+ Q+LA DP L +P Sbjct: 1 RAVLGQQVSIRAARTHAGRLVAAYGRAVHD-PEGTLTHTFPSVQQLADVDP---IHLAVP 56 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 R L L + ++ + D + A L PG+G WTA A+RG D F Sbjct: 57 KARQRTLAALVAGLADRSIVLDTGCDWQSARTQLLALPGVGPWTAEVIAMRGLGDPDAFP 116 Query: 236 PDDYLIK---QRFPGMTPAQ--IRRYAERWKPWRSYALLHIWYT 274 D ++ +R G+ Q + + RW+PWRSYA ++W T Sbjct: 117 AADLGLRVAAKRL-GLPSGQRSLTAASARWRPWRSYATQYLWTT 159 >UniRef50_D1P0X5 DNA-3-methyladenine glycosylase II n=4 Tax=Enterobacteriaceae RepID=D1P0X5_9ENTR Length = 303 Score = 70.9 bits (172), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 54/189 (28%), Positives = 84/189 (44%), Gaps = 14/189 (7%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC 154 LG+L + G+R+ FE V AI+GQ +SV A + R Q G + C Sbjct: 110 LGKLITPQRGVRVYQSASTFEALVWAIIGQQISVLAAIAIRRRFIQAVG--MQHSSGIWC 167 Query: 155 FPTPQRLAAADPQALKALGMPLKRAEAL----IHLANAALEGTLPMTIPGDVEQAMKTLQ 210 FPT Q++A D L+ G + AL + N L+ L +T P +VE L Sbjct: 168 FPTVQQVAQVDDNILRKTGFSTGKIIALRGVCEAIENQRLDLDLTVT-PDNVEDVTAQLL 226 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPW 263 GIG WT +Y LRG+ D L D +++ + + + + + ++ PW Sbjct: 227 AIKGIGPWTISYALLRGFNYLDGSLHGDVAVRRNLQTLLNHTEQPSTKETQHWLVQFAPW 286 Query: 264 RSYALLHIW 272 R+ H+W Sbjct: 287 RALVAAHLW 295 >UniRef50_D1C1F2 HhH-GPD family protein n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1C1F2_SPHTD Length = 319 Score = 70.5 bits (171), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 58/193 (30%), Positives = 96/193 (49%), Gaps = 17/193 (8%) Query: 49 TAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLP 108 T +P++ R TL L G++ L+ SRL +P + A RL A+P R P Sbjct: 81 TVVPEL-RRTLMRTLGTGVD------LSGFSRLA--AGDPALAELA-DRLRGAKP-TRYP 129 Query: 109 GCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQ 167 +A V AI Q +++ ++ +R+AQ G ++ D ++ FP P+ + P Sbjct: 130 TVYEAL---VNAIACQQITLTFGLRILSRLAQECGMTIERDGETHVAFPRPEDVLTVSPD 186 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFAL 225 L+ LG ++A A++ L+ ++G+L + D+ + AM+ L G+GRWTA Y L Sbjct: 187 RLRELGFSRQKARAVLELSERLVDGSLDLEPLEDLPDDAAMERLLALRGVGRWTAEYVLL 246 Query: 226 RGWQAKDVFLPDD 238 RG +F DD Sbjct: 247 RGLGRVHIFPGDD 259 >UniRef50_A9B7A8 Transcriptional regulator, AraC family n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B7A8_HERA2 Length = 489 Score = 69.7 bits (169), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 53/180 (29%), Positives = 83/180 (46%), Gaps = 15/180 (8%) Query: 104 GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAA 163 GLR+P + F+ V AILGQ +S+A+A +L R+ +L G+RL+ ++ PTP +A Sbjct: 308 GLRMPLVHNPFDALVWAILGQQISLAVAYRLRQRLTELVGQRLNQ--DFYLAPTPNTIAQ 365 Query: 164 ADPQALKALGMPLKRAEALIHLANAALEGTLPMTI--PGDVEQAMKTLQTFPGIGRWTAN 221 + L LG +A LI A A + +LP+ + + L GIG WTA Sbjct: 366 LTVEQLLPLGFSNAKARYLIDTAQAIIAESLPLASYHRKSATRIERELLALRGIGPWTAQ 425 Query: 222 YFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAER---------WKPWRSYALLHIW 272 Y +R + D D + Q+ + +R + P+RS A H+W Sbjct: 426 YVLMRSFGFSDCVPVGDSGLTSSLQAF--FQLEQRPDRSTTLALMAAFSPYRSLATFHLW 483 >UniRef50_B8GAB8 DNA-3-methyladenine glycosylase II n=3 Tax=Chloroflexus RepID=B8GAB8_CHLAD Length = 199 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 49/169 (28%), Positives = 81/169 (47%), Gaps = 20/169 (11%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 +F AI+ Q +S+ A + R+ L GE TP+++ AAD AL+A Sbjct: 33 SFATLAYAIISQQLSLNAARAIRDRLTTLLGE-----------LTPEQILAADTTALRAA 81 Query: 173 GMPLKRAEALIHLANAALEGTLPMTI--PGDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 G+ ++++ L LA + G + + + D E A+ L GIGRWTA + + Sbjct: 82 GLSMQKSGYLRDLAERIVYGQINLELLPTLDDETAIAMLTNVRGIGRWTAEIYLMFALNR 141 Query: 231 KDVFLPDDY-------LIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D+ DD L+ Q ++P ++R ERW+P+RS A ++W Sbjct: 142 LDILPADDLGLRDGARLVYQLPQILSPRELRALGERWRPYRSIACWYLW 190 >UniRef50_C7MAP3 Adenosine deaminase n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MAP3_BRAFD Length = 515 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 80/294 (27%), Positives = 119/294 (40%), Gaps = 43/294 (14%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTA-IPDIARHTLHINLSAGL 67 P+D + + + A RAV VE V + R++ + GV+ + A H L + L Sbjct: 211 PFDGAGLAAWFAHRAVPGVEEVDGLRWTRAVHLPHGPGVLQVDLGGPAPHPLPLTLRLAD 270 Query: 68 EPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAA-------RPGLRLPGCVDAFEQGVRA 120 A ++ RL DL +P ++ L R A RPG+RLPG E + A Sbjct: 271 LRDHAVAVSLTRRLLDLDADPVGIDDGLRRTLPALAPLLAARPGVRLPGTPTLAEALLWA 330 Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRL----AAADPQALKALGMPL 176 + GQ ++ A A R L L PE + + +RL A A +A P Sbjct: 331 VTGQQITTAQARDQITRATDLLATAL---PEALRTGSVERLPVLPANAAARAEDWFRGPR 387 Query: 177 KRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQA------ 230 R+ L A LP P +++ + G+G WTA+Y LRG +A Sbjct: 388 ARSRTLQEAVPAIAADDLPARWP--LDELRSRVLALRGVGPWTADYVLLRGLRAIDAAPA 445 Query: 231 ---------KDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 +D+ L +D+ QR E PWRSYA LH+W + Sbjct: 446 GDAALLGAARDLGLAEDHTALQRV-----------LEAASPWRSYAALHLWQHQ 488 >UniRef50_B4X1U6 Base excision DNA repair protein, HhH-GPD family n=1 Tax=Alcanivorax sp. DG881 RepID=B4X1U6_9GAMM Length = 292 Score = 68.9 bits (167), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 52/190 (27%), Positives = 83/190 (43%), Gaps = 12/190 (6%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC 154 LG+L + GLR+P FE AI+GQ +SV+ A + R QL G+ C Sbjct: 100 LGQLVDRQRGLRVPQSATPFEALSWAIIGQQISVSAATAIRRRFIQLAGQ--TRISGLHC 157 Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTL---PMTIPGDVEQAMKTLQT 211 +P + D AL+++G +AE L+ ++ E L + D + + L Sbjct: 158 YPDAAAVNQLDASALRSVGFSASKAETLLTVSLCCCEHALLPDALHSVADAQSTEQALLG 217 Query: 212 FPGIGRWTANYFALRGWQAKDVFLPDDYLIK---QRFPGMTPAQIRRYAERW----KPWR 264 G+G W+ NY LRG+ D L D ++ Q+ M + + W PWR Sbjct: 218 IRGLGPWSVNYTLLRGYGYLDGSLHGDVAVQKALQQLLAMKARPTAKATQDWLAAFTPWR 277 Query: 265 SYALLHIWYT 274 + H+W + Sbjct: 278 ALVAAHLWQS 287 >UniRef50_Q2BC23 DNA-3-methyladenine glycosylase II n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BC23_9BACI Length = 299 Score = 67.8 bits (164), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 54/199 (27%), Positives = 85/199 (42%), Gaps = 22/199 (11%) Query: 94 ALGRLGAARP----------GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG 143 GR+ A P GLR+ G D FE V A++GQ +++ A KL + YG Sbjct: 98 GFGRMAAGDPLLKGLAERYAGLRIIGIPDLFEALVWAVIGQQINLTFAYKLKKAFTEKYG 157 Query: 144 ERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV 202 + + FP P +AA +P+ LK L ++AE +I +A E L Sbjct: 158 TCFSYEGRCFWLFPEPGMIAALEPEELKQLQFTGRKAEYIIGIAKLMAEKKLKKDDLLGQ 217 Query: 203 EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ---------RFPGMTPAQI 253 A L + G+G WTA+Y ++ F D + R P + ++ Sbjct: 218 PGARDVLMSLKGVGAWTADYVRMKCLLDPAAFPIGDAGFQNALKLQMGLDRKPSIE--EV 275 Query: 254 RRYAERWKPWRSYALLHIW 272 + A RW W++YA+ + W Sbjct: 276 EKAASRWAGWQAYAVFYFW 294 >UniRef50_A5KST9 DNA-3-methyladenine glycosylase II n=1 Tax=candidate division TM7 genomosp. GTL1 RepID=A5KST9_9BACT Length = 239 Score = 66.6 bits (161), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 58/199 (29%), Positives = 90/199 (45%), Gaps = 21/199 (10%) Query: 90 IVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDF 149 I + LG L AA+ L D F VR+I+ Q VSVA + + ARV G Sbjct: 49 IQDTKLGALIAAQAPLNRLRKGDYFANLVRSIISQQVSVAASRAILARVQAATGLE---- 104 Query: 150 PEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALE--GTLPMTIPGDVEQAMK 207 P+R+ A +P+ L+ALG+ +A + LA + G ++ + Sbjct: 105 --------PKRILALNPEELRALGLSRPKAGYISDLAEHFVREPGIFDHLERLADDEVIT 156 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ---RFPGM----TPAQIRRYAERW 260 L GIG WTA F + D+F PDD +++ R G+ + Q+ AE W Sbjct: 157 ELTRIKGIGAWTAQMFLMFTLGRLDIFAPDDVGLQRAITRLYGLKEVPSRTQLEALAEAW 216 Query: 261 KPWRSYALLHIWYTEGWQP 279 +P+R+ A H+W + +P Sbjct: 217 RPYRTVASWHLWESLTHEP 235 >UniRef50_O31544 Putative DNA-3-methyladenine glycosylase yfjP n=17 Tax=Bacillaceae RepID=YFJP_BACSU Length = 287 Score = 66.6 bits (161), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 60/219 (27%), Positives = 101/219 (46%), Gaps = 17/219 (7%) Query: 68 EPVAAECLAKMSRLFDLQCNPQIV-----NGALGRLGAARPGLRLPGCVDAFEQGVRAIL 122 E E + ++ R+F + + Q V +L + G L + ++ I+ Sbjct: 68 ETDQGEMMKEIKRIFQWENHLQHVLDHFSKTSLSAIFEEHAGTPLVLDYSVYNCMMKCII 127 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 Q ++++ A LT R +GE+ D C+P P+ +A D Q L+ L +++AE Sbjct: 128 HQQLNLSFAYTLTERFVHAFGEQKDGV---WCYPKPETIAELDYQDLRDLQFSMRKAEYT 184 Query: 183 IHLANAALEGTLPMT-IPGDV-EQAMKTLQTFPGIGRWTANYFALRGWQAKDVF-LPDDY 239 I + EGTL ++ +P E MK L GIG WT + G ++F L D Sbjct: 185 IDTSRMIAEGTLSLSELPHMADEDIMKKLIKIRGIGPWTVQNVLMFGLGRPNLFPLADIG 244 Query: 240 L---IKQRFP-GMTPAQ--IRRYAERWKPWRSYALLHIW 272 L IK+ F PA+ + ++ W+P+ SYA L++W Sbjct: 245 LQNAIKRHFQLDDKPAKDVMLAMSKEWEPYLSYASLYLW 283 >UniRef50_Q2FMK1 HhH-GPD n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FMK1_METHJ Length = 309 Score = 65.9 bits (159), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 47/206 (22%), Positives = 89/206 (43%), Gaps = 15/206 (7%) Query: 81 LFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQ 140 D C+ ++ RL GLR P FE + +++ Q +S+++A L R + Sbjct: 97 FLDAICSDPVMKSLAHRLD----GLRSPATPTVFEALIDSVIEQQISLSVARSLEYRFIR 152 Query: 141 LYGER-LDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP 199 +G + + C+P P+ LA +P + G ++ E + ++ + +G L + Sbjct: 153 QFGRTCFVNGDLHYCYPLPEDLAGLEPSDFRRCGFTSRKGEYIRDISRSIEKGNLDLESF 212 Query: 200 GDVEQA---MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMT 249 V ++ L GIGRWTA LRG D F DD +++ ++ Sbjct: 213 KKVRDNADIVEALCQIRGIGRWTAELTMLRGLHRMDAFPADDIALRRMISRWYHNGKKIS 272 Query: 250 PAQIRRYAERWKPWRSYALLHIWYTE 275 ++ + AE+W ++ A ++ E Sbjct: 273 ASEAVKTAEQWGEYKGLASFYLEVAE 298 >UniRef50_A9BVD9 HhH-GPD family protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9BVD9_DELAS Length = 328 Score = 65.9 bits (159), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 65/233 (27%), Positives = 97/233 (41%), Gaps = 25/233 (10%) Query: 65 AGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLG---AARPGLRLPGCVDAFEQGVRAI 121 AG++ V A L +M L + G RLG A + GL +P +E A+ Sbjct: 92 AGMDDVLAAMLRRMFGLSQDVGEFERRFGRHARLGPLLARQRGLHVPAACTPWEALSWAV 151 Query: 122 LGQLVSVAMAAKLTARVAQLYGERLD------DFPEYI-CFPTPQRLAAADPQALKALGM 174 GQ +SVA A L R+ G+ + D P+ + C P +LA + L+A G Sbjct: 152 TGQQISVAAAVSLRRRLIAAAGQPVALHDGHADAPQQLWCMPEAAQLAQLGEEDLRAAGF 211 Query: 175 PLKRAEALIHLANAALEGTLPM-------TIPGDVEQAMKTLQTFPGIGRWTANYFALRG 227 + L LA A G LP+ +P V + + L GIG WT NY LRG Sbjct: 212 SRSKTHTLRLLAQAVQSGELPLDDWAALPELP--VAEIRERLLALKGIGPWTVNYMLLRG 269 Query: 228 WQAKDVFLPDDYLIKQ------RFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 + D L D +++ + M Q + + PWR+ H+W + Sbjct: 270 YGHLDGPLHGDVAVRRALALLLKTDAMDAVQTELWLRDFAPWRALVAAHLWAS 322 >UniRef50_D1VDS6 HhH-GPD family protein n=3 Tax=Actinomycetales RepID=D1VDS6_9ACTO Length = 292 Score = 65.9 bits (159), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 70/230 (30%), Positives = 106/230 (46%), Gaps = 27/230 (11%) Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCN----PQIVNGALGRLGAARPGLRL-PGCVD 112 T+H +L+ G +P A A+++R+ L + P V G A L L P C Sbjct: 61 TVHADLTGGADPAAVR--AQVARILSLDVDGAAFPDCVAADAVAAGLAARHLGLRPVCFP 118 Query: 113 A-FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE-YICFPTPQRLAAADP-QAL 169 + +E I+G + + AA+L A +A+ +GE + + FPTP L D L Sbjct: 119 SPYEAACWTIIGHRIRLTQAARLKATIAREHGETITVAGQPTAAFPTPSTLRTVDDLPGL 178 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMT----IPGDVEQAMKTLQTFPGIGRWTANYFAL 225 L M RA +A AAL+G L +P E+A++ LQ PGIG ++A + Sbjct: 179 SELKMGRLRA-----VAQAALDGELDAATLRALP--TEEALRHLQALPGIGPFSAELILI 231 Query: 226 RGWQAKDVFLPDDYLIKQRFPGM----TP--AQIRRYAERWKPWRSYALL 269 RG DVF + + Q +P Q+ R A+RW P+RS+ L Sbjct: 232 RGAGHPDVFPGHERRLHQAMAKAYHLDSPELGQLSRLAQRWAPFRSWVTL 281 >UniRef50_C6A294 AlkA 3-methyladenine DNA glycosylase n=9 Tax=Thermococcaceae RepID=C6A294_THESM Length = 279 Score = 65.1 bits (157), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 52/177 (29%), Positives = 83/177 (46%), Gaps = 14/177 (7%) Query: 104 GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAA 163 GL +P D ++ V I Q VS A + + +L G++L++ YI FPTPQ + Sbjct: 92 GLTIPKAPDKYQALVETIAQQQVSFEFAMQTIRNLVKLAGKKLENL--YI-FPTPQSILN 148 Query: 164 ADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG-DVEQAMKTLQTFPGIGRWTANY 222 + + + RA + HL LEG L + + D ++A+K L F GIGRW+A Sbjct: 149 LSEEKFREAKLGY-RAGYIRHLTKEYLEGNLNLDLEELDEKEAIKYLTKFKGIGRWSAEL 207 Query: 223 FALRGWQAKDVFLPDDYLIKQ---RFPGMTPAQ-----IRRYAERWKPWRSYALLHI 271 F G K+V+ D +K+ + G P + +R E + W+S +I Sbjct: 208 FLAYGL-GKNVYPAGDLGMKRGIAKIFGKNPKEVKEKDVREIIEPYGKWKSLLAFYI 263 >UniRef50_Q0VPN7 Putative uncharacterized protein n=1 Tax=Alcanivorax borkumensis SK2 RepID=Q0VPN7_ALCBS Length = 291 Score = 65.1 bits (157), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 54/194 (27%), Positives = 86/194 (44%), Gaps = 12/194 (6%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC 154 L L + GLR+P FE AI+GQ +SV+ A + R QL D C Sbjct: 99 LNTLVQRQRGLRVPQSATPFEALTWAIIGQQISVSAATAIRRRFIQLASPVRHD--GLHC 156 Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGT-LPMTIPGD--VEQAMKTLQT 211 +P + P AL+++G +A+ L+ ++ + LP T+ D EQ + L Sbjct: 157 YPDAATVCLLTPDALRSVGFSATKADTLLAVSRLCRDQQLLPETLHLDAYAEQLERNLLE 216 Query: 212 FPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF------PGMTPAQIRR-YAERWKPWR 264 G+G W+ NY LRG+ D L D +++ P A++ R + + PWR Sbjct: 217 IRGLGPWSVNYTLLRGYGFLDGSLHADVAVQKALQMLLGQPERPTARVTRDWLADFTPWR 276 Query: 265 SYALLHIWYTEGWQ 278 + H+W + Q Sbjct: 277 ALVAAHLWQSLSTQ 290 >UniRef50_B6EMH3 DNA repair protein n=2 Tax=Gammaproteobacteria RepID=B6EMH3_ALISL Length = 202 Score = 64.7 bits (156), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 47/169 (27%), Positives = 75/169 (44%), Gaps = 20/169 (11%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 FE + I+ Q +S +AA + R+ L E TP+RL + + Q L+ + Sbjct: 38 GFEAFLSIIVSQQLSTKVAAVIMGRLVALLKE-----------VTPERLLSIEEQNLRDV 86 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQA 230 G+ ++ E LA A G L + + E A+ + + G GRW+A + + Sbjct: 87 GLSWRKIEYAKGLALAVQSGNLDIDGLESLSDEDAISAITSLKGFGRWSAEIYLMFSLGR 146 Query: 231 KDVFLPDDY---LIKQRFPGMT----PAQIRRYAERWKPWRSYALLHIW 272 +D+F DD + R G+T P Q R W+PWRS L +W Sbjct: 147 QDIFPADDLGVLIALGRLKGLTDKPTPKQAREMVGHWQPWRSVGSLFLW 195 >UniRef50_Q1AWP7 DNA-3-methyladenine glycosylase II n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AWP7_RUBXD Length = 163 Score = 64.3 bits (155), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 53/170 (31%), Positives = 83/170 (48%), Gaps = 26/170 (15%) Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 +R ++GQ +SV A + AR+ +G R P P L A + L+A G+ Sbjct: 1 MRTVVGQQLSVGAARSIYARLCARFGGR---------PPLPGELEAVPDEELRACGVSGA 51 Query: 178 RAEALIHLANAALEGTLPMT----IP-GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 +A L LA LEG LP+ +P G+V A+ ++ GIGRW+A F + + D Sbjct: 52 KARCLRELARRVLEGGLPLEELRGLPDGEVISALTAVR---GIGRWSAQMFLIFHLRRPD 108 Query: 233 VFLPDDYLIKQR------FPGMTPAQ--IRRYAERWKPWRSYALLHIWYT 274 V D I++ P + PA+ + R A W+PWR+ A L++W + Sbjct: 109 VLPAADLGIRRAAALLYGLPEL-PAEELLERLAAPWRPWRTTACLYLWRS 157 >UniRef50_C6WJ98 Transcriptional regulator, AraC family n=5 Tax=Actinomycetales RepID=C6WJ98_ACTMD Length = 584 Score = 63.2 bits (152), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 84/288 (29%), Positives = 118/288 (40%), Gaps = 32/288 (11%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L ++ P + G L A AV VE D Y R+L + GVV+ P + L Sbjct: 303 LPFRGPLHAPSLFGPLVANAVPGVEEWRDGAYRRTLRLPRGHGVVSLRPRADHVECDLTL 362 Query: 64 SAGLE-PVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP------GLRLPGCVDAFEQ 116 + + PVA +++ R DL +P V+GAL A RP G R+PG VD E Sbjct: 363 TDSRDLPVA---ISRCRRALDLDADPAEVDGALRADPALRPLVDAAPGTRVPGVVDGAEC 419 Query: 117 GVRAILGQLVSVAMAAKLTA------RVAQLYGERLDDFPE---YICFPTPQRLAAADPQ 167 VRA+LG+ A A RV + GE + D FPTPQ L DP Sbjct: 420 AVRALLGEGTGTGAAMGAGANAGWAHRVVREAGEAVPDPAGGGLTHLFPTPQALLDLDPA 479 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRG 227 L A A + AL G + + D +A L+ G + R Sbjct: 480 LLPPP------ARAPLTALLTALVGGVDLGAGADRAEARSALRC---AGERVLDAVLTRS 530 Query: 228 WQAKDVFLPDDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALLHI 271 D F PDD ++ G+ T A + + W+PWR+YA ++ Sbjct: 531 LGDPDGFCPDDPAVRAAAGGIGLPVTAAALADRSRAWRPWRAYATRYL 578 >UniRef50_B3T536 Putative HhH-GPD superfamily base excision DNA repair protein n=1 Tax=uncultured marine microorganism HF4000_ANIW137P11 RepID=B3T536_9ZZZZ Length = 209 Score = 62.8 bits (151), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 51/191 (26%), Positives = 84/191 (43%), Gaps = 19/191 (9%) Query: 90 IVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDF 149 +++ AL + A+ L L D F V+AI+GQ +S+ AA + RV L GE Sbjct: 20 LIDPALAAVINAKGELGLSSRGDLFATLVKAIVGQQISIKAAATVWGRVVDLIGE----- 74 Query: 150 PEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG-DVEQAMKT 208 P+ + A + L++ G+ ++AE + +A A G D E+A++ Sbjct: 75 ------VKPESVLAHTHEELRSCGLSNRKAEYVAGIAEAWQGGYAEYDWDSMDDERALEL 128 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-------MTPAQIRRYAERWK 261 L G+GRWTA + DVF DD + + + A++ A W Sbjct: 129 LVALRGVGRWTAEMVLIFTLLRPDVFPIDDLGVVRGMEKVYNEGEVLDKAELNDIASNWS 188 Query: 262 PWRSYALLHIW 272 PWR+ ++W Sbjct: 189 PWRTVGSWYMW 199 >UniRef50_A5WCQ9 HhH-GPD family protein n=2 Tax=Psychrobacter RepID=A5WCQ9_PSYWF Length = 231 Score = 62.8 bits (151), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 47/175 (26%), Positives = 76/175 (43%), Gaps = 21/175 (12%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F + +RA++GQ +SVA A+ + +++ E TP + AD L++ G Sbjct: 66 FRELMRAMVGQQLSVAAASSIWSKL------------ENAALITPDAIMKADDDTLRSHG 113 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 + ++ + L ++ +P E + L GIG+WTA + L D+ Sbjct: 114 LSRQKIRYIRSLVEHDIDFEALAHLPD--EAVISELTAVTGIGKWTAQMYLLFSLGRADI 171 Query: 234 FLPDDYLIK---QRFPGM----TPAQIRRYAERWKPWRSYALLHIWYTEGWQPDE 281 DD IK G+ TP Q+ R + W P RS A L +W GW D+ Sbjct: 172 LAVDDLAIKVGAMEVLGLDERPTPKQLERLTQSWSPHRSAASLLLWAHYGWLKDQ 226 >UniRef50_B5IDT4 Base excision DNA repair protein, HhH-GPD family n=3 Tax=Aciduliprofundum boonei T469 RepID=B5IDT4_9EURY Length = 289 Score = 62.0 bits (149), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 44/178 (24%), Positives = 81/178 (45%), Gaps = 9/178 (5%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG--ERLDDFPEY 152 L ++ GLR ++ +E ++ +L Q +S+ A TA++ + +G E+ + + Y Sbjct: 102 LYKMAITYSGLRPARNLNLYEALIKIVLQQRISLKYALNTTAKLIEKWGIREKWNGY-SY 160 Query: 153 ICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP--MTIPGDVEQAMKTLQ 210 FP P++L +KALG +A++L+ +A G LP + + E+ +K L Sbjct: 161 YSFPPPEKLMRISTSEIKALGTTTVKAKSLLEIAKMEYNGDLPSIYEVNKNPEEYVKFLT 220 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ----IRRYAERWKPWR 264 G+G WTA + D +++ F Q IR Y E++ W+ Sbjct: 221 GIYGVGMWTAELSVATVIHDYSIAPAGDLNVRKAFSKFLGLQGEKEIREYTEKFGKWK 278 >UniRef50_B1ZV80 Transcriptional regulator, AraC family n=2 Tax=Opitutaceae RepID=B1ZV80_OPITP Length = 523 Score = 61.6 bits (148), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 54/187 (28%), Positives = 79/187 (42%), Gaps = 11/187 (5%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC 154 L RL A R LR+ F+ V AI+GQ ++ + A L R+ +L G RL + + Sbjct: 331 LARLVAGRSELRISRIPSVFDGLVWAIIGQQINFSFACVLKRRLTELAGTRLSN--GLMA 388 Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT-IPG-DVEQAMKTLQTF 212 PTP +A +P L L ++A LI A A G L + +P +A +TL Sbjct: 389 PPTPTAIARLEPDELVPLQFSRQKAGYLITTARAITAGELDLAQLPSMSASRAERTLLAL 448 Query: 213 PGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ-------IRRYAERWKPWRS 265 G G W+ NY +R D D + + + RR + P RS Sbjct: 449 HGFGPWSVNYVMMRALGFADCVPLGDTGVTSGLQSLLHLEQRPDVDATRRLMAVFSPHRS 508 Query: 266 YALLHIW 272 A H+W Sbjct: 509 LATAHLW 515 >UniRef50_D1RHI7 HhH-GPD family base excision repair protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RHI7_LEGLO Length = 263 Score = 60.5 bits (145), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 41/138 (29%), Positives = 65/138 (47%), Gaps = 3/138 (2%) Query: 104 GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE-YICFPTPQRLA 162 G++ P FE + AI Q +S+ + R+ Q G +++ + Y FPT + + Sbjct: 75 GVKPPCFPSFFEALINAISCQQISLDAGLHIQNRLVQHIGMKMNHENQVYYAFPTAEDVG 134 Query: 163 AADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGD--VEQAMKTLQTFPGIGRWTA 220 LK +G ++E ++ LA+ E + D E+ ++ L F GIGRWTA Sbjct: 135 HCSVAELKKIGYSTHKSETIVSLASMLKEEHSFLNRLEDKPTEEVIQLLCQFKGIGRWTA 194 Query: 221 NYFALRGWQAKDVFLPDD 238 Y LRG +VF DD Sbjct: 195 EYVLLRGLGRIEVFPGDD 212 >UniRef50_B8IZY6 HhH-GPD family protein n=8 Tax=Bacteria RepID=B8IZY6_DESDA Length = 235 Score = 60.5 bits (145), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 46/174 (26%), Positives = 79/174 (45%), Gaps = 21/174 (12%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 D F + +I+GQ +S A + R+ E C TP+ + ++L+ Sbjct: 41 DIFNALLNSIVGQQISTKAQATIWKRMR-----------EQFCPITPENIGTISAESLQT 89 Query: 172 LGMPLKRAEALIHLANAALEGTLPMT-IPG--DVEQAMKTLQTFPGIGRWTANYFALRGW 228 G+ +++A + + A L+G+L + +P D E + +Q GIG WTA + Sbjct: 90 CGISMRKAAYIKSITEAVLDGSLDLARLPSLTDKEICAQLVQ-LKGIGVWTAEMIMIFSM 148 Query: 229 QAKDVFLPDDYLIKQ------RFPGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 Q D+ DD I++ R +TPA RY +R+ P + A L++W G Sbjct: 149 QRPDILSWDDLAIQRGLRMLYRHRQITPALFARYRKRYSPHATTASLYLWAIAG 202 >UniRef50_UPI00016C4C1A DNA-3-methyladenine glycosylase II n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4C1A Length = 227 Score = 60.5 bits (145), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 59/206 (28%), Positives = 90/206 (43%), Gaps = 26/206 (12%) Query: 90 IVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLY------- 142 ++NG +GR+G P L +P D F VR ++GQ +S A + R+A+ Sbjct: 19 VMNGLIGRVG---PCLLMPRGEDPFTLLVRCVIGQQISTKAAESIYNRLARAVNPPPEGP 75 Query: 143 ----GERLDDFPEYICFPTPQRLAAADPQALKALGM--PLKRA-EALIHLANAALEGTLP 195 G L + P +LAA K G+ P +R A++ A A + LP Sbjct: 76 HPADGTSLAMWQREGIMPM-DKLAALSEAEFKECGVSGPKQRTLRAVVEHARANPD-LLP 133 Query: 196 MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM------- 248 D + + L GIG WT + + L G DV DY IK + Sbjct: 134 SIAGLDDDTIRERLTVIKGIGPWTVDMYLLFGLGRPDVLSVGDYGIKVAVKNLFRLRKLP 193 Query: 249 TPAQIRRYAERWKPWRSYALLHIWYT 274 PA++ R A+ W+P+RS AL ++W + Sbjct: 194 DPAKLTRVAKPWQPYRSVALWYLWRS 219 >UniRef50_B9LPN6 HhH-GPD family protein n=4 Tax=Halobacteriaceae RepID=B9LPN6_HALLT Length = 198 Score = 60.5 bits (145), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 48/192 (25%), Positives = 79/192 (41%), Gaps = 21/192 (10%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + + RL L + D F + +I+ Q +S A AA + R + G Sbjct: 12 DSTMARLIDRHGRLTIEPAADEFARLCTSIVNQQLSTASAAAIHERFVDVLG-------- 63 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG----DVEQAMK 207 PTP + AAD AL+ G+ + E L A A +G +T G E + Sbjct: 64 --GAPTPDDVLAADEVALREAGLSGTKVEYLREAAAAFRDGDRDLTREGFGDASDEAVVA 121 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMTPAQIRRYAERW 260 L G+G WTA + + +DV D +++ +T A++R + W Sbjct: 122 ALTEIRGVGEWTARMYLIFALGREDVLPLGDLAVRKGIEQVYNDGAELTRAEMRNIGDAW 181 Query: 261 KPWRSYALLHIW 272 +P+RSY ++W Sbjct: 182 RPYRSYGTRYVW 193 >UniRef50_Q1ITU3 DNA-3-methyladenine glycosylase II n=2 Tax=Bacteria RepID=Q1ITU3_ACIBL Length = 251 Score = 60.1 bits (144), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 49/178 (27%), Positives = 85/178 (47%), Gaps = 20/178 (11%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY-------ICFPTPQRLAAA 164 + FE + +I+ Q +S AA + RV LY D P + + FPTP++L A Sbjct: 38 NVFEALMESIVYQQLSGKAAATILNRVKALYFP--PDTPTHDTRHGKALPFPTPEQLLAT 95 Query: 165 DPQALKALGMPLKRAEALIHLANAALEGTLP----MTIPGDVEQAMKTLQTFPGIGRWTA 220 + L++ G+ + +++ LA ++GT+P M D ++ + L GIGRWT Sbjct: 96 PDETLRSAGLSGNKTKSVKDLAAKTIDGTVPDIATMKKMSD-DEIINHLTQVRGIGRWTV 154 Query: 221 NYFALRGWQAKDVFLPDDYLIKQRFPGM------TPAQIRRYAERWKPWRSYALLHIW 272 L KDV+ DD +++ + + P ++ E +KP+RS A ++W Sbjct: 155 EMILLFNLFRKDVWPVDDLGVRKGYGYLHGIEMPKPKELMALGEVYKPYRSVAAWYMW 212 >UniRef50_Q92383 DNA-3-methyladenine glycosylase 1 n=1 Tax=Schizosaccharomyces pombe RepID=MAG1_SCHPO Length = 228 Score = 59.7 bits (143), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 44/187 (23%), Positives = 81/187 (43%), Gaps = 19/187 (10%) Query: 98 LGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPT 157 +G RP + + +E+ +RA+ Q +L ++ A R FPT Sbjct: 36 VGNYRPNRSMEK-KEPYEELIRAVASQ--------QLHSKAANAIFNRFKSISNNGQFPT 86 Query: 158 PQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGD---VEQAMKTLQTFPG 214 P+ + D + ++A G ++ ++L +A A + G +P + E+ ++ L G Sbjct: 87 PEEIRDMDFEIMRACGFSARKIDSLKSIAEATISGLIPTKEEAERLSNEELIERLTQIKG 146 Query: 215 IGRWTANYFALRGWQAKDVFLPDDYLIK------QRFPGM-TPAQIRRYAERWKPWRSYA 267 IGRWT + DV DD I+ R P + T + +++E P+R+ A Sbjct: 147 IGRWTVEMLLIFSLNRDDVMPADDLSIRNGYRYLHRLPKIPTKMYVLKHSEICAPFRTAA 206 Query: 268 LLHIWYT 274 ++W T Sbjct: 207 AWYLWKT 213 >UniRef50_Q81IC3 DNA-3-methyladenine glycosylase II n=75 Tax=Bacillus RepID=Q81IC3_BACCR Length = 287 Score = 59.3 bits (142), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 45/171 (26%), Positives = 81/171 (47%), Gaps = 14/171 (8%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 D F +R I+ Q +++ A LT + + YG + FPTP+ +A + L+ Sbjct: 117 DYFACLLRCIIHQQINLKFATVLTEQFVKRYGTEKNGV---FFFPTPEIVANISIEELRE 173 Query: 172 LGMPLKRAEALIHLANAALEGTLPM-TIPGDVEQAMKTLQTFP--GIGRWTANYFALRGW 228 ++AE ++ L + + GTL + +I E+ + + Q P GIG WT F + G Sbjct: 174 QKFSQRKAEYIVGLGRSIVSGTLNLASIENGTEEDI-SAQLLPIRGIGAWTVQNFLMFGL 232 Query: 229 QAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIW 272 K++F D I++ G+ A + + + +P+ SYA L++W Sbjct: 233 GRKNMFPKADIGIQRAVQGIFQLDDKPDDAFLEKVKQECEPYCSYAALYLW 283 >UniRef50_A8TVS7 HhH-GPD n=1 Tax=alpha proteobacterium BAL199 RepID=A8TVS7_9PROT Length = 229 Score = 59.3 bits (142), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 59/201 (29%), Positives = 85/201 (42%), Gaps = 34/201 (16%) Query: 85 QCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGE 144 Q +P + L +G P + PG F +R ++GQ VS A AA + R+ + G+ Sbjct: 22 QAHPAL-GAVLVEIGPPEPRILQPG----FGSLLRIMVGQQVSTASAAAIWGRLVEASGD 76 Query: 145 RLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP----MTIPG 200 +D F D +AL +G + LA+A L+GTL +PG Sbjct: 77 TVDGFNSL------------DDEALGRVGFSRAKMRYGRALADAVLDGTLNPDDLEKLPG 124 Query: 201 DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ---------RFPGMTPA 251 EQ L PGIGRWTA + + DVF D +++ P + A Sbjct: 125 --EQVSAQLMALPGIGRWTAEIYRMFALGDPDVFPIGDLALREGVRMALDLPERPDLGAA 182 Query: 252 QIRRYAERWKPWRSYALLHIW 272 + R WKP RS A L +W Sbjct: 183 E--RLTAAWKPERSAAALLLW 201 >UniRef50_B4CYJ1 DNA-3-methyladenine glycosylase II n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYJ1_9BACT Length = 214 Score = 58.9 bits (141), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 50/171 (29%), Positives = 78/171 (45%), Gaps = 22/171 (12%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLY-GERLDDFPEYICFPTPQRLAAADPQALKAL 172 F VRA+ Q ++ A + R L+ G++ FPT + LA+ +AL+ Sbjct: 37 FRALVRAVAHQQLNGTAAETILRRFCALFPGKK---------FPTAKDLASVTDEALRGS 87 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIP----GDVEQAMKTLQTFPGIGRWTANYFALRGW 228 G + AL +A L+GT+P T D E + +Q G+GRWT + Sbjct: 88 GFSWAKIAALRDIAAKTLDGTIPSTRAIQKMNDAEIIERLVQV-RGVGRWTVEMMLIFKL 146 Query: 229 QAKDVFLPDDYLIKQRFP---GM----TPAQIRRYAERWKPWRSYALLHIW 272 DVF DD+ I+ F G+ P +I +AERW+P+ + A + W Sbjct: 147 GRPDVFPADDFGIRDGFRVAYGLDEMPKPKEILAHAERWRPYATTAAWYFW 197 >UniRef50_Q01SY7 DNA-3-methyladenine glycosylase II n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01SY7_SOLUE Length = 200 Score = 58.9 bits (141), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 52/206 (25%), Positives = 90/206 (43%), Gaps = 23/206 (11%) Query: 84 LQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG 143 L+ + +++ + R+GA R P FE VR+I+ Q +S +A + R+ G Sbjct: 8 LRKSDPVLSAIIERVGAYGIQFREPD----FETLVRSIVYQQLSGRVAKVILDRLVAAVG 63 Query: 144 ERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT-IPGDV 202 + TP+++ A P ++ LG+ ++ + LA +G L T +P Sbjct: 64 REV----------TPEKILALRPGRMRKLGLSTQKTAYIRDLARHTRDGRLVFTELPALT 113 Query: 203 -EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIR 254 E+ ++ L GIG WTA F + + DV D ++ TPA++ Sbjct: 114 DEEVIERLTQVKGIGVWTAQMFLMFALRRHDVLPTGDLGVRNAIRKAYDLAELPTPAEME 173 Query: 255 RYAERWKPWRSYALLHIWYTEGWQPD 280 A W+PW S A ++W + Q D Sbjct: 174 ELARNWRPWCSVASWYLWRSLEGQAD 199 >UniRef50_Q1J274 Endonuclease III, DNA-3-methyladenine glycosidase II n=3 Tax=Deinococcus RepID=Q1J274_DEIGD Length = 216 Score = 58.5 bits (140), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 47/166 (28%), Positives = 72/166 (43%), Gaps = 18/166 (10%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F VR+++GQ +S AA + AR+ G P+ L P L+ALG Sbjct: 47 FGTLVRSVVGQQLSTQAAASIAARLEDALGGV-----------EPEALLRTPPDKLRALG 95 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQA--MKTLQTFPGIGRWTANYFALRGWQAK 231 + + + LA+AAL G + + A + L PGIGRWT F + G Sbjct: 96 LSWAKVRTVRALADAALSGQVDFAHLSSLPDAAVIDALTPLPGIGRWTVEMFLMFGLARP 155 Query: 232 DVFLPDDYLIKQ----RFPGMTPAQIR-RYAERWKPWRSYALLHIW 272 DVF D +++Q +P + P + W P+R+ A +W Sbjct: 156 DVFSFGDLVLRQGLSRLYPHVAPGSAQAAVVAAWSPYRTLAARVLW 201 >UniRef50_Q3INX6 DNA N-glycosylase / DNA lyase n=6 Tax=Halobacteriaceae RepID=Q3INX6_NATPD Length = 203 Score = 58.5 bits (140), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 57/196 (29%), Positives = 88/196 (44%), Gaps = 24/196 (12%) Query: 84 LQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG 143 L+ +P ++ + R GA L + D F + + ++L Q VS+A A + Sbjct: 8 LRADP-VLEPLIERHGA----LTIEPADDLFRRLLVSVLRQQVSMASA--------EATK 54 Query: 144 ERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI--PGD 201 +RL D E PTP + AAD + + G+ ++A L ++A A + P D Sbjct: 55 KRLFDAVE----PTPTAVLAADTETFREAGLSRQKATYLHNIAAAFEDHGYDRAYFEPMD 110 Query: 202 VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYA---- 257 E L G+G WTAN L +DVF D I++ + + R A Sbjct: 111 DEAVRAELTDITGVGEWTANMQLLFSLGREDVFPVGDLGIRKGMRALLDEDLDRAAMTEA 170 Query: 258 -ERWKPWRSYALLHIW 272 ERW P+RSYA L++W Sbjct: 171 AERWAPYRSYASLYLW 186 >UniRef50_C1DYL3 Predicted protein n=2 Tax=Micromonas RepID=C1DYL3_9CHLO Length = 291 Score = 58.5 bits (140), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 48/173 (27%), Positives = 79/173 (45%), Gaps = 18/173 (10%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 AF RAI+ Q ++ AA + RV + G + D + TP + AD A++A Sbjct: 83 AFRSLARAIVFQQLNGTAAATIFGRVLRCVGAQDD-----VLALTPDAIIDADEAAMRAC 137 Query: 173 GMPLKRAEALIHLANA--ALEGTLPMTIPG----DVEQAMKTLQTFPGIGRWTANYFALR 226 G+ ++ E L+ LA A P++ D M L GIG W+ + F + Sbjct: 138 GLSQRKHEYLVALARAFHPAHSDFPLSDESLEAMDDTAVMSALVALRGIGPWSVHMFQMF 197 Query: 227 GWQAKDVFLPDDYLIKQ---RFPGM----TPAQIRRYAERWKPWRSYALLHIW 272 DV D+ +++ R G+ + A++ AERWKP R+ A +++W Sbjct: 198 YLNRPDVLPTKDFGVRKGVMRLYGLRDMPSEAKVEEIAERWKPHRTLASMYMW 250 >UniRef50_C1D8D7 HhH-GPD family protein n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1D8D7_LARHH Length = 208 Score = 58.2 bits (139), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 55/193 (28%), Positives = 89/193 (46%), Gaps = 26/193 (13%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARV-AQLYGERLDDFPEYI 153 + RL A+ P L + FE +RAI+GQ +SV A + R+ A L G+ Sbjct: 21 MARLIASWPDAELVSRGEPFETLLRAIVGQQISVRAADAVWKRLSAVLSGQ--------- 71 Query: 154 CFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTL-PMTIPG-DVEQAMKTLQT 211 P+P+R+ A + L++ G+ ++ LA +G + P G D E + L Sbjct: 72 --PSPERVLALPEEVLRSAGLSARKVLYARDLAECFTDGRVNPAAHAGLDDEALIAELVA 129 Query: 212 FPGIGRWTANYFALRGWQAKDVFLPDD----------YLIKQRFPGMTPAQIRRYAERWK 261 GIGRWTA + + DV+ DD Y ++ + +T Q+R ER+ Sbjct: 130 VRGIGRWTAEMYLIFNQLRPDVWPVDDIGLQRAMARHYALEDQKASLT--QLRVMGERFA 187 Query: 262 PWRSYALLHIWYT 274 PWR+ A ++W + Sbjct: 188 PWRTVATWYLWRS 200 >UniRef50_Q1H1S0 DNA-3-methyladenine glycosylase II n=1 Tax=Methylobacillus flagellatus KT RepID=Q1H1S0_METFK Length = 142 Score = 58.2 bits (139), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 39/130 (30%), Positives = 58/130 (44%), Gaps = 10/130 (7%) Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP---MTIPGDVEQAMKTLQT 211 FP P L A D L+A G ++ + LA AAL G +P + + E + L Sbjct: 6 FPAPAALLATDVAQLRACGFSGRKITYITGLAQAALAGNIPDHATALAMEDEALITQLTA 65 Query: 212 FPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ---RFPGM----TPAQIRRYAERWKPWR 264 PGIGRWT + + D+ DD +++ R G+ TP +R W P R Sbjct: 66 LPGIGRWTVEMMLMHTLRRADILPVDDLGVREGFRRLKGLSTAPTPRLLRDIGLAWSPHR 125 Query: 265 SYALLHIWYT 274 S A ++W+ Sbjct: 126 SSAAWYLWHV 135 >UniRef50_Q5FSB3 DNA-3-methyladenine glycosylase n=1 Tax=Gluconobacter oxydans RepID=Q5FSB3_GLUOX Length = 219 Score = 58.2 bits (139), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 48/171 (28%), Positives = 76/171 (44%), Gaps = 14/171 (8%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 + ++ +RAI GQ + A A K+ R+ L + D P P P R+ + + L+A Sbjct: 37 EPYDALLRAIAGQQLHGAAARKIFGRLCLLGAQESVDGPP----PAPGRILSLSEERLRA 92 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDV---EQAMKTLQTFPGIGRWTANYFALRGW 228 G+ + A+ LA A L+G +P V E+ + L T GIGRWT + Sbjct: 93 CGLSGNKILAMKGLAQARLDGLVPSRAEASVMTDEELIARLVTLRGIGRWTVEMLLMFTL 152 Query: 229 QAKDVFLPDDYLIKQ---RFPGM----TPAQIRRYAERWKPWRSYALLHIW 272 DV DD+ +++ R M P ++ ER+ P RS + W Sbjct: 153 NRPDVMPVDDFGVREGWRRIRKMDLPPKPKALKEETERFAPHRSTLAWYCW 203 >UniRef50_A5V920 HhH-GPD family protein n=7 Tax=Sphingomonadales RepID=A5V920_SPHWW Length = 368 Score = 58.2 bits (139), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 47/165 (28%), Positives = 75/165 (45%), Gaps = 18/165 (10%) Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 +R I+GQ VSVA A + A++ + G L P+ +AAA L+A G+ + Sbjct: 199 LRTIVGQQVSVAAANAIWAKMETMVGAGL----------APEAVAAAPDDLLRATGLSRQ 248 Query: 178 RAEALIHLANAALEGTLPMT-IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 + LA GT+ +P D E+A+ + GIGRW+A + L DV+ Sbjct: 249 KIAYARSLAEHVASGTIDFDRLPADDEEAIAQMTAIKGIGRWSAEIYLLFAEGRGDVWPA 308 Query: 237 DDYLIK---QRFPGM----TPAQIRRYAERWKPWRSYALLHIWYT 274 D ++ R G+ + + RR A W P R A + W++ Sbjct: 309 GDLAVQIEVGRLLGLPERPSERETRRLAHGWSPHRGAAAIFAWHS 353 >UniRef50_Q6CEP5 YALI0B14080p n=1 Tax=Yarrowia lipolytica RepID=Q6CEP5_YARLI Length = 360 Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 41/137 (29%), Positives = 68/137 (49%), Gaps = 7/137 (5%) Query: 110 CVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQAL 169 C + FE R I+GQ VS A A + + +L+ + E FP+PQ + +AL Sbjct: 135 CNNCFEHLTRGIIGQQVSGAAAESILKKFKKLFPV---EGSEDGKFPSPQEILDTPTEAL 191 Query: 170 KALGMPLKRAEALIHLANAALEGTLP---MTIPGDVEQAMKTLQTFPGIGRWTANYFALR 226 ++ G+ ++AE + L+ A +GTL ++ D + + L GIG W+A+ F L Sbjct: 192 RSAGLSGRKAEYITCLSTAFKDGTLSDDWLSTASD-DDVVDALVAIKGIGPWSADMFLLF 250 Query: 227 GWQAKDVFLPDDYLIKQ 243 + DVF D I++ Sbjct: 251 ALKRMDVFTLGDLGIQR 267 >UniRef50_Q0BSG3 DNA-3-methyladenine glycosylase II n=12 Tax=Proteobacteria RepID=Q0BSG3_GRABC Length = 255 Score = 58.2 bits (139), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 49/169 (28%), Positives = 68/169 (40%), Gaps = 17/169 (10%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 FE +RAI Q + A + AR L FP FP+P + A D + L+ G Sbjct: 65 FEALIRAIAHQQLHARAAEAILARFLAL-------FPVNTDFPSPLEIMALDTETLRQCG 117 Query: 174 MPLKRAEALIHLANAALEGTLPM---TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 + AL + AA G +P D E ++ L T GIGRWT + Sbjct: 118 FSGTKIIALRGVCEAAQGGIIPDRSGCTALDDETLIQRLTTLRGIGRWTVEMLMIFTLGR 177 Query: 231 KDVFLPDDY-------LIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D+ DD+ LIK P + + W PWRS A ++W Sbjct: 178 TDILPVDDFGVREGWRLIKGLESQPRPKILADIGQSWSPWRSLAAWYLW 226 >UniRef50_Q0USE2 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0USE2_PHANO Length = 240 Score = 57.8 bits (138), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 41/151 (27%), Positives = 72/151 (47%), Gaps = 13/151 (8%) Query: 122 LGQLVSVAMAAKLTARVAQLYGERLDDFPE-YICFPTPQRLAAADPQALKALGMPLKRAE 180 +GQ VS A AA + + L FPE + FP+P ++ D L+ G+ ++AE Sbjct: 1 MGQQVSGAAAASIRKKFTSL-------FPETHPSFPSPSQILEKDLPTLRTAGLSQRKAE 53 Query: 181 ALIHLANAALEGTL--PMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD 238 + LA G L PM + E+ ++ L G+GRW+ FA G + DVF D Sbjct: 54 YISGLAEKFASGELSAPMLVTASDEELIEKLVAVRGLGRWSVEMFACFGLKRMDVFSTGD 113 Query: 239 YLIKQ---RFPGMTPAQIRRYAERWKPWRSY 266 +++ + G ++++ +WK +++ Sbjct: 114 LGVQRGMAAYMGRDTSKLKAKGGKWKYVKTH 144 >UniRef50_Q0BWS7 Putative DNA-3-methyladenine glycosylase n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0BWS7_HYPNA Length = 213 Score = 57.4 bits (137), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 52/168 (30%), Positives = 73/168 (43%), Gaps = 23/168 (13%) Query: 119 RAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKR 178 R I Q +S AA + RV GE TP+ L AADP AL+A G+ + Sbjct: 47 RMISHQQLSTKAAATIWGRVEVFLGE-----------VTPETLLAADPDALRACGLSRPK 95 Query: 179 AEALIHLANAALEGTLPM--TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 L +A A + G L + D++ A L + GIG WTA F L A D F Sbjct: 96 VAHLTSIAEAMVTGELNLARVCAADLDSARAELVSVRGIGPWTAELFLLYAVGAMDAFPI 155 Query: 237 DDYLIKQ------RFPGMTPAQI-RRYAERWKPWRSYALLHIWYTEGW 277 D + + R+ ++I ++AE W+P R A +W GW Sbjct: 156 ADVGLMEAHKQLGRYETRMESKIFTQHAEIWRPHRGVAAHLLW---GW 200 >UniRef50_B7K2N0 DNA-3-methyladenine glycosylase II n=5 Tax=Chroococcales RepID=B7K2N0_CYAP8 Length = 206 Score = 57.4 bits (137), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 51/190 (26%), Positives = 85/190 (44%), Gaps = 20/190 (10%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + L L + P + + F V+AI+GQ +SV A ++ R+ L G Sbjct: 18 DKILAYLISLYPDETIINYHNPFYTLVKAIIGQQISVNAANAISKRLESLLGT------- 70 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTL-PMTIPGDVEQ-AMKTL 209 + + A D +AL+ G+ + + ++A A +G L P P +Q + L Sbjct: 71 ----ISIETYLAMDSEALRQCGLSRPKISYITNIAQAFEQGILTPQIWPMMSDQEVISQL 126 Query: 210 QTFPGIGRWTANYFALRGWQAKDVF-LPDDYLIK--QRFPG----MTPAQIRRYAERWKP 262 + GIG WTA F + D+ L D LI QR G +T +I+ ++ WKP Sbjct: 127 ISIKGIGLWTAQMFLIFHLHRSDILPLADLGLINAIQRHYGQSQRLTKGEIQELSQVWKP 186 Query: 263 WRSYALLHIW 272 +R+ A ++W Sbjct: 187 YRTVATWYLW 196 >UniRef50_C7MP98 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=2 Tax=Bacteria RepID=C7MP98_CRYCD Length = 234 Score = 56.6 bits (135), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 43/170 (25%), Positives = 71/170 (41%), Gaps = 21/170 (12%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 D F V I+GQ +S A + R+ L GE + Q + A P+ L++ Sbjct: 38 DLFSAVVHHIIGQQISTAAQQTVWLRMCDLLGE-----------VSAQSITATSPEQLQS 86 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTI---PGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 G+ ++ + + A ++G+ + D E A+ L + GIG WTA L Sbjct: 87 CGISFRKVDYIQDFAEKVMDGSFDLDAIEQASDAE-AIAALSSLRGIGTWTAEMLLLFCL 145 Query: 229 QAKDVFLPDDYLIKQ------RFPGMTPAQIRRYAERWKPWRSYALLHIW 272 DV DD I++ +T +Y R+ P+ S A L++W Sbjct: 146 GRPDVLSFDDLAIQRGLRMVYHHRKITRPLFEKYRRRYSPYGSVASLYLW 195 >UniRef50_Q2SX77 DNA-3-methyladenine glycosylase n=60 Tax=Betaproteobacteria RepID=Q2SX77_BURTA Length = 312 Score = 56.6 bits (135), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 48/173 (27%), Positives = 78/173 (45%), Gaps = 22/173 (12%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFP-TPQRLAAADPQALK 170 D+F R+++GQ +SVA A + ++ E C P ++ + L Sbjct: 144 DSFVTLARSVVGQQISVAAAQSVWVKI------------ETACPKLAPPQIIKLGQEKLI 191 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 A G+ +++E ++ LA + G L + D E + L GIGRWTA F + Sbjct: 192 ACGLSKRKSEYILDLAQHFVSGALHVDKWASMDDEDVIAELTQIRGIGRWTAEMFLIFNL 251 Query: 229 QAKDVFLPDDY-LIK----QRFPG--MTPAQIRRYAERWKPWRSYALLHIWYT 274 DV DD LI+ F G +T ++ R A W+PWR+ A ++W + Sbjct: 252 SRPDVLPLDDLGLIRAISVNYFSGEPVTRSEAREVAANWEPWRTVATWYMWRS 304 >UniRef50_A6GQ39 3-methyladenine DNA glycosylase II n=1 Tax=Limnobacter sp. MED105 RepID=A6GQ39_9BURK Length = 217 Score = 56.6 bits (135), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 45/170 (26%), Positives = 78/170 (45%), Gaps = 19/170 (11%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 +E +R+++GQ +SV A + ARV ++ T + L A LKA G Sbjct: 50 YETMLRSLVGQQISVKAADAVWARVVDALNGKI----------TSRALLALSDDTLKATG 99 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPG--DVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + ++ L+ +G L + + D E + L G+GRWTA F + + Sbjct: 100 LSRQKIAYSRALSEFEQQGGLELAVLEGMDDEACTRHLCAIKGVGRWTAQMFLMFCLRRP 159 Query: 232 DVFLPDDY-----LIKQRFPG--MTPAQIRRYAERWKPWRSYALLHIWYT 274 DV+ DD + +Q F G + P + ++ E+ KPWR+ A ++W + Sbjct: 160 DVWPVDDIGVQRGISRQFFEGEPIGPKEALQFGEKLKPWRTVAAWYLWRS 209 >UniRef50_UPI00018509D2 YfjP n=1 Tax=Bacillus coahuilensis m4-4 RepID=UPI00018509D2 Length = 301 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 64/288 (22%), Positives = 115/288 (39%), Gaps = 43/288 (14%) Query: 7 QPPYDWSWMLGFLAARAVSSVET----VADSYYARSLAVGEYRGVV----TAIPDIA--R 56 + PYD +L + + VE + RS+ +Y G + + I D+ Sbjct: 8 EQPYDVESVLSYFTGHPLVVVEQSGLRFGLDHGVRSIIDVKYEGEIAIIHSEIDDMKFIE 67 Query: 57 HTLHI-NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFE 115 T+HI +L L+P+ + R +LQ + G L +D + Sbjct: 68 KTMHILHLDRPLKPID-----EFYRKSELQ-----------EIFQKYEGYPLLLELDDYM 111 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMP 175 +R I+ Q V++ +A + + YGE +D FP P L + L+ + Sbjct: 112 SIIRCIISQQVNLTLARNIFTSLTHTYGEEVDSV---WFFPRPHVLKEVSIEELRTHKLS 168 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 ++AE + A+ +G + M + ++ + L GIG+WT + L +++ Sbjct: 169 QRKAEYIQGFASLVADGAIDMDELDKLSNDEIIDRLLPIRGIGKWTVENYLLFTLGRENL 228 Query: 234 FLPDD---------YLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 F D +L R P M I Y+ W P+ SYA +++W Sbjct: 229 FPKGDIGIQNALKKFLQLDRKPTMDEMDI--YSRDWAPYLSYASIYLW 274 >UniRef50_B2SXP8 HhH-GPD family protein n=39 Tax=Betaproteobacteria RepID=B2SXP8_BURPP Length = 349 Score = 56.2 bits (134), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 50/173 (28%), Positives = 77/173 (44%), Gaps = 22/173 (12%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFP-TPQRLAAADPQALK 170 D F R+++GQ +SVA A + A+V E C PQ+ + L Sbjct: 181 DPFVTLARSVVGQQISVASAQAVWAKV------------EAACPKLVPQQFIKLGLEKLT 228 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 G+ ++AE ++ LA + G L + + E + L GIGRWTA F + Sbjct: 229 TCGLSKRKAEYVLDLAQHFVSGALHVGKWTSMEDEAVIAELTQIRGIGRWTAEMFLIFNL 288 Query: 229 QAKDVFLPDDY-LIK----QRFPG--MTPAQIRRYAERWKPWRSYALLHIWYT 274 DV DD LI+ F G +T ++ R A W+PWR+ A ++W + Sbjct: 289 SRPDVLPLDDLGLIRAISVNYFSGEPVTRSEAREVAANWEPWRTVATWYMWRS 341 >UniRef50_B2B817 Predicted CDS Pa_2_12990 n=8 Tax=Leotiomyceta RepID=B2B817_PODAN Length = 428 Score = 56.2 bits (134), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 49/187 (26%), Positives = 79/187 (42%), Gaps = 28/187 (14%) Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLY---GERLDDFPEYICFPTPQRLAAADPQ 167 +D FE I+ Q VS A A + R L+ + E FPTP + + Sbjct: 245 IDPFESLASGIISQQVSGAAAKAIKNRFISLFYPGNDTTTTTHEKKKFPTPADVIGKSIE 304 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFAL 225 L+ G+ ++AE L+ LA + G L + D E+ ++ L G+GRW+ FA Sbjct: 305 TLRTAGLSQRKAEYLLGLAQKFVSGELTAQMLADAPYEEVLEKLIAVRGLGRWSVEMFAC 364 Query: 226 RGWQAKDVFLPDDYLIKQ---RFPG-----------------MTPAQIRRYAERWKPWRS 265 G + DVF D +++ F G M+ ++ AE ++P+RS Sbjct: 365 FGLKRMDVFSTGDLGVQRGMAAFVGRDVGKLKAKGGGNKWKYMSEREMEEIAEGFRPYRS 424 Query: 266 YALLHIW 272 L +W Sbjct: 425 ---LFMW 428 >UniRef50_B1YMD5 HhH-GPD family protein n=1 Tax=Exiguobacterium sibiricum 255-15 RepID=B1YMD5_EXIS2 Length = 273 Score = 56.2 bits (134), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 46/176 (26%), Positives = 83/176 (47%), Gaps = 14/176 (7%) Query: 106 RLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD--FPEYICFPTPQRLAA 163 RL ++ F +R+I+ Q +++A A L R + +G + FP PT ++L Sbjct: 103 RLVLDINPFTALIRSIIHQQINLAFAQVLMERFCRTFGTEQNGVIFP-----PTAEQLVN 157 Query: 164 ADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYF 223 +P+ L+AL + ++ + L+ A AA++ P +TL G+G WT Sbjct: 158 VEPEQLRALQLSGRKVDYLLGAARAAIDFERLTEAPDAT--IAETLIALKGVGPWTVQNV 215 Query: 224 ALRGWQAKDVFLPDDYLIK---QRFPGMTPA--QIRRYAERWKPWRSYALLHIWYT 274 + G+ +D+F D I +R G P+ + AE + P+RS+A +W + Sbjct: 216 LMFGYGREDLFPASDIGILRAFERLHGTRPSVEEAVLLAEEFAPYRSHAAYLLWRS 271 >UniRef50_D0XPK8 HhH-GPD family protein n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XPK8_9CAUL Length = 230 Score = 55.8 bits (133), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 55/181 (30%), Positives = 80/181 (44%), Gaps = 26/181 (14%) Query: 105 LRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAA 164 +R G V F R I+ Q VSVA AA + AR+ GE TP L A Sbjct: 52 VRQGGFVGLF----RMIVEQQVSVASAASVWARLQAGLGE-----------ITPAGLLAH 96 Query: 165 DPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQ--AMKTLQTFPGIGRWTANY 222 D +L+ +G+ ++A +A A +EGT+ + ++ A++ L G+G WTA Sbjct: 97 DLDSLRGMGLSRQKATYGQGMARAQIEGTIDLEHLATLDDAAAIEALVRLKGVGLWTAEA 156 Query: 223 FALRGWQAKDVFLPDDYLIKQRFPGMTPAQIR-------RYAERWKPWRSYA--LLHIWY 273 + L DVF D +++ + R AE W+PWR A LL WY Sbjct: 157 YLLLCEGRTDVFPGGDVALQEAIKWADGTETRPDTKGAYARAEIWRPWRGVATHLLWAWY 216 Query: 274 T 274 T Sbjct: 217 T 217 >UniRef50_O28163 3-methyladenine DNA glycosylase (AlkA) n=1 Tax=Archaeoglobus fulgidus RepID=O28163_ARCFU Length = 295 Score = 55.8 bits (133), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 42/146 (28%), Positives = 73/146 (50%), Gaps = 8/146 (5%) Query: 99 GAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPT 157 G R GL + FE +AI+ Q +S +A KL A++ +G+ ++ + ++ FPT Sbjct: 107 GFGRAGLM---SMSVFEGIAKAIIQQQISFVVAEKLAAKIVGRFGDEVEWNGLKFYGFPT 163 Query: 158 PQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGR 217 + + A + L+ G+ ++AE ++ +A L E+A + L +F GIGR Sbjct: 164 QEAILKAGVEGLRECGLSRRKAELIVEIAKEE---NLEELKEWGEEEAYEYLTSFKGIGR 220 Query: 218 WTANYFALRGWQAKDVFLPDDYLIKQ 243 WTA L K+VF DD +++ Sbjct: 221 WTAE-LVLSMALGKNVFPADDLGVRR 245 >UniRef50_B6G8M1 Putative uncharacterized protein n=1 Tax=Collinsella stercoris DSM 13279 RepID=B6G8M1_9ACTN Length = 189 Score = 55.5 bits (132), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 46/184 (25%), Positives = 80/184 (43%), Gaps = 23/184 (12%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC 154 +G L +RP AF +I+ Q++S+ + +R+ +L D+ Sbjct: 4 IGDLEYSRPE-------SAFHSLAHSIIEQMLSMKAGRAIESRLRELCD---GDY----- 48 Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPG 214 TP+ +A + +K+ GM ++ ++L LA AL L E KTL PG Sbjct: 49 --TPECIAGIPAENIKSCGMSFRKVQSLKTLAEYALANDLESLAELPDEDVYKTLVQLPG 106 Query: 215 IGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRR------YAERWKPWRSYAL 268 IG+WT + F L D+ +D ++Q F + A I W+P+ S A+ Sbjct: 107 IGKWTCDMFLLFYLGRPDILPVEDGALRQAFEWLYGAPIVSKEVQAVVCSLWRPYSSTAV 166 Query: 269 LHIW 272 +++ Sbjct: 167 RYLY 170 >UniRef50_A8IJX2 HhH-GPD protein n=1 Tax=Azorhizobium caulinodans ORS 571 RepID=A8IJX2_AZOC5 Length = 217 Score = 54.7 bits (130), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 49/168 (29%), Positives = 73/168 (43%), Gaps = 19/168 (11%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F V I+ Q +SVA A ++AR Q+ G PT + L A P+ LKA G Sbjct: 46 FAGLVNIIIAQQLSVAAARAISARTEQVLGGP----------PTVEALLNASPETLKAGG 95 Query: 174 MPLKRAEALIHLANAALEGTLPM--TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + + L +A A +G + + + + A L PGIGRWTA+ + L Sbjct: 96 LSAPKIRTLTRIARALADGVVDLAHVEAMEADAAADYLTRLPGIGRWTADIYLLFCLGRS 155 Query: 232 DVFLPDDYLIKQR------FPGMTPA-QIRRYAERWKPWRSYALLHIW 272 D F D ++ PG A ++ AE W+P+R A +W Sbjct: 156 DAFPEGDLALQVAAADAFGLPGRASALGLKAIAEDWRPYRGVAAHLLW 203 >UniRef50_C6CD76 DNA-3-methyladenine glycosylase II n=1 Tax=Dickeya dadantii Ech703 RepID=C6CD76_DICDC Length = 225 Score = 54.7 bits (130), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 39/149 (26%), Positives = 68/149 (45%), Gaps = 15/149 (10%) Query: 100 AARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQ 159 A+RPG + +E +RA+ Q +S AA + A++ + + E FP+P Sbjct: 37 ASRPGQQ------PYEALIRAVASQQLSNRAAAAIIAKLQKQFAM------EETGFPSPS 84 Query: 160 RLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG---DVEQAMKTLQTFPGIG 216 +LA P+ L+ G ++ + + +A A+ G +P + + + L T GIG Sbjct: 85 QLAECPPEHLRQCGFSSRKIDTVQAIARGAISGLVPDRASAALMEDDTLITQLCTLHGIG 144 Query: 217 RWTANYFALRGWQAKDVFLPDDYLIKQRF 245 RWT + + D+ DD I+Q F Sbjct: 145 RWTVEMLLINTLERMDIMPVDDLGIRQGF 173 >UniRef50_B3EJD3 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Chlorobium phaeobacteroides BS1 RepID=B3EJD3_CHLPB Length = 313 Score = 54.3 bits (129), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 45/142 (31%), Positives = 65/142 (45%), Gaps = 11/142 (7%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER----LDDFP-EYICFPTPQRLAAADP 166 D FE + + Q + + + K +AQ YG R L+D P Y FPTP+ LA+ P Sbjct: 116 DPFETLITFMCAQGLGMHLIRKQVTYLAQEYGTRHTIRLNDVPYTYFSFPTPEALASTSP 175 Query: 167 QALK-ALGMPLKRAEALIHLANAALEGTLPMTIPGD----VEQAMKTLQTFPGIGRWTAN 221 ++L+ RA+ +I A A + G L + D +E KTL + PGIG A+ Sbjct: 176 ESLRLCTNNNCIRADNIIQAAQAVVSGKLDLQALKDPAMPLENVRKTLCSQPGIGFKIAD 235 Query: 222 YFALRGWQAKDVFLPDDYLIKQ 243 L G F P D + Q Sbjct: 236 CVMLFGLHRFAAF-PIDRHVHQ 256 >UniRef50_A0RYQ2 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Cenarchaeum symbiosum RepID=A0RYQ2_CENSY Length = 187 Score = 54.3 bits (129), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 47/172 (27%), Positives = 82/172 (47%), Gaps = 21/172 (12%) Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 VR+I+ Q +S + A+ + AR LYG FP P +A + L+ G+ Sbjct: 24 VRSIITQQLSGSAASSILARFRALYGG---------GFPRPADVARTPARKLQQAGISAM 74 Query: 178 RAEALIHLANAALEGTLPM---TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 +A+ + L+ L + + GD E+ + L G+GRWTA F + +DV Sbjct: 75 KADYIRGLSGMIDRRELKLAGFSRMGD-EEVVAELVRVRGVGRWTAEMFLIFALGRQDVL 133 Query: 235 LPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWY-TEGWQ 278 D +++ + T A+I + AERW+P+R+ A ++W T+G++ Sbjct: 134 PLGDLGLRKGVMKLCSMDSLPTDAEIVKTAERWRPYRTAATWYLWKGTQGFR 185 >UniRef50_A9EU33 Methylated-DNA--protein-cysteine methyltransferase n=31 Tax=Bacteria RepID=A9EU33_SORC5 Length = 395 Score = 53.9 bits (128), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 48/163 (29%), Positives = 73/163 (44%), Gaps = 17/163 (10%) Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 +I+ Q ++ AA + ARV L FP PTP +L A + L+ G+ + Sbjct: 231 SIVYQQLTGKAAATIFARVRAL-------FPRAHEGPTPAQLLRASDEKLRGAGLSQAKL 283 Query: 180 EALIHLANAALEGTLPM--TIPGDVEQAM-KTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 AL LA +G LP + G ++A+ + L GIGRWT + DV Sbjct: 284 LALRDLARKTEDGELPTLAEVHGMEDEAIIERLTRVRGIGRWTVEMLLMFRLGRPDVLPV 343 Query: 237 DDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIW 272 DDY I++ F A + + RWKP+R+ A ++W Sbjct: 344 DDYGIRKGFALAFKRPEPPARADLEKRGARWKPYRTVASWYLW 386 >UniRef50_A6CCG3 Probable DNA-3-methyladenine glycosylase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCG3_9PLAN Length = 211 Score = 53.9 bits (128), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 44/168 (26%), Positives = 76/168 (45%), Gaps = 26/168 (15%) Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 +R+I+ Q +S + A + R+ L G+ PT +++ + L+++G+ + Sbjct: 45 LRSIVSQQISTSAARTIYLRLHALTGKGQ---------PTAEKVMQLSHEQLRSVGLSNQ 95 Query: 178 RAEALIHLANAALEGTL---PMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 +A + HLA ++ + M + D E L GIG WTA F + G D+F Sbjct: 96 KATYVRHLAEMVMQNKVRLHKMHLLSD-EDVTSELIQVKGIGVWTAQMFLMFGLCRPDIF 154 Query: 235 LPDD----------YLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 DD Y +K R T +I A+RW+P+R+ A + W Sbjct: 155 PHDDLGIQNGIQKIYELKTRPDKQTCIEI---AQRWQPYRTVASWYCW 199 >UniRef50_C8W0S2 HhH-GPD family protein n=6 Tax=Bacteria RepID=C8W0S2_DESAS Length = 201 Score = 53.9 bits (128), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 37/161 (22%), Positives = 74/161 (45%), Gaps = 15/161 (9%) Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 A++ ++S +++K A V + ER D+ T Q++A + ++ G+ +K+A Sbjct: 42 ALVHSIISQQISSKAAATVWNRFLERFDEI-------TSQKIAYTTAEEIQQCGITMKKA 94 Query: 180 EALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 + +A+A ++G + ++ E+ K L GIG WTA Q +V Sbjct: 95 IYIKSIADAVMQGEFNIDELSELPDEEVCKRLSALNGIGVWTAEMLMTFSMQRPNVMSWG 154 Query: 238 DYLIKQ------RFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D I++ + A+ +Y R+ P+ + A L++W Sbjct: 155 DLAIRRGIMMLYHHRKLDKAKFEKYKRRYSPYCTIASLYLW 195 >UniRef50_C7PK12 HhH-GPD family protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PK12_CHIPD Length = 206 Score = 53.9 bits (128), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 40/172 (23%), Positives = 78/172 (45%), Gaps = 21/172 (12%) Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 +I+ Q +SV +A + R LY D E P Q++ P+ L+++G+ + Sbjct: 42 SIMSQQLSVKVATVIYTRFLALY-----DGKE----PNAQQILDTPPETLRSIGLSNAKV 92 Query: 180 EALIHLANAALEGTL--PMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 + ++A +E L + D E+ +K L G+GRWT + +DVF D Sbjct: 93 SYVHNVARFTVEEKLTDKKLLQMDDEEVIKYLTQIKGVGRWTVEMLLMFYLCREDVFAID 152 Query: 238 DYLIKQRFPGMTP----------AQIRRYAERWKPWRSYALLHIWYTEGWQP 279 D ++Q + ++ + +++W P+R+YA ++W + +P Sbjct: 153 DLGLQQAMIKLYKLDNTDKKAFREKLLKISKKWSPYRTYASRYLWAWKDMKP 204 >UniRef50_Q5K8T8 DNA-3-methyladenine glycosidase, putative n=1 Tax=Filobasidiella neoformans RepID=Q5K8T8_CRYNE Length = 461 Score = 53.5 bits (127), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 39/136 (28%), Positives = 64/136 (47%), Gaps = 6/136 (4%) Query: 110 CVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQAL 169 +D F V +I+GQ VS A + R L+G E FP+PQ + D +L Sbjct: 126 AIDPFRTLVTSIIGQQVSWMAARAINTRFRALFGFTH----EKEGFPSPQMVLMQDVTSL 181 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTI--PGDVEQAMKTLQTFPGIGRWTANYFALRG 227 K +G+ ++AE ++ LA+ G L + G E+ K L GIG+WT + F + Sbjct: 182 KGVGLSGRKAEYVLSLADHFASGQLSTQLLQSGTDEEISKALIAVRGIGQWTVDMFMIFS 241 Query: 228 WQAKDVFLPDDYLIKQ 243 + D+ D +++ Sbjct: 242 LRRPDILAVGDLGVQK 257 >UniRef50_A2QHV8 Contig An04c0070, complete genome n=10 Tax=Eurotiomycetidae RepID=A2QHV8_ASPNC Length = 412 Score = 53.5 bits (127), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 36/137 (26%), Positives = 63/137 (45%), Gaps = 4/137 (2%) Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLY--GERLDDFPEYICFPTPQRLAAADPQA 168 +D F V +I+GQ VS A A + + L+ + +D FPTP+ + D Sbjct: 216 IDPFRSLVSSIIGQQVSGAAAKSIKDKFVALFKTNNKDEDGTRPSFFPTPEEIIKMDIST 275 Query: 169 LKALGMPLKRAEALIHLANAALEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALR 226 L+ G+ ++AE + L+ G L M + E+ ++ L G+G+W+ FA Sbjct: 276 LRTAGLSQRKAEYIHGLSEKFANGELSARMLLNASDEELVEKLTAVRGLGKWSVEMFACF 335 Query: 227 GWQAKDVFLPDDYLIKQ 243 + DVF D +++ Sbjct: 336 ALKRIDVFSTGDLGVQR 352 >UniRef50_A9RKT9 Predicted protein (Fragment) n=1 Tax=Physcomitrella patens subsp. patens RepID=A9RKT9_PHYPA Length = 205 Score = 53.5 bits (127), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 44/171 (25%), Positives = 78/171 (45%), Gaps = 18/171 (10%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 ++F VR+I+ Q ++V AA + AR+ L G P+ + TP +AA L+ Sbjct: 38 NSFAALVRSIVSQQLAVKAAATIHARLVALCGG-----PQKV---TPAAIAALTAGELRG 89 Query: 172 LGMPLKRAEALIHLANAALEGTLP---MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 G+ ++ L LA+ + G L + D + + L GIG W+A+ F + Sbjct: 90 AGISGRKEVYLHDLADKLVSGALSDEKLMAMEDEDDLVTALTAVKGIGVWSAHMFMIFHL 149 Query: 229 QAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIW 272 DV D I++ F + ++ + A+ W+P+RS A ++W Sbjct: 150 HRPDVLPVGDLGIRKGFQKLFHLKHLPCAEEMHKLADSWRPYRSLASWYLW 200 >UniRef50_B0U6C0 DNA-3-methyladenine glycosidase n=16 Tax=Xanthomonadaceae RepID=B0U6C0_XYLFM Length = 226 Score = 53.5 bits (127), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 61/211 (28%), Positives = 87/211 (41%), Gaps = 26/211 (12%) Query: 85 QCNPQIVNGALGRLGA--ARPGLRLP-GCVDAFEQGVRAILGQLVSVAMAAKLTARVAQL 141 C+P + +G + RLG A G R P VDA RAIL Q +S A+ + AR+ + Sbjct: 19 HCDPGL-SGWMQRLGPLPALRGWRQPFNVVDAL---ARAILFQQLSGKAASTIVARIEAV 74 Query: 142 YGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP-G 200 G C + LA D L+A G+ + AL L + G LP G Sbjct: 75 IGS--------TCL-YAETLACIDDACLRACGVSSNKILALRDLTRREVAGELPSVRQMG 125 Query: 201 DVEQAMKTLQTFP--GIGRWTANYFALRGWQAKDVFLPDDYLIK---QRFPGM----TPA 251 + + P GIGRWT + DV DD ++ QR + TP Sbjct: 126 AMHHNTIVEKLIPIRGIGRWTVEMMLMFRLGRPDVLPVDDLGVRKGIQRVDTLAFVPTPK 185 Query: 252 QIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 + E W P+R+YA L++W + E Sbjct: 186 ALCTRGECWAPYRTYAGLYLWRIADFHEGEG 216 >UniRef50_Q972N8 Putative uncharacterized protein ST1094 n=1 Tax=Sulfolobus tokodaii RepID=Q972N8_SULTO Length = 282 Score = 53.1 bits (126), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 29/122 (23%), Positives = 59/122 (48%), Gaps = 12/122 (9%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGER-LDDFPEYICFPTPQRLAAADPQALKAL 172 E I Q +S+ + L ++A+ +G + + D + FPT + L + P+ L++L Sbjct: 123 LETVTNVISCQQISLNVCLTLVNKLAEKFGGKVIVDDKQINVFPTVEDLINSKPEELRSL 182 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 G ++ E +++ NA + + + ++++ G+G+W+ NY LRG D Sbjct: 183 GYSSRKVEYILNAVNALKD-----------KDSFESIKNLKGLGKWSINYILLRGLGRID 231 Query: 233 VF 234 V Sbjct: 232 VI 233 >UniRef50_Q9KC25 DNA-3-methyladenine glycosidase n=1 Tax=Bacillus halodurans RepID=Q9KC25_BACHD Length = 221 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 44/186 (23%), Positives = 81/186 (43%), Gaps = 18/186 (9%) Query: 105 LRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAA 164 ++LP + F+ V +I+ Q +S+ A+ + RV QL G L+ P++L Sbjct: 31 VQLPTKPNPFQSLVSSIVEQQLSIKAASAIYGRVEQLVGGALEK---------PEQLYRV 81 Query: 165 DPQALKALGMPLKRAEALIHLANAALEGTLPMT-IPG-DVEQAMKTLQTFPGIGRWTANY 222 +AL+ G+ ++ E + H+ G L T + G + ++ L GIG+WTA Sbjct: 82 SDEALRQAGVSKRKIEYIRHVCEHVESGRLDFTELEGAEATTVIEKLTAIKGIGQWTAEM 141 Query: 223 FALRGWQAKDVFLPDDYLIKQ-------RFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 F + DV D +++ G + + + W P+ + A L++W Sbjct: 142 FMMFSLGRLDVLSVGDVGLQRGAKWLYGNGEGDGKKLLIYHGKAWAPYETVACLYLWKAA 201 Query: 276 GWQPDE 281 G +E Sbjct: 202 GTFAEE 207 >UniRef50_O94468 Probable DNA-3-methyladenine glycosylase 2 n=1 Tax=Schizosaccharomyces pombe RepID=MAG2_SCHPO Length = 213 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 40/169 (23%), Positives = 77/169 (45%), Gaps = 17/169 (10%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 +E +RAI Q +S A T + + + D E FPTP+++ D + L G Sbjct: 42 YEGIIRAITSQKLSDAA----TNSIINKFCTQCSDNDE---FPTPKQIMETDVETLHECG 94 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDV---EQAMKTLQTFPGIGRWTANYFALRGWQA 230 +++ + +A AAL +P + E+ M++L G+ RWT +++ Sbjct: 95 FSKLKSQEIHIVAEAALNKQIPSKSEIEKMSEEELMESLSKIKGVKRWTIEMYSIFTLGR 154 Query: 231 KDVFLPDDYLIK---QRFPGMTPA----QIRRYAERWKPWRSYALLHIW 272 D+ DD +K + F G++ ++ + + KP+R+ A ++W Sbjct: 155 LDIMPADDSTLKNEAKEFFGLSSKPQTEEVEKLTKPCKPYRTIAAWYLW 203 >UniRef50_B5ES79 HhH-GPD family protein n=4 Tax=Acidithiobacillus RepID=B5ES79_ACIF5 Length = 322 Score = 53.1 bits (126), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 57/184 (30%), Positives = 81/184 (44%), Gaps = 16/184 (8%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC 154 L L A GLR P FE V A+ Q +S+ + L R+++L E + + + Sbjct: 108 LASLEARYRGLRPPRFPSLFEGMVNAVACQQLSLHLGITLLNRLSELCREGVGEMDQVYP 167 Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP----MTIPGDVEQAMKTLQ 210 FP P L + AL+ LG ++ AL LA A G L +P A++ L Sbjct: 168 FPDPGSLLRQEVTALRGLGFSGQKVTALRALAEEAAVGGLEREDWQHLPNAA--AVQRLL 225 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMTPAQIRRYAERWKPW 263 GIGRW+A Y LR DVF DD ++ + AQ+ A R +PW Sbjct: 226 RLRGIGRWSAEYVLLRTLGRLDVFPGDDVGARKALARWLEENGSLDYAQV---AHRLRPW 282 Query: 264 RSYA 267 + YA Sbjct: 283 QPYA 286 >UniRef50_Q4ZR24 DNA-3-methyladenine glycosylase II n=4 Tax=Pseudomonas syringae group RepID=Q4ZR24_PSEU2 Length = 221 Score = 52.8 bits (125), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 45/159 (28%), Positives = 72/159 (45%), Gaps = 12/159 (7%) Query: 125 LVSVAMAAKLTARVAQLYGERLDD-FPEYICFPTPQRLAAADPQALKALGMPLKRAEALI 183 LV +L AR RL FPE + FP+ L D QAL++ G + A+ Sbjct: 50 LVEAVAYQQLHARAGDAMVMRLRSLFPE-VSFPSAPALVELDDQALRSCGFSAAKCRAIK 108 Query: 184 HLANAALEGTLP---MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 +A A L+G +P + E ++ L PG+GRWT + G DV D+ Sbjct: 109 AIAAARLDGLVPEVSAALAMGNEALVERLIQLPGVGRWTVEMMLIYGLGQLDVMPASDFG 168 Query: 241 IKQRFPGMTPAQIR-------RYAERWKPWRSYALLHIW 272 + + + + Q++ R AER+ P+R+ A ++W Sbjct: 169 VCEGYRRLYALQLKPSHRQMARLAERFAPYRTIAAWYLW 207 >UniRef50_Q6BZL7 DEHA2A00418p n=2 Tax=Debaryomyces hansenii RepID=Q6BZL7_DEBHA Length = 300 Score = 52.8 bits (125), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 42/197 (21%), Positives = 85/197 (43%), Gaps = 29/197 (14%) Query: 103 PGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLY---GERLD---DFPEYICFP 156 P + ++A++ V+ I+ Q +S + A + + +L+ GE + F + FP Sbjct: 89 PNTLMDVKMNAYQTLVKIIISQQLSTSAARSIMTKFIKLFLKEGESTEPDHQFKAHPHFP 148 Query: 157 TPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV-----EQAMKTLQT 211 TP+ + P+ L++ G+ ++A L+ ++ + + + E + L Sbjct: 149 TPEIVKETSPERLRSAGISFRKAGYLLIISEKFSDKNYLLNDDKKLNDMSNEDIARLLID 208 Query: 212 FPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ---------------RFPGMTPAQIRRY 256 GIG W + F L + D+F D I++ + ++ ++ +Y Sbjct: 209 LKGIGPWAVDIFLLLYMKRSDIFPISDAGIRKGLSMLIQNTSGKKGKKLNYLSIEEMEKY 268 Query: 257 AERWKPWRSYALLHIWY 273 +E WKP+RS A WY Sbjct: 269 SENWKPYRSVA---SWY 282 >UniRef50_A1K6J5 DNA-3-methyladenine glycosylase II n=21 Tax=Proteobacteria RepID=A1K6J5_AZOSB Length = 229 Score = 52.4 bits (124), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 41/175 (23%), Positives = 76/175 (43%), Gaps = 22/175 (12%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 + +E VRA+ Q ++ ++ ++ R+ LY + FP P++L A AL+ Sbjct: 48 EPYEALVRAVAYQQLATSVGDRIIGRLLALYPDS--------AFPQPEQLLATGFDALRG 99 Query: 172 LGMPLKRAEALIHLANAALEGTLPM---TIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 G ++ E + +A L G +P + D E + L GIGRWT + Sbjct: 100 CGFSARKIETIHGIAQGTLSGLVPSRADAVSMDDEALIARLVELRGIGRWTVEMLLIFTL 159 Query: 229 QAKDVFLPDDYLIKQRF---------PGMTPAQIRRYAERWKPWRSYALLHIWYT 274 + DV DD+ +++ + PG ++ R P+R+ A ++W + Sbjct: 160 ERIDVLPVDDFGVREGYRHLKSLDEMPGRK--EMARAGLVCSPYRTVAAWYLWRS 212 >UniRef50_D1HE56 Whole genome shotgun sequence of line PN40024, scaffold_1.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HE56_VITVI Length = 351 Score = 52.0 bits (123), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 48/186 (25%), Positives = 76/186 (40%), Gaps = 22/186 (11%) Query: 105 LRLPGCVDAFEQG----VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQR 160 L P D+F ++IL Q ++ + R L G PE + Sbjct: 138 LHPPPTFDSFHTPFLALTKSILYQQLAYKAGTSIYTRFVGLCGGEAGVLPETVL------ 191 Query: 161 LAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRW 218 A P L+ +G+ ++A L LA G L T I D + L GIG W Sbjct: 192 --ALTPHQLRQIGVSGRKASYLHDLARKYQNGILSDTGIITMDDKSLFTMLTMVNGIGSW 249 Query: 219 TANYFALRGWQAKDVFLPDDYLIK---QRFPGMT----PAQIRRYAERWKPWRSYALLHI 271 + + F + DV +D ++ Q G+ P+Q+ + E+W+P+RS A +I Sbjct: 250 SVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGLEELPRPSQMEQLCEKWRPYRSVASWYI 309 Query: 272 W-YTEG 276 W + EG Sbjct: 310 WRFVEG 315 >UniRef50_Q3B3Y2 HhH-GPD n=1 Tax=Chlorobium luteolum DSM 273 RepID=Q3B3Y2_PELLD Length = 311 Score = 51.6 bits (122), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 55/195 (28%), Positives = 80/195 (41%), Gaps = 35/195 (17%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI--C------FPTPQRLAA 163 D +E V + Q + +A+ + + +A+ YGE + P + C FPTP RL A Sbjct: 114 DPYETMVTFMCAQGIGMALIRRQVSMLARRYGEHV---PLSLNGCTINLYRFPTPSRLGA 170 Query: 164 ADPQALKAL-GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQT----FPGIGRW 218 ADP L+A L RA +I + EG + + + +Q GIG Sbjct: 171 ADPMELRACTNNNLMRARNIISASQKVTEGCIDFKALASKKNTQEDIQAALSRCGGIGLK 230 Query: 219 TANYFALRGWQAKDVFLPDDYLIKQ------RFPG----MTPAQIRRYAERWKP------ 262 A+ AL G D F P D ++Q FP +T R AER + Sbjct: 231 IADCIALFGLGRFDAF-PIDTHVRQFLGLWFGFPEASAPLTDKNYRILAERARELLGEKL 289 Query: 263 --WRSYALLHIWYTE 275 +R + L H W TE Sbjct: 290 AGYRGHHLFHCWRTE 304 >UniRef50_C8WLI9 HhH-GPD family protein n=4 Tax=Bacteria RepID=C8WLI9_EGGLE Length = 219 Score = 50.8 bits (120), Expect = 5e-05, Method: Compositional matrix adjust. Identities = 40/173 (23%), Positives = 71/173 (41%), Gaps = 19/173 (10%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 D F V I+GQ ++ + R+ + +GE TP+ +AA L+ Sbjct: 42 DLFAALVNCIVGQQIATKAQTTIWNRMLERFGE-----------VTPEAMAACSDDELQQ 90 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQ 229 +G+ ++ + A L G + + ++ ++ +TL PGIG WTA Q Sbjct: 91 VGISFRKVGYIKGAAARVLSGEVDLEGLAELSDDEVCRTLSALPGIGVWTAEMLMTFSMQ 150 Query: 230 AKDVFLPDDYLIKQ------RFPGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 ++ D I + +TP +Y R+ P+ S A L++W G Sbjct: 151 RPNILSWGDLAIHRGLRMVHHHRRITPELFAKYRRRYTPYGSVASLYLWEVAG 203 >UniRef50_UPI0001B54083 YfjP n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B54083 Length = 308 Score = 50.4 bits (119), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 55/223 (24%), Positives = 96/223 (43%), Gaps = 19/223 (8%) Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNP----QIVN--GALGRLGAARPGLRLPGCV 111 T+ + ++A +E +AA +A++SR+ L + ++ N + RL A G R Sbjct: 71 TVEVGVAAPVE-IAARVVAQVSRILSLDVDESGLAEVANRDSVVRRLHARHSGARPVLHS 129 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL-DDFPEYICFPTPQRLAAADPQALK 170 +E A+L Q + VA A +L ++A +G ++ +D P FP+PQ +A + Sbjct: 130 SPYEAACWAVLTQGMRVAQARRLREQLAVRHGRQVGEDGP--FSFPSPQIVAELGDEPSV 187 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 + + L + +P + A+ L PGIG + A L G Sbjct: 188 GPFRLARLRAVARAAVDGRLNADELLALP--IPDALNRLCAIPGIGAFAAEQILLHGAGH 245 Query: 231 KDVFLPDDYLIKQRF-------PGMTPAQIRRYAERWKPWRSY 266 D+F D + Q P P ++ AE W+P+RS+ Sbjct: 246 PDLFPRLDTQLHQVLCAEYALPPDTPPDELEPLAEDWRPYRSW 288 >UniRef50_Q5SLG4 DNA-3-methyladenine glycosidase n=6 Tax=Bacteria RepID=Q5SLG4_THET8 Length = 185 Score = 50.4 bits (119), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 43/153 (28%), Positives = 70/153 (45%), Gaps = 7/153 (4%) Query: 125 LVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIH 184 L +A +L+ R A ERL + PTP+ A L+ G+ +A AL Sbjct: 33 LAESVVAQQLSTRAAARLAERLFR----LVPPTPEAFLEAPLDLLRQAGLSRAKALALKD 88 Query: 185 LANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ- 243 LA A EG L + E ++ L G+G WTA F + G DV+ D +++ Sbjct: 89 LAAKAEEGLLDGLDRLEDEAVVERLTRVRGVGLWTAEMFLMFGLGRPDVWPVRDLGLRRA 148 Query: 244 --RFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 R G+ P + + E ++P+RS+ ++W + Sbjct: 149 AARLFGVAPEALPAFGEAFRPYRSHLAWYLWRS 181 >UniRef50_A9FBN7 Putative DNA-3-methyladenine glycosidase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FBN7_SORC5 Length = 292 Score = 50.4 bits (119), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 45/178 (25%), Positives = 79/178 (44%), Gaps = 9/178 (5%) Query: 104 GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC-FPTPQRLA 162 G++ P +E V +++ Q +S+A A R+ + +D + + FP P+ +A Sbjct: 105 GVKPPRFASLWEAIVNSVVFQQLSLAAAMAAVRRLVLRFASPVDVAGQRLFPFPPPEVVA 164 Query: 163 AADPQALKALGMPLKRAEALIHLAN-----AALEGTLPMTIPGDVEQAMKTLQTFPGIGR 217 ++ P L+ LG+ +A+AL A E L G++E+ ++ L PGIG Sbjct: 165 SSTPHDLRTLGLSGAKADALRTCARMIAAGELREEELEALANGEIERRLREL---PGIGP 221 Query: 218 WTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 WTA+ LRG++ DVF D + + E P+R H+ + Sbjct: 222 WTASVILLRGFRRLDVFPGGDVAAARGLGAIAGEHGGELVEALGPYRGMLYFHLLLSS 279 >UniRef50_B3E6X3 HhH-GPD family protein n=2 Tax=Bacteria RepID=B3E6X3_GEOLS Length = 201 Score = 50.4 bits (119), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 45/196 (22%), Positives = 83/196 (42%), Gaps = 24/196 (12%) Query: 86 CNPQIVNGALGRLGAARPG-LRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGE 144 CN ++ + P +R PG F R IL Q VS+ A+ + E Sbjct: 14 CNKHLIFRIINDKYGIPPNWMREPG----FISLSRIILEQQVSI--------ESAKAHFE 61 Query: 145 RLDDF-PEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG--D 201 +++ + PE+ TP + Q ++ + ++A+ L L+ A L+ L + + + Sbjct: 62 KINSYIPEF----TPNEIIKLSDQEMRDCQISRQKAKYLRSLSEAILKNELNLEVMDTFN 117 Query: 202 VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLI----KQRFPGMTPAQIRRYA 257 + + L GIG WT + + + Q KDVF D + + T ++ + Sbjct: 118 DHEIREKLTKINGIGNWTVDIYLMFCLQRKDVFPSGDIAVINAAMELLEYETKDEVLNES 177 Query: 258 ERWKPWRSYALLHIWY 273 ++W P RS A +W+ Sbjct: 178 KKWAPLRSLAAYFLWH 193 >UniRef50_B4S806 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Prosthecochloris aestuarii DSM 271 RepID=B4S806_PROA2 Length = 312 Score = 50.4 bits (119), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 13/144 (9%) Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER----LDDFP--EYICFPTPQRLAAA 164 ++AFE + + Q + + + K + +GER +D P +Y FP+P+ LAAA Sbjct: 114 LNAFETLITFMCAQAIGMNLIRKQIRTICNRFGERHMTEIDGNPLIQY-SFPSPETLAAA 172 Query: 165 DPQALK-ALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAM----KTLQTFPGIGRWT 219 PQ L+ +RA +I A A EG L M + E ++ +L + GIG Sbjct: 173 SPQDLRICTNNNCERASNIISAARAVAEGRLCMDELINNELSLGSIRNSLTAYRGIGLKI 232 Query: 220 ANYFALRGWQAKDVFLPDDYLIKQ 243 A+ L G D F P D ++Q Sbjct: 233 ADCVMLFGLHRHDAF-PIDTHVRQ 255 >UniRef50_B0D0G2 Predicted protein n=1 Tax=Laccaria bicolor S238N-H82 RepID=B0D0G2_LACBS Length = 415 Score = 50.4 bits (119), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 34/121 (28%), Positives = 56/121 (46%), Gaps = 9/121 (7%) Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEY-------ICFPTPQRLAAADPQALKALGMP 175 GQ +S A +T R +LY + + P++ FPTPQ + D L+ G+ Sbjct: 129 GQQISWLAARSITHRFIRLYHPSIPEKPDHQMMKSYLHLFPTPQDIVDTDIATLRTAGLS 188 Query: 176 LKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 ++AE + LA+ ++G L + D + L GIGRWT + FA+ + D+ Sbjct: 189 ARKAEYVKDLASRFVDGRLSTEKLLNADDDDLYSILIEVRGIGRWTVDMFAIFSLRRPDI 248 Query: 234 F 234 Sbjct: 249 L 249 >UniRef50_UPI0000D54B32 HhH-GPD n=1 Tax=Psychroflexus torquis ATCC 700755 RepID=UPI0000D54B32 Length = 197 Score = 50.1 bits (118), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 41/171 (23%), Positives = 76/171 (44%), Gaps = 20/171 (11%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 FE VR I Q +SVA A + R+A++ F P++ L+ Sbjct: 36 GFEGLVRLICEQQLSVASAKAIFERLAKIVSP----FEAKNFLKVPKK-------DLQKT 84 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQA--MKTLQTFPGIGRWTANYFALRGWQA 230 G+ ++ + LANA +EG L T + + K L GIG+WTA+ + L + Sbjct: 85 GLSRQKIDYCTGLANACIEGDLDFTTLHKMNDSDLRKELCKIKGIGKWTADCYMLASLKR 144 Query: 231 KDVFLPDDYLIK---QRFPGMTP----AQIRRYAERWKPWRSYALLHIWYT 274 +D++ D ++ Q+ ++ ++ + +WKP+R+ +W + Sbjct: 145 EDIWPAGDLGLQISVQKLKKLSSRPSEMELEEISVKWKPYRTLVANMLWNS 195 >UniRef50_A9M750 HhH-GPD family protein n=55 Tax=Rhizobiales RepID=A9M750_BRUC2 Length = 232 Score = 50.1 bits (118), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 49/168 (29%), Positives = 70/168 (41%), Gaps = 20/168 (11%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 FE ++ Q VS A AA + AR+ Q+ I TP+ A +A + G Sbjct: 61 FESLASIVVAQQVSTASAAAIWARLKQV-----------INPLTPEAYIAGGEEAWRLAG 109 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + + L+ L+ A G L + D+ +A+ TL GIG WTA + L Sbjct: 110 LSRPKQRTLLALSEALAGGALDLHGLCDLPAGEAIATLTAIKGIGPWTAEVYLLFAAGHP 169 Query: 232 DVFLPDDYLIK----QRFPGMT---PAQIRRYAERWKPWRSYALLHIW 272 DVF D ++ F T A +R+ AE W PWR A W Sbjct: 170 DVFPAGDVALQTAVGHAFAHETRPDAAALRQLAENWAPWRGVAARLFW 217 >UniRef50_A7EZ08 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7EZ08_SCLS1 Length = 418 Score = 50.1 bits (118), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 37/135 (27%), Positives = 60/135 (44%), Gaps = 3/135 (2%) Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALK 170 ++ F V I+ Q VS A A + A+ L+ D P FPTP + A D L+ Sbjct: 239 IEPFRALVSGIISQQVSGAAAKSIKAKFVALFNPP-DSDPSTHTFPTPSAIVATDLARLR 297 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 G+ ++AE + LA +G L + E+ +L G+G+W+ FA Sbjct: 298 TAGLSQRKAEYISGLALKFTDGELTTQFLLSASYEEVFASLIQVRGLGKWSVEMFACFAL 357 Query: 229 QAKDVFLPDDYLIKQ 243 + DVF D +++ Sbjct: 358 KRLDVFSTGDLGVQR 372 >UniRef50_A6TTX3 Methylated-DNA--protein-cysteine methyltransferase n=67 Tax=Bacteria RepID=A6TTX3_ALKMQ Length = 355 Score = 49.7 bits (117), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 37/159 (23%), Positives = 73/159 (45%), Gaps = 17/159 (10%) Query: 125 LVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIH 184 L+S ++ +++ + A+ RLD+ E + TP+ + + ++ GM K+AE + Sbjct: 198 LISSIVSQQISNKAAETVWNRLDELLESM---TPESITKTELSQIQGCGMTNKKAEYIKG 254 Query: 185 LANAALEG-----TLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 +A+ AL G TL M ++ Q + +L G+G WT + +V D Sbjct: 255 IADVALCGKINFKTLHMLSDQEIIQKLSSLH---GVGIWTVEMLLIFSLNRPNVVSYGDL 311 Query: 240 LIKQ------RFPGMTPAQIRRYAERWKPWRSYALLHIW 272 I++ ++ Q +Y ++ P+ S A L++W Sbjct: 312 AIRRGMMNLYGLKELSKEQFNQYRAKYAPYGSVASLYLW 350 >UniRef50_C5G8B3 DNA-3-methyladenine glycosylase n=8 Tax=Onygenales RepID=C5G8B3_AJEDR Length = 438 Score = 49.3 bits (116), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 40/167 (23%), Positives = 71/167 (42%), Gaps = 27/167 (16%) Query: 130 MAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAA 189 M A A++ +R DDFP TP ++A D L+ G+ ++AE + LA Sbjct: 270 MDAGEKIETAEMRYDRDDDFP------TPAQVAKCDIATLRTAGLSQRKAEYIQGLAEKF 323 Query: 190 LEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ---- 243 G L M + E+ ++ L G+G+W+ F+ G + DVF D +++ Sbjct: 324 ASGELSARMLLQASDEEVLEKLIAVRGLGKWSVEMFSCFGLKRMDVFSTGDLGVQRGMAA 383 Query: 244 ---------------RFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 +F M+ ++ A + P+RS + ++W E Sbjct: 384 FVGRDVSKLKAKGGGKFKYMSEKEMVEVAAPFSPYRSLFMWYMWRIE 430 >UniRef50_A9T041 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9T041_PHYPA Length = 178 Score = 49.3 bits (116), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 46/178 (25%), Positives = 80/178 (44%), Gaps = 18/178 (10%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 F R+I+ Q +S A + R+ + G L+ TP +AA + L+A+ Sbjct: 8 CFTALARSIVYQQISGKAACAIYCRLISICGG-LESV-------TPPVIAALTVEELRAV 59 Query: 173 GMPLKRAEALIHLANAALEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 G+ ++ L LA G L I + + +K L GIG W+A+ F + + Sbjct: 60 GISGRKGLYLHDLAEKFTSGLLSEAKLIIMNEDDLVKALTAVKGIGVWSAHMFMIFYLRK 119 Query: 231 KDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIW-YTEGWQPD 280 DV D I++ F + +PA+++ A W+P+R+ A ++W T+ PD Sbjct: 120 PDVLPVGDLAIRKAFQKLYHLNQLPSPAEMQELAFPWRPYRTLASWYLWRMTDNMLPD 177 >UniRef50_C1A5A1 DNA-3-methyladenine glycosylase n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A5A1_GEMAT Length = 243 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 42/169 (24%), Positives = 73/169 (43%), Gaps = 14/169 (8%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F R I+ Q +S + A + R + L E+ PTP+ + D AL+ G Sbjct: 68 FGHLARNIVYQQLSGSAATTIHGRFLKHVSAHLGVETEH---PTPESVLGIDDDALRGCG 124 Query: 174 MPLKRAEALIHLANAALEGTLP---MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 + + + A+ LA ++G LP + + D ++ + L GIG WTA F + Sbjct: 125 LSVAKVRAIKDLAQHVIDGRLPLDRLDVMSD-QEIIDALVPVRGIGPWTAQMFLMFRLGR 183 Query: 231 KDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIW 272 DV D +++ + A++ + A+ W+PW S A + W Sbjct: 184 PDVLPVLDLGVRKGAQRIYRTRALPDAARLEKIAKTWRPWASVASWYCW 232 >UniRef50_B2J3A5 HhH-GPD family protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J3A5_NOSP7 Length = 212 Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 45/162 (27%), Positives = 69/162 (42%), Gaps = 20/162 (12%) Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 IL Q VSVA A + R+ + I TP+ D L+ +G ++ Sbjct: 55 ILEQQVSVAAARAVFNRLCGV-----------IVPLTPENFLTLDDVQLRGIGFSRQKIL 103 Query: 181 ALIHLANAALEGTLPMT-IPGDVEQAMKT-LQTFPGIGRWTANYFALRGWQAKDVFLPDD 238 LANA L ++ + E ++T L+ GIG WT + + L Q DVF D Sbjct: 104 YSRGLANAIASDQLDLSKLERMDETTIRTELKRLKGIGDWTVDIYLLMALQRPDVFPKGD 163 Query: 239 YLIK---QRFPGM----TPAQIRRYAERWKPWRSYALLHIWY 273 I Q+ + TP Q+ + W+PWR+ A +W+ Sbjct: 164 LAIAIALQKLKNLATRPTPVQLEGMTQHWRPWRAVAARLLWH 205 >UniRef50_Q04UT1 DNA-3-methyladenine glycosylase II n=4 Tax=Leptospira RepID=Q04UT1_LEPBJ Length = 228 Score = 48.9 bits (115), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 42/176 (23%), Positives = 80/176 (45%), Gaps = 26/176 (14%) Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 ++++LGQ +SV +A R+ L G + P P R+ + LK +G+ Sbjct: 64 IKSVLGQQLSVKVALTFERRLISLAGSK--------KIPPPDRILMIPNEELKKIGVSQA 115 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQA--MKTLQTFPGIGRWTANY---FALRGWQAKD 232 + E + +A A L + + +E + + L +F G+G WTA FAL W D Sbjct: 116 KIETIQRIAEAYLNRDITDSKLRKLEDSDVLNLLCSFKGVGPWTAEMVLIFALDRW---D 172 Query: 233 VFLPDDYLIK---QRFPGMT---PAQIRRYAERWKPWRSYALLHIWYT----EGWQ 278 F +D +++ ++ G++ +I+ + + P+R+ ++W EGW Sbjct: 173 HFSINDLILRKSVEKHYGISKDNKKEIQHFLMSYSPFRTILSWYLWADMDGGEGWN 228 >UniRef50_D2QEN8 HhH-GPD family protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QEN8_9SPHI Length = 207 Score = 48.5 bits (114), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 44/200 (22%), Positives = 83/200 (41%), Gaps = 31/200 (15%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGV-----RAILGQLVSVAMAAKLTARVAQLYGERL 146 + + R+ A P +P V+ + V +I+ Q +SV A + +R L+ ++ Sbjct: 15 DPVMARIIAETP---VPKLVNDYADDVYLALLESIVSQQISVKAADAIFSRFRALFPDK- 70 Query: 147 DDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV--EQ 204 +P L L++ G+ ++ + L +A +LE + + E+ Sbjct: 71 --------YPQADALLLKTTDELRSAGLSFQKIKYLQSVAEFSLEKPIDRVHLDALTDEE 122 Query: 205 AMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ------------ 252 ++ L G+GRWT + D+F DD +I+QR P Q Sbjct: 123 IVQYLLPIKGVGRWTVEMLLMFVLDRPDIFPIDDLVIRQRMLRAYPEQTNGLTGKALYKV 182 Query: 253 IRRYAERWKPWRSYALLHIW 272 + AE W+P+R+ A ++W Sbjct: 183 LLSIAEPWRPYRTTASRYLW 202 >UniRef50_Q9LN45 F18O14.25 n=22 Tax=Magnoliophyta RepID=Q9LN45_ARATH Length = 1314 Score = 48.5 bits (114), Expect = 3e-04, Method: Composition-based stats. Identities = 39/165 (23%), Positives = 71/165 (43%), Gaps = 19/165 (11%) Query: 118 VRAILGQLVSVAMAAKLTARVAQLYG-ERLDDFPEYICFPTPQRLAAADPQALKALGMPL 176 +R IL Q +++ + R L G E L P+ + + +PQ L+ +G+ Sbjct: 176 IRNILYQQLAMKAGNSIYTRFVSLCGGENL---------VVPETVLSLNPQQLRQIGVSG 226 Query: 177 KRAEALIHLANAALEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 ++A L LA G L + D + L GIG W+ + F + DV Sbjct: 227 RKASYLHDLARKYQNGILSDSAILNMDEKSLFTMLTMVNGIGSWSVHMFMINSLHRPDVL 286 Query: 235 LPDDYLIK---QRFPGMT----PAQIRRYAERWKPWRSYALLHIW 272 +D ++ Q G+ P+Q+ ++ +W+P+RS ++W Sbjct: 287 PVNDLGVRKGVQLLYGLDDLPRPSQMEQHCAKWRPYRSVGSWYMW 331 >UniRef50_D1VAP6 HhH-GPD family protein n=1 Tax=Frankia sp. EuI1c RepID=D1VAP6_9ACTO Length = 310 Score = 48.5 bits (114), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 44/166 (26%), Positives = 69/166 (41%), Gaps = 10/166 (6%) Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALK 170 D R + Q V+ AA L +R+ L G + P LA DP L Sbjct: 122 TDVLHVLARCVTAQQVTGRFAATLRSRLVGLVGRPVTAGPHTAYALDADLLADTDPTRLT 181 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTIPG-DVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 LG+ ++A AL+ +A A ++ D E + TL PGIGRW+A +F +R Sbjct: 182 ELGLSGRKAMALLGVARAVTGSLSLSSLHELDDEGVIATLTALPGIGRWSAEWFLIRAL- 240 Query: 230 AKDVFLPDDYLIKQRF-----PGMTP---AQIRRYAERWKPWRSYA 267 + + D +++ PG+ P ++R W P + A Sbjct: 241 GRPLVAAGDLAVRKAVGHLYRPGLPPPAEEEVRLLTAHWGPAAALA 286 >UniRef50_B3QN63 8-oxoguanine DNA glycosylase domain protein n=2 Tax=Chlorobaculum RepID=B3QN63_CHLP8 Length = 317 Score = 48.1 bits (113), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 43/140 (30%), Positives = 66/140 (47%), Gaps = 11/140 (7%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPE----YICFPTPQRLAAADPQA 168 FE V + Q + + + + + +A+ YG+++ + PE + FPTP LA+ADP Sbjct: 122 FEIMVTFMCAQGIGMHLIRRQVSMIAERYGQKIVLETPEGEMVFYGFPTPSALASADPSE 181 Query: 169 LKALGMPLK-RAEALIHLANAALEGTLPMTIPG----DVEQAMKTLQTFPGIGRWTANYF 223 L + RA +I +A + G L + G D+E +TL GIG A+ Sbjct: 182 LALCTNNNRIRAANIIAMARSFESGKLALACVGSGECDLETLRETLCVHSGIGLKIADCI 241 Query: 224 ALRGWQAKDVFLPDDYLIKQ 243 AL G D F P D +KQ Sbjct: 242 ALFGLGRFDAF-PIDTHVKQ 260 >UniRef50_Q754R1 AFR011Wp n=1 Tax=Eremothecium gossypii RepID=Q754R1_ASHGO Length = 285 Score = 47.8 bits (112), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 33/129 (25%), Positives = 56/129 (43%), Gaps = 9/129 (6%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F+ I+ Q +S A A + +RV QL+G FP Y+ + + AA D +L+ G Sbjct: 63 FKHLASGIITQQISGAAARSIKSRVEQLFG---GTFPNYVELQS--KFAAGDSASLRKCG 117 Query: 174 MPLKRAEALIHLANAALEGTLPM----TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 + ++ + L + + + D E ++ GIG W+A F + Sbjct: 118 LSARKVSYVESLTAYFNQNEMRLKHLFNSGSDAEIVDDLVRNVKGIGPWSAKMFLVTSLH 177 Query: 230 AKDVFLPDD 238 +DVF DD Sbjct: 178 RQDVFAADD 186 >UniRef50_D2LH30 HhH-GPD family protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LH30_RHOVA Length = 214 Score = 47.8 bits (112), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 46/168 (27%), Positives = 67/168 (39%), Gaps = 20/168 (11%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F+ V I GQ +SVA + R+ G+ T + LAAAD L+ G Sbjct: 43 FKGLVFVITGQQISVAAGRAIFGRLEGALGD-----------ITAETLAAADDTILREAG 91 Query: 174 MPLKRAEALIHLANAALEGTLPMTI--PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + L L AAL L + D E+A+ L GIG WTA + L Sbjct: 92 YSRPKMRTLRALQEAALADGLDLVAIEAMDAERAIIKLSAIKGIGPWTAEVYLLFAAGHP 151 Query: 232 DVFLPDDYLIKQRF-------PGMTPAQIRRYAERWKPWRSYALLHIW 272 D+F D +++ + +R ++ W PWRS A +W Sbjct: 152 DIFPAADVALQESMRLAFDLDARPSTQALREISDAWTPWRSAAARLLW 199 >UniRef50_A9I9J6 DNA-3-methyladenine glycosidase II n=1 Tax=Bordetella petrii DSM 12804 RepID=A9I9J6_BORPD Length = 218 Score = 47.8 bits (112), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 59/207 (28%), Positives = 83/207 (40%), Gaps = 33/207 (15%) Query: 78 MSRLFDLQCNPQIVNGALGRLG-AARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTA 136 + RL L + V A G + ARPG F R + GQ+VSVA A + Sbjct: 19 LRRLVKLDPRLRAVRDAAGAVPLRARPG--------GFAGLARIVCGQMVSVASADAIW- 69 Query: 137 RVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM 196 RL+ P+ TP A L+ +G+ + AL LA A G L + Sbjct: 70 -------RRLEALPQAT---TPGGFLALGEHGLQGVGLSQGKFRALTQLARALSAGELDL 119 Query: 197 ----TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF-----PG 247 +P D A+ L GIG WTA + + D+F D +++ PG Sbjct: 120 PAIEAMPADA--AIAELTRHKGIGPWTAEIYLMFCAGHPDIFPAGDIALQKAVGDALTPG 177 Query: 248 MTPAQIR--RYAERWKPWRSYALLHIW 272 P + R AE W P+R+ A L W Sbjct: 178 QYPDRKRLIGIAEAWAPYRASAALLFW 204 >UniRef50_D0J2I3 HhH-GPD n=6 Tax=Comamonadaceae RepID=D0J2I3_COMTE Length = 274 Score = 47.8 bits (112), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 59/230 (25%), Positives = 98/230 (42%), Gaps = 28/230 (12%) Query: 59 LHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGV 118 +H++L G+ AE ++ R +++ + +LG+ L G AF Sbjct: 62 IHLDLPDGVPAYWAEACRQLMR------RDRVLKRLIPQLGSQ--ALLPCGQEQAFATLA 113 Query: 119 RAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKR 178 R+I+GQ +S A L + +L PE + RL D ++A+G+ ++ Sbjct: 114 RSIIGQQISAKSAKTLWNKFVRLPAAMQ---PEQVL-----RLKVDD---MRAVGLSARK 162 Query: 179 AEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 + L+ LA E L M + E + L + G+ RWTA F + +V Sbjct: 163 VDYLVDLALHFTENRLHMDEWAQMSDEVIIAELMSIRGLSRWTAENFLIYCLGRPNVLPL 222 Query: 237 DDYLIKQ-----RFPG--MTPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 DD + Q F G ++ + R AE WKPW + A +IW + QP Sbjct: 223 DDAGLIQGISLNHFSGDPVSRSDAREVAEAWKPWCTVATWYIWRSLEAQP 272 >UniRef50_A3JFL8 3-methyladenine DNA glycosylase n=1 Tax=Marinobacter sp. ELB17 RepID=A3JFL8_9ALTE Length = 207 Score = 47.8 bits (112), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 55/186 (29%), Positives = 75/186 (40%), Gaps = 26/186 (13%) Query: 97 RLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFP 156 RLGA R PG F V IL Q +S+ A + R+ GE Sbjct: 31 RLGAPPLWAREPG----FASLVHIILEQQISIKAAQTVFERLCAHLGEM----------- 75 Query: 157 TPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT---IPGDVEQAMKTLQTFP 213 +PQR+ +A + LKA G+ ++A LA G L + D E L P Sbjct: 76 SPQRMVSAGEEELKAFGLTRQKARYCFGLAERIHTGKLNLAQLDALSDTE-GRDALLAIP 134 Query: 214 GIGRWTANYFALRGWQAKDVF-LPDDYL------IKQRFPGMTPAQIRRYAERWKPWRSY 266 G+G W+ + + L + DV+ L D L IKQ T Q A W PWR+ Sbjct: 135 GLGPWSVDVYYLMALRRPDVWPLGDLALAAAMQEIKQLDAPATRQQQVDIANAWSPWRAV 194 Query: 267 ALLHIW 272 A +W Sbjct: 195 AARLLW 200 >UniRef50_C0NIP1 Putative uncharacterized protein n=1 Tax=Ajellomyces capsulatus G186AR RepID=C0NIP1_AJECG Length = 406 Score = 47.8 bits (112), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 32/109 (29%), Positives = 53/109 (48%), Gaps = 8/109 (7%) Query: 139 AQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP--M 196 A++ +R DDFP TP ++A D L++ G+ ++AE + LA G L M Sbjct: 286 AEMRYDRDDDFP------TPAQVAKCDIATLRSAGLSQRKAEYIQGLAEKFASGELSAQM 339 Query: 197 TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF 245 + E+ ++ L G+GRW+ F+ G + DVF D ++ F Sbjct: 340 LLQASDEEVLEKLIAVRGLGRWSVEMFSCFGLKRMDVFSTGDLGVQSLF 388 >UniRef50_B3QVZ3 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QVZ3_CHLT3 Length = 323 Score = 47.4 bits (111), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 36/146 (24%), Positives = 65/146 (44%), Gaps = 19/146 (13%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY-----ICFPTPQRLAAADPQA 168 FE + + Q + + + + ++A+ +GE+++D P + FP+PQ LA AD + Sbjct: 119 FEALISFMCAQGMGIQIIRRQIEQLARQFGEKINDSPPFDSEHCYSFPSPQALANADIEL 178 Query: 169 L-KALGMPLKRAEALIHLANAALEG-------------TLPMTIPGDVEQAMKTLQTFPG 214 L K L RA+ + H++ A + G L + ++A L FPG Sbjct: 179 LKKCTNNNLVRAKNIKHISEAVVNGELVLEHIHAIHNENLGLCSKCSYKKAKAALLRFPG 238 Query: 215 IGRWTANYFALRGWQAKDVFLPDDYL 240 IG A+ L G + + D ++ Sbjct: 239 IGDKIADCICLFGLEHGEAIPIDRHV 264 >UniRef50_A6EE77 3-methyladenine DNA glycosylase n=1 Tax=Pedobacter sp. BAL39 RepID=A6EE77_9SPHI Length = 222 Score = 47.0 bits (110), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 44/175 (25%), Positives = 76/175 (43%), Gaps = 28/175 (16%) Query: 112 DAFEQGVRAILGQLVSVAMA----AKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQ 167 + FE V IL Q VS+A A KL R+ ++ TP L + Sbjct: 42 NTFESLVHIILEQQVSLASALAALNKLRDRLKEV---------------TPGVLLQLTDE 86 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTI-PGDVEQAMKTL-QTFPGIGRWTANYFAL 225 LKA + +++ + HLA + L G++ + + P ++ ++ L G+G WT + + + Sbjct: 87 ELKACYLSRQKSIYVRHLATSILHGSIDLDLMPRLPDREIRILLNQLKGVGNWTIDVYLM 146 Query: 226 RGWQAKDVFLPDDYL-------IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 Q DVF D +K G + R A W+P+R+ A + +W+ Sbjct: 147 FVLQRADVFPSGDLAAVNALKQLKDLPVGTHKEVLERIAMNWQPYRTVATMILWH 201 >UniRef50_B6K1P6 DNA-3-methyladenine glycosylase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6K1P6_SCHJY Length = 224 Score = 47.0 bits (110), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 39/172 (22%), Positives = 73/172 (42%), Gaps = 21/172 (12%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 + +E + A+ Q +S + + R+ Q + ++ FP+ Q L + D + L++ Sbjct: 49 EPYEGLIHALTYQRLSDSAGDAILGRLCQHFHKK--------SFPSVQELLSLDTEDLRS 100 Query: 172 LGMPLKRAEALIHLANAALEGTLP-----MTIPGDVEQAMKTLQTFPGIGRWTANYFALR 226 G ++ E ++ LAN A +G+LP +P D + + GIG WT +A+ Sbjct: 101 FGFSHRKGETILELANMAADGSLPSREEISHMPLD--KMIGIFTKVKGIGAWTVEKYAIF 158 Query: 227 GWQAKDVFLPDDYLIKQRFP-----GMTPAQIRRYAERWKPWRSYALLHIWY 273 +V D I++ TP + ER + + Y + WY Sbjct: 159 TLGRPNVMPTMDREIRENVQLLYHLDHTPTDV-EMEERSRAYVPYKTVASWY 209 >UniRef50_B8GY42 DNA-3-methyladenine glycosylase II n=4 Tax=Caulobacteraceae RepID=B8GY42_CAUCN Length = 213 Score = 47.0 bits (110), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 47/168 (27%), Positives = 72/168 (42%), Gaps = 20/168 (11%) Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 ++ I+ Q VS+A AA + ARV PE TP+ +A D L+ LG+ Sbjct: 47 LKMIVQQQVSLASAAAIWARVEA-------GLPEM----TPEIVADHDEAYLRTLGLSQP 95 Query: 178 RAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 +A +A A L G + E+A+ L G+GRWTA F + D+F Sbjct: 96 KARYARAIAEAHLSGVCDFDALRALSDEEAIAALTAIKGVGRWTAEVFLMFTQGRLDLFP 155 Query: 236 PDDYLIKQRFPGMTPAQIR-------RYAERWKPWRSYALLHIWYTEG 276 D +++ + A+ R AE W+P+R A +W G Sbjct: 156 GGDVALQEAMRWVDRAETRPTEKQAYARAELWRPYRGVAAHLLWACYG 203 >UniRef50_A0Z859 HhH-GPD protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z859_9GAMM Length = 215 Score = 47.0 bits (110), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 19/169 (11%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F+ + ++GQ VS A +T R+ + +L+ P+ L + D +L+A G Sbjct: 47 FDSLAKIVVGQQVSTRAAEAITQRLLESLNGQLE----------PEILLSRDDDSLRAAG 96 Query: 174 MPLKRAEALIHLANAALEGTLP-MTIPG-DVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + ++ L LA A +EG LP + +P ++ ++ + G G W+A + + Sbjct: 97 LSRQKISYLRSLATAVVEGALPLLDLPKMSDDEVLQRITAIRGFGAWSAQMYLMFSLGRT 156 Query: 232 DVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWY 273 D++ D ++ F + T + A+ + P+RS L W+ Sbjct: 157 DIWPSGDLAVRVGFGRLLGLVERPTAKKTEELAKDFTPYRSALALLCWH 205 >UniRef50_D0LW65 DNA-3-methyladenine glycosylase II n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LW65_HALO1 Length = 220 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 37/127 (29%), Positives = 59/127 (46%), Gaps = 9/127 (7%) Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG--DVEQAMKTLQTF 212 FPTP L A L++ G+ +A AL LA +G++ D ++ TL Sbjct: 84 FPTPAALLAVSEDTLRSAGLSRAKATALRDLAAKFADGSVRSRQFSRMDADELRATLTQV 143 Query: 213 PGIGRWTANYFALRGWQAKDVFLPDDYLIK---QRFPGM----TPAQIRRYAERWKPWRS 265 GIG W+ + F + G DV D ++ QR+ + PA+++ A W P+RS Sbjct: 144 RGIGPWSVDMFLIFGLMRPDVLPVGDLGVRKGMQRYFELEELPKPAEMQELAAPWAPFRS 203 Query: 266 YALLHIW 272 A ++W Sbjct: 204 VASWYMW 210 >UniRef50_C0D4Q9 Putative uncharacterized protein n=2 Tax=Clostridium RepID=C0D4Q9_9CLOT Length = 293 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 38/137 (27%), Positives = 66/137 (48%), Gaps = 7/137 (5%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE----YICFPTPQRLAAADPQ 167 D +E + ++ Q ++ +L +++ +G++++D E FPTP+ LA A + Sbjct: 102 DPWEMIITFVISQQKTIPCIRRLVEDISRRWGQKIEDGDEKNFAVYSFPTPKELARASLE 161 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVE--QAMKTLQTFPGIGRWTANYFAL 225 L L + RA+ + L+ A G L + +E QAM+ L F GIG+ AN L Sbjct: 162 ELLDLKLGY-RAKYIHRLSQDAAAGILDLKNLETMEYGQAMEYLTGFYGIGKKVANCVCL 220 Query: 226 RGWQAKDVFLPDDYLIK 242 G + F D ++ K Sbjct: 221 FGLHHIEAFPVDTWIEK 237 >UniRef50_B4B851 DNA-3-methyladenine glycosylase II n=2 Tax=Cyanobacteria RepID=B4B851_9CHRO Length = 215 Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust. Identities = 40/165 (24%), Positives = 67/165 (40%), Gaps = 23/165 (13%) Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTP---QRLAAADPQALKALGMPL 176 AI+ Q +S +A K+ R LY E TP + L + L+++G+ Sbjct: 50 AIMAQQISTEVANKIYQRFLSLYNE-----------STPLNARNLLQTSDEDLRSIGISR 98 Query: 177 KRAEALIHLANAALEGTLPMTIPGDVEQA--MKTLQTFPGIGRWTANYFALRGWQAKDVF 234 + L +LA A E P++ +E +K L GIG WT + Q D+ Sbjct: 99 YKIGYLKNLARAVEEYLPPLSELATMEDETIIKLLTQIKGIGTWTVQMLLIFRLQRLDIL 158 Query: 235 LPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIW 272 D I+ + +P + +WKP+R+ A ++W Sbjct: 159 PSGDLGIRMAIKNLYQLPELPSPEIVEAIGHKWKPYRTIAAWYLW 203 >UniRef50_Q8TL35 DNA-3-methyladenine glycosylase II n=1 Tax=Methanosarcina acetivorans RepID=Q8TL35_METAC Length = 299 Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust. Identities = 39/169 (23%), Positives = 80/169 (47%), Gaps = 6/169 (3%) Query: 104 GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLA 162 GL + FE A+L Q +S+ +A K+ ++ + G + + Y FP+ +++ Sbjct: 122 GLHQVKFLTPFEAAAWAVLSQRISMKVAHKIKNKLTEAIGNSIQIEGIVYRTFPSARQVK 181 Query: 163 AADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANY 222 + L ++ +++E LI +A+A G+++ + L GIG W+A+ Sbjct: 182 NLGVENLASIIKNERKSEYLIAVADAFDRVDENFLRQGNIKDVREWLMNIWGIGEWSAHL 241 Query: 223 FALRGW-QAKDVFLPDDYLIK--QRF--PGMTPAQIRRYAERWKPWRSY 266 +RG + +++ + L+ +RF P T Q RR A+ + ++ Y Sbjct: 242 ILIRGLGRMEELSEHEKTLLNCFKRFYGPEATEDQFRRVADSYGDFKGY 290 >UniRef50_Q55703 Slr0231 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=Q55703_SYNY3 Length = 152 Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust. Identities = 38/128 (29%), Positives = 58/128 (45%), Gaps = 15/128 (11%) Query: 157 TPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP-----GDVEQAMKTLQT 211 T Q LA DP+ L+ LG+ + L A AL+ P ++P GD ++ L Sbjct: 16 TAQTLANVDPELLRELGISRYKTRYLKTWA-IALQNNFP-SLPELETWGD-RAIVEQLTA 72 Query: 212 FPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWR 264 GIG WTA F L + +D+ D I+ + P Q+ Y + W+P+R Sbjct: 73 IKGIGPWTAQLFLLFRLRRQDILPNQDLGIRIAIQKLYQLPDRPNPKQVSEYGKNWQPYR 132 Query: 265 SYALLHIW 272 S A ++W Sbjct: 133 SLASWYLW 140 >UniRef50_Q9YFG9 Putative uncharacterized protein n=1 Tax=Aeropyrum pernix RepID=Q9YFG9_AERPE Length = 133 Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust. Identities = 30/98 (30%), Positives = 50/98 (51%), Gaps = 9/98 (9%) Query: 131 AAKLTARVAQLYGERLDDFPE-YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAA 189 A ++ +R+ YGE + E + FP P+RL + L+ +G+ +A+A+ ++A Sbjct: 4 ALRIQSRLITSYGEHVSIGREVFYSFPDPERLISLSLDRLRRIGLTRMKAQAIKNIAMTE 63 Query: 190 LEGTLPMTIPGDVEQA------MKTLQTFPGIGRWTAN 221 EG LP ++EQA +K L G+G WTA Sbjct: 64 YEGRLPSV--EEIEQAEELSSIVKELTRLKGVGPWTAE 99 >UniRef50_C6IXS6 DNA-3-methyladenine glycosidase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IXS6_9BACL Length = 228 Score = 46.2 bits (108), Expect = 0.001, Method: Compositional matrix adjust. Identities = 44/168 (26%), Positives = 72/168 (42%), Gaps = 20/168 (11%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F + R+I+ Q +SV A+ + RV +L GE +P L A L+A G Sbjct: 46 FTELARSIISQQISVKAASTIRGRVIELAGEL-----------SPAALLAQSDADLRAAG 94 Query: 174 MPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + + L L++ G L + D E+ +K L + GIGRW+A F + + Sbjct: 95 LSASKVAYLKDLSDKVQSGQLDLDRLQELDDEEVIKQLVSVKGIGRWSAEMFLIFALGRE 154 Query: 232 DVFLPDDYLIKQRFP---GMTPAQIRRY----AERWKPWRSYALLHIW 272 V D +++ M R+Y A +W + S A L++W Sbjct: 155 HVVSYGDAGLQRAAKWVYDMEERPDRKYLQQAAAQWPSYGSIASLYLW 202 >UniRef50_B8EL05 HhH-GPD family protein n=5 Tax=Alphaproteobacteria RepID=B8EL05_METSB Length = 249 Score = 45.4 bits (106), Expect = 0.002, Method: Compositional matrix adjust. Identities = 47/161 (29%), Positives = 70/161 (43%), Gaps = 20/161 (12%) Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 I+GQ +SVA A + R+ G TP + AA +ALKA G+ + Sbjct: 53 IMGQQLSVASADAIWRRLIDRLGPL-----------TPSVIEAATDEALKACGLSAPKIR 101 Query: 181 ALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD 238 L +A A G LP+ G + + A L GIG WTA+ + + D F D Sbjct: 102 TLRAIAEAITSGALPLDELGVMPADAAHAALTAVKGIGPWTADIYLMFCLGHSDAFAAGD 161 Query: 239 YLIKQRFP---GMT----PAQIRRYAERWKPWRSYALLHIW 272 ++ GMT ++ AE+W+PWR+ A +W Sbjct: 162 LALQAAARLAYGMTARPGAPELVALAEQWRPWRAVAAKVLW 202 >UniRef50_C0KTC3 8-oxoguanine DNA glycosylase n=1 Tax=Clostridium sp. enrichment culture clone 7-14 RepID=C0KTC3_9CLOT Length = 269 Score = 45.4 bits (106), Expect = 0.002, Method: Compositional matrix adjust. Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 9/144 (6%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 D +E V I+ Q ++ + YGE + P Y FPTP LA AD +AL+A Sbjct: 102 DPWEILVTFIISQRKNIPAIRACVETLCSRYGEPIG--PTY-AFPTPAALAGADEEALRA 158 Query: 172 LGMPLKRAEALIHLANAALEGTLPM--TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 + RA ++ A A GTL + + + +Q ++L T PG+G AN +L G+ Sbjct: 159 CALGY-RAGYVLAAAQMADAGTLDLFALVSLEDDQLAESLMTVPGVGVKVANCVSLFGYH 217 Query: 230 AKDVFLPD---DYLIKQRFPGMTP 250 F D + +I + + G P Sbjct: 218 RIAAFPRDVWMNRVIHEHYRGRFP 241 >UniRef50_Q11DX8 HhH-GPD n=3 Tax=Rhizobiales RepID=Q11DX8_MESSB Length = 214 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 48/169 (28%), Positives = 65/169 (38%), Gaps = 22/169 (13%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F I+ Q VS A A + AR++QL TP AA + G Sbjct: 43 FHSLASVIVSQQVSRASADAIFARLSQLVKPL-----------TPDGFLAAGESVMVKAG 91 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + + L L+ A + L + G++ AM+ L PGIG WTA + L Sbjct: 92 LSRAKQRTLTALSAALRDKALDLDDLGNLPPADAMEALTVIPGIGPWTAQVYLLVAAGHP 151 Query: 232 DVFLPDDYLIKQRFPGMTPA--------QIRRYAERWKPWRSYALLHIW 272 D+F D + Q G A Q+ AE W PWRS A W Sbjct: 152 DIFPAGDIAL-QAAVGHALALDARPKANQLALIAEPWSPWRSVAARLFW 199 >UniRef50_Q7UGU9 DNA-3-methyladenine glycosidase n=1 Tax=Rhodopirellula baltica RepID=Q7UGU9_RHOBA Length = 207 Score = 44.7 bits (104), Expect = 0.004, Method: Compositional matrix adjust. Identities = 43/168 (25%), Positives = 70/168 (41%), Gaps = 18/168 (10%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 FE R +L Q VS+ A ++ QL L TP+ + Q +A Sbjct: 32 GFETLARIVLEQQVSLRSAESTLHKLQQLLEGPL----------TPRGIVRLSAQQTRAC 81 Query: 173 GMPLKRAEALIHLANAALEGTLPMT-IPGDVEQ-AMKTLQTFPGIGRWTANYFALRGWQA 230 G+ ++ L LA ++G + +PG +Q A L GIGRW+A + + Sbjct: 82 GVSRQKHRYLNQLAADIVDGRFVLDRLPGMSDQEARDQLTARLGIGRWSAEVYLMSALNR 141 Query: 231 KDVFLPDDYLIKQRFPGMTPAQ------IRRYAERWKPWRSYALLHIW 272 D+ D + + + Q I + A+RW+P+RS A +W Sbjct: 142 PDILPFGDLGLLKGVEELDGGQYDDFDAIIQRADRWRPYRSMATRLVW 189 >UniRef50_B6JZD7 DNA-3-methyladenine glycosylase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6JZD7_SCHJY Length = 268 Score = 44.3 bits (103), Expect = 0.004, Method: Compositional matrix adjust. Identities = 26/94 (27%), Positives = 42/94 (44%), Gaps = 3/94 (3%) Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV---EQAMKTLQT 211 FPTP+ + A + + LK+ G ++ + + +A G +P E+ ++ L Sbjct: 123 FPTPEEILALEQEQLKSCGFSRRKTDTIREIARGVETGLIPSLDAAHEMVNEELIERLSQ 182 Query: 212 FPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF 245 GIGRWTA + G DV D I+ F Sbjct: 183 IHGIGRWTAEMLLIFGMGRLDVLPAGDLKIRDGF 216 >UniRef50_C7MW75 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MW75_SACVD Length = 324 Score = 44.3 bits (103), Expect = 0.004, Method: Compositional matrix adjust. Identities = 47/182 (25%), Positives = 77/182 (42%), Gaps = 16/182 (8%) Query: 101 ARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI-CFPTPQ 159 A PGLR +E A++ VA A + R+A G I FP PQ Sbjct: 146 ANPGLRPVLFCSPYEAACWAVVCHRFRVAQADAVIRRIAMNRGRVFHVGGREIPSFPVPQ 205 Query: 160 RLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM----TIPGDVEQAMKTLQTFPGI 215 L D + G+ ++ +L +A+AAL G L +P V A++ ++ PG+ Sbjct: 206 ELGVLD----SSYGVSERKRRSLSAIADAALSGELDADRLRALP--VFAAVEAVRRLPGL 259 Query: 216 GRWTANYFALRGWQAKDVFLPDDYLIK---QRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 G ++A RG D+F + + +R G+ + ERW+P+R + + Sbjct: 260 GPFSAELVVGRGAGHPDLFPASEAGLATTLRRCYGVDDVGV--VTERWRPYRGWGAFFLR 317 Query: 273 YT 274 T Sbjct: 318 ET 319 >UniRef50_C1XHZ0 DNA-3-methyladenine glycosylase II n=2 Tax=Meiothermus RepID=C1XHZ0_MEIRU Length = 178 Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust. Identities = 35/119 (29%), Positives = 57/119 (47%), Gaps = 4/119 (3%) Query: 158 PQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGR 217 P+ L A + L+A+G+ +A + L+ ALEG L E + L GIG Sbjct: 56 PEVLYRAALEDLRAVGLSSAKARYVQDLSRFALEGGLQGLEHHSDEALIAHLTQVKGIGV 115 Query: 218 WTANYFALRGWQAKDVFLPDDYLIK---QRFPGMTPA-QIRRYAERWKPWRSYALLHIW 272 WT F + G DV+ D I+ Q+ G+ ++ ER++P+RS+A ++W Sbjct: 116 WTVQMFLMFGLGRPDVWPVLDLGIRKGAQKLYGVIERDELEALGERFRPYRSHAAWYLW 174 >UniRef50_D1ZEJ1 Whole genome shotgun sequence assembly, scaffold_22 n=6 Tax=Leotiomyceta RepID=D1ZEJ1_SORMA Length = 415 Score = 44.3 bits (103), Expect = 0.005, Method: Compositional matrix adjust. Identities = 48/215 (22%), Positives = 83/215 (38%), Gaps = 50/215 (23%) Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLY-------------------GERLDDFPE 151 +D FE V +I+ Q VS A A + + L+ G +D P Sbjct: 193 IDPFESLVSSIISQQVSGAAAKSIKGKFVALFDDPSLDQDQDDEDGKDTPPGHPAEDQPS 252 Query: 152 YIC----FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV--EQA 205 FPTP + D L+ G+ ++AE + LA+ G L ++ ++ Sbjct: 253 SKRRKRRFPTPSLVLQKDLPTLRTAGLSQRKAEYIHGLASKFASGELSASLLASAPYDEL 312 Query: 206 MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ---RFPG--------------- 247 + L G+G WT FA + DVF D +++ F G Sbjct: 313 VSKLVAVRGLGLWTVEMFACFALKRMDVFSLGDLGVQRGMAAFVGRDVKKLKNGNGKGNG 372 Query: 248 -------MTPAQIRRYAERWKPWRSYALLHIWYTE 275 M+ +++ +ER++P+RS + ++W E Sbjct: 373 KDKKWKYMSEGEMKEISERFRPYRSLFMWYMWRVE 407 >UniRef50_C4DGP2 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DGP2_9ACTO Length = 300 Score = 43.9 bits (102), Expect = 0.006, Method: Compositional matrix adjust. Identities = 70/281 (24%), Positives = 119/281 (42%), Gaps = 20/281 (7%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVV-TAIPDIARHTL 59 M T+N P++ + FL A ++ E D + + + V A+ T+ Sbjct: 1 MTTINPAGPFNLATSTRFLEGFAPAAYEGAGDEVLRLAFPADDGKAVAGAALRQETDGTV 60 Query: 60 HINLS--AGLEPVAAECLAKMSRLFDLQCNPQIV--NGALGRLGAARPGLRLPGCVDA-F 114 + ++ A E V A+ MS D P +V + + L PGLR P C + + Sbjct: 61 RVEITGAADAEAVGAQVRRIMSLDIDGTGYPAVVASDPIVKGLSEQYPGLR-PVCFHSPY 119 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI-CFPTPQRLAAADPQALKALG 173 E A++G + + AA + A +A GE + + FPTP+ LA + + G Sbjct: 120 EAAAWAVIGHRIRITQAAGIKAAMAARLGETVTVAGRPVAAFPTPEVLA----EVGEFPG 175 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQ--AMKTLQTFPGIGRWTANYFALRGWQAK 231 + + L +A AAL G L D+ A++ LQ GIG ++A +RG Sbjct: 176 LTDVKIARLRGIAEAALAGELDAKRLRDMASADALEQLQGIAGIGPFSAELILIRGAGHP 235 Query: 232 DVFLPDDYLIKQRFPGM------TPAQIRRYAERWKPWRSY 266 DVF + + + + + A++ A W P+RS+ Sbjct: 236 DVFPRTETRLHRTMTQLYRREEPSAAELADIAADWAPFRSW 276 >UniRef50_B6BWA6 DNA-3-methyladenine glycosylase 1 n=1 Tax=beta proteobacterium KB13 RepID=B6BWA6_9PROT Length = 201 Score = 43.9 bits (102), Expect = 0.007, Method: Compositional matrix adjust. Identities = 34/146 (23%), Positives = 65/146 (44%), Gaps = 17/146 (11%) Query: 139 AQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP--- 195 + +Y LD F FPT Q + + L ++G+ ++A+ ++ +A+ L+ +P Sbjct: 58 STIYQRFLDLFNN--VFPTEQDIIK-NKDLLSSIGLSNQKAQTILSIADGYLKQFIPNEK 114 Query: 196 --MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLI-------KQRFP 246 M + G ++ ++ G+G WT + D+ D I KQR Sbjct: 115 KIMLLNG--QEIIEQFTQIKGVGVWTVQMMLIFNQGQPDIMPSSDLAIRKKYSFFKQRDC 172 Query: 247 GMTPAQIRRYAERWKPWRSYALLHIW 272 +TP Q+ + E P+R+ A ++W Sbjct: 173 LITPTQLIKETEYLSPYRTIAAWYLW 198 >UniRef50_A3LTR9 3-methyladenine DNA glycosylase (Fragment) n=5 Tax=Saccharomycetales RepID=A3LTR9_PICST Length = 288 Score = 43.5 bits (101), Expect = 0.007, Method: Compositional matrix adjust. Identities = 33/142 (23%), Positives = 65/142 (45%), Gaps = 16/142 (11%) Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 + +++GQ +S A + + +G+ D+ P+ L+A+G+ Sbjct: 84 ISSVIGQQISGHAARAVEKKFKDSFGD--DEM-------NPENTLKKSFDELRAVGLSNM 134 Query: 178 RAEALIHLANAALEGTLPMTIP----GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 + + +I ++ A + +T P G +E+ ++ L + GIG W+A FA+ + DV Sbjct: 135 KTKYVISISEAFSDPKNKLTDPKFYEGPLEEIVEELVSLKGIGVWSAKMFAIFTLKEMDV 194 Query: 234 FLPDDYLIKQRFPGMTPAQIRR 255 F DD + + GM +RR Sbjct: 195 FAEDDLGVAR---GMAKYVVRR 213 >UniRef50_B7K9B1 HhH-GPD family protein n=3 Tax=Cyanobacteria RepID=B7K9B1_CYAP7 Length = 210 Score = 43.5 bits (101), Expect = 0.009, Method: Compositional matrix adjust. Identities = 44/198 (22%), Positives = 78/198 (39%), Gaps = 18/198 (9%) Query: 84 LQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG 143 LQ I+ + ++G + G + P E AI+ Q +S +A K+ R Y Sbjct: 10 LQEADSILGEIIVQIGECKLG-KTPSNSSLLEALAWAIISQQISTKVANKIYQRFLNFYN 68 Query: 144 ERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVE 203 D P T + L + L++LG+ + L +LA A + P+ +E Sbjct: 69 ---DATP-----LTAKNLLNTPEEDLRSLGISRNKIRYLKNLAKAVEDNLPPLYQLELME 120 Query: 204 --QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIR 254 + + L G+G WTA + D+ D I+ + +P + Sbjct: 121 DWEIIHLLTQIKGVGIWTAQMLLIFRLNRLDILPSADLGIRTAIKNLYQLPELPSPEIVE 180 Query: 255 RYAERWKPWRSYALLHIW 272 +WKP+R+ A ++W Sbjct: 181 AIGYKWKPYRTIASWYLW 198 >UniRef50_A8N5M3 Putative uncharacterized protein n=2 Tax=Agaricales RepID=A8N5M3_COPC7 Length = 822 Score = 43.5 bits (101), Expect = 0.009, Method: Compositional matrix adjust. Identities = 32/125 (25%), Positives = 57/125 (45%), Gaps = 15/125 (12%) Query: 122 LGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC----------FPTPQRLAAADPQALKA 171 LGQ +S A +T + +LY + PE + FPTP++++ + L+ Sbjct: 567 LGQQISWKAARSITHKFIRLYSPSI---PEEVTDESRAAAMQVFPTPEQVSKTEVSLLRT 623 Query: 172 LGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 G+ ++A+ + LA +G L + E+ + L GIGRWT + FA+ + Sbjct: 624 AGLSERKAQYIQDLAARFADGRLSTDKLLNASDEELAEMLIEVKGIGRWTVDMFAIFSLR 683 Query: 230 AKDVF 234 D+ Sbjct: 684 RPDIL 688 >UniRef50_B4SHB1 8-oxoguanine DNA glycosylase domain protein n=3 Tax=Chlorobium/Pelodictyon group RepID=B4SHB1_PELPB Length = 312 Score = 43.1 bits (100), Expect = 0.010, Method: Compositional matrix adjust. Identities = 40/142 (28%), Positives = 65/142 (45%), Gaps = 11/142 (7%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER--LDDFPEYIC---FPTPQRLAAADP 166 D FE + + Q + + + K + + Q YGE+ + + I FP+P+RLAAA+P Sbjct: 115 DPFETMISFMCAQGIGMPLIRKQVSMLLQNYGEKRTISYSGKEITLHHFPSPERLAAANP 174 Query: 167 QALKAL-GMPLKRAEALIHLANAALEGTLPMTIPGD----VEQAMKTLQTFPGIGRWTAN 221 AL RA ++ +A +G + + D + + +TL G+G A+ Sbjct: 175 IALSTCTNNNHPRARNIVRIAKGVADGKIDLDALSDPLLPLSELRRTLCQNEGVGYKIAD 234 Query: 222 YFALRGWQAKDVFLPDDYLIKQ 243 AL G D F P D +KQ Sbjct: 235 CIALFGLGRFDAF-PIDTHVKQ 255 >UniRef50_C7RDZ5 8-oxoguanine DNA glycosylase domain protein n=3 Tax=Clostridiales Family XI. Incertae Sedis RepID=C7RDZ5_ANAPD Length = 300 Score = 43.1 bits (100), Expect = 0.010, Method: Compositional matrix adjust. Identities = 31/134 (23%), Positives = 56/134 (41%), Gaps = 3/134 (2%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP--EYICFPTPQRLAAADPQAL 169 + FE + I+ + K +++ YG+ + ++ +Y FP P+ L P+ L Sbjct: 122 EVFETTISFIISANNQIPRIKKAVRIISERYGDYIGEYKGRKYYSFPRPEVLMKVKPEDL 181 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 + R + ++ + EG L D E K L PG+G A+ L + Sbjct: 182 REYARVGFRDKRIVEASRMIYEGQLDGASKLDTEDLRKKLMELPGVGPKVADCILLFAYH 241 Query: 230 AKDVFLPDDYLIKQ 243 ++ F P D IK+ Sbjct: 242 RRETF-PVDVWIKR 254 >UniRef50_B8PBS7 Predicted protein n=2 Tax=Postia placenta Mad-698-R RepID=B8PBS7_POSPM Length = 336 Score = 43.1 bits (100), Expect = 0.011, Method: Compositional matrix adjust. Identities = 33/121 (27%), Positives = 57/121 (47%), Gaps = 7/121 (5%) Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC-----FPTPQRLAAAD 165 +D F +I+GQ +S A + R +L+ L + P+ FPT +++ + D Sbjct: 16 LDPFRTLANSIIGQQISWKAARAIYHRFIRLFDPSLPEKPQDYTQPSEFFPTARQVVSTD 75 Query: 166 PQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYF 223 LK+ G+ ++AE + LA+ +G L + D E+ L GIGR T + F Sbjct: 76 LAILKSAGLSGRKAEYVYDLASRFADGRLSTEKLLQADDEELYSMLIEVKGIGRLTCSLF 135 Query: 224 A 224 + Sbjct: 136 S 136 >UniRef50_B4WC67 AlkA N-terminal domain family n=1 Tax=Brevundimonas sp. BAL3 RepID=B4WC67_9CAUL Length = 360 Score = 43.1 bits (100), Expect = 0.012, Method: Compositional matrix adjust. Identities = 37/109 (33%), Positives = 49/109 (44%), Gaps = 16/109 (14%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L ++PPYDWS M+ LAARAV E VAD + R L R L Sbjct: 201 LAYRPPYDWSAMMSALAARAVPD-EAVADGVWRRRL-----RTATDGTDGEVSVRLGSEG 254 Query: 64 SAGLEPVAAE------CLAKMSRLFDLQCNPQIVNGALGRLGAARPGLR 106 A +E E LA++ R+FDL +P+ + L +A P LR Sbjct: 255 KAAVEARVDELKALPGVLARVRRVFDLAADPEAIRRDL----SADPDLR 299 >UniRef50_Q4A0G9 Putative DNA-3-methyladenine glycosidase n=1 Tax=Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305 RepID=Q4A0G9_STAS1 Length = 221 Score = 43.1 bits (100), Expect = 0.012, Method: Compositional matrix adjust. Identities = 35/145 (24%), Positives = 62/145 (42%), Gaps = 12/145 (8%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + L +L L++ D + +R+I+GQ ++VA+A + +++ +DD Sbjct: 20 DATLAQLINQIGDLQIQTRADPLKSLIRSIIGQQITVAVAQSIFQKLS----IAIDDHW- 74 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTL 209 T +L+ +KALG+ + + ++ A G L D + L Sbjct: 75 -----TVNQLSQLRESEMKALGLSQSKINYIQNVLFAVRNGQLNFEQLYKMDDNSVINAL 129 Query: 210 QTFPGIGRWTANYFALRGWQAKDVF 234 GIGRWTA F L Q K++ Sbjct: 130 TQIKGIGRWTAEVFLLFTLQRKNIL 154 >UniRef50_Q688W2 Os05g0567500 protein n=2 Tax=Oryza sativa RepID=Q688W2_ORYSJ Length = 290 Score = 42.7 bits (99), Expect = 0.015, Method: Compositional matrix adjust. Identities = 33/121 (27%), Positives = 56/121 (46%), Gaps = 12/121 (9%) Query: 161 LAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQA--MKTLQTFPGIGRW 218 L+AAD L+A+G+ ++A L LA G L + +++A + L G+G W Sbjct: 148 LSAAD---LRAIGVSARKAAYLHDLAGRFAAGELSESAVAAMDEAALLAELTKVKGVGEW 204 Query: 219 TANYFALRGWQAKDVFLPDDYLIK---QRFPGM----TPAQIRRYAERWKPWRSYALLHI 271 T + F + DV D ++ Q G+ P ++ ERW+P+RS ++ Sbjct: 205 TVHMFMIFSLHRPDVLPSGDLGVRKGVQELYGLPALPKPEEMAALCERWRPYRSVGAWYM 264 Query: 272 W 272 W Sbjct: 265 W 265 >UniRef50_D0JW87 Glycosidase n=1 Tax=Yersinia pestis D182038 RepID=D0JW87_YERP1 Length = 243 Score = 42.4 bits (98), Expect = 0.016, Method: Compositional matrix adjust. Identities = 40/153 (26%), Positives = 68/153 (44%), Gaps = 17/153 (11%) Query: 93 GALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY 152 A+ RLG L P D F +R I+ Q +SV A + R+ L G Sbjct: 22 AAIERLGM----LERPLSPDLFAALIRNIVDQQISVKAAQTVNTRLTLLLGS-------- 69 Query: 153 ICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQ 210 TP +AAA +A++ GM +++A + A+AA+ G+L +++ + + + L Sbjct: 70 ---ITPATVAAASAEAIQRCGMTMRKAGYIKGAADAAINGSLDLSVIAQLPDNEVITQLS 126 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ 243 G+G WTA + D+ D I++ Sbjct: 127 RLDGVGVWTAEMLLISSLSRPDIVSWGDLAIRR 159 >UniRef50_B2W1R2 DNA-3-methyladenine glycosylase n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2W1R2_PYRTR Length = 439 Score = 42.0 bits (97), Expect = 0.023, Method: Compositional matrix adjust. Identities = 35/145 (24%), Positives = 62/145 (42%), Gaps = 21/145 (14%) Query: 149 FPE-YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP--MTIPGDVEQA 205 FP + FPTP ++ L+ G+ ++AE + LA G L M + E+ Sbjct: 282 FPTTHPAFPTPTQVLQLPIPTLRTAGLSQRKAEYITGLAEKFCSGELTAQMLVSASDEEL 341 Query: 206 MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ------------------RFPG 247 ++ L G+GRW+ FA G + DVF D +++ ++ Sbjct: 342 IEKLVAVRGLGRWSVEMFACFGLKRMDVFSTGDLGVQRGMAVYAGRDVNKLKSKGGKWKY 401 Query: 248 MTPAQIRRYAERWKPWRSYALLHIW 272 MT ++ A + P+RS + ++W Sbjct: 402 MTEREMLDTAANFSPYRSLFMWYMW 426 >UniRef50_C7MNR2 Endonuclease III n=3 Tax=Coriobacteriaceae RepID=C7MNR2_CRYCD Length = 222 Score = 41.2 bits (95), Expect = 0.039, Method: Compositional matrix adjust. Identities = 42/166 (25%), Positives = 68/166 (40%), Gaps = 22/166 (13%) Query: 125 LVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAAD----PQALKALGMPLKRAE 180 LV+V ++A+ T L + +PTP LA A+ + +LG +A Sbjct: 41 LVAVVLSAQCTDAAVNKVTPSL-----FAAYPTPAALAQANVTDVATIIHSLGFFRAKAT 95 Query: 181 ALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 L+HL+ L G+V + LQT PG+GR TAN ++ D ++ Sbjct: 96 HLVHLSQ-----VLMTDFGGEVPNDIDALQTLPGVGRKTANVVMCEAFKNPQGIAVDTHV 150 Query: 241 I----KQRFPG---MTPAQIRRYAERWKPWRSYALL-HIWYTEGWQ 278 K +F G TPA+ + P + + + H W G + Sbjct: 151 FRIAHKLKFAGPSADTPAKTEAALLKTYPQKDWLYINHQWVHFGRE 196 >UniRef50_C8WCH4 HhH-GPD family protein n=3 Tax=Zymomonas mobilis RepID=C8WCH4_ZYMMN Length = 206 Score = 41.2 bits (95), Expect = 0.040, Method: Compositional matrix adjust. Identities = 46/174 (26%), Positives = 73/174 (41%), Gaps = 26/174 (14%) Query: 113 AFEQGV----RAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQA 168 A E+GV R I+GQ + +A + ++ G+ T RL + D Sbjct: 38 AVERGVPSLARVIVGQQLHTKVADGIWQKLVCSIGD-----------ITADRLLSVDEAI 86 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTA-NYFA--- 224 L+ G+ + L LA ++ G +P + A+ L + GIGRWTA NY Sbjct: 87 LRQCGLSPSKIAYLKDLAMRSVSGLDLFALPEGDDDAVDLLMSVHGIGRWTAENYLIFAE 146 Query: 225 --LRGWQAKD--VFLPDDYLIK-QRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 L W A D + + YL + P M + R ++P+RS L +W+ Sbjct: 147 GRLDIWPAADLGIRIATGYLYQLSHRPDMK--ETRGLGAIFRPYRSIMALFLWH 198 >UniRef50_Q0AQT9 HhH-GPD family protein n=2 Tax=Hyphomonadaceae RepID=Q0AQT9_MARMM Length = 202 Score = 41.2 bits (95), Expect = 0.041, Method: Compositional matrix adjust. Identities = 45/174 (25%), Positives = 73/174 (41%), Gaps = 28/174 (16%) Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGER--LDDF-PEYICFPTPQRLAAADPQALKALGM 174 RAI GQ +SV A + RV G + LD F PE + + L+A G+ Sbjct: 46 CRAISGQQISVKAAQSIWGRVEASAGRQPLLDHFCPE-------------NTETLRACGL 92 Query: 175 PLKRAEALIHLANAA------LEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 + +I +A E M+I G E+ L G+G+WTA+ + + Sbjct: 93 SGAKTRTIITIAETHRSVGLDTEALKNMSIAGRTER----LTEIWGVGQWTADMMNIFYF 148 Query: 229 QAKDVFLPDDYLIKQRFPGMTPAQIR--RYAERWKPWRSYALLHIWYTEGWQPD 280 DV+ D ++ +T + + R A +KP+RS+ ++W PD Sbjct: 149 GEPDVWPDGDVAARKTLERLTSKRRKTVRTAAMFKPYRSWLAYYMWAHVDAPPD 202 >UniRef50_Q8DCC1 3-methyladenine DNA glycosylase n=6 Tax=Vibrio RepID=Q8DCC1_VIBVU Length = 208 Score = 40.8 bits (94), Expect = 0.052, Method: Compositional matrix adjust. Identities = 44/162 (27%), Positives = 63/162 (38%), Gaps = 20/162 (12%) Query: 119 RAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKR 178 RA+ GQ +SV A + RV L E+ Q QAL+ G+ + Sbjct: 49 RAVAGQQLSVNAAKTIWRRVESLSAEK---------GSLQQTFVDEHYQALRDCGLSNAK 99 Query: 179 AEALIHLANAALEGTLPMTI--PGDVEQAMKTLQTFPGIGRWTAN-----YFALRG-WQA 230 + ++ + A +EG L D + +K L GIG WTA +F + W A Sbjct: 100 VKTILGINQALMEGALDSAFLASNDPQTIVKQLTGLWGIGPWTAEMALMFFFGMPDVWSA 159 Query: 231 KDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D L G+ I A P+R+Y LHIW Sbjct: 160 GDAALMRGLSSLAEKEGVDAEAILSAA---TPYRTYLALHIW 198 >UniRef50_A4ABI1 DNA-3-methyladenine glycosidase II n=2 Tax=unclassified Gammaproteobacteria RepID=A4ABI1_9GAMM Length = 204 Score = 40.0 bits (92), Expect = 0.089, Method: Compositional matrix adjust. Identities = 46/168 (27%), Positives = 72/168 (42%), Gaps = 20/168 (11%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F V I Q VS+A A +VA L PE+ T + +AL+A G Sbjct: 42 FPTLVYIIFEQQVSLASAKSTYDKVAAL-------LPEF----TAEAYLQLSDEALRAAG 90 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPG-DVEQAMKTLQT-FPGIGRWTANYFALRGWQAK 231 + ++A +A A + G LP+ G ++ ++TL T GIG WTA+ + + + Sbjct: 91 VSRQKARYTRLVAEATIAGDLPIHALGRKPDEEVRTLLTAITGIGNWTADVYLMLALRRP 150 Query: 232 DVFLPDDY-------LIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D++ D IK R + ER++P+RS A W Sbjct: 151 DLWPVGDLALVKAATAIKSRPHKPDKLWLENLGERYRPYRSVATGIFW 198 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P04395 DNA-3-methyladenine glycosylase 2 n=122 Tax=Ente... 366 e-100 UniRef50_A8MFS4 Transcriptional regulator, AraC family n=2 Tax=C... 313 4e-84 UniRef50_D1R7A9 Putative uncharacterized protein n=1 Tax=Parachl... 311 2e-83 UniRef50_UPI0001BC59A0 AraC family transcriptional regulator n=1... 306 5e-82 UniRef50_A8GHQ3 Transcriptional regulator, AraC family n=3 Tax=P... 306 6e-82 UniRef50_A3ETF2 Putative Ada DNA repair protein and transcriptio... 305 1e-81 UniRef50_Q6MA41 Putative DNA-3-methyladenine glycosidase II n=3 ... 304 2e-81 UniRef50_Q2SDC7 Adenosine deaminase n=3 Tax=Bacteria RepID=Q2SDC... 301 2e-80 UniRef50_C8QGF9 Transcriptional regulator, AraC family n=1 Tax=P... 297 2e-79 UniRef50_A6SU78 Methylated-DNA-[protein]-cysteine S-methyltransf... 289 1e-76 UniRef50_Q0AGQ6 DNA-3-methyladenine glycosylase II / DNA-O6-meth... 287 3e-76 UniRef50_Q1IT49 DNA-3-methyladenine glycosylase II / Transcripti... 285 9e-76 UniRef50_Q46QC3 Transcriptional regulator Ada n=25 Tax=cellular ... 280 4e-74 UniRef50_C0WE04 Transcriptional regulator n=1 Tax=Acidaminococcu... 280 5e-74 UniRef50_D2L7I2 Transcriptional regulator, AraC family n=1 Tax=D... 274 2e-72 UniRef50_A7HP34 Ada metal-binding domain protein n=3 Tax=Bacteri... 274 3e-72 UniRef50_Q2RNZ4 Transcriptional regulator Ada / DNA-3-methyladen... 273 5e-72 UniRef50_Q02KH7 DNA-3-methyladenine glycosidase II n=8 Tax=Pseud... 272 7e-72 UniRef50_Q12D18 Transcriptional regulator Ada / DNA-O6-methylgua... 268 1e-70 UniRef50_B0RQX4 DNA methylation and regulatory protein (Methylat... 267 3e-70 UniRef50_B0SWZ0 Transcriptional regulator, AraC family n=7 Tax=B... 267 4e-70 UniRef50_B9DJS2 Putative uncharacterized protein n=1 Tax=Staphyl... 264 2e-69 UniRef50_A6EY17 Transcriptional Regulator, AraC family protein n... 262 1e-68 UniRef50_Q1ZAD8 Hypothetical ada regulatory protein n=2 Tax=Phot... 261 2e-68 UniRef50_A5KSU6 Transcriptional regulator, AraC family n=1 Tax=c... 260 4e-68 UniRef50_C7R5W7 Transcriptional regulator, AraC family n=1 Tax=K... 258 2e-67 UniRef50_Q10630 Methylated-DNA--protein-cysteine methyltransfera... 258 2e-67 UniRef50_UPI0001901D5D methylated-DNA--protein-cysteine methyltr... 257 3e-67 UniRef50_A4SQS2 DNA methylation and regulatory protein n=2 Tax=A... 255 1e-66 UniRef50_C0Q970 AlkA n=1 Tax=Desulfobacterium autotrophicum HRM2... 252 8e-66 UniRef50_B4S0Y6 Ada regulatory protein n=3 Tax=Alteromonas macle... 252 1e-65 UniRef50_Q2T2N2 DNA-3-methyladenine glycosylase II n=65 Tax=Burk... 251 2e-65 UniRef50_A1WKZ8 DNA-3-methyladenine glycosylase II / Transcripti... 250 3e-65 UniRef50_A1TR03 DNA-O6-methylguanine--protein-cysteine S-methylt... 250 3e-65 UniRef50_B7RWC6 AlkA N-terminal domain family protein n=1 Tax=ma... 250 4e-65 UniRef50_B1ZFN9 AlkA domain protein n=6 Tax=Methylobacterium Rep... 250 5e-65 UniRef50_A7HG85 AlkA domain protein n=2 Tax=Myxococcales RepID=A... 250 6e-65 UniRef50_Q6MR46 DNA methylation and regulatory protein Ada n=1 T... 248 1e-64 UniRef50_Q3IBU8 Putative ADA regulatory protein (Regulatory prot... 244 3e-63 UniRef50_Q15P13 DNA-O6-methylguanine--protein-cysteine S-methylt... 243 5e-63 UniRef50_Q1QTR7 Transcriptional regulator Ada / DNA-3-methyladen... 240 5e-62 UniRef50_C4DFD0 DNA-3-methyladenine glycosylase II; Transcriptio... 239 7e-62 UniRef50_C7MYM6 DNA-3-methyladenine glycosylase II /DNA-O6-methy... 233 7e-60 UniRef50_A3XSB2 Ada regulatory protein n=1 Tax=Vibrio sp. MED222... 232 7e-60 UniRef50_C1YI07 DNA-O6-methylguanine--protein-cysteine S-methylt... 232 1e-59 UniRef50_D1BI44 DNA-3-methyladenine glycosylase II /DNA-O6-methy... 231 3e-59 UniRef50_Q2IPL2 Transcriptional regulator Ada / DNA-O6-methylgua... 230 5e-59 UniRef50_B2GIR9 Putative methylated-DNA--protein-cysteine methyl... 228 2e-58 UniRef50_B0KRT0 AlkA domain protein n=1 Tax=Pseudomonas putida G... 227 2e-58 UniRef50_A0JV31 DNA-O6-methylguanine--protein-cysteine S-methylt... 227 3e-58 UniRef50_A8LHD8 Transcriptional regulator, AraC family n=4 Tax=A... 227 4e-58 UniRef50_P37878 DNA-3-methyladenine glycosylase n=4 Tax=Bacillac... 224 3e-57 UniRef50_D0LE01 Ada metal-binding domain protein n=1 Tax=Gordoni... 223 7e-57 UniRef50_Q12L65 DNA-O6-methylguanine--protein-cysteine S-methylt... 221 2e-56 UniRef50_UPI0000E0EED3 Ada family regulatory protein n=1 Tax=Gla... 220 4e-56 UniRef50_C7QDZ2 Transcriptional regulator, AraC family n=2 Tax=A... 219 8e-56 UniRef50_A1S7Q4 DNA-3-methyladenine glycosylase II / DNA-O6-meth... 218 1e-55 UniRef50_A3D6C4 Transcriptional regulator Ada / DNA-3-methyladen... 217 4e-55 UniRef50_Q7MGD3 Adenosine deaminase n=51 Tax=Vibrionales RepID=Q... 213 4e-54 UniRef50_C0ZIT0 DNA-3-methyladenine glycosylase II n=75 Tax=Baci... 213 5e-54 UniRef50_C4L050 DNA-3-methyladenine glycosylase II n=4 Tax=Bacil... 213 6e-54 UniRef50_C6XZ60 HhH-GPD family protein n=1 Tax=Pedobacter hepari... 213 6e-54 UniRef50_D1C0H7 Transcriptional regulator, AraC family n=1 Tax=X... 213 6e-54 UniRef50_C6D2P4 DNA-3-methyladenine glycosylase II n=1 Tax=Paeni... 211 2e-53 UniRef50_A5CSR4 Putative DNA glycosylase n=2 Tax=Clavibacter mic... 205 1e-51 UniRef50_C0Z5U6 Putative DNA-3-methyladenine glycosylase II n=1 ... 205 2e-51 UniRef50_Q1YTX8 Putative DNA-3-methyladenine glycosylase II n=1 ... 204 4e-51 UniRef50_C5C5F4 HhH-GPD family protein n=1 Tax=Beutenbergia cave... 202 1e-50 UniRef50_A4BNP3 3-methyladenine DNA glycosylase/8-oxoguanineDNA ... 198 2e-49 UniRef50_O31544 Putative DNA-3-methyladenine glycosylase yfjP n=... 197 3e-49 UniRef50_UPI00018509D2 YfjP n=1 Tax=Bacillus coahuilensis m4-4 R... 197 4e-49 UniRef50_C8XKJ9 AlkA domain protein n=1 Tax=Nakamurella multipar... 196 7e-49 UniRef50_C7PMW8 8-oxoguanine DNA glycosylase domain protein n=1 ... 195 2e-48 UniRef50_C6W476 HhH-GPD family protein n=1 Tax=Dyadobacter ferme... 194 3e-48 UniRef50_D2PPK3 Transcriptional regulator, AraC family n=1 Tax=K... 193 4e-48 UniRef50_D1CD20 DNA-3-methyladenine glycosylase II n=1 Tax=Therm... 192 9e-48 UniRef50_Q7N9Z6 Similarities with the C-terminal region of 3-met... 188 2e-46 UniRef50_D1P0X5 DNA-3-methyladenine glycosylase II n=4 Tax=Enter... 186 5e-46 UniRef50_A9B7A8 Transcriptional regulator, AraC family n=1 Tax=H... 185 1e-45 UniRef50_B9XBY0 HhH-GPD family protein n=1 Tax=bacterium Ellin51... 185 1e-45 UniRef50_C6MGP3 HhH-GPD family protein n=1 Tax=Nitrosomonas sp. ... 184 4e-45 UniRef50_C7MAP3 Adenosine deaminase n=1 Tax=Brachybacterium faec... 183 6e-45 UniRef50_C1RNZ7 DNA-3-methyladenine glycosylase II n=1 Tax=Cellu... 183 7e-45 UniRef50_Q81IC3 DNA-3-methyladenine glycosylase II n=75 Tax=Baci... 182 1e-44 UniRef50_A1ZCF3 HhH-GPD n=1 Tax=Microscilla marina ATCC 23134 Re... 181 2e-44 UniRef50_Q2BC23 DNA-3-methyladenine glycosylase II n=1 Tax=Bacil... 181 3e-44 UniRef50_Q9KC25 DNA-3-methyladenine glycosidase n=1 Tax=Bacillus... 181 3e-44 UniRef50_B3T536 Putative HhH-GPD superfamily base excision DNA r... 180 4e-44 UniRef50_B7K2N0 DNA-3-methyladenine glycosylase II n=5 Tax=Chroo... 180 4e-44 UniRef50_Q1ITU3 DNA-3-methyladenine glycosylase II n=2 Tax=Bacte... 180 5e-44 UniRef50_Q2FMK1 HhH-GPD n=1 Tax=Methanospirillum hungatei JF-1 R... 180 6e-44 UniRef50_D0J4I7 HhH-GPD n=2 Tax=Comamonas testosteroni RepID=D0J... 180 7e-44 UniRef50_A6CCG3 Probable DNA-3-methyladenine glycosylase n=1 Tax... 179 1e-43 UniRef50_Q82VT3 HhH-GPD n=2 Tax=Betaproteobacteria RepID=Q82VT3_... 178 1e-43 UniRef50_A6WG49 HhH-GPD family protein n=5 Tax=Actinomycetales R... 178 1e-43 UniRef50_C7NLP9 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 178 2e-43 UniRef50_Q5NXL1 DNA-3-methyladenine glycosidase II n=3 Tax=Betap... 178 2e-43 UniRef50_Q01SY7 DNA-3-methyladenine glycosylase II n=1 Tax=Candi... 176 6e-43 UniRef50_A5KST9 DNA-3-methyladenine glycosylase II n=1 Tax=candi... 176 7e-43 UniRef50_C1A5A1 DNA-3-methyladenine glycosylase n=1 Tax=Gemmatim... 175 2e-42 UniRef50_A9RKT9 Predicted protein (Fragment) n=1 Tax=Physcomitre... 175 2e-42 UniRef50_B2SXP8 HhH-GPD family protein n=39 Tax=Betaproteobacter... 174 3e-42 UniRef50_A9BVD9 HhH-GPD family protein n=1 Tax=Delftia acidovora... 173 4e-42 UniRef50_D1Z1B8 Putative DNA glycosidase n=1 Tax=Methanocella pa... 173 5e-42 UniRef50_A9T041 Predicted protein n=1 Tax=Physcomitrella patens ... 172 1e-41 UniRef50_UPI00016C4C1A DNA-3-methyladenine glycosylase II n=1 Ta... 172 2e-41 UniRef50_Q2SX77 DNA-3-methyladenine glycosylase n=60 Tax=Betapro... 171 2e-41 UniRef50_Q9LN45 F18O14.25 n=22 Tax=Magnoliophyta RepID=Q9LN45_ARATH 171 3e-41 UniRef50_D1HE56 Whole genome shotgun sequence of line PN40024, s... 171 3e-41 UniRef50_B4X1U6 Base excision DNA repair protein, HhH-GPD family... 170 5e-41 UniRef50_C2AV46 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 170 6e-41 UniRef50_C1D8D7 HhH-GPD family protein n=1 Tax=Laribacter hongko... 170 8e-41 UniRef50_A2QHV8 Contig An04c0070, complete genome n=10 Tax=Eurot... 169 9e-41 UniRef50_D1C1F2 HhH-GPD family protein n=1 Tax=Sphaerobacter the... 168 1e-40 UniRef50_Q0BSG3 DNA-3-methyladenine glycosylase II n=12 Tax=Prot... 168 1e-40 UniRef50_Q92383 DNA-3-methyladenine glycosylase 1 n=1 Tax=Schizo... 168 2e-40 UniRef50_B4CYJ1 DNA-3-methyladenine glycosylase II n=1 Tax=Chtho... 167 4e-40 UniRef50_B8GAB8 DNA-3-methyladenine glycosylase II n=3 Tax=Chlor... 167 4e-40 UniRef50_C7PK12 HhH-GPD family protein n=1 Tax=Chitinophaga pine... 167 4e-40 UniRef50_B8IZY6 HhH-GPD family protein n=8 Tax=Bacteria RepID=B8... 166 6e-40 UniRef50_A9EU33 Methylated-DNA--protein-cysteine methyltransfera... 166 1e-39 UniRef50_C6WJ98 Transcriptional regulator, AraC family n=5 Tax=A... 165 1e-39 UniRef50_D1RHI7 HhH-GPD family base excision repair protein n=1 ... 165 2e-39 UniRef50_B0U6C0 DNA-3-methyladenine glycosidase n=16 Tax=Xanthom... 165 2e-39 UniRef50_B1YMD5 HhH-GPD family protein n=1 Tax=Exiguobacterium s... 164 3e-39 UniRef50_B2B817 Predicted CDS Pa_2_12990 n=8 Tax=Leotiomyceta Re... 164 4e-39 UniRef50_C7MP98 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 164 4e-39 UniRef50_B9LPN6 HhH-GPD family protein n=4 Tax=Halobacteriaceae ... 163 7e-39 UniRef50_A5WCQ9 HhH-GPD family protein n=2 Tax=Psychrobacter Rep... 163 8e-39 UniRef50_C5G8B3 DNA-3-methyladenine glycosylase n=8 Tax=Onygenal... 163 9e-39 UniRef50_Q3INX6 DNA N-glycosylase / DNA lyase n=6 Tax=Halobacter... 162 9e-39 UniRef50_C6CD76 DNA-3-methyladenine glycosylase II n=1 Tax=Dicke... 162 1e-38 UniRef50_A6GQ39 3-methyladenine DNA glycosylase II n=1 Tax=Limno... 161 2e-38 UniRef50_Q5FSB3 DNA-3-methyladenine glycosylase n=1 Tax=Gluconob... 161 2e-38 UniRef50_Q0VPN7 Putative uncharacterized protein n=1 Tax=Alcaniv... 161 3e-38 UniRef50_B4B851 DNA-3-methyladenine glycosylase II n=2 Tax=Cyano... 160 8e-38 UniRef50_A5V920 HhH-GPD family protein n=7 Tax=Sphingomonadales ... 159 1e-37 UniRef50_B1ZV80 Transcriptional regulator, AraC family n=2 Tax=O... 158 2e-37 UniRef50_B6EMH3 DNA repair protein n=2 Tax=Gammaproteobacteria R... 157 3e-37 UniRef50_D0J2I3 HhH-GPD n=6 Tax=Comamonadaceae RepID=D0J2I3_COMTE 157 5e-37 UniRef50_Q6BZL7 DEHA2A00418p n=2 Tax=Debaryomyces hansenii RepID... 156 6e-37 UniRef50_Q9ZET9 DNA-3-methyladenine glycosidase (Fragment) n=1 T... 156 7e-37 UniRef50_C8WLI9 HhH-GPD family protein n=4 Tax=Bacteria RepID=C8... 156 7e-37 UniRef50_A1K6J5 DNA-3-methyladenine glycosylase II n=21 Tax=Prot... 156 9e-37 UniRef50_D2QEN8 HhH-GPD family protein n=1 Tax=Spirosoma lingual... 156 1e-36 UniRef50_B5ES79 HhH-GPD family protein n=4 Tax=Acidithiobacillus... 155 1e-36 UniRef50_A0RYQ2 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 155 1e-36 UniRef50_B2J3A5 HhH-GPD family protein n=1 Tax=Nostoc punctiform... 155 1e-36 UniRef50_C1DYL3 Predicted protein n=2 Tax=Micromonas RepID=C1DYL... 155 2e-36 UniRef50_C8W0S2 HhH-GPD family protein n=6 Tax=Bacteria RepID=C8... 154 3e-36 UniRef50_A7EZ08 Putative uncharacterized protein n=1 Tax=Sclerot... 153 5e-36 UniRef50_Q1AWP7 DNA-3-methyladenine glycosylase II n=1 Tax=Rubro... 153 6e-36 UniRef50_C6IXS6 DNA-3-methyladenine glycosidase n=1 Tax=Paenibac... 153 7e-36 UniRef50_A6TTX3 Methylated-DNA--protein-cysteine methyltransfera... 152 1e-35 UniRef50_B3E6X3 HhH-GPD family protein n=2 Tax=Bacteria RepID=B3... 151 2e-35 UniRef50_UPI0000D54B32 HhH-GPD n=1 Tax=Psychroflexus torquis ATC... 151 2e-35 UniRef50_D0LW65 DNA-3-methyladenine glycosylase II n=1 Tax=Halia... 151 3e-35 UniRef50_Q6CEP5 YALI0B14080p n=1 Tax=Yarrowia lipolytica RepID=Q... 151 4e-35 UniRef50_Q5K8T8 DNA-3-methyladenine glycosidase, putative n=1 Ta... 149 7e-35 UniRef50_A9FBN7 Putative DNA-3-methyladenine glycosidase n=1 Tax... 149 1e-34 UniRef50_B8GY42 DNA-3-methyladenine glycosylase II n=4 Tax=Caulo... 148 1e-34 UniRef50_Q04UT1 DNA-3-methyladenine glycosylase II n=4 Tax=Lepto... 148 2e-34 UniRef50_A9M750 HhH-GPD family protein n=55 Tax=Rhizobiales RepI... 148 2e-34 UniRef50_O94468 Probable DNA-3-methyladenine glycosylase 2 n=1 T... 148 3e-34 UniRef50_A0Z859 HhH-GPD protein n=1 Tax=marine gamma proteobacte... 147 4e-34 UniRef50_Q5SLG4 DNA-3-methyladenine glycosidase n=6 Tax=Bacteria... 146 8e-34 UniRef50_B6K1P6 DNA-3-methyladenine glycosylase n=1 Tax=Schizosa... 146 1e-33 UniRef50_C6A294 AlkA 3-methyladenine DNA glycosylase n=9 Tax=The... 145 2e-33 UniRef50_Q1J274 Endonuclease III, DNA-3-methyladenine glycosidas... 144 3e-33 UniRef50_D2LH30 HhH-GPD family protein n=1 Tax=Rhodomicrobium va... 143 5e-33 UniRef50_Q0BWS7 Putative DNA-3-methyladenine glycosylase n=1 Tax... 143 8e-33 UniRef50_D0XPK8 HhH-GPD family protein n=1 Tax=Brevundimonas sub... 143 9e-33 UniRef50_A6EE77 3-methyladenine DNA glycosylase n=1 Tax=Pedobact... 143 1e-32 UniRef50_B0D0G2 Predicted protein n=1 Tax=Laccaria bicolor S238N... 142 1e-32 UniRef50_B6G8M1 Putative uncharacterized protein n=1 Tax=Collins... 142 1e-32 UniRef50_A8TVS7 HhH-GPD n=1 Tax=alpha proteobacterium BAL199 Rep... 142 1e-32 UniRef50_A3JFL8 3-methyladenine DNA glycosylase n=1 Tax=Marinoba... 141 3e-32 UniRef50_A8IJX2 HhH-GPD protein n=1 Tax=Azorhizobium caulinodans... 139 8e-32 UniRef50_Q0USE2 Putative uncharacterized protein n=1 Tax=Phaeosp... 139 1e-31 UniRef50_B8EL05 HhH-GPD family protein n=5 Tax=Alphaproteobacter... 138 1e-31 UniRef50_Q8TL35 DNA-3-methyladenine glycosylase II n=1 Tax=Metha... 138 3e-31 UniRef50_Q1H1S0 DNA-3-methyladenine glycosylase II n=1 Tax=Methy... 136 7e-31 UniRef50_A9I9J6 DNA-3-methyladenine glycosidase II n=1 Tax=Borde... 134 2e-30 UniRef50_Q3B3Y2 HhH-GPD n=1 Tax=Chlorobium luteolum DSM 273 RepI... 134 3e-30 UniRef50_Q4ZR24 DNA-3-methyladenine glycosylase II n=4 Tax=Pseud... 133 5e-30 UniRef50_D1VDS6 HhH-GPD family protein n=3 Tax=Actinomycetales R... 133 1e-29 UniRef50_Q972N8 Putative uncharacterized protein ST1094 n=1 Tax=... 131 2e-29 UniRef50_B5IDT4 Base excision DNA repair protein, HhH-GPD family... 131 2e-29 UniRef50_D1VAP6 HhH-GPD family protein n=1 Tax=Frankia sp. EuI1c... 130 5e-29 UniRef50_Q55703 Slr0231 protein n=1 Tax=Synechocystis sp. PCC 68... 129 7e-29 UniRef50_O28163 3-methyladenine DNA glycosylase (AlkA) n=1 Tax=A... 128 2e-28 UniRef50_B4S806 8-oxoguanine DNA glycosylase domain protein n=1 ... 128 3e-28 UniRef50_B3QN63 8-oxoguanine DNA glycosylase domain protein n=2 ... 125 1e-27 UniRef50_UPI0001B54083 YfjP n=1 Tax=Streptomyces sp. AA4 RepID=U... 120 6e-26 UniRef50_B3EJD3 8-oxoguanine DNA glycosylase domain protein n=1 ... 119 1e-25 UniRef50_Q754R1 AFR011Wp n=1 Tax=Eremothecium gossypii RepID=Q75... 117 5e-25 UniRef50_C0NIP1 Putative uncharacterized protein n=1 Tax=Ajellom... 116 8e-25 Sequences not found previously or not previously below threshold: UniRef50_B6JZD7 DNA-3-methyladenine glycosylase n=1 Tax=Schizosa... 168 2e-40 UniRef50_D1ZEJ1 Whole genome shotgun sequence assembly, scaffold... 168 2e-40 UniRef50_C1XHZ0 DNA-3-methyladenine glycosylase II n=2 Tax=Meiot... 161 2e-38 UniRef50_B7K9B1 HhH-GPD family protein n=3 Tax=Cyanobacteria Rep... 159 1e-37 UniRef50_B2W1R2 DNA-3-methyladenine glycosylase n=1 Tax=Pyrenoph... 156 8e-37 UniRef50_Q7NJ14 Gll2018 protein n=1 Tax=Gloeobacter violaceus Re... 149 1e-34 UniRef50_Q688W2 Os05g0567500 protein n=2 Tax=Oryza sativa RepID=... 146 9e-34 UniRef50_A8QA43 Putative uncharacterized protein n=1 Tax=Malasse... 145 2e-33 UniRef50_C5SF64 HhH-GPD family protein n=1 Tax=Asticcacaulis exc... 143 5e-33 UniRef50_Q8F6D8 DNA-3-methyladenine glycosylase n=2 Tax=Leptospi... 143 8e-33 UniRef50_A8N5M3 Putative uncharacterized protein n=2 Tax=Agarica... 143 1e-32 UniRef50_A8J6X9 Predicted protein (Fragment) n=1 Tax=Chlamydomon... 142 1e-32 UniRef50_B6U6Y8 DNA-3-methyladenine glycosylase 1 n=8 Tax=Magnol... 141 2e-32 UniRef50_D1IGU3 Whole genome shotgun sequence of line PN40024, s... 141 3e-32 UniRef50_Q4A0G9 Putative DNA-3-methyladenine glycosidase n=1 Tax... 138 2e-31 UniRef50_Q7CSU9 DNA-3-methyladenine glycosidase II n=1 Tax=Agrob... 136 6e-31 UniRef50_C8WCH4 HhH-GPD family protein n=3 Tax=Zymomonas mobilis... 135 2e-30 UniRef50_Q8DCC1 3-methyladenine DNA glycosylase n=6 Tax=Vibrio R... 134 3e-30 UniRef50_Q1D1V1 HhH-GPD domain protein n=15 Tax=cellular organis... 134 3e-30 UniRef50_Q4PHN3 Putative uncharacterized protein n=1 Tax=Ustilag... 133 6e-30 UniRef50_Q0AQT9 HhH-GPD family protein n=2 Tax=Hyphomonadaceae R... 133 7e-30 UniRef50_Q5CV50 DNA-3-methyladenine glycosidase (Fragment) n=2 T... 132 2e-29 UniRef50_B6R6H8 DNA-3-methyladenine glycosidase II protein n=3 T... 131 2e-29 UniRef50_A4ABI1 DNA-3-methyladenine glycosidase II n=2 Tax=uncla... 131 3e-29 UniRef50_Q7UGU9 DNA-3-methyladenine glycosidase n=1 Tax=Rhodopir... 131 4e-29 UniRef50_B6BS24 DNA-3-methyladenine glycosylase I n=4 Tax=SAR11 ... 129 1e-28 UniRef50_Q1QHT1 HhH-GPD n=1 Tax=Nitrobacter hamburgensis X14 Rep... 128 2e-28 UniRef50_Q0G7C6 Putative dna-3-methyladenine glycosidase ii prot... 128 3e-28 UniRef50_C4DGP2 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 127 4e-28 UniRef50_Q11DX8 HhH-GPD n=3 Tax=Rhizobiales RepID=Q11DX8_MESSB 127 5e-28 UniRef50_B6BWA6 DNA-3-methyladenine glycosylase 1 n=1 Tax=beta p... 126 7e-28 UniRef50_B6JDH5 DNA-3-methyladenine glycosidase II n=14 Tax=Rhiz... 126 1e-27 UniRef50_C0E8I7 Putative uncharacterized protein n=2 Tax=Clostri... 125 2e-27 UniRef50_A4SEG0 8-oxoguanine DNA glycosylase domain protein n=1 ... 124 3e-27 UniRef50_B9JTI1 DNA-3-methyladenine glycosidase II n=1 Tax=Agrob... 124 5e-27 UniRef50_B3ECK0 8-oxoguanine DNA glycosylase domain protein n=1 ... 123 6e-27 UniRef50_B5Y412 Predicted protein n=1 Tax=Phaeodactylum tricornu... 122 1e-26 UniRef50_A3TMT2 Base-excision DNA repair protein n=1 Tax=Janibac... 122 1e-26 UniRef50_Q3ARU6 HhH-GPD n=1 Tax=Chlorobium chlorochromatii CaD3 ... 121 2e-26 UniRef50_C5E2U4 KLTH0H07766p n=2 Tax=Saccharomycetaceae RepID=C5... 121 3e-26 UniRef50_D0JW87 Glycosidase n=1 Tax=Yersinia pestis D182038 RepI... 120 4e-26 UniRef50_Q28QY1 HhH-GPD n=83 Tax=Bacteria RepID=Q28QY1_JANSC 120 6e-26 UniRef50_P22134 DNA-3-methyladenine glycosylase n=7 Tax=Saccharo... 120 6e-26 UniRef50_B4SHB1 8-oxoguanine DNA glycosylase domain protein n=3 ... 118 3e-25 UniRef50_B8KXG2 HhH-GPD family protein n=1 Tax=gamma proteobacte... 117 4e-25 UniRef50_A6SCZ1 Putative uncharacterized protein n=1 Tax=Botryot... 116 1e-24 UniRef50_A5DE48 Putative uncharacterized protein n=2 Tax=Pichia ... 114 2e-24 UniRef50_C1ABY5 DNA-3-methyladenine glycosylase n=1 Tax=Gemmatim... 113 6e-24 UniRef50_C7MW75 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 112 1e-23 UniRef50_A5DYG6 Putative uncharacterized protein n=1 Tax=Loddero... 111 2e-23 UniRef50_A3LTR9 3-methyladenine DNA glycosylase (Fragment) n=5 T... 111 3e-23 UniRef50_A4XJM3 8-oxoguanine DNA glycosylase domain protein n=2 ... 111 4e-23 UniRef50_Q5R0A4 3-methyladenine DNA glycosylase n=2 Tax=Idiomari... 110 4e-23 >UniRef50_P04395 DNA-3-methyladenine glycosylase 2 n=122 Tax=Enterobacteriaceae RepID=3MG2_ECOLI Length = 282 Score = 366 bits (940), Expect = e-100, Method: Composition-based stats. Identities = 282/282 (100%), Positives = 282/282 (100%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH Sbjct: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRA 120 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRA Sbjct: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRA 120 Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE Sbjct: 121 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 Query: 181 ALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 ALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL Sbjct: 181 ALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 Query: 241 IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA Sbjct: 241 IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 >UniRef50_A8MFS4 Transcriptional regulator, AraC family n=2 Tax=Clostridiaceae RepID=A8MFS4_ALKOO Length = 485 Score = 313 bits (803), Expect = 4e-84, Method: Composition-based stats. Identities = 102/287 (35%), Positives = 152/287 (52%), Gaps = 15/287 (5%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRG-----VVTAIPDIARHT 58 L+++PPY W ML FLA RA++ +E V ++ Y R++ + G + R+ Sbjct: 199 LSYRPPYHWEDMLRFLAGRAITGIEVVKNNEYMRTVHLENSEGKPVYGWIRVGHQSKRNA 258 Query: 59 LHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP-----GLRLPGCVDA 113 L + +S L V + LA++ LFDL C+P V L + RP G R+PGC +A Sbjct: 259 LSVTVSQALLSVLPQVLARIRHLFDLYCDPDAVYETLQVMNDIRPNLCTLGTRVPGCFNA 318 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAAD---PQA 168 FE VRA+LGQ ++V A+ L AR+ Q YG + E FP+P+ + A + Sbjct: 319 FEMVVRAVLGQQITVKAASTLAARIVQTYGTPIQTGFEGLTHVFPSPEDILALNGPIENH 378 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 L LG+ RA+ + LA A ++G + +P E+ MK L GIG WTA Y A+R Sbjct: 379 LGPLGVIAARAKTIYELAQAFVQGEIDFDLPAQPEEEMKRLMAIRGIGSWTAQYIAMRAM 438 Query: 229 QAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 + D FL D +K+ P T ++ AE W+PWRSYA +++W T Sbjct: 439 EWPDAFLETDAGVKKALPPYTAKELLEIAEAWRPWRSYATVNLWNTL 485 >UniRef50_D1R7A9 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R7A9_9CHLA Length = 532 Score = 311 bits (797), Expect = 2e-83, Method: Composition-based stats. Identities = 95/284 (33%), Positives = 150/284 (52%), Gaps = 8/284 (2%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L ++PPYDW ++ FL R + VE + Y R++ +G+ +G + + +L Sbjct: 248 ILQLTYRPPYDWKGVINFLRVRLMKGVEHIEGDRYLRTIQLGKTKGWIQISHAEEKQSLI 307 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRL------GAARPGLRLPGCVDAF 114 LS L PV L ++ +FDL P +++ L + PGLR+PG D F Sbjct: 308 FELSHSLLPVLPALLGRIRSVFDLNARPDVISTHLRQDKWLTEAVNVNPGLRIPGAFDGF 367 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADPQALKAL 172 E VRAILGQ ++V A L R+ Q +GE++ PTPQRL A L +L Sbjct: 368 ELAVRAILGQQITVKAATTLAGRLVQAFGEKIQTPYPELKHLSPTPQRLTIATVDELASL 427 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 G+ R++++IHLA + G L + E+ M+ L PGIG+WTA+Y A+R + D Sbjct: 428 GIIQSRSKSIIHLAEEVVSGRLQLDADVYPEKTMQKLVQIPGIGKWTAHYIAMRALRWPD 487 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 F +D ++++ +T +Q ++ W+PWRSYA+LH+W Sbjct: 488 AFPKEDIVLRKCLGNVTASQAEILSQSWRPWRSYAVLHLWQNSS 531 >UniRef50_UPI0001BC59A0 AraC family transcriptional regulator n=1 Tax=Fusobacterium ulcerans ATCC 49185 RepID=UPI0001BC59A0 Length = 486 Score = 306 bits (785), Expect = 5e-82, Method: Composition-based stats. Identities = 98/288 (34%), Positives = 152/288 (52%), Gaps = 14/288 (4%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAV----GEYRGVVTAIPDIARH 57 L ++PPY W +L FLA RA+ VETV + Y R++ + A ++ Sbjct: 199 LALGYRPPYQWEHILNFLALRAIPGVETVKEGKYYRTVHFLNGEKHIYSWIQAENQPEKN 258 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP-----GLRLPGCVD 112 T+ + + A L PV ++ LAK+ LFDL C+P V L ++ +P G+R+PGC D Sbjct: 259 TIAVTMPAELLPVLSQVLAKVRNLFDLSCDPYAVYEGLMKMNNIQPNICTLGIRVPGCFD 318 Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAAADP---Q 167 FE VRA+LGQ +++ A L AR+ + +G ++ E + FP P+ + Sbjct: 319 PFEMSVRAVLGQQITIKAAKTLAARITEKFGVTIETGIEGLTHIFPEPEDIYKLKDKITD 378 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRG 227 +L LG+ RA+ ++ LA+A + + E+ +K L GIG WTA Y A+R Sbjct: 379 SLGELGIIKTRAKTILELASAFVNKEIDFNFCIHPEEEIKKLMKISGIGNWTAQYIAMRA 438 Query: 228 WQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 D FL DY +K+ P TP +I AE W+PWRSYA++++W + Sbjct: 439 MGLTDAFLETDYGVKKALPSYTPKEILTLAEAWRPWRSYAVVNLWNSL 486 >UniRef50_A8GHQ3 Transcriptional regulator, AraC family n=3 Tax=Proteobacteria RepID=A8GHQ3_SERP5 Length = 512 Score = 306 bits (784), Expect = 6e-82, Method: Composition-based stats. Identities = 135/286 (47%), Positives = 178/286 (62%), Gaps = 6/286 (2%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVG----EYRGVVTAIPDIAR 56 ++ L ++PPYDW ML FL RAVS VE V Y R++A+ +Y G V+ P+ + Sbjct: 222 VFHLGYRPPYDWPRMLSFLQTRAVSGVEKVEGQQYLRAIAITQGGIDYHGWVSVQPEESH 281 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQ 116 + + + ++ L V E L ++ +LFDL P ++ ALG+L A PGLRLPGCV+ FEQ Sbjct: 282 NRVRVEIAPALSRVTTEVLRRIRQLFDLDAAPDLIVQALGQLAADAPGLRLPGCVNGFEQ 341 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADPQALKALGM 174 RA+LGQLVSV MAA +A+ +G L+ FP +++A P+ L+ LG+ Sbjct: 342 ATRAVLGQLVSVKMAATFAGCMAERWGTPLEQPYAGITHVFPNAEQVARLQPEELRPLGV 401 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 LKRA ALI +A A EG L + D+EQ +K L PGIG WTA Y A+R W DVF Sbjct: 402 QLKRAAALIAIARAVTEGRLQLENVLDIEQGIKALTALPGIGSWTACYIAMRAWSWPDVF 461 Query: 235 LPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 L DYLIKQRFPGMTP QI YAE W+PWRSYA LH+W+ +GW P Sbjct: 462 LTGDYLIKQRFPGMTPRQIENYAECWRPWRSYATLHLWHNQGWVPS 507 >UniRef50_A3ETF2 Putative Ada DNA repair protein and transcriptional regulator, AraC family n=2 Tax=Leptospirillum sp. Group II RepID=A3ETF2_9BACT Length = 480 Score = 305 bits (782), Expect = 1e-81, Method: Composition-based stats. Identities = 106/285 (37%), Positives = 158/285 (55%), Gaps = 6/285 (2%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEY----RGVVTAIPDIAR 56 +++L ++PPY W M FL R V+ VE V++ Y R++ + + +G ++A D Sbjct: 196 VFSLGFRPPYAWEAMFDFLGNRTVAGVEEVSEKVYRRAVRIRKGGTTFQGWLSAEADNTG 255 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQ 116 L + LS L PVA LA++ RLFDL+C+P+++ LG LG PG+R+PG D FE Sbjct: 256 KALRLTLSTSLAPVATTVLARVRRLFDLECHPELIADILGPLGMREPGIRVPGAFDGFET 315 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADPQALKALGM 174 VR ILGQ VSV A L R+ +G+ +D FP+P+R+A D AL LG+ Sbjct: 316 AVRIILGQQVSVQGARTLAGRLVSAHGDPIDTPWPDITRAFPSPERIAGMDASALSGLGI 375 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 R +A++ LA A EG + + D E M+ L++ PGIG WTA A+R D F Sbjct: 376 FGFRIKAILGLAAAVAEGRITLAPGPDPEPQMEALRSIPGIGEWTAQAIAMRVLSWPDAF 435 Query: 235 LPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 DY I++ +P ++ A++W+PWR+YA L +W P Sbjct: 436 PHTDYGIQKALKEKSPRRVLEVAQQWRPWRAYAALALWRALVESP 480 >UniRef50_Q6MA41 Putative DNA-3-methyladenine glycosidase II n=3 Tax=Bacteria RepID=Q6MA41_PARUW Length = 476 Score = 304 bits (780), Expect = 2e-81, Method: Composition-based stats. Identities = 90/280 (32%), Positives = 149/280 (53%), Gaps = 8/280 (2%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 LN++PPYDW L FL+ R++ +E V ++ Y R++ + EY+G + +H L + Sbjct: 194 LQLNYRPPYDWIGFLNFLSIRSLKGIELVKNNCYLRTVQIREYKGWIHVSHVEDKHCLRV 253 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 +++ L PV A L ++ FDL P + + L A PGLR+PG D FE Sbjct: 254 KIASSLVPVLAILLERIRNFFDLNARPDKISVQLEQDPFLAEEVAKNPGLRVPGTFDGFE 313 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICF--PTPQRLAAADPQALKALG 173 RAILGQ ++V A L +R + +GE + + P+ QR+++ + + +G Sbjct: 314 LAFRAILGQQITVKAATTLASRFVKAFGEEFKTPFAELHYLCPSSQRISSLKWEEIATIG 373 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 + RA+ +I LA TL + ++ +K L + GIG+WTA+Y ALR Q D Sbjct: 374 IIRARAQTIIELAKQMSSNTLKLEAGVNLRLTIKQLTSIAGIGQWTAHYIALRALQWPDA 433 Query: 234 FLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 F +D ++++ +T Q + ++ W+PWRSYA L++W Sbjct: 434 FPKEDVALRKKLGKVTAKQAEKLSQVWRPWRSYATLYLWQ 473 >UniRef50_Q2SDC7 Adenosine deaminase n=3 Tax=Bacteria RepID=Q2SDC7_HAHCH Length = 484 Score = 301 bits (772), Expect = 2e-80, Method: Composition-based stats. Identities = 93/285 (32%), Positives = 142/285 (49%), Gaps = 6/285 (2%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYR----GVVTAIPDIARH 57 + + ++PPY W +L FL+AR ++ V+ V D Y R + V G A + Sbjct: 200 FEMPYRPPYAWDALLSFLSARTIAGVDAVVDGRYHRIVRVEAGEDSAVGWFEASHEADAA 259 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQG 117 + I L + L + ++ FD+ C PQ +N LG L PG+R+P +D FE Sbjct: 260 RIRIRLDSTLSRHIGYLINRLRAFFDVSCVPQEINKVLGTLAQNEPGMRIPSGMDGFEIA 319 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADPQALKALGMP 175 VRAILGQ ++VA A L AR+ +G+ ++ FP+ L + L +LG+ Sbjct: 320 VRAILGQQITVAAARTLLARLVDKFGDPIETPFPEINRTFPSAATLVNLPVEELASLGVI 379 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 R A+ +A A L L ++ +VEQ ++ L PGIG WTA Y A+R D F Sbjct: 380 RTRVRAIQEIAAAMLRSELTLSPAANVEQEIQRLHAIPGIGDWTAQYIAMRAMSWPDAFP 439 Query: 236 PDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 D +++ G+ Q R AE W+PWR YA++H+W + D Sbjct: 440 ASDVGVRKALGGVDAKQSARAAEEWRPWRGYAVMHLWRSLELPHD 484 >UniRef50_C8QGF9 Transcriptional regulator, AraC family n=1 Tax=Pantoea sp. At-9b RepID=C8QGF9_9ENTR Length = 495 Score = 297 bits (762), Expect = 2e-79, Method: Composition-based stats. Identities = 98/289 (33%), Positives = 148/289 (51%), Gaps = 10/289 (3%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L+++PPYDW +L FL R + VE VA+ Y R++A+G +G V + L + Sbjct: 200 LRLSYRPPYDWEAILDFLQQRVMKEVEWVAEGIYHRTVALGGCQGWVRVSHYPEKQALKV 259 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 + L PV L ++ LFDL PQ + + L PGLR+PG D FE Sbjct: 260 QFTTSLTPVLPALLRRLRDLFDLDAQPQRIADQLAQDPLLAPSLVRYPGLRVPGAFDGFE 319 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAAADPQALKALG 173 GVRAILGQ V+V A L++RVAQ +G + + P+ + LA A + +LG Sbjct: 320 LGVRAILGQQVTVKAATTLSSRVAQRFGAPMATPWPELSRLSPSAETLATATQDDIASLG 379 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 + R++A++ LA A G L + + + L GIG WTA+Y A+R + D Sbjct: 380 IVSARSQAILALAQACASGALRFNGAVNPDVVQQQLLALKGIGPWTASYIAMRALRWPDA 439 Query: 234 FLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 F +D I+ G++ ++ W+PWRSYA+LHIW + P++ Sbjct: 440 FPKEDIAIRNNLGGVSAKDAEVRSQVWRPWRSYAVLHIWKSLT--PEKG 486 >UniRef50_A6SU78 Methylated-DNA-[protein]-cysteine S-methyltransferase n=2 Tax=Oxalobacteraceae RepID=A6SU78_JANMA Length = 499 Score = 289 bits (739), Expect = 1e-76, Method: Composition-based stats. Identities = 103/293 (35%), Positives = 153/293 (52%), Gaps = 19/293 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADS-----YYARSLAVGEYRGVVTAIPDIAR 56 L ++PPY W ML +LA RA+ VE V + Y RS+ + G + AR Sbjct: 207 LRLAYRPPYAWEPMLAYLAGRAIPGVEGVVEDAPGTLSYVRSVMLNNTAGWLRVTHLPAR 266 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGC 110 L ++L A L PV LA++ + FDL NP+I+ + L + PGLR+PG Sbjct: 267 RQLELSLPATLAPVLMPLLARVRKQFDLDANPEIIAAHLSADALLAQQIRLTPGLRVPGT 326 Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDF--PEYICFPTPQRLAAADPQA 168 D FE +RA+LGQ VSVA A ++ R+ + +GE D FPT +RLAAAD Sbjct: 327 FDTFELAIRAVLGQQVSVAGATTVSGRLVKAFGEPADTPFIGINRHFPTAERLAAADIGE 386 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 + ALGMP RA+ + ++A A++G L M +++ + +L+T GIG WTA Y A+R Sbjct: 387 IAALGMPGSRAQTIQNVARFAVQGGLQMKPGASLDECVSSLKTVRGIGEWTAQYVAMRAL 446 Query: 229 QAKDVFLPDDYLIKQRFPG------MTPAQIRRYAERWKPWRSYALLHIWYTE 275 + D F D +++ +T Q+ A W PWR+Y L +W++ Sbjct: 447 RFPDAFPTGDLGLQKAAVEVAGGTRLTEKQLLLRAAGWSPWRAYTALLLWHSL 499 >UniRef50_Q0AGQ6 DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / Transcriptional regulator Ada n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AGQ6_NITEC Length = 477 Score = 287 bits (735), Expect = 3e-76, Method: Composition-based stats. Identities = 88/281 (31%), Positives = 141/281 (50%), Gaps = 9/281 (3%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L+++PP W+ ++ FL +R+ + + + Y +++ + +G VTA D RH ++ Sbjct: 193 LLRLSYRPPLAWNALIRFLCSRSNLRLSQIQNGNYLQTVNLDGCQGWVTAKHDTKRHQIY 252 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALG------RLGAARPGLRLPGCVDAF 114 + S L P + RLFDL NP I+ LG L A PGLR+PG +D F Sbjct: 253 VQASRSLLPCLIRLQMYLRRLFDLDANPAIIEAHLGNDDILKPLIANHPGLRIPGTLDIF 312 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI--CFPTPQRLAAADPQALKAL 172 E G+RAILGQ ++V A L R +G+ +D + P + +A QAL + Sbjct: 313 ELGLRAILGQQITVKAATTLFGRFVATFGKPVDTPFPGLDRTSPPAELIADTSLQALIDI 372 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 G+ +RA + A + G L D + ++ L PGIG WTA Y A+R + Sbjct: 373 GLTGRRALTIQRFAQTIVNGALKPES-IDRNKIIEQLLELPGIGPWTAQYIAIRALGDSN 431 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 F D + + PA++ R E+W+PWR+Y +H+W+ Sbjct: 432 AFPASDLGLLRGLRMEKPAELLRRTEKWQPWRAYGAIHLWH 472 >UniRef50_Q1IT49 DNA-3-methyladenine glycosylase II / Transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=3 Tax=Bacteria RepID=Q1IT49_ACIBL Length = 477 Score = 285 bits (731), Expect = 9e-76, Method: Composition-based stats. Identities = 97/281 (34%), Positives = 142/281 (50%), Gaps = 12/281 (4%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 ++ L ++PPY W ML FL RA VE V + YARS+++ G +H+L Sbjct: 198 VFRLRYRPPYHWLGMLDFLRPRATPGVECVTEDAYARSISLHGKEGSFEVTHAPEQHSLV 257 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAAR------PGLRLPGCVDAF 114 + ++ + + ++ +FDL + + G L R PG RLPG D F Sbjct: 258 LRVNFEDSSALFQIVERVRAMFDLNADWGSIAGVLENDRLLRGHLKGDPGRRLPGAWDGF 317 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE-YICFPTPQRLAAADPQALKALG 173 E VRA+LGQ +SVA A L ++A+ +G L FPTP+ LA A +L Sbjct: 318 ELAVRAVLGQQISVAAATNLAGQIARKFGRPLRKSNGISHLFPTPEILADA-----ASLP 372 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 +P+KRAE + LA A + L DV Q + L+T PGIG WTA Y ALR + D Sbjct: 373 LPMKRAETIRALACAVRDCELQFDAITDVPQFCEQLKTIPGIGDWTAQYVALRALREPDA 432 Query: 234 FLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 F D +++ + A++ R AE W+PWR YA +++W Sbjct: 433 FPAGDLGLQKSLGVKSSAELERRAENWRPWRGYAAIYMWSA 473 >UniRef50_Q46QC3 Transcriptional regulator Ada n=25 Tax=cellular organisms RepID=Q46QC3_RALEJ Length = 509 Score = 280 bits (716), Expect = 4e-74, Method: Composition-based stats. Identities = 105/294 (35%), Positives = 145/294 (49%), Gaps = 12/294 (4%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEY----RGVVTAIPDIAR 56 ++ L ++PP W +LGFLA RAV +E V D YAR+L+V RG V R Sbjct: 195 VFELGYRPPLAWEALLGFLAVRAVDGIEQVRDGAYARTLSVESGGTTHRGWVRLDHVPGR 254 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQ 116 L + LSA L V + L K+ RL DL C P IV+ LG L + PG+RLPG VD FE Sbjct: 255 LVLRVTLSASLARVIPQALGKVRRLCDLGCRPDIVDRHLGELASDVPGMRLPGSVDGFEI 314 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDF--------PEYICFPTPQRLAAADPQA 168 VRA++GQ++SV A ++ AR+ Q G+ L P FP+ LAA Sbjct: 315 AVRAVIGQVISVVQARRILARLGQTAGDALPAPAMPIDGCAPLQHGFPSAAALAALPDAD 374 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 + A G+ + L LA G LP+ EQ + L GIG WTA Y A+R Sbjct: 375 MVAAGVSPGKLRTLRALAQRVASGALPLEQHMPPEQTVAALCEIDGIGDWTAQYVAMRAL 434 Query: 229 QAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 D F DY +++ T ++ +W PWR+YA +H+W+ + Sbjct: 435 GWPDAFPGTDYALRKVLGVNTVRAMQARTAQWAPWRAYAAIHLWHRYEAMKTQG 488 >UniRef50_C0WE04 Transcriptional regulator n=1 Tax=Acidaminococcus sp. D21 RepID=C0WE04_9FIRM Length = 483 Score = 280 bits (716), Expect = 5e-74, Method: Composition-based stats. Identities = 95/287 (33%), Positives = 151/287 (52%), Gaps = 15/287 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGE-----YRGVVTAIPDIAR 56 TL ++PPY S + FL RA+ +ETV++ Y R++ + Y G+++ P+ Sbjct: 197 LTLTYRPPYLASPLFDFLKGRAMKGIETVSEGIYKRTVTLAGEKGARYHGIISVSPNKKC 256 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPG-----LRLPGCV 111 + L + LS L PV ++ + ++SR FDL P+ + L + PG +R+PG Sbjct: 257 NALTLTLSDSLLPVLSDVIFRVSRQFDLAAFPETIAAVLYAMNDGVPGTFAEGIRIPGAF 316 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADP--- 166 D FE VRAILGQ ++V A+ L AR + G ++ FPTP+++ + Sbjct: 317 DGFETAVRAILGQQITVKAASTLAARFVAVLGTPIETGHPGLTHLFPTPEKILSYGESLS 376 Query: 167 QALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALR 226 L LG+ ++ ++ LA A ++G+L + E+ K L GIGRWT++Y A+R Sbjct: 377 DELGKLGIISSKSASIRALAQALMDGSLRLDGTRSREETKKALLALKGIGRWTSDYIAMR 436 Query: 227 GWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 + D+FL D IK PG TP + AE W+P+RSYA + +W Sbjct: 437 VLKDPDIFLETDAGIKHALPGTTPKERLTLAEAWRPFRSYATVSLWR 483 >UniRef50_D2L7I2 Transcriptional regulator, AraC family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L7I2_9DELT Length = 482 Score = 274 bits (701), Expect = 2e-72, Method: Composition-based stats. Identities = 107/280 (38%), Positives = 149/280 (53%), Gaps = 6/280 (2%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGE----YRGVVTAIPDIARH 57 TL ++PPYDW +LGFL R++ VE VAD Y R+LA+ + G + A++ Sbjct: 198 LTLGYRPPYDWDGLLGFLCLRSIGGVEAVADGVYRRTLAISRNGVVHAGWLAVAHAPAKN 257 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQG 117 + + ++AGL PV L ++S LFDL C+P + L L GLRLPG D FE Sbjct: 258 AVRVTVAAGLLPVLPAVLTRVSHLFDLACDPAAIAAGLAGLADGHEGLRLPGAADGFEVA 317 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDF--PEYICFPTPQRLAAADPQALKALGMP 175 VRAILGQ V+VA A L R A +GE + FP P R+A A+ + G+ Sbjct: 318 VRAILGQQVTVAGARTLARRFAAAFGEPVSTPFADLTTVFPGPARVAGLTVDAIASQGIL 377 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 RA A+I LA A EG L ++ DV L PGIG WTA+Y A+R D F Sbjct: 378 AARARAIIGLARAMAEGGLVLSPAADVAATRAALLALPGIGAWTADYIAMRALAWPDAFP 437 Query: 236 PDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 D+ +K+ P ++ A W+PWR+YA++H+W + Sbjct: 438 HTDFGVKKALGETDPKRVLERAAGWRPWRAYAVMHLWRSL 477 >UniRef50_A7HP34 Ada metal-binding domain protein n=3 Tax=Bacteria RepID=A7HP34_PARL1 Length = 513 Score = 274 bits (701), Expect = 3e-72, Method: Composition-based stats. Identities = 96/290 (33%), Positives = 140/290 (48%), Gaps = 18/290 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++PP+D+ +L +L++RA+ VE +++ YARS +G +G+VT P L Sbjct: 226 IRLGYRPPFDFDRILAYLSSRALPGVERISEGRYARSFHLGGVKGLVTVTPAATGSALDA 285 Query: 62 NLSAGLE----PVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCV 111 ++ A++ RLFDL P + + A+G A PGLR+ G Sbjct: 286 RIAVLDAKGGTVPVRAIAARLRRLFDLDAEPGAIAAAFAGDPAIGPRFARVPGLRVAGAF 345 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL--DDFPEYICFPTPQRLAAADPQAL 169 D FE VRA+LGQ +SV A + R+ GE + ++ FP P+ LA AD L Sbjct: 346 DGFELAVRAVLGQQISVKGATTIAGRIVARLGEEVTTEEPGITHFFPAPRALARAD---L 402 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 LG+ R L LA A G L T ++ + L PGIG WTA+Y ALR Sbjct: 403 SGLGLTGGRIATLTSLAQAVASGALDFTPRESLDAKLAELTALPGIGEWTAHYVALRALG 462 Query: 230 AKDVFLPDDYLIKQRFPGMTP---AQIRRYAERWKPWRSYALLHIWYTEG 276 D F D +++ P ++ R AE W+PWR YA L +W +G Sbjct: 463 EPDAFPASDLGLRKAVGKGEPVSTKELERMAESWRPWRGYAALALWTIDG 512 >UniRef50_Q2RNZ4 Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=2 Tax=Bacteria RepID=Q2RNZ4_RHORT Length = 486 Score = 273 bits (698), Expect = 5e-72, Method: Composition-based stats. Identities = 109/285 (38%), Positives = 143/285 (50%), Gaps = 18/285 (6%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L + P+DW +L F RAV +E V Y R++A+G RGVVT L Sbjct: 204 VLRLAARQPFDWPGLLAFFRQRAVPGLERVEGDTYVRAIAIGAARGVVTIRGSAEG--LV 261 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAF 114 + S A +A++ R+FDL + + + L L AARPGLR+PG D F Sbjct: 262 VTPSLDRPEGLAALVARLRRVFDLDADIGAIGAHLGADPLLAPLVAARPGLRVPGAWDGF 321 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE---YICFPTPQRLAAADPQALKA 171 E VRAILGQ VSVA A L R+ +GE L + P FPT RLA AD L Sbjct: 322 ELAVRAILGQQVSVAAATTLAGRLVGAFGEPLTNAPPAGPSRLFPTAARLAEAD---LGG 378 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 LG+ RA+A+ LA A +E + D++ A+ L PGIG WTA Y ALR Sbjct: 379 LGLTTARAKAISGLARAVVETPGLLDPGPDLDSAVARLCRLPGIGPWTAQYIALRALGEA 438 Query: 232 DVFLPDDYLIKQRFPG----MTPAQIRRYAERWKPWRSYALLHIW 272 D D + + TPA + AE W+PWRSYA+LH+W Sbjct: 439 DALPVGDIGVLRALAEDGVRPTPAALLARAEDWRPWRSYAVLHLW 483 >UniRef50_Q02KH7 DNA-3-methyladenine glycosidase II n=8 Tax=Pseudomonas RepID=Q02KH7_PSEAB Length = 297 Score = 272 bits (697), Expect = 7e-72, Method: Composition-based stats. Identities = 102/290 (35%), Positives = 143/290 (49%), Gaps = 16/290 (5%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L +Q P++W A R ++ VE++ D +YARS P R L Sbjct: 9 VLHLPYQSPWEWRQFHQHFALRLLAGVESLGDDHYARSFRANGRPAWFEVRPLAERQVLA 68 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAF 114 ++LS +AAE A++ R+FDL +P + + LG L AA PGLRLP D F Sbjct: 69 LSLSPSAHALAAELEARVRRMFDLDSDPAAIARHFAGDPLLGPLVAANPGLRLPVAFDPF 128 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE---YICFPTPQRLAAADPQALKA 171 EQ VRAI+GQ V+V A +T R+ Q GE L++ FPTP LA A+ L Sbjct: 129 EQAVRAIVGQQVTVKAAVTITGRLIQRLGEPLENLGYDGISHLFPTPAALAQAN---LDG 185 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 +GMP KR + L A A G L + + E ++ L PGIG WTA Y ALR Sbjct: 186 IGMPGKRVQTLQRFAAAIASGELSLDLADGPEALVERLCALPGIGPWTAEYIALRAMGEA 245 Query: 232 DVFLPDDYLIKQ----RFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGW 277 D F D + + G+ ++ AE W+PWR+YA +H+W+ Sbjct: 246 DAFPAADLGLLKSTVWGPQGIDARSLKARAEAWRPWRAYAAIHLWHHYAA 295 >UniRef50_Q12D18 Transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II n=3 Tax=Proteobacteria RepID=Q12D18_POLSJ Length = 504 Score = 268 bits (686), Expect = 1e-70, Method: Composition-based stats. Identities = 93/293 (31%), Positives = 137/293 (46%), Gaps = 14/293 (4%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADS----YYARSLAVGEY----RGVVTAIPD 53 L ++PPYD + MLGF + R +S++E VA R+ V G + A D Sbjct: 212 IRLGYRPPYDVAAMLGFFSKRTISAIEFVAADAQHPSIGRTFRVESGGKVHAGWLLAAFD 271 Query: 54 IARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDA 113 R L +N+S L V + ++ FDL +P +N L GLR+PG +D Sbjct: 272 ETRSRLVLNVSDSLREVLPLVIRRVRATFDLDADPAAINSVLHAGFPQGDGLRVPGALDG 331 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADPQALKA 171 +E VRA+LGQ ++VA A L R+ +GE + FP P LAAA AL Sbjct: 332 YELAVRAVLGQQITVAAARTLAQRMVDRFGEPVQTPWPQLTRLFPAPAMLAAASGDALGQ 391 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 LG+ +R A++ +A A + L + DV ++ L+ PGIG WTA Y A+R + Sbjct: 392 LGIVRQRQAAIVGIAQAVADKRLQLHSGADVHATLEALKALPGIGDWTAQYIAMRALRWP 451 Query: 232 DVFLPDDYLIKQRFP----GMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 D F D + + + + WKPWRSYA++ W +P Sbjct: 452 DAFPAGDVALHKAMGVQGLKNPAREAELASHAWKPWRSYAVIRAWSGTLERPG 504 >UniRef50_B0RQX4 DNA methylation and regulatory protein (Methylated-DNA--[protein]-cysteine S-methyltransferase) n=40 Tax=cellular organisms RepID=B0RQX4_XANCB Length = 521 Score = 267 bits (683), Expect = 3e-70, Method: Composition-based stats. Identities = 93/290 (32%), Positives = 139/290 (47%), Gaps = 14/290 (4%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++PP D ML FL RA+ +E V Y R + ++ R L + Sbjct: 232 LRLGYRPPLDLPAMLTFLQRRAIPGIEQVDADGYRRVIGAPGQATLIHVSAAPTRDELLL 291 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 + A + + ++ R+FDL + V + L + RPGLR+PG D FE Sbjct: 292 RIGATDPRQIPQIVRRVRRIFDLDADLHAVHATLAQDPLLEQAITRRPGLRVPGGWDGFE 351 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI--CFPTPQRLAAADPQALKALG 173 VRA+LGQ +SVA AA L AR+ +G L D P + FPTP ++A A L+ LG Sbjct: 352 VAVRAVLGQQISVAGAATLAARLVDRHGGHLPDMPPGLDRSFPTPAQMADAP---LEQLG 408 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 +P RA L LA+A +G L + + PGIG WTA+Y A+R D Sbjct: 409 LPRARAATLRALASACAQGRLHFGAGQRLPDFVAACTALPGIGPWTAHYIAMRALSHPDA 468 Query: 234 FLPDDYLIKQRFPG---MTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 F D +++Q ++ ++ W+PWR+YA+LH+W+ + D Sbjct: 469 FPAGDLILQQVLGAPERLSERATEARSQAWRPWRAYAVLHLWHLAVDRKD 518 >UniRef50_B0SWZ0 Transcriptional regulator, AraC family n=7 Tax=Bacteria RepID=B0SWZ0_CAUSK Length = 505 Score = 267 bits (682), Expect = 4e-70, Method: Composition-based stats. Identities = 98/296 (33%), Positives = 145/296 (48%), Gaps = 22/296 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 TL ++PPYDW ML FLA RA+ VE + + Y R +A+ G + P I L + Sbjct: 206 LTLRYRPPYDWDAMLAFLALRAIPGVEVIESNTYRRVIALDGAAGTIAVSP-IDGDRLSV 264 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 + LA++ +FDL +P + + L R+ RPGLR+PG D FE Sbjct: 265 AVRFPKLSALPRILARVRGVFDLSADPVGIAAVLSRDPDLARMVGLRPGLRVPGAWDGFE 324 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERL----DDFPEYICFPTPQRLAAADPQALKA 171 VRAILGQ ++V A KL + +GE L + FP+ +RLAA + L Sbjct: 325 LAVRAILGQQITVVQARKLAGDLVAAHGEPLAQPWTEPGLTHAFPSAERLAATN---LSG 381 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + MP R L +A A + ++ +++ ++ L+ PGIG WTA Y A+R + Sbjct: 382 MKMPGARIRCLSAMAQAIADAPNLLSPTAGLDEMVRRLRALPGIGEWTAQYIAMRQLREP 441 Query: 232 DVFLPDDYLIKQRFPGM-----TPAQIRRYAERWKPWRSYALLHIWYT---EGWQP 279 D F D + + + T Q+ AE W+PWR+YA LH+W + EG P Sbjct: 442 DAFPAADVALMRALADVDGVRPTAEQLLTRAEAWRPWRAYAALHLWASLADEGAPP 497 >UniRef50_B9DJS2 Putative uncharacterized protein n=1 Tax=Staphylococcus carnosus subsp. carnosus TM300 RepID=B9DJS2_STACT Length = 341 Score = 264 bits (676), Expect = 2e-69, Method: Composition-based stats. Identities = 94/307 (30%), Positives = 144/307 (46%), Gaps = 33/307 (10%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEY------RGVVTAIPDIA 55 + L +Q PY W+ M+ +L+ RA+ VE V D+YYAR++ + + +G + + Sbjct: 36 FNLYYQTPYIWTAMIDYLSKRAIPRVEIVQDNYYARTVLLKDTATKRAVKGWLKVKNNTK 95 Query: 56 RHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFE 115 + L + +SA L + + K+ FDL+ NP+I+N L + GLR+PG + FE Sbjct: 96 NNALLVEMSASLIHEWNKIIQKLRHFFDLEVNPEIINKTLNEDWITK-GLRVPGAFNGFE 154 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI--CFPTPQR---LAAADP---Q 167 GVRAILGQ ++V A ++ R+ G + FP P++ LA D Sbjct: 155 LGVRAILGQQITVKAATTISGRLVHALGTPFKTKIAGLDTLFPIPEKFVYLAHCDTPISD 214 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTI-----------------PGDVEQAMKTLQ 210 L LG+ ++R+ + LA A + G + + E M L Sbjct: 215 LLGPLGVTVRRSNTIAALAEAIVNGEVQLNPVVHGVESSIPSNRYNTQMETAESEMNRLL 274 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-GMTPAQIRRYAERWKPWRSYALL 269 GIG+WTA Y +R D FL D IK P TP AE+W P RSYA++ Sbjct: 275 AIKGIGKWTAQYIGMRALGYTDSFLETDIGIKNAMPNDTTPKSRLAVAEKWHPLRSYAVV 334 Query: 270 HIWYTEG 276 ++W T Sbjct: 335 NLWNTLN 341 >UniRef50_A6EY17 Transcriptional Regulator, AraC family protein n=1 Tax=Marinobacter algicola DG893 RepID=A6EY17_9ALTE Length = 504 Score = 262 bits (669), Expect = 1e-68, Method: Composition-based stats. Identities = 96/294 (32%), Positives = 131/294 (44%), Gaps = 21/294 (7%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L +PP+D +L F ARA+ +E V +YARSL + G+V P + Sbjct: 210 VLFLRARPPFDSEQLLAFFRARAIPGLEAVGAHHYARSLCIAGQPGLVICRPSDHPPGVQ 269 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR------LGAARPGLRLPGCVDAF 114 + L E A++ RL DL + ++ L R L PGLR+PG + F Sbjct: 270 VILRGPARQSILEVSARIRRLLDLDADLPGISEHLARDPLMEPLVTQHPGLRVPGSWERF 329 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE-----YICFPTPQRLAAADPQAL 169 E VRAILGQ VS++ A L R+ YG+ L D FP P L Q L Sbjct: 330 EFSVRAILGQQVSISAARTLAGRLVARYGQPLPDDLARGTGITHRFPEPAALVG---QPL 386 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 LGMP RA+ L + E P D + L + GIG WT Y ALRG Sbjct: 387 NTLGMPGSRADTLARITARFAE---PGFAEQDGNDLLAQLASMRGIGPWTLQYLALRGLG 443 Query: 230 AKDVFLPDDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 D F D I + + + R+AERW+PWR+YA ++W + P Sbjct: 444 DPDAFPASDLGILKAASHLGGPQDAKALTRHAERWRPWRAYAAQYLWTSLNAHP 497 >UniRef50_Q1ZAD8 Hypothetical ada regulatory protein n=2 Tax=Photobacterium profundum RepID=Q1ZAD8_PHOPR Length = 514 Score = 261 bits (667), Expect = 2e-68, Method: Composition-based stats. Identities = 88/301 (29%), Positives = 123/301 (40%), Gaps = 44/301 (14%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVG------------------ 42 + TL+++PPY+W + F A+R + +E ++ Y R+ + Sbjct: 212 VITLSYRPPYNWQHLQQFYASRIIEGLEWCDENSYGRTFSFDSDDCSHSVLNTGQNINHS 271 Query: 43 ----EYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR- 97 + G TA + + + + + R DL + + + L R Sbjct: 272 EDAFDCIGEFTAFHIPEKSVFLVRIQLSDLRYLNRVIRNIRRCLDLDADIEHIEARLKRA 331 Query: 98 ---LGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC 154 A GLRLPG FE G+RAILGQ VSV A L +V G + E Sbjct: 332 LNTDILAISGLRLPGTWSPFEAGIRAILGQQVSVQAARNLVTKVV---GNNPINTDERCY 388 Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPG 214 FP PQ+L A L L MP KR E + LAN A L + K L G Sbjct: 389 FPLPQQLIA---DELTYLKMPGKRKETIRLLANYACNKPLDDS---------KALLAIAG 436 Query: 215 IGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 IG WT +Y +RG D+FL D IK+ M A A PWRSY L++W Sbjct: 437 IGPWTVHYLRMRGLSDPDIFLIGDLGIKKALAKMNEAFSPDAA---APWRSYLTLYLWSA 493 Query: 275 E 275 + Sbjct: 494 D 494 >UniRef50_A5KSU6 Transcriptional regulator, AraC family n=1 Tax=candidate division TM7 genomosp. GTL1 RepID=A5KSU6_9BACT Length = 464 Score = 260 bits (665), Expect = 4e-68, Method: Composition-based stats. Identities = 91/287 (31%), Positives = 137/287 (47%), Gaps = 27/287 (9%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 M +++PP+DW +LGF+ RA S E D+ Y R + E + A++ L Sbjct: 194 MLRTDYRPPFDWDLLLGFIKKRATPS-EWATDTTYHRLIGSDE----IVVRNVPAKNYLT 248 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAF 114 I + L A L K+ RLFDL NP ++ + L A PG+R+PGC D F Sbjct: 249 IEVPQKLSRHAHAILMKVRRLFDLDANPSVITTVLTNDPYLKPFLADNPGVRVPGCWDNF 308 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGM 174 E +RA++GQ VSV+ A + R+ + G TP LAA+ + ++GM Sbjct: 309 EMLIRAVVGQQVSVSAATTVMRRLVERIGS------------TPDTLAASSADEIASIGM 356 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 PLKRA + LA+ + + D ++ Q GIG WT Y LR D Sbjct: 357 PLKRATTIHTLAHKVKNSDIDLN-ECDPQRFADQFQHISGIGPWTIAYLQLRILHWPDAL 415 Query: 235 LPDDYLIKQRF---PGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 +D +++ +T A++ +YAE W+PWRSYA+ +W Q Sbjct: 416 PAEDIGLQRALIPYKRITKAELSKYAEAWRPWRSYAVFLLWNASSNQ 462 >UniRef50_C7R5W7 Transcriptional regulator, AraC family n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R5W7_KANKD Length = 461 Score = 258 bits (659), Expect = 2e-67, Method: Composition-based stats. Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 22/278 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L+++PPYDWS M FL R +S++ETV D+ Y R+ ++ +G +A D +R + ++ Sbjct: 202 LKLHYRPPYDWSLMQDFLKQRELSAIETVTDNCYGRTFSIDSSKGHFSAEIDPSRSSFNV 261 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP----GLRLPGCVDAFEQG 117 + + R+ DL + +++ +L + +P GLRLP D FE G Sbjct: 262 TIEMDDMSKLLTATHHIRRVLDLNSDLEVIENSLAQDVNIKPVLKSGLRLPATWDTFEAG 321 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 V+AILGQ VSV A TA V + G + +D +Y FPT +++ D L L MP Sbjct: 322 VKAILGQQVSVKAAYTHTASVIEQLGSKYND--QYKLFPTAKQIVNGD---LTFLKMPNS 376 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 R + L A L + + ++ GIG WT Y LR D F Sbjct: 377 RKQTLHDFAQWYLS---------TSGEDLASILDIKGIGPWTYEYIKLRSGMDSDAFPEK 427 Query: 238 DYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 D + + +E+W+PWRSYA L +W++ Sbjct: 428 DLGVIKAMEQYN----LTNSEQWQPWRSYATLQLWHSL 461 >UniRef50_Q10630 Methylated-DNA--protein-cysteine methyltransferase n=52 Tax=Actinomycetales RepID=ADA_MYCTU Length = 496 Score = 258 bits (659), Expect = 2e-67, Method: Composition-based stats. Identities = 88/297 (29%), Positives = 128/297 (43%), Gaps = 22/297 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L + P+ + + G LAA AV E V D Y R+L + G+V+ P + Sbjct: 202 LRLPVRAPFAFEGVFGHLAATAVPGCEEVRDGAYRRTLRLPWGNGIVSLTPAPDHVRCLL 261 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 L A+ RL DL +P+ + + L + PG R+P VD E Sbjct: 262 VL--DDFRDLMTATARCRRLLDLDADPEAIVEALGADPDLRAVVGKAPGQRIPRTVDEAE 319 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDD--FPEYICFPTPQRLAAADPQALKALG 173 VRA+L Q VS A+ R+ YG + D FP+ ++LA DP L Sbjct: 320 FAVRAVLAQQVSTKAASTHAGRLVAAYGRPVHDRHGALTHTFPSIEQLAEIDPGHLA--- 376 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 +P R + L + + +L + D ++A L PG+G WTA A+RG D Sbjct: 377 VPKARQRTINALVASLADKSLVLDAGCDWQRARGQLLALPGVGPWTAEVIAMRGLGDPDA 436 Query: 234 FLPDDYLIKQRFPGMT-PAQIRR---YAERWKPWRSYALLHIWYTEG-----WQPDE 281 F D ++ + PAQ R ++ RW+PWRSYA H+W T W P E Sbjct: 437 FPASDLGLRLAAKKLGLPAQRRALTVHSARWRPWRSYATQHLWTTLEHPVNQWPPQE 493 >UniRef50_UPI0001901D5D methylated-DNA--protein-cysteine methyltransferase n=1 Tax=Mycobacterium tuberculosis T85 RepID=UPI0001901D5D Length = 361 Score = 257 bits (657), Expect = 3e-67, Method: Composition-based stats. Identities = 88/297 (29%), Positives = 128/297 (43%), Gaps = 22/297 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L + P+ + + G LAA AV E V D Y R+L + G+V+ P + Sbjct: 67 LRLPVRAPFAFEGVFGHLAATAVPGCEEVRDGAYRRTLRLPWGNGIVSLTPAPDHVRCLL 126 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 L A+ RL DL +P+ + + L + PG R+P VD E Sbjct: 127 VL--DDFRDLMTATARCRRLLDLDADPEAIVEALGADPDLRAVVGKAPGQRIPRTVDEAE 184 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDD--FPEYICFPTPQRLAAADPQALKALG 173 VRA+L Q VS A+ R+ YG + D FP+ ++LA DP L Sbjct: 185 FAVRAVLAQQVSTKAASTHAGRLVAAYGRPVHDRHGALTHTFPSIEQLAEIDPGHLA--- 241 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 +P R + L + + +L + D ++A L PG+G WTA A+RG D Sbjct: 242 VPKARQRTINALVASLADKSLVLDAGCDWQRARGQLLALPGVGPWTAEVIAMRGLGDPDA 301 Query: 234 FLPDDYLIKQRFPGMT-PAQIRR---YAERWKPWRSYALLHIWYTEG-----WQPDE 281 F D ++ + PAQ R ++ RW+PWRSYA H+W T W P E Sbjct: 302 FPASDLGLRLAAKKLGLPAQRRALTVHSARWRPWRSYATQHLWTTLEHPVNQWPPQE 358 >UniRef50_A4SQS2 DNA methylation and regulatory protein n=2 Tax=Aeromonas RepID=A4SQS2_AERS4 Length = 522 Score = 255 bits (653), Expect = 1e-66, Method: Composition-based stats. Identities = 88/287 (30%), Positives = 131/287 (45%), Gaps = 22/287 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++PPYD + ML F RA+ +E V + Y R VG+ G + H++ + Sbjct: 213 LQLPYRPPYDVAAMLAFYRLRAIPGLERVDGNVYERRHRVGDQSGWIRIE-QGKGHSIRL 271 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 + + L ++ R++DL + Q + + L RL + PG+RLP D +E Sbjct: 272 TVHDLPPAALPDLLYRVRRMWDLDADMQRIGERLGQDPLLARLQSRWPGVRLPAGWDEYE 331 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMP 175 +RAI+GQ VSV A + R+ R + PTP +L A D L +GMP Sbjct: 332 VMLRAIVGQQVSVKGAITIMGRLLA----RTEAQFGVAQLPTPAQLCALD---LDGIGMP 384 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 R L LA A GTL + D + L PGIG WT Y+ LR Q D F Sbjct: 385 GSRIRTLQGLAAALASGTLSLNTASD-----EQLLALPGIGPWTVAYWRLRCGQDPDAFP 439 Query: 236 PDDYLIKQRFPGMTP---AQIRRYAERWKPWRSYALLHIWYTEGWQP 279 D ++++ G ++ +E W+PWR YA +W+ QP Sbjct: 440 ASDLVLQKALGGGDKLPVKEVLVQSEAWQPWRGYAASWLWHAMSEQP 486 >UniRef50_C0Q970 AlkA n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0Q970_DESAH Length = 353 Score = 252 bits (645), Expect = 8e-66, Method: Composition-based stats. Identities = 72/283 (25%), Positives = 124/283 (43%), Gaps = 16/283 (5%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L + P D+S ++ F+ RA+ VE + D Y+R+ +G + + + + + Sbjct: 73 LPYARPLDFSQVIEFMKFRAIQGVEDIEDQRYSRTFRTNRSKGYFIVRDNPGKSAIELTI 132 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGL------RLPGCVDAFEQG 117 E ++ +FDL + +N + G+ RLP ++FE Sbjct: 133 YCDDIRCYMEIYNRVRLMFDLNTDFFPINKKFIKDKLLSKGMSDGHVPRLPIAFNSFEFC 192 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEY--ICFPTPQRLAAADPQALKALGM 174 +RA+LGQ +SV A+ L +R+A+ G + + +FP FP P+ L L+ +G+ Sbjct: 193 IRAVLGQQISVQAASTLASRIAKKAGPQTEKNFPPGLDYFFPGPEELVKTS---LEGIGI 249 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 R + ++A L+ + E K GIG WT NY A+R D F Sbjct: 250 TGVRQATITNIAQGLLDNVFSLNPNQPFETFQKDFSAIRGIGEWTVNYVAMRSLGMVDSF 309 Query: 235 LPDDYLIKQRFPGMTP----AQIRRYAERWKPWRSYALLHIWY 273 D I + +I + AE+W+P+R+YA L +W Sbjct: 310 PAADLGIIKALEKNGKRPGRKEILKQAEKWRPYRAYAALCLWN 352 >UniRef50_B4S0Y6 Ada regulatory protein n=3 Tax=Alteromonas macleodii RepID=B4S0Y6_ALTMD Length = 475 Score = 252 bits (643), Expect = 1e-65, Method: Composition-based stats. Identities = 87/288 (30%), Positives = 137/288 (47%), Gaps = 28/288 (9%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L+++PPY+W ++ FLAARA+S +E V+D+ Y R + GE G A+ + ARH +++ Sbjct: 204 LSYRPPYNWPYVREFLAARAISGMEVVSDNSYGRYFSCGESIGYFNAVHNEARHGFELHI 263 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLG----AARPGLRLPGCVDAFEQGVR 119 + + + L DL +P ++ +L + G A GLRLP FE G R Sbjct: 264 DMPDLRNLHKTIENIKLLLDLHADPLLIEESLKQAGLPDNALTAGLRLPSAWSVFESGCR 323 Query: 120 AILGQLVSVAMAAKLTARVAQLYGER-------LDDFPEYICFPTPQRLAAADPQALKAL 172 AI+GQ VSV A + G++ + Y CFPTP+ +A + L+ Sbjct: 324 AIVGQQVSVKAAIGQVTLLVHQLGKKGAVSDKYNTNSTAYYCFPTPEAVAGNNLAFLR-- 381 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 MP R EA+ A L +P K + G+G WT +Y +RG + D Sbjct: 382 -MPQARKEAVRQFACLFLNDKVPNH---------KEILAIKGVGPWTLDYLKMRGERNPD 431 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 V+L D ++++ + P + + A PWRSY L +W Q + Sbjct: 432 VYLEGDLIVRK-MAQLYPVEPAQAA----PWRSYLTLQLWQLSNQQKE 474 >UniRef50_Q2T2N2 DNA-3-methyladenine glycosylase II n=65 Tax=Burkholderia RepID=Q2T2N2_BURTA Length = 343 Score = 251 bits (641), Expect = 2e-65, Method: Composition-based stats. Identities = 99/298 (33%), Positives = 135/298 (45%), Gaps = 19/298 (6%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 ++ L ++PPYDW +L F A RA+ VE V Y R++ G +T + L Sbjct: 46 VFELPFKPPYDWPRVLRFFAGRAIPGVEAVEGGAYRRTVDYRGAVGALTVRKHPRKRCLV 105 Query: 61 INLSAGLEPVAAECLAKMSR-LFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDA 113 + A A +FDL +P + + L L A PGLR+PG + Sbjct: 106 ATVEGDAARHADAAFAARLATMFDLHADPAAIGAHLARDAWLAPLVDAAPGLRVPGAWSS 165 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD---FPEYICFPTPQRLAAADPQALK 170 FE VRAI+GQ VSV A + R+ + GERL FP P LAA D L Sbjct: 166 FELIVRAIVGQQVSVKAATTIVGRLVERAGERLVGHAPGATGWRFPEPAALAACD---LS 222 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTI-PGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 +GMP KRA AL +A A G +P+ D L PGIG WT Y A+R W+ Sbjct: 223 RIGMPGKRAAALQGVARAVAAGDVPLDAYATDPAGVRAALLALPGIGPWTVEYVAMRAWR 282 Query: 230 AKDVFLPDDYLIKQRFPGMT-----PAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 D + D ++ Q PA R A+ W+PWR+YA +H+W + A Sbjct: 283 DADAWPATDLVLMQAIVARDPALDRPASQRLRADAWRPWRAYAAMHLWNEIADRAGSA 340 >UniRef50_A1WKZ8 DNA-3-methyladenine glycosylase II / Transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=4 Tax=Bacteria RepID=A1WKZ8_VEREI Length = 581 Score = 250 bits (640), Expect = 3e-65, Method: Composition-based stats. Identities = 94/286 (32%), Positives = 131/286 (45%), Gaps = 18/286 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETV----ADSYYARSLAVG--------EYRGVVT 49 L W+PP D + +L F A R + VE V A R++ + E G ++ Sbjct: 286 LRLAWRPPLDVAALLAFFARRQLHGVEWVLPDGAGPILRRTVRLAPGCTGQPREIIGWIS 345 Query: 50 AIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPG 109 A D +RH L + S L PV + ++ L DL +P +N L GLRLPG Sbjct: 346 ARFDGSRHLLLLQASDSLYPVLPLVIRRVRALLDLDADPAAINAVLHPHFPQGDGLRLPG 405 Query: 110 CVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI--CFPTPQRLAAADPQ 167 D FE VRA+LGQ V++A A L R+ + G+ + + FP P LAA D Sbjct: 406 AFDGFELAVRAVLGQQVTLAAARTLGQRLVERLGQTIATPWPELQRLFPAPATLAATDGA 465 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRG 227 L +G+ +R A++ LA A G L + D E+ L PGIG WTA Y A+R Sbjct: 466 VLGQMGIVRQRQAAIVALARAVDGGQLALHDGADPEKTTAALCALPGIGDWTAQYIAMRV 525 Query: 228 WQAKDVFLPDDYLIKQRF----PGMTPAQIRRYAERWKPWRSYALL 269 + D F D + + A+ W+PWRSYALL Sbjct: 526 LRWPDAFPSGDVALHKALGLQGQKNPARAATAAAQAWRPWRSYALL 571 >UniRef50_A1TR03 DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II / Transcriptional regulator Ada n=9 Tax=Comamonadaceae RepID=A1TR03_ACIAC Length = 534 Score = 250 bits (640), Expect = 3e-65, Method: Composition-based stats. Identities = 93/299 (31%), Positives = 139/299 (46%), Gaps = 22/299 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVA--DSYYARSLAV-----GEYRGVVTAIPD- 53 L ++PP D + +LGF R + +ETV R+ + E G + A D Sbjct: 226 VRLAYRPPLDIAALLGFFGQRRIHGMETVDVPGLELRRTARLQDAEGRECTGWLAARFDG 285 Query: 54 --------IARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGL 105 + + + +S+ L P +A++ L DL +P+ +N L GL Sbjct: 286 GAAAARGGPPKPHVVLRVSSSLLPALPGVIARVRGLLDLDADPEAINAVLHGDFPRGDGL 345 Query: 106 RLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAA 163 R+PG D FE VRA+LGQ V+VA A L RV + +G+ + +C FPTP LAA Sbjct: 346 RVPGAWDGFELAVRAVLGQQVTVAAARTLAQRVVERWGDPVATPWPDLCRLFPTPAVLAA 405 Query: 164 ADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYF 223 D AL LG+ +R A++ L+ A EG L + DV + L+ PGIG WTA Y Sbjct: 406 CDGDALGQLGIVRQRQAAIVALSRAVAEGRLLLHAAADVAGTIAALRALPGIGDWTAQYI 465 Query: 224 ALRGWQAKDVFLPDDYLIKQRFPGMT----PAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 A+R + D F D + + + ++ W+PWRSYA++ W G Sbjct: 466 AMRALRWPDAFPSGDVALHKALAVQSAPRPARAAEEASQAWRPWRSYAVVRAWAGTGTP 524 >UniRef50_B7RWC6 AlkA N-terminal domain family protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RWC6_9GAMM Length = 471 Score = 250 bits (639), Expect = 4e-65, Method: Composition-based stats. Identities = 80/282 (28%), Positives = 127/282 (45%), Gaps = 24/282 (8%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++PPYDW+ ++ FL+ A++ VE + DS Y R+ + P ++ L + Sbjct: 204 LQLQYRPPYDWNGVVDFLSHHAIAGVEEINDSRYRRNFRTTAGVAQLEIKPHKNKNALEL 263 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 L + ++ R+FDL NP+ + + ALG L PG R PG FE Sbjct: 264 RLQLPDNSRLMSTVGQVRRMFDLDANPEQISALLQQDTALGPLSKRSPGARSPGHWSLFE 323 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMP 175 VRAI+GQ VS A + AR+A+ + FP +AA + MP Sbjct: 324 SAVRAIVGQQVSTVAARTVLARLAKAC-----TKEGIVTFPDAADIAALTDEHFP---MP 375 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 +R E L L + + E ++ L F G+G WT A+RG DVF Sbjct: 376 SRRRETLRSLCQTYSD--------REDELTLEALADFKGVGPWTVGMVAVRGAGDPDVFP 427 Query: 236 PDDYLIKQRFPGM--TPAQIRRYAERWKPWRSYALLHIWYTE 275 D +++ + + + ++ A +W+PWRSYA +W + Sbjct: 428 TGDLGLERTWATLPGSEGKLNDAAAQWRPWRSYAANLLWRSY 469 >UniRef50_B1ZFN9 AlkA domain protein n=6 Tax=Methylobacterium RepID=B1ZFN9_METPB Length = 376 Score = 250 bits (638), Expect = 5e-65, Method: Composition-based stats. Identities = 97/293 (33%), Positives = 138/293 (47%), Gaps = 19/293 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 + L +PP+DW + F A A VETV YAR+ + G ++ + ++ I Sbjct: 16 FRLALRPPFDWGHLERFFADHASPGVETVTPGRYARTFLLAGRPGTLSVTCERGSLSVRI 75 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR------LGAARPGLRLPGCVDAFE 115 EP A L ++ +FDL +P + LGR L A RPGLR+PG D FE Sbjct: 76 RGPEADEPFEA-ILTRLRAMFDLGADPDAIAAGLGRDPTMAALVARRPGLRMPGAFDGFE 134 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERL------DDFPEYICFPTPQRLAAADPQAL 169 VRAILGQ VSVA A +L R+ +G L D+ FPTP++L A+ + Sbjct: 135 LAVRAILGQQVSVAAATRLAGRLVAAFGTPLGPKVGGDEPGLTHLFPTPEQLLEAEISLV 194 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 L MP R A+ LA A L GD++ + L+ PGIG WTA+Y A+R Sbjct: 195 --LNMPRARGRAIQGLAAAVLATPDLFAPGGDLDATVARLKALPGIGDWTAHYIAMRALA 252 Query: 230 AKDVFLPDDYLIKQRF----PGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 D F D + + + A W+PWR+YA +H+W + + Sbjct: 253 QADAFPAGDVGLMRALDDGAGRPGRVALLDRAAAWRPWRAYAAIHLWAEDAAR 305 >UniRef50_A7HG85 AlkA domain protein n=2 Tax=Myxococcales RepID=A7HG85_ANADF Length = 485 Score = 250 bits (638), Expect = 6e-65, Method: Composition-based stats. Identities = 95/286 (33%), Positives = 131/286 (45%), Gaps = 19/286 (6%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHT-L 59 + L+++PP DW +L FLAAR + VE V Y R++ +G G V+ D AR T L Sbjct: 196 VLRLDFRPPLDWEALLAFLAARCTAGVEQVEGGAYRRTVRLGGRTGWVSVTRDPARPTAL 255 Query: 60 HINLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDA 113 S L A++ DL P V + L R PGLR+PG D Sbjct: 256 RAEASLSLAGALMPLAARLRAQLDLDARPDAVASRLRRDPLLARALRRHPGLRVPGAFDG 315 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAA-------DP 166 + VR I+GQ VSVA A ++ R+A GE + FP RLA + Sbjct: 316 LDAAVRVIVGQQVSVAAATTVSGRLAAALGEPVATP-----FPGLDRLAPSAEAIAAAGV 370 Query: 167 QALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALR 226 A+ +GMP RA ++ LA A G L + GD E L G+G WTA A+R Sbjct: 371 DAIARVGMPGARARTILELARAVAGGGLALHRGGDGEAVRAGLLELSGVGPWTAEVVAMR 430 Query: 227 GWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D F D + + + + AE W+PWRSYA++H+W Sbjct: 431 ALGEPDAFPASDLGVLRALGASSALEAEARAEAWRPWRSYAVMHLW 476 >UniRef50_Q6MR46 DNA methylation and regulatory protein Ada n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MR46_BDEBA Length = 479 Score = 248 bits (634), Expect = 1e-64, Method: Composition-based stats. Identities = 80/281 (28%), Positives = 124/281 (44%), Gaps = 16/281 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L+++PP+D++ +L F + AV +E + R + V G +T + + Sbjct: 193 IRLSYRPPFDFTGLLHFYRSHAVGQLEWFEEGLMHRIIEVNGKVGQITLSDLPDESCIKL 252 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALG------RLGAARPGLRLPGCVDAFE 115 + ++++ L DL +P I+ L L PG+RLP D FE Sbjct: 253 EIDFPDTTALHTIISRVRSLLDLDSDPVIIANVLETDKDMKALLKKHPGIRLPSSWDPFE 312 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGER---LDDFPEYICFPTPQRLAAADPQALKAL 172 V AILGQ+VSV L + L G L D FPTP ++ AD LK+L Sbjct: 313 VVVAAILGQVVSVERGRALVNDLIDLAGSDSGLLRDGKSVRLFPTPAQVIKAD---LKSL 369 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 +R E L+ L+ A + G L + DV+ ++ + PGIG WTA+Y AL+ + D Sbjct: 370 KTTTRRKETLVALSKALINGDLSLEPAQDVDSFVEKILGIPGIGPWTASYMALKALRHTD 429 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 F D +I + + E + PWR Y +W Sbjct: 430 AFPATDLIIARAIAEHPKTKF----ESFSPWRGYVAALLWR 466 >UniRef50_Q3IBU8 Putative ADA regulatory protein (Regulatory protein of adaptative response) n=3 Tax=Alteromonadales RepID=Q3IBU8_PSEHT Length = 454 Score = 244 bits (623), Expect = 3e-63, Method: Composition-based stats. Identities = 76/276 (27%), Positives = 120/276 (43%), Gaps = 22/276 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 TL ++PPY+W M FLA R ++ +E + + Y R+ + +G A ++ + Sbjct: 195 LTLPFRPPYNWPAMQQFLAKRLIAPMEWITATSYGRTFSDEHCKGSFNAEFIAQKNHFKV 254 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVN----GALGRLGAARPGLRLPGCVDAFEQG 117 ++ + + + R+ DL + ++ + A GLRLPG +FE G Sbjct: 255 AITINNTHCLQQVITNIRRVLDLDADINLITMHIQDNINNAFAVSEGLRLPGIWSSFEAG 314 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 +RA+LGQ VSV A L ++ GE+ + + FPTPQ+L +D MP Sbjct: 315 IRAVLGQQVSVTAAHNLVTKLVSELGEQCN---GAVYFPTPQQLVNSD---FAFFKMPQA 368 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 R AL +LA + D + GIG WT NY LRG D+ L Sbjct: 369 RKNALYNLAQFCT-----LNPQCDD---LDLWLNLKGIGPWTVNYAKLRGQSQPDILLDG 420 Query: 238 DYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 D +K+ A ++ P+RSY +W Sbjct: 421 DLGVKKA----QAAVAVFSSDNCAPFRSYLTFQLWQ 452 >UniRef50_Q15P13 DNA-O6-methylguanine--protein-cysteine S-methyltransferase / Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15P13_PSEA6 Length = 457 Score = 243 bits (621), Expect = 5e-63, Method: Composition-based stats. Identities = 85/281 (30%), Positives = 127/281 (45%), Gaps = 26/281 (9%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L ++PPY+W + FLA RA+S E V+ YAR+ G +G A ++ Sbjct: 191 VIPLAYRPPYNWPHLRDFLARRAISGSEWVSQDSYARNFTFGTSKGYFQAQHQPDKYRFL 250 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAAR----PGLRLPGCVDAFEQ 116 + L+ CL+ + R+ D+ + ++ + G ++ PG+R+PG + FE Sbjct: 251 VTLAIDDLRQLKHCLSNVRRILDVDADSATIDNRIELSGLSKQTITPGIRIPGIWNTFEA 310 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDD-----FPEYICFPTPQRLAAADPQALKA 171 G RAILGQ +SV A L ++ GE + D FP P +A +D L Sbjct: 311 GCRAILGQQISVTAAINLVTKLVATIGEPVLDDQAPVPELNRYFPAPDAVANSD---LSF 367 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 LGMP R E L A + P T P D + GIG WT Y LRG Sbjct: 368 LGMPNSRRETLRRFAAFYAQH--PDTPPDDW-------LSIKGIGPWTVAYANLRGLSQA 418 Query: 232 DVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D++L D +IK++ A++ PWRSY +W Sbjct: 419 DIWLNSDLVIKKQLLLHDID-----ADKVSPWRSYLTFTLW 454 >UniRef50_Q1QTR7 Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=2 Tax=Gammaproteobacteria RepID=Q1QTR7_CHRSD Length = 453 Score = 240 bits (612), Expect = 5e-62, Method: Composition-based stats. Identities = 87/273 (31%), Positives = 124/273 (45%), Gaps = 21/273 (7%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L ++PPY W W+ FLAAR + +E + Y R + G G TA+ RH + L Sbjct: 196 LAYRPPYAWEWLRDFLAARRIDRLEWGDEHRYGRHIQWGSASGHFTAVHVPERHGFRVTL 255 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR-LGAARP---GLRLPGCVDAFEQGVR 119 S + + R+ DL + ++ L + L P GLRLPG FE GVR Sbjct: 256 SLDDLGALLPVVRHIRRVLDLDADTALIEAQLRQTLPDTFPLVEGLRLPGVWTPFEAGVR 315 Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 A+LGQ VS++ A R+ + GE + FPT R+AA+D L+ MP R Sbjct: 316 AVLGQQVSISAARGHVTRLVEALGEP--TGDDGRQFPTAARIAASDLAFLR---MPQARR 370 Query: 180 EALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 + L LA AA + L + GIG W+A+Y ALRG D++L D Sbjct: 371 DCLRGLAQAACDRRLDDDP--------RQWTALKGIGPWSADYAALRGTSHPDIWLGGDL 422 Query: 240 LIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 +K+ + + PWRSY L +W Sbjct: 423 GVKRALSALGTVEPAHAT----PWRSYLTLQLW 451 >UniRef50_C4DFD0 DNA-3-methyladenine glycosylase II; Transcriptional regulator Ada; DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DFD0_9ACTO Length = 413 Score = 239 bits (611), Expect = 7e-62, Method: Composition-based stats. Identities = 87/285 (30%), Positives = 124/285 (43%), Gaps = 17/285 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++ P + G LAA AV VE D Y R+L + GVV+ P + Sbjct: 116 LRLPFRQPLCPDNVFGHLAATAVPGVEEWRDGAYRRTLRLPHGPGVVSLRPGPDHVGCVL 175 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNG------ALGRLGAARPGLRLPGCVDAFE 115 + +A+ L DL +P V+ L L A PG R+P VD E Sbjct: 176 --WLSDLRDLSIAIARCRWLLDLDADPVAVDELLSRDEVLAPLVAKAPGRRVPRTVDPGE 233 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADPQALKALG 173 VRA+LGQ VS A A AR+ YG+R++D FP+P LA DP L Sbjct: 234 FAVRAVLGQQVSTAAARTHAARLVARYGQRVEDPGGGLTHLFPSPGELAGLDPDGLA--- 290 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 MP+ R L+ L A ++G + + + D A + L PG G WT A+R D Sbjct: 291 MPVSRKNTLLGLVRALVDGDVELGVGVDWRSAKEALSALPGFGPWTVESIAMRALGDPDA 350 Query: 234 FLPDDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALLHIWYT 274 F+ D I+ + + + W PWR+YA+ ++W T Sbjct: 351 FVASDLGIRLAAEQLGLPTGARALVERSRAWMPWRAYAVQYLWAT 395 >UniRef50_C7MYM6 DNA-3-methyladenine glycosylase II /DNA-O6-methylguanine--protein-cysteine S-methyltransferase /Transcriptional regulator Ada n=20 Tax=Actinobacteria (class) RepID=C7MYM6_SACVD Length = 510 Score = 233 bits (594), Expect = 7e-60, Method: Composition-based stats. Identities = 82/294 (27%), Positives = 133/294 (45%), Gaps = 23/294 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++ P+D + +L FL ARAV VE+ + Y R+L + VV P + Sbjct: 220 LRLPFRRPFDTTGVLDFLTARAVPGVES-TEGDYRRTLRLPHGAAVVRLSPRSTHIECLL 278 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 L+ + ++++ RL+DL +PQ V + AL +A PG+R+PG VD E Sbjct: 279 RLT--DIRDLSGAVSRIRRLWDLDADPQAVLDCLSADPALAPWLSAAPGIRVPGAVDGPE 336 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLD------DFPEYICFPTPQRLAAADPQAL 169 +RA+ Q +S A R+ G + + FP P +A L Sbjct: 337 LVLRALFEQGMSTRRAHIALGRLVTELGTPIAPELLDATDDPTLLFPGPTAVAEHAASIL 396 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 P R + + +A A +G L + + D E + L PGI W A+Y +R Sbjct: 397 PG---PQDRVDTIRTIAAALAQGDLDVHVGRDAEDLRRDLLAVPGISSWAADYILMRLLG 453 Query: 230 AKDVFLPDDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 D+ L D ++++ + T + + +A RW+PWRSYA +++W G QP Sbjct: 454 HPDILLGTDLVLRRGARSLGIDATYSGLTTHARRWRPWRSYAGMYLWRA-GDQP 506 >UniRef50_A3XSB2 Ada regulatory protein n=1 Tax=Vibrio sp. MED222 RepID=A3XSB2_9VIBR Length = 482 Score = 232 bits (593), Expect = 7e-60, Method: Composition-based stats. Identities = 81/295 (27%), Positives = 114/295 (38%), Gaps = 40/295 (13%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L++ P DW +LGF R + +E V D YY R++ V +G A R +L I Sbjct: 203 IQLSFHGPLDWDHLLGFYRRRMIEGLEEVGDGYYQRTVNVNGSKGWFKATLAKER-SLDI 261 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGA---ARPGLRLPGCVDAFEQGV 118 +A + R+FDL + V + A+ G+R+PG A+E GV Sbjct: 262 EFELDDMSQLRSLIANIRRMFDLDVDISKVEDFFSTIDPNLVAKSGIRIPGVWSAWEAGV 321 Query: 119 RAILGQLVSVAMAAKLTARVAQLYG--------------------ERLDDFPEYICFPTP 158 RAILGQ VSV A + + ++ D E FPTP Sbjct: 322 RAILGQQVSVTAAIGQLNLLVRKLSGSYQVFDSQEQANSQECSDLPQIADASEKAYFPTP 381 Query: 159 QRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRW 218 +++A AD L+ MP R E L A ++ + GIG W Sbjct: 382 KQIADADVSFLR---MPGSRKETLKRFAQYMVDNE---------AEHPSKWIDLKGIGPW 429 Query: 219 TANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 T Y LRG + L D ++K+ E PW SYA H W Sbjct: 430 TIQYALLRGLSEPNHLLVGDLVVKKFIEHRP----TINTESVSPWGSYATFHCWN 480 >UniRef50_C1YI07 DNA-O6-methylguanine--protein-cysteine S-methyltransferase; DNA-3-methyladenine glycosylase II; Transcriptional regulator Ada n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YI07_NOCDA Length = 561 Score = 232 bits (591), Expect = 1e-59, Method: Composition-based stats. Identities = 105/327 (32%), Positives = 136/327 (41%), Gaps = 49/327 (14%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAI----PDIARH 57 L ++ P D + ML FL RAV VE D Y R+L + VV A Sbjct: 211 LRLPYREPIDLARMLRFLGDRAVPGVEEYRDGVYRRTLMLAHGPAVVELSEGSGTGRAGR 270 Query: 58 TLHINLSAGLEPVAA--------------------------ECLAKMSRLFDLQCNPQIV 91 T + G+ P A + + RL DL +P V Sbjct: 271 TGRAGATGGVRPADAVDGGVSVSGGGHVLCRLRLSEARDLTSAVRRCRRLLDLDADPGAV 330 Query: 92 ------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER 145 + LG + AA PGLR PG VD E VRA+LGQ VSV A L R+ + +GE Sbjct: 331 AEALGGDPLLGPIVAAHPGLRSPGHVDPAELAVRAVLGQQVSVRAARTLAGRLVERFGEP 390 Query: 146 LDDFPE------YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP 199 L E FP+P LAAADP +P+ R AL L A G + + Sbjct: 391 LAPGLEAPGGGLTHVFPSPDALAAADPAGFS---VPVARGRALAGLCEAIASGWIDLGPG 447 Query: 200 GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG----MTPAQIRR 255 D ++A + L GIG WTA Y +RG DVFL D ++ TPA R Sbjct: 448 CDRDEAERRLVELRGIGPWTAGYVRMRGLGDPDVFLHGDLGVRMALEAGGRRATPAAAAR 507 Query: 256 YAERWKPWRSYALLHIWYTEGWQPDEA 282 A W PWRSYA +W + + E+ Sbjct: 508 EAREWSPWRSYANHALWASLADRERES 534 >UniRef50_D1BI44 DNA-3-methyladenine glycosylase II /DNA-O6-methylguanine--protein-cysteine S-methyltransferase /Transcriptional regulator Ada n=1 Tax=Sanguibacter keddieii DSM 10542 RepID=D1BI44_SANKS Length = 517 Score = 231 bits (589), Expect = 3e-59, Method: Composition-based stats. Identities = 98/310 (31%), Positives = 138/310 (44%), Gaps = 34/310 (10%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADS-----YYARSLAVGEYRGVVTAI--PD 53 + L + P+D + +LGFLA RAV+ VET YAR+L + G V + Sbjct: 207 VVDLPVRQPFDAAGVLGFLADRAVAGVETATTEDDGTMRYARTLDLPHGPGAVEVVAVRR 266 Query: 54 IARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAARPGLRL 107 R + L A +A++ RL DL +P V+ AL L RPG R+ Sbjct: 267 QGRWEMRARLELAALGDVAPAVARVRRLLDLDADPVAVDSALAQDPALRPLVEERPGTRV 326 Query: 108 PGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI--CFPTPQRLAA-- 163 PG VD E VRA++GQ +SVA A R+ G + FPT ++AA Sbjct: 327 PGAVDPHELVVRAVVGQQISVAAARTHLGRLTARLGTPYRSAFAGLDRLFPTAAQVAAGV 386 Query: 164 ---ADPQAL---KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGR 217 AD + L + L +P + A++ +A A +G L + + D L PGIG Sbjct: 387 PVPADDEVLDPDRPLRLPRRSVRAVVSVARALADGDLVVDVGADAAALRAELVDRPGIGP 446 Query: 218 WTANYFALRGWQAKDVFLPDDYLIKQRFPGM--------TPAQIRRYAER---WKPWRSY 266 WTA Y A+R D +LP D + + T A R AE W PWRSY Sbjct: 447 WTAAYVAMRVLGDPDAWLPGDVALVAGARAVGLLGTEKTTSAAHRALAEGASVWAPWRSY 506 Query: 267 ALLHIWYTEG 276 A++H+W Sbjct: 507 AVVHLWRAAS 516 >UniRef50_Q2IPL2 Transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II n=12 Tax=Proteobacteria RepID=Q2IPL2_ANADE Length = 514 Score = 230 bits (587), Expect = 5e-59, Method: Composition-based stats. Identities = 106/288 (36%), Positives = 140/288 (48%), Gaps = 17/288 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L PYDW +L FLAARA+ VE VAD Y R++A+ G V PD L Sbjct: 213 IALPHTAPYDWPALLEFLAARAIPGVEQVADGAYRRTVALDGAAGTVEVRPDPRGRGLLA 272 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 L A + ++ RL DL + + + L L AARPGLR+PG + FE Sbjct: 273 TLRLPRVAAIAPAVERLRRLLDLDADAAAIGAHLSGDPLLAPLLAARPGLRVPGAWEPFE 332 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAAADPQALKALG 173 VRA+LGQ VSVA A L R+A G +D + FP P+ LA AD L+ LG Sbjct: 333 LVVRAVLGQQVSVAAARTLAGRLAARLGAPVDSGDPALSRLFPGPEALAGAD---LEGLG 389 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 + RA L + A + + G++E A+ L PGIGRWTA Y A+R D Sbjct: 390 LTRARAATLAAIGGAVRDDPSLLAPGGELEDAVARLDALPGIGRWTAQYVAMRALHQPDA 449 Query: 234 FLPDD------YLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 F D + P ++ R AERW+PWR+YA LH+W + Sbjct: 450 FPEGDLGLLAALGGLRGRGRAAPGELLRRAERWRPWRAYAALHLWMSL 497 >UniRef50_B2GIR9 Putative methylated-DNA--protein-cysteine methyltransferase/3-methyladenine-DNA glycosylase II n=1 Tax=Kocuria rhizophila DC2201 RepID=B2GIR9_KOCRD Length = 532 Score = 228 bits (581), Expect = 2e-58, Method: Composition-based stats. Identities = 80/281 (28%), Positives = 125/281 (44%), Gaps = 16/281 (5%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L+++ P D + + A AV VE + Y+R+L + + A L + Sbjct: 241 LSYRAPLDLHGLFVWFAVHAVEGVEVGTATSYSRTLRLPGGPAWLRVYRRGA-DELRMRA 299 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAARPGLRLPGCVDAFEQG 117 +A++ RLFDL +P V+ AL L AARPGLR+ G D E Sbjct: 300 RLTDLADLPALIARVRRLFDLDADPLAVDEALSHVPALRPLVAARPGLRVVGSADPEETL 359 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAAADPQALKALGMP 175 +R ++GQ +S+A A + + GE +F + FPT +A + L+ P Sbjct: 360 IRTLIGQQISLAAARTVLGARTREMGEPAPEFAPGLSHMFPTAAAIAEHGERFLRG---P 416 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 R A++ A+A G L ++ D Q L PG+G WTA++ +R DVFL Sbjct: 417 AARVRAVLGAASAVASGELSLSPGDDAAQQRAALLALPGVGPWTADHVRMRVTGDPDVFL 476 Query: 236 PDDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALLHIW 272 DD ++ + + +A+ PWRSYA H+W Sbjct: 477 VDDGALRAGAQRIGLPGDKKALTAWAQSAAPWRSYATTHLW 517 >UniRef50_B0KRT0 AlkA domain protein n=1 Tax=Pseudomonas putida GB-1 RepID=B0KRT0_PSEPG Length = 325 Score = 227 bits (580), Expect = 2e-58, Method: Composition-based stats. Identities = 96/283 (33%), Positives = 138/283 (48%), Gaps = 14/283 (4%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L ++ PY W LGFLAAR + +ET D Y+R+L V + V+ A P +H L + L Sbjct: 40 LRYRAPYHWPSTLGFLAARCIPGIETCHDGTYSRTLIVAGHHAVLHATPMTNQH-LRVRL 98 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQG 117 +A++ R+FDL +P + + + L ARPGLR+P DA EQ Sbjct: 99 EGAPSNALPGLIARLRRVFDLDADPARISAELSCDPLMASLLKARPGLRVPQGWDACEQA 158 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERL--DDFPEYICFPTPQRLAAADPQALKALGMP 175 +R +LGQ +SVA A L R+ Q +G L FP LA A + + GMP Sbjct: 159 MRTVLGQQISVAGAMTLAGRLVQRHGAPLRLSAPGLSHVFPALPTLANAQFENM---GMP 215 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 RA L LANA L + +++ ++ L GIG W+A+Y ALR A D Sbjct: 216 SARATTLATLANALLADPGLLRRGQVLDELLRNLCRLKGIGPWSAHYLALRQAGAADALP 275 Query: 236 PDDYLIKQRFPGM--TPAQIRRYAERWKPWRSYALLHIWYTEG 276 D + + + AQ+ A W+PWR+YA H+W + G Sbjct: 276 LGDVALIKALRLLEGDEAQLAERALDWRPWRAYAAQHLWASLG 318 >UniRef50_A0JV31 DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II / Transcriptional regulator Ada n=5 Tax=Actinobacteria (class) RepID=A0JV31_ARTS2 Length = 504 Score = 227 bits (580), Expect = 3e-58, Method: Composition-based stats. Identities = 81/300 (27%), Positives = 127/300 (42%), Gaps = 30/300 (10%) Query: 3 TLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDI--ARHTLH 60 L ++ P+D + FLA R++ +ET + YAR+L + + D L Sbjct: 199 NLPYREPFD-PGIFQFLAVRSIPGIETGTGTSYARTLRLPHADARFSVEYDADAPGRPLV 257 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAARPGLRLPGCVDAF 114 + + A L+++ RL DL +P ++ L A PG+R+PG VD Sbjct: 258 LTIGAVDLRDLPSLLSRVRRLLDLDADPVAIDNALEADPRLAPAVKAFPGMRMPGAVDPQ 317 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI-CFPTPQRLAAADPQALKALG 173 E +RA++GQ ++VA A +++ E L FPT ++A L+ Sbjct: 318 ELLIRAMIGQQITVAAARTALTQLSACGSESLVPADGLHRLFPTAAQIADPGFGLLRG-- 375 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 P +R +++ A A G L D+ L PG+G WT Y A+R A DV Sbjct: 376 -PQRRIDSVRAAAGAMAAGNLDFGYGDDLAGLQSKLLPLPGVGPWTVGYVAMRVIGAPDV 434 Query: 234 FLPDDYLIKQRF-------------PGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 FL +D ++ PG+ PA PWRSYA +H+W +P Sbjct: 435 FLANDAAVRNGILALDTGPQAGERPPGVQPADFTDV----SPWRSYATMHLWRAAAMRPQ 490 >UniRef50_A8LHD8 Transcriptional regulator, AraC family n=4 Tax=Actinomycetales RepID=A8LHD8_FRASN Length = 540 Score = 227 bits (579), Expect = 4e-58, Method: Composition-based stats. Identities = 91/309 (29%), Positives = 125/309 (40%), Gaps = 34/309 (11%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++ P + G L A AV VE D Y R++ +V P + Sbjct: 213 VRLPFRAPLYPDNLFGHLVATAVPGVEEWRDGAYRRTMRTLHGHAIVALRPLPDHIGCRL 272 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAARPGLRLPGCVDAFE 115 L+ A + + RL DL +P V+ AL L A PG R+P VD E Sbjct: 273 ALT--DVRDLAPVIGRCRRLLDLDADPIAVDGQLAADPALAPLVARAPGRRVPRTVDPAE 330 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQ---------------- 159 VRA+LGQ VSVA A AR+ G + D + PQ Sbjct: 331 LAVRAVLGQQVSVAAARTHAARLVTAVGTPIHDPEGGLTHLWPQIADLAEHIERTEYAEC 390 Query: 160 -RLAAADPQ-----ALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFP 213 LA A P A + L +P R L + G + + GD E+A L P Sbjct: 391 TDLADAVPAGRRAGAPRGLALPAARRRTFAALVGGLVSGMIELGAGGDWERARAALAALP 450 Query: 214 GIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALL 269 GIG WT A+R D FLP D +++ + TPA + R+A W+PWR+YA+ Sbjct: 451 GIGPWTLETIAMRALGDPDAFLPGDLGVRRGAERLGLPATPAALSRHAAAWRPWRAYAVQ 510 Query: 270 HIWYTEGWQ 278 H+W Sbjct: 511 HLWAVLDHP 519 >UniRef50_P37878 DNA-3-methyladenine glycosylase n=4 Tax=Bacillaceae RepID=3MGA_BACSU Length = 303 Score = 224 bits (571), Expect = 3e-57, Method: Composition-based stats. Identities = 73/292 (25%), Positives = 125/292 (42%), Gaps = 21/292 (7%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVV--TAIPDIARHT 58 + TL +D + LG+L + + ++ + +A+GE R +V + I + Sbjct: 11 VITLP--EIFDMNANLGYLTREKNECMYEIENNIITKVIAIGEIRSLVQVSVINNKQMIV 68 Query: 59 LHINLSAGLEPVAAECLAK-MSRLFDLQCNP------QIVNGALGRLGAARPGLRLPGCV 111 +N S +E E + K + FDL + + L GLR+ G Sbjct: 69 QFLNDSRPVEQWKREEIVKYIHEWFDLDNDLTPFYEMAKADPLLKMPARKFYGLRVIGIP 128 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALK 170 D FE +LGQ +++A A L + + +G+ ++ + +Y FP +R+A P L Sbjct: 129 DLFEALCWGVLGQQINLAFAYSLKKQFVEAFGDSIEWNGKKYWVFPPYERIARLTPTDLA 188 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTIPG--DVEQAMKTLQTFPGIGRWTANYFALRGW 228 + M +K++E +I +A G L + + A K L GIG WTANY +R Sbjct: 189 DIKMTVKKSEYIIGIARLMASGELSREKLMKMNFKDAEKNLIKIRGIGPWTANYVLMRCL 248 Query: 229 QAKDVFLPDDYL-------IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 + F DD ++ T +I + WK W+SYA ++W Sbjct: 249 RFPTAFPIDDVGLIHSIKILRNMNRKPTKDEILEISVPWKEWQSYATFYLWR 300 >UniRef50_D0LE01 Ada metal-binding domain protein n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0LE01_GORB4 Length = 526 Score = 223 bits (568), Expect = 7e-57, Method: Composition-based stats. Identities = 83/303 (27%), Positives = 122/303 (40%), Gaps = 35/303 (11%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADS-----------YYARSLAVGEYRGVVTA 50 L ++PPY WSWM FL + A + VE+V D Y R L + + Sbjct: 220 LRLVYRPPYRWSWMRWFLGSHAAAGVESVIDDDPDAITPATRWRYRRVLDLPHGPALAVV 279 Query: 51 IP---DIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAA 101 P + + + L + ++ R DL + + AL L A Sbjct: 280 EPSTEETGPPFVRLTLHHMDMRDLGVAVNRIRRHLDLDADVATAEDALRHDPALRPLIDA 339 Query: 102 RPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE---------- 151 PGLRLPG +D E +R ++GQ +SVA A + G R+ E Sbjct: 340 APGLRLPGSLDPAETILRTMIGQQISVAAARTHIDALVARLGTRVPWPDEADLPPSAVFP 399 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQT 211 FP+ +A Q L+ P +R E+++ +A A + T+ L Sbjct: 400 SATFPSATAIAEHGHQVLRG---PRRRIESIVAVAAALADKTVEPHPGLAASDLRAQLLE 456 Query: 212 FPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHI 271 PGIG WTA A+R D+ L DD ++KQ + R W PWRSYA +H+ Sbjct: 457 LPGIGPWTAALVAMRVTGDPDIALTDDLVVKQAMTELGID--IRSVPSWSPWRSYASMHL 514 Query: 272 WYT 274 W Sbjct: 515 WRH 517 >UniRef50_Q12L65 DNA-O6-methylguanine--protein-cysteine S-methyltransferase / Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II n=8 Tax=Shewanella RepID=Q12L65_SHEDO Length = 545 Score = 221 bits (564), Expect = 2e-56, Method: Composition-based stats. Identities = 84/297 (28%), Positives = 116/297 (39%), Gaps = 34/297 (11%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSY-YARSLAVGEYRGVVTAIPDIARHTLH 60 +L ++PP +W M F R VS +E + + Y+RS +GV + A+ Sbjct: 226 LSLAFRPPLNWHKMWAFYQFRQVSGMEILDEEQGYSRSFCFDGVKGVFRVRLNEAKSQFD 285 Query: 61 INLSAGLEPV---AAECLAKMSRLFDLQCNPQIVNGALGRLGA----ARPGLRLPGCVDA 113 + + ++ RL DL + + L A GLR+P Sbjct: 286 TQIYLLHSHDVKQLHPVVLRIRRLLDLDTDMATIAQIFVPLVAMGAKLDAGLRIPATASV 345 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKAL 172 FE RAILGQ VSV A KL + + YGE + + + FPTP+ +A A LK Sbjct: 346 FEAACRAILGQQVSVQQATKLLNTLVEHYGETFELNGQVWRLFPTPEAVATASLDELK-- 403 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 MP R AL L E P + P D GIG WT Y +RG + Sbjct: 404 -MPGARRLALNALGAYVQEH--PHSTPDDW-------LEVKGIGPWTVAYAKMRGLSESN 453 Query: 233 VFLPDDYLIKQRFPGMTPAQ-------------IRRYAERWKPWRSYALLHIWYTEG 276 VFL D +IK R G+ A + PW SY +W E Sbjct: 454 VFLSSDLVIKHRIHGLYAKAGGIIETPKAYLALAADIANKVSPWGSYLTFGLWDDED 510 >UniRef50_UPI0000E0EED3 Ada family regulatory protein n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0EED3 Length = 280 Score = 220 bits (561), Expect = 4e-56, Method: Composition-based stats. Identities = 81/296 (27%), Positives = 119/296 (40%), Gaps = 36/296 (12%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRG-------------- 46 M L+ PY+WS + FL RA++ +E + +YAR ++ Sbjct: 2 MIYLSVTQPYNWSMVHAFLTRRAIAGIEECGEFHYARYFDETDFYAVSGLSHVSNEGLTS 61 Query: 47 -VVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAA---- 101 A + + LS E LA ++R+ D Q +P + AL + G Sbjct: 62 SWFCATYEPEAQRFAVQLSLHNEACREAVLANIARVLDAQQDPNTIAQALTKAGFTPEHM 121 Query: 102 RPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRL 161 GLRLP FE +RAI+GQ +SV A K+ + Q G + Y FP+ + Sbjct: 122 TSGLRLPATWSPFEALIRAIVGQQISVNGAVKI---LNQWIGNLRAEANGYRHFPSATEI 178 Query: 162 AAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTAN 221 A D L MP R L A L + ++ L GIG WT N Sbjct: 179 ACCDTSKLP---MPKARQATLNLAAETVQAKPLH------DSETIQDLLKIKGIGPWTVN 229 Query: 222 YFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGW 277 Y +RG D+FL +D ++K + A+ E KPWRSY + +W Sbjct: 230 YVLMRGISHPDIFLDNDLVVKNQL-----ARFALTPELAKPWRSYVCIQLWEHANT 280 >UniRef50_C7QDZ2 Transcriptional regulator, AraC family n=2 Tax=Actinomycetales RepID=C7QDZ2_CATAD Length = 564 Score = 219 bits (559), Expect = 8e-56, Method: Composition-based stats. Identities = 97/348 (27%), Positives = 144/348 (41%), Gaps = 70/348 (20%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETV----ADSYYARSLAVGEYRGVVTAIPDIARH 57 L ++ P+D++ +LG+ RA+ V+ V D Y R+L + G V D + Sbjct: 214 LRLTYRTPFDFAALLGWFGDRAIPGVDEVVGTGRDLVYRRALRLPHGTGQVELRDD--KG 271 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAARPGLRLPGCV 111 +H L A + + L DL +P V+ AL L AARPGLR+PG V Sbjct: 272 VVHARLVVDDLRDVAVAVRRCRDLLDLDADPAQVDAVLAGDPALAPLVAARPGLRVPGAV 331 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLA--------- 162 D FE VRA+LGQ +SVA A +TAR+ Q + ++ E P A Sbjct: 332 DGFEIAVRAVLGQQISVAAARTMTARLVQRF-SAVELAAEAALVPNAALPAVSPAVSGSP 390 Query: 163 ------------------AADPQALKALGMPLKRA-----------------------EA 181 +DP+A A K+A Sbjct: 391 TATSALAAASGATRDPDKDSDPEAAPASHFVDKKADLLPFPRPETLAAGDYEGLGLTRRT 450 Query: 182 LIHL-ANAALEGTLPMTI--PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD 238 + L A A + + + D +A L PGIG WTA+Y ALR + D F D Sbjct: 451 VATLRALATAVASGDLALDRGVDRTEARAKLLAVPGIGPWTADYVALRVFGDPDAFPVGD 510 Query: 239 YLIKQRFPGMT----PAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 +++++ + + +AE W+PWR+YA LH+W + G EA Sbjct: 511 LIVRRQAERLGLPGAEKALLAHAESWRPWRAYAALHLWASSGDPVIEA 558 >UniRef50_A1S7Q4 DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / Transcriptional regulator Ada n=1 Tax=Shewanella amazonensis SB2B RepID=A1S7Q4_SHEAM Length = 483 Score = 218 bits (557), Expect = 1e-55, Method: Composition-based stats. Identities = 79/284 (27%), Positives = 126/284 (44%), Gaps = 22/284 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADS----YYARSLAVGEYRGVVTAIPDIARH 57 L+++PPYD+ + F ARA+ E + Y R+L V G A ++ Sbjct: 211 LQLSFRPPYDFMRLRAFFMARAIPGAEWFFNDAGEPCYGRTLMVAGDAGWFEACLLAGKN 270 Query: 58 TLHINL-SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALG---RLGAARPGLRLPGCVDA 113 L +++ G ++ LA++ R+ D+ N +++ + G + LPG Sbjct: 271 ALAVSIFPGGRVSALSQWLAEIKRVLDIDANLSLIHEHIQGHMPEGVVLNTMTLPGAGSF 330 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKAL 172 FE RA+LGQ VS+ A +L + ++ FPT +++A+A L++L Sbjct: 331 FEAACRAVLGQQVSLVQATRLLGLLTAETTPEVELGGRRCRVFPTAEQVASAT---LESL 387 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 MP R AL +A +P TL GIG WT +Y +RG D Sbjct: 388 KMPGSRKNALRDMAALFSRDPVPDD---------ATLLAVKGIGPWTVSYARMRGLSDPD 438 Query: 233 VFLPDDYLIKQRFPGMTPAQI-RRYAERWKPWRSYALLHIWYTE 275 V L D ++KQ+ M A++ R PW SY L +W+TE Sbjct: 439 VLLVGDLVVKQKLTAMGWAKVPDRLKSDVSPWGSYLTLALWHTE 482 >UniRef50_A3D6C4 Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=11 Tax=Shewanella RepID=A3D6C4_SHEB5 Length = 565 Score = 217 bits (553), Expect = 4e-55, Method: Composition-based stats. Identities = 84/321 (26%), Positives = 125/321 (38%), Gaps = 65/321 (20%) Query: 6 WQPPYDWSWMLGFLAARAVSSVETVADSY-----------------------------YA 36 ++PP DW+ L F RAV+ +E Y Sbjct: 256 YRPPLDWASQLAFYRLRAVTGMEWFTPQMSHPQASDAVQVADEANLAAEANADDNGLEYG 315 Query: 37 RSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAEC---LAKMSRLFDLQCNPQIVNG 93 R A+G+ RG V I + + + ++ + E + ++ R+ DL + Q + Sbjct: 316 RCFAIGKMRGTVQIIHEPKLNRFKLAIALTEDSAVDELQLLVTEVRRILDLDADMQQIEQ 375 Query: 94 ALGRLGA----ARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-D 148 L L + GLR+PG FE RAILGQ V+V A KL + + YGE + Sbjct: 376 GLSTLPSLGLMPFSGLRIPGAGSLFEAVCRAILGQQVTVVQATKLLNILVEAYGECFSLN 435 Query: 149 FPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKT 208 EY FPTP+ + A L L MP R AL LA E E ++ Sbjct: 436 GREYRLFPTPEAIREAS---LTELKMPGARKLALNALAAFICE---------HPEASVDD 483 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIR-------------- 254 + GIG WT Y LRG +VFL D ++K+ + R Sbjct: 484 WLSVKGIGPWTIAYAKLRGLGDPNVFLHLDLIVKKHLLALYIKNNRLDETAAAAVIYSQL 543 Query: 255 --RYAERWKPWRSYALLHIWY 273 + +++ PW SY +W+ Sbjct: 544 CEQLSQQIAPWGSYLTFQLWH 564 >UniRef50_Q7MGD3 Adenosine deaminase n=51 Tax=Vibrionales RepID=Q7MGD3_VIBVY Length = 481 Score = 213 bits (544), Expect = 4e-54, Method: Composition-based stats. Identities = 78/277 (28%), Positives = 112/277 (40%), Gaps = 27/277 (9%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGE----YRGVVTAIPDIARHTL 59 L ++ + ML F RA+ S E V ++ Y R + + +R A + L Sbjct: 224 LAFRGDLNVKHMLDFYRQRAIESEEVVTETSYQRQVVINGKTVGFRAEFPATFPAEKRQL 283 Query: 60 HINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLG---AARPGLRLPGCVDAFEQ 116 + S + +A + R+FDL C+ +++ L + G+R+PG + +E Sbjct: 284 VVYFSMDDLTLLRPMVAGIRRMFDLDCDTRVIEAHLNTVALGLVKSVGIRIPGVWNVWEA 343 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPL 176 GVRAILGQ VSV A + L E FP+PQ++ AD L+ MP Sbjct: 344 GVRAILGQQVSVKAAIGQLNLLVA----TLHHDSEVRTFPSPQQVVDADLHFLR---MPQ 396 Query: 177 KRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 R E L A LE D GIG WT +Y LRG D L Sbjct: 397 SRKETLRRFAVMMLENE-----HADP----NQWLALKGIGPWTVSYAQLRGLSQPDRLLE 447 Query: 237 DDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 D ++K+ E PW SYA H+W Sbjct: 448 KDLVVKKALAQFPTLNQ----ESASPWGSYATFHLWN 480 >UniRef50_C0ZIT0 DNA-3-methyladenine glycosylase II n=75 Tax=Bacillales RepID=C0ZIT0_BREBN Length = 310 Score = 213 bits (543), Expect = 5e-54, Method: Composition-based stats. Identities = 72/284 (25%), Positives = 118/284 (41%), Gaps = 21/284 (7%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEP 69 + +S L +L+ + + + + +++ +G+ VV L + + P Sbjct: 25 FSFSQNLHYLSRASNECMFHIQNGRLYKAIPIGQDSQVVEI-HAKNDQGLTVRFLSPSLP 83 Query: 70 ---VAAECLAKMSRLFDLQCNP------QIVNGALGRLGAARPGLRLPGCVDAFEQGVRA 120 V E + FDL + + L + GLR G D FE Sbjct: 84 NEKVRTEVARYVRDWFDLDRDLVPFYELAAGDALLKQAVEKFYGLRTMGIPDLFEALSWG 143 Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 I+GQ +++ A L R+ + +G R++ + Y FPT +++A L L M K+ Sbjct: 144 IIGQQINLTYAYTLKRRLVEAFGRRVEFEGETYWLFPTAEKIAGLSVTDLDGLRMTTKKC 203 Query: 180 EALIHLANAALEGTLPMTI---PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 E LI +A +EG L + GD + A K L + GIG WTANY +R + F Sbjct: 204 EYLIDVAQLIVEGKLSKELLWDGGDYQTAEKRLTSIRGIGPWTANYVLMRCLRMPSAFPI 263 Query: 237 DDYLIKQRF-------PGMTPAQIRRYAERWKPWRSYALLHIWY 273 DD + T A+IR ++ W W SYA ++W Sbjct: 264 DDVGLHNAIKFLLGKEKKPTKAEIRELSKTWTNWESYATFYLWR 307 >UniRef50_C4L050 DNA-3-methyladenine glycosylase II n=4 Tax=Bacillales RepID=C4L050_EXISA Length = 297 Score = 213 bits (543), Expect = 6e-54, Method: Composition-based stats. Identities = 64/290 (22%), Positives = 116/290 (40%), Gaps = 16/290 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L Q P+D+ L FL+ + + + +GE ++ ++ + Sbjct: 4 MLLAVQQPFDFQECLVFLSRSEQEVLHVTTPDMVRKLMRIGERLILIELREEVNHIHVRF 63 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNP------QIVNGALGRLGAARPGLRLPGCVDAFE 115 E ++ DL+ + + L L GLR+ G D FE Sbjct: 64 PFDEVSETEKEHVAREVRNWLDLERDLKPFETMGAKDELLAPLIETHRGLRMIGFPDLFE 123 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGE-RLDDFPEYICFPTPQRLAAADPQALKALGM 174 AI+GQ ++++ A + R + YG+ R+ + Y FP +R+A +P+ L+ L Sbjct: 124 ALTWAIIGQQITLSFAYTIKRRFVERYGDHRVIEGRAYWTFPRAERIALLEPEELRELQF 183 Query: 175 PLKRAEALIHLANAALEGTLPMTIPG--DVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 ++AE +I +A G L T + L G+G WTA+Y ++ +Q Sbjct: 184 SRRKAEYVIDIAREITNGDLSKTALQSHSSADIRQRLLAIRGVGAWTADYVLMKCFQDAS 243 Query: 233 VFLPDDYLIKQRFP-------GMTPAQIRRYAERWKPWRSYALLHIWYTE 275 F D + Q T +++RY E W+ + YA ++W + Sbjct: 244 AFPIADVGLHQAIQHQLGTAKKPTIEEVKRYGESWQGFEGYATFYLWRSL 293 >UniRef50_C6XZ60 HhH-GPD family protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XZ60_PEDHD Length = 301 Score = 213 bits (543), Expect = 6e-54, Method: Composition-based stats. Identities = 68/286 (23%), Positives = 117/286 (40%), Gaps = 19/286 (6%) Query: 13 SWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH-INLSAGLEPVA 71 + L FL+ + +V + R G +V P + L +N+S E + Sbjct: 17 AECLWFLSRDFDDCMYSVFEDRVRRGFRQGSGIMIVDIYPMSDKLILEWLNISPSAEDIT 76 Query: 72 AECLAKMSRLFDLQCNPQ------IVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQL 125 A + +S FDL + + + + GLR G D FE I+GQ Sbjct: 77 A-VVQFVSEWFDLNTDLIPFYKTIAADRRISYMAEDFAGLRFIGMPDFFEALAWCIIGQQ 135 Query: 126 VSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIH 184 ++++ A K+ R+ + YG D +Y FP P+ +A A L+ L K+AE +I Sbjct: 136 INLSFAYKVKRRLVERYGTCTQFDGQKYYLFPGPEIIAKASISDLRELQFSEKKAEYIIA 195 Query: 185 LANAALEGTLP---MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLI 241 +A A L G L + D+E +K L GIG+WTANY ++ + D + Sbjct: 196 IAEAFLNGMLNKELLQRLPDLESRIKFLTNIRGIGQWTANYALMKSLKEPACIPYGDAGL 255 Query: 242 KQRF-------PGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 I ++ + ++ W+SY + ++W + Sbjct: 256 LNALLNHGIIKSKDNKPAIAKFFKAFEGWQSYIVFYLWRALSKPKE 301 >UniRef50_D1C0H7 Transcriptional regulator, AraC family n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1C0H7_XYLCX Length = 543 Score = 213 bits (542), Expect = 6e-54, Method: Composition-based stats. Identities = 89/320 (27%), Positives = 131/320 (40%), Gaps = 49/320 (15%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADS-----YYARSLAVGEYRGVVTAI----- 51 L + P+D + GFLAARAV+ VET + YAR++A+ Sbjct: 210 LRLPVREPFDAPGVFGFLAARAVTGVETASADDDGTLRYARTVALPHGPAAFEVSATPRA 269 Query: 52 ---PDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR------LGAAR 102 D + + + A +A++ RL DL +P V+ ALG L A Sbjct: 270 VSGRDARGWDVQVRVELTSLADVATVVARVRRLLDLDADPVAVDTALGTDPALALLVTAT 329 Query: 103 PGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQR 160 PG+R+PG VD E VRAI+GQ +SVA A R+A G + + FP+ Sbjct: 330 PGIRVPGAVDPHELLVRAIVGQQISVAAARTHLGRLAARLGTPYASSFDGLTTVFPSAAA 389 Query: 161 LA------------AADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKT 208 + A+DP + L +P + A++ A G L + + D + Sbjct: 390 IVDGVPVTAPGTPEASDPD--RPLRLPARGVAAVVGATRALAAGDLAVDVGADPDTLRTA 447 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDD--------------YLIKQRFPGMTPAQIR 254 L PG+G WTA Y A+R D + D +R P + Sbjct: 448 LLALPGVGAWTAAYVAMRVLGDPDAWPEGDVALVAGAAAAGIAAASAAERRPTQRHRDLA 507 Query: 255 RYAERWKPWRSYALLHIWYT 274 +A W PWRSYA +H+W Sbjct: 508 AHAAAWAPWRSYAAMHLWAA 527 >UniRef50_C6D2P4 DNA-3-methyladenine glycosylase II n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D2P4_PAESJ Length = 304 Score = 211 bits (538), Expect = 2e-53, Method: Composition-based stats. Identities = 71/288 (24%), Positives = 116/288 (40%), Gaps = 21/288 (7%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLE 68 P+ + + +L A + V R + G + + H + G + Sbjct: 16 PFSYKETVNYLRRSANEPLYQVEGDAVYRLIPTGSGEEPAAVVIRESGHGGLLVRVIGEK 75 Query: 69 PVAAE----CLAKMSRLFDLQCNP------QIVNGALGRLGAARPGLRLPGCVDAFEQGV 118 V+ E A + FD + + L GLR G D FE Sbjct: 76 QVSDERQREIEAFIREWFDFDTDLLPFYEMAEKDPLLVHAIGRFHGLRSVGISDLFEALC 135 Query: 119 RAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLK 177 I+GQ +++A A L R + YG+ ++ + + FP P+ +A P+ + ++ M K Sbjct: 136 WGIIGQQINLAFAYTLKRRFVEAYGQSVEREGRTFWQFPVPETIATLKPEDMASMQMTSK 195 Query: 178 RAEALIHLANAALEGTLP---MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 ++E LI +A EG+L + GD K L GIG WTANY +R + + F Sbjct: 196 KSEYLIGVAKLMAEGSLDKQSLLALGDFAAIEKQLTGIRGIGPWTANYVLMRCLRLPNAF 255 Query: 235 LPDDYLIKQRFPGMTP-------AQIRRYAERWKPWRSYALLHIWYTE 275 D + +T ++IR+ AE WK W SYA ++W Sbjct: 256 PIADVGLHNSIKALTGSEAKPAISEIRQMAEGWKGWESYATFYLWRIL 303 >UniRef50_A5CSR4 Putative DNA glycosylase n=2 Tax=Clavibacter michiganensis RepID=A5CSR4_CLAM3 Length = 311 Score = 205 bits (522), Expect = 1e-51, Method: Composition-based stats. Identities = 79/285 (27%), Positives = 118/285 (41%), Gaps = 23/285 (8%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIP------DIARHTLHIN 62 P+D ++ FL+ AV+ E + + +S + G VT D+ + + Sbjct: 20 PFDGGGVIRFLSWHAVTGAEEGDATSFTQSARLAHGAGTVTVRLLEAEPGDVGGARVEVT 79 Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGA------ARPGLRLPGCVDAFEQ 116 AAE LA RL L + ++ L R A A PGLR+PG +D Sbjct: 80 TRVEHAADAAELLAGTRRLLGLDVDAARIDADLARDPALAAVVRATPGLRIPGTLDPRST 139 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGE----RLDDFPEYICFPTPQRLAAADPQALKAL 172 R I+GQ +SVA A R+ GE + PT R+A + L+ Sbjct: 140 LFRTIVGQQISVASARATHGRMTADLGEDLPASVAHGSVTRLPPTAARIARDGGELLRG- 198 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 P +R LI +A A G L + + L F G+G WTA+Y A+R D Sbjct: 199 --PARRTATLIRIAEALETGELVIEPGVPRAELRAALVAFHGVGPWTADYVAMRALGEPD 256 Query: 233 VFLPDDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALLHIWY 273 + L D ++++ + + A W PWRSYA LH+W Sbjct: 257 ILLSGDLIVRRGGAALGLPDEARALDARAAAWSPWRSYATLHLWR 301 >UniRef50_C0Z5U6 Putative DNA-3-methyladenine glycosylase II n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z5U6_BREBN Length = 309 Score = 205 bits (521), Expect = 2e-51, Method: Composition-based stats. Identities = 68/294 (23%), Positives = 119/294 (40%), Gaps = 27/294 (9%) Query: 8 PPYDWSWMLGFLAARAVSSVETVAD-SYYARSLAVGEYRGVVTAIPDIA------RHTLH 60 PPY + +L L + + + + R +G +V + R+ Sbjct: 10 PPYSFDRLLRRLETHPDTQIRVNQEKNSLQRVFRIGLRPVLVHMQFMGSLEEPALRYGTQ 69 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAF 114 LS + + + + R F ++ G L L GLR D F Sbjct: 70 AILSTSDQQLLEKM---IRRTFSADLELSVIYEQMREEGELAILTERFRGLRPMLDADLF 126 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD--FPEYICFPTPQRLAAADPQALKAL 172 + V+ I+GQ +++ AA LT R+ L G+ +++ I FPTP +A + L++L Sbjct: 127 QCMVKTIIGQQINLTFAANLTERLVTLAGDPVENQNGEGIIAFPTPDSVARLTVEDLRSL 186 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 ++AE +I A A + T+ + + E+ + L + GIGRWT + G Sbjct: 187 QFSQRKAEYIIDFARAIVNETVDLERLWTMEDEEIITYLTSLRGIGRWTVECLLMFGMGR 246 Query: 231 KDVFLPDDYLIKQRFPGMTPAQ-------IRRYAERWKPWRSYALLHIWYTEGW 277 D+ D ++ + + IR+ E+W PWRS L++W G Sbjct: 247 PDLLPAADIGLRNGIVHLYGMETKPNENDIRKLGEKWAPWRSIYCLYVWEAVGA 300 >UniRef50_Q1YTX8 Putative DNA-3-methyladenine glycosylase II n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YTX8_9GAMM Length = 257 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 85/278 (30%), Positives = 123/278 (44%), Gaps = 24/278 (8%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 M L P+ W +L +L+ R V E++AD++Y R G+V D L Sbjct: 1 MIELPVVKPFPWQQLLEYLSFRLVPEFESIADNHYQRIYRD----GLVRVSYDEPNGLLQ 56 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR---LGAARPGLRLPGCVDAFEQG 117 I S + + +SR+F Q Q + L + A PG R GC D FE Sbjct: 57 IK-SDLPQDQLDNLIVPVSRIFRPQLCTQAIYQQLLPHLPILAKSPGFRPLGCWDPFELC 115 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 +R I+GQ V+VA A + R+ + G+ TP+ L AAD L +GMP Sbjct: 116 LRTIIGQQVTVAAANTIMRRLVERCGQL-----------TPEALLAAD---LSNMGMPGA 161 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 R ALI LA A G L ++ P + + L GIG WT Y A+R D F Sbjct: 162 RVAALIALATALANGDLDLSRP--WPELKEALLKLRGIGPWTCGYLAIRLGMDDDAFPET 219 Query: 238 DYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 D + + + + AE W+P+R+YA + +W E Sbjct: 220 DVGLIRAAKSESAMALLASAELWRPYRAYAAVGLWALE 257 >UniRef50_C5C5F4 HhH-GPD family protein n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C5F4_BEUC1 Length = 330 Score = 202 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 85/308 (27%), Positives = 130/308 (42%), Gaps = 31/308 (10%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVA--DSYYARSLAVGEYRGVVTAIPDIARHTL 59 L + PP D +L L ++ V R++ + +VT + Sbjct: 19 LRLAYTPPLDADALLAALGRHETVGLDRVDPLGRTVTRTVPTPDGPVLVTVHLAADEPVV 78 Query: 60 HINL------------SAGLEPVAAECLAKMSRLFDLQCNPQIVNGA------LGRLGAA 101 +++ +G + ++++ DL +P + L L A Sbjct: 79 VLDVEPLVAVAGVGAVWSGADGALEALVSRVRGWLDLDHDPLAADAVLAADPALAPLVAG 138 Query: 102 RPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL------DDFPEYICF 155 PG+R+PG VD FE +LGQ VS+A A T+R YG L P + F Sbjct: 139 APGMRVPGFVDPFEAAATTVLGQQVSLAAARTFTSRFVAAYGTPLRAAGAPSTAPHWFAF 198 Query: 156 PTPQRLAAADPQALKAL-GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPG 214 PTP+ +A ADP L+A+ G+ RA +L LA A +G T PG + L PG Sbjct: 199 PTPEAIARADPDELRAVVGLTRARASSLTSLAAAFADGLALDTGPGS----RERLLALPG 254 Query: 215 IGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 IG WTA+Y LR + D F D ++++ + P + AE W+PWR Y + HIW + Sbjct: 255 IGPWTADYLELRLLRDPDAFPAGDLVLRRGLGVVDPDEATALAESWRPWRGYGVFHIWSS 314 Query: 275 EGWQPDEA 282 Sbjct: 315 ATAPQGRG 322 >UniRef50_A4BNP3 3-methyladenine DNA glycosylase/8-oxoguanineDNA glycosylase n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BNP3_9GAMM Length = 263 Score = 198 bits (503), Expect = 2e-49, Method: Composition-based stats. Identities = 78/270 (28%), Positives = 111/270 (41%), Gaps = 27/270 (10%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLE 68 P+ W+ +LG+L AR + E + D Y R V L I A Sbjct: 12 PFPWAALLGYLDARLIPGAERIVDDGYERR----HNGATVRVTYHAGGKCLRIT--ADDA 65 Query: 69 PVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP------GLRLPGCVDAFEQGVRAIL 122 E ++ RLFD + + V+ L RP GLR GC FE VR ++ Sbjct: 66 VCGDEITVRVIRLFDTGQDTRAVDRQLRACPLLRPRVDRMPGLRPLGCWCPFELCVRTVV 125 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 GQ VSVA AA L R+A+ GE A+GMP +R L Sbjct: 126 GQQVSVAAAATLMRRLAERCGELSPAALCAADL--------------DAIGMPGRRVATL 171 Query: 183 IHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIK 242 LA A G L + D L PGIG WT Y A+R + D+ D + Sbjct: 172 RRLAEAVATGELALE-HADWAAIDAGLSRLPGIGPWTRAYLAIRLGRQPDILPETDLGLL 230 Query: 243 QRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 + +P +R ++RW+P+R++A ++W Sbjct: 231 RAAGAASPTVLRALSQRWRPYRAHAATYLW 260 >UniRef50_O31544 Putative DNA-3-methyladenine glycosylase yfjP n=17 Tax=Bacillaceae RepID=YFJP_BACSU Length = 287 Score = 197 bits (502), Expect = 3e-49, Method: Composition-based stats. Identities = 64/287 (22%), Positives = 113/287 (39%), Gaps = 25/287 (8%) Query: 5 NWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVG--EYRGVV-TAIPDIARHTLHI 61 + PPY + +L L+ +++V AR + V G V H Sbjct: 7 SVTPPYHFDRVLDRLSLDPLNAV-----DREAREVRVPIRNQAGDVCIVKVQALGHAGEP 61 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV-----NGALGRLGAARPGLRLPGCVDAFEQ 116 E E + ++ R+F + + Q V +L + G L + Sbjct: 62 EFLVSGETDQGEMMKEIKRIFQWENHLQHVLDHFSKTSLSAIFEEHAGTPLVLDYSVYNC 121 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPL 176 ++ I+ Q ++++ A LT R +GE+ D C+P P+ +A D Q L+ L + Sbjct: 122 MMKCIIHQQLNLSFAYTLTERFVHAFGEQKDGV---WCYPKPETIAELDYQDLRDLQFSM 178 Query: 177 KRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 ++AE I + EGTL ++ E MK L GIG WT + G ++F Sbjct: 179 RKAEYTIDTSRMIAEGTLSLSELPHMADEDIMKKLIKIRGIGPWTVQNVLMFGLGRPNLF 238 Query: 235 LPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWYT 274 D ++ + ++ W+P+ SYA L++W + Sbjct: 239 PLADIGLQNAIKRHFQLDDKPAKDVMLAMSKEWEPYLSYASLYLWRS 285 >UniRef50_UPI00018509D2 YfjP n=1 Tax=Bacillus coahuilensis m4-4 RepID=UPI00018509D2 Length = 301 Score = 197 bits (501), Expect = 4e-49, Method: Composition-based stats. Identities = 57/300 (19%), Positives = 113/300 (37%), Gaps = 39/300 (13%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVET----VADSYYARSLAVGEYRGVVTAIPDIA-- 55 ++ + PYD +L + + VE + RS+ +Y G + I Sbjct: 3 VRVSVEQPYDVESVLSYFTGHPLVVVEQSGLRFGLDHGVRSIIDVKYEGEIAIIHSEIDD 62 Query: 56 ----RHTLH-INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGC 110 T+H ++L L+P+ + L + G L Sbjct: 63 MKFIEKTMHILHLDRPLKPI----------------DEFYRKSELQEIFQKYEGYPLLLE 106 Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALK 170 +D + +R I+ Q V++ +A + + YGE +D FP P L + L+ Sbjct: 107 LDDYMSIIRCIISQQVNLTLARNIFTSLTHTYGEEVDSV---WFFPRPHVLKEVSIEELR 163 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGW 228 + ++AE + A+ +G + M + ++ + L GIG+WT + L Sbjct: 164 THKLSQRKAEYIQGFASLVADGAIDMDELDKLSNDEIIDRLLPIRGIGKWTVENYLLFTL 223 Query: 229 QAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWYTEGWQPDE 281 +++F D I+ T ++ Y+ W P+ SYA +++W + ++ Sbjct: 224 GRENLFPKGDIGIQNALKKFLQLDRKPTMDEMDIYSRDWAPYLSYASIYLWRSLENGSEQ 283 >UniRef50_C8XKJ9 AlkA domain protein n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XKJ9_NAKMY Length = 300 Score = 196 bits (499), Expect = 7e-49, Method: Composition-based stats. Identities = 87/299 (29%), Positives = 125/299 (41%), Gaps = 35/299 (11%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L P+ +L FL+ +V VE V + YARSL +G D T+ Sbjct: 14 VVDLPVAGPFAADRLLAFLSRESVPGVEYVREREYARSLRLGSG-------DDAEVGTIR 66 Query: 61 INLSAGLEP-----------VAAECLAKMSRLFDLQCNPQIVNGALGRLGAAR------P 103 ++L +P E +A+ L DL + V+ L P Sbjct: 67 LHLPGPGDPPTVRAVVRFAARIDEAVARCRHLLDLDTDGSAVDRVLRADPGLAASVQRCP 126 Query: 104 GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRL 161 GLR+PG + E VR ILGQ VSVA A R+ L +RL + + FP P R+ Sbjct: 127 GLRVPGPAEPAETVVRTILGQQVSVAGARTAATRLVALADDRLPEPVDGLTHLFPEPARI 186 Query: 162 AAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTAN 221 AA P A R A + AA + + A L PGIG WTA+ Sbjct: 187 AALGPTA-----FVGPRIRAQAVVTAAAAIAGGTLRLDRTDAHARGVLLAMPGIGPWTAD 241 Query: 222 YFALRGWQAKDVFLPDDYLIKQRFPGMT----PAQIRRYAERWKPWRSYALLHIWYTEG 276 Y ++R + DV L DD I++ + P ++ W+P+RSYA +H+W G Sbjct: 242 YLSMRVFGDPDVLLVDDLAIRRGAGALGLPDQPRELAARGLDWRPFRSYAGMHLWAASG 300 >UniRef50_C7PMW8 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PMW8_CHIPD Length = 302 Score = 195 bits (495), Expect = 2e-48, Method: Composition-based stats. Identities = 63/284 (22%), Positives = 116/284 (40%), Gaps = 18/284 (6%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGL-- 67 + ++ L FL+ + V D+ + + + ++ D A L + G Sbjct: 17 FSFAECLVFLSRSEKECLHYVQDNAVQKMVISNGHPVLLEISDDPAAKALKARVLDGPGE 76 Query: 68 EPVAAECLAKMSRLFDLQCNP------QIVNGALGRLGAARPGLRLPGCVDAFEQGVRAI 121 + A + +S L+ + + L L GLRL G D FE +I Sbjct: 77 DIDDAHIIKYISHWLHLEADLRPFYKFAKKDAVLKPLADRYKGLRLIGIPDLFEALTWSI 136 Query: 122 LGQLVSVAMAAKLTARVAQLYGER-LDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 GQ +++ A L R Q +G + + +Y +P P +A+ +P +L A+ +A+ Sbjct: 137 TGQQITLGFAYTLRQRFIQAFGHHAVINGKDYYVYPHPAVVASLEPASLIAMQFSRSKAD 196 Query: 181 ALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD 238 +I LA A G L D +QA L +F GIG W+ANY ++ + + +D Sbjct: 197 YIIGLAKAMTGGLLTDKQLWEMDYQQARAHLISFRGIGNWSANYVLMKYHRHHEALPLED 256 Query: 239 YLIKQRFP-------GMTPAQIRRYAERWKPWRSYALLHIWYTE 275 + + A ++ Y W+ + +YA ++W + Sbjct: 257 AGLHNALKQQLQLTAKPSLADVKAYTGHWREYAAYATFYLWRSL 300 >UniRef50_C6W476 HhH-GPD family protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W476_DYAFD Length = 300 Score = 194 bits (493), Expect = 3e-48, Method: Composition-based stats. Identities = 61/289 (21%), Positives = 112/289 (38%), Gaps = 18/289 (6%) Query: 8 PP-YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAG 66 PP + + FL + T+ + +++ + + I A Sbjct: 11 PPLFSFRECHWFLDRDFDDCMHTIRGNAVLKAIRTSFGDILFRVSEEANFLKTEILYGAA 70 Query: 67 LEPVAAECLAKMSRLFDLQCNPQ------IVNGALGRLGAARPGLRLPGCVDAFEQGVRA 120 + ++ FDL + + + L + A GLRL G D FE + Sbjct: 71 APEARDLVVGYVANWFDLNRDIEPFYDLLAADSRLAYMTDAFRGLRLVGISDMFEAICWS 130 Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 I+GQ +++ A KL R+ + YG ++ + + FPTP+ LA A L+A+ K+A Sbjct: 131 IIGQQINLTFAYKLKRRMVERYGTHVEWNGEVFPVFPTPEALANAGIDELRAMQFSQKKA 190 Query: 180 EALIHLANAALEGTLPMT---IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 E ++ +A A +G L D K L + G+G WTANY ++ ++ + Sbjct: 191 EYVVGIAQAFADGKLNAEVISALPDFASRQKVLVAYKGVGIWTANYVLMKTFRMPEGIPH 250 Query: 237 DDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 D + G +I + W +Y ++W + + Sbjct: 251 GDVGLLNALAGHGIIGDRSEKEKIEALFHAFPGWETYLTFYLWRSLAMK 299 >UniRef50_D2PPK3 Transcriptional regulator, AraC family n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PPK3_9ACTO Length = 435 Score = 193 bits (492), Expect = 4e-48, Method: Composition-based stats. Identities = 75/276 (27%), Positives = 121/276 (43%), Gaps = 31/276 (11%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIP---DIARH 57 + L +QPPYDW M+ LAARAV VE+V+D Y R++ + GV+ P D+ Sbjct: 183 LMRLPYQPPYDWDAMVDHLAARAVPGVESVSDRVYRRTIGLDGGAGVLEIGPGEGDVLML 242 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQG 117 H+ GL V A++ + + + + LG L ARPGLR+PG A E Sbjct: 243 RAHLPYWEGLIHVVERA-ARLVGVASEPADRLLRDPLLGPLVVARPGLRVPGAWGALEIA 301 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADPQALKALGMP 175 V+A+ Q S+ R+ + G+ + + FP+ + LA++ Sbjct: 302 VQAVTAQDHSLKETRAQLGRLVKECGQPVPGLTDRLTHLFPSAEVLASSSTGI------- 354 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 + LA A +G + + G E + L T PG+ TA++ ALR +DVF Sbjct: 355 ------VQSLAAAVADGRVSLE-GGSSEVLLAQLTTVPGLMPDTADWIALR-LGHQDVFP 406 Query: 236 PDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHI 271 + ++RW+P R+ A ++ Sbjct: 407 ----------RSLHAEVAAEVSDRWRPHRAVAATYL 432 >UniRef50_D1CD20 DNA-3-methyladenine glycosylase II n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CD20_THET1 Length = 301 Score = 192 bits (489), Expect = 9e-48, Method: Composition-based stats. Identities = 75/281 (26%), Positives = 119/281 (42%), Gaps = 17/281 (6%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLE 68 P + L +L + +E V + R+L + E ++ + + + A + Sbjct: 12 PLRLAHALLYLRTSPSAVLEKVTEDACRRALRINERAVLLQVRQEGQGVRVTLWGDALDD 71 Query: 69 PVAAECLAKMSRLFDLQCNP-------QIVNGALGRLGAARPGLRLPGCVDAFEQGVRAI 121 A A++ R+F L +P + + LGR+ R D E + AI Sbjct: 72 ATVAAAEAEVRRIFLLDEDPGAFYREVPLRDRVLGRVMEDYLWARPVLIADPLEALMWAI 131 Query: 122 LGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 +GQ V+VA A KL AR+ +L G L+ D Y FP R+A L+ ++A Sbjct: 132 IGQQVNVAFARKLKARLVELCGSVLEVDGERYWVFPPAWRIADLPEDLLRGNQFSRQKAR 191 Query: 181 ALIHLANAALEGTLPMTIPG--DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD 238 ++ LA A G L + G VE+A+ L F G+GRWTA Y +RG DV D Sbjct: 192 YILGLARAVASGELDLRALGVLPVEEAIAELVRFLGVGRWTAEYVLMRGLGRADVIPAAD 251 Query: 239 YLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIW 272 ++ T A++R + W PWR++ W Sbjct: 252 LGLRAVMGRHYLGGRVATEAEVREISAAWSPWRAWGAWLWW 292 >UniRef50_Q7N9Z6 Similarities with the C-terminal region of 3-methyladenine DNA glycosylase n=2 Tax=Enterobacteriaceae RepID=Q7N9Z6_PHOLL Length = 299 Score = 188 bits (478), Expect = 2e-46, Method: Composition-based stats. Identities = 62/239 (25%), Positives = 101/239 (42%), Gaps = 26/239 (10%) Query: 49 TAIPDIARHTLHINLSAGLEPV-AAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRL 107 T PD TL ++ L+PV ECL K + +G L A + GLR+ Sbjct: 72 TTGPDERLSTLASHMPGLLQPVHLFECLYK-------------RHPVIGSLIARQSGLRI 118 Query: 108 PGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQ 167 FE A++GQ +SV+ A + R Q G + CFPT +++ Sbjct: 119 YQSATPFEALSWAVIGQQISVSAAISIRRRFIQAMG--VQHSSGLWCFPTARQIINHSED 176 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMT---IPGDVEQAMKTLQTFPGIGRWTANYFA 224 L+ G + +A+AL+ L+ G L + D++Q + L GIG WT NY Sbjct: 177 ELRQCGFSVSKAKALLRLSQLIESGELTLAISNSETDIQQLIDNLLAIKGIGMWTINYSL 236 Query: 225 LRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWYTEG 276 LRG+ + L D +++ + + Q ++ + PW++ H+W E Sbjct: 237 LRGFNYLNGSLHGDVAVRRNIQRLFNQNEKVSAEQAEKWLADFAPWKALLAAHLWQQES 295 >UniRef50_D1P0X5 DNA-3-methyladenine glycosylase II n=4 Tax=Enterobacteriaceae RepID=D1P0X5_9ENTR Length = 303 Score = 186 bits (474), Expect = 5e-46, Method: Composition-based stats. Identities = 66/288 (22%), Positives = 113/288 (39%), Gaps = 20/288 (6%) Query: 8 PP-YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSA- 65 PP Y F + E V ++ + + +Y +T + T +++ A Sbjct: 15 PPHYRVDDFFAFHLRDPQNIAEIVTENTLRKGIIWQQYPAQITLSIENHNATFSLDIDAL 74 Query: 66 GLEPVAAECLAKMSRLFDLQC------NPQIVNGALGRLGAARPGLRLPGCVDAFEQGVR 119 + E L + L L + + + LG+L + G+R+ FE V Sbjct: 75 QVSATQHEKLTLATHLLGLNQPVELFEDIYLSHPILGKLITPQRGVRVYQSASTFEALVW 134 Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 AI+GQ +SV A + R Q G + CFPT Q++A D L+ G + Sbjct: 135 AIIGQQISVLAAIAIRRRFIQAVGMQHSSG--IWCFPTVQQVAQVDDNILRKTGFSTGKI 192 Query: 180 EALIHLANAALEGTLPMTI---PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 AL + A L + + P +VE L GIG WT +Y LRG+ D L Sbjct: 193 IALRGVCEAIENQRLDLDLTVTPDNVEDVTAQLLAIKGIGPWTISYALLRGFNYLDGSLH 252 Query: 237 DDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWYTEGW 277 D +++ + + + + + ++ PWR+ H+W + Sbjct: 253 GDVAVRRNLQTLLNHTEQPSTKETQHWLVQFAPWRALVAAHLWRYQSA 300 >UniRef50_A9B7A8 Transcriptional regulator, AraC family n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B7A8_HERA2 Length = 489 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 72/282 (25%), Positives = 113/282 (40%), Gaps = 21/282 (7%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGV---VTAIPDIARHTLHINLSAG 66 Y +LG L VS + V + + + + GV VT P A ++ + SA Sbjct: 206 YPSRQILGQLGRDPVSLTDQVVEQTWYSTCRLNGQTGVLLAVTITPTTAECSI-VEQSAV 264 Query: 67 LEPVAAECLAKMSRLFDLQCNPQ------IVNGALGRLGAARPGLRLPGCVDAFEQGVRA 120 A + L +P + AL L + GLR+P + F+ V A Sbjct: 265 TPSDVATIHRHVIAGLGLSNDPSRFEAHVAKSPALLPLIEHQRGLRMPLVHNPFDALVWA 324 Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 ILGQ +S+A+A +L R+ +L G+RL+ ++ PTP +A + L LG +A Sbjct: 325 ILGQQISLAVAYRLRQRLTELVGQRLNQ--DFYLAPTPNTIAQLTVEQLLPLGFSNAKAR 382 Query: 181 ALIHLANAALEGTLPMTI--PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD 238 LI A A + +LP+ + + L GIG WTA Y +R + D D Sbjct: 383 YLIDTAQAIIAESLPLASYHRKSATRIERELLALRGIGPWTAQYVLMRSFGFSDCVPVGD 442 Query: 239 YLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWY 273 + + + P+RS A H+W Sbjct: 443 SGLTSSLQAFFQLEQRPDRSTTLALMAAFSPYRSLATFHLWQ 484 >UniRef50_B9XBY0 HhH-GPD family protein n=1 Tax=bacterium Ellin514 RepID=B9XBY0_9BACT Length = 294 Score = 185 bits (470), Expect = 1e-45, Method: Composition-based stats. Identities = 65/274 (23%), Positives = 103/274 (37%), Gaps = 19/274 (6%) Query: 19 LAARAVSSV-ETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAK 77 AR + E + + ++ V+ I L L + Sbjct: 19 FQARDPEGLAERLEPNRIRKAFVFEAIPLVLDISLAKNMANCRIEADRPLPHATRTTLQQ 78 Query: 78 MSR-LFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAM 130 ++R L L+ +P+ + L L + GLR+P FE AI+GQ ++++ Sbjct: 79 IARNLLALRIDPEPFEAMAKEDNLLASLVQKQTGLRIPHTTTPFEALAWAIIGQQINLSF 138 Query: 131 AAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAAL 190 A L QL G + C P +A +P L L +AE L+ +A Sbjct: 139 AITLRRSFIQLAGTKHSSG--LWCHPDASAVARLNPDHLGQLKFSRAKAETLVRMAQLVD 196 Query: 191 EGTLPMTIPGD--VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM 248 G LP+ + E+ L GIG WT NY LRG+ D L D I+ + Sbjct: 197 SGKLPLDEWQNHSPEEIQAALLAIKGIGPWTVNYTLLRGFAFADCSLHGDAAIRNALNRL 256 Query: 249 -------TPAQIRRYAERWKPWRSYALLHIWYTE 275 T +I +R++P RS H+W + Sbjct: 257 SGSATKPTIKEIETLLQRYRPHRSMTAAHLWKSL 290 >UniRef50_C6MGP3 HhH-GPD family protein n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MGP3_9PROT Length = 316 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 65/289 (22%), Positives = 115/289 (39%), Gaps = 18/289 (6%) Query: 3 TLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHIN 62 L Y S +L F ETV D+Y+++ + + ++ + + Sbjct: 9 RLPLPENYRISDILSFHGRDKWDVAETVYDNYFSKGIIWHDTPACLSIRFQQLYVEIELC 68 Query: 63 LSAGLEP-VAAECLAKMSRLFDLQC------NPQIVNGALGRLGAARPGLRLPGCVDAFE 115 + L+ + R+ L + LG L A + GLR+P AFE Sbjct: 69 IDKQLKTFCPDTFHSMAIRMLGLNQSVNTFEEEFREHAQLGSLIAKQSGLRVPVSATAFE 128 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMP 175 + AI GQ +S++ A + ++ QL G R C P Q L+ L+ +G Sbjct: 129 ALIWAIAGQKISISAALAIRRKLIQLIGLRHSGG--LYCHPNAQHLSHLSISDLRQIGFS 186 Query: 176 LKRAEALIHLANAALEGTLPMTIPG---DVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 +A+ ++ ++ + L ++ +E + L GIG WT +Y LRG+ D Sbjct: 187 HSKAQTILTVSQRVICNELELSSASAEPPIEHIRQQLLQIRGIGLWTVDYTLLRGYGWLD 246 Query: 233 VFLPDDYLIKQRFP------GMTPAQIRRYAERWKPWRSYALLHIWYTE 275 L D +++ + Q R++ E + PWR+ H+W E Sbjct: 247 GSLHGDVAVRRGLQILLNCESINENQTRQWLENFSPWRALVAAHLWNIE 295 >UniRef50_C7MAP3 Adenosine deaminase n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MAP3_BRAFD Length = 515 Score = 183 bits (465), Expect = 6e-45, Method: Composition-based stats. Identities = 79/296 (26%), Positives = 114/296 (38%), Gaps = 31/296 (10%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTA-IPDIARHTLHIN 62 L + P+D + + + A RAV VE V + R++ + GV+ + A H L + Sbjct: 206 LAVRRPFDGAGLAAWFAHRAVPGVEEVDGLRWTRAVHLPHGPGVLQVDLGGPAPHPLPLT 265 Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR-------LGAARPGLRLPGCVDAFE 115 L A ++ RL DL +P ++ L R L AARPG+RLPG E Sbjct: 266 LRLADLRDHAVAVSLTRRLLDLDADPVGIDDGLRRTLPALAPLLAARPGVRLPGTPTLAE 325 Query: 116 QGVRAILGQLVSVAMAAKLTAR----VAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 + A+ GQ ++ A A R +A E L P AA + Sbjct: 326 ALLWAVTGQQITTAQARDQITRATDLLATALPEALRTGSVERLPVLPANAAARAEDWFRG 385 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 P R+ L A LP P D + + G+G WTA+Y LRG +A Sbjct: 386 ---PRARSRTLQEAVPAIAADDLPARWPLD--ELRSRVLALRGVGPWTADYVLLRGLRAI 440 Query: 232 DVFLPD---------DYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 D D + ++R E PWRSYA LH+W + Sbjct: 441 DAAPAGDAALLGAARDLGL-----AEDHTALQRVLEAASPWRSYAALHLWQHQATP 491 >UniRef50_C1RNZ7 DNA-3-methyladenine glycosylase II n=1 Tax=Cellulomonas flavigena DSM 20109 RepID=C1RNZ7_9CELL Length = 302 Score = 183 bits (464), Expect = 7e-45, Method: Composition-based stats. Identities = 87/282 (30%), Positives = 120/282 (42%), Gaps = 16/282 (5%) Query: 7 QPPYDWSWMLGFLAARAVSSVETVA--DSYYARSLAVGEYRGVVTAIPDIARHTLHINLS 64 + D L LAAR+V VE V +R + +G VTA + + Sbjct: 2 RTALDHRAALSSLAARSVPGVERVDVDAGTVSRLVELGAGPVHVTAHVAATG----VRVD 57 Query: 65 AGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQGV 118 A A +R F L + V + LG L AARP LR+PG D FE V Sbjct: 58 ADGPAAPAALDDLATRWFGLADDLAPVHAALGGDPVLGPLVAARPHLRVPGHPDGFEAAV 117 Query: 119 RAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA-LGMPLK 177 + +L Q VS+ AR+A YG + +P P+ LAAAD AL+A L +P Sbjct: 118 QVVLTQQVSLGAGRTTGARLASAYGRP--GPGGLLAYPRPEDLAAADSVALQAVLRVPHA 175 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 RA A+ LA A G L + L PGIG WTA+ ALR +D F Sbjct: 176 RARAVHALAVACA-GGLRLVPGAPAADVRAALLAIPGIGPWTADVVALRALGDRDAFPAG 234 Query: 238 DYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 D ++++ + W PWR++A H+W G+ Sbjct: 235 DLVLRRALGVPDVRDVATAGRAWSPWRAFAATHLWAAVGYPA 276 >UniRef50_Q81IC3 DNA-3-methyladenine glycosylase II n=75 Tax=Bacillus RepID=Q81IC3_BACCR Length = 287 Score = 182 bits (462), Expect = 1e-44, Method: Composition-based stats. Identities = 57/297 (19%), Positives = 111/297 (37%), Gaps = 41/297 (13%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 TL + PY + +L L+ ++ + + E + D + + + Sbjct: 6 VTLEY--PYHFEEVLKRLSFDPLN------------VIQLDEKVIYIPLCIDEEQVVVRL 51 Query: 62 NLSAGLEP----------VAAECLAKMSRLFDL-----QCNPQIVNGALGRLGAARPGLR 106 ++ + + +M +F +N +L L Sbjct: 52 QGIGTVQNPQFWISSQTGDPEKVMKRMRAIFHWNEPFQDIQNHFLNTSLRPLFETYAYTP 111 Query: 107 LPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADP 166 + D F +R I+ Q +++ A LT + + YG + FPTP+ +A Sbjct: 112 IILEFDYFACLLRCIIHQQINLKFATVLTEQFVKRYGTEKNGV---FFFPTPEIVANISI 168 Query: 167 QALKALGMPLKRAEALIHLANAALEGTLPMTI--PGDVEQAMKTLQTFPGIGRWTANYFA 224 + L+ ++AE ++ L + + GTL + G E L GIG WT F Sbjct: 169 EELREQKFSQRKAEYIVGLGRSIVSGTLNLASIENGTEEDISAQLLPIRGIGAWTVQNFL 228 Query: 225 LRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWYT 274 + G K++F D I++ G+ A + + + +P+ SYA L++W + Sbjct: 229 MFGLGRKNMFPKADIGIQRAVQGIFQLDDKPDDAFLEKVKQECEPYCSYAALYLWKS 285 >UniRef50_A1ZCF3 HhH-GPD n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZCF3_9SPHI Length = 207 Score = 181 bits (461), Expect = 2e-44, Method: Composition-based stats. Identities = 54/179 (30%), Positives = 90/179 (50%), Gaps = 20/179 (11%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 D + VR+I+GQ +SV AA + R +L+ E +PTP+ + AA+ LKA Sbjct: 30 DIYLALVRSIVGQQLSVKAAATIYQRFRELFPEN---------YPTPKLVVAAELDTLKA 80 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGD--VEQAMKTLQTFPGIGRWTANYFALRGWQ 229 G+ ++A + ++A A+EG L + + E+ ++ L T G+GRWT + +Q Sbjct: 81 AGLSKQKATYIKNVAAFAIEGGLDFEVLNNQTDEEIIQVLITIKGVGRWTVEMLLMFAFQ 140 Query: 230 AKDVFLPDDYLIKQRFPGMT---------PAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 DVF DD I+Q + A+++ A WKP+R+ A L++W + P Sbjct: 141 RPDVFSVDDLGIQQAVKKLYQLDEEGKALKAKMKTIANAWKPYRTLACLYLWQWKDNTP 199 >UniRef50_Q2BC23 DNA-3-methyladenine glycosylase II n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BC23_9BACI Length = 299 Score = 181 bits (459), Expect = 3e-44, Method: Composition-based stats. Identities = 60/280 (21%), Positives = 105/280 (37%), Gaps = 14/280 (5%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEP 69 + + L FL + + V S + + ++ L A + Sbjct: 18 FHFREALVFLDRSSYEILHYVEGSAVFKGIITDGEVILLKISSTETHLHASFLLGAPSDN 77 Query: 70 VAAECLAKMSRLFDLQCN------PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILG 123 + A + DL+ + + L L GLR+ G D FE V A++G Sbjct: 78 GRKQAAAFIEEWLDLKRDASGFGRMAAGDPLLKGLAERYAGLRIIGIPDLFEALVWAVIG 137 Query: 124 QLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 Q +++ A KL + YG + + FP P +AA +P+ LK L ++AE + Sbjct: 138 QQINLTFAYKLKKAFTEKYGTCFSYEGRCFWLFPEPGMIAALEPEELKQLQFTGRKAEYI 197 Query: 183 IHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIK 242 I +A E L A L + G+G WTA+Y ++ F D + Sbjct: 198 IGIAKLMAEKKLKKDDLLGQPGARDVLMSLKGVGAWTADYVRMKCLLDPAAFPIGDAGFQ 257 Query: 243 QRFP-------GMTPAQIRRYAERWKPWRSYALLHIWYTE 275 + ++ + A RW W++YA+ + W + Sbjct: 258 NALKLQMGLDRKPSIEEVEKAASRWAGWQAYAVFYFWRSL 297 >UniRef50_Q9KC25 DNA-3-methyladenine glycosidase n=1 Tax=Bacillus halodurans RepID=Q9KC25_BACHD Length = 221 Score = 181 bits (459), Expect = 3e-44, Method: Composition-based stats. Identities = 44/201 (21%), Positives = 82/201 (40%), Gaps = 18/201 (8%) Query: 90 IVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDF 149 + L + ++LP + F+ V +I+ Q +S+ A+ + RV QL G L+ Sbjct: 16 AQDSRLFQFIEIAGEVQLPTKPNPFQSLVSSIVEQQLSIKAASAIYGRVEQLVGGALEK- 74 Query: 150 PEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQ--AMK 207 P++L +AL+ G+ ++ E + H+ G L T E ++ Sbjct: 75 --------PEQLYRVSDEALRQAGVSKRKIEYIRHVCEHVESGRLDFTELEGAEATTVIE 126 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERW 260 L GIG+WTA F + DV D +++ + + + + W Sbjct: 127 KLTAIKGIGQWTAEMFMMFSLGRLDVLSVGDVGLQRGAKWLYGNGEGDGKKLLIYHGKAW 186 Query: 261 KPWRSYALLHIWYTEGWQPDE 281 P+ + A L++W G +E Sbjct: 187 APYETVACLYLWKAAGTFAEE 207 >UniRef50_B3T536 Putative HhH-GPD superfamily base excision DNA repair protein n=1 Tax=uncultured marine microorganism HF4000_ANIW137P11 RepID=B3T536_9ZZZZ Length = 209 Score = 180 bits (458), Expect = 4e-44, Method: Composition-based stats. Identities = 53/200 (26%), Positives = 87/200 (43%), Gaps = 19/200 (9%) Query: 90 IVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDF 149 +++ AL + A+ L L D F V+AI+GQ +S+ AA + RV L GE Sbjct: 20 LIDPALAAVINAKGELGLSSRGDLFATLVKAIVGQQISIKAAATVWGRVVDLIGEVK--- 76 Query: 150 PEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP-GDVEQAMKT 208 P+ + A + L++ G+ ++AE + +A A G D E+A++ Sbjct: 77 --------PESVLAHTHEELRSCGLSNRKAEYVAGIAEAWQGGYAEYDWDSMDDERALEL 128 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-------MTPAQIRRYAERWK 261 L G+GRWTA + DVF DD + + + A++ A W Sbjct: 129 LVALRGVGRWTAEMVLIFTLLRPDVFPIDDLGVVRGMEKVYNEGEVLDKAELNDIASNWS 188 Query: 262 PWRSYALLHIWYTEGWQPDE 281 PWR+ ++W +P E Sbjct: 189 PWRTVGSWYMWRAIDPEPIE 208 >UniRef50_B7K2N0 DNA-3-methyladenine glycosylase II n=5 Tax=Chroococcales RepID=B7K2N0_CYAP8 Length = 206 Score = 180 bits (458), Expect = 4e-44, Method: Composition-based stats. Identities = 43/199 (21%), Positives = 81/199 (40%), Gaps = 20/199 (10%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + L L + P + + F V+AI+GQ +SV A ++ R+ L G Sbjct: 18 DKILAYLISLYPDETIINYHNPFYTLVKAIIGQQISVNAANAISKRLESLLGT------- 70 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTL 209 + + A D +AL+ G+ + + ++A A +G L ++ + L Sbjct: 71 ----ISIETYLAMDSEALRQCGLSRPKISYITNIAQAFEQGILTPQIWPMMSDQEVISQL 126 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF-------PGMTPAQIRRYAERWKP 262 + GIG WTA F + D+ D + +T +I+ ++ WKP Sbjct: 127 ISIKGIGLWTAQMFLIFHLHRSDILPLADLGLINAIQRHYGQSQRLTKGEIQELSQVWKP 186 Query: 263 WRSYALLHIWYTEGWQPDE 281 +R+ A ++W + P + Sbjct: 187 YRTVATWYLWRSLDPIPVQ 205 >UniRef50_Q1ITU3 DNA-3-methyladenine glycosylase II n=2 Tax=Bacteria RepID=Q1ITU3_ACIBL Length = 251 Score = 180 bits (457), Expect = 5e-44, Method: Composition-based stats. Identities = 51/212 (24%), Positives = 92/212 (43%), Gaps = 16/212 (7%) Query: 79 SRLFDLQCNPQIVNGALGRLGAARP--GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTA 136 R + V+ L +L A P ++ + FE + +I+ Q +S AA + Sbjct: 3 DRHEKAIAHLSKVDKKLAKLIAKCPPCAIKPNYMQNVFEALMESIVYQQLSGKAAATILN 62 Query: 137 RVAQLYGERLDDFPEY-----ICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALE 191 RV LY + + FPTP++L A + L++ G+ + +++ LA ++ Sbjct: 63 RVKALYFPPDTPTHDTRHGKALPFPTPEQLLATPDETLRSAGLSGNKTKSVKDLAAKTID 122 Query: 192 GTLPMTIPG---DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM 248 GT+P ++ + L GIGRWT L KDV+ DD +++ + + Sbjct: 123 GTVPDIATMKKMSDDEIINHLTQVRGIGRWTVEMILLFNLFRKDVWPVDDLGVRKGYGYL 182 Query: 249 ------TPAQIRRYAERWKPWRSYALLHIWYT 274 P ++ E +KP+RS A ++W Sbjct: 183 HGIEMPKPKELMALGEVYKPYRSVAAWYMWRA 214 >UniRef50_Q2FMK1 HhH-GPD n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FMK1_METHJ Length = 309 Score = 180 bits (456), Expect = 6e-44, Method: Composition-based stats. Identities = 47/217 (21%), Positives = 90/217 (41%), Gaps = 15/217 (6%) Query: 66 GLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQL 125 L + D C+ ++ RL GLR P FE + +++ Q Sbjct: 82 DLITWYFALNDNLMDFLDAICSDPVMKSLAHRLD----GLRSPATPTVFEALIDSVIEQQ 137 Query: 126 VSVAMAAKLTARVAQLYG-ERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIH 184 +S+++A L R + +G + + C+P P+ LA +P + G ++ E + Sbjct: 138 ISLSVARSLEYRFIRQFGRTCFVNGDLHYCYPLPEDLAGLEPSDFRRCGFTSRKGEYIRD 197 Query: 185 LANAALEGTLPMTI---PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLI 241 ++ + +G L + D ++ L GIGRWTA LRG D F DD + Sbjct: 198 ISRSIEKGNLDLESFKKVRDNADIVEALCQIRGIGRWTAELTMLRGLHRMDAFPADDIAL 257 Query: 242 KQRF-------PGMTPAQIRRYAERWKPWRSYALLHI 271 ++ ++ ++ + AE+W ++ A ++ Sbjct: 258 RRMISRWYHNGKKISASEAVKTAEQWGEYKGLASFYL 294 >UniRef50_D0J4I7 HhH-GPD n=2 Tax=Comamonas testosteroni RepID=D0J4I7_COMTE Length = 329 Score = 180 bits (456), Expect = 7e-44, Method: Composition-based stats. Identities = 72/244 (29%), Positives = 105/244 (43%), Gaps = 26/244 (10%) Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCV 111 +LH + E A A + R+ L P + + LG L + + GL +PG Sbjct: 81 SLHAAHTEPAEADRAALQAMVKRMLGLIYAPDQLELAHGDHPELGVLLSRQAGLHVPGSP 140 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD---DFPEYICFPTPQRLAAADPQA 168 FE AI GQ ++VA+A L ++ L GE L D P +P QR+AA A Sbjct: 141 TPFEALTWAITGQQITVAVAVSLRRKLIALAGEPLAQDGDMPALHAYPDAQRVAALGLDA 200 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMT-----------IPGDVEQAMKTLQTFPGIGR 217 L+ G +A+ L+ +A A EG LP+ DV A L GIG Sbjct: 201 LRGAGFSQAKAQTLLAVAQAVAEGQLPLDDWAARSAVGRWSEEDVAAASAQLLAVKGIGP 260 Query: 218 WTANYFALRGWQAKDVFLPDDYLIKQRFPGMT------PAQIRRYAERWKPWRSYALLHI 271 WT NY LRG+ D L D +++ +T + E++KPWR+ H+ Sbjct: 261 WTVNYTLLRGYGWPDGSLHGDVAVRRAIGLLTGSDKPDARAASDWLEQFKPWRALVAAHL 320 Query: 272 WYTE 275 W + Sbjct: 321 WASL 324 >UniRef50_A6CCG3 Probable DNA-3-methyladenine glycosylase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCG3_9PLAN Length = 211 Score = 179 bits (454), Expect = 1e-43, Method: Composition-based stats. Identities = 40/178 (22%), Positives = 75/178 (42%), Gaps = 18/178 (10%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F +R+I+ Q +S + A + R+ L G+ PT +++ + L+++G Sbjct: 41 FALLLRSIVSQQISTSAARTIYLRLHALTGKGQ---------PTAEKVMQLSHEQLRSVG 91 Query: 174 MPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + ++A + HLA ++ + + E L GIG WTA F + G Sbjct: 92 LSNQKATYVRHLAEMVMQNKVRLHKMHLLSDEDVTSELIQVKGIGVWTAQMFLMFGLCRP 151 Query: 232 DVFLPDDYLIKQRFPGMTPAQIR-------RYAERWKPWRSYALLHIWYTEGWQPDEA 282 D+F DD I+ + + R A+RW+P+R+ A + W + + Sbjct: 152 DIFPHDDLGIQNGIQKIYELKTRPDKQTCIEIAQRWQPYRTVASWYCWRILEMETPDG 209 >UniRef50_Q82VT3 HhH-GPD n=2 Tax=Betaproteobacteria RepID=Q82VT3_NITEU Length = 205 Score = 178 bits (453), Expect = 1e-43, Method: Composition-based stats. Identities = 55/202 (27%), Positives = 83/202 (41%), Gaps = 20/202 (9%) Query: 87 NPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL 146 + + + R+ +AF RAI+GQ +SV AA + +V L E Sbjct: 12 DLSARDPVMHRIIQCYSDSMPEERGNAFATLARAIVGQQISVKAAASVWQKVTTLIPE-- 69 Query: 147 DDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQ 204 TP+ L A + L+ G+ ++ + L L+ LEGTL D E Sbjct: 70 ---------ITPEALIATEIDLLRTCGLSARKVDYLRDLSRHFLEGTLVTVNWHDLDDET 120 Query: 205 AMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF-------PGMTPAQIRRYA 257 ++ L GIGRWTA F + DV DD +++ + IR A Sbjct: 121 LIRKLVEVKGIGRWTAEMFLIFHLHRPDVLPLDDIGLQRAVSLHYNASQPVAKQAIRTIA 180 Query: 258 ERWKPWRSYALLHIWYTEGWQP 279 E W+PWRS A ++W + P Sbjct: 181 ESWQPWRSVATWYLWRSLDPIP 202 >UniRef50_A6WG49 HhH-GPD family protein n=5 Tax=Actinomycetales RepID=A6WG49_KINRD Length = 295 Score = 178 bits (453), Expect = 1e-43, Method: Composition-based stats. Identities = 86/287 (29%), Positives = 124/287 (43%), Gaps = 12/287 (4%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L + + +L A A+ E D + R + G V+ + + L Sbjct: 10 LAVRGELAADPLRRWLRAHALPGAERHLDGVHERVVPTGAGPVEVSVDLGTSPGCEQVVL 69 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQ------IVNGALGRLGAARPGLRLPGCVDAFEQG 117 A E + R L +P + L L AARPGLR+P V E Sbjct: 70 HAP-AAALDEVEGTVRRWLGLDADPAEAEAWLARDPLLAPLVAARPGLRVPRAVAGVETA 128 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC-FPTPQRLAAADPQALKAL-GMP 175 V +LGQ VS+A A R+ +G + P + FP LA A +A++A G+ Sbjct: 129 VLTVLGQQVSLAAARTFGGRLVAAFGTPVSSAPSSLTAFPAAAVLADAGAEAIRAATGVT 188 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTF---PGIGRWTANYFALRGWQAKD 232 RA + LA A G GD E+A PGIG WTA+Y ALR +D Sbjct: 189 GARARTVHALAAALAGGLDLDAAAGDPERAGAARARLLALPGIGPWTADYVALRVLGDRD 248 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 FLP D ++++ G++P + AE W+PWR +ALLH+W + P Sbjct: 249 AFLPGDLVLRRALGGLSPKEAAARAEPWRPWRGHALLHLWTAAVFVP 295 >UniRef50_C7NLP9 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Kytococcus sedentarius DSM 20547 RepID=C7NLP9_KYTSD Length = 286 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 77/278 (27%), Positives = 116/278 (41%), Gaps = 19/278 (6%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSY--YARSLAVGEYRGVVTAIPDIARHTLHINLSAG 66 P+D LG L A AV + + R + V +VT D T ++ + Sbjct: 14 PFDRVAALGTLTAHAVDGLHRLDPDTTELTRWVDVHGDPQLVTVCLDPGGAT--VSTATR 71 Query: 67 LEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGA------ARPGLRLPGCVDAFEQGVRA 120 V+ E A++ FDL + VN LG +RPG+R+ FE + Sbjct: 72 DAGVSDEIAARVQHWFDLDTDLTPVNARLGADPVLAGQVRSRPGIRITRFHAPFEAVILT 131 Query: 121 ILGQLVSVAMAAKLTARVAQLYGE---RLDDFPEYICFPTPQRLAAADPQALKA-LGMPL 176 +LGQ VS+A AR+ YG+ + P FPTP L A + L+A +G+ Sbjct: 132 VLGQQVSLAAGRLFAARLIAAYGDDAAPVRQEPGLRVFPTPVALTAVPVEELRAVIGLTG 191 Query: 177 KRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 RA + +A E ++P A L GIG WT +Y A+R D F Sbjct: 192 TRARTVHAVAAHFAETARDASLP-----ARAELHAVHGIGPWTLDYLAIRASTDADAFPA 246 Query: 237 DDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 D ++++ ++P A W P+RSYA +W Sbjct: 247 TDAVLRRTLAAISPDTGPERAASWSPYRSYAASRLWAH 284 >UniRef50_Q5NXL1 DNA-3-methyladenine glycosidase II n=3 Tax=Betaproteobacteria RepID=Q5NXL1_AZOSE Length = 300 Score = 178 bits (451), Expect = 2e-43, Method: Composition-based stats. Identities = 69/281 (24%), Positives = 115/281 (40%), Gaps = 19/281 (6%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEP 69 + +L F ++ E V + + +T D A+ + + + Sbjct: 15 FRTDDVLAFHRRDPLAVAERVEGQTLQKGVVWEGRPACLTIRFDSAQASAELAIDGAPGA 74 Query: 70 VAAECLAKMS-RLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQGVRAIL 122 A LA++ R+ L ++ + LG L A PGLR+P FE AI Sbjct: 75 AAPAALAQLLPRMLGLTQQVEVFERTYRDHPQLGPLIARHPGLRVPLSASPFEALSWAIT 134 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 GQ +SV A L R+ ++ G L C+P +R+A + L++ G +A+ L Sbjct: 135 GQQISVRAAISLRRRLIEVAG--LRHSVGLACYPDAERVAGLNEADLRSAGFSQAKAQTL 192 Query: 183 IHLANAALEGTLPMT---IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 I L E LP+ V++ + L GIG WT +Y LRG+ D L D Sbjct: 193 IRLGRLVAEDELPLNTWIATLPVDEIRERLMRVRGIGPWTIDYALLRGFGWLDGSLHGDV 252 Query: 240 LIKQRFPG-------MTPAQIRRYAERWKPWRSYALLHIWY 273 ++++ +T Q +R+ + PWR+ H+W Sbjct: 253 VVRRSLQAVLDCPDSVTEGQAKRWLAEFSPWRALIAAHLWA 293 >UniRef50_Q01SY7 DNA-3-methyladenine glycosylase II n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01SY7_SOLUE Length = 200 Score = 176 bits (448), Expect = 6e-43, Method: Composition-based stats. Identities = 50/202 (24%), Positives = 85/202 (42%), Gaps = 23/202 (11%) Query: 88 PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD 147 +++ + R+GA R P FE VR+I+ Q +S +A + R+ G + Sbjct: 12 DPVLSAIIERVGAYGIQFREP----DFETLVRSIVYQQLSGRVAKVILDRLVAAVGREV- 66 Query: 148 DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQA 205 TP+++ A P ++ LG+ ++ + LA +G L T E+ Sbjct: 67 ---------TPEKILALRPGRMRKLGLSTQKTAYIRDLARHTRDGRLVFTELPALTDEEV 117 Query: 206 MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAE 258 ++ L GIG WTA F + + DV D ++ TPA++ A Sbjct: 118 IERLTQVKGIGVWTAQMFLMFALRRHDVLPTGDLGVRNAIRKAYDLAELPTPAEMEELAR 177 Query: 259 RWKPWRSYALLHIWYTEGWQPD 280 W+PW S A ++W + Q D Sbjct: 178 NWRPWCSVASWYLWRSLEGQAD 199 >UniRef50_A5KST9 DNA-3-methyladenine glycosylase II n=1 Tax=candidate division TM7 genomosp. GTL1 RepID=A5KST9_9BACT Length = 239 Score = 176 bits (447), Expect = 7e-43, Method: Composition-based stats. Identities = 56/209 (26%), Positives = 91/209 (43%), Gaps = 21/209 (10%) Query: 80 RLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVA 139 ++ + I + LG L AA+ L D F VR+I+ Q VSVA + + ARV Sbjct: 39 QIAAAEVTLAIQDTKLGALIAAQAPLNRLRKGDYFANLVRSIISQQVSVAASRAILARVQ 98 Query: 140 QLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALE--GTLPMT 197 G P+R+ A +P+ L+ALG+ +A + LA + G Sbjct: 99 AATGLE------------PKRILALNPEELRALGLSRPKAGYISDLAEHFVREPGIFDHL 146 Query: 198 IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TP 250 ++ + L GIG WTA F + D+F PDD +++ + + Sbjct: 147 ERLADDEVITELTRIKGIGAWTAQMFLMFTLGRLDIFAPDDVGLQRAITRLYGLKEVPSR 206 Query: 251 AQIRRYAERWKPWRSYALLHIWYTEGWQP 279 Q+ AE W+P+R+ A H+W + +P Sbjct: 207 TQLEALAEAWRPYRTVASWHLWESLTHEP 235 >UniRef50_C1A5A1 DNA-3-methyladenine glycosylase n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A5A1_GEMAT Length = 243 Score = 175 bits (444), Expect = 2e-42, Method: Composition-based stats. Identities = 50/206 (24%), Positives = 81/206 (39%), Gaps = 14/206 (6%) Query: 79 SRLFDLQCNPQIVNGALGRLGAA-RPGLRLPGCVDA-FEQGVRAILGQLVSVAMAAKLTA 136 RL + LG AA P LP F R I+ Q +S + A + Sbjct: 31 RRLTQAIAELSERDTRLGAAIAAVGPCTLLPRTEGTHFGHLARNIVYQQLSGSAATTIHG 90 Query: 137 RVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM 196 R + L E+ PTP+ + D AL+ G+ + + A+ LA ++G LP+ Sbjct: 91 RFLKHVSAHLGVETEH---PTPESVLGIDDDALRGCGLSVAKVRAIKDLAQHVIDGRLPL 147 Query: 197 TIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM------ 248 ++ + L GIG WTA F + DV D +++ + Sbjct: 148 DRLDVMSDQEIIDALVPVRGIGPWTAQMFLMFRLGRPDVLPVLDLGVRKGAQRIYRTRAL 207 Query: 249 -TPAQIRRYAERWKPWRSYALLHIWY 273 A++ + A+ W+PW S A + W Sbjct: 208 PDAARLEKIAKTWRPWASVASWYCWR 233 >UniRef50_A9RKT9 Predicted protein (Fragment) n=1 Tax=Physcomitrella patens subsp. patens RepID=A9RKT9_PHYPA Length = 205 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 44/203 (21%), Positives = 81/203 (39%), Gaps = 18/203 (8%) Query: 81 LFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQ 140 + + + + L + ++F VR+I+ Q ++V AA + AR+ Sbjct: 7 IAEATKHLLAADANLACVIQKSNSPPFENDGNSFAALVRSIVSQQLAVKAAATIHARLVA 66 Query: 141 LYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP---MT 197 L G TP +AA L+ G+ ++ L LA+ + G L + Sbjct: 67 LCGGPQKV--------TPAAIAALTAGELRGAGISGRKEVYLHDLADKLVSGALSDEKLM 118 Query: 198 IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TP 250 D + + L GIG W+A+ F + DV D I++ F + Sbjct: 119 AMEDEDDLVTALTAVKGIGVWSAHMFMIFHLHRPDVLPVGDLGIRKGFQKLFHLKHLPCA 178 Query: 251 AQIRRYAERWKPWRSYALLHIWY 273 ++ + A+ W+P+RS A ++W Sbjct: 179 EEMHKLADSWRPYRSLASWYLWQ 201 >UniRef50_B2SXP8 HhH-GPD family protein n=39 Tax=Betaproteobacteria RepID=B2SXP8_BURPP Length = 349 Score = 174 bits (442), Expect = 3e-42, Method: Composition-based stats. Identities = 50/226 (22%), Positives = 88/226 (38%), Gaps = 23/226 (10%) Query: 66 GLEPVAAECLAKMSRLFDLQ---CNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAIL 122 + A +++R + + L +L + L D F R+++ Sbjct: 132 AVPVQIAGLTPEVTRPAYWDKACADLVKRDRILKKLIPKFGPVHLLSRGDPFVTLARSVV 191 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 GQ +SVA A + A+V + + PQ+ + L G+ ++AE + Sbjct: 192 GQQISVASAQAVWAKVEAACPKLV-----------PQQFIKLGLEKLTTCGLSKRKAEYV 240 Query: 183 IHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 + LA + G L + + E + L GIGRWTA F + DV DD Sbjct: 241 LDLAQHFVSGALHVGKWTSMEDEAVIAELTQIRGIGRWTAEMFLIFNLSRPDVLPLDDLG 300 Query: 241 IKQRF-------PGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 + + +T ++ R A W+PWR+ A ++W + P Sbjct: 301 LIRAISVNYFSGEPVTRSEAREVAANWEPWRTVATWYMWRSLDPLP 346 >UniRef50_A9BVD9 HhH-GPD family protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9BVD9_DELAS Length = 328 Score = 173 bits (440), Expect = 4e-42, Method: Composition-based stats. Identities = 61/233 (26%), Positives = 92/233 (39%), Gaps = 21/233 (9%) Query: 65 AGLEPVAAECLAKMSRLFDLQCNPQIV---NGALGRLGAARPGLRLPGCVDAFEQGVRAI 121 AG++ V A L +M L + + LG L A + GL +P +E A+ Sbjct: 92 AGMDDVLAAMLRRMFGLSQDVGEFERRFGRHARLGPLLARQRGLHVPAACTPWEALSWAV 151 Query: 122 LGQLVSVAMAAKLTARVAQLYGERL-------DDFPEYICFPTPQRLAAADPQALKALGM 174 GQ +SVA A L R+ G+ + D + C P +LA + L+A G Sbjct: 152 TGQQISVAAAVSLRRRLIAAAGQPVALHDGHADAPQQLWCMPEAAQLAQLGEEDLRAAGF 211 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDV-----EQAMKTLQTFPGIGRWTANYFALRGWQ 229 + L LA A G LP+ + + + L GIG WT NY LRG+ Sbjct: 212 SRSKTHTLRLLAQAVQSGELPLDDWAALPELPVAEIRERLLALKGIGPWTVNYMLLRGYG 271 Query: 230 AKDVFLPDDYLIKQRFP------GMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 D L D +++ M Q + + PWR+ H+W + Sbjct: 272 HLDGPLHGDVAVRRALALLLKTDAMDAVQTELWLRDFAPWRALVAAHLWASLS 324 >UniRef50_D1Z1B8 Putative DNA glycosidase n=1 Tax=Methanocella paludicola SANAE RepID=D1Z1B8_METPS Length = 303 Score = 173 bits (440), Expect = 5e-42, Method: Composition-based stats. Identities = 64/283 (22%), Positives = 108/283 (38%), Gaps = 23/283 (8%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLS---- 64 P+ + L R + ++ Y R L ++ + D +++S Sbjct: 13 PFRLDLTVWALRRRKSNIIDRWDGRRYTRILLFKNAPVRISIVQDSPEKAPELSMSLEGD 72 Query: 65 -AGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQG 117 + + + + L + + + +G L G++ P FE Sbjct: 73 KESADRAREPMIRLVKEMLGLDLDLRPFYALTKNDVVIGGLVRQFCGVKPPRFPTIFEAL 132 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 + AI Q VS+ + L R+A+ YG DD FP P+ LA+ + +K LG + Sbjct: 133 LNAIACQQVSLDVGIILLDRLAERYGRAFDDEA---AFPAPEGLASIPVEEIKKLGFSYQ 189 Query: 178 RAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 +A A+ LA A G + ++A+K L T GIGRW+A Y LRG D F Sbjct: 190 KARAIKELAAAIASGNASLERVYRMSDQEAIKYLSTLRGIGRWSAEYVLLRGLGRLDSFP 249 Query: 236 PDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHI 271 DD + + +I+ RW P+ H+ Sbjct: 250 ADDIGARNNLQRLFHLDHKPGYGEIKELTSRWHPYEGLVYFHL 292 >UniRef50_A9T041 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9T041_PHYPA Length = 178 Score = 172 bits (437), Expect = 1e-41, Method: Composition-based stats. Identities = 44/177 (24%), Positives = 78/177 (44%), Gaps = 18/177 (10%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F R+I+ Q +S A + R+ + G + TP +AA + L+A+G Sbjct: 9 FTALARSIVYQQISGKAACAIYCRLISICG--------GLESVTPPVIAALTVEELRAVG 60 Query: 174 MPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + ++ L LA G L + + +K L GIG W+A+ F + + Sbjct: 61 ISGRKGLYLHDLAEKFTSGLLSEAKLIIMNEDDLVKALTAVKGIGVWSAHMFMIFYLRKP 120 Query: 232 DVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWY-TEGWQPD 280 DV D I++ F + +PA+++ A W+P+R+ A ++W T+ PD Sbjct: 121 DVLPVGDLAIRKAFQKLYHLNQLPSPAEMQELAFPWRPYRTLASWYLWRMTDNMLPD 177 >UniRef50_UPI00016C4C1A DNA-3-methyladenine glycosylase II n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4C1A Length = 227 Score = 172 bits (436), Expect = 2e-41, Method: Composition-based stats. Identities = 56/208 (26%), Positives = 85/208 (40%), Gaps = 24/208 (11%) Query: 89 QIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLY------ 142 ++NG +GR+G P L +P D F VR ++GQ +S A + R+A+ Sbjct: 18 PVMNGLIGRVG---PCLLMPRGEDPFTLLVRCVIGQQISTKAAESIYNRLARAVNPPPEG 74 Query: 143 -----GERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGT--LP 195 G L + P +LAA K G+ + L + A LP Sbjct: 75 PHPADGTSLAMWQREGIMPM-DKLAALSEAEFKECGVSGPKQRTLRAVVEHARANPDLLP 133 Query: 196 MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM------- 248 D + + L GIG WT + + L G DV DY IK + Sbjct: 134 SIAGLDDDTIRERLTVIKGIGPWTVDMYLLFGLGRPDVLSVGDYGIKVAVKNLFRLRKLP 193 Query: 249 TPAQIRRYAERWKPWRSYALLHIWYTEG 276 PA++ R A+ W+P+RS AL ++W + Sbjct: 194 DPAKLTRVAKPWQPYRSVALWYLWRSLD 221 >UniRef50_Q2SX77 DNA-3-methyladenine glycosylase n=60 Tax=Betaproteobacteria RepID=Q2SX77_BURTA Length = 312 Score = 171 bits (434), Expect = 2e-41, Method: Composition-based stats. Identities = 48/220 (21%), Positives = 87/220 (39%), Gaps = 20/220 (9%) Query: 66 GLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQL 125 ++ AE +A+ + + L +L L D+F R+++GQ Sbjct: 98 PVQLSDAETVARPPYWDKACADLVKRDRILKKLIPKFGPAHLVKRGDSFVTLARSVVGQQ 157 Query: 126 VSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHL 185 +SVA A + ++ + P ++ + L A G+ +++E ++ L Sbjct: 158 ISVAAAQSVWVKIETACPKL-----------APPQIIKLGQEKLIACGLSKRKSEYILDL 206 Query: 186 ANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ 243 A + G L + D E + L GIGRWTA F + DV DD + + Sbjct: 207 AQHFVSGALHVDKWASMDDEDVIAELTQIRGIGRWTAEMFLIFNLSRPDVLPLDDLGLIR 266 Query: 244 RF-------PGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 +T ++ R A W+PWR+ A ++W + Sbjct: 267 AISVNYFSGEPVTRSEAREVAANWEPWRTVATWYMWRSLD 306 >UniRef50_Q9LN45 F18O14.25 n=22 Tax=Magnoliophyta RepID=Q9LN45_ARATH Length = 1314 Score = 171 bits (433), Expect = 3e-41, Method: Composition-based stats. Identities = 39/192 (20%), Positives = 73/192 (38%), Gaps = 17/192 (8%) Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 + L L P F +R IL Q +++ + R L G Sbjct: 149 ADPLLAALIDVHPPPTFESFKTPFLALIRNILYQQLAMKAGNSIYTRFVSLCGGE----- 203 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI--PGDVEQAMKT 208 P+ + + +PQ L+ +G+ ++A L LA G L + D + Sbjct: 204 ---NLVVPETVLSLNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAILNMDEKSLFTM 260 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERWK 261 L GIG W+ + F + DV +D +++ + P+Q+ ++ +W+ Sbjct: 261 LTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQLLYGLDDLPRPSQMEQHCAKWR 320 Query: 262 PWRSYALLHIWY 273 P+RS ++W Sbjct: 321 PYRSVGSWYMWR 332 >UniRef50_D1HE56 Whole genome shotgun sequence of line PN40024, scaffold_1.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HE56_VITVI Length = 351 Score = 171 bits (433), Expect = 3e-41, Method: Composition-based stats. Identities = 43/196 (21%), Positives = 74/196 (37%), Gaps = 17/196 (8%) Query: 87 NPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL 146 + + + L L P F ++IL Q ++ + R L G Sbjct: 124 HLRNADPHLAPLIDLHPPPTFDSFHTPFLALTKSILYQQLAYKAGTSIYTRFVGLCGGEA 183 Query: 147 DDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQ 204 P+ + A P L+ +G+ ++A L LA G L T I D + Sbjct: 184 GVL--------PETVLALTPHQLRQIGVSGRKASYLHDLARKYQNGILSDTGIITMDDKS 235 Query: 205 AMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYA 257 L GIG W+ + F + DV +D +++ + P+Q+ + Sbjct: 236 LFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGLEELPRPSQMEQLC 295 Query: 258 ERWKPWRSYALLHIWY 273 E+W+P+RS A +IW Sbjct: 296 EKWRPYRSVASWYIWR 311 >UniRef50_B4X1U6 Base excision DNA repair protein, HhH-GPD family n=1 Tax=Alcanivorax sp. DG881 RepID=B4X1U6_9GAMM Length = 292 Score = 170 bits (431), Expect = 5e-41, Method: Composition-based stats. Identities = 65/293 (22%), Positives = 109/293 (37%), Gaps = 19/293 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAA--RAVSSVETVADSYYARSLAVGEYRGVVTA---IPDIAR 56 TL P + L F A+S E V + +++ + ++ P R Sbjct: 4 LTLALPPHFSVPAFLDFHGRDQHAIS--EQVESNVLRKAVTLDGRPCLLALDFNQPGQVR 61 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQ 116 T L L + + LG+L + GLR+P FE Sbjct: 62 ATARTLTRTALSRQTRAMLGLDQAVHTFEQAVTGQATPLGQLVDRQRGLRVPQSATPFEA 121 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPL 176 AI+GQ +SV+ A + R QL G+ C+P + D AL+++G Sbjct: 122 LSWAIIGQQISVSAATAIRRRFIQLAGQ--TRISGLHCYPDAAAVNQLDASALRSVGFSA 179 Query: 177 KRAEALIHLANAALEGTL---PMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 +AE L+ ++ E L + D + + L G+G W+ NY LRG+ D Sbjct: 180 SKAETLLTVSLCCCEHALLPDALHSVADAQSTEQALLGIRGLGPWSVNYTLLRGYGYLDG 239 Query: 234 FLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 L D +++ + T + + + PWR+ H+W + Q Sbjct: 240 SLHGDVAVQKALQQLLAMKARPTAKATQDWLAAFTPWRALVAAHLWQSLQVQA 292 >UniRef50_C2AV46 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Tsukamurella paurometabola DSM 20162 RepID=C2AV46_TSUPA Length = 216 Score = 170 bits (430), Expect = 6e-41, Method: Composition-based stats. Identities = 62/209 (29%), Positives = 95/209 (45%), Gaps = 15/209 (7%) Query: 76 AKMSRLFDL-QCNPQIVNGAL------GRLGAARPGLRLPGCVDAFEQGVRAILGQLVSV 128 A++ + ++ +P V+ AL L AA PG+RL GCVD E +R ++GQ +S+ Sbjct: 5 ARVDNVIEILDADPLTVDEALSTDPRLAPLVAATPGIRLFGCVDPAELLLRTMIGQQISI 64 Query: 129 AMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAAADPQALKALGMPLKRAEALIHLA 186 A A AR+ + GE +DD + FP+P +A + L P R A+ +A Sbjct: 65 AAATTHQARLVEALGEPVDDPTGRVSRAFPSPAVVAERGHEVLTG---PRARVTAIRSVA 121 Query: 187 NAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP 246 EG L + +A L G+G WTA+Y A+R D L D ++ + Sbjct: 122 VEIAEGRLTLHPGMTRAEARDVLLRLSGVGPWTADYVAMRLLADPDTLLSSDLVVAKGAA 181 Query: 247 GMTPAQIRRYAERWKPWRSYALLHIWYTE 275 + + W PW SY LH+W Sbjct: 182 ALD---LDIATNHWSPWGSYVSLHLWNHS 207 >UniRef50_C1D8D7 HhH-GPD family protein n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1D8D7_LARHH Length = 208 Score = 170 bits (430), Expect = 8e-41, Method: Composition-based stats. Identities = 50/214 (23%), Positives = 90/214 (42%), Gaps = 20/214 (9%) Query: 75 LAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKL 134 +A+ + + + + RL A+ P L + FE +RAI+GQ +SV A + Sbjct: 1 MARPAWWDNACAGLAAADPVMARLIASWPDAELVSRGEPFETLLRAIVGQQISVRAADAV 60 Query: 135 TARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTL 194 R++ + + P+P+R+ A + L++ G+ ++ LA +G + Sbjct: 61 WKRLSAVLSGQ----------PSPERVLALPEEVLRSAGLSARKVLYARDLAECFTDGRV 110 Query: 195 P--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP-- 250 D E + L GIGRWTA + + DV+ DD +++ Sbjct: 111 NPAAHAGLDDEALIAELVAVRGIGRWTAEMYLIFNQLRPDVWPVDDIGLQRAMARHYALE 170 Query: 251 ------AQIRRYAERWKPWRSYALLHIWYTEGWQ 278 Q+R ER+ PWR+ A ++W + Q Sbjct: 171 DQKASLTQLRVMGERFAPWRTVATWYLWRSLDPQ 204 >UniRef50_A2QHV8 Contig An04c0070, complete genome n=10 Tax=Eurotiomycetidae RepID=A2QHV8_ASPNC Length = 412 Score = 169 bits (429), Expect = 9e-41, Method: Composition-based stats. Identities = 47/219 (21%), Positives = 83/219 (37%), Gaps = 28/219 (12%) Query: 83 DLQCNPQIVNGALGRLGAARPG--LRLPGCV---DAFEQGVRAILGQLVSVAMAAKLTAR 137 + + L L +P G D F V +I+GQ VS A A + + Sbjct: 183 KAAAHLIATDPRLESLIREQPCPLFTPEGLAEEIDPFRSLVSSIIGQQVSGAAAKSIKDK 242 Query: 138 VAQLY--GERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP 195 L+ + +D FPTP+ + D L+ G+ ++AE + L+ G L Sbjct: 243 FVALFKTNNKDEDGTRPSFFPTPEEIIKMDISTLRTAGLSQRKAEYIHGLSEKFANGELS 302 Query: 196 --MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQR--------- 244 M + E+ ++ L G+G+W+ FA + DVF D +++ Sbjct: 303 ARMLLNASDEELVEKLTAVRGLGKWSVEMFACFALKRIDVFSTGDLGVQRGCAVFVGKDV 362 Query: 245 ----------FPGMTPAQIRRYAERWKPWRSYALLHIWY 273 F M + A ++ P+RS + ++W Sbjct: 363 NKLKGKGGGKFKYMPEKDMLELAAKFAPYRSLFMWYMWR 401 >UniRef50_D1C1F2 HhH-GPD family protein n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1C1F2_SPHTD Length = 319 Score = 168 bits (427), Expect = 1e-40, Method: Composition-based stats. Identities = 60/274 (21%), Positives = 103/274 (37%), Gaps = 20/274 (7%) Query: 8 PPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIP----DIARHTLHINL 63 PP+ + L R + V+ Y R L V V D + + Sbjct: 16 PPFRLDLTVWVLRRRPDNVVDRWDGRTYRRVLPVNGQPIEVAVTQTGPVDSPCLHVVASG 75 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNP------QIVNGALGRLGAARPGLRLPGCVDAFEQG 117 E V E + R + + AL L G + +E Sbjct: 76 PGADETVVPELRRTLMRTLGTGVDLSGFSRLAAGDPALAELADRLRGAKPTRYPTVYEAL 135 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPL 176 V AI Q +++ ++ +R+AQ G ++ D ++ FP P+ + P L+ LG Sbjct: 136 VNAIACQQITLTFGLRILSRLAQECGMTIERDGETHVAFPRPEDVLTVSPDRLRELGFSR 195 Query: 177 KRAEALIHLANAALEGTLPMTIPGD--VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 ++A A++ L+ ++G+L + D + AM+ L G+GRWTA Y LRG +F Sbjct: 196 QKARAVLELSERLVDGSLDLEPLEDLPDDAAMERLLALRGVGRWTAEYVLLRGLGRVHIF 255 Query: 235 LPDDYLIKQRF-------PGMTPAQIRRYAERWK 261 DD + + ++R W+ Sbjct: 256 PGDDVGGRNNLRRWLGIEEALDYDGVQRVLGAWR 289 >UniRef50_Q0BSG3 DNA-3-methyladenine glycosylase II n=12 Tax=Proteobacteria RepID=Q0BSG3_GRABC Length = 255 Score = 168 bits (427), Expect = 1e-40, Method: Composition-based stats. Identities = 51/203 (25%), Positives = 78/203 (38%), Gaps = 19/203 (9%) Query: 86 CNPQIVNGALGRLGAA--RPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG 143 + + AL L P L + FE +RAI Q + A + AR L+ Sbjct: 35 AHLARQDKALSALITRVGPPRLTISLEQSPFEALIRAIAHQQLHARAAEAILARFLALF- 93 Query: 144 ERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG--- 200 P FP+P + A D + L+ G + AL + AA G +P Sbjct: 94 ------PVNTDFPSPLEIMALDTETLRQCGFSGTKIIALRGVCEAAQGGIIPDRSGCTAL 147 Query: 201 DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIK---QRFPGMT----PAQI 253 D E ++ L T GIGRWT + D+ DD+ ++ + G+ P + Sbjct: 148 DDETLIQRLTTLRGIGRWTVEMLMIFTLGRTDILPVDDFGVREGWRLIKGLESQPRPKIL 207 Query: 254 RRYAERWKPWRSYALLHIWYTEG 276 + W PWRS A ++W Sbjct: 208 ADIGQSWSPWRSLAAWYLWRAAD 230 >UniRef50_Q92383 DNA-3-methyladenine glycosylase 1 n=1 Tax=Schizosaccharomyces pombe RepID=MAG1_SCHPO Length = 228 Score = 168 bits (427), Expect = 2e-40, Method: Composition-based stats. Identities = 41/191 (21%), Positives = 77/191 (40%), Gaps = 18/191 (9%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC 154 L +L R + +E+ +RA+ Q + A + R Sbjct: 32 LVKLVGNYRPNRSMEKKEPYEELIRAVASQQLHSKAANAIFNRF--------KSISNNGQ 83 Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV---EQAMKTLQT 211 FPTP+ + D + ++A G ++ ++L +A A + G +P + E+ ++ L Sbjct: 84 FPTPEEIRDMDFEIMRACGFSARKIDSLKSIAEATISGLIPTKEEAERLSNEELIERLTQ 143 Query: 212 FPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWR 264 GIGRWT + DV DD I+ + + T + +++E P+R Sbjct: 144 IKGIGRWTVEMLLIFSLNRDDVMPADDLSIRNGYRYLHRLPKIPTKMYVLKHSEICAPFR 203 Query: 265 SYALLHIWYTE 275 + A ++W T Sbjct: 204 TAAAWYLWKTS 214 >UniRef50_B6JZD7 DNA-3-methyladenine glycosylase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6JZD7_SCHJY Length = 268 Score = 168 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 43/222 (19%), Positives = 81/222 (36%), Gaps = 22/222 (9%) Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAI 121 N++ +P A + L + + + V A+G R+ +E +RA+ Sbjct: 42 NITNPGQPTADQLAKAEEHLASIDEHWKRVVEAIG-----HTSFRVEKVRQPYEALIRAV 96 Query: 122 LGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEA 181 Q ++ + R+ FPTP+ + A + + LK+ G ++ + Sbjct: 97 AYQQLTTKAGKAIINRLVAK-------ASATGGFPTPEEILALEQEQLKSCGFSRRKTDT 149 Query: 182 LIHLANAALEG---TLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD 238 + +A G +L E+ ++ L GIGRWTA + G DV D Sbjct: 150 IREIARGVETGLIPSLDAAHEMVNEELIERLSQIHGIGRWTAEMLLIFGMGRLDVLPAGD 209 Query: 239 YLIKQRFPGMTP-------AQIRRYAERWKPWRSYALLHIWY 273 I+ F + + P+RS A +++ Sbjct: 210 LKIRDGFRYLYAMDKLPSLRETNELGCACAPYRSIATWYLYR 251 >UniRef50_D1ZEJ1 Whole genome shotgun sequence assembly, scaffold_22 n=6 Tax=Leotiomyceta RepID=D1ZEJ1_SORMA Length = 415 Score = 168 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 50/247 (20%), Positives = 88/247 (35%), Gaps = 55/247 (22%) Query: 86 CNPQIVNGALGRLGAARPGL-----RLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQ 140 + V+ + L P L +D FE V +I+ Q VS A A + + Sbjct: 163 AHLIAVDPRMKPLIDKHPCRIFSPEGLAEQIDPFESLVSSIISQQVSGAAAKSIKGKFVA 222 Query: 141 LY-------------------GERLDDFPEY----ICFPTPQRLAAADPQALKALGMPLK 177 L+ G +D P FPTP + D L+ G+ + Sbjct: 223 LFDDPSLDQDQDDEDGKDTPPGHPAEDQPSSKRRKRRFPTPSLVLQKDLPTLRTAGLSQR 282 Query: 178 RAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 +AE + LA+ G L ++ ++ + L G+G WT FA + DVF Sbjct: 283 KAEYIHGLASKFASGELSASLLASAPYDELVSKLVAVRGLGLWTVEMFACFALKRMDVFS 342 Query: 236 PDDYLIKQRFPG-------------------------MTPAQIRRYAERWKPWRSYALLH 270 D +++ M+ +++ +ER++P+RS + + Sbjct: 343 LGDLGVQRGMAAFVGRDVKKLKNGNGKGNGKDKKWKYMSEGEMKEISERFRPYRSLFMWY 402 Query: 271 IWYTEGW 277 +W E Sbjct: 403 MWRVEET 409 >UniRef50_B4CYJ1 DNA-3-methyladenine glycosylase II n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYJ1_9BACT Length = 214 Score = 167 bits (423), Expect = 4e-40, Method: Composition-based stats. Identities = 47/179 (26%), Positives = 75/179 (41%), Gaps = 18/179 (10%) Query: 105 LRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAA 164 L+ F VRA+ Q ++ A + R L+ FPT + LA+ Sbjct: 28 LQPEKDHSPFRALVRAVAHQQLNGTAAETILRRFCALF--------PGKKFPTAKDLASV 79 Query: 165 DPQALKALGMPLKRAEALIHLANAALEGTLPMTIP---GDVEQAMKTLQTFPGIGRWTAN 221 +AL+ G + AL +A L+GT+P T + + ++ L G+GRWT Sbjct: 80 TDEALRGSGFSWAKIAALRDIAAKTLDGTIPSTRAIQKMNDAEIIERLVQVRGVGRWTVE 139 Query: 222 YFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWY 273 + DVF DD+ I+ F P +I +AERW+P+ + A + W Sbjct: 140 MMLIFKLGRPDVFPADDFGIRDGFRVAYGLDEMPKPKEILAHAERWRPYATTAAWYFWR 198 >UniRef50_B8GAB8 DNA-3-methyladenine glycosylase II n=3 Tax=Chloroflexus RepID=B8GAB8_CHLAD Length = 199 Score = 167 bits (423), Expect = 4e-40, Method: Composition-based stats. Identities = 51/192 (26%), Positives = 85/192 (44%), Gaps = 20/192 (10%) Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 V+ L L +F AI+ Q +S+ A + R+ L GE Sbjct: 11 VDPVLAPWIDQIGSFALQRQPHSFATLAYAIISQQLSLNAARAIRDRLTTLLGEL----- 65 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKT 208 TP+++ AAD AL+A G+ ++++ L LA + G + + + D E A+ Sbjct: 66 ------TPEQILAADTTALRAAGLSMQKSGYLRDLAERIVYGQINLELLPTLDDETAIAM 119 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIK-------QRFPGMTPAQIRRYAERWK 261 L GIGRWTA + + D+ DD ++ Q ++P ++R ERW+ Sbjct: 120 LTNVRGIGRWTAEIYLMFALNRLDILPADDLGLRDGARLVYQLPQILSPRELRALGERWR 179 Query: 262 PWRSYALLHIWY 273 P+RS A ++W Sbjct: 180 PYRSIACWYLWQ 191 >UniRef50_C7PK12 HhH-GPD family protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PK12_CHIPD Length = 206 Score = 167 bits (423), Expect = 4e-40, Method: Composition-based stats. Identities = 40/175 (22%), Positives = 79/175 (45%), Gaps = 21/175 (12%) Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPL 176 + +I+ Q +SV +A + R LY D E P Q++ P+ L+++G+ Sbjct: 39 LIGSIMSQQLSVKVATVIYTRFLALY-----DGKE----PNAQQILDTPPETLRSIGLSN 89 Query: 177 KRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 + + ++A +E L + D E+ +K L G+GRWT + +DVF Sbjct: 90 AKVSYVHNVARFTVEEKLTDKKLLQMDDEEVIKYLTQIKGVGRWTVEMLLMFYLCREDVF 149 Query: 235 LPDDYLIKQRFPGMTP----------AQIRRYAERWKPWRSYALLHIWYTEGWQP 279 DD ++Q + ++ + +++W P+R+YA ++W + +P Sbjct: 150 AIDDLGLQQAMIKLYKLDNTDKKAFREKLLKISKKWSPYRTYASRYLWAWKDMKP 204 >UniRef50_B8IZY6 HhH-GPD family protein n=8 Tax=Bacteria RepID=B8IZY6_DESDA Length = 235 Score = 166 bits (422), Expect = 6e-40, Method: Composition-based stats. Identities = 43/193 (22%), Positives = 79/193 (40%), Gaps = 19/193 (9%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + L +R D F + +I+GQ +S A + R+ + + Sbjct: 21 DPVLAAAMEEIGHIRREVTPDIFNALLNSIVGQQISTKAQATIWKRMREQF--------- 71 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGD--VEQAMKTL 209 C TP+ + ++L+ G+ +++A + + A L+G+L + ++ L Sbjct: 72 --CPITPENIGTISAESLQTCGISMRKAAYIKSITEAVLDGSLDLARLPSLTDKEICAQL 129 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF------PGMTPAQIRRYAERWKPW 263 GIG WTA + Q D+ DD I++ +TPA RY +R+ P Sbjct: 130 VQLKGIGVWTAEMIMIFSMQRPDILSWDDLAIQRGLRMLYRHRQITPALFARYRKRYSPH 189 Query: 264 RSYALLHIWYTEG 276 + A L++W G Sbjct: 190 ATTASLYLWAIAG 202 >UniRef50_A9EU33 Methylated-DNA--protein-cysteine methyltransferase n=31 Tax=Bacteria RepID=A9EU33_SORC5 Length = 395 Score = 166 bits (420), Expect = 1e-39, Method: Composition-based stats. Identities = 63/254 (24%), Positives = 97/254 (38%), Gaps = 22/254 (8%) Query: 36 ARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQC---NPQIVN 92 R LA G G +A + + + G A E FD + + + Sbjct: 142 HRVLAAGGKAGGFSANGGVTTKLRLLAIEGGQARGAPEAAPGGDLGFDPGAAVEHLRASD 201 Query: 93 GALGRLGAARP--GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 AL R+ A +R+ F +I+ Q ++ AA + ARV L+ P Sbjct: 202 AALARVIDAVGPFAMRIDRTSSLFLALAESIVYQQLTGKAAATIFARVRALF-------P 254 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI---PGDVEQAMK 207 PTP +L A + L+ G+ + AL LA +G LP + E ++ Sbjct: 255 RAHEGPTPAQLLRASDEKLRGAGLSQAKLLALRDLARKTEDGELPTLAEVHGMEDEAIIE 314 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF-------PGMTPAQIRRYAERW 260 L GIGRWT + DV DDY I++ F A + + RW Sbjct: 315 RLTRVRGIGRWTVEMLLMFRLGRPDVLPVDDYGIRKGFALAFKRPEPPARADLEKRGARW 374 Query: 261 KPWRSYALLHIWYT 274 KP+R+ A ++W Sbjct: 375 KPYRTVASWYLWRA 388 >UniRef50_C6WJ98 Transcriptional regulator, AraC family n=5 Tax=Actinomycetales RepID=C6WJ98_ACTMD Length = 584 Score = 165 bits (419), Expect = 1e-39, Method: Composition-based stats. Identities = 80/290 (27%), Positives = 114/290 (39%), Gaps = 30/290 (10%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L ++ P + G L A AV VE D Y R+L + GVV+ P Sbjct: 300 VLRLPFRGPLHAPSLFGPLVANAVPGVEEWRDGAYRRTLRLPRGHGVVSLRPRADHVECD 359 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAF 114 + L+ +++ R DL +P V + AL L A PG R+PG VD Sbjct: 360 LTLT--DSRDLPVAISRCRRALDLDADPAEVDGALRADPALRPLVDAAPGTRVPGVVDGA 417 Query: 115 EQGVRAILGQLVSVAMAAKLTA------RVAQLYGERLDDFPE---YICFPTPQRLAAAD 165 E VRA+LG+ A A RV + GE + D FPTPQ L D Sbjct: 418 ECAVRALLGEGTGTGAAMGAGANAGWAHRVVREAGEAVPDPAGGGLTHLFPTPQALLDLD 477 Query: 166 PQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFAL 225 P L A A + AL G + + D +A L+ G + Sbjct: 478 PALLPPP------ARAPLTALLTALVGGVDLGAGADRAEARSALRC---AGERVLDAVLT 528 Query: 226 RGWQAKDVFLPDDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALLHI 271 R D F PDD ++ G+ T A + + W+PWR+YA ++ Sbjct: 529 RSLGDPDGFCPDDPAVRAAAGGIGLPVTAAALADRSRAWRPWRAYATRYL 578 >UniRef50_D1RHI7 HhH-GPD family base excision repair protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RHI7_LEGLO Length = 263 Score = 165 bits (417), Expect = 2e-39, Method: Composition-based stats. Identities = 53/242 (21%), Positives = 92/242 (38%), Gaps = 17/242 (7%) Query: 47 VVTAIPDIARHTLHINLSAGLEP-------VAAECLAKMSRLFDLQCNPQIVNGALGRLG 99 VV + L ++L+ + P E + ++R + L L Sbjct: 11 VVEQTNSLNNPELLVSLNEPVHPLVQEKIKHLIEMMLGLNRDLTGFYKMAKDDVRLDPLV 70 Query: 100 AARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE-YICFPTP 158 G++ P FE + AI Q +S+ + R+ Q G +++ + Y FPT Sbjct: 71 FQFMGVKPPCFPSFFEALINAISCQQISLDAGLHIQNRLVQHIGMKMNHENQVYYAFPTA 130 Query: 159 QRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGD--VEQAMKTLQTFPGIG 216 + + LK +G ++E ++ LA+ E + D E+ ++ L F GIG Sbjct: 131 EDVGHCSVAELKKIGYSTHKSETIVSLASMLKEEHSFLNRLEDKPTEEVIQLLCQFKGIG 190 Query: 217 RWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMTPAQIRRYAERWKPWRSYALL 269 RWTA Y LRG +VF DD + + + + W P+ Sbjct: 191 RWTAEYVLLRGLGRIEVFPGDDIGAQNNLQKLLHLEDKLDYKKTSKITALWHPYAGLVYF 250 Query: 270 HI 271 H+ Sbjct: 251 HL 252 >UniRef50_B0U6C0 DNA-3-methyladenine glycosidase n=16 Tax=Xanthomonadaceae RepID=B0U6C0_XYLFM Length = 226 Score = 165 bits (417), Expect = 2e-39, Method: Composition-based stats. Identities = 54/210 (25%), Positives = 81/210 (38%), Gaps = 24/210 (11%) Query: 85 QCNPQIVN--GALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLY 142 C+P + LG L A R G R P + + RAIL Q +S A+ + AR+ + Sbjct: 19 HCDPGLSGWMQRLGPLPALR-GWRQP--FNVVDALARAILFQQLSGKAASTIVARIEAVI 75 Query: 143 GERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV 202 G + LA D L+A G+ + AL L + G LP Sbjct: 76 GSTCLY---------AETLACIDDACLRACGVSSNKILALRDLTRREVAGELPSVRQMGA 126 Query: 203 ---EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQ 252 ++ L GIGRWT + DV DD +++ + TP Sbjct: 127 MHHNTIVEKLIPIRGIGRWTVEMMLMFRLGRPDVLPVDDLGVRKGIQRVDTLAFVPTPKA 186 Query: 253 IRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 + E W P+R+YA L++W + E Sbjct: 187 LCTRGECWAPYRTYAGLYLWRIADFHEGEG 216 >UniRef50_B1YMD5 HhH-GPD family protein n=1 Tax=Exiguobacterium sibiricum 255-15 RepID=B1YMD5_EXIS2 Length = 273 Score = 164 bits (416), Expect = 3e-39, Method: Composition-based stats. Identities = 58/271 (21%), Positives = 110/271 (40%), Gaps = 17/271 (6%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEP 69 YD++ + L + VE D + + E + +R L LS Sbjct: 12 YDFAGIRKRLRGDRLQ-VEQ--DGRLFVPVMLPEGNFIGQVEAVGSRELL---LSGEGPQ 65 Query: 70 VAAECLAKMS-RLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSV 128 L + RL + + +L + + RL ++ F +R+I+ Q +++ Sbjct: 66 EPMVQLLRCRFRLDTTNPSQHLSETSLSEVVSTFGAERLVLDINPFTALIRSIIHQQINL 125 Query: 129 AMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANA 188 A A L R + +G + I PT ++L +P+ L+AL + ++ + L+ A A Sbjct: 126 AFAQVLMERFCRTFGTEQNGV---IFPPTAEQLVNVEPEQLRALQLSGRKVDYLLGAARA 182 Query: 189 ALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM 248 A++ P +TL G+G WT + G+ +D+F D I + F + Sbjct: 183 AIDFERLTEAP--DATIAETLIALKGVGPWTVQNVLMFGYGREDLFPASDIGILRAFERL 240 Query: 249 -----TPAQIRRYAERWKPWRSYALLHIWYT 274 + + AE + P+RS+A +W + Sbjct: 241 HGTRPSVEEAVLLAEEFAPYRSHAAYLLWRS 271 >UniRef50_B2B817 Predicted CDS Pa_2_12990 n=8 Tax=Leotiomyceta RepID=B2B817_PODAN Length = 428 Score = 164 bits (415), Expect = 4e-39, Method: Composition-based stats. Identities = 47/186 (25%), Positives = 76/186 (40%), Gaps = 28/186 (15%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLY---GERLDDFPEYICFPTPQRLAAADPQA 168 D FE I+ Q VS A A + R L+ + E FPTP + + Sbjct: 246 DPFESLASGIISQQVSGAAAKAIKNRFISLFYPGNDTTTTTHEKKKFPTPADVIGKSIET 305 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALR 226 L+ G+ ++AE L+ LA + G L + D E+ ++ L G+GRW+ FA Sbjct: 306 LRTAGLSQRKAEYLLGLAQKFVSGELTAQMLADAPYEEVLEKLIAVRGLGRWSVEMFACF 365 Query: 227 GWQAKDVFLPDDYLIKQRFPG--------------------MTPAQIRRYAERWKPWRSY 266 G + DVF D +++ M+ ++ AE ++P+RS Sbjct: 366 GLKRMDVFSTGDLGVQRGMAAFVGRDVGKLKAKGGGNKWKYMSEREMEEIAEGFRPYRS- 424 Query: 267 ALLHIW 272 L +W Sbjct: 425 --LFMW 428 >UniRef50_C7MP98 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=2 Tax=Bacteria RepID=C7MP98_CRYCD Length = 234 Score = 164 bits (415), Expect = 4e-39, Method: Composition-based stats. Identities = 41/175 (23%), Positives = 70/175 (40%), Gaps = 19/175 (10%) Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALK 170 D F V I+GQ +S A + R+ L GE + Q + A P+ L+ Sbjct: 37 ADLFSAVVHHIIGQQISTAAQQTVWLRMCDLLGEV-----------SAQSITATSPEQLQ 85 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTI--PGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 + G+ ++ + + A ++G+ + +A+ L + GIG WTA L Sbjct: 86 SCGISFRKVDYIQDFAEKVMDGSFDLDAIEQASDAEAIAALSSLRGIGTWTAEMLLLFCL 145 Query: 229 QAKDVFLPDDYLIKQRF------PGMTPAQIRRYAERWKPWRSYALLHIWYTEGW 277 DV DD I++ +T +Y R+ P+ S A L++W Sbjct: 146 GRPDVLSFDDLAIQRGLRMVYHHRKITRPLFEKYRRRYSPYGSVASLYLWAISSM 200 >UniRef50_B9LPN6 HhH-GPD family protein n=4 Tax=Halobacteriaceae RepID=B9LPN6_HALLT Length = 198 Score = 163 bits (413), Expect = 7e-39, Method: Composition-based stats. Identities = 48/193 (24%), Positives = 79/193 (40%), Gaps = 21/193 (10%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + + RL L + D F + +I+ Q +S A AA + R + G Sbjct: 12 DSTMARLIDRHGRLTIEPAADEFARLCTSIVNQQLSTASAAAIHERFVDVLGGA------ 65 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG----DVEQAMK 207 PTP + AAD AL+ G+ + E L A A +G +T G E + Sbjct: 66 ----PTPDDVLAADEVALREAGLSGTKVEYLREAAAAFRDGDRDLTREGFGDASDEAVVA 121 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMTPAQIRRYAERW 260 L G+G WTA + + +DV D +++ +T A++R + W Sbjct: 122 ALTEIRGVGEWTARMYLIFALGREDVLPLGDLAVRKGIEQVYNDGAELTRAEMRNIGDAW 181 Query: 261 KPWRSYALLHIWY 273 +P+RSY ++W Sbjct: 182 RPYRSYGTRYVWA 194 >UniRef50_A5WCQ9 HhH-GPD family protein n=2 Tax=Psychrobacter RepID=A5WCQ9_PSYWF Length = 231 Score = 163 bits (412), Expect = 8e-39, Method: Composition-based stats. Identities = 54/219 (24%), Positives = 89/219 (40%), Gaps = 28/219 (12%) Query: 70 VAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVA 129 +E + +L D++ + LG P LR F + +RA++GQ +SVA Sbjct: 29 DLSELEGHIKQLIDIEPRFAPIYQQLG-----VPSLR--RNRGGFRELMRAMVGQQLSVA 81 Query: 130 MAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAA 189 A+ + +++ E TP + AD L++ G+ ++ + L Sbjct: 82 AASSIWSKL------------ENAALITPDAIMKADDDTLRSHGLSRQKIRYIRSLVEHD 129 Query: 190 LEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM- 248 ++ +P E + L GIG+WTA + L D+ DD IK + Sbjct: 130 IDFEALAHLP--DEAVISELTAVTGIGKWTAQMYLLFSLGRADILAVDDLAIKVGAMEVL 187 Query: 249 ------TPAQIRRYAERWKPWRSYALLHIWYTEGWQPDE 281 TP Q+ R + W P RS A L +W GW D+ Sbjct: 188 GLDERPTPKQLERLTQSWSPHRSAASLLLWAHYGWLKDQ 226 >UniRef50_C5G8B3 DNA-3-methyladenine glycosylase n=8 Tax=Onygenales RepID=C5G8B3_AJEDR Length = 438 Score = 163 bits (412), Expect = 9e-39, Method: Composition-based stats. Identities = 49/261 (18%), Positives = 86/261 (32%), Gaps = 70/261 (26%) Query: 86 CNPQIVNGALGRLGAARPGLRLPGC-----VDAFEQGVRAILGQLVSVAMAAKLTARVAQ 140 + V L + P +D F V I+GQ VS A A + + Sbjct: 171 AHLIKVAPQLRPVIEKHPCPLFSPAGLAEEIDPFNSLVSGIIGQQVSGAAAKSIKKKFMA 230 Query: 141 LY----------------------------------------GERLDDFPEYIC----FP 156 L+ GE+++ FP Sbjct: 231 LFRSDGGGGGDCNSINNATATIATTVINDGGAATKNNNNMDAGEKIETAEMRYDRDDDFP 290 Query: 157 TPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP--MTIPGDVEQAMKTLQTFPG 214 TP ++A D L+ G+ ++AE + LA G L M + E+ ++ L G Sbjct: 291 TPAQVAKCDIATLRTAGLSQRKAEYIQGLAEKFASGELSARMLLQASDEEVLEKLIAVRG 350 Query: 215 IGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-------------------MTPAQIRR 255 +G+W+ F+ G + DVF D +++ M+ ++ Sbjct: 351 LGKWSVEMFSCFGLKRMDVFSTGDLGVQRGMAAFVGRDVSKLKAKGGGKFKYMSEKEMVE 410 Query: 256 YAERWKPWRSYALLHIWYTEG 276 A + P+RS + ++W E Sbjct: 411 VAAPFSPYRSLFMWYMWRIED 431 >UniRef50_Q3INX6 DNA N-glycosylase / DNA lyase n=6 Tax=Halobacteriaceae RepID=Q3INX6_NATPD Length = 203 Score = 162 bits (411), Expect = 9e-39, Method: Composition-based stats. Identities = 51/194 (26%), Positives = 78/194 (40%), Gaps = 19/194 (9%) Query: 87 NPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL 146 + + L L L + D F + + ++L Q VS+A A R+ Sbjct: 6 DTLRADPVLEPLIERHGALTIEPADDLFRRLLVSVLRQQVSMASAEATKKRLFDA----- 60 Query: 147 DDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQ 204 PTP + AAD + + G+ ++A L ++A A + P D E Sbjct: 61 -------VEPTPTAVLAADTETFREAGLSRQKATYLHNIAAAFEDHGYDRAYFEPMDDEA 113 Query: 205 AMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-----MTPAQIRRYAER 259 L G+G WTAN L +DVF D I++ + A + AER Sbjct: 114 VRAELTDITGVGEWTANMQLLFSLGREDVFPVGDLGIRKGMRALLDEDLDRAAMTEAAER 173 Query: 260 WKPWRSYALLHIWY 273 W P+RSYA L++W Sbjct: 174 WAPYRSYASLYLWR 187 >UniRef50_C6CD76 DNA-3-methyladenine glycosylase II n=1 Tax=Dickeya dadantii Ech703 RepID=C6CD76_DICDC Length = 225 Score = 162 bits (410), Expect = 1e-38, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 85/202 (42%), Gaps = 18/202 (8%) Query: 84 LQCNPQIVNGALGRLGAARPGLRLPGCVD--AFEQGVRAILGQLVSVAMAAKLTARVAQL 141 + + ++ RL A +R +E +RA+ Q +S AA + A++ + Sbjct: 13 ARSHLAAIDDRWERLIAGVGHIRFASRPGQQPYEALIRAVASQQLSNRAAAAIIAKLQKQ 72 Query: 142 YGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP---MTI 198 + E FP+P +LA P+ L+ G ++ + + +A A+ G +P Sbjct: 73 FAM------EETGFPSPSQLAECPPEHLRQCGFSSRKIDTVQAIARGAISGLVPDRASAA 126 Query: 199 PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPA 251 + + + L T GIGRWT + + D+ DD I+Q F + + Sbjct: 127 LMEDDTLITQLCTLHGIGRWTVEMLLINTLERMDIMPVDDLGIRQGFRYLYQLPSDPSRK 186 Query: 252 QIRRYAERWKPWRSYALLHIWY 273 ++ + +P+R+ A ++W Sbjct: 187 EMLALSAPCQPYRTLAAWYLWR 208 >UniRef50_A6GQ39 3-methyladenine DNA glycosylase II n=1 Tax=Limnobacter sp. MED105 RepID=A6GQ39_9BURK Length = 217 Score = 161 bits (409), Expect = 2e-38, Method: Composition-based stats. Identities = 45/198 (22%), Positives = 81/198 (40%), Gaps = 19/198 (9%) Query: 88 PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD 147 + + L A L +E +R+++GQ +SV A + ARV ++ Sbjct: 24 LALQSPVWVELLARHSDRALRSRGAPYETMLRSLVGQQISVKAADAVWARVVDALNGKI- 82 Query: 148 DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI--PGDVEQA 205 T + L A LKA G+ ++ L+ +G L + + D E Sbjct: 83 ---------TSRALLALSDDTLKATGLSRQKIAYSRALSEFEQQGGLELAVLEGMDDEAC 133 Query: 206 MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-------MTPAQIRRYAE 258 + L G+GRWTA F + + DV+ DD +++ + P + ++ E Sbjct: 134 TRHLCAIKGVGRWTAQMFLMFCLRRPDVWPVDDIGVQRGISRQFFEGEPIGPKEALQFGE 193 Query: 259 RWKPWRSYALLHIWYTEG 276 + KPWR+ A ++W + Sbjct: 194 KLKPWRTVAAWYLWRSLD 211 >UniRef50_C1XHZ0 DNA-3-methyladenine glycosylase II n=2 Tax=Meiothermus RepID=C1XHZ0_MEIRU Length = 178 Score = 161 bits (408), Expect = 2e-38, Method: Composition-based stats. Identities = 40/167 (23%), Positives = 71/167 (42%), Gaps = 16/167 (9%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 +E + +I+GQ +S A + R++ + P+ L A + L+A+ Sbjct: 23 PYEVLLSSIVGQQLSGKAADTIWRRLSSRFALE------------PEVLYRAALEDLRAV 70 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 G+ +A + L+ ALEG L E + L GIG WT F + G D Sbjct: 71 GLSSAKARYVQDLSRFALEGGLQGLEHHSDEALIAHLTQVKGIGVWTVQMFLMFGLGRPD 130 Query: 233 VFLPDDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALLHIWYTE 275 V+ D I++ + ++ ER++P+RS+A ++W Sbjct: 131 VWPVLDLGIRKGAQKLYGVIERDELEALGERFRPYRSHAAWYLWRAL 177 >UniRef50_Q5FSB3 DNA-3-methyladenine glycosylase n=1 Tax=Gluconobacter oxydans RepID=Q5FSB3_GLUOX Length = 219 Score = 161 bits (408), Expect = 2e-38, Method: Composition-based stats. Identities = 51/205 (24%), Positives = 81/205 (39%), Gaps = 19/205 (9%) Query: 81 LFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQ 140 + V +G LR + ++ +RAI GQ + A A K+ R+ Sbjct: 11 FLGADPDLAAVIARIGPCT-----LRGDNGQEPYDALLRAIAGQQLHGAAARKIFGRLCL 65 Query: 141 LYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG 200 L + D P P P R+ + + L+A G+ + A+ LA A L+G +P Sbjct: 66 LGAQESVDGPP----PAPGRILSLSEERLRACGLSGNKILAMKGLAQARLDGLVPSRAEA 121 Query: 201 D---VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIK---QRFPGMT----P 250 E+ + L T GIGRWT + DV DD+ ++ +R M P Sbjct: 122 SVMTDEELIARLVTLRGIGRWTVEMLLMFTLNRPDVMPVDDFGVREGWRRIRKMDLPPKP 181 Query: 251 AQIRRYAERWKPWRSYALLHIWYTE 275 ++ ER+ P RS + W Sbjct: 182 KALKEETERFAPHRSTLAWYCWRVA 206 >UniRef50_Q0VPN7 Putative uncharacterized protein n=1 Tax=Alcanivorax borkumensis SK2 RepID=Q0VPN7_ALCBS Length = 291 Score = 161 bits (407), Expect = 3e-38, Method: Composition-based stats. Identities = 50/197 (25%), Positives = 82/197 (41%), Gaps = 12/197 (6%) Query: 93 GALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY 152 L L + GLR+P FE AI+GQ +SV+ A + R QL D Sbjct: 97 PPLNTLVQRQRGLRVPQSATPFEALTWAIIGQQISVSAATAIRRRFIQLASPVRHDG--L 154 Query: 153 ICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTL---PMTIPGDVEQAMKTL 209 C+P + P AL+++G +A+ L+ ++ + L + + EQ + L Sbjct: 155 HCYPDAATVCLLTPDALRSVGFSATKADTLLAVSRLCRDQQLLPETLHLDAYAEQLERNL 214 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKP 262 G+G W+ NY LRG+ D L D +++ + T R + + P Sbjct: 215 LEIRGLGPWSVNYTLLRGYGFLDGSLHADVAVQKALQMLLGQPERPTARVTRDWLADFTP 274 Query: 263 WRSYALLHIWYTEGWQP 279 WR+ H+W + Q Sbjct: 275 WRALVAAHLWQSLSTQA 291 >UniRef50_B4B851 DNA-3-methyladenine glycosylase II n=2 Tax=Cyanobacteria RepID=B4B851_9CHRO Length = 215 Score = 160 bits (404), Expect = 8e-38, Method: Composition-based stats. Identities = 39/174 (22%), Positives = 67/174 (38%), Gaps = 17/174 (9%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 E AI+ Q +S +A K+ R LY E + L + L++ Sbjct: 42 SLLEALAWAIMAQQISTEVANKIYQRFLSLYNESTPLN--------ARNLLQTSDEDLRS 93 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 +G+ + L +LA A E P++ + E +K L GIG WT + Q Sbjct: 94 IGISRYKIGYLKNLARAVEEYLPPLSELATMEDETIIKLLTQIKGIGTWTVQMLLIFRLQ 153 Query: 230 AKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWYTEG 276 D+ D I+ + +P + +WKP+R+ A ++W + Sbjct: 154 RLDILPSGDLGIRMAIKNLYQLPELPSPEIVEAIGHKWKPYRTIAAWYLWRSLS 207 >UniRef50_A5V920 HhH-GPD family protein n=7 Tax=Sphingomonadales RepID=A5V920_SPHWW Length = 368 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 52/204 (25%), Positives = 86/204 (42%), Gaps = 20/204 (9%) Query: 88 PQIVNGALGRLGAARPGLRLPGCVD-AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL 146 + A+ R+ A G P D ++ +R I+GQ VSVA A + A++ + G L Sbjct: 169 LAGIEPAIARMLEAI-GYPPPRIRDRGYQTLLRTIVGQQVSVAAANAIWAKMETMVGAGL 227 Query: 147 DDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT-IPGDVEQA 205 P+ +AAA L+A G+ ++ LA GT+ +P D E+A Sbjct: 228 A----------PEAVAAAPDDLLRATGLSRQKIAYARSLAEHVASGTIDFDRLPADDEEA 277 Query: 206 MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAE 258 + + GIGRW+A + L DV+ D ++ + + + RR A Sbjct: 278 IAQMTAIKGIGRWSAEIYLLFAEGRGDVWPAGDLAVQIEVGRLLGLPERPSERETRRLAH 337 Query: 259 RWKPWRSYALLHIWYTEGWQPDEA 282 W P R A + W++ + A Sbjct: 338 GWSPHRGAAAIFAWHSYNARASTA 361 >UniRef50_B7K9B1 HhH-GPD family protein n=3 Tax=Cyanobacteria RepID=B7K9B1_CYAP7 Length = 210 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 66/178 (37%), Gaps = 17/178 (9%) Query: 108 PGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQ 167 P E AI+ Q +S +A K+ R Y + T + L + Sbjct: 33 PSNSSLLEALAWAIISQQISTKVANKIYQRFLNFYNDATP--------LTAKNLLNTPEE 84 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPM--TIPGDVEQAMKTLQTFPGIGRWTANYFAL 225 L++LG+ + L +LA A + P+ + + + L G+G WTA + Sbjct: 85 DLRSLGISRNKIRYLKNLAKAVEDNLPPLYQLELMEDWEIIHLLTQIKGVGIWTAQMLLI 144 Query: 226 RGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWYTEG 276 D+ D I+ + +P + +WKP+R+ A ++W + Sbjct: 145 FRLNRLDILPSADLGIRTAIKNLYQLPELPSPEIVEAIGYKWKPYRTIASWYLWRSLS 202 >UniRef50_B1ZV80 Transcriptional regulator, AraC family n=2 Tax=Opitutaceae RepID=B1ZV80_OPITP Length = 523 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 64/293 (21%), Positives = 105/293 (35%), Gaps = 40/293 (13%) Query: 19 LAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI----------------- 61 L+ A S E + Y ++ + ++T + + + + Sbjct: 226 LSRDAQSVSERLEGDVYTAAVNLSGGPALLTLRLNPSPVRVEVIPAAGGTGVPPVGSRPP 285 Query: 62 -------NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGA-----LGRLGAARPGLRLPG 109 ++++G+ E A L L+ + L RL A R LR+ Sbjct: 286 TKGERVPDVASGVPTNTLEAHAIAIGLLGLEDDASSFARLARKLGLARLVAGRSELRISR 345 Query: 110 CVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQAL 169 F+ V AI+GQ ++ + A L R+ +L G RL + + PTP +A +P L Sbjct: 346 IPSVFDGLVWAIIGQQINFSFACVLKRRLTELAGTRLSNG--LMAPPTPTAIARLEPDEL 403 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRG 227 L ++A LI A A G L + +A +TL G G W+ NY +R Sbjct: 404 VPLQFSRQKAGYLITTARAITAGELDLAQLPSMSASRAERTLLALHGFGPWSVNYVMMRA 463 Query: 228 WQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWY 273 D D + + RR + P RS A H+W Sbjct: 464 LGFADCVPLGDTGVTSGLQSLLHLEQRPDVDATRRLMAVFSPHRSLATAHLWQ 516 >UniRef50_B6EMH3 DNA repair protein n=2 Tax=Gammaproteobacteria RepID=B6EMH3_ALISL Length = 202 Score = 157 bits (398), Expect = 3e-37, Method: Composition-based stats. Identities = 48/194 (24%), Positives = 79/194 (40%), Gaps = 22/194 (11%) Query: 90 IVNGALGRLGAARPGLRLPGC-VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD 148 +++ L + G P FE + I+ Q +S +AA + R+ L E Sbjct: 15 LIDKDLEAAVKTQ-GYPSPRVNPHGFEAFLSIIVSQQLSTKVAAVIMGRLVALLKEV--- 70 Query: 149 FPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGD--VEQAM 206 TP+RL + + Q L+ +G+ ++ E LA A G L + E A+ Sbjct: 71 --------TPERLLSIEEQNLRDVGLSWRKIEYAKGLALAVQSGNLDIDGLESLSDEDAI 122 Query: 207 KTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAER 259 + + G GRW+A + + +D+F DD + + TP Q R Sbjct: 123 SAITSLKGFGRWSAEIYLMFSLGRQDIFPADDLGVLIALGRLKGLTDKPTPKQAREMVGH 182 Query: 260 WKPWRSYALLHIWY 273 W+PWRS L +W Sbjct: 183 WQPWRSVGSLFLWQ 196 >UniRef50_D0J2I3 HhH-GPD n=6 Tax=Comamonadaceae RepID=D0J2I3_COMTE Length = 274 Score = 157 bits (397), Expect = 5e-37, Method: Composition-based stats. Identities = 53/230 (23%), Positives = 93/230 (40%), Gaps = 28/230 (12%) Query: 59 LHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGV 118 +H++L G+ AE ++ R +++ + +LG+ L G AF Sbjct: 62 IHLDLPDGVPAYWAEACRQLMR------RDRVLKRLIPQLGSQA--LLPCGQEQAFATLA 113 Query: 119 RAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKR 178 R+I+GQ +S A L + +L P+++ ++A+G+ ++ Sbjct: 114 RSIIGQQISAKSAKTLWNKFVRLPAAMQ-----------PEQVLRLKVDDMRAVGLSARK 162 Query: 179 AEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 + L+ LA E L M E + L + G+ RWTA F + +V Sbjct: 163 VDYLVDLALHFTENRLHMDEWAQMSDEVIIAELMSIRGLSRWTAENFLIYCLGRPNVLPL 222 Query: 237 DDYLIKQRFP-------GMTPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 DD + Q ++ + R AE WKPW + A +IW + QP Sbjct: 223 DDAGLIQGISLNHFSGDPVSRSDAREVAEAWKPWCTVATWYIWRSLEAQP 272 >UniRef50_Q6BZL7 DEHA2A00418p n=2 Tax=Debaryomyces hansenii RepID=Q6BZL7_DEBHA Length = 300 Score = 156 bits (396), Expect = 6e-37, Method: Composition-based stats. Identities = 42/208 (20%), Positives = 86/208 (41%), Gaps = 28/208 (13%) Query: 92 NGALGRLGAA--RPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLY---GE-- 144 + AL + P + ++A++ V+ I+ Q +S + A + + +L+ GE Sbjct: 76 DPALSDFIKSCDDPNTLMDVKMNAYQTLVKIIISQQLSTSAARSIMTKFIKLFLKEGEST 135 Query: 145 -RLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP---- 199 F + FPTP+ + P+ L++ G+ ++A L+ ++ + + Sbjct: 136 EPDHQFKAHPHFPTPEIVKETSPERLRSAGISFRKAGYLLIISEKFSDKNYLLNDDKKLN 195 Query: 200 -GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP------------ 246 E + L GIG W + F L + D+F D I++ Sbjct: 196 DMSNEDIARLLIDLKGIGPWAVDIFLLLYMKRSDIFPISDAGIRKGLSMLIQNTSGKKGK 255 Query: 247 ---GMTPAQIRRYAERWKPWRSYALLHI 271 ++ ++ +Y+E WKP+RS A ++ Sbjct: 256 KLNYLSIEEMEKYSENWKPYRSVASWYL 283 >UniRef50_Q9ZET9 DNA-3-methyladenine glycosidase (Fragment) n=1 Tax=Mycobacterium avium subsp. paratuberculosis RepID=Q9ZET9_MYCPA Length = 185 Score = 156 bits (395), Expect = 7e-37, Method: Composition-based stats. Identities = 51/174 (29%), Positives = 74/174 (42%), Gaps = 14/174 (8%) Query: 119 RAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY--ICFPTPQRLAAADPQALKALGMPL 176 RA+LGQ VS+ A R+ YG + D FP+ Q+LA DP L +P Sbjct: 1 RAVLGQQVSIRAARTHAGRLVAAYGRAVHDPEGTLTHTFPSVQQLADVDPIHLA---VPK 57 Query: 177 KRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 R L L + ++ + D + A L PG+G WTA A+RG D F Sbjct: 58 ARQRTLAALVAGLADRSIVLDTGCDWQSARTQLLALPGVGPWTAEVIAMRGLGDPDAFPA 117 Query: 237 DDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALLHIWYTEG-----WQPDE 281 D ++ + + + RW+PWRSYA ++W T W P + Sbjct: 118 ADLGLRVAAKRLGLPSGQRSLTAASARWRPWRSYATQYLWTTLEHPVNHWPPQQ 171 >UniRef50_C8WLI9 HhH-GPD family protein n=4 Tax=Bacteria RepID=C8WLI9_EGGLE Length = 219 Score = 156 bits (395), Expect = 7e-37, Method: Composition-based stats. Identities = 43/197 (21%), Positives = 75/197 (38%), Gaps = 19/197 (9%) Query: 88 PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD 147 + LG A + D F V I+GQ ++ + R+ + +GE Sbjct: 18 LAARDPRLGEAMAVIGRIEREVHPDLFAALVNCIVGQQIATKAQTTIWNRMLERFGEV-- 75 Query: 148 DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGD--VEQA 205 TP+ +AA L+ +G+ ++ + A L G + + + ++ Sbjct: 76 ---------TPEAMAACSDDELQQVGISFRKVGYIKGAAARVLSGEVDLEGLAELSDDEV 126 Query: 206 MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF------PGMTPAQIRRYAER 259 +TL PGIG WTA Q ++ D I + +TP +Y R Sbjct: 127 CRTLSALPGIGVWTAEMLMTFSMQRPNILSWGDLAIHRGLRMVHHHRRITPELFAKYRRR 186 Query: 260 WKPWRSYALLHIWYTEG 276 + P+ S A L++W G Sbjct: 187 YTPYGSVASLYLWEVAG 203 >UniRef50_B2W1R2 DNA-3-methyladenine glycosylase n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2W1R2_PYRTR Length = 439 Score = 156 bits (395), Expect = 8e-37, Method: Composition-based stats. Identities = 56/287 (19%), Positives = 90/287 (31%), Gaps = 70/287 (24%) Query: 52 PDIARHTLHINLSAGLEPVAAECLAKMSRLF-DLQCNPQIVNGALGRLGAAR-------P 103 P A+ L + RL D + V+ L L Sbjct: 152 PSPAKKRKAKELVPPDVGAIPNASTDVERLLKDAEEFLVKVDPKLEELVKKHHCKIFSPE 211 Query: 104 GLRLPGCVDAFEQGVRAILGQLV----------------------------------SVA 129 GLR VD F I+GQ V S Sbjct: 212 GLRE--VVDPFTALSSGIIGQQVLDEWHGLKQRAHWKASPSNPVHQNCKNNTSPPQVSGQ 269 Query: 130 MAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAA 189 A+ + A+ L+ + FPTP ++ L+ G+ ++AE + LA Sbjct: 270 AASSIRAKFTALF------PTTHPAFPTPTQVLQLPIPTLRTAGLSQRKAEYITGLAEKF 323 Query: 190 LEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF-- 245 G L M + E+ ++ L G+GRW+ FA G + DVF D +++ Sbjct: 324 CSGELTAQMLVSASDEELIEKLVAVRGLGRWSVEMFACFGLKRMDVFSTGDLGVQRGMAV 383 Query: 246 ----------------PGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 MT ++ A + P+RS + ++W Sbjct: 384 YAGRDVNKLKSKGGKWKYMTEREMLDTAANFSPYRSLFMWYMWRIAD 430 >UniRef50_A1K6J5 DNA-3-methyladenine glycosylase II n=21 Tax=Proteobacteria RepID=A1K6J5_AZOSB Length = 229 Score = 156 bits (395), Expect = 9e-37, Method: Composition-based stats. Identities = 47/223 (21%), Positives = 88/223 (39%), Gaps = 22/223 (9%) Query: 65 AGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLG-AARPGLRLPG-CVDAFEQGVRAIL 122 A++ +L + ++ RL A P L P + +E VRA+ Sbjct: 1 MDTPDTLPPAQAELYQL--ATAHLAGIDADWARLVTAVGPCLLQPKPAREPYEALVRAVA 58 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 Q ++ ++ ++ R+ LY + FP P++L A AL+ G ++ E + Sbjct: 59 YQQLATSVGDRIIGRLLALYPDS--------AFPQPEQLLATGFDALRGCGFSARKIETI 110 Query: 183 IHLANAALEGTLPMTIP---GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 +A L G +P D E + L GIGRWT + + DV DD+ Sbjct: 111 HGIAQGTLSGLVPSRADAVSMDDEALIARLVELRGIGRWTVEMLLIFTLERIDVLPVDDF 170 Query: 240 LIKQRFPGMT-------PAQIRRYAERWKPWRSYALLHIWYTE 275 +++ + + ++ R P+R+ A ++W + Sbjct: 171 GVREGYRHLKSLDEMPGRKEMARAGLVCSPYRTVAAWYLWRSL 213 >UniRef50_D2QEN8 HhH-GPD family protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QEN8_9SPHI Length = 207 Score = 156 bits (394), Expect = 1e-36, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 82/202 (40%), Gaps = 25/202 (12%) Query: 90 IVNGALGRLGAARPGLRLPG--CVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD 147 + + R+ A P +L D + + +I+ Q +SV A + +R L+ ++ Sbjct: 13 AQDPVMARIIAETPVPKLVNDYADDVYLALLESIVSQQISVKAADAIFSRFRALFPDK-- 70 Query: 148 DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG--DVEQA 205 +P L L++ G+ ++ + L +A +LE + E+ Sbjct: 71 -------YPQADALLLKTTDELRSAGLSFQKIKYLQSVAEFSLEKPIDRVHLDALTDEEI 123 Query: 206 MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ------------I 253 ++ L G+GRWT + D+F DD +I+QR P Q + Sbjct: 124 VQYLLPIKGVGRWTVEMLLMFVLDRPDIFPIDDLVIRQRMLRAYPEQTNGLTGKALYKVL 183 Query: 254 RRYAERWKPWRSYALLHIWYTE 275 AE W+P+R+ A ++W + Sbjct: 184 LSIAEPWRPYRTTASRYLWRWQ 205 >UniRef50_B5ES79 HhH-GPD family protein n=4 Tax=Acidithiobacillus RepID=B5ES79_ACIF5 Length = 322 Score = 155 bits (393), Expect = 1e-36, Method: Composition-based stats. Identities = 66/289 (22%), Positives = 108/289 (37%), Gaps = 19/289 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGE----YRGVVTAIPDIARH 57 + L+ +PP+ + L +A + ++ Y R G+ R T Sbjct: 5 FRLSPRPPFRLDLTVWALRRQAHNRMDGWESETYRRVWRYGDDWLKVRLWQTKGDPDPFL 64 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNP------QIVNGALGRLGAARPGLRLPGCV 111 I E + A+++ + L + + L L A GLR P Sbjct: 65 EGEIYEGPQDERTVSWVRAQLTWMLSLDRDLGPFYVVAAGDPRLASLEARYRGLRPPRFP 124 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 FE V A+ Q +S+ + L R+++L E + + + FP P L + AL+ Sbjct: 125 SLFEGMVNAVACQQLSLHLGITLLNRLSELCREGVGEMDQVYPFPDPGSLLRQEVTALRG 184 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQ 229 LG ++ AL LA A G L + A++ L GIGRW+A Y LR Sbjct: 185 LGFSGQKVTALRALAEEAAVGGLEREDWQHLPNAAAVQRLLRLRGIGRWSAEYVLLRTLG 244 Query: 230 AKDVFLPDDYLIKQRF-------PGMTPAQIRRYAERWKPWRSYALLHI 271 DVF DD ++ + AQ+ W+P+ + Sbjct: 245 RLDVFPGDDVGARKALARWLEENGSLDYAQVAHRLRPWQPYAGMVYFLL 293 >UniRef50_A0RYQ2 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Cenarchaeum symbiosum RepID=A0RYQ2_CENSY Length = 187 Score = 155 bits (393), Expect = 1e-36, Method: Composition-based stats. Identities = 47/184 (25%), Positives = 80/184 (43%), Gaps = 24/184 (13%) Query: 105 LRLPGCVDA------FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTP 158 +RL G + E VR+I+ Q +S + A+ + AR LYG FP P Sbjct: 5 IRLVGEYNPRRTRNRHEALVRSIITQQLSGSAASSILARFRALYGG---------GFPRP 55 Query: 159 QRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIG 216 +A + L+ G+ +A+ + L+ L + E+ + L G+G Sbjct: 56 ADVARTPARKLQQAGISAMKADYIRGLSGMIDRRELKLAGFSRMGDEEVVAELVRVRGVG 115 Query: 217 RWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALL 269 RWTA F + +DV D +++ + T A+I + AERW+P+R+ A Sbjct: 116 RWTAEMFLIFALGRQDVLPLGDLGLRKGVMKLCSMDSLPTDAEIVKTAERWRPYRTAATW 175 Query: 270 HIWY 273 ++W Sbjct: 176 YLWK 179 >UniRef50_B2J3A5 HhH-GPD family protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J3A5_NOSP7 Length = 212 Score = 155 bits (393), Expect = 1e-36, Method: Composition-based stats. Identities = 50/209 (23%), Positives = 78/209 (37%), Gaps = 27/209 (12%) Query: 74 CLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAK 133 + L ++ + + L LG R PG F + IL Q VSVA A Sbjct: 15 LTRGLMVLANIDSDLARI---LETLGPPPIWSREPG----FATLLCIILEQQVSVAAARA 67 Query: 134 LTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGT 193 + R+ I TP+ D L+ +G ++ LANA Sbjct: 68 VFNRLC-----------GVIVPLTPENFLTLDDVQLRGIGFSRQKILYSRGLANAIASDQ 116 Query: 194 LPMTIPGDVEQ--AMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM--- 248 L ++ +++ L+ GIG WT + + L Q DVF D I + Sbjct: 117 LDLSKLERMDETTIRTELKRLKGIGDWTVDIYLLMALQRPDVFPKGDLAIAIALQKLKNL 176 Query: 249 ----TPAQIRRYAERWKPWRSYALLHIWY 273 TP Q+ + W+PWR+ A +W+ Sbjct: 177 ATRPTPVQLEGMTQHWRPWRAVAARLLWH 205 >UniRef50_C1DYL3 Predicted protein n=2 Tax=Micromonas RepID=C1DYL3_9CHLO Length = 291 Score = 155 bits (392), Expect = 2e-36, Method: Composition-based stats. Identities = 57/260 (21%), Positives = 97/260 (37%), Gaps = 33/260 (12%) Query: 37 RSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALG 96 R++ V V + + +P E +A+ G L Sbjct: 20 RTVRVEAAVPDVDGTFGDVVVADALRELSTRDPKLGELIARC--------------GELP 65 Query: 97 RLGAARPGLRLPGCVD-AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICF 155 R+ A + R + AF RAI+ Q ++ AA + RV + G + D Sbjct: 66 RIFACQEARRAKHEPNRAFRSLARAIVFQQLNGTAAATIFGRVLRCVGAQDDVLAL---- 121 Query: 156 PTPQRLAAADPQALKALGMPLKRAEALIHLANAA--LEGTLPMTIP----GDVEQAMKTL 209 TP + AD A++A G+ ++ E L+ LA A P++ D M L Sbjct: 122 -TPDAIIDADEAAMRACGLSQRKHEYLVALARAFHPAHSDFPLSDESLEAMDDTAVMSAL 180 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKP 262 GIG W+ + F + DV D+ +++ + + A++ AERWKP Sbjct: 181 VALRGIGPWSVHMFQMFYLNRPDVLPTKDFGVRKGVMRLYGLRDMPSEAKVEEIAERWKP 240 Query: 263 WRSYALLHIWYTEGWQPDEA 282 R+ A +++W Sbjct: 241 HRTLASMYMWQAADEGKSSG 260 >UniRef50_C8W0S2 HhH-GPD family protein n=6 Tax=Bacteria RepID=C8W0S2_DESAS Length = 201 Score = 154 bits (390), Expect = 3e-36, Method: Composition-based stats. Identities = 39/173 (22%), Positives = 72/173 (41%), Gaps = 19/173 (10%) Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALK 170 D F V +I+ Q +S AA + R + + E T Q++A + ++ Sbjct: 37 PDLFAALVHSIISQQISSKAAATVWNRFLERFDE-----------ITSQKIAYTTAEEIQ 85 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTIPGD--VEQAMKTLQTFPGIGRWTANYFALRGW 228 G+ +K+A + +A+A ++G + + E+ K L GIG WTA Sbjct: 86 QCGITMKKAIYIKSIADAVMQGEFNIDELSELPDEEVCKRLSALNGIGVWTAEMLMTFSM 145 Query: 229 QAKDVFLPDDYLIKQRF------PGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 Q +V D I++ + A+ +Y R+ P+ + A L++W Sbjct: 146 QRPNVMSWGDLAIRRGIMMLYHHRKLDKAKFEKYKRRYSPYCTIASLYLWEIA 198 >UniRef50_A7EZ08 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7EZ08_SCLS1 Length = 418 Score = 153 bits (388), Expect = 5e-36, Method: Composition-based stats. Identities = 41/174 (23%), Positives = 69/174 (39%), Gaps = 10/174 (5%) Query: 91 VNGALGRLGAARPGLRLPGC------VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGE 144 V L + P R+ ++ F V I+ Q VS A A + A+ L+ Sbjct: 214 VEPKLKPIIEKHPC-RIFSAEGLAEEIEPFRALVSGIISQQVSGAAAKSIKAKFVALFNP 272 Query: 145 RLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP--MTIPGDV 202 D P FPTP + A D L+ G+ ++AE + LA +G L + Sbjct: 273 PDSD-PSTHTFPTPSAIVATDLARLRTAGLSQRKAEYISGLALKFTDGELTTQFLLSASY 331 Query: 203 EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRY 256 E+ +L G+G+W+ FA + DVF D +++ + + + Sbjct: 332 EEVFASLIQVRGLGKWSVEMFACFALKRLDVFSTGDLGVQRGMAALLGKDVEKL 385 >UniRef50_Q1AWP7 DNA-3-methyladenine glycosylase II n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AWP7_RUBXD Length = 163 Score = 153 bits (387), Expect = 6e-36, Method: Composition-based stats. Identities = 49/171 (28%), Positives = 76/171 (44%), Gaps = 18/171 (10%) Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 +R ++GQ +SV A + AR+ +G R P P L A + L+A G+ Sbjct: 1 MRTVVGQQLSVGAARSIYARLCARFGGR---------PPLPGELEAVPDEELRACGVSGA 51 Query: 178 RAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 +A L LA LEG LP+ + + L GIGRW+A F + + DV Sbjct: 52 KARCLRELARRVLEGGLPLEELRGLPDGEVISALTAVRGIGRWSAQMFLIFHLRRPDVLP 111 Query: 236 PDDYLIKQR------FPGMTPAQ-IRRYAERWKPWRSYALLHIWYTEGWQP 279 D I++ P + + + R A W+PWR+ A L++W + P Sbjct: 112 AADLGIRRAAALLYGLPELPAEELLERLAAPWRPWRTTACLYLWRSLDALP 162 >UniRef50_C6IXS6 DNA-3-methyladenine glycosidase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IXS6_9BACL Length = 228 Score = 153 bits (387), Expect = 7e-36, Method: Composition-based stats. Identities = 46/196 (23%), Positives = 80/196 (40%), Gaps = 20/196 (10%) Query: 88 PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD 147 + +GRL A L F + R+I+ Q +SV A+ + RV +L GE Sbjct: 20 LASADPRMGRLIALIGSLATKPQGPLFTELARSIISQQISVKAASTIRGRVIELAGEL-- 77 Query: 148 DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG--DVEQA 205 +P L A L+A G+ + L L++ G L + D E+ Sbjct: 78 ---------SPAALLAQSDADLRAAGLSASKVAYLKDLSDKVQSGQLDLDRLQELDDEEV 128 Query: 206 MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ-------IRRYAE 258 +K L + GIGRW+A F + + V D +++ + + +++ A Sbjct: 129 IKQLVSVKGIGRWSAEMFLIFALGREHVVSYGDAGLQRAAKWVYDMEERPDRKYLQQAAA 188 Query: 259 RWKPWRSYALLHIWYT 274 +W + S A L++W Sbjct: 189 QWPSYGSIASLYLWEA 204 >UniRef50_A6TTX3 Methylated-DNA--protein-cysteine methyltransferase n=67 Tax=Bacteria RepID=A6TTX3_ALKMQ Length = 355 Score = 152 bits (385), Expect = 1e-35, Method: Composition-based stats. Identities = 38/190 (20%), Positives = 76/190 (40%), Gaps = 23/190 (12%) Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 + A+ R+G G D F + +I+ Q +S A + R+ +L Sbjct: 176 LGAAIERIGKIERGT----IADPFTALISSIVSQQISNKAAETVWNRLDELLESM----- 226 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKT 208 TP+ + + ++ GM K+AE + +A+ AL G + ++ ++ Sbjct: 227 ------TPESITKTELSQIQGCGMTNKKAEYIKGIADVALCGKINFKTLHMLSDQEIIQK 280 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ------RFPGMTPAQIRRYAERWKP 262 L + G+G WT + +V D I++ ++ Q +Y ++ P Sbjct: 281 LSSLHGVGIWTVEMLLIFSLNRPNVVSYGDLAIRRGMMNLYGLKELSKEQFNQYRAKYAP 340 Query: 263 WRSYALLHIW 272 + S A L++W Sbjct: 341 YGSVASLYLW 350 >UniRef50_B3E6X3 HhH-GPD family protein n=2 Tax=Bacteria RepID=B3E6X3_GEOLS Length = 201 Score = 151 bits (383), Expect = 2e-35, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 78/195 (40%), Gaps = 22/195 (11%) Query: 86 CNPQIVNGALGRLGAARPG-LRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGE 144 CN ++ + P +R PG F R IL Q VS+ A ++ Sbjct: 14 CNKHLIFRIINDKYGIPPNWMREPG----FISLSRIILEQQVSIESAKAHFEKINSYI-- 67 Query: 145 RLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG--DV 202 PE+ TP + Q ++ + ++A+ L L+ A L+ L + + + Sbjct: 68 -----PEF----TPNEIIKLSDQEMRDCQISRQKAKYLRSLSEAILKNELNLEVMDTFND 118 Query: 203 EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM----TPAQIRRYAE 258 + + L GIG WT + + + Q KDVF D + + T ++ ++ Sbjct: 119 HEIREKLTKINGIGNWTVDIYLMFCLQRKDVFPSGDIAVINAAMELLEYETKDEVLNESK 178 Query: 259 RWKPWRSYALLHIWY 273 +W P RS A +W+ Sbjct: 179 KWAPLRSLAAYFLWH 193 >UniRef50_UPI0000D54B32 HhH-GPD n=1 Tax=Psychroflexus torquis ATCC 700755 RepID=UPI0000D54B32 Length = 197 Score = 151 bits (382), Expect = 2e-35, Method: Composition-based stats. Identities = 44/194 (22%), Positives = 80/194 (41%), Gaps = 20/194 (10%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + L L + P + FE VR I Q +SVA A + R+A ++ E Sbjct: 15 DKDLEDLIKSIPKIVPFRREKGFEGLVRLICEQQLSVASAKAIFERLA-----KIVSPFE 69 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTL 209 F + L+ G+ ++ + LANA +EG L T + K L Sbjct: 70 AKNF------LKVPKKDLQKTGLSRQKIDYCTGLANACIEGDLDFTTLHKMNDSDLRKEL 123 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIK---QRFPGM----TPAQIRRYAERWKP 262 GIG+WTA+ + L + +D++ D ++ Q+ + + ++ + +WKP Sbjct: 124 CKIKGIGKWTADCYMLASLKREDIWPAGDLGLQISVQKLKKLSSRPSEMELEEISVKWKP 183 Query: 263 WRSYALLHIWYTEG 276 +R+ +W + Sbjct: 184 YRTLVANMLWNSYD 197 >UniRef50_D0LW65 DNA-3-methyladenine glycosylase II n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LW65_HALO1 Length = 220 Score = 151 bits (381), Expect = 3e-35, Method: Composition-based stats. Identities = 48/192 (25%), Positives = 77/192 (40%), Gaps = 17/192 (8%) Query: 93 GALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY 152 + L A L ++F RAI+ Q ++ AA + AR L+ Sbjct: 30 AHMPALIAVHGPPDLARTRNSFASLGRAIVYQQLATRAAAAIYARFLALF--------PR 81 Query: 153 ICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQ 210 FPTP L A L++ G+ +A AL LA +G++ D ++ TL Sbjct: 82 GRFPTPAALLAVSEDTLRSAGLSRAKATALRDLAAKFADGSVRSRQFSRMDADELRATLT 141 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPW 263 GIG W+ + F + G DV D +++ PA+++ A W P+ Sbjct: 142 QVRGIGPWSVDMFLIFGLMRPDVLPVGDLGVRKGMQRYFELEELPKPAEMQELAAPWAPF 201 Query: 264 RSYALLHIWYTE 275 RS A ++W Sbjct: 202 RSVASWYMWRVA 213 >UniRef50_Q6CEP5 YALI0B14080p n=1 Tax=Yarrowia lipolytica RepID=Q6CEP5_YARLI Length = 360 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 48/221 (21%), Positives = 79/221 (35%), Gaps = 56/221 (25%) Query: 109 GCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQA 168 C + FE R I+GQ VS A A + + +L+ + E FP+PQ + +A Sbjct: 134 KCNNCFEHLTRGIIGQQVSGAAAESILKKFKKLF---PVEGSEDGKFPSPQEILDTPTEA 190 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALR 226 L++ G+ ++AE + L+ A +GTL + + L GIG W+A+ F L Sbjct: 191 LRSAGLSGRKAEYITCLSTAFKDGTLSDDWLSTASDDDVVDALVAIKGIGPWSADMFLLF 250 Query: 227 GWQAKDVFLPDDYLIKQRF----------------------------------------- 245 + DVF D I++ Sbjct: 251 ALKRMDVFTLGDLGIQRGVSVYLKERPHLAEIIKQVDFSLPINGVHSPGKSKKAGARKAA 310 Query: 246 ----------PGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 T ++ A R+ P+RS +L +W Sbjct: 311 KSKPDTKGKWRVPTADEMTWVAHRFAPYRSVMMLILWKISD 351 >UniRef50_Q5K8T8 DNA-3-methyladenine glycosidase, putative n=1 Tax=Filobasidiella neoformans RepID=Q5K8T8_CRYNE Length = 461 Score = 149 bits (378), Expect = 7e-35, Method: Composition-based stats. Identities = 39/140 (27%), Positives = 64/140 (45%), Gaps = 6/140 (4%) Query: 110 CVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQAL 169 +D F V +I+GQ VS A + R L+G E FP+PQ + D +L Sbjct: 126 AIDPFRTLVTSIIGQQVSWMAARAINTRFRALFGFT----HEKEGFPSPQMVLMQDVTSL 181 Query: 170 KALGMPLKRAEALIHLANAALEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRG 227 K +G+ ++AE ++ LA+ G L + G E+ K L GIG+WT + F + Sbjct: 182 KGVGLSGRKAEYVLSLADHFASGQLSTQLLQSGTDEEISKALIAVRGIGQWTVDMFMIFS 241 Query: 228 WQAKDVFLPDDYLIKQRFPG 247 + D+ D +++ Sbjct: 242 LRRPDILAVGDLGVQKGLLK 261 Score = 47.5 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 10/34 (29%), Positives = 19/34 (55%) Query: 243 QRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 + +TP ++ E W+P+RS A+ ++W G Sbjct: 427 KGGAYLTPKEMEALTEGWRPYRSLAVFYMWPVAG 460 >UniRef50_A9FBN7 Putative DNA-3-methyladenine glycosidase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FBN7_SORC5 Length = 292 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 59/275 (21%), Positives = 104/275 (37%), Gaps = 11/275 (4%) Query: 7 QPPYDWSWMLGFLAARAVSSVETV-ADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSA 65 + P+ + + L R + V+T AD Y R+ V + L + + Sbjct: 2 RAPFRLALTVAALQRRPENPVDTWSADRRYLRAFDTARGPVVWAVTEEPGGTQLRVEVFG 61 Query: 66 GLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP------GLRLPGCVDAFEQGVR 119 ++ ++RL + RL A G++ P +E V Sbjct: 62 DVDDPRM-WRGLVTRLLGTDIDLAPFYARAERLPAFAALAARFRGVKPPRFASLWEAIVN 120 Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKR 178 +++ Q +S+A A R+ + +D FP P+ +A++ P L+ LG+ + Sbjct: 121 SVVFQQLSLAAAMAAVRRLVLRFASPVDVAGQRLFPFPPPEVVASSTPHDLRTLGLSGAK 180 Query: 179 AEALIHLANAALEGTLPMTIPGDVE--QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 A+AL A G L + + + L+ PGIG WTA+ LRG++ DVF Sbjct: 181 ADALRTCARMIAAGELREEELEALANGEIERRLRELPGIGPWTASVILLRGFRRLDVFPG 240 Query: 237 DDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHI 271 D + + E P+R H+ Sbjct: 241 GDVAAARGLGAIAGEHGGELVEALGPYRGMLYFHL 275 >UniRef50_Q7NJ14 Gll2018 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ14_GLOVI Length = 206 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 43/171 (25%), Positives = 71/171 (41%), Gaps = 18/171 (10%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F+ VRAI+ Q +S AA + R+ L+ R P P L A + AL+ +G Sbjct: 44 FDAVVRAIVYQQLSGKAAATIHKRLCDLFDGR---------PPLPAELLAVEAAALRGVG 94 Query: 174 MPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + ++ L LA G L + + + + L GIGRWTA F + Sbjct: 95 LSRQKLNYLKSLAAQVESGALAIETLHILEDQAILAELMRLKGIGRWTAQMFLMFRLGRP 154 Query: 232 DVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWYTE 275 +V D I++ +P Q+ +E W P+ + A ++W + Sbjct: 155 NVLPEGDLGIQKAIQLAYSLKALPSPKQMAAVSEPWHPYCTIACWYLWRSL 205 >UniRef50_B8GY42 DNA-3-methyladenine glycosylase II n=4 Tax=Caulobacteraceae RepID=B8GY42_CAUCN Length = 213 Score = 148 bits (375), Expect = 1e-34, Method: Composition-based stats. Identities = 51/197 (25%), Positives = 78/197 (39%), Gaps = 20/197 (10%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + AL + A P + F ++ I+ Q VS+A AA + ARV E Sbjct: 21 DPALATVEAVTPPFAWRVGLGGFPGLLKMIVQQQVSLASAAAIWARVEAGLPEM------ 74 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG--DVEQAMKTL 209 TP+ +A D L+ LG+ +A +A A L G E+A+ L Sbjct: 75 -----TPEIVADHDEAYLRTLGLSQPKARYARAIAEAHLSGVCDFDALRALSDEEAIAAL 129 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIR-------RYAERWKP 262 G+GRWTA F + D+F D +++ + A+ R AE W+P Sbjct: 130 TAIKGVGRWTAEVFLMFTQGRLDLFPGGDVALQEAMRWVDRAETRPTEKQAYARAELWRP 189 Query: 263 WRSYALLHIWYTEGWQP 279 +R A +W G Sbjct: 190 YRGVAAHLLWACYGAVK 206 >UniRef50_Q04UT1 DNA-3-methyladenine glycosylase II n=4 Tax=Leptospira RepID=Q04UT1_LEPBJ Length = 228 Score = 148 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 37/179 (20%), Positives = 75/179 (41%), Gaps = 20/179 (11%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 + ++ ++++LGQ +SV +A R+ L G + P P R+ + LK Sbjct: 58 NPYQVLIKSVLGQQLSVKVALTFERRLISLAGSKKI--------PPPDRILMIPNEELKK 109 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQ--AMKTLQTFPGIGRWTANYFALRGWQ 229 +G+ + E + +A A L + + +E + L +F G+G WTA + Sbjct: 110 IGVSQAKIETIQRIAEAYLNRDITDSKLRKLEDSDVLNLLCSFKGVGPWTAEMVLIFALD 169 Query: 230 AKDVFLPDDYLIKQRFPGM------TPAQIRRYAERWKPWRSYALLHIWYT----EGWQ 278 D F +D ++++ +I+ + + P+R+ ++W EGW Sbjct: 170 RWDHFSINDLILRKSVEKHYGISKDNKKEIQHFLMSYSPFRTILSWYLWADMDGGEGWN 228 >UniRef50_A9M750 HhH-GPD family protein n=55 Tax=Rhizobiales RepID=A9M750_BRUC2 Length = 232 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 50/214 (23%), Positives = 82/214 (38%), Gaps = 23/214 (10%) Query: 72 AECLAKMSRLFDLQCNPQIV---NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSV 128 + + ++ L D++ + + + L + + L FE ++ Q VS Sbjct: 16 GQAMRRIDTLSDIEAGLEALVLADRRLADIRNRSHAVPLRRSEPGFESLASIVVAQQVST 75 Query: 129 AMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANA 188 A AA + AR+ Q+ TP+ A +A + G+ + L+ L+ A Sbjct: 76 ASAAAIWARLKQVINPL-----------TPEAYIAGGEEAWRLAGLSRPKQRTLLALSEA 124 Query: 189 ALEGTLPMTIPGDVE--QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP 246 G L + D+ +A+ TL GIG WTA + L DVF D ++ Sbjct: 125 LAGGALDLHGLCDLPAGEAIATLTAIKGIGPWTAEVYLLFAAGHPDVFPAGDVALQTAVG 184 Query: 247 GM-------TPAQIRRYAERWKPWRSYALLHIWY 273 A +R+ AE W PWR A W Sbjct: 185 HAFAHETRPDAAALRQLAENWAPWRGVAARLFWA 218 >UniRef50_O94468 Probable DNA-3-methyladenine glycosylase 2 n=1 Tax=Schizosaccharomyces pombe RepID=MAG2_SCHPO Length = 213 Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats. Identities = 37/171 (21%), Positives = 74/171 (43%), Gaps = 17/171 (9%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 +E +RAI Q +S A + + + + FPTP+++ D + L Sbjct: 41 PYEGIIRAITSQKLSDAATNSIINKFCTQCSDNDE-------FPTPKQIMETDVETLHEC 93 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIP---GDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 G +++ + +A AAL +P E+ M++L G+ RWT +++ Sbjct: 94 GFSKLKSQEIHIVAEAALNKQIPSKSEIEKMSEEELMESLSKIKGVKRWTIEMYSIFTLG 153 Query: 230 AKDVFLPDDYLIK---QRFPGMTPA----QIRRYAERWKPWRSYALLHIWY 273 D+ DD +K + F G++ ++ + + KP+R+ A ++W Sbjct: 154 RLDIMPADDSTLKNEAKEFFGLSSKPQTEEVEKLTKPCKPYRTIAAWYLWQ 204 >UniRef50_A0Z859 HhH-GPD protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z859_9GAMM Length = 215 Score = 147 bits (371), Expect = 4e-34, Method: Composition-based stats. Identities = 46/214 (21%), Positives = 85/214 (39%), Gaps = 24/214 (11%) Query: 76 AKMSRLFDLQCNPQI-VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKL 134 A + R + V ALG +G R F+ + ++GQ VS A + Sbjct: 12 AHIKRALSAAAEIDVRVATALGLIGLPAARKR----DHGFDSLAKIVVGQQVSTRAAEAI 67 Query: 135 TARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTL 194 T R+ + +L+ P+ L + D +L+A G+ ++ L LA A +EG L Sbjct: 68 TQRLLESLNGQLE----------PEILLSRDDDSLRAAGLSRQKISYLRSLATAVVEGAL 117 Query: 195 PMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM---- 248 P+ ++ ++ + G G W+A + + D++ D ++ F + Sbjct: 118 PLLDLPKMSDDEVLQRITAIRGFGAWSAQMYLMFSLGRTDIWPSGDLAVRVGFGRLLGLV 177 Query: 249 ---TPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 T + A+ + P+RS L W+ P Sbjct: 178 ERPTAKKTEELAKDFTPYRSALALLCWHYYSNAP 211 >UniRef50_Q5SLG4 DNA-3-methyladenine glycosidase n=6 Tax=Bacteria RepID=Q5SLG4_THET8 Length = 185 Score = 146 bits (369), Expect = 8e-34, Method: Composition-based stats. Identities = 43/169 (25%), Positives = 73/169 (43%), Gaps = 15/169 (8%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 F +++ Q +S AA+L R+ +L PTP+ A L+ Sbjct: 29 PFRVLAESVVAQQLSTRAAARLAERLFRL------------VPPTPEAFLEAPLDLLRQA 76 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 G+ +A AL LA A EG L + E ++ L G+G WTA F + G D Sbjct: 77 GLSRAKALALKDLAAKAEEGLLDGLDRLEDEAVVERLTRVRGVGLWTAEMFLMFGLGRPD 136 Query: 233 VFLPDDYLIKQ---RFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 V+ D +++ R G+ P + + E ++P+RS+ ++W + Sbjct: 137 VWPVRDLGLRRAAARLFGVAPEALPAFGEAFRPYRSHLAWYLWRSLSSP 185 >UniRef50_Q688W2 Os05g0567500 protein n=2 Tax=Oryza sativa RepID=Q688W2_ORYSJ Length = 290 Score = 146 bits (369), Expect = 9e-34, Method: Composition-based stats. Identities = 40/170 (23%), Positives = 67/170 (39%), Gaps = 12/170 (7%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 AF +IL Q ++ + AA + AR L D + P + A L+A+ Sbjct: 100 AFHSLAHSILHQQLAPSAAAAIYARFLALIPAAADPDAAVVN---PAAVLALSAADLRAI 156 Query: 173 GMPLKRAEALIHLANAALEGTLPMTI--PGDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 G+ ++A L LA G L + D + L G+G WT + F + Sbjct: 157 GVSARKAAYLHDLAGRFAAGELSESAVAAMDEAALLAELTKVKGVGEWTVHMFMIFSLHR 216 Query: 231 KDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWY 273 DV D +++ + P ++ ERW+P+RS ++W Sbjct: 217 PDVLPSGDLGVRKGVQELYGLPALPKPEEMAALCERWRPYRSVGAWYMWR 266 >UniRef50_B6K1P6 DNA-3-methyladenine glycosylase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6K1P6_SCHJY Length = 224 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 77/179 (43%), Gaps = 18/179 (10%) Query: 105 LRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAA 164 L+ + +E + A+ Q +S + + R+ Q + ++ FP+ Q L + Sbjct: 42 LKPQTAREPYEGLIHALTYQRLSDSAGDAILGRLCQHFHKK--------SFPSVQELLSL 93 Query: 165 DPQALKALGMPLKRAEALIHLANAALEGTLPMT---IPGDVEQAMKTLQTFPGIGRWTAN 221 D + L++ G ++ E ++ LAN A +G+LP +++ + GIG WT Sbjct: 94 DTEDLRSFGFSHRKGETILELANMAADGSLPSREEISHMPLDKMIGIFTKVKGIGAWTVE 153 Query: 222 YFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWY 273 +A+ +V D I++ + T ++ + + P+++ A ++W Sbjct: 154 KYAIFTLGRPNVMPTMDREIRENVQLLYHLDHTPTDVEMEERSRAYVPYKTVASWYLWR 212 >UniRef50_A8QA43 Putative uncharacterized protein n=1 Tax=Malassezia globosa CBS 7966 RepID=A8QA43_MALGO Length = 339 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 31/144 (21%), Positives = 61/144 (42%), Gaps = 9/144 (6%) Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP-------EYICFPTPQRLAA 163 ++ F +ILGQ +S A + + +L+ L P + + FPTP ++ Sbjct: 109 LNLFRVLTTSILGQQISWLAARSIMYKFCRLFAPDLPLQPNLDAVNKDELPFPTPLQVLK 168 Query: 164 ADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTAN 221 A L+ G+ + + + +A +G L + I + E + L G+GRWTA Sbjct: 169 ATDDELRRAGLSTAKIKYVRDVARRFSDGRLDVRKIIHMNPEACITELTQVKGVGRWTAE 228 Query: 222 YFALRGWQAKDVFLPDDYLIKQRF 245 + ++ D+ D +++ Sbjct: 229 MLLMFALRSPDILPVGDLGVQRGI 252 Score = 43.3 bits (101), Expect = 0.009, Method: Composition-based stats. Identities = 10/34 (29%), Positives = 18/34 (52%) Query: 243 QRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 ++ + P ++ A W P+RS A + +W EG Sbjct: 306 KKHMYLDPEEMSVLAAPWAPYRSVACMFMWSIEG 339 >UniRef50_C6A294 AlkA 3-methyladenine DNA glycosylase n=9 Tax=Thermococcaceae RepID=C6A294_THESM Length = 279 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 48/188 (25%), Positives = 79/188 (42%), Gaps = 14/188 (7%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + L GL +P D ++ V I Q VS A + + +L G++L++ Sbjct: 80 DSKFAFLIKEFYGLTIPKAPDKYQALVETIAQQQVSFEFAMQTIRNLVKLAGKKLEN--- 136 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG-DVEQAMKTLQ 210 FPTPQ + + + + RA + HL LEG L + + D ++A+K L Sbjct: 137 LYIFPTPQSILNLSEEKFREAKL-GYRAGYIRHLTKEYLEGNLNLDLEELDEKEAIKYLT 195 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM--------TPAQIRRYAERWKP 262 F GIGRW+A F G K+V+ D +K+ + +R E + Sbjct: 196 KFKGIGRWSAELFLAYGLG-KNVYPAGDLGMKRGIAKIFGKNPKEVKEKDVREIIEPYGK 254 Query: 263 WRSYALLH 270 W+S + Sbjct: 255 WKSLLAFY 262 >UniRef50_Q1J274 Endonuclease III, DNA-3-methyladenine glycosidase II n=3 Tax=Deinococcus RepID=Q1J274_DEIGD Length = 216 Score = 144 bits (364), Expect = 3e-33, Method: Composition-based stats. Identities = 46/177 (25%), Positives = 71/177 (40%), Gaps = 18/177 (10%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 F VR+++GQ +S AA + AR+ G P+ L P L+AL Sbjct: 46 PFGTLVRSVVGQQLSTQAAASIAARLEDALGGV-----------EPEALLRTPPDKLRAL 94 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQA--MKTLQTFPGIGRWTANYFALRGWQA 230 G+ + + LA+AAL G + + A + L PGIGRWT F + G Sbjct: 95 GLSWAKVRTVRALADAALSGQVDFAHLSSLPDAAVIDALTPLPGIGRWTVEMFLMFGLAR 154 Query: 231 KDVFLPDDYLIKQRFPGMTP-----AQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 DVF D +++Q + P + W P+R+ A +W + Sbjct: 155 PDVFSFGDLVLRQGLSRLYPHVAPGSAQAAVVAAWSPYRTLAARVLWAERRTDKERG 211 >UniRef50_C5SF64 HhH-GPD family protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SF64_9CAUL Length = 214 Score = 143 bits (362), Expect = 5e-33, Method: Composition-based stats. Identities = 46/201 (22%), Positives = 71/201 (35%), Gaps = 20/201 (9%) Query: 88 PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD 147 + AL L AA L D FE +R I+ Q +SV A + ++ E Sbjct: 17 LAEADPALAPLFAAVGDLNFRHRADGFEGLLRLIVEQQLSVRAADAIWQKLRGGLSEM-- 74 Query: 148 DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQA 205 +P L + L+ G+ + LA A L + E A Sbjct: 75 ---------SPAHLLTLSDETLRGHGLSRPKVRYARILAEAVHARALDFAHVRSLEAEDA 125 Query: 206 MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-----PAQI--RRYAE 258 ++ L GIGRWTA + + D+F D +++ + P ++ R A Sbjct: 126 IEHLTALKGIGRWTAEVYLMFCEGRLDIFPTGDIALREALGWLDGLDARPDEVYCRERAL 185 Query: 259 RWKPWRSYALLHIWYTEGWQP 279 W P RS +W G Sbjct: 186 CWAPHRSVVSHALWGWYGAVK 206 >UniRef50_D2LH30 HhH-GPD family protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LH30_RHOVA Length = 214 Score = 143 bits (362), Expect = 5e-33, Method: Composition-based stats. Identities = 50/202 (24%), Positives = 75/202 (37%), Gaps = 20/202 (9%) Query: 81 LFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQ 140 L V+ + L A + + F+ V I GQ +SVA + R+ Sbjct: 10 LAQHIAELVRVHPSFMPLKEAAGPVPVRWLDRGFKGLVFVITGQQISVAAGRAIFGRLEG 69 Query: 141 LYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI-- 198 G+ T + LAAAD L+ G + L L AAL L + Sbjct: 70 ALGD-----------ITAETLAAADDTILREAGYSRPKMRTLRALQEAALADGLDLVAIE 118 Query: 199 PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMTPA 251 D E+A+ L GIG WTA + L D+F D +++ + Sbjct: 119 AMDAERAIIKLSAIKGIGPWTAEVYLLFAAGHPDIFPAADVALQESMRLAFDLDARPSTQ 178 Query: 252 QIRRYAERWKPWRSYALLHIWY 273 +R ++ W PWRS A +W Sbjct: 179 ALREISDAWTPWRSAAARLLWA 200 >UniRef50_Q8F6D8 DNA-3-methyladenine glycosylase n=2 Tax=Leptospira interrogans RepID=Q8F6D8_LEPIN Length = 213 Score = 143 bits (360), Expect = 8e-33, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 60/180 (33%), Gaps = 23/180 (12%) Query: 108 PGCVD---AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAA 164 P FE V IL Q VS+A A ++ G T +++ Sbjct: 37 PPFWSRKPNFETLVHIILEQQVSLASARAALVKLKNKIGSV-----------TARKILLL 85 Query: 165 DPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQ--AMKTLQTFPGIGRWTANY 222 L+ ++ + LA + + L T GIG WT + Sbjct: 86 SDIELRECYFSRQKTSYVRDLAEFVFSKRIILGDLASKSDQMIRGDLITVKGIGNWTVDI 145 Query: 223 FALRGWQAKDVFLPDDYLIKQRFPGMTP-------AQIRRYAERWKPWRSYALLHIWYTE 275 F + D+F D + + +I ++ W+P+RS A + +W++ Sbjct: 146 FLIMALHRADIFPLGDLAAVKSLKKIKKLPVDTSNDKILSVSKSWRPFRSIATMLLWHSY 205 >UniRef50_Q0BWS7 Putative DNA-3-methyladenine glycosylase n=1 Tax=Hyphomonas neptunium ATCC 15444 RepID=Q0BWS7_HYPNA Length = 213 Score = 143 bits (360), Expect = 8e-33, Method: Composition-based stats. Identities = 58/212 (27%), Positives = 82/212 (38%), Gaps = 27/212 (12%) Query: 74 CLAKMSRLFDLQC-NPQIVNGALGRLGAARPGLRLP---GCVDAFEQGVRAILGQLVSVA 129 A R C + + AL R A L +P + R I Q +S Sbjct: 1 MTAPSRRRLKTACERLALADPALAR---AYDSLGVPEWRTSEPGYNMLGRMISHQQLSTK 57 Query: 130 MAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAA 189 AA + RV GE TP+ L AADP AL+A G+ + L +A A Sbjct: 58 AAATIWGRVEVFLGEV-----------TPETLLAADPDALRACGLSRPKVAHLTSIAEAM 106 Query: 190 LEGTLPMTI--PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG 247 + G L + D++ A L + GIG WTA F L A D F D + + Sbjct: 107 VTGELNLARVCAADLDSARAELVSVRGIGPWTAELFLLYAVGAMDAFPIADVGLMEAHKQ 166 Query: 248 MTPAQIR-------RYAERWKPWRSYALLHIW 272 + + R ++AE W+P R A +W Sbjct: 167 LGRYETRMESKIFTQHAEIWRPHRGVAAHLLW 198 >UniRef50_D0XPK8 HhH-GPD family protein n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XPK8_9CAUL Length = 230 Score = 143 bits (360), Expect = 9e-33, Method: Composition-based stats. Identities = 59/220 (26%), Positives = 86/220 (39%), Gaps = 26/220 (11%) Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAIL 122 + L P+ E LA ++ AL R A P F R I+ Sbjct: 12 VDKDLMPLTPEDLAAARETL------ARLDPALARAHAQTPPFEWRVRQGGFVGLFRMIV 65 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 Q VSVA AA + AR+ GE TP L A D +L+ +G+ ++A Sbjct: 66 EQQVSVASAASVWARLQAGLGE-----------ITPAGLLAHDLDSLRGMGLSRQKATYG 114 Query: 183 IHLANAALEGTLPMTIPG--DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 +A A +EGT+ + D A++ L G+G WTA + L DVF D Sbjct: 115 QGMARAQIEGTIDLEHLATLDDAAAIEALVRLKGVGLWTAEAYLLLCEGRTDVFPGGDVA 174 Query: 241 IKQRFPGMTPAQIR-------RYAERWKPWRSYALLHIWY 273 +++ + R AE W+PWR A +W Sbjct: 175 LQEAIKWADGTETRPDTKGAYARAEIWRPWRGVATHLLWA 214 >UniRef50_A6EE77 3-methyladenine DNA glycosylase n=1 Tax=Pedobacter sp. BAL39 RepID=A6EE77_9SPHI Length = 222 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 46/211 (21%), Positives = 75/211 (35%), Gaps = 26/211 (12%) Query: 78 MSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCV------DAFEQGVRAILGQLVSVAMA 131 + +F QI + R +R G + FE V IL Q VS+A A Sbjct: 2 IVNIFSHDNFHQICDELAATDADLRSVIRTYGYPPMWKRSNTFESLVHIILEQQVSLASA 61 Query: 132 AKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALE 191 ++ E TP L + LKA + +++ + HLA + L Sbjct: 62 LAALNKLRDRLKEV-----------TPGVLLQLTDEELKACYLSRQKSIYVRHLATSILH 110 Query: 192 G--TLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT 249 G L + + L G+G WT + + + Q DVF D + Sbjct: 111 GSIDLDLMPRLPDREIRILLNQLKGVGNWTIDVYLMFVLQRADVFPSGDLAAVNALKQLK 170 Query: 250 -------PAQIRRYAERWKPWRSYALLHIWY 273 + R A W+P+R+ A + +W+ Sbjct: 171 DLPVGTHKEVLERIAMNWQPYRTVATMILWH 201 >UniRef50_A8N5M3 Putative uncharacterized protein n=2 Tax=Agaricales RepID=A8N5M3_COPC7 Length = 822 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 31/135 (22%), Positives = 59/135 (43%), Gaps = 9/135 (6%) Query: 122 LGQLVSVAMAAKLTARVAQLYGERLDD-------FPEYICFPTPQRLAAADPQALKALGM 174 LGQ +S A +T + +LY + + FPTP++++ + L+ G+ Sbjct: 567 LGQQISWKAARSITHKFIRLYSPSIPEEVTDESRAAAMQVFPTPEQVSKTEVSLLRTAGL 626 Query: 175 PLKRAEALIHLANAALEGTL--PMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 ++A+ + LA +G L + E+ + L GIGRWT + FA+ + D Sbjct: 627 SERKAQYIQDLAARFADGRLSTDKLLNASDEELAEMLIEVKGIGRWTVDMFAIFSLRRPD 686 Query: 233 VFLPDDYLIKQRFPG 247 + D +++ Sbjct: 687 ILPVGDLGVQRGLAR 701 >UniRef50_B0D0G2 Predicted protein n=1 Tax=Laccaria bicolor S238N-H82 RepID=B0D0G2_LACBS Length = 415 Score = 142 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 35/132 (26%), Positives = 60/132 (45%), Gaps = 9/132 (6%) Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEY-------ICFPTPQRLAAADPQALKALGMP 175 GQ +S A +T R +LY + + P++ FPTPQ + D L+ G+ Sbjct: 129 GQQISWLAARSITHRFIRLYHPSIPEKPDHQMMKSYLHLFPTPQDIVDTDIATLRTAGLS 188 Query: 176 LKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 ++AE + LA+ ++G L + D + L GIGRWT + FA+ + D+ Sbjct: 189 ARKAEYVKDLASRFVDGRLSTEKLLNADDDDLYSILIEVRGIGRWTVDMFAIFSLRRPDI 248 Query: 234 FLPDDYLIKQRF 245 D +++ Sbjct: 249 LPVGDLGVQRGL 260 Score = 40.6 bits (94), Expect = 0.068, Method: Composition-based stats. Identities = 10/25 (40%), Positives = 17/25 (68%) Query: 248 MTPAQIRRYAERWKPWRSYALLHIW 272 +TP ++ ERWKP+RS + ++W Sbjct: 385 LTPQEMAALTERWKPYRSLGVYYMW 409 >UniRef50_B6G8M1 Putative uncharacterized protein n=1 Tax=Collinsella stercoris DSM 13279 RepID=B6G8M1_9ACTN Length = 189 Score = 142 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 16/172 (9%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 AF +I+ Q++S+ + +R+ +L TP+ +A + +K+ Sbjct: 14 SAFHSLAHSIIEQMLSMKAGRAIESRLRELCDGDY----------TPECIAGIPAENIKS 63 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 GM ++ ++L LA AL L E KTL PGIG+WT + F L Sbjct: 64 CGMSFRKVQSLKTLAEYALANDLESLAELPDEDVYKTLVQLPGIGKWTCDMFLLFYLGRP 123 Query: 232 DVFLPDDYLIKQRFPGMTPAQI------RRYAERWKPWRSYALLHIWYTEGW 277 D+ +D ++Q F + A I W+P+ S A+ +++ Sbjct: 124 DILPVEDGALRQAFEWLYGAPIVSKEVQAVVCSLWRPYSSTAVRYLYRALNT 175 >UniRef50_A8J6X9 Predicted protein (Fragment) n=1 Tax=Chlamydomonas reinhardtii RepID=A8J6X9_CHLRE Length = 179 Score = 142 bits (358), Expect = 1e-32, Method: Composition-based stats. Identities = 38/175 (21%), Positives = 65/175 (37%), Gaps = 16/175 (9%) Query: 109 GCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQA 168 G F R++ Q ++ A+ + RV + TP + AA P+A Sbjct: 5 GAGGVFSALARSVAYQQLATKAASTIWGRVLGVC------QVGSTAALTPAHILAAPPEA 58 Query: 169 LKALGMPLKRAEALIHLANAALEG---TLPMTIPGDVEQAMKTLQTFPGIGRWTANYFAL 225 L+ G+ ++ E L+ LA A + DV+ + L GIG WT + A+ Sbjct: 59 LRGAGLSGRKLEYLVGLAQAFSGRPGWEQELEALTDVDALVAQLTPLRGIGEWTVHMIAM 118 Query: 226 RGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWY 273 DV D +++ + Q+ W P+RS ++W Sbjct: 119 MHLGLPDVLPTGDLGVRRGLQLLYGLRQLPDVRQVEEITAGWAPYRSVGSWYMWR 173 >UniRef50_A8TVS7 HhH-GPD n=1 Tax=alpha proteobacterium BAL199 RepID=A8TVS7_9PROT Length = 229 Score = 142 bits (358), Expect = 1e-32, Method: Composition-based stats. Identities = 55/198 (27%), Positives = 80/198 (40%), Gaps = 26/198 (13%) Query: 85 QCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGE 144 Q +P + L +G P + PG F +R ++GQ VS A AA + R+ + G+ Sbjct: 22 QAHPAL-GAVLVEIGPPEPRILQPG----FGSLLRIMVGQQVSTASAAAIWGRLVEASGD 76 Query: 145 RLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV-- 202 +D F + D +AL +G + LA+A L+GTL + Sbjct: 77 TVDGFN------------SLDDEALGRVGFSRAKMRYGRALADAVLDGTLNPDDLEKLPG 124 Query: 203 EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMTPAQIRR 255 EQ L PGIGRWTA + + DVF D +++ R Sbjct: 125 EQVSAQLMALPGIGRWTAEIYRMFALGDPDVFPIGDLALREGVRMALDLPERPDLGAAER 184 Query: 256 YAERWKPWRSYALLHIWY 273 WKP RS A L +W Sbjct: 185 LTAAWKPERSAAALLLWR 202 >UniRef50_B6U6Y8 DNA-3-methyladenine glycosylase 1 n=8 Tax=Magnoliophyta RepID=B6U6Y8_MAIZE Length = 291 Score = 141 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 40/199 (20%), Positives = 70/199 (35%), Gaps = 12/199 (6%) Query: 87 NPQIVNGALGRLGA--ARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGE 144 + Q + L + A P + AF R+IL Q ++ + A + AR L Sbjct: 74 HLQAADPLLTAVIANTEAPTFTATPSLPAFHSLARSILYQQLATSAADAIYARFLALLPS 133 Query: 145 RLDDFPEYICFPTPQRLAAADPQAL-KALGMPLKRAEALIHLANAALEGTLPMTI--PGD 201 A A + +G+ ++A L LA G L + D Sbjct: 134 ASAAAAAVAADAVTPAAVLALAAADLRTIGVSGRKASYLHDLAARFAAGELSDSAVAAMD 193 Query: 202 VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIR 254 + L G+G WT + F + D+ D +++ + P ++ Sbjct: 194 EAALLAELTKVRGVGEWTVHMFMIFSLHRPDILPCGDLGVRKGVQELYKLKSLPNPEEMA 253 Query: 255 RYAERWKPWRSYALLHIWY 273 ERW+P+RS ++W Sbjct: 254 ALCERWRPYRSVGAWYMWR 272 >UniRef50_D1IGU3 Whole genome shotgun sequence of line PN40024, scaffold_63.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1IGU3_VITVI Length = 364 Score = 141 bits (356), Expect = 3e-32, Method: Composition-based stats. Identities = 47/243 (19%), Positives = 82/243 (33%), Gaps = 27/243 (11%) Query: 49 TAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLP 108 + P A L I + V A L + + L + A + Sbjct: 114 HSSPRTAAILLPITRLLSSDDVVAAALRHLRSS----------DPVLAPVIDAYEPPKFE 163 Query: 109 GCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQA 168 F ++IL Q ++ + R L G P + TP +L Sbjct: 164 NSDTPFLALAKSILYQQITHKAGTTIYNRFVSLCGGETRVCPISVLALTPPQLL------ 217 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQ-AMKTLQT-FPGIGRWTANYFALR 226 +G+ ++ L LAN G L + +E A+ +L G G + + F + Sbjct: 218 --QIGVSARKVSFLHDLANKYRTGILSDSKILTMEDRALVSLIAMVKGFGVLSVHMFMIF 275 Query: 227 GWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 DV D +++ + P+Q+ + ERW+P+RS A +IW Sbjct: 276 SLHRPDVLPVGDANLRKGVQMLYGLEELPRPSQMEKLCERWRPYRSVASWYIWRLSEANG 335 Query: 280 DEA 282 + Sbjct: 336 VQG 338 >UniRef50_A3JFL8 3-methyladenine DNA glycosylase n=1 Tax=Marinobacter sp. ELB17 RepID=A3JFL8_9ALTE Length = 207 Score = 141 bits (356), Expect = 3e-32, Method: Composition-based stats. Identities = 49/205 (23%), Positives = 76/205 (37%), Gaps = 27/205 (13%) Query: 80 RLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVA 139 +L + + + RLGA R PG F V IL Q +S+ A + R+ Sbjct: 17 QLAAVDADLGRIYT---RLGAPPLWAREPG----FASLVHIILEQQISIKAAQTVFERLC 69 Query: 140 QLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP 199 GE +PQR+ +A + LKA G+ ++A LA G L + Sbjct: 70 AHLGEM-----------SPQRMVSAGEEELKAFGLTRQKARYCFGLAERIHTGKLNLAQL 118 Query: 200 G--DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TP 250 + L PG+G W+ + + L + DV+ D + + T Sbjct: 119 DALSDTEGRDALLAIPGLGPWSVDVYYLMALRRPDVWPLGDLALAAAMQEIKQLDAPATR 178 Query: 251 AQIRRYAERWKPWRSYALLHIWYTE 275 Q A W PWR+ A +W Sbjct: 179 QQQVDIANAWSPWRAVAARLLWMHY 203 >UniRef50_A8IJX2 HhH-GPD protein n=1 Tax=Azorhizobium caulinodans ORS 571 RepID=A8IJX2_AZOC5 Length = 217 Score = 139 bits (352), Expect = 8e-32, Method: Composition-based stats. Identities = 51/197 (25%), Positives = 79/197 (40%), Gaps = 19/197 (9%) Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 ++ L L A L F V I+ Q +SVA A ++AR Q+ G Sbjct: 23 LDDRLHALVAEAGRPPLRRRAPDFAGLVNIIIAQQLSVAAARAISARTEQVLGGP----- 77 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKT 208 PT + L A P+ LKA G+ + L +A A +G + + + + A Sbjct: 78 -----PTVEALLNASPETLKAGGLSAPKIRTLTRIARALADGVVDLAHVEAMEADAAADY 132 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF-------PGMTPAQIRRYAERWK 261 L PGIGRWTA+ + L D F D ++ + ++ AE W+ Sbjct: 133 LTRLPGIGRWTADIYLLFCLGRSDAFPEGDLALQVAAADAFGLPGRASALGLKAIAEDWR 192 Query: 262 PWRSYALLHIWYTEGWQ 278 P+R A +W G + Sbjct: 193 PYRGVAAHLLWAYYGAR 209 >UniRef50_Q0USE2 Putative uncharacterized protein n=1 Tax=Phaeosphaeria nodorum RepID=Q0USE2_PHANO Length = 240 Score = 139 bits (350), Expect = 1e-31, Method: Composition-based stats. Identities = 39/152 (25%), Positives = 65/152 (42%), Gaps = 9/152 (5%) Query: 122 LGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEA 181 +GQ VS A AA + + L+ E FP+P ++ D L+ G+ ++AE Sbjct: 1 MGQQVSGAAAASIRKKFTSLFPETHP------SFPSPSQILEKDLPTLRTAGLSQRKAEY 54 Query: 182 LIHLANAALEGTL--PMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 + LA G L PM + E+ ++ L G+GRW+ FA G + DVF D Sbjct: 55 ISGLAEKFASGELSAPMLVTASDEELIEKLVAVRGLGRWSVEMFACFGLKRMDVFSTGDL 114 Query: 240 LIKQRFPGMTPAQIRRYAERWKPWRSYALLHI 271 +++ + + W+ Y H+ Sbjct: 115 GVQRGMAAYMGRDTSKLKAKGGKWK-YVKTHL 145 >UniRef50_B8EL05 HhH-GPD family protein n=5 Tax=Alphaproteobacteria RepID=B8EL05_METSB Length = 249 Score = 138 bits (349), Expect = 1e-31, Method: Composition-based stats. Identities = 54/201 (26%), Positives = 79/201 (39%), Gaps = 24/201 (11%) Query: 86 CNPQIVNGALGRLGA--ARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG 143 ++ + RL A A+P LR F I+GQ +SVA A + R+ G Sbjct: 18 AELGRLDPVMARLVAEGAQPVLR--KREPGFHGLAWIIMGQQLSVASADAIWRRLIDRLG 75 Query: 144 ERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GD 201 TP + AA +ALKA G+ + L +A A G LP+ Sbjct: 76 PL-----------TPSVIEAATDEALKACGLSAPKIRTLRAIAEAITSGALPLDELGVMP 124 Query: 202 VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD---YLIKQRFPGMTPA----QIR 254 + A L GIG WTA+ + + D F D + GMT ++ Sbjct: 125 ADAAHAALTAVKGIGPWTADIYLMFCLGHSDAFAAGDLALQAAARLAYGMTARPGAPELV 184 Query: 255 RYAERWKPWRSYALLHIWYTE 275 AE+W+PWR+ A +W Sbjct: 185 ALAEQWRPWRAVAAKVLWAHY 205 >UniRef50_Q4A0G9 Putative DNA-3-methyladenine glycosidase n=1 Tax=Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305 RepID=Q4A0G9_STAS1 Length = 221 Score = 138 bits (348), Expect = 2e-31, Method: Composition-based stats. Identities = 39/192 (20%), Positives = 75/192 (39%), Gaps = 19/192 (9%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + L +L L++ D + +R+I+GQ ++VA+A + +++ + Sbjct: 20 DATLAQLINQIGDLQIQTRADPLKSLIRSIIGQQITVAVAQSIFQKLSIAIDDHW----- 74 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTL 209 T +L+ +KALG+ + + ++ A G L D + L Sbjct: 75 -----TVNQLSQLRESEMKALGLSQSKINYIQNVLFAVRNGQLNFEQLYKMDDNSVINAL 129 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERWKP 262 GIGRWTA F L Q K++ D +++ + Q+ E+W+ Sbjct: 130 TQIKGIGRWTAEVFLLFTLQRKNILPIYDVGLQRAAQWLYQTTKAERKKQLTICKEQWQG 189 Query: 263 WRSYALLHIWYT 274 S ++W Sbjct: 190 CASIGAFYLWEA 201 >UniRef50_Q8TL35 DNA-3-methyladenine glycosylase II n=1 Tax=Methanosarcina acetivorans RepID=Q8TL35_METAC Length = 299 Score = 138 bits (347), Expect = 3e-31, Method: Composition-based stats. Identities = 49/281 (17%), Positives = 107/281 (38%), Gaps = 18/281 (6%) Query: 9 PYDWSWMLGFLAARA-VSSVETVADSYYARSLAVGEYRGVVTAIPD-----IARHTLHIN 62 P+D+S L F+ A +TV + +++ + + + + Sbjct: 15 PFDFSKSLNFMGMFAPAEGEQTVTGFSFTKAVYLENKILAFRLKNEGTVSEPGLSYIFYS 74 Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQI------VNGALGRLGAARPGLRLPGCVDAFEQ 116 E + + L ++ L + Q + + GL + FE Sbjct: 75 CEEISEEIKSALLDRIKFFLSLDDDLQPFYVLGSKDPQFVPVLEELYGLHQVKFLTPFEA 134 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMP 175 A+L Q +S+ +A K+ ++ + G + + Y FP+ +++ + L ++ Sbjct: 135 AAWAVLSQRISMKVAHKIKNKLTEAIGNSIQIEGIVYRTFPSARQVKNLGVENLASIIKN 194 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 +++E LI +A+A G+++ + L GIG W+A+ +RG + Sbjct: 195 ERKSEYLIAVADAFDRVDENFLRQGNIKDVREWLMNIWGIGEWSAHLILIRGLGRMEELS 254 Query: 236 PDDYLIKQRF-----PGMTPAQIRRYAERWKPWRSYALLHI 271 + + F P T Q RR A+ + ++ Y ++ Sbjct: 255 EHEKTLLNCFKRFYGPEATEDQFRRVADSYGDFKGYWAYYL 295 >UniRef50_Q7CSU9 DNA-3-methyladenine glycosidase II n=1 Tax=Agrobacterium tumefaciens str. C58 RepID=Q7CSU9_AGRT5 Length = 215 Score = 136 bits (344), Expect = 6e-31, Method: Composition-based stats. Identities = 44/195 (22%), Positives = 68/195 (34%), Gaps = 19/195 (9%) Query: 87 NPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL 146 + ++ AL + L L F ++ Q+VS A A + AR+ G ++ Sbjct: 16 HLARLDPALSPVIEKAGPLELRIHEPGFAGLAHIVVSQMVSRASANAIWARILAGTGGKV 75 Query: 147 DDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQ-- 204 T + A + G+ +A L LA A EG + + E Sbjct: 76 ----------TAENYLAVSEELRATFGLSRAKATTLEGLARAVTEGQVDLDGVVRKEAGA 125 Query: 205 AMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIR-------RYA 257 A L GIG WTA + + D+F D ++ ++R A Sbjct: 126 AFSELVALRGIGPWTAEVYLMFCGGHPDIFPVGDVALRSAVAHALDLEVRPDAKWLAERA 185 Query: 258 ERWKPWRSYALLHIW 272 W PWRS A W Sbjct: 186 TLWSPWRSVAARLFW 200 >UniRef50_Q1H1S0 DNA-3-methyladenine glycosylase II n=1 Tax=Methylobacillus flagellatus KT RepID=Q1H1S0_METFK Length = 142 Score = 136 bits (344), Expect = 7e-31, Method: Composition-based stats. Identities = 38/129 (29%), Positives = 56/129 (43%), Gaps = 10/129 (7%) Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI---PGDVEQAMKTLQT 211 FP P L A D L+A G ++ + LA AAL G +P + E + L Sbjct: 6 FPAPAALLATDVAQLRACGFSGRKITYITGLAQAALAGNIPDHATALAMEDEALITQLTA 65 Query: 212 FPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWR 264 PGIGRWT + + D+ DD +++ F + TP +R W P R Sbjct: 66 LPGIGRWTVEMMLMHTLRRADILPVDDLGVREGFRRLKGLSTAPTPRLLRDIGLAWSPHR 125 Query: 265 SYALLHIWY 273 S A ++W+ Sbjct: 126 SSAAWYLWH 134 >UniRef50_C8WCH4 HhH-GPD family protein n=3 Tax=Zymomonas mobilis RepID=C8WCH4_ZYMMN Length = 206 Score = 135 bits (341), Expect = 2e-30, Method: Composition-based stats. Identities = 36/165 (21%), Positives = 64/165 (38%), Gaps = 18/165 (10%) Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMP 175 R I+GQ + +A + ++ G+ T RL + D L+ G+ Sbjct: 45 SLARVIVGQQLHTKVADGIWQKLVCSIGD-----------ITADRLLSVDEAILRQCGLS 93 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 + L LA ++ G +P + A+ L + GIGRWTA + + D++ Sbjct: 94 PSKIAYLKDLAMRSVSGLDLFALPEGDDDAVDLLMSVHGIGRWTAENYLIFAEGRLDIWP 153 Query: 236 PDDYLIK-------QRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 D I+ Q + R ++P+RS L +W+ Sbjct: 154 AADLGIRIATGYLYQLSHRPDMKETRGLGAIFRPYRSIMALFLWH 198 >UniRef50_A9I9J6 DNA-3-methyladenine glycosidase II n=1 Tax=Bordetella petrii DSM 12804 RepID=A9I9J6_BORPD Length = 218 Score = 134 bits (339), Expect = 2e-30, Method: Composition-based stats. Identities = 55/207 (26%), Positives = 79/207 (38%), Gaps = 29/207 (14%) Query: 77 KMSRLFDLQCNPQIVNGALGRLG-AARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLT 135 + RL L + V A G + ARPG F R + GQ+VSVA A + Sbjct: 18 HLRRLVKLDPRLRAVRDAAGAVPLRARPG--------GFAGLARIVCGQMVSVASADAIW 69 Query: 136 ARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP 195 R+ L TP A L+ +G+ + AL LA A G L Sbjct: 70 RRLEAL-----------PQATTPGGFLALGEHGLQGVGLSQGKFRALTQLARALSAGELD 118 Query: 196 MTI--PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-GMTPAQ 252 + + A+ L GIG WTA + + D+F D +++ +TP Q Sbjct: 119 LPAIEAMPADAAIAELTRHKGIGPWTAEIYLMFCAGHPDIFPAGDIALQKAVGDALTPGQ 178 Query: 253 ------IRRYAERWKPWRSYALLHIWY 273 + AE W P+R+ A L W Sbjct: 179 YPDRKRLIGIAEAWAPYRASAALLFWR 205 >UniRef50_Q3B3Y2 HhH-GPD n=1 Tax=Chlorobium luteolum DSM 273 RepID=Q3B3Y2_PELLD Length = 311 Score = 134 bits (339), Expect = 3e-30, Method: Composition-based stats. Identities = 58/234 (24%), Positives = 84/234 (35%), Gaps = 38/234 (16%) Query: 78 MSRLFDLQCNPQIV--------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVA 129 R F L + + + + L GLR+ D +E V + Q + +A Sbjct: 73 FRRYFSLDVDTETLFSEPFRNAHPELALQLERYRGLRVLRQ-DPYETMVTFMCAQGIGMA 131 Query: 130 MAAKLTARVAQLYGERLDDF-----PEYICFPTPQRLAAADPQALKAL-GMPLKRAEALI 183 + + + +A+ YGE + FPTP RL AADP L+A L RA +I Sbjct: 132 LIRRQVSMLARRYGEHVPLSLNGCTINLYRFPTPSRLGAADPMELRACTNNNLMRARNII 191 Query: 184 HLANAALEGTLPMTIPGDV----EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 + EG + E L GIG A+ AL G D F D Sbjct: 192 SASQKVTEGCIDFKALASKKNTQEDIQAALSRCGGIGLKIADCIALFGLGRFDAFPI-DT 250 Query: 240 LIKQRFP----------GMTPAQIRRYAERWKP--------WRSYALLHIWYTE 275 ++Q +T R AER + +R + L H W TE Sbjct: 251 HVRQFLGLWFGFPEASAPLTDKNYRILAERARELLGEKLAGYRGHHLFHCWRTE 304 >UniRef50_Q8DCC1 3-methyladenine DNA glycosylase n=6 Tax=Vibrio RepID=Q8DCC1_VIBVU Length = 208 Score = 134 bits (339), Expect = 3e-30, Method: Composition-based stats. Identities = 39/161 (24%), Positives = 62/161 (38%), Gaps = 14/161 (8%) Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPL 176 RA+ GQ +SV A + RV L E+ Q QAL+ G+ Sbjct: 47 LSRAVAGQQLSVNAAKTIWRRVESLSAEKGSLQ---------QTFVDEHYQALRDCGLSN 97 Query: 177 KRAEALIHLANAALEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 + + ++ + A +EG L D + +K L GIG WTA + + DV+ Sbjct: 98 AKVKTILGINQALMEGALDSAFLASNDPQTIVKQLTGLWGIGPWTAEMALMFFFGMPDVW 157 Query: 235 LPDDYLIKQRFPGMTPAQ---IRRYAERWKPWRSYALLHIW 272 D + + + + P+R+Y LHIW Sbjct: 158 SAGDAALMRGLSSLAEKEGVDAEAILSAATPYRTYLALHIW 198 >UniRef50_Q1D1V1 HhH-GPD domain protein n=15 Tax=cellular organisms RepID=Q1D1V1_MYXXD Length = 499 Score = 134 bits (338), Expect = 3e-30, Method: Composition-based stats. Identities = 67/284 (23%), Positives = 107/284 (37%), Gaps = 17/284 (5%) Query: 3 TLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIP----DIARHT 58 +L+ + P+ + L R + V+ Y R+L V + +V D + Sbjct: 184 SLDTRAPFHLEATVRVLQRRPTNLVDVWEGGRYLRALTVSDGFVLVEVSNQGTVDEPQVR 243 Query: 59 LHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGAL------GRLGAARPGLRLPGCVD 112 + AE + R L +P+ ++ L G + A G+R P Sbjct: 244 FRVLDGDDSRGAHAEISRVLRRGLGLDVDPEPLDRLLQSERKLGPIVRALRGMRPPRFPS 303 Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKA 171 FE I Q VS+ + R+ +G L + FPT LA A A+++ Sbjct: 304 LFETFANVIPFQQVSLDAGVAVVRRLVARFGRFLPHEGQVRYAFPTAAALAEARLDAIRS 363 Query: 172 LGMPLKRAEALIHLANAALEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 G+ ++AEAL A A G + M +AM+ L GIG W+A LRG Sbjct: 364 CGLSARKAEALRAAAAAIQAGDVTEAMLSQMSSAEAMRMLTGLHGIGPWSAALVLLRGLG 423 Query: 230 AKDVFLPDDYLIKQRFPGMTPAQ----IRRYAERWKPWRSYALL 269 DVF D + + G+ + + R R+ R Y Sbjct: 424 RLDVFPEGDVGVIRGLSGLMHVEPGPALERLIRRFGEQRGYLYF 467 >UniRef50_Q4ZR24 DNA-3-methyladenine glycosylase II n=4 Tax=Pseudomonas syringae group RepID=Q4ZR24_PSEU2 Length = 221 Score = 133 bits (336), Expect = 5e-30, Method: Composition-based stats. Identities = 53/221 (23%), Positives = 90/221 (40%), Gaps = 28/221 (12%) Query: 64 SAGLEPVAAECLAKMSRL-FDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAIL 122 S + AE +A + L Q +V L + AA+ D F+ V A+ Sbjct: 5 SDSTDREHAEAVAALRSLDPQWQALIDLVGPCLHPVSAAQ---------DPFQALVEAVA 55 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 Q + + R+ L+ E + FP+ L D QAL++ G + A+ Sbjct: 56 YQQLHARAGDAMVMRLRSLFPE--------VSFPSAPALVELDDQALRSCGFSAAKCRAI 107 Query: 183 IHLANAALEGTLP---MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 +A A L+G +P + E ++ L PG+GRWT + G DV D+ Sbjct: 108 KAIAAARLDGLVPEVSAALAMGNEALVERLIQLPGVGRWTVEMMLIYGLGQLDVMPASDF 167 Query: 240 LIKQRFPGMTP-------AQIRRYAERWKPWRSYALLHIWY 273 + + + + Q+ R AER+ P+R+ A ++W Sbjct: 168 GVCEGYRRLYALQLKPSHRQMARLAERFAPYRTIAAWYLWR 208 >UniRef50_Q4PHN3 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4PHN3_USTMA Length = 447 Score = 133 bits (335), Expect = 6e-30, Method: Composition-based stats. Identities = 35/177 (19%), Positives = 63/177 (35%), Gaps = 21/177 (11%) Query: 90 IVNGALGRLGAARPGLRLPGCVDA----------FEQGVRAILGQLVSVAMAAKLTARVA 139 V+ RL P +D F +ILGQ +S A + + Sbjct: 129 AVDRRFERLFQKVPVRCYEEALDPSNSETKNLNLFRTVTTSILGQQISWLAARSVLYKFC 188 Query: 140 QLYG--------ERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALE 191 +L+ + + E FPTP + L+A G+ + + + LA ++ Sbjct: 189 RLFSPDSMPEKPDFVGFPREQWPFPTPLMVLRTPDAELRAAGLSFAKIKYVKDLAARFVD 248 Query: 192 GTLPMT---IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF 245 G L + D E + L G+GRWT+ + + D+ D +++ Sbjct: 249 GRLDIRQILELDDEEACVAELSKVKGVGRWTSEMILMFAMRKPDILPCADLGVQKGM 305 Score = 43.3 bits (101), Expect = 0.010, Method: Composition-based stats. Identities = 8/31 (25%), Positives = 18/31 (58%) Query: 243 QRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 + ++P +++ A +W P+RS A + +W Sbjct: 413 KGGQYLSPDEMKALAAKWSPYRSVACMFMWA 443 >UniRef50_Q0AQT9 HhH-GPD family protein n=2 Tax=Hyphomonadaceae RepID=Q0AQT9_MARMM Length = 202 Score = 133 bits (335), Expect = 7e-30, Method: Composition-based stats. Identities = 51/209 (24%), Positives = 86/209 (41%), Gaps = 21/209 (10%) Query: 79 SRLFDLQCNP-QIVNGALGRLGAARPGLRLPGCVD-AF-EQGVRAILGQLVSVAMAAKLT 135 +L D + AL RLG L+LP D F + RAI GQ +SV A + Sbjct: 8 RQLLDCAAPLFPELEAALRRLGP----LQLPRRNDLPFPDYLCRAISGQQISVKAAQSIW 63 Query: 136 ARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP 195 RV G + ++ C P+ + + L+A G+ + +I +A L Sbjct: 64 GRVEASAGRQ--PLLDHFC---PE-----NTETLRACGLSGAKTRTIITIAETHRSVGLD 113 Query: 196 MTIPGDVEQA--MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPA-- 251 ++ A + L G+G+WTA+ + + DV+ D ++ +T Sbjct: 114 TEALKNMSIAGRTERLTEIWGVGQWTADMMNIFYFGEPDVWPDGDVAARKTLERLTSKRR 173 Query: 252 QIRRYAERWKPWRSYALLHIWYTEGWQPD 280 + R A +KP+RS+ ++W PD Sbjct: 174 KTVRTAAMFKPYRSWLAYYMWAHVDAPPD 202 >UniRef50_D1VDS6 HhH-GPD family protein n=3 Tax=Actinomycetales RepID=D1VDS6_9ACTO Length = 292 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 69/281 (24%), Positives = 113/281 (40%), Gaps = 22/281 (7%) Query: 7 QPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVV-TAIPDIARHTLHINLSA 65 + P+ + FL +S + D + + AI T+H +L+ Sbjct: 9 RGPFSLAAGTRFLEGFTPASYDAAPDDVLRLAFPTDDGHTSAGAAIRQAPDGTVHADLTG 68 Query: 66 GLEPVAAECLAKMSRLFDLQCNPQI------VNGALGRLGAARPGLRLPGCVDAFEQGVR 119 G +P A A+++R+ L + + L A GLR +E Sbjct: 69 GADP--AAVRAQVARILSLDVDGAAFPDCVAADAVAAGLAARHLGLRPVCFPSPYEAACW 126 Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKR 178 I+G + + AA+L A +A+ +GE + FPTP L D G+ + Sbjct: 127 TIIGHRIRLTQAARLKATIAREHGETITVAGQPTAAFPTPSTLRTVDD----LPGLSELK 182 Query: 179 AEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 L +A AAL+G L + E+A++ LQ PGIG ++A +RG DVF Sbjct: 183 MGRLRAVAQAALDGELDAATLRALPTEEALRHLQALPGIGPFSAELILIRGAGHPDVFPG 242 Query: 237 DDYLIKQRFPGMTP------AQIRRYAERWKPWRSYALLHI 271 + + Q Q+ R A+RW P+RS+ L + Sbjct: 243 HERRLHQAMAKAYHLDSPELGQLSRLAQRWAPFRSWVTLLL 283 >UniRef50_Q5CV50 DNA-3-methyladenine glycosidase (Fragment) n=2 Tax=Cryptosporidium RepID=Q5CV50_CRYPV Length = 217 Score = 132 bits (332), Expect = 2e-29, Method: Composition-based stats. Identities = 34/173 (19%), Positives = 65/173 (37%), Gaps = 19/173 (10%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 D F+ I+GQ VS + + + P+ ++ + ++ Sbjct: 41 DLFQGLFYIIIGQQVSQKAQVSIWNK-----------AKTTLKSIDPETISNYSLEEIRK 89 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 +G+ LK+A + +A + + + + D E+ + L GIG W+A + Sbjct: 90 VGVSLKKATFIKGIAEKIINKEIDLNLLHEKDDEEVCEELTKLNGIGVWSAEMAMIFCMN 149 Query: 230 AKDVFLPDDYLIKQRF------PGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 K+VF D IK+ +T Y E + P+ S L++W Sbjct: 150 RKNVFSFSDIAIKRALKMIYGHKEITKEIFEHYRELFSPYCSIVSLYLWEISN 202 >UniRef50_B6R6H8 DNA-3-methyladenine glycosidase II protein n=3 Tax=Rhodobacteraceae RepID=B6R6H8_9RHOB Length = 213 Score = 131 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 46/196 (23%), Positives = 70/196 (35%), Gaps = 21/196 (10%) Query: 86 CNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER 145 + ++ L R+ + L F ++ QL+SVA AA + AR+ L Sbjct: 15 ASLVQLDERLVRVVDIAGEVPLRRRSADFAGLCNIVVAQLLSVAAAASIWARLEALV--- 71 Query: 146 LDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV--E 203 + F P L + + L +G+ + L +A G L ++ D E Sbjct: 72 -------VPF-EPDVLLSKSDEELLGVGLSNAKLRTLKAIAEELKAG-LCLSEAVDWPGE 122 Query: 204 QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRY 256 A K L GIG W+A+ F L DVF D ++ + Sbjct: 123 VAHKRLCEIKGIGPWSADIFLLFCAGHPDVFPVGDVALQAAVQHAFDLEERPKGKVLAEI 182 Query: 257 AERWKPWRSYALLHIW 272 AE W P R A W Sbjct: 183 AEAWSPHRGTAARLFW 198 >UniRef50_Q972N8 Putative uncharacterized protein ST1094 n=1 Tax=Sulfolobus tokodaii RepID=Q972N8_SULTO Length = 282 Score = 131 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 45/244 (18%), Positives = 91/244 (37%), Gaps = 20/244 (8%) Query: 28 ETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCN 87 E + D Y R + V I I L + + + + + + R+ L + Sbjct: 32 EILPDKSYVRVFN-RDVIVKVKQIGGIYNPKLRVEIFSSKDDDEEGIIKTLRRVLGLDVD 90 Query: 88 P------QIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQL 141 + N L G + E I Q +S+ + L ++A+ Sbjct: 91 LTNFYNRALDNPYFSSLARKFIGAKPVVFPTLLETVTNVISCQQISLNVCLTLVNKLAEK 150 Query: 142 YGERL-DDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG 200 +G ++ D + FPT + L + P+ L++LG ++ E +++ NA + Sbjct: 151 FGGKVIVDDKQINVFPTVEDLINSKPEELRSLGYSSRKVEYILNAVNALKD--------- 201 Query: 201 DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-MTPAQIRRYAER 259 + + ++++ G+G+W+ NY LRG DV D + + + I Sbjct: 202 --KDSFESIKNLKGLGKWSINYILLRGLGRIDVIPTGDVGFRNKAKRFLGIDNIEEILSS 259 Query: 260 WKPW 263 K + Sbjct: 260 VKEY 263 >UniRef50_B5IDT4 Base excision DNA repair protein, HhH-GPD family n=3 Tax=Aciduliprofundum boonei T469 RepID=B5IDT4_9EURY Length = 289 Score = 131 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 53/274 (19%), Positives = 113/274 (41%), Gaps = 20/274 (7%) Query: 13 SWMLGF-LAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLE-PV 70 + F L+ R + + V ++ + R + + + + +I + I++++ E Sbjct: 16 PHLHRFSLSDRPLPCI--VENNLFWRLIPIEDI--FIPVKAEIHEDIVKIDVASECENDY 71 Query: 71 AAECLAKMSRLFDLQCN------PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQ 124 E L+ + L + + L ++ GLR ++ +E ++ +L Q Sbjct: 72 CKEVLSTVRHLLAVDISYSNFLKTLEDFPRLYKMAITYSGLRPARNLNLYEALIKIVLQQ 131 Query: 125 LVSVAMAAKLTARVAQLYG-ERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALI 183 +S+ A TA++ + +G + Y FP P++L +KALG +A++L+ Sbjct: 132 RISLKYALNTTAKLIEKWGIREKWNGYSYYSFPPPEKLMRISTSEIKALGTTTVKAKSLL 191 Query: 184 HLANAALEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLI 241 +A G LP + + E+ +K L G+G WTA + D + Sbjct: 192 EIAKMEYNGDLPSIYEVNKNPEEYVKFLTGIYGVGMWTAELSVATVIHDYSIAPAGDLNV 251 Query: 242 KQRFPG----MTPAQIRRYAERWKPWRSYALLHI 271 ++ F +IR Y E++ W+ +++ Sbjct: 252 RKAFSKFLGLQGEKEIREYTEKFGKWKGLI-MYL 284 >UniRef50_A4ABI1 DNA-3-methyladenine glycosidase II n=2 Tax=unclassified Gammaproteobacteria RepID=A4ABI1_9GAMM Length = 204 Score = 131 bits (330), Expect = 3e-29, Method: Composition-based stats. Identities = 45/193 (23%), Positives = 73/193 (37%), Gaps = 20/193 (10%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + L R+ A +L F V I Q VS+A A +VA L E Sbjct: 20 DRDLARVVARHGPPKLLSRPPGFPTLVYIIFEQQVSLASAKSTYDKVAALLPEF------ 73 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG--DVEQAMKTL 209 T + +AL+A G+ ++A +A A + G LP+ G E+ L Sbjct: 74 -----TAEAYLQLSDEALRAAGVSRQKARYTRLVAEATIAGDLPIHALGRKPDEEVRTLL 128 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ-------IRRYAERWKP 262 GIG WTA+ + + + D++ D + + + + ER++P Sbjct: 129 TAITGIGNWTADVYLMLALRRPDLWPVGDLALVKAATAIKSRPHKPDKLWLENLGERYRP 188 Query: 263 WRSYALLHIWYTE 275 +RS A W Sbjct: 189 YRSVATGIFWRHY 201 >UniRef50_Q7UGU9 DNA-3-methyladenine glycosidase n=1 Tax=Rhodopirellula baltica RepID=Q7UGU9_RHOBA Length = 207 Score = 131 bits (329), Expect = 4e-29, Method: Composition-based stats. Identities = 47/198 (23%), Positives = 77/198 (38%), Gaps = 23/198 (11%) Query: 84 LQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG 143 + +P + + RLG R G FE R +L Q VS+ A ++ QL Sbjct: 8 AETDPSL-HLVWRRLGDPPSWRRPAG----FETLARIVLEQQVSLRSAESTLHKLQQLLE 62 Query: 144 ERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGT--LPMTIPGD 201 L TP+ + Q +A G+ ++ L LA ++G L Sbjct: 63 GPL----------TPRGIVRLSAQQTRACGVSRQKHRYLNQLAADIVDGRFVLDRLPGMS 112 Query: 202 VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ------IRR 255 ++A L GIGRW+A + + D+ D + + + Q I + Sbjct: 113 DQEARDQLTARLGIGRWSAEVYLMSALNRPDILPFGDLGLLKGVEELDGGQYDDFDAIIQ 172 Query: 256 YAERWKPWRSYALLHIWY 273 A+RW+P+RS A +W Sbjct: 173 RADRWRPYRSMATRLVWA 190 >UniRef50_D1VAP6 HhH-GPD family protein n=1 Tax=Frankia sp. EuI1c RepID=D1VAP6_9ACTO Length = 310 Score = 130 bits (327), Expect = 5e-29, Method: Composition-based stats. Identities = 59/272 (21%), Positives = 97/272 (35%), Gaps = 14/272 (5%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPD--IARHTLHINLSAG 66 P D L ++ R++ G R + P + R L++ G Sbjct: 16 PVDIVGSLAAYGRHGDDLIDRWDGRVLIRTVPHGAGRAALATRPAGPVERPRLYVTGPPG 75 Query: 67 LEPVAAECLAKMSRLFDLQC--NPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQ 124 L+ LA+ L D + + + + A R L D R + Q Sbjct: 76 LDTATLAGLARAQFLRDQPALDDLTARDPVIAAIPAHRRRLAQLTQTDVLHVLARCVTAQ 135 Query: 125 LVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIH 184 V+ AA L +R+ L G + P LA DP L LG+ ++A AL+ Sbjct: 136 QVTGRFAATLRSRLVGLVGRPVTAGPHTAYALDADLLADTDPTRLTELGLSGRKAMALLG 195 Query: 185 LANAALEGTLPMTIPG-DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ 243 +A A ++ D E + TL PGIGRW+A +F +R + D +++ Sbjct: 196 VARAVTGSLSLSSLHELDDEGVIATLTALPGIGRWSAEWFLIRALGRP-LVAAGDLAVRK 254 Query: 244 RFPGMT--------PAQIRRYAERWKPWRSYA 267 + ++R W P + A Sbjct: 255 AVGHLYRPGLPPPAEEEVRLLTAHWGPAAALA 286 >UniRef50_Q55703 Slr0231 protein n=1 Tax=Synechocystis sp. PCC 6803 RepID=Q55703_SYNY3 Length = 152 Score = 129 bits (326), Expect = 7e-29, Method: Composition-based stats. Identities = 35/130 (26%), Positives = 54/130 (41%), Gaps = 9/130 (6%) Query: 157 TPQRLAAADPQALKALGMPLKRAEALIHLANAALEG--TLPMTIPGDVEQAMKTLQTFPG 214 T Q LA DP+ L+ LG+ + L A A +LP ++ L G Sbjct: 16 TAQTLANVDPELLRELGISRYKTRYLKTWAIALQNNFPSLPELETWGDRAIVEQLTAIKG 75 Query: 215 IGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYA 267 IG WTA F L + +D+ D I+ + P Q+ Y + W+P+RS A Sbjct: 76 IGPWTAQLFLLFRLRRQDILPNQDLGIRIAIQKLYQLPDRPNPKQVSEYGKNWQPYRSLA 135 Query: 268 LLHIWYTEGW 277 ++W + Sbjct: 136 SWYLWRSLSA 145 >UniRef50_B6BS24 DNA-3-methyladenine glycosylase I n=4 Tax=SAR11 cluster RepID=B6BS24_9RICK Length = 211 Score = 129 bits (325), Expect = 1e-28, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 67/177 (37%), Gaps = 19/177 (10%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 D F ++I+GQ +SVA A + L + + + LK+ Sbjct: 42 DIFYSLCKSIIGQQISVAAANSVF----------LKFKKKCKNKINAKTVYKLTVTQLKS 91 Query: 172 LGMPLKRAEALIHLANAALEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 G+ ++A+ + LA L T + E+A+ L IGRW+A L + Sbjct: 92 CGLSRQKAKGIKSLAKQTLNKTFDSKLIPKMSDEEAIIYLSKLRQIGRWSAEMILLFTYN 151 Query: 230 AKDVFLPDDYLIKQRFPGMTPAQ-------IRRYAERWKPWRSYALLHIWYTEGWQP 279 +++ D + + + + +R+ P+ S A ++W + +P Sbjct: 152 RSNIWPIQDIGLLRAISKNYKKEYLPPEKYVNLLYKRFSPYCSVATWYLWRSIDPEP 208 >UniRef50_O28163 3-methyladenine DNA glycosylase (AlkA) n=1 Tax=Archaeoglobus fulgidus RepID=O28163_ARCFU Length = 295 Score = 128 bits (323), Expect = 2e-28, Method: Composition-based stats. Identities = 55/266 (20%), Positives = 106/266 (39%), Gaps = 27/266 (10%) Query: 11 DWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIAR-----HTLHINLSA 65 +W + F + + + V + R++ + V A P+ R Sbjct: 11 NWELKMKFFVLPELPTPDVVESGVWRRAIVLDGRAVAVMAYPESERTIVVEGNFENREWE 70 Query: 66 GLEPVAAECL-----AKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRA 120 + E L ++ R D +++ G R GL + FE +A Sbjct: 71 AVRRKLVEYLGLQNPEELYRFMDGDEKLRMLKNRFY--GFGRAGLM---SMSVFEGIAKA 125 Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 I+ Q +S +A KL A++ +G+ ++ + ++ FPT + + A + L+ G+ ++A Sbjct: 126 IIQQQISFVVAEKLAAKIVGRFGDEVEWNGLKFYGFPTQEAILKAGVEGLRECGLSRRKA 185 Query: 180 EALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 E ++ +A L E+A + L +F GIGRWTA K+VF DD Sbjct: 186 ELIVEIAKEE---NLEELKEWGEEEAYEYLTSFKGIGRWTAELVLSMALG-KNVFPADDL 241 Query: 240 LIKQRFPGM-------TPAQIRRYAE 258 +++ + + ++R A Sbjct: 242 GVRRAVSRLYFNGEIQSAEKVREIAR 267 >UniRef50_Q1QHT1 HhH-GPD n=1 Tax=Nitrobacter hamburgensis X14 RepID=Q1QHT1_NITHX Length = 217 Score = 128 bits (322), Expect = 2e-28, Method: Composition-based stats. Identities = 49/224 (21%), Positives = 80/224 (35%), Gaps = 35/224 (15%) Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQG 117 T+H+ + L+ + + D + P + L L +PG FE Sbjct: 2 TVHLKTQSDLDDAVNALVKQ-----DPRLRPIVDRTGLPALRQRQPG---------FEGL 47 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 + GQ +S A A + R++ + D + A L LG+ Sbjct: 48 AAIVCGQQLSTASAGAIWGRLSAAFAPFHHD-----------AIGKARADRLGRLGLSAA 96 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQAMK--TLQTFPGIGRWTANYFALRGWQAKDVFL 235 + L LA G L + + + + TL GIG WTA+ + L D + Sbjct: 97 KITTLKLLAREIAAGRLNLDVLAEEDADAAHATLVRHRGIGPWTADVYLLFCLGHGDAWP 156 Query: 236 PDDYLIKQRFP-------GMTPAQIRRYAERWKPWRSYALLHIW 272 D +++ TP Q+ AE W+P R A H+W Sbjct: 157 AGDVALQEAIKVGLGLTTRPTPKQMVPLAEPWRPLRG-AAAHLW 199 >UniRef50_B4S806 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Prosthecochloris aestuarii DSM 271 RepID=B4S806_PROA2 Length = 312 Score = 128 bits (321), Expect = 3e-28, Method: Composition-based stats. Identities = 49/215 (22%), Positives = 79/215 (36%), Gaps = 30/215 (13%) Query: 89 QIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD 148 + V + +L G+R+ ++AFE + + Q + + + K + +GER Sbjct: 93 RRVYPVVSQLAEPYMGVRVLR-LNAFETLITFMCAQAIGMNLIRKQIRTICNRFGERHMT 151 Query: 149 FPEY-----ICFPTPQRLAAADPQALKAL-GMPLKRAEALIHLANAALEGTLPMTIPGDV 202 + FP+P+ LAAA PQ L+ +RA +I A A EG L M + Sbjct: 152 EIDGNPLIQYSFPSPETLAAASPQDLRICTNNNCERASNIISAARAVAEGRLCMDELINN 211 Query: 203 E----QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG----------M 248 E +L + GIG A+ L G D F D ++Q + Sbjct: 212 ELSLGSIRNSLTAYRGIGLKIADCVMLFGLHRHDAFPI-DTHVRQYLGKWFGLEKTQKAL 270 Query: 249 TPAQIRRYAERWKP--------WRSYALLHIWYTE 275 TP + + + L H W E Sbjct: 271 TPKTYIELQHQASEILNPENAGYAGHILFHCWRNE 305 >UniRef50_Q0G7C6 Putative dna-3-methyladenine glycosidase ii protein n=1 Tax=Fulvimarina pelagi HTCC2506 RepID=Q0G7C6_9RHIZ Length = 219 Score = 128 bits (321), Expect = 3e-28, Method: Composition-based stats. Identities = 43/195 (22%), Positives = 68/195 (34%), Gaps = 20/195 (10%) Query: 88 PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD 147 ++ L + + + + ++ Q VS A A + R+ E Sbjct: 22 LAALDPRLLPVIETAGSIPIRRQAEGLRGLCAIVVAQQVSKASADAIFRRL-----EAEA 76 Query: 148 DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV--EQA 205 D ++ + A D AL+ G+ + + LA A EGTL A Sbjct: 77 DLDDH------DAILALDEGALRRAGLSRPKQATIRELARARAEGTLDFARLASASGRDA 130 Query: 206 MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMTPAQIRRYAE 258 + L GIG WTA + L +DVF D +++ P ++ AE Sbjct: 131 IADLVKLRGIGVWTAECYLLFAVGHRDVFPVGDLALQEAVRMVFGLEDRPKPEELSAIAE 190 Query: 259 RWKPWRSYALLHIWY 273 W P RS A W Sbjct: 191 SWAPHRSTAARLFWA 205 >UniRef50_C4DGP2 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DGP2_9ACTO Length = 300 Score = 127 bits (320), Expect = 4e-28, Method: Composition-based stats. Identities = 62/279 (22%), Positives = 112/279 (40%), Gaps = 22/279 (7%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVV-TAIPDIARHTLHINLSAGL 67 P++ + FL A ++ E D + + + V A+ T+ + ++ Sbjct: 9 PFNLATSTRFLEGFAPAAYEGAGDEVLRLAFPADDGKAVAGAALRQETDGTVRVEITGAA 68 Query: 68 EPVAAECLAKMSRLFDLQCN----PQIV--NGALGRLGAARPGLRLPGCVDAFEQGVRAI 121 + A A++ R+ L + P +V + + L PGLR +E A+ Sbjct: 69 D--AEAVGAQVRRIMSLDIDGTGYPAVVASDPIVKGLSEQYPGLRPVCFHSPYEAAAWAV 126 Query: 122 LGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 +G + + AA + A +A GE + FPTP+ LA G+ + Sbjct: 127 IGHRIRITQAAGIKAAMAARLGETVTVAGRPVAAFPTPEVLAEVGE----FPGLTDVKIA 182 Query: 181 ALIHLANAALEGTLPMTIPGDVE--QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD 238 L +A AAL G L D+ A++ LQ GIG ++A +RG DVF + Sbjct: 183 RLRGIAEAALAGELDAKRLRDMASADALEQLQGIAGIGPFSAELILIRGAGHPDVFPRTE 242 Query: 239 YLIKQRFPGM------TPAQIRRYAERWKPWRSYALLHI 271 + + + + A++ A W P+RS+ + + Sbjct: 243 TRLHRTMTQLYRREEPSAAELADIAADWAPFRSWVGVLL 281 >UniRef50_Q11DX8 HhH-GPD n=3 Tax=Rhizobiales RepID=Q11DX8_MESSB Length = 214 Score = 127 bits (319), Expect = 5e-28, Method: Composition-based stats. Identities = 53/210 (25%), Positives = 77/210 (36%), Gaps = 23/210 (10%) Query: 73 ECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAA 132 E A + R D +++ LG + A + L F I+ Q VS A A Sbjct: 5 ETAADIRRGLDA---LVLLDRRLGPVRAVAGAVPLRRGPPGFHSLASVIVSQQVSRASAD 61 Query: 133 KLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEG 192 + AR++QL + TP AA + G+ + L L+ A + Sbjct: 62 AIFARLSQL-----------VKPLTPDGFLAAGESVMVKAGLSRAKQRTLTALSAALRDK 110 Query: 193 TLPMTIPGD--VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-- 248 L + G+ AM+ L PGIG WTA + L D+F D ++ Sbjct: 111 ALDLDDLGNLPPADAMEALTVIPGIGPWTAQVYLLVAAGHPDIFPAGDIALQAAVGHALA 170 Query: 249 -----TPAQIRRYAERWKPWRSYALLHIWY 273 Q+ AE W PWRS A W Sbjct: 171 LDARPKANQLALIAEPWSPWRSVAARLFWA 200 >UniRef50_B6BWA6 DNA-3-methyladenine glycosylase 1 n=1 Tax=beta proteobacterium KB13 RepID=B6BWA6_9PROT Length = 201 Score = 126 bits (318), Expect = 7e-28, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 73/171 (42%), Gaps = 20/171 (11%) Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGM 174 + ++++ Q + A+ + R L+ FPT Q + + L ++G+ Sbjct: 41 QSIIKSVFFQQLHPKAASTIYQRFLDLFNN---------VFPTEQDIIK-NKDLLSSIGL 90 Query: 175 PLKRAEALIHLANAALEGTLPMT---IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 ++A+ ++ +A+ L+ +P + + ++ ++ G+G WT + Sbjct: 91 SNQKAQTILSIADGYLKQFIPNEKKIMLLNGQEIIEQFTQIKGVGVWTVQMMLIFNQGQP 150 Query: 232 DVFLPDDYLIKQRFP-------GMTPAQIRRYAERWKPWRSYALLHIWYTE 275 D+ D I++++ +TP Q+ + E P+R+ A ++W + Sbjct: 151 DIMPSSDLAIRKKYSFFKQRDCLITPTQLIKETEYLSPYRTIAAWYLWQIK 201 >UniRef50_B6JDH5 DNA-3-methyladenine glycosidase II n=14 Tax=Rhizobiales RepID=B6JDH5_OLICO Length = 290 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 46/194 (23%), Positives = 72/194 (37%), Gaps = 25/194 (12%) Query: 90 IVNGALGRL--GAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD 147 + + AL + A P LR F I GQ +S AA + RV++ + Sbjct: 26 VADPALAAIAEIAGPPALR--RRTPGFAGLAAIICGQQLSTHAAAAIWKRVSEAFDPFHH 83 Query: 148 DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMK 207 +R+ A L LG+ + + L LA L + + G E + Sbjct: 84 -----------ERVRLARADRLARLGLSAAKIKTLKALAREIAAERLDLDVLGQQEADLA 132 Query: 208 --TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMTPAQIRRYAE 258 L GIG WTA+ + L D + D +++ + + AE Sbjct: 133 HTALTALHGIGPWTADIYLLFCLGHGDAWPNGDLAVQEAMRIGLDLEARPSAREAATIAE 192 Query: 259 RWKPWRSYALLHIW 272 RW+PWR A H+W Sbjct: 193 RWRPWRG-AAAHLW 205 >UniRef50_B3QN63 8-oxoguanine DNA glycosylase domain protein n=2 Tax=Chlorobaculum RepID=B3QN63_CHLP8 Length = 317 Score = 125 bits (315), Expect = 1e-27, Method: Composition-based stats. Identities = 50/218 (22%), Positives = 83/218 (38%), Gaps = 30/218 (13%) Query: 86 CNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER 145 + Q L R+ +R+ + FE V + Q + + + + + +A+ YG++ Sbjct: 95 SDFQRDYPELWRMVKPYRSVRVMRQ-EPFEIMVTFMCAQGIGMHLIRRQVSMIAERYGQK 153 Query: 146 L-----DDFPEYICFPTPQRLAAADPQALKALGMPLK-RAEALIHLANAALEGTLPMTI- 198 + + + FPTP LA+ADP L + RA +I +A + G L + Sbjct: 154 IVLETPEGEMVFYGFPTPSALASADPSELALCTNNNRIRAANIIAMARSFESGKLALACV 213 Query: 199 ---PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF---------- 245 D+E +TL GIG A+ AL G D F D +KQ Sbjct: 214 GSGECDLETLRETLCVHSGIGLKIADCIALFGLGRFDAFPI-DTHVKQYLWEWFGIEEAR 272 Query: 246 PGMTPAQIRRYAERWK--------PWRSYALLHIWYTE 275 +T R E+ + + + L H W E Sbjct: 273 RSLTEKNYRILQEKARAILGTECAGYAGHILFHCWRKE 310 >UniRef50_C0E8I7 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=C0E8I7_9CLOT Length = 281 Score = 125 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 44/200 (22%), Positives = 81/200 (40%), Gaps = 17/200 (8%) Query: 82 FDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLT 135 FD + Q + + L + G+R+ D +E I+ Q ++ + Sbjct: 75 FDFDTDYQAIKQGFLEDEVLKKSCDYAGGIRILRQ-DPWETLCSFIISQNNNIPRIKGII 133 Query: 136 ARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP 195 R+ +L GE++ P FPTP+ LAA L + RA L+ A+ G + Sbjct: 134 DRLCKLCGEQV---PGGYAFPTPEALAAKSLDDLSIMR-AGFRARYLLDAAHKVSTGKID 189 Query: 196 MTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQI 253 + ++++A KTL + G+G A L G+ + F D IK+ Sbjct: 190 LPSLYTMEIDEARKTLTSICGVGPKVAECVLLFGFHRLEAFPV-DVWIKRAITYFYQDGF 248 Query: 254 RRYAERWKPWRSYALLHIWY 273 +A +P+ A ++++ Sbjct: 249 PEFA---RPYGGIAQQYLFH 265 >UniRef50_A4SEG0 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Chlorobium phaeovibrioides DSM 265 RepID=A4SEG0_PROVI Length = 313 Score = 124 bits (313), Expect = 3e-27, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 65/199 (32%), Gaps = 30/199 (15%) Query: 104 GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE-----YICFPTP 158 GLR+ D +E V + Q + +++ + + + +G + FP P Sbjct: 109 GLRVLRQ-DPYETLVTFMCAQGIGMSIIRRQVNMLCRYFGNEVRVADGGRDTPLYSFPAP 167 Query: 159 QRLAAADPQALK-ALGMPLKRAEALIHLANAALEGTLPMTIPGDVE----QAMKTLQTFP 213 LA ADP L+ RA + + G L + D + L Sbjct: 168 SVLADADPALLRRCCNNNSMRAGNIGEASRLLALGRLDLQALSDPSLPLSEIRTELTALK 227 Query: 214 GIGRWTANYFALRGWQAKDVFLPDDYLI------------------KQRFPGMTPAQIRR 255 GIG A+ AL G D F D + ++R+ + + Sbjct: 228 GIGFKIADCIALFGLGRFDAFPI-DTHVEQFLSSWFSIGAHQKGLSQKRYLHLQEKAVEL 286 Query: 256 YAERWKPWRSYALLHIWYT 274 E + + L H W T Sbjct: 287 LGESLSGYAGHHLFHCWRT 305 >UniRef50_B9JTI1 DNA-3-methyladenine glycosidase II n=1 Tax=Agrobacterium vitis S4 RepID=B9JTI1_AGRVS Length = 217 Score = 124 bits (311), Expect = 5e-27, Method: Composition-based stats. Identities = 49/214 (22%), Positives = 74/214 (34%), Gaps = 28/214 (13%) Query: 69 PVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSV 128 A+ + L D+ + G L L PG FE I+ Q+VS Sbjct: 8 RNQADLATGLEALLDIDPRLHAIAREAGPLPLR---LTEPG----FEGLAFIIVSQMVSR 60 Query: 129 AMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANA 188 A A + R+ + G P A AL +LG+ + L LA+A Sbjct: 61 ASADAIWRRLCEAMGTA-----------EPAAYLALGDAAL-SLGLSRAKHACLSGLASA 108 Query: 189 ALEGTLPMTIPGD--VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP 246 G + + D VE + L GIG WT+ + + DVF D ++ Sbjct: 109 VQAGDIDLFAILDQPVETGIAELTRLRGIGLWTSEVYLMFCGGHADVFPAGDVALRAAVG 168 Query: 247 G-------MTPAQIRRYAERWKPWRSYALLHIWY 273 + R A W+P+R+ A W Sbjct: 169 DGLCLATRPDIRETTRIALTWRPYRAIAARLFWA 202 >UniRef50_B3ECK0 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Chlorobium limicola DSM 245 RepID=B3ECK0_CHLL2 Length = 312 Score = 123 bits (310), Expect = 6e-27, Method: Composition-based stats. Identities = 48/208 (23%), Positives = 77/208 (37%), Gaps = 28/208 (13%) Query: 93 GALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD----- 147 A+ L GL+L D FE + + Q + +A+ + + + YG Sbjct: 97 PAIATLLQEYMGLKLLRQ-DPFETTITFMCAQGIGMALIRRQIGMLCEKYGTPCTIELMG 155 Query: 148 DFPEYICFPTPQRLAAADPQALKAL-GMPLKRAEALIHLANAALEGTLPMTIPGD----V 202 FP P+ LA +L+A +RA + +A AA EGTL TI G + Sbjct: 156 QKHRIFRFPKPEMLAETSVLSLQACTNNNYRRALNIRRVAAAAAEGTLDFTISGSQSLSL 215 Query: 203 EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD---------YLIKQRFPGMTPAQI 253 ++ L + GIG A+ AL D F D + I++ +T Sbjct: 216 DRIRAMLCEYDGIGPKIADCIALFSLGRFDAFPVDTHVRQYLAEWFGIRRASMSLTEKNY 275 Query: 254 RRYAERW----KP----WRSYALLHIWY 273 R + +P + + L H W Sbjct: 276 LRLQDEVRTILRPEVAGYAGHLLFHCWR 303 >UniRef50_B5Y412 Predicted protein n=1 Tax=Phaeodactylum tricornutum CCAP 1055/1 RepID=B5Y412_PHATR Length = 394 Score = 122 bits (307), Expect = 1e-26, Method: Composition-based stats. Identities = 44/201 (21%), Positives = 71/201 (35%), Gaps = 33/201 (16%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 FE R I GQ VS A + R+ + L PQ + K +G Sbjct: 154 FESLCRIIAGQFVSGKSAQAVWKRLLEHARHDLTPTRILQLVSQPQG-EDIEFGLQKPVG 212 Query: 174 MPLKRAEALIHLANAALEGTLP-------MTIPGDVEQAM----KTLQTFPGIGRWTANY 222 + +A++++ LA +G L + GD + + K L GIG W+ + Sbjct: 213 LTKNKAKSIVDLARHFEDGRLSEGFLTSSTSPSGDTDSTITSIQKALLKVQGIGPWSVDM 272 Query: 223 FALRGWQAKDVFLPDDYLIKQRF-------------------PGMTPAQIRRYAERWKPW 263 F L + +V D +++ P QIRR E + P+ Sbjct: 273 FLLFYLEQPNVLPLGDLGVRKGIAIHFAMRGSVKKGKQAQLCPKQDAPQIRRRLEAYAPY 332 Query: 264 RSYALLHIWYTEGWQ--PDEA 282 +S ++W PD A Sbjct: 333 QSLLTYYMWRAADTPSAPDSA 353 >UniRef50_A3TMT2 Base-excision DNA repair protein n=1 Tax=Janibacter sp. HTCC2649 RepID=A3TMT2_9MICO Length = 213 Score = 122 bits (307), Expect = 1e-26, Method: Composition-based stats. Identities = 50/204 (24%), Positives = 73/204 (35%), Gaps = 24/204 (11%) Query: 79 SRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARV 138 + ++ + L G R PG V IL Q VS+A AA ARV Sbjct: 8 RHVAEVAAGHPHLAAELDDHGVPERWSRPPGFPS----LVLLILEQQVSLASAAAAYARV 63 Query: 139 AQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI 198 G TP L A + L+ G+ ++ L L+ A G L + Sbjct: 64 RARTGAM-----------TPAALLATTSEQLREDGVSRQKDRYLRALSAAVGSGDLDLAG 112 Query: 199 PGD--VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------T 249 E A+K L GIG WTA + L DV+ D ++ Sbjct: 113 LATLPDEVAIKRLTQLSGIGPWTAQAYLLACLDRPDVWPVGDRALQVAVAERLGLAHVPN 172 Query: 250 PAQIRRYAERWKPWRSYALLHIWY 273 ++ E+W+P RS A +W+ Sbjct: 173 GRELEVLGEQWRPHRSTAARLLWH 196 >UniRef50_Q3ARU6 HhH-GPD n=1 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3ARU6_CHLCH Length = 312 Score = 121 bits (305), Expect = 2e-26, Method: Composition-based stats. Identities = 42/191 (21%), Positives = 70/191 (36%), Gaps = 29/191 (15%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE-----YICFPTPQRLAAADPQ 167 FE + + Q + + + + R+ + YGE + E + FP P++LA + + Sbjct: 116 PFETLISFMCAQGIGMRLIRQQINRLCERYGEFYEAEMEGEMLCFSGFPAPEQLACLNAE 175 Query: 168 ALKALGMPLK-RAEALIHLANAALEGTLPMTIP----GDVEQAMKTLQTFPGIGRWTANY 222 L + RA +I +A +EG L ++ E+ L GIG A+ Sbjct: 176 ELSYCTNNNRERAANIIAVARKVVEGRLDLSSLSYPNMAFEEVQARLTQERGIGLKIADC 235 Query: 223 FALRGWQAKDVFLPDDYLIKQ----------RFPGMTPAQIRRYAERWKP-----WRSYA 267 AL G + F D + Q +TPA R+ + + YA Sbjct: 236 VALFGLGYFEAFPI-DTHVHQFMAQWFKVPAASRSLTPATYRQLTLEAREILGSHYTGYA 294 Query: 268 L---LHIWYTE 275 H W E Sbjct: 295 AHLLFHCWRCE 305 >UniRef50_C5E2U4 KLTH0H07766p n=2 Tax=Saccharomycetaceae RepID=C5E2U4_LACTC Length = 274 Score = 121 bits (304), Expect = 3e-26, Method: Composition-based stats. Identities = 46/240 (19%), Positives = 76/240 (31%), Gaps = 55/240 (22%) Query: 86 CNPQIVNGALGRLGAARPGLRLPGCVDA---FEQGVRAILGQLVSVAMAAKLTARVAQLY 142 +P + + L + P +D F + +IL Q +S A A + ++ Y Sbjct: 33 VDPSLYDALLSKDFQLFLKREQPP-MDLNHHFIKLASSILAQQISGAAANSVKNKLMAYY 91 Query: 143 GERLDDFPEYICFPTPQRLAAA----DPQALKALGMPLKRAEALIHLANAALEGT--LPM 196 G FPT Q + D Q L+ G+ ++ + L LAN + + Sbjct: 92 GGH---------FPTYQEVVETFHKKDVQELRDCGLSARKVQYLESLANYFNDNEKAIEA 142 Query: 197 TIPGDVEQAMKTLQT-FPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-------- 247 D E+ L GIG W+A F + + DVF DD I + Sbjct: 143 LFEQDNEEIASQLVANIKGIGPWSAKMFLVTSLERMDVFAADDLGIARGCSKYLDCRPEI 202 Query: 248 ---------------------------MTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 + + E + P+RS + +W D Sbjct: 203 LKELMSKRVQGKTKRSKIVHKKKNWKIYDEDIVEKCGEMFAPYRSIFMFILWRLSSTNID 262 >UniRef50_D0JW87 Glycosidase n=1 Tax=Yersinia pestis D182038 RepID=D0JW87_YERP1 Length = 243 Score = 120 bits (302), Expect = 4e-26, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 66/164 (40%), Gaps = 13/164 (7%) Query: 87 NPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL 146 + + + + L P D F +R I+ Q +SV A + R+ L G Sbjct: 12 HLKRRDKKMAAAIERLGMLERPLSPDLFAALIRNIVDQQISVKAAQTVNTRLTLLLGS-- 69 Query: 147 DDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQ 204 TP +AAA +A++ GM +++A + A+AA+ G+L ++ + Sbjct: 70 ---------ITPATVAAASAEAIQRCGMTMRKAGYIKGAADAAINGSLDLSVIAQLPDNE 120 Query: 205 AMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM 248 + L G+G WTA + D+ D I++ + Sbjct: 121 VITQLSRLDGVGVWTAEMLLISSLSRPDIVSWGDLAIRRGMMNL 164 >UniRef50_UPI0001B54083 YfjP n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B54083 Length = 308 Score = 120 bits (301), Expect = 6e-26, Method: Composition-based stats. Identities = 54/232 (23%), Positives = 96/232 (41%), Gaps = 17/232 (7%) Query: 50 AIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARP 103 AI + T+ + ++A +E +AA +A++SR+ L + + + + RL A Sbjct: 63 AIRQRSPGTVEVGVAAPVE-IAARVVAQVSRILSLDVDESGLAEVANRDSVVRRLHARHS 121 Query: 104 GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAA 163 G R +E A+L Q + VA A +L ++A +G ++ + FP+PQ +A Sbjct: 122 GARPVLHSSPYEAACWAVLTQGMRVAQARRLREQLAVRHGRQVGE-DGPFSFPSPQIVAE 180 Query: 164 ADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYF 223 + + + L + +P + A+ L PGIG + A Sbjct: 181 LGDEPSVGPFRLARLRAVARAAVDGRLNADELLALP--IPDALNRLCAIPGIGAFAAEQI 238 Query: 224 ALRGWQAKDVFLPDDYLIKQRF-------PGMTPAQIRRYAERWKPWRSYAL 268 L G D+F D + Q P P ++ AE W+P+RS+ Sbjct: 239 LLHGAGHPDLFPRLDTQLHQVLCAEYALPPDTPPDELEPLAEDWRPYRSWVA 290 >UniRef50_Q28QY1 HhH-GPD n=83 Tax=Bacteria RepID=Q28QY1_JANSC Length = 225 Score = 120 bits (301), Expect = 6e-26, Method: Composition-based stats. Identities = 46/194 (23%), Positives = 70/194 (36%), Gaps = 24/194 (12%) Query: 87 NPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL 146 + + + P R P D F + AI+GQ VS A AA + RV + Sbjct: 37 RLRKIEPKFAVISGPLPLRRRP---DGFGALLHAIVGQQVSTASAAAIWGRVQSAGLHQE 93 Query: 147 DDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAM 206 +AL+ G+ + L+ A ++ P D + Sbjct: 94 TAVAAATE------------EALRGCGLSRPKVRYAKALSEARIDYAALRDAPLD--DVL 139 Query: 207 KTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAER 259 TL PGIGRWTA +A DVF D +++ + ++R AE Sbjct: 140 ATLIAVPGIGRWTAEIYAKFALGHADVFAAGDLALQEGARLLFDLPERPAEKEMRAMAEA 199 Query: 260 WKPWRSYALLHIWY 273 W P R+ A +W Sbjct: 200 WTPVRAIAARALWA 213 >UniRef50_P22134 DNA-3-methyladenine glycosylase n=7 Tax=Saccharomycetaceae RepID=MAG_YEAST Length = 296 Score = 120 bits (301), Expect = 6e-26, Method: Composition-based stats. Identities = 46/208 (22%), Positives = 73/208 (35%), Gaps = 44/208 (21%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC-FPTPQRLAAADPQALK 170 D F + IL Q +S A + ARV LYG D+ F P + A + Sbjct: 82 DYFIRLASTILSQQISGQAAESIKARVVSLYGGAFPDYKILFEDFKDPAKCA-----EIA 136 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTIP----GDVEQAMKTL-QTFPGIGRWTANYFAL 225 G+ ++ L LA E + + E+ +++L GIG W+A F + Sbjct: 137 KCGLSKRKMIYLESLAVYFTEKYKDIEKLFGQKDNDEEVIESLVTNVKGIGPWSAKMFLI 196 Query: 226 RGWQAKDVFLPDDYLIKQRF-------PGMTPAQIRRY---------------------- 256 G + DVF P+D I + F P + +R Sbjct: 197 SGLKRMDVFAPEDLGIARGFSKYLSDKPELEKELMRERKVVKKSKIKHKKYNWKIYDDDI 256 Query: 257 ----AERWKPWRSYALLHIWYTEGWQPD 280 +E + P+RS + +W D Sbjct: 257 MEKCSETFSPYRSVFMFILWRLASTNTD 284 >UniRef50_B3EJD3 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Chlorobium phaeobacteroides BS1 RepID=B3EJD3_CHLPB Length = 313 Score = 119 bits (298), Expect = 1e-25, Method: Composition-based stats. Identities = 44/145 (30%), Positives = 63/145 (43%), Gaps = 11/145 (7%) Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER----LDDFP-EYICFPTPQRLAAADP 166 D FE + + Q + + + K +AQ YG R L+D P Y FPTP+ LA+ P Sbjct: 116 DPFETLITFMCAQGLGMHLIRKQVTYLAQEYGTRHTIRLNDVPYTYFSFPTPEALASTSP 175 Query: 167 QALKAL-GMPLKRAEALIHLANAALEGTLPMTIPGDV----EQAMKTLQTFPGIGRWTAN 221 ++L+ RA+ +I A A + G L + D E KTL + PGIG A+ Sbjct: 176 ESLRLCTNNNCIRADNIIQAAQAVVSGKLDLQALKDPAMPLENVRKTLCSQPGIGFKIAD 235 Query: 222 YFALRGWQAKDVFLPDDYLIKQRFP 246 L G F D + Q Sbjct: 236 CVMLFGLHRFAAFPI-DRHVHQYLA 259 >UniRef50_B4SHB1 8-oxoguanine DNA glycosylase domain protein n=3 Tax=Chlorobium/Pelodictyon group RepID=B4SHB1_PELPB Length = 312 Score = 118 bits (295), Expect = 3e-25, Method: Composition-based stats. Identities = 53/228 (23%), Positives = 85/228 (37%), Gaps = 36/228 (15%) Query: 82 FDLQC------NPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLT 135 FD+ N + L +L +R+ D FE + + Q + + + K Sbjct: 80 FDIDTEQIFPDNFSHLYPTLWQLLTDYFPVRIMRQ-DPFETMISFMCAQGIGMPLIRKQV 138 Query: 136 ARVAQLYGERLD-----DFPEYICFPTPQRLAAADPQALKAL-GMPLKRAEALIHLANAA 189 + + Q YGE+ FP+P+RLAAA+P AL RA ++ +A Sbjct: 139 SMLLQNYGEKRTISYSGKEITLHHFPSPERLAAANPIALSTCTNNNHPRARNIVRIAKGV 198 Query: 190 LEGTLPMTIPGDV----EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ-- 243 +G + + D + +TL G+G A+ AL G D F D +KQ Sbjct: 199 ADGKIDLDALSDPLLPLSELRRTLCQNEGVGYKIADCIALFGLGRFDAFPI-DTHVKQYL 257 Query: 244 --------RFPGMTPAQIRRY-AER-------WKPWRSYALLHIWYTE 275 +TPA+ AE + + + L H W E Sbjct: 258 GQWFNSTTALQSLTPARYLALDAEARTILKPDFAGYAGHLLFHCWRKE 305 >UniRef50_B8KXG2 HhH-GPD family protein n=1 Tax=gamma proteobacterium NOR51-B RepID=B8KXG2_9GAMM Length = 209 Score = 117 bits (294), Expect = 4e-25, Method: Composition-based stats. Identities = 34/176 (19%), Positives = 64/176 (36%), Gaps = 19/176 (10%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 F R I+GQ VSVA A+ + ARV G + + +++ + L++ Sbjct: 43 GFAALARIIIGQQVSVAAASSIAARVEAALGGEI----------SADQVSRVGDEKLRSA 92 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 G+ ++ L LA E + + ++ + G G W+A + + Sbjct: 93 GLSRQKVNYLRELARHCREEGFAPERLVREEDDAVLEAITAIKGFGVWSAQMYLMFSLGR 152 Query: 231 KDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 D++ D ++ F + + R E + P RS + W P Sbjct: 153 TDIWPVGDLAVRAGFGRIMALDERPDEHETARLGEVFSPHRSALAMLCWKFYSEAP 208 >UniRef50_Q754R1 AFR011Wp n=1 Tax=Eremothecium gossypii RepID=Q754R1_ASHGO Length = 285 Score = 117 bits (293), Expect = 5e-25, Method: Composition-based stats. Identities = 39/204 (19%), Positives = 72/204 (35%), Gaps = 42/204 (20%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F+ I+ Q +S A A + +RV QL+G ++ E + AA D +L+ G Sbjct: 63 FKHLASGIITQQISGAAARSIKSRVEQLFGGTFPNYVELQS-----KFAAGDSASLRKCG 117 Query: 174 MPLKRAEALIHLANAALEGTLPM----TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 + ++ + L + + + D E ++ GIG W+A F + Sbjct: 118 LSARKVSYVESLTAYFNQNEMRLKHLFNSGSDAEIVDDLVRNVKGIGPWSAKMFLVTSLH 177 Query: 230 AKDVFLPDDYLIKQ---RFPGMTPAQIRRY------------------------------ 256 +DVF DD I + R+ P +++ Sbjct: 178 RQDVFAADDLGIARGCSRYLTARPEVLKKLMVSRTTVKRSKIKHKNSNWRIYDEDIVESC 237 Query: 257 AERWKPWRSYALLHIWYTEGWQPD 280 + +KP R+ + +W D Sbjct: 238 GQLFKPHRTLFMFILWRLSSTNID 261 >UniRef50_C0NIP1 Putative uncharacterized protein n=1 Tax=Ajellomyces capsulatus G186AR RepID=C0NIP1_AJECG Length = 406 Score = 116 bits (291), Expect = 8e-25, Method: Composition-based stats. Identities = 30/124 (24%), Positives = 51/124 (41%), Gaps = 24/124 (19%) Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP--MTIPGDVEQAMKTLQTF 212 FPTP ++A D L++ G+ ++AE + LA G L M + E+ ++ L Sbjct: 296 FPTPAQVAKCDIATLRSAGLSQRKAEYIQGLAEKFASGELSAQMLLQASDEEVLEKLIAV 355 Query: 213 PGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 G+GRW+ F+ G + DVF D ++ S + ++W Sbjct: 356 RGLGRWSVEMFSCFGLKRMDVFSTGDLGVQ----------------------SLFMWYMW 393 Query: 273 YTEG 276 E Sbjct: 394 RIED 397 Score = 43.3 bits (101), Expect = 0.010, Method: Composition-based stats. Identities = 15/65 (23%), Positives = 23/65 (35%), Gaps = 5/65 (7%) Query: 83 DLQCNPQIVNGALGRLGAARPG-----LRLPGCVDAFEQGVRAILGQLVSVAMAAKLTAR 137 + + V L + P L +D F V I+GQ VS A A + + Sbjct: 158 EATAHLLNVAPQLRPVVEKHPCPLFSPAGLAEEIDPFNSLVSGIIGQQVSGAAARSIKKK 217 Query: 138 VAQLY 142 L+ Sbjct: 218 FMALF 222 >UniRef50_A6SCZ1 Putative uncharacterized protein n=1 Tax=Botryotinia fuckeliana B05.10 RepID=A6SCZ1_BOTFB Length = 338 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 54/142 (38%), Gaps = 26/142 (18%) Query: 162 AAADPQALKALGMPLKRAEALIHLANAALEGTL--PMTIPGDVEQAMKTLQTFPGIGRWT 219 + + ++ G+ ++AE + LA G L P + E+ +L G+G+W+ Sbjct: 188 ISIETTCIRTAGLSQRKAEYISGLALKFTNGDLTTPFLLTASYEEVFDSLIQVRGLGKWS 247 Query: 220 ANYFALRGWQAKDVFLPDDYLIKQRFPGM------------------------TPAQIRR 255 FA + DVF D +++ + + ++ Sbjct: 248 VEMFACFALKRLDVFSTGDLGVQRGMAALVGRDVEKLKKAGKGAKGGGKWKYMSEKEMEE 307 Query: 256 YAERWKPWRSYALLHIWYTEGW 277 AE++ P+R+ + ++W E Sbjct: 308 IAEKFSPYRTIFMWYMWRVEDT 329 >UniRef50_A5DE48 Putative uncharacterized protein n=2 Tax=Pichia guilliermondii RepID=A5DE48_PICGU Length = 333 Score = 114 bits (287), Expect = 2e-24, Method: Composition-based stats. Identities = 28/143 (19%), Positives = 56/143 (39%), Gaps = 17/143 (11%) Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLY-GERLDDFPEYICFPTPQRLAAADPQAL 169 + + + +++ Q +S + A + + L+ GE PTP + L Sbjct: 118 LSYWHSLISSVVSQQISGSAARSIMNKFEALFDGE-----------PTPSKTLTFTFDEL 166 Query: 170 KALGMPLKRAEALIHLANAALE-----GTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFA 224 + +G+ + + ++ A + + +E+ +K L + GIG W+A FA Sbjct: 167 REVGLSRMKISYVQSISEAFSDPNSNLCKVSFYRDAPLEEVVKELVSLKGIGEWSAKMFA 226 Query: 225 LRGWQAKDVFLPDDYLIKQRFPG 247 L DVF DD + + Sbjct: 227 LFTLNEWDVFAHDDLGVARGMAR 249 >UniRef50_C1ABY5 DNA-3-methyladenine glycosylase n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1ABY5_GEMAT Length = 219 Score = 113 bits (284), Expect = 6e-24, Method: Composition-based stats. Identities = 45/187 (24%), Positives = 70/187 (37%), Gaps = 13/187 (6%) Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILG 123 S+ P A+ L + S +Q + +L + A L F R +L Sbjct: 3 SSTESPAPAQRLTRASLAAAVQA-LSAQDPSLAAIVARHGTPPLWARPAGFATLGRIVLE 61 Query: 124 QLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALI 183 Q VS+ AA L R+ P +A AL+ALG+ +++ L Sbjct: 62 QQVSLEAAATLWRRLDAQIPGGFHAAP----------VAEIGVDALRALGLTRQKSAYLH 111 Query: 184 HLANAALEGTLPMTIPG--DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLI 241 LA A + TL + + + M L GIG WTA+ + L + DV+ P D + Sbjct: 112 GLATAVADRTLDLALLARASDAEVMSRLTALHGIGPWTASVYLLFALRRPDVWPPGDLAL 171 Query: 242 KQRFPGM 248 Sbjct: 172 HLAMRDA 178 >UniRef50_C7MW75 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Saccharomonospora viridis DSM 43017 RepID=C7MW75_SACVD Length = 324 Score = 112 bits (281), Expect = 1e-23, Method: Composition-based stats. Identities = 51/232 (21%), Positives = 87/232 (37%), Gaps = 15/232 (6%) Query: 50 AIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCN---PQIV---NGALGRLGAARP 103 + A + I + A E V + ++ R L + V + L A P Sbjct: 90 LVRQRAPGVVRIEVDAPPESV-ETVVEQVCRALSLDVDGGGFAAVADGDPVLRHRLRANP 148 Query: 104 GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLA 162 GLR +E A++ VA A + R+A G E FP PQ L Sbjct: 149 GLRPVLFCSPYEAACWAVVCHRFRVAQADAVIRRIAMNRGRVFHVGGREIPSFPVPQELG 208 Query: 163 AADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVE--QAMKTLQTFPGIGRWTA 220 D + G+ ++ +L +A+AAL G L + A++ ++ PG+G ++A Sbjct: 209 VLD----SSYGVSERKRRSLSAIADAALSGELDADRLRALPVFAAVEAVRRLPGLGPFSA 264 Query: 221 NYFALRGWQAKDVFLPDDYLIKQRFPGMTP-AQIRRYAERWKPWRSYALLHI 271 RG D+F + + + ERW+P+R + + Sbjct: 265 ELVVGRGAGHPDLFPASEAGLATTLRRCYGVDDVGVVTERWRPYRGWGAFFL 316 >UniRef50_A5DYG6 Putative uncharacterized protein n=1 Tax=Lodderomyces elongisporus RepID=A5DYG6_LODEL Length = 362 Score = 111 bits (279), Expect = 2e-23, Method: Composition-based stats. Identities = 39/216 (18%), Positives = 72/216 (33%), Gaps = 57/216 (26%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 + +R+++ Q VS A A + + L+ E PT +R + LK+ G Sbjct: 151 WYSLIRSVIAQQVSGAAAKSIENKFKNLFNE--------GEVPTAKRTREMSLEQLKSAG 202 Query: 174 MPLKRAEALIHLANAALE-----GTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 + + + + H++ + L VE+ + + GIG W+A FAL Sbjct: 203 ISGPKVKYVEHISQVFTDPSSKLCDLSFYANATVEEVYEEVCKLKGIGIWSAKMFALFTL 262 Query: 229 QAKDVFLPDDYLIKQRFPGMTPAQ------------------------------------ 252 + DVF DD + + + Sbjct: 263 EEMDVFAEDDLGVARGMAKYLEQRPEMLLQVKSAAKGDEDKQKRLKKRSKFFNKDNSKRT 322 Query: 253 --------IRRYAERWKPWRSYALLHIWYTEGWQPD 280 + AE++KP+RS ++ +W D Sbjct: 323 WNPVHDVYVLDIAEKFKPYRSAFMMILWRLSSTNID 358 >UniRef50_A3LTR9 3-methyladenine DNA glycosylase (Fragment) n=5 Tax=Saccharomycetales RepID=A3LTR9_PICST Length = 288 Score = 111 bits (278), Expect = 3e-23, Method: Composition-based stats. Identities = 28/141 (19%), Positives = 61/141 (43%), Gaps = 13/141 (9%) Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALK 170 + + + +++GQ +S A + + +G+ + P+ L+ Sbjct: 77 LSYWYALISSVIGQQISGHAARAVEKKFKDSFGDDEMN---------PENTLKKSFDELR 127 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTIP----GDVEQAMKTLQTFPGIGRWTANYFALR 226 A+G+ + + +I ++ A + +T P G +E+ ++ L + GIG W+A FA+ Sbjct: 128 AVGLSNMKTKYVISISEAFSDPKNKLTDPKFYEGPLEEIVEELVSLKGIGVWSAKMFAIF 187 Query: 227 GWQAKDVFLPDDYLIKQRFPG 247 + DVF DD + + Sbjct: 188 TLKEMDVFAEDDLGVARGMAK 208 >UniRef50_A4XJM3 8-oxoguanine DNA glycosylase domain protein n=2 Tax=Clostridia RepID=A4XJM3_CALS8 Length = 286 Score = 111 bits (277), Expect = 4e-23, Method: Composition-based stats. Identities = 49/210 (23%), Positives = 82/210 (39%), Gaps = 17/210 (8%) Query: 78 MSRLFDLQCNPQIV-------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAM 130 FDL + + + L G+RL + FE + I+ Q ++ Sbjct: 72 FYWYFDLDKDYDEILEKLSGHDSILKEAVEKYRGMRLLNQ-EPFECMISFIISQNNNIKR 130 Query: 131 AAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAA 189 L R+ Q +G+++ FPT + L ++ LK LG+ RAE + Sbjct: 131 IQLLIERLCQAFGKKITYKGFVSWSFPTLESLWSSSIDDLKLLGL-GYRAEYIKDAVEKV 189 Query: 190 LEGTLPMTIPGDVE--QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG 247 G + D+E +A + L+T GIG A+ L Q +VF D +K+ Sbjct: 190 KNGLINFDELTDLEVQKAKQVLKTIKGIGDKVADCILLYSLQKYNVFPI-DVWVKRVLKE 248 Query: 248 M----TPAQIRRYAERWKPWRSYALLHIWY 273 T QIR + + YA L +++ Sbjct: 249 YYGFKTKDQIRDFINSFGDLAGYAQLFLFH 278 >UniRef50_Q5R0A4 3-methyladenine DNA glycosylase n=2 Tax=Idiomarina RepID=Q5R0A4_IDILO Length = 197 Score = 110 bits (276), Expect = 4e-23, Method: Composition-based stats. Identities = 34/160 (21%), Positives = 64/160 (40%), Gaps = 13/160 (8%) Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGM 174 E R I+ Q++S+ +A + R Q R+A + L G+ Sbjct: 41 EALPRIIIRQMLSLKASATIIERAEQKAKALGVG-----------RIAYIPAEELTGCGV 89 Query: 175 PLKRAEALIHLANAALE--GTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 +A A+ +A+ + + +K + GIG WTA+ A+ + +D Sbjct: 90 SRSKAAAINSVAHCHQTQPERIDAWSQMNSADLIKDVTQLKGIGPWTASILAMFHFAHED 149 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 +F D +++ + I ER +P+RSY ++W Sbjct: 150 LFPIQDSSLRKAMSLLKDQDILIIPERAQPYRSYLACYLW 189 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P04395 DNA-3-methyladenine glycosylase 2 n=122 Tax=Ente... 281 2e-74 UniRef50_Q2SDC7 Adenosine deaminase n=3 Tax=Bacteria RepID=Q2SDC... 277 4e-73 UniRef50_D1R7A9 Putative uncharacterized protein n=1 Tax=Parachl... 275 9e-73 UniRef50_A8MFS4 Transcriptional regulator, AraC family n=2 Tax=C... 273 6e-72 UniRef50_Q6MA41 Putative DNA-3-methyladenine glycosidase II n=3 ... 272 9e-72 UniRef50_UPI0001BC59A0 AraC family transcriptional regulator n=1... 272 1e-71 UniRef50_C8QGF9 Transcriptional regulator, AraC family n=1 Tax=P... 270 3e-71 UniRef50_A3ETF2 Putative Ada DNA repair protein and transcriptio... 267 3e-70 UniRef50_A8GHQ3 Transcriptional regulator, AraC family n=3 Tax=P... 258 2e-67 UniRef50_Q0AGQ6 DNA-3-methyladenine glycosylase II / DNA-O6-meth... 258 2e-67 UniRef50_Q1IT49 DNA-3-methyladenine glycosylase II / Transcripti... 254 2e-66 UniRef50_A6SU78 Methylated-DNA-[protein]-cysteine S-methyltransf... 253 4e-66 UniRef50_Q2RNZ4 Transcriptional regulator Ada / DNA-3-methyladen... 252 1e-65 UniRef50_A7HP34 Ada metal-binding domain protein n=3 Tax=Bacteri... 249 7e-65 UniRef50_Q46QC3 Transcriptional regulator Ada n=25 Tax=cellular ... 247 3e-64 UniRef50_Q02KH7 DNA-3-methyladenine glycosidase II n=8 Tax=Pseud... 245 1e-63 UniRef50_B0RQX4 DNA methylation and regulatory protein (Methylat... 245 1e-63 UniRef50_D2L7I2 Transcriptional regulator, AraC family n=1 Tax=D... 241 2e-62 UniRef50_UPI0001901D5D methylated-DNA--protein-cysteine methyltr... 237 2e-61 UniRef50_Q6MR46 DNA methylation and regulatory protein Ada n=1 T... 237 2e-61 UniRef50_Q10630 Methylated-DNA--protein-cysteine methyltransfera... 237 3e-61 UniRef50_C0WE04 Transcriptional regulator n=1 Tax=Acidaminococcu... 237 3e-61 UniRef50_B0SWZ0 Transcriptional regulator, AraC family n=7 Tax=B... 237 4e-61 UniRef50_Q12D18 Transcriptional regulator Ada / DNA-O6-methylgua... 235 2e-60 UniRef50_A6EY17 Transcriptional Regulator, AraC family protein n... 234 2e-60 UniRef50_C0Q970 AlkA n=1 Tax=Desulfobacterium autotrophicum HRM2... 234 3e-60 UniRef50_A4SQS2 DNA methylation and regulatory protein n=2 Tax=A... 234 3e-60 UniRef50_A7HG85 AlkA domain protein n=2 Tax=Myxococcales RepID=A... 231 2e-59 UniRef50_A5KSU6 Transcriptional regulator, AraC family n=1 Tax=c... 231 2e-59 UniRef50_B1ZFN9 AlkA domain protein n=6 Tax=Methylobacterium Rep... 225 1e-57 UniRef50_C4L050 DNA-3-methyladenine glycosylase II n=4 Tax=Bacil... 221 2e-56 UniRef50_Q1ZAD8 Hypothetical ada regulatory protein n=2 Tax=Phot... 221 2e-56 UniRef50_A1WKZ8 DNA-3-methyladenine glycosylase II / Transcripti... 221 3e-56 UniRef50_B7RWC6 AlkA N-terminal domain family protein n=1 Tax=ma... 221 3e-56 UniRef50_C7R5W7 Transcriptional regulator, AraC family n=1 Tax=K... 220 5e-56 UniRef50_C4DFD0 DNA-3-methyladenine glycosylase II; Transcriptio... 219 6e-56 UniRef50_Q2T2N2 DNA-3-methyladenine glycosylase II n=65 Tax=Burk... 219 8e-56 UniRef50_B9DJS2 Putative uncharacterized protein n=1 Tax=Staphyl... 219 8e-56 UniRef50_A1TR03 DNA-O6-methylguanine--protein-cysteine S-methylt... 219 1e-55 UniRef50_P37878 DNA-3-methyladenine glycosylase n=4 Tax=Bacillac... 217 2e-55 UniRef50_C7QDZ2 Transcriptional regulator, AraC family n=2 Tax=A... 217 4e-55 UniRef50_C0ZIT0 DNA-3-methyladenine glycosylase II n=75 Tax=Baci... 216 6e-55 UniRef50_C6D2P4 DNA-3-methyladenine glycosylase II n=1 Tax=Paeni... 216 6e-55 UniRef50_C0Z5U6 Putative DNA-3-methyladenine glycosylase II n=1 ... 215 9e-55 UniRef50_C6XZ60 HhH-GPD family protein n=1 Tax=Pedobacter hepari... 214 3e-54 UniRef50_C1YI07 DNA-O6-methylguanine--protein-cysteine S-methylt... 210 4e-53 UniRef50_D0LE01 Ada metal-binding domain protein n=1 Tax=Gordoni... 210 5e-53 UniRef50_B4S0Y6 Ada regulatory protein n=3 Tax=Alteromonas macle... 209 8e-53 UniRef50_B2GIR9 Putative methylated-DNA--protein-cysteine methyl... 207 3e-52 UniRef50_A3XSB2 Ada regulatory protein n=1 Tax=Vibrio sp. MED222... 206 6e-52 UniRef50_C6W476 HhH-GPD family protein n=1 Tax=Dyadobacter ferme... 206 9e-52 UniRef50_Q2IPL2 Transcriptional regulator Ada / DNA-O6-methylgua... 205 1e-51 UniRef50_Q15P13 DNA-O6-methylguanine--protein-cysteine S-methylt... 205 1e-51 UniRef50_Q2BC23 DNA-3-methyladenine glycosylase II n=1 Tax=Bacil... 205 2e-51 UniRef50_A8LHD8 Transcriptional regulator, AraC family n=4 Tax=A... 204 3e-51 UniRef50_C5C5F4 HhH-GPD family protein n=1 Tax=Beutenbergia cave... 203 4e-51 UniRef50_D1BI44 DNA-3-methyladenine glycosylase II /DNA-O6-methy... 203 4e-51 UniRef50_Q1QTR7 Transcriptional regulator Ada / DNA-3-methyladen... 203 6e-51 UniRef50_C7MYM6 DNA-3-methyladenine glycosylase II /DNA-O6-methy... 203 7e-51 UniRef50_C7PMW8 8-oxoguanine DNA glycosylase domain protein n=1 ... 201 2e-50 UniRef50_D1CD20 DNA-3-methyladenine glycosylase II n=1 Tax=Therm... 201 3e-50 UniRef50_A0JV31 DNA-O6-methylguanine--protein-cysteine S-methylt... 200 3e-50 UniRef50_O31544 Putative DNA-3-methyladenine glycosylase yfjP n=... 200 4e-50 UniRef50_Q12L65 DNA-O6-methylguanine--protein-cysteine S-methylt... 199 1e-49 UniRef50_A9B7A8 Transcriptional regulator, AraC family n=1 Tax=H... 198 1e-49 UniRef50_B0KRT0 AlkA domain protein n=1 Tax=Pseudomonas putida G... 198 2e-49 UniRef50_Q3IBU8 Putative ADA regulatory protein (Regulatory prot... 197 3e-49 UniRef50_C6MGP3 HhH-GPD family protein n=1 Tax=Nitrosomonas sp. ... 195 2e-48 UniRef50_D1P0X5 DNA-3-methyladenine glycosylase II n=4 Tax=Enter... 194 2e-48 UniRef50_Q81IC3 DNA-3-methyladenine glycosylase II n=75 Tax=Baci... 194 3e-48 UniRef50_UPI00018509D2 YfjP n=1 Tax=Bacillus coahuilensis m4-4 R... 193 7e-48 UniRef50_Q1YTX8 Putative DNA-3-methyladenine glycosylase II n=1 ... 191 2e-47 UniRef50_A3D6C4 Transcriptional regulator Ada / DNA-3-methyladen... 191 2e-47 UniRef50_D1Z1B8 Putative DNA glycosidase n=1 Tax=Methanocella pa... 191 3e-47 UniRef50_A5CSR4 Putative DNA glycosylase n=2 Tax=Clavibacter mic... 189 1e-46 UniRef50_D1C0H7 Transcriptional regulator, AraC family n=1 Tax=X... 188 2e-46 UniRef50_UPI0000E0EED3 Ada family regulatory protein n=1 Tax=Gla... 188 2e-46 UniRef50_Q7N9Z6 Similarities with the C-terminal region of 3-met... 188 3e-46 UniRef50_Q5NXL1 DNA-3-methyladenine glycosidase II n=3 Tax=Betap... 186 6e-46 UniRef50_Q7MGD3 Adenosine deaminase n=51 Tax=Vibrionales RepID=Q... 186 6e-46 UniRef50_Q1ITU3 DNA-3-methyladenine glycosylase II n=2 Tax=Bacte... 186 8e-46 UniRef50_C0E8I7 Putative uncharacterized protein n=2 Tax=Clostri... 185 1e-45 UniRef50_D1C1F2 HhH-GPD family protein n=1 Tax=Sphaerobacter the... 184 2e-45 UniRef50_C8XKJ9 AlkA domain protein n=1 Tax=Nakamurella multipar... 184 3e-45 UniRef50_A1S7Q4 DNA-3-methyladenine glycosylase II / DNA-O6-meth... 183 4e-45 UniRef50_B4X1U6 Base excision DNA repair protein, HhH-GPD family... 183 7e-45 UniRef50_B9XBY0 HhH-GPD family protein n=1 Tax=bacterium Ellin51... 182 1e-44 UniRef50_A6CCG3 Probable DNA-3-methyladenine glycosylase n=1 Tax... 180 5e-44 UniRef50_D1ZEJ1 Whole genome shotgun sequence assembly, scaffold... 179 8e-44 UniRef50_A1ZCF3 HhH-GPD n=1 Tax=Microscilla marina ATCC 23134 Re... 178 1e-43 UniRef50_A4BNP3 3-methyladenine DNA glycosylase/8-oxoguanineDNA ... 178 2e-43 UniRef50_C1RNZ7 DNA-3-methyladenine glycosylase II n=1 Tax=Cellu... 178 2e-43 UniRef50_B7K2N0 DNA-3-methyladenine glycosylase II n=5 Tax=Chroo... 177 3e-43 UniRef50_Q2FMK1 HhH-GPD n=1 Tax=Methanospirillum hungatei JF-1 R... 177 4e-43 UniRef50_A2QHV8 Contig An04c0070, complete genome n=10 Tax=Eurot... 176 8e-43 UniRef50_Q9KC25 DNA-3-methyladenine glycosidase n=1 Tax=Bacillus... 175 2e-42 UniRef50_C7RDZ5 8-oxoguanine DNA glycosylase domain protein n=3 ... 175 2e-42 UniRef50_Q82VT3 HhH-GPD n=2 Tax=Betaproteobacteria RepID=Q82VT3_... 175 2e-42 UniRef50_D0J4I7 HhH-GPD n=2 Tax=Comamonas testosteroni RepID=D0J... 174 2e-42 UniRef50_C7NLP9 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 174 3e-42 UniRef50_B5ES79 HhH-GPD family protein n=4 Tax=Acidithiobacillus... 173 5e-42 UniRef50_B3T536 Putative HhH-GPD superfamily base excision DNA r... 173 7e-42 UniRef50_B1YMD5 HhH-GPD family protein n=1 Tax=Exiguobacterium s... 173 7e-42 UniRef50_Q0VPN7 Putative uncharacterized protein n=1 Tax=Alcaniv... 172 9e-42 UniRef50_B2SXP8 HhH-GPD family protein n=39 Tax=Betaproteobacter... 172 1e-41 UniRef50_A6WG49 HhH-GPD family protein n=5 Tax=Actinomycetales R... 171 3e-41 UniRef50_A9RKT9 Predicted protein (Fragment) n=1 Tax=Physcomitre... 171 3e-41 UniRef50_A4XJM3 8-oxoguanine DNA glycosylase domain protein n=2 ... 170 5e-41 UniRef50_D1HE56 Whole genome shotgun sequence of line PN40024, s... 170 5e-41 UniRef50_C5G8B3 DNA-3-methyladenine glycosylase n=8 Tax=Onygenal... 170 6e-41 UniRef50_A9FBN7 Putative DNA-3-methyladenine glycosidase n=1 Tax... 170 6e-41 UniRef50_Q9LN45 F18O14.25 n=22 Tax=Magnoliophyta RepID=Q9LN45_ARATH 170 6e-41 UniRef50_C1D8D7 HhH-GPD family protein n=1 Tax=Laribacter hongko... 170 7e-41 UniRef50_D2PPK3 Transcriptional regulator, AraC family n=1 Tax=K... 169 8e-41 UniRef50_B6JZD7 DNA-3-methyladenine glycosylase n=1 Tax=Schizosa... 169 9e-41 UniRef50_A5KJK2 Putative uncharacterized protein n=3 Tax=Clostri... 169 1e-40 UniRef50_B2B817 Predicted CDS Pa_2_12990 n=8 Tax=Leotiomyceta Re... 168 1e-40 UniRef50_Q2SX77 DNA-3-methyladenine glycosylase n=60 Tax=Betapro... 168 2e-40 UniRef50_B1ZV80 Transcriptional regulator, AraC family n=2 Tax=O... 167 3e-40 UniRef50_A6TLI8 8-oxoguanine DNA glycosylase domain protein n=9 ... 167 4e-40 UniRef50_D1RHI7 HhH-GPD family base excision repair protein n=1 ... 167 4e-40 UniRef50_C7MAP3 Adenosine deaminase n=1 Tax=Brachybacterium faec... 167 4e-40 UniRef50_C1A5A1 DNA-3-methyladenine glycosylase n=1 Tax=Gemmatim... 166 9e-40 UniRef50_C7DHT2 3-Methyladenine DNA glycosylase n=1 Tax=Candidat... 165 1e-39 UniRef50_Q01SY7 DNA-3-methyladenine glycosylase II n=1 Tax=Candi... 165 2e-39 UniRef50_B4CYJ1 DNA-3-methyladenine glycosylase II n=1 Tax=Chtho... 164 2e-39 UniRef50_A5KST9 DNA-3-methyladenine glycosylase II n=1 Tax=candi... 164 3e-39 UniRef50_A9BVD9 HhH-GPD family protein n=1 Tax=Delftia acidovora... 163 4e-39 UniRef50_B7K9B1 HhH-GPD family protein n=3 Tax=Cyanobacteria Rep... 163 8e-39 UniRef50_UPI00016C4C1A DNA-3-methyladenine glycosylase II n=1 Ta... 162 1e-38 UniRef50_B8IZY6 HhH-GPD family protein n=8 Tax=Bacteria RepID=B8... 162 1e-38 UniRef50_C7PK12 HhH-GPD family protein n=1 Tax=Chitinophaga pine... 161 2e-38 UniRef50_Q92383 DNA-3-methyladenine glycosylase 1 n=1 Tax=Schizo... 161 2e-38 UniRef50_B8GAB8 DNA-3-methyladenine glycosylase II n=3 Tax=Chlor... 161 2e-38 UniRef50_A9EU33 Methylated-DNA--protein-cysteine methyltransfera... 160 4e-38 UniRef50_Q3INX6 DNA N-glycosylase / DNA lyase n=6 Tax=Halobacter... 160 5e-38 UniRef50_Q04UT1 DNA-3-methyladenine glycosylase II n=4 Tax=Lepto... 159 8e-38 UniRef50_D2QEN8 HhH-GPD family protein n=1 Tax=Spirosoma lingual... 159 9e-38 UniRef50_B4B851 DNA-3-methyladenine glycosylase II n=2 Tax=Cyano... 159 1e-37 UniRef50_A5WCQ9 HhH-GPD family protein n=2 Tax=Psychrobacter Rep... 159 1e-37 UniRef50_Q0BSG3 DNA-3-methyladenine glycosylase II n=12 Tax=Prot... 158 1e-37 UniRef50_C1XHZ0 DNA-3-methyladenine glycosylase II n=2 Tax=Meiot... 158 1e-37 UniRef50_A9T041 Predicted protein n=1 Tax=Physcomitrella patens ... 158 1e-37 UniRef50_B2W1R2 DNA-3-methyladenine glycosylase n=1 Tax=Pyrenoph... 158 2e-37 UniRef50_C2AV46 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 158 2e-37 UniRef50_A6GQ39 3-methyladenine DNA glycosylase II n=1 Tax=Limno... 158 2e-37 UniRef50_Q5FSB3 DNA-3-methyladenine glycosylase n=1 Tax=Gluconob... 157 3e-37 UniRef50_C6IXS6 DNA-3-methyladenine glycosidase n=1 Tax=Paenibac... 156 5e-37 UniRef50_C7H057 8-oxoguanine DNA glycosylase n=1 Tax=Eubacterium... 156 5e-37 UniRef50_Q8TL35 DNA-3-methyladenine glycosylase II n=1 Tax=Metha... 156 5e-37 UniRef50_C7MP98 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 156 6e-37 UniRef50_C6CD76 DNA-3-methyladenine glycosylase II n=1 Tax=Dicke... 156 6e-37 UniRef50_B2J3A5 HhH-GPD family protein n=1 Tax=Nostoc punctiform... 156 7e-37 UniRef50_A7EZ08 Putative uncharacterized protein n=1 Tax=Sclerot... 156 8e-37 UniRef50_UPI0000D54B32 HhH-GPD n=1 Tax=Psychroflexus torquis ATC... 156 1e-36 UniRef50_B0K3N9 8-oxoguanine DNA glycosylase domain protein n=15... 155 2e-36 UniRef50_Q1D1V1 HhH-GPD domain protein n=15 Tax=cellular organis... 155 2e-36 UniRef50_D1IGU3 Whole genome shotgun sequence of line PN40024, s... 154 3e-36 UniRef50_B0MMM9 Putative uncharacterized protein n=1 Tax=Eubacte... 154 3e-36 UniRef50_B0P8C4 Putative uncharacterized protein n=2 Tax=Clostri... 154 4e-36 UniRef50_Q5K8T8 DNA-3-methyladenine glycosidase, putative n=1 Ta... 154 4e-36 UniRef50_B5IDT4 Base excision DNA repair protein, HhH-GPD family... 153 5e-36 UniRef50_C7HRV7 3-methyladenine DNA glycosylase n=6 Tax=Clostrid... 153 6e-36 UniRef50_A8SY14 Putative uncharacterized protein n=1 Tax=Coproco... 153 6e-36 UniRef50_C8WLI9 HhH-GPD family protein n=4 Tax=Bacteria RepID=C8... 153 6e-36 UniRef50_Q6CEP5 YALI0B14080p n=1 Tax=Yarrowia lipolytica RepID=Q... 153 7e-36 UniRef50_B6EMH3 DNA repair protein n=2 Tax=Gammaproteobacteria R... 153 7e-36 UniRef50_A6TTX3 Methylated-DNA--protein-cysteine methyltransfera... 153 8e-36 UniRef50_A5V920 HhH-GPD family protein n=7 Tax=Sphingomonadales ... 153 9e-36 UniRef50_B9XGL2 8-oxoguanine DNA glycosylase domain protein n=1 ... 152 1e-35 UniRef50_B9LPN6 HhH-GPD family protein n=4 Tax=Halobacteriaceae ... 152 1e-35 UniRef50_A0Q2T4 8-oxoguanine-DNA-glycosylase, putative n=5 Tax=C... 151 2e-35 UniRef50_A0RYQ2 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 151 2e-35 UniRef50_B0U6C0 DNA-3-methyladenine glycosidase n=16 Tax=Xanthom... 151 2e-35 UniRef50_C4R2A8 Mitochondrial glycosylase/lyase n=1 Tax=Pichia p... 151 3e-35 UniRef50_C4Y1S0 Putative uncharacterized protein n=1 Tax=Clavisp... 151 3e-35 UniRef50_C8W0S2 HhH-GPD family protein n=6 Tax=Bacteria RepID=C8... 151 4e-35 UniRef50_A9KID8 8-oxoguanine DNA glycosylase domain protein n=2 ... 150 4e-35 UniRef50_Q6BZL7 DEHA2A00418p n=2 Tax=Debaryomyces hansenii RepID... 150 4e-35 UniRef50_C0KTC3 8-oxoguanine DNA glycosylase n=1 Tax=Clostridium... 150 7e-35 UniRef50_B3T522 Putative HhH-GPD superfamily base excision DNA r... 150 7e-35 UniRef50_UPI00016C0B45 8-oxoguanine DNA glycosylase domain prote... 149 1e-34 UniRef50_A5E623 Putative uncharacterized protein n=1 Tax=Loddero... 149 1e-34 UniRef50_A1K6J5 DNA-3-methyladenine glycosylase II n=21 Tax=Prot... 149 1e-34 UniRef50_C6PBG5 8-oxoguanine DNA glycosylase domain protein n=1 ... 149 1e-34 UniRef50_Q4A0G9 Putative DNA-3-methyladenine glycosidase n=1 Tax... 149 1e-34 UniRef50_C4G1E5 Putative uncharacterized protein n=1 Tax=Abiotro... 149 1e-34 UniRef50_A9M750 HhH-GPD family protein n=55 Tax=Rhizobiales RepI... 148 2e-34 UniRef50_UPI0001C41B95 8-oxoguanine DNA glycosylase Ogg n=1 Tax=... 148 2e-34 UniRef50_C2L088 8-oxoguanine DNA glycosylase n=2 Tax=Clostridial... 148 2e-34 UniRef50_Q7NJ14 Gll2018 protein n=1 Tax=Gloeobacter violaceus Re... 148 2e-34 UniRef50_A3JFL8 3-methyladenine DNA glycosylase n=1 Tax=Marinoba... 148 2e-34 UniRef50_D0J2I3 HhH-GPD n=6 Tax=Comamonadaceae RepID=D0J2I3_COMTE 148 2e-34 UniRef50_Q3B3Y2 HhH-GPD n=1 Tax=Chlorobium luteolum DSM 273 RepI... 148 2e-34 UniRef50_C6WJ98 Transcriptional regulator, AraC family n=5 Tax=A... 148 3e-34 UniRef50_Q6CEU4 YALI0B12870p n=1 Tax=Yarrowia lipolytica RepID=Q... 148 3e-34 UniRef50_A6EE77 3-methyladenine DNA glycosylase n=1 Tax=Pedobact... 147 4e-34 UniRef50_O94468 Probable DNA-3-methyladenine glycosylase 2 n=1 T... 147 5e-34 UniRef50_C4Z3R2 N-glycosylase/DNA lyase n=4 Tax=Bacteria RepID=C... 147 5e-34 UniRef50_UPI00006DC22F hypothetical protein CdifQ_04000214 n=1 T... 146 6e-34 UniRef50_B8I162 8-oxoguanine DNA glycosylase domain protein n=2 ... 146 6e-34 UniRef50_D0XPK8 HhH-GPD family protein n=1 Tax=Brevundimonas sub... 146 6e-34 UniRef50_B6BS24 DNA-3-methyladenine glycosylase I n=4 Tax=SAR11 ... 146 7e-34 UniRef50_D0LW65 DNA-3-methyladenine glycosylase II n=1 Tax=Halia... 146 9e-34 UniRef50_C6A294 AlkA 3-methyladenine DNA glycosylase n=9 Tax=The... 146 1e-33 UniRef50_A8QA43 Putative uncharacterized protein n=1 Tax=Malasse... 146 1e-33 UniRef50_B4S806 8-oxoguanine DNA glycosylase domain protein n=1 ... 145 1e-33 UniRef50_C1DYL3 Predicted protein n=2 Tax=Micromonas RepID=C1DYL... 145 1e-33 UniRef50_Q9ZET9 DNA-3-methyladenine glycosidase (Fragment) n=1 T... 145 2e-33 UniRef50_Q46B02 8-oxoguanine DNA glycosylase n=4 Tax=Methanosarc... 145 2e-33 UniRef50_Q2FPI5 8-oxoguanine DNA glycosylase-like n=1 Tax=Methan... 145 2e-33 UniRef50_C4ZBV7 8-oxoguanine DNA glycosylase domain protein n=1 ... 145 2e-33 UniRef50_Q756Y7 AER127Cp n=1 Tax=Eremothecium gossypii RepID=Q75... 145 2e-33 UniRef50_Q4PHN3 Putative uncharacterized protein n=1 Tax=Ustilag... 145 2e-33 UniRef50_P53397 DNA-(apurinic or apyrimidinic site) lyase n=8 Ta... 144 3e-33 UniRef50_A8N5M3 Putative uncharacterized protein n=2 Tax=Agarica... 144 3e-33 UniRef50_Q18H56 DNA N-glycosylase / DNA lyase n=12 Tax=Halobacte... 144 3e-33 UniRef50_A1SKK8 Putative uncharacterized protein n=2 Tax=Actinom... 144 4e-33 UniRef50_B3ECK0 8-oxoguanine DNA glycosylase domain protein n=1 ... 144 4e-33 UniRef50_D2LH30 HhH-GPD family protein n=1 Tax=Rhodomicrobium va... 143 4e-33 UniRef50_B1C9X9 Putative uncharacterized protein n=1 Tax=Anaerof... 143 5e-33 UniRef50_B4SHB1 8-oxoguanine DNA glycosylase domain protein n=3 ... 143 5e-33 UniRef50_B6K1P6 DNA-3-methyladenine glycosylase n=1 Tax=Schizosa... 143 5e-33 UniRef50_B8GY42 DNA-3-methyladenine glycosylase II n=4 Tax=Caulo... 143 6e-33 UniRef50_Q8F6D8 DNA-3-methyladenine glycosylase n=2 Tax=Leptospi... 143 6e-33 UniRef50_Q5CV50 DNA-3-methyladenine glycosidase (Fragment) n=2 T... 143 7e-33 UniRef50_Q9XI06 F8K7.14 protein n=3 Tax=rosids RepID=Q9XI06_ARATH 143 8e-33 UniRef50_O28163 3-methyladenine DNA glycosylase (AlkA) n=1 Tax=A... 143 8e-33 UniRef50_C4DGP2 3-methyladenine DNA glycosylase/8-oxoguanine DNA... 142 1e-32 UniRef50_C8V1L7 Mitochondrial glycosylase/lyase (Eurofung) n=25 ... 142 1e-32 UniRef50_B3E6X3 HhH-GPD family protein n=2 Tax=Bacteria RepID=B3... 142 1e-32 UniRef50_A2STI1 8-oxoguanine DNA glycosylase domain protein n=1 ... 142 1e-32 UniRef50_B9WJV0 Mitochondrial N-glycosylase/DNA lyase [includes:... 142 1e-32 UniRef50_O27397 DNA-(apurinic or apyrimidinic site) lyase n=1 Ta... 142 2e-32 UniRef50_D1VDS6 HhH-GPD family protein n=3 Tax=Actinomycetales R... 141 2e-32 UniRef50_Q5SLG4 DNA-3-methyladenine glycosidase n=6 Tax=Bacteria... 141 2e-32 UniRef50_A4SEG0 8-oxoguanine DNA glycosylase domain protein n=1 ... 141 2e-32 UniRef50_Q64A66 8-oxoguanine DNA glycosylase n=2 Tax=environment... 141 2e-32 UniRef50_Q7CSU9 DNA-3-methyladenine glycosidase II n=1 Tax=Agrob... 141 3e-32 UniRef50_B6G8M1 Putative uncharacterized protein n=1 Tax=Collins... 141 3e-32 UniRef50_Q1AWP7 DNA-3-methyladenine glycosylase II n=1 Tax=Rubro... 141 3e-32 UniRef50_Q3ARU6 HhH-GPD n=1 Tax=Chlorobium chlorochromatii CaD3 ... 141 4e-32 UniRef50_D1VAP6 HhH-GPD family protein n=1 Tax=Frankia sp. EuI1c... 141 4e-32 UniRef50_C3Y2J9 Putative uncharacterized protein n=1 Tax=Branchi... 140 4e-32 UniRef50_Q688W2 Os05g0567500 protein n=2 Tax=Oryza sativa RepID=... 140 4e-32 UniRef50_C0D4Q9 Putative uncharacterized protein n=2 Tax=Clostri... 140 5e-32 UniRef50_C5DFL6 KLTH0D16126p n=2 Tax=Saccharomycetaceae RepID=C5... 139 8e-32 UniRef50_C5SF64 HhH-GPD family protein n=1 Tax=Asticcacaulis exc... 139 9e-32 UniRef50_UPI00015B5D00 PREDICTED: similar to ENSANGP00000022197 ... 139 1e-31 UniRef50_C9L550 8-oxoguanine DNA glycosylase n=3 Tax=Clostridial... 139 1e-31 Sequences not found previously or not previously below threshold: >UniRef50_P04395 DNA-3-methyladenine glycosylase 2 n=122 Tax=Enterobacteriaceae RepID=3MG2_ECOLI Length = 282 Score = 281 bits (719), Expect = 2e-74, Method: Composition-based stats. Identities = 282/282 (100%), Positives = 282/282 (100%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH Sbjct: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRA 120 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRA Sbjct: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRA 120 Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE Sbjct: 121 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 Query: 181 ALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 ALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL Sbjct: 181 ALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 Query: 241 IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA Sbjct: 241 IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 >UniRef50_Q2SDC7 Adenosine deaminase n=3 Tax=Bacteria RepID=Q2SDC7_HAHCH Length = 484 Score = 277 bits (708), Expect = 4e-73, Method: Composition-based stats. Identities = 93/285 (32%), Positives = 142/285 (49%), Gaps = 6/285 (2%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEY----RGVVTAIPDIARH 57 + + ++PPY W +L FL+AR ++ V+ V D Y R + V G A + Sbjct: 200 FEMPYRPPYAWDALLSFLSARTIAGVDAVVDGRYHRIVRVEAGEDSAVGWFEASHEADAA 259 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQG 117 + I L + L + ++ FD+ C PQ +N LG L PG+R+P +D FE Sbjct: 260 RIRIRLDSTLSRHIGYLINRLRAFFDVSCVPQEINKVLGTLAQNEPGMRIPSGMDGFEIA 319 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY--ICFPTPQRLAAADPQALKALGMP 175 VRAILGQ ++VA A L AR+ +G+ ++ FP+ L + L +LG+ Sbjct: 320 VRAILGQQITVAAARTLLARLVDKFGDPIETPFPEINRTFPSAATLVNLPVEELASLGVI 379 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 R A+ +A A L L ++ +VEQ ++ L PGIG WTA Y A+R D F Sbjct: 380 RTRVRAIQEIAAAMLRSELTLSPAANVEQEIQRLHAIPGIGDWTAQYIAMRAMSWPDAFP 439 Query: 236 PDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 D +++ G+ Q R AE W+PWR YA++H+W + D Sbjct: 440 ASDVGVRKALGGVDAKQSARAAEEWRPWRGYAVMHLWRSLELPHD 484 >UniRef50_D1R7A9 Putative uncharacterized protein n=1 Tax=Parachlamydia acanthamoebae str. Hall's coccus RepID=D1R7A9_9CHLA Length = 532 Score = 275 bits (705), Expect = 9e-73, Method: Composition-based stats. Identities = 95/284 (33%), Positives = 149/284 (52%), Gaps = 8/284 (2%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L ++PPYDW ++ FL R + VE + Y R++ +G+ +G + + +L Sbjct: 248 ILQLTYRPPYDWKGVINFLRVRLMKGVEHIEGDRYLRTIQLGKTKGWIQISHAEEKQSLI 307 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAF 114 LS L PV L ++ +FDL P ++ + L PGLR+PG D F Sbjct: 308 FELSHSLLPVLPALLGRIRSVFDLNARPDVISTHLRQDKWLTEAVNVNPGLRIPGAFDGF 367 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY--ICFPTPQRLAAADPQALKAL 172 E VRAILGQ ++V A L R+ Q +GE++ PTPQRL A L +L Sbjct: 368 ELAVRAILGQQITVKAATTLAGRLVQAFGEKIQTPYPELKHLSPTPQRLTIATVDELASL 427 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 G+ R++++IHLA + G L + E+ M+ L PGIG+WTA+Y A+R + D Sbjct: 428 GIIQSRSKSIIHLAEEVVSGRLQLDADVYPEKTMQKLVQIPGIGKWTAHYIAMRALRWPD 487 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 F +D ++++ +T +Q ++ W+PWRSYA+LH+W Sbjct: 488 AFPKEDIVLRKCLGNVTASQAEILSQSWRPWRSYAVLHLWQNSS 531 >UniRef50_A8MFS4 Transcriptional regulator, AraC family n=2 Tax=Clostridiaceae RepID=A8MFS4_ALKOO Length = 485 Score = 273 bits (698), Expect = 6e-72, Method: Composition-based stats. Identities = 102/289 (35%), Positives = 152/289 (52%), Gaps = 15/289 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRG-----VVTAIPDIAR 56 L+++PPY W ML FLA RA++ +E V ++ Y R++ + G + R Sbjct: 197 LVLSYRPPYHWEDMLRFLAGRAITGIEVVKNNEYMRTVHLENSEGKPVYGWIRVGHQSKR 256 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP-----GLRLPGCV 111 + L + +S L V + LA++ LFDL C+P V L + RP G R+PGC Sbjct: 257 NALSVTVSQALLSVLPQVLARIRHLFDLYCDPDAVYETLQVMNDIRPNLCTLGTRVPGCF 316 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAAD---P 166 +AFE VRA+LGQ ++V A+ L AR+ Q YG + E FP+P+ + A + Sbjct: 317 NAFEMVVRAVLGQQITVKAASTLAARIVQTYGTPIQTGFEGLTHVFPSPEDILALNGPIE 376 Query: 167 QALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALR 226 L LG+ RA+ + LA A ++G + +P E+ MK L GIG WTA Y A+R Sbjct: 377 NHLGPLGVIAARAKTIYELAQAFVQGEIDFDLPAQPEEEMKRLMAIRGIGSWTAQYIAMR 436 Query: 227 GWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 + D FL D +K+ P T ++ AE W+PWRSYA +++W T Sbjct: 437 AMEWPDAFLETDAGVKKALPPYTAKELLEIAEAWRPWRSYATVNLWNTL 485 >UniRef50_Q6MA41 Putative DNA-3-methyladenine glycosidase II n=3 Tax=Bacteria RepID=Q6MA41_PARUW Length = 476 Score = 272 bits (696), Expect = 9e-72, Method: Composition-based stats. Identities = 90/283 (31%), Positives = 150/283 (53%), Gaps = 8/283 (2%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 LN++PPYDW L FL+ R++ +E V ++ Y R++ + EY+G + +H L + Sbjct: 194 LQLNYRPPYDWIGFLNFLSIRSLKGIELVKNNCYLRTVQIREYKGWIHVSHVEDKHCLRV 253 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 +++ L PV A L ++ FDL P + + L A PGLR+PG D FE Sbjct: 254 KIASSLVPVLAILLERIRNFFDLNARPDKISVQLEQDPFLAEEVAKNPGLRVPGTFDGFE 313 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICF--PTPQRLAAADPQALKALG 173 RAILGQ ++V A L +R + +GE + + P+ QR+++ + + +G Sbjct: 314 LAFRAILGQQITVKAATTLASRFVKAFGEEFKTPFAELHYLCPSSQRISSLKWEEIATIG 373 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 + RA+ +I LA TL + ++ +K L + GIG+WTA+Y ALR Q D Sbjct: 374 IIRARAQTIIELAKQMSSNTLKLEAGVNLRLTIKQLTSIAGIGQWTAHYIALRALQWPDA 433 Query: 234 FLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 F +D ++++ +T Q + ++ W+PWRSYA L++W + Sbjct: 434 FPKEDVALRKKLGKVTAKQAEKLSQVWRPWRSYATLYLWQKKD 476 >UniRef50_UPI0001BC59A0 AraC family transcriptional regulator n=1 Tax=Fusobacterium ulcerans ATCC 49185 RepID=UPI0001BC59A0 Length = 486 Score = 272 bits (695), Expect = 1e-71, Method: Composition-based stats. Identities = 98/288 (34%), Positives = 151/288 (52%), Gaps = 14/288 (4%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAV----GEYRGVVTAIPDIARH 57 L ++PPY W +L FLA RA+ VETV + Y R++ + A ++ Sbjct: 199 LALGYRPPYQWEHILNFLALRAIPGVETVKEGKYYRTVHFLNGEKHIYSWIQAENQPEKN 258 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP-----GLRLPGCVD 112 T+ + + A L PV ++ LAK+ LFDL C+P V L ++ +P G+R+PGC D Sbjct: 259 TIAVTMPAELLPVLSQVLAKVRNLFDLSCDPYAVYEGLMKMNNIQPNICTLGIRVPGCFD 318 Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADP---Q 167 FE VRA+LGQ +++ A L AR+ + +G ++ E FP P+ + Sbjct: 319 PFEMSVRAVLGQQITIKAAKTLAARITEKFGVTIETGIEGLTHIFPEPEDIYKLKDKITD 378 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRG 227 +L LG+ RA+ ++ LA+A + + E+ +K L GIG WTA Y A+R Sbjct: 379 SLGELGIIKTRAKTILELASAFVNKEIDFNFCIHPEEEIKKLMKISGIGNWTAQYIAMRA 438 Query: 228 WQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 D FL DY +K+ P TP +I AE W+PWRSYA++++W + Sbjct: 439 MGLTDAFLETDYGVKKALPSYTPKEILTLAEAWRPWRSYAVVNLWNSL 486 >UniRef50_C8QGF9 Transcriptional regulator, AraC family n=1 Tax=Pantoea sp. At-9b RepID=C8QGF9_9ENTR Length = 495 Score = 270 bits (692), Expect = 3e-71, Method: Composition-based stats. Identities = 97/288 (33%), Positives = 146/288 (50%), Gaps = 8/288 (2%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L+++PPYDW +L FL R + VE VA+ Y R++A+G +G V + L + Sbjct: 200 LRLSYRPPYDWEAILDFLQQRVMKEVEWVAEGIYHRTVALGGCQGWVRVSHYPEKQALKV 259 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 + L PV L ++ LFDL PQ + + L PGLR+PG D FE Sbjct: 260 QFTTSLTPVLPALLRRLRDLFDLDAQPQRIADQLAQDPLLAPSLVRYPGLRVPGAFDGFE 319 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY--ICFPTPQRLAAADPQALKALG 173 GVRAILGQ V+V A L++RVAQ +G + P+ + LA A + +LG Sbjct: 320 LGVRAILGQQVTVKAATTLSSRVAQRFGAPMATPWPELSRLSPSAETLATATQDDIASLG 379 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 + R++A++ LA A G L + + + L GIG WTA+Y A+R + D Sbjct: 380 IVSARSQAILALAQACASGALRFNGAVNPDVVQQQLLALKGIGPWTASYIAMRALRWPDA 439 Query: 234 FLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDE 281 F +D I+ G++ ++ W+PWRSYA+LHIW + + + Sbjct: 440 FPKEDIAIRNNLGGVSAKDAEVRSQVWRPWRSYAVLHIWKSLTPEKGD 487 >UniRef50_A3ETF2 Putative Ada DNA repair protein and transcriptional regulator, AraC family n=2 Tax=Leptospirillum sp. Group II RepID=A3ETF2_9BACT Length = 480 Score = 267 bits (684), Expect = 3e-70, Method: Composition-based stats. Identities = 106/285 (37%), Positives = 158/285 (55%), Gaps = 6/285 (2%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEY----RGVVTAIPDIAR 56 +++L ++PPY W M FL R V+ VE V++ Y R++ + + +G ++A D Sbjct: 196 VFSLGFRPPYAWEAMFDFLGNRTVAGVEEVSEKVYRRAVRIRKGGTTFQGWLSAEADNTG 255 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQ 116 L + LS L PVA LA++ RLFDL+C+P+++ LG LG PG+R+PG D FE Sbjct: 256 KALRLTLSTSLAPVATTVLARVRRLFDLECHPELIADILGPLGMREPGIRVPGAFDGFET 315 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADPQALKALGM 174 VR ILGQ VSV A L R+ +G+ +D FP+P+R+A D AL LG+ Sbjct: 316 AVRIILGQQVSVQGARTLAGRLVSAHGDPIDTPWPDITRAFPSPERIAGMDASALSGLGI 375 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 R +A++ LA A EG + + D E M+ L++ PGIG WTA A+R D F Sbjct: 376 FGFRIKAILGLAAAVAEGRITLAPGPDPEPQMEALRSIPGIGEWTAQAIAMRVLSWPDAF 435 Query: 235 LPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 DY I++ +P ++ A++W+PWR+YA L +W P Sbjct: 436 PHTDYGIQKALKEKSPRRVLEVAQQWRPWRAYAALALWRALVESP 480 >UniRef50_A8GHQ3 Transcriptional regulator, AraC family n=3 Tax=Proteobacteria RepID=A8GHQ3_SERP5 Length = 512 Score = 258 bits (660), Expect = 2e-67, Method: Composition-based stats. Identities = 134/286 (46%), Positives = 177/286 (61%), Gaps = 6/286 (2%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEY----RGVVTAIPDIAR 56 ++ L ++PPYDW ML FL RAVS VE V Y R++A+ + G V+ P+ + Sbjct: 222 VFHLGYRPPYDWPRMLSFLQTRAVSGVEKVEGQQYLRAIAITQGGIDYHGWVSVQPEESH 281 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQ 116 + + + ++ L V E L ++ +LFDL P ++ ALG+L A PGLRLPGCV+ FEQ Sbjct: 282 NRVRVEIAPALSRVTTEVLRRIRQLFDLDAAPDLIVQALGQLAADAPGLRLPGCVNGFEQ 341 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADPQALKALGM 174 RA+LGQLVSV MAA +A+ +G L+ FP +++A P+ L+ LG+ Sbjct: 342 ATRAVLGQLVSVKMAATFAGCMAERWGTPLEQPYAGITHVFPNAEQVARLQPEELRPLGV 401 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 LKRA ALI +A A EG L + D+EQ +K L PGIG WTA Y A+R W DVF Sbjct: 402 QLKRAAALIAIARAVTEGRLQLENVLDIEQGIKALTALPGIGSWTACYIAMRAWSWPDVF 461 Query: 235 LPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 L DYLIKQRFPGMTP QI YAE W+PWRSYA LH+W+ +GW P Sbjct: 462 LTGDYLIKQRFPGMTPRQIENYAECWRPWRSYATLHLWHNQGWVPS 507 >UniRef50_Q0AGQ6 DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / Transcriptional regulator Ada n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AGQ6_NITEC Length = 477 Score = 258 bits (659), Expect = 2e-67, Method: Composition-based stats. Identities = 87/281 (30%), Positives = 140/281 (49%), Gaps = 9/281 (3%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L+++PP W+ ++ FL +R+ + + + Y +++ + +G VTA D RH ++ Sbjct: 193 LLRLSYRPPLAWNALIRFLCSRSNLRLSQIQNGNYLQTVNLDGCQGWVTAKHDTKRHQIY 252 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAF 114 + S L P + RLFDL NP I+ + L L A PGLR+PG +D F Sbjct: 253 VQASRSLLPCLIRLQMYLRRLFDLDANPAIIEAHLGNDDILKPLIANHPGLRIPGTLDIF 312 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY--ICFPTPQRLAAADPQALKAL 172 E G+RAILGQ ++V A L R +G+ +D P + +A QAL + Sbjct: 313 ELGLRAILGQQITVKAATTLFGRFVATFGKPVDTPFPGLDRTSPPAELIADTSLQALIDI 372 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 G+ +RA + A + G L D + ++ L PGIG WTA Y A+R + Sbjct: 373 GLTGRRALTIQRFAQTIVNGALKPES-IDRNKIIEQLLELPGIGPWTAQYIAIRALGDSN 431 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 F D + + PA++ R E+W+PWR+Y +H+W+ Sbjct: 432 AFPASDLGLLRGLRMEKPAELLRRTEKWQPWRAYGAIHLWH 472 >UniRef50_Q1IT49 DNA-3-methyladenine glycosylase II / Transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=3 Tax=Bacteria RepID=Q1IT49_ACIBL Length = 477 Score = 254 bits (650), Expect = 2e-66, Method: Composition-based stats. Identities = 95/281 (33%), Positives = 141/281 (50%), Gaps = 12/281 (4%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 ++ L ++PPY W ML FL RA VE V + YARS+++ G +H+L Sbjct: 198 VFRLRYRPPYHWLGMLDFLRPRATPGVECVTEDAYARSISLHGKEGSFEVTHAPEQHSLV 257 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAF 114 + ++ + + ++ +FDL + + + L PG RLPG D F Sbjct: 258 LRVNFEDSSALFQIVERVRAMFDLNADWGSIAGVLENDRLLRGHLKGDPGRRLPGAWDGF 317 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY-ICFPTPQRLAAADPQALKALG 173 E VRA+LGQ +SVA A L ++A+ +G L FPTP+ LA A +L Sbjct: 318 ELAVRAVLGQQISVAAATNLAGQIARKFGRPLRKSNGISHLFPTPEILADA-----ASLP 372 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 +P+KRAE + LA A + L DV Q + L+T PGIG WTA Y ALR + D Sbjct: 373 LPMKRAETIRALACAVRDCELQFDAITDVPQFCEQLKTIPGIGDWTAQYVALRALREPDA 432 Query: 234 FLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 F D +++ + A++ R AE W+PWR YA +++W Sbjct: 433 FPAGDLGLQKSLGVKSSAELERRAENWRPWRGYAAIYMWSA 473 >UniRef50_A6SU78 Methylated-DNA-[protein]-cysteine S-methyltransferase n=2 Tax=Oxalobacteraceae RepID=A6SU78_JANMA Length = 499 Score = 253 bits (647), Expect = 4e-66, Method: Composition-based stats. Identities = 103/293 (35%), Positives = 153/293 (52%), Gaps = 19/293 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADS-----YYARSLAVGEYRGVVTAIPDIAR 56 L ++PPY W ML +LA RA+ VE V + Y RS+ + G + AR Sbjct: 207 LRLAYRPPYAWEPMLAYLAGRAIPGVEGVVEDAPGTLSYVRSVMLNNTAGWLRVTHLPAR 266 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGC 110 L ++L A L PV LA++ + FDL NP+I+ + L + PGLR+PG Sbjct: 267 RQLELSLPATLAPVLMPLLARVRKQFDLDANPEIIAAHLSADALLAQQIRLTPGLRVPGT 326 Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY--ICFPTPQRLAAADPQA 168 D FE +RA+LGQ VSVA A ++ R+ + +GE D FPT +RLAAAD Sbjct: 327 FDTFELAIRAVLGQQVSVAGATTVSGRLVKAFGEPADTPFIGINRHFPTAERLAAADIGE 386 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 + ALGMP RA+ + ++A A++G L M +++ + +L+T GIG WTA Y A+R Sbjct: 387 IAALGMPGSRAQTIQNVARFAVQGGLQMKPGASLDECVSSLKTVRGIGEWTAQYVAMRAL 446 Query: 229 QAKDVFLPDDYLIKQRF------PGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 + D F D +++ +T Q+ A W PWR+Y L +W++ Sbjct: 447 RFPDAFPTGDLGLQKAAVEVAGGTRLTEKQLLLRAAGWSPWRAYTALLLWHSL 499 >UniRef50_Q2RNZ4 Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=2 Tax=Bacteria RepID=Q2RNZ4_RHORT Length = 486 Score = 252 bits (643), Expect = 1e-65, Method: Composition-based stats. Identities = 109/285 (38%), Positives = 143/285 (50%), Gaps = 18/285 (6%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L + P+DW +L F RAV +E V Y R++A+G RGVVT L Sbjct: 204 VLRLAARQPFDWPGLLAFFRQRAVPGLERVEGDTYVRAIAIGAARGVVTIRGSAEG--LV 261 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAF 114 + S A +A++ R+FDL + + + L L AARPGLR+PG D F Sbjct: 262 VTPSLDRPEGLAALVARLRRVFDLDADIGAIGAHLGADPLLAPLVAARPGLRVPGAWDGF 321 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY---ICFPTPQRLAAADPQALKA 171 E VRAILGQ VSVA A L R+ +GE L + P FPT RLA AD L Sbjct: 322 ELAVRAILGQQVSVAAATTLAGRLVGAFGEPLTNAPPAGPSRLFPTAARLAEAD---LGG 378 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 LG+ RA+A+ LA A +E + D++ A+ L PGIG WTA Y ALR Sbjct: 379 LGLTTARAKAISGLARAVVETPGLLDPGPDLDSAVARLCRLPGIGPWTAQYIALRALGEA 438 Query: 232 DVFLPDDYLIKQRFP----GMTPAQIRRYAERWKPWRSYALLHIW 272 D D + + TPA + AE W+PWRSYA+LH+W Sbjct: 439 DALPVGDIGVLRALAEDGVRPTPAALLARAEDWRPWRSYAVLHLW 483 >UniRef50_A7HP34 Ada metal-binding domain protein n=3 Tax=Bacteria RepID=A7HP34_PARL1 Length = 513 Score = 249 bits (637), Expect = 7e-65, Method: Composition-based stats. Identities = 95/286 (33%), Positives = 138/286 (48%), Gaps = 18/286 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++PP+D+ +L +L++RA+ VE +++ YARS +G +G+VT P L Sbjct: 226 IRLGYRPPFDFDRILAYLSSRALPGVERISEGRYARSFHLGGVKGLVTVTPAATGSALDA 285 Query: 62 NLSAGLE----PVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCV 111 ++ A++ RLFDL P + + A+G A PGLR+ G Sbjct: 286 RIAVLDAKGGTVPVRAIAARLRRLFDLDAEPGAIAAAFAGDPAIGPRFARVPGLRVAGAF 345 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL--DDFPEYICFPTPQRLAAADPQAL 169 D FE VRA+LGQ +SV A + R+ GE + ++ FP P+ LA AD L Sbjct: 346 DGFELAVRAVLGQQISVKGATTIAGRIVARLGEEVTTEEPGITHFFPAPRALARAD---L 402 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 LG+ R L LA A G L T ++ + L PGIG WTA+Y ALR Sbjct: 403 SGLGLTGGRIATLTSLAQAVASGALDFTPRESLDAKLAELTALPGIGEWTAHYVALRALG 462 Query: 230 AKDVFLPDDYLIKQRFPGMTP---AQIRRYAERWKPWRSYALLHIW 272 D F D +++ P ++ R AE W+PWR YA L +W Sbjct: 463 EPDAFPASDLGLRKAVGKGEPVSTKELERMAESWRPWRGYAALALW 508 >UniRef50_Q46QC3 Transcriptional regulator Ada n=25 Tax=cellular organisms RepID=Q46QC3_RALEJ Length = 509 Score = 247 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 105/294 (35%), Positives = 145/294 (49%), Gaps = 12/294 (4%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEY----RGVVTAIPDIAR 56 ++ L ++PP W +LGFLA RAV +E V D YAR+L+V RG V R Sbjct: 195 VFELGYRPPLAWEALLGFLAVRAVDGIEQVRDGAYARTLSVESGGTTHRGWVRLDHVPGR 254 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQ 116 L + LSA L V + L K+ RL DL C P IV+ LG L + PG+RLPG VD FE Sbjct: 255 LVLRVTLSASLARVIPQALGKVRRLCDLGCRPDIVDRHLGELASDVPGMRLPGSVDGFEI 314 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDF--------PEYICFPTPQRLAAADPQA 168 VRA++GQ++SV A ++ AR+ Q G+ L P FP+ LAA Sbjct: 315 AVRAVIGQVISVVQARRILARLGQTAGDALPAPAMPIDGCAPLQHGFPSAAALAALPDAD 374 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 + A G+ + L LA G LP+ EQ + L GIG WTA Y A+R Sbjct: 375 MVAAGVSPGKLRTLRALAQRVASGALPLEQHMPPEQTVAALCEIDGIGDWTAQYVAMRAL 434 Query: 229 QAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 D F DY +++ T ++ +W PWR+YA +H+W+ + Sbjct: 435 GWPDAFPGTDYALRKVLGVNTVRAMQARTAQWAPWRAYAAIHLWHRYEAMKTQG 488 >UniRef50_Q02KH7 DNA-3-methyladenine glycosidase II n=8 Tax=Pseudomonas RepID=Q02KH7_PSEAB Length = 297 Score = 245 bits (626), Expect = 1e-63, Method: Composition-based stats. Identities = 102/290 (35%), Positives = 143/290 (49%), Gaps = 16/290 (5%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L +Q P++W A R ++ VE++ D +YARS P R L Sbjct: 9 VLHLPYQSPWEWRQFHQHFALRLLAGVESLGDDHYARSFRANGRPAWFEVRPLAERQVLA 68 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAF 114 ++LS +AAE A++ R+FDL +P + + LG L AA PGLRLP D F Sbjct: 69 LSLSPSAHALAAELEARVRRMFDLDSDPAAIARHFAGDPLLGPLVAANPGLRLPVAFDPF 128 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD---FPEYICFPTPQRLAAADPQALKA 171 EQ VRAI+GQ V+V A +T R+ Q GE L++ FPTP LA A+ L Sbjct: 129 EQAVRAIVGQQVTVKAAVTITGRLIQRLGEPLENLGYDGISHLFPTPAALAQAN---LDG 185 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 +GMP KR + L A A G L + + E ++ L PGIG WTA Y ALR Sbjct: 186 IGMPGKRVQTLQRFAAAIASGELSLDLADGPEALVERLCALPGIGPWTAEYIALRAMGEA 245 Query: 232 DVFLPDDYLIKQ----RFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGW 277 D F D + + G+ ++ AE W+PWR+YA +H+W+ Sbjct: 246 DAFPAADLGLLKSTVWGPQGIDARSLKARAEAWRPWRAYAAIHLWHHYAA 295 >UniRef50_B0RQX4 DNA methylation and regulatory protein (Methylated-DNA--[protein]-cysteine S-methyltransferase) n=40 Tax=cellular organisms RepID=B0RQX4_XANCB Length = 521 Score = 245 bits (625), Expect = 1e-63, Method: Composition-based stats. Identities = 92/290 (31%), Positives = 137/290 (47%), Gaps = 14/290 (4%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++PP D ML FL RA+ +E V Y R + ++ R L + Sbjct: 232 LRLGYRPPLDLPAMLTFLQRRAIPGIEQVDADGYRRVIGAPGQATLIHVSAAPTRDELLL 291 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 + A + + ++ R+FDL + V + L + RPGLR+PG D FE Sbjct: 292 RIGATDPRQIPQIVRRVRRIFDLDADLHAVHATLAQDPLLEQAITRRPGLRVPGGWDGFE 351 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY--ICFPTPQRLAAADPQALKALG 173 VRA+LGQ +SVA AA L AR+ +G L D P FPTP ++A A + L G Sbjct: 352 VAVRAVLGQQISVAGAATLAARLVDRHGGHLPDMPPGLDRSFPTPAQMADAPLEQL---G 408 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 +P RA L LA+A +G L + + PGIG WTA+Y A+R D Sbjct: 409 LPRARAATLRALASACAQGRLHFGAGQRLPDFVAACTALPGIGPWTAHYIAMRALSHPDA 468 Query: 234 FLPDDYLIKQRFP---GMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 F D +++Q ++ ++ W+PWR+YA+LH+W+ + D Sbjct: 469 FPAGDLILQQVLGAPERLSERATEARSQAWRPWRAYAVLHLWHLAVDRKD 518 >UniRef50_D2L7I2 Transcriptional regulator, AraC family n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2L7I2_9DELT Length = 482 Score = 241 bits (616), Expect = 2e-62, Method: Composition-based stats. Identities = 107/281 (38%), Positives = 150/281 (53%), Gaps = 6/281 (2%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGE----YRGVVTAIPDIARH 57 TL ++PPYDW +LGFL R++ VE VAD Y R+LA+ + G + A++ Sbjct: 198 LTLGYRPPYDWDGLLGFLCLRSIGGVEAVADGVYRRTLAISRNGVVHAGWLAVAHAPAKN 257 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQG 117 + + ++AGL PV L ++S LFDL C+P + L L GLRLPG D FE Sbjct: 258 AVRVTVAAGLLPVLPAVLTRVSHLFDLACDPAAIAAGLAGLADGHEGLRLPGAADGFEVA 317 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQRLAAADPQALKALGMP 175 VRAILGQ V+VA A L R A +GE + + FP P R+A A+ + G+ Sbjct: 318 VRAILGQQVTVAGARTLARRFAAAFGEPVSTPFADLTTVFPGPARVAGLTVDAIASQGIL 377 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 RA A+I LA A EG L ++ DV L PGIG WTA+Y A+R D F Sbjct: 378 AARARAIIGLARAMAEGGLVLSPAADVAATRAALLALPGIGAWTADYIAMRALAWPDAFP 437 Query: 236 PDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 D+ +K+ P ++ A W+PWR+YA++H+W + Sbjct: 438 HTDFGVKKALGETDPKRVLERAAGWRPWRAYAVMHLWRSLQ 478 >UniRef50_UPI0001901D5D methylated-DNA--protein-cysteine methyltransferase n=1 Tax=Mycobacterium tuberculosis T85 RepID=UPI0001901D5D Length = 361 Score = 237 bits (606), Expect = 2e-61, Method: Composition-based stats. Identities = 81/292 (27%), Positives = 123/292 (42%), Gaps = 17/292 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L + P+ + + G LAA AV E V D Y R+L + G+V+ P + Sbjct: 67 LRLPVRAPFAFEGVFGHLAATAVPGCEEVRDGAYRRTLRLPWGNGIVSLTPAPDHVRCLL 126 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 L A+ RL DL +P+ + + L + PG R+P VD E Sbjct: 127 VL--DDFRDLMTATARCRRLLDLDADPEAIVEALGADPDLRAVVGKAPGQRIPRTVDEAE 184 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDD--FPEYICFPTPQRLAAADPQALKALG 173 VRA+L Q VS A+ R+ YG + D FP+ ++LA DP L Sbjct: 185 FAVRAVLAQQVSTKAASTHAGRLVAAYGRPVHDRHGALTHTFPSIEQLAEIDPGHLA--- 241 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 +P R + L + + +L + D ++A L PG+G WTA A+RG D Sbjct: 242 VPKARQRTINALVASLADKSLVLDAGCDWQRARGQLLALPGVGPWTAEVIAMRGLGDPDA 301 Query: 234 FLPDDYLIKQRFPG----MTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDE 281 F D ++ + ++ RW+PWRSYA H+W T ++ Sbjct: 302 FPASDLGLRLAAKKLGLPAQRRALTVHSARWRPWRSYATQHLWTTLEHPVNQ 353 >UniRef50_Q6MR46 DNA methylation and regulatory protein Ada n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MR46_BDEBA Length = 479 Score = 237 bits (606), Expect = 2e-61, Method: Composition-based stats. Identities = 78/284 (27%), Positives = 125/284 (44%), Gaps = 16/284 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L+++PP+D++ +L F + AV +E + R + V G +T + + Sbjct: 193 IRLSYRPPFDFTGLLHFYRSHAVGQLEWFEEGLMHRIIEVNGKVGQITLSDLPDESCIKL 252 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 + ++++ L DL +P I+ + + L PG+RLP D FE Sbjct: 253 EIDFPDTTALHTIISRVRSLLDLDSDPVIIANVLETDKDMKALLKKHPGIRLPSSWDPFE 312 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGER---LDDFPEYICFPTPQRLAAADPQALKAL 172 V AILGQ+VSV L + L G L D FPTP ++ AD ++LK Sbjct: 313 VVVAAILGQVVSVERGRALVNDLIDLAGSDSGLLRDGKSVRLFPTPAQVIKADLKSLKT- 371 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 +R E L+ L+ A + G L + DV+ ++ + PGIG WTA+Y AL+ + D Sbjct: 372 --TTRRKETLVALSKALINGDLSLEPAQDVDSFVEKILGIPGIGPWTASYMALKALRHTD 429 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 F D +I + + E + PWR Y +W Sbjct: 430 AFPATDLIIARAIAEHPKTKF----ESFSPWRGYVAALLWREYS 469 >UniRef50_Q10630 Methylated-DNA--protein-cysteine methyltransferase n=52 Tax=Actinomycetales RepID=ADA_MYCTU Length = 496 Score = 237 bits (606), Expect = 3e-61, Method: Composition-based stats. Identities = 81/292 (27%), Positives = 123/292 (42%), Gaps = 17/292 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L + P+ + + G LAA AV E V D Y R+L + G+V+ P + Sbjct: 202 LRLPVRAPFAFEGVFGHLAATAVPGCEEVRDGAYRRTLRLPWGNGIVSLTPAPDHVRCLL 261 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 L A+ RL DL +P+ + + L + PG R+P VD E Sbjct: 262 VL--DDFRDLMTATARCRRLLDLDADPEAIVEALGADPDLRAVVGKAPGQRIPRTVDEAE 319 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDD--FPEYICFPTPQRLAAADPQALKALG 173 VRA+L Q VS A+ R+ YG + D FP+ ++LA DP L Sbjct: 320 FAVRAVLAQQVSTKAASTHAGRLVAAYGRPVHDRHGALTHTFPSIEQLAEIDPGHLA--- 376 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 +P R + L + + +L + D ++A L PG+G WTA A+RG D Sbjct: 377 VPKARQRTINALVASLADKSLVLDAGCDWQRARGQLLALPGVGPWTAEVIAMRGLGDPDA 436 Query: 234 FLPDDYLIKQRFPG----MTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDE 281 F D ++ + ++ RW+PWRSYA H+W T ++ Sbjct: 437 FPASDLGLRLAAKKLGLPAQRRALTVHSARWRPWRSYATQHLWTTLEHPVNQ 488 >UniRef50_C0WE04 Transcriptional regulator n=1 Tax=Acidaminococcus sp. D21 RepID=C0WE04_9FIRM Length = 483 Score = 237 bits (605), Expect = 3e-61, Method: Composition-based stats. Identities = 95/287 (33%), Positives = 151/287 (52%), Gaps = 15/287 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGE-----YRGVVTAIPDIAR 56 TL ++PPY S + FL RA+ +ETV++ Y R++ + Y G+++ P+ Sbjct: 197 LTLTYRPPYLASPLFDFLKGRAMKGIETVSEGIYKRTVTLAGEKGARYHGIISVSPNKKC 256 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPG-----LRLPGCV 111 + L + LS L PV ++ + ++SR FDL P+ + L + PG +R+PG Sbjct: 257 NALTLTLSDSLLPVLSDVIFRVSRQFDLAAFPETIAAVLYAMNDGVPGTFAEGIRIPGAF 316 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADP--- 166 D FE VRAILGQ ++V A+ L AR + G ++ FPTP+++ + Sbjct: 317 DGFETAVRAILGQQITVKAASTLAARFVAVLGTPIETGHPGLTHLFPTPEKILSYGESLS 376 Query: 167 QALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALR 226 L LG+ ++ ++ LA A ++G+L + E+ K L GIGRWT++Y A+R Sbjct: 377 DELGKLGIISSKSASIRALAQALMDGSLRLDGTRSREETKKALLALKGIGRWTSDYIAMR 436 Query: 227 GWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 + D+FL D IK PG TP + AE W+P+RSYA + +W Sbjct: 437 VLKDPDIFLETDAGIKHALPGTTPKERLTLAEAWRPFRSYATVSLWR 483 >UniRef50_B0SWZ0 Transcriptional regulator, AraC family n=7 Tax=Bacteria RepID=B0SWZ0_CAUSK Length = 505 Score = 237 bits (605), Expect = 4e-61, Method: Composition-based stats. Identities = 95/292 (32%), Positives = 142/292 (48%), Gaps = 19/292 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 TL ++PPYDW ML FLA RA+ VE + + Y R +A+ G + P I L + Sbjct: 206 LTLRYRPPYDWDAMLAFLALRAIPGVEVIESNTYRRVIALDGAAGTIAVSP-IDGDRLSV 264 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 + LA++ +FDL +P + + L R+ RPGLR+PG D FE Sbjct: 265 AVRFPKLSALPRILARVRGVFDLSADPVGIAAVLSRDPDLARMVGLRPGLRVPGAWDGFE 324 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERL----DDFPEYICFPTPQRLAAADPQALKA 171 VRAILGQ ++V A KL + +GE L + FP+ +RLAA + L Sbjct: 325 LAVRAILGQQITVVQARKLAGDLVAAHGEPLAQPWTEPGLTHAFPSAERLAATN---LSG 381 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + MP R L +A A + ++ +++ ++ L+ PGIG WTA Y A+R + Sbjct: 382 MKMPGARIRCLSAMAQAIADAPNLLSPTAGLDEMVRRLRALPGIGEWTAQYIAMRQLREP 441 Query: 232 DVFLPDDYLIKQRFP-----GMTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 D F D + + T Q+ AE W+PWR+YA LH+W + + Sbjct: 442 DAFPAADVALMRALADVDGVRPTAEQLLTRAEAWRPWRAYAALHLWASLADE 493 >UniRef50_Q12D18 Transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II n=3 Tax=Proteobacteria RepID=Q12D18_POLSJ Length = 504 Score = 235 bits (599), Expect = 2e-60, Method: Composition-based stats. Identities = 93/293 (31%), Positives = 137/293 (46%), Gaps = 14/293 (4%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADS----YYARSLAVGEY----RGVVTAIPD 53 L ++PPYD + MLGF + R +S++E VA R+ V G + A D Sbjct: 212 IRLGYRPPYDVAAMLGFFSKRTISAIEFVAADAQHPSIGRTFRVESGGKVHAGWLLAAFD 271 Query: 54 IARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDA 113 R L +N+S L V + ++ FDL +P +N L GLR+PG +D Sbjct: 272 ETRSRLVLNVSDSLREVLPLVIRRVRATFDLDADPAAINSVLHAGFPQGDGLRVPGALDG 331 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADPQALKA 171 +E VRA+LGQ ++VA A L R+ +GE + FP P LAAA AL Sbjct: 332 YELAVRAVLGQQITVAAARTLAQRMVDRFGEPVQTPWPQLTRLFPAPAMLAAASGDALGQ 391 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 LG+ +R A++ +A A + L + DV ++ L+ PGIG WTA Y A+R + Sbjct: 392 LGIVRQRQAAIVGIAQAVADKRLQLHSGADVHATLEALKALPGIGDWTAQYIAMRALRWP 451 Query: 232 DVFLPDDYLIKQRFP----GMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 D F D + + + + WKPWRSYA++ W +P Sbjct: 452 DAFPAGDVALHKAMGVQGLKNPAREAELASHAWKPWRSYAVIRAWSGTLERPG 504 >UniRef50_A6EY17 Transcriptional Regulator, AraC family protein n=1 Tax=Marinobacter algicola DG893 RepID=A6EY17_9ALTE Length = 504 Score = 234 bits (598), Expect = 2e-60, Method: Composition-based stats. Identities = 93/296 (31%), Positives = 128/296 (43%), Gaps = 21/296 (7%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L +PP+D +L F ARA+ +E V +YARSL + G+V P + Sbjct: 210 VLFLRARPPFDSEQLLAFFRARAIPGLEAVGAHHYARSLCIAGQPGLVICRPSDHPPGVQ 269 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAF 114 + L E A++ RL DL + + + + L PGLR+PG + F Sbjct: 270 VILRGPARQSILEVSARIRRLLDLDADLPGISEHLARDPLMEPLVTQHPGLRVPGSWERF 329 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE-----YICFPTPQRLAAADPQAL 169 E VRAILGQ VS++ A L R+ YG+ L D FP P L L Sbjct: 330 EFSVRAILGQQVSISAARTLAGRLVARYGQPLPDDLARGTGITHRFPEPAALVGQP---L 386 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 LGMP RA+ L + E P D + L + GIG WT Y ALRG Sbjct: 387 NTLGMPGSRADTLARITARFAE---PGFAEQDGNDLLAQLASMRGIGPWTLQYLALRGLG 443 Query: 230 AKDVFLPDDYLIKQRFPG----MTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDE 281 D F D I + + R+AERW+PWR+YA ++W + P Sbjct: 444 DPDAFPASDLGILKAASHLGGPQDAKALTRHAERWRPWRAYAAQYLWTSLNAHPPR 499 >UniRef50_C0Q970 AlkA n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0Q970_DESAH Length = 353 Score = 234 bits (597), Expect = 3e-60, Method: Composition-based stats. Identities = 68/285 (23%), Positives = 120/285 (42%), Gaps = 16/285 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L + P D+S ++ F+ RA+ VE + D Y+R+ +G + + + + Sbjct: 71 LLLPYARPLDFSQVIEFMKFRAIQGVEDIEDQRYSRTFRTNRSKGYFIVRDNPGKSAIEL 130 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 + E ++ +FDL + + + L + + RLP ++FE Sbjct: 131 TIYCDDIRCYMEIYNRVRLMFDLNTDFFPINKKFIKDKLLSKGMSDGHVPRLPIAFNSFE 190 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDD---FPEYICFPTPQRLAAADPQALKAL 172 +RA+LGQ +SV A+ L +R+A+ G + + FP P+ L + + Sbjct: 191 FCIRAVLGQQISVQAASTLASRIAKKAGPQTEKNFPPGLDYFFPGPEELVKTSLEGI--- 247 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 G+ R + ++A L+ + E K GIG WT NY A+R D Sbjct: 248 GITGVRQATITNIAQGLLDNVFSLNPNQPFETFQKDFSAIRGIGEWTVNYVAMRSLGMVD 307 Query: 233 VFLPDDYLIKQRF----PGMTPAQIRRYAERWKPWRSYALLHIWY 273 F D I + +I + AE+W+P+R+YA L +W Sbjct: 308 SFPAADLGIIKALEKNGKRPGRKEILKQAEKWRPYRAYAALCLWN 352 >UniRef50_A4SQS2 DNA methylation and regulatory protein n=2 Tax=Aeromonas RepID=A4SQS2_AERS4 Length = 522 Score = 234 bits (597), Expect = 3e-60, Method: Composition-based stats. Identities = 87/287 (30%), Positives = 130/287 (45%), Gaps = 22/287 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++PPYD + ML F RA+ +E V + Y R VG+ G + H++ + Sbjct: 213 LQLPYRPPYDVAAMLAFYRLRAIPGLERVDGNVYERRHRVGDQSGWIRIEQGK-GHSIRL 271 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 + + L ++ R++DL + Q + + L RL + PG+RLP D +E Sbjct: 272 TVHDLPPAALPDLLYRVRRMWDLDADMQRIGERLGQDPLLARLQSRWPGVRLPAGWDEYE 331 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMP 175 +RAI+GQ VSV A + R+ + PTP +L A D L +GMP Sbjct: 332 VMLRAIVGQQVSVKGAITIMGRLLAR----TEAQFGVAQLPTPAQLCALD---LDGIGMP 384 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 R L LA A GTL + D + L PGIG WT Y+ LR Q D F Sbjct: 385 GSRIRTLQGLAAALASGTLSLNTASD-----EQLLALPGIGPWTVAYWRLRCGQDPDAFP 439 Query: 236 PDDYLIKQRFPGMTP---AQIRRYAERWKPWRSYALLHIWYTEGWQP 279 D ++++ G ++ +E W+PWR YA +W+ QP Sbjct: 440 ASDLVLQKALGGGDKLPVKEVLVQSEAWQPWRGYAASWLWHAMSEQP 486 >UniRef50_A7HG85 AlkA domain protein n=2 Tax=Myxococcales RepID=A7HG85_ANADF Length = 485 Score = 231 bits (590), Expect = 2e-59, Method: Composition-based stats. Identities = 95/286 (33%), Positives = 131/286 (45%), Gaps = 19/286 (6%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHT-L 59 + L+++PP DW +L FLAAR + VE V Y R++ +G G V+ D AR T L Sbjct: 196 VLRLDFRPPLDWEALLAFLAARCTAGVEQVEGGAYRRTVRLGGRTGWVSVTRDPARPTAL 255 Query: 60 HINLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDA 113 S L A++ DL P V + L R PGLR+PG D Sbjct: 256 RAEASLSLAGALMPLAARLRAQLDLDARPDAVASRLRRDPLLARALRRHPGLRVPGAFDG 315 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAA-------DP 166 + VR I+GQ VSVA A ++ R+A GE + FP RLA + Sbjct: 316 LDAAVRVIVGQQVSVAAATTVSGRLAAALGEPVATP-----FPGLDRLAPSAEAIAAAGV 370 Query: 167 QALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALR 226 A+ +GMP RA ++ LA A G L + GD E L G+G WTA A+R Sbjct: 371 DAIARVGMPGARARTILELARAVAGGGLALHRGGDGEAVRAGLLELSGVGPWTAEVVAMR 430 Query: 227 GWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D F D + + + + AE W+PWRSYA++H+W Sbjct: 431 ALGEPDAFPASDLGVLRALGASSALEAEARAEAWRPWRSYAVMHLW 476 >UniRef50_A5KSU6 Transcriptional regulator, AraC family n=1 Tax=candidate division TM7 genomosp. GTL1 RepID=A5KSU6_9BACT Length = 464 Score = 231 bits (589), Expect = 2e-59, Method: Composition-based stats. Identities = 91/287 (31%), Positives = 137/287 (47%), Gaps = 27/287 (9%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 M +++PP+DW +LGF+ RA S E D+ Y R + E + A++ L Sbjct: 194 MLRTDYRPPFDWDLLLGFIKKRATPS-EWATDTTYHRLIGSDE----IVVRNVPAKNYLT 248 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAF 114 I + L A L K+ RLFDL NP ++ + L A PG+R+PGC D F Sbjct: 249 IEVPQKLSRHAHAILMKVRRLFDLDANPSVITTVLTNDPYLKPFLADNPGVRVPGCWDNF 308 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGM 174 E +RA++GQ VSV+ A + R+ + G TP LAA+ + ++GM Sbjct: 309 EMLIRAVVGQQVSVSAATTVMRRLVERIGS------------TPDTLAASSADEIASIGM 356 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 PLKRA + LA+ + + D ++ Q GIG WT Y LR D Sbjct: 357 PLKRATTIHTLAHKVKNSDIDLN-ECDPQRFADQFQHISGIGPWTIAYLQLRILHWPDAL 415 Query: 235 LPDDYLIKQRF---PGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 +D +++ +T A++ +YAE W+PWRSYA+ +W Q Sbjct: 416 PAEDIGLQRALIPYKRITKAELSKYAEAWRPWRSYAVFLLWNASSNQ 462 >UniRef50_B1ZFN9 AlkA domain protein n=6 Tax=Methylobacterium RepID=B1ZFN9_METPB Length = 376 Score = 225 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 93/293 (31%), Positives = 136/293 (46%), Gaps = 19/293 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 + L +PP+DW + F A A VETV YAR+ + G ++ + ++ I Sbjct: 16 FRLALRPPFDWGHLERFFADHASPGVETVTPGRYARTFLLAGRPGTLSVTCERGSLSVRI 75 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 EP L ++ +FDL +P + + + L A RPGLR+PG D FE Sbjct: 76 RGPEADEPF-EAILTRLRAMFDLGADPDAIAAGLGRDPTMAALVARRPGLRMPGAFDGFE 134 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERL------DDFPEYICFPTPQRLAAADPQAL 169 VRAILGQ VSVA A +L R+ +G L D+ FPTP++L A+ + Sbjct: 135 LAVRAILGQQVSVAAATRLAGRLVAAFGTPLGPKVGGDEPGLTHLFPTPEQLLEAEISLV 194 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 L MP R A+ LA A L GD++ + L+ PGIG WTA+Y A+R Sbjct: 195 --LNMPRARGRAIQGLAAAVLATPDLFAPGGDLDATVARLKALPGIGDWTAHYIAMRALA 252 Query: 230 AKDVFLPDDYLIKQRF----PGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 D F D + + + A W+PWR+YA +H+W + + Sbjct: 253 QADAFPAGDVGLMRALDDGAGRPGRVALLDRAAAWRPWRAYAAIHLWAEDAAR 305 >UniRef50_C4L050 DNA-3-methyladenine glycosylase II n=4 Tax=Bacillales RepID=C4L050_EXISA Length = 297 Score = 221 bits (564), Expect = 2e-56, Method: Composition-based stats. Identities = 63/290 (21%), Positives = 115/290 (39%), Gaps = 16/290 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L Q P+D+ L FL+ + + + +GE ++ ++ + Sbjct: 4 MLLAVQQPFDFQECLVFLSRSEQEVLHVTTPDMVRKLMRIGERLILIELREEVNHIHVRF 63 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCN------PQIVNGALGRLGAARPGLRLPGCVDAFE 115 E ++ DL+ + + L L GLR+ G D FE Sbjct: 64 PFDEVSETEKEHVAREVRNWLDLERDLKPFETMGAKDELLAPLIETHRGLRMIGFPDLFE 123 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGER-LDDFPEYICFPTPQRLAAADPQALKALGM 174 AI+GQ ++++ A + R + YG+ + + Y FP +R+A +P+ L+ L Sbjct: 124 ALTWAIIGQQITLSFAYTIKRRFVERYGDHRVIEGRAYWTFPRAERIALLEPEELRELQF 183 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDV--EQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 ++AE +I +A G L T + L G+G WTA+Y ++ +Q Sbjct: 184 SRRKAEYVIDIAREITNGDLSKTALQSHSSADIRQRLLAIRGVGAWTADYVLMKCFQDAS 243 Query: 233 VFLPDDYLIKQRFP-------GMTPAQIRRYAERWKPWRSYALLHIWYTE 275 F D + Q T +++RY E W+ + YA ++W + Sbjct: 244 AFPIADVGLHQAIQHQLGTAKKPTIEEVKRYGESWQGFEGYATFYLWRSL 293 >UniRef50_Q1ZAD8 Hypothetical ada regulatory protein n=2 Tax=Photobacterium profundum RepID=Q1ZAD8_PHOPR Length = 514 Score = 221 bits (563), Expect = 2e-56, Method: Composition-based stats. Identities = 88/300 (29%), Positives = 122/300 (40%), Gaps = 44/300 (14%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVG------------------ 42 + TL+++PPY+W + F A+R + +E ++ Y R+ + Sbjct: 212 VITLSYRPPYNWQHLQQFYASRIIEGLEWCDENSYGRTFSFDSDDCSHSVLNTGQNINHS 271 Query: 43 ----EYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRL 98 + G TA + + + + + R DL + + + L R Sbjct: 272 EDAFDCIGEFTAFHIPEKSVFLVRIQLSDLRYLNRVIRNIRRCLDLDADIEHIEARLKRA 331 Query: 99 ----GAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC 154 A GLRLPG FE G+RAILGQ VSV A L +V G + E Sbjct: 332 LNTDILAISGLRLPGTWSPFEAGIRAILGQQVSVQAARNLVTKVV---GNNPINTDERCY 388 Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPG 214 FP PQ+L A L L MP KR E + LAN A L + K L G Sbjct: 389 FPLPQQLIA---DELTYLKMPGKRKETIRLLANYACNKPLDDS---------KALLAIAG 436 Query: 215 IGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 IG WT +Y +RG D+FL D IK+ M A A PWRSY L++W Sbjct: 437 IGPWTVHYLRMRGLSDPDIFLIGDLGIKKALAKMNEAFSPDAA---APWRSYLTLYLWSA 493 >UniRef50_A1WKZ8 DNA-3-methyladenine glycosylase II / Transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=4 Tax=Bacteria RepID=A1WKZ8_VEREI Length = 581 Score = 221 bits (563), Expect = 3e-56, Method: Composition-based stats. Identities = 92/286 (32%), Positives = 128/286 (44%), Gaps = 18/286 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADS----YYARSLAVGEY--------RGVVT 49 L W+PP D + +L F A R + VE V R++ + G ++ Sbjct: 286 LRLAWRPPLDVAALLAFFARRQLHGVEWVLPDGAGPILRRTVRLAPGCTGQPREIIGWIS 345 Query: 50 AIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPG 109 A D +RH L + S L PV + ++ L DL +P +N L GLRLPG Sbjct: 346 ARFDGSRHLLLLQASDSLYPVLPLVIRRVRALLDLDADPAAINAVLHPHFPQGDGLRLPG 405 Query: 110 CVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY--ICFPTPQRLAAADPQ 167 D FE VRA+LGQ V++A A L R+ + G+ + FP P LAA D Sbjct: 406 AFDGFELAVRAVLGQQVTLAAARTLGQRLVERLGQTIATPWPELQRLFPAPATLAATDGA 465 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRG 227 L +G+ +R A++ LA A G L + D E+ L PGIG WTA Y A+R Sbjct: 466 VLGQMGIVRQRQAAIVALARAVDGGQLALHDGADPEKTTAALCALPGIGDWTAQYIAMRV 525 Query: 228 WQAKDVFLPDDYLIKQRFP----GMTPAQIRRYAERWKPWRSYALL 269 + D F D + + A+ W+PWRSYALL Sbjct: 526 LRWPDAFPSGDVALHKALGLQGQKNPARAATAAAQAWRPWRSYALL 571 >UniRef50_B7RWC6 AlkA N-terminal domain family protein n=1 Tax=marine gamma proteobacterium HTCC2148 RepID=B7RWC6_9GAMM Length = 471 Score = 221 bits (563), Expect = 3e-56, Method: Composition-based stats. Identities = 80/282 (28%), Positives = 127/282 (45%), Gaps = 24/282 (8%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++PPYDW+ ++ FL+ A++ VE + DS Y R+ + P ++ L + Sbjct: 204 LQLQYRPPYDWNGVVDFLSHHAIAGVEEINDSRYRRNFRTTAGVAQLEIKPHKNKNALEL 263 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 L + ++ R+FDL NP+ + + ALG L PG R PG FE Sbjct: 264 RLQLPDNSRLMSTVGQVRRMFDLDANPEQISALLQQDTALGPLSKRSPGARSPGHWSLFE 323 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMP 175 VRAI+GQ VS A + AR+A+ + FP +AA + MP Sbjct: 324 SAVRAIVGQQVSTVAARTVLARLAKAC-----TKEGIVTFPDAADIAALTDEHFP---MP 375 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 +R E L L + + E ++ L F G+G WT A+RG DVF Sbjct: 376 SRRRETLRSLCQTYSD--------REDELTLEALADFKGVGPWTVGMVAVRGAGDPDVFP 427 Query: 236 PDDYLIKQRFPGMTPAQ--IRRYAERWKPWRSYALLHIWYTE 275 D +++ + + ++ + A +W+PWRSYA +W + Sbjct: 428 TGDLGLERTWATLPGSEGKLNDAAAQWRPWRSYAANLLWRSY 469 >UniRef50_C7R5W7 Transcriptional regulator, AraC family n=1 Tax=Kangiella koreensis DSM 16069 RepID=C7R5W7_KANKD Length = 461 Score = 220 bits (561), Expect = 5e-56, Method: Composition-based stats. Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 22/278 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L+++PPYDWS M FL R +S++ETV D+ Y R+ ++ +G +A D +R + ++ Sbjct: 202 LKLHYRPPYDWSLMQDFLKQRELSAIETVTDNCYGRTFSIDSSKGHFSAEIDPSRSSFNV 261 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP----GLRLPGCVDAFEQG 117 + + R+ DL + +++ +L + +P GLRLP D FE G Sbjct: 262 TIEMDDMSKLLTATHHIRRVLDLNSDLEVIENSLAQDVNIKPVLKSGLRLPATWDTFEAG 321 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 V+AILGQ VSV A TA V + G + +D +Y FPT +++ D LK MP Sbjct: 322 VKAILGQQVSVKAAYTHTASVIEQLGSKYND--QYKLFPTAKQIVNGDLTFLK---MPNS 376 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 R + L A L + + ++ GIG WT Y LR D F Sbjct: 377 RKQTLHDFAQWYLS---------TSGEDLASILDIKGIGPWTYEYIKLRSGMDSDAFPEK 427 Query: 238 DYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 D + + +E+W+PWRSYA L +W++ Sbjct: 428 DLGVIKAMEQYN----LTNSEQWQPWRSYATLQLWHSL 461 >UniRef50_C4DFD0 DNA-3-methyladenine glycosylase II; Transcriptional regulator Ada; DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DFD0_9ACTO Length = 413 Score = 219 bits (559), Expect = 6e-56, Method: Composition-based stats. Identities = 87/285 (30%), Positives = 123/285 (43%), Gaps = 17/285 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++ P + G LAA AV VE D Y R+L + GVV+ P + Sbjct: 116 LRLPFRQPLCPDNVFGHLAATAVPGVEEWRDGAYRRTLRLPHGPGVVSLRPGPDHVGCVL 175 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNG------ALGRLGAARPGLRLPGCVDAFE 115 + +A+ L DL +P V+ L L A PG R+P VD E Sbjct: 176 --WLSDLRDLSIAIARCRWLLDLDADPVAVDELLSRDEVLAPLVAKAPGRRVPRTVDPGE 233 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADPQALKALG 173 VRA+LGQ VS A A AR+ YG+R++D FP+P LA DP L Sbjct: 234 FAVRAVLGQQVSTAAARTHAARLVARYGQRVEDPGGGLTHLFPSPGELAGLDPDGLA--- 290 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 MP+ R L+ L A ++G + + + D A + L PG G WT A+R D Sbjct: 291 MPVSRKNTLLGLVRALVDGDVELGVGVDWRSAKEALSALPGFGPWTVESIAMRALGDPDA 350 Query: 234 FLPDDYLIKQRFPG----MTPAQIRRYAERWKPWRSYALLHIWYT 274 F+ D I+ + + W PWR+YA+ ++W T Sbjct: 351 FVASDLGIRLAAEQLGLPTGARALVERSRAWMPWRAYAVQYLWAT 395 >UniRef50_Q2T2N2 DNA-3-methyladenine glycosylase II n=65 Tax=Burkholderia RepID=Q2T2N2_BURTA Length = 343 Score = 219 bits (559), Expect = 8e-56, Method: Composition-based stats. Identities = 98/298 (32%), Positives = 134/298 (44%), Gaps = 19/298 (6%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 ++ L ++PPYDW +L F A RA+ VE V Y R++ G +T + L Sbjct: 46 VFELPFKPPYDWPRVLRFFAGRAIPGVEAVEGGAYRRTVDYRGAVGALTVRKHPRKRCLV 105 Query: 61 INLSAGLEPVAAECLAKMSR-LFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDA 113 + A A +FDL +P + + L L A PGLR+PG + Sbjct: 106 ATVEGDAARHADAAFAARLATMFDLHADPAAIGAHLARDAWLAPLVDAAPGLRVPGAWSS 165 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD---FPEYICFPTPQRLAAADPQALK 170 FE VRAI+GQ VSV A + R+ + GERL FP P LAA D L Sbjct: 166 FELIVRAIVGQQVSVKAATTIVGRLVERAGERLVGHAPGATGWRFPEPAALAACD---LS 222 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQ-AMKTLQTFPGIGRWTANYFALRGWQ 229 +GMP KRA AL +A A G +P+ L PGIG WT Y A+R W+ Sbjct: 223 RIGMPGKRAAALQGVARAVAAGDVPLDAYATDPAGVRAALLALPGIGPWTVEYVAMRAWR 282 Query: 230 AKDVFLPDDYLIKQRFPGMT-----PAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 D + D ++ Q PA R A+ W+PWR+YA +H+W + A Sbjct: 283 DADAWPATDLVLMQAIVARDPALDRPASQRLRADAWRPWRAYAAMHLWNEIADRAGSA 340 >UniRef50_B9DJS2 Putative uncharacterized protein n=1 Tax=Staphylococcus carnosus subsp. carnosus TM300 RepID=B9DJS2_STACT Length = 341 Score = 219 bits (559), Expect = 8e-56, Method: Composition-based stats. Identities = 94/307 (30%), Positives = 142/307 (46%), Gaps = 33/307 (10%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYR------GVVTAIPDIA 55 + L +Q PY W+ M+ +L+ RA+ VE V D+YYAR++ + + G + + Sbjct: 36 FNLYYQTPYIWTAMIDYLSKRAIPRVEIVQDNYYARTVLLKDTATKRAVKGWLKVKNNTK 95 Query: 56 RHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFE 115 + L + +SA L + + K+ FDL+ NP+I+N L GLR+PG + FE Sbjct: 96 NNALLVEMSASLIHEWNKIIQKLRHFFDLEVNPEIINKTLNEDW-ITKGLRVPGAFNGFE 154 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI--CFPTPQR---LAAAD---PQ 167 GVRAILGQ ++V A ++ R+ G + FP P++ LA D Sbjct: 155 LGVRAILGQQITVKAATTISGRLVHALGTPFKTKIAGLDTLFPIPEKFVYLAHCDTPISD 214 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTI-----------------PGDVEQAMKTLQ 210 L LG+ ++R+ + LA A + G + + E M L Sbjct: 215 LLGPLGVTVRRSNTIAALAEAIVNGEVQLNPVVHGVESSIPSNRYNTQMETAESEMNRLL 274 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-GMTPAQIRRYAERWKPWRSYALL 269 GIG+WTA Y +R D FL D IK P TP AE+W P RSYA++ Sbjct: 275 AIKGIGKWTAQYIGMRALGYTDSFLETDIGIKNAMPNDTTPKSRLAVAEKWHPLRSYAVV 334 Query: 270 HIWYTEG 276 ++W T Sbjct: 335 NLWNTLN 341 >UniRef50_A1TR03 DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II / Transcriptional regulator Ada n=9 Tax=Comamonadaceae RepID=A1TR03_ACIAC Length = 534 Score = 219 bits (558), Expect = 1e-55, Method: Composition-based stats. Identities = 92/299 (30%), Positives = 137/299 (45%), Gaps = 22/299 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVA--DSYYARSLAV-----GEYRGVVTAIPD- 53 L ++PP D + +LGF R + +ETV R+ + E G + A D Sbjct: 226 VRLAYRPPLDIAALLGFFGQRRIHGMETVDVPGLELRRTARLQDAEGRECTGWLAARFDG 285 Query: 54 --------IARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGL 105 + + + +S+ L P +A++ L DL +P+ +N L GL Sbjct: 286 GAAAARGGPPKPHVVLRVSSSLLPALPGVIARVRGLLDLDADPEAINAVLHGDFPRGDGL 345 Query: 106 RLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY--ICFPTPQRLAA 163 R+PG D FE VRA+LGQ V+VA A L RV + +G+ + FPTP LAA Sbjct: 346 RVPGAWDGFELAVRAVLGQQVTVAAARTLAQRVVERWGDPVATPWPDLCRLFPTPAVLAA 405 Query: 164 ADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYF 223 D AL LG+ +R A++ L+ A EG L + DV + L+ PGIG WTA Y Sbjct: 406 CDGDALGQLGIVRQRQAAIVALSRAVAEGRLLLHAAADVAGTIAALRALPGIGDWTAQYI 465 Query: 224 ALRGWQAKDVFLPDDYLIKQRFPGMT----PAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 A+R + D F D + + + ++ W+PWRSYA++ W G Sbjct: 466 AMRALRWPDAFPSGDVALHKALAVQSAPRPARAAEEASQAWRPWRSYAVVRAWAGTGTP 524 >UniRef50_P37878 DNA-3-methyladenine glycosylase n=4 Tax=Bacillaceae RepID=3MGA_BACSU Length = 303 Score = 217 bits (554), Expect = 2e-55, Method: Composition-based stats. Identities = 72/294 (24%), Positives = 123/294 (41%), Gaps = 21/294 (7%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYR--GVVTAIPDIARHT 58 + TL +D + LG+L + + ++ + +A+GE R V+ I + Sbjct: 11 VITLP--EIFDMNANLGYLTREKNECMYEIENNIITKVIAIGEIRSLVQVSVINNKQMIV 68 Query: 59 LHINLSAGLEPV-AAECLAKMSRLFDLQCN------PQIVNGALGRLGAARPGLRLPGCV 111 +N S +E E + + FDL + + L GLR+ G Sbjct: 69 QFLNDSRPVEQWKREEIVKYIHEWFDLDNDLTPFYEMAKADPLLKMPARKFYGLRVIGIP 128 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALK 170 D FE +LGQ +++A A L + + +G+ ++ + +Y FP +R+A P L Sbjct: 129 DLFEALCWGVLGQQINLAFAYSLKKQFVEAFGDSIEWNGKKYWVFPPYERIARLTPTDLA 188 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTIPG--DVEQAMKTLQTFPGIGRWTANYFALRGW 228 + M +K++E +I +A G L + + A K L GIG WTANY +R Sbjct: 189 DIKMTVKKSEYIIGIARLMASGELSREKLMKMNFKDAEKNLIKIRGIGPWTANYVLMRCL 248 Query: 229 QAKDVFLPDDYL-------IKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 + F DD ++ T +I + WK W+SYA ++W Sbjct: 249 RFPTAFPIDDVGLIHSIKILRNMNRKPTKDEILEISVPWKEWQSYATFYLWRVL 302 >UniRef50_C7QDZ2 Transcriptional regulator, AraC family n=2 Tax=Actinomycetales RepID=C7QDZ2_CATAD Length = 564 Score = 217 bits (553), Expect = 4e-55, Method: Composition-based stats. Identities = 99/350 (28%), Positives = 145/350 (41%), Gaps = 74/350 (21%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETV----ADSYYARSLAVGEYRGVVTAIPDIARH 57 L ++ P+D++ +LG+ RA+ V+ V D Y R+L + G V D + Sbjct: 214 LRLTYRTPFDFAALLGWFGDRAIPGVDEVVGTGRDLVYRRALRLPHGTGQVELRDD--KG 271 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAARPGLRLPGCV 111 +H L A + + L DL +P V+ AL L AARPGLR+PG V Sbjct: 272 VVHARLVVDDLRDVAVAVRRCRDLLDLDADPAQVDAVLAGDPALAPLVAARPGLRVPGAV 331 Query: 112 DAFEQGVRAILGQLVSVAMAA----KLTARVAQL-------------------------- 141 D FE VRA+LGQ +SVA A +L R + + Sbjct: 332 DGFEIAVRAVLGQQISVAAARTMTARLVQRFSAVELAAEAALVPNAALPAVSPAVSGSPT 391 Query: 142 -------------------------YGERLDDFPEYICFPTPQRLAAADPQALKALGMPL 176 +D + + FP P+ LAA D + L G+ Sbjct: 392 ATSALAAASGATRDPDKDSDPEAAPASHFVDKKADLLPFPRPETLAAGDYEGL---GLTR 448 Query: 177 KRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 + L LA A G L + D +A L PGIG WTA+Y ALR + D F Sbjct: 449 RTVATLRALATAVASGDLALDRGVDRTEARAKLLAVPGIGPWTADYVALRVFGDPDAFPV 508 Query: 237 DDYLIKQRFPGMT----PAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 D +++++ + + +AE W+PWR+YA LH+W + G EA Sbjct: 509 GDLIVRRQAERLGLPGAEKALLAHAESWRPWRAYAALHLWASSGDPVIEA 558 >UniRef50_C0ZIT0 DNA-3-methyladenine glycosylase II n=75 Tax=Bacillales RepID=C0ZIT0_BREBN Length = 310 Score = 216 bits (551), Expect = 6e-55, Method: Composition-based stats. Identities = 73/289 (25%), Positives = 120/289 (41%), Gaps = 19/289 (6%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVG--EYRGVVTAIPDIARHTLHI 61 L+ + +S L +L+ + + + + +++ +G + A D + Sbjct: 19 LSVPTEFSFSQNLHYLSRASNECMFHIQNGRLYKAIPIGQDSQVVEIHAKNDQGLTVRFL 78 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNP------QIVNGALGRLGAARPGLRLPGCVDAFE 115 + S E V E + FDL + + L + GLR G D FE Sbjct: 79 SPSLPNEKVRTEVARYVRDWFDLDRDLVPFYELAAGDALLKQAVEKFYGLRTMGIPDLFE 138 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGM 174 I+GQ +++ A L R+ + +G R++ + Y FPT +++A L L M Sbjct: 139 ALSWGIIGQQINLTYAYTLKRRLVEAFGRRVEFEGETYWLFPTAEKIAGLSVTDLDGLRM 198 Query: 175 PLKRAEALIHLANAALEGTLP---MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 K+ E LI +A +EG L + GD + A K L + GIG WTANY +R + Sbjct: 199 TTKKCEYLIDVAQLIVEGKLSKELLWDGGDYQTAEKRLTSIRGIGPWTANYVLMRCLRMP 258 Query: 232 DVFLPDDYLIKQRF-------PGMTPAQIRRYAERWKPWRSYALLHIWY 273 F DD + T A+IR ++ W W SYA ++W Sbjct: 259 SAFPIDDVGLHNAIKFLLGKEKKPTKAEIRELSKTWTNWESYATFYLWR 307 >UniRef50_C6D2P4 DNA-3-methyladenine glycosylase II n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6D2P4_PAESJ Length = 304 Score = 216 bits (551), Expect = 6e-55, Method: Composition-based stats. Identities = 71/289 (24%), Positives = 115/289 (39%), Gaps = 23/289 (7%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEY--RGVVTAIPDIARHTLHINLSAG 66 P+ + + +L A + V R + G V I + L + + Sbjct: 16 PFSYKETVNYLRRSANEPLYQVEGDAVYRLIPTGSGEEPAAV-VIRESGHGGLLVRVIGE 74 Query: 67 ---LEPVAAECLAKMSRLFDLQCN------PQIVNGALGRLGAARPGLRLPGCVDAFEQG 117 + E A + FD + + L GLR G D FE Sbjct: 75 KQVSDERQREIEAFIREWFDFDTDLLPFYEMAEKDPLLVHAIGRFHGLRSVGISDLFEAL 134 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPL 176 I+GQ +++A A L R + YG+ ++ + + FP P+ +A P+ + ++ M Sbjct: 135 CWGIIGQQINLAFAYTLKRRFVEAYGQSVEREGRTFWQFPVPETIATLKPEDMASMQMTS 194 Query: 177 KRAEALIHLANAALEGTLP---MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 K++E LI +A EG+L + GD K L GIG WTANY +R + + Sbjct: 195 KKSEYLIGVAKLMAEGSLDKQSLLALGDFAAIEKQLTGIRGIGPWTANYVLMRCLRLPNA 254 Query: 234 FLPDDYLIKQRFPGMTP-------AQIRRYAERWKPWRSYALLHIWYTE 275 F D + +T ++IR+ AE WK W SYA ++W Sbjct: 255 FPIADVGLHNSIKALTGSEAKPAISEIRQMAEGWKGWESYATFYLWRIL 303 >UniRef50_C0Z5U6 Putative DNA-3-methyladenine glycosylase II n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0Z5U6_BREBN Length = 309 Score = 215 bits (549), Expect = 9e-55, Method: Composition-based stats. Identities = 68/298 (22%), Positives = 118/298 (39%), Gaps = 21/298 (7%) Query: 5 NWQPPYDWSWMLGFLAARAVSSVETVAD-SYYARSLAVGEYRGVVTAIPDI--ARHTLHI 61 + PPY + +L L + + + + R +G +V L Sbjct: 7 SLTPPYSFDRLLRRLETHPDTQIRVNQEKNSLQRVFRIGLRPVLVHMQFMGSLEEPALRY 66 Query: 62 NLSAGLEPVAAECLAK-MSRLFDLQCNPQIVNGALGR------LGAARPGLRLPGCVDAF 114 A L + L K + R F ++ + L GLR D F Sbjct: 67 GTQAILSTSDQQLLEKMIRRTFSADLELSVIYEQMREEGELAILTERFRGLRPMLDADLF 126 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD--FPEYICFPTPQRLAAADPQALKAL 172 + V+ I+GQ +++ AA LT R+ L G+ +++ I FPTP +A + L++L Sbjct: 127 QCMVKTIIGQQINLTFAANLTERLVTLAGDPVENQNGEGIIAFPTPDSVARLTVEDLRSL 186 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 ++AE +I A A + T+ + + E+ + L + GIGRWT + G Sbjct: 187 QFSQRKAEYIIDFARAIVNETVDLERLWTMEDEEIITYLTSLRGIGRWTVECLLMFGMGR 246 Query: 231 KDVFLPDDYLIKQRFPGMTPAQ-------IRRYAERWKPWRSYALLHIWYTEGWQPDE 281 D+ D ++ + + IR+ E+W PWRS L++W G + Sbjct: 247 PDLLPAADIGLRNGIVHLYGMETKPNENDIRKLGEKWAPWRSIYCLYVWEAVGAIKRK 304 >UniRef50_C6XZ60 HhH-GPD family protein n=1 Tax=Pedobacter heparinus DSM 2366 RepID=C6XZ60_PEDHD Length = 301 Score = 214 bits (545), Expect = 3e-54, Method: Composition-based stats. Identities = 64/288 (22%), Positives = 112/288 (38%), Gaps = 17/288 (5%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEP 69 + + L FL+ + +V + R G +V P + L + Sbjct: 14 FSKAECLWFLSRDFDDCMYSVFEDRVRRGFRQGSGIMIVDIYPMSDKLILEWLNISPSAE 73 Query: 70 VAAECLAKMSRLFDLQCN------PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILG 123 + +S FDL + + + + GLR G D FE I+G Sbjct: 74 DITAVVQFVSEWFDLNTDLIPFYKTIAADRRISYMAEDFAGLRFIGMPDFFEALAWCIIG 133 Query: 124 QLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 Q ++++ A K+ R+ + YG D +Y FP P+ +A A L+ L K+AE + Sbjct: 134 QQINLSFAYKVKRRLVERYGTCTQFDGQKYYLFPGPEIIAKASISDLRELQFSEKKAEYI 193 Query: 183 IHLANAALEGTLP---MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 I +A A L G L + D+E +K L GIG+WTANY ++ + D Sbjct: 194 IAIAEAFLNGMLNKELLQRLPDLESRIKFLTNIRGIGQWTANYALMKSLKEPACIPYGDA 253 Query: 240 LIKQRF-------PGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 + I ++ + ++ W+SY + ++W + Sbjct: 254 GLLNALLNHGIIKSKDNKPAIAKFFKAFEGWQSYIVFYLWRALSKPKE 301 >UniRef50_C1YI07 DNA-O6-methylguanine--protein-cysteine S-methyltransferase; DNA-3-methyladenine glycosylase II; Transcriptional regulator Ada n=1 Tax=Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 RepID=C1YI07_NOCDA Length = 561 Score = 210 bits (535), Expect = 4e-53, Method: Composition-based stats. Identities = 100/327 (30%), Positives = 130/327 (39%), Gaps = 49/327 (14%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIA------ 55 L ++ P D + ML FL RAV VE D Y R+L + VV Sbjct: 211 LRLPYREPIDLARMLRFLGDRAVPGVEEYRDGVYRRTLMLAHGPAVVELSEGSGTGRAGR 270 Query: 56 ------------------------RHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIV 91 + L + + RL DL +P V Sbjct: 271 TGRAGATGGVRPADAVDGGVSVSGGGHVLCRLRLSEARDLTSAVRRCRRLLDLDADPGAV 330 Query: 92 ------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER 145 + LG + AA PGLR PG VD E VRA+LGQ VSV A L R+ + +GE Sbjct: 331 AEALGGDPLLGPIVAAHPGLRSPGHVDPAELAVRAVLGQQVSVRAARTLAGRLVERFGEP 390 Query: 146 L------DDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP 199 L FP+P LAAADP +P+ R AL L A G + + Sbjct: 391 LAPGLEAPGGGLTHVFPSPDALAAADP---AGFSVPVARGRALAGLCEAIASGWIDLGPG 447 Query: 200 GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG----MTPAQIRR 255 D ++A + L GIG WTA Y +RG DVFL D ++ TPA R Sbjct: 448 CDRDEAERRLVELRGIGPWTAGYVRMRGLGDPDVFLHGDLGVRMALEAGGRRATPAAAAR 507 Query: 256 YAERWKPWRSYALLHIWYTEGWQPDEA 282 A W PWRSYA +W + + E+ Sbjct: 508 EAREWSPWRSYANHALWASLADRERES 534 >UniRef50_D0LE01 Ada metal-binding domain protein n=1 Tax=Gordonia bronchialis DSM 43247 RepID=D0LE01_GORB4 Length = 526 Score = 210 bits (535), Expect = 5e-53, Method: Composition-based stats. Identities = 83/303 (27%), Positives = 122/303 (40%), Gaps = 35/303 (11%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADS-----------YYARSLAVGEYRGVVTA 50 L ++PPY WSWM FL + A + VE+V D Y R L + + Sbjct: 220 LRLVYRPPYRWSWMRWFLGSHAAAGVESVIDDDPDAITPATRWRYRRVLDLPHGPALAVV 279 Query: 51 IP---DIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQ------IVNGALGRLGAA 101 P + + + L + ++ R DL + + AL L A Sbjct: 280 EPSTEETGPPFVRLTLHHMDMRDLGVAVNRIRRHLDLDADVATAEDALRHDPALRPLIDA 339 Query: 102 RPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY--------- 152 PGLRLPG +D E +R ++GQ +SVA A + G R+ E Sbjct: 340 APGLRLPGSLDPAETILRTMIGQQISVAAARTHIDALVARLGTRVPWPDEADLPPSAVFP 399 Query: 153 -ICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQT 211 FP+ +A Q L+ P +R E+++ +A A + T+ L Sbjct: 400 SATFPSATAIAEHGHQVLRG---PRRRIESIVAVAAALADKTVEPHPGLAASDLRAQLLE 456 Query: 212 FPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHI 271 PGIG WTA A+R D+ L DD ++KQ + R W PWRSYA +H+ Sbjct: 457 LPGIGPWTAALVAMRVTGDPDIALTDDLVVKQAMTELGID--IRSVPSWSPWRSYASMHL 514 Query: 272 WYT 274 W Sbjct: 515 WRH 517 >UniRef50_B4S0Y6 Ada regulatory protein n=3 Tax=Alteromonas macleodii RepID=B4S0Y6_ALTMD Length = 475 Score = 209 bits (533), Expect = 8e-53, Method: Composition-based stats. Identities = 85/290 (29%), Positives = 133/290 (45%), Gaps = 28/290 (9%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L+++PPY+W ++ FLAARA+S +E V+D+ Y R + GE G A+ + ARH + Sbjct: 202 LFLSYRPPYNWPYVREFLAARAISGMEVVSDNSYGRYFSCGESIGYFNAVHNEARHGFEL 261 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLG----AARPGLRLPGCVDAFEQG 117 ++ + + + L DL +P ++ +L + G A GLRLP FE G Sbjct: 262 HIDMPDLRNLHKTIENIKLLLDLHADPLLIEESLKQAGLPDNALTAGLRLPSAWSVFESG 321 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGER-------LDDFPEYICFPTPQRLAAADPQALK 170 RAI+GQ VSV A + G++ + Y CFPTP+ +A + L+ Sbjct: 322 CRAIVGQQVSVKAAIGQVTLLVHQLGKKGAVSDKYNTNSTAYYCFPTPEAVAGNNLAFLR 381 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 MP R EA+ A L +P K + G+G WT +Y +RG + Sbjct: 382 ---MPQARKEAVRQFACLFLNDKVPNH---------KEILAIKGVGPWTLDYLKMRGERN 429 Query: 231 KDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 DV+L D ++++ + PWRSY L +W Q + Sbjct: 430 PDVYLEGDLIVRKMAQLYPVEPAQA-----APWRSYLTLQLWQLSNQQKE 474 >UniRef50_B2GIR9 Putative methylated-DNA--protein-cysteine methyltransferase/3-methyladenine-DNA glycosylase II n=1 Tax=Kocuria rhizophila DC2201 RepID=B2GIR9_KOCRD Length = 532 Score = 207 bits (528), Expect = 3e-52, Method: Composition-based stats. Identities = 79/283 (27%), Positives = 123/283 (43%), Gaps = 16/283 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L+++ P D + + A AV VE + Y+R+L + + A L + Sbjct: 239 LALSYRAPLDLHGLFVWFAVHAVEGVEVGTATSYSRTLRLPGGPAWLRVYRRGAD-ELRM 297 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAARPGLRLPGCVDAFE 115 +A++ RLFDL +P V+ AL L AARPGLR+ G D E Sbjct: 298 RARLTDLADLPALIARVRRLFDLDADPLAVDEALSHVPALRPLVAARPGLRVVGSADPEE 357 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDD--FPEYICFPTPQRLAAADPQALKALG 173 +R ++GQ +S+A A + + GE + FPT +A + L+ Sbjct: 358 TLIRTLIGQQISLAAARTVLGARTREMGEPAPEFAPGLSHMFPTAAAIAEHGERFLRG-- 415 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 P R A++ A+A G L ++ D Q L PG+G WTA++ +R DV Sbjct: 416 -PAARVRAVLGAASAVASGELSLSPGDDAAQQRAALLALPGVGPWTADHVRMRVTGDPDV 474 Query: 234 FLPDDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALLHIW 272 FL DD ++ + + +A+ PWRSYA H+W Sbjct: 475 FLVDDGALRAGAQRIGLPGDKKALTAWAQSAAPWRSYATTHLW 517 >UniRef50_A3XSB2 Ada regulatory protein n=1 Tax=Vibrio sp. MED222 RepID=A3XSB2_9VIBR Length = 482 Score = 206 bits (525), Expect = 6e-52, Method: Composition-based stats. Identities = 81/295 (27%), Positives = 114/295 (38%), Gaps = 40/295 (13%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L++ P DW +LGF R + +E V D YY R++ V +G A R +L I Sbjct: 203 IQLSFHGPLDWDHLLGFYRRRMIEGLEEVGDGYYQRTVNVNGSKGWFKATLAKER-SLDI 261 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRL---GAARPGLRLPGCVDAFEQGV 118 +A + R+FDL + V + A+ G+R+PG A+E GV Sbjct: 262 EFELDDMSQLRSLIANIRRMFDLDVDISKVEDFFSTIDPNLVAKSGIRIPGVWSAWEAGV 321 Query: 119 RAILGQLVSVAMAAKLTARVAQLYG--------------------ERLDDFPEYICFPTP 158 RAILGQ VSV A + + ++ D E FPTP Sbjct: 322 RAILGQQVSVTAAIGQLNLLVRKLSGSYQVFDSQEQANSQECSDLPQIADASEKAYFPTP 381 Query: 159 QRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRW 218 +++A AD L+ MP R E L A ++ + GIG W Sbjct: 382 KQIADADVSFLR---MPGSRKETLKRFAQYMVDNE---------AEHPSKWIDLKGIGPW 429 Query: 219 TANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 T Y LRG + L D ++K+ E PW SYA H W Sbjct: 430 TIQYALLRGLSEPNHLLVGDLVVKKFIEHRP----TINTESVSPWGSYATFHCWN 480 >UniRef50_C6W476 HhH-GPD family protein n=1 Tax=Dyadobacter fermentans DSM 18053 RepID=C6W476_DYAFD Length = 300 Score = 206 bits (524), Expect = 9e-52, Method: Composition-based stats. Identities = 61/288 (21%), Positives = 111/288 (38%), Gaps = 18/288 (6%) Query: 8 PP-YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAG 66 PP + + FL + T+ + +++ + + I A Sbjct: 11 PPLFSFRECHWFLDRDFDDCMHTIRGNAVLKAIRTSFGDILFRVSEEANFLKTEILYGAA 70 Query: 67 LEPVAAECLAKMSRLFDLQCNPQ------IVNGALGRLGAARPGLRLPGCVDAFEQGVRA 120 + ++ FDL + + + L + A GLRL G D FE + Sbjct: 71 APEARDLVVGYVANWFDLNRDIEPFYDLLAADSRLAYMTDAFRGLRLVGISDMFEAICWS 130 Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 I+GQ +++ A KL R+ + YG ++ + + FPTP+ LA A L+A+ K+A Sbjct: 131 IIGQQINLTFAYKLKRRMVERYGTHVEWNGEVFPVFPTPEALANAGIDELRAMQFSQKKA 190 Query: 180 EALIHLANAALEGTLPMT---IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 E ++ +A A +G L D K L + G+G WTANY ++ ++ + Sbjct: 191 EYVVGIAQAFADGKLNAEVISALPDFASRQKVLVAYKGVGIWTANYVLMKTFRMPEGIPH 250 Query: 237 DDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWYTEGW 277 D + G +I + W +Y ++W + Sbjct: 251 GDVGLLNALAGHGIIGDRSEKEKIEALFHAFPGWETYLTFYLWRSLAM 298 >UniRef50_Q2IPL2 Transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II n=12 Tax=Proteobacteria RepID=Q2IPL2_ANADE Length = 514 Score = 205 bits (523), Expect = 1e-51, Method: Composition-based stats. Identities = 106/288 (36%), Positives = 139/288 (48%), Gaps = 17/288 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L PYDW +L FLAARA+ VE VAD Y R++A+ G V PD L Sbjct: 213 IALPHTAPYDWPALLEFLAARAIPGVEQVADGAYRRTVALDGAAGTVEVRPDPRGRGLLA 272 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 L A + ++ RL DL + + + L L AARPGLR+PG + FE Sbjct: 273 TLRLPRVAAIAPAVERLRRLLDLDADAAAIGAHLSGDPLLAPLLAARPGLRVPGAWEPFE 332 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLD--DFPEYICFPTPQRLAAADPQALKALG 173 VRA+LGQ VSVA A L R+A G +D D FP P+ LA AD + L G Sbjct: 333 LVVRAVLGQQVSVAAARTLAGRLAARLGAPVDSGDPALSRLFPGPEALAGADLEGL---G 389 Query: 174 MPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 + RA L + A + + G++E A+ L PGIGRWTA Y A+R D Sbjct: 390 LTRARAATLAAIGGAVRDDPSLLAPGGELEDAVARLDALPGIGRWTAQYVAMRALHQPDA 449 Query: 234 FLPDD------YLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 F D + P ++ R AERW+PWR+YA LH+W + Sbjct: 450 FPEGDLGLLAALGGLRGRGRAAPGELLRRAERWRPWRAYAALHLWMSL 497 >UniRef50_Q15P13 DNA-O6-methylguanine--protein-cysteine S-methyltransferase / Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II n=1 Tax=Pseudoalteromonas atlantica T6c RepID=Q15P13_PSEA6 Length = 457 Score = 205 bits (523), Expect = 1e-51, Method: Composition-based stats. Identities = 81/281 (28%), Positives = 121/281 (43%), Gaps = 26/281 (9%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L ++PPY+W + FLA RA+S E V+ YAR+ G +G A ++ Sbjct: 191 VIPLAYRPPYNWPHLRDFLARRAISGSEWVSQDSYARNFTFGTSKGYFQAQHQPDKYRFL 250 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGA--LGRLGAA--RPGLRLPGCVDAFEQ 116 + L+ CL+ + R+ D+ + ++ L L PG+R+PG + FE Sbjct: 251 VTLAIDDLRQLKHCLSNVRRILDVDADSATIDNRIELSGLSKQTITPGIRIPGIWNTFEA 310 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDD-----FPEYICFPTPQRLAAADPQALKA 171 G RAILGQ +SV A L ++ GE + D FP P +A +D L Sbjct: 311 GCRAILGQQISVTAAINLVTKLVATIGEPVLDDQAPVPELNRYFPAPDAVANSDLSFL-- 368 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 GMP R E L A + + + GIG WT Y LRG Sbjct: 369 -GMPNSRRETLRRFAAFYAQH---------PDTPPDDWLSIKGIGPWTVAYANLRGLSQA 418 Query: 232 DVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D++L D +IK++ A++ PWRSY +W Sbjct: 419 DIWLNSDLVIKKQLLLHDID-----ADKVSPWRSYLTFTLW 454 >UniRef50_Q2BC23 DNA-3-methyladenine glycosylase II n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2BC23_9BACI Length = 299 Score = 205 bits (521), Expect = 2e-51, Method: Composition-based stats. Identities = 61/288 (21%), Positives = 106/288 (36%), Gaps = 14/288 (4%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L + + L FL + + V S + + ++ Sbjct: 10 MELELPAHFHFREALVFLDRSSYEILHYVEGSAVFKGIITDGEVILLKISSTETHLHASF 69 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCN------PQIVNGALGRLGAARPGLRLPGCVDAFE 115 L A + + A + DL+ + + L L GLR+ G D FE Sbjct: 70 LLGAPSDNGRKQAAAFIEEWLDLKRDASGFGRMAAGDPLLKGLAERYAGLRIIGIPDLFE 129 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGM 174 V A++GQ +++ A KL + YG + + FP P +AA +P+ LK L Sbjct: 130 ALVWAVIGQQINLTFAYKLKKAFTEKYGTCFSYEGRCFWLFPEPGMIAALEPEELKQLQF 189 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 ++AE +I +A E L A L + G+G WTA+Y ++ F Sbjct: 190 TGRKAEYIIGIAKLMAEKKLKKDDLLGQPGARDVLMSLKGVGAWTADYVRMKCLLDPAAF 249 Query: 235 LPDDYLIKQRF-------PGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 D + + ++ + A RW W++YA+ + W + Sbjct: 250 PIGDAGFQNALKLQMGLDRKPSIEEVEKAASRWAGWQAYAVFYFWRSL 297 >UniRef50_A8LHD8 Transcriptional regulator, AraC family n=4 Tax=Actinomycetales RepID=A8LHD8_FRASN Length = 540 Score = 204 bits (519), Expect = 3e-51, Method: Composition-based stats. Identities = 91/309 (29%), Positives = 124/309 (40%), Gaps = 34/309 (11%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++ P + G L A AV VE D Y R++ +V P + Sbjct: 213 VRLPFRAPLYPDNLFGHLVATAVPGVEEWRDGAYRRTMRTLHGHAIVALRPLPDHIGCRL 272 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAARPGLRLPGCVDAFE 115 L+ A + + RL DL +P V+ AL L A PG R+P VD E Sbjct: 273 ALT--DVRDLAPVIGRCRRLLDLDADPIAVDGQLAADPALAPLVARAPGRRVPRTVDPAE 330 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQ---------------- 159 VRA+LGQ VSVA A AR+ G + D + PQ Sbjct: 331 LAVRAVLGQQVSVAAARTHAARLVTAVGTPIHDPEGGLTHLWPQIADLAEHIERTEYAEC 390 Query: 160 -RLAAADPQ-----ALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFP 213 LA A P A + L +P R L + G + + GD E+A L P Sbjct: 391 TDLADAVPAGRRAGAPRGLALPAARRRTFAALVGGLVSGMIELGAGGDWERARAALAALP 450 Query: 214 GIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG----MTPAQIRRYAERWKPWRSYALL 269 GIG WT A+R D FLP D +++ TPA + R+A W+PWR+YA+ Sbjct: 451 GIGPWTLETIAMRALGDPDAFLPGDLGVRRGAERLGLPATPAALSRHAAAWRPWRAYAVQ 510 Query: 270 HIWYTEGWQ 278 H+W Sbjct: 511 HLWAVLDHP 519 >UniRef50_C5C5F4 HhH-GPD family protein n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C5F4_BEUC1 Length = 330 Score = 203 bits (518), Expect = 4e-51, Method: Composition-based stats. Identities = 82/308 (26%), Positives = 128/308 (41%), Gaps = 31/308 (10%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVA--DSYYARSLAVGEYRGVVTAIPDIARHTL 59 L + PP D +L L ++ V R++ + +VT + Sbjct: 19 LRLAYTPPLDADALLAALGRHETVGLDRVDPLGRTVTRTVPTPDGPVLVTVHLAADEPVV 78 Query: 60 HINL------------SAGLEPVAAECLAKMSRLFDLQCNPQIVNGA------LGRLGAA 101 +++ +G + ++++ DL +P + L L A Sbjct: 79 VLDVEPLVAVAGVGAVWSGADGALEALVSRVRGWLDLDHDPLAADAVLAADPALAPLVAG 138 Query: 102 RPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD------DFPEYICF 155 PG+R+PG VD FE +LGQ VS+A A T+R YG L P + F Sbjct: 139 APGMRVPGFVDPFEAAATTVLGQQVSLAAARTFTSRFVAAYGTPLRAAGAPSTAPHWFAF 198 Query: 156 PTPQRLAAADPQALKAL-GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPG 214 PTP+ +A ADP L+A+ G+ RA +L LA A +G + + L PG Sbjct: 199 PTPEAIARADPDELRAVVGLTRARASSLTSLAAAFADG----LALDTGPGSRERLLALPG 254 Query: 215 IGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 IG WTA+Y LR + D F D ++++ + P + AE W+PWR Y + HIW + Sbjct: 255 IGPWTADYLELRLLRDPDAFPAGDLVLRRGLGVVDPDEATALAESWRPWRGYGVFHIWSS 314 Query: 275 EGWQPDEA 282 Sbjct: 315 ATAPQGRG 322 >UniRef50_D1BI44 DNA-3-methyladenine glycosylase II /DNA-O6-methylguanine--protein-cysteine S-methyltransferase /Transcriptional regulator Ada n=1 Tax=Sanguibacter keddieii DSM 10542 RepID=D1BI44_SANKS Length = 517 Score = 203 bits (518), Expect = 4e-51, Method: Composition-based stats. Identities = 95/310 (30%), Positives = 135/310 (43%), Gaps = 34/310 (10%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETV---ADS--YYARSLAVGEYRGVVTAI--PD 53 + L + P+D + +LGFLA RAV+ VET D YAR+L + G V + Sbjct: 207 VVDLPVRQPFDAAGVLGFLADRAVAGVETATTEDDGTMRYARTLDLPHGPGAVEVVAVRR 266 Query: 54 IARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAARPGLRL 107 R + L A +A++ RL DL +P V+ AL L RPG R+ Sbjct: 267 QGRWEMRARLELAALGDVAPAVARVRRLLDLDADPVAVDSALAQDPALRPLVEERPGTRV 326 Query: 108 PGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY--ICFPTPQRLAA-- 163 PG VD E VRA++GQ +SVA A R+ G FPT ++AA Sbjct: 327 PGAVDPHELVVRAVVGQQISVAAARTHLGRLTARLGTPYRSAFAGLDRLFPTAAQVAAGV 386 Query: 164 ---ADPQAL---KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGR 217 AD + L + L +P + A++ +A A +G L + + D L PGIG Sbjct: 387 PVPADDEVLDPDRPLRLPRRSVRAVVSVARALADGDLVVDVGADAAALRAELVDRPGIGP 446 Query: 218 WTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-----------PAQIRRYAERWKPWRSY 266 WTA Y A+R D +LP D + + + A W PWRSY Sbjct: 447 WTAAYVAMRVLGDPDAWLPGDVALVAGARAVGLLGTEKTTSAAHRALAEGASVWAPWRSY 506 Query: 267 ALLHIWYTEG 276 A++H+W Sbjct: 507 AVVHLWRAAS 516 >UniRef50_Q1QTR7 Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=2 Tax=Gammaproteobacteria RepID=Q1QTR7_CHRSD Length = 453 Score = 203 bits (516), Expect = 6e-51, Method: Composition-based stats. Identities = 87/275 (31%), Positives = 124/275 (45%), Gaps = 21/275 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++PPY W W+ FLAAR + +E + Y R + G G TA+ RH + Sbjct: 194 LMLAYRPPYAWEWLRDFLAARRIDRLEWGDEHRYGRHIQWGSASGHFTAVHVPERHGFRV 253 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR-LGAARP---GLRLPGCVDAFEQG 117 LS + + R+ DL + ++ L + L P GLRLPG FE G Sbjct: 254 TLSLDDLGALLPVVRHIRRVLDLDADTALIEAQLRQTLPDTFPLVEGLRLPGVWTPFEAG 313 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 VRA+LGQ VS++ A R+ + GE + FPT R+AA+D L+ MP Sbjct: 314 VRAVLGQQVSISAARGHVTRLVEALGEP--TGDDGRQFPTAARIAASDLAFLR---MPQA 368 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 R + L LA AA + L + GIG W+A+Y ALRG D++L Sbjct: 369 RRDCLRGLAQAACDRRLDDDP--------RQWTALKGIGPWSADYAALRGTSHPDIWLGG 420 Query: 238 DYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 D +K+ + + PWRSY L +W Sbjct: 421 DLGVKRALSALGTVEPAHAT----PWRSYLTLQLW 451 >UniRef50_C7MYM6 DNA-3-methyladenine glycosylase II /DNA-O6-methylguanine--protein-cysteine S-methyltransferase /Transcriptional regulator Ada n=20 Tax=Actinobacteria (class) RepID=C7MYM6_SACVD Length = 510 Score = 203 bits (516), Expect = 7e-51, Method: Composition-based stats. Identities = 79/293 (26%), Positives = 128/293 (43%), Gaps = 22/293 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L ++ P+D + +L FL ARAV VE+ Y R+L + VV P + Sbjct: 220 LRLPFRRPFDTTGVLDFLTARAVPGVESTEGD-YRRTLRLPHGAAVVRLSPRSTHIECLL 278 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFE 115 L+ + ++++ RL+DL +PQ V + AL +A PG+R+PG VD E Sbjct: 279 RLT--DIRDLSGAVSRIRRLWDLDADPQAVLDCLSADPALAPWLSAAPGIRVPGAVDGPE 336 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLD------DFPEYICFPTPQRLAAADPQAL 169 +RA+ Q +S A R+ G + + FP P +A L Sbjct: 337 LVLRALFEQGMSTRRAHIALGRLVTELGTPIAPELLDATDDPTLLFPGPTAVAEHAASIL 396 Query: 170 KALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 P R + + +A A +G L + + D E + L PGI W A+Y +R Sbjct: 397 PG---PQDRVDTIRTIAAALAQGDLDVHVGRDAEDLRRDLLAVPGISSWAADYILMRLLG 453 Query: 230 AKDVFLPDDYLIKQRFPG----MTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 D+ L D ++++ T + + +A RW+PWRSYA +++W Sbjct: 454 HPDILLGTDLVLRRGARSLGIDATYSGLTTHARRWRPWRSYAGMYLWRAGDQP 506 >UniRef50_C7PMW8 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PMW8_CHIPD Length = 302 Score = 201 bits (512), Expect = 2e-50, Method: Composition-based stats. Identities = 63/284 (22%), Positives = 115/284 (40%), Gaps = 18/284 (6%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEP 69 + ++ L FL+ + V D+ + + + ++ D A L + G Sbjct: 17 FSFAECLVFLSRSEKECLHYVQDNAVQKMVISNGHPVLLEISDDPAAKALKARVLDGPGE 76 Query: 70 VA--AECLAKMSRLFDLQCN------PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAI 121 A + +S L+ + + L L GLRL G D FE +I Sbjct: 77 DIDDAHIIKYISHWLHLEADLRPFYKFAKKDAVLKPLADRYKGLRLIGIPDLFEALTWSI 136 Query: 122 LGQLVSVAMAAKLTARVAQLYGER-LDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 GQ +++ A L R Q +G + + +Y +P P +A+ +P +L A+ +A+ Sbjct: 137 TGQQITLGFAYTLRQRFIQAFGHHAVINGKDYYVYPHPAVVASLEPASLIAMQFSRSKAD 196 Query: 181 ALIHLANAALEGTLPM--TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD 238 +I LA A G L D +QA L +F GIG W+ANY ++ + + +D Sbjct: 197 YIIGLAKAMTGGLLTDKQLWEMDYQQARAHLISFRGIGNWSANYVLMKYHRHHEALPLED 256 Query: 239 YLIKQRFP-------GMTPAQIRRYAERWKPWRSYALLHIWYTE 275 + + A ++ Y W+ + +YA ++W + Sbjct: 257 AGLHNALKQQLQLTAKPSLADVKAYTGHWREYAAYATFYLWRSL 300 >UniRef50_D1CD20 DNA-3-methyladenine glycosylase II n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CD20_THET1 Length = 301 Score = 201 bits (511), Expect = 3e-50, Method: Composition-based stats. Identities = 74/286 (25%), Positives = 117/286 (40%), Gaps = 17/286 (5%) Query: 8 PPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGL 67 P + L +L + +E V + R+L + E ++ + + + A Sbjct: 11 GPLRLAHALLYLRTSPSAVLEKVTEDACRRALRINERAVLLQVRQEGQGVRVTLWGDALD 70 Query: 68 EPVAAECLAKMSRLFDLQCNPQI-------VNGALGRLGAARPGLRLPGCVDAFEQGVRA 120 + A A++ R+F L +P + LGR+ R D E + A Sbjct: 71 DATVAAAEAEVRRIFLLDEDPGAFYREVPLRDRVLGRVMEDYLWARPVLIADPLEALMWA 130 Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 I+GQ V+VA A KL AR+ +L G L+ D Y FP R+A L+ ++A Sbjct: 131 IIGQQVNVAFARKLKARLVELCGSVLEVDGERYWVFPPAWRIADLPEDLLRGNQFSRQKA 190 Query: 180 EALIHLANAALEGTLPMTIPG--DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 ++ LA A G L + G VE+A+ L F G+GRWTA Y +RG DV Sbjct: 191 RYILGLARAVASGELDLRALGVLPVEEAIAELVRFLGVGRWTAEYVLMRGLGRADVIPAA 250 Query: 238 DYLIKQRFPGMT-------PAQIRRYAERWKPWRSYALLHIWYTEG 276 D ++ A++R + W PWR++ W Sbjct: 251 DLGLRAVMGRHYLGGRVATEAEVREISAAWSPWRAWGAWLWWLHLQ 296 >UniRef50_A0JV31 DNA-O6-methylguanine--protein-cysteine S-methyltransferase / DNA-3-methyladenine glycosylase II / Transcriptional regulator Ada n=5 Tax=Actinobacteria (class) RepID=A0JV31_ARTS2 Length = 504 Score = 200 bits (510), Expect = 3e-50, Method: Composition-based stats. Identities = 78/295 (26%), Positives = 123/295 (41%), Gaps = 22/295 (7%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIA--RHTLHI 61 L ++ P+D + FLA R++ +ET + YAR+L + + D L + Sbjct: 200 LPYREPFD-PGIFQFLAVRSIPGIETGTGTSYARTLRLPHADARFSVEYDADAPGRPLVL 258 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAARPGLRLPGCVDAFE 115 + A L+++ RL DL +P ++ L A PG+R+PG VD E Sbjct: 259 TIGAVDLRDLPSLLSRVRRLLDLDADPVAIDNALEADPRLAPAVKAFPGMRMPGAVDPQE 318 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYI-CFPTPQRLAAADPQALKALGM 174 +RA++GQ ++VA A +++ E L FPT ++A L+ Sbjct: 319 LLIRAMIGQQITVAAARTALTQLSACGSESLVPADGLHRLFPTAAQIADPGFGLLRG--- 375 Query: 175 PLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 P +R +++ A A G L D+ L PG+G WT Y A+R A DVF Sbjct: 376 PQRRIDSVRAAAGAMAAGNLDFGYGDDLAGLQSKLLPLPGVGPWTVGYVAMRVIGAPDVF 435 Query: 235 LPDDYLIKQRF---------PGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 L +D ++ P PWRSYA +H+W +P Sbjct: 436 LANDAAVRNGILALDTGPQAGERPPGVQPADFTDVSPWRSYATMHLWRAAAMRPQ 490 >UniRef50_O31544 Putative DNA-3-methyladenine glycosylase yfjP n=17 Tax=Bacillaceae RepID=YFJP_BACSU Length = 287 Score = 200 bits (509), Expect = 4e-50, Method: Composition-based stats. Identities = 58/286 (20%), Positives = 108/286 (37%), Gaps = 19/286 (6%) Query: 5 NWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLS 64 + PPY + +L L+ +++V+ + + H Sbjct: 7 SVTPPYHFDRVLDRLSLDPLNAVDR-EAREVRVPIRNQAGDVCI-VKVQALGHAGEPEFL 64 Query: 65 AGLEPVAAECLAKMSRLFDLQCNPQIV-----NGALGRLGAARPGLRLPGCVDAFEQGVR 119 E E + ++ R+F + + Q V +L + G L + ++ Sbjct: 65 VSGETDQGEMMKEIKRIFQWENHLQHVLDHFSKTSLSAIFEEHAGTPLVLDYSVYNCMMK 124 Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 I+ Q ++++ A LT R +GE+ C+P P+ +A D Q L+ L +++A Sbjct: 125 CIIHQQLNLSFAYTLTERFVHAFGEQ---KDGVWCYPKPETIAELDYQDLRDLQFSMRKA 181 Query: 180 EALIHLANAALEGTLPM--TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 E I + EGTL + E MK L GIG WT + G ++F Sbjct: 182 EYTIDTSRMIAEGTLSLSELPHMADEDIMKKLIKIRGIGPWTVQNVLMFGLGRPNLFPLA 241 Query: 238 DYLIKQRFPGMT-------PAQIRRYAERWKPWRSYALLHIWYTEG 276 D ++ + ++ W+P+ SYA L++W + Sbjct: 242 DIGLQNAIKRHFQLDDKPAKDVMLAMSKEWEPYLSYASLYLWRSIE 287 >UniRef50_Q12L65 DNA-O6-methylguanine--protein-cysteine S-methyltransferase / Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II n=8 Tax=Shewanella RepID=Q12L65_SHEDO Length = 545 Score = 199 bits (506), Expect = 1e-49, Method: Composition-based stats. Identities = 81/297 (27%), Positives = 113/297 (38%), Gaps = 34/297 (11%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSY-YARSLAVGEYRGVVTAIPDIARHTLH 60 +L ++PP +W M F R VS +E + + Y+RS +GV + A+ Sbjct: 226 LSLAFRPPLNWHKMWAFYQFRQVSGMEILDEEQGYSRSFCFDGVKGVFRVRLNEAKSQFD 285 Query: 61 INLS---AGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP----GLRLPGCVDA 113 + + + ++ RL DL + + L A GLR+P Sbjct: 286 TQIYLLHSHDVKQLHPVVLRIRRLLDLDTDMATIAQIFVPLVAMGAKLDAGLRIPATASV 345 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKAL 172 FE RAILGQ VSV A KL + + YGE + + + FPTP+ +A A LK Sbjct: 346 FEAACRAILGQQVSVQQATKLLNTLVEHYGETFELNGQVWRLFPTPEAVATASLDELK-- 403 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 MP R AL L E GIG WT Y +RG + Sbjct: 404 -MPGARRLALNALGAYVQEH---------PHSTPDDWLEVKGIGPWTVAYAKMRGLSESN 453 Query: 233 VFLPDDYLIKQRFPGMTPAQ-------------IRRYAERWKPWRSYALLHIWYTEG 276 VFL D +IK R G+ A + PW SY +W E Sbjct: 454 VFLSSDLVIKHRIHGLYAKAGGIIETPKAYLALAADIANKVSPWGSYLTFGLWDDED 510 >UniRef50_A9B7A8 Transcriptional regulator, AraC family n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9B7A8_HERA2 Length = 489 Score = 198 bits (505), Expect = 1e-49, Method: Composition-based stats. Identities = 73/290 (25%), Positives = 115/290 (39%), Gaps = 21/290 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGV---VTAIPDIARHT 58 ++L Y +LG L VS + V + + + + GV VT P A + Sbjct: 198 FSLALPNDYPSRQILGQLGRDPVSLTDQVVEQTWYSTCRLNGQTGVLLAVTITPTTAECS 257 Query: 59 LHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAARPGLRLPGCVD 112 + + SA A + L +P AL L + GLR+P + Sbjct: 258 I-VEQSAVTPSDVATIHRHVIAGLGLSNDPSRFEAHVAKSPALLPLIEHQRGLRMPLVHN 316 Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 F+ V AILGQ +S+A+A +L R+ +L G+RL+ ++ PTP +A + L L Sbjct: 317 PFDALVWAILGQQISLAVAYRLRQRLTELVGQRLN--QDFYLAPTPNTIAQLTVEQLLPL 374 Query: 173 GMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 G +A LI A A + +LP+ + + L GIG WTA Y +R + Sbjct: 375 GFSNAKARYLIDTAQAIIAESLPLASYHRKSATRIERELLALRGIGPWTAQYVLMRSFGF 434 Query: 231 KDVFLPDDYLIKQRF-------PGMTPAQIRRYAERWKPWRSYALLHIWY 273 D D + + + P+RS A H+W Sbjct: 435 SDCVPVGDSGLTSSLQAFFQLEQRPDRSTTLALMAAFSPYRSLATFHLWQ 484 >UniRef50_B0KRT0 AlkA domain protein n=1 Tax=Pseudomonas putida GB-1 RepID=B0KRT0_PSEPG Length = 325 Score = 198 bits (503), Expect = 2e-49, Method: Composition-based stats. Identities = 96/285 (33%), Positives = 137/285 (48%), Gaps = 14/285 (4%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L ++ PY W LGFLAAR + +ET D Y+R+L V + V+ A P +H L + L Sbjct: 40 LRYRAPYHWPSTLGFLAARCIPGIETCHDGTYSRTLIVAGHHAVLHATPMTNQH-LRVRL 98 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQG 117 +A++ R+FDL +P + + + L ARPGLR+P DA EQ Sbjct: 99 EGAPSNALPGLIARLRRVFDLDADPARISAELSCDPLMASLLKARPGLRVPQGWDACEQA 158 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLD--DFPEYICFPTPQRLAAADPQALKALGMP 175 +R +LGQ +SVA A L R+ Q +G L FP LA A + + GMP Sbjct: 159 MRTVLGQQISVAGAMTLAGRLVQRHGAPLRLSAPGLSHVFPALPTLANAQFENM---GMP 215 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 RA L LANA L + +++ ++ L GIG W+A+Y ALR A D Sbjct: 216 SARATTLATLANALLADPGLLRRGQVLDELLRNLCRLKGIGPWSAHYLALRQAGAADALP 275 Query: 236 PDDYLIKQRFP--GMTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 D + + AQ+ A W+PWR+YA H+W + G Sbjct: 276 LGDVALIKALRLLEGDEAQLAERALDWRPWRAYAAQHLWASLGPA 320 >UniRef50_Q3IBU8 Putative ADA regulatory protein (Regulatory protein of adaptative response) n=3 Tax=Alteromonadales RepID=Q3IBU8_PSEHT Length = 454 Score = 197 bits (502), Expect = 3e-49, Method: Composition-based stats. Identities = 77/278 (27%), Positives = 121/278 (43%), Gaps = 22/278 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 TL ++PPY+W M FLA R ++ +E + + Y R+ + +G A ++ + Sbjct: 195 LTLPFRPPYNWPAMQQFLAKRLIAPMEWITATSYGRTFSDEHCKGSFNAEFIAQKNHFKV 254 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQI----VNGALGRLGAARPGLRLPGCVDAFEQG 117 ++ + + + R+ DL + + + + A GLRLPG +FE G Sbjct: 255 AITINNTHCLQQVITNIRRVLDLDADINLITMHIQDNINNAFAVSEGLRLPGIWSSFEAG 314 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 +RA+LGQ VSV A L ++ GE+ + + FPTPQ+L +D K MP Sbjct: 315 IRAVLGQQVSVTAAHNLVTKLVSELGEQCN---GAVYFPTPQQLVNSDFAFFK---MPQA 368 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 R AL +LA + D + GIG WT NY LRG D+ L Sbjct: 369 RKNALYNLAQFC-----TLNPQCDD---LDLWLNLKGIGPWTVNYAKLRGQSQPDILLDG 420 Query: 238 DYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 D +K+ A ++ P+RSY +W Sbjct: 421 DLGVKKA----QAAVAVFSSDNCAPFRSYLTFQLWQQL 454 >UniRef50_C6MGP3 HhH-GPD family protein n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MGP3_9PROT Length = 316 Score = 195 bits (496), Expect = 2e-48, Method: Composition-based stats. Identities = 65/289 (22%), Positives = 116/289 (40%), Gaps = 18/289 (6%) Query: 3 TLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHIN 62 L Y S +L F ETV D+Y+++ + + ++ + + Sbjct: 9 RLPLPENYRISDILSFHGRDKWDVAETVYDNYFSKGIIWHDTPACLSIRFQQLYVEIELC 68 Query: 63 LSAGLEP-VAAECLAKMSRLFDLQCN------PQIVNGALGRLGAARPGLRLPGCVDAFE 115 + L+ + R+ L + + LG L A + GLR+P AFE Sbjct: 69 IDKQLKTFCPDTFHSMAIRMLGLNQSVNTFEEEFREHAQLGSLIAKQSGLRVPVSATAFE 128 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMP 175 + AI GQ +S++ A + ++ QL G R C P Q L+ L+ +G Sbjct: 129 ALIWAIAGQKISISAALAIRRKLIQLIGLRHSGG--LYCHPNAQHLSHLSISDLRQIGFS 186 Query: 176 LKRAEALIHLANAALEGTLPMTIPG---DVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 +A+ ++ ++ + L ++ +E + L GIG WT +Y LRG+ D Sbjct: 187 HSKAQTILTVSQRVICNELELSSASAEPPIEHIRQQLLQIRGIGLWTVDYTLLRGYGWLD 246 Query: 233 VFLPDDYLIKQRFP------GMTPAQIRRYAERWKPWRSYALLHIWYTE 275 L D +++ + Q R++ E + PWR+ H+W E Sbjct: 247 GSLHGDVAVRRGLQILLNCESINENQTRQWLENFSPWRALVAAHLWNIE 295 >UniRef50_D1P0X5 DNA-3-methyladenine glycosylase II n=4 Tax=Enterobacteriaceae RepID=D1P0X5_9ENTR Length = 303 Score = 194 bits (494), Expect = 2e-48, Method: Composition-based stats. Identities = 65/294 (22%), Positives = 113/294 (38%), Gaps = 19/294 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L P Y F + E V ++ + + +Y +T + T + Sbjct: 10 IQLLLPPHYRVDDFFAFHLRDPQNIAEIVTENTLRKGIIWQQYPAQITLSIENHNATFSL 69 Query: 62 NLSAGLEPV-------AAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAF 114 ++ A A L +++ +L + + + LG+L + G+R+ F Sbjct: 70 DIDALQVSATQHEKLTLATHLLGLNQPVELFEDIYLSHPILGKLITPQRGVRVYQSASTF 129 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGM 174 E V AI+GQ +SV A + R Q G + CFPT Q++A D L+ G Sbjct: 130 EALVWAIIGQQISVLAAIAIRRRFIQAVGMQHSSG--IWCFPTVQQVAQVDDNILRKTGF 187 Query: 175 PLKRAEALIHLANAALEGTLPMTI---PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + AL + A L + + P +VE L GIG WT +Y LRG+ Sbjct: 188 STGKIIALRGVCEAIENQRLDLDLTVTPDNVEDVTAQLLAIKGIGPWTISYALLRGFNYL 247 Query: 232 DVFLPDDYLIKQRFP-------GMTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 D L D +++ + + + + ++ PWR+ H+W + Sbjct: 248 DGSLHGDVAVRRNLQTLLNHTEQPSTKETQHWLVQFAPWRALVAAHLWRYQSAA 301 >UniRef50_Q81IC3 DNA-3-methyladenine glycosylase II n=75 Tax=Bacillus RepID=Q81IC3_BACCR Length = 287 Score = 194 bits (493), Expect = 3e-48, Method: Composition-based stats. Identities = 56/299 (18%), Positives = 109/299 (36%), Gaps = 41/299 (13%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 TL + PY + +L L+ ++ + + E + D + + + Sbjct: 6 VTLEY--PYHFEEVLKRLSFDPLN------------VIQLDEKVIYIPLCIDEEQVVVRL 51 Query: 62 NLSAGLEP----------VAAECLAKMSRLFDL-----QCNPQIVNGALGRLGAARPGLR 106 ++ + + +M +F +N +L L Sbjct: 52 QGIGTVQNPQFWISSQTGDPEKVMKRMRAIFHWNEPFQDIQNHFLNTSLRPLFETYAYTP 111 Query: 107 LPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADP 166 + D F +R I+ Q +++ A LT + + YG + FPTP+ +A Sbjct: 112 IILEFDYFACLLRCIIHQQINLKFATVLTEQFVKRYGT---EKNGVFFFPTPEIVANISI 168 Query: 167 QALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFA 224 + L+ ++AE ++ L + + GTL + G E L GIG WT F Sbjct: 169 EELREQKFSQRKAEYIVGLGRSIVSGTLNLASIENGTEEDISAQLLPIRGIGAWTVQNFL 228 Query: 225 LRGWQAKDVFLPDDYLIKQRFP-------GMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 + G K++F D I++ A + + + +P+ SYA L++W + Sbjct: 229 MFGLGRKNMFPKADIGIQRAVQGIFQLDDKPDDAFLEKVKQECEPYCSYAALYLWKSIE 287 >UniRef50_UPI00018509D2 YfjP n=1 Tax=Bacillus coahuilensis m4-4 RepID=UPI00018509D2 Length = 301 Score = 193 bits (490), Expect = 7e-48, Method: Composition-based stats. Identities = 52/294 (17%), Positives = 108/294 (36%), Gaps = 27/294 (9%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 ++ + PYD +L + + VE R R ++ + +H Sbjct: 3 VRVSVEQPYDVESVLSYFTGHPLVVVEQ----SGLRFGLDHGVRSIIDVKYEGEIAIIHS 58 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNG-----ALGRLGAARPGLRLPGCVDAFEQ 116 + + + K + L + ++ L + G L +D + Sbjct: 59 EIDD------MKFIEKTMHILHLDRPLKPIDEFYRKSELQEIFQKYEGYPLLLELDDYMS 112 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPL 176 +R I+ Q V++ +A + + YGE +D FP P L + L+ + Sbjct: 113 IIRCIISQQVNLTLARNIFTSLTHTYGEEVDS---VWFFPRPHVLKEVSIEELRTHKLSQ 169 Query: 177 KRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 ++AE + A+ +G + M ++ + L GIG+WT + L +++F Sbjct: 170 RKAEYIQGFASLVADGAIDMDELDKLSNDEIIDRLLPIRGIGKWTVENYLLFTLGRENLF 229 Query: 235 LPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHIWYTEGWQPDE 281 D I+ T ++ Y+ W P+ SYA +++W + ++ Sbjct: 230 PKGDIGIQNALKKFLQLDRKPTMDEMDIYSRDWAPYLSYASIYLWRSLENGSEQ 283 >UniRef50_Q1YTX8 Putative DNA-3-methyladenine glycosylase II n=1 Tax=gamma proteobacterium HTCC2207 RepID=Q1YTX8_9GAMM Length = 257 Score = 191 bits (487), Expect = 2e-47, Method: Composition-based stats. Identities = 83/278 (29%), Positives = 121/278 (43%), Gaps = 24/278 (8%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 M L P+ W +L +L+ R V E++AD++Y R G+V D L Sbjct: 1 MIELPVVKPFPWQQLLEYLSFRLVPEFESIADNHYQRIYR----DGLVRVSYDEPNGLLQ 56 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR---LGAARPGLRLPGCVDAFEQG 117 I + + +SR+F Q Q + L + A PG R GC D FE Sbjct: 57 IKSDLP-QDQLDNLIVPVSRIFRPQLCTQAIYQQLLPHLPILAKSPGFRPLGCWDPFELC 115 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 +R I+GQ V+VA A + R+ + G+ TP+ L AAD L +GMP Sbjct: 116 LRTIIGQQVTVAAANTIMRRLVERCGQL-----------TPEALLAAD---LSNMGMPGA 161 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 R ALI LA A G L ++ + + L GIG WT Y A+R D F Sbjct: 162 RVAALIALATALANGDLDLSR--PWPELKEALLKLRGIGPWTCGYLAIRLGMDDDAFPET 219 Query: 238 DYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 D + + + + AE W+P+R+YA + +W E Sbjct: 220 DVGLIRAAKSESAMALLASAELWRPYRAYAAVGLWALE 257 >UniRef50_A3D6C4 Transcriptional regulator Ada / DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase n=11 Tax=Shewanella RepID=A3D6C4_SHEB5 Length = 565 Score = 191 bits (486), Expect = 2e-47, Method: Composition-based stats. Identities = 83/321 (25%), Positives = 124/321 (38%), Gaps = 65/321 (20%) Query: 6 WQPPYDWSWMLGFLAARAVSSVETVADSY-----------------------------YA 36 ++PP DW+ L F RAV+ +E Y Sbjct: 256 YRPPLDWASQLAFYRLRAVTGMEWFTPQMSHPQASDAVQVADEANLAAEANADDNGLEYG 315 Query: 37 RSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAEC---LAKMSRLFDLQCNPQIVNG 93 R A+G+ RG V I + + + ++ + E + ++ R+ DL + Q + Sbjct: 316 RCFAIGKMRGTVQIIHEPKLNRFKLAIALTEDSAVDELQLLVTEVRRILDLDADMQQIEQ 375 Query: 94 ALGRLGA----ARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-D 148 L L + GLR+PG FE RAILGQ V+V A KL + + YGE + Sbjct: 376 GLSTLPSLGLMPFSGLRIPGAGSLFEAVCRAILGQQVTVVQATKLLNILVEAYGECFSLN 435 Query: 149 FPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKT 208 EY FPTP+ + A LK MP R AL LA E E ++ Sbjct: 436 GREYRLFPTPEAIREASLTELK---MPGARKLALNALAAFICEH---------PEASVDD 483 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ---------------- 252 + GIG WT Y LRG +VFL D ++K+ + Sbjct: 484 WLSVKGIGPWTIAYAKLRGLGDPNVFLHLDLIVKKHLLALYIKNNRLDETAAAAVIYSQL 543 Query: 253 IRRYAERWKPWRSYALLHIWY 273 + +++ PW SY +W+ Sbjct: 544 CEQLSQQIAPWGSYLTFQLWH 564 >UniRef50_D1Z1B8 Putative DNA glycosidase n=1 Tax=Methanocella paludicola SANAE RepID=D1Z1B8_METPS Length = 303 Score = 191 bits (485), Expect = 3e-47, Method: Composition-based stats. Identities = 65/290 (22%), Positives = 109/290 (37%), Gaps = 23/290 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L P+ + L R + ++ Y R L ++ + D + Sbjct: 6 IPLPAAFPFRLDLTVWALRRRKSNIIDRWDGRRYTRILLFKNAPVRISIVQDSPEKAPEL 65 Query: 62 NLS-----AGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGC 110 ++S + + + + L + + + +G L G++ P Sbjct: 66 SMSLEGDKESADRAREPMIRLVKEMLGLDLDLRPFYALTKNDVVIGGLVRQFCGVKPPRF 125 Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALK 170 FE + AI Q VS+ + L R+A+ YG DD FP P+ LA+ + +K Sbjct: 126 PTIFEALLNAIACQQVSLDVGIILLDRLAERYGRAFDD---EAAFPAPEGLASIPVEEIK 182 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTI--PGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 LG ++A A+ LA A G + ++A+K L T GIGRW+A Y LRG Sbjct: 183 KLGFSYQKARAIKELAAAIASGNASLERVYRMSDQEAIKYLSTLRGIGRWSAEYVLLRGL 242 Query: 229 QAKDVFLPDDYLIKQRFPGMTP-------AQIRRYAERWKPWRSYALLHI 271 D F DD + + +I+ RW P+ H+ Sbjct: 243 GRLDSFPADDIGARNNLQRLFHLDHKPGYGEIKELTSRWHPYEGLVYFHL 292 >UniRef50_A5CSR4 Putative DNA glycosylase n=2 Tax=Clavibacter michiganensis RepID=A5CSR4_CLAM3 Length = 311 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 80/290 (27%), Positives = 118/290 (40%), Gaps = 23/290 (7%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIP------DIARH 57 L P+D ++ FL+ AV+ E + + +S + G VT D+ Sbjct: 15 LEVPGPFDGGGVIRFLSWHAVTGAEEGDATSFTQSARLAHGAGTVTVRLLEAEPGDVGGA 74 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAARPGLRLPGCV 111 + + AAE LA RL L + ++ AL + A PGLR+PG + Sbjct: 75 RVEVTTRVEHAADAAELLAGTRRLLGLDVDAARIDADLARDPALAAVVRATPGLRIPGTL 134 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD----DFPEYICFPTPQRLAAADPQ 167 D R I+GQ +SVA A R+ GE L PT R+A + Sbjct: 135 DPRSTLFRTIVGQQISVASARATHGRMTADLGEDLPASVAHGSVTRLPPTAARIARDGGE 194 Query: 168 ALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRG 227 L+ P +R LI +A A G L + + L F G+G WTA+Y A+R Sbjct: 195 LLRG---PARRTATLIRIAEALETGELVIEPGVPRAELRAALVAFHGVGPWTADYVAMRA 251 Query: 228 WQAKDVFLPDDYLIKQRFPG----MTPAQIRRYAERWKPWRSYALLHIWY 273 D+ L D ++++ + A W PWRSYA LH+W Sbjct: 252 LGEPDILLSGDLIVRRGGAALGLPDEARALDARAAAWSPWRSYATLHLWR 301 >UniRef50_D1C0H7 Transcriptional regulator, AraC family n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1C0H7_XYLCX Length = 543 Score = 188 bits (478), Expect = 2e-46, Method: Composition-based stats. Identities = 86/318 (27%), Positives = 127/318 (39%), Gaps = 45/318 (14%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADS-----YYARSLAVGEYRGVVTAI----- 51 L + P+D + GFLAARAV+ VET + YAR++A+ Sbjct: 210 LRLPVREPFDAPGVFGFLAARAVTGVETASADDDGTLRYARTVALPHGPAAFEVSATPRA 269 Query: 52 ---PDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAAR 102 D + + + A +A++ RL DL +P V+ AL L A Sbjct: 270 VSGRDARGWDVQVRVELTSLADVATVVARVRRLLDLDADPVAVDTALGTDPALALLVTAT 329 Query: 103 PGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC--FPTPQR 160 PG+R+PG VD E VRAI+GQ +SVA A R+A G + + FP+ Sbjct: 330 PGIRVPGAVDPHELLVRAIVGQQISVAAARTHLGRLAARLGTPYASSFDGLTTVFPSAAA 389 Query: 161 LAAADP----------QALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQ 210 + P + L +P + A++ A G L + + D + L Sbjct: 390 IVDGVPVTAPGTPEASDPDRPLRLPARGVAAVVGATRALAAGDLAVDVGADPDTLRTALL 449 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPDD--------------YLIKQRFPGMTPAQIRRY 256 PG+G WTA Y A+R D + D +R P + + Sbjct: 450 ALPGVGAWTAAYVAMRVLGDPDAWPEGDVALVAGAAAAGIAAASAAERRPTQRHRDLAAH 509 Query: 257 AERWKPWRSYALLHIWYT 274 A W PWRSYA +H+W Sbjct: 510 AAAWAPWRSYAAMHLWAA 527 >UniRef50_UPI0000E0EED3 Ada family regulatory protein n=1 Tax=Glaciecola sp. HTCC2999 RepID=UPI0000E0EED3 Length = 280 Score = 188 bits (477), Expect = 2e-46, Method: Composition-based stats. Identities = 80/296 (27%), Positives = 117/296 (39%), Gaps = 36/296 (12%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRG-------------- 46 M L+ PY+WS + FL RA++ +E + +YAR ++ Sbjct: 2 MIYLSVTQPYNWSMVHAFLTRRAIAGIEECGEFHYARYFDETDFYAVSGLSHVSNEGLTS 61 Query: 47 -VVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAAR--- 102 A + + LS E LA ++R+ D Q +P + AL + G Sbjct: 62 SWFCATYEPEAQRFAVQLSLHNEACREAVLANIARVLDAQQDPNTIAQALTKAGFTPEHM 121 Query: 103 -PGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRL 161 GLRLP FE +RAI+GQ +SV A K+ + Q G + Y FP+ + Sbjct: 122 TSGLRLPATWSPFEALIRAIVGQQISVNGAVKI---LNQWIGNLRAEANGYRHFPSATEI 178 Query: 162 AAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTAN 221 A D L MP R L A L + ++ L GIG WT N Sbjct: 179 ACCDTSKLP---MPKARQATLNLAAETVQAKPLH------DSETIQDLLKIKGIGPWTVN 229 Query: 222 YFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGW 277 Y +RG D+FL +D ++K + E KPWRSY + +W Sbjct: 230 YVLMRGISHPDIFLDNDLVVKNQLAR-----FALTPELAKPWRSYVCIQLWEHANT 280 >UniRef50_Q7N9Z6 Similarities with the C-terminal region of 3-methyladenine DNA glycosylase n=2 Tax=Enterobacteriaceae RepID=Q7N9Z6_PHOLL Length = 299 Score = 188 bits (477), Expect = 3e-46, Method: Composition-based stats. Identities = 56/283 (19%), Positives = 105/283 (37%), Gaps = 18/283 (6%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGL-- 67 Y + E V + + + + +T + + + + Sbjct: 15 YHTHDFFALHQRDKQNIAEIVEQNKVQKGMMWDDKPAELTIAINAKTAHIQLKIDGDTTG 74 Query: 68 ----EPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILG 123 A + + + L + +G L A + GLR+ FE A++G Sbjct: 75 PDERLSTLASHMPGLLQPVHLFECLYKRHPVIGSLIARQSGLRIYQSATPFEALSWAVIG 134 Query: 124 QLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALI 183 Q +SV+ A + R Q G + CFPT +++ L+ G + +A+AL+ Sbjct: 135 QQISVSAAISIRRRFIQAMG--VQHSSGLWCFPTARQIINHSEDELRQCGFSVSKAKALL 192 Query: 184 HLANAALEGTLPMT---IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 L+ G L + D++Q + L GIG WT NY LRG+ + L D Sbjct: 193 RLSQLIESGELTLAISNSETDIQQLIDNLLAIKGIGMWTINYSLLRGFNYLNGSLHGDVA 252 Query: 241 IKQRFPG-------MTPAQIRRYAERWKPWRSYALLHIWYTEG 276 +++ ++ Q ++ + PW++ H+W E Sbjct: 253 VRRNIQRLFNQNEKVSAEQAEKWLADFAPWKALLAAHLWQQES 295 >UniRef50_Q5NXL1 DNA-3-methyladenine glycosidase II n=3 Tax=Betaproteobacteria RepID=Q5NXL1_AZOSE Length = 300 Score = 186 bits (474), Expect = 6e-46, Method: Composition-based stats. Identities = 67/281 (23%), Positives = 114/281 (40%), Gaps = 19/281 (6%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEP 69 + +L F ++ E V + + +T D A+ + + + Sbjct: 15 FRTDDVLAFHRRDPLAVAERVEGQTLQKGVVWEGRPACLTIRFDSAQASAELAIDGAPGA 74 Query: 70 VAAECLAK-------MSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAIL 122 A LA+ +++ ++ + LG L A PGLR+P FE AI Sbjct: 75 AAPAALAQLLPRMLGLTQQVEVFERTYRDHPQLGPLIARHPGLRVPLSASPFEALSWAIT 134 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 GQ +SV A L R+ ++ G L C+P +R+A + L++ G +A+ L Sbjct: 135 GQQISVRAAISLRRRLIEVAG--LRHSVGLACYPDAERVAGLNEADLRSAGFSQAKAQTL 192 Query: 183 IHLANAALEGTLPMT---IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 I L E LP+ V++ + L GIG WT +Y LRG+ D L D Sbjct: 193 IRLGRLVAEDELPLNTWIATLPVDEIRERLMRVRGIGPWTIDYALLRGFGWLDGSLHGDV 252 Query: 240 LIKQRFPG-------MTPAQIRRYAERWKPWRSYALLHIWY 273 ++++ +T Q +R+ + PWR+ H+W Sbjct: 253 VVRRSLQAVLDCPDSVTEGQAKRWLAEFSPWRALIAAHLWA 293 >UniRef50_Q7MGD3 Adenosine deaminase n=51 Tax=Vibrionales RepID=Q7MGD3_VIBVY Length = 481 Score = 186 bits (473), Expect = 6e-46, Method: Composition-based stats. Identities = 77/277 (27%), Positives = 110/277 (39%), Gaps = 27/277 (9%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEY----RGVVTAIPDIARHTL 59 L ++ + ML F RA+ S E V ++ Y R + + R A + L Sbjct: 224 LAFRGDLNVKHMLDFYRQRAIESEEVVTETSYQRQVVINGKTVGFRAEFPATFPAEKRQL 283 Query: 60 HINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLG---AARPGLRLPGCVDAFEQ 116 + S + +A + R+FDL C+ +++ L + G+R+PG + +E Sbjct: 284 VVYFSMDDLTLLRPMVAGIRRMFDLDCDTRVIEAHLNTVALGLVKSVGIRIPGVWNVWEA 343 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPL 176 GVRAILGQ VSV A + L E FP+PQ++ AD L+ MP Sbjct: 344 GVRAILGQQVSVKAAIGQLNLLVA----TLHHDSEVRTFPSPQQVVDADLHFLR---MPQ 396 Query: 177 KRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 R E L A LE GIG WT +Y LRG D L Sbjct: 397 SRKETLRRFAVMMLENE---------HADPNQWLALKGIGPWTVSYAQLRGLSQPDRLLE 447 Query: 237 DDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWY 273 D ++K+ E PW SYA H+W Sbjct: 448 KDLVVKKALAQFP----TLNQESASPWGSYATFHLWN 480 >UniRef50_Q1ITU3 DNA-3-methyladenine glycosylase II n=2 Tax=Bacteria RepID=Q1ITU3_ACIBL Length = 251 Score = 186 bits (473), Expect = 8e-46, Method: Composition-based stats. Identities = 51/215 (23%), Positives = 90/215 (41%), Gaps = 16/215 (7%) Query: 79 SRLFDLQCNPQIVNGALGRLGAARPG--LRLPGCVDAFEQGVRAILGQLVSVAMAAKLTA 136 R + V+ L +L A P ++ + FE + +I+ Q +S AA + Sbjct: 3 DRHEKAIAHLSKVDKKLAKLIAKCPPCAIKPNYMQNVFEALMESIVYQQLSGKAAATILN 62 Query: 137 RVAQLYGERLDDFPEYIC-----FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALE 191 RV LY + FPTP++L A + L++ G+ + +++ LA ++ Sbjct: 63 RVKALYFPPDTPTHDTRHGKALPFPTPEQLLATPDETLRSAGLSGNKTKSVKDLAAKTID 122 Query: 192 GTLPM---TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG- 247 GT+P ++ + L GIGRWT L KDV+ DD +++ + Sbjct: 123 GTVPDIATMKKMSDDEIINHLTQVRGIGRWTVEMILLFNLFRKDVWPVDDLGVRKGYGYL 182 Query: 248 -----MTPAQIRRYAERWKPWRSYALLHIWYTEGW 277 P ++ E +KP+RS A ++W Sbjct: 183 HGIEMPKPKELMALGEVYKPYRSVAAWYMWRACET 217 >UniRef50_C0E8I7 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=C0E8I7_9CLOT Length = 281 Score = 185 bits (471), Expect = 1e-45, Method: Composition-based stats. Identities = 47/245 (19%), Positives = 90/245 (36%), Gaps = 24/245 (9%) Query: 44 YRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGR 97 + D + T + + A + FD + Q + + L + Sbjct: 44 GGRYLAMRQDGNQLTFY-------DTSAEDFERHWKLYFDFDTDYQAIKQGFLEDEVLKK 96 Query: 98 LGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPT 157 G+R+ D +E I+ Q ++ + R+ +L GE++ FPT Sbjct: 97 SCDYAGGIRILRQ-DPWETLCSFIISQNNNIPRIKGIIDRLCKLCGEQVPGG---YAFPT 152 Query: 158 PQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM--TIPGDVEQAMKTLQTFPGI 215 P+ LAA L + RA L+ A+ G + + ++++A KTL + G+ Sbjct: 153 PEALAAKSLDDLSIMR-AGFRARYLLDAAHKVSTGKIDLPSLYTMEIDEARKTLTSICGV 211 Query: 216 GRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTE 275 G A L G+ + F D IK+ +A +P+ A ++++ Sbjct: 212 GPKVAECVLLFGFHRLEAFPV-DVWIKRAITYFYQDGFPEFA---RPYGGIAQQYLFHYI 267 Query: 276 GWQPD 280 P+ Sbjct: 268 RNCPE 272 >UniRef50_D1C1F2 HhH-GPD family protein n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1C1F2_SPHTD Length = 319 Score = 184 bits (468), Expect = 2e-45, Method: Composition-based stats. Identities = 60/274 (21%), Positives = 102/274 (37%), Gaps = 20/274 (7%) Query: 8 PPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIP----DIARHTLHINL 63 PP+ + L R + V+ Y R L V V D + + Sbjct: 16 PPFRLDLTVWVLRRRPDNVVDRWDGRTYRRVLPVNGQPIEVAVTQTGPVDSPCLHVVASG 75 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNP------QIVNGALGRLGAARPGLRLPGCVDAFEQG 117 E V E + R + + AL L G + +E Sbjct: 76 PGADETVVPELRRTLMRTLGTGVDLSGFSRLAAGDPALAELADRLRGAKPTRYPTVYEAL 135 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPL 176 V AI Q +++ ++ +R+AQ G ++ D ++ FP P+ + P L+ LG Sbjct: 136 VNAIACQQITLTFGLRILSRLAQECGMTIERDGETHVAFPRPEDVLTVSPDRLRELGFSR 195 Query: 177 KRAEALIHLANAALEGTLPMTIPGDVEQ--AMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 ++A A++ L+ ++G+L + D+ AM+ L G+GRWTA Y LRG +F Sbjct: 196 QKARAVLELSERLVDGSLDLEPLEDLPDDAAMERLLALRGVGRWTAEYVLLRGLGRVHIF 255 Query: 235 LPDDYLIKQRFPGM-------TPAQIRRYAERWK 261 DD + ++R W+ Sbjct: 256 PGDDVGGRNNLRRWLGIEEALDYDGVQRVLGAWR 289 >UniRef50_C8XKJ9 AlkA domain protein n=1 Tax=Nakamurella multipartita DSM 44233 RepID=C8XKJ9_NAKMY Length = 300 Score = 184 bits (468), Expect = 3e-45, Method: Composition-based stats. Identities = 84/292 (28%), Positives = 119/292 (40%), Gaps = 21/292 (7%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEY----RGVVTAIPDIAR 56 + L P+ +L FL+ +V VE V + YARSL +G G + Sbjct: 14 VVDLPVAGPFAADRLLAFLSRESVPGVEYVREREYARSLRLGSGDDAEVGTIRLHLPGPG 73 Query: 57 HTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRL------GAARPGLRLPGC 110 + E +A+ L DL + V+ L PGLR+PG Sbjct: 74 DPPTVRAVVRFAARIDEAVARCRHLLDLDTDGSAVDRVLRADPGLAASVQRCPGLRVPGP 133 Query: 111 VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YICFPTPQRLAAADPQA 168 + E VR ILGQ VSVA A R+ L +RL + + FP P R+AA P A Sbjct: 134 AEPAETVVRTILGQQVSVAGARTAATRLVALADDRLPEPVDGLTHLFPEPARIAALGPTA 193 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGW 228 R A + AA + + A L PGIG WTA+Y ++R + Sbjct: 194 -----FVGPRIRAQAVVTAAAAIAGGTLRLDRTDAHARGVLLAMPGIGPWTADYLSMRVF 248 Query: 229 QAKDVFLPDDYLIKQRFPG----MTPAQIRRYAERWKPWRSYALLHIWYTEG 276 DV L DD I++ P ++ W+P+RSYA +H+W G Sbjct: 249 GDPDVLLVDDLAIRRGAGALGLPDQPRELAARGLDWRPFRSYAGMHLWAASG 300 >UniRef50_A1S7Q4 DNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / Transcriptional regulator Ada n=1 Tax=Shewanella amazonensis SB2B RepID=A1S7Q4_SHEAM Length = 483 Score = 183 bits (466), Expect = 4e-45, Method: Composition-based stats. Identities = 78/284 (27%), Positives = 125/284 (44%), Gaps = 22/284 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSY----YARSLAVGEYRGVVTAIPDIARH 57 L+++PPYD+ + F ARA+ E + Y R+L V G A ++ Sbjct: 211 LQLSFRPPYDFMRLRAFFMARAIPGAEWFFNDAGEPCYGRTLMVAGDAGWFEACLLAGKN 270 Query: 58 TLHINL-SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGL---RLPGCVDA 113 L +++ G ++ LA++ R+ D+ N +++ + L LPG Sbjct: 271 ALAVSIFPGGRVSALSQWLAEIKRVLDIDANLSLIHEHIQGHMPEGVVLNTMTLPGAGSF 330 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKAL 172 FE RA+LGQ VS+ A +L + ++ FPT +++A+A ++LK Sbjct: 331 FEAACRAVLGQQVSLVQATRLLGLLTAETTPEVELGGRRCRVFPTAEQVASATLESLK-- 388 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 MP R AL +A +P TL GIG WT +Y +RG D Sbjct: 389 -MPGSRKNALRDMAALFSRDPVPDD---------ATLLAVKGIGPWTVSYARMRGLSDPD 438 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAER-WKPWRSYALLHIWYTE 275 V L D ++KQ+ M A++ + PW SY L +W+TE Sbjct: 439 VLLVGDLVVKQKLTAMGWAKVPDRLKSDVSPWGSYLTLALWHTE 482 >UniRef50_B4X1U6 Base excision DNA repair protein, HhH-GPD family n=1 Tax=Alcanivorax sp. DG881 RepID=B4X1U6_9GAMM Length = 292 Score = 183 bits (464), Expect = 7e-45, Method: Composition-based stats. Identities = 64/291 (21%), Positives = 105/291 (36%), Gaps = 15/291 (5%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTA---IPDIARHT 58 TL P + L F + E V + +++ + ++ P R T Sbjct: 4 LTLALPPHFSVPAFLDFHGRDQHAISEQVESNVLRKAVTLDGRPCLLALDFNQPGQVRAT 63 Query: 59 LHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGV 118 L L + + LG+L + GLR+P FE Sbjct: 64 ARTLTRTALSRQTRAMLGLDQAVHTFEQAVTGQATPLGQLVDRQRGLRVPQSATPFEALS 123 Query: 119 RAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKR 178 AI+GQ +SV+ A + R QL G+ C+P + D AL+++G + Sbjct: 124 WAIIGQQISVSAATAIRRRFIQLAGQTRISG--LHCYPDAAAVNQLDASALRSVGFSASK 181 Query: 179 AEALIHLANAALEGTLPMTIPGDVEQAM---KTLQTFPGIGRWTANYFALRGWQAKDVFL 235 AE L+ ++ E L V A + L G+G W+ NY LRG+ D L Sbjct: 182 AETLLTVSLCCCEHALLPDALHSVADAQSTEQALLGIRGLGPWSVNYTLLRGYGYLDGSL 241 Query: 236 PDDYLIKQRFP-------GMTPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 D +++ T + + + PWR+ H+W + Q Sbjct: 242 HGDVAVQKALQQLLAMKARPTAKATQDWLAAFTPWRALVAAHLWQSLQVQA 292 >UniRef50_B9XBY0 HhH-GPD family protein n=1 Tax=bacterium Ellin514 RepID=B9XBY0_9BACT Length = 294 Score = 182 bits (463), Expect = 1e-44, Method: Composition-based stats. Identities = 64/291 (21%), Positives = 101/291 (34%), Gaps = 18/291 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 ++L Y ++ F A E + + ++ V+ I Sbjct: 3 FSLKLPSNYSSREVMAFQARDPEGLAERLEPNRIRKAFVFEAIPLVLDISLAKNMANCRI 62 Query: 62 NLSAGLEPVAAECLAKMSRLF-------DLQCNPQIVNGALGRLGAARPGLRLPGCVDAF 114 L L +++R + + L L + GLR+P F Sbjct: 63 EADRPLPHATRTTLQQIARNLLALRIDPEPFEAMAKEDNLLASLVQKQTGLRIPHTTTPF 122 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGM 174 E AI+GQ ++++ A L QL G + C P +A +P L L Sbjct: 123 EALAWAIIGQQINLSFAITLRRSFIQLAGTKHSSG--LWCHPDASAVARLNPDHLGQLKF 180 Query: 175 PLKRAEALIHLANAALEGTLPMTIPG--DVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 +AE L+ +A G LP+ E+ L GIG WT NY LRG+ D Sbjct: 181 SRAKAETLVRMAQLVDSGKLPLDEWQNHSPEEIQAALLAIKGIGPWTVNYTLLRGFAFAD 240 Query: 233 VFLPDDYLIKQRFPG-------MTPAQIRRYAERWKPWRSYALLHIWYTEG 276 L D I+ T +I +R++P RS H+W + Sbjct: 241 CSLHGDAAIRNALNRLSGSATKPTIKEIETLLQRYRPHRSMTAAHLWKSLH 291 >UniRef50_A6CCG3 Probable DNA-3-methyladenine glycosylase n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CCG3_9PLAN Length = 211 Score = 180 bits (457), Expect = 5e-44, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 81/211 (38%), Gaps = 18/211 (8%) Query: 81 LFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQ 140 + + L + + L F +R+I+ Q +S + A + R+ Sbjct: 8 FLKASKHLSKADPLLKPVINSIGPCPLKPYRYRFALLLRSIVSQQISTSAARTIYLRLHA 67 Query: 141 LYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--I 198 L G+ PT +++ + L+++G+ ++A + HLA ++ + + Sbjct: 68 LTGKGQ---------PTAEKVMQLSHEQLRSVGLSNQKATYVRHLAEMVMQNKVRLHKMH 118 Query: 199 PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIR---- 254 E L GIG WTA F + G D+F DD I+ + + R Sbjct: 119 LLSDEDVTSELIQVKGIGVWTAQMFLMFGLCRPDIFPHDDLGIQNGIQKIYELKTRPDKQ 178 Query: 255 ---RYAERWKPWRSYALLHIWYTEGWQPDEA 282 A+RW+P+R+ A + W + + Sbjct: 179 TCIEIAQRWQPYRTVASWYCWRILEMETPDG 209 >UniRef50_D1ZEJ1 Whole genome shotgun sequence assembly, scaffold_22 n=6 Tax=Leotiomyceta RepID=D1ZEJ1_SORMA Length = 415 Score = 179 bits (455), Expect = 8e-44, Method: Composition-based stats. Identities = 51/253 (20%), Positives = 88/253 (34%), Gaps = 55/253 (21%) Query: 81 LFDLQCNPQIVNGALGRLGAARPGL-----RLPGCVDAFEQGVRAILGQLVSVAMAAKLT 135 L + V+ + L P L +D FE V +I+ Q VS A A + Sbjct: 158 LEKACAHLIAVDPRMKPLIDKHPCRIFSPEGLAEQIDPFESLVSSIISQQVSGAAAKSIK 217 Query: 136 ARVAQLY-------------------GERLDDFP----EYICFPTPQRLAAADPQALKAL 172 + L+ G +D P FPTP + D L+ Sbjct: 218 GKFVALFDDPSLDQDQDDEDGKDTPPGHPAEDQPSSKRRKRRFPTPSLVLQKDLPTLRTA 277 Query: 173 GMPLKRAEALIHLANAALEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 G+ ++AE + LA+ G L + ++ + L G+G WT FA + Sbjct: 278 GLSQRKAEYIHGLASKFASGELSASLLASAPYDELVSKLVAVRGLGLWTVEMFACFALKR 337 Query: 231 KDVFLPDDYLIKQRFPG-------------------------MTPAQIRRYAERWKPWRS 265 DVF D +++ M+ +++ +ER++P+RS Sbjct: 338 MDVFSLGDLGVQRGMAAFVGRDVKKLKNGNGKGNGKDKKWKYMSEGEMKEISERFRPYRS 397 Query: 266 YALLHIWYTEGWQ 278 + ++W E Sbjct: 398 LFMWYMWRVEETD 410 >UniRef50_A1ZCF3 HhH-GPD n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZCF3_9SPHI Length = 207 Score = 178 bits (453), Expect = 1e-43, Method: Composition-based stats. Identities = 58/202 (28%), Positives = 96/202 (47%), Gaps = 24/202 (11%) Query: 92 NGALGRLGAARP---GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD 148 + L ++ L LP D + VR+I+GQ +SV AA + R +L+ E Sbjct: 8 DPLLKKVIEQASQTLSLALPK-KDIYLALVRSIVGQQLSVKAAATIYQRFRELFPE---- 62 Query: 149 FPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAM 206 +PTP+ + AA+ LKA G+ ++A + ++A A+EG L + E+ + Sbjct: 63 -----NYPTPKLVVAAELDTLKAAGLSKQKATYIKNVAAFAIEGGLDFEVLNNQTDEEII 117 Query: 207 KTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT---------PAQIRRYA 257 + L T G+GRWT + +Q DVF DD I+Q + A+++ A Sbjct: 118 QVLITIKGVGRWTVEMLLMFAFQRPDVFSVDDLGIQQAVKKLYQLDEEGKALKAKMKTIA 177 Query: 258 ERWKPWRSYALLHIWYTEGWQP 279 WKP+R+ A L++W + P Sbjct: 178 NAWKPYRTLACLYLWQWKDNTP 199 >UniRef50_A4BNP3 3-methyladenine DNA glycosylase/8-oxoguanineDNA glycosylase n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BNP3_9GAMM Length = 263 Score = 178 bits (452), Expect = 2e-43, Method: Composition-based stats. Identities = 76/270 (28%), Positives = 111/270 (41%), Gaps = 27/270 (10%) Query: 9 PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLE 68 P+ W+ +LG+L AR + E + D Y R V L ++A Sbjct: 12 PFPWAALLGYLDARLIPGAERIVDDGYER----RHNGATVRVTYHAGGKCLR--ITADDA 65 Query: 69 PVAAECLAKMSRLFDLQCNPQIVN------GALGRLGAARPGLRLPGCVDAFEQGVRAIL 122 E ++ RLFD + + V+ L PGLR GC FE VR ++ Sbjct: 66 VCGDEITVRVIRLFDTGQDTRAVDRQLRACPLLRPRVDRMPGLRPLGCWCPFELCVRTVV 125 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 GQ VSVA AA L R+A+ GE A+GMP +R L Sbjct: 126 GQQVSVAAAATLMRRLAERCGELSPAALCAADL--------------DAIGMPGRRVATL 171 Query: 183 IHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIK 242 LA A G L + D L PGIG WT Y A+R + D+ D + Sbjct: 172 RRLAEAVATGELALEH-ADWAAIDAGLSRLPGIGPWTRAYLAIRLGRQPDILPETDLGLL 230 Query: 243 QRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 + +P +R ++RW+P+R++A ++W Sbjct: 231 RAAGAASPTVLRALSQRWRPYRAHAATYLW 260 >UniRef50_C1RNZ7 DNA-3-methyladenine glycosylase II n=1 Tax=Cellulomonas flavigena DSM 20109 RepID=C1RNZ7_9CELL Length = 302 Score = 178 bits (451), Expect = 2e-43, Method: Composition-based stats. Identities = 83/278 (29%), Positives = 117/278 (42%), Gaps = 8/278 (2%) Query: 7 QPPYDWSWMLGFLAARAVSSVETVA--DSYYARSLAVGEYRGVVTAIPDIARHTLHINLS 64 + D L LAAR+V VE V +R + +G VTA + + Sbjct: 2 RTALDHRAALSSLAARSVPGVERVDVDAGTVSRLVELGAGPVHVTAHVAATGVRVDADGP 61 Query: 65 AGLEPVAAECLAKMSRLFDLQCNPQIV--NGALGRLGAARPGLRLPGCVDAFEQGVRAIL 122 A + DL + + LG L AARP LR+PG D FE V+ +L Sbjct: 62 AAPAALDDLATRWFGLADDLAPVHAALGGDPVLGPLVAARPHLRVPGHPDGFEAAVQVVL 121 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL-GMPLKRAEA 181 Q VS+ AR+A YG + +P P+ LAAAD AL+A+ +P RA A Sbjct: 122 TQQVSLGAGRTTGARLASAYGRP--GPGGLLAYPRPEDLAAADSVALQAVLRVPHARARA 179 Query: 182 LIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLI 241 + LA A G L + L PGIG WTA+ ALR +D F D ++ Sbjct: 180 VHALAVACA-GGLRLVPGAPAADVRAALLAIPGIGPWTADVVALRALGDRDAFPAGDLVL 238 Query: 242 KQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 ++ + W PWR++A H+W G+ Sbjct: 239 RRALGVPDVRDVATAGRAWSPWRAFAATHLWAAVGYPA 276 >UniRef50_B7K2N0 DNA-3-methyladenine glycosylase II n=5 Tax=Chroococcales RepID=B7K2N0_CYAP8 Length = 206 Score = 177 bits (450), Expect = 3e-43, Method: Composition-based stats. Identities = 42/199 (21%), Positives = 80/199 (40%), Gaps = 20/199 (10%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + L L + P + + F V+AI+GQ +SV A ++ R+ L G Sbjct: 18 DKILAYLISLYPDETIINYHNPFYTLVKAIIGQQISVNAANAISKRLESLLGT------- 70 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP--MTIPGDVEQAMKTL 209 + + A D +AL+ G+ + + ++A A +G L + ++ + L Sbjct: 71 ----ISIETYLAMDSEALRQCGLSRPKISYITNIAQAFEQGILTPQIWPMMSDQEVISQL 126 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERWKP 262 + GIG WTA F + D+ D + +I+ ++ WKP Sbjct: 127 ISIKGIGLWTAQMFLIFHLHRSDILPLADLGLINAIQRHYGQSQRLTKGEIQELSQVWKP 186 Query: 263 WRSYALLHIWYTEGWQPDE 281 +R+ A ++W + P + Sbjct: 187 YRTVATWYLWRSLDPIPVQ 205 >UniRef50_Q2FMK1 HhH-GPD n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FMK1_METHJ Length = 309 Score = 177 bits (449), Expect = 4e-43, Method: Composition-based stats. Identities = 54/288 (18%), Positives = 107/288 (37%), Gaps = 33/288 (11%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLS----- 64 WS ++ + V T ++ ++ + + V+T + LS Sbjct: 14 LHWSAII--FSNH-DPMVRTYSNHAFSFTALLSNGPFVLTVNQTDPFEKTPLTLSIWSDK 70 Query: 65 ----------AGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAF 114 + L + D C+ + + L GLR P F Sbjct: 71 AVRAQDTCEASDLITWYFALNDNLMDFLDAICS----DPVMKSLAHRLDGLRSPATPTVF 126 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGER-LDDFPEYICFPTPQRLAAADPQALKALG 173 E + +++ Q +S+++A L R + +G + + C+P P+ LA +P + G Sbjct: 127 EALIDSVIEQQISLSVARSLEYRFIRQFGRTCFVNGDLHYCYPLPEDLAGLEPSDFRRCG 186 Query: 174 MPLKRAEALIHLANAALEGTLPMTIP---GDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 ++ E + ++ + +G L + D ++ L GIGRWTA LRG Sbjct: 187 FTSRKGEYIRDISRSIEKGNLDLESFKKVRDNADIVEALCQIRGIGRWTAELTMLRGLHR 246 Query: 231 KDVFLPDDYLIKQRFPGMTP-------AQIRRYAERWKPWRSYALLHI 271 D F DD +++ ++ + AE+W ++ A ++ Sbjct: 247 MDAFPADDIALRRMISRWYHNGKKISASEAVKTAEQWGEYKGLASFYL 294 >UniRef50_A2QHV8 Contig An04c0070, complete genome n=10 Tax=Eurotiomycetidae RepID=A2QHV8_ASPNC Length = 412 Score = 176 bits (447), Expect = 8e-43, Method: Composition-based stats. Identities = 52/286 (18%), Positives = 95/286 (33%), Gaps = 34/286 (11%) Query: 20 AARAVSSVET-VADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKM 78 + ++ V + +L +V P L Sbjct: 124 SRTTPPPLDRPVEPHHTNATLLTPHGSSLVAYPPGTDADASPSKTGRPRPTATTGTL--- 180 Query: 79 SRLFDLQCNPQIVNGALGRLGAARPGLR-----LPGCVDAFEQGVRAILGQLVSVAMAAK 133 L + + L L +P L +D F V +I+GQ VS A A Sbjct: 181 --LEKAAAHLIATDPRLESLIREQPCPLFTPEGLAEEIDPFRSLVSSIIGQQVSGAAAKS 238 Query: 134 LTARVAQLY--GERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALE 191 + + L+ + +D FPTP+ + D L+ G+ ++AE + L+ Sbjct: 239 IKDKFVALFKTNNKDEDGTRPSFFPTPEEIIKMDISTLRTAGLSQRKAEYIHGLSEKFAN 298 Query: 192 GTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP--- 246 G L M + E+ ++ L G+G+W+ FA + DVF D +++ Sbjct: 299 GELSARMLLNASDEELVEKLTAVRGLGKWSVEMFACFALKRIDVFSTGDLGVQRGCAVFV 358 Query: 247 ----------------GMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 M + A ++ P+RS + ++W Sbjct: 359 GKDVNKLKGKGGGKFKYMPEKDMLELAAKFAPYRSLFMWYMWRVTD 404 >UniRef50_Q9KC25 DNA-3-methyladenine glycosidase n=1 Tax=Bacillus halodurans RepID=Q9KC25_BACHD Length = 221 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 46/216 (21%), Positives = 84/216 (38%), Gaps = 22/216 (10%) Query: 79 SRLFDLQCNPQ----IVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKL 134 R F + L + ++LP + F+ V +I+ Q +S+ A+ + Sbjct: 1 MRYFSTDSPEVKTIVAQDSRLFQFIEIAGEVQLPTKPNPFQSLVSSIVEQQLSIKAASAI 60 Query: 135 TARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTL 194 RV QL G L+ P++L +AL+ G+ ++ E + H+ G L Sbjct: 61 YGRVEQLVGGALEK---------PEQLYRVSDEALRQAGVSKRKIEYIRHVCEHVESGRL 111 Query: 195 PMTIPGDVEQ--AMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT--- 249 T E ++ L GIG+WTA F + DV D +++ + Sbjct: 112 DFTELEGAEATTVIEKLTAIKGIGQWTAEMFMMFSLGRLDVLSVGDVGLQRGAKWLYGNG 171 Query: 250 ----PAQIRRYAERWKPWRSYALLHIWYTEGWQPDE 281 + + + W P+ + A L++W G +E Sbjct: 172 EGDGKKLLIYHGKAWAPYETVACLYLWKAAGTFAEE 207 >UniRef50_C7RDZ5 8-oxoguanine DNA glycosylase domain protein n=3 Tax=Clostridiales Family XI. Incertae Sedis RepID=C7RDZ5_ANAPD Length = 300 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 45/251 (17%), Positives = 91/251 (36%), Gaps = 15/251 (5%) Query: 31 ADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQI 90 D Y +G+ ++ + +L N+S + +D Sbjct: 46 EDGSYTAVF-LGK---IINVLKLDEGVSLIRNISLEDFNEIFYDYFDLGLDYDAIKKEVA 101 Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD-- 148 ++ + + A G+R+ + FE + I+ + K +++ YG+ + + Sbjct: 102 IDPVMEKATAYGYGIRILNQ-EVFETTISFIISANNQIPRIKKAVRIISERYGDYIGEYK 160 Query: 149 FPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKT 208 +Y FP P+ L P+ L+ R + ++ + EG L D E K Sbjct: 161 GRKYYSFPRPEVLMKVKPEDLREYARVGFRDKRIVEASRMIYEGQLDGASKLDTEDLRKK 220 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF------PGMTPAQIRRYAER-WK 261 L PG+G A+ L + ++ F D IK+ + Q+ YA + + Sbjct: 221 LMELPGVGPKVADCILLFAYHRRETFPV-DVWIKRVMETLFIKKEVPKKQVDDYARKYFG 279 Query: 262 PWRSYALLHIW 272 YA +++ Sbjct: 280 KNAGYAQQYLF 290 >UniRef50_Q82VT3 HhH-GPD n=2 Tax=Betaproteobacteria RepID=Q82VT3_NITEU Length = 205 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 55/208 (26%), Positives = 82/208 (39%), Gaps = 20/208 (9%) Query: 81 LFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQ 140 + + + R+ +AF RAI+GQ +SV AA + +V Sbjct: 6 WEQAVNDLSARDPVMHRIIQCYSDSMPEERGNAFATLARAIVGQQISVKAAASVWQKVTT 65 Query: 141 LYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--I 198 L E TP+ L A + L+ G+ ++ + L L+ LEGTL Sbjct: 66 LIPE-----------ITPEALIATEIDLLRTCGLSARKVDYLRDLSRHFLEGTLVTVNWH 114 Query: 199 PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PA 251 D E ++ L GIGRWTA F + DV DD +++ Sbjct: 115 DLDDETLIRKLVEVKGIGRWTAEMFLIFHLHRPDVLPLDDIGLQRAVSLHYNASQPVAKQ 174 Query: 252 QIRRYAERWKPWRSYALLHIWYTEGWQP 279 IR AE W+PWRS A ++W + P Sbjct: 175 AIRTIAESWQPWRSVATWYLWRSLDPIP 202 >UniRef50_D0J4I7 HhH-GPD n=2 Tax=Comamonas testosteroni RepID=D0J4I7_COMTE Length = 329 Score = 174 bits (442), Expect = 2e-42, Method: Composition-based stats. Identities = 76/304 (25%), Positives = 116/304 (38%), Gaps = 32/304 (10%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVAD--SYYARSLAVGEYRGVVTAIPDIARHT- 58 L + Y + + F E V +++ ++ + Sbjct: 23 IALPER--YRFDEFVHFHDRDEQQLAERVDAAARSLHKAIMWKRSPALLQLQWQTGQVQA 80 Query: 59 -LHINLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCV 111 LH + E A A + R+ L P + + LG L + + GL +PG Sbjct: 81 SLHAAHTEPAEADRAALQAMVKRMLGLIYAPDQLELAHGDHPELGVLLSRQAGLHVPGSP 140 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD---DFPEYICFPTPQRLAAADPQA 168 FE AI GQ ++VA+A L ++ L GE L D P +P QR+AA A Sbjct: 141 TPFEALTWAITGQQITVAVAVSLRRKLIALAGEPLAQDGDMPALHAYPDAQRVAALGLDA 200 Query: 169 LKALGMPLKRAEALIHLANAALEGTLPMT-----------IPGDVEQAMKTLQTFPGIGR 217 L+ G +A+ L+ +A A EG LP+ DV A L GIG Sbjct: 201 LRGAGFSQAKAQTLLAVAQAVAEGQLPLDDWAARSAVGRWSEEDVAAASAQLLAVKGIGP 260 Query: 218 WTANYFALRGWQAKDVFLPDDYLIKQRFP------GMTPAQIRRYAERWKPWRSYALLHI 271 WT NY LRG+ D L D +++ + E++KPWR+ H+ Sbjct: 261 WTVNYTLLRGYGWPDGSLHGDVAVRRAIGLLTGSDKPDARAASDWLEQFKPWRALVAAHL 320 Query: 272 WYTE 275 W + Sbjct: 321 WASL 324 >UniRef50_C7NLP9 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Kytococcus sedentarius DSM 20547 RepID=C7NLP9_KYTSD Length = 286 Score = 174 bits (442), Expect = 3e-42, Method: Composition-based stats. Identities = 75/279 (26%), Positives = 113/279 (40%), Gaps = 19/279 (6%) Query: 8 PPYDWSWMLGFLAARAVSSVETVADSY--YARSLAVGEYRGVVTAIPDIARHTLHINLSA 65 P+D LG L A AV + + R + V +VT D T+ Sbjct: 13 GPFDRVAALGTLTAHAVDGLHRLDPDTTELTRWVDVHGDPQLVTVCLDPGGATVSTATR- 71 Query: 66 GLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQGVR 119 V+ E A++ FDL + V + L +RPG+R+ FE + Sbjct: 72 -DAGVSDEIAARVQHWFDLDTDLTPVNARLGADPVLAGQVRSRPGIRITRFHAPFEAVIL 130 Query: 120 AILGQLVSVAMAAKLTARVAQLYGE---RLDDFPEYICFPTPQRLAAADPQALKA-LGMP 175 +LGQ VS+A AR+ YG+ + P FPTP L A + L+A +G+ Sbjct: 131 TVLGQQVSLAAGRLFAARLIAAYGDDAAPVRQEPGLRVFPTPVALTAVPVEELRAVIGLT 190 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 RA + +A E ++P A L GIG WT +Y A+R D F Sbjct: 191 GTRARTVHAVAAHFAETARDASLP-----ARAELHAVHGIGPWTLDYLAIRASTDADAFP 245 Query: 236 PDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYT 274 D ++++ ++P A W P+RSYA +W Sbjct: 246 ATDAVLRRTLAAISPDTGPERAASWSPYRSYAASRLWAH 284 >UniRef50_B5ES79 HhH-GPD family protein n=4 Tax=Acidithiobacillus RepID=B5ES79_ACIF5 Length = 322 Score = 173 bits (440), Expect = 5e-42, Method: Composition-based stats. Identities = 66/289 (22%), Positives = 107/289 (37%), Gaps = 19/289 (6%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEY----RGVVTAIPDIARH 57 + L+ +PP+ + L +A + ++ Y R G+ R T Sbjct: 5 FRLSPRPPFRLDLTVWALRRQAHNRMDGWESETYRRVWRYGDDWLKVRLWQTKGDPDPFL 64 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCN------PQIVNGALGRLGAARPGLRLPGCV 111 I E + A+++ + L + + L L A GLR P Sbjct: 65 EGEIYEGPQDERTVSWVRAQLTWMLSLDRDLGPFYVVAAGDPRLASLEARYRGLRPPRFP 124 Query: 112 DAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKA 171 FE V A+ Q +S+ + L R+++L E + + + FP P L + AL+ Sbjct: 125 SLFEGMVNAVACQQLSLHLGITLLNRLSELCREGVGEMDQVYPFPDPGSLLRQEVTALRG 184 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIPGDVE--QAMKTLQTFPGIGRWTANYFALRGWQ 229 LG ++ AL LA A G L + A++ L GIGRW+A Y LR Sbjct: 185 LGFSGQKVTALRALAEEAAVGGLEREDWQHLPNAAAVQRLLRLRGIGRWSAEYVLLRTLG 244 Query: 230 AKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPWRSYALLHI 271 DVF DD ++ AQ+ W+P+ + Sbjct: 245 RLDVFPGDDVGARKALARWLEENGSLDYAQVAHRLRPWQPYAGMVYFLL 293 >UniRef50_B3T536 Putative HhH-GPD superfamily base excision DNA repair protein n=1 Tax=uncultured marine microorganism HF4000_ANIW137P11 RepID=B3T536_9ZZZZ Length = 209 Score = 173 bits (439), Expect = 7e-42, Method: Composition-based stats. Identities = 53/200 (26%), Positives = 87/200 (43%), Gaps = 19/200 (9%) Query: 90 IVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDF 149 +++ AL + A+ L L D F V+AI+GQ +S+ AA + RV L GE Sbjct: 20 LIDPALAAVINAKGELGLSSRGDLFATLVKAIVGQQISIKAAATVWGRVVDLIGEV---- 75 Query: 150 PEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP-GDVEQAMKT 208 P+ + A + L++ G+ ++AE + +A A G D E+A++ Sbjct: 76 -------KPESVLAHTHEELRSCGLSNRKAEYVAGIAEAWQGGYAEYDWDSMDDERALEL 128 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERWK 261 L G+GRWTA + DVF DD + + + A++ A W Sbjct: 129 LVALRGVGRWTAEMVLIFTLLRPDVFPIDDLGVVRGMEKVYNEGEVLDKAELNDIASNWS 188 Query: 262 PWRSYALLHIWYTEGWQPDE 281 PWR+ ++W +P E Sbjct: 189 PWRTVGSWYMWRAIDPEPIE 208 >UniRef50_B1YMD5 HhH-GPD family protein n=1 Tax=Exiguobacterium sibiricum 255-15 RepID=B1YMD5_EXIS2 Length = 273 Score = 173 bits (438), Expect = 7e-42, Method: Composition-based stats. Identities = 56/273 (20%), Positives = 107/273 (39%), Gaps = 17/273 (6%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEP 69 YD++ + L + VE D + + E A + + LS Sbjct: 12 YDFAGIRKRLRGDRLQ-VEQ--DGRLFVPVMLPEGNF---IGQVEAVGSRELLLSGEGPQ 65 Query: 70 VAAECLAKMS-RLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSV 128 L + RL + + +L + + RL ++ F +R+I+ Q +++ Sbjct: 66 EPMVQLLRCRFRLDTTNPSQHLSETSLSEVVSTFGAERLVLDINPFTALIRSIIHQQINL 125 Query: 129 AMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANA 188 A A L R + +G + I PT ++L +P+ L+AL + ++ + L+ A A Sbjct: 126 AFAQVLMERFCRTFGT---EQNGVIFPPTAEQLVNVEPEQLRALQLSGRKVDYLLGAARA 182 Query: 189 ALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM 248 A++ +TL G+G WT + G+ +D+F D I + F + Sbjct: 183 AID--FERLTEAPDATIAETLIALKGVGPWTVQNVLMFGYGREDLFPASDIGILRAFERL 240 Query: 249 TP-----AQIRRYAERWKPWRSYALLHIWYTEG 276 + AE + P+RS+A +W + Sbjct: 241 HGTRPSVEEAVLLAEEFAPYRSHAAYLLWRSIE 273 >UniRef50_Q0VPN7 Putative uncharacterized protein n=1 Tax=Alcanivorax borkumensis SK2 RepID=Q0VPN7_ALCBS Length = 291 Score = 172 bits (437), Expect = 9e-42, Method: Composition-based stats. Identities = 58/290 (20%), Positives = 105/290 (36%), Gaps = 14/290 (4%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI 61 L + S L F + + E V +++ ++ + H + Sbjct: 4 VKLPLPAHFSVSDFLHFHGRDSHAVSERVDALTLHKAIIFNLRPCILKMDFNQVGHVITN 63 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIV--NGALGRLGAARPGLRLPGCVDAFEQGVR 119 + + A + ++ + L L + GLR+P FE Sbjct: 64 ATDTTAPALTRQASAMLGLNQPVEVFETAIGTKPPLNTLVQRQRGLRVPQSATPFEALTW 123 Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 AI+GQ +SV+ A + R QL D C+P + P AL+++G +A Sbjct: 124 AIIGQQISVSAATAIRRRFIQLASPVRHDG--LHCYPDAATVCLLTPDALRSVGFSATKA 181 Query: 180 EALIHLANAALEGTL---PMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 + L+ ++ + L + + EQ + L G+G W+ NY LRG+ D L Sbjct: 182 DTLLAVSRLCRDQQLLPETLHLDAYAEQLERNLLEIRGLGPWSVNYTLLRGYGFLDGSLH 241 Query: 237 DDYLIKQRFP-------GMTPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 D +++ T R + + PWR+ H+W + Q Sbjct: 242 ADVAVQKALQMLLGQPERPTARVTRDWLADFTPWRALVAAHLWQSLSTQA 291 >UniRef50_B2SXP8 HhH-GPD family protein n=39 Tax=Betaproteobacteria RepID=B2SXP8_BURPP Length = 349 Score = 172 bits (436), Expect = 1e-41, Method: Composition-based stats. Identities = 49/226 (21%), Positives = 86/226 (38%), Gaps = 23/226 (10%) Query: 66 GLEPVAAECLAKMSRLFDLQ---CNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAIL 122 + A +++R + + L +L + L D F R+++ Sbjct: 132 AVPVQIAGLTPEVTRPAYWDKACADLVKRDRILKKLIPKFGPVHLLSRGDPFVTLARSVV 191 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 GQ +SVA A + A+V + + PQ+ + L G+ ++AE + Sbjct: 192 GQQISVASAQAVWAKVEAACPKLV-----------PQQFIKLGLEKLTTCGLSKRKAEYV 240 Query: 183 IHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 + LA + G L + + E + L GIGRWTA F + DV DD Sbjct: 241 LDLAQHFVSGALHVGKWTSMEDEAVIAELTQIRGIGRWTAEMFLIFNLSRPDVLPLDDLG 300 Query: 241 IKQRFPGMT-------PAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 + + ++ R A W+PWR+ A ++W + P Sbjct: 301 LIRAISVNYFSGEPVTRSEAREVAANWEPWRTVATWYMWRSLDPLP 346 >UniRef50_A6WG49 HhH-GPD family protein n=5 Tax=Actinomycetales RepID=A6WG49_KINRD Length = 295 Score = 171 bits (434), Expect = 3e-41, Method: Composition-based stats. Identities = 85/287 (29%), Positives = 122/287 (42%), Gaps = 12/287 (4%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 L + + +L A A+ E D + R + G V+ + + L Sbjct: 10 LAVRGELAADPLRRWLRAHALPGAERHLDGVHERVVPTGAGPVEVSVDLGTSPGCEQVVL 69 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQ------IVNGALGRLGAARPGLRLPGCVDAFEQG 117 A E + R L +P + L L AARPGLR+P V E Sbjct: 70 HAP-AAALDEVEGTVRRWLGLDADPAEAEAWLARDPLLAPLVAARPGLRVPRAVAGVETA 128 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKA-LGMP 175 V +LGQ VS+A A R+ +G + FP LA A +A++A G+ Sbjct: 129 VLTVLGQQVSLAAARTFGGRLVAAFGTPVSSAPSSLTAFPAAAVLADAGAEAIRAATGVT 188 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTF---PGIGRWTANYFALRGWQAKD 232 RA + LA A G GD E+A PGIG WTA+Y ALR +D Sbjct: 189 GARARTVHALAAALAGGLDLDAAAGDPERAGAARARLLALPGIGPWTADYVALRVLGDRD 248 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 FLP D ++++ G++P + AE W+PWR +ALLH+W + P Sbjct: 249 AFLPGDLVLRRALGGLSPKEAAARAEPWRPWRGHALLHLWTAAVFVP 295 >UniRef50_A9RKT9 Predicted protein (Fragment) n=1 Tax=Physcomitrella patens subsp. patens RepID=A9RKT9_PHYPA Length = 205 Score = 171 bits (433), Expect = 3e-41, Method: Composition-based stats. Identities = 44/209 (21%), Positives = 80/209 (38%), Gaps = 18/209 (8%) Query: 79 SRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARV 138 + + + + L + ++F VR+I+ Q ++V AA + AR+ Sbjct: 5 DAIAEATKHLLAADANLACVIQKSNSPPFENDGNSFAALVRSIVSQQLAVKAAATIHARL 64 Query: 139 AQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI 198 L G TP +AA L+ G+ ++ L LA+ + G L Sbjct: 65 VALCGGPQKV--------TPAAIAALTAGELRGAGISGRKEVYLHDLADKLVSGALSDEK 116 Query: 199 PG---DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-------M 248 D + + L GIG W+A+ F + DV D I++ F Sbjct: 117 LMAMEDEDDLVTALTAVKGIGVWSAHMFMIFHLHRPDVLPVGDLGIRKGFQKLFHLKHLP 176 Query: 249 TPAQIRRYAERWKPWRSYALLHIWYTEGW 277 ++ + A+ W+P+RS A ++W + Sbjct: 177 CAEEMHKLADSWRPYRSLASWYLWQLKDA 205 >UniRef50_A4XJM3 8-oxoguanine DNA glycosylase domain protein n=2 Tax=Clostridia RepID=A4XJM3_CALS8 Length = 286 Score = 170 bits (431), Expect = 5e-41, Method: Composition-based stats. Identities = 49/216 (22%), Positives = 82/216 (37%), Gaps = 17/216 (7%) Query: 72 AECLAKMSRLFDLQCNPQIV-------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQ 124 E FDL + + + L G+RL + FE + I+ Q Sbjct: 66 DEFKKSFYWYFDLDKDYDEILEKLSGHDSILKEAVEKYRGMRLLNQ-EPFECMISFIISQ 124 Query: 125 LVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALI 183 ++ L R+ Q +G+++ FPT + L ++ LK LG+ RAE + Sbjct: 125 NNNIKRIQLLIERLCQAFGKKITYKGFVSWSFPTLESLWSSSIDDLKLLGL-GYRAEYIK 183 Query: 184 HLANAALEGTLPMTIPGDVE--QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLI 241 G + D+E +A + L+T GIG A+ L Q +VF D + Sbjct: 184 DAVEKVKNGLINFDELTDLEVQKAKQVLKTIKGIGDKVADCILLYSLQKYNVFPI-DVWV 242 Query: 242 KQRFPGMT----PAQIRRYAERWKPWRSYALLHIWY 273 K+ QIR + + YA L +++ Sbjct: 243 KRVLKEYYGFKTKDQIRDFINSFGDLAGYAQLFLFH 278 >UniRef50_D1HE56 Whole genome shotgun sequence of line PN40024, scaffold_1.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1HE56_VITVI Length = 351 Score = 170 bits (431), Expect = 5e-41, Method: Composition-based stats. Identities = 43/192 (22%), Positives = 72/192 (37%), Gaps = 17/192 (8%) Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 + L L P F ++IL Q ++ + R L G Sbjct: 128 ADPHLAPLIDLHPPPTFDSFHTPFLALTKSILYQQLAYKAGTSIYTRFVGLCGGEAGVL- 186 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKT 208 P+ + A P L+ +G+ ++A L LA G L T I D + Sbjct: 187 -------PETVLALTPHQLRQIGVSGRKASYLHDLARKYQNGILSDTGIITMDDKSLFTM 239 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERWK 261 L GIG W+ + F + DV +D +++ + P+Q+ + E+W+ Sbjct: 240 LTMVNGIGSWSVHMFMIFSLHRPDVLPVNDLGVRKGVQLLYGLEELPRPSQMEQLCEKWR 299 Query: 262 PWRSYALLHIWY 273 P+RS A +IW Sbjct: 300 PYRSVASWYIWR 311 >UniRef50_C5G8B3 DNA-3-methyladenine glycosylase n=8 Tax=Onygenales RepID=C5G8B3_AJEDR Length = 438 Score = 170 bits (431), Expect = 6e-41, Method: Composition-based stats. Identities = 50/266 (18%), Positives = 88/266 (33%), Gaps = 70/266 (26%) Query: 81 LFDLQCNPQIVNGALGRLGAARPGLRLPGC-----VDAFEQGVRAILGQLVSVAMAAKLT 135 L + + V L + P +D F V I+GQ VS A A + Sbjct: 166 LEEAVAHLIKVAPQLRPVIEKHPCPLFSPAGLAEEIDPFNSLVSGIIGQQVSGAAAKSIK 225 Query: 136 ARVAQLY----------------------------------------GERLDDFPEYIC- 154 + L+ GE+++ Sbjct: 226 KKFMALFRSDGGGGGDCNSINNATATIATTVINDGGAATKNNNNMDAGEKIETAEMRYDR 285 Query: 155 ---FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP--MTIPGDVEQAMKTL 209 FPTP ++A D L+ G+ ++AE + LA G L M + E+ ++ L Sbjct: 286 DDDFPTPAQVAKCDIATLRTAGLSQRKAEYIQGLAEKFASGELSARMLLQASDEEVLEKL 345 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF-------------------PGMTP 250 G+G+W+ F+ G + DVF D +++ M+ Sbjct: 346 IAVRGLGKWSVEMFSCFGLKRMDVFSTGDLGVQRGMAAFVGRDVSKLKAKGGGKFKYMSE 405 Query: 251 AQIRRYAERWKPWRSYALLHIWYTEG 276 ++ A + P+RS + ++W E Sbjct: 406 KEMVEVAAPFSPYRSLFMWYMWRIED 431 >UniRef50_A9FBN7 Putative DNA-3-methyladenine glycosidase n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FBN7_SORC5 Length = 292 Score = 170 bits (430), Expect = 6e-41, Method: Composition-based stats. Identities = 59/275 (21%), Positives = 104/275 (37%), Gaps = 11/275 (4%) Query: 7 QPPYDWSWMLGFLAARAVSSVETV-ADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSA 65 + P+ + + L R + V+T AD Y R+ V + L + + Sbjct: 2 RAPFRLALTVAALQRRPENPVDTWSADRRYLRAFDTARGPVVWAVTEEPGGTQLRVEVFG 61 Query: 66 GLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP------GLRLPGCVDAFEQGVR 119 ++ ++RL + RL A G++ P +E V Sbjct: 62 DVDDPRMW-RGLVTRLLGTDIDLAPFYARAERLPAFAALAARFRGVKPPRFASLWEAIVN 120 Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKR 178 +++ Q +S+A A R+ + +D FP P+ +A++ P L+ LG+ + Sbjct: 121 SVVFQQLSLAAAMAAVRRLVLRFASPVDVAGQRLFPFPPPEVVASSTPHDLRTLGLSGAK 180 Query: 179 AEALIHLANAALEGTLPMTIPGDVE--QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 A+AL A G L + + + L+ PGIG WTA+ LRG++ DVF Sbjct: 181 ADALRTCARMIAAGELREEELEALANGEIERRLRELPGIGPWTASVILLRGFRRLDVFPG 240 Query: 237 DDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHI 271 D + + E P+R H+ Sbjct: 241 GDVAAARGLGAIAGEHGGELVEALGPYRGMLYFHL 275 >UniRef50_Q9LN45 F18O14.25 n=22 Tax=Magnoliophyta RepID=Q9LN45_ARATH Length = 1314 Score = 170 bits (430), Expect = 6e-41, Method: Composition-based stats. Identities = 38/196 (19%), Positives = 75/196 (38%), Gaps = 17/196 (8%) Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 + L L P F +R IL Q +++ + R L G Sbjct: 149 ADPLLAALIDVHPPPTFESFKTPFLALIRNILYQQLAMKAGNSIYTRFVSLCGGENLVV- 207 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQ--AMKT 208 P+ + + +PQ L+ +G+ ++A L LA G L + ++++ Sbjct: 208 -------PETVLSLNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAILNMDEKSLFTM 260 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERWK 261 L GIG W+ + F + DV +D +++ + P+Q+ ++ +W+ Sbjct: 261 LTMVNGIGSWSVHMFMINSLHRPDVLPVNDLGVRKGVQLLYGLDDLPRPSQMEQHCAKWR 320 Query: 262 PWRSYALLHIWYTEGW 277 P+RS ++W Sbjct: 321 PYRSVGSWYMWRLIEA 336 >UniRef50_C1D8D7 HhH-GPD family protein n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1D8D7_LARHH Length = 208 Score = 170 bits (430), Expect = 7e-41, Method: Composition-based stats. Identities = 50/214 (23%), Positives = 90/214 (42%), Gaps = 20/214 (9%) Query: 75 LAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKL 134 +A+ + + + + RL A+ P L + FE +RAI+GQ +SV A + Sbjct: 1 MARPAWWDNACAGLAAADPVMARLIASWPDAELVSRGEPFETLLRAIVGQQISVRAADAV 60 Query: 135 TARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTL 194 R++ + + P+P+R+ A + L++ G+ ++ LA +G + Sbjct: 61 WKRLSAVLSGQ----------PSPERVLALPEEVLRSAGLSARKVLYARDLAECFTDGRV 110 Query: 195 P--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP-- 250 D E + L GIGRWTA + + DV+ DD +++ Sbjct: 111 NPAAHAGLDDEALIAELVAVRGIGRWTAEMYLIFNQLRPDVWPVDDIGLQRAMARHYALE 170 Query: 251 ------AQIRRYAERWKPWRSYALLHIWYTEGWQ 278 Q+R ER+ PWR+ A ++W + Q Sbjct: 171 DQKASLTQLRVMGERFAPWRTVATWYLWRSLDPQ 204 >UniRef50_D2PPK3 Transcriptional regulator, AraC family n=1 Tax=Kribbella flavida DSM 17836 RepID=D2PPK3_9ACTO Length = 435 Score = 169 bits (429), Expect = 8e-41, Method: Composition-based stats. Identities = 74/276 (26%), Positives = 118/276 (42%), Gaps = 31/276 (11%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARH--- 57 + L +QPPYDW M+ LAARAV VE+V+D Y R++ + GV+ P Sbjct: 183 LMRLPYQPPYDWDAMVDHLAARAVPGVESVSDRVYRRTIGLDGGAGVLEIGPGEGDVLML 242 Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQG 117 H+ GL V A++ + + + + LG L ARPGLR+PG A E Sbjct: 243 RAHLPYWEGLIHVVERA-ARLVGVASEPADRLLRDPLLGPLVVARPGLRVPGAWGALEIA 301 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLD--DFPEYICFPTPQRLAAADPQALKALGMP 175 V+A+ Q S+ R+ + G+ + FP+ + LA++ Sbjct: 302 VQAVTAQDHSLKETRAQLGRLVKECGQPVPGLTDRLTHLFPSAEVLASSSTGI------- 354 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 + LA A +G + + G E + L T PG+ TA++ ALR +DVF Sbjct: 355 ------VQSLAAAVADGRVSLE-GGSSEVLLAQLTTVPGLMPDTADWIALR-LGHQDVFP 406 Query: 236 PDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHI 271 + ++RW+P R+ A ++ Sbjct: 407 ----------RSLHAEVAAEVSDRWRPHRAVAATYL 432 >UniRef50_B6JZD7 DNA-3-methyladenine glycosylase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6JZD7_SCHJY Length = 268 Score = 169 bits (429), Expect = 9e-41, Method: Composition-based stats. Identities = 39/212 (18%), Positives = 75/212 (35%), Gaps = 19/212 (8%) Query: 79 SRLFDLQCNPQIVNGALGRLGAARPG--LRLPGCVDAFEQGVRAILGQLVSVAMAAKLTA 136 +L + + ++ R+ A R+ +E +RA+ Q ++ + Sbjct: 52 DQLAKAEEHLASIDEHWKRVVEAIGHTSFRVEKVRQPYEALIRAVAYQQLTTKAGKAIIN 111 Query: 137 RVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEG---T 193 R+ FPTP+ + A + + LK+ G ++ + + +A G + Sbjct: 112 RLVAK-------ASATGGFPTPEEILALEQEQLKSCGFSRRKTDTIREIARGVETGLIPS 164 Query: 194 LPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP--- 250 L E+ ++ L GIGRWTA + G DV D I+ F + Sbjct: 165 LDAAHEMVNEELIERLSQIHGIGRWTAEMLLIFGMGRLDVLPAGDLKIRDGFRYLYAMDK 224 Query: 251 ----AQIRRYAERWKPWRSYALLHIWYTEGWQ 278 + P+RS A +++ Sbjct: 225 LPSLRETNELGCACAPYRSIATWYLYRVTSLP 256 >UniRef50_A5KJK2 Putative uncharacterized protein n=3 Tax=Clostridiales RepID=A5KJK2_9FIRM Length = 274 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 41/241 (17%), Positives = 86/241 (35%), Gaps = 9/241 (3%) Query: 36 ARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGAL 95 + + + + + + + + + + + L Sbjct: 30 GKVYELTAGDRYLKIEVAEDQKKVRFYCTQEEFEQFWKTYFDLDTCYADYLSLIKDDEYL 89 Query: 96 GRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE--YI 153 + G+R+ D +E V IL Q ++ + R+++ YG++ + Y Sbjct: 90 KKAAGFGSGIRIL-QQDLWEMIVTFILSQQSNIPRIKSMIQRISERYGDKKETGEGHVYY 148 Query: 154 CFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI--PGDVEQAMKTLQT 211 FP ++L A + L+AL + R++ L + A EGT+ + ++A K L Sbjct: 149 AFPRAEQLVQASEEELRALKL-GYRSKYLCNTAEMVAEGTVNLEKIKEMSYDEAKKELLK 207 Query: 212 FPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHI 271 GIG A+ L D F D I++ + + E++K +I Sbjct: 208 LSGIGSKVADCICLFALHKLDAFPV-DTHIQKVIDTVYAGKFP--FEKYKGCAGVMQQYI 264 Query: 272 W 272 + Sbjct: 265 F 265 >UniRef50_B2B817 Predicted CDS Pa_2_12990 n=8 Tax=Leotiomyceta RepID=B2B817_PODAN Length = 428 Score = 168 bits (427), Expect = 1e-40, Method: Composition-based stats. Identities = 49/219 (22%), Positives = 82/219 (37%), Gaps = 30/219 (13%) Query: 81 LFDLQCNPQIVNGALGRLGAAR-----PGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLT 135 L + + V+ + L L +D FE I+ Q VS A A + Sbjct: 210 LEEACAHLIKVDPRMKPLIEKHHCHIFSPEGLSEKIDPFESLASGIISQQVSGAAAKAIK 269 Query: 136 ARVAQLY---GERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEG 192 R L+ + E FPTP + + L+ G+ ++AE L+ LA + G Sbjct: 270 NRFISLFYPGNDTTTTTHEKKKFPTPADVIGKSIETLRTAGLSQRKAEYLLGLAQKFVSG 329 Query: 193 TLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF----- 245 L M E+ ++ L G+GRW+ FA G + DVF D +++ Sbjct: 330 ELTAQMLADAPYEEVLEKLIAVRGLGRWSVEMFACFGLKRMDVFSTGDLGVQRGMAAFVG 389 Query: 246 ---------------PGMTPAQIRRYAERWKPWRSYALL 269 M+ ++ AE ++P+RS + Sbjct: 390 RDVGKLKAKGGGNKWKYMSEREMEEIAEGFRPYRSLFMW 428 >UniRef50_Q2SX77 DNA-3-methyladenine glycosylase n=60 Tax=Betaproteobacteria RepID=Q2SX77_BURTA Length = 312 Score = 168 bits (425), Expect = 2e-40, Method: Composition-based stats. Identities = 47/225 (20%), Positives = 86/225 (38%), Gaps = 20/225 (8%) Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAI 121 + ++ AE +A+ + + L +L L D+F R++ Sbjct: 94 EAAVPVQLSDAETVARPPYWDKACADLVKRDRILKKLIPKFGPAHLVKRGDSFVTLARSV 153 Query: 122 LGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEA 181 +GQ +SVA A + ++ + P ++ + L A G+ +++E Sbjct: 154 VGQQISVAAAQSVWVKIETACPKLA-----------PPQIIKLGQEKLIACGLSKRKSEY 202 Query: 182 LIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 ++ LA + G L + D E + L GIGRWTA F + DV DD Sbjct: 203 ILDLAQHFVSGALHVDKWASMDDEDVIAELTQIRGIGRWTAEMFLIFNLSRPDVLPLDDL 262 Query: 240 LIKQRFPGMT-------PAQIRRYAERWKPWRSYALLHIWYTEGW 277 + + ++ R A W+PWR+ A ++W + Sbjct: 263 GLIRAISVNYFSGEPVTRSEAREVAANWEPWRTVATWYMWRSLDP 307 >UniRef50_B1ZV80 Transcriptional regulator, AraC family n=2 Tax=Opitutaceae RepID=B1ZV80_OPITP Length = 523 Score = 167 bits (424), Expect = 3e-40, Method: Composition-based stats. Identities = 66/308 (21%), Positives = 109/308 (35%), Gaps = 40/308 (12%) Query: 10 YDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHI-------- 61 Y ++ L+ A S E + Y ++ + ++T + + + + Sbjct: 217 YLLGYLRRALSRDAQSVSERLEGDVYTAAVNLSGGPALLTLRLNPSPVRVEVIPAAGGTG 276 Query: 62 ----------------NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGA-----LGRLGA 100 ++++G+ E A L L+ + L RL A Sbjct: 277 VPPVGSRPPTKGERVPDVASGVPTNTLEAHAIAIGLLGLEDDASSFARLARKLGLARLVA 336 Query: 101 ARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQR 160 R LR+ F+ V AI+GQ ++ + A L R+ +L G RL + + PTP Sbjct: 337 GRSELRISRIPSVFDGLVWAIIGQQINFSFACVLKRRLTELAGTRLSNG--LMAPPTPTA 394 Query: 161 LAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRW 218 +A +P L L ++A LI A A G L + +A +TL G G W Sbjct: 395 IARLEPDELVPLQFSRQKAGYLITTARAITAGELDLAQLPSMSASRAERTLLALHGFGPW 454 Query: 219 TANYFALRGWQAKDVFLPDDYLIKQRF-------PGMTPAQIRRYAERWKPWRSYALLHI 271 + NY +R D D + RR + P RS A H+ Sbjct: 455 SVNYVMMRALGFADCVPLGDTGVTSGLQSLLHLEQRPDVDATRRLMAVFSPHRSLATAHL 514 Query: 272 WYTEGWQP 279 W +P Sbjct: 515 WQLNLPRP 522 >UniRef50_A6TLI8 8-oxoguanine DNA glycosylase domain protein n=9 Tax=Clostridiaceae RepID=A6TLI8_ALKMQ Length = 296 Score = 167 bits (423), Expect = 4e-40, Method: Composition-based stats. Identities = 45/256 (17%), Positives = 88/256 (34%), Gaps = 18/256 (7%) Query: 28 ETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCN 87 E D Y + Y V+ L+ E + E + +Q + Sbjct: 35 EREEDGSYTGVV----YDKVLNVKKIGNDVILYPTNQLDFENIWIEYFDLHTDYQMIQEH 90 Query: 88 PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD 147 Q ++ + + G+R+ D +E + I+ ++ + +++ YG+ ++ Sbjct: 91 LQAIDPVMKKAIGFGRGIRILRQ-DPWETIISFIISANNNIPRIKRAIDLMSRGYGQPVE 149 Query: 148 D--FPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM--TIPGDVE 203 D FP L+ + L A RA ++ A + D E Sbjct: 150 DFRGGANYTFPDAATLSKRTVEELLACN-TGYRAPYILKTAQQVSTANIEFQNLKKLDRE 208 Query: 204 QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF------PGMTPAQIRRYA 257 + L TF GIG AN D F D +K+ + +I+ +A Sbjct: 209 SCQRQLMTFNGIGPKVANCVLFFSMGKFDAFPV-DVWVKRVMEALYFEQKTSHEKIQAFA 267 Query: 258 ER-WKPWRSYALLHIW 272 E+ + + YA +++ Sbjct: 268 EKSFGEYAGYAQQYLF 283 >UniRef50_D1RHI7 HhH-GPD family base excision repair protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1RHI7_LEGLO Length = 263 Score = 167 bits (423), Expect = 4e-40, Method: Composition-based stats. Identities = 53/244 (21%), Positives = 91/244 (37%), Gaps = 17/244 (6%) Query: 45 RGVVTAIPDIARHTLHINLSAGLEP-------VAAECLAKMSRLFDLQCNPQIVNGALGR 97 + VV + L ++L+ + P E + ++R + L Sbjct: 9 KVVVEQTNSLNNPELLVSLNEPVHPLVQEKIKHLIEMMLGLNRDLTGFYKMAKDDVRLDP 68 Query: 98 LGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL-DDFPEYICFP 156 L G++ P FE + AI Q +S+ + R+ Q G ++ + Y FP Sbjct: 69 LVFQFMGVKPPCFPSFFEALINAISCQQISLDAGLHIQNRLVQHIGMKMNHENQVYYAFP 128 Query: 157 TPQRLAAADPQALKALGMPLKRAEALIHLANAALEGT--LPMTIPGDVEQAMKTLQTFPG 214 T + + LK +G ++E ++ LA+ E L E+ ++ L F G Sbjct: 129 TAEDVGHCSVAELKKIGYSTHKSETIVSLASMLKEEHSFLNRLEDKPTEEVIQLLCQFKG 188 Query: 215 IGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMTPAQIRRYAERWKPWRSYA 267 IGRWTA Y LRG +VF DD + + + + W P+ Sbjct: 189 IGRWTAEYVLLRGLGRIEVFPGDDIGAQNNLQKLLHLEDKLDYKKTSKITALWHPYAGLV 248 Query: 268 LLHI 271 H+ Sbjct: 249 YFHL 252 >UniRef50_C7MAP3 Adenosine deaminase n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MAP3_BRAFD Length = 515 Score = 167 bits (423), Expect = 4e-40, Method: Composition-based stats. Identities = 77/292 (26%), Positives = 113/292 (38%), Gaps = 21/292 (7%) Query: 3 TLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPD-IARHTLHI 61 L + P+D + + + A RAV VE V + R++ + GV+ A H L + Sbjct: 205 QLAVRRPFDGAGLAAWFAHRAVPGVEEVDGLRWTRAVHLPHGPGVLQVDLGGPAPHPLPL 264 Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR-------LGAARPGLRLPGCVDAF 114 L A ++ RL DL +P ++ L R L AARPG+RLPG Sbjct: 265 TLRLADLRDHAVAVSLTRRLLDLDADPVGIDDGLRRTLPALAPLLAARPGVRLPGTPTLA 324 Query: 115 EQGVRAILGQLVSVAMAAKLTAR----VAQLYGERLDDFPEYICFPTPQRLAAADPQALK 170 E + A+ GQ ++ A A R +A E L P AA + Sbjct: 325 EALLWAVTGQQITTAQARDQITRATDLLATALPEALRTGSVERLPVLPANAAARAEDWFR 384 Query: 171 ALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 P R+ L A LP +++ + G+G WTA+Y LRG +A Sbjct: 385 G---PRARSRTLQEAVPAIAADDLP--ARWPLDELRSRVLALRGVGPWTADYVLLRGLRA 439 Query: 231 KDVFLPDDYLIKQRFP----GMTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 D D + ++R E PWRSYA LH+W + Sbjct: 440 IDAAPAGDAALLGAARDLGLAEDHTALQRVLEAASPWRSYAALHLWQHQATP 491 >UniRef50_C1A5A1 DNA-3-methyladenine glycosylase n=1 Tax=Gemmatimonas aurantiaca T-27 RepID=C1A5A1_GEMAT Length = 243 Score = 166 bits (420), Expect = 9e-40, Method: Composition-based stats. Identities = 48/209 (22%), Positives = 80/209 (38%), Gaps = 14/209 (6%) Query: 79 SRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDA--FEQGVRAILGQLVSVAMAAKLTA 136 RL + LG AA L + F R I+ Q +S + A + Sbjct: 31 RRLTQAIAELSERDTRLGAAIAAVGPCTLLPRTEGTHFGHLARNIVYQQLSGSAATTIHG 90 Query: 137 RVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM 196 R + L E+ PTP+ + D AL+ G+ + + A+ LA ++G LP+ Sbjct: 91 RFLKHVSAHLGVETEH---PTPESVLGIDDDALRGCGLSVAKVRAIKDLAQHVIDGRLPL 147 Query: 197 TIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT----- 249 ++ + L GIG WTA F + DV D +++ + Sbjct: 148 DRLDVMSDQEIIDALVPVRGIGPWTAQMFLMFRLGRPDVLPVLDLGVRKGAQRIYRTRAL 207 Query: 250 --PAQIRRYAERWKPWRSYALLHIWYTEG 276 A++ + A+ W+PW S A + W Sbjct: 208 PDAARLEKIAKTWRPWASVASWYCWRVLD 236 >UniRef50_C7DHT2 3-Methyladenine DNA glycosylase n=1 Tax=Candidatus Micrarchaeum acidiphilum ARMAN-2 RepID=C7DHT2_9EURY Length = 303 Score = 165 bits (419), Expect = 1e-39, Method: Composition-based stats. Identities = 53/273 (19%), Positives = 98/273 (35%), Gaps = 20/273 (7%) Query: 16 LGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECL 75 LGF A +R+L ++ + I +++ A + Sbjct: 20 LGFTIRSAQPLTFYADYDRVSRTLVYPSDGKIINLRELESGRGRRIGIASKSIDYAVYDV 79 Query: 76 AKMSRLFDLQCNPQIV---NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAA 132 + RL D + + + L G+R+ D +E + IL Q ++ Sbjct: 80 KRRFRLGDRLSSIYGAISTDATMEGLIQNFSGMRITL-NDPWETTMCYILSQYNNIPRIR 138 Query: 133 KLTARVAQLYGERL---DDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAA 189 +T R+ +G + D FP +AAA +++ G RA+ L+ A+ Sbjct: 139 GITKRMIARFGSDIFGDHDSVVGKAFPKSHEIAAASEKSIVECG-AGFRAKYLVEAADYC 197 Query: 190 LEGTLPMTIPG--DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG 247 + M G D + L G+G A+ AL G+ + F D IK+ Sbjct: 198 TNN-IDMARLGKLDYPELKDELLQIKGVGDKVADCIALFGYGKLEAFPV-DVWIKRIVER 255 Query: 248 MT-------PAQIRRYAE-RWKPWRSYALLHIW 272 + +I R+AE +W + A +++ Sbjct: 256 LYFRGRKKSIKEIHRFAEDKWGRYAGVAQQYLF 288 >UniRef50_Q01SY7 DNA-3-methyladenine glycosylase II n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01SY7_SOLUE Length = 200 Score = 165 bits (417), Expect = 2e-39, Method: Composition-based stats. Identities = 45/204 (22%), Positives = 80/204 (39%), Gaps = 19/204 (9%) Query: 86 CNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER 145 + + + L + + FE VR+I+ Q +S +A + R+ G Sbjct: 6 QHLRKSDPVLSAIIERVGAYGIQFREPDFETLVRSIVYQQLSGRVAKVILDRLVAAVGRE 65 Query: 146 LDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM--TIPGDVE 203 + TP+++ A P ++ LG+ ++ + LA +G L E Sbjct: 66 V----------TPEKILALRPGRMRKLGLSTQKTAYIRDLARHTRDGRLVFTELPALTDE 115 Query: 204 QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-------MTPAQIRRY 256 + ++ L GIG WTA F + + DV D ++ TPA++ Sbjct: 116 EVIERLTQVKGIGVWTAQMFLMFALRRHDVLPTGDLGVRNAIRKAYDLAELPTPAEMEEL 175 Query: 257 AERWKPWRSYALLHIWYTEGWQPD 280 A W+PW S A ++W + Q D Sbjct: 176 ARNWRPWCSVASWYLWRSLEGQAD 199 >UniRef50_B4CYJ1 DNA-3-methyladenine glycosylase II n=1 Tax=Chthoniobacter flavus Ellin428 RepID=B4CYJ1_9BACT Length = 214 Score = 164 bits (416), Expect = 2e-39, Method: Composition-based stats. Identities = 49/193 (25%), Positives = 79/193 (40%), Gaps = 20/193 (10%) Query: 93 GALGRLGAARPG--LRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 + RL L+ F VRA+ Q ++ A + R L+ + Sbjct: 14 KVMRRLIRTHGPCTLQPEKDHSPFRALVRAVAHQQLNGTAAETILRRFCALFPGK----- 68 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP---GDVEQAMK 207 FPT + LA+ +AL+ G + AL +A L+GT+P T + + ++ Sbjct: 69 ---KFPTAKDLASVTDEALRGSGFSWAKIAALRDIAAKTLDGTIPSTRAIQKMNDAEIIE 125 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERW 260 L G+GRWT + DVF DD+ I+ F P +I +AERW Sbjct: 126 RLVQVRGVGRWTVEMMLIFKLGRPDVFPADDFGIRDGFRVAYGLDEMPKPKEILAHAERW 185 Query: 261 KPWRSYALLHIWY 273 +P+ + A + W Sbjct: 186 RPYATTAAWYFWR 198 >UniRef50_A5KST9 DNA-3-methyladenine glycosylase II n=1 Tax=candidate division TM7 genomosp. GTL1 RepID=A5KST9_9BACT Length = 239 Score = 164 bits (415), Expect = 3e-39, Method: Composition-based stats. Identities = 57/213 (26%), Positives = 92/213 (43%), Gaps = 21/213 (9%) Query: 76 AKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLT 135 A +++ + I + LG L AA+ L D F VR+I+ Q VSVA + + Sbjct: 35 ALATQIAAAEVTLAIQDTKLGALIAAQAPLNRLRKGDYFANLVRSIISQQVSVAASRAIL 94 Query: 136 ARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALE--GT 193 ARV G P+R+ A +P+ L+ALG+ +A + LA + G Sbjct: 95 ARVQAATG------------LEPKRILALNPEELRALGLSRPKAGYISDLAEHFVREPGI 142 Query: 194 LPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP--- 250 ++ + L GIG WTA F + D+F PDD +++ + Sbjct: 143 FDHLERLADDEVITELTRIKGIGAWTAQMFLMFTLGRLDIFAPDDVGLQRAITRLYGLKE 202 Query: 251 ----AQIRRYAERWKPWRSYALLHIWYTEGWQP 279 Q+ AE W+P+R+ A H+W + +P Sbjct: 203 VPSRTQLEALAEAWRPYRTVASWHLWESLTHEP 235 >UniRef50_A9BVD9 HhH-GPD family protein n=1 Tax=Delftia acidovorans SPH-1 RepID=A9BVD9_DELAS Length = 328 Score = 163 bits (414), Expect = 4e-39, Method: Composition-based stats. Identities = 62/233 (26%), Positives = 92/233 (39%), Gaps = 21/233 (9%) Query: 65 AGLEPVAAECLAKMSRLFDLQCNPQIV---NGALGRLGAARPGLRLPGCVDAFEQGVRAI 121 AG++ V A L +M L + + LG L A + GL +P +E A+ Sbjct: 92 AGMDDVLAAMLRRMFGLSQDVGEFERRFGRHARLGPLLARQRGLHVPAACTPWEALSWAV 151 Query: 122 LGQLVSVAMAAKLTARVAQLYGERL-------DDFPEYICFPTPQRLAAADPQALKALGM 174 GQ +SVA A L R+ G+ + D + C P +LA + L+A G Sbjct: 152 TGQQISVAAAVSLRRRLIAAAGQPVALHDGHADAPQQLWCMPEAAQLAQLGEEDLRAAGF 211 Query: 175 PLKRAEALIHLANAALEGTLPMT-----IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 + L LA A G LP+ V + + L GIG WT NY LRG+ Sbjct: 212 SRSKTHTLRLLAQAVQSGELPLDDWAALPELPVAEIRERLLALKGIGPWTVNYMLLRGYG 271 Query: 230 AKDVFLPDDYLIKQRFP------GMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 D L D +++ M Q + + PWR+ H+W + Sbjct: 272 HLDGPLHGDVAVRRALALLLKTDAMDAVQTELWLRDFAPWRALVAAHLWASLS 324 >UniRef50_B7K9B1 HhH-GPD family protein n=3 Tax=Cyanobacteria RepID=B7K9B1_CYAP7 Length = 210 Score = 163 bits (412), Expect = 8e-39, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 73/204 (35%), Gaps = 20/204 (9%) Query: 89 QIVNGALGRLGAARPGLRL---PGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER 145 Q + LG + +L P E AI+ Q +S +A K+ R Y + Sbjct: 11 QEADSILGEIIVQIGECKLGKTPSNSSLLEALAWAIISQQISTKVANKIYQRFLNFYND- 69 Query: 146 LDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM--TIPGDVE 203 T + L + L++LG+ + L +LA A + P+ + Sbjct: 70 -------ATPLTAKNLLNTPEEDLRSLGISRNKIRYLKNLAKAVEDNLPPLYQLELMEDW 122 Query: 204 QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRY 256 + + L G+G WTA + D+ D I+ + P + Sbjct: 123 EIIHLLTQIKGVGIWTAQMLLIFRLNRLDILPSADLGIRTAIKNLYQLPELPSPEIVEAI 182 Query: 257 AERWKPWRSYALLHIWYTEGWQPD 280 +WKP+R+ A ++W + D Sbjct: 183 GYKWKPYRTIASWYLWRSLSDSID 206 >UniRef50_UPI00016C4C1A DNA-3-methyladenine glycosylase II n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4C1A Length = 227 Score = 162 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 49/204 (24%), Positives = 74/204 (36%), Gaps = 20/204 (9%) Query: 93 GALGRLGAA-RPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + L P L +P D F VR ++GQ +S A + R+A+ + Sbjct: 18 PVMNGLIGRVGPCLLMPRGEDPFTLLVRCVIGQQISTKAAESIYNRLARAVNPPPEGPHP 77 Query: 152 YICFPTP----------QRLAAADPQALKALGMPLKRAEALIHLANAALEGT--LPMTIP 199 +LAA K G+ + L + A LP Sbjct: 78 ADGTSLAMWQREGIMPMDKLAALSEAEFKECGVSGPKQRTLRAVVEHARANPDLLPSIAG 137 Query: 200 GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-------MTPAQ 252 D + + L GIG WT + + L G DV DY IK PA+ Sbjct: 138 LDDDTIRERLTVIKGIGPWTVDMYLLFGLGRPDVLSVGDYGIKVAVKNLFRLRKLPDPAK 197 Query: 253 IRRYAERWKPWRSYALLHIWYTEG 276 + R A+ W+P+RS AL ++W + Sbjct: 198 LTRVAKPWQPYRSVALWYLWRSLD 221 >UniRef50_B8IZY6 HhH-GPD family protein n=8 Tax=Bacteria RepID=B8IZY6_DESDA Length = 235 Score = 162 bits (411), Expect = 1e-38, Method: Composition-based stats. Identities = 43/194 (22%), Positives = 79/194 (40%), Gaps = 19/194 (9%) Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 + L +R D F + +I+GQ +S A + R+ + + Sbjct: 20 RDPVLAAAMEEIGHIRREVTPDIFNALLNSIVGQQISTKAQATIWKRMREQF-------- 71 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKT 208 C TP+ + ++L+ G+ +++A + + A L+G+L + ++ Sbjct: 72 ---CPITPENIGTISAESLQTCGISMRKAAYIKSITEAVLDGSLDLARLPSLTDKEICAQ 128 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF------PGMTPAQIRRYAERWKP 262 L GIG WTA + Q D+ DD I++ +TPA RY +R+ P Sbjct: 129 LVQLKGIGVWTAEMIMIFSMQRPDILSWDDLAIQRGLRMLYRHRQITPALFARYRKRYSP 188 Query: 263 WRSYALLHIWYTEG 276 + A L++W G Sbjct: 189 HATTASLYLWAIAG 202 >UniRef50_C7PK12 HhH-GPD family protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PK12_CHIPD Length = 206 Score = 161 bits (409), Expect = 2e-38, Method: Composition-based stats. Identities = 38/175 (21%), Positives = 76/175 (43%), Gaps = 21/175 (12%) Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPL 176 + +I+ Q +SV +A + R LY + P Q++ P+ L+++G+ Sbjct: 39 LIGSIMSQQLSVKVATVIYTRFLALYDGKE---------PNAQQILDTPPETLRSIGLSN 89 Query: 177 KRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVF 234 + + ++A +E L D E+ +K L G+GRWT + +DVF Sbjct: 90 AKVSYVHNVARFTVEEKLTDKKLLQMDDEEVIKYLTQIKGVGRWTVEMLLMFYLCREDVF 149 Query: 235 LPDDYLIKQRF----------PGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 DD ++Q ++ + +++W P+R+YA ++W + +P Sbjct: 150 AIDDLGLQQAMIKLYKLDNTDKKAFREKLLKISKKWSPYRTYASRYLWAWKDMKP 204 >UniRef50_Q92383 DNA-3-methyladenine glycosylase 1 n=1 Tax=Schizosaccharomyces pombe RepID=MAG1_SCHPO Length = 228 Score = 161 bits (409), Expect = 2e-38, Method: Composition-based stats. Identities = 43/219 (19%), Positives = 83/219 (37%), Gaps = 21/219 (9%) Query: 70 VAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVD---AFEQGVRAILGQLV 126 E ++ L + + ++ RL R ++ +E+ +RA+ Q + Sbjct: 4 DIEEKEEIVTSLTKAEIHLSGLDENWKRLVKLVGNYRPNRSMEKKEPYEELIRAVASQQL 63 Query: 127 SVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLA 186 A + R FPTP+ + D + ++A G ++ ++L +A Sbjct: 64 HSKAANAIFNRFKS--------ISNNGQFPTPEEIRDMDFEIMRACGFSARKIDSLKSIA 115 Query: 187 NAALEGTLP---MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQ 243 A + G +P E+ ++ L GIGRWT + DV DD I+ Sbjct: 116 EATISGLIPTKEEAERLSNEELIERLTQIKGIGRWTVEMLLIFSLNRDDVMPADDLSIRN 175 Query: 244 RFPG-------MTPAQIRRYAERWKPWRSYALLHIWYTE 275 + T + +++E P+R+ A ++W T Sbjct: 176 GYRYLHRLPKIPTKMYVLKHSEICAPFRTAAAWYLWKTS 214 >UniRef50_B8GAB8 DNA-3-methyladenine glycosylase II n=3 Tax=Chloroflexus RepID=B8GAB8_CHLAD Length = 199 Score = 161 bits (408), Expect = 2e-38, Method: Composition-based stats. Identities = 50/193 (25%), Positives = 83/193 (43%), Gaps = 20/193 (10%) Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 V+ L L +F AI+ Q +S+ A + R+ L GE Sbjct: 11 VDPVLAPWIDQIGSFALQRQPHSFATLAYAIISQQLSLNAARAIRDRLTTLLGEL----- 65 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKT 208 TP+++ AAD AL+A G+ ++++ L LA + G + + + D E A+ Sbjct: 66 ------TPEQILAADTTALRAAGLSMQKSGYLRDLAERIVYGQINLELLPTLDDETAIAM 119 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERWK 261 L GIGRWTA + + D+ DD ++ + P ++R ERW+ Sbjct: 120 LTNVRGIGRWTAEIYLMFALNRLDILPADDLGLRDGARLVYQLPQILSPRELRALGERWR 179 Query: 262 PWRSYALLHIWYT 274 P+RS A ++W Sbjct: 180 PYRSIACWYLWQI 192 >UniRef50_A9EU33 Methylated-DNA--protein-cysteine methyltransferase n=31 Tax=Bacteria RepID=A9EU33_SORC5 Length = 395 Score = 160 bits (406), Expect = 4e-38, Method: Composition-based stats. Identities = 62/256 (24%), Positives = 97/256 (37%), Gaps = 22/256 (8%) Query: 36 ARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQC---NPQIVN 92 R LA G G +A + + + G A E FD + + + Sbjct: 142 HRVLAAGGKAGGFSANGGVTTKLRLLAIEGGQARGAPEAAPGGDLGFDPGAAVEHLRASD 201 Query: 93 GALGRLGAARPG--LRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 AL R+ A +R+ F +I+ Q ++ AA + ARV L+ + Sbjct: 202 AALARVIDAVGPFAMRIDRTSSLFLALAESIVYQQLTGKAAATIFARVRALFPRAHEG-- 259 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI---PGDVEQAMK 207 PTP +L A + L+ G+ + AL LA +G LP + E ++ Sbjct: 260 -----PTPAQLLRASDEKLRGAGLSQAKLLALRDLARKTEDGELPTLAEVHGMEDEAIIE 314 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP-------AQIRRYAERW 260 L GIGRWT + DV DDY I++ F A + + RW Sbjct: 315 RLTRVRGIGRWTVEMLLMFRLGRPDVLPVDDYGIRKGFALAFKRPEPPARADLEKRGARW 374 Query: 261 KPWRSYALLHIWYTEG 276 KP+R+ A ++W Sbjct: 375 KPYRTVASWYLWRAVD 390 >UniRef50_Q3INX6 DNA N-glycosylase / DNA lyase n=6 Tax=Halobacteriaceae RepID=Q3INX6_NATPD Length = 203 Score = 160 bits (405), Expect = 5e-38, Method: Composition-based stats. Identities = 51/194 (26%), Positives = 78/194 (40%), Gaps = 19/194 (9%) Query: 87 NPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL 146 + + L L L + D F + + ++L Q VS+A A R+ Sbjct: 6 DTLRADPVLEPLIERHGALTIEPADDLFRRLLVSVLRQQVSMASAEATKKRLFDA----- 60 Query: 147 DDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQ 204 PTP + AAD + + G+ ++A L ++A A + P D E Sbjct: 61 -------VEPTPTAVLAADTETFREAGLSRQKATYLHNIAAAFEDHGYDRAYFEPMDDEA 113 Query: 205 AMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-----MTPAQIRRYAER 259 L G+G WTAN L +DVF D I++ + A + AER Sbjct: 114 VRAELTDITGVGEWTANMQLLFSLGREDVFPVGDLGIRKGMRALLDEDLDRAAMTEAAER 173 Query: 260 WKPWRSYALLHIWY 273 W P+RSYA L++W Sbjct: 174 WAPYRSYASLYLWR 187 >UniRef50_Q04UT1 DNA-3-methyladenine glycosylase II n=4 Tax=Leptospira RepID=Q04UT1_LEPBJ Length = 228 Score = 159 bits (403), Expect = 8e-38, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 84/223 (37%), Gaps = 16/223 (7%) Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAI 121 L + + RL + + +L + +L + ++ ++++ Sbjct: 8 RLPNPKNRSSFVLEDRKIRLQKASDWLRKKDPITKKLVDSIGPCKLQTIGNPYQVLIKSV 67 Query: 122 LGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEA 181 LGQ +SV +A R+ L G + P P R+ + LK +G+ + E Sbjct: 68 LGQQLSVKVALTFERRLISLAGSKKI--------PPPDRILMIPNEELKKIGVSQAKIET 119 Query: 182 LIHLANAALEGTLPMTIPGDVEQ--AMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 + +A A L + + +E + L +F G+G WTA + D F +D Sbjct: 120 IQRIAEAYLNRDITDSKLRKLEDSDVLNLLCSFKGVGPWTAEMVLIFALDRWDHFSINDL 179 Query: 240 LIKQRFPGMT------PAQIRRYAERWKPWRSYALLHIWYTEG 276 ++++ +I+ + + P+R+ ++W Sbjct: 180 ILRKSVEKHYGISKDNKKEIQHFLMSYSPFRTILSWYLWADMD 222 >UniRef50_D2QEN8 HhH-GPD family protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QEN8_9SPHI Length = 207 Score = 159 bits (403), Expect = 9e-38, Method: Composition-based stats. Identities = 43/206 (20%), Positives = 82/206 (39%), Gaps = 25/206 (12%) Query: 88 PQIVNGALGRLGAARPGLRLPG--CVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER 145 + + R+ A P +L D + + +I+ Q +SV A + +R L+ ++ Sbjct: 11 HLAQDPVMARIIAETPVPKLVNDYADDVYLALLESIVSQQISVKAADAIFSRFRALFPDK 70 Query: 146 LDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVE 203 +P L L++ G+ ++ + L +A +LE + E Sbjct: 71 ---------YPQADALLLKTTDELRSAGLSFQKIKYLQSVAEFSLEKPIDRVHLDALTDE 121 Query: 204 QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ----------- 252 + ++ L G+GRWT + D+F DD +I+QR P Q Sbjct: 122 EIVQYLLPIKGVGRWTVEMLLMFVLDRPDIFPIDDLVIRQRMLRAYPEQTNGLTGKALYK 181 Query: 253 -IRRYAERWKPWRSYALLHIWYTEGW 277 + AE W+P+R+ A ++W + Sbjct: 182 VLLSIAEPWRPYRTTASRYLWRWQAT 207 >UniRef50_B4B851 DNA-3-methyladenine glycosylase II n=2 Tax=Cyanobacteria RepID=B4B851_9CHRO Length = 215 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 75/220 (34%), Gaps = 20/220 (9%) Query: 74 CLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVD---AFEQGVRAILGQLVSVAM 130 + Q + + ++ + +L E AI+ Q +S + Sbjct: 1 MTKSLINYDQALYYLQEADIIMAQIISEIGDYQLAEFKSNSSLLEALAWAIMAQQISTEV 60 Query: 131 AAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAAL 190 A K+ R LY E + L + L+++G+ + L +LA A Sbjct: 61 ANKIYQRFLSLYNESTPL--------NARNLLQTSDEDLRSIGISRYKIGYLKNLARAVE 112 Query: 191 EG--TLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM 248 E L + E +K L GIG WT + Q D+ D I+ + Sbjct: 113 EYLPPLSELATMEDETIIKLLTQIKGIGTWTVQMLLIFRLQRLDILPSGDLGIRMAIKNL 172 Query: 249 T-------PAQIRRYAERWKPWRSYALLHIWYTEGWQPDE 281 P + +WKP+R+ A ++W + ++ Sbjct: 173 YQLPELPSPEIVEAIGHKWKPYRTIAAWYLWRSLSDTIEK 212 >UniRef50_A5WCQ9 HhH-GPD family protein n=2 Tax=Psychrobacter RepID=A5WCQ9_PSYWF Length = 231 Score = 159 bits (402), Expect = 1e-37, Method: Composition-based stats. Identities = 48/226 (21%), Positives = 80/226 (35%), Gaps = 21/226 (9%) Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAIL 122 L + + + +S L + + L F + +RA++ Sbjct: 15 LPSEPACNLIQTMNDLSELEGHIKQLIDIEPRFAPIYQQLGVPSLRRNRGGFRELMRAMV 74 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 GQ +SVA A+ + +++ TP + AD L++ G+ ++ + Sbjct: 75 GQQLSVAAASSIWSKLENA------------ALITPDAIMKADDDTLRSHGLSRQKIRYI 122 Query: 183 IHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIK 242 L + E + L GIG+WTA + L D+ DD IK Sbjct: 123 RSLVEH--DIDFEALAHLPDEAVISELTAVTGIGKWTAQMYLLFSLGRADILAVDDLAIK 180 Query: 243 QRF-------PGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDE 281 TP Q+ R + W P RS A L +W GW D+ Sbjct: 181 VGAMEVLGLDERPTPKQLERLTQSWSPHRSAASLLLWAHYGWLKDQ 226 >UniRef50_Q0BSG3 DNA-3-methyladenine glycosylase II n=12 Tax=Proteobacteria RepID=Q0BSG3_GRABC Length = 255 Score = 158 bits (401), Expect = 1e-37, Method: Composition-based stats. Identities = 50/209 (23%), Positives = 77/209 (36%), Gaps = 19/209 (9%) Query: 83 DLQCNPQIVNGALGRLGAARPGLRLP--GCVDAFEQGVRAILGQLVSVAMAAKLTARVAQ 140 + + + AL L RL FE +RAI Q + A + AR Sbjct: 32 EACAHLARQDKALSALITRVGPPRLTISLEQSPFEALIRAIAHQQLHARAAEAILARFLA 91 Query: 141 LYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP- 199 L+ P FP+P + A D + L+ G + AL + AA G +P Sbjct: 92 LF-------PVNTDFPSPLEIMALDTETLRQCGFSGTKIIALRGVCEAAQGGIIPDRSGC 144 Query: 200 --GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMTP 250 D E ++ L T GIGRWT + D+ DD+ +++ + P Sbjct: 145 TALDDETLIQRLTTLRGIGRWTVEMLMIFTLGRTDILPVDDFGVREGWRLIKGLESQPRP 204 Query: 251 AQIRRYAERWKPWRSYALLHIWYTEGWQP 279 + + W PWRS A ++W Sbjct: 205 KILADIGQSWSPWRSLAAWYLWRAADEAK 233 >UniRef50_C1XHZ0 DNA-3-methyladenine glycosylase II n=2 Tax=Meiothermus RepID=C1XHZ0_MEIRU Length = 178 Score = 158 bits (401), Expect = 1e-37, Method: Composition-based stats. Identities = 40/168 (23%), Positives = 71/168 (42%), Gaps = 16/168 (9%) Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 +E + +I+GQ +S A + R++ + P+ L A + L+A+ Sbjct: 23 PYEVLLSSIVGQQLSGKAADTIWRRLSSRFA------------LEPEVLYRAALEDLRAV 70 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 G+ +A + L+ ALEG L E + L GIG WT F + G D Sbjct: 71 GLSSAKARYVQDLSRFALEGGLQGLEHHSDEALIAHLTQVKGIGVWTVQMFLMFGLGRPD 130 Query: 233 VFLPDDYLIKQRFPGMTP----AQIRRYAERWKPWRSYALLHIWYTEG 276 V+ D I++ + ++ ER++P+RS+A ++W Sbjct: 131 VWPVLDLGIRKGAQKLYGVIERDELEALGERFRPYRSHAAWYLWRALE 178 >UniRef50_A9T041 Predicted protein n=1 Tax=Physcomitrella patens subsp. patens RepID=A9T041_PHYPA Length = 178 Score = 158 bits (401), Expect = 1e-37, Method: Composition-based stats. Identities = 41/173 (23%), Positives = 73/173 (42%), Gaps = 17/173 (9%) Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALG 173 F R+I+ Q +S A + R+ + G + TP +AA + L+A+G Sbjct: 9 FTALARSIVYQQISGKAACAIYCRLISICGG--------LESVTPPVIAALTVEELRAVG 60 Query: 174 MPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + ++ L LA G L + + +K L GIG W+A+ F + + Sbjct: 61 ISGRKGLYLHDLAEKFTSGLLSEAKLIIMNEDDLVKALTAVKGIGVWSAHMFMIFYLRKP 120 Query: 232 DVFLPDDYLIKQRFPGMT-------PAQIRRYAERWKPWRSYALLHIWYTEGW 277 DV D I++ F + PA+++ A W+P+R+ A ++W Sbjct: 121 DVLPVGDLAIRKAFQKLYHLNQLPSPAEMQELAFPWRPYRTLASWYLWRMTDN 173 >UniRef50_B2W1R2 DNA-3-methyladenine glycosylase n=1 Tax=Pyrenophora tritici-repentis Pt-1C-BFP RepID=B2W1R2_PYRTR Length = 439 Score = 158 bits (401), Expect = 2e-37, Method: Composition-based stats. Identities = 56/287 (19%), Positives = 89/287 (31%), Gaps = 70/287 (24%) Query: 52 PDIARHTLHINLSAGLEPVAAECLAKMSRLF-DLQCNPQIVNGALGRLGAARP------- 103 P A+ L + RL D + V+ L L Sbjct: 152 PSPAKKRKAKELVPPDVGAIPNASTDVERLLKDAEEFLVKVDPKLEELVKKHHCKIFSPE 211 Query: 104 GLRLPGCVDAFEQGVRAILGQLV----------------------------------SVA 129 GLR VD F I+GQ V S Sbjct: 212 GLREV--VDPFTALSSGIIGQQVLDEWHGLKQRAHWKASPSNPVHQNCKNNTSPPQVSGQ 269 Query: 130 MAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAA 189 A+ + A+ L+ FPTP ++ L+ G+ ++AE + LA Sbjct: 270 AASSIRAKFTALFPTTHP------AFPTPTQVLQLPIPTLRTAGLSQRKAEYITGLAEKF 323 Query: 190 LEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF-- 245 G L M + E+ ++ L G+GRW+ FA G + DVF D +++ Sbjct: 324 CSGELTAQMLVSASDEELIEKLVAVRGLGRWSVEMFACFGLKRMDVFSTGDLGVQRGMAV 383 Query: 246 ----------------PGMTPAQIRRYAERWKPWRSYALLHIWYTEG 276 MT ++ A + P+RS + ++W Sbjct: 384 YAGRDVNKLKSKGGKWKYMTEREMLDTAANFSPYRSLFMWYMWRIAD 430 >UniRef50_C2AV46 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Tsukamurella paurometabola DSM 20162 RepID=C2AV46_TSUPA Length = 216 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 58/186 (31%), Positives = 84/186 (45%), Gaps = 8/186 (4%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + L L AA PG+RL GCVD E +R ++GQ +S+A A AR+ + GE +DD Sbjct: 28 DPRLAPLVAATPGIRLFGCVDPAELLLRTMIGQQISIAAATTHQARLVEALGEPVDDPTG 87 Query: 152 --YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTL 209 FP+P +A + L P R A+ +A EG L + +A L Sbjct: 88 RVSRAFPSPAVVAERGHEVLTG---PRARVTAIRSVAVEIAEGRLTLHPGMTRAEARDVL 144 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALL 269 G+G WTA+Y A+R D L D ++ + + + W PW SY L Sbjct: 145 LRLSGVGPWTADYVAMRLLADPDTLLSSDLVVAKGAAALD---LDIATNHWSPWGSYVSL 201 Query: 270 HIWYTE 275 H+W Sbjct: 202 HLWNHS 207 >UniRef50_A6GQ39 3-methyladenine DNA glycosylase II n=1 Tax=Limnobacter sp. MED105 RepID=A6GQ39_9BURK Length = 217 Score = 158 bits (400), Expect = 2e-37, Method: Composition-based stats. Identities = 48/230 (20%), Positives = 85/230 (36%), Gaps = 29/230 (12%) Query: 58 TLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQG 117 +L +N E + + + L A L +E Sbjct: 4 SLRLNHDPDAPAYWQEACEAL----------ALQSPVWVELLARHSDRALRSRGAPYETM 53 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLK 177 +R+++GQ +SV A + ARV ++ T + L A LKA G+ + Sbjct: 54 LRSLVGQQISVKAADAVWARVVDALNGKI----------TSRALLALSDDTLKATGLSRQ 103 Query: 178 RAEALIHLANAALEGTLPM--TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 + L+ +G L + D E + L G+GRWTA F + + DV+ Sbjct: 104 KIAYSRALSEFEQQGGLELAVLEGMDDEACTRHLCAIKGVGRWTAQMFLMFCLRRPDVWP 163 Query: 236 PDDYLIKQRFPGMT-------PAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 DD +++ P + ++ E+ KPWR+ A ++W + Sbjct: 164 VDDIGVQRGISRQFFEGEPIGPKEALQFGEKLKPWRTVAAWYLWRSLDPA 213 >UniRef50_Q5FSB3 DNA-3-methyladenine glycosylase n=1 Tax=Gluconobacter oxydans RepID=Q5FSB3_GLUOX Length = 219 Score = 157 bits (398), Expect = 3e-37, Method: Composition-based stats. Identities = 51/214 (23%), Positives = 83/214 (38%), Gaps = 16/214 (7%) Query: 81 LFDLQCNPQIVNGALGRLGAARPGLRLPGC--VDAFEQGVRAILGQLVSVAMAAKLTARV 138 + D + L + A L G + ++ +RAI GQ + A A K+ R+ Sbjct: 4 MADSATLFLGADPDLAAVIARIGPCTLRGDNGQEPYDALLRAIAGQQLHGAAARKIFGRL 63 Query: 139 AQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI 198 L + D P P P R+ + + L+A G+ + A+ LA A L+G +P Sbjct: 64 CLLGAQESVDGPP----PAPGRILSLSEERLRACGLSGNKILAMKGLAQARLDGLVPSRA 119 Query: 199 P---GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-------M 248 E+ + L T GIGRWT + DV DD+ +++ + Sbjct: 120 EASVMTDEELIARLVTLRGIGRWTVEMLLMFTLNRPDVMPVDDFGVREGWRRIRKMDLPP 179 Query: 249 TPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 P ++ ER+ P RS + W A Sbjct: 180 KPKALKEETERFAPHRSTLAWYCWRVAEEGKKTA 213 >UniRef50_C6IXS6 DNA-3-methyladenine glycosidase n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6IXS6_9BACL Length = 228 Score = 156 bits (396), Expect = 5e-37, Method: Composition-based stats. Identities = 46/197 (23%), Positives = 80/197 (40%), Gaps = 20/197 (10%) Query: 89 QIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD 148 + +GRL A L F + R+I+ Q +SV A+ + RV +L GE Sbjct: 21 ASADPRMGRLIALIGSLATKPQGPLFTELARSIISQQISVKAASTIRGRVIELAGEL--- 77 Query: 149 FPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAM 206 +P L A L+A G+ + L L++ G L + D E+ + Sbjct: 78 --------SPAALLAQSDADLRAAGLSASKVAYLKDLSDKVQSGQLDLDRLQELDDEEVI 129 Query: 207 KTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ-------IRRYAER 259 K L + GIGRW+A F + + V D +++ + + +++ A + Sbjct: 130 KQLVSVKGIGRWSAEMFLIFALGREHVVSYGDAGLQRAAKWVYDMEERPDRKYLQQAAAQ 189 Query: 260 WKPWRSYALLHIWYTEG 276 W + S A L++W Sbjct: 190 WPSYGSIASLYLWEAIN 206 >UniRef50_C7H057 8-oxoguanine DNA glycosylase n=1 Tax=Eubacterium saphenum ATCC 49989 RepID=C7H057_9FIRM Length = 294 Score = 156 bits (396), Expect = 5e-37, Method: Composition-based stats. Identities = 44/268 (16%), Positives = 86/268 (32%), Gaps = 30/268 (11%) Query: 31 ADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMS---RLFDLQCN 87 D Y+ R + + + + G E ++ Sbjct: 31 EDGSYSGVFR----RSFINVSLQEQGNIMKVKRILGESIDEEELYRFFDLGSSYERIKEK 86 Query: 88 PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD 147 + + + + G+R+ D FE + I+ Q +++ K + YG ++ Sbjct: 87 FKAKDEVMKKAILKGEGIRILRQ-DFFETLITFIISQNNNISRIRKNIESICAAYGSEVE 145 Query: 148 DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANA----------ALEGTLPMT 197 FPT + LA A + LKAL + RA ++ +E Sbjct: 146 AGSGIYAFPTAEELAGAKEKDLKALKL-GYRAGYIVKSVEHYIKKKDRIQRCIEKLAKEY 204 Query: 198 IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYA 257 + E L FPG+G A+ L K+ F D IK+ ++ Sbjct: 205 EKQEEENLFNELMKFPGVGAKVADCIMLFATPCKNRFPI-DTWIKKIMEEKYAMKVESKK 263 Query: 258 E----------RWKPWRSYALLHIWYTE 275 + ++ P+ A +++Y+ Sbjct: 264 DKDKVQKFVEVKFSPYAGIAQQYLFYSA 291 >UniRef50_Q8TL35 DNA-3-methyladenine glycosylase II n=1 Tax=Methanosarcina acetivorans RepID=Q8TL35_METAC Length = 299 Score = 156 bits (396), Expect = 5e-37, Method: Composition-based stats. Identities = 47/281 (16%), Positives = 105/281 (37%), Gaps = 18/281 (6%) Query: 9 PYDWSWMLGFLAARAV-SSVETVADSYYARSLAVGEYRGVVTAIPD-----IARHTLHIN 62 P+D+S L F+ A +TV + +++ + + + + Sbjct: 15 PFDFSKSLNFMGMFAPAEGEQTVTGFSFTKAVYLENKILAFRLKNEGTVSEPGLSYIFYS 74 Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQ 116 E + + L ++ L + Q + + GL + FE Sbjct: 75 CEEISEEIKSALLDRIKFFLSLDDDLQPFYVLGSKDPQFVPVLEELYGLHQVKFLTPFEA 134 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMP 175 A+L Q +S+ +A K+ ++ + G + + Y FP+ +++ + L ++ Sbjct: 135 AAWAVLSQRISMKVAHKIKNKLTEAIGNSIQIEGIVYRTFPSARQVKNLGVENLASIIKN 194 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 +++E LI +A+A G+++ + L GIG W+A+ +RG + Sbjct: 195 ERKSEYLIAVADAFDRVDENFLRQGNIKDVREWLMNIWGIGEWSAHLILIRGLGRMEELS 254 Query: 236 PDDYLIKQRFPGMT-----PAQIRRYAERWKPWRSYALLHI 271 + + F Q RR A+ + ++ Y ++ Sbjct: 255 EHEKTLLNCFKRFYGPEATEDQFRRVADSYGDFKGYWAYYL 295 >UniRef50_C7MP98 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=2 Tax=Bacteria RepID=C7MP98_CRYCD Length = 234 Score = 156 bits (396), Expect = 6e-37, Method: Composition-based stats. Identities = 42/195 (21%), Positives = 74/195 (37%), Gaps = 19/195 (9%) Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 + L A + D F V I+GQ +S A + R+ L GE Sbjct: 17 RDPQLACAIDAIGHVYREMDADLFSAVVHHIIGQQISTAAQQTVWLRMCDLLGEV----- 71 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI--PGDVEQAMKT 208 + Q + A P+ L++ G+ ++ + + A ++G+ + +A+ Sbjct: 72 ------SAQSITATSPEQLQSCGISFRKVDYIQDFAEKVMDGSFDLDAIEQASDAEAIAA 125 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ------IRRYAERWKP 262 L + GIG WTA L DV DD I++ + + +Y R+ P Sbjct: 126 LSSLRGIGTWTAEMLLLFCLGRPDVLSFDDLAIQRGLRMVYHHRKITRPLFEKYRRRYSP 185 Query: 263 WRSYALLHIWYTEGW 277 + S A L++W Sbjct: 186 YGSVASLYLWAISSM 200 >UniRef50_C6CD76 DNA-3-methyladenine glycosylase II n=1 Tax=Dickeya dadantii Ech703 RepID=C6CD76_DICDC Length = 225 Score = 156 bits (396), Expect = 6e-37, Method: Composition-based stats. Identities = 43/204 (21%), Positives = 83/204 (40%), Gaps = 18/204 (8%) Query: 87 NPQIVNGALGRLGAARPGLRL--PGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGE 144 + ++ RL A +R +E +RA+ Q +S AA + A++ + + Sbjct: 16 HLAAIDDRWERLIAGVGHIRFASRPGQQPYEALIRAVASQQLSNRAAAAIIAKLQKQF-- 73 Query: 145 RLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM---TIPGD 201 E FP+P +LA P+ L+ G ++ + + +A A+ G +P + Sbjct: 74 ----AMEETGFPSPSQLAECPPEHLRQCGFSSRKIDTVQAIARGAISGLVPDRASAALME 129 Query: 202 VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIR 254 + + L T GIGRWT + + D+ DD I+Q F + ++ Sbjct: 130 DDTLITQLCTLHGIGRWTVEMLLINTLERMDIMPVDDLGIRQGFRYLYQLPSDPSRKEML 189 Query: 255 RYAERWKPWRSYALLHIWYTEGWQ 278 + +P+R+ A ++W Sbjct: 190 ALSAPCQPYRTLAAWYLWRIPHMP 213 >UniRef50_B2J3A5 HhH-GPD family protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J3A5_NOSP7 Length = 212 Score = 156 bits (395), Expect = 7e-37, Method: Composition-based stats. Identities = 46/198 (23%), Positives = 72/198 (36%), Gaps = 20/198 (10%) Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 ++ L R+ + F + IL Q VSVA A + R+ Sbjct: 25 IDSDLARILETLGPPPIWSREPGFATLLCIILEQQVSVAAARAVFNRLC----------- 73 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQ--AMKT 208 I TP+ D L+ +G ++ LANA L ++ +++ Sbjct: 74 GVIVPLTPENFLTLDDVQLRGIGFSRQKILYSRGLANAIASDQLDLSKLERMDETTIRTE 133 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMTPAQIRRYAERWK 261 L+ GIG WT + + L Q DVF D I TP Q+ + W+ Sbjct: 134 LKRLKGIGDWTVDIYLLMALQRPDVFPKGDLAIAIALQKLKNLATRPTPVQLEGMTQHWR 193 Query: 262 PWRSYALLHIWYTEGWQP 279 PWR+ A +W+ P Sbjct: 194 PWRAVAARLLWHYYLSNP 211 >UniRef50_A7EZ08 Putative uncharacterized protein n=1 Tax=Sclerotinia sclerotiorum 1980 UF-70 RepID=A7EZ08_SCLS1 Length = 418 Score = 156 bits (395), Expect = 8e-37, Method: Composition-based stats. Identities = 40/175 (22%), Positives = 68/175 (38%), Gaps = 10/175 (5%) Query: 90 IVNGALGRLGAARPGLRLPGC------VDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG 143 V L + P R+ ++ F V I+ Q VS A A + A+ L+ Sbjct: 213 KVEPKLKPIIEKHPC-RIFSAEGLAEEIEPFRALVSGIISQQVSGAAAKSIKAKFVALFN 271 Query: 144 ERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP--MTIPGD 201 D FPTP + A D L+ G+ ++AE + LA +G L + Sbjct: 272 PPDSDPST-HTFPTPSAIVATDLARLRTAGLSQRKAEYISGLALKFTDGELTTQFLLSAS 330 Query: 202 VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRY 256 E+ +L G+G+W+ FA + DVF D +++ + + + Sbjct: 331 YEEVFASLIQVRGLGKWSVEMFACFALKRLDVFSTGDLGVQRGMAALLGKDVEKL 385 >UniRef50_UPI0000D54B32 HhH-GPD n=1 Tax=Psychroflexus torquis ATCC 700755 RepID=UPI0000D54B32 Length = 197 Score = 156 bits (394), Expect = 1e-36, Method: Composition-based stats. Identities = 42/194 (21%), Positives = 77/194 (39%), Gaps = 20/194 (10%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + L L + P + FE VR I Q +SVA A + R+A ++ E Sbjct: 15 DKDLEDLIKSIPKIVPFRREKGFEGLVRLICEQQLSVASAKAIFERLA-----KIVSPFE 69 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM--TIPGDVEQAMKTL 209 F + L+ G+ ++ + LANA +EG L + K L Sbjct: 70 AKNF------LKVPKKDLQKTGLSRQKIDYCTGLANACIEGDLDFTTLHKMNDSDLRKEL 123 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIK-------QRFPGMTPAQIRRYAERWKP 262 GIG+WTA+ + L + +D++ D ++ + + ++ + +WKP Sbjct: 124 CKIKGIGKWTADCYMLASLKREDIWPAGDLGLQISVQKLKKLSSRPSEMELEEISVKWKP 183 Query: 263 WRSYALLHIWYTEG 276 +R+ +W + Sbjct: 184 YRTLVANMLWNSYD 197 >UniRef50_B0K3N9 8-oxoguanine DNA glycosylase domain protein n=15 Tax=Clostridia RepID=B0K3N9_THEPX Length = 297 Score = 155 bits (392), Expect = 2e-36, Method: Composition-based stats. Identities = 43/258 (16%), Positives = 84/258 (32%), Gaps = 30/258 (11%) Query: 31 ADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQI 90 D Y Y V+ + + + FDL + + Sbjct: 38 EDGSYTGV----AYDRVINVKLEEDMLII-------DNTDLNDFYDIWFDYFDLGRDYKQ 86 Query: 91 V------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGE 144 + + L G+R+ D +E V I+ Q + K+ +A +GE Sbjct: 87 IKENLSRDPILKEAIQYGQGIRILRQ-DTWETLVSFIISQNNRIPQIKKVIENLASSFGE 145 Query: 145 RLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDV- 202 ++ Y FP + L D + + RA+ ++ A+ G + + + Sbjct: 146 PIEYKGKIYYTFPKAEELVMFDVETIAKTK-CGFRAKYILDAASKVFSGEIDLLKLFEYS 204 Query: 203 -EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF------PGMTPAQIRR 255 L G+G A+ L D F D IK+ TP +I+ Sbjct: 205 TNDIRDILMNINGVGPKVADCVILYSIGRYDTFP-TDVWIKRIVEYLYLKREGTPLEIQL 263 Query: 256 YA-ERWKPWRSYALLHIW 272 +A +++ +A +++ Sbjct: 264 FAIDKFGDLSGFAQQYLF 281 >UniRef50_Q1D1V1 HhH-GPD domain protein n=15 Tax=cellular organisms RepID=Q1D1V1_MYXXD Length = 499 Score = 155 bits (391), Expect = 2e-36, Method: Composition-based stats. Identities = 67/283 (23%), Positives = 106/283 (37%), Gaps = 17/283 (6%) Query: 4 LNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIP----DIARHTL 59 L+ + P+ + L R + V+ Y R+L V + +V D + Sbjct: 185 LDTRAPFHLEATVRVLQRRPTNLVDVWEGGRYLRALTVSDGFVLVEVSNQGTVDEPQVRF 244 Query: 60 HINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGAL------GRLGAARPGLRLPGCVDA 113 + AE + R L +P+ ++ L G + A G+R P Sbjct: 245 RVLDGDDSRGAHAEISRVLRRGLGLDVDPEPLDRLLQSERKLGPIVRALRGMRPPRFPSL 304 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKAL 172 FE I Q VS+ + R+ +G L + FPT LA A A+++ Sbjct: 305 FETFANVIPFQQVSLDAGVAVVRRLVARFGRFLPHEGQVRYAFPTAAALAEARLDAIRSC 364 Query: 173 GMPLKRAEALIHLANAALEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 G+ ++AEAL A A G + M +AM+ L GIG W+A LRG Sbjct: 365 GLSARKAEALRAAAAAIQAGDVTEAMLSQMSSAEAMRMLTGLHGIGPWSAALVLLRGLGR 424 Query: 231 KDVFLPDDYLIKQRFPGMTPAQ----IRRYAERWKPWRSYALL 269 DVF D + + G+ + + R R+ R Y Sbjct: 425 LDVFPEGDVGVIRGLSGLMHVEPGPALERLIRRFGEQRGYLYF 467 >UniRef50_D1IGU3 Whole genome shotgun sequence of line PN40024, scaffold_63.assembly12x (Fragment) n=1 Tax=Vitis vinifera RepID=D1IGU3_VITVI Length = 364 Score = 154 bits (390), Expect = 3e-36, Method: Composition-based stats. Identities = 47/236 (19%), Positives = 81/236 (34%), Gaps = 24/236 (10%) Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQIV-------NGALGRLGAARPGLRLPGCVDAFE 115 LS+ P A L ++RL + L + A + F Sbjct: 111 LSSHSSPRTAAILLPITRLLSSDDVVAAALRHLRSSDPVLAPVIDAYEPPKFENSDTPFL 170 Query: 116 QGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMP 175 ++IL Q ++ + R L G P + TP +L G+ Sbjct: 171 ALAKSILYQQITHKAGTTIYNRFVSLCGGETRVCPISVLALTPPQLLQI--------GVS 222 Query: 176 LKRAEALIHLANAALEGTLPMTIPGDVEQ-AMKTLQT-FPGIGRWTANYFALRGWQAKDV 233 ++ L LAN G L + +E A+ +L G G + + F + DV Sbjct: 223 ARKVSFLHDLANKYRTGILSDSKILTMEDRALVSLIAMVKGFGVLSVHMFMIFSLHRPDV 282 Query: 234 FLPDDYLIKQRFPGMT-------PAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 D +++ + P+Q+ + ERW+P+RS A +IW + Sbjct: 283 LPVGDANLRKGVQMLYGLEELPRPSQMEKLCERWRPYRSVASWYIWRLSEANGVQG 338 >UniRef50_B0MMM9 Putative uncharacterized protein n=1 Tax=Eubacterium siraeum DSM 15702 RepID=B0MMM9_9FIRM Length = 264 Score = 154 bits (390), Expect = 3e-36, Method: Composition-based stats. Identities = 39/206 (18%), Positives = 74/206 (35%), Gaps = 17/206 (8%) Query: 75 LAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSV 128 ++ FDL + + + + PG+R+ + FE + I+ Q ++ Sbjct: 60 ISYWQSYFDLDTDYDALIKQFSEDEHMRLACKENPGIRVLRQ-EPFETLISFIISQNNNI 118 Query: 129 AMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANA 188 + R+ + +GE+ D FPT ++L + L L RA ++ Sbjct: 119 KRITGIIDRLCESFGEKTDRG---YMFPTLEKLVGVTAEDLAPLR-AGFRARYIVDAVEK 174 Query: 189 ALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP 246 + + D A + L+ G+G A+ L G+ D F D IK+ Sbjct: 175 LHSTEVSLDGIKAMDTTAAREELKKIKGVGDKVADCVLLFGYHKTDAFPR-DVWIKRIEQ 233 Query: 247 GMTPAQIRRYAERWKPWRSYALLHIW 272 + P + E K A ++ Sbjct: 234 KLYPDGLP---ECIKGNEGIAQQFLF 256 >UniRef50_B0P8C4 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=B0P8C4_9FIRM Length = 300 Score = 154 bits (389), Expect = 4e-36, Method: Composition-based stats. Identities = 44/230 (19%), Positives = 79/230 (34%), Gaps = 10/230 (4%) Query: 53 DIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVD 112 + SAG + R + N + AL A PG+R+ Sbjct: 60 QTNGKIVVSCQSAGDFDAVWRGYFDLDRDYGALKNRFRADPALAEAVAKAPGIRVLRQQ- 118 Query: 113 AFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKAL 172 +E I+ Q ++ + R+ +G L + FPTP+ +AA P AL L Sbjct: 119 PWEALCTFIISQNNNIPRITGIVGRMCAAFGRPL--GGGWHAFPTPEAIAALAPDALAPL 176 Query: 173 GMPLKRAEALIHLANAALEGTLPMTIPG--DVEQAMKTLQTFPGIGRWTANYFALRGWQA 230 RA L+ A + G + + G +++A L G+G A L G Sbjct: 177 R-AGFRARYLVDAARRVVSGEVDLEALGTLPLDEAQAMLTRITGVGVKVAACTLLYGCGR 235 Query: 231 KDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 + D +++ + P + + A ++++ P Sbjct: 236 VECVPV-DVWMRRVLDRLYPGGMPACT---YGYEGIAQQYLFHCARTDPG 281 >UniRef50_Q5K8T8 DNA-3-methyladenine glycosidase, putative n=1 Tax=Filobasidiella neoformans RepID=Q5K8T8_CRYNE Length = 461 Score = 154 bits (389), Expect = 4e-36, Method: Composition-based stats. Identities = 40/174 (22%), Positives = 69/174 (39%), Gaps = 8/174 (4%) Query: 79 SRLFDLQCNPQIVNGALGRLGAARPGLRLPG--CVDAFEQGVRAILGQLVSVAMAAKLTA 136 L + ++ P +D F V +I+GQ VS A + Sbjct: 93 FNLPSAISHLSALDPRFSLFFEHLPCRPFVNLEAIDPFRTLVTSIIGQQVSWMAARAINT 152 Query: 137 RVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP- 195 R L+G + FP+PQ + D +LK +G+ ++AE ++ LA+ G L Sbjct: 153 RFRALFGFTHEKEG----FPSPQMVLMQDVTSLKGVGLSGRKAEYVLSLADHFASGQLST 208 Query: 196 -MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM 248 + G E+ K L GIG+WT + F + + D+ D +++ Sbjct: 209 QLLQSGTDEEISKALIAVRGIGQWTVDMFMIFSLRRPDILAVGDLGVQKGLLKW 262 Score = 44.1 bits (103), Expect = 0.005, Method: Composition-based stats. Identities = 9/30 (30%), Positives = 18/30 (60%) Query: 243 QRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 + +TP ++ E W+P+RS A+ ++W Sbjct: 427 KGGAYLTPKEMEALTEGWRPYRSLAVFYMW 456 >UniRef50_B5IDT4 Base excision DNA repair protein, HhH-GPD family n=3 Tax=Aciduliprofundum boonei T469 RepID=B5IDT4_9EURY Length = 289 Score = 153 bits (388), Expect = 5e-36, Method: Composition-based stats. Identities = 54/282 (19%), Positives = 110/282 (39%), Gaps = 18/282 (6%) Query: 5 NWQPPYDW-SWMLGF-LAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHIN 62 + + PY + F L+ R + + V ++ + R + + + V A + + Sbjct: 7 SVEKPYSIIPHLHRFSLSDRPLPCI--VENNLFWRLIPIEDIFIPVKAEIHEDIVKIDV- 63 Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQ------IVNGALGRLGAARPGLRLPGCVDAFEQ 116 S E L+ + L + + L ++ GLR ++ +E Sbjct: 64 ASECENDYCKEVLSTVRHLLAVDISYSNFLKTLEDFPRLYKMAITYSGLRPARNLNLYEA 123 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYG-ERLDDFPEYICFPTPQRLAAADPQALKALGMP 175 ++ +L Q +S+ A TA++ + +G + Y FP P++L +KALG Sbjct: 124 LIKIVLQQRISLKYALNTTAKLIEKWGIREKWNGYSYYSFPPPEKLMRISTSEIKALGTT 183 Query: 176 LKRAEALIHLANAALEGTLPM--TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDV 233 +A++L+ +A G LP + + E+ +K L G+G WTA + Sbjct: 184 TVKAKSLLEIAKMEYNGDLPSIYEVNKNPEEYVKFLTGIYGVGMWTAELSVATVIHDYSI 243 Query: 234 FLPDDYLIKQR----FPGMTPAQIRRYAERWKPWRSYALLHI 271 D +++ +IR Y E++ W+ + + Sbjct: 244 APAGDLNVRKAFSKFLGLQGEKEIREYTEKFGKWKGLIMYLM 285 >UniRef50_C7HRV7 3-methyladenine DNA glycosylase n=6 Tax=Clostridiales Family XI. Incertae Sedis RepID=C7HRV7_9FIRM Length = 301 Score = 153 bits (388), Expect = 6e-36, Method: Composition-based stats. Identities = 43/218 (19%), Positives = 82/218 (37%), Gaps = 19/218 (8%) Query: 72 AECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQL 125 E FDL N + + + L G+R+ + FE + I+ Sbjct: 76 EEFYDIFYDYFDLGVNYEDIKEKISLDKTLKEATEYGSGIRILNQ-EFFETLISFIISAN 134 Query: 126 VSVAMAAKLTARVAQLYGERLDD--FPEYICFPTPQRLAAADPQALKALGMPLKRAEALI 183 + K ++++YG+ + + +Y FP P++L+ A P+ L+ R + ++ Sbjct: 135 NQIPRIKKAVRIISEMYGDYIGEYRGRKYYSFPNPEQLSKARPEDLREKARVGFRDKRIV 194 Query: 184 HLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLI 241 A +G E K LQ PG+G A+ L + ++ F D I Sbjct: 195 QTAKIINDGFFDFEKDIKMPTEDLRKKLQELPGVGPKVADCILLFAFHKRETFPV-DVWI 253 Query: 242 KQRF------PGMTPAQIRRYAER-WKPWRSYALLHIW 272 K+ + QI YA++ + Y +++ Sbjct: 254 KRVMEFLFIKEEVPKKQISAYADKYFGENAGYVQQYLF 291 >UniRef50_A8SY14 Putative uncharacterized protein n=1 Tax=Coprococcus eutactus ATCC 27759 RepID=A8SY14_9FIRM Length = 297 Score = 153 bits (387), Expect = 6e-36, Method: Composition-based stats. Identities = 42/231 (18%), Positives = 82/231 (35%), Gaps = 25/231 (10%) Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV-------NGALGRLGAARPGLRLPGCVDA 113 + + D+ + ++ +GAL + G+ + D Sbjct: 64 LRIYGSSMEDYEGIWKL---YLDMDNDYGLIKQSVIKADGALKTAVDEKSGIHILNQ-DF 119 Query: 114 FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD--DFPEYICFPTPQRLAAADPQALKA 171 FE + I+ Q S+ + ++ +G+ + + + FP QRL A + L+ Sbjct: 120 FETLISFIVSQNKSIPQIKQCVKNISHRFGDEVIGYNGEAFYVFPDVQRLHDATEEELRE 179 Query: 172 LGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 + RA + + A G + D+ QA + L T G+G AN L G Sbjct: 180 CKV-GFRAPYIKNATEAVYSGAVTKEKLDELDIAQARELLMTIKGVGEKVANCVLLFGLG 238 Query: 230 AKDVFLPDDYLIKQRFPGMT-------PAQIRRYA-ERWKPWRSYALLHIW 272 ++ F D +K+ M I +A ++ YA +++ Sbjct: 239 RREAFPV-DVWMKRIMEQMYFDGKDTKKQDIEAFAVNKFGDLGGYAQQYLF 288 >UniRef50_C8WLI9 HhH-GPD family protein n=4 Tax=Bacteria RepID=C8WLI9_EGGLE Length = 219 Score = 153 bits (387), Expect = 6e-36, Method: Composition-based stats. Identities = 43/199 (21%), Positives = 74/199 (37%), Gaps = 19/199 (9%) Query: 86 CNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER 145 + LG A + D F V I+GQ ++ + R+ + +GE Sbjct: 16 AYLAARDPRLGEAMAVIGRIEREVHPDLFAALVNCIVGQQIATKAQTTIWNRMLERFGEV 75 Query: 146 LDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVE 203 TP+ +AA L+ +G+ ++ + A L G + + + Sbjct: 76 -----------TPEAMAACSDDELQQVGISFRKVGYIKGAAARVLSGEVDLEGLAELSDD 124 Query: 204 QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF------PGMTPAQIRRYA 257 + +TL PGIG WTA Q ++ D I + +TP +Y Sbjct: 125 EVCRTLSALPGIGVWTAEMLMTFSMQRPNILSWGDLAIHRGLRMVHHHRRITPELFAKYR 184 Query: 258 ERWKPWRSYALLHIWYTEG 276 R+ P+ S A L++W G Sbjct: 185 RRYTPYGSVASLYLWEVAG 203 >UniRef50_Q6CEP5 YALI0B14080p n=1 Tax=Yarrowia lipolytica RepID=Q6CEP5_YARLI Length = 360 Score = 153 bits (387), Expect = 7e-36, Method: Composition-based stats. Identities = 48/268 (17%), Positives = 87/268 (32%), Gaps = 56/268 (20%) Query: 62 NLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAI 121 + + + + + + + + C + FE R I Sbjct: 87 EIYESDFSKGLKYILDVDPSLEDIVHSSEFTSFVKEAATRQESRTNRKCNNCFEHLTRGI 146 Query: 122 LGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEA 181 +GQ VS A A + + +L+ + E FP+PQ + +AL++ G+ ++AE Sbjct: 147 IGQQVSGAAAESILKKFKKLFP---VEGSEDGKFPSPQEILDTPTEALRSAGLSGRKAEY 203 Query: 182 LIHLANAALEGTL--PMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDY 239 + L+ A +GTL + + L GIG W+A+ F L + DVF D Sbjct: 204 ITCLSTAFKDGTLSDDWLSTASDDDVVDALVAIKGIGPWSADMFLLFALKRMDVFTLGDL 263 Query: 240 LIKQRF---------------------------------------------------PGM 248 I++ Sbjct: 264 GIQRGVSVYLKERPHLAEIIKQVDFSLPINGVHSPGKSKKAGARKAAKSKPDTKGKWRVP 323 Query: 249 TPAQIRRYAERWKPWRSYALLHIWYTEG 276 T ++ A R+ P+RS +L +W Sbjct: 324 TADEMTWVAHRFAPYRSVMMLILWKISD 351 >UniRef50_B6EMH3 DNA repair protein n=2 Tax=Gammaproteobacteria RepID=B6EMH3_ALISL Length = 202 Score = 153 bits (387), Expect = 7e-36, Method: Composition-based stats. Identities = 46/199 (23%), Positives = 76/199 (38%), Gaps = 20/199 (10%) Query: 90 IVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDF 149 +++ L + FE + I+ Q +S +AA + R+ L E Sbjct: 15 LIDKDLEAAVKTQGYPSPRVNPHGFEAFLSIIVSQQLSTKVAAVIMGRLVALLKEV---- 70 Query: 150 PEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMK 207 TP+RL + + Q L+ +G+ ++ E LA A G L + E A+ Sbjct: 71 -------TPERLLSIEEQNLRDVGLSWRKIEYAKGLALAVQSGNLDIDGLESLSDEDAIS 123 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-------MTPAQIRRYAERW 260 + + G GRW+A + + +D+F DD + TP Q R W Sbjct: 124 AITSLKGFGRWSAEIYLMFSLGRQDIFPADDLGVLIALGRLKGLTDKPTPKQAREMVGHW 183 Query: 261 KPWRSYALLHIWYTEGWQP 279 +PWRS L +W Sbjct: 184 QPWRSVGSLFLWQYYHQDA 202 >UniRef50_A6TTX3 Methylated-DNA--protein-cysteine methyltransferase n=67 Tax=Bacteria RepID=A6TTX3_ALKMQ Length = 355 Score = 153 bits (386), Expect = 8e-36, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 73/189 (38%), Gaps = 19/189 (10%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + LG + D F + +I+ Q +S A + R+ +L Sbjct: 173 DKKLGAAIERIGKIERGTIADPFTALISSIVSQQISNKAAETVWNRLDELLESM------ 226 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM--TIPGDVEQAMKTL 209 TP+ + + ++ GM K+AE + +A+ AL G + ++ ++ L Sbjct: 227 -----TPESITKTELSQIQGCGMTNKKAEYIKGIADVALCGKINFKTLHMLSDQEIIQKL 281 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF------PGMTPAQIRRYAERWKPW 263 + G+G WT + +V D I++ ++ Q +Y ++ P+ Sbjct: 282 SSLHGVGIWTVEMLLIFSLNRPNVVSYGDLAIRRGMMNLYGLKELSKEQFNQYRAKYAPY 341 Query: 264 RSYALLHIW 272 S A L++W Sbjct: 342 GSVASLYLW 350 >UniRef50_A5V920 HhH-GPD family protein n=7 Tax=Sphingomonadales RepID=A5V920_SPHWW Length = 368 Score = 153 bits (386), Expect = 9e-36, Method: Composition-based stats. Identities = 49/211 (23%), Positives = 82/211 (38%), Gaps = 18/211 (8%) Query: 80 RLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVA 139 +L + A+ R+ A ++ +R I+GQ VSVA A + A++ Sbjct: 161 QLHRSIDALAGIEPAIARMLEAIGYPPPRIRDRGYQTLLRTIVGQQVSVAAANAIWAKME 220 Query: 140 QLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP 199 + G L P+ +AAA L+A G+ ++ LA GT+ Sbjct: 221 TMVGAGLA----------PEAVAAAPDDLLRATGLSRQKIAYARSLAEHVASGTIDFDRL 270 Query: 200 -GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMTPA 251 D E+A+ + GIGRW+A + L DV+ D ++ + Sbjct: 271 PADDEEAIAQMTAIKGIGRWSAEIYLLFAEGRGDVWPAGDLAVQIEVGRLLGLPERPSER 330 Query: 252 QIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 + RR A W P R A + W++ + A Sbjct: 331 ETRRLAHGWSPHRGAAAIFAWHSYNARASTA 361 >UniRef50_B9XGL2 8-oxoguanine DNA glycosylase domain protein n=1 Tax=bacterium Ellin514 RepID=B9XGL2_9BACT Length = 293 Score = 152 bits (385), Expect = 1e-35, Method: Composition-based stats. Identities = 46/244 (18%), Positives = 79/244 (32%), Gaps = 17/244 (6%) Query: 47 VVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLR 106 V + + L L + + LG A GLR Sbjct: 44 WVRLRSSSNSIIAEVAEPVTDWSWLVDFLQTHLELKSVLATFPK-DEPLGNAIRACHGLR 102 Query: 107 LPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY---ICFPTPQRLAA 163 L + +E IL + ++ + +GE + P + FP+ RLAA Sbjct: 103 LLRQ-NPWECLASFILSSTKQIVQIQQIVELLCIRFGEPVPVPPGHSPAYAFPSAMRLAA 161 Query: 164 ADPQALKALGMPLKRAEALIHLANAALEGT--LPMTIPGDVEQAMKTLQTFPGIGRWTAN 221 A L+ M RA L A G L DV+ A L PG+GR A+ Sbjct: 162 ATEAELRDCKM-GFRAPYLRETARMIHSGEVILERLYGMDVDDARAELLKLPGVGRKIAD 220 Query: 222 YFALRGWQAKDVFLPDDYLIKQRFP-------GMTPAQIRRYAER-WKPWRSYALLHIWY 273 L + + F D + + + ++ ++ + + P YA ++++ Sbjct: 221 CVLLFAYGFQAAFPV-DVWVMKALQHLYFPKRRPSRKRLEKFTQTYFGPNAGYAQQYLFH 279 Query: 274 TEGW 277 Sbjct: 280 YMRT 283 >UniRef50_B9LPN6 HhH-GPD family protein n=4 Tax=Halobacteriaceae RepID=B9LPN6_HALLT Length = 198 Score = 152 bits (385), Expect = 1e-35, Method: Composition-based stats. Identities = 47/196 (23%), Positives = 78/196 (39%), Gaps = 21/196 (10%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + + RL L + D F + +I+ Q +S A AA + R + G Sbjct: 12 DSTMARLIDRHGRLTIEPAADEFARLCTSIVNQQLSTASAAAIHERFVDVLGGA------ 65 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG----DVEQAMK 207 PTP + AAD AL+ G+ + E L A A +G +T G E + Sbjct: 66 ----PTPDDVLAADEVALREAGLSGTKVEYLREAAAAFRDGDRDLTREGFGDASDEAVVA 121 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERW 260 L G+G WTA + + +DV D +++ + A++R + W Sbjct: 122 ALTEIRGVGEWTARMYLIFALGREDVLPLGDLAVRKGIEQVYNDGAELTRAEMRNIGDAW 181 Query: 261 KPWRSYALLHIWYTEG 276 +P+RSY ++W Sbjct: 182 RPYRSYGTRYVWAEYE 197 >UniRef50_A0Q2T4 8-oxoguanine-DNA-glycosylase, putative n=5 Tax=Clostridium RepID=A0Q2T4_CLONN Length = 292 Score = 151 bits (383), Expect = 2e-35, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 79/205 (38%), Gaps = 13/205 (6%) Query: 78 MSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTAR 137 + R + + + L + G+RL D FE + I+ + M + Sbjct: 81 LYRDYSTIKDILKKDPLLKKSVEFGHGIRLLKQ-DPFELVISFIISANNRIPMIKRAILN 139 Query: 138 VAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM 196 +++ +G L+ Y FP Q+L + + L G+ RA+ + +E T+ + Sbjct: 140 ISKKWGNELEYKGKTYYSFPNVQQLKDSTIEQLSECGV-GFRAKYIYKTIQDIIEETIDL 198 Query: 197 T--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT----- 249 + ++ K LQ G+G A+ L + F D +K+ Sbjct: 199 DYIKSLNDDECHKELQKISGVGPKVADCIMLFSMEKYTAFPV-DVWVKRAMQHFYLAPDV 257 Query: 250 -PAQIRRYA-ERWKPWRSYALLHIW 272 +IR + +++ P+ +A +++ Sbjct: 258 SLKKIRDFGRDKFDPFCGFAQQYLF 282 >UniRef50_A0RYQ2 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Cenarchaeum symbiosum RepID=A0RYQ2_CENSY Length = 187 Score = 151 bits (382), Expect = 2e-35, Method: Composition-based stats. Identities = 46/188 (24%), Positives = 78/188 (41%), Gaps = 18/188 (9%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC 154 + RL + E VR+I+ Q +S + A+ + AR LYG Sbjct: 1 MARLIRLVGEYNPRRTRNRHEALVRSIITQQLSGSAASSILARFRALYGG---------G 51 Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTF 212 FP P +A + L+ G+ +A+ + L+ L + E+ + L Sbjct: 52 FPRPADVARTPARKLQQAGISAMKADYIRGLSGMIDRRELKLAGFSRMGDEEVVAELVRV 111 Query: 213 PGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG-------MTPAQIRRYAERWKPWRS 265 G+GRWTA F + +DV D +++ T A+I + AERW+P+R+ Sbjct: 112 RGVGRWTAEMFLIFALGRQDVLPLGDLGLRKGVMKLCSMDSLPTDAEIVKTAERWRPYRT 171 Query: 266 YALLHIWY 273 A ++W Sbjct: 172 AATWYLWK 179 >UniRef50_B0U6C0 DNA-3-methyladenine glycosidase n=16 Tax=Xanthomonadaceae RepID=B0U6C0_XYLFM Length = 226 Score = 151 bits (382), Expect = 2e-35, Method: Composition-based stats. Identities = 53/219 (24%), Positives = 78/219 (35%), Gaps = 22/219 (10%) Query: 74 CLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAK 133 +A L+ LG L A R + VD RAIL Q +S A+ Sbjct: 10 VVAAYDHLYHCDPGLSGWMQRLGPLPALRGWRQPFNVVD---ALARAILFQQLSGKAAST 66 Query: 134 LTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGT 193 + AR+ + G + LA D L+A G+ + AL L + G Sbjct: 67 IVARIEAVIGSTCLY---------AETLACIDDACLRACGVSSNKILALRDLTRREVAGE 117 Query: 194 LPMTI---PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG--- 247 LP ++ L GIGRWT + DV DD +++ Sbjct: 118 LPSVRQMGAMHHNTIVEKLIPIRGIGRWTVEMMLMFRLGRPDVLPVDDLGVRKGIQRVDT 177 Query: 248 ----MTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDEA 282 TP + E W P+R+YA L++W + E Sbjct: 178 LAFVPTPKALCTRGECWAPYRTYAGLYLWRIADFHEGEG 216 >UniRef50_C4R2A8 Mitochondrial glycosylase/lyase n=1 Tax=Pichia pastoris GS115 RepID=C4R2A8_PICPG Length = 320 Score = 151 bits (381), Expect = 3e-35, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 71/208 (34%), Gaps = 30/208 (14%) Query: 87 NPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL 146 + + ++ AA G+R+ D +E + I +V +K+ + YG+ + Sbjct: 104 DWASKDAHFAKVSAAFAGIRML-QQDPWETLISFICSSNNNVKRISKMCHALCLEYGDFI 162 Query: 147 DD--FPEYICFPTPQRLA-AADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGD-- 201 + +Y FPTP +LA A +L+ LG RA + A + Sbjct: 163 VEYAGTKYYSFPTPVQLASRASEASLRELGF-GYRARYVYETAQMLADDKALFMQLHSMR 221 Query: 202 -----VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRY 256 EQ + L F G+G A+ AL + D + Q + R Sbjct: 222 SSSFTDEQVHEFLLQFKGVGPKVADCVALMSLNRHSLVPI-DTHVLQFARRDYSYKFRGR 280 Query: 257 A-----------------ERWKPWRSYA 267 + ++W + +A Sbjct: 281 SNATLSSAMYVDMKRFFVDKWGSYAGWA 308 >UniRef50_C4Y1S0 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y1S0_CLAL4 Length = 494 Score = 151 bits (381), Expect = 3e-35, Method: Composition-based stats. Identities = 39/199 (19%), Positives = 70/199 (35%), Gaps = 25/199 (12%) Query: 99 GAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL--DDFPEYICFP 156 A PG+R+ D +E V I +V +K+ + YG L D +Y FP Sbjct: 146 FAQFPGIRILRQ-DPWETVVSFICSSNNNVKRISKMCDALCAEYGRFLARHDGIDYFSFP 204 Query: 157 TPQRLAAADPQ-ALKALGMPLKRAEALIHLANAALE--------GTLPMTIPGDVEQAMK 207 PQ L++ + + L+ LG RA+ + A + L +A + Sbjct: 205 GPQVLSSPEVEGRLRELGF-GYRAKYIASTAKMFADSKWPHISLERLESLRTKPFAEAHE 263 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIK--------QRFPGMTPAQIRR---- 255 L G+G A+ L DV D ++++ + M Sbjct: 264 FLLQLTGVGPKVADCICLMALDKHDVVPVDTHVLQIAVRDYKYRGPRTMNKKTYEAVRGH 323 Query: 256 YAERWKPWRSYALLHIWYT 274 A+ + + +A ++ Sbjct: 324 LADLFGEYAGWAQSVMFAA 342 >UniRef50_C8W0S2 HhH-GPD family protein n=6 Tax=Bacteria RepID=C8W0S2_DESAS Length = 201 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 40/189 (21%), Positives = 73/189 (38%), Gaps = 19/189 (10%) Query: 95 LGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYIC 154 L + D F V +I+ Q +S AA + R + + E Sbjct: 21 LAEAIERIGIIEREIIPDLFAALVHSIISQQISSKAAATVWNRFLERFDE---------- 70 Query: 155 FPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGT--LPMTIPGDVEQAMKTLQTF 212 T Q++A + ++ G+ +K+A + +A+A ++G + E+ K L Sbjct: 71 -ITSQKIAYTTAEEIQQCGITMKKAIYIKSIADAVMQGEFNIDELSELPDEEVCKRLSAL 129 Query: 213 PGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF------PGMTPAQIRRYAERWKPWRSY 266 GIG WTA Q +V D I++ + A+ +Y R+ P+ + Sbjct: 130 NGIGVWTAEMLMTFSMQRPNVMSWGDLAIRRGIMMLYHHRKLDKAKFEKYKRRYSPYCTI 189 Query: 267 ALLHIWYTE 275 A L++W Sbjct: 190 ASLYLWEIA 198 >UniRef50_A9KID8 8-oxoguanine DNA glycosylase domain protein n=2 Tax=Clostridiales RepID=A9KID8_CLOPH Length = 272 Score = 150 bits (380), Expect = 4e-35, Method: Composition-based stats. Identities = 38/244 (15%), Positives = 79/244 (32%), Gaps = 17/244 (6%) Query: 27 VETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQC 86 ++ ++++ Y +A G + T + E L + Sbjct: 23 MKEISETSYE-VVAYGH---YLRISQQDNELTFSCT-EGEFHSIWNEYLGLNVDYDTITG 77 Query: 87 NPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL 146 + + + G+R+ + +E V ++ Q ++ K + + YG+ Sbjct: 78 LVNEDDRYMKSAMSFGWGIRILKQ-ELWETIVSFLISQQNNIPRIKKSIQMLCERYGDEK 136 Query: 147 --DDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDV 202 ++ Y FP P+ LK + R + ++ A A ++G+ + Sbjct: 137 LNENGDVYYTFPKPEAFLNLKDSELKECNL-GYRTKYILRTAKAVVDGSFDLEGLPNLSY 195 Query: 203 EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKP 262 E A K L G+G + L D F D I++ P + Sbjct: 196 EDAKKELMKLYGVGIKVSECICLYALHHFDAFPI-DTHIQKVLELNYPGGF-----PFDK 249 Query: 263 WRSY 266 +R Y Sbjct: 250 YRGY 253 >UniRef50_Q6BZL7 DEHA2A00418p n=2 Tax=Debaryomyces hansenii RepID=Q6BZL7_DEBHA Length = 300 Score = 150 bits (380), Expect = 4e-35, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 86/212 (40%), Gaps = 28/212 (13%) Query: 88 PQIVNGALGRLGAAR--PGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLY--- 142 + + AL + P + ++A++ V+ I+ Q +S + A + + +L+ Sbjct: 72 FEKTDPALSDFIKSCDDPNTLMDVKMNAYQTLVKIIISQQLSTSAARSIMTKFIKLFLKE 131 Query: 143 ---GERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM--- 196 E F + FPTP+ + P+ L++ G+ ++A L+ ++ + + Sbjct: 132 GESTEPDHQFKAHPHFPTPEIVKETSPERLRSAGISFRKAGYLLIISEKFSDKNYLLNDD 191 Query: 197 --TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------- 246 E + L GIG W + F L + D+F D I++ Sbjct: 192 KKLNDMSNEDIARLLIDLKGIGPWAVDIFLLLYMKRSDIFPISDAGIRKGLSMLIQNTSG 251 Query: 247 -------GMTPAQIRRYAERWKPWRSYALLHI 271 ++ ++ +Y+E WKP+RS A ++ Sbjct: 252 KKGKKLNYLSIEEMEKYSENWKPYRSVASWYL 283 >UniRef50_C0KTC3 8-oxoguanine DNA glycosylase n=1 Tax=Clostridium sp. enrichment culture clone 7-14 RepID=C0KTC3_9CLOT Length = 269 Score = 150 bits (378), Expect = 7e-35, Method: Composition-based stats. Identities = 55/251 (21%), Positives = 89/251 (35%), Gaps = 18/251 (7%) Query: 31 ADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQI 90 R + R +TA+ T + A AE FDL + Sbjct: 16 DSGQCFRLNQLEAGRFQLTALDRCVELTERADDWALDCSA-AELDKLWRGYFDLDTDYGA 74 Query: 91 -------VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG 143 + L G+R+ D +E V I+ Q ++ + YG Sbjct: 75 YRAAVPEKDVYLTAAADFGRGIRILRQ-DPWEILVTFIISQRKNIPAIRACVETLCSRYG 133 Query: 144 ERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVE 203 E + FPTP LA AD +AL+A + RA ++ A A GTL + +E Sbjct: 134 EPI---GPTYAFPTPAALAGADEEALRACAL-GYRAGYVLAAAQMADAGTLDLFALVSLE 189 Query: 204 Q--AMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWK 261 ++L T PG+G AN +L G+ F D + + + Y R+K Sbjct: 190 DDQLAESLMTVPGVGVKVANCVSLFGYHRIAAFPR-DVWMNRVIHEHYRGRFPLY--RYK 246 Query: 262 PWRSYALLHIW 272 + +++ Sbjct: 247 GFAGVMQQYLF 257 >UniRef50_B3T522 Putative HhH-GPD superfamily base excision DNA repair protein n=3 Tax=environmental samples RepID=B3T522_9ARCH Length = 282 Score = 150 bits (378), Expect = 7e-35, Method: Composition-based stats. Identities = 34/223 (15%), Positives = 78/223 (34%), Gaps = 28/223 (12%) Query: 76 AKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVA 129 + + F N + + + + + PGLR+ D F+ + I+ ++ Sbjct: 52 KRAKKFFREDDNYEKILKNITKDKIVKKATKHYPGLRVTRQ-DPFQCCISFIVSSNSNIP 110 Query: 130 MAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANA 188 ++ + +G ++ + E+ FP P+ LA A Q L+ + R++ ++ + A Sbjct: 111 NIRMRLQKLCRKFGTKVRFEQREFFLFPRPKILAKATLQDLQECKL-GYRSKYVLDTSRA 169 Query: 189 ALEGTLPMTIPG--DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP 246 G + D ++ + L PGIG A+ L + + F D I + Sbjct: 170 VASGEIDFDELKKVDYQECKELLLKLPGIGDKVADCVMLFSLEKLEAFPL-DTWIVKILQ 228 Query: 247 GMTPAQI----------------RRYAERWKPWRSYALLHIWY 273 + + + + Y+ ++ Sbjct: 229 KYYSDNFCMDKKTISKKRYENIHQNVLDHFGKYAGYSQQFLYK 271 >UniRef50_UPI00016C0B45 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Epulopiscium sp. 'N.t. morphotype B' RepID=UPI00016C0B45 Length = 289 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 40/195 (20%), Positives = 72/195 (36%), Gaps = 16/195 (8%) Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL---D 147 ++ + G+R+ D FE + I+ Q ++ + +A+ +G+ + Sbjct: 82 IDIHMNNAIRFGGGIRILKQ-DPFEMLISFIISQNKAIPHIKQCINNIAERFGQPIFQEI 140 Query: 148 DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVE--QA 205 Y FPT +L AA L + RA + + G + +T +E A Sbjct: 141 SSETYYAFPTLAQLQAATIDDLSECKV-GFRAAYIKDAIDKLSSGEVDLTSIASLETADA 199 Query: 206 MKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAER------ 259 K L G+G+ A+ L + DVF D IK+ G Q E Sbjct: 200 RKQLMKIKGVGKKIADCVLLFAYYRTDVFP-TDVWIKRVVEGFYFNQEETKLEAIDTFAK 258 Query: 260 --WKPWRSYALLHIW 272 +K +A +++ Sbjct: 259 NTFKDLAGFAQQYLF 273 >UniRef50_A5E623 Putative uncharacterized protein n=1 Tax=Lodderomyces elongisporus RepID=A5E623_LODEL Length = 438 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 73/202 (36%), Gaps = 28/202 (13%) Query: 99 GAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY--ICFP 156 + G+R D +E + I +V +K+ + + +G+ ++++ Y FP Sbjct: 167 FDSHTGIRTLRQ-DPWECLISFICSSNNNVKRISKMCDNLCEHFGDLVNEYEGYKHYSFP 225 Query: 157 TPQRLAAADPQA-LKALGMPLKRAEALIHLANAALE--------GTLPMTIPGDVEQAMK 207 TP++L+A++ ++ L+ LG RA+ + A L D EQA + Sbjct: 226 TPEQLSASNTESKLRELGF-GYRAKYIYQTAKKFTSPEYPDITIEKLMSMRDMDYEQAHE 284 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRY----------- 256 L G+G A+ L DV D ++ + R Sbjct: 285 FLLQLSGVGPKVADCICLMSLDKHDVVPIDTHVYQIAVRDFKYKGKRDLKTLNKEMHRNI 344 Query: 257 ----AERWKPWRSYALLHIWYT 274 + + + +A ++ Sbjct: 345 RKFFRDIFGDYAGWAQSVLFAA 366 >UniRef50_A1K6J5 DNA-3-methyladenine glycosylase II n=21 Tax=Proteobacteria RepID=A1K6J5_AZOSB Length = 229 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 45/225 (20%), Positives = 86/225 (38%), Gaps = 22/225 (9%) Query: 66 GLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPG--LRLPGCVDAFEQGVRAILG 123 A++ +L + ++ RL A L+ + +E VRA+ Sbjct: 2 DTPDTLPPAQAELYQLAT--AHLAGIDADWARLVTAVGPCLLQPKPAREPYEALVRAVAY 59 Query: 124 QLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALI 183 Q ++ ++ ++ R+ LY + FP P++L A AL+ G ++ E + Sbjct: 60 QQLATSVGDRIIGRLLALYPDS--------AFPQPEQLLATGFDALRGCGFSARKIETIH 111 Query: 184 HLANAALEGTLPMTIP---GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 +A L G +P D E + L GIGRWT + + DV DD+ Sbjct: 112 GIAQGTLSGLVPSRADAVSMDDEALIARLVELRGIGRWTVEMLLIFTLERIDVLPVDDFG 171 Query: 241 IKQRFPG-------MTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 +++ + ++ R P+R+ A ++W + Sbjct: 172 VREGYRHLKSLDEMPGRKEMARAGLVCSPYRTVAAWYLWRSLALP 216 >UniRef50_C6PBG5 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PBG5_CLOTS Length = 303 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 40/255 (15%), Positives = 82/255 (32%), Gaps = 18/255 (7%) Query: 31 ADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQI 90 D Y + V+ D T+ N + + R + Sbjct: 43 DDGSYTGV----AFDRVINVKLDGDILTID-NTTLADFNDIWYDYFDLGRDYGKIKEALS 97 Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DF 149 + L G+R+ D +E + I+ Q + K+ +++ +G + Sbjct: 98 QDEILRAAIKYGEGIRILRQ-DTWETLISFIISQNNRIPQIKKVIENLSRAFGHPIVYKN 156 Query: 150 PEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG--DVEQAMK 207 Y FP Q + AD ++L R++ +I A + + D + Sbjct: 157 KTYYTFPKVQDIIMADEESLNNSK-CGFRSKYIIDAALKVFNDEINLFELQLYDTHEVRN 215 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF------PGMTPAQIRRYA-ERW 260 L + G+G A+ L + F D IK+ A ++ +A E++ Sbjct: 216 ILMSIRGVGPKVADCVILYSIGRYEAFP-TDVWIKRVVEFLYLKRKTNNADVQSFAKEKF 274 Query: 261 KPWRSYALLHIWYTE 275 +A +++ Sbjct: 275 GDLSGFAQQYLFNYA 289 >UniRef50_Q4A0G9 Putative DNA-3-methyladenine glycosidase n=1 Tax=Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305 RepID=Q4A0G9_STAS1 Length = 221 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 39/198 (19%), Positives = 75/198 (37%), Gaps = 19/198 (9%) Query: 90 IVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDF 149 + L +L L++ D + +R+I+GQ ++VA+A + +++ + Sbjct: 18 KQDATLAQLINQIGDLQIQTRADPLKSLIRSIIGQQITVAVAQSIFQKLSIAIDDHW--- 74 Query: 150 PEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMK 207 T +L+ +KALG+ + + ++ A G L D + Sbjct: 75 -------TVNQLSQLRESEMKALGLSQSKINYIQNVLFAVRNGQLNFEQLYKMDDNSVIN 127 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERW 260 L GIGRWTA F L Q K++ D +++ + Q+ E+W Sbjct: 128 ALTQIKGIGRWTAEVFLLFTLQRKNILPIYDVGLQRAAQWLYQTTKAERKKQLTICKEQW 187 Query: 261 KPWRSYALLHIWYTEGWQ 278 + S ++W Sbjct: 188 QGCASIGAFYLWEAIHQD 205 >UniRef50_C4G1E5 Putative uncharacterized protein n=1 Tax=Abiotrophia defectiva ATCC 49176 RepID=C4G1E5_ABIDE Length = 268 Score = 149 bits (376), Expect = 1e-34, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 72/204 (35%), Gaps = 22/204 (10%) Query: 78 MSRLFDLQCNPQIV-------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAM 130 FDL + + + L A G+R+ D +E + I+ Q ++ Sbjct: 65 WFDYFDLGTDYSAIKALADEKDEYLKNAIANGFGIRILRQ-DLWEMIISFIVSQNNNIPR 123 Query: 131 AAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAAL 190 A++ L E FP+ + L+ + L + G+ R E + +A Sbjct: 124 IKNSIAKLCAL--------TEDGSFPSAEVLSKVSVEELHSYGL-GYRDEYIHRMAVKTA 174 Query: 191 EGTL--PMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM 248 G I E+A K L GIG+ A+ + G D F D +KQ Sbjct: 175 NGEFVPESLIGLPYEEAKKLLMAEHGIGKKVADCICVFGLHMLDAFPI-DTHVKQILAAH 233 Query: 249 TPAQIRRYAERWKPWRSYALLHIW 272 ER+K + + +++ Sbjct: 234 YEDGFP--FERYKGYAAVMQQYMF 255 >UniRef50_A9M750 HhH-GPD family protein n=55 Tax=Rhizobiales RepID=A9M750_BRUC2 Length = 232 Score = 148 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 50/221 (22%), Positives = 82/221 (37%), Gaps = 23/221 (10%) Query: 73 ECLAKMSRLFDLQCNPQIV---NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVA 129 + + ++ L D++ + + + L + + L FE ++ Q VS A Sbjct: 17 QAMRRIDTLSDIEAGLEALVLADRRLADIRNRSHAVPLRRSEPGFESLASIVVAQQVSTA 76 Query: 130 MAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAA 189 AA + AR+ Q+ TP+ A +A + G+ + L+ L+ A Sbjct: 77 SAAAIWARLKQVI-----------NPLTPEAYIAGGEEAWRLAGLSRPKQRTLLALSEAL 125 Query: 190 LEGTLPMTIPGDVE--QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP- 246 G L + D+ +A+ TL GIG WTA + L DVF D ++ Sbjct: 126 AGGALDLHGLCDLPAGEAIATLTAIKGIGPWTAEVYLLFAAGHPDVFPAGDVALQTAVGH 185 Query: 247 ------GMTPAQIRRYAERWKPWRSYALLHIWYTEGWQPDE 281 A +R+ AE W PWR A W Sbjct: 186 AFAHETRPDAAALRQLAENWAPWRGVAARLFWAYYAAIKGR 226 >UniRef50_UPI0001C41B95 8-oxoguanine DNA glycosylase Ogg n=1 Tax=Methanobrevibacter ruminantium M1 RepID=UPI0001C41B95 Length = 358 Score = 148 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 45/294 (15%), Positives = 101/294 (34%), Gaps = 57/294 (19%) Query: 37 RSLAVGEYRGVVTAIPDIAR-HTLHINLSAG-------------------LEPVAAECLA 76 +++ + ++ D + S L+ + E + Sbjct: 55 KTINLNGLPVLLNLSQDKDNINGFEYTYSLPQKVEKQAFSIGQDNLSKKELQSIEREIDS 114 Query: 77 KMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAM 130 ++ ++DL+ + + + L GLRL D FE + +I S+A Sbjct: 115 NLNNIYDLEFDLEKFYEFLLEDEKLAPSVDFCKGLRLFIAKDPFECIISSICSANNSIAR 174 Query: 131 AAKLTARVAQLYGERLD-DFPEYICFPTPQRLAA---ADPQA----------------LK 170 ++ +GE+++ D + FP+P+ + LK Sbjct: 175 WTASIDKIKLNWGEKVEFDEGMFYGFPSPKDFLDFYETPIEQSEADGRRYEVDCYTKNLK 234 Query: 171 ALGMPLKRAEALIHLANAALEG-TLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQ 229 + G+ RA + + ++ + ++A + PG+G A+ L G+ Sbjct: 235 SCGV-GYRAPYMKKASQMLIDEIDMNDVSKMAYDEAFDLILRLPGVGPKVADCILLYGFG 293 Query: 230 AKDVFLPDDYLIKQRFPGMT-------PAQIRRYA-ERWKPWRSYALLHIWYTE 275 ++ F D IK+ + + R + ER+ + YA L++++ Sbjct: 294 FQEAFP-SDVWIKRIVSHLYFDGEDISADKTREFGIERFGDYAGYAQLYLFHYA 346 >UniRef50_C2L088 8-oxoguanine DNA glycosylase n=2 Tax=Clostridiales RepID=C2L088_9FIRM Length = 275 Score = 148 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 69/223 (30%), Gaps = 8/223 (3%) Query: 31 ADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQI 90 D Y GE + A E ++ Sbjct: 27 EDGSYR--FISGESVIYLRPEDREAGVYTVSCDRESWETTWFPFFDLERCYSEIAVLESG 84 Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFP 150 + + + A G+RL D +E + I+ Q S+ K +++ YG + Sbjct: 85 KHEFVDQAIAHGRGVRLLRQ-DPWEMLLTFIISQRKSIPAIIKSVEALSEKYGHDIVTEQ 143 Query: 151 E-YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQ--AMK 207 E FP+P+ + A + L A G+ R + ++ G L + + ++ Sbjct: 144 ERLKAFPSPEEMKEATAEELAACGL-GYRVKYILDAIQKVNSGELNLKAIAKLPDDVLLE 202 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP 250 LQ G+G AN AL + D I + Sbjct: 203 KLQAVMGVGIKVANCIALFAYGRTACVPV-DVWIFRAIEKECG 244 >UniRef50_Q7NJ14 Gll2018 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ14_GLOVI Length = 206 Score = 148 bits (375), Expect = 2e-34, Method: Composition-based stats. Identities = 44/196 (22%), Positives = 74/196 (37%), Gaps = 20/196 (10%) Query: 92 NGALGRLGAARPGL--RLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDF 149 + L + + F+ VRAI+ Q +S AA + R+ L+ R Sbjct: 20 DPILAAIIERVGDCSYQTSAAGTHFDAVVRAIVYQQLSGKAAATIHKRLCDLFDGR---- 75 Query: 150 PEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMK 207 P P L A + AL+ +G+ ++ L LA G L + + + + Sbjct: 76 -----PPLPAELLAVEAAALRGVGLSRQKLNYLKSLAAQVESGALAIETLHILEDQAILA 130 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERW 260 L GIGRWTA F + +V D I++ P Q+ +E W Sbjct: 131 ELMRLKGIGRWTAQMFLMFRLGRPNVLPEGDLGIQKAIQLAYSLKALPSPKQMAAVSEPW 190 Query: 261 KPWRSYALLHIWYTEG 276 P+ + A ++W + Sbjct: 191 HPYCTIACWYLWRSLE 206 >UniRef50_A3JFL8 3-methyladenine DNA glycosylase n=1 Tax=Marinobacter sp. ELB17 RepID=A3JFL8_9ALTE Length = 207 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 47/204 (23%), Positives = 72/204 (35%), Gaps = 20/204 (9%) Query: 81 LFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQ 140 L V+ LGR+ L F V IL Q +S+ A + R+ Sbjct: 11 LDYGAQQLAAVDADLGRIYTRLGAPPLWAREPGFASLVHIILEQQISIKAAQTVFERLCA 70 Query: 141 LYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP- 199 GE +PQR+ +A + LKA G+ ++A LA G L + Sbjct: 71 HLGEM-----------SPQRMVSAGEEELKAFGLTRQKARYCFGLAERIHTGKLNLAQLD 119 Query: 200 -GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPA 251 + L PG+G W+ + + L + DV+ D + + T Sbjct: 120 ALSDTEGRDALLAIPGLGPWSVDVYYLMALRRPDVWPLGDLALAAAMQEIKQLDAPATRQ 179 Query: 252 QIRRYAERWKPWRSYALLHIWYTE 275 Q A W PWR+ A +W Sbjct: 180 QQVDIANAWSPWRAVAARLLWMHY 203 >UniRef50_D0J2I3 HhH-GPD n=6 Tax=Comamonadaceae RepID=D0J2I3_COMTE Length = 274 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 55/242 (22%), Positives = 88/242 (36%), Gaps = 32/242 (13%) Query: 49 TAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP--GLR 106 A +H++L G+ AE ++ R + L RL L Sbjct: 52 AVASKAAAPKIHLDLPDGVPAYWAEACRQLMR----------RDRVLKRLIPQLGSQALL 101 Query: 107 LPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADP 166 G AF R+I+GQ +S A L + +L P+++ Sbjct: 102 PCGQEQAFATLARSIIGQQISAKSAKTLWNKFVRL-----------PAAMQPEQVLRLKV 150 Query: 167 QALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFA 224 ++A+G+ ++ + L+ LA E L M E + L + G+ RWTA F Sbjct: 151 DDMRAVGLSARKVDYLVDLALHFTENRLHMDEWAQMSDEVIIAELMSIRGLSRWTAENFL 210 Query: 225 LRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERWKPWRSYALLHIWYTEGW 277 + +V DD + Q + R AE WKPW + A +IW + Sbjct: 211 IYCLGRPNVLPLDDAGLIQGISLNHFSGDPVSRSDAREVAEAWKPWCTVATWYIWRSLEA 270 Query: 278 QP 279 QP Sbjct: 271 QP 272 >UniRef50_Q3B3Y2 HhH-GPD n=1 Tax=Chlorobium luteolum DSM 273 RepID=Q3B3Y2_PELLD Length = 311 Score = 148 bits (374), Expect = 2e-34, Method: Composition-based stats. Identities = 62/280 (22%), Positives = 95/280 (33%), Gaps = 43/280 (15%) Query: 32 DSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIV 91 D Y ++ G I + + ++ V + + R F L + + + Sbjct: 32 DGRYVSAII----NGSAVVIENTNDGGVVLHTDGNTIGVESPQV-WFRRYFSLDVDTETL 86 Query: 92 --------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG 143 + L GLR+ D +E V + Q + +A+ + + +A+ YG Sbjct: 87 FSEPFRNAHPELALQLERYRGLRVLRQ-DPYETMVTFMCAQGIGMALIRRQVSMLARRYG 145 Query: 144 ERLDD-----FPEYICFPTPQRLAAADPQALKALGMPL-KRAEALIHLANAALEGTLPMT 197 E + FPTP RL AADP L+A RA +I + EG + Sbjct: 146 EHVPLSLNGCTINLYRFPTPSRLGAADPMELRACTNNNLMRARNIISASQKVTEGCIDFK 205 Query: 198 IPGD----VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM----- 248 E L GIG A+ AL G D F D ++Q Sbjct: 206 ALASKKNTQEDIQAALSRCGGIGLKIADCIALFGLGRFDAFPI-DTHVRQFLGLWFGFPE 264 Query: 249 -----TPAQIRRYAERWKP--------WRSYALLHIWYTE 275 T R AER + +R + L H W TE Sbjct: 265 ASAPLTDKNYRILAERARELLGEKLAGYRGHHLFHCWRTE 304 >UniRef50_C6WJ98 Transcriptional regulator, AraC family n=5 Tax=Actinomycetales RepID=C6WJ98_ACTMD Length = 584 Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats. Identities = 79/290 (27%), Positives = 114/290 (39%), Gaps = 30/290 (10%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLH 60 + L ++ P + G L A AV VE D Y R+L + GVV+ P Sbjct: 300 VLRLPFRGPLHAPSLFGPLVANAVPGVEEWRDGAYRRTLRLPRGHGVVSLRPRADHVECD 359 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIV------NGALGRLGAARPGLRLPGCVDAF 114 + L+ +++ R DL +P V + AL L A PG R+PG VD Sbjct: 360 LTLT--DSRDLPVAISRCRRALDLDADPAEVDGALRADPALRPLVDAAPGTRVPGVVDGA 417 Query: 115 EQGVRAILGQ------LVSVAMAAKLTARVAQLYGERLDDFPE---YICFPTPQRLAAAD 165 E VRA+LG+ + A RV + GE + D FPTPQ L D Sbjct: 418 ECAVRALLGEGTGTGAAMGAGANAGWAHRVVREAGEAVPDPAGGGLTHLFPTPQALLDLD 477 Query: 166 PQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFAL 225 P L A A + AL G + + D +A L+ G + Sbjct: 478 PALLPPP------ARAPLTALLTALVGGVDLGAGADRAEARSALRC---AGERVLDAVLT 528 Query: 226 RGWQAKDVFLPDDYLIKQRFPGM----TPAQIRRYAERWKPWRSYALLHI 271 R D F PDD ++ G+ T A + + W+PWR+YA ++ Sbjct: 529 RSLGDPDGFCPDDPAVRAAAGGIGLPVTAAALADRSRAWRPWRAYATRYL 578 >UniRef50_Q6CEU4 YALI0B12870p n=1 Tax=Yarrowia lipolytica RepID=Q6CEU4_YARLI Length = 403 Score = 148 bits (373), Expect = 3e-34, Method: Composition-based stats. Identities = 44/268 (16%), Positives = 88/268 (32%), Gaps = 29/268 (10%) Query: 32 DSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGL-EPVAAECLAKMSRLFDL----QC 86 ++ Y + + V+ D + + A L + Sbjct: 44 NNNYW-IIGMEGRGIVLNQKDDDTMWAEVSDKGKPVKSRDTAAILNDYFNISTDTIKLYE 102 Query: 87 NPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL 146 + + G+R+ D +E I +V ++L ++ +G+ + Sbjct: 103 DWSSRDDHFKNKSIKYLGIRVLRQ-DPWENLCSFICSSNNNVKRISQLVQKMTITFGDHV 161 Query: 147 DDFPEY--ICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGT-----LPMTIP 199 + FP+P +LA +P L+ LG+ RA+ + A L L Sbjct: 162 ATLDDLKIHSFPSPDKLADTEP-ILRELGL-GYRAKYISKTAEMLLTKPGGEQFLHELRD 219 Query: 200 GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD---------DYLIKQRFPGMTP 250 ++A ++ F G+G A+ L D D DY + + MTP Sbjct: 220 ASFDEAKSSIMEFLGVGPKVADCVCLFSLDKHDTVPVDTHVWQIAQKDYGVARSAKTMTP 279 Query: 251 AQIRR----YAERWKPWRSYALLHIWYT 274 + + E+W P+ +A ++ Sbjct: 280 KAYAQVQEFFREKWGPYAGWAHCVLFAA 307 >UniRef50_A6EE77 3-methyladenine DNA glycosylase n=1 Tax=Pedobacter sp. BAL39 RepID=A6EE77_9SPHI Length = 222 Score = 147 bits (371), Expect = 4e-34, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 70/206 (33%), Gaps = 20/206 (9%) Query: 79 SRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARV 138 + + L + + + FE V IL Q VS+A A ++ Sbjct: 9 DNFHQICDELAATDADLRSVIRTYGYPPMWKRSNTFESLVHIILEQQVSLASALAALNKL 68 Query: 139 AQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEG--TLPM 196 E TP L + LKA + +++ + HLA + L G L + Sbjct: 69 RDRLKEV-----------TPGVLLQLTDEELKACYLSRQKSIYVRHLATSILHGSIDLDL 117 Query: 197 TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMT 249 + L G+G WT + + + Q DVF D G Sbjct: 118 MPRLPDREIRILLNQLKGVGNWTIDVYLMFVLQRADVFPSGDLAAVNALKQLKDLPVGTH 177 Query: 250 PAQIRRYAERWKPWRSYALLHIWYTE 275 + R A W+P+R+ A + +W+ Sbjct: 178 KEVLERIAMNWQPYRTVATMILWHYY 203 >UniRef50_O94468 Probable DNA-3-methyladenine glycosylase 2 n=1 Tax=Schizosaccharomyces pombe RepID=MAG2_SCHPO Length = 213 Score = 147 bits (371), Expect = 5e-34, Method: Composition-based stats. Identities = 37/202 (18%), Positives = 75/202 (37%), Gaps = 19/202 (9%) Query: 85 QCNPQIVNGALGRLGAARPG--LRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLY 142 + + ++ L L +E +RAI Q +S A + + Sbjct: 11 EKHLSSIDNKWSSLVKKVGPCTLTPHPEHAPYEGIIRAITSQKLSDAATNSIINKFCTQC 70 Query: 143 GERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--- 199 + + FPTP+++ D + L G +++ + +A AAL +P Sbjct: 71 SDNDE-------FPTPKQIMETDVETLHECGFSKLKSQEIHIVAEAALNKQIPSKSEIEK 123 Query: 200 GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP-------AQ 252 E+ M++L G+ RWT +++ D+ DD +K + Sbjct: 124 MSEEELMESLSKIKGVKRWTIEMYSIFTLGRLDIMPADDSTLKNEAKEFFGLSSKPQTEE 183 Query: 253 IRRYAERWKPWRSYALLHIWYT 274 + + + KP+R+ A ++W Sbjct: 184 VEKLTKPCKPYRTIAAWYLWQI 205 >UniRef50_C4Z3R2 N-glycosylase/DNA lyase n=4 Tax=Bacteria RepID=C4Z3R2_EUBE2 Length = 287 Score = 147 bits (371), Expect = 5e-34, Method: Composition-based stats. Identities = 47/245 (19%), Positives = 87/245 (35%), Gaps = 29/245 (11%) Query: 47 VVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGR-------LG 99 V+ + T + + E + FD+ + + L Sbjct: 42 VLHVSQEADTVTFY-------DTDKDEYVNVWKDYFDMDRDYSAIKKKLLEKDDKLKDAI 94 Query: 100 AARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD--FPEYICFPT 157 + G+R+ D FE + I+ Q + K+ A ++ +G + FPT Sbjct: 95 ESMWGVRILNQ-DFFETLISFIISQNKQIPHIKKIVADISAKFGTYKGTYGGADMYTFPT 153 Query: 158 PQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP--GDVEQAMKTLQTFPGI 215 ++LA A + K L RA ++ + G + D + +K L T G+ Sbjct: 154 LEQLANASEEDFKELK-TGFRAPYIMDAIRRNMAGQFDINELKSMDYDSCIKELMTIKGV 212 Query: 216 GRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYA-ERWKPWRSYA 267 G AN +L G K+ F D IK+ M +I +A E++ +A Sbjct: 213 GEKVANCVSLFGLGKKEAFPV-DVWIKRIMETMYFDGVDTPKDKIAAFAKEQFGELGGFA 271 Query: 268 LLHIW 272 +++ Sbjct: 272 QQYLF 276 >UniRef50_UPI00006DC22F hypothetical protein CdifQ_04000214 n=1 Tax=Clostridium difficile QCD-32g58 RepID=UPI00006DC22F Length = 200 Score = 146 bits (370), Expect = 6e-34, Method: Composition-based stats. Identities = 38/192 (19%), Positives = 75/192 (39%), Gaps = 14/192 (7%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD--F 149 + L + G+R+ D +E + I+ + M + +++ +G+ + + Sbjct: 2 DEYLNKATEFGWGIRILRQ-DGWEMLISFIISSNNRIPMIQRAIENLSRKFGKYIGEYEG 60 Query: 150 PEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEG--TLPMTIPGDVEQAMK 207 EY FPTP+ L A + ++A R + + A +E + E K Sbjct: 61 NEYYAFPTPEELNKASQEEIRACQ-TGFRDKYIKSTTQAVIENNDEVSEYTNLSTEDCRK 119 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ------IRRYA-ERW 260 L F G+G + AL G Q D F D +K+ + +R Y +++ Sbjct: 120 ELLKFNGVGPKVCDCIALFGMQKYDSFPV-DVWVKRVMQEFYIDEDMSLPKMRTYGIDKF 178 Query: 261 KPWRSYALLHIW 272 K +A +++ Sbjct: 179 KEMSGFAQQYLF 190 >UniRef50_B8I162 8-oxoguanine DNA glycosylase domain protein n=2 Tax=Clostridium RepID=B8I162_CLOCE Length = 295 Score = 146 bits (370), Expect = 6e-34, Method: Composition-based stats. Identities = 42/267 (15%), Positives = 84/267 (31%), Gaps = 18/267 (6%) Query: 28 ETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCN 87 R + + A + + + + + FDL + Sbjct: 27 HIFDCGQCFRWIRQEDGSYRGIARGRMVNVSYDNEVFCITNSSEQDFIDIWYEYFDLGTD 86 Query: 88 PQIV------NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQL 141 + + + G+RL D +E + I+ + K ++ L Sbjct: 87 YSKIKSVLEQDEIMREAIKTGWGIRLLKQ-DFWEMLISFIISANNMIPRIMKTVDALSVL 145 Query: 142 YGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEG--TLPMTIP 199 G+ +D FP + LA + +K R + + + +G T + Sbjct: 146 RGKCIDSGRNAYSFPEIKALAETSLEDIKQCK-AGFRCKYIHKTSALMAQGIVTEEILRS 204 Query: 200 GDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT------PAQI 253 D A K L PG+G A+ L DVF D +K+ + +I Sbjct: 205 MDTAMARKELMILPGVGPKVADCILLFSGLKYDVFPI-DVWVKRVMEELYLKKESSHKEI 263 Query: 254 RRYA-ERWKPWRSYALLHIWYTEGWQP 279 + +A +++ YA +++Y Sbjct: 264 QEFATKQFGGLTGYAQQYLFYHARLNK 290 >UniRef50_D0XPK8 HhH-GPD family protein n=1 Tax=Brevundimonas subvibrioides ATCC 15264 RepID=D0XPK8_9CAUL Length = 230 Score = 146 bits (370), Expect = 6e-34, Method: Composition-based stats. Identities = 59/222 (26%), Positives = 86/222 (38%), Gaps = 26/222 (11%) Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAIL 122 + L P+ E LA ++ AL R A P F R I+ Sbjct: 12 VDKDLMPLTPEDLAAARETL------ARLDPALARAHAQTPPFEWRVRQGGFVGLFRMIV 65 Query: 123 GQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEAL 182 Q VSVA AA + AR+ GE TP L A D +L+ +G+ ++A Sbjct: 66 EQQVSVASAASVWARLQAGLGE-----------ITPAGLLAHDLDSLRGMGLSRQKATYG 114 Query: 183 IHLANAALEGTLPMTIPG--DVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYL 240 +A A +EGT+ + D A++ L G+G WTA + L DVF D Sbjct: 115 QGMARAQIEGTIDLEHLATLDDAAAIEALVRLKGVGLWTAEAYLLLCEGRTDVFPGGDVA 174 Query: 241 IKQRFPGMTPAQIR-------RYAERWKPWRSYALLHIWYTE 275 +++ + R AE W+PWR A +W Sbjct: 175 LQEAIKWADGTETRPDTKGAYARAEIWRPWRGVATHLLWAWY 216 >UniRef50_B6BS24 DNA-3-methyladenine glycosylase I n=4 Tax=SAR11 cluster RepID=B6BS24_9RICK Length = 211 Score = 146 bits (370), Expect = 7e-34, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 72/202 (35%), Gaps = 22/202 (10%) Query: 92 NGALGRLGAARPG---LRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD 148 + + L + L D F ++I+GQ +SVA A + L Sbjct: 19 DKVMKYLIQKYSEPSEVTLTSRKDIFYSLCKSIIGQQISVAAANSVF----------LKF 68 Query: 149 FPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLP--MTIPGDVEQAM 206 + + + LK+ G+ ++A+ + LA L T + E+A+ Sbjct: 69 KKKCKNKINAKTVYKLTVTQLKSCGLSRQKAKGIKSLAKQTLNKTFDSKLIPKMSDEEAI 128 Query: 207 KTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIR-------RYAER 259 L IGRW+A L + +++ D + + + +R Sbjct: 129 IYLSKLRQIGRWSAEMILLFTYNRSNIWPIQDIGLLRAISKNYKKEYLPPEKYVNLLYKR 188 Query: 260 WKPWRSYALLHIWYTEGWQPDE 281 + P+ S A ++W + +P + Sbjct: 189 FSPYCSVATWYLWRSIDPEPIQ 210 >UniRef50_D0LW65 DNA-3-methyladenine glycosylase II n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LW65_HALO1 Length = 220 Score = 146 bits (368), Expect = 9e-34, Method: Composition-based stats. Identities = 48/194 (24%), Positives = 77/194 (39%), Gaps = 17/194 (8%) Query: 93 GALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY 152 + L A L ++F RAI+ Q ++ AA + AR L+ Sbjct: 30 AHMPALIAVHGPPDLARTRNSFASLGRAIVYQQLATRAAAAIYARFLALFPRG------- 82 Query: 153 ICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTLQ 210 FPTP L A L++ G+ +A AL LA +G++ D ++ TL Sbjct: 83 -RFPTPAALLAVSEDTLRSAGLSRAKATALRDLAAKFADGSVRSRQFSRMDADELRATLT 141 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM-------TPAQIRRYAERWKPW 263 GIG W+ + F + G DV D +++ PA+++ A W P+ Sbjct: 142 QVRGIGPWSVDMFLIFGLMRPDVLPVGDLGVRKGMQRYFELEELPKPAEMQELAAPWAPF 201 Query: 264 RSYALLHIWYTEGW 277 RS A ++W Sbjct: 202 RSVASWYMWRVAEN 215 >UniRef50_C6A294 AlkA 3-methyladenine DNA glycosylase n=9 Tax=Thermococcaceae RepID=C6A294_THESM Length = 279 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 47/188 (25%), Positives = 77/188 (40%), Gaps = 14/188 (7%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + L GL +P D ++ V I Q VS A + + +L G++L++ Sbjct: 80 DSKFAFLIKEFYGLTIPKAPDKYQALVETIAQQQVSFEFAMQTIRNLVKLAGKKLEN--- 136 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT-IPGDVEQAMKTLQ 210 FPTPQ + + + + RA + HL LEG L + D ++A+K L Sbjct: 137 LYIFPTPQSILNLSEEKFREAKL-GYRAGYIRHLTKEYLEGNLNLDLEELDEKEAIKYLT 195 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF--------PGMTPAQIRRYAERWKP 262 F GIGRW+A F G +V+ D +K+ + +R E + Sbjct: 196 KFKGIGRWSAELFLAYGLGK-NVYPAGDLGMKRGIAKIFGKNPKEVKEKDVREIIEPYGK 254 Query: 263 WRSYALLH 270 W+S + Sbjct: 255 WKSLLAFY 262 >UniRef50_A8QA43 Putative uncharacterized protein n=1 Tax=Malassezia globosa CBS 7966 RepID=A8QA43_MALGO Length = 339 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 66/175 (37%), Gaps = 16/175 (9%) Query: 91 VNGALGRLGAA-----RPGLRL--PGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG 143 ++ L A LR ++ F +ILGQ +S A + + +L+ Sbjct: 82 IDSRFASLFAQLDLKVYDELRSGKVKELNLFRVLTTSILGQQISWLAARSIMYKFCRLFA 141 Query: 144 ERLDDFP-------EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM 196 L P + + FPTP ++ A L+ G+ + + + +A +G L + Sbjct: 142 PDLPLQPNLDAVNKDELPFPTPLQVLKATDDELRRAGLSTAKIKYVRDVARRFSDGRLDV 201 Query: 197 TI--PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT 249 + E + L G+GRWTA + ++ D+ D +++ Sbjct: 202 RKIIHMNPEACITELTQVKGVGRWTAEMLLMFALRSPDILPVGDLGVQRGIVKFY 256 >UniRef50_B4S806 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Prosthecochloris aestuarii DSM 271 RepID=B4S806_PROA2 Length = 312 Score = 145 bits (367), Expect = 1e-33, Method: Composition-based stats. Identities = 49/216 (22%), Positives = 78/216 (36%), Gaps = 30/216 (13%) Query: 88 PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERL- 146 + V + +L G+R+ ++AFE + + Q + + + K + +GER Sbjct: 92 FRRVYPVVSQLAEPYMGVRVLR-LNAFETLITFMCAQAIGMNLIRKQIRTICNRFGERHM 150 Query: 147 ----DDFPEYICFPTPQRLAAADPQALK-ALGMPLKRAEALIHLANAALEGTLPMTIPGD 201 + FP+P+ LAAA PQ L+ +RA +I A A EG L M + Sbjct: 151 TEIDGNPLIQYSFPSPETLAAASPQDLRICTNNNCERASNIISAARAVAEGRLCMDELIN 210 Query: 202 VE----QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM--------- 248 E +L + GIG A+ L G D F D ++Q Sbjct: 211 NELSLGSIRNSLTAYRGIGLKIADCVMLFGLHRHDAFPI-DTHVRQYLGKWFGLEKTQKA 269 Query: 249 -TPAQIRRYAERWKP--------WRSYALLHIWYTE 275 TP + + + L H W E Sbjct: 270 LTPKTYIELQHQASEILNPENAGYAGHILFHCWRNE 305 >UniRef50_C1DYL3 Predicted protein n=2 Tax=Micromonas RepID=C1DYL3_9CHLO Length = 291 Score = 145 bits (367), Expect = 1e-33, Method: Composition-based stats. Identities = 53/253 (20%), Positives = 89/253 (35%), Gaps = 33/253 (13%) Query: 54 IARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGL-RLPGCVD 112 T+ + + + + D + LG L A L R+ C + Sbjct: 17 PGPRTVRVEAAVPDVDGTFGDVV----VADALRELSTRDPKLGELIARCGELPRIFACQE 72 Query: 113 A----------FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLA 162 A F RAI+ Q ++ AA + RV + G + + TP + Sbjct: 73 ARRAKHEPNRAFRSLARAIVFQQLNGTAAATIFGRVLRCVG-----AQDDVLALTPDAII 127 Query: 163 AADPQALKALGMPLKRAEALIHLANAALEGTLPM------TIPGDVEQAMKTLQTFPGIG 216 AD A++A G+ ++ E L+ LA A D M L GIG Sbjct: 128 DADEAAMRACGLSQRKHEYLVALARAFHPAHSDFPLSDESLEAMDDTAVMSALVALRGIG 187 Query: 217 RWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT-------PAQIRRYAERWKPWRSYALL 269 W+ + F + DV D+ +++ + A++ AERWKP R+ A + Sbjct: 188 PWSVHMFQMFYLNRPDVLPTKDFGVRKGVMRLYGLRDMPSEAKVEEIAERWKPHRTLASM 247 Query: 270 HIWYTEGWQPDEA 282 ++W Sbjct: 248 YMWQAADEGKSSG 260 >UniRef50_Q9ZET9 DNA-3-methyladenine glycosidase (Fragment) n=1 Tax=Mycobacterium avium subsp. paratuberculosis RepID=Q9ZET9_MYCPA Length = 185 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 49/166 (29%), Positives = 70/166 (42%), Gaps = 9/166 (5%) Query: 119 RAILGQLVSVAMAAKLTARVAQLYGERLDDFPEY--ICFPTPQRLAAADPQALKALGMPL 176 RA+LGQ VS+ A R+ YG + D FP+ Q+LA DP L +P Sbjct: 1 RAVLGQQVSIRAARTHAGRLVAAYGRAVHDPEGTLTHTFPSVQQLADVDPIHLA---VPK 57 Query: 177 KRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 R L L + ++ + D + A L PG+G WTA A+RG D F Sbjct: 58 ARQRTLAALVAGLADRSIVLDTGCDWQSARTQLLALPGVGPWTAEVIAMRGLGDPDAFPA 117 Query: 237 DDYLIKQRFPG----MTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 D ++ + + RW+PWRSYA ++W T Sbjct: 118 ADLGLRVAAKRLGLPSGQRSLTAASARWRPWRSYATQYLWTTLEHP 163 >UniRef50_Q46B02 8-oxoguanine DNA glycosylase n=4 Tax=Methanosarcinaceae RepID=Q46B02_METBF Length = 283 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 45/243 (18%), Positives = 85/243 (34%), Gaps = 23/243 (9%) Query: 44 YRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP 103 V+ D + + NL + +++ +++ R Sbjct: 38 GDQVIRLSQDQGQLIVDSNLQPEFLTRYFRLDDNLPSIYESINRDLLID----RAIRKYR 93 Query: 104 GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAA 163 GLRL D +E + +L S+ K +++ +G+ ++ P Y FP P+ LA Sbjct: 94 GLRLIRQ-DPWECLISYMLSTASSIPTIQKRIYLLSRSFGQEIE--PGYFSFPDPETLAN 150 Query: 164 ADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQ--AMKTLQTFPGIGRWTAN 221 DP L + R + +I A G L + + +E A + L GIG A+ Sbjct: 151 VDPAELDKCKL-GFRTQNIIAAAEEVASGELDLDVLFRLEYRYARERLMRLRGIGEKVAD 209 Query: 222 YFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ-----------IRRYA-ERWKPWRSYALL 269 L + + F D I+Q + + E + + YA Sbjct: 210 CVLLFAFDKMEAFPV-DTHIRQIIQHYHIDDSYFETCKNMSCMGDWGREYFGHYCGYAQE 268 Query: 270 HIW 272 +++ Sbjct: 269 YLY 271 >UniRef50_Q2FPI5 8-oxoguanine DNA glycosylase-like n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FPI5_METHJ Length = 299 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 65/206 (31%), Gaps = 21/206 (10%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + + GLR+ +E V I +V+ K + YG F Sbjct: 91 DRFFETRYSQSRGLRILRQH-PWECLVSFICSANSNVSTIGKRINLILGRYGTSCTLFEN 149 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAMKTL 209 FP P L+ L+ + RA LI A E + QA TL Sbjct: 150 --TFPDPAVLSQCQEPELREC-LTGYRAPYLIKTAQYIHEHPDFFHQVRKMEYHQAKDTL 206 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRR-------------- 255 PG+G A+ L ++ + D I++ + I Sbjct: 207 MNLPGVGPKVADCVLLFAFEHLNAVPV-DIRIRKIIEDKYASLIHTDKSGKLPYDTIGGF 265 Query: 256 YAERWKPWRSYALLHIWYTEGWQPDE 281 E + P+ YA +++ T + E Sbjct: 266 CREYFGPYAGYAQQYLFATRDLEKGE 291 >UniRef50_C4ZBV7 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Eubacterium rectale ATCC 33656 RepID=C4ZBV7_EUBR3 Length = 267 Score = 145 bits (365), Expect = 2e-33, Method: Composition-based stats. Identities = 41/220 (18%), Positives = 74/220 (33%), Gaps = 19/220 (8%) Query: 66 GLEPVAAECLAKMSRLFDLQCNPQIV--------NGALGRLGAARPGLRLPGCVDAFEQG 117 L+ A+ D+ + + + L A G+R+ D +E Sbjct: 50 ELDCDEADWNNIWKSYLDMDTDYAGIAKLIADGDDAHLKEAYAYGSGVRILRQ-DLWEMI 108 Query: 118 VRAILGQLVSVAMAAKLTARVAQLYGERLD---DFPEYICFPTPQRLAAADPQALKALGM 174 V ++ Q ++ + + G ++D + E FP P + +++G Sbjct: 109 VTFMISQNNNIKRITNSVDLLCRRCGHKIDGSAEGGELYTFPKPLEVPDEVFDD-RSMGF 167 Query: 175 PLKRAEALIHLANAALEG--TLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 RA L + L ++AM++L T GIG+ AN L G D Sbjct: 168 -GYRAPYLKEIYGYGANNPDWLDNLRKMSYDEAMESLLTRKGIGKKVANCICLFGLHHVD 226 Query: 233 VFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 F D +KQ ER+K +++ Sbjct: 227 AFPI-DTHVKQLLDKYYSDGFD--FERYKGVAGIIQQYLF 263 >UniRef50_Q756Y7 AER127Cp n=1 Tax=Eremothecium gossypii RepID=Q756Y7_ASHGO Length = 391 Score = 145 bits (365), Expect = 2e-33, Method: Composition-based stats. Identities = 49/300 (16%), Positives = 98/300 (32%), Gaps = 57/300 (19%) Query: 35 YARSLAVGEYRGV-VTAIPDIARHTLHINLSAG-LEPVAAECLAKMSRLFDLQ------- 85 Y+ S+ + + G + + + ++ +++ + + + R ++ Sbjct: 36 YSASMLLNDKLGYRIIVLKQPDQCSIEFSVAGNKDDDCSGAARQWLMRYLRMEVNLEALL 95 Query: 86 CNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER 145 Q + +G G+R+ + +E I ++ K+ + YG Sbjct: 96 AEWQKADTRF--IGKTHRGVRILRQ-EPWETLCSFICSSNNNIGRITKMCHALCSQYGSF 152 Query: 146 LD--DFPEYICFPTPQRLAA-ADPQALKALGMPLKRAEALIHLAN-------AALEGTLP 195 L D Y FPT ++L A AL+ LG RA+ ++ A A + T Sbjct: 153 LGELDGTPYYSFPTSKQLMEGASEDALRDLGF-GYRAKYIMAAAEWMDSSKPAHMSDTEH 211 Query: 196 MTIPGDV---EQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP-- 250 + D+ E+ + PG+G A+ L G Q D D I + Sbjct: 212 LESWLDMISREEIRQRFMEVPGVGPKVADCVCLMGMQMDDHVPV-DVHINRIAQRDYKFN 270 Query: 251 ------------------------AQIRRYAE----RWKPWRSYALLHIWYTEGWQPDEA 282 ++ E +W P+ +A ++ E + D A Sbjct: 271 ASAAKIAALKARYKDLPSVRKKLNMELELVREMFIQKWGPYAGWAQGVLFAQEINKSDTA 330 >UniRef50_Q4PHN3 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4PHN3_USTMA Length = 447 Score = 145 bits (365), Expect = 2e-33, Method: Composition-based stats. Identities = 37/188 (19%), Positives = 66/188 (35%), Gaps = 21/188 (11%) Query: 79 SRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDA----------FEQGVRAILGQLVSV 128 L Q + V+ RL P +D F +ILGQ +S Sbjct: 118 FDLNRAQEHLCAVDRRFERLFQKVPVRCYEEALDPSNSETKNLNLFRTVTTSILGQQISW 177 Query: 129 AMAAKLTARVAQLY--------GERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 A + + +L+ + + E FPTP + L+A G+ + + Sbjct: 178 LAARSVLYKFCRLFSPDSMPEKPDFVGFPREQWPFPTPLMVLRTPDAELRAAGLSFAKIK 237 Query: 181 ALIHLANAALEGTLPM---TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 + LA ++G L + D E + L G+GRWT+ + + D+ Sbjct: 238 YVKDLAARFVDGRLDIRQILELDDEEACVAELSKVKGVGRWTSEMILMFAMRKPDILPCA 297 Query: 238 DYLIKQRF 245 D +++ Sbjct: 298 DLGVQKGM 305 Score = 42.5 bits (99), Expect = 0.017, Method: Composition-based stats. Identities = 8/35 (22%), Positives = 18/35 (51%) Query: 243 QRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGW 277 + ++P +++ A +W P+RS A + +W Sbjct: 413 KGGQYLSPDEMKALAAKWSPYRSVACMFMWALVDT 447 >UniRef50_P53397 DNA-(apurinic or apyrimidinic site) lyase n=8 Tax=Saccharomycetaceae RepID=OGG1_YEAST Length = 376 Score = 144 bits (364), Expect = 3e-33, Method: Composition-based stats. Identities = 47/289 (16%), Positives = 89/289 (30%), Gaps = 58/289 (20%) Query: 33 SYYARSLAVG--EYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCN--- 87 + Y+ ++ +G E VV D L ++ G + + F L + Sbjct: 36 NQYSTTMKIGQQEKYSVVILRQDEENEILEF-VAVGDCGNQDALKTHLMKYFRLDVSLKH 94 Query: 88 -----PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLY 142 + A +L G+R+ + +E + I +++ ++ + + Sbjct: 95 LFDNVWIPSDKAFAKLSPQ--GIRIL-AQEPWETLISFICSSNNNISRITRMCNSLCSNF 151 Query: 143 GERLD--DFPEYICFPTPQRL-AAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIP 199 G + D Y FPT + L + A L+ LG RA+ +I A + I Sbjct: 152 GNLITTIDGVAYHSFPTSEELTSRATEAKLRELGF-GYRAKYIIETARKLVNDKAEANIT 210 Query: 200 GD------------VEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPG 247 D E + L ++ G+G A+ L G + D + + Sbjct: 211 SDTTYLQSICKDAQYEDVREHLMSYNGVGPKVADCVCLMGLHMDGIVPV-DVHVSRIAKR 269 Query: 248 MTP--------AQIRR-------------------YAERWKPWRSYALL 269 ++R +K W SYA Sbjct: 270 DYQISANKNHLKELRTKYNALPISRKKINLELDHIRLMLFKKWGSYAGW 318 >UniRef50_A8N5M3 Putative uncharacterized protein n=2 Tax=Agaricales RepID=A8N5M3_COPC7 Length = 822 Score = 144 bits (364), Expect = 3e-33, Method: Composition-based stats. Identities = 31/136 (22%), Positives = 59/136 (43%), Gaps = 9/136 (6%) Query: 122 LGQLVSVAMAAKLTARVAQLYGERLDD-------FPEYICFPTPQRLAAADPQALKALGM 174 LGQ +S A +T + +LY + + FPTP++++ + L+ G+ Sbjct: 567 LGQQISWKAARSITHKFIRLYSPSIPEEVTDESRAAAMQVFPTPEQVSKTEVSLLRTAGL 626 Query: 175 PLKRAEALIHLANAALEGTL--PMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKD 232 ++A+ + LA +G L + E+ + L GIGRWT + FA+ + D Sbjct: 627 SERKAQYIQDLAARFADGRLSTDKLLNASDEELAEMLIEVKGIGRWTVDMFAIFSLRRPD 686 Query: 233 VFLPDDYLIKQRFPGM 248 + D +++ Sbjct: 687 ILPVGDLGVQRGLARW 702 >UniRef50_Q18H56 DNA N-glycosylase / DNA lyase n=12 Tax=Halobacteriaceae RepID=Q18H56_HALWD Length = 337 Score = 144 bits (364), Expect = 3e-33, Method: Composition-based stats. Identities = 53/288 (18%), Positives = 91/288 (31%), Gaps = 31/288 (10%) Query: 8 PPYDWSWML----GFLAARAVSSVE---TVADSY-YARSLA-----VGEYRGVVTAIPDI 54 P+D L +L RA S + +V+ + ++A + + V+ Sbjct: 37 GPFDLQATLESGQSYLWNRADSGMYDSMSVSGGSNWYETVAPPIDGISDTPIVIRVRQRE 96 Query: 55 ARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAF 114 L + V L ++ + + + A GLRL F Sbjct: 97 D--QLEWEATGDPVEVLTHLL-RLDDDLTSIYDSFPDDELVTTAVGAFHGLRLVRDP-PF 152 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALG 173 + I + V+ ++ +A +G + FPTP +LAA AL+ L Sbjct: 153 STLISFICSAQMRVSRIHQMQLSMADAFGTTHTVGGESFAAFPTPSQLAAQSESALRDLS 212 Query: 174 MPLKRAEALIHLANAALEGTLP--MTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + RA + A G + E A + L F G+G A+ L Sbjct: 213 L-GYRAPYVQRTAEMVSTGEAHPAIAAERAYENAREYLTQFVGVGNKVADCVLLFSLDYT 271 Query: 232 DVFLPDDYLIKQRFPGMTPA--------QIRRYAERWKP-WRSYALLH 270 + D I+ PA R +R + YA + Sbjct: 272 EAVPL-DTWIQTAIAEHFPACDRDGYMQTSRAIRDRLGGQYAGYAQTY 318 >UniRef50_A1SKK8 Putative uncharacterized protein n=2 Tax=Actinomycetales RepID=A1SKK8_NOCSJ Length = 313 Score = 144 bits (363), Expect = 4e-33, Method: Composition-based stats. Identities = 53/275 (19%), Positives = 82/275 (29%), Gaps = 20/275 (7%) Query: 6 WQP--PYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINL 63 W+P P +LG A + + R + + + +H Sbjct: 8 WRPEWPCPVGSVLGVHRRGAGDPTYRIEGDTHLRGIRTHDGPATLAVRTRPRDGVVHAEA 67 Query: 64 SAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGA----ARPGLRLPGCVDAFEQGVR 119 L + L + + L L A RL E V Sbjct: 68 WGPGAD---RALDSLPALLGADDDVSGFDPDLHPLVAEGWRRYAHWRLGRTGLVMESLVP 124 Query: 120 AILGQLVSVAMAAKLTARVAQLYGERLDDF---PEYICFPTPQRLAAADPQALKALGMPL 176 AI+ Q V+ A + YGER + P P L L + Sbjct: 125 AIIEQKVTGQEAFAGFRMLVHRYGERAPGPGHDRKLWVQPDPATLRRIPSWEWLRLHVDP 184 Query: 177 KRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 R+ AL+ A + L T E+ + L + PGIG WTA R D Sbjct: 185 ARSRALVSAAR--VADALERTAGLPGEEVERRLTSLPGIGVWTAAEVRQRAHGDPDAVSF 242 Query: 237 DDYLIKQRFPG------MTPAQIRRYAERWKPWRS 265 DY + + + ++ + E W+P R Sbjct: 243 GDYHVAKDLGWAVAGRPFSDDEMAEFLEPWRPHRG 277 >UniRef50_B3ECK0 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Chlorobium limicola DSM 245 RepID=B3ECK0_CHLL2 Length = 312 Score = 144 bits (363), Expect = 4e-33, Method: Composition-based stats. Identities = 49/239 (20%), Positives = 82/239 (34%), Gaps = 32/239 (13%) Query: 66 GLEPVAAECLAKMSRLFD----LQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAI 121 G PV+ A + D + A+ L GL+L D FE + + Sbjct: 66 GGIPVSEALSAYFTLDIDNGKLFDDHFIKRFPAIATLLQEYMGLKLLRQ-DPFETTITFM 124 Query: 122 LGQLVSVAMAAKLTARVAQLYGERLD-----DFPEYICFPTPQRLAAADPQALKAL-GMP 175 Q + +A+ + + + YG FP P+ LA +L+A Sbjct: 125 CAQGIGMALIRRQIGMLCEKYGTPCTIELMGQKHRIFRFPKPEMLAETSVLSLQACTNNN 184 Query: 176 LKRAEALIHLANAALEGTLPM----TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 +RA + +A AA EGTL + +++ L + GIG A+ AL Sbjct: 185 YRRALNIRRVAAAAAEGTLDFTISGSQSLSLDRIRAMLCEYDGIGPKIADCIALFSLGRF 244 Query: 232 DVFLPDDY---------LIKQRFPGMTPAQIRRYAERWK--------PWRSYALLHIWY 273 D F D + I++ +T R + + + + L H W Sbjct: 245 DAFPVDTHVRQYLAEWFGIRRASMSLTEKNYLRLQDEVRTILRPEVAGYAGHLLFHCWR 303 >UniRef50_D2LH30 HhH-GPD family protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LH30_RHOVA Length = 214 Score = 143 bits (362), Expect = 4e-33, Method: Composition-based stats. Identities = 50/210 (23%), Positives = 76/210 (36%), Gaps = 20/210 (9%) Query: 80 RLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVA 139 L V+ + L A + + F+ V I GQ +SVA + R+ Sbjct: 9 DLAQHIAELVRVHPSFMPLKEAAGPVPVRWLDRGFKGLVFVITGQQISVAAGRAIFGRLE 68 Query: 140 QLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI- 198 G+ T + LAAAD L+ G + L L AAL L + Sbjct: 69 GALGD-----------ITAETLAAADDTILREAGYSRPKMRTLRALQEAALADGLDLVAI 117 Query: 199 -PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP-------GMTP 250 D E+A+ L GIG WTA + L D+F D +++ + Sbjct: 118 EAMDAERAIIKLSAIKGIGPWTAEVYLLFAAGHPDIFPAADVALQESMRLAFDLDARPST 177 Query: 251 AQIRRYAERWKPWRSYALLHIWYTEGWQPD 280 +R ++ W PWRS A +W + Sbjct: 178 QALREISDAWTPWRSAAARLLWAYYKVRKG 207 >UniRef50_B1C9X9 Putative uncharacterized protein n=1 Tax=Anaerofustis stercorihominis DSM 17244 RepID=B1C9X9_9FIRM Length = 289 Score = 143 bits (362), Expect = 5e-33, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 73/211 (34%), Gaps = 19/211 (9%) Query: 78 MSRLFDLQCNPQIVN------GALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMA 131 + FD + ++ + + G+R+ D FE + I+ ++ Sbjct: 72 LVDYFDFNLDYNEIDKKISTDEHIKKCIEYGNGIRILKQ-DLFETIISFIISANNNIPRI 130 Query: 132 AKLTARVAQLYG-ERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAAL 190 K+ + + + YG E + Y FP+ + L + L M R + L+ + Sbjct: 131 KKIISDLCERYGKEIIYKGKSYYSFPSYEDLKDITQEEFHELKM-GFRDKYLVDAISKIN 189 Query: 191 EGTLPMTI--PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM 248 G + + + +A K L G+G ++ L + ++ D +K+ F Sbjct: 190 SGEIDLDTITNMETAEAKKELMKIKGVGEKVSDCILLFSLKRLELCP-MDVWVKRIFETR 248 Query: 249 TP------AQIRRYAE-RWKPWRSYALLHIW 272 A+ W + A +++ Sbjct: 249 YGLTDLTFENGFNLAKNNWGEYAGIAQQYLF 279 >UniRef50_B4SHB1 8-oxoguanine DNA glycosylase domain protein n=3 Tax=Chlorobium/Pelodictyon group RepID=B4SHB1_PELPB Length = 312 Score = 143 bits (362), Expect = 5e-33, Method: Composition-based stats. Identities = 60/307 (19%), Positives = 105/307 (34%), Gaps = 53/307 (17%) Query: 7 QPPYDWSWML----GFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHIN 62 + P+D L FL + +Y + + ++ IA + L + Sbjct: 10 RAPFDIRQTLFSGQSFLWN-----INKDEPDFY--AAIIKSKPLIIK---QIADNELEVY 59 Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQIV--------NGALGRLGAARPGLRLPGCVDAF 114 + + +S F + + + L +L +R+ D F Sbjct: 60 AEDKVINGVP-LVDFISHYFTFDIDTEQIFPDNFSHLYPTLWQLLTDYFPVRIMRQ-DPF 117 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-----DFPEYICFPTPQRLAAADPQAL 169 E + + Q + + + K + + Q YGE+ FP+P+RLAAA+P AL Sbjct: 118 ETMISFMCAQGIGMPLIRKQVSMLLQNYGEKRTISYSGKEITLHHFPSPERLAAANPIAL 177 Query: 170 KAL-GMPLKRAEALIHLANAALEGTLPMTIPGDV----EQAMKTLQTFPGIGRWTANYFA 224 RA ++ +A +G + + D + +TL G+G A+ A Sbjct: 178 STCTNNNHPRARNIVRIAKGVADGKIDLDALSDPLLPLSELRRTLCQNEGVGYKIADCIA 237 Query: 225 LRGWQAKDVFLPDDYLIKQRFPGM----------TPAQIRRY-AER-------WKPWRSY 266 L G D F D +KQ TPA+ AE + + + Sbjct: 238 LFGLGRFDAFPI-DTHVKQYLGQWFNSTTALQSLTPARYLALDAEARTILKPDFAGYAGH 296 Query: 267 ALLHIWY 273 L H W Sbjct: 297 LLFHCWR 303 >UniRef50_B6K1P6 DNA-3-methyladenine glycosylase n=1 Tax=Schizosaccharomyces japonicus yFS275 RepID=B6K1P6_SCHJY Length = 224 Score = 143 bits (362), Expect = 5e-33, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 81/206 (39%), Gaps = 20/206 (9%) Query: 80 RLFDLQCNPQIVNGALGRLGAARP--GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTAR 137 L + ++ L L+ + +E + A+ Q +S + + R Sbjct: 15 DLIKGEKQIASLDEQHQELVRCVGQCTLKPQTAREPYEGLIHALTYQRLSDSAGDAILGR 74 Query: 138 VAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT 197 + Q + ++ FP+ Q L + D + L++ G ++ E ++ LAN A +G+LP Sbjct: 75 LCQHFHKK--------SFPSVQELLSLDTEDLRSFGFSHRKGETILELANMAADGSLPSR 126 Query: 198 I---PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP---- 250 +++ + GIG WT +A+ +V D I++ + Sbjct: 127 EEISHMPLDKMIGIFTKVKGIGAWTVEKYAIFTLGRPNVMPTMDREIRENVQLLYHLDHT 186 Query: 251 ---AQIRRYAERWKPWRSYALLHIWY 273 ++ + + P+++ A ++W Sbjct: 187 PTDVEMEERSRAYVPYKTVASWYLWR 212 >UniRef50_B8GY42 DNA-3-methyladenine glycosylase II n=4 Tax=Caulobacteraceae RepID=B8GY42_CAUCN Length = 213 Score = 143 bits (361), Expect = 6e-33, Method: Composition-based stats. Identities = 51/199 (25%), Positives = 78/199 (39%), Gaps = 20/199 (10%) Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE 151 + AL + A P + F ++ I+ Q VS+A AA + ARV E Sbjct: 21 DPALATVEAVTPPFAWRVGLGGFPGLLKMIVQQQVSLASAAAIWARVEAGLPEM------ 74 Query: 152 YICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEG--TLPMTIPGDVEQAMKTL 209 TP+ +A D L+ LG+ +A +A A L G E+A+ L Sbjct: 75 -----TPEIVADHDEAYLRTLGLSQPKARYARAIAEAHLSGVCDFDALRALSDEEAIAAL 129 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIR-------RYAERWKP 262 G+GRWTA F + D+F D +++ + A+ R AE W+P Sbjct: 130 TAIKGVGRWTAEVFLMFTQGRLDLFPGGDVALQEAMRWVDRAETRPTEKQAYARAELWRP 189 Query: 263 WRSYALLHIWYTEGWQPDE 281 +R A +W G Sbjct: 190 YRGVAAHLLWACYGAVKRR 208 >UniRef50_Q8F6D8 DNA-3-methyladenine glycosylase n=2 Tax=Leptospira interrogans RepID=Q8F6D8_LEPIN Length = 213 Score = 143 bits (361), Expect = 6e-33, Method: Composition-based stats. Identities = 34/204 (16%), Positives = 65/204 (31%), Gaps = 20/204 (9%) Query: 81 LFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQ 140 + + + L + FE V IL Q VS+A A ++ Sbjct: 13 FYSICDQLSRKDRGLHSILLKHGYPPFWSRKPNFETLVHIILEQQVSLASARAALVKLKN 72 Query: 141 LYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM--TI 198 G T +++ L+ ++ + LA + + Sbjct: 73 KIGSV-----------TARKILLLSDIELRECYFSRQKTSYVRDLAEFVFSKRIILGDLA 121 Query: 199 PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP-------A 251 + L T GIG WT + F + D+F D + + Sbjct: 122 SKSDQMIRGDLITVKGIGNWTVDIFLIMALHRADIFPLGDLAAVKSLKKIKKLPVDTSND 181 Query: 252 QIRRYAERWKPWRSYALLHIWYTE 275 +I ++ W+P+RS A + +W++ Sbjct: 182 KILSVSKSWRPFRSIATMLLWHSY 205 >UniRef50_Q5CV50 DNA-3-methyladenine glycosidase (Fragment) n=2 Tax=Cryptosporidium RepID=Q5CV50_CRYPV Length = 217 Score = 143 bits (361), Expect = 7e-33, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 70/195 (35%), Gaps = 19/195 (9%) Query: 90 IVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDF 149 + + + ++ + D F+ I+GQ VS + + Sbjct: 19 LRDEKMKKIIEEIGYIDREYIDDLFQGLFYIIIGQQVSQKAQVSIWNK-----------A 67 Query: 150 PEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPM--TIPGDVEQAMK 207 + P+ ++ + ++ +G+ LK+A + +A + + + D E+ + Sbjct: 68 KTTLKSIDPETISNYSLEEIRKVGVSLKKATFIKGIAEKIINKEIDLNLLHEKDDEEVCE 127 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRF------PGMTPAQIRRYAERWK 261 L GIG W+A + K+VF D IK+ +T Y E + Sbjct: 128 ELTKLNGIGVWSAEMAMIFCMNRKNVFSFSDIAIKRALKMIYGHKEITKEIFEHYRELFS 187 Query: 262 PWRSYALLHIWYTEG 276 P+ S L++W Sbjct: 188 PYCSIVSLYLWEISN 202 >UniRef50_Q9XI06 F8K7.14 protein n=3 Tax=rosids RepID=Q9XI06_ARATH Length = 391 Score = 143 bits (360), Expect = 8e-33, Method: Composition-based stats. Identities = 42/255 (16%), Positives = 79/255 (30%), Gaps = 29/255 (11%) Query: 44 YRGVVTAIPDIARHTLHINLSAGLEPVAAEC-----LAKMSRLFDLQCNPQIVNGALGRL 98 +V+ + + P +AE L L +L + + G L Sbjct: 77 GPHLVSLRQRPGDDAVSYCVHCSTSPKSAELALLDFLNAEISLAELWSDFSKKDPRFGEL 136 Query: 99 GAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD--FPEYICFP 156 G R+ D E ++ + ++A K+ G L D E+ FP Sbjct: 137 ARHLRGARVLRQ-DPLECLIQFLCSSNNNIARITKMVD-FVSSLGLHLGDIDGFEFHQFP 194 Query: 157 TPQRLAAADPQALKALGMPLKRAEALIHLANAAL------EGTLPMTIPGDVEQAMKTLQ 210 + RL+ + + G RA+ + NA L ++++A+ L Sbjct: 195 SLDRLSRVSEEEFRKAGF-GYRAKYITGTVNALQAKPGGGNEWLLSLRKVELQEAVAALC 253 Query: 211 TFPGIGRWTANYFALRGWQAKDVFLPD-------------DYLIKQRFPGMTPAQIRRYA 257 T PG+G A AL D D + P + + Sbjct: 254 TLPGVGPKVAACIALFSLDQHSAIPVDTHVWQIATNYLLPDLAGAKLTPKLHGRVAEAFV 313 Query: 258 ERWKPWRSYALLHIW 272 ++ + +A ++ Sbjct: 314 SKYGEYAGWAQTLLF 328 >UniRef50_O28163 3-methyladenine DNA glycosylase (AlkA) n=1 Tax=Archaeoglobus fulgidus RepID=O28163_ARCFU Length = 295 Score = 143 bits (360), Expect = 8e-33, Method: Composition-based stats. Identities = 56/283 (19%), Positives = 110/283 (38%), Gaps = 32/283 (11%) Query: 11 DWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTL-----HINLSA 65 +W + F + + + V + R++ + V A P+ R + Sbjct: 11 NWELKMKFFVLPELPTPDVVESGVWRRAIVLDGRAVAVMAYPESERTIVVEGNFENREWE 70 Query: 66 GLEPVAAECL-----AKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPG--CVDAFEQGV 118 + E L ++ R D + L L G G + FE Sbjct: 71 AVRRKLVEYLGLQNPEELYRFMDG-------DEKLRMLKNRFYGFGRAGLMSMSVFEGIA 123 Query: 119 RAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLK 177 +AI+ Q +S +A KL A++ +G+ ++ + ++ FPT + + A + L+ G+ + Sbjct: 124 KAIIQQQISFVVAEKLAAKIVGRFGDEVEWNGLKFYGFPTQEAILKAGVEGLRECGLSRR 183 Query: 178 RAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 +AE ++ +A L +A + L +F GIGRWTA +VF D Sbjct: 184 KAELIVEIAKEENLEELKEWGEE---EAYEYLTSFKGIGRWTAELVLSMALGK-NVFPAD 239 Query: 238 DYLIKQRFPGMT-------PAQIRRYA-ERWKPWRSYALLHIW 272 D +++ + ++R A ER+ + L +++ Sbjct: 240 DLGVRRAVSRLYFNGEIQSAEKVREIARERFGRFARDILFYLF 282 >UniRef50_C4DGP2 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase n=1 Tax=Stackebrandtia nassauensis DSM 44728 RepID=C4DGP2_9ACTO Length = 300 Score = 142 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 60/280 (21%), Positives = 107/280 (38%), Gaps = 22/280 (7%) Query: 8 PPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVV-TAIPDIARHTLHINLSAG 66 P++ + FL A ++ E D + + + V A+ T+ + ++ Sbjct: 8 GPFNLATSTRFLEGFAPAAYEGAGDEVLRLAFPADDGKAVAGAALRQETDGTVRVEITGA 67 Query: 67 LEPVAAECLAKMSRLFDLQCN------PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRA 120 A A++ R+ L + + + L PGLR +E A Sbjct: 68 --ADAEAVGAQVRRIMSLDIDGTGYPAVVASDPIVKGLSEQYPGLRPVCFHSPYEAAAWA 125 Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALGMPLKRA 179 ++G + + AA + A +A GE + FPTP+ LA G+ + Sbjct: 126 VIGHRIRITQAAGIKAAMAARLGETVTVAGRPVAAFPTPEVLAEVGE----FPGLTDVKI 181 Query: 180 EALIHLANAALEGTLPMTIPGDVE--QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPD 237 L +A AAL G L D+ A++ LQ GIG ++A +RG DVF Sbjct: 182 ARLRGIAEAALAGELDAKRLRDMASADALEQLQGIAGIGPFSAELILIRGAGHPDVFPRT 241 Query: 238 DYLIKQRF------PGMTPAQIRRYAERWKPWRSYALLHI 271 + + + + A++ A W P+RS+ + + Sbjct: 242 ETRLHRTMTQLYRREEPSAAELADIAADWAPFRSWVGVLL 281 >UniRef50_C8V1L7 Mitochondrial glycosylase/lyase (Eurofung) n=25 Tax=Leotiomyceta RepID=C8V1L7_EMENI Length = 414 Score = 142 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 42/270 (15%), Positives = 81/270 (30%), Gaps = 42/270 (15%) Query: 19 LAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKM 78 + Y R +++ + + + + + LA + Sbjct: 31 FRWHKDPDNDEWRCVLYGRLISLKQDPSHLYYRTYVNSKP-SGSCNGSDSGSEDTTLAII 89 Query: 79 SRLFDLQ-------CNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMA 131 F+L + + + G+R+ DA+E + I ++A Sbjct: 90 KHYFNLNSNLTTLYAQWSSSDPNFRKKASQFTGIRILRQ-DAWEALISFICSSNNNIARI 148 Query: 132 AKLTARVAQLYGERLDD--FPEYICFPTPQRLAAAD--PQALKALGMPLKRAEALIHLAN 187 +++ ++ YG + D Y FP P+RLA + L++LG RA+ + A Sbjct: 149 SQMVEKLCANYGLHIADVDGRSYHDFPPPERLAEDEGVEARLRSLGF-GYRAKYIYQTAV 207 Query: 188 AAL---------------------------EGTLPMTIPGDVEQAMKTLQTFPGIGRWTA 220 EG +P +A + L G+G A Sbjct: 208 IIAKQKENGWLNSLRNPEAPAFGLEVVAGQEGEMPPEGRSGYREAHEKLLELQGVGPKVA 267 Query: 221 NYFALRGWQAKDVFLPDDYLIKQRFPGMTP 250 + AL G + D + Q Sbjct: 268 DCVALMGLGWGESVPV-DTHVWQIAQRDYK 296 >UniRef50_B3E6X3 HhH-GPD family protein n=2 Tax=Bacteria RepID=B3E6X3_GEOLS Length = 201 Score = 142 bits (359), Expect = 1e-32, Method: Composition-based stats. Identities = 39/186 (20%), Positives = 71/186 (38%), Gaps = 18/186 (9%) Query: 97 RLGAARPGLRLPGCVDA-FEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICF 155 R+ + G+ + F R IL Q VS+ A ++ E Sbjct: 21 RIINDKYGIPPNWMREPGFISLSRIILEQQVSIESAKAHFEKINSYIPEF---------- 70 Query: 156 PTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPG--DVEQAMKTLQTFP 213 TP + Q ++ + ++A+ L L+ A L+ L + + + + + L Sbjct: 71 -TPNEIIKLSDQEMRDCQISRQKAKYLRSLSEAILKNELNLEVMDTFNDHEIREKLTKIN 129 Query: 214 GIGRWTANYFALRGWQAKDVFLPDDYLIKQR----FPGMTPAQIRRYAERWKPWRSYALL 269 GIG WT + + + Q KDVF D + T ++ +++W P RS A Sbjct: 130 GIGNWTVDIYLMFCLQRKDVFPSGDIAVINAAMELLEYETKDEVLNESKKWAPLRSLAAY 189 Query: 270 HIWYTE 275 +W+ Sbjct: 190 FLWHYY 195 >UniRef50_A2STI1 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Methanocorpusculum labreanum Z RepID=A2STI1_METLZ Length = 299 Score = 142 bits (358), Expect = 1e-32, Method: Composition-based stats. Identities = 39/253 (15%), Positives = 82/253 (32%), Gaps = 23/253 (9%) Query: 38 SLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIV------ 91 + +Y G A + I L ++ R F L + Sbjct: 41 AFRWRQYNGFWYA--PVGDKIWKIRQERDLLLYDGPTEKELIRYFGLDIPLDDILTDIDR 98 Query: 92 NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFP 150 + + GLR+ D +E + I ++ +++ YG +++ D Sbjct: 99 DPLIHAAIRRCLGLRIIRQ-DPWECLISYICATCANIPGIMMRIENLSERYGNKIEMDEM 157 Query: 151 EYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALE--GTLPMTIPGDVEQAMKT 208 + FP +RLA + +++ + R + A A +A + Sbjct: 158 TFHTFPDAKRLAEEEMCSIRTCKV-GYRDAYICGAAEMAASDANWAEKIAGMPYPEAHEK 216 Query: 209 LQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYA---------ER 259 L G+G A+ L G++ + F D I++ R+ + E Sbjct: 217 LMRLNGVGPKVADCVLLFGFEKYEAFPV-DVWIERILRTKYLGGTRKLSYKRAGSFAREH 275 Query: 260 WKPWRSYALLHIW 272 + + YA +++ Sbjct: 276 FGQYAGYAQEYLF 288 >UniRef50_B9WJV0 Mitochondrial N-glycosylase/DNA lyase [includes: 8-oxoguanine DNA glycosylase; DNA-(Apurinic or apyrimidinic site) lyase], putative n=9 Tax=Fungi/Metazoa group RepID=B9WJV0_CANDC Length = 346 Score = 142 bits (358), Expect = 1e-32, Method: Composition-based stats. Identities = 34/203 (16%), Positives = 67/203 (33%), Gaps = 29/203 (14%) Query: 99 GAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD--FPEYICFP 156 + G+R+ +E + I +V +K+ + +GE +++ EY FP Sbjct: 109 FDSFAGIRILRQG-PWECLISFICSSNNNVKRISKMCENLCINFGEYINEFEGHEYYTFP 167 Query: 157 TPQRLAAADPQ-ALKALGMPLKRAEALIHLANAALEGT---------LPMTIPGDVEQAM 206 TP+ L+ D + L+ LG RA+ + A + L A Sbjct: 168 TPEALSQPDVEPKLRDLGF-GYRAKYIYQTACKFTDNNKYPDITIENLNSLREAKYTTAH 226 Query: 207 KTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGM---------------TPA 251 + L G+G A+ L DV D ++ + A Sbjct: 227 EFLLQLTGVGPKVADCICLMSLDKHDVVPIDTHVYQIAVRDYKFKGNKSMKTLSKETYAA 286 Query: 252 QIRRYAERWKPWRSYALLHIWYT 274 + + + + +A ++ Sbjct: 287 IRLFFKDIFGDYAGWAQSVLFAA 309 >UniRef50_O27397 DNA-(apurinic or apyrimidinic site) lyase n=1 Tax=Methanothermobacter thermautotrophicus str. Delta H RepID=OGG1_METTH Length = 312 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 42/273 (15%), Positives = 87/273 (31%), Gaps = 31/273 (11%) Query: 32 DSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFD-LQCNPQI 90 + + L + V + + + + + + FD + Sbjct: 28 EGAFRELLIIEGVPCPVEVRNEAGVLRVRPYVDVPQKTLREKIEYIFDLKFDIEDFYTFL 87 Query: 91 VNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DF 149 + L + GLRL D FE + +I SV + + +L+G+ + Sbjct: 88 EDKNLSYTLDSSRGLRLFLAKDPFECVISSIASANCSVVRWTRSIEDIRRLWGQANTFNG 147 Query: 150 PEYICFPTPQRL-------------------AAADPQALKALGMPLKRAEALIHLANAAL 190 + FP+P L + L++ G+ RA + + Sbjct: 148 ETFHTFPSPHVLTGVAEGSLEDLQRAEDNLPSDFSFNDLRSCGV-GYRAPYIRETSRILA 206 Query: 191 EG-TLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT 249 E + D + A + L G+G A+ L G++ + F D I++ + Sbjct: 207 EEMDIRRIDGMDYDDARELLLELSGVGPKVADCILLYGFRKTEAFPV-DVWIRRIMNHIH 265 Query: 250 P------AQIRRYAER-WKPWRSYALLHIWYTE 275 P + +A R + Y L+++ Sbjct: 266 PGRNFNDRSMVEFARREYGEMADYVQLYLFNHA 298 >UniRef50_D1VDS6 HhH-GPD family protein n=3 Tax=Actinomycetales RepID=D1VDS6_9ACTO Length = 292 Score = 141 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 68/286 (23%), Positives = 112/286 (39%), Gaps = 22/286 (7%) Query: 2 YTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVV-TAIPDIARHTLH 60 T+ + P+ + FL +S + D + + AI T+H Sbjct: 4 ITITPRGPFSLAAGTRFLEGFTPASYDAAPDDVLRLAFPTDDGHTSAGAAIRQAPDGTVH 63 Query: 61 INLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGA------ARPGLRLPGCVDAF 114 +L+ G +P A A+++R+ L + + GLR + Sbjct: 64 ADLTGGADP--AAVRAQVARILSLDVDGAAFPDCVAADAVAAGLAARHLGLRPVCFPSPY 121 Query: 115 EQGVRAILGQLVSVAMAAKLTARVAQLYGERLD-DFPEYICFPTPQRLAAADPQALKALG 173 E I+G + + AA+L A +A+ +GE + FPTP L D G Sbjct: 122 EAACWTIIGHRIRLTQAARLKATIAREHGETITVAGQPTAAFPTPSTLRTVDD----LPG 177 Query: 174 MPLKRAEALIHLANAALEGTLPM--TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAK 231 + + L +A AAL+G L E+A++ LQ PGIG ++A +RG Sbjct: 178 LSELKMGRLRAVAQAALDGELDAATLRALPTEEALRHLQALPGIGPFSAELILIRGAGHP 237 Query: 232 DVFLPDDYLIKQRFPGMTP------AQIRRYAERWKPWRSYALLHI 271 DVF + + Q Q+ R A+RW P+RS+ L + Sbjct: 238 DVFPGHERRLHQAMAKAYHLDSPELGQLSRLAQRWAPFRSWVTLLL 283 >UniRef50_Q5SLG4 DNA-3-methyladenine glycosidase n=6 Tax=Bacteria RepID=Q5SLG4_THET8 Length = 185 Score = 141 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 42/194 (21%), Positives = 73/194 (37%), Gaps = 19/194 (9%) Query: 92 NGALGRLGAARPGLRLPG----CVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD 147 + L F +++ Q +S AA+L R+ +L Sbjct: 4 HPVLAAFYRRHGPAPFAPNPFPQRPPFRVLAESVVAQQLSTRAAARLAERLFRL------ 57 Query: 148 DFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMK 207 PTP+ A L+ G+ +A AL LA A EG L + E ++ Sbjct: 58 ------VPPTPEAFLEAPLDLLRQAGLSRAKALALKDLAAKAEEGLLDGLDRLEDEAVVE 111 Query: 208 TLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMT---PAQIRRYAERWKPWR 264 L G+G WTA F + G DV+ D +++ + P + + E ++P+R Sbjct: 112 RLTRVRGVGLWTAEMFLMFGLGRPDVWPVRDLGLRRAAARLFGVAPEALPAFGEAFRPYR 171 Query: 265 SYALLHIWYTEGWQ 278 S+ ++W + Sbjct: 172 SHLAWYLWRSLSSP 185 >UniRef50_A4SEG0 8-oxoguanine DNA glycosylase domain protein n=1 Tax=Chlorobium phaeovibrioides DSM 265 RepID=A4SEG0_PROVI Length = 313 Score = 141 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 40/223 (17%), Positives = 66/223 (29%), Gaps = 30/223 (13%) Query: 84 LQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG 143 L GLR+ D +E V + Q + +++ + + + +G Sbjct: 89 FDDEFAARYPDLAFRLLKLQGLRVLRQ-DPYETLVTFMCAQGIGMSIIRRQVNMLCRYFG 147 Query: 144 ERLDDFP-----EYICFPTPQRLAAADPQALKALGMPLK-RAEALIHLANAALEGTLPMT 197 + FP P LA ADP L+ RA + + G L + Sbjct: 148 NEVRVADGGRDTPLYSFPAPSVLADADPALLRRCCNNNSMRAGNIGEASRLLALGRLDLQ 207 Query: 198 IPGDVE----QAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLI------------ 241 D + L GIG A+ AL G D F D + Sbjct: 208 ALSDPSLPLSEIRTELTALKGIGFKIADCIALFGLGRFDAFPI-DTHVEQFLSSWFSIGA 266 Query: 242 ------KQRFPGMTPAQIRRYAERWKPWRSYALLHIWYTEGWQ 278 ++R+ + + E + + L H W T Sbjct: 267 HQKGLSQKRYLHLQEKAVELLGESLSGYAGHHLFHCWRTTEKN 309 >UniRef50_Q64A66 8-oxoguanine DNA glycosylase n=2 Tax=environmental samples RepID=Q64A66_9ARCH Length = 288 Score = 141 bits (357), Expect = 2e-32, Method: Composition-based stats. Identities = 46/266 (17%), Positives = 94/266 (35%), Gaps = 34/266 (12%) Query: 38 SLAVGEYRGVVTAIPDIARHTLH-----INLSAGLEPVAAECLAKMSRLFDLQCNPQIV- 91 G+ G I D + + ++ SA + V E + RL D N + Sbjct: 18 VFRWGKANGWWYGIVDGSILKVRQEGDELHFSAYPKDVEEEFIKSYFRLNDDLSNILRII 77 Query: 92 --NGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG-ERLDD 148 + + + GLR+ ++ + I S+A+ + +++ +G E + + Sbjct: 78 NKDVEINKAIRQFNGLRIVRQ-SVWDCMISYICATNASIAVIESMLRNLSEKFGDEIVVN 136 Query: 149 FPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGT--LPMTIPGDVEQAM 206 + FP ++LA A +K + RA+ L +A L D + Sbjct: 137 GKAFFSFPKVKKLAKASVNKIKLCKV-GYRAKFLSEIAKQVKNNPNLLEELRNSDYLELW 195 Query: 207 KTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRR----------- 255 L++ PG+G A+ +L + + F D I++ + A+I R Sbjct: 196 DELRSLPGVGPKVADCVSLFAFDKLEAFPI-DVWIRRVIYQIRGAEIPRTKDGTEKPLTV 254 Query: 256 ---------YAERWKPWRSYALLHIW 272 + + YA +++ Sbjct: 255 SEYTELSSFARRHYGMFAGYAQEYLF 280 >UniRef50_Q7CSU9 DNA-3-methyladenine glycosidase II n=1 Tax=Agrobacterium tumefaciens str. C58 RepID=Q7CSU9_AGRT5 Length = 215 Score = 141 bits (356), Expect = 3e-32, Method: Composition-based stats. Identities = 44/210 (20%), Positives = 71/210 (33%), Gaps = 19/210 (9%) Query: 72 AECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMA 131 + + M + + + ++ AL + L L F ++ Q+VS A A Sbjct: 1 MKIITGMDDISEGLDHLARLDPALSPVIEKAGPLELRIHEPGFAGLAHIVVSQMVSRASA 60 Query: 132 AKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALE 191 + AR+ G ++ T + A + G+ +A L LA A E Sbjct: 61 NAIWARILAGTGGKV----------TAENYLAVSEELRATFGLSRAKATTLEGLARAVTE 110 Query: 192 GTLPMTIPGDVEQ--AMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFP--- 246 G + + E A L GIG WTA + + D+F D ++ Sbjct: 111 GQVDLDGVVRKEAGAAFSELVALRGIGPWTAEVYLMFCGGHPDIFPVGDVALRSAVAHAL 170 Query: 247 ----GMTPAQIRRYAERWKPWRSYALLHIW 272 + A W PWRS A W Sbjct: 171 DLEVRPDAKWLAERATLWSPWRSVAARLFW 200 >UniRef50_B6G8M1 Putative uncharacterized protein n=1 Tax=Collinsella stercoris DSM 13279 RepID=B6G8M1_9ACTN Length = 189 Score = 141 bits (355), Expect = 3e-32, Method: Composition-based stats. Identities = 42/184 (22%), Positives = 73/184 (39%), Gaps = 16/184 (8%) Query: 100 AARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQ 159 L AF +I+ Q++S+ + +R+ +L TP+ Sbjct: 2 KYIGDLEYSRPESAFHSLAHSIIEQMLSMKAGRAIESRLRELCDGDY----------TPE 51 Query: 160 RLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVEQAMKTLQTFPGIGRWT 219 +A + +K+ GM ++ ++L LA AL L E KTL PGIG+WT Sbjct: 52 CIAGIPAENIKSCGMSFRKVQSLKTLAEYALANDLESLAELPDEDVYKTLVQLPGIGKWT 111 Query: 220 ANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQI------RRYAERWKPWRSYALLHIWY 273 + F L D+ +D ++Q F + A I W+P+ S A+ +++ Sbjct: 112 CDMFLLFYLGRPDILPVEDGALRQAFEWLYGAPIVSKEVQAVVCSLWRPYSSTAVRYLYR 171 Query: 274 TEGW 277 Sbjct: 172 ALNT 175 >UniRef50_Q1AWP7 DNA-3-methyladenine glycosylase II n=1 Tax=Rubrobacter xylanophilus DSM 9941 RepID=Q1AWP7_RUBXD Length = 163 Score = 141 bits (355), Expect = 3e-32, Method: Composition-based stats. Identities = 48/170 (28%), Positives = 73/170 (42%), Gaps = 18/170 (10%) Query: 119 RAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKR 178 R ++GQ +SV A + AR+ +G R P P L A + L+A G+ + Sbjct: 2 RTVVGQQLSVGAARSIYARLCARFGGR---------PPLPGELEAVPDEELRACGVSGAK 52 Query: 179 AEALIHLANAALEGTLPMT--IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLP 236 A L LA LEG LP+ + + L GIGRW+A F + + DV Sbjct: 53 ARCLRELARRVLEGGLPLEELRGLPDGEVISALTAVRGIGRWSAQMFLIFHLRRPDVLPA 112 Query: 237 DDYLIKQRFPGMT-------PAQIRRYAERWKPWRSYALLHIWYTEGWQP 279 D I++ + + R A W+PWR+ A L++W + P Sbjct: 113 ADLGIRRAAALLYGLPELPAEELLERLAAPWRPWRTTACLYLWRSLDALP 162 >UniRef50_Q3ARU6 HhH-GPD n=1 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3ARU6_CHLCH Length = 312 Score = 141 bits (355), Expect = 4e-32, Method: Composition-based stats. Identities = 47/264 (17%), Positives = 89/264 (33%), Gaps = 39/264 (14%) Query: 48 VTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIV--------NGALGRLG 99 + I ++ +T+ ++ + + + A +S F L Q + + RL Sbjct: 45 LVIISQLSPYTIRVHCDSEVL-YGQKISAFISHYFTLDVPFQKIFSSSFKSNYSEVWRLL 103 Query: 100 AARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE-----YIC 154 + L FE + + Q + + + + R+ + YGE + E + Sbjct: 104 DGYKSIALLRQH-PFETLISFMCAQGIGMRLIRQQINRLCERYGEFYEAEMEGEMLCFSG 162 Query: 155 FPTPQRLAAADPQALKALGMPLK-RAEALIHLANAALEGTLPMTIP----GDVEQAMKTL 209 FP P++LA + + L + RA +I +A +EG L ++ E+ L Sbjct: 163 FPAPEQLACLNAEELSYCTNNNRERAANIIAVARKVVEGRLDLSSLSYPNMAFEEVQARL 222 Query: 210 QTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP------------------A 251 GIG A+ AL G + F D + Q Sbjct: 223 TQERGIGLKIADCVALFGLGYFEAFPI-DTHVHQFMAQWFKVPAASRSLTPATYRQLTLE 281 Query: 252 QIRRYAERWKPWRSYALLHIWYTE 275 + + ++ L H W E Sbjct: 282 AREILGSHYTGYAAHLLFHCWRCE 305 >UniRef50_D1VAP6 HhH-GPD family protein n=1 Tax=Frankia sp. EuI1c RepID=D1VAP6_9ACTO Length = 310 Score = 141 bits (355), Expect = 4e-32, Method: Composition-based stats. Identities = 61/281 (21%), Positives = 99/281 (35%), Gaps = 14/281 (4%) Query: 1 MYTLNWQPPYDWSWMLGFLAARAVSSVETVADSYYARSLAVGEYRGVVTAIPDIA--RHT 58 + T+ P D L ++ R++ G R + P R Sbjct: 8 VATVELAEPVDIVGSLAAYGRHGDDLIDRWDGRVLIRTVPHGAGRAALATRPAGPVERPR 67 Query: 59 LHINLSAGLEPVAAECLAKMSRLFDLQC--NPQIVNGALGRLGAARPGLRLPGCVDAFEQ 116 L++ GL+ LA+ L D + + + + A R L D Sbjct: 68 LYVTGPPGLDTATLAGLARAQFLRDQPALDDLTARDPVIAAIPAHRRRLAQLTQTDVLHV 127 Query: 117 GVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPL 176 R + Q V+ AA L +R+ L G + P LA DP L LG+ Sbjct: 128 LARCVTAQQVTGRFAATLRSRLVGLVGRPVTAGPHTAYALDADLLADTDPTRLTELGLSG 187 Query: 177 KRAEALIHLANAAL-EGTLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFL 235 ++A AL+ +A A +L D E + TL PGIGRW+A +F +R + Sbjct: 188 RKAMALLGVARAVTGSLSLSSLHELDDEGVIATLTALPGIGRWSAEWFLIRALGRP-LVA 246 Query: 236 PDDYLIKQRFPGMT--------PAQIRRYAERWKPWRSYAL 268 D +++ + ++R W P + A Sbjct: 247 AGDLAVRKAVGHLYRPGLPPPAEEEVRLLTAHWGPAAALAQ 287 >UniRef50_C3Y2J9 Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y2J9_BRAFL Length = 401 Score = 140 bits (354), Expect = 4e-32, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 67/217 (30%), Gaps = 25/217 (11%) Query: 80 RLFDLQCNPQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVA 139 +L DL + ++ G+R+ D E I ++ + + R+ Sbjct: 160 KLTDLYEQWCKDDPHFKQVSPNFTGIRMLRQ-DPVENLFSFICSSNNHISRISGMVERLC 218 Query: 140 QLYGERLDD--FPEYICFPTPQRLAAADPQA-LKALGMPLKRAEALIHLANAALE----G 192 + Y RL + Y FPT LA + L+ LG RA + A +E Sbjct: 219 EAYSARLCEVCGVTYYAFPTVSALAGRGVEERLRKLGF-GYRARYISETAQYIMEQGGES 277 Query: 193 TLPMTIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQ 252 L E+A L G+G A+ L D + Q + Sbjct: 278 WLYNLKTLPYEEAKAELIKLSGVGAKVADCVCLMSMDKTGAIPV-DTHVWQIVNRDYKHK 336 Query: 253 IRR---------------YAERWKPWRSYALLHIWYT 274 + + + W P+ +A ++ Sbjct: 337 LGTTKTLTDKTYKEIGDFFRQLWGPYAGWAHSVLFAA 373 >UniRef50_Q688W2 Os05g0567500 protein n=2 Tax=Oryza sativa RepID=Q688W2_ORYSJ Length = 290 Score = 140 bits (354), Expect = 4e-32, Method: Composition-based stats. Identities = 47/222 (21%), Positives = 80/222 (36%), Gaps = 21/222 (9%) Query: 63 LSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPGCVD--AFEQGVRA 120 LS E A +S L + + L + ++ AF + Sbjct: 55 LSPPPLSSPGELAAALSHL-------RTADPLLSEVISSTGAPAFISSPSRPAFHSLAHS 107 Query: 121 ILGQLVSVAMAAKLTARVAQLYGERLDDFPEYICFPTPQRLAAADPQALKALGMPLKRAE 180 IL Q ++ + AA + AR L D + P + A L+A+G+ ++A Sbjct: 108 ILHQQLAPSAAAAIYARFLALIPAAADPDAAVV---NPAAVLALSAADLRAIGVSARKAA 164 Query: 181 ALIHLANAALEGTLPMTI--PGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDD 238 L LA G L + D + L G+G WT + F + DV D Sbjct: 165 YLHDLAGRFAAGELSESAVAAMDEAALLAELTKVKGVGEWTVHMFMIFSLHRPDVLPSGD 224 Query: 239 YLIKQRFPGMT-------PAQIRRYAERWKPWRSYALLHIWY 273 +++ + P ++ ERW+P+RS ++W Sbjct: 225 LGVRKGVQELYGLPALPKPEEMAALCERWRPYRSVGAWYMWR 266 >UniRef50_C0D4Q9 Putative uncharacterized protein n=2 Tax=Clostridium RepID=C0D4Q9_9CLOT Length = 293 Score = 140 bits (353), Expect = 5e-32, Method: Composition-based stats. Identities = 48/236 (20%), Positives = 90/236 (38%), Gaps = 21/236 (8%) Query: 50 AIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARPGLRLPG 109 + ++ + + + + + L AA G+R+ Sbjct: 41 LRVSQEGNRINFDCPQEDLSAWLTYFDCATDYAAILASVDPDDNYLKAAAAAGRGIRILR 100 Query: 110 CVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDDFPE----YICFPTPQRLAAAD 165 D +E + ++ Q ++ +L +++ +G++++D E FPTP+ LA A Sbjct: 101 Q-DPWEMIITFVISQQKTIPCIRRLVEDISRRWGQKIEDGDEKNFAVYSFPTPKELARAS 159 Query: 166 PQALKALGMPLKRAEALIHLANAALEGTLPMTIPGDVE--QAMKTLQTFPGIGRWTANYF 223 + L L + RA+ + L+ A G L + +E QAM+ L F GIG+ AN Sbjct: 160 LEELLDLKL-GYRAKYIHRLSQDAAAGILDLKNLETMEYGQAMEYLTGFYGIGKKVANCV 218 Query: 224 ALRGWQAKDVFLPDDYLIKQRFPGMT--PAQIRR----------YAERWKPWRSYA 267 L G + F D I++ + RR A+ + + YA Sbjct: 219 CLFGLHHIEAFPV-DTWIEKILLREYFSAKKYRRTPKNRLYDTILADHFGKYGGYA 273 >UniRef50_C5DFL6 KLTH0D16126p n=2 Tax=Saccharomycetaceae RepID=C5DFL6_LACTC Length = 386 Score = 139 bits (352), Expect = 8e-32, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 84/235 (35%), Gaps = 22/235 (9%) Query: 33 SYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDL-----QCN 87 Y+ ++ + + +V + + +L A L + RL + Sbjct: 35 GQYSTTMRIDDRFRIVVLRQLEDNYIEYASLGAEECSSLGSFLKRYFRLEVPLSELYENQ 94 Query: 88 PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLD 147 + + G+R+ D +E + I +++ K+ + +G + Sbjct: 95 WLPRDSRFEK--KRPHGIRILSQ-DPWETLLSYICSSNNNISRITKMCHALCIEFGNPVG 151 Query: 148 --DFPEYICFPTPQRLA-AADPQALKALGMPLKRAEALIHLANAALEGTLPMT------- 197 D +Y FPT + L A + L+ALG RA+ L+ A+ L+ L M+ Sbjct: 152 QYDKVDYYSFPTSKELVERASEEKLRALGF-GYRAKFLMKTADKMLKERLDMSDTQYLES 210 Query: 198 --IPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP 250 + EQ + + F G+G A+ L G + +V D I + Sbjct: 211 WKDHLEYEQVRERVMGFDGVGPKVADCVCLSGLEMDEVVPV-DVHIARIAQRDYR 264 >UniRef50_C5SF64 HhH-GPD family protein n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SF64_9CAUL Length = 214 Score = 139 bits (351), Expect = 9e-32, Method: Composition-based stats. Identities = 45/201 (22%), Positives = 68/201 (33%), Gaps = 20/201 (9%) Query: 89 QIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGERLDD 148 + AL L AA L D FE +R I+ Q +SV A + ++ E Sbjct: 18 AEADPALAPLFAAVGDLNFRHRADGFEGLLRLIVEQQLSVRAADAIWQKLRGGLSEM--- 74 Query: 149 FPEYICFPTPQRLAAADPQALKALGMPLKRAEALIHLANAALEGTLPMT--IPGDVEQAM 206 +P L + L+ G+ + LA A L + E A+ Sbjct: 75 --------SPAHLLTLSDETLRGHGLSRPKVRYARILAEAVHARALDFAHVRSLEAEDAI 126 Query: 207 KTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQI-------RRYAER 259 + L GIGRWTA + + D+F D +++ + R A Sbjct: 127 EHLTALKGIGRWTAEVYLMFCEGRLDIFPTGDIALREALGWLDGLDARPDEVYCRERALC 186 Query: 260 WKPWRSYALLHIWYTEGWQPD 280 W P RS +W G Sbjct: 187 WAPHRSVVSHALWGWYGAVKR 207 >UniRef50_UPI00015B5D00 PREDICTED: similar to ENSANGP00000022197 n=3 Tax=Nasonia vitripennis RepID=UPI00015B5D00 Length = 844 Score = 139 bits (351), Expect = 1e-31, Method: Composition-based stats. Identities = 40/268 (14%), Positives = 78/268 (29%), Gaps = 32/268 (11%) Query: 27 VETVADSYYARSLAVGEYRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQC 86 VE + Y + T + + S L+ Sbjct: 542 VECSDGNSYRGVFNS----ALWTLSQSDNKLFYTRHGSKKGFQCDKALSNYFRLDVSLKE 597 Query: 87 N---PQIVNGALGRLGAARPGLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYG 143 N + +R+ D E I ++ + + ++ +L+G Sbjct: 598 NLKSWAGTDSHFKTTYDKIGAVRILNQ-DVVENLFSFICSSNNNITRISGMVDKLCRLFG 656 Query: 144 ERL--DDFPEYICFPTPQRLAAADPQA-LKALGMPLKRAEALIHLANAALEGTLPM---- 196 E + + +Y FP ++L++++ + L+ G RA ++ A L Sbjct: 657 EYICTVEGQDYYDFPKIEKLSSSELENILRKEGF-GYRAGYIVKSAKKLLSLGEDWLLGL 715 Query: 197 --TIPGDVEQAMKTLQTFPGIGRWTANYFALRGWQAKDVFLPDDYLIKQRFPGMTP---- 250 E A ++L + PGIG A+ L + D ++ + TP Sbjct: 716 KKENGATYEHARESLMSLPGIGPKVADCICLMSLGHLESIPVDTHIFQVACSNYTPHLSK 775 Query: 251 ----------AQIRRYAERWKPWRSYAL 268 E W P +A Sbjct: 776 QKTVTPKIHQEVSSHLRELWGPLAGWAQ 803 >UniRef50_C9L550 8-oxoguanine DNA glycosylase n=3 Tax=Clostridiales RepID=C9L550_RUMHA Length = 272 Score = 139 bits (350), Expect = 1e-31, Method: Composition-based stats. Identities = 39/233 (16%), Positives = 73/233 (31%), Gaps = 10/233 (4%) Query: 44 YRGVVTAIPDIARHTLHINLSAGLEPVAAECLAKMSRLFDLQCNPQIVNGALGRLGAARP 103 + + +L E V E + L Sbjct: 36 GARYLDVYQKDGESIFYCSLEEFTE-VWEEYFDLKRAYKLYIEKINPKDTYLVNAAKLGG 94 Query: 104 GLRLPGCVDAFEQGVRAILGQLVSVAMAAKLTARVAQLYGER--LDDFPEYICFPTPQRL 161 G+R+ D +E V ++ Q ++ + + + YGE Y FP + Sbjct: 95 GIRILKQ-DLWEMIVSFLISQQNNIVRIRRCIENICREYGEEKITASGEHYYAFPKAEAF 153 Query: 162 AAADPQALKALGMPLKRAEALIHLANAALEGTLPMTI--PGDVEQAMKTLQTFPGIGRWT 219 A + LKA + R++ ++ A A + G + + +A + L G+G Sbjct: 154 ACLEEDDLKACNL-GYRSKYVVRAAKAVVAGEIDLQAIEKMKYAKAKEELLKIFGVGVKV 212 Query: 220 ANYFALRGWQAKDVFLPDDYLIKQRFPGMTPAQIRRYAERWKPWRSYALLHIW 272 A+ L G + F D I Q + R+K + +I+ Sbjct: 213 ADCICLFGLHHLEAFPV-DTHINQALEKHYKRGFPKR--RYKGFEGVLQQYIF 262 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.314 0.154 0.449 Lambda K H 0.267 0.0471 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,874,915,957 Number of Sequences: 3077464 Number of extensions: 82738503 Number of successful extensions: 292753 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 1045 Number of HSP's successfully gapped in prelim test: 1243 Number of HSP's that attempted gapping in prelim test: 287463 Number of HSP's gapped (non-prelim): 2564 length of query: 282 length of database: 1,040,396,356 effective HSP length: 127 effective length of query: 155 effective length of database: 649,558,428 effective search space: 100681556340 effective search space used: 100681556340 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 93 (40.2 bits)