BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (168 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_B7MB04 G/U mismatch-specific DNA glycosylase n=70 Tax=E... 346 2e-94 UniRef50_B5REH0 G/U mismatch-specific DNA glycosylase n=51 Tax=E... 290 1e-77 UniRef50_Q46SQ2 G/U mismatch-specific uracil-DNA glycosylase n=4... 150 2e-35 UniRef50_C6B6I6 Uracil-DNA glycosylase superfamily n=2 Tax=Rhizo... 147 1e-34 UniRef50_A8LC62 Uracil-DNA glycosylase superfamily n=16 Tax=Bact... 147 2e-34 UniRef50_Q39CW6 G/U mismatch-specific uracil-DNA glycosylase n=1... 145 3e-34 UniRef50_D2B5F3 G:T/U mismatch-specific DNA glycosylase-like pro... 144 2e-33 UniRef50_Q47NL8 G/U mismatch-specific uracil-DNA glycosylase n=1... 143 3e-33 UniRef50_Q0RRS7 DNA glycosylase, G/U mismatch-specific n=2 Tax=F... 141 6e-33 UniRef50_Q1D9T2 G/U mismatch-specific DNA glycosylase n=2 Tax=Ba... 139 5e-32 UniRef50_B9J7K0 G:T/U mismatch-specific DNA glycosylase n=1 Tax=... 138 6e-32 UniRef50_C7PGY3 Uracil-DNA glycosylase superfamily n=1 Tax=Chiti... 138 9e-32 UniRef50_C6WCQ8 Uracil-DNA glycosylase superfamily n=1 Tax=Actin... 136 3e-31 UniRef50_Q9K3I5 G/U mismatch-specific DNA glycosylase n=3 Tax=St... 136 3e-31 UniRef50_B7GT25 Uracil-DNA glycosylase superfamily n=6 Tax=Actin... 117 1e-25 UniRef50_A6CPG7 G/U mismatch-specific DNA glycosylase n=2 Tax=Ba... 114 1e-24 UniRef50_C8XAI4 Uracil-DNA glycosylase superfamily n=3 Tax=Actin... 113 2e-24 UniRef50_B2GKB6 G/U mismatch-specific DNA glycosylase n=6 Tax=Ba... 112 5e-24 UniRef50_Q67QW8 G/U mismatch-specific DNA glycosylase n=1 Tax=Sy... 111 8e-24 UniRef50_UPI0001926888 PREDICTED: similar to GL18173 n=1 Tax=Hyd... 109 3e-23 UniRef50_C0ZBM4 Putative uncharacterized protein n=1 Tax=Breviba... 106 3e-22 UniRef50_C8NV44 G/U mismatch-specific DNA glycosylase n=2 Tax=Co... 105 5e-22 UniRef50_UPI000178957E Uracil-DNA glycosylase superfamily n=1 Ta... 105 5e-22 UniRef50_Q1J3D2 Uracil-DNA glycosylase superfamily n=1 Tax=Deino... 105 7e-22 UniRef50_B2W4Q2 G/U mismatch-specific DNA glycosylase n=2 Tax=Pl... 103 2e-21 UniRef50_A6SED0 Putative uncharacterized protein n=2 Tax=Sclerot... 100 2e-20 UniRef50_C9SR06 G/U mismatch-specific uracil DNA glycosylase n=4... 99 4e-20 UniRef50_B2A9Z4 Predicted CDS Pa_1_2290 n=2 Tax=Sordariales RepI... 98 1e-19 UniRef50_Q4T9X5 Chromosome undetermined SCAF7485, whole genome s... 96 3e-19 UniRef50_B3MY42 GF22816 n=1 Tax=Drosophila ananassae RepID=B3MY4... 94 2e-18 UniRef50_B8I864 Uracil-DNA glycosylase superfamily n=1 Tax=Clost... 94 2e-18 UniRef50_C4QAV8 G/t mismatch-specific thymine DNA glycosylase, p... 92 5e-18 UniRef50_A8P541 Putative uncharacterized protein n=1 Tax=Coprino... 92 5e-18 UniRef50_A7HGS2 Uracil-DNA glycosylase superfamily n=1 Tax=Anaer... 92 7e-18 UniRef50_B8N438 Mismatch-specific thymine-DNA glycosylase, putat... 92 7e-18 UniRef50_A6GLD3 G/U mismatch-specific DNA glycosylase n=1 Tax=Li... 92 7e-18 UniRef50_UPI0001926FCB PREDICTED: similar to predicted protein n... 92 8e-18 UniRef50_Q9V4D8 Thd1 n=1 Tax=Drosophila melanogaster RepID=Q9V4D... 92 9e-18 UniRef50_Q9XYZ3 G/T mismatch-specific thymine DNA glycosylase n=... 91 1e-17 UniRef50_B0E0R1 Predicted protein n=7 Tax=Laccaria bicolor S238N... 91 1e-17 UniRef50_B4NHL6 GK13687 n=2 Tax=Coelomata RepID=B4NHL6_DROWI 91 2e-17 UniRef50_D1CA47 Uracil-DNA glycosylase superfamily n=1 Tax=Sphae... 90 3e-17 UniRef50_Q5KFR6 Expressed protein n=2 Tax=Filobasidiella neoform... 90 4e-17 UniRef50_B4L7B8 GI14092 n=1 Tax=Drosophila mojavensis RepID=B4L7... 90 4e-17 UniRef50_UPI0000DB71A4 PREDICTED: similar to Thd1 CG1981-PA, par... 89 4e-17 UniRef50_A7S194 Predicted protein n=1 Tax=Nematostella vectensis... 89 4e-17 UniRef50_B4JZW0 GH23925 n=1 Tax=Drosophila grimshawi RepID=B4JZW... 89 5e-17 UniRef50_C7IMH6 Uracil-DNA glycosylase superfamily n=1 Tax=Clost... 89 5e-17 UniRef50_B2IIR8 Uracil-DNA glycosylase superfamily n=2 Tax=Bacte... 89 6e-17 UniRef50_B4MF03 GJ13949 n=2 Tax=Drosophila virilis RepID=B4MF03_... 89 7e-17 UniRef50_A2QSA7 Function: In the DNA of higher eukaryotes n=48 T... 88 9e-17 UniRef50_UPI0000D578C1 PREDICTED: similar to Thd1 n=1 Tax=Tribol... 87 2e-16 UniRef50_A8Q5A6 Putative uncharacterized protein n=1 Tax=Malasse... 87 3e-16 UniRef50_C8RZ13 Uracil-DNA glycosylase superfamily n=1 Tax=Rhodo... 87 3e-16 UniRef50_C5SKW3 Uracil-DNA glycosylase superfamily n=1 Tax=Astic... 86 3e-16 UniRef50_Q7SDB2 Predicted protein n=2 Tax=Sordariaceae RepID=Q7S... 86 6e-16 UniRef50_Q13569 G/T mismatch-specific thymine DNA glycosylase n=... 84 1e-15 UniRef50_UPI000186DD28 G/t mismatch-specific thymine DNA glycosy... 83 3e-15 UniRef50_UPI0001792C18 PREDICTED: similar to GA15170-PA n=3 Tax=... 83 3e-15 UniRef50_UPI0000DB6DF5 PREDICTED: similar to Thd1 CG1981-PA n=2 ... 82 5e-15 UniRef50_Q6C2V5 YALI0F04796p n=1 Tax=Yarrowia lipolytica RepID=Q... 81 1e-14 UniRef50_UPI0000E46F0A PREDICTED: similar to G/T mismatch-specif... 81 1e-14 UniRef50_C6H6H6 Uracil DNA glycosylase superfamily protein n=1 T... 80 2e-14 UniRef50_B3QI54 Uracil-DNA glycosylase superfamily n=3 Tax=Rhodo... 79 8e-14 UniRef50_A7HV32 Uracil-DNA glycosylase superfamily n=3 Tax=Rhizo... 77 3e-13 UniRef50_Q4P6C9 Putative uncharacterized protein n=1 Tax=Ustilag... 75 1e-12 UniRef50_O59825 G/U mismatch-specific uracil DNA glycosylase n=1... 69 4e-11 UniRef50_Q1IXM1 Uracil-DNA glycosylase superfamily n=3 Tax=Deino... 69 5e-11 UniRef50_A9VA78 Predicted protein n=1 Tax=Monosiga brevicollis R... 67 2e-10 UniRef50_UPI0000D8867B hypothetical protein CIMG_10236 n=1 Tax=C... 67 2e-10 UniRef50_C4Y7L8 Putative uncharacterized protein n=1 Tax=Clavisp... 61 1e-08 UniRef50_C7DIS9 Putative uncharacterized protein n=2 Tax=unclass... 60 3e-08 UniRef50_A5EPR0 G/U mismatch-specific uracil-DNA glycosylase n=1... 57 3e-07 UniRef50_Q1Q0C1 Similar to G/U-mismatch-specific DNA glycolase n... 55 1e-06 UniRef50_A3LQ36 Predicted protein n=1 Tax=Pichia stipitis RepID=... 52 6e-06 UniRef50_C5BH73 Putative uncharacterized protein n=1 Tax=Edwards... 51 2e-05 UniRef50_B7Q2I4 G/T mismatch-specific thymine DNA glycosylase, p... 50 2e-05 UniRef50_C5MI19 Predicted protein n=1 Tax=Candida tropicalis MYA... 49 8e-05 UniRef50_B9WEN4 G/U mismatch-specific uracil DNA glycosylase, pu... 49 9e-05 UniRef50_Q7VJA2 Putative uncharacterized protein n=1 Tax=Helicob... 44 0.003 UniRef50_D0N7I8 Putative uncharacterized protein n=1 Tax=Phytoph... 43 0.004 UniRef50_Q4T9X4 Chromosome undetermined SCAF7485, whole genome s... 42 0.010 UniRef50_C1EDR1 Predicted protein n=2 Tax=Micromonas RepID=C1EDR... 41 0.018 UniRef50_UPI00017F04BF PREDICTED: similar to thymine-DNA glycosy... 40 0.031 >UniRef50_B7MB04 G/U mismatch-specific DNA glycosylase n=70 Tax=Enterobacteriaceae RepID=MUG_ECO45 Length = 168 Score = 346 bits (888), Expect = 2e-94, Method: Compositional matrix adjust. Identities = 168/168 (100%), Positives = 168/168 (100%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL Sbjct: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG Sbjct: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRGR 168 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRGR Sbjct: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRGR 168 >UniRef50_B5REH0 G/U mismatch-specific DNA glycosylase n=51 Tax=Enterobacteriaceae RepID=MUG_SALG2 Length = 168 Score = 290 bits (742), Expect = 1e-77, Method: Compositional matrix adjust. Identities = 134/167 (80%), Positives = 150/167 (89%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 MV+DILAPGLRVVFCGINPGLSSA TGFPFAHPANRFWKVI+ AGFTDRQLKP+EA+ LL Sbjct: 1 MVKDILAPGLRVVFCGINPGLSSANTGFPFAHPANRFWKVIHLAGFTDRQLKPEEAEKLL 60 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 D+RCGVTKLVDRPTVQA EV EL +GGR LIEKIEDYQP ALA+LGKQA+EQGFSQRG Sbjct: 61 DFRCGVTKLVDRPTVQATEVKLHELRSGGRNLIEKIEDYQPAALAVLGKQAFEQGFSQRG 120 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRG 167 WGK+ + IG+T +WVLPNPSGL+R+ EKLVEAYRELDQAL++RG Sbjct: 121 IAWGKKKIAIGATMVWVLPNPSGLNRIKTEKLVEAYRELDQALIMRG 167 >UniRef50_Q46SQ2 G/U mismatch-specific uracil-DNA glycosylase n=4 Tax=Proteobacteria RepID=Q46SQ2_RALEJ Length = 197 Score = 150 bits (379), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 76/159 (47%), Positives = 98/159 (61%), Gaps = 1/159 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + DI+ PGL VFCG+NPGL +A +G FA NRFW+VI+ AG TDR+L+P + + LL Sbjct: 16 LPDIMKPGLSAVFCGLNPGLRAAASGHHFAGHGNRFWRVIHLAGITDRELEPVDDRQLLS 75 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y G+T V RPT A +S E +LI K+E + P +A LGK AY SQR Sbjct: 76 YGYGLTTAVSRPTASAAALSHNEYVDASVRLIIKLERFHPACVAFLGKAAYAAMTSQRVV 135 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV-SLEKLVEAYREL 159 WG Q +G ++WVLPNPSG +R SL LV+AY E Sbjct: 136 PWGPQRRRMGGARVWVLPNPSGRNRAFSLAGLVDAYGEF 174 >UniRef50_C6B6I6 Uracil-DNA glycosylase superfamily n=2 Tax=Rhizobium leguminosarum RepID=C6B6I6_RHILS Length = 163 Score = 147 bits (371), Expect = 1e-34, Method: Compositional matrix adjust. Identities = 73/163 (44%), Positives = 98/163 (60%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + +ILAPGL VVFCG+NP LS+ G F++P+NRFW+V++ AGFT R L+ E + LL Sbjct: 1 MSEILAPGLSVVFCGLNPALSAVRDGHNFSNPSNRFWRVLHLAGFTPRLLRADEERELLQ 60 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y CG+T V RPT A+E+ KQ+ + L KI P LA LGK AY + Sbjct: 61 YGCGLTSAVSRPTKSASELKKQDYRSAAPVLENKIRKIAPANLAFLGKAAYAAISLRADV 120 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV-SLEKLVEAYRELDQAL 163 +WG+Q +W+LPNPSGL+R +L L E Y +L L Sbjct: 121 EWGRQPEVFAGADVWLLPNPSGLNRAFTLTTLTEHYLDLRLTL 163 >UniRef50_A8LC62 Uracil-DNA glycosylase superfamily n=16 Tax=Bacteria RepID=A8LC62_FRASN Length = 297 Score = 147 bits (370), Expect = 2e-34, Method: Compositional matrix adjust. Identities = 75/162 (46%), Positives = 100/162 (61%), Gaps = 1/162 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D++ P RV+ CGINP L S TGF F P+NR W V++ AGFT R+L P E +HL Sbjct: 109 VPDLVGPETRVLLCGINPSLESGATGFHFGTPSNRLWPVLHFAGFTGRRLHPSETEHLRA 168 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G+T LV R T +A+E++ E+ AG LIE +E +P+ +A LG AY GF +R A Sbjct: 169 RGIGITNLVHRSTARADEIADDEIRAGVPVLIELVERIRPEWVAFLGLAAYRIGFGRRTA 228 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 + G+Q IG +W+LPNPSGL + L LV Y EL +A Sbjct: 229 KVGRQPERIGPAGVWLLPNPSGLNAHYQLPDLVRVYGELREA 270 >UniRef50_Q39CW6 G/U mismatch-specific uracil-DNA glycosylase n=10 Tax=Bacteria RepID=Q39CW6_BURS3 Length = 210 Score = 145 bits (367), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 73/159 (45%), Positives = 104/159 (65%), Gaps = 1/159 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D++AP L V+FCGINPGL++A TG FA +NRFW+VI+ AGFT ++ PQ+ +L Sbjct: 33 LPDLIAPQLDVLFCGINPGLTAAATGHHFAGRSNRFWRVIHLAGFTPAEISPQDDHAILQ 92 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y CG+T +V RPT A+++S+ E A + K+ Y P+ +A LGK AY QR Sbjct: 93 YGCGLTAVVKRPTASADQLSRAEFVAAAAEFEHKVARYGPRFVAFLGKAAYSALSGQREI 152 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLS-RVSLEKLVEAYREL 159 +WG Q + + +WVLPNPSG + SL+ LV+AYR+L Sbjct: 153 EWGLQPARMQGSSVWVLPNPSGRNLAFSLDDLVDAYRQL 191 >UniRef50_D2B5F3 G:T/U mismatch-specific DNA glycosylase-like protein n=9 Tax=Actinomycetales RepID=D2B5F3_STRRD Length = 208 Score = 144 bits (362), Expect = 2e-33, Method: Compositional matrix adjust. Identities = 71/163 (43%), Positives = 98/163 (60%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 ++D+L P L V+FCGINPGL SA TG FA P NRFW ++++GFT R L P E L Sbjct: 45 LKDVLGPSLDVLFCGINPGLYSAATGHHFARPGNRFWPALHRSGFTPRLLDPAEQDLLPS 104 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G+T +V+R + QA E++ +E GG +L + +P+ LA+ G AY F Q A Sbjct: 105 FGLGITNVVERASAQAAELTPEEFREGGARLARLVAGTRPRVLAVAGVTAYRTAFGQPKA 164 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 G Q IG T +WVLPNPSGL + +LE + R L ++L Sbjct: 165 AVGPQATVIGDTAVWVLPNPSGLNAHWTLEGIAGEMRRLRESL 207 >UniRef50_Q47NL8 G/U mismatch-specific uracil-DNA glycosylase n=11 Tax=Bacteria RepID=Q47NL8_THEFY Length = 212 Score = 143 bits (360), Expect = 3e-33, Method: Compositional matrix adjust. Identities = 71/146 (48%), Positives = 94/146 (64%), Gaps = 2/146 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D++ PGL ++FCGINPGL SA +G FA P NRFW ++ A T R L+P E LLD Sbjct: 21 VPDVIGPGLTILFCGINPGLYSAWSGHHFARPGNRFWPALHAASLTPRLLRPAEQHLLLD 80 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 GVT LVDR T +A+E++ EL AGG+ L K+ Y+P+ LA+LG AY + F++ A Sbjct: 81 LGLGVTNLVDRATARADELTAAELQAGGQALAAKLRAYRPRLLAVLGVTAYRKAFARPRA 140 Query: 122 QWGKQTL--TIGSTQIWVLPNPSGLS 145 + G Q I +WVLPNPSGL+ Sbjct: 141 RIGPQPPEDAIEGVPVWVLPNPSGLN 166 >UniRef50_Q0RRS7 DNA glycosylase, G/U mismatch-specific n=2 Tax=Frankia RepID=Q0RRS7_FRAAA Length = 196 Score = 141 bits (356), Expect = 6e-33, Method: Compositional matrix adjust. Identities = 73/162 (45%), Positives = 95/162 (58%), Gaps = 1/162 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D+LAP V+ CGINP L S TGF F P+NR W V++ AGFT R+L P E L Sbjct: 30 VPDLLAPRTTVLLCGINPSLESGATGFHFGTPSNRLWPVLHLAGFTPRRLHPSETAELTS 89 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 +T LV+R T +A+E+ E+ AG +L +E P +A LG AY GF +R A Sbjct: 90 RGISITNLVNRSTARADEIGDDEVRAGVDRLTGLVERISPHWVAFLGLAAYRVGFGRRRA 149 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 Q G+Q T+G +W+LPNPSGL + L LV Y EL +A Sbjct: 150 QVGRQPETLGQAGVWLLPNPSGLNAHYQLPDLVRVYGELREA 191 >UniRef50_Q1D9T2 G/U mismatch-specific DNA glycosylase n=2 Tax=Bacteria RepID=Q1D9T2_MYXXD Length = 166 Score = 139 bits (349), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 71/163 (43%), Positives = 98/163 (60%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D++AP LRV+FCGINP L SA G+ FA P NRFW ++ AG T +P E + LL Sbjct: 1 MPDVIAPRLRVLFCGINPSLYSAVVGYHFARPGNRFWPTLHAAGITPHTFQPTEQEALLA 60 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G+T +VDR T A+ ++ EL G + L K+ YQP+ LA+LG AY F + A Sbjct: 61 LGYGITNVVDRATATADLLNPAELARGAKALEAKVARYQPRFLAVLGVSAYRIAFGRPKA 120 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 G Q +G T++WVLPNPSGL + L+ L + EL +A+ Sbjct: 121 ALGTQPERLGDTRLWVLPNPSGLNAHYQLDALGRLFAELRRAV 163 >UniRef50_B9J7K0 G:T/U mismatch-specific DNA glycosylase n=1 Tax=Agrobacterium radiobacter K84 RepID=B9J7K0_AGRRK Length = 198 Score = 138 bits (348), Expect = 6e-32, Method: Compositional matrix adjust. Identities = 69/163 (42%), Positives = 100/163 (61%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + +ILAP L VVFCG+NP L++ G F++ +NRFW+V++ AGFT R L+ +E + +L Sbjct: 28 LAEILAPRLSVVFCGLNPALTAHRDGHNFSNRSNRFWRVLHLAGFTPRLLRAEEEREMLR 87 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y G+T V RPT A+E++K++ L +KI P+ LA LGK AY Q Sbjct: 88 YGLGLTSAVSRPTKSASELTKRDYITAAPALEDKITKLAPRNLAFLGKAAYAAISLQAHV 147 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV-SLEKLVEAYRELDQAL 163 +WG+Q +WVLPNPSGL+R L +L E YR++ A+ Sbjct: 148 EWGRQPDKFADASVWVLPNPSGLNRAFDLTRLTEHYRKVKLAV 190 >UniRef50_C7PGY3 Uracil-DNA glycosylase superfamily n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PGY3_CHIPD Length = 171 Score = 138 bits (347), Expect = 9e-32, Method: Compositional matrix adjust. Identities = 69/163 (42%), Positives = 99/163 (60%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 ++DI+ L V+FCGINPGL SA G F+ +NRFWKV++QAGFT ++ P +LD Sbjct: 5 LQDIITENLTVLFCGINPGLKSAVQGHHFSGRSNRFWKVLHQAGFTPHEIDPTLDSTILD 64 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G+T V+R TV+A+E+SK E KIE Y+P LA LGK AY ++ Sbjct: 65 FGYGLTTAVERATVRADELSKAEFDQSFETFKRKIEQYKPTYLAFLGKAAYMAFSEKKNV 124 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQAL 163 QWG Q + + +WVLPN SGL+R +L L++ ++ L + Sbjct: 125 QWGLQEEPLCNAIVWVLPNTSGLNRGFTLSMLIDHFQALHNHI 167 >UniRef50_C6WCQ8 Uracil-DNA glycosylase superfamily n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WCQ8_ACTMD Length = 200 Score = 136 bits (343), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 71/163 (43%), Positives = 97/163 (59%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D+LAPGL + CGINPG SA TG +A P NRFW+V++ AGFT R L P E LL Sbjct: 21 LPDLLAPGLSALLCGINPGGRSAVTGHHYAGPGNRFWRVLHLAGFTPRLLAPSEQALLLG 80 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G+T LV RPT +A E+ EL AG +L +E ++P A+ ++G AY F + A Sbjct: 81 FGIGLTNLVARPTARAAELGSAELVAGAGRLAGLVERHRPGAVGVVGVSAYRVAFRRPRA 140 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQAL 163 G Q + + +WVLPNPSGL+ L+ LV +R+ L Sbjct: 141 VVGPQDVELAGAPLWVLPNPSGLNAGYRLDDLVAIFRDFRSEL 183 >UniRef50_Q9K3I5 G/U mismatch-specific DNA glycosylase n=3 Tax=Streptomyces RepID=Q9K3I5_STRCO Length = 160 Score = 136 bits (342), Expect = 3e-31, Method: Compositional matrix adjust. Identities = 67/152 (44%), Positives = 95/152 (62%), Gaps = 1/152 (0%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLVD 71 ++FCGINPGL +A TG FA P NRFW V++ +GFT R LKP E L Y G+T +V Sbjct: 1 MLFCGINPGLMTAATGHHFARPGNRFWPVLHLSGFTPRLLKPSEQDELPSYGLGITNVVA 60 Query: 72 RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTLTIG 131 R + +A+E++ +E GGR L K+ +P LA++G AY F + A+ G Q T G Sbjct: 61 RASARADELTAEEYREGGRLLARKVARLRPGWLAVVGVTAYRAAFDEPKARVGPQERTFG 120 Query: 132 STQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 ST++WVLPNPSGL + + + + + Y L +A Sbjct: 121 STRVWVLPNPSGLNAHWTAQTMAQEYARLREA 152 >UniRef50_B7GT25 Uracil-DNA glycosylase superfamily n=6 Tax=Actinobacteridae RepID=B7GT25_BIFLI Length = 212 Score = 117 bits (294), Expect = 1e-25, Method: Compositional matrix adjust. Identities = 64/171 (37%), Positives = 97/171 (56%), Gaps = 13/171 (7%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ------LKPQE 55 V+DI P +++ GINPGL +A PFA+P NRFW ++ +G T Q L+ + Sbjct: 30 VDDIRCPHPKLLIVGINPGLWTAAVNAPFAYPGNRFWPSLFASGLTPWQVDASYGLRLHD 89 Query: 56 AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 + ++D G+T LV+R T +A+E++ EL G +L E P+A+AILG AY Q Sbjct: 90 ERMMIDRGIGLTNLVNRATAKASELNTAELRHGADRLRELCARVHPRAIAILGITAYRQA 149 Query: 116 FSQRGAQWGKQTLT------IGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 F+ + A G Q LT G Q+WV+P PSGL + ++ LV+ +R + Sbjct: 150 FADKSAALGLQDLTAEETDRFGGAQLWVIPQPSGLNAHATMPVLVDWWRRV 200 >UniRef50_A6CPG7 G/U mismatch-specific DNA glycosylase n=2 Tax=Bacillus RepID=A6CPG7_9BACI Length = 170 Score = 114 bits (285), Expect = 1e-24, Method: Compositional matrix adjust. Identities = 58/159 (36%), Positives = 93/159 (58%), Gaps = 1/159 (0%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 D L L +VF G NP L S G +A+ +NRFWK+++ +G T RQ P+E LL Sbjct: 6 DHLKENLDIVFIGFNPSLKSGEVGHNYANKSNRFWKILHLSGLTPRQYAPEEDYTLLQLG 65 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 G+T +V+RPT +A +++K+E G L K++ Y P+ +GK Y+Q +R +W Sbjct: 66 YGLTNIVERPTKEAADITKEEYSEGRVILKNKLKKYSPKIACYVGKGVYQQFSKKRKIEW 125 Query: 124 GKQTLTIGSTQI-WVLPNPSGLSRVSLEKLVEAYRELDQ 161 G Q ++ I +V P+ SGL R+ ++++V Y+EL + Sbjct: 126 GIQEESVVEGVIDFVAPSSSGLVRMKIDEIVSIYKELTE 164 >UniRef50_C8XAI4 Uracil-DNA glycosylase superfamily n=3 Tax=Actinomycetales RepID=C8XAI4_NAKMY Length = 193 Score = 113 bits (283), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 60/164 (36%), Positives = 92/164 (56%), Gaps = 7/164 (4%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK------PQE 55 V+D++ PG+R++F GINPGL +A FA P NRFW + AG T ++ P + Sbjct: 19 VDDLIEPGVRLLFVGINPGLWTAAANAHFARPGNRFWPALQLAGITRERMDVSGGMAPAD 78 Query: 56 AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 L+ G+T LV T +A+E++ +++ L ++ +P +A+LG AY Q Sbjct: 79 RAELVGAGIGITNLVPFATARADELTGEQIRQAAGTLTATVQRVRPAVVAVLGLTAYRQA 138 Query: 116 FSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 F + A+ G+Q +G Q+WVLPNPSGL + + L AYRE Sbjct: 139 FGRPKARPGRQPEDLGGAQLWVLPNPSGLNAHETAASLGVAYRE 182 >UniRef50_B2GKB6 G/U mismatch-specific DNA glycosylase n=6 Tax=Bacteria RepID=B2GKB6_KOCRD Length = 190 Score = 112 bits (280), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 59/159 (37%), Positives = 91/159 (57%), Gaps = 7/159 (4%) Query: 14 FCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK------PQEAQHLLDYRCGVT 67 F GINPGL +A TG FA P NRF+ +++AG TD ++ P + L G++ Sbjct: 28 FVGINPGLWTAATGAHFARPGNRFYPALHRAGITDTRIDAAAGYDPADLAQLTGRGIGIS 87 Query: 68 KLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQT 127 + R T +A+E++++EL G R+L + ++ P +A+LG AY + F + GA G+Q Sbjct: 88 NVCPRATARADELTREELREGARRLDRLVREHHPAVVAVLGITAYREAFERPGAATGRQD 147 Query: 128 LTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 T ++V PNPSGL + SL+ L AY E+ +A V Sbjct: 148 APWPGTALFVAPNPSGLNAHSSLDDLAHAYAEIARAAGV 186 >UniRef50_Q67QW8 G/U mismatch-specific DNA glycosylase n=1 Tax=Symbiobacterium thermophilum RepID=Q67QW8_SYMTH Length = 177 Score = 111 bits (278), Expect = 8e-24, Method: Compositional matrix adjust. Identities = 66/165 (40%), Positives = 95/165 (57%), Gaps = 2/165 (1%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 D LAPGLRVVF G NPG +SA G +A+P NRF+ +++QAG T+R+ P E LL+ Sbjct: 7 DYLAPGLRVVFVGFNPGETSAREGHYYAYPGNRFYWLLWQAGLTERRYAPHEDARLLEIG 66 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 G+T LV R T + ++S EL +G +L+ K+ Y+P+ GK Y +R + Sbjct: 67 YGLTDLVGRSTRSSGDLSAAELRSGREELLAKLARYRPRVACYNGKGIYAALTGRRRVAY 126 Query: 124 GKQTLT-IGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRG 167 G Q + I +V +PSG SR L + + YREL +AL+ G Sbjct: 127 GLQPESVIPGVLDFVAASPSGRSREPLAEKLRLYREL-RALIAEG 170 >UniRef50_UPI0001926888 PREDICTED: similar to GL18173 n=1 Tax=Hydra magnipapillata RepID=UPI0001926888 Length = 282 Score = 109 bits (273), Expect = 3e-23, Method: Compositional matrix adjust. Identities = 55/142 (38%), Positives = 83/142 (58%), Gaps = 2/142 (1%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 D + GL +VF GINPGL++A +G + P N FWK +Y +G D+ + Q LLDY Sbjct: 51 DYMRNGLDIVFVGINPGLAAAFSGKYYTGPGNHFWKCLYLSGLIDKPFTSNDDQKLLDYN 110 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 G T +V R T +N++ K+EL AG + L EKI+ Y+P + GK +Y+ ++ + Sbjct: 111 IGFTNMVSRTTKGSNDLCKEELIAGSKVLKEKIQLYRPLIVVFNGKISYQVFSGKKDFFF 170 Query: 124 GKQTLTI--GSTQIWVLPNPSG 143 GKQ+ + T IWV+P+ S Sbjct: 171 GKQSELVEGTDTHIWVMPSSSA 192 >UniRef50_C0ZBM4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZBM4_BREBN Length = 192 Score = 106 bits (264), Expect = 3e-22, Method: Compositional matrix adjust. Identities = 58/163 (35%), Positives = 88/163 (53%), Gaps = 9/163 (5%) Query: 10 LRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKL 69 L V+FCGINPG SA TGF +A+PAN FW+ +Y+ G T +L P+E LLDY G+T L Sbjct: 17 LAVLFCGINPGRISATTGFHYANPANLFWRGLYEGGLTPHKLLPEETATLLDYNYGITDL 76 Query: 70 VDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR--GAQWGKQT 127 V RPT A ++ ++ AG ++ I+ Y+P+ + G A+ Q+ G Q Sbjct: 77 VSRPTRSATDLRNEDYAAGTELFVQMIQTYRPRVICFNGITAFRHATGQKKEAISLGLQP 136 Query: 128 LTIGSTQIW------VLPNPSGL-SRVSLEKLVEAYRELDQAL 163 + T+ W V+P+ SG + + E+ + + EL L Sbjct: 137 IRYFGTEEWSGSYVFVVPSTSGANASFTREERIAKFTELSHFL 179 >UniRef50_C8NV44 G/U mismatch-specific DNA glycosylase n=2 Tax=Corynebacterium RepID=C8NV44_9CORY Length = 170 Score = 105 bits (263), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 60/159 (37%), Positives = 89/159 (55%), Gaps = 7/159 (4%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFT------DRQLKPQEAQHLLDYRC 64 R++ GINPG S PFAHP NRFW + +AG T L Q+A+ L + Sbjct: 10 RLLIVGINPGTLSEEVDAPFAHPGNRFWPALDKAGITPYRVYASEGLSEQDAKMLAERGI 69 Query: 65 GVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWG 124 G T LV R T +A++++ +EL GG +L ++ ++P A+ G AY F +R AQ G Sbjct: 70 GFTNLVARMTPKASDLATEELVEGGERLEAVVDKHRPGAVMFAGIGAYRGAFGRRKAQRG 129 Query: 125 KQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 Q+ I +WV+ NPSGL + ++E L +YRE+ +A Sbjct: 130 LQSGMIAGVPVWVVGNPSGLNAHETVESLAASYREVWEA 168 >UniRef50_UPI000178957E Uracil-DNA glycosylase superfamily n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178957E Length = 171 Score = 105 bits (263), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 54/159 (33%), Positives = 89/159 (55%), Gaps = 1/159 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D L GL +VF G NP L S G +A+P N FW++++++G T R + E LL Sbjct: 4 VSDHLDYGLTLVFIGFNPSLRSGEVGHHYANPRNNFWRILHKSGLTPRLYEAAEDGELLK 63 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V RPT A +++++E G L +K+E+Y+P+ +GK Y + + A Sbjct: 64 LGYGFTNIVARPTRGAEDITREEYRLGRETLRKKLEEYRPEVACFVGKGVYTEYSGKAKA 123 Query: 122 QWGKQ-TLTIGSTQIWVLPNPSGLSRVSLEKLVEAYREL 159 WG Q + + +V P+ SGL R+ +++++ YR+L Sbjct: 124 DWGIQPDSVVDGVREFVAPSSSGLVRMPMDEIIGIYRQL 162 >UniRef50_Q1J3D2 Uracil-DNA glycosylase superfamily n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J3D2_DEIGD Length = 177 Score = 105 bits (261), Expect = 7e-22, Method: Compositional matrix adjust. Identities = 60/162 (37%), Positives = 91/162 (56%), Gaps = 1/162 (0%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 +++D++A G++V+F G NP S G +A N+FW+++ +G T R L+P+E L Sbjct: 10 VLQDLVAEGVQVLFVGFNPSPRSVQQGHHYAGRGNQFWRLLAASGLTPRLLRPEEDHLLP 69 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + G T LV RPTV A E+S+ EL AG L + YQ +A GK Y ++ Sbjct: 70 RWGLGSTNLVARPTVSAAELSRSELRAGVPHLRRLVARYQVPLVAYTGKGVYLAATGKQQ 129 Query: 121 AQWGKQTLT-IGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQ 161 A WG Q + IG +VLP+PSGL+R+ + + YR L + Sbjct: 130 AHWGAQEDSLIGGVTDFVLPSPSGLARLPFAEKLGYYRALAE 171 >UniRef50_B2W4Q2 G/U mismatch-specific DNA glycosylase n=2 Tax=Pleosporineae RepID=B2W4Q2_PYRTR Length = 328 Score = 103 bits (258), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 50/111 (45%), Positives = 72/111 (64%), Gaps = 1/111 (0%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 DIL P L VF G NPG+ +A G +AHP+N FWK+++ +G T+R+LKP+E + L Sbjct: 113 DILEPNLICVFVGTNPGVQTAIAGHAYAHPSNHFWKLLHSSGLTNRRLKPEEDRSLPAQF 172 Query: 64 C-GVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYE 113 C G T +VDRP+ A E+SK+E AG L K ++P+A+ I+GK +E Sbjct: 173 CMGNTNIVDRPSKDAAELSKEETAAGTASLEAKFRKFKPEAVCIVGKGIWE 223 >UniRef50_A6SED0 Putative uncharacterized protein n=2 Tax=Sclerotiniaceae RepID=A6SED0_BOTFB Length = 425 Score = 100 bits (248), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 48/111 (43%), Positives = 73/111 (65%), Gaps = 1/111 (0%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-Y 62 D+++ L +F G+NPGL +A G +AHP+NRFWK++Y +G T R LK ++ L + Y Sbjct: 224 DVISSDLLCLFVGLNPGLLTAQVGHAYAHPSNRFWKILYSSGCTTRLLKAEDDVKLPELY 283 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYE 113 G T +V+RPT E+SK+E+ G L EKI Y+P+A+A++GK +E Sbjct: 284 SLGNTNIVERPTRNGGELSKKEMDEGVSSLEEKIRIYRPEAVALVGKGIWE 334 >UniRef50_C9SR06 G/U mismatch-specific uracil DNA glycosylase n=4 Tax=Sordariomycetes RepID=C9SR06_VERA1 Length = 320 Score = 99.4 bits (246), Expect = 4e-20, Method: Compositional matrix adjust. Identities = 49/117 (41%), Positives = 70/117 (59%), Gaps = 1/117 (0%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-Y 62 D +AP L ++F G+NPG+ +A TG +AHP+N FWK++Y +G T R L E + L + Y Sbjct: 110 DAIAPNLLILFIGLNPGIQTASTGHAYAHPSNLFWKLLYHSGITSRLLPASEDRTLPERY 169 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 CG T +V RP+ E+ EL AG L KI ++P+A I+GK +E F R Sbjct: 170 ACGFTNIVSRPSRNGAELRNAELDAGVAVLEAKIAKWRPEAACIVGKGIWEAVFRVR 226 >UniRef50_B2A9Z4 Predicted CDS Pa_1_2290 n=2 Tax=Sordariales RepID=B2A9Z4_PODAN Length = 327 Score = 97.8 bits (242), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 46/111 (41%), Positives = 71/111 (63%), Gaps = 1/111 (0%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-Y 62 D +AP L ++F G+NPGL ++ TG +AHP N FW++++ +G T R PQE + L + Sbjct: 108 DAIAPNLILLFVGLNPGLLTSSTGHAYAHPTNLFWRLLHSSGITPRLCAPQEDRLLPSLF 167 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYE 113 G+T +V RPT E+SK+E+ AG L EKI ++P+ + I+GK +E Sbjct: 168 SLGLTNIVPRPTRNGAELSKKEMDAGVAVLEEKIRKFKPEVVCIVGKSIWE 218 >UniRef50_Q4T9X5 Chromosome undetermined SCAF7485, whole genome shotgun sequence. (Fragment) n=3 Tax=Metazoa RepID=Q4T9X5_TETNG Length = 333 Score = 96.3 bits (238), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 53/152 (34%), Positives = 82/152 (53%), Gaps = 7/152 (4%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-Y 62 D++AP L ++ GINPGL SA G + +P N FWK ++ +GFT++QL Q L + Y Sbjct: 45 DVIAPNLDILIIGINPGLLSAYKGHHYPNPGNHFWKCLFLSGFTEKQLNYSHDQSLPEIY 104 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQ------GF 116 G T +V+R T + ++S +E+ GGR+L+ K++ Y+P A GK YE G Sbjct: 105 GIGFTNMVERTTPGSKDLSNKEIREGGRQLLAKLQKYRPLIAAFNGKGIYEIFCRETFGV 164 Query: 117 SQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVS 148 + ++G Q + TQ PS R + Sbjct: 165 KAKNLEFGLQPYRVPETQTVCYLMPSSSPRCA 196 >UniRef50_B3MY42 GF22816 n=1 Tax=Drosophila ananassae RepID=B3MY42_DROAN Length = 1192 Score = 94.0 bits (232), Expect = 2e-18, Method: Composition-based stats. Identities = 49/144 (34%), Positives = 77/144 (53%), Gaps = 2/144 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y AG T Q+ E LL Sbjct: 657 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLAGLTQEQMNADEDHKLLR 716 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 717 QGIGFTNMVSRATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 776 Query: 122 QWGKQTLTIGSTQ--IWVLPNPSG 143 +G+Q + T IWV+P+ S Sbjct: 777 HFGRQPGRVDGTDTYIWVMPSSSA 800 >UniRef50_B8I864 Uracil-DNA glycosylase superfamily n=1 Tax=Clostridium cellulolyticum H10 RepID=B8I864_CLOCE Length = 183 Score = 93.6 bits (231), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 53/169 (31%), Positives = 92/169 (54%), Gaps = 9/169 (5%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 DILA GL+VVF G NPGL SA ++H +NRFW+ +Y++G T + +P++ + +L++ Sbjct: 12 DILATGLKVVFIGYNPGLLSARNRHHYSHKSNRFWRFLYESGLTPSRFEPEDDRKILEFG 71 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILG---KQAYEQGF---- 116 G T +VDRP+ ANE+ E+ G + +++++P+ +G +AY Sbjct: 72 YGSTNIVDRPSKAANEIGTDEVVEGSANIYRLLDNFKPRIACYVGIGVYRAYASSILRVP 131 Query: 117 -SQRGAQWGKQTLTI-GSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 S + G Q+ +I S +V + SGL+ V + + + L + L Sbjct: 132 ASGINVKPGLQSASILESISDFVCYSTSGLNTVPFAEQRKCFENLKKLL 180 >UniRef50_C4QAV8 G/t mismatch-specific thymine DNA glycosylase, putative n=1 Tax=Schistosoma mansoni RepID=C4QAV8_SCHMA Length = 869 Score = 92.4 bits (228), Expect = 5e-18, Method: Composition-based stats. Identities = 50/145 (34%), Positives = 74/145 (51%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 D L GL +V GINP L+SA G +A P N FW + QAG + + +LDY Sbjct: 98 DHLKEGLDIVIVGINPSLASAHVGHHYAGPGNHFWTCLSQAGLVPMAVSCYDDSKMLDYG 157 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 G T + RPT A E++++E+ AG ++EK+ Y+P+ GK YE + Sbjct: 158 IGFTNVCTRPTKGAAELTRKEMKAGAAIMLEKMRKYKPKIAVFNGKGIYEAYVGHKNFCM 217 Query: 124 GKQTLTIGSTQIWVLPNPSGLSRVS 148 G+Q T+ T I + PS +R + Sbjct: 218 GRQPTTLDGTDIVIFVMPSSSARCA 242 >UniRef50_A8P541 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8P541_COPC7 Length = 365 Score = 92.4 bits (228), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 46/108 (42%), Positives = 64/108 (59%), Gaps = 1/108 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D L P L ++FCGINPG SA G FA P N FW ++++G T ++KPQE L D Sbjct: 151 VNDNLKPDLDIIFCGINPGQVSATLGHHFAGPTNHFWSCLHESGLTSTRMKPQEDFTLPD 210 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILG 108 + G+T LV RPT +A+E+SK E A K+ Y+P+ + +G Sbjct: 211 RHSIGLTNLVGRPTAEASELSKAESLACVAPFFVKMHKYRPKIVCFVG 258 >UniRef50_A7HGS2 Uracil-DNA glycosylase superfamily n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7HGS2_ANADF Length = 181 Score = 92.0 bits (227), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 49/143 (34%), Positives = 77/143 (53%), Gaps = 1/143 (0%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 D +AP L ++F GINP L SA G FA N FW++++ +G T P++ L Sbjct: 13 DRVAPDLDILFVGINPSLRSAEVGHHFASKGNPFWRLLHASGLTAELHAPEDDVALPARG 72 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF-SQRGAQ 122 G+T + R T A E+ + EL GGR L +++ +P+ +A++G Y F R Sbjct: 73 LGITNVCSRATRSAAELGRDELREGGRALPDRVAALRPRIVALVGVSLYPHVFPGGRETG 132 Query: 123 WGKQTLTIGSTQIWVLPNPSGLS 145 G + + +++VLPNPSGL+ Sbjct: 133 PGAKRARLAGARVFVLPNPSGLN 155 >UniRef50_B8N438 Mismatch-specific thymine-DNA glycosylase, putative n=6 Tax=Eurotiomycetidae RepID=B8N438_ASPFN Length = 373 Score = 92.0 bits (227), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 45/147 (30%), Positives = 82/147 (55%), Gaps = 11/147 (7%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D + P L ++ G+NPG+ + TG+ +AHP+N +WK+++ +G T + P + L Sbjct: 180 LLRDTIPPNLTLLLVGVNPGIMTGATGYVYAHPSNLYWKLLHWSGITAIRHPPSDTYRLP 239 Query: 61 D-YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 + Y G T +V+RPT A+ +SK E+ AG L EK+ QP+A+ ++GK +E + R Sbjct: 240 ELYNIGNTNIVERPTRDASMLSKAEMDAGVPVLEEKVAKQQPEAVCLVGKSIWEAVWRVR 299 Query: 120 GAQ----------WGKQTLTIGSTQIW 136 + W ++ +G ++ W Sbjct: 300 KGRAIRKEEFRYGWQDESENMGRSERW 326 >UniRef50_A6GLD3 G/U mismatch-specific DNA glycosylase n=1 Tax=Limnobacter sp. MED105 RepID=A6GLD3_9BURK Length = 161 Score = 91.7 bits (226), Expect = 7e-18, Method: Compositional matrix adjust. Identities = 56/159 (35%), Positives = 88/159 (55%), Gaps = 5/159 (3%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D+L L +VFCG PG SA G +A+ N FW+ +++ G T Q+ P++ Q + Sbjct: 3 ILPDLLQANLNIVFCGTAPGTVSAQRGAYYANAGNAFWRTLFEVGLTPHQVSPKDFQGIT 62 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Y+ G+T L + V + S ++ G +L EKI + P+ LA K+A Q F Sbjct: 63 QYKLGLTDLAKQ--VYGADSSLKKSDFGPSQLREKILHFNPKILAFTSKRA-GQEFLGCK 119 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYREL 159 Q G Q IG+T ++VLP+PSGL+R +E ++EL Sbjct: 120 VQCGLQHEKIGNTVLFVLPSPSGLARSHWR--IEPWQEL 156 >UniRef50_UPI0001926FCB PREDICTED: similar to predicted protein n=2 Tax=Hydra magnipapillata RepID=UPI0001926FCB Length = 226 Score = 91.7 bits (226), Expect = 8e-18, Method: Compositional matrix adjust. Identities = 47/144 (32%), Positives = 78/144 (54%), Gaps = 1/144 (0%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 MV D+L L ++F GINPGL SA G ++ N FW +++++G L Q L Sbjct: 53 MVPDLLEEHLDIIFVGINPGLVSAYRGHHYSGNNNHFWPLLFESGLISEPLTWQNDYKCL 112 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Y+ G+T +V R T A E++ E+ +G ++E I+ P + GK YE ++ Sbjct: 113 QYKIGMTNIVSRTTRSAAELTNSEIRSGKSSIVEMIKKLNPLVVCFNGKGIYEVFSGKKK 172 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGL 144 + G+Q I ++ ++V+P+ SGL Sbjct: 173 CEVGQQE-NIFNSAVYVMPSTSGL 195 >UniRef50_Q9V4D8 Thd1 n=1 Tax=Drosophila melanogaster RepID=Q9V4D8_DROME Length = 1738 Score = 91.7 bits (226), Expect = 9e-18, Method: Composition-based stats. Identities = 48/144 (33%), Positives = 77/144 (53%), Gaps = 2/144 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y AG T Q+ E L+ Sbjct: 779 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLAGLTQEQMSADEDHKLIK 838 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 839 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 898 Query: 122 QWGKQTLTIGSTQ--IWVLPNPSG 143 +G+Q + T IWV+P+ S Sbjct: 899 HFGRQPDRVDGTDTFIWVMPSSSA 922 >UniRef50_Q9XYZ3 G/T mismatch-specific thymine DNA glycosylase n=7 Tax=Sophophora RepID=Q9XYZ3_DROME Length = 1095 Score = 91.3 bits (225), Expect = 1e-17, Method: Composition-based stats. Identities = 48/144 (33%), Positives = 77/144 (53%), Gaps = 2/144 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y AG T Q+ E L+ Sbjct: 136 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLAGLTQEQMSADEDHKLIK 195 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 196 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 255 Query: 122 QWGKQTLTIGSTQ--IWVLPNPSG 143 +G+Q + T IWV+P+ S Sbjct: 256 HFGRQPDRVDGTDTFIWVMPSSSA 279 >UniRef50_B0E0R1 Predicted protein n=7 Tax=Laccaria bicolor S238N-H82 RepID=B0E0R1_LACBS Length = 436 Score = 90.9 bits (224), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 45/97 (46%), Positives = 62/97 (63%), Gaps = 1/97 (1%) Query: 13 VFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-YRCGVTKLVD 71 VFCGINPG +SA G F P N FW +Y++GFT R L+PQE L + + G+T LVD Sbjct: 224 VFCGINPGQTSAEIGHHFGGPTNHFWSCLYESGFTTRLLRPQEDFILPEQFSIGMTNLVD 283 Query: 72 RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILG 108 RPT + +E+SK E AG L+ KI ++P+ + +G Sbjct: 284 RPTAEQSELSKAERLAGVPALLAKIATHRPKIVCFVG 320 >UniRef50_B4NHL6 GK13687 n=2 Tax=Coelomata RepID=B4NHL6_DROWI Length = 1761 Score = 90.5 bits (223), Expect = 2e-17, Method: Composition-based stats. Identities = 48/144 (33%), Positives = 77/144 (53%), Gaps = 2/144 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y AG T Q+ E L+ Sbjct: 765 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLAGLTQEQMSADEDHKLIK 824 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 825 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 884 Query: 122 QWGKQTLTIGSTQ--IWVLPNPSG 143 +G+Q + T IWV+P+ S Sbjct: 885 HFGRQPDRVEGTDTYIWVMPSSSA 908 >UniRef50_D1CA47 Uracil-DNA glycosylase superfamily n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CA47_SPHTD Length = 194 Score = 89.7 bits (221), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 51/156 (32%), Positives = 80/156 (51%), Gaps = 14/156 (8%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTD--------RQLK 52 ++ D LAP + +VF GINP SA G FA NRFW + Q+ ++ +L+ Sbjct: 9 VLPDYLAPKMPLVFIGINPSTYSAAQGHYFARRTNRFWPALSQSRLSEPIRAALGRDRLE 68 Query: 53 PQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAY 112 P + LL + G T LV RP+ +A E+S+ + KL +++E YQP+ G Y Sbjct: 69 PTDDARLLAFGIGFTDLVKRPSARAAELSRADYRIWAPKLRQRLEHYQPRVACFHGFTGY 128 Query: 113 EQ------GFSQRGAQWGKQTLTIGSTQIWVLPNPS 142 G ++ G Q ++G T+++V+PNPS Sbjct: 129 RAFVRDALGLTEVPRNLGPQPHSLGPTRLYVVPNPS 164 >UniRef50_Q5KFR6 Expressed protein n=2 Tax=Filobasidiella neoformans RepID=Q5KFR6_CRYNE Length = 469 Score = 89.7 bits (221), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 45/114 (39%), Positives = 67/114 (58%), Gaps = 2/114 (1%) Query: 2 VEDILAPGLRVVFCGINP-GLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 + D+L G +VFCGI G S+ G FAHP N+FWK +YQ+G T R + P E ++ Sbjct: 158 LNDLLVKGQDLVFCGIKRIGKMSSTLGHHFAHPTNKFWKALYQSGLTSRLMSPTEDYKVV 217 Query: 61 D-YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYE 113 D Y G+T LV RPT + +E+S E+ L++K YQP + +GK+ ++ Sbjct: 218 DEYNYGLTNLVGRPTSEQSELSTLEMKLNTINLLQKFIKYQPSVVCFVGKKIWD 271 >UniRef50_B4L7B8 GI14092 n=1 Tax=Drosophila mojavensis RepID=B4L7B8_DROMO Length = 1737 Score = 89.7 bits (221), Expect = 4e-17, Method: Composition-based stats. Identities = 46/144 (31%), Positives = 80/144 (55%), Gaps = 2/144 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y +G T Q+ ++ L+ Sbjct: 796 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTQEQMSAEDDYTLVK 855 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 856 HGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 915 Query: 122 QWGKQTLTI--GSTQIWVLPNPSG 143 +G+Q + +T IWV+P+ S Sbjct: 916 HFGRQPDRVEGTNTYIWVMPSSSA 939 >UniRef50_UPI0000DB71A4 PREDICTED: similar to Thd1 CG1981-PA, partial n=2 Tax=Apocrita RepID=UPI0000DB71A4 Length = 263 Score = 89.4 bits (220), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 50/141 (35%), Positives = 77/141 (54%), Gaps = 1/141 (0%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 D L L +VF GINP L +A G +A P N F+K+++++G T R L +E LL Y Sbjct: 27 DYLKMNLDLVFVGINPSLMAAHRGRYYAGPGNHFYKLLHESGLTPRCLIYEEDYKLLQYG 86 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA-Q 122 G+T +V R T + ++ + E+ G + + EK++ Y+P+ GK YE ++ G Sbjct: 87 IGLTNIVTRATKSSADLKRTEIKEGSKIIEEKLKLYKPKIAVFNGKCIYEVFANKMGNFN 146 Query: 123 WGKQTLTIGSTQIWVLPNPSG 143 +G Q I T IWV P+ S Sbjct: 147 FGLQLERINETAIWVTPSSSA 167 >UniRef50_A7S194 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7S194_NEMVE Length = 250 Score = 89.4 bits (220), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 49/145 (33%), Positives = 82/145 (56%), Gaps = 3/145 (2%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D +A L ++F GINPGL+SA G +A P N FW +Y +G +L ++ + Sbjct: 55 LLPDRVAENLDILFIGINPGLTSAYKGHHYAGPNNHFWPCLYDSGLVPEKLTFRDDEKCP 114 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Y G+T +V+R T ++++S++E+ G LI K++ +P GK YE FS+ Sbjct: 115 AYGIGLTNIVERTTRGSSDLSRKEIKDGVDALIVKVKRLKPLVACFNGKGIYEI-FSKSK 173 Query: 121 AQWGKQTLTIGSTQ--IWVLPNPSG 143 + G+QT I T ++V+P+ SG Sbjct: 174 CEIGRQTKCIPGTNVVVYVMPSSSG 198 >UniRef50_B4JZW0 GH23925 n=1 Tax=Drosophila grimshawi RepID=B4JZW0_DROGR Length = 1557 Score = 89.4 bits (220), Expect = 5e-17, Method: Composition-based stats. Identities = 48/144 (33%), Positives = 77/144 (53%), Gaps = 2/144 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y +G T Q+ + LL Sbjct: 611 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTQEQMSADDDYTLLK 670 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ +QP+ GK +E ++ Sbjct: 671 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFQPKIAVFNGKLIFEVFSGKKEF 730 Query: 122 QWGKQTLTIGSTQ--IWVLPNPSG 143 +G+Q + T IWV+P+ S Sbjct: 731 HFGRQPDRVDGTNTYIWVMPSSSA 754 >UniRef50_C7IMH6 Uracil-DNA glycosylase superfamily n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IMH6_9CLOT Length = 184 Score = 89.0 bits (219), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 50/170 (29%), Positives = 88/170 (51%), Gaps = 11/170 (6%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 DILAPGL+VVF G NPGL SA ++H +NRFW+ +++AG T + +P++ + +L+ Sbjct: 12 DILAPGLKVVFIGYNPGLLSAKNRHHYSHKSNRFWRFLFEAGLTPVRFQPEDDRKILELG 71 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 G T +VDR T AN++ E+ G L ++ +P+ +G Y + F+ + Sbjct: 72 YGSTNIVDRQTKAANDIGADEVIEGSANLYRLLDTLKPRVACYVGIGVY-RAFASCILKM 130 Query: 124 GKQTLTIG----------STQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 + + +T +V + SGL+ V + + + +L + L Sbjct: 131 SASRINVTPGLQLDSLQEATLDFVCYSTSGLNTVPFAEQCKCFADLKKLL 180 >UniRef50_B2IIR8 Uracil-DNA glycosylase superfamily n=2 Tax=Bacteria RepID=B2IIR8_BEII9 Length = 191 Score = 89.0 bits (219), Expect = 6e-17, Method: Compositional matrix adjust. Identities = 56/154 (36%), Positives = 75/154 (48%), Gaps = 13/154 (8%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA--GFTDRQ------LKP 53 + D+L GL V+F GINP L S G FA NRFW ++ R+ L+P Sbjct: 9 LPDLLDKGLSVLFVGINPSLYSVAQGHYFARKTNRFWPAFSRSVLSLAARENLGLEALEP 68 Query: 54 QEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYE 113 L G T V RPT +A++VS E A L+EKIE Y+P+ G AY Sbjct: 69 MHDHLLPAQGFGFTDTVKRPTARASDVSPAEFAAAVTALVEKIEYYRPRIACFHGTMAYR 128 Query: 114 QGF-----SQRGAQWGKQTLTIGSTQIWVLPNPS 142 ++ G QTL+IG T ++V+PNPS Sbjct: 129 PVHRALTETKTDPALGMQTLSIGPTHLYVVPNPS 162 >UniRef50_B4MF03 GJ13949 n=2 Tax=Drosophila virilis RepID=B4MF03_DROVI Length = 1038 Score = 88.6 bits (218), Expect = 7e-17, Method: Composition-based stats. Identities = 47/144 (32%), Positives = 77/144 (53%), Gaps = 2/144 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y +G T QL + L+ Sbjct: 815 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTQEQLSADDDYTLVK 874 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 875 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 934 Query: 122 QWGKQTLTIGSTQ--IWVLPNPSG 143 +G+Q + T IWV+P+ S Sbjct: 935 HFGRQPDRVDGTNTYIWVMPSSSA 958 >UniRef50_A2QSA7 Function: In the DNA of higher eukaryotes n=48 Tax=Leotiomyceta RepID=A2QSA7_ASPNC Length = 345 Score = 88.2 bits (217), Expect = 9e-17, Method: Compositional matrix adjust. Identities = 40/114 (35%), Positives = 72/114 (63%), Gaps = 1/114 (0%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D + P L ++ G+NPG+ + TG+ +AHP+N FWK+++ +G T + P + L Sbjct: 138 LLVDTIPPNLTLLLVGVNPGVLTGVTGYAYAHPSNLFWKLLHWSGITPIRHPPSDTYKLP 197 Query: 61 D-YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYE 113 + Y G T +V+RPT A+ +SK+E+ AG L EK+ + +P+ + ++GK +E Sbjct: 198 ELYNIGNTNIVERPTRDASMLSKKEMDAGVPVLEEKVREKRPEVVCLVGKSIWE 251 >UniRef50_UPI0000D578C1 PREDICTED: similar to Thd1 n=1 Tax=Tribolium castaneum RepID=UPI0000D578C1 Length = 702 Score = 87.0 bits (214), Expect = 2e-16, Method: Composition-based stats. Identities = 46/142 (32%), Positives = 77/142 (54%), Gaps = 2/142 (1%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 D L L ++ GINPGL +A G +A P N FWK +Y +G T +Q+ E LL Sbjct: 172 DHLTENLDIIIIGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTQQQVSADEDYKLLQVG 231 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 G T +V R T + +++++E+ G + L+EK+ ++P+ GK YE ++ ++ Sbjct: 232 IGFTNMVQRATKGSADLTRKEIKEGSQILLEKLRKFRPKIAVFNGKLIYEVFSGKKDFKF 291 Query: 124 GKQTLTIGSTQ--IWVLPNPSG 143 G+Q + T +WV+P+ S Sbjct: 292 GRQPDLVDGTNTYMWVMPSSSA 313 >UniRef50_A8Q5A6 Putative uncharacterized protein n=1 Tax=Malassezia globosa CBS 7966 RepID=A8Q5A6_MALGO Length = 272 Score = 86.7 bits (213), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 43/112 (38%), Positives = 64/112 (57%), Gaps = 3/112 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ---H 58 + D + P L VVFCGINPG S +AH +N F+ I+ AG T +L P + Sbjct: 28 LPDYIKPNLDVVFCGINPGTESGAQRRHYAHRSNHFYACIHMAGLTSERLTPSQDMTFPT 87 Query: 59 LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQ 110 L YR G+T L RPT ++ ++++ EL G +L+ KI Y+P+ + +GKQ Sbjct: 88 LQPYRLGLTNLAHRPTARSEQLTRIELERGVPELLMKIARYRPRVICFVGKQ 139 >UniRef50_C8RZ13 Uracil-DNA glycosylase superfamily n=1 Tax=Rhodobacter sp. SW2 RepID=C8RZ13_9RHOB Length = 165 Score = 86.7 bits (213), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 52/148 (35%), Positives = 74/148 (50%), Gaps = 4/148 (2%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 M+ DI+APGL V+FCG PG SA G +A P NRFW ++ + G T R+L P E L Sbjct: 1 MLPDIVAPGLDVLFCGTAPGAMSAARGHYYARPGNRFWPLLAEIGLTPRRLHPDEDHLLP 60 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQA--YEQGFSQ 118 + G+T L + Q E+ + +L + +P+ALA A G + Sbjct: 61 GFGLGLTDLAQGVSGQDAEIPRHAY--APDRLQALVAAIRPRALAFTSLTAARLALGVPR 118 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGLSR 146 A G T++W LP+PSGL+R Sbjct: 119 LAAGALPADPRYGHTRLWALPSPSGLAR 146 >UniRef50_C5SKW3 Uracil-DNA glycosylase superfamily n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SKW3_9CAUL Length = 203 Score = 86.3 bits (212), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 61/166 (36%), Positives = 80/166 (48%), Gaps = 8/166 (4%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D+L PGL++VFCG G SA +AHP N FW+ +Y G T +L P + LLD Sbjct: 35 VPDLLRPGLKLVFCGTALGRVSAQKRAYYAHPGNFFWRTLYGVGLTPERLAPTDYPRLLD 94 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y G+T L N+V L KI Y PQ LA K A F +R Sbjct: 95 YGIGLTDLCK--AHYGNDVDLPAEAWDAEALTAKIAAYAPQHLAFTSKTAASV-FLERPT 151 Query: 122 ---QWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALV 164 G Q G T + VLP+PSG +R ++ A+R+L QA+ Sbjct: 152 GKIALGLQAGKAGETSLRVLPSPSGQARRFWDE--AAWRDLAQAVT 195 >UniRef50_Q7SDB2 Predicted protein n=2 Tax=Sordariaceae RepID=Q7SDB2_NEUCR Length = 396 Score = 85.5 bits (210), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 41/112 (36%), Positives = 68/112 (60%), Gaps = 1/112 (0%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-Y 62 D L+P L V+F G+NPG+S+A +AHP N FW++++ +G T P E + L + + Sbjct: 145 DALSPNLLVLFIGLNPGISTAIQQHAYAHPTNLFWRLLFSSGITPVLCAPSEDRSLPERF 204 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQ 114 G+T +V RPT E++K E+ G L EK+ ++P+A+ ++GK +E Sbjct: 205 SLGLTNIVGRPTRNGGELTKGEMDEGVAVLEEKVRVWKPEAVCVVGKGIWES 256 >UniRef50_Q13569 G/T mismatch-specific thymine DNA glycosylase n=47 Tax=Euteleostomi RepID=TDG_HUMAN Length = 410 Score = 84.3 bits (207), Expect = 1e-15, Method: Compositional matrix adjust. Identities = 50/152 (32%), Positives = 76/152 (50%), Gaps = 7/152 (4%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL-DY 62 DIL L +V GINPGL +A G + P N FWK ++ +G ++ QL + L Y Sbjct: 126 DILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFWKCLFMSGLSEVQLNHMDDHTLPGKY 185 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQ------GF 116 G T +V+R T + ++S +E GGR L++K++ YQP+ GK YE G Sbjct: 186 GIGFTNMVERTTPGSKDLSSKEFREGGRILVQKLQKYQPRIAVFNGKCIYEIFSKEVFGV 245 Query: 117 SQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVS 148 + ++G Q I T+ PS +R + Sbjct: 246 KVKNLEFGLQPHKIPDTETLCYVMPSSSARCA 277 >UniRef50_UPI000186DD28 G/t mismatch-specific thymine DNA glycosylase, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186DD28 Length = 1239 Score = 83.2 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 45/142 (31%), Positives = 74/142 (52%), Gaps = 2/142 (1%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 D L L +V GINPGL +A G +A P N FWK +Y +G T + + LL Sbjct: 533 DHLTTNLDIVIIGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTPEPMTADDDYKLLSVG 592 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 G T +V R T + +++++E+ G + L+EK++ ++P+ GK YE + + Sbjct: 593 IGFTNMVARATKGSADLTRKEIKEGSQILLEKLQKFKPKIAVFNGKLIYEVFSGNKDFNF 652 Query: 124 GKQTLTIGSTQ--IWVLPNPSG 143 G+Q + T +WV+P+ S Sbjct: 653 GRQPDFVDGTNTYMWVMPSSSA 674 >UniRef50_UPI0001792C18 PREDICTED: similar to GA15170-PA n=3 Tax=Coelomata RepID=UPI0001792C18 Length = 1013 Score = 83.2 bits (204), Expect = 3e-15, Method: Composition-based stats. Identities = 44/142 (30%), Positives = 76/142 (53%), Gaps = 2/142 (1%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 D L L +V GINPGL +A G +A P N FWK +Y +G T + + LL Sbjct: 263 DHLTNNLDIVIIGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTPEPMTADDDYKLLQNG 322 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 G T +V+R T + +++++E+ G + L+EK++ ++P+ GK +E ++ + Sbjct: 323 IGFTNMVERATKGSADLTRKEIKEGSQILLEKLQRFKPKIAVFNGKLIFEVFSGKKDFTF 382 Query: 124 GKQTLTIGSTQ--IWVLPNPSG 143 G+Q + T +WV+P+ S Sbjct: 383 GRQPELVDGTNTYMWVMPSSSA 404 >UniRef50_UPI0000DB6DF5 PREDICTED: similar to Thd1 CG1981-PA n=2 Tax=Apocrita RepID=UPI0000DB6DF5 Length = 973 Score = 82.4 bits (202), Expect = 5e-15, Method: Composition-based stats. Identities = 47/145 (32%), Positives = 78/145 (53%), Gaps = 2/145 (1%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D L L ++ GINPGL +A G +A P N FWK ++ +G T L + LL Sbjct: 206 VLPDHLTNNLDIIIIGINPGLFAAYKGHHYAGPGNHFWKCLHLSGLTPEPLTADDDYKLL 265 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + G T +V R T + +++++E+ G L+EK+ Y+P+ GK YE ++ Sbjct: 266 RFGIGFTNMVARATKGSADLTRKEIKEGSHILLEKLRKYKPKIAVFNGKLIYEVFSGKKD 325 Query: 121 AQWGKQ-TLTIGS-TQIWVLPNPSG 143 +G+Q L G+ T +WV+P+ S Sbjct: 326 FGFGRQPNLVEGTNTYMWVMPSSSA 350 >UniRef50_Q6C2V5 YALI0F04796p n=1 Tax=Yarrowia lipolytica RepID=Q6C2V5_YARLI Length = 302 Score = 81.3 bits (199), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 50/162 (30%), Positives = 89/162 (54%), Gaps = 14/162 (8%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-YRCGVTKL 69 R+VF G NPG+ S+ G +AHP+N FWK + + L + L+D + G T L Sbjct: 97 RLVFVGFNPGIMSSEKGHYYAHPSNSFWKNLNKYVVIGGGLTCYDDSSLMDSHGIGFTDL 156 Query: 70 VDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGK----QAYEQGFSQRGAQ--- 122 V RPT ++S++E+ G + L+++I +P + I+G+ + F +G + Sbjct: 157 VARPTRGCEDLSREEMALGAQILVDQISANKPALVCIVGRGIWATVFRAVFKHQGIKKSE 216 Query: 123 ---WGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQ 161 +G + +G+++++V+P+ SGL R +K +E +REL Q Sbjct: 217 MPHFGLMPVHMGTSRVFVVPSTSGLVR---DKRLELWRELGQ 255 >UniRef50_UPI0000E46F0A PREDICTED: similar to G/T mismatch-specific thymine DNA glycosylase n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E46F0A Length = 588 Score = 81.3 bits (199), Expect = 1e-14, Method: Composition-based stats. Identities = 48/145 (33%), Positives = 73/145 (50%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 DIL L +V GINPGL +A G +A P N FWK +Y +G + + L D+ Sbjct: 70 DILVHNLDIVIIGINPGLMAAYKGHHYAGPGNHFWKCLYLSGLVPEPMTCMDDVKLPDFG 129 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 G T +V R T + ++ ++E+ G + ++EKI+ Y P GK YE ++ + Sbjct: 130 VGFTNIVGRTTRGSADLKRKEIKEGAKVVVEKIQKYLPLIACFNGKGIYEIYSGKKDFEV 189 Query: 124 GKQTLTIGSTQIWVLPNPSGLSRVS 148 G+Q I T+ V PS +R S Sbjct: 190 GRQPENIPGTETVVYVMPSSSARCS 214 >UniRef50_C6H6H6 Uracil DNA glycosylase superfamily protein n=1 Tax=Ajellomyces capsulatus H143 RepID=C6H6H6_AJECH Length = 301 Score = 80.5 bits (197), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 36/99 (36%), Positives = 64/99 (64%), Gaps = 1/99 (1%) Query: 16 GINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-YRCGVTKLVDRPT 74 G+NPG+ + TG +AHP+N +WK+++ +G TD + P + L + Y G T +V+RPT Sbjct: 115 GVNPGIMTGQTGHVYAHPSNLYWKLLHSSGITDYRHPPSDTYQLPELYSVGNTNIVERPT 174 Query: 75 VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYE 113 A+++S++E+ G L KI + +P+A+ ++GK +E Sbjct: 175 RDASQLSRKEMDDGVPVLERKIREKRPEAVCLVGKSIWE 213 >UniRef50_B3QI54 Uracil-DNA glycosylase superfamily n=3 Tax=Rhodopseudomonas palustris RepID=B3QI54_RHOPT Length = 177 Score = 78.6 bits (192), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 47/145 (32%), Positives = 78/145 (53%), Gaps = 3/145 (2%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D+L GL+VVFCG G +S +AH N+FW V+ + G T R+L P E + + Sbjct: 15 ILPDVLQRGLKVVFCGTAAGRASEKAQSYYAHGRNKFWSVLSETGLTTRKLAPHEFREVT 74 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 ++ G+T L ++V + H L KIE+++P LA +A ++ F R Sbjct: 75 NFGIGLTDLCKDVAGSDHDVRPRPEHR--VVLRRKIEEHRPDFLAFTSLEAAKR-FVGRR 131 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLS 145 G+ +IG+T I+VLP+ S ++ Sbjct: 132 VGLGRYEESIGTTSIYVLPSTSPMA 156 >UniRef50_A7HV32 Uracil-DNA glycosylase superfamily n=3 Tax=Rhizobiales RepID=A7HV32_PARL1 Length = 176 Score = 76.6 bits (187), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 47/149 (31%), Positives = 71/149 (47%), Gaps = 4/149 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D+L PGL VVF G P SA G +AHP NRFW +++ G T R +P E L+ Sbjct: 12 LPDVLVPGLDVVFVGTAPSRRSAALGAYYAHPGNRFWPALFETGITPRLYQPHEYAELIH 71 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYE--QGFSQR 119 G T + R + +E+ + ++I +P+A+A K+A G Sbjct: 72 LGIGFTDMAKRASGMDHEIDPALFEV--ERFEKEILRCRPRAVAFTSKKAASVWMGVPTG 129 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGLSRVS 148 G Q G +++VL +PSG +R S Sbjct: 130 RLLHGHQPRRPGFPEVFVLSSPSGAARRS 158 >UniRef50_Q4P6C9 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4P6C9_USTMA Length = 649 Score = 74.7 bits (182), Expect = 1e-12, Method: Composition-based stats. Identities = 40/98 (40%), Positives = 57/98 (58%), Gaps = 3/98 (3%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ---EAQH 58 V D+ A ++ CGINPG+ S+ +G FAH +N F+ ++ AG T R++KP+ E Sbjct: 296 VPDLFAERNDIMLCGINPGVKSSSSGHHFAHRSNHFYPSLHLAGITSRRMKPEQDVEFPF 355 Query: 59 LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKI 96 L Y G+T L RPT + NE+ EL G L+EKI Sbjct: 356 LRPYSLGLTNLAPRPTAEGNELLPSELIDGVPVLLEKI 393 >UniRef50_O59825 G/U mismatch-specific uracil DNA glycosylase n=1 Tax=Schizosaccharomyces pombe RepID=TDG_SCHPO Length = 325 Score = 69.3 bits (168), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 37/118 (31%), Positives = 64/118 (54%), Gaps = 3/118 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFT--DRQLKPQEAQHL 59 V D + + G+NPG++S+ G FA P+NRFWK++ ++ + + + L Sbjct: 142 VPDYICENPYAIIVGLNPGITSSLKGHAFASPSNRFWKMLNKSKLLEGNAEFTYLNDKDL 201 Query: 60 LDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQ-ALAILGKQAYEQGF 116 + G+T L RP+ ++ K+E+ G R L EK++ Y+PQ L I GK +E+ + Sbjct: 202 PAHGLGITNLCARPSSSGADLRKEEMQDGARILYEKVKRYRPQVGLFISGKGIWEEMY 259 >UniRef50_Q1IXM1 Uracil-DNA glycosylase superfamily n=3 Tax=Deinococcus RepID=Q1IXM1_DEIGD Length = 195 Score = 69.3 bits (168), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 48/154 (31%), Positives = 72/154 (46%), Gaps = 8/154 (5%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 +V D+L GL +V G P SA +A+P N+FW+V+ + G T RQL P+E L Sbjct: 13 LVPDVLQEGLTLVLVGTAPSRISARAKAYYANPENKFWRVLAETGLTPRQLAPREYPILP 72 Query: 61 DYRCGVTKLVDRPT--VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQ--GF 116 Y G+T + R + A V H +L EKI Y+P +A K+ + G Sbjct: 73 QYGIGLTDVAKRHSGVDAALPVEAWAPH----ELREKIRRYRPAIVAFTSKRGASETLGV 128 Query: 117 SQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLE 150 +G Q + ++WVLP+ S L + Sbjct: 129 PTGRLPYGPQVGRLEGAEVWVLPSTSPLGHTHFQ 162 >UniRef50_A9VA78 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9VA78_MONBE Length = 239 Score = 67.4 bits (163), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 49/162 (30%), Positives = 76/162 (46%), Gaps = 21/162 (12%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 D+L P LRV+ GINPG+ SA G +A W+ G Y Sbjct: 52 DLLLPNLRVLIIGINPGVWSAARGHHYAAGVELGWQTDMSCG---------------QYG 96 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQA----YEQGFSQR 119 G T V R T ++ K E+ G +L + I++YQP A GK +++ FS++ Sbjct: 97 IGFTNSVARTTRGQMDLDKSEIEDGVARLRQLIQEYQPAIAAFNGKGVSEIFFKRQFSRK 156 Query: 120 GAQWGKQTLTIGSTQIWVLP--NPSGLSRVSLEKLVEAYREL 159 G Q TIG T+++++P +P G +R + + +REL Sbjct: 157 VTTLGLQQETIGETRLYIMPSTSPRGATRPRWQDKLPFFREL 198 >UniRef50_UPI0000D8867B hypothetical protein CIMG_10236 n=1 Tax=Coccidioides immitis RS RepID=UPI0000D8867B Length = 218 Score = 67.0 bits (162), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 31/79 (39%), Positives = 50/79 (63%), Gaps = 1/79 (1%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D + GL ++F G+NPG+ + TGF +AHP+N +WK+++ +G T RQ P + L Sbjct: 123 LLHDTIPQGLILLFIGVNPGIMTGQTGFVYAHPSNLYWKLLHWSGITMRQHPPSDTYRLP 182 Query: 61 D-YRCGVTKLVDRPTVQAN 78 + Y G T +V RPT A+ Sbjct: 183 ELYGVGNTNIVARPTRDAS 201 >UniRef50_C4Y7L8 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y7L8_CLAL4 Length = 427 Score = 61.2 bits (147), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 56/201 (27%), Positives = 88/201 (43%), Gaps = 42/201 (20%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA----------GFTDRQLKPQ 54 +++ LR++F G NPG+ SA +AHP+N FWK+ Q+ G +D L + Sbjct: 226 MISDSLRLLFIGFNPGVQSAQMQHHYAHPSNLFWKLFNQSALLPQVAQAQGISDHPLVLE 285 Query: 55 EAQ------------HLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQ 102 Q L+D G T LV R T A+ +S E A +++ +I Sbjct: 286 LFQDGKGLASFRDDGRLVDVGIGFTDLVLRCTRAAHHLSPSEKTANVPRVMREIRSSNAS 345 Query: 103 ALAILGKQAYE------QGFSQRGAQWGKQTLTIGST---------QIWVLPNPSGL--S 145 + ++GK +E G R +WG QT + Q++++P+ SGL S Sbjct: 346 HVVVVGKGVWETFVRAFTGDKIRHFEWGLQTDATANAFYRQCGRRLQVYLVPSTSGLVAS 405 Query: 146 RVSLEKL---VEAYRELDQAL 163 EKL VE R + + L Sbjct: 406 MTFGEKLALWVEIARSVSEDL 426 >UniRef50_C7DIS9 Putative uncharacterized protein n=2 Tax=unclassified Euryarchaeota RepID=C7DIS9_9EURY Length = 189 Score = 60.1 bits (144), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 45/168 (26%), Positives = 80/168 (47%), Gaps = 14/168 (8%) Query: 6 LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAG---------FTDRQL-KPQE 55 L+ ++F GINP S PF++ FW ++ +A D++L + E Sbjct: 18 LSKNALIMFVGINPHPGSYKRKVPFSN-NKMFWYLLSRAHIIEEPTEYLMNDKKLVQMYE 76 Query: 56 AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 + + +YR LVDRPT +++ K E G ++ + I Y+P+ + +GK YE+ Sbjct: 77 GKFVQEYRLNFINLVDRPTSDVSKLRKGEERPGAARVFDSIRIYKPRLVCFIGKITYEKF 136 Query: 116 FSQRGAQWG-KQTLTIGSTQIWVLP-NPSGLSRVS-LEKLVEAYRELD 160 + +G KQ + ST + P + R+ L+ L++A R +D Sbjct: 137 SGSKHTDYGYKQPIYGISTYVCRFPIRGPAIDRIKELKALIKAARGID 184 >UniRef50_A5EPR0 G/U mismatch-specific uracil-DNA glycosylase n=1 Tax=Bradyrhizobium sp. BTAi1 RepID=A5EPR0_BRASB Length = 166 Score = 57.0 bits (136), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 47/144 (32%), Positives = 67/144 (46%), Gaps = 4/144 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L PGLR+VF G SA TG +AHP NRFW ++ G T R +P E LL Sbjct: 4 LPDQLKPGLRLVFVGTAASARSAATGHYYAHPGNRFWPTLFAVGLTPRLYRPHEFPALLS 63 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ-RG 120 G T L + + K + G KI ++P+ +A K+A + + G Sbjct: 64 LGIGFTDLCKQGAGMDHVALKAGVDVAG--FAAKIRRHRPRTIAFTSKKAASLFYRKPTG 121 Query: 121 A-QWGKQTLTIGSTQIWVLPNPSG 143 A G+Q+ +VL +PSG Sbjct: 122 AIALGRQSEQDDFPTAFVLASPSG 145 >UniRef50_Q1Q0C1 Similar to G/U-mismatch-specific DNA glycolase n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q0C1_9BACT Length = 170 Score = 54.7 bits (130), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 39/159 (24%), Positives = 73/159 (45%), Gaps = 13/159 (8%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD---------- 61 ++F GINP S PF++ FW ++ +AG + + L+ Sbjct: 11 ILFVGINPHPGSFNRHVPFSN-NKLFWYLLSEAGLIKEKREELRDDKTLERVYKEKFNTV 69 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y G ++DRPT + K E G +K+ I+ P+ + +GK +YE+ + Sbjct: 70 YGLGFVNIIDRPTRDITGIKKHEELPGRKKISRIIKAEMPKVVCFIGKVSYEKYAGSKDF 129 Query: 122 QWGKQTLTIGSTQIWVLPNP-SGLSRVSLEKLVEAYREL 159 +G Q IG+++++V+ P G + + + +L E R + Sbjct: 130 SFGWQE-NIGASKVFVMHFPLRGEALIRVRELQEIKRTI 167 >UniRef50_A3LQ36 Predicted protein n=1 Tax=Pichia stipitis RepID=A3LQ36_PICST Length = 270 Score = 52.4 bits (124), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 46/198 (23%), Positives = 72/198 (36%), Gaps = 59/198 (29%) Query: 6 LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGF------------------T 47 L L ++F G NPG+ S+ +AH +N FWK+ Q+G Sbjct: 50 LVENLTLLFIGFNPGMESSLQQHHYAHFSNLFWKLFNQSGLLLQCLGVLDPQYLSHNYDN 109 Query: 48 DRQL-----------KPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKI 96 D L KP+ L+ Y+ G T L+ R T A E+ E A +LI++ Sbjct: 110 DELLQVLVKDGTTYAKPEHDYELIKYKIGFTDLILRCTRTAQELPMAEKLANVPRLIDEF 169 Query: 97 EDYQPQALAILGKQAYE--------------QGFSQRGAQWGK----------------Q 126 + + +GK +E S+ WG Q Sbjct: 170 NMSSSKHIVFIGKGIWEVIVKYVEVELGIKKVKLSKETFMWGLQDSQNSRLYALVLKKFQ 229 Query: 127 TLTIGSTQIWVLPNPSGL 144 + ++++V PN SGL Sbjct: 230 SKISADSKVFVFPNTSGL 247 >UniRef50_C5BH73 Putative uncharacterized protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5BH73_EDWI9 Length = 63 Score = 50.8 bits (120), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 24/47 (51%), Positives = 30/47 (63%) Query: 22 SSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTK 68 + G G FA+P NRFW VI+ AGFTDR L PQ HLL G+++ Sbjct: 7 TEPGPGDHFANPRNRFWSVIHLAGFTDRPLAPQAEPHLLIPAVGLSR 53 >UniRef50_B7Q2I4 G/T mismatch-specific thymine DNA glycosylase, putative n=1 Tax=Ixodes scapularis RepID=B7Q2I4_IXOSC Length = 236 Score = 50.4 bits (119), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 28/110 (25%), Positives = 53/110 (48%) Query: 39 KVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIED 98 + ++ AG + + L+ Y G T +V R T + ++S++E+ G L++K++ Sbjct: 24 RCLFLAGLIPEPMTAVDDFKLIQYGIGFTNIVARTTRSSADLSRKEIKEGAEILMDKLKI 83 Query: 99 YQPQALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVS 148 +QP+ GK YE + +G+Q I T + V PS +R + Sbjct: 84 FQPRIAVFNGKGIYEIFSGNKNFVFGRQPDPIEGTNVSVFVMPSSSARCA 133 >UniRef50_C5MI19 Predicted protein n=1 Tax=Candida tropicalis MYA-3404 RepID=C5MI19_CANTT Length = 267 Score = 48.5 bits (114), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 32/127 (25%), Positives = 54/127 (42%), Gaps = 19/127 (14%) Query: 6 LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ-------- 57 L+ L V+F G N G SA +AH N+FWK+ ++ ++ Q Sbjct: 60 LSNKLHVLFIGFNAGEKSALKQHRYAHHTNQFWKLFNESNLLEKVTTSQNITKDDTLEKL 119 Query: 58 -----------HLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAI 106 HL++Y G T LV+R T +E+ E +L+++ + + I Sbjct: 120 LSHGCKAVHDYHLVNYGIGFTSLVERSTKNTSELGITEKLESVPRLLKEWNESNAVEIVI 179 Query: 107 LGKQAYE 113 LGK ++ Sbjct: 180 LGKGIWD 186 >UniRef50_B9WEN4 G/U mismatch-specific uracil DNA glycosylase, putative (Uracil mismatch repair protein, putative) (Dna repair protein, putative) n=2 Tax=Candida RepID=B9WEN4_CANDC Length = 292 Score = 48.5 bits (114), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 34/137 (24%), Positives = 58/137 (42%), Gaps = 29/137 (21%) Query: 6 LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ-------- 57 L L+V+F G NPG SA +AH +N FWK+ Q+ + + Q Sbjct: 60 LNDHLQVLFVGFNPGTESAIQQHHYAHHSNLFWKLFNQSQLLHKVISKNNQQEEQQIVLD 119 Query: 58 --------HLLD-------------YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKI 96 HLL+ Y G + LV R T +A++++ E +L+ + Sbjct: 120 DSNDQFLNHLLENGCNATHDFKLIKYNIGFSDLVLRSTARADQLTISEKLENVPRLLSEF 179 Query: 97 EDYQPQALAILGKQAYE 113 + + + + I+GK +E Sbjct: 180 KSSRVETIVIVGKGIWE 196 >UniRef50_Q7VJA2 Putative uncharacterized protein n=1 Tax=Helicobacter hepaticus RepID=Q7VJA2_HELHP Length = 188 Score = 43.5 bits (101), Expect = 0.003, Method: Compositional matrix adjust. Identities = 38/152 (25%), Positives = 67/152 (44%), Gaps = 10/152 (6%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQE----AQHLLDYRCGV 66 RV+ G P + S F +AH NRFW ++ + ++ Q+ Q LL + + Sbjct: 36 RVLILGSFPSVISRDEQFYYAHSHNRFWHILATLFTPEIDIRAQDRDAKKQFLLTHHIAL 95 Query: 67 TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 +V R N +A + + + A+ G++AYE +G Q Sbjct: 96 WDIV-RECDICNSSDSTLSNAKPNDISVILSSAKINAIFCNGRKAYELFIR----FFGSQ 150 Query: 127 TLTIGSTQIWVLPNPS-GLSRVSLEKLVEAYR 157 ++++LP+ S +R SLEKLV+++R Sbjct: 151 MCETQEIKVFLLPSSSPANARYSLEKLVDSWR 182 >UniRef50_D0N7I8 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0N7I8_PHYIN Length = 240 Score = 43.1 bits (100), Expect = 0.004, Method: Compositional matrix adjust. Identities = 41/177 (23%), Positives = 77/177 (43%), Gaps = 23/177 (12%) Query: 10 LRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA-------GFTDRQLKPQEAQHLLD- 61 LR++ G NP + TG+ +++P NR W ++ G + E L D Sbjct: 53 LRLLIVGHNPSDHAWKTGYSYSNPTNRMWMLLTGTLSPHSWEGIVPSTARIDEQNILPDV 112 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIE----DYQPQALAILGKQAYEQGFS 117 + G T + P A++ K + A +++ + P +A GK+ + FS Sbjct: 113 HGVGFTSIGLEPGNDASKYGKATMEAWRTSFFQRLRRHALPHAPILIAFSGKRQFSYLFS 172 Query: 118 ---QRGAQWGKQT-LTIG-------STQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 + +GKQ+ L G +++WVLP+ SG + ++ + Y++L + L Sbjct: 173 PALSKIESYGKQSKLPPGWPHELRADSEVWVLPSSSGRAAMTTAQRTLPYQQLAERL 229 >UniRef50_Q4T9X4 Chromosome undetermined SCAF7485, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4T9X4_TETNG Length = 482 Score = 41.6 bits (96), Expect = 0.010, Method: Compositional matrix adjust. Identities = 20/52 (38%), Positives = 31/52 (59%) Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYE 113 YR G T +V R T + ++S +EL GG+ L+EK++ ++P GK YE Sbjct: 241 YRMGFTNMVARATPGSKDLSSKELREGGKILVEKLKKFKPLIAVFNGKCIYE 292 >UniRef50_C1EDR1 Predicted protein n=2 Tax=Micromonas RepID=C1EDR1_9CHLO Length = 302 Score = 40.8 bits (94), Expect = 0.018, Method: Compositional matrix adjust. Identities = 42/195 (21%), Positives = 76/195 (38%), Gaps = 39/195 (20%) Query: 10 LRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA---------------------GFTD 48 L+++ G NP S G+G+ +++P+N FWK++ + GFTD Sbjct: 93 LKLLIIGHNPSEHSWGSGWSYSNPSNNFWKLLAKGRIVPEDWTKEDCPRLPGDLGIGFTD 152 Query: 49 RQLKP------QEAQHLLDYRCGV-TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP 101 P + +R + +L A+ ++ G R + + P Sbjct: 153 AGTVPGNDAGAYNRATMKKWRSDLYARLRGHVARCADASGDEDARVGHRSSMSIASRHGP 212 Query: 102 QALAILGKQAYEQGFSQRGAQ--WGKQ---------TLTIGSTQIWVLPNPSGLSRVSLE 150 +A GK+ Y Q F ++ G+Q L T++WV+P+ SG + ++ E Sbjct: 213 DVVAFSGKRQYAQLFDAVPSRVDAGRQPPESLPPGWPLCPERTEVWVMPSSSGRAAMTRE 272 Query: 151 KLVEAYRELDQALVV 165 + EL L Sbjct: 273 AREGPWVELGTRLAT 287 >UniRef50_UPI00017F04BF PREDICTED: similar to thymine-DNA glycosylase n=1 Tax=Sus scrofa RepID=UPI00017F04BF Length = 300 Score = 40.0 bits (92), Expect = 0.031, Method: Compositional matrix adjust. Identities = 38/147 (25%), Positives = 52/147 (35%), Gaps = 10/147 (6%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 DIL L +V GINPGL +A G + P N FW + R + AQ + Y Sbjct: 124 DILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFWLCYVMPSSSARCAQFPRAQDKVHYY 183 Query: 64 CGVTKLVDRPTVQANEVSKQEL---------HAGGRKLIEKIEDYQPQALAILGKQAYEQ 114 + L D+ QE+ +K+ K E Y P A G E Sbjct: 184 IKLKDLRDQLKGIERNTDVQEVQYTFDLQLAQEDAKKMAVKEEKYDPGYEAAYGGAYGEN 243 Query: 115 GFSQRGAQWGKQTLT-IGSTQIWVLPN 140 S + L G + +PN Sbjct: 244 PCSSEPCSFASNGLNDSGDSTFSDIPN 270 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B7MB04 G/U mismatch-specific DNA glycosylase n=70 Tax=E... 252 3e-66 UniRef50_B5REH0 G/U mismatch-specific DNA glycosylase n=51 Tax=E... 238 5e-62 UniRef50_UPI0000E46F0A PREDICTED: similar to G/T mismatch-specif... 229 2e-59 UniRef50_UPI0001792C18 PREDICTED: similar to GA15170-PA n=3 Tax=... 227 1e-58 UniRef50_B4L7B8 GI14092 n=1 Tax=Drosophila mojavensis RepID=B4L7... 227 1e-58 UniRef50_UPI0000DB6DF5 PREDICTED: similar to Thd1 CG1981-PA n=2 ... 226 2e-58 UniRef50_Q9XYZ3 G/T mismatch-specific thymine DNA glycosylase n=... 226 2e-58 UniRef50_Q9V4D8 Thd1 n=1 Tax=Drosophila melanogaster RepID=Q9V4D... 226 2e-58 UniRef50_B4MF03 GJ13949 n=2 Tax=Drosophila virilis RepID=B4MF03_... 226 2e-58 UniRef50_B4JZW0 GH23925 n=1 Tax=Drosophila grimshawi RepID=B4JZW... 226 3e-58 UniRef50_A6CPG7 G/U mismatch-specific DNA glycosylase n=2 Tax=Ba... 225 3e-58 UniRef50_B4NHL6 GK13687 n=2 Tax=Coelomata RepID=B4NHL6_DROWI 225 4e-58 UniRef50_B3MY42 GF22816 n=1 Tax=Drosophila ananassae RepID=B3MY4... 225 4e-58 UniRef50_UPI000186DD28 G/t mismatch-specific thymine DNA glycosy... 224 1e-57 UniRef50_C4QAV8 G/t mismatch-specific thymine DNA glycosylase, p... 221 5e-57 UniRef50_UPI0000D578C1 PREDICTED: similar to Thd1 n=1 Tax=Tribol... 221 7e-57 UniRef50_C7PGY3 Uracil-DNA glycosylase superfamily n=1 Tax=Chiti... 218 8e-56 UniRef50_A7S194 Predicted protein n=1 Tax=Nematostella vectensis... 217 9e-56 UniRef50_UPI000178957E Uracil-DNA glycosylase superfamily n=1 Ta... 216 2e-55 UniRef50_UPI0001926FCB PREDICTED: similar to predicted protein n... 214 1e-54 UniRef50_Q1D9T2 G/U mismatch-specific DNA glycosylase n=2 Tax=Ba... 212 3e-54 UniRef50_Q39CW6 G/U mismatch-specific uracil-DNA glycosylase n=1... 211 6e-54 UniRef50_D2B5F3 G:T/U mismatch-specific DNA glycosylase-like pro... 211 1e-53 UniRef50_Q67QW8 G/U mismatch-specific DNA glycosylase n=1 Tax=Sy... 210 2e-53 UniRef50_Q46SQ2 G/U mismatch-specific uracil-DNA glycosylase n=4... 209 3e-53 UniRef50_Q4T9X5 Chromosome undetermined SCAF7485, whole genome s... 208 6e-53 UniRef50_UPI0000DB71A4 PREDICTED: similar to Thd1 CG1981-PA, par... 208 8e-53 UniRef50_A8LC62 Uracil-DNA glycosylase superfamily n=16 Tax=Bact... 206 2e-52 UniRef50_UPI0001926888 PREDICTED: similar to GL18173 n=1 Tax=Hyd... 206 3e-52 UniRef50_Q0RRS7 DNA glycosylase, G/U mismatch-specific n=2 Tax=F... 205 4e-52 UniRef50_Q9K3I5 G/U mismatch-specific DNA glycosylase n=3 Tax=St... 203 2e-51 UniRef50_Q13569 G/T mismatch-specific thymine DNA glycosylase n=... 203 2e-51 UniRef50_C6WCQ8 Uracil-DNA glycosylase superfamily n=1 Tax=Actin... 203 3e-51 UniRef50_Q47NL8 G/U mismatch-specific uracil-DNA glycosylase n=1... 202 4e-51 UniRef50_C6B6I6 Uracil-DNA glycosylase superfamily n=2 Tax=Rhizo... 202 4e-51 UniRef50_Q1J3D2 Uracil-DNA glycosylase superfamily n=1 Tax=Deino... 201 5e-51 UniRef50_B9J7K0 G:T/U mismatch-specific DNA glycosylase n=1 Tax=... 197 1e-49 UniRef50_C7IMH6 Uracil-DNA glycosylase superfamily n=1 Tax=Clost... 194 1e-48 UniRef50_C0ZBM4 Putative uncharacterized protein n=1 Tax=Breviba... 191 6e-48 UniRef50_A6SED0 Putative uncharacterized protein n=2 Tax=Sclerot... 191 7e-48 UniRef50_B8I864 Uracil-DNA glycosylase superfamily n=1 Tax=Clost... 191 8e-48 UniRef50_B2A9Z4 Predicted CDS Pa_1_2290 n=2 Tax=Sordariales RepI... 186 2e-46 UniRef50_B8N438 Mismatch-specific thymine-DNA glycosylase, putat... 185 6e-46 UniRef50_A2QSA7 Function: In the DNA of higher eukaryotes n=48 T... 185 7e-46 UniRef50_C8XAI4 Uracil-DNA glycosylase superfamily n=3 Tax=Actin... 185 8e-46 UniRef50_D1CA47 Uracil-DNA glycosylase superfamily n=1 Tax=Sphae... 183 2e-45 UniRef50_C9SR06 G/U mismatch-specific uracil DNA glycosylase n=4... 183 2e-45 UniRef50_A9VA78 Predicted protein n=1 Tax=Monosiga brevicollis R... 182 4e-45 UniRef50_B2W4Q2 G/U mismatch-specific DNA glycosylase n=2 Tax=Pl... 181 6e-45 UniRef50_Q6C2V5 YALI0F04796p n=1 Tax=Yarrowia lipolytica RepID=Q... 181 1e-44 UniRef50_C8NV44 G/U mismatch-specific DNA glycosylase n=2 Tax=Co... 180 1e-44 UniRef50_B7GT25 Uracil-DNA glycosylase superfamily n=6 Tax=Actin... 180 2e-44 UniRef50_B2GKB6 G/U mismatch-specific DNA glycosylase n=6 Tax=Ba... 179 4e-44 UniRef50_A6GLD3 G/U mismatch-specific DNA glycosylase n=1 Tax=Li... 176 2e-43 UniRef50_A7HGS2 Uracil-DNA glycosylase superfamily n=1 Tax=Anaer... 176 3e-43 UniRef50_B2IIR8 Uracil-DNA glycosylase superfamily n=2 Tax=Bacte... 176 3e-43 UniRef50_C5SKW3 Uracil-DNA glycosylase superfamily n=1 Tax=Astic... 173 3e-42 UniRef50_Q7SDB2 Predicted protein n=2 Tax=Sordariaceae RepID=Q7S... 169 4e-41 UniRef50_C6H6H6 Uracil DNA glycosylase superfamily protein n=1 T... 168 7e-41 UniRef50_A7HV32 Uracil-DNA glycosylase superfamily n=3 Tax=Rhizo... 168 7e-41 UniRef50_A8P541 Putative uncharacterized protein n=1 Tax=Coprino... 168 8e-41 UniRef50_O59825 G/U mismatch-specific uracil DNA glycosylase n=1... 167 1e-40 UniRef50_C8RZ13 Uracil-DNA glycosylase superfamily n=1 Tax=Rhodo... 164 7e-40 UniRef50_A8Q5A6 Putative uncharacterized protein n=1 Tax=Malasse... 164 1e-39 UniRef50_C4Y7L8 Putative uncharacterized protein n=1 Tax=Clavisp... 160 2e-38 UniRef50_B0E0R1 Predicted protein n=7 Tax=Laccaria bicolor S238N... 157 1e-37 UniRef50_Q1IXM1 Uracil-DNA glycosylase superfamily n=3 Tax=Deino... 155 6e-37 UniRef50_B3QI54 Uracil-DNA glycosylase superfamily n=3 Tax=Rhodo... 154 1e-36 UniRef50_Q5KFR6 Expressed protein n=2 Tax=Filobasidiella neoform... 151 1e-35 UniRef50_B7Q2I4 G/T mismatch-specific thymine DNA glycosylase, p... 149 2e-35 UniRef50_A3LQ36 Predicted protein n=1 Tax=Pichia stipitis RepID=... 149 2e-35 UniRef50_Q1Q0C1 Similar to G/U-mismatch-specific DNA glycolase n... 145 7e-34 UniRef50_C7DIS9 Putative uncharacterized protein n=2 Tax=unclass... 142 4e-33 UniRef50_A5EPR0 G/U mismatch-specific uracil-DNA glycosylase n=1... 141 1e-32 UniRef50_B9WEN4 G/U mismatch-specific uracil DNA glycosylase, pu... 139 3e-32 UniRef50_Q4P6C9 Putative uncharacterized protein n=1 Tax=Ustilag... 132 5e-30 UniRef50_C5MI19 Predicted protein n=1 Tax=Candida tropicalis MYA... 127 2e-28 UniRef50_UPI0000D8867B hypothetical protein CIMG_10236 n=1 Tax=C... 112 5e-24 UniRef50_C5BH73 Putative uncharacterized protein n=1 Tax=Edwards... 58 1e-07 Sequences not found previously or not previously below threshold: UniRef50_Q6BL09 DEHA2F17270p n=2 Tax=Saccharomycetaceae RepID=Q6... 133 3e-30 UniRef50_A5DZA6 Putative uncharacterized protein n=2 Tax=Sacchar... 129 3e-29 UniRef50_Q4T9X4 Chromosome undetermined SCAF7485, whole genome s... 91 1e-17 UniRef50_D0N7I8 Putative uncharacterized protein n=1 Tax=Phytoph... 91 1e-17 UniRef50_C1EDR1 Predicted protein n=2 Tax=Micromonas RepID=C1EDR... 89 5e-17 UniRef50_A8IDF6 Predicted protein n=1 Tax=Chlamydomonas reinhard... 88 1e-16 UniRef50_UPI00017F04BF PREDICTED: similar to thymine-DNA glycosy... 76 5e-13 UniRef50_Q2FQM0 Putative uncharacterized protein n=1 Tax=Methano... 70 3e-11 UniRef50_C3X2Y9 Mug G:T/U mismatch-specific DNA glycosylase n=1 ... 70 3e-11 UniRef50_Q1VS76 G/U mismatch-specific DNA glycosylase, putative ... 68 1e-10 UniRef50_C6X5D8 G:T/U mismatch-specific uracil/thymine DNA-glyco... 65 8e-10 UniRef50_C9RN39 Mug G:T/U mismatch-specific DNA glycosylase n=1 ... 64 1e-09 UniRef50_Q7MVW9 G/U mismatch-specific DNA glycosylase, putative ... 63 4e-09 UniRef50_A6SVT3 G:T/U mismatch-specific DNA glycosylase n=21 Tax... 63 6e-09 UniRef50_B9TDK4 Putative uncharacterized protein n=1 Tax=Ricinus... 62 6e-09 UniRef50_B2KDM2 G:T/U mismatch-specific DNA glycosylase-like pro... 62 7e-09 UniRef50_Q7VJA2 Putative uncharacterized protein n=1 Tax=Helicob... 62 8e-09 UniRef50_D0WFV1 Putative G/U mismatch-specific DNA glycosylase n... 62 9e-09 UniRef50_C6QE22 G:T/U mismatch-specific DNA glycosylase-like pro... 61 2e-08 UniRef50_B0T5Y6 Uracil-DNA glycosylase superfamily n=10 Tax=Prot... 60 3e-08 UniRef50_C5AAU4 Uracil-DNA glycosylase superfamily protein n=8 T... 60 3e-08 UniRef50_C8WEM8 Uracil-DNA glycosylase superfamily n=4 Tax=Zymom... 60 4e-08 UniRef50_Q82UD2 Putative uncharacterized protein n=1 Tax=Nitroso... 59 5e-08 UniRef50_C4FAG7 Putative uncharacterized protein n=2 Tax=Collins... 59 8e-08 UniRef50_A6Q7V9 Putative uncharacterized protein n=1 Tax=Sulfuro... 58 8e-08 UniRef50_Q1GYD6 G/U mismatch-specific uracil-DNA glycosylase n=1... 58 9e-08 UniRef50_C8WJG8 Putative uncharacterized protein n=8 Tax=Bacteri... 58 9e-08 UniRef50_C9KQM1 Putative G/U mismatch-specific DNA glycosylase n... 58 1e-07 UniRef50_Q8CUP2 Hypothetical conserved protein n=1 Tax=Oceanobac... 58 1e-07 UniRef50_A0R287 T/U mismatch-specific DNA glycosylase n=5 Tax=My... 58 2e-07 UniRef50_A6TMT4 Putative uncharacterized protein n=1 Tax=Alkalip... 57 3e-07 UniRef50_Q8PWD8 Conserved protein n=3 Tax=Methanosarcina RepID=Q... 57 3e-07 UniRef50_A5VDD8 Uracil-DNA glycosylase superfamily n=8 Tax=Prote... 57 3e-07 UniRef50_A9KMN2 G:T/U mismatch-specific DNA glycosylase n=2 Tax=... 56 5e-07 UniRef50_B1HZQ2 Putative uncharacterized protein n=2 Tax=Bacilla... 56 5e-07 UniRef50_Q31G15 Uracil DNA glycosylase superfamily protein n=1 T... 56 7e-07 UniRef50_UPI00016AE856 G:T/U mismatch-specific DNA glycosylase-l... 55 7e-07 UniRef50_C6CSS2 Putative uncharacterized protein n=1 Tax=Paeniba... 54 1e-06 UniRef50_C2FY74 G/U mismatch-specific DNA glycosylase n=2 Tax=Sp... 54 2e-06 UniRef50_C9M6L9 Putative G/U mismatch-specific DNA glycosylase n... 53 3e-06 UniRef50_C7N6G7 YjeF-like protein n=6 Tax=Bacteria RepID=C7N6G7_... 53 3e-06 UniRef50_Q0F2S7 Putative uncharacterized protein n=1 Tax=Maripro... 53 3e-06 UniRef50_A6DAL5 Putative uncharacterized protein n=1 Tax=Caminib... 53 4e-06 UniRef50_C3XK88 T/U mismatch-specific DNA glycosylase n=1 Tax=He... 52 5e-06 UniRef50_A1WBA7 Putative uncharacterized protein n=17 Tax=Proteo... 52 6e-06 UniRef50_Q1NE08 Uracil-DNA glycosylase superfamily protein n=1 T... 52 7e-06 UniRef50_B7ATH8 Putative uncharacterized protein n=1 Tax=Bactero... 52 8e-06 UniRef50_B7CDR6 Putative uncharacterized protein n=1 Tax=Eubacte... 52 9e-06 UniRef50_A7ZBI7 Mug G:T/U mismatch-specific DNA glycosylase n=2 ... 52 9e-06 UniRef50_D1PNX0 T/U mismatch-specific DNA glycosylase n=1 Tax=Su... 51 1e-05 UniRef50_B9KXL9 Phage DNA polymerase-related protein n=17 Tax=ce... 51 1e-05 UniRef50_B8GKM7 Putative uncharacterized protein n=1 Tax=Methano... 51 1e-05 UniRef50_C4LCA1 T/U mismatch-specific DNA glycosylase n=10 Tax=B... 51 2e-05 UniRef50_C8W9T7 Uracil-DNA glycosylase superfamily n=1 Tax=Atopo... 51 2e-05 UniRef50_B2UPH3 Uracil-DNA glycosylase superfamily n=2 Tax=Chlam... 50 2e-05 UniRef50_C6XAZ6 G/U mismatch-specific uracil-DNA glycosylase n=4... 50 3e-05 UniRef50_B9LA93 Putative uncharacterized protein n=1 Tax=Nautili... 50 4e-05 UniRef50_A8U688 Putative uncharacterized protein n=1 Tax=Carnoba... 49 5e-05 UniRef50_B2V9S9 Phage SPO1 DNA polymerase-related protein n=18 T... 49 6e-05 UniRef50_A6Q419 Putative uncharacterized protein n=1 Tax=Nitrati... 49 8e-05 UniRef50_Q5P2M8 T/U mismatch-specific DNA glycosylase n=30 Tax=B... 49 8e-05 UniRef50_A1KBL3 Putative uncharacterized protein n=1 Tax=Azoarcu... 48 1e-04 UniRef50_C4ZBZ4 G:T/U mismatch-specific DNA glycosylase n=20 Tax... 47 1e-04 UniRef50_C5K9X8 Putative uncharacterized protein n=2 Tax=Perkins... 47 2e-04 UniRef50_A2U3I4 Putative uncharacterized protein n=2 Tax=Polarib... 47 2e-04 UniRef50_B1C3U7 Putative uncharacterized protein n=1 Tax=Clostri... 47 2e-04 UniRef50_D0J0R4 Putative uncharacterized protein n=6 Tax=Burkhol... 47 2e-04 UniRef50_A9AVD1 G:T/U mismatch-specific DNA glycosylase-like pro... 47 3e-04 UniRef50_B0EFS1 Putative uncharacterized protein n=4 Tax=Entamoe... 46 4e-04 UniRef50_B7KM64 Phage SPO1 DNA polymerase-related protein n=1 Ta... 46 4e-04 UniRef50_A8MCZ3 Uracil-DNA glycosylase superfamily n=4 Tax=Therm... 46 5e-04 UniRef50_A8A982 Phage SPO1 DNA polymerase-related protein n=1 Ta... 45 0.001 UniRef50_C4Z504 DNA glycosylase n=4 Tax=Clostridiales RepID=C4Z5... 45 0.001 UniRef50_C6MEU9 Putative uncharacterized protein n=1 Tax=Nitroso... 44 0.002 UniRef50_A7I1K0 G:T/U mismatch-specific DNA glycosylase n=1 Tax=... 44 0.002 UniRef50_B6R445 Bacteriophage-type DNA polymerase N-terminal dom... 44 0.002 UniRef50_A3ZBY1 T/U mismatch-specific DNA glycosylase n=15 Tax=C... 44 0.002 UniRef50_B0VFE1 Uracil-DNA glycosylase n=1 Tax=Candidatus Cloaca... 44 0.003 UniRef50_C5EZE8 Putative uncharacterized protein n=2 Tax=Helicob... 43 0.003 UniRef50_B9CP28 T/U mismatch-specific DNA glycosylase n=1 Tax=At... 43 0.003 UniRef50_Q6L0J8 DNA polymerase, bacteriophage-type n=1 Tax=Picro... 43 0.003 UniRef50_A0RQD6 Mug G:T/U mismatch-specific DNA glycosylase n=6 ... 43 0.003 UniRef50_C0GI50 Phage SPO1 DNA polymerase-related protein n=1 Ta... 43 0.003 UniRef50_A7IAF7 Putative uncharacterized protein n=1 Tax=Candida... 43 0.003 UniRef50_C4G9N7 Putative uncharacterized protein n=1 Tax=Shuttle... 43 0.004 UniRef50_A8VW39 Putative uncharacterized protein n=1 Tax=Bacillu... 43 0.005 UniRef50_A0B978 Phage SPO1 DNA polymerase-related protein n=1 Ta... 42 0.005 UniRef50_UPI0001BC5AE8 DNA glycosylase n=1 Tax=Fusobacterium sp.... 42 0.005 UniRef50_D1PL52 Uracil-DNA glycosylase n=5 Tax=Bacteria RepID=D1... 42 0.006 UniRef50_A4B987 Putative uncharacterized protein n=1 Tax=Reineke... 42 0.007 UniRef50_Q04GE7 G:T/U mismatch-specific DNA glycosylase n=2 Tax=... 42 0.007 UniRef50_D1CIM8 Uracil-DNA glycosylase superfamily n=1 Tax=Therm... 42 0.008 UniRef50_A8RDC8 Putative uncharacterized protein n=1 Tax=Eubacte... 42 0.008 UniRef50_A9H0D0 Uracil-DNA glycosylase n=3 Tax=Bacteria RepID=A9... 42 0.009 UniRef50_Q3SAA1 Uracyl DNA glycosylase n=4 Tax=Euryarchaeota Rep... 42 0.009 UniRef50_B3DV47 Uracil-DNA glycosylase n=1 Tax=Methylacidiphilum... 41 0.011 UniRef50_A5FBU1 G:T/U mismatch-specific DNA glycosylase-like pro... 41 0.016 UniRef50_A1RYC0 Phage SPO1 DNA polymerase-related protein n=23 T... 41 0.016 UniRef50_B2IF85 Phage SPO1 DNA polymerase-related protein n=1 Ta... 41 0.018 UniRef50_A0LFG4 Phage SPO1 DNA polymerase-related protein n=2 Ta... 41 0.018 UniRef50_A0RV64 Uracil-DNA glycosylase n=2 Tax=Thaumarchaeota Re... 41 0.022 UniRef50_A8UU74 N-terminus of phage SPO1 DNA polymerase n=3 Tax=... 40 0.027 UniRef50_Q2S6J8 Uracil-DNA glycosylase, family 4 n=2 Tax=Rhodoth... 40 0.028 UniRef50_Q2LTM2 Uracil DNA glycosylase superfamily protein n=1 T... 40 0.028 UniRef50_C1DCC6 Probable DNA-directed DNA polymerase, bacterioph... 40 0.031 UniRef50_A1WUV9 Phage SPO1 DNA polymerase-related protein n=3 Ta... 40 0.039 UniRef50_Q0W079 Predicted uracil-DNA glycosylase superfamily pro... 39 0.059 UniRef50_Q0EZR7 Phage SPO1 DNA polymerase domain protein n=1 Tax... 39 0.062 UniRef50_D2LEU1 Phage SPO1 DNA polymerase-related protein n=1 Ta... 39 0.064 UniRef50_A1B575 Putative uncharacterized protein n=1 Tax=Paracoc... 39 0.068 UniRef50_D1Y5H9 Uracil-DNA glycosylase n=1 Tax=Pyramidobacter pi... 39 0.074 UniRef50_B6IW73 Uracil-DNA glycosylase, family 4 protein, putati... 38 0.089 UniRef50_C1FF23 Predicted protein n=2 Tax=cellular organisms Rep... 38 0.096 >UniRef50_B7MB04 G/U mismatch-specific DNA glycosylase n=70 Tax=Enterobacteriaceae RepID=MUG_ECO45 Length = 168 Score = 252 bits (645), Expect = 3e-66, Method: Composition-based stats. Identities = 168/168 (100%), Positives = 168/168 (100%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL Sbjct: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG Sbjct: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRGR 168 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRGR Sbjct: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRGR 168 >UniRef50_B5REH0 G/U mismatch-specific DNA glycosylase n=51 Tax=Enterobacteriaceae RepID=MUG_SALG2 Length = 168 Score = 238 bits (608), Expect = 5e-62, Method: Composition-based stats. Identities = 134/167 (80%), Positives = 150/167 (89%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 MV+DILAPGLRVVFCGINPGLSSA TGFPFAHPANRFWKVI+ AGFTDRQLKP+EA+ LL Sbjct: 1 MVKDILAPGLRVVFCGINPGLSSANTGFPFAHPANRFWKVIHLAGFTDRQLKPEEAEKLL 60 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 D+RCGVTKLVDRPTVQA EV EL +GGR LIEKIEDYQP ALA+LGKQA+EQGFSQRG Sbjct: 61 DFRCGVTKLVDRPTVQATEVKLHELRSGGRNLIEKIEDYQPAALAVLGKQAFEQGFSQRG 120 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRG 167 WGK+ + IG+T +WVLPNPSGL+R+ EKLVEAYRELDQAL++RG Sbjct: 121 IAWGKKKIAIGATMVWVLPNPSGLNRIKTEKLVEAYRELDQALIMRG 167 >UniRef50_UPI0000E46F0A PREDICTED: similar to G/T mismatch-specific thymine DNA glycosylase n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E46F0A Length = 588 Score = 229 bits (585), Expect = 2e-59, Method: Composition-based stats. Identities = 50/165 (30%), Positives = 78/165 (47%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + DIL L +V GINPGL +A G +A P N FWK +Y +G + + L D Sbjct: 68 LPDILVHNLDIVIIGINPGLMAAYKGHHYAGPGNHFWKCLYLSGLVPEPMTCMDDVKLPD 127 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G T +V R T + ++ ++E+ G + ++EKI+ Y P GK YE ++ Sbjct: 128 FGVGFTNIVGRTTRGSADLKRKEIKEGAKVVVEKIQKYLPLIACFNGKGIYEIYSGKKDF 187 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 + G+Q I T+ V PS +R + V Y L + Sbjct: 188 EVGRQPENIPGTETVVYVMPSSSARCSQFPRAQDKVPFYITLKEL 232 >UniRef50_UPI0001792C18 PREDICTED: similar to GA15170-PA n=3 Tax=Coelomata RepID=UPI0001792C18 Length = 1013 Score = 227 bits (580), Expect = 1e-58, Method: Composition-based stats. Identities = 48/167 (28%), Positives = 83/167 (49%), Gaps = 8/167 (4%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y +G T + + LL Sbjct: 261 LPDHLTNNLDIVIIGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTPEPMTADDDYKLLQ 320 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V+R T + +++++E+ G + L+EK++ ++P+ GK +E ++ Sbjct: 321 NGIGFTNMVERATKGSADLTRKEIKEGSQILLEKLQRFKPKIAVFNGKLIFEVFSGKKDF 380 Query: 122 QWGKQTLTIGS--TQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 +G+Q + T +WV+P+ S +R V Y L + Sbjct: 381 TFGRQPELVDGTNTYMWVMPSSS--ARCAQLPRAADKVPFYAALKKF 425 >UniRef50_B4L7B8 GI14092 n=1 Tax=Drosophila mojavensis RepID=B4L7B8_DROMO Length = 1737 Score = 227 bits (579), Expect = 1e-58, Method: Composition-based stats. Identities = 47/165 (28%), Positives = 82/165 (49%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y +G T Q+ ++ L+ Sbjct: 796 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTQEQMSAEDDYTLVK 855 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 856 HGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 915 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 +G+Q + T ++ PS +R V Y L + Sbjct: 916 HFGRQPDRVEGTNTYIWVMPSSSARCAQLPRAADKVPFYAALKKF 960 >UniRef50_UPI0000DB6DF5 PREDICTED: similar to Thd1 CG1981-PA n=2 Tax=Apocrita RepID=UPI0000DB6DF5 Length = 973 Score = 226 bits (578), Expect = 2e-58, Method: Composition-based stats. Identities = 49/168 (29%), Positives = 82/168 (48%), Gaps = 8/168 (4%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D L L ++ GINPGL +A G +A P N FWK ++ +G T L + LL Sbjct: 206 VLPDHLTNNLDIIIIGINPGLFAAYKGHHYAGPGNHFWKCLHLSGLTPEPLTADDDYKLL 265 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + G T +V R T + +++++E+ G L+EK+ Y+P+ GK YE ++ Sbjct: 266 RFGIGFTNMVARATKGSADLTRKEIKEGSHILLEKLRKYKPKIAVFNGKLIYEVFSGKKD 325 Query: 121 AQWGKQTLTIGS--TQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 +G+Q + T +WV+P+ S +R V Y L + Sbjct: 326 FGFGRQPNLVEGTNTYMWVMPSSS--ARCAQLPRAADKVPYYAALKKF 371 >UniRef50_Q9XYZ3 G/T mismatch-specific thymine DNA glycosylase n=7 Tax=Sophophora RepID=Q9XYZ3_DROME Length = 1095 Score = 226 bits (577), Expect = 2e-58, Method: Composition-based stats. Identities = 49/165 (29%), Positives = 80/165 (48%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y AG T Q+ E L+ Sbjct: 136 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLAGLTQEQMSADEDHKLIK 195 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 196 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 255 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 +G+Q + T ++ PS +R V Y L + Sbjct: 256 HFGRQPDRVDGTDTFIWVMPSSSARCAQLPRAADKVPFYAALKKF 300 >UniRef50_Q9V4D8 Thd1 n=1 Tax=Drosophila melanogaster RepID=Q9V4D8_DROME Length = 1738 Score = 226 bits (577), Expect = 2e-58, Method: Composition-based stats. Identities = 49/165 (29%), Positives = 80/165 (48%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y AG T Q+ E L+ Sbjct: 779 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLAGLTQEQMSADEDHKLIK 838 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 839 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 898 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 +G+Q + T ++ PS +R V Y L + Sbjct: 899 HFGRQPDRVDGTDTFIWVMPSSSARCAQLPRAADKVPFYAALKKF 943 >UniRef50_B4MF03 GJ13949 n=2 Tax=Drosophila virilis RepID=B4MF03_DROVI Length = 1038 Score = 226 bits (576), Expect = 2e-58, Method: Composition-based stats. Identities = 48/165 (29%), Positives = 80/165 (48%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y +G T QL + L+ Sbjct: 815 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTQEQLSADDDYTLVK 874 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 875 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 934 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 +G+Q + T ++ PS +R V Y L + Sbjct: 935 HFGRQPDRVDGTNTYIWVMPSSSARCAQLPRAADKVPFYAALKKF 979 >UniRef50_B4JZW0 GH23925 n=1 Tax=Drosophila grimshawi RepID=B4JZW0_DROGR Length = 1557 Score = 226 bits (576), Expect = 3e-58, Method: Composition-based stats. Identities = 49/165 (29%), Positives = 80/165 (48%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y +G T Q+ + LL Sbjct: 611 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTQEQMSADDDYTLLK 670 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ +QP+ GK +E ++ Sbjct: 671 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFQPKIAVFNGKLIFEVFSGKKEF 730 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 +G+Q + T ++ PS +R V Y L + Sbjct: 731 HFGRQPDRVDGTNTYIWVMPSSSARCAQLPRAADKVPFYAALKKF 775 >UniRef50_A6CPG7 G/U mismatch-specific DNA glycosylase n=2 Tax=Bacillus RepID=A6CPG7_9BACI Length = 170 Score = 225 bits (575), Expect = 3e-58, Method: Composition-based stats. Identities = 57/162 (35%), Positives = 93/162 (57%), Gaps = 1/162 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +VF G NP L S G +A+ +NRFWK+++ +G T RQ P+E LL Sbjct: 4 LPDHLKENLDIVFIGFNPSLKSGEVGHNYANKSNRFWKILHLSGLTPRQYAPEEDYTLLQ 63 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G+T +V+RPT +A +++K+E G L K++ Y P+ +GK Y+Q +R Sbjct: 64 LGYGLTNIVERPTKEAADITKEEYSEGRVILKNKLKKYSPKIACYVGKGVYQQFSKKRKI 123 Query: 122 QWGKQTLT-IGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQA 162 +WG Q + + +V P+ SGL R+ ++++V Y+EL + Sbjct: 124 EWGIQEESVVEGVIDFVAPSSSGLVRMKIDEIVSIYKELTEL 165 >UniRef50_B4NHL6 GK13687 n=2 Tax=Coelomata RepID=B4NHL6_DROWI Length = 1761 Score = 225 bits (575), Expect = 4e-58, Method: Composition-based stats. Identities = 49/165 (29%), Positives = 80/165 (48%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y AG T Q+ E L+ Sbjct: 765 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLAGLTQEQMSADEDHKLIK 824 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 825 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 884 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 +G+Q + T ++ PS +R V Y L + Sbjct: 885 HFGRQPDRVEGTDTYIWVMPSSSARCAQLPRAADKVPFYAALKKF 929 >UniRef50_B3MY42 GF22816 n=1 Tax=Drosophila ananassae RepID=B3MY42_DROAN Length = 1192 Score = 225 bits (575), Expect = 4e-58, Method: Composition-based stats. Identities = 50/165 (30%), Positives = 80/165 (48%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y AG T Q+ E LL Sbjct: 657 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLAGLTQEQMNADEDHKLLR 716 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 717 QGIGFTNMVSRATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 776 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 +G+Q + T ++ PS +R V Y L + Sbjct: 777 HFGRQPGRVDGTDTYIWVMPSSSARCAQLPRAADKVPFYAALKKF 821 >UniRef50_UPI000186DD28 G/t mismatch-specific thymine DNA glycosylase, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186DD28 Length = 1239 Score = 224 bits (571), Expect = 1e-57, Method: Composition-based stats. Identities = 49/167 (29%), Positives = 81/167 (48%), Gaps = 8/167 (4%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y +G T + + LL Sbjct: 531 LPDHLTTNLDIVIIGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTPEPMTADDDYKLLS 590 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G + L+EK++ ++P+ GK YE + Sbjct: 591 VGIGFTNMVARATKGSADLTRKEIKEGSQILLEKLQKFKPKIAVFNGKLIYEVFSGNKDF 650 Query: 122 QWGKQTLTIGS--TQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 +G+Q + T +WV+P+ S +R V Y L + Sbjct: 651 NFGRQPDFVDGTNTYMWVMPSSS--ARCAQLPRAADKVPFYAALKKF 695 >UniRef50_C4QAV8 G/t mismatch-specific thymine DNA glycosylase, putative n=1 Tax=Schistosoma mansoni RepID=C4QAV8_SCHMA Length = 869 Score = 221 bits (565), Expect = 5e-57, Method: Composition-based stats. Identities = 52/165 (31%), Positives = 79/165 (47%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L GL +V GINP L+SA G +A P N FW + QAG + + +LD Sbjct: 96 LPDHLKEGLDIVIVGINPSLASAHVGHHYAGPGNHFWTCLSQAGLVPMAVSCYDDSKMLD 155 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y G T + RPT A E++++E+ AG ++EK+ Y+P+ GK YE + Sbjct: 156 YGIGFTNVCTRPTKGAAELTRKEMKAGAAIMLEKMRKYKPKIAVFNGKGIYEAYVGHKNF 215 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 G+Q T+ T I + PS +R E + + L + Sbjct: 216 CMGRQPTTLDGTDIVIFVMPSSSARCAQLPRAEDKLPFFLALRKL 260 >UniRef50_UPI0000D578C1 PREDICTED: similar to Thd1 n=1 Tax=Tribolium castaneum RepID=UPI0000D578C1 Length = 702 Score = 221 bits (564), Expect = 7e-57, Method: Composition-based stats. Identities = 50/167 (29%), Positives = 84/167 (50%), Gaps = 8/167 (4%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L ++ GINPGL +A G +A P N FWK +Y +G T +Q+ E LL Sbjct: 170 LPDHLTENLDIIIIGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTQQQVSADEDYKLLQ 229 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G + L+EK+ ++P+ GK YE ++ Sbjct: 230 VGIGFTNMVQRATKGSADLTRKEIKEGSQILLEKLRKFRPKIAVFNGKLIYEVFSGKKDF 289 Query: 122 QWGKQTLTIGS--TQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 ++G+Q + T +WV+P+ S +R V Y L + Sbjct: 290 KFGRQPDLVDGTNTYMWVMPSSS--ARCAQLPRAADKVPFYAALKKF 334 >UniRef50_C7PGY3 Uracil-DNA glycosylase superfamily n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PGY3_CHIPD Length = 171 Score = 218 bits (555), Expect = 8e-56, Method: Composition-based stats. Identities = 69/164 (42%), Positives = 99/164 (60%), Gaps = 1/164 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 ++DI+ L V+FCGINPGL SA G F+ +NRFWKV++QAGFT ++ P +LD Sbjct: 5 LQDIITENLTVLFCGINPGLKSAVQGHHFSGRSNRFWKVLHQAGFTPHEIDPTLDSTILD 64 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G+T V+R TV+A+E+SK E KIE Y+P LA LGK AY ++ Sbjct: 65 FGYGLTTAVERATVRADELSKAEFDQSFETFKRKIEQYKPTYLAFLGKAAYMAFSEKKNV 124 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQALV 164 QWG Q + + +WVLPN SGL+R +L L++ ++ L + Sbjct: 125 QWGLQEEPLCNAIVWVLPNTSGLNRGFTLSMLIDHFQALHNHIG 168 >UniRef50_A7S194 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7S194_NEMVE Length = 250 Score = 217 bits (554), Expect = 9e-56, Method: Composition-based stats. Identities = 51/164 (31%), Positives = 88/164 (53%), Gaps = 5/164 (3%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D +A L ++F GINPGL+SA G +A P N FW +Y +G +L ++ + Sbjct: 55 LLPDRVAENLDILFIGINPGLTSAYKGHHYAGPNNHFWPCLYDSGLVPEKLTFRDDEKCP 114 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Y G+T +V+R T ++++S++E+ G LI K++ +P GK YE FS+ Sbjct: 115 AYGIGLTNIVERTTRGSSDLSRKEIKDGVDALIVKVKRLKPLVACFNGKGIYEI-FSKSK 173 Query: 121 AQWGKQTLTIGSTQ--IWVLPNPSGLSRV--SLEKLVEAYRELD 160 + G+QT I T ++V+P+ SG + ++ + EL Sbjct: 174 CEIGRQTKCIPGTNVVVYVMPSSSGRTMTYPRASDKLKFFTELK 217 >UniRef50_UPI000178957E Uracil-DNA glycosylase superfamily n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178957E Length = 171 Score = 216 bits (551), Expect = 2e-55, Method: Composition-based stats. Identities = 54/160 (33%), Positives = 90/160 (56%), Gaps = 1/160 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D L GL +VF G NP L S G +A+P N FW++++++G T R + E LL Sbjct: 4 VSDHLDYGLTLVFIGFNPSLRSGEVGHHYANPRNNFWRILHKSGLTPRLYEAAEDGELLK 63 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V RPT A +++++E G L +K+E+Y+P+ +GK Y + + A Sbjct: 64 LGYGFTNIVARPTRGAEDITREEYRLGRETLRKKLEEYRPEVACFVGKGVYTEYSGKAKA 123 Query: 122 QWGKQTLT-IGSTQIWVLPNPSGLSRVSLEKLVEAYRELD 160 WG Q + + + +V P+ SGL R+ +++++ YR+L Sbjct: 124 DWGIQPDSVVDGVREFVAPSSSGLVRMPMDEIIGIYRQLH 163 >UniRef50_UPI0001926FCB PREDICTED: similar to predicted protein n=2 Tax=Hydra magnipapillata RepID=UPI0001926FCB Length = 226 Score = 214 bits (545), Expect = 1e-54, Method: Composition-based stats. Identities = 49/164 (29%), Positives = 84/164 (51%), Gaps = 3/164 (1%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 MV D+L L ++F GINPGL SA G ++ N FW +++++G L Q L Sbjct: 53 MVPDLLEEHLDIIFVGINPGLVSAYRGHHYSGNNNHFWPLLFESGLISEPLTWQNDYKCL 112 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Y+ G+T +V R T A E++ E+ +G ++E I+ P + GK YE ++ Sbjct: 113 QYKIGMTNIVSRTTRSAAELTNSEIRSGKSSIVEMIKKLNPLVVCFNGKGIYEVFSGKKK 172 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSR--VSLEKLVEAYRELDQA 162 + G Q I ++ ++V+P+ SGL E + +++L + Sbjct: 173 CEVG-QQENIFNSAVYVMPSTSGLVTQYPRKEDKLVFFQQLKEL 215 >UniRef50_Q1D9T2 G/U mismatch-specific DNA glycosylase n=2 Tax=Bacteria RepID=Q1D9T2_MYXXD Length = 166 Score = 212 bits (541), Expect = 3e-54, Method: Composition-based stats. Identities = 71/163 (43%), Positives = 98/163 (60%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D++AP LRV+FCGINP L SA G+ FA P NRFW ++ AG T +P E + LL Sbjct: 1 MPDVIAPRLRVLFCGINPSLYSAVVGYHFARPGNRFWPTLHAAGITPHTFQPTEQEALLA 60 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G+T +VDR T A+ ++ EL G + L K+ YQP+ LA+LG AY F + A Sbjct: 61 LGYGITNVVDRATATADLLNPAELARGAKALEAKVARYQPRFLAVLGVSAYRIAFGRPKA 120 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 G Q +G T++WVLPNPSGL + L+ L + EL +A+ Sbjct: 121 ALGTQPERLGDTRLWVLPNPSGLNAHYQLDALGRLFAELRRAV 163 >UniRef50_Q39CW6 G/U mismatch-specific uracil-DNA glycosylase n=10 Tax=Bacteria RepID=Q39CW6_BURS3 Length = 210 Score = 211 bits (539), Expect = 6e-54, Method: Composition-based stats. Identities = 73/160 (45%), Positives = 104/160 (65%), Gaps = 1/160 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D++AP L V+FCGINPGL++A TG FA +NRFW+VI+ AGFT ++ PQ+ +L Sbjct: 33 LPDLIAPQLDVLFCGINPGLTAAATGHHFAGRSNRFWRVIHLAGFTPAEISPQDDHAILQ 92 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y CG+T +V RPT A+++S+ E A + K+ Y P+ +A LGK AY QR Sbjct: 93 YGCGLTAVVKRPTASADQLSRAEFVAAAAEFEHKVARYGPRFVAFLGKAAYSALSGQREI 152 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLS-RVSLEKLVEAYRELD 160 +WG Q + + +WVLPNPSG + SL+ LV+AYR+L Sbjct: 153 EWGLQPARMQGSSVWVLPNPSGRNLAFSLDDLVDAYRQLR 192 >UniRef50_D2B5F3 G:T/U mismatch-specific DNA glycosylase-like protein n=9 Tax=Actinomycetales RepID=D2B5F3_STRRD Length = 208 Score = 211 bits (537), Expect = 1e-53, Method: Composition-based stats. Identities = 71/163 (43%), Positives = 98/163 (60%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 ++D+L P L V+FCGINPGL SA TG FA P NRFW ++++GFT R L P E L Sbjct: 45 LKDVLGPSLDVLFCGINPGLYSAATGHHFARPGNRFWPALHRSGFTPRLLDPAEQDLLPS 104 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G+T +V+R + QA E++ +E GG +L + +P+ LA+ G AY F Q A Sbjct: 105 FGLGITNVVERASAQAAELTPEEFREGGARLARLVAGTRPRVLAVAGVTAYRTAFGQPKA 164 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 G Q IG T +WVLPNPSGL + +LE + R L ++L Sbjct: 165 AVGPQATVIGDTAVWVLPNPSGLNAHWTLEGIAGEMRRLRESL 207 >UniRef50_Q67QW8 G/U mismatch-specific DNA glycosylase n=1 Tax=Symbiobacterium thermophilum RepID=Q67QW8_SYMTH Length = 177 Score = 210 bits (534), Expect = 2e-53, Method: Composition-based stats. Identities = 66/167 (39%), Positives = 95/167 (56%), Gaps = 2/167 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D LAPGLRVVF G NPG +SA G +A+P NRF+ +++QAG T+R+ P E LL+ Sbjct: 5 LPDYLAPGLRVVFVGFNPGETSAREGHYYAYPGNRFYWLLWQAGLTERRYAPHEDARLLE 64 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G+T LV R T + ++S EL +G +L+ K+ Y+P+ GK Y +R Sbjct: 65 IGYGLTDLVGRSTRSSGDLSAAELRSGREELLAKLARYRPRVACYNGKGIYAALTGRRRV 124 Query: 122 QWGKQTLT-IGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRG 167 +G Q + I +V +PSG SR L + + YREL AL+ G Sbjct: 125 AYGLQPESVIPGVLDFVAASPSGRSREPLAEKLRLYRELR-ALIAEG 170 >UniRef50_Q46SQ2 G/U mismatch-specific uracil-DNA glycosylase n=4 Tax=Proteobacteria RepID=Q46SQ2_RALEJ Length = 197 Score = 209 bits (532), Expect = 3e-53, Method: Composition-based stats. Identities = 77/168 (45%), Positives = 102/168 (60%), Gaps = 1/168 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + DI+ PGL VFCG+NPGL +A +G FA NRFW+VI+ AG TDR+L+P + + LL Sbjct: 16 LPDIMKPGLSAVFCGLNPGLRAAASGHHFAGHGNRFWRVIHLAGITDRELEPVDDRQLLS 75 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y G+T V RPT A +S E +LI K+E + P +A LGK AY SQR Sbjct: 76 YGYGLTTAVSRPTASAAALSHNEYVDASVRLIIKLERFHPACVAFLGKAAYAAMTSQRVV 135 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQALVVRGR 168 WG Q +G ++WVLPNPSG +R SL LV+AY E ++ + R Sbjct: 136 PWGPQRRRMGGARVWVLPNPSGRNRAFSLAGLVDAYGEFRASVESQIR 183 >UniRef50_Q4T9X5 Chromosome undetermined SCAF7485, whole genome shotgun sequence. (Fragment) n=3 Tax=Metazoa RepID=Q4T9X5_TETNG Length = 333 Score = 208 bits (530), Expect = 6e-53, Method: Composition-based stats. Identities = 55/172 (31%), Positives = 87/172 (50%), Gaps = 11/172 (6%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D++AP L ++ GINPGL SA G + +P N FWK ++ +GFT++QL Q L + Sbjct: 43 LPDVIAPNLDILIIGINPGLLSAYKGHHYPNPGNHFWKCLFLSGFTEKQLNYSHDQSLPE 102 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG----- 115 Y G T +V+R T + ++S +E+ GGR+L+ K++ Y+P A GK YE Sbjct: 103 IYGIGFTNMVERTTPGSKDLSNKEIREGGRQLLAKLQKYRPLIAAFNGKGIYEIFCRETF 162 Query: 116 -FSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 + ++G Q + TQ PS R + V Y +L + Sbjct: 163 GVKAKNLEFGLQPYRVPETQTVCYLMPSSSPRCAQFPRAQDKVHFYIKLKEL 214 >UniRef50_UPI0000DB71A4 PREDICTED: similar to Thd1 CG1981-PA, partial n=2 Tax=Apocrita RepID=UPI0000DB71A4 Length = 263 Score = 208 bits (529), Expect = 8e-53, Method: Composition-based stats. Identities = 51/165 (30%), Positives = 84/165 (50%), Gaps = 3/165 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +VF GINP L +A G +A P N F+K+++++G T R L +E LL Sbjct: 25 LPDYLKMNLDLVFVGINPSLMAAHRGRYYAGPGNHFYKLLHESGLTPRCLIYEEDYKLLQ 84 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ-RG 120 Y G+T +V R T + ++ + E+ G + + EK++ Y+P+ GK YE ++ Sbjct: 85 YGIGLTNIVTRATKSSADLKRTEIKEGSKIIEEKLKLYKPKIAVFNGKCIYEVFANKMGN 144 Query: 121 AQWGKQTLTIGSTQIWVLPNPSG--LSRVSLEKLVEAYRELDQAL 163 +G Q I T IWV P+ S + + + + L + L Sbjct: 145 FNFGLQLERINETAIWVTPSSSARCANFPRMTDKLHFFTALKKYL 189 >UniRef50_A8LC62 Uracil-DNA glycosylase superfamily n=16 Tax=Bacteria RepID=A8LC62_FRASN Length = 297 Score = 206 bits (525), Expect = 2e-52, Method: Composition-based stats. Identities = 75/162 (46%), Positives = 100/162 (61%), Gaps = 1/162 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D++ P RV+ CGINP L S TGF F P+NR W V++ AGFT R+L P E +HL Sbjct: 109 VPDLVGPETRVLLCGINPSLESGATGFHFGTPSNRLWPVLHFAGFTGRRLHPSETEHLRA 168 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G+T LV R T +A+E++ E+ AG LIE +E +P+ +A LG AY GF +R A Sbjct: 169 RGIGITNLVHRSTARADEIADDEIRAGVPVLIELVERIRPEWVAFLGLAAYRIGFGRRTA 228 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 + G+Q IG +W+LPNPSGL + L LV Y EL +A Sbjct: 229 KVGRQPERIGPAGVWLLPNPSGLNAHYQLPDLVRVYGELREA 270 >UniRef50_UPI0001926888 PREDICTED: similar to GL18173 n=1 Tax=Hydra magnipapillata RepID=UPI0001926888 Length = 282 Score = 206 bits (524), Expect = 3e-52, Method: Composition-based stats. Identities = 59/167 (35%), Positives = 91/167 (54%), Gaps = 8/167 (4%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D + GL +VF GINPGL++A +G + P N FWK +Y +G D+ + Q LLD Sbjct: 49 LPDYMRNGLDIVFVGINPGLAAAFSGKYYTGPGNHFWKCLYLSGLIDKPFTSNDDQKLLD 108 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y G T +V R T +N++ K+EL AG + L EKI+ Y+P + GK +Y+ ++ Sbjct: 109 YNIGFTNMVSRTTKGSNDLCKEELIAGSKVLKEKIQLYRPLIVVFNGKISYQVFSGKKDF 168 Query: 122 QWGKQTLTIGS--TQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 +GKQ+ + T IWV+P+ S +R E V + L + Sbjct: 169 FFGKQSELVEGTDTHIWVMPSSS--ARCAQLPRAEDKVPFFTGLKRF 213 >UniRef50_Q0RRS7 DNA glycosylase, G/U mismatch-specific n=2 Tax=Frankia RepID=Q0RRS7_FRAAA Length = 196 Score = 205 bits (523), Expect = 4e-52, Method: Composition-based stats. Identities = 73/162 (45%), Positives = 95/162 (58%), Gaps = 1/162 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D+LAP V+ CGINP L S TGF F P+NR W V++ AGFT R+L P E L Sbjct: 30 VPDLLAPRTTVLLCGINPSLESGATGFHFGTPSNRLWPVLHLAGFTPRRLHPSETAELTS 89 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 +T LV+R T +A+E+ E+ AG +L +E P +A LG AY GF +R A Sbjct: 90 RGISITNLVNRSTARADEIGDDEVRAGVDRLTGLVERISPHWVAFLGLAAYRVGFGRRRA 149 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 Q G+Q T+G +W+LPNPSGL + L LV Y EL +A Sbjct: 150 QVGRQPETLGQAGVWLLPNPSGLNAHYQLPDLVRVYGELREA 191 >UniRef50_Q9K3I5 G/U mismatch-specific DNA glycosylase n=3 Tax=Streptomyces RepID=Q9K3I5_STRCO Length = 160 Score = 203 bits (517), Expect = 2e-51, Method: Composition-based stats. Identities = 67/152 (44%), Positives = 95/152 (62%), Gaps = 1/152 (0%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLVD 71 ++FCGINPGL +A TG FA P NRFW V++ +GFT R LKP E L Y G+T +V Sbjct: 1 MLFCGINPGLMTAATGHHFARPGNRFWPVLHLSGFTPRLLKPSEQDELPSYGLGITNVVA 60 Query: 72 RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTLTIG 131 R + +A+E++ +E GGR L K+ +P LA++G AY F + A+ G Q T G Sbjct: 61 RASARADELTAEEYREGGRLLARKVARLRPGWLAVVGVTAYRAAFDEPKARVGPQERTFG 120 Query: 132 STQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 ST++WVLPNPSGL + + + + + Y L +A Sbjct: 121 STRVWVLPNPSGLNAHWTAQTMAQEYARLREA 152 >UniRef50_Q13569 G/T mismatch-specific thymine DNA glycosylase n=47 Tax=Euteleostomi RepID=TDG_HUMAN Length = 410 Score = 203 bits (517), Expect = 2e-51, Method: Composition-based stats. Identities = 52/170 (30%), Positives = 81/170 (47%), Gaps = 11/170 (6%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL- 60 + DIL L +V GINPGL +A G + P N FWK ++ +G ++ QL + L Sbjct: 124 LPDILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFWKCLFMSGLSEVQLNHMDDHTLPG 183 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ-- 118 Y G T +V+R T + ++S +E GGR L++K++ YQP+ GK YE + Sbjct: 184 KYGIGFTNMVERTTPGSKDLSSKEFREGGRILVQKLQKYQPRIAVFNGKCIYEIFSKEVF 243 Query: 119 ----RGAQWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVEAYRELD 160 + ++G Q I T+ PS +R + V Y +L Sbjct: 244 GVKVKNLEFGLQPHKIPDTETLCYVMPSSSARCAQFPRAQDKVHYYIKLK 293 >UniRef50_C6WCQ8 Uracil-DNA glycosylase superfamily n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WCQ8_ACTMD Length = 200 Score = 203 bits (516), Expect = 3e-51, Method: Composition-based stats. Identities = 71/163 (43%), Positives = 97/163 (59%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D+LAPGL + CGINPG SA TG +A P NRFW+V++ AGFT R L P E LL Sbjct: 21 LPDLLAPGLSALLCGINPGGRSAVTGHHYAGPGNRFWRVLHLAGFTPRLLAPSEQALLLG 80 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G+T LV RPT +A E+ EL AG +L +E ++P A+ ++G AY F + A Sbjct: 81 FGIGLTNLVARPTARAAELGSAELVAGAGRLAGLVERHRPGAVGVVGVSAYRVAFRRPRA 140 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQAL 163 G Q + + +WVLPNPSGL+ L+ LV +R+ L Sbjct: 141 VVGPQDVELAGAPLWVLPNPSGLNAGYRLDDLVAIFRDFRSEL 183 >UniRef50_Q47NL8 G/U mismatch-specific uracil-DNA glycosylase n=11 Tax=Bacteria RepID=Q47NL8_THEFY Length = 212 Score = 202 bits (514), Expect = 4e-51, Method: Composition-based stats. Identities = 74/169 (43%), Positives = 102/169 (60%), Gaps = 3/169 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D++ PGL ++FCGINPGL SA +G FA P NRFW ++ A T R L+P E LLD Sbjct: 21 VPDVIGPGLTILFCGINPGLYSAWSGHHFARPGNRFWPALHAASLTPRLLRPAEQHLLLD 80 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 GVT LVDR T +A+E++ EL AGG+ L K+ Y+P+ LA+LG AY + F++ A Sbjct: 81 LGLGVTNLVDRATARADELTAAELQAGGQALAAKLRAYRPRLLAVLGVTAYRKAFARPRA 140 Query: 122 QWGKQT--LTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVVRG 167 + G Q I +WVLPNPSGL + + + + E R + A + Sbjct: 141 RIGPQPPEDAIEGVPVWVLPNPSGLNAHWTTQGIAEELRRMCAAHGIEA 189 >UniRef50_C6B6I6 Uracil-DNA glycosylase superfamily n=2 Tax=Rhizobium leguminosarum RepID=C6B6I6_RHILS Length = 163 Score = 202 bits (514), Expect = 4e-51, Method: Composition-based stats. Identities = 73/163 (44%), Positives = 98/163 (60%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + +ILAPGL VVFCG+NP LS+ G F++P+NRFW+V++ AGFT R L+ E + LL Sbjct: 1 MSEILAPGLSVVFCGLNPALSAVRDGHNFSNPSNRFWRVLHLAGFTPRLLRADEERELLQ 60 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y CG+T V RPT A+E+ KQ+ + L KI P LA LGK AY + Sbjct: 61 YGCGLTSAVSRPTKSASELKKQDYRSAAPVLENKIRKIAPANLAFLGKAAYAAISLRADV 120 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQAL 163 +WG+Q +W+LPNPSGL+R +L L E Y +L L Sbjct: 121 EWGRQPEVFAGADVWLLPNPSGLNRAFTLTTLTEHYLDLRLTL 163 >UniRef50_Q1J3D2 Uracil-DNA glycosylase superfamily n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J3D2_DEIGD Length = 177 Score = 201 bits (513), Expect = 5e-51, Method: Composition-based stats. Identities = 61/168 (36%), Positives = 92/168 (54%), Gaps = 1/168 (0%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 +++D++A G++V+F G NP S G +A N+FW+++ +G T R L+P+E L Sbjct: 10 VLQDLVAEGVQVLFVGFNPSPRSVQQGHHYAGRGNQFWRLLAASGLTPRLLRPEEDHLLP 69 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + G T LV RPTV A E+S+ EL AG L + YQ +A GK Y ++ Sbjct: 70 RWGLGSTNLVARPTVSAAELSRSELRAGVPHLRRLVARYQVPLVAYTGKGVYLAATGKQQ 129 Query: 121 AQWGKQTLT-IGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRG 167 A WG Q + IG +VLP+PSGL+R+ + + YR L + G Sbjct: 130 AHWGAQEDSLIGGVTDFVLPSPSGLARLPFAEKLGYYRALAELAQQPG 177 >UniRef50_B9J7K0 G:T/U mismatch-specific DNA glycosylase n=1 Tax=Agrobacterium radiobacter K84 RepID=B9J7K0_AGRRK Length = 198 Score = 197 bits (501), Expect = 1e-49, Method: Composition-based stats. Identities = 69/163 (42%), Positives = 100/163 (61%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + +ILAP L VVFCG+NP L++ G F++ +NRFW+V++ AGFT R L+ +E + +L Sbjct: 28 LAEILAPRLSVVFCGLNPALTAHRDGHNFSNRSNRFWRVLHLAGFTPRLLRAEEEREMLR 87 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y G+T V RPT A+E++K++ L +KI P+ LA LGK AY Q Sbjct: 88 YGLGLTSAVSRPTKSASELTKRDYITAAPALEDKITKLAPRNLAFLGKAAYAAISLQAHV 147 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRVS-LEKLVEAYRELDQAL 163 +WG+Q +WVLPNPSGL+R L +L E YR++ A+ Sbjct: 148 EWGRQPDKFADASVWVLPNPSGLNRAFDLTRLTEHYRKVKLAV 190 >UniRef50_C7IMH6 Uracil-DNA glycosylase superfamily n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IMH6_9CLOT Length = 184 Score = 194 bits (493), Expect = 1e-48, Method: Composition-based stats. Identities = 51/171 (29%), Positives = 88/171 (51%), Gaps = 9/171 (5%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + DILAPGL+VVF G NPGL SA ++H +NRFW+ +++AG T + +P++ + +L+ Sbjct: 10 LPDILAPGLKVVFIGYNPGLLSAKNRHHYSHKSNRFWRFLFEAGLTPVRFQPEDDRKILE 69 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG------ 115 G T +VDR T AN++ E+ G L ++ +P+ +G Y Sbjct: 70 LGYGSTNIVDRQTKAANDIGADEVIEGSANLYRLLDTLKPRVACYVGIGVYRAFASCILK 129 Query: 116 --FSQRGAQWGKQTLTIGSTQI-WVLPNPSGLSRVSLEKLVEAYRELDQAL 163 S+ G Q ++ + +V + SGL+ V + + + +L + L Sbjct: 130 MSASRINVTPGLQLDSLQEATLDFVCYSTSGLNTVPFAEQCKCFADLKKLL 180 >UniRef50_C0ZBM4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZBM4_BREBN Length = 192 Score = 191 bits (487), Expect = 6e-48, Method: Composition-based stats. Identities = 57/171 (33%), Positives = 91/171 (53%), Gaps = 9/171 (5%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + + L V+FCGINPG SA TGF +A+PAN FW+ +Y+ G T +L P+E LLD Sbjct: 9 LPTYIRRDLAVLFCGINPGRISATTGFHYANPANLFWRGLYEGGLTPHKLLPEETATLLD 68 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG- 120 Y G+T LV RPT A ++ ++ AG ++ I+ Y+P+ + G A+ Q+ Sbjct: 69 YNYGITDLVSRPTRSATDLRNEDYAAGTELFVQMIQTYRPRVICFNGITAFRHATGQKKE 128 Query: 121 -AQWGKQTLTIGSTQ------IWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 G Q + T+ ++V+P+ SG + + E+ + + EL L Sbjct: 129 AISLGLQPIRYFGTEEWSGSYVFVVPSTSGANASFTREERIAKFTELSHFL 179 >UniRef50_A6SED0 Putative uncharacterized protein n=2 Tax=Sclerotiniaceae RepID=A6SED0_BOTFB Length = 425 Score = 191 bits (486), Expect = 7e-48, Method: Composition-based stats. Identities = 59/180 (32%), Positives = 98/180 (54%), Gaps = 21/180 (11%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D+++ L +F G+NPGL +A G +AHP+NRFWK++Y +G T R LK ++ L + Sbjct: 222 LPDVISSDLLCLFVGLNPGLLTAQVGHAYAHPSNRFWKILYSSGCTTRLLKAEDDVKLPE 281 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Y G T +V+RPT E+SK+E+ G L EKI Y+P+A+A++GK +E + + Sbjct: 282 LYSLGNTNIVERPTRNGGELSKKEMDEGVSSLEEKIRIYRPEAVALVGKGIWESVWRVKK 341 Query: 121 --------AQWGKQTL-----------TIGSTQIWVLPNPSGLSRV-SLEKLVEAYRELD 160 ++G Q+ + G +++V + SGL+ S + +REL Sbjct: 342 GKAMAKDEFRYGWQSEEENLGVVEGEDSWGGARLFVASSTSGLAASLSPAEKESIWRELG 401 >UniRef50_B8I864 Uracil-DNA glycosylase superfamily n=1 Tax=Clostridium cellulolyticum H10 RepID=B8I864_CLOCE Length = 183 Score = 191 bits (486), Expect = 8e-48, Method: Composition-based stats. Identities = 52/171 (30%), Positives = 91/171 (53%), Gaps = 9/171 (5%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + DILA GL+VVF G NPGL SA ++H +NRFW+ +Y++G T + +P++ + +L+ Sbjct: 10 LPDILATGLKVVFIGYNPGLLSARNRHHYSHKSNRFWRFLYESGLTPSRFEPEDDRKILE 69 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG------ 115 + G T +VDRP+ ANE+ E+ G + +++++P+ +G Y Sbjct: 70 FGYGSTNIVDRPSKAANEIGTDEVVEGSANIYRLLDNFKPRIACYVGIGVYRAYASSILR 129 Query: 116 --FSQRGAQWGKQTLTI-GSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 S + G Q+ +I S +V + SGL+ V + + + L + L Sbjct: 130 VPASGINVKPGLQSASILESISDFVCYSTSGLNTVPFAEQRKCFENLKKLL 180 >UniRef50_B2A9Z4 Predicted CDS Pa_1_2290 n=2 Tax=Sordariales RepID=B2A9Z4_PODAN Length = 327 Score = 186 bits (474), Expect = 2e-46, Method: Composition-based stats. Identities = 57/182 (31%), Positives = 95/182 (52%), Gaps = 21/182 (11%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-Y 62 D +AP L ++F G+NPGL ++ TG +AHP N FW++++ +G T R PQE + L + Sbjct: 108 DAIAPNLILLFVGLNPGLLTSSTGHAYAHPTNLFWRLLHSSGITPRLCAPQEDRLLPSLF 167 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG-- 120 G+T +V RPT E+SK+E+ AG L EKI ++P+ + I+GK +E + R Sbjct: 168 SLGLTNIVPRPTRNGAELSKKEMDAGVAVLEEKIRKFKPEVVCIVGKSIWESVWRVRHGR 227 Query: 121 ------AQWGKQTL-----------TIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 ++G Q +++V + SGL + + + E + EL + Sbjct: 228 GIKKEEFRYGWQERGEDMGKVEGGGEWNGARVFVACSTSGLAATLKPREKEEIFGELGEW 287 Query: 163 LV 164 +V Sbjct: 288 VV 289 >UniRef50_B8N438 Mismatch-specific thymine-DNA glycosylase, putative n=6 Tax=Eurotiomycetidae RepID=B8N438_ASPFN Length = 373 Score = 185 bits (469), Expect = 6e-46, Method: Composition-based stats. Identities = 52/184 (28%), Positives = 95/184 (51%), Gaps = 18/184 (9%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D + P L ++ G+NPG+ + TG+ +AHP+N +WK+++ +G T + P + L Sbjct: 180 LLRDTIPPNLTLLLVGVNPGIMTGATGYVYAHPSNLYWKLLHWSGITAIRHPPSDTYRLP 239 Query: 61 D-YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 + Y G T +V+RPT A+ +SK E+ AG L EK+ QP+A+ ++GK +E + R Sbjct: 240 ELYNIGNTNIVERPTRDASMLSKAEMDAGVPVLEEKVAKQQPEAVCLVGKSIWEAVWRVR 299 Query: 120 G--------AQWGKQ--------TLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQA 162 ++G Q + ++V SGL+ +S+ + + EL + Sbjct: 300 KGRAIRKEEFRYGWQDESENMGRSERWNGAPVFVATTTSGLAAGMSMAEKQAVWNELGKW 359 Query: 163 LVVR 166 + R Sbjct: 360 VNSR 363 >UniRef50_A2QSA7 Function: In the DNA of higher eukaryotes n=48 Tax=Leotiomyceta RepID=A2QSA7_ASPNC Length = 345 Score = 185 bits (469), Expect = 7e-46, Method: Composition-based stats. Identities = 53/187 (28%), Positives = 100/187 (53%), Gaps = 21/187 (11%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D + P L ++ G+NPG+ + TG+ +AHP+N FWK+++ +G T + P + L Sbjct: 138 LLVDTIPPNLTLLLVGVNPGVLTGVTGYAYAHPSNLFWKLLHWSGITPIRHPPSDTYKLP 197 Query: 61 D-YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF--- 116 + Y G T +V+RPT A+ +SK+E+ AG L EK+ + +P+ + ++GK +E + Sbjct: 198 ELYNIGNTNIVERPTRDASMLSKKEMDAGVPVLEEKVREKRPEVVCLVGKSIWEAVWRVK 257 Query: 117 -----SQRGAQWGKQTLTI-----------GSTQIWVLPNPSGLSR-VSLEKLVEAYREL 159 + ++G Q L + +++V SGL+ +SL++ E + EL Sbjct: 258 VGRGIRKEEFRYGWQDLEMNMGRVGGEGGWEGARVFVATTTSGLAAGMSLQEKREVWDEL 317 Query: 160 DQALVVR 166 + + R Sbjct: 318 GRWVCER 324 >UniRef50_C8XAI4 Uracil-DNA glycosylase superfamily n=3 Tax=Actinomycetales RepID=C8XAI4_NAKMY Length = 193 Score = 185 bits (469), Expect = 8e-46, Method: Composition-based stats. Identities = 62/171 (36%), Positives = 94/171 (54%), Gaps = 7/171 (4%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQL------KPQE 55 V+D++ PG+R++F GINPGL +A FA P NRFW + AG T ++ P + Sbjct: 19 VDDLIEPGVRLLFVGINPGLWTAAANAHFARPGNRFWPALQLAGITRERMDVSGGMAPAD 78 Query: 56 AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 L+ G+T LV T +A+E++ +++ L ++ +P +A+LG AY Q Sbjct: 79 RAELVGAGIGITNLVPFATARADELTGEQIRQAAGTLTATVQRVRPAVVAVLGLTAYRQA 138 Query: 116 FSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 F + A+ G+Q +G Q+WVLPNPSGL + + L AYRE A V Sbjct: 139 FGRPKARPGRQPEDLGGAQLWVLPNPSGLNAHETAASLGVAYREPALAAGV 189 >UniRef50_D1CA47 Uracil-DNA glycosylase superfamily n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CA47_SPHTD Length = 194 Score = 183 bits (466), Expect = 2e-45, Method: Composition-based stats. Identities = 55/182 (30%), Positives = 86/182 (47%), Gaps = 15/182 (8%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTD--------RQLK 52 ++ D LAP + +VF GINP SA G FA NRFW + Q+ ++ +L+ Sbjct: 9 VLPDYLAPKMPLVFIGINPSTYSAAQGHYFARRTNRFWPALSQSRLSEPIRAALGRDRLE 68 Query: 53 PQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAY 112 P + LL + G T LV RP+ +A E+S+ + KL +++E YQP+ G Y Sbjct: 69 PTDDARLLAFGIGFTDLVKRPSARAAELSRADYRIWAPKLRQRLEHYQPRVACFHGFTGY 128 Query: 113 EQGFS------QRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 + G Q ++G T+++V+PNPS + +L + Y L L Sbjct: 129 RAFVRDALGLTEVPRNLGPQPHSLGPTRLYVVPNPSPANAHFTLADQIAWYDRLADFLAE 188 Query: 166 RG 167 G Sbjct: 189 IG 190 >UniRef50_C9SR06 G/U mismatch-specific uracil DNA glycosylase n=4 Tax=Sordariomycetes RepID=C9SR06_VERA1 Length = 320 Score = 183 bits (466), Expect = 2e-45, Method: Composition-based stats. Identities = 58/182 (31%), Positives = 90/182 (49%), Gaps = 23/182 (12%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D +AP L ++F G+NPG+ +A TG +AHP+N FWK++Y +G T R L E + L + Sbjct: 108 LPDAIAPNLLILFIGLNPGIQTASTGHAYAHPSNLFWKLLYHSGITSRLLPASEDRTLPE 167 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR- 119 Y CG T +V RP+ E+ EL AG L KI ++P+A I+GK +E F R Sbjct: 168 RYACGFTNIVSRPSRNGAELRNAELDAGVAVLEAKIAKWRPEAACIVGKGIWEAVFRVRK 227 Query: 120 -------GAQWGKQT-LTIG------------STQIWVLPNPSGLSRVSLE-KLVEAYRE 158 ++G Q +G +++V + SGL+ + + E Sbjct: 228 GRGMKKGKFEYGWQEGENMGVVRDGEEGGPWEGARVFVATSTSGLAATLGPREKEVIWGE 287 Query: 159 LD 160 L Sbjct: 288 LG 289 >UniRef50_A9VA78 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9VA78_MONBE Length = 239 Score = 182 bits (463), Expect = 4e-45, Method: Composition-based stats. Identities = 49/165 (29%), Positives = 77/165 (46%), Gaps = 21/165 (12%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D+L P LRV+ GINPG+ SA G +A W+ G Sbjct: 50 LPDLLLPNLRVLIIGINPGVWSAARGHHYAAGVELGWQTDMSCG---------------Q 94 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQA----YEQGFS 117 Y G T V R T ++ K E+ G +L + I++YQP A GK +++ FS Sbjct: 95 YGIGFTNSVARTTRGQMDLDKSEIEDGVARLRQLIQEYQPAIAAFNGKGVSEIFFKRQFS 154 Query: 118 QRGAQWGKQTLTIGSTQIWVLPNPS--GLSRVSLEKLVEAYRELD 160 ++ G Q TIG T+++++P+ S G +R + + +REL Sbjct: 155 RKVTTLGLQQETIGETRLYIMPSTSPRGATRPRWQDKLPFFRELA 199 >UniRef50_B2W4Q2 G/U mismatch-specific DNA glycosylase n=2 Tax=Pleosporineae RepID=B2W4Q2_PYRTR Length = 328 Score = 181 bits (461), Expect = 6e-45, Method: Composition-based stats. Identities = 57/195 (29%), Positives = 97/195 (49%), Gaps = 30/195 (15%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL- 60 + DIL P L VF G NPG+ +A G +AHP+N FWK+++ +G T+R+LKP+E + L Sbjct: 111 LVDILEPNLICVFVGTNPGVQTAIAGHAYAHPSNHFWKLLHSSGLTNRRLKPEEDRSLPA 170 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF---- 116 + G T +VDRP+ A E+SK+E AG L K ++P+A+ I+GK +E + Sbjct: 171 QFCMGNTNIVDRPSKDAAELSKEETAAGTASLEAKFRKFKPEAVCIVGKGIWEAIWCYRH 230 Query: 117 ----SQRGAQWGKQTLT--------------------IGSTQIWVLPNPSGLSRV-SLEK 151 S++ ++G Q ++++V + SGL+ + Sbjct: 231 GKNPSKKDFKYGWQDEKHNMGKTPDDAGEELDGNGNVWKGSRVFVTTSTSGLAASLKPAE 290 Query: 152 LVEAYRELDQALVVR 166 ++ + + R Sbjct: 291 KEAIWKPFGEWVQKR 305 >UniRef50_Q6C2V5 YALI0F04796p n=1 Tax=Yarrowia lipolytica RepID=Q6C2V5_YARLI Length = 302 Score = 181 bits (459), Expect = 1e-44, Method: Composition-based stats. Identities = 51/168 (30%), Positives = 91/168 (54%), Gaps = 14/168 (8%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-YRCGVTKL 69 R+VF G NPG+ S+ G +AHP+N FWK + + L + L+D + G T L Sbjct: 97 RLVFVGFNPGIMSSEKGHYYAHPSNSFWKNLNKYVVIGGGLTCYDDSSLMDSHGIGFTDL 156 Query: 70 VDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAY----EQGFSQRGAQ--- 122 V RPT ++S++E+ G + L+++I +P + I+G+ + F +G + Sbjct: 157 VARPTRGCEDLSREEMALGAQILVDQISANKPALVCIVGRGIWATVFRAVFKHQGIKKSE 216 Query: 123 ---WGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRG 167 +G + +G+++++V+P+ SGL R +K +E +REL Q +G Sbjct: 217 MPHFGLMPVHMGTSRVFVVPSTSGLVR---DKRLELWRELGQLYRGKG 261 >UniRef50_C8NV44 G/U mismatch-specific DNA glycosylase n=2 Tax=Corynebacterium RepID=C8NV44_9CORY Length = 170 Score = 180 bits (458), Expect = 1e-44, Method: Composition-based stats. Identities = 61/168 (36%), Positives = 93/168 (55%), Gaps = 7/168 (4%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ------LKPQE 55 + D + R++ GINPG S PFAHP NRFW + +AG T + L Q+ Sbjct: 1 MHDRIPNPCRLLIVGINPGTLSEEVDAPFAHPGNRFWPALDKAGITPYRVYASEGLSEQD 60 Query: 56 AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 A+ L + G T LV R T +A++++ +EL GG +L ++ ++P A+ G AY Sbjct: 61 AKMLAERGIGFTNLVARMTPKASDLATEELVEGGERLEAVVDKHRPGAVMFAGIGAYRGA 120 Query: 116 FSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 F +R AQ G Q+ I +WV+ NPSGL + ++E L +YRE+ +A Sbjct: 121 FGRRKAQRGLQSGMIAGVPVWVVGNPSGLNAHETVESLAASYREVWEA 168 >UniRef50_B7GT25 Uracil-DNA glycosylase superfamily n=6 Tax=Actinobacteridae RepID=B7GT25_BIFLI Length = 212 Score = 180 bits (457), Expect = 2e-44, Method: Composition-based stats. Identities = 62/171 (36%), Positives = 95/171 (55%), Gaps = 13/171 (7%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ------LKPQE 55 V+DI P +++ GINPGL +A PFA+P NRFW ++ +G T Q L+ + Sbjct: 30 VDDIRCPHPKLLIVGINPGLWTAAVNAPFAYPGNRFWPSLFASGLTPWQVDASYGLRLHD 89 Query: 56 AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 + ++D G+T LV+R T +A+E++ EL G +L E P+A+AILG AY Q Sbjct: 90 ERMMIDRGIGLTNLVNRATAKASELNTAELRHGADRLRELCARVHPRAIAILGITAYRQA 149 Query: 116 FSQRGAQWGKQT------LTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 F+ + A G Q G Q+WV+P PSGL + ++ LV+ +R + Sbjct: 150 FADKSAALGLQDLTAEETDRFGGAQLWVIPQPSGLNAHATMPVLVDWWRRV 200 >UniRef50_B2GKB6 G/U mismatch-specific DNA glycosylase n=6 Tax=Bacteria RepID=B2GKB6_KOCRD Length = 190 Score = 179 bits (454), Expect = 4e-44, Method: Composition-based stats. Identities = 59/159 (37%), Positives = 91/159 (57%), Gaps = 7/159 (4%) Query: 14 FCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK------PQEAQHLLDYRCGVT 67 F GINPGL +A TG FA P NRF+ +++AG TD ++ P + L G++ Sbjct: 28 FVGINPGLWTAATGAHFARPGNRFYPALHRAGITDTRIDAAAGYDPADLAQLTGRGIGIS 87 Query: 68 KLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQT 127 + R T +A+E++++EL G R+L + ++ P +A+LG AY + F + GA G+Q Sbjct: 88 NVCPRATARADELTREELREGARRLDRLVREHHPAVVAVLGITAYREAFERPGAATGRQD 147 Query: 128 LTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 T ++V PNPSGL + SL+ L AY E+ +A V Sbjct: 148 APWPGTALFVAPNPSGLNAHSSLDDLAHAYAEIARAAGV 186 >UniRef50_A6GLD3 G/U mismatch-specific DNA glycosylase n=1 Tax=Limnobacter sp. MED105 RepID=A6GLD3_9BURK Length = 161 Score = 176 bits (448), Expect = 2e-43, Method: Composition-based stats. Identities = 56/160 (35%), Positives = 88/160 (55%), Gaps = 5/160 (3%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D+L L +VFCG PG SA G +A+ N FW+ +++ G T Q+ P++ Q + Sbjct: 3 ILPDLLQANLNIVFCGTAPGTVSAQRGAYYANAGNAFWRTLFEVGLTPHQVSPKDFQGIT 62 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Y+ G+T L + V + S ++ G +L EKI + P+ LA K+A Q F Sbjct: 63 QYKLGLTDLAKQ--VYGADSSLKKSDFGPSQLREKILHFNPKILAFTSKRA-GQEFLGCK 119 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELD 160 Q G Q IG+T ++VLP+PSGL+R +E ++EL Sbjct: 120 VQCGLQHEKIGNTVLFVLPSPSGLARSHWR--IEPWQELA 157 >UniRef50_A7HGS2 Uracil-DNA glycosylase superfamily n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7HGS2_ANADF Length = 181 Score = 176 bits (446), Expect = 3e-43, Method: Composition-based stats. Identities = 51/166 (30%), Positives = 85/166 (51%), Gaps = 3/166 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D +AP L ++F GINP L SA G FA N FW++++ +G T P++ L Sbjct: 11 LCDRVAPDLDILFVGINPSLRSAEVGHHFASKGNPFWRLLHASGLTAELHAPEDDVALPA 70 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF-SQRG 120 G+T + R T A E+ + EL GGR L +++ +P+ +A++G Y F R Sbjct: 71 RGLGITNVCSRATRSAAELGRDELREGGRALPDRVAALRPRIVALVGVSLYPHVFPGGRE 130 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVS--LEKLVEAYRELDQALV 164 G + + +++VLPNPSGL++ + + +R L + Sbjct: 131 TGPGAKRARLAGARVFVLPNPSGLNQAYPGFQVKLVWFRRLARLAG 176 >UniRef50_B2IIR8 Uracil-DNA glycosylase superfamily n=2 Tax=Bacteria RepID=B2IIR8_BEII9 Length = 191 Score = 176 bits (446), Expect = 3e-43, Method: Composition-based stats. Identities = 58/177 (32%), Positives = 79/177 (44%), Gaps = 14/177 (7%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTD--------RQLKP 53 + D+L GL V+F GINP L S G FA NRFW ++ + L+P Sbjct: 9 LPDLLDKGLSVLFVGINPSLYSVAQGHYFARKTNRFWPAFSRSVLSLAARENLGLEALEP 68 Query: 54 QEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYE 113 L G T V RPT +A++VS E A L+EKIE Y+P+ G AY Sbjct: 69 MHDHLLPAQGFGFTDTVKRPTARASDVSPAEFAAAVTALVEKIEYYRPRIACFHGTMAYR 128 Query: 114 QGFS-----QRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALV 164 + G QTL+IG T ++V+PNPS + + Y L L+ Sbjct: 129 PVHRALTETKTDPALGMQTLSIGPTHLYVVPNPSPANAHFTPADQTAWYDRLAGTLL 185 >UniRef50_C5SKW3 Uracil-DNA glycosylase superfamily n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SKW3_9CAUL Length = 203 Score = 173 bits (438), Expect = 3e-42, Method: Composition-based stats. Identities = 59/165 (35%), Positives = 78/165 (47%), Gaps = 6/165 (3%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D+L PGL++VFCG G SA +AHP N FW+ +Y G T +L P + LLD Sbjct: 35 VPDLLRPGLKLVFCGTALGRVSAQKRAYYAHPGNFFWRTLYGVGLTPERLAPTDYPRLLD 94 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR-- 119 Y G+T L N+V L KI Y PQ LA K A + Sbjct: 95 YGIGLTDLCK--AHYGNDVDLPAEAWDAEALTAKIAAYAPQHLAFTSKTAASVFLERPTG 152 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALV 164 G Q G T + VLP+PSG +R ++ A+R+L QA+ Sbjct: 153 KIALGLQAGKAGETSLRVLPSPSGQARRFWDE--AAWRDLAQAVT 195 >UniRef50_Q7SDB2 Predicted protein n=2 Tax=Sordariaceae RepID=Q7SDB2_NEUCR Length = 396 Score = 169 bits (428), Expect = 4e-41, Method: Composition-based stats. Identities = 51/222 (22%), Positives = 92/222 (41%), Gaps = 62/222 (27%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L+P L V+F G+NPG+S+A +AHP N FW++++ +G T P E + L + Sbjct: 143 LPDALSPNLLVLFIGLNPGISTAIQQHAYAHPTNLFWRLLFSSGITPVLCAPSEDRSLPE 202 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ-- 118 + G+T +V RPT E++K E+ G L EK+ ++P+A+ ++GK +E + Sbjct: 203 RFSLGLTNIVGRPTRNGGELTKGEMDEGVAVLEEKVRVWKPEAVCVVGKGIWESVYRVRE 262 Query: 119 ---------------------------------RGAQWGKQTLT---------------- 129 + ++G Q + Sbjct: 263 RQRQQRHGQGQIGIEKDKVGLNKGRGKAVGLLPKDWKYGWQDESENMGVPGGDSGDEEEE 322 Query: 130 ---------IGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQ 161 +++V + SGL + V + +REL + Sbjct: 323 MDTTTAGGGWKGARVFVATSTSGLAATVGRAEKERIWRELGE 364 >UniRef50_C6H6H6 Uracil DNA glycosylase superfamily protein n=1 Tax=Ajellomyces capsulatus H143 RepID=C6H6H6_AJECH Length = 301 Score = 168 bits (426), Expect = 7e-41, Method: Composition-based stats. Identities = 46/164 (28%), Positives = 88/164 (53%), Gaps = 18/164 (10%) Query: 16 GINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-YRCGVTKLVDRPT 74 G+NPG+ + TG +AHP+N +WK+++ +G TD + P + L + Y G T +V+RPT Sbjct: 115 GVNPGIMTGQTGHVYAHPSNLYWKLLHSSGITDYRHPPSDTYQLPELYSVGNTNIVERPT 174 Query: 75 VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG--------AQWGKQ 126 A+++S++E+ G L KI + +P+A+ ++GK +E + R ++G Q Sbjct: 175 RDASQLSRKEMDDGVPVLERKIREKRPEAVCLVGKSIWEAVWRVRRGRAIKKEEFRYGWQ 234 Query: 127 TL--TIG------STQIWVLPNPSGLSR-VSLEKLVEAYRELDQ 161 +G +++V SGL+ +++ + + EL + Sbjct: 235 DEGDRMGRTKEWKGARVFVATTTSGLAAGMTMAEKQAVWSELGE 278 >UniRef50_A7HV32 Uracil-DNA glycosylase superfamily n=3 Tax=Rhizobiales RepID=A7HV32_PARL1 Length = 176 Score = 168 bits (426), Expect = 7e-41, Method: Composition-based stats. Identities = 50/165 (30%), Positives = 77/165 (46%), Gaps = 6/165 (3%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D+L PGL VVF G P SA G +AHP NRFW +++ G T R +P E L+ Sbjct: 12 LPDVLVPGLDVVFVGTAPSRRSAALGAYYAHPGNRFWPALFETGITPRLYQPHEYAELIH 71 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQ--GFSQR 119 G T + R + +E+ + ++I +P+A+A K+A G Sbjct: 72 LGIGFTDMAKRASGMDHEIDPALFE--VERFEKEILRCRPRAVAFTSKKAASVWMGVPTG 129 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALV 164 G Q G +++VL +PSG +R S +E + L + L Sbjct: 130 RLLHGHQPRRPGFPEVFVLSSPSGAARRSWS--IEPWAGLARFLG 172 >UniRef50_A8P541 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8P541_COPC7 Length = 365 Score = 168 bits (425), Expect = 8e-41, Method: Composition-based stats. Identities = 50/129 (38%), Positives = 69/129 (53%), Gaps = 4/129 (3%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D L P L ++FCGINPG SA G FA P N FW ++++G T ++KPQE L D Sbjct: 151 VNDNLKPDLDIIFCGINPGQVSATLGHHFAGPTNHFWSCLHESGLTSTRMKPQEDFTLPD 210 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS--- 117 + G+T LV RPT +A+E+SK E A K+ Y+P+ + +G E S Sbjct: 211 RHSIGLTNLVGRPTAEASELSKAESLACVAPFFVKMHKYRPKIVCFVGLSIAETVSSYVA 270 Query: 118 QRGAQWGKQ 126 + G Q Sbjct: 271 KNFLDPGLQ 279 >UniRef50_O59825 G/U mismatch-specific uracil DNA glycosylase n=1 Tax=Schizosaccharomyces pombe RepID=TDG_SCHPO Length = 325 Score = 167 bits (423), Expect = 1e-40, Method: Composition-based stats. Identities = 41/154 (26%), Positives = 75/154 (48%), Gaps = 10/154 (6%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTD--RQLKPQEAQHL 59 V D + + G+NPG++S+ G FA P+NRFWK++ ++ + + + L Sbjct: 142 VPDYICENPYAIIVGLNPGITSSLKGHAFASPSNRFWKMLNKSKLLEGNAEFTYLNDKDL 201 Query: 60 LDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAIL-GKQAYEQGFSQ 118 + G+T L RP+ ++ K+E+ G R L EK++ Y+PQ + GK +E+ + Sbjct: 202 PAHGLGITNLCARPSSSGADLRKEEMQDGARILYEKVKRYRPQVGLFISGKGIWEEMYKM 261 Query: 119 -------RGAQWGKQTLTIGSTQIWVLPNPSGLS 145 + +G Q G ++V + SG + Sbjct: 262 LTGKKLPKTFVFGWQPEKFGDANVFVGISSSGRA 295 >UniRef50_C8RZ13 Uracil-DNA glycosylase superfamily n=1 Tax=Rhodobacter sp. SW2 RepID=C8RZ13_9RHOB Length = 165 Score = 164 bits (417), Expect = 7e-40, Method: Composition-based stats. Identities = 51/153 (33%), Positives = 72/153 (47%), Gaps = 4/153 (2%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 M+ DI+APGL V+FCG PG SA G +A P NRFW ++ + G T R+L P E L Sbjct: 1 MLPDIVAPGLDVLFCGTAPGAMSAARGHYYARPGNRFWPLLAEIGLTPRRLHPDEDHLLP 60 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + G+T L + Q E+ + +L + +P+ALA A Sbjct: 61 GFGLGLTDLAQGVSGQDAEIPRHAY--APDRLQALVAAIRPRALAFTSLTAARLALGVPR 118 Query: 121 AQWG--KQTLTIGSTQIWVLPNPSGLSRVSLEK 151 G G T++W LP+PSGL+R Sbjct: 119 LAAGALPADPRYGHTRLWALPSPSGLARSHFSA 151 >UniRef50_A8Q5A6 Putative uncharacterized protein n=1 Tax=Malassezia globosa CBS 7966 RepID=A8Q5A6_MALGO Length = 272 Score = 164 bits (415), Expect = 1e-39, Method: Composition-based stats. Identities = 44/124 (35%), Positives = 66/124 (53%), Gaps = 3/124 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEA---QH 58 + D + P L VVFCGINPG S +AH +N F+ I+ AG T +L P + Sbjct: 28 LPDYIKPNLDVVFCGINPGTESGAQRRHYAHRSNHFYACIHMAGLTSERLTPSQDMTFPT 87 Query: 59 LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ 118 L YR G+T L RPT ++ ++++ EL G +L+ KI Y+P+ + +GKQ + Sbjct: 88 LQPYRLGLTNLAHRPTARSEQLTRIELERGVPELLMKIARYRPRVICFVGKQIADVFMRV 147 Query: 119 RGAQ 122 A Sbjct: 148 VRAS 151 >UniRef50_C4Y7L8 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y7L8_CLAL4 Length = 427 Score = 160 bits (405), Expect = 2e-38, Method: Composition-based stats. Identities = 48/202 (23%), Positives = 89/202 (44%), Gaps = 41/202 (20%) Query: 3 EDI---LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA----------GFTDR 49 D+ ++ LR++F G NPG+ SA +AHP+N FWK+ Q+ G +D Sbjct: 221 PDLAPMISDSLRLLFIGFNPGVQSAQMQHHYAHPSNLFWKLFNQSALLPQVAQAQGISDH 280 Query: 50 QL------------KPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIE 97 L ++ L+D G T LV R T A+ +S E A +++ +I Sbjct: 281 PLVLELFQDGKGLASFRDDGRLVDVGIGFTDLVLRCTRAAHHLSPSEKTANVPRVMREIR 340 Query: 98 DYQPQALAILGKQAYEQGFSQ------RGAQWGKQTLTIGST---------QIWVLPNPS 142 + ++GK +E R +WG QT + Q++++P+ S Sbjct: 341 SSNASHVVVVGKGVWETFVRAFTGDKIRHFEWGLQTDATANAFYRQCGRRLQVYLVPSTS 400 Query: 143 GL-SRVSLEKLVEAYRELDQAL 163 GL + ++ + + + E+ +++ Sbjct: 401 GLVASMTFGEKLALWVEIARSV 422 >UniRef50_B0E0R1 Predicted protein n=7 Tax=Laccaria bicolor S238N-H82 RepID=B0E0R1_LACBS Length = 436 Score = 157 bits (398), Expect = 1e-37, Method: Composition-based stats. Identities = 58/199 (29%), Positives = 86/199 (43%), Gaps = 38/199 (19%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + + L GL VFCGINPG +SA G F P N FW +Y++GFT R L+PQE L + Sbjct: 214 LRNHLRSGLE-VFCGINPGQTSAEIGHHFGGPTNHFWSCLYESGFTTRLLRPQEDFILPE 272 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS--- 117 + G+T LVDRPT + +E+SK E AG L+ KI ++P+ + +G + S Sbjct: 273 QFSIGMTNLVDRPTAEQSELSKAERLAGVPALLAKIATHRPKIVCFVGLGIADIVKSKVM 332 Query: 118 -----------------------QRGAQWGKQTLTIG---------STQIWVLPNPSGL- 144 + A G Q + T + + + SG Sbjct: 333 PVIVRLSSALPPLLTPRQVGKGRKTKAAIGIQPYKLQYQGEQACPKETLFYAVSSTSGRV 392 Query: 145 SRVSLEKLVEAYRELDQAL 163 + V ++ L L Sbjct: 393 VTYQKKDKVNQFKALKSVL 411 >UniRef50_Q1IXM1 Uracil-DNA glycosylase superfamily n=3 Tax=Deinococcus RepID=Q1IXM1_DEIGD Length = 195 Score = 155 bits (392), Expect = 6e-37, Method: Composition-based stats. Identities = 46/162 (28%), Positives = 75/162 (46%), Gaps = 6/162 (3%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 +V D+L GL +V G P SA +A+P N+FW+V+ + G T RQL P+E L Sbjct: 13 LVPDVLQEGLTLVLVGTAPSRISARAKAYYANPENKFWRVLAETGLTPRQLAPREYPILP 72 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQ--GFSQ 118 Y G+T + R + + + +L EKI Y+P +A K+ + G Sbjct: 73 QYGIGLTDVAKRHSGVDAALPVE--AWAPHELREKIRRYRPAIVAFTSKRGASETLGVPT 130 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELD 160 +G Q + ++WVLP+ S L + + ++ L Sbjct: 131 GRLPYGPQVGRLEGAEVWVLPSTSPLGHTHFQ--LGPWQALA 170 >UniRef50_B3QI54 Uracil-DNA glycosylase superfamily n=3 Tax=Rhodopseudomonas palustris RepID=B3QI54_RHOPT Length = 177 Score = 154 bits (390), Expect = 1e-36, Method: Composition-based stats. Identities = 47/163 (28%), Positives = 83/163 (50%), Gaps = 3/163 (1%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D+L GL+VVFCG G +S +AH N+FW V+ + G T R+L P E + + Sbjct: 15 ILPDVLQRGLKVVFCGTAAGRASEKAQSYYAHGRNKFWSVLSETGLTTRKLAPHEFREVT 74 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 ++ G+T L ++V + H L KIE+++P LA +A ++ +R Sbjct: 75 NFGIGLTDLCKDVAGSDHDVRPRPEH--RVVLRRKIEEHRPDFLAFTSLEAAKRFVGRR- 131 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 G+ +IG+T I+VLP+ S ++ + + E + + Sbjct: 132 VGLGRYEESIGTTSIYVLPSTSPMADWNWMSNKIHWWEFAELV 174 >UniRef50_Q5KFR6 Expressed protein n=2 Tax=Filobasidiella neoformans RepID=Q5KFR6_CRYNE Length = 469 Score = 151 bits (381), Expect = 1e-35, Method: Composition-based stats. Identities = 44/116 (37%), Positives = 67/116 (57%), Gaps = 2/116 (1%) Query: 2 VEDILAPGLRVVFCGINP-GLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 + D+L G +VFCGI G S+ G FAHP N+FWK +YQ+G T R + P E ++ Sbjct: 158 LNDLLVKGQDLVFCGIKRIGKMSSTLGHHFAHPTNKFWKALYQSGLTSRLMSPTEDYKVV 217 Query: 61 -DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 +Y G+T LV RPT + +E+S E+ L++K YQP + +GK+ ++ Sbjct: 218 DEYNYGLTNLVGRPTSEQSELSTLEMKLNTINLLQKFIKYQPSVVCFVGKKIWDVF 273 >UniRef50_B7Q2I4 G/T mismatch-specific thymine DNA glycosylase, putative n=1 Tax=Ixodes scapularis RepID=B7Q2I4_IXOSC Length = 236 Score = 149 bits (378), Expect = 2e-35, Method: Composition-based stats. Identities = 33/135 (24%), Positives = 60/135 (44%), Gaps = 5/135 (3%) Query: 39 KVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIED 98 + ++ AG + + L+ Y G T +V R T + ++S++E+ G L++K++ Sbjct: 24 RCLFLAGLIPEPMTAVDDFKLIQYGIGFTNIVARTTRSSADLSRKEIKEGAEILMDKLKI 83 Query: 99 YQPQALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVE 154 +QP+ GK YE + +G+Q I T + V PS +R + Sbjct: 84 FQPRIAVFNGKGIYEIFSGNKNFVFGRQPDPIEGTNVSVFVMPSSSARCAQLPRAVDKLP 143 Query: 155 AYRELDQALV-VRGR 168 Y L + +RGR Sbjct: 144 FYVALKKLRDHLRGR 158 >UniRef50_A3LQ36 Predicted protein n=1 Tax=Pichia stipitis RepID=A3LQ36_PICST Length = 270 Score = 149 bits (378), Expect = 2e-35, Method: Composition-based stats. Identities = 47/218 (21%), Positives = 81/218 (37%), Gaps = 60/218 (27%) Query: 6 LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGF------------------T 47 L L ++F G NPG+ S+ +AH +N FWK+ Q+G Sbjct: 50 LVENLTLLFIGFNPGMESSLQQHHYAHFSNLFWKLFNQSGLLLQCLGVLDPQYLSHNYDN 109 Query: 48 DRQLK-----------PQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKI 96 D L+ P+ L+ Y+ G T L+ R T A E+ E A +LI++ Sbjct: 110 DELLQVLVKDGTTYAKPEHDYELIKYKIGFTDLILRCTRTAQELPMAEKLANVPRLIDEF 169 Query: 97 EDYQPQALAILGKQAYEQGF--------------SQRGAQWGKQTLT------------- 129 + + +GK +E S+ WG Q Sbjct: 170 NMSSSKHIVFIGKGIWEVIVKYVEVELGIKKVKLSKETFMWGLQDSQNSRLYALVLKKFQ 229 Query: 130 ---IGSTQIWVLPNPSGLS-RVSLEKLVEAYRELDQAL 163 ++++V PN SGL + E+ ++ +++L + Sbjct: 230 SKISADSKVFVFPNTSGLVGSLKYEEKLKLWQDLADVI 267 >UniRef50_Q1Q0C1 Similar to G/U-mismatch-specific DNA glycolase n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q0C1_9BACT Length = 170 Score = 145 bits (366), Expect = 7e-34, Method: Composition-based stats. Identities = 39/159 (24%), Positives = 73/159 (45%), Gaps = 13/159 (8%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD---------- 61 ++F GINP S PF++ FW ++ +AG + + L+ Sbjct: 11 ILFVGINPHPGSFNRHVPFSN-NKLFWYLLSEAGLIKEKREELRDDKTLERVYKEKFNTV 69 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y G ++DRPT + K E G +K+ I+ P+ + +GK +YE+ + Sbjct: 70 YGLGFVNIIDRPTRDITGIKKHEELPGRKKISRIIKAEMPKVVCFIGKVSYEKYAGSKDF 129 Query: 122 QWGKQTLTIGSTQIWVLPNP-SGLSRVSLEKLVEAYREL 159 +G Q IG+++++V+ P G + + + +L E R + Sbjct: 130 SFGWQ-ENIGASKVFVMHFPLRGEALIRVRELQEIKRTI 167 >UniRef50_C7DIS9 Putative uncharacterized protein n=2 Tax=unclassified Euryarchaeota RepID=C7DIS9_9EURY Length = 189 Score = 142 bits (359), Expect = 4e-33, Method: Composition-based stats. Identities = 43/168 (25%), Positives = 78/168 (46%), Gaps = 14/168 (8%) Query: 6 LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ----------LKPQE 55 L+ ++F GINP S PF++ FW ++ +A + ++ E Sbjct: 18 LSKNALIMFVGINPHPGSYKRKVPFSN-NKMFWYLLSRAHIIEEPTEYLMNDKKLVQMYE 76 Query: 56 AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 + + +YR LVDRPT +++ K E G ++ + I Y+P+ + +GK YE+ Sbjct: 77 GKFVQEYRLNFINLVDRPTSDVSKLRKGEERPGAARVFDSIRIYKPRLVCFIGKITYEKF 136 Query: 116 FSQRGAQWG-KQTLTIGSTQIWVLP-NPSGLSRVS-LEKLVEAYRELD 160 + +G KQ + ST + P + R+ L+ L++A R +D Sbjct: 137 SGSKHTDYGYKQPIYGISTYVCRFPIRGPAIDRIKELKALIKAARGID 184 >UniRef50_A5EPR0 G/U mismatch-specific uracil-DNA glycosylase n=1 Tax=Bradyrhizobium sp. BTAi1 RepID=A5EPR0_BRASB Length = 166 Score = 141 bits (355), Expect = 1e-32, Method: Composition-based stats. Identities = 46/164 (28%), Positives = 75/164 (45%), Gaps = 6/164 (3%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L PGLR+VF G SA TG +AHP NRFW ++ G T R +P E LL Sbjct: 4 LPDQLKPGLRLVFVGTAASARSAATGHYYAHPGNRFWPTLFAVGLTPRLYRPHEFPALLS 63 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG- 120 G T L + + K + G KI ++P+ +A K+A + + Sbjct: 64 LGIGFTDLCKQGAGMDHVALKAGVDVAG--FAAKIRRHRPRTIAFTSKKAASLFYRKPTG 121 Query: 121 -AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 G+Q+ +VL +PSG + + ++ +++L + + Sbjct: 122 AIALGRQSEQDDFPTAFVLASPSGAASGAWT--LQPWQDLAEHV 163 >UniRef50_B9WEN4 G/U mismatch-specific uracil DNA glycosylase, putative (Uracil mismatch repair protein, putative) (Dna repair protein, putative) n=2 Tax=Candida RepID=B9WEN4_CANDC Length = 292 Score = 139 bits (351), Expect = 3e-32, Method: Composition-based stats. Identities = 45/240 (18%), Positives = 88/240 (36%), Gaps = 78/240 (32%) Query: 2 VEDI---LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK------ 52 + D+ L L+V+F G NPG SA +AH +N FWK+ Q+ + + Sbjct: 53 LPDLKPSLNDHLQVLFVGFNPGTESAIQQHHYAHHSNLFWKLFNQSQLLHKVISKNNQQE 112 Query: 53 -----------------------PQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGG 89 L+ Y G + LV R T +A++++ E Sbjct: 113 EQQIVLDDSNDQFLNHLLENGCNATHDFKLIKYNIGFSDLVLRSTARADQLTISEKLENV 172 Query: 90 RKLIEKIEDYQPQALAILGKQAYEQ-----------------------------GFSQRG 120 +L+ + + + + + I+GK +E G + + Sbjct: 173 PRLLSEFKSSRVETIVIVGKGIWEIIVKYFMKELNISSTNFKLTSSNSNRKSDGGSAPKN 232 Query: 121 AQWGK----------------QTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 +WG S++I++LPN SGL + +S ++ ++ ++++ L Sbjct: 233 FEWGLLQKGSDLTYNLIIDSIYKHIDESSKIYILPNTSGLVASLSFDEKLKLWQDMVNDL 292 >UniRef50_Q6BL09 DEHA2F17270p n=2 Tax=Saccharomycetaceae RepID=Q6BL09_DEBHA Length = 275 Score = 133 bits (334), Expect = 3e-30, Method: Composition-based stats. Identities = 45/222 (20%), Positives = 74/222 (33%), Gaps = 63/222 (28%) Query: 6 LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK------------- 52 L L+V+F G NPGL S+ +AH N FWK+ ++ L+ Sbjct: 52 LDDRLKVLFIGFNPGLQSSIQQHHYAHFTNLFWKLFNESKLLINVLQSLNISVGDAIKED 111 Query: 53 --------------------PQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKL 92 P ++ Y+ G T LV R T A E++ E +L Sbjct: 112 PLLKELIQVDSDDKYKTFVKPVNDFEIVKYKIGFTDLVLRCTKTAQELNLSEKLENVPRL 171 Query: 93 IEKIEDYQPQALAILGKQAYEQGFS-------------QRGAQWGKQ------------- 126 + + + +GK +E + QWGKQ Sbjct: 172 FNEFKQSGSTFIVFVGKGIWEIIVKYITNLLKIKVKLTKGNFQWGKQILGEDEHYNMILE 231 Query: 127 ---TLTIGSTQIWVLPNPSGLSRV-SLEKLVEAYRELDQALV 164 + I+V PN SGL ++ + + +L ++ Sbjct: 232 KLNEILGCKHNIYVFPNTSGLVTSLKYDEKLHLWNDLSHEIL 273 >UniRef50_Q4P6C9 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4P6C9_USTMA Length = 649 Score = 132 bits (332), Expect = 5e-30, Method: Composition-based stats. Identities = 39/102 (38%), Positives = 59/102 (57%), Gaps = 3/102 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEA---QH 58 V D+ A ++ CGINPG+ S+ +G FAH +N F+ ++ AG T R++KP++ Sbjct: 296 VPDLFAERNDIMLCGINPGVKSSSSGHHFAHRSNHFYPSLHLAGITSRRMKPEQDVEFPF 355 Query: 59 LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQ 100 L Y G+T L RPT + NE+ EL G L+EKI ++ Sbjct: 356 LRPYSLGLTNLAPRPTAEGNELLPSELIDGVPVLLEKILKWK 397 >UniRef50_A5DZA6 Putative uncharacterized protein n=2 Tax=Saccharomycetales RepID=A5DZA6_LODEL Length = 312 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 49/228 (21%), Positives = 86/228 (37%), Gaps = 66/228 (28%) Query: 2 VEDI---LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ---- 54 + D+ L L+++F G NPG+ S+ +AH +N FWK+ QA R ++ + Sbjct: 85 LPDLRHSLNSNLKLLFVGYNPGVQSSIQQHHYAHHSNLFWKLFNQANVLQRVVQKRKKKE 144 Query: 55 -----------------------------EAQHLLDYRCGVTKLVDRPTVQANEVSKQEL 85 L+ Y G + L R TV+A E++ +E Sbjct: 145 NGMHQKVFDDDDDDDKYLKTLLSTGCSAIHDFDLIKYDIGFSDLSLRCTVRAEELTSEEK 204 Query: 86 HAGGRKLIEKIEDYQPQALAILGKQAYEQGFS-------------QRGAQWG-------K 125 A +LI +I + + + ++GK +E + QWG K Sbjct: 205 IANIPRLIREINESNAENVVLIGKGIWEVIVKFYKAQLGIKFQLNKDNFQWGQIAESKDK 264 Query: 126 QTLTIGS---------TQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 Q I T I+ P+ SGL + + + + ++E+ L Sbjct: 265 QYNQIIEHVYAQLPKHTTIYAFPSTSGLVASMKFSEKLRLWQEMANNL 312 >UniRef50_C5MI19 Predicted protein n=1 Tax=Candida tropicalis MYA-3404 RepID=C5MI19_CANTT Length = 267 Score = 127 bits (319), Expect = 2e-28, Method: Composition-based stats. Identities = 46/208 (22%), Positives = 80/208 (38%), Gaps = 50/208 (24%) Query: 6 LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQE---------- 55 L+ L V+F G N G SA +AH N+FWK+ ++ ++ Q Sbjct: 60 LSNKLHVLFIGFNAGEKSALKQHRYAHHTNQFWKLFNESNLLEKVTTSQNITKDDTLEKL 119 Query: 56 ---------AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAI 106 HL++Y G T LV+R T +E+ E +L+++ + + I Sbjct: 120 LSHGCKAVHDYHLVNYGIGFTSLVERSTKNTSELGITEKLESVPRLLKEWNESNAVEIVI 179 Query: 107 LGKQAYEQ---GFSQR----------GAQWG-----KQTLTIG------------STQIW 136 LGK ++ FS++ +WG + S++I Sbjct: 180 LGKGIWDVIVTFFSRKLNVKFKLNAIEFKWGEVGKTHEHEIYNLILDSIHDCIPKSSKIH 239 Query: 137 VLPNPSGLSR-VSLEKLVEAYRELDQAL 163 V P+ SGL R E ++ +++ L Sbjct: 240 VFPDTSGLVRHFKYEDKLDLWKDFVHRL 267 >UniRef50_UPI0000D8867B hypothetical protein CIMG_10236 n=1 Tax=Coccidioides immitis RS RepID=UPI0000D8867B Length = 218 Score = 112 bits (280), Expect = 5e-24, Method: Composition-based stats. Identities = 31/79 (39%), Positives = 50/79 (63%), Gaps = 1/79 (1%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D + GL ++F G+NPG+ + TGF +AHP+N +WK+++ +G T RQ P + L Sbjct: 123 LLHDTIPQGLILLFIGVNPGIMTGQTGFVYAHPSNLYWKLLHWSGITMRQHPPSDTYRLP 182 Query: 61 D-YRCGVTKLVDRPTVQAN 78 + Y G T +V RPT A+ Sbjct: 183 ELYGVGNTNIVARPTRDAS 201 >UniRef50_Q4T9X4 Chromosome undetermined SCAF7485, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4T9X4_TETNG Length = 482 Score = 91.4 bits (226), Expect = 1e-17, Method: Composition-based stats. Identities = 35/147 (23%), Positives = 58/147 (39%), Gaps = 29/147 (19%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQ---AGFTDRQLKPQEA-- 56 + D+L NPGL P +++ +G + L Sbjct: 190 LPDLLD---------YNPGLR-----HHRNQPGPDG--CLHRTLVSGPRESFLSSSTTCM 233 Query: 57 -QHLL-DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQ 114 HL YR G T +V R T + ++S +EL GG+ L+EK++ ++P GK YE Sbjct: 234 TPHLPGKYRMGFTNMVARATPGSKDLSSKELREGGKILVEKLKKFKPLIAVFNGKCIYEM 293 Query: 115 G----FSQR--GAQWGKQTLTIGSTQI 135 F ++ ++G Q I + Sbjct: 294 FCRELFGKKPQKLEFGLQPHKIPDCDV 320 >UniRef50_D0N7I8 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0N7I8_PHYIN Length = 240 Score = 91.0 bits (225), Expect = 1e-17, Method: Composition-based stats. Identities = 39/178 (21%), Positives = 75/178 (42%), Gaps = 23/178 (12%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI-------YQAGFTDRQLKPQEAQHLLD 61 LR++ G NP + TG+ +++P NR W ++ G + E L D Sbjct: 52 RLRLLIVGHNPSDHAWKTGYSYSNPTNRMWMLLTGTLSPHSWEGIVPSTARIDEQNILPD 111 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIED----YQPQALAILGKQAYEQGF 116 + G T + P A++ K + A +++ + P +A GK+ + F Sbjct: 112 VHGVGFTSIGLEPGNDASKYGKATMEAWRTSFFQRLRRHALPHAPILIAFSGKRQFSYLF 171 Query: 117 S---QRGAQWGKQTLTIGS--------TQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 S + +GKQ+ +++WVLP+ SG + ++ + Y++L + L Sbjct: 172 SPALSKIESYGKQSKLPPGWPHELRADSEVWVLPSSSGRAAMTTAQRTLPYQQLAERL 229 >UniRef50_C1EDR1 Predicted protein n=2 Tax=Micromonas RepID=C1EDR1_9CHLO Length = 302 Score = 89.1 bits (220), Expect = 5e-17, Method: Composition-based stats. Identities = 38/193 (19%), Positives = 69/193 (35%), Gaps = 39/193 (20%) Query: 10 LRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL-DYRCGVTK 68 L+++ G NP S G+G+ +++P+N FWK++ + ++ L D G T Sbjct: 93 LKLLIIGHNPSEHSWGSGWSYSNPSNNFWKLLAKGRIVPEDWTKEDCPRLPGDLGIGFTD 152 Query: 69 LVDRPTVQANEVSKQELHAGGRKLIEKIE---------------------------DYQP 101 P A ++ + L ++ + P Sbjct: 153 AGTVPGNDAGAYNRATMKKWRSDLYARLRGHVARCADASGDEDARVGHRSSMSIASRHGP 212 Query: 102 QALAILGKQAYEQGFSQ--RGAQWGKQTLT---------IGSTQIWVLPNPSGLSRVSLE 150 +A GK+ Y Q F G+Q T++WV+P+ SG + ++ E Sbjct: 213 DVVAFSGKRQYAQLFDAVPSRVDAGRQPPESLPPGWPLCPERTEVWVMPSSSGRAAMTRE 272 Query: 151 KLVEAYRELDQAL 163 + EL L Sbjct: 273 AREGPWVELGTRL 285 >UniRef50_A8IDF6 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IDF6_CHLRE Length = 334 Score = 87.9 bits (217), Expect = 1e-16, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 72/198 (36%), Gaps = 36/198 (18%) Query: 2 VEDILAPG-LRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTD--RQLKPQEAQH 58 V + L LR+V G NP + +G +++P+N W ++ + G R PQ+ Sbjct: 119 VPEKLGDDPLRLVIVGHNPSEHAWRSGHYYSNPSNHLWPLLRRTGIAPPDRIRGPQDDGL 178 Query: 59 LLDY-RCGVTKL-VDRPTVQANEVSKQELHAGGRKLIEKIEDYQ---------------- 100 + G + P ++ + R ++ + Sbjct: 179 MPAVAGVGFLDVGCGVPGTDSSRFKSEVFEGWSRGFYSRLRAHMRRAATSIGCRCGLCGA 238 Query: 101 PQALAILGKQAYEQGF---------SQRGAQWGKQTLTIGS------TQIWVLPNPSGLS 145 P+ +A GK+ Y + + ++G+Q T++WV + SG + Sbjct: 239 PRLVAFTGKRQYLELMNVGAGPARNKVKTVEFGRQAALPPGWPLPPSTEVWVCTSTSGAA 298 Query: 146 RVSLEKLVEAYRELDQAL 163 ++ E + Y +L + L Sbjct: 299 PMTREAREQPYVQLAERL 316 >UniRef50_UPI00017F04BF PREDICTED: similar to thymine-DNA glycosylase n=1 Tax=Sus scrofa RepID=UPI00017F04BF Length = 300 Score = 76.0 bits (186), Expect = 5e-13, Method: Composition-based stats. Identities = 17/39 (43%), Positives = 21/39 (53%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKV 40 + DIL L +V GINPGL +A G + P N FW Sbjct: 122 LPDILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFWLC 160 >UniRef50_Q2FQM0 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FQM0_METHJ Length = 159 Score = 69.8 bits (170), Expect = 3e-11, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 69/164 (42%), Gaps = 12/164 (7%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKP-QEAQHLL 60 + +L+ RV+ G P + S G +AHP N FWK++ LL Sbjct: 6 LSPLLSSDCRVLILGSYPSVLSIKAGEYYAHPRNMFWKIMEIILNIPHNQTYTTRISLLL 65 Query: 61 DYRCGVTKLVDRPTVQ-ANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 ++ G+ + + + + + ++ +L I ++Q + L +A E+GF + Sbjct: 66 NHGIGLWDVYAACSREKSADAHIKDPVPN--ELKTLILNHQDIRIIFLNGRAAEKGFRKF 123 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 QT+ LP+ S V LE+ ++ +RE+ + L Sbjct: 124 FPDISLQTIY--------LPSSSPAHAVHLEEKIKRWREITKFL 159 >UniRef50_C3X2Y9 Mug G:T/U mismatch-specific DNA glycosylase n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X2Y9_OXAFO Length = 161 Score = 69.8 bits (170), Expect = 3e-11, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 65/167 (38%), Gaps = 21/167 (12%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ--HL 59 + + +P + + G PG S G +AHP N+FW+++ + L P EA+ L Sbjct: 9 LPPLFSPHIHTMILGSFPGRESLAAGQYYAHPRNQFWRLLSAVLGEELVLLPYEARLERL 68 Query: 60 LDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFS 117 L + G+ ++ + S+ + ++++ P + GK A Sbjct: 69 LRHGIGLWDVITMCERIGSLDSR--IKNAVPNDFKRLKKDCPNLEKICFNGKVA------ 120 Query: 118 QRGAQWGKQTLTIGST--QIWVLPNPSGL-SRVSLEKLVEAYRELDQ 161 G + + + VLP+ S + S E+ + ++ + Sbjct: 121 ------GTYAERLAAAGYKTIVLPSSSPAYASRSFEEKRVLWEKILE 161 >UniRef50_Q1VS76 G/U mismatch-specific DNA glycosylase, putative n=2 Tax=Bacteria RepID=Q1VS76_9FLAO Length = 254 Score = 67.9 bits (165), Expect = 1e-10, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 63/154 (40%), Gaps = 18/154 (11%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH-LLDYRCGVTKLV 70 ++ G PG S G + HP NRFWK+I + L + + LL ++ G+ + Sbjct: 110 ILILGTMPGDKSLELGEYYGHPRNRFWKIISTITNNELPLTYSDKKELLLKFKIGIWDIA 169 Query: 71 DRPTVQANEVSKQELHAGGRKLIEKIEDYQP-QALAILGK---QAYEQGFSQRGAQWGKQ 126 + + + + L + I ++ + + G +++ F ++ G + Sbjct: 170 HKANRKGS-LDSAIEDEEPNDLDDFIARHKNLKVIGFNGTKSETLFDKYFDRKS---GLK 225 Query: 127 TLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYREL 159 ++ LP+ S + + + + + +R++ Sbjct: 226 -------YVY-LPSTSPANTGIDFDNICKLWRQI 251 >UniRef50_C6X5D8 G:T/U mismatch-specific uracil/thymine DNA-glycosylase n=2 Tax=Flavobacteriaceae RepID=C6X5D8_FLAB3 Length = 162 Score = 65.2 bits (158), Expect = 8e-10, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 64/167 (38%), Gaps = 19/167 (11%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEA---QHL 59 I++ R++ G PG+ S +AHP N+FWK+++ D + L Sbjct: 9 PPIISDTSRILILGSAPGIKSLQMQQYYAHPQNQFWKILF--HLFDEEYAVDYDIRKALL 66 Query: 60 LDYRCGV---TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF 116 + +R ++ +E + +++ + D+ P AI ++ Sbjct: 67 FKNHIALWDVIDSCEREGSSDAKIRNEEHND----VLQLVTDF-PNITAIFSNG--QKSS 119 Query: 117 SQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 + G ++ VLP+ S L + K +EA++ L L Sbjct: 120 KEARKMLGAESQIPFH----VLPSTSPLHTIPFNKKLEAWKILKHYL 162 >UniRef50_C9RN39 Mug G:T/U mismatch-specific DNA glycosylase n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RN39_FIBSS Length = 179 Score = 64.4 bits (156), Expect = 1e-09, Method: Composition-based stats. Identities = 37/167 (22%), Positives = 66/167 (39%), Gaps = 17/167 (10%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH---L 59 + V+ G P S GF + HP NRFWKV+ A + + L Sbjct: 23 PALFDEKSEVLLLGSIPSPKSREQGFYYGHPQNRFWKVL--ASVLNEPTPETIDEKKAML 80 Query: 60 LDYRCGVTKLVDRPT-VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ 118 L++ + ++D T + A++ S +++ + I + + + G A++ Sbjct: 81 LEHHIALWDVLDSCTIIGASDTSIEDVVPN--DIASLIAKTKIKRIFCTGATAHKLYEKY 138 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGLS-RVSLEKLVEAYRELDQALV 164 G + + LP+ S + V EKLVEAY E+ + Sbjct: 139 CEKATGIKAVK--------LPSTSPANCAVKFEKLVEAYGEIRRETT 177 >UniRef50_Q7MVW9 G/U mismatch-specific DNA glycosylase, putative n=2 Tax=Porphyromonas gingivalis RepID=Q7MVW9_PORGI Length = 163 Score = 62.9 bits (152), Expect = 4e-09, Method: Composition-based stats. Identities = 32/158 (20%), Positives = 63/158 (39%), Gaps = 20/158 (12%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK---PQEAQHLLDYRCG 65 L ++ G PG S G +AHP NRFW + A + L + + LL G Sbjct: 16 HLEILILGSLPGDESIRRGQYYAHPRNRFWPL--MAKLLGKPLPDDYAERTEILLSAHIG 73 Query: 66 VTKLVDRPTV-QANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQRGAQ 122 + + + ++ + + IE P+ +A G++A E F ++ Sbjct: 74 LWDVAHSAIRKGSADIQICDEEPN--DIRSLIER-NPRLHTIAFNGRKA-EAMF-RKHF- 127 Query: 123 WGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYREL 159 T+ + ++P+ S + +L+ LV+ + + Sbjct: 128 ----PETLA-IRCLLMPSTSPANAGKTLDLLVKDWNRI 160 >UniRef50_A6SVT3 G:T/U mismatch-specific DNA glycosylase n=21 Tax=Proteobacteria RepID=A6SVT3_JANMA Length = 165 Score = 62.5 bits (151), Expect = 6e-09, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 61/165 (36%), Gaps = 23/165 (13%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL-- 60 ++ ++ G PG +S +AHP N+FWK + A + + + LL Sbjct: 14 PPVIDQHTEILILGSFPGEASLSAQQYYAHPRNQFWK-LTGALINEDLVTLPYTERLLRL 72 Query: 61 -DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFS 117 +R G+ ++ + + + ++ PQ + GK + Sbjct: 73 SAHRIGLWDVIAACAREGS--LDSAIRQAQHNDFALLKQQCPQLRRVCFNGKTS------ 124 Query: 118 QRGAQWGKQTLTIGST--QIWVLPNPSGL-SRVSLEKLVEAYREL 159 GKQ + +LP+ S + ++ E+ + A+R + Sbjct: 125 ------GKQEALFAQAGFETIILPSSSPAYALLTFEQKLAAWRGI 163 >UniRef50_B9TDK4 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TDK4_RICCO Length = 198 Score = 62.1 bits (150), Expect = 6e-09, Method: Composition-based stats. Identities = 31/159 (19%), Positives = 64/159 (40%), Gaps = 12/159 (7%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ-EAQHLLDYRCGV 66 P R++ G PG++S +AHP N FW+++ Q Q Q D + Sbjct: 37 PDARILILGTLPGIASLRQQQYYAHPQNAFWRILEQLFNIPAQAAYHVRCQSAQDQGLAI 96 Query: 67 TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQRGAQWG 124 + + S + ++ I ++ P+ +A+ G+ A + ++ + Sbjct: 97 WDVCHSAQRMGSLDSNIQSNSIVANHIHQLLADCPRIRLIALNGQAAAQLF--KKHIRPA 154 Query: 125 KQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 + T+ +LP+ S +R+ E+ + + L QA Sbjct: 155 PEIETV------ILPSTSPANARLRFEEKLAQWAILKQA 187 >UniRef50_B2KDM2 G:T/U mismatch-specific DNA glycosylase-like protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KDM2_ELUMP Length = 162 Score = 62.1 bits (150), Expect = 7e-09, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 63/162 (38%), Gaps = 22/162 (13%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH--LLDY 62 I + +++ G PG S G +AHP N FW V+++ F + + +++ Sbjct: 8 ITSKEPKILILGTMPGEESLLRGEYYAHPRNAFWPVLFKC-FGAEISAQYKDKKNLIINN 66 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQ--PQALAILGKQA---YEQGFS 117 + +++ Q + E+ I+ + +A+ GK A Y++ F+ Sbjct: 67 NLALWDVLNECERQGS--LDSEIKNFTPNKIDVFLRSRKSVRAIFFNGKGAGRFYKRFFA 124 Query: 118 QRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 + + LP+ S + SLE+ +++ Sbjct: 125 -----------PFEEVKYFYLPSTSPAYAAKSLEEKTALWKQ 155 >UniRef50_Q7VJA2 Putative uncharacterized protein n=1 Tax=Helicobacter hepaticus RepID=Q7VJA2_HELHP Length = 188 Score = 61.7 bits (149), Expect = 8e-09, Method: Composition-based stats. Identities = 41/157 (26%), Positives = 68/157 (43%), Gaps = 20/157 (12%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ-LKPQEA----QHLLDYRCG 65 RV+ G P + S F +AH NRFW ++ FT ++ Q+ Q LL + Sbjct: 36 RVLILGSFPSVISRDEQFYYAHSHNRFWHIL-ATLFTPEIDIRAQDRDAKKQFLLTHHIA 94 Query: 66 VTKLV---DRPTVQANEVSKQELHAGGRKL-IEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + +V D + +S + + L KI G++AYE Sbjct: 95 LWDIVRECDICNSSDSTLSNAKPNDISVILSSAKIN----AIFC-NGRKAYELFIR---- 145 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYR 157 +G Q ++++LP+ S +R SLEKLV+++R Sbjct: 146 FFGSQMCETQEIKVFLLPSSSPANARYSLEKLVDSWR 182 >UniRef50_D0WFV1 Putative G/U mismatch-specific DNA glycosylase n=2 Tax=Bacteria RepID=D0WFV1_9ACTN Length = 562 Score = 61.7 bits (149), Expect = 9e-09, Method: Composition-based stats. Identities = 36/153 (23%), Positives = 58/153 (37%), Gaps = 15/153 (9%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH---LLDYRCGVT 67 +V+ G P S GF +AHP NRFW+VI A D L + LL + + Sbjct: 20 QVLVLGTMPSPRSRKEGFYYAHPRNRFWRVI--AQLFDEPLARTNEERIDQLLRHHIALW 77 Query: 68 KLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQT 127 ++ + + + + +I D P A A Q + + + Sbjct: 78 DVL--ASCRIAGAKDASIRDAVPNDLARIMDAAPVAAVFCTGTAAAQAYRRF-------S 128 Query: 128 LTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 LP+ S + + L+ LV+AYR L Sbjct: 129 EPKTGIACVQLPSTSPANAAMGLDALVDAYRPL 161 >UniRef50_C6QE22 G:T/U mismatch-specific DNA glycosylase-like protein n=1 Tax=Hyphomicrobium denitrificans ATCC 51888 RepID=C6QE22_9RHIZ Length = 178 Score = 60.6 bits (146), Expect = 2e-08, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 63/173 (36%), Gaps = 15/173 (8%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEA----QH 58 I+ ++ G PG S + HP N FW +I A + P + Sbjct: 11 PPIVTADAHLLILGTLPGEESLRLQQYYGHPRNHFWPLI--AAIFGEAV-PSDYSARIAL 67 Query: 59 LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQGF 116 L + C + +++ + S + P + +A G++A F Sbjct: 68 LKRHGCALWDVLESAERIGS--SDAAIRNATVNPFAAFLADYPNIRTIAFNGQKA-RDLF 124 Query: 117 SQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYR-ELDQALVVRGR 168 + + G + + LP+ S L SLE+ + +R +L ++L + R Sbjct: 125 HRHVVKPGHVPEQVCTMID--LPSSSPLYTKSLEEKLGIWRAKLAESLAAQKR 175 >UniRef50_B0T5Y6 Uracil-DNA glycosylase superfamily n=10 Tax=Proteobacteria RepID=B0T5Y6_CAUSK Length = 166 Score = 60.2 bits (145), Expect = 3e-08, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 58/171 (33%), Gaps = 19/171 (11%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKP--QEAQ--H 58 + RV+ G PG +S + +P N FW+ + ++ D L P E + Sbjct: 8 PPVTDANTRVLILGSLPGEASLAVQQYYGNPRNAFWR-LMESVL-DAPLVPLAYEDRLAA 65 Query: 59 LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP-QALAILGKQAYEQGFS 117 LL G+ ++ + + + L+ IE + +A G A + G Sbjct: 66 LLSRGVGLWDVIAQAERPGS-LDAAIRDPAANDLLALIETLPALRLVAFNGGAAAKLGSR 124 Query: 118 QRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVVRG 167 L LP+ S + S + E + L + +G Sbjct: 125 ----------LIGDRVAALALPSSSPAHAARSFAQKAEIWGGLKTFVFPQG 165 >UniRef50_C5AAU4 Uracil-DNA glycosylase superfamily protein n=8 Tax=Proteobacteria RepID=C5AAU4_BURGB Length = 163 Score = 59.8 bits (144), Expect = 3e-08, Method: Composition-based stats. Identities = 31/155 (20%), Positives = 56/155 (36%), Gaps = 16/155 (10%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ----HLLDYRCGV 66 RV+ G PG S +AHP NRFW+++ +L + LL +R G+ Sbjct: 16 RVLILGSLPGEVSLAQSQYYAHPQNRFWRLLGD--VLGTELVALDYASRLDTLLRHRIGL 73 Query: 67 TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQRGAQWGK 125 +V T + + + L ++ +A G A + G Sbjct: 74 WDVVAEATRIGS-LDSRIRDHASNDLSALVDTLPNLDTIAFNGGTAAKLGTRALA----- 127 Query: 126 QTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 ++ LP+ S + + + A+R+L Sbjct: 128 --ERGARYRLLRLPSSSPAYAALPYADKLLAWRQL 160 >UniRef50_C8WEM8 Uracil-DNA glycosylase superfamily n=4 Tax=Zymomonas mobilis RepID=C8WEM8_ZYMMN Length = 163 Score = 59.8 bits (144), Expect = 4e-08, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 70/163 (42%), Gaps = 21/163 (12%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK----PQEAQHLLDYRC 64 R++ G PG++S + HP N+FW++ + ++ P+ + LL + Sbjct: 12 NSRLLILGSLPGVASLEKAQYYGHPRNQFWRL--MSDIIGEDIESAAYPERLEGLLRHHI 69 Query: 65 GVTKLVDRPTVQAN-EVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQRGAQ 122 G+ ++ Q + + + +E+ L + I +ALA G Q+ AQ Sbjct: 70 GLWDVIGTAKRQGSLDSNIKEVSPNP--LGDLIGKLPNLKALAFNG---------QKAAQ 118 Query: 123 WGKQTLTIGSTQI--WVLPNPSGLSRVSLEKLVEAYRELDQAL 163 G + L ++ ++LP+ S V+ + A+ +L + + Sbjct: 119 LGIKELQKIGAKLPYYILPSSSPAHAVAYDVKKAAWIKLQEII 161 >UniRef50_Q82UD2 Putative uncharacterized protein n=1 Tax=Nitrosomonas europaea RepID=Q82UD2_NITEU Length = 167 Score = 59.0 bits (142), Expect = 5e-08, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 58/166 (34%), Gaps = 14/166 (8%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA-GFTDRQLKPQEAQHLLD 61 I +PG V+ G PG S +AHP N FW++ + GF P L D Sbjct: 7 PPIASPGSWVLILGTMPGKVSLREQQYYAHPQNLFWRITAEILGFDATSAYPLRVSSLKD 66 Query: 62 YRCGVTKLVDRPTVQA---NEVSKQELHAG-GRKLIEKIEDYQPQALAILGKQAYEQGFS 117 + + ++ T ++ ++ + + D + + G +A Sbjct: 67 HGVALWDVLQSCTRESSLDADIVAHTIVPNDFGRFFTACPDIR--RVCFNGAKAAALYA- 123 Query: 118 QRGAQWGKQTLTIGSTQIWV-LPNPSGL-SRVSLEKLVEAYRELDQ 161 R + Q T +V LP+ S + + + A+ + Sbjct: 124 -RHVKPFLQD---APTVEYVQLPSTSPANAAIPRADKLRAWSVIKH 165 >UniRef50_C4FAG7 Putative uncharacterized protein n=2 Tax=Collinsella RepID=C4FAG7_9ACTN Length = 185 Score = 58.7 bits (141), Expect = 8e-08, Method: Composition-based stats. Identities = 38/160 (23%), Positives = 63/160 (39%), Gaps = 16/160 (10%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKV---IYQAGFTDRQLKPQEAQH--L 59 I RV+ G P S GF + HP NRFW+V ++ G D +L +A+ L Sbjct: 26 IFDERSRVLVLGTMPSPRSREVGFYYGHPQNRFWRVMERLH--GLRDHELIDNDARRAFL 83 Query: 60 LDYRCGVTKLVDRPT-VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ 118 L +R + ++ A++ S L ++ + + G Q + Sbjct: 84 LAHRIALWDVLAACNIAGASDASIANPEPN--DLARILDAAPIETVCTTG-GKAAQLCRR 140 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYR 157 Q ++ LP+ S +R+ L+ LV AYR Sbjct: 141 FDGPVLTQR----GIELVELPSTSPANARMRLDDLVAAYR 176 >UniRef50_A6Q7V9 Putative uncharacterized protein n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6Q7V9_SULNB Length = 175 Score = 58.3 bits (140), Expect = 8e-08, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 67/162 (41%), Gaps = 24/162 (14%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ--HLLDY 62 I+ + + G P + S F +AHP N+FWK++ + T + ++ + L + Sbjct: 13 IVFNDTKTLILGSFPSIKSFENNFYYAHPRNQFWKILEK--LTGYPVNNRDQKIWLLKEC 70 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQA---YEQGFS 117 + G+ ++ + + + + + Y P +A GK+A +E FS Sbjct: 71 KLGLWDMIGSCQRENS-LDSSLEDEVVNDIPALLSAY-PSIDKIAFTGKKAQALFETHFS 128 Query: 118 QRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 + +++ LP+PS + ++ E+ V Y E Sbjct: 129 YLEIE-----------RVY-LPSPSPAYAAMTFEEKVVQYAE 158 >UniRef50_Q1GYD6 G/U mismatch-specific uracil-DNA glycosylase n=1 Tax=Methylobacillus flagellatus KT RepID=Q1GYD6_METFK Length = 174 Score = 58.3 bits (140), Expect = 9e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 60/167 (35%), Gaps = 12/167 (7%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI-YQAGFTDRQLKPQEAQHLLD 61 + PG RV+ G PG S +AHP N FW I + + + L Sbjct: 12 PPAITPGCRVLILGSMPGKRSLEMQAYYAHPQNLFWPFITHLFNIPPESGYQERLRLLNQ 71 Query: 62 YRCGVTKL---VDRPTVQANEVSKQELHAG-GRKLIEKIEDYQPQALAILGKQAYEQGFS 117 G+ + R + ++ + + A L + + + G +A EQ F Sbjct: 72 QHIGIWDVLKECVRTSSLDGDIEETSIVANDFSALFMQFPHIR--HIFFNGSKA-EQAFR 128 Query: 118 QRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 + + + + LP+ S + + E + ++++ A+ Sbjct: 129 RHVLKRQIIPAHLHYHK---LPSTSPANASIKKEHKLAQWQQIISAI 172 >UniRef50_C8WJG8 Putative uncharacterized protein n=8 Tax=Bacteria RepID=C8WJG8_EGGLE Length = 173 Score = 58.3 bits (140), Expect = 9e-08, Method: Composition-based stats. Identities = 37/159 (23%), Positives = 61/159 (38%), Gaps = 12/159 (7%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ-EAQHLLDYRCGVTKL 69 RV+ G P S GF + HP NRFWKV+ L + L + + + Sbjct: 19 RVLVLGTMPSPKSREIGFYYGHPQNRFWKVMGALFDEPEPLGIEGRTAFLYAHGIALWDV 78 Query: 70 VDRPT-VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTL 128 + V A++ S + + +I + P + + GA Q Sbjct: 79 LSSCAIVGASDAS---IVDPVANNLSRIAEAAPLEAVFTTGSKATALYRRFGA---PQLP 132 Query: 129 TIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVVR 166 + T LP+ S +R+ L+ LV+AY + + L R Sbjct: 133 GLAHTG---LPSTSPANARMRLDDLVKAYLPIRETLERR 168 >UniRef50_C5BH73 Putative uncharacterized protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5BH73_EDWI9 Length = 63 Score = 58.3 bits (140), Expect = 1e-07, Method: Composition-based stats. Identities = 26/52 (50%), Positives = 30/52 (57%), Gaps = 5/52 (9%) Query: 16 GINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 G PG G FA+P NRFW VI+ AGFTDR L PQ HLL G++ Sbjct: 6 GTEPGP-----GDHFANPRNRFWSVIHLAGFTDRPLAPQAEPHLLIPAVGLS 52 >UniRef50_C9KQM1 Putative G/U mismatch-specific DNA glycosylase n=2 Tax=Veillonellaceae RepID=C9KQM1_9FIRM Length = 174 Score = 57.5 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 36/173 (20%), Positives = 60/173 (34%), Gaps = 24/173 (13%) Query: 6 LAPGLR----VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK---PQEAQH 58 L P + + G PG+ S +AHP NRFW A L + Sbjct: 6 LRPNIDKDCRALILGSMPGVKSLEAQQYYAHPQNRFWP--MMAHLLGEALPSAYEERLAM 63 Query: 59 LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAIL-GKQAYEQG 115 LL + + + + + + + G +++Y P +A+ GK A Q Sbjct: 64 LLRHHIALWDSIGSCEREGS-LDSAIRNEQGNDFAALLQEY-PGIRAIYFNGGKSA--QA 119 Query: 116 FSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVVRG 167 F + + +P+ S +R L L A+R +AL G Sbjct: 120 FKRYNKALLARPDIAFHA----MPSTSPANARWRLPMLEAAWR---KALTESG 165 >UniRef50_Q8CUP2 Hypothetical conserved protein n=1 Tax=Oceanobacillus iheyensis RepID=Q8CUP2_OCEIH Length = 167 Score = 57.5 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 66/160 (41%), Gaps = 16/160 (10%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ--HLLDY 62 ++ RV+ G PG+ S + +P N FW +++ ++++ + + L + Sbjct: 10 VIPDDPRVLIVGSMPGVKSLAEQQYYGNPRNHFWSIMFTL-LEEKRVDNYQDKLSMLKRH 68 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQGFSQRG 120 + + + + + A +I ++DY P + + G +AY+ ++ Sbjct: 69 GIALWDTIGSCYREGS-LDVNIQDAEPNDIIGLVQDY-PTIKLIGCNGTKAYDIF--RKN 124 Query: 121 AQWGKQTLTIGSTQIWVLPNPS---GLSRVSLEKLVEAYR 157 + + + LP+ S G +LE+ +EA++ Sbjct: 125 FSL----ESFPNVDVVKLPSTSPIPGKYNKNLEEKIEAWK 160 >UniRef50_A0R287 T/U mismatch-specific DNA glycosylase n=5 Tax=Mycobacterium RepID=A0R287_MYCS2 Length = 162 Score = 57.5 bits (138), Expect = 2e-07, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 50/163 (30%), Gaps = 20/163 (12%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI-YQAGFTDRQLKPQEAQHLL 60 + I+ RV+ G P + S + + +P N FW+++ GF + + Sbjct: 8 LPPIVGEAPRVLILGNMPSVMSLASSEYYGNPRNAFWRIMGSLYGFAPDAAYSERVSAVT 67 Query: 61 DYRCGVTKLVDRPTVQANEVSKQ-ELHAGGRKLIEKIEDYQPQA--LAILGKQA---YEQ 114 + V V R + + E + P + G A Y + Sbjct: 68 SHGIAVWD-VLRSCRRVGSLDSAVERDSMVPNDFAGFFAAHPSLERVVFNGAAAEANYRR 126 Query: 115 GFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYR 157 + + +P+ S + E + A+R Sbjct: 127 LVGEVPLPY------------VRMPSTSPAQTMRFEDKLSAWR 157 >UniRef50_A6TMT4 Putative uncharacterized protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TMT4_ALKMQ Length = 164 Score = 56.7 bits (136), Expect = 3e-07, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 60/161 (37%), Gaps = 18/161 (11%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL---D 61 I+ ++ G P + S G + +P N+FWK+IY+ + + + + + Sbjct: 10 IIDANSHILILGSIPSVQSLQEGKYYGNPRNQFWKIIYE--VLGQPFQEDYGKKVQFIKE 67 Query: 62 YRCGVTKLVDRPTVQAN-EVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQ 118 + + +++ + + + + +E P+ + G +A++ + Sbjct: 68 HGIAIWDVIESCHREGSLDSNIKEERPND---FNTFLKSYPRIKYVLFNGTKAHDTFRRK 124 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYREL 159 G + I LP+ S + + +E ++ L Sbjct: 125 VGFSF---EDKIFHK----LPSSSPAHTMKIGNKIEEWKVL 158 >UniRef50_Q8PWD8 Conserved protein n=3 Tax=Methanosarcina RepID=Q8PWD8_METMA Length = 158 Score = 56.7 bits (136), Expect = 3e-07, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 66/167 (39%), Gaps = 30/167 (17%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ----H 58 + ++ G PG S + HP N FW+++ A L+ + + Sbjct: 7 PPVFDNNTEILILGSLPGDESLRRRQYYGHPGNDFWRLVGNA--IGENLQDMDYETRLTT 64 Query: 59 LLDYRCGVTKLV---DRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQA-- 111 L +R G+ + R Q + + ++E++ ++++ P+ + GK + Sbjct: 65 LKRHRIGLWDVFKAGKRKGSQDSNIREEEVNQ-----FSELKEMAPELKKVFFNGKASGR 119 Query: 112 YEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRE 158 YE S G +T VLP+ SG++R +++K + Sbjct: 120 YEPVLS----AMGYETK--------VLPSSSGINRRNVKKRESEWEN 154 >UniRef50_A5VDD8 Uracil-DNA glycosylase superfamily n=8 Tax=Proteobacteria RepID=A5VDD8_SPHWW Length = 190 Score = 56.7 bits (136), Expect = 3e-07, Method: Composition-based stats. Identities = 34/171 (19%), Positives = 57/171 (33%), Gaps = 25/171 (14%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK----PQEAQH 58 +++ R++ G PG +S +AHP N+FW+ + A D L P Sbjct: 26 PPVVSADTRLLLLGSLPGEASLRAARYYAHPQNQFWR-LTGAVIGDEGLALLDYPARLDR 84 Query: 59 LLDYRCGVTKLVDRPTVQAN------EVSKQELHAGGRKLIEKIEDYQPQALAILGKQAY 112 L G+ +V + + +L A L +A+ G A Sbjct: 85 LRAAGVGLWDVVADALRDGSLDGAIRDHRPNDLAALVATLPAL------RAIGFNGGTAA 138 Query: 113 EQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 G GA G + LP+ S + + E + L + L Sbjct: 139 RLGRRMIGAPAGL--------ALVDLPSSSPAFTRPIAEKREKWLHLREFL 181 >UniRef50_A9KMN2 G:T/U mismatch-specific DNA glycosylase n=2 Tax=Clostridiales RepID=A9KMN2_CLOPH Length = 173 Score = 56.0 bits (134), Expect = 5e-07, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 66/165 (40%), Gaps = 17/165 (10%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH---LLD 61 + +++ G P + S F + HP NRFW+++ + +++ + LL+ Sbjct: 14 VFDKNSKILILGSFPSVKSREQEFYYGHPMNRFWRML--SSVLKKEIPSDSLRKREFLLE 71 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP-QALAILGKQAYEQGFSQRG 120 + ++D + S + + +I P + + GK A + Sbjct: 72 NHIALWDVID--SCDIIGSSDSSIKNVVPTDLSRILAISPISHIYVNGKTAKKLYD---- 125 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALV 164 + + ++ + LP+ S + SLEKL+E ++ + + L Sbjct: 126 ----RYSESVTGLKAICLPSTSPANAMFSLEKLIEEWKVIIEPLG 166 >UniRef50_B1HZQ2 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HZQ2_LYSSC Length = 173 Score = 56.0 bits (134), Expect = 5e-07, Method: Composition-based stats. Identities = 25/164 (15%), Positives = 57/164 (34%), Gaps = 17/164 (10%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK---PQEAQ 57 ++ I+ +V+ G PG S + +P N FW +I Q ++ + Sbjct: 12 LLP-IVDSSTKVLIVGSMPGKQSLVKQQYYGNPRNHFWPIIGQ--LLQVEVPEDYCERIA 68 Query: 58 HLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGF 116 L G+ ++ + + + + E Y + + G +A++ Sbjct: 69 LLKKREIGLWDTIEACEREGS-LDAAIRNEKPNDFQTLFETYPNIKLVLFNGAKAFDVF- 126 Query: 117 SQRGAQWGKQTLTIGSTQIWVLPNPS---GLSRVSLEKLVEAYR 157 ++ + +P+ S G + S ++ VE +R Sbjct: 127 -KKHIGL----ELLAGRAYKKMPSTSPIPGKNIKSFDEKVEEWR 165 >UniRef50_Q31G15 Uracil DNA glycosylase superfamily protein n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Q31G15_THICR Length = 168 Score = 55.6 bits (133), Expect = 7e-07, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 58/164 (35%), Gaps = 10/164 (6%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRC 64 I+ +V+ G P + S F +AHP N FW ++ + L+ Sbjct: 10 IVPTQPKVLILGTAPSVVSLEKSFYYAHPRNAFWPILADYFNQPVDSVEAKKALCLENDI 69 Query: 65 GVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQGFSQRGAQ 122 + ++ Q + + + Y P +A+ GK A + F ++ + Sbjct: 70 LLWDVLSACERQGS-LDSAIKQPEANDFEALFQTY-PDLKAVLFNGKAA-QNLFEKQVIK 126 Query: 123 WGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAY-RELDQALV 164 + + LP+ S +R++ E + + L Q L Sbjct: 127 KQSLPEDL---MFYSLPSTSPANARLTFENKRVFWHKTLQQTLT 167 >UniRef50_UPI00016AE856 G:T/U mismatch-specific DNA glycosylase-like protein n=1 Tax=Burkholderia pseudomallei B7210 RepID=UPI00016AE856 Length = 170 Score = 55.2 bits (132), Expect = 7e-07, Method: Composition-based stats. Identities = 35/162 (21%), Positives = 60/162 (37%), Gaps = 13/162 (8%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQ-AGFTDRQLKPQEAQHLLDYR 63 I+ + G PG S +AH AN+FW+++ G T + + LL + Sbjct: 8 IVDANSTHLILGTMPGRESLKFQRYYAHGANQFWQILETVYGETIPDHYERRVEFLLRHG 67 Query: 64 CGVTKLVDRPTVQANEVS---KQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + V R + + + EL L E+ + + + G A + Sbjct: 68 IALWD-VLRSADREGSLDSAIRNELANDFAGLFERFPRLR--TIGLHGTTAAKLFRRHIA 124 Query: 121 AQWGKQTLTIGSTQIWVLPNPS---GLSRVSLEKLVEAYREL 159 Q G + I Q VLP+ S G S E+ + +R+ Sbjct: 125 FQRG---VPIAELQCTVLPSTSSTPGRCVKSPEEKMACWRDF 163 >UniRef50_C6CSS2 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CSS2_PAESJ Length = 177 Score = 54.4 bits (130), Expect = 1e-06, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 61/169 (36%), Gaps = 24/169 (14%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFW---KVIYQAGFTDR----QLKPQE 55 ++ RV+ G P + S + HP N FW ++ AG D +L + Sbjct: 13 PPVIDERARVLVLGTAPSVKSLEHRQFYGHPRNYFWGMVYGLFGAGVPDEDYGKRLSFLQ 72 Query: 56 AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALA--ILGKQAYE 113 H+ + K +RP + +E + L +Y P+ G +AY+ Sbjct: 73 DHHMAVFDV--IKSCERPGSLDVNIKNEEPND----LPALAGEY-PELKCFAFNGSKAYD 125 Query: 114 QGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLS---RVSLEKLVEAYREL 159 Q L + +P+ S + +LE +EA++ + Sbjct: 126 TFRKYYRDHPALQHLALLK-----MPSTSPIPTQKMRNLEDRIEAWKAI 169 >UniRef50_C2FY74 G/U mismatch-specific DNA glycosylase n=2 Tax=Sphingobacterium spiritivorum RepID=C2FY74_9SPHI Length = 160 Score = 54.0 bits (129), Expect = 2e-06, Method: Composition-based stats. Identities = 32/155 (20%), Positives = 59/155 (38%), Gaps = 20/155 (12%) Query: 13 VFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ---EAQHLLDYRCGVTKL 69 + G PG S G +AHP NRFWK+++ +++ + Q L D+ + Sbjct: 17 LILGSLPGDKSLGQQQYYAHPQNRFWKLLF--HLMEKEYESDYSKRIQFLEDHHIALWDT 74 Query: 70 VD---RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 RP + EL L++ + Q + G ++ + + Sbjct: 75 CASAIRPGSMDTAI-LSELPNDIISLLQHVPSIQ--HVIFNG--------NKAKQLYDRY 123 Query: 127 TLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELD 160 + LP+ S +R S +KLV+++ L Sbjct: 124 HKRLAQISYHSLPSTSPANARFSFDKLVDSWSILK 158 >UniRef50_C9M6L9 Putative G/U mismatch-specific DNA glycosylase n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M6L9_9BACT Length = 158 Score = 53.3 bits (127), Expect = 3e-06, Method: Composition-based stats. Identities = 32/148 (21%), Positives = 54/148 (36%), Gaps = 13/148 (8%) Query: 19 PGLSSAGTGFPFAHPANRFWKVI-YQAGFTDRQLKPQEAQHLLDYRCGVTKLVDRPT-VQ 76 P S GF +AHP NRFW ++ G L + +V T + Sbjct: 6 PSPKSREAGFYYAHPRNRFWPLLCRLLGCPTPPDTAGRKALCLSVGLALWDVVGSCTIIG 65 Query: 77 ANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIW 136 A++ + +++ A L + + + G A G+ + Sbjct: 66 ASDSTIEDVRAN--DLSPLLSGAPIRKIFTTGAAAARLYRQFIEPISGRPAES------- 116 Query: 137 VLPNPSGL-SRVSLEKLVEAYRELDQAL 163 LP+ S + + +L+EAYR L AL Sbjct: 117 -LPSTSPANAAWTFPRLLEAYRPLADAL 143 >UniRef50_C7N6G7 YjeF-like protein n=6 Tax=Bacteria RepID=C7N6G7_SLAHD Length = 447 Score = 53.3 bits (127), Expect = 3e-06, Method: Composition-based stats. Identities = 34/158 (21%), Positives = 56/158 (35%), Gaps = 17/158 (10%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH---LLDYRCGVT 67 RV+ G P S GF + HP NRFW V+ A + + LL + + Sbjct: 21 RVLILGTMPSPKSREEGFFYGHPRNRFWTVL--ASLFHEPVPATNQRKRDLLLRHGIALW 78 Query: 68 KLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAI-LGKQAYEQGFSQRGAQWGKQ 126 ++ + S + + +I P G ++ E +Q G Sbjct: 79 DVL--ASCDIEGASDASIKNAKPNDLTRITSAAPIEAVFCTGAKSAELYRRYCESQTG-- 134 Query: 127 TLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 I + Q LP+ S + + E L AY+ + L Sbjct: 135 ---IPAVQ---LPSTSPANAAMDTETLTAAYQVILDHL 166 >UniRef50_Q0F2S7 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F2S7_9PROT Length = 168 Score = 53.3 bits (127), Expect = 3e-06, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 54/167 (32%), Gaps = 29/167 (17%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH-----LLDY 62 P +V+ G PG S +AHP N FW I + F + L Sbjct: 12 PDAKVLILGSMPGRKSLDEQQYYAHPRNAFWP-IMASLF---EFSADIDYEKRLGMLRSN 67 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIE-------DYQP--QALAILGKQAYE 113 + + + + +L I D P +A+ G++A + Sbjct: 68 GIALWDVARQCIRPGS-------LDAAIRLPSVITNDFTAFFDTHPQIRAVCFNGRKAAD 120 Query: 114 QGFSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 + + + ++ LP+ S + +S + + +R + Sbjct: 121 LYQRRVIPELALNQTQLP---LYPLPSTSPTHAAISFDSKLAQWRTI 164 >UniRef50_A6DAL5 Putative uncharacterized protein n=1 Tax=Caminibacter mediatlanticus TB-2 RepID=A6DAL5_9PROT Length = 171 Score = 52.9 bits (126), Expect = 4e-06, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 71/160 (44%), Gaps = 16/160 (10%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQ--AGFTDRQLKPQEAQHLLDYRCGVTKL 69 ++ G P + S F ++HP N+FWK++ T + ++ ++ L ++ + Sbjct: 20 ILILGTFPSIKSFENNFYYSHPQNQFWKILSTIFNEKTPKTIE-EKINFLKKHKIALWDT 78 Query: 70 ---VDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQRGAQWG 124 R + + + + + + ++ Y P +A+ + A + ++ + Sbjct: 79 ICECKRKEKNSKDSNLEIIKPC--DIENLLKQY-PNIKKVAVTSRTA--EKILKKHFK-- 131 Query: 125 KQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 +I ++Q+ LP+PS L + + + V+ ++EL Q + Sbjct: 132 NSQFSILNSQLIYLPSPSPLYASKNFQTKVKEWKELLQLM 171 >UniRef50_C3XK88 T/U mismatch-specific DNA glycosylase n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XK88_9HELI Length = 162 Score = 52.5 bits (125), Expect = 5e-06, Method: Composition-based stats. Identities = 29/107 (27%), Positives = 45/107 (42%), Gaps = 8/107 (7%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH--LLDYRCGVTKL 69 V+ G P S GF + +P NRFWK++ F +L EA+ LL YR + + Sbjct: 16 VLILGSFPSKMSFEIGFYYGNPRNRFWKMLQD--FFKVELVSVEAKKDFLLHYRIALWDI 73 Query: 70 VDRPTVQANE----VSKQELHAGGRKLIEKIEDYQPQALAILGKQAY 112 V + + + K L L + Q + + GK+AY Sbjct: 74 VCLGSNMQGDKESSMDKTLLATEISDLEALLAKTQIKTILCNGKKAY 120 >UniRef50_A1WBA7 Putative uncharacterized protein n=17 Tax=Proteobacteria RepID=A1WBA7_ACISJ Length = 207 Score = 52.5 bits (125), Expect = 6e-06, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 62/169 (36%), Gaps = 22/169 (13%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH--- 58 + ++ G RV+ G PG++S + HP N FW+++ A + P Q+ Sbjct: 41 LPPMVGSGTRVLVLGSFPGVASLAAQQYYGHPQNHFWRIL--AALWPQHPLPGRDQYAAR 98 Query: 59 ---LLDYRCGVTKL---VDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAI--LGKQ 110 LL G+ + +R + +++ + P AI G + Sbjct: 99 CDWLLARGLGLWDVYAACEREGSLDTSIRNAQVND-----FAGLHARAPGLAAIAHNGGE 153 Query: 111 AYEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 ++ A G + + LP+ S + S EK + A+ E Sbjct: 154 SFRHA-KAVRASLGLKADAPMPSL--RLPSTSPANASWSFEKKLHAWGE 199 >UniRef50_Q1NE08 Uracil-DNA glycosylase superfamily protein n=1 Tax=Sphingomonas sp. SKA58 RepID=Q1NE08_9SPHN Length = 164 Score = 52.1 bits (124), Expect = 7e-06, Method: Composition-based stats. Identities = 35/159 (22%), Positives = 64/159 (40%), Gaps = 15/159 (9%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ--EAQHLLDYRCGVTK 68 RV+ G PG +S G +AH N FW +I + + +P + L + G+ Sbjct: 16 RVLVLGSLPGDASIRQGEYYAHRGNAFWHLIGDVVGENLRAQPYAMRLKRLRAHHVGLWD 75 Query: 69 LVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQGFSQRGAQWGKQ 126 +++ + + S+ + + + ++ P +A+A GK A +R Q Sbjct: 76 VMESAQREGSLDSR--IRQADLRNLADFVEHLPDLRAIAFNGKTA--ALHGRR------Q 125 Query: 127 TLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALV 164 + LP+ SG + +S E E + L LV Sbjct: 126 LTDRPGITLLDLPSTSGAYASLSREAKRERWMALRDHLV 164 >UniRef50_B7ATH8 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7ATH8_9BACE Length = 180 Score = 52.1 bits (124), Expect = 8e-06, Method: Composition-based stats. Identities = 35/172 (20%), Positives = 67/172 (38%), Gaps = 16/172 (9%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHL-LDYR 63 ++ R++ G P + S F + HP NRFW+++ D ++ ++L L Sbjct: 16 VVDKDSRILILGSFPSVKSRENMFYYGHPQNRFWRMLADIVKADVPQTIEDKKNLILSNG 75 Query: 64 CGVTKLVDRPT--VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + + A+ + E+ L ++ Y +A+ G AY F + Sbjct: 76 FALWDTLAMCEIHASADSSIRHEVPNDIPGL---VKQYGIEAIMFNGNAAYR-YFEK--- 128 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYR----ELDQALVVRGR 168 +GK + LP+ S + S E+LV + EL + ++ R Sbjct: 129 YYGK-DDGLEGIIKKALPSTSPANAACSYERLVGEWGSAVNELKEKILKEKR 179 >UniRef50_B7CDR6 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CDR6_9FIRM Length = 153 Score = 51.7 bits (123), Expect = 9e-06, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 58/155 (37%), Gaps = 29/155 (18%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR-CGVTKL 69 RV+ G P + S +A+ NRFWKV+ + + + LL + + Sbjct: 17 RVLILGSMPSVISLKEQMYYANKTNRFWKVL------EAIYQENDKIKLLKISHIALWDI 70 Query: 70 VD---RPTVQANEVSKQELHAGGRKLIEKIEDYQ--PQALAILGKQAYEQGFSQRGAQWG 124 R T +E+ E + + E + Y+ + + G+ + Sbjct: 71 CHSCIRKTSADSEIKDIEPND----IQELLNRYKNIEKVIC-NGRTSQNTM--------- 116 Query: 125 KQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 + + P+ S ++ SLE+L+E Y++ Sbjct: 117 --QKYFPNIETVYCPSTSSANAQFSLEQLIEIYKK 149 >UniRef50_A7ZBI7 Mug G:T/U mismatch-specific DNA glycosylase n=2 Tax=Campylobacter RepID=A7ZBI7_CAMC1 Length = 168 Score = 51.7 bits (123), Expect = 9e-06, Method: Composition-based stats. Identities = 33/166 (19%), Positives = 68/166 (40%), Gaps = 17/166 (10%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH---LLD 61 I +++ G P + S GF + +P NRFW+V+ AG + L + LL Sbjct: 10 IFDKNSKILILGSFPSVISRKFGFYYTNPQNRFWRVL--AGILNAPLPASTDEKIKFLLS 67 Query: 62 YRCGVTKLVDRPT-VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + + +++ ++ L ++ + + + G +AYE Sbjct: 68 HHIAIYDAAISCEIEGSSDAKMSKIVP--VNLKPLFKEAKIKQVYANGGKAYEI------ 119 Query: 121 AQWGKQTLTIGSTQIWV--LPNPSGL-SRVSLEKLVEAYRELDQAL 163 + + I +T+ V LP+ S ++ SLE+L+ ++ + L Sbjct: 120 CKKYLENEIIKATKNEVIKLPSTSPANAKFSLERLMCEWQVVADTL 165 >UniRef50_D1PNX0 T/U mismatch-specific DNA glycosylase n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PNX0_9FIRM Length = 176 Score = 51.3 bits (122), Expect = 1e-05, Method: Composition-based stats. Identities = 35/162 (21%), Positives = 58/162 (35%), Gaps = 19/162 (11%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEA----- 56 + + + G P S GF + HP NRFW ++ A T + Sbjct: 15 LPPLHGERAGALILGSFPSPKSREQGFFYGHPQNRFWPLL--ASLTGEPVPDWADIEAKK 72 Query: 57 QHLLDYRCGVTKLVDRP-TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 Q +L + V + A++ S + + + I QA+ G Sbjct: 73 QIILRHGLAVWDTIGACDIRGASDASIRNVQPN--DVAALIRRLGVQAVFCNG-----AA 125 Query: 116 FSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAY 156 + A++ Q LT S VLP+ S + S+EKL + Sbjct: 126 SGRVYARY-AQPLTGLSAT--VLPSTSPANAAWSMEKLRAVW 164 >UniRef50_B9KXL9 Phage DNA polymerase-related protein n=17 Tax=cellular organisms RepID=B9KXL9_THERP Length = 230 Score = 51.0 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 36/166 (21%), Positives = 65/166 (39%), Gaps = 20/166 (12%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P V+F G PG G PF A +F +++ +AG Q+ + Sbjct: 58 PHAEVMFIGEAPGYHEDRQGRPFVGAAGQFLNELLGRAGLRREQVY-------------I 104 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQ--W 123 T +V RP + + E+ A L E+I +P + LG+ + + F Sbjct: 105 TNVVKCRPPGNRDPL-PDEIAACAPYLDEQIAVIRPSVIVTLGRYSMARWFPNEKISRIH 163 Query: 124 GKQTLTIGSTQIWVLPNPSGLSRVS-LEKLVEAYRELDQALVVRGR 168 GK G + + +P+ L++LVE + + ++ + R Sbjct: 164 GK-PRRFGDVVVVPMYHPAAALHQPALKELVEKDFDALREIIAQAR 208 >UniRef50_B8GKM7 Putative uncharacterized protein n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GKM7_METPE Length = 185 Score = 51.0 bits (121), Expect = 1e-05, Method: Composition-based stats. Identities = 35/169 (20%), Positives = 64/169 (37%), Gaps = 14/169 (8%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ I+ V+ G P + G +A+P N FW +I + R L E LL Sbjct: 24 ILP-IIGSAPVVLVLGSFPSARARTVGEYYANPLNHFWPIIEELFGISRSLSYSERVRLL 82 Query: 61 -DYRCGVTKLVDRP-TVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQGF 116 D + + +VD + + + E I P + +A G+ A E+ Sbjct: 83 ADQQVALWDMVDSCYQEGSMDSTIYEPALND---IRGFLTTHPTIRLVAANGRTA-ERFL 138 Query: 117 SQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALV 164 + + G + LP+ S +R S + V+ +R + + V Sbjct: 139 KRSLGKAGP----PAGVTVISLPSTSPANARESFHQKVQRWRVIRDSAV 183 >UniRef50_C4LCA1 T/U mismatch-specific DNA glycosylase n=10 Tax=Bacteria RepID=C4LCA1_TOLAT Length = 167 Score = 51.0 bits (121), Expect = 2e-05, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 54/156 (34%), Gaps = 9/156 (5%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQ-AGFTDRQLKPQEAQHLLDYRCGV 66 R++ G PG +S G +AH N FW V+ + F + L + Sbjct: 13 ENARILILGSMPGQASLDAGQYYAHKRNLFWPVMAELLQFDPSAGYEERTAALRAAGIAL 72 Query: 67 TKLVDRPTVQAN-EVSKQELHAGGRKLIEKIEDYQP-QALAILGKQAYEQGFSQRGAQWG 124 ++ + + + + + E ++ + + G++A R Sbjct: 73 WDVLQSCKREGSLDANIEHATLTVNNFAEFFRQHRQIRHVFFNGEKAASCF--HRYVAKA 130 Query: 125 KQTLTIGSTQIWVLPNPSGLSRV-SLEKLVEAYREL 159 + + LP+ S + SL ++A++ + Sbjct: 131 DYPADLICQR---LPSTSPANASQSLAFKLQAWQRI 163 >UniRef50_C8W9T7 Uracil-DNA glycosylase superfamily n=1 Tax=Atopobium parvulum DSM 20469 RepID=C8W9T7_ATOPD Length = 185 Score = 50.6 bits (120), Expect = 2e-05, Method: Composition-based stats. Identities = 38/167 (22%), Positives = 68/167 (40%), Gaps = 19/167 (11%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIY-QAGFTDRQLKPQEAQHLL 60 ++ I+ P RV+ G P S GF F +P NRFW+V+ G Q P++ Sbjct: 24 LDPIIFPDSRVLLLGSMPSPKSREIGFYFGNPQNRFWRVVATLYGEKPLQSIPEKIDFCK 83 Query: 61 DYRCGVTKLVDRP-TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQA---YEQGF 116 + + ++ A++ S + I+D A+ LG A Y++ Sbjct: 84 RHHLALCDVLKACDIKGASDASIANPVP--HNIGALIQDSSICAIFTLGGTATKFYKRYT 141 Query: 117 SQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 Q+ +Q LP+ S +R+S++ LVE + + + Sbjct: 142 EQQTGILARQ-----------LPSTSPANARMSVDDLVEEFSIIKKF 177 >UniRef50_B2UPH3 Uracil-DNA glycosylase superfamily n=2 Tax=Chlamydiae/Verrucomicrobia group RepID=B2UPH3_AKKM8 Length = 171 Score = 50.2 bits (119), Expect = 2e-05, Method: Composition-based stats. Identities = 33/171 (19%), Positives = 61/171 (35%), Gaps = 13/171 (7%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI-YQAGFTDRQLKPQEAQHLLD 61 + P V+ G PG S +AHP N FWK++ GF L E LL+ Sbjct: 7 PPSVTPDCSVLVLGSLPGDESLRQAQYYAHPRNAFWKIMGSLLGFDPS-LPYGERLSLLN 65 Query: 62 Y-RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 G+ +V + + + + ++ + P + + +R Sbjct: 66 RGGVGLWDVVASGVRPGS-LDQHISQERPNDIAALLDRF-PGIGVVCCNGTASHKYLKRY 123 Query: 121 AQWGKQTLTIGSTQIWV--LPNPS-GLSRVSLEKLVEAYRELDQALVVRGR 168 + V +P+ S +R++ E+ AY E+ L+ + R Sbjct: 124 F-----PELFLRETLSVIQMPSTSPAAARLTYEQKFRAYEEIMAPLLRKER 169 >UniRef50_C6XAZ6 G/U mismatch-specific uracil-DNA glycosylase n=4 Tax=Bacteria RepID=C6XAZ6_METSD Length = 179 Score = 49.8 bits (118), Expect = 3e-05, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 57/165 (34%), Gaps = 4/165 (2%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQ-AGFTDRQLKPQEAQHLLD 61 + RV+ G PG++S + HP N FW ++ + G + L Sbjct: 8 PAVAETDARVLILGSMPGIASLTQQQYYGHPHNLFWPIMGELVGAHPHLPYAERLAQLCR 67 Query: 62 YRCGVTKLVDRPTV-QANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQR 119 + + ++ + + + + E + Q + + G++A + Sbjct: 68 HHIALWDVMHQCFRPGSLDSAIDEGSIVANDFAGFFATHAQIRHVFFNGQKAAQAFQRHV 127 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 + S + LP+ S + + + + ++ + QAL Sbjct: 128 IKAEKLNKDALASLRFSTLPSTSPANASIPQKTKRQQWQSILQAL 172 >UniRef50_B9LA93 Putative uncharacterized protein n=1 Tax=Nautilia profundicola AmH RepID=B9LA93_NAUPA Length = 172 Score = 49.8 bits (118), Expect = 4e-05, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 74/160 (46%), Gaps = 17/160 (10%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEA--QHLLDYRCGVTK 68 +++ G P + S F ++HP N+FW+++ F D++ + E + L ++ + Sbjct: 21 KLLILGTFPSIKSFENEFYYSHPRNQFWEILSIV-FKDKKPETIEEKIKFLKKHKIALWD 79 Query: 69 LV---DRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQRGAQW 123 V R + + + + L + ++ Y P +A+ + A E+ + Sbjct: 80 AVCECKRKKGNSRDDNLEILKPC--DIESLLKRY-PNIKKVAVTSRTA-EKVIKKYLTN- 134 Query: 124 GKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 +I ++ ++ LP+PS L +R+ +++ E +++L + Sbjct: 135 --SQFSILNSLLY-LPSPSPLNARMKIKEKAETWKKLLEL 171 >UniRef50_A8U688 Putative uncharacterized protein n=1 Tax=Carnobacterium sp. AT7 RepID=A8U688_9LACT Length = 161 Score = 49.0 bits (116), Expect = 5e-05, Method: Composition-based stats. Identities = 13/63 (20%), Positives = 27/63 (42%), Gaps = 1/63 (1%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR-CGV 66 +++ G P S +A+ N+FWK+I ++ T + ++ LL G+ Sbjct: 11 ENTKILILGSAPSEKSLELQQYYANKGNQFWKIISESLNTTDPIDYEKRLELLSQNQIGL 70 Query: 67 TKL 69 + Sbjct: 71 WDI 73 >UniRef50_B2V9S9 Phage SPO1 DNA polymerase-related protein n=18 Tax=Bacteria RepID=B2V9S9_SULSY Length = 217 Score = 49.0 bits (116), Expect = 6e-05, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 37/111 (33%), Gaps = 14/111 (12%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 +++F G PG G PF K I AG ++ Sbjct: 69 NAKIMFIGEAPGEEEDKQGRPFVGRAGKLLTKCIENAGHIREEVY-----------IANI 117 Query: 68 KLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ 118 P + + +E A L+ +IE P+ L +LG AY F + Sbjct: 118 NKCRPPNNRTP--TIEEQEACIPFLLRQIEIINPKVLCLLGATAYRGIFKK 166 >UniRef50_A6Q419 Putative uncharacterized protein n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q419_NITSB Length = 182 Score = 48.6 bits (115), Expect = 8e-05, Method: Composition-based stats. Identities = 34/159 (21%), Positives = 67/159 (42%), Gaps = 27/159 (16%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH-LLDYRCGVTKL 69 +V+ G P +S GF + HP N+FWK++ +E + L + + + Sbjct: 16 KVLILGSFPSFASLEIGFYYGHPQNQFWKIVADVFKQKVPDTLEEKKEFLKRHHIALWDM 75 Query: 70 VD---RPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQA---YEQGFSQRGA 121 V R + +S E++ IEK+ P + + G+++ +E+ FS Sbjct: 76 VKGCIRENSLDSSLSSIEVND-----IEKLLKTYPNIEVIFFTGRKSQQLFEKQFSHVNL 130 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYREL 159 + LP+PS R +S E+ V+ ++++ Sbjct: 131 ------------PRYYLPSPSPAYRAMSFEEKVQQWKKI 157 >UniRef50_Q5P2M8 T/U mismatch-specific DNA glycosylase n=30 Tax=Bacteria RepID=Q5P2M8_AZOSE Length = 166 Score = 48.6 bits (115), Expect = 8e-05, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 53/161 (32%), Gaps = 8/161 (4%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA-GFTDRQLKPQEAQHLLD 61 + APG + G PG +S +AH N FW ++ + G + Q L D Sbjct: 8 PPVAAPGAHTLILGSMPGAASLAALEYYAHRRNLFWPILGEVLGIDPALSYAERTQRLAD 67 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQR 119 + ++ + S E + P + G A EQ F + Sbjct: 68 AGYALWDVLGACRRHGSLDSNIENDSIEVNDFAIFFATHPSIDRVFCNGATA-EQCFRR- 125 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 ++ LP+ S + + + A+RE+ Sbjct: 126 --HVLPALEGRFEFEVRRLPSTSPANASIPHAAKLAAWREI 164 >UniRef50_A1KBL3 Putative uncharacterized protein n=1 Tax=Azoarcus sp. BH72 RepID=A1KBL3_AZOSB Length = 186 Score = 47.9 bits (113), Expect = 1e-04, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 61/164 (37%), Gaps = 15/164 (9%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWK---VIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 RV+ G PG++S +AHP N FW ++ AG + Q LLD + Sbjct: 25 RVLVLGSMPGVASLTAAQYYAHPRNAFWTIMGALFGAG--PALPYAERLQRLLDAGVALW 82 Query: 68 KL---VDRPTVQANEVSKQELHAGGRKLIEKIEDYQP-QALAILGKQAYEQGFSQRGAQW 123 + RP + ++ + L I + L G A E F +R Sbjct: 83 DVIASCRRPGSLDSAIAPDSVTPN--DLPGLITACPALEHLFFNG-TAAETAF-RRHFAR 138 Query: 124 GKQTLTIGST-QIWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 G + + T ++ LP+ S + L + + A+ + A Sbjct: 139 GDRLRVLRPTLKLQRLPSTSPAHAARPLAEKLSAWEAVRAAASA 182 >UniRef50_C4ZBZ4 G:T/U mismatch-specific DNA glycosylase n=20 Tax=Bacteria RepID=C4ZBZ4_EUBR3 Length = 164 Score = 47.5 bits (112), Expect = 1e-04, Method: Composition-based stats. Identities = 34/157 (21%), Positives = 62/157 (39%), Gaps = 17/157 (10%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH---LLDYRC 64 R++ G P ++S F + H NRFWK++ A D + + L Sbjct: 16 ENSRILILGTLPSVASRENNFYYGHKQNRFWKLL--AYLLDEPVPETIDEKKYMLFKNHI 73 Query: 65 GVTKLVDRP-TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 + ++ +++ S + + I KI + KQ Y G ++ GA + Sbjct: 74 AIWDVIQSCDIKGSSDSSIKNVE---PTDIRKILETADI------KQVYANG-NKAGALY 123 Query: 124 GKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 K L + + LP+ S + S E+L EA+ + Sbjct: 124 KKYQLPLTGMEAVTLPSTSPANAAWSFERLCEAWTRI 160 >UniRef50_C5K9X8 Putative uncharacterized protein n=2 Tax=Perkinsus marinus ATCC 50983 RepID=C5K9X8_9ALVE Length = 267 Score = 47.5 bits (112), Expect = 2e-04, Method: Composition-based stats. Identities = 36/193 (18%), Positives = 59/193 (30%), Gaps = 40/193 (20%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH---- 58 + RV+ G P S G + +P N FW V+ A D P Sbjct: 77 PPSIGDSPRVLILGSAPCEMSLEKGQFYGNPRNHFWAVMAYA--MDDPTFPMLDFEARCS 134 Query: 59 -LLDYRCGVTKLVDRPTV---QANEVSKQELHAGGRKLIEKIEDYQPQ---ALAILGKQA 111 L + + D T + +E L+E P L G + Sbjct: 135 VLHSRGIALWNICDVFTRYKSSDATLHCEEYTDIAGLLVE-----HPTIGYILC-NGSAS 188 Query: 112 YEQGFS-----QRGAQWGKQTLTI-------GSTQ------IWVLPNPSGLSRV--SLEK 151 Y+ + G Q + G+ Q I LP+ S + + + Sbjct: 189 YDCMLKYDLAARLEVSTGLQPTPLSLATQARGAAQAVDLPVILKLPSTSPANAMADPVGT 248 Query: 152 LVEAYRE-LDQAL 163 + ++E L +AL Sbjct: 249 KGKVWKEKLQEAL 261 >UniRef50_A2U3I4 Putative uncharacterized protein n=2 Tax=Polaribacter RepID=A2U3I4_9FLAO Length = 159 Score = 47.5 bits (112), Expect = 2e-04, Method: Composition-based stats. Identities = 29/166 (17%), Positives = 58/166 (34%), Gaps = 24/166 (14%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK--PQEAQHL 59 ++ +L + G PG +A+ N FWK+IY+ T+ E L Sbjct: 6 LDPVLPHNPTIFILGTKPGNLELRKQEYYANNGNSFWKIIYE--LTEEVFSKNYAERIDL 63 Query: 60 LDYR-CGVTKLV---DRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYE 113 L V ++ +R A V + + +I + P + G++A++ Sbjct: 64 LKRNHIAVWDIIESGERVKPGAQNVKNE-----IPNPVNEIVEMYPSVKQIIFNGQKAHD 118 Query: 114 QGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLS-RVSLEKLVEAYRE 158 + I V+ + S + R S + + ++E Sbjct: 119 LYL--KHFS------KIEGIIYEVVLSTSPANTRFSFREKLNNWKE 156 >UniRef50_B1C3U7 Putative uncharacterized protein n=1 Tax=Clostridium spiroforme DSM 1552 RepID=B1C3U7_9FIRM Length = 160 Score = 47.5 bits (112), Expect = 2e-04, Method: Composition-based stats. Identities = 34/165 (20%), Positives = 68/165 (41%), Gaps = 31/165 (18%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVI-------YQAGFTDRQLKPQEAQHLLDYR 63 +++ G P + S F + HP NRFWK++ + T ++ Q LL+ + Sbjct: 15 KILILGSFPSIKSREAKFYYYHPQNRFWKILGNIFDNDNLSTITLKK------QFLLNNK 68 Query: 64 CGVTKLVDRPT-VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQA---YEQGFSQR 119 + ++ +++ S ++ LI I + + G A Y Q F + Sbjct: 69 IALWDVIASCNIKGSSDSSISDVK--VNDLIRIINGSAIKHIYTNGNLANKLYHQYFDDK 126 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 A + T+ LP+ S + SLE+L+E ++ + ++ Sbjct: 127 IA--------LPVTK---LPSSSPANASYSLERLIECWKVIKDSM 160 >UniRef50_D0J0R4 Putative uncharacterized protein n=6 Tax=Burkholderiales RepID=D0J0R4_COMTE Length = 178 Score = 47.1 bits (111), Expect = 2e-04, Method: Composition-based stats. Identities = 36/166 (21%), Positives = 66/166 (39%), Gaps = 26/166 (15%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH-----LLDYRCG 65 R + G PGL S +AHP N+FW ++ QA + P+ LL G Sbjct: 28 RALVLGSFPGLKSLQLAQYYAHPQNQFWPLL-QALWPQHLQPPRSDYAGRCEWLLARGLG 86 Query: 66 VTKL---VDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQ 122 + + +R + + +++ L ++ D Q G ++Y+ Sbjct: 87 LWDVYASCEREGSLDSAIRAAQVNDFAG-LRQRCPDLQ--LALHNGGESYKHA------- 136 Query: 123 WGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQALVVRG 167 +QT +G + LP+ S + S ++ ++A+R AL G Sbjct: 137 --RQTEALGLIAL-KLPSTSPANAGWSFDRKLQAWR---HALAAHG 176 >UniRef50_A9AVD1 G:T/U mismatch-specific DNA glycosylase-like protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AVD1_HERA2 Length = 168 Score = 46.7 bits (110), Expect = 3e-04, Method: Composition-based stats. Identities = 36/181 (19%), Positives = 63/181 (34%), Gaps = 35/181 (19%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + I+ R + G PG S F + HP N+FW+ I QA F D Sbjct: 6 LPAIVHSTSRCLILGTLPGDQSLAAQFYYLHPRNQFWR-IMQAVFADEP----------- 53 Query: 62 YRCGVTKLVDRPTVQA-NEVSKQELHAGGRKL--------------IEKIEDYQPQA--L 104 G+T V R A ++++ ++ ++ + P + Sbjct: 54 ---GLTS-VTRAAFLARHQLALWDVLHAAERVGSLDTAITNPIPNDFQTFFTANPAIQKV 109 Query: 105 AILGKQAYEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALV 164 + G +A + R + ++ LP+ S + + K VEA+ L AL Sbjct: 110 GLNGMKAQQLFM--RLVWPNLTSELQSRLELVGLPSTSPANTQAFAKKVEAWMWLRNALG 167 Query: 165 V 165 Sbjct: 168 A 168 >UniRef50_B0EFS1 Putative uncharacterized protein n=4 Tax=Entamoeba RepID=B0EFS1_ENTDI Length = 176 Score = 46.3 bits (109), Expect = 4e-04, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 59/164 (35%), Gaps = 25/164 (15%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA---GFTDRQLKPQEAQHLLD 61 I+ ++ G PG S + HP N FW +I G + + Q + + Sbjct: 11 IIPSKPLILILGSMPGTESLKKQQYYGHPQNCFWSIISSIKSMGSVPPRYE-QRIELIKS 69 Query: 62 YRCGVTKLVDRPTVQAN-EVSKQELHAGGRKLIEKIEDYQP--QALAILGKQ---AYEQG 115 + + + + + + + +E+ I K+ P + + G+ Y + Sbjct: 70 CQIALWDVCCQCERKGSLDSDIKEVKPNK---INKLLLEHPTIKTVLFNGQGPSKLYHKF 126 Query: 116 FSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 F + + + + ++P S +R + +E +++ Sbjct: 127 FKE-----------LPNIKYVIMPGTSPANARTKKQTKIEIWKK 159 >UniRef50_B7KM64 Phage SPO1 DNA polymerase-related protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KM64_CYAP7 Length = 209 Score = 46.3 bits (109), Expect = 4e-04, Method: Composition-based stats. Identities = 33/169 (19%), Positives = 62/169 (36%), Gaps = 29/169 (17%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAG-FTDRQLKPQEAQHLLDYRCGVTKL 69 ++ G PG + TG PF +AG D+ L + Y +T + Sbjct: 51 DIMLIGEAPGQTEDKTGKPFTG----------KAGKLLDKMLASINLERGQIY---LTNI 97 Query: 70 VD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG-------- 120 + RP +++E+ L+ +IE P+ + + G AY +R Sbjct: 98 IKCRPPNNRTP-TEKEIETCRPYLLAQIEFINPKFIVLAGGSAYRGLTGKRKPISQVRGK 156 Query: 121 -AQWGKQTLTIGSTQIWVLPNPS---GLSRV-SLEKLVEAYRELDQALV 164 +W ++L NPS G + + + L+ R+L++ Sbjct: 157 WFKWHNSKCISIFHPAYLLRNPSLEKGSPKWLTWQDLISLKRKLNKLTT 205 >UniRef50_A8MCZ3 Uracil-DNA glycosylase superfamily n=4 Tax=Thermoproteaceae RepID=A8MCZ3_CALMQ Length = 231 Score = 45.9 bits (108), Expect = 5e-04, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 60/168 (35%), Gaps = 32/168 (19%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPANR-------------FWKVIYQAGFTDRQLKPQE 55 R++ G+ P AH NR +K ++ G ++ Sbjct: 54 NPRIMIVGLAPA----------AHGGNRTGRMFTGDASAQFLFKALHACGLSNNPYSLSR 103 Query: 56 AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRK-LIEKIEDYQPQALAILGKQAYEQ 114 +T +V N+ + E+ A + I +I +P+A+ +LG+ A+ Sbjct: 104 IDGTKVNCVYITSVVK-CAPPDNKPTPGEVKACVGRWFINEINMVKPRAIVVLGEVAWRG 162 Query: 115 GFSQRGAQWGKQTL---TIGSTQIWVLPNPSGLS----RVSLEKLVEA 155 G + G + I T +++ +PS + R+ L LV Sbjct: 163 VAMALGLRGGFKHGGVVKINDTAVYMSYHPSPRNTNTGRLRLSDLVNI 210 >UniRef50_A8A982 Phage SPO1 DNA polymerase-related protein n=1 Tax=Ignicoccus hospitalis KIN4/I RepID=A8A982_IGNH4 Length = 191 Score = 45.2 bits (106), Expect = 0.001, Method: Composition-based stats. Identities = 37/161 (22%), Positives = 63/161 (39%), Gaps = 21/161 (13%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 G VVF G PG G PF A R + ++ + K +T Sbjct: 32 EGAEVVFVGEAPGKEEDEEGRPFVGRAGR----LLRSTISQIGFKKYY----------IT 77 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 +V RP +++E+ A LIE++ +P+ + LG+ A E +G G + Sbjct: 78 NVVKCRPPGNRTP-TEEEVEACKPYLIEELNCIRPKLVVALGRTAAEALLGLKGPLKGVR 136 Query: 127 TLTI----GSTQIWVLPNPSGLSRVSLEKLVEAY-RELDQA 162 + G ++ VL + + KL E + +L +A Sbjct: 137 GKVLVASAGGARVKVLVTYHPAAVLRNPKLKEQFVSDLKKA 177 >UniRef50_C4Z504 DNA glycosylase n=4 Tax=Clostridiales RepID=C4Z504_EUBE2 Length = 188 Score = 44.8 bits (105), Expect = 0.001, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 64/156 (41%), Gaps = 23/156 (14%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK---PQEAQHLLDYRCGVT 67 R++ G P + S F + HP NRFWK+I +G + + ++ +L + + Sbjct: 44 RILILGSLPSVKSREQMFFYGHPQNRFWKMI--SGVFEETVPQTIEEKKAFMLKHNIALW 101 Query: 68 KLVDRPTVQANEV---SKQELHAGGRK-LIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 T+ + ++ S + L I++ + + + GK + + + Sbjct: 102 D-----TIYSCDIIGSSDSSIKNVVPTDLKSIIDNSKIEKVICNGKTSGKYYEKYQMKYL 156 Query: 124 GKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 G + +LP+ S + SL KLVE +++ Sbjct: 157 GIKPD--------ILPSTSPANAAYSLGKLVEIWKK 184 >UniRef50_C6MEU9 Putative uncharacterized protein n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MEU9_9PROT Length = 194 Score = 44.4 bits (104), Expect = 0.002, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 58/166 (34%), Gaps = 31/166 (18%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI-----------YQAGFTDRQLKPQEAQ 57 +++ G PG +S +AHP N FW +I Y+ + P Sbjct: 32 HAKILILGSMPGEASLAANQYYAHPRNAFWPIIAPLLQIQANSRYEEKIIALESSPIALW 91 Query: 58 HLLDYRCGVTKLVDRPTVQANEV-SKQELHAGGRKLIEKIEDYQPQ--ALAILGKQAYEQ 114 +L +R + + + ++ P + G++A E Sbjct: 92 DVLQ-------SCNRAGSLDSMIETGTQIVNNFPAFFAL----HPHLTHVFFNGEKA-EA 139 Query: 115 GFSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 F + + +GS + LP+ S +R+S ++ + + ++ Sbjct: 140 CFRRNVIP----DIKLGSIALLRLPSTSPANARLSSDEKCKTWHQM 181 >UniRef50_A7I1K0 G:T/U mismatch-specific DNA glycosylase n=1 Tax=Campylobacter hominis ATCC BAA-381 RepID=A7I1K0_CAMHC Length = 204 Score = 44.0 bits (103), Expect = 0.002, Method: Composition-based stats. Identities = 12/37 (32%), Positives = 19/37 (51%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI 41 I +++ G P ++S GF + H NRFWK + Sbjct: 9 IFDKNSKILILGSMPSIASIKFGFYYGHKNNRFWKTL 45 >UniRef50_B6R445 Bacteriophage-type DNA polymerase N-terminal domain protein n=2 Tax=Alphaproteobacteria RepID=B6R445_9RHOB Length = 331 Score = 44.0 bits (103), Expect = 0.002, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 53/163 (32%), Gaps = 28/163 (17%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLV 70 RV+F G PG G PF + DR LK L + +V Sbjct: 179 RVMFVGEAPGREEDIQGVPFVGRSGH---------LLDRMLKA---VGLDRSNVYIANVV 226 Query: 71 D-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR------GAQW 123 RP + E + +IE P+ + LG + + R +W Sbjct: 227 PWRPPGNRTP-TPHETEICRPFIQRQIELVDPEIVVFLGAASAKTLSGARDGIRKLRGKW 285 Query: 124 ------GKQTLTIGSTQ-IWVLPNPSGLSRVSLEKLVEAYREL 159 G Q I + ++L +P R + +++ EL Sbjct: 286 LTYNGEGTQRKAIATYHPAYLLRSP-AEKRYAWRDMLQIRAEL 327 >UniRef50_A3ZBY1 T/U mismatch-specific DNA glycosylase n=15 Tax=Campylobacter RepID=A3ZBY1_CAMJE Length = 160 Score = 43.6 bits (102), Expect = 0.002, Method: Composition-based stats. Identities = 11/31 (35%), Positives = 18/31 (58%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVI 41 +++ G P + S GF + HP NRFW ++ Sbjct: 18 KILILGSFPSIKSRQDGFYYQHPRNRFWPIL 48 >UniRef50_B0VFE1 Uracil-DNA glycosylase n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VFE1_9BACT Length = 217 Score = 43.6 bits (102), Expect = 0.003, Method: Composition-based stats. Identities = 42/169 (24%), Positives = 60/169 (35%), Gaps = 26/169 (15%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAG-FTDRQLKPQEAQHLLDYRCGV 66 P + G PG TG PF AG ++ L Q Y + Sbjct: 61 PNAIAMLIGEGPGEQEDITGRPFVG----------AAGQLLEKMLSAINLQRSDVY---I 107 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 T +V RP N S +E A L+E+I+ QP+ L ILG A + + G Sbjct: 108 TNIVKCRPPGNRNPES-EERKACLPYLLEQIQIIQPKILLILGLVAAQTLLNNNNT-LGW 165 Query: 126 QTLTIG---STQIWVLPNPSGLSR------VSLEKLVEAYRELDQALVV 165 I +V +PS L R + L E +E ++ L Sbjct: 166 HRGKIHYFMDIPAFVTYHPSALLRNQEWKKPAWIDLQEFQKEYEKLLAA 214 >UniRef50_C5EZE8 Putative uncharacterized protein n=2 Tax=Helicobacter RepID=C5EZE8_9HELI Length = 167 Score = 43.3 bits (101), Expect = 0.003, Method: Composition-based stats. Identities = 33/158 (20%), Positives = 67/158 (42%), Gaps = 19/158 (12%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQL--KPQE--AQHLLDYRCGV 66 +++ G P ++S F +AH NRFWK++ G D L +P+E L + + Sbjct: 14 KILILGSFPSVASRQESFYYAHKQNRFWKILE--GLFDSPLHNQPKECKIAFLKSHHIAL 71 Query: 67 TKLVDRP-TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQA---YEQGFSQRGAQ 122 + +++ + + L + I++ Q Q + G++A Y Q F + + Sbjct: 72 FDITQECYIQNSSDSTLKILKPNP--IHSLIQNTQIQVIFTNGQKASKLYCQFFCHQDS- 128 Query: 123 WGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 I I LP+ S ++ +L L+ + ++ Sbjct: 129 ----PHFIPLPHI-PLPSSSPANAKCTLPLLLREWSKI 161 >UniRef50_B9CP28 T/U mismatch-specific DNA glycosylase n=1 Tax=Atopobium rimae ATCC 49626 RepID=B9CP28_9ACTN Length = 174 Score = 43.3 bits (101), Expect = 0.003, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 3/57 (5%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGF-TDRQLKPQEAQ 57 + I+ RV+ G P S GF F +P NRFW+V AG ++ L+ E + Sbjct: 16 LPPIVDRDSRVLLLGSMPSPKSREIGFYFGNPQNRFWRV--MAGLWGEKPLESSEEK 70 >UniRef50_Q6L0J8 DNA polymerase, bacteriophage-type n=1 Tax=Picrophilus torridus RepID=Q6L0J8_PICTO Length = 326 Score = 43.3 bits (101), Expect = 0.003, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 43/116 (37%), Gaps = 14/116 (12%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 V+F G +PG + T PF A F ++ AG ++ +T Sbjct: 175 NSEVIFIGQSPGFNEDETRRPFTGRAGLFLNSLLKMAGLERNRVY-------------LT 221 Query: 68 KLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 ++ T + +K E+ L+ +I+ +P+ + ++G A + Sbjct: 222 NVLKFMTYKNRPPNKHEIENAMPYLLREIDIIRPKLIVLMGSTAVNAFLPKHSVTL 277 >UniRef50_A0RQD6 Mug G:T/U mismatch-specific DNA glycosylase n=6 Tax=Bacteria RepID=A0RQD6_CAMFF Length = 157 Score = 43.3 bits (101), Expect = 0.003, Method: Composition-based stats. Identities = 34/157 (21%), Positives = 65/157 (41%), Gaps = 19/157 (12%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ--HLLDYRCGVTK 68 +++ G P + S GF +AH NRFWKV+ A F L+ + + LL ++ + Sbjct: 15 KILILGSFPSVKSREMGFYYAHSQNRFWKVL-IALFGSEHLRNVQDKIDFLLKHQVAIHD 73 Query: 69 LVDRPT-VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQT 127 + +++ S + LI + D + G +Y Sbjct: 74 AAYSCSVKGSSDNSISSVKPANLDLIFENADIKAVFA--NGNTSYAIC------------ 119 Query: 128 LTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 + ++ LP+ S ++ SLEKLV+ ++ + + L Sbjct: 120 KKFYNFEVIKLPSTSPANAKFSLEKLVDEWKIILKYL 156 >UniRef50_C0GI50 Phage SPO1 DNA polymerase-related protein n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GI50_9FIRM Length = 222 Score = 43.3 bits (101), Expect = 0.003, Method: Composition-based stats. Identities = 24/103 (23%), Positives = 37/103 (35%), Gaps = 16/103 (15%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAG-FTDRQLKPQEAQHLLDYRCGV 66 P ++F G PG G PF +AG ++ L E Y + Sbjct: 71 PDADLLFVGEAPGAEEDAQGRPFVG----------EAGKLLNKILAAAEISREEVY---I 117 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILG 108 T +V RP + E A L ++I +P+ + LG Sbjct: 118 TNVVKCRPPKNRTP-NPDEAAACSEHLQKQIALLKPRIVVCLG 159 >UniRef50_A7IAF7 Putative uncharacterized protein n=1 Tax=Candidatus Methanoregula boonei 6A8 RepID=A7IAF7_METB6 Length = 162 Score = 43.3 bits (101), Expect = 0.003, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIY-QAGFTDRQLKPQEAQHLL 60 + I+ ++ G P L S G + +P N+FW++I G P L Sbjct: 8 LPPIVGRDPEILILGSFPSLISLGKSEYYGNPKNQFWRIIEILFGIDHALSYPLRTHLLT 67 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQGFSQ 118 ++R + + + + + I P + +A+ G A + F Sbjct: 68 EHRIALWDALAFCRREGSM--DSAICDPVANDITGFVAAHPTIRCIALNGSTA-GRYFKS 124 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 G +LP+ S + +S + + + Sbjct: 125 MN------PDLNGH----ILPSTSPAYASMSFAEKTLEWARI 156 >UniRef50_C4G9N7 Putative uncharacterized protein n=1 Tax=Shuttleworthia satelles DSM 14600 RepID=C4G9N7_9FIRM Length = 255 Score = 42.9 bits (100), Expect = 0.004, Method: Composition-based stats. Identities = 13/33 (39%), Positives = 19/33 (57%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQ 43 R++ G P + S F + HP NRFW+V+Y Sbjct: 20 RILILGSVPSVKSRELAFYYGHPQNRFWRVLYL 52 >UniRef50_A8VW39 Putative uncharacterized protein n=1 Tax=Bacillus selenitireducens MLS10 RepID=A8VW39_9BACI Length = 59 Score = 42.9 bits (100), Expect = 0.005, Method: Composition-based stats. Identities = 11/37 (29%), Positives = 18/37 (48%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI 41 ++ RV+ G PG S +A+P N FW ++ Sbjct: 9 VVDREPRVLILGSMPGAKSLDAQMYYANPRNHFWPIL 45 >UniRef50_A0B978 Phage SPO1 DNA polymerase-related protein n=1 Tax=Methanosaeta thermophila PT RepID=A0B978_METTP Length = 184 Score = 42.5 bits (99), Expect = 0.005, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 42/120 (35%), Gaps = 22/120 (18%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPA----NRFWKVIYQAGFTDRQLKPQEAQHLLDY 62 R++ G PG + G PF A NR + AG + Sbjct: 34 PEDARIMMIGEAPGAAEDLAGRPFVGRAGAILNR---ALNLAGLDRDE------------ 78 Query: 63 RCGVTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 +T +V RP + S E+ A L ++IE +P A+ +LG +G Sbjct: 79 -IFITNIVKCRP-PGNRQPSADEIRACMPYLRKQIEMVRPSAICLLGNVPARAILGVQGV 136 >UniRef50_UPI0001BC5AE8 DNA glycosylase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5AE8 Length = 310 Score = 42.5 bits (99), Expect = 0.005, Method: Composition-based stats. Identities = 27/149 (18%), Positives = 57/149 (38%), Gaps = 20/149 (13%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH---L 59 ++ G P + S F + HP NRFWK++ A + + L Sbjct: 172 PAFYEKNSIILILGSFPSVKSREENFFYGHPQNRFWKML--AKIFEEAFPETREKKKELL 229 Query: 60 LDYRCGVTKLVDRPT-VQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQGF 116 +R V ++ +++ S Q++ + +I +Y P + + G +Y+ Sbjct: 230 RKHRLAVWDVIHSCKIKGSSDSSIQDVIPND---LTEILNYSPIQKIIC-NGATSYKYYK 285 Query: 117 SQRGAQWGKQTLTIGSTQIWVLPNPSGLS 145 + + GK+ ++ LP+ S + Sbjct: 286 KYQEKELGKEAIS--------LPSTSPAN 306 >UniRef50_D1PL52 Uracil-DNA glycosylase n=5 Tax=Bacteria RepID=D1PL52_9FIRM Length = 188 Score = 42.5 bits (99), Expect = 0.006, Method: Composition-based stats. Identities = 22/108 (20%), Positives = 41/108 (37%), Gaps = 13/108 (12%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P V+F G PG + G PF + D L+ + + + Sbjct: 32 PNAEVLFVGEGPGANEDAQGQPFVGRS---------GQLLDHYLEAVDLNR--EKNVYIA 80 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQ 114 +V RP + + +E A L ++ + +P+ + LG+ A +Q Sbjct: 81 NIVKCRPPQNRDPL-PEESAACLPWLRQQFQLLRPKIIVCLGRIAAQQ 127 >UniRef50_A4B987 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4B987_9GAMM Length = 171 Score = 42.1 bits (98), Expect = 0.007, Method: Composition-based stats. Identities = 35/159 (22%), Positives = 63/159 (39%), Gaps = 12/159 (7%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ--HLLDYRCGVTK 68 RV+ G PG++S +AHP N FW I F + P + + LL R + Sbjct: 16 RVLVLGSMPGVASLTAVQYYAHPRNAFWP-IMADFFGFDRDAPYDDRLAALLAKRVALWD 74 Query: 69 LVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQGFSQRGAQWGKQ 126 ++ + S E + I+ QP Q + + G +A + +R + Sbjct: 75 VLQSCVRPGSLDSAIERDSMVPNPIDSWMATQPSLQTVLLNGGRAATEF--RRYFKAILS 132 Query: 127 TLTIGSTQIWVLPNPSGL-SRVSLEKLVEAY-RELDQAL 163 I ++ LP+ S + + E+ + + L +AL Sbjct: 133 NPNI---EVVTLPSTSPAYAAMRFEEKRDRWHHALHRAL 168 >UniRef50_Q04GE7 G:T/U mismatch-specific DNA glycosylase n=2 Tax=Oenococcus oeni RepID=Q04GE7_OENOB Length = 194 Score = 42.1 bits (98), Expect = 0.007, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 56/153 (36%), Gaps = 13/153 (8%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ--HLL 60 I +V+ G P S +GF + P N FW + + D K +EA+ L Sbjct: 26 PPIYDENSKVLLLGSLPSPKSLASGFYYGSPQNIFWSTLAKTLGVDEPEKSKEARLSFAL 85 Query: 61 DYRCGV-TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 +R + L + + A + + + + +++ Q + + GK+A E Sbjct: 86 KHRVAIWITLKEADIIGAQDSTIKNPVPN--DFSDILKNSQIRTIFTEGKKATELFNK-- 141 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKL 152 IG ++ LP+ S +R Sbjct: 142 -----LAAPKIGQESVY-LPSTSPANRAHRPNR 168 >UniRef50_D1CIM8 Uracil-DNA glycosylase superfamily n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CIM8_THET1 Length = 198 Score = 42.1 bits (98), Expect = 0.008, Method: Composition-based stats. Identities = 47/176 (26%), Positives = 70/176 (39%), Gaps = 31/176 (17%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPAN---RFWKVIYQAGFTDRQLKPQEAQHLLDYRCG 65 G RV+ G PG+ + G PF+ P R W I +AG P E Q D R Sbjct: 35 GARVMVIGQAPGVRTMIAGRPFSGPGGKLLRGW--IARAGI------PLEEQ---DRRVY 83 Query: 66 VTKLVDRP-------TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ 118 T L R + S +E+ L +IE P + ++G A E+ + Sbjct: 84 FTSL-TRCYPGKAPKGKGDRKPSAREVELCSPFLQREIELIDPVLVLLVGSMAIERFLGR 142 Query: 119 RGAQWGKQTLTIGSTQIWV-LPNPSGLSRV--------SLEKLVEAYRELDQALVV 165 L + W+ LP+PSG+SR +E+ +E REL +L + Sbjct: 143 MPLAEAVGQLFSRDGRWWLPLPHPSGVSRWLNHPEHRALVEEALERMRELLSSLGL 198 >UniRef50_A8RDC8 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8RDC8_9FIRM Length = 166 Score = 41.7 bits (97), Expect = 0.008, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 59/159 (37%), Gaps = 19/159 (11%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH--LLDYRCGVTKL 69 V+ G P ++S + N FWK + A E + + + + Sbjct: 15 VLILGSMPSITSLEKQEYYGFAHNHFWK--FMANIFAMPTDTYEQKKAIIRKRHLALWDV 72 Query: 70 VDRPTVQANEVS--KQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQT 127 + + + S ++E I+K + + GK++YE Q+ Q Sbjct: 73 IHSCEREGSLDSAIRKERVNDIAGFIQKHTTIR-LVIC-NGKKSYELY--QKHFQ----- 123 Query: 128 LTIGSTQIWVLPNPSGLSRVSLEKLV--EAYRELDQALV 164 +G + ++ L + S +R E ++ + + L +AL Sbjct: 124 -NLGVSCVY-LGSTSNANRSIKENILYEQWKQTLWEALG 160 >UniRef50_A9H0D0 Uracil-DNA glycosylase n=3 Tax=Bacteria RepID=A9H0D0_GLUDA Length = 433 Score = 41.7 bits (97), Expect = 0.009, Method: Composition-based stats. Identities = 38/157 (24%), Positives = 55/157 (35%), Gaps = 28/157 (17%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 PG ++ G PG G PF P + +++AG T + V Sbjct: 265 PGAALMLVGEQPGDQEDRVGRPFVGPAGQLLDRTLHEAGITRDAVY-------------V 311 Query: 67 TKLVD------RPTVQANEVSKQE-LHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 T V R T + +E E + A L + P+ L +LG A R Sbjct: 312 TNAVKHFRFQRRGTRRLHEKPTVENVTACAPWLAAERRIVAPRVLVMLGATAAGAVLG-R 370 Query: 120 GAQWGKQ-----TLTIGSTQIWVLPNPSGLSRVSLEK 151 G++ L GST + V +PS L R E Sbjct: 371 SVTIGRERSRPIPLADGSTGL-VTVHPSFLLRQPDED 406 >UniRef50_Q3SAA1 Uracyl DNA glycosylase n=4 Tax=Euryarchaeota RepID=Q3SAA1_9EURY Length = 196 Score = 41.7 bits (97), Expect = 0.009, Method: Composition-based stats. Identities = 23/115 (20%), Positives = 45/115 (39%), Gaps = 18/115 (15%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGVTKL 69 +V+F G PG G PF A ++ +++ G + ++ +T + Sbjct: 44 KVMFVGEAPGKWEDIKGRPFVGAAGKYLDELLSSIGLS------RDDVF-------ITNI 90 Query: 70 VD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 V RP + + +E+ A L +IE P+ + LG+ + F + Sbjct: 91 VKCRP-PNNRDPTSEEISACAPYLEWQIETMAPRIICPLGR--FSAKFILEKFGF 142 >UniRef50_B3DV47 Uracil-DNA glycosylase n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DV47_METI4 Length = 276 Score = 41.3 bits (96), Expect = 0.011, Method: Composition-based stats. Identities = 37/174 (21%), Positives = 65/174 (37%), Gaps = 21/174 (12%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHL-LDYRCG 65 P ++F G PG G PF P K+I G + Q+ D Sbjct: 106 PYAELMFVGEAPGAEEDLQGEPFVGPAGQLLTKMIRAMGLSREQVYIANVLKCRPD---- 161 Query: 66 VTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW-- 123 + + + + +E+ L+ +++ P+ + LG AYE + G + Sbjct: 162 ----IPKGQKGNRKPTTEEMSTCLPYLLSQLKLINPKVIVALGATAYEGLTGKVGIKISE 217 Query: 124 --GKQTLTIGSTQI------WVLPNPSGLS-RVSLEKLVEAYRELDQALVVRGR 168 G+ G+T I ++L NPS + R E L+ A +L + + R Sbjct: 218 VRGRCFSFNGATLIPTYHPSFLLHNPSLTNKRKVWEDLLTAMGKLGMPISEKQR 271 >UniRef50_A5FBU1 G:T/U mismatch-specific DNA glycosylase-like protein n=3 Tax=Bacteria RepID=A5FBU1_FLAJ1 Length = 160 Score = 40.9 bits (95), Expect = 0.016, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 59/162 (36%), Gaps = 22/162 (13%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH--LLDYRC 64 +P +V+ G PG S + H N FWK ++ F + E + LL + Sbjct: 11 SPDSKVLILGTMPGTKSLELNQYYGHSQNNFWKFMFTI-FKEEFSTDYETKKALLLKNKI 69 Query: 65 GVTKL---VDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQR 119 + + DR + + K E+ +++ P +A G++A F ++ Sbjct: 70 ALWDVLQYCDRVGSLDSAI-KNEIANDFETFLKQ----HPNIHTIAFNGQKA--AAFFKK 122 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGLSRVS-LEKLVEAYRELD 160 K I LP+ S + + ++ ++ + Sbjct: 123 YVHLKKDYTLIT------LPSTSPANASKAFQSKLDEWKVIA 158 >UniRef50_A1RYC0 Phage SPO1 DNA polymerase-related protein n=23 Tax=Archaea RepID=A1RYC0_THEPD Length = 217 Score = 40.9 bits (95), Expect = 0.016, Method: Composition-based stats. Identities = 24/100 (24%), Positives = 41/100 (41%), Gaps = 16/100 (16%) Query: 12 VVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLV 70 VVF G PG + G PF ++I AG + ++ +T +V Sbjct: 53 VVFIGEAPGRNEDLQGRPFVGAAGALLTELISLAGLSREEVF-------------ITNVV 99 Query: 71 D-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGK 109 RP E ++E+ A L+ ++ +P+ + LGK Sbjct: 100 KCRP-PNNREPREEEIAACLPYLLAQLRAIKPRLIVTLGK 138 >UniRef50_B2IF85 Phage SPO1 DNA polymerase-related protein n=1 Tax=Beijerinckia indica subsp. indica ATCC 9039 RepID=B2IF85_BEII9 Length = 290 Score = 40.9 bits (95), Expect = 0.018, Method: Composition-based stats. Identities = 40/166 (24%), Positives = 59/166 (35%), Gaps = 24/166 (14%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAG-FTDRQLKPQEAQHLLDYRCGV 66 P V+ G PG G PF +AG DR L + Y + Sbjct: 129 PNSSVMIIGEAPGADEDRQGRPFVG----------RAGQLLDRMLAAVDLDRTQVY---I 175 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA---Q 122 +V RP + E+ A + +IE P+ + LG + + + A Sbjct: 176 ANIVPWRPPGNRTP-TPLEIAACLPFIRRQIELVSPRFIVCLGAPSAQTLLGTKEAITRL 234 Query: 123 WGKQTL-TIGSTQIWVLPN--PSGLSRVSLEKLVEA--YRELDQAL 163 G+ + G I VLP P+ L R EK +R L +AL Sbjct: 235 RGRWRDWSCGGKTIHVLPMLHPAYLLRQPAEKKRAWADWRLLAKAL 280 >UniRef50_A0LFG4 Phage SPO1 DNA polymerase-related protein n=2 Tax=Deltaproteobacteria RepID=A0LFG4_SYNFM Length = 264 Score = 40.6 bits (94), Expect = 0.018, Method: Composition-based stats. Identities = 28/107 (26%), Positives = 40/107 (37%), Gaps = 14/107 (13%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLVD 71 +VF G PGL G PF A R D+ +K Y + +V Sbjct: 94 LVFVGEGPGLEEDRQGRPFVGRAGR---------LLDQMIKALGFHREAVY---ICNVVK 141 Query: 72 -RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS 117 RP S +E+ G LI +IE +P+ + LG A + Sbjct: 142 CRPPKNRTPES-EEIARCGPFLIRQIEAIRPRVICALGACAAQTLVG 187 >UniRef50_A0RV64 Uracil-DNA glycosylase n=2 Tax=Thaumarchaeota RepID=A0RV64_CENSY Length = 216 Score = 40.6 bits (94), Expect = 0.022, Method: Composition-based stats. Identities = 34/149 (22%), Positives = 50/149 (33%), Gaps = 15/149 (10%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLV 70 R + G PG G PF A R D++L+ Y +T +V Sbjct: 70 RAILVGEAPGREEDLQGKPFVGGAGR---------VLDKELERAGIPRSTVY---ITNVV 117 Query: 71 D-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTLT 129 RP S +E A L E+I P+ + ILG A G G + Sbjct: 118 KCRPPKNRVP-SDEERDACIEYLKEEISVINPRIVCILGATALRSLLGLGGIT-GIRGTI 175 Query: 130 IGSTQIWVLPNPSGLSRVSLEKLVEAYRE 158 I L + + + L+ A+R Sbjct: 176 INQDNTDYLVSLHPAATMYRRDLLPAFRR 204 >UniRef50_A8UU74 N-terminus of phage SPO1 DNA polymerase n=3 Tax=Aquificales RepID=A8UU74_9AQUI Length = 220 Score = 40.2 bits (93), Expect = 0.027, Method: Composition-based stats. Identities = 25/109 (22%), Positives = 43/109 (39%), Gaps = 13/109 (11%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLVD 71 VVF G PG G PF A ++ + + K +E L +T + Sbjct: 71 VVFVGEAPGEDEDRQGRPFVGRAGKY---LNK--------KIEEVLGLRREEVYITNVCK 119 Query: 72 -RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 RP + + E+ A L ++I+ +P+ + LG A E ++ Sbjct: 120 CRP-PGNRKPTPAEISACFPYLKKEIDIIKPKVICCLGATAGEGILGKK 167 >UniRef50_Q2S6J8 Uracil-DNA glycosylase, family 4 n=2 Tax=Rhodothermaceae RepID=Q2S6J8_SALRD Length = 287 Score = 40.2 bits (93), Expect = 0.028, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 41/106 (38%), Gaps = 16/106 (15%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAG-FTDRQLKPQEAQHLLDYRCGV 66 P +V G PG + G PF +AG D+ L + Y + Sbjct: 133 PEADLVIIGEAPGRNEDEEGEPFVG----------RAGQLLDKILDAINFEREDVY---I 179 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQA 111 T ++ RP N +E+ A L ++I +P+ + +GK A Sbjct: 180 TNILKSRPPRNRNP-KSEEIDAHLPVLYKQIALLRPKIILTVGKVA 224 >UniRef50_Q2LTM2 Uracil DNA glycosylase superfamily protein n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LTM2_SYNAS Length = 263 Score = 40.2 bits (93), Expect = 0.028, Method: Composition-based stats. Identities = 26/115 (22%), Positives = 41/115 (35%), Gaps = 12/115 (10%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P +VF G PG G PF A R + + Y C + Sbjct: 111 PHSELVFVGEAPGADEDLQGRPFVGRAGH---------LLTRIIDAMGLKREDVYICNI- 160 Query: 68 KLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQ 122 L RP N + +E+ LI ++E +P+ + LG A + ++G Sbjct: 161 -LKCRPPGNRNPL-PEEIEVCEPWLIRQLEAIRPRIICALGTFAAQTLLKKKGIP 213 >UniRef50_C1DCC6 Probable DNA-directed DNA polymerase, bacteriophage-type n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1DCC6_LARHH Length = 259 Score = 39.8 bits (92), Expect = 0.031, Method: Composition-based stats. Identities = 33/154 (21%), Positives = 59/154 (38%), Gaps = 16/154 (10%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLVD 71 ++ G PG G PF PA + + A T P++ ++ +V Sbjct: 104 LMLIGEGPGEQEDLQGRPFVGPAGK----LLDAMLTATGWSPEQDMYVC-------NVVK 152 Query: 72 -RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR--GAQWGKQTL 128 RP N E+ A L +I +P+ + LG+ A + + A+ QT Sbjct: 153 CRPPGNRNP-QPDEMAACQAYLQTQIRLVRPRLIVALGRVAAQSLLATELSVARLRGQTH 211 Query: 129 TIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQA 162 + V +P+ L R +L A+++L + Sbjct: 212 QYAGYPLVVTYHPAYLLR-NLPDKARAWQDLCRD 244 >UniRef50_A1WUV9 Phage SPO1 DNA polymerase-related protein n=3 Tax=Ectothiorhodospiraceae RepID=A1WUV9_HALHL Length = 294 Score = 39.8 bits (92), Expect = 0.039, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 58/161 (36%), Gaps = 27/161 (16%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLV 70 +V G PG G PF +AG + + + +V Sbjct: 144 DLVIIGEGPGAEEDRQGEPFVG----------RAGQLLDAMLAAIDRGRDRGGVYIGNIV 193 Query: 71 D-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA-------- 121 RP + E+ A L ++E +P+A+ LG+ A +Q + Sbjct: 194 KCRP-PGNRDPRPDEVAACTPYLRRQLELLEPKAIVALGRVAAQQLLETKSPLAKLRGRV 252 Query: 122 -QWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQ 161 +G Q++ +WV +P+ L R + + +++L + Sbjct: 253 HHYGAQSIP-----VWVTYHPAYLLRSPG-EKAKVWQDLKR 287 >UniRef50_Q0W079 Predicted uracil-DNA glycosylase superfamily protein n=2 Tax=Euryarchaeota RepID=Q0W079_UNCMA Length = 185 Score = 39.0 bits (90), Expect = 0.059, Method: Composition-based stats. Identities = 24/112 (21%), Positives = 42/112 (37%), Gaps = 14/112 (12%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLV 70 R++ G PG + G PF A KV+ ++ + + +V Sbjct: 39 RIMLIGEAPGRNEDKQGKPFVGAAG---KVLNLV-LESAGIRRDDVY--------IANVV 86 Query: 71 D-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 RP K E+ A L +++E P+ + +LGK A E ++ Sbjct: 87 KCRPPENRVP-KKDEIEACSVYLKKQLEVISPKVVVLLGKTAAESYLGRKVV 137 >UniRef50_Q0EZR7 Phage SPO1 DNA polymerase domain protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EZR7_9PROT Length = 252 Score = 39.0 bits (90), Expect = 0.062, Method: Composition-based stats. Identities = 36/169 (21%), Positives = 59/169 (34%), Gaps = 29/169 (17%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAG-FTDRQLKPQEAQHLLDYRCGV 66 P +VF G PG G PF +AG DR + Y + Sbjct: 99 PQADLVFIGEAPGRDEDLQGEPFVG----------RAGQLLDRMMAAIGLDRDAVY---I 145 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 + RP + E+ A ++++ QP+ + +LG+ A + A G Sbjct: 146 MNTIKCRP-PNNRDPRADEVKACNLWFEQQLDALQPKMICLLGRVAAQTVLGT-DATLGA 203 Query: 126 QT---LTIGSTQIWVLPNPSGLSRVSLEK---------LVEAYRELDQA 162 +WV +P+ L R +K L + YR+L Q+ Sbjct: 204 LRGAWHEYHGIPVWVTYHPAYLLRSPQQKQKGWQDLCLLADRYRDLTQS 252 >UniRef50_D2LEU1 Phage SPO1 DNA polymerase-related protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LEU1_RHOVA Length = 289 Score = 39.0 bits (90), Expect = 0.064, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 54/160 (33%), Gaps = 22/160 (13%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCG 65 +P ++ G PG G PF +++ G T+ Sbjct: 131 SPSAPLMLIGEAPGKDEDRLGQPFVGRAGQLLDRMLAAIGLTEADFY------------- 177 Query: 66 VTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW- 123 +T +V RP S +E+ + +IE P+ + LG A Q Sbjct: 178 ITNIVFWRPPGNRTP-SPEEVLVCQPFVERQIELLAPRIVVFLGNAAARQLTGATEGIMK 236 Query: 124 --GKQTLTIGSTQIWVLPN--PSGLSRVSLEKLVEAYREL 159 GK G + + P+ L R + K + A+R+L Sbjct: 237 LRGKWLELPGHPGVRAMATLHPAYLLRTPIAKRL-AWRDL 275 >UniRef50_A1B575 Putative uncharacterized protein n=1 Tax=Paracoccus denitrificans PD1222 RepID=A1B575_PARDP Length = 171 Score = 39.0 bits (90), Expect = 0.068, Method: Composition-based stats. Identities = 6/51 (11%), Positives = 21/51 (41%) Query: 67 TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS 117 T ++ + ++++++ + G + QP+ + + G+ A Sbjct: 79 TNVICFSSPMSSDLTRIQKQKGSAIFEWLLARLQPKVIIVHGEGAARALSK 129 >UniRef50_D1Y5H9 Uracil-DNA glycosylase n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y5H9_9BACT Length = 201 Score = 38.6 bits (89), Expect = 0.074, Method: Composition-based stats. Identities = 22/103 (21%), Positives = 35/103 (33%), Gaps = 14/103 (13%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 ++ G PG TG PF + R L E L + Sbjct: 40 PENASLMIVGEAPGAEEERTGRPFVGRSGR---------LLTEILAAAE---LKREEIFI 87 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILG 108 T + RP N + K E+ A L ++ E +P+ + +G Sbjct: 88 TSICKCRPPKNRNPL-KTEIAACLPWLTDQFEAIRPRLVLAVG 129 >UniRef50_B6IW73 Uracil-DNA glycosylase, family 4 protein, putative n=6 Tax=Alphaproteobacteria RepID=B6IW73_RHOCS Length = 298 Score = 38.2 bits (88), Expect = 0.089, Method: Composition-based stats. Identities = 25/103 (24%), Positives = 37/103 (35%), Gaps = 10/103 (9%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P RV+ G PG G PF PA +++ G R D + Sbjct: 132 PAARVMLVGEAPGEQEDRQGRPFVGPAGLLLDRMLAAIGLDRR-------ADTPDRAVYI 184 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILG 108 T +V+ RP + + E+ A + I+ P L LG Sbjct: 185 TNVVNWRPPGNRSP-TDGEIAACLPFVQRHIDLVNPAVLVFLG 226 >UniRef50_C1FF23 Predicted protein n=2 Tax=cellular organisms RepID=C1FF23_9CHLO Length = 248 Score = 38.2 bits (88), Expect = 0.096, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 55/172 (31%), Gaps = 25/172 (14%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTK 68 RV+ G PG G PF A + I+ + VT Sbjct: 85 NARVMIIGEAPGAEEDKLGVPFVGRAGKLLDEIFASVTLSTDRHCY-----------VTN 133 Query: 69 LVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTL 128 +V R + + + +E+ L E+I P + G+ + + K Sbjct: 134 VVKRRPLNNRDPTAREIEWYRGYLREEIRLLDPAIIVCTGRHSMRTLLGKNCEGITKIRG 193 Query: 129 TIGSTQ-IWVLPN--PSGLSR-----------VSLEKLVEAYRELDQALVVR 166 V+P PS L R ++ + + E R++D+ ++ Sbjct: 194 RWFDVDGRVVMPMFHPSYLLRNPQKTPGSPKALTWDDIREVKRKVDELSLIE 245 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B7MB04 G/U mismatch-specific DNA glycosylase n=70 Tax=E... 208 5e-53 UniRef50_UPI0001792C18 PREDICTED: similar to GA15170-PA n=3 Tax=... 207 1e-52 UniRef50_UPI0000DB6DF5 PREDICTED: similar to Thd1 CG1981-PA n=2 ... 206 2e-52 UniRef50_UPI0000E46F0A PREDICTED: similar to G/T mismatch-specif... 205 6e-52 UniRef50_B4L7B8 GI14092 n=1 Tax=Drosophila mojavensis RepID=B4L7... 205 6e-52 UniRef50_B4NHL6 GK13687 n=2 Tax=Coelomata RepID=B4NHL6_DROWI 203 3e-51 UniRef50_B4JZW0 GH23925 n=1 Tax=Drosophila grimshawi RepID=B4JZW... 202 3e-51 UniRef50_Q9XYZ3 G/T mismatch-specific thymine DNA glycosylase n=... 202 3e-51 UniRef50_Q9V4D8 Thd1 n=1 Tax=Drosophila melanogaster RepID=Q9V4D... 202 3e-51 UniRef50_B4MF03 GJ13949 n=2 Tax=Drosophila virilis RepID=B4MF03_... 202 5e-51 UniRef50_UPI000186DD28 G/t mismatch-specific thymine DNA glycosy... 201 6e-51 UniRef50_B3MY42 GF22816 n=1 Tax=Drosophila ananassae RepID=B3MY4... 201 7e-51 UniRef50_B5REH0 G/U mismatch-specific DNA glycosylase n=51 Tax=E... 200 1e-50 UniRef50_A6CPG7 G/U mismatch-specific DNA glycosylase n=2 Tax=Ba... 198 5e-50 UniRef50_C4QAV8 G/t mismatch-specific thymine DNA glycosylase, p... 198 7e-50 UniRef50_UPI0000D578C1 PREDICTED: similar to Thd1 n=1 Tax=Tribol... 197 1e-49 UniRef50_A7S194 Predicted protein n=1 Tax=Nematostella vectensis... 195 3e-49 UniRef50_UPI0001926FCB PREDICTED: similar to predicted protein n... 194 1e-48 UniRef50_Q4T9X5 Chromosome undetermined SCAF7485, whole genome s... 191 7e-48 UniRef50_UPI000178957E Uracil-DNA glycosylase superfamily n=1 Ta... 190 2e-47 UniRef50_C7PGY3 Uracil-DNA glycosylase superfamily n=1 Tax=Chiti... 189 4e-47 UniRef50_UPI0000DB71A4 PREDICTED: similar to Thd1 CG1981-PA, par... 188 4e-47 UniRef50_Q67QW8 G/U mismatch-specific DNA glycosylase n=1 Tax=Sy... 187 1e-46 UniRef50_UPI0001926888 PREDICTED: similar to GL18173 n=1 Tax=Hyd... 187 1e-46 UniRef50_Q39CW6 G/U mismatch-specific uracil-DNA glycosylase n=1... 183 1e-45 UniRef50_Q13569 G/T mismatch-specific thymine DNA glycosylase n=... 183 3e-45 UniRef50_Q46SQ2 G/U mismatch-specific uracil-DNA glycosylase n=4... 183 3e-45 UniRef50_Q1J3D2 Uracil-DNA glycosylase superfamily n=1 Tax=Deino... 182 4e-45 UniRef50_A8LC62 Uracil-DNA glycosylase superfamily n=16 Tax=Bact... 182 4e-45 UniRef50_C6WCQ8 Uracil-DNA glycosylase superfamily n=1 Tax=Actin... 180 1e-44 UniRef50_Q1D9T2 G/U mismatch-specific DNA glycosylase n=2 Tax=Ba... 178 5e-44 UniRef50_C0ZBM4 Putative uncharacterized protein n=1 Tax=Breviba... 178 6e-44 UniRef50_Q0RRS7 DNA glycosylase, G/U mismatch-specific n=2 Tax=F... 178 6e-44 UniRef50_D2B5F3 G:T/U mismatch-specific DNA glycosylase-like pro... 177 1e-43 UniRef50_C7IMH6 Uracil-DNA glycosylase superfamily n=1 Tax=Clost... 176 2e-43 UniRef50_A6SED0 Putative uncharacterized protein n=2 Tax=Sclerot... 175 6e-43 UniRef50_A2QSA7 Function: In the DNA of higher eukaryotes n=48 T... 175 7e-43 UniRef50_B8N438 Mismatch-specific thymine-DNA glycosylase, putat... 172 3e-42 UniRef50_C6B6I6 Uracil-DNA glycosylase superfamily n=2 Tax=Rhizo... 170 2e-41 UniRef50_B2W4Q2 G/U mismatch-specific DNA glycosylase n=2 Tax=Pl... 169 3e-41 UniRef50_Q9K3I5 G/U mismatch-specific DNA glycosylase n=3 Tax=St... 169 3e-41 UniRef50_B8I864 Uracil-DNA glycosylase superfamily n=1 Tax=Clost... 169 3e-41 UniRef50_Q47NL8 G/U mismatch-specific uracil-DNA glycosylase n=1... 166 2e-40 UniRef50_B2A9Z4 Predicted CDS Pa_1_2290 n=2 Tax=Sordariales RepI... 166 2e-40 UniRef50_D1CA47 Uracil-DNA glycosylase superfamily n=1 Tax=Sphae... 165 5e-40 UniRef50_B9J7K0 G:T/U mismatch-specific DNA glycosylase n=1 Tax=... 165 5e-40 UniRef50_C9SR06 G/U mismatch-specific uracil DNA glycosylase n=4... 163 2e-39 UniRef50_A9VA78 Predicted protein n=1 Tax=Monosiga brevicollis R... 161 7e-39 UniRef50_Q6C2V5 YALI0F04796p n=1 Tax=Yarrowia lipolytica RepID=Q... 161 9e-39 UniRef50_B2IIR8 Uracil-DNA glycosylase superfamily n=2 Tax=Bacte... 160 2e-38 UniRef50_A7HV32 Uracil-DNA glycosylase superfamily n=3 Tax=Rhizo... 159 3e-38 UniRef50_A6GLD3 G/U mismatch-specific DNA glycosylase n=1 Tax=Li... 159 4e-38 UniRef50_C6H6H6 Uracil DNA glycosylase superfamily protein n=1 T... 158 7e-38 UniRef50_C4Y7L8 Putative uncharacterized protein n=1 Tax=Clavisp... 157 1e-37 UniRef50_C8NV44 G/U mismatch-specific DNA glycosylase n=2 Tax=Co... 156 2e-37 UniRef50_C5SKW3 Uracil-DNA glycosylase superfamily n=1 Tax=Astic... 156 3e-37 UniRef50_O59825 G/U mismatch-specific uracil DNA glycosylase n=1... 155 6e-37 UniRef50_C8XAI4 Uracil-DNA glycosylase superfamily n=3 Tax=Actin... 155 7e-37 UniRef50_Q7SDB2 Predicted protein n=2 Tax=Sordariaceae RepID=Q7S... 154 1e-36 UniRef50_A7HGS2 Uracil-DNA glycosylase superfamily n=1 Tax=Anaer... 153 3e-36 UniRef50_B2GKB6 G/U mismatch-specific DNA glycosylase n=6 Tax=Ba... 153 3e-36 UniRef50_A8P541 Putative uncharacterized protein n=1 Tax=Coprino... 152 4e-36 UniRef50_B7GT25 Uracil-DNA glycosylase superfamily n=6 Tax=Actin... 151 7e-36 UniRef50_B3QI54 Uracil-DNA glycosylase superfamily n=3 Tax=Rhodo... 147 1e-34 UniRef50_Q1IXM1 Uracil-DNA glycosylase superfamily n=3 Tax=Deino... 147 2e-34 UniRef50_C8RZ13 Uracil-DNA glycosylase superfamily n=1 Tax=Rhodo... 145 4e-34 UniRef50_A8Q5A6 Putative uncharacterized protein n=1 Tax=Malasse... 144 1e-33 UniRef50_A3LQ36 Predicted protein n=1 Tax=Pichia stipitis RepID=... 143 3e-33 UniRef50_B0E0R1 Predicted protein n=7 Tax=Laccaria bicolor S238N... 142 5e-33 UniRef50_C6XAZ6 G/U mismatch-specific uracil-DNA glycosylase n=4... 139 3e-32 UniRef50_C9RN39 Mug G:T/U mismatch-specific DNA glycosylase n=1 ... 139 4e-32 UniRef50_Q82UD2 Putative uncharacterized protein n=1 Tax=Nitroso... 138 1e-31 UniRef50_C9KQM1 Putative G/U mismatch-specific DNA glycosylase n... 136 3e-31 UniRef50_A5EPR0 G/U mismatch-specific uracil-DNA glycosylase n=1... 136 3e-31 UniRef50_A9KMN2 G:T/U mismatch-specific DNA glycosylase n=2 Tax=... 135 5e-31 UniRef50_Q1GYD6 G/U mismatch-specific uracil-DNA glycosylase n=1... 135 6e-31 UniRef50_B7Q2I4 G/T mismatch-specific thymine DNA glycosylase, p... 134 8e-31 UniRef50_B7ATH8 Putative uncharacterized protein n=1 Tax=Bactero... 134 8e-31 UniRef50_C3X2Y9 Mug G:T/U mismatch-specific DNA glycosylase n=1 ... 134 1e-30 UniRef50_Q2FQM0 Putative uncharacterized protein n=1 Tax=Methano... 134 1e-30 UniRef50_A6TMT4 Putative uncharacterized protein n=1 Tax=Alkalip... 133 2e-30 UniRef50_B1HZQ2 Putative uncharacterized protein n=2 Tax=Bacilla... 132 4e-30 UniRef50_C4LCA1 T/U mismatch-specific DNA glycosylase n=10 Tax=B... 132 5e-30 UniRef50_B9TDK4 Putative uncharacterized protein n=1 Tax=Ricinus... 132 5e-30 UniRef50_Q31G15 Uracil DNA glycosylase superfamily protein n=1 T... 132 6e-30 UniRef50_A5DZA6 Putative uncharacterized protein n=2 Tax=Sacchar... 131 6e-30 UniRef50_Q8CUP2 Hypothetical conserved protein n=1 Tax=Oceanobac... 131 8e-30 UniRef50_Q6BL09 DEHA2F17270p n=2 Tax=Saccharomycetaceae RepID=Q6... 131 8e-30 UniRef50_C6QE22 G:T/U mismatch-specific DNA glycosylase-like pro... 131 1e-29 UniRef50_C6X5D8 G:T/U mismatch-specific uracil/thymine DNA-glyco... 129 3e-29 UniRef50_Q7VJA2 Putative uncharacterized protein n=1 Tax=Helicob... 129 3e-29 UniRef50_B9WEN4 G/U mismatch-specific uracil DNA glycosylase, pu... 129 3e-29 UniRef50_Q0F2S7 Putative uncharacterized protein n=1 Tax=Maripro... 129 4e-29 UniRef50_Q5KFR6 Expressed protein n=2 Tax=Filobasidiella neoform... 128 5e-29 UniRef50_D0WFV1 Putative G/U mismatch-specific DNA glycosylase n... 128 6e-29 UniRef50_A0R287 T/U mismatch-specific DNA glycosylase n=5 Tax=My... 128 8e-29 UniRef50_C7N6G7 YjeF-like protein n=6 Tax=Bacteria RepID=C7N6G7_... 128 9e-29 UniRef50_A6Q419 Putative uncharacterized protein n=1 Tax=Nitrati... 127 1e-28 UniRef50_C4FAG7 Putative uncharacterized protein n=2 Tax=Collins... 126 3e-28 UniRef50_C8W9T7 Uracil-DNA glycosylase superfamily n=1 Tax=Atopo... 124 9e-28 UniRef50_Q5P2M8 T/U mismatch-specific DNA glycosylase n=30 Tax=B... 124 9e-28 UniRef50_A6SVT3 G:T/U mismatch-specific DNA glycosylase n=21 Tax... 124 1e-27 UniRef50_C4ZBZ4 G:T/U mismatch-specific DNA glycosylase n=20 Tax... 124 1e-27 UniRef50_C8WJG8 Putative uncharacterized protein n=8 Tax=Bacteri... 124 2e-27 UniRef50_C4Z504 DNA glycosylase n=4 Tax=Clostridiales RepID=C4Z5... 123 3e-27 UniRef50_C2FY74 G/U mismatch-specific DNA glycosylase n=2 Tax=Sp... 123 3e-27 UniRef50_UPI00016AE856 G:T/U mismatch-specific DNA glycosylase-l... 122 3e-27 UniRef50_C5AAU4 Uracil-DNA glycosylase superfamily protein n=8 T... 122 4e-27 UniRef50_Q1VS76 G/U mismatch-specific DNA glycosylase, putative ... 122 4e-27 UniRef50_A9AVD1 G:T/U mismatch-specific DNA glycosylase-like pro... 122 4e-27 UniRef50_B2KDM2 G:T/U mismatch-specific DNA glycosylase-like pro... 121 7e-27 UniRef50_A7ZBI7 Mug G:T/U mismatch-specific DNA glycosylase n=2 ... 121 8e-27 UniRef50_B0T5Y6 Uracil-DNA glycosylase superfamily n=10 Tax=Prot... 121 1e-26 UniRef50_B2UPH3 Uracil-DNA glycosylase superfamily n=2 Tax=Chlam... 120 2e-26 UniRef50_C8WEM8 Uracil-DNA glycosylase superfamily n=4 Tax=Zymom... 120 2e-26 UniRef50_Q1NE08 Uracil-DNA glycosylase superfamily protein n=1 T... 119 2e-26 UniRef50_B1C3U7 Putative uncharacterized protein n=1 Tax=Clostri... 119 2e-26 UniRef50_A6DAL5 Putative uncharacterized protein n=1 Tax=Caminib... 119 4e-26 UniRef50_A6Q7V9 Putative uncharacterized protein n=1 Tax=Sulfuro... 119 4e-26 UniRef50_C5MI19 Predicted protein n=1 Tax=Candida tropicalis MYA... 119 4e-26 UniRef50_A1WBA7 Putative uncharacterized protein n=17 Tax=Proteo... 118 5e-26 UniRef50_A1KBL3 Putative uncharacterized protein n=1 Tax=Azoarcu... 118 9e-26 UniRef50_Q7MVW9 G/U mismatch-specific DNA glycosylase, putative ... 118 1e-25 UniRef50_C6CSS2 Putative uncharacterized protein n=1 Tax=Paeniba... 117 1e-25 UniRef50_B0EFS1 Putative uncharacterized protein n=4 Tax=Entamoe... 116 2e-25 UniRef50_D1PNX0 T/U mismatch-specific DNA glycosylase n=1 Tax=Su... 116 3e-25 UniRef50_D0N7I8 Putative uncharacterized protein n=1 Tax=Phytoph... 116 4e-25 UniRef50_Q1Q0C1 Similar to G/U-mismatch-specific DNA glycolase n... 116 4e-25 UniRef50_B8GKM7 Putative uncharacterized protein n=1 Tax=Methano... 115 6e-25 UniRef50_C9M6L9 Putative G/U mismatch-specific DNA glycosylase n... 114 8e-25 UniRef50_C6MEU9 Putative uncharacterized protein n=1 Tax=Nitroso... 113 2e-24 UniRef50_A5VDD8 Uracil-DNA glycosylase superfamily n=8 Tax=Prote... 113 2e-24 UniRef50_C1EDR1 Predicted protein n=2 Tax=Micromonas RepID=C1EDR... 113 3e-24 UniRef50_A8IDF6 Predicted protein n=1 Tax=Chlamydomonas reinhard... 112 4e-24 UniRef50_C7DIS9 Putative uncharacterized protein n=2 Tax=unclass... 112 5e-24 UniRef50_B9LA93 Putative uncharacterized protein n=1 Tax=Nautili... 112 5e-24 UniRef50_Q4P6C9 Putative uncharacterized protein n=1 Tax=Ustilag... 112 6e-24 UniRef50_Q8PWD8 Conserved protein n=3 Tax=Methanosarcina RepID=Q... 111 1e-23 UniRef50_A2U3I4 Putative uncharacterized protein n=2 Tax=Polarib... 109 3e-23 UniRef50_B7CDR6 Putative uncharacterized protein n=1 Tax=Eubacte... 109 3e-23 UniRef50_C5K9X8 Putative uncharacterized protein n=2 Tax=Perkins... 108 9e-23 UniRef50_C3XK88 T/U mismatch-specific DNA glycosylase n=1 Tax=He... 107 1e-22 UniRef50_D0J0R4 Putative uncharacterized protein n=6 Tax=Burkhol... 107 1e-22 UniRef50_A8U688 Putative uncharacterized protein n=1 Tax=Carnoba... 101 1e-20 UniRef50_UPI0000D8867B hypothetical protein CIMG_10236 n=1 Tax=C... 97 2e-19 UniRef50_Q4T9X4 Chromosome undetermined SCAF7485, whole genome s... 90 3e-17 UniRef50_A7I1K0 G:T/U mismatch-specific DNA glycosylase n=1 Tax=... 89 7e-17 UniRef50_A8A982 Phage SPO1 DNA polymerase-related protein n=1 Ta... 84 2e-15 UniRef50_B9KXL9 Phage DNA polymerase-related protein n=17 Tax=ce... 80 4e-14 UniRef50_B2V9S9 Phage SPO1 DNA polymerase-related protein n=18 T... 76 3e-13 UniRef50_A8MCZ3 Uracil-DNA glycosylase superfamily n=4 Tax=Therm... 73 4e-12 UniRef50_B7KM64 Phage SPO1 DNA polymerase-related protein n=1 Ta... 72 7e-12 UniRef50_UPI00017F04BF PREDICTED: similar to thymine-DNA glycosy... 70 4e-11 UniRef50_B6R445 Bacteriophage-type DNA polymerase N-terminal dom... 66 3e-10 Sequences not found previously or not previously below threshold: UniRef50_A5FBU1 G:T/U mismatch-specific DNA glycosylase-like pro... 116 2e-25 UniRef50_C3RJZ3 G:T/U mismatch-specific DNA glycosylase n=6 Tax=... 115 5e-25 UniRef50_A3ZBY1 T/U mismatch-specific DNA glycosylase n=15 Tax=C... 113 2e-24 UniRef50_A4B987 Putative uncharacterized protein n=1 Tax=Reineke... 113 3e-24 UniRef50_B9CP28 T/U mismatch-specific DNA glycosylase n=1 Tax=At... 111 7e-24 UniRef50_A7IAF7 Putative uncharacterized protein n=1 Tax=Candida... 110 2e-23 UniRef50_A0RQD6 Mug G:T/U mismatch-specific DNA glycosylase n=6 ... 110 2e-23 UniRef50_A8RDC8 Putative uncharacterized protein n=1 Tax=Eubacte... 107 1e-22 UniRef50_C5EZE8 Putative uncharacterized protein n=2 Tax=Helicob... 102 4e-21 UniRef50_C7MKT3 G:T/U mismatch-specific DNA glycosylase n=1 Tax=... 102 6e-21 UniRef50_UPI0001BC5AE8 DNA glycosylase n=1 Tax=Fusobacterium sp.... 101 8e-21 UniRef50_B8DUI0 T/U mismatch-specific DNA glycosylase n=10 Tax=B... 99 4e-20 UniRef50_Q04GE7 G:T/U mismatch-specific DNA glycosylase n=2 Tax=... 91 9e-18 UniRef50_Q12X97 Putative uncharacterized protein n=1 Tax=Methano... 81 1e-14 UniRef50_C4FFI1 Putative uncharacterized protein n=1 Tax=Bifidob... 80 2e-14 UniRef50_C9KTN5 Putative G/U mismatch-specific DNA glycosylase n... 79 6e-14 UniRef50_C9YCM9 Putative uncharacterized protein n=1 Tax=Curviba... 72 7e-12 UniRef50_A0RV64 Uracil-DNA glycosylase n=2 Tax=Thaumarchaeota Re... 70 2e-11 UniRef50_A4E9G3 Putative uncharacterized protein n=1 Tax=Collins... 70 4e-11 UniRef50_A7C715 Putative uncharacterized protein n=1 Tax=Beggiat... 69 7e-11 UniRef50_B6IW73 Uracil-DNA glycosylase, family 4 protein, putati... 68 1e-10 UniRef50_C1SMS3 Uracil-DNA glycosylase, family 4 n=1 Tax=Denitro... 68 1e-10 UniRef50_B5YEZ9 DNA polymerase, bacteriophage-type n=2 Tax=Dicty... 68 2e-10 UniRef50_B3U4U7 Uracil-DNA glycosylase family 4 n=1 Tax=Candidat... 66 4e-10 UniRef50_B2IF85 Phage SPO1 DNA polymerase-related protein n=1 Ta... 66 6e-10 UniRef50_Q2LTM2 Uracil DNA glycosylase superfamily protein n=1 T... 66 6e-10 UniRef50_B2A668 Phage SPO1 DNA polymerase-related protein n=1 Ta... 65 9e-10 UniRef50_Q2GCJ7 Uracil-DNA glycosylase, family 4 n=4 Tax=Anaplas... 65 1e-09 UniRef50_B3DV47 Uracil-DNA glycosylase n=1 Tax=Methylacidiphilum... 65 1e-09 UniRef50_D1QTK9 Mug protein n=1 Tax=Prevotella oris F0302 RepID=... 65 1e-09 UniRef50_B2SQC4 T/U mismatch-specific DNA glycosylase n=13 Tax=X... 65 1e-09 UniRef50_C8SCY8 Phage SPO1 DNA polymerase-related protein n=1 Ta... 65 1e-09 UniRef50_A0Q3G6 Phage SPO1 DNA polymerase-related protein n=5 Ta... 64 2e-09 UniRef50_B0VFE1 Uracil-DNA glycosylase n=1 Tax=Candidatus Cloaca... 64 2e-09 UniRef50_Q3SAA1 Uracyl DNA glycosylase n=4 Tax=Euryarchaeota Rep... 64 2e-09 UniRef50_A8UU74 N-terminus of phage SPO1 DNA polymerase n=3 Tax=... 64 2e-09 UniRef50_A8PQ90 Phage SPO1 DNA polymerase domain protein n=1 Tax... 63 3e-09 UniRef50_Q73JU4 DNA polymerase domain protein n=1 Tax=Treponema ... 63 5e-09 UniRef50_C4G9N7 Putative uncharacterized protein n=1 Tax=Shuttle... 63 5e-09 UniRef50_A6Q9E7 Uracil-DNA glycosylase n=4 Tax=Epsilonproteobact... 63 5e-09 UniRef50_D2RLY7 Putative uncharacterized protein n=1 Tax=Acidami... 62 6e-09 UniRef50_A5KS51 Phage SPO1 DNA polymerase-related protein n=1 Ta... 62 9e-09 UniRef50_A5WHQ7 G:T/U mismatch-specific DNA glycosylase-like pro... 62 9e-09 UniRef50_C6MFU1 Phage SPO1 DNA polymerase-related protein n=1 Ta... 61 1e-08 UniRef50_C8PQ75 DNA polymerase domain protein n=1 Tax=Treponema ... 61 1e-08 UniRef50_Q0AVT6 Uracil-DNA glycosylase-like protein n=4 Tax=Clos... 61 2e-08 UniRef50_D2LEU1 Phage SPO1 DNA polymerase-related protein n=1 Ta... 61 2e-08 UniRef50_Q1H4A3 Phage SPO1 DNA polymerase-related protein n=2 Ta... 61 2e-08 UniRef50_A2BJ05 Predicted Uracil-DNA glycosylase n=8 Tax=Archaea... 60 2e-08 UniRef50_D1PL52 Uracil-DNA glycosylase n=5 Tax=Bacteria RepID=D1... 60 3e-08 UniRef50_A8VW39 Putative uncharacterized protein n=1 Tax=Bacillu... 60 3e-08 UniRef50_Q899E0 DNA polymerase, bacteriophage-type n=4 Tax=Clost... 60 3e-08 UniRef50_Q0W079 Predicted uracil-DNA glycosylase superfamily pro... 59 5e-08 UniRef50_B9XNJ8 Phage SPO1 DNA polymerase-related protein n=1 Ta... 59 5e-08 UniRef50_C9MT19 Mug protein n=10 Tax=Prevotella RepID=C9MT19_9BACT 59 6e-08 UniRef50_Q119E8 Phage SPO1 DNA polymerase-related protein n=20 T... 59 6e-08 UniRef50_A4EBV6 Putative uncharacterized protein n=1 Tax=Collins... 59 7e-08 UniRef50_A9F5H8 DNA-directed DNA polymerase n=3 Tax=Bacteria Rep... 59 7e-08 UniRef50_B9KBE0 Phage SPO1 DNA polymerase-related protein n=1 Ta... 59 7e-08 UniRef50_C7RSP2 Phage SPO1 DNA polymerase-related protein n=1 Ta... 59 7e-08 UniRef50_B2S2H6 DNA polymerase, bacteriophage-type n=2 Tax=Trepo... 59 7e-08 UniRef50_D0SIJ1 G:T/U mismatch-specific DNA glycosylase n=42 Tax... 58 1e-07 UniRef50_B1KRY1 Uracil-DNA glycosylase n=12 Tax=Clostridia RepID... 58 1e-07 UniRef50_C1DCC6 Probable DNA-directed DNA polymerase, bacterioph... 58 1e-07 UniRef50_D0LVE3 Phage SPO1 DNA polymerase-related protein n=1 Ta... 58 1e-07 UniRef50_C0GI50 Phage SPO1 DNA polymerase-related protein n=1 Ta... 58 1e-07 UniRef50_C1TMP3 Uracil-DNA glycosylase, family 4 n=1 Tax=Dethios... 58 1e-07 UniRef50_A1RYC0 Phage SPO1 DNA polymerase-related protein n=23 T... 58 1e-07 UniRef50_Q6L0J8 DNA polymerase, bacteriophage-type n=1 Tax=Picro... 58 1e-07 UniRef50_A1WUV9 Phage SPO1 DNA polymerase-related protein n=3 Ta... 58 1e-07 UniRef50_D0RRM7 Uracil DNA glycosylase superfamily n=1 Tax=alpha... 58 1e-07 UniRef50_B4UK16 Phage SPO1 DNA polymerase-related protein n=7 Ta... 58 2e-07 UniRef50_C7N4J2 Uracil-DNA glycosylase, family 4 n=2 Tax=Slackia... 57 2e-07 UniRef50_A0LFG4 Phage SPO1 DNA polymerase-related protein n=2 Ta... 57 2e-07 UniRef50_Q2S6J8 Uracil-DNA glycosylase, family 4 n=2 Tax=Rhodoth... 57 2e-07 UniRef50_C3X2T7 DNA polymerase bacteriophage-type n=2 Tax=Oxalob... 57 2e-07 UniRef50_A3CY07 Phage SPO1 DNA polymerase-related protein n=1 Ta... 57 2e-07 UniRef50_C1FF23 Predicted protein n=2 Tax=cellular organisms Rep... 57 2e-07 UniRef50_B6AR05 Uracil-DNA glycosylase, family 4 n=1 Tax=Leptosp... 57 3e-07 UniRef50_B8GUB2 Phage SPO1 DNA polymerase-related protein n=2 Ta... 57 3e-07 UniRef50_C7Q262 Uracil-DNA glycosylase superfamily n=3 Tax=Actin... 56 3e-07 UniRef50_Q0BR04 Uracil DNA glycosylase superfamily protein n=2 T... 56 3e-07 UniRef50_Q5SKC5 Uracil-DNA glycosylase n=6 Tax=Thermaceae RepID=... 56 3e-07 UniRef50_Q0EZR7 Phage SPO1 DNA polymerase domain protein n=1 Tax... 56 3e-07 UniRef50_Q47EK2 Phage SPO1 DNA polymerase-related protein n=1 Ta... 56 4e-07 UniRef50_A9II61 Bacteriophage-related DNA polymerase n=4 Tax=Bor... 56 4e-07 UniRef50_D1Y5H9 Uracil-DNA glycosylase n=1 Tax=Pyramidobacter pi... 56 4e-07 UniRef50_Q2Y9I2 Phage SPO1 DNA polymerase-related protein n=1 Ta... 56 4e-07 UniRef50_A5N3Q6 Phage-related DNA polymerase n=7 Tax=Clostridium... 56 4e-07 UniRef50_A9A4H4 Phage SPO1 DNA polymerase-related protein n=3 Ta... 56 4e-07 UniRef50_D1CIM8 Uracil-DNA glycosylase superfamily n=1 Tax=Therm... 56 5e-07 UniRef50_C8W879 Phage SPO1 DNA polymerase-related protein n=1 Ta... 56 5e-07 UniRef50_B4U5Y1 Phage SPO1 DNA polymerase-related protein n=1 Ta... 56 5e-07 UniRef50_Q8ZYS2 Uracil DNA glycosylase (Pa-UDGa) n=1 Tax=Pyrobac... 56 6e-07 UniRef50_Q3J878 Phage SPO1 DNA polymerase-related protein n=3 Ta... 56 7e-07 UniRef50_A9AZ06 Phage SPO1 DNA polymerase-related protein n=1 Ta... 55 7e-07 >UniRef50_B7MB04 G/U mismatch-specific DNA glycosylase n=70 Tax=Enterobacteriaceae RepID=MUG_ECO45 Length = 168 Score = 208 bits (530), Expect = 5e-53, Method: Composition-based stats. Identities = 168/168 (100%), Positives = 168/168 (100%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL Sbjct: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG Sbjct: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRGR 168 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRGR Sbjct: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRGR 168 >UniRef50_UPI0001792C18 PREDICTED: similar to GA15170-PA n=3 Tax=Coelomata RepID=UPI0001792C18 Length = 1013 Score = 207 bits (526), Expect = 1e-52, Method: Composition-based stats. Identities = 47/165 (28%), Positives = 81/165 (49%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y +G T + + LL Sbjct: 261 LPDHLTNNLDIVIIGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTPEPMTADDDYKLLQ 320 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V+R T + +++++E+ G + L+EK++ ++P+ GK +E ++ Sbjct: 321 NGIGFTNMVERATKGSADLTRKEIKEGSQILLEKLQRFKPKIAVFNGKLIFEVFSGKKDF 380 Query: 122 QWGKQTLTIGS--TQIWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 +G+Q + T +WV+P+ S V Y L + Sbjct: 381 TFGRQPELVDGTNTYMWVMPSSSARCAQLPRAADKVPFYAALKKF 425 >UniRef50_UPI0000DB6DF5 PREDICTED: similar to Thd1 CG1981-PA n=2 Tax=Apocrita RepID=UPI0000DB6DF5 Length = 973 Score = 206 bits (525), Expect = 2e-52, Method: Composition-based stats. Identities = 48/166 (28%), Positives = 80/166 (48%), Gaps = 4/166 (2%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D L L ++ GINPGL +A G +A P N FWK ++ +G T L + LL Sbjct: 206 VLPDHLTNNLDIIIIGINPGLFAAYKGHHYAGPGNHFWKCLHLSGLTPEPLTADDDYKLL 265 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + G T +V R T + +++++E+ G L+EK+ Y+P+ GK YE ++ Sbjct: 266 RFGIGFTNMVARATKGSADLTRKEIKEGSHILLEKLRKYKPKIAVFNGKLIYEVFSGKKD 325 Query: 121 AQWGKQTLTIGS--TQIWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 +G+Q + T +WV+P+ S V Y L + Sbjct: 326 FGFGRQPNLVEGTNTYMWVMPSSSARCAQLPRAADKVPYYAALKKF 371 >UniRef50_UPI0000E46F0A PREDICTED: similar to G/T mismatch-specific thymine DNA glycosylase n=1 Tax=Strongylocentrotus purpuratus RepID=UPI0000E46F0A Length = 588 Score = 205 bits (521), Expect = 6e-52, Method: Composition-based stats. Identities = 49/165 (29%), Positives = 80/165 (48%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + DIL L +V GINPGL +A G +A P N FWK +Y +G + + L D Sbjct: 68 LPDILVHNLDIVIIGINPGLMAAYKGHHYAGPGNHFWKCLYLSGLVPEPMTCMDDVKLPD 127 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G T +V R T + ++ ++E+ G + ++EKI+ Y P GK YE ++ Sbjct: 128 FGVGFTNIVGRTTRGSADLKRKEIKEGAKVVVEKIQKYLPLIACFNGKGIYEIYSGKKDF 187 Query: 122 QWGKQTLTIGSTQ--IWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 + G+Q I T+ ++V+P+ S + V Y L + Sbjct: 188 EVGRQPENIPGTETVVYVMPSSSARCSQFPRAQDKVPFYITLKEL 232 >UniRef50_B4L7B8 GI14092 n=1 Tax=Drosophila mojavensis RepID=B4L7B8_DROMO Length = 1737 Score = 205 bits (521), Expect = 6e-52, Method: Composition-based stats. Identities = 49/165 (29%), Positives = 83/165 (50%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y +G T Q+ ++ L+ Sbjct: 796 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTQEQMSAEDDYTLVK 855 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 856 HGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 915 Query: 122 QWGKQTLTIGS--TQIWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 +G+Q + T IWV+P+ S V Y L + Sbjct: 916 HFGRQPDRVEGTNTYIWVMPSSSARCAQLPRAADKVPFYAALKKF 960 >UniRef50_B4NHL6 GK13687 n=2 Tax=Coelomata RepID=B4NHL6_DROWI Length = 1761 Score = 203 bits (516), Expect = 3e-51, Method: Composition-based stats. Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y AG T Q+ E L+ Sbjct: 765 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLAGLTQEQMSADEDHKLIK 824 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 825 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 884 Query: 122 QWGKQTLTIGS--TQIWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 +G+Q + T IWV+P+ S V Y L + Sbjct: 885 HFGRQPDRVEGTDTYIWVMPSSSARCAQLPRAADKVPFYAALKKF 929 >UniRef50_B4JZW0 GH23925 n=1 Tax=Drosophila grimshawi RepID=B4JZW0_DROGR Length = 1557 Score = 202 bits (515), Expect = 3e-51, Method: Composition-based stats. Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y +G T Q+ + LL Sbjct: 611 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTQEQMSADDDYTLLK 670 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ +QP+ GK +E ++ Sbjct: 671 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFQPKIAVFNGKLIFEVFSGKKEF 730 Query: 122 QWGKQTLTIGS--TQIWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 +G+Q + T IWV+P+ S V Y L + Sbjct: 731 HFGRQPDRVDGTNTYIWVMPSSSARCAQLPRAADKVPFYAALKKF 775 >UniRef50_Q9XYZ3 G/T mismatch-specific thymine DNA glycosylase n=7 Tax=Sophophora RepID=Q9XYZ3_DROME Length = 1095 Score = 202 bits (515), Expect = 3e-51, Method: Composition-based stats. Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y AG T Q+ E L+ Sbjct: 136 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLAGLTQEQMSADEDHKLIK 195 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 196 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 255 Query: 122 QWGKQTLTIGSTQ--IWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 +G+Q + T IWV+P+ S V Y L + Sbjct: 256 HFGRQPDRVDGTDTFIWVMPSSSARCAQLPRAADKVPFYAALKKF 300 >UniRef50_Q9V4D8 Thd1 n=1 Tax=Drosophila melanogaster RepID=Q9V4D8_DROME Length = 1738 Score = 202 bits (515), Expect = 3e-51, Method: Composition-based stats. Identities = 51/165 (30%), Positives = 81/165 (49%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y AG T Q+ E L+ Sbjct: 779 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLAGLTQEQMSADEDHKLIK 838 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 839 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 898 Query: 122 QWGKQTLTIGSTQ--IWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 +G+Q + T IWV+P+ S V Y L + Sbjct: 899 HFGRQPDRVDGTDTFIWVMPSSSARCAQLPRAADKVPFYAALKKF 943 >UniRef50_B4MF03 GJ13949 n=2 Tax=Drosophila virilis RepID=B4MF03_DROVI Length = 1038 Score = 202 bits (513), Expect = 5e-51, Method: Composition-based stats. Identities = 50/165 (30%), Positives = 81/165 (49%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y +G T QL + L+ Sbjct: 815 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTQEQLSADDDYTLVK 874 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 875 QGIGFTNMVARATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 934 Query: 122 QWGKQTLTIGS--TQIWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 +G+Q + T IWV+P+ S V Y L + Sbjct: 935 HFGRQPDRVDGTNTYIWVMPSSSARCAQLPRAADKVPFYAALKKF 979 >UniRef50_UPI000186DD28 G/t mismatch-specific thymine DNA glycosylase, putative n=1 Tax=Pediculus humanus corporis RepID=UPI000186DD28 Length = 1239 Score = 201 bits (512), Expect = 6e-51, Method: Composition-based stats. Identities = 48/165 (29%), Positives = 79/165 (47%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y +G T + + LL Sbjct: 531 LPDHLTTNLDIVIIGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTPEPMTADDDYKLLS 590 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G + L+EK++ ++P+ GK YE + Sbjct: 591 VGIGFTNMVARATKGSADLTRKEIKEGSQILLEKLQKFKPKIAVFNGKLIYEVFSGNKDF 650 Query: 122 QWGKQTLTIGS--TQIWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 +G+Q + T +WV+P+ S V Y L + Sbjct: 651 NFGRQPDFVDGTNTYMWVMPSSSARCAQLPRAADKVPFYAALKKF 695 >UniRef50_B3MY42 GF22816 n=1 Tax=Drosophila ananassae RepID=B3MY42_DROAN Length = 1192 Score = 201 bits (512), Expect = 7e-51, Method: Composition-based stats. Identities = 52/165 (31%), Positives = 81/165 (49%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +V GINPGL +A G +A P N FWK +Y AG T Q+ E LL Sbjct: 657 IPDHLCDNLDIVIVGINPGLFAAYKGHHYAGPGNHFWKCLYLAGLTQEQMNADEDHKLLR 716 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G R L+EK++ ++P+ GK +E ++ Sbjct: 717 QGIGFTNMVSRATKGSADLTRKEIKEGSRILLEKLQRFRPKVAVFNGKLIFEVFSGKKEF 776 Query: 122 QWGKQTLTIGS--TQIWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 +G+Q + T IWV+P+ S V Y L + Sbjct: 777 HFGRQPGRVDGTDTYIWVMPSSSARCAQLPRAADKVPFYAALKKF 821 >UniRef50_B5REH0 G/U mismatch-specific DNA glycosylase n=51 Tax=Enterobacteriaceae RepID=MUG_SALG2 Length = 168 Score = 200 bits (510), Expect = 1e-50, Method: Composition-based stats. Identities = 134/167 (80%), Positives = 150/167 (89%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 MV+DILAPGLRVVFCGINPGLSSA TGFPFAHPANRFWKVI+ AGFTDRQLKP+EA+ LL Sbjct: 1 MVKDILAPGLRVVFCGINPGLSSANTGFPFAHPANRFWKVIHLAGFTDRQLKPEEAEKLL 60 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 D+RCGVTKLVDRPTVQA EV EL +GGR LIEKIEDYQP ALA+LGKQA+EQGFSQRG Sbjct: 61 DFRCGVTKLVDRPTVQATEVKLHELRSGGRNLIEKIEDYQPAALAVLGKQAFEQGFSQRG 120 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRG 167 WGK+ + IG+T +WVLPNPSGL+R+ EKLVEAYRELDQAL++RG Sbjct: 121 IAWGKKKIAIGATMVWVLPNPSGLNRIKTEKLVEAYRELDQALIMRG 167 >UniRef50_A6CPG7 G/U mismatch-specific DNA glycosylase n=2 Tax=Bacillus RepID=A6CPG7_9BACI Length = 170 Score = 198 bits (504), Expect = 5e-50, Method: Composition-based stats. Identities = 57/162 (35%), Positives = 93/162 (57%), Gaps = 1/162 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +VF G NP L S G +A+ +NRFWK+++ +G T RQ P+E LL Sbjct: 4 LPDHLKENLDIVFIGFNPSLKSGEVGHNYANKSNRFWKILHLSGLTPRQYAPEEDYTLLQ 63 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G+T +V+RPT +A +++K+E G L K++ Y P+ +GK Y+Q +R Sbjct: 64 LGYGLTNIVERPTKEAADITKEEYSEGRVILKNKLKKYSPKIACYVGKGVYQQFSKKRKI 123 Query: 122 QWGKQTLT-IGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQA 162 +WG Q + + +V P+ SGL R+ ++++V Y+EL + Sbjct: 124 EWGIQEESVVEGVIDFVAPSSSGLVRMKIDEIVSIYKELTEL 165 >UniRef50_C4QAV8 G/t mismatch-specific thymine DNA glycosylase, putative n=1 Tax=Schistosoma mansoni RepID=C4QAV8_SCHMA Length = 869 Score = 198 bits (503), Expect = 7e-50, Method: Composition-based stats. Identities = 52/165 (31%), Positives = 79/165 (47%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L GL +V GINP L+SA G +A P N FW + QAG + + +LD Sbjct: 96 LPDHLKEGLDIVIVGINPSLASAHVGHHYAGPGNHFWTCLSQAGLVPMAVSCYDDSKMLD 155 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y G T + RPT A E++++E+ AG ++EK+ Y+P+ GK YE + Sbjct: 156 YGIGFTNVCTRPTKGAAELTRKEMKAGAAIMLEKMRKYKPKIAVFNGKGIYEAYVGHKNF 215 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVEAYRELDQA 162 G+Q T+ T I + PS +R E + + L + Sbjct: 216 CMGRQPTTLDGTDIVIFVMPSSSARCAQLPRAEDKLPFFLALRKL 260 >UniRef50_UPI0000D578C1 PREDICTED: similar to Thd1 n=1 Tax=Tribolium castaneum RepID=UPI0000D578C1 Length = 702 Score = 197 bits (501), Expect = 1e-49, Method: Composition-based stats. Identities = 49/165 (29%), Positives = 82/165 (49%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L ++ GINPGL +A G +A P N FWK +Y +G T +Q+ E LL Sbjct: 170 LPDHLTENLDIIIIGINPGLFAAYKGHHYAGPGNHFWKCLYLSGLTQQQVSADEDYKLLQ 229 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V R T + +++++E+ G + L+EK+ ++P+ GK YE ++ Sbjct: 230 VGIGFTNMVQRATKGSADLTRKEIKEGSQILLEKLRKFRPKIAVFNGKLIYEVFSGKKDF 289 Query: 122 QWGKQTLTIGS--TQIWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 ++G+Q + T +WV+P+ S V Y L + Sbjct: 290 KFGRQPDLVDGTNTYMWVMPSSSARCAQLPRAADKVPFYAALKKF 334 >UniRef50_A7S194 Predicted protein n=1 Tax=Nematostella vectensis RepID=A7S194_NEMVE Length = 250 Score = 195 bits (497), Expect = 3e-49, Method: Composition-based stats. Identities = 49/166 (29%), Positives = 86/166 (51%), Gaps = 5/166 (3%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D +A L ++F GINPGL+SA G +A P N FW +Y +G +L ++ + Sbjct: 55 LLPDRVAENLDILFIGINPGLTSAYKGHHYAGPNNHFWPCLYDSGLVPEKLTFRDDEKCP 114 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Y G+T +V+R T ++++S++E+ G LI K++ +P GK YE + Sbjct: 115 AYGIGLTNIVERTTRGSSDLSRKEIKDGVDALIVKVKRLKPLVACFNGKGIYEIFSKSK- 173 Query: 121 AQWGKQTLTIGSTQ--IWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 + G+QT I T ++V+P+ SG + ++ + EL Sbjct: 174 CEIGRQTKCIPGTNVVVYVMPSSSGRTMTYPRASDKLKFFTELKSL 219 >UniRef50_UPI0001926FCB PREDICTED: similar to predicted protein n=2 Tax=Hydra magnipapillata RepID=UPI0001926FCB Length = 226 Score = 194 bits (493), Expect = 1e-48, Method: Composition-based stats. Identities = 49/164 (29%), Positives = 84/164 (51%), Gaps = 3/164 (1%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 MV D+L L ++F GINPGL SA G ++ N FW +++++G L Q L Sbjct: 53 MVPDLLEEHLDIIFVGINPGLVSAYRGHHYSGNNNHFWPLLFESGLISEPLTWQNDYKCL 112 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Y+ G+T +V R T A E++ E+ +G ++E I+ P + GK YE ++ Sbjct: 113 QYKIGMTNIVSRTTRSAAELTNSEIRSGKSSIVEMIKKLNPLVVCFNGKGIYEVFSGKKK 172 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 + G Q I ++ ++V+P+ SGL E + +++L + Sbjct: 173 CEVG-QQENIFNSAVYVMPSTSGLVTQYPRKEDKLVFFQQLKEL 215 >UniRef50_Q4T9X5 Chromosome undetermined SCAF7485, whole genome shotgun sequence. (Fragment) n=3 Tax=Metazoa RepID=Q4T9X5_TETNG Length = 333 Score = 191 bits (486), Expect = 7e-48, Method: Composition-based stats. Identities = 54/172 (31%), Positives = 90/172 (52%), Gaps = 11/172 (6%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D++AP L ++ GINPGL SA G + +P N FWK ++ +GFT++QL Q L + Sbjct: 43 LPDVIAPNLDILIIGINPGLLSAYKGHHYPNPGNHFWKCLFLSGFTEKQLNYSHDQSLPE 102 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG----- 115 Y G T +V+R T + ++S +E+ GGR+L+ K++ Y+P A GK YE Sbjct: 103 IYGIGFTNMVERTTPGSKDLSNKEIREGGRQLLAKLQKYRPLIAAFNGKGIYEIFCRETF 162 Query: 116 -FSQRGAQWGKQTLTIGSTQ--IWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 + ++G Q + TQ +++P+ S + V Y +L + Sbjct: 163 GVKAKNLEFGLQPYRVPETQTVCYLMPSSSPRCAQFPRAQDKVHFYIKLKEL 214 >UniRef50_UPI000178957E Uracil-DNA glycosylase superfamily n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178957E Length = 171 Score = 190 bits (483), Expect = 2e-47, Method: Composition-based stats. Identities = 54/165 (32%), Positives = 91/165 (55%), Gaps = 1/165 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D L GL +VF G NP L S G +A+P N FW++++++G T R + E LL Sbjct: 4 VSDHLDYGLTLVFIGFNPSLRSGEVGHHYANPRNNFWRILHKSGLTPRLYEAAEDGELLK 63 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T +V RPT A +++++E G L +K+E+Y+P+ +GK Y + + A Sbjct: 64 LGYGFTNIVARPTRGAEDITREEYRLGRETLRKKLEEYRPEVACFVGKGVYTEYSGKAKA 123 Query: 122 QWGKQTLT-IGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVV 165 WG Q + + + +V P+ SGL R+ +++++ YR+L + Sbjct: 124 DWGIQPDSVVDGVREFVAPSSSGLVRMPMDEIIGIYRQLHDWISR 168 >UniRef50_C7PGY3 Uracil-DNA glycosylase superfamily n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PGY3_CHIPD Length = 171 Score = 189 bits (480), Expect = 4e-47, Method: Composition-based stats. Identities = 69/164 (42%), Positives = 99/164 (60%), Gaps = 1/164 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 ++DI+ L V+FCGINPGL SA G F+ +NRFWKV++QAGFT ++ P +LD Sbjct: 5 LQDIITENLTVLFCGINPGLKSAVQGHHFSGRSNRFWKVLHQAGFTPHEIDPTLDSTILD 64 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G+T V+R TV+A+E+SK E KIE Y+P LA LGK AY ++ Sbjct: 65 FGYGLTTAVERATVRADELSKAEFDQSFETFKRKIEQYKPTYLAFLGKAAYMAFSEKKNV 124 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQALV 164 QWG Q + + +WVLPN SGL+R +L L++ ++ L + Sbjct: 125 QWGLQEEPLCNAIVWVLPNTSGLNRGFTLSMLIDHFQALHNHIG 168 >UniRef50_UPI0000DB71A4 PREDICTED: similar to Thd1 CG1981-PA, partial n=2 Tax=Apocrita RepID=UPI0000DB71A4 Length = 263 Score = 188 bits (479), Expect = 4e-47, Method: Composition-based stats. Identities = 51/165 (30%), Positives = 84/165 (50%), Gaps = 3/165 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L L +VF GINP L +A G +A P N F+K+++++G T R L +E LL Sbjct: 25 LPDYLKMNLDLVFVGINPSLMAAHRGRYYAGPGNHFYKLLHESGLTPRCLIYEEDYKLLQ 84 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ-RG 120 Y G+T +V R T + ++ + E+ G + + EK++ Y+P+ GK YE ++ Sbjct: 85 YGIGLTNIVTRATKSSADLKRTEIKEGSKIIEEKLKLYKPKIAVFNGKCIYEVFANKMGN 144 Query: 121 AQWGKQTLTIGSTQIWVLPNPSG--LSRVSLEKLVEAYRELDQAL 163 +G Q I T IWV P+ S + + + + L + L Sbjct: 145 FNFGLQLERINETAIWVTPSSSARCANFPRMTDKLHFFTALKKYL 189 >UniRef50_Q67QW8 G/U mismatch-specific DNA glycosylase n=1 Tax=Symbiobacterium thermophilum RepID=Q67QW8_SYMTH Length = 177 Score = 187 bits (476), Expect = 1e-46, Method: Composition-based stats. Identities = 66/167 (39%), Positives = 95/167 (56%), Gaps = 2/167 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D LAPGLRVVF G NPG +SA G +A+P NRF+ +++QAG T+R+ P E LL+ Sbjct: 5 LPDYLAPGLRVVFVGFNPGETSAREGHYYAYPGNRFYWLLWQAGLTERRYAPHEDARLLE 64 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G+T LV R T + ++S EL +G +L+ K+ Y+P+ GK Y +R Sbjct: 65 IGYGLTDLVGRSTRSSGDLSAAELRSGREELLAKLARYRPRVACYNGKGIYAALTGRRRV 124 Query: 122 QWGKQTLT-IGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRG 167 +G Q + I +V +PSG SR L + + YREL AL+ G Sbjct: 125 AYGLQPESVIPGVLDFVAASPSGRSREPLAEKLRLYRELR-ALIAEG 170 >UniRef50_UPI0001926888 PREDICTED: similar to GL18173 n=1 Tax=Hydra magnipapillata RepID=UPI0001926888 Length = 282 Score = 187 bits (475), Expect = 1e-46, Method: Composition-based stats. Identities = 58/165 (35%), Positives = 89/165 (53%), Gaps = 4/165 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D + GL +VF GINPGL++A +G + P N FWK +Y +G D+ + Q LLD Sbjct: 49 LPDYMRNGLDIVFVGINPGLAAAFSGKYYTGPGNHFWKCLYLSGLIDKPFTSNDDQKLLD 108 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y G T +V R T +N++ K+EL AG + L EKI+ Y+P + GK +Y+ ++ Sbjct: 109 YNIGFTNMVSRTTKGSNDLCKEELIAGSKVLKEKIQLYRPLIVVFNGKISYQVFSGKKDF 168 Query: 122 QWGKQTLTIGS--TQIWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 +GKQ+ + T IWV+P+ S E V + L + Sbjct: 169 FFGKQSELVEGTDTHIWVMPSSSARCAQLPRAEDKVPFFTGLKRF 213 >UniRef50_Q39CW6 G/U mismatch-specific uracil-DNA glycosylase n=10 Tax=Bacteria RepID=Q39CW6_BURS3 Length = 210 Score = 183 bits (466), Expect = 1e-45, Method: Composition-based stats. Identities = 73/160 (45%), Positives = 104/160 (65%), Gaps = 1/160 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D++AP L V+FCGINPGL++A TG FA +NRFW+VI+ AGFT ++ PQ+ +L Sbjct: 33 LPDLIAPQLDVLFCGINPGLTAAATGHHFAGRSNRFWRVIHLAGFTPAEISPQDDHAILQ 92 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y CG+T +V RPT A+++S+ E A + K+ Y P+ +A LGK AY QR Sbjct: 93 YGCGLTAVVKRPTASADQLSRAEFVAAAAEFEHKVARYGPRFVAFLGKAAYSALSGQREI 152 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLS-RVSLEKLVEAYRELD 160 +WG Q + + +WVLPNPSG + SL+ LV+AYR+L Sbjct: 153 EWGLQPARMQGSSVWVLPNPSGRNLAFSLDDLVDAYRQLR 192 >UniRef50_Q13569 G/T mismatch-specific thymine DNA glycosylase n=47 Tax=Euteleostomi RepID=TDG_HUMAN Length = 410 Score = 183 bits (464), Expect = 3e-45, Method: Composition-based stats. Identities = 52/172 (30%), Positives = 83/172 (48%), Gaps = 11/172 (6%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL- 60 + DIL L +V GINPGL +A G + P N FWK ++ +G ++ QL + L Sbjct: 124 LPDILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFWKCLFMSGLSEVQLNHMDDHTLPG 183 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ-- 118 Y G T +V+R T + ++S +E GGR L++K++ YQP+ GK YE + Sbjct: 184 KYGIGFTNMVERTTPGSKDLSSKEFREGGRILVQKLQKYQPRIAVFNGKCIYEIFSKEVF 243 Query: 119 ----RGAQWGKQTLTIGSTQ--IWVLPNPSGLSRV--SLEKLVEAYRELDQA 162 + ++G Q I T+ +V+P+ S + V Y +L Sbjct: 244 GVKVKNLEFGLQPHKIPDTETLCYVMPSSSARCAQFPRAQDKVHYYIKLKDL 295 >UniRef50_Q46SQ2 G/U mismatch-specific uracil-DNA glycosylase n=4 Tax=Proteobacteria RepID=Q46SQ2_RALEJ Length = 197 Score = 183 bits (464), Expect = 3e-45, Method: Composition-based stats. Identities = 77/168 (45%), Positives = 102/168 (60%), Gaps = 1/168 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + DI+ PGL VFCG+NPGL +A +G FA NRFW+VI+ AG TDR+L+P + + LL Sbjct: 16 LPDIMKPGLSAVFCGLNPGLRAAASGHHFAGHGNRFWRVIHLAGITDRELEPVDDRQLLS 75 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y G+T V RPT A +S E +LI K+E + P +A LGK AY SQR Sbjct: 76 YGYGLTTAVSRPTASAAALSHNEYVDASVRLIIKLERFHPACVAFLGKAAYAAMTSQRVV 135 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQALVVRGR 168 WG Q +G ++WVLPNPSG +R SL LV+AY E ++ + R Sbjct: 136 PWGPQRRRMGGARVWVLPNPSGRNRAFSLAGLVDAYGEFRASVESQIR 183 >UniRef50_Q1J3D2 Uracil-DNA glycosylase superfamily n=1 Tax=Deinococcus geothermalis DSM 11300 RepID=Q1J3D2_DEIGD Length = 177 Score = 182 bits (463), Expect = 4e-45, Method: Composition-based stats. Identities = 61/168 (36%), Positives = 92/168 (54%), Gaps = 1/168 (0%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 +++D++A G++V+F G NP S G +A N+FW+++ +G T R L+P+E L Sbjct: 10 VLQDLVAEGVQVLFVGFNPSPRSVQQGHHYAGRGNQFWRLLAASGLTPRLLRPEEDHLLP 69 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + G T LV RPTV A E+S+ EL AG L + YQ +A GK Y ++ Sbjct: 70 RWGLGSTNLVARPTVSAAELSRSELRAGVPHLRRLVARYQVPLVAYTGKGVYLAATGKQQ 129 Query: 121 AQWGKQTLT-IGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRG 167 A WG Q + IG +VLP+PSGL+R+ + + YR L + G Sbjct: 130 AHWGAQEDSLIGGVTDFVLPSPSGLARLPFAEKLGYYRALAELAQQPG 177 >UniRef50_A8LC62 Uracil-DNA glycosylase superfamily n=16 Tax=Bacteria RepID=A8LC62_FRASN Length = 297 Score = 182 bits (463), Expect = 4e-45, Method: Composition-based stats. Identities = 75/162 (46%), Positives = 100/162 (61%), Gaps = 1/162 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D++ P RV+ CGINP L S TGF F P+NR W V++ AGFT R+L P E +HL Sbjct: 109 VPDLVGPETRVLLCGINPSLESGATGFHFGTPSNRLWPVLHFAGFTGRRLHPSETEHLRA 168 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G+T LV R T +A+E++ E+ AG LIE +E +P+ +A LG AY GF +R A Sbjct: 169 RGIGITNLVHRSTARADEIADDEIRAGVPVLIELVERIRPEWVAFLGLAAYRIGFGRRTA 228 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 + G+Q IG +W+LPNPSGL + L LV Y EL +A Sbjct: 229 KVGRQPERIGPAGVWLLPNPSGLNAHYQLPDLVRVYGELREA 270 >UniRef50_C6WCQ8 Uracil-DNA glycosylase superfamily n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WCQ8_ACTMD Length = 200 Score = 180 bits (458), Expect = 1e-44, Method: Composition-based stats. Identities = 71/163 (43%), Positives = 97/163 (59%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D+LAPGL + CGINPG SA TG +A P NRFW+V++ AGFT R L P E LL Sbjct: 21 LPDLLAPGLSALLCGINPGGRSAVTGHHYAGPGNRFWRVLHLAGFTPRLLAPSEQALLLG 80 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G+T LV RPT +A E+ EL AG +L +E ++P A+ ++G AY F + A Sbjct: 81 FGIGLTNLVARPTARAAELGSAELVAGAGRLAGLVERHRPGAVGVVGVSAYRVAFRRPRA 140 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQAL 163 G Q + + +WVLPNPSGL+ L+ LV +R+ L Sbjct: 141 VVGPQDVELAGAPLWVLPNPSGLNAGYRLDDLVAIFRDFRSEL 183 >UniRef50_Q1D9T2 G/U mismatch-specific DNA glycosylase n=2 Tax=Bacteria RepID=Q1D9T2_MYXXD Length = 166 Score = 178 bits (453), Expect = 5e-44, Method: Composition-based stats. Identities = 71/163 (43%), Positives = 98/163 (60%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D++AP LRV+FCGINP L SA G+ FA P NRFW ++ AG T +P E + LL Sbjct: 1 MPDVIAPRLRVLFCGINPSLYSAVVGYHFARPGNRFWPTLHAAGITPHTFQPTEQEALLA 60 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G+T +VDR T A+ ++ EL G + L K+ YQP+ LA+LG AY F + A Sbjct: 61 LGYGITNVVDRATATADLLNPAELARGAKALEAKVARYQPRFLAVLGVSAYRIAFGRPKA 120 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 G Q +G T++WVLPNPSGL + L+ L + EL +A+ Sbjct: 121 ALGTQPERLGDTRLWVLPNPSGLNAHYQLDALGRLFAELRRAV 163 >UniRef50_C0ZBM4 Putative uncharacterized protein n=1 Tax=Brevibacillus brevis NBRC 100599 RepID=C0ZBM4_BREBN Length = 192 Score = 178 bits (452), Expect = 6e-44, Method: Composition-based stats. Identities = 57/173 (32%), Positives = 91/173 (52%), Gaps = 9/173 (5%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + + L V+FCGINPG SA TGF +A+PAN FW+ +Y+ G T +L P+E LLD Sbjct: 9 LPTYIRRDLAVLFCGINPGRISATTGFHYANPANLFWRGLYEGGLTPHKLLPEETATLLD 68 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG- 120 Y G+T LV RPT A ++ ++ AG ++ I+ Y+P+ + G A+ Q+ Sbjct: 69 YNYGITDLVSRPTRSATDLRNEDYAAGTELFVQMIQTYRPRVICFNGITAFRHATGQKKE 128 Query: 121 -AQWGKQTLTIGSTQ------IWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 G Q + T+ ++V+P+ SG + + E+ + + EL L Sbjct: 129 AISLGLQPIRYFGTEEWSGSYVFVVPSTSGANASFTREERIAKFTELSHFLRE 181 >UniRef50_Q0RRS7 DNA glycosylase, G/U mismatch-specific n=2 Tax=Frankia RepID=Q0RRS7_FRAAA Length = 196 Score = 178 bits (452), Expect = 6e-44, Method: Composition-based stats. Identities = 73/162 (45%), Positives = 95/162 (58%), Gaps = 1/162 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D+LAP V+ CGINP L S TGF F P+NR W V++ AGFT R+L P E L Sbjct: 30 VPDLLAPRTTVLLCGINPSLESGATGFHFGTPSNRLWPVLHLAGFTPRRLHPSETAELTS 89 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 +T LV+R T +A+E+ E+ AG +L +E P +A LG AY GF +R A Sbjct: 90 RGISITNLVNRSTARADEIGDDEVRAGVDRLTGLVERISPHWVAFLGLAAYRVGFGRRRA 149 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 Q G+Q T+G +W+LPNPSGL + L LV Y EL +A Sbjct: 150 QVGRQPETLGQAGVWLLPNPSGLNAHYQLPDLVRVYGELREA 191 >UniRef50_D2B5F3 G:T/U mismatch-specific DNA glycosylase-like protein n=9 Tax=Actinomycetales RepID=D2B5F3_STRRD Length = 208 Score = 177 bits (449), Expect = 1e-43, Method: Composition-based stats. Identities = 71/163 (43%), Positives = 98/163 (60%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 ++D+L P L V+FCGINPGL SA TG FA P NRFW ++++GFT R L P E L Sbjct: 45 LKDVLGPSLDVLFCGINPGLYSAATGHHFARPGNRFWPALHRSGFTPRLLDPAEQDLLPS 104 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + G+T +V+R + QA E++ +E GG +L + +P+ LA+ G AY F Q A Sbjct: 105 FGLGITNVVERASAQAAELTPEEFREGGARLARLVAGTRPRVLAVAGVTAYRTAFGQPKA 164 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 G Q IG T +WVLPNPSGL + +LE + R L ++L Sbjct: 165 AVGPQATVIGDTAVWVLPNPSGLNAHWTLEGIAGEMRRLRESL 207 >UniRef50_C7IMH6 Uracil-DNA glycosylase superfamily n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IMH6_9CLOT Length = 184 Score = 176 bits (448), Expect = 2e-43, Method: Composition-based stats. Identities = 52/175 (29%), Positives = 89/175 (50%), Gaps = 9/175 (5%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + DILAPGL+VVF G NPGL SA ++H +NRFW+ +++AG T + +P++ + +L+ Sbjct: 10 LPDILAPGLKVVFIGYNPGLLSAKNRHHYSHKSNRFWRFLFEAGLTPVRFQPEDDRKILE 69 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG------ 115 G T +VDR T AN++ E+ G L ++ +P+ +G Y Sbjct: 70 LGYGSTNIVDRQTKAANDIGADEVIEGSANLYRLLDTLKPRVACYVGIGVYRAFASCILK 129 Query: 116 --FSQRGAQWGKQTLTIGSTQI-WVLPNPSGLSRVSLEKLVEAYRELDQALVVRG 167 S+ G Q ++ + +V + SGL+ V + + + +L + L G Sbjct: 130 MSASRINVTPGLQLDSLQEATLDFVCYSTSGLNTVPFAEQCKCFADLKKLLDSIG 184 >UniRef50_A6SED0 Putative uncharacterized protein n=2 Tax=Sclerotiniaceae RepID=A6SED0_BOTFB Length = 425 Score = 175 bits (443), Expect = 6e-43, Method: Composition-based stats. Identities = 59/188 (31%), Positives = 101/188 (53%), Gaps = 21/188 (11%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D+++ L +F G+NPGL +A G +AHP+NRFWK++Y +G T R LK ++ L + Sbjct: 222 LPDVISSDLLCLFVGLNPGLLTAQVGHAYAHPSNRFWKILYSSGCTTRLLKAEDDVKLPE 281 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Y G T +V+RPT E+SK+E+ G L EKI Y+P+A+A++GK +E + + Sbjct: 282 LYSLGNTNIVERPTRNGGELSKKEMDEGVSSLEEKIRIYRPEAVALVGKGIWESVWRVKK 341 Query: 121 --------AQWGKQTL-----------TIGSTQIWVLPNPSG-LSRVSLEKLVEAYRELD 160 ++G Q+ + G +++V + SG + +S + +REL Sbjct: 342 GKAMAKDEFRYGWQSEEENLGVVEGEDSWGGARLFVASSTSGLAASLSPAEKESIWRELG 401 Query: 161 QALVVRGR 168 + R + Sbjct: 402 SWIEQRRK 409 >UniRef50_A2QSA7 Function: In the DNA of higher eukaryotes n=48 Tax=Leotiomyceta RepID=A2QSA7_ASPNC Length = 345 Score = 175 bits (443), Expect = 7e-43, Method: Composition-based stats. Identities = 53/189 (28%), Positives = 100/189 (52%), Gaps = 21/189 (11%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D + P L ++ G+NPG+ + TG+ +AHP+N FWK+++ +G T + P + L Sbjct: 138 LLVDTIPPNLTLLLVGVNPGVLTGVTGYAYAHPSNLFWKLLHWSGITPIRHPPSDTYKLP 197 Query: 61 D-YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF--- 116 + Y G T +V+RPT A+ +SK+E+ AG L EK+ + +P+ + ++GK +E + Sbjct: 198 ELYNIGNTNIVERPTRDASMLSKKEMDAGVPVLEEKVREKRPEVVCLVGKSIWEAVWRVK 257 Query: 117 -----SQRGAQWGKQTLT-----------IGSTQIWVLPNPSGLSR-VSLEKLVEAYREL 159 + ++G Q L +++V SGL+ +SL++ E + EL Sbjct: 258 VGRGIRKEEFRYGWQDLEMNMGRVGGEGGWEGARVFVATTTSGLAAGMSLQEKREVWDEL 317 Query: 160 DQALVVRGR 168 + + R + Sbjct: 318 GRWVCERRK 326 >UniRef50_B8N438 Mismatch-specific thymine-DNA glycosylase, putative n=6 Tax=Eurotiomycetidae RepID=B8N438_ASPFN Length = 373 Score = 172 bits (437), Expect = 3e-42, Method: Composition-based stats. Identities = 52/184 (28%), Positives = 94/184 (51%), Gaps = 18/184 (9%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D + P L ++ G+NPG+ + TG+ +AHP+N +WK+++ +G T + P + L Sbjct: 180 LLRDTIPPNLTLLLVGVNPGIMTGATGYVYAHPSNLYWKLLHWSGITAIRHPPSDTYRLP 239 Query: 61 D-YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 + Y G T +V+RPT A+ +SK E+ AG L EK+ QP+A+ ++GK +E + R Sbjct: 240 ELYNIGNTNIVERPTRDASMLSKAEMDAGVPVLEEKVAKQQPEAVCLVGKSIWEAVWRVR 299 Query: 120 G--------AQWGKQT--------LTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQA 162 ++G Q ++V SGL+ +S+ + + EL + Sbjct: 300 KGRAIRKEEFRYGWQDESENMGRSERWNGAPVFVATTTSGLAAGMSMAEKQAVWNELGKW 359 Query: 163 LVVR 166 + R Sbjct: 360 VNSR 363 >UniRef50_C6B6I6 Uracil-DNA glycosylase superfamily n=2 Tax=Rhizobium leguminosarum RepID=C6B6I6_RHILS Length = 163 Score = 170 bits (431), Expect = 2e-41, Method: Composition-based stats. Identities = 73/163 (44%), Positives = 98/163 (60%), Gaps = 1/163 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + +ILAPGL VVFCG+NP LS+ G F++P+NRFW+V++ AGFT R L+ E + LL Sbjct: 1 MSEILAPGLSVVFCGLNPALSAVRDGHNFSNPSNRFWRVLHLAGFTPRLLRADEERELLQ 60 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y CG+T V RPT A+E+ KQ+ + L KI P LA LGK AY + Sbjct: 61 YGCGLTSAVSRPTKSASELKKQDYRSAAPVLENKIRKIAPANLAFLGKAAYAAISLRADV 120 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQAL 163 +WG+Q +W+LPNPSGL+R +L L E Y +L L Sbjct: 121 EWGRQPEVFAGADVWLLPNPSGLNRAFTLTTLTEHYLDLRLTL 163 >UniRef50_B2W4Q2 G/U mismatch-specific DNA glycosylase n=2 Tax=Pleosporineae RepID=B2W4Q2_PYRTR Length = 328 Score = 169 bits (429), Expect = 3e-41, Method: Composition-based stats. Identities = 56/195 (28%), Positives = 97/195 (49%), Gaps = 30/195 (15%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL- 60 + DIL P L VF G NPG+ +A G +AHP+N FWK+++ +G T+R+LKP+E + L Sbjct: 111 LVDILEPNLICVFVGTNPGVQTAIAGHAYAHPSNHFWKLLHSSGLTNRRLKPEEDRSLPA 170 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF---- 116 + G T +VDRP+ A E+SK+E AG L K ++P+A+ I+GK +E + Sbjct: 171 QFCMGNTNIVDRPSKDAAELSKEETAAGTASLEAKFRKFKPEAVCIVGKGIWEAIWCYRH 230 Query: 117 ----SQRGAQWGKQTLT--------------------IGSTQIWVLPNPSG-LSRVSLEK 151 S++ ++G Q ++++V + SG + + + Sbjct: 231 GKNPSKKDFKYGWQDEKHNMGKTPDDAGEELDGNGNVWKGSRVFVTTSTSGLAASLKPAE 290 Query: 152 LVEAYRELDQALVVR 166 ++ + + R Sbjct: 291 KEAIWKPFGEWVQKR 305 >UniRef50_Q9K3I5 G/U mismatch-specific DNA glycosylase n=3 Tax=Streptomyces RepID=Q9K3I5_STRCO Length = 160 Score = 169 bits (429), Expect = 3e-41, Method: Composition-based stats. Identities = 67/152 (44%), Positives = 95/152 (62%), Gaps = 1/152 (0%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLVD 71 ++FCGINPGL +A TG FA P NRFW V++ +GFT R LKP E L Y G+T +V Sbjct: 1 MLFCGINPGLMTAATGHHFARPGNRFWPVLHLSGFTPRLLKPSEQDELPSYGLGITNVVA 60 Query: 72 RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTLTIG 131 R + +A+E++ +E GGR L K+ +P LA++G AY F + A+ G Q T G Sbjct: 61 RASARADELTAEEYREGGRLLARKVARLRPGWLAVVGVTAYRAAFDEPKARVGPQERTFG 120 Query: 132 STQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 ST++WVLPNPSGL + + + + + Y L +A Sbjct: 121 STRVWVLPNPSGLNAHWTAQTMAQEYARLREA 152 >UniRef50_B8I864 Uracil-DNA glycosylase superfamily n=1 Tax=Clostridium cellulolyticum H10 RepID=B8I864_CLOCE Length = 183 Score = 169 bits (429), Expect = 3e-41, Method: Composition-based stats. Identities = 52/171 (30%), Positives = 91/171 (53%), Gaps = 9/171 (5%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + DILA GL+VVF G NPGL SA ++H +NRFW+ +Y++G T + +P++ + +L+ Sbjct: 10 LPDILATGLKVVFIGYNPGLLSARNRHHYSHKSNRFWRFLYESGLTPSRFEPEDDRKILE 69 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ--- 118 + G T +VDRP+ ANE+ E+ G + +++++P+ +G Y S Sbjct: 70 FGYGSTNIVDRPSKAANEIGTDEVVEGSANIYRLLDNFKPRIACYVGIGVYRAYASSILR 129 Query: 119 -----RGAQWGKQTLTI-GSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 + G Q+ +I S +V + SGL+ V + + + L + L Sbjct: 130 VPASGINVKPGLQSASILESISDFVCYSTSGLNTVPFAEQRKCFENLKKLL 180 >UniRef50_Q47NL8 G/U mismatch-specific uracil-DNA glycosylase n=11 Tax=Bacteria RepID=Q47NL8_THEFY Length = 212 Score = 166 bits (421), Expect = 2e-40, Method: Composition-based stats. Identities = 74/169 (43%), Positives = 102/169 (60%), Gaps = 3/169 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D++ PGL ++FCGINPGL SA +G FA P NRFW ++ A T R L+P E LLD Sbjct: 21 VPDVIGPGLTILFCGINPGLYSAWSGHHFARPGNRFWPALHAASLTPRLLRPAEQHLLLD 80 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 GVT LVDR T +A+E++ EL AGG+ L K+ Y+P+ LA+LG AY + F++ A Sbjct: 81 LGLGVTNLVDRATARADELTAAELQAGGQALAAKLRAYRPRLLAVLGVTAYRKAFARPRA 140 Query: 122 QWGKQT--LTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVVRG 167 + G Q I +WVLPNPSGL + + + + E R + A + Sbjct: 141 RIGPQPPEDAIEGVPVWVLPNPSGLNAHWTTQGIAEELRRMCAAHGIEA 189 >UniRef50_B2A9Z4 Predicted CDS Pa_1_2290 n=2 Tax=Sordariales RepID=B2A9Z4_PODAN Length = 327 Score = 166 bits (421), Expect = 2e-40, Method: Composition-based stats. Identities = 56/183 (30%), Positives = 94/183 (51%), Gaps = 21/183 (11%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-Y 62 D +AP L ++F G+NPGL ++ TG +AHP N FW++++ +G T R PQE + L + Sbjct: 108 DAIAPNLILLFVGLNPGLLTSSTGHAYAHPTNLFWRLLHSSGITPRLCAPQEDRLLPSLF 167 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG-- 120 G+T +V RPT E+SK+E+ AG L EKI ++P+ + I+GK +E + R Sbjct: 168 SLGLTNIVPRPTRNGAELSKKEMDAGVAVLEEKIRKFKPEVVCIVGKSIWESVWRVRHGR 227 Query: 121 ------AQWGKQTL-----------TIGSTQIWVLPNPSG-LSRVSLEKLVEAYRELDQA 162 ++G Q +++V + SG + + + E + EL + Sbjct: 228 GIKKEEFRYGWQERGEDMGKVEGGGEWNGARVFVACSTSGLAATLKPREKEEIFGELGEW 287 Query: 163 LVV 165 +V Sbjct: 288 VVK 290 >UniRef50_D1CA47 Uracil-DNA glycosylase superfamily n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1CA47_SPHTD Length = 194 Score = 165 bits (418), Expect = 5e-40, Method: Composition-based stats. Identities = 55/182 (30%), Positives = 86/182 (47%), Gaps = 15/182 (8%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTD--------RQLK 52 ++ D LAP + +VF GINP SA G FA NRFW + Q+ ++ +L+ Sbjct: 9 VLPDYLAPKMPLVFIGINPSTYSAAQGHYFARRTNRFWPALSQSRLSEPIRAALGRDRLE 68 Query: 53 PQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAY 112 P + LL + G T LV RP+ +A E+S+ + KL +++E YQP+ G Y Sbjct: 69 PTDDARLLAFGIGFTDLVKRPSARAAELSRADYRIWAPKLRQRLEHYQPRVACFHGFTGY 128 Query: 113 EQGFS------QRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 + G Q ++G T+++V+PNPS + +L + Y L L Sbjct: 129 RAFVRDALGLTEVPRNLGPQPHSLGPTRLYVVPNPSPANAHFTLADQIAWYDRLADFLAE 188 Query: 166 RG 167 G Sbjct: 189 IG 190 >UniRef50_B9J7K0 G:T/U mismatch-specific DNA glycosylase n=1 Tax=Agrobacterium radiobacter K84 RepID=B9J7K0_AGRRK Length = 198 Score = 165 bits (418), Expect = 5e-40, Method: Composition-based stats. Identities = 69/165 (41%), Positives = 100/165 (60%), Gaps = 1/165 (0%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + +ILAP L VVFCG+NP L++ G F++ +NRFW+V++ AGFT R L+ +E + +L Sbjct: 28 LAEILAPRLSVVFCGLNPALTAHRDGHNFSNRSNRFWRVLHLAGFTPRLLRAEEEREMLR 87 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 Y G+T V RPT A+E++K++ L +KI P+ LA LGK AY Q Sbjct: 88 YGLGLTSAVSRPTKSASELTKRDYITAAPALEDKITKLAPRNLAFLGKAAYAAISLQAHV 147 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQALVV 165 +WG+Q +WVLPNPSGL+R L +L E YR++ A+ Sbjct: 148 EWGRQPDKFADASVWVLPNPSGLNRAFDLTRLTEHYRKVKLAVDR 192 >UniRef50_C9SR06 G/U mismatch-specific uracil DNA glycosylase n=4 Tax=Sordariomycetes RepID=C9SR06_VERA1 Length = 320 Score = 163 bits (414), Expect = 2e-39, Method: Composition-based stats. Identities = 57/188 (30%), Positives = 88/188 (46%), Gaps = 23/188 (12%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL- 60 + D +AP L ++F G+NPG+ +A TG +AHP+N FWK++Y +G T R L E + L Sbjct: 108 LPDAIAPNLLILFIGLNPGIQTASTGHAYAHPSNLFWKLLYHSGITSRLLPASEDRTLPE 167 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR- 119 Y CG T +V RP+ E+ EL AG L KI ++P+A I+GK +E F R Sbjct: 168 RYACGFTNIVSRPSRNGAELRNAELDAGVAVLEAKIAKWRPEAACIVGKGIWEAVFRVRK 227 Query: 120 -------GAQWGKQT-------------LTIGSTQIWVLPNPSG-LSRVSLEKLVEAYRE 158 ++G Q +++V + SG + + + + E Sbjct: 228 GRGMKKGKFEYGWQEGENMGVVRDGEEGGPWEGARVFVATSTSGLAATLGPREKEVIWGE 287 Query: 159 LDQALVVR 166 L R Sbjct: 288 LGAWCTQR 295 >UniRef50_A9VA78 Predicted protein n=1 Tax=Monosiga brevicollis RepID=A9VA78_MONBE Length = 239 Score = 161 bits (408), Expect = 7e-39, Method: Composition-based stats. Identities = 48/167 (28%), Positives = 73/167 (43%), Gaps = 21/167 (12%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D+L P LRV+ GINPG+ SA G +A W+ G Sbjct: 50 LPDLLLPNLRVLIIGINPGVWSAARGHHYAAGVELGWQTDMSCG---------------Q 94 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS---- 117 Y G T V R T ++ K E+ G +L + I++YQP A GK E F Sbjct: 95 YGIGFTNSVARTTRGQMDLDKSEIEDGVARLRQLIQEYQPAIAAFNGKGVSEIFFKRQFS 154 Query: 118 QRGAQWGKQTLTIGSTQIWVLPNPSG--LSRVSLEKLVEAYRELDQA 162 ++ G Q TIG T+++++P+ S +R + + +REL Sbjct: 155 RKVTTLGLQQETIGETRLYIMPSTSPRGATRPRWQDKLPFFRELAAL 201 >UniRef50_Q6C2V5 YALI0F04796p n=1 Tax=Yarrowia lipolytica RepID=Q6C2V5_YARLI Length = 302 Score = 161 bits (407), Expect = 9e-39, Method: Composition-based stats. Identities = 50/168 (29%), Positives = 88/168 (52%), Gaps = 14/168 (8%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-YRCGVTKL 69 R+VF G NPG+ S+ G +AHP+N FWK + + L + L+D + G T L Sbjct: 97 RLVFVGFNPGIMSSEKGHYYAHPSNSFWKNLNKYVVIGGGLTCYDDSSLMDSHGIGFTDL 156 Query: 70 VDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR---------- 119 V RPT ++S++E+ G + L+++I +P + I+G+ + F Sbjct: 157 VARPTRGCEDLSREEMALGAQILVDQISANKPALVCIVGRGIWATVFRAVFKHQGIKKSE 216 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVVRG 167 +G + +G+++++V+P+ SGL R +K +E +REL Q +G Sbjct: 217 MPHFGLMPVHMGTSRVFVVPSTSGLVR---DKRLELWRELGQLYRGKG 261 >UniRef50_B2IIR8 Uracil-DNA glycosylase superfamily n=2 Tax=Bacteria RepID=B2IIR8_BEII9 Length = 191 Score = 160 bits (405), Expect = 2e-38, Method: Composition-based stats. Identities = 58/178 (32%), Positives = 79/178 (44%), Gaps = 14/178 (7%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTD--------RQLKP 53 + D+L GL V+F GINP L S G FA NRFW ++ + L+P Sbjct: 9 LPDLLDKGLSVLFVGINPSLYSVAQGHYFARKTNRFWPAFSRSVLSLAARENLGLEALEP 68 Query: 54 QEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYE 113 L G T V RPT +A++VS E A L+EKIE Y+P+ G AY Sbjct: 69 MHDHLLPAQGFGFTDTVKRPTARASDVSPAEFAAAVTALVEKIEYYRPRIACFHGTMAYR 128 Query: 114 QGFS-----QRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 + G QTL+IG T ++V+PNPS + + Y L L+ Sbjct: 129 PVHRALTETKTDPALGMQTLSIGPTHLYVVPNPSPANAHFTPADQTAWYDRLAGTLLE 186 >UniRef50_A7HV32 Uracil-DNA glycosylase superfamily n=3 Tax=Rhizobiales RepID=A7HV32_PARL1 Length = 176 Score = 159 bits (403), Expect = 3e-38, Method: Composition-based stats. Identities = 49/166 (29%), Positives = 77/166 (46%), Gaps = 6/166 (3%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D+L PGL VVF G P SA G +AHP NRFW +++ G T R +P E L+ Sbjct: 12 LPDVLVPGLDVVFVGTAPSRRSAALGAYYAHPGNRFWPALFETGITPRLYQPHEYAELIH 71 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 G T + R + +E+ + ++I +P+A+A K+A Sbjct: 72 LGIGFTDMAKRASGMDHEIDPALFE--VERFEKEILRCRPRAVAFTSKKAASVWMGVPTG 129 Query: 122 QW--GKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVV 165 + G Q G +++VL +PSG +R S +E + L + L Sbjct: 130 RLLHGHQPRRPGFPEVFVLSSPSGAARRSWS--IEPWAGLARFLGR 173 >UniRef50_A6GLD3 G/U mismatch-specific DNA glycosylase n=1 Tax=Limnobacter sp. MED105 RepID=A6GLD3_9BURK Length = 161 Score = 159 bits (402), Expect = 4e-38, Method: Composition-based stats. Identities = 53/162 (32%), Positives = 87/162 (53%), Gaps = 5/162 (3%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D+L L +VFCG PG SA G +A+ N FW+ +++ G T Q+ P++ Q + Sbjct: 3 ILPDLLQANLNIVFCGTAPGTVSAQRGAYYANAGNAFWRTLFEVGLTPHQVSPKDFQGIT 62 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 Y+ G+T L + + + K + G +L EKI + P+ LA K+A ++ Sbjct: 63 QYKLGLTDLAKQVYGADSSLKKSDF--GPSQLREKILHFNPKILAFTSKRAGQEFLG-CK 119 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQA 162 Q G Q IG+T ++VLP+PSGL+R +E ++EL Sbjct: 120 VQCGLQHEKIGNTVLFVLPSPSGLARSHW--RIEPWQELAAL 159 >UniRef50_C6H6H6 Uracil DNA glycosylase superfamily protein n=1 Tax=Ajellomyces capsulatus H143 RepID=C6H6H6_AJECH Length = 301 Score = 158 bits (400), Expect = 7e-38, Method: Composition-based stats. Identities = 46/169 (27%), Positives = 88/169 (52%), Gaps = 18/169 (10%) Query: 16 GINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD-YRCGVTKLVDRPT 74 G+NPG+ + TG +AHP+N +WK+++ +G TD + P + L + Y G T +V+RPT Sbjct: 115 GVNPGIMTGQTGHVYAHPSNLYWKLLHSSGITDYRHPPSDTYQLPELYSVGNTNIVERPT 174 Query: 75 VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG--------AQWGKQ 126 A+++S++E+ G L KI + +P+A+ ++GK +E + R ++G Q Sbjct: 175 RDASQLSRKEMDDGVPVLERKIREKRPEAVCLVGKSIWEAVWRVRRGRAIKKEEFRYGWQ 234 Query: 127 TL--------TIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQALVVR 166 +++V SGL+ +++ + + EL + + R Sbjct: 235 DEGDRMGRTKEWKGARVFVATTTSGLAAGMTMAEKQAVWSELGEWVKRR 283 >UniRef50_C4Y7L8 Putative uncharacterized protein n=1 Tax=Clavispora lusitaniae ATCC 42720 RepID=C4Y7L8_CLAL4 Length = 427 Score = 157 bits (398), Expect = 1e-37, Method: Composition-based stats. Identities = 47/202 (23%), Positives = 89/202 (44%), Gaps = 38/202 (18%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA----------GFTDRQL 51 + +++ LR++F G NPG+ SA +AHP+N FWK+ Q+ G +D L Sbjct: 223 LAPMISDSLRLLFIGFNPGVQSAQMQHHYAHPSNLFWKLFNQSALLPQVAQAQGISDHPL 282 Query: 52 ------------KPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY 99 ++ L+D G T LV R T A+ +S E A +++ +I Sbjct: 283 VLELFQDGKGLASFRDDGRLVDVGIGFTDLVLRCTRAAHHLSPSEKTANVPRVMREIRSS 342 Query: 100 QPQALAILGKQAYEQGFSQ------RGAQWGKQTLTIGST---------QIWVLPNPSGL 144 + ++GK +E R +WG QT + Q++++P+ SGL Sbjct: 343 NASHVVVVGKGVWETFVRAFTGDKIRHFEWGLQTDATANAFYRQCGRRLQVYLVPSTSGL 402 Query: 145 -SRVSLEKLVEAYRELDQALVV 165 + ++ + + + E+ +++ Sbjct: 403 VASMTFGEKLALWVEIARSVSE 424 >UniRef50_C8NV44 G/U mismatch-specific DNA glycosylase n=2 Tax=Corynebacterium RepID=C8NV44_9CORY Length = 170 Score = 156 bits (396), Expect = 2e-37, Method: Composition-based stats. Identities = 61/168 (36%), Positives = 92/168 (54%), Gaps = 7/168 (4%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTD------RQLKPQE 55 + D + R++ GINPG S PFAHP NRFW + +AG T L Q+ Sbjct: 1 MHDRIPNPCRLLIVGINPGTLSEEVDAPFAHPGNRFWPALDKAGITPYRVYASEGLSEQD 60 Query: 56 AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 A+ L + G T LV R T +A++++ +EL GG +L ++ ++P A+ G AY Sbjct: 61 AKMLAERGIGFTNLVARMTPKASDLATEELVEGGERLEAVVDKHRPGAVMFAGIGAYRGA 120 Query: 116 FSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 F +R AQ G Q+ I +WV+ NPSGL + ++E L +YRE+ +A Sbjct: 121 FGRRKAQRGLQSGMIAGVPVWVVGNPSGLNAHETVESLAASYREVWEA 168 >UniRef50_C5SKW3 Uracil-DNA glycosylase superfamily n=1 Tax=Asticcacaulis excentricus CB 48 RepID=C5SKW3_9CAUL Length = 203 Score = 156 bits (394), Expect = 3e-37, Method: Composition-based stats. Identities = 57/166 (34%), Positives = 78/166 (46%), Gaps = 6/166 (3%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D+L PGL++VFCG G SA +AHP N FW+ +Y G T +L P + LLD Sbjct: 35 VPDLLRPGLKLVFCGTALGRVSAQKRAYYAHPGNFFWRTLYGVGLTPERLAPTDYPRLLD 94 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR-- 119 Y G+T L ++ + L KI Y PQ LA K A + Sbjct: 95 YGIGLTDLCKAHYGNDVDLPAE--AWDAEALTAKIAAYAPQHLAFTSKTAASVFLERPTG 152 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVV 165 G Q G T + VLP+PSG +R ++ A+R+L QA+ Sbjct: 153 KIALGLQAGKAGETSLRVLPSPSGQARRFWDE--AAWRDLAQAVTP 196 >UniRef50_O59825 G/U mismatch-specific uracil DNA glycosylase n=1 Tax=Schizosaccharomyces pombe RepID=TDG_SCHPO Length = 325 Score = 155 bits (391), Expect = 6e-37, Method: Composition-based stats. Identities = 44/176 (25%), Positives = 81/176 (46%), Gaps = 11/176 (6%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTD--RQLKPQEAQHL 59 V D + + G+NPG++S+ G FA P+NRFWK++ ++ + + + L Sbjct: 142 VPDYICENPYAIIVGLNPGITSSLKGHAFASPSNRFWKMLNKSKLLEGNAEFTYLNDKDL 201 Query: 60 LDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAIL-GKQAYEQGFSQ 118 + G+T L RP+ ++ K+E+ G R L EK++ Y+PQ + GK +E+ + Sbjct: 202 PAHGLGITNLCARPSSSGADLRKEEMQDGARILYEKVKRYRPQVGLFISGKGIWEEMYKM 261 Query: 119 -------RGAQWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQALVVR 166 + +G Q G ++V + SG + S EK + + + Sbjct: 262 LTGKKLPKTFVFGWQPEKFGDANVFVGISSSGRAAGYSDEKKQNLWNLFAEEVNRH 317 >UniRef50_C8XAI4 Uracil-DNA glycosylase superfamily n=3 Tax=Actinomycetales RepID=C8XAI4_NAKMY Length = 193 Score = 155 bits (391), Expect = 7e-37, Method: Composition-based stats. Identities = 62/171 (36%), Positives = 94/171 (54%), Gaps = 7/171 (4%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQL------KPQE 55 V+D++ PG+R++F GINPGL +A FA P NRFW + AG T ++ P + Sbjct: 19 VDDLIEPGVRLLFVGINPGLWTAAANAHFARPGNRFWPALQLAGITRERMDVSGGMAPAD 78 Query: 56 AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 L+ G+T LV T +A+E++ +++ L ++ +P +A+LG AY Q Sbjct: 79 RAELVGAGIGITNLVPFATARADELTGEQIRQAAGTLTATVQRVRPAVVAVLGLTAYRQA 138 Query: 116 FSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 F + A+ G+Q +G Q+WVLPNPSGL + + L AYRE A V Sbjct: 139 FGRPKARPGRQPEDLGGAQLWVLPNPSGLNAHETAASLGVAYREPALAAGV 189 >UniRef50_Q7SDB2 Predicted protein n=2 Tax=Sordariaceae RepID=Q7SDB2_NEUCR Length = 396 Score = 154 bits (389), Expect = 1e-36, Method: Composition-based stats. Identities = 50/224 (22%), Positives = 91/224 (40%), Gaps = 62/224 (27%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL- 60 + D L+P L V+F G+NPG+S+A +AHP N FW++++ +G T P E + L Sbjct: 143 LPDALSPNLLVLFIGLNPGISTAIQQHAYAHPTNLFWRLLFSSGITPVLCAPSEDRSLPE 202 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ-- 118 + G+T +V RPT E++K E+ G L EK+ ++P+A+ ++GK +E + Sbjct: 203 RFSLGLTNIVGRPTRNGGELTKGEMDEGVAVLEEKVRVWKPEAVCVVGKGIWESVYRVRE 262 Query: 119 ---------------------------------RGAQWGKQTLT---------------- 129 + ++G Q + Sbjct: 263 RQRQQRHGQGQIGIEKDKVGLNKGRGKAVGLLPKDWKYGWQDESENMGVPGGDSGDEEEE 322 Query: 130 ---------IGSTQIWVLPNPSG-LSRVSLEKLVEAYRELDQAL 163 +++V + SG + V + +REL + + Sbjct: 323 MDTTTAGGGWKGARVFVATSTSGLAATVGRAEKERIWRELGEWV 366 >UniRef50_A7HGS2 Uracil-DNA glycosylase superfamily n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7HGS2_ANADF Length = 181 Score = 153 bits (386), Expect = 3e-36, Method: Composition-based stats. Identities = 51/167 (30%), Positives = 84/167 (50%), Gaps = 3/167 (1%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D +AP L ++F GINP L SA G FA N FW++++ +G T P++ L Sbjct: 11 LCDRVAPDLDILFVGINPSLRSAEVGHHFASKGNPFWRLLHASGLTAELHAPEDDVALPA 70 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF-SQRG 120 G+T + R T A E+ + EL GGR L +++ +P+ +A++G Y F R Sbjct: 71 RGLGITNVCSRATRSAAELGRDELREGGRALPDRVAALRPRIVALVGVSLYPHVFPGGRE 130 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLS-RVS-LEKLVEAYRELDQALVV 165 G + + +++VLPNPSGL+ + + +R L + Sbjct: 131 TGPGAKRARLAGARVFVLPNPSGLNQAYPGFQVKLVWFRRLARLAGR 177 >UniRef50_B2GKB6 G/U mismatch-specific DNA glycosylase n=6 Tax=Bacteria RepID=B2GKB6_KOCRD Length = 190 Score = 153 bits (386), Expect = 3e-36, Method: Composition-based stats. Identities = 59/159 (37%), Positives = 90/159 (56%), Gaps = 7/159 (4%) Query: 14 FCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ------LKPQEAQHLLDYRCGVT 67 F GINPGL +A TG FA P NRF+ +++AG TD + P + L G++ Sbjct: 28 FVGINPGLWTAATGAHFARPGNRFYPALHRAGITDTRIDAAAGYDPADLAQLTGRGIGIS 87 Query: 68 KLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQT 127 + R T +A+E++++EL G R+L + ++ P +A+LG AY + F + GA G+Q Sbjct: 88 NVCPRATARADELTREELREGARRLDRLVREHHPAVVAVLGITAYREAFERPGAATGRQD 147 Query: 128 LTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 T ++V PNPSGL + SL+ L AY E+ +A V Sbjct: 148 APWPGTALFVAPNPSGLNAHSSLDDLAHAYAEIARAAGV 186 >UniRef50_A8P541 Putative uncharacterized protein n=1 Tax=Coprinopsis cinerea okayama7#130 RepID=A8P541_COPC7 Length = 365 Score = 152 bits (384), Expect = 4e-36, Method: Composition-based stats. Identities = 55/193 (28%), Positives = 85/193 (44%), Gaps = 29/193 (15%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V D L P L ++FCGINPG SA G FA P N FW ++++G T ++KPQE L D Sbjct: 151 VNDNLKPDLDIIFCGINPGQVSATLGHHFAGPTNHFWSCLHESGLTSTRMKPQEDFTLPD 210 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQ---GFS 117 + G+T LV RPT +A+E+SK E A K+ Y+P+ + +G E + Sbjct: 211 RHSIGLTNLVGRPTAEASELSKAESLACVAPFFVKMHKYRPKIVCFVGLSIAETVSSYVA 270 Query: 118 QRGAQWGKQ------------------------TLTIGSTQIWVLPNPSGL-SRVSLEKL 152 + G Q + T + + + SG + Sbjct: 271 KNFLDPGLQKKGKAKVKAGPGLLSFKLTYESEGDKSRSETLFYAVSSTSGRVVKYKKSDK 330 Query: 153 VEAYRELDQALVV 165 V+ ++ L+ L+ Sbjct: 331 VQQFKGLNDLLLE 343 >UniRef50_B7GT25 Uracil-DNA glycosylase superfamily n=6 Tax=Actinobacteridae RepID=B7GT25_BIFLI Length = 212 Score = 151 bits (383), Expect = 7e-36, Method: Composition-based stats. Identities = 61/171 (35%), Positives = 94/171 (54%), Gaps = 13/171 (7%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKP------QE 55 V+DI P +++ GINPGL +A PFA+P NRFW ++ +G T Q+ + Sbjct: 30 VDDIRCPHPKLLIVGINPGLWTAAVNAPFAYPGNRFWPSLFASGLTPWQVDASYGLRLHD 89 Query: 56 AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 + ++D G+T LV+R T +A+E++ EL G +L E P+A+AILG AY Q Sbjct: 90 ERMMIDRGIGLTNLVNRATAKASELNTAELRHGADRLRELCARVHPRAIAILGITAYRQA 149 Query: 116 FSQRGAQWGKQT------LTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 F+ + A G Q G Q+WV+P PSGL + ++ LV+ +R + Sbjct: 150 FADKSAALGLQDLTAEETDRFGGAQLWVIPQPSGLNAHATMPVLVDWWRRV 200 >UniRef50_B3QI54 Uracil-DNA glycosylase superfamily n=3 Tax=Rhodopseudomonas palustris RepID=B3QI54_RHOPT Length = 177 Score = 147 bits (371), Expect = 1e-34, Method: Composition-based stats. Identities = 46/165 (27%), Positives = 82/165 (49%), Gaps = 3/165 (1%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D+L GL+VVFCG G +S +AH N+FW V+ + G T R+L P E + + Sbjct: 15 ILPDVLQRGLKVVFCGTAAGRASEKAQSYYAHGRNKFWSVLSETGLTTRKLAPHEFREVT 74 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 ++ G+T L ++V + L KIE+++P LA +A ++ +R Sbjct: 75 NFGIGLTDLCKDVAGSDHDVRPR--PEHRVVLRRKIEEHRPDFLAFTSLEAAKRFVGRR- 131 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVV 165 G+ +IG+T I+VLP+ S ++ + + E + + Sbjct: 132 VGLGRYEESIGTTSIYVLPSTSPMADWNWMSNKIHWWEFAELVRA 176 >UniRef50_Q1IXM1 Uracil-DNA glycosylase superfamily n=3 Tax=Deinococcus RepID=Q1IXM1_DEIGD Length = 195 Score = 147 bits (371), Expect = 2e-34, Method: Composition-based stats. Identities = 45/167 (26%), Positives = 75/167 (44%), Gaps = 6/167 (3%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 +V D+L GL +V G P SA +A+P N+FW+V+ + G T RQL P+E L Sbjct: 13 LVPDVLQEGLTLVLVGTAPSRISARAKAYYANPENKFWRVLAETGLTPRQLAPREYPILP 72 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR- 119 Y G+T + R + + + +L EKI Y+P +A K+ + Sbjct: 73 QYGIGLTDVAKRHSGVDAALPVE--AWAPHELREKIRRYRPAIVAFTSKRGASETLGVPT 130 Query: 120 -GAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVV 165 +G Q + ++WVLP+ S L + + ++ L + Sbjct: 131 GRLPYGPQVGRLEGAEVWVLPSTSPLGHTHFQ--LGPWQALADRVRA 175 >UniRef50_C8RZ13 Uracil-DNA glycosylase superfamily n=1 Tax=Rhodobacter sp. SW2 RepID=C8RZ13_9RHOB Length = 165 Score = 145 bits (367), Expect = 4e-34, Method: Composition-based stats. Identities = 52/164 (31%), Positives = 75/164 (45%), Gaps = 6/164 (3%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 M+ DI+APGL V+FCG PG SA G +A P NRFW ++ + G T R+L P E L Sbjct: 1 MLPDIVAPGLDVLFCGTAPGAMSAARGHYYARPGNRFWPLLAEIGLTPRRLHPDEDHLLP 60 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + G+T L + Q E+ + +L + +P+ALA A Sbjct: 61 GFGLGLTDLAQGVSGQDAEIPRHAYAPD--RLQALVAAIRPRALAFTSLTAARLALGVPR 118 Query: 121 AQWGKQTL--TIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQA 162 G G T++W LP+PSGL+R + + L Sbjct: 119 LAAGALPADPRYGHTRLWALPSPSGLARSHFSA--DPWLALAAW 160 >UniRef50_A8Q5A6 Putative uncharacterized protein n=1 Tax=Malassezia globosa CBS 7966 RepID=A8Q5A6_MALGO Length = 272 Score = 144 bits (363), Expect = 1e-33, Method: Composition-based stats. Identities = 44/124 (35%), Positives = 66/124 (53%), Gaps = 3/124 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEA---QH 58 + D + P L VVFCGINPG S +AH +N F+ I+ AG T +L P + Sbjct: 28 LPDYIKPNLDVVFCGINPGTESGAQRRHYAHRSNHFYACIHMAGLTSERLTPSQDMTFPT 87 Query: 59 LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ 118 L YR G+T L RPT ++ ++++ EL G +L+ KI Y+P+ + +GKQ + Sbjct: 88 LQPYRLGLTNLAHRPTARSEQLTRIELERGVPELLMKIARYRPRVICFVGKQIADVFMRV 147 Query: 119 RGAQ 122 A Sbjct: 148 VRAS 151 >UniRef50_A3LQ36 Predicted protein n=1 Tax=Pichia stipitis RepID=A3LQ36_PICST Length = 270 Score = 143 bits (360), Expect = 3e-33, Method: Composition-based stats. Identities = 46/224 (20%), Positives = 81/224 (36%), Gaps = 60/224 (26%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFT-------------- 47 V L L ++F G NPG+ S+ +AH +N FWK+ Q+G Sbjct: 46 VRPSLVENLTLLFIGFNPGMESSLQQHHYAHFSNLFWKLFNQSGLLLQCLGVLDPQYLSH 105 Query: 48 ----DRQLK-----------PQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKL 92 D L+ P+ L+ Y+ G T L+ R T A E+ E A +L Sbjct: 106 NYDNDELLQVLVKDGTTYAKPEHDYELIKYKIGFTDLILRCTRTAQELPMAEKLANVPRL 165 Query: 93 IEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK--------------------------- 125 I++ + + +GK +E + G Sbjct: 166 IDEFNMSSSKHIVFIGKGIWEVIVKYVEVELGIKKVKLSKETFMWGLQDSQNSRLYALVL 225 Query: 126 ---QTLTIGSTQIWVLPNPSGLS-RVSLEKLVEAYRELDQALVV 165 Q+ ++++V PN SGL + E+ ++ +++L + Sbjct: 226 KKFQSKISADSKVFVFPNTSGLVGSLKYEEKLKLWQDLADVISK 269 >UniRef50_B0E0R1 Predicted protein n=7 Tax=Laccaria bicolor S238N-H82 RepID=B0E0R1_LACBS Length = 436 Score = 142 bits (358), Expect = 5e-33, Method: Composition-based stats. Identities = 57/201 (28%), Positives = 85/201 (42%), Gaps = 38/201 (18%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + + L GL V FCGINPG +SA G F P N FW +Y++GFT R L+PQE L + Sbjct: 214 LRNHLRSGLEV-FCGINPGQTSAEIGHHFGGPTNHFWSCLYESGFTTRLLRPQEDFILPE 272 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF---- 116 + G+T LVDRPT + +E+SK E AG L+ KI ++P+ + +G + Sbjct: 273 QFSIGMTNLVDRPTAEQSELSKAERLAGVPALLAKIATHRPKIVCFVGLGIADIVKSKVM 332 Query: 117 ----------------------SQRGAQWGKQTLTIG---------STQIWVLPNPSGL- 144 + A G Q + T + + + SG Sbjct: 333 PVIVRLSSALPPLLTPRQVGKGRKTKAAIGIQPYKLQYQGEQACPKETLFYAVSSTSGRV 392 Query: 145 SRVSLEKLVEAYRELDQALVV 165 + V ++ L L Sbjct: 393 VTYQKKDKVNQFKALKSVLES 413 >UniRef50_C6XAZ6 G/U mismatch-specific uracil-DNA glycosylase n=4 Tax=Bacteria RepID=C6XAZ6_METSD Length = 179 Score = 139 bits (351), Expect = 3e-32, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 55/166 (33%), Gaps = 4/166 (2%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQ-AGFTDRQLKPQEAQHLLD 61 + RV+ G PG++S + HP N FW ++ + G + L Sbjct: 8 PAVAETDARVLILGSMPGIASLTQQQYYGHPHNLFWPIMGELVGAHPHLPYAERLAQLCR 67 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQ--PQALAILGKQAYEQGFSQR 119 + + ++ + + S + + + + G++A + Sbjct: 68 HHIALWDVMHQCFRPGSLDSAIDEGSIVANDFAGFFATHAQIRHVFFNGQKAAQAFQRHV 127 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALV 164 + S + LP+ S + + + + ++ + QAL Sbjct: 128 IKAEKLNKDALASLRFSTLPSTSPANASIPQKTKRQQWQSILQALA 173 >UniRef50_C9RN39 Mug G:T/U mismatch-specific DNA glycosylase n=1 Tax=Fibrobacter succinogenes subsp. succinogenes S85 RepID=C9RN39_FIBSS Length = 179 Score = 139 bits (350), Expect = 4e-32, Method: Composition-based stats. Identities = 37/165 (22%), Positives = 64/165 (38%), Gaps = 13/165 (7%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ-HLLD 61 + V+ G P S GF + HP NRFWKV+ E + LL+ Sbjct: 23 PALFDEKSEVLLLGSIPSPKSREQGFYYGHPQNRFWKVLASVLNEPTPETIDEKKAMLLE 82 Query: 62 YRCGVTKLVDRPT-VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + + ++D T + A++ S +++ + I + + + G A++ Sbjct: 83 HHIALWDVLDSCTIIGASDTSIEDVVPN--DIASLIAKTKIKRIFCTGATAHKLYEKYCE 140 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLS-RVSLEKLVEAYRELDQALV 164 G + LP+ S + V EKLVEAY E+ + Sbjct: 141 KATG--------IKAVKLPSTSPANCAVKFEKLVEAYGEIRRETT 177 >UniRef50_Q82UD2 Putative uncharacterized protein n=1 Tax=Nitrosomonas europaea RepID=Q82UD2_NITEU Length = 167 Score = 138 bits (347), Expect = 1e-31, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 56/163 (34%), Gaps = 8/163 (4%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA-GFTDRQLKPQEAQHLLD 61 I +PG V+ G PG S +AHP N FW++ + GF P L D Sbjct: 7 PPIASPGSWVLILGTMPGKVSLREQQYYAHPQNLFWRITAEILGFDATSAYPLRVSSLKD 66 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQGFSQR 119 + + ++ T +++ + H + P + + G +A Sbjct: 67 HGVALWDVLQSCTRESSLDADIVAHTIVPNDFGRFFTACPDIRRVCFNGAKAAALYAR-- 124 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQ 161 + Q + + LP+ S + + + A+ + Sbjct: 125 HVKPFLQDA--PTVEYVQLPSTSPANAAIPRADKLRAWSVIKH 165 >UniRef50_C9KQM1 Putative G/U mismatch-specific DNA glycosylase n=2 Tax=Veillonellaceae RepID=C9KQM1_9FIRM Length = 174 Score = 136 bits (343), Expect = 3e-31, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 60/169 (35%), Gaps = 12/169 (7%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI-YQAGFTDRQLKPQEAQHLL 60 + + R + G PG+ S +AHP NRFW ++ + G + LL Sbjct: 6 LRPNIDKDCRALILGSMPGVKSLEAQQYYAHPQNRFWPMMAHLLGEALPSAYEERLAMLL 65 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQR 119 + + + + + + + G +++Y +A+ G ++ + Sbjct: 66 RHHIALWDSIGSCEREGS-LDSAIRNEQGNDFAALLQEYPGIRAIYFNGGKSAQAFKRYN 124 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVVRG 167 A + +P+ S +R L L A+R +AL G Sbjct: 125 KALLAR-----PDIAFHAMPSTSPANARWRLPMLEAAWR---KALTESG 165 >UniRef50_A5EPR0 G/U mismatch-specific uracil-DNA glycosylase n=1 Tax=Bradyrhizobium sp. BTAi1 RepID=A5EPR0_BRASB Length = 166 Score = 136 bits (343), Expect = 3e-31, Method: Composition-based stats. Identities = 45/166 (27%), Positives = 74/166 (44%), Gaps = 6/166 (3%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 + D L PGLR+VF G SA TG +AHP NRFW ++ G T R +P E LL Sbjct: 4 LPDQLKPGLRLVFVGTAASARSAATGHYYAHPGNRFWPTLFAVGLTPRLYRPHEFPALLS 63 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG- 120 G T L + + K + KI ++P+ +A K+A + + Sbjct: 64 LGIGFTDLCKQGAGMDHVALKAGVD--VAGFAAKIRRHRPRTIAFTSKKAASLFYRKPTG 121 Query: 121 -AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVV 165 G+Q+ +VL +PSG + + ++ +++L + + Sbjct: 122 AIALGRQSEQDDFPTAFVLASPSGAASGAWT--LQPWQDLAEHVAS 165 >UniRef50_A9KMN2 G:T/U mismatch-specific DNA glycosylase n=2 Tax=Clostridiales RepID=A9KMN2_CLOPH Length = 173 Score = 135 bits (341), Expect = 5e-31, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 65/168 (38%), Gaps = 13/168 (7%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ-HLLDY 62 + +++ G P + S F + HP NRFW+++ + + LL+ Sbjct: 13 PVFDKNSKILILGSFPSVKSREQEFYYGHPMNRFWRMLSSVLKKEIPSDSLRKREFLLEN 72 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP-QALAILGKQAYEQGFSQRGA 121 + ++D + + S + + +I P + + GK A + Sbjct: 73 HIALWDVIDSCDIIGS--SDSSIKNVVPTDLSRILAISPISHIYVNGKTAKKLYDR---- 126 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRV-SLEKLVEAYRELDQALVVRGR 168 + ++ + LP+ S + + SLEKL+E ++ + + L R Sbjct: 127 ----YSESVTGLKAICLPSTSPANAMFSLEKLIEEWKVIIEPLGDNYR 170 >UniRef50_Q1GYD6 G/U mismatch-specific uracil-DNA glycosylase n=1 Tax=Methylobacillus flagellatus KT RepID=Q1GYD6_METFK Length = 174 Score = 135 bits (340), Expect = 6e-31, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 55/165 (33%), Gaps = 8/165 (4%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI-YQAGFTDRQLKPQEAQHLLD 61 + PG RV+ G PG S +AHP N FW I + + + L Sbjct: 12 PPAITPGCRVLILGSMPGKRSLEMQAYYAHPQNLFWPFITHLFNIPPESGYQERLRLLNQ 71 Query: 62 YRCGVTKLVDRPTVQAN-EVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQR 119 G+ ++ ++ + +E + + + G +A + Sbjct: 72 QHIGIWDVLKECVRTSSLDGDIEETSIVANDFSALFMQFPHIRHIFFNGSKAEQAFRRHV 131 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 K+ + LP+ S + + E + ++++ A+ Sbjct: 132 L----KRQIIPAHLHYHKLPSTSPANASIKKEHKLAQWQQIISAI 172 >UniRef50_B7Q2I4 G/T mismatch-specific thymine DNA glycosylase, putative n=1 Tax=Ixodes scapularis RepID=B7Q2I4_IXOSC Length = 236 Score = 134 bits (339), Expect = 8e-31, Method: Composition-based stats. Identities = 30/128 (23%), Positives = 56/128 (43%), Gaps = 4/128 (3%) Query: 39 KVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIED 98 + ++ AG + + L+ Y G T +V R T + ++S++E+ G L++K++ Sbjct: 24 RCLFLAGLIPEPMTAVDDFKLIQYGIGFTNIVARTTRSSADLSRKEIKEGAEILMDKLKI 83 Query: 99 YQPQALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRV----SLEKLVE 154 +QP+ GK YE + +G+Q I T + V PS +R + Sbjct: 84 FQPRIAVFNGKGIYEIFSGNKNFVFGRQPDPIEGTNVSVFVMPSSSARCAQLPRAVDKLP 143 Query: 155 AYRELDQA 162 Y L + Sbjct: 144 FYVALKKL 151 >UniRef50_B7ATH8 Putative uncharacterized protein n=1 Tax=Bacteroides pectinophilus ATCC 43243 RepID=B7ATH8_9BACE Length = 180 Score = 134 bits (339), Expect = 8e-31, Method: Composition-based stats. Identities = 32/171 (18%), Positives = 63/171 (36%), Gaps = 12/171 (7%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHL-LDY 62 ++ R++ G P + S F + HP NRFW+++ D ++ ++L L Sbjct: 15 PVVDKDSRILILGSFPSVKSRENMFYYGHPQNRFWRMLADIVKADVPQTIEDKKNLILSN 74 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQ 122 + + + A+ H + ++ Y +A+ G AY G Sbjct: 75 GFALWDTLAMCEIHAS-ADSSIRHEVPNDIPGLVKQYGIEAIMFNGNAAYRYFEKYYG-- 131 Query: 123 WGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYR----ELDQALVVRGR 168 + + LP+ S + S E+LV + EL + ++ R Sbjct: 132 ---KDDGLEGIIKKALPSTSPANAACSYERLVGEWGSAVNELKEKILKEKR 179 >UniRef50_C3X2Y9 Mug G:T/U mismatch-specific DNA glycosylase n=1 Tax=Oxalobacter formigenes HOxBLS RepID=C3X2Y9_OXAFO Length = 161 Score = 134 bits (338), Expect = 1e-30, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 62/167 (37%), Gaps = 21/167 (12%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ--EAQHL 59 + + +P + + G PG S G +AHP N+FW+++ + L P + L Sbjct: 9 LPPLFSPHIHTMILGSFPGRESLAAGQYYAHPRNQFWRLLSAVLGEELVLLPYEARLERL 68 Query: 60 LDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFS 117 L + G+ ++ + S+ + ++++ P + GK A Sbjct: 69 LRHGIGLWDVITMCERIGSLDSR--IKNAVPNDFKRLKKDCPNLEKICFNGKVA------ 120 Query: 118 QRGAQWGKQTLTI--GSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQ 161 G + + VLP+ S + S E+ + ++ + Sbjct: 121 ------GTYAERLAAAGYKTIVLPSSSPAYASRSFEEKRVLWEKILE 161 >UniRef50_Q2FQM0 Putative uncharacterized protein n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FQM0_METHJ Length = 159 Score = 134 bits (338), Expect = 1e-30, Method: Composition-based stats. Identities = 37/163 (22%), Positives = 66/163 (40%), Gaps = 10/163 (6%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDR-QLKPQEAQHLL 60 + +L+ RV+ G P + S G +AHP N FWK++ Q LL Sbjct: 6 LSPLLSSDCRVLILGSYPSVLSIKAGEYYAHPRNMFWKIMEIILNIPHNQTYTTRISLLL 65 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 ++ G+ + + + + +L I ++Q + L +A E+GF + Sbjct: 66 NHGIGLWDVYAACSREKS-ADAHIKDPVPNELKTLILNHQDIRIIFLNGRAAEKGFRKFF 124 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 QT LP+ S V LE+ ++ +RE+ + L Sbjct: 125 PDISLQT--------IYLPSSSPAHAVHLEEKIKRWREITKFL 159 >UniRef50_A6TMT4 Putative uncharacterized protein n=1 Tax=Alkaliphilus metalliredigens QYMF RepID=A6TMT4_ALKMQ Length = 164 Score = 133 bits (335), Expect = 2e-30, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 60/162 (37%), Gaps = 12/162 (7%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTD-RQLKPQEAQHLLDY 62 I+ ++ G P + S G + +P N+FWK+IY+ ++ ++ Q + ++ Sbjct: 9 PIIDANSHILILGSIPSVQSLQEGKYYGNPRNQFWKIIYEVLGQPFQEDYGKKVQFIKEH 68 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQRG 120 + +++ + + S + P+ + G +A++ + G Sbjct: 69 GIAIWDVIESCHREGSLDSN--IKEERPNDFNTFLKSYPRIKYVLFNGTKAHDTFRRKVG 126 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQA 162 + LP+ S + + +E ++ L + Sbjct: 127 FSF-------EDKIFHKLPSSSPAHTMKIGNKIEEWKVLLKY 161 >UniRef50_B1HZQ2 Putative uncharacterized protein n=2 Tax=Bacillaceae RepID=B1HZQ2_LYSSC Length = 173 Score = 132 bits (333), Expect = 4e-30, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 57/170 (33%), Gaps = 12/170 (7%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ-LKPQEAQHL 59 ++ I+ +V+ G PG S + +P N FW +I Q + + L Sbjct: 11 VLLPIVDSSTKVLIVGSMPGKQSLVKQQYYGNPRNHFWPIIGQLLQVEVPEDYCERIALL 70 Query: 60 LDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQ 118 G+ ++ + + + + E Y + + G +A++ Sbjct: 71 KKREIGLWDTIEACEREGS-LDAAIRNEKPNDFQTLFETYPNIKLVLFNGAKAFDVFKKH 129 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGL---SRVSLEKLVEAYRELDQALVV 165 G + +P+ S + + S ++ VE +R + L Sbjct: 130 IGL------ELLAGRAYKKMPSTSPIPGKNIKSFDEKVEEWRIILSYLSQ 173 >UniRef50_C4LCA1 T/U mismatch-specific DNA glycosylase n=10 Tax=Bacteria RepID=C4LCA1_TOLAT Length = 167 Score = 132 bits (332), Expect = 5e-30, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 53/162 (32%), Gaps = 9/162 (5%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQ-AGFTDRQLKPQEAQHLLDY 62 I R++ G PG +S G +AH N FW V+ + F + L Sbjct: 9 PIADENARILILGSMPGQASLDAGQYYAHKRNLFWPVMAELLQFDPSAGYEERTAALRAA 68 Query: 63 RCGVTKLVDRPTVQAN-EVSKQELHAGGRKLIEKIEDYQ-PQALAILGKQAYEQGFSQRG 120 + ++ + + + + + E ++ + + G++A Sbjct: 69 GIALWDVLQSCKREGSLDANIEHATLTVNNFAEFFRQHRQIRHVFFNGEKAASCFHRYV- 127 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQ 161 + LP+ S + SL ++A++ + Sbjct: 128 ----AKADYPADLICQRLPSTSPANASQSLAFKLQAWQRIMD 165 >UniRef50_B9TDK4 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TDK4_RICCO Length = 198 Score = 132 bits (332), Expect = 5e-30, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 60/163 (36%), Gaps = 12/163 (7%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ-EAQHLLDY 62 + P R++ G PG++S +AHP N FW+++ Q Q Q D Sbjct: 33 PVANPDARILILGTLPGIASLRQQQYYAHPQNAFWRILEQLFNIPAQAAYHVRCQSAQDQ 92 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQRG 120 + + + S + ++ I ++ P+ +A+ G+ A + Sbjct: 93 GLAIWDVCHSAQRMGSLDSNIQSNSIVANHIHQLLADCPRIRLIALNGQAAAQLFKKHIR 152 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 + +LP+ S +R+ E+ + + L QA Sbjct: 153 PA--------PEIETVILPSTSPANARLRFEEKLAQWAILKQA 187 >UniRef50_Q31G15 Uracil DNA glycosylase superfamily protein n=1 Tax=Thiomicrospira crunogena XCL-2 RepID=Q31G15_THICR Length = 168 Score = 132 bits (332), Expect = 6e-30, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 57/165 (34%), Gaps = 8/165 (4%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 I+ +V+ G P + S F +AHP N FW ++ + L+ Sbjct: 9 PIVPTQPKVLILGTAPSVVSLEKSFYYAHPRNAFWPILADYFNQPVDSVEAKKALCLEND 68 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP-QALAILGKQAYEQGFSQRGAQ 122 + ++ Q + + + Y +A+ GK A + F ++ + Sbjct: 69 ILLWDVLSACERQGS-LDSAIKQPEANDFEALFQTYPDLKAVLFNGK-AAQNLFEKQVIK 126 Query: 123 WGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAY-RELDQALVV 165 + + LP+ S +R++ E + + L Q L Sbjct: 127 KQSLPEDL---MFYSLPSTSPANARLTFENKRVFWHKTLQQTLTK 168 >UniRef50_A5DZA6 Putative uncharacterized protein n=2 Tax=Saccharomycetales RepID=A5DZA6_LODEL Length = 312 Score = 131 bits (331), Expect = 6e-30, Method: Composition-based stats. Identities = 46/228 (20%), Positives = 81/228 (35%), Gaps = 66/228 (28%) Query: 2 VEDI---LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK------ 52 + D+ L L+++F G NPG+ S+ +AH +N FWK+ QA R ++ Sbjct: 85 LPDLRHSLNSNLKLLFVGYNPGVQSSIQQHHYAHHSNLFWKLFNQANVLQRVVQKRKKKE 144 Query: 53 ---------------------------PQEAQHLLDYRCGVTKLVDRPTVQANEVSKQEL 85 L+ Y G + L R TV+A E++ +E Sbjct: 145 NGMHQKVFDDDDDDDKYLKTLLSTGCSAIHDFDLIKYDIGFSDLSLRCTVRAEELTSEEK 204 Query: 86 HAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK-------------------- 125 A +LI +I + + + ++GK +E AQ G Sbjct: 205 IANIPRLIREINESNAENVVLIGKGIWEVIVKFYKAQLGIKFQLNKDNFQWGQIAESKDK 264 Query: 126 ---------QTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 T I+ P+ SGL + + + + ++E+ L Sbjct: 265 QYNQIIEHVYAQLPKHTTIYAFPSTSGLVASMKFSEKLRLWQEMANNL 312 >UniRef50_Q8CUP2 Hypothetical conserved protein n=1 Tax=Oceanobacillus iheyensis RepID=Q8CUP2_OCEIH Length = 167 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 67/165 (40%), Gaps = 14/165 (8%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH--LLD 61 ++ RV+ G PG+ S + +P N FW +++ ++++ + + L Sbjct: 9 PVIPDDPRVLIVGSMPGVKSLAEQQYYGNPRNHFWSIMFTL-LEEKRVDNYQDKLSMLKR 67 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQ-PQALAILGKQAYEQGFSQRG 120 + + + + + + A +I ++DY + + G +AY+ ++ Sbjct: 68 HGIALWDTIGSCYREGS-LDVNIQDAEPNDIIGLVQDYPTIKLIGCNGTKAYDIF--RKN 124 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGL---SRVSLEKLVEAYRELDQA 162 + + + LP+ S + +LE+ +EA++ + + Sbjct: 125 FSL----ESFPNVDVVKLPSTSPIPGKYNKNLEEKIEAWKIIIDS 165 >UniRef50_Q6BL09 DEHA2F17270p n=2 Tax=Saccharomycetaceae RepID=Q6BL09_DEBHA Length = 275 Score = 131 bits (330), Expect = 8e-30, Method: Composition-based stats. Identities = 45/226 (19%), Positives = 77/226 (34%), Gaps = 63/226 (27%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK--------- 52 ++ L L+V+F G NPGL S+ +AH N FWK+ ++ L+ Sbjct: 48 LQPSLDDRLKVLFIGFNPGLQSSIQQHHYAHFTNLFWKLFNESKLLINVLQSLNISVGDA 107 Query: 53 ------------------------PQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAG 88 P ++ Y+ G T LV R T A E++ E Sbjct: 108 IKEDPLLKELIQVDSDDKYKTFVKPVNDFEIVKYKIGFTDLVLRCTKTAQELNLSEKLEN 167 Query: 89 GRKLIEKIEDYQPQALAILGKQAYEQGFS-------------QRGAQWGKQ--------- 126 +L + + + +GK +E + QWGKQ Sbjct: 168 VPRLFNEFKQSGSTFIVFVGKGIWEIIVKYITNLLKIKVKLTKGNFQWGKQILGEDEHYN 227 Query: 127 -------TLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQALV 164 + I+V PN SGL + ++ + + +L ++ Sbjct: 228 MILEKLNEILGCKHNIYVFPNTSGLVTSLKYDEKLHLWNDLSHEIL 273 >UniRef50_C6QE22 G:T/U mismatch-specific DNA glycosylase-like protein n=1 Tax=Hyphomicrobium denitrificans ATCC 51888 RepID=C6QE22_9RHIZ Length = 178 Score = 131 bits (329), Expect = 1e-29, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 61/169 (36%), Gaps = 7/169 (4%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ-EAQHLLD 61 I+ ++ G PG S + HP N FW +I L Sbjct: 11 PPIVTADAHLLILGTLPGEESLRLQQYYGHPRNHFWPLIAAIFGEAVPSDYSARIALLKR 70 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQRG 120 + C + +++ + +A + DY + +A G++A F + Sbjct: 71 HGCALWDVLESAERIGS-SDAAIRNATVNPFAAFLADYPNIRTIAFNGQKA-RDLFHRHV 128 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYR-ELDQALVVRGR 168 + G + + LP+ S L SLE+ + +R +L ++L + R Sbjct: 129 VKPGHVPEQVC--TMIDLPSSSPLYTKSLEEKLGIWRAKLAESLAAQKR 175 >UniRef50_C6X5D8 G:T/U mismatch-specific uracil/thymine DNA-glycosylase n=2 Tax=Flavobacteriaceae RepID=C6X5D8_FLAB3 Length = 162 Score = 129 bits (326), Expect = 3e-29, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 66/165 (40%), Gaps = 15/165 (9%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEA---QHL 59 I++ R++ G PG+ S +AHP N+FWK+++ D + L Sbjct: 9 PPIISDTSRILILGSAPGIKSLQMQQYYAHPQNQFWKILF--HLFDEEYAVDYDIRKALL 66 Query: 60 LDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQ 118 + ++D + + + + +++ + D+ A+ G+++ ++ Sbjct: 67 FKNHIALWDVIDSCEREGS-SDAKIRNEEHNDVLQLVTDFPNITAIFSNGQKSSKEARKM 125 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 GA+ VLP+ S L + K +EA++ L L Sbjct: 126 LGAE--------SQIPFHVLPSTSPLHTIPFNKKLEAWKILKHYL 162 >UniRef50_Q7VJA2 Putative uncharacterized protein n=1 Tax=Helicobacter hepaticus RepID=Q7VJA2_HELHP Length = 188 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 37/168 (22%), Positives = 70/168 (41%), Gaps = 16/168 (9%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQE----AQHL 59 + RV+ G P + S F +AH NRFW ++ + ++ Q+ Q L Sbjct: 29 PVFDTYSRVLILGSFPSVISRDEQFYYAHSHNRFWHILATLFTPEIDIRAQDRDAKKQFL 88 Query: 60 LDYRCGVTKL---VDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF 116 L + + + D + +S + + L + A+ G++AYE Sbjct: 89 LTHHIALWDIVRECDICNSSDSTLSNAKPNDISVILSS----AKINAIFCNGRKAYELFI 144 Query: 117 SQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 +G Q ++++LP+ S +R SLEKLV+++R + + Sbjct: 145 R----FFGSQMCETQEIKVFLLPSSSPANARYSLEKLVDSWRVVLNYV 188 >UniRef50_B9WEN4 G/U mismatch-specific uracil DNA glycosylase, putative (Uracil mismatch repair protein, putative) (Dna repair protein, putative) n=2 Tax=Candida RepID=B9WEN4_CANDC Length = 292 Score = 129 bits (325), Expect = 3e-29, Method: Composition-based stats. Identities = 44/240 (18%), Positives = 86/240 (35%), Gaps = 78/240 (32%) Query: 2 VEDI---LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK------ 52 + D+ L L+V+F G NPG SA +AH +N FWK+ Q+ + + Sbjct: 53 LPDLKPSLNDHLQVLFVGFNPGTESAIQQHHYAHHSNLFWKLFNQSQLLHKVISKNNQQE 112 Query: 53 -----------------------PQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGG 89 L+ Y G + LV R T +A++++ E Sbjct: 113 EQQIVLDDSNDQFLNHLLENGCNATHDFKLIKYNIGFSDLVLRSTARADQLTISEKLENV 172 Query: 90 RKLIEKIEDYQPQALAILGKQAYEQGFSQ-----------------------------RG 120 +L+ + + + + + I+GK +E + Sbjct: 173 PRLLSEFKSSRVETIVIVGKGIWEIIVKYFMKELNISSTNFKLTSSNSNRKSDGGSAPKN 232 Query: 121 AQWGK----------------QTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 +WG S++I++LPN SGL + +S ++ ++ ++++ L Sbjct: 233 FEWGLLQKGSDLTYNLIIDSIYKHIDESSKIYILPNTSGLVASLSFDEKLKLWQDMVNDL 292 >UniRef50_Q0F2S7 Putative uncharacterized protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0F2S7_9PROT Length = 168 Score = 129 bits (324), Expect = 4e-29, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 56/160 (35%), Gaps = 7/160 (4%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAG-FTDRQLKPQEAQHLLDYRCGV 66 P +V+ G PG S +AHP N FW ++ F+ + L + Sbjct: 12 PDAKVLILGSMPGRKSLDEQQYYAHPRNAFWPIMASLFEFSADIDYEKRLGMLRSNGIAL 71 Query: 67 TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQGFSQRGAQWG 124 + + + + L + D P +A+ G++A + + + Sbjct: 72 WDVARQCIRPGSLDAAIRLPSVITNDFTAFFDTHPQIRAVCFNGRKAADLYQRRVIPELA 131 Query: 125 KQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 + ++ LP+ S + +S + + +R + L Sbjct: 132 LNQTQLP---LYPLPSTSPTHAAISFDSKLAQWRTILDIL 168 >UniRef50_Q5KFR6 Expressed protein n=2 Tax=Filobasidiella neoformans RepID=Q5KFR6_CRYNE Length = 469 Score = 128 bits (323), Expect = 5e-29, Method: Composition-based stats. Identities = 45/116 (38%), Positives = 67/116 (57%), Gaps = 2/116 (1%) Query: 2 VEDILAPGLRVVFCGINP-GLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 + D+L G +VFCGI G S+ G FAHP N+FWK +YQ+G T R + P E ++ Sbjct: 158 LNDLLVKGQDLVFCGIKRIGKMSSTLGHHFAHPTNKFWKALYQSGLTSRLMSPTEDYKVV 217 Query: 61 D-YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 D Y G+T LV RPT + +E+S E+ L++K YQP + +GK+ ++ Sbjct: 218 DEYNYGLTNLVGRPTSEQSELSTLEMKLNTINLLQKFIKYQPSVVCFVGKKIWDVF 273 >UniRef50_D0WFV1 Putative G/U mismatch-specific DNA glycosylase n=2 Tax=Bacteria RepID=D0WFV1_9ACTN Length = 562 Score = 128 bits (323), Expect = 6e-29, Method: Composition-based stats. Identities = 36/160 (22%), Positives = 58/160 (36%), Gaps = 15/160 (9%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK---PQEAQHLL 60 + +V+ G P S GF +AHP NRFW+VI Q D L + LL Sbjct: 13 PVFDEQSQVLVLGTMPSPRSRKEGFYYAHPRNRFWRVIAQ--LFDEPLARTNEERIDQLL 70 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + + ++ + + + +I D P A A Q + + Sbjct: 71 RHHIALWDVLASC--RIAGAKDASIRDAVPNDLARIMDAAPVAAVFCTGTAAAQAYRRF- 127 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 + LP+ S + + L+ LV+AYR L Sbjct: 128 ------SEPKTGIACVQLPSTSPANAAMGLDALVDAYRPL 161 >UniRef50_A0R287 T/U mismatch-specific DNA glycosylase n=5 Tax=Mycobacterium RepID=A0R287_MYCS2 Length = 162 Score = 128 bits (322), Expect = 8e-29, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 50/162 (30%), Gaps = 18/162 (11%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI-YQAGFTDRQLKPQEAQHLL 60 + I+ RV+ G P + S + + +P N FW+++ GF + + Sbjct: 8 LPPIVGEAPRVLILGNMPSVMSLASSEYYGNPRNAFWRIMGSLYGFAPDAAYSERVSAVT 67 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQA---YEQG 115 + V ++ + S E + P + G A Y + Sbjct: 68 SHGIAVWDVLRSCRRVGSLDSAVERDSMVPNDFAGFFAAHPSLERVVFNGAAAEANYRRL 127 Query: 116 FSQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYR 157 + + +P+ S + E + A+R Sbjct: 128 VGEVPLPY------------VRMPSTSPAQTMRFEDKLSAWR 157 >UniRef50_C7N6G7 YjeF-like protein n=6 Tax=Bacteria RepID=C7N6G7_SLAHD Length = 447 Score = 128 bits (321), Expect = 9e-29, Method: Composition-based stats. Identities = 33/165 (20%), Positives = 57/165 (34%), Gaps = 13/165 (7%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ-HLLDY 62 + RV+ G P S GF + HP NRFW V+ Q + LL + Sbjct: 14 PVFDRHSRVLILGTMPSPKSREEGFFYGHPRNRFWTVLASLFHEPVPATNQRKRDLLLRH 73 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP-QALAILGKQAYEQGFSQRGA 121 + ++ ++ S + + +I P +A+ G ++ E + Sbjct: 74 GIALWDVLASCDIEGA--SDASIKNAKPNDLTRITSAAPIEAVFCTGAKSAELYRRYCES 131 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 Q G LP+ S + + E L AY+ + L Sbjct: 132 QTG--------IPAVQLPSTSPANAAMDTETLTAAYQVILDHLEA 168 >UniRef50_A6Q419 Putative uncharacterized protein n=1 Tax=Nitratiruptor sp. SB155-2 RepID=A6Q419_NITSB Length = 182 Score = 127 bits (319), Expect = 1e-28, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 66/166 (39%), Gaps = 15/166 (9%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQ-HLLDY 62 + +V+ G P +S GF + HP N+FWK++ +E + L + Sbjct: 9 PFIDKESKVLILGSFPSFASLEIGFYYGHPQNQFWKIVADVFKQKVPDTLEEKKEFLKRH 68 Query: 63 RCGVTKLVDRPTVQAN-EVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQRG 120 + +V + + + S + + + ++ Y + + G+++ + Q Sbjct: 69 HIALWDMVKGCIRENSLDSSLSSIE--VNDIEKLLKTYPNIEVIFFTGRKSQQLFEKQF- 125 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQALVV 165 + + + LP+PS R +S E+ V+ ++++ + Sbjct: 126 --------SHVNLPRYYLPSPSPAYRAMSFEEKVQQWKKILDYIDE 163 >UniRef50_C4FAG7 Putative uncharacterized protein n=2 Tax=Collinsella RepID=C4FAG7_9ACTN Length = 185 Score = 126 bits (317), Expect = 3e-28, Method: Composition-based stats. Identities = 37/162 (22%), Positives = 63/162 (38%), Gaps = 12/162 (7%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA-GFTDRQL--KPQEAQH 58 + I RV+ G P S GF + HP NRFW+V+ + G D +L Sbjct: 23 LGPIFDERSRVLVLGTMPSPRSREVGFYYGHPQNRFWRVMERLHGLRDHELIDNDARRAF 82 Query: 59 LLDYRCGVTKLVDRPT-VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS 117 LL +R + ++ A++ S L ++ + + G +A + Sbjct: 83 LLAHRIALWDVLAACNIAGASDASIANPEPN--DLARILDAAPIETVCTTGGKAAQLC-R 139 Query: 118 QRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 + Q ++ LP+ S +R+ L+ LV AYR Sbjct: 140 RFDGPVLTQR----GIELVELPSTSPANARMRLDDLVAAYRA 177 >UniRef50_C8W9T7 Uracil-DNA glycosylase superfamily n=1 Tax=Atopobium parvulum DSM 20469 RepID=C8W9T7_ATOPD Length = 185 Score = 124 bits (313), Expect = 9e-28, Method: Composition-based stats. Identities = 36/164 (21%), Positives = 65/164 (39%), Gaps = 13/164 (7%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIY-QAGFTDRQLKPQEAQHLL 60 ++ I+ P RV+ G P S GF F +P NRFW+V+ G Q P++ Sbjct: 24 LDPIIFPDSRVLLLGSMPSPKSREIGFYFGNPQNRFWRVVATLYGEKPLQSIPEKIDFCK 83 Query: 61 DYRCGVTKLVDRP-TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 + + ++ A++ S + + I+D A+ LG A Sbjct: 84 RHHLALCDVLKACDIKGASDASIA--NPVPHNIGALIQDSSICAIFTLGGTA-------- 133 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 + + T LP+ S +R+S++ LVE + + + Sbjct: 134 TKFYKRYTEQQTGILARQLPSTSPANARMSVDDLVEEFSIIKKF 177 >UniRef50_Q5P2M8 T/U mismatch-specific DNA glycosylase n=30 Tax=Bacteria RepID=Q5P2M8_AZOSE Length = 166 Score = 124 bits (313), Expect = 9e-28, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 50/161 (31%), Gaps = 8/161 (4%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA-GFTDRQLKPQEAQHLLD 61 + APG + G PG +S +AH N FW ++ + G + Q L D Sbjct: 8 PPVAAPGAHTLILGSMPGAASLAALEYYAHRRNLFWPILGEVLGIDPALSYAERTQRLAD 67 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQR 119 + ++ + S E + P + G A + Sbjct: 68 AGYALWDVLGACRRHGSLDSNIENDSIEVNDFAIFFATHPSIDRVFCNGATAEQCFRRHV 127 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 ++ LP+ S + + + A+RE+ Sbjct: 128 LPAL----EGRFEFEVRRLPSTSPANASIPHAAKLAAWREI 164 >UniRef50_A6SVT3 G:T/U mismatch-specific DNA glycosylase n=21 Tax=Proteobacteria RepID=A6SVT3_JANMA Length = 165 Score = 124 bits (312), Expect = 1e-27, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 62/165 (37%), Gaps = 23/165 (13%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD- 61 ++ ++ G PG +S +AHP N+FWK + A + + + LL Sbjct: 14 PPVIDQHTEILILGSFPGEASLSAQQYYAHPRNQFWK-LTGALINEDLVTLPYTERLLRL 72 Query: 62 --YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQGFS 117 +R G+ ++ + + S + ++ P + + GK + Sbjct: 73 SAHRIGLWDVIAACAREGSLDSA--IRQAQHNDFALLKQQCPQLRRVCFNGKTS------ 124 Query: 118 QRGAQWGKQTLTI--GSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 GKQ + +LP+ S + ++ E+ + A+R + Sbjct: 125 ------GKQEALFAQAGFETIILPSSSPAYALLTFEQKLAAWRGI 163 >UniRef50_C4ZBZ4 G:T/U mismatch-specific DNA glycosylase n=20 Tax=Bacteria RepID=C4ZBZ4_EUBR3 Length = 164 Score = 124 bits (311), Expect = 1e-27, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 63/165 (38%), Gaps = 17/165 (10%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH---LL 60 + R++ G P ++S F + H NRFWK++ A D + + L Sbjct: 12 PVYDENSRILILGTLPSVASRENNFYYGHKQNRFWKLL--AYLLDEPVPETIDEKKYMLF 69 Query: 61 DYRCGVTKLVDRP-TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 + ++ +++ S + + + + +E + + G +A Sbjct: 70 KNHIAIWDVIQSCDIKGSSDSSIKNVEP--TDIRKILETADIKQVYANGNKA-------- 119 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 GA + K L + + LP+ S + S E+L EA+ + L Sbjct: 120 GALYKKYQLPLTGMEAVTLPSTSPANAAWSFERLCEAWTRILANL 164 >UniRef50_C8WJG8 Putative uncharacterized protein n=8 Tax=Bacteria RepID=C8WJG8_EGGLE Length = 173 Score = 124 bits (311), Expect = 2e-27, Method: Composition-based stats. Identities = 36/167 (21%), Positives = 60/167 (35%), Gaps = 10/167 (5%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ-EAQHLL 60 +E I A RV+ G P S GF + HP NRFWKV+ L + L Sbjct: 10 LEPIYAVDSRVLVLGTMPSPKSREIGFYYGHPQNRFWKVMGALFDEPEPLGIEGRTAFLY 69 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + + ++ + S + + +I + P + + G Sbjct: 70 AHGIALWDVLSSCAIVGA--SDASIVDPVANNLSRIAEAAPLEAVFTTGSKATALYRRFG 127 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVVR 166 A + LP+ S +R+ L+ LV+AY + + L R Sbjct: 128 AP------QLPGLAHTGLPSTSPANARMRLDDLVKAYLPIRETLERR 168 >UniRef50_C4Z504 DNA glycosylase n=4 Tax=Clostridiales RepID=C4Z504_EUBE2 Length = 188 Score = 123 bits (309), Expect = 3e-27, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 66/161 (40%), Gaps = 17/161 (10%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK---PQEAQHL 59 + R++ G P + S F + HP NRFWK+I +G + + ++ + Sbjct: 36 PPLYNSESRILILGSLPSVKSREQMFFYGHPQNRFWKMI--SGVFEETVPQTIEEKKAFM 93 Query: 60 LDYRCGVTKLVDRP-TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ 118 L + + + + +++ S + + L I++ + + + GK + + Sbjct: 94 LKHNIALWDTIYSCDIIGSSDSSIKNVVP--TDLKSIIDNSKIEKVICNGKTSGKYYEKY 151 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 + G + +LP+ S + SL KLVE +++ Sbjct: 152 QMKYLGIKPD--------ILPSTSPANAAYSLGKLVEIWKK 184 >UniRef50_C2FY74 G/U mismatch-specific DNA glycosylase n=2 Tax=Sphingobacterium spiritivorum RepID=C2FY74_9SPHI Length = 160 Score = 123 bits (308), Expect = 3e-27, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 58/163 (35%), Gaps = 16/163 (9%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ---EAQHL 59 + + + G PG S G +AHP NRFWK+++ +++ + Q L Sbjct: 7 PPVCGLSPKFLILGSLPGDKSLGQQQYYAHPQNRFWKLLF--HLMEKEYESDYSKRIQFL 64 Query: 60 LDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQ 118 D+ + + + L +I ++ Q + G +A + Sbjct: 65 EDHHIALWDTCASAIRPGSMDT-AILSELPNDIISLLQHVPSIQHVIFNGNKAKQLYDR- 122 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELD 160 + LP+ S +R S +KLV+++ L Sbjct: 123 -------YHKRLAQISYHSLPSTSPANARFSFDKLVDSWSILK 158 >UniRef50_UPI00016AE856 G:T/U mismatch-specific DNA glycosylase-like protein n=1 Tax=Burkholderia pseudomallei B7210 RepID=UPI00016AE856 Length = 170 Score = 122 bits (307), Expect = 3e-27, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 60/163 (36%), Gaps = 9/163 (5%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQ-AGFTDRQLKPQEAQHLL 60 + I+ + G PG S +AH AN+FW+++ G T + + LL Sbjct: 5 LAPIVDANSTHLILGTMPGRESLKFQRYYAHGANQFWQILETVYGETIPDHYERRVEFLL 64 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQR 119 + + ++ + + + + E + + + + + G A + Sbjct: 65 RHGIALWDVLRSADREGS-LDSAIRNELANDFAGLFERFPRLRTIGLHGTTAAKLFRRHI 123 Query: 120 GAQWGKQTLTIGSTQIWVLPNPS---GLSRVSLEKLVEAYREL 159 Q G + I Q VLP+ S G S E+ + +R+ Sbjct: 124 AFQRG---VPIAELQCTVLPSTSSTPGRCVKSPEEKMACWRDF 163 >UniRef50_C5AAU4 Uracil-DNA glycosylase superfamily protein n=8 Tax=Proteobacteria RepID=C5AAU4_BURGB Length = 163 Score = 122 bits (307), Expect = 4e-27, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 57/160 (35%), Gaps = 12/160 (7%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTD--RQLKPQEAQHLLD 61 ++ RV+ G PG S +AHP NRFW+++ T+ LL Sbjct: 9 PVVDAQTRVLILGSLPGEVSLAQSQYYAHPQNRFWRLLGDVLGTELVALDYASRLDTLLR 68 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQRG 120 +R G+ +V T + + + L ++ +A G A + G Sbjct: 69 HRIGLWDVVAEATRIGS-LDSRIRDHASNDLSALVDTLPNLDTIAFNGGTAAKLGTRAL- 126 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 ++ LP+ S + + + A+R+L Sbjct: 127 ------AERGARYRLLRLPSSSPAYAALPYADKLLAWRQL 160 >UniRef50_Q1VS76 G/U mismatch-specific DNA glycosylase, putative n=2 Tax=Bacteria RepID=Q1VS76_9FLAO Length = 254 Score = 122 bits (307), Expect = 4e-27, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 60/160 (37%), Gaps = 12/160 (7%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH-LLD 61 + I ++ G PG S G + HP NRFWK+I + L + + LL Sbjct: 101 DPISNSDTAILILGTMPGDKSLELGEYYGHPRNRFWKIISTITNNELPLTYSDKKELLLK 160 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP-QALAILGKQAYEQGFSQRG 120 ++ G+ + + + + + L + I ++ + + G ++ Sbjct: 161 FKIGIWDIAHKANRKGS-LDSAIEDEEPNDLDDFIARHKNLKVIGFNGTKSETLFDKYFD 219 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYREL 159 + G + LP+ S + + + + + +R++ Sbjct: 220 RKSGL--------KYVYLPSTSPANTGIDFDNICKLWRQI 251 >UniRef50_A9AVD1 G:T/U mismatch-specific DNA glycosylase-like protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AVD1_HERA2 Length = 168 Score = 122 bits (307), Expect = 4e-27, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 59/168 (35%), Gaps = 9/168 (5%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ--LKPQEAQHL 59 + I+ R + G PG S F + HP N+FW+++ QA F D A L Sbjct: 6 LPAIVHSTSRCLILGTLPGDQSLAAQFYYLHPRNQFWRIM-QAVFADEPGLTSVTRAAFL 64 Query: 60 LDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFS 117 ++ + ++ + + + + P + + G +A + Sbjct: 65 ARHQLALWDVLHAAERVGSLDTA--ITNPIPNDFQTFFTANPAIQKVGLNGMKAQQLFMR 122 Query: 118 QRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQALVV 165 + ++ LP+ S + + K VEA+ L AL Sbjct: 123 --LVWPNLTSELQSRLELVGLPSTSPANTQAFAKKVEAWMWLRNALGA 168 >UniRef50_B2KDM2 G:T/U mismatch-specific DNA glycosylase-like protein n=1 Tax=Elusimicrobium minutum Pei191 RepID=B2KDM2_ELUMP Length = 162 Score = 121 bits (305), Expect = 7e-27, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 66/158 (41%), Gaps = 14/158 (8%) Query: 5 ILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHL-LDYR 63 I + +++ G PG S G +AHP N FW V+++ + + ++ ++L ++ Sbjct: 8 ITSKEPKILILGTMPGEESLLRGEYYAHPRNAFWPVLFKCFGAEISAQYKDKKNLIINNN 67 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQ--PQALAILGKQAYEQGFSQRGA 121 + +++ Q + S E+ I+ + +A+ GK A + + A Sbjct: 68 LALWDVLNECERQGSLDS--EIKNFTPNKIDVFLRSRKSVRAIFFNGKGAGR-FYKRFFA 124 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 + + + LP+ S + SLE+ +++ Sbjct: 125 PF-------EEVKYFYLPSTSPAYAAKSLEEKTALWKQ 155 >UniRef50_A7ZBI7 Mug G:T/U mismatch-specific DNA glycosylase n=2 Tax=Campylobacter RepID=A7ZBI7_CAMC1 Length = 168 Score = 121 bits (304), Expect = 8e-27, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 66/167 (39%), Gaps = 17/167 (10%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH---LL 60 I +++ G P + S GF + +P NRFW+V+ AG + L + LL Sbjct: 9 PIFDKNSKILILGSFPSVISRKFGFYYTNPQNRFWRVL--AGILNAPLPASTDEKIKFLL 66 Query: 61 DYRCGVTKLVDRPT-VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 + + +++ ++ L ++ + + + G +AYE Sbjct: 67 SHHIAIYDAAISCEIEGSSDAKMSKIVP--VNLKPLFKEAKIKQVYANGGKAYEICKKY- 123 Query: 120 GAQWGKQTLTIGSTQ--IWVLPNPSGLSR-VSLEKLVEAYRELDQAL 163 + I +T+ + LP+ S + SLE+L+ ++ + L Sbjct: 124 -----LENEIIKATKNEVIKLPSTSPANAKFSLERLMCEWQVVADTL 165 >UniRef50_B0T5Y6 Uracil-DNA glycosylase superfamily n=10 Tax=Proteobacteria RepID=B0T5Y6_CAUSK Length = 166 Score = 121 bits (303), Expect = 1e-26, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 54/169 (31%), Gaps = 15/169 (8%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH--LL 60 + RV+ G PG +S + +P N FW+++ E + LL Sbjct: 8 PPVTDANTRVLILGSLPGEASLAVQQYYGNPRNAFWRLMESVLDAPLVPLAYEDRLAALL 67 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP-QALAILGKQAYEQGFSQR 119 G+ ++ + + + L+ IE + +A G A + G Sbjct: 68 SRGVGLWDVIAQAERPGS-LDAAIRDPAANDLLALIETLPALRLVAFNGGAAAKLGSR-- 124 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVVRG 167 L LP+ S + S + E + L + +G Sbjct: 125 --------LIGDRVAALALPSSSPAHAARSFAQKAEIWGGLKTFVFPQG 165 >UniRef50_B2UPH3 Uracil-DNA glycosylase superfamily n=2 Tax=Chlamydiae/Verrucomicrobia group RepID=B2UPH3_AKKM8 Length = 171 Score = 120 bits (302), Expect = 2e-26, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 63/169 (37%), Gaps = 9/169 (5%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI-YQAGFTDRQLKPQEAQHLLD 61 + P V+ G PG S +AHP N FWK++ GF + L Sbjct: 7 PPSVTPDCSVLVLGSLPGDESLRQAQYYAHPRNAFWKIMGSLLGFDPSLPYGERLSLLNR 66 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQRG 120 G+ +V + + + + ++ + + G +++ Sbjct: 67 GGVGLWDVVASGVRPGS-LDQHISQERPNDIAALLDRFPGIGVVCCNGTASHKYLKRYFP 125 Query: 121 AQWGKQTLTIGSTQIWVLPNPSG-LSRVSLEKLVEAYRELDQALVVRGR 168 + ++TL+ + +P+ S +R++ E+ AY E+ L+ + R Sbjct: 126 ELFLRETLS-----VIQMPSTSPAAARLTYEQKFRAYEEIMAPLLRKER 169 >UniRef50_C8WEM8 Uracil-DNA glycosylase superfamily n=4 Tax=Zymomonas mobilis RepID=C8WEM8_ZYMMN Length = 163 Score = 120 bits (301), Expect = 2e-26, Method: Composition-based stats. Identities = 31/164 (18%), Positives = 69/164 (42%), Gaps = 13/164 (7%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA--GFTDRQLKPQEAQHLLD 61 + R++ G PG++S + HP N+FW+++ + P+ + LL Sbjct: 7 PVTNINSRLLILGSLPGVASLEKAQYYGHPRNQFWRLMSDIIGEDIESAAYPERLEGLLR 66 Query: 62 YRCGVTKLVDRPTVQAN-EVSKQELHAGG-RKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 + G+ ++ Q + + + +E+ LI K+ + +ALA G++A + G + Sbjct: 67 HHIGLWDVIGTAKRQGSLDSNIKEVSPNPLGDLIGKLPNL--KALAFNGQKAAQLGIKEL 124 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 ++LP+ S V+ + A+ +L + + Sbjct: 125 -------QKIGAKLPYYILPSSSPAHAVAYDVKKAAWIKLQEII 161 >UniRef50_Q1NE08 Uracil-DNA glycosylase superfamily protein n=1 Tax=Sphingomonas sp. SKA58 RepID=Q1NE08_9SPHN Length = 164 Score = 119 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 35/167 (20%), Positives = 64/167 (38%), Gaps = 15/167 (8%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ--EAQHLL 60 + RV+ G PG +S G +AH N FW +I + + +P + L Sbjct: 8 PPSVDAATRVLVLGSLPGDASIRQGEYYAHRGNAFWHLIGDVVGENLRAQPYAMRLKRLR 67 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQGFSQ 118 + G+ +++ + + S+ + + + ++ P +A+A GK A G Sbjct: 68 AHHVGLWDVMESAQREGSLDSR--IRQADLRNLADFVEHLPDLRAIAFNGKTAALHGRR- 124 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALV 164 Q + LP+ SG + +S E E + L LV Sbjct: 125 -------QLTDRPGITLLDLPSTSGAYASLSREAKRERWMALRDHLV 164 >UniRef50_B1C3U7 Putative uncharacterized protein n=1 Tax=Clostridium spiroforme DSM 1552 RepID=B1C3U7_9FIRM Length = 160 Score = 119 bits (300), Expect = 2e-26, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 62/163 (38%), Gaps = 13/163 (7%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ-EAQHLLDY 62 I +++ G P + S F + HP NRFWK++ D + Q LL+ Sbjct: 8 PIYNKESKILILGSFPSIKSREAKFYYYHPQNRFWKILGNIFDNDNLSTITLKKQFLLNN 67 Query: 63 RCGVTKLVDRPT-VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + + ++ +++ S ++ LI I + + G A + Sbjct: 68 KIALWDVIASCNIKGSSDSSISDVK--VNDLIRIINGSAIKHIYTNGNLANKLYHQYFDD 125 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 + + LP+ S + SLE+L+E ++ + ++ Sbjct: 126 KIAL--------PVTKLPSSSPANASYSLERLIECWKVIKDSM 160 >UniRef50_A6DAL5 Putative uncharacterized protein n=1 Tax=Caminibacter mediatlanticus TB-2 RepID=A6DAL5_9PROT Length = 171 Score = 119 bits (298), Expect = 4e-26, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 69/166 (41%), Gaps = 12/166 (7%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQE-AQHLLDY 62 I+ ++ G P + S F ++HP N+FWK++ +E L + Sbjct: 12 PIVFDTSEILILGTFPSIKSFENNFYYSHPQNQFWKILSTIFNEKTPKTIEEKINFLKKH 71 Query: 63 RCGVTKL---VDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQ 118 + + R + + + + + + ++ Y + +A+ + A + + Sbjct: 72 KIALWDTICECKRKEKNSKDSNLEIIKPC--DIENLLKQYPNIKKVAVTSRTAEKIL--K 127 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 + + +I ++Q+ LP+PS L + + + V+ ++EL Q + Sbjct: 128 KHFK--NSQFSILNSQLIYLPSPSPLYASKNFQTKVKEWKELLQLM 171 >UniRef50_A6Q7V9 Putative uncharacterized protein n=1 Tax=Sulfurovum sp. NBC37-1 RepID=A6Q7V9_SULNB Length = 175 Score = 119 bits (298), Expect = 4e-26, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 60/158 (37%), Gaps = 14/158 (8%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 I+ + + G P + S F +AHP N+FWK++ + + Q+ L + + Sbjct: 12 PIVFNDTKTLILGSFPSIKSFENNFYYAHPRNQFWKILEKLTGYPVNNRDQKIWLLKECK 71 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQRGA 121 G+ ++ + + + + + Y P +A GK+A + Sbjct: 72 LGLWDMIGSCQRENS-LDSSLEDEVVNDIPALLSAY-PSIDKIAFTGKKAQALFETHF-- 127 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 + + LP+PS + ++ E+ V Y E Sbjct: 128 -------SYLEIERVYLPSPSPAYAAMTFEEKVVQYAE 158 >UniRef50_C5MI19 Predicted protein n=1 Tax=Candida tropicalis MYA-3404 RepID=C5MI19_CANTT Length = 267 Score = 119 bits (298), Expect = 4e-26, Method: Composition-based stats. Identities = 46/212 (21%), Positives = 81/212 (38%), Gaps = 50/212 (23%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQ------- 54 ++ L+ L V+F G N G SA +AH N+FWK+ ++ ++ Q Sbjct: 56 LKPSLSNKLHVLFIGFNAGEKSALKQHRYAHHTNQFWKLFNESNLLEKVTTSQNITKDDT 115 Query: 55 ------------EAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQ 102 HL++Y G T LV+R T +E+ E +L+++ + Sbjct: 116 LEKLLSHGCKAVHDYHLVNYGIGFTSLVERSTKNTSELGITEKLESVPRLLKEWNESNAV 175 Query: 103 ALAILGKQAYEQ---GFSQR----------GAQWG-----------------KQTLTIGS 132 + ILGK ++ FS++ +WG S Sbjct: 176 EIVILGKGIWDVIVTFFSRKLNVKFKLNAIEFKWGEVGKTHEHEIYNLILDSIHDCIPKS 235 Query: 133 TQIWVLPNPSGLSR-VSLEKLVEAYRELDQAL 163 ++I V P+ SGL R E ++ +++ L Sbjct: 236 SKIHVFPDTSGLVRHFKYEDKLDLWKDFVHRL 267 >UniRef50_A1WBA7 Putative uncharacterized protein n=17 Tax=Proteobacteria RepID=A1WBA7_ACISJ Length = 207 Score = 118 bits (297), Expect = 5e-26, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 57/164 (34%), Gaps = 12/164 (7%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA----GFTDRQLKPQEAQ 57 + ++ G RV+ G PG++S + HP N FW+++ R Sbjct: 41 LPPMVGSGTRVLVLGSFPGVASLAAQQYYGHPQNHFWRILAALWPQHPLPGRDQYAARCD 100 Query: 58 HLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQG 115 LL G+ + + + + + + P A+A G +++ Sbjct: 101 WLLARGLGLWDVYAACEREGSLDTS--IRNAQVNDFAGLHARAPGLAAIAHNGGESFRHA 158 Query: 116 FSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 A G + + LP+ S + S EK + A+ E Sbjct: 159 -KAVRASLGLKADAPMPS--LRLPSTSPANASWSFEKKLHAWGE 199 >UniRef50_A1KBL3 Putative uncharacterized protein n=1 Tax=Azoarcus sp. BH72 RepID=A1KBL3_AZOSB Length = 186 Score = 118 bits (295), Expect = 9e-26, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 60/167 (35%), Gaps = 5/167 (2%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI-YQAGFTDRQLKPQEAQHLLDY 62 + RV+ G PG++S +AHP N FW ++ G + Q LLD Sbjct: 18 PVWRSDARVLVLGSMPGVASLTAAQYYAHPRNAFWTIMGALFGAGPALPYAERLQRLLDA 77 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQGFSQRG 120 + ++ + S + + + P + L G A E F + Sbjct: 78 GVALWDVIASCRRPGSLDSAIAPDSVTPNDLPGLITACPALEHLFFNG-TAAETAFRRHF 136 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVVR 166 A+ + + + ++ LP+ S + L + + A+ + A Sbjct: 137 ARGDRLRVLRPTLKLQRLPSTSPAHAARPLAEKLSAWEAVRAAASAH 183 >UniRef50_Q7MVW9 G/U mismatch-specific DNA glycosylase, putative n=2 Tax=Porphyromonas gingivalis RepID=Q7MVW9_PORGI Length = 163 Score = 118 bits (295), Expect = 1e-25, Method: Composition-based stats. Identities = 32/161 (19%), Positives = 61/161 (37%), Gaps = 14/161 (8%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ-LKPQEAQHLLD 61 I L ++ G PG S G +AHP NRFW ++ + + + LL Sbjct: 10 PPIEDGHLEILILGSLPGDESIRRGQYYAHPRNRFWPLMAKLLGKPLPDDYAERTEILLS 69 Query: 62 YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQR 119 G+ + + + Q + IE P+ +A G++A E F + Sbjct: 70 AHIGLWDVAHSAIRKGS-ADIQICDEEPNDIRSLIER-NPRLHTIAFNGRKA-EAMFRKH 126 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYREL 159 T+ + ++P+ S + +L+ LV+ + + Sbjct: 127 F------PETLA-IRCLLMPSTSPANAGKTLDLLVKDWNRI 160 >UniRef50_C6CSS2 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CSS2_PAESJ Length = 177 Score = 117 bits (294), Expect = 1e-25, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 56/169 (33%), Gaps = 16/169 (9%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFW---KVIYQAGFTDRQLKPQEAQHL 59 ++ RV+ G P + S + HP N FW ++ AG D L Sbjct: 13 PPVIDERARVLVLGTAPSVKSLEHRQFYGHPRNYFWGMVYGLFGAGVPDEDYGK-RLSFL 71 Query: 60 LDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALA--ILGKQAYEQGFS 117 D+ V ++ + + + L +Y P+ G +AY+ Sbjct: 72 QDHHMAVFDVIKSCERPGS-LDVNIKNEEPNDLPALAGEY-PELKCFAFNGSKAYDTFRK 129 Query: 118 QRGAQWGKQTLTIGSTQIWVLPNPSGLSRVS---LEKLVEAYRELDQAL 163 Q L + +P+ S + LE +EA++ + L Sbjct: 130 YYRDHPALQHLAL-----LKMPSTSPIPTQKMRNLEDRIEAWKAILPYL 173 >UniRef50_A5FBU1 G:T/U mismatch-specific DNA glycosylase-like protein n=3 Tax=Bacteria RepID=A5FBU1_FLAJ1 Length = 160 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 51/162 (31%), Gaps = 12/162 (7%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTD-RQLKPQEAQHLLDY 62 I +P +V+ G PG S + H N FWK ++ + + LL Sbjct: 8 PISSPDSKVLILGTMPGTKSLELNQYYGHSQNNFWKFMFTIFKEEFSTDYETKKALLLKN 67 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQRGA 121 + + ++ + + + ++ + +A G++A Sbjct: 68 KIALWDVLQYCDRVGS-LDSAIKNEIANDFETFLKQHPNIHTIAFNGQKAAAFFKKYVHL 126 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRVS-LEKLVEAYRELDQA 162 + + LP+ S + + ++ ++ + Sbjct: 127 KK--------DYTLITLPSTSPANASKAFQSKLDEWKVIASL 160 >UniRef50_B0EFS1 Putative uncharacterized protein n=4 Tax=Entamoeba RepID=B0EFS1_ENTDI Length = 176 Score = 116 bits (292), Expect = 2e-25, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 58/165 (35%), Gaps = 23/165 (13%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDR--QLKPQEAQHLL 60 + I+ ++ G PG S + HP N FW +I Q + + Sbjct: 9 DPIIPSKPLILILGSMPGTESLKKQQYYGHPQNCFWSIISSIKSMGSVPPRYEQRIELIK 68 Query: 61 DYRCGVTKLVDRPTVQAN-EVSKQELHAGGRKLIEKIEDYQP--QALAILGK---QAYEQ 114 + + + + + + + +E+ I K+ P + + G+ + Y + Sbjct: 69 SCQIALWDVCCQCERKGSLDSDIKEVKPNK---INKLLLEHPTIKTVLFNGQGPSKLYHK 125 Query: 115 GFSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 F + + + + ++P S +R + +E +++ Sbjct: 126 FFKE-----------LPNIKYVIMPGTSPANARTKKQTKIEIWKK 159 >UniRef50_D1PNX0 T/U mismatch-specific DNA glycosylase n=1 Tax=Subdoligranulum variabile DSM 15176 RepID=D1PNX0_9FIRM Length = 176 Score = 116 bits (291), Expect = 3e-25, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 52/162 (32%), Gaps = 19/162 (11%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEA----- 56 + + + G P S GF + HP NRFW ++ A T + Sbjct: 15 LPPLHGERAGALILGSFPSPKSREQGFFYGHPQNRFWPLL--ASLTGEPVPDWADIEAKK 72 Query: 57 QHLLDYRCGVTKLVDRP-TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 Q +L + V + A++ S + + + I QA+ G + Sbjct: 73 QIILRHGLAVWDTIGACDIRGASDASIRNVQPN--DVAALIRRLGVQAVFCNGAASGRVY 130 Query: 116 FSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAY 156 G VLP+ S + S+EKL + Sbjct: 131 ARYAQPLTGLSAT--------VLPSTSPANAAWSMEKLRAVW 164 >UniRef50_D0N7I8 Putative uncharacterized protein n=1 Tax=Phytophthora infestans T30-4 RepID=D0N7I8_PHYIN Length = 240 Score = 116 bits (290), Expect = 4e-25, Method: Composition-based stats. Identities = 37/178 (20%), Positives = 70/178 (39%), Gaps = 23/178 (12%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI-------YQAGFTDRQLKPQEAQHLLD 61 LR++ G NP + TG+ +++P NR W ++ G + E L D Sbjct: 52 RLRLLIVGHNPSDHAWKTGYSYSNPTNRMWMLLTGTLSPHSWEGIVPSTARIDEQNILPD 111 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIED----YQPQALAILGKQAYEQGF 116 + G T + P A++ K + A +++ + P +A GK+ + F Sbjct: 112 VHGVGFTSIGLEPGNDASKYGKATMEAWRTSFFQRLRRHALPHAPILIAFSGKRQFSYLF 171 Query: 117 -----------SQRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 Q G +++WVLP+ SG + ++ + Y++L + L Sbjct: 172 SPALSKIESYGKQSKLPPGWPHELRADSEVWVLPSSSGRAAMTTAQRTLPYQQLAERL 229 >UniRef50_Q1Q0C1 Similar to G/U-mismatch-specific DNA glycolase n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q0C1_9BACT Length = 170 Score = 116 bits (290), Expect = 4e-25, Method: Composition-based stats. Identities = 39/166 (23%), Positives = 73/166 (43%), Gaps = 13/166 (7%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD------ 61 ++F GINP S PF++ FW ++ +AG + + L+ Sbjct: 7 KKPVILFVGINPHPGSFNRHVPFSN-NKLFWYLLSEAGLIKEKREELRDDKTLERVYKEK 65 Query: 62 ----YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS 117 Y G ++DRPT + K E G +K+ I+ P+ + +GK +YE+ Sbjct: 66 FNTVYGLGFVNIIDRPTRDITGIKKHEELPGRKKISRIIKAEMPKVVCFIGKVSYEKYAG 125 Query: 118 QRGAQWGKQTLTIGSTQIWVLPNP-SGLSRVSLEKLVEAYRELDQA 162 + +G Q IG+++++V+ P G + + + +L E R + Sbjct: 126 SKDFSFGWQ-ENIGASKVFVMHFPLRGEALIRVRELQEIKRTIQSL 170 >UniRef50_C3RJZ3 G:T/U mismatch-specific DNA glycosylase n=6 Tax=Bacteria RepID=C3RJZ3_9MOLU Length = 160 Score = 115 bits (289), Expect = 5e-25, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 62/165 (37%), Gaps = 13/165 (7%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK-PQEAQHL 59 ++E + +++ G P + S F + HP NRFWK++ T+ ++ + L Sbjct: 6 LIEPVYNKSSKILILGSFPSVKSREANFFYHHPQNRFWKILANIYNTELPETIIEKKEFL 65 Query: 60 LDYRCGVTKLVDRPT-VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ 118 ++ V ++ +++ S + L + I+ Q + G A Sbjct: 66 INQHIAVWDVIASCEIKGSSDSSITNVE--VNDLKKIIDQSQITHIYTNGNLADRLYHRY 123 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 I + LP+ S + SL KL+ ++E+ Sbjct: 124 F--------DEIIDLPVTKLPSSSPANASYSLTKLISFWKEIKSL 160 >UniRef50_B8GKM7 Putative uncharacterized protein n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GKM7_METPE Length = 185 Score = 115 bits (288), Expect = 6e-25, Method: Composition-based stats. Identities = 32/165 (19%), Positives = 62/165 (37%), Gaps = 9/165 (5%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL-DY 62 I+ V+ G P + G +A+P N FW +I + R L E LL D Sbjct: 26 PIIGSAPVVLVLGSFPSARARTVGEYYANPLNHFWPIIEELFGISRSLSYSERVRLLADQ 85 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQ-PQALAILGKQAYEQGFSQRGA 121 + + +VD + + + + + + + +A G+ A E+ + Sbjct: 86 QVALWDMVDSCYQEGS-MDSTIYEPALNDIRGFLTTHPTIRLVAANGRTA-ERFLKRSLG 143 Query: 122 QWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 + G + LP+ S +R S + V+ +R + + V Sbjct: 144 KAGP----PAGVTVISLPSTSPANARESFHQKVQRWRVIRDSAVP 184 >UniRef50_C9M6L9 Putative G/U mismatch-specific DNA glycosylase n=1 Tax=Jonquetella anthropi E3_33 E1 RepID=C9M6L9_9BACT Length = 158 Score = 114 bits (287), Expect = 8e-25, Method: Composition-based stats. Identities = 33/152 (21%), Positives = 55/152 (36%), Gaps = 13/152 (8%) Query: 15 CGINPGLSSAGTGFPFAHPANRFWKVI-YQAGFTDRQLKPQEAQHLLDYRCGVTKLVDRP 73 G P S GF +AHP NRFW ++ G L + +V Sbjct: 2 LGSFPSPKSREAGFYYAHPRNRFWPLLCRLLGCPTPPDTAGRKALCLSVGLALWDVVGSC 61 Query: 74 T-VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTLTIGS 132 T + A++ + +++ A L + + + G A G+ + Sbjct: 62 TIIGASDSTIEDVRAN--DLSPLLSGAPIRKIFTTGAAAARLYRQFIEPISGRPAES--- 116 Query: 133 TQIWVLPNPSGL-SRVSLEKLVEAYRELDQAL 163 LP+ S + + +L+EAYR L AL Sbjct: 117 -----LPSTSPANAAWTFPRLLEAYRPLADAL 143 >UniRef50_A3ZBY1 T/U mismatch-specific DNA glycosylase n=15 Tax=Campylobacter RepID=A3ZBY1_CAMJE Length = 160 Score = 113 bits (284), Expect = 2e-24, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 57/160 (35%), Gaps = 13/160 (8%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 +++ G P + S GF + HP NRFW ++ + ++ L Sbjct: 11 PFFDKDSKILILGSFPSIKSRQDGFYYQHPRNRFWPILETLFNAKLENITEQQAFLRKKH 70 Query: 64 CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 + ++ ++ ++ K +A L + + QA+ G+ AY Sbjct: 71 IALWDVLQSCKIKNSD-DKTISYAKANDLSLILSQTKIQAIFTTGQSAYRFFVKFH---- 125 Query: 124 GKQTLTIGSTQIWVLPNPSGLS-RVSLEKLVEAYRELDQA 162 + LP+ S + S E+L++ Y + + Sbjct: 126 -------PHLEAIALPSTSPANLNFSFEQLLQNYEIIKKF 158 >UniRef50_C6MEU9 Putative uncharacterized protein n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MEU9_9PROT Length = 194 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 57/166 (34%), Gaps = 9/166 (5%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIY-QAGFTDRQLKPQEAQHLLDY 62 I +++ G PG +S +AHP N FW +I ++ L Sbjct: 27 PIAGRHAKILILGSMPGEASLAANQYYAHPRNAFWPIIAPLLQIQANSRYEEKIIALESS 86 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQ--ALAILGKQAYEQGFSQRG 120 + ++ + S E P + G++A E F + Sbjct: 87 PIALWDVLQSCNRAGSLDSMIETGTQIVNNFPAFFALHPHLTHVFFNGEKA-EACFRRNV 145 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 + +GS + LP+ S +R+S ++ + + ++ + + Sbjct: 146 IP----DIKLGSIALLRLPSTSPANARLSSDEKCKTWHQMIKLALE 187 >UniRef50_A5VDD8 Uracil-DNA glycosylase superfamily n=8 Tax=Proteobacteria RepID=A5VDD8_SPHWW Length = 190 Score = 113 bits (283), Expect = 2e-24, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 55/166 (33%), Gaps = 15/166 (9%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQL----KPQEAQH 58 +++ R++ G PG +S +AHP N+FW+ + A D L P Sbjct: 26 PPVVSADTRLLLLGSLPGEASLRAARYYAHPQNQFWR-LTGAVIGDEGLALLDYPARLDR 84 Query: 59 LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP-QALAILGKQAYEQGFS 117 L G+ +V + + L + +A+ G A G Sbjct: 85 LRAAGVGLWDVVADALRDGS-LDGAIRDHRPNDLAALVATLPALRAIGFNGGTAARLGRR 143 Query: 118 QRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 GA G + LP+ S + + E + L + L Sbjct: 144 MIGAPAGL--------ALVDLPSSSPAFTRPIAEKREKWLHLREFL 181 >UniRef50_C1EDR1 Predicted protein n=2 Tax=Micromonas RepID=C1EDR1_9CHLO Length = 302 Score = 113 bits (282), Expect = 3e-24, Method: Composition-based stats. Identities = 38/194 (19%), Positives = 69/194 (35%), Gaps = 39/194 (20%) Query: 10 LRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL-DYRCGVTK 68 L+++ G NP S G+G+ +++P+N FWK++ + ++ L D G T Sbjct: 93 LKLLIIGHNPSEHSWGSGWSYSNPSNNFWKLLAKGRIVPEDWTKEDCPRLPGDLGIGFTD 152 Query: 69 LVDRPTVQANEVSKQELHAGGRKLIEKIE---------------------------DYQP 101 P A ++ + L ++ + P Sbjct: 153 AGTVPGNDAGAYNRATMKKWRSDLYARLRGHVARCADASGDEDARVGHRSSMSIASRHGP 212 Query: 102 QALAILGKQAYEQGFSQR--GAQWGKQTLT---------IGSTQIWVLPNPSGLSRVSLE 150 +A GK+ Y Q F G+Q T++WV+P+ SG + ++ E Sbjct: 213 DVVAFSGKRQYAQLFDAVPSRVDAGRQPPESLPPGWPLCPERTEVWVMPSSSGRAAMTRE 272 Query: 151 KLVEAYRELDQALV 164 + EL L Sbjct: 273 AREGPWVELGTRLA 286 >UniRef50_A4B987 Putative uncharacterized protein n=1 Tax=Reinekea blandensis MED297 RepID=A4B987_9GAMM Length = 171 Score = 113 bits (282), Expect = 3e-24, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 60/162 (37%), Gaps = 10/162 (6%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQE-AQHLLDYRCG 65 RV+ G PG++S +AHP N FW ++ DR + LL R Sbjct: 12 GEAPRVLVLGSMPGVASLTAVQYYAHPRNAFWPIMADFFGFDRDAPYDDRLAALLAKRVA 71 Query: 66 VTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQRGAQW 123 + ++ + S E + I+ QP + + G +A + A Sbjct: 72 LWDVLQSCVRPGSLDSAIERDSMVPNPIDSWMATQPSLQTVLLNGGRAATEFRRYFKAIL 131 Query: 124 GKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAY-RELDQAL 163 + + ++ LP+ S + + E+ + + L +AL Sbjct: 132 -----SNPNIEVVTLPSTSPAYAAMRFEEKRDRWHHALHRAL 168 >UniRef50_A8IDF6 Predicted protein n=1 Tax=Chlamydomonas reinhardtii RepID=A8IDF6_CHLRE Length = 334 Score = 112 bits (281), Expect = 4e-24, Method: Composition-based stats. Identities = 35/200 (17%), Positives = 72/200 (36%), Gaps = 36/200 (18%) Query: 2 VEDILAPG-LRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTD--RQLKPQEAQH 58 V + L LR+V G NP + +G +++P+N W ++ + G R PQ+ Sbjct: 119 VPEKLGDDPLRLVIVGHNPSEHAWRSGHYYSNPSNHLWPLLRRTGIAPPDRIRGPQDDGL 178 Query: 59 LLDY-RCGVTKL-VDRPTVQANEVSKQELHAGGRKLIEKIEDY----------------Q 100 + G + P ++ + R ++ + Sbjct: 179 MPAVAGVGFLDVGCGVPGTDSSRFKSEVFEGWSRGFYSRLRAHMRRAATSIGCRCGLCGA 238 Query: 101 PQALAILGKQAYEQGF---------SQRGAQWGKQTLTIGS------TQIWVLPNPSGLS 145 P+ +A GK+ Y + + ++G+Q T++WV + SG + Sbjct: 239 PRLVAFTGKRQYLELMNVGAGPARNKVKTVEFGRQAALPPGWPLPPSTEVWVCTSTSGAA 298 Query: 146 RVSLEKLVEAYRELDQALVV 165 ++ E + Y +L + L Sbjct: 299 PMTREAREQPYVQLAERLAQ 318 >UniRef50_C7DIS9 Putative uncharacterized protein n=2 Tax=unclassified Euryarchaeota RepID=C7DIS9_9EURY Length = 189 Score = 112 bits (280), Expect = 5e-24, Method: Composition-based stats. Identities = 37/145 (25%), Positives = 66/145 (45%), Gaps = 12/145 (8%) Query: 6 LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ----------LKPQE 55 L+ ++F GINP S PF++ FW ++ +A + ++ E Sbjct: 18 LSKNALIMFVGINPHPGSYKRKVPFSN-NKMFWYLLSRAHIIEEPTEYLMNDKKLVQMYE 76 Query: 56 AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 + + +YR LVDRPT +++ K E G ++ + I Y+P+ + +GK YE+ Sbjct: 77 GKFVQEYRLNFINLVDRPTSDVSKLRKGEERPGAARVFDSIRIYKPRLVCFIGKITYEKF 136 Query: 116 FSQRGAQWG-KQTLTIGSTQIWVLP 139 + +G KQ + ST + P Sbjct: 137 SGSKHTDYGYKQPIYGISTYVCRFP 161 >UniRef50_B9LA93 Putative uncharacterized protein n=1 Tax=Nautilia profundicola AmH RepID=B9LA93_NAUPA Length = 172 Score = 112 bits (280), Expect = 5e-24, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 68/165 (41%), Gaps = 13/165 (7%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQE-AQHLLDY 62 ++ +++ G P + S F ++HP N+FW+++ + +E + L + Sbjct: 14 PVVFEDSKLLILGTFPSIKSFENEFYYSHPRNQFWEILSIVFKDKKPETIEEKIKFLKKH 73 Query: 63 RCGVTKLVDRPTV---QANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQ 118 + + V + + + + L + ++ Y + +A+ + A + Sbjct: 74 KIALWDAVCECKRKKGNSRDDNLEILKPC--DIESLLKRYPNIKKVAVTSRTA-----EK 126 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQA 162 ++ + + LP+PS L +R+ +++ E +++L + Sbjct: 127 VIKKYLTNSQFSILNSLLYLPSPSPLNARMKIKEKAETWKKLLEL 171 >UniRef50_Q4P6C9 Putative uncharacterized protein n=1 Tax=Ustilago maydis RepID=Q4P6C9_USTMA Length = 649 Score = 112 bits (280), Expect = 6e-24, Method: Composition-based stats. Identities = 39/102 (38%), Positives = 59/102 (57%), Gaps = 3/102 (2%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEA---QH 58 V D+ A ++ CGINPG+ S+ +G FAH +N F+ ++ AG T R++KP++ Sbjct: 296 VPDLFAERNDIMLCGINPGVKSSSSGHHFAHRSNHFYPSLHLAGITSRRMKPEQDVEFPF 355 Query: 59 LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQ 100 L Y G+T L RPT + NE+ EL G L+EKI ++ Sbjct: 356 LRPYSLGLTNLAPRPTAEGNELLPSELIDGVPVLLEKILKWK 397 >UniRef50_B9CP28 T/U mismatch-specific DNA glycosylase n=1 Tax=Atopobium rimae ATCC 49626 RepID=B9CP28_9ACTN Length = 174 Score = 111 bits (279), Expect = 7e-24, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 59/167 (35%), Gaps = 13/167 (7%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA-GFTDRQLKPQEAQHLL 60 + I+ RV+ G P S GF F +P NRFW+V+ G + ++ + Sbjct: 16 LPPIVDRDSRVLLLGSMPSPKSREIGFYFGNPQNRFWRVMAGLWGEKPLESSEEKIRFCH 75 Query: 61 DYRCGVTKLVDRP-TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 + + ++ + A++ S + + +E + LG A Sbjct: 76 LHHVAICDVLKACDIIGASDASIENPVPN--DIASLLEQANIVQIYTLGGMATRLYHRHI 133 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALVV 165 LP+ S +R++L+ L+ A+ + + Sbjct: 134 --------EPTTHLAATSLPSTSPANARMTLDDLISAFTVVRKTAEA 172 >UniRef50_Q8PWD8 Conserved protein n=3 Tax=Methanosarcina RepID=Q8PWD8_METMA Length = 158 Score = 111 bits (277), Expect = 1e-23, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 60/163 (36%), Gaps = 26/163 (15%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQA--GFTDRQLKPQEAQHLL 60 + ++ G PG S + HP N FW+++ A L Sbjct: 7 PPVFDNNTEILILGSLPGDESLRRRQYYGHPGNDFWRLVGNAIGENLQDMDYETRLTTLK 66 Query: 61 DYRCGVTKLV---DRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQG 115 +R G+ + R Q + + ++E++ ++++ P+ + GK + Sbjct: 67 RHRIGLWDVFKAGKRKGSQDSNIREEEVNQ-----FSELKEMAPELKKVFFNGKAS---- 117 Query: 116 FSQRGAQWGKQTLTIG--STQIWVLPNPSGLSRVSLEKLVEAY 156 G+ + + VLP+ SG++R +++K + Sbjct: 118 --------GRYEPVLSAMGYETKVLPSSSGINRRNVKKRESEW 152 >UniRef50_A7IAF7 Putative uncharacterized protein n=1 Tax=Candidatus Methanoregula boonei 6A8 RepID=A7IAF7_METB6 Length = 162 Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQL-KPQEAQHLL 60 + I+ ++ G P L S G + +P N+FW++I D L P L Sbjct: 8 LPPIVGRDPEILILGSFPSLISLGKSEYYGNPKNQFWRIIEILFGIDHALSYPLRTHLLT 67 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGFSQ 118 ++R + + + + S + I P +A+ G A S Sbjct: 68 EHRIALWDALAFCRREGSMDSA--ICDPVANDITGFVAAHPTIRCIALNGSTAGRYFKSM 125 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 G +LP+ S + +S + + + Sbjct: 126 NPDLNGH-----------ILPSTSPAYASMSFAEKTLEWARI 156 >UniRef50_A0RQD6 Mug G:T/U mismatch-specific DNA glycosylase n=6 Tax=Bacteria RepID=A0RQD6_CAMFF Length = 157 Score = 110 bits (276), Expect = 2e-23, Method: Composition-based stats. Identities = 33/163 (20%), Positives = 64/163 (39%), Gaps = 17/163 (10%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQE-AQHLLDY 62 + +++ G P + S GF +AH NRFWKV+ ++ Q+ LL + Sbjct: 8 PVFDSRSKILILGSFPSVKSREMGFYYAHSQNRFWKVLIALFGSEHLRNVQDKIDFLLKH 67 Query: 63 RCGVTKLVDRPT-VQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA 121 + + + +++ S + LI E+ +A+ G +Y Sbjct: 68 QVAIHDAAYSCSVKGSSDNSISSVKPANLDLI--FENADIKAVFANGNTSYAICKKFYNF 125 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQAL 163 ++ LP+ S + SLEKLV+ ++ + + L Sbjct: 126 ------------EVIKLPSTSPANAKFSLEKLVDEWKIILKYL 156 >UniRef50_A2U3I4 Putative uncharacterized protein n=2 Tax=Polaribacter RepID=A2U3I4_9FLAO Length = 159 Score = 109 bits (274), Expect = 3e-23, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 55/163 (33%), Gaps = 18/163 (11%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQL---KPQEAQH 58 ++ +L + G PG +A+ N FWK+IY+ T+ + Sbjct: 6 LDPVLPHNPTIFILGTKPGNLELRKQEYYANNGNSFWKIIYE--LTEEVFSKNYAERIDL 63 Query: 59 LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAYEQGF 116 L V +++ + Q + + +I + P + G++A++ Sbjct: 64 LKRNHIAVWDIIESGER--VKPGAQNVKNEIPNPVNEIVEMYPSVKQIIFNGQKAHDLYL 121 Query: 117 SQRGAQWGKQTLTIGSTQIWVLPNPSGLS-RVSLEKLVEAYRE 158 I V+ + S + R S + + ++E Sbjct: 122 KHFS--------KIEGIIYEVVLSTSPANTRFSFREKLNNWKE 156 >UniRef50_B7CDR6 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CDR6_9FIRM Length = 153 Score = 109 bits (273), Expect = 3e-23, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 56/160 (35%), Gaps = 21/160 (13%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 ++ I+ RV+ G P + S +A+ NRFWKV+ + + LL Sbjct: 8 LDPIVFKDSRVLILGSMPSVISLKEQMYYANKTNRFWKVLEAI------YQENDKIKLLK 61 Query: 62 -YRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQ-PQALAILGKQAYEQGFSQR 119 + + + + + + E + Y+ + + G+ + Sbjct: 62 ISHIALWDICHSCIRKTS-ADSEIKDIEPNDIQELLNRYKNIEKVICNGRTSQNTMQKY- 119 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRE 158 + + P+ S + SLE+L+E Y++ Sbjct: 120 ----------FPNIETVYCPSTSSANAQFSLEQLIEIYKK 149 >UniRef50_C5K9X8 Putative uncharacterized protein n=2 Tax=Perkinsus marinus ATCC 50983 RepID=C5K9X8_9ALVE Length = 267 Score = 108 bits (269), Expect = 9e-23, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 56/192 (29%), Gaps = 38/192 (19%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH---- 58 + RV+ G P S G + +P N FW V+ A D P Sbjct: 77 PPSIGDSPRVLILGSAPCEMSLEKGQFYGNPRNHFWAVMAYA--MDDPTFPMLDFEARCS 134 Query: 59 -LLDYRCGVTKLVDRPTV---QANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQAY 112 L + + D T + +E L+E P + G +Y Sbjct: 135 VLHSRGIALWNICDVFTRYKSSDATLHCEEYTDIAGLLVE-----HPTIGYILCNGSASY 189 Query: 113 EQGFS-----QRGAQWGKQTLTIGSTQ-------------IWVLPNPSGLSRV--SLEKL 152 + + G Q + I LP+ S + + + Sbjct: 190 DCMLKYDLAARLEVSTGLQPTPLSLATQARGAAQAVDLPVILKLPSTSPANAMADPVGTK 249 Query: 153 VEAYRE-LDQAL 163 + ++E L +AL Sbjct: 250 GKVWKEKLQEAL 261 >UniRef50_A8RDC8 Putative uncharacterized protein n=1 Tax=Eubacterium dolichum DSM 3991 RepID=A8RDC8_9FIRM Length = 166 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 53/165 (32%), Gaps = 13/165 (7%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDY 62 + I + V+ G P ++S + N FWK + Q+ + Sbjct: 6 DPITSKEPYVLILGSMPSITSLEKQEYYGFAHNHFWKFMANIFAMPTDTYEQKKAIIRKR 65 Query: 63 RCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGKQAYEQGFSQRGA 121 + ++ + + + + I+ + + + GK++YE Sbjct: 66 HLALWDVIHSCEREGS-LDSAIRKERVNDIAGFIQKHTTIRLVICNGKKSYELYQKHF-- 122 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLV-EAYRE-LDQALV 164 L + S +R E ++ E +++ L +AL Sbjct: 123 -------QNLGVSCVYLGSTSNANRSIKENILYEQWKQTLWEALG 160 >UniRef50_C3XK88 T/U mismatch-specific DNA glycosylase n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XK88_9HELI Length = 162 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 31/153 (20%), Positives = 51/153 (33%), Gaps = 14/153 (9%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYR 63 I V+ G P S GF + +P NRFWK++ + + LL YR Sbjct: 8 PIFNNESEVLILGSFPSKMSFEIGFYYGNPRNRFWKMLQDFFKVELVSVEAKKDFLLHYR 67 Query: 64 CGVTKLVDRPTVQANE----VSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 + +V + + + K L L + Q + + GK+AY Sbjct: 68 IALWDIVCLGSNMQGDKESSMDKTLLATEISDLEALLAKTQIKTILCNGKKAY------- 120 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKL 152 K + LP+ S + V + Sbjct: 121 -FLLNKYYPHLKDMA-KSLPSTSPAN-VRFDSK 150 >UniRef50_D0J0R4 Putative uncharacterized protein n=6 Tax=Burkholderiales RepID=D0J0R4_COMTE Length = 178 Score = 107 bits (268), Expect = 1e-22, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 64/174 (36%), Gaps = 24/174 (13%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH--- 58 + + R + G PGL S +AHP N+FW ++ QA + P+ Sbjct: 19 LAAVADGRTRALVLGSFPGLKSLQLAQYYAHPQNQFWPLL-QALWPQHLQPPRSDYAGRC 77 Query: 59 --LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQAYEQ 114 LL G+ + + + S + A + P Q G ++Y+ Sbjct: 78 EWLLARGLGLWDVYASCEREGSLDSA--IRAAQVNDFAGLRQRCPDLQLALHNGGESYKH 135 Query: 115 GFSQRGAQWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQALVVRG 167 +QT +G + LP+ S + S ++ ++A+R AL G Sbjct: 136 A---------RQTEALGLIAL-KLPSTSPANAGWSFDRKLQAWR---HALAAHG 176 >UniRef50_C5EZE8 Putative uncharacterized protein n=2 Tax=Helicobacter RepID=C5EZE8_9HELI Length = 167 Score = 102 bits (255), Expect = 4e-21, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 63/165 (38%), Gaps = 9/165 (5%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQE--AQHLLD 61 I+ +++ G P ++S F +AH NRFWK++ + +P+E L Sbjct: 7 PIIFSDSKILILGSFPSVASRQESFYYAHKQNRFWKILEGLFDSPLHNQPKECKIAFLKS 66 Query: 62 YRCGVTKLVDRP-TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 + + + +++ + + L + I++ Q Q + G++A + Sbjct: 67 HHIALFDITQECYIQNSSDSTLKILKPNP--IHSLIQNTQIQVIFTNGQKASKLY---CQ 121 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQALV 164 + + LP+ S + +L L+ + ++ L Sbjct: 122 FFCHQDSPHFIPLPHIPLPSSSPANAKCTLPLLLREWSKILPYLT 166 >UniRef50_C7MKT3 G:T/U mismatch-specific DNA glycosylase n=1 Tax=Cryptobacterium curtum DSM 15641 RepID=C7MKT3_CRYCD Length = 172 Score = 102 bits (254), Expect = 6e-21, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 56/163 (34%), Gaps = 17/163 (10%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLK-------PQEA 56 + V+ G P + S F + +P NRFW VI + Sbjct: 13 PVFDSRSHVLILGSFPSVLSRQNSFYYGNPQNRFWHVIAACVGAPTPPNTPLNTSIQAKR 72 Query: 57 QHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF 116 Q L D + +++ ++ + HA L + +A+ G+ A Sbjct: 73 QLLKDAGIALWDVIESCDIKGS-ADSTIRHAVPANLNIIFDAASIEAVYANGRTAETLYR 131 Query: 117 SQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 A + I LP+ S + S E+L++++++ Sbjct: 132 RLAEAHTKR--------PIIGLPSTSPANAAWSKERLIDSWKK 166 >UniRef50_UPI0001BC5AE8 DNA glycosylase n=1 Tax=Fusobacterium sp. D12 RepID=UPI0001BC5AE8 Length = 310 Score = 101 bits (252), Expect = 8e-21, Method: Composition-based stats. Identities = 27/148 (18%), Positives = 55/148 (37%), Gaps = 16/148 (10%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH---L 59 ++ G P + S F + HP NRFWK++ A + + L Sbjct: 172 PAFYEKNSIILILGSFPSVKSREENFFYGHPQNRFWKML--AKIFEEAFPETREKKKELL 229 Query: 60 LDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP-QALAILGKQAYEQGFSQ 118 +R V ++ ++ + S + + +I +Y P Q + G +Y+ Sbjct: 230 RKHRLAVWDVIHSCKIKGS--SDSSIQDVIPNDLTEILNYSPIQKIICNGATSYKYYKKY 287 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGLSR 146 + + GK+ ++ LP+ S + Sbjct: 288 QEKELGKEAIS--------LPSTSPANA 307 >UniRef50_A8U688 Putative uncharacterized protein n=1 Tax=Carnobacterium sp. AT7 RepID=A8U688_9LACT Length = 161 Score = 101 bits (252), Expect = 1e-20, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 51/158 (32%), Gaps = 11/158 (6%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 +E I +++ G P S +A+ N+FWK+I ++ T + ++ LL Sbjct: 5 LEPIYDENTKILILGSAPSEKSLELQQYYANKGNQFWKIISESLNTTDPIDYEKRLELLS 64 Query: 62 YR-CGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 G+ + + + + + G ++Y++ Sbjct: 65 QNQIGLWDIYSHFDRIGS-LDSNFKQTKLNNFDALFSTASIKLIIANGDKSYQEV----- 118 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRE 158 ++ + + SG + ++ + E Sbjct: 119 ----LKSTIFEHYTVIKCLSTSGANNGKAQERKMEWNE 152 >UniRef50_B8DUI0 T/U mismatch-specific DNA glycosylase n=10 Tax=Bacteria RepID=B8DUI0_BIFA0 Length = 175 Score = 99.2 bits (246), Expect = 4e-20, Method: Composition-based stats. Identities = 32/155 (20%), Positives = 55/155 (35%), Gaps = 14/155 (9%) Query: 13 VFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQL-------KPQEAQHLLDYRCG 65 +F G S GF + HP NRFW ++ + L +R Sbjct: 22 LFLGTMASPKSREAGFFYMHPRNRFWPIMQALFADPQDASDVVGDTPDSRRAFALRHRFA 81 Query: 66 VTKLVDRP-TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWG 124 + V+ V A++ S + + L E I + Q + G++A + Sbjct: 82 LWDTVESCDIVGASDSSIRNVVP--SDLSEIIRNSQISHIYACGQKADQLFHKYSEPML- 138 Query: 125 KQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 LP+ S + +L++LV+ YRE Sbjct: 139 --ECNGIHVSETALPSTSPAYASKTLDQLVDIYRE 171 >UniRef50_UPI0000D8867B hypothetical protein CIMG_10236 n=1 Tax=Coccidioides immitis RS RepID=UPI0000D8867B Length = 218 Score = 96.9 bits (240), Expect = 2e-19, Method: Composition-based stats. Identities = 31/80 (38%), Positives = 50/80 (62%), Gaps = 1/80 (1%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 ++ D + GL ++F G+NPG+ + TGF +AHP+N +WK+++ +G T RQ P + L Sbjct: 123 LLHDTIPQGLILLFIGVNPGIMTGQTGFVYAHPSNLYWKLLHWSGITMRQHPPSDTYRLP 182 Query: 61 D-YRCGVTKLVDRPTVQANE 79 + Y G T +V RPT A+ Sbjct: 183 ELYGVGNTNIVARPTRDASP 202 >UniRef50_Q04GE7 G:T/U mismatch-specific DNA glycosylase n=2 Tax=Oenococcus oeni RepID=Q04GE7_OENOB Length = 194 Score = 91.5 bits (226), Expect = 9e-18, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 51/146 (34%), Gaps = 11/146 (7%) Query: 3 EDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQ--LKPQEAQHLL 60 I +V+ G P S +GF + P N FW + + D K L Sbjct: 26 PPIYDENSKVLLLGSLPSPKSLASGFYYGSPQNIFWSTLAKTLGVDEPEKSKEARLSFAL 85 Query: 61 DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 +R + + + + S + + +++ Q + + GK+A E Sbjct: 86 KHRVAIWITLKEADIIGAQDS-TIKNPVPNDFSDILKNSQIRTIFTEGKKATELFNKLAA 144 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSR 146 + G+++ LP+ S +R Sbjct: 145 PKIGQES--------VYLPSTSPANR 162 >UniRef50_Q4T9X4 Chromosome undetermined SCAF7485, whole genome shotgun sequence n=1 Tax=Tetraodon nigroviridis RepID=Q4T9X4_TETNG Length = 482 Score = 89.9 bits (222), Expect = 3e-17, Method: Composition-based stats. Identities = 42/183 (22%), Positives = 66/183 (36%), Gaps = 37/183 (20%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQ---AGFTDRQLKPQEA-- 56 + D+L NPGL P +++ +G + L Sbjct: 190 LPDLLD---------YNPGLR-----HHRNQPGPDG--CLHRTLVSGPRESFLSSSTTCM 233 Query: 57 -QHLL-DYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQ 114 HL YR G T +V R T + ++S +EL GG+ L+EK++ ++P GK YE Sbjct: 234 TPHLPGKYRMGFTNMVARATPGSKDLSSKELREGGKILVEKLKKFKPLIAVFNGKCIYEM 293 Query: 115 G----FSQR--GAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAY---RELDQALVV 165 F ++ ++G Q I + + V Y REL L Sbjct: 294 FCRELFGKKPQKLEFGLQPHKIPDCDVRPC-----AQFPRAQDKVHFYINVRELRDQLKG 348 Query: 166 RGR 168 R Sbjct: 349 VER 351 >UniRef50_A7I1K0 G:T/U mismatch-specific DNA glycosylase n=1 Tax=Campylobacter hominis ATCC BAA-381 RepID=A7I1K0_CAMHC Length = 204 Score = 88.8 bits (219), Expect = 7e-17, Method: Composition-based stats. Identities = 35/209 (16%), Positives = 65/209 (31%), Gaps = 61/209 (29%) Query: 4 DILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAG--------------FTDR 49 I +++ G P ++S GF + H NRFWK + D Sbjct: 8 PIFDKNSKILILGSMPSIASIKFGFYYGHKNNRFWKTLAAIFDDKLLGSRNLEQIMILDL 67 Query: 50 QLKP--------------------------------QEAQHLLDYRCGVTKLVDRP-TVQ 76 Q + LL + + V Sbjct: 68 QSTEFSAQQNFKQTESKNSEIYAKFAINGKINVHIKYKIDFLLREKIALYDTVKSADIKG 127 Query: 77 ANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIW 136 + + S + A L E +E +A+ GK++YE ++ Sbjct: 128 SADASIKNYRAN--DLDEILEIADIKAIFANGKKSYEVAKKVLN----------NDKRLM 175 Query: 137 VLPNPSGL-SRVSLEKLVEAY-RELDQAL 163 +L + S +RVSL+ L+ + +++ + L Sbjct: 176 LLNSTSSANARVSLDDLINEWSQKIFKFL 204 >UniRef50_A8A982 Phage SPO1 DNA polymerase-related protein n=1 Tax=Ignicoccus hospitalis KIN4/I RepID=A8A982_IGNH4 Length = 191 Score = 83.8 bits (206), Expect = 2e-15, Method: Composition-based stats. Identities = 40/163 (24%), Positives = 64/163 (39%), Gaps = 23/163 (14%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWK-VIYQAGFTDRQLKPQEAQHLLDYRCG 65 G VVF G PG G PF A R + I Q GF + Sbjct: 31 DEGAEVVFVGEAPGKEEDEEGRPFVGRAGRLLRSTISQIGFK---------------KYY 75 Query: 66 VTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWG 124 +T +V RP +++E+ A LIE++ +P+ + LG+ A E +G G Sbjct: 76 ITNVVKCRPPGNRTP-TEEEVEACKPYLIEELNCIRPKLVVALGRTAAEALLGLKGPLKG 134 Query: 125 KQTLTI----GSTQIWVLPNPSGLSRVSLEKLVEAY-RELDQA 162 + + G ++ VL + + KL E + +L +A Sbjct: 135 VRGKVLVASAGGARVKVLVTYHPAAVLRNPKLKEQFVSDLKKA 177 >UniRef50_Q12X97 Putative uncharacterized protein n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12X97_METBU Length = 160 Score = 81.1 bits (199), Expect = 1e-14, Method: Composition-based stats. Identities = 20/143 (13%), Positives = 47/143 (32%), Gaps = 13/143 (9%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVI-YQAGFTDRQLKP-QEAQHL 59 ++ I+ ++ G P S +A N+FW ++ + G ++++ L Sbjct: 9 LDQIVNEETEILITGTFPSEKSRNQKQYYADSRNQFWNLMSFVLGIDLKKIEYNNRVSIL 68 Query: 60 LDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 ++ G+ ++ + + K + Q + + G + E Sbjct: 69 KKHKIGLWDTIESCEIGGS-SDKNICNPQYNDFSHL---TQIKKIVCNGGKVEETCNKHC 124 Query: 120 GAQWGKQTLTIGSTQIWVLPNPS 142 G Q + VL + S Sbjct: 125 NVLDGVQ-------VVRVLSSSS 140 >UniRef50_C4FFI1 Putative uncharacterized protein n=1 Tax=Bifidobacterium angulatum DSM 20098 RepID=C4FFI1_9BIFI Length = 159 Score = 80.3 bits (197), Expect = 2e-14, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 46/140 (32%), Gaps = 15/140 (10%) Query: 28 FPFAHPANRFWKVIYQAGFTDRQLKPQEAQ---------HLLDYRCGVTKLVDRPTVQAN 78 F + HP NRFW V+ D L L + + ++ ++ Sbjct: 8 FNYGHPRNRFWPVMAAVFHDDSCLCENTDPIQTVRTCKGFALRHHMALWDVIVSCDIEGA 67 Query: 79 EVSKQELHAGGRKLIE-KIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIWV 137 S + + Q + G +A + QR QT + + Sbjct: 68 --SDASIRNAVPNDFSDMLRQSQISHIFTTGAKAAQLY--QRLCIPLLQTHGSDNVPMTR 123 Query: 138 LPNPSGLSR-VSLEKLVEAY 156 LP+ + + L +LVEAY Sbjct: 124 LPSTNPTNAGAKLPELVEAY 143 >UniRef50_B9KXL9 Phage DNA polymerase-related protein n=17 Tax=cellular organisms RepID=B9KXL9_THERP Length = 230 Score = 79.5 bits (195), Expect = 4e-14, Method: Composition-based stats. Identities = 35/168 (20%), Positives = 66/168 (39%), Gaps = 22/168 (13%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCG 65 P V+F G PG G PF A +F +++ +AG Q+ Sbjct: 57 DPHAEVMFIGEAPGYHEDRQGRPFVGAAGQFLNELLGRAGLRREQVY------------- 103 Query: 66 VTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA-QW 123 +T +V RP + + E+ A L E+I +P + LG+ + + F + Sbjct: 104 ITNVVKCRPPGNRDPL-PDEIAACAPYLDEQIAVIRPSVIVTLGRYSMARWFPNEKISRI 162 Query: 124 GKQTLTIGSTQIWVLPNPSGLSRVS---LEKLVEAYRELDQALVVRGR 168 + G + V+P + + L++LVE + + ++ + R Sbjct: 163 HGKPRRFGD--VVVVPMYHPAAALHQPALKELVEKDFDALREIIAQAR 208 >UniRef50_C9KTN5 Putative G/U mismatch-specific DNA glycosylase n=1 Tax=Bacteroides finegoldii DSM 17565 RepID=C9KTN5_9BACE Length = 142 Score = 79.1 bits (194), Expect = 6e-14, Method: Composition-based stats. Identities = 17/128 (13%), Positives = 39/128 (30%), Gaps = 16/128 (12%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRF-----------WKVIYQAGFTDRQ 50 + I ++ G PG S + + N F W + Sbjct: 7 MPAIENSESEILILGTIPGGESLKYQQYYQNKDNLFGDIMFRICDPNWPFFEEVYTIP-- 64 Query: 51 LKPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDY-QPQALAILGK 109 + Q LLD + + +++ N + + L ++ + + + + G+ Sbjct: 65 -YEKRKQLLLDNKISLWDILESCERDGNN-DRTIKNEQLNDLETYLKLHPKIRKICFNGQ 122 Query: 110 QAYEQGFS 117 +AY Sbjct: 123 KAYSYYKK 130 >UniRef50_B2V9S9 Phage SPO1 DNA polymerase-related protein n=18 Tax=Bacteria RepID=B2V9S9_SULSY Length = 217 Score = 76.4 bits (187), Expect = 3e-13, Method: Composition-based stats. Identities = 25/111 (22%), Positives = 37/111 (33%), Gaps = 14/111 (12%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 +++F G PG G PF K I AG ++ Sbjct: 69 NAKIMFIGEAPGEEEDKQGRPFVGRAGKLLTKCIENAGHIREEVY-----------IANI 117 Query: 68 KLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ 118 RP + +E A L+ +IE P+ L +LG AY F + Sbjct: 118 NKC-RPPNNRTP-TIEEQEACIPFLLRQIEIINPKVLCLLGATAYRGIFKK 166 >UniRef50_A8MCZ3 Uracil-DNA glycosylase superfamily n=4 Tax=Thermoproteaceae RepID=A8MCZ3_CALMQ Length = 231 Score = 73.0 bits (178), Expect = 4e-12, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 60/169 (35%), Gaps = 32/169 (18%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANR-------------FWKVIYQAGFTDRQLKPQ 54 R++ G+ P AH NR +K ++ G ++ Sbjct: 53 SNPRIMIVGLAPA----------AHGGNRTGRMFTGDASAQFLFKALHACGLSNNPYSLS 102 Query: 55 EAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRK-LIEKIEDYQPQALAILGKQAYE 113 +T +V N+ + E+ A + I +I +P+A+ +LG+ A+ Sbjct: 103 RIDGTKVNCVYITSVVK-CAPPDNKPTPGEVKACVGRWFINEINMVKPRAIVVLGEVAWR 161 Query: 114 QGFSQRGAQWGKQTL---TIGSTQIWVLPNPSGLS----RVSLEKLVEA 155 G + G + I T +++ +PS + R+ L LV Sbjct: 162 GVAMALGLRGGFKHGGVVKINDTAVYMSYHPSPRNTNTGRLRLSDLVNI 210 >UniRef50_B7KM64 Phage SPO1 DNA polymerase-related protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KM64_CYAP7 Length = 209 Score = 72.2 bits (176), Expect = 7e-12, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 61/170 (35%), Gaps = 27/170 (15%) Query: 10 LRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKL 69 ++ G PG + TG PF A + D+ L L + +T + Sbjct: 50 ADIMLIGEAPGQTEDKTGKPFTGKAGK---------LLDKMLASIN---LERGQIYLTNI 97 Query: 70 VD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG-------- 120 + RP +++E+ L+ +IE P+ + + G AY +R Sbjct: 98 IKCRPPNNRTP-TEKEIETCRPYLLAQIEFINPKFIVLAGGSAYRGLTGKRKPISQVRGK 156 Query: 121 -AQWGKQTLTIGSTQIWVLPNPS---GLSRV-SLEKLVEAYRELDQALVV 165 +W ++L NPS G + + + L+ R+L++ Sbjct: 157 WFKWHNSKCISIFHPAYLLRNPSLEKGSPKWLTWQDLISLKRKLNKLTTK 206 >UniRef50_C9YCM9 Putative uncharacterized protein n=1 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YCM9_9BURK Length = 323 Score = 72.2 bits (176), Expect = 7e-12, Method: Composition-based stats. Identities = 18/107 (16%), Positives = 38/107 (35%), Gaps = 13/107 (12%) Query: 59 LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ 118 LL+ GV + + + S + I + Q QA+ G ++++ Sbjct: 227 LLERGLGVWDVYASCEREGSLDSA--IRNAVPNDIAALHLPQLQAIGHNGGESFKHAR-- 282 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQALV 164 ++ LP+ S + S E+ + A+RE+ + Sbjct: 283 --------HTRTLGVPVYQLPSTSPANASWSFERKLAAWREVMERHG 321 >UniRef50_A0RV64 Uracil-DNA glycosylase n=2 Tax=Thaumarchaeota RepID=A0RV64_CENSY Length = 216 Score = 70.3 bits (171), Expect = 2e-11, Method: Composition-based stats. Identities = 35/158 (22%), Positives = 51/158 (32%), Gaps = 15/158 (9%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLD 61 V R + G PG G PF A R D++L E + Sbjct: 61 VPGTGPASARAILVGEAPGREEDLQGKPFVGGAGR---------VLDKEL---ERAGIPR 108 Query: 62 YRCGVTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG 120 +T +V RP S +E A L E+I P+ + ILG A G Sbjct: 109 STVYITNVVKCRPPKNRVP-SDEERDACIEYLKEEISVINPRIVCILGATALRSLLGLGG 167 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRE 158 G + I L + + + L+ A+R Sbjct: 168 IT-GIRGTIINQDNTDYLVSLHPAATMYRRDLLPAFRR 204 >UniRef50_UPI00017F04BF PREDICTED: similar to thymine-DNA glycosylase n=1 Tax=Sus scrofa RepID=UPI00017F04BF Length = 300 Score = 69.5 bits (169), Expect = 4e-11, Method: Composition-based stats. Identities = 17/39 (43%), Positives = 21/39 (53%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKV 40 + DIL L +V GINPGL +A G + P N FW Sbjct: 122 LPDILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFWLC 160 >UniRef50_A4E9G3 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4E9G3_9ACTN Length = 134 Score = 69.5 bits (169), Expect = 4e-11, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 37/103 (35%), Gaps = 12/103 (11%) Query: 58 HLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQ-ALAILGKQAYEQGF 116 LL+ + +++ ++ + S + +E+I P + G A Sbjct: 35 MLLNGGVALWDVIESCDIKGS--SDASIKNVVPAHVERITGAAPIGQVVCNGGTAGRLYK 92 Query: 117 SQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRE 158 + G VLP+ S + LE+LVE +RE Sbjct: 93 RYLQERTGL--------SAIVLPSTSPANAAWRLERLVERWRE 127 >UniRef50_A7C715 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7C715_9GAMM Length = 131 Score = 68.7 bits (167), Expect = 7e-11, Method: Composition-based stats. Identities = 11/118 (9%), Positives = 33/118 (27%), Gaps = 6/118 (5%) Query: 45 GFTDRQLKPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKI--EDYQPQ 102 G + L V ++ Q + S + + + Sbjct: 6 GIDSSLDYDERVILLQKKGISVWDVLQTCIRQGSLDSAIQESTMIANDFQSFYSNHSNIK 65 Query: 103 ALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYREL 159 ++ G +A + + + LP+ S + ++ E+ + ++ + Sbjct: 66 SVFFNGAKAEMVYKKHVIPSL---SEKWLKLEYFRLPSTSPALASLNREEKLLKWQII 120 >UniRef50_B6IW73 Uracil-DNA glycosylase, family 4 protein, putative n=6 Tax=Alphaproteobacteria RepID=B6IW73_RHOCS Length = 298 Score = 68.0 bits (165), Expect = 1e-10, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 52/162 (32%), Gaps = 17/162 (10%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCG 65 P RV+ G PG G PF PA +++ G D Sbjct: 131 DPAARVMLVGEAPGEQEDRQGRPFVGPAGLLLDRMLAAIGL-------DRRADTPDRAVY 183 Query: 66 VTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS------Q 118 +T +V+ RP + + E+ A + I+ P L LG + + + Sbjct: 184 ITNVVNWRPPGNRSP-TDGEIAACLPFVQRHIDLVNPAVLVFLGGISAKSLLQRTEGITR 242 Query: 119 RGAQWGKQTLTIGSTQIWVLPNPSGLSRVS-LEKLVEAYREL 159 +W S + LP + EA+++L Sbjct: 243 LRGRWFDYASPGLSRPLPALPMLHPAYLLRNPAAKREAWKDL 284 >UniRef50_C1SMS3 Uracil-DNA glycosylase, family 4 n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SMS3_9BACT Length = 217 Score = 67.6 bits (164), Expect = 1e-10, Method: Composition-based stats. Identities = 31/162 (19%), Positives = 55/162 (33%), Gaps = 17/162 (10%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCG 65 P ++F G PG G PF A + K+I GFT + Sbjct: 66 DPAADLMFVGEGPGAEEDKQGIPFVGRAGQLLTKMIEAMGFTRDTVY------------- 112 Query: 66 VTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWG 124 + +V RP N + E + L +IE +P+ + LG A + + Sbjct: 113 IANIVKCRPPENRNPF-EDESNTCIPYLYRQIEYVKPKVIVCLGSVAAQNFLKTKLGITK 171 Query: 125 KQTLTIGSTQIWVLPNPSGLSRVSLEK-LVEAYRELDQALVV 165 + + + V+P + K + +L + + V Sbjct: 172 IRGEFVELNGVKVMPTYHPAYLLRNPKMKKPCWEDLQKVMAV 213 >UniRef50_B5YEZ9 DNA polymerase, bacteriophage-type n=2 Tax=Dictyoglomus RepID=B5YEZ9_DICT6 Length = 210 Score = 67.6 bits (164), Expect = 2e-10, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 51/150 (34%), Gaps = 18/150 (12%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P ++F G PG G PF A R + ++ L + Sbjct: 39 PNSSIMFIGEAPGFHEDQQGRPFVGAAGRLLTELIES------------MGLKREDIYIA 86 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG--AQWG 124 ++ RP + +E+ A L +IE P+ + LG+ A + Sbjct: 87 NVLKCRPPNNREPL-PEEIKACFPFLRRQIEIINPKVICTLGRYAAYAILGHSVNISNSH 145 Query: 125 KQTLTIGSTQIWVLPNPSGLSRVSLEKLVE 154 + + +I+V + + + KL+E Sbjct: 146 GKDYVVDGKRIFVTYH--PAAALYHGKLLE 173 >UniRef50_B6R445 Bacteriophage-type DNA polymerase N-terminal domain protein n=2 Tax=Alphaproteobacteria RepID=B6R445_9RHOB Length = 331 Score = 66.4 bits (161), Expect = 3e-10, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 54/169 (31%), Gaps = 35/169 (20%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 + RV+F G PG G PF + DR LK L + Sbjct: 175 SSKARVMFVGEAPGREEDIQGVPFVGRSGH---------LLDRMLKA---VGLDRSNVYI 222 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF------SQR 119 +V RP + E + +IE P+ + LG + + + Sbjct: 223 ANVVPWRPPGNRTP-TPHETEICRPFIQRQIELVDPEIVVFLGAASAKTLSGARDGIRKL 281 Query: 120 GAQW------GKQTLTIGS-TQIWVLPNPSGLSRVSLEKLVEAYRELDQ 161 +W G Q I + ++L +P + A+R++ Q Sbjct: 282 RGKWLTYNGEGTQRKAIATYHPAYLLRSP--------AEKRYAWRDMLQ 322 >UniRef50_B3U4U7 Uracil-DNA glycosylase family 4 n=1 Tax=Candidatus Nitrospira defluvii RepID=B3U4U7_9BACT Length = 223 Score = 66.0 bits (160), Expect = 4e-10, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 56/164 (34%), Gaps = 18/164 (10%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P V+F G PG G PF A + + ++ L + Sbjct: 38 PHASVMFVGEAPGFHEDQKGEPFVGAAGQLLNDLLES------------AGLSRADIYIA 85 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 ++ RP + QE+ L+++I +P+ + LG A + ++ + Sbjct: 86 NVIKCRPPNNR-DPEPQEVETCKPFLLQQIAMIRPKLVCSLGNWATQTLLERKVGITRVR 144 Query: 127 TLTIGSTQIWVLPNPSGLSRVS----LEKLVEAYRELDQALVVR 166 + + P + + L+ L E +++L + L Sbjct: 145 GQAFYLKEFVLFPLLHPAAALHQGTMLQPLREDFQKLKEFLDRH 188 >UniRef50_B2IF85 Phage SPO1 DNA polymerase-related protein n=1 Tax=Beijerinckia indica subsp. indica ATCC 9039 RepID=B2IF85_BEII9 Length = 290 Score = 65.7 bits (159), Expect = 6e-10, Method: Composition-based stats. Identities = 35/165 (21%), Positives = 54/165 (32%), Gaps = 22/165 (13%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P V+ G PG G PF A + DR L + Y + Sbjct: 129 PNSSVMIIGEAPGADEDRQGRPFVGRAGQ---------LLDRMLAAVDLDRTQVY---IA 176 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR----GAQ 122 +V RP + E+ A + +IE P+ + LG + + + + Sbjct: 177 NIVPWRPPGNRTP-TPLEIAACLPFIRRQIELVSPRFIVCLGAPSAQTLLGTKEAITRLR 235 Query: 123 WGKQTLTIGSTQIWVLPNPSGLS--RVSLEKLVEA--YRELDQAL 163 + + G I VLP R EK +R L +AL Sbjct: 236 GRWRDWSCGGKTIHVLPMLHPAYLLRQPAEKKRAWADWRLLAKAL 280 >UniRef50_Q2LTM2 Uracil DNA glycosylase superfamily protein n=1 Tax=Syntrophus aciditrophicus SB RepID=Q2LTM2_SYNAS Length = 263 Score = 65.7 bits (159), Expect = 6e-10, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 52/159 (32%), Gaps = 16/159 (10%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P +VF G PG G PF A + +A L + Sbjct: 111 PHSELVFVGEAPGADEDLQGRPFVGRAGHL------------LTRIIDAMGLKREDVYIC 158 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 ++ RP N + +E+ LI ++E +P+ + LG A + ++G Sbjct: 159 NILKCRPPGNRNPL-PEEIEVCEPWLIRQLEAIRPRIICALGTFAAQTLLKKKGIPITVL 217 Query: 127 TLTIGSTQ-IWVLPNPSGLSRVSLE-KLVEAYRELDQAL 163 I ++P + + + ++ + + Sbjct: 218 RGQFHDYHGIKLMPTYHPAYLLRNPGAKRQVWEDVQKIM 256 >UniRef50_B2A668 Phage SPO1 DNA polymerase-related protein n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A668_NATTJ Length = 195 Score = 64.9 bits (157), Expect = 9e-10, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 51/159 (32%), Gaps = 17/159 (10%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCG 65 ++ G PG PF P K++ F+ + Sbjct: 38 DEQADIMLIGEGPGAKEDELQRPFVGPAGELLTKMLAAINFSRDDVY------------- 84 Query: 66 VTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWG 124 +T +V RP +++E+ L ++I Q + + ILG A + + Sbjct: 85 ITNIVKCRPPNNRTP-NQEEMDTCLPILRQQIRLIQSKIIVILGGAALKGLIDKNMKITK 143 Query: 125 KQTLTIGSTQIWVLPNPSGLSRVS-LEKLVEAYRELDQA 162 + + I +LP + K E++ +L Sbjct: 144 ARGSWLERKGIHILPTYHPAYLLRNPAKKKESWEDLQNL 182 >UniRef50_Q2GCJ7 Uracil-DNA glycosylase, family 4 n=4 Tax=Anaplasmataceae RepID=Q2GCJ7_NEOSM Length = 255 Score = 64.9 bits (157), Expect = 1e-09, Method: Composition-based stats. Identities = 31/163 (19%), Positives = 51/163 (31%), Gaps = 25/163 (15%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P +++ G PG G PF N K++ G L + + Sbjct: 97 PDAKIMLIGEAPGEQEDKLGIPFCGRSGNLLDKMLQAIG-------------LDRTKVYI 143 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS-------- 117 T V RP + +E+ L I P+ + +LG A Sbjct: 144 TNAVFWRPPGNRRP-TPEEIEICKPFLEHHIRLVLPKLIVLLGSTAASSVLKSQDRISTL 202 Query: 118 QRGAQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELD 160 + K V+ +PS L R +K + A+ +L Sbjct: 203 RNKLHEYKNPFIREGITTTVMFHPSYLLRQPAQKKL-AWEDLK 244 >UniRef50_B3DV47 Uracil-DNA glycosylase n=1 Tax=Methylacidiphilum infernorum V4 RepID=B3DV47_METI4 Length = 276 Score = 64.9 bits (157), Expect = 1e-09, Method: Composition-based stats. Identities = 31/165 (18%), Positives = 58/165 (35%), Gaps = 12/165 (7%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P ++F G PG G PF PA + K+I G + Q+ Sbjct: 106 PYAELMFVGEAPGAEEDLQGEPFVGPAGQLLTKMIRAMGLSREQVYIANVLKCRP----- 160 Query: 67 TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 + + + + +E+ L+ +++ P+ + LG AYE + G + + Sbjct: 161 -DI-PKGQKGNRKPTTEEMSTCLPYLLSQLKLINPKVIVALGATAYEGLTGKVGIKISEV 218 Query: 127 TLT---IGSTQIWVLPNPSGLSR-VSLEKLVEAYRELDQALVVRG 167 + +PS L SL + + +L A+ G Sbjct: 219 RGRCFSFNGATLIPTYHPSFLLHNPSLTNKRKVWEDLLTAMGKLG 263 >UniRef50_D1QTK9 Mug protein n=1 Tax=Prevotella oris F0302 RepID=D1QTK9_9BACT Length = 200 Score = 64.5 bits (156), Expect = 1e-09, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 60/180 (33%), Gaps = 20/180 (11%) Query: 4 DILAPGLRVVFCGINP-GLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQH---- 58 L R++ G P + + + N W++ F D+ + Sbjct: 21 PWLPKNARLLMLGTFPPAPKRWCMEWYYPNFQNDMWRIFGYLFFGDKMHFVDQEHKTFNV 80 Query: 59 ------LLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQ 110 L + + R N S ++L ++ + P +++ G+ Sbjct: 81 DAIKALLKEKGVAIFDTALRIRRTKNNASDKDLEIVEPSDLDNMLRSLPACKSILAAGQL 140 Query: 111 AYEQGFSQRGAQ----WGKQTLTIGS---TQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 A E G + G+ T +++ +P+ S + +EK E YR++ + L Sbjct: 141 ATEVFTKHYGIKTIPKMGQYTEFTFEGRTLRLYRMPSSSRAYPMQIEKKAEFYRKMFEEL 200 >UniRef50_B2SQC4 T/U mismatch-specific DNA glycosylase n=13 Tax=Xanthomonadaceae RepID=B2SQC4_XANOP Length = 121 Score = 64.5 bits (156), Expect = 1e-09, Method: Composition-based stats. Identities = 20/122 (16%), Positives = 43/122 (35%), Gaps = 6/122 (4%) Query: 45 GFTDRQLKPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--Q 102 G Q LLD+R GV ++ + + + + + + + P + Sbjct: 2 GLDAAASYASRLQALLDHRVGVWDVIGQCERRGSLDTSIVAASIVVNPLPALLVTLPQLR 61 Query: 103 ALAILGKQAYEQGFSQRGAQWGKQTLTIGSTQIWVLPNPSGL-SRVSLEKLVEAYRELDQ 161 +A G A + + + + LP+ S + SL +L A++ + Sbjct: 62 LVACNGAAAAQAWRRHVQPLLSAKLRALP---VVALPSTSPANAAWSLPRLATAWQPVCD 118 Query: 162 AL 163 A+ Sbjct: 119 AV 120 >UniRef50_C8SCY8 Phage SPO1 DNA polymerase-related protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8SCY8_FERPL Length = 211 Score = 64.5 bits (156), Expect = 1e-09, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 62/161 (38%), Gaps = 24/161 (14%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 ++VF G PG G PF A ++ K + + G ++ + Sbjct: 45 EKAKIVFVGEAPGREEDLQGRPFVGAAGQYLTKTLEKCGIKRSEVF-------------I 91 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQ---GFSQR--- 119 T ++ RP + +E+ L++++ + +P + LG+ + + F ++ Sbjct: 92 TNILKCRPPNNR-DPKPEEVEKCFPYLVKQLHEIKPDVIVCLGRHSAREILNFFGKKFKG 150 Query: 120 -GAQWGKQTLT-IGSTQIWVLPNPSGLSRVSLEKLVEAYRE 158 A GK + ++ ++P + + +L E + E Sbjct: 151 ISADRGKIFEAELDGKKVKIIPTYHPAAVLYKPQLKEYFEE 191 >UniRef50_A0Q3G6 Phage SPO1 DNA polymerase-related protein n=5 Tax=Clostridium RepID=A0Q3G6_CLONN Length = 190 Score = 64.1 bits (155), Expect = 2e-09, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 53/156 (33%), Gaps = 14/156 (8%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P ++F G PG TG PF A + + G L +T Sbjct: 38 PKANLMFIGEAPGADEDRTGKPFVGKAGQ----LLTKGLMALDLYRDRDY-------YIT 86 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 + RP N + E +A + L ++ +P+ + LG + + + + Sbjct: 87 NICKCRPENNRNPMD-HEAYACIQYLRNQVALIKPKIIVCLGAVSGKYIIDKDLRITRDR 145 Query: 127 TLTIGSTQIWVLPNPSGLSRVSLEKLVE-AYRELDQ 161 I ++++P + + + + +L + Sbjct: 146 GKWIEKKGVYIMPIFHPAAILRDDNKKPLFWGDLKE 181 >UniRef50_B0VFE1 Uracil-DNA glycosylase n=1 Tax=Candidatus Cloacamonas acidaminovorans RepID=B0VFE1_9BACT Length = 217 Score = 63.7 bits (154), Expect = 2e-09, Method: Composition-based stats. Identities = 39/159 (24%), Positives = 58/159 (36%), Gaps = 19/159 (11%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P + G PG TG PF A + ++ L L +T Sbjct: 61 PNAIAMLIGEGPGEQEDITGRPFVGAAGQ---------LLEKMLSAIN---LQRSDVYIT 108 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 +V RP N S +E A L+E+I+ QP+ L ILG A + + G Sbjct: 109 NIVKCRPPGNRNPES-EERKACLPYLLEQIQIIQPKILLILGLVAAQTLLNNNNT-LGWH 166 Query: 127 TLTIG---STQIWVLPNPSGLSRVSLEKLVEAYRELDQA 162 I +V +PS L R + E A+ +L + Sbjct: 167 RGKIHYFMDIPAFVTYHPSALLR-NQEWKKPAWIDLQEF 204 >UniRef50_Q3SAA1 Uracyl DNA glycosylase n=4 Tax=Euryarchaeota RepID=Q3SAA1_9EURY Length = 196 Score = 63.7 bits (154), Expect = 2e-09, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 61/162 (37%), Gaps = 23/162 (14%) Query: 11 RVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGVTKL 69 +V+F G PG G PF A ++ +++ G + + +T + Sbjct: 44 KVMFVGEAPGKWEDIKGRPFVGAAGKYLDELLSSIGLSRDDVF-------------ITNI 90 Query: 70 VD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTL 128 V RP + + +E+ A L +IE P+ + LG+ + + + G + Sbjct: 91 VKCRPPNNR-DPTSEEISACAPYLEWQIETMAPRIICPLGRFSAKFILEKFGFTMKSISA 149 Query: 129 TIGSTQ---IWVLPNPSGLSRVS----LEKLVEAYRELDQAL 163 G I ++P + + +L +++ L + L Sbjct: 150 AQGKIHRGTITIIPMYHPAAALYHGKWKAELEKSFTVLKEEL 191 >UniRef50_A8UU74 N-terminus of phage SPO1 DNA polymerase n=3 Tax=Aquificales RepID=A8UU74_9AQUI Length = 220 Score = 63.7 bits (154), Expect = 2e-09, Method: Composition-based stats. Identities = 26/113 (23%), Positives = 43/113 (38%), Gaps = 13/113 (11%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P VVF G PG G PF A ++ + + K +E L +T Sbjct: 67 PYSPVVFVGEAPGEDEDRQGRPFVGRAGKY---LNK--------KIEEVLGLRREEVYIT 115 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQR 119 + RP + E+ A L ++I+ +P+ + LG A E ++ Sbjct: 116 NVCKCRPPGNRKP-TPAEISACFPYLKKEIDIIKPKVICCLGATAGEGILGKK 167 >UniRef50_A8PQ90 Phage SPO1 DNA polymerase domain protein n=1 Tax=Rickettsiella grylli RepID=A8PQ90_9COXI Length = 240 Score = 63.3 bits (153), Expect = 3e-09, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 52/165 (31%), Gaps = 33/165 (20%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWK-VIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P ++ G PG G PF + ++ GF + Sbjct: 82 PHADLLVVGEAPGAHEDKQGEPFVGRGGQLLTNMLSSIGFKREDYY-------------I 128 Query: 67 TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG---------FS 117 ++ + S +E+ A L+ +I +P+ + +G+ A Sbjct: 129 ANILKSRPPNNRDPSLEEVSACTPYLLRQINLIKPKLILAVGRIAAHYLLSTESTMARLR 188 Query: 118 QRGAQWG--KQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELD 160 +G K L + ++L +P + +A++++ Sbjct: 189 GNLFYFGERKIPLLVTYHPAYLLRSP--------REKRKAWQDMR 225 >UniRef50_Q73JU4 DNA polymerase domain protein n=1 Tax=Treponema denticola RepID=Q73JU4_TREDE Length = 264 Score = 62.6 bits (151), Expect = 5e-09, Method: Composition-based stats. Identities = 34/159 (21%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 6 LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCG 65 + V+ G PG G PF A + D+ L E + C Sbjct: 106 FDNKIEVMVIGEGPGADEDAQGRPFVGKAGQ---------LLDKMLAAIELYRT--HNCY 154 Query: 66 VTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWG 124 +T +V RP + + E A L +I +P+A+ ++G+ A + + G Sbjct: 155 ITNVVKCRPPNNRDPL-PDETAACRNYLNAQIALIRPKAILVVGRVALQNLLETQEG-IG 212 Query: 125 KQTLTIGSTQIWVL---PNPSGLSRVSLEKLVEAYRELD 160 K Q L +PS L R + A+ +L Sbjct: 213 KMHGKFFEYQNIPLMATYHPSALLR-DVNLKRPAWEDLK 250 >UniRef50_C4G9N7 Putative uncharacterized protein n=1 Tax=Shuttleworthia satelles DSM 14600 RepID=C4G9N7_9FIRM Length = 255 Score = 62.6 bits (151), Expect = 5e-09, Method: Composition-based stats. Identities = 15/112 (13%), Positives = 37/112 (33%), Gaps = 7/112 (6%) Query: 52 KPQEAQHLLDYRCGVTKLVDRP-TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQ 110 + ++ L R + +++ S + + + + + + G Sbjct: 139 REEKRHFLHINRIAAWDTISECDIKGSSDASIKNVIP--SDIPMLLAACPIRVILCNGAS 196 Query: 111 AYEQGFSQRGAQW-GKQTLTIGSTQIWV--LPNPSGL-SRVSLEKLVEAYRE 158 +Y + GS ++ V +P+ S + SLE L A+ + Sbjct: 197 SYRYFQKYLTSLLDDFYRERPGSRRVPVRQVPSTSPANAAWSLEDLTGAWEQ 248 Score = 55.3 bits (132), Expect = 8e-07, Method: Composition-based stats. Identities = 13/37 (35%), Positives = 19/37 (51%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQ 43 R++ G P + S F + HP NRFW+V+Y Sbjct: 16 DEKSRILILGSVPSVKSRELAFYYGHPQNRFWRVLYL 52 >UniRef50_A6Q9E7 Uracil-DNA glycosylase n=4 Tax=Epsilonproteobacteria RepID=A6Q9E7_SULNB Length = 222 Score = 62.6 bits (151), Expect = 5e-09, Method: Composition-based stats. Identities = 34/165 (20%), Positives = 58/165 (35%), Gaps = 24/165 (14%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P ++ G PG + TG PF + + + L +T Sbjct: 71 PHADILIIGEGPGATEDSTGKPFVGRSGELLSKMLENVL-----------GLKRADVYIT 119 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGA----- 121 +V RP + E H L+++IE +P+ + LG AY Sbjct: 120 NIVKCRPPNNRAP-TPTEAHTCQPYLLKQIELIKPKLILTLGATAYHYLTGDDTGITKVR 178 Query: 122 ----QWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQA 162 + G TL ++L NPS + E L++ +EL ++ Sbjct: 179 GSIHKHGNATLIPTYHPSYLLRNPS-AKKEVFEDLLKV-KELMKS 221 >UniRef50_D2RLY7 Putative uncharacterized protein n=1 Tax=Acidaminococcus fermentans DSM 20731 RepID=D2RLY7_ACIFE Length = 219 Score = 62.2 bits (150), Expect = 6e-09, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 56/180 (31%), Gaps = 25/180 (13%) Query: 3 EDILAPGLRVVFCGINP-GLSSAGTGFPFAHPANRFWKVIYQAGFTD---------RQLK 52 L RV+ G P F + + N W+++ F+D + Sbjct: 22 PPFLPETARVLILGTFPPAAEKRAMDFYYPNFQNDMWRILGMVYFSDPDHFRKGMEKAFD 81 Query: 53 PQEAQ-HLLDYRCGVTKLVDRPTV---QANEVSKQELHAGGRKLIEKIEDY-QPQALAIL 107 P + L + + V R A++ + L + + L Sbjct: 82 PTRIRLFLAETGIALGPTVLRAQREKGNASDKFLHVVEEAS--LAAMLRQIPHCRLLITT 139 Query: 108 GKQAYEQGFSQRGAQWGKQ------TLTIGSTQIW--VLPNPSGLSRVSLEKLVEAYREL 159 G++A E Q +T+ + LP+ S + LEK E Y+E+ Sbjct: 140 GEKATEIVLHQCTNPPKLPRTGTSVPITLDGRDLLLSRLPSTSRAYPMKLEKKAEMYKEV 199 >UniRef50_A5KS51 Phage SPO1 DNA polymerase-related protein n=1 Tax=candidate division TM7 genomosp. GTL1 RepID=A5KS51_9BACT Length = 207 Score = 61.8 bits (149), Expect = 9e-09, Method: Composition-based stats. Identities = 30/143 (20%), Positives = 51/143 (35%), Gaps = 17/143 (11%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTK 68 ++VF G PG G PF A RF D L L +T Sbjct: 50 DAKIVFIGEAPGKKEDEQGLPFVGAAGRF---------LDEMLASAN---LKREDVYITN 97 Query: 69 LVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW---G 124 +V RP + S +E A + +++ P+ + LG+ + E + Sbjct: 98 IVKYRPPNNR-DPSPEEKRAFWPYVARQLKVIGPEVIVTLGRHSMEFFLPGQKISVVHGD 156 Query: 125 KQTLTIGSTQIWVLPNPSGLSRV 147 + +T G +I V+P + + Sbjct: 157 PKRVTFGKEKIVVVPLFHPAAAL 179 >UniRef50_A5WHQ7 G:T/U mismatch-specific DNA glycosylase-like protein n=59 Tax=Bacteria RepID=A5WHQ7_PSYWF Length = 211 Score = 61.8 bits (149), Expect = 9e-09, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 65/190 (34%), Gaps = 28/190 (14%) Query: 4 DILAPGLRVVFCGINPGLSS-AGTGFPFAHPANRFWKVIYQAGFTD---------RQLKP 53 +L ++ G P F + + N W++ + F D ++ P Sbjct: 21 PVLPSNATIMMMGTFPPTQDKWAMPFHYPNFYNDMWRIYGRVFFDDADYFRVGDEKRFDP 80 Query: 54 QEAQ-HLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQA--LAILGKQ 110 + + L + V + + + ++L ++++ P+ L G + Sbjct: 81 ERIKDFLSQHAIASCPTVKKAIREQGNAADKDLTIVEAVNLKQVLSQVPKVDVLFTTGGK 140 Query: 111 AYEQGFSQRGAQWGK-----------QTLTIG----STQIWVLPNPSGLSRVSLEKLVEA 155 A E + GK Q + ++ LP+ S ++L+K V A Sbjct: 141 ATEILMDLLAQEQGKAIPKSKMPKTNQHIDYNYGGRQLTLYRLPSTSRAYPLALDKKVAA 200 Query: 156 YRELDQALVV 165 YR+ QA+ Sbjct: 201 YRDFFQAMGK 210 >UniRef50_C6MFU1 Phage SPO1 DNA polymerase-related protein n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MFU1_9PROT Length = 251 Score = 61.4 bits (148), Expect = 1e-08, Method: Composition-based stats. Identities = 37/158 (23%), Positives = 58/158 (36%), Gaps = 20/158 (12%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAG-FTDRQLKPQEAQHLLDYRCG 65 +F G PG TG PF QAG D+ L + + Sbjct: 85 DKNASWLFVGEGPGAREDETGEPFVG----------QAGKLLDQMLFAIQLNR--RHMIY 132 Query: 66 VTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS---QRGA 121 +T +V RP N + E L ++I +P+ + LGK A + + Sbjct: 133 ITNIVKCRPPNNRNPST-YEAAQCEPFLTQQIALIKPKIIIALGKVAAQSLLKCDNSISS 191 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYREL 159 GK + V +P+ L R SL + +A+R+L Sbjct: 192 LRGKIHD-YSGIPLIVTYHPAYLLR-SLPEKAKAWRDL 227 >UniRef50_C8PQ75 DNA polymerase domain protein n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PQ75_9SPIO Length = 299 Score = 61.0 bits (147), Expect = 1e-08, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 55/159 (34%), Gaps = 14/159 (8%) Query: 6 LAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCG 65 L + V+ G PG G PF A + D+ L+ + C Sbjct: 142 LTNSVEVMVIGEGPGADEDRQGRPFVGRAGQ---------LLDKMLEAIQLSR--KTNCY 190 Query: 66 VTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWG 124 +T +V RP + + +E + L +I +P+ + ++G+ A + Sbjct: 191 ITNVVKCRPPQNR-DPAPEERNCCAAFLDAQIRLIRPKMILVMGRIAAQHLLQTSDGIGK 249 Query: 125 KQTLTIGSTQIWVLPNPSGLSRVSLEK-LVEAYRELDQA 162 + I ++P + + E A+ +L + Sbjct: 250 LRGRFFDYQGIPLMPTYHPSALLRDESLKRPAWEDLKRF 288 >UniRef50_Q0AVT6 Uracil-DNA glycosylase-like protein n=4 Tax=Clostridia RepID=Q0AVT6_SYNWW Length = 233 Score = 60.6 bits (146), Expect = 2e-08, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 49/153 (32%), Gaps = 15/153 (9%) Query: 10 LRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKL 69 R++F G PG G PF A + D+ L E + + ++ + Sbjct: 80 SRIMFIGEGPGKDEDEQGRPFVGKAGQ---------LLDKILVAAE---IPRSQVYISNV 127 Query: 70 VD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTL 128 V RP + E+ L +I QP + LG A + + + + Sbjct: 128 VKCRPPGNRLP-NPDEVRECRNYLEAQIRIMQPAIIVCLGAMASQVVIDPKASVSKIRGK 186 Query: 129 TIGSTQIWVLPNPSGLSRVSLEK-LVEAYRELD 160 + ++ + + A+++ Sbjct: 187 WFNRQGVKIMATYHPAALLRNPSYKRPAWQDFQ 219 >UniRef50_D2LEU1 Phage SPO1 DNA polymerase-related protein n=1 Tax=Rhodomicrobium vannielii ATCC 17100 RepID=D2LEU1_RHOVA Length = 289 Score = 60.6 bits (146), Expect = 2e-08, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 46/158 (29%), Gaps = 18/158 (11%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCG 65 +P ++ G PG G PF A + +++ G T+ Sbjct: 131 SPSAPLMLIGEAPGKDEDRLGQPFVGRAGQLLDRMLAAIGLTEADFYIT--------NIV 182 Query: 66 VTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG---AQ 122 RP S +E+ + +IE P+ + LG A Q Sbjct: 183 FW----RPPGNRTP-SPEEVLVCQPFVERQIELLAPRIVVFLGNAAARQLTGATEGIMKL 237 Query: 123 WGKQTLTIGSTQIWVLPNPSGLSRVSLE-KLVEAYREL 159 GK G + + + A+R+L Sbjct: 238 RGKWLELPGHPGVRAMATLHPAYLLRTPIAKRLAWRDL 275 >UniRef50_Q1H4A3 Phage SPO1 DNA polymerase-related protein n=2 Tax=Betaproteobacteria RepID=Q1H4A3_METFK Length = 330 Score = 60.6 bits (146), Expect = 2e-08, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 51/155 (32%), Gaps = 14/155 (9%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P +F G PG G PF PA + D L + + Sbjct: 168 DPNAEWLFVGEAPGAEEDRWGEPFVGPAGQ---------LLDNMLAAIALKRGENVYIA- 217 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 ++ RP + +E+ L ++E +P+ + LGK A + + + G Sbjct: 218 -NVLKCRPPANR-DPQGEEVVKCDPYLKRQVELIKPKLIVALGKFAAQSLLASDKSIAGL 275 Query: 126 QTLTIGSTQIWVLPNPSGLSRVS-LEKLVEAYREL 159 + + V+ + L +A+ +L Sbjct: 276 RGALHDFHGVPVIVTYHPAYLLRNLADKAKAWDDL 310 >UniRef50_A2BJ05 Predicted Uracil-DNA glycosylase n=8 Tax=Archaea RepID=A2BJ05_HYPBU Length = 221 Score = 60.3 bits (145), Expect = 2e-08, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 56/159 (35%), Gaps = 27/159 (16%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 +V+ G PG + G PF A + K++ AG Q+ +T Sbjct: 37 NAKVMVIGEAPGRNEDLQGRPFVGAAGQLLNKLLELAGLKREQVY-------------IT 83 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW--- 123 +V RP + + E+ A L+ +I +P+ + +G+ + G QW Sbjct: 84 NIVKCRPPGNR-DPQEDEVKACLPYLLHQIRLVKPKLIIAVGRHSARTLLELAGLQWRSM 142 Query: 124 -----GKQTLTIGSTQIWVLPNPSGLSRV---SLEKLVE 154 IG ++ + + + L + +E Sbjct: 143 SEQHGKTYHGEIGGVKLMITVTYHPAAALYKPPLRQKLE 181 >UniRef50_D1PL52 Uracil-DNA glycosylase n=5 Tax=Bacteria RepID=D1PL52_9FIRM Length = 188 Score = 60.3 bits (145), Expect = 3e-08, Method: Composition-based stats. Identities = 22/109 (20%), Positives = 42/109 (38%), Gaps = 13/109 (11%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P V+F G PG + G PF + + D L+ + + + Sbjct: 32 PNAEVLFVGEGPGANEDAQGQPFVGRSGQ---------LLDHYLEAVDLNR--EKNVYIA 80 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQG 115 +V RP + + +E A L ++ + +P+ + LG+ A +Q Sbjct: 81 NIVKCRPPQNRDPL-PEESAACLPWLRQQFQLLRPKIIVCLGRIAAQQL 128 >UniRef50_A8VW39 Putative uncharacterized protein n=1 Tax=Bacillus selenitireducens MLS10 RepID=A8VW39_9BACI Length = 59 Score = 59.9 bits (144), Expect = 3e-08, Method: Composition-based stats. Identities = 11/44 (25%), Positives = 19/44 (43%) Query: 2 VEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAG 45 + ++ RV+ G PG S +A+P N FW ++ Sbjct: 6 MNPVVDREPRVLILGSMPGAKSLDAQMYYANPRNHFWPILGSYF 49 >UniRef50_Q899E0 DNA polymerase, bacteriophage-type n=4 Tax=Clostridiales RepID=Q899E0_CLOTE Length = 188 Score = 59.9 bits (144), Expect = 3e-08, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 52/162 (32%), Gaps = 26/162 (16%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 ++F G PG TG PF A +F + + + + + Sbjct: 33 SKADIMFIGEAPGGDEDKTGRPFVGRAGQFLTKMLNS-----------LDLIRERDYYIC 81 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS------QRG 120 + RP N + E+ A L + QP+ + LG A Q S + Sbjct: 82 NICKCRPENNRNP-KEDEIKACIPYLRNQFALVQPRIIVCLGAIATNQIISDQWRITRDR 140 Query: 121 AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVE-AYRELDQ 161 +W K+ + + + E + +R+L + Sbjct: 141 GKWIKRKGCFITATFH------PAAVLRDEGKKQYIFRDLQE 176 >UniRef50_Q0W079 Predicted uracil-DNA glycosylase superfamily protein n=2 Tax=Euryarchaeota RepID=Q0W079_UNCMA Length = 185 Score = 59.5 bits (143), Expect = 5e-08, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 53/163 (32%), Gaps = 20/163 (12%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 R++ G PG + G PF V+ AG + + Sbjct: 37 DARIMLIGEAPGRNEDKQGKPFVGAAGKVLNLVLESAGIRRDDVY-------------IA 83 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 +V RP K E+ A L +++E P+ + +LGK A E ++ + Sbjct: 84 NVVKCRPPENRVP-KKDEIEACSVYLKKQLEVISPKVVVLLGKTAAESYLGRKVVMGEEH 142 Query: 127 TLTIGSTQIWVLPNPSGLS----RVSLEKLVEAYRELDQALVV 165 V+ + R + L ++ + +A + Sbjct: 143 GKPFKHEDRTVVITYHPAAIIYNRKLKDALESDFKVIRKAALE 185 >UniRef50_B9XNJ8 Phage SPO1 DNA polymerase-related protein n=1 Tax=bacterium Ellin514 RepID=B9XNJ8_9BACT Length = 201 Score = 59.5 bits (143), Expect = 5e-08, Method: Composition-based stats. Identities = 27/147 (18%), Positives = 51/147 (34%), Gaps = 14/147 (9%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTK 68 +V+ G PG + TG PF + +F D L+ E +T Sbjct: 52 NSKVMIIGEAPGRNEDLTGRPFVGSSGKF---------LDHVLEGSEFD---RSDFFITN 99 Query: 69 LVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQT 127 + RP + + L +IE P+ + +LG A ++ ++ + + Sbjct: 100 ICKCRPPSNRTPKTGEIETCTSLYLYRQIELINPRIIVLLGGVAVKKMLGKKTVEEARCG 159 Query: 128 LTIGSTQIWVLPNPSGLSRVSLEKLVE 154 + + + L R E+L E Sbjct: 160 IVEHEGRKF-LVTYHPAVRFYREELAE 185 >UniRef50_C9MT19 Mug protein n=10 Tax=Prevotella RepID=C9MT19_9BACT Length = 195 Score = 59.1 bits (142), Expect = 6e-08, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 55/182 (30%), Gaps = 21/182 (11%) Query: 4 DILAPGLRVVFCGINP-GLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQE------- 55 L +++ G P + + + N W++ F D+Q E Sbjct: 10 PWLPSNAKLLLLGTFPPAPKRWCMEWYYPNYTNDMWRIFGYIFFGDKQYFVDEENKTYKL 69 Query: 56 ---AQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILGKQ 110 L + + R + ++L ++ + P +A+ G+ Sbjct: 70 ELLKAFLKEKGIAIFDTALRIRRTTGTAADKDLEIVEPADLDGMLRVLPECKAVLAAGQL 129 Query: 111 AYEQGFSQ-----RGAQWGKQTLTIGSTQIWVL---PNPSGLSRVSLEKLVEAYRELDQA 162 A + R + G + L P+ S + +EK E Y ++ + Sbjct: 130 ATKVFTDHYQIDARKMKMGDYKEFSFEGRTLKLYREPSSSRAYPMKVEKKSEYYVQMLKD 189 Query: 163 LV 164 + Sbjct: 190 IG 191 >UniRef50_Q119E8 Phage SPO1 DNA polymerase-related protein n=20 Tax=Cyanobacteria RepID=Q119E8_TRIEI Length = 242 Score = 59.1 bits (142), Expect = 6e-08, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 56/163 (34%), Gaps = 25/163 (15%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P ++ G PG + G PF + + D+ L+ E D ++ Sbjct: 77 PEAEIMVIGEAPGQNEDEQGLPFVGKSGQ---------LLDKILQSVELSS--DRHVYIS 125 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRG------ 120 + RP + +E+ A L+E+I P+ + + G A + + Sbjct: 126 NAIRCRPPENRTP-TLKEVGACKPYLLEQIRLVNPKIILLTGATAVKALTGDKRGITKIR 184 Query: 121 ---AQWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELD 160 +W + ++L NPS R + ++++ Sbjct: 185 GQWIEWEGRLCMPIFHPAYLLRNPS---REPGKPKWLMWQDIQ 224 >UniRef50_A4EBV6 Putative uncharacterized protein n=1 Tax=Collinsella aerofaciens ATCC 25986 RepID=A4EBV6_9ACTN Length = 206 Score = 58.7 bits (141), Expect = 7e-08, Method: Composition-based stats. Identities = 29/155 (18%), Positives = 45/155 (29%), Gaps = 18/155 (11%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAH-PANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P RV+F G PG + G PF ++ AG + Sbjct: 48 PRARVMFIGEAPGRNEDLQGEPFVGAAGEDLNGILSLAGLKREDVYIANVLKC------- 100 Query: 67 TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 RP N +E+ A L E+I P + LG A + Sbjct: 101 -----RPPGNRNP-RPEEVLACSPFLREQIRSIWPDIIVTLGNPATHFVLKTEIGITKLR 154 Query: 127 TLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQ 161 V+P + + A++EL + Sbjct: 155 GRFHQMGHFVVMPTFHPAAALRN----PAWQELLE 185 >UniRef50_A9F5H8 DNA-directed DNA polymerase n=3 Tax=Bacteria RepID=A9F5H8_SORC5 Length = 215 Score = 58.7 bits (141), Expect = 7e-08, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 51/170 (30%), Gaps = 26/170 (15%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P VVF G PG G PF A + + A + + + Sbjct: 58 DPSSEVVFVGEGPGAEEDLQGQPFVGAAGQLLDKMIAA------MGYHRDE------VYI 105 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS------QR 119 +V RP E+ A L ++ +P+ + LG A + + Sbjct: 106 CNIVKCRPPKNRKP-EPAEMAACSPYLTAQLALIKPKVIVALGATAVQGLIGTSEGITKL 164 Query: 120 GAQWGKQTLTIGSTQIWVLPNPSGLSRVS-LEKLVEAYRELDQALVVRGR 168 W +I ++P + E + +L + + G+ Sbjct: 165 RGTWKLYKGSIP-----IMPTFHPAYLLRQPSAKREVWSDLKEVMRHLGK 209 >UniRef50_B9KBE0 Phage SPO1 DNA polymerase-related protein n=1 Tax=Thermotoga neapolitana DSM 4359 RepID=B9KBE0_THENN Length = 192 Score = 58.7 bits (141), Expect = 7e-08, Method: Composition-based stats. Identities = 32/153 (20%), Positives = 54/153 (35%), Gaps = 20/153 (13%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPAN-RFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 ++VF G PG TG PF A +++ + G + + Sbjct: 39 DTKIVFVGEGPGEEEDKTGRPFVGRAGMLLTELLNEVGIKREDVY-------------IC 85 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQ---- 122 +V RP + +E+ A G L+ +IE P + LG A + Sbjct: 86 NVVKCRPPNNRTP-TTEEMFACGHFLLAQIEIINPSVVVALGATALSFFMDGKKVSITKV 144 Query: 123 WGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEA 155 G +G +I +PS L R ++L + Sbjct: 145 RGNPMDWLGGKKIVPTFHPSYLLRNRSKELRKI 177 >UniRef50_C7RSP2 Phage SPO1 DNA polymerase-related protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RSP2_9PROT Length = 287 Score = 58.7 bits (141), Expect = 7e-08, Method: Composition-based stats. Identities = 34/165 (20%), Positives = 53/165 (32%), Gaps = 21/165 (12%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 G +F G PG G PF PA + D L + + D Sbjct: 120 DHGADWLFIGEGPGAEEDARGEPFVGPAGK---------LLDAMLAAIDLRRGADVYIA- 169 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 V RP E+ A L +IE QP+ + +LG+ A G+ Sbjct: 170 -NAVKCRPPSNRTP-EAVEMVACAPYLRRQIELIQPRLIVLLGRAAAHALLGDVGSLASL 227 Query: 126 QTLTI-------GSTQIWVLPNPSGLSRVS-LEKLVEAYRELDQA 162 + + G I V+ + L +A+ +L +A Sbjct: 228 RGRPLEYREGAGGRVAIPVVVTYHPAYLLRTLPDKAKAWEDLCRA 272 >UniRef50_B2S2H6 DNA polymerase, bacteriophage-type n=2 Tax=Treponema pallidum RepID=B2S2H6_TREPS Length = 277 Score = 58.7 bits (141), Expect = 7e-08, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 49/157 (31%), Gaps = 16/157 (10%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 V+ G PG +G PF + + ++ G + C +T Sbjct: 119 DADVLVVGEAPGAEEDRSGRPFVGRSGKLLDAMLAAIGLS------------RQQNCYIT 166 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 +V RP + E R L + ++P A+ +LG+ A + + Sbjct: 167 NVVKCRPPRNRTP-TPHETACCARFLHAHLTLHRPCAILVLGRCAAQHMLQTTDGIGKLR 225 Query: 127 TLTIGSTQIWVLPNPSGLSRVSLEK-LVEAYRELDQA 162 I +L + + E A+ +L Sbjct: 226 GRFFTYQGIPLLATYHPSALLRDEALKRPAWEDLKTF 262 >UniRef50_D0SIJ1 G:T/U mismatch-specific DNA glycosylase n=42 Tax=Bacteria RepID=D0SIJ1_ACIJU Length = 196 Score = 58.3 bits (140), Expect = 1e-07, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 63/184 (34%), Gaps = 21/184 (11%) Query: 2 VEDILAPGLRVVFCGINPGLSS-AGTGFPFAHPANRFWKVIYQAGFTDRQL--------- 51 ++ L +++ G P + F + + N W++ F D+ Sbjct: 11 LKPFLPYNAKLLMLGSFPPPQNRWRMNFYYPNYQNDMWRIFGLIFFKDKNYFLDLSNKNF 70 Query: 52 -KPQEAQHLLDYRCGVTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQP--QALAILG 108 + + + L++ G+ R S + L + ++ P + G Sbjct: 71 KEQKIREFLIEIGVGIFDTAFRIKRLQGNASDKFLEIVTPTDLSELLIQIPACHTIMTTG 130 Query: 109 KQAYEQGFSQRGAQWGK----QTLTIGSTQ----IWVLPNPSGLSRVSLEKLVEAYRELD 160 +A + K Q++ + + ++ LP+ S ++L+K EAY+ Sbjct: 131 DKATDTLMQHFPENPIKPMIGQSVQVNYMERELSLYRLPSSSRAYPLALDKKAEAYKLFF 190 Query: 161 QALV 164 + + Sbjct: 191 EEIG 194 >UniRef50_B1KRY1 Uracil-DNA glycosylase n=12 Tax=Clostridia RepID=B1KRY1_CLOBM Length = 190 Score = 58.3 bits (140), Expect = 1e-07, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 49/151 (32%), Gaps = 14/151 (9%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLVD 71 ++F G PG TG PF A + + L + + D + + Sbjct: 37 IMFIGEAPGADEDRTGRPFVGKAGQ---------LLTKALLALDFKR--DEHYYICNVCK 85 Query: 72 -RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTLTI 130 RP ++E L ++ +P+ + LG A + ++ + I Sbjct: 86 CRPENNRTP-KEEEAMTCLPYLRNQVALIKPKIMICLGATALKYIMGEQFRITRDRGKWI 144 Query: 131 GSTQIWVLPNPSGLSRVSLEKLVE-AYRELD 160 +++ + E + +++L Sbjct: 145 ERKGYYIMATFHPAALFRDESKKKLLWQDLK 175 >UniRef50_C1DCC6 Probable DNA-directed DNA polymerase, bacteriophage-type n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1DCC6_LARHH Length = 259 Score = 58.3 bits (140), Expect = 1e-07, Method: Composition-based stats. Identities = 34/151 (22%), Positives = 56/151 (37%), Gaps = 16/151 (10%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLVD 71 ++ G PG G PF PA + + A T P++ V +V Sbjct: 104 LMLIGEGPGEQEDLQGRPFVGPAGK----LLDAMLTATGWSPEQD-------MYVCNVVK 152 Query: 72 -RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYE--QGFSQRGAQWGKQTL 128 RP N E+ A L +I +P+ + LG+ A + A+ QT Sbjct: 153 CRPPGNRNP-QPDEMAACQAYLQTQIRLVRPRLIVALGRVAAQSLLATELSVARLRGQTH 211 Query: 129 TIGSTQIWVLPNPSGLSRVSLEKLVEAYREL 159 + V +P+ L R +L A+++L Sbjct: 212 QYAGYPLVVTYHPAYLLR-NLPDKARAWQDL 241 >UniRef50_D0LVE3 Phage SPO1 DNA polymerase-related protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LVE3_HALO1 Length = 380 Score = 58.0 bits (139), Expect = 1e-07, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 50/164 (30%), Gaps = 15/164 (9%) Query: 7 APGLRVVFCGINPGLSSAGTGFPF-AHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCG 65 P ++F G PG G PF + K+I G+T + Sbjct: 219 DPRAALMFVGEAPGFHEDRRGEPFVGNAGELLDKMIAAMGWTRDTVYIANVLKC------ 272 Query: 66 VTKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 RP + E+ A L E+I+ P+ L LG+ A Sbjct: 273 ------RPPNNR-DPQPDEIAACQPFLAEQIDAVAPRILVTLGRPAAHLLLQTTAPMSAL 325 Query: 126 QTLTIGSTQIWVLPNPSGLSRVSL-EKLVEAYRELDQALVVRGR 168 + + V+P + E+ + + +L + R Sbjct: 326 RGRFQEYRGVRVMPTFHPAFLLRRPERKRDTWNDLKLVIAELER 369 >UniRef50_C0GI50 Phage SPO1 DNA polymerase-related protein n=1 Tax=Dethiobacter alkaliphilus AHT 1 RepID=C0GI50_9FIRM Length = 222 Score = 58.0 bits (139), Expect = 1e-07, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 50/157 (31%), Gaps = 17/157 (10%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAH-PANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P ++F G PG G PF K++ A + ++ + Sbjct: 71 PDADLLFVGEAPGAEEDAQGRPFVGEAGKLLNKILAAAEISREEVY-------------I 117 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 T +V RP + E A L ++I +P+ + LG A + + Sbjct: 118 TNVVKCRPPKNRTP-NPDEAAACSEHLQKQIALLKPRIVVCLGSLATQALIDPQARVSAV 176 Query: 126 QTLTIGSTQIWVLPNPSGLSRVS-LEKLVEAYRELDQ 161 + + ++ + + + + ++ + Sbjct: 177 RGRWFTKDGLQMMATFHPAALLRDPSRKRPVWEDIQK 213 >UniRef50_C1TMP3 Uracil-DNA glycosylase, family 4 n=1 Tax=Dethiosulfovibrio peptidovorans DSM 11002 RepID=C1TMP3_9BACT Length = 206 Score = 58.0 bits (139), Expect = 1e-07, Method: Composition-based stats. Identities = 34/177 (19%), Positives = 59/177 (33%), Gaps = 30/177 (16%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P ++F G PG +G PF A R I +A DR+ + Sbjct: 43 PPDAALMFVGEGPGEDEDRSGIPFVGRAGRLLDKILEAAKIDRK------------GVYI 90 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 T +V RP + ++ + R L +I P+ + LG + Sbjct: 91 TNVVKCRPPKNRTPMVEETMV-CQRFLEAQIAVINPRIIVCLGNTPMKWFLGTSEGITKL 149 Query: 126 --QTLTIGSTQIWVL--PNP----------SGLSRVSLEKLVEAYRELDQALVVRGR 168 Q + + + P+ S + ++ ++E R LD+ L GR Sbjct: 150 RGQWFSWKGIDVMPMFHPSYLLRNESRKKNSPKA-LTWRDILEVRRRLDE-LKAGGR 204 >UniRef50_A1RYC0 Phage SPO1 DNA polymerase-related protein n=23 Tax=Archaea RepID=A1RYC0_THEPD Length = 217 Score = 58.0 bits (139), Expect = 1e-07, Method: Composition-based stats. Identities = 37/171 (21%), Positives = 62/171 (36%), Gaps = 28/171 (16%) Query: 12 VVFCGINPGLSSAGTGFPFAHPAN-RFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLV 70 VVF G PG + G PF A ++I AG + ++ +T +V Sbjct: 53 VVFIGEAPGRNEDLQGRPFVGAAGALLTELISLAGLSREEVF-------------ITNVV 99 Query: 71 D-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQTLT 129 RP E ++E+ A L+ ++ +P+ + LGK + + RG Sbjct: 100 KCRPPNNR-EPREEEIAACLPYLLAQLRAIKPRLIVTLGKHSTKTILRMRGMSVDSIMSV 158 Query: 130 IGST--------QIWVLPNPSGLSRV--SLEKLV--EAYRELDQALVVRGR 168 G Q+ V P + + + V E +R L L G+ Sbjct: 159 RGKAYTVNAEWGQVTVFPTLHPAAALYNPAMRKVLEEDFRFLRGLLEGGGQ 209 >UniRef50_Q6L0J8 DNA polymerase, bacteriophage-type n=1 Tax=Picrophilus torridus RepID=Q6L0J8_PICTO Length = 326 Score = 58.0 bits (139), Expect = 1e-07, Method: Composition-based stats. Identities = 23/116 (19%), Positives = 43/116 (37%), Gaps = 14/116 (12%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 V+F G +PG + T PF A F ++ AG L R +T Sbjct: 175 NSEVIFIGQSPGFNEDETRRPFTGRAGLFLNSLLKMAG-------------LERNRVYLT 221 Query: 68 KLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW 123 ++ T + +K E+ L+ +I+ +P+ + ++G A + Sbjct: 222 NVLKFMTYKNRPPNKHEIENAMPYLLREIDIIRPKLIVLMGSTAVNAFLPKHSVTL 277 >UniRef50_A1WUV9 Phage SPO1 DNA polymerase-related protein n=3 Tax=Ectothiorhodospiraceae RepID=A1WUV9_HALHL Length = 294 Score = 58.0 bits (139), Expect = 1e-07, Method: Composition-based stats. Identities = 31/160 (19%), Positives = 56/160 (35%), Gaps = 17/160 (10%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 +V G PG G PF A + D L + + Sbjct: 140 DRQADLVIIGEGPGAEEDRQGEPFVGRAGQ---------LLDAMLAAIDRGR-DRGGVYI 189 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 +V RP + E+ A L ++E +P+A+ LG+ A +Q + Sbjct: 190 GNIVKCRPPGNR-DPRPDEVAACTPYLRRQLELLEPKAIVALGRVAAQQLLETKSPLAKL 248 Query: 126 Q----TLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQ 161 + S +WV +P+ L R S + + +++L + Sbjct: 249 RGRVHHYGAQSIPVWVTYHPAYLLR-SPGEKAKVWQDLKR 287 >UniRef50_D0RRM7 Uracil DNA glycosylase superfamily n=1 Tax=alpha proteobacterium HIMB114 RepID=D0RRM7_9RICK Length = 217 Score = 57.6 bits (138), Expect = 1e-07, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 56/161 (34%), Gaps = 19/161 (11%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAG-FTDRQLKPQEAQHLLDYRCGV 66 +++ G PG+S G PF+ ++G D+ L L + + Sbjct: 60 SKSKIMIIGDAPGVSDEINGKPFSG----------ESGELLDKMLMAIN---LDRNKVYL 106 Query: 67 TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ----RGAQ 122 T +++ ++ E+ + + I+ P+ + ILG + F ++ Sbjct: 107 TNIINFRLQDNKKLGPVEIKNYRSIIFDHIKIINPKIIFILGSVTFNFFFKNTQSLTKSR 166 Query: 123 WGKQTLTIGSTQIWVLPNPSGLSRVS-LEKLVEAYRELDQA 162 + I + + LP+ + + ++ +L Sbjct: 167 GHWNKIKISNNEYDCLPSYHPAFLLRQPSQKKASWEDLKNL 207 >UniRef50_B4UK16 Phage SPO1 DNA polymerase-related protein n=7 Tax=Deltaproteobacteria RepID=B4UK16_ANASK Length = 295 Score = 57.6 bits (138), Expect = 2e-07, Method: Composition-based stats. Identities = 32/155 (20%), Positives = 53/155 (34%), Gaps = 15/155 (9%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P +VF G PG G PF A + + QA + Q Q + Sbjct: 141 PQAELVFVGEGPGADEDREGEPFVGKAGQLLTKMIQA------MGYQREQ------VYIA 188 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 +V RP N E+ A L ++ +P+ + LGK A + + Sbjct: 189 NVVKCRPPGNRNP-EPDEIEACEPFLRAQLAAIRPRVVVALGKFAAQTLLRDTTPITRLR 247 Query: 127 TLTIGSTQIWVLPNPSGLSRV-SLEKLVEAYRELD 160 I ++P + S E+ +A+ +L Sbjct: 248 GRWREYEGIRLMPTFHPAYLLRSPEEKKKAWEDLQ 282 >UniRef50_C7N4J2 Uracil-DNA glycosylase, family 4 n=2 Tax=Slackia RepID=C7N4J2_SLAHD Length = 213 Score = 57.2 bits (137), Expect = 2e-07, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 51/161 (31%), Gaps = 18/161 (11%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 G R++ G PG + G PF N +++ AG T ++ Sbjct: 53 GARIMLIGEAPGKNEDLQGKPFVGAAGNYLTELLGYAGLTREEVFIANILKC-------- 104 Query: 68 KLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQT 127 RP N E+ L +++ P+ + +G A + G + Sbjct: 105 ----RPPSNRNP-QPDEIFECTPYLRDQVRAIMPEYIVTMGNFATKFILRTDRGITGLRG 159 Query: 128 LTIGSTQIWVLPNPSGLS----RVSLEKLVEAYRELDQALV 164 + Q VLP + R E L + L + L Sbjct: 160 NLQKTGQFSVLPVFHPAAAIYDRTKREVLEADFVYLGRLLG 200 >UniRef50_A0LFG4 Phage SPO1 DNA polymerase-related protein n=2 Tax=Deltaproteobacteria RepID=A0LFG4_SYNFM Length = 264 Score = 57.2 bits (137), Expect = 2e-07, Method: Composition-based stats. Identities = 28/113 (24%), Positives = 42/113 (37%), Gaps = 16/113 (14%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCG 65 + +VF G PGL G PF A R ++I GF + Sbjct: 89 SAHAPLVFVGEGPGLEEDRQGRPFVGRAGRLLDQMIKALGFHREAVY------------- 135 Query: 66 VTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS 117 + +V RP S +E+ G LI +IE +P+ + LG A + Sbjct: 136 ICNVVKCRPPKNRTPES-EEIARCGPFLIRQIEAIRPRVICALGACAAQTLVG 187 >UniRef50_Q2S6J8 Uracil-DNA glycosylase, family 4 n=2 Tax=Rhodothermaceae RepID=Q2S6J8_SALRD Length = 287 Score = 57.2 bits (137), Expect = 2e-07, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 52/156 (33%), Gaps = 15/156 (9%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P +V G PG + G PF A + I A +R+ + Sbjct: 132 DPEADLVIIGEAPGRNEDEEGEPFVGRAGQLLDKILDAINFERE------------DVYI 179 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 T ++ RP N +E+ A L ++I +P+ + +GK A + + Sbjct: 180 TNILKSRPPRNRNP-KSEEIDAHLPVLYKQIALLRPKIILTVGKVAGNTLLDRSSSLSNL 238 Query: 126 QTLTIGSTQIWVLPNPSGLSRVSLEK-LVEAYRELD 160 + I ++ + + + + ++ Sbjct: 239 RDQEHDFYGIPLMATYHPAALLRNSQWKRPTWEDVK 274 >UniRef50_C3X2T7 DNA polymerase bacteriophage-type n=2 Tax=Oxalobacter formigenes RepID=C3X2T7_OXAFO Length = 317 Score = 57.2 bits (137), Expect = 2e-07, Method: Composition-based stats. Identities = 33/175 (18%), Positives = 62/175 (35%), Gaps = 24/175 (13%) Query: 1 MVEDILAPGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLL 60 +V I +F G PG G PF + + D + + Sbjct: 144 VVPGIGDREADWLFIGEGPGFHEDRQGEPFVGRSGK---------LLDAMMAAMRMKRGE 194 Query: 61 DYRCGVTKLVD-RPTVQANEV---SKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF 116 + +V R + ++ + +++E L +I+ +P+ + LGK A Sbjct: 195 NVYIA--NIVKCRASNESGKDRQPTEEEAAICMPYLERQIDLIRPKIMVALGKTAAVALL 252 Query: 117 SQ-RGAQWG--KQTLTIGST-----QIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 + R G + T + V +PS L R K +A+++L +AL Sbjct: 253 NTGRDVSLGSLRNREHFFRTGGRKIPLVVTYHPSYLLRTPGGKK-QAWQDLCRAL 306 >UniRef50_A3CY07 Phage SPO1 DNA polymerase-related protein n=1 Tax=Methanoculleus marisnigri JR1 RepID=A3CY07_METMJ Length = 219 Score = 57.2 bits (137), Expect = 2e-07, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 57/172 (33%), Gaps = 24/172 (13%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P ++ G PG TG PF A + + + +G L + Sbjct: 54 PRADLMLIGEAPGKQEDLTGRPFVGRAGKLLEGL-LSGI-----------GLSRDDVFIA 101 Query: 68 KLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK-- 125 +V + ++E+ A L E+I P+ + +LG+ + S ++ + Sbjct: 102 NIVKHRPPGNRDPGEEEIRACTPYLEEQIRIIGPKVIVMLGRHSSRYILSFLPVEFDRIT 161 Query: 126 ------QTLTIGSTQIWVLPNPSGLSRV----SLEKLVEAYRELDQALVVRG 167 + + ++P + + E L E + + + L G Sbjct: 162 EIRGTVYPGLLFGHPVRLIPTLHPAAALYNPRYRESLEEDFLIIGRELAEAG 213 >UniRef50_C1FF23 Predicted protein n=2 Tax=cellular organisms RepID=C1FF23_9CHLO Length = 248 Score = 57.2 bits (137), Expect = 2e-07, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 45/162 (27%), Gaps = 18/162 (11%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 RV+ G PG G PF A + D D C V Sbjct: 83 DTNARVMIIGEAPGAEEDKLGVPFVGRAGK---------LLDEIFASVTLST--DRHCYV 131 Query: 67 TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 T +V R + + + +E+ L E+I P + G+ + + K Sbjct: 132 TNVVKRRPLNNRDPTAREIEWYRGYLREEIRLLDPAIIVCTGRHSMRTLLGKNCEGITKI 191 Query: 127 TLTIGSTQ-IWVLPNPSGLSRVSLEKLVE------AYRELDQ 161 V+P + + + ++ + Sbjct: 192 RGRWFDVDGRVVMPMFHPSYLLRNPQKTPGSPKALTWDDIRE 233 >UniRef50_B6AR05 Uracil-DNA glycosylase, family 4 n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AR05_9BACT Length = 245 Score = 56.8 bits (136), Expect = 3e-07, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 49/158 (31%), Gaps = 18/158 (11%) Query: 12 VVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLV 70 ++F G PG TG PF K+I G + ++ T Sbjct: 88 LMFVGEGPGADEDRTGRPFVGRAGELLTKMIQAMGLSREEVY-----------IANTVKC 136 Query: 71 DRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ---RGAQWGKQT 127 P + S +E A L +I +P+ + +LG+ A + Q G Sbjct: 137 RPPGNRVP--SPEERKACFPFLEAQIRLLKPKYMILLGQTAAQVVLRQSAGITRIRGTCC 194 Query: 128 LTIGSTQIWVLPNPSGLSRVS-LEKLVEAYRELDQALV 164 I ++P + + + +L + Sbjct: 195 ELPEYPGIKIMPTYHPAYLLRNPSAKRDVWSDLQTVMG 232 >UniRef50_B8GUB2 Phage SPO1 DNA polymerase-related protein n=2 Tax=Ectothiorhodospiraceae RepID=B8GUB2_THISH Length = 241 Score = 56.8 bits (136), Expect = 3e-07, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 47/157 (29%), Gaps = 18/157 (11%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCG 65 +F G PG G PF A + +++ G Sbjct: 90 DRQADWLFIGEAPGAEEDRQGEPFVGRAGQLLDSMLFALGLK------------RGRGVY 137 Query: 66 VTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWG 124 + ++ RP + +E A L +I+ QP+ + LG+ A + G Sbjct: 138 IANILKCRPPGNR-DPKPEEARACRAYLDRQIDLIQPRIIVALGRIAAQNLLGTDE-PLG 195 Query: 125 KQTLTIGSTQIWVLPNP-SGLSRVS-LEKLVEAYREL 159 + + + L + +A+ +L Sbjct: 196 RLRGKPLNYRDIPLVITYHPAYLLRQPADKRKAWEDL 232 >UniRef50_C7Q262 Uracil-DNA glycosylase superfamily n=3 Tax=Actinomycetales RepID=C7Q262_CATAD Length = 222 Score = 56.4 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 35/179 (19%), Positives = 56/179 (31%), Gaps = 24/179 (13%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P RVVF G PG G PF P F + + AG RQ + + Sbjct: 47 PAARVVFVGEQPGDMEDRQGVPFVGPAGQLFDRAMADAGIDPRQAYVTNSVKHFRF---- 102 Query: 67 TKLVDRPTVQANEVSKQE--LHAGGRKLIEKIEDYQPQALAILGKQAYEQGF-------- 116 L DRP + + + + A L ++ P+ + LG A Sbjct: 103 --LQDRPNGRRIHKTPETPHIEACKPWLRAELAIVDPEVIVALGATAGRALIGPQFRVTR 160 Query: 117 -SQRGAQWGKQTLTIGST-----QIWVLPNPS-GLSRVSLEKLVEAYRELDQALVVRGR 168 + W T + S + + + AY+E+ L++ R Sbjct: 161 ERGKVLMWPPSGDAGFGTDPSRRRAVSFVATSHPSAVLRADNQETAYKEMVSDLLLVAR 219 >UniRef50_Q0BR04 Uracil DNA glycosylase superfamily protein n=2 Tax=Acetobacteraceae RepID=Q0BR04_GRABC Length = 282 Score = 56.4 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 57/157 (36%), Gaps = 22/157 (14%) Query: 12 VVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGVTKLV 70 ++ G PG G PF P+ +F +++ AG T + D+ L+ Sbjct: 130 LMLIGEAPGAEEDRAGKPFVGPSGKFLDRMLASAGMTRER----------DFVIA--NLI 177 Query: 71 D-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF------SQRGAQW 123 RP N ++ E+ L+ I +P+ L LG + + + +W Sbjct: 178 PWRPPGNRNP-TETEILQCLPFLLRHIALARPRMLVCLGGVSAKALTGSTVGIRRLRGKW 236 Query: 124 GKQTLTIGSTQIWVLPNPSGLSRVSLE-KLVEAYREL 159 T+ + + +LP + EA+ ++ Sbjct: 237 TDITIPGLDSPLPMLPMLHPAYLLRNPGAKREAWADI 273 >UniRef50_Q5SKC5 Uracil-DNA glycosylase n=6 Tax=Thermaceae RepID=Q5SKC5_THET8 Length = 205 Score = 56.4 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 27/139 (19%), Positives = 49/139 (35%), Gaps = 15/139 (10%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P +++ G PG TG PF A + +++ AG ++ + Sbjct: 32 PDAKLMIVGEGPGEEEDKTGRPFVGKAGQLLNRILEAAGIPREEVY-------------I 78 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 T +V RP + + + L+++IE PQ + LG A E ++ + Sbjct: 79 TNIVKCRPPQNRAPLPDEAKICTDKWLLKQIELIAPQIIVPLGAVAAEFFLGEKVSITKV 138 Query: 126 QTLTIGSTQIWVLPNPSGL 144 + I V P Sbjct: 139 RGKWYEWHGIKVFPMFHPA 157 >UniRef50_Q0EZR7 Phage SPO1 DNA polymerase domain protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EZR7_9PROT Length = 252 Score = 56.4 bits (135), Expect = 3e-07, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 56/156 (35%), Gaps = 19/156 (12%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P +VF G PG G PF A + +++ G + Sbjct: 99 PQADLVFIGEAPGRDEDLQGEPFVGRAGQLLDRMMAAIGLDRDAVYIMNTIKC------- 151 Query: 67 TKLVDRPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS---QRGAQW 123 RP + E+ A ++++ QP+ + +LG+ A + GA Sbjct: 152 -----RPPNNR-DPRADEVKACNLWFEQQLDALQPKMICLLGRVAAQTVLGTDATLGALR 205 Query: 124 GKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYREL 159 G +WV +P+ L R S ++ + +++L Sbjct: 206 GAWHE-YHGIPVWVTYHPAYLLR-SPQQKQKGWQDL 239 >UniRef50_Q47EK2 Phage SPO1 DNA polymerase-related protein n=1 Tax=Dechloromonas aromatica RCB RepID=Q47EK2_DECAR Length = 260 Score = 56.4 bits (135), Expect = 4e-07, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 45/155 (29%), Gaps = 14/155 (9%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 +F G PG G PF A + D L + R + Sbjct: 101 DTNPEWLFIGEGPGAEEDVKGEPFVGQAGK---------LLDNMLASLDIAR--GNRVYI 149 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 V RP E+ A L +I +P+ + +LGK A + Sbjct: 150 ANAVKCRPPGNRTP-DAAEMAACWPYLERQIALLKPKVIVLLGKAAVHAVLHDDKSLASL 208 Query: 126 QTLTIGSTQIWVLPNPSGLSRVS-LEKLVEAYREL 159 + I V+ + L +A+ +L Sbjct: 209 RGKRFELAGIPVVVTYHPAYLLRNLPDKAKAWEDL 243 >UniRef50_A9II61 Bacteriophage-related DNA polymerase n=4 Tax=Bordetella RepID=A9II61_BORPD Length = 298 Score = 56.4 bits (135), Expect = 4e-07, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 60/162 (37%), Gaps = 19/162 (11%) Query: 10 LRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTKL 69 R + G PG G PF + + + A L ++ + + Sbjct: 147 ARWMVVGEAPGEQEDRQGLPFVGRSGQ----LLDAMLASVGLSREKDVF-------IANV 195 Query: 70 VD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFS---QRGAQWGK 125 + RP N +E+ A L+ +IE +P+ + +LG+ A + G+ G+ Sbjct: 196 IKCRPPGNRNP-KPEEIAACSPYLMRQIELLKPERILVLGRFAAQTLLGTDATIGSLRGR 254 Query: 126 QTLTIG--STQIWVLPNPSGLSRV-SLEKLVEAYRELDQALV 164 + +I V+ + + S + A+++L A+ Sbjct: 255 RHELAAEDGRKIPVVVSYHPAYLLRSPAEKARAWQDLRLAIA 296 >UniRef50_D1Y5H9 Uracil-DNA glycosylase n=1 Tax=Pyramidobacter piscolens W5455 RepID=D1Y5H9_9BACT Length = 201 Score = 56.0 bits (134), Expect = 4e-07, Method: Composition-based stats. Identities = 30/169 (17%), Positives = 55/169 (32%), Gaps = 20/169 (11%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 ++ G PG TG PF + R L E L + Sbjct: 40 PENASLMIVGEAPGAEEERTGRPFVGRSGR---------LLTEILAAAE---LKREEIFI 87 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 T + RP N + K E+ A L ++ E +P+ + +G + + G Sbjct: 88 TSICKCRPPKNRNPL-KTEIAACLPWLTDQFEAIRPRLVLAVGNVPSQTLLGVKTGINGL 146 Query: 126 Q----TLTIGSTQIWVLPNPSGLSRVSLEKLVE--AYRELDQALVVRGR 168 + T ++ + P + + VE + ++ + LV R Sbjct: 147 RGRFHQFTWNGRRMTLRPLFHPAYLLRNPRRVEGSPWDKMLRDLVEARR 195 >UniRef50_Q2Y9I2 Phage SPO1 DNA polymerase-related protein n=1 Tax=Nitrosospira multiformis ATCC 25196 RepID=Q2Y9I2_NITMU Length = 209 Score = 56.0 bits (134), Expect = 4e-07, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 50/162 (30%), Gaps = 12/162 (7%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCG 65 ++ G PG TG PF P + + G + ++ A + Sbjct: 47 PSHASIMLVGEQPGDREDATGHPFVGPAGSILSHALTAVGISREEVYVTNAVKHFKFEL- 105 Query: 66 VTKLVDRPTVQANEVSKQ-ELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ---RGA 121 R + ++ E+ A L +IE P+ + LG A + Sbjct: 106 ------RGKRRMHKKPIDTEIVACHAWLQREIELITPRLIIALGATAARSLLGRATPVKV 159 Query: 122 QWGKQTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRELDQAL 163 GK + + +PS L R+ E E Y L Sbjct: 160 NRGKLLEYATGVSLLITVHPSFLLRMPEENKDEEYEHFVNDL 201 >UniRef50_A5N3Q6 Phage-related DNA polymerase n=7 Tax=Clostridium RepID=A5N3Q6_CLOK5 Length = 192 Score = 56.0 bits (134), Expect = 4e-07, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 48/157 (30%), Gaps = 16/157 (10%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P +VF G PG G PF A + + L+ + DY + Sbjct: 33 PKADLVFIGEAPGADEDRIGLPFVGKAGK---------LLTKALESLDLYRKKDY--YIC 81 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQ 126 + RP +E A L ++ +P+ + LG A + + + Sbjct: 82 NICKCRPKNNRTPYEDEE-KACLPYLRNQVALIKPKIIVCLGATAMGCILGK-DWRITRD 139 Query: 127 TLTIGSTQIWVLPNP-SGLSRVSLEKLVE-AYRELDQ 161 + + + + + E +++L Sbjct: 140 RGKWVERKGFYITATFHPSAILRDENKKPLFWQDLKN 176 >UniRef50_A9A4H4 Phage SPO1 DNA polymerase-related protein n=3 Tax=marine archaeal group 1 RepID=A9A4H4_NITMS Length = 196 Score = 56.0 bits (134), Expect = 4e-07, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 48/146 (32%), Gaps = 17/146 (11%) Query: 10 LRVVFCGINPGLSSAGTGFPFAHP-ANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVTK 68 V+F G PG + G PF + + AG + + +T Sbjct: 35 SDVIFVGEAPGRNEDKNGEPFVGAAGKKLSAALEGAGVSREDVY-------------ITN 81 Query: 69 LVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGKQT 127 +V RP + E L +I +P+ + +LG A+ + Sbjct: 82 VVKCRPPNNRVPNTN-ERDTCKEYLKNEILIIKPKIICVLGNTAFNSVLGGSEITKFRGK 140 Query: 128 LTIGSTQIWVLPNPSGLSRVSLEKLV 153 L Q++ L + + ++L+ Sbjct: 141 LVRKDNQLYFL-TIHPAATIYNQELI 165 >UniRef50_D1CIM8 Uracil-DNA glycosylase superfamily n=1 Tax=Thermobaculum terrenum ATCC BAA-798 RepID=D1CIM8_THET1 Length = 198 Score = 56.0 bits (134), Expect = 5e-07, Method: Composition-based stats. Identities = 37/150 (24%), Positives = 55/150 (36%), Gaps = 23/150 (15%) Query: 9 GLRVVFCGINPGLSSAGTGFPFAHPAN---RFWKVIYQAGFTDRQLKPQEAQHLLDYRCG 65 G RV+ G PG+ + G PF+ P R W I +AG + D R Sbjct: 35 GARVMVIGQAPGVRTMIAGRPFSGPGGKLLRGW--IARAGIPLEE---------QDRRVY 83 Query: 66 VTKLVDRP-------TVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQ 118 T L R + S +E+ L +IE P + ++G A E+ + Sbjct: 84 FTSL-TRCYPGKAPKGKGDRKPSAREVELCSPFLQREIELIDPVLVLLVGSMAIERFLGR 142 Query: 119 RGAQWGK-QTLTIGSTQIWVLPNPSGLSRV 147 Q + LP+PSG+SR Sbjct: 143 MPLAEAVGQLFSRDGRWWLPLPHPSGVSRW 172 >UniRef50_C8W879 Phage SPO1 DNA polymerase-related protein n=1 Tax=Atopobium parvulum DSM 20469 RepID=C8W879_ATOPD Length = 221 Score = 56.0 bits (134), Expect = 5e-07, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 54/167 (32%), Gaps = 20/167 (11%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAH-PANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 P R++ G PG + +G PF + ++ + G + + + + Sbjct: 64 PTARIMIVGEAPGKNEDLSGQPFVGTSGKKLDALLEEVGLSHQDVF-------------I 110 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 + +V RP N E+ L ++I +P + LG A + Sbjct: 111 SNVVKCRPPSNRNP-KVAEVEECSPYLRDQIRAIKPDVIVCLGNFATQFVMHTDKGVTEL 169 Query: 126 QTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRE----LDQALVVRGR 168 + VLP S + ++ + ++ L L R Sbjct: 170 RGKFYQIGAFSVLPTMHPASTIYRKEWAQFLKDDLQLLADWLKKHPR 216 >UniRef50_B4U5Y1 Phage SPO1 DNA polymerase-related protein n=1 Tax=Hydrogenobaculum sp. Y04AAS1 RepID=B4U5Y1_HYDS0 Length = 245 Score = 56.0 bits (134), Expect = 5e-07, Method: Composition-based stats. Identities = 32/157 (20%), Positives = 52/157 (33%), Gaps = 24/157 (15%) Query: 8 PGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGVT 67 P ++F G PG G PF A + T + L +T Sbjct: 84 PDTGIMFVGEGPGADEDVEGKPFVGKAGKL--------LTQEL---ERVGLLRQRDYYIT 132 Query: 68 KLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQW--- 123 +V RP +QE+ L E+++ +P+ + LG+ A + Sbjct: 133 NVVKCRPPGNRTP-EEQEMITCSYILKEEMKLIKPKLVIALGRTAAVGLSGIKQISPTKV 191 Query: 124 --------GKQTLTIGSTQIWVLPNPSGLSRVSLEKL 152 K + T+I+V +PS L R E Sbjct: 192 RGSFISANLKSPVLAPGTKIFVTYHPSYLLRNPNELK 228 >UniRef50_Q8ZYS2 Uracil DNA glycosylase (Pa-UDGa) n=1 Tax=Pyrobaculum aerophilum RepID=Q8ZYS2_PYRAE Length = 196 Score = 55.6 bits (133), Expect = 6e-07, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 55/159 (34%), Gaps = 24/159 (15%) Query: 10 LRVVFCGINPGLSSAGTGFPFAHPANRFW-KVIYQAGFTDRQLKPQEAQHLLDYRCGVTK 68 L V+ G PG S G PF A + + + + G +T Sbjct: 34 LGVMIVGEAPGASEDEAGRPFVGAAGQLLTEALSRLGV-------------RRGDVFITN 80 Query: 69 LVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQ---GFSQRGAQWG 124 +V RP +++E+ A LI++I +P+ + LG + + +R + G Sbjct: 81 VVKCRPPNNRTP-NREEVEACLPYLIQQIGILKPRRIIALGLISAKALMELMGRRAEKLG 139 Query: 125 K-----QTLTIGSTQIWVLPNPSGLSRVSLEKLVEAYRE 158 I Q+ + + + L +++ Sbjct: 140 DVKGKCYQGRIAGVQVELCITYHPAAVLRKPALRGEFQK 178 >UniRef50_Q3J878 Phage SPO1 DNA polymerase-related protein n=3 Tax=Gammaproteobacteria RepID=Q3J878_NITOC Length = 231 Score = 55.6 bits (133), Expect = 7e-07, Method: Composition-based stats. Identities = 21/159 (13%), Positives = 52/159 (32%), Gaps = 19/159 (11%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRFWKVIYQAGFTDRQLKPQEAQHLLDYRCGV 66 + + G PG+ G PF A + + +A L + + Sbjct: 78 DQAAKWLIVGEAPGVEEDRQGEPFVGRAGQLLNAMLEAV------------GLQRGQVYI 125 Query: 67 TKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGFSQRGAQWGK 125 ++ RP + + +E+ L ++ +P+ + +G+ A + + G+ Sbjct: 126 ANILKCRPPNNRDPL-PEEVAHCEPYLRRQVALLRPRIILAVGRVAGQNLL-KSSLPLGR 183 Query: 126 QT---LTIGSTQIWVLPNPSGLSRVSLE-KLVEAYRELD 160 T I ++ + + A+++L Sbjct: 184 LRGSVHQYPETTIPLVVTYHPAYLLRAPREKRRAWQDLQ 222 >UniRef50_A9AZ06 Phage SPO1 DNA polymerase-related protein n=1 Tax=Herpetosiphon aurantiacus ATCC 23779 RepID=A9AZ06_HERA2 Length = 220 Score = 55.3 bits (132), Expect = 7e-07, Method: Composition-based stats. Identities = 26/139 (18%), Positives = 47/139 (33%), Gaps = 23/139 (16%) Query: 7 APGLRVVFCGINPGLSSAGTGFPFAHPANRF---WKVIYQAGFTDRQLKPQEAQHLLDYR 63 ++ G PG G PF P+ W + + G T Q+ Sbjct: 36 PANAEILLIGEGPGQREDALGRPFVGPSGDLLEQW--LAEIGLTREQVF----------- 82 Query: 64 CGVTKLVD-RPTVQANEVSKQELHAGGRKLIEKIEDYQPQALAILGKQAYEQGF--SQRG 120 + +V RP + E+ A L +I P+ +A LG+ + + F + Sbjct: 83 --IANVVKCRPPGNR-DPEPSEIAACAHFLDRQIAALAPKLIATLGRHSMNKFFPGGKIT 139 Query: 121 AQWGKQTLTIGSTQIWVLP 139 G + + ++ LP Sbjct: 140 KIHGIRGVKRQGQTVF-LP 157 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.154 0.449 Lambda K H 0.267 0.0470 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,147,358,819 Number of Sequences: 3077464 Number of extensions: 48687387 Number of successful extensions: 124883 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 410 Number of HSP's successfully gapped in prelim test: 307 Number of HSP's that attempted gapping in prelim test: 123712 Number of HSP's gapped (non-prelim): 752 length of query: 168 length of database: 1,040,396,356 effective HSP length: 119 effective length of query: 49 effective length of database: 674,178,140 effective search space: 33034728860 effective search space used: 33034728860 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.2 bits) S2: 88 (38.3 bits)