BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (229 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_A1JPE3 DNA mismatch repair protein mutH n=58 Tax=Proteo... 330 2e-89 UniRef50_B1KI41 DNA mismatch repair protein mutH n=236 Tax=Gamma... 271 9e-72 UniRef50_C0N2X3 DNA mismatch repair endonuclease MutH n=1 Tax=Me... 235 7e-61 UniRef50_A5IBX0 DNA mismatch repair protein MutH n=4 Tax=Legione... 218 1e-55 UniRef50_D1RIY1 DNA mismatch repair endonuclease MutH n=1 Tax=Le... 206 4e-52 UniRef50_C6MZP8 DNA mismatch repair protein n=1 Tax=Legionella d... 196 4e-49 UniRef50_Q89B33 DNA mismatch repair protein mutH n=1 Tax=Buchner... 181 1e-44 UniRef50_A9GFJ0 Methyl-directed mismatch repair n=1 Tax=Sorangiu... 179 4e-44 UniRef50_C2WIM6 Type II restriction endonuclease n=12 Tax=Firmic... 60 8e-08 UniRef50_B2TKK6 DNA mismatch repair enzyme MutH n=3 Tax=Clostrid... 55 2e-06 UniRef50_B3DNU4 DNA mismatch repair protein n=4 Tax=Bifidobacter... 51 3e-05 UniRef50_UPI000196BAC3 hypothetical protein CATMIT_00199 n=1 Tax... 48 2e-04 UniRef50_A8RH65 Putative uncharacterized protein n=1 Tax=Clostri... 47 5e-04 UniRef50_C2SK77 DNA mismatch repair enzyme MutH n=1 Tax=Bacillus... 45 0.002 UniRef50_P16667 Type-2 restriction enzyme Sau3AI n=2 Tax=Staphyl... 44 0.004 UniRef50_C3FCF3 DNA mismatch repair enzyme MutH n=2 Tax=Bacillus... 40 0.071 >UniRef50_A1JPE3 DNA mismatch repair protein mutH n=58 Tax=Proteobacteria RepID=MUTH_YERE8 Length = 228 Score = 330 bits (847), Expect = 2e-89, Method: Compositional matrix adjust. Identities = 162/220 (73%), Positives = 190/220 (86%) Query: 9 SPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPE 68 +PP E QL +A+ LSG+TLGELA+ G P +LKR KGW+G+LLE +LGASAGSKPE Sbjct: 9 APPTDEHQLFQRAEALSGFTLGELASKAGWNIPADLKRIKGWVGMLLEFYLGASAGSKPE 68 Query: 69 QDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGE 128 QDFA +G+ELKTIP+ + G+PLETTFVCVAPLTGNSG+TWE+SHVRHKL RVLW+PVEGE Sbjct: 69 QDFADIGIELKTIPISAQGKPLETTFVCVAPLTGNSGITWESSHVRHKLARVLWVPVEGE 128 Query: 129 RSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAA 188 R IPLAQRRVG+PLLWSPNEEE+ LR DWEELMD+IVLG+VE ITARHGE LQ+RPKAA Sbjct: 129 RQIPLAQRRVGAPLLWSPNEEEEELLRRDWEELMDLIVLGKVETITARHGEVLQLRPKAA 188 Query: 189 NAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLI 228 N++ALTEAIG G+ I+TLPRGFYLKK FT +LARHFL+ Sbjct: 189 NSRALTEAIGEHGQPIMTLPRGFYLKKTFTGPMLARHFLL 228 >UniRef50_B1KI41 DNA mismatch repair protein mutH n=236 Tax=Gammaproteobacteria RepID=MUTH_SHEWM Length = 231 Score = 271 bits (694), Expect = 9e-72, Method: Compositional matrix adjust. Identities = 127/218 (58%), Positives = 172/218 (78%) Query: 11 PETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQD 70 P+T ++L+++A ++G TLG+LA G +TP NLKRDKGW+G L+E LGA AGS+PE D Sbjct: 7 PKTIDELMSRANDMAGLTLGQLADSHGFITPANLKRDKGWVGQLIEHELGALAGSRPEPD 66 Query: 71 FAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERS 130 F LG+ELKTIPVD G+P+ETT+V VAPL G+TWETS V HKL+ VLWIP++G+R Sbjct: 67 FLHLGIELKTIPVDKNGKPIETTYVTVAPLIDIQGLTWETSVVCHKLQTVLWIPIQGDRD 126 Query: 131 IPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANA 190 IP++QR++GSP+LW PNEEE L++DWEE+M+ I LGQV+++TARHGE LQ+RPK AN+ Sbjct: 127 IPVSQRQIGSPILWRPNEEELALLKQDWEEIMEFIALGQVKQLTARHGEVLQLRPKGANS 186 Query: 191 KALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLI 228 +++T++IG G +T PRGFYLK FT ++L+R F I Sbjct: 187 RSVTQSIGPNGSTQMTNPRGFYLKIPFTQSILSRAFGI 224 >UniRef50_C0N2X3 DNA mismatch repair endonuclease MutH n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N2X3_9GAMM Length = 223 Score = 235 bits (600), Expect = 7e-61, Method: Compositional matrix adjust. Identities = 112/219 (51%), Positives = 156/219 (71%) Query: 8 LSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKP 67 + PP+T+ +L+ + QQL+G TLG++AA +G+V P++L+R KGW+G LLE +LGA AG++ Sbjct: 4 IPPPDTQHELIRRCQQLAGKTLGQVAAELGVVVPDDLRRHKGWVGQLLENYLGADAGNQA 63 Query: 68 EQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEG 127 E DF LG+ELKT+P+++ G+P E+T+VC LT + + W+ S V+ KL VLW+PVE Sbjct: 64 EPDFTRLGIELKTLPLNANGQPKESTYVCTVTLTESGNMQWQQSWVKKKLAHVLWLPVEA 123 Query: 128 ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKA 187 E + LAQR VG LW P E+++ LR+DWEELMD IVLG+ ITA+ G YLQIRPKA Sbjct: 124 ESATALAQRYVGQGWLWQPTVEQEKILRQDWEELMDRIVLGEQADITAKEGHYLQIRPKA 183 Query: 188 ANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHF 226 AN++ + + G GE L P+GFYL+ FT LLA F Sbjct: 184 ANSRVMAKTRGDEGEATLINPKGFYLRTAFTRELLATQF 222 >UniRef50_A5IBX0 DNA mismatch repair protein MutH n=4 Tax=Legionella pneumophila RepID=A5IBX0_LEGPC Length = 224 Score = 218 bits (556), Expect = 1e-55, Method: Compositional matrix adjust. Identities = 105/215 (48%), Positives = 140/215 (65%) Query: 10 PPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQ 69 PP+ E++LL Q Q++ G + +LA +GL TP N KGW+G LEI LG A ++ Sbjct: 9 PPKNEKELLEQCQRIEGLSFAQLALELGLFTPLNPVHRKGWVGQALEIALGTDAKNESLP 68 Query: 70 DFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGER 129 DF LG+ELKTIP++ G+P E+TF+ PL +WE+S KLKRVLWIP+EG+ Sbjct: 69 DFKQLGIELKTIPINKSGKPAESTFIVSIPLLTIHQQSWESSQCYAKLKRVLWIPIEGDT 128 Query: 130 SIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAAN 189 IP RR+G LWSPN+++ L EDW L + IVLGQ+E + + GEYLQ+RPKAAN Sbjct: 129 DIPFPHRRIGRGFLWSPNKDQKSVLAEDWNYLTNQIVLGQLETLNSSSGEYLQVRPKAAN 188 Query: 190 AKALTEAIGARGERILTLPRGFYLKKNFTSALLAR 224 K+L G RI TLPRGFYL+ FTS +L++ Sbjct: 189 GKSLCYGYDPEGNRIRTLPRGFYLRSCFTSVILSQ 223 >UniRef50_D1RIY1 DNA mismatch repair endonuclease MutH n=1 Tax=Legionella longbeachae D-4968 RepID=D1RIY1_LEGLO Length = 230 Score = 206 bits (524), Expect = 4e-52, Method: Compositional matrix adjust. Identities = 95/217 (43%), Positives = 137/217 (63%) Query: 8 LSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKP 67 + PP+TE +L+ + ++G + +L+ + L+ P + R KGW+G ++E+ LGA+A +K Sbjct: 8 IPPPQTESELIERCSSIAGLSFAQLSLSLNLLIPIDSNRRKGWVGQVIELALGANASNKS 67 Query: 68 EQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEG 127 DF LG+ELKT+P+ GRP E+TF+ PL W+TS KLKRVLWIP+EG Sbjct: 68 LPDFKNLGIELKTLPIGKSGRPTESTFITSIPLLTVHQQQWKTSQCYSKLKRVLWIPIEG 127 Query: 128 ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKA 187 + IP QRR+G LWSPNE ++R L DW L I G +E + A+ G+YLQ+RPKA Sbjct: 128 DTDIPYPQRRIGQGFLWSPNEAQERALEADWNYLTLQISTGNLEVLDAKSGDYLQVRPKA 187 Query: 188 ANAKALTEAIGARGERILTLPRGFYLKKNFTSALLAR 224 AN ++L + G +I TLPRGFYL+ +FT + Sbjct: 188 ANGRSLCYGFDSHGNKIKTLPRGFYLRSSFTEKIFGN 224 >UniRef50_C6MZP8 DNA mismatch repair protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZP8_9GAMM Length = 227 Score = 196 bits (499), Expect = 4e-49, Method: Compositional matrix adjust. Identities = 94/213 (44%), Positives = 137/213 (64%) Query: 11 PETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQD 70 P+TE +++ + +L G + +LA +GL P N + KGWIG +E+ LGA+A +K D Sbjct: 10 PKTETEVIERCAKLVGLSFAQLALGLGLSIPTNPDQRKGWIGQAIELALGANAYNKSLPD 69 Query: 71 FAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERS 130 F LG+ELKT+PV G+P E+TF+ PL W++S KL+RVLW+P+EG+ Sbjct: 70 FQYLGIELKTLPVAKSGKPTESTFITSIPLLTIHQQDWKSSQCYAKLQRVLWLPIEGDAD 129 Query: 131 IPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANA 190 IP QRR+G LWSPNE ++ L DW L I G +E + A+ G+YLQ+RPKAA+ Sbjct: 130 IPYMQRRIGQGFLWSPNEAQETILEADWNYLTLQISTGYLETLDAKSGDYLQVRPKAAHG 189 Query: 191 KALTEAIGARGERILTLPRGFYLKKNFTSALLA 223 K+L ++G ++ TLPRGFYL+ +FT+ +LA Sbjct: 190 KSLCYGFDSKGNKVKTLPRGFYLRSSFTAKILA 222 >UniRef50_Q89B33 DNA mismatch repair protein mutH n=1 Tax=Buchnera aphidicola (Baizongia pistaciae) RepID=MUTH_BUCBP Length = 220 Score = 181 bits (460), Expect = 1e-44, Method: Compositional matrix adjust. Identities = 84/213 (39%), Positives = 134/213 (62%), Gaps = 4/213 (1%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASA--GSKPEQD 70 E++L A LSGY++ E+ + + +L R+KG++G +LE+ LG + G K D Sbjct: 5 CEKKLFMHAIGLSGYSIREIVSSLDQPVSNSLVRNKGFVGKILELILGVNVLHGYKC-ID 63 Query: 71 FAALGVELKTIPVDSLGRPLETTFVCVAPLTGNS-GVTWETSHVRHKLKRVLWIPVEGER 129 F +LG+ELK+IP++S G PLE TF+C PL NS +TW S+ K+K++LWIP+ G R Sbjct: 64 FPSLGIELKSIPINSSGYPLEPTFICNIPLKNNSLNITWNNSYFYRKIKKILWIPIIGNR 123 Query: 130 SIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAAN 189 + + VG +W+ + +++ L++DWEE MD+I++G+VE I+++HG+ LQ++ K N Sbjct: 124 VVSFLDKIVGEAFIWTMSSVQEKILKKDWEEFMDLIIIGKVEYISSKHGQVLQVKKKCKN 183 Query: 190 AKALTEAIGARGERILTLPRGFYLKKNFTSALL 222 + I G T PR FY +K+FT +LL Sbjct: 184 KHVCIKFINYNGCVKFTNPRAFYFRKSFTWSLL 216 >UniRef50_A9GFJ0 Methyl-directed mismatch repair n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GFJ0_SORC5 Length = 227 Score = 179 bits (455), Expect = 4e-44, Method: Compositional matrix adjust. Identities = 102/215 (47%), Positives = 134/215 (62%) Query: 8 LSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKP 67 L PP E +L +A L+G TL +LA +G P R KG G L+E LGA+AGS Sbjct: 5 LKPPRDEAELRCRAAALAGLTLADLARALGCAVPRGGVRTKGAAGALVERALGATAGSTS 64 Query: 68 EQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEG 127 DF LG+ELKTIPV+ G P E+TFVC L+ WETS R KL VLW+P+ Sbjct: 65 RPDFPHLGIELKTIPVEVTGAPQESTFVCSISLSDADRAEWETSRARAKLACVLWVPIIA 124 Query: 128 ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKA 187 P +RR+G+P LW P + ++R LR D+E+LM +I +G VE +TA G +LQ+RPKA Sbjct: 125 APGRPSEERRLGAPRLWRPTDAQERVLRADFEDLMGLIGIGAVEALTAHAGRWLQVRPKA 184 Query: 188 ANAKALTEAIGARGERILTLPRGFYLKKNFTSALL 222 A+ ++ T A G GE I LPRGFYL+ +FT A+L Sbjct: 185 ASGRSRTLAFGRDGEMIAALPRGFYLRPSFTGAIL 219 >UniRef50_C2WIM6 Type II restriction endonuclease n=12 Tax=Firmicutes RepID=C2WIM6_BACCE Length = 475 Score = 59.7 bits (143), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 56/221 (25%), Positives = 96/221 (43%), Gaps = 14/221 (6%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIG-VLLEIWLGASAGSKPEQDF 71 T E+L+ +A + G T GE+ L + KG +G V+ E + G S E DF Sbjct: 17 TIEELMKKAHEAEGKTFGEIDTTDRLANA----KSKGGLGQVIEESFFGYEVNSNAEADF 72 Query: 72 AALGVELKTIPVD-----SLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVE 126 LG+ELK SL E + + +ETS K +++L + E Sbjct: 73 KHLGIELKVTAFKQNKNGSLSAK-ERLVLNIINYMEEVHTHFETSSFWKKNEKLLLMFYE 131 Query: 127 GERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPK 186 + + LL++ E + +R+DWE +++ I +G+ ++ YL K Sbjct: 132 WVPGVDRKDFHITKSLLFTYPEADLEIIRQDWETIVNKIRVGKAHELSEGDTNYLGACTK 191 Query: 187 AANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFL 227 +N +L + I + R F LK ++ +AL+ R+ L Sbjct: 192 GSNKNSLRSQPYS---EIPAMQRAFSLKPSYMTALVRRYHL 229 >UniRef50_B2TKK6 DNA mismatch repair enzyme MutH n=3 Tax=Clostridium RepID=B2TKK6_CLOBB Length = 476 Score = 55.1 bits (131), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 46/198 (23%), Positives = 83/198 (41%), Gaps = 23/198 (11%) Query: 43 NLKRDKGWIG-VLLEIWLGASAGSKPEQDFAALGVELKTIPVDSL-GRP----------- 89 N K+ KG +G + E + G S+ E DF +GVELK P+ ++ +P Sbjct: 38 NNKKSKGRLGQTVEEEYFGYKVNSRQEADFNEVGVELKVCPIRTIKAKPKSNSIREQIGY 97 Query: 90 --LETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPN 147 E + + + TWE + + K K +L + E++IP + LW+P+ Sbjct: 98 SAKERIVLSIIDYFKLNEETWENNSIMKKCKELLLMFYMNEKNIPKEELVFKIISLWTPS 157 Query: 148 EEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGARGE---RI 204 E++ + + DW + + G+ I+ YL A K T RG+ + Sbjct: 158 EQDLKIIENDWNTIALKVKKGKAHEISEGDTMYL-----GACTKGSTAEKSKRGQPNSDV 212 Query: 205 LTLPRGFYLKKNFTSALL 222 + R F K+ + ++ Sbjct: 213 MAPQRAFSYKRQYVDYII 230 Score = 51.2 bits (121), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 45/175 (25%), Positives = 77/175 (44%), Gaps = 12/175 (6%) Query: 32 LAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPE--QDFAALGVELKTIPVDSLGRP 89 ++ ++ L + + ++ K +I ++++ G K E ++F +ELKTI + G P Sbjct: 271 VSEIISLYSIDRERKAKNYIRLIVDDVCKKVFGDKLENFEEFKKANIELKTIFLKLNGMP 330 Query: 90 LETTFVCVAPLTGNSGVTWETSHVRHKL--KRVLWI------PVEGERSIPLAQRRVGSP 141 E+ WETS +R K K+ LWI E + + L + + Sbjct: 331 KESMSFEQIDYCEIVNEEWETSTIRDKFENKKHLWIIFKSKVNFEKQSELSLNDIVLDNV 390 Query: 142 LLWS-PNEEEDRQLREDWEELMDMIVLGQV-ERITARHGEYLQIRPKAANAKALT 194 + W+ P + D + + W + D I G + I GE IRPK N+K LT Sbjct: 391 MFWNMPISDLDSSMYKVWLDTTDKIKQGVYNDFIKISDGEIAHIRPKGQNSKDLT 445 >UniRef50_B3DNU4 DNA mismatch repair protein n=4 Tax=Bifidobacterium RepID=B3DNU4_BIFLD Length = 502 Score = 50.8 bits (120), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 46/197 (23%), Positives = 85/197 (43%), Gaps = 17/197 (8%) Query: 47 DKGWIGVLLE---IWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETT------FVCV 97 +KG +G ++E + ++ + P+ ELK P+ + + +T+ + Sbjct: 55 NKGSLGQIIEESVLKYAINSDAAPDIHIGDTSYELKVTPLKHIKKGKQTSAKERLVIDII 114 Query: 98 APLTGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWS---PNEEED-RQ 153 LT +E+S + K K ++ + +R+ + R+ +L S E +D Sbjct: 115 NYLTLADETDFESSKMWDKAKNIILVYYYDDRTDKKKELRIDCKVLASYLMKYEADDLAT 174 Query: 154 LREDWEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEA---IGARGERILTLPRG 210 ++ DW + D + G + ++ YL K AN+K L EA GA I R Sbjct: 175 IKNDWHVIRDKVASGHADSLSESDTNYLAACTKGANSKQLREAPAPAGANTATIFAKQRA 234 Query: 211 FYLKKNFTSALLARHFL 227 F LK ++ +A +AR L Sbjct: 235 FSLKTSYMTA-IARKLL 250 >UniRef50_UPI000196BAC3 hypothetical protein CATMIT_00199 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196BAC3 Length = 499 Score = 48.1 bits (113), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 53/186 (28%), Positives = 79/186 (42%), Gaps = 12/186 (6%) Query: 47 DKGWIGVLLEIWLGASA-GSKPEQDFAALGVELKTIPVDSLGR------PLETTFVCVAP 99 +KG IG ++E L S+ E DF LGVELK P++ +G+ E V + Sbjct: 40 NKGKIGQVIEEGLFHKGIDSRQEADFPELGVELKVTPMNRVGKEKDMVSAKERLVVTMIN 99 Query: 100 LTGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNE--EEDRQLRED 157 + V++E SHV KL +VL + E V L+ + EED+Q+ + Sbjct: 100 YMKDYNVSFEDSHVLDKLNKVLLMFYFNEPEKMNYDLCVDKIFLYQFKDIPEEDKQIIIN 159 Query: 158 WEELMDM-IVLGQVERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKN 216 ++ I G E I+ YL K A A T + + R F LK Sbjct: 160 DYNIIIDKIKSGHAEDISESDTYYLSACTKGAT--AATSFVDQPFSSVKAKRRAFSLKPK 217 Query: 217 FTSALL 222 + +ALL Sbjct: 218 YMTALL 223 >UniRef50_A8RH65 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RH65_9CLOT Length = 217 Score = 47.0 bits (110), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 49/192 (25%), Positives = 79/192 (41%), Gaps = 35/192 (18%) Query: 47 DKGWIGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGV 106 +KG G LLE+ LG S DF ELKT D+ G P ET F+ + + Sbjct: 33 NKGKTGQLLELALGMHLSS-TNLDFD--DGELKTNKCDNAGNPKETVFITQISSVIDELI 89 Query: 107 T---WETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLW-----------SPNEEEDR 152 +E +H+ K+ +L++PV + GSP W SP + R Sbjct: 90 QERPFEETHLYEKISNILYVPVCKD----------GSPKDWMFLPSIHIDLSSPRYAQLR 139 Query: 153 QL-REDWEELMDMIVLGQVE-----RITARHGEYLQIRPKAANAKALTEAIGARGERILT 206 + R D+ + + + +E RI +G+++Q+R K + + G G + Sbjct: 140 DIWRSDYYSICRQLRM-HIETSPDRRIHTSNGQHIQVRSKDSMPYHPIYS-GVYGRYVSN 197 Query: 207 LPRGFYLKKNFT 218 FY +K F Sbjct: 198 KNHAFYFRKEFV 209 >UniRef50_C2SK77 DNA mismatch repair enzyme MutH n=1 Tax=Bacillus cereus BDRD-ST196 RepID=C2SK77_BACCE Length = 461 Score = 45.4 bits (106), Expect = 0.002, Method: Compositional matrix adjust. Identities = 42/175 (24%), Positives = 82/175 (46%), Gaps = 9/175 (5%) Query: 7 LLSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGS 65 + S TE+Q+ A++L G LG+L L K+DKG IG +++ + G A S Sbjct: 4 IYSESWTEQQIFDVAEELVGKRLGDLDKSGWLKK----KKDKGNIGNMIQSDFFGIPANS 59 Query: 66 KPEQDFAALGVELKTIPV---DSLGRPLETTFVC-VAPLTGNSGVTWETSHVRHKLKRVL 121 DF +ELK P+ ++G + V + + + +E S V K + +L Sbjct: 60 IKGADFEHHHIELKVTPILKKKTVGYSSKERLVLGMINYMEDYRIPFEESIVNKKAQNML 119 Query: 122 WIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITAR 176 + E + P+ + ++ + + ++ Q+R D++ ++D I G+ I+ + Sbjct: 120 LVFYLHEENKPVEEFKIIKTARFQLPKSDESQVRLDYQTIVDNIQQGKAHEISEK 174 >UniRef50_P16667 Type-2 restriction enzyme Sau3AI n=2 Tax=Staphylococcus RepID=T2S3_STAAU Length = 489 Score = 43.9 bits (102), Expect = 0.004, Method: Compositional matrix adjust. Identities = 45/192 (23%), Positives = 75/192 (39%), Gaps = 9/192 (4%) Query: 42 ENLKRDKGWIGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGR----PLETTFVCV 97 E++K+ K +G E W G S + D A GVELK P L E + + Sbjct: 29 ESIKQSKSSVGDAFENWFGKKKDSDSKPDMAEAGVELKATPFKKLKNGKYSSKERLVLNI 88 Query: 98 APLTGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWS--PNEEEDRQLR 155 + +ETS K + E + P + +L+ N + ++ Sbjct: 89 INYEKVANENFETSSFLSKNNTIELAFYEYIKGTPSDNWIIKEAVLYEMHKNPIDYEIIK 148 Query: 156 EDWEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKK 215 +DWE + I G+ ++ YL K ANA +L + I R F LK Sbjct: 149 QDWEIINQYINEGKAHELSEGLTSYLAPCTKGANASSLRNQPYS---DIKAKQRAFSLKS 205 Query: 216 NFTSALLARHFL 227 + +++L ++ L Sbjct: 206 GYMTSILRKYVL 217 >UniRef50_C3FCF3 DNA mismatch repair enzyme MutH n=2 Tax=Bacillus thuringiensis RepID=C3FCF3_BACTU Length = 453 Score = 39.7 bits (91), Expect = 0.071, Method: Compositional matrix adjust. Identities = 51/219 (23%), Positives = 93/219 (42%), Gaps = 13/219 (5%) Query: 12 ETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKPEQD 70 +TEEQLLA + + G + E+ GL+ + DKG +G ++E + + PE D Sbjct: 8 KTEEQLLAFTEGIIGKSFKEIDT-KGLLQGNS--NDKGILGKVVETGFYDYELNNCPEAD 64 Query: 71 FAALGVELKTIPVDSL--GRPLETTFVCVAPLTGNSGV--TWETSHVRHKLKRVLWIPVE 126 F LG+ELK + L G + ++ + + + +E S V K K++L I E Sbjct: 65 FGELGIELKVSGFNKLRNGSWSAKERISLSMINYKNIIHEEYEFSKVVSKNKKLLIIWYE 124 Query: 127 GERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPK 186 + +P + L+ +++E +R D+ + ++ G +++ L K Sbjct: 125 YMKGVPYGDFIIRDFQLYDMSQDE-AIIRNDFYTIKQKVLEGLAHKLSEGDTVILGAATK 183 Query: 187 AANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARH 225 + + I R F LK +F +L H Sbjct: 184 GQKGQKAEQP----NSEISAPTRAFSLKNSFFRGVLRSH 218 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_D1RIY1 DNA mismatch repair endonuclease MutH n=1 Tax=Le... 305 5e-82 UniRef50_A1JPE3 DNA mismatch repair protein mutH n=58 Tax=Proteo... 305 1e-81 UniRef50_B1KI41 DNA mismatch repair protein mutH n=236 Tax=Gamma... 302 5e-81 UniRef50_A5IBX0 DNA mismatch repair protein MutH n=4 Tax=Legione... 298 1e-79 UniRef50_C6MZP8 DNA mismatch repair protein n=1 Tax=Legionella d... 297 2e-79 UniRef50_C0N2X3 DNA mismatch repair endonuclease MutH n=1 Tax=Me... 295 8e-79 UniRef50_A9GFJ0 Methyl-directed mismatch repair n=1 Tax=Sorangiu... 262 6e-69 UniRef50_Q89B33 DNA mismatch repair protein mutH n=1 Tax=Buchner... 256 5e-67 UniRef50_C2WIM6 Type II restriction endonuclease n=12 Tax=Firmic... 235 8e-61 UniRef50_C2SK77 DNA mismatch repair enzyme MutH n=1 Tax=Bacillus... 199 9e-50 UniRef50_UPI000196BAC3 hypothetical protein CATMIT_00199 n=1 Tax... 197 3e-49 UniRef50_B3DNU4 DNA mismatch repair protein n=4 Tax=Bifidobacter... 192 5e-48 UniRef50_B2TKK6 DNA mismatch repair enzyme MutH n=3 Tax=Clostrid... 191 2e-47 UniRef50_A8RH65 Putative uncharacterized protein n=1 Tax=Clostri... 159 8e-38 Sequences not found previously or not previously below threshold: UniRef50_Q0TSC8 Type II site-specific deoxyribonuclease n=3 Tax=... 161 1e-38 UniRef50_B9E7L8 Type II restriction enzyme n=1 Tax=Macrococcus c... 158 1e-37 UniRef50_D2RCS3 DNA mismatch repair enzyme MutH n=1 Tax=Gardnere... 152 6e-36 UniRef50_B8ZN89 Putative restriction enzyme n=1 Tax=Streptococcu... 149 7e-35 UniRef50_P16667 Type-2 restriction enzyme Sau3AI n=2 Tax=Staphyl... 147 2e-34 UniRef50_C5WEK1 Restriction enzyme n=4 Tax=Bacteria RepID=C5WEK1... 145 1e-33 UniRef50_O87348 LlaKR2I restriction enzyme n=2 Tax=Lactococcus l... 141 1e-32 UniRef50_Q6SFK8 Type II restriction enzyme, putative n=1 Tax=unc... 137 4e-31 UniRef50_C3FCF3 DNA mismatch repair enzyme MutH n=2 Tax=Bacillus... 135 8e-31 UniRef50_Q30PF3 DNA mismatch repair enzyme MutH n=1 Tax=Sulfurim... 132 7e-30 UniRef50_C0ETP8 Putative uncharacterized protein n=2 Tax=Clostri... 131 1e-29 UniRef50_C8JWG6 Restriction endonuclease n=4 Tax=Bacillales RepI... 130 3e-29 UniRef50_D2NQK8 DNA mismatch repair protein n=1 Tax=Rothia mucil... 125 1e-27 UniRef50_Q6F1F9 Type II restriction enzyme Sau3AI, GATC site n=1... 122 7e-27 UniRef50_A1A2S7 Putative type II restriction enzyme Sau3AI n=1 T... 112 1e-23 UniRef50_Q3IRH4 DNA mismatch repair protein homolog n=1 Tax=Natr... 105 1e-21 UniRef50_C4Z3T8 Putative uncharacterized protein n=2 Tax=Clostri... 103 3e-21 UniRef50_D1NWA3 DNA mismatch repair enzyme MutH family protein n... 90 5e-17 UniRef50_Q03GQ2 Type II restriction endonuclease, MutH family n=... 89 1e-16 UniRef50_Q9ACM4 R.Sth368I endonuclease n=1 Tax=Streptococcus the... 81 3e-14 UniRef50_A6BKR2 Putative uncharacterized protein n=1 Tax=Dorea l... 80 6e-14 UniRef50_C0EPR0 Putative uncharacterized protein n=1 Tax=Neisser... 67 5e-10 UniRef50_C0EPR1 Putative uncharacterized protein n=1 Tax=Neisser... 63 8e-09 UniRef50_Q6SFL0 Putative uncharacterized protein n=1 Tax=uncultu... 62 1e-08 UniRef50_C5EHV9 Type II restriction endonuclease n=1 Tax=Clostri... 62 1e-08 UniRef50_A0Z3W2 Putative uncharacterized protein n=1 Tax=marine ... 41 0.040 >UniRef50_D1RIY1 DNA mismatch repair endonuclease MutH n=1 Tax=Legionella longbeachae D-4968 RepID=D1RIY1_LEGLO Length = 230 Score = 305 bits (782), Expect = 5e-82, Method: Composition-based stats. Identities = 96/221 (43%), Positives = 138/221 (62%) Query: 8 LSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKP 67 + PP+TE +L+ + ++G + +L+ + L+ P + R KGW+G ++E+ LGA+A +K Sbjct: 8 IPPPQTESELIERCSSIAGLSFAQLSLSLNLLIPIDSNRRKGWVGQVIELALGANASNKS 67 Query: 68 EQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEG 127 DF LG+ELKT+P+ GRP E+TF+ PL W+TS KLKRVLWIP+EG Sbjct: 68 LPDFKNLGIELKTLPIGKSGRPTESTFITSIPLLTVHQQQWKTSQCYSKLKRVLWIPIEG 127 Query: 128 ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKA 187 + IP QRR+G LWSPNE ++R L DW L I G +E + A+ G+YLQ+RPKA Sbjct: 128 DTDIPYPQRRIGQGFLWSPNEAQERALEADWNYLTLQISTGNLEVLDAKSGDYLQVRPKA 187 Query: 188 ANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLI 228 AN ++L + G +I TLPRGFYL+ +FT + I Sbjct: 188 ANGRSLCYGFDSHGNKIKTLPRGFYLRSSFTEKIFGNPHQI 228 >UniRef50_A1JPE3 DNA mismatch repair protein mutH n=58 Tax=Proteobacteria RepID=MUTH_YERE8 Length = 228 Score = 305 bits (780), Expect = 1e-81, Method: Composition-based stats. Identities = 162/220 (73%), Positives = 190/220 (86%) Query: 9 SPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPE 68 +PP E QL +A+ LSG+TLGELA+ G P +LKR KGW+G+LLE +LGASAGSKPE Sbjct: 9 APPTDEHQLFQRAEALSGFTLGELASKAGWNIPADLKRIKGWVGMLLEFYLGASAGSKPE 68 Query: 69 QDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGE 128 QDFA +G+ELKTIP+ + G+PLETTFVCVAPLTGNSG+TWE+SHVRHKL RVLW+PVEGE Sbjct: 69 QDFADIGIELKTIPISAQGKPLETTFVCVAPLTGNSGITWESSHVRHKLARVLWVPVEGE 128 Query: 129 RSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAA 188 R IPLAQRRVG+PLLWSPNEEE+ LR DWEELMD+IVLG+VE ITARHGE LQ+RPKAA Sbjct: 129 RQIPLAQRRVGAPLLWSPNEEEEELLRRDWEELMDLIVLGKVETITARHGEVLQLRPKAA 188 Query: 189 NAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLI 228 N++ALTEAIG G+ I+TLPRGFYLKK FT +LARHFL+ Sbjct: 189 NSRALTEAIGEHGQPIMTLPRGFYLKKTFTGPMLARHFLL 228 >UniRef50_B1KI41 DNA mismatch repair protein mutH n=236 Tax=Gammaproteobacteria RepID=MUTH_SHEWM Length = 231 Score = 302 bits (774), Expect = 5e-81, Method: Composition-based stats. Identities = 127/218 (58%), Positives = 172/218 (78%) Query: 11 PETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQD 70 P+T ++L+++A ++G TLG+LA G +TP NLKRDKGW+G L+E LGA AGS+PE D Sbjct: 7 PKTIDELMSRANDMAGLTLGQLADSHGFITPANLKRDKGWVGQLIEHELGALAGSRPEPD 66 Query: 71 FAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERS 130 F LG+ELKTIPVD G+P+ETT+V VAPL G+TWETS V HKL+ VLWIP++G+R Sbjct: 67 FLHLGIELKTIPVDKNGKPIETTYVTVAPLIDIQGLTWETSVVCHKLQTVLWIPIQGDRD 126 Query: 131 IPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANA 190 IP++QR++GSP+LW PNEEE L++DWEE+M+ I LGQV+++TARHGE LQ+RPK AN+ Sbjct: 127 IPVSQRQIGSPILWRPNEEELALLKQDWEEIMEFIALGQVKQLTARHGEVLQLRPKGANS 186 Query: 191 KALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLI 228 +++T++IG G +T PRGFYLK FT ++L+R F I Sbjct: 187 RSVTQSIGPNGSTQMTNPRGFYLKIPFTQSILSRAFGI 224 >UniRef50_A5IBX0 DNA mismatch repair protein MutH n=4 Tax=Legionella pneumophila RepID=A5IBX0_LEGPC Length = 224 Score = 298 bits (762), Expect = 1e-79, Method: Composition-based stats. Identities = 105/216 (48%), Positives = 140/216 (64%) Query: 9 SPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPE 68 PP+ E++LL Q Q++ G + +LA +GL TP N KGW+G LEI LG A ++ Sbjct: 8 KPPKNEKELLEQCQRIEGLSFAQLALELGLFTPLNPVHRKGWVGQALEIALGTDAKNESL 67 Query: 69 QDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGE 128 DF LG+ELKTIP++ G+P E+TF+ PL +WE+S KLKRVLWIP+EG+ Sbjct: 68 PDFKQLGIELKTIPINKSGKPAESTFIVSIPLLTIHQQSWESSQCYAKLKRVLWIPIEGD 127 Query: 129 RSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAA 188 IP RR+G LWSPN+++ L EDW L + IVLGQ+E + + GEYLQ+RPKAA Sbjct: 128 TDIPFPHRRIGRGFLWSPNKDQKSVLAEDWNYLTNQIVLGQLETLNSSSGEYLQVRPKAA 187 Query: 189 NAKALTEAIGARGERILTLPRGFYLKKNFTSALLAR 224 N K+L G RI TLPRGFYL+ FTS +L++ Sbjct: 188 NGKSLCYGYDPEGNRIRTLPRGFYLRSCFTSVILSQ 223 >UniRef50_C6MZP8 DNA mismatch repair protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZP8_9GAMM Length = 227 Score = 297 bits (760), Expect = 2e-79, Method: Composition-based stats. Identities = 94/213 (44%), Positives = 137/213 (64%) Query: 11 PETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQD 70 P+TE +++ + +L G + +LA +GL P N + KGWIG +E+ LGA+A +K D Sbjct: 10 PKTETEVIERCAKLVGLSFAQLALGLGLSIPTNPDQRKGWIGQAIELALGANAYNKSLPD 69 Query: 71 FAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERS 130 F LG+ELKT+PV G+P E+TF+ PL W++S KL+RVLW+P+EG+ Sbjct: 70 FQYLGIELKTLPVAKSGKPTESTFITSIPLLTIHQQDWKSSQCYAKLQRVLWLPIEGDAD 129 Query: 131 IPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANA 190 IP QRR+G LWSPNE ++ L DW L I G +E + A+ G+YLQ+RPKAA+ Sbjct: 130 IPYMQRRIGQGFLWSPNEAQETILEADWNYLTLQISTGYLETLDAKSGDYLQVRPKAAHG 189 Query: 191 KALTEAIGARGERILTLPRGFYLKKNFTSALLA 223 K+L ++G ++ TLPRGFYL+ +FT+ +LA Sbjct: 190 KSLCYGFDSKGNKVKTLPRGFYLRSSFTAKILA 222 >UniRef50_C0N2X3 DNA mismatch repair endonuclease MutH n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N2X3_9GAMM Length = 223 Score = 295 bits (755), Expect = 8e-79, Method: Composition-based stats. Identities = 112/219 (51%), Positives = 156/219 (71%) Query: 8 LSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKP 67 + PP+T+ +L+ + QQL+G TLG++AA +G+V P++L+R KGW+G LLE +LGA AG++ Sbjct: 4 IPPPDTQHELIRRCQQLAGKTLGQVAAELGVVVPDDLRRHKGWVGQLLENYLGADAGNQA 63 Query: 68 EQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEG 127 E DF LG+ELKT+P+++ G+P E+T+VC LT + + W+ S V+ KL VLW+PVE Sbjct: 64 EPDFTRLGIELKTLPLNANGQPKESTYVCTVTLTESGNMQWQQSWVKKKLAHVLWLPVEA 123 Query: 128 ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKA 187 E + LAQR VG LW P E+++ LR+DWEELMD IVLG+ ITA+ G YLQIRPKA Sbjct: 124 ESATALAQRYVGQGWLWQPTVEQEKILRQDWEELMDRIVLGEQADITAKEGHYLQIRPKA 183 Query: 188 ANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHF 226 AN++ + + G GE L P+GFYL+ FT LLA F Sbjct: 184 ANSRVMAKTRGDEGEATLINPKGFYLRTAFTRELLATQF 222 >UniRef50_A9GFJ0 Methyl-directed mismatch repair n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GFJ0_SORC5 Length = 227 Score = 262 bits (670), Expect = 6e-69, Method: Composition-based stats. Identities = 102/216 (47%), Positives = 134/216 (62%) Query: 8 LSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKP 67 L PP E +L +A L+G TL +LA +G P R KG G L+E LGA+AGS Sbjct: 5 LKPPRDEAELRCRAAALAGLTLADLARALGCAVPRGGVRTKGAAGALVERALGATAGSTS 64 Query: 68 EQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEG 127 DF LG+ELKTIPV+ G P E+TFVC L+ WETS R KL VLW+P+ Sbjct: 65 RPDFPHLGIELKTIPVEVTGAPQESTFVCSISLSDADRAEWETSRARAKLACVLWVPIIA 124 Query: 128 ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKA 187 P +RR+G+P LW P + ++R LR D+E+LM +I +G VE +TA G +LQ+RPKA Sbjct: 125 APGRPSEERRLGAPRLWRPTDAQERVLRADFEDLMGLIGIGAVEALTAHAGRWLQVRPKA 184 Query: 188 ANAKALTEAIGARGERILTLPRGFYLKKNFTSALLA 223 A+ ++ T A G GE I LPRGFYL+ +FT A+L Sbjct: 185 ASGRSRTLAFGRDGEMIAALPRGFYLRPSFTGAILG 220 >UniRef50_Q89B33 DNA mismatch repair protein mutH n=1 Tax=Buchnera aphidicola (Baizongia pistaciae) RepID=MUTH_BUCBP Length = 220 Score = 256 bits (653), Expect = 5e-67, Method: Composition-based stats. Identities = 84/213 (39%), Positives = 134/213 (62%), Gaps = 4/213 (1%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASA--GSKPEQD 70 E++L A LSGY++ E+ + + +L R+KG++G +LE+ LG + G K D Sbjct: 5 CEKKLFMHAIGLSGYSIREIVSSLDQPVSNSLVRNKGFVGKILELILGVNVLHGYKC-ID 63 Query: 71 FAALGVELKTIPVDSLGRPLETTFVCVAPLTGNS-GVTWETSHVRHKLKRVLWIPVEGER 129 F +LG+ELK+IP++S G PLE TF+C PL NS +TW S+ K+K++LWIP+ G R Sbjct: 64 FPSLGIELKSIPINSSGYPLEPTFICNIPLKNNSLNITWNNSYFYRKIKKILWIPIIGNR 123 Query: 130 SIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAAN 189 + + VG +W+ + +++ L++DWEE MD+I++G+VE I+++HG+ LQ++ K N Sbjct: 124 VVSFLDKIVGEAFIWTMSSVQEKILKKDWEEFMDLIIIGKVEYISSKHGQVLQVKKKCKN 183 Query: 190 AKALTEAIGARGERILTLPRGFYLKKNFTSALL 222 + I G T PR FY +K+FT +LL Sbjct: 184 KHVCIKFINYNGCVKFTNPRAFYFRKSFTWSLL 216 >UniRef50_C2WIM6 Type II restriction endonuclease n=12 Tax=Firmicutes RepID=C2WIM6_BACCE Length = 475 Score = 235 bits (600), Expect = 8e-61, Method: Composition-based stats. Identities = 53/221 (23%), Positives = 94/221 (42%), Gaps = 12/221 (5%) Query: 12 ETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKPEQD 70 T E+L+ +A + G T GE+ L + KG +G ++E + G S E D Sbjct: 16 TTIEELMKKAHEAEGKTFGEIDTTDRLANA----KSKGGLGQVIEESFFGYEVNSNAEAD 71 Query: 71 FAALGVELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVE 126 F LG+ELK E + + +ETS K +++L + E Sbjct: 72 FKHLGIELKVTAFKQNKNGSLSAKERLVLNIINYMEEVHTHFETSSFWKKNEKLLLMFYE 131 Query: 127 GERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPK 186 + + LL++ E + +R+DWE +++ I +G+ ++ YL K Sbjct: 132 WVPGVDRKDFHITKSLLFTYPEADLEIIRQDWETIVNKIRVGKAHELSEGDTNYLGACTK 191 Query: 187 AANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFL 227 +N +L + I + R F LK ++ +AL+ R+ L Sbjct: 192 GSNKNSLRSQPYSE---IPAMQRAFSLKPSYMTALVRRYHL 229 Score = 56.2 bits (134), Expect = 9e-07, Method: Composition-based stats. Identities = 46/229 (20%), Positives = 93/229 (40%), Gaps = 22/229 (9%) Query: 14 EEQLLAQAQQLSGYTLGELAALVGL-VTPENLKRDKGWIGVLLEIWLGASAGSKPEQD-F 71 EE L ++ + G T E+A + + P K ++ +L+ LG + + F Sbjct: 248 EEILYSKFENYIGLTDQEIAQKLSIDYKPT----TKSFVPLLVSSLLGIKGTRLDKIEEF 303 Query: 72 AALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGE--- 128 A +E KT+ ++ G+P ++ + +WE S +R + + ++ V E Sbjct: 304 AKANIEFKTVRLEPNGKPEQSMSFETIDFHQWTNESWEESEIRERFYQTKFLFVIFEFNQ 363 Query: 129 --RSIPLAQRRVGSPLLWSPNEEE-DRQLREDWEELMDMIVLGQVERITARHGEYLQIR- 184 + P + LW+ ++++RE WEE+ +I G R + ++ Sbjct: 364 TKKENPNRKLYFKGIKLWNMPAPTIEKEIRELWEEVNKVIHEGIQIEYKKRGDKVVEANN 423 Query: 185 -PK-AANAKALTEAIGARGERILTLPRG-------FYLKKNFTSALLAR 224 PK N A G +TLP G ++L ++ + ++ + Sbjct: 424 LPKINFNGVAHIRPKARNGADKVTLPDGQHITKQCYWLNNSYIAQIINK 472 >UniRef50_C2SK77 DNA mismatch repair enzyme MutH n=1 Tax=Bacillus cereus BDRD-ST196 RepID=C2SK77_BACCE Length = 461 Score = 199 bits (505), Expect = 9e-50, Method: Composition-based stats. Identities = 47/222 (21%), Positives = 93/222 (41%), Gaps = 9/222 (4%) Query: 7 LLSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGS 65 + S TE+Q+ A++L G LG+L L K+DKG IG +++ + G A S Sbjct: 4 IYSESWTEQQIFDVAEELVGKRLGDLDKSGWLKK----KKDKGNIGNMIQSDFFGIPANS 59 Query: 66 KPEQDFAALGVELKTIPV---DSLGRP-LETTFVCVAPLTGNSGVTWETSHVRHKLKRVL 121 DF +ELK P+ ++G E + + + + +E S V K + +L Sbjct: 60 IKGADFEHHHIELKVTPILKKKTVGYSSKERLVLGMINYMEDYRIPFEESIVNKKAQNML 119 Query: 122 WIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYL 181 + E + P+ + ++ + + ++ Q+R D++ ++D I G+ I+ + + + Sbjct: 120 LVFYLHEENKPVEEFKIIKTARFQLPKSDESQVRLDYQTIVDNIQQGKAHEISEKQQKIM 179 Query: 182 QIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLA 223 K + + R + K + SA Sbjct: 180 GACTKGQGKGKGKDGVDQPRSSEKAKSRAYSYKVGYMSAYFR 221 Score = 43.5 bits (101), Expect = 0.005, Method: Composition-based stats. Identities = 30/194 (15%), Positives = 60/194 (30%), Gaps = 23/194 (11%) Query: 48 KGWIGVLLEIWLGASAGS-KPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGV 106 K I L+ G + ++F G +KT+ + + T + Sbjct: 270 KSSIFNLIGFMFGTKGDNLNHTEEFLKEGYAIKTVRNRLNSKDNQDMSFSNIDYTEIAND 329 Query: 107 TWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIV 166 +E S + +I + R +W P++E Q E + ++ +M+ Sbjct: 330 EFEESTWYGWFAEIKYILTVWDEYEEGKNRFKDYT-IWIPDDELIEQASELFYQIKNMLN 388 Query: 167 LGQVE-RITA---RHGEY-------------LQIRPKAANAKALTEAIGARGERILTLPR 209 V I +HG + QIRPK + + + Sbjct: 389 TNSVRVEIDETVGKHGRWSDNLPGGKMDYPPFQIRPKGSGESVFVTL----THGLKIKKK 444 Query: 210 GFYLKKNFTSALLA 223 Y+ K + ++ Sbjct: 445 ALYINKEYIRKIVG 458 >UniRef50_UPI000196BAC3 hypothetical protein CATMIT_00199 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196BAC3 Length = 499 Score = 197 bits (500), Expect = 3e-49, Method: Composition-based stats. Identities = 61/224 (27%), Positives = 90/224 (40%), Gaps = 17/224 (7%) Query: 12 ETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASA-GSKPEQD 70 ++ E+LL AQ G E L +KG IG ++E L S+ E D Sbjct: 10 QSIEELLETAQSSLGVPFKEYDVNNRL-----NGGNKGKIGQVIEEGLFHKGIDSRQEAD 64 Query: 71 FAALGVELKTIPVDSLGR------PLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIP 124 F LGVELK P++ +G+ E V + + V++E SHV KL +VL + Sbjct: 65 FPELGVELKVTPMNRVGKEKDMVSAKERLVVTMINYMKDYNVSFEDSHVLDKLNKVLLMF 124 Query: 125 VEGERSIPLAQRRVGSPLLWSP---NEEEDRQLREDWEELMDMIVLGQVERITARHGEYL 181 E V L+ EE+ + + D+ ++D I G E I+ YL Sbjct: 125 YFNEPEKMNYDLCVDKIFLYQFKDIPEEDKQIIINDYNIIIDKIKSGHAEDISESDTYYL 184 Query: 182 QIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARH 225 K A A T + + R F LK + +ALL Sbjct: 185 SACTKGAT--AATSFVDQPFSSVKAKRRAFSLKPKYMTALLRSQ 226 Score = 41.6 bits (96), Expect = 0.021, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 33/92 (35%), Gaps = 2/92 (2%) Query: 86 LGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIP-VEGERSIPLAQRRVGSPLLW 144 G E+ WE S +R+K ++ V E + R LW Sbjct: 327 NGTIKESMSFPAFKFKDIVEENWEDSELRNKFINEKYLFCVFDEIDDSKYEYRFRGAFLW 386 Query: 145 SPNEEE-DRQLREDWEELMDMIVLGQVERITA 175 + E + D ++RE WE + + G I+ Sbjct: 387 AMPESDLDGKVREAWERTVYLAKHGIDFTISE 418 >UniRef50_B3DNU4 DNA mismatch repair protein n=4 Tax=Bifidobacterium RepID=B3DNU4_BIFLD Length = 502 Score = 192 bits (489), Expect = 5e-48, Method: Composition-based stats. Identities = 51/229 (22%), Positives = 95/229 (41%), Gaps = 21/229 (9%) Query: 15 EQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLE---IWLGASAGSKPEQDF 71 E+++ G + EL +T +KG +G ++E + ++ + P+ Sbjct: 27 EEVMNTLLTAKGKSFRELDQTGRALTGG----NKGSLGQIIEESVLKYAINSDAAPDIHI 82 Query: 72 AALGVELKTIPVDSLGRPLETT------FVCVAPLTGNSGVTWETSHVRHKLKRVLWIPV 125 ELK P+ + + +T+ + LT +E+S + K K ++ + Sbjct: 83 GDTSYELKVTPLKHIKKGKQTSAKERLVIDIINYLTLADETDFESSKMWDKAKNIILVYY 142 Query: 126 EGERSIPLAQRRVGSPLLWSP---NEEED-RQLREDWEELMDMIVLGQVERITARHGEYL 181 +R+ + R+ +L S E +D ++ DW + D + G + ++ YL Sbjct: 143 YDDRTDKKKELRIDCKVLASYLMKYEADDLATIKNDWHVIRDKVASGHADSLSESDTNYL 202 Query: 182 QIRPKAANAKALTEAI---GARGERILTLPRGFYLKKNFTSALLARHFL 227 K AN+K L EA GA I R F LK ++ +A+ AR L Sbjct: 203 AACTKGANSKQLREAPAPAGANTATIFAKQRAFSLKTSYMTAI-ARKLL 250 >UniRef50_B2TKK6 DNA mismatch repair enzyme MutH n=3 Tax=Clostridium RepID=B2TKK6_CLOBB Length = 476 Score = 191 bits (485), Expect = 2e-47, Method: Composition-based stats. Identities = 48/225 (21%), Positives = 91/225 (40%), Gaps = 18/225 (8%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGV-LLEIWLGASAGSKPEQDF 71 +E + A++L +L +L N K+ KG +G + E + G S+ E DF Sbjct: 9 NKESIENYARRLLNKSLKDLDENNKEHK-FNNKKSKGRLGQTVEEEYFGYKVNSRQEADF 67 Query: 72 AALGVELKTIPVDSLGRP--------------LETTFVCVAPLTGNSGVTWETSHVRHKL 117 +GVELK P+ ++ E + + + TWE + + K Sbjct: 68 NEVGVELKVCPIRTIKAKPKSNSIREQIGYSAKERIVLSIIDYFKLNEETWENNSIMKKC 127 Query: 118 KRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARH 177 K +L + E++IP + LW+P+E++ + + DW + + G+ I+ Sbjct: 128 KELLLMFYMNEKNIPKEELVFKIISLWTPSEQDLKIIENDWNTIALKVKKGKAHEISEGD 187 Query: 178 GEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALL 222 YL K + A+ G ++ R F K+ + ++ Sbjct: 188 TMYLGACTKGSTAE--KSKRGQPNSDVMAPQRAFSYKRQYVDYII 230 Score = 73.9 bits (180), Expect = 4e-12, Method: Composition-based stats. Identities = 45/207 (21%), Positives = 82/207 (39%), Gaps = 20/207 (9%) Query: 32 LAALVGLVTPENLKRDKGWIGVLLE----IWLGASAGSKPEQDFAALGVELKTIPVDSLG 87 ++ ++ L + + ++ K +I ++++ G + E F +ELKTI + G Sbjct: 271 VSEIISLYSIDRERKAKNYIRLIVDDVCKKVFGDKLENFEE--FKKANIELKTIFLKLNG 328 Query: 88 RPLETTFVCVAPLTGNSGVTWETSHVRHK--LKRVLWIP------VEGERSIPLAQRRVG 139 P E+ WETS +R K K+ LWI E + + L + Sbjct: 329 MPKESMSFEQIDYCEIVNEEWETSTIRDKFENKKHLWIIFKSKVNFEKQSELSLNDIVLD 388 Query: 140 SPLLWSPNEEE-DRQLREDWEELMDMIVLG-QVERITARHGEYLQIRPKAANAKALTEAI 197 + + W+ + D + + W + D I G + I GE IRPK N+K LT Sbjct: 389 NVMFWNMPISDLDSSMYKVWLDTTDKIKQGVYNDFIKISDGEIAHIRPKGQNSKDLTSTP 448 Query: 198 GARGERILTLPRGFYLKKNFTSALLAR 224 + + F+L + + + Sbjct: 449 ----QGTEERKKCFWLNAKYIKEQIEK 471 >UniRef50_Q0TSC8 Type II site-specific deoxyribonuclease n=3 Tax=Clostridiales RepID=Q0TSC8_CLOP1 Length = 495 Score = 161 bits (408), Expect = 1e-38, Method: Composition-based stats. Identities = 47/224 (20%), Positives = 86/224 (38%), Gaps = 23/224 (10%) Query: 21 AQQLSGYTLGEL--------------AALVGLVTPENLKRDKGWIGVLLEI-WLGASAGS 65 AQ+L G T ++ K+ KG +G L+E + + Sbjct: 18 AQKLIGKTFNDVLNDYTKYESELLISEEKEEYAENHENKKRKGGLGDLIEECYFHYKCNN 77 Query: 66 KPEQDFAALGVELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKLKRVL 121 DF GVELK P E + + ++E SH+ +K + +L Sbjct: 78 DSRPDFPDAGVELKVTPYKINKNKTLSAKERLIITMIDYFKVIEESFEDSHLWNKSQLIL 137 Query: 122 WIPVEGERSI-PLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEY 180 I + I ++ L++P EE+ ++ D++ ++D I G+ ++ Y Sbjct: 138 LIYYLYSKDIGNRLDYKINYAKLFTPPEEDLEIIKNDFKIIVDKIKDGKAHELSEGDTMY 197 Query: 181 LQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLAR 224 L KA+++ E + IL PR F K ++ + +L Sbjct: 198 LGAATKASSSSDRREQPFSN---ILAKPRAFSFKASYMTYVLNN 238 Score = 50.1 bits (118), Expect = 5e-05, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 66/161 (40%), Gaps = 19/161 (11%) Query: 14 EEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIW---LGASAGSKPEQD 70 EE ++ + +G T GEL + K K +++ LG + E Sbjct: 263 EEIIIDKINSYAGKTDGELCKIFD-------KEYKNNKAQWIDLAYRMLGIKSNKAEE-- 313 Query: 71 FAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKL--KRVLWIPVEGE 128 F + +K + ++S + +E++ + TWE S + + L ++ L++ + + Sbjct: 314 FEKANITVKALRIESKDKIVESSPLPTFKFKKLVEETWEESKLFNYLDQQKFLFVVYKKD 373 Query: 129 RSIPLAQRRVGSPLLWSPNEEED-RQLREDWEELMDMIVLG 168 + LW+ ++ +RE WE + ++I+ G Sbjct: 374 GDK----YVLKGAQLWNIPYDDLNTTVREGWENIRNVIIDG 410 >UniRef50_A8RH65 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RH65_9CLOT Length = 217 Score = 159 bits (401), Expect = 8e-38, Method: Composition-based stats. Identities = 53/227 (23%), Positives = 91/227 (40%), Gaps = 38/227 (16%) Query: 15 EQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQDFAAL 74 ++ + +L+G EL + + +KG G LLE+ LG S DF Sbjct: 4 QEACVRFNKLAGIRFEELFSQNDMNMIII---NKGKTGQLLELALGMHLSS-TNLDFDD- 58 Query: 75 GVELKTIPVDSLGRPLETTFVCVAPLTGNSGVT---WETSHVRHKLKRVLWIPVEGERSI 131 ELKT D+ G P ET F+ + + +E +H+ K+ +L++PV + Sbjct: 59 -GELKTNKCDNAGNPKETVFITQISSVIDELIQERPFEETHLYEKISNILYVPVCKD--- 114 Query: 132 PLAQRRVGSPLLW-----------SPNEEEDRQL-REDWEELMDMIVLGQVE-----RIT 174 GSP W SP + R + R D+ + + + +E RI Sbjct: 115 -------GSPKDWMFLPSIHIDLSSPRYAQLRDIWRSDYYSICRQLRM-HIETSPDRRIH 166 Query: 175 ARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSAL 221 +G+++Q+R K + + G G + FY +K F + Sbjct: 167 TSNGQHIQVRSKDSMPYHPIYS-GVYGRYVSNKNHAFYFRKEFVYDI 212 >UniRef50_B9E7L8 Type II restriction enzyme n=1 Tax=Macrococcus caseolyticus JCSC5402 RepID=B9E7L8_MACCJ Length = 462 Score = 158 bits (399), Expect = 1e-37, Method: Composition-based stats. Identities = 49/223 (21%), Positives = 94/223 (42%), Gaps = 17/223 (7%) Query: 12 ETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLE-IWLGASAGSKPEQD 70 ET +L+ A++ G L E+ L KG +G ++E + G S E D Sbjct: 10 ETVNELIMYAKEAEGKYLYEIDKKDMLENTNV----KGSVGHIIEASYFGYEINSNAEPD 65 Query: 71 FAALGVELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVE 126 FA +G+ELK + + + E + + + V +E+S K K +L E Sbjct: 66 FADVGIELKATGIQKMKKGDLKAKERLVLNIINYCDEATVEFESSSFWKKNKHLLLFFYE 125 Query: 127 GERSIPLA-QRR---VGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQ 182 ++ RR + + EE+ + +++DW + + I+ G+ ++ L Sbjct: 126 YKKDKNNKMDRRNAQIIKVEDFQFPEEDLKTIKQDWNIIHNKIINGEAHLLSEGDTLILG 185 Query: 183 IRPKAAN-AKALTEAIGARGERILTLPRGFYLKKNFTSALLAR 224 K + AK+L + + IL R + +K+ + S L+ + Sbjct: 186 ACTKGSTAAKSLRKQPNSD---ILAKQRAYSIKQGYMSNLVRK 225 Score = 44.7 bits (104), Expect = 0.002, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 75/209 (35%), Gaps = 21/209 (10%) Query: 26 GYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQD-FAALGVELKTIPVD 84 G + E+A + + + K I L+ LG + + F ++ KTI ++ Sbjct: 259 GKSDVEIAEQLNITLSDG----KNRIAHLISNMLGIKKNDLSKIEEFEKYSIKFKTITLN 314 Query: 85 SLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLW 144 + + E + W+ SH++ + W+ V + + +G LW Sbjct: 315 NEDKLKEHMSFENIDFDEYATSEWDDSHLKTMFESTKWLFVVFKEDDKHNKIFLGYK-LW 373 Query: 145 SPNEEEDRQLREDWE----------ELMDMIVLGQVERI-TARHGEYLQIRPKAANAKAL 193 + + +++ ++ G + A+ IRPKA NA+ Sbjct: 374 NMPMTDIEGPLQEFFIKTQRILNEGVILSQTSRGIRNNLPGAKDNPICHIRPKARNAEDK 433 Query: 194 TEAIGARGERILTLPRGFYLKKNFTSALL 222 + G+ I + F+L K + +L Sbjct: 434 VKLPD--GQCIT--KQCFWLNKGYIKNIL 458 >UniRef50_D2RCS3 DNA mismatch repair enzyme MutH n=1 Tax=Gardnerella vaginalis 409-05 RepID=D2RCS3_GARVA Length = 482 Score = 152 bits (385), Expect = 6e-36, Method: Composition-based stats. Identities = 44/230 (19%), Positives = 83/230 (36%), Gaps = 19/230 (8%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIW-LGASAGSKPEQDF 71 +++ A ++G T E+ +KG +G ++E G S S + D Sbjct: 13 DIDEITALLTGINGKTFREIDKTGRGEKAG----NKGSLGNIIEESVFGYSINSDKQPDI 68 Query: 72 --AALGVELKTIPVDSLGRPL----ETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPV 125 A ELK P+ S + E V + ++ S K ++++ I Sbjct: 69 MVADEFYELKVTPIRSNKKKKIVAKERLVVDIINYLTLCNEVFDNSSFWQKARQMIIIYY 128 Query: 126 EGERSIPLAQRRVGSP----LLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYL 181 R+ Q R+ + E +R+DWE + + + G + ++ YL Sbjct: 129 LDNRTDKTNQSRLDCTIYGSFVLKYEPGELETIRKDWEYIKNKVAAGHADALSESDTNYL 188 Query: 182 QIRPKA-ANAKALTEAIGARG---ERILTLPRGFYLKKNFTSALLARHFL 227 K + AK+ +A G + I R F K ++ + + R Sbjct: 189 AACTKGESAAKSFRDAPAPSGFESKYIRAKQRAFSYKPSYMTIVAQRVLG 238 >UniRef50_B8ZN89 Putative restriction enzyme n=1 Tax=Streptococcus pneumoniae ATCC 700669 RepID=B8ZN89_STRPJ Length = 496 Score = 149 bits (376), Expect = 7e-35, Method: Composition-based stats. Identities = 41/191 (21%), Positives = 75/191 (39%), Gaps = 10/191 (5%) Query: 46 RDKGWIGVLLEIWL-GASAGSKPEQDFAALGVELKTIPVDSLGRP----LETTFVCVAPL 100 + KG G +E + G S E DF +GVELK P E + + Sbjct: 62 KSKGQYGNYIEKYFYGYQPNSDSEADFEKIGVELKVTPFKINKNGTLSAKERLVLTILNY 121 Query: 101 TGNSGVTWETSHVRHKLKRVLWIPVEG-ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWE 159 + + ++H+ K ++L + G + + + L+ EE+ + ED++ Sbjct: 122 MEENLEDFYSTHLWKKCAKILLLFYNGLIPNQTMKDYVIEKIFLYEWFEEDMAVILEDYQ 181 Query: 160 ELMDMIVLGQVERITARHGEYLQIRPKAA-NAKALTEAIGARGERILTLPRGFYLKKNFT 218 ++ D I G+ ++ G YL K A K L + + L R + LK ++ Sbjct: 182 KITDKIKNGKAHELSESDGNYLSTCTKGAGKGKDLRQQPFSH---ELAKQRAWELKSSYM 238 Query: 219 SALLARHFLIQ 229 + L+ Q Sbjct: 239 TYLINHKIFNQ 249 Score = 52.7 bits (125), Expect = 8e-06, Method: Composition-based stats. Identities = 26/126 (20%), Positives = 51/126 (40%), Gaps = 2/126 (1%) Query: 44 LKRDKGWIGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLE-TTFVCVAPLTG 102 + KG L+ LG + ++F + L+ I VD P E + F Sbjct: 292 NSKAKGKNSTLIRKILGLTGDLDKTKEFQKANMNLRVIRVDKNNLPKEDSPFKTYCFKEL 351 Query: 103 NSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELM 162 + +WE+SHV +++ ++ V + P + S W + + +++ W+E Sbjct: 352 AATDSWESSHVYNEIYNKRFLFVIFKEIEPKL-FVLDSIKFWGFQDRQLEEIQRVWQETR 410 Query: 163 DMIVLG 168 +I G Sbjct: 411 QIISDG 416 >UniRef50_P16667 Type-2 restriction enzyme Sau3AI n=2 Tax=Staphylococcus RepID=T2S3_STAAU Length = 489 Score = 147 bits (372), Expect = 2e-34, Method: Composition-based stats. Identities = 49/221 (22%), Positives = 89/221 (40%), Gaps = 15/221 (6%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQDFA 72 T++ + +A++ G ++ EL E++K+ K +G E W G S + D A Sbjct: 6 TKQAVHNRAKEAVGKSVLELNGG------ESIKQSKSSVGDAFENWFGKKKDSDSKPDMA 59 Query: 73 ALGVELKTIPVDSLGRPL----ETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGE 128 GVELK P L E + + + +ETS K + E Sbjct: 60 EAGVELKATPFKKLKNGKYSSKERLVLNIINYEKVANENFETSSFLSKNNTIELAFYEYI 119 Query: 129 RSIPLAQRRVGSPLLWSPNEE--EDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPK 186 + P + +L+ ++ + +++DWE + I G+ ++ YL K Sbjct: 120 KGTPSDNWIIKEAVLYEMHKNPIDYEIIKQDWEIINQYINEGKAHELSEGLTSYLAPCTK 179 Query: 187 AANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFL 227 ANA +L + I R F LK + +++L ++ L Sbjct: 180 GANASSLRNQPYSD---IKAKQRAFSLKSGYMTSILRKYVL 217 >UniRef50_C5WEK1 Restriction enzyme n=4 Tax=Bacteria RepID=C5WEK1_STRDG Length = 496 Score = 145 bits (365), Expect = 1e-33, Method: Composition-based stats. Identities = 40/190 (21%), Positives = 70/190 (36%), Gaps = 8/190 (4%) Query: 46 RDKGWIGVLLEIWL-GASAGSKPEQDFAALGVELKTIPVDSLGRP----LETTFVCVAPL 100 + KG G +E + G S DF +GVELK P E + + Sbjct: 62 KSKGQYGNYIEKYFYGYQPNSDSSADFDKIGVELKVTPFKVNKNGSISAKERLVLTILNY 121 Query: 101 TGNSGVTWETSHVRHKLKRVLWIPVEG-ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWE 159 + + +H+ K ++L + G + LA + L+ EE+ + ED++ Sbjct: 122 MEENLDDFYQTHLWEKCSKILLLFYNGLIPNQTLADYIIEKVFLYEWFEEDMELILEDYK 181 Query: 160 ELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTS 219 + D I G+ ++ G YL K A + L R + LK ++ + Sbjct: 182 RITDKIKQGRAHELSESDGNYLSTCTKGAGKG--KDFRSQPFSDELAKQRAWELKSSYMT 239 Query: 220 ALLARHFLIQ 229 L+ Q Sbjct: 240 YLINHKIFSQ 249 >UniRef50_O87348 LlaKR2I restriction enzyme n=2 Tax=Lactococcus lactis RepID=O87348_9LACT Length = 496 Score = 141 bits (356), Expect = 1e-32, Method: Composition-based stats. Identities = 41/224 (18%), Positives = 80/224 (35%), Gaps = 12/224 (5%) Query: 15 EQLLAQAQQLSGYTLGELAALV--GLVTPENLKRDKGWIGVLLEIWL-GASAGSKPEQDF 71 EQ++ + Q T + + E + KG G +E + G + DF Sbjct: 29 EQVIEKYQSTPYKTYADFQSKTVSDFKDKEISLKSKGQYGNNIEKYFYGYQPNNDSVADF 88 Query: 72 AALGVELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEG 127 +GVELK P E + + + + ++H+ K ++L + G Sbjct: 89 EKIGVELKVTPFKVNKNGTISAKERLVLTILKYMEENLDDFYSTHLWQKCSKILLLFYNG 148 Query: 128 E-RSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPK 186 + + L+ EE+ + +D++ + + I G+ ++ G YL K Sbjct: 149 LLDGQTIEDYVIKKVFLFEWFEEDMEVILDDYKRITEKIKSGKAHELSESDGNYLSTCTK 208 Query: 187 AA-NAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLIQ 229 A + L E R + LK ++ + L+ Q Sbjct: 209 GAGKGRDLRE---QPFSDEPAKQRAWELKSSYMTYLINNKIFNQ 249 Score = 55.8 bits (133), Expect = 1e-06, Method: Composition-based stats. Identities = 29/151 (19%), Positives = 53/151 (35%), Gaps = 7/151 (4%) Query: 19 AQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQDFAALGVEL 78 + + G + EL + + KG L+ LG + Q+F + L Sbjct: 272 ERILEFKGMSEAELYEKFSI-----NPKAKGKNSTLVRTILGLTGDLDKTQEFQKANMNL 326 Query: 79 KTIPVDSLGRPLETTFVCVAPLTG-NSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRR 137 + I VD G P E + + WE SH ++ ++ V + P Sbjct: 327 RVIRVDKNGLPKEDSPFKNYNFKELATNDDWEESHPYQEICNKRFLFVVFKEVKPKI-FI 385 Query: 138 VGSPLLWSPNEEEDRQLREDWEELMDMIVLG 168 + S W + + +++ W E +I G Sbjct: 386 LDSIKFWGFPDRQVEEVQRVWAETRRIITEG 416 >UniRef50_Q6SFK8 Type II restriction enzyme, putative n=1 Tax=uncultured marine bacterium 580 RepID=Q6SFK8_9BACT Length = 235 Score = 137 bits (344), Expect = 4e-31, Method: Composition-based stats. Identities = 49/216 (22%), Positives = 82/216 (37%), Gaps = 15/216 (6%) Query: 21 AQQLSGYTLGELAALVGLVTPENL---KRDKGWIGVLLE-IWLGASAGSKPEQDFAALGV 76 A++L+G TL ++ + + E L +KG G +E + G S E DF+ + Sbjct: 15 AKKLTGKTLRDVLSPQIIQESEKLFINNGNKGRFGHKIEKHYFGYDINSDKEADFSLCKL 74 Query: 77 ELKTIPV----DSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERSIP 132 ELK P+ D P E + TW TS + K+ +L I + Sbjct: 75 ELKVTPMKLKNDGSLTPKERLVNNIINFDEIVKETWHTSSLLKKINSILLIRYLDPMNKE 134 Query: 133 LAQRRVG----SPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAA 188 + Q + S N E+ Q +DW +++ I G+ ++ YL K Sbjct: 135 INQLDYKILDVQIINLSSNAEDFNQFEKDWNLIVNKIKDGKAHELSESDTMYLGACTKGE 194 Query: 189 NAKALTEAIGARGERILTLPRGFYLKKNFTSALLAR 224 N ++ + + R F K + LL R Sbjct: 195 NKESQRSQPYS---SEKAMQRAFSFKTQYMKILLDR 227 >UniRef50_C3FCF3 DNA mismatch repair enzyme MutH n=2 Tax=Bacillus thuringiensis RepID=C3FCF3_BACTU Length = 453 Score = 135 bits (341), Expect = 8e-31, Method: Composition-based stats. Identities = 49/219 (22%), Positives = 86/219 (39%), Gaps = 13/219 (5%) Query: 12 ETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKPEQD 70 +TEEQLLA + + G + E+ L + DKG +G ++E + + PE D Sbjct: 8 KTEEQLLAFTEGIIGKSFKEIDTKGLL---QGNSNDKGILGKVVETGFYDYELNNCPEAD 64 Query: 71 FAALGVELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVE 126 F LG+ELK + L E + + +E S V K K++L I E Sbjct: 65 FGELGIELKVSGFNKLRNGSWSAKERISLSMINYKNIIHEEYEFSKVVSKNKKLLIIWYE 124 Query: 127 GERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPK 186 + +P + L+ + +++ +R D+ + ++ G +++ L K Sbjct: 125 YMKGVPYGDFIIRDFQLYDMS-QDEAIIRNDFYTIKQKVLEGLAHKLSEGDTVILGAATK 183 Query: 187 AANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARH 225 + I R F LK +F +L H Sbjct: 184 GQKG----QKAEQPNSEISAPTRAFSLKNSFFRGVLRSH 218 >UniRef50_Q30PF3 DNA mismatch repair enzyme MutH n=1 Tax=Sulfurimonas denitrificans DSM 1251 RepID=Q30PF3_SULDN Length = 467 Score = 132 bits (333), Expect = 7e-30, Method: Composition-based stats. Identities = 36/181 (19%), Positives = 70/181 (38%), Gaps = 8/181 (4%) Query: 48 KGWIGVLLEIWLGASA-GSKPEQDFAALGVELKTIPVDSLG----RPLETTFVCVAPLTG 102 KG G +LE + A S E DF + +ELK+ P+ L R E + + Sbjct: 41 KGSFGQILEKFYFLYAPNSNAEADFPEVNLELKSSPLKQLKNNQFRAKERLSLNIINYQD 100 Query: 103 NSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELM 162 +ETS K + +L + E + + W+ + +++DW+ + Sbjct: 101 IVHQNFETSSFWKKNENLLLVFYLYENDTNVLDYVIKLVDEWTFPSIDLEIIKQDWKRIK 160 Query: 163 DMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALL 222 ++ G+ ++ YL PK E + + R + LK+ + + ++ Sbjct: 161 QKVLDGKAHELSEGDTFYLGAAPKGGKGGNPREQPNS---TLTAKQRAYSLKQGYVNHII 217 Query: 223 A 223 A Sbjct: 218 A 218 Score = 42.0 bits (97), Expect = 0.015, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 62/169 (36%), Gaps = 9/169 (5%) Query: 3 QPRPLLSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGAS 62 Q + EE ++++ + T+ ++ + ++ + K + L + LG Sbjct: 232 QSTEITKDKTLEEIVVSKFESYYDKTIEDI---LAMLNINLNLKAKNFYANLTKAILGIE 288 Query: 63 AGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKL-KRVL 121 + E +F + +KTI + P E TW+ S + + L + L Sbjct: 289 LNKEIE-EFEKAEIIVKTIRLKDNNLPKEDISFPNFKYENIVNETWDESEISNILGHKFL 347 Query: 122 WIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQV 170 ++ + E + W+ ++ ++ + W +++ G + Sbjct: 348 FVFFQFE----NKKLIFKKAQFWNMPYKDILEVEKVWARTKEIVQSGDI 392 >UniRef50_C0ETP8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=C0ETP8_9FIRM Length = 507 Score = 131 bits (330), Expect = 1e-29, Method: Composition-based stats. Identities = 45/229 (19%), Positives = 77/229 (33%), Gaps = 13/229 (5%) Query: 9 SPPETEEQLLAQAQQLSGYTLGE---LAALVGLVTPENLKRDKGWIGVLLEI-WLGASAG 64 + E++ +A++ G E LA K KG IG E W + Sbjct: 9 KEYKNSEEVKKRAEEAIGRPFKEIFELAKKFQEKNGLKEKHGKGDIGQAYEEGWFNYACN 68 Query: 65 SKPEQDFAALGVELKTIPVDSLGRP---LETTFVCVAPLTGNSGVTWETSHVRHKLKRVL 121 E DF +ELK P R E + + +E S K +L Sbjct: 69 KDAEPDFKEADIELKVTPFLKNSRGYRAKERLSLGKINYKDENWNEYEESRFWIKNHHLL 128 Query: 122 WIPVEGERSIPLAQR---RVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHG 178 + + + I ++ LL E + ++ DWE++ + G+ +T R Sbjct: 129 IMYYQYIKGIDRENFSVQKIDEILLNELPERDKSIIQHDWEKIARYVKEGRAHELTERDF 188 Query: 179 EYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFL 227 YL K + + + PR + KK++ + L L Sbjct: 189 MYLSPATKGSGGN---KKVKYDDRYPKAKPRAYSFKKSYMTKLFNERML 234 Score = 41.2 bits (95), Expect = 0.027, Method: Composition-based stats. Identities = 23/119 (19%), Positives = 50/119 (42%), Gaps = 8/119 (6%) Query: 54 LLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGN-SGVTWETSH 112 +++ + + +F +L+TI V+ G PLE TW+ S Sbjct: 290 IIKQIYKSDCDLEKTDEFQKANYKLRTITVNKNGMPLEDMSFSTFDFEELLKEKTWKESI 349 Query: 113 VRHKL---KRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLG 168 V ++ K +L I + + + ++W ++++ +++E WEE ++I G Sbjct: 350 VYDEMIDSKFLLVIFSKNSEGKE----ILNNAMMWYIPKKDENKVKEVWEETRNVIKNG 404 >UniRef50_C8JWG6 Restriction endonuclease n=4 Tax=Bacillales RepID=C8JWG6_LISMO Length = 555 Score = 130 bits (328), Expect = 3e-29, Method: Composition-based stats. Identities = 53/256 (20%), Positives = 91/256 (35%), Gaps = 48/256 (18%) Query: 12 ETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLE-IWLGASAGSKPEQD 70 +T+E++ +A + G TLGE+ +K + G ++E IW A + E D Sbjct: 6 DTKEKVHNRALEAIGKTLGEIDE-----NNLKNVNNKSYPGNVIEQIWYDHPADNISEPD 60 Query: 71 FAALGVELKTIPVDSLGRPL-----------ETTFVCVAPLTGNSGVTWETSHVRHKLKR 119 F GVELK P+D + E + ++E S HK K Sbjct: 61 FKEAGVELKVTPIDKQSKGNKDKKVTRLIAGERLVLNKIHYKNEYKKSFEESSFWHKNKV 120 Query: 120 VLWIPVE------------GERSIPLAQRRVGSPLLWSP----------------NEEED 151 + I E Q ++ L S ++++ Sbjct: 121 IELIQYYRRDTSDKKKFSRKELIEDKKQFKIAYATLLSMVDLTDFNLPKDTVLEISDKDF 180 Query: 152 RQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGF 211 +++DWE++ +I + E ++ YL KAA T + G I PR + Sbjct: 181 EIIKQDWEKISKLINESKAEELSEGMTNYLGACTKAATGAEFTTQV---GSEIKPKPRAY 237 Query: 212 YLKKNFTSALLARHFL 227 K F + L+ + Sbjct: 238 SFKTKFINELINTQII 253 >UniRef50_D2NQK8 DNA mismatch repair protein n=1 Tax=Rothia mucilaginosa DY-18 RepID=D2NQK8_9MICC Length = 499 Score = 125 bits (314), Expect = 1e-27, Method: Composition-based stats. Identities = 41/188 (21%), Positives = 73/188 (38%), Gaps = 7/188 (3%) Query: 47 DKGWIGVLLEIW-LGASAGSKPEQDFAALGVELKTIPVDSLGRP----LETTFVCVAPLT 101 KG G L+E G S PE D A VE+KT P E + + Sbjct: 62 SKGIYGSLMEKAGYGIEPNSSPEADIPAAKVEIKTTPYRVTANGAISAKERLVLSMFNFH 121 Query: 102 GNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEEL 161 + + +H+ HK + +L + + +++ ++ L+ EE+ + ED++ + Sbjct: 122 EENLDDFYQTHLWHKCQNILLLFYKYQKTRDISNNITDKFFLFDWPEEDMPTILEDYKRI 181 Query: 162 MDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSAL 221 ++ G+ ++ G YL K A K G L R + LK ++ + L Sbjct: 182 TQKVLEGRAHELSESDGMYLSTCRKGA-GKDKDRTTQPYGPE-LANRRAWSLKSSYMTTL 239 Query: 222 LARHFLIQ 229 L Q Sbjct: 240 LRTKVFSQ 247 Score = 51.6 bits (122), Expect = 2e-05, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 62/173 (35%), Gaps = 19/173 (10%) Query: 11 PETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRD-----------KGWIGVLLEIWL 59 + E++ +A+A Q + ++ L ++ KG L+ L Sbjct: 246 SQEEQESIARAAQDTSKPFTQIIEEKLLQYRGQSEKKLCKEFDVNFNAKGRNSTLVRKIL 305 Query: 60 GASAGSKPEQDFAALGVELKTIPVDSLGRPLE-TTFVCVAPLTGNSGVTWETSHVRHKL- 117 G S +F + ++ I VD G P E + F + + WE SH+ ++ Sbjct: 306 GLSTDIDSTAEFKKANMNIRAIRVDKNGLPKEDSPFKNYRFIELANEKEWEESHIYTEIF 365 Query: 118 --KRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLG 168 K + + E E + W E + +L W+E ++I G Sbjct: 366 SRKFMFVVFRENESG----DFCLDQIKFWGFPEPLEPELMRVWQETKEIINRG 414 >UniRef50_Q6F1F9 Type II restriction enzyme Sau3AI, GATC site n=1 Tax=Mesoplasma florum RepID=Q6F1F9_MESFL Length = 452 Score = 122 bits (307), Expect = 7e-27, Method: Composition-based stats. Identities = 43/221 (19%), Positives = 83/221 (37%), Gaps = 16/221 (7%) Query: 12 ETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLG-ASAGSKPEQD 70 +T + + +A++ LGE+ + + +DKG IG L++ + ++ E D Sbjct: 3 KTIDDIFRRAKEAENKQLGEI---INFDEIDYKLKDKGRIGNLIQKYYFNIDINNRSESD 59 Query: 71 FAALGV--ELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIP 124 F LG+ ELK + + E + + N + S V +K K +L I Sbjct: 60 FKELGLDLELKVTGLKQNKKKQWIAKERVSLSMID-FNNIEKDFTDSKVLNKNKNILLIG 118 Query: 125 VEGERSIPLAQRRVGSPLLWSP---NEEEDRQLREDWEELMDMIVLGQVERITARHGEYL 181 E + ++ +LW +E E + + D+ + I G ++ + L Sbjct: 119 HEYNKDKNRKNLKIIKSILWKYENISENEKKIIENDYSIIYQKIQNGIAHELSCSDTKIL 178 Query: 182 QIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALL 222 + K + R F LKK + + +L Sbjct: 179 EATTKGQGKN--KDLRNQPNSDFKAKSRAFALKKWYVNYIL 217 >UniRef50_A1A2S7 Putative type II restriction enzyme Sau3AI n=1 Tax=Bifidobacterium adolescentis ATCC 15703 RepID=A1A2S7_BIFAA Length = 643 Score = 112 bits (280), Expect = 1e-23, Method: Composition-based stats. Identities = 47/248 (18%), Positives = 86/248 (34%), Gaps = 41/248 (16%) Query: 9 SPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKP 67 SP T+ + +A G TL E+ + +K + G ++E W A + Sbjct: 126 SPYTTKTAVQLRANDAVGRTLEEID-----FNNTEKQGNKSYPGNVIEHVWFDHPADNIS 180 Query: 68 EQDFAALGVELKTIPVDSLGRPL--------ETTFVCVAPLTGNSGVTWETSHVRHKLKR 119 DF VELK P+D + + E + S + TS K + Sbjct: 181 APDFPEAEVELKVSPID-IKKSKDGPSCIAGERLVLNTINYAKESTADFRTSSFWKKNRF 239 Query: 120 VLWIPVE----GERSIPLAQRRVG------------------SPLLWSPNEEEDRQLRED 157 + + E+ +R+ L +E ++ +D Sbjct: 240 IELMQYLRRIASEKGQSEDKRKYKIEYAHLLAMDCFAEPNFPQNSLLPLSEATMLRIEQD 299 Query: 158 WEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNF 217 W + IV + + +T + L K AN K ++ + G R R + K++F Sbjct: 300 WNTIHQFIVENRADELTEGMTDTLAACTKGANNKQ--KSRQSNGAR--AKSRAYCFKQSF 355 Query: 218 TSALLARH 225 ++LL + Sbjct: 356 MTSLLQNY 363 Score = 49.7 bits (117), Expect = 8e-05, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 61/175 (34%), Gaps = 23/175 (13%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQ--- 69 + +L G + E+A P+N + +L+E LG++ + Sbjct: 385 CDSIILDYFNPFKGKSFTEIAEQFHFTIPKNASPKQTNF-MLVEHMLGSNTSISKDPNDD 443 Query: 70 ----DFAAL-GVELKTIPVDSLGRPLETTFVCVAPLTGN-SGVTWETSHVRHKLKRVL-- 121 D L G+ +KT+P+ G+P E V P + W+T + L ++L Sbjct: 444 YVSFDAEELKGIRVKTLPLFH-GKPSEQFKVQSIPDFNDLINQKWDTDNC--ALHQLLET 500 Query: 122 ---WIPVE-----GERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLG 168 + V E+ W + + + W+E ++ + G Sbjct: 501 TKFLVFVFDSQNPNEKDKNPENIYFKGAAFWYMPASDIDIVEQVWKEDVEKLRNG 555 >UniRef50_Q3IRH4 DNA mismatch repair protein homolog n=1 Tax=Natronomonas pharaonis DSM 2160 RepID=Q3IRH4_NATPD Length = 285 Score = 105 bits (261), Expect = 1e-21, Method: Composition-based stats. Identities = 48/224 (21%), Positives = 84/224 (37%), Gaps = 23/224 (10%) Query: 11 PETEEQLLAQA-QQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKPE 68 P+ E++ + +A L G + GE+ + + + R K + ++E + S E Sbjct: 8 PDAEQEDILRATNALLGRSFGEIDEEIQGMADDERARSKHGVANVIESGYFNIETNSSAE 67 Query: 69 QDFAALGVELKTIPVDSLG-----RPLETTFVCVAPLTG-NSGVTWET-SHVRHKLKRVL 121 DF G+ELK P+ G RP E +C+ W +R KL++VL Sbjct: 68 PDFGKAGIELKVTPLRLTGNNELVRPKERLVLCMCDYMEVVDADHWTDVPALRKKLEQVL 127 Query: 122 WIPVEGERSIPLAQRRVGSPLLWSPNEEED--RQLREDWEELMDMIVLGQVERITARHGE 179 I + LW P E + ++ D+E I+ G+V + +H + Sbjct: 128 IIWYIHIVGQDRRTYPIVWWTLWEPMTEPEWSEIIQSDFEICKKRILNGEVP--SEKHTQ 185 Query: 180 YLQIRPK----------AANAKALTEAIGARGERILTLPRGFYL 213 L PK + ++ A A RG+ + Sbjct: 186 LLGTCPKHGGGYDRENPEESPRSSRVAQDAHPTLDYAEKRGWSI 229 >UniRef50_C4Z3T8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=C4Z3T8_EUBE2 Length = 216 Score = 103 bits (258), Expect = 3e-21, Method: Composition-based stats. Identities = 44/190 (23%), Positives = 77/190 (40%), Gaps = 13/190 (6%) Query: 42 ENLKRDKGWIGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPLT 101 N+ +KG G +LE+ +G + DF ELKT + LG P ET F+ Sbjct: 28 NNIVVNKGKTGQILELTIGLQLSNTT-LDFED--GELKTNKCNKLGIPAETMFITQTASI 84 Query: 102 GNSGVT---WETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWS-PNEEED-RQLRE 156 + ++ +E S + K+ R+L++ + E + + S P E QL+ Sbjct: 85 IDELLSFKLFEESKLYKKINRMLYVRINKEGAPENWMFLPPILVDLSLPKYAELAAQLKF 144 Query: 157 DWEELM----DMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFY 212 D+ ++ D + L + +G+Y+QIR K + + + R FY Sbjct: 145 DYIDICDILNDQLCLSPDAMLHTANGKYIQIRTKDSIPYHAIYSR-KYDRIVSDKNRAFY 203 Query: 213 LKKNFTSALL 222 K L+ Sbjct: 204 FKIPCLKYLI 213 >UniRef50_D1NWA3 DNA mismatch repair enzyme MutH family protein n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NWA3_9BIFI Length = 485 Score = 90.1 bits (222), Expect = 5e-17, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 80/227 (35%), Gaps = 23/227 (10%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLE-IWLGASAGSKPEQD- 70 T + + G TLGEL L +P +KG G ++E LG A S D Sbjct: 8 TRTDITRRLNACIGKTLGELDTKHVLDSPT----NKGVAGHVIEKSVLGYPADSARRPDL 63 Query: 71 -FAALGVELKTIPVDSLGR------PLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWI 123 L VELKT + + + E + +E+S HK++ +L++ Sbjct: 64 NIDGLDVELKTTGIRASKKEGISFEAKEPMSITALKPDEIIDEEFESSAFWHKVQNMLFV 123 Query: 124 PVEGE----RSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGE 179 E + + V + + E+ QL+ DW + + + E + E Sbjct: 124 CYVYEGKPKKGRDYSNFPVHTAFFKTFEGEDKEQLQRDWTTIREFVRDLYEENPDSPAFE 183 Query: 180 YLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHF 226 K ++ + + P ++ F ++++ H Sbjct: 184 ------KISSGISKEVSPYLDTAPKWPHPPRIRFRRAFVNSIINEHL 224 Score = 46.6 bits (109), Expect = 6e-04, Method: Composition-based stats. Identities = 32/174 (18%), Positives = 69/174 (39%), Gaps = 15/174 (8%) Query: 14 EEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQD-FA 72 + +L + +G T+ ++AA + P+ K K ++ G ++ F Sbjct: 245 DHKLQELTNKFAGKTIAQIAAYFDMEIPD--KVPKQLAEQIVVRMFGGTSRKMHNIALFQ 302 Query: 73 ALGVELKTIPVDSLGRPLETTFVCVAPLTG-NSGVTWETSHVRH--KLKRVLWIPVEGER 129 +G+ KT+ ++ G P E T + S +E S + K +L++ V E Sbjct: 303 KMGIVGKTVALNPQGMPTEDTKLMSIDFNELESNDVYEESDFYYFFKDNDLLYM-VFQEP 361 Query: 130 SIPLAQRRVGSPLL------WSPNEEEDRQLREDWEELMDMIVLGQVERITARH 177 + ++ R G L P E++ + W+ + D+I+ ++ + Sbjct: 362 EVGIS--RGGEVFLGFARVPLEPLAEKNGLVESVWDIMRDLIISRKLRLVPETD 413 >UniRef50_Q03GQ2 Type II restriction endonuclease, MutH family n=5 Tax=Bacilli RepID=Q03GQ2_PEDPA Length = 492 Score = 89.3 bits (220), Expect = 1e-16, Method: Composition-based stats. Identities = 43/231 (18%), Positives = 85/231 (36%), Gaps = 21/231 (9%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLE-IWLGASAGSKPEQDF 71 T+ QL+ ++ TLGE+ +N K+ G G ++E LG + S D Sbjct: 7 TKTQLIGLLERAVNKTLGEVDTAHVFDKTKNNKKITGIAGDVIENSVLGYPSDSYQSPDL 66 Query: 72 --AALGVELKTIPVDSLGRP--------LETTFVCVAPLTGNSGVTWETSHVRHKLKRVL 121 + +ELKT + + E + + +E+S+ HKL+R+L Sbjct: 67 CVDGVDIELKTTGIKVAKKKSKSHLFEAKEPMSITAVSPEKITTEEFESSNFWHKLERML 126 Query: 122 WIPVEGERSI-----PLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITAR 176 + + + A + ++E+ L+ DW+ + I Q Sbjct: 127 LVYYFYDSDVTVTAAEYANFLIKGYDFHEFSKEDQETLKNDWKIVQQFIQNLQNGYGEEA 186 Query: 177 HGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFL 227 +Y +I + + + + PR F LK+ + ++ +F Sbjct: 187 SSQYPRISSEIRSQLMMIDTAPKW----PHRPR-FRLKRATVTTMVQEYFG 232 >UniRef50_Q9ACM4 R.Sth368I endonuclease n=1 Tax=Streptococcus thermophilus RepID=Q9ACM4_STRTR Length = 491 Score = 80.9 bits (198), Expect = 3e-14, Method: Composition-based stats. Identities = 43/233 (18%), Positives = 85/233 (36%), Gaps = 25/233 (10%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKPEQDF 71 TEE+++ ++ G TLG++ + + G G ++E G A +K E D Sbjct: 7 TEEEVVRILDKIVGKTLGQIDKNNVFDKTKEKPKITGIAGDVIEQCVFGYQADNKQEADL 66 Query: 72 AALG--VELKTIPV---------DSLGRPLETTFVCVAPLTGNSGVT-WETSHVRHKLKR 119 G VELKT + E + + + S + HKL+ Sbjct: 67 LIDGRPVELKTTGLKKSSPKSKSKHALEAKEPMSITAVSVDKIGSQDNFYESTLWHKLEE 126 Query: 120 VLWIPVEGERSI-----PLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERIT 174 +L + + + + +EE+ + L DW+ + D + + + Sbjct: 127 LLIVYYLYDSDKIVKASEYSNFPIQGYQFLIFSEEDKKVLENDWKIVRDFVRNVEKNALD 186 Query: 175 ARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFL 227 + E+ +I +K TE P F L +++ + ++ +HF Sbjct: 187 -KSIEFPKI------SKLRTEMSYMDTAPKYPHPPRFRLTRSYVTTIVQQHFG 232 >UniRef50_A6BKR2 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BKR2_9FIRM Length = 517 Score = 80.1 bits (196), Expect = 6e-14, Method: Composition-based stats. Identities = 47/240 (19%), Positives = 78/240 (32%), Gaps = 31/240 (12%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKPEQD- 70 + L+ + + G T + ++ KG G ++E SK E D Sbjct: 7 DKNALIHRLEGYLGKTFEMIDNKEMFKHVQDFNLQKGIAGSVVEQCIFEYPPDSKQEADL 66 Query: 71 -----FAALGVELKTIPVDSLGRP------LETTFVCVAPLTGNSGVTWETSHVRHKLKR 119 + ELKT + +P E + + + +ETSH KL+ Sbjct: 67 IIVDGIENVKTELKTTGMLIREKPRKHYVAKEPMSITAVGVYDIAEQEFETSHFWEKLEH 126 Query: 120 VLWIPVE-----GERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERIT 174 +L I + V +++E LR DWE + + Sbjct: 127 MLIIYYYYAATHAVSAYEYKDFPVVGYEFHEFSDDEIEVLRNDWEHVRTLCAKVVSHHSG 186 Query: 175 ARHGEYLQIRPKAANAKALTEAIGARGERI-------LTLPRGFYLKKNFTSALLARHFL 227 R + K A + E G + PR F LKK S+++A+HF Sbjct: 187 PRDKNW-----KEAVKQEYIEVHGELRRVLSYIDLAPKFPPR-FRLKKPTVSSIIAKHFG 240 >UniRef50_C0EPR0 Putative uncharacterized protein n=1 Tax=Neisseria flavescens NRL30031/H210 RepID=C0EPR0_NEIFL Length = 109 Score = 67.0 bits (162), Expect = 5e-10, Method: Composition-based stats. Identities = 28/99 (28%), Positives = 46/99 (46%), Gaps = 10/99 (10%) Query: 25 SGYTLGELAALVGLVTPE---NLKRDKGWIGVLLEIWLGASAGSKPEQDFAALGVELKTI 81 +G TLG+L P ++ +KG +G +E +G + + DFA ELKT Sbjct: 14 AGRTLGDLKLEYQTRFPSFTSEMRINKGGVGQFIEKLIGLN-NTNALTDFAD--GELKTN 70 Query: 82 PVDSLGRPLETTFVCVAPL----TGNSGVTWETSHVRHK 116 D+ G PLET F+ ++ +++E S + K Sbjct: 71 KADTGGAPLETMFISQISSNFDQLISNQISFEDSWIYQK 109 >UniRef50_C0EPR1 Putative uncharacterized protein n=1 Tax=Neisseria flavescens NRL30031/H210 RepID=C0EPR1_NEIFL Length = 128 Score = 63.1 bits (152), Expect = 8e-09, Method: Composition-based stats. Identities = 26/111 (23%), Positives = 45/111 (40%), Gaps = 7/111 (6%) Query: 116 KLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEED--RQLREDWEELMDMIVL---GQV 170 K+K +L++P+ P Q + S NEE + RQL +D+ ++ ++ + Sbjct: 5 KIKNLLYVPICKVGDDP-DQWYIQSAYHVQINEEGELFRQLADDFTQIKSKLLSDINSEN 63 Query: 171 ERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSAL 221 I G ++QIR K + + I FY KK F + Sbjct: 64 GYIHTSSGVFIQIRSKDSKPYHPIFSA-QYDRDISNKNHAFYFKKEFMREV 113 >UniRef50_Q6SFL0 Putative uncharacterized protein n=1 Tax=uncultured marine bacterium 580 RepID=Q6SFL0_9BACT Length = 236 Score = 62.4 bits (150), Expect = 1e-08, Method: Composition-based stats. Identities = 35/220 (15%), Positives = 73/220 (33%), Gaps = 42/220 (19%) Query: 39 VTPENLKRDKGWIGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVA 98 + + R+KG G L+E G + DF ELKT + E+T + + Sbjct: 25 FKTDEIIRNKGKSGQLMEKLCGLKLSNTT-TDFED--GELKTSEL------KESTAITMI 75 Query: 99 -----PLTGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLL-WSPNEEEDR 152 + +++E S + K++ ++++P+ PL + + Sbjct: 76 TDWVDDIIHEEPISFEVSRLSKKIQHLIFMPLVKPSKDPLCWYFKNCIYINITKGTSLYN 135 Query: 153 QLREDWEELMD-----------------MIVLGQV--------ERITARHGEYLQIRPKA 187 +++D+E + I + + + G+Y+QIR K Sbjct: 136 DIKKDFENICLNSHELIYKKKITQLTDCKITKNKSLYSDKYGDKFLHTISGKYIQIRTKD 195 Query: 188 A--NAKALTEAIGARGERILTLPRGFYLKKNFTSALLARH 225 A + + L FY +F + + Sbjct: 196 AGKTKSKPIFSKLLKRNITLKSRMAFYFLASFKDYVDKNY 235 >UniRef50_C5EHV9 Type II restriction endonuclease n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EHV9_9FIRM Length = 507 Score = 62.4 bits (150), Expect = 1e-08, Method: Composition-based stats. Identities = 46/236 (19%), Positives = 85/236 (36%), Gaps = 39/236 (16%) Query: 13 TEEQLLAQAQQLSG-YTLGELAALVGLVTPENLKRDKGWIGVLLE-IWLGASAGSKPEQD 70 T E+++ + Q G TLGE+ T KG G ++E L S E D Sbjct: 9 TREEIIRKFQVFDGEKTLGEVDVRDVFHT---QVPAKGIAGCVIEQSVLSMKQNSTQEAD 65 Query: 71 FAAL-------GVELKTI---PVDSLG---RPLETTFVCVAPLTGNSGVTWETSHVRHKL 117 + L ELK P + G E + + + SH +K+ Sbjct: 66 LSILQADGSYKDTELKVTGVEPSNKQGEKYAAKEPMSLTAVSIGVIEHEKFLESHFYNKI 125 Query: 118 KRVLWIPVEGER-----SIPLAQR----RVGSPLL-WSPNEEEDRQLREDWEELMDMIVL 167 +L++ +R +P A+ +G L + +++E + DW ++ D ++ Sbjct: 126 AHMLFVFYVYDRKSGQKVVPYAEYDRFPVLGYKFLDIADDKDELAKFENDWTQVHDYLLS 185 Query: 168 GQ-----------VERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFY 212 + + + + Y+ I P+ T + + TL +GFY Sbjct: 186 IENREDRYDLYPLLHQKIKKDLFYIDIAPRFKQQPRQTPRFRFKKSYVNTLFQGFY 241 >UniRef50_A0Z3W2 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z3W2_9GAMM Length = 402 Score = 40.8 bits (94), Expect = 0.040, Method: Composition-based stats. Identities = 31/152 (20%), Positives = 54/152 (35%), Gaps = 15/152 (9%) Query: 11 PETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQD 70 P + ++ A +L L E++ + T +G LE LG A SK D Sbjct: 148 PISAREISDAANKLLEM-LSEVSRQGFIKTMRAGDT---GVGYTLETLLGIEANSKKTPD 203 Query: 71 FAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERS 130 + G+ELK+ G ++T P S + + + + Sbjct: 204 YN--GIELKSSRTSKAGN-KQSTVFSQVPNWKLSNLKGSKDILNKHGR------FNSSKG 254 Query: 131 IPLAQRRVGSPLLWSPNEEEDRQLREDWEELM 162 + L S + + + L++DWEEL Sbjct: 255 RDQLYHEISCTKLNSYDLQLE--LKDDWEELR 284 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_B1KI41 DNA mismatch repair protein mutH n=236 Tax=Gamma... 266 4e-70 UniRef50_D1RIY1 DNA mismatch repair endonuclease MutH n=1 Tax=Le... 261 2e-68 UniRef50_C0N2X3 DNA mismatch repair endonuclease MutH n=1 Tax=Me... 257 2e-67 UniRef50_C6MZP8 DNA mismatch repair protein n=1 Tax=Legionella d... 254 2e-66 UniRef50_A1JPE3 DNA mismatch repair protein mutH n=58 Tax=Proteo... 253 3e-66 UniRef50_A5IBX0 DNA mismatch repair protein MutH n=4 Tax=Legione... 250 2e-65 UniRef50_C2WIM6 Type II restriction endonuclease n=12 Tax=Firmic... 249 4e-65 UniRef50_Q0TSC8 Type II site-specific deoxyribonuclease n=3 Tax=... 215 1e-54 UniRef50_O87348 LlaKR2I restriction enzyme n=2 Tax=Lactococcus l... 213 3e-54 UniRef50_B9E7L8 Type II restriction enzyme n=1 Tax=Macrococcus c... 211 2e-53 UniRef50_A9GFJ0 Methyl-directed mismatch repair n=1 Tax=Sorangiu... 210 2e-53 UniRef50_B8ZN89 Putative restriction enzyme n=1 Tax=Streptococcu... 209 5e-53 UniRef50_C5WEK1 Restriction enzyme n=4 Tax=Bacteria RepID=C5WEK1... 207 2e-52 UniRef50_P16667 Type-2 restriction enzyme Sau3AI n=2 Tax=Staphyl... 207 3e-52 UniRef50_B3DNU4 DNA mismatch repair protein n=4 Tax=Bifidobacter... 206 4e-52 UniRef50_Q89B33 DNA mismatch repair protein mutH n=1 Tax=Buchner... 203 4e-51 UniRef50_B2TKK6 DNA mismatch repair enzyme MutH n=3 Tax=Clostrid... 201 1e-50 UniRef50_C3FCF3 DNA mismatch repair enzyme MutH n=2 Tax=Bacillus... 198 1e-49 UniRef50_C0ETP8 Putative uncharacterized protein n=2 Tax=Clostri... 197 2e-49 UniRef50_C2SK77 DNA mismatch repair enzyme MutH n=1 Tax=Bacillus... 197 2e-49 UniRef50_UPI000196BAC3 hypothetical protein CATMIT_00199 n=1 Tax... 197 3e-49 UniRef50_D2RCS3 DNA mismatch repair enzyme MutH n=1 Tax=Gardnere... 196 4e-49 UniRef50_D2NQK8 DNA mismatch repair protein n=1 Tax=Rothia mucil... 193 5e-48 UniRef50_Q6SFK8 Type II restriction enzyme, putative n=1 Tax=unc... 187 3e-46 UniRef50_D1NWA3 DNA mismatch repair enzyme MutH family protein n... 186 6e-46 UniRef50_Q3IRH4 DNA mismatch repair protein homolog n=1 Tax=Natr... 184 2e-45 UniRef50_Q03GQ2 Type II restriction endonuclease, MutH family n=... 183 4e-45 UniRef50_C8JWG6 Restriction endonuclease n=4 Tax=Bacillales RepI... 182 6e-45 UniRef50_A1A2S7 Putative type II restriction enzyme Sau3AI n=1 T... 181 1e-44 UniRef50_Q30PF3 DNA mismatch repair enzyme MutH n=1 Tax=Sulfurim... 175 7e-43 UniRef50_A6BKR2 Putative uncharacterized protein n=1 Tax=Dorea l... 173 3e-42 UniRef50_Q9ACM4 R.Sth368I endonuclease n=1 Tax=Streptococcus the... 171 2e-41 UniRef50_Q6F1F9 Type II restriction enzyme Sau3AI, GATC site n=1... 168 1e-40 UniRef50_A8RH65 Putative uncharacterized protein n=1 Tax=Clostri... 154 2e-36 UniRef50_C4Z3T8 Putative uncharacterized protein n=2 Tax=Clostri... 138 1e-31 UniRef50_C5EHV9 Type II restriction endonuclease n=1 Tax=Clostri... 132 8e-30 UniRef50_Q6SFL0 Putative uncharacterized protein n=1 Tax=uncultu... 110 4e-23 UniRef50_C0EPR1 Putative uncharacterized protein n=1 Tax=Neisser... 83 5e-15 UniRef50_C0EPR0 Putative uncharacterized protein n=1 Tax=Neisser... 77 4e-13 Sequences not found previously or not previously below threshold: UniRef50_A0Z3W2 Putative uncharacterized protein n=1 Tax=marine ... 46 9e-04 >UniRef50_B1KI41 DNA mismatch repair protein mutH n=236 Tax=Gammaproteobacteria RepID=MUTH_SHEWM Length = 231 Score = 266 bits (680), Expect = 4e-70, Method: Composition-based stats. Identities = 127/218 (58%), Positives = 172/218 (78%) Query: 11 PETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQD 70 P+T ++L+++A ++G TLG+LA G +TP NLKRDKGW+G L+E LGA AGS+PE D Sbjct: 7 PKTIDELMSRANDMAGLTLGQLADSHGFITPANLKRDKGWVGQLIEHELGALAGSRPEPD 66 Query: 71 FAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERS 130 F LG+ELKTIPVD G+P+ETT+V VAPL G+TWETS V HKL+ VLWIP++G+R Sbjct: 67 FLHLGIELKTIPVDKNGKPIETTYVTVAPLIDIQGLTWETSVVCHKLQTVLWIPIQGDRD 126 Query: 131 IPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANA 190 IP++QR++GSP+LW PNEEE L++DWEE+M+ I LGQV+++TARHGE LQ+RPK AN+ Sbjct: 127 IPVSQRQIGSPILWRPNEEELALLKQDWEEIMEFIALGQVKQLTARHGEVLQLRPKGANS 186 Query: 191 KALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLI 228 +++T++IG G +T PRGFYLK FT ++L+R F I Sbjct: 187 RSVTQSIGPNGSTQMTNPRGFYLKIPFTQSILSRAFGI 224 >UniRef50_D1RIY1 DNA mismatch repair endonuclease MutH n=1 Tax=Legionella longbeachae D-4968 RepID=D1RIY1_LEGLO Length = 230 Score = 261 bits (667), Expect = 2e-68, Method: Composition-based stats. Identities = 96/221 (43%), Positives = 138/221 (62%) Query: 8 LSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKP 67 + PP+TE +L+ + ++G + +L+ + L+ P + R KGW+G ++E+ LGA+A +K Sbjct: 8 IPPPQTESELIERCSSIAGLSFAQLSLSLNLLIPIDSNRRKGWVGQVIELALGANASNKS 67 Query: 68 EQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEG 127 DF LG+ELKT+P+ GRP E+TF+ PL W+TS KLKRVLWIP+EG Sbjct: 68 LPDFKNLGIELKTLPIGKSGRPTESTFITSIPLLTVHQQQWKTSQCYSKLKRVLWIPIEG 127 Query: 128 ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKA 187 + IP QRR+G LWSPNE ++R L DW L I G +E + A+ G+YLQ+RPKA Sbjct: 128 DTDIPYPQRRIGQGFLWSPNEAQERALEADWNYLTLQISTGNLEVLDAKSGDYLQVRPKA 187 Query: 188 ANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLI 228 AN ++L + G +I TLPRGFYL+ +FT + I Sbjct: 188 ANGRSLCYGFDSHGNKIKTLPRGFYLRSSFTEKIFGNPHQI 228 >UniRef50_C0N2X3 DNA mismatch repair endonuclease MutH n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N2X3_9GAMM Length = 223 Score = 257 bits (657), Expect = 2e-67, Method: Composition-based stats. Identities = 112/219 (51%), Positives = 156/219 (71%) Query: 8 LSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKP 67 + PP+T+ +L+ + QQL+G TLG++AA +G+V P++L+R KGW+G LLE +LGA AG++ Sbjct: 4 IPPPDTQHELIRRCQQLAGKTLGQVAAELGVVVPDDLRRHKGWVGQLLENYLGADAGNQA 63 Query: 68 EQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEG 127 E DF LG+ELKT+P+++ G+P E+T+VC LT + + W+ S V+ KL VLW+PVE Sbjct: 64 EPDFTRLGIELKTLPLNANGQPKESTYVCTVTLTESGNMQWQQSWVKKKLAHVLWLPVEA 123 Query: 128 ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKA 187 E + LAQR VG LW P E+++ LR+DWEELMD IVLG+ ITA+ G YLQIRPKA Sbjct: 124 ESATALAQRYVGQGWLWQPTVEQEKILRQDWEELMDRIVLGEQADITAKEGHYLQIRPKA 183 Query: 188 ANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHF 226 AN++ + + G GE L P+GFYL+ FT LLA F Sbjct: 184 ANSRVMAKTRGDEGEATLINPKGFYLRTAFTRELLATQF 222 >UniRef50_C6MZP8 DNA mismatch repair protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZP8_9GAMM Length = 227 Score = 254 bits (648), Expect = 2e-66, Method: Composition-based stats. Identities = 94/213 (44%), Positives = 137/213 (64%) Query: 11 PETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQD 70 P+TE +++ + +L G + +LA +GL P N + KGWIG +E+ LGA+A +K D Sbjct: 10 PKTETEVIERCAKLVGLSFAQLALGLGLSIPTNPDQRKGWIGQAIELALGANAYNKSLPD 69 Query: 71 FAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERS 130 F LG+ELKT+PV G+P E+TF+ PL W++S KL+RVLW+P+EG+ Sbjct: 70 FQYLGIELKTLPVAKSGKPTESTFITSIPLLTIHQQDWKSSQCYAKLQRVLWLPIEGDAD 129 Query: 131 IPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANA 190 IP QRR+G LWSPNE ++ L DW L I G +E + A+ G+YLQ+RPKAA+ Sbjct: 130 IPYMQRRIGQGFLWSPNEAQETILEADWNYLTLQISTGYLETLDAKSGDYLQVRPKAAHG 189 Query: 191 KALTEAIGARGERILTLPRGFYLKKNFTSALLA 223 K+L ++G ++ TLPRGFYL+ +FT+ +LA Sbjct: 190 KSLCYGFDSKGNKVKTLPRGFYLRSSFTAKILA 222 >UniRef50_A1JPE3 DNA mismatch repair protein mutH n=58 Tax=Proteobacteria RepID=MUTH_YERE8 Length = 228 Score = 253 bits (647), Expect = 3e-66, Method: Composition-based stats. Identities = 162/220 (73%), Positives = 190/220 (86%) Query: 9 SPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPE 68 +PP E QL +A+ LSG+TLGELA+ G P +LKR KGW+G+LLE +LGASAGSKPE Sbjct: 9 APPTDEHQLFQRAEALSGFTLGELASKAGWNIPADLKRIKGWVGMLLEFYLGASAGSKPE 68 Query: 69 QDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGE 128 QDFA +G+ELKTIP+ + G+PLETTFVCVAPLTGNSG+TWE+SHVRHKL RVLW+PVEGE Sbjct: 69 QDFADIGIELKTIPISAQGKPLETTFVCVAPLTGNSGITWESSHVRHKLARVLWVPVEGE 128 Query: 129 RSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAA 188 R IPLAQRRVG+PLLWSPNEEE+ LR DWEELMD+IVLG+VE ITARHGE LQ+RPKAA Sbjct: 129 RQIPLAQRRVGAPLLWSPNEEEEELLRRDWEELMDLIVLGKVETITARHGEVLQLRPKAA 188 Query: 189 NAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLI 228 N++ALTEAIG G+ I+TLPRGFYLKK FT +LARHFL+ Sbjct: 189 NSRALTEAIGEHGQPIMTLPRGFYLKKTFTGPMLARHFLL 228 >UniRef50_A5IBX0 DNA mismatch repair protein MutH n=4 Tax=Legionella pneumophila RepID=A5IBX0_LEGPC Length = 224 Score = 250 bits (639), Expect = 2e-65, Method: Composition-based stats. Identities = 105/216 (48%), Positives = 140/216 (64%) Query: 9 SPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPE 68 PP+ E++LL Q Q++ G + +LA +GL TP N KGW+G LEI LG A ++ Sbjct: 8 KPPKNEKELLEQCQRIEGLSFAQLALELGLFTPLNPVHRKGWVGQALEIALGTDAKNESL 67 Query: 69 QDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGE 128 DF LG+ELKTIP++ G+P E+TF+ PL +WE+S KLKRVLWIP+EG+ Sbjct: 68 PDFKQLGIELKTIPINKSGKPAESTFIVSIPLLTIHQQSWESSQCYAKLKRVLWIPIEGD 127 Query: 129 RSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAA 188 IP RR+G LWSPN+++ L EDW L + IVLGQ+E + + GEYLQ+RPKAA Sbjct: 128 TDIPFPHRRIGRGFLWSPNKDQKSVLAEDWNYLTNQIVLGQLETLNSSSGEYLQVRPKAA 187 Query: 189 NAKALTEAIGARGERILTLPRGFYLKKNFTSALLAR 224 N K+L G RI TLPRGFYL+ FTS +L++ Sbjct: 188 NGKSLCYGYDPEGNRIRTLPRGFYLRSCFTSVILSQ 223 >UniRef50_C2WIM6 Type II restriction endonuclease n=12 Tax=Firmicutes RepID=C2WIM6_BACCE Length = 475 Score = 249 bits (637), Expect = 4e-65, Method: Composition-based stats. Identities = 53/230 (23%), Positives = 96/230 (41%), Gaps = 12/230 (5%) Query: 5 RPLLSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASA 63 + + T E+L+ +A + G T GE+ L + KG +G ++E + G Sbjct: 9 KEFIMDYTTIEELMKKAHEAEGKTFGEIDTTDRLA----NAKSKGGLGQVIEESFFGYEV 64 Query: 64 GSKPEQDFAALGVELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKLKR 119 S E DF LG+ELK E + + +ETS K ++ Sbjct: 65 NSNAEADFKHLGIELKVTAFKQNKNGSLSAKERLVLNIINYMEEVHTHFETSSFWKKNEK 124 Query: 120 VLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGE 179 +L + E + + LL++ E + +R+DWE +++ I +G+ ++ Sbjct: 125 LLLMFYEWVPGVDRKDFHITKSLLFTYPEADLEIIRQDWETIVNKIRVGKAHELSEGDTN 184 Query: 180 YLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLIQ 229 YL K +N +L I + R F LK ++ +AL+ R+ L + Sbjct: 185 YLGACTKGSNKNSLRSQPY---SEIPAMQRAFSLKPSYMTALVRRYHLNE 231 Score = 58.4 bits (140), Expect = 2e-07, Method: Composition-based stats. Identities = 43/232 (18%), Positives = 92/232 (39%), Gaps = 28/232 (12%) Query: 14 EEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGS-KPEQDFA 72 EE L ++ + G T E+A + + + K ++ +L+ LG ++FA Sbjct: 248 EEILYSKFENYIGLTDQEIAQKL---SIDYKPTTKSFVPLLVSSLLGIKGTRLDKIEEFA 304 Query: 73 ALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGE---- 128 +E KT+ ++ G+P ++ + +WE S +R + + ++ V E Sbjct: 305 KANIEFKTVRLEPNGKPEQSMSFETIDFHQWTNESWEESEIRERFYQTKFLFVIFEFNQT 364 Query: 129 -RSIPLAQRRVGSPLLWSPNEEE-DRQLREDWEELMDMIVLG-------------QVERI 173 + P + LW+ ++++RE WEE+ +I G + + Sbjct: 365 KKENPNRKLYFKGIKLWNMPAPTIEKEIRELWEEVNKVIHEGIQIEYKKRGDKVVEANNL 424 Query: 174 TARHGE-YLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLAR 224 + IRPKA N G+ I + ++L ++ + ++ + Sbjct: 425 PKINFNGVAHIRPKARNGADKVTLP--DGQHIT--KQCYWLNNSYIAQIINK 472 >UniRef50_Q0TSC8 Type II site-specific deoxyribonuclease n=3 Tax=Clostridiales RepID=Q0TSC8_CLOP1 Length = 495 Score = 215 bits (547), Expect = 1e-54, Method: Composition-based stats. Identities = 47/230 (20%), Positives = 87/230 (37%), Gaps = 23/230 (10%) Query: 17 LLAQAQQLSGYTLGEL--------------AALVGLVTPENLKRDKGWIGVLLEI-WLGA 61 + AQ+L G T ++ K+ KG +G L+E + Sbjct: 14 IENYAQKLIGKTFNDVLNDYTKYESELLISEEKEEYAENHENKKRKGGLGDLIEECYFHY 73 Query: 62 SAGSKPEQDFAALGVELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKL 117 + DF GVELK P E + + ++E SH+ +K Sbjct: 74 KCNNDSRPDFPDAGVELKVTPYKINKNKTLSAKERLIITMIDYFKVIEESFEDSHLWNKS 133 Query: 118 KRVLWIPVEGERSI-PLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITAR 176 + +L I + I ++ L++P EE+ ++ D++ ++D I G+ ++ Sbjct: 134 QLILLIYYLYSKDIGNRLDYKINYAKLFTPPEEDLEIIKNDFKIIVDKIKDGKAHELSEG 193 Query: 177 HGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHF 226 YL KA+++ E + IL PR F K ++ + +L Sbjct: 194 DTMYLGAATKASSSSDRREQPFSN---ILAKPRAFSFKASYMTYVLNNFI 240 Score = 60.0 bits (144), Expect = 6e-08, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 65/156 (41%), Gaps = 9/156 (5%) Query: 14 EEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQDFAA 73 EE ++ + +G T GEL + + K +K L LG + E F Sbjct: 263 EEIIIDKINSYAGKTDGELCK----IFDKEYKNNKAQWIDLAYRMLGIKSNKAEE--FEK 316 Query: 74 LGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERSIPL 133 + +K + ++S + +E++ + TWE S + + L + ++ V ++ Sbjct: 317 ANITVKALRIESKDKIVESSPLPTFKFKKLVEETWEESKLFNYLDQQKFLFVVYKKDGDK 376 Query: 134 AQRRVGSPLLWSPNEEEDR-QLREDWEELMDMIVLG 168 + LW+ ++ +RE WE + ++I+ G Sbjct: 377 --YVLKGAQLWNIPYDDLNTTVREGWENIRNVIIDG 410 >UniRef50_O87348 LlaKR2I restriction enzyme n=2 Tax=Lactococcus lactis RepID=O87348_9LACT Length = 496 Score = 213 bits (543), Expect = 3e-54, Method: Composition-based stats. Identities = 39/223 (17%), Positives = 78/223 (34%), Gaps = 10/223 (4%) Query: 15 EQLLAQAQQLSGYTLGELAALV--GLVTPENLKRDKGWIGVLLEIWL-GASAGSKPEQDF 71 EQ++ + Q T + + E + KG G +E + G + DF Sbjct: 29 EQVIEKYQSTPYKTYADFQSKTVSDFKDKEISLKSKGQYGNNIEKYFYGYQPNNDSVADF 88 Query: 72 AALGVELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEG 127 +GVELK P E + + + + ++H+ K ++L + G Sbjct: 89 EKIGVELKVTPFKVNKNGTISAKERLVLTILKYMEENLDDFYSTHLWQKCSKILLLFYNG 148 Query: 128 E-RSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPK 186 + + L+ EE+ + +D++ + + I G+ ++ G YL K Sbjct: 149 LLDGQTIEDYVIKKVFLFEWFEEDMEVILDDYKRITEKIKSGKAHELSESDGNYLSTCTK 208 Query: 187 AANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLIQ 229 A + R + LK ++ + L+ Q Sbjct: 209 GAGKG--RDLREQPFSDEPAKQRAWELKSSYMTYLINNKIFNQ 249 Score = 57.3 bits (137), Expect = 3e-07, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 58/169 (34%), Gaps = 13/169 (7%) Query: 12 ETEEQLLAQAQQLSGYTLGELAALVGLVTPE-----------NLKRDKGWIGVLLEIWLG 60 +T+++ + + + E+ + L + KG L+ LG Sbjct: 249 QTDQESVVAVARGEKKSFTEIISERILEFKGMSEAELYEKFSINPKAKGKNSTLVRTILG 308 Query: 61 ASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNS-GVTWETSHVRHKLKR 119 + Q+F + L+ I VD G P E + + WE SH ++ Sbjct: 309 LTGDLDKTQEFQKANMNLRVIRVDKNGLPKEDSPFKNYNFKELATNDDWEESHPYQEICN 368 Query: 120 VLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLG 168 ++ V + P + S W + + +++ W E +I G Sbjct: 369 KRFLFVVFKEVKP-KIFILDSIKFWGFPDRQVEEVQRVWAETRRIITEG 416 >UniRef50_B9E7L8 Type II restriction enzyme n=1 Tax=Macrococcus caseolyticus JCSC5402 RepID=B9E7L8_MACCJ Length = 462 Score = 211 bits (537), Expect = 2e-53, Method: Composition-based stats. Identities = 45/224 (20%), Positives = 88/224 (39%), Gaps = 15/224 (6%) Query: 10 PPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLE-IWLGASAGSKPE 68 ET +L+ A++ G L E+ L KG +G ++E + G S E Sbjct: 8 EYETVNELIMYAKEAEGKYLYEIDKKDMLENTNV----KGSVGHIIEASYFGYEINSNAE 63 Query: 69 QDFAALGVELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIP 124 DFA +G+ELK + + + E + + + V +E+S K K +L Sbjct: 64 PDFADVGIELKATGIQKMKKGDLKAKERLVLNIINYCDEATVEFESSSFWKKNKHLLLFF 123 Query: 125 VEGERSI----PLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEY 180 E ++ ++ + EE+ + +++DW + + I+ G+ ++ Sbjct: 124 YEYKKDKNNKMDRRNAQIIKVEDFQFPEEDLKTIKQDWNIIHNKIINGEAHLLSEGDTLI 183 Query: 181 LQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLAR 224 L K + A IL R + +K+ + S L+ + Sbjct: 184 LGACTKGSTA--AKSLRKQPNSDILAKQRAYSIKQGYMSNLVRK 225 Score = 58.1 bits (139), Expect = 2e-07, Method: Composition-based stats. Identities = 34/221 (15%), Positives = 74/221 (33%), Gaps = 21/221 (9%) Query: 15 EQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQD-FAA 73 + L + + G + E+A + + + K I L+ LG + + F Sbjct: 248 QILHNKFKNYYGKSDVEIAEQLNITLSDG----KNRIAHLISNMLGIKKNDLSKIEEFEK 303 Query: 74 LGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERSIPL 133 ++ KTI +++ + E + W+ SH++ + W+ V + Sbjct: 304 YSIKFKTITLNNEDKLKEHMSFENIDFDEYATSEWDDSHLKTMFESTKWLFVVF-KEDDK 362 Query: 134 AQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVL-----------GQVERITARHGEYLQ 182 + LW+ + +++ I+ + A+ Sbjct: 363 HNKIFLGYKLWNMPMTDIEGPLQEFFIKTQRILNEGVILSQTSRGIRNNLPGAKDNPICH 422 Query: 183 IRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLA 223 IRPKA NA+ + + + F+L K + +L Sbjct: 423 IRPKARNAEDKVKLPDGQCITK----QCFWLNKGYIKNILQ 459 >UniRef50_A9GFJ0 Methyl-directed mismatch repair n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9GFJ0_SORC5 Length = 227 Score = 210 bits (536), Expect = 2e-53, Method: Composition-based stats. Identities = 102/216 (47%), Positives = 134/216 (62%) Query: 8 LSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKP 67 L PP E +L +A L+G TL +LA +G P R KG G L+E LGA+AGS Sbjct: 5 LKPPRDEAELRCRAAALAGLTLADLARALGCAVPRGGVRTKGAAGALVERALGATAGSTS 64 Query: 68 EQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEG 127 DF LG+ELKTIPV+ G P E+TFVC L+ WETS R KL VLW+P+ Sbjct: 65 RPDFPHLGIELKTIPVEVTGAPQESTFVCSISLSDADRAEWETSRARAKLACVLWVPIIA 124 Query: 128 ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKA 187 P +RR+G+P LW P + ++R LR D+E+LM +I +G VE +TA G +LQ+RPKA Sbjct: 125 APGRPSEERRLGAPRLWRPTDAQERVLRADFEDLMGLIGIGAVEALTAHAGRWLQVRPKA 184 Query: 188 ANAKALTEAIGARGERILTLPRGFYLKKNFTSALLA 223 A+ ++ T A G GE I LPRGFYL+ +FT A+L Sbjct: 185 ASGRSRTLAFGRDGEMIAALPRGFYLRPSFTGAILG 220 >UniRef50_B8ZN89 Putative restriction enzyme n=1 Tax=Streptococcus pneumoniae ATCC 700669 RepID=B8ZN89_STRPJ Length = 496 Score = 209 bits (533), Expect = 5e-53, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 80/222 (36%), Gaps = 10/222 (4%) Query: 16 QLLAQAQQLSGYTLGELAALVGLVTPEN--LKRDKGWIGVLLEIWL-GASAGSKPEQDFA 72 ++ + + + + V + + KG G +E + G S E DF Sbjct: 30 DVMEEYNRSLYKSYDDYNDRVVSEVQDKAISMKSKGQYGNYIEKYFYGYQPNSDSEADFE 89 Query: 73 ALGVELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEG- 127 +GVELK P E + + + + ++H+ K ++L + G Sbjct: 90 KIGVELKVTPFKINKNGTLSAKERLVLTILNYMEENLEDFYSTHLWKKCAKILLLFYNGL 149 Query: 128 ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKA 187 + + + L+ EE+ + ED++++ D I G+ ++ G YL K Sbjct: 150 IPNQTMKDYVIEKIFLYEWFEEDMAVILEDYQKITDKIKNGKAHELSESDGNYLSTCTKG 209 Query: 188 ANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLIQ 229 A + L R + LK ++ + L+ Q Sbjct: 210 AGKG--KDLRQQPFSHELAKQRAWELKSSYMTYLINHKIFNQ 249 Score = 57.3 bits (137), Expect = 4e-07, Method: Composition-based stats. Identities = 27/143 (18%), Positives = 54/143 (37%), Gaps = 2/143 (1%) Query: 27 YTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQDFAALGVELKTIPVDSL 86 + + E + KG L+ LG + ++F + L+ I VD Sbjct: 275 LSYKGFSEQELYDRFEVNSKAKGKNSTLIRKILGLTGDLDKTKEFQKANMNLRVIRVDKN 334 Query: 87 GRPLE-TTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWS 145 P E + F + +WE+SHV +++ ++ V + I + S W Sbjct: 335 NLPKEDSPFKTYCFKELAATDSWESSHVYNEIYNKRFLFVIF-KEIEPKLFVLDSIKFWG 393 Query: 146 PNEEEDRQLREDWEELMDMIVLG 168 + + +++ W+E +I G Sbjct: 394 FQDRQLEEIQRVWQETRQIISDG 416 >UniRef50_C5WEK1 Restriction enzyme n=4 Tax=Bacteria RepID=C5WEK1_STRDG Length = 496 Score = 207 bits (527), Expect = 2e-52, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 80/238 (33%), Gaps = 25/238 (10%) Query: 15 EQLLAQAQQLSGYTLGELAALVGLV-----------------TPENLKRDKGWIGVLLEI 57 E ++ +Q L ++ L E + KG G +E Sbjct: 14 ESVMNYSQNLLNKPFSQIVEEYTLSPYKTRQDFQEKVISEIIDKEISMKSKGQYGNYIEK 73 Query: 58 WL-GASAGSKPEQDFAALGVELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSH 112 + G S DF +GVELK P E + + + + +H Sbjct: 74 YFYGYQPNSDSSADFDKIGVELKVTPFKVNKNGSISAKERLVLTILNYMEENLDDFYQTH 133 Query: 113 VRHKLKRVLWIPVEG-ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVE 171 + K ++L + G + LA + L+ EE+ + ED++ + D I G+ Sbjct: 134 LWEKCSKILLLFYNGLIPNQTLADYIIEKVFLYEWFEEDMELILEDYKRITDKIKQGRAH 193 Query: 172 RITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLIQ 229 ++ G YL K A + L R + LK ++ + L+ Q Sbjct: 194 ELSESDGNYLSTCTKGAGKG--KDFRSQPFSDELAKQRAWELKSSYMTYLINHKIFSQ 249 Score = 47.3 bits (111), Expect = 4e-04, Method: Composition-based stats. Identities = 31/213 (14%), Positives = 66/213 (30%), Gaps = 18/213 (8%) Query: 11 PETEEQLLAQAQQLSGYTLGEL-----------AALVGLVTPENLKRDKGWIGVLLEIWL 59 ++E++ + + + E+ + + K L+ L Sbjct: 248 SQSEQESVLATVREDKKSFTEIVADKILAYKGFSEEELYNRFDINPNAKSKNSTLVRKIL 307 Query: 60 GASAGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTG-NSGVTWETSHVRHKLK 118 G + Q+F + L+ I V+ G P E + + WE SH ++ Sbjct: 308 GLTGDIDKTQEFQKANMNLRVIRVNKNGLPKEDSPFKNYKFQELANTEEWEESHPYLEIF 367 Query: 119 RVLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQ--VERITAR 176 ++ V + P + S W + +++ W+E +I G + Sbjct: 368 SKKFLFVVFKEVEPKL-FVLDSIKFWGFPDRLLNEVQRVWKETQIIIQTGVELSQNGNKV 426 Query: 177 HGEYLQIRPKAANAKALTEAIGARGERILTLPR 209 + Q R AN T+ + + Sbjct: 427 STNFPQSR---ANKILFTKIHAQNTLYEIEPNK 456 >UniRef50_P16667 Type-2 restriction enzyme Sau3AI n=2 Tax=Staphylococcus RepID=T2S3_STAAU Length = 489 Score = 207 bits (526), Expect = 3e-52, Method: Composition-based stats. Identities = 49/223 (21%), Positives = 89/223 (39%), Gaps = 15/223 (6%) Query: 11 PETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQD 70 T++ + +A++ G ++ EL E++K+ K +G E W G S + D Sbjct: 4 YLTKQAVHNRAKEAVGKSVLELNG------GESIKQSKSSVGDAFENWFGKKKDSDSKPD 57 Query: 71 FAALGVELKTIPVDSLGRPL----ETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVE 126 A GVELK P L E + + + +ETS K + E Sbjct: 58 MAEAGVELKATPFKKLKNGKYSSKERLVLNIINYEKVANENFETSSFLSKNNTIELAFYE 117 Query: 127 GERSIPLAQRRVGSPLLWSPNEE--EDRQLREDWEELMDMIVLGQVERITARHGEYLQIR 184 + P + +L+ ++ + +++DWE + I G+ ++ YL Sbjct: 118 YIKGTPSDNWIIKEAVLYEMHKNPIDYEIIKQDWEIINQYINEGKAHELSEGLTSYLAPC 177 Query: 185 PKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFL 227 K ANA +L + I R F LK + +++L ++ L Sbjct: 178 TKGANASSLRNQPYSD---IKAKQRAFSLKSGYMTSILRKYVL 217 >UniRef50_B3DNU4 DNA mismatch repair protein n=4 Tax=Bifidobacterium RepID=B3DNU4_BIFLD Length = 502 Score = 206 bits (525), Expect = 4e-52, Method: Composition-based stats. Identities = 48/243 (19%), Positives = 89/243 (36%), Gaps = 23/243 (9%) Query: 1 MSQPRPLL---SPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI 57 MS P + E+++ G + EL +T +KG +G ++E Sbjct: 10 MSSSEPEFNDAREYSSLEEVMNTLLTAKGKSFRELDQTGRALTGG----NKGSLGQIIEE 65 Query: 58 -WLGASAGSKPEQDF--AALGVELKTIPVDSLGRPL-----ETTFVCVAPLTG-NSGVTW 108 L + S D ELK P+ + + E + + + Sbjct: 66 SVLKYAINSDAAPDIHIGDTSYELKVTPLKHIKKGKQTSAKERLVIDIINYLTLADETDF 125 Query: 109 ETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSP----LLWSPNEEEDRQLREDWEELMDM 164 E+S + K K ++ + +R+ + R+ L ++ ++ DW + D Sbjct: 126 ESSKMWDKAKNIILVYYYDDRTDKKKELRIDCKVLASYLMKYEADDLATIKNDWHVIRDK 185 Query: 165 IVLGQVERITARHGEYLQIRPKAANAKALTEAI---GARGERILTLPRGFYLKKNFTSAL 221 + G + ++ YL K AN+K L EA GA I R F LK ++ +A+ Sbjct: 186 VASGHADSLSESDTNYLAACTKGANSKQLREAPAPAGANTATIFAKQRAFSLKTSYMTAI 245 Query: 222 LAR 224 + Sbjct: 246 ARK 248 Score = 43.8 bits (102), Expect = 0.005, Method: Composition-based stats. Identities = 39/247 (15%), Positives = 91/247 (36%), Gaps = 30/247 (12%) Query: 6 PLLSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGS 65 P+ +E + A+ +G + ++A+ + + K + + LGAS S Sbjct: 260 PIPQEQNLDEYVAAKFIPYTGKSSRDIASELQVSAAPTAKNYNSSLAFAM---LGASKSS 316 Query: 66 KPEQD-FAALGV-ELKTIPVDSLGRPLETTFVCVAPLTG-----NSGVTWETSHVRH--K 116 + + F+ + + K++ + G P E N TWE S VR + Sbjct: 317 ISKIEQFSKANISQFKSVTIYPDGLPREHMSFKAITDDQWEEWANPQTTWEQSFVRDFFE 376 Query: 117 LKRVLWI----PVEGERSIPLAQRRVGSPLLWSPNEEEDRQ-LREDWEELMDMIVL---- 167 + L + P+ + A+ LW+ E++ Q ++ WE + ++V Sbjct: 377 TSKFLIMVSKSPIPYQSGHDKAKDIFKGAFLWNMPEDDIEQYVKPVWETMHTLLVAHTPL 436 Query: 168 ---GQVERITARHG----EYLQI-RPKAANAKALTEAIGARGERILTLPRGFYLKKNFTS 219 + + + +L+ K + + + ++T F+L + + + Sbjct: 437 NYGIRGKNLIPGSSFNSVFHLRPHASKGKDNGSAKDRSILPNGEVITKQ-CFWLDRRYIA 495 Query: 220 ALLARHF 226 ++A + Sbjct: 496 RIIASNL 502 >UniRef50_Q89B33 DNA mismatch repair protein mutH n=1 Tax=Buchnera aphidicola (Baizongia pistaciae) RepID=MUTH_BUCBP Length = 220 Score = 203 bits (516), Expect = 4e-51, Method: Composition-based stats. Identities = 82/213 (38%), Positives = 132/213 (61%), Gaps = 2/213 (0%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASA-GSKPEQDF 71 E++L A LSGY++ E+ + + +L R+KG++G +LE+ LG + DF Sbjct: 5 CEKKLFMHAIGLSGYSIREIVSSLDQPVSNSLVRNKGFVGKILELILGVNVLHGYKCIDF 64 Query: 72 AALGVELKTIPVDSLGRPLETTFVCVAPLTGNS-GVTWETSHVRHKLKRVLWIPVEGERS 130 +LG+ELK+IP++S G PLE TF+C PL NS +TW S+ K+K++LWIP+ G R Sbjct: 65 PSLGIELKSIPINSSGYPLEPTFICNIPLKNNSLNITWNNSYFYRKIKKILWIPIIGNRV 124 Query: 131 IPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANA 190 + + VG +W+ + +++ L++DWEE MD+I++G+VE I+++HG+ LQ++ K N Sbjct: 125 VSFLDKIVGEAFIWTMSSVQEKILKKDWEEFMDLIIIGKVEYISSKHGQVLQVKKKCKNK 184 Query: 191 KALTEAIGARGERILTLPRGFYLKKNFTSALLA 223 + I G T PR FY +K+FT +LL Sbjct: 185 HVCIKFINYNGCVKFTNPRAFYFRKSFTWSLLN 217 >UniRef50_B2TKK6 DNA mismatch repair enzyme MutH n=3 Tax=Clostridium RepID=B2TKK6_CLOBB Length = 476 Score = 201 bits (512), Expect = 1e-50, Method: Composition-based stats. Identities = 48/227 (21%), Positives = 91/227 (40%), Gaps = 18/227 (7%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGV-LLEIWLGASAGSKPEQDF 71 +E + A++L +L +L N K+ KG +G + E + G S+ E DF Sbjct: 9 NKESIENYARRLLNKSLKDLDENNKEHK-FNNKKSKGRLGQTVEEEYFGYKVNSRQEADF 67 Query: 72 AALGVELKTIPVDSLGRP--------------LETTFVCVAPLTGNSGVTWETSHVRHKL 117 +GVELK P+ ++ E + + + TWE + + K Sbjct: 68 NEVGVELKVCPIRTIKAKPKSNSIREQIGYSAKERIVLSIIDYFKLNEETWENNSIMKKC 127 Query: 118 KRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARH 177 K +L + E++IP + LW+P+E++ + + DW + + G+ I+ Sbjct: 128 KELLLMFYMNEKNIPKEELVFKIISLWTPSEQDLKIIENDWNTIALKVKKGKAHEISEGD 187 Query: 178 GEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLAR 224 YL K + A+ G ++ R F K+ + ++ Sbjct: 188 TMYLGACTKGSTAE--KSKRGQPNSDVMAPQRAFSYKRQYVDYIIEE 232 Score = 77.3 bits (189), Expect = 3e-13, Method: Composition-based stats. Identities = 43/195 (22%), Positives = 74/195 (37%), Gaps = 13/195 (6%) Query: 15 EQLLAQAQQLSGYTLGELAALVGLVTPENLKR-DKGWIGVLLEIWLGASAGSKPEQDFAA 73 E++ ++ ++ E+ +L + K + + + + G + E F Sbjct: 257 EKIYNVFNRILNKSVSEIISLYSIDRERKAKNYIRLIVDDVCKKVFGDKLENFEE--FKK 314 Query: 74 LGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHK--LKRVLWIP------V 125 +ELKTI + G P E+ WETS +R K K+ LWI Sbjct: 315 ANIELKTIFLKLNGMPKESMSFEQIDYCEIVNEEWETSTIRDKFENKKHLWIIFKSKVNF 374 Query: 126 EGERSIPLAQRRVGSPLLWSPNEEED-RQLREDWEELMDMIVLG-QVERITARHGEYLQI 183 E + + L + + + W+ + + + W + D I G + I GE I Sbjct: 375 EKQSELSLNDIVLDNVMFWNMPISDLDSSMYKVWLDTTDKIKQGVYNDFIKISDGEIAHI 434 Query: 184 RPKAANAKALTEAIG 198 RPK N+K LT Sbjct: 435 RPKGQNSKDLTSTPQ 449 >UniRef50_C3FCF3 DNA mismatch repair enzyme MutH n=2 Tax=Bacillus thuringiensis RepID=C3FCF3_BACTU Length = 453 Score = 198 bits (503), Expect = 1e-49, Method: Composition-based stats. Identities = 49/220 (22%), Positives = 85/220 (38%), Gaps = 13/220 (5%) Query: 12 ETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKPEQD 70 +TEEQLLA + + G + E+ L DKG +G ++E + + PE D Sbjct: 8 KTEEQLLAFTEGIIGKSFKEIDTKGLLQ---GNSNDKGILGKVVETGFYDYELNNCPEAD 64 Query: 71 FAALGVELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPVE 126 F LG+ELK + L E + + +E S V K K++L I E Sbjct: 65 FGELGIELKVSGFNKLRNGSWSAKERISLSMINYKNIIHEEYEFSKVVSKNKKLLIIWYE 124 Query: 127 GERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRPK 186 + +P + L+ + +++ +R D+ + ++ G +++ L K Sbjct: 125 YMKGVPYGDFIIRDFQLYDMS-QDEAIIRNDFYTIKQKVLEGLAHKLSEGDTVILGAATK 183 Query: 187 AANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHF 226 + I R F LK +F +L H Sbjct: 184 GQKG----QKAEQPNSEISAPTRAFSLKNSFFRGVLRSHV 219 >UniRef50_C0ETP8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=C0ETP8_9FIRM Length = 507 Score = 197 bits (502), Expect = 2e-49, Method: Composition-based stats. Identities = 44/231 (19%), Positives = 78/231 (33%), Gaps = 13/231 (5%) Query: 9 SPPETEEQLLAQAQQLSGYTLGEL---AALVGLVTPENLKRDKGWIGVLLEI-WLGASAG 64 + E++ +A++ G E+ A K KG IG E W + Sbjct: 9 KEYKNSEEVKKRAEEAIGRPFKEIFELAKKFQEKNGLKEKHGKGDIGQAYEEGWFNYACN 68 Query: 65 SKPEQDFAALGVELKTIPVDSLGRP---LETTFVCVAPLTGNSGVTWETSHVRHKLKRVL 121 E DF +ELK P R E + + +E S K +L Sbjct: 69 KDAEPDFKEADIELKVTPFLKNSRGYRAKERLSLGKINYKDENWNEYEESRFWIKNHHLL 128 Query: 122 WIPVEGERSIPLAQR---RVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHG 178 + + + I ++ LL E + ++ DWE++ + G+ +T R Sbjct: 129 IMYYQYIKGIDRENFSVQKIDEILLNELPERDKSIIQHDWEKIARYVKEGRAHELTERDF 188 Query: 179 EYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLIQ 229 YL K + + + PR + KK++ + L L + Sbjct: 189 MYLSPATKGSGGN---KKVKYDDRYPKAKPRAYSFKKSYMTKLFNERMLDE 236 Score = 48.0 bits (113), Expect = 3e-04, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 63/166 (37%), Gaps = 13/166 (7%) Query: 7 LLSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSK 66 L E ++ LL Q+ G T+ L + K +++ + + Sbjct: 248 LSKEKEFDDILLETLQKYLGRTVKSLKKE--FPDYRDGYSHKHG---IIKQIYKSDCDLE 302 Query: 67 PEQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGV-TWETSHVRHKL---KRVLW 122 +F +L+TI V+ G PLE TW+ S V ++ K +L Sbjct: 303 KTDEFQKANYKLRTITVNKNGMPLEDMSFSTFDFEELLKEKTWKESIVYDEMIDSKFLLV 362 Query: 123 IPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLG 168 I + + ++W ++++ +++E WEE ++I G Sbjct: 363 IFSKNSEGKEILNN----AMMWYIPKKDENKVKEVWEETRNVIKNG 404 >UniRef50_C2SK77 DNA mismatch repair enzyme MutH n=1 Tax=Bacillus cereus BDRD-ST196 RepID=C2SK77_BACCE Length = 461 Score = 197 bits (501), Expect = 2e-49, Method: Composition-based stats. Identities = 46/223 (20%), Positives = 90/223 (40%), Gaps = 9/223 (4%) Query: 7 LLSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGS 65 + S TE+Q+ A++L G LG+L L K+DKG IG +++ + G A S Sbjct: 4 IYSESWTEQQIFDVAEELVGKRLGDLDKSGWLKK----KKDKGNIGNMIQSDFFGIPANS 59 Query: 66 KPEQDFAALGVELKTIPVDSLGR----PLETTFVCVAPLTGNSGVTWETSHVRHKLKRVL 121 DF +ELK P+ E + + + + +E S V K + +L Sbjct: 60 IKGADFEHHHIELKVTPILKKKTVGYSSKERLVLGMINYMEDYRIPFEESIVNKKAQNML 119 Query: 122 WIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYL 181 + E + P+ + ++ + + ++ Q+R D++ ++D I G+ I+ + + + Sbjct: 120 LVFYLHEENKPVEEFKIIKTARFQLPKSDESQVRLDYQTIVDNIQQGKAHEISEKQQKIM 179 Query: 182 QIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLAR 224 K + + R + K + SA Sbjct: 180 GACTKGQGKGKGKDGVDQPRSSEKAKSRAYSYKVGYMSAYFRS 222 Score = 61.1 bits (147), Expect = 2e-08, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 72/228 (31%), Gaps = 34/228 (14%) Query: 17 LLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGS-KPEQDFAALG 75 + + G T E+ + K I L+ G + ++F G Sbjct: 244 VTETLDKYVGKTDEEIQLELQKEVSG-----KSSIFNLIGFMFGTKGDNLNHTEEFLKEG 298 Query: 76 VELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVR---HKLKRVLWIPVEGERSIP 132 +KT+ + + T + +E S ++K +L + E E Sbjct: 299 YAIKTVRNRLNSKDNQDMSFSNIDYTEIANDEFEESTWYGWFAEIKYILTVWDEYEEGKN 358 Query: 133 LAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLG----QVERITARHGEY-------- 180 R +W P++E Q E + ++ +M+ +++ +HG + Sbjct: 359 ----RFKDYTIWIPDDELIEQASELFYQIKNMLNTNSVRVEIDETVGKHGRWSDNLPGGK 414 Query: 181 -----LQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLA 223 QIRPK + + + Y+ K + ++ Sbjct: 415 MDYPPFQIRPKGSGESVFVTL----THGLKIKKKALYINKEYIRKIVG 458 >UniRef50_UPI000196BAC3 hypothetical protein CATMIT_00199 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196BAC3 Length = 499 Score = 197 bits (501), Expect = 3e-49, Method: Composition-based stats. Identities = 61/228 (26%), Positives = 90/228 (39%), Gaps = 17/228 (7%) Query: 11 PETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGAS-AGSKPEQ 69 ++ E+LL AQ G E L +KG IG ++E L S+ E Sbjct: 9 YQSIEELLETAQSSLGVPFKEYDVNNRL-----NGGNKGKIGQVIEEGLFHKGIDSRQEA 63 Query: 70 DFAALGVELKTIPVDSLGR------PLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWI 123 DF LGVELK P++ +G+ E V + + V++E SHV KL +VL + Sbjct: 64 DFPELGVELKVTPMNRVGKEKDMVSAKERLVVTMINYMKDYNVSFEDSHVLDKLNKVLLM 123 Query: 124 PVEGERSIPLAQRRVGSPLLWSP---NEEEDRQLREDWEELMDMIVLGQVERITARHGEY 180 E V L+ EE+ + + D+ ++D I G E I+ Y Sbjct: 124 FYFNEPEKMNYDLCVDKIFLYQFKDIPEEDKQIIINDYNIIIDKIKSGHAEDISESDTYY 183 Query: 181 LQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLI 228 L K A A T + + R F LK + +ALL Sbjct: 184 LSACTKGATA--ATSFVDQPFSSVKAKRRAFSLKPKYMTALLRSQVFN 229 Score = 49.2 bits (116), Expect = 1e-04, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 47/162 (29%), Gaps = 19/162 (11%) Query: 26 GYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQ---DFAALGVEL-KTI 81 G +L E+ ++ L ++ + PE D E K Sbjct: 264 GKSLSEIDGMIETPVNRKSL-------NYLRSYIARMMNASPETKSKDIDEYYDEFYKAN 316 Query: 82 PVDS------LGRPLETTFVCVAPLTGNSGVTWETSHVRHK-LKRVLWIPVEGERSIPLA 134 G E+ WE S +R+K + V E Sbjct: 317 IKIKTIRIKKNGTIKESMSFPAFKFKDIVEENWEDSELRNKFINEKYLFCVFDEIDDSKY 376 Query: 135 QRRVGSPLLWSPNEEEDR-QLREDWEELMDMIVLGQVERITA 175 + R LW+ E + ++RE WE + + G I+ Sbjct: 377 EYRFRGAFLWAMPESDLDGKVREAWERTVYLAKHGIDFTISE 418 >UniRef50_D2RCS3 DNA mismatch repair enzyme MutH n=1 Tax=Gardnerella vaginalis 409-05 RepID=D2RCS3_GARVA Length = 482 Score = 196 bits (499), Expect = 4e-49, Method: Composition-based stats. Identities = 44/230 (19%), Positives = 83/230 (36%), Gaps = 19/230 (8%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKPEQDF 71 +++ A ++G T E+ +KG +G ++E G S S + D Sbjct: 13 DIDEITALLTGINGKTFREIDKTGR----GEKAGNKGSLGNIIEESVFGYSINSDKQPDI 68 Query: 72 --AALGVELKTIPVDSLGRPL----ETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIPV 125 A ELK P+ S + E V + ++ S K ++++ I Sbjct: 69 MVADEFYELKVTPIRSNKKKKIVAKERLVVDIINYLTLCNEVFDNSSFWQKARQMIIIYY 128 Query: 126 EGERSIPLAQRRVGSP----LLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYL 181 R+ Q R+ + E +R+DWE + + + G + ++ YL Sbjct: 129 LDNRTDKTNQSRLDCTIYGSFVLKYEPGELETIRKDWEYIKNKVAAGHADALSESDTNYL 188 Query: 182 QIRPKA-ANAKALTEAIGARG---ERILTLPRGFYLKKNFTSALLARHFL 227 K + AK+ +A G + I R F K ++ + + R Sbjct: 189 AACTKGESAAKSFRDAPAPSGFESKYIRAKQRAFSYKPSYMTIVAQRVLG 238 Score = 40.7 bits (94), Expect = 0.042, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 58/170 (34%), Gaps = 14/170 (8%) Query: 15 EQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQD-FAA 73 E L Q+ G +G + + LK + L LG E + FAA Sbjct: 252 ENLNDYVQKKIGRYVGYKISDLANKFNVELKTHYSFNSQLALKMLGVKKNRIEEIEQFAA 311 Query: 74 LGV-ELKTIPVDSLGRPLETTFVCVAPLTG-----NSGVTWETSHVRHKLKRVLWIPVEG 127 V +LKT + G+P + + + W S + + + V Sbjct: 312 ASVTQLKTTVLYDDGKPEQNMSFPALKEDDWAELADPNMQWHDSRLYKFFENNKFYFVVF 371 Query: 128 ERSIP------LAQRRVGSPLLWSPNEEEDRQ-LREDWEELMDMIVLGQV 170 + + + LW+ EE+ + + W ++ ++++ G+ Sbjct: 372 KSNGKNRKSTCYKDDVLVGGFLWNMPEEDIEKYVAPAWNKVHEIMISGES 421 >UniRef50_D2NQK8 DNA mismatch repair protein n=1 Tax=Rothia mucilaginosa DY-18 RepID=D2NQK8_9MICC Length = 499 Score = 193 bits (490), Expect = 5e-48, Method: Composition-based stats. Identities = 45/227 (19%), Positives = 81/227 (35%), Gaps = 13/227 (5%) Query: 12 ETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKR----DKGWIGVLLEIW-LGASAGSK 66 T +L + Q T G + K+ KG G L+E G S Sbjct: 25 RTLHNILEEFQNSEYKTYE--DKKKGTPSTTARKKISELSKGIYGSLMEKAGYGIEPNSS 82 Query: 67 PEQDFAALGVELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKLKRVLW 122 PE D A VE+KT P E + + + + +H+ HK + +L Sbjct: 83 PEADIPAAKVEIKTTPYRVTANGAISAKERLVLSMFNFHEENLDDFYQTHLWHKCQNILL 142 Query: 123 IPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQ 182 + + +++ ++ L+ EE+ + ED++ + ++ G+ ++ G YL Sbjct: 143 LFYKYQKTRDISNNITDKFFLFDWPEEDMPTILEDYKRITQKVLEGRAHELSESDGMYLS 202 Query: 183 IRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFLIQ 229 K A L R + LK ++ + LL Q Sbjct: 203 TCRKGAGKDKDRTT--QPYGPELANRRAWSLKSSYMTTLLRTKVFSQ 247 Score = 63.8 bits (154), Expect = 4e-09, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 61/170 (35%), Gaps = 13/170 (7%) Query: 11 PETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKR-----------DKGWIGVLLEIWL 59 + E++ +A+A Q + ++ L ++ KG L+ L Sbjct: 246 SQEEQESIARAAQDTSKPFTQIIEEKLLQYRGQSEKKLCKEFDVNFNAKGRNSTLVRKIL 305 Query: 60 GASAGSKPEQDFAALGVELKTIPVDSLGRPLE-TTFVCVAPLTGNSGVTWETSHVRHKLK 118 G S +F + ++ I VD G P E + F + + WE SH+ ++ Sbjct: 306 GLSTDIDSTAEFKKANMNIRAIRVDKNGLPKEDSPFKNYRFIELANEKEWEESHIYTEIF 365 Query: 119 RVLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLG 168 ++ V R + W E + +L W+E ++I G Sbjct: 366 SRKFMFVVF-RENESGDFCLDQIKFWGFPEPLEPELMRVWQETKEIINRG 414 >UniRef50_Q6SFK8 Type II restriction enzyme, putative n=1 Tax=uncultured marine bacterium 580 RepID=Q6SFK8_9BACT Length = 235 Score = 187 bits (474), Expect = 3e-46, Method: Composition-based stats. Identities = 47/224 (20%), Positives = 81/224 (36%), Gaps = 15/224 (6%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPEN---LKRDKGWIGVLLE-IWLGASAGSKPE 68 + + A++L+G TL ++ + + E +KG G +E + G S E Sbjct: 7 DKNSVYNFAKKLTGKTLRDVLSPQIIQESEKLFINNGNKGRFGHKIEKHYFGYDINSDKE 66 Query: 69 QDFAALGVELKTIPVDS----LGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIP 124 DF+ +ELK P+ P E + TW TS + K+ +L I Sbjct: 67 ADFSLCKLELKVTPMKLKNDGSLTPKERLVNNIINFDEIVKETWHTSSLLKKINSILLIR 126 Query: 125 VEGERSIPLAQRRVG----SPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEY 180 + + Q + S N E+ Q +DW +++ I G+ ++ Y Sbjct: 127 YLDPMNKEINQLDYKILDVQIINLSSNAEDFNQFEKDWNLIVNKIKDGKAHELSESDTMY 186 Query: 181 LQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLAR 224 L K N ++ + R F K + LL R Sbjct: 187 LGACTKGENKESQRSQPY---SSEKAMQRAFSFKTQYMKILLDR 227 >UniRef50_D1NWA3 DNA mismatch repair enzyme MutH family protein n=1 Tax=Bifidobacterium gallicum DSM 20093 RepID=D1NWA3_9BIFI Length = 485 Score = 186 bits (472), Expect = 6e-46, Method: Composition-based stats. Identities = 43/227 (18%), Positives = 80/227 (35%), Gaps = 23/227 (10%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKPEQD- 70 T + + G TLGEL L +P +KG G ++E LG A S D Sbjct: 8 TRTDITRRLNACIGKTLGELDTKHVLDSPT----NKGVAGHVIEKSVLGYPADSARRPDL 63 Query: 71 -FAALGVELKTIPVDSLGR------PLETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWI 123 L VELKT + + + E + +E+S HK++ +L++ Sbjct: 64 NIDGLDVELKTTGIRASKKEGISFEAKEPMSITALKPDEIIDEEFESSAFWHKVQNMLFV 123 Query: 124 PVEGE----RSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGE 179 E + + V + + E+ QL+ DW + + + E + E Sbjct: 124 CYVYEGKPKKGRDYSNFPVHTAFFKTFEGEDKEQLQRDWTTIREFVRDLYEENPDSPAFE 183 Query: 180 YLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHF 226 K ++ + + P ++ F ++++ H Sbjct: 184 ------KISSGISKEVSPYLDTAPKWPHPPRIRFRRAFVNSIINEHL 224 Score = 41.1 bits (95), Expect = 0.028, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 64/206 (31%), Gaps = 24/206 (11%) Query: 15 EQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLG---ASAGSKPEQDF 71 +L + +G T+ ++AA + P+ + K ++ G + F Sbjct: 246 HKLQELTNKFAGKTIAQIAAYFDMEIPDKVP--KQLAEQIVVRMFGGTSRKMHNIAL--F 301 Query: 72 AALGVELKTIPVDSLGRPLETTFVCVAPLTG-NSGVTWETSHVR---HKLKRVLWIPVEG 127 +G+ KT+ ++ G P E T + S +E S + + E Sbjct: 302 QKMGIVGKTVALNPQGMPTEDTKLMSIDFNELESNDVYEESDFYYFFKDNDLLYMVFQEP 361 Query: 128 ERSIPLAQRRVGSPLL------WSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYL 181 E I G L P E++ + W+ + D+I+ ++ + Sbjct: 362 EVGISRG----GEVFLGFARVPLEPLAEKNGLVESVWDIMRDLIISRKLRLVPETDK--- 414 Query: 182 QIRPKAANAKALTEAIGARGERILTL 207 P EA L Sbjct: 415 SGAPILNKNGVPREAPNWPKSTESAL 440 >UniRef50_Q3IRH4 DNA mismatch repair protein homolog n=1 Tax=Natronomonas pharaonis DSM 2160 RepID=Q3IRH4_NATPD Length = 285 Score = 184 bits (467), Expect = 2e-45, Method: Composition-based stats. Identities = 47/224 (20%), Positives = 83/224 (37%), Gaps = 23/224 (10%) Query: 11 PETEEQLLAQA-QQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKPE 68 P+ E++ + +A L G + GE+ + + + R K + ++E + S E Sbjct: 8 PDAEQEDILRATNALLGRSFGEIDEEIQGMADDERARSKHGVANVIESGYFNIETNSSAE 67 Query: 69 QDFAALGVELKTIPVDSLGR-----PLETTFVCVAPLTG-NSGVTWET-SHVRHKLKRVL 121 DF G+ELK P+ G P E +C+ W +R KL++VL Sbjct: 68 PDFGKAGIELKVTPLRLTGNNELVRPKERLVLCMCDYMEVVDADHWTDVPALRKKLEQVL 127 Query: 122 WIPVEGERSIPLAQRRVGSPLLWSPNEEED--RQLREDWEELMDMIVLGQVERITARHGE 179 I + LW P E + ++ D+E I+ G+V + +H + Sbjct: 128 IIWYIHIVGQDRRTYPIVWWTLWEPMTEPEWSEIIQSDFEICKKRILNGEVP--SEKHTQ 185 Query: 180 YLQIRPK----------AANAKALTEAIGARGERILTLPRGFYL 213 L PK + ++ A A RG+ + Sbjct: 186 LLGTCPKHGGGYDRENPEESPRSSRVAQDAHPTLDYAEKRGWSI 229 >UniRef50_Q03GQ2 Type II restriction endonuclease, MutH family n=5 Tax=Bacilli RepID=Q03GQ2_PEDPA Length = 492 Score = 183 bits (465), Expect = 4e-45, Method: Composition-based stats. Identities = 41/231 (17%), Positives = 82/231 (35%), Gaps = 21/231 (9%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLE-IWLGASAGSKPEQDF 71 T+ QL+ ++ TLGE+ +N K+ G G ++E LG + S D Sbjct: 7 TKTQLIGLLERAVNKTLGEVDTAHVFDKTKNNKKITGIAGDVIENSVLGYPSDSYQSPDL 66 Query: 72 --AALGVELKTIPVDSLGRP--------LETTFVCVAPLTGNSGVTWETSHVRHKLKRVL 121 + +ELKT + + E + + +E+S+ HKL+R+L Sbjct: 67 CVDGVDIELKTTGIKVAKKKSKSHLFEAKEPMSITAVSPEKITTEEFESSNFWHKLERML 126 Query: 122 WIPVEGERSI-----PLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITAR 176 + + + A + ++E+ L+ DW+ + I Q Sbjct: 127 LVYYFYDSDVTVTAAEYANFLIKGYDFHEFSKEDQETLKNDWKIVQQFIQNLQNGYGEEA 186 Query: 177 HGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFL 227 +Y +I + + + F LK+ + ++ +F Sbjct: 187 SSQYPRISSEIRSQLMMI-----DTAPKWPHRPRFRLKRATVTTMVQEYFG 232 Score = 45.0 bits (105), Expect = 0.002, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 71/199 (35%), Gaps = 11/199 (5%) Query: 15 EQLLAQAQQLSGYTLGELAALVGLVTPENLKRD--KGWIGVLLEIWLGASAGS-KPEQDF 71 ++L +Q G T+ EL + + N + D K ++ GA + +DF Sbjct: 250 QELHKFTKQYHGMTIRELIKELSISIKLNKQDDAPKSVTEQIVTHMFGAKSKKINSIEDF 309 Query: 72 AALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVT--WETSHVRHKLKRVLWIPVEGE- 128 LG+ KTI + G E + +E S + + ++ + E Sbjct: 310 KKLGLIAKTITQTAKGTRTEDVKISRIDFEEFKPENDLFEESLIYNYFNEQQFLCILFEE 369 Query: 129 ---RSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERITARHGEYLQIRP 185 S L R +G L P+E ++ +++ W+++ +++ ++ + + Sbjct: 370 PNTSSKLLENRFLGFKRLSFPDEFINQDVKKTWQKIRYLVIHNKLVDVISYDKN--GHPR 427 Query: 186 KAANAKALTEAIGARGERI 204 K + A Sbjct: 428 KNKKTNTIVSAPNFPKSSE 446 >UniRef50_C8JWG6 Restriction endonuclease n=4 Tax=Bacillales RepID=C8JWG6_LISMO Length = 555 Score = 182 bits (463), Expect = 6e-45, Method: Composition-based stats. Identities = 52/257 (20%), Positives = 90/257 (35%), Gaps = 48/257 (18%) Query: 11 PETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKPEQ 69 +T+E++ +A + G TLGE+ +K + G ++E W A + E Sbjct: 5 YDTKEKVHNRALEAIGKTLGEIDENNL-----KNVNNKSYPGNVIEQIWYDHPADNISEP 59 Query: 70 DFAALGVELKTIPVDSLGRPL-----------ETTFVCVAPLTGNSGVTWETSHVRHKLK 118 DF GVELK P+D + E + ++E S HK K Sbjct: 60 DFKEAGVELKVTPIDKQSKGNKDKKVTRLIAGERLVLNKIHYKNEYKKSFEESSFWHKNK 119 Query: 119 RVLWIPVE------------GERSIPLAQRRVGSPLLWSP----------------NEEE 150 + I E Q ++ L S ++++ Sbjct: 120 VIELIQYYRRDTSDKKKFSRKELIEDKKQFKIAYATLLSMVDLTDFNLPKDTVLEISDKD 179 Query: 151 DRQLREDWEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRG 210 +++DWE++ +I + E ++ YL KAA T + G I PR Sbjct: 180 FEIIKQDWEKISKLINESKAEELSEGMTNYLGACTKAATGAEFTTQV---GSEIKPKPRA 236 Query: 211 FYLKKNFTSALLARHFL 227 + K F + L+ + Sbjct: 237 YSFKTKFINELINTQII 253 >UniRef50_A1A2S7 Putative type II restriction enzyme Sau3AI n=1 Tax=Bifidobacterium adolescentis ATCC 15703 RepID=A1A2S7_BIFAA Length = 643 Score = 181 bits (460), Expect = 1e-44, Method: Composition-based stats. Identities = 46/248 (18%), Positives = 85/248 (34%), Gaps = 41/248 (16%) Query: 9 SPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKP 67 SP T+ + +A G TL E+ + +K + G ++E W A + Sbjct: 126 SPYTTKTAVQLRANDAVGRTLEEID-----FNNTEKQGNKSYPGNVIEHVWFDHPADNIS 180 Query: 68 EQDFAALGVELKTIPVDSLGRPL--------ETTFVCVAPLTGNSGVTWETSHVRHKLKR 119 DF VELK P+D + + E + S + TS K + Sbjct: 181 APDFPEAEVELKVSPID-IKKSKDGPSCIAGERLVLNTINYAKESTADFRTSSFWKKNRF 239 Query: 120 VLWIPVE----GERSIPLAQRRVG------------------SPLLWSPNEEEDRQLRED 157 + + E+ +R+ L +E ++ +D Sbjct: 240 IELMQYLRRIASEKGQSEDKRKYKIEYAHLLAMDCFAEPNFPQNSLLPLSEATMLRIEQD 299 Query: 158 WEELMDMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNF 217 W + IV + + +T + L K AN K ++ + G R + K++F Sbjct: 300 WNTIHQFIVENRADELTEGMTDTLAACTKGANNKQ--KSRQSNGA--RAKSRAYCFKQSF 355 Query: 218 TSALLARH 225 ++LL + Sbjct: 356 MTSLLQNY 363 Score = 53.0 bits (126), Expect = 7e-06, Method: Composition-based stats. Identities = 31/171 (18%), Positives = 59/171 (34%), Gaps = 23/171 (13%) Query: 17 LLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQ------- 69 +L G + E+A P+N + +L+E LG++ + Sbjct: 389 ILDYFNPFKGKSFTEIAEQFHFTIPKNASPKQTNF-MLVEHMLGSNTSISKDPNDDYVSF 447 Query: 70 DFAAL-GVELKTIPVDSLGRPLETTFVCVAP-LTGNSGVTWETSHVRHKLKRVL-----W 122 D L G+ +KT+P+ G+P E V P W+T + L ++L Sbjct: 448 DAEELKGIRVKTLPLFH-GKPSEQFKVQSIPDFNDLINQKWDTDNC--ALHQLLETTKFL 504 Query: 123 IPVEG-----ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLG 168 + V E+ W + + + W+E ++ + G Sbjct: 505 VFVFDSQNPNEKDKNPENIYFKGAAFWYMPASDIDIVEQVWKEDVEKLRNG 555 >UniRef50_Q30PF3 DNA mismatch repair enzyme MutH n=1 Tax=Sulfurimonas denitrificans DSM 1251 RepID=Q30PF3_SULDN Length = 467 Score = 175 bits (445), Expect = 7e-43, Method: Composition-based stats. Identities = 34/186 (18%), Positives = 68/186 (36%), Gaps = 8/186 (4%) Query: 48 KGWIGVLLEIWLG-ASAGSKPEQDFAALGVELKTIPVDSLGR----PLETTFVCVAPLTG 102 KG G +LE + + S E DF + +ELK+ P+ L E + + Sbjct: 41 KGSFGQILEKFYFLYAPNSNAEADFPEVNLELKSSPLKQLKNNQFRAKERLSLNIINYQD 100 Query: 103 NSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELM 162 +ETS K + +L + E + + W+ + +++DW+ + Sbjct: 101 IVHQNFETSSFWKKNENLLLVFYLYENDTNVLDYVIKLVDEWTFPSIDLEIIKQDWKRIK 160 Query: 163 DMIVLGQVERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALL 222 ++ G+ ++ YL PK E + R + LK+ + + ++ Sbjct: 161 QKVLDGKAHELSEGDTFYLGAAPKGGKGGNPREQPN---STLTAKQRAYSLKQGYVNHII 217 Query: 223 ARHFLI 228 A Sbjct: 218 ASISGN 223 Score = 56.1 bits (134), Expect = 8e-07, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 60/167 (35%), Gaps = 9/167 (5%) Query: 3 QPRPLLSPPETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGAS 62 Q + EE ++++ + T+ ++ + ++ + K + L + LG Sbjct: 232 QSTEITKDKTLEEIVVSKFESYYDKTIEDI---LAMLNINLNLKAKNFYANLTKAILGIE 288 Query: 63 AGSKPEQDFAALGVELKTIPVDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKR-VL 121 + E +F + +KTI + P E TW+ S + + L L Sbjct: 289 LNKEIE-EFEKAEIIVKTIRLKDNNLPKEDISFPNFKYENIVNETWDESEISNILGHKFL 347 Query: 122 WIPVEGERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLG 168 ++ + E + W+ ++ ++ + W +++ G Sbjct: 348 FVFFQFE----NKKLIFKKAQFWNMPYKDILEVEKVWARTKEIVQSG 390 >UniRef50_A6BKR2 Putative uncharacterized protein n=1 Tax=Dorea longicatena DSM 13814 RepID=A6BKR2_9FIRM Length = 517 Score = 173 bits (440), Expect = 3e-42, Method: Composition-based stats. Identities = 46/239 (19%), Positives = 77/239 (32%), Gaps = 29/239 (12%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKPEQD- 70 + L+ + + G T + ++ KG G ++E SK E D Sbjct: 7 DKNALIHRLEGYLGKTFEMIDNKEMFKHVQDFNLQKGIAGSVVEQCIFEYPPDSKQEADL 66 Query: 71 -----FAALGVELKTIPVDSLGRP------LETTFVCVAPLTGNSGVTWETSHVRHKLKR 119 + ELKT + +P E + + + +ETSH KL+ Sbjct: 67 IIVDGIENVKTELKTTGMLIREKPRKHYVAKEPMSITAVGVYDIAEQEFETSHFWEKLEH 126 Query: 120 VLWIPVEG-----ERSIPLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERIT 174 +L I + V +++E LR DWE + + Sbjct: 127 MLIIYYYYAATHAVSAYEYKDFPVVGYEFHEFSDDEIEVLRNDWEHVRTLCAKVVSHHSG 186 Query: 175 ARHGEYLQIRPKAANAKALTEAIGARGERI------LTLPRGFYLKKNFTSALLARHFL 227 R + K A + E G + P F LKK S+++A+HF Sbjct: 187 PRDKNW-----KEAVKQEYIEVHGELRRVLSYIDLAPKFPPRFRLKKPTVSSIIAKHFG 240 >UniRef50_Q9ACM4 R.Sth368I endonuclease n=1 Tax=Streptococcus thermophilus RepID=Q9ACM4_STRTR Length = 491 Score = 171 bits (434), Expect = 2e-41, Method: Composition-based stats. Identities = 43/233 (18%), Positives = 85/233 (36%), Gaps = 25/233 (10%) Query: 13 TEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEI-WLGASAGSKPEQDF 71 TEE+++ ++ G TLG++ + + G G ++E G A +K E D Sbjct: 7 TEEEVVRILDKIVGKTLGQIDKNNVFDKTKEKPKITGIAGDVIEQCVFGYQADNKQEADL 66 Query: 72 AALG--VELKTIPVD---------SLGRPLETTFVCVAPLTGNSGVT-WETSHVRHKLKR 119 G VELKT + E + + + S + HKL+ Sbjct: 67 LIDGRPVELKTTGLKKSSPKSKSKHALEAKEPMSITAVSVDKIGSQDNFYESTLWHKLEE 126 Query: 120 VLWIPVEGERSI-----PLAQRRVGSPLLWSPNEEEDRQLREDWEELMDMIVLGQVERIT 174 +L + + + + +EE+ + L DW+ + D + + + Sbjct: 127 LLIVYYLYDSDKIVKASEYSNFPIQGYQFLIFSEEDKKVLENDWKIVRDFVRNVEKNALD 186 Query: 175 ARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALLARHFL 227 + E+ +I +K TE P F L +++ + ++ +HF Sbjct: 187 -KSIEFPKI------SKLRTEMSYMDTAPKYPHPPRFRLTRSYVTTIVQQHFG 232 >UniRef50_Q6F1F9 Type II restriction enzyme Sau3AI, GATC site n=1 Tax=Mesoplasma florum RepID=Q6F1F9_MESFL Length = 452 Score = 168 bits (426), Expect = 1e-40, Method: Composition-based stats. Identities = 43/221 (19%), Positives = 83/221 (37%), Gaps = 16/221 (7%) Query: 12 ETEEQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIW-LGASAGSKPEQD 70 +T + + +A++ LGE+ + + +DKG IG L++ + ++ E D Sbjct: 3 KTIDDIFRRAKEAENKQLGEI---INFDEIDYKLKDKGRIGNLIQKYYFNIDINNRSESD 59 Query: 71 FAALGV--ELKTIPVDSLGRP----LETTFVCVAPLTGNSGVTWETSHVRHKLKRVLWIP 124 F LG+ ELK + + E + + N + S V +K K +L I Sbjct: 60 FKELGLDLELKVTGLKQNKKKQWIAKERVSLSMIDF-NNIEKDFTDSKVLNKNKNILLIG 118 Query: 125 VEGERSIPLAQRRVGSPLLWSP---NEEEDRQLREDWEELMDMIVLGQVERITARHGEYL 181 E + ++ +LW +E E + + D+ + I G ++ + L Sbjct: 119 HEYNKDKNRKNLKIIKSILWKYENISENEKKIIENDYSIIYQKIQNGIAHELSCSDTKIL 178 Query: 182 QIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSALL 222 + K + R F LKK + + +L Sbjct: 179 EATTKGQGKN--KDLRNQPNSDFKAKSRAFALKKWYVNYIL 217 >UniRef50_A8RH65 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RH65_9CLOT Length = 217 Score = 154 bits (390), Expect = 2e-36, Method: Composition-based stats. Identities = 51/216 (23%), Positives = 87/216 (40%), Gaps = 16/216 (7%) Query: 15 EQLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQDFAAL 74 ++ + +L+G EL + + + +KG G LLE+ LG S DF Sbjct: 4 QEACVRFNKLAGIRFEELFSQNDMNM---IIINKGKTGQLLELALGMHLSST-NLDFDD- 58 Query: 75 GVELKTIPVDSLGRPLETTFVCVAPLTGNSGVT---WETSHVRHKLKRVLWIPVEGERSI 131 ELKT D+ G P ET F+ + + +E +H+ K+ +L++PV + S Sbjct: 59 -GELKTNKCDNAGNPKETVFITQISSVIDELIQERPFEETHLYEKISNILYVPVCKDGSP 117 Query: 132 PLAQR-RVGSPLLWSPNEEEDRQL-REDWEELMDM----IVLGQVERITARHGEYLQIRP 185 L SP + R + R D+ + I RI +G+++Q+R Sbjct: 118 KDWMFLPSIHIDLSSPRYAQLRDIWRSDYYSICRQLRMHIETSPDRRIHTSNGQHIQVRS 177 Query: 186 KAANAKALTEAIGARGERILTLPRGFYLKKNFTSAL 221 K + + G G + FY +K F + Sbjct: 178 KDSMPYHPIYS-GVYGRYVSNKNHAFYFRKEFVYDI 212 >UniRef50_C4Z3T8 Putative uncharacterized protein n=2 Tax=Clostridiales RepID=C4Z3T8_EUBE2 Length = 216 Score = 138 bits (348), Expect = 1e-31, Method: Composition-based stats. Identities = 45/216 (20%), Positives = 84/216 (38%), Gaps = 16/216 (7%) Query: 16 QLLAQAQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQDFAALG 75 + + ++L+ E N+ +KG G +LE+ +G + DF Sbjct: 5 EAQKRIEKLAYIPFKEYLTEEQF---NNIVVNKGKTGQILELTIGLQLSNTTL-DFED-- 58 Query: 76 VELKTIPVDSLGRPLETTFVCVAPLTGNSGVTW---ETSHVRHKLKRVLWIPVEGERSIP 132 ELKT + LG P ET F+ + +++ E S + K+ R+L++ + E + Sbjct: 59 GELKTNKCNKLGIPAETMFITQTASIIDELLSFKLFEESKLYKKINRMLYVRINKEGAPE 118 Query: 133 LAQRRVGSPLLWSPNE--EEDRQLREDWEELM----DMIVLGQVERITARHGEYLQIRPK 186 + S + E QL+ D+ ++ D + L + +G+Y+QIR K Sbjct: 119 NWMFLPPILVDLSLPKYAELAAQLKFDYIDICDILNDQLCLSPDAMLHTANGKYIQIRTK 178 Query: 187 AANAKALTEAIGARGERILTLPRGFYLKKNFTSALL 222 + + + R FY K L+ Sbjct: 179 DSIPYHAIYSR-KYDRIVSDKNRAFYFKIPCLKYLI 213 >UniRef50_C5EHV9 Type II restriction endonuclease n=1 Tax=Clostridiales bacterium 1_7_47FAA RepID=C5EHV9_9FIRM Length = 507 Score = 132 bits (333), Expect = 8e-30, Method: Composition-based stats. Identities = 45/238 (18%), Positives = 84/238 (35%), Gaps = 39/238 (16%) Query: 11 PETEEQLLAQAQQLSG-YTLGELAALVGLVTPENLKRDKGWIGVLLE-IWLGASAGSKPE 68 T E+++ + Q G TLGE+ T KG G ++E L S E Sbjct: 7 YFTREEIIRKFQVFDGEKTLGEVDVRDVFHT---QVPAKGIAGCVIEQSVLSMKQNSTQE 63 Query: 69 QDFAAL-------GVELKTI---PVDSLG---RPLETTFVCVAPLTGNSGVTWETSHVRH 115 D + L ELK P + G E + + + SH + Sbjct: 64 ADLSILQADGSYKDTELKVTGVEPSNKQGEKYAAKEPMSLTAVSIGVIEHEKFLESHFYN 123 Query: 116 KLKRVLWIPVEGE-----RSIPLAQR----RVGSPLL-WSPNEEEDRQLREDWEELMDMI 165 K+ +L++ + + +P A+ +G L + +++E + DW ++ D + Sbjct: 124 KIAHMLFVFYVYDRKSGQKVVPYAEYDRFPVLGYKFLDIADDKDELAKFENDWTQVHDYL 183 Query: 166 VLGQV-----------ERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFY 212 + + + + Y+ I P+ T + + TL +GFY Sbjct: 184 LSIENREDRYDLYPLLHQKIKKDLFYIDIAPRFKQQPRQTPRFRFKKSYVNTLFQGFY 241 >UniRef50_Q6SFL0 Putative uncharacterized protein n=1 Tax=uncultured marine bacterium 580 RepID=Q6SFL0_9BACT Length = 236 Score = 110 bits (275), Expect = 4e-23, Method: Composition-based stats. Identities = 36/238 (15%), Positives = 78/238 (32%), Gaps = 45/238 (18%) Query: 21 AQQLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQDFAALGVELKT 80 ++ +L + + + R+KG G L+E G + DF ELKT Sbjct: 10 CKKSINIPFKDLFSN---FKTDEIIRNKGKSGQLMEKLCGLKLSNTT-TDFED--GELKT 63 Query: 81 IPVDSLGRPLETTFVCVA-----PLTGNSGVTWETSHVRHKLKRVLWIPVEGERSIPLAQ 135 + E+T + + + +++E S + K++ ++++P+ PL Sbjct: 64 SEL------KESTAITMITDWVDDIIHEEPISFEVSRLSKKIQHLIFMPLVKPSKDPLCW 117 Query: 136 RRVGSPLL-WSPNEEEDRQLREDWEELMD-----------------MIVLGQV------- 170 + + +++D+E + I + Sbjct: 118 YFKNCIYINITKGTSLYNDIKKDFENICLNSHELIYKKKITQLTDCKITKNKSLYSDKYG 177 Query: 171 -ERITARHGEYLQIRPKAA--NAKALTEAIGARGERILTLPRGFYLKKNFTSALLARH 225 + + G+Y+QIR K A + + L FY +F + + Sbjct: 178 DKFLHTISGKYIQIRTKDAGKTKSKPIFSKLLKRNITLKSRMAFYFLASFKDYVDKNY 235 >UniRef50_C0EPR1 Putative uncharacterized protein n=1 Tax=Neisseria flavescens NRL30031/H210 RepID=C0EPR1_NEIFL Length = 128 Score = 83.5 bits (205), Expect = 5e-15, Method: Composition-based stats. Identities = 26/111 (23%), Positives = 45/111 (40%), Gaps = 7/111 (6%) Query: 116 KLKRVLWIPVEGERSIPLAQRRVGSPLLWSPNEEED--RQLREDWEELMDMIV---LGQV 170 K+K +L++P+ P Q + S NEE + RQL +D+ ++ ++ + Sbjct: 5 KIKNLLYVPICKVGDDP-DQWYIQSAYHVQINEEGELFRQLADDFTQIKSKLLSDINSEN 63 Query: 171 ERITARHGEYLQIRPKAANAKALTEAIGARGERILTLPRGFYLKKNFTSAL 221 I G ++QIR K + + I FY KK F + Sbjct: 64 GYIHTSSGVFIQIRSKDSKPYHPIFSA-QYDRDISNKNHAFYFKKEFMREV 113 >UniRef50_C0EPR0 Putative uncharacterized protein n=1 Tax=Neisseria flavescens NRL30031/H210 RepID=C0EPR0_NEIFL Length = 109 Score = 76.9 bits (188), Expect = 4e-13, Method: Composition-based stats. Identities = 28/109 (25%), Positives = 48/109 (44%), Gaps = 10/109 (9%) Query: 15 EQLLAQAQQLSGYTLGELAALVGLVTPE---NLKRDKGWIGVLLEIWLGASAGSKPEQDF 71 ++ +G TLG+L P ++ +KG +G +E +G + + DF Sbjct: 4 TDSYDFFRENAGRTLGDLKLEYQTRFPSFTSEMRINKGGVGQFIEKLIGLN-NTNALTDF 62 Query: 72 AALGVELKTIPVDSLGRPLETTFVCVAPL----TGNSGVTWETSHVRHK 116 A ELKT D+ G PLET F+ ++ +++E S + K Sbjct: 63 AD--GELKTNKADTGGAPLETMFISQISSNFDQLISNQISFEDSWIYQK 109 >UniRef50_A0Z3W2 Putative uncharacterized protein n=1 Tax=marine gamma proteobacterium HTCC2080 RepID=A0Z3W2_9GAMM Length = 402 Score = 46.1 bits (108), Expect = 9e-04, Method: Composition-based stats. Identities = 30/143 (20%), Positives = 53/143 (37%), Gaps = 15/143 (10%) Query: 23 QLSGYTLGELAALVGLVTPENLKRDKGWIGVLLEIWLGASAGSKPEQDFAALGVELKTIP 82 + L L+ + + ++ +G LE LG A SK D+ G+ELK+ Sbjct: 156 DAANKLLEMLSEVSRQGFIKTMRAGDTGVGYTLETLLGIEANSKKTPDYN--GIELKSSR 213 Query: 83 VDSLGRPLETTFVCVAPLTGNSGVTWETSHVRHKLKRVL--WIPVEGERSIPLAQRRVGS 140 G ++T P W+ S++ K +L + + Sbjct: 214 TSKAGN-KQSTVFSQVP-------NWKLSNL-KGSKDILNKHGRFNSSKGRDQLYHEISC 264 Query: 141 PLLWSPNEEEDRQLREDWEELMD 163 L S + + + L++DWEEL Sbjct: 265 TKLNSYDLQLE--LKDDWEELRQ 285 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.310 0.143 0.375 Lambda K H 0.267 0.0437 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,220,639,847 Number of Sequences: 3077464 Number of extensions: 44002328 Number of successful extensions: 125612 Number of sequences better than 1.0e-01: 40 Number of HSP's better than 0.1 without gapping: 71 Number of HSP's successfully gapped in prelim test: 27 Number of HSP's that attempted gapping in prelim test: 125247 Number of HSP's gapped (non-prelim): 142 length of query: 229 length of database: 1,040,396,356 effective HSP length: 124 effective length of query: 105 effective length of database: 658,790,820 effective search space: 69173036100 effective search space used: 69173036100 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 91 (39.6 bits)