BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (348 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P15006 Protein mcrC n=9 Tax=Escherichia RepID=MCRC_ECOLI 721 0.0 UniRef50_A6BZ07 McrC protein n=1 Tax=Planctomyces maris DSM 8797... 194 4e-48 UniRef50_Q1IK47 McrC protein n=1 Tax=Candidatus Koribacter versa... 184 5e-45 UniRef50_C1QDS7 McrBC 5-methylcytosine restriction system compon... 175 2e-42 UniRef50_D1PB97 Putative uncharacterized protein n=1 Tax=Prevote... 174 3e-42 UniRef50_B2GJY9 Putative McrC protein n=3 Tax=Actinomycetales Re... 160 8e-38 UniRef50_C8W851 McrBC 5-methylcytosine restriction system compon... 153 1e-35 UniRef50_UPI000197B905 hypothetical protein BACCOPRO_00614 n=1 T... 152 1e-35 UniRef50_C1PCD8 McrBC 5-methylcytosine restriction system compon... 150 8e-35 UniRef50_UPI0001C37412 5-methylcytosine-specific restriction enz... 148 3e-34 UniRef50_C2G7A3 5-methylcytosine-specific restriction enzyme sub... 146 1e-33 UniRef50_A6EGF0 5-methylcytosine-specific restriction enzyme Mcr... 140 5e-32 UniRef50_B0AA68 Putative uncharacterized protein n=1 Tax=Clostri... 131 5e-29 UniRef50_Q1QFB3 Putative uncharacterized protein n=1 Tax=Nitroba... 128 3e-28 UniRef50_B2JTN9 Putative uncharacterized protein n=1 Tax=Burkhol... 128 3e-28 UniRef50_D0VFC8 McrBC 5-methylcytosine restriction system compon... 119 2e-25 UniRef50_A1UB52 Putative uncharacterized protein n=2 Tax=Mycobac... 105 3e-21 UniRef50_UPI0001BCC8FD 5-methylcytosine-specific restriction enz... 86 2e-15 UniRef50_A5WF57 McrBC 5-methylcytosine restriction system compon... 84 7e-15 UniRef50_D0D6B3 5-methylcytosine-specific restriction enzyme sub... 83 2e-14 UniRef50_B7D079 5-methylcytosine-specific restriction enzyme sub... 81 5e-14 UniRef50_C0WJ24 5-methylcytosine-specific restriction enzyme sub... 71 7e-11 UniRef50_C4T585 McrBC 5-methylcytosine restriction system compon... 70 1e-10 UniRef50_B2A7U4 McrBC 5-methylcytosine restriction system compon... 64 1e-08 UniRef50_B1BM79 Putative uncharacterized protein n=1 Tax=Clostri... 60 9e-08 UniRef50_C7NQX2 McrBC 5-methylcytosine restriction system compon... 60 1e-07 UniRef50_Q9RZI4 Putative uncharacterized protein n=1 Tax=Deinoco... 59 3e-07 UniRef50_UPI0001BCB4AE McrBC 5-methylcytosine restriction system... 58 6e-07 UniRef50_Q26DA3 Putative uncharacterized protein n=2 Tax=Flavoba... 53 2e-05 UniRef50_A6VF50 Putative uncharacterized protein n=2 Tax=Methano... 51 7e-05 UniRef50_A6EMU6 5-Methylcytosine-specific restriction enzyme C n... 51 8e-05 UniRef50_D2QXZ3 McrBC 5-methylcytosine restriction system compon... 50 9e-05 UniRef50_C5CG01 McrBC 5-methylcytosine restriction system compon... 50 1e-04 UniRef50_Q6LZ73 Putative uncharacterized protein n=1 Tax=Methano... 50 2e-04 UniRef50_C7PTE7 McrBC 5-methylcytosine restriction system compon... 50 2e-04 UniRef50_Q10YL1 McrBC 5-methylcytosine restriction system compon... 47 7e-04 UniRef50_B4V8L4 Putative uncharacterized protein n=1 Tax=Strepto... 47 8e-04 UniRef50_Q4C3F9 Similar to McrBC 5-methylcytosine restriction sy... 47 8e-04 UniRef50_C0GTU8 Putative uncharacterized protein n=1 Tax=Desulfo... 45 0.003 UniRef50_Q139N0 Putative uncharacterized protein n=1 Tax=Rhodops... 45 0.003 UniRef50_A4TAT5 McrBC 5-methylcytosine restriction system compon... 45 0.004 UniRef50_B5IHH3 McrBC 5-methylcytosine restriction system compon... 44 0.007 UniRef50_B7UQU6 McrC family protein, predicted McrBC 5-methylcyt... 44 0.007 UniRef50_A4J0H3 McrBC 5-methylcytosine restriction system compon... 41 0.072 UniRef50_D2B1B6 McrBC 5-methylcytosine restriction system compon... 41 0.081 UniRef50_C5EXA5 Putative uncharacterized protein n=1 Tax=Helicob... 40 0.086 >UniRef50_P15006 Protein mcrC n=9 Tax=Escherichia RepID=MCRC_ECOLI Length = 348 Score = 721 bits (1860), Expect = 0.0, Method: Compositional matrix adjust. Identities = 348/348 (100%), Positives = 348/348 (100%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL Sbjct: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE Sbjct: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ Sbjct: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR Sbjct: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 Query: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL Sbjct: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 Query: 301 LIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 LIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK Sbjct: 301 LIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 >UniRef50_A6BZ07 McrC protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BZ07_9PLAN Length = 362 Score = 194 bits (493), Expect = 4e-48, Method: Compositional matrix adjust. Identities = 113/345 (32%), Positives = 186/345 (53%), Gaps = 5/345 (1%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNP 64 IP++NIYY+L YAW LQE + ++ ++ VL+ GV L +RGL+ +Y Sbjct: 2 TIPIQNIYYLLCYAWDKLQEGQIVSVSPEDCQTTAELFARVLDSGVTHLLKRGLDRNYIS 61 Query: 65 NTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNS 124 ++G+ + TI+ L + D + D NRIIK+TL L++ +L+ Sbjct: 62 EEIETSSLRGKFDITTTIKQNLLRKSRVHCVVDSFSYDVPHNRIIKATLRNLLRCRELDR 121 Query: 125 TIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGH 184 RD LYR+L +S + LTP+ F+ + +N +Y F++ VC+ I +N + + G Sbjct: 122 DQRDRLLRLYRRLHDVSDIKLTPKDFNNVQLHRNNAWYGFLLQVCRLIYDNLLINEETGD 181 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 +F DF R+E++M+ L++ F++ F R+E + L W + + LPRM TD Sbjct: 182 SQFRDFLRDERQMARLFENFVFNFYRKEQSVFKVKSELLTWQGVDATPEDQQFLPRMRTD 241 Query: 245 ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGEN-----IGG 299 +++ SS + +++DAKYYK G HS+NLYQL YL + +N +N I G Sbjct: 242 VSLDSSTRKIVLDAKYYKDSLQSFHGNSSVHSENLYQLFAYLKNFYLKNIQNGDSRPIEG 301 Query: 300 LLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 +L+YP ++ Y+I+G I + T+NL W I Q+LL+I + Sbjct: 302 ILLYPTTGQSLSLNYEIHGHSIRIVTLNLNTSWKEIRQQLLNILE 346 >UniRef50_Q1IK47 McrC protein n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IK47_ACIBL Length = 351 Score = 184 bits (466), Expect = 5e-45, Method: Compositional matrix adjust. Identities = 108/349 (30%), Positives = 183/349 (52%), Gaps = 11/349 (3%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPN 65 IPV N+YY+L YAW L+E ++ +L+++ VL G+ L ++G++ Y + Sbjct: 3 IPVANVYYLLCYAWDKLEERDLVDIHPTEETDLVNLFARVLTNGIDHLLKKGIDRGYLLH 62 Query: 66 TEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNST 125 +E ++GRI+F ++I+ + FD L+ D L NRI+KST+ LI+ L+S Sbjct: 63 SEESCVLRGRIDFPQSIKHMLFQRAQAHCEFDELSFDVLHNRILKSTIMRLIRTRDLDSG 122 Query: 126 IRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHY 185 IRD YR + L L+ Q F + +N +Y F++ VC + N +P Q G++ Sbjct: 123 IRDRLLFQYRYFAEVGDLDLSVQIFGKVQLYRNNHFYDFLLRVCALLFENLLPTQEPGNW 182 Query: 186 RFYDFERNEKEMSLLYQKFLYEFCRRELTS------ANTTRSYLKWDASSISDQSLNLLP 239 RF F +N ++M+ ++++F+ F +REL S R + W + D S LLP Sbjct: 183 RFRSFLQNREQMAYVFERFVRNFYKRELPSVRVDGRCKVKREDINWGMTPSDDLSSALLP 242 Query: 240 RMETDITIRSSEKILIVDAKYYKSIFSRRMG-TEKFHSQNLYQLMNYL--WSLKPENGEN 296 +M+TD+ I + K ++V+ KY +R K + +LYQ+ YL W P + Sbjct: 243 KMQTDVCITTEAKRILVECKYVDDPLEQREEMAPKLITTHLYQVNAYLDNWPDLPLYRSS 302 Query: 297 IGGLLIYPHVDTAVKHRY-KINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 +L+YP + + + +G + + T+NL Q+W IHQ+LL + D Sbjct: 303 -RAILLYPLATRPIAVEFTRADGQLLSVRTLNLAQQWSAIHQDLLRLVD 350 >UniRef50_C1QDS7 McrBC 5-methylcytosine restriction system component n=2 Tax=Brachyspira RepID=C1QDS7_9SPIR Length = 350 Score = 175 bits (443), Expect = 2e-42, Method: Compositional matrix adjust. Identities = 113/355 (31%), Positives = 194/355 (54%), Gaps = 24/355 (6%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQAN-LEAIPGN----NLLDILGYVLNKGVLQLSRRGLEL 60 IP++NIYYML+YAW I + N + I G+ N+ +++GY+LN + +L +RG Sbjct: 7 IPIKNIYYMLSYAWNIWNIINEDNDKKEIFGDEKFDNIYNVMGYILNIFLEKLIKRGFYR 66 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 Y E + +KG+I F+++++ H K V ++++L+ D L N+IIK TL LI ++ Sbjct: 67 GYITLEEDLSVLKGKINFSESVK--RNTHKKLVCSYNILSNDILFNQIIKYTLNKLINYK 124 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 +++ I+++ L I +++ + F L KN +YK +I++CKF+ N + + Sbjct: 125 NIDNDIKEKLIKLNHYFIKIKNINVNNRTFKLLKYNKNNMHYKIIINICKFVHKNLLVNK 184 Query: 181 NKGHYRFYDFERNEKEMSLLYQKF------LYEFCRRELTSANTTRSYLKWDASSISDQS 234 N Y F DF EK M +LY+KF +Y F + + N T +KW+ I+D Sbjct: 185 NSSEYSFIDFNE-EKRMHMLYEKFVLNFYKIYFFHNKNIKVKNKT---IKWN---INDNE 237 Query: 235 LNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKP--E 292 +P M+TDI I + EK LI+D K+YK+I + S +LYQ+ +Y+ ++ + Sbjct: 238 --YIPIMKTDIMIYNKEKCLIIDTKFYKNILIKNNDKVSLRSSHLYQIFSYMSNINNSYK 295 Query: 293 NGENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 + I G+L+YP + + YKIN + T++L ++ I EL++I Y Sbjct: 296 RFKTIKGILLYPLCNDNLNKEYKINDKYFAVNTIDLNSDFNIIKSELINIIKNYF 350 >UniRef50_D1PB97 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PB97_9BACT Length = 364 Score = 174 bits (442), Expect = 3e-42, Method: Compositional matrix adjust. Identities = 108/355 (30%), Positives = 195/355 (54%), Gaps = 16/355 (4%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 Q IP+ N+YY+L YAWG ++ + ++ ++L ++L +L +L R+GL Y Sbjct: 2 QQKIPIENLYYLLCYAWGVSDQLDKVKVDGEKCHSLENLLSTILLNACDRLLRQGLLRAY 61 Query: 63 NPNTEIIPGIKGRIEFAKTIR-GFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 + + G++G++ A+T++ G HLN G+T+ D L +D + NR+I STL L++ E Sbjct: 62 RFEEQEVEGVRGKLNLAETLKSGKHLN-GRTICQVDELTQDVVINRVIFSTLKRLMRIEG 120 Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 ++ IR R K P I + +T L + + +YK V+++C+ I ++++P ++ Sbjct: 121 IDENIRARLRKTLAKFPHIEEIRVTEGLLGRLRQHRLSGFYKLVLNICRLIWDSTLPCKD 180 Query: 182 K-GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA--NTTRSYLKWDASSIS---DQSL 235 K G F DF ++ M+ ++++FL FC++ R Y+ + S ++ Sbjct: 181 KDGRLEFLDFTEDDFRMNCIFERFLMNFCKQNCRDEYPEVHREYIDFQLSPFGMMFKEAG 240 Query: 236 NLLPRMETDITI--RSSEKILIVDAKYYK-SIFSRRMGTEKFHSQNLYQLMNYLWSLKPE 292 LP METD+T+ ++++ LI+DAK+Y+ ++ S+ G EK +L Q+++Y+ + + Sbjct: 241 EALPVMETDVTLFNPNTQEKLILDAKFYREALVSKFGGREKVRRDHLSQILSYVMNQEDR 300 Query: 293 NGE---NIGGLLIYPHVDT--AVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDI 342 + N G L+YP VD +RYK G I + TVNLGQ W I + + +I Sbjct: 301 SKPHTMNAYGALVYPTVDEDFDFSYRYKETGHRIIVRTVNLGQPWRKIEERVKEI 355 >UniRef50_B2GJY9 Putative McrC protein n=3 Tax=Actinomycetales RepID=B2GJY9_KOCRD Length = 348 Score = 160 bits (404), Expect = 8e-38, Method: Compositional matrix adjust. Identities = 93/342 (27%), Positives = 166/342 (48%), Gaps = 4/342 (1%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 M+ +RNIY ML YA+ ++ +++ ++ D+ +L +GV +RG+ Sbjct: 1 MKDRTATIRNIYVMLAYAFRAIRTPDASDVGTEEFTHIHDLFAEILAQGVSAQVKRGVHH 60 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 DY E + ++GRI+ T+ + G FD DT N+ +KS + +LI+H Sbjct: 61 DYLRRDEQLTTVRGRIDVTATMVARAVTPGSVSCIFDTYEPDTPFNQALKSVMVLLIRHG 120 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 ++ +D R L L ++ + + + Y+ ++ VC+ +V +P + Sbjct: 121 EVGQRRKDALRRLLPYLDAVTLVSPRSIRWEKFTCHRRNAAYRILLGVCQLVVEGLLPTE 180 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 N G + ++ +E+ MS LY++FL E+ + T ++ WD ++ + LP Sbjct: 181 NSGDTQLAEW-LSEEAMSALYERFLREYYAFHHPELSPTARHVAWDYDPVTAVGADQLPA 239 Query: 241 METDITIRSSEKILIVDAKYY-KSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGG 299 M TD+T+ S + LI+DAKYY + + S G HS NLYQ+++Y+ + N + G Sbjct: 240 MRTDVTLTSGTRTLIIDAKYYSQPLTSGAYGKLTVHSANLYQMLSYIKNADVSNDGTVSG 299 Query: 300 LLIYPHVDTAVKHRYK--INGFDIGLCTVNLGQEWPCIHQEL 339 LL+Y D + I G +G T++L WP + EL Sbjct: 300 LLLYARTDAPAQPDVDVVIQGNRLGARTLDLAAPWPDLRHEL 341 >UniRef50_C8W851 McrBC 5-methylcytosine restriction system component-like protein n=21 Tax=cellular organisms RepID=C8W851_ATOPD Length = 351 Score = 153 bits (386), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 97/343 (28%), Positives = 181/343 (52%), Gaps = 15/343 (4%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGV-LQLSRRGLELDYN 63 +I ++NIY+ML YA+ LQ ++ A N ++L +L +GV LQL +RGL +Y Sbjct: 1 MIRIQNIYHMLAYAFQTLQGQGYRDIAAEEFGNTTELLAEILARGVSLQL-KRGLGQEYI 59 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN 123 E + +G+IE +++++ + + V ++D + DT NRI+K+T+A+L++ + ++ Sbjct: 60 DREEALSSPRGKIELSESLKTRSILRRQLVCSYDEFSTDTRMNRILKATIALLVRSD-ID 118 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 + R L + + L + + ++ +N + Y+ +++VC +V + Q G Sbjct: 119 KVRKKALRRLLPYFVDVGDVDLEHEDW-HMRFDRNNQAYRMLMNVCWLVVKGLLQTQEDG 177 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 R D +E+ MS LY+KF+ E+ RRE + Y+ W ++ D ++LP M T Sbjct: 178 SIRMMDL-LDEQRMSHLYEKFILEYYRREHPKLSAGAPYIDW---ALDDGFDDMLPAMHT 233 Query: 244 DITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPE-----NGENIG 298 DI + +LI+DAKYY ++ HS NLYQ+ Y+ + + E ++ Sbjct: 234 DIMLEQGRTVLIIDAKYYSRTMQQQFDKRSVHSSNLYQIFTYVKNKEVELSSTLKAHSVS 293 Query: 299 GLLIYPHVDTAVKHR--YKINGFDIGLCTVNLGQEWPCIHQEL 339 G+L+Y D ++ Y+++G I + T++L Q + I +L Sbjct: 294 GMLLYAKTDEEIQPDGVYQMSGNQISVRTLDLNQPFEEIRSQL 336 >UniRef50_UPI000197B905 hypothetical protein BACCOPRO_00614 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197B905 Length = 346 Score = 152 bits (385), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 105/351 (29%), Positives = 180/351 (51%), Gaps = 20/351 (5%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPGN---NLLDILGYVLNKGVLQLSRRGLELDY 62 I +RN+YYML YA+ +E+K+ N E I ++ D+ +L KGV +RGL +Y Sbjct: 7 IWIRNVYYMLAYAF---EELKKNNYEQIAHEEFEHIQDLFAEILYKGVSAQLKRGLHREY 63 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 E +P +KGR++ TI FD L+E+ L NR++K+TL++L + Sbjct: 64 INRVEDLPLLKGRLDIRGTIANQMRCRNVLCCEFDDLSENNLFNRVLKTTLSLLCHERNV 123 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 +S + E R+L G+ + + ++ +N + Y+ +++VC FI++ + Sbjct: 124 SSVRKAELRTLLPFFSGVDEIDVRNIRWNDFVYQRNNQMYRMLMNVCYFIIDGMLMTTET 183 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCR---RELTSANTTRSYLKWDASSISDQSLNLLP 239 G YR F +++ M L++KF+ E+ R REL S N R ++W+ S ++LLP Sbjct: 184 GKYRMATF--SDEHMCRLFEKFVLEYYRLHHREL-SPNPDR--IEWNIYSKDAMVIDLLP 238 Query: 240 RMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGG 299 M++DI + ++ L++D KYY HS NLYQ+ Y+ +L ++ N+ G Sbjct: 239 AMQSDIVLHRGDQSLVIDTKYYSHAMQYHFDKPTIHSANLYQIFTYVKNLDVKDTGNVSG 298 Query: 300 LLIYPHVDTAVKH--RYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 LL+Y D + I + T++L QE+ I +L +E+LK Sbjct: 299 LLLYAKTDEDITPDLSASFGKNHIRVRTLDLNQEFSGIASQL----EEFLK 345 >UniRef50_C1PCD8 McrBC 5-methylcytosine restriction system component-like protein n=6 Tax=Bacteria RepID=C1PCD8_BACCO Length = 355 Score = 150 bits (378), Expect = 8e-35, Method: Compositional matrix adjust. Identities = 95/354 (26%), Positives = 183/354 (51%), Gaps = 14/354 (3%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPG---NNLLDILGYVLNKGVLQLSRRG 57 M+ I +RNIYYML+YA+ L K++N + I ++ D+ +L KG+ + ++G Sbjct: 1 MKDKGILIRNIYYMLSYAFRVL---KRSNYDEIGSERFEHIQDLFAAILTKGIARQLKQG 57 Query: 58 LELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILI 117 L +Y + + + ++G+++ TI K +D L+E+ + N+I+K+T IL+ Sbjct: 58 LYKEYVSHCDDLSVLRGKLDIHGTIHHKLQRKQKLSCEYDELSENNVFNQILKTTSVILM 117 Query: 118 KHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSI 177 + +N R + + + + T ++ L KN + YK ++++C F+++ + Sbjct: 118 QQPSVNVKRRTALKKVMLHFDSVDMIEPTRIKWNILRFQKNNQSYKMLLNICYFVLDGLL 177 Query: 178 PGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNL 237 +KG ++ +F +E+ MS L++KF+ E+ R S + W+ I + + Sbjct: 178 LSTDKGKFKMANF-LDEQHMSRLFEKFVLEYYRYHYPSLRAAAPQIAWN---IDTGATDF 233 Query: 238 LPRMETDITIRSSEKILIVDAKYYKSIF--SRRMGTEKFHSQNLYQLMNYLWSLKPENGE 295 LP M+TDI ++S K+LI+D KYY R G+ FHS NLYQ+ Y+ + N Sbjct: 234 LPTMQTDIVLKSCSKVLIIDTKYYAHTMQVQSRYGSRTFHSNNLYQIFTYVKNQDVGNTG 293 Query: 296 NIGGLLIYPHVDTAV--KHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 N+ G+L+Y + + + ++G + + T++L + I ++L +I Y Sbjct: 294 NVAGMLLYARTEETIVPNADFMMSGNKMSVKTLDLNTAFGNIAEQLDNIATSYF 347 >UniRef50_UPI0001C37412 5-methylcytosine-specific restriction enzyme subunit McrC n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37412 Length = 351 Score = 148 bits (373), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 98/349 (28%), Positives = 178/349 (51%), Gaps = 16/349 (4%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPGNN---LLDILGYVLNKGVLQLSRRGLELDY 62 I ++NIYYML+YA+ Q +KQ + + + G + D+ +L KGV + ++GL +Y Sbjct: 7 IFIQNIYYMLSYAF---QILKQEDYKQVAGEKFEKIHDLFAAILEKGVSRQVKQGLYREY 63 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 P E + ++G++ +T+R N K FD +ED N+I+K T+ LI+ E + Sbjct: 64 VPTQEDLSVMRGKLNMGETVRLKVQNKQKLGCEFDEFSEDNPYNQILKVTIHRLIRAEDV 123 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSY--LNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 + R + I + P H ++ L ++ R Y+ ++++C ++N + Sbjct: 124 APERKQALRRVSVYFGNIRLIQ--PDHIAWNRLIYQRSNRNYELLLNICYLVLNGMLQTT 181 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSL-NLLP 239 G Y+ F +++ M LY+KF+ E+ ++ + + +KW+ + DQ + LP Sbjct: 182 EDGSYKLLAF--SDEHMERLYEKFILEYYKQHHPELDPKSAQVKWNLTEEPDQPMIQFLP 239 Query: 240 RMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGG 299 +M+TDIT++ +K LI+DAKYY ++ E S +LYQ+ Y+ ++ N N+ G Sbjct: 240 KMQTDITLQKGDKTLIIDAKYYGKSMAQSYSKETLRSAHLYQIFAYVKNMDTANKGNVSG 299 Query: 300 LLIYPHVDTAV---KHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 LL+Y + V + I G IG T++L + + +L I E Sbjct: 300 LLLYAKTEDEVFPEGEPFVIGGNRIGARTLDLNVSFDTLRIQLDKIAKE 348 >UniRef50_C2G7A3 5-methylcytosine-specific restriction enzyme subunit McrC n=16 Tax=Staphylococcus RepID=C2G7A3_STAAU Length = 346 Score = 146 bits (368), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 96/346 (27%), Positives = 181/346 (52%), Gaps = 10/346 (2%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNP 64 +I ++NIYYML+YA+ L + L N+ D+ +L KG+ GL +Y Sbjct: 1 MINIKNIYYMLSYAFTVLNKKGYQKLATEQFENIFDLYSAILIKGISSQLNSGLHHEYIE 60 Query: 65 NTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNS 124 T+ + I+G+++ +I+G + + +D + +T N+I+K+T+ LIK + ++ Sbjct: 61 QTDSLKVIRGKVDVKNSIQGLGVLSQRINCIYDEFSLNTYMNKILKTTMKCLIKTD-ISR 119 Query: 125 TIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGH 184 + + R L + TL + Y + +N + YK +IS+C I I ++KG Sbjct: 120 KNKIKLRKLLVHFNNVDTLDYRNIQW-YHSFDRNNQTYKMLISICYLIFQGVIQTESKGQ 178 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 F +E+++S LY+KF+ E+ ++E T S ++W +D ++N+LP M +D Sbjct: 179 NDLMVF-VDEQQISRLYEKFILEYYKKEFPELVVTSSNIQWSLD--NDDNVNMLPVMRSD 235 Query: 245 ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLK---PENGENIGGLL 301 I +R +K LI+DAK+YK+ T+K HS NLYQ+ Y+ + + + + G+L Sbjct: 236 IMLRYKDKCLIIDAKFYKNTLHNYYDTKKIHSTNLYQIFTYVKNQQLNLKKKAIQVSGML 295 Query: 302 IYPHVD--TAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 +Y D + ++ ++G I + T++L + I ++L I ++ Sbjct: 296 LYAKTDENIVLNDKFHMSGSQIIIKTLDLNCNFTIIKKQLNGIVND 341 >UniRef50_A6EGF0 5-methylcytosine-specific restriction enzyme McrBC, subunit McrC n=1 Tax=Pedobacter sp. BAL39 RepID=A6EGF0_9SPHI Length = 232 Score = 140 bits (354), Expect = 5e-32, Method: Compositional matrix adjust. Identities = 77/231 (33%), Positives = 131/231 (56%), Gaps = 8/231 (3%) Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 L+ +++EA+ +Y +L GI + ++PQ FS + +N +YKF + V + I + + Sbjct: 2 LSKALKNEAKGIYNQLTGIKDILISPQKFSLVTIHRNNIHYKFPLQVGQLITAQTAIEER 61 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDAS-SISDQSLNLLPR 240 G+Y F DF+RN +M+ L++ F+ F RE +R ++W + S S +L L+P+ Sbjct: 62 NGNYFFQDFDRNHHQMARLFESFVRRFYMREQKRFKVSRENIEWRINESESTGNLALIPK 121 Query: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG-- 298 M+TDI++ S E+ +I+D K+Y S ++ R + K HS +LYQL +YL +L+ ++ G Sbjct: 122 MQTDISLISPERKIIIDTKFYLSAYNSRYDSPKLHSSHLYQLYSYLCNLEEQSLSRNGGA 181 Query: 299 -----GLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 G+L+YP A+ YKI I + T+NL W IH L+ + D Sbjct: 182 NKIYEGILLYPKNGIALDESYKIGSHRIKIYTINLEGPWQDIHDRLISLLD 232 >UniRef50_B0AA68 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0AA68_9CLOT Length = 350 Score = 131 bits (329), Expect = 5e-29, Method: Compositional matrix adjust. Identities = 95/342 (27%), Positives = 172/342 (50%), Gaps = 16/342 (4%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPN 65 I ++NIYYML+Y + L + +++ +N+ DIL +L K V + +RGL +Y Sbjct: 7 IFIKNIYYMLSYVYTDLIQKDYKDIDVEEFDNVGDILAVILFKVVSKQVKRGLIKEYKSE 66 Query: 66 TEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNST 125 + + G+I K+I+ N K FD L+ D N IIK+ + +L+ + ++S Sbjct: 67 EGELSVLTGKINIEKSIKLKANNKNKLYCEFDKLSMDNYLNSIIKTAMYVLVLSKDISSQ 126 Query: 126 IRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHY 185 + + L ++TL + ++ + ++ Y +I++C I+N+ + G Y Sbjct: 127 NKKNLKKLVLLFSNVNTLKVNEIRWNDIKYNRHNSNYSGIINICYLILNDLLMTTEDGEY 186 Query: 186 RFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDI 245 + +F +EK+M +Y+KF+ + ++ S S +KW+ + D+ LP M++DI Sbjct: 187 KVAEF-LSEKKMYSIYEKFVLFYYQKHYPSLRPKASKIKWNLDNELDK---FLPEMKSDI 242 Query: 246 TIRSSEKILIVDAKYYKSIFSRRMGT------EKFHSQNLYQLMNYLWSLKPENGENIGG 299 T+ S E ILI+D KYY S+ M T + HS NLYQ+ Y+ + + G Sbjct: 243 TLTSGENILIIDTKYY----SQSMQTIELYNSKTIHSNNLYQIFTYVKNKDINKNGKVSG 298 Query: 300 LLIYPHV--DTAVKHRYKINGFDIGLCTVNLGQEWPCIHQEL 339 +L+Y D Y ++G I + T++L +++ I Q L Sbjct: 299 MLLYAKTNEDIIPNSEYIMSGNKIMVRTLDLNKDFKFIAQSL 340 >UniRef50_Q1QFB3 Putative uncharacterized protein n=1 Tax=Nitrobacter hamburgensis X14 RepID=Q1QFB3_NITHX Length = 361 Score = 128 bits (322), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 89/336 (26%), Positives = 154/336 (45%), Gaps = 10/336 (2%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 Q IP+RN++ +L YA G + Q L DILG +L + + R L Y Sbjct: 9 QTGIPIRNLWLLLVYASGLAEFESQCGAGTDDDIELADILGRLLVRLAKRRLRTNLSRGY 68 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 +IP ++GR+++ +T+ HL G+ F+ L+ DT NR+++ L I I Sbjct: 69 QRRQAVIPRVRGRVDWLQTLSRQHLQRGRLACRFEELSFDTPRNRLVRCAL-IAIAGRVR 127 Query: 123 NSTIRDEARSLYRKLP--GISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 + + + R L L GI+ T Q S G + +F+IS + + +P + Sbjct: 128 DHAVAADCRRLGDDLGRLGIAASRPTQQEMSADTIGSHQSEDRFMISAARLVFEMLLPNE 187 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN----TTRSYLKWDASSISDQSLN 236 G + +R+E + +++K + F R L +A+ ++Y W + Sbjct: 188 TPGDMKLSRLKRDEITLRKIFEKAVTGFYRHHLRAADGWSVREQNYQSWQLEPGRSGDVG 247 Query: 237 LLPRMETDITI-RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPEN-- 293 LLP M+ DI + R ++ +++D K+ + +K S N+YQ+ YL S + Sbjct: 248 LLPGMKPDIILDRKQDRRIVIDTKFTSILAKGIADRDKLKSANIYQIYAYLHSQRGRGRL 307 Query: 294 GENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLG 329 + G+L+YP +D V + I G DI TV+L Sbjct: 308 CDRAEGVLLYPALDHDVDETFTIQGHDIRFVTVDLA 343 >UniRef50_B2JTN9 Putative uncharacterized protein n=1 Tax=Burkholderia phymatum STM815 RepID=B2JTN9_BURP8 Length = 363 Score = 128 bits (322), Expect = 3e-28, Method: Compositional matrix adjust. Identities = 91/351 (25%), Positives = 167/351 (47%), Gaps = 25/351 (7%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPN 65 IP++N++++L+YA + + +E +++ ++LG +L V + +R L Y P Sbjct: 10 IPIKNLWFLLSYAHNLARFADRLPVEIGEQDDIPELLGRLLAFLVERRIKRNLTRAYQPR 69 Query: 66 TEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNST 125 + ++GRI+ KT+ L G+ + F+ L+ DT N++++ LA + ST Sbjct: 70 EARLTRVRGRIDLVKTLSAGELQQGRIICRFEELDADTPRNQLVRYALA------HIAST 123 Query: 126 IRDEARSLYRKLP---------GISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNS 176 +RD+A L R+ G+S + + G+N +I V ++ Sbjct: 124 VRDQA--LERRCGLLASELGRLGVSFRRPSRSEMAREQIGRNDADDAALIVVSNLALDPR 181 Query: 177 IPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN---TTRSYLKWDASSISDQ 233 +P + G R +R+E+ + +++K + F EL + + L W +S + Sbjct: 182 LPSEESGDSRVARLQRDERLLPYIFEKAIAGFYMHELPNKEWRVRPQKVLAWPVASPTPG 241 Query: 234 SLNLLPRMETDITI--RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWS--- 288 +LLP M+ DI I R + + ++VD K+ + + GT++F S LYQ+ YL S Sbjct: 242 LHDLLPGMQADIVIDSRITNRRVVVDTKFTDILTRNQFGTQRFKSNYLYQMFAYLRSQTG 301 Query: 289 LKPENGENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQEL 339 ++ E GLL++P V V + + G + TV+LG E IH L Sbjct: 302 FGDKHAEEAEGLLLHPSVGLHVDESFFVQGHRMRFATVDLGGEIHSIHSAL 352 >UniRef50_D0VFC8 McrBC 5-methylcytosine restriction system component (Fragment) n=1 Tax=Streptococcus suis RepID=D0VFC8_STRSU Length = 346 Score = 119 bits (297), Expect = 2e-25, Method: Compositional matrix adjust. Identities = 82/316 (25%), Positives = 163/316 (51%), Gaps = 12/316 (3%) Query: 37 NLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTF 96 N D+L ++ + +RGL Y TE + +KG+I ++++ + + V + Sbjct: 33 NTADLLAEIMIISLSIQVKRGLGRGYRSQTESLSALKGKINISESLTPPNWRRKQLVCQY 92 Query: 97 DMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGG 156 D + D+ NRIIK+++ IL+K + ++ + + R L +S ++L +++ L Sbjct: 93 DDFSLDSTMNRIIKASIEILLKAD-ISRDRKKKLRKLLVFFGEVSKINLHSINWN-LQYN 150 Query: 157 KNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA 216 +N + Y ++S+C +VN I + +G+ + +F +E+ SLLY+KF+ + ++ Sbjct: 151 RNNQSY-LLMSICYLVVNGLIHTEREGNKKLMNF-LDERRESLLYEKFILGYYKKHYPQI 208 Query: 217 NTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHS 276 T S + W ++ D +LP M++DI ++ + ILI+DAKYY S R HS Sbjct: 209 QVTASQIPW---ALDDGFGEMLPIMQSDIYLKYKDTILIIDAKYYSSNTQIRFDKRTLHS 265 Query: 277 QNLYQLMNYLWSLK---PENGENIGGLLIYPHVDTAVK--HRYKINGFDIGLCTVNLGQE 331 NLYQ+ Y+ + + + + G+L+Y D ++ Y+++G I + ++L + Sbjct: 266 NNLYQIFTYVKNQAYRLSDTNDTVAGMLLYAKTDIDIQPNQVYQMHGNQISVKNLDLNLQ 325 Query: 332 WPCIHQELLDIFDEYL 347 + I ++L DI + Sbjct: 326 FASIAEQLDDIITSHF 341 >UniRef50_A1UB52 Putative uncharacterized protein n=2 Tax=Mycobacterium RepID=A1UB52_MYCSK Length = 361 Score = 105 bits (261), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 88/348 (25%), Positives = 156/348 (44%), Gaps = 12/348 (3%) Query: 5 VIPVRNIYYMLTYAWGYLQE---IKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELD 61 IPVRN++ ++ YA Q ++ ++E P LLD++ VL V + +R L Sbjct: 13 AIPVRNLWLLMLYASRLYQRNHLLRNMDVEQNP-ERLLDLVAQVLVYAVERRLQRNLGRQ 71 Query: 62 YNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 Y + ++G+I+ T L G+ FD L+ D L NRII+S L +L + Sbjct: 72 YRERRATLARVRGQIDVLTTESKALLAQGRIACRFDELSVDNLRNRIIRSAL-VLAARDA 130 Query: 122 LNSTIRDEARSLYRKLP--GISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPG 179 + T++ AR++ G+S ++ + L +N I + I+ +IP Sbjct: 131 RDRTLQRTARNMADVFTQYGVSPQLVSVRESRQLVLDRNAHDDVEAIGAAQLILEMAIPA 190 Query: 180 QNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA---NTTRSYLKWDASSISDQSLN 236 ++ G+ D ER+ E+ LY+ + F R L S+ + ++ W ++ + Sbjct: 191 ESAGNSTNRDPERDAAEIRRLYEAAVRGFYRSALPSSWSVSPGETHYHWPLVEATEGLKS 250 Query: 237 LLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSL--KPENG 294 +LP M+ D + + + +IV+ K+ ++ + G K +++QL Y+ S E Sbjct: 251 ILPIMKADTVLETVGRRIIVETKFADALKPNQYGLPKLARNHVFQLYAYVQSQHGSDELS 310 Query: 295 ENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDI 342 G+L+YP V V I G TV+LG I LL + Sbjct: 311 ATAEGVLLYPVVGEHVDESASIQGHRYRFLTVDLGGPAESIRSSLLRV 358 >UniRef50_UPI0001BCC8FD 5-methylcytosine-specific restriction enzyme subunit McrC n=1 Tax=Aeromicrobium marinum DSM 15272 RepID=UPI0001BCC8FD Length = 357 Score = 85.9 bits (211), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 89/356 (25%), Positives = 150/356 (42%), Gaps = 29/356 (8%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYV---LNKGVLQLSRRGLEL 60 P IP++N++ + YA + + Q + A +N D+ +V L V Q GL + Sbjct: 3 PKIPIKNVWLLQLYASSLYRAVGQRLVAA--EDNPEDLPAFVAGMLADAVTQRLHTGLSV 60 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANR---IIKSTLAILI 117 + + + ++GRI+ T R L+ G+ TFD + DT ANR A L+ Sbjct: 61 GFQRTSRPLTRVRGRIDVLPTARHQLLSRGQVHCTFDEVVADTPANRLARAALWRAATLV 120 Query: 118 KHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYY---KFVISVCKFIVN 174 HE RSL +L P S + G R + +I+ +++ Sbjct: 121 PHEP-------RFRSLALQLEAAGVRGPCPP-LSRVPGLHRERLLVRDRQMIATADLLLS 172 Query: 175 NSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN---TTRSYLKWDASSIS 231 +IP ++G + +E+ + +L+++ F R L S LKWD S +S Sbjct: 173 LAIPTTDEGGKLLPAPDMDERYLRVLFERACVGFFRLRLEPQGWKVNHNSPLKWDTSFMS 232 Query: 232 DQSLNLLPRMETDITI------RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNY 285 +LP ME DI + + +++D K+ + G K S +YQ+ Y Sbjct: 233 SGMGAILPGMELDIELVHHDLTGPGRRRVVIDTKFTTITKMNQYGNLKLRSGYIYQIYAY 292 Query: 286 LWSLKP-ENGENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELL 340 L S + E GL+++P V V I G I TV+L + + ++LL Sbjct: 293 LMSQEASETDPKSEGLMLHPVVGERVDEEVVIQGHRIRFATVDLAADSATLAEQLL 348 >UniRef50_A5WF57 McrBC 5-methylcytosine restriction system component-like protein n=4 Tax=Proteobacteria RepID=A5WF57_PSYWF Length = 363 Score = 84.0 bits (206), Expect = 7e-15, Method: Compositional matrix adjust. Identities = 87/355 (24%), Positives = 157/355 (44%), Gaps = 23/355 (6%) Query: 6 IPVRNIYYMLTYAWGYLQEI--KQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 IP+RN++ ++ YA +E+ + +E P +++ D++ +L + V +R L Y Sbjct: 12 IPIRNLWLLMLYASDIYRELNKDRVAVEENP-DDIPDLIAEMLCQRVEHRIQRNLSYGYQ 70 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTL---AILIKHE 120 ++ ++GRI+ T R L+ GK FD L DT NR ++ L A +++ + Sbjct: 71 SREAVVSRVRGRIDLLNTERNRLLDRGKVACRFDELTIDTARNRYVRGALERIAKIVQRK 130 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 +L R L R G+S + + S ++ K +++V N ++P + Sbjct: 131 ELAHRCRSLDIRLRRM--GVSQVLPSRAELSVDRLSRHDAEDKPMLTVAHLAFNLALPTE 188 Query: 181 NKGHYRFYDFERNEKE-MSLLYQKFLYEFCRRELTSAN-----TTRSYLKWDASSISDQS 234 G ER + + L++K + F E+T +N T + W + S Sbjct: 189 VTGSKYLSRPEREDLPWLRKLFEKGVAGFY--EITLSNHGYKVTAGKRINWPVTDSSQGI 246 Query: 235 LNLLPRMETDITIRSSE--KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPE 292 +LP M+TDI I + + + I+D K+ + E S +YQ+ YL S + + Sbjct: 247 DKILPSMKTDIIIDNLDLGQRTIIDTKFNAVLTRGWYRHETLRSSYIYQMYAYLRS-QED 305 Query: 293 NGE----NIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIF 343 +G+ N GL I+P V + I I TV+L I ++LL + Sbjct: 306 SGDFLDRNACGLFIHPSVGEDINEYMVIQDHKIQFATVDLAASTKEIRRQLLGLI 360 >UniRef50_D0D6B3 5-methylcytosine-specific restriction enzyme subunit McrC n=1 Tax=Citreicella sp. SE45 RepID=D0D6B3_9RHOB Length = 359 Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 76/335 (22%), Positives = 144/335 (42%), Gaps = 15/335 (4%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPN 65 IP+RNI+ + YA +Q + + +L D++G ++ V RR L Y Sbjct: 5 IPLRNIWILFLYAADLVQLRGRFERDVERARDLPDLVGRLMVNVVEDRLRRNLSRGYRAQ 64 Query: 66 TEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNST 125 T I+P ++GRI+ T G + G+ F+ DT NR++++ L L T Sbjct: 65 TAILPRVQGRIDMLATEAGQLMERGQIACRFEEHVMDTPRNRLVRAALERLAARVFTPET 124 Query: 126 ---IRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 R A R G+S + + G+N + ++++ + + +IP + Sbjct: 125 AYRCRSLAADFSRA--GVSARRPSRTELAIDQMGRNEGADRMMVALAGMVFDGTIPTEKH 182 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSY---LKWDASSISDQSLNLLP 239 G E E + L+++ + R L T + + W ++ ++LP Sbjct: 183 GTALQPGDETTEHLIRRLFERAVGNALRIALEPEGWTIAQGHRIAWPVGGKTEGLPSILP 242 Query: 240 RMETDIT---IRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLW---SLKPEN 293 M+TDI I++S ++ ++D K+ + + + S LYQ+ YL S++ + Sbjct: 243 GMQTDIELSHIKTSRRV-VIDTKFTRILTASNYRGGILRSGYLYQMYAYLRTQESMEHPS 301 Query: 294 GENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNL 328 G+L++P V +V + G I T++L Sbjct: 302 SLTSEGILLHPQVGGSVDETMILQGHPISFRTIDL 336 >UniRef50_B7D079 5-methylcytosine-specific restriction enzyme subunit McrC n=1 Tax=Burkholderia pseudomallei 576 RepID=B7D079_BURPS Length = 294 Score = 81.3 bits (199), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 71/273 (26%), Positives = 123/273 (45%), Gaps = 23/273 (8%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQA-------NLEAIPGNNLLDILGYVLNKGVLQLSRRGL 58 IPVRN++ ++ YA L IK+ +LE IP D++ +L V Q RR + Sbjct: 19 IPVRNLWLLMLYA-SDLTRIKEVFNALVEDDLEDIP-----DLVAKLLAHTVEQRLRRNV 72 Query: 59 ELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKST---LAI 115 Y + + ++GRI+ +T L+ G+ F+ L +T NR++++ LA Sbjct: 73 TRGYQHRAQSLTRVRGRIDILRTEAQQLLSRGEVYCRFEELTANTPRNRLVRAALDLLAS 132 Query: 116 LIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNN 175 L++ L R A +L R GI + + + G+N + + K + Sbjct: 133 LVRDRDLARQCRSLAAALGRS--GIVGVRPSRAELAQDQIGRNDHDDWLMAELAKLAFDL 190 Query: 176 SIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN---TTRSYLKWDASSISD 232 ++P + G ER + + L++K + F R EL + + W S+ SD Sbjct: 191 ALPTEEAGPTTLVSPERGDVYVRRLFEKAVLGFARVELERIGWRVRGGTCMNWQVSAASD 250 Query: 233 QSLNLLPRMETDITIR--SSEKILIVDAKYYKS 263 + +LP M TDI I S+ + L++D K+ S Sbjct: 251 GAAEILPGMITDIIIDDLSAGRRLVIDTKFTLS 283 >UniRef50_C0WJ24 5-methylcytosine-specific restriction enzyme subunit McrC n=3 Tax=Corynebacterium RepID=C0WJ24_9CORY Length = 373 Score = 70.9 bits (172), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 74/307 (24%), Positives = 136/307 (44%), Gaps = 15/307 (4%) Query: 6 IPVRNIYYMLTYAWGYLQE--IKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 +P+R+++ + YA + I +++E G L +++G +L V + RR L + + Sbjct: 15 VPIRSVWMLQLYASQTFIDGHISNSSVEEA-GVELPELIGTMLCDAVERRFRRELSIGFT 73 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN 123 I ++GRI +T R L G F+ L ++ NR ++ L H N Sbjct: 74 LTERNITRVRGRINMYETARHQLLEKGLIACEFNELTINSEINRFLRYALE-YAGHILSN 132 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ-NK 182 D A ++ + + + + + K ++V K ++ ++P + N Sbjct: 133 VGSGDAAHRCKILGQRLAQMGVPEPKTAAFPRARLSPADKKPVAVAKLLLELAVPTRGND 192 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN---TTRSYLKWDASSISDQSLNLLP 239 RF + E+ L++K L+ L+ ++ L W+ Q+ + LP Sbjct: 193 ALPRFSRKHFTQDELRKLFEKALFGLFHYHLSPFGWKVSSGKRLNWNV----QQAPSYLP 248 Query: 240 RMETDITIRSSE-KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLK--PENGEN 296 M+TDI +RS E +I ++DAK+ R+G E S +LYQL YL S + E E Sbjct: 249 SMQTDIILRSPEGEITVIDAKFTHLFTENRVGNESIKSSHLYQLYAYLRSQETFSEEWET 308 Query: 297 IGGLLIY 303 G+++Y Sbjct: 309 AQGIMLY 315 >UniRef50_C4T585 McrBC 5-methylcytosine restriction system component n=1 Tax=Yersinia intermedia ATCC 29909 RepID=C4T585_YERIN Length = 367 Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 78/350 (22%), Positives = 152/350 (43%), Gaps = 19/350 (5%) Query: 8 VRNIYYMLTYAWGYLQEIKQANLEAIPGN--NLLDILGYVLNKGVLQLSRRGLELDYNPN 65 +RN++ ++ YA +++ + ++ A+ N + D++ +L + R L + Y Sbjct: 1 MRNLWLLMLYASDLFRQLGRRHI-AVEDNPAEIPDLVATILLHEIALRRHRNLSMGYQTR 59 Query: 66 TEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAIL---IKHEKL 122 + ++GRI+ T L G+ F + DT NR ++ L L I L Sbjct: 60 HAALNRVRGRIDVLYTTSHQLLERGRVACHFQDMTLDTPRNRYVRCALERLTPIIAKPSL 119 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHF-SYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 + A SL R+ GI+ + + S G++ K ++ + +P ++ Sbjct: 120 AADCHAMALSLRRE--GINGGYPDNRELPSVRRFGRHDAADKPMVDAAQLAFELLMPTED 177 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN---TTRSYLKWDASSISDQSLNLL 238 +G + N M L++K + F R L + LKW S S S + Sbjct: 178 QGQHLLPAPSDNLYWMRKLFEKGIAGFYRVHLAKTTWRISAGKELKWALSEQSAGSAEIF 237 Query: 239 PRMETDITI--RSSEKILIVDAKYYKSIFSRRMGTEK-FHSQNLYQLMNYLWSLKPENGE 295 P M++DI + + +++ +I+D K + +I ++ EK + +YQL YL + + + Sbjct: 238 PTMKSDIILEHKMAQQRIIIDTK-FNAILTKGWHREKSLRNSYIYQLYTYLRTQESQADP 296 Query: 296 ---NIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDI 342 N GLL++P V + G I TV++ + I ++LL++ Sbjct: 297 LSLNAAGLLLHPAVGYMLNEYVVTQGHKIHFATVDMAVDAKTIKRQLLEL 346 >UniRef50_B2A7U4 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A7U4_NATTJ Length = 383 Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 79/330 (23%), Positives = 150/330 (45%), Gaps = 34/330 (10%) Query: 3 QPVIPVRNIYYMLTYAWGYL----QEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGL 58 +P I N++ ML+Y++ + ++ + AN++ LLD L V V +L ++GL Sbjct: 67 EPKIDTANVFKMLSYSYDLIFWHDEKAQFANIQE-----LLDYLVLVFCNQVNRLIKKGL 121 Query: 59 ELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIK 118 DY + + KGR+ + + H K +D D L N+IIK T+ +L + Sbjct: 122 HADYVLVNDKLSYAKGRMNVRELVEKPWEKH-KIDCYYDNYQVDILENQIIKFTIDLLKR 180 Query: 119 HEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIP 178 + + N+ IR + R +S +T + + ++YK + + CK + Sbjct: 181 YIQ-NNWIRRSLLNTNRYFDSVSLRPITVEDIDQVQYTTLNKHYKHIHNFCKMFLELMGI 239 Query: 179 GQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLL 238 + G F F EM+ LY+K++ + + EL + + I L+L Sbjct: 240 NEQIGETLFNQFHL---EMNNLYEKYVGKLLKEELPN----------NYCVILQDKLHLD 286 Query: 239 PRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 + I+IR + ++ D K Y I ++ G++ + ++YQ+ Y+ K + Sbjct: 287 EYDQ--ISIR-PDIVIYNDVKPYLVIDTKYKGSKDITNNDIYQMAAYMSKTKTD------ 337 Query: 299 GLLIYPHVDTAVKHRYKINGFDIGLCTVNL 328 G+L+YP + A + Y ING + + T++L Sbjct: 338 GVLLYPAQEVA-ETEYIINGRSLNIKTIDL 366 >UniRef50_B1BM79 Putative uncharacterized protein n=1 Tax=Clostridium perfringens C str. JGS1495 RepID=B1BM79_CLOPE Length = 425 Score = 60.5 bits (145), Expect = 9e-08, Method: Compositional matrix adjust. Identities = 67/279 (24%), Positives = 120/279 (43%), Gaps = 23/279 (8%) Query: 33 IPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKT 92 + N L+DI+G + + + + +RG+ +Y I IKG++ K + N K Sbjct: 109 LKNNPLIDIMGEIFYRDLSRELQRGIYSEYVSVENSIGNIKGKLLVTKHSKVNRFNKNKA 168 Query: 93 VSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSY 152 +D ED NRI+K L+ L++ E N ++ R L R +S + + Sbjct: 169 YCAYDEFTEDNFFNRILKKALSYLLR-EVRNERLKSNLRVLDRSFEEVSDKFINKHALNR 227 Query: 153 LNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRE 212 + +K + K I+N S+ +KG + EM+ LY++++ R Sbjct: 228 YKLNRRNERFKNSFELAKMILNGSMGDNSKGKEFGFTLLF---EMNYLYEEYIGVVLREV 284 Query: 213 LTSANTTRS------YLKWDASSISDQSLNLLPRMETDITIRSSEK-ILIVDAKYYKSIF 265 ++ N S YL ++ ++ + L P DI I E +I+D K+ K Sbjct: 285 ISEENIFVSTQEKTKYLLYNKKRKREE-IALKP----DIVIYKDETPKIIIDTKWKK--- 336 Query: 266 SRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 R G E + ++YQ+ Y+ S K E +++YP Sbjct: 337 GSRNGKENYSQGDVYQMYAYITSYK----ECEKCVILYP 371 >UniRef50_C7NQX2 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NQX2_HALUD Length = 421 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 70/333 (21%), Positives = 131/333 (39%), Gaps = 46/333 (13%) Query: 4 PVIPVR------NIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRG 57 P I VR N+ Y+L YA ++ G+ LD G + + ++ RG Sbjct: 79 PTIEVRPKAAGTNLLYLLQYAHDTTATTFESQAPYQAGHTFLDAFGALYEAELRRIVDRG 138 Query: 58 LELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAIL- 116 L DY ++GR++ + ++ T+D L D LANR I +L Sbjct: 139 LYTDYRRTDATESHLRGRLDIHRQLQRQPPVPTAFECTYDELTHDILANRAILHATTVLL 198 Query: 117 --IKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVN 174 + + ++R + L R+ +S +T Q + + +Y+ ++ + K ++ Sbjct: 199 GAVSDRSITQSLRQHQQLLRRQ---VSLTPVTIQDIERIELNRLADHYEDILRLTKLVIR 255 Query: 175 NSIPGQ-NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQ 233 NS + G + N M+ +++ + C+ L+ W+ D Sbjct: 256 NSFVSELQAGSSAAFAMLVN---MNTIFENAVERACKEVLSERE------DWEV-KFQDT 305 Query: 234 SLNLLP------RMETDITIRSSEKI--LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNY 285 S NL+ ++ DITI E L+ DAK+ E+ + + YQ+ +Y Sbjct: 306 SQNLITGGKHTVTLQPDITIYDPENTVSLVADAKWKN---------ERPKNADFYQMTSY 356 Query: 286 LWSLKPENGENIGGLLIYPHVDTAVKHRYKING 318 + + N+ G+L YP + R + G Sbjct: 357 MLA------NNVPGILFYPDCGGLNESRSTVTG 383 >UniRef50_Q9RZI4 Putative uncharacterized protein n=1 Tax=Deinococcus radiodurans RepID=Q9RZI4_DEIRA Length = 442 Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 69/317 (21%), Positives = 135/317 (42%), Gaps = 45/317 (14%) Query: 41 ILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRG----FHLNHGKTVSTF 96 ++ Y L +GV RRG+ Y P E PG++GR++ + +R HL H T+ Sbjct: 144 VIRYAL-EGVRAAVRRGIPHAYVPVQEERPGLRGRLDLPRQVRQPPHRAHLLH----VTY 198 Query: 97 DMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGG 156 D D R+ + ++ L +L + R AR L L + F+ G Sbjct: 199 DEFLPDRPETRLTRLSVERLAALTRLPANQR-LARELLHALDEVPPSRNVNVDFAAWRLG 257 Query: 157 KNTRYYKFVISVCKFI---VNNSIPG-QNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRE 212 + ++ + ++C+ + +N + G + + H +D M+++Y+ ++ + R Sbjct: 258 RGHTHFAPLEALCRMVLYELNPIVAGKKTQAHALLFD-------MNVVYEAYVAQLLR-- 308 Query: 213 LTSANTTRSYLKWD-ASSISDQSL---NLLP--RMETDITIRSSEKILIVDAKYYKSIFS 266 R Y W A+ ++ ++L + LP R+ D+ +R+ E +IV +K + + Sbjct: 309 -------RLYPTWTVATQVTQRALGDADGLPAFRLRPDLLLRTEEGQVIVADTKWKRLEA 361 Query: 267 RRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL-LIYPHVDTAVKHRYKI---NGFDIG 322 + T + + YQ++ Y + + L LIYP + I G + Sbjct: 362 DKAPTYDVANADAYQMLAYSEAFQHSAAYTHKALWLIYPRLPGLPPVSAPIRLGQGRTLS 421 Query: 323 LCTVNL-----GQEWPC 334 + T++L G +WP Sbjct: 422 IVTIDLNQADPGAQWPA 438 >UniRef50_UPI0001BCB4AE McrBC 5-methylcytosine restriction system component n=1 Tax=Fusobacterium periodonticum ATCC 33693 RepID=UPI0001BCB4AE Length = 444 Score = 57.8 bits (138), Expect = 6e-07, Method: Compositional matrix adjust. Identities = 77/341 (22%), Positives = 153/341 (44%), Gaps = 34/341 (9%) Query: 32 AIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGK 91 AI ++L+I + K V ++ +GL Y E I KG+++ I+ + K Sbjct: 103 AIADTSILEIFINLFIKEVEEIIEKGLLYRYIGRNENISVFKGKLDINNHIKYNFSHKEK 162 Query: 92 TVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFS 151 FD + ++L N IIK T+ L K +N +++ + I L + ++ Sbjct: 163 FFMKFDEFSINSLENSIIKLTIQKL-KKISVNLKNKEKLNKISHHFENIIILPNSIENLK 221 Query: 152 YLNGGKNTRYYKFVISVCKFIVNNS---IPGQNKGHYR---------FYDFERNEKEMSL 199 Y+ + YYK I K +NN I G F ++ N K +++ Sbjct: 222 YITFDRTNDYYKNSIQWSKIFLNNQSSLIFSATNGEVATMLFPMETIFENYIAN-KLINI 280 Query: 200 LYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITI--RSSEKILIVD 257 + +KF +L S + ++++D LN + ++ DI I ++S++I I+D Sbjct: 281 VKEKFY-----NQLIVKVQDDSCSAFSTATLNDTKLNNMFNVKPDIVIKNKNSKEIFILD 335 Query: 258 AKYYKSIFSRRMGTEKFHSQNLYQLMNY--LWSLKPENGENI-GGLLIYPHVD------- 307 K+ I + K + ++YQ+++Y +++ + +N LIYP + Sbjct: 336 TKW--KILDKLDNKFKISTDDIYQMLSYVKIYNDRYKNSYTCEKAYLIYPATNIRKNSFS 393 Query: 308 TAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 + K ++K + F++ +C VNL E ++L++I +++K Sbjct: 394 SEDKIKFKTDNFELNICFVNLSSE-ETTEKDLVNILSKFIK 433 >UniRef50_Q26DA3 Putative uncharacterized protein n=2 Tax=Flavobacteria RepID=Q26DA3_9BACT Length = 416 Score = 52.8 bits (125), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 51/226 (22%), Positives = 104/226 (46%), Gaps = 15/226 (6%) Query: 37 NLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTF 96 NLL++ + + + L R+GL Y T+ +KG++EFA I+ ++ + +T Sbjct: 110 NLLEVYFELYLQELESLVRKGLIKQYRKQTKNTKALKGKLEFAGHIKSNIVHKERFYTTH 169 Query: 97 DMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGG 156 + + + ++++ LAI+ + S ++D A + P + +T +H + ++ Sbjct: 170 QVYDSNHFLHQVLSKALAIVGQFTN-GSRLQDLASRVQLNFPEVDNKAITAKHLNEMSLN 228 Query: 157 KNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQ-KFLYE-FCRRELT 214 + T YK + + + I+ N P + G EK +SLL+ L+E + ++L Sbjct: 229 RKTLSYKNALELARLIILNYSPDISSG---------KEKMLSLLFDMNELWETYILKQLQ 279 Query: 215 SANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKY 260 A+ + + S +S + DI +R S K I+D K+ Sbjct: 280 KASIG---FEIEVSGQESKSFWANNSLRPDIVLRKSGKTYIIDTKW 322 >UniRef50_A6VF50 Putative uncharacterized protein n=2 Tax=Methanococcus RepID=A6VF50_METM7 Length = 416 Score = 50.8 bits (120), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 58/259 (22%), Positives = 113/259 (43%), Gaps = 17/259 (6%) Query: 53 LSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKST 112 L +RGL+ DY + + +KG+++F + I+ ++ + FD +D NRIIKST Sbjct: 145 LIKRGLKSDYIETQKNLNVLKGKLKFKEHIKHNLIHKERFFVEFDEFIKDMAENRIIKST 204 Query: 113 LAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFI 172 L L K K +++ + + IS + F+ G+ Y+ ++ C+ Sbjct: 205 LKELSKRSKSGKNLKNISEYSFV-FDEISESKNIEKDFNACKSGRLMVDYENILLWCRVF 263 Query: 173 VNNSIPGQNKGHYRFYDFERNEKEMSLLY--QKFLYEFCRRELTSANTTRSYLKWDASS- 229 + N F +F+ + +LLY +K + +L + SY+K S Sbjct: 264 LKNE---------SFINFKGSNVAFALLYPMEKIFESYLTYKLKKSGKF-SYVKAQDSRF 313 Query: 230 -ISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWS 288 + + L + +++ DI + I ++DAK+ I ++YQL++Y Sbjct: 314 FLVKEDLKKMFKLKPDIYAEKDDTIYLIDAKW--KILDVNSPNYGISQGDMYQLLSYAKI 371 Query: 289 LKPENGENIGGLLIYPHVD 307 + +++ L+YP D Sbjct: 372 YENNCKKHVKMALVYPKTD 390 >UniRef50_A6EMU6 5-Methylcytosine-specific restriction enzyme C n=1 Tax=unidentified eubacterium SCB49 RepID=A6EMU6_9BACT Length = 414 Score = 50.8 bits (120), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 39/157 (24%), Positives = 75/157 (47%), Gaps = 4/157 (2%) Query: 27 QANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFH 86 Q N ++I +LLDI + V +L RRGL Y ++ + +KG++EFA+ + Sbjct: 109 QVNKQSI---HLLDIYFDWFLREVQELCRRGLIKKYYKESKNVKSLKGKLEFAQHLNKNL 165 Query: 87 LNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLT 146 ++ + ++ + N+D ++II L ++ K + + + + + P +ST+ Sbjct: 166 IHKERFYTSHQIYNKDHKLHQIINQALEVIELVSK-GTYLYSKCKEVRLNFPEVSTIKCN 224 Query: 147 PQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 FS LN + YK I + + I+ N P + G Sbjct: 225 ESTFSKLNFNRKNSPYKTTIEIARLIILNFAPNVSTG 261 >UniRef50_D2QXZ3 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QXZ3_9PLAN Length = 435 Score = 50.4 bits (119), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 71/301 (23%), Positives = 123/301 (40%), Gaps = 41/301 (13%) Query: 32 AIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRG------- 84 A G NLLD++ + + ++ R GL DY E +P ++GR+ F + IR Sbjct: 106 AAGGGNLLDLIMLMFVEECERILRGGLLSDYVEEEEELPVVRGRMLFDRQIRKRLGRLDL 165 Query: 85 FHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLH 144 H H D +D N+++ L + +H T+R +AR L L G Sbjct: 166 IHCRH-------DERKQDVPENQLLAYVLDVCARH-AFQPTLRRKARQLEHHLLGSCDPS 217 Query: 145 LTPQHFSYLNGG----KNTRYYKFVISVCKFIVNNSIPGQ--NKGHYRFYDFERNEKEMS 198 L GG + +Y+ +C IV+ + G R + F +M+ Sbjct: 218 LL--DLVTTRGGIYYDRMNEHYRDAHELCWLIVDALGISDIYSSGSSRIFAF---LLDMN 272 Query: 199 LLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLN-----LLPRMETDITIRSSEKI 253 L++ F+ + R+ ++ SY D S I D + N ++P + I S Sbjct: 273 RLFEVFVLQVLRQLTSTTALKVSYQSSDRSIIRDSATNQPYSSVIP--DFLIAAPSLSGK 330 Query: 254 LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSL-KPENGENIGGLLIYPHVDTAVKH 312 L++DAKY + + ++YQ Y ++ + + G ++ G LIYP T Sbjct: 331 LVLDAKY------KLYDAAGVSNSDIYQSFFYAYAFGRHQLGGHVAG-LIYPSESTTASR 383 Query: 313 R 313 + Sbjct: 384 K 384 >UniRef50_C5CG01 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CG01_KOSOT Length = 397 Score = 50.1 bits (118), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 65/273 (23%), Positives = 119/273 (43%), Gaps = 39/273 (14%) Query: 4 PVIPVRNIYYMLTYAWG-----YLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGL 58 P + ++N+ +ML YA+ +++ I+ + E I L + L V K V+ ++RGL Sbjct: 74 PRVELKNLLHMLEYAYNLKSFQFIEGIQ--DCETI--EELYERLVKVFVKRVIDRTKRGL 129 Query: 59 ELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIK 118 +Y + +P ++G++ I+ K + D N+II TL+ L+ Sbjct: 130 YREYIQQNDRLPYVRGKLNVRSMIK--QPWKVKLDCVYQDHTNDIEENQIILWTLSKLVM 187 Query: 119 HEKLNSTIRDEARSLYRKLPGISTLHLTP--------QHFSYLNGGKNTRYYKFVISVCK 170 + ++ R+ R YR L G T+ + P + ++ LN Y + + K Sbjct: 188 SDSISEGTRNLVRKAYRSLAG--TIKVRPFKPSECIKRFYNRLNSD-----YLPIHVLAK 240 Query: 171 FIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSI 230 F + NS P G + F N M L++KF+ E+ ++ L R+ K + Sbjct: 241 FFLENSGPAVKSGTSKMIPFLVN---MPRLFEKFIAEWLKKNL-KGYIVRAQEKVNLDKE 296 Query: 231 SDQSLNLLPRMETDITIRS---SEKILIVDAKY 260 + S N+ D+ I S E + ++D KY Sbjct: 297 NSLSFNI------DLVIYSGLTGEAVAVLDTKY 323 >UniRef50_Q6LZ73 Putative uncharacterized protein n=1 Tax=Methanococcus maripaludis RepID=Q6LZ73_METMP Length = 426 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 56/238 (23%), Positives = 108/238 (45%), Gaps = 18/238 (7%) Query: 53 LSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKST 112 L + GL+ DY + + +KG+I+F + I +++ + +D + NRIIKST Sbjct: 144 LVKNGLKSDYISIEDNLNVLKGKIKFNEHISKNYIHKERFYVNYDEFIRNRPENRIIKST 203 Query: 113 LAILIKHEKLNSTIR--DEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCK 170 + L+K+ LNS ++ +E + +P L + F+ + YK +I CK Sbjct: 204 IKYLLKNSSLNSNLKRINEFLFIMDAIPESKNLE---KDFAACVNNRLMTDYKKIIPWCK 260 Query: 171 FIVNNSIPGQNKGHYRFYDFERNEKEMSLLY--QKFLYEFCRRELTSANTTRSYL-KWDA 227 + N F +F+ +E +LLY +K + E + + + + + Sbjct: 261 VFLKNE---------SFTNFKGDEIAYALLYPMEKIFESYLTEEFKKSGKFETIVSQGNG 311 Query: 228 SSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNY 285 ++ + R++ DI +S K I+DAK +K + S + ++YQL++Y Sbjct: 312 YFLAKHKNEGIFRLKPDIYAETSSKKYIMDAK-WKILNSDKNKNYGISQNDMYQLLSY 368 >UniRef50_C7PTE7 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PTE7_CHIPD Length = 423 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 54/243 (22%), Positives = 112/243 (46%), Gaps = 15/243 (6%) Query: 22 LQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKT 81 + +++ANL + N++LDI + + V L RRGL Y +T + +KG+++F K Sbjct: 105 IDHVEKANLN-LRSNSILDIYIRLFLEEVEVLLRRGLIKKYKRHTANLTTLKGKLDFGKH 163 Query: 82 IRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGIS 141 I ++ + + + + L N+II L LI N+++ D+ + P + Sbjct: 164 ISANLIHKERFFVEHTIYSHENLFNQIINEVLK-LIPLLVSNTSLNDKLGRIRLDFPELP 222 Query: 142 TLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYD--FERNEKEMSL 199 + +T F + + + Y+ + + + ++ N P G F+ NE Sbjct: 223 AIKVTAATFDKIQYDRKSSTYQPALEIARLLLLNYRPDITGGTNNVIAILFDMNE----- 277 Query: 200 LYQKFLYEFCRRELTSANTTRSYLKWDASS-ISDQSLNLLPR-METDITIRSSEKILIVD 257 L++++++ R+L N+ +K S ++ L P+ + DI +R EK +++D Sbjct: 278 LWEEYIF----RKLQRLNSEGIEVKRQQSQHFWKRNGALYPKSVRPDIVLRKGEKTIVLD 333 Query: 258 AKY 260 K+ Sbjct: 334 TKW 336 >UniRef50_Q10YL1 McrBC 5-methylcytosine restriction system component-like n=2 Tax=Oscillatoriales RepID=Q10YL1_TRIEI Length = 405 Score = 47.4 bits (111), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 60/274 (21%), Positives = 114/274 (41%), Gaps = 55/274 (20%) Query: 3 QPVIPVRNIYYMLTYAWG-----YLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRG 57 +P +P+ N++ ML YA+ +L + N N L++IL + +L+ R+G Sbjct: 76 KPKVPLHNLFGMLEYAYNLRSFCFLDGLVNCNSLQEFYNCLVNILA----QKILERGRKG 131 Query: 58 LELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVST-FDMLNEDTLANRIIKSTLAIL 116 Y P TE + I+GR+ + + H G ++ + + N+I+ TL I+ Sbjct: 132 FHRAYLPKTENLTYIRGRLNMRQVM---HKPWGVSLKCDYQEHTANIPDNQILAWTLFII 188 Query: 117 IKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRY------YKFVISVCK 170 + + + + L G+ TL Q F + N +Y Y+ + +C+ Sbjct: 189 SRSSFCSEKVAVTVTRAFHILQGLVTL----QPFKS-SDCLNIKYHRLNEDYQVLHGLCR 243 Query: 171 FIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSI 230 F ++N +G+Y F +M+ LY+KF+ ++ + L+S Sbjct: 244 FFLDNIGASHQQGNYSMLPFL---IDMAKLYEKFVAKWLKLHLSS--------------- 285 Query: 231 SDQSLNLLPRMETDITIRSSEKILIVDAKYYKSI 264 ++ ++ EK+ IVD K Y I Sbjct: 286 -------------NLRVKEQEKVEIVDDKIYCKI 306 >UniRef50_B4V8L4 Putative uncharacterized protein n=1 Tax=Streptomyces sp. Mg1 RepID=B4V8L4_9ACTO Length = 404 Score = 47.4 bits (111), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 64/314 (20%), Positives = 125/314 (39%), Gaps = 23/314 (7%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P +P+ ++++L Y+ + ++ +L L + + + + R+GL Y Sbjct: 73 PKVPIARLFFLLGYSLDPKGGWRGGQVDVGEHREVLPALAHAVERQTDRALRQGLLQGYR 132 Query: 64 PNTEIIPGIKGRIEFAKTIR---GFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 E ++GRI A IR G L +D D NRI+++ + L++ Sbjct: 133 ATEESALVVRGRIREADQIRRRFGVVL---PVEVAYDEYTTDIAENRILRAAVERLLRLP 189 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 + +R +L ++ L S+ NTR Y+ + + + +++ S P Sbjct: 190 GVPREVRRRLLHQRARLADVTPLVPGQPLPSWQPSRLNTR-YQPALHLARAVLDGSSPEH 248 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 G R F +M+ L++ F+ R L ++ T + D + ++ R Sbjct: 249 VPGGLRIDGF---LFDMNRLFEDFVTVALREALRGSDLTGAL--QDPHHLDEEDAI---R 300 Query: 241 METDITIRSSEKI--LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 M+ D + + + DAKY +R G + +LYQ++ Y +L G + Sbjct: 301 MKPDFVLYGPDGAPRAVADAKYKA---EKRDG---YPDADLYQMLAYCTALGLPKGHLVY 354 Query: 299 GLLIYPHVDTAVKH 312 PH V+H Sbjct: 355 AKGNAPHAAHRVRH 368 >UniRef50_Q4C3F9 Similar to McrBC 5-methylcytosine restriction system component n=4 Tax=Chroococcales RepID=Q4C3F9_CROWT Length = 294 Score = 47.4 bits (111), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 55/262 (20%), Positives = 108/262 (41%), Gaps = 20/262 (7%) Query: 42 LGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNE 101 L + +L ++ L Y ++ + +KG+I+ I+ H +D + Sbjct: 11 LATIFAHRILNRIQKELYSTYIKQSQELNYVKGKIDIKTMIK--HPWKPTLTCQYDNFTQ 68 Query: 102 DTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLH-LTPQHFSYLNGGKNTR 160 D N+I+ T+ I+ + + + R R +Y L G TL +T + Sbjct: 69 DIEDNQILLWTIYIISRQQICQANTRILIRKVYHALQGYVTLSPVTANDCINRKCNRLNE 128 Query: 161 YYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTR 220 Y + S+C+F + N+IP +KG+Y F +M+ LY+KF+ + + L + Sbjct: 129 DYHGLHSLCRFFLENTIPSHDKGNYNSLPFLV---DMNQLYEKFVAAWLIQHLPPHLGIK 185 Query: 221 SYLKWDASSISDQSLNLLPRMETDITI---RSSEKILIVDAKYYKSIFSRRMGTEKFHSQ 277 + + + S S + D+ I + E + ++D KY I + E+ + Sbjct: 186 TQHRVEYDSFS---------FKIDLIIYNKETQENLYVLDTKYKTKI--KTSDIEQIIAY 234 Query: 278 NLYQLMNYLWSLKPENGENIGG 299 Q N+ ++P N + I Sbjct: 235 TFQQNCNHAIIIEPTNNKPINA 256 >UniRef50_C0GTU8 Putative uncharacterized protein n=1 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GTU8_9DELT Length = 424 Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust. Identities = 40/146 (27%), Positives = 62/146 (42%), Gaps = 1/146 (0%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 LLDI V QL+ GL Y N + + +KGR+ F K I + + + Sbjct: 113 LLDIFFRSFLSEVEQLAHHGLVRKYRKNQDNLTTLKGRLLFQKQITLNLVRRERFYTEHV 172 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 + N+I+ + L ILI N + +AR+L I ++ FS LN + Sbjct: 173 HYERNNPFNQILGTALDILILTSS-NPHLSAQARNLALSFEDIDRINAAEVTFSRLNYTR 231 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKG 183 NT Y+ I + + I+ N P G Sbjct: 232 NTERYRRAIQLARLIILNYCPDVRSG 257 >UniRef50_Q139N0 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris BisB5 RepID=Q139N0_RHOPS Length = 434 Score = 45.4 bits (106), Expect = 0.003, Method: Compositional matrix adjust. Identities = 34/122 (27%), Positives = 57/122 (46%), Gaps = 7/122 (5%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQAN----LEAIPGNNLLDILGYVLNKGVLQLSRRGL 58 +P PV N+ ++ + L I A+ + G ++L+ L L + ++ RGL Sbjct: 77 RPKFPVSNLARVIDTSKRQLNSIPGADRSYLANDLSGGSVLNFLAANLVDALRPIAARGL 136 Query: 59 ELDYNPNTEIIPGIKGRIEFAKTIRGFHLN--HGKTVSTFDMLNEDTLANRIIKSTLAIL 116 +Y+ +E +GRIE A T+RG+ H FD D NRI+K+ L + Sbjct: 137 HKEYSCRSETTSHPRGRIEIAGTMRGWSRGQFHKVQAQRFDQ-TSDLPVNRILKAALESV 195 Query: 117 IK 118 +K Sbjct: 196 LK 197 >UniRef50_A4TAT5 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Mycobacterium gilvum PYR-GCK RepID=A4TAT5_MYCGI Length = 446 Score = 45.1 bits (105), Expect = 0.004, Method: Compositional matrix adjust. Identities = 76/313 (24%), Positives = 132/313 (42%), Gaps = 37/313 (11%) Query: 32 AIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRI----EFAKTIRGFHL 87 A+ G++L +L VL L R GL DY P + + ++GR+ +F K H Sbjct: 104 AVSGDDLFQLLVRVLVGESKLLIRDGLLRDYRPTEDTLAVMRGRLRMRDQFLKRYGSLH- 162 Query: 88 NHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGIST-LHLT 146 + FD + D N+++ ++L H + S +R+E R L + I Sbjct: 163 ---RLECNFDEYDGDIAENQLLAASLTAAASHVR-ASALRNETRMLAGVIGDICQPPTFD 218 Query: 147 P----QHFSYLNGGKNTRY---YKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSL 199 P Q Y G +N+RY +K + V + + N + ++ + M++ Sbjct: 219 PDWYEQRIHY--GRRNSRYEGAHKAALLVLRGLALNDLHSASRQGVNAFMV-----NMNV 271 Query: 200 LYQKFLYEFCRRELTSANTTRSYLKWDASSI-SDQSLNLL---PRMETDITIRSSEKILI 255 ++++F+ + L S RS + +I D+S N R + IT +S + + Sbjct: 272 IFERFVSALVDQAL-SGTGLRSTPQLSIRAIVVDESTNRTYSNIRPDLVITEVNSARSVP 330 Query: 256 VDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYK 315 VD KY + T KF S ++YQL Y ++L E + G +IY T + Sbjct: 331 VDIKY------KLYDTVKFSSADVYQLFTYAYALGAGAEEKMAG-VIYASTTTTSGPALR 383 Query: 316 INGFDIGLCTVNL 328 I G + G+ L Sbjct: 384 IKG-NTGIAAARL 395 >UniRef50_B5IHH3 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Aciduliprofundum boonei T469 RepID=B5IHH3_9EURY Length = 460 Score = 44.3 bits (103), Expect = 0.007, Method: Compositional matrix adjust. Identities = 49/188 (26%), Positives = 90/188 (47%), Gaps = 12/188 (6%) Query: 8 VRNIYYMLTYAWGY-LQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNT 66 ++N+ ML AW ++++ ++L+ I N++ ++L + + +L + GL +Y + Sbjct: 109 IKNLVKMLQIAWNLPIRDVDISSLK-IGENSIFEVLLTIYSIKLLDAIKEGLYKEYIRVS 167 Query: 67 EIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIK--STLAILIKHEKLNS 124 + + +KG+I+FAK R + H V+ D N D L NR +K + LA L +N Sbjct: 168 DDLHYVKGQIDFAKYSRRWERRHIIPVNYNDR-NPDNLINRTLKYAAYLASLYTRNSMNF 226 Query: 125 TIRDEARSLYR--KLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + A +L L +S + F+ LN G YK +I++ + I+ N P Sbjct: 227 SNLKMAENLMDSVSLVPVSASEIDSITFTRLNEG-----YKPLINLARVIITNLSPEFTG 281 Query: 183 GHYRFYDF 190 G + F Sbjct: 282 GKKDVFAF 289 >UniRef50_B7UQU6 McrC family protein, predicted McrBC 5-methylcytosine restriction system component n=24 Tax=Enterobacteriaceae RepID=B7UQU6_ECO27 Length = 436 Score = 44.3 bits (103), Expect = 0.007, Method: Compositional matrix adjust. Identities = 59/278 (21%), Positives = 121/278 (43%), Gaps = 27/278 (9%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 L+DI + V ++ +RGL+ DY + +P +KGR+ + + + + +D Sbjct: 127 LMDIFIQQFIESVRKIVQRGLKRDYLRQEDNLPWMKGRLRISAQLSKNCIRRDRFQVEYD 186 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 + + L NRI+K+ + I + N + + L I+++H F L+ + Sbjct: 187 EYSVNRLENRILKTAIN-KISRQTSNPQLLQQITQLQFHFENITSVHDAYIAFEQLHFDR 245 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 +Y+ ++ K I+ P G + M +++ F+ + R Sbjct: 246 QMHHYEQALAWAKMILLGDSPHCMYGDVNAFSLLF---PMEAVFESFVTTWMR------- 295 Query: 218 TTRSYLKWDASS-ISDQSL-----NLLPRMETDITIR----SSEKILIVDAKYYKSIFSR 267 R Y KW + +S ++L L ++ DI +R ++ ++ D K +K + R Sbjct: 296 -YRYYDKWRVDAQVSSKNLISYNGKALFKLRPDICLRPRKSTTGSVITCDVK-WKIVNGR 353 Query: 268 RMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH 305 + E+ + +LYQ++ Y + + G+ I LIYP+ Sbjct: 354 KDSLEQSQA-DLYQMLAYGLNYQEGEGDMI---LIYPY 387 >UniRef50_A4J0H3 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J0H3_DESRM Length = 334 Score = 40.8 bits (94), Expect = 0.072, Method: Compositional matrix adjust. Identities = 50/216 (23%), Positives = 92/216 (42%), Gaps = 21/216 (9%) Query: 4 PVIPVRNIYYMLTYAWGYLQ-EIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 P + + NI+ ML YA+ I +E + L +L +L +R+GL +Y Sbjct: 11 PRVKLSNIFTMLEYAYRLKSFRILDGMVECDSLQEFYERLAMILAGMILNRNRQGLYREY 70 Query: 63 NPNTEIIPGIKGRIEFAKTI-----RGFHLNHGKTVSTFDMLNEDTLA---NRIIKSTLA 114 + +P I+G++ + GF ++ + T D+ + L NRI+ S L Sbjct: 71 REQVDKLPYIRGQLNIRHQLVKPWEVGFSCHYQE--HTADIEDNQILTWTLNRILYSGLC 128 Query: 115 ILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVN 174 ++ I+ RSL L S + L P FS + + Y+ + ++C+F + Sbjct: 129 ----SDRGLPVIKKAYRSL---LSQTSLIPLDPGRFSSRVYSRLNQDYRPLHALCRFFLE 181 Query: 175 NSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCR 210 PG G + F +M L++ F+ ++ R Sbjct: 182 QCGPGYEVGDHSMIPF---LVDMPRLFELFVAQWLR 214 >UniRef50_D2B1B6 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2B1B6_STRRD Length = 431 Score = 40.8 bits (94), Expect = 0.081, Method: Compositional matrix adjust. Identities = 29/107 (27%), Positives = 46/107 (42%), Gaps = 2/107 (1%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIP--GNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEI 68 + ML YA G L +P G++L D++ +L L R GL DY E Sbjct: 77 VLRMLDYASGLPALRHMDRLRNLPNQGHDLRDLICLLLTVECEALVRHGLRRDYIRRQET 136 Query: 69 IPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAI 115 +P I+GR+ + + + FD + D L NR+ + L + Sbjct: 137 LPAIRGRLLADQQVLRRFGRLDRLECRFDEFDSDILDNRLCAAALRV 183 >UniRef50_C5EXA5 Putative uncharacterized protein n=1 Tax=Helicobacter pullorum MIT 98-5489 RepID=C5EXA5_9HELI Length = 552 Score = 40.4 bits (93), Expect = 0.086, Method: Compositional matrix adjust. Identities = 50/246 (20%), Positives = 105/246 (42%), Gaps = 24/246 (9%) Query: 25 IKQANLEAIPGNNL--LDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTI 82 KQ+++ ++ NL L++ + + +L RGL+ DY + +KG++ F + I Sbjct: 181 FKQSHIASLQSLNLPLLEVFIQMFLAELERLIHRGLKSDYREIAQNRVFLKGKLLFNEQI 240 Query: 83 RGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGIST 142 + ++ + + D + ++ NR+IK TL L + L+ R + SLY I+ Sbjct: 241 KHNLIHKERFFTQSDEYSLNSAPNRLIKCTLEFL-RTLSLSPKTRTKLDSLYFIFEEITP 299 Query: 143 LHLTPQHFSYLNGGKNTRYYKFVISVCKFIVN----NSIPGQNKGHYRFYDFERNEKEMS 198 + F+ + + Y+ V+ C + ++ G + + ER Sbjct: 300 SSHIDRDFAKCKSMRRFKEYELVLLWCAIFLQQKSFSAYSGSERAFALLFPMER------ 353 Query: 199 LLYQKFLYEFCRRELT----SANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKIL 254 L++ F+ + R + R Y D + + +++ D+ +RS +IL Sbjct: 354 -LFESFVGHWLGRSIEHHEIKLQEQRYYFMQDFQKVD------IFQLKPDVIMRSESEIL 406 Query: 255 IVDAKY 260 I+D K+ Sbjct: 407 ILDTKW 412 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P15006 Protein mcrC n=9 Tax=Escherichia RepID=MCRC_ECOLI 403 e-111 UniRef50_C1PCD8 McrBC 5-methylcytosine restriction system compon... 366 e-100 UniRef50_UPI0001C37412 5-methylcytosine-specific restriction enz... 363 7e-99 UniRef50_B2GJY9 Putative McrC protein n=3 Tax=Actinomycetales Re... 358 1e-97 UniRef50_A6BZ07 McrC protein n=1 Tax=Planctomyces maris DSM 8797... 351 2e-95 UniRef50_C8W851 McrBC 5-methylcytosine restriction system compon... 347 4e-94 UniRef50_UPI000197B905 hypothetical protein BACCOPRO_00614 n=1 T... 345 1e-93 UniRef50_B2JTN9 Putative uncharacterized protein n=1 Tax=Burkhol... 333 5e-90 UniRef50_Q1QFB3 Putative uncharacterized protein n=1 Tax=Nitroba... 331 3e-89 UniRef50_D0D6B3 5-methylcytosine-specific restriction enzyme sub... 324 2e-87 UniRef50_Q1IK47 McrC protein n=1 Tax=Candidatus Koribacter versa... 323 8e-87 UniRef50_A1UB52 Putative uncharacterized protein n=2 Tax=Mycobac... 322 1e-86 UniRef50_A5WF57 McrBC 5-methylcytosine restriction system compon... 319 9e-86 UniRef50_C4T585 McrBC 5-methylcytosine restriction system compon... 318 2e-85 UniRef50_C2G7A3 5-methylcytosine-specific restriction enzyme sub... 315 2e-84 UniRef50_B0AA68 Putative uncharacterized protein n=1 Tax=Clostri... 313 6e-84 UniRef50_D1PB97 Putative uncharacterized protein n=1 Tax=Prevote... 309 8e-83 UniRef50_UPI0001BCC8FD 5-methylcytosine-specific restriction enz... 307 4e-82 UniRef50_C1QDS7 McrBC 5-methylcytosine restriction system compon... 296 8e-79 UniRef50_D0VFC8 McrBC 5-methylcytosine restriction system compon... 291 3e-77 UniRef50_C0WJ24 5-methylcytosine-specific restriction enzyme sub... 276 6e-73 UniRef50_B7D079 5-methylcytosine-specific restriction enzyme sub... 250 6e-65 UniRef50_B4V8L4 Putative uncharacterized protein n=1 Tax=Strepto... 227 4e-58 UniRef50_B2A7U4 McrBC 5-methylcytosine restriction system compon... 225 1e-57 UniRef50_C5CG01 McrBC 5-methylcytosine restriction system compon... 218 2e-55 UniRef50_C7NQX2 McrBC 5-methylcytosine restriction system compon... 214 3e-54 UniRef50_A6VF50 Putative uncharacterized protein n=2 Tax=Methano... 212 2e-53 UniRef50_A6EGF0 5-methylcytosine-specific restriction enzyme Mcr... 211 3e-53 UniRef50_UPI0001BCB4AE McrBC 5-methylcytosine restriction system... 210 5e-53 UniRef50_Q9RZI4 Putative uncharacterized protein n=1 Tax=Deinoco... 209 2e-52 UniRef50_B1BM79 Putative uncharacterized protein n=1 Tax=Clostri... 205 2e-51 UniRef50_Q10YL1 McrBC 5-methylcytosine restriction system compon... 200 5e-50 UniRef50_D2QXZ3 McrBC 5-methylcytosine restriction system compon... 197 5e-49 UniRef50_Q4C3F9 Similar to McrBC 5-methylcytosine restriction sy... 191 4e-47 UniRef50_C7PTE7 McrBC 5-methylcytosine restriction system compon... 190 5e-47 UniRef50_Q6LZ73 Putative uncharacterized protein n=1 Tax=Methano... 190 9e-47 UniRef50_Q26DA3 Putative uncharacterized protein n=2 Tax=Flavoba... 178 2e-43 UniRef50_A6EMU6 5-Methylcytosine-specific restriction enzyme C n... 177 5e-43 Sequences not found previously or not previously below threshold: UniRef50_C9ZD45 Putative uncharacterized protein n=3 Tax=Strepto... 210 8e-53 UniRef50_A3UV38 Putative uncharacterized protein n=1 Tax=Vibrio ... 186 1e-45 UniRef50_C3FCB5 McrBC 5-methylcytosine restriction system compon... 176 9e-43 UniRef50_C7NBA8 McrBC 5-methylcytosine restriction system compon... 176 1e-42 UniRef50_D1SH86 McrBC 5-methylcytosine restriction system compon... 176 1e-42 UniRef50_A6ALR1 5-Methylcytosine-specific restriction enzyme C n... 175 2e-42 UniRef50_Q0KFR0 5-Methylcytosine-specific restriction enzyme C n... 175 2e-42 UniRef50_C5EXA5 Putative uncharacterized protein n=1 Tax=Helicob... 172 2e-41 UniRef50_D1WRX1 McrBC 5-methylcytosine restriction system compon... 171 4e-41 UniRef50_UPI0001972FC4 McrBC 5-methylcytosine restriction system... 167 4e-40 UniRef50_A4J0H3 McrBC 5-methylcytosine restriction system compon... 167 8e-40 UniRef50_C4G4B6 Putative uncharacterized protein n=3 Tax=Bacteri... 165 2e-39 UniRef50_B9MJI1 Putative uncharacterized protein n=1 Tax=Diaphor... 165 2e-39 UniRef50_B5IHH3 McrBC 5-methylcytosine restriction system compon... 165 2e-39 UniRef50_B7KTT4 IQ calmodulin-binding-domain protein n=2 Tax=Rhi... 165 3e-39 UniRef50_UPI0001909ACF hypothetical protein RetlI_30663 n=1 Tax=... 163 8e-39 UniRef50_B0A6E5 Putative uncharacterized protein n=1 Tax=Clostri... 163 1e-38 UniRef50_A7ZRA6 Putative uncharacterized protein n=3 Tax=Enterob... 160 5e-38 UniRef50_B7UQU6 McrC family protein, predicted McrBC 5-methylcyt... 160 5e-38 UniRef50_B7LQZ4 Putative 5-methylcytosine restriction system com... 160 5e-38 UniRef50_Q39M33 McrBC 5-methylcytosine restriction system compon... 160 6e-38 UniRef50_C8S833 McrBC 5-methylcytosine restriction system compon... 158 3e-37 UniRef50_C0GTU8 Putative uncharacterized protein n=1 Tax=Desulfo... 157 6e-37 UniRef50_C2WWH8 Putative uncharacterized protein n=3 Tax=Bacillu... 155 2e-36 UniRef50_C5DAA2 McrBC 5-methylcytosine restriction system compon... 155 2e-36 UniRef50_C7MHK3 McrBC 5-methylcytosine restriction system compon... 155 3e-36 UniRef50_B2FKW7 Putative uncharacterized protein n=3 Tax=Xanthom... 151 4e-35 UniRef50_A6X8D9 Putative uncharacterized protein n=1 Tax=Ochroba... 150 5e-35 UniRef50_A8EUN6 McrBC catalytic subunit McrC, putative n=2 Tax=C... 150 5e-35 UniRef50_A7H1L9 Putative uncharacterized protein n=13 Tax=Campyl... 150 7e-35 UniRef50_Q7VG80 Putative uncharacterized protein n=1 Tax=Helicob... 149 1e-34 UniRef50_Q2SJR5 McrBC 5-methylcytosine restriction system compon... 145 2e-33 UniRef50_D2AT08 McrBC 5-methylcytosine restriction system compon... 145 2e-33 UniRef50_A4TAT5 McrBC 5-methylcytosine restriction system compon... 144 5e-33 UniRef50_D0C387 McrBC 5-methylcytosine restriction system compon... 144 5e-33 UniRef50_Q2FNZ4 McrBC 5-methylcytosine restriction system compon... 143 1e-32 UniRef50_B1BM86 ATP-dependent helicase priA n=8 Tax=Clostridium ... 143 1e-32 UniRef50_A0JT91 McrBC 5-methylcytosine restriction system compon... 142 2e-32 UniRef50_D2B1B6 McrBC 5-methylcytosine restriction system compon... 142 2e-32 UniRef50_Q2J945 McrBC 5-methylcytosine restriction system compon... 140 6e-32 UniRef50_D2QGI2 5-methylcytosine restriction system component-li... 140 1e-31 UniRef50_D0Z341 McrBC 5-methylcytosine restriction system compon... 138 2e-31 UniRef50_D0BXD9 McrBC 5-methylcytosine restriction system compon... 138 3e-31 UniRef50_A4Y9J5 Putative uncharacterized protein n=3 Tax=Gammapr... 138 4e-31 UniRef50_A2TPW2 Putative uncharacterized protein n=1 Tax=Dokdoni... 137 6e-31 UniRef50_D1SMG2 McrBC catalytic subunit McrC, putative n=1 Tax=M... 137 7e-31 UniRef50_B5HWV8 Putative uncharacterized protein n=1 Tax=Strepto... 136 1e-30 UniRef50_C9ZD40 Putative uncharacterized protein n=1 Tax=Strepto... 130 6e-29 UniRef50_C1EZU3 Putative uncharacterized protein n=1 Tax=Bacillu... 128 4e-28 UniRef50_C6NTT9 Putative uncharacterized protein n=1 Tax=Acidith... 126 1e-27 UniRef50_Q188G2 Putative uncharacterized protein n=9 Tax=Clostri... 125 2e-27 UniRef50_Q466P1 Putative uncharacterized protein n=2 Tax=Methano... 123 1e-26 UniRef50_C5A3Z2 McrBC 5-methylcytosine restriction system compon... 121 3e-26 UniRef50_A1ZU11 Putative uncharacterized protein n=1 Tax=Microsc... 120 7e-26 UniRef50_Q5JJA9 Putative 5-methylcytosine restriction system, ca... 118 2e-25 UniRef50_Q08S71 Putative uncharacterized protein n=1 Tax=Stigmat... 118 3e-25 UniRef50_Q5JH86 Putative 5-methylcytosine restriction system, ca... 114 6e-24 UniRef50_C9LKR7 Putative uncharacterized protein n=1 Tax=Prevote... 110 1e-22 UniRef50_A1SC19 McrBC 5-methylcytosine restriction system compon... 108 3e-22 UniRef50_B1I205 McrBC 5-methylcytosine restriction system compon... 105 2e-21 UniRef50_D1YX67 Putative uncharacterized protein n=1 Tax=Methano... 103 7e-21 UniRef50_D0LPM4 5-methylcytosine restriction system component-li... 101 3e-20 UniRef50_UPI00016C4D94 hypothetical protein GobsU_06730 n=1 Tax=... 100 7e-20 UniRef50_A9FMX9 Putative uncharacterized protein n=1 Tax=Sorangi... 100 1e-19 UniRef50_D1PAJ0 Putative uncharacterized protein n=1 Tax=Prevote... 100 1e-19 UniRef50_Q2GBH5 McrBC 5-methylcytosine restriction system compon... 100 1e-19 UniRef50_Q7MVS1 Putative uncharacterized protein n=1 Tax=Porphyr... 99 2e-19 UniRef50_Q97QG4 Conserved domain protein n=27 Tax=Streptococcus ... 98 3e-19 UniRef50_A7ZBW9 Putative uncharacterized protein n=1 Tax=Campylo... 96 2e-18 UniRef50_A7GF31 Putative uncharacterized protein n=1 Tax=Clostri... 95 4e-18 UniRef50_C7M8S6 McrBC 5-methylcytosine restriction system compon... 94 9e-18 UniRef50_C1QC92 McrBC 5-methylcytosine restriction system compon... 89 2e-16 UniRef50_C8PX06 Putative uncharacterized protein n=1 Tax=Enhydro... 87 1e-15 UniRef50_C2BVA3 Putative uncharacterized protein n=1 Tax=Mobilun... 87 1e-15 UniRef50_B9KCB6 Putative uncharacterized protein n=1 Tax=Campylo... 86 2e-15 UniRef50_B8DPA2 McrBC 5-methylcytosine restriction system compon... 85 5e-15 UniRef50_A8UWA1 Putative uncharacterized protein n=1 Tax=Hydroge... 84 6e-15 UniRef50_D2Q5W0 3-isopropylmalate dehydrogenase n=2 Tax=Bifidoba... 82 3e-14 UniRef50_D0YRU6 Putative ATPase family associated with various c... 81 8e-14 UniRef50_UPI0001B4EC67 McrBC 5-methylcytosine restriction system... 80 2e-13 UniRef50_C9PUK8 Putative uncharacterized protein n=2 Tax=Bactero... 79 2e-13 UniRef50_C2LL98 Putative uncharacterized protein n=2 Tax=Proteus... 79 2e-13 UniRef50_Q5UZU6 Putative uncharacterized protein n=1 Tax=Haloarc... 79 3e-13 UniRef50_Q139N0 Putative uncharacterized protein n=1 Tax=Rhodops... 78 4e-13 UniRef50_C8VZS1 Putative uncharacterized protein n=1 Tax=Desulfo... 77 9e-13 UniRef50_D1W5H2 Conserved domain protein n=3 Tax=Prevotella RepI... 77 1e-12 UniRef50_Q1LJY6 McrBC 5-methylcytosine restriction system compon... 77 1e-12 UniRef50_D1YRX5 Conserved domain protein n=1 Tax=Veillonella par... 76 2e-12 UniRef50_B9CT57 Putative uncharacterized protein n=1 Tax=Staphyl... 75 3e-12 UniRef50_UPI0001B550BC hypothetical protein StAA4_26514 n=1 Tax=... 75 3e-12 UniRef50_UPI000185C9B8 conserved hypothetical protein n=1 Tax=Ca... 75 3e-12 UniRef50_B5CQ36 Putative uncharacterized protein n=1 Tax=Ruminoc... 74 9e-12 UniRef50_A9DR64 Putative uncharacterized protein n=1 Tax=Kordia ... 72 3e-11 UniRef50_D0Z8V4 5-methylcytosine restriction system component n=... 70 9e-11 UniRef50_C9BWA4 Guanosine 5'-monophosphate oxidoreductase n=4 Ta... 70 1e-10 UniRef50_B3JJV2 Putative uncharacterized protein n=1 Tax=Bactero... 69 3e-10 UniRef50_UPI000197B531 hypothetical protein BACCOPRO_00002 n=1 T... 67 7e-10 UniRef50_C9PRC9 Putative uncharacterized protein n=1 Tax=Pasteur... 67 1e-09 UniRef50_B2UGA5 McrBC 5-methylcytosine restriction system compon... 65 5e-09 UniRef50_B0TG77 Putative uncharacterized protein n=1 Tax=Helioba... 63 1e-08 UniRef50_C1QFM5 McrBC 5-methylcytosine restriction system compon... 63 2e-08 UniRef50_D1PC49 Putative uncharacterized protein n=1 Tax=Prevote... 63 2e-08 UniRef50_A5WH29 Putative uncharacterized protein n=1 Tax=Psychro... 61 7e-08 UniRef50_Q6L339 Putative uncharacterized protein n=1 Tax=Picroph... 60 9e-08 UniRef50_D2MHZ4 Putative uncharacterized protein (Fragment) n=1 ... 60 1e-07 UniRef50_D2EQZ8 Putative uncharacterized protein n=1 Tax=Strepto... 59 3e-07 UniRef50_Q5UZU4 Putative uncharacterized protein n=1 Tax=Haloarc... 54 7e-06 UniRef50_C2WFQ0 Putative uncharacterized protein n=1 Tax=Bacillu... 54 1e-05 UniRef50_B9ZE69 Putative uncharacterized protein n=1 Tax=Natrial... 54 1e-05 UniRef50_Q4FV58 Putative uncharacterized protein n=2 Tax=Psychro... 53 1e-05 UniRef50_C3JNW8 Putative uncharacterized protein n=1 Tax=Rhodoco... 51 4e-05 UniRef50_B9D5W2 Putative uncharacterized protein n=1 Tax=Campylo... 49 3e-04 UniRef50_Q9ZMQ3 Putative n=6 Tax=Campylobacterales RepID=Q9ZMQ3_... 47 0.001 UniRef50_A6Y209 McrBC 5-methylcytosine restriction system compon... 43 0.013 >UniRef50_P15006 Protein mcrC n=9 Tax=Escherichia RepID=MCRC_ECOLI Length = 348 Score = 403 bits (1037), Expect = e-111, Method: Composition-based stats. Identities = 348/348 (100%), Positives = 348/348 (100%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL Sbjct: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE Sbjct: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ Sbjct: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR Sbjct: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 Query: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL Sbjct: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 Query: 301 LIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 LIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK Sbjct: 301 LIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 >UniRef50_C1PCD8 McrBC 5-methylcytosine restriction system component-like protein n=6 Tax=Bacteria RepID=C1PCD8_BACCO Length = 355 Score = 366 bits (941), Expect = e-100, Method: Composition-based stats. Identities = 92/351 (26%), Positives = 180/351 (51%), Gaps = 8/351 (2%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 M+ I +RNIYYML+YA+ L+ + + ++ D+ +L KG+ + ++GL Sbjct: 1 MKDKGILIRNIYYMLSYAFRVLKRSNYDEIGSERFEHIQDLFAAILTKGIARQLKQGLYK 60 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 +Y + + + ++G+++ TI K +D L+E+ + N+I+K+T IL++ Sbjct: 61 EYVSHCDDLSVLRGKLDIHGTIHHKLQRKQKLSCEYDELSENNVFNQILKTTSVILMQQP 120 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 +N R + + + + T ++ L KN + YK ++++C F+++ + Sbjct: 121 SVNVKRRTALKKVMLHFDSVDMIEPTRIKWNILRFQKNNQSYKMLLNICYFVLDGLLLST 180 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 +KG ++ +F +E+ MS L++KF+ E+ R S + W+ I + + LP Sbjct: 181 DKGKFKMANFL-DEQHMSRLFEKFVLEYYRYHYPSLRAAAPQIAWN---IDTGATDFLPT 236 Query: 241 METDITIRSSEKILIVDAKYYKSIF--SRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 M+TDI ++S K+LI+D KYY R G+ FHS NLYQ+ Y+ + N N+ Sbjct: 237 MQTDIVLKSCSKVLIIDTKYYAHTMQVQSRYGSRTFHSNNLYQIFTYVKNQDVGNTGNVA 296 Query: 299 GLLIYPHVDTAV--KHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 G+L+Y + + + ++G + + T++L + I ++L +I Y Sbjct: 297 GMLLYARTEETIVPNADFMMSGNKMSVKTLDLNTAFGNIAEQLDNIATSYF 347 >UniRef50_UPI0001C37412 5-methylcytosine-specific restriction enzyme subunit McrC n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37412 Length = 351 Score = 363 bits (931), Expect = 7e-99, Method: Composition-based stats. Identities = 93/349 (26%), Positives = 172/349 (49%), Gaps = 6/349 (1%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 I ++NIYYML+YA+ L++ + + D+ +L KGV + ++GL +Y Sbjct: 4 DKGIFIQNIYYMLSYAFQILKQEDYKQVAGEKFEKIHDLFAAILEKGVSRQVKQGLYREY 63 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 P E + ++G++ +T+R N K FD +ED N+I+K T+ LI+ E + Sbjct: 64 VPTQEDLSVMRGKLNMGETVRLKVQNKQKLGCEFDEFSEDNPYNQILKVTIHRLIRAEDV 123 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + R + I + ++ L ++ R Y+ ++++C ++N + Sbjct: 124 APERKQALRRVSVYFGNIRLIQPDHIAWNRLIYQRSNRNYELLLNICYLVLNGMLQTTED 183 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQ-SLNLLPRM 241 G Y+ F +++ M LY+KF+ E+ ++ + + +KW+ + DQ + LP+M Sbjct: 184 GSYKLLAF--SDEHMERLYEKFILEYYKQHHPELDPKSAQVKWNLTEEPDQPMIQFLPKM 241 Query: 242 ETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 +TDIT++ +K LI+DAKYY ++ E S +LYQ+ Y+ ++ N N+ GLL Sbjct: 242 QTDITLQKGDKTLIIDAKYYGKSMAQSYSKETLRSAHLYQIFAYVKNMDTANKGNVSGLL 301 Query: 302 IYPHVDTAV---KHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 +Y + V + I G IG T++L + + +L I E Sbjct: 302 LYAKTEDEVFPEGEPFVIGGNRIGARTLDLNVSFDTLRIQLDKIAKECF 350 >UniRef50_B2GJY9 Putative McrC protein n=3 Tax=Actinomycetales RepID=B2GJY9_KOCRD Length = 348 Score = 358 bits (920), Expect = 1e-97, Method: Composition-based stats. Identities = 93/342 (27%), Positives = 164/342 (47%), Gaps = 4/342 (1%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 M+ +RNIY ML YA+ ++ +++ ++ D+ +L +GV +RG+ Sbjct: 1 MKDRTATIRNIYVMLAYAFRAIRTPDASDVGTEEFTHIHDLFAEILAQGVSAQVKRGVHH 60 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 DY E + ++GRI+ T+ + G FD DT N+ +KS + +LI+H Sbjct: 61 DYLRRDEQLTTVRGRIDVTATMVARAVTPGSVSCIFDTYEPDTPFNQALKSVMVLLIRHG 120 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 ++ +D R L L ++ + + + Y+ ++ VC+ +V +P + Sbjct: 121 EVGQRRKDALRRLLPYLDAVTLVSPRSIRWEKFTCHRRNAAYRILLGVCQLVVEGLLPTE 180 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 N G + ++ +E+ MS LY++FL E+ + T ++ WD ++ + LP Sbjct: 181 NSGDTQLAEWL-SEEAMSALYERFLREYYAFHHPELSPTARHVAWDYDPVTAVGADQLPA 239 Query: 241 METDITIRSSEKILIVDAKYYK-SIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGG 299 M TD+T+ S + LI+DAKYY + S G HS NLYQ+++Y+ + N + G Sbjct: 240 MRTDVTLTSGTRTLIIDAKYYSQPLTSGAYGKLTVHSANLYQMLSYIKNADVSNDGTVSG 299 Query: 300 LLIYPHVDTA--VKHRYKINGFDIGLCTVNLGQEWPCIHQEL 339 LL+Y D I G +G T++L WP + EL Sbjct: 300 LLLYARTDAPAQPDVDVVIQGNRLGARTLDLAAPWPDLRHEL 341 >UniRef50_A6BZ07 McrC protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BZ07_9PLAN Length = 362 Score = 351 bits (901), Expect = 2e-95, Method: Composition-based stats. Identities = 113/345 (32%), Positives = 186/345 (53%), Gaps = 5/345 (1%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNP 64 IP++NIYY+L YAW LQE + ++ ++ VL+ GV L +RGL+ +Y Sbjct: 2 TIPIQNIYYLLCYAWDKLQEGQIVSVSPEDCQTTAELFARVLDSGVTHLLKRGLDRNYIS 61 Query: 65 NTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNS 124 ++G+ + TI+ L + D + D NRIIK+TL L++ +L+ Sbjct: 62 EEIETSSLRGKFDITTTIKQNLLRKSRVHCVVDSFSYDVPHNRIIKATLRNLLRCRELDR 121 Query: 125 TIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGH 184 RD LYR+L +S + LTP+ F+ + +N +Y F++ VC+ I +N + + G Sbjct: 122 DQRDRLLRLYRRLHDVSDIKLTPKDFNNVQLHRNNAWYGFLLQVCRLIYDNLLINEETGD 181 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 +F DF R+E++M+ L++ F++ F R+E + L W + + LPRM TD Sbjct: 182 SQFRDFLRDERQMARLFENFVFNFYRKEQSVFKVKSELLTWQGVDATPEDQQFLPRMRTD 241 Query: 245 ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGEN-----IGG 299 +++ SS + +++DAKYYK G HS+NLYQL YL + +N +N I G Sbjct: 242 VSLDSSTRKIVLDAKYYKDSLQSFHGNSSVHSENLYQLFAYLKNFYLKNIQNGDSRPIEG 301 Query: 300 LLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 +L+YP ++ Y+I+G I + T+NL W I Q+LL+I + Sbjct: 302 ILLYPTTGQSLSLNYEIHGHSIRIVTLNLNTSWKEIRQQLLNILE 346 >UniRef50_C8W851 McrBC 5-methylcytosine restriction system component-like protein n=21 Tax=cellular organisms RepID=C8W851_ATOPD Length = 351 Score = 347 bits (890), Expect = 4e-94, Method: Composition-based stats. Identities = 95/350 (27%), Positives = 180/350 (51%), Gaps = 13/350 (3%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNP 64 +I ++NIY+ML YA+ LQ ++ A N ++L +L +GV +RGL +Y Sbjct: 1 MIRIQNIYHMLAYAFQTLQGQGYRDIAAEEFGNTTELLAEILARGVSLQLKRGLGQEYID 60 Query: 65 NTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNS 124 E + +G+IE +++++ + + V ++D + DT NRI+K+T+A+L++ + ++ Sbjct: 61 REEALSSPRGKIELSESLKTRSILRRQLVCSYDEFSTDTRMNRILKATIALLVRSD-IDK 119 Query: 125 TIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGH 184 + R L + + L + + ++ +N + Y+ +++VC +V + Q G Sbjct: 120 VRKKALRRLLPYFVDVGDVDLEHEDW-HMRFDRNNQAYRMLMNVCWLVVKGLLQTQEDGS 178 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 R D +E+ MS LY+KF+ E+ RRE + Y+ W ++ D ++LP M TD Sbjct: 179 IRMMD-LLDEQRMSHLYEKFILEYYRREHPKLSAGAPYIDW---ALDDGFDDMLPAMHTD 234 Query: 245 ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPE-----NGENIGG 299 I + +LI+DAKYY ++ HS NLYQ+ Y+ + + E ++ G Sbjct: 235 IMLEQGRTVLIIDAKYYSRTMQQQFDKRSVHSSNLYQIFTYVKNKEVELSSTLKAHSVSG 294 Query: 300 LLIYPHVDTAVKHR--YKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 +L+Y D ++ Y+++G I + T++L Q + I +L I + Sbjct: 295 MLLYAKTDEEIQPDGVYQMSGNQISVRTLDLNQPFEEIRSQLDGIAKAHF 344 >UniRef50_UPI000197B905 hypothetical protein BACCOPRO_00614 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197B905 Length = 346 Score = 345 bits (885), Expect = 1e-93, Method: Composition-based stats. Identities = 92/346 (26%), Positives = 169/346 (48%), Gaps = 4/346 (1%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 ++ I +RN+YYML YA+ L++ + ++ D+ +L KGV +RGL Sbjct: 2 IKDRNIWIRNVYYMLAYAFEELKKNNYEQIAHEEFEHIQDLFAEILYKGVSAQLKRGLHR 61 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 +Y E +P +KGR++ TI FD L+E+ L NR++K+TL++L Sbjct: 62 EYINRVEDLPLLKGRLDIRGTIANQMRCRNVLCCEFDDLSENNLFNRVLKTTLSLLCHER 121 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 ++S + E R+L G+ + + ++ +N + Y+ +++VC FI++ + Sbjct: 122 NVSSVRKAELRTLLPFFSGVDEIDVRNIRWNDFVYQRNNQMYRMLMNVCYFIIDGMLMTT 181 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 G YR F +++ M L++KF+ E+ R + ++W+ S ++LLP Sbjct: 182 ETGKYRMATF--SDEHMCRLFEKFVLEYYRLHHRELSPNPDRIEWNIYSKDAMVIDLLPA 239 Query: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 M++DI + ++ L++D KYY HS NLYQ+ Y+ +L ++ N+ GL Sbjct: 240 MQSDIVLHRGDQSLVIDTKYYSHAMQYHFDKPTIHSANLYQIFTYVKNLDVKDTGNVSGL 299 Query: 301 LIYPHVDTAV--KHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 L+Y D + I + T++L QE+ I +L + Sbjct: 300 LLYAKTDEDITPDLSASFGKNHIRVRTLDLNQEFSGIASQLEEFLK 345 >UniRef50_B2JTN9 Putative uncharacterized protein n=1 Tax=Burkholderia phymatum STM815 RepID=B2JTN9_BURP8 Length = 363 Score = 333 bits (855), Expect = 5e-90, Method: Composition-based stats. Identities = 88/350 (25%), Positives = 165/350 (47%), Gaps = 13/350 (3%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPN 65 IP++N++++L+YA + + +E +++ ++LG +L V + +R L Y P Sbjct: 10 IPIKNLWFLLSYAHNLARFADRLPVEIGEQDDIPELLGRLLAFLVERRIKRNLTRAYQPR 69 Query: 66 TEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAIL---IKHEKL 122 + ++GRI+ KT+ L G+ + F+ L+ DT N++++ LA + ++ + L Sbjct: 70 EARLTRVRGRIDLVKTLSAGELQQGRIICRFEELDADTPRNQLVRYALAHIASTVRDQAL 129 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 A L R G+S + + G+N +I V ++ +P + Sbjct: 130 ERRCGLLASELGRL--GVSFRRPSRSEMAREQIGRNDADDAALIVVSNLALDPRLPSEES 187 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANT---TRSYLKWDASSISDQSLNLLP 239 G R +R+E+ + +++K + F EL + + L W +S + +LLP Sbjct: 188 GDSRVARLQRDERLLPYIFEKAIAGFYMHELPNKEWRVRPQKVLAWPVASPTPGLHDLLP 247 Query: 240 RMETDITIRS--SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSL---KPENG 294 M+ DI I S + + ++VD K+ + + GT++F S LYQ+ YL S ++ Sbjct: 248 GMQADIVIDSRITNRRVVVDTKFTDILTRNQFGTQRFKSNYLYQMFAYLRSQTGFGDKHA 307 Query: 295 ENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 E GLL++P V V + + G + TV+LG E IH L + Sbjct: 308 EEAEGLLLHPSVGLHVDESFFVQGHRMRFATVDLGGEIHSIHSALAALVS 357 >UniRef50_Q1QFB3 Putative uncharacterized protein n=1 Tax=Nitrobacter hamburgensis X14 RepID=Q1QFB3_NITHX Length = 361 Score = 331 bits (848), Expect = 3e-89, Method: Composition-based stats. Identities = 92/352 (26%), Positives = 159/352 (45%), Gaps = 10/352 (2%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 Q IP+RN++ +L YA G + Q L DILG +L + + R L Y Sbjct: 9 QTGIPIRNLWLLLVYASGLAEFESQCGAGTDDDIELADILGRLLVRLAKRRLRTNLSRGY 68 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 +IP ++GR+++ +T+ HL G+ F+ L+ DT NR+++ L I I Sbjct: 69 QRRQAVIPRVRGRVDWLQTLSRQHLQRGRLACRFEELSFDTPRNRLVRCAL-IAIAGRVR 127 Query: 123 NSTIRDEARSLYRKLP--GISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 + + + R L L GI+ T Q S G + +F+IS + + +P + Sbjct: 128 DHAVAADCRRLGDDLGRLGIAASRPTQQEMSADTIGSHQSEDRFMISAARLVFEMLLPNE 187 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN----TTRSYLKWDASSISDQSLN 236 G + +R+E + +++K + F R L +A+ ++Y W + Sbjct: 188 TPGDMKLSRLKRDEITLRKIFEKAVTGFYRHHLRAADGWSVREQNYQSWQLEPGRSGDVG 247 Query: 237 LLPRMETDITI-RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPEN-- 293 LLP M+ DI + R ++ +++D K+ + +K S N+YQ+ YL S + Sbjct: 248 LLPGMKPDIILDRKQDRRIVIDTKFTSILAKGIADRDKLKSANIYQIYAYLHSQRGRGRL 307 Query: 294 GENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 + G+L+YP +D V + I G DI TV+L + I L I + Sbjct: 308 CDRAEGVLLYPALDHDVDETFTIQGHDIRFVTVDLALKPSEILDRLHSIAQD 359 >UniRef50_D0D6B3 5-methylcytosine-specific restriction enzyme subunit McrC n=1 Tax=Citreicella sp. SE45 RepID=D0D6B3_9RHOB Length = 359 Score = 324 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 75/350 (21%), Positives = 145/350 (41%), Gaps = 11/350 (3%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPN 65 IP+RNI+ + YA +Q + + +L D++G ++ V RR L Y Sbjct: 5 IPLRNIWILFLYAADLVQLRGRFERDVERARDLPDLVGRLMVNVVEDRLRRNLSRGYRAQ 64 Query: 66 TEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNST 125 T I+P ++GRI+ T G + G+ F+ DT NR++++ L L T Sbjct: 65 TAILPRVQGRIDMLATEAGQLMERGQIACRFEEHVMDTPRNRLVRAALERLAARVFTPET 124 Query: 126 IRDEARSLYRKL--PGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 RSL G+S + + G+N + ++++ + + +IP + G Sbjct: 125 AY-RCRSLAADFSRAGVSARRPSRTELAIDQMGRNEGADRMMVALAGMVFDGTIPTEKHG 183 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSY---LKWDASSISDQSLNLLPR 240 E E + L+++ + R L T + + W ++ ++LP Sbjct: 184 TALQPGDETTEHLIRRLFERAVGNALRIALEPEGWTIAQGHRIAWPVGGKTEGLPSILPG 243 Query: 241 METDITIR--SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLW---SLKPENGE 295 M+TDI + + + +++D K+ + + + S LYQ+ YL S++ + Sbjct: 244 MQTDIELSHIKTSRRVVIDTKFTRILTASNYRGGILRSGYLYQMYAYLRTQESMEHPSSL 303 Query: 296 NIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 G+L++P V +V + G I T++L ++L Sbjct: 304 TSEGILLHPQVGGSVDETMILQGHPISFRTIDLTASSTEFVEQLHRTATN 353 >UniRef50_Q1IK47 McrC protein n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IK47_ACIBL Length = 351 Score = 323 bits (827), Expect = 8e-87, Method: Composition-based stats. Identities = 106/349 (30%), Positives = 181/349 (51%), Gaps = 9/349 (2%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPN 65 IPV N+YY+L YAW L+E ++ +L+++ VL G+ L ++G++ Y + Sbjct: 3 IPVANVYYLLCYAWDKLEERDLVDIHPTEETDLVNLFARVLTNGIDHLLKKGIDRGYLLH 62 Query: 66 TEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNST 125 +E ++GRI+F ++I+ + FD L+ D L NRI+KST+ LI+ L+S Sbjct: 63 SEESCVLRGRIDFPQSIKHMLFQRAQAHCEFDELSFDVLHNRILKSTIMRLIRTRDLDSG 122 Query: 126 IRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHY 185 IRD YR + L L+ Q F + +N +Y F++ VC + N +P Q G++ Sbjct: 123 IRDRLLFQYRYFAEVGDLDLSVQIFGKVQLYRNNHFYDFLLRVCALLFENLLPTQEPGNW 182 Query: 186 RFYDFERNEKEMSLLYQKFLYEFCRRELTS------ANTTRSYLKWDASSISDQSLNLLP 239 RF F +N ++M+ ++++F+ F +REL S R + W + D S LLP Sbjct: 183 RFRSFLQNREQMAYVFERFVRNFYKRELPSVRVDGRCKVKREDINWGMTPSDDLSSALLP 242 Query: 240 RMETDITIRSSEKILIVDAKYYKSIFSRRMG-TEKFHSQNLYQLMNYLWSL-KPENGENI 297 +M+TD+ I + K ++V+ KY +R K + +LYQ+ YL + + Sbjct: 243 KMQTDVCITTEAKRILVECKYVDDPLEQREEMAPKLITTHLYQVNAYLDNWPDLPLYRSS 302 Query: 298 GGLLIYPHVDTAVKHRYK-INGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 +L+YP + + +G + + T+NL Q+W IHQ+LL + D Sbjct: 303 RAILLYPLATRPIAVEFTRADGQLLSVRTLNLAQQWSAIHQDLLRLVDN 351 >UniRef50_A1UB52 Putative uncharacterized protein n=2 Tax=Mycobacterium RepID=A1UB52_MYCSK Length = 361 Score = 322 bits (825), Expect = 1e-86, Method: Composition-based stats. Identities = 88/351 (25%), Positives = 156/351 (44%), Gaps = 12/351 (3%) Query: 5 VIPVRNIYYMLTYAWGYLQE---IKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELD 61 IPVRN++ ++ YA Q ++ ++E P LLD++ VL V + +R L Sbjct: 13 AIPVRNLWLLMLYASRLYQRNHLLRNMDVEQNP-ERLLDLVAQVLVYAVERRLQRNLGRQ 71 Query: 62 YNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 Y + ++G+I+ T L G+ FD L+ D L NRII+S L +L + Sbjct: 72 YRERRATLARVRGQIDVLTTESKALLAQGRIACRFDELSVDNLRNRIIRSAL-VLAARDA 130 Query: 122 LNSTIRDEARSLYRKLP--GISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPG 179 + T++ AR++ G+S ++ + L +N I + I+ +IP Sbjct: 131 RDRTLQRTARNMADVFTQYGVSPQLVSVRESRQLVLDRNAHDDVEAIGAAQLILEMAIPA 190 Query: 180 QNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA---NTTRSYLKWDASSISDQSLN 236 ++ G+ D ER+ E+ LY+ + F R L S+ + ++ W ++ + Sbjct: 191 ESAGNSTNRDPERDAAEIRRLYEAAVRGFYRSALPSSWSVSPGETHYHWPLVEATEGLKS 250 Query: 237 LLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSL--KPENG 294 +LP M+ D + + + +IV+ K+ ++ + G K +++QL Y+ S E Sbjct: 251 ILPIMKADTVLETVGRRIIVETKFADALKPNQYGLPKLARNHVFQLYAYVQSQHGSDELS 310 Query: 295 ENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 G+L+YP V V I G TV+LG I LL + Sbjct: 311 ATAEGVLLYPVVGEHVDESASIQGHRYRFLTVDLGGPAESIRSSLLRVTSN 361 >UniRef50_A5WF57 McrBC 5-methylcytosine restriction system component-like protein n=4 Tax=Proteobacteria RepID=A5WF57_PSYWF Length = 363 Score = 319 bits (818), Expect = 9e-86, Method: Composition-based stats. Identities = 82/351 (23%), Positives = 147/351 (41%), Gaps = 15/351 (4%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPG-NNLLDILGYVLNKGVLQLSRRGLELDYNP 64 IP+RN++ ++ YA +E+ + + +++ D++ +L + V +R L Y Sbjct: 12 IPIRNLWLLMLYASDIYRELNKDRVAVEENPDDIPDLIAEMLCQRVEHRIQRNLSYGYQS 71 Query: 65 NTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIK---HEK 121 ++ ++GRI+ T R L+ GK FD L DT NR ++ L + K ++ Sbjct: 72 REAVVSRVRGRIDLLNTERNRLLDRGKVACRFDELTIDTARNRYVRGALERIAKIVQRKE 131 Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 L R L R G+S + + S ++ K +++V N ++P + Sbjct: 132 LAHRCRSLDIRLRRM--GVSQVLPSRAELSVDRLSRHDAEDKPMLTVAHLAFNLALPTEV 189 Query: 182 KGHYRFYDFER-NEKEMSLLYQKFLYEFCRRELTSAN---TTRSYLKWDASSISDQSLNL 237 G ER + + L++K + F L++ T + W + S + Sbjct: 190 TGSKYLSRPEREDLPWLRKLFEKGVAGFYEITLSNHGYKVTAGKRINWPVTDSSQGIDKI 249 Query: 238 LPRMETDITIRSSE--KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPEN-- 293 LP M+TDI I + + + I+D K+ + E S +YQ+ YL S + Sbjct: 250 LPSMKTDIIIDNLDLGQRTIIDTKFNAVLTRGWYRHETLRSSYIYQMYAYLRSQEDSGDF 309 Query: 294 -GENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIF 343 N GL I+P V + I I TV+L I ++LL + Sbjct: 310 LDRNACGLFIHPSVGEDINEYMVIQDHKIQFATVDLAASTKEIRRQLLGLI 360 >UniRef50_C4T585 McrBC 5-methylcytosine restriction system component n=1 Tax=Yersinia intermedia ATCC 29909 RepID=C4T585_YERIN Length = 367 Score = 318 bits (814), Expect = 2e-85, Method: Composition-based stats. Identities = 73/351 (20%), Positives = 144/351 (41%), Gaps = 15/351 (4%) Query: 8 VRNIYYMLTYAWGYLQEIKQANLEAIPG-NNLLDILGYVLNKGVLQLSRRGLELDYNPNT 66 +RN++ ++ YA +++ + ++ + D++ +L + R L + Y Sbjct: 1 MRNLWLLMLYASDLFRQLGRRHIAVEDNPAEIPDLVATILLHEIALRRHRNLSMGYQTRH 60 Query: 67 EIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAIL---IKHEKLN 123 + ++GRI+ T L G+ F + DT NR ++ L L I L Sbjct: 61 AALNRVRGRIDVLYTTSHQLLERGRVACHFQDMTLDTPRNRYVRCALERLTPIIAKPSLA 120 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHF-SYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + A SL R GI+ + + S G++ K ++ + +P +++ Sbjct: 121 ADCHAMALSLRR--EGINGGYPDNRELPSVRRFGRHDAADKPMVDAAQLAFELLMPTEDQ 178 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANT---TRSYLKWDASSISDQSLNLLP 239 G + N M L++K + F R L LKW S S S + P Sbjct: 179 GQHLLPAPSDNLYWMRKLFEKGIAGFYRVHLAKTTWRISAGKELKWALSEQSAGSAEIFP 238 Query: 240 RMETDITI--RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGE-- 295 M++DI + + +++ +I+D K+ + + + +YQL YL + + + Sbjct: 239 TMKSDIILEHKMAQQRIIIDTKFNAILTKGWHREKSLRNSYIYQLYTYLRTQESQADPLS 298 Query: 296 -NIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 N GLL++P V + G I TV++ + I ++LL++ + Sbjct: 299 LNAAGLLLHPAVGYMLNEYVVTQGHKIHFATVDMAVDAKTIKRQLLELAHD 349 >UniRef50_C2G7A3 5-methylcytosine-specific restriction enzyme subunit McrC n=16 Tax=Staphylococcus RepID=C2G7A3_STAAU Length = 346 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 96/348 (27%), Positives = 181/348 (52%), Gaps = 10/348 (2%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNP 64 +I ++NIYYML+YA+ L + L N+ D+ +L KG+ GL +Y Sbjct: 1 MINIKNIYYMLSYAFTVLNKKGYQKLATEQFENIFDLYSAILIKGISSQLNSGLHHEYIE 60 Query: 65 NTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNS 124 T+ + I+G+++ +I+G + + +D + +T N+I+K+T+ LIK + ++ Sbjct: 61 QTDSLKVIRGKVDVKNSIQGLGVLSQRINCIYDEFSLNTYMNKILKTTMKCLIKTD-ISR 119 Query: 125 TIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGH 184 + + R L + TL + Y + +N + YK +IS+C I I ++KG Sbjct: 120 KNKIKLRKLLVHFNNVDTLDYRNIQW-YHSFDRNNQTYKMLISICYLIFQGVIQTESKGQ 178 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 F +E+++S LY+KF+ E+ ++E T S ++W +D ++N+LP M +D Sbjct: 179 NDLMVF-VDEQQISRLYEKFILEYYKKEFPELVVTSSNIQWSLD--NDDNVNMLPVMRSD 235 Query: 245 ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGEN---IGGLL 301 I +R +K LI+DAK+YK+ T+K HS NLYQ+ Y+ + + + + G+L Sbjct: 236 IMLRYKDKCLIIDAKFYKNTLHNYYDTKKIHSTNLYQIFTYVKNQQLNLKKKAIQVSGML 295 Query: 302 IYPHVDTAV--KHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 +Y D + ++ ++G I + T++L + I ++L I ++ Sbjct: 296 LYAKTDENIVLNDKFHMSGSQIIIKTLDLNCNFTIIKKQLNGIVNDIF 343 >UniRef50_B0AA68 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0AA68_9CLOT Length = 350 Score = 313 bits (802), Expect = 6e-84, Method: Composition-based stats. Identities = 92/351 (26%), Positives = 174/351 (49%), Gaps = 8/351 (2%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 ++ I ++NIYYML+Y + L + +++ +N+ DIL +L K V + +RGL Sbjct: 2 IKDKSIFIKNIYYMLSYVYTDLIQKDYKDIDVEEFDNVGDILAVILFKVVSKQVKRGLIK 61 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 +Y + + G+I K+I+ N K FD L+ D N IIK+ + +L+ + Sbjct: 62 EYKSEEGELSVLTGKINIEKSIKLKANNKNKLYCEFDKLSMDNYLNSIIKTAMYVLVLSK 121 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 ++S + + L ++TL + ++ + ++ Y +I++C I+N+ + Sbjct: 122 DISSQNKKNLKKLVLLFSNVNTLKVNEIRWNDIKYNRHNSNYSGIINICYLILNDLLMTT 181 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 G Y+ +F +EK+M +Y+KF+ + ++ S S +KW+ + D LP Sbjct: 182 EDGEYKVAEFL-SEKKMYSIYEKFVLFYYQKHYPSLRPKASKIKWNLDNELD---KFLPE 237 Query: 241 METDITIRSSEKILIVDAKYYKSIFS--RRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 M++DIT+ S E ILI+D KYY ++ HS NLYQ+ Y+ + + Sbjct: 238 MKSDITLTSGENILIIDTKYYSQSMQTIELYNSKTIHSNNLYQIFTYVKNKDINKNGKVS 297 Query: 299 GLLIYPHVDTAV--KHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 G+L+Y + + Y ++G I + T++L +++ I Q L I + + Sbjct: 298 GMLLYAKTNEDIIPNSEYIMSGNKIMVRTLDLNKDFKFIAQSLNKIASDLI 348 >UniRef50_D1PB97 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PB97_9BACT Length = 364 Score = 309 bits (793), Expect = 8e-83, Method: Composition-based stats. Identities = 104/356 (29%), Positives = 192/356 (53%), Gaps = 14/356 (3%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 Q IP+ N+YY+L YAWG ++ + ++ ++L ++L +L +L R+GL Y Sbjct: 2 QQKIPIENLYYLLCYAWGVSDQLDKVKVDGEKCHSLENLLSTILLNACDRLLRQGLLRAY 61 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + + G++G++ A+T++ +G+T+ D L +D + NR+I STL L++ E + Sbjct: 62 RFEEQEVEGVRGKLNLAETLKSGKHLNGRTICQVDELTQDVVINRVIFSTLKRLMRIEGI 121 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + IR R K P I + +T L + + +YK V+++C+ I ++++P ++K Sbjct: 122 DENIRARLRKTLAKFPHIEEIRVTEGLLGRLRQHRLSGFYKLVLNICRLIWDSTLPCKDK 181 Query: 183 -GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA--NTTRSYLKWDASSIS---DQSLN 236 G F DF ++ M+ ++++FL FC++ R Y+ + S ++ Sbjct: 182 DGRLEFLDFTEDDFRMNCIFERFLMNFCKQNCRDEYPEVHREYIDFQLSPFGMMFKEAGE 241 Query: 237 LLPRMETDITI--RSSEKILIVDAKYYK-SIFSRRMGTEKFHSQNLYQLMNYLWSLKPEN 293 LP METD+T+ ++++ LI+DAK+Y+ ++ S+ G EK +L Q+++Y+ + + + Sbjct: 242 ALPVMETDVTLFNPNTQEKLILDAKFYREALVSKFGGREKVRRDHLSQILSYVMNQEDRS 301 Query: 294 GE---NIGGLLIYPHVDTAVK--HRYKINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 N G L+YP VD +RYK G I + TVNLGQ W I + + +I Sbjct: 302 KPHTMNAYGALVYPTVDEDFDFSYRYKETGHRIIVRTVNLGQPWRKIEERVKEIVK 357 >UniRef50_UPI0001BCC8FD 5-methylcytosine-specific restriction enzyme subunit McrC n=1 Tax=Aeromicrobium marinum DSM 15272 RepID=UPI0001BCC8FD Length = 357 Score = 307 bits (786), Expect = 4e-82, Method: Composition-based stats. Identities = 87/361 (24%), Positives = 150/361 (41%), Gaps = 25/361 (6%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPG-NNLLDILGYVLNKGVLQLSRRGLELDY 62 P IP++N++ + YA + + Q + A +L + +L V Q GL + + Sbjct: 3 PKIPIKNVWLLQLYASSLYRAVGQRLVAAEDNPEDLPAFVAGMLADAVTQRLHTGLSVGF 62 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLAN---RIIKSTLAILIKH 119 + + ++GRI+ T R L+ G+ TFD + DT AN R A L+ H Sbjct: 63 QRTSRPLTRVRGRIDVLPTARHQLLSRGQVHCTFDEVVADTPANRLARAALWRAATLVPH 122 Query: 120 EKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTR---YYKFVISVCKFIVNNS 176 E RSL +L P S + G R + +I+ +++ + Sbjct: 123 EP-------RFRSLALQLEAAGVRGPCPP-LSRVPGLHRERLLVRDRQMIATADLLLSLA 174 Query: 177 IPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANT---TRSYLKWDASSISDQ 233 IP ++G + +E+ + +L+++ F R L S LKWD S +S Sbjct: 175 IPTTDEGGKLLPAPDMDERYLRVLFERACVGFFRLRLEPQGWKVNHNSPLKWDTSFMSSG 234 Query: 234 SLNLLPRMETDITI------RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLW 287 +LP ME DI + + +++D K+ + G K S +YQ+ YL Sbjct: 235 MGAILPGMELDIELVHHDLTGPGRRRVVIDTKFTTITKMNQYGNLKLRSGYIYQIYAYLM 294 Query: 288 SLK-PENGENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEY 346 S + E GL+++P V V I G I TV+L + + ++LL + Sbjct: 295 SQEASETDPKSEGLMLHPVVGERVDEEVVIQGHRIRFATVDLAADSATLAEQLLATITPH 354 Query: 347 L 347 + Sbjct: 355 V 355 >UniRef50_C1QDS7 McrBC 5-methylcytosine restriction system component n=2 Tax=Brachyspira RepID=C1QDS7_9SPIR Length = 350 Score = 296 bits (758), Expect = 8e-79, Method: Composition-based stats. Identities = 106/353 (30%), Positives = 186/353 (52%), Gaps = 18/353 (5%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQAN-----LEAIPGNNLLDILGYVLNKGVLQLSRRGLE 59 IP++NIYYML+YAW I + N +N+ +++GY+LN + +L +RG Sbjct: 6 KIPIKNIYYMLSYAWNIWNIINEDNDKKEIFGDEKFDNIYNVMGYILNIFLEKLIKRGFY 65 Query: 60 LDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKH 119 Y E + +KG+I F+++++ H K V ++++L+ D L N+IIK TL LI + Sbjct: 66 RGYITLEEDLSVLKGKINFSESVKRN--THKKLVCSYNILSNDILFNQIIKYTLNKLINY 123 Query: 120 EKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPG 179 + +++ I+++ L I +++ + F L KN +YK +I++CKF+ N + Sbjct: 124 KNIDNDIKEKLIKLNHYFIKIKNINVNNRTFKLLKYNKNNMHYKIIINICKFVHKNLLVN 183 Query: 180 QNKGHYRFYDFERNEKEMSLLYQKFLYEFCR---RELTSANTTRSYLKWDASSISDQSLN 236 +N Y F DF EK M +LY+KF+ F + + +KW+ + Sbjct: 184 KNSSEYSFIDFN-EEKRMHMLYEKFVLNFYKIYFFHNKNIKVKNKTIKWNIN-----DNE 237 Query: 237 LLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKP--ENG 294 +P M+TDI I + EK LI+D K+YK+I + S +LYQ+ +Y+ ++ + Sbjct: 238 YIPIMKTDIMIYNKEKCLIIDTKFYKNILIKNNDKVSLRSSHLYQIFSYMSNINNSYKRF 297 Query: 295 ENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 + I G+L+YP + + YKIN + T++L ++ I EL++I Y Sbjct: 298 KTIKGILLYPLCNDNLNKEYKINDKYFAVNTIDLNSDFNIIKSELINIIKNYF 350 >UniRef50_D0VFC8 McrBC 5-methylcytosine restriction system component (Fragment) n=1 Tax=Streptococcus suis RepID=D0VFC8_STRSU Length = 346 Score = 291 bits (745), Expect = 3e-77, Method: Composition-based stats. Identities = 85/338 (25%), Positives = 170/338 (50%), Gaps = 12/338 (3%) Query: 15 LTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKG 74 L YA + + + + N D+L ++ + +RGL Y TE + +KG Sbjct: 11 LLYAIYASKSHSRISTMLLRFKNTADLLAEIMIISLSIQVKRGLGRGYRSQTESLSALKG 70 Query: 75 RIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLY 134 +I ++++ + + V +D + D+ NRIIK+++ IL+K + ++ + + R L Sbjct: 71 KINISESLTPPNWRRKQLVCQYDDFSLDSTMNRIIKASIEILLKAD-ISRDRKKKLRKLL 129 Query: 135 RKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNE 194 +S ++L +++ L +N + Y ++S+C +VN I + +G+ + +F +E Sbjct: 130 VFFGEVSKINLHSINWN-LQYNRNNQSY-LLMSICYLVVNGLIHTEREGNKKLMNFL-DE 186 Query: 195 KEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKIL 254 + SLLY+KF+ + ++ T S + W ++ D +LP M++DI ++ + IL Sbjct: 187 RRESLLYEKFILGYYKKHYPQIQVTASQIPW---ALDDGFGEMLPIMQSDIYLKYKDTIL 243 Query: 255 IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLK---PENGENIGGLLIYPHVDTAVK 311 I+DAKYY S R HS NLYQ+ Y+ + + + + G+L+Y D ++ Sbjct: 244 IIDAKYYSSNTQIRFDKRTLHSNNLYQIFTYVKNQAYRLSDTNDTVAGMLLYAKTDIDIQ 303 Query: 312 HR--YKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 Y+++G I + ++L ++ I ++L DI + Sbjct: 304 PNQVYQMHGNQISVKNLDLNLQFASIAEQLDDIITSHF 341 >UniRef50_C0WJ24 5-methylcytosine-specific restriction enzyme subunit McrC n=3 Tax=Corynebacterium RepID=C0WJ24_9CORY Length = 373 Score = 276 bits (707), Expect = 6e-73, Method: Composition-based stats. Identities = 77/353 (21%), Positives = 143/353 (40%), Gaps = 16/353 (4%) Query: 2 EQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIP-GNNLLDILGYVLNKGVLQLSRRGLEL 60 E +P+R+++ + YA + +N G L +++G +L V + RR L + Sbjct: 11 EDIYVPIRSVWMLQLYASQTFIDGHISNSSVEEAGVELPELIGTMLCDAVERRFRRELSI 70 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 + I ++GRI +T R L G F+ L ++ NR ++ L H Sbjct: 71 GFTLTERNITRVRGRINMYETARHQLLEKGLIACEFNELTINSEINRFLRYALEY-AGHI 129 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 N D A ++ + + + + + K ++V K ++ ++P + Sbjct: 130 LSNVGSGDAAHRCKILGQRLAQMGVPEPKTAAFPRARLSPADKKPVAVAKLLLELAVPTR 189 Query: 181 -NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTT---RSYLKWDASSISDQSLN 236 N RF + E+ L++K L+ L+ L W+ Q+ + Sbjct: 190 GNDALPRFSRKHFTQDELRKLFEKALFGLFHYHLSPFGWKVSSGKRLNWNV----QQAPS 245 Query: 237 LLPRMETDITIRSSE-KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLK--PEN 293 LP M+TDI +RS E +I ++DAK+ R+G E S +LYQL YL S + E Sbjct: 246 YLPSMQTDIILRSPEGEITVIDAKFTHLFTENRVGNESIKSSHLYQLYAYLRSQETFSEE 305 Query: 294 GENIGGLLIYPHVDTAVKHR---YKINGFDIGLCTVNLGQEWPCIHQELLDIF 343 E G+++Y + ++G + + L ++ L + Sbjct: 306 WETAQGIMLYASTGQNQADEALSFFLDGHPVTFAGIGLETSIREFREKALMLV 358 >UniRef50_B7D079 5-methylcytosine-specific restriction enzyme subunit McrC n=1 Tax=Burkholderia pseudomallei 576 RepID=B7D079_BURPS Length = 294 Score = 250 bits (638), Expect = 6e-65, Method: Composition-based stats. Identities = 64/271 (23%), Positives = 119/271 (43%), Gaps = 11/271 (4%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQANLEAIPG-NNLLDILGYVLNKGVLQLSRRGLELDYN 63 IPVRN++ ++ YA + + N ++ D++ +L V Q RR + Y Sbjct: 18 RIPVRNLWLLMLYASDLTRIKEVFNALVEDDLEDIPDLVAKLLAHTVEQRLRRNVTRGYQ 77 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAI---LIKHE 120 + + ++GRI+ +T L+ G+ F+ L +T NR++++ L + L++ Sbjct: 78 HRAQSLTRVRGRIDILRTEAQQLLSRGEVYCRFEELTANTPRNRLVRAALDLLASLVRDR 137 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 L R A +L R GI + + + G+N + + K + ++P + Sbjct: 138 DLARQCRSLAAALGRS--GIVGVRPSRAELAQDQIGRNDHDDWLMAELAKLAFDLALPTE 195 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANT---TRSYLKWDASSISDQSLNL 237 G ER + + L++K + F R EL + + W S+ SD + + Sbjct: 196 EAGPTTLVSPERGDVYVRRLFEKAVLGFARVELERIGWRVRGGTCMNWQVSAASDGAAEI 255 Query: 238 LPRMETDITIR--SSEKILIVDAKYYKSIFS 266 LP M TDI I S+ + L++D K+ S Sbjct: 256 LPGMITDIIIDDLSAGRRLVIDTKFTLSTSD 286 >UniRef50_B4V8L4 Putative uncharacterized protein n=1 Tax=Streptomyces sp. Mg1 RepID=B4V8L4_9ACTO Length = 404 Score = 227 bits (579), Expect = 4e-58, Method: Composition-based stats. Identities = 59/311 (18%), Positives = 121/311 (38%), Gaps = 17/311 (5%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P +P+ ++++L Y+ + ++ +L L + + + + R+GL Y Sbjct: 73 PKVPIARLFFLLGYSLDPKGGWRGGQVDVGEHREVLPALAHAVERQTDRALRQGLLQGYR 132 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN 123 E ++GRI A IR +D D NRI+++ + L++ + Sbjct: 133 ATEESALVVRGRIREADQIRRRFGVVLPVEVAYDEYTTDIAENRILRAAVERLLRLPGVP 192 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 +R +L ++ L S+ + Y+ + + + +++ S P G Sbjct: 193 REVRRRLLHQRARLADVTPLVPGQPLPSWQPS-RLNTRYQPALHLARAVLDGSSPEHVPG 251 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 R F + M+ L++ F+ R L ++ T + D + ++ RM+ Sbjct: 252 GLRIDGFLFD---MNRLFEDFVTVALREALRGSDLTGALQ--DPHHLDEEDAI---RMKP 303 Query: 244 DITIRSSEK--ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 D + + + DAKY +R G + +LYQ++ Y +L G + Sbjct: 304 DFVLYGPDGAPRAVADAKYKA---EKRDG---YPDADLYQMLAYCTALGLPKGHLVYAKG 357 Query: 302 IYPHVDTAVKH 312 PH V+H Sbjct: 358 NAPHAAHRVRH 368 >UniRef50_B2A7U4 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A7U4_NATTJ Length = 383 Score = 225 bits (575), Expect = 1e-57, Method: Composition-based stats. Identities = 76/339 (22%), Positives = 141/339 (41%), Gaps = 27/339 (7%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 +P I N++ ML+Y++ + + LLD L V V +L ++GL DY Sbjct: 67 EPKIDTANVFKMLSYSYDLI-FWHDEKAQFANIQELLDYLVLVFCNQVNRLIKKGLHADY 125 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + + KGR+ + + K +D D L N+IIK T+ +L ++ + Sbjct: 126 VLVNDKLSYAKGRMNVRELVEKPW-EKHKIDCYYDNYQVDILENQIIKFTIDLLKRYIQ- 183 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 N+ IR + R +S +T + + ++YK + + CK + + Sbjct: 184 NNWIRRSLLNTNRYFDSVSLRPITVEDIDQVQYTTLNKHYKHIHNFCKMFLELMGINEQI 243 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRME 242 G F F EM+ LY+K++ + + EL + K S+ Sbjct: 244 GETLFNQFHL---EMNNLYEKYVGKLLKEELPNNYCVILQDKLHLDEYDQISI------R 294 Query: 243 TDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLI 302 DI I + D K Y I ++ G++ + ++YQ+ Y+ K + G+L+ Sbjct: 295 PDIVIYN-------DVKPYLVIDTKYKGSKDITNNDIYQMAAYMSKTKTD------GVLL 341 Query: 303 YPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLD 341 YP + Y ING + + T++L Q ++L++ Sbjct: 342 YPAQ-EVAETEYIINGRSLNIKTIDL-QNLDDGAKDLIN 378 >UniRef50_C5CG01 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CG01_KOSOT Length = 397 Score = 218 bits (556), Expect = 2e-55, Method: Composition-based stats. Identities = 72/350 (20%), Positives = 132/350 (37%), Gaps = 31/350 (8%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEA-IPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 P + ++N+ +ML YA+ ++ L + L V K V+ ++RGL +Y Sbjct: 74 PRVELKNLLHMLEYAYNLKSFQFIEGIQDCETIEELYERLVKVFVKRVIDRTKRGLYREY 133 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + +P ++G++ I+ K + D N+II TL+ L+ + + Sbjct: 134 IQQNDRLPYVRGKLNVRSMIKQ--PWKVKLDCVYQDHTNDIEENQIILWTLSKLVMSDSI 191 Query: 123 NSTIRDEARSLYRKLPG-ISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 + R+ R YR L G I P + Y + + KF + NS P Sbjct: 192 SEGTRNLVRKAYRSLAGTIKVRPFKPSECIKRFYNRLNSDYLPIHVLAKFFLENSGPAVK 251 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRM 241 G + F N M L++KF+ E+ ++ L R+ K + + S N+ Sbjct: 252 SGTSKMIPFLVN---MPRLFEKFIAEWLKKNL-KGYIVRAQEKVNLDKENSLSFNI---- 303 Query: 242 ETDITIRS---SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 D+ I S E + ++D KY E+ ++ Q+ +Y Sbjct: 304 --DLVIYSGLTGEAVAVLDTKYKI--------NERPSDNDISQIASYAM-----TKNCTK 348 Query: 299 GLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 LIYP +D + I T L + ++D +++ Sbjct: 349 AFLIYP-IDMNPPVNVTVGSVAIRCLTFQLSGDLEANGMRMVDELMRFIE 397 >UniRef50_C7NQX2 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NQX2_HALUD Length = 421 Score = 214 bits (546), Expect = 3e-54, Method: Composition-based stats. Identities = 70/333 (21%), Positives = 131/333 (39%), Gaps = 46/333 (13%) Query: 4 PVIPVR------NIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRG 57 P I VR N+ Y+L YA ++ G+ LD G + + ++ RG Sbjct: 79 PTIEVRPKAAGTNLLYLLQYAHDTTATTFESQAPYQAGHTFLDAFGALYEAELRRIVDRG 138 Query: 58 LELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAIL- 116 L DY ++GR++ + ++ T+D L D LANR I +L Sbjct: 139 LYTDYRRTDATESHLRGRLDIHRQLQRQPPVPTAFECTYDELTHDILANRAILHATTVLL 198 Query: 117 --IKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVN 174 + + ++R + L R+ +S +T Q + + +Y+ ++ + K ++ Sbjct: 199 GAVSDRSITQSLRQHQQLLRRQ---VSLTPVTIQDIERIELNRLADHYEDILRLTKLVIR 255 Query: 175 NSIPGQ-NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQ 233 NS + G + N M+ +++ + C+ L+ W+ D Sbjct: 256 NSFVSELQAGSSAAFAMLVN---MNTIFENAVERACKEVLSERE------DWEV-KFQDT 305 Query: 234 SLNLLPR------METDITIRSSEKI--LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNY 285 S NL+ ++ DITI E L+ DAK+ E+ + + YQ+ +Y Sbjct: 306 SQNLITGGKHTVTLQPDITIYDPENTVSLVADAKWK---------NERPKNADFYQMTSY 356 Query: 286 LWSLKPENGENIGGLLIYPHVDTAVKHRYKING 318 + + N+ G+L YP + R + G Sbjct: 357 MLA------NNVPGILFYPDCGGLNESRSTVTG 383 >UniRef50_A6VF50 Putative uncharacterized protein n=2 Tax=Methanococcus RepID=A6VF50_METM7 Length = 416 Score = 212 bits (539), Expect = 2e-53, Method: Composition-based stats. Identities = 60/274 (21%), Positives = 117/274 (42%), Gaps = 17/274 (6%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 L +I + + L +RGL+ DY + + +KG+++F + I+ ++ + FD Sbjct: 130 LNEIFITMFLDELSVLIKRGLKSDYIETQKNLNVLKGKLKFKEHIKHNLIHKERFFVEFD 189 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 +D NRIIKSTL L K K +++ + + IS + F+ G+ Sbjct: 190 EFIKDMAENRIIKSTLKELSKRSKSGKNLKNISEYSFV-FDEISESKNIEKDFNACKSGR 248 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLY--QKFLYEFCRRELTS 215 Y+ ++ C+ + N F +F+ + +LLY +K + +L Sbjct: 249 LMVDYENILLWCRVFLKNE---------SFINFKGSNVAFALLYPMEKIFESYLTYKLKK 299 Query: 216 ANTTRSYLKWDASSI--SDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEK 273 + SY+K S + L + +++ DI + I ++DAK+ I Sbjct: 300 SGKF-SYVKAQDSRFFLVKEDLKKMFKLKPDIYAEKDDTIYLIDAKW--KILDVNSPNYG 356 Query: 274 FHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVD 307 ++YQL++Y + +++ L+YP D Sbjct: 357 ISQGDMYQLLSYAKIYENNCKKHVKMALVYPKTD 390 >UniRef50_A6EGF0 5-methylcytosine-specific restriction enzyme McrBC, subunit McrC n=1 Tax=Pedobacter sp. BAL39 RepID=A6EGF0_9SPHI Length = 232 Score = 211 bits (538), Expect = 3e-53, Method: Composition-based stats. Identities = 76/231 (32%), Positives = 130/231 (56%), Gaps = 8/231 (3%) Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 L+ +++EA+ +Y +L GI + ++PQ FS + +N +YKF + V + I + + Sbjct: 2 LSKALKNEAKGIYNQLTGIKDILISPQKFSLVTIHRNNIHYKFPLQVGQLITAQTAIEER 61 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDAS-SISDQSLNLLPR 240 G+Y F DF+RN +M+ L++ F+ F RE +R ++W + S S +L L+P+ Sbjct: 62 NGNYFFQDFDRNHHQMARLFESFVRRFYMREQKRFKVSRENIEWRINESESTGNLALIPK 121 Query: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENI--- 297 M+TDI++ S E+ +I+D K+Y S ++ R + K HS +LYQL +YL +L+ ++ Sbjct: 122 MQTDISLISPERKIIIDTKFYLSAYNSRYDSPKLHSSHLYQLYSYLCNLEEQSLSRNGGA 181 Query: 298 ----GGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 G+L+YP A+ YKI I + T+NL W IH L+ + D Sbjct: 182 NKIYEGILLYPKNGIALDESYKIGSHRIKIYTINLEGPWQDIHDRLISLLD 232 >UniRef50_UPI0001BCB4AE McrBC 5-methylcytosine restriction system component n=1 Tax=Fusobacterium periodonticum ATCC 33693 RepID=UPI0001BCB4AE Length = 444 Score = 210 bits (536), Expect = 5e-53, Method: Composition-based stats. Identities = 78/377 (20%), Positives = 158/377 (41%), Gaps = 41/377 (10%) Query: 4 PVIPV--------RNIYY-MLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLS 54 P IP+ +N + +L + ++ + AI ++L+I + K V ++ Sbjct: 66 PKIPLVENDIVAEKNRFLEILQNISYFKEKFFNDSKIAIADTSILEIFINLFIKEVEEII 125 Query: 55 RRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLA 114 +GL Y E I KG+++ I+ + K FD + ++L N IIK T+ Sbjct: 126 EKGLLYRYIGRNENISVFKGKLDINNHIKYNFSHKEKFFMKFDEFSINSLENSIIKLTIQ 185 Query: 115 ILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVN 174 L K +N +++ + I L + ++ Y+ + YYK I K +N Sbjct: 186 KL-KKISVNLKNKEKLNKISHHFENIIILPNSIENLKYITFDRTNDYYKNSIQWSKIFLN 244 Query: 175 N---SIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCR--------RELTSANTTRSYL 223 N I G F M +++ ++ +L S Sbjct: 245 NQSSLIFSATNGEVATMLF-----PMETIFENYIANKLINIVKEKFYNQLIVKVQDDSCS 299 Query: 224 KWDASSISDQSLNLLPRMETDITI--RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQ 281 + ++++D LN + ++ DI I ++S++I I+D K+ I + K + ++YQ Sbjct: 300 AFSTATLNDTKLNNMFNVKPDIVIKNKNSKEIFILDTKW--KILDKLDNKFKISTDDIYQ 357 Query: 282 LMNYLW--SLKPENGENI-GGLLIYPHVDTAVKH-------RYKINGFDIGLCTVNLGQE 331 +++Y+ + + +N LIYP + ++K + F++ +C VNL E Sbjct: 358 MLSYVKIYNDRYKNSYTCEKAYLIYPATNIRKNSFSSEDKIKFKTDNFELNICFVNLSSE 417 Query: 332 WPCIHQELLDIFDEYLK 348 ++L++I +++K Sbjct: 418 -ETTEKDLVNILSKFIK 433 >UniRef50_C9ZD45 Putative uncharacterized protein n=3 Tax=Streptomyces RepID=C9ZD45_STRSW Length = 415 Score = 210 bits (534), Expect = 8e-53, Method: Composition-based stats. Identities = 57/331 (17%), Positives = 122/331 (36%), Gaps = 20/331 (6%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P +P+ ++++L + + ++ +L+ L + + + V + R GL Y Sbjct: 81 PKVPIARLFFLLGFGLDPKGSWRDGEVDVAEHRDLVPALAHAVERQVDRALRPGLLQGYR 140 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN 123 E ++GR+ A+ IR D D NRI+++ + L++ + Sbjct: 141 ATEETALVVRGRLREAEQIRRRFGAALPVEVVHDEFTTDIAENRILRTAVERLLRLPGVP 200 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 +R +L ++ + +L + Y + + + +++ + G Sbjct: 201 RDVRRRLSHQRGRLAEVTAIVRGQSVPDWLP-TRLNTRYHHALRLARAVLDGVSAEHSPG 259 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 R F + M+ L++ F+ R + + + D + + + RM Sbjct: 260 GLRIDGFLFD---MNKLFEDFVTVALREAFRTTGSGHTARLQDPHHLDEAATI---RMRP 313 Query: 244 DITIRSSEK---ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 D + + +VDAKY + + +LYQ++ Y +L G + Sbjct: 314 DFVLYGPDGGAPCAVVDAKY------KAERRGGYPDADLYQMLAYCTALGLREGHLVYAK 367 Query: 301 LIYPHVDTAVKHRYKINGFDIGLCTVNLGQE 331 PH V+H G I ++L QE Sbjct: 368 GNAPHAAHEVRHA----GILIHQHALDLDQE 394 >UniRef50_Q9RZI4 Putative uncharacterized protein n=1 Tax=Deinococcus radiodurans RepID=Q9RZI4_DEIRA Length = 442 Score = 209 bits (531), Expect = 2e-52, Method: Composition-based stats. Identities = 60/315 (19%), Positives = 124/315 (39%), Gaps = 22/315 (6%) Query: 32 AIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGK 91 L +++ +GV RRG+ Y P E PG++GR++ + +R Sbjct: 134 QTAHMPLYEVVIRYALEGVRAAVRRGIPHAYVPVQEERPGLRGRLDLPRQVRQPPHRAHL 193 Query: 92 TVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFS 151 T+D D R+ + ++ L +L + R AR L L + F+ Sbjct: 194 LHVTYDEFLPDRPETRLTRLSVERLAALTRLPANQR-LARELLHALDEVPPSRNVNVDFA 252 Query: 152 YLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRR 211 G+ ++ + ++C+ ++ P + + + M+++Y+ ++ + RR Sbjct: 253 AWRLGRGHTHFAPLEALCRMVLYELNPIVAGKKTQAHALLFD---MNVVYEAYVAQLLRR 309 Query: 212 ELTSANTTRSYLKWDASSISDQSLNLLP--RMETDITIRSSEKILIV-DAKYYKSIFSRR 268 + + + + LP R+ D+ +R+ E +IV D K+ K + + + Sbjct: 310 LYPTWTVAT-----QVTQRALGDADGLPAFRLRPDLLLRTEEGQVIVADTKW-KRLEADK 363 Query: 269 MGTEKFHSQNLYQLMNYLWSLKPENGENIGGL-LIYPHVDTAVKHRYKI---NGFDIGLC 324 T + + YQ++ Y + + L LIYP + I G + + Sbjct: 364 APTYDVANADAYQMLAYSEAFQHSAAYTHKALWLIYPRLPGLPPVSAPIRLGQGRTLSIV 423 Query: 325 TVNLGQ-----EWPC 334 T++L Q +WP Sbjct: 424 TIDLNQADPGAQWPA 438 >UniRef50_B1BM79 Putative uncharacterized protein n=1 Tax=Clostridium perfringens C str. JGS1495 RepID=B1BM79_CLOPE Length = 425 Score = 205 bits (523), Expect = 2e-51, Method: Composition-based stats. Identities = 71/322 (22%), Positives = 133/322 (41%), Gaps = 27/322 (8%) Query: 33 IPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKT 92 + N L+DI+G + + + + +RG+ +Y I IKG++ K + N K Sbjct: 109 LKNNPLIDIMGEIFYRDLSRELQRGIYSEYVSVENSIGNIKGKLLVTKHSKVNRFNKNKA 168 Query: 93 VSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSY 152 +D ED NRI+K L+ L++ E N ++ R L R +S + + Sbjct: 169 YCAYDEFTEDNFFNRILKKALSYLLR-EVRNERLKSNLRVLDRSFEEVSDKFINKHALNR 227 Query: 153 LNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRE 212 + +K + K I+N S+ +KG + EM+ LY++++ R Sbjct: 228 YKLNRRNERFKNSFELAKMILNGSMGDNSKGKEFGFTLLF---EMNYLYEEYIGVVLREV 284 Query: 213 LTSANTTRS------YLKWDASSISDQSLNLLPRMETDITIRSSEK-ILIVDAKYYKSIF 265 ++ N S YL ++ ++ ++ DI I E +I+D K+ K Sbjct: 285 ISEENIFVSTQEKTKYLLYNKKRKREEIA-----LKPDIVIYKDETPKIIIDTKWKK--- 336 Query: 266 SRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKI---NGFDIG 322 R G E + ++YQ+ Y+ S K E +++YP + + + I Sbjct: 337 GSRNGKENYSQGDVYQMYAYITSYK----ECEKCVILYPKEEDEGNIIWNLKGYQDKKIF 392 Query: 323 LCTVNLGQEWPCIHQELLDIFD 344 + +V+L + + L DI Sbjct: 393 MRSVDL-SSYERTKEILKDIVK 413 >UniRef50_Q10YL1 McrBC 5-methylcytosine restriction system component-like n=2 Tax=Oscillatoriales RepID=Q10YL1_TRIEI Length = 405 Score = 200 bits (509), Expect = 5e-50, Method: Composition-based stats. Identities = 59/312 (18%), Positives = 123/312 (39%), Gaps = 38/312 (12%) Query: 3 QPVIPVRNIYYMLTYAWGY-----LQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRG 57 +P +P+ N++ ML YA+ L + N N L++IL + +L+ R+G Sbjct: 76 KPKVPLHNLFGMLEYAYNLRSFCFLDGLVNCNSLQEFYNCLVNILA----QKILERGRKG 131 Query: 58 LELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILI 117 Y P TE + I+GR+ + + + + N+I+ TL I+ Sbjct: 132 FHRAYLPKTENLTYIRGRLNMRQVMHK--PWGVSLKCDYQEHTANIPDNQILAWTLFIIS 189 Query: 118 KHEKLNSTIRDEARSLYRKLPGISTLHL-TPQHFSYLNGGKNTRYYKFVISVCKFIVNNS 176 + + + + L G+ TL + + Y+ + +C+F ++N Sbjct: 190 RSSFCSEKVAVTVTRAFHILQGLVTLQPFKSSDCLNIKYHRLNEDYQVLHGLCRFFLDNI 249 Query: 177 IPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLN 236 +G+Y F + M+ LY+KF+ ++ + L+S + K + ++ Sbjct: 250 GASHQQGNYSMLPFLID---MAKLYEKFVAKWLKLHLSSNLRVKEQEKVEI-------VD 299 Query: 237 LLPRMETDITIR---SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPEN 293 + D+ I + + + I+D KY + + ++ Q++ Y Sbjct: 300 DKIYCKIDLVIYEIKTCKVVYILDTKYKLDC--------RPSTDDINQVVAYAT-----Y 346 Query: 294 GENIGGLLIYPH 305 + +LIYP Sbjct: 347 KKCHEAILIYPQ 358 >UniRef50_D2QXZ3 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QXZ3_9PLAN Length = 435 Score = 197 bits (501), Expect = 5e-49, Method: Composition-based stats. Identities = 67/304 (22%), Positives = 119/304 (39%), Gaps = 23/304 (7%) Query: 20 GYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFA 79 L+ A G NLLD++ + + ++ R GL DY E +P ++GR+ F Sbjct: 94 NTLKHYSSVLKLAAGGGNLLDLIMLMFVEECERILRGGLLSDYVEEEEELPVVRGRMLFD 153 Query: 80 KTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPG 139 + IR D +D N+++ L + +H T+R +AR L L G Sbjct: 154 RQIRKRLGRLDLIHCRHDERKQDVPENQLLAYVLDVCARH-AFQPTLRRKARQLEHHLLG 212 Query: 140 ISTLHLTPQHFSYLN----GGKNTRYYKFVISVCKFIVNNSIPGQ--NKGHYRFYDFERN 193 + + +Y+ +C IV+ + G R + F + Sbjct: 213 SCD--PSLLDLVTTRGGIYYDRMNEHYRDAHELCWLIVDALGISDIYSSGSSRIFAFLLD 270 Query: 194 EKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNL-LPRMETD--ITIRSS 250 M+ L++ F+ + R+ ++ SY D S I D + N + D I S Sbjct: 271 ---MNRLFEVFVLQVLRQLTSTTALKVSYQSSDRSIIRDSATNQPYSSVIPDFLIAAPSL 327 Query: 251 EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSL-KPENGENIGGLLIYPHVDTA 309 L++DAKY + + ++YQ Y ++ + + G ++ G LIYP T Sbjct: 328 SGKLVLDAKY------KLYDAAGVSNSDIYQSFFYAYAFGRHQLGGHVAG-LIYPSESTT 380 Query: 310 VKHR 313 + Sbjct: 381 ASRK 384 >UniRef50_Q4C3F9 Similar to McrBC 5-methylcytosine restriction system component n=4 Tax=Chroococcales RepID=Q4C3F9_CROWT Length = 294 Score = 191 bits (485), Expect = 4e-47, Method: Composition-based stats. Identities = 54/267 (20%), Positives = 110/267 (41%), Gaps = 20/267 (7%) Query: 37 NLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTF 96 L + L + +L ++ L Y ++ + +KG+I+ I+ H + Sbjct: 6 TLYNRLATIFAHRILNRIQKELYSTYIKQSQELNYVKGKIDIKTMIK--HPWKPTLTCQY 63 Query: 97 DMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPG-ISTLHLTPQHFSYLNG 155 D +D N+I+ T+ I+ + + + R R +Y L G ++ +T Sbjct: 64 DNFTQDIEDNQILLWTIYIISRQQICQANTRILIRKVYHALQGYVTLSPVTANDCINRKC 123 Query: 156 GKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTS 215 + Y + S+C+F + N+IP +KG+Y F + M+ LY+KF+ + + L Sbjct: 124 NRLNEDYHGLHSLCRFFLENTIPSHDKGNYNSLPFLVD---MNQLYEKFVAAWLIQHLPP 180 Query: 216 ANTTRSYLKWDASSISDQSLNLLPRMETDITIR---SSEKILIVDAKYYKSIFSRRMGTE 272 ++ + + S S + D+ I + E + ++D KY I + E Sbjct: 181 HLGIKTQHRVEYDSFS---------FKIDLIIYNKETQENLYVLDTKYKTKI--KTSDIE 229 Query: 273 KFHSQNLYQLMNYLWSLKPENGENIGG 299 + + Q N+ ++P N + I Sbjct: 230 QIIAYTFQQNCNHAIIIEPTNNKPINA 256 >UniRef50_C7PTE7 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PTE7_CHIPD Length = 423 Score = 190 bits (484), Expect = 5e-47, Method: Composition-based stats. Identities = 62/313 (19%), Positives = 130/313 (41%), Gaps = 29/313 (9%) Query: 22 LQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKT 81 + +++ANL + N++LDI + + V L RRGL Y +T + +KG+++F K Sbjct: 105 IDHVEKANLN-LRSNSILDIYIRLFLEEVEVLLRRGLIKKYKRHTANLTTLKGKLDFGKH 163 Query: 82 IRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGIS 141 I ++ + + + + L N+II L LI N+++ D+ + P + Sbjct: 164 ISANLIHKERFFVEHTIYSHENLFNQIINEVLK-LIPLLVSNTSLNDKLGRIRLDFPELP 222 Query: 142 TLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLY 201 + +T F + + + Y+ + + + ++ N P G + M+ L+ Sbjct: 223 AIKVTAATFDKIQYDRKSSTYQPALEIARLLLLNYRPDITGGTNNVIAILFD---MNELW 279 Query: 202 QKFLYEFCRRELTSANTTRSYLKWDASS-ISDQSLNLLPR-METDITIRSSEKILIVDAK 259 +++++ R+L N+ +K S ++ L P+ + DI +R EK +++D K Sbjct: 280 EEYIF----RKLQRLNSEGIEVKRQQSQHFWKRNGALYPKSVRPDIVLRKGEKTIVLDTK 335 Query: 260 YYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKIN-- 317 + K ++L Q+ Y E +L+YP ++ + Sbjct: 336 WKLI------PDYKPTDEDLKQMFVY-----NLYWECSHSVLLYPADRYHLETGAYFDFS 384 Query: 318 -----GFDIGLCT 325 G + T Sbjct: 385 RQTAAGNSCSVAT 397 >UniRef50_Q6LZ73 Putative uncharacterized protein n=1 Tax=Methanococcus maripaludis RepID=Q6LZ73_METMP Length = 426 Score = 190 bits (482), Expect = 9e-47, Method: Composition-based stats. Identities = 60/271 (22%), Positives = 113/271 (41%), Gaps = 19/271 (7%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 L +I + + L + GL+ DY + + +KG+I+F + I +++ + +D Sbjct: 129 LNEIFIKIFLDDLDVLVKNGLKSDYISIEDNLNVLKGKIKFNEHISKNYIHKERFYVNYD 188 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 + NRIIKST+ L+K+ LNS ++ L+ I + F+ + Sbjct: 189 EFIRNRPENRIIKSTIKYLLKNSSLNSNLKRINEFLFIM-DAIPESKNLEKDFAACVNNR 247 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLY--QKFLYEFCRRELTS 215 YK +I CK + N F +F+ +E +LLY +K + E Sbjct: 248 LMTDYKKIIPWCKVFLKNE---------SFTNFKGDEIAYALLYPMEKIFESYLTEEFKK 298 Query: 216 ANTTRSYLKW-DASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKF 274 + + + + ++ + R++ DI +S K I+DAK+ K + S + Sbjct: 299 SGKFETIVSQGNGYFLAKHKNEGIFRLKPDIYAETSSKKYIMDAKW-KILNSDKNKNYGI 357 Query: 275 HSQNLYQLMNYLWSLKPENGENIGGLLIYPH 305 ++YQL++Y L+YP Sbjct: 358 SQNDMYQLLSYAVVY-----GCNELRLLYPK 383 >UniRef50_A3UV38 Putative uncharacterized protein n=1 Tax=Vibrio splendidus 12B01 RepID=A3UV38_VIBSP Length = 441 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 55/324 (16%), Positives = 116/324 (35%), Gaps = 11/324 (3%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 P + +++ ML G+ + L LL++ VL L ++GL+ DY Sbjct: 94 NPTLARKSLLVMLRALKGFSHIQTSSALIHEEKMPLLEVFIGQFINSVLNLVKKGLKSDY 153 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + + KG++ A +R + K + D ANR++ S L I+ K Sbjct: 154 VKTVDNLVYQKGKLVSAGQLRNNLVTKHKFYCEYQEYLVDRPANRLLHSALNIVAKL-SR 212 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + + + + L+ + S L + +Y I+ I+N P K Sbjct: 213 SPKHKKQLQELFFIFEEVPLSRDYKSDLSRLRLDRGMSHYHTPIAWATLILNGFSPQTMK 272 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRME 242 G + M +++ ++ + R+++ +S +K S+ + +++ Sbjct: 273 GSNQAISLLF---PMERVFEDYVAKVLRQQVPDDFVVKSQVK--RKSLVEHKQASWFKLQ 327 Query: 243 TDITIRSSEKIL-IVDAKYY--KSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGG 299 D+ + ++ ++D K+ + YQ+ Y ++ + Sbjct: 328 PDLLLEKGGSVVSVLDTKWKLVDQTKDNGSDKYGLSQSDFYQMFAYGQHYFDDSSDEREM 387 Query: 300 LLIYPHVDTAVKHRYKINGFDIGL 323 LIYP D FD + Sbjct: 388 FLIYPAHDGF--ETAIEQSFDFNI 409 >UniRef50_Q26DA3 Putative uncharacterized protein n=2 Tax=Flavobacteria RepID=Q26DA3_9BACT Length = 416 Score = 178 bits (453), Expect = 2e-43, Method: Composition-based stats. Identities = 52/268 (19%), Positives = 111/268 (41%), Gaps = 22/268 (8%) Query: 37 NLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTF 96 NLL++ + + + L R+GL Y T+ +KG++EFA I+ ++ + +T Sbjct: 110 NLLEVYFELYLQELESLVRKGLIKQYRKQTKNTKALKGKLEFAGHIKSNIVHKERFYTTH 169 Query: 97 DMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGG 156 + + + ++++ LAI+ + S ++D A + P + +T +H + ++ Sbjct: 170 QVYDSNHFLHQVLSKALAIVGQFTN-GSRLQDLASRVQLNFPEVDNKAITAKHLNEMSLN 228 Query: 157 KNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA 216 + T YK + + + I+ N P + G + + M+ L++ ++ L Sbjct: 229 RKTLSYKNALELARLIILNYSPDISSGKEKMLSLLFD---MNELWETYI-------LKQL 278 Query: 217 NTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHS 276 + + S +S + DI +R S K I+D K+ +R Sbjct: 279 QKASIGFEIEVSGQESKSFWANNSLRPDIVLRKSGKTYIIDTKW------KRPNKSTASV 332 Query: 277 QNLYQLMNYLWSLKPENGENIGGLLIYP 304 +L Q+ Y + +L+YP Sbjct: 333 NDLRQMYTYCR-----FWDAEKAMLLYP 355 >UniRef50_A6EMU6 5-Methylcytosine-specific restriction enzyme C n=1 Tax=unidentified eubacterium SCB49 RepID=A6EMU6_9BACT Length = 414 Score = 177 bits (449), Expect = 5e-43, Method: Composition-based stats. Identities = 54/288 (18%), Positives = 116/288 (40%), Gaps = 23/288 (7%) Query: 22 LQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKT 81 +Q++ A + +LLDI + V +L RRGL Y ++ + +KG++EFA+ Sbjct: 102 VQQVGNAQVNK-QSIHLLDIYFDWFLREVQELCRRGLIKKYYKESKNVKSLKGKLEFAQH 160 Query: 82 IRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGIS 141 + ++ + ++ + N+D ++II L ++ K + + + + + P +S Sbjct: 161 LNKNLIHKERFYTSHQIYNKDHKLHQIINQALEVIELVSK-GTYLYSKCKEVRLNFPEVS 219 Query: 142 TLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLY 201 T+ FS LN + YK I + + I+ N P + G + M+ L+ Sbjct: 220 TIKCNESTFSKLNFNRKNSPYKTTIEIARLIILNFAPNVSTGSENMLALLFD---MNNLW 276 Query: 202 QKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYY 261 ++++ + + + + + + DI + + I+D K+ Sbjct: 277 EEYVLLKLKEATRDLDI-------EVHGQNRKPFWNGITIRPDIVVSQGDTTCIIDTKW- 328 Query: 262 KSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTA 309 + K ++ +L Q+ Y E + LL+YP Sbjct: 329 -----KNNRDNKPNTNDLRQMYVY-----NEYWQGKNALLLYPSASKD 366 >UniRef50_C3FCB5 McrBC 5-methylcytosine restriction system component n=3 Tax=Bacillus cereus group RepID=C3FCB5_BACTU Length = 439 Score = 176 bits (447), Expect = 9e-43, Method: Composition-based stats. Identities = 51/278 (18%), Positives = 100/278 (35%), Gaps = 14/278 (5%) Query: 37 NLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTF 96 NL ++ + + V +L ++GL Y P + + KG++ + I+ ++ + + Sbjct: 128 NLYELFISMYIQEVRELIKKGLRSSYFPQVDNVNYFKGKLIIREQIKKNQVHKERFYVEY 187 Query: 97 DMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGG 156 D + NR+IKSTL L K + I+ R L I + FS + Sbjct: 188 DEYGINRPENRLIKSTLLKLQKLSNSAANIK-RIRQLLPNFEKIKPSINYKKDFSKVVID 246 Query: 157 KNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA 216 +NT+ YK ++ K + N G M +++ ++ ++ L+ Sbjct: 247 RNTKDYKTLMQWSKVFLINQSFTTFSGETNARALLF---PMEKVFEAYVARNLKQVLSDL 303 Query: 217 NTTRSYLKWDASSISDQSLNLLPRMETDITI-RSSEKILIVDAKYYKSIFSRRMGTEKFH 275 S + + DI I R +I+D K+ K + Sbjct: 304 MWEVSIQDKGYYLFNSPKR---FALRPDIVIMREDGSRVILDTKW-KKLVDNPNRNYGIS 359 Query: 276 SQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHR 313 ++YQ+ Y + + L+YP + Sbjct: 360 QADMYQMYAYSKKYEAQEIW-----LLYPLNEGMKDTD 392 >UniRef50_C7NBA8 McrBC 5-methylcytosine restriction system component n=4 Tax=Bacteria RepID=C7NBA8_LEPBD Length = 432 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 59/336 (17%), Positives = 126/336 (37%), Gaps = 20/336 (5%) Query: 14 MLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIK 73 ML + ++ + + NL +I + + + +L + G++ DY + + K Sbjct: 111 MLRSMRDFPSKVFNNSNIQVERMNLYEIFINMYLQEIRRLIKIGIKSDYIFKEDNLNYYK 170 Query: 74 GRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSL 133 G++ ++ + ++ + +D N + + N++IK+TL L K + E R L Sbjct: 171 GKLLTSQHFKINLVHKERFYVAYDEFNPNRVENKLIKATLLKLQKLTTSAENSK-EIRQL 229 Query: 134 YRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERN 193 I FS + ++ R Y+ ++ K + N G+ + Sbjct: 230 LVFFEIIDASMNYTADFSKVRINRSNRDYEMIMQWSKVFLLNKSFTTFSGNNNSRALLFS 289 Query: 194 EKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKI 253 M +Y+ ++ + ++ L S + L + DI + E+ Sbjct: 290 ---MEKVYESYVAKHLKKILGEDGWNVSSQDRGYYLFTK--PRLQFALIPDIVCKRGERT 344 Query: 254 LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHR 313 +I+D K+ K + + R+ ++YQ+ Y K L+YP D +H Sbjct: 345 IIMDTKWKKLVNNERI-NYGISQSDMYQMYAYSKKYKASEIW-----LLYPLNDEMKEHS 398 Query: 314 YKI----NGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 +G + + V+L I L + D+ Sbjct: 399 EISFNSGDGTTVNIYFVDL----ENIESSLEVLRDK 430 >UniRef50_D1SH86 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1SH86_9ACTO Length = 419 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 56/344 (16%), Positives = 124/344 (36%), Gaps = 22/344 (6%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 +P I + + ++L YA + + + P ++L+ + Q GL Y Sbjct: 66 RPKIGIARLLWLLGYARDP-RGWRTEPVGLTPEHDLVPAMAVAFATATYQALAPGLLQGY 124 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 E +P ++GR+ A +R +D + D N+I+ S + L + + Sbjct: 125 RTVEEALPLVRGRLREADQLRTRPGLALPVEVRYDDYDTDIPENQILLSAVRRLHRLPGV 184 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + + L ++ L + + Y+ + + + I+ + Sbjct: 185 PPATCNALHRIAAALADVTPLTAGAPIPEA-SSNRFNNRYQPALRLARLILAGESIEHSH 243 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRME 242 G F + ++ +++ +L R + + + D+ LP + Sbjct: 244 GSTLAAGFVFD---LNTVFEDWLTTALRHAVETRYGGTVTGQHQMHLDRDRR---LPLVR 297 Query: 243 TDITIRSSEKIL-IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 DIT + L ++DAKY ++YQL+ Y +L G L Sbjct: 298 PDITWWHGRQCLAVIDAKYKA-------PGNTPPRDDIYQLLAYCTTLNLP-----RGHL 345 Query: 302 IYPHVD-TAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 +Y +V ++ +G I + V+L +H+++ + + Sbjct: 346 VYASAGAESVTYQLTGSGVHIVVHRVDLAAPVTHLHEQIEKVAE 389 >UniRef50_A6ALR1 5-Methylcytosine-specific restriction enzyme C n=4 Tax=Vibrionaceae RepID=A6ALR1_VIBHA Length = 434 Score = 175 bits (444), Expect = 2e-42, Method: Composition-based stats. Identities = 52/269 (19%), Positives = 100/269 (37%), Gaps = 12/269 (4%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 LL++ + V +L +RGL+ DY + + KG++ + +R +N K +D Sbjct: 129 LLEVFIEQFLQSVNRLVKRGLKSDYVTQVDNLNYQKGKLLVGQQLRRNLINQHKFYVEYD 188 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 + ANR+I + L L+ + + S R R L + Q S L + Sbjct: 189 EYLINRPANRLITTALTKLVSYTRSPSNQR-LLRELQFAFVDVPVSKSVKQDLSALKLDR 247 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 + Y I+ K I+ P KG M +++ ++ R L Sbjct: 248 SMLDYHVPIAWAKLILEGFSPLSMKGESSALSLMF---PMEAVFESYVASVLRSSLPENV 304 Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITIRSSEK-ILIVDAKYYKSIFSRRMGTEKFHS 276 + + A + + ++ D+ + +K +++D K+ + Sbjct: 305 ELTTQAR--AKHLVKHNGKAQFQLMPDLLMTLPDKSQVVLDTKW--KLLDFEAHNYGISQ 360 Query: 277 QNLYQLMNYLWSLKPENGENIGGLLIYPH 305 ++YQ+ Y +GE LIYP Sbjct: 361 SDMYQMFAYGHKYLKGSGEL---YLIYPA 386 >UniRef50_Q0KFR0 5-Methylcytosine-specific restriction enzyme C n=3 Tax=Proteobacteria RepID=Q0KFR0_RALEH Length = 450 Score = 175 bits (443), Expect = 2e-42, Method: Composition-based stats. Identities = 47/272 (17%), Positives = 105/272 (38%), Gaps = 13/272 (4%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 LL++ + V + +RGL DY+ + + ++G++ A ++ + + D Sbjct: 130 LLEVFVAEFLRAVDHIVKRGLRSDYSSRQDNLYALRGKLLIAPHLQQNLYRADRFFTDHD 189 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 D NR++ + L ++ E S + AR L + F + + Sbjct: 190 EFTIDRPENRLLHAALRRVL--ELSASQLNQLARELAFVFAEVPVSAQPQIDFQRVRLDR 247 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 YY ++ + I++ P G + M +++ F+ + ++L Sbjct: 248 GMGYYADALAWARLILDEESPLTGAGAHCAPSMLF---PMEAVFEAFVAKHLAKQLARPL 304 Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITIRSSEK-ILIVDAKYY--KSIFSRRMGTEKF 274 +S + + + R++ D+ IR++++ +L++D K+ + + Sbjct: 305 ILKSQAR--SHHLVRHREQNWFRLKPDLLIRNADRDLLVLDTKWKLLDGMKANGTDKYGL 362 Query: 275 HSQNLYQLMNYLWSLKPENGENIGGLLIYPHV 306 + YQL Y S G+ + LIYP Sbjct: 363 SQSDFYQLQAYGQSYLSGRGDVV---LIYPKT 391 >UniRef50_C5EXA5 Putative uncharacterized protein n=1 Tax=Helicobacter pullorum MIT 98-5489 RepID=C5EXA5_9HELI Length = 552 Score = 172 bits (436), Expect = 2e-41, Method: Composition-based stats. Identities = 49/257 (19%), Positives = 102/257 (39%), Gaps = 6/257 (2%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 LL++ + + +L RGL+ DY + +KG++ F + I+ ++ + + D Sbjct: 196 LLEVFIQMFLAELERLIHRGLKSDYREIAQNRVFLKGKLLFNEQIKHNLIHKERFFTQSD 255 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 + ++ NR+IK TL L + L+ R + SLY I+ + F+ + Sbjct: 256 EYSLNSAPNRLIKCTLEFL-RTLSLSPKTRTKLDSLYFIFEEITPSSHIDRDFAKCKSMR 314 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 + Y+ V+ C + G R + M L++ F+ + R + Sbjct: 315 RFKEYELVLLWCAIFLQQKSFSAYSGSERAFALLF---PMERLFESFVGHWLGRSIEHHE 371 Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQ 277 + + D + +++ D+ +RS +ILI+D K+ + Sbjct: 372 I--KLQEQRYYFMQDFQKVDIFQLKPDVIMRSESEILILDTKWKIPDSTNDEKRYGIAQS 429 Query: 278 NLYQLMNYLWSLKPENG 294 ++YQ+ Y E+ Sbjct: 430 DVYQMWAYASKYALEST 446 >UniRef50_D1WRX1 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Streptomyces sp. ACT-1 RepID=D1WRX1_9ACTO Length = 429 Score = 171 bits (433), Expect = 4e-41, Method: Composition-based stats. Identities = 51/327 (15%), Positives = 114/327 (34%), Gaps = 20/327 (6%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P +PVR +++++ YA ++ ++ L + + + R+G+ Y Sbjct: 70 PKVPVRRLFFLIGYAADPRVHRD-GEVDVTEDEEIVPALAQGFERALERALRQGVLQGYR 128 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN 123 E P ++GR+ A + H +D D NR+++ + L+ ++ Sbjct: 129 HTEEASPVVRGRVREADQVNRHHGRSFPVEIAYDDYGTDIAENRLLRGAVERLLPLHRVP 188 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 +R R +L L ++ + Y + + + I+ + G Sbjct: 189 GDVRRRLRHHRARLLDAEPLGRGARYLPRWRPSRLNHRYLPALRLAETILRGASVEHGTG 248 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 + + +++ F+ R L + ++ M Sbjct: 249 GASVDGYLIDTH---KVFEDFVCVALREALARYGGRAALQARGVYLDDAGEIS----MRP 301 Query: 244 DITIRSSEK--ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 D+ ++ + DAKY + E F +LYQ++ Y +L +G + Sbjct: 302 DLVWYGEDRTPRAVADAKY------KAEKPEGFPDADLYQMLAYCTALGLRDGHLVYARG 355 Query: 302 IYPHVDTAVKHR-YKINGFDIGLCTVN 327 P V V+H +I+ + T++ Sbjct: 356 YEPTVTHQVRHSQIRIHQHAL---TLD 379 >UniRef50_UPI0001972FC4 McrBC 5-methylcytosine restriction system component n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001972FC4 Length = 431 Score = 167 bits (424), Expect = 4e-40, Method: Composition-based stats. Identities = 46/316 (14%), Positives = 108/316 (34%), Gaps = 23/316 (7%) Query: 37 NLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTF 96 NL ++ + V + +RGL Y E KG++ F++ IR + + ++ + Sbjct: 129 NLFEVFIRMFVDEVFSIVKRGLRCSYELTEENTSFFKGKLLFSEQIRHNYSHRERSHVEY 188 Query: 97 DMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGG 156 + N+++K+TL L + + R + + L G+ Sbjct: 189 GDFTANRPENKLLKATLLRLYRQTS-SQKNRSDIKILLTAFSGVEASTDCKGDLVRYIPD 247 Query: 157 KNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA 216 +N + Y+ + C+ + G + M +L++ + +++L + Sbjct: 248 RNMKGYRTALMWCRIFLTGKSFASFAGSEQAPALLF---PMEVLFESYTAALLKKKLDGS 304 Query: 217 NTTRSYLKWDASSISDQSLNLLPRMETDITI--RSSEKILIVDAKYYKSIFSRRMGTEKF 274 + L + DI + +S I ++D K+ + Sbjct: 305 RFAVLVQDKTHYLFDEPGKKFL--LRPDIVVKRKSDGAIFVLDTKWKVLDLGKT--NYGI 360 Query: 275 HSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKI----NGFDIGLCTVNLGQ 330 ++YQ+ Y E+ L+YP + R +G + + ++L Sbjct: 361 SQADMYQMFAYQKKYGAEH-----MTLLYPETEKVPPDRRIEFRADDGAAVLVKFIDLFH 415 Query: 331 EWPCIHQELLDIFDEY 346 + L ++ + Sbjct: 416 P----EESLTNVIQSF 427 >UniRef50_A4J0H3 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J0H3_DESRM Length = 334 Score = 167 bits (422), Expect = 8e-40, Method: Composition-based stats. Identities = 59/346 (17%), Positives = 125/346 (36%), Gaps = 36/346 (10%) Query: 4 PVIPVRNIYYMLTYAWGY--LQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELD 61 P + + NI+ ML YA+ + + +E + L +L +L +R+GL + Sbjct: 11 PRVKLSNIFTMLEYAYRLKSFRILDGM-VECDSLQEFYERLAMILAGMILNRNRQGLYRE 69 Query: 62 YNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 Y + +P I+G++ + + D N+I+ TL ++ Sbjct: 70 YREQVDKLPYIRGQLNIRHQLVK--PWEVGFSCHYQEHTADIEDNQILTWTLNRILYSGL 127 Query: 122 LNSTIRDEARSLYRKL-PGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 + + YR L S + L P FS + + Y+ + ++C+F + PG Sbjct: 128 CSDRGLPVIKKAYRSLLSQTSLIPLDPGRFSSRVYSRLNQDYRPLHALCRFFLEQCGPGY 187 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 G + F + M L++ F+ ++ R L + + + + ++ Sbjct: 188 EVGDHSMIPFLVD---MPRLFELFVAQWLRTYLPPEYEITPQERVEIGENGELTFSI--- 241 Query: 241 METDITIRSSEKIL---IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENI 297 D+ + ++D KY + ++ Q++ Y + Sbjct: 242 ---DMVLYRKRDETAMCVMDTKYKSAATP--------TQADINQVVTYAVA-----KGCR 285 Query: 298 GGLLIYPHVDTAVKHRYKINGFDIGLCTV--NLGQEWPCIHQELLD 341 +LIYP + +K DI + T+ L + L++ Sbjct: 286 DAVLIYPSSNIRP---FKETIGDITVKTLAFPLAGNLEEAGKRLVE 328 >UniRef50_C4G4B6 Putative uncharacterized protein n=3 Tax=Bacteria RepID=C4G4B6_ABIDE Length = 451 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 48/276 (17%), Positives = 104/276 (37%), Gaps = 15/276 (5%) Query: 37 NLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTF 96 NL +I + + V QL + G++ Y + + KG++ + I+ + + F Sbjct: 140 NLYEIFINMYIQEVRQLVKHGIKSSYVGQEDNLMVYKGKLIVNEHIKHNLTHKERFYVGF 199 Query: 97 DMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGG 156 D + N++IKSTL L K + E R L + + + FS + Sbjct: 200 DEYQVNRAENKLIKSTLLKLQKLTTSVENSK-EIRQLLTAFELVESSINYDKDFSKIVTD 258 Query: 157 KNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA 216 ++T+ Y+ +I K + N G M +++ ++ +F ++ + Sbjct: 259 RSTKEYEMLIKWSKVFLKNKSFTTFSGTESARALMF---PMEKVFEAYVAKFMKKVFSRI 315 Query: 217 NTTRSYLKWDASSISD--QSLNLLPRMETDITIRSSEK---ILIVDAKYYKSIFSRRMGT 271 S + + + D+ + +++ ++I+D K+ KS+ + + Sbjct: 316 GWEVSAQDKGHYLFNSLNGENHKRFALRPDLVVTKNDENKSVIILDTKW-KSLVNDKGTN 374 Query: 272 EKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVD 307 ++YQ+ Y L+YP D Sbjct: 375 YGISQADMYQMYGYSKKY-----GTSEIWLLYPVND 405 >UniRef50_B9MJI1 Putative uncharacterized protein n=1 Tax=Diaphorobacter sp. TPSY RepID=B9MJI1_DIAST Length = 431 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 46/293 (15%), Positives = 100/293 (34%), Gaps = 22/293 (7%) Query: 34 PGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTV 93 L + + + +L ++GL DY E +P ++G++ +R Sbjct: 115 FDVPLTEWVMRQFLLSLQRLVQQGLRQDYVRVEEELPYLRGQLHTTAQMRQLPGRAHHFH 174 Query: 94 STFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYL 153 D+ D NR++K L ++H + A+ L +L + Q + Sbjct: 175 VRHDVFVPDRAENRLLKLALER-VRHATNQADNWRLAQELSARLHEVPASTQPQQDWRAW 233 Query: 154 NGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRREL 213 + + +Y+ + C+ ++ +P G + M L++ ++ + + L Sbjct: 234 SRTRLMSHYQPIYPWCQLVLGQGMPVALAGDQQGLSLLF---PMEKLFESYVARWLLKHL 290 Query: 214 TSA-----NTTRSYLKWDASSISDQSLNLLPRMETDITIR--SSEKILIVDAKYYKSIFS 266 YL W + ++ D+ +R + ++++D K+ + Sbjct: 291 PQHLCLTAQAASEYLCW-------HDGRRMFQLRPDLLLRNHNGAAVMVLDTKWKRLEAD 343 Query: 267 RRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH-VDTAVKHRYKING 318 R + YQ++ Y G+ LIYP V G Sbjct: 344 NRANNYGLAQGDFYQMLAYGQRYLTGQGKLA---LIYPAWTGFDVPLPMFEMG 393 >UniRef50_B5IHH3 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Aciduliprofundum boonei T469 RepID=B5IHH3_9EURY Length = 460 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 64/353 (18%), Positives = 134/353 (37%), Gaps = 29/353 (8%) Query: 8 VRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTE 67 ++N+ ML AW + I N++ ++L + + +L + GL +Y ++ Sbjct: 109 IKNLVKMLQIAWNLPIRDVDISSLKIGENSIFEVLLTIYSIKLLDAIKEGLYKEYIRVSD 168 Query: 68 IIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIR 127 + +KG+I+FAK ++ N D L NR +K A L NS Sbjct: 169 DLHYVKGQIDFAKY-SRRWERRHIIPVNYNDRNPDNLINRTLKYA-AYLASLYTRNSMNF 226 Query: 128 DEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRF 187 + + +S + ++ + + YK +I++ + I+ N P G Sbjct: 227 SNLKMAENLMDSVSLVPVSASEIDSITFTRLNEGYKPLINLARVIITNLSPEFTGGKKDV 286 Query: 188 YDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLL----PRMET 243 + F M ++++F+ + + +DQ +LL + Sbjct: 287 FAFL---IPMEKVFERFIANSIVQNKSKVLGNDCKSCEVYVQGADQKKHLLKGSRFMLIP 343 Query: 244 DITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIY 303 DI I+ + K I+D KY ++YQ++ Y ++ +L+Y Sbjct: 344 DIMIKINGKRYIIDTKYKLLDTED-EKKYGVSQSDVYQMLAYAYAYDTPKI-----MLLY 397 Query: 304 PHVDTAVKHR--------YKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 P + K+ G + + T++L + +L+ +D++L+ Sbjct: 398 PKGVGDFDKKEWEFENINSKLAGKKLIIETIDL------MKYDLVKEYDKFLE 444 >UniRef50_B7KTT4 IQ calmodulin-binding-domain protein n=2 Tax=Rhizobiales RepID=B7KTT4_METC4 Length = 426 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 60/344 (17%), Positives = 130/344 (37%), Gaps = 19/344 (5%) Query: 9 RNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEI 68 R + +ML + ++LL+IL + + + + RRG+ Y + + Sbjct: 86 RKLVHMLAVTHDLDVSAGALSELDWQRDDLLEILIRLFARMLAEAVRRGMPRRYVGHEDD 145 Query: 69 IPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRD 128 +P ++GR++ A+ + FD L+ D N+I+K+ + L + + R Sbjct: 146 LPVLRGRLDAARQFTRLAASPQSLACRFDALSADIALNQIMKAAVLRL-QSFARGAETRR 204 Query: 129 EARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFY 188 R L I + + F + + ++ + ++ ++ + G + Sbjct: 205 LLRELAFAYADIREVPIETLRFDLVIVDRTNARWRDLQALACLLLQGRFQTTSGGAATGF 264 Query: 189 DFERNEKEMSLLYQKFLYEFCRRELTSAN-TTRSYLKWDASSISDQSLNLLPRMETDITI 247 M+ L++ ++ R L S+ + +S R + DI + Sbjct: 265 SLLF---AMNALFEAYVARMLARVLRSSGQRVVAQGGLLYCLEDPESGIRTFRTKPDILV 321 Query: 248 -RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH- 305 R SE L++D K+ + + ++YQ+M Y + LL+YPH Sbjct: 322 KRDSETTLVIDTKWKRLAPVIDDPKQGVSQADIYQMMAYGRLYR-----CARLLLLYPHH 376 Query: 306 ------VDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIF 343 +HR + ++ + T++L Q + +L D+ Sbjct: 377 ARLGAQPGLRSRHRVTSSDDELFVGTIDLEQ-LETVPGQLSDLA 419 >UniRef50_UPI0001909ACF hypothetical protein RetlI_30663 n=1 Tax=Rhizobium etli IE4771 RepID=UPI0001909ACF Length = 482 Score = 163 bits (413), Expect = 8e-39, Method: Composition-based stats. Identities = 54/313 (17%), Positives = 109/313 (34%), Gaps = 12/313 (3%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 ++ M++ A I + + L L+L+RRG Y P Sbjct: 160 LFKMVSEALNLTPRIGGGATIEAFDLPMTEWLAASFLAKALELARRGPRQAYRLVEAREP 219 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 ++GR++F + +R T D+ D NR+I+S + + + + R A Sbjct: 220 FLRGRLDFTRQLRAAAGGAHMFHITHDVYLLDRPENRLIRSAIEHIARRSMTSDNWR-LA 278 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 R L + + Y + +C+ ++ + +P G Y Sbjct: 279 RELSILFADVPESRNVHADLKRWGRDRQLADYADIRPLCELLLTHRLPFALAGDYHGMSM 338 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 M L+++++ R + + + +M+ DI + S Sbjct: 339 LF---PMERLFERYVLGSLRMIAPEH--FEIHPQHGTMHLCSHEGEDWFQMKPDILVESG 393 Query: 251 EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDT-- 308 + I+DAK+ K + + R + + YQL Y G L+YP V Sbjct: 394 AQRWIIDAKW-KRLSTDRDKNYELSQVDFYQLFAYGQKY---LGGTGEMYLVYPAVSDFP 449 Query: 309 AVKHRYKINGFDI 321 ++ +K++ + Sbjct: 450 TMRAPFKLSDNLL 462 >UniRef50_B0A6E5 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0A6E5_9CLOT Length = 426 Score = 163 bits (412), Expect = 1e-38, Method: Composition-based stats. Identities = 51/299 (17%), Positives = 119/299 (39%), Gaps = 20/299 (6%) Query: 37 NLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTF 96 N+ +I + VL + ++GL+ +Y KG+++F + IR + + + + Sbjct: 127 NIFEIFIRMFINEVLLIVKKGLKSNYETIESNERVFKGKMKFTQQIRYNYAHKEQCYVEY 186 Query: 97 DMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLN-G 155 D N + N+++KSTL L K+ + +++ + L + + F+ + Sbjct: 187 DEFNTNCPENKLLKSTLLYLYKNT-CSLKNKNDIKMLLNSFLEVDKSTNYEEDFNRIIAA 245 Query: 156 GKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTS 215 +N + Y + K + G + M L++ ++ E R+ L Sbjct: 246 DRNKKDYTTALLWSKIFLMGKSFTSFSGSKIAFALLF---PMEKLFESYVAEILRKNLNK 302 Query: 216 ANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEK--ILIVDAKYYKSIFSRRMGTEK 273 + + S + L ++ DI +++ + I I+D K+ + S + Sbjct: 303 SLYSISIQDKTYHLFDKPNKKFL--LKPDIVVKNKKNNDIFILDTKW--KLLSNQKSNYG 358 Query: 274 FHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKI----NGFDIGLCTVNL 328 ++YQ+ Y +N +L+YP+ + + ++ NG + + ++L Sbjct: 359 ISQSDMYQMYAYSKKYGSKNV-----ILLYPNAENTIINKTIEFESNNGTSVKVKFIDL 412 >UniRef50_A7ZRA6 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A7ZRA6_ECO24 Length = 428 Score = 160 bits (406), Expect = 5e-38, Method: Composition-based stats. Identities = 56/275 (20%), Positives = 104/275 (37%), Gaps = 13/275 (4%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 LL+I + V QL ++GL DY + +KG++ + +R +N K +D Sbjct: 122 LLEIFISQFLQSVSQLLKQGLRSDYVSEKGNLAFMKGKLMLSAQLRHNAVNRHKFCVDYD 181 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 D ANR++ STL L+ + + R + GI + L + Sbjct: 182 EYMPDCAANRLLHSTLDKLLSLKLSSENQRWLYELCF-AFDGIPLSRDIESDLNSLRIER 240 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 +Y I+ + I+ P +G+ + M +++ F+ + EL S Sbjct: 241 GMTHYSEPIAWAQLILRGMSPSALQGNTKAISLLF---PMEAVFESFVAQTLPYELPSHL 297 Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITIRSS---EKILIVDAKYYKSIFSRRMGT-EK 273 S S+ L ++ D+ I+S + +++D K+ + S++ + Sbjct: 298 KVFSQAA--TYSLVKHGLKDCFKLRPDLLIQSRQPIQTKMVMDTKWKQVNSSQQKKSLYG 355 Query: 274 FHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDT 308 + YQ+ Y G LIYP D Sbjct: 356 LAQSDFYQMFAYGQKY---LGGTGEMYLIYPAHDD 387 >UniRef50_B7UQU6 McrC family protein, predicted McrBC 5-methylcytosine restriction system component n=24 Tax=Enterobacteriaceae RepID=B7UQU6_ECO27 Length = 436 Score = 160 bits (406), Expect = 5e-38, Method: Composition-based stats. Identities = 53/302 (17%), Positives = 122/302 (40%), Gaps = 16/302 (5%) Query: 8 VRNIYYMLTYAWGYLQEIKQANLEA-IPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNT 66 VR M+ + I + I L+DI + V ++ +RGL+ DY Sbjct: 96 VRQQLLMMLRTLKSFRHIASSESGVKISKMPLMDIFIQQFIESVRKIVQRGLKRDYLRQE 155 Query: 67 EIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTI 126 + +P +KGR+ + + + + +D + + L NRI+K+ + + + + Sbjct: 156 DNLPWMKGRLRISAQLSKNCIRRDRFQVEYDEYSVNRLENRILKTAINKISRQTSNPQLL 215 Query: 127 RDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYR 186 + + + I+++H F L+ + +Y+ ++ K I+ P G Sbjct: 216 QQITQLQFH-FENITSVHDAYIAFEQLHFDRQMHHYEQALAWAKMILLGDSPHCMYGDVN 274 Query: 187 FYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDIT 246 + M +++ F+ + R + + + ++ + L ++ DI Sbjct: 275 AFSLLF---PMEAVFESFVTTWMRYRYYDKWRVDAQVS--SKNLISYNGKALFKLRPDIC 329 Query: 247 IR----SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLI 302 +R ++ ++ D K+ I + R + + +LYQ++ Y + + G+ +LI Sbjct: 330 LRPRKSTTGSVITCDVKW--KIVNGRKDSLEQSQADLYQMLAYGLNYQEGEGD---MILI 384 Query: 303 YP 304 YP Sbjct: 385 YP 386 >UniRef50_B7LQZ4 Putative 5-methylcytosine restriction system component n=6 Tax=Enterobacteriaceae RepID=B7LQZ4_ESCF3 Length = 447 Score = 160 bits (406), Expect = 5e-38, Method: Composition-based stats. Identities = 54/275 (19%), Positives = 102/275 (37%), Gaps = 13/275 (4%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 LL+I + V QL ++GL DY + +KG++ + +R +N K +D Sbjct: 122 LLEIFIHQFLHSVSQLLKQGLRSDYVSKQGNLAFMKGKLMLSAQLRHNAVNRHKFCVDYD 181 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 D ANR++ S L L+ + + R L GI + + L + Sbjct: 182 DYMPDCAANRLLHSALDKLLSLKLSSENQR-WLYELRFAFDGIPLSRDIERDINNLRLER 240 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 +Y ++ + I+ P +G+ + M +++ F+ + EL Sbjct: 241 GMAHYNEPMAWAQLILRGMSPSALQGNTKAISLLF---PMEAVFESFVAQTLPDELPPHL 297 Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITIRS---SEKILIVDAKYYKSIFSRRMGT-EK 273 S+ L ++ D+ I+S + +++D K+ S++ + Sbjct: 298 KVLPQAA--TYSLVKHGLKDCFKLRPDLLIQSHKPVQTKMVMDTKWKLVNSSQQTKSLYG 355 Query: 274 FHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDT 308 + YQ+ Y G N LIYP D Sbjct: 356 LAQADFYQMFAYGQKY---LGGNGEMYLIYPAHDD 387 >UniRef50_Q39M33 McrBC 5-methylcytosine restriction system component-like protein n=6 Tax=Proteobacteria RepID=Q39M33_BURS3 Length = 445 Score = 160 bits (406), Expect = 6e-38, Method: Composition-based stats. Identities = 53/318 (16%), Positives = 106/318 (33%), Gaps = 12/318 (3%) Query: 14 MLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIK 73 ML A + + + + + + + +RG+ DY E ++ Sbjct: 102 MLEVAMEITPREGDEATLQCFDHPITEWMMRRFLQALEHVIKRGMRRDYLRIEEEQRYLR 161 Query: 74 GRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSL 133 G++ AK +R + +D+ + D NR++KS L + K ++ A L Sbjct: 162 GQLNIAKQMRASPAHADLLNIRYDVFSPDRAENRLLKSGLIRVSKST-RDADNWRVANEL 220 Query: 134 YRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERN 193 L + F + +Y C+ ++ + +P G + Sbjct: 221 LHILHEVPASRNASADFRAWRTDRLMAHYVQARPWCQIVLGDQVPLALTGETQGISLLF- 279 Query: 194 EKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKI 253 M L+++++ R L + + + +++ D I + Sbjct: 280 --PMERLFERYVGHALRILLPVHYRLTEQ--GSRHWLCNHDGTGIFKLKPDYLIEGPDAP 335 Query: 254 LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHV---DTAV 310 I+DAK+ + R + YQL Y GE LIYP + A+ Sbjct: 336 RILDAKWKLIDGADRTNNYGLKQADFYQLYAYGQKYLGGVGELN---LIYPRTRQFEAAL 392 Query: 311 KHRYKINGFDIGLCTVNL 328 K Y + + +L Sbjct: 393 KPFYFTPELRLQVLPFDL 410 >UniRef50_C8S833 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8S833_FERPL Length = 426 Score = 158 bits (400), Expect = 3e-37, Method: Composition-based stats. Identities = 61/338 (18%), Positives = 134/338 (39%), Gaps = 24/338 (7%) Query: 2 EQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGN-NLLDILGYVLNKGVLQLSRRGLEL 60 ++ I ++N+ ML Y+ + IK+ +L I + +I ++ K + +L + + Sbjct: 90 KREKI-LQNLVRMLEYS--GWEGIKETDLTQIGTEKDFFEIYVFLFAKNLAELLKVNRDA 146 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 Y + + ++G+IEF K L+ ++ N +T NR +K +L+K Sbjct: 147 SYVRTYDELRFVRGKIEFRKYWNPARLH--IIPCSYYERNMNTPINRTLKFVSYLLLKKV 204 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 + + T R +S+ L ++ +T + + + I C+ + +S+ Sbjct: 205 ESSET-RRLLKSVISVLDSVTLSPVTLAEVEKITFNRLNSRFIPFIDFCRAFLRDSVFSL 263 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 F+ F M L+++F+ + + + L + Sbjct: 264 QGSDVEFFSFL---IPMETLFERFVAKAVKELYKGTEWK---PHIQETFGYLVPKEKLFQ 317 Query: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 ++ DI + + + +IVD KY I Q+LYQ+ Y L + + Sbjct: 318 LQPDIVLENGGERVIVDTKY--KILDPEDRKLGVSQQDLYQMYAYCKEL-----GSSKCV 370 Query: 301 LIYP-HVDTAVKHRYKINGFD---IGLCTVNLGQEWPC 334 LIYP ++ + +K+ + + + T++L + Sbjct: 371 LIYPESLNGKIDGEFKLGSKEKIDLKVKTISLENPFDN 408 >UniRef50_C0GTU8 Putative uncharacterized protein n=1 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GTU8_9DELT Length = 424 Score = 157 bits (397), Expect = 6e-37, Method: Composition-based stats. Identities = 59/285 (20%), Positives = 107/285 (37%), Gaps = 23/285 (8%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 LLDI V QL+ GL Y N + + +KGR+ F K I + + + Sbjct: 113 LLDIFFRSFLSEVEQLAHHGLVRKYRKNQDNLTTLKGRLLFQKQITLNLVRRERFYTEHV 172 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 + N+I+ + L ILI N + +AR+L I ++ FS LN + Sbjct: 173 HYERNNPFNQILGTALDILI-LTSSNPHLSAQARNLALSFEDIDRINAAEVTFSRLNYTR 231 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 NT Y+ I + + I+ N P G + M+ L+++++Y +R + Sbjct: 232 NTERYRRAIQLARLIILNYCPDVRSGGEDVLAILFD---MNNLFERYVYAQLKRAEAMNS 288 Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITIRSS----EKILIVDAKYYKSIFSRRMGTEK 273 + ++ + + DI +K +++D K+ + T Sbjct: 289 EQNVSFRAQVQQPFWRTERIRKHIRPDIIAEIGQGYDQKRVVIDTKWKIPRDGKPADT-- 346 Query: 274 FHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKING 318 +L+Q+ Y + LL+YP + I G Sbjct: 347 ----DLHQMYAYNVHFGAKQS-----LLLYPRTSSTCD----IQG 378 >UniRef50_C2WWH8 Putative uncharacterized protein n=3 Tax=Bacillus cereus group RepID=C2WWH8_BACCE Length = 399 Score = 155 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 60/322 (18%), Positives = 119/322 (36%), Gaps = 19/322 (5%) Query: 30 LEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNH 89 L LL +L R+G Y + E + ++G+IE +K I Sbjct: 94 LNGEDRGELLTAFLATFLTRLLNELRKGTYKTYERHEENLNTLRGKIELSKHIYKNVFQK 153 Query: 90 GKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQH 149 K FD E+ N++ K L I+ KH K+ T++ L + ++ T + Sbjct: 154 TKAYCAFDEYTENNSLNQLFKCALLIVKKHTKI-HTLKLYLERCLGYLEPVDVVYFTEKE 212 Query: 150 FSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFC 209 + + ++ K IV + + F +M++L++K++ Sbjct: 213 LKSITFNRQNERFRQAALFAKLIVERATIYSKGRGASSFSFLF---QMNMLFEKYIEVAL 269 Query: 210 RRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRM 269 +E N S + +S ++ D I ++I+D K+ + Sbjct: 270 -QETIGNNKIISQHAEKRLLRNKKSGRQNILLKPDFVI---NNVIIMDTKWKSAT---NN 322 Query: 270 GTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKH---RYKINGFDIGLCTV 326 G + ++YQ+ Y+ + K +L+YP + H I +CT+ Sbjct: 323 GRSSYVQSDIYQMYAYVTAYKEVK----RCILLYPKQEIEAVHPVWEVIDTEKTIEMCTI 378 Query: 327 NLGQEWPCIHQELLDIFDEYLK 348 + E+ +EL +I + +K Sbjct: 379 RID-EFSKTVRELKEILQKQVK 399 >UniRef50_C5DAA2 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Geobacillus sp. WCH70 RepID=C5DAA2_GEOSW Length = 411 Score = 155 bits (392), Expect = 2e-36, Method: Composition-based stats. Identities = 46/307 (14%), Positives = 127/307 (41%), Gaps = 17/307 (5%) Query: 14 MLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIK 73 ML + + A + A ++L++++ + K V + +G+ +Y + + ++ Sbjct: 77 MLLFCEDLPLSYEHATMAAYDSHSLMEMIARLFVKEVEMILNKGIVKEYIVEEDNLTCLR 136 Query: 74 GRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSL 133 GR++ + +R + K +D L+ + L N++I+ L + +KH L + L Sbjct: 137 GRVDIRQHLRTNFMTPTKVYCRYDELDTNILENQVIRMALEV-VKHFSLTKQTMRQINRL 195 Query: 134 YRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERN 193 + I+ + + + + + ++Y+ + +I Q +++ Sbjct: 196 ADEFMMIADPYFSY-EWPNFSYHRLNQHYEKAHKLAYYIWKQIYVNQLY-QFQYRSHYSY 253 Query: 194 EKEMSLLYQKFLYEFCRRELTSANTTRSYLKWD--ASSISDQSLNLLPRMETDITIRSSE 251 +M+ L++KF+ + ++ L A + ++ + D +++ D+ + + Sbjct: 254 LIDMNELFEKFVAKLLKKYLPGAAKVHAQRRFKKAITKNGDGYHDII----LDLLVEFPD 309 Query: 252 KI-LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAV 310 K +++D KY ++ K + ++YQL Y ++ + ++++P Sbjct: 310 KDPIVLDTKY------KQYSKYKVENADIYQLAFYAQ-FVTKSSNHYKAIIVHPEYAGED 362 Query: 311 KHRYKIN 317 I+ Sbjct: 363 ACEEVID 369 >UniRef50_C7MHK3 McrBC 5-methylcytosine restriction system component n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MHK3_BRAFD Length = 393 Score = 155 bits (391), Expect = 3e-36, Method: Composition-based stats. Identities = 60/344 (17%), Positives = 120/344 (34%), Gaps = 26/344 (7%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 +P IPV + +++ YA + ++ +L L + + +GL Y Sbjct: 63 RPKIPVERLVFLMGYASAPT-FWRDHSVRLDTDADLPQALARTFMRLATKAIEQGLLQGY 121 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + +P ++GRI I ++D D NR++ + L++ L Sbjct: 122 QRVDDSLPVLRGRIRVTDQISRRFGADLPLEVSYDDFTVDIAENRLLLAAATRLLRLPGL 181 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + R + L +L G+S + + Y+ + + + I+ Q + Sbjct: 182 DVRTRQGLQRLRLQLSGVSEVR-RGDELPRWQPTRLNARYQPSLRLAERILAGESFEQRR 240 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRME 242 G R F + M +Y+ F+ + L S + + + Sbjct: 241 GRLRVDGFVFD---MWKIYEDFVGVALKEALASRGSATLQHRMHLDHAQRVD------LR 291 Query: 243 TDITIRSSEK-ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 D S ++VDAKY + + +LYQL+ Y L G L Sbjct: 292 PDFLWTSHNGDQVVVDAKY------KAEKPAGYPQADLYQLLAYCTVLGLR-----EGHL 340 Query: 302 IYPHVDTAVKHRYKINGFDIGL--CTVNLGQEWPCIHQELLDIF 343 +Y + + + G +I + TV+L Q + ++ + Sbjct: 341 VYAK-GNESELTHDVRGTEIVIHCHTVDLDQAPSTLLGQVRSLA 383 >UniRef50_B2FKW7 Putative uncharacterized protein n=3 Tax=Xanthomonadaceae RepID=B2FKW7_STRMK Length = 448 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 50/295 (16%), Positives = 103/295 (34%), Gaps = 14/295 (4%) Query: 14 MLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIK 73 ML A A+ L + + + +L +RGL DY E ++ Sbjct: 100 MLCTALDISPREGSPTDVALFDIPLNEWVMGRFIGALDELLKRGLRFDYTRVREEQLFLR 159 Query: 74 GRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSL 133 GR++ ++ +R D+ +ED NR+++ L + N+ A+ L Sbjct: 160 GRLDMSRQLRQSPTRAHVFNIEHDVFSEDRPENRLLRVALDRVCART-RNAGTWRLAQEL 218 Query: 134 YRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERN 193 ++ + + +Y+ V +C+ I++ P G +R Sbjct: 219 STRMASVPRSTQVANDLRAWGSDRLMAHYREVRPLCELILSGQSPLALAGDWRSPSMMF- 277 Query: 194 EKEMSLLYQKFLYEFCRREL-TSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEK 252 M L+++++ R+ + ++ S NL+P D +R ++ Sbjct: 278 --PMERLFERYVGACLARQFAPEWQVGGAASEYLCSHGEANWFNLIP----DFLLRRGDE 331 Query: 253 ILIVDAKYY--KSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH 305 + ++D K+ + S + YQ+ Y G L+YP Sbjct: 332 LRVLDTKWKVLDATASDAREKYGLKQSDFYQMFAYGQRY---LGGVGQMALVYPQ 383 >UniRef50_A6X8D9 Putative uncharacterized protein n=1 Tax=Ochrobactrum anthropi ATCC 49188 RepID=A6X8D9_OCHA4 Length = 422 Score = 150 bits (380), Expect = 5e-35, Method: Composition-based stats. Identities = 54/308 (17%), Positives = 106/308 (34%), Gaps = 16/308 (5%) Query: 8 VRNIYYMLTYAWGYLQEIKQANLEAIPGN-NLLDILGYVLNKGVLQLSRRGLELDYNPNT 66 +R++ + L+ + N + + + + L RRGL Y Sbjct: 87 IRSLLIKMITESLNLRPRRAGNAGVAAFHMPMTEWFASAFLEEASDLIRRGLRSGYATIG 146 Query: 67 EIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTI 126 P ++GR+E A+ I H + D + D NRI++S + + ++ + Sbjct: 147 TREPYLRGRLEVARQIGAAGGLH-QFSVQLDEYSLDRPENRILRSAVEHVARNTASSENW 205 Query: 127 RDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYR 186 R AR L L + G+ Y V + + ++ + +P G + Sbjct: 206 R-RARELSALLGEVPESGDVGSDLGKWERGRQLADYGRVKPLAELLLTHQLPFATLGDRK 264 Query: 187 FYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDAS--SISDQSLNLLPRMETD 244 M L++++++ L W S + R++ D Sbjct: 265 GMSMLF---PMERLFERYVFSSVSASLKP----GFESVWQPSRHHLCSLGAEEWFRLKPD 317 Query: 245 ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 + + + I+DAK+ K + R + YQL Y +G+ L+YP Sbjct: 318 MLVHDEARRWIIDAKW-KRLNPDRSEKFGLAQADFYQLFAYGQKYLGGSGD---MFLMYP 373 Query: 305 HVDTAVKH 312 D + Sbjct: 374 GTDEFPEV 381 >UniRef50_A8EUN6 McrBC catalytic subunit McrC, putative n=2 Tax=Campylobacterales RepID=A8EUN6_ARCB4 Length = 387 Score = 150 bits (380), Expect = 5e-35, Method: Composition-based stats. Identities = 57/340 (16%), Positives = 127/340 (37%), Gaps = 40/340 (11%) Query: 10 NIY-YMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEI 68 NI+ YML YA+ +Q A + +L++ + G+LQ ++GL +Y + Sbjct: 76 NIFIYMLMYAYDVKLSNEQIASCANQKHTILEVFIQMFANGLLQELKKGLYKEYLTKQDN 135 Query: 69 IPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRD 128 +P +KG+ + ++ K +D +E+ N+ T+ L K + Sbjct: 136 LPVLKGKYLINENLKYN-FTKNKIYCEYDEFSENNSLNQFFLYTVKYLQKF----VKDKK 190 Query: 129 EARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFY 188 + + + +N + +K + ++ SI ++ + + Sbjct: 191 LLKQCELIFDEVEYKQIDINRLETINFDRLNLRFKTSFEIAILLLKQSILLFSQ-DKKSF 249 Query: 189 DFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIR 248 F + M++L++KF+ + L +A + + L ++ DI + Sbjct: 250 AFLFD---MNVLFEKFIARMVKE-----------LDNNAKIQNQDNFGNL-TLKPDIILE 294 Query: 249 SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDT 308 + I+D KY K E + Q +Y + K +N +L+YP Sbjct: 295 NQ----IIDTKYKKI-----KSIEDIKQSDKLQAFSYGINYKVDNV-----MLLYPKHLD 340 Query: 309 AVKHRYKINGF----DIGLCTVNLGQEWPCIHQELLDIFD 344 +K+ + + + T++L + + +I + Sbjct: 341 NIKYDLVLGKDDKKVKLKIRTIDLNFSGNNYKEYIDEIME 380 >UniRef50_A7H1L9 Putative uncharacterized protein n=13 Tax=Campylobacter jejuni RepID=A7H1L9_CAMJD Length = 445 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 47/279 (16%), Positives = 104/279 (37%), Gaps = 18/279 (6%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 L ++ + + ++GL Y E +KG++ F + I+ ++ + ++ D Sbjct: 151 LFEVFITMFLDEFDSVYKKGLMRSYLSCEENRAFLKGKLLFNEHIKQNLIHKERFFTSND 210 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 D NR+IKSTL L LN + + L + + FSY + Sbjct: 211 EFVLDIAPNRLIKSTLNFLKSKTSLN---KFRLIKAMQMLDEVEFSKNYEKDFSY-KISR 266 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 + Y+ ++ CK + N G + M +++ ++ ++ + + Sbjct: 267 HFDCYENLLLWCKIFLKNESFMPYHGKNEAFALLF---PMEKIFEDYVAYMLKKVNPAQD 323 Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQ 277 + + ++ ++ D+ I E +I+D K+ S+ + Sbjct: 324 IKVQN---NGKYLISKNDENCFMLKPDLYI---ENKMILDTKWKIPNDSKDEKKQGIAQS 377 Query: 278 NLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKI 316 +LYQ+ Y K + + ++YP + + KI Sbjct: 378 DLYQMFAYACKFKIYDIK-----IVYPLCEKTQDLQRKI 411 >UniRef50_Q7VG80 Putative uncharacterized protein n=1 Tax=Helicobacter hepaticus RepID=Q7VG80_HELHP Length = 485 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 55/296 (18%), Positives = 109/296 (36%), Gaps = 32/296 (10%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 LL+I + + + +L ++GL+ DY E +KG++ F + ++ + + ++ D Sbjct: 167 LLEIFILMFLQELEKLVKKGLKSDYIVCEENRNFLKGKLLFHQNLKLNFAHRERFFTSSD 226 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 + + NRIIKSTL L+ + L++ + + I + S Sbjct: 227 EFSVNIAPNRIIKSTLE-LLNTQNLSTNTSAKLMQMRFIFLDIPPSQSIDKDLSKCQNLG 285 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLY----------- 206 R YK ++ C+ + + + Y + M+ L++ F+ Sbjct: 286 YFRNYKMILQWCEIFLKRKSFAPYQKDSKAYALLFD---MNKLFESFVASEMKKWLCDMK 342 Query: 207 -----EFCRRELTSANTTRSYLKWDASS--ISDQSLNLLPRMETDITIR---SSEKILIV 256 + ++ + SYLK S + + + DI + E I Sbjct: 343 LSYENKVFIEQIFRESKKDSYLKTQEKSKYLIVEGDKNRFLLNPDIVGYQKQTKETFFIA 402 Query: 257 DAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKH 312 D K+ I S+ ++YQ+ YL + G LIYP ++ Sbjct: 403 DTKW--KILSKEQQNYGVSQSDMYQIFAYLAKYQ-----CNQGFLIYPKIEDCNDE 451 >UniRef50_Q2SJR5 McrBC 5-methylcytosine restriction system component n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SJR5_HAHCH Length = 437 Score = 145 bits (367), Expect = 2e-33, Method: Composition-based stats. Identities = 50/276 (18%), Positives = 99/276 (35%), Gaps = 13/276 (4%) Query: 36 NNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVST 95 L + + + +L RGL DY E I+G++ ++ R + Sbjct: 117 QPLHEWIFRQFLTELQRLVGRGLRFDYQRVDEESRFIRGQLRLSQQQRQPLGRRHLFQIS 176 Query: 96 FDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNG 155 D+ D L NR++K+TL+ ++ + K R A L ++ I + + Sbjct: 177 HDIYTPDRLENRLLKTTLSYVLANCKSGENWR-RANELTHRMADIPPEQEPLRAMNNWRS 235 Query: 156 GKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTS 215 K + Y + C+ I+ P +G +R M L++K++ R EL + Sbjct: 236 NKLMQDYDAIRPWCELILAKLNPNFQQGQHRGIALLF---PMEQLFEKYVEVSLRHELPA 292 Query: 216 ANTTRSYLKWDA----SSISDQSLNLLPRMETDITIRSSEKILIVDAKYY--KSIFSRRM 269 ++ + +++ D+ +R+ ++D K+ Sbjct: 293 GIQLKAQASSQYLLRHKPQGSDISTTMFQLKPDLLLRTPLGDQVLDTKWKLLDQCAWTSD 352 Query: 270 GTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH 305 +LYQ+ Y + G +LIYP Sbjct: 353 KKYNIAQSDLYQMFAYGHKYQHGRGH---MMLIYPK 385 >UniRef50_D2AT08 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AT08_STRRD Length = 406 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 67/352 (19%), Positives = 132/352 (37%), Gaps = 26/352 (7%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P PV ++++L YA + +Q ++A LL L + + R+G+ Y Sbjct: 62 PKTPVDRVFFLLGYARRP-RGWRQGEVDAGDHPELLPALAHAYALAADRALRQGVLQGYL 120 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN 123 E +P ++GRI A +R + +D D NR++ + A L++ L Sbjct: 121 EMEEALPVVRGRIREADQLRRRYGLPLPVEVRYDDYTVDIAENRLLLAASARLLRLPGLA 180 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 R R + +L G++ L + + Y + + + ++ + + G Sbjct: 181 VQTRRTLRHVIARLAGVTALVPGRPLPVWRPS-RINTRYHTALGLAELVLRGASYELDDG 239 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 + EM +++ F+ L RS L+ + LL + Sbjct: 240 TA--VRVDGLLVEMWRVFEDFVTVALTEALRPHG-GRSELQDKRHHLDHGRRVLL---KP 293 Query: 244 DI---TIRSSEKIL---IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENI 297 D+ I S + +VD+KY S HS +LYQ++ Y L ++G Sbjct: 294 DLVRYVIDSGGTEIPAAVVDSKYKISTGPEG------HSADLYQMLAYCTVLGLDHGH-- 345 Query: 298 GGLLIYPHVDTAV-KHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 L+Y D +H + G +I ++L + + + + ++ Sbjct: 346 ---LVYAEGDAEPYRHVVRGAGIEIMQHAIDLTLPPADLLAAIERLAESIVR 394 >UniRef50_A4TAT5 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Mycobacterium gilvum PYR-GCK RepID=A4TAT5_MYCGI Length = 446 Score = 144 bits (363), Expect = 5e-33, Method: Composition-based stats. Identities = 63/343 (18%), Positives = 126/343 (36%), Gaps = 26/343 (7%) Query: 10 NIYYMLTYA--WGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTE 67 + M+ Y+ L + A+ G++L +L VL L R GL DY P + Sbjct: 80 RVLQMIEYSEGVRLLAHLPPDQQLAVSGDDLFQLLVRVLVGESKLLIRDGLLRDYRPTED 139 Query: 68 IIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIR 127 + ++GR+ + + + FD + D N+++ ++L H S +R Sbjct: 140 TLAVMRGRLRMRDQFLKRYGSLHRLECNFDEYDGDIAENQLLAASLTAAASH-VRASALR 198 Query: 128 DEARSLYRKLPGISTLHLTPQHF--SYLNGGKNTRYYKFVISVCKFIVNNSIPG--QNKG 183 +E R L + I + ++ G+ Y+ ++ + Sbjct: 199 NETRMLAGVIGDICQPPTFDPDWYEQRIHYGRRNSRYEGAHKAALLVLRGLALNDLHSAS 258 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRS-YLKWDASSISDQSLNLLPRME 242 F N M++++++F+ + L+ + L A + + + + Sbjct: 259 RQGVNAFMVN---MNVIFERFVSALVDQALSGTGLRSTPQLSIRAIVVDESTNRTYSNIR 315 Query: 243 TDITIR--SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 D+ I +S + + VD KY + T KF S ++YQL Y ++L E + G+ Sbjct: 316 PDLVITEVNSARSVPVDIKY------KLYDTVKFSSADVYQLFTYAYALGAGAEEKMAGV 369 Query: 301 LIYPHVDTAVKHRYKINGF------DIGLCTVNLGQEWPCIHQ 337 IY T +I G + +++ I Sbjct: 370 -IYASTTTTSGPALRIKGNTGIAAARLRGAGLDVAAALDNIAS 411 >UniRef50_D0C387 McrBC 5-methylcytosine restriction system component n=1 Tax=Acinetobacter sp. RUH2624 RepID=D0C387_9GAMM Length = 437 Score = 144 bits (363), Expect = 5e-33, Method: Composition-based stats. Identities = 51/310 (16%), Positives = 106/310 (34%), Gaps = 18/310 (5%) Query: 36 NNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVST 95 + + + +L + GL DY E ++G++ K +R Sbjct: 128 QPIHEWIIEQFLCNFEKLIQYGLRFDYQRVQEEQKYLRGQLLHVKHMRQSPARKHIFPIE 187 Query: 96 FDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNG 155 D+ D NR+IK+ L ++ K + + + A+ L I Q F Sbjct: 188 HDIYEVDRPENRLIKTALDVVCKKTRSSKNWK-LAQELRLMTGEIPKSQNIVQDFKQWQS 246 Query: 156 GKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTS 215 G+ Y + + I++ +P G +R M L++ ++ EL Sbjct: 247 GRLLALYVDIRPWTELILSEYMPVSTHGGWRGMSLLF---PMEKLFEHYVAYHLHHELKE 303 Query: 216 ANTTRSYLKWDASSISDQSLNLLPRMETDITI--RSSEKILIVDAKYYKSIFSRRMGTEK 273 + + +++ L R++ DI I + S +I+D K+ + R Sbjct: 304 WDVKTQVSNQHICTFNEKP---LFRLKPDIYIQHKCSPYKIILDTKWKLLDQNDRNRRFG 360 Query: 274 FHSQNLYQLMNYLWSLKPENGENIGGLLIYP----HVDTAVKHRYKINGFD--IGLCTVN 327 ++ Q+ Y +LIYP + + ++ + + + N Sbjct: 361 LKDSDVQQMFAYSHYY---LDHASEVILIYPYHKNKFEDEICFKFNVQNDQAMLRVIPFN 417 Query: 328 LGQEWPCIHQ 337 L + + I Sbjct: 418 LDKPYDFIKS 427 >UniRef50_Q2FNZ4 McrBC 5-methylcytosine restriction system component-like n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNZ4_METHJ Length = 437 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 59/353 (16%), Positives = 122/353 (34%), Gaps = 29/353 (8%) Query: 9 RNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEI 68 RN+ ML+Y ++L++ + ++ + L R Y E Sbjct: 81 RNLAVMLSYT-NLKPLSSDLTSMDQEDIDMLELFLRIFSEQLHHLLFRCQHRQYLNRDEH 139 Query: 69 IPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRD 128 + IKGRI+ K L + TF L +DTL NRI K ++ +H + N ++ Sbjct: 140 LKFIKGRIQVNKYWNPAQL--ERIPCTFKELTQDTLLNRIFKFCATLMSRHTQ-NEETKE 196 Query: 129 EARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFY 188 + + + L ++ H+T ++ + T + ++ C+ + +S + Sbjct: 197 HLKGILQILEPVTYTHVTSSETRFVILDRLTEQFAPLLRFCEIYLRHSTITLQASQVEIF 256 Query: 189 DFERNEKEMSLLYQKFLYEFCRR--ELTSANTTRSYLKWDASSISDQSLNLLPRMETDIT 246 M ++++F+ L T + + DI Sbjct: 257 SLL---IPMERVFEQFISGVLSEQSHLLPEGATVYSQYPGGHLAQTLDGRGIFELRPDIF 313 Query: 247 IRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGG-LLIYPH 305 I +I+D KY + + ++YQ+ Y +N+ +L+YP Sbjct: 314 IDHPRIPVIIDTKY--KMPKKSSSNSGIKQSDIYQMFGY------GAKKNVPALMLLYPD 365 Query: 306 VDT--AVKHRYKINGFDIGL-------CTVNLG--QEWPCIHQELLDIFDEYL 347 + + + + + T +L +W +EL I + Sbjct: 366 IGEKIDIDLEFSYDNCRLSALLIRSITLTYDLADPVQWEMWLEELRGIMHDMY 418 >UniRef50_B1BM86 ATP-dependent helicase priA n=8 Tax=Clostridium RepID=B1BM86_CLOPE Length = 513 Score = 143 bits (360), Expect = 1e-32, Method: Composition-based stats. Identities = 58/323 (17%), Positives = 132/323 (40%), Gaps = 22/323 (6%) Query: 36 NNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVST 95 +L +IL Y+ +K + + R+G+ +Y E I +KG + + I+ + K Sbjct: 202 QSLNEILAYLFSKKLQKELRKGVYGEYVYIEENINSLKGSLRVQEQIKNMASHSSKAFCR 261 Query: 96 FDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNG 155 F+ + D N+I+ + ++K+ K T+R R L + ++T + + Sbjct: 262 FEEFSRDNKLNKILSFFVKEVMKNVKNRETLR-LLRISEMILGDVDERNVTLNEVNNFSF 320 Query: 156 GKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTS 215 + + ++ ++ K IV N G + Y M+ +++ ++ + ++ L Sbjct: 321 NRLNKPFEDAFTLGKMIVLGESALGNLGGNKAYSILFK---MNEIFEIYIGKLLKQLLYK 377 Query: 216 ANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSE-KILIVDAKYYKSIFSRRMGTEKF 274 + K+ I ++S + ++ DI I + + +I+D K+ + Sbjct: 378 ETVHMQHSKYKL-LIKEESNRGVFKLIPDIVIEKNGIERIIIDTKWKSV--ESKFNRHGV 434 Query: 275 HSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRY---------KINGFDIGLCT 325 ++LYQ+ YL + +N + +L+YP+ + + + + Sbjct: 435 KREDLYQMYAYLT--RYKNVSTV--ILLYPYNERIEGEEGEYLESWYLDEEEHKRVRVYA 490 Query: 326 VNLGQEWPCIHQELLDIFDEYLK 348 VNL E + L I +Y++ Sbjct: 491 VNLENEKETLKS-LDKIVRKYVE 512 >UniRef50_A0JT91 McrBC 5-methylcytosine restriction system component-like protein n=3 Tax=Arthrobacter RepID=A0JT91_ARTS2 Length = 418 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 56/330 (16%), Positives = 116/330 (35%), Gaps = 18/330 (5%) Query: 17 YAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRI 76 YA Q ++ + A+ +L L L + + RG+ Y E + +KGRI Sbjct: 93 YAGN--QGFREDPVAAVEDPDLWSALAVSLVQLADRALSRGVLQGYLTVDESLRTVKGRI 150 Query: 77 EFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRK 136 + I ++D ED NRI+++ L + + ++ ++ R L K Sbjct: 151 RISDQISRRPGMLVPLEVSYDEFTEDIAENRILRAALERMARVPRVRPDVQSRLRLLLGK 210 Query: 137 LPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKE 196 L ++ L + Y V+ + + I+ N+ G + F + Sbjct: 211 LDAVTRLRPGAP-LPPWQATRMNTRYHAVLRLSEVILRNASAEAGDGKQQTASFVVD--- 266 Query: 197 MSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDIT-IRSSEKILI 255 M +++ F+ R +T A L+++A + + D + +++ Sbjct: 267 MGQVFEDFVGTALREAMT-AYPGEMRLQYNALLNEAVRDSDRLTVNPDAVHLLGGRPVVV 325 Query: 256 VDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYK 315 D KY + S + +Q++ Y +L+ L+Y R Sbjct: 326 YDTKYRAATDQGAS-----LSADHFQMLAYCTALRVPT-----AWLVYAGAGEMKLRRIL 375 Query: 316 INGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 D+ ++L I + D+ + Sbjct: 376 NTDIDVVEYPLDLSLPPSDILAAVADLAQQ 405 >UniRef50_D2B1B6 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2B1B6_STRRD Length = 431 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 59/312 (18%), Positives = 101/312 (32%), Gaps = 21/312 (6%) Query: 11 IYYMLTYAWGY--LQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEI 68 + ML YA G L+ + + G++L D++ +L L R GL DY E Sbjct: 77 VLRMLDYASGLPALRHMDRLRNLPNQGHDLRDLICLLLTVECEALVRHGLRRDYIRRQET 136 Query: 69 IPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRD 128 +P I+GR+ + + + FD + D L NR+ + L + H + +R Sbjct: 137 LPAIRGRLLADQQVLRRFGRLDRLECRFDEFDSDILDNRLCAAAL-RVAAHSARDEALRA 195 Query: 129 EARSLYRKLPGISTLHLTPQHFSY--LNGGKNTRYYKFVISVCKFIVNNSIPGQ--NKGH 184 AR + + T + L + +Y+ ++ + G Sbjct: 196 RARRVATDFSEVCTTDGLDVRWVAQHLTYHRPNEHYRQAHRWALLLLQAPGFTDLLSTGG 255 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 F + M+ L++ F + R + + SIS + D Sbjct: 256 PSSRTFMLD---MNSLFEAFATQLLREATHRTGIAVRAQESLSRSISRPDGRSYTSITPD 312 Query: 245 ITIRSSEK----ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 I + VD KY + +LYQ Y L E + Sbjct: 313 IQLVHGHGPGAWRRSVDVKY------KLYADRTIKPSDLYQSFAYGQVLSSEETPTAY-I 365 Query: 301 LIYPHVDTAVKH 312 L D H Sbjct: 366 LFASDRDGEPDH 377 >UniRef50_Q2J945 McrBC 5-methylcytosine restriction system component-like n=1 Tax=Frankia sp. CcI3 RepID=Q2J945_FRASC Length = 416 Score = 140 bits (354), Expect = 6e-32, Method: Composition-based stats. Identities = 62/363 (17%), Positives = 123/363 (33%), Gaps = 40/363 (11%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 +P + +R + ++L YA + ++A +LL + + + G+ Y Sbjct: 68 RPKVTIRRLLFLLGYAQDR-GRWFEDEVQAAEEPDLLPAVAAAFARTASRALAHGVPRGY 126 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 +P ++GR+ + +R +D DT NR++ + L+ + Sbjct: 127 RQVDAALPVLRGRLRESAQLRQRSGVMFPLEVRYDERTVDTAENRLLLAATRSLLALAGV 186 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 E R + L G++ P + Y + + + ++ +S + Sbjct: 187 APATAQELRRIAAALDGVAEPAHGPVKPPDWVPTRVNAPYHAALRLAETVLRSSSFERED 246 Query: 183 GHY-RFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRM 241 G R F M +++ F+ LT D++L M Sbjct: 247 GETLRVDGFVVK---MWEVFEDFVTHAVDEVLTHRGGEVRLQDRTHHLDEDRTLE----M 299 Query: 242 ETDITIRSSEK-------ILIVDAKYYKSIFSRRMGTEKFHSQNLY-QLMNYLWSLKPEN 293 D+ + E +++DAKY +I ++Y Q++ Y L Sbjct: 300 CPDLVLYRPEGPGGRMIPAVVLDAKYRLAIRQG-------ARAHVYHQMIAYCARLGAR- 351 Query: 294 GENIGGLLIYPHVDTAVKH--------RYKING-FDIGLCT--VNLGQEWPCIHQELLDI 342 G L+Y + A R +I G IGL T ++L + + I Sbjct: 352 ----QGWLVYAGSERADGQPGGRGDVIRSRIGGPTPIGLVTYVLDLRLPLAELRARIERI 407 Query: 343 FDE 345 D+ Sbjct: 408 ADD 410 >UniRef50_D2QGI2 5-methylcytosine restriction system component-like protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QGI2_9SPHI Length = 428 Score = 140 bits (352), Expect = 1e-31, Method: Composition-based stats. Identities = 44/274 (16%), Positives = 95/274 (34%), Gaps = 16/274 (5%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 L ++ V L+++G++ Y KGR + A+ R + + +D Sbjct: 121 LWEVFITAFLDTVDALAQQGIQRAYVTVEGNERFWKGRFQAARQQRDNACHAERLAVVYD 180 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHF-SYLNGG 156 L NRI+K+ L + I + + + L L ++ + Sbjct: 181 TLTASVPPNRILKTAL-VAIHAKTTDQANKRRIHQLLSVLEEVALSDDVRSDLMAVRRSN 239 Query: 157 KNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA 216 + Y+ + + ++ PG +G M +++ ++ R SA Sbjct: 240 RLFMRYETALGWAEMLLMGQGPGVKRGDKESIALLF---PMERVFEDYVAHGIRAYWPSA 296 Query: 217 NTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSI-----FSRRMGT 271 + S + A + + ++ DI IR ++ ++D K+ + S R + Sbjct: 297 DRI-SVQESSAHLVDEHVGAPRFKLRPDIIIRHQDRTFVMDTKWKQVNGLSLDTSPRTAS 355 Query: 272 EKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH 305 ++YQ+ Y + L+YP Sbjct: 356 YGIDQADMYQVYAYGKKYAANDL-----FLLYPA 384 >UniRef50_D0Z341 McrBC 5-methylcytosine restriction system component n=3 Tax=Gammaproteobacteria RepID=D0Z341_LISDA Length = 425 Score = 138 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 58/337 (17%), Positives = 124/337 (36%), Gaps = 22/337 (6%) Query: 2 EQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELD 61 + ++ + ML+ + + + L ++L K V + ++G+ Sbjct: 85 QDTQATMKVLLKMLSTVYKLNMHRFEHSSLQTLNRPLFEVLISYFLKEVSNIIQQGIRSR 144 Query: 62 YNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 Y + P +KG+++ +K I ++D + D NR+I+S L +IK K Sbjct: 145 YTRVQDCKPYLKGQLQTSKQINQRPGCLNSFHISYDEFSPDRAENRLIRSALNQVIKWSK 204 Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 + +R L L I F + ++ +Y+ V CK I+N P Sbjct: 205 NSDNLR-LGSELQCALDDIPCSKNYALDFRQWSKDRSLVHYRSVKPWCKLILNYQSPVSL 263 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYL---KWDASSISDQSLNLL 238 G ++ M L+++++ + L+ A T R+ + + Sbjct: 264 SGRHKGISMLF---PMESLFEQYVAIRLGKSLSHALTLRTQVSNCALVTHTPRSGKSQEW 320 Query: 239 PRMETDITIRSSEKIL-----IVDAKYYKSIFSR--RMGTEKFHSQNLYQLMNYLWSLKP 291 R++ DI + +K L + D K+ + + ++YQ+ Y + Sbjct: 321 FRLKPDIVVW--DKTLHKPLCVADTKWKRINEKQATAKHKYGISQSDMYQMFAYGQNC-- 376 Query: 292 ENGENIGGLLIYPHVDTAVK--HRYKINGFDIGLCTV 326 G + LIYP + + +K + + + + Sbjct: 377 -LGGSGVVYLIYPAYEDFNESLPPFKFD-NRLSVKAI 411 >UniRef50_D0BXD9 McrBC 5-methylcytosine restriction system component n=2 Tax=Acinetobacter RepID=D0BXD9_9GAMM Length = 441 Score = 138 bits (348), Expect = 3e-31, Method: Composition-based stats. Identities = 46/274 (16%), Positives = 98/274 (35%), Gaps = 15/274 (5%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 L D + +L R G+ DY ++G+++ A+ +R + D Sbjct: 126 LTDWFYAQFLDALQKLYRTGIRFDYQRVEAEENFLRGQLDTAQQMRKPLTRQHQLSIKHD 185 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 + + NR+I+S + ++ K K + + + + + F + Sbjct: 186 IFTSNRAENRLIRSCIDVVCKRAKTA-DLWRTSHEFHLLFSEVPQSTNYREDFKKWKNDR 244 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 +Y + C+ I+ N IP KG + M L++K++ ++L Sbjct: 245 LMSHYSDIRYWCELILGNEIPFAVKGINQAKSILF---PMEKLFEKYVEIQLSKQLVKGA 301 Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITIR------SSEKILIVDAKYYKSIFSRRMGT 271 + ++ + + + D+ I+ S+K LI+D K+ + Sbjct: 302 KLETQKSSKY--LAQYNSKDIFNLIPDLAIQYYCEQSKSKKYLILDTKWKLINSNNIEEK 359 Query: 272 EKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH 305 ++YQ+ Y + G +LIYP Sbjct: 360 FGIKQSDMYQMFAYNHMYQ---GHTSDIVLIYPK 390 >UniRef50_A4Y9J5 Putative uncharacterized protein n=3 Tax=Gammaproteobacteria RepID=A4Y9J5_SHEPC Length = 435 Score = 138 bits (347), Expect = 4e-31, Method: Composition-based stats. Identities = 52/296 (17%), Positives = 105/296 (35%), Gaps = 13/296 (4%) Query: 14 MLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIK 73 ++ A Q + L + L + + +RG+ DY E ++ Sbjct: 98 LILNALQLKQRETEYTDIERFDAPLTEWLMAQFLTELDSVIKRGMRFDYQRIEESQRFLR 157 Query: 74 GRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSL 133 G+++ K +R D+ D NR++K+ L + K + N R A L Sbjct: 158 GQLDVVKQLRQPAGREHIFNIRHDIFTADRAENRLLKTALLRVCKTTQDNDNWR-LAHEL 216 Query: 134 YRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERN 193 L + T F+ + +Y+ + C+F+++ +P +G ++ Sbjct: 217 QSLLHELPTSTDIQADFTTWRSDRLMAHYQAIKPWCEFVLSQHVPLAVQGLWQGISMLF- 275 Query: 194 EKEMSLLYQKFLYEFC----RRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRS 249 M L++ ++ +R L R+ L +++ D+ ++ Sbjct: 276 --PMERLFESYVAAELDAAVQRMLGVKGEVRTQLASKYLCKHQGKD--FFQLQPDLQLKL 331 Query: 250 SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH 305 I+D K+ S + Q+ YQL Y + +G + LIYP Sbjct: 332 GADHWILDTKWKLLDASDKENKYGLSQQDFYQLFAYGQTYLGGDGTLV---LIYPA 384 >UniRef50_A2TPW2 Putative uncharacterized protein n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TPW2_9FLAO Length = 448 Score = 137 bits (345), Expect = 6e-31, Method: Composition-based stats. Identities = 57/308 (18%), Positives = 113/308 (36%), Gaps = 20/308 (6%) Query: 10 NIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEII 69 NI++ L+Y L+ + N+ +IL Y+ +K +L + Y Sbjct: 105 NIFWWLSYC-RKLRFPNYKTGLSGEKNDFFEILIYLFSKYTKELLNNAMYQRYVEIHREE 163 Query: 70 PGIKGRIEFAKTIRGFHLNHG--KTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIR 127 +KGRI+F + K T+D D NR IK +L K + + Sbjct: 164 QFVKGRIDFTRYTNENLSRANFHKISCTYDSFEMDNQFNRCIKYVATLLFSVTK-DRQSK 222 Query: 128 DEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRF 187 + R + L +S ++ + + ++ + C + N + K + Sbjct: 223 NNLREILFILDEVSDETMSASACRNIQFNPMFKEFETIRDYCVLFLENCVSYNYKDALQL 282 Query: 188 YDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITI 247 + F M +++ F++ F +EL S + +++ D+ + Sbjct: 283 FAFL---IPMEYIFEDFIFGFIDKELHEVTAKAQ------SGQISLDQSKNFKLKPDLIL 333 Query: 248 RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVD 307 + K +I D KY S +LYQ++ Y L+ ++ +L+YP D Sbjct: 334 EVNGKRIIADTKYKMLNLSGNDPKNGISQNDLYQMVAYAIRLQCDHI-----ILLYP--D 386 Query: 308 TAVKHRYK 315 + Y+ Sbjct: 387 HILNPGYR 394 >UniRef50_D1SMG2 McrBC catalytic subunit McrC, putative n=1 Tax=Methanocaldococcus sp. FS406-22 RepID=D1SMG2_9EURY Length = 445 Score = 137 bits (345), Expect = 7e-31, Method: Composition-based stats. Identities = 63/295 (21%), Positives = 114/295 (38%), Gaps = 16/295 (5%) Query: 14 MLTYAWGY-LQEIKQANLEAIPGNN-LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPG 71 M+ +A+ ++E + A ++ I + +I Y+ +L +RG Y E Sbjct: 103 MMNFAYDLNIKEQELAKVKDIASTPVIYEIFIYLFAYSLLNEIKRGFYKSYIKVREEKKF 162 Query: 72 IKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEAR 131 +KG++ K IR K + E+ L N+I T I +K K + Sbjct: 163 LKGKLLIDKQIRKLPHQRHKFSIEYHEFTENNLLNQIFYYTTYISLKKTKW-RENKKLLS 221 Query: 132 SLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFE 191 L GI+ +T F ++ + +K ++ K I+ S G+ G F Sbjct: 222 ELMLIFEGINLRKITIHDFKRVHFTRLNERFKKPFNLAKIIL--SAFGEIDGEDAIGFF- 278 Query: 192 RNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSE 251 +M+ L++KF+ + L +S K+ N+ + D + Sbjct: 279 ---VDMNDLFEKFICSILSKSLGFEIKYQS--KFKLFKEVKGIKNI--EQKPDYVVYKDN 331 Query: 252 K-ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH 305 K +L++DAKY + +R K S L Q+ Y + I +LI+P Sbjct: 332 KPVLVLDAKYTEI--NREYEKPKLPSDMLRQIYTYAKYYTLKCNYKIRSVLIFPK 384 >UniRef50_B5HWV8 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HWV8_9ACTO Length = 434 Score = 136 bits (343), Expect = 1e-30, Method: Composition-based stats. Identities = 55/310 (17%), Positives = 109/310 (35%), Gaps = 21/310 (6%) Query: 11 IYYMLTY-AWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEII 69 + ML Y A + +L D++ +L + +L RG+ DY + + Sbjct: 85 VLRMLEYTAGRGFPPLDATRTVREGAPHLRDLVALLLTEECERLLSRGVRQDYVTTEDDL 144 Query: 70 PGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDE 129 P ++GRI ++ + + + FD + D + NR+ + + L + +R Sbjct: 145 PAVRGRILPSRQLLRHYGRLDRLACRFDEHDTDIVDNRLCAAAVD-LAARTARSPAVRAR 203 Query: 130 ARSLYRKLPGISTLHLT--PQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ--NKGHY 185 AR ++ L + L+ ++ +Y+ +++ G Sbjct: 204 ARRAATSFARVAPTRLGDLRTALAGLDYHRHNTHYRSAHRWAALLLSGGGIADLLAPGPL 263 Query: 186 RFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNL-LPRMETD 244 F + M++L++ FL R T T + D+ + D Sbjct: 264 ASRAFLVD---MNVLFEVFLTHLLREAATGTGLTVRDQTRHRGVLYDERTERPYGEVRPD 320 Query: 245 I----TIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSL-KPENGENIGG 299 + T+ VD KY + + K +LYQ Y +L + G Sbjct: 321 VLVTGTLDGEPLRRPVDLKY------KLYDSRKLSPSDLYQAFLYAHALARQPAGGPPTC 374 Query: 300 LLIYPHVDTA 309 +LI+P +A Sbjct: 375 VLIHPGSGSA 384 >UniRef50_C9ZD40 Putative uncharacterized protein n=1 Tax=Streptomyces scabiei 87.22 RepID=C9ZD40_STRSW Length = 415 Score = 130 bits (328), Expect = 6e-29, Method: Composition-based stats. Identities = 48/347 (13%), Positives = 103/347 (29%), Gaps = 27/347 (7%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNP 64 IP + L YA G P D++ L +L R GL DY Sbjct: 73 AIPGEQLMSWLAYALGTPVPATARRWATGPDG-YADLVAAALLDQCERLLREGLRRDYVR 131 Query: 65 NTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNS 124 + P ++GR++ A + + D NR++ S L + ++ Sbjct: 132 RRSVEPVLRGRLDIAAQATRRYGQLDQLHVRTFDREADIPDNRVLGSALKAALGMT-VSP 190 Query: 125 TIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPG---QN 181 + P T + + + Y+ + + ++ + Sbjct: 191 DLARALHGAAGAFPHAPTPAAALRALDRTHYTRLNARYRPAHTWARLLLRGGGVTDLLTD 250 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRM 241 +G E M L++ + + Sbjct: 251 QGTTA----EGLLLAMPALWEAVVRRLGTEAVGPHGGHAVPGGSGVGITVHGDRGNASTF 306 Query: 242 ETDITI--------RSSEKILI-VDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPE 292 D+ + ++ + L+ VDAKY +R + +++QL+ Y Sbjct: 307 RPDLLLSLPALPGHDTAHRTLLPVDAKY------KRYDHHGVSAADVHQLLTYSSGYASA 360 Query: 293 NGENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVN-LGQEWPCIHQE 338 + ++++P + + G + L T+ LG + ++ Sbjct: 361 DAPT--AVIVHPQTGRHDRRTLHVRGPNGLLGTIAVLGVDTRTTPEQ 405 >UniRef50_C1EZU3 Putative uncharacterized protein n=1 Tax=Bacillus cereus 03BB102 RepID=C1EZU3_BACC3 Length = 424 Score = 128 bits (321), Expect = 4e-28, Method: Composition-based stats. Identities = 60/333 (18%), Positives = 127/333 (38%), Gaps = 27/333 (8%) Query: 30 LEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNH 89 ++ G N + L + + ++ + G YN E + +KGR+ K I+ + Sbjct: 102 VDIEQGMNFTEQLISMFISELWKVKKIGFSKTYNSKEENLNYLKGRLFIGKQIKYNVVPK 161 Query: 90 GKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQH 149 K ++ LN T N I+ + LI H ++S I+ E + + + Sbjct: 162 -KFYCNYNELNYLTAENLILFEIINKLI-HLSIDSKIKSELIYFKNEFAQALNID-RRIN 218 Query: 150 FSYLNG--GKNTRYYKFVISVCKFIVNNSIPGQ-NKGHYRFYDFERNEKEMSLLYQKFLY 206 + + +Y+ ++ + + + G F +F +M L++K++ Sbjct: 219 LKGIKYKATRLNMHYETIMYLSEMFLQKRFFSTLESGENLFCNFL---IKMDDLFEKYIL 275 Query: 207 EFCRRELTSANTT---RSYLKWDASSISDQSL---NLLPRMETDITIRSSEK---ILIVD 257 + + + + + D+ L N M DI I + I+++D Sbjct: 276 LLVKEIIENFFPKYRVEEQVNLNFVRKYDEKLDRENGFLTMIPDIIIYNKTSNKPIVVID 335 Query: 258 AKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLI-YPHVDTAVKHRYKI 316 KY K ++ YQ+++Y++SL +N + G+L+ + + Y+ Sbjct: 336 TKYVDITNKN-----KLNNNAYYQMLSYMFSLHLQNETLVTGILLSHGTSGHTYRINYR- 389 Query: 317 NGFDIGLCT--VNLGQEWPCIHQELLDIFDEYL 347 G + + T V+L I L + D+ L Sbjct: 390 EGQHMHIYTGSVDLLNTEEKIKDSLKLMLDKVL 422 >UniRef50_C6NTT9 Putative uncharacterized protein n=1 Tax=Acidithiobacillus caldus ATCC 51756 RepID=C6NTT9_9GAMM Length = 441 Score = 126 bits (317), Expect = 1e-27, Method: Composition-based stats. Identities = 49/314 (15%), Positives = 99/314 (31%), Gaps = 23/314 (7%) Query: 39 LDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDM 98 D L + + RR + Y ++ +P +GRI F G+ S + Sbjct: 118 HDALMEMFCDELQLARRRQVIRRYASTSDSLPSPRGRISFPGQCYESIRRPGRFASAWVA 177 Query: 99 LNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKN 158 L ED NRI K L + ++ IR +L + ++ + + Sbjct: 178 LTEDVPENRIFKEVLLRY--RPRCSARIRGRIDLCLSELDSVDASGDHRLEWAKVRADRL 235 Query: 159 TRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANT 218 Y ++ K +++ G G S L+++F+ + +A Sbjct: 236 PPIYHSLLRQSKALLDEEGAGVFAGDKLATA---EIVFTSRLFEQFVAKELSWISPAAGL 292 Query: 219 TRSYLKWDASSISDQSLNLLPRMETDITI--RSSEKILIVDAKYYKSIFSRRMGTEKFHS 276 + S + + D+ + + LIVD K+ +R Sbjct: 293 VSKAQDRGTFTCSRGDGKGVFELIPDVRLIDDRGKTALIVDTKWKSLDMRKRH--LGISR 350 Query: 277 QNLYQLMNYLWSLKPENGENIGGLLIYP-------HVDTAVKHRYKINGFDIGLCTVN-- 327 +++YQ++ Y +L+YP K + + + Sbjct: 351 EDIYQVLTY-----GSRFNCADVVLLYPDVTNETGKTGYYQKFESILGARKYSVHVLKIP 405 Query: 328 LGQEWPCIHQELLD 341 L + ++LL Sbjct: 406 LLAPTLMVARDLLR 419 >UniRef50_Q188G2 Putative uncharacterized protein n=9 Tax=Clostridium difficile RepID=Q188G2_CLOD6 Length = 422 Score = 125 bits (314), Expect = 2e-27, Method: Composition-based stats. Identities = 49/319 (15%), Positives = 122/319 (38%), Gaps = 22/319 (6%) Query: 37 NLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTF 96 NLL+ + + ++G+ +Y E + ++G+I + + ++ K + Sbjct: 115 NLLNFFVMYFIESMQTQMKKGIYFEYINKIENLNVMRGKILLSTYAKEKGISPMKIRCEY 174 Query: 97 DMLNEDTLANRIIKST-LAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNG 155 D +E+ N+++K ++IL + +++I+ + + + + +H+ + Sbjct: 175 DEYSENNFLNQVLKKACISILCRIN--DNSIQGKIKKILSYFQNVDLIHIDRKKLLDYKF 232 Query: 156 GKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRREL-T 214 KN +K + + ++ N ++ + + E++ LY++++ + Sbjct: 233 YKNNDRFKDCYLLARLLLLNLSMDNSQNNQEAFSILF---EINTLYEEYIGILIKSIWDN 289 Query: 215 SANTTRSYLKWDASSISDQSLNLLPRMETDITIR--SSEKILIVDAKYYKSIFSRRMGTE 272 S T K ++Q+ + DI + +E +I+D K+ Sbjct: 290 SFRETYIQDKSKFLLKNEQTGKKNFNLRPDIVLYDLKNEYEIIIDTKWKAIEVDS---NV 346 Query: 273 KFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRY-----KINGFDIGLCTVN 327 + S ++YQ+ Y+ + + +L+YP + + G I TV Sbjct: 347 FYRSSDIYQMYAYITAYENAK----RCILLYPCIQKDKNYSSWKLSESFKGKFIEAKTVR 402 Query: 328 LGQEWPCIHQELLDIFDEY 346 L + +L I Y Sbjct: 403 LD-DIKNTKNDLKKIIFNY 420 >UniRef50_Q466P1 Putative uncharacterized protein n=2 Tax=Methanosarcina RepID=Q466P1_METBF Length = 453 Score = 123 bits (308), Expect = 1e-26, Method: Composition-based stats. Identities = 55/329 (16%), Positives = 119/329 (36%), Gaps = 24/329 (7%) Query: 9 RNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEI 68 R+++Y L+Y ++ L ++L Y+ + + ++ Y E Sbjct: 111 RHLFYWLSYC-KKVKFPFNQAFLDKFELELPELLIYLFARQIHEVISTRPFSAYEEVQEA 169 Query: 69 IPGIKGRIEFAKTI-RGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIR 127 + +GRI F + + R + N ++ D L NRIIK +L+ + T R Sbjct: 170 LFTPRGRINFDRYVTRISYGNCHLIDCDYEPFVFDNLLNRIIKYCTRLLLSKASIIETQR 229 Query: 128 DEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRF 187 + L + Q L Y+ +I +C I+ N + Y Sbjct: 230 -ILNEIIFMLEDVDDQVCFAQQLQTLRIPSIYSDYEEIIQICGMILEN--QAYSCAEYEM 286 Query: 188 YDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITI 247 ++ M +++ F+ + ++ + S + + ++ DI + Sbjct: 287 KNWSL-LLPMEYIFEDFIAGYVQKYFSGTFKVEP----QKSDLYLHTNPNTFNLQHDILL 341 Query: 248 --RSSEKILIVDAKYYKS-IFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 + + + +I+D KY + + ++YQ+++Y + LLIYP Sbjct: 342 TNKKTGEQIIIDTKYKPRWNLEKSDSKKGIAQSDMYQMISYAY-----RRGTNKVLLIYP 396 Query: 305 HVDTAV--KHRYKIN----GFDIGLCTVN 327 + + H + IN I + ++ Sbjct: 397 NTSNELAEDHTFLINKGTKDETINIKAID 425 >UniRef50_C5A3Z2 McrBC 5-methylcytosine restriction system component n=1 Tax=Thermococcus gammatolerans EJ3 RepID=C5A3Z2_THEGJ Length = 458 Score = 121 bits (305), Expect = 3e-26, Method: Composition-based stats. Identities = 58/289 (20%), Positives = 107/289 (37%), Gaps = 28/289 (9%) Query: 14 MLTYAWGYLQEIKQANLEAIPGNNL----LDILGYVLNKGVLQLSRRGLELDYNPNTEII 69 ML A+G +IK +L + G NL ++ Y+ K + +RG +Y Sbjct: 119 MLDMAYGL--KIKDHDLAYLQGRNLRPNLYEVFIYLFAKSLWSEVQRGYHREYVEVHREE 176 Query: 70 PGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDE 129 ++G++ ++ IR L ED L NRI +++ ++ R Sbjct: 177 KFLRGKLLMSRQIRKLPHQLNTFSVEVHELIEDNLLNRIFYASVREALRRTTWGLN-RKL 235 Query: 130 ARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYR-FY 188 L GI+ +HL +HF ++ + ++ + K + +P KG R Sbjct: 236 LGELMLAFDGITPIHLRTEHFERVHFTRLNERFRRPFELAKLLF---MPASGKGRSREVS 292 Query: 189 DFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIR 248 F + M+ L+++F+ R L + + S + D +R Sbjct: 293 GFFVD---MNKLFERFIERVLVRNLPPEYKLFYQESYPFLKNQNGSSQ-----KPDYVVR 344 Query: 249 SSEK-ILIVDAKYYKSIFSRRMGTEKFHSQN-LYQLMNYLWSLKPENGE 295 ++++DAKY R E+ S + L QL Y + Sbjct: 345 KGNTPVVVLDAKY-------RELKERIPSSDMLRQLYVYSRIWGYKTSH 386 >UniRef50_A1ZU11 Putative uncharacterized protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZU11_9SPHI Length = 438 Score = 120 bits (302), Expect = 7e-26, Method: Composition-based stats. Identities = 56/368 (15%), Positives = 128/368 (34%), Gaps = 42/368 (11%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P + ++ Y L+Y+ A P L +I ++ QL Y Sbjct: 88 PKVCFDHVLYYLSYSQRVRFPFALARTHTSPSLFLPEICIFLFASYAEQLLIEQPLHLYQ 147 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHL--NHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 TE + +KG+++ + ++ N K S L + N+++K T L+ + Sbjct: 148 ERTEELDFLKGQLDIDQYLKENIATGNWQKLHSRHTPLLYNNRFNQLVKYTARQLLLMTQ 207 Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 ++ + + + L +S + +T + + + + V+ +C + N + Sbjct: 208 YAPSL-EHLQHMIALLQNVSDVPMTYKDCMKIRLPEQQIALQTVVDMCAMFLGNEMINYE 266 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRREL---TSANTTRSYLKWDASSISDQSLNLL 238 G + F M L+Y+ F+ +F + + YL + + + Sbjct: 267 VGQKYNFAFLL---PMELVYEDFIGQFVQTHFAQWQPRLQPKKYLGRNPTGKP------V 317 Query: 239 PRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 ++ D+ + S + +I D KY R ++YQ++ Y + Sbjct: 318 FSVQPDMLLGSPQ--VIADTKYKIREVPLRHSQTAIEESDIYQMIAYALGYR-----CSE 370 Query: 299 GLLIYPHV-----DTAVKHRYKINGF------DIGLCTVNLGQEWPC---------IHQE 338 +L+YP + + I I ++++ + ++ Sbjct: 371 MVLLYPASYRQPKTLSFSESFNIQSDLLTTPLRIRAESLDITTTAQTKFAEALEQKLKKQ 430 Query: 339 LLDIFDEY 346 L IF+ + Sbjct: 431 LQRIFESH 438 >UniRef50_Q5JJA9 Putative 5-methylcytosine restriction system, catalytic subunit n=1 Tax=Thermococcus kodakarensis RepID=Q5JJA9_PYRKO Length = 467 Score = 118 bits (297), Expect = 2e-25, Method: Composition-based stats. Identities = 55/295 (18%), Positives = 107/295 (36%), Gaps = 18/295 (6%) Query: 15 LTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKG 74 L Y G + + L + +DIL + + +L R G+ ++ E ++G Sbjct: 104 LYYNLGLREYDIKTALATEGNSPFIDILLDIFSNRLLNELRFGIYGEFVSTEETSSSLRG 163 Query: 75 RIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLY 134 ++ + I K + D NR+ K TL + ++H + + Sbjct: 164 QLLVEREILKLPTQKHKFDIRYKKFTVDNFLNRVFKYTLYLGLQHTNR-RETKRTLSEAW 222 Query: 135 RKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNE 194 L +S ++ L ++ + + K I++ KG F Sbjct: 223 DMLKEVSLTPISVDSIEKLTLNSLNLRFELPLKLAKIIISGL--DYQKGLIS-PGFI--- 276 Query: 195 KEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRM-ETDITIRSSEKI 253 M ++ F+Y+ R L R Y + + + + + DI I SS Sbjct: 277 IPMPDAFEFFVYKLLRAILGKDYRVR-YHPQNREFVLETPRKFIENPPQPDIIIESSSGQ 335 Query: 254 --LIVDAKYYKSIFSRRMGTEKF--HSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 ++VDAKY + +++ +S +LYQ+ +Y G+L+YP Sbjct: 336 PLVVVDAKYKTLYCPKCGEKQRYVKNSSDLYQIYSYTKLYNA-----HAGVLVYP 385 >UniRef50_Q08S71 Putative uncharacterized protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08S71_STIAU Length = 420 Score = 118 bits (296), Expect = 3e-25, Method: Composition-based stats. Identities = 56/348 (16%), Positives = 118/348 (33%), Gaps = 27/348 (7%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 +P +P ++ + YA L + + LL +LG + + V +L+R G Y Sbjct: 62 RPQVPSLHLLALADYAAKGL-SWGEDLVGMAEAEELLPLLGALFLRRVERLARGGWVHGY 120 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 P ++GR + I + + FD D NR++ + L +L + Sbjct: 121 REEEAGQPVLRGRWLAGRDIAQPPTHRHRLTCRFDEFTRDVAPNRLLLAALRVLERARTF 180 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + AR+L L G+ + L + Y + + I+ ++ Sbjct: 181 GPPVAARARALAATLDGVQARAEVHAAETMLASDRRFAAYGPAAKLARLILESTGVQAAP 240 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRME 242 G + F ++ L++ + + + +++ ++ L L+P Sbjct: 241 GPHPLSSFIL---RLAPLFEAAVTRALVQVAEARGL-ACHVQRPLVLDTEGRLLLVP--- 293 Query: 243 TDITIRSSEKILIVDAKYYKSIFSRRMGTEKF-HSQNLYQLMNYLWSLKPENGENIGGLL 301 D + L++DAKY + + + Q++ YL G L Sbjct: 294 -DAVVEQGGARLVIDAKYK-------FPAKGLPPADDFQQIVTYL-----ACVGTHRGAL 340 Query: 302 IYPHVDTAVKHR---YKINGF--DIGLCTVNLGQEWPCIHQELLDIFD 344 + P + + G + + V LG + + L + Sbjct: 341 VLPALGAVPEEETLRLMTFGRTSQVRVVKVPLGGPAATLGRALESTAE 388 >UniRef50_Q5JH86 Putative 5-methylcytosine restriction system, catalytic subunit n=1 Tax=Thermococcus kodakarensis RepID=Q5JH86_PYRKO Length = 476 Score = 114 bits (285), Expect = 6e-24, Method: Composition-based stats. Identities = 59/316 (18%), Positives = 110/316 (34%), Gaps = 25/316 (7%) Query: 10 NIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEII 69 N+YY + G ++ + E L +I Y+ + + RGL +Y E Sbjct: 107 NMYYQMGLEPGEIRALVF---EYGRQKALDEIFKYLYVLMLSRALSRGLYYEYGEIEESS 163 Query: 70 PGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDE 129 ++GRI + R + +L ED NR++K L + +K +L+ T + Sbjct: 164 QTVRGRILVNELARRPA-WKADLPVRYSLLLEDNPLNRVLKGALEVAVKSARLSETRKVG 222 Query: 130 ARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYD 189 L + P F ++ ++ V + + + G G +F Sbjct: 223 GI-LLDLFRDVGD--PKPGDFGKVSFNHLNERFRTVFRLARVMYFGLAAG---GSRKFLP 276 Query: 190 FERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLN----LLPRME--- 242 M L++ +Y + L + R ++ + + L M Sbjct: 277 GVF--IRMDELFETLVYRTLKTVLDNEAEVRFQVQLPHVIKNAGEIEARFGALFMMGNPL 334 Query: 243 TDITIRSSEKILIVDAKY-----YKSIFSRRMGTEKFHSQNLYQLMNYLW-SLKPENGEN 296 DI + + E +V+ KY Y +R S LYQ Y + + Sbjct: 335 PDIVVSTDEGTCVVEVKYRNLYVYHRGENRAHRKLVRKSDELYQAYTYSRLVSEYLGAKR 394 Query: 297 IGGLLIYPHVDTAVKH 312 + LL+YP ++ H Sbjct: 395 VPVLLVYPRLEGIYNH 410 >UniRef50_C9LKR7 Putative uncharacterized protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LKR7_9BACT Length = 405 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 52/297 (17%), Positives = 95/297 (31%), Gaps = 42/297 (14%) Query: 41 ILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHG--KTVSTFDM 98 +L V ++ + L+ Y +E + +KGRI K R + +D Sbjct: 112 LLVLHFLGVVSRI--KELKKGYVSRSENLKKVKGRISILKNERQNIAIRRYDRVFCEYDE 169 Query: 99 LNEDTLANRIIKSTL---AILIK--HEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYL 153 + D NR+IK L L++ +E+ + + +S + + L Sbjct: 170 YSADIPENRLIKKALLFSQRLLQGLNERSAAVAKLRLNKSLALFSEVSD-KVEIKQVKRL 228 Query: 154 NGGKNTRYYKFVISVCKFIVNNSIPGQNKGHY---RFYDFERNEKEMSLLYQKFLYEFCR 210 K Y I + K I+ +K + F + MSLLY+ ++Y Sbjct: 229 RAHKLFTNYNEAIRLAKLILRLFDYNISKVGSHEGKVVPFWLD---MSLLYEHYVYGLLH 285 Query: 211 RELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMG 270 + D +S E I+D KY + + Sbjct: 286 EAYRER----------ITYQFKGKTGF-----PDFLYKSKEYKAILDTKYIPKYDEKSLD 330 Query: 271 TEKFHSQNLYQLMNY------LWSLKPENGENIGGLLIYPHVDTAVKHRYKINGFDI 321 + QL Y L L+ ++ I ++IYP + + N + Sbjct: 331 KDVVR-----QLSGYGRDLRILTHLEYKDVSPIPCIIIYPKEGKRKNNPFLGNNLRM 382 >UniRef50_A1SC19 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Nocardioides sp. JS614 RepID=A1SC19_NOCSJ Length = 407 Score = 108 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 50/343 (14%), Positives = 109/343 (31%), Gaps = 28/343 (8%) Query: 4 PVIPVRNIYYMLTYAWGYLQ-EIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 P + + +L A G ++ + P ++ +L + + G Y Sbjct: 65 PKVGAAKVLTLLARAQGVRGLKVDPELVGVAPHADISAVLAVLFAQEAATAMAAGPLRGY 124 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + +P ++GR+ + T D DT NR ++ L+ + Sbjct: 125 RSEDQTLPVLRGRVRLREQHLRRFGLPVPLEVTVDEWTLDTDDNRRTRAAATALLALPGV 184 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 R L R L L + ++ + ++ ++ Sbjct: 185 PEHSTQALRRLDRLLGEAKLLAPGAP-LEPWTPTRLNVKMHRLLHLADVVLAHTSVEHEA 243 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRME 242 G + + F N M+ L++ + + ++ +S ++ Sbjct: 244 GATQTHGFVVN---MAWLFETLIARLLEEQTLGLVPQQTMPLDTLGRLS---------IK 291 Query: 243 TDITIRSSEKIL-IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 D+ ++ + D KY K + ++YQL+ Y L G L Sbjct: 292 PDLLFDGPGGVVAVADTKYKL-----LDDNGKVPNADVYQLVTYCARLGLSTGH-----L 341 Query: 302 IYPHVDTAVKHRYKINGFD--IGLCTVNLGQEWPCIHQELLDI 342 IY D + I G + + + V++ + I Q++ +I Sbjct: 342 IY-SSDEPGPDPFGIVGTNVLLVVHAVDVSRPVDVIEQQVREI 383 >UniRef50_B1I205 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Candidatus Desulforudis audaxviator MP104C RepID=B1I205_DESAP Length = 435 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 57/303 (18%), Positives = 98/303 (32%), Gaps = 22/303 (7%) Query: 14 MLTYAWGYLQEIKQANLE-AIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGI 72 +L YA+G +E D+L L+ +L RGL Y P E++ Sbjct: 87 LLRYAYGLRNLFLYGQVEMETTDRPFQDLLLSQLSAEAAELLSRGLHRAYRPRHELMASP 146 Query: 73 KGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARS 132 +GR+ F + R + + D L+N+++ + L R Sbjct: 147 RGRVNFQRLARTGGVRQSALPCYHHLRLADCLSNQVLVAGLRFGAGLTADLELRARLRRL 206 Query: 133 LYRKLPGISTLHLTPQHFSYLNG--GKNTRYYKFVISVCKFIVN--NSIPGQNKGHYRFY 188 ++ + L F+ L + TR Y+ + K + + G+ G Sbjct: 207 AAVCGENVTPIRLDYHVFARLEREANRLTRAYEPAFRLTKILYRDAGAGLGREAGGLPVP 266 Query: 189 DFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSL---NLLPRMETDI 245 F + M+ +Q L F L Y + P D Sbjct: 267 GFLFD---MNRFFQAVLSRFLHENLDGFRVQDEYRLQGMFAYVPGFNPQRRQAPAPRPDF 323 Query: 246 TIRSSEKI-LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 + ++ I+DAKY + LYQL Y S G + ++YP Sbjct: 324 VVFRGGRVAAILDAKYRDLWEN------ALPRDMLYQLALYALSQ----GGGMRAAILYP 373 Query: 305 HVD 307 +D Sbjct: 374 TLD 376 >UniRef50_D1YX67 Putative uncharacterized protein n=1 Tax=Methanocella paludicola SANAE RepID=D1YX67_METPS Length = 433 Score = 103 bits (258), Expect = 7e-21, Method: Composition-based stats. Identities = 61/321 (19%), Positives = 112/321 (34%), Gaps = 24/321 (7%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGN-NLLDILGYVLNKGVLQLSRRGLELD 61 +P I ++ +L YA+ N E G N D+L Y L+ L RGL Sbjct: 73 KPKIEGDSLLRLLRYAYSLEDLDLYKNTEYSTGKMNFHDLLLYQLSIEANILISRGLHKK 132 Query: 62 YNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 Y P + +G + + R ++ + ++D NR++ + L I+ + Sbjct: 133 YKPTDSELKIPRGILNIQRIARRGGISKQSLPCKYYPRSDDNTMNRVLLAGL-IMGSNLT 191 Query: 122 LNSTIRDEARSLYRKLPG-ISTLHLTPQHFSYLNGG--KNTRYYKFVISVCKFIVNNSIP 178 + ++ R L L +S + L + S ++ + TR Y+ IS+ K ++++ Sbjct: 192 SSQKLKGRLRRLSFILRETVSPVKLNWEMMSVVDMDMSRLTRAYQPSISIIKMLMSSQGI 251 Query: 179 G-QNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNL 237 + K F + M+ +Q + + L + + L Sbjct: 252 DLEQKDGMNLPGFLFD---MNHFFQVLISRYLHDYLHDYKVYDEPPLKGLMAYDNNYNPL 308 Query: 238 L---PRMETDITIRSSEKIL--IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPE 292 P + D +R + + I+D KY + LYQL Y S Sbjct: 309 KRRPPHLYPDFIVRDNMNKVVAILDTKYRDLW------KLPLPREMLYQLSIYAQSRNLG 362 Query: 293 NGENIGGLLIYPHVDTAVKHR 313 I +YP D Sbjct: 363 ENSTI----LYPTTDNVSSEN 379 >UniRef50_D0LPM4 5-methylcytosine restriction system component-like protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPM4_HALO1 Length = 404 Score = 101 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 48/329 (14%), Positives = 110/329 (33%), Gaps = 42/329 (12%) Query: 31 EAIPGNNLLDILGYVLNKGVLQLSRR-GLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNH 89 E +L ++ V + GL Y+ + I+GRI+F + + + Sbjct: 89 ETTREGDLGPLVARVFCAATWHAIQTSGLLRAYHRQSVRSSMIRGRIDFPRLV-HAGGDL 147 Query: 90 GKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQH 149 +T T NR++ + +A + + L ++ + L L +S Sbjct: 148 SRTPCIVFSRLPQTPLNRLLAAAVAQIRRDPVLRASAGADLPPLATALADVSPHLDRALL 207 Query: 150 FSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFC 209 + + + + + ++ I+ ++ + + F + ++ L+++ + Sbjct: 208 SARIPLSRLEQPFAASHALACLILRSAGLA-SGSEHEGAGFLVD---LANLFERAVARAF 263 Query: 210 RRELTSANTTRSYLKWDASSISD---QSLNLLPRMETDITIRSSE-KILIVDAKYYKSIF 265 R + KW + + ME D+ + + ++VDAKY Sbjct: 264 RDA-----PFAAEAKWRVQLLREAPSTPSTQGSSMELDVFLPDVRGQRVVVDAKYKT--- 315 Query: 266 SRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH-----------VDTAVKHRY 314 + + NL Q++ Y + +L++P V Y Sbjct: 316 -------RVTTGNLQQMITYCVA-----SGTHQAVLVFPAGHLTDRRAHVLVPHRGPPAY 363 Query: 315 KINGFDIGLCTVNLGQEWPCIHQELLDIF 343 +I+ + L +L W + L D Sbjct: 364 RIHLVEFELTQTDLAG-WRDAGRRLADAV 391 >UniRef50_UPI00016C4D94 hypothetical protein GobsU_06730 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4D94 Length = 385 Score = 100 bits (249), Expect = 7e-20, Method: Composition-based stats. Identities = 58/307 (18%), Positives = 104/307 (33%), Gaps = 39/307 (12%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P IP N+ ++L + IP LL L + ++R GL Y Sbjct: 66 PKIPWPNLQFLLGSGVHPTGGTTR-----IPEGGLLGTLATAFADQLEAVARAGLVAGYG 120 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDM--LNEDTLANRIIKSTLAILIKHEK 121 + P ++G++ A +R D + +T NRI +S L H Sbjct: 121 EVESVSPFLRGKLRTAAQMRDAASQAFPGHFHIDEPSFDLNTPWNRIARSAATTLGTHPD 180 Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ- 180 + R+ + R L + +H T FS Y ++ +C I Sbjct: 181 VPRATRERIETAARPLAEVPNVHGTDADFSAARTEPRAVGYHALLDLCAIIQQGFSVADP 240 Query: 181 -NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLP 239 G F ++ ++++L+ RREL W + + L P Sbjct: 241 LRTGSDAFL------LDLGQAFERYLFRSLRREL------ADRPGWSVDAHP--AFALGP 286 Query: 240 -RMETDITIRSSE-KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENI 297 + D+ +R ++DAK+ + +L+Q++ Y + Sbjct: 287 VTLRPDVLVRKRAVARGVLDAKWKTTALDP---------ADLHQVLAYA---GLTGAPRV 334 Query: 298 GGLLIYP 304 G L+YP Sbjct: 335 G--LVYP 339 >UniRef50_A9FMX9 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FMX9_SORC5 Length = 400 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 50/305 (16%), Positives = 101/305 (33%), Gaps = 34/305 (11%) Query: 42 LGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNE 101 +L + + + R DY + ++G I +A+ R + N Sbjct: 109 FAQILEEDLSRAGPR---RDYQRREDDASVLRGTIRWAELARRTSPVP--VPCRYWERNI 163 Query: 102 DTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRY 161 DT NR+ + + H+ L L+ + L + Sbjct: 164 DTPLNRLFAAAVHAASAHDSLREAGGMPLDRLHGIFGHVPRLPPAWILDRTRPLPRLEAD 223 Query: 162 YKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRS 221 ++ S+ I+ G R F + ++ L++ + R + Sbjct: 224 FEAARSLAITILQAFGISH-GGAQRALAFHVD---LARLFEMTVEAAARTQAWDGKVA-- 277 Query: 222 YLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQ 281 +++ +D + D+ +R+ + L++DAKY K+ F +LYQ Sbjct: 278 -IQYQPPYEADVAGE---ESRIDVLVRARGEALVIDAKYSKA----------FSKSHLYQ 323 Query: 282 LMNYLWSLKPENGENIGGLLIYPHVDTAVKHRY-KINGF---DIGLCTVNLGQEWPCIHQ 337 ++ Y+ L G L+YP R+ G ++ L V+L + Sbjct: 324 VLAYMKMLGAR-----RGALVYPKGAELRGERFWSAPGAPEWEVRLHEVDLVAVASNGRR 378 Query: 338 ELLDI 342 EL + Sbjct: 379 ELERL 383 >UniRef50_D1PAJ0 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PAJ0_9BACT Length = 416 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 46/288 (15%), Positives = 91/288 (31%), Gaps = 39/288 (13%) Query: 49 GVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHG--KTVSTFDMLNEDTLAN 106 GV+ + L Y + + +KG I+ K R + + + DT N Sbjct: 121 GVVNRIKS-LRKGYVLRQKNLKKVKGHIKMLKNERINIAVKRYDRIYCEYADYSVDTPEN 179 Query: 107 RIIKSTL-------AILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNT 159 R++K L A + ++ + S + K +S + + K Sbjct: 180 RLLKKALVFSQRFVAKINRNNVVYSKVNQMVTKALSKFDYVSD-DININSIGQIRSNKLY 238 Query: 160 RYYKFVISVCKFIV---NNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA 216 R Y + + K I+ + S+ R F + MSLLY+ ++Y Sbjct: 239 REYAEAMRLAKVILKHFDYSLSNVEATENRVTPFVLD---MSLLYEHYVYGLLHEAYREK 295 Query: 217 NTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHS 276 + + ++ D +S I+D KY + Sbjct: 296 ------ISYQYPGVTGL---------PDFLYKSKHFNAILDTKYIPKYEKGTLDNYVIRQ 340 Query: 277 QNLY-------QLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKIN 317 + Y + + Y + ++ ++IYP + +K N Sbjct: 341 LSGYSRDLTILRKLGYEDIDEDSPAPSVPCIIIYPKEGGDTTNPFKSN 388 >UniRef50_Q2GBH5 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=Q2GBH5_NOVAD Length = 687 Score = 99.6 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 38/243 (15%), Positives = 87/243 (35%), Gaps = 18/243 (7%) Query: 108 IIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVIS 167 I+ + L +H + R L L I + +T + + + R ++ + Sbjct: 432 IMAAATVFLARHT-RSLATRRTLDELRHALADIPLMPITRLPWQAVRIDRTNRRWEALFR 490 Query: 168 VCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDA 227 + + ++ + H + D M+ L++K++ RR L + Sbjct: 491 LARLLLQRDWQATHH-HAKAPDGLTLLFPMNDLFEKYIAVLLRRALAGSGIEVIDQGGHR 549 Query: 228 SSISDQSLNLL-----PRMETDITIRSSEKIL-IVDAKYYKSIFSRRMGTEKFHSQNLYQ 281 + + + L R + DI +R +I+ I+D K+ K ++YQ Sbjct: 550 ACLGSFTGGHLETGEVFRTKPDIMLRRGREIVAIIDTKWKKLSLDPLDRKHGVSQADVYQ 609 Query: 282 LMNYLWSLKPENGENIGGLLIYPHVDTAV---KHRYKING--FDIGLCTVNLGQEWPCIH 336 LM Y + +L+YP V + ++ + G + + ++ + + Sbjct: 610 LMAYARLYQ-----TAELMLLYPARPGQVCAERAQFGMAGGSERLRIAMADVSLDEKALA 664 Query: 337 QEL 339 + L Sbjct: 665 EAL 667 >UniRef50_Q7MVS1 Putative uncharacterized protein n=1 Tax=Porphyromonas gingivalis RepID=Q7MVS1_PORGI Length = 431 Score = 99.3 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 47/293 (16%), Positives = 91/293 (31%), Gaps = 47/293 (16%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHG--KTVST 95 L ++ V + +RGL+ DY + +KG I ++ R + K + Sbjct: 133 LSPLIVVHFLSVVRGIVKRGLKKDYVQRENNLNKVKGHIAISRNERTNVIRKRFDKVLCK 192 Query: 96 FDMLNEDTLANRIIKSTL----AILIKHEKLNS--TIRDEARSLYRKLPGISTLHLTPQH 149 + +E+ NR+IK L IL +S +R + + Sbjct: 193 YQEYSENIPENRLIKKALLFSREILENLAITSSLIPLRHAIHQYLSAFCNVDE-QIEVWE 251 Query: 150 FSYLNGGKNTRYYKFVISVCKFIVN-------NSIPGQNKGHYRFYDFERNEKEMSLLYQ 202 + K + Y I + + I+ N P + + F+ +M+LLY+ Sbjct: 252 VKNIKHHKIFKEYDEAIRLAQMILRRYDYSITNIRPAEEEYCPVFW------LDMALLYE 305 Query: 203 KFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYK 262 ++ + + + + A + D +++D KY Sbjct: 306 HYVLGLLKVAY------GNKIMYQAHGYTGY---------PDFICYDP--KIVMDTKYIP 348 Query: 263 SIFSRRMGTEKFHSQNLYQLMNYLWS---LKPENGENIGGLLIYPHVDTAVKH 312 + QL Y K ++I L+IYP Sbjct: 349 RFEKDGIDVYIVR-----QLCGYSRDRRLFKTCPDKSIPCLIIYPKEGEPQNP 396 >UniRef50_Q97QG4 Conserved domain protein n=27 Tax=Streptococcus RepID=Q97QG4_STRPN Length = 442 Score = 98.5 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 43/296 (14%), Positives = 107/296 (36%), Gaps = 32/296 (10%) Query: 29 NLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLN 88 ++ L +L Y+ K + R+GL +Y+ + +KG I+ ++ Sbjct: 118 DVALSREERLYQLLVYLFPKYLQAAIRKGLYKEYHRFSHNDSHVKGVIDVRNHLKKNLPF 177 Query: 89 HGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQ 148 G D ++++ T+ + + + + D + + I + + + Sbjct: 178 TGNIAYATREFTYDNPLMQLVRHTIEYIKNQKSIGQGVLDNLSTSRENVSEIVRVTPSYK 237 Query: 149 HFSYLNGGKNT----------RYYKFVISVCKFIVNNSIPGQNKG-HYRFYDFERNEKEM 197 + Y+ + +C I+N + G Y+ ++ Sbjct: 238 LADRAKIIRGNQSKPIRHAYFHEYRNLQELCLMILNQ----EKHGLGYQDQKIYGILFDV 293 Query: 198 SLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVD 257 + L+++++Y + S+ + ++ D E+ +++D Sbjct: 294 AWLWEEYVYTLLPKGFVHPRNKDKTDGISVFSVGKR------KVYPDF--YDRERKIVLD 345 Query: 258 AKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHR 313 AKY K + + + ++L+QL++Y + LK E LI+P ++ +V Sbjct: 346 AKYKKLELTEK----GINREDLFQLISYSYILKAEKAG-----LIFPSMEQSVNSE 392 >UniRef50_A7ZBW9 Putative uncharacterized protein n=1 Tax=Campylobacter concisus 13826 RepID=A7ZBW9_CAMC1 Length = 441 Score = 96.2 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 52/302 (17%), Positives = 107/302 (35%), Gaps = 25/302 (8%) Query: 14 MLTYAWGYLQEIKQANLEAIPGNNLLDILGY-VLNKGVLQLSRRGLELDYNPNTEIIPGI 72 ML YA + NL I+ Y + + + + GL +Y + Sbjct: 98 MLNYANDIYIDDVSLGKSVDAKENLSKIIIYYLFIQTLERAFLLGLPKEYKDKNYHEAKV 157 Query: 73 KGRIEFAKTIRGFHLNHGKTVSTFDMLNE--DTLANRIIKSTLAILIKHEKLNSTIRDEA 130 G+++ AK I+ GK ST + D + ++ L I+ K + + Sbjct: 158 MGKVDVAKFIKSDIPFTGKISSTNRERQDMGDIVL--LLHKALKIVQKE---SKELIKPV 212 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRY------YKFVISVCKFIVNNSIPGQNKGH 184 + L I L + + + YK V+ K I+ N G Sbjct: 213 INTLSYLNEIREPRLVTPNVIHNALNSKALHNPIYTPYKKVLEYAKLIIENEDAGTKSNG 272 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 + F + ++ L++ ++ + ++E + T ++ + + D Sbjct: 273 KQNLGFLVD---VAELFEIYIRKLLQKEFKDWSVTSPKIELYKDKFFARKII------PD 323 Query: 245 ITIRSSEKILIVDAKYYKSIFS--RRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLI 302 I + S +++L+ D KY + + G + +Q+ Y+ + + I G L+ Sbjct: 324 IVMSSGDQVLVFDTKYKRMNMQGKDQYGLGDVDRNDFFQINTYMSYYQNQGKNVIAGGLL 383 Query: 303 YP 304 YP Sbjct: 384 YP 385 >UniRef50_A7GF31 Putative uncharacterized protein n=1 Tax=Clostridium botulinum F str. Langeland RepID=A7GF31_CLOBL Length = 419 Score = 94.6 bits (234), Expect = 4e-18, Method: Composition-based stats. Identities = 50/313 (15%), Positives = 106/313 (33%), Gaps = 20/313 (6%) Query: 11 IYYMLTYAWGYLQ-EIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEII 69 +Y +L YA+G + ++ I + D++ Y L L RRG++ Y E + Sbjct: 76 LYQLLRYAYGLRELKLFNVAEHTIDNFSFFDLIIYELYVEAEDLLRRGIQKSYIHREENL 135 Query: 70 PGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDE 129 +GRI+ + L + +E+ + N+ L L+S ++ + Sbjct: 136 SSPRGRIDMNRLCGQGGLIKDTLPCKYFNRDENNILNQ-TLLAGLKLGLKLVLDSGLKIK 194 Query: 130 ARSLYRKL-PGISTLHLTPQ--HFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYR 186 + + L IS + LT + + + T Y V + + + R Sbjct: 195 LQRICSNLKENISDITLTRGSLQLARNSINRLTGRYSAVFEIINILYESQGIQLENA-SR 253 Query: 187 FYDFERNEKEMSLLYQKFLYEFCRR---ELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 + + +M+ ++ + + + + + + + P Sbjct: 254 YINLRGYFFDMNAFFETLVGRLLENCSDRYSIKDQFSLHDMFIYTPGFNPCRRKSPTPRP 313 Query: 244 DITIRSSEKIL-IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLI 302 D + K++ ++DAKY LYQL Y S + I + Sbjct: 314 DFALIRQGKVVKLLDAKYRNLWEKN------LPRDMLYQLAIYAVSGIGDKTATI----L 363 Query: 303 YPHVDTAVKHRYK 315 YP ++ + Sbjct: 364 YPSLNDVTTVQMI 376 >UniRef50_C7M8S6 McrBC 5-methylcytosine restriction system component-like protein n=2 Tax=Capnocytophaga RepID=C7M8S6_CAPOD Length = 437 Score = 93.9 bits (232), Expect = 9e-18, Method: Composition-based stats. Identities = 52/298 (17%), Positives = 102/298 (34%), Gaps = 39/298 (13%) Query: 32 AIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP-GIKGRIEFAKTIRGFHLNHG 90 + L L + + R+GL+ Y E + IKG+I+ K ++ Sbjct: 133 EQQKDTLTPFLMVQFLLLLKCIVRKGLKKSYYTVEENLNNRIKGKIQLDKHLKQNVF-KN 191 Query: 91 KT---VSTFDMLNEDTLANRIIKSTLAILIKHEKL--------NSTIRDEARSLYRKLPG 139 K V + D+L NR +K L +I + + +IR+ Sbjct: 192 KLTAHVCRYQEFGMDSLENRFLKKVLQFIISFKNTHSNYFAGNDESIRELITYCSPHFEL 251 Query: 140 ISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSL 199 IS L + L + Y+ I + K I+ + + +M Sbjct: 252 ISE-ELDVESLKKLTTNPFFKEYEEAIRIGKQILKRFSYNITETTQQKVAIPPFWIDMPK 310 Query: 200 LYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAK 259 L++ ++Y+ + + S + D + + D + + E +++DAK Sbjct: 311 LFELYVYKKLQEQFGSRGEVHYHFTGDYTEL-------------DFLLNTPEYKMVIDAK 357 Query: 260 YYKSIFSRRMGTEKFHSQNLYQLMNYL------WSLKPENGENIGGLLIYPHVDTAVK 311 Y R+ ++ Q+ Y +LK ++ I L+IYP + Sbjct: 358 YKTVYEDSRV------IDDIRQVSAYARLERVYKALKIDDNRLIDCLIIYPSFEQNTD 409 >UniRef50_C1QC92 McrBC 5-methylcytosine restriction system component n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC92_9SPIR Length = 409 Score = 89.2 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 51/276 (18%), Positives = 100/276 (36%), Gaps = 34/276 (12%) Query: 46 LNKGVLQLSRRGLELDYNPNTEIIPG-IKGRIEFAKTIRGFHL--NHGKTVSTFDMLNED 102 + + GL+ ++ E + IKG+I+F+ I+ + + + ++ + + Sbjct: 131 FLRMLELELHNGLKRNFIRKEENLNSKIKGKIDFSNHIKKNIMTARNDRVYCSYFDYDIN 190 Query: 103 TLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYY 162 L NRI+K L I + S +S L + + Y Sbjct: 191 CLENRILKKALKICYSNIGSIYNS----FSCMTFFSEVSD-ELHFYELHNIKLNPLYKKY 245 Query: 163 KFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSY 222 K +I + I+ + F + MSLL++K++Y L + N Y Sbjct: 246 KLLIKLAINIIKLKRYKDSNKENYAPPFYID---MSLLFEKYVYALLDDSLKNKNAKILY 302 Query: 223 LKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQL 282 + + +++ D I+ + I D KY S + +++ QL Sbjct: 303 QEVYSRH----------KLKPDFIIKCNGYDYIADTKYKSSC------NNGINIEDIRQL 346 Query: 283 MNY--LWSLKPENGENIG-----GLLIYPHVDTAVK 311 Y + S+ E +I ++IYP D+ K Sbjct: 347 SGYGRVESIVKEFTNDIENYIPNCIIIYPSDDSNNK 382 >UniRef50_C8PX06 Putative uncharacterized protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PX06_9GAMM Length = 426 Score = 86.9 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 44/246 (17%), Positives = 92/246 (37%), Gaps = 30/246 (12%) Query: 70 PGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN-----S 124 P ++G++ + ++ K + DT NR++K+T+ +I + Sbjct: 156 PYLQGKLLVKEQLQHNFHQPHKFYHQTENFAMDTAGNRLVKTTIERVIGSMAMPLPPQWQ 215 Query: 125 TIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGH 184 + A+ LY L + Q + L R Y F IS C ++ ++G Sbjct: 216 VVDTVAKDLYDSLFSQAL-----QELTALPSLLAQRNYTF-ISFCYALLTLQQAS-SQGQ 268 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 + + N M ++K++ + + N K ++ ++ D Sbjct: 269 FLTPTWLVN---MPFAFEKWVGRKIHEQFAAQNFELVEQKRQPLTVQQGL-----TIKPD 320 Query: 245 ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 I ++S++K++I D K+ K+ ++YQL+ Y + LI P Sbjct: 321 IWLKSADKLIIADVKWKKTPTFN-----DISLADMYQLLTYASEFDAD-----EAWLIVP 370 Query: 305 HVDTAV 310 + T + Sbjct: 371 TLGTQL 376 >UniRef50_C2BVA3 Putative uncharacterized protein n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BVA3_9ACTO Length = 390 Score = 86.5 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 60/333 (18%), Positives = 110/333 (33%), Gaps = 52/333 (15%) Query: 14 MLTYAWGYLQEIKQANLEAIPGNNLLDI----LGYVLNKGVL------QLSRRGLELDYN 63 ML Y ++ + + D L +L++ +L ++ RR Sbjct: 72 MLLYLHDSQSAKLMNDVPSEYSSGSHDFCLSSLAEMLSQELLSFAAKPKIFRR------K 125 Query: 64 PNTEIIPGIKGRIEFAKT-IRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 P E G+I + T +R + ++ D NRIIKS ++ Sbjct: 126 PTLEATSSAVGQINWPVTNLRARRGDAAPILTRRHRPTFDVPENRIIKSAAKRVLGLLSS 185 Query: 123 N----STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIP 178 + D A G + Q N G + YYK +S+ I+ S Sbjct: 186 DAPGRRVTHDWANWQAATFAGYDDIRKVSQMMRTTNIGGSHSYYKNALSLSLVILEASGI 245 Query: 179 GQNKGHYRFYDFERNEKEMSLLYQKFLY-EFCRRELTSANTTRSYLKWDASSISDQSLNL 237 + + F N M LY+ F+ R +A + + + +++ + L Sbjct: 246 DHGE-SWESDGFLFN---MPGLYEDFVRTSLMRAAQPTALSVQKGFASSSFLLANGEIEL 301 Query: 238 LPRMETDITIRSSEKI-LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGEN 296 +P D+TI I ++D KY +LYQ+ Y+ + + Sbjct: 302 IP----DLTIYRGGTIEAVLDVKYKAPDAK-----------DLYQIYTYM-----QFAQL 341 Query: 297 IGGLLIYP--HVDTAVKHRYKINGFDIGLCTVN 327 +I P V+ +G I ++ Sbjct: 342 NEAYIISPSVRTGDMVE---TFDGHRIRYLGLD 371 >UniRef50_B9KCB6 Putative uncharacterized protein n=1 Tax=Campylobacter lari RM2100 RepID=B9KCB6_CAMLR Length = 459 Score = 86.2 bits (212), Expect = 2e-15, Method: Composition-based stats. Identities = 52/298 (17%), Positives = 106/298 (35%), Gaps = 17/298 (5%) Query: 14 MLTYAWGYLQEIKQANLEAIPGNNLLD-ILGYVLNKGVLQLSRRGLELDYNPNTEIIPGI 72 ML +A + + NN+ + IL Y+ + + + GL +Y Sbjct: 111 MLNFANDVFVDDVSIYQDVKKENNISEFILFYIFVQKLEKSFLIGLPKNYQSKKYNDLRF 170 Query: 73 KGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKH-EKLNSTIRDEAR 131 KG I+ + I+ K + ED ++ L ++ K ++ Sbjct: 171 KGNIDMVEFIKHNIPLKAKVATKTREQIEDVYIINVLYKALEVIEKKNSGFLKNVKHIKT 230 Query: 132 SLYRKLPGISTLH--LTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPG-QNKGHYRFY 188 L + + L S +N + YK ++ K I+ + +NK + Y Sbjct: 231 YLVQNKSEHCFIKESLNKAFSSKALRNQNYQNYKELLKYAKMIIESQNFTSKNKNDQKSY 290 Query: 189 DFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIR 248 F N ++ L++ ++ + R L+ +S + + DI + Sbjct: 291 GFIVN---IAELFEIYISKLLRNNFEEYMVDSPKLEIYKNSFYKRHII------PDIVLS 341 Query: 249 SSEKILIVDAKYYKSIFSRR--MGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 + ++ D KY + R G +L+Q+ Y+ + G+ + G L+YP Sbjct: 342 KDDTYMVFDTKYKRMKMEGRSQNGMGDLDRNDLFQIHTYM-GYYQKIGKVLLGGLLYP 398 >UniRef50_B8DPA2 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Desulfovibrio vulgaris str. 'Miyazaki F' RepID=B8DPA2_DESVM Length = 397 Score = 84.6 bits (208), Expect = 5e-15, Method: Composition-based stats. Identities = 69/353 (19%), Positives = 118/353 (33%), Gaps = 48/353 (13%) Query: 4 PVIPVRNIYYMLTYA----WGYLQEIK-QANLEAIPGNNLLDILGYVLNKGVLQLSRRGL 58 P I N +++L A LQE+ A+ + I+ L V ++ R Sbjct: 61 PKIGDVNFFHLLFKAEGLQNNTLQELNSFASYFTNEDHTPPIIIAKNLLLSVNEILHRSP 120 Query: 59 ELDYNPNTEIIPGIKGRIEFAKTIRGFHLN-HGKTVSTFDMLNEDTLANRIIKSTLAILI 117 + + G + KTI G H H T DT NRI+ + + I I Sbjct: 121 TAKRFKVKKNGNFVAGSLNIQKTIFGIHSRAHKPIHYTVKEKTLDTPENRILTAAINIAI 180 Query: 118 KHEKLNSTIRDE------ARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKF 171 + E + ++ L T Q+ + G YYK I + + Sbjct: 181 DLLPTEQRLSYEPIYLAWLQRFPHSTDILADLETTAQNIASNKYGGPRDYYKRSIILAQI 240 Query: 172 IVNNSIPGQNKGHYRFYDFERNEKEMSL--LYQKFLYEFCRRELTSANTTRS-YLKWDAS 228 + G F + ++ +++KF+ + E T+ S S Sbjct: 241 LFGYRG----YGLSGTTSFTGDAILLNTAAVFEKFVRKIISLEYTAKGIVVSKETNSPYS 296 Query: 229 SISDQSLNLLPRMETDITIRSSEK-ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLW 287 ++ S ++ P DI I LI DAKY K + YQL YL Sbjct: 297 LYTNGSYSVCP----DIIISEGGNLRLIADAKYKKPTI-----------SDHYQLYTYLS 341 Query: 288 SLKPENGENIGGLLIYPH-VDTAVKHR-------YKINGFDIGLCTVNLGQEW 332 L + G LI P ++ + + I + + ++L + + Sbjct: 342 VLGAK-----RGALIAPSFTGFDIETKEFQTPSQHTITEIYLPMQNIDLAENF 389 >UniRef50_A8UWA1 Putative uncharacterized protein n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8UWA1_9AQUI Length = 443 Score = 84.2 bits (207), Expect = 6e-15, Method: Composition-based stats. Identities = 44/239 (18%), Positives = 84/239 (35%), Gaps = 29/239 (12%) Query: 62 YNPNTEIIP-GIKGRIEFAKTIRGFHLN--HGKTVSTFDMLNEDTLANRIIKSTLAILIK 118 + + + +KG+I KT + H KT F +L D L N+I+K+ L IK Sbjct: 176 FVFVEQDLNGRVKGKINIKKTYQRHMSKGIHTKTTCRFQILTHDFLDNQILKAALIQAIK 235 Query: 119 HEKLN----STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVN 174 +L S + + L +S + FS + YK + + + I+ Sbjct: 236 FVRLMKFEISGLNEIMNYLSYLFESVSLKRVLDTDFSKVRHSPFFPEYKEALELARMILK 295 Query: 175 NSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQS 234 N ++ + M L++ +++ + + + Y Sbjct: 296 NLGNDPFSNVSKYTTIQPYIINMPKLFELYVWLKLKGKFSRGKVIYQYNA---------- 345 Query: 235 LNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPEN 293 D + K LIVDAKY + + K ++++ QL Y + + Sbjct: 346 ----NGDIPDFIVE--GKNLIVDAKY------KYIDESKPSTEDIGQLSRYGRNKEVRK 392 >UniRef50_D2Q5W0 3-isopropylmalate dehydrogenase n=2 Tax=Bifidobacterium dentium RepID=D2Q5W0_9BIFI Length = 510 Score = 82.3 bits (202), Expect = 3e-14, Method: Composition-based stats. Identities = 35/313 (11%), Positives = 87/313 (27%), Gaps = 48/313 (15%) Query: 29 NLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLN 88 + + +L ++ + + +G+ Y +G I+ + I Sbjct: 154 DFDTERDGAWQKMLMLMIPFHLERAMSKGIYKQYMSRRYNDSRPRGVIDIPRHISRNVPF 213 Query: 89 HGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQ 148 G + D +I+ T+ + + + R++ R + Sbjct: 214 RGTVAYNTREFDVDNPVTELIRHTIEYIRAQGNFGTQL---LRNVNRDTDTYV-REIRQA 269 Query: 149 HFSYLNGG--------------KNT--RYYKFVISVCKFIVNNSIPGQNKGHYRFYDFER 192 + + + ++ Y+ + +C I+ G + Sbjct: 270 TWKHYDSNARAKIIHENRTHPVRHAFYSEYRDLQQLCLKILTKQGVDTGCGEDAVHGLLF 329 Query: 193 NEKEMSLLYQKFLYEFCRRELTSANTTRSY---------LKWDASSISDQSLNLLPRMET 243 + L++++L LKW + + Sbjct: 330 SCSW---LWEEYLNTLLSGHFKDYEVKHPRNLDQHKKDALKWPIFKVGGTDDQTENWLIP 386 Query: 244 DITIRS---SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 D ++ + +++DAKY + + +QL+ Y + GL Sbjct: 387 DFLLKPTADDRENIVMDAKYK--------PRKNISRDDRFQLLAYKLYFRS-----HKGL 433 Query: 301 LIYPHVDTAVKHR 313 +Y D A K + Sbjct: 434 FLYAAKDEAEKEK 446 >UniRef50_D0YRU6 Putative ATPase family associated with various cellular activities (AAA) n=1 Tax=Mobiluncus mulieris 28-1 RepID=D0YRU6_9ACTO Length = 461 Score = 80.8 bits (198), Expect = 8e-14, Method: Composition-based stats. Identities = 18/90 (20%), Positives = 41/90 (45%), Gaps = 7/90 (7%) Query: 266 SRRMGTEKFHSQNLYQLMNYLWSLKPENGEN-----IGGLLIYPHVDTAVKHR--YKING 318 + S NLYQ+ Y+ + + E ++ + G+L+Y D ++ Y+++G Sbjct: 368 PKNWDKHTIVSANLYQIFTYVKNKQAELNQSGGSRQVSGMLLYARTDEDIQPDGVYQMSG 427 Query: 319 FDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 I + T++L + + +L I + + Sbjct: 428 NQISVTTLDLNCPFEQLSAQLNSIAATHFE 457 >UniRef50_UPI0001B4EC67 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4EC67 Length = 424 Score = 79.6 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 41/338 (12%), Positives = 97/338 (28%), Gaps = 40/338 (11%) Query: 3 QPVIPVR--NIYYMLTYAWGYLQEIKQANLEAIPGNNLLD----------ILGYVLNKGV 50 +P P+ + L YA N + P L + ++ L Sbjct: 69 RPKFPIAGDRLIDWLCYA----------NKQEEPDETLRNWPLGSDGYAGLVPAALLHEC 118 Query: 51 LQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIK 110 +L R GL DY + + ++GR++ + + + N + Sbjct: 119 RRLLRHGLRRDYVRHHRVDTTLRGRLDVEAQATRCYGAVDRLHLQTFEYQDGGWENLVCG 178 Query: 111 STLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCK 170 + L + + + R + P + +Y+ + + Sbjct: 179 AALTVAARRSSDPAQTRWLL-DAAAQFPSPRQPLDAVSLLQRGQYTRLNTHYRAAHAWAR 237 Query: 171 FIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSI 230 ++ Y F +++L+++ + + + A Sbjct: 238 MVLGGGGVTNLLEPYGFGAKSL-MLNLNVLWERVVRRMAVDAAVDLGGRGARGEEKAIHT 296 Query: 231 SDQSLNLLPRMETDITIRSSEKI--------LIVDAKYYKSIFSRRMGTEKFHSQNLYQL 282 Q + P D+ + + L VDAKY + + + + +QL Sbjct: 297 HGQRNDKTPTFNPDVLLAFPPQTDSSADIRFLAVDAKY------KGYMEKNVSAADRHQL 350 Query: 283 MNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKINGFD 320 + Y+ L+++P + ++ G Sbjct: 351 LTYIAGYTAPEYPL--ALVVHPSAAAPTERELRVQGPR 386 >UniRef50_C9PUK8 Putative uncharacterized protein n=2 Tax=Bacteroidales RepID=C9PUK8_9BACT Length = 437 Score = 79.2 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 58/316 (18%), Positives = 106/316 (33%), Gaps = 49/316 (15%) Query: 29 NLEAIPGNNLLDILGYVLNKGVLQLSRR----GLELDYNPNTEIIPG-IKGRIEFAKTIR 83 + AIP + D+L L L + RR GL Y E + IKGR A+ ++ Sbjct: 119 HKPAIPISQQQDLLSIFLITEYLSVLRRIAAKGLRKSYYMVEENLNNKIKGRCLVARNVK 178 Query: 84 GFHLNHGKT---VSTFDMLNEDTLANRIIKSTLAILI------KHEKLNSTIRDEARSLY 134 L+ G+ + + D+ NRI+K L + +H ST+ + R + Sbjct: 179 QN-LSKGRVTNNFCRYQVYGIDSCENRILKRALRFCVKQLEVYRHAFDTSTLDNIVRFVN 237 Query: 135 RKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN-KGHYRFYDFERN 193 + +T + G + + + + + ++ G+++ Sbjct: 238 PHFDNVGE-EVTTKAIQTFKGNPIFKEHSTAVELAQLLLRRYSYDITLAGNHQITTPPF- 295 Query: 194 EKEMSLLYQKFLYEFCRRELTSANT---TRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 +MS L++ +++ R TS N + + + P Sbjct: 296 WIDMSKLFELYVFRHLRLVFTSKNEVCYHPKAHRQELDYLLKPCHWAEP----------- 344 Query: 251 EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSL--------KPENGENIGGLLI 302 +VDAKY R G + Q+ Y EN I L++ Sbjct: 345 ---YVVDAKYK----PRYKGMIGIDKDDARQVAGYARLQKIYDMLKLDAENALPIKCLIV 397 Query: 303 YPHVDTAVKHRYKING 318 YP D + R+ Sbjct: 398 YP--DQEQQERFTFTD 411 >UniRef50_C2LL98 Putative uncharacterized protein n=2 Tax=Proteus mirabilis RepID=C2LL98_PROMI Length = 475 Score = 79.2 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 45/272 (16%), Positives = 95/272 (34%), Gaps = 23/272 (8%) Query: 41 ILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLN 100 +L Y+ N + R GL Y + I ++G I T + + GK + ++ + Sbjct: 161 LLAYLWNIKFKRAYRLGLPKTYITRNDRISRVRGTIN--ATDYFQNKSSGKYLCSYREHS 218 Query: 101 EDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTR 160 D+ A + + + R+ + GI S+ Sbjct: 219 YDSPATSLFIKAYEAVAHYSFC-HQTRNIYSAFLTANQGIKRSQQEILRTSHFTNPFYN- 276 Query: 161 YYKFVISVCKFIV--NNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRE-LTSAN 217 Y +I + K ++ S + F+ ++S+L++ F+ + +R+ L Sbjct: 277 DYNVLIDLSKQVIGRKGSDFDSQQDSSAFF------FDISMLFEYFIRKLIKRDGLRLLG 330 Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQ 277 + + ++S + ++E D+ I E++ + D KY + Sbjct: 331 KFELCQEIASGALS----GYMRKLEPDLVIEIDERLFVFDVKYKA-----FDSQFGVKRE 381 Query: 278 NLYQLMNYLWSLKPENGENIGGLLIYPHVDTA 309 +L+QL Y+ G IYP + Sbjct: 382 DLFQLHTYIGQYGNIAAIKGCG-FIYPISEER 412 >UniRef50_Q5UZU6 Putative uncharacterized protein n=1 Tax=Haloarcula marismortui RepID=Q5UZU6_HALMA Length = 186 Score = 78.8 bits (193), Expect = 3e-13, Method: Composition-based stats. Identities = 20/99 (20%), Positives = 42/99 (42%), Gaps = 6/99 (6%) Query: 4 PVIPVR------NIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRG 57 P I + N+ Y+L YA + G++ +D L + N+ + ++ RRG Sbjct: 83 PTIQISPKAAGNNLLYLLRYAQNVSPTTIEQQTGLGQGDSFVDALAALFNQELQEILRRG 142 Query: 58 LELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTF 96 L +Y + ++GR++ + ++ K T+ Sbjct: 143 LHSEYQTVSSEEKQLRGRLDVQRQLQRQGPVPTKFECTY 181 >UniRef50_Q139N0 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris BisB5 RepID=Q139N0_RHOPS Length = 434 Score = 78.1 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 58/353 (16%), Positives = 127/353 (35%), Gaps = 48/353 (13%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGN----NLLDILGYVLNKGVLQLSRRGL 58 +P PV N+ ++ + L I A+ + + ++L+ L L + ++ RGL Sbjct: 77 RPKFPVSNLARVIDTSKRQLNSIPGADRSYLANDLSGGSVLNFLAANLVDALRPIAARGL 136 Query: 59 ELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVS----TFDMLNEDTLANRIIKSTLA 114 +Y+ +E +GRIE A T+RG + G+ FD D NRI+K+ L Sbjct: 137 HKEYSCRSETTSHPRGRIEIAGTMRG--WSRGQFHKVQAQRFDQ-TSDLPVNRILKAALE 193 Query: 115 ILIK----HEKLNSTIRDEARSLYRKLPGIS------TLHLTPQHFSYLNGGKNTRYYKF 164 ++K H + + A + + + P + L + + + + YY Sbjct: 194 SVLKLMWPHSTESRRLIVRANASFLEFPQLVGSCKPLDLAESQAILAARSLPADRIYYYR 253 Query: 165 VISVCKFIVNNSIPG--QNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSY 222 I + I+++ + F N + L++++L + + + Sbjct: 254 AIEIALLILSSRGISLQEEGVDVLLDSFIINFDD---LFEEYLRRVLQARAPNL-LSVKD 309 Query: 223 LKWDASSISDQSLNLLPRMETDITIR--SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLY 280 ++ + P + D+ + + ++ + KY + ++ Sbjct: 310 GNFEGKRQLFEDRKDQPA-QPDVVLTWQPTSVNVVGEIKYKD----------RPSRDDIN 358 Query: 281 QLMNYLWSLKPENGENIGGLLIYPHVDTA---VKHRYKINGFDIGLCTVNLGQ 330 Q + Y + +LI+ ++H I G + +LG Sbjct: 359 QAITYALCYNTKC-----AVLIHQCRSGESRGLRHHGTIRGIRLENYAFDLGA 406 >UniRef50_C8VZS1 Putative uncharacterized protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VZS1_DESAS Length = 439 Score = 76.9 bits (188), Expect = 9e-13, Method: Composition-based stats. Identities = 45/267 (16%), Positives = 99/267 (37%), Gaps = 14/267 (5%) Query: 41 ILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLN 100 ++ ++ + ++ GL +KGRI K++ L + VS F Sbjct: 127 LISFIWLHKLANANKHGLPRHNVKKNYTGYNVKGRINVKKSV-ISLLTKEQVVSEFYEKE 185 Query: 101 EDTLANRIIKSTLAILIKHEKLN-STIRDEARSLYRKL--PGISTLHLTPQHFSYLNGGK 157 D RI+ IL+K +L ++ D AR + L S+ ++ + +N + Sbjct: 186 IDETIARILVQAYWILVKDYELGILSLPDNAREIINLLKSSRFSSQSVSQNEYDRINYKE 245 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 + ++ V+ I+NN P ++ F +M+ +++ ++ + L Sbjct: 246 IYQSFREVVDFSWDIINNKTPSKSVCTQSQNGFSF-FIDMAEIWELYIRTALSKHLKKDK 304 Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQ 277 ++ + + DI ++ + + DAKY + Sbjct: 305 WKVML----DYAVVYEDTFFKRCLIPDIVVKRGADVAVFDAKYKAM----NYSSLDVDRN 356 Query: 278 NLYQLMNYLWSLKPENGENIGGLLIYP 304 + +Q+ Y+ + + + G LIYP Sbjct: 357 DFFQIHTYM-NYYAQGKRLLAGGLIYP 382 >UniRef50_D1W5H2 Conserved domain protein n=3 Tax=Prevotella RepID=D1W5H2_9BACT Length = 440 Score = 76.9 bits (188), Expect = 1e-12, Method: Composition-based stats. Identities = 45/295 (15%), Positives = 100/295 (33%), Gaps = 23/295 (7%) Query: 13 YMLTYAWGYLQEIKQANL-EAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPG 71 Y+L Y + +L ++ D + ++ + R+G+ +Y + Sbjct: 106 YLLHYMLQKVLSFNLFDLSHNNEEEDVFDFIMFMFPYFLKAAMRQGVYREYQNFSHNDAN 165 Query: 72 IKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEAR 131 +KG I+ A+ I G + + D +I+ T+ + S + + Sbjct: 166 LKGTIDIAQHIAKNVPFVGNIAYSTREYSHDNNMTELIRHTIEFMKTKRYGQSVLNVDHE 225 Query: 132 SLYRKLPGISTLHLTPQHFSYLNGGKNTR--------YYKFVISVCKFIVNNSIPGQNKG 183 ++ + L ++ KN R Y+ + +C + + K Sbjct: 226 TIENVKAIVEHTPLYNKNERGCIINKNLRVKAHPYFTEYRPLQMLC---LQILRMDEVKY 282 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 D + + L+++++ R + + K D S P Sbjct: 283 GESNNDICGILFDGAWLWEEYVNTILR-DYDFKHPENKLHKGGIYLFDDHSGIRYPDFYK 341 Query: 244 DITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 D +++DAKY + K S +++Q+M Y+ +LK + G + Sbjct: 342 D--------DMVLDAKY--KLLGSYDKVSKVDSDDIHQVMAYMTALKVDQGGFVA 386 >UniRef50_Q1LJY6 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Cupriavidus metallidurans CH34 RepID=Q1LJY6_RALME Length = 424 Score = 76.5 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 42/223 (18%), Positives = 74/223 (33%), Gaps = 18/223 (8%) Query: 3 QPVIPVRNIYYMLTYAWG----YLQEIKQA-NLEAIPGNNLLDILGYVLNKGVLQLSRRG 57 +P +P+ N+ +L Y+ L+ ++A + L V ++ R G Sbjct: 71 RPKVPLVNLERIL-YSSNHKPYVLESFQRAYGHHTRASEPIETFLVDRFLDWVEEIHRFG 129 Query: 58 LELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILI 117 L Y E +GR+ FA TIR N +D D NR IK+ L L Sbjct: 130 LLKQYRVIHEDGFTPRGRLNFAATIRLRSRNQQSLAYQWDDRTADNGPNRFIKAVLLQLA 189 Query: 118 KHEKLNSTIRDEAR--SLYRKLPGIST-------LHLTPQHFSYLNGGKNTRYYKFVISV 168 ++ R +AR +S + L + YYK I + Sbjct: 190 DRDEFLHDRRRKARLSVCVDYFSSVSDVDVHSVLIDPLVDDVEQLPSTR--EYYKTAIVL 247 Query: 169 CKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRR 211 K ++ S + ++K++ ++ Sbjct: 248 AKMLIEKSGLAFLSDQHSVLLPTLLLDLDEA-FEKYVLTLLQQ 289 >UniRef50_D1YRX5 Conserved domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YRX5_9FIRM Length = 445 Score = 76.1 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 48/298 (16%), Positives = 93/298 (31%), Gaps = 34/298 (11%) Query: 29 NLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLN 88 + + LLD L ++ K + R+GL Y IKG+I+ + + Sbjct: 115 DADTSSDKRLLDFLLFIFPKYLGSAIRKGLYKQYIYKQYNDMKIKGKIDIPRHLIRNIPF 174 Query: 89 HGKTVSTFDMLNEDTLANRIIKSTLA---------ILIKHEKLNSTIRDEARSLYRKLPG 139 G + + + D +I+ T+ I++ K + A YR Sbjct: 175 IGSIAYSQRLFSYDNTLIELIRHTIEFIKSKSYGSIILSDIKEEVNLIVNATQSYRACDR 234 Query: 140 ISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSL 199 + ++ + Y + +C I+ + G Y + Sbjct: 235 QKIIEQNKKNIIRHAYFR---EYSVLQRLCILILKSEKHDIGGGIQNSYGILFDGAW--- 288 Query: 200 LYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAK 259 L+++++ SD + P D +S+ IVDAK Sbjct: 289 LWEEYINILLNSHFYHPKNKSKSGAQQL--FSDGKGLIYP----DFISKSTAPRSIVDAK 342 Query: 260 YYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKIN 317 Y + ++ Q++ Y++ G IYP + V K+N Sbjct: 343 YK--------PIDNIRGRDYLQVLAYMYRFDA-----YKGYYIYPESNEQVPEILKLN 387 >UniRef50_B9CT57 Putative uncharacterized protein n=1 Tax=Staphylococcus capitis SK14 RepID=B9CT57_STACP Length = 415 Score = 75.4 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 38/265 (14%), Positives = 89/265 (33%), Gaps = 13/265 (4%) Query: 34 PGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTV 93 ++D+ ++ + + R+GL +Y N +KG I+ + I+ GK Sbjct: 93 RDEKVIDLFIFIFPYYLKKAMRKGLYKEYTRNEYNNHNVKGTIDIQRHIKNNTPFIGKIA 152 Query: 94 STFDMLNEDTLANRIIKSTLAILIKHE---KLNSTIRDEARSLYRKLPGISTLHLTPQHF 150 + + D ++I+ T+ + K + + S +R+E R + + L Sbjct: 153 YSQREFSYDNYILQLIRHTIEFIKKKKEGVNVLSNVRNEVREICEVTTSYNYLDRNKVLM 212 Query: 151 SYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCR 210 YYK + K + ++ + + L+++++ Sbjct: 213 FNNKQPIRHAYYKEYRELQKLCLTILQQHKHYIGSNAEKIHGIIFDGAWLWEEYIDTLIH 272 Query: 211 RELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMG 270 + S Q + + D R+ +I DAKY Sbjct: 273 EDYYHPKNKGGSGAQRL--FSTQMGAKIGLVYPDFIGRNQNYRIIGDAKYK--------P 322 Query: 271 TEKFHSQNLYQLMNYLWSLKPENGE 295 + +++ Q++ Y++ + G Sbjct: 323 IQNIGNRDYLQVLAYMYRFDAKTGY 347 >UniRef50_UPI0001B550BC hypothetical protein StAA4_26514 n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B550BC Length = 449 Score = 75.4 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 52/284 (18%), Positives = 95/284 (33%), Gaps = 24/284 (8%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 + ++ + + +R GL + ++G ++ T S Sbjct: 118 IAELTAALWRAALTAAARHGLPSFRTRRGHVGSAVRGSLDSPGTFALRAARSPFVASVER 177 Query: 98 MLNEDTLANRIIKST---LAILIKHEKLNSTIRDEARSLYRKL---PGISTLHLTPQHFS 151 +++I + L L+ H D + +L G + + Sbjct: 178 AKLLGNPVSQVIVAADQVLDTLLHHR--PGWRGDRVEEIVPRLRESVGARPRLPSLRDLR 235 Query: 152 YLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRR 211 + T YK V + IV + P + R + + ++ L++ FL RR Sbjct: 236 SVRYTPITLPYKRVADLSWQIVQRTAPQASPTDERTHGLLID---VAELWELFLLRCARR 292 Query: 212 E--LTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEK-ILIVDAKYYKSIFSRR 268 L + T+ ++ + +L R+ DI I +E IVDAKY Sbjct: 293 ATALPVTHGTQYHVPAPLLRSARHPTAVLGRLFPDILIGPAESPTAIVDAKYKP-----L 347 Query: 269 MGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKH 312 ++LYQL YL + E G L+YP +D Sbjct: 348 NDRRGVDREDLYQLNAYLTAHNAEL-----GALVYPTLDQHPSP 386 >UniRef50_UPI000185C9B8 conserved hypothetical protein n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185C9B8 Length = 411 Score = 75.4 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 44/303 (14%), Positives = 87/303 (28%), Gaps = 22/303 (7%) Query: 21 YLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAK 80 I + D Y+ + + +GL Y +KG I+ + Sbjct: 70 LATNIFNFEQSPNNEETIWDFWLYLFPYCLKKAYAQGLYKAYQRKQYNDANVKGSIDVKR 129 Query: 81 TIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGI 140 + GK T + D +++I+ T+ L H N+ + +A + Sbjct: 130 HLLKNLPFAGKIAYTTKEHSYDNPLSQLIRHTIEYLRTHPIGNALLNTDAEMRTMVSQFV 189 Query: 141 STLHLTPQHFSYLNGGKNTR---------YYKFVISVCKFIVNNSIPGQNKGHYRFYDFE 191 T + Y + +C I+N+ + + Y Sbjct: 190 FHTQNTYNKNARRKVIMANAKPFVHPYFTEYAPLQKICLNILNHEKLTFGEEKDKIYGLL 249 Query: 192 RNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSE 251 + L++++L F + + + P D +S Sbjct: 250 FDGAW---LWEEYLNTFLDEDFK--HPENLKGNGREYLFKKGKQPIYP----DFISKSGS 300 Query: 252 KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVK 311 L+ DAKY + G + + ++Y Y E G L+YP Sbjct: 301 PKLVGDAKYIPLDKHKSYGEDSERAISIY----YKTITYMYRFETNRGFLLYPCSKEDSD 356 Query: 312 HRY 314 + Sbjct: 357 KPF 359 >UniRef50_B5CQ36 Putative uncharacterized protein n=1 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CQ36_9FIRM Length = 124 Score = 73.8 bits (180), Expect = 9e-12, Method: Composition-based stats. Identities = 19/73 (26%), Positives = 37/73 (50%), Gaps = 5/73 (6%) Query: 280 YQLMNYLWSLKPENG---ENIGGLLIYPHVDTAV--KHRYKINGFDIGLCTVNLGQEWPC 334 YQ+ Y+ + + E + G+L+Y D AV + YK++G I + T++L ++ Sbjct: 46 YQIFTYVKNKEIELSAQPHEVFGMLLYAKTDEAVLPNNSYKMSGNTISVKTLDLDCDFSE 105 Query: 335 IHQELLDIFDEYL 347 I +L I + + Sbjct: 106 IANQLNKIVESHF 118 >UniRef50_A9DR64 Putative uncharacterized protein n=1 Tax=Kordia algicida OT-1 RepID=A9DR64_9FLAO Length = 474 Score = 71.9 bits (175), Expect = 3e-11, Method: Composition-based stats. Identities = 49/317 (15%), Positives = 107/317 (33%), Gaps = 26/317 (8%) Query: 14 MLTYAWGYLQEIKQANLE-AIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGI 72 ML + + + + N +I+ ++ + + + + GL +Y TE + Sbjct: 124 MLNFVNDIYVDNQSTKADKTEETNEFQNIIAFLFIQSLEKATVLGLPKNYQSITERSNKV 183 Query: 73 KGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARS 132 +G+I+ ++ G STF ++ L K I + Sbjct: 184 RGKIDINAYLKREIPFTGNLTSTFREQIYVQEIIDVLYLACKALEKR--FGKEIHKKILG 241 Query: 133 LYRKLP-GISTLHLTPQHFSYLNG-----GKNTRYYKFVISVCKFIV--NNSIPGQNKGH 184 +Y+ L S + +K I + I+ N + Sbjct: 242 VYQLLKLNYSGVFPQNSVIEKAKNHFVLQNPMFSAFKKTIGYAEIILREQNLLVSNTDNQ 301 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 + + +S L++ +L + R T Y+ Q + +M D Sbjct: 302 LTTNGYLFD---VSQLFEVYLEKLLSRYF-----TDWYVTGQEELNVYQKMFYKRKMFPD 353 Query: 245 ITIRSS--EKILIVDAKYYKSIFSRRMGTE-KFHSQNLYQLMNYLWSLKPENGENIGGLL 301 + ++ ++++ DAK+ K + + + YQ+ +Y+ +P+ I G L Sbjct: 354 LVMKHKLTNQLIVFDAKFKKMRLHKTQSSYSDLDRSDFYQIHSYIHYYQPDV---IAGGL 410 Query: 302 IYPHVDT-AVKHRYKIN 317 +YP + + Y N Sbjct: 411 LYPLSNEININTTYSEN 427 >UniRef50_D0Z8V4 5-methylcytosine restriction system component n=2 Tax=Edwardsiella RepID=D0Z8V4_EDWTE Length = 185 Score = 70.4 bits (171), Expect = 9e-11, Method: Composition-based stats. Identities = 16/127 (12%), Positives = 45/127 (35%), Gaps = 8/127 (6%) Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRM 241 KG R + M +++ ++ + R + + + + S+ + R+ Sbjct: 2 KGDNRAFSLLF---PMEKVFEHYVAKTLREQYAPQVAVHAQV--QSKSLVTHADAQWFRL 56 Query: 242 ETDITIRSSEKIL-IVDAKYY--KSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 + D+ + ++++ ++D K+ + + YQ+ Y + Sbjct: 57 KPDMVMIQGKQVIAVLDTKWKLLDPTLANGADKYALQQSDFYQMFAYGHHYFDQQITVRE 116 Query: 299 GLLIYPH 305 L+YP Sbjct: 117 MFLVYPA 123 >UniRef50_C9BWA4 Guanosine 5'-monophosphate oxidoreductase n=4 Tax=Enterococcus faecium RepID=C9BWA4_ENTFC Length = 430 Score = 70.0 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 37/280 (13%), Positives = 90/280 (32%), Gaps = 38/280 (13%) Query: 39 LDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDM 98 D+L ++ + + R+G+ +Y IKG ++ AK IR GK Sbjct: 124 YDLLVFLFPYYLNEAMRKGIYKEYVKREYNNANIKGAVDVAKHIRSNVPFVGKVAYRTRE 183 Query: 99 LNEDTLANRIIKSTLAI-------LIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFS 151 + + ++I+ T+ L+ ++ + + ++ ++ Sbjct: 184 FSYNNHLTQLIRHTIEKIQNEYDFLLSGDEDTKENVVLIKQTTPDYARLDQFNILQENIF 243 Query: 152 YLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG-HYRFYDFERNEKEMSLLYQKFLYEFCR 210 + Y + +C I++ + G ++S L+++++ Sbjct: 244 HPVKHSYYEEYSALQQICIQILS----EEKSGFGSDKNQIHGIIIDVSWLWEEYI----- 294 Query: 211 RELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSE---KILIVDAKYYKSIFSR 267 W + E + + R + + +D KY K + + Sbjct: 295 ---------GKVTGWKHYGRDKGLATMHLFQEPNRSPRYPDFTFNNIPIDTKYKKHLDT- 344 Query: 268 RMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVD 307 + QL+ Y+ + + + I G + P D Sbjct: 345 --------RNDYNQLVTYIHIMNLDQADTIKGGFLQPTSD 376 >UniRef50_B3JJV2 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JJV2_9BACE Length = 439 Score = 68.8 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 46/277 (16%), Positives = 95/277 (34%), Gaps = 34/277 (12%) Query: 51 LQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIK 110 + +G+ +Y N ++G I+ + ++ +G+ + D +I+ Sbjct: 146 NEALTQGIYKEYQRNEYNDANVRGTIDINRHLKTNLPFNGRIAYRTREFSHDNHVTELIR 205 Query: 111 STLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK---------NTRY 161 T+ + K ++++A ++ + + H++ K Y Sbjct: 206 HTIDYIGKTSFGKMLLKNDA----DTHTSVAQIIHSTPHYNRQEREKIVKANLKVITHPY 261 Query: 162 YKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRS 221 Y + K + +NK + ++S L++ E+ L+ Sbjct: 262 YSSYTPLQKLCLRILRHEKNKYGAKDDKIHGVLFDVSYLWE----EYLATILSKQGFKHP 317 Query: 222 YLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQ 281 K I N PR D +I DAKY ++I + ++ Q Sbjct: 318 NNKRGTGRIYLALPNQFPRY-PDFYREKGS--VIADAKYKRNIDT---------RDDVNQ 365 Query: 282 LMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKING 318 ++ YL+ LK + G+ I P K RY + G Sbjct: 366 MITYLYRLKAQ-----KGVFILPTNKVRTKERYHLYG 397 >UniRef50_UPI000197B531 hypothetical protein BACCOPRO_00002 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197B531 Length = 446 Score = 67.3 bits (163), Expect = 7e-10, Method: Composition-based stats. Identities = 44/265 (16%), Positives = 95/265 (35%), Gaps = 22/265 (8%) Query: 35 GNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVS 94 L D L Y+ + + + +G+ +Y N ++G I + +R +G+ Sbjct: 131 DEQLFDFLLYMFPRFLNEALSQGIYKEYKRNEYNDANVRGTININRHLRTNMPFNGRIAY 190 Query: 95 TFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRD--EARSLYRKLPGISTLHLTPQHFSY 152 + + D +I+ T+ + K + + + + E R+ ++ + + + Sbjct: 191 STREFSHDNHVTELIRHTIDYISKSKFGRTLLENDSETRTSVTQIISATPSYCRQEREIV 250 Query: 153 LNGGK---NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFC 209 + N YY + K + + K + ++S L++++L Sbjct: 251 VKSNLKEINHPYYSRYTPLQKLCLRILRHEKIKYGEKKNKIHGILFDVSYLWEEYLATIL 310 Query: 210 RRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRM 269 ++ K I N LPR D LIVDAKY K Sbjct: 311 TKQ----GFKHPNNKKGFGCIYLAKHNRLPRY-PDYYREYD--RLIVDAKYKKETN---- 359 Query: 270 GTEKFHSQNLYQLMNYLWSLKPENG 294 +++Q++ Y++ +K + G Sbjct: 360 ------RDDIHQMITYMYQMKGKRG 378 >UniRef50_C9PRC9 Putative uncharacterized protein n=1 Tax=Pasteurella dagmatis ATCC 43325 RepID=C9PRC9_9PAST Length = 481 Score = 66.5 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 54/342 (15%), Positives = 116/342 (33%), Gaps = 39/342 (11%) Query: 13 YMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGI 72 Y+++ A G+L+ + ++ +L Y+ N + + R G+ Y+ ++ I + Sbjct: 126 YIISDADGFLEIKDFS--ATEKKDSYAWLLAYLWNIKLKRAYRLGIPKVYSSKSDRISTV 183 Query: 73 KGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARS 132 +G I+ + + GK + +F + + A + + H T R Sbjct: 184 RGSIDPLDYFKNH--SSGKCLCSFREHDYASTAISLFLMAYDTVKHHSFCQQT-----RY 236 Query: 133 LYRKLPGIST-LHLTPQHFSYLNGGKNT--RYYKFVISVCKFIVNNSIPGQNKGHYRFYD 189 +Y + T + N Y +I + K I++ + G + Sbjct: 237 IYNAFMMANQGKKKTKKEILETPYFSNPYYSDYNTLIDLSKRIISQK--SLDFGSSNASN 294 Query: 190 FERNEKEMSLLYQKFLYEFC-RRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIR 248 + M L++ F+ + R + + S K S+ ++E D+ I Sbjct: 295 AYLFDISM--LFEYFIRKLLIRSGINVRSKFESLRKIQTCSLGKYER----KLEPDLIIE 348 Query: 249 SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP---- 304 + I D KY + ++L+QL Y+ GG I+P Sbjct: 349 GENGVYIFDVKYKH-----FDEKYGVNREDLFQLHTYI-GQWSNKETVCGGGFIFPIPEK 402 Query: 305 --------HVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQE 338 + ++ + G ++ + L I +E Sbjct: 403 KWEKLCLEKTQGVISNKIQQQGKEMDFYVIFLPIPKENIAKE 444 >UniRef50_B2UGA5 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Ralstonia pickettii 12J RepID=B2UGA5_RALPJ Length = 164 Score = 64.6 bits (156), Expect = 5e-09, Method: Composition-based stats. Identities = 29/148 (19%), Positives = 53/148 (35%), Gaps = 29/148 (19%) Query: 215 SANTTRSYLKWDASSISDQSLNLLPRMETDITI-RSSEKILIVDAKYYKSIFSRRMGTEK 273 + + YL ++ S L M+ D+T R I+D K+ + S E Sbjct: 12 RLQSPQKYLAFEESQQRSAFL-----MKPDVTASRDGRVRWILDTKWKE--LSAGEAKEG 64 Query: 274 FHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTA-----VKHRYKIN----------G 318 +LYQ+ Y +L+YPH V+ Y +N Sbjct: 65 VAQSDLYQMYAYA-----SCYNCSEVVLLYPHHGALGQSAGVRATYLLNPWAERASQEPA 119 Query: 319 FDIGLCTVNLGQEWPCIHQELLDIFDEY 346 + + T++L + + ++L I +Y Sbjct: 120 RRVRVATMDLA-DLKTVPRQLERIVLDY 146 >UniRef50_B0TG77 Putative uncharacterized protein n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TG77_HELMI Length = 485 Score = 63.4 bits (153), Expect = 1e-08, Method: Composition-based stats. Identities = 37/291 (12%), Positives = 91/291 (31%), Gaps = 40/291 (13%) Query: 41 ILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTV--STFDM 98 +L ++ + + R L+ + + +G +++ + R F Sbjct: 153 VLSTMVLFRIRAMLER-LQRRFTIGEAELTAPRGTVDWGRYARTKVPTGRLLEVPCRFPD 211 Query: 99 LNEDTLANRIIKSTLAILIKHEKLNSTIR-------DEARSLYRKLPGISTLHLTPQH-F 150 L +D + TL + + + L ++ + T + Sbjct: 212 LRDDAALLGALHFTLRRQLASLESQRATGTVVLPLIALCQGLLDRVRHVPPRRPTDRDRL 271 Query: 151 SYLNGGKNTRYYKFVISVCKFIVN---NSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYE 207 +L T ++ + ++ ++ + ++ G + + S + +++ Sbjct: 272 LWLRAPLQTDVFRDGLRAMEWTIDERGLAGLAEHTG----LPWVMSMDAFSEAWCEYVVT 327 Query: 208 FCRRELT------SANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYY 261 R T + L W+ Q + D+ + ++ +I+DAKY Sbjct: 328 ELARHYGGHVRAGRLRETVTPLLWEPPFTGSQRF-----LMPDLVLERDDETIIIDAKYK 382 Query: 262 KS----IFSRRMGTEKF----HSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 R E H +L Q++ Y S + + L+YP Sbjct: 383 SHWEDLSLERWFDLETTVRERHRNDLLQVLAYTTSY---ATKRVTACLLYP 430 >UniRef50_C1QFM5 McrBC 5-methylcytosine restriction system component n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QFM5_9SPIR Length = 430 Score = 62.7 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 51/315 (16%), Positives = 107/315 (33%), Gaps = 28/315 (8%) Query: 13 YMLTYAWGYLQEIKQANLE-AIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPG 71 Y L Y + + NLE + +N D L Y+ + R+GL Y Sbjct: 106 YFLHYILMKVLNLNIINLEHSKDYDNSFDFLIYMFISFFKKALRQGLFKQYKLIKHNDCH 165 Query: 72 IKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE----KLNSTIR 127 +KG I+ + I+ +GK + D ++I+ T+ + ++ I+ Sbjct: 166 VKGTIDINRYIKNNIPFNGKISYNTREYSYDNNMTQLIRHTIEYINTKNRYILGYDNEIK 225 Query: 128 DEARSLYRKLPGISTLHLTPQHFSYLNGGKNTR----YYKFVISVCKFIVNNSIPGQNKG 183 + + ++ P + N K + Y+ + +C I+ + Sbjct: 226 NYIQQIFYSTPSYE--KNKRESIINKNLKKLSHPYYYEYEPLRKICIQILRHEKLKYGSD 283 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 Y + +++++L F + + + S LN + Sbjct: 284 DNTVYGLLFDGAW---IFEEYLNTFL------SKINFIHAENRTSKNGINLLNNAWIVYP 334 Query: 244 DITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIY 303 D S +++DAKY + +E + +Q+++Y ++L + IY Sbjct: 335 DFYKLSENNNIVLDAKYKRL---DNYISENIDRNDKHQIVSYAYTLNAKKAG-----FIY 386 Query: 304 PHVDTAVKHRYKING 318 P + K I Sbjct: 387 PTENNNYKDYDYIGN 401 >UniRef50_D1PC49 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PC49_9BACT Length = 437 Score = 62.7 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 38/274 (13%), Positives = 84/274 (30%), Gaps = 33/274 (12%) Query: 35 GNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVS 94 N+L IL + + ++G+ +Y ++G I+ ++ IR G Sbjct: 123 EENILKILVLMFPTMLKTAMKQGIYKEYRKIQYNDSNVRGTIDISRHIRENIPFCGNISY 182 Query: 95 TFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLN 154 D D +I+ T+ I+ L I + + + + +H L Sbjct: 183 DTDEFCYDNAVMELIRHTIEY-IRTIPLGDMILSSNEVVEEYVSKVISYTPCYRHSDRLK 241 Query: 155 GGKNTRY---------YKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFL 205 Y + +C I+N G + L++++L Sbjct: 242 IIHENLTPCRHPYYTGYNALQKICIQILNQEDMKYGDGDGSVSGILFDGAW---LWEEYL 298 Query: 206 YEFC----RRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYY 261 + T + ++ +K ++DAKY Sbjct: 299 NTLLCDYDFNHPQNKQGTGAIYLFEHGGKRYPDFW--------------KKDFVLDAKYK 344 Query: 262 KSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGE 295 K +++ ++ Q++ Y++ LK + Sbjct: 345 K--YAQSGNKLDIAIDDINQIVTYMFRLKSQKSG 376 >UniRef50_A5WH29 Putative uncharacterized protein n=1 Tax=Psychrobacter sp. PRwf-1 RepID=A5WH29_PSYWF Length = 542 Score = 60.7 bits (146), Expect = 7e-08, Method: Composition-based stats. Identities = 41/314 (13%), Positives = 94/314 (29%), Gaps = 65/314 (20%) Query: 47 NKGVLQLSRRG---LELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDT 103 K R+G L Y ++ P +G+I + ++ S + + + Sbjct: 191 LKKAEMQLRQGADILPSRYQSKSQNQPKAQGKINMSAQLKNNWHRPHYLYSEQTVFDTNK 250 Query: 104 LANRIIKST---LAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLN------ 154 + + + L L+ +++S A S ++ S TP L Sbjct: 251 RLAQFLFTAWQQLRYLLHPSQVDSNYAVSAHSHRQQHLNHSPYSATPIALQQLKALAADQ 310 Query: 155 -------------------GGKNTRYYKFVISVCKFIVNNSIPGQNK-----GHYRFYDF 190 G + + + +++++S+ Q Sbjct: 311 WLPTYKALQSEALTWRATLGTRQAQLLTQALDWAWWLLSHSLQSQPPDPSARNSTSLLPT 370 Query: 191 ERNEKEMSLLYQKFLYE----FCRRELTSAN-TTRSYLKWDASSISDQSL---------- 235 M +++++ + ++ L + + +W + + D + Sbjct: 371 AALIINMQFAFERWVLGKLSVWVQQTLPGSRLIVQPSFEWLYAHLEDSNHLAFVCGKPIA 430 Query: 236 --NLLPRMETDITIRSSEKIL--IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKP 291 +L+ R++ D I S + ++D KY ++ + QL Y Sbjct: 431 AYHLVQRLQPDACIYDSRAKITHVIDIKYKALSTVQQ-----VSGSDWQQLYVY---QHY 482 Query: 292 ENGENIGGLLIYPH 305 N LIYP Sbjct: 483 LNRP--QAWLIYPK 494 >UniRef50_Q6L339 Putative uncharacterized protein n=1 Tax=Picrophilus torridus RepID=Q6L339_PICTO Length = 217 Score = 60.3 bits (145), Expect = 9e-08, Method: Composition-based stats. Identities = 34/215 (15%), Positives = 73/215 (33%), Gaps = 26/215 (12%) Query: 150 FSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFC 209 F+ ++ + ++ + + I+ N +M++++Q F F Sbjct: 3 FNTVSFNRLNERFEIPYNYAEMIMKNMRLDIGNDKRTM----MMLFDMNMIFQNFFTIFI 58 Query: 210 ---RRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIR----SSEKILIVDAKYYK 262 RR++ T R ++ + + L + D+ I + + I I+D KY Sbjct: 59 IRNRRKIFQGKTVRIIPQYSRRNFIFSDSHALRITKPDLYIEVEDINKKNIFILDMKYKL 118 Query: 263 S--------IFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP-HVDTAVKHR 313 I +LYQ+ Y + G +L++P V Sbjct: 119 LQKADIEEYINDHIEDVYSVSQLDLYQMFTY-----SDLYGTDGTILVFPGRVGAISNPY 173 Query: 314 -YKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 +K NG + +C + L + L++ + Sbjct: 174 MFKENGRILWICIIPLDFTGDSWEERLVECVKGFF 208 >UniRef50_D2MHZ4 Putative uncharacterized protein (Fragment) n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MHZ4_9BACT Length = 164 Score = 60.0 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 26/120 (21%), Positives = 42/120 (35%), Gaps = 11/120 (9%) Query: 195 KEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITI--RSSEK 252 M +Y+ F+ RR T + S M+ DI++ SS Sbjct: 18 FPMEQVYEDFVTHGFRRYQTEFQVVAQGPRERMLQPSSGHNA----MKPDISLCKPSSNV 73 Query: 253 ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKH 312 I+DAK+ + H +LYQ+ Y K + L+YP T ++ Sbjct: 74 RFILDAKWIHLGENENKTISDVHGSDLYQIYAYGKRYKCQTV-----ALVYPRNSTFIRP 128 >UniRef50_D2EQZ8 Putative uncharacterized protein n=1 Tax=Streptococcus sp. M143 RepID=D2EQZ8_9STRE Length = 478 Score = 58.8 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 39/283 (13%), Positives = 90/283 (31%), Gaps = 16/283 (5%) Query: 41 ILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNH-GKTVSTFDML 99 I+ Y+ + +++ L Y + I+G I+ + I ++ K + Sbjct: 161 IVQYLFLVSLRKVAGTNLPKKYVYKKDRDYSIRGNIDIERYITNDLVSSDKKISFRYPER 220 Query: 100 NEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNT 159 I+ + + + I L G T + KN+ Sbjct: 221 ENAQNIIDILYCAIKECSVEQAVLPDILSVRNYLAESFSGRRPSKYTVNNILKDKILKNS 280 Query: 160 --RYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 YK + ++I+N + + + S L++ ++Y + L Sbjct: 281 LYANYKKPLQYAQYILNLRELN-DGNTNKSNSVSGYLVDASFLWEMYIYNLMKIHLHDWE 339 Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITIRS--SEKILIVDAKYYKSIFSRRMGTEKFH 275 + D +R + +I+++DAK+ Sbjct: 340 VDAQE---ELHFYEQTFYAKDNY--PDFVLRHRLTGEIVVLDAKFKNM----EFNGRDVD 390 Query: 276 SQNLYQLMNYLWSLKPENGENIGGL-LIYPHVDTAVKHRYKIN 317 + ++ QL Y + + G+ G LIYP + +++ ++ Sbjct: 391 NADIRQLHGYSYYYHLQYGDKFRGAGLIYPAKERIPQNKVNVD 433 >UniRef50_Q5UZU4 Putative uncharacterized protein n=1 Tax=Haloarcula marismortui RepID=Q5UZU4_HALMA Length = 157 Score = 54.2 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 61/154 (39%), Gaps = 37/154 (24%) Query: 197 MSLLYQKFLYEFCRRELTSANTTRSYLKWDASSI--SDQSLNLLPR--METDITIRS--- 249 M+ +Y++ + + + S+ +W+A+ + ++ P M D + + Sbjct: 1 MNTVYERVIERAVK------SVAESHDRWEATGQAHTTNLISGTPTVNMYPDFAVSNVET 54 Query: 250 -----SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 E +++ DAK+ T + ++YQL +Y+ + K G++ YP Sbjct: 55 ETDDGDETVVVGDAKWK---------TGSVSNDDIYQLTSYVLARK------APGIVFYP 99 Query: 305 HVDTAVKHRYKINGF-DIGLC---TVNLGQEWPC 334 D A + Y+I ++ + T + + Sbjct: 100 AQDGAAEREYQIKNEWELKIVELPTEDYSASFES 133 >UniRef50_C2WFQ0 Putative uncharacterized protein n=1 Tax=Bacillus cereus Rock3-44 RepID=C2WFQ0_BACCE Length = 439 Score = 53.8 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 47/306 (15%), Positives = 101/306 (33%), Gaps = 22/306 (7%) Query: 13 YMLTYAWGYLQEIKQA-NLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPG 71 YML +G I ++ + A +L ++ + + ++G Y Sbjct: 83 YMLKKVFGSKAFIFESMQVSASRDKAFEQMLLFIFVYLLEKAMKKGTFKQYTNFDFYDSN 142 Query: 72 IKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAIL-IKHEKLNSTIRDEA 130 IKG I F ++ + G+ + N I + +L K+ + + Sbjct: 143 IKGTINFNVYMKQIMIQDGRLPYHVRERSAMNPVNIGILTAYDVLKSKNPTFVQQVFKKN 202 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGK------NTRYY---KFVISVCKFIVNNSIPGQN 181 L R + I + K Y+ + + VC I+ +S Sbjct: 203 PVLSRFIAQIKGELPNYRSIDKKQLLKQLTKSIRHPYFTEVESLRKVCIEIIRHSGSDIF 262 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRM 241 + + +++ L++ FL E+ + D+ + + ++ Sbjct: 263 QEENQMITGLL--IDVNKLWEYFLEHTIFAEMKKMAASYEVSTQDSYPVLYELKGFDMKI 320 Query: 242 ETDITI-RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 + D + R+ + + DAK + T ++YQ+ Y+ +L G Sbjct: 321 KPDFVLSRNKQNQAVFDAK--HRPAWSKKDTSAVKK-DVYQISMYMSALNVSIGG----- 372 Query: 301 LIYPHV 306 +IYP Sbjct: 373 VIYPTT 378 >UniRef50_B9ZE69 Putative uncharacterized protein n=1 Tax=Natrialba magadii ATCC 43099 RepID=B9ZE69_NATMA Length = 586 Score = 53.8 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 53/295 (17%), Positives = 108/295 (36%), Gaps = 45/295 (15%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANL-------EAIPGNNLLDILGYVLNKGVLQLSRR 56 P I I+ ML + + I + + I +++L ++ +G+ + R Sbjct: 100 PKIDWGPIFDMLLAVYDQNRSIDYHGIPLQDFLSDDIELDDVLVVMAINYLEGLETIQRN 159 Query: 57 GLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVST----FDMLNEDTLANRIIKST 112 G D +G I+ +T LNH + + D AN ++ Sbjct: 160 GYIRDLILRRTNSLEGRGEIDVEQT----LLNHARGTVEPNWIRNETEYDNAANSLLHYA 215 Query: 113 LAILIK-------------HEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGG--- 156 L++ ++++ S + E L G+ + + + G Sbjct: 216 GKTLLRLFRQNASEYDHPGYDRIFSEVHREIERLE--GMGVDSGLDRIDEYRRITLGDLP 273 Query: 157 KNTRYYKFVISVCKFIVNNSIPGQNKGHYR--FYDFERNEKEMSLLYQKFLYEFCRRELT 214 K +YY+ V K ++++S+ Q K R D+ N M L++++ REL+ Sbjct: 274 KQRQYYRKAFDVAKAVLSSSLGQQLKDGPRELVVDYVLN---MESLFEQYSQVVIERELS 330 Query: 215 SANTTRSYLKW-DASSISDQSLNLLPR-----METDITIRSSEKIL-IVDAKYYK 262 + + + S S+N E D ++ E+ + ++D+KYY Sbjct: 331 YIKSYDHFDSIANVKPASSPSVNPFEGENQIYHEPDHALQEGEETIAVLDSKYYA 385 >UniRef50_Q4FV58 Putative uncharacterized protein n=2 Tax=Psychrobacter RepID=Q4FV58_PSYA2 Length = 514 Score = 53.4 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 40/311 (12%), Positives = 94/311 (30%), Gaps = 60/311 (19%) Query: 45 VLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTL 104 ++ + + +L+ Y ++GR+ + +R + K V +L++ L Sbjct: 169 LITQFLQRLAHHQPITHYQTQIHNQTALQGRLLIKEQLRHNSMQPHKFVCERSVLSKGML 228 Query: 105 ANRIIKSTLAILIKHEK-------LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLN-GG 156 ANRIIKS L +L L + Y S + Sbjct: 229 ANRIIKSALKLLAPLLSQSNLLLYLQPWQQVSVLHQYEIRQLASIYFQAKHELAIQPLQA 288 Query: 157 KNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERN--------EKEMSLLYQKF---- 204 + + + ++ +++ S F + +M+ ++++ Sbjct: 289 QQLQAAQQLVDFAYWLLCQSHAETGHSIDSQNPFHKKLTPQRLCLLIDMNQAFEQWASQR 348 Query: 205 LYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS-----EKIL----- 254 + F ++ ++ + + ++D + D+ I E Sbjct: 349 IALFFQQL---SDDYKPLFQTQRVWLNDAEGQACLSIRPDLLIYKQIHSSAENTAMYDNY 405 Query: 255 -----------------IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENI 297 ++D K+ + + + YQL +Y + + E Sbjct: 406 VSQAKDAREKHSRHYSHVIDIKWKHLAHASA-----ISASDAYQLSSYAQAYQAEQVW-- 458 Query: 298 GGLLIYPHVDT 308 L+YP D Sbjct: 459 ---LVYPVQDD 466 >UniRef50_C3JNW8 Putative uncharacterized protein n=1 Tax=Rhodococcus erythropolis SK121 RepID=C3JNW8_RHOER Length = 415 Score = 51.5 bits (122), Expect = 4e-05, Method: Composition-based stats. Identities = 41/286 (14%), Positives = 81/286 (28%), Gaps = 65/286 (22%) Query: 63 NPNTEIIPGIKGRIEFAKTIR---GFHLNHGKTVSTFDMLNEDTLANRIIKSTL---AIL 116 E + + R++ A+ IR DT NR+ L ++ Sbjct: 83 RRREETL---RSRLDVAQYIRDRGRPAARPRSFPLVTQERQLDTPENRLAAGVLGNIRLI 139 Query: 117 IKHEKLNSTIRD--EARSLYRKLPGISTLHLTPQ-----------HFSYLNGGK----NT 159 + +E + + ARS +R L IS + + + N Sbjct: 140 LANEIFPARTAESTLARSHFRALTKISREPVFSSLKRTSFARKDMTLTRFRVNRRMTGND 199 Query: 160 RYYKFVISVC--KFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 Y+ ++ + + +E +++ + E R L Sbjct: 200 HPYRQLLQWIDNWLSIAGLVGDAGGDQTVDLALPESESYWEKVFEVWCLEQTRSGLLRLG 259 Query: 218 TT----------RSYLKWDASSISDQSLNLLPRME--------------------TDITI 247 R+ S QS+++ +M+ DI I Sbjct: 260 WHTDSDFRLHSSRARSPIATFSKDGQSVDVFFQMQIPLGLGRWKSERTAAALVGIPDIAI 319 Query: 248 R-SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPE 292 LI+DAK+ S+ E+ Y+++ Y + Sbjct: 320 ACKGRAPLIIDAKWRFRSLSQGTSEEQ------YKMLGYAENFAHN 359 >UniRef50_B9D5W2 Putative uncharacterized protein n=1 Tax=Campylobacter rectus RM3267 RepID=B9D5W2_WOLRE Length = 427 Score = 48.8 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 36/241 (14%), Positives = 78/241 (32%), Gaps = 21/241 (8%) Query: 72 IKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK----LNSTIR 127 + G ++ I+ GK S D + R++ IL++ + IR Sbjct: 149 LHGSLDVKNFIKKDQPFMGKISSRKSSRVPDEVVARVLLKAYDILVRKNPKFALYDKEIR 208 Query: 128 DEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIV--NNSIPGQNKGHY 185 + S + S + + YK + + + I+ ++ +N Sbjct: 209 NFLL-ANANGEMKSVKDINVALNSKSVMNELYKDYKIALQIARVIILQDSRYANENAVKN 267 Query: 186 RFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDI 245 + + L++ ++ L + L + R D Sbjct: 268 LNFGYLL---YAPNLFELYVKGLIEEAL-KILREKHNLDFMLLYQWPNDDE---RYRVDY 320 Query: 246 TIR-SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 ++ ++ +++DAKY E S NL Q+ Y+ + K + G+L+Y Sbjct: 321 LLKDKKDRTIVIDAKYRYFCERCGYCNETDMS-NLVQIGKYIAAYKAKM-----GILVYA 374 Query: 305 H 305 Sbjct: 375 R 375 >UniRef50_Q9ZMQ3 Putative n=6 Tax=Campylobacterales RepID=Q9ZMQ3_HELPJ Length = 406 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 42/256 (16%), Positives = 84/256 (32%), Gaps = 20/256 (7%) Query: 58 LELDYNPNTEI--IPGIKGRIEFAKTIRGFHL------NHGKT-VSTFDMLNEDTLANRI 108 L Y + KGRI F+KTI+ N + F + + N + Sbjct: 92 LSHGYYSENKSYYENNAKGRINFSKTIKKNRPIIQTFNNKNSFVYTRFQVKRKMINENEL 151 Query: 109 IKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISV 168 I + + HE + + + K + + K + Sbjct: 152 I-TAINKYCVHEAFSKFGFVFSSFMPPKFNLPTDKNYCIYLLENKLNNTFNDDKKILFQS 210 Query: 169 CKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDAS 228 K I+ Q+ DF+ +++++ + + + KW+ Sbjct: 211 MKNILL-----QDDNILDKTDFKFGTHHFYVVWERMIDRAFG--IKNKEVYFPKTKWNLR 263 Query: 229 SISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWS 288 + LL + D + +KI I+DAKYYK S + + Q++ ++ Sbjct: 264 CSNQNPDYLL---QPDSIMLFDDKIYILDAKYYKYGISGVASDLPNSASIIKQIVYGEYA 320 Query: 289 LKPENGENIGGLLIYP 304 K E + + + + P Sbjct: 321 AKLETKKEVYNIFLMP 336 >UniRef50_A6Y209 McrBC 5-methylcytosine restriction system component n=2 Tax=Gammaproteobacteria RepID=A6Y209_VIBCH Length = 434 Score = 43.4 bits (101), Expect = 0.013, Method: Composition-based stats. Identities = 28/217 (12%), Positives = 67/217 (30%), Gaps = 30/217 (13%) Query: 116 LIKHEKLNSTIRDEARSLYRKLPGISTLHLT------PQHFS--YLNGGKNT------RY 161 LI+ ++ N D + L +S + + S + + R Sbjct: 182 LIRFDRKNGWNGDIVAAAKELLSEVSDPSVASSLVRLIEDLSPQNVAANRRNPIPARHRA 241 Query: 162 YKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN--TT 219 +K + + ++ N+G + N +++ L R + Sbjct: 242 WKPLHELSIDVLGGLGLNYNQGQAHAPGYLVNTW---RVWEDLLTVAARLGFGRSAVVPQ 298 Query: 220 RSYLKWDASSISDQSLNLLPRMETD--ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQ 277 + Y +S +++ L + D I + + +++DAKY + G + Sbjct: 299 QGYPLGTKIRMSTGAVSKL-SVYPDCVIELDGTRPRILLDAKYKGHVEK---GQLRISEA 354 Query: 278 NLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRY 314 ++Y+ + + + L YP Sbjct: 355 DIYEALAFSKATGCNLV-----ALAYPAQPGDAPQPV 386 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_C1PCD8 McrBC 5-methylcytosine restriction system compon... 310 5e-83 UniRef50_UPI0001C37412 5-methylcytosine-specific restriction enz... 309 1e-82 UniRef50_P15006 Protein mcrC n=9 Tax=Escherichia RepID=MCRC_ECOLI 308 2e-82 UniRef50_A6BZ07 McrC protein n=1 Tax=Planctomyces maris DSM 8797... 299 1e-79 UniRef50_B2GJY9 Putative McrC protein n=3 Tax=Actinomycetales Re... 299 1e-79 UniRef50_C8W851 McrBC 5-methylcytosine restriction system compon... 292 1e-77 UniRef50_UPI000197B905 hypothetical protein BACCOPRO_00614 n=1 T... 292 1e-77 UniRef50_Q1QFB3 Putative uncharacterized protein n=1 Tax=Nitroba... 272 1e-71 UniRef50_A3UV38 Putative uncharacterized protein n=1 Tax=Vibrio ... 270 5e-71 UniRef50_B0AA68 Putative uncharacterized protein n=1 Tax=Clostri... 269 7e-71 UniRef50_B4V8L4 Putative uncharacterized protein n=1 Tax=Strepto... 267 6e-70 UniRef50_C9ZD45 Putative uncharacterized protein n=3 Tax=Strepto... 265 2e-69 UniRef50_Q1IK47 McrC protein n=1 Tax=Candidatus Koribacter versa... 264 4e-69 UniRef50_B2JTN9 Putative uncharacterized protein n=1 Tax=Burkhol... 262 2e-68 UniRef50_B9MJI1 Putative uncharacterized protein n=1 Tax=Diaphor... 257 4e-67 UniRef50_D0D6B3 5-methylcytosine-specific restriction enzyme sub... 257 4e-67 UniRef50_UPI0001909ACF hypothetical protein RetlI_30663 n=1 Tax=... 257 4e-67 UniRef50_D1PB97 Putative uncharacterized protein n=1 Tax=Prevote... 256 7e-67 UniRef50_A1UB52 Putative uncharacterized protein n=2 Tax=Mycobac... 256 7e-67 UniRef50_C2G7A3 5-methylcytosine-specific restriction enzyme sub... 256 8e-67 UniRef50_C1QDS7 McrBC 5-methylcytosine restriction system compon... 253 6e-66 UniRef50_B2FKW7 Putative uncharacterized protein n=3 Tax=Xanthom... 252 1e-65 UniRef50_Q39M33 McrBC 5-methylcytosine restriction system compon... 251 3e-65 UniRef50_D1SH86 McrBC 5-methylcytosine restriction system compon... 251 4e-65 UniRef50_A5WF57 McrBC 5-methylcytosine restriction system compon... 250 6e-65 UniRef50_Q0KFR0 5-Methylcytosine-specific restriction enzyme C n... 248 2e-64 UniRef50_C4T585 McrBC 5-methylcytosine restriction system compon... 247 3e-64 UniRef50_D0VFC8 McrBC 5-methylcytosine restriction system compon... 246 7e-64 UniRef50_C7NBA8 McrBC 5-methylcytosine restriction system compon... 245 2e-63 UniRef50_A6ALR1 5-Methylcytosine-specific restriction enzyme C n... 242 1e-62 UniRef50_D0Z341 McrBC 5-methylcytosine restriction system compon... 241 3e-62 UniRef50_Q9RZI4 Putative uncharacterized protein n=1 Tax=Deinoco... 241 4e-62 UniRef50_UPI0001972FC4 McrBC 5-methylcytosine restriction system... 240 5e-62 UniRef50_C3FCB5 McrBC 5-methylcytosine restriction system compon... 239 2e-61 UniRef50_C4G4B6 Putative uncharacterized protein n=3 Tax=Bacteri... 238 2e-61 UniRef50_B7LQZ4 Putative 5-methylcytosine restriction system com... 236 1e-60 UniRef50_A6VF50 Putative uncharacterized protein n=2 Tax=Methano... 236 1e-60 UniRef50_D1WRX1 McrBC 5-methylcytosine restriction system compon... 236 1e-60 UniRef50_A4Y9J5 Putative uncharacterized protein n=3 Tax=Gammapr... 235 1e-60 UniRef50_A6X8D9 Putative uncharacterized protein n=1 Tax=Ochroba... 235 2e-60 UniRef50_A7ZRA6 Putative uncharacterized protein n=3 Tax=Enterob... 234 3e-60 UniRef50_B7KTT4 IQ calmodulin-binding-domain protein n=2 Tax=Rhi... 234 3e-60 UniRef50_UPI0001BCC8FD 5-methylcytosine-specific restriction enz... 234 5e-60 UniRef50_B1BM79 Putative uncharacterized protein n=1 Tax=Clostri... 232 2e-59 UniRef50_Q2SJR5 McrBC 5-methylcytosine restriction system compon... 230 7e-59 UniRef50_C5EXA5 Putative uncharacterized protein n=1 Tax=Helicob... 229 1e-58 UniRef50_C5CG01 McrBC 5-methylcytosine restriction system compon... 228 2e-58 UniRef50_B2A7U4 McrBC 5-methylcytosine restriction system compon... 227 4e-58 UniRef50_B7UQU6 McrC family protein, predicted McrBC 5-methylcyt... 226 7e-58 UniRef50_D0C387 McrBC 5-methylcytosine restriction system compon... 226 1e-57 UniRef50_C7MHK3 McrBC 5-methylcytosine restriction system compon... 225 3e-57 UniRef50_Q6LZ73 Putative uncharacterized protein n=1 Tax=Methano... 224 3e-57 UniRef50_B0A6E5 Putative uncharacterized protein n=1 Tax=Clostri... 221 4e-56 UniRef50_B5IHH3 McrBC 5-methylcytosine restriction system compon... 220 5e-56 UniRef50_D2QGI2 5-methylcytosine restriction system component-li... 220 7e-56 UniRef50_D0BXD9 McrBC 5-methylcytosine restriction system compon... 219 2e-55 UniRef50_C7NQX2 McrBC 5-methylcytosine restriction system compon... 219 2e-55 UniRef50_C0WJ24 5-methylcytosine-specific restriction enzyme sub... 218 3e-55 UniRef50_A0JT91 McrBC 5-methylcytosine restriction system compon... 217 4e-55 UniRef50_C8S833 McrBC 5-methylcytosine restriction system compon... 216 7e-55 UniRef50_A7H1L9 Putative uncharacterized protein n=13 Tax=Campyl... 215 1e-54 UniRef50_UPI0001BCB4AE McrBC 5-methylcytosine restriction system... 215 2e-54 UniRef50_Q2FNZ4 McrBC 5-methylcytosine restriction system compon... 214 3e-54 UniRef50_Q2J945 McrBC 5-methylcytosine restriction system compon... 214 4e-54 UniRef50_C2WWH8 Putative uncharacterized protein n=3 Tax=Bacillu... 212 2e-53 UniRef50_Q7VG80 Putative uncharacterized protein n=1 Tax=Helicob... 211 2e-53 UniRef50_D2QXZ3 McrBC 5-methylcytosine restriction system compon... 211 3e-53 UniRef50_C9ZD40 Putative uncharacterized protein n=1 Tax=Strepto... 209 1e-52 UniRef50_Q10YL1 McrBC 5-methylcytosine restriction system compon... 207 3e-52 UniRef50_C7PTE7 McrBC 5-methylcytosine restriction system compon... 203 7e-51 UniRef50_B1BM86 ATP-dependent helicase priA n=8 Tax=Clostridium ... 202 1e-50 UniRef50_D2AT08 McrBC 5-methylcytosine restriction system compon... 202 1e-50 UniRef50_A4J0H3 McrBC 5-methylcytosine restriction system compon... 202 1e-50 UniRef50_C5DAA2 McrBC 5-methylcytosine restriction system compon... 202 2e-50 UniRef50_B7D079 5-methylcytosine-specific restriction enzyme sub... 200 8e-50 UniRef50_A2TPW2 Putative uncharacterized protein n=1 Tax=Dokdoni... 197 4e-49 UniRef50_D2B1B6 McrBC 5-methylcytosine restriction system compon... 196 9e-49 UniRef50_C0GTU8 Putative uncharacterized protein n=1 Tax=Desulfo... 195 1e-48 UniRef50_A8EUN6 McrBC catalytic subunit McrC, putative n=2 Tax=C... 195 2e-48 UniRef50_Q26DA3 Putative uncharacterized protein n=2 Tax=Flavoba... 195 2e-48 UniRef50_Q466P1 Putative uncharacterized protein n=2 Tax=Methano... 194 3e-48 UniRef50_C6NTT9 Putative uncharacterized protein n=1 Tax=Acidith... 193 8e-48 UniRef50_B5HWV8 Putative uncharacterized protein n=1 Tax=Strepto... 190 4e-47 UniRef50_A4TAT5 McrBC 5-methylcytosine restriction system compon... 189 1e-46 UniRef50_A1ZU11 Putative uncharacterized protein n=1 Tax=Microsc... 189 2e-46 UniRef50_D1SMG2 McrBC catalytic subunit McrC, putative n=1 Tax=M... 187 6e-46 UniRef50_A6EMU6 5-Methylcytosine-specific restriction enzyme C n... 186 1e-45 UniRef50_Q4C3F9 Similar to McrBC 5-methylcytosine restriction sy... 185 2e-45 UniRef50_Q188G2 Putative uncharacterized protein n=9 Tax=Clostri... 180 8e-44 UniRef50_Q97QG4 Conserved domain protein n=27 Tax=Streptococcus ... 178 3e-43 UniRef50_C5A3Z2 McrBC 5-methylcytosine restriction system compon... 177 4e-43 UniRef50_Q5JJA9 Putative 5-methylcytosine restriction system, ca... 176 8e-43 UniRef50_UPI000185C9B8 conserved hypothetical protein n=1 Tax=Ca... 176 1e-42 UniRef50_D1YX67 Putative uncharacterized protein n=1 Tax=Methano... 174 6e-42 UniRef50_B1I205 McrBC 5-methylcytosine restriction system compon... 171 4e-41 UniRef50_D1W5H2 Conserved domain protein n=3 Tax=Prevotella RepI... 170 6e-41 UniRef50_A1SC19 McrBC 5-methylcytosine restriction system compon... 168 3e-40 UniRef50_A6EGF0 5-methylcytosine-specific restriction enzyme Mcr... 166 1e-39 UniRef50_C1EZU3 Putative uncharacterized protein n=1 Tax=Bacillu... 165 2e-39 UniRef50_A7ZBW9 Putative uncharacterized protein n=1 Tax=Campylo... 163 7e-39 UniRef50_D2Q5W0 3-isopropylmalate dehydrogenase n=2 Tax=Bifidoba... 163 8e-39 UniRef50_C9LKR7 Putative uncharacterized protein n=1 Tax=Prevote... 163 8e-39 UniRef50_Q08S71 Putative uncharacterized protein n=1 Tax=Stigmat... 163 8e-39 UniRef50_B9CT57 Putative uncharacterized protein n=1 Tax=Staphyl... 163 1e-38 UniRef50_UPI00016C4D94 hypothetical protein GobsU_06730 n=1 Tax=... 161 3e-38 UniRef50_D1YRX5 Conserved domain protein n=1 Tax=Veillonella par... 161 3e-38 UniRef50_Q5JH86 Putative 5-methylcytosine restriction system, ca... 159 1e-37 UniRef50_D1PAJ0 Putative uncharacterized protein n=1 Tax=Prevote... 157 5e-37 UniRef50_UPI000197B531 hypothetical protein BACCOPRO_00002 n=1 T... 157 6e-37 UniRef50_C7M8S6 McrBC 5-methylcytosine restriction system compon... 157 7e-37 UniRef50_UPI0001B4EC67 McrBC 5-methylcytosine restriction system... 156 1e-36 UniRef50_A9FMX9 Putative uncharacterized protein n=1 Tax=Sorangi... 155 2e-36 UniRef50_Q7MVS1 Putative uncharacterized protein n=1 Tax=Porphyr... 155 2e-36 UniRef50_C1QFM5 McrBC 5-methylcytosine restriction system compon... 154 3e-36 UniRef50_C9BWA4 Guanosine 5'-monophosphate oxidoreductase n=4 Ta... 153 1e-35 UniRef50_B9KCB6 Putative uncharacterized protein n=1 Tax=Campylo... 152 2e-35 UniRef50_A7GF31 Putative uncharacterized protein n=1 Tax=Clostri... 152 2e-35 UniRef50_D1PC49 Putative uncharacterized protein n=1 Tax=Prevote... 152 3e-35 UniRef50_C1QC92 McrBC 5-methylcytosine restriction system compon... 151 4e-35 UniRef50_D0LPM4 5-methylcytosine restriction system component-li... 148 4e-34 UniRef50_C9PRC9 Putative uncharacterized protein n=1 Tax=Pasteur... 145 2e-33 UniRef50_B3JJV2 Putative uncharacterized protein n=1 Tax=Bactero... 144 6e-33 UniRef50_A9DR64 Putative uncharacterized protein n=1 Tax=Kordia ... 142 2e-32 UniRef50_Q139N0 Putative uncharacterized protein n=1 Tax=Rhodops... 142 2e-32 UniRef50_C8VZS1 Putative uncharacterized protein n=1 Tax=Desulfo... 141 4e-32 UniRef50_C2BVA3 Putative uncharacterized protein n=1 Tax=Mobilun... 141 5e-32 UniRef50_B8DPA2 McrBC 5-methylcytosine restriction system compon... 140 5e-32 UniRef50_UPI0001B550BC hypothetical protein StAA4_26514 n=1 Tax=... 140 7e-32 UniRef50_C2LL98 Putative uncharacterized protein n=2 Tax=Proteus... 140 1e-31 UniRef50_Q2GBH5 McrBC 5-methylcytosine restriction system compon... 139 1e-31 UniRef50_C9PUK8 Putative uncharacterized protein n=2 Tax=Bactero... 137 9e-31 UniRef50_C2WFQ0 Putative uncharacterized protein n=1 Tax=Bacillu... 134 4e-30 UniRef50_D2EQZ8 Putative uncharacterized protein n=1 Tax=Strepto... 134 6e-30 UniRef50_Q1LJY6 McrBC 5-methylcytosine restriction system compon... 132 2e-29 UniRef50_B0TG77 Putative uncharacterized protein n=1 Tax=Helioba... 127 6e-28 UniRef50_A8UWA1 Putative uncharacterized protein n=1 Tax=Hydroge... 124 5e-27 UniRef50_A5WH29 Putative uncharacterized protein n=1 Tax=Psychro... 122 2e-26 UniRef50_C8PX06 Putative uncharacterized protein n=1 Tax=Enhydro... 120 7e-26 UniRef50_Q4FV58 Putative uncharacterized protein n=2 Tax=Psychro... 115 2e-24 UniRef50_B9ZE69 Putative uncharacterized protein n=1 Tax=Natrial... 110 9e-23 UniRef50_D0Z8V4 5-methylcytosine restriction system component n=... 110 1e-22 UniRef50_B9D5W2 Putative uncharacterized protein n=1 Tax=Campylo... 105 3e-21 UniRef50_Q9ZMQ3 Putative n=6 Tax=Campylobacterales RepID=Q9ZMQ3_... 102 2e-20 UniRef50_Q6L339 Putative uncharacterized protein n=1 Tax=Picroph... 100 7e-20 UniRef50_C3JNW8 Putative uncharacterized protein n=1 Tax=Rhodoco... 100 1e-19 UniRef50_Q5UZU6 Putative uncharacterized protein n=1 Tax=Haloarc... 97 8e-19 UniRef50_D2MHZ4 Putative uncharacterized protein (Fragment) n=1 ... 89 2e-16 UniRef50_B2UGA5 McrBC 5-methylcytosine restriction system compon... 85 3e-15 UniRef50_D0YRU6 Putative ATPase family associated with various c... 72 3e-11 UniRef50_Q5UZU4 Putative uncharacterized protein n=1 Tax=Haloarc... 71 5e-11 UniRef50_B5CQ36 Putative uncharacterized protein n=1 Tax=Ruminoc... 68 6e-10 Sequences not found previously or not previously below threshold: UniRef50_B9LVU8 Putative uncharacterized protein n=1 Tax=Halorub... 103 1e-20 UniRef50_A6Y209 McrBC 5-methylcytosine restriction system compon... 77 1e-12 UniRef50_Q60CI5 Conserved domain protein n=1 Tax=Methylococcus c... 62 2e-08 UniRef50_A6T3N7 Uncharacterized conserved protein n=1 Tax=Janthi... 56 2e-06 UniRef50_B0MB39 Putative uncharacterized protein n=1 Tax=Anaeros... 56 2e-06 UniRef50_C0QR15 Putative uncharacterized protein n=1 Tax=Perseph... 55 4e-06 UniRef50_C4Z3V6 Putative uncharacterized protein n=1 Tax=Eubacte... 52 2e-05 UniRef50_A5G8Z2 Putative uncharacterized protein n=2 Tax=Proteob... 52 2e-05 UniRef50_D2RAE2 LlaJI restriction endonuclease n=1 Tax=Gardnerel... 52 3e-05 UniRef50_B5IWI4 Putative uncharacterized protein n=1 Tax=Thermoc... 52 5e-05 UniRef50_C7DCR7 Putative uncharacterized protein n=1 Tax=Thalass... 51 5e-05 UniRef50_Q30SQ1 Putative uncharacterized protein n=2 Tax=Sulfuri... 51 6e-05 UniRef50_A1RVV7 Putative uncharacterized protein n=1 Tax=Pyrobac... 50 9e-05 UniRef50_Q6QPY9 R2.LlaJI n=1 Tax=Lactococcus lactis RepID=Q6QPY9... 50 9e-05 UniRef50_Q5XBC0 Type II restriction-modification system restrict... 50 1e-04 UniRef50_Q03XF1 Putative uncharacterized protein n=1 Tax=Leucono... 50 1e-04 UniRef50_B7IJA5 Type II restriction-modification system restrict... 49 3e-04 UniRef50_O58602 Putative uncharacterized protein PH0872 n=1 Tax=... 47 0.001 UniRef50_B7HF08 Conserved domain protein n=29 Tax=Bacillus cereu... 47 0.001 UniRef50_O66886 Putative uncharacterized protein n=1 Tax=Aquifex... 47 0.001 UniRef50_Q6LVP0 Putative uncharacterized protein n=1 Tax=Photoba... 47 0.002 UniRef50_A6GXX4 Putative uncharacterized protein n=1 Tax=Flavoba... 46 0.002 UniRef50_D0C730 Restriction endonuclease n=1 Tax=Acinetobacter b... 45 0.003 UniRef50_B9KEV3 Putative uncharacterized protein n=1 Tax=Campylo... 45 0.005 UniRef50_UPI0001AF063B hypothetical protein SghaA1_36952 n=1 Tax... 43 0.012 UniRef50_A4YFI8 Putative uncharacterized protein n=1 Tax=Metallo... 43 0.017 UniRef50_C5UQY5 Putative restriction endonuclease n=1 Tax=Clostr... 43 0.018 UniRef50_Q81H81 Type II restriction-modification system restrict... 43 0.019 UniRef50_B9DJ21 Putative uncharacterized protein n=1 Tax=Staphyl... 43 0.023 UniRef50_C2EED9 LlaI.3 like protein n=1 Tax=Lactobacillus saliva... 42 0.024 UniRef50_C3WKP8 DNA helicase II n=1 Tax=Fusobacterium sp. 2_1_31... 42 0.027 UniRef50_A9EW27 Putative uncharacterized protein n=1 Tax=Sorangi... 42 0.027 UniRef50_O34303 Type-2 restriction enzyme BsuMI component ydjA n... 42 0.030 UniRef50_A1RQM2 Putative uncharacterized protein n=2 Tax=Thermop... 42 0.031 UniRef50_Q01P70 Putative uncharacterized protein n=1 Tax=Candida... 42 0.038 UniRef50_UPI0001B9ECEF Domain of unknown function DUF2357 n=1 Ta... 42 0.051 UniRef50_Q2FY99 Phi APSE P51-like protein n=34 Tax=root RepID=Q2... 40 0.090 UniRef50_C7QAD2 McrBC 5-methylcytosine restriction system compon... 40 0.091 >UniRef50_C1PCD8 McrBC 5-methylcytosine restriction system component-like protein n=6 Tax=Bacteria RepID=C1PCD8_BACCO Length = 355 Score = 310 bits (794), Expect = 5e-83, Method: Composition-based stats. Identities = 91/351 (25%), Positives = 178/351 (50%), Gaps = 8/351 (2%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 M+ I +RNIYYML+YA+ L+ + + ++ D+ +L KG+ + ++GL Sbjct: 1 MKDKGILIRNIYYMLSYAFRVLKRSNYDEIGSERFEHIQDLFAAILTKGIARQLKQGLYK 60 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 +Y + + + ++G+++ TI K +D L+E+ + N+I+K+T IL++ Sbjct: 61 EYVSHCDDLSVLRGKLDIHGTIHHKLQRKQKLSCEYDELSENNVFNQILKTTSVILMQQP 120 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 +N R + + + + T ++ L KN + YK ++++C F+++ + Sbjct: 121 SVNVKRRTALKKVMLHFDSVDMIEPTRIKWNILRFQKNNQSYKMLLNICYFVLDGLLLST 180 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 +KG ++ +F +E+ MS L++KF+ E+ R S + W+ + + LP Sbjct: 181 DKGKFKMANFL-DEQHMSRLFEKFVLEYYRYHYPSLRAAAPQIAWNI---DTGATDFLPT 236 Query: 241 METDITIRSSEKILIVDAKYYKSIF--SRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 M+TDI ++S K+LI+D KYY R G+ FHS NLYQ+ Y+ + N N+ Sbjct: 237 MQTDIVLKSCSKVLIIDTKYYAHTMQVQSRYGSRTFHSNNLYQIFTYVKNQDVGNTGNVA 296 Query: 299 GLLIYPHVDTA--VKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 G+L+Y + + ++G + + T++L + I ++L +I Y Sbjct: 297 GMLLYARTEETIVPNADFMMSGNKMSVKTLDLNTAFGNIAEQLDNIATSYF 347 >UniRef50_UPI0001C37412 5-methylcytosine-specific restriction enzyme subunit McrC n=1 Tax=Ruminococcus flavefaciens FD-1 RepID=UPI0001C37412 Length = 351 Score = 309 bits (791), Expect = 1e-82, Method: Composition-based stats. Identities = 91/349 (26%), Positives = 170/349 (48%), Gaps = 6/349 (1%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 I ++NIYYML+YA+ L++ + + D+ +L KGV + ++GL +Y Sbjct: 4 DKGIFIQNIYYMLSYAFQILKQEDYKQVAGEKFEKIHDLFAAILEKGVSRQVKQGLYREY 63 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 P E + ++G++ +T+R N K FD +ED N+I+K T+ LI+ E + Sbjct: 64 VPTQEDLSVMRGKLNMGETVRLKVQNKQKLGCEFDEFSEDNPYNQILKVTIHRLIRAEDV 123 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + R + I + ++ L ++ R Y+ ++++C ++N + Sbjct: 124 APERKQALRRVSVYFGNIRLIQPDHIAWNRLIYQRSNRNYELLLNICYLVLNGMLQTTED 183 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSI-SDQSLNLLPRM 241 G Y+ F +++ M LY+KF+ E+ ++ + + +KW+ + + LP+M Sbjct: 184 GSYKLLAF--SDEHMERLYEKFILEYYKQHHPELDPKSAQVKWNLTEEPDQPMIQFLPKM 241 Query: 242 ETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 +TDIT++ +K LI+DAKYY ++ E S +LYQ+ Y+ ++ N N+ GLL Sbjct: 242 QTDITLQKGDKTLIIDAKYYGKSMAQSYSKETLRSAHLYQIFAYVKNMDTANKGNVSGLL 301 Query: 302 IYPHVDTAV---KHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 +Y + V + I G IG T++L + + +L I E Sbjct: 302 LYAKTEDEVFPEGEPFVIGGNRIGARTLDLNVSFDTLRIQLDKIAKECF 350 >UniRef50_P15006 Protein mcrC n=9 Tax=Escherichia RepID=MCRC_ECOLI Length = 348 Score = 308 bits (790), Expect = 2e-82, Method: Composition-based stats. Identities = 348/348 (100%), Positives = 348/348 (100%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL Sbjct: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE Sbjct: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ Sbjct: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR Sbjct: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 Query: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL Sbjct: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 Query: 301 LIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 LIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK Sbjct: 301 LIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 >UniRef50_A6BZ07 McrC protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6BZ07_9PLAN Length = 362 Score = 299 bits (765), Expect = 1e-79, Method: Composition-based stats. Identities = 112/345 (32%), Positives = 184/345 (53%), Gaps = 5/345 (1%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNP 64 IP++NIYY+L YAW LQE + ++ ++ VL+ GV L +RGL+ +Y Sbjct: 2 TIPIQNIYYLLCYAWDKLQEGQIVSVSPEDCQTTAELFARVLDSGVTHLLKRGLDRNYIS 61 Query: 65 NTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNS 124 ++G+ + TI+ L + D + D NRIIK+TL L++ +L+ Sbjct: 62 EEIETSSLRGKFDITTTIKQNLLRKSRVHCVVDSFSYDVPHNRIIKATLRNLLRCRELDR 121 Query: 125 TIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGH 184 RD LYR+L +S + LTP+ F+ + +N +Y F++ VC+ I +N + + G Sbjct: 122 DQRDRLLRLYRRLHDVSDIKLTPKDFNNVQLHRNNAWYGFLLQVCRLIYDNLLINEETGD 181 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 +F DF R+E++M+ L++ F++ F R+E + L W + + LPRM TD Sbjct: 182 SQFRDFLRDERQMARLFENFVFNFYRKEQSVFKVKSELLTWQGVDATPEDQQFLPRMRTD 241 Query: 245 ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPEN-----GENIGG 299 +++ SS + +++DAKYYK G HS+NLYQL YL + +N I G Sbjct: 242 VSLDSSTRKIVLDAKYYKDSLQSFHGNSSVHSENLYQLFAYLKNFYLKNIQNGDSRPIEG 301 Query: 300 LLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 +L+YP ++ Y+I+G I + T+NL W I Q+LL+I + Sbjct: 302 ILLYPTTGQSLSLNYEIHGHSIRIVTLNLNTSWKEIRQQLLNILE 346 >UniRef50_B2GJY9 Putative McrC protein n=3 Tax=Actinomycetales RepID=B2GJY9_KOCRD Length = 348 Score = 299 bits (765), Expect = 1e-79, Method: Composition-based stats. Identities = 93/343 (27%), Positives = 164/343 (47%), Gaps = 4/343 (1%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 M+ +RNIY ML YA+ ++ +++ ++ D+ +L +GV +RG+ Sbjct: 1 MKDRTATIRNIYVMLAYAFRAIRTPDASDVGTEEFTHIHDLFAEILAQGVSAQVKRGVHH 60 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 DY E + ++GRI+ T+ + G FD DT N+ +KS + +LI+H Sbjct: 61 DYLRRDEQLTTVRGRIDVTATMVARAVTPGSVSCIFDTYEPDTPFNQALKSVMVLLIRHG 120 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 ++ +D R L L ++ + + + Y+ ++ VC+ +V +P + Sbjct: 121 EVGQRRKDALRRLLPYLDAVTLVSPRSIRWEKFTCHRRNAAYRILLGVCQLVVEGLLPTE 180 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 N G + ++ +E+ MS LY++FL E+ + T ++ WD ++ + LP Sbjct: 181 NSGDTQLAEWL-SEEAMSALYERFLREYYAFHHPELSPTARHVAWDYDPVTAVGADQLPA 239 Query: 241 METDITIRSSEKILIVDAKYYK-SIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGG 299 M TD+T+ S + LI+DAKYY + S G HS NLYQ+++Y+ + N + G Sbjct: 240 MRTDVTLTSGTRTLIIDAKYYSQPLTSGAYGKLTVHSANLYQMLSYIKNADVSNDGTVSG 299 Query: 300 LLIYPHVD--TAVKHRYKINGFDIGLCTVNLGQEWPCIHQELL 340 LL+Y D I G +G T++L WP + EL Sbjct: 300 LLLYARTDAPAQPDVDVVIQGNRLGARTLDLAAPWPDLRHELE 342 >UniRef50_C8W851 McrBC 5-methylcytosine restriction system component-like protein n=21 Tax=cellular organisms RepID=C8W851_ATOPD Length = 351 Score = 292 bits (748), Expect = 1e-77, Method: Composition-based stats. Identities = 95/350 (27%), Positives = 180/350 (51%), Gaps = 13/350 (3%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNP 64 +I ++NIY+ML YA+ LQ ++ A N ++L +L +GV +RGL +Y Sbjct: 1 MIRIQNIYHMLAYAFQTLQGQGYRDIAAEEFGNTTELLAEILARGVSLQLKRGLGQEYID 60 Query: 65 NTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNS 124 E + +G+IE +++++ + + V ++D + DT NRI+K+T+A+L++ + ++ Sbjct: 61 REEALSSPRGKIELSESLKTRSILRRQLVCSYDEFSTDTRMNRILKATIALLVRSD-IDK 119 Query: 125 TIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGH 184 + R L + + L + + ++ +N + Y+ +++VC +V + Q G Sbjct: 120 VRKKALRRLLPYFVDVGDVDLEHEDW-HMRFDRNNQAYRMLMNVCWLVVKGLLQTQEDGS 178 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 R D +E+ MS LY+KF+ E+ RRE + Y+ W ++ D ++LP M TD Sbjct: 179 IRMMDLL-DEQRMSHLYEKFILEYYRREHPKLSAGAPYIDW---ALDDGFDDMLPAMHTD 234 Query: 245 ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPE-----NGENIGG 299 I + +LI+DAKYY ++ HS NLYQ+ Y+ + + E ++ G Sbjct: 235 IMLEQGRTVLIIDAKYYSRTMQQQFDKRSVHSSNLYQIFTYVKNKEVELSSTLKAHSVSG 294 Query: 300 LLIYPHVDTAVKHR--YKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 +L+Y D ++ Y+++G I + T++L Q + I +L I + Sbjct: 295 MLLYAKTDEEIQPDGVYQMSGNQISVRTLDLNQPFEEIRSQLDGIAKAHF 344 >UniRef50_UPI000197B905 hypothetical protein BACCOPRO_00614 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197B905 Length = 346 Score = 292 bits (748), Expect = 1e-77, Method: Composition-based stats. Identities = 92/346 (26%), Positives = 168/346 (48%), Gaps = 4/346 (1%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 ++ I +RN+YYML YA+ L++ + ++ D+ +L KGV +RGL Sbjct: 2 IKDRNIWIRNVYYMLAYAFEELKKNNYEQIAHEEFEHIQDLFAEILYKGVSAQLKRGLHR 61 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 +Y E +P +KGR++ TI FD L+E+ L NR++K+TL++L Sbjct: 62 EYINRVEDLPLLKGRLDIRGTIANQMRCRNVLCCEFDDLSENNLFNRVLKTTLSLLCHER 121 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 ++S + E R+L G+ + + ++ +N + Y+ +++VC FI++ + Sbjct: 122 NVSSVRKAELRTLLPFFSGVDEIDVRNIRWNDFVYQRNNQMYRMLMNVCYFIIDGMLMTT 181 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 G YR F +++ M L++KF+ E+ R + ++W+ S ++LLP Sbjct: 182 ETGKYRMATF--SDEHMCRLFEKFVLEYYRLHHRELSPNPDRIEWNIYSKDAMVIDLLPA 239 Query: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 M++DI + ++ L++D KYY HS NLYQ+ Y+ +L ++ N+ GL Sbjct: 240 MQSDIVLHRGDQSLVIDTKYYSHAMQYHFDKPTIHSANLYQIFTYVKNLDVKDTGNVSGL 299 Query: 301 LIYPHVDTA--VKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 L+Y D I + T++L QE+ I +L + Sbjct: 300 LLYAKTDEDITPDLSASFGKNHIRVRTLDLNQEFSGIASQLEEFLK 345 >UniRef50_Q1QFB3 Putative uncharacterized protein n=1 Tax=Nitrobacter hamburgensis X14 RepID=Q1QFB3_NITHX Length = 361 Score = 272 bits (696), Expect = 1e-71, Method: Composition-based stats. Identities = 92/352 (26%), Positives = 159/352 (45%), Gaps = 10/352 (2%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 Q IP+RN++ +L YA G + Q L DILG +L + + R L Y Sbjct: 9 QTGIPIRNLWLLLVYASGLAEFESQCGAGTDDDIELADILGRLLVRLAKRRLRTNLSRGY 68 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 +IP ++GR+++ +T+ HL G+ F+ L+ DT NR+++ L I I Sbjct: 69 QRRQAVIPRVRGRVDWLQTLSRQHLQRGRLACRFEELSFDTPRNRLVRCAL-IAIAGRVR 127 Query: 123 NSTIRDEARSLYRKLP--GISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 + + + R L L GI+ T Q S G + +F+IS + + +P + Sbjct: 128 DHAVAADCRRLGDDLGRLGIAASRPTQQEMSADTIGSHQSEDRFMISAARLVFEMLLPNE 187 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN----TTRSYLKWDASSISDQSLN 236 G + +R+E + +++K + F R L +A+ ++Y W + Sbjct: 188 TPGDMKLSRLKRDEITLRKIFEKAVTGFYRHHLRAADGWSVREQNYQSWQLEPGRSGDVG 247 Query: 237 LLPRMETDITI-RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENG- 294 LLP M+ DI + R ++ +++D K+ + +K S N+YQ+ YL S + Sbjct: 248 LLPGMKPDIILDRKQDRRIVIDTKFTSILAKGIADRDKLKSANIYQIYAYLHSQRGRGRL 307 Query: 295 -ENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 + G+L+YP +D V + I G DI TV+L + I L I + Sbjct: 308 CDRAEGVLLYPALDHDVDETFTIQGHDIRFVTVDLALKPSEILDRLHSIAQD 359 >UniRef50_A3UV38 Putative uncharacterized protein n=1 Tax=Vibrio splendidus 12B01 RepID=A3UV38_VIBSP Length = 441 Score = 270 bits (691), Expect = 5e-71, Method: Composition-based stats. Identities = 55/325 (16%), Positives = 117/325 (36%), Gaps = 14/325 (4%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 P + +++ ML G+ + L LL++ VL L ++GL+ DY Sbjct: 94 NPTLARKSLLVMLRALKGFSHIQTSSALIHEEKMPLLEVFIGQFINSVLNLVKKGLKSDY 153 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + + KG++ A +R + K + D ANR++ S L I+ K + Sbjct: 154 VKTVDNLVYQKGKLVSAGQLRNNLVTKHKFYCEYQEYLVDRPANRLLHSALNIVAKLSRS 213 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + + + L+ + S L + +Y I+ I+N P K Sbjct: 214 PKH-KKQLQELFFIFEEVPLSRDYKSDLSRLRLDRGMSHYHTPIAWATLILNGFSPQTMK 272 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRME 242 G + M +++ ++ + R+++ +S +K S+ + +++ Sbjct: 273 GSNQAISLLF---PMERVFEDYVAKVLRQQVPDDFVVKSQVKRK--SLVEHKQASWFKLQ 327 Query: 243 TDITIRSSEKIL-IVDAKYY--KSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGG 299 D+ + ++ ++D K+ + YQ+ Y ++ + Sbjct: 328 PDLLLEKGGSVVSVLDTKWKLVDQTKDNGSDKYGLSQSDFYQMFAYGQHYFDDSSDEREM 387 Query: 300 LLIYPHVDT-----AVKHRYKINGF 319 LIYP D + I+G Sbjct: 388 FLIYPAHDGFETAIEQSFDFNISGD 412 >UniRef50_B0AA68 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0AA68_9CLOT Length = 350 Score = 269 bits (689), Expect = 7e-71, Method: Composition-based stats. Identities = 91/351 (25%), Positives = 174/351 (49%), Gaps = 8/351 (2%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 ++ I ++NIYYML+Y + L + +++ +N+ DIL +L K V + +RGL Sbjct: 2 IKDKSIFIKNIYYMLSYVYTDLIQKDYKDIDVEEFDNVGDILAVILFKVVSKQVKRGLIK 61 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 +Y + + G+I K+I+ N K FD L+ D N IIK+ + +L+ + Sbjct: 62 EYKSEEGELSVLTGKINIEKSIKLKANNKNKLYCEFDKLSMDNYLNSIIKTAMYVLVLSK 121 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 ++S + + L ++TL + ++ + ++ Y +I++C I+N+ + Sbjct: 122 DISSQNKKNLKKLVLLFSNVNTLKVNEIRWNDIKYNRHNSNYSGIINICYLILNDLLMTT 181 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 G Y+ +F +EK+M +Y+KF+ + ++ S S +KW+ + ++ LP Sbjct: 182 EDGEYKVAEFL-SEKKMYSIYEKFVLFYYQKHYPSLRPKASKIKWN---LDNELDKFLPE 237 Query: 241 METDITIRSSEKILIVDAKYYKSIFS--RRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 M++DIT+ S E ILI+D KYY ++ HS NLYQ+ Y+ + + Sbjct: 238 MKSDITLTSGENILIIDTKYYSQSMQTIELYNSKTIHSNNLYQIFTYVKNKDINKNGKVS 297 Query: 299 GLLIYPHVDTA--VKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 G+L+Y + Y ++G I + T++L +++ I Q L I + + Sbjct: 298 GMLLYAKTNEDIIPNSEYIMSGNKIMVRTLDLNKDFKFIAQSLNKIASDLI 348 >UniRef50_B4V8L4 Putative uncharacterized protein n=1 Tax=Streptomyces sp. Mg1 RepID=B4V8L4_9ACTO Length = 404 Score = 267 bits (682), Expect = 6e-70, Method: Composition-based stats. Identities = 58/341 (17%), Positives = 127/341 (37%), Gaps = 21/341 (6%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P +P+ ++++L Y+ + ++ +L L + + + + R+GL Y Sbjct: 73 PKVPIARLFFLLGYSLDPKGGWRGGQVDVGEHREVLPALAHAVERQTDRALRQGLLQGYR 132 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN 123 E ++GRI A IR +D D NRI+++ + L++ + Sbjct: 133 ATEESALVVRGRIREADQIRRRFGVVLPVEVAYDEYTTDIAENRILRAAVERLLRLPGVP 192 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 +R +L ++ L + Y+ + + + +++ S P G Sbjct: 193 REVRRRLLHQRARLADVTPLVPGQP-LPSWQPSRLNTRYQPALHLARAVLDGSSPEHVPG 251 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 R F + M+ L++ F+ R L ++ T + + ++ RM+ Sbjct: 252 GLRIDGFLFD---MNRLFEDFVTVALREALRGSDLTGALQD--PHHLDEEDAI---RMKP 303 Query: 244 DITIRSSEK--ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 D + + + DAKY + + +LYQ++ Y +L G + Sbjct: 304 DFVLYGPDGAPRAVADAKYKA------EKRDGYPDADLYQMLAYCTALGLPKGHLVYAKG 357 Query: 302 IYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDI 342 PH V+H G + ++L +E + ++ + Sbjct: 358 NAPHAAHRVRHA----GIVLHQHALDLDREPATLLADIAAL 394 >UniRef50_C9ZD45 Putative uncharacterized protein n=3 Tax=Streptomyces RepID=C9ZD45_STRSW Length = 415 Score = 265 bits (677), Expect = 2e-69, Method: Composition-based stats. Identities = 57/347 (16%), Positives = 125/347 (36%), Gaps = 20/347 (5%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P +P+ ++++L + + ++ +L+ L + + + V + R GL Y Sbjct: 81 PKVPIARLFFLLGFGLDPKGSWRDGEVDVAEHRDLVPALAHAVERQVDRALRPGLLQGYR 140 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN 123 E ++GR+ A+ IR D D NRI+++ + L++ + Sbjct: 141 ATEETALVVRGRLREAEQIRRRFGAALPVEVVHDEFTTDIAENRILRTAVERLLRLPGVP 200 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 +R +L ++ + +L + Y + + + +++ + G Sbjct: 201 RDVRRRLSHQRGRLAEVTAIVRGQSVPDWLP-TRLNTRYHHALRLARAVLDGVSAEHSPG 259 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 R F + M+ L++ F+ R + + + D + + RM Sbjct: 260 GLRIDGFLFD---MNKLFEDFVTVALREAFRTTGSGHTARLQDPHHLDE---AATIRMRP 313 Query: 244 DITIRSSEK---ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 D + + +VDAKY + +LYQ++ Y +L G + Sbjct: 314 DFVLYGPDGGAPCAVVDAKYKA------ERRGGYPDADLYQMLAYCTALGLREGHLVYAK 367 Query: 301 LIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 PH V+H G I ++L QE + ++ + + Sbjct: 368 GNAPHAAHEVRHA----GILIHQHALDLDQEPTGLLTDIEGMALRLV 410 >UniRef50_Q1IK47 McrC protein n=1 Tax=Candidatus Koribacter versatilis Ellin345 RepID=Q1IK47_ACIBL Length = 351 Score = 264 bits (674), Expect = 4e-69, Method: Composition-based stats. Identities = 105/349 (30%), Positives = 181/349 (51%), Gaps = 9/349 (2%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPN 65 IPV N+YY+L YAW L+E ++ +L+++ VL G+ L ++G++ Y + Sbjct: 3 IPVANVYYLLCYAWDKLEERDLVDIHPTEETDLVNLFARVLTNGIDHLLKKGIDRGYLLH 62 Query: 66 TEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNST 125 +E ++GRI+F ++I+ + FD L+ D L NRI+KST+ LI+ L+S Sbjct: 63 SEESCVLRGRIDFPQSIKHMLFQRAQAHCEFDELSFDVLHNRILKSTIMRLIRTRDLDSG 122 Query: 126 IRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHY 185 IRD YR + L L+ Q F + +N +Y F++ VC + N +P Q G++ Sbjct: 123 IRDRLLFQYRYFAEVGDLDLSVQIFGKVQLYRNNHFYDFLLRVCALLFENLLPTQEPGNW 182 Query: 186 RFYDFERNEKEMSLLYQKFLYEFCRRELT------SANTTRSYLKWDASSISDQSLNLLP 239 RF F +N ++M+ ++++F+ F +REL R + W + D S LLP Sbjct: 183 RFRSFLQNREQMAYVFERFVRNFYKRELPSVRVDGRCKVKREDINWGMTPSDDLSSALLP 242 Query: 240 RMETDITIRSSEKILIVDAKYYKSIFSRRMGT-EKFHSQNLYQLMNYLWSL-KPENGENI 297 +M+TD+ I + K ++V+ KY +R K + +LYQ+ YL + + Sbjct: 243 KMQTDVCITTEAKRILVECKYVDDPLEQREEMAPKLITTHLYQVNAYLDNWPDLPLYRSS 302 Query: 298 GGLLIYPHVDTAVKHRY-KINGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 +L+YP + + + +G + + T+NL Q+W IHQ+LL + D Sbjct: 303 RAILLYPLATRPIAVEFTRADGQLLSVRTLNLAQQWSAIHQDLLRLVDN 351 >UniRef50_B2JTN9 Putative uncharacterized protein n=1 Tax=Burkholderia phymatum STM815 RepID=B2JTN9_BURP8 Length = 363 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 86/349 (24%), Positives = 162/349 (46%), Gaps = 11/349 (3%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPN 65 IP++N++++L+YA + + +E +++ ++LG +L V + +R L Y P Sbjct: 10 IPIKNLWFLLSYAHNLARFADRLPVEIGEQDDIPELLGRLLAFLVERRIKRNLTRAYQPR 69 Query: 66 TEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNST 125 + ++GRI+ KT+ L G+ + F+ L+ DT N++++ LA + + Sbjct: 70 EARLTRVRGRIDLVKTLSAGELQQGRIICRFEELDADTPRNQLVRYALAHIAS-TVRDQA 128 Query: 126 IRDEARSLYRKLP--GISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 + L +L G+S + + G+N +I V ++ +P + G Sbjct: 129 LERRCGLLASELGRLGVSFRRPSRSEMAREQIGRNDADDAALIVVSNLALDPRLPSEESG 188 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLK---WDASSISDQSLNLLPR 240 R +R+E+ + +++K + F EL + K W +S + +LLP Sbjct: 189 DSRVARLQRDERLLPYIFEKAIAGFYMHELPNKEWRVRPQKVLAWPVASPTPGLHDLLPG 248 Query: 241 METDITIRS--SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWS---LKPENGE 295 M+ DI I S + + ++VD K+ + + GT++F S LYQ+ YL S ++ E Sbjct: 249 MQADIVIDSRITNRRVVVDTKFTDILTRNQFGTQRFKSNYLYQMFAYLRSQTGFGDKHAE 308 Query: 296 NIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 GLL++P V V + + G + TV+LG E IH L + Sbjct: 309 EAEGLLLHPSVGLHVDESFFVQGHRMRFATVDLGGEIHSIHSALAALVS 357 >UniRef50_B9MJI1 Putative uncharacterized protein n=1 Tax=Diaphorobacter sp. TPSY RepID=B9MJI1_DIAST Length = 431 Score = 257 bits (657), Expect = 4e-67, Method: Composition-based stats. Identities = 44/326 (13%), Positives = 109/326 (33%), Gaps = 14/326 (4%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + M+ + L + + + +L ++GL DY E +P Sbjct: 92 LRKMVASLLDLPAKEAGEAALEHFDVPLTEWVMRQFLLSLQRLVQQGLRQDYVRVEEELP 151 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 ++G++ +R D+ D NR++K L + +H + A Sbjct: 152 YLRGQLHTTAQMRQLPGRAHHFHVRHDVFVPDRAENRLLKLALERV-RHATNQADNWRLA 210 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 + L +L + Q + + + +Y+ + C+ ++ +P G + Sbjct: 211 QELSARLHEVPASTQPQQDWRAWSRTRLMSHYQPIYPWCQLVLGQGMPVALAGDQQGLSL 270 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIR-- 248 M L++ ++ + + L + + + + ++ D+ +R Sbjct: 271 LF---PMEKLFESYVARWLLKHLPQHLCLTAQAASEY--LCWHDGRRMFQLRPDLLLRNH 325 Query: 249 SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH-VD 307 + ++++D K+ + R + YQ++ Y G+ LIYP Sbjct: 326 NGAAVMVLDTKWKRLEADNRANNYGLAQGDFYQMLAYGQRYLTGQGKLA---LIYPAWTG 382 Query: 308 TAVKHRYKINGF--DIGLCTVNLGQE 331 V G + + ++ + Sbjct: 383 FDVPLPMFEMGPGLRMEVLRFDVEND 408 >UniRef50_D0D6B3 5-methylcytosine-specific restriction enzyme subunit McrC n=1 Tax=Citreicella sp. SE45 RepID=D0D6B3_9RHOB Length = 359 Score = 257 bits (657), Expect = 4e-67, Method: Composition-based stats. Identities = 74/350 (21%), Positives = 144/350 (41%), Gaps = 11/350 (3%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPN 65 IP+RNI+ + YA +Q + + +L D++G ++ V RR L Y Sbjct: 5 IPLRNIWILFLYAADLVQLRGRFERDVERARDLPDLVGRLMVNVVEDRLRRNLSRGYRAQ 64 Query: 66 TEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNST 125 T I+P ++GRI+ T G + G+ F+ DT NR++++ L L Sbjct: 65 TAILPRVQGRIDMLATEAGQLMERGQIACRFEEHVMDTPRNRLVRAALERLAARVFTPE- 123 Query: 126 IRDEARSLYRKL--PGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 RSL G+S + + G+N + ++++ + + +IP + G Sbjct: 124 TAYRCRSLAADFSRAGVSARRPSRTELAIDQMGRNEGADRMMVALAGMVFDGTIPTEKHG 183 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSY---LKWDASSISDQSLNLLPR 240 E E + L+++ + R L T + + W ++ ++LP Sbjct: 184 TALQPGDETTEHLIRRLFERAVGNALRIALEPEGWTIAQGHRIAWPVGGKTEGLPSILPG 243 Query: 241 METDITIRS--SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLW---SLKPENGE 295 M+TDI + + + +++D K+ + + + S LYQ+ YL S++ + Sbjct: 244 MQTDIELSHIKTSRRVVIDTKFTRILTASNYRGGILRSGYLYQMYAYLRTQESMEHPSSL 303 Query: 296 NIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 G+L++P V +V + G I T++L ++L Sbjct: 304 TSEGILLHPQVGGSVDETMILQGHPISFRTIDLTASSTEFVEQLHRTATN 353 >UniRef50_UPI0001909ACF hypothetical protein RetlI_30663 n=1 Tax=Rhizobium etli IE4771 RepID=UPI0001909ACF Length = 482 Score = 257 bits (657), Expect = 4e-67, Method: Composition-based stats. Identities = 54/322 (16%), Positives = 107/322 (33%), Gaps = 13/322 (4%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 ++ M++ A I + + L L+L+RRG Y P Sbjct: 160 LFKMVSEALNLTPRIGGGATIEAFDLPMTEWLAASFLAKALELARRGPRQAYRLVEAREP 219 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 ++GR++F + +R T D+ D NR+I+S + + + + R A Sbjct: 220 FLRGRLDFTRQLRAAAGGAHMFHITHDVYLLDRPENRLIRSAIEHIARRSMTSDNWR-LA 278 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 R L + + Y + +C+ ++ + +P G Y Sbjct: 279 RELSILFADVPESRNVHADLKRWGRDRQLADYADIRPLCELLLTHRLPFALAGDYHGMSM 338 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 M L+++++ R + +M+ DI + S Sbjct: 339 LF---PMERLFERYVLGSLRMIAPEHFEIHPQH--GTMHLCSHEGEDWFQMKPDILVESG 393 Query: 251 EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAV 310 + I+DAK+ + R + + YQL Y G L+YP V Sbjct: 394 AQRWIIDAKWKRLST-DRDKNYELSQVDFYQLFAYGQKY---LGGTGEMYLVYPAVSDFP 449 Query: 311 --KHRYKINGF-DIGLCTVNLG 329 + +K++ + + +L Sbjct: 450 TMRAPFKLSDNLLLHVLPFDLE 471 >UniRef50_D1PB97 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PB97_9BACT Length = 364 Score = 256 bits (655), Expect = 7e-67, Method: Composition-based stats. Identities = 104/357 (29%), Positives = 189/357 (52%), Gaps = 14/357 (3%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 Q IP+ N+YY+L YAWG ++ + ++ ++L ++L +L +L R+GL Y Sbjct: 2 QQKIPIENLYYLLCYAWGVSDQLDKVKVDGEKCHSLENLLSTILLNACDRLLRQGLLRAY 61 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + + G++G++ A+T++ +G+T+ D L +D + NR+I STL L++ E + Sbjct: 62 RFEEQEVEGVRGKLNLAETLKSGKHLNGRTICQVDELTQDVVINRVIFSTLKRLMRIEGI 121 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + IR R K P I + +T L + + +YK V+++C+ I ++++P ++K Sbjct: 122 DENIRARLRKTLAKFPHIEEIRVTEGLLGRLRQHRLSGFYKLVLNICRLIWDSTLPCKDK 181 Query: 183 -GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN--TTRSYLKWDASSIS---DQSLN 236 G F DF ++ M+ ++++FL FC++ R Y+ + S ++ Sbjct: 182 DGRLEFLDFTEDDFRMNCIFERFLMNFCKQNCRDEYPEVHREYIDFQLSPFGMMFKEAGE 241 Query: 237 LLPRMETDITIRSSE--KILIVDAK-YYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPEN 293 LP METD+T+ + + LI+DAK Y +++ S+ G EK +L Q+++Y+ + + + Sbjct: 242 ALPVMETDVTLFNPNTQEKLILDAKFYREALVSKFGGREKVRRDHLSQILSYVMNQEDRS 301 Query: 294 GE---NIGGLLIYPHVDTAVK--HRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 N G L+YP VD +RYK G I + TVNLGQ W I + + +I Sbjct: 302 KPHTMNAYGALVYPTVDEDFDFSYRYKETGHRIIVRTVNLGQPWRKIEERVKEIVKR 358 >UniRef50_A1UB52 Putative uncharacterized protein n=2 Tax=Mycobacterium RepID=A1UB52_MYCSK Length = 361 Score = 256 bits (655), Expect = 7e-67, Method: Composition-based stats. Identities = 87/349 (24%), Positives = 152/349 (43%), Gaps = 10/349 (2%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPGNN--LLDILGYVLNKGVLQLSRRGLELDYN 63 IPVRN++ ++ YA Q + N LLD++ VL V + +R L Y Sbjct: 14 IPVRNLWLLMLYASRLYQRNHLLRNMDVEQNPERLLDLVAQVLVYAVERRLQRNLGRQYR 73 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN 123 + ++G+I+ T L G+ FD L+ D L NRII+S L +L + + Sbjct: 74 ERRATLARVRGQIDVLTTESKALLAQGRIACRFDELSVDNLRNRIIRSAL-VLAARDARD 132 Query: 124 STIRDEARSLYRKLP--GISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 T++ AR++ G+S ++ + L +N I + I+ +IP ++ Sbjct: 133 RTLQRTARNMADVFTQYGVSPQLVSVRESRQLVLDRNAHDDVEAIGAAQLILEMAIPAES 192 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA---NTTRSYLKWDASSISDQSLNLL 238 G+ D ER+ E+ LY+ + F R L S+ + ++ W ++ ++L Sbjct: 193 AGNSTNRDPERDAAEIRRLYEAAVRGFYRSALPSSWSVSPGETHYHWPLVEATEGLKSIL 252 Query: 239 PRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLK--PENGEN 296 P M+ D + + + +IV+ K+ ++ + G K +++QL Y+ S E Sbjct: 253 PIMKADTVLETVGRRIIVETKFADALKPNQYGLPKLARNHVFQLYAYVQSQHGSDELSAT 312 Query: 297 IGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 G+L+YP V V I G TV+LG I LL + Sbjct: 313 AEGVLLYPVVGEHVDESASIQGHRYRFLTVDLGGPAESIRSSLLRVTSN 361 >UniRef50_C2G7A3 5-methylcytosine-specific restriction enzyme subunit McrC n=16 Tax=Staphylococcus RepID=C2G7A3_STAAU Length = 346 Score = 256 bits (654), Expect = 8e-67, Method: Composition-based stats. Identities = 96/348 (27%), Positives = 180/348 (51%), Gaps = 10/348 (2%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNP 64 +I ++NIYYML+YA+ L + L N+ D+ +L KG+ GL +Y Sbjct: 1 MINIKNIYYMLSYAFTVLNKKGYQKLATEQFENIFDLYSAILIKGISSQLNSGLHHEYIE 60 Query: 65 NTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNS 124 T+ + I+G+++ +I+G + + +D + +T N+I+K+T+ LIK ++ Sbjct: 61 QTDSLKVIRGKVDVKNSIQGLGVLSQRINCIYDEFSLNTYMNKILKTTMKCLIK-TDISR 119 Query: 125 TIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGH 184 + + R L + TL + Y + +N + YK +IS+C I I ++KG Sbjct: 120 KNKIKLRKLLVHFNNVDTLDYRNIQW-YHSFDRNNQTYKMLISICYLIFQGVIQTESKGQ 178 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 F +E+++S LY+KF+ E+ ++E T S ++W +D ++N+LP M +D Sbjct: 179 NDLMVF-VDEQQISRLYEKFILEYYKKEFPELVVTSSNIQWSLD--NDDNVNMLPVMRSD 235 Query: 245 ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGEN---IGGLL 301 I +R +K LI+DAK+YK+ T+K HS NLYQ+ Y+ + + + + G+L Sbjct: 236 IMLRYKDKCLIIDAKFYKNTLHNYYDTKKIHSTNLYQIFTYVKNQQLNLKKKAIQVSGML 295 Query: 302 IYPHVDTAV--KHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 +Y D + ++ ++G I + T++L + I ++L I ++ Sbjct: 296 LYAKTDENIVLNDKFHMSGSQIIIKTLDLNCNFTIIKKQLNGIVNDIF 343 >UniRef50_C1QDS7 McrBC 5-methylcytosine restriction system component n=2 Tax=Brachyspira RepID=C1QDS7_9SPIR Length = 350 Score = 253 bits (647), Expect = 6e-66, Method: Composition-based stats. Identities = 106/353 (30%), Positives = 185/353 (52%), Gaps = 18/353 (5%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQAN-----LEAIPGNNLLDILGYVLNKGVLQLSRRGLE 59 IP++NIYYML+YAW I + N +N+ +++GY+LN + +L +RG Sbjct: 6 KIPIKNIYYMLSYAWNIWNIINEDNDKKEIFGDEKFDNIYNVMGYILNIFLEKLIKRGFY 65 Query: 60 LDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKH 119 Y E + +KG+I F+++++ H K V ++++L+ D L N+IIK TL LI + Sbjct: 66 RGYITLEEDLSVLKGKINFSESVKRN--THKKLVCSYNILSNDILFNQIIKYTLNKLINY 123 Query: 120 EKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPG 179 + +++ I+++ L I +++ + F L KN +YK +I++CKF+ N + Sbjct: 124 KNIDNDIKEKLIKLNHYFIKIKNINVNNRTFKLLKYNKNNMHYKIIINICKFVHKNLLVN 183 Query: 180 QNKGHYRFYDFERNEKEMSLLYQKFLYEFCRREL---TSANTTRSYLKWDASSISDQSLN 236 +N Y F DF EK M +LY+KF+ F + + +KW+ Sbjct: 184 KNSSEYSFIDFN-EEKRMHMLYEKFVLNFYKIYFFHNKNIKVKNKTIKWNI-----NDNE 237 Query: 237 LLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLK--PENG 294 +P M+TDI I + EK LI+D K+YK+I + S +LYQ+ +Y+ ++ + Sbjct: 238 YIPIMKTDIMIYNKEKCLIIDTKFYKNILIKNNDKVSLRSSHLYQIFSYMSNINNSYKRF 297 Query: 295 ENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 + I G+L+YP + + YKIN + T++L ++ I EL++I Y Sbjct: 298 KTIKGILLYPLCNDNLNKEYKINDKYFAVNTIDLNSDFNIIKSELINIIKNYF 350 >UniRef50_B2FKW7 Putative uncharacterized protein n=3 Tax=Xanthomonadaceae RepID=B2FKW7_STRMK Length = 448 Score = 252 bits (644), Expect = 1e-65, Method: Composition-based stats. Identities = 48/328 (14%), Positives = 109/328 (33%), Gaps = 18/328 (5%) Query: 9 RNIYY-MLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTE 67 R + ML A A+ L + + + +L +RGL DY E Sbjct: 94 RRLLRRMLCTALDISPREGSPTDVALFDIPLNEWVMGRFIGALDELLKRGLRFDYTRVRE 153 Query: 68 IIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIR 127 ++GR++ ++ +R D+ +ED NR+++ L + + N+ Sbjct: 154 EQLFLRGRLDMSRQLRQSPTRAHVFNIEHDVFSEDRPENRLLRVALDRVCARTR-NAGTW 212 Query: 128 DEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRF 187 A+ L ++ + + +Y+ V +C+ I++ P G +R Sbjct: 213 RLAQELSTRMASVPRSTQVANDLRAWGSDRLMAHYREVRPLCELILSGQSPLALAGDWRS 272 Query: 188 YDFERNEKEMSLLYQKFLYEFCRREL-TSANTTRSYLKWDASSISDQSLNLLPRMETDIT 246 M L+++++ R+ + ++ + + D Sbjct: 273 PSMMF---PMERLFERYVGACLARQFAPEWQVGGAASEY----LCSHGEANWFNLIPDFL 325 Query: 247 IRSSEKILIVDAKYY--KSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 +R +++ ++D K+ + S + YQ+ Y G L+YP Sbjct: 326 LRRGDELRVLDTKWKVLDATASDAREKYGLKQSDFYQMFAYGQRY---LGGVGQMALVYP 382 Query: 305 HVDTAVKHRYKIN---GFDIGLCTVNLG 329 + + + + +L Sbjct: 383 QHTGFAQPLPAFDFSAELQLWVLPFDLE 410 >UniRef50_Q39M33 McrBC 5-methylcytosine restriction system component-like protein n=6 Tax=Proteobacteria RepID=Q39M33_BURS3 Length = 445 Score = 251 bits (641), Expect = 3e-65, Method: Composition-based stats. Identities = 54/323 (16%), Positives = 107/323 (33%), Gaps = 12/323 (3%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML A + + + + + + + +RG+ DY E Sbjct: 99 LRRMLEVAMEITPREGDEATLQCFDHPITEWMMRRFLQALEHVIKRGMRRDYLRIEEEQR 158 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 ++G++ AK +R + +D+ + D NR++KS L + K + R A Sbjct: 159 YLRGQLNIAKQMRASPAHADLLNIRYDVFSPDRAENRLLKSGLIRVSKSTRDADNWRV-A 217 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 L L + F + +Y C+ ++ + +P G + Sbjct: 218 NELLHILHEVPASRNASADFRAWRTDRLMAHYVQARPWCQIVLGDQVPLALTGETQGISL 277 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 M L+++++ R L + + + +++ D I Sbjct: 278 LF---PMERLFERYVGHALRILLPVHYRLTEQGSR--HWLCNHDGTGIFKLKPDYLIEGP 332 Query: 251 EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHV---D 307 + I+DAK+ + R + YQL Y GE LIYP + Sbjct: 333 DAPRILDAKWKLIDGADRTNNYGLKQADFYQLYAYGQKYLGGVGELN---LIYPRTRQFE 389 Query: 308 TAVKHRYKINGFDIGLCTVNLGQ 330 A+K Y + + +L Sbjct: 390 AALKPFYFTPELRLQVLPFDLDT 412 >UniRef50_D1SH86 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Micromonospora aurantiaca ATCC 27029 RepID=D1SH86_9ACTO Length = 419 Score = 251 bits (640), Expect = 4e-65, Method: Composition-based stats. Identities = 56/344 (16%), Positives = 122/344 (35%), Gaps = 22/344 (6%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 +P I + + ++L YA + + + P ++L+ + Q GL Y Sbjct: 66 RPKIGIARLLWLLGYARDP-RGWRTEPVGLTPEHDLVPAMAVAFATATYQALAPGLLQGY 124 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 E +P ++GR+ A +R +D + D N+I+ S + L + + Sbjct: 125 RTVEEALPLVRGRLREADQLRTRPGLALPVEVRYDDYDTDIPENQILLSAVRRLHRLPGV 184 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + + L ++ L + + Y+ + + + I+ + Sbjct: 185 PPATCNALHRIAAALADVTPLTAGAP-IPEASSNRFNNRYQPALRLARLILAGESIEHSH 243 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRME 242 G F + + +++ +L R + + + D LP + Sbjct: 244 GSTLAAGFVFDL---NTVFEDWLTTALRHAVETRYGGTVTGQHQMHLDRD---RRLPLVR 297 Query: 243 TDITIRSSEKIL-IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 DIT + L ++DAKY ++YQL+ Y +L G L Sbjct: 298 PDITWWHGRQCLAVIDAKYKAPGN-------TPPRDDIYQLLAYCTTLNLP-----RGHL 345 Query: 302 IYPHVD-TAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 +Y +V ++ +G I + V+L +H+++ + + Sbjct: 346 VYASAGAESVTYQLTGSGVHIVVHRVDLAAPVTHLHEQIEKVAE 389 >UniRef50_A5WF57 McrBC 5-methylcytosine restriction system component-like protein n=4 Tax=Proteobacteria RepID=A5WF57_PSYWF Length = 363 Score = 250 bits (638), Expect = 6e-65, Method: Composition-based stats. Identities = 81/351 (23%), Positives = 146/351 (41%), Gaps = 15/351 (4%) Query: 6 IPVRNIYYMLTYAWGYLQEIKQANLEAIPG-NNLLDILGYVLNKGVLQLSRRGLELDYNP 64 IP+RN++ ++ YA +E+ + + +++ D++ +L + V +R L Y Sbjct: 12 IPIRNLWLLMLYASDIYRELNKDRVAVEENPDDIPDLIAEMLCQRVEHRIQRNLSYGYQS 71 Query: 65 NTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIK---HEK 121 ++ ++GRI+ T R L+ GK FD L DT NR ++ L + K ++ Sbjct: 72 REAVVSRVRGRIDLLNTERNRLLDRGKVACRFDELTIDTARNRYVRGALERIAKIVQRKE 131 Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 L R L R G+S + + S ++ K +++V N ++P + Sbjct: 132 LAHRCRSLDIRLRRM--GVSQVLPSRAELSVDRLSRHDAEDKPMLTVAHLAFNLALPTEV 189 Query: 182 KGHYRFYDFER-NEKEMSLLYQKFLYEFCRRELTSANTT---RSYLKWDASSISDQSLNL 237 G ER + + L++K + F L++ + W + S + Sbjct: 190 TGSKYLSRPEREDLPWLRKLFEKGVAGFYEITLSNHGYKVTAGKRINWPVTDSSQGIDKI 249 Query: 238 LPRMETDITIRSSE--KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPE--- 292 LP M+TDI I + + + I+D K+ + E S +YQ+ YL S + Sbjct: 250 LPSMKTDIIIDNLDLGQRTIIDTKFNAVLTRGWYRHETLRSSYIYQMYAYLRSQEDSGDF 309 Query: 293 NGENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIF 343 N GL I+P V + I I TV+L I ++LL + Sbjct: 310 LDRNACGLFIHPSVGEDINEYMVIQDHKIQFATVDLAASTKEIRRQLLGLI 360 >UniRef50_Q0KFR0 5-Methylcytosine-specific restriction enzyme C n=3 Tax=Proteobacteria RepID=Q0KFR0_RALEH Length = 450 Score = 248 bits (633), Expect = 2e-64, Method: Composition-based stats. Identities = 52/324 (16%), Positives = 117/324 (36%), Gaps = 18/324 (5%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML G+ + + LL++ + V + +RGL DY+ + + Sbjct: 103 LIEMLCCLQGFRHVKTDSANISAARMPLLEVFVAEFLRAVDHIVKRGLRSDYSSRQDNLY 162 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 ++G++ A ++ + + D D NR++ + L ++ E S + A Sbjct: 163 ALRGKLLIAPHLQQNLYRADRFFTDHDEFTIDRPENRLLHAALRRVL--ELSASQLNQLA 220 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 R L + F + + YY ++ + I++ P G + Sbjct: 221 RELAFVFAEVPVSAQPQIDFQRVRLDRGMGYYADALAWARLILDEESPLTGAGAHCAPSM 280 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 M +++ F+ + ++L +S + + R++ D+ IR++ Sbjct: 281 LF---PMEAVFEAFVAKHLAKQLARPLILKSQARS--HHLVRHREQNWFRLKPDLLIRNA 335 Query: 251 EKI-LIVDAKYYKSIFSRRM--GTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVD 307 ++ L++D K+ + + YQL Y S G+ + LIYP Sbjct: 336 DRDLLVLDTKWKLLDGMKANGTDKYGLSQSDFYQLQAYGQSYLSGRGDVV---LIYPKTA 392 Query: 308 TAVKH----RY-KINGFDIGLCTV 326 + + + K+ G + + Sbjct: 393 SFERPVPVFEFPKVEGLRLWVLPF 416 >UniRef50_C4T585 McrBC 5-methylcytosine restriction system component n=1 Tax=Yersinia intermedia ATCC 29909 RepID=C4T585_YERIN Length = 367 Score = 247 bits (632), Expect = 3e-64, Method: Composition-based stats. Identities = 72/352 (20%), Positives = 142/352 (40%), Gaps = 15/352 (4%) Query: 8 VRNIYYMLTYAWGYLQEIKQANLEAIPGN-NLLDILGYVLNKGVLQLSRRGLELDYNPNT 66 +RN++ ++ YA +++ + ++ + D++ +L + R L + Y Sbjct: 1 MRNLWLLMLYASDLFRQLGRRHIAVEDNPAEIPDLVATILLHEIALRRHRNLSMGYQTRH 60 Query: 67 EIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAIL---IKHEKLN 123 + ++GRI+ T L G+ F + DT NR ++ L L I L Sbjct: 61 AALNRVRGRIDVLYTTSHQLLERGRVACHFQDMTLDTPRNRYVRCALERLTPIIAKPSLA 120 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHFSYL-NGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + A SL R GI+ + + + G++ K ++ + +P +++ Sbjct: 121 ADCHAMALSLRR--EGINGGYPDNRELPSVRRFGRHDAADKPMVDAAQLAFELLMPTEDQ 178 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANT---TRSYLKWDASSISDQSLNLLP 239 G + N M L++K + F R L LKW S S S + P Sbjct: 179 GQHLLPAPSDNLYWMRKLFEKGIAGFYRVHLAKTTWRISAGKELKWALSEQSAGSAEIFP 238 Query: 240 RMETDITIRSS--EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGE-- 295 M++DI + ++ +I+D K+ + + + +YQL YL + + + Sbjct: 239 TMKSDIILEHKMAQQRIIIDTKFNAILTKGWHREKSLRNSYIYQLYTYLRTQESQADPLS 298 Query: 296 -NIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEY 346 N GLL++P V + G I TV++ + I ++LL++ + Sbjct: 299 LNAAGLLLHPAVGYMLNEYVVTQGHKIHFATVDMAVDAKTIKRQLLELAHDC 350 >UniRef50_D0VFC8 McrBC 5-methylcytosine restriction system component (Fragment) n=1 Tax=Streptococcus suis RepID=D0VFC8_STRSU Length = 346 Score = 246 bits (629), Expect = 7e-64, Method: Composition-based stats. Identities = 85/338 (25%), Positives = 169/338 (50%), Gaps = 12/338 (3%) Query: 15 LTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKG 74 L YA + + + + N D+L ++ + +RGL Y TE + +KG Sbjct: 11 LLYAIYASKSHSRISTMLLRFKNTADLLAEIMIISLSIQVKRGLGRGYRSQTESLSALKG 70 Query: 75 RIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLY 134 +I ++++ + + V +D + D+ NRIIK+++ IL+K ++ + + R L Sbjct: 71 KINISESLTPPNWRRKQLVCQYDDFSLDSTMNRIIKASIEILLK-ADISRDRKKKLRKLL 129 Query: 135 RKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNE 194 +S ++L +++ L +N + Y ++S+C +VN I + +G+ + +F +E Sbjct: 130 VFFGEVSKINLHSINWN-LQYNRNNQSY-LLMSICYLVVNGLIHTEREGNKKLMNFL-DE 186 Query: 195 KEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKIL 254 + SLLY+KF+ + ++ T S + W ++ D +LP M++DI ++ + IL Sbjct: 187 RRESLLYEKFILGYYKKHYPQIQVTASQIPW---ALDDGFGEMLPIMQSDIYLKYKDTIL 243 Query: 255 IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKP---ENGENIGGLLIYPHVDTAVK 311 I+DAKYY S R HS NLYQ+ Y+ + + + + G+L+Y D ++ Sbjct: 244 IIDAKYYSSNTQIRFDKRTLHSNNLYQIFTYVKNQAYRLSDTNDTVAGMLLYAKTDIDIQ 303 Query: 312 HR--YKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 Y+++G I + ++L ++ I ++L DI + Sbjct: 304 PNQVYQMHGNQISVKNLDLNLQFASIAEQLDDIITSHF 341 >UniRef50_C7NBA8 McrBC 5-methylcytosine restriction system component n=4 Tax=Bacteria RepID=C7NBA8_LEPBD Length = 432 Score = 245 bits (626), Expect = 2e-63, Method: Composition-based stats. Identities = 59/337 (17%), Positives = 126/337 (37%), Gaps = 20/337 (5%) Query: 14 MLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIK 73 ML + ++ + + NL +I + + + +L + G++ DY + + K Sbjct: 111 MLRSMRDFPSKVFNNSNIQVERMNLYEIFINMYLQEIRRLIKIGIKSDYIFKEDNLNYYK 170 Query: 74 GRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSL 133 G++ ++ + ++ + +D N + + N++IK+TL L K + E R L Sbjct: 171 GKLLTSQHFKINLVHKERFYVAYDEFNPNRVENKLIKATLLKLQKLTTSAENSK-EIRQL 229 Query: 134 YRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERN 193 I FS + ++ R Y+ ++ K + N G+ + Sbjct: 230 LVFFEIIDASMNYTADFSKVRINRSNRDYEMIMQWSKVFLLNKSFTTFSGNNNSRALLFS 289 Query: 194 EKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKI 253 M +Y+ ++ + ++ L S + L + DI + E+ Sbjct: 290 ---MEKVYESYVAKHLKKILGEDGWNVSSQDRGYYLFTKP--RLQFALIPDIVCKRGERT 344 Query: 254 LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHR 313 +I+D K+ K + + R+ ++YQ+ Y K L+YP D +H Sbjct: 345 IIMDTKWKKLVNNERI-NYGISQSDMYQMYAYSKKYKAS-----EIWLLYPLNDEMKEHS 398 Query: 314 YKI----NGFDIGLCTVNLGQEWPCIHQELLDIFDEY 346 +G + + V+L I L + D+ Sbjct: 399 EISFNSGDGTTVNIYFVDL----ENIESSLEVLRDKI 431 >UniRef50_A6ALR1 5-Methylcytosine-specific restriction enzyme C n=4 Tax=Vibrionaceae RepID=A6ALR1_VIBHA Length = 434 Score = 242 bits (618), Expect = 1e-62, Method: Composition-based stats. Identities = 53/287 (18%), Positives = 102/287 (35%), Gaps = 12/287 (4%) Query: 27 QANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFH 86 A LL++ + V +L +RGL+ DY + + KG++ + +R Sbjct: 118 DQASIASKKMPLLEVFIEQFLQSVNRLVKRGLKSDYVTQVDNLNYQKGKLLVGQQLRRNL 177 Query: 87 LNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLT 146 +N K +D + ANR+I + L L+ + + S R R L + Sbjct: 178 INQHKFYVEYDEYLINRPANRLITTALTKLVSYTRSPSNQR-LLRELQFAFVDVPVSKSV 236 Query: 147 PQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLY 206 Q S L ++ Y I+ K I+ P KG M +++ ++ Sbjct: 237 KQDLSALKLDRSMLDYHVPIAWAKLILEGFSPLSMKGESSALSLMF---PMEAVFESYVA 293 Query: 207 EFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEK-ILIVDAKYYKSIF 265 R L + + A + + ++ D+ + +K +++D K+ F Sbjct: 294 SVLRSSLPENVELTTQAR--AKHLVKHNGKAQFQLMPDLLMTLPDKSQVVLDTKWKLLDF 351 Query: 266 SRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKH 312 ++YQ+ Y + LIYP ++ K Sbjct: 352 E--AHNYGISQSDMYQMFAYGHKY---LKGSGELYLIYPAHESFTKP 393 >UniRef50_D0Z341 McrBC 5-methylcytosine restriction system component n=3 Tax=Gammaproteobacteria RepID=D0Z341_LISDA Length = 425 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 57/348 (16%), Positives = 127/348 (36%), Gaps = 20/348 (5%) Query: 2 EQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELD 61 + ++ + ML+ + + + L ++L K V + ++G+ Sbjct: 85 QDTQATMKVLLKMLSTVYKLNMHRFEHSSLQTLNRPLFEVLISYFLKEVSNIIQQGIRSR 144 Query: 62 YNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 Y + P +KG+++ +K I ++D + D NR+I+S L +IK K Sbjct: 145 YTRVQDCKPYLKGQLQTSKQINQRPGCLNSFHISYDEFSPDRAENRLIRSALNQVIKWSK 204 Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 + +R L L I F + ++ +Y+ V CK I+N P Sbjct: 205 NSDNLR-LGSELQCALDDIPCSKNYALDFRQWSKDRSLVHYRSVKPWCKLILNYQSPVSL 263 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYL---KWDASSISDQSLNLL 238 G ++ M L+++++ + L+ A T R+ + + Sbjct: 264 SGRHKGISMLF---PMESLFEQYVAIRLGKSLSHALTLRTQVSNCALVTHTPRSGKSQEW 320 Query: 239 PRMETDITIRSS---EKILIVDAKYYKSIFSR--RMGTEKFHSQNLYQLMNYLWSLKPEN 293 R++ DI + + + + D K+ + + ++YQ+ Y + Sbjct: 321 FRLKPDIVVWDKTLHKPLCVADTKWKRINEKQATAKHKYGISQSDMYQMFAYGQN---CL 377 Query: 294 GENIGGLLIYPHVDTAVK--HRYKINGFDIGLCTV--NLGQEWPCIHQ 337 G + LIYP + + +K + + + + +L + + Sbjct: 378 GGSGVVYLIYPAYEDFNESLPPFKFD-NRLSVKAIPYDLTNDECELLS 424 >UniRef50_Q9RZI4 Putative uncharacterized protein n=1 Tax=Deinococcus radiodurans RepID=Q9RZI4_DEIRA Length = 442 Score = 241 bits (614), Expect = 4e-62, Method: Composition-based stats. Identities = 58/325 (17%), Positives = 122/325 (37%), Gaps = 14/325 (4%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML A + L +++ +GV RRG+ Y P E P Sbjct: 114 LLRMLA-ATDERFRVAPPAELQTAHMPLYEVVIRYALEGVRAAVRRGIPHAYVPVQEERP 172 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 G++GR++ + +R T+D D R+ + ++ L +L + R A Sbjct: 173 GLRGRLDLPRQVRQPPHRAHLLHVTYDEFLPDRPETRLTRLSVERLAALTRLPANQR-LA 231 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 R L L + F+ G+ ++ + ++C+ ++ P + + Sbjct: 232 RELLHALDEVPPSRNVNVDFAAWRLGRGHTHFAPLEALCRMVLYELNPIVAGKKTQAHAL 291 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 + M+++Y+ ++ + RR + + ++ D R+ D+ +R+ Sbjct: 292 LFD---MNVVYEAYVAQLLRRLYPTWTVATQVTQR---ALGDADGLPAFRLRPDLLLRTE 345 Query: 251 EKILIV-DAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGG-LLIYPHVDT 308 E +IV D K+ + + T + + YQ++ Y + + LIYP + Sbjct: 346 EGQVIVADTKWKRLEADK-APTYDVANADAYQMLAYSEAFQHSAAYTHKALWLIYPRLPG 404 Query: 309 AVKHRYKI---NGFDIGLCTVNLGQ 330 I G + + T++L Q Sbjct: 405 LPPVSAPIRLGQGRTLSIVTIDLNQ 429 >UniRef50_UPI0001972FC4 McrBC 5-methylcytosine restriction system component n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001972FC4 Length = 431 Score = 240 bits (613), Expect = 5e-62, Method: Composition-based stats. Identities = 48/342 (14%), Positives = 113/342 (33%), Gaps = 23/342 (6%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML + Q + + NL ++ + V + +RGL Y E Sbjct: 103 LINMLKTLREAPYKSLQTSSVNVEKMNLFEVFIRMFVDEVFSIVKRGLRCSYELTEENTS 162 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 KG++ F++ IR + + ++ + + N+++K+TL L + R + Sbjct: 163 FFKGKLLFSEQIRHNYSHRERSHVEYGDFTANRPENKLLKATLLRLYRQTSSQKN-RSDI 221 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 + L G+ +N + Y+ + C+ + G + Sbjct: 222 KILLTAFSGVEASTDCKGDLVRYIPDRNMKGYRTALMWCRIFLTGKSFASFAGSEQAPAL 281 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITI--R 248 M +L++ + +++L + + + DI + + Sbjct: 282 LF---PMEVLFESYTAALLKKKLDGSRFAVLVQDKTHYLFDEPGKK--FLLRPDIVVKRK 336 Query: 249 SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDT 308 S I ++D K+ + ++YQ+ Y E+ L+YP + Sbjct: 337 SDGAIFVLDTKWKVLDLGKT--NYGISQADMYQMFAYQKKYGAEH-----MTLLYPETEK 389 Query: 309 AVKHRYKI----NGFDIGLCTVNLGQEWPCIHQELLDIFDEY 346 R +G + + ++L + L ++ + Sbjct: 390 VPPDRRIEFRADDGAAVLVKFIDLFHP----EESLTNVIQSF 427 >UniRef50_C3FCB5 McrBC 5-methylcytosine restriction system component n=3 Tax=Bacillus cereus group RepID=C3FCB5_BACTU Length = 439 Score = 239 bits (609), Expect = 2e-61, Method: Composition-based stats. Identities = 53/306 (17%), Positives = 106/306 (34%), Gaps = 14/306 (4%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 M+ + +I + NL ++ + + V +L ++GL Y P + + Sbjct: 102 FLRMIRNMKDFTSKIFNDANLNVDKMNLYELFISMYIQEVRELIKKGLRSSYFPQVDNVN 161 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 KG++ + I+ ++ + +D + NR+IKSTL L K + I+ Sbjct: 162 YFKGKLIIREQIKKNQVHKERFYVEYDEYGINRPENRLIKSTLLKLQKLSNSAANIK-RI 220 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 R L I + FS + +NT+ YK ++ K + N G Sbjct: 221 RQLLPNFEKIKPSINYKKDFSKVVIDRNTKDYKTLMQWSKVFLINQSFTTFSGETNARAL 280 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITI-RS 249 M +++ ++ ++ L+ S + + DI I R Sbjct: 281 LF---PMEKVFEAYVARNLKQVLSDLMWEVSIQDKGYYLFNSPKR---FALRPDIVIMRE 334 Query: 250 SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTA 309 +I+D K+ K + ++YQ+ Y + + L+YP + Sbjct: 335 DGSRVILDTKWKKLVD-NPNRNYGISQADMYQMYAYSKKYEAQ-----EIWLLYPLNEGM 388 Query: 310 VKHRYK 315 Sbjct: 389 KDTDIN 394 >UniRef50_C4G4B6 Putative uncharacterized protein n=3 Tax=Bacteria RepID=C4G4B6_ABIDE Length = 451 Score = 238 bits (608), Expect = 2e-61, Method: Composition-based stats. Identities = 54/344 (15%), Positives = 120/344 (34%), Gaps = 23/344 (6%) Query: 9 RNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEI 68 R ML + ++ + NL +I + + V QL + G++ Y + Sbjct: 112 RVFMRMLRSMKDFPSKVFTNANLKMDRMNLYEIFINMYIQEVRQLVKHGIKSSYVGQEDN 171 Query: 69 IPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRD 128 + KG++ + I+ + + FD + N++IKSTL L K + Sbjct: 172 LMVYKGKLIVNEHIKHNLTHKERFYVGFDEYQVNRAENKLIKSTLLKLQKLTTSVENSK- 230 Query: 129 EARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFY 188 E R L + + + FS + ++T+ Y+ +I K + N G Sbjct: 231 EIRQLLTAFELVESSINYDKDFSKIVTDRSTKEYEMLIKWSKVFLKNKSFTTFSGTESAR 290 Query: 189 DFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISD--QSLNLLPRMETDIT 246 M +++ ++ +F ++ + S + + + D+ Sbjct: 291 ALMF---PMEKVFEAYVAKFMKKVFSRIGWEVSAQDKGHYLFNSLNGENHKRFALRPDLV 347 Query: 247 IRSSEK---ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIY 303 + +++ ++I+D K+ + + ++YQ+ Y L+Y Sbjct: 348 VTKNDENKSVIILDTKWKSLVNDKGT-NYGISQADMYQMYGYSKKY-----GTSEIWLLY 401 Query: 304 PHVDTAVKHRYK----INGFDIGLCTVNLGQEWPCIHQELLDIF 343 P D +G + L V++ I + + D+ Sbjct: 402 PVNDAMRDCGTIKFDSGDGVTVSLFFVDVA----NIEKSMEDLL 441 >UniRef50_B7LQZ4 Putative 5-methylcytosine restriction system component n=6 Tax=Enterobacteriaceae RepID=B7LQZ4_ESCF3 Length = 447 Score = 236 bits (602), Expect = 1e-60, Method: Composition-based stats. Identities = 58/306 (18%), Positives = 111/306 (36%), Gaps = 13/306 (4%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML++ G+ Q LL+I + V QL ++GL DY + Sbjct: 95 LLTMLSHLPGFRHIQTQQATLQAQRIPLLEIFIHQFLHSVSQLLKQGLRSDYVSKQGNLA 154 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 +KG++ + +R +N K +D D ANR++ S L L+ + + R Sbjct: 155 FMKGKLMLSAQLRHNAVNRHKFCVDYDDYMPDCAANRLLHSALDKLLSLKLSSENQRW-L 213 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 L GI + + L + +Y ++ + I+ P +G+ + Sbjct: 214 YELRFAFDGIPLSRDIERDINNLRLERGMAHYNEPMAWAQLILRGMSPSALQGNTKAISL 273 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 M +++ F+ + EL S+ L ++ D+ I+S Sbjct: 274 LF---PMEAVFESFVAQTLPDELPPHLKVLPQAA--TYSLVKHGLKDCFKLRPDLLIQSH 328 Query: 251 ---EKILIVDAKYYKSIFSRRMGT-EKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHV 306 + +++D K+ S++ + + YQ+ Y G N LIYP Sbjct: 329 KPVQTKMVMDTKWKLVNSSQQTKSLYGLAQADFYQMFAYGQKY---LGGNGEMYLIYPAH 385 Query: 307 DTAVKH 312 D + Sbjct: 386 DDFSQP 391 >UniRef50_A6VF50 Putative uncharacterized protein n=2 Tax=Methanococcus RepID=A6VF50_METM7 Length = 416 Score = 236 bits (601), Expect = 1e-60, Method: Composition-based stats. Identities = 55/322 (17%), Positives = 119/322 (36%), Gaps = 13/322 (4%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 ML + + L +I + + L +RGL+ DY + + Sbjct: 103 FLKMLKTLKDSPFKKFDLSNLKTDRMPLNEIFITMFLDELSVLIKRGLKSDYIETQKNLN 162 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 +KG+++F + I+ ++ + FD +D NRIIKSTL L K K +++ Sbjct: 163 VLKGKLKFKEHIKHNLIHKERFFVEFDEFIKDMAENRIIKSTLKELSKRSKSGKNLKN-I 221 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 IS + F+ G+ Y+ ++ C+ + N KG + Sbjct: 222 SEYSFVFDEISESKNIEKDFNACKSGRLMVDYENILLWCRVFLKNESFINFKGSNVAFAL 281 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 M +++ +L ++ + ++ + + L + +++ DI Sbjct: 282 L---YPMEKIFESYLTYKLKKSGKFSYVKAQDSRF---FLVKEDLKKMFKLKPDIYAEKD 335 Query: 251 EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAV 310 + I ++DAK+ ++YQL++Y + +++ L+YP D + Sbjct: 336 DTIYLIDAKWKILDV--NSPNYGISQGDMYQLLSYAKIYENNCKKHVKMALVYPKTDKFL 393 Query: 311 K----HRYKINGFDIGLCTVNL 328 + + + + L Sbjct: 394 EKVNFEYFDEKNVLLKIWPFEL 415 >UniRef50_D1WRX1 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Streptomyces sp. ACT-1 RepID=D1WRX1_9ACTO Length = 429 Score = 236 bits (601), Expect = 1e-60, Method: Composition-based stats. Identities = 52/342 (15%), Positives = 113/342 (33%), Gaps = 20/342 (5%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P +PVR +++++ YA ++ ++ L + + + R+G+ Y Sbjct: 70 PKVPVRRLFFLIGYAADPRVHRD-GEVDVTEDEEIVPALAQGFERALERALRQGVLQGYR 128 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN 123 E P ++GR+ A + H +D D NR+++ + L+ ++ Sbjct: 129 HTEEASPVVRGRVREADQVNRHHGRSFPVEIAYDDYGTDIAENRLLRGAVERLLPLHRVP 188 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 +R R +L L ++ + Y + + + I+ + G Sbjct: 189 GDVRRRLRHHRARLLDAEPLGRGARYLPRWRPSRLNHRYLPALRLAETILRGASVEHGTG 248 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 + + +++ F+ R L + + M Sbjct: 249 GASVDGYLIDTH---KVFEDFVCVALREALARYGGRAALQARGVYLDDAGEI----SMRP 301 Query: 244 DITIRSSEK--ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 D+ ++ + DAKY E F +LYQ++ Y +L +G + Sbjct: 302 DLVWYGEDRTPRAVADAKYKA------EKPEGFPDADLYQMLAYCTALGLRDGHLVYARG 355 Query: 302 IYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIF 343 P V V+H I + L + + E+ + Sbjct: 356 YEPTVTHQVRHS----QIRIHQHALTLDRPPGELLAEIAALA 393 >UniRef50_A4Y9J5 Putative uncharacterized protein n=3 Tax=Gammaproteobacteria RepID=A4Y9J5_SHEPC Length = 435 Score = 235 bits (600), Expect = 1e-60, Method: Composition-based stats. Identities = 56/347 (16%), Positives = 122/347 (35%), Gaps = 20/347 (5%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ++ A Q + L + L + + +RG+ DY E Sbjct: 95 LRKLILNALQLKQRETEYTDIERFDAPLTEWLMAQFLTELDSVIKRGMRFDYQRIEESQR 154 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 ++G+++ K +R D+ D NR++K+ L + K + N R A Sbjct: 155 FLRGQLDVVKQLRQPAGREHIFNIRHDIFTADRAENRLLKTALLRVCKTTQDNDNWR-LA 213 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 L L + T F+ + +Y+ + C+F+++ +P +G ++ Sbjct: 214 HELQSLLHELPTSTDIQADFTTWRSDRLMAHYQAIKPWCEFVLSQHVPLAVQGLWQGISM 273 Query: 191 ERNEKEMSLLYQKFLYEFC----RRELTSANTTRSYLKWDASSISDQSLNLLPRMETDIT 246 M L++ ++ +R L R+ L + +++ D+ Sbjct: 274 LF---PMERLFESYVAAELDAAVQRMLGVKGEVRTQLASKY--LCKHQGKDFFQLQPDLQ 328 Query: 247 IRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH- 305 ++ I+D K+ S + Q+ YQL Y + +G + LIYP Sbjct: 329 LKLGADHWILDTKWKLLDASDKENKYGLSQQDFYQLFAYGQTYLGGDGTLV---LIYPAW 385 Query: 306 ---VDTAVKHRYKINGF-DIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 ++ N + + +L + ++L+ + +K Sbjct: 386 HKFPAGTYLGEFRFNDKLTLKVLPFDLDAQ--NAAKDLISKLQQGVK 430 >UniRef50_A6X8D9 Putative uncharacterized protein n=1 Tax=Ochrobactrum anthropi ATCC 49188 RepID=A6X8D9_OCHA4 Length = 422 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 51/325 (15%), Positives = 105/325 (32%), Gaps = 15/325 (4%) Query: 8 VRNIY-YMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNT 66 +R++ M+T + A + + + L RRGL Y Sbjct: 87 IRSLLIKMITESLNLRPRRAGNAGVAAFHMPMTEWFASAFLEEASDLIRRGLRSGYATIG 146 Query: 67 EIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTI 126 P ++GR+E A+ I + D + D NRI++S + + ++ + Sbjct: 147 TREPYLRGRLEVARQIGAAGGL-HQFSVQLDEYSLDRPENRILRSAVEHVARNTASSENW 205 Query: 127 RDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYR 186 R AR L L + G+ Y V + + ++ + +P G + Sbjct: 206 R-RARELSALLGEVPESGDVGSDLGKWERGRQLADYGRVKPLAELLLTHQLPFATLGDRK 264 Query: 187 FYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDIT 246 M L++++++ L + + R++ D+ Sbjct: 265 GMSMLF---PMERLFERYVFSSVSASLKPGFESVWQPSR--HHLCSLGAEEWFRLKPDML 319 Query: 247 IRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHV 306 + + I+DAK+ + R + YQL Y G + L+YP Sbjct: 320 VHDEARRWIIDAKWKRLN-PDRSEKFGLAQADFYQLFAYGQKY---LGGSGDMFLMYPGT 375 Query: 307 DTAVKHRYKI---NGFDIGLCTVNL 328 D + + + ++ Sbjct: 376 DEFPEVPGPFEFSRDLRLHVLPFDM 400 >UniRef50_A7ZRA6 Putative uncharacterized protein n=3 Tax=Enterobacteriaceae RepID=A7ZRA6_ECO24 Length = 428 Score = 234 bits (597), Expect = 3e-60, Method: Composition-based stats. Identities = 60/303 (19%), Positives = 111/303 (36%), Gaps = 13/303 (4%) Query: 14 MLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIK 73 ML++ + Q LL+I + V QL ++GL DY + +K Sbjct: 98 MLSHLPRFRHIQTQQATLQAQRMPLLEIFISQFLQSVSQLLKQGLRSDYVSEKGNLAFMK 157 Query: 74 GRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSL 133 G++ + +R +N K +D D ANR++ STL L+ + + R L Sbjct: 158 GKLMLSAQLRHNAVNRHKFCVDYDEYMPDCAANRLLHSTLDKLLSLKLSSENQRW-LYEL 216 Query: 134 YRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERN 193 GI + L + +Y I+ + I+ P +G+ + Sbjct: 217 CFAFDGIPLSRDIESDLNSLRIERGMTHYSEPIAWAQLILRGMSPSALQGNTKAISLLF- 275 Query: 194 EKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS--- 250 M +++ F+ + EL S S S+ L ++ D+ I+S Sbjct: 276 --PMEAVFESFVAQTLPYELPSHLKVFSQAA--TYSLVKHGLKDCFKLRPDLLIQSRQPI 331 Query: 251 EKILIVDAKYYKSIFSRRMGT-EKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTA 309 + +++D K+ + S++ + + YQ+ Y G LIYP D Sbjct: 332 QTKMVMDTKWKQVNSSQQKKSLYGLAQSDFYQMFAYGQKY---LGGTGEMYLIYPAHDDF 388 Query: 310 VKH 312 + Sbjct: 389 SQP 391 >UniRef50_B7KTT4 IQ calmodulin-binding-domain protein n=2 Tax=Rhizobiales RepID=B7KTT4_METC4 Length = 426 Score = 234 bits (597), Expect = 3e-60, Method: Composition-based stats. Identities = 60/356 (16%), Positives = 130/356 (36%), Gaps = 26/356 (7%) Query: 4 PVIPV-------RNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRR 56 P I R + +ML + ++LL+IL + + + + RR Sbjct: 74 PKIGDLCEGGVRRKLVHMLAVTHDLDVSAGALSELDWQRDDLLEILIRLFARMLAEAVRR 133 Query: 57 GLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAIL 116 G+ Y + + +P ++GR++ A+ + FD L+ D N+I+K+ + L Sbjct: 134 GMPRRYVGHEDDLPVLRGRLDAARQFTRLAASPQSLACRFDALSADIALNQIMKAAVLRL 193 Query: 117 IKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNS 176 + R R L I + + F + + ++ + ++ ++ Sbjct: 194 QSF-ARGAETRRLLRELAFAYADIREVPIETLRFDLVIVDRTNARWRDLQALACLLLQGR 252 Query: 177 IPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN-TTRSYLKWDASSISDQSL 235 + G + M+ L++ ++ R L S+ + +S Sbjct: 253 FQTTSGGAATGFSLLF---AMNALFEAYVARMLARVLRSSGQRVVAQGGLLYCLEDPESG 309 Query: 236 NLLPRMETDITIRSS-EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENG 294 R + DI ++ E L++D K+ + + ++YQ+M Y + Sbjct: 310 IRTFRTKPDILVKRDSETTLVIDTKWKRLAPVIDDPKQGVSQADIYQMMAYGRLYR---- 365 Query: 295 ENIGGLLIYPHV-------DTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIF 343 LL+YPH +HR + ++ + T++L Q + +L D+ Sbjct: 366 -CARLLLLYPHHARLGAQPGLRSRHRVTSSDDELFVGTIDLEQ-LETVPGQLSDLA 419 >UniRef50_UPI0001BCC8FD 5-methylcytosine-specific restriction enzyme subunit McrC n=1 Tax=Aeromicrobium marinum DSM 15272 RepID=UPI0001BCC8FD Length = 357 Score = 234 bits (596), Expect = 5e-60, Method: Composition-based stats. Identities = 80/355 (22%), Positives = 148/355 (41%), Gaps = 13/355 (3%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPG-NNLLDILGYVLNKGVLQLSRRGLELDY 62 P IP++N++ + YA + + Q + A +L + +L V Q GL + + Sbjct: 3 PKIPIKNVWLLQLYASSLYRAVGQRLVAAEDNPEDLPAFVAGMLADAVTQRLHTGLSVGF 62 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + + ++GRI+ T R L+ G+ TFD + DT ANR+ ++ L Sbjct: 63 QRTSRPLTRVRGRIDVLPTARHQLLSRGQVHCTFDEVVADTPANRLARAALWRAATLVPH 122 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 R A L G+ L+ + + +I+ +++ +IP ++ Sbjct: 123 EPRFRSLALQLEA--AGVRGPCPPLSRVPGLHRERLLVRDRQMIATADLLLSLAIPTTDE 180 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSY---LKWDASSISDQSLNLLP 239 G + +E+ + +L+++ F R L ++ LKWD S +S +LP Sbjct: 181 GGKLLPAPDMDERYLRVLFERACVGFFRLRLEPQGWKVNHNSPLKWDTSFMSSGMGAILP 240 Query: 240 RMETDITIRS------SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKP-E 292 ME DI + + +++D K+ + G K S +YQ+ YL S + E Sbjct: 241 GMELDIELVHHDLTGPGRRRVVIDTKFTTITKMNQYGNLKLRSGYIYQIYAYLMSQEASE 300 Query: 293 NGENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYL 347 GL+++P V V I G I TV+L + + ++LL ++ Sbjct: 301 TDPKSEGLMLHPVVGERVDEEVVIQGHRIRFATVDLAADSATLAEQLLATITPHV 355 >UniRef50_B1BM79 Putative uncharacterized protein n=1 Tax=Clostridium perfringens C str. JGS1495 RepID=B1BM79_CLOPE Length = 425 Score = 232 bits (591), Expect = 2e-59, Method: Composition-based stats. Identities = 73/349 (20%), Positives = 140/349 (40%), Gaps = 21/349 (6%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 +E + + +ML+ + + N L+DI+G + + + + +RG+ Sbjct: 81 LEDRKV----LLFMLSKCRKINIKTLDFIGSNLKNNPLIDIMGEIFYRDLSRELQRGIYS 136 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 +Y I IKG++ K + N K +D ED NRI+K L+ L++ E Sbjct: 137 EYVSVENSIGNIKGKLLVTKHSKVNRFNKNKAYCAYDEFTEDNFFNRILKKALSYLLR-E 195 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 N ++ R L R +S + + + +K + K I+N S+ Sbjct: 196 VRNERLKSNLRVLDRSFEEVSDKFINKHALNRYKLNRRNERFKNSFELAKMILNGSMGDN 255 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANT-TRSYLKWDASSISDQSLNLLP 239 +KG + EM+ LY++++ R ++ N + K + + Sbjct: 256 SKGKEFGFTLLF---EMNYLYEEYIGVVLREVISEENIFVSTQEKTKYLLYNKKRKREEI 312 Query: 240 RMETDITIRSSEK-ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 ++ DI I E +I+D K+ K R G E + ++YQ+ Y+ S K E Sbjct: 313 ALKPDIVIYKDETPKIIIDTKWKK---GSRNGKENYSQGDVYQMYAYITSYK----ECEK 365 Query: 299 GLLIYPHVDTAVKHRYKING---FDIGLCTVNLGQEWPCIHQELLDIFD 344 +++YP + + + G I + +V+L + + L DI Sbjct: 366 CVILYPKEEDEGNIIWNLKGYQDKKIFMRSVDLS-SYERTKEILKDIVK 413 >UniRef50_Q2SJR5 McrBC 5-methylcytosine restriction system component n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SJR5_HAHCH Length = 437 Score = 230 bits (586), Expect = 7e-59, Method: Composition-based stats. Identities = 55/317 (17%), Positives = 108/317 (34%), Gaps = 13/317 (4%) Query: 2 EQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELD 61 E P R + ML + L + + + +L RGL D Sbjct: 83 EDPQAARRILQRMLMTSLDVRPRTAGDAKLRRMRQPLHEWIFRQFLTELQRLVGRGLRFD 142 Query: 62 YNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 Y E I+G++ ++ R + D+ D L NR++K+TL+ ++ + K Sbjct: 143 YQRVDEESRFIRGQLRLSQQQRQPLGRRHLFQISHDIYTPDRLENRLLKTTLSYVLANCK 202 Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 R A L ++ I + + K + Y + C+ I+ P Sbjct: 203 SGENWR-RANELTHRMADIPPEQEPLRAMNNWRSNKLMQDYDAIRPWCELILAKLNPNFQ 261 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLL--- 238 +G +R M L++K++ R EL + ++ + + Sbjct: 262 QGQHRGIALLF---PMEQLFEKYVEVSLRHELPAGIQLKAQASSQYLLRHKPQGSDISTT 318 Query: 239 -PRMETDITIRSSEKILIVDAKYYKS--IFSRRMGTEKFHSQNLYQLMNYLWSLKPENGE 295 +++ D+ +R+ ++D K+ +LYQ+ Y + G Sbjct: 319 MFQLKPDLLLRTPLGDQVLDTKWKLLDQCAWTSDKKYNIAQSDLYQMFAYGHKYQHGRGH 378 Query: 296 NIGGLLIYPHVDTAVKH 312 +LIYP + Sbjct: 379 ---MMLIYPKHPAFTEP 392 >UniRef50_C5EXA5 Putative uncharacterized protein n=1 Tax=Helicobacter pullorum MIT 98-5489 RepID=C5EXA5_9HELI Length = 552 Score = 229 bits (584), Expect = 1e-58, Method: Composition-based stats. Identities = 50/280 (17%), Positives = 104/280 (37%), Gaps = 6/280 (2%) Query: 15 LTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKG 74 L + LL++ + + +L RGL+ DY + +KG Sbjct: 173 LVTLKDSPFKQSHIASLQSLNLPLLEVFIQMFLAELERLIHRGLKSDYREIAQNRVFLKG 232 Query: 75 RIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLY 134 ++ F + I+ ++ + + D + ++ NR+IK TL L + L+ R + SLY Sbjct: 233 KLLFNEQIKHNLIHKERFFTQSDEYSLNSAPNRLIKCTLEFL-RTLSLSPKTRTKLDSLY 291 Query: 135 RKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNE 194 I+ + F+ + + Y+ V+ C + G R + Sbjct: 292 FIFEEITPSSHIDRDFAKCKSMRRFKEYELVLLWCAIFLQQKSFSAYSGSERAFALLF-- 349 Query: 195 KEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKIL 254 M L++ F+ + R + + + D + +++ D+ +RS +IL Sbjct: 350 -PMERLFESFVGHWLGRSIEHHEIK--LQEQRYYFMQDFQKVDIFQLKPDVIMRSESEIL 406 Query: 255 IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENG 294 I+D K+ + ++YQ+ Y E+ Sbjct: 407 ILDTKWKIPDSTNDEKRYGIAQSDVYQMWAYASKYALEST 446 >UniRef50_C5CG01 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Kosmotoga olearia TBF 19.5.1 RepID=C5CG01_KOSOT Length = 397 Score = 228 bits (582), Expect = 2e-58, Method: Composition-based stats. Identities = 72/350 (20%), Positives = 132/350 (37%), Gaps = 31/350 (8%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANL-EAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 P + ++N+ +ML YA+ + + L + L V K V+ ++RGL +Y Sbjct: 74 PRVELKNLLHMLEYAYNLKSFQFIEGIQDCETIEELYERLVKVFVKRVIDRTKRGLYREY 133 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + +P ++G++ I+ K + D N+II TL+ L+ + + Sbjct: 134 IQQNDRLPYVRGKLNVRSMIKQ--PWKVKLDCVYQDHTNDIEENQIILWTLSKLVMSDSI 191 Query: 123 NSTIRDEARSLYRKLPG-ISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 + R+ R YR L G I P + Y + + KF + NS P Sbjct: 192 SEGTRNLVRKAYRSLAGTIKVRPFKPSECIKRFYNRLNSDYLPIHVLAKFFLENSGPAVK 251 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRM 241 G + F N M L++KF+ E+ ++ L R+ K + + S N+ Sbjct: 252 SGTSKMIPFLVN---MPRLFEKFIAEWLKKNLKG-YIVRAQEKVNLDKENSLSFNI---- 303 Query: 242 ETDITIRS---SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 D+ I S E + ++D KY E+ ++ Q+ +Y Sbjct: 304 --DLVIYSGLTGEAVAVLDTKYKI--------NERPSDNDISQIASYAM-----TKNCTK 348 Query: 299 GLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 LIYP +D + I T L + ++D +++ Sbjct: 349 AFLIYP-IDMNPPVNVTVGSVAIRCLTFQLSGDLEANGMRMVDELMRFIE 397 >UniRef50_B2A7U4 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A7U4_NATTJ Length = 383 Score = 227 bits (580), Expect = 4e-58, Method: Composition-based stats. Identities = 75/340 (22%), Positives = 141/340 (41%), Gaps = 29/340 (8%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 +P I N++ ML+Y++ + + LLD L V V +L ++GL DY Sbjct: 67 EPKIDTANVFKMLSYSYDLI-FWHDEKAQFANIQELLDYLVLVFCNQVNRLIKKGLHADY 125 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + + KGR+ + + K +D D L N+IIK T+ +L ++ + Sbjct: 126 VLVNDKLSYAKGRMNVRELVEKPW-EKHKIDCYYDNYQVDILENQIIKFTIDLLKRYIQ- 183 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 N+ IR + R +S +T + + ++YK + + CK + + Sbjct: 184 NNWIRRSLLNTNRYFDSVSLRPITVEDIDQVQYTTLNKHYKHIHNFCKMFLELMGINEQI 243 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRME 242 G F F EM+ LY+K++ + + EL + K + Sbjct: 244 GETLFNQFHL---EMNNLYEKYVGKLLKEELPNNYCVILQDKLHLDEYDQ------ISIR 294 Query: 243 TDITIRSS-EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 DI I + + L++D KY G++ + ++YQ+ Y+ K + G+L Sbjct: 295 PDIVIYNDVKPYLVIDTKYK--------GSKDITNNDIYQMAAYMSKTKTD------GVL 340 Query: 302 IYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLD 341 +YP + Y ING + + T++L Q ++L++ Sbjct: 341 LYPAQ-EVAETEYIINGRSLNIKTIDL-QNLDDGAKDLIN 378 >UniRef50_B7UQU6 McrC family protein, predicted McrBC 5-methylcytosine restriction system component n=24 Tax=Enterobacteriaceae RepID=B7UQU6_ECO27 Length = 436 Score = 226 bits (577), Expect = 7e-58, Method: Composition-based stats. Identities = 55/311 (17%), Positives = 124/311 (39%), Gaps = 16/311 (5%) Query: 8 VRNIYYMLTYAWGYLQEIKQANLEA-IPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNT 66 VR M+ + I + I L+DI + V ++ +RGL+ DY Sbjct: 96 VRQQLLMMLRTLKSFRHIASSESGVKISKMPLMDIFIQQFIESVRKIVQRGLKRDYLRQE 155 Query: 67 EIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTI 126 + +P +KGR+ + + + + +D + + L NRI+K+ + + + N + Sbjct: 156 DNLPWMKGRLRISAQLSKNCIRRDRFQVEYDEYSVNRLENRILKTAINKISRQTS-NPQL 214 Query: 127 RDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYR 186 + L I+++H F L+ + +Y+ ++ K I+ P G Sbjct: 215 LQQITQLQFHFENITSVHDAYIAFEQLHFDRQMHHYEQALAWAKMILLGDSPHCMYGDVN 274 Query: 187 FYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDIT 246 + M +++ F+ + R + + ++ + L ++ DI Sbjct: 275 AFSLLF---PMEAVFESFVTTWMRYRYYDKWRVDAQVSSK--NLISYNGKALFKLRPDIC 329 Query: 247 IR----SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLI 302 +R ++ ++ D K+ I + R + + +LYQ++ Y + + G+ +LI Sbjct: 330 LRPRKSTTGSVITCDVKWK--IVNGRKDSLEQSQADLYQMLAYGLNYQEGEGD---MILI 384 Query: 303 YPHVDTAVKHR 313 YP+ + + Sbjct: 385 YPYHNGFNQPS 395 >UniRef50_D0C387 McrBC 5-methylcytosine restriction system component n=1 Tax=Acinetobacter sp. RUH2624 RepID=D0C387_9GAMM Length = 437 Score = 226 bits (576), Expect = 1e-57, Method: Composition-based stats. Identities = 52/335 (15%), Positives = 108/335 (32%), Gaps = 18/335 (5%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML + + + + +L + GL DY E Sbjct: 103 LKKMLKVSLHLPYRDAGEASLNRFKQPIHEWIIEQFLCNFEKLIQYGLRFDYQRVQEEQK 162 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 ++G++ K +R D+ D NR+IK+ L ++ K + + + A Sbjct: 163 YLRGQLLHVKHMRQSPARKHIFPIEHDIYEVDRPENRLIKTALDVVCKKTRSSKNWK-LA 221 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 + L I Q F G+ Y + + I++ +P G +R Sbjct: 222 QELRLMTGEIPKSQNIVQDFKQWQSGRLLALYVDIRPWTELILSEYMPVSTHGGWRGMSL 281 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 M L++ ++ EL + + ++ L R++ DI I+ Sbjct: 282 LF---PMEKLFEHYVAYHLHHELKEWDVKTQVSNQHICTFNE---KPLFRLKPDIYIQHK 335 Query: 251 --EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP---- 304 +I+D K+ + R ++ Q+ Y +LIYP Sbjct: 336 CSPYKIILDTKWKLLDQNDRNRRFGLKDSDVQQMFAYSHYY---LDHASEVILIYPYHKN 392 Query: 305 HVDTAVKHRYKINGFD--IGLCTVNLGQEWPCIHQ 337 + + ++ + + + NL + + I Sbjct: 393 KFEDEICFKFNVQNDQAMLRVIPFNLDKPYDFIKS 427 >UniRef50_C7MHK3 McrBC 5-methylcytosine restriction system component n=1 Tax=Brachybacterium faecium DSM 4810 RepID=C7MHK3_BRAFD Length = 393 Score = 225 bits (573), Expect = 3e-57, Method: Composition-based stats. Identities = 60/346 (17%), Positives = 118/346 (34%), Gaps = 24/346 (6%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 +P IPV + +++ YA + ++ +L L + + +GL Y Sbjct: 63 RPKIPVERLVFLMGYASAPT-FWRDHSVRLDTDADLPQALARTFMRLATKAIEQGLLQGY 121 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + +P ++GRI I ++D D NR++ + L++ L Sbjct: 122 QRVDDSLPVLRGRIRVTDQISRRFGADLPLEVSYDDFTVDIAENRLLLAAATRLLRLPGL 181 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + R + L +L G+S + + Y+ + + + I+ Q + Sbjct: 182 DVRTRQGLQRLRLQLSGVSEVRRGD-ELPRWQPTRLNARYQPSLRLAERILAGESFEQRR 240 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRME 242 G R F + M +Y+ F+ + L S + + + Sbjct: 241 GRLRVDGFVFD---MWKIYEDFVGVALKEALASRGSATLQHRMHLDHAQRVD------LR 291 Query: 243 TDITIRS-SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 D S + ++VDAKY + +LYQL+ Y L G L Sbjct: 292 PDFLWTSHNGDQVVVDAKYKA------EKPAGYPQADLYQLLAYCTVLGLR-----EGHL 340 Query: 302 IYPHVDT-AVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEY 346 +Y + + H + I TV+L Q + ++ + Sbjct: 341 VYAKGNESELTHDVRGTEIVIHCHTVDLDQAPSTLLGQVRSLAHRI 386 >UniRef50_Q6LZ73 Putative uncharacterized protein n=1 Tax=Methanococcus maripaludis RepID=Q6LZ73_METMP Length = 426 Score = 224 bits (572), Expect = 3e-57, Method: Composition-based stats. Identities = 63/336 (18%), Positives = 120/336 (35%), Gaps = 22/336 (6%) Query: 8 VRNIYY-MLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNT 66 VR I+ ML + L +I + + L + GL+ DY Sbjct: 98 VRRIFLKMLKSLKNAPFKEFGDASLKTHKMKLNEIFIKIFLDDLDVLVKNGLKSDYISIE 157 Query: 67 EIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTI 126 + + +KG+I+F + I +++ + +D + NRIIKST+ L+K+ LNS + Sbjct: 158 DNLNVLKGKIKFNEHISKNYIHKERFYVNYDEFIRNRPENRIIKSTIKYLLKNSSLNSNL 217 Query: 127 RDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYR 186 + + I + F+ + YK +I CK + N KG Sbjct: 218 K-RINEFLFIMDAIPESKNLEKDFAACVNNRLMTDYKKIIPWCKVFLKNESFTNFKGDEI 276 Query: 187 FYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDIT 246 Y M +++ +L E ++ S + + ++ + R++ DI Sbjct: 277 AYALL---YPMEKIFESYLTEEFKK---SGKFETIVSQGNGYFLAKHKNEGIFRLKPDIY 330 Query: 247 IRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHV 306 +S K I+DAK+ + ++YQL++Y L+YP Sbjct: 331 AETSSKKYIMDAKWKILNSDKN-KNYGISQNDMYQLLSYAVVY-----GCNELRLLYPKS 384 Query: 307 DTAVKHRYKINGFDIG--------LCTVNLGQEWPC 334 + I + ++L + Sbjct: 385 KDFKRILEFEYNNSINYKEKISLKIIPIDLERNIAD 420 >UniRef50_B0A6E5 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0A6E5_9CLOT Length = 426 Score = 221 bits (562), Expect = 4e-56, Method: Composition-based stats. Identities = 54/342 (15%), Positives = 127/342 (37%), Gaps = 24/342 (7%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML + Q ++ N+ +I + VL + ++GL+ +Y Sbjct: 101 VMDMLKTLRKSPYKSLQVANVSVDKMNIFEIFIRMFINEVLLIVKKGLKSNYETIESNER 160 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 KG+++F + IR + + + +D N + N+++KSTL L K+ + +++ Sbjct: 161 VFKGKMKFTQQIRYNYAHKEQCYVEYDEFNTNCPENKLLKSTLLYLYKNT-CSLKNKNDI 219 Query: 131 RSLYRKLPGISTLHLTPQHFSYLN-GGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYD 189 + L + + F+ + +N + Y + K + G + Sbjct: 220 KMLLNSFLEVDKSTNYEEDFNRIIAADRNKKDYTTALLWSKIFLMGKSFTSFSGSKIAFA 279 Query: 190 FERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRS 249 M L++ ++ E R+ L + + S + ++ DI +++ Sbjct: 280 LLF---PMEKLFESYVAEILRKNLNKSLYSISIQDKTYHLFDKPNKK--FLLKPDIVVKN 334 Query: 250 SEK--ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVD 307 + I I+D K+ + ++YQ+ Y +N +L+YP+ + Sbjct: 335 KKNNDIFILDTKWKLLSNQK--SNYGISQSDMYQMYAYSKKYGSKN-----VILLYPNAE 387 Query: 308 TAVKHRYKI----NGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 + ++ NG + + ++L I + + E Sbjct: 388 NTIINKTIEFESNNGTSVKVKFIDLF----NIKYSINSLISE 425 >UniRef50_B5IHH3 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Aciduliprofundum boonei T469 RepID=B5IHH3_9EURY Length = 460 Score = 220 bits (561), Expect = 5e-56, Method: Composition-based stats. Identities = 65/353 (18%), Positives = 135/353 (38%), Gaps = 29/353 (8%) Query: 8 VRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTE 67 ++N+ ML AW + I N++ ++L + + +L + GL +Y ++ Sbjct: 109 IKNLVKMLQIAWNLPIRDVDISSLKIGENSIFEVLLTIYSIKLLDAIKEGLYKEYIRVSD 168 Query: 68 IIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIR 127 + +KG+I+FAK R ++ N D L NR +K A L NS Sbjct: 169 DLHYVKGQIDFAKYSRR-WERRHIIPVNYNDRNPDNLINRTLKYA-AYLASLYTRNSMNF 226 Query: 128 DEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRF 187 + + +S + ++ + + YK +I++ + I+ N P G Sbjct: 227 SNLKMAENLMDSVSLVPVSASEIDSITFTRLNEGYKPLINLARVIITNLSPEFTGGKKDV 286 Query: 188 YDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLL----PRMET 243 + F M ++++F+ + + +DQ +LL + Sbjct: 287 FAFL---IPMEKVFERFIANSIVQNKSKVLGNDCKSCEVYVQGADQKKHLLKGSRFMLIP 343 Query: 244 DITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIY 303 DI I+ + K I+D KY ++YQ++ Y ++ +L+Y Sbjct: 344 DIMIKINGKRYIIDTKYKLLDTED-EKKYGVSQSDVYQMLAYAYAYD-----TPKIMLLY 397 Query: 304 PHVDTAVKHR--------YKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 P + K+ G + + T++L + +L+ +D++L+ Sbjct: 398 PKGVGDFDKKEWEFENINSKLAGKKLIIETIDL------MKYDLVKEYDKFLE 444 >UniRef50_D2QGI2 5-methylcytosine restriction system component-like protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QGI2_9SPHI Length = 428 Score = 220 bits (560), Expect = 7e-56, Method: Composition-based stats. Identities = 50/357 (14%), Positives = 113/357 (31%), Gaps = 24/357 (6%) Query: 4 PVIPVRN-----IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGL 58 P I ++ + ML + + L ++ V L+++G+ Sbjct: 82 PKIDQQSNTRPLLLSMLRHLRNSPFRTLRTAHSRAVRIPLWEVFITAFLDTVDALAQQGI 141 Query: 59 ELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIK 118 + Y KGR + A+ R + + +D L NRI+K+ L + I Sbjct: 142 QRAYVTVEGNERFWKGRFQAARQQRDNACHAERLAVVYDTLTASVPPNRILKTAL-VAIH 200 Query: 119 HEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYL-NGGKNTRYYKFVISVCKFIVNNSI 177 + + + L L ++ + + Y+ + + ++ Sbjct: 201 AKTTDQANKRRIHQLLSVLEEVALSDDVRSDLMAVRRSNRLFMRYETALGWAEMLLMGQG 260 Query: 178 PGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNL 237 PG +G M +++ ++ R SA+ S + A + + Sbjct: 261 PGVKRGDKESIALLF---PMERVFEDYVAHGIRAYWPSADRI-SVQESSAHLVDEHVGAP 316 Query: 238 LPRMETDITIRSSEKILIVDAKYY-----KSIFSRRMGTEKFHSQNLYQLMNYLWSLKPE 292 ++ DI IR ++ ++D K+ S R + ++YQ+ Y Sbjct: 317 RFKLRPDIIIRHQDRTFVMDTKWKQVNGLSLDTSPRTASYGIDQADMYQVYAYGKKYAAN 376 Query: 293 NGENIGGLLIYPHVDTAVKHRYKI---NGFDIGLCTVNLGQEWPCIHQELLDIFDEY 346 + L+YP T + + + ++ ++L + Sbjct: 377 DL-----FLLYPANSTFREPLAVFAYDATTRLHVVPFDVTNSLANEVEKLALYALSF 428 >UniRef50_D0BXD9 McrBC 5-methylcytosine restriction system component n=2 Tax=Acinetobacter RepID=D0BXD9_9GAMM Length = 441 Score = 219 bits (557), Expect = 2e-55, Method: Composition-based stats. Identities = 51/338 (15%), Positives = 112/338 (33%), Gaps = 26/338 (7%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + +L Q L D + +L R G+ DY Sbjct: 99 LIKILKQVLQLPQRDTGLASIEKFKVPLTDWFYAQFLDALQKLYRTGIRFDYQRVEAEEN 158 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 ++G+++ A+ +R + D+ + NR+I+S + ++ K K + + + Sbjct: 159 FLRGQLDTAQQMRKPLTRQHQLSIKHDIFTSNRAENRLIRSCIDVVCKRAKT-ADLWRTS 217 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 + + + F + +Y + C+ I+ N IP KG + Sbjct: 218 HEFHLLFSEVPQSTNYREDFKKWKNDRLMSHYSDIRYWCELILGNEIPFAVKGINQAKSI 277 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIR-- 248 M L++K++ ++L + ++ + + + D+ I+ Sbjct: 278 LF---PMEKLFEKYVEIQLSKQLVKGAKLETQKSSKY--LAQYNSKDIFNLIPDLAIQYY 332 Query: 249 ----SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 S+K LI+D K+ + ++YQ+ Y + G +LIYP Sbjct: 333 CEQSKSKKYLILDTKWKLINSNNIEEKFGIKQSDMYQMFAYNHMYQ---GHTSDIVLIYP 389 Query: 305 HVDTAV----KHRYKING-------FDIGLCTVNLGQE 331 +K++ I + +L + Sbjct: 390 KHKNFQIALKPFEFKLHDLLKTGLQPKIWVIPFDLERS 427 >UniRef50_C7NQX2 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Halorhabdus utahensis DSM 12940 RepID=C7NQX2_HALUD Length = 421 Score = 219 bits (557), Expect = 2e-55, Method: Composition-based stats. Identities = 60/320 (18%), Positives = 123/320 (38%), Gaps = 24/320 (7%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 +P N+ Y+L YA ++ G+ LD G + + ++ RGL DY Sbjct: 84 RPKAAGTNLLYLLQYAHDTTATTFESQAPYQAGHTFLDAFGALYEAELRRIVDRGLYTDY 143 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 ++GR++ + ++ T+D L D LANR I +L+ Sbjct: 144 RRTDATESHLRGRLDIHRQLQRQPPVPTAFECTYDELTHDILANRAILHATTVLLGAVSD 203 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ-N 181 S + + +S +T Q + + +Y+ ++ + K ++ NS + Sbjct: 204 RSITQSLRQHQQLLRRQVSLTPVTIQDIERIELNRLADHYEDILRLTKLVIRNSFVSELQ 263 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA-NTTRSYLKWDASSISDQSLNLLPR 240 G + N M+ +++ + C+ L+ + + + I+ + Sbjct: 264 AGSSAAFAMLVN---MNTIFENAVERACKEVLSEREDWEVKFQDTSQNLITGGKHTV--T 318 Query: 241 METDITIRSSEKIL--IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 ++ DITI E + + DAK+ E+ + + YQ+ +Y+ + N+ Sbjct: 319 LQPDITIYDPENTVSLVADAKWK---------NERPKNADFYQMTSYMLA------NNVP 363 Query: 299 GLLIYPHVDTAVKHRYKING 318 G+L YP + R + G Sbjct: 364 GILFYPDCGGLNESRSTVTG 383 >UniRef50_C0WJ24 5-methylcytosine-specific restriction enzyme subunit McrC n=3 Tax=Corynebacterium RepID=C0WJ24_9CORY Length = 373 Score = 218 bits (555), Expect = 3e-55, Method: Composition-based stats. Identities = 74/350 (21%), Positives = 140/350 (40%), Gaps = 10/350 (2%) Query: 2 EQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGN-NLLDILGYVLNKGVLQLSRRGLEL 60 E +P+R+++ + YA + +N L +++G +L V + RR L + Sbjct: 11 EDIYVPIRSVWMLQLYASQTFIDGHISNSSVEEAGVELPELIGTMLCDAVERRFRRELSI 70 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 + I ++GRI +T R L G F+ L ++ NR ++ L H Sbjct: 71 GFTLTERNITRVRGRINMYETARHQLLEKGLIACEFNELTINSEINRFLRYALEY-AGHI 129 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 N D A ++ + + + + + K ++V K ++ ++P + Sbjct: 130 LSNVGSGDAAHRCKILGQRLAQMGVPEPKTAAFPRARLSPADKKPVAVAKLLLELAVPTR 189 Query: 181 NKGH-YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLP 239 RF + E+ L++K L+ L+ S K + Q+ + LP Sbjct: 190 GNDALPRFSRKHFTQDELRKLFEKALFGLFHYHLSPFGWKVSSGK-RLNWNVQQAPSYLP 248 Query: 240 RMETDITIRSSEKIL-IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLK--PENGEN 296 M+TDI +RS E + ++DAK+ R+G E S +LYQL YL S + E E Sbjct: 249 SMQTDIILRSPEGEITVIDAKFTHLFTENRVGNESIKSSHLYQLYAYLRSQETFSEEWET 308 Query: 297 IGGLLIYPHVDTAVKHR---YKINGFDIGLCTVNLGQEWPCIHQELLDIF 343 G+++Y + ++G + + L ++ L + Sbjct: 309 AQGIMLYASTGQNQADEALSFFLDGHPVTFAGIGLETSIREFREKALMLV 358 >UniRef50_A0JT91 McrBC 5-methylcytosine restriction system component-like protein n=3 Tax=Arthrobacter RepID=A0JT91_ARTS2 Length = 418 Score = 217 bits (553), Expect = 4e-55, Method: Composition-based stats. Identities = 56/330 (16%), Positives = 116/330 (35%), Gaps = 18/330 (5%) Query: 17 YAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRI 76 YA Q ++ + A+ +L L L + + RG+ Y E + +KGRI Sbjct: 93 YAGN--QGFREDPVAAVEDPDLWSALAVSLVQLADRALSRGVLQGYLTVDESLRTVKGRI 150 Query: 77 EFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRK 136 + I ++D ED NRI+++ L + + ++ ++ R L K Sbjct: 151 RISDQISRRPGMLVPLEVSYDEFTEDIAENRILRAALERMARVPRVRPDVQSRLRLLLGK 210 Query: 137 LPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKE 196 L ++ L + Y V+ + + I+ N+ G + F + Sbjct: 211 LDAVTRLRPGAP-LPPWQATRMNTRYHAVLRLSEVILRNASAEAGDGKQQTASFVVD--- 266 Query: 197 MSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDIT-IRSSEKILI 255 M +++ F+ R +T A L+++A + + D + +++ Sbjct: 267 MGQVFEDFVGTALREAMT-AYPGEMRLQYNALLNEAVRDSDRLTVNPDAVHLLGGRPVVV 325 Query: 256 VDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYK 315 D KY + S + +Q++ Y +L+ L+Y R Sbjct: 326 YDTKYRAATDQGASL-----SADHFQMLAYCTALRVPT-----AWLVYAGAGEMKLRRIL 375 Query: 316 INGFDIGLCTVNLGQEWPCIHQELLDIFDE 345 D+ ++L I + D+ + Sbjct: 376 NTDIDVVEYPLDLSLPPSDILAAVADLAQQ 405 >UniRef50_C8S833 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Ferroglobus placidus DSM 10642 RepID=C8S833_FERPL Length = 426 Score = 216 bits (551), Expect = 7e-55, Method: Composition-based stats. Identities = 57/340 (16%), Positives = 128/340 (37%), Gaps = 24/340 (7%) Query: 2 EQPVIPVRNIYYMLTYA-WGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 ++ I ++N+ ML Y+ W ++E + + +I ++ K + +L + + Sbjct: 90 KREKI-LQNLVRMLEYSGWEGIKETDLTQIGTEK--DFFEIYVFLFAKNLAELLKVNRDA 146 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 Y + + ++G+IEF K ++ N +T NR +K +L+K Sbjct: 147 SYVRTYDELRFVRGKIEFRKYW--NPARLHIIPCSYYERNMNTPINRTLKFVSYLLLKKV 204 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 + + R +S+ L ++ +T + + + I C+ + +S+ Sbjct: 205 ESSE-TRRLLKSVISVLDSVTLSPVTLAEVEKITFNRLNSRFIPFIDFCRAFLRDSVFSL 263 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 F+ F M L+++F+ + + + L + Sbjct: 264 QGSDVEFFSFL---IPMETLFERFVAKAVKELYKGTEWK---PHIQETFGYLVPKEKLFQ 317 Query: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 ++ DI + + + +IVD KY R Q+LYQ+ Y L + + Sbjct: 318 LQPDIVLENGGERVIVDTKYKILDPEDR--KLGVSQQDLYQMYAYCKEL-----GSSKCV 370 Query: 301 LIYPHV-DTAVKHRYKINGFD---IGLCTVNLGQEWPCIH 336 LIYP + + +K+ + + + T++L + Sbjct: 371 LIYPESLNGKIDGEFKLGSKEKIDLKVKTISLENPFDNGK 410 >UniRef50_A7H1L9 Putative uncharacterized protein n=13 Tax=Campylobacter jejuni RepID=A7H1L9_CAMJD Length = 445 Score = 215 bits (549), Expect = 1e-54, Method: Composition-based stats. Identities = 50/325 (15%), Positives = 112/325 (34%), Gaps = 19/325 (5%) Query: 2 EQPVIPVRNIYY-MLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 + +N+ ML + Q + L ++ + + ++GL Sbjct: 114 KDFKFNPKNLLINMLKTLKNSPFKKSQISSLQSSKIPLFEVFITMFLDEFDSVYKKGLMR 173 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 Y E +KG++ F + I+ ++ + ++ D D NR+IKSTL L Sbjct: 174 SYLSCEENRAFLKGKLLFNEHIKQNLIHKERFFTSNDEFVLDIAPNRLIKSTLNFLKSKT 233 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 LN + + L + + FS ++ Y+ ++ CK + N Sbjct: 234 SLN---KFRLIKAMQMLDEVEFSKNYEKDFS-YKISRHFDCYENLLLWCKIFLKNESFMP 289 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 G + M +++ ++ ++ + + + + ++ Sbjct: 290 YHGKNEAFALLF---PMEKIFEDYVAYMLKKVNPAQDIKVQN---NGKYLISKNDENCFM 343 Query: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 ++ D+ I E +I+D K+ S+ + +LYQ+ Y K + + Sbjct: 344 LKPDLYI---ENKMILDTKWKIPNDSKDEKKQGIAQSDLYQMFAYACKFKIYDIK----- 395 Query: 301 LIYPHVDTAVKHRYKINGFDIGLCT 325 ++YP + + KI Sbjct: 396 IVYPLCEKTQDLQRKIAEKFFVFKA 420 >UniRef50_UPI0001BCB4AE McrBC 5-methylcytosine restriction system component n=1 Tax=Fusobacterium periodonticum ATCC 33693 RepID=UPI0001BCB4AE Length = 444 Score = 215 bits (548), Expect = 2e-54, Method: Composition-based stats. Identities = 70/374 (18%), Positives = 147/374 (39%), Gaps = 35/374 (9%) Query: 4 PVIPVR---------NIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLS 54 P IP+ +L + ++ + AI ++L+I + K V ++ Sbjct: 66 PKIPLVENDIVAEKNRFLEILQNISYFKEKFFNDSKIAIADTSILEIFINLFIKEVEEII 125 Query: 55 RRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLA 114 +GL Y E I KG+++ I+ + K FD + ++L N IIK T+ Sbjct: 126 EKGLLYRYIGRNENISVFKGKLDINNHIKYNFSHKEKFFMKFDEFSINSLENSIIKLTIQ 185 Query: 115 ILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVN 174 L K +N +++ + I L + ++ Y+ + YYK I K +N Sbjct: 186 KL-KKISVNLKNKEKLNKISHHFENIIILPNSIENLKYITFDRTNDYYKNSIQWSKIFLN 244 Query: 175 NSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFC----RRELTSANTTRSYLK----WD 226 N M +++ ++ + + + + + Sbjct: 245 NQSSLIFSATNGEVATM--LFPMETIFENYIANKLINIVKEKFYNQLIVKVQDDSCSAFS 302 Query: 227 ASSISDQSLNLLPRMETDITIRSSE--KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMN 284 ++++D LN + ++ DI I++ +I I+D K+ I + K + ++YQ+++ Sbjct: 303 TATLNDTKLNNMFNVKPDIVIKNKNSKEIFILDTKWK--ILDKLDNKFKISTDDIYQMLS 360 Query: 285 YLWSLKPENGE---NIGGLLIYPHVDTAVKH-------RYKINGFDIGLCTVNLGQEWPC 334 Y+ LIYP + ++K + F++ +C VNL E Sbjct: 361 YVKIYNDRYKNSYTCEKAYLIYPATNIRKNSFSSEDKIKFKTDNFELNICFVNLSSE-ET 419 Query: 335 IHQELLDIFDEYLK 348 ++L++I +++K Sbjct: 420 TEKDLVNILSKFIK 433 >UniRef50_Q2FNZ4 McrBC 5-methylcytosine restriction system component-like n=1 Tax=Methanospirillum hungatei JF-1 RepID=Q2FNZ4_METHJ Length = 437 Score = 214 bits (546), Expect = 3e-54, Method: Composition-based stats. Identities = 57/353 (16%), Positives = 116/353 (32%), Gaps = 27/353 (7%) Query: 9 RNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEI 68 RN+ ML+Y ++L++ + ++ + L R Y E Sbjct: 81 RNLAVMLSYT-NLKPLSSDLTSMDQEDIDMLELFLRIFSEQLHHLLFRCQHRQYLNRDEH 139 Query: 69 IPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRD 128 + IKGRI+ K + TF L +DTL NRI K ++ +H + N ++ Sbjct: 140 LKFIKGRIQVNKYW--NPAQLERIPCTFKELTQDTLLNRIFKFCATLMSRHTQ-NEETKE 196 Query: 129 EARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFY 188 + + + L ++ H+T ++ + T + ++ C+ + +S + Sbjct: 197 HLKGILQILEPVTYTHVTSSETRFVILDRLTEQFAPLLRFCEIYLRHSTITLQASQVEIF 256 Query: 189 DFERNEKEMSLLYQKFLYEFCRR--ELTSANTTRSYLKWDASSISDQSLNLLPRMETDIT 246 M ++++F+ L T + + DI Sbjct: 257 SLL---IPMERVFEQFISGVLSEQSHLLPEGATVYSQYPGGHLAQTLDGRGIFELRPDIF 313 Query: 247 IRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHV 306 I +I+D KY + ++YQ+ Y +L+YP + Sbjct: 314 IDHPRIPVIIDTKYKMP--KKSSSNSGIKQSDIYQMFGYGAKKNVPAL-----MLLYPDI 366 Query: 307 DT--AVKHRYKINGFDIGL-------CTVNLGQE--WPCIHQELLDIFDEYLK 348 + + + + T +L W +EL I + Sbjct: 367 GEKIDIDLEFSYDNCRLSALLIRSITLTYDLADPVQWEMWLEELRGIMHDMYD 419 >UniRef50_Q2J945 McrBC 5-methylcytosine restriction system component-like n=1 Tax=Frankia sp. CcI3 RepID=Q2J945_FRASC Length = 416 Score = 214 bits (545), Expect = 4e-54, Method: Composition-based stats. Identities = 57/365 (15%), Positives = 121/365 (33%), Gaps = 40/365 (10%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 +P + +R + ++L YA + ++A +LL + + + G+ Y Sbjct: 68 RPKVTIRRLLFLLGYAQD-RGRWFEDEVQAAEEPDLLPAVAAAFARTASRALAHGVPRGY 126 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 +P ++GR+ + +R +D DT NR++ + L+ + Sbjct: 127 RQVDAALPVLRGRLRESAQLRQRSGVMFPLEVRYDERTVDTAENRLLLAATRSLLALAGV 186 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 E R + L G++ P + Y + + + ++ +S + Sbjct: 187 APATAQELRRIAAALDGVAEPAHGPVKPPDWVPTRVNAPYHAALRLAETVLRSSSFERED 246 Query: 183 GHY-RFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRM 241 G R F +M +++ F+ LT D++L M Sbjct: 247 GETLRVDGFVV---KMWEVFEDFVTHAVDEVLTHRGGEVRLQDRTHHLDEDRTLE----M 299 Query: 242 ETDITIRSSEKI-------LIVDAKYYKSIFSRRMGTEKFHSQNLY-QLMNYLWSLKPEN 293 D+ + E +++DAKY +I ++Y Q++ Y L Sbjct: 300 CPDLVLYRPEGPGGRMIPAVVLDAKYRLAIRQG-------ARAHVYHQMIAYCARLGAR- 351 Query: 294 GENIGGLLIYPHVDTAVKHR--------YKINGFD---IGLCTVNLGQEWPCIHQELLDI 342 G L+Y + A +I G + ++L + + I Sbjct: 352 ----QGWLVYAGSERADGQPGGRGDVIRSRIGGPTPIGLVTYVLDLRLPLAELRARIERI 407 Query: 343 FDEYL 347 D+ + Sbjct: 408 ADDMV 412 >UniRef50_C2WWH8 Putative uncharacterized protein n=3 Tax=Bacillus cereus group RepID=C2WWH8_BACCE Length = 399 Score = 212 bits (540), Expect = 2e-53, Method: Composition-based stats. Identities = 61/341 (17%), Positives = 126/341 (36%), Gaps = 19/341 (5%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML+ + L LL +L R+G Y + E + Sbjct: 75 LLGMLSITEFLPISFYEEVLNGEDRGELLTAFLATFLTRLLNELRKGTYKTYERHEENLN 134 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 ++G+IE +K I K FD E+ N++ K L I+ KH K+ T++ Sbjct: 135 TLRGKIELSKHIYKNVFQKTKAYCAFDEYTENNSLNQLFKCALLIVKKHTKI-HTLKLYL 193 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 L + ++ T + + + ++ K IV + + F Sbjct: 194 ERCLGYLEPVDVVYFTEKELKSITFNRQNERFRQAALFAKLIVERATIYSKGRGASSFSF 253 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 +M++L++K++ + + + S + +S ++ D I Sbjct: 254 LF---QMNMLFEKYIEVALQETIGNNKII-SQHAEKRLLRNKKSGRQNILLKPDFVI--- 306 Query: 251 EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAV 310 ++I+D K+ + + R + ++YQ+ Y+ + K E +L+YP + Sbjct: 307 NNVIIMDTKWKSATNNGRSS---YVQSDIYQMYAYVTAYK----EVKRCILLYPKQEIEA 359 Query: 311 KHR---YKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 H I +CT+ + E+ +EL +I + +K Sbjct: 360 VHPVWEVIDTEKTIEMCTIRID-EFSKTVRELKEILQKQVK 399 >UniRef50_Q7VG80 Putative uncharacterized protein n=1 Tax=Helicobacter hepaticus RepID=Q7VG80_HELHP Length = 485 Score = 211 bits (538), Expect = 2e-53, Method: Composition-based stats. Identities = 55/323 (17%), Positives = 113/323 (34%), Gaps = 32/323 (9%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML + + LL+I + + + +L ++GL+ DY E Sbjct: 140 LLKMLQSLKDSPFKQSHFAHLKLTKMPLLEIFILMFLQELEKLVKKGLKSDYIVCEENRN 199 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 +KG++ F + ++ + + ++ D + + NRIIKSTL L+ + L++ + Sbjct: 200 FLKGKLLFHQNLKLNFAHRERFFTSSDEFSVNIAPNRIIKSTLE-LLNTQNLSTNTSAKL 258 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 + I + S R YK ++ C+ + + + Y Sbjct: 259 MQMRFIFLDIPPSQSIDKDLSKCQNLGYFRNYKMILQWCEIFLKRKSFAPYQKDSKAYAL 318 Query: 191 ERNEKEMSLLYQKFLYEFCR----------------RELTSANTTRSYLKWD--ASSISD 232 + M+ L++ F+ + ++ + SYLK + + Sbjct: 319 LFD---MNKLFESFVASEMKKWLCDMKLSYENKVFIEQIFRESKKDSYLKTQEKSKYLIV 375 Query: 233 QSLNLLPRMETDITIRSSEKI---LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSL 289 + + DI + I D K+ I S+ ++YQ+ YL Sbjct: 376 EGDKNRFLLNPDIVGYQKQTKETFFIADTKWK--ILSKEQQNYGVSQSDMYQIFAYLAKY 433 Query: 290 KPENGENIGGLLIYPHVDTAVKH 312 + G LIYP ++ Sbjct: 434 Q-----CNQGFLIYPKIEDCNDE 451 >UniRef50_D2QXZ3 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Pirellula staleyi DSM 6068 RepID=D2QXZ3_9PLAN Length = 435 Score = 211 bits (538), Expect = 3e-53, Method: Composition-based stats. Identities = 72/361 (19%), Positives = 134/361 (37%), Gaps = 32/361 (8%) Query: 3 QPVIPVRNI--YYMLTYAWGYLQEIKQANLE--AIPGNNLLDILGYVLNKGVLQLSRRGL 58 +P + N+ M+ + G +++ A G NLLD++ + + ++ R GL Sbjct: 73 RPKLAGDNLRLLQMIEFTTGLNTLKHYSSVLKLAAGGGNLLDLIMLMFVEECERILRGGL 132 Query: 59 ELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIK 118 DY E +P ++GR+ F + IR D +D N+++ L + + Sbjct: 133 LSDYVEEEEELPVVRGRMLFDRQIRKRLGRLDLIHCRHDERKQDVPENQLLAYVLDVCAR 192 Query: 119 HEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSY--LNGGKNTRYYKFVISVCKFIVNNS 176 H T+R +AR L L G L + + + +Y+ +C IV+ Sbjct: 193 H-AFQPTLRRKARQLEHHLLGSCDPSLLDLVTTRGGIYYDRMNEHYRDAHELCWLIVDAL 251 Query: 177 IPG--QNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQS 234 + G R + F + M+ L++ F+ + R+ ++ SY D S I D + Sbjct: 252 GISDIYSSGSSRIFAFLLD---MNRLFEVFVLQVLRQLTSTTALKVSYQSSDRSIIRDSA 308 Query: 235 LNL-LPRMETDITIRSS--EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKP 291 N + D I + L++DAKY + ++YQ Y ++ Sbjct: 309 TNQPYSSVIPDFLIAAPSLSGKLVLDAKYKL------YDAAGVSNSDIYQSFFYAYAFGR 362 Query: 292 ENGENIGGLLIYPHVD-----TAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEY 346 LIYP + R + N + + L +LD + Sbjct: 363 HQLGGHVAGLIYPSESTTASRKELTVRSQFNANAARVVLIGLPIP------AVLDEAKSH 416 Query: 347 L 347 + Sbjct: 417 V 417 >UniRef50_C9ZD40 Putative uncharacterized protein n=1 Tax=Streptomyces scabiei 87.22 RepID=C9ZD40_STRSW Length = 415 Score = 209 bits (531), Expect = 1e-52, Method: Composition-based stats. Identities = 48/347 (13%), Positives = 97/347 (27%), Gaps = 23/347 (6%) Query: 4 PV--IPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELD 61 P IP + L YA G P D++ L +L R GL D Sbjct: 70 PKFAIPGEQLMSWLAYALGTPVPATARRWATGPDG-YADLVAAALLDQCERLLREGLRRD 128 Query: 62 YNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 Y + P ++GR++ A + + D NR++ S L + Sbjct: 129 YVRRRSVEPVLRGRLDIAAQATRRYGQLDQLHVRTFDREADIPDNRVLGSALKAALGMT- 187 Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 ++ + P T + + + Y+ + + ++ Sbjct: 188 VSPDLARALHGAAGAFPHAPTPAAALRALDRTHYTRLNARYRPAHTWARLLLRGGGVTDL 247 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRM 241 E M L++ + + Sbjct: 248 LTDQGTTA-EGLLLAMPALWEAVVRRLGTEAVGPHGGHAVPGGSGVGITVHGDRGNASTF 306 Query: 242 ETDITIRSS---------EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPE 292 D+ + +L VDAKY R + +++QL+ Y Sbjct: 307 RPDLLLSLPALPGHDTAHRTLLPVDAKYK------RYDHHGVSAADVHQLLTYSSGYASA 360 Query: 293 NGENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVN-LGQEWPCIHQE 338 + ++++P + + G + L T+ LG + ++ Sbjct: 361 DAPT--AVIVHPQTGRHDRRTLHVRGPNGLLGTIAVLGVDTRTTPEQ 405 >UniRef50_Q10YL1 McrBC 5-methylcytosine restriction system component-like n=2 Tax=Oscillatoriales RepID=Q10YL1_TRIEI Length = 405 Score = 207 bits (528), Expect = 3e-52, Method: Composition-based stats. Identities = 57/350 (16%), Positives = 129/350 (36%), Gaps = 32/350 (9%) Query: 3 QPVIPVRNIYYMLTYAWGYLQE-IKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELD 61 +P +P+ N++ ML YA+ + + L +L + +L+ R+G Sbjct: 76 KPKVPLHNLFGMLEYAYNLRSFCFLDGLVNCNSLQEFYNCLVNILAQKILERGRKGFHRA 135 Query: 62 YNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 Y P TE + I+GR+ + + K + + N+I+ TL I+ + Sbjct: 136 YLPKTENLTYIRGRLNMRQVMHKPWGVSLK--CDYQEHTANIPDNQILAWTLFIISRSSF 193 Query: 122 LNSTIRDEARSLYRKLPG-ISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 + + + L G ++ + + Y+ + +C+F ++N Sbjct: 194 CSEKVAVTVTRAFHILQGLVTLQPFKSSDCLNIKYHRLNEDYQVLHGLCRFFLDNIGASH 253 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 +G+Y F +M+ LY+KF+ ++ + L+S + K + + Sbjct: 254 QQGNYSMLPFL---IDMAKLYEKFVAKWLKLHLSSNLRVKEQEKVEIV-------DDKIY 303 Query: 241 METDITIR---SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENI 297 + D+ I + + + I+D KY + + ++ Q++ Y + Sbjct: 304 CKIDLVIYEIKTCKVVYILDTKYKLDC--------RPSTDDINQVVAYATY-----KKCH 350 Query: 298 GGLLIYP-HVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQE-LLDIFDE 345 +LIYP + + + + T + + Q L ++ Sbjct: 351 EAILIYPQRLTNYINQLVGESQVRLRTLTFAIDSDLEKAGQSFLEELISN 400 >UniRef50_C7PTE7 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Chitinophaga pinensis DSM 2588 RepID=C7PTE7_CHIPD Length = 423 Score = 203 bits (517), Expect = 7e-51, Method: Composition-based stats. Identities = 59/323 (18%), Positives = 125/323 (38%), Gaps = 26/323 (8%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML + + + N++LDI + + V L RRGL Y +T + Sbjct: 93 LLQMLQQCSLIKIDHVEKANLNLRSNSILDIYIRLFLEEVEVLLRRGLIKKYKRHTANLT 152 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 +KG+++F K I ++ + + + + L N+II L LI N+++ D+ Sbjct: 153 TLKGKLDFGKHISANLIHKERFFVEHTIYSHENLFNQIINEVLK-LIPLLVSNTSLNDKL 211 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 + P + + +T F + + + Y+ + + + ++ N P G Sbjct: 212 GRIRLDFPELPAIKVTAATFDKIQYDRKSSTYQPALEIARLLLLNYRPDITGGTNNVIAI 271 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR-METDITIRS 249 + M+ L++++++ +R L S + ++ L P+ + DI +R Sbjct: 272 LFD---MNELWEEYIFRKLQR-LNSEGIEVKRQQSQ--HFWKRNGALYPKSVRPDIVLRK 325 Query: 250 SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTA 309 EK +++D K+ K ++L Q+ Y E +L+YP Sbjct: 326 GEKTIVLDTKWKLI------PDYKPTDEDLKQMFVY-----NLYWECSHSVLLYPADRYH 374 Query: 310 VKHRYKIN-------GFDIGLCT 325 ++ + G + T Sbjct: 375 LETGAYFDFSRQTAAGNSCSVAT 397 >UniRef50_B1BM86 ATP-dependent helicase priA n=8 Tax=Clostridium RepID=B1BM86_CLOPE Length = 513 Score = 202 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 60/348 (17%), Positives = 137/348 (39%), Gaps = 22/348 (6%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML+ + + + +L +IL Y+ +K + + R+G+ +Y E I Sbjct: 177 LLNMLSKCGILKVNYSEISSLKLYKQSLNEILAYLFSKKLQKELRKGVYGEYVYIEENIN 236 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 +KG + + I+ + K F+ + D N+I+ + ++K+ K T+R Sbjct: 237 SLKGSLRVQEQIKNMASHSSKAFCRFEEFSRDNKLNKILSFFVKEVMKNVKNRETLR-LL 295 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 R L + ++T + + + + ++ ++ K IV N G + Y Sbjct: 296 RISEMILGDVDERNVTLNEVNNFSFNRLNKPFEDAFTLGKMIVLGESALGNLGGNKAYSI 355 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 +M+ +++ ++ + ++ L + K+ I ++S + ++ DI I + Sbjct: 356 LF---KMNEIFEIYIGKLLKQLLYKETVHMQHSKYKL-LIKEESNRGVFKLIPDIVIEKN 411 Query: 251 E-KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTA 309 + +I+D K+ + ++LYQ+ YL K +L+YP+ + Sbjct: 412 GIERIIIDTKWKSV--ESKFNRHGVKREDLYQMYAYLTRYK----NVSTVILLYPYNERI 465 Query: 310 VKHRY---------KINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 + + + VNL E + L I +Y++ Sbjct: 466 EGEEGEYLESWYLDEEEHKRVRVYAVNLENEKETLKS-LDKIVRKYVE 512 >UniRef50_D2AT08 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2AT08_STRRD Length = 406 Score = 202 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 62/352 (17%), Positives = 125/352 (35%), Gaps = 26/352 (7%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P PV ++++L YA + +Q ++A LL L + + R+G+ Y Sbjct: 62 PKTPVDRVFFLLGYARRP-RGWRQGEVDAGDHPELLPALAHAYALAADRALRQGVLQGYL 120 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN 123 E +P ++GRI A +R + +D D NR++ + A L++ L Sbjct: 121 EMEEALPVVRGRIREADQLRRRYGLPLPVEVRYDDYTVDIAENRLLLAASARLLRLPGLA 180 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 R R + +L G++ L + Y + + + ++ + + G Sbjct: 181 VQTRRTLRHVIARLAGVTALVPGRP-LPVWRPSRINTRYHTALGLAELVLRGASYELDDG 239 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 + EM +++ F+ L ++ Sbjct: 240 T--AVRVDGLLVEMWRVFEDFVTVALTEALRPHGGRSELQDKRHHL----DHGRRVLLKP 293 Query: 244 DI---TIRSSEKIL---IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENI 297 D+ I S + +VD+KY S HS +LYQ++ Y L ++G Sbjct: 294 DLVRYVIDSGGTEIPAAVVDSKYKISTGPEG------HSADLYQMLAYCTVLGLDHGH-- 345 Query: 298 GGLLIYPHVD-TAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 L+Y D +H + G +I ++L + + + + ++ Sbjct: 346 ---LVYAEGDAEPYRHVVRGAGIEIMQHAIDLTLPPADLLAAIERLAESIVR 394 >UniRef50_A4J0H3 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Desulfotomaculum reducens MI-1 RepID=A4J0H3_DESRM Length = 334 Score = 202 bits (514), Expect = 1e-50, Method: Composition-based stats. Identities = 57/343 (16%), Positives = 120/343 (34%), Gaps = 30/343 (8%) Query: 4 PVIPVRNIYYMLTYAWGYLQE-IKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 P + + NI+ ML YA+ I +E + L +L +L +R+GL +Y Sbjct: 11 PRVKLSNIFTMLEYAYRLKSFRILDGMVECDSLQEFYERLAMILAGMILNRNRQGLYREY 70 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + +P I+G++ + + D N+I+ TL ++ Sbjct: 71 REQVDKLPYIRGQLNIRHQLVK--PWEVGFSCHYQEHTADIEDNQILTWTLNRILYSGLC 128 Query: 123 NSTIRDEARSLYR-KLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 + + YR L S + L P FS + + Y+ + ++C+F + PG Sbjct: 129 SDRGLPVIKKAYRSLLSQTSLIPLDPGRFSSRVYSRLNQDYRPLHALCRFFLEQCGPGYE 188 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRM 241 G + F + M L++ F+ ++ R L + + + + ++ Sbjct: 189 VGDHSMIPFLVD---MPRLFELFVAQWLRTYLPPEYEITPQERVEIGENGELTFSI---- 241 Query: 242 ETDITIRSSEKI---LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 D+ + ++D KY + ++ Q++ Y + Sbjct: 242 --DMVLYRKRDETAMCVMDTKYKSAATP--------TQADINQVVTYAVA-----KGCRD 286 Query: 299 GLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLD 341 +LIYP + + I + L + L++ Sbjct: 287 AVLIYPSSNIRP-FKETIGDITVKTLAFPLAGNLEEAGKRLVE 328 >UniRef50_C5DAA2 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Geobacillus sp. WCH70 RepID=C5DAA2_GEOSW Length = 411 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 48/339 (14%), Positives = 134/339 (39%), Gaps = 13/339 (3%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML + + A + A ++L++++ + K V + +G+ +Y + + Sbjct: 74 VVDMLLFCEDLPLSYEHATMAAYDSHSLMEMIARLFVKEVEMILNKGIVKEYIVEEDNLT 133 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 ++GR++ + +R + K +D L+ + L N++I+ L ++KH L + Sbjct: 134 CLRGRVDIRQHLRTNFMTPTKVYCRYDELDTNILENQVIRMALE-VVKHFSLTKQTMRQI 192 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 L + I+ + + + + + ++Y+ + +I Q +++ Sbjct: 193 NRLADEFMMIADPYFSY-EWPNFSYHRLNQHYEKAHKLAYYIWKQIYVNQL-YQFQYRSH 250 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 +M+ L++KF+ + ++ L A + ++ + + + D+ + Sbjct: 251 YSYLIDMNELFEKFVAKLLKKYLPGAAKVHAQRRFKKAITKNGDGYHDIIL--DLLVEFP 308 Query: 251 EKI-LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTA 309 +K +++D KY + K + ++YQL Y ++ + ++++P Sbjct: 309 DKDPIVLDTKYK------QYSKYKVENADIYQLAFYAQ-FVTKSSNHYKAIIVHPEYAGE 361 Query: 310 VKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 I+ L + I + L + + ++ Sbjct: 362 DACEEVIDLLPGTFHQGKLFVKPVSIEKVLAAVKRKDIE 400 >UniRef50_B7D079 5-methylcytosine-specific restriction enzyme subunit McrC n=1 Tax=Burkholderia pseudomallei 576 RepID=B7D079_BURPS Length = 294 Score = 200 bits (508), Expect = 8e-50, Method: Composition-based stats. Identities = 63/270 (23%), Positives = 118/270 (43%), Gaps = 9/270 (3%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQANLEAIPG-NNLLDILGYVLNKGVLQLSRRGLELDYN 63 IPVRN++ ++ YA + + N ++ D++ +L V Q RR + Y Sbjct: 18 RIPVRNLWLLMLYASDLTRIKEVFNALVEDDLEDIPDLVAKLLAHTVEQRLRRNVTRGYQ 77 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN 123 + + ++GRI+ +T L+ G+ F+ L +T NR++++ L L+ + Sbjct: 78 HRAQSLTRVRGRIDILRTEAQQLLSRGEVYCRFEELTANTPRNRLVRAALD-LLASLVRD 136 Query: 124 STIRDEARSLYRKL--PGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 + + RSL L GI + + + G+N + + K + ++P + Sbjct: 137 RDLARQCRSLAAALGRSGIVGVRPSRAELAQDQIGRNDHDDWLMAELAKLAFDLALPTEE 196 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANT---TRSYLKWDASSISDQSLNLL 238 G ER + + L++K + F R EL + + W S+ SD + +L Sbjct: 197 AGPTTLVSPERGDVYVRRLFEKAVLGFARVELERIGWRVRGGTCMNWQVSAASDGAAEIL 256 Query: 239 PRMETDITIR--SSEKILIVDAKYYKSIFS 266 P M TDI I S+ + L++D K+ S Sbjct: 257 PGMITDIIIDDLSAGRRLVIDTKFTLSTSD 286 >UniRef50_A2TPW2 Putative uncharacterized protein n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TPW2_9FLAO Length = 448 Score = 197 bits (502), Expect = 4e-49, Method: Composition-based stats. Identities = 56/308 (18%), Positives = 113/308 (36%), Gaps = 20/308 (6%) Query: 10 NIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEII 69 NI++ L+Y L+ + N+ +IL Y+ +K +L + Y Sbjct: 105 NIFWWLSYC-RKLRFPNYKTGLSGEKNDFFEILIYLFSKYTKELLNNAMYQRYVEIHREE 163 Query: 70 PGIKGRIEFAKTIRGFHLNH--GKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIR 127 +KGRI+F + K T+D D NR IK +L K + + Sbjct: 164 QFVKGRIDFTRYTNENLSRANFHKISCTYDSFEMDNQFNRCIKYVATLLFSVTK-DRQSK 222 Query: 128 DEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRF 187 + R + L +S ++ + + ++ + C + N + K + Sbjct: 223 NNLREILFILDEVSDETMSASACRNIQFNPMFKEFETIRDYCVLFLENCVSYNYKDALQL 282 Query: 188 YDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITI 247 + F M +++ F++ F +EL + + +++ D+ + Sbjct: 283 FAFL---IPMEYIFEDFIFGFIDKELHEVTAKAQSGQISL------DQSKNFKLKPDLIL 333 Query: 248 RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVD 307 + K +I D KY S +LYQ++ Y L+ ++ +L+YP D Sbjct: 334 EVNGKRIIADTKYKMLNLSGNDPKNGISQNDLYQMVAYAIRLQCDHI-----ILLYP--D 386 Query: 308 TAVKHRYK 315 + Y+ Sbjct: 387 HILNPGYR 394 >UniRef50_D2B1B6 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Streptosporangium roseum DSM 43021 RepID=D2B1B6_STRRD Length = 431 Score = 196 bits (499), Expect = 9e-49, Method: Composition-based stats. Identities = 61/322 (18%), Positives = 105/322 (32%), Gaps = 23/322 (7%) Query: 3 QPVIPVRNI--YYMLTYAWGY--LQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGL 58 +P + R + ML YA G L+ + + G++L D++ +L L R GL Sbjct: 67 RPKLVGRELAVLRMLDYASGLPALRHMDRLRNLPNQGHDLRDLICLLLTVECEALVRHGL 126 Query: 59 ELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIK 118 DY E +P I+GR+ + + + FD + D L NR+ + L + Sbjct: 127 RRDYIRRQETLPAIRGRLLADQQVLRRFGRLDRLECRFDEFDSDILDNRLCAAAL-RVAA 185 Query: 119 HEKLNSTIRDEARSLYRKLPGISTLHLTPQHF--SYLNGGKNTRYYKFVISVCKFIVNNS 176 H + +R AR + + T + +L + +Y+ ++ Sbjct: 186 HSARDEALRARARRVATDFSEVCTTDGLDVRWVAQHLTYHRPNEHYRQAHRWALLLLQAP 245 Query: 177 IPGQ--NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQS 234 + G F + M+ L++ F + R + + SIS Sbjct: 246 GFTDLLSTGGPSSRTFMLD---MNSLFEAFATQLLREATHRTGIAVRAQESLSRSISRPD 302 Query: 235 LNLLPRMETDITIRSSEK----ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLK 290 + DI + VD KY +LYQ Y L Sbjct: 303 GRSYTSITPDIQLVHGHGPGAWRRSVDVKYKL------YADRTIKPSDLYQSFAYGQVLS 356 Query: 291 PENGENIGGLLIYPHVDTAVKH 312 E +L D H Sbjct: 357 SEETPTAY-ILFASDRDGEPDH 377 >UniRef50_C0GTU8 Putative uncharacterized protein n=1 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GTU8_9DELT Length = 424 Score = 195 bits (497), Expect = 1e-48, Method: Composition-based stats. Identities = 57/278 (20%), Positives = 105/278 (37%), Gaps = 19/278 (6%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFD 97 LLDI V QL+ GL Y N + + +KGR+ F K I + + + Sbjct: 113 LLDIFFRSFLSEVEQLAHHGLVRKYRKNQDNLTTLKGRLLFQKQITLNLVRRERFYTEHV 172 Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK 157 + N+I+ + L ILI N + +AR+L I ++ FS LN + Sbjct: 173 HYERNNPFNQILGTALDILI-LTSSNPHLSAQARNLALSFEDIDRINAAEVTFSRLNYTR 231 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 NT Y+ I + + I+ N P G + M+ L+++++Y +R + Sbjct: 232 NTERYRRAIQLARLIILNYCPDVRSGGEDVLAILFD---MNNLFERYVYAQLKRAEAMNS 288 Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITIR----SSEKILIVDAKYYKSIFSRRMGTEK 273 + ++ + + DI +K +++D K+ + T Sbjct: 289 EQNVSFRAQVQQPFWRTERIRKHIRPDIIAEIGQGYDQKRVVIDTKWKIPRDGKPADT-- 346 Query: 274 FHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVK 311 +L+Q+ Y + LL+YP + Sbjct: 347 ----DLHQMYAYNVHFGAKQS-----LLLYPRTSSTCD 375 >UniRef50_A8EUN6 McrBC catalytic subunit McrC, putative n=2 Tax=Campylobacterales RepID=A8EUN6_ARCB4 Length = 387 Score = 195 bits (496), Expect = 2e-48, Method: Composition-based stats. Identities = 57/341 (16%), Positives = 126/341 (36%), Gaps = 40/341 (11%) Query: 10 NIY-YMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEI 68 NI+ YML YA+ +Q A + +L++ + G+LQ ++GL +Y + Sbjct: 76 NIFIYMLMYAYDVKLSNEQIASCANQKHTILEVFIQMFANGLLQELKKGLYKEYLTKQDN 135 Query: 69 IPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRD 128 +P +KG+ + ++ K +D +E+ N+ T+ L K K + Sbjct: 136 LPVLKGKYLINENLKYNF-TKNKIYCEYDEFSENNSLNQFFLYTVKYLQKFVKD----KK 190 Query: 129 EARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFY 188 + + + +N + +K + ++ SI ++ F Sbjct: 191 LLKQCELIFDEVEYKQIDINRLETINFDRLNLRFKTSFEIAILLLKQSILLFSQDKKSF- 249 Query: 189 DFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIR 248 F + M++L++KF+ + L + ++ + ++ DI + Sbjct: 250 AFLFD---MNVLFEKFIARMVKE-LDNNAKIQNQDNFG-----------NLTLKPDIILE 294 Query: 249 SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDT 308 + I+D KY K E + Q +Y + K +N +L+YP Sbjct: 295 NQ----IIDTKYKKI-----KSIEDIKQSDKLQAFSYGINYKVDN-----VMLLYPKHLD 340 Query: 309 AVKHRYKINGF----DIGLCTVNLGQEWPCIHQELLDIFDE 345 +K+ + + + T++L + + +I + Sbjct: 341 NIKYDLVLGKDDKKVKLKIRTIDLNFSGNNYKEYIDEIMER 381 >UniRef50_Q26DA3 Putative uncharacterized protein n=2 Tax=Flavobacteria RepID=Q26DA3_9BACT Length = 416 Score = 195 bits (496), Expect = 2e-48, Method: Composition-based stats. Identities = 54/294 (18%), Positives = 117/294 (39%), Gaps = 22/294 (7%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML + A NLL++ + + + L R+GL Y T+ Sbjct: 84 LLKMLQACGKLKADSSGAANVKRQHLNLLEVYFELYLQELESLVRKGLIKQYRKQTKNTK 143 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 +KG++EFA I+ ++ + +T + + + ++++ LAI+ + S ++D A Sbjct: 144 ALKGKLEFAGHIKSNIVHKERFYTTHQVYDSNHFLHQVLSKALAIVGQFTN-GSRLQDLA 202 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 + P + +T +H + ++ + T YK + + + I+ N P + G + Sbjct: 203 SRVQLNFPEVDNKAITAKHLNEMSLNRKTLSYKNALELARLIILNYSPDISSGKEKMLSL 262 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 + M+ L++ ++ + ++ + + S +S + DI +R S Sbjct: 263 LFD---MNELWETYILKQLQKA-------SIGFEIEVSGQESKSFWANNSLRPDIVLRKS 312 Query: 251 EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 K I+D K+ + +L Q+ Y E +L+YP Sbjct: 313 GKTYIIDTKWKRP------NKSTASVNDLRQMYTYCRFWDAE-----KAMLLYP 355 >UniRef50_Q466P1 Putative uncharacterized protein n=2 Tax=Methanosarcina RepID=Q466P1_METBF Length = 453 Score = 194 bits (494), Expect = 3e-48, Method: Composition-based stats. Identities = 54/330 (16%), Positives = 119/330 (36%), Gaps = 24/330 (7%) Query: 9 RNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEI 68 R+++Y L+Y ++ L ++L Y+ + + ++ Y E Sbjct: 111 RHLFYWLSYC-KKVKFPFNQAFLDKFELELPELLIYLFARQIHEVISTRPFSAYEEVQEA 169 Query: 69 IPGIKGRIEFAKTI-RGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIR 127 + +GRI F + + R + N ++ D L NRIIK +L+ + T R Sbjct: 170 LFTPRGRINFDRYVTRISYGNCHLIDCDYEPFVFDNLLNRIIKYCTRLLLSKASIIETQR 229 Query: 128 DEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRF 187 + L + Q L Y+ +I +C I+ N + + Sbjct: 230 -ILNEIIFMLEDVDDQVCFAQQLQTLRIPSIYSDYEEIIQICGMILENQAYSCAEYEMKN 288 Query: 188 YDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITI 247 + M +++ F+ + ++ + S + + ++ DI + Sbjct: 289 WSLLL---PMEYIFEDFIAGYVQKYFSGTFKVEPQ----KSDLYLHTNPNTFNLQHDILL 341 Query: 248 --RSSEKILIVDAKYYKSI-FSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 + + + +I+D KY + + ++YQ+++Y + LLIYP Sbjct: 342 TNKKTGEQIIIDTKYKPRWNLEKSDSKKGIAQSDMYQMISYAY-----RRGTNKVLLIYP 396 Query: 305 HVDTAV--KHRYKIN----GFDIGLCTVNL 328 + + H + IN I + +++ Sbjct: 397 NTSNELAEDHTFLINKGTKDETINIKAIDV 426 >UniRef50_C6NTT9 Putative uncharacterized protein n=1 Tax=Acidithiobacillus caldus ATCC 51756 RepID=C6NTT9_9GAMM Length = 441 Score = 193 bits (490), Expect = 8e-48, Method: Composition-based stats. Identities = 53/339 (15%), Positives = 105/339 (30%), Gaps = 24/339 (7%) Query: 14 MLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIK 73 ML A Q + + D L + + RR + Y ++ +P + Sbjct: 94 MLATAL-SAQSMTAGLASISADGSRHDALMEMFCDELQLARRRQVIRRYASTSDSLPSPR 152 Query: 74 GRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSL 133 GRI F G+ S + L ED NRI K L + ++ IR Sbjct: 153 GRISFPGQCYESIRRPGRFASAWVALTEDVPENRIFKEVLLRY--RPRCSARIRGRIDLC 210 Query: 134 YRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERN 193 +L + ++ + + Y ++ K +++ G G Sbjct: 211 LSELDSVDASGDHRLEWAKVRADRLPPIYHSLLRQSKALLDEEGAGVFAGDKLATA---E 267 Query: 194 EKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITI--RSSE 251 S L+++F+ + +A + S + + D+ + + Sbjct: 268 IVFTSRLFEQFVAKELSWISPAAGLVSKAQDRGTFTCSRGDGKGVFELIPDVRLIDDRGK 327 Query: 252 KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP------- 304 LIVD K+ +R +++YQ++ Y +L+YP Sbjct: 328 TALIVDTKWKSLDMRKRH--LGISREDIYQVLTYGSRFN-----CADVVLLYPDVTNETG 380 Query: 305 HVDTAVKHRYKINGFDIGLCT--VNLGQEWPCIHQELLD 341 K + + + L + ++LL Sbjct: 381 KTGYYQKFESILGARKYSVHVLKIPLLAPTLMVARDLLR 419 >UniRef50_B5HWV8 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HWV8_9ACTO Length = 434 Score = 190 bits (484), Expect = 4e-47, Method: Composition-based stats. Identities = 54/312 (17%), Positives = 108/312 (34%), Gaps = 21/312 (6%) Query: 11 IYYMLTY-AWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEII 69 + ML Y A + +L D++ +L + +L RG+ DY + + Sbjct: 85 VLRMLEYTAGRGFPPLDATRTVREGAPHLRDLVALLLTEECERLLSRGVRQDYVTTEDDL 144 Query: 70 PGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDE 129 P ++GRI ++ + + + FD + D + NR+ + + L + +R Sbjct: 145 PAVRGRILPSRQLLRHYGRLDRLACRFDEHDTDIVDNRLCAAAVD-LAARTARSPAVRAR 203 Query: 130 ARSLYRKLPGISTLHLT--PQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN--KGHY 185 AR ++ L + L+ ++ +Y+ +++ G Sbjct: 204 ARRAATSFARVAPTRLGDLRTALAGLDYHRHNTHYRSAHRWAALLLSGGGIADLLAPGPL 263 Query: 186 RFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNL-LPRMETD 244 F + M++L++ FL R T T + D+ + D Sbjct: 264 ASRAFLVD---MNVLFEVFLTHLLREAATGTGLTVRDQTRHRGVLYDERTERPYGEVRPD 320 Query: 245 IT----IRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSL-KPENGENIGG 299 + + VD KY + K +LYQ Y +L + G Sbjct: 321 VLVTGTLDGEPLRRPVDLKYKL------YDSRKLSPSDLYQAFLYAHALARQPAGGPPTC 374 Query: 300 LLIYPHVDTAVK 311 +LI+P +A + Sbjct: 375 VLIHPGSGSATR 386 >UniRef50_A4TAT5 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Mycobacterium gilvum PYR-GCK RepID=A4TAT5_MYCGI Length = 446 Score = 189 bits (480), Expect = 1e-46, Method: Composition-based stats. Identities = 64/343 (18%), Positives = 125/343 (36%), Gaps = 26/343 (7%) Query: 10 NIYYMLTYA--WGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTE 67 + M+ Y+ L + A+ G++L +L VL L R GL DY P + Sbjct: 80 RVLQMIEYSEGVRLLAHLPPDQQLAVSGDDLFQLLVRVLVGESKLLIRDGLLRDYRPTED 139 Query: 68 IIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIR 127 + ++GR+ + + + FD + D N+++ ++L H S +R Sbjct: 140 TLAVMRGRLRMRDQFLKRYGSLHRLECNFDEYDGDIAENQLLAASLTAAASH-VRASALR 198 Query: 128 DEARSLYRKLPGISTLHLTPQHF--SYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHY 185 +E R L + I + ++ G+ Y+ ++ Sbjct: 199 NETRMLAGVIGDICQPPTFDPDWYEQRIHYGRRNSRYEGAHKAALLVLRGLALNDLHSAS 258 Query: 186 R--FYDFERNEKEMSLLYQKFLYEFCRRELTSAN-TTRSYLKWDASSISDQSLNLLPRME 242 R F N M++++++F+ + L+ + L A + + + + Sbjct: 259 RQGVNAFMVN---MNVIFERFVSALVDQALSGTGLRSTPQLSIRAIVVDESTNRTYSNIR 315 Query: 243 TDITIR--SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 D+ I +S + + VD KY T KF S ++YQL Y ++L E + G+ Sbjct: 316 PDLVITEVNSARSVPVDIKYKL------YDTVKFSSADVYQLFTYAYALGAGAEEKMAGV 369 Query: 301 LIYPHVDTAVKHRYKINGF------DIGLCTVNLGQEWPCIHQ 337 IY T +I G + +++ I Sbjct: 370 -IYASTTTTSGPALRIKGNTGIAAARLRGAGLDVAAALDNIAS 411 >UniRef50_A1ZU11 Putative uncharacterized protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZU11_9SPHI Length = 438 Score = 189 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 56/365 (15%), Positives = 128/365 (35%), Gaps = 36/365 (9%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P + ++ Y L+Y+ A P L +I ++ QL Y Sbjct: 88 PKVCFDHVLYYLSYSQRVRFPFALARTHTSPSLFLPEICIFLFASYAEQLLIEQPLHLYQ 147 Query: 64 PNTEIIPGIKGRIEFAKTIRGFH--LNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 TE + +KG+++ + ++ N K S L + N+++K T L+ + Sbjct: 148 ERTEELDFLKGQLDIDQYLKENIATGNWQKLHSRHTPLLYNNRFNQLVKYTARQLLLMTQ 207 Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 ++ + + + L +S + +T + + + + V+ +C + N + Sbjct: 208 YAPSL-EHLQHMIALLQNVSDVPMTYKDCMKIRLPEQQIALQTVVDMCAMFLGNEMINYE 266 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRM 241 G + F M L+Y+ F+ +F + + L+ + + + + Sbjct: 267 VGQKYNFAFLL---PMELVYEDFIGQFVQTHFAQW---QPRLQPKKYLGRNPTGKPVFSV 320 Query: 242 ETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 + D+ + S + +I D KY R ++YQ++ Y + +L Sbjct: 321 QPDMLLGSPQ--VIADTKYKIREVPLRHSQTAIEESDIYQMIAYALGYR-----CSEMVL 373 Query: 302 IYPHVDTAVK-----HRYKINGF------DIGLCTVNLGQEWPC---------IHQELLD 341 +YP K + I I ++++ + ++L Sbjct: 374 LYPASYRQPKTLSFSESFNIQSDLLTTPLRIRAESLDITTTAQTKFAEALEQKLKKQLQR 433 Query: 342 IFDEY 346 IF+ + Sbjct: 434 IFESH 438 >UniRef50_D1SMG2 McrBC catalytic subunit McrC, putative n=1 Tax=Methanocaldococcus sp. FS406-22 RepID=D1SMG2_9EURY Length = 445 Score = 187 bits (474), Expect = 6e-46, Method: Composition-based stats. Identities = 65/350 (18%), Positives = 124/350 (35%), Gaps = 27/350 (7%) Query: 13 YMLTYAWGY-LQEIKQANLEAIPGNN-LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 M+ +A+ ++E + A ++ I + +I Y+ +L +RG Y E Sbjct: 102 KMMNFAYDLNIKEQELAKVKDIASTPVIYEIFIYLFAYSLLNEIKRGFYKSYIKVREEKK 161 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 +KG++ K IR K + E+ L N+I T I +K K + Sbjct: 162 FLKGKLLIDKQIRKLPHQRHKFSIEYHEFTENNLLNQIFYYTTYISLKKTKWREN-KKLL 220 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 L GI+ +T F ++ + +K ++ K I++ G D Sbjct: 221 SELMLIFEGINLRKITIHDFKRVHFTRLNERFKKPFNLAKIILSAF------GEIDGEDA 274 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 +M+ L++KF+ + L + + + D + Sbjct: 275 IGFFVDMNDLFEKFICSILSKSL----GFEIKYQSKFKLFKEVKGIKNIEQKPDYVVYKD 330 Query: 251 EK-ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTA 309 K +L++DAKY + R K S L Q+ Y + I +LI+P Sbjct: 331 NKPVLVLDAKYTEIN--REYEKPKLPSDMLRQIYTYAKYYTLKCNYKIRSVLIFPKSKKY 388 Query: 310 VKHR---------YKINGFDIGLCTVNLGQ--EWPCIHQELLDIFDEYLK 348 + N ++ + T NL + E I +E ++ + + Sbjct: 389 NDFNSKAIIGEATFFDNEINLYVLTYNLKKLIEGDGIDEEFINCIKKLTE 438 >UniRef50_A6EMU6 5-Methylcytosine-specific restriction enzyme C n=1 Tax=unidentified eubacterium SCB49 RepID=A6EMU6_9BACT Length = 414 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 54/299 (18%), Positives = 113/299 (37%), Gaps = 22/299 (7%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML + +LLDI + V +L RRGL Y ++ + Sbjct: 90 LIEMLKQTKKLKVQQVGNAQVNKQSIHLLDIYFDWFLREVQELCRRGLIKKYYKESKNVK 149 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 +KG++EFA+ + ++ + ++ + N+D ++II L ++ + + + Sbjct: 150 SLKGKLEFAQHLNKNLIHKERFYTSHQIYNKDHKLHQIINQALEVIE-LVSKGTYLYSKC 208 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 + + P +ST+ FS LN + YK I + + I+ N P + G Sbjct: 209 KEVRLNFPEVSTIKCNESTFSKLNFNRKNSPYKTTIEIARLIILNFAPNVSTGSENMLAL 268 Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS 250 + M+ L+++++ + L + + + + DI + Sbjct: 269 LFD---MNNLWEEYVLLKLKEATRD-------LDIEVHGQNRKPFWNGITIRPDIVVSQG 318 Query: 251 EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTA 309 + I+D K+ K ++ +L Q+ Y + +N LL+YP Sbjct: 319 DTTCIIDTKWK------NNRDNKPNTNDLRQMYVYNEYWQGKN-----ALLLYPSASKD 366 >UniRef50_Q4C3F9 Similar to McrBC 5-methylcytosine restriction system component n=4 Tax=Chroococcales RepID=Q4C3F9_CROWT Length = 294 Score = 185 bits (470), Expect = 2e-45, Method: Composition-based stats. Identities = 52/269 (19%), Positives = 108/269 (40%), Gaps = 20/269 (7%) Query: 37 NLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTF 96 L + L + +L ++ L Y ++ + +KG+I+ I+ + Sbjct: 6 TLYNRLATIFAHRILNRIQKELYSTYIKQSQELNYVKGKIDIKTMIKH--PWKPTLTCQY 63 Query: 97 DMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPG-ISTLHLTPQHFSYLNG 155 D +D N+I+ T+ I+ + + + R R +Y L G ++ +T Sbjct: 64 DNFTQDIEDNQILLWTIYIISRQQICQANTRILIRKVYHALQGYVTLSPVTANDCINRKC 123 Query: 156 GKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTS 215 + Y + S+C+F + N+IP +KG+Y F + M+ LY+KF+ + + L Sbjct: 124 NRLNEDYHGLHSLCRFFLENTIPSHDKGNYNSLPFLVD---MNQLYEKFVAAWLIQHLPP 180 Query: 216 ANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKI---LIVDAKYYKSIFSRRMGTE 272 ++ + + S + D+ I + E ++D KY I + E Sbjct: 181 HLGIKTQHRVEYDS---------FSFKIDLIIYNKETQENLYVLDTKYKTKI--KTSDIE 229 Query: 273 KFHSQNLYQLMNYLWSLKPENGENIGGLL 301 + + Q N+ ++P N + I + Sbjct: 230 QIIAYTFQQNCNHAIIIEPTNNKPINAKI 258 >UniRef50_Q188G2 Putative uncharacterized protein n=9 Tax=Clostridium difficile RepID=Q188G2_CLOD6 Length = 422 Score = 180 bits (456), Expect = 8e-44, Method: Composition-based stats. Identities = 49/345 (14%), Positives = 129/345 (37%), Gaps = 22/345 (6%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + ML+ + + ++ NLL+ + + ++G+ +Y E + Sbjct: 89 LLQMLSICNKIPITMNEKIRLSLKNYNLLNFFVMYFIESMQTQMKKGIYFEYINKIENLN 148 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKST-LAILIKHEKLNSTIRDE 129 ++G+I + + ++ K +D +E+ N+++K ++IL + +++I+ + Sbjct: 149 VMRGKILLSTYAKEKGISPMKIRCEYDEYSENNFLNQVLKKACISILCR--INDNSIQGK 206 Query: 130 ARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYD 189 + + + +H+ + KN +K + + ++ N ++ + + Sbjct: 207 IKKILSYFQNVDLIHIDRKKLLDYKFYKNNDRFKDCYLLARLLLLNLSMDNSQNNQEAFS 266 Query: 190 FERNEKEMSLLYQKFLYEFCRRELTSA-NTTRSYLKWDASSISDQSLNLLPRMETDITIR 248 + LY++++ + ++ T K ++Q+ + DI + Sbjct: 267 ILFEI---NTLYEEYIGILIKSIWDNSFRETYIQDKSKFLLKNEQTGKKNFNLRPDIVLY 323 Query: 249 --SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHV 306 +E +I+D K+ + S ++YQ+ Y+ + + +L+YP + Sbjct: 324 DLKNEYEIIIDTKWKAIEV---DSNVFYRSSDIYQMYAYITAYENAK----RCILLYPCI 376 Query: 307 DTAVKHRY-----KINGFDIGLCTVNLGQEWPCIHQELLDIFDEY 346 + G I TV L + +L I Y Sbjct: 377 QKDKNYSSWKLSESFKGKFIEAKTVRLD-DIKNTKNDLKKIIFNY 420 >UniRef50_Q97QG4 Conserved domain protein n=27 Tax=Streptococcus RepID=Q97QG4_STRPN Length = 442 Score = 178 bits (451), Expect = 3e-43, Method: Composition-based stats. Identities = 46/314 (14%), Positives = 111/314 (35%), Gaps = 32/314 (10%) Query: 13 YMLTYAWGYLQEIK--QANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + L Y + I ++ L +L Y+ K + R+GL +Y+ + Sbjct: 100 HFLHYLLNKVLHINLTSLDVALSREERLYQLLVYLFPKYLQAAIRKGLYKEYHRFSHNDS 159 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 +KG I+ ++ G D ++++ T+ + + + + D Sbjct: 160 HVKGVIDVRNHLKKNLPFTGNIAYATREFTYDNPLMQLVRHTIEYIKNQKSIGQGVLDNL 219 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNT----------RYYKFVISVCKFIVNNSIPGQ 180 + + I + + + + Y+ + +C I+N G Sbjct: 220 STSRENVSEIVRVTPSYKLADRAKIIRGNQSKPIRHAYFHEYRNLQELCLMILNQEKHGL 279 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 + Y + ++ L+++++Y + S+ + + Sbjct: 280 GYQDQKIYGILFD---VAWLWEEYVYTLLPKGFVHPRNKDKTDGISVFSVGKR------K 330 Query: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 + D E+ +++DAKY K + + + ++L+QL++Y + LK E Sbjct: 331 VYPDF--YDRERKIVLDAKYKKLELT----EKGINREDLFQLISYSYILKAEKAG----- 379 Query: 301 LIYPHVDTAVKHRY 314 LI+P ++ +V Sbjct: 380 LIFPSMEQSVNSEI 393 >UniRef50_C5A3Z2 McrBC 5-methylcytosine restriction system component n=1 Tax=Thermococcus gammatolerans EJ3 RepID=C5A3Z2_THEGJ Length = 458 Score = 177 bits (450), Expect = 4e-43, Method: Composition-based stats. Identities = 54/309 (17%), Positives = 106/309 (34%), Gaps = 26/309 (8%) Query: 7 PVRNIYYMLTYAWGYLQEIKQANLEAIPG--NNLLDILGYVLNKGVLQLSRRGLELDYNP 64 P+ ML A+G + NL ++ Y+ K + +RG +Y Sbjct: 112 PILAFIRMLDMAYGLKIKDHDLAYLQGRNLRPNLYEVFIYLFAKSLWSEVQRGYHREYVE 171 Query: 65 NTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNS 124 ++G++ ++ IR L ED L NRI +++ ++ Sbjct: 172 VHREEKFLRGKLLMSRQIRKLPHQLNTFSVEVHELIEDNLLNRIFYASVREALRRTTWGL 231 Query: 125 TIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGH 184 R L GI+ +HL +HF ++ + ++ + K + +P KG Sbjct: 232 N-RKLLGELMLAFDGITPIHLRTEHFERVHFTRLNERFRRPFELAKLLF---MPASGKGR 287 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 R +M+ L+++F+ R L + + S + D Sbjct: 288 SREVS--GFFVDMNKLFERFIERVLVRNLPPEYKLFYQESYPFLKNQNGSSQ-----KPD 340 Query: 245 ITIRSSEK-ILIVDAKYYKSIFSRRMGTEKFHSQN-LYQLMNYLWSLKPENGENI----G 298 +R ++++DAKY + E+ S + L QL Y + Sbjct: 341 YVVRKGNTPVVVLDAKYREL-------KERIPSSDMLRQLYVYSRIWGYKTSHENDSKPP 393 Query: 299 GLLIYPHVD 307 +++ P Sbjct: 394 AVIVIPSSS 402 >UniRef50_Q5JJA9 Putative 5-methylcytosine restriction system, catalytic subunit n=1 Tax=Thermococcus kodakarensis RepID=Q5JJA9_PYRKO Length = 467 Score = 176 bits (447), Expect = 8e-43, Method: Composition-based stats. Identities = 51/294 (17%), Positives = 106/294 (36%), Gaps = 16/294 (5%) Query: 15 LTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKG 74 L Y G + + L + +DIL + + +L R G+ ++ E ++G Sbjct: 104 LYYNLGLREYDIKTALATEGNSPFIDILLDIFSNRLLNELRFGIYGEFVSTEETSSSLRG 163 Query: 75 RIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLY 134 ++ + I K + D NR+ K TL + ++H + + Sbjct: 164 QLLVEREILKLPTQKHKFDIRYKKFTVDNFLNRVFKYTLYLGLQHTNR-RETKRTLSEAW 222 Query: 135 RKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNE 194 L +S ++ L ++ + + K I++ + F Sbjct: 223 DMLKEVSLTPISVDSIEKLTLNSLNLRFELPLKLAKIIISGLDYQKGLISPGFI------ 276 Query: 195 KEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRS--SEK 252 M ++ F+Y+ R L R + + + + + DI I S + Sbjct: 277 IPMPDAFEFFVYKLLRAILGKDYRVRYHPQNREFVLETPRKFIENPPQPDIIIESSSGQP 336 Query: 253 ILIVDAKYYKSIFSRRMGTEKF--HSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 +++VDAKY + +++ +S +LYQ+ +Y G+L+YP Sbjct: 337 LVVVDAKYKTLYCPKCGEKQRYVKNSSDLYQIYSYTKLYNA-----HAGVLVYP 385 >UniRef50_UPI000185C9B8 conserved hypothetical protein n=1 Tax=Capnocytophaga sputigena ATCC 33612 RepID=UPI000185C9B8 Length = 411 Score = 176 bits (446), Expect = 1e-42, Method: Composition-based stats. Identities = 45/310 (14%), Positives = 89/310 (28%), Gaps = 22/310 (7%) Query: 21 YLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAK 80 I + D Y+ + + +GL Y +KG I+ + Sbjct: 70 LATNIFNFEQSPNNEETIWDFWLYLFPYCLKKAYAQGLYKAYQRKQYNDANVKGSIDVKR 129 Query: 81 TIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGI 140 + GK T + D +++I+ T+ L H N+ + +A + Sbjct: 130 HLLKNLPFAGKIAYTTKEHSYDNPLSQLIRHTIEYLRTHPIGNALLNTDAEMRTMVSQFV 189 Query: 141 STLHLTPQHFSYLN---------GGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFE 191 T + Y + +C I+N+ + + Y Sbjct: 190 FHTQNTYNKNARRKVIMANAKPFVHPYFTEYAPLQKICLNILNHEKLTFGEEKDKIYGLL 249 Query: 192 RNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSE 251 + L++++L F + + + Q + D +S Sbjct: 250 FDGAW---LWEEYLNTFLDEDFKHPENLKGNGREYLFKKGKQP------IYPDFISKSGS 300 Query: 252 KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVK 311 L+ DAKY + G + + ++Y Y E G L+YP Sbjct: 301 PKLVGDAKYIPLDKHKSYGEDSERAISIY----YKTITYMYRFETNRGFLLYPCSKEDSD 356 Query: 312 HRYKINGFDI 321 + I Sbjct: 357 KPFFSEELYI 366 >UniRef50_D1YX67 Putative uncharacterized protein n=1 Tax=Methanocella paludicola SANAE RepID=D1YX67_METPS Length = 433 Score = 174 bits (440), Expect = 6e-42, Method: Composition-based stats. Identities = 63/332 (18%), Positives = 114/332 (34%), Gaps = 27/332 (8%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPG-NNLLDILGYVLNKGVLQLSRRGLELD 61 +P I ++ +L YA+ N E G N D+L Y L+ L RGL Sbjct: 73 KPKIEGDSLLRLLRYAYSLEDLDLYKNTEYSTGKMNFHDLLLYQLSIEANILISRGLHKK 132 Query: 62 YNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 Y P + +G + + R ++ + ++D NR++ + L I+ + Sbjct: 133 YKPTDSELKIPRGILNIQRIARRGGISKQSLPCKYYPRSDDNTMNRVLLAGL-IMGSNLT 191 Query: 122 LNSTIRDEARSLYRKLPG-ISTLHLTPQHFSYLNGG--KNTRYYKFVISVCKFIVNNSIP 178 + ++ R L L +S + L + S ++ + TR Y+ IS+ K ++++ Sbjct: 192 SSQKLKGRLRRLSFILRETVSPVKLNWEMMSVVDMDMSRLTRAYQPSISIIKMLMSSQGI 251 Query: 179 G-QNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSIS---DQS 234 + K F + M+ +Q + + L + + Sbjct: 252 DLEQKDGMNLPGFLFD---MNHFFQVLISRYLHDYLHDYKVYDEPPLKGLMAYDNNYNPL 308 Query: 235 LNLLPRMETDITIRSSEKIL--IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPE 292 P + D +R + + I+D KY + LYQL Y S Sbjct: 309 KRRPPHLYPDFIVRDNMNKVVAILDTKYRDLW------KLPLPREMLYQLSIYAQSRNLG 362 Query: 293 NGENIGGLLIYPHVDTAVKHRYKINGFDIGLC 324 I +YP D N I L Sbjct: 363 ENSTI----LYPTTDNVSSEN---NEQRINLY 387 >UniRef50_B1I205 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Candidatus Desulforudis audaxviator MP104C RepID=B1I205_DESAP Length = 435 Score = 171 bits (433), Expect = 4e-41, Method: Composition-based stats. Identities = 57/314 (18%), Positives = 101/314 (32%), Gaps = 22/314 (7%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIP-GNNLLDILGYVLNKGVLQLSRRGLELD 61 +P + + +L YA+G +E D+L L+ +L RGL Sbjct: 76 RPKMTGFPLVALLRYAYGLRNLFLYGQVEMETTDRPFQDLLLSQLSAEAAELLSRGLHRA 135 Query: 62 YNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK 121 Y P E++ +GR+ F + R + + D L+N+++ + L Sbjct: 136 YRPRHELMASPRGRVNFQRLARTGGVRQSALPCYHHLRLADCLSNQVLVAGLRFGAGLTA 195 Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNG--GKNTRYYKFVISVCKFIVN--NSI 177 R ++ + L F+ L + TR Y+ + K + + Sbjct: 196 DLELRARLRRLAAVCGENVTPIRLDYHVFARLEREANRLTRAYEPAFRLTKILYRDAGAG 255 Query: 178 PGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSL-- 235 G+ G F + M+ +Q L F L Y + Sbjct: 256 LGREAGGLPVPGFLFD---MNRFFQAVLSRFLHENLDGFRVQDEYRLQGMFAYVPGFNPQ 312 Query: 236 -NLLPRMETDITIRSSEKI-LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPEN 293 P D + ++ I+DAKY + LYQL Y Sbjct: 313 RRQAPAPRPDFVVFRGGRVAAILDAKYRDLWEN------ALPRDMLYQLALYA----LSQ 362 Query: 294 GENIGGLLIYPHVD 307 G + ++YP +D Sbjct: 363 GGGMRAAILYPTLD 376 >UniRef50_D1W5H2 Conserved domain protein n=3 Tax=Prevotella RepID=D1W5H2_9BACT Length = 440 Score = 170 bits (431), Expect = 6e-41, Method: Composition-based stats. Identities = 43/294 (14%), Positives = 98/294 (33%), Gaps = 23/294 (7%) Query: 13 YMLTYAWGYLQEIKQANL-EAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPG 71 Y+L Y + +L ++ D + ++ + R+G+ +Y + Sbjct: 106 YLLHYMLQKVLSFNLFDLSHNNEEEDVFDFIMFMFPYFLKAAMRQGVYREYQNFSHNDAN 165 Query: 72 IKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEAR 131 +KG I+ A+ I G + + D +I+ T+ + S + + Sbjct: 166 LKGTIDIAQHIAKNVPFVGNIAYSTREYSHDNNMTELIRHTIEFMKTKRYGQSVLNVDHE 225 Query: 132 SLYRKLPGISTLHLTPQHFS--------YLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 ++ + L ++ + Y+ + +C I+ + K Sbjct: 226 TIENVKAIVEHTPLYNKNERGCIINKNLRVKAHPYFTEYRPLQMLCLQILR---MDEVKY 282 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 D + + L+++++ R + K D S P Sbjct: 283 GESNNDICGILFDGAWLWEEYVNTILRD-YDFKHPENKLHKGGIYLFDDHSGIRYPDFYK 341 Query: 244 DITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENI 297 D +++DAKY + K S +++Q+M Y+ +LK + G + Sbjct: 342 D--------DMVLDAKYK--LLGSYDKVSKVDSDDIHQVMAYMTALKVDQGGFV 385 >UniRef50_A1SC19 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Nocardioides sp. JS614 RepID=A1SC19_NOCSJ Length = 407 Score = 168 bits (425), Expect = 3e-40, Method: Composition-based stats. Identities = 50/343 (14%), Positives = 108/343 (31%), Gaps = 28/343 (8%) Query: 4 PVIPVRNIYYMLTYAWGYLQ-EIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 P + + +L A G ++ + P ++ +L + + G Y Sbjct: 65 PKVGAAKVLTLLARAQGVRGLKVDPELVGVAPHADISAVLAVLFAQEAATAMAAGPLRGY 124 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 + +P ++GR+ + T D DT NR ++ L+ + Sbjct: 125 RSEDQTLPVLRGRVRLREQHLRRFGLPVPLEVTVDEWTLDTDDNRRTRAAATALLALPGV 184 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 R L R L L + ++ + ++ ++ Sbjct: 185 PEHSTQALRRLDRLLGEAKLLAPGAP-LEPWTPTRLNVKMHRLLHLADVVLAHTSVEHEA 243 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRME 242 G + + F N M+ L++ + + ++ S ++ Sbjct: 244 GATQTHGFVVN---MAWLFETLIARLLEEQ---TLGLVPQQTMPLDTLGRLS------IK 291 Query: 243 TDITIRSSEKIL-IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 D+ ++ + D KY K + ++YQL+ Y L G L Sbjct: 292 PDLLFDGPGGVVAVADTKYKLL-----DDNGKVPNADVYQLVTYCARLGLSTGH-----L 341 Query: 302 IYPHVDTAVKHRYKINGFD--IGLCTVNLGQEWPCIHQELLDI 342 IY D + I G + + + V++ + I Q++ +I Sbjct: 342 IY-SSDEPGPDPFGIVGTNVLLVVHAVDVSRPVDVIEQQVREI 383 >UniRef50_A6EGF0 5-methylcytosine-specific restriction enzyme McrBC, subunit McrC n=1 Tax=Pedobacter sp. BAL39 RepID=A6EGF0_9SPHI Length = 232 Score = 166 bits (420), Expect = 1e-39, Method: Composition-based stats. Identities = 76/231 (32%), Positives = 131/231 (56%), Gaps = 8/231 (3%) Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 L+ +++EA+ +Y +L GI + ++PQ FS + +N +YKF + V + I + + Sbjct: 2 LSKALKNEAKGIYNQLTGIKDILISPQKFSLVTIHRNNIHYKFPLQVGQLITAQTAIEER 61 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDAS-SISDQSLNLLPR 240 G+Y F DF+RN +M+ L++ F+ F RE +R ++W + S S +L L+P+ Sbjct: 62 NGNYFFQDFDRNHHQMARLFESFVRRFYMREQKRFKVSRENIEWRINESESTGNLALIPK 121 Query: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPEN------- 293 M+TDI++ S E+ +I+D K+Y S ++ R + K HS +LYQL +YL +L+ ++ Sbjct: 122 MQTDISLISPERKIIIDTKFYLSAYNSRYDSPKLHSSHLYQLYSYLCNLEEQSLSRNGGA 181 Query: 294 GENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFD 344 + G+L+YP A+ YKI I + T+NL W IH L+ + D Sbjct: 182 NKIYEGILLYPKNGIALDESYKIGSHRIKIYTINLEGPWQDIHDRLISLLD 232 >UniRef50_C1EZU3 Putative uncharacterized protein n=1 Tax=Bacillus cereus 03BB102 RepID=C1EZU3_BACC3 Length = 424 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 63/362 (17%), Positives = 133/362 (36%), Gaps = 37/362 (10%) Query: 11 IYYMLTYAWG----------YLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 ++ ML G + + ++ G N + L + + ++ + G Sbjct: 73 VFSMLEKVHGTEYKFKKNKKFWHFDPKTLVDIEQGMNFTEQLISMFISELWKVKKIGFSK 132 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 YN E + +KGR+ K I+ + K ++ LN T N I+ + LI H Sbjct: 133 TYNSKEENLNYLKGRLFIGKQIKYNVV-PKKFYCNYNELNYLTAENLILFEIINKLI-HL 190 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNG--GKNTRYYKFVISVCKFIVNNSIP 178 ++S I+ E + + + + + +Y+ ++ + + + Sbjct: 191 SIDSKIKSELIYFKNEFAQALNID-RRINLKGIKYKATRLNMHYETIMYLSEMFLQKRFF 249 Query: 179 GQ-NKGHYRFYDFERNEKEMSLLYQKFLYEFCR---RELTSANTTRSYLKWDASSISDQS 234 G F +F +M L++K++ + + + D+ Sbjct: 250 STLESGENLFCNFL---IKMDDLFEKYILLLVKEIIENFFPKYRVEEQVNLNFVRKYDEK 306 Query: 235 L---NLLPRMETDITIRSS---EKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWS 288 L N M DI I + + I+++D KY K ++ YQ+++Y++S Sbjct: 307 LDRENGFLTMIPDIIIYNKTSNKPIVVIDTKYVDIT-----NKNKLNNNAYYQMLSYMFS 361 Query: 289 LKPENGENIGGLLI-YPHVDTAVKHRYKINGFDIGLCT--VNLGQEWPCIHQELLDIFDE 345 L +N + G+L+ + + Y+ G + + T V+L I L + D+ Sbjct: 362 LHLQNETLVTGILLSHGTSGHTYRINYRE-GQHMHIYTGSVDLLNTEEKIKDSLKLMLDK 420 Query: 346 YL 347 L Sbjct: 421 VL 422 >UniRef50_A7ZBW9 Putative uncharacterized protein n=1 Tax=Campylobacter concisus 13826 RepID=A7ZBW9_CAMC1 Length = 441 Score = 163 bits (414), Expect = 7e-39, Method: Composition-based stats. Identities = 53/332 (15%), Positives = 114/332 (34%), Gaps = 33/332 (9%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGY-VLNKGVLQLSRRGLELDYNPNTEII 69 + ML YA + NL I+ Y + + + + GL +Y Sbjct: 95 LERMLNYANDIYIDDVSLGKSVDAKENLSKIIIYYLFIQTLERAFLLGLPKEYKDKNYHE 154 Query: 70 PGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDE 129 + G+++ AK I+ GK ST + ++ L I+ K + + Sbjct: 155 AKVMGKVDVAKFIKSDIPFTGKISSTNRERQDMGDIVLLLHKALKIVQKE---SKELIKP 211 Query: 130 ARSLYRKLPGISTLHLTPQHF------SYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 + L I L + S YK V+ K I+ N G Sbjct: 212 VINTLSYLNEIREPRLVTPNVIHNALNSKALHNPIYTPYKKVLEYAKLIIENEDAGTKSN 271 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 + F + ++ L++ ++ + ++E + T ++ + ++ Sbjct: 272 GKQNLGFLVD---VAELFEIYIRKLLQKEFKDWSVTSPKIEL------YKDKFFARKIIP 322 Query: 244 DITIRSSEKILIVDAKYYKSIFSR--RMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLL 301 DI + S +++L+ D KY + + G + +Q+ Y+ + + I G L Sbjct: 323 DIVMSSGDQVLVFDTKYKRMNMQGKDQYGLGDVDRNDFFQINTYMSYYQNQGKNVIAGGL 382 Query: 302 IYPHVD------------TAVKHRYKINGFDI 321 +YP + ++ ++G ++ Sbjct: 383 LYPMDKFSRDRCHNHSWFENLNTKFIVDGIEL 414 >UniRef50_D2Q5W0 3-isopropylmalate dehydrogenase n=2 Tax=Bifidobacterium dentium RepID=D2Q5W0_9BIFI Length = 510 Score = 163 bits (413), Expect = 8e-39, Method: Composition-based stats. Identities = 39/327 (11%), Positives = 90/327 (27%), Gaps = 42/327 (12%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + YM+ + I + + +L ++ + + +G+ Y Sbjct: 138 LAYMMDKVLNF--NILGLDFDTERDGAWQKMLMLMIPFHLERAMSKGIYKQYMSRRYNDS 195 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIK--------HEKL 122 +G I+ + I G + D +I+ T+ + + Sbjct: 196 RPRGVIDIPRHISRNVPFRGTVAYNTREFDVDNPVTELIRHTIEYIRAQGNFGTQLLRNV 255 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGK----NTRYYKFVISVCKFIVNNSIP 178 N R + + + + Y+ + +C I+ Sbjct: 256 NRDTDTYVREIRQATWKHYDSNARAKIIHENRTHPVRHAFYSEYRDLQQLCLKILTKQGV 315 Query: 179 GQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSY---------LKWDASS 229 G + + S L++++L LKW Sbjct: 316 DTGCGEDAVHGLLFSC---SWLWEEYLNTLLSGHFKDYEVKHPRNLDQHKKDALKWPIFK 372 Query: 230 ISDQSLNLLPRMETDITIR---SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYL 286 + + D ++ + +++DAKY + + +QL+ Y Sbjct: 373 VGGTDDQTENWLIPDFLLKPTADDRENIVMDAKYK--------PRKNISRDDRFQLLAYK 424 Query: 287 WSLKPENGENIGGLLIYPHVDTAVKHR 313 + + GL +Y D A K + Sbjct: 425 LYFR-----SHKGLFLYAAKDEAEKEK 446 >UniRef50_C9LKR7 Putative uncharacterized protein n=1 Tax=Prevotella tannerae ATCC 51259 RepID=C9LKR7_9BACT Length = 405 Score = 163 bits (413), Expect = 8e-39, Method: Composition-based stats. Identities = 51/295 (17%), Positives = 92/295 (31%), Gaps = 36/295 (12%) Query: 40 DILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHG--KTVSTFD 97 +L V ++ + L+ Y +E + +KGRI K R + +D Sbjct: 111 PLLVLHFLGVVSRI--KELKKGYVSRSENLKKVKGRISILKNERQNIAIRRYDRVFCEYD 168 Query: 98 MLNEDTLANRIIKSTLAILIK-----HEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSY 152 + D NR+IK L + +E+ + + +S + + Sbjct: 169 EYSADIPENRLIKKALLFSQRLLQGLNERSAAVAKLRLNKSLALFSEVSD-KVEIKQVKR 227 Query: 153 LNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRE 212 L K Y I + K I+ +K +MSLLY+ ++Y Sbjct: 228 LRAHKLFTNYNEAIRLAKLILRLFDYNISKVGSHEGKVVPFWLDMSLLYEHYVYGLLHEA 287 Query: 213 LTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTE 272 T + K D +S E I+D KY + + + Sbjct: 288 YRERITYQFKGKTGF---------------PDFLYKSKEYKAILDTKYIPKYDEKSLDKD 332 Query: 273 KFHSQNLYQLMNYL------WSLKPENGENIGGLLIYPHVDTAVKHRYKINGFDI 321 QL Y L+ ++ I ++IYP + + N + Sbjct: 333 VVR-----QLSGYGRDLRILTHLEYKDVSPIPCIIIYPKEGKRKNNPFLGNNLRM 382 >UniRef50_Q08S71 Putative uncharacterized protein n=1 Tax=Stigmatella aurantiaca DW4/3-1 RepID=Q08S71_STIAU Length = 420 Score = 163 bits (413), Expect = 8e-39, Method: Composition-based stats. Identities = 53/350 (15%), Positives = 113/350 (32%), Gaps = 25/350 (7%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 +P +P ++ + YA L + + LL +LG + + V +L+R G Y Sbjct: 62 RPQVPSLHLLALADYAAKGL-SWGEDLVGMAEAEELLPLLGALFLRRVERLARGGWVHGY 120 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 P ++GR + I + + FD D NR++ + L +L + Sbjct: 121 REEEAGQPVLRGRWLAGRDIAQPPTHRHRLTCRFDEFTRDVAPNRLLLAALRVLERARTF 180 Query: 123 NSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNK 182 + AR+L L G+ + L + Y + + I+ ++ Sbjct: 181 GPPVAARARALAATLDGVQARAEVHAAETMLASDRRFAAYGPAAKLARLILESTGVQAAP 240 Query: 183 GHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRME 242 G + F L++ + + + +++ ++ L + Sbjct: 241 GPHPLSSFILRLAP---LFEAAVTRALVQVAEARGL-ACHVQRPLVLDTEGRL----LLV 292 Query: 243 TDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLI 302 D + L++DAKY + + Q++ Y G L+ Sbjct: 293 PDAVVEQGGARLVIDAKYKFPA------KGLPPADDFQQIVTY-----LACVGTHRGALV 341 Query: 303 YPHVDTAVKHR---YKINGF--DIGLCTVNLGQEWPCIHQELLDIFDEYL 347 P + + G + + V LG + + L + + Sbjct: 342 LPALGAVPEEETLRLMTFGRTSQVRVVKVPLGGPAATLGRALESTAERLV 391 >UniRef50_B9CT57 Putative uncharacterized protein n=1 Tax=Staphylococcus capitis SK14 RepID=B9CT57_STACP Length = 415 Score = 163 bits (412), Expect = 1e-38, Method: Composition-based stats. Identities = 39/271 (14%), Positives = 90/271 (33%), Gaps = 13/271 (4%) Query: 27 QANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFH 86 N ++D+ ++ + + R+GL +Y N +KG I+ + I+ Sbjct: 86 DLNTIIDRDEKVIDLFIFIFPYYLKKAMRKGLYKEYTRNEYNNHNVKGTIDIQRHIKNNT 145 Query: 87 LNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE---KLNSTIRDEARSLYRKLPGISTL 143 GK + + D ++I+ T+ + K + + S +R+E R + + L Sbjct: 146 PFIGKIAYSQREFSYDNYILQLIRHTIEFIKKKKEGVNVLSNVRNEVREICEVTTSYNYL 205 Query: 144 HLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQK 203 YYK + K + ++ + + L+++ Sbjct: 206 DRNKVLMFNNKQPIRHAYYKEYRELQKLCLTILQQHKHYIGSNAEKIHGIIFDGAWLWEE 265 Query: 204 FLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKS 263 ++ + S Q + + D R+ +I DAKY Sbjct: 266 YIDTLIHEDYYHPKNKGGSGAQRL--FSTQMGAKIGLVYPDFIGRNQNYRIIGDAKYKPI 323 Query: 264 IFSRRMGTEKFHSQNLYQLMNYLWSLKPENG 294 + +++ Q++ Y++ + G Sbjct: 324 --------QNIGNRDYLQVLAYMYRFDAKTG 346 >UniRef50_UPI00016C4D94 hypothetical protein GobsU_06730 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4D94 Length = 385 Score = 161 bits (408), Expect = 3e-38, Method: Composition-based stats. Identities = 53/306 (17%), Positives = 101/306 (33%), Gaps = 37/306 (12%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P IP N+ ++L + IP LL L + ++R GL Y Sbjct: 66 PKIPWPNLQFLLGSGVHPTGGTTR-----IPEGGLLGTLATAFADQLEAVARAGLVAGYG 120 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDM--LNEDTLANRIIKSTLAILIKHEK 121 + P ++G++ A +R D + +T NRI +S L H Sbjct: 121 EVESVSPFLRGKLRTAAQMRDAASQAFPGHFHIDEPSFDLNTPWNRIARSAATTLGTHPD 180 Query: 122 LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQN 181 + R+ + R L + +H T FS Y ++ +C I + Sbjct: 181 VPRATRERIETAARPLAEVPNVHGTDADFSAARTEPRAVGYHALLDLCAIIQQGFSVA-D 239 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRREL--TSANTTRSYLKWDASSISDQSLNLLP 239 F + ++++L+ RREL + ++ + + Sbjct: 240 PLRTGSDAFLLDLG---QAFERYLFRSLRRELADRPGWSVDAHPAFALGPV--------- 287 Query: 240 RMETDITIRSSE-KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 + D+ +R ++DAK+ + +L+Q++ Y Sbjct: 288 TLRPDVLVRKRAVARGVLDAKWKTTA---------LDPADLHQVLAYAGLTGAPRVG--- 335 Query: 299 GLLIYP 304 L+YP Sbjct: 336 --LVYP 339 >UniRef50_D1YRX5 Conserved domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YRX5_9FIRM Length = 445 Score = 161 bits (408), Expect = 3e-38, Method: Composition-based stats. Identities = 49/313 (15%), Positives = 97/313 (30%), Gaps = 29/313 (9%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + Y+L I + + LLD L ++ K + R+GL Y Sbjct: 98 LSYLLESVLNI-PNIVSLDADTSSDKRLLDFLLFIFPKYLGSAIRKGLYKQYIYKQYNDM 156 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN---STIR 127 IKG+I+ + + G + + + D +I+ T+ + + S I+ Sbjct: 157 KIKGKIDIPRHLIRNIPFIGSIAYSQRLFSYDNTLIELIRHTIEFIKSKSYGSIILSDIK 216 Query: 128 DEARSLYRKLPGI---STLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGH 184 +E + + Q+ + R Y + +C I+ + G Sbjct: 217 EEVNLIVNATQSYRACDRQKIIEQNKKNIIRHAYFREYSVLQRLCILILKSEKHDIGGGI 276 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 Y + L+++++ S + D Sbjct: 277 QNSYGILFDGAW---LWEEYINILLNSHFYHPKNKSKSGAQQLFSDGKGL------IYPD 327 Query: 245 ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 +S+ IVDAKY + ++ Q++ Y++ G IYP Sbjct: 328 FISKSTAPRSIVDAKYKPI--------DNIRGRDYLQVLAYMYRFDAY-----KGYYIYP 374 Query: 305 HVDTAVKHRYKIN 317 + V K+N Sbjct: 375 ESNEQVPEILKLN 387 >UniRef50_Q5JH86 Putative 5-methylcytosine restriction system, catalytic subunit n=1 Tax=Thermococcus kodakarensis RepID=Q5JH86_PYRKO Length = 476 Score = 159 bits (403), Expect = 1e-37, Method: Composition-based stats. Identities = 55/316 (17%), Positives = 106/316 (33%), Gaps = 25/316 (7%) Query: 10 NIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEII 69 N+YY + G ++ + E L +I Y+ + + RGL +Y E Sbjct: 107 NMYYQMGLEPGEIRALVF---EYGRQKALDEIFKYLYVLMLSRALSRGLYYEYGEIEESS 163 Query: 70 PGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDE 129 ++GRI + R + +L ED NR++K L + +K +L+ T + Sbjct: 164 QTVRGRILVNELARRPA-WKADLPVRYSLLLEDNPLNRVLKGALEVAVKSARLSETRKVG 222 Query: 130 ARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYD 189 L + F ++ ++ V + + + G +F Sbjct: 223 GI-LLDLFRDVGDPKPG--DFGKVSFNHLNERFRTVFRLARVMYFGLAA---GGSRKFLP 276 Query: 190 FERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLP-------RME 242 M L++ +Y + L + R ++ + + Sbjct: 277 GVF--IRMDELFETLVYRTLKTVLDNEAEVRFQVQLPHVIKNAGEIEARFGALFMMGNPL 334 Query: 243 TDITIRSSEKILIVDAKY-----YKSIFSRRMGTEKFHSQNLYQLMNYLW-SLKPENGEN 296 DI + + E +V+ KY Y +R S LYQ Y + + Sbjct: 335 PDIVVSTDEGTCVVEVKYRNLYVYHRGENRAHRKLVRKSDELYQAYTYSRLVSEYLGAKR 394 Query: 297 IGGLLIYPHVDTAVKH 312 + LL+YP ++ H Sbjct: 395 VPVLLVYPRLEGIYNH 410 >UniRef50_D1PAJ0 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PAJ0_9BACT Length = 416 Score = 157 bits (397), Expect = 5e-37, Method: Composition-based stats. Identities = 46/342 (13%), Positives = 99/342 (28%), Gaps = 45/342 (13%) Query: 5 VIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNN----------LLDILGYVLNKGVLQLS 54 I ++ M + E N + ++ + V ++ Sbjct: 69 KIDFLRMF-MTCFCSDLAVESFSKIYSIDQENPAIVAPVLSSVVSPLIVFHFIGVVNRI- 126 Query: 55 RRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHG--KTVSTFDMLNEDTLANRIIKST 112 + L Y + + +KG I+ K R + + + DT NR++K Sbjct: 127 -KSLRKGYVLRQKNLKKVKGHIKMLKNERINIAVKRYDRIYCEYADYSVDTPENRLLKKA 185 Query: 113 L-------AILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFV 165 L A + ++ + S + K +S + + K R Y Sbjct: 186 LVFSQRFVAKINRNNVVYSKVNQMVTKALSKFDYVSD-DININSIGQIRSNKLYREYAEA 244 Query: 166 ISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKW 225 + + K I+ + + +MSLLY+ ++Y + + Sbjct: 245 MRLAKVILKHFDYSLSNVEATENRVTPFVLDMSLLYEHYVYGLLHEAYREK------ISY 298 Query: 226 DASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLY----- 280 ++ D +S I+D KY + + Y Sbjct: 299 QYPGVTGL---------PDFLYKSKHFNAILDTKYIPKYEKGTLDNYVIRQLSGYSRDLT 349 Query: 281 --QLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKINGFD 320 + + Y + ++ ++IYP + +K N Sbjct: 350 ILRKLGYEDIDEDSPAPSVPCIIIYPKEGGDTTNPFKSNKLR 391 >UniRef50_UPI000197B531 hypothetical protein BACCOPRO_00002 n=1 Tax=Bacteroides coprophilus DSM 18228 RepID=UPI000197B531 Length = 446 Score = 157 bits (397), Expect = 6e-37, Method: Composition-based stats. Identities = 50/313 (15%), Positives = 110/313 (35%), Gaps = 30/313 (9%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 ++YML+ + L D L Y+ + + + +G+ +Y N Sbjct: 110 LHYMLSKVFCINVLNLSHGTSDEQ---LFDFLLYMFPRFLNEALSQGIYKEYKRNEYNDA 166 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRD-- 128 ++G I + +R +G+ + + D +I+ T+ + K + + + + Sbjct: 167 NVRGTININRHLRTNMPFNGRIAYSTREFSHDNHVTELIRHTIDYISKSKFGRTLLENDS 226 Query: 129 EARSLYRKLPGISTLHLTPQHFSYLNGGK---NTRYYKFVISVCKFIVNNSIPGQNKGHY 185 E R+ ++ + + + + N YY + K + + K Sbjct: 227 ETRTSVTQIISATPSYCRQEREIVVKSNLKEINHPYYSRYTPLQKLCLRILRHEKIKYGE 286 Query: 186 RFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDI 245 + ++S L++++L ++ K I N LPR D Sbjct: 287 KKNKIHGILFDVSYLWEEYLATILTKQ----GFKHPNNKKGFGCIYLAKHNRLPR-YPDY 341 Query: 246 TIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH 305 LIVDAKY K +++Q++ Y++ +K + G+ + P Sbjct: 342 YREYD--RLIVDAKYKKETN----------RDDIHQMITYMYQMKGK-----RGIFVQPG 384 Query: 306 VDTAVKHRYKING 318 +K + + G Sbjct: 385 DKEYLKDIFHLLG 397 >UniRef50_C7M8S6 McrBC 5-methylcytosine restriction system component-like protein n=2 Tax=Capnocytophaga RepID=C7M8S6_CAPOD Length = 437 Score = 157 bits (396), Expect = 7e-37, Method: Composition-based stats. Identities = 51/297 (17%), Positives = 102/297 (34%), Gaps = 39/297 (13%) Query: 33 IPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP-GIKGRIEFAKTIRGFHLNHGK 91 + L L + + R+GL+ Y E + IKG+I+ K ++ + K Sbjct: 134 QQKDTLTPFLMVQFLLLLKCIVRKGLKKSYYTVEENLNNRIKGKIQLDKHLKQN-VFKNK 192 Query: 92 TV---STFDMLNEDTLANRIIKSTLAILIKHEKL--------NSTIRDEARSLYRKLPGI 140 + D+L NR +K L +I + + +IR+ I Sbjct: 193 LTAHVCRYQEFGMDSLENRFLKKVLQFIISFKNTHSNYFAGNDESIRELITYCSPHFELI 252 Query: 141 STLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLL 200 S L + L + Y+ I + K I+ + + +M L Sbjct: 253 SE-ELDVESLKKLTTNPFFKEYEEAIRIGKQILKRFSYNITETTQQKVAIPPFWIDMPKL 311 Query: 201 YQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKY 260 ++ ++Y+ + + S + D + + D + + E +++DAKY Sbjct: 312 FELYVYKKLQEQFGSRGEVHYHFTGDYTEL-------------DFLLNTPEYKMVIDAKY 358 Query: 261 YKSIFSRRMGTEKFHSQNLYQLMNYLW------SLKPENGENIGGLLIYPHVDTAVK 311 R+ ++ Q+ Y +LK ++ I L+IYP + Sbjct: 359 KTVYEDSRV------IDDIRQVSAYARLERVYKALKIDDNRLIDCLIIYPSFEQNTD 409 >UniRef50_UPI0001B4EC67 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4EC67 Length = 424 Score = 156 bits (395), Expect = 1e-36, Method: Composition-based stats. Identities = 41/359 (11%), Positives = 103/359 (28%), Gaps = 25/359 (6%) Query: 3 QPVIPVR--NIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLEL 60 +P P+ + L YA + + + + ++ L +L R GL Sbjct: 69 RPKFPIAGDRLIDWLCYANKQEEPDETLRNWPLGSDGYAGLVPAALLHECRRLLRHGLRR 128 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE 120 DY + + ++GR++ + + + N + + L + + Sbjct: 129 DYVRHHRVDTTLRGRLDVEAQATRCYGAVDRLHLQTFEYQDGGWENLVCGAALTVAARRS 188 Query: 121 KLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQ 180 + R + P + +Y+ + + ++ Sbjct: 189 SDPAQTRWLLD-AAAQFPSPRQPLDAVSLLQRGQYTRLNTHYRAAHAWARMVLGGGGVTN 247 Query: 181 NKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPR 240 Y F +++L+++ + + + A Q + P Sbjct: 248 LLEPYGFGAKSL-MLNLNVLWERVVRRMAVDAAVDLGGRGARGEEKAIHTHGQRNDKTPT 306 Query: 241 METDITIRSSEKI--------LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPE 292 D+ + + L VDAKY + + + +QL+ Y+ Sbjct: 307 FNPDVLLAFPPQTDSSADIRFLAVDAKYK------GYMEKNVSAADRHQLLTYIAGYTAP 360 Query: 293 NGENIGGLLIYPHVDTAVKHRYKINGFD-----IGLCTVNLGQEWPCIHQELLDIFDEY 346 L+++P + ++ G I + ++ L + E+ Sbjct: 361 EYPL--ALVVHPSAAAPTERELRVQGPRGRLGLIKVLGLDTRTAPKDATGPLREAIAEF 417 >UniRef50_A9FMX9 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9FMX9_SORC5 Length = 400 Score = 155 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 53/340 (15%), Positives = 111/340 (32%), Gaps = 35/340 (10%) Query: 11 IYYMLTYA---WGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRR-GLELDYNPNT 66 ++ + + ++ A A + + I + + + R G DY Sbjct: 71 VFAWMCAVDPRFRPVKWRGTAPEGADGQHGVASIAVRAFAQILEEDLSRAGPRRDYQRRE 130 Query: 67 EIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTI 126 + ++G I +A+ R + + N DT NR+ + + H+ L Sbjct: 131 DDASVLRGTIRWAELARR--TSPVPVPCRYWERNIDTPLNRLFAAAVHAASAHDSLREAG 188 Query: 127 RDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYR 186 L+ + L + ++ S+ I+ G R Sbjct: 189 GMPLDRLHGIFGHVPRLPPAWILDRTRPLPRLEADFEAARSLAITILQAFGISHG-GAQR 247 Query: 187 FYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDIT 246 F + + L++ + R + +++ +D + D+ Sbjct: 248 ALAFHVDL---ARLFEMTVEAAARTQAWDGKVA---IQYQPPYEADVAGEES---RIDVL 298 Query: 247 IRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHV 306 +R+ + L++DAKY K+ F +LYQ++ Y+ L G L+YP Sbjct: 299 VRARGEALVIDAKYSKA----------FSKSHLYQVLAYMKMLGAR-----RGALVYPKG 343 Query: 307 DTAVKHRY-KINGF---DIGLCTVNLGQEWPCIHQELLDI 342 R+ G ++ L V+L +EL + Sbjct: 344 AELRGERFWSAPGAPEWEVRLHEVDLVAVASNGRRELERL 383 >UniRef50_Q7MVS1 Putative uncharacterized protein n=1 Tax=Porphyromonas gingivalis RepID=Q7MVS1_PORGI Length = 431 Score = 155 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 43/287 (14%), Positives = 86/287 (29%), Gaps = 35/287 (12%) Query: 38 LLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHG--KTVST 95 L ++ V + +RGL+ DY + +KG I ++ R + K + Sbjct: 133 LSPLIVVHFLSVVRGIVKRGLKKDYVQRENNLNKVKGHIAISRNERTNVIRKRFDKVLCK 192 Query: 96 FDMLNEDTLANRIIKSTL----AILIKHEKLNST--IRDEARSLYRKLPGISTLHLTPQH 149 + +E+ NR+IK L IL +S +R + + Sbjct: 193 YQEYSENIPENRLIKKALLFSREILENLAITSSLIPLRHAIHQYLSAFCNVDE-QIEVWE 251 Query: 150 FSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF-ERNEKEMSLLYQKFLYEF 208 + K + Y I + + I+ ++ +M+LLY+ ++ Sbjct: 252 VKNIKHHKIFKEYDEAIRLAQMILRRYDYSITNIRPAEEEYCPVFWLDMALLYEHYVLGL 311 Query: 209 CRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRR 268 + + +++ D +++D KY Sbjct: 312 LKVAYGNKIMYQAHGYTGY---------------PDFICYDP--KIVMDTKYIPRFEKDG 354 Query: 269 MGTEKFHSQNLYQLMNYLWS---LKPENGENIGGLLIYPHVDTAVKH 312 + QL Y K ++I L+IYP Sbjct: 355 IDVYIVR-----QLCGYSRDRRLFKTCPDKSIPCLIIYPKEGEPQNP 396 >UniRef50_C1QFM5 McrBC 5-methylcytosine restriction system component n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QFM5_9SPIR Length = 430 Score = 154 bits (390), Expect = 3e-36, Method: Composition-based stats. Identities = 50/313 (15%), Positives = 104/313 (33%), Gaps = 24/313 (7%) Query: 13 YMLTYAWGYLQEIKQANLEAIPG-NNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPG 71 Y L Y + + NLE +N D L Y+ + R+GL Y Sbjct: 106 YFLHYILMKVLNLNIINLEHSKDYDNSFDFLIYMFISFFKKALRQGLFKQYKLIKHNDCH 165 Query: 72 IKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEK----LNSTIR 127 +KG I+ + I+ +GK + D ++I+ T+ + + ++ I+ Sbjct: 166 VKGTIDINRYIKNNIPFNGKISYNTREYSYDNNMTQLIRHTIEYINTKNRYILGYDNEIK 225 Query: 128 DEARSLYRKLPGISTLHLTPQHFSYLN--GGKNTRYYKFVISVCKFIVNNSIPGQNKGHY 185 + + ++ P L Y+ + +C I+ + Sbjct: 226 NYIQQIFYSTPSYEKNKRESIINKNLKKLSHPYYYEYEPLRKICIQILRHEKLKYGSDDN 285 Query: 186 RFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDI 245 Y + +++++L F + + + S LN + D Sbjct: 286 TVYGLLFDGAW---IFEEYLNTFL------SKINFIHAENRTSKNGINLLNNAWIVYPDF 336 Query: 246 TIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPH 305 S +++DAKY + +E + +Q+++Y ++L + IYP Sbjct: 337 YKLSENNNIVLDAKYKRL---DNYISENIDRNDKHQIVSYAYTLNAKKAG-----FIYPT 388 Query: 306 VDTAVKHRYKING 318 + K I Sbjct: 389 ENNNYKDYDYIGN 401 >UniRef50_C9BWA4 Guanosine 5'-monophosphate oxidoreductase n=4 Tax=Enterococcus faecium RepID=C9BWA4_ENTFC Length = 430 Score = 153 bits (386), Expect = 1e-35, Method: Composition-based stats. Identities = 42/307 (13%), Positives = 97/307 (31%), Gaps = 38/307 (12%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + YML Y + A D+L ++ + + R+G+ +Y Sbjct: 98 LRYMLQRVLNYNVI--NDHFSASKKMTYYDLLVFLFPYYLNEAMRKGIYKEYVKREYNNA 155 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAI-------LIKHEKLN 123 IKG ++ AK IR GK + + ++I+ T+ L+ ++ Sbjct: 156 NIKGAVDVAKHIRSNVPFVGKVAYRTREFSYNNHLTQLIRHTIEKIQNEYDFLLSGDEDT 215 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 + + ++ ++ + Y + +C I++ G Sbjct: 216 KENVVLIKQTTPDYARLDQFNILQENIFHPVKHSYYEEYSALQQICIQILSEEKSGFGSD 275 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 + ++S L+++++ W + E Sbjct: 276 KNQ---IHGIIIDVSWLWEEYI--------------GKVTGWKHYGRDKGLATMHLFQEP 318 Query: 244 DITIRSSE---KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGL 300 + + R + + +D KY K + + + QL+ Y+ + + + I G Sbjct: 319 NRSPRYPDFTFNNIPIDTKYKKHLDT---------RNDYNQLVTYIHIMNLDQADTIKGG 369 Query: 301 LIYPHVD 307 + P D Sbjct: 370 FLQPTSD 376 >UniRef50_B9KCB6 Putative uncharacterized protein n=1 Tax=Campylobacter lari RM2100 RepID=B9KCB6_CAMLR Length = 459 Score = 152 bits (385), Expect = 2e-35, Method: Composition-based stats. Identities = 51/301 (16%), Positives = 106/301 (35%), Gaps = 17/301 (5%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLD-ILGYVLNKGVLQLSRRGLELDYNPNTEII 69 + ML +A + + NN+ + IL Y+ + + + GL +Y Sbjct: 108 LERMLNFANDVFVDDVSIYQDVKKENNISEFILFYIFVQKLEKSFLIGLPKNYQSKKYND 167 Query: 70 PGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAIL-IKHEKLNSTIRD 128 KG I+ + I+ K + ED ++ L ++ K+ ++ Sbjct: 168 LRFKGNIDMVEFIKHNIPLKAKVATKTREQIEDVYIINVLYKALEVIEKKNSGFLKNVKH 227 Query: 129 EARSLYRKLPGISTLH--LTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPG-QNKGHY 185 L + + L S +N + YK ++ K I+ + +NK Sbjct: 228 IKTYLVQNKSEHCFIKESLNKAFSSKALRNQNYQNYKELLKYAKMIIESQNFTSKNKNDQ 287 Query: 186 RFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDI 245 + Y F N + L++ ++ + R L+ +S + + DI Sbjct: 288 KSYGFIVNI---AELFEIYISKLLRNNFEEYMVDSPKLEIYKNSFYKR------HIIPDI 338 Query: 246 TIRSSEKILIVDAKYYKSIFSRRMGT--EKFHSQNLYQLMNYLWSLKPENGENIGGLLIY 303 + + ++ D KY + R +L+Q+ Y+ + G+ + G L+Y Sbjct: 339 VLSKDDTYMVFDTKYKRMKMEGRSQNGMGDLDRNDLFQIHTYM-GYYQKIGKVLLGGLLY 397 Query: 304 P 304 P Sbjct: 398 P 398 >UniRef50_A7GF31 Putative uncharacterized protein n=1 Tax=Clostridium botulinum F str. Langeland RepID=A7GF31_CLOBL Length = 419 Score = 152 bits (384), Expect = 2e-35, Method: Composition-based stats. Identities = 63/363 (17%), Positives = 124/363 (34%), Gaps = 30/363 (8%) Query: 4 PVIPVRNIYYMLTYAWGYLQ-EIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 P + +Y +L YA+G + ++ I + D++ Y L L RRG++ Y Sbjct: 69 PKLNGLPLYQLLRYAYGLRELKLFNVAEHTIDNFSFFDLIIYELYVEAEDLLRRGIQKSY 128 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKL 122 E + +GRI+ + L + +E+ + N+ L L Sbjct: 129 IHREENLSSPRGRIDMNRLCGQGGLIKDTLPCKYFNRDENNILNQ-TLLAGLKLGLKLVL 187 Query: 123 NSTIRDEARSLYRKL-PGISTLHLTPQ--HFSYLNGGKNTRYYKFVISVCKFIVNNSIPG 179 +S ++ + + + L IS + LT + + + T Y V + + + Sbjct: 188 DSGLKIKLQRICSNLKENISDITLTRGSLQLARNSINRLTGRYSAVFEIINILYESQGI- 246 Query: 180 QNKGHYRFYDFERNEKEMSLLYQKFLYEFCR---RELTSANTTRSYLKWDASSISDQSLN 236 Q + R+ + +M+ ++ + + + + + + + Sbjct: 247 QLENASRYINLRGYFFDMNAFFETLVGRLLENCSDRYSIKDQFSLHDMFIYTPGFNPCRR 306 Query: 237 LLPRMETDITIRSSEKIL-IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGE 295 P D + K++ ++DAKY LYQL Y S + Sbjct: 307 KSPTPRPDFALIRQGKVVKLLDAKYRNLWEKN------LPRDMLYQLAIYAVSGIGDKTA 360 Query: 296 NIGGLLIYPHVDTAVKHRYKINGFDIG--------LCTVNLGQEWPCIHQE--LLDIFDE 345 I +YP ++ + I L VNL + I+ + L DE Sbjct: 361 TI----LYPSLNDVTTVQMIDINDPISSSKMASVILKPVNLIKVAEMINDDKVLSKFVDE 416 Query: 346 YLK 348 LK Sbjct: 417 LLK 419 >UniRef50_D1PC49 Putative uncharacterized protein n=1 Tax=Prevotella copri DSM 18205 RepID=D1PC49_9BACT Length = 437 Score = 152 bits (383), Expect = 3e-35, Method: Composition-based stats. Identities = 48/333 (14%), Positives = 103/333 (30%), Gaps = 29/333 (8%) Query: 9 RNIYYMLTYAWGYLQEIKQANLE-AIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTE 67 +N ++M Y + NL+ N+L IL + + ++G+ +Y Sbjct: 97 QNDFFM-HYMLQKVFSYNIFNLDFMSTEENILKILVLMFPTMLKTAMKQGIYKEYRKIQY 155 Query: 68 IIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIR 127 ++G I+ ++ IR G D D +I+ T+ + + L I Sbjct: 156 NDSNVRGTIDISRHIRENIPFCGNISYDTDEFCYDNAVMELIRHTIEYI-RTIPLGDMIL 214 Query: 128 DEARSLYRKLPGISTLHLTPQHFSYLNG---------GKNTRYYKFVISVCKFIVNNSIP 178 + + + + +H L Y + +C I+N Sbjct: 215 SSNEVVEEYVSKVISYTPCYRHSDRLKIIHENLTPCRHPYYTGYNALQKICIQILNQEDM 274 Query: 179 GQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLL 238 G + L++++L + Sbjct: 275 KYGDGDGSVSGILFDGAW---LWEEYLNTLLCD-YDFNHPQNKQGTGAIYLFEHGGKR-- 328 Query: 239 PRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 D +K ++DAKY K +++ ++ Q++ Y++ LK + I Sbjct: 329 ---YPDFW----KKDFVLDAKYKK--YAQSGNKLDIAIDDINQIVTYMFRLKSQKSGIIC 379 Query: 299 GLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQE 331 L+ + + R NG++ + L Sbjct: 380 PLI--GEKNKTISERMNKNGYNGVMYIYALAIP 410 >UniRef50_C1QC92 McrBC 5-methylcytosine restriction system component n=1 Tax=Brachyspira murdochii DSM 12563 RepID=C1QC92_9SPIR Length = 409 Score = 151 bits (381), Expect = 4e-35, Method: Composition-based stats. Identities = 56/335 (16%), Positives = 107/335 (31%), Gaps = 52/335 (15%) Query: 3 QPVIPVRNIYYM------LTYAWGYLQEIKQANLEAIP----------GNNLLDILGYVL 46 P I NI +M L+Y+ K ++ L + Sbjct: 74 NPKID--NIDFMKMFSKCLSYSSMIKDFDKIYSINFEEPAIDYKGSILIKGLDALTSIHF 131 Query: 47 NKGVLQLSRRGLELDYNPNTEIIPG-IKGRIEFAKTIRGFH--LNHGKTVSTFDMLNEDT 103 + + GL+ ++ E + IKG+I+F+ I+ + + ++ + + Sbjct: 132 LRMLELELHNGLKRNFIRKEENLNSKIKGKIDFSNHIKKNIMTARNDRVYCSYFDYDINC 191 Query: 104 LANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYK 163 L NRI+K L I + +S L + + YK Sbjct: 192 LENRILKKALKICYSNIGSIYNSFS----CMTFFSEVSD-ELHFYELHNIKLNPLYKKYK 246 Query: 164 FVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYL 223 +I + I+ + F +MSLL++K++Y L + N Y Sbjct: 247 LLIKLAINIIKLKRYKDSNKENYAPPFY---IDMSLLFEKYVYALLDDSLKNKNAKILYQ 303 Query: 224 KWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLM 283 + + +++ D I+ + I D KY S + +++ QL Sbjct: 304 EVYSRH----------KLKPDFIIKCNGYDYIADTKYKSSC------NNGINIEDIRQLS 347 Query: 284 NYLWS-------LKPENGENIGGLLIYPHVDTAVK 311 Y ++IYP D+ K Sbjct: 348 GYGRVESIVKEFTNDIENYIPNCIIIYPSDDSNNK 382 >UniRef50_D0LPM4 5-methylcytosine restriction system component-like protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LPM4_HALO1 Length = 404 Score = 148 bits (373), Expect = 4e-34, Method: Composition-based stats. Identities = 51/351 (14%), Positives = 114/351 (32%), Gaps = 44/351 (12%) Query: 11 IYYMLTYAW--GYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRR-GLELDYNPNTE 67 + L YA + + E +L ++ V + GL Y+ + Sbjct: 67 LLTWLAYADPGLAALRLLRPLPETTREGDLGPLVARVFCAATWHAIQTSGLLRAYHRQSV 126 Query: 68 IIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIR 127 I+GRI+F + + + +T T NR++ + +A + + L ++ Sbjct: 127 RSSMIRGRIDFPR-LVHAGGDLSRTPCIVFSRLPQTPLNRLLAAAVAQIRRDPVLRASAG 185 Query: 128 DEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRF 187 + L L +S + + + + + ++ I+ ++ + Sbjct: 186 ADLPPLATALADVSPHLDRALLSARIPLSRLEQPFAASHALACLILRSAGLASGS-EHEG 244 Query: 188 YDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISD---QSLNLLPRMETD 244 F + + L+++ + R + KW + + ME D Sbjct: 245 AGFLVDL---ANLFERAVARAFRDA-----PFAAEAKWRVQLLREAPSTPSTQGSSMELD 296 Query: 245 ITIRS-SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIY 303 + + + ++VDAKY + + NL Q++ Y + +L++ Sbjct: 297 VFLPDVRGQRVVVDAKYKTRVTTG----------NLQQMITYCVA-----SGTHQAVLVF 341 Query: 304 PH-----------VDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIF 343 P V Y+I+ + L +L W + L D Sbjct: 342 PAGHLTDRRAHVLVPHRGPPAYRIHLVEFELTQTDLAG-WRDAGRRLADAV 391 >UniRef50_C9PRC9 Putative uncharacterized protein n=1 Tax=Pasteurella dagmatis ATCC 43325 RepID=C9PRC9_9PAST Length = 481 Score = 145 bits (366), Expect = 2e-33, Method: Composition-based stats. Identities = 51/343 (14%), Positives = 114/343 (33%), Gaps = 37/343 (10%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + Y+++ A G+L+ + ++ +L Y+ N + + R G+ Y+ ++ I Sbjct: 124 LKYIISDADGFLEIKDFS--ATEKKDSYAWLLAYLWNIKLKRAYRLGIPKVYSSKSDRIS 181 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 ++G I+ + + GK + +F + + A + + H T Sbjct: 182 TVRGSIDPLDYFKNH--SSGKCLCSFREHDYASTAISLFLMAYDTVKHHSFCQQT----- 234 Query: 131 RSLYRKLPGIST-LHLTPQHFSYLNG--GKNTRYYKFVISVCKFIVNNSIPGQNKGHYRF 187 R +Y + T + Y +I + K I++ Sbjct: 235 RYIYNAFMMANQGKKKTKKEILETPYFSNPYYSDYNTLIDLSKRIISQKSLDFGS-SNAS 293 Query: 188 YDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITI 247 + + S+L++ F+ + + S RS + + ++E D+ I Sbjct: 294 NAYLFDI---SMLFEYFIRKLL---IRSGINVRSKFESLRKIQTCSLGKYERKLEPDLII 347 Query: 248 RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP--- 304 + I D KY + ++L+QL Y+ GG I+P Sbjct: 348 EGENGVYIFDVKYKH-----FDEKYGVNREDLFQLHTYI-GQWSNKETVCGGGFIFPIPE 401 Query: 305 ---------HVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQE 338 + ++ + G ++ + L I +E Sbjct: 402 KKWEKLCLEKTQGVISNKIQQQGKEMDFYVIFLPIPKENIAKE 444 >UniRef50_B3JJV2 Putative uncharacterized protein n=1 Tax=Bacteroides coprocola DSM 17136 RepID=B3JJV2_9BACE Length = 439 Score = 144 bits (363), Expect = 6e-33, Method: Composition-based stats. Identities = 45/277 (16%), Positives = 96/277 (34%), Gaps = 34/277 (12%) Query: 51 LQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIK 110 + +G+ +Y N ++G I+ + ++ +G+ + D +I+ Sbjct: 146 NEALTQGIYKEYQRNEYNDANVRGTIDINRHLKTNLPFNGRIAYRTREFSHDNHVTELIR 205 Query: 111 STLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKN---------TRY 161 T+ + K ++++A ++ + + H++ K Y Sbjct: 206 HTIDYIGKTSFGKMLLKNDA----DTHTSVAQIIHSTPHYNRQEREKIVKANLKVITHPY 261 Query: 162 YKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRS 221 Y + K + +NK + ++S L++++L ++ Sbjct: 262 YSSYTPLQKLCLRILRHEKNKYGAKDDKIHGVLFDVSYLWEEYLATILSKQ----GFKHP 317 Query: 222 YLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQ 281 K I N PR D +I DAKY ++I + ++ Q Sbjct: 318 NNKRGTGRIYLALPNQFPR-YPDFYREKGS--VIADAKYKRNIDT---------RDDVNQ 365 Query: 282 LMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKING 318 ++ YL+ LK + G+ I P K RY + G Sbjct: 366 MITYLYRLKAQ-----KGVFILPTNKVRTKERYHLYG 397 >UniRef50_A9DR64 Putative uncharacterized protein n=1 Tax=Kordia algicida OT-1 RepID=A9DR64_9FLAO Length = 474 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 49/320 (15%), Positives = 107/320 (33%), Gaps = 26/320 (8%) Query: 11 IYYMLTYAWGYLQEIKQANLE-AIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEII 69 + ML + + + + N +I+ ++ + + + + GL +Y TE Sbjct: 121 LKRMLNFVNDIYVDNQSTKADKTEETNEFQNIIAFLFIQSLEKATVLGLPKNYQSITERS 180 Query: 70 PGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDE 129 ++G+I+ ++ G STF ++ L K I + Sbjct: 181 NKVRGKIDINAYLKREIPFTGNLTSTFREQIYVQEIIDVLYLACKALEKR--FGKEIHKK 238 Query: 130 ARSLYRKLP-GISTLHLTPQHFSYLNG-----GKNTRYYKFVISVCKFIVN--NSIPGQN 181 +Y+ L S + +K I + I+ N + Sbjct: 239 ILGVYQLLKLNYSGVFPQNSVIEKAKNHFVLQNPMFSAFKKTIGYAEIILREQNLLVSNT 298 Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRM 241 + + +S L++ +L + R T T Q + +M Sbjct: 299 DNQLTTNGYLFD---VSQLFEVYLEKLLSRYFTDWYVTG-----QEELNVYQKMFYKRKM 350 Query: 242 ETDITIRSS--EKILIVDAKYYKSIFSRRMGTE-KFHSQNLYQLMNYLWSLKPENGENIG 298 D+ ++ ++++ DAK+ K + + + YQ+ +Y+ +P+ I Sbjct: 351 FPDLVMKHKLTNQLIVFDAKFKKMRLHKTQSSYSDLDRSDFYQIHSYIHYYQPD---VIA 407 Query: 299 GLLIYPHVDT-AVKHRYKIN 317 G L+YP + + Y N Sbjct: 408 GGLLYPLSNEININTTYSEN 427 >UniRef50_Q139N0 Putative uncharacterized protein n=1 Tax=Rhodopseudomonas palustris BisB5 RepID=Q139N0_RHOPS Length = 434 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 55/356 (15%), Positives = 123/356 (34%), Gaps = 38/356 (10%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGN----NLLDILGYVLNKGVLQLSRRGL 58 +P PV N+ ++ + L I A+ + + ++L+ L L + ++ RGL Sbjct: 77 RPKFPVSNLARVIDTSKRQLNSIPGADRSYLANDLSGGSVLNFLAANLVDALRPIAARGL 136 Query: 59 ELDYNPNTEIIPGIKGRIEFAKTIRG-FHLNHGKTVSTFDMLNEDTLANRIIKSTLAILI 117 +Y+ +E +GRIE A T+RG K + D NRI+K+ L ++ Sbjct: 137 HKEYSCRSETTSHPRGRIEIAGTMRGWSRGQFHKVQAQRFDQTSDLPVNRILKAALESVL 196 Query: 118 K----HEKLNSTIRDEARSLYRKLPGIS------TLHLTPQHFSYLNGGKNTRYYKFVIS 167 K H + + A + + + P + L + + + + YY I Sbjct: 197 KLMWPHSTESRRLIVRANASFLEFPQLVGSCKPLDLAESQAILAARSLPADRIYYYRAIE 256 Query: 168 VCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDA 227 + I+++ + L++++L + + + ++ Sbjct: 257 IALLILSSRGISLQEEGVDVLLDSF-IINFDDLFEEYLRRVLQARAPNLLSV-KDGNFEG 314 Query: 228 SSISDQSLNLLPRMETDITI--RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNY 285 + P + D+ + + + ++ + KY ++ Q + Y Sbjct: 315 KRQLFEDRKDQPA-QPDVVLTWQPTSVNVVGEIKYKDR----------PSRDDINQAITY 363 Query: 286 LWSLKPENGENIGGLLIYPHVDTA---VKHRYKINGFDIGLCTVNLGQEWPCIHQE 338 + +LI+ ++H I G + +LG +E Sbjct: 364 ALCYNTKC-----AVLIHQCRSGESRGLRHHGTIRGIRLENYAFDLGAANLDAEEE 414 >UniRef50_C8VZS1 Putative uncharacterized protein n=1 Tax=Desulfotomaculum acetoxidans DSM 771 RepID=C8VZS1_DESAS Length = 439 Score = 141 bits (356), Expect = 4e-32, Method: Composition-based stats. Identities = 49/299 (16%), Positives = 108/299 (36%), Gaps = 16/299 (5%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLD--ILGYVLNKGVLQLSRRGLELDYNPNTEI 68 ++ M+ + A L+ L ++ ++ + ++ GL Sbjct: 95 LFKMIEEIFNVKLVTSNAALQRKNDFGFLIRRLISFIWLHKLANANKHGLPRHNVKKNYT 154 Query: 69 IPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN-STIR 127 +KGRI K++ L + VS F D RI+ IL+K +L ++ Sbjct: 155 GYNVKGRINVKKSV-ISLLTKEQVVSEFYEKEIDETIARILVQAYWILVKDYELGILSLP 213 Query: 128 DEARSLYRKL--PGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHY 185 D AR + L S+ ++ + +N + + ++ V+ I+NN P ++ Sbjct: 214 DNAREIINLLKSSRFSSQSVSQNEYDRINYKEIYQSFREVVDFSWDIINNKTPSKSVCTQ 273 Query: 186 RFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDI 245 F +M+ +++ ++ + L ++ + + DI Sbjct: 274 SQNGFSF-FIDMAEIWELYIRTALSKHLKKDKWKVMLD----YAVVYEDTFFKRCLIPDI 328 Query: 246 TIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 ++ + + DAKY + + +Q+ Y+ + + + G LIYP Sbjct: 329 VVKRGADVAVFDAKYKAM----NYSSLDVDRNDFFQIHTYM-NYYAQGKRLLAGGLIYP 382 >UniRef50_C2BVA3 Putative uncharacterized protein n=1 Tax=Mobiluncus curtisii ATCC 43063 RepID=C2BVA3_9ACTO Length = 390 Score = 141 bits (355), Expect = 5e-32, Method: Composition-based stats. Identities = 62/336 (18%), Positives = 107/336 (31%), Gaps = 38/336 (11%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDI----LGYVLNKGVLQLSRRGLE 59 P N ML Y ++ + + D L +L++ +L + + Sbjct: 62 PKYTSLNPVSMLLYLHDSQSAKLMNDVPSEYSSGSHDFCLSSLAEMLSQELLSFAAKPKI 121 Query: 60 LDYNPNTEIIPGIKGRIEFAKT-IRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIK 118 P E G+I + T +R + ++ D NRIIKS ++ Sbjct: 122 FRRKPTLEATSSAVGQINWPVTNLRARRGDAAPILTRRHRPTFDVPENRIIKSAAKRVLG 181 Query: 119 HEKLN----STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVN 174 + D A G + Q N G + YYK +S+ I+ Sbjct: 182 LLSSDAPGRRVTHDWANWQAATFAGYDDIRKVSQMMRTTNIGGSHSYYKNALSLSLVILE 241 Query: 175 NSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQS 234 S + + F N M LY+ F+ R A T ++ +S S Sbjct: 242 ASGIDHGE-SWESDGFLFN---MPGLYEDFVRTSLMRA---AQPTALSVQKGFASSSFLL 294 Query: 235 LNLLPRMETDITIRSSEKI-LIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPEN 293 N + D+TI I ++D KY +LYQ+ Y+ + Sbjct: 295 ANGEIELIPDLTIYRGGTIEAVLDVKYKAPDAK-----------DLYQIYTYM-----QF 338 Query: 294 GENIGGLLIYP--HVDTAVKHRYKINGFDIGLCTVN 327 + +I P V+ +G I ++ Sbjct: 339 AQLNEAYIISPSVRTGDMVE---TFDGHRIRYLGLD 371 >UniRef50_B8DPA2 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Desulfovibrio vulgaris str. 'Miyazaki F' RepID=B8DPA2_DESVM Length = 397 Score = 140 bits (354), Expect = 5e-32, Method: Composition-based stats. Identities = 65/352 (18%), Positives = 110/352 (31%), Gaps = 46/352 (13%) Query: 4 PVIPVRNIYYMLTYAWGYL-----QEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGL 58 P I N +++L A G + A+ + I+ L V ++ R Sbjct: 61 PKIGDVNFFHLLFKAEGLQNNTLQELNSFASYFTNEDHTPPIIIAKNLLLSVNEILHRSP 120 Query: 59 ELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGK-TVSTFDMLNEDTLANRIIKSTLAILI 117 + + G + KTI G H K T DT NRI+ + + I I Sbjct: 121 TAKRFKVKKNGNFVAGSLNIQKTIFGIHSRAHKPIHYTVKEKTLDTPENRILTAAINIAI 180 Query: 118 KHEKLNSTIR------DEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKF 171 + + ++ L T Q+ + G YYK I + + Sbjct: 181 DLLPTEQRLSYEPIYLAWLQRFPHSTDILADLETTAQNIASNKYGGPRDYYKRSIILAQI 240 Query: 172 IVNNSIPGQNKGHYRFYDFERNEKEMS--LLYQKFLYEFCRRELTSANTTRSYLKWDASS 229 + G F + ++ +++KF+ + E T+ S S Sbjct: 241 LFGYRGY----GLSGTTSFTGDAILLNTAAVFEKFVRKIISLEYTAKGIVVSKETNSPYS 296 Query: 230 ISDQSLNLLPRMETDITIRSSEK-ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWS 288 + N + DI I LI DAKY K + YQL YL Sbjct: 297 LY---TNGSYSVCPDIIISEGGNLRLIADAKYKKPTI-----------SDHYQLYTYLSV 342 Query: 289 LKPENGENIGGLLIYPHVDT--------AVKHRYKINGFDIGLCTVNLGQEW 332 L + G LI P ++ I + + ++L + + Sbjct: 343 LGAK-----RGALIAPSFTGFDIETKEFQTPSQHTITEIYLPMQNIDLAENF 389 >UniRef50_UPI0001B550BC hypothetical protein StAA4_26514 n=1 Tax=Streptomyces sp. AA4 RepID=UPI0001B550BC Length = 449 Score = 140 bits (353), Expect = 7e-32, Method: Composition-based stats. Identities = 55/319 (17%), Positives = 104/319 (32%), Gaps = 25/319 (7%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDY 62 +P + + I + A + +A + ++ + + +R GL Sbjct: 84 RPRLGIDTIASWASAALNL-HTVPRAAEARGNSALIAELTAALWRAALTAAARHGLPSFR 142 Query: 63 NPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKST---LAILIKH 119 + ++G ++ T S +++I + L L+ H Sbjct: 143 TRRGHVGSAVRGSLDSPGTFALRAARSPFVASVERAKLLGNPVSQVIVAADQVLDTLLHH 202 Query: 120 EKLNSTIRDEARSLYRKLPG---ISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNS 176 D + +L + + + T YK V + IV + Sbjct: 203 --RPGWRGDRVEEIVPRLRESVGARPRLPSLRDLRSVRYTPITLPYKRVADLSWQIVQRT 260 Query: 177 IPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRR--ELTSANTTRSYLKWDASSISDQS 234 P + R + +++ L++ FL RR L + T+ ++ + Sbjct: 261 APQASPTDERTHGLL---IDVAELWELFLLRCARRATALPVTHGTQYHVPAPLLRSARHP 317 Query: 235 LNLLPRMETDITIRSSEK-ILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPEN 293 +L R+ DI I +E IVDAKY ++LYQL YL + E Sbjct: 318 TAVLGRLFPDILIGPAESPTAIVDAKYKPLN-----DRRGVDREDLYQLNAYLTAHNAEL 372 Query: 294 GENIGGLLIYPHVDTAVKH 312 G L+YP +D Sbjct: 373 -----GALVYPTLDQHPSP 386 >UniRef50_C2LL98 Putative uncharacterized protein n=2 Tax=Proteus mirabilis RepID=C2LL98_PROMI Length = 475 Score = 140 bits (352), Expect = 1e-31, Method: Composition-based stats. Identities = 51/337 (15%), Positives = 106/337 (31%), Gaps = 33/337 (9%) Query: 11 IYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP 70 + Y++ A G+L+ G +L Y+ N + R GL Y + I Sbjct: 133 LQYIIADADGFLELENIGGESHADGYEW--LLAYLWNIKFKRAYRLGLPKTYITRNDRIS 190 Query: 71 GIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 ++G I + + GK + ++ + D+ A + + + R+ Sbjct: 191 RVRGTINATDYFQNK--SSGKYLCSYREHSYDSPATSLFIKAYEAVAHYSFC-HQTRNIY 247 Query: 131 RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDF 190 + GI S+ Y +I + K ++ + F Sbjct: 248 SAFLTANQGIKRSQQEILRTSHFT-NPFYNDYNVLIDLSKQVIGRKGSDFDS-QQDSSAF 305 Query: 191 ERNEKEMSLLYQKFLYEFCRRE-LTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRS 249 + S+L++ F+ + +R+ L + + ++S + ++E D+ I Sbjct: 306 FFDI---SMLFEYFIRKLIKRDGLRLLGKFELCQEIASGALS----GYMRKLEPDLVIEI 358 Query: 250 SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP----- 304 E++ + D KY ++L+QL Y+ G IYP Sbjct: 359 DERLFVFDVKYKA-----FDSQFGVKREDLFQLHTYIGQYGNIAAIKGCG-FIYPISEER 412 Query: 305 -------HVDTAVKHRYKINGFDIGLCTVNLGQEWPC 334 + G +I + L Sbjct: 413 WASLNLEKTQGVISDIIHQQGQEIPFHVLFLKIPEDT 449 >UniRef50_Q2GBH5 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Novosphingobium aromaticivorans DSM 12444 RepID=Q2GBH5_NOVAD Length = 687 Score = 139 bits (350), Expect = 1e-31, Method: Composition-based stats. Identities = 38/248 (15%), Positives = 89/248 (35%), Gaps = 18/248 (7%) Query: 108 IIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVIS 167 I+ + L +H + + R L L I + +T + + + R ++ + Sbjct: 432 IMAAATVFLARHTR-SLATRRTLDELRHALADIPLMPITRLPWQAVRIDRTNRRWEALFR 490 Query: 168 VCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDA 227 + + ++ + H + D M+ L++K++ RR L + Sbjct: 491 LARLLLQRDWQATHH-HAKAPDGLTLLFPMNDLFEKYIAVLLRRALAGSGIEVIDQGGHR 549 Query: 228 SSISDQSLNLL-----PRMETDITIRSSEKIL-IVDAKYYKSIFSRRMGTEKFHSQNLYQ 281 + + + L R + DI +R +I+ I+D K+ K ++YQ Sbjct: 550 ACLGSFTGGHLETGEVFRTKPDIMLRRGREIVAIIDTKWKKLSLDPLDRKHGVSQADVYQ 609 Query: 282 LMNYLWSLKPENGENIGGLLIYPHVDTAV---KHRYKING--FDIGLCTVNLGQEWPCIH 336 LM Y + +L+YP V + ++ + G + + ++ + + Sbjct: 610 LMAYARLYQ-----TAELMLLYPARPGQVCAERAQFGMAGGSERLRIAMADVSLDEKALA 664 Query: 337 QELLDIFD 344 + L + Sbjct: 665 EALGVLVM 672 >UniRef50_C9PUK8 Putative uncharacterized protein n=2 Tax=Bacteroidales RepID=C9PUK8_9BACT Length = 437 Score = 137 bits (344), Expect = 9e-31, Method: Composition-based stats. Identities = 52/321 (16%), Positives = 104/321 (32%), Gaps = 42/321 (13%) Query: 21 YLQEIKQANLEAIPGNNLLDIL-GYVLNKGVLQLSRRGLELDYNPNTEIIPG-IKGRIEF 78 + + +LL I + +++ +GL Y E + IKGR Sbjct: 114 VTISFHKPAIPISQQQDLLSIFLITEYLSVLRRIAAKGLRKSYYMVEENLNNKIKGRCLV 173 Query: 79 AKTIRGFHLNHGKT---VSTFDMLNEDTLANRIIKSTLAILI------KHEKLNSTIRDE 129 A+ ++ + G+ + + D+ NRI+K L + +H ST+ + Sbjct: 174 ARNVKQNL-SKGRVTNNFCRYQVYGIDSCENRILKRALRFCVKQLEVYRHAFDTSTLDNI 232 Query: 130 ARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYD 189 R + + +T + G + + + + + ++ Sbjct: 233 VRFVNPHFDNVGE-EVTTKAIQTFKGNPIFKEHSTAVELAQLLLRRYSYDITLAGNHQIT 291 Query: 190 FERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRS 249 +MS L++ +++ R TS N + K + D ++ Sbjct: 292 TPPFWIDMSKLFELYVFRHLRLVFTSKNEVCYHPKAHRQEL-------------DYLLKP 338 Query: 250 SE--KILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLW--------SLKPENGENIGG 299 + +VDAKY R G + Q+ Y L EN I Sbjct: 339 CHWAEPYVVDAKYK----PRYKGMIGIDKDDARQVAGYARLQKIYDMLKLDAENALPIKC 394 Query: 300 LLIYPHVDTAVKHRYKINGFD 320 L++YP D + R+ + Sbjct: 395 LIVYP--DQEQQERFTFTDTE 413 >UniRef50_C2WFQ0 Putative uncharacterized protein n=1 Tax=Bacillus cereus Rock3-44 RepID=C2WFQ0_BACCE Length = 439 Score = 134 bits (338), Expect = 4e-30, Method: Composition-based stats. Identities = 45/308 (14%), Positives = 98/308 (31%), Gaps = 22/308 (7%) Query: 11 IYYMLTYAWGYLQEIKQA-NLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEII 69 + YML +G I ++ + A +L ++ + + ++G Y Sbjct: 81 LSYMLKKVFGSKAFIFESMQVSASRDKAFEQMLLFIFVYLLEKAMKKGTFKQYTNFDFYD 140 Query: 70 PGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAIL-IKHEKLNSTIRD 128 IKG I F ++ + G+ + N I + +L K+ + Sbjct: 141 SNIKGTINFNVYMKQIMIQDGRLPYHVRERSAMNPVNIGILTAYDVLKSKNPTFVQQVFK 200 Query: 129 EARSLYRKLPGISTLHLTPQHFSYLNG---------GKNTRYYKFVISVCKFIVNNSIPG 179 + L R + I + + + VC I+ +S Sbjct: 201 KNPVLSRFIAQIKGELPNYRSIDKKQLLKQLTKSIRHPYFTEVESLRKVCIEIIRHSGSD 260 Query: 180 QNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLP 239 + + +++ L++ FL E+ + D+ + + Sbjct: 261 IFQEENQM--ITGLLIDVNKLWEYFLEHTIFAEMKKMAASYEVSTQDSYPVLYELKGFDM 318 Query: 240 RMETDITI-RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 +++ D + R+ + + DAK + T ++YQ+ Y+ +L G Sbjct: 319 KIKPDFVLSRNKQNQAVFDAK--HRPAWSKKDTSAVKK-DVYQISMYMSALNVSIGGV-- 373 Query: 299 GLLIYPHV 306 IYP Sbjct: 374 ---IYPTT 378 >UniRef50_D2EQZ8 Putative uncharacterized protein n=1 Tax=Streptococcus sp. M143 RepID=D2EQZ8_9STRE Length = 478 Score = 134 bits (337), Expect = 6e-30, Method: Composition-based stats. Identities = 39/283 (13%), Positives = 94/283 (33%), Gaps = 16/283 (5%) Query: 41 ILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHG-KTVSTFDML 99 I+ Y+ + +++ L Y + I+G I+ + I ++ K + Sbjct: 161 IVQYLFLVSLRKVAGTNLPKKYVYKKDRDYSIRGNIDIERYITNDLVSSDKKISFRYPER 220 Query: 100 NEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNG--GK 157 I+ + + + I L G T + Sbjct: 221 ENAQNIIDILYCAIKECSVEQAVLPDILSVRNYLAESFSGRRPSKYTVNNILKDKILKNS 280 Query: 158 NTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 YK + ++I+N + + + S L++ ++Y + L Sbjct: 281 LYANYKKPLQYAQYILNLRELN-DGNTNKSNSVSGYLVDASFLWEMYIYNLMKIHLHDW- 338 Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITIRS--SEKILIVDAKYYKSIFSRRMGTEKFH 275 + + + + + D +R + +I+++DAK+ F+ R Sbjct: 339 EVDAQEELHFYEQTFYAKDNY----PDFVLRHRLTGEIVVLDAKFKNMEFNGRD----VD 390 Query: 276 SQNLYQLMNYLWSLKPENGENIGGL-LIYPHVDTAVKHRYKIN 317 + ++ QL Y + + G+ G LIYP + +++ ++ Sbjct: 391 NADIRQLHGYSYYYHLQYGDKFRGAGLIYPAKERIPQNKVNVD 433 >UniRef50_Q1LJY6 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Cupriavidus metallidurans CH34 RepID=Q1LJY6_RALME Length = 424 Score = 132 bits (331), Expect = 2e-29, Method: Composition-based stats. Identities = 50/311 (16%), Positives = 91/311 (29%), Gaps = 41/311 (13%) Query: 3 QPVIPVRNIYYMLTYAWG-----YLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRG 57 +P +P+ N+ +L Y+ + + L V ++ R G Sbjct: 71 RPKVPLVNLERIL-YSSNHKPYVLESFQRAYGHHTRASEPIETFLVDRFLDWVEEIHRFG 129 Query: 58 LELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILI 117 L Y E +GR+ FA TIR N +D D NR IK+ L L Sbjct: 130 LLKQYRVIHEDGFTPRGRLNFAATIRLRSRNQQSLAYQWDDRTADNGPNRFIKAVLLQLA 189 Query: 118 KHEKLNSTIRDEAR--SLYRKLPGIST-------LHLTPQHFSYLNGGKNTRYYKFVISV 168 ++ R +AR +S + L + YYK I + Sbjct: 190 DRDEFLHDRRRKARLSVCVDYFSSVSDVDVHSVLIDPLVDDVEQLPSTR--EYYKTAIVL 247 Query: 169 CKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCR---------RELTSANTT 219 K ++ S + ++K++ + R Sbjct: 248 AKMLIEKSGLAFLSDQHSVLLPTLLLDLDEA-FEKYVLTLLQQVNIRRDDVRVFDGNIDG 306 Query: 220 RSYLKWDASSISDQSLNLLPRME--TDITIRSSE-----KILIVDAKYYKSIFSRRMGTE 272 K S + ++ D+ + + L++D KY + Sbjct: 307 ALGGKKQLLKDSGLENQVKSTVQATPDVLVEKYSIPSIPRNLVIDMKYKEV-------KN 359 Query: 273 KFHSQNLYQLM 283 +L Q++ Sbjct: 360 IVERGDLNQVI 370 >UniRef50_B0TG77 Putative uncharacterized protein n=1 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TG77_HELMI Length = 485 Score = 127 bits (319), Expect = 6e-28, Method: Composition-based stats. Identities = 40/324 (12%), Positives = 97/324 (29%), Gaps = 26/324 (8%) Query: 3 QPVIPVRNIYYMLTYA-WGYLQEIKQANLEAIPGNNLLDI-LGYVLNKGVLQLSRRGLEL 60 QP I ML W + E + + + L ++ + + R L+ Sbjct: 113 QPRFDWPGIGPMLARMGWRVIPEPLRLPMLPRSDRKVPPWVLSTMVLFRIRAMLER-LQR 171 Query: 61 DYNPNTEIIPGIKGRIEFAKTIRGFHL--NHGKTVSTFDMLNEDTLANRIIKSTLAILIK 118 + + +G +++ + R + F L +D + TL + Sbjct: 172 RFTIGEAELTAPRGTVDWGRYARTKVPTGRLLEVPCRFPDLRDDAALLGALHFTLRRQLA 231 Query: 119 HEKLNSTIR-------DEARSLYRKLPGISTLHLTPQH-FSYLNGGKNTRYYKFVISVCK 170 + + L ++ + T + +L T ++ + + Sbjct: 232 SLESQRATGTVVLPLIALCQGLLDRVRHVPPRRPTDRDRLLWLRAPLQTDVFRDGLRAME 291 Query: 171 FIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSI 230 + ++ + + + S + +++ R + + Sbjct: 292 WTIDERGLAGLA-EHTGLPWVMSMDAFSEAWCEYVVTELARHYGGHVRAGRLRETVTPLL 350 Query: 231 SDQSLNLLPR-METDITIRSSEKILIVDAKYYKSI----FSRRMGTEKF----HSQNLYQ 281 + R + D+ + ++ +I+DAKY R E H +L Q Sbjct: 351 WEPPFTGSQRFLMPDLVLERDDETIIIDAKYKSHWEDLSLERWFDLETTVRERHRNDLLQ 410 Query: 282 LMNYLWSLKPENGENIGGLLIYPH 305 ++ Y S + + L+YP Sbjct: 411 VLAYTTSYATKR---VTACLLYPC 431 >UniRef50_A8UWA1 Putative uncharacterized protein n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8UWA1_9AQUI Length = 443 Score = 124 bits (311), Expect = 5e-27, Method: Composition-based stats. Identities = 53/300 (17%), Positives = 101/300 (33%), Gaps = 37/300 (12%) Query: 34 PGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIP-GIKGRIEFAKTIRGFH--LNHG 90 + + L + +L ++GL+ + + + +KG+I KT + H Sbjct: 148 EDDTFMLFLVIHYLNLLAKLVKKGLKKGFVFVEQDLNGRVKGKINIKKTYQRHMSKGIHT 207 Query: 91 KTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN----STIRDEARSLYRKLPGISTLHLT 146 KT F +L D L N+I+K+ L IK +L S + + L +S + Sbjct: 208 KTTCRFQILTHDFLDNQILKAALIQAIKFVRLMKFEISGLNEIMNYLSYLFESVSLKRVL 267 Query: 147 PQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLY 206 FS + YK + + + I+ N ++ + M L++ +++ Sbjct: 268 DTDFSKVRHSPFFPEYKEALELARMILKNLGNDPFSNVSKYTTIQPYIINMPKLFELYVW 327 Query: 207 EFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFS 266 + + + Y D + K LIVDAKY Sbjct: 328 LKLKGKFSRGKVIYQYNANGD--------------IPDFIVE--GKNLIVDAKYKYI--- 368 Query: 267 RRMGTEKFHSQNLYQLMNYLWSLKPE--------NGENIGGLLIYPHVDTAVKHRYKING 318 K ++++ QL Y + + ++ YP D K+ G Sbjct: 369 ---DESKPSTEDIGQLSRYGRNKEVRKLALGKHIKNREPRLVIAYPTFDCFNTKCLKVEG 425 >UniRef50_A5WH29 Putative uncharacterized protein n=1 Tax=Psychrobacter sp. PRwf-1 RepID=A5WH29_PSYWF Length = 542 Score = 122 bits (306), Expect = 2e-26, Method: Composition-based stats. Identities = 40/315 (12%), Positives = 92/315 (29%), Gaps = 65/315 (20%) Query: 47 NKGVLQLSRRG---LELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDT 103 K R+G L Y ++ P +G+I + ++ S + + + Sbjct: 191 LKKAEMQLRQGADILPSRYQSKSQNQPKAQGKINMSAQLKNNWHRPHYLYSEQTVFDTNK 250 Query: 104 LANRIIKST---LAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNG----- 155 + + + L L+ +++S A S ++ S TP L Sbjct: 251 RLAQFLFTAWQQLRYLLHPSQVDSNYAVSAHSHRQQHLNHSPYSATPIALQQLKALAADQ 310 Query: 156 --------------------GKNTRYYKFVISVCKFIVNNSIPGQNK-----GHYRFYDF 190 + + + +++++S+ Q Sbjct: 311 WLPTYKALQSEALTWRATLGTRQAQLLTQALDWAWWLLSHSLQSQPPDPSARNSTSLLPT 370 Query: 191 ERNEKEMSLLYQKFLYEFC----RRELTSAN-TTRSYLKWDASSISDQSL---------- 235 M +++++ ++ L + + +W + + D + Sbjct: 371 AALIINMQFAFERWVLGKLSVWVQQTLPGSRLIVQPSFEWLYAHLEDSNHLAFVCGKPIA 430 Query: 236 --NLLPRMETDITIRSSEKIL--IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKP 291 +L+ R++ D I S + ++D KY ++ + QL Y L Sbjct: 431 AYHLVQRLQPDACIYDSRAKITHVIDIKYKALSTVQQ-----VSGSDWQQLYVYQHYLN- 484 Query: 292 ENGENIGGLLIYPHV 306 LIYP Sbjct: 485 ----RPQAWLIYPKN 495 >UniRef50_C8PX06 Putative uncharacterized protein n=1 Tax=Enhydrobacter aerosaccus SK60 RepID=C8PX06_9GAMM Length = 426 Score = 120 bits (302), Expect = 7e-26, Method: Composition-based stats. Identities = 43/246 (17%), Positives = 91/246 (36%), Gaps = 30/246 (12%) Query: 70 PGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN-----S 124 P ++G++ + ++ K + DT NR++K+T+ +I + Sbjct: 156 PYLQGKLLVKEQLQHNFHQPHKFYHQTENFAMDTAGNRLVKTTIERVIGSMAMPLPPQWQ 215 Query: 125 TIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGH 184 + A+ LY L + Q + L R Y IS C ++ ++G Sbjct: 216 VVDTVAKDLYDSLFSQAL-----QELTALPSLLAQRNY-TFISFCYALLTLQ-QASSQGQ 268 Query: 185 YRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETD 244 + + N M ++K++ + + N K ++ ++ D Sbjct: 269 FLTPTWLVN---MPFAFEKWVGRKIHEQFAAQNFELVEQKRQPLTVQQGL-----TIKPD 320 Query: 245 ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYP 304 I ++S++K++I D K+ K+ ++YQL+ Y + LI P Sbjct: 321 IWLKSADKLIIADVKWKKTPTFN-----DISLADMYQLLTYASEFDAD-----EAWLIVP 370 Query: 305 HVDTAV 310 + T + Sbjct: 371 TLGTQL 376 >UniRef50_Q4FV58 Putative uncharacterized protein n=2 Tax=Psychrobacter RepID=Q4FV58_PSYA2 Length = 514 Score = 115 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 42/319 (13%), Positives = 93/319 (29%), Gaps = 61/319 (19%) Query: 37 NLLDILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTF 96 L D L + +L+ Y ++GR+ + +R + K V Sbjct: 164 PLSDWLITQF---LQRLAHHQPITHYQTQIHNQTALQGRLLIKEQLRHNSMQPHKFVCER 220 Query: 97 DMLNEDTLANRIIKSTLAILIKHEK-------LNSTIRDEARSLYRKLPGISTLHLTPQH 149 +L++ LANRIIKS L +L L + Y S Sbjct: 221 SVLSKGMLANRIIKSALKLLAPLLSQSNLLLYLQPWQQVSVLHQYEIRQLASIYFQAKHE 280 Query: 150 FSYLN-GGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFER--------NEKEMSLL 200 + + + + ++ +++ S F + +M+ Sbjct: 281 LAIQPLQAQQLQAAQQLVDFAYWLLCQSHAETGHSIDSQNPFHKKLTPQRLCLLIDMNQA 340 Query: 201 YQKFLYE---FCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSS------- 250 ++++ + ++L+ + ++D + D+ I Sbjct: 341 FEQWASQRIALFFQQLSDDYK--PLFQTQRVWLNDAEGQACLSIRPDLLIYKQIHSSAEN 398 Query: 251 ----EKIL----------------IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLK 290 + + ++D K+ + + + YQL +Y + + Sbjct: 399 TAMYDNYVSQAKDAREKHSRHYSHVIDIKWKHLAHASA-----ISASDAYQLSSYAQAYQ 453 Query: 291 PENGENIGGLLIYPHVDTA 309 E L+YP D Sbjct: 454 AE-----QVWLVYPVQDDQ 467 >UniRef50_B9ZE69 Putative uncharacterized protein n=1 Tax=Natrialba magadii ATCC 43099 RepID=B9ZE69_NATMA Length = 586 Score = 110 bits (275), Expect = 9e-23, Method: Composition-based stats. Identities = 58/378 (15%), Positives = 130/378 (34%), Gaps = 56/378 (14%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANL-------EAIPGNNLLDILGYVLNKGVLQLSRR 56 P I I+ ML + + I + + I +++L ++ +G+ + R Sbjct: 100 PKIDWGPIFDMLLAVYDQNRSIDYHGIPLQDFLSDDIELDDVLVVMAINYLEGLETIQRN 159 Query: 57 GLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVST----FDMLNEDTLANRIIKST 112 G D +G I+ +T+ LNH + + D AN ++ Sbjct: 160 GYIRDLILRRTNSLEGRGEIDVEQTL----LNHARGTVEPNWIRNETEYDNAANSLLHYA 215 Query: 113 LAILIKH-------------EKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLN---GG 156 L++ +++ S + E L G+ + + + Sbjct: 216 GKTLLRLFRQNASEYDHPGYDRIFSEVHREIERLEGM--GVDSGLDRIDEYRRITLGDLP 273 Query: 157 KNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA 216 K +YY+ V K ++++S+ Q K R M L++++ REL+ Sbjct: 274 KQRQYYRKAFDVAKAVLSSSLGQQLKDGPREL-VVDYVLNMESLFEQYSQVVIERELSYI 332 Query: 217 NTTRSYLKW------DASSISDQSLNLLPRMETDITIRSSEKIL-IVDAKYYKSIFSRRM 269 + + + S++ E D ++ E+ + ++D+KYY Sbjct: 333 KSYDHFDSIANVKPASSPSVNPFEGENQIYHEPDHALQEGEETIAVLDSKYYAEGHDPVK 392 Query: 270 GTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKINGFDIGLCT---- 325 ++ L +Y + L + + L+ P + R G ++ + + Sbjct: 393 ESQSRSR-----LFSYAYLLHADRLAFL-CPLLEPR-----RRRVTQTGAELEIVSPTGD 441 Query: 326 VNLGQEWPCIHQELLDIF 343 +L + +H+ L + Sbjct: 442 FSLDKYDEVVHEYLHSVL 459 >UniRef50_D0Z8V4 5-methylcytosine restriction system component n=2 Tax=Edwardsiella RepID=D0Z8V4_EDWTE Length = 185 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 16/134 (11%), Positives = 45/134 (33%), Gaps = 8/134 (5%) Query: 182 KGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRM 241 KG R + M +++ ++ + R + + + + S+ + R+ Sbjct: 2 KGDNRAFSLLF---PMEKVFEHYVAKTLREQYAPQVAVHAQV--QSKSLVTHADAQWFRL 56 Query: 242 ETDITIRSSEKIL-IVDAKYY--KSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIG 298 + D+ + ++++ ++D K+ + + YQ+ Y + Sbjct: 57 KPDMVMIQGKQVIAVLDTKWKLLDPTLANGADKYALQQSDFYQMFAYGHHYFDQQITVRE 116 Query: 299 GLLIYPHVDTAVKH 312 L+YP Sbjct: 117 MFLVYPAHANFTAP 130 >UniRef50_B9D5W2 Putative uncharacterized protein n=1 Tax=Campylobacter rectus RM3267 RepID=B9D5W2_WOLRE Length = 427 Score = 105 bits (261), Expect = 3e-21, Method: Composition-based stats. Identities = 43/320 (13%), Positives = 91/320 (28%), Gaps = 25/320 (7%) Query: 41 ILGYVLNKGVLQLSRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLN 100 +L + G Y + G ++ I+ GK S Sbjct: 118 VLYASFISRLKLAKLSGFPSVYKKIPFRDYALHGSLDVKNFIKKDQPFMGKISSRKSSRV 177 Query: 101 EDTLANRIIKSTLAILIKHEK----LNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGG 156 D + R++ IL++ + IR+ S + S Sbjct: 178 PDEVVARVLLKAYDILVRKNPKFALYDKEIRNFLL-ANANGEMKSVKDINVALNSKSVMN 236 Query: 157 KNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA 216 + + YK + + + I+ N+ + +F L++ ++ L Sbjct: 237 ELYKDYKIALQIARVIILQDSRYANENAVKNLNF-GYLLYAPNLFELYVKGLIEEALKIL 295 Query: 217 NTTRSYLKWDASSISDQSLNLLPRMETDITIR-SSEKILIVDAKYYKSIFSRRMGTEKFH 275 L + D ++ ++ +++DAKY E Sbjct: 296 REKH-NLDFMLLYQWPNDDERY---RVDYLLKDKKDRTIVIDAKYRYFCERCGYCNETDM 351 Query: 276 SQNLYQLMNYLWSLKPENGENIGGLLIYPHV-------DTAVKHRYKINGFDIGLCTVNL 328 NL Q+ Y+ + K + G+L+Y + Y IN D + Sbjct: 352 -SNLVQIGKYIAAYKAKM-----GILVYARNCSCHKVLSENKEPIYLINVLDCPSKK-DF 404 Query: 329 GQEWPCIHQELLDIFDEYLK 348 + L ++ + + Sbjct: 405 NESIEAFKLGLANVIKKNFE 424 >UniRef50_B9LVU8 Putative uncharacterized protein n=1 Tax=Halorubrum lacusprofundi ATCC 49239 RepID=B9LVU8_HALLT Length = 585 Score = 103 bits (256), Expect = 1e-20, Method: Composition-based stats. Identities = 52/345 (15%), Positives = 113/345 (32%), Gaps = 39/345 (11%) Query: 3 QPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPG-------NNLLDILGYVLNKGVLQLSR 55 P I +I+ ML + + I+ + +++ +L G+ + R Sbjct: 99 DPKIDWEHIFDMLLAVYDQNRSIEYHGIPLQDFLSDDIHLDDVFVVLAINYLDGLETIHR 158 Query: 56 RGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAI 115 +G D +G I+ +T+ + + D AN ++ Sbjct: 159 QGYIRDLVIRRLDSLDGRGEIDVEQTLLNHGRGTLEPHWIRNETEYDNAANSLLHFAGKT 218 Query: 116 LIKH-------------EKLNSTIRDEARSLYRKLPGIS---TLHLTPQHFSYLNGGKNT 159 L++ +++ S + E L G+S + S + K Sbjct: 219 LLRLFRQNSHENDHPAYDRIFSEVHREVERLESM--GVSSGLDRMDAYRRLSLSDLPKQR 276 Query: 160 RYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTT 219 RYY+ V K ++++S+ Q + R M L++++ REL+ + Sbjct: 277 RYYQKAFDVAKAVMSSSLGQQLRDGPREL-VVDYVLNMESLFEQYSQVVIERELSYIKSY 335 Query: 220 RSYLKW------DASSISDQSLNLLPRMETDITIRSSEKIL-IVDAKYYKSIFSRRMGTE 272 + S++ E D ++ +K L ++D+KYY + Sbjct: 336 DHLGDLDDVTPVRSPSVNPFEGEGQIYHEPDHALQEGDKTLAVLDSKYYAEGHDPVKESP 395 Query: 273 KFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKIN 317 L +Y + L + + L+ P + ++ Sbjct: 396 SRSR-----LFSYAYLLHSDRLAFL-CPLLEPKRRRVTQTDAELQ 434 >UniRef50_Q9ZMQ3 Putative n=6 Tax=Campylobacterales RepID=Q9ZMQ3_HELPJ Length = 406 Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 38/258 (14%), Positives = 82/258 (31%), Gaps = 20/258 (7%) Query: 54 SRRGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKT-------VSTFDMLNEDTLAN 106 G + + KGRI F+KTI+ + F + + N Sbjct: 92 LSHGYYSE--NKSYYENNAKGRINFSKTIKKNRPIIQTFNNKNSFVYTRFQVKRKMINEN 149 Query: 107 RIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVI 166 +I + + HE + + + K + + K + Sbjct: 150 ELI-TAINKYCVHEAFSKFGFVFSSFMPPKFNLPTDKNYCIYLLENKLNNTFNDDKKILF 208 Query: 167 SVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWD 226 K I+ + DF+ +++++ + + + KW+ Sbjct: 209 QSMKNILLQ-----DDNILDKTDFKFGTHHFYVVWERMIDRAFG--IKNKEVYFPKTKWN 261 Query: 227 ASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYL 286 + L ++ D + +KI I+DAKYYK S + + Q++ Sbjct: 262 LRCSNQNPDYL---LQPDSIMLFDDKIYILDAKYYKYGISGVASDLPNSASIIKQIVYGE 318 Query: 287 WSLKPENGENIGGLLIYP 304 ++ K E + + + + P Sbjct: 319 YAAKLETKKEVYNIFLMP 336 >UniRef50_Q6L339 Putative uncharacterized protein n=1 Tax=Picrophilus torridus RepID=Q6L339_PICTO Length = 217 Score = 100 bits (250), Expect = 7e-20, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 74/216 (34%), Gaps = 26/216 (12%) Query: 150 FSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFC 209 F+ ++ + ++ + + I+ N +M++++Q F F Sbjct: 3 FNTVSFNRLNERFEIPYNYAEMIMKNMRLDIGNDKRTM----MMLFDMNMIFQNFFTIFI 58 Query: 210 ---RRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIR----SSEKILIVDAKYYK 262 RR++ T R ++ + + L + D+ I + + I I+D KY Sbjct: 59 IRNRRKIFQGKTVRIIPQYSRRNFIFSDSHALRITKPDLYIEVEDINKKNIFILDMKYKL 118 Query: 263 S--------IFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHR- 313 I +LYQ+ Y + +L++P A+ + Sbjct: 119 LQKADIEEYINDHIEDVYSVSQLDLYQMFTYSDLYGTDGT-----ILVFPGRVGAISNPY 173 Query: 314 -YKINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 +K NG + +C + L + L++ + Sbjct: 174 MFKENGRILWICIIPLDFTGDSWEERLVECVKGFFD 209 >UniRef50_C3JNW8 Putative uncharacterized protein n=1 Tax=Rhodococcus erythropolis SK121 RepID=C3JNW8_RHOER Length = 415 Score = 100 bits (248), Expect = 1e-19, Method: Composition-based stats. Identities = 51/325 (15%), Positives = 98/325 (30%), Gaps = 70/325 (21%) Query: 63 NPNTEIIPGIKGRIEFAKTIR---GFHLNHGKTVSTFDMLNEDTLANRIIKSTL---AIL 116 E + + R++ A+ IR DT NR+ L ++ Sbjct: 83 RRREETL---RSRLDVAQYIRDRGRPAARPRSFPLVTQERQLDTPENRLAAGVLGNIRLI 139 Query: 117 IKHEKLNSTIRD--EARSLYRKLPGISTLHLTPQ-----------HFSYLNGGK----NT 159 + +E + + ARS +R L IS + + + N Sbjct: 140 LANEIFPARTAESTLARSHFRALTKISREPVFSSLKRTSFARKDMTLTRFRVNRRMTGND 199 Query: 160 RYYKFVISVC--KFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSAN 217 Y+ ++ + + +E +++ + E R L Sbjct: 200 HPYRQLLQWIDNWLSIAGLVGDAGGDQTVDLALPESESYWEKVFEVWCLEQTRSGLLRLG 259 Query: 218 TT----------RSYLKWDASSISDQSLNLLPRME--------------------TDITI 247 R+ S QS+++ +M+ DI I Sbjct: 260 WHTDSDFRLHSSRARSPIATFSKDGQSVDVFFQMQIPLGLGRWKSERTAAALVGIPDIAI 319 Query: 248 R-SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKP-ENGENIGGLLIYPH 305 LI+DAK+ S+ E+ Y+++ Y + + GLLI+ Sbjct: 320 ACKGRAPLIIDAKWRFRSLSQGTSEEQ------YKMLGYAENFAHNQPALGFFGLLIF-- 371 Query: 306 VDTAVKHRYKINGFDIGLCTV--NL 328 + AV + G + L T+ +L Sbjct: 372 LSDAVDTQSFSRGENSRLTTLRTDL 396 >UniRef50_Q5UZU6 Putative uncharacterized protein n=1 Tax=Haloarcula marismortui RepID=Q5UZU6_HALMA Length = 186 Score = 97.0 bits (240), Expect = 8e-19, Method: Composition-based stats. Identities = 19/93 (20%), Positives = 40/93 (43%) Query: 4 PVIPVRNIYYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYN 63 P N+ Y+L YA + G++ +D L + N+ + ++ RRGL +Y Sbjct: 89 PKAAGNNLLYLLRYAQNVSPTTIEQQTGLGQGDSFVDALAALFNQELQEILRRGLHSEYQ 148 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTF 96 + ++GR++ + ++ K T+ Sbjct: 149 TVSSEEKQLRGRLDVQRQLQRQGPVPTKFECTY 181 >UniRef50_D2MHZ4 Putative uncharacterized protein (Fragment) n=1 Tax=Candidatus Poribacteria sp. WGA-A3 RepID=D2MHZ4_9BACT Length = 164 Score = 89.3 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 58/175 (33%), Gaps = 19/175 (10%) Query: 175 NSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQS 234 N G + M +Y+ F+ RR T + S Sbjct: 1 NQGLTTYSGRHINQSLLF---PMEQVYEDFVTHGFRRYQTEFQVVAQGPRERMLQPSSGH 57 Query: 235 LNLLPRMETDITI--RSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPE 292 M+ DI++ SS I+DAK+ + H +LYQ+ Y K + Sbjct: 58 NA----MKPDISLCKPSSNVRFILDAKWIHLGENENKTISDVHGSDLYQIYAYGKRYKCQ 113 Query: 293 NGENIGGLLIYPHVDTAVKH-RYK-INGFDIGLCTVNLGQEW---PCIHQELLDI 342 L+YP T ++ ++ +G + L ++ + + L ++ Sbjct: 114 T-----VALVYPRNSTFIRPCNFQFFDGLKLVLLPFDVSSPAGSERSVQRALREL 163 >UniRef50_B2UGA5 McrBC 5-methylcytosine restriction system component-like protein n=1 Tax=Ralstonia pickettii 12J RepID=B2UGA5_RALPJ Length = 164 Score = 85.0 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 44/145 (30%), Gaps = 24/145 (16%) Query: 218 TTRSYLKWDASSISDQSLNLLPRMETDITI-RSSEKILIVDAKYYKSIFSRRMGTEKFHS 276 R + + M+ D+T R I+D K+ + E Sbjct: 10 VIRLQSPQKYLAFEESQQRSAFLMKPDVTASRDGRVRWILDTKWKELSA--GEAKEGVAQ 67 Query: 277 QNLYQLMNYLWSLKPENGENIGGLLIYPHVDTA---------------VKHRYKINGFDI 321 +LYQ+ Y +L+YPH + + + Sbjct: 68 SDLYQMYAYASCYN-----CSEVVLLYPHHGALGQSAGVRATYLLNPWAERASQEPARRV 122 Query: 322 GLCTVNLGQEWPCIHQELLDIFDEY 346 + T++L + + ++L I +Y Sbjct: 123 RVATMDLA-DLKTVPRQLERIVLDY 146 >UniRef50_A6Y209 McrBC 5-methylcytosine restriction system component n=2 Tax=Gammaproteobacteria RepID=A6Y209_VIBCH Length = 434 Score = 76.6 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 37/320 (11%), Positives = 84/320 (26%), Gaps = 28/320 (8%) Query: 1 MEQPVIPVRNIYYMLTYAWGYLQEIKQANLEAIPG--NNLLDILGYVLNKGVLQLSRRGL 58 +E R ++ L+ + + + L A G +L ++ + RR Sbjct: 89 LEDTDAAWREDFFFLSTLSRHGRLLASERLSASGGTPRDLSTLVARSITSMYEARKRR-P 147 Query: 59 ELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIK 118 Y E I G + I + + N I + L+ Sbjct: 148 LRSYRRVREAEFFIDGDPDPVDLI---FPSPDGFEQELIRFDRKNGWNGDIVAAAKELLS 204 Query: 119 HEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIP 178 S R + R +K + + ++ Sbjct: 205 EVSDPSVASSLVRLIE------DLSPQNVAANRRNPIPARHRAWKPLHELSIDVLGGLGL 258 Query: 179 GQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLL 238 N+G + N +++ L R + + + S + Sbjct: 259 NYNQGQAHAPGYLVNT---WRVWEDLLTVAARLGFGRSAVV-PQQGYPLGTKIRMSTGAV 314 Query: 239 PRM--ETD--ITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENG 294 ++ D I + + +++DAKY + + + ++Y+ + + + Sbjct: 315 SKLSVYPDCVIELDGTRPRILLDAKYKGHVEKGQ---LRISEADIYEALAFSKA-----T 366 Query: 295 ENIGGLLIYPHVDTAVKHRY 314 L YP Sbjct: 367 GCNLVALAYPAQPGDAPQPV 386 >UniRef50_D0YRU6 Putative ATPase family associated with various cellular activities (AAA) n=1 Tax=Mobiluncus mulieris 28-1 RepID=D0YRU6_9ACTO Length = 461 Score = 72.3 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 41/94 (43%), Gaps = 7/94 (7%) Query: 262 KSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGEN-----IGGLLIYPHVDTAVKHR--Y 314 + S NLYQ+ Y+ + + E ++ + G+L+Y D ++ Y Sbjct: 364 HPFHPKNWDKHTIVSANLYQIFTYVKNKQAELNQSGGSRQVSGMLLYARTDEDIQPDGVY 423 Query: 315 KINGFDIGLCTVNLGQEWPCIHQELLDIFDEYLK 348 +++G I + T++L + + +L I + + Sbjct: 424 QMSGNQISVTTLDLNCPFEQLSAQLNSIAATHFE 457 >UniRef50_Q5UZU4 Putative uncharacterized protein n=1 Tax=Haloarcula marismortui RepID=Q5UZU4_HALMA Length = 157 Score = 71.2 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 61/161 (37%), Gaps = 29/161 (18%) Query: 197 MSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRS------- 249 M+ +Y++ + + S + + + +++ + + M D + + Sbjct: 1 MNTVYERVIERAVKSVAESHDRWEATGQAHTTNLISGTPTV--NMYPDFAVSNVETETDD 58 Query: 250 -SEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDT 308 E +++ DAK+ T + ++YQL +Y+ + K G++ YP D Sbjct: 59 GDETVVVGDAKWK---------TGSVSNDDIYQLTSYVLARKA------PGIVFYPAQDG 103 Query: 309 AVKHRYKINGF-DIGLC---TVNLGQEWPCIHQELLDIFDE 345 A + Y+I ++ + T + + + ++ Sbjct: 104 AAEREYQIKNEWELKIVELPTEDYSASFESFVDTIETAVED 144 >UniRef50_B5CQ36 Putative uncharacterized protein n=1 Tax=Ruminococcus lactaris ATCC 29176 RepID=B5CQ36_9FIRM Length = 124 Score = 67.7 bits (164), Expect = 6e-10, Method: Composition-based stats. Identities = 18/73 (24%), Positives = 36/73 (49%), Gaps = 5/73 (6%) Query: 280 YQLMNYLWSLKPENG---ENIGGLLIYPHVDTA--VKHRYKINGFDIGLCTVNLGQEWPC 334 YQ+ Y+ + + E + G+L+Y D A + YK++G I + T++L ++ Sbjct: 46 YQIFTYVKNKEIELSAQPHEVFGMLLYAKTDEAVLPNNSYKMSGNTISVKTLDLDCDFSE 105 Query: 335 IHQELLDIFDEYL 347 I +L I + + Sbjct: 106 IANQLNKIVESHF 118 >UniRef50_Q60CI5 Conserved domain protein n=1 Tax=Methylococcus capsulatus RepID=Q60CI5_METCA Length = 549 Score = 62.3 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 42/319 (13%), Positives = 96/319 (30%), Gaps = 63/319 (19%) Query: 85 FHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIK-----------------HEKLNSTIR 127 L G+ S DT NR +K L L + H ++ I Sbjct: 235 RRLFPGRVRSRRRHATVDTPENRFVKFFLTRLEQRLAQIKAMLGEGGGTFLHPEMGDEIG 294 Query: 128 DEARSLYRKLPGISTLHLT--PQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHY 185 R L R L G + + + Y+ + + + ++ + Sbjct: 295 RLERCLDRFLDGPRWREVGELRHIPARSTVLQRRAGYRELFRLFALL---QQASRHDANL 351 Query: 186 RFYDFERNEKEMSLLYQKFLYEFCRREL------TSANTTRSYLKWDASSISDQSLNLLP 239 + K+ LLY+ + + ++ L + + + + ++ + Sbjct: 352 LDFTALVEIKDAPLLYEYWCFFQVKQRLDARFGLPRRASIIAEAQPEEDALREGLRLDYG 411 Query: 240 R------------------------METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFH 275 R + DI + +++LI+DAKY G E + Sbjct: 412 RGIALYYNATCAARSSGRLSSYSNDLRPDIILSDGDRLLILDAKYRGEASGDT-GIESPN 470 Query: 276 SQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKINGFD------IGLCTVNLG 329 ++ ++ Y +++ G ++YP + + G + ++ Sbjct: 471 PADIDKMHTYREAIRNVWG----AFVLYPGEKPEIHPAFTARGDSRYEGVGALVLAPDIL 526 Query: 330 QEWPCIHQELLDIFDEYLK 348 E+ + E+L+ Sbjct: 527 SGEVDGSDEIERLISEFLE 545 >UniRef50_A6T3N7 Uncharacterized conserved protein n=1 Tax=Janthinobacterium sp. Marseille RepID=A6T3N7_JANMA Length = 538 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 34/261 (13%), Positives = 72/261 (27%), Gaps = 66/261 (25%) Query: 100 NEDTLANRIIKSTLAILIK--------------HEKLNSTIRDEARSLYRKLPGI----- 140 + DT NR +K L + + R L I Sbjct: 236 SFDTPENRFVKHVLLDIENVCVSINSIVMPENIQRASQEMLIKSRRWLRNDFFKILGTLQ 295 Query: 141 STLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLL 200 +P S + ++ K I N + K+++ L Sbjct: 296 FLPTTSPALTSRFGYKEIFGHFLKSRVAAKQIFN---------DLKRQSLYVELKDVAQL 346 Query: 201 YQKFLYEFCRRELTSANTTRS----------------------YLKWDASSISDQSLNLL 238 Y+ +++ L + + ++ S + + Sbjct: 347 YEYWVFYKIATSLLGEKAIVTARGLVTKNGRIANSISLENGNISVAFNESFVRSNRGSYS 406 Query: 239 PRMETDIT--IRSSEKILIV--DAKYYKSIFSRRMGT--------EKFHSQNLYQLMNYL 286 + D+ IR+ L++ DAKY + +LY++ Y+ Sbjct: 407 LTLRPDVVVRIRTKAGTLVILFDAKYRSRVAGSFEDELLEEVVHRRSVKPDDLYKMHCYV 466 Query: 287 WSLKPENGENIGGLLIYPHVD 307 ++ G+ + + +YP D Sbjct: 467 DAI----GDAVSAMAVYPGND 483 >UniRef50_B0MB39 Putative uncharacterized protein n=1 Tax=Anaerostipes caccae DSM 14662 RepID=B0MB39_9FIRM Length = 424 Score = 55.8 bits (133), Expect = 2e-06, Method: Composition-based stats. Identities = 31/258 (12%), Positives = 70/258 (27%), Gaps = 37/258 (14%) Query: 62 YNPNTEIIPGI-KGRIEFAKTIRGFHLN----HGKTVSTFDML--NED--TLANRIIKST 112 Y ++ +G+I+F ++R + + + + L +I K Sbjct: 103 YKEREQVRKTADRGKIDFPASLRKNVKFFQEDGSPFFDRYTVKGSSPNEKNLITQIHKYC 162 Query: 113 LAILIKH------EKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVI 166 + L S + K + Sbjct: 163 VYEAFSTLGWLFTPHLPPDPHITLESERFLFA-----------LRKKLSVTHNDKDKLLF 211 Query: 167 SVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWD 226 ++ + Y +++K + E + KW+ Sbjct: 212 QAMIQMLEYLDTENDNKQY-----YFGTDRFEYVWEKLIDEVFG--IRGKEEFFPRTKWN 264 Query: 227 ASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYL 286 +++ + +E D + +KI ++DAKYY+ + S Q+ Sbjct: 265 LKFNANKDNHA---LEPDSIMICDDKIYVLDAKYYRYGVTGEPRHLPESSSINKQITYGE 321 Query: 287 WSLKPENGENIGGLL-IY 303 + + + G L IY Sbjct: 322 YIDNCQELKTKYGDLPIY 339 >UniRef50_C0QR15 Putative uncharacterized protein n=1 Tax=Persephonella marina EX-H1 RepID=C0QR15_PERMH Length = 465 Score = 55.0 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 54/203 (26%), Gaps = 35/203 (17%) Query: 98 MLNEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGI-----STLHLTPQHFSY 152 DTL NR IK L + + L + + + + + SY Sbjct: 198 EETFDTLENRFIKYFLKEIETVLSEELEEFLYLKELREIKEEVEYALQTDIFVEVGNLSY 257 Query: 153 LNGGKN----TRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEF 208 Y+ + ++ + ++ IP K D + K+M+ L++ ++ Sbjct: 258 FPSNSQVLMKKAGYREIFNIYRLFHSSFIPQIFKN----LDIALSLKDMATLWEYYVLIE 313 Query: 209 CRRELTSAN----------------------TTRSYLKWDASSISDQSLNLLPRMETDIT 246 + L D Sbjct: 314 ILKSLKKIYGNYETKINFEERTKQGTIYDYAVFEFEDGLKLYFQKSLYSYSRLEFRPDFI 373 Query: 247 IRSSEKILIVDAKYYKSIFSRRM 269 + + K I DAK+ +R+ Sbjct: 374 MEMNNKKYIFDAKFRIFEDNRKD 396 >UniRef50_C4Z3V6 Putative uncharacterized protein n=1 Tax=Eubacterium eligens ATCC 27750 RepID=C4Z3V6_EUBE2 Length = 389 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 35/324 (10%), Positives = 84/324 (25%), Gaps = 52/324 (16%) Query: 60 LDYNPNTEIIPGIK--GRIEFAKTIRGFHLNHGKTVSTFDMLNE---DTLANRIIKSTLA 114 Y E++ + G+I + +TI+ + N +I Sbjct: 80 RGYYKEQEVLYKVAKSGKINWNRTIKTQKPYVQDMDVFYLDFVTKKNSVKENELITLIHE 139 Query: 115 ILIKHEKLNSTIRDEARSLYRKLP-GISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIV 173 + I + K P + L + + I+ Sbjct: 140 YCVYESF--ERIGWLYTEMMPKKPVIVKQERLFRSVLKDKIANTFNDKNRMLFRHMLAII 197 Query: 174 NNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQ 233 ++ +++K + + E + + + Sbjct: 198 -----DFEGDKDSDKNYRYGTYRFEYIWEKMIDKVFGIENKADYFP------KTTWYVNG 246 Query: 234 SLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPEN 293 S +E D + + ++DAKYYK + + + Q+ + + Sbjct: 247 SKYDNASLEPDTVMLYGTDVYVLDAKYYKYGVTGKTWDLPESTSINKQITYGEYIANEDK 306 Query: 294 GE-------NIGGLLIYPHVD----------------------TAVKHRYKINGFDIGLC 324 + + + P + +KI G + + Sbjct: 307 FKKKHGENMKVYNAFLMPFDSLKSKYPNNANMLKVGQAVSNWKDNTEEYHKIQGVLLDVK 366 Query: 325 T---VNLGQEWPCIHQELLDIFDE 345 T +N+ QE I ++L + + Sbjct: 367 TLMSINVRQEMKEI-EKLAKLIEN 389 >UniRef50_A5G8Z2 Putative uncharacterized protein n=2 Tax=Proteobacteria RepID=A5G8Z2_GEOUR Length = 785 Score = 52.3 bits (124), Expect = 2e-05, Method: Composition-based stats. Identities = 49/398 (12%), Positives = 99/398 (24%), Gaps = 89/398 (22%) Query: 16 TYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSR---RGLELDYNPNT--EIIP 70 ++A E+ + L L L G+ + R P + Sbjct: 182 SFAQKTEHELATSRKPHERFPLLWLALFRSLRSGLENAVKLICRSPHSRLLPLERFQRPD 241 Query: 71 GIKGRI------EFAKTIRGF-HLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLN 123 +KGR+ + + R + L+ DT NR +K + I+ + Sbjct: 242 CLKGRLTPRLEEQVTEQCRNGELHRRHRI--ETRRLSVDTPENRFVKMVMTRCIRELSVF 299 Query: 124 STIRDEA-------RSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNS 176 S R + + + Y + + + + Sbjct: 300 SRRARLNNAVPDRERLSNAFFAELDGWQKPLEQRLAEPLFREVGSYNGMAQESLVLHHRA 359 Query: 177 IPGQNKGHYRFYDFERNEK---------EMSLLYQKFLYEFCRRELTSANTTRSYLKWDA 227 + ++ + ++ LY+ + RR L + T + A Sbjct: 360 GYAKVYRIWQELKLYLDLFGRQASISMKSVAELYEVWCLLEIRRMLMALGFTEVETRKAA 419 Query: 228 SSISDQSLNLLPRM----------------------------------------ETDITI 247 +L M + DI + Sbjct: 420 LRTKGLEKDLADGMGTAFRFTRRDGLEIRLAHEPPFSVTRNPDARGIYSWTTPQKPDILL 479 Query: 248 R-----SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKP----------E 292 S I DAKY + + Q+ Y +L + Sbjct: 480 EAALPDGSRINWIFDAKYRIDAEDGKDDL--IPDDAINQMHRYRDALIHLKKADDGVTEK 537 Query: 293 NGENIGGLLIYPH--VDTAVKHRYKINGFDIGLCTVNL 328 + +G ++YP + K+ Y +G+ L Sbjct: 538 SRPVVGAFVLYPGWFDEETGKNPYTAAVEAVGIGGFPL 575 >UniRef50_D2RAE2 LlaJI restriction endonuclease n=1 Tax=Gardnerella vaginalis 409-05 RepID=D2RAE2_GARVA Length = 458 Score = 52.3 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 30/243 (12%), Positives = 67/243 (27%), Gaps = 26/243 (10%) Query: 74 GRIEFAKTIRGFHLNHGK-------TVSTFDMLNE---DTLANRIIKSTLAILIKHEKLN 123 G+ ++A+T R + F++ + DT + + +E Sbjct: 125 GKQDWARTARKQMPLVQSRNGVSSFVFTQFEVRSSTPNDTKEI----TQINRFCVYEAFK 180 Query: 124 STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKG 183 + + + + + N K + N + Sbjct: 181 RLGWLYVPYMPEESGSHPDIKTSIRIVQNKLATTNDDRKKSLFRSM-----NDMLEYMYE 235 Query: 184 HYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMET 243 F + +++K + N +W + + Sbjct: 236 KTSDKQFYFGTDDFDHVWEKLIDRAFGE--KDKNKYFPRSRW---LLDCGKYKEKKPLIP 290 Query: 244 DITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMN--YLWSLKPENGENIGGLL 301 D + ++K ++DAKYYK + S Q+ YL K + +++ Sbjct: 291 DTIMIYNDKYYVLDAKYYKYGLTGIPDHLPNGSSINKQITYGEYLEKNKNVDAKSLFNAF 350 Query: 302 IYP 304 I P Sbjct: 351 IMP 353 >UniRef50_B5IWI4 Putative uncharacterized protein n=1 Tax=Thermococcus barophilus MP RepID=B5IWI4_9EURY Length = 554 Score = 51.5 bits (122), Expect = 5e-05, Method: Composition-based stats. Identities = 37/359 (10%), Positives = 87/359 (24%), Gaps = 101/359 (28%) Query: 33 IPGNNLLDILGYVLN--------KGVLQLSRRGLELDYNPNTEIIPGIK----------- 73 + ++ Y ++ RR E + + Sbjct: 162 ESEEPMSELFVYHFLVNNRERIISAYEEIIRR-PHRKLVEREEWLNFWEVSEVDEDTVMS 220 Query: 74 -------------GRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKS--------- 111 I A+ + ++ DT NR K Sbjct: 221 IITHPEYLIKAEPSSIAVAEYLNNHVPTKVAQRIKYESF--DTHENRFAKHFLNELITWG 278 Query: 112 --TLAILIKHEKLNSTIRDEA-RSLYRKLPGI-----STLHLTPQHFSYLNGGKN----T 159 +A ++ L ++ A L L + S + Sbjct: 279 EKAIAAILTSSYLTKEQKENATSKLTSILGELEYYATSDIFDDVGEMVIFPYTSQVLLKR 338 Query: 160 RYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSA--- 216 Y+ ++ + + P ++ + K+++ LY+ + + EL Sbjct: 339 EGYRDLLQL-WREFRSYSPFFDEMQKAI-----DNKDIAKLYEYWCFFKLVEELGKILGQ 392 Query: 217 -------NTTRSYLKWDASSISDQSLNLLP----------RMETDITIRSSEKIL-IVDA 258 T + + L + D + I+ + DA Sbjct: 393 ENLRIIVEPTGELSERGHVYAEFDNGWRLYYNLKKRGYSVSLRPDFLLLKGRNIIGVFDA 452 Query: 259 KYY----KSIFSRRMGTEKFHS---------QNLYQLMNYLWSLKPENGENIGGLLIYP 304 K+ E + +++Y++ Y +L + +++YP Sbjct: 453 KFKLDVVDVNEFAEEDREMERAPNFQTWAKLEDIYKMHTYRDALNAKF-----AVVLYP 506 >UniRef50_C7DCR7 Putative uncharacterized protein n=1 Tax=Thalassiobium sp. R2A62 RepID=C7DCR7_9RHOB Length = 412 Score = 51.1 bits (121), Expect = 5e-05, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 44/119 (36%), Gaps = 7/119 (5%) Query: 189 DFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIR 248 F ++ +++++ L E N ++ + S RM TDI +R Sbjct: 247 SFVFGIEDFHIVWEQMLAETLEGVEPHWNEKLPRAVYE-TLDGRASDAPERRMLTDIVLR 305 Query: 249 SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKP--ENGENIGGLLIYPH 305 + I+DAKYY + + + Q+ Y +L+ + I ++P Sbjct: 306 TPTGFTIIDAKYYAANSPNTVPGWPDIA---KQMF-YEMALRSVVGDEPEIRNCFVFPA 360 >UniRef50_Q30SQ1 Putative uncharacterized protein n=2 Tax=Sulfurimonas denitrificans DSM 1251 RepID=Q30SQ1_SULDN Length = 736 Score = 51.1 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 50/290 (17%), Positives = 89/290 (30%), Gaps = 51/290 (17%) Query: 56 RGLELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAI 115 R + +DY IK ++ +K I+ L+ K + D L ++I +T+ Sbjct: 150 RKIHIDYKSQD-----IKYKLNLSKNIKE--LDKTKI---HQTQSLDVLYSQI--ATITY 197 Query: 116 ------------LIKHEKLNSTIRDEARSLYRKLPG-ISTLHLTPQHFSYLNGGKNTRYY 162 +IK E+ + EAR L + S LN K T+ Y Sbjct: 198 GALKLFAKRRIDIIKDEEYKKQLIQEARELISFIAKKYPIDKNYKFSLSKLNNSKTTKVY 257 Query: 163 -------------KFVISVCKFIVNNSIPGQNKGHYRFYDF-----ERNEKEMSLLYQKF 204 K + + +N + N+ F E + +++ Sbjct: 258 SNKSDTKLLLVDIKSLFGFEQMYQDNEVYVSNRYDLTTTSFFINPSTFYEWYIYDIFEST 317 Query: 205 LYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKIL--IVDAKYYK 262 + + S + + D + EK + ++DAK+ Sbjct: 318 VGHQYSVLFDKHKEASKKTQTQYDLTSKYDGDTKRNSKPDYILIDEEKKIKVVLDAKWKN 377 Query: 263 SIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKH 312 +G+E F L L NG + LIYP+ H Sbjct: 378 IPKFNEIGSEDFLKLKL------DTELLNNNGYSTLAYLIYPYYPHENDH 421 >UniRef50_A1RVV7 Putative uncharacterized protein n=1 Tax=Pyrobaculum islandicum DSM 4184 RepID=A1RVV7_PYRIL Length = 429 Score = 50.4 bits (119), Expect = 9e-05, Method: Composition-based stats. Identities = 29/255 (11%), Positives = 68/255 (26%), Gaps = 26/255 (10%) Query: 112 TLAILIKHEKLNSTIRDEARSLYRKLPGI--STLHLTPQHFSYLNGGKNTRYYKFVISVC 169 ++L+ L+ +R R G+ + S L + Sbjct: 177 ASSVLLTLSNLSQFLRGALRYTSGLPEGVGKALELYISSLESALKVLHKDAEEEIPHLWV 236 Query: 170 KFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASS 229 + ++ R E + LY+ ++ R L L+ + Sbjct: 237 ED-LDPGDYFYLPAIARRVGSEFFLIPSTKLYETYVLALTVRTLKRDGLAAKVLRGGIAV 295 Query: 230 ISDQSLNLLPR---------------METDITIRSSEKILIVDAKYYKSIFSRRMGTEKF 274 + + DI + S +++V+AKY + R + Sbjct: 296 DVEGDGRVYFNKAPTSKLIRRIAGIAPRPDIVVESGRNLIVVEAKYRRLDERRLRRADAI 355 Query: 275 HSQNLYQLMNYLWSLKPENGENIGGLLIYP--HVDTAVKHRYKINGFDIGLCTVNLGQEW 332 L+ YL + ++ + P I+ + + ++ Sbjct: 356 R------LVAYLADVARDHRLRGVVAALEPPHQAGGCPAVSANIDSTQAEIRFSRVNPDY 409 Query: 333 PCIHQELLDIFDEYL 347 +E ++ L Sbjct: 410 EKSEEEFIECIRGIL 424 >UniRef50_Q6QPY9 R2.LlaJI n=1 Tax=Lactococcus lactis RepID=Q6QPY9_9LACT Length = 414 Score = 50.4 bits (119), Expect = 9e-05, Method: Composition-based stats. Identities = 40/250 (16%), Positives = 85/250 (34%), Gaps = 26/250 (10%) Query: 60 LDYNPNTEII--PGIKGRIEFAKTIRGFHL---NHGKTVSTFDMLNEDTL-----ANRII 109 +Y TE + IKG+ +++KT + +G V T + T +I Sbjct: 99 HEYYTETERVYKTAIKGKTDWSKTFKRQRPIVQQNGSLVYTQTTVQTSTPNDTKMITQIH 158 Query: 110 KSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVC 169 K + A+S + + S N N R +K +I + Sbjct: 159 KYCVYECFDKIGWLYVPNLPAKS-EISFDKNRFISILYDKLSNTNNDTNKRLFKAMIDMI 217 Query: 170 KFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASS 229 +FI + P + + F +++K + + E + KW+ Sbjct: 218 EFI--DERPNEKQ-------FYFGTDVFEYVWEKLIDKVFGIE--NKQDYFPKTKWNLRF 266 Query: 230 ISDQSLNLLPRMETDITIRSSEKILIVDAK-YYKSIFSRRMGTEKFHSQNLYQLMNYLWS 288 + +E D + +K+ ++DAK Y + ++ HS ++ + + Y Sbjct: 267 ---GKERMNRPLEPDTIMIYKDKVYVLDAKLYRYGAMDVPVASKLPHSSDINKQITYGQY 323 Query: 289 LKPENGENIG 298 + ++ Sbjct: 324 INKHPDPSLR 333 >UniRef50_Q5XBC0 Type II restriction-modification system restriction subunit n=3 Tax=Streptococcus pyogenes RepID=Q5XBC0_STRP6 Length = 426 Score = 50.4 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 39/295 (13%), Positives = 78/295 (26%), Gaps = 39/295 (13%) Query: 29 NLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPNTEI-IPGIKGRIEFAKTIRG--- 84 + L+ + RG Y E I + GRI +A+TI+ Sbjct: 78 TVNVNQALKTLNFPMEAYTFVIHDFINRGTY--YKETEEYYIQTLGGRINWARTIKRVQP 135 Query: 85 ----FHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHE----KLNSTIRDEARSLYRK 136 K S + TL I K + + +L + + Sbjct: 136 VVQGNGFKFLKMESKKQSDTDITLLTEINKFCVYEAFLNMGWLYQLPQPEKARINYNRSQ 195 Query: 137 LPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKE 196 I + + I+N + F Sbjct: 196 FESI---------IKKKMATTFNDMERRLFQSMIDILNYKDRQEEPDK-----FYFGTNN 241 Query: 197 MSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIV 256 +++ + + + + KWD + S +E D + ++ + ++ Sbjct: 242 FQYIWENLID--YTYGIANKTSYFPRTKWDLVFETKTSKGY--ALEPDTIMVHNDNVFVL 297 Query: 257 DAKYYKSIFSRRMGTEKFHSQNLYQLM--NYLWSLK-----PENGENIGGLLIYP 304 DAKYYK + M + Q+ Y+ + + + P Sbjct: 298 DAKYYKFGQTNLMKDLPPSTSINKQITYGGYVANQDLLKKLHGQNSKVYNAFLMP 352 >UniRef50_Q03XF1 Putative uncharacterized protein n=1 Tax=Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293 RepID=Q03XF1_LEUMM Length = 158 Score = 50.0 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 17/104 (16%), Positives = 35/104 (33%), Gaps = 14/104 (13%) Query: 208 FCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSR 267 R S + + + R+ D ++S K +I DAKY Sbjct: 1 MIREIFYHLKNKSGIDAQQLFSK-SNNFSKIGRIYPDFISKNSNKRIIADAKYKPI---- 55 Query: 268 RMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVK 311 + +++ Q++ Y++ + G IYP + + Sbjct: 56 ----KNIANKDYLQVLAYMFRFNAKM-----GFYIYPANNNESE 90 >UniRef50_B7IJA5 Type II restriction-modification system restriction subunit n=1 Tax=Bacillus cereus G9842 RepID=B7IJA5_BACC2 Length = 449 Score = 48.8 bits (115), Expect = 3e-04, Method: Composition-based stats. Identities = 38/265 (14%), Positives = 79/265 (29%), Gaps = 19/265 (7%) Query: 59 ELDYNPNTEIIP--GIKGRIEFAKTIRGFHLN---HGKTVSTFDMLNEDTLANRII---- 109 Y +E + +G+IE+++T+ + + L T N II Sbjct: 128 RYGYYEKSEEVSMENGEGQIEWSRTLNEITPHFLKNRPYYLNTYNLITVTERNNIIREIH 187 Query: 110 KSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFV-ISV 168 K + E + + + R L I+ L F + + Y I + Sbjct: 188 KWAVDFC--FEMFGTILGYNNIKIERSLLNINKLGNVEY-FKKIIYREINETYVDSNIQL 244 Query: 169 CKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDAS 228 K ++ N + +++++ E + +W + Sbjct: 245 LKSLITLLTRKSNFSKSNLLTLYG-TRSFAMVWEDVCSYIFNNEKDNYVNEIEKPRW-TN 302 Query: 229 SISDQSLNLLPRMETDITIR----SSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMN 284 I DI E LI+DAKYY F ++ + + Sbjct: 303 IIGTNVSEEKETFRPDIIKTFSSIDQEYFLILDAKYYLIEFLDGNLVNNPDVNDVAKQLL 362 Query: 285 YLWSLKPENGENIGGLLIYPHVDTA 309 Y + + + + ++P + Sbjct: 363 YEKAFHYKKEKIFRSIFLFPKSEQD 387 >UniRef50_O58602 Putative uncharacterized protein PH0872 n=1 Tax=Pyrococcus horikoshii RepID=O58602_PYRHO Length = 500 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 35/272 (12%), Positives = 83/272 (30%), Gaps = 65/272 (23%) Query: 100 NEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNT 159 + DT NR K L +LI+ + R + L + + + T Sbjct: 221 SFDTPENRFAKYFLDLLIEWGERAQRNVGGVRDIIEDLRFL-RSDPLWEEVGKMKIFPYT 279 Query: 160 RYYKFVISVCKFIVNNSIPGQNKGHYR-------FYDFER---NEKEMSLLYQKFLYEFC 209 + ++ + G Y+ F+ R + ++++ LY+ +++ Sbjct: 280 ---------SQTLLKGEGYRELLGLYQEFVVRMPFFAKLREAIDNRDIARLYEYWVFFKL 330 Query: 210 RRELTSANTT---------------------RSYLKWDASSISDQSLNLLP---RMETDI 245 +EL KW ++ D Sbjct: 331 VKELEKILGEKNVRITIEPARGLSEEGNVYAEFKGKWRLYYNRKLRPRKFSYSVSLKPDF 390 Query: 246 TIRSSEKIL-IVDAKYYKSI---------FSRRMGTEKFHS----QNLYQLMNYLWSLKP 291 ++ K++ + DAK+ + + +++Y++ Y +L+ Sbjct: 391 SLFREGKVVGVFDAKFKLELVDVDRFADEDKEMEENPSIETWAKLEDIYKMHTYRDALRC 450 Query: 292 ENGENIGGLLIYPHVDTAV--KHRYKINGFDI 321 + +++YP + R KI+G + Sbjct: 451 KF-----AVVLYPGSKSVFFDAKRGKIDGVSL 477 >UniRef50_B7HF08 Conserved domain protein n=29 Tax=Bacillus cereus group RepID=B7HF08_BACC4 Length = 815 Score = 46.9 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 52/420 (12%), Positives = 121/420 (28%), Gaps = 89/420 (21%) Query: 7 PVRNI-YYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLSRRGLELDYNPN 65 + N+ +++L + I N +L+ K + +R Sbjct: 176 EIYNLAFHLLKRTYLGASAIYATNPSKSEFYRILNDSFERFMKSISH-IKRQPHHTLMTR 234 Query: 66 TEIIPGIKG----RIEFA--KTIRGFHLN-------HGKTVSTFDMLNEDTLANRIIKST 112 + ++G +++ +R K ++ + ++ DTL NR +K Sbjct: 235 HQ---LVRGEKIRKLDSVGMNYLRKRPHLLQGENSIPTKGITAYKEVSYDTLENRFVKWM 291 Query: 113 LAILIKH-EKLNSTIRDEARSLY-----RKLPGIS----TLHLTPQHFSYLNGGKNTRY- 161 + ++ + L + +R L + + + GK R Sbjct: 292 IQRVVHKIDDLIKVLEKRSRYTRGETDEDLLERVKNMKYRMKNELNDPFWRRIGKLDRSV 351 Query: 162 YKFVISVC-------KFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELT 214 + VI + + + S +G F+ + K+++ LY+ + Y + L+ Sbjct: 352 FSLVIQMAAGYRDAYQIFLMLSRGLTLRGQI----FKMSVKDVARLYEYWTYLKLGQILS 407 Query: 215 SANTTRSYLKWDA--------------------SSISDQSLNLLP----------RMETD 244 +D+++ L + D Sbjct: 408 KKYIPLHQDVIQVKQDGLYVTLDESKTAKRTFKHPETDETIELYFQKRNGRLPTVTQKPD 467 Query: 245 --ITIRSSEK----ILIVDAKYYKSIF-----SRRMGTEKFHSQNLYQLMNYLWSLKPEN 293 + I K I DAKY + G +++ + Y +L E Sbjct: 468 TMLAIEKKGKSYQYQYIFDAKYRIDFAETSHYKGKYGAPGPMEEDINTMHRYRDALVIEE 527 Query: 294 GENIG-----GLLIYPHVDTAV--KHRYKINGFDIGLCTVN-LGQEWPCIHQELLDIFDE 345 +++P V H + + + + L + Q L + ++ Sbjct: 528 EGPFERTAYGAYVLFPWNQEEVYENHPFYKSIEKVNIGGFPFLPNATRLVEQFLDHLIEK 587 >UniRef50_O66886 Putative uncharacterized protein n=1 Tax=Aquifex aeolicus RepID=O66886_AQUAE Length = 440 Score = 46.5 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 28/239 (11%), Positives = 62/239 (25%), Gaps = 45/239 (18%) Query: 78 FAKTIRGFHLNHGKTVSTFDMLNEDTLANRIIKSTLAILIKHEKLNST-----IRDEARS 132 + + F+ + DT NR +K L L + Sbjct: 183 IIEHEGKRYSPTAVLQYEFEE-SFDTPENRFVKHLLKELEILLSEELKDFFFLEELKEIK 241 Query: 133 LYRKLPGISTLHLTPQHFSYLNGGKN----TRYYKFVISVCKFIVNNSIPGQNKGHYRFY 188 + S + ++ Y+ + + + + + +P + Sbjct: 242 EEIEYTLRSDVFSEVGDLNFFPSNSQVLMKKAGYRELFQIYRLLHLSFVPRIFED----L 297 Query: 189 DFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYL----------------------KWD 226 D + K+M+ L++ ++ RE T + Sbjct: 298 DLAFSLKDMATLWEYYVLIEILREFKEKFGTYKVIIDFEEKVEGKTVYEEAQFKFENDLI 357 Query: 227 ASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNY 285 R D ++ + I DAK+ + + +L Q M+Y Sbjct: 358 LYYQKSLYAYSGLRFRPDFYVKFKDNRFIFDAKFRIFENNEK---------DLLQNMHY 407 >UniRef50_Q6LVP0 Putative uncharacterized protein n=1 Tax=Photobacterium profundum RepID=Q6LVP0_PHOPR Length = 793 Score = 46.5 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 43/356 (12%), Positives = 89/356 (25%), Gaps = 95/356 (26%) Query: 71 GIKGRI--EFAKTIRGFHLNHGKTVSTFDML--------NEDTLANRIIKST-------L 113 +KGR+ A+ +R N +D + +T NR IK L Sbjct: 247 RLKGRLSNRLAEKVREDIANK-----VYDRRYKQDKKYLSLNTPENRFIKMVVIQCKQEL 301 Query: 114 AILIKHEKLN--STIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKF 171 L ++N +R L + + K Y + Sbjct: 302 EKLSDKLEVNNQKENNTHSRLSQHFLSELKNWQKPLIKMERHSFLKEVGDYTGLQRESLV 361 Query: 172 IVNNSIPGQNKGHYRFYDFERNEKE---------MSLLYQKFLYEFCRRELTSANTTRS- 221 + + ++ + + ++ Y+ + + R+ LT + Sbjct: 362 LQQKTGYNTVYKTWQELKYYLDIFAKHSTVSMKSVAETYEVWCFLAVRKVLTDVLGFKET 421 Query: 222 -----------YLKWDASSISDQSLNLLP----------------------------RME 242 Y + + + + Sbjct: 422 EQNKAKLKLNDYFELQLEDGLSGAGAFVFERSDGLKARLAHEPVFRRAGKKIRSFWVTQK 481 Query: 243 TDITIR---SSEKILI--VDAKYYKSIFSRRMGTEKFHSQN------LYQLMNYLWS--- 288 DI + K I DAKY + R + Q+ + Q+ Y + Sbjct: 482 PDILLEVTFPDGKKCIWLFDAKYRIKTKNDRYEQDDIDQQDFVPDDAINQMHRYRDALIR 541 Query: 289 ------LKPENGENIGGLLIYPH--VDTAVKHRYKINGFDIGLCTVNLGQEWPCIH 336 L ++ G +YP + + Y+ ++G+ L Sbjct: 542 IEGQEGLHSKSRPVFGAFALYPGFYDQSVASNPYQAVINEVGIGAFALLPSAENNK 597 >UniRef50_A6GXX4 Putative uncharacterized protein n=1 Tax=Flavobacterium psychrophilum JIP02/86 RepID=A6GXX4_FLAPJ Length = 573 Score = 46.1 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 64/223 (28%), Gaps = 42/223 (18%) Query: 118 KHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFI--VNN 175 + K I + + L + + F N ++ +Y F+ K I + N Sbjct: 320 RSLKRLGDINESLNEIKYYLDKYIPVTIETLDFLNTNKIESKEHYNFIFD--KLIQWLLN 377 Query: 176 SIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISD--- 232 +K F M L++K + + + ++ Sbjct: 378 KNATYSKDKKLFKGI----NRMDQLFEKACFYKLIDSFKKLEYEVQEISNNKVKLTKKGA 433 Query: 233 --------------------QSLNLLPRMETDITIR-SSEKILIVDAKYYKSIFSRRMGT 271 ++ D TI + K +I+DAKY K R Sbjct: 434 VHYLYSQILPDKLISVRSKNGYGGNSGVLQPDFTIELENGKFIIIDAKYKKLETITRYDY 493 Query: 272 EKFHSQNLYQLMNYLWSLKPENGE---NIGGLLIYPHVDTAVK 311 + YL + + G +G +++P ++ + Sbjct: 494 PDLA-------LKYLHGIGLKTGGFFNPMGLFMLFPRIENFID 529 >UniRef50_D0C730 Restriction endonuclease n=1 Tax=Acinetobacter baumannii ATCC 19606 RepID=D0C730_ACIBA Length = 426 Score = 45.4 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 43/313 (13%), Positives = 93/313 (29%), Gaps = 44/313 (14%) Query: 43 GYVLNKGVLQLSRRGLELDYNPNTEIIPGIK---GRIEFAKTIRGFHLNHGKT--VSTFD 97 + + + G+ + E++ ++ G+ ++ KTI + D Sbjct: 114 LEMFKYLINDFQQHGIFKN----EEVL--LRKNSGKTDWKKTINRLVSFPDSSGRPVYLD 167 Query: 98 ML-----NEDTLANRIIKSTLAILIKHEKLNSTIRDEARSLYRKLPGISTLHLTPQHFSY 152 + + ++ RI L + K+ T +++ ++ S Sbjct: 168 VYGKQRTSTNSEITRIHAGILKQVYKNYGFIFTGKNKVPYSLKQYGETSLSTDAQISVLK 227 Query: 153 LNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRE 212 +N + + + I KG+ + + ++ L Sbjct: 228 NEI-RNHFADRQTLLLKTLI---EYLEAYKGNEQKNQIIG-VTRFHVAWEHMLSRCL--- 279 Query: 213 LTSANTTRSYLKWDASSISDQSLNLLP----RMETDITIRSS--EKILIVDAKYYKSIFS 266 N + +P M TDI I +K+ I+DAKYY++ Sbjct: 280 ---DNVIDINSRLPKPVFIKPDGTAIPAKKSGMRTDIVIEDKAAKKLTILDAKYYEATTV 336 Query: 267 RRMGTEKFHSQNLYQLMNYLWSLKPEN---GENIGGLLIYPHVDTAVKHRYKINGFDIGL 323 +L + Y ++ G LI+P +A + N + Sbjct: 337 ENAPGW----ADLVKQFFYEKAISIMPEFSGYQFENALIFPGQKSAFDKIHMQNQKNGNY 392 Query: 324 CTVNLGQEWPCIH 336 L ++P I Sbjct: 393 ----LDTDFPIIK 401 >UniRef50_B9KEV3 Putative uncharacterized protein n=1 Tax=Campylobacter lari RM2100 RepID=B9KEV3_CAMLR Length = 208 Score = 44.6 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 15/71 (21%), Positives = 25/71 (35%), Gaps = 9/71 (12%) Query: 255 IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRY 314 I+D +++ R +LYQ+ Y + + LIYP + R Sbjct: 103 ILD----ENLVGRDEKKYGISQSDLYQMFAYANKYEIK-----EIYLIYPLCERTFDLRG 153 Query: 315 KINGFDIGLCT 325 K+ DI Sbjct: 154 KLKTKDIKFLA 164 >UniRef50_UPI0001AF063B hypothetical protein SghaA1_36952 n=1 Tax=Streptomyces ghanaensis ATCC 14672 RepID=UPI0001AF063B Length = 220 Score = 43.4 bits (101), Expect = 0.012, Method: Composition-based stats. Identities = 9/39 (23%), Positives = 17/39 (43%) Query: 92 TVSTFDMLNEDTLANRIIKSTLAILIKHEKLNSTIRDEA 130 V +D DT NRI+++ L++ + +R Sbjct: 101 LVYGYDAYTADTAENRILRAVGERLLRLPGVPGPVRRRL 139 >UniRef50_A4YFI8 Putative uncharacterized protein n=1 Tax=Metallosphaera sedula DSM 5348 RepID=A4YFI8_METS5 Length = 405 Score = 43.0 bits (100), Expect = 0.017, Method: Composition-based stats. Identities = 41/308 (13%), Positives = 90/308 (29%), Gaps = 68/308 (22%) Query: 47 NKGVLQLSRRGL-ELDYNPNTEIIPGIKGRIEFAKTIRGFHLNHGKTVSTFDMLNEDTLA 105 + Q L ++ + + P G I+ ++I ++ G + D Sbjct: 59 LAYLEQAIEARLTYREFVISYDREPM--GAIDLPRSI--PVMSRGIYAYYTYIKGYDAPE 114 Query: 106 ----NRIIK----STLAILIKHEKLNSTIR------------DEARSLYRKLPGISTLHL 145 N ++K + L K + + I+ D R G L Sbjct: 115 YAIMNYLLKRIYSTALQYYNKIKDVREEIKYFRVKGRMKTRLDRLRKGLSYFKGEYFRPL 174 Query: 146 TPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFL 205 T +L YY + G + + + + + LY+ ++ Sbjct: 175 TDYDPEWLRET-FNLYYTLSQ------LKELSLGISTQKAPSMNKKMLKVILWKLYELYV 227 Query: 206 YEFCRRELTSANTTRSYLK----------------------WDASSISDQSLNLLPRMET 243 + + L + S+ D + R Sbjct: 228 FFIFVKYLEREGFDVAKENGRYVAKKGNRRLSLILNSDLDFSQLDSVDDLDNTEIFRGRP 287 Query: 244 DITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIY 303 D+++ + +L+ + KY + + + ++LM Y + + +LIY Sbjct: 288 DLSLVAGNSVLV-ECKY--------SSKVGYITSSRFKLMAYAYEYN-----PLTAILIY 333 Query: 304 PHVDTAVK 311 P +D V+ Sbjct: 334 PGLDKEVE 341 >UniRef50_C5UQY5 Putative restriction endonuclease n=1 Tax=Clostridium botulinum E1 str. 'BoNT E Beluga' RepID=C5UQY5_CLOBO Length = 413 Score = 42.7 bits (99), Expect = 0.018, Method: Composition-based stats. Identities = 17/145 (11%), Positives = 38/145 (26%), Gaps = 6/145 (4%) Query: 138 PGISTLHLTPQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEM 197 I +L + Y ++ V I+ + + + K Sbjct: 200 EEIVLPVDKEYAIKFLRAERQVTYNTRLLRVIDLIIKFIDSREEESKDNAV-MSLSTKSF 258 Query: 198 SLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVD 257 +++ + E W DI + + + I+D Sbjct: 259 YAVWELMCKKAFSDEYKDMKEEIPRPYWKI----ADKEPKYTEQIPDILYKENRSLFILD 314 Query: 258 AKYYKSIFSRRMGTEKFHSQNLYQL 282 AKYY ++ + Y++ Sbjct: 315 AKYYNVKKNKPGWHDLVKQY-FYEM 338 >UniRef50_Q81H81 Type II restriction-modification system restriction subunit n=2 Tax=Bacillus cereus RepID=Q81H81_BACCR Length = 444 Score = 42.7 bits (99), Expect = 0.019, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 46/148 (31%), Gaps = 9/148 (6%) Query: 170 KFIVNNSIPGQNKGHYRFYDFERNEK---EMSLLYQKFLYEFCRRELTSANTTRSYLKWD 226 K I+ ++ K E + E +++K E++ KW Sbjct: 242 KMILLKALIALIKKRTHSKLTELDIYGTREFEYIWEKVCGYVFDNEISQYENFLEKPKWT 301 Query: 227 ASSISDQSLNLLPRMETDIT---IRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLM 283 + DI I LI+DAKYY F ++ + + Sbjct: 302 DFNAMQLITKKTF--RPDIIKTYIDIQRYFLILDAKYYNIRFKEGDLKGNPGVNDIAKQL 359 Query: 284 NYLWSLKPENGENI-GGLLIYPHVDTAV 310 Y +L +I + ++P+ + Sbjct: 360 LYEQALSAYTKGSITRNMFLFPYDKDEL 387 >UniRef50_B9DJ21 Putative uncharacterized protein n=1 Tax=Staphylococcus carnosus subsp. carnosus TM300 RepID=B9DJ21_STACT Length = 416 Score = 42.7 bits (99), Expect = 0.023, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 49/151 (32%), Gaps = 24/151 (15%) Query: 170 KFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKW---- 225 K ++ N I + SL++++ + + ++ YLK+ Sbjct: 233 KMLIKNLIIFFKNIDGGSQNLYIKANTFSLIWEEMVENYLNNYFIGMDSKGQYLKFSDTR 292 Query: 226 --------DASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQ 277 + I D S + +E D + E + I D+KYY + Sbjct: 293 LTPKLNFENRLFIPDASSENIYSLELDHYLNEDENVYIFDSKYYN----------EISKL 342 Query: 278 NLYQLMNY--LWSLKPENGENIGGLLIYPHV 306 + Q+ Y G+NI LI P Sbjct: 343 DYKQVSYYYLTRGFDSHLGKNIYNALITPQS 373 >UniRef50_C2EED9 LlaI.3 like protein n=1 Tax=Lactobacillus salivarius ATCC 11741 RepID=C2EED9_9LACO Length = 412 Score = 42.3 bits (98), Expect = 0.024, Method: Composition-based stats. Identities = 19/129 (14%), Positives = 41/129 (31%), Gaps = 25/129 (19%) Query: 189 DFERNEKEMSLLYQKFLYEFCRRELTS--------ANTTRSYLKWDASSISDQSLNLLPR 240 ++ S +++K + + + K++ ++ S N Sbjct: 251 NYYLKHYSFSSVWEKMVNHYLNYHFEGIDDNRLIFKDNRSMVNKFEKATFYPNSANPRQN 310 Query: 241 METDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNY-------LWSLKPEN 293 ++ D + + I DAKYY K S N Q+ Y + + K + Sbjct: 311 IQPDYYLDDGDTQYIFDAKYYN----------KIDSINYKQVAYYFFLKNISVTNEKMKR 360 Query: 294 GENIGGLLI 302 L++ Sbjct: 361 KTTYNALIL 369 >UniRef50_C3WKP8 DNA helicase II n=1 Tax=Fusobacterium sp. 2_1_31 RepID=C3WKP8_9FUSO Length = 929 Score = 42.3 bits (98), Expect = 0.027, Method: Composition-based stats. Identities = 31/241 (12%), Positives = 77/241 (31%), Gaps = 29/241 (12%) Query: 121 KLNSTIRDEARSLYRKLPGISTLHLT-----------PQHFSYLNGGKNTRYYKFVISVC 169 L+ + + R + I+ FS +K + + Sbjct: 681 NLDEVTKKDERKILSYTTDIAPYRHCPMKYYLVREKEYSTFSKKIFNLGIITHKAIEHIN 740 Query: 170 KFIVNNSIP---GQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWD 226 K + P + + ++ ++ +++ + ++ + Y+K Sbjct: 741 KLFLQKKNPLFDDEYIENLLKNIYKFQNIDLDDNFER-IMSIVKKYIEDEKDNFEYIKKV 799 Query: 227 ASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYL 286 +S + + + D+ + +I I+D K + + ++ S QL Y Sbjct: 800 EASEFRIEDDYILYGQIDLILEDENEIQIIDFK------TGKYNELEYSSNYRQQLSLYK 853 Query: 287 WSLKPENGENIGGLLIYPHVDTAVKHRYKINGFDIGLCTVNLGQEWPCIHQELLDIFDEY 346 L+ + ++I L Y + K I + L +++ I++ DI D Sbjct: 854 LLLQKKYDKDIKTYLYY-LEEDEPKKEILITDEE-------LEEDFKNINKTTQDILDNK 905 Query: 347 L 347 Sbjct: 906 F 906 >UniRef50_A9EW27 Putative uncharacterized protein n=1 Tax=Sorangium cellulosum 'So ce 56' RepID=A9EW27_SORC5 Length = 118 Score = 42.3 bits (98), Expect = 0.027, Method: Composition-based stats. Identities = 13/77 (16%), Positives = 26/77 (33%), Gaps = 8/77 (10%) Query: 242 ETDITIRSSEKIL-IVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSL-KPENGENIGG 299 D+T+ +++ ++D KY LYQL Y + E + Sbjct: 3 RPDLTVSRGGRVVMVLDTKYRDLAAKEIGD------GILYQLSIYGVAFCPAEPAPPVPV 56 Query: 300 LLIYPHVDTAVKHRYKI 316 + +YP + + Sbjct: 57 VALYPGDASRAEETAVE 73 >UniRef50_O34303 Type-2 restriction enzyme BsuMI component ydjA n=1 Tax=Bacillus subtilis RepID=YDJA_BACSU Length = 465 Score = 42.3 bits (98), Expect = 0.030, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 53/145 (36%), Gaps = 10/145 (6%) Query: 191 ERNEKEMSLLYQKFLYEFCRRELTSANTTRSYLKWDASSISDQSLNLLPRMETDIT---- 246 K ++++ + + + S W + + +E DI Sbjct: 264 LYGTKYFHRVWEE-VCKTVFSHVNEYVKKISRPNW-INFTDIEVNKEKKTLEPDIIKAFE 321 Query: 247 IRSSEKILIVDAKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLK-PENGENIGGLLIYPH 305 RS E LI+DAKYY F + +++ + + Y +L+ G+ ++P Sbjct: 322 YRSKEYFLILDAKYYNINFDGKKLEGNPGVEDITKQLLYDKALEKLSRGKTKHNAFLFPS 381 Query: 306 VDTAVKHRYKINGFDIGLCTVNLGQ 330 + + +K+ G + +++ Sbjct: 382 SN--STNTFKVFG-SVDFDFLDIAA 403 >UniRef50_A1RQM2 Putative uncharacterized protein n=2 Tax=Thermoproteaceae RepID=A1RQM2_PYRIL Length = 384 Score = 41.9 bits (97), Expect = 0.031, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 51/167 (30%), Gaps = 41/167 (24%) Query: 164 FVISVCKFIVNNSI-PGQNKGHYRFYDFERNEKEMSLLYQKFLYEFCRRELTSANT---- 218 + K +V I G+ +G + F LY+ ++ L Sbjct: 174 AIYRAAKALVEGEIYVGERRGVGKALKFVN-----WRLYEMYIAMLVLEALRRLGWRTVG 228 Query: 219 ------------------TRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKY 260 SI ++ R D+T+ +++ +V+ KY Sbjct: 229 VDVEKRAVLVERDGKTLAVYLNRALPHHSIIEEVAGDEVRGRPDLTVANADVKAVVECKY 288 Query: 261 YKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVD 307 + ++ +Q+M Y+ + G+L+YP V Sbjct: 289 --------SDRPGYIARGRFQVMTYMCEYGAKI-----GILVYPAVS 322 >UniRef50_Q01P70 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q01P70_SOLUE Length = 155 Score = 41.9 bits (97), Expect = 0.038, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 35/91 (38%), Gaps = 13/91 (14%) Query: 258 AKYYKSIFSRRMGTEKFHSQNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVK--HRYK 315 KY + + + + +LYQL+ + +L GG L+Y H Sbjct: 4 VKYKRLPAT------TYQNSDLYQLLACVVALDLP-----GGTLVYAADKGVSAAVHAVV 52 Query: 316 INGFDIGLCTVNLGQEWPCIHQELLDIFDEY 346 N + + ++L P + Q++ +I + Sbjct: 53 QNRKRLEMVALDLSAPRPKLRQQISEIAERI 83 >UniRef50_UPI0001B9ECEF Domain of unknown function DUF2357 n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI0001B9ECEF Length = 810 Score = 41.5 bits (96), Expect = 0.051, Method: Composition-based stats. Identities = 40/433 (9%), Positives = 105/433 (24%), Gaps = 106/433 (24%) Query: 8 VRNI-YYMLTYAWGYLQEIKQANLEAIPGNNLLDILGYVLNKGVLQLS---RRGLELDYN 63 + N+ + L + + N +L + L + V ++ L + Sbjct: 183 IYNLSFDFLRKTYHLTGLKETKNQSLTEFFTILQHVFEQLVQAVERIQTSPNYKLVKEMR 242 Query: 64 PNTEIIPGIKGRIEFAKTIRGFHLNH------------GKTVSTF-----DMLNEDTLAN 106 G+ A + HL + + DT+ N Sbjct: 243 KVEADRVKKSGKENIAYLTKHPHLLRQDDRIGFVPINGHSLYPSHALETKRQIQYDTMEN 302 Query: 107 RIIKSTLAILIKH--------------------EKLNSTIRDEARSLYRKLPGISTLHLT 146 R ++ + + +++ S R L + + Sbjct: 303 RFVRWVIERISSKLKELKLRLSGKSRLQDPLLIKRIGSMQNQLQRLLSLDFLNVGGMKQL 362 Query: 147 PQHFSYLNGGKNTRYYKFVISVCKFIVNNSIPGQNKGHYRFYDFERNEKEMSLLYQKFLY 206 Y+ + ++ + F + K+++ LY+ + + Sbjct: 363 SVSLVLQMAPGYREVYRNYL----MLMKGLSI-------QSDLFRLSMKDLAQLYEYWCF 411 Query: 207 EFCRRELTSANTTRSYLKWD--------------------------------ASSISDQS 234 L+ +++ Sbjct: 412 LKIHDLLSRKYELLKQDIIKVQRTGIFVTLDKSAKARMVYRNPSNGEVFTLYYNALPKGD 471 Query: 235 LNLLPRMETDITIRSSEK------ILIVDAKYYKSIF------SRRMGTEKFHSQNLYQL 282 + + D + + I DAKY + R +++ + Sbjct: 472 RSATLGQKPDNVLTLKKNEAAVEYKYIFDAKYRLNSAYEGTPYYERYKQPGPEEEDINTM 531 Query: 283 MNYLWSLKPENGENIG-------GLLIYPHVDTAV--KHRYKINGFDIGLCTVN-LGQEW 332 Y ++ +G +++P+ D +HR+ + + + L Sbjct: 532 HRYRDAIVYADGHGQEYERSMFGAYVLFPYHDEEKFREHRFYKSIELVNVGAFPFLPNST 591 Query: 333 PCIHQELLDIFDE 345 + L +I + Sbjct: 592 ELMEAFLDEIIRD 604 >UniRef50_Q2FY99 Phi APSE P51-like protein n=34 Tax=root RepID=Q2FY99_STAA8 Length = 388 Score = 40.3 bits (93), Expect = 0.090, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 51/161 (31%), Gaps = 14/161 (8%) Query: 165 VISVCKFI--VNNSIPGQNKGHYRFYDFERNEKEMSLL---YQKFLYEF---CRRELTSA 216 + + + Q + + F +++RN+ L ++++ L+ Sbjct: 48 AHELSELYFSLKYEGLTQFEFNKAFQNYKRNQYYSEELREYVEEYVANVEEKYNEALSRD 107 Query: 217 NTTRSYLKWDASSISDQSLNLLPRMETDITIRSSEKILIVDAKYYKSIFSRRMGTEKFHS 276 + + + + D+ I S + I+D KY K I + + Sbjct: 108 DDVIALFETKLDLGKYVPESFGTG---DVIIFSGGVLEIIDLKYGKGIEVSAIDNPQLR- 163 Query: 277 QNLYQLMNYLWSLKPENGENIGGLLIYPHVDTAVKHRYKIN 317 LY L Y + + +I P +D I+ Sbjct: 164 --LYGLGAYELLSLVYDIHTVRMTIIQPRIDNFSTEELPIS 202 >UniRef50_C7QAD2 McrBC 5-methylcytosine restriction system component-like protein n=4 Tax=Actinomycetales RepID=C7QAD2_CATAD Length = 72 Score = 40.3 bits (93), Expect = 0.091, Method: Composition-based stats. Identities = 15/65 (23%), Positives = 26/65 (40%), Gaps = 6/65 (9%) Query: 279 LYQLMNYLWSLKPENGENIGGLLIYPHVD-TAVKHRYKINGFDIGLCTVNLGQEWPCIHQ 337 LYQ+M Y L+ G L+Y + H + G I ++L Q + + Sbjct: 4 LYQVMAYCTVLRL-----AHGHLVYAAGEPAIATHTVRAAGLAITAHALDLDQPPTRVEE 58 Query: 338 ELLDI 342 ++ I Sbjct: 59 QIAAI 63 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.312 0.142 0.389 Lambda K H 0.267 0.0435 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,907,063,058 Number of Sequences: 3077464 Number of extensions: 69264699 Number of successful extensions: 219495 Number of sequences better than 1.0e-01: 203 Number of HSP's better than 0.1 without gapping: 289 Number of HSP's successfully gapped in prelim test: 134 Number of HSP's that attempted gapping in prelim test: 217937 Number of HSP's gapped (non-prelim): 444 length of query: 348 length of database: 1,040,396,356 effective HSP length: 129 effective length of query: 219 effective length of database: 643,403,500 effective search space: 140905366500 effective search space used: 140905366500 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 94 (40.7 bits)