BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (464 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P05719 Type-1 restriction enzyme EcoKI specificity prot... 951 0.0 UniRef50_A1AJL9 HsdS, type I site-specific deoxyribonuclease n=2... 333 6e-90 UniRef50_B5BKY5 Subunit S of type I restriction-modification sys... 313 9e-84 UniRef50_P06990 Type-1 restriction enzyme EcoBI specificity prot... 301 3e-80 UniRef50_A9N788 Putative uncharacterized protein n=3 Tax=Salmone... 278 3e-73 UniRef50_B3YJG5 Type I restriction enzyme EcoKI specificity prot... 274 4e-72 UniRef50_P06187 Type-1 restriction enzyme StySJI specificity pro... 274 5e-72 UniRef50_D0KMA1 Restriction modification system DNA specificity ... 254 4e-66 UniRef50_P06991 Type-1 restriction enzyme EcoDI specificity prot... 252 2e-65 UniRef50_B7LQL4 Specificity determinant for hsdM and hsdR (Modul... 250 1e-64 UniRef50_A3JH04 Specificity determinant for hsdM and hsdR n=1 Ta... 226 1e-57 UniRef50_A1TWL9 Restriction modification system DNA specificity ... 214 5e-54 UniRef50_B5IN27 HsdS, type I site-specific deoxyribonuclease n=1... 213 1e-53 UniRef50_A4BLE7 Type I restriction enzyme StySPI specificity pro... 200 9e-50 UniRef50_D1UP80 Restriction modification system DNA specificity ... 198 3e-49 UniRef50_A3EKX4 Type I restriction modification DNA specificity ... 189 1e-46 UniRef50_D1C5W4 Restriction modification system DNA specificity ... 178 4e-43 UniRef50_A0KZG7 Restriction modification system DNA specificity ... 178 5e-43 UniRef50_C9P2L6 HsdS type I site-specific deoxyribonuclease n=1 ... 177 5e-43 UniRef50_C9NQJ7 HsdS type I site-specific deoxyribonuclease n=1 ... 169 2e-40 UniRef50_B8GGK0 Restriction modification system DNA specificity ... 169 2e-40 UniRef50_A8ZTW4 Restriction modification system DNA specificity ... 163 1e-38 UniRef50_Q1MKB2 Putative type I restriction enzyme specificity s... 161 4e-38 UniRef50_B2SI71 Type I restriction-modification system, S subuni... 161 4e-38 UniRef50_A3UV36 Type I restriction enzyme specificity protein n=... 158 3e-37 UniRef50_C0VG50 Type I restriction modification enzyme protein S... 157 7e-37 UniRef50_B0VPS8 Specificity determinant for hsdM and hsdR n=1 Ta... 152 2e-35 UniRef50_Q8PTL2 Type I restriction-modification system specifici... 151 4e-35 UniRef50_A6DQ81 Putative restriction-modification system specifi... 147 7e-34 UniRef50_C9YAL6 Putative uncharacterized protein n=1 Tax=Curviba... 147 1e-33 UniRef50_UPI0001695152 type I restriction enzyme specificity pro... 146 1e-33 UniRef50_Q2P0A3 Specificity determinant for hsdM and hsdR n=2 Ta... 146 1e-33 UniRef50_C5SE02 Restriction modification system DNA specificity ... 145 4e-33 UniRef50_B6R0S6 Restriction modification system DNA specificity ... 138 5e-31 UniRef50_A7IEA1 Restriction modification system DNA specificity ... 138 5e-31 UniRef50_A1BGI9 Restriction modification system DNA specificity ... 135 3e-30 UniRef50_Q210J8 Type I restriction enzyme StySPI specificity pro... 133 2e-29 UniRef50_A6E2R5 Restriction endonuclease S subunits-like protein... 130 1e-28 UniRef50_B4VXC6 Type I restriction modification DNA specificity ... 129 2e-28 UniRef50_B2A6M8 Restriction modification system DNA specificity ... 125 4e-27 UniRef50_C3MFA1 Putative restriction endonuclease type I, S subu... 121 5e-26 UniRef50_UPI0001C36A8C HsdS1 n=1 Tax=Clostridium hathewayi DSM 1... 119 3e-25 UniRef50_Q7UK98 Type I restriction enzyme EcoEI specificity prot... 118 5e-25 UniRef50_Q26D97 Putative type I site-speicific deoxyribonuclease... 118 6e-25 UniRef50_C5TIE5 Restriction modification system DNA specificity ... 116 2e-24 UniRef50_B8E4I3 Restriction modification system DNA specificity ... 115 4e-24 UniRef50_C9RY89 Restriction modification system DNA specificity ... 115 4e-24 UniRef50_B4B315 Restriction modification system DNA specificity ... 114 8e-24 UniRef50_Q466N9 Type I restriction-modification system specifici... 113 2e-23 UniRef50_A7N438 Putative uncharacterized protein n=1 Tax=Vibrio ... 112 4e-23 UniRef50_B0PEE2 Putative uncharacterized protein n=1 Tax=Anaerot... 111 6e-23 UniRef50_A3JFC5 Restriction modification system DNA specificity ... 110 2e-22 UniRef50_Q0VNH1 Type I restriction-modification system, S subuni... 109 2e-22 UniRef50_A4ILD9 Putative type I specificity subunit HsdS n=1 Tax... 108 4e-22 UniRef50_Q73D72 Type I restriction-modification enzyme, S subuni... 108 5e-22 UniRef50_Q1Z9T4 Type I restriction-modification system, S subuni... 107 9e-22 UniRef50_D2TNZ5 Putative type I restriction modification system ... 107 1e-21 UniRef50_UPI0001BC509B restriction modification system DNA speci... 107 1e-21 UniRef50_C4KDM6 Restriction modification system DNA specificity ... 106 2e-21 UniRef50_A7VYZ3 Putative uncharacterized protein n=1 Tax=Clostri... 106 2e-21 UniRef50_A6TLK6 Restriction modification system DNA specificity ... 106 2e-21 UniRef50_Q4HNM9 Type I restriction-modification system S subunit... 106 2e-21 UniRef50_C6IKX2 Type I restriction-modification system n=2 Tax=B... 105 3e-21 UniRef50_D0WYM6 Putative uncharacterized protein n=1 Tax=Vibrio ... 104 5e-21 UniRef50_Q3JBU1 Restriction modification system DNA specificity ... 104 6e-21 UniRef50_Q4C702 Restriction modification system DNA specificity ... 103 1e-20 UniRef50_UPI0001855288 conserved hypothetical protein n=1 Tax=Fr... 103 2e-20 UniRef50_UPI0001B4DA32 restriction endonuclease S subunits-like ... 102 2e-20 UniRef50_A3PYN5 Restriction modification system DNA specificity ... 101 6e-20 UniRef50_Q0W5N3 Type I restriction modification system, specific... 101 7e-20 UniRef50_A9I6S0 Type I restriction-modification system, S subuni... 100 8e-20 UniRef50_Q5LPW5 Type I restriction-modification system, S subuni... 100 1e-19 UniRef50_A1UJN5 Restriction endonuclease S subunits-like protein... 100 2e-19 UniRef50_B0NIH0 Putative uncharacterized protein n=1 Tax=Clostri... 99 3e-19 UniRef50_A4AEB7 Type I restriction-modification system, endonucl... 99 5e-19 UniRef50_C3Q383 Putative uncharacterized protein n=1 Tax=Bactero... 97 9e-19 UniRef50_A3SCN8 Restriction endonuclease S subunit-like protein ... 97 1e-18 UniRef50_C6J5M6 Putative uncharacterized protein n=1 Tax=Paeniba... 97 2e-18 UniRef50_UPI000196B4FC hypothetical protein CATMIT_01648 n=1 Tax... 97 2e-18 UniRef50_Q1NNI4 Restriction modification system DNA specificity ... 97 2e-18 UniRef50_C2CSZ9 Type I restriction modification DNA specificity ... 96 3e-18 UniRef50_B0QS41 Type I restriction enzyme EcoKI subunit R n=1 Ta... 96 3e-18 UniRef50_A4VH87 Type I restriction-modification system, S subuni... 96 4e-18 UniRef50_Q2J5T0 Restriction modification system DNA specificity ... 95 5e-18 UniRef50_Q0W4T6 Type I restriction modification system, specific... 94 1e-17 UniRef50_B0JHV8 Restriction modification system DNA specificity ... 93 2e-17 UniRef50_A6H2J0 Restriction-modification enzyme n=3 Tax=Gammapro... 93 2e-17 UniRef50_A1VBQ9 Restriction modification system DNA specificity ... 93 2e-17 UniRef50_D2NCT2 Putative uncharacterized protein n=1 Tax=Escheri... 92 3e-17 UniRef50_C5RH89 Restriction modification system DNA specificity ... 92 4e-17 UniRef50_Q1NN41 Restriction modification system DNA specificity ... 92 4e-17 UniRef50_Q12PV3 Restriction modification system DNA specificity ... 91 9e-17 UniRef50_B7VNG6 Type I restriction enzyme EcoKI, S subunit n=1 T... 91 9e-17 UniRef50_C2CF25 Restriction modification system DNA specificity ... 89 3e-16 UniRef50_A5UR98 Restriction modification system DNA specificity ... 88 6e-16 UniRef50_Q1R1F8 Restriction modification system DNA specificity ... 88 8e-16 UniRef50_Q5YW32 Putative restriction-modification system specifi... 87 2e-15 UniRef50_B5VW93 Restriction modification system DNA specificity ... 86 3e-15 UniRef50_C9P132 Type I restriction-modification system specifici... 85 6e-15 UniRef50_C1PCQ5 Restriction modification system DNA specificity ... 85 6e-15 UniRef50_Q8RJG0 HsdS n=12 Tax=Campylobacter jejuni RepID=Q8RJG0_... 85 7e-15 UniRef50_B5W475 Restriction modification system DNA specificity ... 85 7e-15 UniRef50_B1LRG3 Type I restriction modification DNA specificity ... 84 8e-15 UniRef50_B7JRE7 Restriction modification system DNA specificity ... 84 8e-15 UniRef50_C6JA10 Putative uncharacterized protein n=1 Tax=Ruminoc... 84 9e-15 UniRef50_D2LA90 Restriction modification system DNA specificity ... 84 1e-14 UniRef50_B3G223 Type I restriction modification DNA specificity ... 84 1e-14 UniRef50_A5KSY3 Restriction modification system DNA specificity ... 84 1e-14 UniRef50_UPI0001AF5E36 restriction modification system DNA speci... 84 1e-14 UniRef50_Q8TP22 Type I site-specific deoxyribonuclease n=1 Tax=M... 83 2e-14 UniRef50_A5W9C1 Restriction modification system DNA specificity ... 83 2e-14 UniRef50_B4RYU8 Type I site-specific deoxyribonuclease n=1 Tax=A... 82 4e-14 UniRef50_D2KHV4 Putative type I restriction-modification system ... 82 4e-14 UniRef50_UPI00016ADEAA restriction modification system DNA speci... 82 5e-14 UniRef50_B4TEJ6 Restriction modification system DNA specificity ... 82 6e-14 UniRef50_A1SW07 Restriction modification system DNA specificity ... 81 8e-14 UniRef50_Q4HFD9 HsdS n=3 Tax=Campylobacterales RepID=Q4HFD9_CAMCO 81 9e-14 UniRef50_UPI00016B0992 probable type I restriction-modification ... 81 9e-14 UniRef50_B0P5V5 Putative uncharacterized protein n=1 Tax=Anaerot... 81 9e-14 UniRef50_A4T4W1 Restriction modification system DNA specificity ... 80 1e-13 UniRef50_B5FA22 Restriction modification system DNA specificity ... 80 1e-13 UniRef50_D1YNY9 Type I restriction modification DNA specificity ... 80 2e-13 UniRef50_C1D7R6 Type I restriction-modification system, S subuni... 80 2e-13 UniRef50_B0RQ64 Type I site-specific DNA methyltransferase speci... 80 2e-13 UniRef50_Q0RKJ6 Type I restriction modification enzyme protein S... 80 2e-13 UniRef50_A6L7U8 Type I restriction enzyme EcoAI specificity prot... 80 2e-13 UniRef50_UPI0001694BE8 putative type I restriction enzyme specif... 79 3e-13 UniRef50_B3E2V8 Restriction modification system DNA specificity ... 79 3e-13 UniRef50_C6Q0B1 Restriction modification system DNA specificity ... 79 4e-13 UniRef50_A1TSH8 Restriction modification system DNA specificity ... 79 4e-13 UniRef50_UPI00016B1071 restriction modification system DNA speci... 78 5e-13 UniRef50_Q6D2H5 Subunit S of type I restriction-modification sys... 78 5e-13 UniRef50_C6JN70 Predicted protein n=1 Tax=Fusobacterium varium A... 78 5e-13 UniRef50_A3PPQ8 Restriction modification system DNA specificity ... 78 7e-13 UniRef50_D1JFQ8 Putative type I restriction enzyme, DNA specific... 77 1e-12 UniRef50_B2V7V7 Restriction modification system DNA specificity ... 77 2e-12 UniRef50_UPI000197A104 putative Type I restriction enzyme EcoR12... 77 2e-12 UniRef50_C3WQF7 Restriction modification system DNA specificity ... 76 2e-12 UniRef50_C0XBA7 Type I restriction-modification system, S subuni... 76 2e-12 UniRef50_A4T8B4 Restriction modification system DNA specificity ... 76 2e-12 UniRef50_B3JQ19 Putative uncharacterized protein n=2 Tax=Bacteri... 76 3e-12 UniRef50_C0QCH4 HsdS2 n=1 Tax=Desulfobacterium autotrophicum HRM... 76 3e-12 UniRef50_B3E898 Restriction modification system DNA specificity ... 76 3e-12 UniRef50_Q1NNJ9 Putative uncharacterized protein n=1 Tax=delta p... 75 4e-12 UniRef50_Q3IEL0 Putative type I restriction-modification system,... 75 5e-12 UniRef50_Q30YF3 Subunit S of type I restriction-modification sys... 75 5e-12 UniRef50_D0LNE2 Restriction modification system DNA specificity ... 75 6e-12 UniRef50_UPI0001907424 putative type I restriction enzyme specif... 75 7e-12 UniRef50_C1ZA47 Restriction endonuclease S subunit n=1 Tax=Planc... 75 7e-12 UniRef50_B3PQK6 Probable type I restriction-modification system ... 74 8e-12 UniRef50_UPI0001BC364B restriction modification system DNA speci... 74 9e-12 UniRef50_B0BR05 Type I restriction enzyme EcoAI specificity prot... 74 9e-12 UniRef50_Q8TP07 Type I site-specific deoxyribonuclease n=1 Tax=M... 74 1e-11 UniRef50_A7I739 Restriction modification system DNA specificity ... 74 1e-11 UniRef50_UPI0001BC2C80 restriction endonuclease S subunits-like ... 74 1e-11 UniRef50_B3H2F5 Type I restriction-modification system, S subuni... 74 1e-11 UniRef50_A3XPV6 Type I restriction-modification system specifici... 74 2e-11 UniRef50_D2QTT7 Restriction modification system DNA specificity ... 74 2e-11 UniRef50_B7CAX1 Putative uncharacterized protein n=1 Tax=Eubacte... 74 2e-11 UniRef50_Q028F8 Putative uncharacterized protein n=1 Tax=Candida... 74 2e-11 UniRef50_A4FXL8 Restriction modification system DNA specificity ... 73 2e-11 UniRef50_C7QRY1 Restriction modification system DNA specificity ... 73 2e-11 UniRef50_C4KBJ9 Restriction modification system DNA specificity ... 73 2e-11 UniRef50_C7RQC3 Type I restriction-modification system specifici... 73 2e-11 UniRef50_B7KLD8 Restriction modification system DNA specificity ... 73 2e-11 UniRef50_B9M293 Restriction endonuclease S subunit-like protein ... 73 3e-11 UniRef50_C3NN82 Restriction modification system DNA specificity ... 73 3e-11 UniRef50_Q3AQE4 Restriction endonuclease S subunits-like n=1 Tax... 72 4e-11 UniRef50_A1BGA0 Restriction modification system DNA specificity ... 72 5e-11 UniRef50_Q1K3D0 Restriction modification system DNA specificity ... 72 5e-11 UniRef50_Q21ZK2 Restriction modification system DNA specificity ... 71 7e-11 UniRef50_C7XC38 Putative uncharacterized protein n=1 Tax=Parabac... 71 7e-11 UniRef50_B7R237 Type I restriction modification system, subunit ... 71 9e-11 UniRef50_C7RNT4 Restriction endonuclease S subunits-like protein... 71 9e-11 UniRef50_Q6F778 Putative type I restriction-modification system ... 71 9e-11 UniRef50_A0ZMI3 Putative uncharacterized protein n=1 Tax=Nodular... 71 1e-10 UniRef50_D0C390 Type I restriction-modification system specifici... 71 1e-10 UniRef50_A8YFX5 HsdS protein n=2 Tax=Microcystis aeruginosa PCC ... 71 1e-10 UniRef50_Q8TN78 Type I restriction modification enzyme protein S... 70 1e-10 UniRef50_Q8GN10 Putative type I specificity subunit HsdS n=3 Tax... 70 1e-10 UniRef50_A5KSM3 Restriction modification system DNA specificity ... 70 2e-10 UniRef50_A8YCA1 HsdS protein n=1 Tax=Microcystis aeruginosa PCC ... 70 2e-10 UniRef50_Q1VSP4 Restriction endonuclease S subunits n=1 Tax=Psyc... 70 2e-10 UniRef50_Q307D8 Type I RM system S subunit n=1 Tax=Arthrospira p... 70 2e-10 UniRef50_A8RUN3 Putative uncharacterized protein n=1 Tax=Clostri... 70 2e-10 UniRef50_Q4FUM9 Possible type I restriction-modification system,... 70 2e-10 UniRef50_D1XRZ5 Restriction modification system DNA specificity ... 70 2e-10 UniRef50_A4FZ34 Restriction modification system DNA specificity ... 70 2e-10 UniRef50_D1PEN6 Type I restriction-modification enzyme S subunit... 69 3e-10 UniRef50_C3RBV6 Type I restriction-modification system n=3 Tax=B... 69 3e-10 UniRef50_C6CZ61 Restriction modification system DNA specificity ... 69 3e-10 UniRef50_UPI000190446B type I restriction-modification system, S... 69 3e-10 UniRef50_C6A4W8 Putative type I specificity subunit HsdS n=1 Tax... 69 3e-10 UniRef50_C2H9J2 Possible type I restriction-modification system ... 69 3e-10 UniRef50_D0BWI7 Predicted protein n=1 Tax=Acinetobacter sp. RUH2... 69 4e-10 UniRef50_B9KF72 Type I restriction-modification system, S subuni... 69 4e-10 UniRef50_B2J095 Restriction modification system DNA specificity ... 69 4e-10 UniRef50_A6UXD7 Type I restriction-modification system, S subuni... 69 5e-10 UniRef50_Q0EXK2 HsdS protein n=1 Tax=Mariprofundus ferrooxydans ... 69 5e-10 UniRef50_C7NM09 Putative uncharacterized protein n=1 Tax=Kytococ... 69 5e-10 UniRef50_Q0RV87 Type I restriction-modification system specifici... 69 5e-10 UniRef50_UPI0001BCA660 restriction modification system DNA speci... 68 6e-10 UniRef50_C3PVT7 Type I restriction enzyme EcoR124II specificity ... 68 7e-10 UniRef50_Q4HNY2 Type I restriction-modification system specifici... 68 8e-10 UniRef50_A6W078 Restriction modification system DNA specificity ... 68 9e-10 UniRef50_B2IP18 Type I restriction-modification system, S subuni... 68 9e-10 UniRef50_B6VTA2 Putative uncharacterized protein n=1 Tax=Bactero... 67 1e-09 UniRef50_Q0EWP9 Type I restriction-modification system, S subuni... 67 1e-09 UniRef50_B0JXI4 Putative type I restriction enzyme specificity p... 67 1e-09 UniRef50_A6CKF2 Putative type I restriction enzyme specificity p... 67 1e-09 UniRef50_Q89Z57 Putative type I restriction enzyme S.BthVORF4518... 67 2e-09 UniRef50_A6C679 Type I restriction-modification system, S subuni... 67 2e-09 UniRef50_Q12YI6 Restriction modification system DNA specificity ... 67 2e-09 UniRef50_B5IRS1 Type I restriction modification DNA specificity ... 67 2e-09 UniRef50_B1XQR8 Type 1 restriction-modification system specifici... 67 2e-09 UniRef50_Q1Q456 Putative uncharacterized protein n=1 Tax=Candida... 66 3e-09 UniRef50_C2I227 Restriction modification system DNA specificity ... 66 3e-09 UniRef50_C6DAR8 Restriction modification system DNA specificity ... 66 3e-09 UniRef50_B3QN66 Restriction modification system DNA specificity ... 66 3e-09 UniRef50_A6EUA9 Type I restriction-modification system, S subuni... 66 3e-09 UniRef50_A6Y5S9 Restriction endonuclease S subunit n=1 Tax=Vibri... 66 3e-09 UniRef50_B3R3C2 Type I restriction-modification methylase S subu... 66 3e-09 UniRef50_C6CR26 Restriction modification system DNA specificity ... 66 3e-09 UniRef50_B7K558 Restriction modification system DNA specificity ... 65 4e-09 UniRef50_A9CZ30 Restriction endonuclease S subunit n=1 Tax=Shewa... 65 4e-09 UniRef50_UPI0001AF6F3B polypeptide HsdS n=1 Tax=Mycobacterium ka... 65 5e-09 UniRef50_A8ZVS3 Restriction modification system DNA specificity ... 65 5e-09 UniRef50_B0RYC3 Type I site-specific deoxyribonuclease (Specific... 65 5e-09 UniRef50_A5GB19 Restriction modification system DNA specificity ... 65 5e-09 UniRef50_Q04LY7 Type I restriction-modification system, S subuni... 65 5e-09 UniRef50_C2RWF6 N-6 DNA methylase n=1 Tax=Bacillus cereus BDRD-S... 65 5e-09 UniRef50_C2QHW5 Putative uncharacterized protein n=2 Tax=Bacillu... 65 6e-09 UniRef50_B8K9P9 Restriction modification system DNA specificity ... 65 6e-09 UniRef50_Q3J7Q5 Restriction endonuclease S subunits-like n=2 Tax... 65 6e-09 UniRef50_C2GFC3 Restriction modification system DNA specificity ... 65 7e-09 UniRef50_UPI0001C15DDF Restriction modification system DNA speci... 65 8e-09 UniRef50_B1ZYW8 Restriction endonuclease S subunits-like protein... 65 8e-09 UniRef50_D0IJZ0 Type I restriction-modification system specifici... 65 8e-09 UniRef50_Q7MNA3 Restriction endonuclease S subunit n=1 Tax=Vibri... 64 9e-09 UniRef50_Q8YRH1 Type I restriction-modification enzyme S subunit... 64 9e-09 UniRef50_A3PKU6 Restriction modification system DNA specificity ... 64 9e-09 UniRef50_Q1VAF2 Hypothetical type I restriction-modification sys... 64 1e-08 UniRef50_Q112D6 Restriction modification system DNA specificity ... 64 1e-08 UniRef50_C9Q5S0 Possible type I restriction-modification system ... 64 2e-08 UniRef50_C8W862 Putative uncharacterized protein n=1 Tax=Atopobi... 63 2e-08 UniRef50_A0KWU0 Restriction modification system DNA specificity ... 63 2e-08 UniRef50_C5B9C5 Type I restriction-modification system, S subuni... 63 2e-08 UniRef50_C3XPA5 Restriction modification system DNA specificity ... 63 2e-08 UniRef50_B5GJX8 Type I restriction modification enzyme protein S... 63 2e-08 UniRef50_B9XT14 Restriction modification system DNA specificity ... 63 3e-08 >UniRef50_P05719 Type-1 restriction enzyme EcoKI specificity protein n=5 Tax=Enterobacteriaceae RepID=T1SK_ECOLI Length = 464 Score = 951 bits (2457), Expect = 0.0, Method: Compositional matrix adjust. Identities = 464/464 (100%), Positives = 464/464 (100%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV Sbjct: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG Sbjct: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD Sbjct: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS Sbjct: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV Sbjct: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI Sbjct: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG Sbjct: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 Query: 421 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS Sbjct: 421 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 >UniRef50_A1AJL9 HsdS, type I site-specific deoxyribonuclease n=2 Tax=Escherichia coli RepID=A1AJL9_ECOK1 Length = 455 Score = 333 bits (855), Expect = 6e-90, Method: Compositional matrix adjust. Identities = 215/473 (45%), Positives = 276/473 (58%), Gaps = 27/473 (5%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAIN---------YLKDDYLPLIRANNIQN 51 MSAGKLPEGW + + +I G T K A N +L L + I + Sbjct: 1 MSAGKLPEGWEQIEIGDIADVISGGTPKSGVAENFAPSGEGVAWLTPADLSGYKEKYISH 60 Query: 52 GKFDTTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVL 111 G D T L + S K+ P+ ++ S V +A++ + F +F Sbjct: 61 GARDLTTLGYS-----SCSAKLMPKGTILFSSRAPIGYVAIAANE-IATNQGFKSFA--- 111 Query: 112 RPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEK 171 P IF + +F ++ R+ + G I +S + + P AEQKIIAEK Sbjct: 112 FPSD-IFPDYAYYFLRN--IRHIAEEMGTGTTFKEISGSSAKTLPFVLVPFAEQKIIAEK 168 Query: 172 LDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESI 231 LDTLLAQVDSTKAR EQIPQILKRFRQAVLG AV GKLTE WR+ S +++ Sbjct: 169 LDTLLAQVDSTKARLEQIPQILKRFRQAVLGAAVRGKLTEDWRD-NSSLSGWREGKLGEF 227 Query: 232 LTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 + + G SSK N+ G+ P+LR+ +++ G +D D+ + E+ ++KL+ D+LF Sbjct: 228 IKKPSYGTSSKSNKEGL-IPVLRMGNLQGGKLDWTDLVYT-SDTIEIEKYKLEYNDVLFN 285 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 R N S E VG + K Q +Y LIR + D P+Y+ +S R + Sbjct: 286 RTN-SPELVGKTAIYK--SEQPAIYAGYLIRVQCLPDLNPDYLNYHLNSILGRQYCYSVK 342 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 Q I+ + + + + +PP+ EQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ Sbjct: 343 SDGVSQSNINAQKLIAYPITVPPLPEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 402 Query: 412 SILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 SILAKAFRGELTAQWRAENP+LISGENSAAALLEKIKAERAASGGKKASRKKS Sbjct: 403 SILAKAFRGELTAQWRAENPELISGENSAAALLEKIKAERAASGGKKASRKKS 455 >UniRef50_B5BKY5 Subunit S of type I restriction-modification system n=7 Tax=Salmonella enterica subsp. enterica RepID=B5BKY5_SALPK Length = 462 Score = 313 bits (802), Expect = 9e-84, Method: Compositional matrix adjust. Identities = 203/477 (42%), Positives = 278/477 (58%), Gaps = 28/477 (5%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS GKLPEGWV +S + + + K ++ +K +R +I G D + + Sbjct: 1 MSGGKLPEGWVTTHLSEICSKPQYGYTTKSSSMGDVK-----FLRTTDITKGAVDWSSVP 55 Query: 61 FV---PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 + P+++ K ++ DIVI S + SV Q+ P + F ++ +P Sbjct: 56 YCMDAPEDVSK--YQLQDRDIVI---SRAGSVGFSFLVQNPPSQVVFASYLIRFKPVNYF 110 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 ++ F +SS Y N++S +SAG + N+ + +PIPP+AEQKIIAEKLDTLLA Sbjct: 111 SEYYLKRFLESSDYWNQLSLMSAGNAVQNVNAQKLSTLTVPIPPIAEQKIIAEKLDTLLA 170 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT-EKWRNFEPQHSVFKKLNFESI----- 231 QVDSTKAR EQIPQILKRFRQAVL AV+G L RN P S ++ + S Sbjct: 171 QVDSTKARLEQIPQILKRFRQAVLAAAVSGLLIGSNKRNHHPLCSEWQWPDLPSTWSVHK 230 Query: 232 ---LTELRNG-LSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD 287 L + R G + K G L +VR D +++ + S+ E L+ GD Sbjct: 231 YSELVDSRLGKMLDKAKNFGSATKYLGNINVRWFSFDLENLQDILISDIERRELSLKLGD 290 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 +L G C + + Q +++ L RAR+ +PE++ ++ + N Sbjct: 291 VLICEGGEP----GRCAIWSEPQDIPVIFQKALHRARVKDKIIPEWL-VYNLKNDSNNIS 345 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 ++ + T + K ++GK + + + +PP++EQ EIVRRVEQLFA+ADTIEKQVNNAL RVN Sbjct: 346 LSQLFTGTTIKHLTGKALANYPIRVPPLEEQHEIVRRVEQLFAWADTIEKQVNNALNRVN 405 Query: 408 NLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 +LTQSILAKAFRGELTAQWRAENP LISGENSAAALLEKIKAERAASGGKK SRKK+ Sbjct: 406 SLTQSILAKAFRGELTAQWRAENPSLISGENSAAALLEKIKAERAASGGKKTSRKKA 462 >UniRef50_P06990 Type-1 restriction enzyme EcoBI specificity protein n=2 Tax=Escherichia coli RepID=T1SB_ECOLX Length = 474 Score = 301 bits (771), Expect = 3e-80, Method: Compositional matrix adjust. Identities = 199/483 (41%), Positives = 274/483 (56%), Gaps = 59/483 (12%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 + W+ + +V + G +K + N + D +PLIR ++ G T +P Sbjct: 23 DSWLRISMDSVANITNGFAFKSSEFNN--RKDGVPLIRIRDVLKGNTSTYYSGQIP---- 76 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAF-----CGVLRPEKLIFSGFI 122 E + PED+++ M + + CS A C + E F Sbjct: 77 -EGYWVYPEDLIVGMDGDFNATIW----------CSEPALLNQRVCKIEVQEDKYNKRFF 125 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 H Y + I++ ++ + ++ + +P+PPLAEQKIIAEKLDTLLAQVDST Sbjct: 126 YHALPG--YLSAINANTSSVTVKHLSSRTLQDTLLPLPPLAEQKIIAEKLDTLLAQVDST 183 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNF------------ES 230 KAR EQIPQILKRFRQAVL AV G+LT++ ++F + KK+ E+ Sbjct: 184 KARLEQIPQILKRFRQAVLAAAVTGRLTKEDKDF-----ITKKVELDNYKILIPEDWSET 238 Query: 231 ILTELRNGLSSKPNESGVGHP---------ILRISSVRAGHVDQNDIRFLECS-ESELNR 280 IL + N + +P GV P ++R+ + G VD N +R + + + R Sbjct: 239 ILNNIIN--TQRPLCYGVVQPGDDIKDGIELIRVCDINDGEVDLNHLRKISKEIDLQYKR 296 Query: 281 HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSS 340 K++ D+L T +G G++++ + N+ I K +P ++ I+ SS Sbjct: 297 SKVRKNDILVTIVGA----IGRIGIVREDINVNIARAVARISPEY-KIIVPMFLHIWLSS 351 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 P + ++ K + +K ++ KD+K+ V LP ++EQ EIVRRVEQLFAYAD+IEKQVN Sbjct: 352 PVMQTWLVQSSKEVA-RKTLNLKDLKNAFVPLPSIEEQHEIVRRVEQLFAYADSIEKQVN 410 Query: 401 NALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKAS 460 NALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKAS Sbjct: 411 NALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKAS 470 Query: 461 RKK 463 RKK Sbjct: 471 RKK 473 >UniRef50_A9N788 Putative uncharacterized protein n=3 Tax=Salmonella enterica subsp. enterica RepID=A9N788_SALPB Length = 467 Score = 278 bits (711), Expect = 3e-73, Method: Compositional matrix adjust. Identities = 193/479 (40%), Positives = 268/479 (55%), Gaps = 27/479 (5%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS GKLPE WV + + + G K +A+ ++ P IR + +NG + + + Sbjct: 1 MSGGKLPEEWVKTTIGVICEVKGGKRLPKGKALLNTATEH-PYIRVTDFENGSVNLSTIK 59 Query: 61 FVPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQ--HLPFECSFGAFCGVLRPEKL 116 ++ + + IS D+ I+++ G+ ++G+ Q + + C +L +K Sbjct: 60 YLDSDTYSAISNYTISKNDLYISIA-GTIGLIGEIPEQLDNANLTENAAKLCFILGTDKK 118 Query: 117 IFSGFIAHFTKSSLYRNKI-SSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 ++ + +K SS + I+ F P P+ EQKIIAEKLDTL Sbjct: 119 YLKHVLSSNKTIEQFDDKTTSSGQPKLALFRIRDCEF-----PYAPINEQKIIAEKLDTL 173 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT-EKWRNFEPQHSVFKKLNFESI--- 231 LAQVDSTKAR EQIPQILKRFRQAVL AV+G L RN P S ++ + S Sbjct: 174 LAQVDSTKARLEQIPQILKRFRQAVLAAAVSGLLIGSNKRNHHPLCSEWQWPDLPSTWSV 233 Query: 232 -----LTELRNG-LSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQD 285 L + R G + K G L +VR D +++ + S+ E L+ Sbjct: 234 HKYSELVDSRLGKMLDKAKNFGSATKYLGNINVRWFSFDLENLQDILISDIERRELSLKL 293 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 GD+L G C + + Q +++ L RAR+ +PE++ ++ + N Sbjct: 294 GDVLICEGGEP----GRCAIWSEPQDIPVIFQKALHRARVKDKIIPEWL-VYNLKNDSNN 348 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 ++ + T + K ++GK + + + +PP++EQ EIVRRVEQLFAYADTIEKQVNNAL R Sbjct: 349 ISLSQLFTGTTIKHLTGKALANYPIRVPPLEEQHEIVRRVEQLFAYADTIEKQVNNALTR 408 Query: 406 VNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 VN+LTQSILAKAFRGELTAQWRAENP LISGENSAAALLEKIKAERAASGGKK SRKK+ Sbjct: 409 VNSLTQSILAKAFRGELTAQWRAENPSLISGENSAAALLEKIKAERAASGGKKTSRKKA 467 >UniRef50_B3YJG5 Type I restriction enzyme EcoKI specificity protein n=3 Tax=Gammaproteobacteria RepID=B3YJG5_SALET Length = 486 Score = 274 bits (701), Expect = 4e-72, Method: Compositional matrix adjust. Identities = 202/519 (38%), Positives = 279/519 (53%), Gaps = 88/519 (16%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKK--EQAINYLKDD---------------YLPL 43 MSAGKLPEGWV + + V Y K ++ ++ + DD L Sbjct: 1 MSAGKLPEGWVDTQLGNI------VDYGKATKRVLSDVNDDTWVLELEDIEKESSKLLST 54 Query: 44 IRANNIQNGKFDTTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECS 103 IRA+ F +T F +++ + I+IA G + +P C+ Sbjct: 55 IRASE---RPFKSTKNSFKRGDVLYGKLRPYLNKIIIAKEDGV------CTTEIIPL-CA 104 Query: 104 FGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLA 163 + C + +I ++ KSS ++ ++ +S G N+ + A + + PLA Sbjct: 105 EPSCC----------NKYIFYWLKSSTFQGYVNDVSYGVNMPRLGTADGLKAPLRLAPLA 154 Query: 164 EQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV- 222 EQKIIAEKLDTLLAQ+DSTKAR EQIPQILKRFRQAVL AV+G LT +WR + V Sbjct: 155 EQKIIAEKLDTLLAQIDSTKARLEQIPQILKRFRQAVLAAAVSGNLTAEWRMNNNSNIVE 214 Query: 223 ---------------------FKKLN-------------FESILTELRNGLSSKPNESGV 248 + KL+ +SI T++ +G P Sbjct: 215 EEIEKVKNKLIAKKIIKKDLIYSKLDRKYPIPSDWLYVKLQSIATKITDGEHKTPKREPA 274 Query: 249 GHPILRISSVRAGHVDQNDIRFLECSESE--LNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 G ++ +++ G++ +D+ ++ +E + NR GD+L + +GS +G L+ Sbjct: 275 GQLLISARNIQDGYLKLSDVDYVGDAEFQKLRNRCDPDSGDVLIS-CSGS---IGRVCLV 330 Query: 307 KKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI 365 + ++ LI+ L +D + +Y+ SP + + K+T+ Q + I Sbjct: 331 DENSKYVMVRSVALIK--LMQDFVINKYMMYLLQSPLLQKEIEENSKSTA-QANLFLGPI 387 Query: 366 KSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQ 425 K+ + LPPV EQAEIVRRVEQLFAYADTIEKQVN+AL RVN+LTQSILAKAFRGELTAQ Sbjct: 388 KNLGIPLPPVPEQAEIVRRVEQLFAYADTIEKQVNSALTRVNSLTQSILAKAFRGELTAQ 447 Query: 426 WRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 WR ENP LISGENSAAALLEKIKAERAASGGKK SRKK+ Sbjct: 448 WRTENPSLISGENSAAALLEKIKAERAASGGKKTSRKKA 486 >UniRef50_P06187 Type-1 restriction enzyme StySJI specificity protein n=8 Tax=Enterobacteriaceae RepID=T1S_SALTY Length = 469 Score = 274 bits (701), Expect = 5e-72, Method: Compositional matrix adjust. Identities = 204/506 (40%), Positives = 267/506 (52%), Gaps = 79/506 (15%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK--FDTTD 58 MS GKLPEGW + ++ + L K + + L ++P+ GK F+T Sbjct: 1 MSGGKLPEGWATSTINEMCNL-----NPKLKLDDDLDVGFMPMAGVPTTYLGKCNFETKK 55 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSS----GSKSVVGKSAHQHLPFECSFGA-------- 106 V K + +D++ A + K+VV K F +GA Sbjct: 56 WSEVKKGFTQ----FQNDDVIFAKITPCFENGKAVVIKE------FPNGYGAGSTEYYVL 105 Query: 107 --FCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAE 164 G++ P L A N ++S + + +P+PPLAE Sbjct: 106 RSINGLINPHWLF-----ALVKTKDFLTNGALNMSGSVGHKRVTKEFLENYGVPVPPLAE 160 Query: 165 QKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE------KWRNFEP 218 QK+IAEKLDTLLAQVDSTKAR EQIPQILKRFRQ+V+ AVNG+LT+ K++ E Sbjct: 161 QKVIAEKLDTLLAQVDSTKARLEQIPQILKRFRQSVIVAAVNGQLTKELHKKNKFKLTEL 220 Query: 219 QHSVFKKLNFESI--LTELRNGLSSKPNES----GVGHPILRISSVRAGHV-DQNDIRFL 271 S+ I +++ G ES G P +R ++ G V + + Sbjct: 221 NISIPSLWKISEIGQFADVKGGKRLPKGESLIAENTGFPYIRAGQLKNGTVLPEGQLYLE 280 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVC----GLLKKLQHQNLLYPDKLIRARLTK 327 E + ++R+ + GDL T VG C G++ PD A LT+ Sbjct: 281 EYIQKSISRYTVSSGDLYIT-------IVGACIGDAGII----------PDVYNNANLTE 323 Query: 328 DA-----LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK----DIKSQVVLLPPVKEQ 378 +A L E I F S R++ + + + + G GK IKS ++LPP++EQ Sbjct: 324 NAAKICNLNENIFNRFLSLWLRSSYLQDIINSEIKSGAQGKLALARIKSLPLILPPLQEQ 383 Query: 379 AEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGEN 438 EIVRRVEQLFAYADTIEKQVNNAL RVN+LTQSILAKAFRGELTAQWRAENP+LISGEN Sbjct: 384 HEIVRRVEQLFAYADTIEKQVNNALTRVNSLTQSILAKAFRGELTAQWRAENPELISGEN 443 Query: 439 SAAALLEKIKAERAASGGKKASRKKS 464 SAAALLEKIKAERAASGGKK SRKK+ Sbjct: 444 SAAALLEKIKAERAASGGKKTSRKKA 469 >UniRef50_D0KMA1 Restriction modification system DNA specificity domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KMA1_PECWW Length = 493 Score = 254 bits (650), Expect = 4e-66, Method: Compositional matrix adjust. Identities = 210/532 (39%), Positives = 272/532 (51%), Gaps = 108/532 (20%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKE--QAINYLKDDYLPLIRANNIQNGKFDTTD 58 MS GKLPEGW + V L G + + I Y P+ +N I GK Sbjct: 2 MSVGKLPEGWKNIHLGDVIELKYGKSLAAQVRDGIGY------PVFGSNGIV-GKHS--- 51 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 +P L+K+S +I GS VV KS P + ++ +P F Sbjct: 52 ---IP--LIKQSG-------LIVGRKGSYGVVQKSVEPFFPIDTTYYIDELFNQPINFWF 99 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 +++ + L R S+ G N ++ +++L +I +PPL EQKIIAEKLDTLLAQ Sbjct: 100 Y-YLSFLPLTKLNR---STTIPGLNRDD----AYNL-SINLPPLVEQKIIAEKLDTLLAQ 150 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLT-------------------------EKW 213 VDSTKAR EQIP+ILKRFRQAVL A+ G+LT E W Sbjct: 151 VDSTKARLEQIPKILKRFRQAVLASALRGELTKKWRIDNKTGQDISSFKASVKKYRFESW 210 Query: 214 ----------RNFEPQHSVFKKLNFESILT--------------ELRNGL---------- 239 + +P++ +KK E+I++ E +GL Sbjct: 211 VKEQEQKFINKGKQPRNDNWKKKYQEAIISQDISDKDIPDGWLFEPLDGLVYISARIGWK 270 Query: 240 SSKPNESGVGHPI-LRISSVRAGHVDQNDIRFLECSESELNRH---KLQDGDLLFTRYNG 295 K +E V P+ L + S+ G + N + SE + KLQ+ D+L + Sbjct: 271 GLKASEYTVKGPLFLSVHSLNYGK-EANLEQAYHISEHRYDESPEIKLQNNDILLCKDGA 329 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK--- 352 +G ++K L + L+ R +PEY+ F S P M N VK Sbjct: 330 G---IGKLSIVKNLNEPATI-NSSLLLIRGGDFFVPEYLFYFLSGPE----MQNLVKERM 381 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 T S + +D+K V+ +PP+ EQ EIVRRVEQLFAYADTIEKQVN AL+RVNNLTQS Sbjct: 382 TGSAVPHLFQRDVKEFVLEVPPLNEQHEIVRRVEQLFAYADTIEKQVNTALSRVNNLTQS 441 Query: 413 ILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 ILAKAFRGELTAQWR ENPDLISGENSAA LLEKIKAERAAS GKKA RKK+ Sbjct: 442 ILAKAFRGELTAQWREENPDLISGENSAAVLLEKIKAERAASVGKKAPRKKA 493 >UniRef50_P06991 Type-1 restriction enzyme EcoDI specificity protein n=1 Tax=Escherichia coli RepID=T1SD_ECOLX Length = 444 Score = 252 bits (643), Expect = 2e-65, Method: Compositional matrix adjust. Identities = 169/361 (46%), Positives = 211/361 (58%), Gaps = 51/361 (14%) Query: 134 KISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQIL 193 KI+ G+ I I+ I++ +P +EQ +IAEKLDTLLAQV+STKAR EQIPQIL Sbjct: 105 KITENGRGSTIPYIRKGDITDISVALPSPSEQTLIAEKLDTLLAQVESTKARLEQIPQIL 164 Query: 194 KRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG-LSSKPNE------- 245 KRFRQAVL A+NG+LT++WR+ + F ++ L + RN L S PN Sbjct: 165 KRFRQAVLTFAMNGELTKEWRSQNNNPAFFPAE--KNSLKQFRNKELPSIPNNWSWMRFD 222 Query: 246 ------SGVGHPILRISSVRAGHVDQNDIRFLECSESELN----------RHKLQDGDLL 289 S + P+ +++ H+ N I S +H+ G ++ Sbjct: 223 QVADIASKLKSPLDYPNTI---HLAPNHIESWTGKASGYQTILEDGVTSAKHEFYTGQII 279 Query: 290 FTRYNGSL------EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 +++ L F G+C +YP I +++ L ++ + A Sbjct: 280 YSKIRPYLCKVTIATFDGMCSAD--------MYP---INSKIDTHFLFRWMLTNTFTDWA 328 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 NA V I+ KD+ V PP+ EQ EIVRRVEQLFAYADTIEKQVNNAL Sbjct: 329 SNAESRTV-----LPKINQKDLSEIPVPTPPLPEQHEIVRRVEQLFAYADTIEKQVNNAL 383 Query: 404 ARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 ARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK Sbjct: 384 ARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 443 Query: 464 S 464 S Sbjct: 444 S 444 Score = 44.7 bits (104), Expect = 0.007, Method: Compositional matrix adjust. Identities = 32/130 (24%), Positives = 52/130 (40%), Gaps = 13/130 (10%) Query: 99 PFEC-----SFGAFCGV-LRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPA-- 150 P+ C +F C + P I S HF + N + ++ A + P Sbjct: 285 PYLCKVTIATFDGMCSADMYP---INSKIDTHFLFRWMLTNTFTDWASNAESRTVLPKIN 341 Query: 151 --SFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGK 208 I +P PPL EQ I +++ L A D+ + + + Q++L A G+ Sbjct: 342 QKDLSEIPVPTPPLPEQHEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 401 Query: 209 LTEKWRNFEP 218 LT +WR P Sbjct: 402 LTAQWRAENP 411 >UniRef50_B7LQL4 Specificity determinant for hsdM and hsdR (Modular protein) n=2 Tax=Escherichia RepID=B7LQL4_ESCF3 Length = 502 Score = 250 bits (638), Expect = 1e-64, Method: Compositional matrix adjust. Identities = 199/527 (37%), Positives = 260/527 (49%), Gaps = 88/527 (16%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MSAGKLPEGWV + V + G T + Y + +P I+ ++ Sbjct: 1 MSAGKLPEGWVETNLQNVASWGSGGTPSRNHDEYY--NGNIPWIKTGDLGPKIITNASEY 58 Query: 61 FVPKNLVKESQKISPE-DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLIF 118 + S K P+ + IAM + +GK++ L + + C V P E + Sbjct: 59 ITDAGVQNSSAKFFPKGSVAIAMYGAT---IGKTSI--LGIDATTNQACAVGTPLEGITS 113 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + F+ +F + +N G NI I +PPLAEQKII EKLDTLLAQ Sbjct: 114 TLFLYYFLLNE--KNAFIKKGKGGAQPNISQTVIKEHIIYLPPLAEQKIITEKLDTLLAQ 171 Query: 179 VDSTKARFEQIPQILK-------------------------------------RFRQAVL 201 VDSTKAR EQIPQILK ++R+A L Sbjct: 172 VDSTKARLEQIPQILKRFRQAVLERAVNGKLTECWRDCVGELTSAEEIITEIKKYRKASL 231 Query: 202 GGAVNGKLTEKWRNFE---------PQHSVFKKLNFESILTELRNGLSSKPNESGV---G 249 + TE R P+ ++ K + + L + + + G Sbjct: 232 STEGSSASTESKRQIAKIEKHCFKVPKINLPKGWVWTTFLQSMEKVVDCHNKTAPYVDQG 291 Query: 250 HPILRISSVRAGHVDQNDIRFLECSESEL---NRHKLQDGDLLFTRYN--GSLEFVGVCG 304 ++R +R G + ++ ++++ +++ L R + GD++FTR G V Sbjct: 292 IHLIRTPDIRNGVISLDNTKYID-NDTYLYWSKRCPPRSGDIIFTREAPMGEAGIVPENT 350 Query: 305 LLKKLQHQNLLYPDKLIRARLTKDALPEYIE-------IFFSSPSARNAMMNCVKTTSGQ 357 ++ Q LL P +PEYI I SS R M +G Sbjct: 351 IICMGQRMMLLRP------------IPEYIHNKYVLLNILSSSFQTR---MISQAIGTGV 395 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 K + D++S LPP++EQ EIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA Sbjct: 396 KHLRVADVESLTYPLPPIEEQHEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 455 Query: 418 FRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 FRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS Sbjct: 456 FRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 502 >UniRef50_A3JH04 Specificity determinant for hsdM and hsdR n=1 Tax=Marinobacter sp. ELB17 RepID=A3JH04_9ALTE Length = 479 Score = 226 bits (576), Expect = 1e-57, Method: Compositional matrix adjust. Identities = 154/410 (37%), Positives = 219/410 (53%), Gaps = 81/410 (19%) Query: 110 VLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIA 169 +L+PE +F F A++ SL +I + + +K F+ + PLAEQK IA Sbjct: 91 ILQPEPYLFPRF-AYYQLRSL---EIPNKGYSRHFKFLKELKFE-----VAPLAEQKTIA 141 Query: 170 EKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL--- 226 KLDTLLAQV++TKAR E+IP ILKRFRQ+VL AV+G+LTE+WRN S KKL Sbjct: 142 VKLDTLLAQVENTKARLERIPTILKRFRQSVLAAAVSGRLTEEWRNNRTTKSSPKKLLNH 201 Query: 227 -------------------------------------------NFESILTELRNGLSSKP 243 E++ T++ +G+ KP Sbjct: 202 FEELRQIAVQDENLRTGKKTKYKPVTIDTYGTPGDLPNSWYWIPVEALATKVTDGVHKKP 261 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLECS-------ESELNRHKLQDGDLLFTRYNGS 296 G P + + ++ G N I F E + E R + GD+L ++ Sbjct: 262 TYISNGVPFITVKNLTKG----NGISFTETNYISTHDHEEFCKRTNPEKGDILISKD--- 314 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA---LPEYIEIFFSSPSARNAMMNCVKT 353 G G++++++ + + L K A + Y+E+ F S + M+ Sbjct: 315 ----GTLGVVRQIRTDAIF--SIFVSVALVKPADRSMSNYLELAFQSSVVQGQMIGV--- 365 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 +G + I D++ ++ +PP++EQ EIV +V+QLFAYA+ +E+QVNNALARVN LTQSI Sbjct: 366 GTGLQHIHLIDLRKDLIPVPPLEEQIEIVHQVDQLFAYAERVEQQVNNALARVNKLTQSI 425 Query: 414 LAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 LAKAFRGELT QWR +NP+LISGENSAAALLE+IK ERAA A+RK+ Sbjct: 426 LAKAFRGELTEQWRKDNPNLISGENSAAALLERIKVERAAMKPTNAARKR 475 Score = 60.5 bits (145), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 55/223 (24%), Positives = 101/223 (45%), Gaps = 18/223 (8%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK-FDTTDLV 60 + G LP W PV + T + +KK I+ + +P I N+ G T+ Sbjct: 233 TPGDLPNSWYWIPVEALATKVTDGVHKKPTYIS----NGVPFITVKNLTKGNGISFTETN 288 Query: 61 FVPKNLVKE-SQKISPE--DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGV--LRPEK 115 ++ + +E ++ +PE DI+I+ G+ VV + + + F F V ++P Sbjct: 289 YISTHDHEEFCKRTNPEKGDILIS-KDGTLGVV-----RQIRTDAIFSIFVSVALVKPAD 342 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 S ++ +SS+ + ++ + G ++ DLI P+PPL EQ I ++D L Sbjct: 343 RSMSNYLELAFQSSVVQGQMIGVGTGLQHIHLIDLRKDLI--PVPPLEEQIEIVHQVDQL 400 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP 218 A + + + + + Q++L A G+LTE+WR P Sbjct: 401 FAYAERVEQQVNNALARVNKLTQSILAKAFRGELTEQWRKDNP 443 >UniRef50_A1TWL9 Restriction modification system DNA specificity domain n=2 Tax=Gammaproteobacteria RepID=A1TWL9_MARAV Length = 435 Score = 214 bits (545), Expect = 5e-54, Method: Compositional matrix adjust. Identities = 172/477 (36%), Positives = 248/477 (51%), Gaps = 60/477 (12%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL 59 M + LP W +A + +++ I G T +K L+R +IQN +T Sbjct: 1 MQSQLLPANWQLANLGEISSDISYGYTASATSEPTGVK-----LLRITDIQN---NTVSW 52 Query: 60 VFVPKNLVKESQ----KISPEDIVIAMSSGSKSVVGKSA--HQHLPFECSFGAFCGVLRP 113 VP ++ + ++ P D+V A + + VGKS +P E + ++ +R Sbjct: 53 PNVPNCKIEPEKVGKYRLKPSDLVFARTGAT---VGKSYLLKGEIP-ESVYASYLIRVRC 108 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 + + F+A++ +S Y +I+ SAG N+ +++P+PPLAEQK+IA+KLD Sbjct: 109 LEGVSIEFLANYFQSPYYWRQITDFSAGIGQPNVNGTKLKNLSVPVPPLAEQKVIADKLD 168 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILT 233 TLLAQV++TKAR E+IPQILKRFRQ+VL AV+G+L + Q KL L Sbjct: 169 TLLAQVENTKARLERIPQILKRFRQSVLAAAVSGRL------IDAQPESIAKLEE---LV 219 Query: 234 ELRNGLSSKPNESGVGHPILRISSVRA--GHVDQ-NDI----RFLECSESELNRHKLQDG 286 ++ NG + KP + + I G VD ND R+L E N + Sbjct: 220 DIENG-ARKPVSATIRKTIQGTIPYYGATGIVDYLNDYTHEGRYLLVGEDGANLLS-KSK 277 Query: 287 DLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNA 346 DL F G + +LK+ NL ++++I +S Sbjct: 278 DLAFI-VEGKMWVNNHAHVLKERPGVNL-----------------DFVKIAINSLDLTPW 319 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + + +K + G I + + EQ EIVRRV+QLF++AD IE+Q ++ALARV Sbjct: 320 ITGSAQPKLTKKSLCGLPITNFTL-----DEQTEIVRRVDQLFSHADRIEQQASSALARV 374 Query: 407 NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 NNLTQSILAKAFRGELT QWR +NP+LI GENSA ALLE+IKAERAA K +R K Sbjct: 375 NNLTQSILAKAFRGELTEQWRRDNPELIGGENSAEALLERIKAERAAMKPVKRTRNK 431 >UniRef50_B5IN27 HsdS, type I site-specific deoxyribonuclease n=1 Tax=Cyanobium sp. PCC 7001 RepID=B5IN27_9CHRO Length = 361 Score = 213 bits (542), Expect = 1e-53, Method: Compositional matrix adjust. Identities = 114/241 (47%), Positives = 156/241 (64%), Gaps = 2/241 (0%) Query: 192 ILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHP 251 ILKRFRQAVL A +G+LT +WR S+ +K+ ++ E+RNGLS KP+ + G Sbjct: 4 ILKRFRQAVLAAATSGELTREWREARGIESLPRKIPLGEVIHEMRNGLSPKPSLNPPGVK 63 Query: 252 ILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQH 311 ILRI +VR G +D D R+LE S+ +L +L+ GDL+FTRYNG+LEFVG C + Sbjct: 64 ILRIGAVRPGTIDWTDHRYLELSDKDLAAFRLEAGDLIFTRYNGTLEFVGACANATSIPD 123 Query: 312 QNLLYPDKLIRARL-TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVV 370 +YPDKLIR R T ALP Y+EI FSS R+ + VK+++GQKGISG D+K+ Sbjct: 124 V-YVYPDKLIRVRCDTSRALPAYVEISFSSVEIRDHIEGLVKSSAGQKGISGTDLKNIFF 182 Query: 371 LLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAEN 430 LP ++EQ EIV +V+ LF AD +E +++ A V+ LT ++LAKAFRG L Q + Sbjct: 183 PLPSIEEQIEIVHQVQALFTLADQLESRLSAARKLVDRLTPALLAKAFRGALVPQDPNDE 242 Query: 431 P 431 P Sbjct: 243 P 243 >UniRef50_A4BLE7 Type I restriction enzyme StySPI specificity protein n=2 Tax=Proteobacteria RepID=A4BLE7_9GAMM Length = 496 Score = 200 bits (509), Expect = 9e-50, Method: Compositional matrix adjust. Identities = 141/476 (29%), Positives = 227/476 (47%), Gaps = 51/476 (10%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 M LPE W V+ + LIRGVTYKK +A + + PL+RANNI NG+ + DLV Sbjct: 1 MENRALPENWARCRVTELAQLIRGVTYKKSEASKESQPGFAPLLRANNI-NGRINHEDLV 59 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 +V + + Q + D++IAMSSGS +VGK+A +FG+FCG LRP I Sbjct: 60 YVREARISNEQWLKESDVLIAMSSGSIGLVGKAAQLRKVKGETFGSFCGALRPTSEIDCH 119 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F F ++ YR +S + G+NINN+K ++ P+PP EQ+ I EK++TL +++D Sbjct: 120 FFGWFFQTRTYRECVSGDAKGSNINNLKRDHILHVDFPLPPANEQRRIVEKIETLFSRLD 179 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR-----------------------NFE 217 + + ++L R+RQ+VL AV G+LT WR ++E Sbjct: 180 KGEEALRDVQKLLSRYRQSVLKAAVTGQLTADWRAENAHRLEHGRDLLARILQTRRESWE 239 Query: 218 -------------------PQHSVFKKLNFESILTELRNGLS---SKPNESGVGHPILRI 255 P V+ L + LT ++ G++ + +++ V P LR+ Sbjct: 240 GRGKYKEPIAPSTSGLPDLPDGWVWASL---AQLTHIKGGVTVDKKRESKNPVTVPYLRV 296 Query: 256 SSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLL 315 ++V+ GH+D +I+ + + + + L+ GD+L G + +G G + Q + Sbjct: 297 ANVQNGHIDLTEIKEITVNRDKAEQTLLKAGDILLNE-GGDRDKLGR-GWVWDGQIAPCI 354 Query: 316 YPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPV 375 + + + RAR + ++++ + M K + IS I + LP Sbjct: 355 HQNHVFRARPVIPEISSRFVSYYANAFGQGFFMQKGKQSVNLASISLTAISGFPIALPSA 414 Query: 376 KEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENP 431 EQ EIV R+E+ T+ + L R L QSIL AF G L Q ++ P Sbjct: 415 DEQREIVGRLEEKLIEVATVAEWCKTELTRSAALRQSILKDAFTGRLVPQNPSDEP 470 >UniRef50_D1UP80 Restriction modification system DNA specificity domain protein n=1 Tax=Burkholderia sp. CCGE1001 RepID=D1UP80_9BURK Length = 443 Score = 198 bits (504), Expect = 3e-49, Method: Compositional matrix adjust. Identities = 116/345 (33%), Positives = 195/345 (56%), Gaps = 17/345 (4%) Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ H+ + + + +S G N+ + + +PPLAEQK IA+KLD++L++V+ Sbjct: 110 YVFHWLRGPRFLSYAIGVSHGLNMPRLGTDAGRSAPFILPPLAEQKRIADKLDSVLSRVE 169 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 + AR ++P IL R R+A L + G+ + ++ F S++ +R G + Sbjct: 170 AACARMGRVPTILTRLRRAALVATLLGQ--------DGDAKPTPRIAFGSLINSIRGGTT 221 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 + P +PILR SSVR G +D D+R+L +S ++ +++ D+LFTR NG++ +V Sbjct: 222 AVPQSDKTAYPILRSSSVRQGRIDFEDVRYLTSEQSGEEKNFIRENDVLFTRLNGNVNYV 281 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G C ++ + YPD+L ARL + +P+Y F+ P R + K+++G K I Sbjct: 282 GNCAVVPSVSLNKYQYPDRLYCARLKETIVPKYCAYAFALPDIRKEIERRAKSSAGHKRI 341 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 S +DIK + LPPV EQ +V ++E++FA D +EK ++ A ++LT ++LAKAFRG Sbjct: 342 SIQDIKEMEIPLPPVAEQLRMVNQIERIFATCDRLEKTLDEAKIVADHLTPALLAKAFRG 401 Query: 421 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGK-KASRKKS 464 EL Q +P+ + SA LLE++KA + G K K SR+ + Sbjct: 402 ELVGQ----DPN----DESAEQLLERLKALTTSLGTKGKRSRQSA 438 >UniRef50_A3EKX4 Type I restriction modification DNA specificity domain protein n=1 Tax=Vibrio cholerae V51 RepID=A3EKX4_VIBCH Length = 466 Score = 189 bits (481), Expect = 1e-46, Method: Compositional matrix adjust. Identities = 157/492 (31%), Positives = 239/492 (48%), Gaps = 78/492 (15%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKD-DYLPLIRANNIQNGKFDTTDLVFVP 63 +LP+GWV +S L G +K +Y +D D++ IR N+Q+G ++ +V Sbjct: 3 QLPKGWVCTSISQCFELKNGYAFKSS---DYTEDGDFV--IRIGNVQDGHIILSNPAYVA 57 Query: 64 -KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLP--FECSFGAFCGVLRPE-KLIFS 119 + L +S K++ DI+I+++ G+ +G + +HLP C V E + +F Sbjct: 58 AEKLGADSFKLNEGDILISLT-GNVGRIGMVSKEHLPAVLNQRVAKICVVNSVEIRWLF- 115 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 + ++ L++ + SL+ GA NI + +PPLAEQ I EKLD +LAQV Sbjct: 116 ----YLLRTRLFQQHVLSLAKGAAQLNISTKDIQSFDFALPPLAEQTRIVEKLDEVLAQV 171 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF---EPQHSVFKKLNFES-ILTEL 235 D+ KAR + IP ILKRFRQ+VL AV+GKLTE+WR +P H K+ +++ + Sbjct: 172 DTIKARLDGIPAILKRFRQSVLAAAVSGKLTEEWRQLNPNQPSHPKVGKVKYKTDLFDSA 231 Query: 236 RNGLSSKPNE------------------------SGVGHPILRISSVR--AGHVDQNDIR 269 L P E + G LR+S+VR +D +D++ Sbjct: 232 SKSLPELPPEWLVIPAAHLLEYVTSGSRGWANYYASSGALFLRMSNVRYDTTKLDLSDLQ 291 Query: 270 FLECSES-ELNRHKLQDGDLLFT---------RYNGSLEFVGVCGLLKKLQHQNLLYPDK 319 ++ E+ E R +++ DL+ + R + +E V QH L P Sbjct: 292 YVNLPENVEGKRSLVKENDLVISITADVGRVARVDSEIEEAYVN------QHLALARPAS 345 Query: 320 LIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQA 379 I A E++ +S + + +K + + G+ DI+S + P + EQ Sbjct: 346 HIDA--------EFLAKCIASVNIGIKQVQALKRGATKAGLGLDDIRSMAIPFPHLAEQK 397 Query: 380 EIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENS 439 EIVR V+Q FA+ADTIE V A ARV+ LTQSILAKAFRGEL Q + P Sbjct: 398 EIVRLVDQYFAFADTIEALVKKAQARVDKLTQSILAKAFRGELVPQDPNDEP-------- 449 Query: 440 AAALLEKIKAER 451 A LLE+I R Sbjct: 450 ADKLLERIATAR 461 >UniRef50_D1C5W4 Restriction modification system DNA specificity domain protein n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1C5W4_SPHTD Length = 532 Score = 178 bits (451), Expect = 4e-43, Method: Compositional matrix adjust. Identities = 143/490 (29%), Positives = 230/490 (46%), Gaps = 93/490 (18%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK--FDTTDLVFVP 63 LP GW A + I G+ ++K D+ LP+IR N+ + F+ T P Sbjct: 9 LPPGWTWATIRDTGEYINGLAFRKSD----WGDEGLPIIRIQNLTDPSKPFNRTSRQVDP 64 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSA----HQHLPFECSFGAFCGVLRPEKLIFS 119 +V DI+++ S+ + + +QH+ V+ +L+ S Sbjct: 65 VYIVHRG------DILLSWSATLDAFTWRGETGVLNQHI---------FKVVPDNRLVHS 109 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQK-IIAE------KL 172 ++ H + ++ K SS G+ + +I F +P+ PLAEQ+ I+AE +L Sbjct: 110 PYLYHLLRHAIDLLKQSSHLHGSTMKHINRGPFLSFQVPLAPLAEQRRIVAEIEKHFTRL 169 Query: 173 DTLLAQVDSTKAR-------------------------------FEQIPQILKRF---RQ 198 D +A ++ +A +E Q+L+R R+ Sbjct: 170 DAAVAALERARANLKRYRAAVLKAACEGRLVPTEAELARAEGRDYETGEQLLQRILQERR 229 Query: 199 AVLGGAVNGKL--------TEKWRNFE--------------PQHSVFKKLNFESILTELR 236 A KL ++W+ P+ V+ +L+ +L LR Sbjct: 230 AKWEAEELAKLRAKGKEPKDDRWKARYKEPAAPDTSDLPELPEGWVWARLD--QLLGSLR 287 Query: 237 NGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 NG+S KP+ S G PILRI++VR V+ +IR+L S + + L GDLLFTRYNGS Sbjct: 288 NGISKKPD-SESGTPILRINAVRPLSVNMEEIRYLSGSVDQYADYVLCQGDLLFTRYNGS 346 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTS 355 E VGVCG ++ + + ++YPDKLIRARL L +++I + +R + ++TT+ Sbjct: 347 PELVGVCGAVRAVDRK-VVYPDKLIRARLASHLCLSSFVQIVLNVGLSREFIARRIRTTA 405 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 GQ G+SG DI+S + LPP+ EQ IV VE+ + + +E+Q+ L R L Q+IL Sbjct: 406 GQSGVSGSDIRSVPLPLPPLAEQRRIVAEVERRLSVVEELERQIEANLKRAERLRQAILK 465 Query: 416 KAFRGELTAQ 425 +AF G+L Q Sbjct: 466 RAFAGKLVPQ 475 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 55/233 (23%), Positives = 104/233 (44%), Gaps = 39/233 (16%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LPEGWV A + + +R KK + + P++R N ++ + ++ ++ Sbjct: 269 ELPEGWVWARLDQLLGSLRNGISKKPDS-----ESGTPILRINAVRPLSVNMEEIRYLSG 323 Query: 65 NLVKESQKISPE-DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR--------PEK 115 ++ + + + + D++ +GS +VG CG +R P+K Sbjct: 324 SVDQYADYVLCQGDLLFTRYNGSPELVG---------------VCGAVRAVDRKVVYPDK 368 Query: 116 LIFSGFIAHFTKSS---------LYRNKISS-LSAGANINNIKPASFDLINIPIPPLAEQ 165 LI + +H SS L R I+ + A + + + + +P+PPLAEQ Sbjct: 369 LIRARLASHLCLSSFVQIVLNVGLSREFIARRIRTTAGQSGVSGSDIRSVPLPLPPLAEQ 428 Query: 166 KIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP 218 + I +++ L+ V+ + + E + +R RQA+L A GKL + N EP Sbjct: 429 RRIVAEVERRLSVVEELERQIEANLKRAERLRQAILKRAFAGKLVPQDPNDEP 481 >UniRef50_A0KZG7 Restriction modification system DNA specificity domain n=6 Tax=Gammaproteobacteria RepID=A0KZG7_SHESA Length = 587 Score = 178 bits (451), Expect = 5e-43, Method: Compositional matrix adjust. Identities = 141/384 (36%), Positives = 198/384 (51%), Gaps = 55/384 (14%) Query: 113 PEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKL 172 P+K + F+ F ++ ++ S + +I+ + I + +PPLAEQ +IA+KL Sbjct: 115 PDKSDCNRFLYQFLRAI----DLTETSRSTTVPSIRKGDIEDIELYLPPLAEQIVIADKL 170 Query: 173 DTLLAQVDSTKARFEQIPQILKRFRQAVLGGAV-------------NGKLTEKWRNFEPQ 219 DTLLAQV++TKAR E+IP+ILK FRQ+VL AV NG E + Sbjct: 171 DTLLAQVETTKARLERIPEILKSFRQSVLSAAVSGKLTQEWRESHGNGTGEEVVKADAIN 230 Query: 220 HSVF---------KKLNFES-ILTE--------------------LRNGLSSKPNESGVG 249 SV KK ES I TE + G + +S G Sbjct: 231 KSVLLNENPALKKKKSTIESQIDTEYIFDLPESWGFTTWGKISEWITYGFTKPMPKSDSG 290 Query: 250 HPILRISSVRAGHVDQNDIRFLECS--ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 +L V+ V+ ND S +S ++ + GDLL T+ +GS+ + + Sbjct: 291 VKLLTAKDVQYFDVNINDAGLTTSSAFQSLSDKDRPIKGDLLITK-DGSIGRAALVRTDE 349 Query: 308 KLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 + L + KD Y+E +S + + + + + Q +S D Sbjct: 350 PFCINQSVAVCWLRSTSMNKD----YLEFLANSEFTQRFVKDKAQGMAIQH-LSIIDYAK 404 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWR 427 + +P ++EQ EIVRRVE+LFA+AD+IE++ ALARVNNLTQSILAKAFRGELTA WR Sbjct: 405 CPLPVPSLEEQTEIVRRVEELFAFADSIEQKATAALARVNNLTQSILAKAFRGELTADWR 464 Query: 428 AENPDLISGENSAAALLEKIKAER 451 A NP+LISG+NSAAALLEKIK ER Sbjct: 465 AANPELISGDNSAAALLEKIKVER 488 >UniRef50_C9P2L6 HsdS type I site-specific deoxyribonuclease n=1 Tax=Vibrio metschnikovii CIP 69.14 RepID=C9P2L6_VIBME Length = 585 Score = 177 bits (450), Expect = 5e-43, Method: Compositional matrix adjust. Identities = 133/390 (34%), Positives = 188/390 (48%), Gaps = 60/390 (15%) Query: 108 CGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKI 167 CGV S + ++ +S R+ SL G I A + +PPLAEQK Sbjct: 113 CGV-------DSDYAYYYLRS--IRDLAESLGTGTTFKEISGAVAKTLPFLLPPLAEQKA 163 Query: 168 IAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE---------- 217 IA+KLD +LAQV +TK R E+IP ILK FRQ++L AV+GKLT WR Sbjct: 164 IADKLDLMLAQVATTKVRLERIPNILKTFRQSILTAAVSGKLTGNWRASSLKSAWTVREL 223 Query: 218 PQHS--------------VFKKLNFE------SILTELRNG--LSSKPNESGVGHP---- 251 P+++ K+ F S+ + LR G + K G HP Sbjct: 224 PENNKTRRGLPDSVALPDALKESRFPESWSILSVASLLRKGVIIDLKDGNHGSNHPKSLE 283 Query: 252 --------ILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVC 303 I G +D + + E + + + + + G++ VG+ Sbjct: 284 FTEKGLPFITAAQMSDNGKIDYDGAPKVSGKPLEKLKVGFSEAEDVIYSHKGTIGKVGIA 343 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 ++L P K L +Y + S +A + ++ +K+ + + + Sbjct: 344 ------DRASVLNPQTTYIRLNQKYVLNQYYALMLKS-NAFTSQVDAIKSQTTRDFVPIT 396 Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 S ++PP+ EQ EIVRRVE+LFA AD IE++VN A VNNL QSI KAFRG+LT Sbjct: 397 AHYSLFAIIPPIDEQVEIVRRVEELFACADNIEQKVNMATELVNNLPQSIFTKAFRGDLT 456 Query: 424 AQWRAENPDLISGENSAAALLEKIKAERAA 453 A WR+ NP+LISG+NSA ALLEKIKAER A Sbjct: 457 ADWRSANPELISGKNSAKALLEKIKAERGA 486 >UniRef50_C9NQJ7 HsdS type I site-specific deoxyribonuclease n=1 Tax=Vibrio coralliilyticus ATCC BAA-450 RepID=C9NQJ7_9VIBR Length = 563 Score = 169 bits (428), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 137/459 (29%), Positives = 222/459 (48%), Gaps = 36/459 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDY---------LPLIRANNIQNGKFD 55 KLP WV + + ++ G T K +N+ + L + I NG+ D Sbjct: 3 KLPFNWVETEIGNLALVVSGGTPKAGDELNFAEPGAGIAWVTPADLSGYKQKEIANGRRD 62 Query: 56 TTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK 115 + PK L S K+ P+ ++ S V A + F +F Sbjct: 63 LS-----PKGLDSSSAKLMPKGTLLFSSRAPIGYVA-IAENEISTNQGFKSFIFT----D 112 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + S + ++ KS ++ S +G + A + + PL EQ IA+KLD++ Sbjct: 113 HVNSTYAYYYLKS--IKDLAESWGSGTTFKELSGAVAKKLPFRLAPLNEQIRIADKLDSI 170 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTEL 235 LA+VD + R ++IP ILKRFRQ+VL A +G+LT +WR E + + ++ +S+ Sbjct: 171 LAKVDHAQERLDKIPDILKRFRQSVLAAATSGELTREWR--EGKEHQWPRVQLKSVGRGF 228 Query: 236 RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 G S+K G P+LR+ +++ G + +++ + E E++++ L+ GD+LF R N Sbjct: 229 NYGSSAKSKPEG-EVPVLRMGNLQGGQLHWDNLVYTSDKE-EIDKYLLEKGDVLFNRTN- 285 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 S E VG + + Q +Y LIR + ++ E++ I +SP AR+ Sbjct: 286 SPELVGKTSIYRG--EQKAIYAGYLIRIKGSEHLDTEFLNIQLNSPHARDYCWQVKTDGV 343 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 Q I+ K +++ LP + EQ EIVRRV +LF+ AD E Q + +N LTQSIL Sbjct: 344 SQSNINAKKLQAYEFDLPEIDEQLEIVRRVSELFSRADLFEYQYLASKKYLNRLTQSILV 403 Query: 416 KAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAAS 454 KAF G+L Q E D SA+ LL+ I++E A+ Sbjct: 404 KAFNGQLVPQ---EPTD-----ESASELLKLIESEMVAN 434 >UniRef50_B8GGK0 Restriction modification system DNA specificity domain protein n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GGK0_METPE Length = 471 Score = 169 bits (428), Expect = 2e-40, Method: Compositional matrix adjust. Identities = 113/353 (32%), Positives = 178/353 (50%), Gaps = 42/353 (11%) Query: 110 VLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLIN-IPIPPLAEQKII 168 V+R I ++ ++ + +RN+ S G+ P F + IP+PPLAEQ+ I Sbjct: 121 VMRSRGEILPEYLFYYIRQKSFRNEAESHFTGSVGQKRVPTDFIKQSVIPLPPLAEQRRI 180 Query: 169 AEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR-------------- 214 +++ LL+ VD+ R ++P I+KRFRQAVL A +G+LTE+WR Sbjct: 181 VARIEALLSHVDAAGDRLSRVPLIMKRFRQAVLAAACSGRLTEEWREDKDNFEDPKLLLQ 240 Query: 215 ---NFEPQHSVFK-----KLNFESILTELRN---------------GLSSKPNESGVG-- 249 N+ QH + K K+N E+ N G+ +P + Sbjct: 241 DIQNYRLQHGINKIKIDSKVNITENPIEIPNTWIWSTIEKIADISGGIQKQPMRAPQRNF 300 Query: 250 HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKL 309 +P LR+++V G +D ++I+ +E EL R+ L+ D+L NGS +G + Sbjct: 301 YPYLRVANVLRGSLDLHEIKNMELFAGELERYHLELNDILIVEGNGSFSEIGRSAIWNG- 359 Query: 310 QHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQV 369 + +N ++ + +IR R+ K LP+Y+ ++++SP TTSG +S K I Sbjct: 360 EIENCVHQNHIIRVRVRK-FLPQYVNLYWNSPLGSELSSGAAVTTSGLYTLSTKKIAQLP 418 Query: 370 VLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + LPP+ EQ EIVRRV LF AD IE++V A R LTQ+++ KAF G L Sbjct: 419 IPLPPISEQHEIVRRVGLLFERADAIEREVVAAGRRCERLTQAVMIKAFSGRL 471 Score = 65.1 bits (157), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 42/157 (26%), Positives = 74/157 (47%), Gaps = 6/157 (3%) Query: 283 LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPS 342 +DGD++ + +E G +++ +++ + R + LPEY+ + S Sbjct: 83 FRDGDVIMAKITPCMEN-GKAAIVRGMKNGIGFGSTEFHVMRSRGEILPEYLFYYIRQKS 141 Query: 343 ARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNA 402 RN + + GQK + IK V+ LPP+ EQ IV R+E L ++ D +++ Sbjct: 142 FRNEAESHFTGSVGQKRVPTDFIKQSVIPLPPLAEQRRIVARIEALLSHVDAAGDRLSRV 201 Query: 403 LARVNNLTQSILAKAFRGELTAQWRA-----ENPDLI 434 + Q++LA A G LT +WR E+P L+ Sbjct: 202 PLIMKRFRQAVLAAACSGRLTEEWREDKDNFEDPKLL 238 Score = 48.1 bits (113), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 50/218 (22%), Positives = 96/218 (44%), Gaps = 27/218 (12%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P W+ + + + + G+ ++Q + + ++ P +R N+ G D ++ K Sbjct: 268 EIPNTWIWSTIEKIADISGGI---QKQPMRAPQRNFYPYLRVANVLRGSLDLHEI----K 320 Query: 65 NLVK-----ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIF 118 N+ E + DI+I +GS S +G+SA + E C +R K Sbjct: 321 NMELFAGELERYHLELNDILIVEGNGSFSEIGRSAIWNGEIENCVHQNHIIRVRVRK--- 377 Query: 119 SGFIAHFTKSSLYRNKI--SSLSAGANIN-----NIKPASFDLINIPIPPLAEQKIIAEK 171 F+ + +LY N S LS+GA + + + IP+PP++EQ I + Sbjct: 378 --FLPQYV--NLYWNSPLGSELSSGAAVTTSGLYTLSTKKIAQLPIPLPPISEQHEIVRR 433 Query: 172 LDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 + L + D+ + + +R QAV+ A +G+L Sbjct: 434 VGLLFERADAIEREVVAAGRRCERLTQAVMIKAFSGRL 471 >UniRef50_A8ZTW4 Restriction modification system DNA specificity domain n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZTW4_DESOH Length = 477 Score = 163 bits (413), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 151/511 (29%), Positives = 244/511 (47%), Gaps = 94/511 (18%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LPEGWV AP+ ++ ++ G K + + K P+ AN+I G +D+ Sbjct: 5 LPEGWVAAPLQKISQIVYGKGLPKNK---FNKQGLYPVFGANSII-GYYDSF-------- 52 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGFIAH 124 L ++ Q ++I+ + + S P +C + V++ P L S + Sbjct: 53 LYEDPQ------VLISCRGANSGTINIS-----PPKCFVTSNSLVVQLPNTLHQSFKYLY 101 Query: 125 FTKSSLYRNKISSLSAG--ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + S + KI + +A I+N+K SF +P+PP EQK I +LD ++ ++D Sbjct: 102 YALESSDKEKIVTGTAQPQVTIDNLK--SF---CVPLPPFNEQKRIVARLDQIIPRIDKL 156 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP---------QHSVFKKLN------ 227 K R ++IP I+KRFRQ+VL AV G+LTEKWR P Q +++L+ Sbjct: 157 KTRLDKIPTIIKRFRQSVLTAAVTGRLTEKWREDHPDVEGAEATVQSIYYRRLDESQTNQ 216 Query: 228 --------FESILTE--------------------LRNGLSSKPNESGVGHPILRISSVR 259 F + TE + G SSK ++ G P+LR+ +++ Sbjct: 217 QKNKIEKLFAEVETEDNGLLPETWKYTFLNKICESFQYGTSSKSSKKG-DIPVLRMGNLQ 275 Query: 260 AGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDK 319 G +D +++ + ++ E+ ++KL+ +LF R N S E VG + L + ++ Sbjct: 276 NGAIDWSNLVY-SSNKKEIEKYKLEKNTVLFNRTN-SPELVGKTAIY--LGERAAIFAGY 331 Query: 320 LIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS-GQKGISGKDIKSQVVLLPPVKEQ 378 LIR Y+ ++ A+ A N KT Q I+ + + + PP++EQ Sbjct: 332 LIRINNMDILDSHYLNYSLNTDYAK-AFCNREKTDGVNQSNINAQKLGRFEIPFPPLEEQ 390 Query: 379 AEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGEN 438 EIVR+VE+ FA AD +E NA ARV+ L +S+LAKAFRGELT Q + P Sbjct: 391 KEIVRQVERSFALADKLEAHYQNARARVDKLARSVLAKAFRGELTPQDPNDEP------- 443 Query: 439 SAAALLEKIKAER-----AASGGKKASRKKS 464 A LLE+I AE+ A +K +++KS Sbjct: 444 -AEKLLERILAEKEKMAAAVKKTRKQAKRKS 473 Score = 63.5 bits (153), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 55/228 (24%), Positives = 100/228 (43%), Gaps = 15/228 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G LPE W ++ + + T K K +P++R N+QNG D ++LV+ Sbjct: 234 GLLPETWKYTFLNKICESFQYGTSSKSS-----KKGDIPVLRMGNLQNGAIDWSNLVYSS 288 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 E K+ ++ V+ + S +VGK+A F + + ++ S ++ Sbjct: 289 NKKEIEKYKLE-KNTVLFNRTNSPELVGKTAIYLGERAAIFAGYLIRINNMDILDSHYLN 347 Query: 124 H-----FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + + K+ R K + G N +NI IP PPL EQK I +++ A Sbjct: 348 YSLNTDYAKAFCNREK----TDGVNQSNINAQKLGRFEIPFPPLEEQKEIVRQVERSFAL 403 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL 226 D +A ++ + + ++VL A G+LT + N EP + +++ Sbjct: 404 ADKLEAHYQNARARVDKLARSVLAKAFRGELTPQDPNDEPAEKLLERI 451 >UniRef50_Q1MKB2 Putative type I restriction enzyme specificity subunit n=2 Tax=Alphaproteobacteria RepID=Q1MKB2_RHIL3 Length = 456 Score = 161 bits (408), Expect = 4e-38, Method: Compositional matrix adjust. Identities = 137/475 (28%), Positives = 224/475 (47%), Gaps = 39/475 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GWV A T+ L + L +++P+ ++ + G +V Sbjct: 4 LPKGWVEA---TLEELCQFNPKHDPDVDQSLGVNFVPMPAVDD-ETGAIIDKSVVRPLSE 59 Query: 66 LVKESQKISPEDIVIA-----MSSGSKSVVGKSAHQHLPFECSFGAFCG-----VLRPEK 115 + K + D++ A M +G +V A+ G CG VLR + Sbjct: 60 IWKGYTHFADRDVIFAKITPCMENGKIAVARDLAN---------GMACGSTEFHVLRSKG 110 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASF-DLINIPIPPLAEQKIIAEKLDT 174 + F+ F + YR GA P F + ++P+PPL EQK I KLDT Sbjct: 111 AVEPDFLWRFLRRKNYRQVAEHSMTGAVGQRRVPRQFLETTSLPLPPLNEQKRIVAKLDT 170 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTE 234 L A+ + +I ++ RF+QAVL A +G+LT+ WR+ + + ++ L +++ Sbjct: 171 LNAKSARARTELARIEILVSRFKQAVLSKAFSGELTKDWRSGQTTLAPWENLPLSQLVSH 230 Query: 235 -LRNGLSSKPNESGVGHPILRISSVRAGHV--DQNDIRFLECSESELNRHKLQDGDLLFT 291 NG S K + G L++S+ +G + D++ I++L+ + E ++ L D++ Sbjct: 231 GPSNGWSPKADGKVSGLKSLKLSATSSGRLRLDESTIKYLDQTLPEDSKFWLLSDDIVIQ 290 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAM-MN 349 R N SLE +G L + ++PD ++R R+ K P Y+ + +S SAR+ N Sbjct: 291 RAN-SLELLGTTVLFDGPPGE-FIFPDLMMRIRVNDKKTNPRYLATYLNSDSARSYFRAN 348 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + I+G ++ V PP++EQ EIV R+E FA D + + AL V L Sbjct: 349 ATGSAGNMPKINGSTVRETRVPTPPLEEQQEIVHRIESAFAMTDRLAAEAMRALDLVGKL 408 Query: 410 TQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 ++ILAKAFRGEL Q + P A LLE+I+AER A+ K R+K+ Sbjct: 409 GEAILAKAFRGELVPQDENDEP--------AEKLLERIRAEREAAPEAKRGRRKT 455 >UniRef50_B2SI71 Type I restriction-modification system, S subunit n=1 Tax=Xanthomonas oryzae pv. oryzae PXO99A RepID=B2SI71_XANOP Length = 501 Score = 161 bits (408), Expect = 4e-38, Method: Compositional matrix adjust. Identities = 122/395 (30%), Positives = 188/395 (47%), Gaps = 64/395 (16%) Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 I GF+ HF ++ ++SLS I +I+ + + I I +PPLAEQK I +KLD LL Sbjct: 122 IDDGFLYHFLRT----QDLASLSRSTTIPSIRKSDVEDITISLPPLAEQKRIVQKLDALL 177 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR---------------------- 214 AQVD+ KAR + +P +LKRFR+A L A++G LT+ WR Sbjct: 178 AQVDTLKARIDAMPALLKRFREATLTSAMSGTLTKDWRIESSQSTAPEAPRMCRQLLANE 237 Query: 215 -----------------------NFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHP 251 F V+ + + I +++G P + G Sbjct: 238 RERIWRGRGKYKPAVRSGEVDASEFSNLPEVWHRGTLDEITWSVKDGPHFSPKYATDGVR 297 Query: 252 ILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKL 309 + ++R G +D + +++ E E R K + D+L+T+ G+ F V + Sbjct: 298 FISGGNIRPGRIDLSTGKYISQELHEELSARCKPEYLDVLYTK-GGTTGFAAVN---RTE 353 Query: 310 QHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQV 369 N+ +++ P ++E +SP A G + + + + V Sbjct: 354 SEFNVWVHVAVLKMLPPSVVDPFFVEFALNSPEC-YAQSQRYTHGVGNQDLGLRRMIKIV 412 Query: 370 VLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAE 429 + +PP+ EQ EIVRRVEQLFAYAD +E +V A R++ LTQS+LAKAFRGEL Q A Sbjct: 413 LPVPPIGEQREIVRRVEQLFAYADQLEAKVATAKQRIDALTQSLLAKAFRGELVPQDPAA 472 Query: 430 NPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 P A+ LL++I+A+RAA+ K RK + Sbjct: 473 EP--------ASVLLDRIRAQRAATPKPKRGRKAA 499 Score = 46.2 bits (108), Expect = 0.003, Method: Compositional matrix adjust. Identities = 47/232 (20%), Positives = 96/232 (41%), Gaps = 11/232 (4%) Query: 6 LPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 LPE W + +T ++ G + + A + ++ I NI+ G+ D + ++ + Sbjct: 265 LPEVWHRGTLDEITWSVKDGPHFSPKYATDGVR-----FISGGNIRPGRIDLSTGKYISQ 319 Query: 65 NLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR--PEKLIFSGF 121 L +E S + PE + + + G + G +A E + VL+ P ++ F Sbjct: 320 ELHEELSARCKPEYLDVLYTKGGTT--GFAAVNRTESEFNVWVHVAVLKMLPPSVVDPFF 377 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + S + + G ++ I +P+PP+ EQ+ I +++ L A D Sbjct: 378 VEFALNSPECYAQSQRYTHGVGNQDLGLRRMIKIVLPVPPIGEQREIVRRVEQLFAYADQ 437 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILT 233 +A+ Q + Q++L A G+L + EP + ++ + T Sbjct: 438 LEAKVATAKQRIDALTQSLLAKAFRGELVPQDPAAEPASVLLDRIRAQRAAT 489 >UniRef50_A3UV36 Type I restriction enzyme specificity protein n=1 Tax=Vibrio splendidus 12B01 RepID=A3UV36_VIBSP Length = 496 Score = 158 bits (400), Expect = 3e-37, Method: Compositional matrix adjust. Identities = 130/417 (31%), Positives = 205/417 (49%), Gaps = 62/417 (14%) Query: 77 DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKIS 136 D++++M+ + + V K ++ S G VL+P LI S ++ +S + + IS Sbjct: 69 DVLVSMTRPNLNAVAKVPEKYNGQVASTG--FDVLKP-FLIESDWLFSVVRSQPFIDSIS 125 Query: 137 SLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRF 196 + GA K + +P+PPLAEQK I EKLD +LAQVD+ KAR + IP +LKRF Sbjct: 126 GTTIGALYPACKTSDIRDYEMPLPPLAEQKRIVEKLDEVLAQVDTIKARLDGIPDLLKRF 185 Query: 197 RQAVLGGAVNGKLTEKWR--------------NFEPQHSVFKKLNFESILTEL------- 235 RQ+VL AV+G LT++WR NF + K ++ +EL Sbjct: 186 RQSVLASAVSGTLTKEWRLTNELTKAEEELKSNFLAKSGKLKLRGKQTNFSELSLITLPD 245 Query: 236 --------------RNGLSSKP--------NESGVGHPILRISSVRAGHVDQNDIRFL-- 271 N + + P + G PI+ + V+ +QN ++ Sbjct: 246 SWTWAQNYKLAKDESNAICAGPFGTIFKAKDFRDEGVPIIFLRHVKEIGFNQNKPNYMDG 305 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD-AL 330 + E + + G+LL T+ G C + + ++ PD +++ + +D L Sbjct: 306 DVWEELHQEYSVHGGELLVTKLGDP---PGECCIYPENMGTAMVTPD-VLKMNVDEDIVL 361 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 +Y+ +F+SP + ++ + + + I K + LP ++EQ EIVR V+Q FA Sbjct: 362 RKYLRSYFNSPIS-TEIIEALAFGATRLRIDIAMFKGFPIPLPSMEEQKEIVRLVDQYFA 420 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKI 447 +ADTIE QV A A+V+NLTQSILAKAFRGEL +Q ++ P A LLE+I Sbjct: 421 FADTIEAQVKKAQAKVDNLTQSILAKAFRGELVSQDPSDEP--------ADKLLERI 469 Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 33/100 (33%), Positives = 52/100 (52%), Gaps = 8/100 (8%) Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 DI+ + LPP+ EQ IV +++++ A DTI+ +++ + QS+LA A G LT Sbjct: 140 DIRDYEMPLPPLAEQKRIVEKLDEVLAQVDTIKARLDGIPDLLKRFRQSVLASAVSGTLT 199 Query: 424 AQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 +WR N +L E E++K+ A GK R K Sbjct: 200 KEWRLTN-ELTKAE-------EELKSNFLAKSGKLKLRGK 231 >UniRef50_C0VG50 Type I restriction modification enzyme protein S n=1 Tax=Acinetobacter sp. ATCC 27244 RepID=C0VG50_9GAMM Length = 399 Score = 157 bits (397), Expect = 7e-37, Method: Compositional matrix adjust. Identities = 126/416 (30%), Positives = 211/416 (50%), Gaps = 25/416 (6%) Query: 12 IAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQ 71 I + ++T IRGV+Y K A++ +++ YLP++RANNIQ D V+VP++ + + Q Sbjct: 4 IVKIGNISTQIRGVSYSKSDAVSNMQEGYLPVLRANNIQEQGLILEDFVYVPESKISKKQ 63 Query: 72 KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHFTKSSL 130 +I D++IA SSGS S+VGK+A FGAFC +LRP +L+ + A++ ++ Sbjct: 64 RILAGDVIIAASSGSISLVGKAASAKEDINAGFGAFCKILRPNTELVDPRYFANYFQTQQ 123 Query: 131 YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 YR IS+L+AGANINN+K D + IP+PPL+EQ+ IA LD Q D + + +Q Sbjct: 124 YRQIISNLAAGANINNLKNEHLDDLEIPLPPLSEQRRIASILD----QADVLRQKRQQAI 179 Query: 191 QILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV-- 248 + L + QA G + FE KKL+ + L ++ + E + Sbjct: 180 EKLDQLLQATFIDMF-GDPVSNPKGFE-----VKKLSEQVDLIQIGPFGTQLHQEDYIEN 233 Query: 249 GHPILRISSVRAGHVDQN-DIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 G P++ S ++ G + N + + EL+++ L+ D+L R G + G C ++ Sbjct: 234 GIPLINPSHIKNGKIVPNLKLSVSQLKYGELSQYHLKLHDVLLGR-RGEM---GRCAVVT 289 Query: 308 KLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS-GKDIK 366 + + L L + P ++E+ SS S + + N + GQ + K I Sbjct: 290 QNEVGWLCGTGSLFLRPNVEKINPFFLEMLLSSDSIKRYLENV---SQGQTMANLNKTIV 346 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + L+ P EI + + + ++ ++ N+ +VNNL QS+ AF G L Sbjct: 347 GSIPLIAP---SIEIQNKFFLISEEINKMKTELENSKNQVNNLFQSLQNHAFNGTL 399 >UniRef50_B0VPS8 Specificity determinant for hsdM and hsdR n=1 Tax=Acinetobacter baumannii SDF RepID=B0VPS8_ACIBS Length = 386 Score = 152 bits (385), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 127/412 (30%), Positives = 200/412 (48%), Gaps = 41/412 (9%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 M K P W IA + V LI G +K + D LP+IR N+ N + Sbjct: 2 MQVSKSPPSWCIASIGEVCNLINGRAFKSTE----WTDRGLPIIRIQNLNNPD---ANFN 54 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAF----CGVLRPEKL 116 F +L ++ D++ A S G S H+ ++ GA ++ + L Sbjct: 55 FFNGDL-DNKHRVEKGDLLFAWSGTP----GTSFGAHI-WDGDIGALNQHIFKIVFNDSL 108 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 I FI + +L +S G + ++ F+ I PPL EQKIIA+KLDTLL Sbjct: 109 IDKRFIRYAINQTL-DELVSGARGGVGLKHVTKGMFETTKIIFPPLYEQKIIADKLDTLL 167 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 AQV +TK R E+I ILK FRQ++L AV+GKLTE+WR + + + K +I + Sbjct: 168 AQVATTKVRLERILNILKTFRQSILSSAVSGKLTEEWRKNKKLNWI--KSTLANICRSVS 225 Query: 237 NGLSSKPNESGVGHPILRISSVRAGHVDQNDI-RFLECS--ESELNRHKLQDGDLLFTRY 293 +G P + G P L IS++ G +D + + R++ S ES + K + D+L+T Sbjct: 226 DGDHQAPPRADFGIPFLVISNISKGEIDFSSVNRWVPESYYESLKDIRKPEINDILYT-V 284 Query: 294 NGSLEFVGVCGLLKKL------QHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 GS G+ +K +H ++ P+ +Y+ + +SP Sbjct: 285 TGSF---GIPVTVKSTTPFCFQRHIAIIKPNH-------SSVDYKYLFYYLASPEVFKHA 334 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 + + T + QK +S +++ +LLPP++EQ EIV RVE+L A+AD IEK++ Sbjct: 335 TS-IATGTAQKTVSLSHLRNFNILLPPIEEQTEIVHRVEELLAFADGIEKKL 385 Score = 56.2 bits (134), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 54/204 (26%), Positives = 94/204 (46%), Gaps = 25/204 (12%) Query: 232 LTELRNGLSSKPNE-SGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 + L NG + K E + G PI+RI ++ + D F + N+H+++ GDLLF Sbjct: 19 VCNLINGRAFKSTEWTDRGLPIIRIQNL-----NNPDANFNFFNGDLDNKHRVEKGDLLF 73 Query: 291 TRYN------GSLEFVGVCGLLKKLQHQ-NLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 G+ + G G L QH +++ D LI R + A+ + ++ S A Sbjct: 74 AWSGTPGTSFGAHIWDGDIGALN--QHIFKIVFNDSLIDKRFIRYAINQTLDELVSG--A 129 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 R + G K ++ ++ ++ PP+ EQ I +++ L A T + ++ L Sbjct: 130 RGGV--------GLKHVTKGMFETTKIIFPPLYEQKIIADKLDTLLAQVATTKVRLERIL 181 Query: 404 ARVNNLTQSILAKAFRGELTAQWR 427 + QSIL+ A G+LT +WR Sbjct: 182 NILKTFRQSILSSAVSGKLTEEWR 205 >UniRef50_Q8PTL2 Type I restriction-modification system specificity subunit n=2 Tax=Methanosarcina RepID=Q8PTL2_METMA Length = 440 Score = 151 bits (382), Expect = 4e-35, Method: Compositional matrix adjust. Identities = 136/479 (28%), Positives = 227/479 (47%), Gaps = 78/479 (16%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LPEGW + + + G KK + G+FD VF Sbjct: 6 ELPEGWAECQIKDIVVINYGKGLKKSDRVE-----------------GQFD----VFGSN 44 Query: 65 NLV-KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF--GAFCGVLRPEKLIFSGF 121 +V K +Q ++ VI GS + S+ P + ++ F G+ R F Sbjct: 45 GIVGKHNQSLTNGPTVIIGRKGSVGEINLSSEPCWPIDTTYYIDNFYGINRI-------F 97 Query: 122 IAHFTKS-SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + + K+ +L S+ G N N+I +P+PPL+EQ I ++ L A++D Sbjct: 98 LYYLLKTLNLANYDTSTAIPGINRNDIYSQL-----VPLPPLSEQHRIVSAIEALFARLD 152 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRN-----------FEPQHSVFKKLNFE 229 +T + +++ +ILK+FR++VL A +G+LTE+WR E Q ++ K+ Sbjct: 153 ATNEKLDRVQEILKKFRESVLAAACDGRLTEEWRKENLHCNEYFAIDEDQFNLVKQWRIP 212 Query: 230 SI--LTELRNGLS-------SKPNESGVGHPILRISSVRAGHVDQNDIRFL-ECSESE-L 278 ++ + L + S S P + +G +R S ++ GH+D ++ +++ E + E + Sbjct: 213 TVWSWSTLEDSCSHVVDCPHSTPKWTDIGVYCVRTSELKCGHIDFSNAKYVSEATYLERI 272 Query: 279 NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFF 338 R K Q+GD+L++R G+ VG+ L+ + + +L+ R + +P + Sbjct: 273 KRLKPQEGDILYSR-EGT---VGIASLVP--SNVKICLGQRLMLFRTKNNLIPSFFVKVL 326 Query: 339 SSPSARNAMMNCVKTTSGQKG--ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 +SP +++ K+T G + DIK LPP+ EQ EIVRRV+ LFA+AD+IE Sbjct: 327 NSPYIYDSVK---KSTMGSTAPRFNVADIKKFPTPLPPLPEQQEIVRRVDALFAFADSIE 383 Query: 397 KQVNNALARVNNLTQSILAKAFRGELTAQW----RAENPDLISGENSAAALLEKIKAER 451 +V A + L QSILAKAF G+L R E D +A L+E+IK ER Sbjct: 384 TKVAAAREKTEKLRQSILAKAFSGQLVETQAEIVRREGRDY----ETAEVLIERIKEER 438 >UniRef50_A6DQ81 Putative restriction-modification system specificity determinant n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DQ81_9BACT Length = 402 Score = 147 bits (371), Expect = 7e-34, Method: Compositional matrix adjust. Identities = 119/422 (28%), Positives = 202/422 (47%), Gaps = 39/422 (9%) Query: 15 VSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQKIS 74 + +++ IRGV+YKK ++ + Y P++RANNI G + LV+V ++KE Q + Sbjct: 6 IGDISSQIRGVSYKKNDVVDEPTERYTPVMRANNINEGFLNYDKLVYVKSEVIKEHQLLQ 65 Query: 75 PEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHFTKSSLYRN 133 D++I SSGS ++VGK+ SFGAFC VLRP+ K +F F + +S Y+ Sbjct: 66 KGDVLICASSGSLNLVGKAGSFLDSTSSSFGAFCKVLRPDTKKVFPRFFHFYFQSQGYKR 125 Query: 134 KISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQIL 193 I +L+ GANINNIK D + IP+P L EQK IA LD + Q + L Sbjct: 126 SIKALAEGANINNIKNEHLDDLKIPLPSLEEQKRIAAILDKADELRQKRREAISQCNEFL 185 Query: 194 KRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP-------NES 246 K ++ G V P+ + K+ F+ +L + G S K +E Sbjct: 186 KSTFLSMFGDPVTN----------PKG--WDKIIFDELLDNIDGGWSPKCETWPATLDEW 233 Query: 247 GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 GV +++ ++ + + + + + ++Q DLLF+R N + E V C + Sbjct: 234 GV----MKLGALTTCEYKEEENKAMLPGLETKSNIEIQPRDLLFSRKN-THELVAACAYV 288 Query: 307 KKLQHQNLLYPDKLIRARLTKDALPE--YIEIFFSSPSARNAMMNCVKTTSG-QKGISGK 363 + Q L+ D + R + A Y+ + R + +G IS K Sbjct: 289 WDTRPQ-LMMSDLMFRFKFKASAEVNSIYMWKLLVNERQRKEVQALASGAAGSMPNISKK 347 Query: 364 DIKSQVVLLPPVKEQ---AEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 ++K+ + +PP++ Q AEI ++ E + + Q+ +L +++ +++ KAF+G Sbjct: 348 NLKTIKLPIPPIELQNQFAEIAKKTE-------SSKSQMQQSLKELDDNFDALMQKAFKG 400 Query: 421 EL 422 EL Sbjct: 401 EL 402 >UniRef50_C9YAL6 Putative uncharacterized protein n=1 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YAL6_9BURK Length = 449 Score = 147 bits (370), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 134/470 (28%), Positives = 229/470 (48%), Gaps = 38/470 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK--FDTTDLVFVP 63 LP+ W AP+ + + ++ +A ++ +P++ A NI + K FD L+ P Sbjct: 3 LPQSWTTAPLGKLCEKLSDGSHNPPKA----QETGMPMLSARNINDRKITFDEFRLI-SP 57 Query: 64 KNLVKESQK--ISPEDIVIAM--SSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + +E ++ +S D+++ + + G +VV + A Q + VL+P K S Sbjct: 58 EEFAEEDRRTRVSSGDVLLTIVGAIGRTAVVPQGAPQF-----TLQRSVAVLKPIK-SDS 111 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 +I++ ++ + + + G I + + IP+ P EQK IA+KLDT+L +V Sbjct: 112 RYISYALEAPALQKYLQDNAKGTAQKGIYLKALAGVEIPVAPEPEQKRIADKLDTVLTRV 171 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-PQHSVFKKLNFESILTELRNG 238 D+ R ++ +LKRFRQ+VL A +G+LTE WRN P+ + + + + +G Sbjct: 172 DAVNTRLARVAPLLKRFRQSVLAAATSGRLTEDWRNGSIPEVKEWSEKALSEVCRTITDG 231 Query: 239 LSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQD--GDLLFTRYNGS 296 P + G P++ VR VD +D +F+ ++ +R + GD+L + Sbjct: 232 EHISPPLAPHGVPLVSAKDVREWGVDFSDTKFVSEEFADASRKRCGPICGDVLVVSRGAT 291 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIR--ARLTKDALPEYIEIFFSSPSARNAMMNCVKTT 354 VG L+K + L+ L + A L K E++ +SP + T Sbjct: 292 ---VGRTCLVKSKEKFCLMGSVLLFQPTATLIKS---EFLAHVLASPLGLEQLTKASGAT 345 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 + Q I +D K + LP ++EQ EIVRRVE LFA+AD +E ++ A A LT ++L Sbjct: 346 A-QAAIYIRDAKGLKIRLPSIEEQTEIVRRVETLFAFADRLEARLAQAQAAATRLTPALL 404 Query: 415 AKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 AKAF GEL Q +P+ + AA LL ++ A+ A+ + RK + Sbjct: 405 AKAFSGELVPQ----DPN----DEPAAELLRRL-AQAPATASPRKGRKAA 445 >UniRef50_UPI0001695152 type I restriction enzyme specificity protein n=1 Tax=Xanthomonas oryzae pv. oryzicola BLS256 RepID=UPI0001695152 Length = 451 Score = 146 bits (369), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 131/484 (27%), Positives = 228/484 (47%), Gaps = 62/484 (12%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPL----------IRANNIQNG-- 52 +LP GWV + + + + + I ++ + P R + ++ Sbjct: 4 ELPGGWVETTIGEICAMGPKSAWDDDMEIGFVPMSHAPTNFRGPLNYEARRWHEVKKAYT 63 Query: 53 KFDTTDLVFVPKNLVKESQKISP--EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGV 110 F+ D++F K++P E+ A+ +G + G + + F + Sbjct: 64 HFENDDVIFA---------KVTPCFENGKAALVAGLPNGAGAGSSE----------FHVL 104 Query: 111 LRPEKLIFSGFIAHFTKSSLY-RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIA 169 R + I ++ KS+ + R +++ + + A + + +PP AEQK IA Sbjct: 105 RRRDAGISPSYLLAVIKSAQFLREGEENMTGAIGLRRVPRAFVENFPVRLPPEAEQKRIA 164 Query: 170 EKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT-EKWRNFEPQHSVFKKLNF 228 +KLD LLAQVD+ KAR + IP +LKRFRQ+V+ V+G L ++ +F+ + ++ + Sbjct: 165 QKLDALLAQVDTFKARIDAIPALLKRFRQSVINHGVSGSLALDQHASFD--TTTWRNMRA 222 Query: 229 ESILTELRNGLSSKPNESGVGHPILRISSVRAGHVD--------QNDIRFLECSESELNR 280 E + T++++G + K + G P L++ ++ G ++ DI C +S Sbjct: 223 EDVCTKVQSGGTPKEGFTTEGIPFLKVYNIVDGIIEFEYRPQYIAADIHQGSCRKS---- 278 Query: 281 HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSS 340 GD+L L + V + + + N+ L R ++ +I + Sbjct: 279 -ITIPGDVLMNIVGPPLGKIAV--VPQGVDEWNINQAITLFRP--SESISSAWIHLVLLE 333 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 + + K ++GQ IS + V +PP + Q EIVRRVEQLFAYAD +E +V Sbjct: 334 GTNIRRVSQETKGSAGQVNISLSQCRDFVFPVPPTQIQDEIVRRVEQLFAYADQLEAKVA 393 Query: 401 NALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKAS 460 A R++ LTQS+LAKAFRGEL Q ++ P A+ LL++I+A+RAA+ K Sbjct: 394 AAQQRIDALTQSLLAKAFRGELVPQDPSDEP--------ASVLLDRIRAQRAATPKPKRG 445 Query: 461 RKKS 464 RK + Sbjct: 446 RKAA 449 >UniRef50_Q2P0A3 Specificity determinant for hsdM and hsdR n=2 Tax=Xanthomonas oryzae pv. oryzae RepID=Q2P0A3_XANOM Length = 450 Score = 146 bits (369), Expect = 1e-33, Method: Compositional matrix adjust. Identities = 125/418 (29%), Positives = 210/418 (50%), Gaps = 43/418 (10%) Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 K++ QK+ P+D+++ + + V ++ + + + + +P L FI Sbjct: 53 KDIGSTKQKVEPDDVLLCKINPRINRVWLVGKKNDHEQIASSEWIVIRQP--LFDPAFIR 110 Query: 124 HFTKSSLYRNKISS--LSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + S +R+++ + G ++ +P + + I PLAEQK IA+KLD LLAQVD+ Sbjct: 111 FQLQESSFRDRLCAEVSGVGGSLTRAQPKKVESYKLRIAPLAEQKRIAQKLDALLAQVDT 170 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR----NFEPQHSV-----FKKLNFESIL 232 KAR + IP +LKRFR++V+ AV G+L+ R E Q + ++++ S L Sbjct: 171 LKARIDAIPALLKRFRKSVVHSAVIGRLSADLRVPIEKSEEQEQLGPLESWREVTLAS-L 229 Query: 233 TELRNGLSS-KP-NES---GVGHPILRISSV--RAGHVDQNDIRFLECSESELNRHKLQD 285 EL G S +P N+S G +P ++ V G + + + + SE L + +L Sbjct: 230 GELSRGKSKHRPRNDSRLYGSEYPFIQTGDVANSGGALTSSKVFY---SEFGLKQSRLFP 286 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSAR 344 L ++ + + + +PD ++ KD + ++I+ Sbjct: 287 SGTLCITIAANIADTAMLAI-------DACFPDSVVGFIPNKDDCVAQFIKYVIDD---N 336 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 + + + QK I+ K + + +PP+KEQ EIVR VEQLFAYAD +E +V A Sbjct: 337 KESLEALAPATAQKNINLKVLNQVKLRIPPIKEQTEIVRHVEQLFAYADQLEAKVAAAQQ 396 Query: 405 RVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRK 462 R++ LTQS+LAKAFRGEL Q ++ P A+ LL++I+A+RAA+ K RK Sbjct: 397 RIDALTQSLLAKAFRGELVPQDPSDEP--------ASVLLDRIRAQRAATPKPKRGRK 446 >UniRef50_C5SE02 Restriction modification system DNA specificity domain protein n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SE02_CHRVI Length = 448 Score = 145 bits (365), Expect = 4e-33, Method: Compositional matrix adjust. Identities = 116/444 (26%), Positives = 204/444 (45%), Gaps = 29/444 (6%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 + W P+ V ++ G +K + N + P+IR ++ +G T +P Sbjct: 23 DWWERVPLGDVCDILNGFPFKSQHFNN---SEGAPVIRIRDVTSGFCKTFYSGDIPVGYW 79 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTK 127 + P D+V+ M + S L C + E + F+++ Sbjct: 80 -----VEPFDMVVGMDGDFNCRLWSSERSLLN-----QRVCKLTPHEDFLDKKFLSYVLP 129 Query: 128 SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFE 187 + Y I+ + + ++ + I P+PPLAEQ+ I KLD L + + Sbjct: 130 A--YLRLINDHTHSITVKHLSSKTIAKIPFPLPPLAEQRRIVAKLDRLFERTRRAREELS 187 Query: 188 QIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESG 247 IP++++ +++A+L A G LT+ WR + + K++ + +L G S+K ++SG Sbjct: 188 HIPRLIENYKKAILVAAFRGDLTKDWRE-KRGLPMPKEVKLGEVAKKLSYGTSAKSSKSG 246 Query: 248 VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 P+LR+ +++ +D D+ + E E+ ++ L GD+LF R N S E VG + K Sbjct: 247 -DVPVLRMGNIQNMRIDWKDLVYTSDVE-EIEKYSLNAGDVLFNRTN-SPELVGKTAIYK 303 Query: 308 KLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 + +Y LI+ + +PEY+ +SP R+ Q I+ K + Sbjct: 304 G--ERPAIYAGYLIKIKCGNRLVPEYLNYCLNSPLGRSYCWRVKSDGVSQSNINAKKLAD 361 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWR 427 LLP EQ EIV R+E+ + D++ + A +++L Q+ LAKAFRGEL Q Sbjct: 362 FSFLLPTHDEQKEIVFRIEKTLDWLDSLVIEERQASHLLDHLDQANLAKAFRGELVPQDP 421 Query: 428 AENPDLISGENSAAALLEKIKAER 451 ++ P A+ LLE+I A+R Sbjct: 422 SDEP--------ASVLLEQIYADR 437 >UniRef50_B6R0S6 Restriction modification system DNA specificity domain protein n=1 Tax=Pseudovibrio sp. JE062 RepID=B6R0S6_9RHOB Length = 492 Score = 138 bits (347), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 137/506 (27%), Positives = 228/506 (45%), Gaps = 72/506 (14%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKF--DTTDLVFV 62 +LPEGWV + + + RG + + ++ DD L I+ ++ +G + ++T+ Sbjct: 3 ELPEGWVETEIENIYEVARGGSPRPIKSYLTADDDGLNWIKISDATSGGYRIESTEQKIT 62 Query: 63 PKNLVKESQKISPEDIVIA--MSSGSK--SVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 + L K ++ I P D++++ MS G S + H FG C R L Sbjct: 63 SEGLHK-TRLIYPGDLLLSNSMSFGKPYISAIEGCIHDGWLVLGGFGKKCVDTRYMHLAL 121 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 S S + + ++G+ + N+ + + +P+ PLAEQK I K+++L A+ Sbjct: 122 S--------SEGVQKQFDEKASGSTVRNLNTGIVNSVRVPLAPLAEQKRIVAKIESLTAK 173 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR------------------------ 214 + +I + KR++QA+L A +G+LT WR Sbjct: 174 SRIARENLARIDTLTKRYKQAILKKAFSGELTADWREKSSKDCLIDLNDVLKEHEVIWQN 233 Query: 215 -----------NFEPQHSV--FKKLNFESI--LTELRNGLSSKPNESGVGHPILRISSVR 259 N +P + + +L+ E + + + + P E G G P + + V+ Sbjct: 234 NIAKKGKYARPNVKPADDLRSWHELSLEGLAYVVDPHPSHRTPPKEIG-GIPYVGVGDVK 292 Query: 260 -AGHVDQNDIRFL--ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLY 316 G +D R + + + L R+ L+ GD + G + +G LL + Q L Sbjct: 293 LDGKLDFAGARKVSPKVLKDHLKRYSLKRGDFAY----GKIGTIGQPFLLPEAQEYALSA 348 Query: 317 PDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVK 376 LI+ R +K A E++ FF SP ++ TS Q K ++ + LP + Sbjct: 349 NVILIQPR-SKFATAEFLYYFFLSPVVTQKILGASVATS-QAAFGIKKMREVLTPLPSLS 406 Query: 377 EQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISG 436 EQ EIV R+E+ FA D + ++ AL V+ L + ILAKAFRGEL Q +PD Sbjct: 407 EQNEIVTRIEKAFAKIDKLAEEAKRALHSVDRLDEKILAKAFRGELVPQ----DPD---- 458 Query: 437 ENSAAALLEKIKAERAASGGKKASRK 462 + A+ LLE+IKAERAA K +RK Sbjct: 459 DEPASVLLERIKAERAAQPKVKRARK 484 >UniRef50_A7IEA1 Restriction modification system DNA specificity domain n=1 Tax=Xanthobacter autotrophicus Py2 RepID=A7IEA1_XANP2 Length = 450 Score = 138 bits (347), Expect = 5e-31, Method: Compositional matrix adjust. Identities = 123/470 (26%), Positives = 222/470 (47%), Gaps = 43/470 (9%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKK-------EQAINYLKDDYLPLIRANNIQNGKFDTT 57 ++P W+ A V ++ G T +Q + +L L R I G+ D + Sbjct: 7 QVPHSWLWASFGEVADIVGGGTPPTGDEANFTKQGVPWLTPADLTGYRETYISRGRRDLS 66 Query: 58 DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSA--HQHLPFECSFGAFC--GVLRP 113 + K + + ++ P+ V+ S++ VG A +++ F +F G + P Sbjct: 67 E-----KGYRESAARLLPKGTVLF---SSRAPVGYCAIASENVSTNQGFKSFILKGDISP 118 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 E ++ H+ S S ++G + + + +P+PPL EQ+ I K+D Sbjct: 119 E------YVRHYLLGST--EYAESKASGTTFKELSGSRATELALPLPPLPEQRRIVAKID 170 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILT 233 +L A+ + E IP+++++++QA+L A +G+LTE P V +L E I Sbjct: 171 SLTAKSRRARDHLEHIPRLVEKYKQAILAAAFDGRLTE----LSPHDIVHPELG-ELIEF 225 Query: 234 ELRNGLSSKPNESGVGHPILRISSVRAGHVDQ-NDIRFLECSESELNRHKLQDGDLLFTR 292 +NGL + G G PILRI + +D+ + + S++ + + DGDL+ R Sbjct: 226 GPQNGLYLPKDRYGEGTPILRIQNYGFNFIDEPTNWHRVTVSDAIAAQFAMSDGDLIINR 285 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 N S +G ++ K ++ ++R RL A P++++++ SS R ++ K Sbjct: 286 VN-SPSHLGKSMVVTKAM-AGAIFESNMMRIRLNALAEPKFVQLYLSSSQGRGSLTKDAK 343 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 Q I+ D+ V LP + +Q ++ R+E FA+ D + + +A ++ L Q+ Sbjct: 344 WAVNQASINQGDVSRTPVPLPGLSDQIAVLDRIETAFAWIDRLAAEATSARTLIDRLDQA 403 Query: 413 ILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRK 462 +LAKAFRGEL Q A+ P A+ LLE+I+AER A+ + R+ Sbjct: 404 VLAKAFRGELVPQDPADEP--------ASVLLERIRAERGAAPKARRGRR 445 >UniRef50_A1BGI9 Restriction modification system DNA specificity domain n=2 Tax=cellular organisms RepID=A1BGI9_CHLPD Length = 479 Score = 135 bits (341), Expect = 3e-30, Method: Compositional matrix adjust. Identities = 134/497 (26%), Positives = 226/497 (45%), Gaps = 83/497 (16%) Query: 11 VIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI--QNGKFDTTDLVFVPKNLVK 68 VIA + V I G +K + + LP+IR N+ +N KF+ ++ VF + LVK Sbjct: 7 VIAILGDVAEYINGRAFKPSE----WGKEGLPIIRIKNLNDENSKFNYSNEVFEKRYLVK 62 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKS 128 + D++ A S+ + + K E +++P I ++ +F Sbjct: 63 KG------DLLFAWSASLGAYIWKKD------EAWLNQHIFLVKPSPFIAKLYLYYFLDK 110 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 ++ S + G+ + ++ F+ I +PPL+EQ+ I K++ L +++D+ A ++ Sbjct: 111 --ITQELYSAAHGSGMVHVTKKKFEETKIGLPPLSEQRSIVSKIEQLFSELDNGIACLKK 168 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWR----NFEPQHSVFKKLNFE--------------- 229 + LK +RQAVL A G+LT+ WR N + + E Sbjct: 169 AQEQLKVYRQAVLKQAFEGELTKSWREQQANLPSAQDLLDTIKTEREQAAKNQGKKLKPV 228 Query: 230 --------SILTELRNGL----------------SSKPNESGVGHPILRISSVRAGHVDQ 265 LTEL +G S+K E G P++R+ +++ G +D Sbjct: 229 TPLAKVELDELTELPDGWCWIKLGELTIGVEYGTSTKSLEKG-EVPVIRMGNIQQGRIDW 287 Query: 266 NDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL 325 ND+ F + ++++++++L GD+LF R N S E VG + ++ LIR Sbjct: 288 NDLAFTD-DKADISKYRLLKGDVLFNRTN-SPELVGKAAIYNG--EMPAIFAGYLIRVNQ 343 Query: 326 TKDALP-EYIEIFFSSPSARNAMMNCVKTTS-GQKGISGKDIKSQVVLLPPVKEQAEIVR 383 K+ L +Y+ F +S A+ N VKT Q I+G+ +KS + KEQ +IV+ Sbjct: 344 IKELLHCKYLNFFLNSHPAK-VYGNSVKTDGVNQSNINGEKLKSYPLPYCSPKEQEQIVQ 402 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG------ELTAQWRAENPDLISGE 437 +E + D +E + +L + L QSIL KAF G ELTA +PD Sbjct: 403 EIEARLSVCDNMEATIRESLEKAEALRQSILKKAFEGKLLSEEELTAT--RNDPDW---- 456 Query: 438 NSAAALLEKIKAERAAS 454 A LLE+I+AE+ S Sbjct: 457 EPAEKLLERIRAEKNQS 473 Score = 71.6 bits (174), Expect = 6e-11, Method: Compositional matrix adjust. Identities = 71/232 (30%), Positives = 112/232 (48%), Gaps = 26/232 (11%) Query: 232 LTELRNGLSSKPNESG-VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 + E NG + KP+E G G PI+RI ++ D+N +F +E R+ ++ GDLLF Sbjct: 14 VAEYINGRAFKPSEWGKEGLPIIRIKNLN----DENS-KFNYSNEVFEKRYLVKKGDLLF 68 Query: 291 TRYNGSL-EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 ++ SL ++ QH L+ P I A+L Y+ F + + Sbjct: 69 A-WSASLGAYIWKKDEAWLNQHIFLVKPSPFI-AKL-------YLYYFLDKITQE---LY 116 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 SG ++ K + + LPP+ EQ IV ++EQLF+ D + A ++ Sbjct: 117 SAAHGSGMVHVTKKKFEETKIGLPPLSEQRSIVSKIEQLFSELDNGIACLKKAQEQLKVY 176 Query: 410 TQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER---AASGGKK 458 Q++L +AF GELT WR + +L S ++ LL+ IK ER A + GKK Sbjct: 177 RQAVLKQAFEGELTKSWREQQANLPSAQD----LLDTIKTEREQAAKNQGKK 224 Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 52/209 (24%), Positives = 94/209 (44%), Gaps = 12/209 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LP+GW + +T GV Y + L+ +P+IR NIQ G+ D DL F Sbjct: 241 ELPDGWCWIKLGELTI---GVEYG--TSTKSLEKGEVPVIRMGNIQQGRIDWNDLAFTDD 295 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-GAFCGVLRPEKLIFSGFIA 123 ++ D++ ++ S +VGK+A + F G V + ++L+ ++ Sbjct: 296 KADISKYRLLKGDVLFNRTN-SPELVGKAAIYNGEMPAIFAGYLIRVNQIKELLHCKYLN 354 Query: 124 HFTKS---SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F S +Y N + + G N +NI +P EQ+ I ++++ L+ D Sbjct: 355 FFLNSHPAKVYGNSVKT--DGVNQSNINGEKLKSYPLPYCSPKEQEQIVQEIEARLSVCD 412 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKL 209 + +A + + + RQ++L A GKL Sbjct: 413 NMEATIRESLEKAEALRQSILKKAFEGKL 441 >UniRef50_Q210J8 Type I restriction enzyme StySPI specificity protein n=1 Tax=Rhodopseudomonas palustris BisB18 RepID=Q210J8_RHOPB Length = 460 Score = 133 bits (334), Expect = 2e-29, Method: Compositional matrix adjust. Identities = 126/482 (26%), Positives = 212/482 (43%), Gaps = 50/482 (10%) Query: 4 GKLPEGWVIAPVSTVTTL----IRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL 59 G LP GWV AP+ + L I Y ++ + ++R NI +F + D Sbjct: 3 GDLPSGWVAAPIDDLRALEPNAITDGPYGSSLKTSHYRSSGARVVRLGNIGFRRFLSADA 62 Query: 60 VFVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECS---FGAFCGVLRPE 114 V++ ++ K + D++IA VG+S P + S A C LR Sbjct: 63 VYISEDHFKALVKHHVRAGDVLIAALGDP---VGRSCIA--PSDISPALVKADCFRLRCS 117 Query: 115 KLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDT 174 + + FI + S R SS + G I + F +P+PP EQ I K+D Sbjct: 118 PHLSAPFIMLWLNSECAREAFSSAAHGLGRVRINLSDFRTTVVPVPPATEQGRIVAKIDN 177 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR-NFEPQHSVFKKLNFESILT 233 L A+ ++ + IPQ++++++QA+L A G+LT +WR N Q + + + I Sbjct: 178 LSAKSKRSRDHLDHIPQLVEKYKQAILAAAFRGELTHEWRVNNLDQKWPWPECSLSDI-A 236 Query: 234 ELRNGLSSKPNE----SGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLL 289 + G + K E S P + +V+ V D E + E N G +L Sbjct: 237 NIGTGATPKRGEQRYYSNGNIPWITSGAVKHAVVQAADEYITEAAVRETNCKVFPAGTIL 296 Query: 290 FTRYN-----GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 Y G + +G+ + I+ R A+ +++ R Sbjct: 297 MAMYGEGKTRGRVTVLGINAATNQAV--------AAIQVRADSPAVRDFVVWHL-----R 343 Query: 345 NAMMNCVKTTSG--QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNA 402 + + + +G Q ++ + + + LP EQ E+VRRV++ FA+ D + + +A Sbjct: 344 SGYLELRERAAGGVQPNLNLGIVNAWRIPLPSRDEQMEVVRRVQKAFAWIDRLTIETTSA 403 Query: 403 LARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRK 462 ++ L Q+ILAKAFRGEL Q +P+ + A+ LLE+IKA+RA S G +R+ Sbjct: 404 RKLIDRLDQAILAKAFRGELVPQ----DPN----DEPASILLERIKAKRAGSAGH--TRR 453 Query: 463 KS 464 +S Sbjct: 454 RS 455 >UniRef50_A6E2R5 Restriction endonuclease S subunits-like protein n=1 Tax=Roseovarius sp. TM1035 RepID=A6E2R5_9RHOB Length = 413 Score = 130 bits (327), Expect = 1e-28, Method: Compositional matrix adjust. Identities = 103/356 (28%), Positives = 162/356 (45%), Gaps = 27/356 (7%) Query: 110 VLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIA 169 VL+P ++ GF KS L +I+ + G+ I ++ P+PPL EQ+ I Sbjct: 82 VLQPHEIDL-GFTFVALKSLL--PEITKDNRGSTIKYLRLGDIADTAAPLPPLPEQRRIV 138 Query: 170 EKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE 229 KLDTL A+ + + I ++++R+R AVL A F Sbjct: 139 RKLDTLSARSTTARTHLTAIEKLVERYRTAVLEAA-----------FRTAWDAGFDTTIA 187 Query: 230 SILTELRNGL--SSKPNESGVGHPILRISSVR-AGHVDQNDIRFLECSESELNRHKLQDG 286 L GL S +G G+P +R++ AG + D+ ++ + SE R++L+ Sbjct: 188 GCLEHAETGLVRSKAEQTAGEGYPYIRMNHYDLAGRWNDRDLTYVAATSSEFERYQLRAN 247 Query: 287 DLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNA 346 DLLF N S E VG + + L+ + L+R R + D LP + SSP R Sbjct: 248 DLLFNTRN-SAELVGKVAIWPE-GKDGYLFNNNLLRMRFSADVLPGFAFWQMSSPPFRRY 305 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + + T+ I + + + +P EQ EIVRR+E FA D ++ + AL + Sbjct: 306 IEGFISATTSVAAIYQRSLMAAPFWVPDTDEQREIVRRIETAFAKIDRLKAEAAKALKLL 365 Query: 407 NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRK 462 +L Q ILAKAF G+L Q + P A LL +I+ RAA+ + R+ Sbjct: 366 GHLDQRILAKAFAGDLVPQDPTDEP--------AETLLARIREARAATQTSRRRRR 413 >UniRef50_B4VXC6 Type I restriction modification DNA specificity domain protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VXC6_9CYAN Length = 506 Score = 129 bits (325), Expect = 2e-28, Method: Compositional matrix adjust. Identities = 98/350 (28%), Positives = 167/350 (47%), Gaps = 25/350 (7%) Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F G + L +S L I ++ + I + PL EQK IA+KLD LLA Sbjct: 87 FHGMPTRYWFYQLKNLGLSELDKATAIPSLNRKDAYRVQIHLSPLNEQKRIADKLDALLA 146 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 +VD+ + R ++ I+++ RQA+L ++GK+T+ W ++ + N L++ + Sbjct: 147 RVDACRDRLIRVSFIIQQLRQAILTDGISGKITQYWSKNNAENLAYNHQNIVGKLSDFAD 206 Query: 238 GLSSKPNE-----SGVGHPIL---RISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLL 289 + P+ G PIL ++S + + + E+ H + D++ Sbjct: 207 VIDPNPSHRYPSYKGGTIPILATEQMSGLNDWDTSSAKLIKYDFYEARKAAHDFLNDDII 266 Query: 290 FTRYNGSLEFVGVCGLLKKL-QHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAM 347 F R G GL + Q+ ++ + R+ D LP Y+ F + + Sbjct: 267 FAR-------KGRLGLARNPPQNIRYVFSHTVFIIRVKADNILPSYLLWFLRQEFCIDWL 319 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 ++ + + +G + ++ + +P EQ EIV+ +E+L+AYAD IE + NAL RV Sbjct: 320 LSEMNSNAGVPTLGKSVMERLPITIPDYAEQQEIVQCIEKLYAYADRIEARYQNALTRVE 379 Query: 408 NLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGK 457 LT ++L+KAFRGEL Q +PD + + LLE+I+AERAA K Sbjct: 380 QLTPTLLSKAFRGELVPQ----DPD----DEPVSVLLERIRAERAAQPNK 421 >UniRef50_B2A6M8 Restriction modification system DNA specificity domain n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A6M8_NATTJ Length = 490 Score = 125 bits (313), Expect = 4e-27, Method: Compositional matrix adjust. Identities = 123/457 (26%), Positives = 208/457 (45%), Gaps = 48/457 (10%) Query: 5 KLPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 +LP W + + I+ G T K+ + +P+ R +IQN + + ++ Sbjct: 26 ELPNNWAWVALDILAEEIKNGTTIKQSKT-----KPGIPVTRIESIQNNEIQLDRVRYIR 80 Query: 64 K-NLVKESQKISPEDIVIAMSSGSKSVVGKSA---HQHLPFECSFGAFCGVLRPEKLIFS 119 + +K + DIV++ S VGK+A +LP + +I Sbjct: 81 DLDKIKNNDYYKIGDIVLS-HINSIEHVGKTALIKEDYLPLIHGMN-LLRIRVNNNMILP 138 Query: 120 GFIAHFTKSSLYRNKI-SSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F+ +T+S +R + + N ++ + I+IPI P EQ+ I K+D LL++ Sbjct: 139 QFLQLYTRSYNFRKAVLKRIKMAVNQVSLNQKNLKQISIPIAPKNEQRRIVYKVDRLLSK 198 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQ----HSVFKKLNFESI--- 231 ++ K + + + R A+L A G+LT WR P+ ++ K+N E Sbjct: 199 INKAKELIGEAKETFELRRAAILDKAFKGELT--WREENPRVESVDTLLAKINSEKKTDI 256 Query: 232 ------LTELRN----------------GLSSKPNESGVGHPILRISSVR-AGHVDQNDI 268 L EL + G S+K + G P+LR+ +++ G +D ND+ Sbjct: 257 KKSPNGLYELPDNWCWIDLGELICHSSYGTSAKAYKDINGLPVLRMGNIKLTGSIDLNDL 316 Query: 269 RFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TK 327 ++L ++ ++KL++ DLLF R N S E VG +++ Y LI+ L K Sbjct: 317 KYLPFDHKDVEKYKLEEYDLLFNRTN-SYELVGKSAIVEPEHAGKFTYASYLIKISLFYK 375 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 L YI + +S R +++ VK GQ I+ K + S V LPP +E EI R +++ Sbjct: 376 KILAPYICYYINSHIGRKYLLSTVKQQVGQANINSKKLSSLPVPLPPEEEIKEINRIMKK 435 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTA 424 + A + I+ +N V L QSIL+KAFRGEL Sbjct: 436 VSAKENRIQNLLNLG-TYVAELEQSILSKAFRGELNT 471 Score = 89.0 bits (219), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 61/219 (27%), Positives = 118/219 (53%), Gaps = 9/219 (4%) Query: 234 ELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 E++NG + K +++ G P+ RI S++ + + +R++ + N + GD++ + Sbjct: 42 EIKNGTTIKQSKTKPGIPVTRIESIQNNEIQLDRVRYIRDLDKIKNNDYYKIGDIVLSHI 101 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA-LPEYIEIFFSSPSARNAMMNCVK 352 N S+E VG L+K+ + L++ L+R R+ + LP++++++ S + R A++ +K Sbjct: 102 N-SIEHVGKTALIKE-DYLPLIHGMNLLRIRVNNNMILPQFLQLYTRSYNFRKAVLKRIK 159 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 Q ++ K++K + + P EQ IV +V++L + + ++ + A + Sbjct: 160 MAVNQVSLNQKNLKQISIPIAPKNEQRRIVYKVDRLLSKINKAKELIGEAKETFELRRAA 219 Query: 413 ILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 IL KAF+GELT WR ENP + S LL KI +E+ Sbjct: 220 ILDKAFKGELT--WREENPRV----ESVDTLLAKINSEK 252 >UniRef50_C3MFA1 Putative restriction endonuclease type I, S subunit n=1 Tax=Rhizobium sp. NGR234 RepID=C3MFA1_RHISN Length = 496 Score = 121 bits (304), Expect = 5e-26, Method: Compositional matrix adjust. Identities = 128/510 (25%), Positives = 216/510 (42%), Gaps = 74/510 (14%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LP GW + + + + G T K+ Y + +P I + + + D Sbjct: 3 ELPRGWCVTTIQEIADVGTGATPKRGTRAFY-ESGTIPWITSGAVSQRQITYADEFITEA 61 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 + + K+ P ++ G G A + + VL + ++ S F+ + Sbjct: 62 AIRSTNCKVFPTGTILVAMYGEGKTRGSVARLAIDAATNQALAAIVLPNDDIVSSEFLMN 121 Query: 125 FTKSSLYRNKISSLSAGA-----NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F S +++ L+AG N+ I+ SF P+PPLAEQK I KLD L A+ Sbjct: 122 FLTSQY--SQLRGLAAGGVQPNLNLQLIRSTSF-----PLPPLAEQKRIVAKLDALSAKS 174 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR-------------NFEPQHSVFKKL 226 + +I ++ R++QAVLG A +G+LT +R + + V +KL Sbjct: 175 ARARTELARIETLVYRYKQAVLGKAFSGELTVDFRLSRRHLQSEAKAGSIHGEEGVERKL 234 Query: 227 NFESILTELRNGLSSKP---------------NESGV------------------GHPIL 253 T++ G+ P N + G PI+ Sbjct: 235 KVRGT-TDVMKGIQLSPLPESWNWVKNHRLAQNRANAICAGPFGTIFKAKDFRDKGIPII 293 Query: 254 RISSVRAGHVDQNDIRFLECS-ESELNR-HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQH 311 + V AG + F++ EL++ + + G+LL T+ + GV + Sbjct: 294 FLRHVAAGEYRTHKPGFMDKKVWQELHQPYSVFGGELLVTKLG---DPPGVACIFPAGVG 350 Query: 312 QNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVL 371 ++ PD + + ++P+++ +F+SP A+N + + + + K+ V Sbjct: 351 TAMVTPDVMKMSVDENASVPKFLMFYFNSPIAKNIIHQLAFGLTRLR-VDLAMFKTFPVP 409 Query: 372 LPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENP 431 P ++EQ EIVRR+E FA D + + AL V L ++ILAKAFRGEL Q + P Sbjct: 410 HPSLEEQLEIVRRIESAFAKIDRLAAEAKRALDLVGKLDEAILAKAFRGELVPQDENDEP 469 Query: 432 DLISGENSAAALLEKIKAERAASGGKKASR 461 EN LLE+I+AERAA+ K R Sbjct: 470 ----AEN----LLERIRAERAAAPKAKRGR 491 >UniRef50_UPI0001C36A8C HsdS1 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C36A8C Length = 456 Score = 119 bits (297), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 109/466 (23%), Positives = 212/466 (45%), Gaps = 63/466 (13%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P W + V +I G T K + Y P + ++ G+ Sbjct: 30 EVPGNWCWVRLKDVAFVITGGTPSKNKPEYY--GGTFPFFKPADLDYGR----------- 76 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAF--CGVLRPEKL------ 116 N+V S+ +S E ++ +KS C G+ CG L + Sbjct: 77 NMVAASEFLSEEGKAVSRCIPAKSTA----------VCCIGSIGKCGYLCVDGTTNQQIN 126 Query: 117 -----IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEK 171 + S F+ ++ + L+ ++ ++ I+ + + + P+PPL EQ+ IA Sbjct: 127 SAIPKVNSLFLYYYCNTILFTKQLRLKASATTISIVNKSKMEQCLFPLPPLREQQRIANH 186 Query: 172 LDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESI 231 ++ + ++D K + + + + + + A+L A +G LT KWR +H K ++FE Sbjct: 187 IEEMFYKLDEIKEKTQLVLESSEDRKAAILYKAFSGALTAKWR----KH---KGVSFEGW 239 Query: 232 LTE-------LRNGL--SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHK 282 +T+ L+ GL + N+ V P LR+++V+ G++D +I+ +E ++ R++ Sbjct: 240 ITKPLSEVATLQTGLMKGKRNNQKTVLLPYLRVANVQDGYLDLKEIKNIEVDVLKIERYR 299 Query: 283 LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSP 341 L+ GD+LFT G + +G + + + + ++ + + R D L P ++ + S Sbjct: 300 LKKGDVLFTE-GGDFDKLGRSSVWNE-EIPDCIHQNHIFVVRTQTDTLDPYFLSLQAGSR 357 Query: 342 SARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNN 401 + + C K T+ I+ +K+ VL+P ++EQ EIV + + I++ Sbjct: 358 YGKTYFIGCSKQTTNLASINSTQLKNFPVLIPTIEEQREIVNILNFFLGKEEQIKQNCLK 417 Query: 402 ALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKI 447 L ++ + +SIL++AFRGEL NPD E S+ LL+ I Sbjct: 418 LLEKIEEIKKSILSRAFRGELGTN----NPD----EESSIELLKTI 455 >UniRef50_Q7UK98 Type I restriction enzyme EcoEI specificity protein n=1 Tax=Rhodopirellula baltica RepID=Q7UK98_RHOBA Length = 550 Score = 118 bits (295), Expect = 5e-25, Method: Compositional matrix adjust. Identities = 133/525 (25%), Positives = 223/525 (42%), Gaps = 103/525 (19%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ--NGKFDTTDL 59 S LPEGW P+ + L+ G +K ++ + LP+IR N+ K++ D Sbjct: 37 SGEALPEGWADVPIGDLCDLVNGRAFKPKE----WSETGLPIIRIQNLNKAEAKYNHFDG 92 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSA--HQHLPFECSFGAFCGVLRPEKLI 117 + K+LV+ + + S G+ G A +QH+ F ++ + L Sbjct: 93 EYADKHLVRPGELLFAWSGTPGTSFGAHIWNGPKALLNQHI--------FRVLIDEDDLN 144 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + F F + I G + ++ F+ +P+PPLAEQ I +++L Sbjct: 145 MTFF--RFAINHKLEELIGKAHGGVGLRHVTKGKFEATQVPLPPLAEQSRIVSAIESL-- 200 Query: 178 QVDSTKARF--EQIPQILKRFRQAVLGGAVNGKLTEKWR----NFEPQHSVFKKLNFE-- 229 Q S++ARF ++ ++ + RQ+VL A +GKLT WR N EP + ++ E Sbjct: 201 QERSSRARFLLSEVGPLIGQLRQSVLRDAFSGKLTADWREANPNVEPAFKLLSRIRTERR 260 Query: 230 ----------------------------------SILTELRNG------------LSSKP 243 S L EL +G + Sbjct: 261 ERWEAEQLAKYEAKGKQPPKNWQDKYKEPEPVDESELPELPDGWCWCQVGDLIESFDAGR 320 Query: 244 NESGVGHP-------ILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 + + + HP +L++S+V D N + L+ + + + GDLL +R N + Sbjct: 321 SPTALSHPARDGEYGVLKVSAVTWREFDPNANKALKDGDEIGDTPTPRKGDLLISRAN-T 379 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRAR-LTKDALPEYIEIFFSSPSARNAMM-NCVKTT 354 +E +G +L K + NL+ DK +R +K+ +PEY+ S S R N T+ Sbjct: 380 VELIGAV-VLVKADYPNLMLSDKTLRMNPASKELVPEYLLYGLRSESVRKFFEDNATGTS 438 Query: 355 SGQKGIS-GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN----L 409 + + +S GK + + + L P ++QA V L D V + LA + + L Sbjct: 439 NSMRNLSQGKILDAPIALAPLAEQQA-----VADLLVTNDEACTSVASGLASMESSLTQL 493 Query: 410 TQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAAS 454 QSIL+KAFRGEL Q + P A+ LL +I+A+R A+ Sbjct: 494 DQSILSKAFRGELVPQDPRDEP--------ASELLARIRAKRVAN 530 >UniRef50_Q26D97 Putative type I site-speicific deoxyribonuclease specificity subunit n=1 Tax=Flavobacteria bacterium BBFL7 RepID=Q26D97_9BACT Length = 468 Score = 118 bits (295), Expect = 6e-25, Method: Compositional matrix adjust. Identities = 126/466 (27%), Positives = 211/466 (45%), Gaps = 49/466 (10%) Query: 5 KLPEGWVIAPVSTV---TTLIRGVTY--KKEQAINYLKDDYLPLIRANNIQNGKFDTTDL 59 +LP+GWV +S++ T L + + K+Q N + LI+ +I G F Sbjct: 4 ELPKGWVETNISSLVDDTGLFKDGDWVESKDQDPN----GNVRLIQLADIGLGNFRDKSQ 59 Query: 60 VFVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKL 116 F+ + + + DI++A +G+S L E ++RP +K Sbjct: 60 RFLNQETAERLNCNFLEQNDILVARMPDP---IGRSCLFPLKGENVTVVDVAIIRPSKKH 116 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 I +++H+ S ++ IS L++G+ I + D I P+PP AEQ I K+D L+ Sbjct: 117 INYKWLSHWINSPVFHKNISELASGSTRKRISRRNLDKIPFPLPPRAEQDRIVAKVDALM 176 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 AQ + + E+IPQ+LK FRQ VL + RN E ++ E +++ Sbjct: 177 AQHAAIQQAMERIPQLLKDFRQQVLNQSFE-------RNIE-------RVALEDCCHKIQ 222 Query: 237 NGLSSKPN-----ESGVGHPILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQDGDLL 289 +G P P + ++R ++ + + ++ + + R + GD+L Sbjct: 223 DGAHHSPKYVSPIREKNMFPYVTSKNIRNDYMKLDTLTYVNEDFHNTIYPRCSPEFGDVL 282 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 T+ S G L + + +LL LI+ K +P Y++ F S + Sbjct: 283 LTKDGAS---TGNVTLNEFDEPISLLSSVCLIKTD-KKKLIPAYLKYFIQSSIGFSEFTG 338 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + T + K + K IK + LP V EQ EIVRRVE LF A IE++ ++++L Sbjct: 339 KM-TGTAIKRVVLKKIKKATIPLPSVPEQQEIVRRVESLFEKATAIEQRYEQLKLQIDSL 397 Query: 410 TQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASG 455 Q+IL KAF+GEL Q + + SA LLE+IK ++ S Sbjct: 398 PQAILHKAFKGELVEQ--------LDSDGSAVELLEQIKNLKSNSN 435 >UniRef50_C5TIE5 Restriction modification system DNA specificity domain protein n=1 Tax=Zymomonas mobilis subsp. mobilis ATCC 10988 RepID=C5TIE5_ZYMMO Length = 419 Score = 116 bits (290), Expect = 2e-24, Method: Compositional matrix adjust. Identities = 89/309 (28%), Positives = 156/309 (50%), Gaps = 23/309 (7%) Query: 156 NIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRN 215 I +PPL EQ+ I K+D+L + + + IP+++++++QA+L A W Sbjct: 131 TINLPPLPEQRRIVAKIDSLTGKSRRARDHLDHIPRLVEKYKQAILSAAFRAD----WPL 186 Query: 216 FEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSE 275 ++ + +++ E R +ESGV +++S+V G D + L S Sbjct: 187 ISVGETIRAVVAGKNLRCEERPPFE---HESGV----VKVSAVSWGTFDARASKTLPESF 239 Query: 276 SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIE 335 + +++ GDLL +R N +LE VG ++ + NL DK++R + +D ++ Sbjct: 240 TPPENTRIKAGDLLISRAN-TLELVGAVVIVLECP-SNLFLSDKVLRLDV-EDGDKPWLM 296 Query: 336 IFFSSPSARNAMMNCVKTTS-GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADT 394 F SP R A+ + +S +KS + P +++ EIV R+E FA+ + Sbjct: 297 WFLRSPDGRAAIEGAATGNQLSMRNLSQAALKSISMPWPAAEQREEIVSRIESAFAWIEC 356 Query: 395 IEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAAS 454 + +A +++L QS+LAKAF+GEL Q A+ P A+ALL++I+AERAA+ Sbjct: 357 LAADAASARKLIDHLDQSMLAKAFKGELVPQDPADEP--------ASALLDRIRAERAAA 408 Query: 455 GGKKASRKK 463 K R+K Sbjct: 409 PKAKRGRRK 417 >UniRef50_B8E4I3 Restriction modification system DNA specificity domain protein n=1 Tax=Shewanella baltica OS223 RepID=B8E4I3_SHEB2 Length = 642 Score = 115 bits (288), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 118/435 (27%), Positives = 201/435 (46%), Gaps = 36/435 (8%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFVP 63 KLPEGWV +T+ +I + Q D P IR +NI +GK + V Sbjct: 2 KLPEGWV---ETTIGNIIDDMQPGFSQKPGKEDGDTTPQIRTHNISPDGKLTLEGIKHVT 58 Query: 64 KNLVKESQKIS-PEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLIFSGF 121 + KES++ S + V+ ++ S+ VGK+A E F LR KLI F Sbjct: 59 AS-NKESERYSLTKGDVVFNNTNSEEWVGKTAVFDQEGEFVFSNHITRLRANSKLITPDF 117 Query: 122 IAHFTKSSLYRNKISSLSAGANINN--IKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 +A + + L+ S A ++ I+ ++ L IP+P L EQ E++ +L QV Sbjct: 118 LAAYLQF-LWSMGFSKTRAKRWVSQAGIEGSTLALFRIPLPSLPEQ----ERIVDVLQQV 172 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHS---VFKKLNFESILTELR 236 I+ + +Q++ N T W +F ++ + + I+ + + Sbjct: 173 G-----------IVAKAKQSIDDHIDNLVRTAYWEHFSEWYTADGLRDPVRISDIVADSQ 221 Query: 237 NGLSSKPNESGVGHPILRISSVR-AGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 G+S +E+G ILR++S+ +G ++ D+++ SE ++ L +GDLLF R N Sbjct: 222 YGVSEAMSETG-KQAILRMNSITTSGWLNLADLKYATLSEKDIKATTLLNGDLLFNRTN- 279 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 S E VG C + + + + Y ++R R+ + LPEYI +S + +MN K Sbjct: 280 SKELVGKCAIWRGAK-EPFSYASYIVRFRMKEGILPEYIWATLNSSYGKYRLMNSAKQAV 338 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 +S D+ V LPP+ Q + + L + +T+ +++ N + + L + Sbjct: 339 SMANVSPTDLGRITVPLPPLALQEKFAK----LINHIETLRQEMLNKQDQYSELQTLVTQ 394 Query: 416 KAFRGELTAQWRAEN 430 +A GE TAQWR EN Sbjct: 395 QALLGEHTAQWRDEN 409 >UniRef50_C9RY89 Restriction modification system DNA specificity domain protein n=2 Tax=Geobacillus RepID=C9RY89_GEOSY Length = 477 Score = 115 bits (288), Expect = 4e-24, Method: Compositional matrix adjust. Identities = 108/443 (24%), Positives = 195/443 (44%), Gaps = 41/443 (9%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P WV V G T +++ Y D +P I+ + +G ++ + Sbjct: 26 EVPGNWVWVRSGHVAKWGSGGTPSRKRLEYYGGD--IPWIKTGELNDGIITGSEETITEE 83 Query: 65 NLVKESQKISPE-DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 L K S KI P+ IVIAM + +G L + + C V +P + + S ++ Sbjct: 84 GLQKSSAKIFPKGSIVIAMYGATIGRLG-----ILGIDAATNQACAVGQPYEFLDSKYMF 138 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 ++ + R+ + +L G NI +PPL EQK IA+K++ L A++D K Sbjct: 139 YYFFAR--RSDLVALGKGGAQPNISQTIIKDFPFALPPLNEQKRIADKIERLFAKIDEAK 196 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKL------------------------TEKWRNFEPQ 219 E++ + +++ R +L A G+L E+W P Sbjct: 197 RLIEEVKESIEQRRAVMLEKAFKGQLGTNDPSEKSILETSDDLSEKDVIPKEQWPYEVPG 256 Query: 220 HSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELN 279 + + KL +S L L+ G ++ + G LRI+ ++ +VD + + + + L Sbjct: 257 NWTWIKL--KSCLKRLQYGYTATSSTLTEGPKYLRITDIQNDNVDWETVPYCKIDDKLLE 314 Query: 280 RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFS 339 ++KL GD++ R + G L+ + + ++ LIR + ++ P Y+ + Sbjct: 315 KYKLNKGDIVIARTGAT---TGKSFLIDDMPFCS-VFASYLIRLTMNENLNPYYLWNYLK 370 Query: 340 SPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 S S + VK Q G + + I +V LPPV EQ I +++ L + ++ V Sbjct: 371 S-SMYWKQITIVKKGIAQPGANARIIGELIVPLPPVPEQKRIAEKLDNLLEKLENEKQLV 429 Query: 400 NNALARVNNLTQSILAKAFRGEL 422 +++ L QS+L KAFRGEL Sbjct: 430 LAVEEKLDLLKQSVLQKAFRGEL 452 >UniRef50_B4B315 Restriction modification system DNA specificity domain n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4B315_9CHRO Length = 397 Score = 114 bits (285), Expect = 8e-24, Method: Compositional matrix adjust. Identities = 90/308 (29%), Positives = 152/308 (49%), Gaps = 15/308 (4%) Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 I F+ +K+ + S G N +K F + IP+P L EQ+ I K++ L Sbjct: 103 ILPHFLNWMSKTPTFIELCKVASEGTTNRIRLKEDKFLSMKIPLPKLEEQQRIIAKIEEL 162 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTEL 235 +A+++ + E R+ + +N ++ + + H KKL I+ + Sbjct: 163 VAKIEEARGLKEA------GIRECEM--LINAEIYNLFTICKNTHWANKKLG--DIVIDD 212 Query: 236 RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 G S K ++ VG PILR+ +++ G +D +++++L+ E ++ LQ GD+L R N Sbjct: 213 CYGTSEKTHDYKVGIPILRMGNIQNGILDVSELKYLDIHEKNKDKLILQKGDILVNRTN- 271 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAMMNCVKTT 354 S E VG C + + +IR RL K A P I ++ +S R M N K Sbjct: 272 SAELVGKCAVFNLKGEYG--FASYIIRLRLDKAQANPTLIAMYINSSLGRTYMFNERKQM 329 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 +GQ I+ K +K+ ++LPP+ EQ EIV ++ L D +++ +L +N L +IL Sbjct: 330 TGQANINAKKLKALPIILPPLSEQQEIVTYLDNLQTQIDEMKRLRQESLKELNALLPAIL 389 Query: 415 AKAFRGEL 422 KAF+GEL Sbjct: 390 DKAFKGEL 397 Score = 65.1 bits (157), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 54/179 (30%), Positives = 89/179 (49%), Gaps = 11/179 (6%) Query: 39 DY---LPLIRANNIQNGKFDTTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAH 95 DY +P++R NIQNG D ++L ++ + + + I + ++ + S +VGK A Sbjct: 222 DYKVGIPILRMGNIQNGILDVSELKYLDIHEKNKDKLILQKGDILVNRTNSAELVGKCAV 281 Query: 96 QHLPFECSFGAFCGVLRPEKLIFS-GFIAHFTKSSLYR----NKISSLSAGANINNIKPA 150 +L E F ++ LR +K + IA + SSL R N+ ++ ANIN K Sbjct: 282 FNLKGEYGFASYIIRLRLDKAQANPTLIAMYINSSLGRTYMFNERKQMTGQANINAKKLK 341 Query: 151 SFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 + +I +PPL+EQ+ I LD L Q+D K ++ + L A+L A G+L Sbjct: 342 ALPII---LPPLSEQQEIVTYLDNLQTQIDEMKRLRQESLKELNALLPAILDKAFKGEL 397 >UniRef50_Q466N9 Type I restriction-modification system specificity subunit n=2 Tax=cellular organisms RepID=Q466N9_METBF Length = 492 Score = 113 bits (282), Expect = 2e-23, Method: Compositional matrix adjust. Identities = 115/458 (25%), Positives = 203/458 (44%), Gaps = 66/458 (14%) Query: 43 LIRANNIQNGKFDTTDLVFVP-KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF- 100 +R +IQN + + + + N K++ + D+V A + + VGKS F Sbjct: 51 FLRITDIQNNEVNWKSVPYCEIDNTKKQNYLLKDGDLVFARTGAT---VGKSYLLKGDFP 107 Query: 101 ECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP 160 E F ++ +R + I F+ +F +S Y +I+ G N+ L+ +P+ Sbjct: 108 ESVFASYLIRVRLLEEISESFVYNFFQSLTYWKQITEGQVGIGQPNVNGTKLSLLIVPVA 167 Query: 161 PLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP-- 218 PL EQ+ I K++ L +++D+ + + + LK +RQAVL A GKLT+KWR P Sbjct: 168 PLLEQRAIVSKIEQLFSELDNGISNLKLAQEQLKVYRQAVLKKAFEGKLTKKWREENPDV 227 Query: 219 ------------QHSVFKK----------------------LNFESILTELRNGLSSKPN 244 Q S KK ++ + + +G P Sbjct: 228 EDSKYVLNKIKNQISTQKKTKEIQDIQYGEVPYELPFKWNWVSLSDVSISITDGDHQAPP 287 Query: 245 ESGVGHPILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 ++ G P + IS++ +G +D ++ ++ + E+ + K Q D+L++ GS G+ Sbjct: 288 KADSGVPFIVISNISSGKLDMSETMYVPEKYYENLAAKRKPQPRDILYS-VTGSY---GI 343 Query: 303 CGLLKK------LQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 L+ + +H L+ P I ++ Y+ SP V T + Sbjct: 344 PILISENYRFCFQRHIALIRPHMEISSK--------YLYYILKSPFVYKQATK-VATGTA 394 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 Q + +++ V +PP+ EQ IV+ +E + + IE+ + + L R L QSIL K Sbjct: 395 QLTVPLSGLRTIKVPIPPIAEQQAIVQEIETRLSVCEKIEQDIKDNLERAEALRQSILKK 454 Query: 417 AFRGELTAQWRAENPDLISGEN--SAAALLEKIKAERA 452 AF G+L + E ++ E+ A LLE+IKAE+A Sbjct: 455 AFEGKLLNE--KELAEVRGAEDWEPAEVLLERIKAEKA 490 Score = 88.2 bits (217), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 61/218 (27%), Positives = 107/218 (49%), Gaps = 9/218 (4%) Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 I ++ G + ++ +G LRI+ ++ V+ + + E ++ + L+DGDL+F Sbjct: 30 IADNIQYGYTESSSDEPIGPKFLRITDIQNNEVNWKSVPYCEIDNTKKQNYLLKDGDLVF 89 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 R + VG LLK ++ LIR RL ++ ++ FF S + + Sbjct: 90 ARTGAT---VGKSYLLKG-DFPESVFASYLIRVRLLEEISESFVYNFFQSLTYWKQITEG 145 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 + GQ ++G + +V + P+ EQ IV ++EQLF+ D + A ++ Sbjct: 146 -QVGIGQPNVNGTKLSLLIVPVAPLLEQRAIVSKIEQLFSELDNGISNLKLAQEQLKVYR 204 Query: 411 QSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIK 448 Q++L KAF G+LT +WR ENPD+ + +L KIK Sbjct: 205 QAVLKKAFEGKLTKKWREENPDV----EDSKYVLNKIK 238 Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 41/175 (23%), Positives = 84/175 (48%), Gaps = 7/175 (4%) Query: 38 DDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE---SQKISPEDIVIAMSSGSKSVVGKSA 94 D +P I +NI +GK D ++ ++VP+ + +K P DI+ +++ + S Sbjct: 290 DSGVPFIVISNISSGKLDMSETMYVPEKYYENLAAKRKPQPRDILYSVTGSYGIPILISE 349 Query: 95 HQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDL 154 + + F ++RP I S ++ + KS + + ++ G + + Sbjct: 350 N----YRFCFQRHIALIRPHMEISSKYLYYILKSPFVYKQATKVATGTAQLTVPLSGLRT 405 Query: 155 INIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 I +PIPP+AEQ+ I ++++T L+ + + + + + RQ++L A GKL Sbjct: 406 IKVPIPPIAEQQAIVQEIETRLSVCEKIEQDIKDNLERAEALRQSILKKAFEGKL 460 >UniRef50_A7N438 Putative uncharacterized protein n=1 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N438_VIBHB Length = 432 Score = 112 bits (279), Expect = 4e-23, Method: Compositional matrix adjust. Identities = 112/429 (26%), Positives = 195/429 (45%), Gaps = 59/429 (13%) Query: 23 RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQKISPEDIVIAM 82 RGV+YK E + D + +R+NNIQ+G + ++ VP +LV +SQ + DI + M Sbjct: 20 RGVSYKPENLKAAIDDKSVVFLRSNNIQSGTLNFENVQIVPDSLVSDSQILKKGDIAVCM 79 Query: 83 SSGSKSVVGKSAH-QH-LPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSA 140 S+GS+ +VGKS QH + + + GAFC V R + S ++ + +S Y++ I A Sbjct: 80 SNGSRQLVGKSGMLQHEVEYPLTVGAFCSVFRCQNEDDSEYVRYLFQSQAYQHGIDVTLA 139 Query: 141 GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAV 200 G+ INN+K + + I +P P A +K IAE L T+ Q+D+T+A ++ I +Q + Sbjct: 140 GSAINNLKNSDVEAIEVPTAPKALRKKIAEILSTIDNQIDATQALIDKYTAI----KQGM 195 Query: 201 LGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS-------SKPNESGV----- 248 + + + + + P +L ++ L L G S+ SG Sbjct: 196 MADLFSRGIDPETKALRPTLEEAPELYHKTPLGMLPKGWDVKTLGDISEKITSGSRDWAK 255 Query: 249 -----GHPILRISSVRAGHVD--QNDIRFLEC-SESELNRHKLQDGDLLFTRYNGSLEFV 300 G +RIS++ HV+ + ++ + SE R +LQ GD+L + L V Sbjct: 256 FYSPEGDLFVRISNLTREHVNFRWDSVKHVNIGGGSEGERTQLQPGDILVS-ITADLGIV 314 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNA--MMNCVKTTSGQK 358 GV P+ + RA + ++ + S NA + N + + GQ+ Sbjct: 315 GVV-------------PENMGRAYIN-----QHTALIRLSTYGENARFIGNYLSSRCGQE 356 Query: 359 ------------GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 GI+ I S +P KEQ I +++ L ++++ + +L+ Sbjct: 357 QFEKNNDSGAKAGINLPTIASLRCPIPEEKEQLLIASKIDALDEVIADLKREKSKSLSLK 416 Query: 407 NNLTQSILA 415 L Q +L Sbjct: 417 QGLMQDLLT 425 >UniRef50_B0PEE2 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0PEE2_9FIRM Length = 388 Score = 111 bits (277), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 115/429 (26%), Positives = 200/429 (46%), Gaps = 59/429 (13%) Query: 12 IAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL----V 67 ++ + V T I G +K + +D LP+IR N+ N P N + Sbjct: 1 MSTLGNVATYINGRAFKPSE----WEDSGLPIIRIQNLTNFS--------APYNYSSREL 48 Query: 68 KESQKISPEDIVIAMSS--GSKSVVGKSA--HQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 +E K++ D++ A S+ G+ G A +QH+ F V P + I ++ Sbjct: 49 EEKYKVTRGDLLFAWSASLGAHIWKGNDAWLNQHI--------FRVV--PSEQIEKKYLY 98 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 +F + ++ + + G+ + +I F IP+P L EQK I K++ L +++D++ Sbjct: 99 YFLLQVV--AELHAKTHGSGMVHITKGPFMNTPIPVPSLPEQKRIVSKIEELFSKLDASV 156 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 A + + LK +RQAVL A F+P +K+ E I+ + R G S K Sbjct: 157 AELQTAKEKLKVYRQAVLKEA-----------FDPVSK--EKILLEDIIEKPRYGTSKKC 203 Query: 244 NESGVG--HPILRISSV--RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 + + + RI ++ + G +D DI++ S+ EL L + DLL R NGS+ Sbjct: 204 SYAYKNGFKAVYRIPNICYQNGSIDHKDIKYAGFSDDELKNLDLIENDLLIIRSNGSVSL 263 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTK--DALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 VG ++K + + + LIR RL K + L +++ F S +AR + + K+TS Sbjct: 264 VGRSSIVKA-EDCDATFAGYLIRLRLKKPSEVLSKFLHYFLESHAARTYIEHVAKSTS-- 320 Query: 358 KGISGKDIKSQVVLLPP----VKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 G++ + L P QA+ V ++E + D I++ ++ +L + L QSI Sbjct: 321 -GVNNINSNEISNLPVPKCDDFDMQAQTVVKIETNLSICDDIQQTIDTSLQQAEALRQSI 379 Query: 414 LAKAFRGEL 422 L +AF GEL Sbjct: 380 LKQAFEGEL 388 >UniRef50_A3JFC5 Restriction modification system DNA specificity domain n=1 Tax=Marinobacter sp. ELB17 RepID=A3JFC5_9ALTE Length = 527 Score = 110 bits (274), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 109/416 (26%), Positives = 174/416 (41%), Gaps = 79/416 (18%) Query: 107 FCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQK 166 C + + + + F+ + S R KIS+L +G+ I + I +P+PPL EQ Sbjct: 112 ICAIRFGDSRVNAKFMMYLINSPSIRGKISALQSGSTRKRISRGNLATIPLPLPPLNEQH 171 Query: 167 IIAEKLDTLLAQVD-----------------------------------STKARFEQIPQ 191 I K++TL +++D K + E Q Sbjct: 172 RIVAKIETLFSELDKGIESLKTAREQLKVYRQAVLKHAFEGKLTAKWREQNKDKLETPQQ 231 Query: 192 ILKRFRQ---------------AVLGGAVNGKLTEK-----------------WRNFEPQ 219 +L R +Q AV NGK K RNF PQ Sbjct: 232 LLARIQQERQARYQQKLQEWQVAVKMWEENGKKENKPGKPKKLAALKETSENETRNF-PQ 290 Query: 220 HSV-FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL 278 V + + ++ E G S K + +LRI ++ G +D ++++F E E+ Sbjct: 291 LPVGWTYVRLGLLIEEPTYGTSKKCSYDSGQVGVLRIPNISHGAIDSSNLKFASFEEHEV 350 Query: 279 NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIF 337 L GDLL R NGS+ VG C L+ + + + L+ LIR R D + P ++ Sbjct: 351 KALALAKGDLLTIRSNGSVSLVGSCALIAE-EDTDFLFAGYLIRLRPNHDLVAPFFLLSV 409 Query: 338 FSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEK 397 +S R + + K+TSG I+ +I++ +V LP + EQ E+++ +E E Sbjct: 410 LTSHLLRRQIESAAKSTSGVNNINTGEIQNLIVPLPSMVEQVELLKFLEISTPNIAVAEY 469 Query: 398 QVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAA 453 ++ L + L QSIL KAF G+L Q + P A+ALL +I +ERA Sbjct: 470 EIEVQLKKSEVLRQSILKKAFSGKLVPQDPNDEP--------ASALLARIHSERAG 517 Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 55/204 (26%), Positives = 101/204 (49%), Gaps = 9/204 (4%) Query: 252 ILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQ 310 +++++ + G RFL S++ ELN L+ GD+L R L G C + + Sbjct: 46 LIQLADIGDGRFLDKSSRFLTRSKARELNCTFLRAGDILVARMPDPL---GRCCIFPLDE 102 Query: 311 HQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQV 369 + + R + +++ +SPS R ++ +++ S +K IS ++ + Sbjct: 103 DGRYVTVVDICAIRFGDSRVNAKFMMYLINSPSIR-GKISALQSGSTRKRISRGNLATIP 161 Query: 370 VLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAE 429 + LPP+ EQ IV ++E LF+ D + + A ++ Q++L AF G+LTA+WR + Sbjct: 162 LPLPPLNEQHRIVAKIETLFSELDKGIESLKTAREQLKVYRQAVLKHAFEGKLTAKWREQ 221 Query: 430 NPDLISGENSAAALLEKIKAERAA 453 N D + + LL +I+ ER A Sbjct: 222 NKDKL---ETPQQLLARIQQERQA 242 Score = 55.5 bits (132), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 58/232 (25%), Positives = 107/232 (46%), Gaps = 14/232 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP- 63 +LP GW + LI TY + +Y + ++R NI +G D+++L F Sbjct: 290 QLPVGWTYV---RLGLLIEEPTYGTSKKCSY-DSGQVGVLRIPNISHGAIDSSNLKFASF 345 Query: 64 -KNLVKESQKISPEDIVIAMSSGSKSVVGKSA---HQHLPFECSFGAFCGVLRP-EKLIF 118 ++ VK + ++ D++ S+GS S+VG A + F F + LRP L+ Sbjct: 346 EEHEVK-ALALAKGDLLTIRSNGSVSLVGSCALIAEEDTDF--LFAGYLIRLRPNHDLVA 402 Query: 119 SGFIAHFTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+ S L R +I S + + + +NNI + +P+P + EQ + + L+ Sbjct: 403 PFFLLSVLTSHLLRRQIESAAKSTSGVNNINTGEIQNLIVPLPSMVEQVELLKFLEISTP 462 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE 229 + + E + + RQ++L A +GKL + N EP ++ +++ E Sbjct: 463 NIAVAEYEIEVQLKKSEVLRQSILKKAFSGKLVPQDPNDEPASALLARIHSE 514 >UniRef50_Q0VNH1 Type I restriction-modification system, S subunit n=1 Tax=Alcanivorax borkumensis SK2 RepID=Q0VNH1_ALCBS Length = 391 Score = 109 bits (273), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 89/325 (27%), Positives = 162/325 (49%), Gaps = 38/325 (11%) Query: 110 VLRPEKLIFSGFIAHFTKSSLYRNKISSLSAG--ANINNIKPASFDLINIPIPPLAEQKI 167 + R E+ + GF+ H+ S L+ + + AG ++ +PA + I IP+PPL EQK Sbjct: 93 IFRDERF-YPGFLKHYFTSELFHRQFMNTVAGVGGSLVRARPAGVERIEIPLPPLEEQKR 151 Query: 168 IAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLN 227 IA T+L + D+ + + +Q Q+ + F +AV +T P+ KK++ Sbjct: 152 IA----TILDKADAIRRKRQQAIQLAEEFLRAVFLDMFGDPVTN------PKGWKVKKID 201 Query: 228 FESILTELRNGL--SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQD 285 L E++ GL S K +E + P LR+++V + +I+ + +++E +R +L+ Sbjct: 202 ---DLCEVQGGLQVSKKRSELSISAPYLRVANVLRNRLYLGEIKEINLTQAEYDRVRLKR 258 Query: 286 GDLLFTRYNGSLEFVGVCGL----LKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSS 340 D+L +G+ +G L + + HQN LIR R+ +K+ P ++ + +S Sbjct: 259 DDVLIVEGHGNPNEIGRSALWTGEIDGMVHQN-----HLIRVRVKSKEIRPRFVNDYINS 313 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAE---IVRRVEQLFAYADTIEK 397 P R MM TTSG IS +KS +++PP+ Q + +V + E + + E Sbjct: 314 PGGRVQMMKASNTTSGLNTISTGIVKSTEIIVPPIYLQDKYMSVVSKFEDVLVKSRMHEG 373 Query: 398 QVNNALARVNNLTQSILAKAFRGEL 422 ++++L S++ KAF+G L Sbjct: 374 GIDSSLF-------SLIKKAFKGNL 391 >UniRef50_A4ILD9 Putative type I specificity subunit HsdS n=1 Tax=Geobacillus thermodenitrificans NG80-2 RepID=A4ILD9_GEOTN Length = 509 Score = 108 bits (270), Expect = 4e-22, Method: Compositional matrix adjust. Identities = 121/486 (24%), Positives = 219/486 (45%), Gaps = 93/486 (19%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P+ WV ++ G T K+ Y + P ++ G D D V V Sbjct: 27 VPKNWVWTRTGITHEIVTGSTPSKKNNEYYGGN--FPFVKP-----GDLDQKDSVTVASE 79 Query: 66 LV----KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLIFSG 120 + KE ++ P+ + GS VG + EC+ L P +K+I+ Sbjct: 80 YLTDKGKEVSRVIPKHSTLVCCIGSIGKVGFNL-----VECTTNQQINSLIPNKKVIYPK 134 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + +F+ SS+Y+N +S S+ ++ I + + +PPL EQK IAEK+D L A++D Sbjct: 135 YTYYFSLSSVYQNLLSKSSSSTTVSIINKSKMSKLPFALPPLNEQKHIAEKVDRLFAKID 194 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKL----------------------TEKWRNF-- 216 K E++ + + R A+L A G+L +E+ R + Sbjct: 195 EAKRLIEEVKESFELRRAAILDKAFRGELTRSWRKKNEHLVSASLMLQEIASERKRKYSD 254 Query: 217 --------------------------EPQHSV-----FKKLNFESILTELRNGLSSKP-N 244 +P+HS+ + F + +T+L +K N Sbjct: 255 LCRLAKINGEKKPRKLYLDEVPVIEEKPRHSLPDTWTITNIGFLAHVTKLAGFEYTKYFN 314 Query: 245 ESGVGH-PILRISSVRAGHVDQNDIRFLECSESEL-NRHKLQDGDLLFTRYNGSLEFVG- 301 + G P++R +V+ G +++I+++ S+L R ++ G++L F+G Sbjct: 315 LTETGDVPVIRAQNVQMGEFIESNIKYITKEVSDLLERSQVHGGEVLMV-------FIGA 367 Query: 302 ----VCGLLKKLQHQNLLYPDKLIRARLTKDA-LPEYIEIFFSSPSARNAMMNCVKTTSG 356 VC + + + L P+ A++T D L EY+ ++ SP ++ + + +K T+ Sbjct: 368 GTGNVC-MAPRDNRRWHLAPN---VAKITVDEILAEYLNLYLQSPIGQSYIKSKMKATA- 422 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 Q+ +S + I+ +V +PP++EQ EIVR VE+L +N+ +++N+ QSIL+K Sbjct: 423 QQSLSMETIRDVLVYVPPLEEQYEIVRIVERLLDNLKNEYLILNDIHMKIDNIKQSILSK 482 Query: 417 AFRGEL 422 AFRGEL Sbjct: 483 AFRGEL 488 Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 43/142 (30%), Positives = 69/142 (48%), Gaps = 12/142 (8%) Query: 310 QHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQV 369 Q N L P+K K P+Y +FS S +++ +++ I+ + Sbjct: 119 QQINSLIPNK-------KVIYPKYT-YYFSLSSVYQNLLSKSSSSTTVSIINKSKMSKLP 170 Query: 370 VLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAE 429 LPP+ EQ I +V++LFA D ++ + +IL KAFRGELT WR + Sbjct: 171 FALPPLNEQKHIAEKVDRLFAKIDEAKRLIEEVKESFELRRAAILDKAFRGELTRSWRKK 230 Query: 430 NPDLISGENSAAALLEKIKAER 451 N L+ SA+ +L++I +ER Sbjct: 231 NEHLV----SASLMLQEIASER 248 >UniRef50_Q73D72 Type I restriction-modification enzyme, S subunit, putative n=1 Tax=Bacillus cereus ATCC 10987 RepID=Q73D72_BACC1 Length = 476 Score = 108 bits (270), Expect = 5e-22, Method: Compositional matrix adjust. Identities = 117/465 (25%), Positives = 200/465 (43%), Gaps = 78/465 (16%) Query: 5 KLPEGWVIAPVSTVTTLIRGVT-------YKKEQAINYLKDDYLPLIRANNIQNGKFDTT 57 ++PE W+ + +I G T Y K+ I+++ L + I GK + T Sbjct: 20 RVPENWIWTWTGAIAEVISGGTPKSKVEEYYKDGTISWITPADLSGYQDMYISKGKRNIT 79 Query: 58 DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 +L L K S K+ P + V+ S V +A + L F +F P Sbjct: 80 EL-----GLNKSSAKMLPINTVLLSSRAPIGYVAIAA-KDLCTNQGFKSFA----PSNAY 129 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + ++ + K S Y + S+++G+ + I IP+PP+ EQK ++EK++ LL Sbjct: 130 YPKYLYWYLKFSKYY--MESMASGSTFKELSSNKSKEIPIPLPPINEQKRVSEKVERLLN 187 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRN---------------------F 216 +V+ K E+ + + R A+L A +G LT KWR F Sbjct: 188 KVEEAKTLIEEAKETFELRRAAILDKAFSGDLTGKWRKENSFQQNEECISDNELRDSEVF 247 Query: 217 EPQHSVFKKLNFESILTELRNGLSSKPN---ESGVGHPILRISSVRAGHVDQNDIRF--- 270 P +K + + T +NG + K E G I +R G++ +N++R Sbjct: 248 YPIPKTWKWTKLKDVAT-FKNGYAFKSKDFVEQG-------IQLIRMGNLYKNELRLDRN 299 Query: 271 -----LECSESELNRHKLQDGDLLF----TRYNGSLEF-VGVCGLLKKLQHQNLLYPDKL 320 L+ E + ++ ++ GD+L T+Y + V V G + +NLL ++ Sbjct: 300 PVYIPLDFDEKIIEKYTVEKGDILLSLTGTKYKRDYGYAVRVDG-----RDKNLLLNQRI 354 Query: 321 IRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAE 380 + L + EYI + S RNA + Q + K ++S ++ +PP E E Sbjct: 355 L--SLKPHMMDEYIYYYLQSSVFRNAFFSFETGGVNQGNVGSKAVESILIPIPPADEAKE 412 Query: 381 IVRRVEQLFAYADTIEKQVNNALA---RVNNLTQSILAKAFRGEL 422 I +++ +L EK+ LA ++ L QS L+KAFRGEL Sbjct: 413 IEKKLARLL----NNEKEALVVLAIEEKLEVLKQSALSKAFRGEL 453 >UniRef50_Q1Z9T4 Type I restriction-modification system, S subunit n=5 Tax=Bacteria RepID=Q1Z9T4_PHOPR Length = 523 Score = 107 bits (267), Expect = 9e-22, Method: Compositional matrix adjust. Identities = 80/238 (33%), Positives = 125/238 (52%), Gaps = 20/238 (8%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG-KFDTTDLVFVP 63 +LP+GW + + + RG + + + DD + I+ + + G K T+ + Sbjct: 3 QLPKGWAENSLGNLVVVERGSSPRPIKNFLTDSDDGVNWIKIGDAKKGQKLLTSTAEKIT 62 Query: 64 KNLVKESQKISPEDIVIA--MSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 K +S+ + D +++ MS G ++G + H G F V R K I S + Sbjct: 63 KEGAMKSRFVDVGDFILSNSMSFGLPYIMGIPGYIHD------GWF--VFRLPKQISSDY 114 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLIN---IPIPPLAEQKIIAEKLDTLLAQ 178 + SS + ++L+ G + NI S DL+ +P+PPLAEQ I EKLD +LAQ Sbjct: 115 FYYLLSSSYVGAQFNNLAVGGVVKNI---SGDLVKKAILPLPPLAEQTRIVEKLDEVLAQ 171 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 VD+ KAR + IP I+KRFRQ+VL AV+GKLTE+WR+ + K F S +T++R Sbjct: 172 VDTIKARLDGIPAIIKRFRQSVLAAAVSGKLTEEWRDINTAQDIEK---FCSEITDVR 226 Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 72/205 (35%), Positives = 112/205 (54%), Gaps = 16/205 (7%) Query: 249 GHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 G P++R+ ++R + + +F+ + E L+RHK+ +GD+LF + + G C + Sbjct: 307 GVPVIRMVNIRPFQFLRENRKFVSFEKFEGLSRHKINEGDVLFAKVGAT---TGDCCMYP 363 Query: 308 KLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIK 366 + +L R + K E++ I + A + N + + Q ++ K IK Sbjct: 364 MNEPIAMLSTTGSCRITVDKQVYNSEFLVIVLN---AYRRIFNSITSQVAQPFLNMKTIK 420 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQW 426 S + +P ++EQ EIVR V+Q F++ADTIE QV A ARV++LTQSILAKAFRGEL AQ Sbjct: 421 SVPIPIPALEEQKEIVRLVDQYFSFADTIEAQVKKAQARVDSLTQSILAKAFRGELVAQD 480 Query: 427 RAENPDLISGENSAAALLEKIKAER 451 ++ P A LLE+I R Sbjct: 481 PSDEP--------ADKLLERIAQAR 497 Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 37/107 (34%), Positives = 56/107 (52%), Gaps = 1/107 (0%) Query: 324 RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 RL K +Y SS S A N + K ISG +K ++ LPP+ EQ IV Sbjct: 105 RLPKQISSDYFYYLLSS-SYVGAQFNNLAVGGVVKNISGDLVKKAILPLPPLAEQTRIVE 163 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAEN 430 +++++ A DTI+ +++ A + QS+LA A G+LT +WR N Sbjct: 164 KLDEVLAQVDTIKARLDGIPAIIKRFRQSVLAAAVSGKLTEEWRDIN 210 >UniRef50_D2TNZ5 Putative type I restriction modification system HsdS component n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TNZ5_CITRO Length = 538 Score = 107 bits (267), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 81/330 (24%), Positives = 154/330 (46%), Gaps = 34/330 (10%) Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 I +I +F S +++++ + G I +I P I+ PI ++ Q+I+ K+D L Sbjct: 98 FILPQYIYYFLLSKF--DELNNNTRGMGIPHIDPVYLGEIDFPITSVSNQEILYSKIDQL 155 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQH--------------- 220 +D + E+ + + A++GKLT+ WR+ Q Sbjct: 156 YNLIDDGFTKTEKALAQISILWSLRITEALSGKLTKNWRDSNSQGKPLPVDIISINNQLE 215 Query: 221 -------SVFKKLNFESILTELRNGLSSK----PNESGVGHPILRISSVRAGHVDQNDIR 269 S ++ + S++ + G S K P E+GV +RI ++ G + ND++ Sbjct: 216 ETLPVLPSDWRYVKLSSVIESISYGTSKKCTYEPQETGV----IRIPNIVNGEICDNDLK 271 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 F +E E +++ L++ D+L R NGSL VG C +K + L+ L+R R+ + Sbjct: 272 FANFTEKEKDKYSLKEDDILIIRSNGSLNLVGACARVKS-KDTGYLFAGYLLRLRINLEL 330 Query: 330 L-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 + P Y++ SP R + K++SG I+ ++I+S ++ + ++EQ IV +E + Sbjct: 331 VNPSYLKYALESPLLRKQIERIAKSSSGVNNINAEEIRSLIIPICSIEEQLVIVNELENI 390 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAF 418 + + Q+ N L + + I+ AF Sbjct: 391 KYNLEAQQVQLRNLLEKSELTKKEIVKDAF 420 Score = 46.2 bits (108), Expect = 0.003, Method: Compositional matrix adjust. Identities = 49/190 (25%), Positives = 88/190 (46%), Gaps = 12/190 (6%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP W +S+V I ++Y + Y + +IR NI NG+ DL F N Sbjct: 221 LPSDWRYVKLSSV---IESISYGTSKKCTYEPQE-TGVIRIPNIVNGEICDNDLKFA--N 274 Query: 66 LV-KESQKIS--PEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFCGVLRPE-KLIFSG 120 KE K S +DI+I S+GS ++VG A + F + LR +L+ Sbjct: 275 FTEKEKDKYSLKEDDILIIRSNGSLNLVGACARVKSKDTGYLFAGYLLRLRINLELVNPS 334 Query: 121 FIAHFTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ + +S L R +I ++ + + +NNI + IPI + EQ +I +L+ + + Sbjct: 335 YLKYALESPLLRKQIERIAKSSSGVNNINAEEIRSLIIPICSIEEQLVIVNELENIKYNL 394 Query: 180 DSTKARFEQI 189 ++ + + + Sbjct: 395 EAQQVQLRNL 404 >UniRef50_UPI0001BC509B restriction modification system DNA specificity domain protein n=3 Tax=Fusobacterium RepID=UPI0001BC509B Length = 503 Score = 107 bits (266), Expect = 1e-21, Method: Compositional matrix adjust. Identities = 63/224 (28%), Positives = 121/224 (54%), Gaps = 3/224 (1%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKK-EQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 ++P+ WV + ++ ++ RG++Y K ++ I D+ ++R N+ + D V+V Sbjct: 26 EIPDSWVWVRLGSIVSVHRGLSYSKVDEIIRENNDEGYLVLRGGNLTEDGLNFEDNVYVR 85 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + + + + ++ D+++ S+GS V+G++ +H + + GAF + RP I S ++ Sbjct: 86 EEIGRRAIELEENDVILVASTGSSKVIGRACIVEHKLEKTTIGAFLMLCRPVTSI-SKWV 144 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + K + YRN IS++S G+NI NIK I PP+ EQ+ I +KLD L + Sbjct: 145 HYIFKGNSYRNYISNISKGSNIKNIKGEYITNYAISFPPIEEQQRIVKKLDFLFEKTKKA 204 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL 226 K +++ + ++ + ++L A G+LT+KWR SV + L Sbjct: 205 KKLLQEVKEEIEMRKISILDKAFRGELTKKWREKNKTGSVLELL 248 Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 54/199 (27%), Positives = 96/199 (48%), Gaps = 14/199 (7%) Query: 258 VRAGHVDQNDIRFLE--CSESELNRH--KLQDGDLLFTRYNGSLEFVG-VCGLLKKLQHQ 312 +R G++ ++ + F + E+ R +L++ D++ GS + +G C + KL+ Sbjct: 66 LRGGNLTEDGLNFEDNVYVREEIGRRAIELEENDVILVASTGSSKVIGRACIVEHKLEKT 125 Query: 313 NLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLL 372 + L R ++ +++ F S RN + N K S K I G+ I + + Sbjct: 126 TIGAFLMLCRPV---TSISKWVHYIFKGNSYRNYISNISKG-SNIKNIKGEYITNYAISF 181 Query: 373 PPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPD 432 PP++EQ IV++++ LF +K + + SIL KAFRGELT +WR +N Sbjct: 182 PPIEEQQRIVKKLDFLFEKTKKAKKLLQEVKEEIEMRKISILDKAFRGELTKKWREKNKT 241 Query: 433 LISGENSAAALLEKIKAER 451 S LL++I+ E+ Sbjct: 242 -----GSVLELLQEIQNEK 255 Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 53/199 (26%), Positives = 91/199 (45%), Gaps = 38/199 (19%) Query: 238 GLSSKPNESGVGHPIL-RISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 GL P++ G +PI+ R +S DI + + L ++ DG+ R Sbjct: 329 GLIGGPSDMGENYPIITRYTSQITKLSSIGDI--IVSIRATLGKNIFSDGEYCLGR---- 382 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 GVCG IR+++ + L + +F+ N++ K +SG Sbjct: 383 ----GVCG----------------IRSKIVNNIL---LRFYFT-----NSIEYLYKISSG 414 Query: 357 QK--GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 +S +DI + LPP++EQ EIVR +E++ + K++ + +++ L +SIL Sbjct: 415 TTFAQVSKEDISNLYFSLPPLEEQQEIVRVLEEVLEKEKKV-KELIDLEEKIDLLEKSIL 473 Query: 415 AKAFRGELTAQWRAENPDL 433 KAFRG+L Q + P L Sbjct: 474 DKAFRGKLGTQDINDEPAL 492 >UniRef50_C4KDM6 Restriction modification system DNA specificity domain protein n=1 Tax=Thauera sp. MZ1T RepID=C4KDM6_THASP Length = 532 Score = 106 bits (265), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 96/347 (27%), Positives = 157/347 (45%), Gaps = 37/347 (10%) Query: 121 FIAHFTKSSLYRNKISS--LSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F H+ +R+ + S G ++ +P+ + + PLAEQ IA++L+ LLA+ Sbjct: 115 FAKHYFSEPSFRSLLCSEVSGVGGSLTRAQPSRVAKYPVLVAPLAEQARIADQLEALLAR 174 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 + + + R E IP +LKRFR+ VL A++G LTE WR Q + +I G Sbjct: 175 IQACQDRLEAIPALLKRFRKLVLSSALSGDLTEVWR--AEQGVGLDTWSARTIADVAEVG 232 Query: 239 LSSKPNESG------VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 S P S G P + ++ ++D D +++ ++ H+L+ Sbjct: 233 TGSTPLRSNSNFYAETGTPWVTSAATSRPYIDSAD---QYVTKAAIDAHRLR-------V 282 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIR---ARLTKD---ALPEYIEIFFSSPSARNA 346 Y + + G K + L D I A +T D A ++++ S + Sbjct: 283 YRPGTLIIAMYGEGKTRGQVSELRIDATINQACAAITVDEQQANAAFVKLALLS---QYE 339 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + Q ++ ++ + LP EQA+IV RV +LFA+ADTI+ +V A + Sbjct: 340 QTRALAEGGAQPNLNLSKVRGIPLRLPEGPEQAQIVHRVGELFAFADTIDSRVAAATGKT 399 Query: 407 NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAA 453 L LAKAFRG+L Q + P A+ LL +I A+RAA Sbjct: 400 RKLPSLTLAKAFRGDLVPQDPTDEP--------ASVLLARIAAQRAA 438 Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 39/160 (24%), Positives = 72/160 (45%), Gaps = 19/160 (11%) Query: 279 NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIF 337 + ++ GD+L + N + V G + H+ + + + DA+ P + + + Sbjct: 65 TKQTVRPGDVLVCKINPRINRVWTVG--TRRDHEQIASSEWI---GFRSDAMVPRFAKHY 119 Query: 338 FSSPSARNAMMNCVKTTSGQKGISGKDIKSQV-------VLLPPVKEQAEIVRRVEQLFA 390 FS PS R+ + + V G+ G ++Q VL+ P+ EQA I ++E L A Sbjct: 120 FSEPSFRSLLCSEVS------GVGGSLTRAQPSRVAKYPVLVAPLAEQARIADQLEALLA 173 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAEN 430 + ++ A + + +L+ A G+LT WRAE Sbjct: 174 RIQACQDRLEAIPALLKRFRKLVLSSALSGDLTEVWRAEQ 213 >UniRef50_A7VYZ3 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VYZ3_9CLOT Length = 444 Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 111/451 (24%), Positives = 200/451 (44%), Gaps = 45/451 (9%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 K+P+ W S + LI G K N L ++ A+N++N VF + Sbjct: 29 KVPKNWCWVRFSKIINLISGRDAKLTDC-NSLGIGIPYILGASNLENN-------VFTIE 80 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 ++ Q IS ++ V+ G+ +GK Q + + +R +F F Sbjct: 81 RWIENPQVISLKNDVLLSVKGT---IGKVYLQKEE-KVNISRQIMAIRTSSTLFPRFTYW 136 Query: 125 FTK--SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 S +R + L G + +I +P PPL EQ+ I +++++L A++D Sbjct: 137 LVNNISDSFRQAGNGLIPGISREDILQKE-----VPFPPLPEQQRIVDRIESLFAKLDEA 191 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV----FKKLNFESILTELRNG 238 K + ++ + + A+L A G+LT +WR +H + ++K F IL ++R+G Sbjct: 192 KQKTQEALNSYETRKAAILHKAFTGELTARWRK---EHGLGMESWEKYKFNDIL-DVRDG 247 Query: 239 LSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL--NRHKLQDGDLLFTRYNGS 296 P G P++ +++ G + D++F+ + + R K+ GD+LF Sbjct: 248 THDSPTYFDQGFPLITSKNLKDGKITDKDLKFISKEDYDKINERSKVDIGDILFA----- 302 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 +G G ++ Q + + A P +++ F S + M K ++ Sbjct: 303 --MIGTIGNPVVVETQPKFAIKNVALFKNIGKASPYFVKYFLESKKVIDRMEKDAKGST- 359 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 QK +S +++ +LLP KEQ EIVR ++ L A ++ L +++ + +SILA+ Sbjct: 360 QKFVSLGYLRAFNILLPKSKEQTEIVRILDDLLAKEQQAKEAAEAVLDQIDLMKKSILAR 419 Query: 417 AFRGELTAQWRAENPDLISGENSAAALLEKI 447 AFRGEL NP E SA L++ I Sbjct: 420 AFRGELGTN----NP----AEESAVELVKNI 442 >UniRef50_A6TLK6 Restriction modification system DNA specificity domain n=2 Tax=Clostridiaceae RepID=A6TLK6_ALKMQ Length = 467 Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 116/461 (25%), Positives = 215/461 (46%), Gaps = 49/461 (10%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANN--------IQNGKFDTT 57 +PE WV + VTT+I G T + I Y ++ +P I + I +GK + T Sbjct: 28 VPENWVWTRLGNVTTIIGGGT-PPSRVIEYYENGSIPWISPVDLSGYTDIYISHGKKNIT 86 Query: 58 DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 +L L K S ++ PE+ V+ S V + ++ C+ F L P Sbjct: 87 EL-----GLKKSSARLLPENTVLLSSRAPIGYVAIADNEL----CTNQGFKSFL-PSPCY 136 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 ++ + KSS + + + ++G + ++ P+PPLAEQ+ I +++++L Sbjct: 137 LPKYLYFYLKSS--KKLLEAYASGTTFLELSGRKAAIVEFPLPPLAEQQRIVDRIESLFE 194 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV----FKKLNFESILT 233 +++ KA + + + A+L A +G+LTEKWR ++ V +KK + + ++ Sbjct: 195 KLNQAKALIQDALDSFENRKAAILHKAFSGELTEKWRE---ENGVGMGSWKKKSIKEVV- 250 Query: 234 ELRNGLS-SKPNESGVGHPILRISSVRAGHVD--QNDIRFLE--CSESELNRHKLQDGDL 288 + R G + N S GH ++R+ ++ G +D +N + S + R + +GD+ Sbjct: 251 KFRAGYAFDSKNFSSTGHQVIRMGNLYNGVLDLTRNPVYISPDLIDNSIIKRFSINEGDI 310 Query: 289 LFTRYNGSLEF-VGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNA 346 L T + G L+K + +NLL +++ LT +++ Y+ + S R+ Sbjct: 311 LLTLTGTKYKRDYGYAVLIK--ESENLLLNQRIL--SLTPESIETNYLLYYLQSDFFRDV 366 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + Q +S K ++ + + EQ EIVR ++ +F D Q+ + + + Sbjct: 367 FFSNETGGVNQGNVSSKFVEKIEIPIFSSLEQKEIVRILDYIFE-KDKNANQLCDLIDNI 425 Query: 407 NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKI 447 + + +SILA+AFRGEL NP+ E SA LL+ I Sbjct: 426 DLMKKSILARAFRGELGTN----NPE----EESAMELLKDI 458 >UniRef50_Q4HNM9 Type I restriction-modification system S subunit, putative n=1 Tax=Campylobacter upsaliensis RM3195 RepID=Q4HNM9_CAMUP Length = 544 Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust. Identities = 66/226 (29%), Positives = 113/226 (50%), Gaps = 8/226 (3%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDT---TDLVFV 62 +P W + + +I G +Y K+ L D+ + ++R NI + D V + Sbjct: 102 IPNSWAWVKLGDICEIISGTSYSKDD----LSDEGIRILRGGNINKNSHNIDLFADDVII 157 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIFSGF 121 ++L + ++I DI++ S+GSK ++GKSA + E GAF ++R K + + Sbjct: 158 KQDLTNKEKQILKNDILMIASTGSKEIIGKSAFSDVALENTQIGAFLRIIRISKEQNAKY 217 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 I H S ++ I S + G NI NIK + IP+PPL EQ+ I +KLD L+ + Sbjct: 218 IFHNLISQIFATHIKSCAGGTNILNIKNEYIENFLIPLPPLCEQQEIVKKLDLLVTLAND 277 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLN 227 E + +I KR +++L A+ G L++ +R P F ++N Sbjct: 278 FAITKENLKRIEKRIEKSLLKLALEGSLSKLYRRSSPTLCAFNEIN 323 >UniRef50_C6IKX2 Type I restriction-modification system n=2 Tax=Bacteroidales RepID=C6IKX2_9BACE Length = 478 Score = 105 bits (262), Expect = 3e-21, Method: Compositional matrix adjust. Identities = 59/186 (31%), Positives = 105/186 (56%), Gaps = 8/186 (4%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK-FDTTDLVFVP 63 KLP+GW + V ++I GV+Y K ++D + ++R NIQNGK D D VF+ Sbjct: 297 KLPQGWYSVTANDVCSIIGGVSYNKAD----IQDTGIRVLRGGNIQNGKVIDCFDDVFIS 352 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKS--AHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + ++ DI++ S+GS++++GK+ A + +P + GAF ++RP++ S + Sbjct: 353 LSYQNNDNQVQRGDIIVVASTGSQTLIGKTGFADRDIP-KTQIGAFLRIVRPKQKTLSPY 411 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 I ++ Y++ I +++ G+NINN+K A I +PPL EQ+ I +K++ L + +D Sbjct: 412 IRLIFQTDAYKDYIRNVAKGSNINNVKNAHLQNFQICLPPLEEQQRIVQKIEELFSSLDD 471 Query: 182 TKARFE 187 E Sbjct: 472 ILTALE 477 Score = 84.3 bits (207), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 95/421 (22%), Positives = 176/421 (41%), Gaps = 48/421 (11%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P+ WV + V T G T + Y + +P ++ ++ +G + Sbjct: 70 EVPDNWVWMTLGEVGTWQSGGTPSRSNKTYYGGN--IPWLKTGDLNDGLISDIPESITEE 127 Query: 65 NLVKESQKISPE-DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + S KI+P ++IAM + +G L F + C I ++ Sbjct: 128 AVANSSAKINPAGSVLIAMYGATIGKLG-----ILTFPATTNQACCACIEFNAITQLYLF 182 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 +F S RN + G NI IP+PPL+EQ+ I +++ A +D + Sbjct: 183 YFLLSQ--RNGFIAKGGGGAQPNISKEIIVNTFIPLPPLSEQQRIVMEIEKWFALIDQVE 240 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS--- 240 + +K+ + +L A++GKL + N EP + K++N + T NG S Sbjct: 241 QGKADLQNTIKQTKSKILDLAIHGKLVPQDPNDEPAIKLLKRINPD--FTPCDNGHSRKL 298 Query: 241 ---------------------SKPNESGVGHPILRISSVRAGHV-DQNDIRFLECSESEL 278 +K + G +LR +++ G V D D F+ S Sbjct: 299 PQGWYSVTANDVCSIIGGVSYNKADIQDTGIRVLRGGNIQNGKVIDCFDDVFISLSYQN- 357 Query: 279 NRHKLQDGDLLFTRYNGSLEFVGVCGL----LKKLQHQNLLYPDKLIRARLTKDALPEYI 334 N +++Q GD++ GS +G G + K Q L +++R + + L YI Sbjct: 358 NDNQVQRGDIIVVASTGSQTLIGKTGFADRDIPKTQIGAFL---RIVRPK--QKTLSPYI 412 Query: 335 EIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADT 394 + F + + ++ + N K S + +++ + LPP++EQ IV+++E+LF+ D Sbjct: 413 RLIFQTDAYKDYIRNVAK-GSNINNVKNAHLQNFQICLPPLEEQQRIVQKIEELFSSLDD 471 Query: 395 I 395 I Sbjct: 472 I 472 >UniRef50_D0WYM6 Putative uncharacterized protein n=1 Tax=Vibrio alginolyticus 40B RepID=D0WYM6_VIBAL Length = 371 Score = 104 bits (260), Expect = 5e-21, Method: Compositional matrix adjust. Identities = 84/275 (30%), Positives = 140/275 (50%), Gaps = 30/275 (10%) Query: 155 INIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR 214 + IP+PPL QK IAE +L + D + +Q+ Q L Q+V +T Sbjct: 118 VQIPLPPLETQKQIAE----VLEKADQLRKDCQQMEQELNSLAQSVFIDMFGDPVTN--- 170 Query: 215 NFEPQHSVFKKLNFESILTELRNGL--SSKPNESGVGHPILRISSVRAGHVDQNDIRFLE 272 P+ K L S L E++ GL +SK + + P LR+++V H++ ++++ + Sbjct: 171 ---PKGWDLKPL---SSLGEVKGGLQVTSKRAANPISVPYLRVANVYRDHLELDEVKEIR 224 Query: 273 CSESELNRHKLQDGDLLFTRYNGSLEFVGVCGL----LKKLQHQNLLYPDKLIRARLTKD 328 +E+EL R L+ GD+LF +G+ VG + + + HQN LIR R D Sbjct: 225 VTENELERVLLEKGDVLFVEGHGNANEVGRTAVWNDEVAQCVHQN-----HLIRFRPGAD 279 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 PEY+ F +S S + ++ KTTSG +S +IKS VL+PP+ EQ + + Sbjct: 280 VRPEYVSAFVNSASGKRQLLKMSKTTSGLNTLSTSNIKSIQVLVPPLLEQDDFLA----- 334 Query: 389 FAYADTIEKQVNNALA-RVNNLTQSILAKAFRGEL 422 F + ++ VN+ L+ ++ +++ KAF+GEL Sbjct: 335 FLASCKAQQVVNDQLSVELDQNFNALMQKAFKGEL 369 >UniRef50_Q3JBU1 Restriction modification system DNA specificity domain n=2 Tax=Nitrosococcus oceani RepID=Q3JBU1_NITOC Length = 547 Score = 104 bits (260), Expect = 6e-21, Method: Compositional matrix adjust. Identities = 67/228 (29%), Positives = 116/228 (50%), Gaps = 13/228 (5%) Query: 226 LNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQD 285 + + +NGL+ + SG P++R++ ++ VD +D+R ++ +E+ +++L Sbjct: 288 IQLRELFESTQNGLAKRQGTSGKPIPVIRLADIKNQEVDSSDLRSIKLDATEIQKYELSR 347 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLL-YPDKLIRARLTKD-ALPEYIEIFFSSPSA 343 DLL R NGS VG L K H N++ Y D IR R + LP YI++ F + + Sbjct: 348 NDLLCIRVNGSPNLVGRMILFK---HDNVMAYCDHFIRFRFPQGIVLPSYIQMLFDTQTV 404 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 R + +++GQ +S I + + + EQ IV R+E+ ++ ++ Sbjct: 405 RRYIELNKVSSAGQNTVSQTTISALAIPYCSLMEQKIIVSRLEEQLTSISAVKVEIEENF 464 Query: 404 ARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 R+ +L QSIL KAF G+L Q + P A+ LLE+I+AE+ Sbjct: 465 QRLKSLRQSILKKAFSGQLVPQDPKDEP--------ASKLLERIRAEK 504 Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 38/96 (39%), Positives = 56/96 (58%), Gaps = 3/96 (3%) Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 + IS KDI++ + LPP EQ IV ++E+LF+ D + + A ++ Q++L A Sbjct: 143 QNISAKDIEALPLGLPPYNEQQRIVAKIEELFSELDKGIESLKTAREQLKVYRQAVLKHA 202 Query: 418 FRGELTAQWRAENPDLISGENSAAALLEKIKAERAA 453 F G+LTAQWR EN D + S LL +I+ ER A Sbjct: 203 FEGKLTAQWREENKDKL---ESPEQLLARIQQEREA 235 Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 50/198 (25%), Positives = 94/198 (47%), Gaps = 15/198 (7%) Query: 41 LPLIRANNIQNGKFDTTDLVFVPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKS---AHQ 96 +P+IR +I+N + D++DL + + + + ++S D++ +GS ++VG+ H Sbjct: 312 IPVIRLADIKNQEVDSSDLRSIKLDATEIQKYELSRNDLLCIRVNGSPNLVGRMILFKHD 371 Query: 97 HLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYR-----NKISSLSAGANINNIKPAS 151 ++ C F P+ ++ +I + R NK+SS A N + + Sbjct: 372 NVMAYCDH--FIRFRFPQGIVLPSYIQMLFDTQTVRRYIELNKVSS----AGQNTVSQTT 425 Query: 152 FDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE 211 + IP L EQKII +L+ L + + K E+ Q LK RQ++L A +G+L Sbjct: 426 ISALAIPYCSLMEQKIIVSRLEEQLTSISAVKVEIEENFQRLKSLRQSILKKAFSGQLVP 485 Query: 212 KWRNFEPQHSVFKKLNFE 229 + EP + +++ E Sbjct: 486 QDPKDEPASKLLERIRAE 503 Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 53/214 (24%), Positives = 91/214 (42%), Gaps = 13/214 (6%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLK--DDYLPLIRANNIQNGKFDTTDLVFVPK 64 P GWV + + G ++ A K D +PLIR + + + + V++P Sbjct: 6 PTGWVFCRFGDIARIRNGYAFRSS-AFKKTKTHDCDVPLIRQSQLIGTAVNIGEAVYLPA 64 Query: 65 NLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR--PEKLIFSG 120 ++ + I+ DI+I MS +GK F G + E + S Sbjct: 65 EYLERFAQYVINKGDILIGMSGA----IGKVCRYKNGFPALQNQRTGKIEVFDESQMDSR 120 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F + S ++ + G + NI + + + +PP EQ+ I K++ L +++D Sbjct: 121 FFGLYLSS--IEGELIRQAKGMAVQNISAKDIEALPLGLPPYNEQQRIVAKIEELFSELD 178 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR 214 + + LK +RQAVL A GKLT +WR Sbjct: 179 KGIESLKTAREQLKVYRQAVLKHAFEGKLTAQWR 212 >UniRef50_Q4C702 Restriction modification system DNA specificity domain n=1 Tax=Crocosphaera watsonii WH 8501 RepID=Q4C702_CROWT Length = 408 Score = 103 bits (258), Expect = 1e-20, Method: Compositional matrix adjust. Identities = 104/427 (24%), Positives = 187/427 (43%), Gaps = 38/427 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+ W + V + G Y+ Y P+I + N++ D +++ ++ + Sbjct: 10 LPQYWKWSKCQEVIDVRDGT----HDTPKYVSSGY-PVITSKNLKTSGIDFSNVSYISEA 64 Query: 66 LVKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 KE K+ DI++AM +G + E S + I+ + Sbjct: 65 DHKEISKRSKVDKGDILLAMIG----TIGNPVIVDIEKEFSIKNVALFKLSKSNIYPEYF 120 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + SS+ ++ G + + IP+PPL EQK IA+ LD Sbjct: 121 KYLLDSSIISRQLDFEQRGGTQKFVSLKVLRNLLIPLPPLEEQKRIAKILDKADEIRRKR 180 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 K ++L+ + G V P+ K L S + EL+ G +SK Sbjct: 181 KESIRLTDELLRSTFLDMFGDPV----------INPKGWEVKTLG--SQIKELKYGTNSK 228 Query: 243 PNESGVGHPI--LRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 +E + I LRI ++ + ND+++ E+++ L++GDLLF R NG+ +++ Sbjct: 229 CSELQKNNNIAVLRIPNIDNEKISWNDLKYTNLDSKEISKLLLKNGDLLFVRSNGNPDYI 288 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTK--DALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 G C + ++ ++ +Y LIR RL D P +I + P+ R+ ++ +TT+G Sbjct: 289 GRCAIFEEESNRKAVYASYLIRGRLKSICDFHPAFIRDIIAFPTFRSFLIREARTTAGNY 348 Query: 359 GISGKDIKSQVVLLPPVKEQAE---IVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 I+ +++ S ++ PP +Q E I ++ + F + KQ +L NL S+L Sbjct: 349 NINIQELSSLKLICPPQDKQEEYLDITTKINRSF-----LNKQ--KSLQESENLFNSLLQ 401 Query: 416 KAFRGEL 422 KAF+GEL Sbjct: 402 KAFKGEL 408 Score = 43.9 bits (102), Expect = 0.014, Method: Compositional matrix adjust. Identities = 46/225 (20%), Positives = 105/225 (46%), Gaps = 22/225 (9%) Query: 232 LTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL-ECSESELN-RHKLQDGDLL 289 + ++R+G P G+P++ +++ +D +++ ++ E E++ R K+ GD+L Sbjct: 22 VIDVRDGTHDTPKYVSSGYPVITSKNLKTSGIDFSNVSYISEADHKEISKRSKVDKGDIL 81 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMM 348 G++ + + K+ +N+ +L+K + PEY + S S + + Sbjct: 82 LAMI-GTIGNPVIVDIEKEFSIKNVAL------FKLSKSNIYPEYFKYLLDS-SIISRQL 133 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + + QK +S K +++ ++ LPP++EQ +R+ ++ AD I ++ ++ + Sbjct: 134 DFEQRGGTQKFVSLKVLRNLLIPLPPLEEQ----KRIAKILDKADEIRRKRKESIRLTDE 189 Query: 409 LTQSILAKAF-------RGELTAQWRAENPDLISGENSAAALLEK 446 L +S F +G ++ +L G NS + L+K Sbjct: 190 LLRSTFLDMFGDPVINPKGWEVKTLGSQIKELKYGTNSKCSELQK 234 >UniRef50_UPI0001855288 conserved hypothetical protein n=1 Tax=Francisella novicida FTG RepID=UPI0001855288 Length = 414 Score = 103 bits (256), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 109/428 (25%), Positives = 190/428 (44%), Gaps = 44/428 (10%) Query: 5 KLPEGWVIAPVSTVTTLIRGV-TYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 KLP GW + V + G K + I Y PL+ + N++N D T F+ Sbjct: 21 KLPAGWEWKKLGEVFDVKDGTHDSPKYKEIGY------PLVTSKNLKNNSLDLTSCKFIS 74 Query: 64 KN-LVKESQ--KISPEDIVIAM--SSGSKSVVGKSAHQHLPFECSFG-AFCGVLRPEKLI 117 + +K +Q K+ D++ AM + GS ++V FE F + +P Sbjct: 75 NDDFIKINQRSKVDKGDLLFAMIGTIGSPTIVD--------FEPDFAIKNVALFKPSNTY 126 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + ++ S L K+ + GA + P+PPLAEQK I KLD+L Sbjct: 127 LIELLKYWLSSHLTTQKMLEEAKGATQKFVGLTYLRNFPAPLPPLAEQKRIVAKLDSL-- 184 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGG--AVNGKLTEKWRNFEPQHSVFKKLNFESILTEL 235 FE+I + ++ +Q + + L + ++ E ++S FK L + + + Sbjct: 185 --------FEKIDKAIELHQQNITNANTLMASALDKTFKKLEREYS-FKIL--DCLSENI 233 Query: 236 RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 R G + K E G +RI+ + +++ +++ ++L+R+KL GD+L R Sbjct: 234 RYGYTDKAKEKGNAR-FIRITDINDQGKFKDESVYVDIKNTDLDRYKLLVGDILVARSGA 292 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTT 354 + V + L + ++ LIR RL D LP +I F S + N ++ +K Sbjct: 293 TAGKVALFTL-----DEFSVFASYLIRIRLQIDKVLPSFIFYFCYSSNYWN-QLDQIKIG 346 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 Q ++ ++K+ + LPP+ Q + V ++ + D I++ L + L SIL Sbjct: 347 GAQPNVNATNLKNIKIPLPPLPIQQQTVEYLDSIATKVDKIKQLNEQKLENLKALKASIL 406 Query: 415 AKAFRGEL 422 KAFRGEL Sbjct: 407 DKAFRGEL 414 >UniRef50_UPI0001B4DA32 restriction endonuclease S subunits-like protein n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B4DA32 Length = 416 Score = 102 bits (255), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 85/342 (24%), Positives = 151/342 (44%), Gaps = 44/342 (12%) Query: 155 INIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE--- 211 I +P+P LAEQ+ I L+ +++++S + + ++R+ A G E Sbjct: 39 IELPVPSLAEQRRIVAALEEQISKIESGERGLTNAARRSGQYRRLAADLATKGGFAEPLT 98 Query: 212 --------------------KWRNFEPQH---------SVFKKLNFESILTELRNGLSSK 242 K R +P + + ++ + I + G S+K Sbjct: 99 GDGTGPELFESIRSARASRVKTRRLKPATLSGPVPKVPAHWTVVSLDEITELIEYGSSTK 158 Query: 243 PNESGV--GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 +ES G P+LR+ +++ G VD ++++ + R++LQ+GDLLF R N S E V Sbjct: 159 TSESAEVGGVPVLRMGNIKDGKVDPRVLKYISADHPDAVRYRLQEGDLLFNRTN-SFELV 217 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G + + + + + LIR R +++ + +S R + + GQ + Sbjct: 218 GKSAVYRD-KFGPMAFASYLIRCRFLPGVDTDWVNLVINSSIGRRYVRSVATQQVGQANV 276 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 +G + + + LPP EQ I+ VE A A +E + A+ L +++L +AF G Sbjct: 277 NGTKLAAMPIPLPPEGEQRRILDVVETHQAAALRLESGIRQQGAKATRLRRALLTQAFAG 336 Query: 421 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRK 462 L Q A+ P A LL +I+AER A+G K R+ Sbjct: 337 RLVTQDPADEP--------AEILLARIRAEREAAGVTKTRRR 370 Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 34/124 (27%), Positives = 61/124 (49%), Gaps = 8/124 (6%) Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 +PE++ FS R + VKTT+GQ GISG ++K + +P + EQ IV +E+ Sbjct: 1 MPEFVAYAFSWEGTRARVREYVKTTAGQAGISGGELKKIELPVPSLAEQRRIVAALEEQI 60 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKA 449 + ++ E+ + NA R + A +G + ++G+ + L E I++ Sbjct: 61 SKIESGERGLTNAARRSGQYRRLAADLATKGGFA--------EPLTGDGTGPELFESIRS 112 Query: 450 ERAA 453 RA+ Sbjct: 113 ARAS 116 Score = 51.6 bits (122), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 55/231 (23%), Positives = 103/231 (44%), Gaps = 12/231 (5%) Query: 5 KLPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 K+P W + + +T LI G + K ++ +P++R NI++GK D L ++ Sbjct: 134 KVPAHWTVVSLDEITELIEYGSSTKTSESAEV---GGVPVLRMGNIKDGKVDPRVLKYIS 190 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGFI 122 + + E ++ + S +VGKSA F +F ++ R + + ++ Sbjct: 191 ADHPDAVRYRLQEGDLLFNRTNSFELVGKSAVYRDKFGPMAFASYLIRCRFLPGVDTDWV 250 Query: 123 AHFTKSSLYRNKISSLS----AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 SS+ R + S++ AN+N K A+ + IP+PP EQ+ I + ++T A Sbjct: 251 NLVINSSIGRRYVRSVATQQVGQANVNGTKLAA---MPIPLPPEGEQRRILDVVETHQAA 307 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE 229 ++ Q R R+A+L A G+L + EP + ++ E Sbjct: 308 ALRLESGIRQQGAKATRLRRALLTQAFAGRLVTQDPADEPAEILLARIRAE 358 >UniRef50_A3PYN5 Restriction modification system DNA specificity domain n=1 Tax=Mycobacterium sp. JLS RepID=A3PYN5_MYCSJ Length = 451 Score = 101 bits (251), Expect = 6e-20, Method: Compositional matrix adjust. Identities = 105/444 (23%), Positives = 183/444 (41%), Gaps = 42/444 (9%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKE-------QAINYLKDDYLPLIRANNIQNGKFDT 56 G++P GW ++P+ V T+ K Q NY + N +G D Sbjct: 18 GRVPSGWAVSPLKNVATVFPSSVDKHSHDNEIPVQLCNYTD------VYKNERISGALDF 71 Query: 57 TDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAH--QHLPFECSFGAFCGVLRPE 114 P+ + K + K D +I S + +G SA+ + LP + G V+RP Sbjct: 72 MKATATPEEIKKFTLKQG--DTIITKDSETADDIGISAYVEETLP-DVLCGYHLSVVRPL 128 Query: 115 KLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDT 174 + F+ S + + + G + + D +NIP+PP EQ IA+ L+ Sbjct: 129 PGLDGRFVKRLFDSHYLKASMEVSANGLTRVGLGQYAIDNLNIPLPPPDEQLQIADFLEA 188 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL-------- 226 A++D+ A+ E + L+ R A + AV L +P +S Sbjct: 189 ETAKIDALIAKQEHLIATLREDRTATITHAVTKGLDPTVDMVQPHNSELPACPKHWTLLI 248 Query: 227 ---NFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKL 283 + T L G S P E+ V P LR+++V+ V+ ++++ + SEL R+ L Sbjct: 249 SLKRLAEVQTGLTLGKSVDPAEA-VDVPYLRVANVQTSGVNLDEVKTVAVHRSELKRYLL 307 Query: 284 QDGDLLFTRYNGSLEFVG----VCGLLKKLQHQNLLYPDKLIRARLTKDALP-EYIEIFF 338 +DGD+L T G ++ +G G + HQN ++ A DAL +++ Sbjct: 308 RDGDVLMTE-GGDIDKLGRGCVWSGEIAPCIHQNHVF------AVRCSDALSGDFLVYLL 360 Query: 339 SSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 + ARN K T+ + + + LPP EQ EIV + + A D + + Sbjct: 361 DTAVARNYFFMTAKKTTNLASTNSTTLGAFTFSLPPRAEQDEIVDHLNERCAGLDALIAK 420 Query: 399 VNNALARVNNLTQSILAKAFRGEL 422 N + + +++ A G++ Sbjct: 421 ANAVITVLREYRAALITDAVTGKI 444 >UniRef50_Q0W5N3 Type I restriction modification system, specificity subunit n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W5N3_UNCMA Length = 449 Score = 101 bits (251), Expect = 7e-20, Method: Compositional matrix adjust. Identities = 101/393 (25%), Positives = 177/393 (45%), Gaps = 23/393 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL-VFV 62 G++PE W I + + + +K+ D Y I +++ N + F Sbjct: 29 GRIPEEWSIVSIKNIVEKTEQIDPQKQ------PDKYFKYIDVSSVSNESLKVVSVNEFK 82 Query: 63 PKNLVKESQKI-SPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG- 120 N +++I +DI+ A + V CS AFC VLR K I Sbjct: 83 GINAPSRARRIVRTDDIIFATIRPNLKRVAIICDDLEGQLCS-TAFC-VLRCMKNIAEPY 140 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAE---KLDTLLA 177 F+ + + K+ L G+ + I +PP++EQ+ IA LD+L+ Sbjct: 141 FVFQTVTTDRFIGKLCDLQCGSGYPAVTDNDLLDQQILLPPISEQRKIAAILGTLDSLIE 200 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 + D AR Q+ + L ++ + G N +L + P+H +K + F ++ +N Sbjct: 201 ETDRVVARTGQLKKGL--IQEFLTEGMGNVELEDTALGMIPKH--WKCVPFATLSLTYKN 256 Query: 238 GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSL 297 G+ G G+P +R+ ++ G V+ + L +++EL ++L +GDLL R N S Sbjct: 257 GIYKHDKYYGSGYPCIRMYNIADGTVNTINSPLLNVTDAELKEYELAEGDLLINRVN-SR 315 Query: 298 EFVGVCGLLKK-LQHQNLLYPDKLIRARLTKDA-LPEYIEIFFSSPSARNAMMNCVKTTS 355 + VG G++ L H + + K IR RL + LPE++ +F S RN + VK+ Sbjct: 316 DLVGKAGIVPAGLGH--VTFESKNIRVRLNRSMILPEFMGLFIQSSMYRNQVNKFVKSAI 373 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 Q I+ D+ + +V LPP EQ +I + ++ Sbjct: 374 AQSTINQDDLDNILVPLPPKDEQEKIASVIREI 406 Score = 54.3 bits (129), Expect = 9e-06, Method: Compositional matrix adjust. Identities = 56/213 (26%), Positives = 99/213 (46%), Gaps = 17/213 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTD--LVF 61 G +P+ W P +T++ + YK ++ Y Y P IR NI +G +T + L+ Sbjct: 236 GMIPKHWKCVPFATLSLTYKNGIYKHDK---YYGSGY-PCIRMYNIADGTVNTINSPLLN 291 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSA-----HQHLPFECSFGAFCGVLRPEKL 116 V +KE + ++ D++I + S+ +VGK+ H+ FE V + Sbjct: 292 VTDAELKEYE-LAEGDLLINRVN-SRDLVGKAGIVPAGLGHVTFE---SKNIRVRLNRSM 346 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANI-NNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 I F+ F +SS+YRN+++ A + I D I +P+PP EQ+ IA + + Sbjct: 347 ILPEFMGLFIQSSMYRNQVNKFVKSAIAQSTINQDDLDNILVPLPPKDEQEKIASVIREI 406 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGK 208 +++ E+I + K Q +L G + K Sbjct: 407 NSKITWEIRYRERIELVKKALMQDLLTGRIRVK 439 >UniRef50_A9I6S0 Type I restriction-modification system, S subunit n=3 Tax=Bacteria RepID=A9I6S0_BORPD Length = 797 Score = 100 bits (250), Expect = 8e-20, Method: Compositional matrix adjust. Identities = 115/498 (23%), Positives = 202/498 (40%), Gaps = 99/498 (19%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GW A + +T +IRG+T+ + + +R N+Q K + +DL+++ + Sbjct: 87 LPQGWEWARLGEITDIIRGITFPASEKTKEPASGRIACLRTANVQK-KIEWSDLLYIDRT 145 Query: 66 LV-KESQKISPEDIVIAMSSGSKSVVGKSAH-QHLPF-ECSFGAFCGVLRPEKLIFSGFI 122 + K SQ + +DIV++M++ S+ +VGK A +P E +FG F GVLR K + ++ Sbjct: 146 FMSKNSQLVRQDDIVMSMAN-SRELVGKVAVVSEMPVNEATFGGFLGVLRTHK-VAPLYV 203 Query: 123 AHFTKSSLYRNK-ISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 H +S R+ I + S NI NI + +P+PP++EQ I K+D L+A+ D Sbjct: 204 LHLLNTSYARSSLIDAASQTTNIANISLGKLNPFLVPVPPISEQHRIVAKIDELMARCDE 263 Query: 182 -TKARFEQ-----------IPQILK-----------------------------RFRQAV 200 K R Q I Q+L R+A+ Sbjct: 264 LEKLRTAQQGARLTVHAAAIKQLLNVAEPGQHQRAQTFLAEHFGELYTIKGNVAELRKAI 323 Query: 201 LGGAVNGKLTEKWRNFEPQHSVFKKLN--------------------------------- 227 L AV GKL + N +P + K++ Sbjct: 324 LQLAVMGKLVPQDPNDQPASELLKEIEAEKQRLVQEGKIKKTKPLPPVTEEEKPYALPQG 383 Query: 228 -----FESILTELRNG----LSSKPNESGVGHPILRISSVRAGHVDQN-DIRFLECSESE 277 F + TE+ G + K + G P++ S + G + + + E + Sbjct: 384 WEWVRFGDLTTEISTGPFGSMIHKSDYIVDGVPLVNPSHMVDGKIFHDPSVTVSEIMAKK 443 Query: 278 LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIF 337 L+ H+L D++ R G + G C ++ + L R +YI Sbjct: 444 LDSHRLNTNDIVMAR-RGEM---GRCAIVTA-ESDGFLCGTGSFVLRFVDRIYRQYILTI 498 Query: 338 FSSPSARNAM-MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 F + R + N V TT ++ + V LPP EQ IV ++++L D ++ Sbjct: 499 FKTEITREFLGGNSVGTT--MTNLNHGILNKMPVSLPPHPEQTRIVTKIDELMVMCDALD 556 Query: 397 KQVNNALARVNNLTQSIL 414 +Q+ ++ L +++ Sbjct: 557 QQIEATSSKRTELLNALI 574 >UniRef50_Q5LPW5 Type I restriction-modification system, S subunit n=1 Tax=Ruegeria pomeroyi RepID=Q5LPW5_SILPO Length = 434 Score = 100 bits (248), Expect = 1e-19, Method: Compositional matrix adjust. Identities = 57/170 (33%), Positives = 91/170 (53%), Gaps = 2/170 (1%) Query: 22 IRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQKISPEDIVIA 81 IRGV+Y+ E D L+R+ NIQ+G+ D T + VP LVK +Q + D+V+ Sbjct: 16 IRGVSYRPEHLQEDFGRDRTVLLRSTNIQDGQLDFTSIQIVPSYLVKPAQSVGEGDLVVC 75 Query: 82 MSSGSKSVVGKSAHQHLPFEC--SFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLS 139 MS+GSK++VGK+A + + GAFC V P+ S F+ H + +R I + Sbjct: 76 MSNGSKALVGKAARYKGEYGAPLTVGAFCSVFHPKTESDSAFLRHVFQGEQFRRSIDIIL 135 Query: 140 AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQI 189 +G+ INN+K + + I+I E+ IA+ LD + + T E++ Sbjct: 136 SGSAINNLKNSDVEGISIRAHSPTERATIADILDAIDDAILETDTVIEKL 185 >UniRef50_A1UJN5 Restriction endonuclease S subunits-like protein n=2 Tax=Mycobacterium RepID=A1UJN5_MYCSK Length = 419 Score = 99.8 bits (247), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 75/329 (22%), Positives = 146/329 (44%), Gaps = 26/329 (7%) Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ S + +K+ +L +G I + + IP+P L EQ I + L+ L+++D Sbjct: 116 YVMWALNSPRFHSKVVALQSGTTRKRISRKNLASLTIPLPTLDEQNRIVDLLEDHLSRLD 175 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 + ++ Q A L + WR+ + + L E + Sbjct: 176 AAESSLRLAMQKADAMTTASLDRQTTAG-SRAWRD--------TTIGAMAELVEYGSSAK 226 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 + P+LR+ +++ G ++ +++L +E + LQ GDL+F R N S E V Sbjct: 227 CAGQAADSDVPVLRMGNIQNGKINWTGLKYLPAGHAEFPKLLLQSGDLVFNRTN-SAELV 285 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G + + + + + LIR R ++ P + + +SP+ R + + GQ + Sbjct: 286 GKSAVFEDTRAAS--FASYLIRVRFGQEVNPAWANMVINSPAGRRYVKSVASQQVGQANV 343 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 +G +K+ + LPP+ EQ VR +++ + + Q+ + + R L +++LA AF G Sbjct: 344 NGTKLKAFPLPLPPLDEQCRRVRAHDEVVVSRERLHHQIADLVVRAAGLRRALLAAAFTG 403 Query: 421 ELTAQWRAENPDLISGENSAAALLEKIKA 449 LT NSA LLE++++ Sbjct: 404 RLT--------------NSAEGLLEELES 418 Score = 53.1 bits (126), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 49/180 (27%), Positives = 83/180 (46%), Gaps = 13/180 (7%) Query: 38 DDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQH 97 D +P++R NIQNGK + T L ++P + + + ++ + S +VGKSA Sbjct: 233 DSDVPVLRMGNIQNGKINWTGLKYLPAGHAEFPKLLLQSGDLVFNRTNSAELVGKSAVFE 292 Query: 98 LPFECSFGAFCGVLRPEKLI---FSGFIAHFTKSSLYRNKISSLSAG-ANINNIKPASFD 153 SF ++ +R + + ++ + + Y ++S G AN+N K +F Sbjct: 293 DTRAASFASYLIRVRFGQEVNPAWANMVINSPAGRRYVKSVASQQVGQANVNGTKLKAFP 352 Query: 154 LINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKR---FRQAVLGGAVNGKLT 210 L P+PPL EQ D + V S + QI ++ R R+A+L A G+LT Sbjct: 353 L---PLPPLDEQCRRVRAHDEV---VVSRERLHHQIADLVVRAAGLRRALLAAAFTGRLT 406 >UniRef50_B0NIH0 Putative uncharacterized protein n=1 Tax=Clostridium scindens ATCC 35704 RepID=B0NIH0_EUBSP Length = 487 Score = 99.0 bits (245), Expect = 3e-19, Method: Compositional matrix adjust. Identities = 102/408 (25%), Positives = 178/408 (43%), Gaps = 67/408 (16%) Query: 57 TDLVFV-PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK 115 TD++ V P +LV I+ E +A+ +G + V+ + F+ EK Sbjct: 48 TDMILVKPGDLVISG--INVEKGALAIYTGEEDVLASIHYSAYEFDA-----------EK 94 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + ++ F KS ++R + + IK F I I +P L +Q + ++ + Sbjct: 95 IDID-YLKWFLKSGIFRKLLLKQTGRGIKKEIKAKHFLPIEIQLPSLNQQHEVVRQIQGV 153 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE------ 229 + + EQ + ++ RQ +L A+ GKL E+ + EP + +K+ E Sbjct: 154 ADYIVEINQQIEQQTKYMEILRQTILQQAIEGKLCEQNPSDEPASVLLEKIKAEKERLIV 213 Query: 230 --------------------------------SILTEL-RNGLSSKPNESGVGHPILRIS 256 IL E RNG S E +L ++ Sbjct: 214 EKKIKKQKTLPPISNAEKPFVLPKGWEWCRLGEILYEAPRNGYSPPKVERETNTRVLTLT 273 Query: 257 SVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG-VCGLLKKLQHQNLL 315 + +G +D +++E SE + ++ GDLL R N SL++VG VC L + + + Sbjct: 274 ATTSGILDLQHYKYVEDMISESSYLWIKQGDLLIQRSN-SLDYVGTVC--LCDVVIKGYI 330 Query: 316 YPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVV----- 370 YPD +++A+++ +A YI + SP AR + TS S K IK VV Sbjct: 331 YPDLMMKAKVSNEADSHYIVYYLKSPFARQYFKDRATGTSN----SMKKIKQSVVSEIPI 386 Query: 371 LLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 LPP+ EQ +IV ++++LFA + +++ A + L +S+L +AF Sbjct: 387 ALPPINEQKQIVAKMKELFALNQKMNQELLQAKKYASQLMESVLQEAF 434 Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust. Identities = 37/121 (30%), Positives = 61/121 (50%), Gaps = 11/121 (9%) Query: 332 EYIEIFFSSPSARNAMMNCVKTTSG-QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 +Y++ F S R ++ +T G +K I K + LP + +Q E+VR+++ + Sbjct: 98 DYLKWFLKSGIFRKLLLK--QTGRGIKKEIKAKHFLPIEIQLPSLNQQHEVVRQIQGVAD 155 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAE 450 Y I +Q+ + L Q+IL +A G+L Q NP + A+ LLEKIKAE Sbjct: 156 YIVEINQQIEQQTKYMEILRQTILQQAIEGKLCEQ----NP----SDEPASVLLEKIKAE 207 Query: 451 R 451 + Sbjct: 208 K 208 >UniRef50_A4AEB7 Type I restriction-modification system, endonuclease S subunit n=2 Tax=Proteobacteria RepID=A4AEB7_9GAMM Length = 398 Score = 98.6 bits (244), Expect = 5e-19, Method: Compositional matrix adjust. Identities = 92/344 (26%), Positives = 148/344 (43%), Gaps = 38/344 (11%) Query: 80 IAMSSGSKSVVGKSAHQHLPFECSFGAFCG----VLRPE-KLIFSGFIAHFTKSSLYRNK 134 I SG + A+Q F C VLRP+ ++ F+ F +S ++ N+ Sbjct: 61 ILFKSGDIIFGKRRAYQRKLCVADFDGICSAHAMVLRPKTDVVLEDFLPFFMQSEIFMNR 120 Query: 135 ISSLSAGANINNIKPASFDLINIPIPPLAEQKII------AEKLDTLLAQVDSTKARFEQ 188 +S G I +PPL EQ+ I AE+ L + S + + Sbjct: 121 AVKISVGGLSPTINWRDLAKEEFALPPLQEQRRIVQLLSAAERYQNALYDL-SERGTSSR 179 Query: 189 IPQILKRFRQAVLGGAVN----GKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 + R R A LG G+ W N P +LT + GLS + Sbjct: 180 DSLVDHRMRGATLGATTYHERVGRYFNGW-NLVP---------LGELLTAAQYGLSESLH 229 Query: 245 ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 G +PILR+ ++ G +D+++L+ S+S+ ++L GD+LF R N S E VG G Sbjct: 230 GKG-QYPILRMMNLEDGKATADDLKYLDLSDSDFETYRLVSGDVLFNRTN-SYELVGRTG 287 Query: 305 LLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 + + ++ LIR + D L PEY+ F +P R +M+ Q I+ Sbjct: 288 VYD--LPGDFVFASYLIRLKTDIDRLSPEYLSAFLRAPIGRRQVMSFATRGVSQANINAS 345 Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 ++K +V LPP+ Q E+V +L AD+ + A+AR+ Sbjct: 346 NLKRVLVPLPPIGYQKEVV----ELLTVADSSRRW---AIARLQ 382 Score = 48.1 bits (113), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 40/183 (21%), Positives = 91/183 (49%), Gaps = 13/183 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G+ GW + P+ + T Y +++ + K Y P++R N+++GK DL ++ Sbjct: 202 GRYFNGWNLVPLGELLT---AAQYGLSESL-HGKGQY-PILRMMNLEDGKATADDLKYLD 256 Query: 64 -KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGF 121 + E+ ++ D++ ++ S +VG++ LP + F ++ L+ + + + Sbjct: 257 LSDSDFETYRLVSGDVLFNRTN-SYELVGRTGVYDLPGDFVFASYLIRLKTDIDRLSPEY 315 Query: 122 IAHFTKSSLYRNKISSLSA-GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ F ++ + R ++ S + G + NI ++ + +P+PP+ QK + E LL D Sbjct: 316 LSAFLRAPIGRRQVMSFATRGVSQANINASNLKRVLVPLPPIGYQKEVVE----LLTVAD 371 Query: 181 STK 183 S++ Sbjct: 372 SSR 374 >UniRef50_C3Q383 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 9_1_42FAA RepID=C3Q383_9BACE Length = 428 Score = 97.4 bits (241), Expect = 9e-19, Method: Compositional matrix adjust. Identities = 105/436 (24%), Positives = 189/436 (43%), Gaps = 43/436 (9%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV- 62 G++P W + P+ + G+TY + N ++D ++R++NIQN K + D V+V Sbjct: 16 GEIPNHWEVVPLKRTGSFENGLTY----SPNDIRDKGYIVLRSSNIQNSKMNYEDTVYVE 71 Query: 63 --PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 P +L+ + DI+I +GS S+VGK A +FGAF P + Sbjct: 72 SVPNDLL-----VKKGDIIICSRNGSASLVGKCAKFDGKIAATFGAFMMRYSPS---INN 123 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 A F+ L RN L + IN + + P+PPL+EQ+ IA LD ++D Sbjct: 124 EYAFFSFQILMRN-YKGLFTTSTINQLTKNVIAQMVCPLPPLSEQQAIASYLDAKTEKID 182 Query: 181 STKARFEQIPQILKRFRQAVLGGAV------NGKLTE---KWRNFEPQHSVFKKLNFESI 231 A+ E+ + L +Q+++ AV N L + KW P+H ++ + + Sbjct: 183 KMIAKAEKKIEYLGELKQSLITRAVTRGLNPNASLKDSGVKWIGKVPEH--WETIKLSRV 240 Query: 232 LTELRNG---LSSKPN-ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD 287 + + +G LSS+ + S G+ L+ + G + Q + + + E Sbjct: 241 YSYIGSGTTPLSSQEDYYSEEGYNWLQTGDLNNGLITQTSKKITKKAIDECRMKFYPKHS 300 Query: 288 LLFTRYNGSLEFVGVCGLLKKL-QHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNA 346 ++ Y ++ VG+ L Q ++ P + + T F+S +A+ Sbjct: 301 VVIAMYGATIGKVGLLDLESTTNQACCVISPTQKMNPLFT----------FYSFMAAKKE 350 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 ++ GQ IS IK V +PP++EQ I+ +++ D I +A + Sbjct: 351 LL-LASFGGGQPNISQDIIKKLRVPVPPLEEQNAIILSLKKECDTIDHIIATQKKKIAYL 409 Query: 407 NNLTQSILAKAFRGEL 422 L QS++ G++ Sbjct: 410 QELKQSLITNVVTGKI 425 >UniRef50_A3SCN8 Restriction endonuclease S subunit-like protein n=1 Tax=Sulfitobacter sp. EE-36 RepID=A3SCN8_9RHOB Length = 497 Score = 97.1 bits (240), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 86/335 (25%), Positives = 155/335 (46%), Gaps = 29/335 (8%) Query: 140 AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQA 199 AG + ++ + I P EQ+ I EKLD L + D +IP+++ +++ Sbjct: 91 AGTTVESLDFNRLKSYPLRIAPSLEQRRIVEKLDILTGRTDRAHDELSRIPELVAKYKSC 150 Query: 200 VLGGAVNGKLTEKWRNFEPQHSVFKKLNFESI-----------LTELRNGLSSKPNESG- 247 L A G+LT +F +HS K E+I ++E++ G+ S Sbjct: 151 FLRLAFTGQLTS---DFRGEHS-RKGTGVENIPDSWAVKPLGEISEIQGGVQVGKKRSSS 206 Query: 248 ---VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 V P LR+++V+ G +D +I+ + + E R L+ GD+L G + +G G Sbjct: 207 TDLVEVPYLRVANVQRGWLDLEEIKTIGVTPQEKERLLLRMGDILMNE-GGDRDKLGR-G 264 Query: 305 LLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKD 364 + Q + ++ + + R RL +LP +++ + ++ T+ IS + Sbjct: 265 WVWNNQIADCIHQNHVFRIRLKDSSLPPEFVSHYANEMGQQYFVDQGTQTTNLASISKRK 324 Query: 365 IKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTA 424 + + V +PP E EIV R++ FA+ + I + A + L +IL+KAFRGEL Sbjct: 325 LAALPVPVPPSDEAVEIVNRIDAAFAWLERISSEQAAASKLLPELDAAILSKAFRGELAR 384 Query: 425 QWRAENPDLISGENSAAALLEKIKAERAASGGKKA 459 Q NPD + A+ +L ++ E A+ +K+ Sbjct: 385 Q----NPD----DEPASRILARVSVEGQAAPTRKS 411 Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust. Identities = 45/239 (18%), Positives = 97/239 (40%), Gaps = 29/239 (12%) Query: 6 LPEGWVIAPVSTVTTLIRGVTY--KKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV- 62 +P+ W + P+ ++ + GV K+ + + ++ +P +R N+Q G D ++ + Sbjct: 178 IPDSWAVKPLGEISEIQGGVQVGKKRSSSTDLVE---VPYLRVANVQRGWLDLEEIKTIG 234 Query: 63 --PKNLVKESQKISPEDIVIAMSSGSKSVVGKS----------AHQHLPFECSFGAFCGV 110 P+ KE + DI++ G + +G+ HQ+ F Sbjct: 235 VTPQE--KERLLLRMGDILMN-EGGDRDKLGRGWVWNNQIADCIHQNHVFRIRLKDSS-- 289 Query: 111 LRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAE 170 L PE F++H+ + + + N+ +I + +P+PP E I Sbjct: 290 LPPE------FVSHYANEMGQQYFVDQGTQTTNLASISKRKLAALPVPVPPSDEAVEIVN 343 Query: 171 KLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE 229 ++D A ++ + ++L A+L A G+L + + EP + +++ E Sbjct: 344 RIDAAFAWLERISSEQAAASKLLPELDAAILSKAFRGELARQNPDDEPASRILARVSVE 402 >UniRef50_C6J5M6 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J5M6_9BACL Length = 403 Score = 96.7 bits (239), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 67/210 (31%), Positives = 112/210 (53%), Gaps = 18/210 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P GW + P+ L++G+TY +Y L ++R++NIQ+GK D V+V + Sbjct: 3 VPNGWAVKPLLECCDLLQGLTYSPSNIQSY----GLLVLRSSNIQDGKLVLDDCVYVNCS 58 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + E + + P DI+I + +GS +++GKS P+ +FGAF VLR + +G++AH Sbjct: 59 -IDEIKYVKPNDILICVRNGSSALIGKSCVIDRPYNATFGAFMSVLRGDT---TGYLAHM 114 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-PLAEQKIIAEKLDTLLAQVDSTKA 184 S + + +I + S+ A IN I F+ I IPIP EQ+ IA A + A Sbjct: 115 FASDVVQQQIRNRSS-ATINQITKRDFEDIKIPIPFDEEEQRAIA-------AALSDADA 166 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWR 214 + +++ + ++AV GA+ LT K R Sbjct: 167 YITALEKLITK-KRAVKQGAMQELLTGKRR 195 >UniRef50_UPI000196B4FC hypothetical protein CATMIT_01648 n=1 Tax=Catenibacterium mitsuokai DSM 15897 RepID=UPI000196B4FC Length = 300 Score = 96.7 bits (239), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 63/193 (32%), Positives = 102/193 (52%), Gaps = 4/193 (2%) Query: 234 ELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 +L NG S E VG +LR+++VR G +D ++ ++ SE E + +GDLL R Sbjct: 102 DLANGRSVPTAE--VGAKVLRLTAVRGGKIDLSEWKYGAWSEEEAKPFAVTEGDLLVVRG 159 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVK 352 NGSL VG GL+ K+ Q + YPD LIR R + + ++ + ++S +RN + + Sbjct: 160 NGSLALVGRAGLVGKVPDQ-VAYPDTLIRLRTIETVVRSAWMSLNWNSELSRNHLEKRAR 218 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 T++G IS DI S V +PP+ EQ I+ + ++E ++ AL + ++ Sbjct: 219 TSAGIYKISQPDIVSVRVPVPPLAEQDRILAEFDTHMKQIGSVEAALDAALKQATAQRKN 278 Query: 413 ILAKAFRGELTAQ 425 +L AF G L Q Sbjct: 279 LLKAAFAGHLVPQ 291 Score = 47.8 bits (112), Expect = 0.001, Method: Compositional matrix adjust. Identities = 51/221 (23%), Positives = 98/221 (44%), Gaps = 17/221 (7%) Query: 6 LPEGWVIAPVSTVT--TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 LP+GW A V ++ L G + + ++R ++ GK D ++ + Sbjct: 86 LPDGWTWANVDQLSPDDLANGRSVPTAEV-------GAKVLRLTAVRGGKIDLSEWKY-- 136 Query: 64 KNLVKESQK---ISPEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFCGVLRP-EKLIF 118 +E K ++ D+++ +GS ++VG++ +P + ++ LR E ++ Sbjct: 137 GAWSEEEAKPFAVTEGDLLVVRGNGSLALVGRAGLVGKVPDQVAYPDTLIRLRTIETVVR 196 Query: 119 SGFIAHFTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 S +++ S L RN + + A I I + +P+PPLAEQ I + DT + Sbjct: 197 SAWMSLNWNSELSRNHLEKRARTSAGIYKISQPDIVSVRVPVPPLAEQDRILAEFDTHMK 256 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP 218 Q+ S +A + + R+ +L A G L + + EP Sbjct: 257 QIGSVEAALDAALKQATAQRKNLLKAAFAGHLVPQDPSDEP 297 >UniRef50_Q1NNI4 Restriction modification system DNA specificity domain n=1 Tax=delta proteobacterium MLMS-1 RepID=Q1NNI4_9DELT Length = 344 Score = 96.7 bits (239), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 85/345 (24%), Positives = 146/345 (42%), Gaps = 63/345 (18%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 M LP GWV+A V + I GV +K + + +P+IR N+ D + Sbjct: 1 MENADLPTGWVMANVDALGEFINGVAFKPADWV----ESGIPIIRIQNLT----DPDKPL 52 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKS----AHQHLPFECSFGAFCGVLRPEKL 116 + V++ + DI+++ S+ + + +QH+ F V PE Sbjct: 53 NRTEREVEDKYVVEHNDILVSWSATLDAFRWRGPRAYVNQHI--------FKVVPNPE-- 102 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + +GF+ + K S+ S G + +I F +PPL EQ+ I EK++TL Sbjct: 103 LDTGFVFYALKESIRELVHSEHLHGTTMKHINRKPFLAHPRALPPLNEQRRIVEKIETLF 162 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR-----NFEPQHSVFKKL----- 226 A++D +A ++ ++L +RQ+VL AV G+LT WR EP + ++ Sbjct: 163 ARLDKGEAALREVQKLLASYRQSVLKAAVTGQLTADWRAENAHRLEPGRDLLTRILQTRR 222 Query: 227 ------------------------------NFESILTELRNGLSSKPNESGVGHPILRIS 256 + + + + G S+K +G P+LR+ Sbjct: 223 DTWQGRGKYKEPTTPDTTNLPELPEGWVWATVDQVSSSVDYGSSAKCTTDAIGVPVLRMG 282 Query: 257 SVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 ++ G +D + ++L SE + L+ DLLF R N S E VG Sbjct: 283 NIVGGTLDLRNFKYLPDDHSEFPKLLLESRDLLFNRTN-SAELVG 326 Score = 56.6 bits (135), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 59/211 (27%), Positives = 91/211 (43%), Gaps = 37/211 (17%) Query: 232 LTELRNGLSSKPN---ESGVGHPILRISSVRAGHVDQNDIRFLECSESEL-NRHKLQDGD 287 L E NG++ KP ESG+ PI+RI ++ + + L +E E+ +++ ++ D Sbjct: 18 LGEFINGVAFKPADWVESGI--PIIRIQNL------TDPDKPLNRTEREVEDKYVVEHND 69 Query: 288 LLFT--------RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFS 339 +L + R+ G +V QH + P+ + AL E I Sbjct: 70 ILVSWSATLDAFRWRGPRAYVN--------QHIFKVVPNPELDTGFVFYALKESIRELVH 121 Query: 340 SPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 S M K I+ K + LPP+ EQ IV ++E LFA D E + Sbjct: 122 SEHLHGTTM---------KHINRKPFLAHPRALPPLNEQRRIVEKIETLFARLDKGEAAL 172 Query: 400 NNALARVNNLTQSILAKAFRGELTAQWRAEN 430 + + QS+L A G+LTA WRAEN Sbjct: 173 REVQKLLASYRQSVLKAAVTGQLTADWRAEN 203 >UniRef50_C2CSZ9 Type I restriction modification DNA specificity protein n=1 Tax=Corynebacterium striatum ATCC 6940 RepID=C2CSZ9_CORST Length = 371 Score = 95.9 bits (237), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 92/339 (27%), Positives = 152/339 (44%), Gaps = 32/339 (9%) Query: 85 GSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANI 144 GS V S+ + +FG F + E+ + S ++ K ++ L A + Sbjct: 64 GSAGFVEWSSGNCWIIDTAFGVFP---KSEEQVDSRWLYWLLKDL----RLGRLQKHAAV 116 Query: 145 NNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGA 204 I A +PPL EQ+ IA LD +VD R Q L + +Q + Sbjct: 117 PGISKADVVEEKFLLPPLDEQRRIAAILD----EVDEALFRVNQSLGDLLQLKQELF--- 169 Query: 205 VNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRA-GHV 263 T+ + E + ++ + L + G S K NE+ VG PILR+ +V G + Sbjct: 170 -----TDLFLRIERESTIIGEY-----LESTQYGTSDKANEN-VGIPILRMGNVSYNGEI 218 Query: 264 DQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA 323 D +D++++E S+ ++ L+ GDLLF R N S + VG ++ +LQ + Y LIR Sbjct: 219 DLSDLKYVELDASDREKYSLKAGDLLFNRTN-SKDLVGKTAVVPELQEE-YTYAGYLIRC 276 Query: 324 RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 R+ A+PEYI F +S + + N K G I+ ++K + + EQ E Sbjct: 277 RVNDKAVPEYISGFLNSVLGKKILRNTAKAIVGMANINANELKRLPIPQASLDEQQEFA- 335 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 L + D +E Q+ + L +S+ +AF+ EL Sbjct: 336 ---SLTSRIDDVESQMKRQRKLLQELQESLSTRAFQEEL 371 >UniRef50_B0QS41 Type I restriction enzyme EcoKI subunit R n=1 Tax=Haemophilus parasuis 29755 RepID=B0QS41_HAEPR Length = 397 Score = 95.9 bits (237), Expect = 3e-18, Method: Compositional matrix adjust. Identities = 78/268 (29%), Positives = 129/268 (48%), Gaps = 10/268 (3%) Query: 160 PPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRN---F 216 P ++Q+ +A+K LL+QV K R E+IP +LK +RQ+VL AVNG+L+ KWR Sbjct: 134 PSFSQQQKLAKKFTVLLSQVAEIKQRLEKIPALLKTYRQSVLARAVNGELSAKWREENGV 193 Query: 217 EPQHSVFKKLNFESILTELRNGLSSK--PNESGVGHPILRISSVRAGHVDQNDIRFLECS 274 V++K + I ++++G + K P E P L++ ++ ++ + Sbjct: 194 SLDSWVYEKA--QHICDKVQSGSTPKGNPFEQNGTIPFLKVYNIVNQELNFDYKPQFVTK 251 Query: 275 ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYI 334 E R D+L L V + + + N+ L R ++ ++ Sbjct: 252 EQHSQRSITLPNDVLMNIVGPPLGKVAI--VTNQYSEWNINQAITLFRCN-PRNLHYKFF 308 Query: 335 EIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADT 394 + + +K GQ IS + +V +P ++EQ I + VE+ +A+ Sbjct: 309 YFVLREGRFIREIEHDLKGIVGQINISLSQCRDMIVPVPTLEEQNYITQAVEKHLNFANQ 368 Query: 395 IEKQVNNALARVNNLTQSILAKAFRGEL 422 +E QVN AL RVN +TQ+ILAK FRGEL Sbjct: 369 LEAQVNAALERVNLMTQAILAKGFRGEL 396 >UniRef50_A4VH87 Type I restriction-modification system, S subunit n=1 Tax=Pseudomonas stutzeri A1501 RepID=A4VH87_PSEU5 Length = 472 Score = 95.5 bits (236), Expect = 4e-18, Method: Compositional matrix adjust. Identities = 55/111 (49%), Positives = 74/111 (66%), Gaps = 8/111 (7%) Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 T+G+ ++ + S + +PP EQ EIVRRVEQLFA+AD +E +VN A A ++ LTQSI Sbjct: 369 TTGRAKLTQGALLSLPIQVPPATEQTEIVRRVEQLFAFADQLEARVNAAKACIDRLTQSI 428 Query: 414 LAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 LAKAFRGEL Q +P+ + A+ LLE+IKA+RAA+ K RK S Sbjct: 429 LAKAFRGELVPQ----DPN----DEPASVLLERIKAQRAAAPKTKRGRKAS 471 Score = 89.0 bits (219), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 51/111 (45%), Positives = 68/111 (61%), Gaps = 4/111 (3%) Query: 110 VLRPEKLIFSGFIAHFTKSSLYRNKISSLSA--GANINNIKPASFDLINIPIPPLAEQKI 167 VLRP + I F + KS + +I S + G + +I + +PPLAEQ Sbjct: 106 VLRPIEGIVPKFSFYMLKS--FGAEILSACSKDGTTVQSIDSEKLETFLFSLPPLAEQTR 163 Query: 168 IAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP 218 IA+KLD LLAQVD+ KAR + IP +LKRFRQ+VL AV+G+LTE+WR P Sbjct: 164 IAQKLDELLAQVDTLKARIDAIPALLKRFRQSVLAAAVSGRLTEEWRGSIP 214 Score = 54.7 bits (130), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 27/89 (30%), Positives = 48/89 (53%) Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 C K + + I + +++ + LPP+ EQ I +++++L A DT++ +++ A + Sbjct: 133 CSKDGTTVQSIDSEKLETFLFSLPPLAEQTRIAQKLDELLAQVDTLKARIDAIPALLKRF 192 Query: 410 TQSILAKAFRGELTAQWRAENPDLISGEN 438 QS+LA A G LT +WR P S E Sbjct: 193 RQSVLAAAVSGRLTEEWRGSIPASESAEE 221 Score = 44.3 bits (103), Expect = 0.009, Method: Compositional matrix adjust. Identities = 44/224 (19%), Positives = 84/224 (37%), Gaps = 23/224 (10%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++PEGW++A VS+ + + ++ + + P AN G+ D D Sbjct: 252 EVPEGWIVASVSSFAECLDSMRVPVKKELRESGEGKYPYFGAN----GEVDRVDEYIFDD 307 Query: 65 NLV--KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 +LV E + IA K V H LR + ++ Sbjct: 308 DLVLVTEDETFYGRVKPIAYKYSGKCWVNNHVH--------------ALRAHDAVARDYL 353 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + ++ + + L+ + + + I +PP EQ I +++ L A D Sbjct: 354 CYVL---MHYDVVPWLTGTTGRAKLTQGALLSLPIQVPPATEQTEIVRRVEQLFAFADQL 410 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL 226 +AR + R Q++L A G+L + N EP + +++ Sbjct: 411 EARVNAAKACIDRLTQSILAKAFRGELVPQDPNDEPASVLLERI 454 >UniRef50_Q2J5T0 Restriction modification system DNA specificity domain n=1 Tax=Frankia sp. CcI3 RepID=Q2J5T0_FRASC Length = 436 Score = 95.1 bits (235), Expect = 5e-18, Method: Compositional matrix adjust. Identities = 90/356 (25%), Positives = 164/356 (46%), Gaps = 35/356 (9%) Query: 42 PLIRANNIQNGKFDTTDLVFVPKNLVKESQ-KISPEDIVIAMSSGSKSVVGKSAHQHLPF 100 P +R N+Q G+ +D+ ++ + + + + D+++ + + +G+ A Q P Sbjct: 48 PYLRVANVQRGRLTLSDVAWLEASARERIRYALDDGDLLVVEGHANPAEIGRCA-QVGPE 106 Query: 101 E--CSFGAFCGVLRPEKLIFSGFIAHFTKSSL---YRNKISSLSAGANINNIKPASFDLI 155 C + LRP L + F H+ SS Y + + S+G + I + Sbjct: 107 SKNCLYQNHLFRLRPRNL-EARFALHWLNSSFSQSYWGRNCATSSG--LYTINSRQLGAL 163 Query: 156 NIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAV--NGKLTEKW 213 IP+PP +Q+ I+E LD + ST+ ++ Q+ R +L V +G+L + W Sbjct: 164 PIPVPPPDKQRKISEILDAADEAIRSTERLVGKLEQVFDSLRGDLLQEHVIRSGRLPDCW 223 Query: 214 RNFEPQHSVFKKLNFESILTELRNGLSSKPNESG---VGHPILRISSVRAGHVDQNDIRF 270 R +L+ L+E+ G++ S V P LR+++V+ G++D DI+ Sbjct: 224 R--------MDRLDR---LSEITGGVTLGGVTSAGRSVELPYLRVANVQDGYIDTTDIKT 272 Query: 271 LECSESELNRHKLQDGDLLFTRYNGSLEFVGVC----GLLKKLQHQNLLYPDKLIRARLT 326 + SE +R+ LQ GD+L T G + +G G + HQN ++ + + RL Sbjct: 273 VTVRTSEFDRYLLQAGDVLMTE-GGDFDKLGRGAVWDGSIDPCLHQNHIFRVRCDKIRL- 330 Query: 327 KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIV 382 LPEY+ + +S + R+ M K T+ I+ + + V LPP+ Q I+ Sbjct: 331 ---LPEYLSTYSASTAGRSYFMGISKQTTNLASINKSQLSALPVPLPPLATQKMII 383 Score = 57.8 bits (138), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 53/216 (24%), Positives = 97/216 (44%), Gaps = 26/216 (12%) Query: 234 ELRNGLSSKPNESG--VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 E+++G++ P + P LR+++V+ G + +D+ +LE S E R+ L DGDLL Sbjct: 29 EIQSGITLSPRRTSGRKDAPYLRVANVQRGRLTLSDVAWLEASARERIRYALDDGDLLVV 88 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFF------SSPSARN 345 + + +G C + + +N LY + L R R P +E F SS S Sbjct: 89 EGHANPAEIGRCAQVGP-ESKNCLYQNHLFRLR------PRNLEARFALHWLNSSFSQSY 141 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 NC T+SG I+ + + + + +PP +Q R++ ++ AD + + + Sbjct: 142 WGRNCA-TSSGLYTINSRQLGALPIPVPPPDKQ----RKISEILDAADEAIRSTERLVGK 196 Query: 406 VNNLTQSILAKAFR------GELTAQWRAENPDLIS 435 + + S+ + G L WR + D +S Sbjct: 197 LEQVFDSLRGDLLQEHVIRSGRLPDCWRMDRLDRLS 232 Score = 57.8 bits (138), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 49/216 (22%), Positives = 94/216 (43%), Gaps = 21/216 (9%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 + +G+LP+ W + + ++ + GVT + + LP +R N+Q+G DTTD+ Sbjct: 214 IRSGRLPDCWRMDRLDRLSEITGGVTLGGVTSAG--RSVELPYLRVANVQDGYIDTTDIK 271 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSA----------HQHLPFECSFGAFCGV 110 V + + + V+ G +G+ A HQ+ F C Sbjct: 272 TVTVRTSEFDRYLLQAGDVLMTEGGDFDKLGRGAVWDGSIDPCLHQNHIFRVR----CDK 327 Query: 111 LRPEKLIFSGFIAHFTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIA 169 +R + +++ ++ S+ R+ +S N+ +I + + +P+PPLA QK+I Sbjct: 328 IR----LLPEYLSTYSASTAGRSYFMGISKQTTNLASINKSQLSALPVPLPPLATQKMII 383 Query: 170 EKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAV 205 L Q+ STKA ++ + + +L G V Sbjct: 384 GSLGAAERQISSTKAELAKLRLVKQGLMDDLLMGRV 419 >UniRef50_Q0W4T6 Type I restriction modification system, specificity subunit n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4T6_UNCMA Length = 484 Score = 94.0 bits (232), Expect = 1e-17, Method: Compositional matrix adjust. Identities = 106/367 (28%), Positives = 164/367 (44%), Gaps = 74/367 (20%) Query: 148 KPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNG 207 K ASF+ I +PPLAEQ I K++ L Q+D+ ++ + +K++RQAVL A NG Sbjct: 139 KIASFE---IALPPLAEQHRIVAKIEELFTQLDAGVEALKKAKEQIKQYRQAVLESAFNG 195 Query: 208 KLTEKWR---------------NFEPQHSVFKKL---NFESILTELRNG----------- 238 KLTEKWR N + S K ES L E+ NG Sbjct: 196 KLTEKWRLSSKEYIAPISEFISNVQKTRSTDGKTVCDQLESTL-EMPNGWLGVLLYQIAD 254 Query: 239 -------LSSKPNESGVGH-PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 L S N G P + S+V + ++ + D E + E N L+ Sbjct: 255 IGTGATPLRSNKNYYENGTIPWITSSAVNSQYITKADEFITELAIKETNAKIFPKNSLII 314 Query: 291 TRYNGSLEFVGVCGLLKKLQHQN----LLYPDKLIRARLTKDALPEYIEIFFSSPSARNA 346 Y V LL + +++ D+ + L +I+++F + Sbjct: 315 ALYGEGKTRGKVSELLIEAATNQACAAIIFNDQTV-------VLKPFIKLYF-----QKN 362 Query: 347 MMNCVKTTSG--QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 + K SG Q ++ IKS ++ LPP+ EQ IV +E+ F + IEK ++ +L+ Sbjct: 363 YEDLRKLASGGVQPNLNLGIIKSTLIPLPPLAEQEIIVGEIEKKFPIMEDIEKTIDQSLS 422 Query: 405 RVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER--AASG-----GK 457 L QSIL++AF G+L Q NP+ + A LLE+I+AER A+G G Sbjct: 423 YSETLRQSILSQAFSGKLVPQ----NPN----DEPAEKLLERIRAERLNQAAGKPQNSGP 474 Query: 458 KASRKKS 464 + +RK++ Sbjct: 475 RRTRKQA 481 Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 61/251 (24%), Positives = 116/251 (46%), Gaps = 18/251 (7%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 S ++P GW+ + + + G T + NY ++ +P I ++ + N ++ T F Sbjct: 236 STLEMPNGWLGVLLYQIADIGTGATPLRSNK-NYYENGTIPWITSSAV-NSQYITKADEF 293 Query: 62 VPKNLVKESQ-KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCG--VLRPEKLIF 118 + + +KE+ KI P++ +I G GK + L E + C + + ++ Sbjct: 294 ITELAIKETNAKIFPKNSLIIALYGEGKTRGKVSE--LLIEAATNQACAAIIFNDQTVVL 351 Query: 119 SGFIA-HFTKSSLYRNKISSLSAGANIN-NIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 FI +F K+ K++S N+N I ++ IP+PPLAEQ+II +++ Sbjct: 352 KPFIKLYFQKNYEDLRKLASGGVQPNLNLGIIKSTL----IPLPPLAEQEIIVGEIEKKF 407 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 ++ + +Q + RQ++L A +GKL + N EP + +++ E + Sbjct: 408 PIMEDIEKTIDQSLSYSETLRQSILSQAFSGKLVPQNPNDEPAEKLLERIRAERL----- 462 Query: 237 NGLSSKPNESG 247 N + KP SG Sbjct: 463 NQAAGKPQNSG 473 Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 44/183 (24%), Positives = 81/183 (44%), Gaps = 23/183 (12%) Query: 262 HVDQND---IRFLECSESELNRHKLQDGDLLFTRYNGSL------EFVGVCGLLKKLQHQ 312 H++++ + F +E + GDLL+ + L E G+C + ++ Sbjct: 42 HIEKDTGKLLSFGNSTEVTSTKTVFHKGDLLYGKLRPYLNKVCVTEIDGICSTDILVFNE 101 Query: 313 NLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLL 372 +KL++ R+ P+++ R A N T + K I S + L Sbjct: 102 QRFLSNKLLKYRM---LCPDFV---------RYANQNA--TGVNHPRVDFKKIASFEIAL 147 Query: 373 PPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPD 432 PP+ EQ IV ++E+LF D + + A ++ Q++L AF G+LT +WR + + Sbjct: 148 PPLAEQHRIVAKIEELFTQLDAGVEALKKAKEQIKQYRQAVLESAFNGKLTEKWRLSSKE 207 Query: 433 LIS 435 I+ Sbjct: 208 YIA 210 >UniRef50_B0JHV8 Restriction modification system DNA specificity domain n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JHV8_MICAN Length = 395 Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 94/366 (25%), Positives = 150/366 (40%), Gaps = 36/366 (9%) Query: 64 KNLVKESQKISPEDIVI--AMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 K VK S+ + P D ++ +MS G ++ S C + + + +I + Sbjct: 59 KTGVKNSRMVYPGDFLLTNSMSFGHPYIMKTSG-------CIHDGWLVLSNKKGVIDQDY 111 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 H S L + S L++G+ + N+ I + +PPL EQ+ IA LD Sbjct: 112 FYHLLGSDLIYAEFSRLASGSTVKNLNIEIVKGIKVSLPPLEEQRRIAAILDKADGVRRK 171 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 K ++LK + G V P+ K+L I T +NG+ Sbjct: 172 RKEAIRLTEELLKSTFLEMFGDPVTN----------PKGWEVKRLG--EICTNFQNGIGK 219 Query: 242 KPNESGVGHPILRISSVRAGH-VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 G G + IS + H L+ + E+ ++ L GDLLF R + E V Sbjct: 220 NSEHYGHGSKVANISDLYEWHRFIPEKYSLLDVTPKEIEKYSLMRGDLLFVRSSVKREGV 279 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 VC + + L+ +IR R D + PE++ + +P RN ++ TS Sbjct: 280 AVCSVYD--SDEICLFSSFMIRVRPRTDLINPEFLSLMLRTPPMRNRLI-LGSNTSTITN 336 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN---NALARVNNLTQSILAK 416 IS + V++PP+K Q I + IE+ V AL + NL S+L + Sbjct: 337 ISQPGLSKIEVVVPPIKTQNLITK-------VTKNIEESVRCHLQALEQSENLFNSLLQR 389 Query: 417 AFRGEL 422 AFRGEL Sbjct: 390 AFRGEL 395 >UniRef50_A6H2J0 Restriction-modification enzyme n=3 Tax=Gammaproteobacteria RepID=A6H2J0_PSEPU Length = 1289 Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 74/263 (28%), Positives = 118/263 (44%), Gaps = 32/263 (12%) Query: 149 PASF--DLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVN 206 P SF DL IP+PPL Q I ++ + V S + + Q ++ +++ Sbjct: 1041 PESFYADL-RIPVPPLKVQSQICDEFTKVDKAVQSARTKIASTQQSIELLVESIYA---- 1095 Query: 207 GKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGH-VDQ 265 S ++ + + ++ GLS K NE G+G+ I R++ + G VD Sbjct: 1096 --------------STAPRIEIAKLSSNIQYGLSEKMNEVGIGYKIFRMNEIIQGRMVDD 1141 Query: 266 NDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR-AR 324 ++ + S E +KL GDLLF R NGSLE +G GL + Y L+R Sbjct: 1142 GAMKCADISVEEFANYKLNKGDLLFVRSNGSLEHIGKVGLFD--LEGDYCYASYLVRIVP 1199 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 + ALP+Y+ +SP R M+ + G I+ +KS V P + EQ E V + Sbjct: 1200 DSSKALPQYLVSIMNSPIFRKGMVQLAVKSGGTNNINATKMKSIKVPTPSLAEQEEFVVK 1259 Query: 385 VEQLFAYADTIEKQVNNALARVN 407 V D + KQ+ +A A ++ Sbjct: 1260 V-------DALGKQIADAQAVID 1275 >UniRef50_A1VBQ9 Restriction modification system DNA specificity domain n=1 Tax=Desulfovibrio vulgaris DP4 RepID=A1VBQ9_DESVV Length = 595 Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 67/217 (30%), Positives = 111/217 (51%), Gaps = 9/217 (4%) Query: 238 GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSL 297 G S K + + G +LRI ++ G +D +D+++ S E +++L+ GDLL R NGS+ Sbjct: 305 GTSRKSDYNIDGTGVLRIPNIVDGKIDSSDLKYTAFSPGEEEQYRLKAGDLLTIRSNGSV 364 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 VG C L++ +Y LIR R + +++ SS RN + + K+TSG Sbjct: 365 SLVGQCALIED-DDTRYVYAGYLIRLRTIGLLVSKFLLYCLSSLRLRNQIESKAKSTSGV 423 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 I+ +++ S +V L EQ E+ + + + A + L + L QSIL KA Sbjct: 424 NNINSQELSSLIVPLCSQLEQNEVSKLLADSLSTAGEQTSMIEIQLEHIRILKQSILDKA 483 Query: 418 FRGELTAQWRAENPDLISGENSAAALLEKIKAERAAS 454 F G L +Q +P+ + A+ LLE+IK ER ++ Sbjct: 484 FSGTLISQ----DPN----DEPASKLLERIKQERKSA 512 Score = 58.5 bits (140), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 37/95 (38%), Positives = 54/95 (56%), Gaps = 5/95 (5%) Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 K +S K + + LPP+ EQ IV ++E+LF+ D + + A ++ QS+L A Sbjct: 145 KHLSSKTVNEIPLPLPPLNEQNRIVAKIEELFSELDAGVENLTKAKEQLGVYRQSLLKHA 204 Query: 418 FRGELTAQWRAENPD-LISGENSAAALLEKIKAER 451 F G+LT WR N D L SGE ALL+++K ER Sbjct: 205 FEGKLTEAWRKRNADKLESGE----ALLKRVKKER 235 Score = 58.5 bits (140), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 62/228 (27%), Positives = 108/228 (47%), Gaps = 15/228 (6%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 GWV A + LI Y + +Y D ++R NI +GK D++DL + + + Sbjct: 289 GWVWARLGN---LIDPPAYGTSRKSDY-NIDGTGVLRIPNIVDGKIDSSDLKYTAFSPGE 344 Query: 69 ESQ-KISPEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFCGVLRPEKLIFSGFIAHFT 126 E Q ++ D++ S+GS S+VG+ A + + + LR L+ S F+ + Sbjct: 345 EEQYRLKAGDLLTIRSNGSVSLVGQCALIEDDDTRYVYAGYLIRLRTIGLLVSKFLLYCL 404 Query: 127 KSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQ----KIIAEKLDTLLAQVDS 181 S RN+I S + + + +NNI + +P+ EQ K++A+ L T Q Sbjct: 405 SSLRLRNQIESKAKSTSGVNNINSQELSSLIVPLCSQLEQNEVSKLLADSLSTAGEQTSM 464 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE 229 + + E I +ILK Q++L A +G L + N EP + +++ E Sbjct: 465 IEIQLEHI-RILK---QSILDKAFSGTLISQDPNDEPASKLLERIKQE 508 Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 49/213 (23%), Positives = 89/213 (41%), Gaps = 31/213 (14%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN--------GKFDTTDLVF 61 W VS + + G +K + + +D+ +PLIR +I + G+FD LV Sbjct: 25 WKRVYVSEIAMVQNGFAFKSK---FFSRDEGIPLIRIRDILSAETEHKYFGQFDKEYLVH 81 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 L+ D V A G + ++ + C ++ + F Sbjct: 82 NGDLLIGMDG-----DFVAAYWPGKEGLLNQRV-------------CRIVIESENYDKKF 123 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 F Y + I ++ + ++ + + I +P+PPL EQ I K++ L +++D+ Sbjct: 124 F--FLALQPYLDAIHEKTSSVTVKHLSSKTVNEIPLPLPPLNEQNRIVAKIEELFSELDA 181 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR 214 + + L +RQ++L A GKLTE WR Sbjct: 182 GVENLTKAKEQLGVYRQSLLKHAFEGKLTEAWR 214 >UniRef50_D2NCT2 Putative uncharacterized protein n=1 Tax=Escherichia coli SE15 RepID=D2NCT2_ECOLX Length = 415 Score = 92.4 bits (228), Expect = 3e-17, Method: Compositional matrix adjust. Identities = 62/208 (29%), Positives = 101/208 (48%), Gaps = 5/208 (2%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P GW + + L RGVTY KE D + ++RA N+ D DLVF+P Sbjct: 213 EIPAGWNDSILGKFIELDRGVTYSKEDVRTQDDKDTIGILRATNVTGNNVDIDDLVFIPS 272 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 + V +Q ++ DI+I MSSGSK VGK+ + + +FGAFC + P + + FI Sbjct: 273 SRVNVNQMLNKFDILIVMSSGSKEHVGKNGVYYFEKKHAFGAFCSKITPVRK-YRYFINT 331 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 F +S +++ I++ G NINN+ I P + K+ + ++ S Sbjct: 332 FLQSKWFKSYINNQCLGTNINNLTNTHITNCEIICPTPDVVALFENKMMPIYNKLASNTQ 391 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEK 212 + Q+ R +L +NG++T K Sbjct: 392 ENSHLIQL----RDWLLPLLMNGQVTVK 415 >UniRef50_C5RH89 Restriction modification system DNA specificity domain protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RH89_CLOCL Length = 457 Score = 92.0 bits (227), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 104/456 (22%), Positives = 207/456 (45%), Gaps = 38/456 (8%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKF--DTTDLVFV 62 ++PE WV + + ++ L+ G T K Y +P I+ ++ G+ +T+ + Sbjct: 23 EVPENWVWSNLKSIADLVTGNTPSKNNEEFY--GGKIPFIKPTDLNQGRILNSSTETL-- 78 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 N+ +I P+ G+ +GK A+ L E + + P+K I++ ++ Sbjct: 79 -SNIGATKARILPKGSTAVCCIGA--TIGKVAY--LNVEGATNQQINSIIPKK-IYNLYV 132 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 ++T SS + + + S+ + I + + IP+PPL EQ+ I +++ L ++D Sbjct: 133 YYYTLSSYFHDTLIENSSSTTLPIINKSRMGELLIPLPPLKEQQRIVNRIENLFEKLDKA 192 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLT-EKWRNFEPQHSVFKKLNF-------ESILTE 234 K E+ + ++ + A+ A G L K P + F KL + E I + Sbjct: 193 KELIEEAREGFEKRKAAITSKAFRGILNYRKGEKVNPINEGFYKLPYNWKWTKLEDICEK 252 Query: 235 LRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSES--ELNRHKLQDGDLLFT 291 + +G + P G + + +++ +D + I ++ E R ++ GD+L+ Sbjct: 253 ITDGTHNSPKSYEYGDYKYVTAKNIKEWGIDLSSITYVTKKEHIPIYKRCDVKYGDILYI 312 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 + + G+ + + + +LL LIR D +Y+ +S + ++ V Sbjct: 313 KDGAT---TGIATINELTEEFSLLSSVALIRVGKCIDN--KYLYYILNSFEIKKRILESV 367 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 K + + ++ K I ++ LPP++EQ EIV+ +++L I K++ ++N + + Sbjct: 368 KGVAITR-LTLKKINDIIIPLPPLEEQKEIVKILDKLLEEESKI-KELTQLEDQINLIKK 425 Query: 412 SILAKAFRGELTAQWRAENPDLISGENSAAALLEKI 447 SILAKAFRG+L + SA LL+KI Sbjct: 426 SILAKAFRGQLGTNCEE--------DESALELLKKI 453 >UniRef50_Q1NN41 Restriction modification system DNA specificity domain n=9 Tax=Bacteria RepID=Q1NN41_9DELT Length = 603 Score = 92.0 bits (227), Expect = 4e-17, Method: Compositional matrix adjust. Identities = 115/506 (22%), Positives = 201/506 (39%), Gaps = 108/506 (21%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP GW ++ + T G T K+ I D +P ++ + + + + Sbjct: 101 LPAGWAYCRLNEIGTWGSGATPKR--GITEYYDGGIPWFKSGELVGDFISSAEETITERA 158 Query: 66 LVKESQKIS-PEDIVIAM---SSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 L + S +++ P D++IAM + G S++ A + C+ F G+L + + Sbjct: 159 LKETSVRLNLPGDVLIAMYGATIGKASILKCHATTNQAV-CACTPFSGIL-------NTY 210 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + +F K+S + +S+ AG NI + P+PPLAEQ I EK+D L+A D Sbjct: 211 LLNFLKAS--KRHFTSMGAGGAQPNISKEKIIAVVFPLPPLAEQHRIVEKVDELMALCDR 268 Query: 182 TK-------ARFEQIPQIL----------------------------------KRFRQAV 200 + A E + + L R +Q + Sbjct: 269 LEQQTSDQLAAHETLVETLLDTLSRSADATELAANWTRLQTHFDTLFTTESSIDRLKQTI 328 Query: 201 LGGAVNGKLTEKWRNFEPQHSVFKKLNFESI----------------------------- 231 L AV G+L + N EP ++ KK+ E Sbjct: 329 LQLAVMGRLVSQDPNDEPASALLKKIAAEKARLVKEGKIKKTKPLPKISEEEKPFTLPDG 388 Query: 232 -----LTELRN----GLSSKPNE---SGVGHPILRISSVRAGHVDQNDIRFLECSESELN 279 L E+ N G S K ++ SG +L++S+V G ++ + L Sbjct: 389 WEWCRLGEIANQSEAGWSPKCDDVPKSGKEWGVLKVSAVTWGKFLSDENKRLPQHLEPRR 448 Query: 280 RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFS 339 +H+++ D L +R N + E V ++ + +L+ DK+IR + P YI +F + Sbjct: 449 KHEVKPNDFLISRAN-TAELVARSVVVPEDVPSHLIMSDKIIRIEFSPLVFPGYINLFNA 507 Query: 340 SPSARNAMMNCV-KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 S AR T+S K +S + I++ V LPP EQ I+R+++++ + ++ Sbjct: 508 SSVARAYYARVAGGTSSSMKNVSREQIQALCVPLPPYPEQLRILRKMDKVVHLCEQLKAH 567 Query: 399 VNNAL--------ARVNNLTQSILAK 416 + A A NN S LA+ Sbjct: 568 LGRASQTRQRFAEAVANNTITSCLAR 593 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 27/75 (36%), Positives = 40/75 (53%), Gaps = 4/75 (5%) Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 Q IS + I + V LPP+ EQ IV +V++L A D +E+Q ++ LA L +++L Sbjct: 230 AQPNISKEKIIAVVFPLPPLAEQHRIVEKVDELMALCDRLEQQTSDQLAAHETLVETLLD 289 Query: 416 KAFRG----ELTAQW 426 R EL A W Sbjct: 290 TLSRSADATELAANW 304 >UniRef50_Q12PV3 Restriction modification system DNA specificity domain n=2 Tax=Gammaproteobacteria RepID=Q12PV3_SHEDO Length = 633 Score = 90.9 bits (224), Expect = 9e-17, Method: Compositional matrix adjust. Identities = 63/187 (33%), Positives = 103/187 (55%), Gaps = 7/187 (3%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LP GW + + ++IRG+T+ + L + IR N+Q+ + DL++V + Sbjct: 121 ELPNGWKWSRLGDFVSIIRGITFPSSEKHRELAPSRVACIRTTNVQDS-LEWDDLLYVDR 179 Query: 65 NLVK-ESQKISPEDIVIAMSSGSKSVVGK-SAHQHLPF-ECSFGAFCGVLRPEKLIFSGF 121 + VK E Q + DIV++M++ S+ +VGK S H+P E SFG F V+RP + S F Sbjct: 180 SYVKREEQYLKLGDIVMSMAN-SRELVGKVSFITHIPVGESSFGGFLSVIRPYQFDAS-F 237 Query: 122 IAHFTKSSLYRNK-ISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + ++ L +N+ I S S NI NI + + I +PPL EQ I K+D L++ D Sbjct: 238 LMSVLRAPLTKNELIGSASQTTNIANISLEKLNPLVIAVPPLEEQHRIVAKVDELMSLCD 297 Query: 181 STKARFE 187 + +A+ E Sbjct: 298 ALEAQTE 304 Score = 51.6 bits (122), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 57/249 (22%), Positives = 123/249 (49%), Gaps = 29/249 (11%) Query: 186 FEQIPQILKRFR----QAVLGGAVNGK-----LTEKWRNFE-PQHSVFKKL-NFESILTE 234 +E ++LKR + Q + G + + +T++ + FE P+ + +L + +++T Sbjct: 396 YEPAAKLLKRIKAEKAQLIKDGKIKKQKPLPDITDEEKPFELPESGEWVRLGDLCTLVTS 455 Query: 235 LRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGDLLFTRY 293 G + ESG +R ++ V+ +D +++ E+ E R K+ G+LL T Sbjct: 456 GSRGWKTYYAESGAT--FIRSQDIKYDRVEFDDKAYVKLPETTEGKRTKVDVGNLLMTIT 513 Query: 294 NGSLEFVGVCGL-LKKL---QHQNLLYPDKLIRARLTKDALPEYIEIFFSSP-SARNAMM 348 ++ + + L + QH L+ KLI + + K YI ++ + R ++ Sbjct: 514 GANVAKTAIVEIELDEAYVSQHVALI---KLINSVMNK-----YIHLWLTGAFGGRGLLL 565 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 C + + G++ ++I ++ +PP++EQ IV +VE+L A D ++ ++++A + Sbjct: 566 EC--SYGAKPGLNLQNINELIIPIPPLEEQHRIVAKVEELMALCDKLKARLSDAQTTQLH 623 Query: 409 LTQSILAKA 417 LT +I+ +A Sbjct: 624 LTDAIVEQA 632 Score = 46.2 bits (108), Expect = 0.003, Method: Compositional matrix adjust. Identities = 37/166 (22%), Positives = 79/166 (47%), Gaps = 8/166 (4%) Query: 254 RISSVRAGHV----DQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKL 309 R++ +R +V + +D+ +++ S + L+ GD++ + N S E VG + + Sbjct: 156 RVACIRTTNVQDSLEWDDLLYVDRSYVKREEQYLKLGDIVMSMAN-SRELVGKVSFITHI 214 Query: 310 QHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQ 368 + L R + DA ++ +P +N ++ T+ IS + + Sbjct: 215 PVGESSFGGFLSVIRPYQFDA--SFLMSVLRAPLTKNELIGSASQTTNIANISLEKLNPL 272 Query: 369 VVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 V+ +PP++EQ IV +V++L + D +E Q ++A L +++L Sbjct: 273 VIAVPPLEEQHRIVAKVDELMSLCDALEAQTEASIAAHQTLVETLL 318 >UniRef50_B7VNG6 Type I restriction enzyme EcoKI, S subunit n=1 Tax=Vibrio splendidus LGP32 RepID=B7VNG6_VIBSL Length = 522 Score = 90.9 bits (224), Expect = 9e-17, Method: Compositional matrix adjust. Identities = 49/116 (42%), Positives = 71/116 (61%) Query: 111 LRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAE 170 LRP + +A+ S +I++ + G+ I + +NI +PPLAEQK I E Sbjct: 113 LRPNPEVNRKCLAYLLNSEGVIKQINAHTKGSTRARINLSVVRNLNINLPPLAEQKRIVE 172 Query: 171 KLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL 226 K+D +LAQVD+ KAR + IP +LKRFRQ+VL AV+GKLTE+WR + + +L Sbjct: 173 KIDEVLAQVDTIKARLDGIPDLLKRFRQSVLTSAVSGKLTEEWREEQDAYPTLNEL 228 Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 41/54 (75%), Positives = 46/54 (85%) Query: 372 LPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQ 425 LPP++EQ EIVR V+Q FA+ADTIE QV A ARV+NLTQSILAKAFRGEL AQ Sbjct: 427 LPPLEEQKEIVRLVDQYFAFADTIEAQVKKAQARVDNLTQSILAKAFRGELVAQ 480 Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 56/218 (25%), Positives = 106/218 (48%), Gaps = 9/218 (4%) Query: 249 GHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 G PI+RI +V+ +I+++ ++E L RH + GDLL T+ E +G+ + Sbjct: 42 GTPIVRIQNVKRMAFLNKNIKYVTDEKAEFLKRHSFKSGDLLLTKLG---EPLGLTCIAP 98 Query: 308 KLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 + ++ ++ D ++R R + + + +S +N S + I+ +++ Sbjct: 99 EYLNEGIIVAD-IVRLRPNPEVNRKCLAYLLNSEGVIK-QINAHTKGSTRARINLSVVRN 156 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWR 427 + LPP+ EQ IV +++++ A DTI+ +++ + QS+L A G+LT +WR Sbjct: 157 LNINLPPLAEQKRIVEKIDEVLAQVDTIKARLDGIPDLLKRFRQSVLTSAVSGKLTEEWR 216 Query: 428 AENPDLISGENSAAALLEKIKAERAASG--GKKASRKK 463 E D N A +E+ + E S KK S+ K Sbjct: 217 EEQ-DAYPTLNELKATIEQERFEIWCSAELNKKISKGK 253 >UniRef50_C2CF25 Restriction modification system DNA specificity domain protein n=2 Tax=Clostridiales Family XI. Incertae Sedis RepID=C2CF25_9FIRM Length = 495 Score = 89.0 bits (219), Expect = 3e-16, Method: Compositional matrix adjust. Identities = 100/448 (22%), Positives = 183/448 (40%), Gaps = 80/448 (17%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDD-YLPLIRANNIQNGKFDTTDLVFVPK 64 +PE W + V I G K A + LK++ +P I A N+++G D +L+++ Sbjct: 67 IPESWKWVRLGDVFQFINGDRGKNYPAKSKLKENGDIPFISAINLKDGTVDENNLLYLDI 126 Query: 65 NLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIFSGF 121 N + S K+ DIV+ + + +GK+ PFE + + +LR K I F Sbjct: 127 NQYERLGSGKLLKNDIVLCI----RGSLGKNCI--YPFEKGAIASSLVILRNYKKIKLEF 180 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + ++ S L+ ++ G N+ + I +P+PPL EQ+ I EK++ L+ VD Sbjct: 181 VLNYLNSYLFYSETKKYDNGTAQPNLSAQNAKKILLPLPPLKEQERIVEKIEDLMLLVDK 240 Query: 182 TKARFEQIPQILKRF----RQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE-------- 229 ++ + + K+F ++++L A+ G+L E+ + +F+ + E Sbjct: 241 YGKNWQMLEDLNKKFPEDLKKSLLQEAIKGRLVEQRKEEGTGEELFELIKEEKNKLIKEG 300 Query: 230 ------------------------------SILTELRNGLSSKPNESGVGHPILRISSVR 259 I +L +G P + G P L + + Sbjct: 301 KIKKQKPLPEITEEEIPFDIPESWKWVRLGEITLKLTDGAHKTPTYTNEGIPFLSVKDIS 360 Query: 260 AGHVDQNDIRFLECSESE--LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQN---- 313 +G +D + RF+ E + R + GDLL T+ VG G+ + Sbjct: 361 SGKIDYSSCRFISKKEHDKLFERCNPERGDLLLTK-------VGTTGIPVVIDTDEEFSL 413 Query: 314 ------LLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 L +P KLI K +SP + + + G K +DI + Sbjct: 414 FVSVALLKFPKKLINIYFLKH--------LINSPLVQVQVKENTRGV-GNKNWVMRDIAN 464 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTI 395 ++ LPP+ EQ +V ++E+L + + Sbjct: 465 TIIPLPPLAEQKRLVEKLEELLPLCEQV 492 Score = 44.3 bits (103), Expect = 0.011, Method: Compositional matrix adjust. Identities = 59/224 (26%), Positives = 98/224 (43%), Gaps = 38/224 (16%) Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYNGSLEF 299 SK E+G P + +++ G VD+N++ +L+ ++ E L KL D++ GSL Sbjct: 95 SKLKENG-DIPFISAINLKDGTVDENNLLYLDINQYERLGSGKLLKNDIVLC-IRGSL-- 150 Query: 300 VGVCGLLKKLQHQNLLYP-------DKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 +N +YP L+ R K E++ + +S + Sbjct: 151 -----------GKNCIYPFEKGAIASSLVILRNYKKIKLEFVLNYLNSYLFYSETKKYDN 199 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN----- 407 T+ Q +S ++ K ++ LPP+KEQ IV ++E L D K L +N Sbjct: 200 GTA-QPNLSAQNAKKILLPLPPLKEQERIVEKIEDLMLLVDKYGKNW-QMLEDLNKKFPE 257 Query: 408 NLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 +L +S+L +A +G L Q R E E + L E IK E+ Sbjct: 258 DLKKSLLQEAIKGRLVEQ-RKE-------EGTGEELFELIKEEK 293 >UniRef50_A5UR98 Restriction modification system DNA specificity domain n=2 Tax=Bacteria RepID=A5UR98_ROSS1 Length = 392 Score = 88.2 bits (217), Expect = 6e-16, Method: Compositional matrix adjust. Identities = 112/438 (25%), Positives = 190/438 (43%), Gaps = 63/438 (14%) Query: 1 MSAGKLPEGWVIAPVSTVTTLI--RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTD 58 M +LP+GW + T+ T+ +G++ K+ +A N +P+ AN + G DT+ Sbjct: 2 MERWELPKGWGWKRLKTLVTVNYGKGLSEKQRKAGN------VPVYGANGVV-GFHDTS- 53 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF--GAFCGVLRPEKL 116 I+ ++ GS V S P + +F F +L P+ Sbjct: 54 --------------ITKGQTIVIGRKGSAGAVNWSEIACWPIDTTFFIDEFPEILYPQ-- 97 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-------PLAEQKIIA 169 F+ F +S +I L A I + + +PIP LAEQ+ I Sbjct: 98 ----FLYQFLRS----QQIDRLQQSAAIPGLNRDVLYSVEVPIPYPDDPAHSLAEQRRIV 149 Query: 170 EKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE 229 +L+ LL + T+A E I Q ++R V+ A L E + N P + K ++ Sbjct: 150 ARLELLLGE---TRAMREDI-QAMRRDLAQVMESA----LAEVFPN--PNGEMPKGWGWK 199 Query: 230 SI--LTELRNGLSSKPN--ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQD 285 SI L EL+ G S P + P LR ++ G VD +D+ ++ +E E+ R KL+ Sbjct: 200 SIDDLFELQQGASMSPRRRQGRNPQPFLRTKNILWGEVDTSDVDVMDFTEDEIERLKLRK 259 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSSPSAR 344 GDLL VG + + Q ++Y + + R R + DA P++ + + Sbjct: 260 GDLLICEGGD----VGRAAVWED-QLPLVMYQNHIHRLRRKSDDADPKFYVYWMKAAYQL 314 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 + ++ + +SG+ +K+ +V + EQ IV +E + ++ + L Sbjct: 315 FKIYQGEESRTAIPNLSGRRLKNFLVPTTSLTEQRRIVAYLEHIAEEIRAMDDLLAQDLR 374 Query: 405 RVNNLTQSILAKAFRGEL 422 + L QSILA AFRGE+ Sbjct: 375 DIEVLEQSILAAAFRGEV 392 >UniRef50_Q1R1F8 Restriction modification system DNA specificity protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1R1F8_CHRSD Length = 538 Score = 87.8 bits (216), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 67/235 (28%), Positives = 116/235 (49%), Gaps = 20/235 (8%) Query: 223 FKKLNFESILTELRNGLSSKPNESGV--GHPILRISSVRAGHVDQNDIRFLECSESELNR 280 +K +N +I +E+ G++ + +P LR+++V A ++ +DI F+ + E R Sbjct: 302 WKWINLGNI-SEISGGITKNQKRQSLPQKNPFLRVANVYANKLELDDIHFIGTTPDEAKR 360 Query: 281 HKLQDGDLLFTRYNGSLEFVGVC----GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 KL+ DLL NGS + +G G ++ HQN LIR+RL +++ Sbjct: 361 AKLKKDDLLIVEGNGSPDQIGRVAKWDGSIEHCTHQN-----HLIRSRLASPISADFVLH 415 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 F S + R A+ +TSG +S ++ + + EQ IV ++E + D +E Sbjct: 416 FLLSATGRKAIKKVASSTSGLYTLSLAKVEKLCIPVCSKNEQMMIVDQLESRLSQLDQLE 475 Query: 397 KQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 + + ++ + L QSIL +AF G L Q +PD + A+ LL +I+AER Sbjct: 476 RTLTASMKQAEALKQSILKRAFAGRLVPQ----DPD----DEPASELLARIRAER 522 Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 56/169 (33%), Positives = 87/169 (51%), Gaps = 4/169 (2%) Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 DGD+LF + +E G LL L + + +RLT+ ++ FF S S R Sbjct: 85 DGDILFAKITPCME-NGKVALLSNLTNGVGFGSTEFHVSRLTEAVEKKFYFYFFVSKSFR 143 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 + ++GQ ++ + V L P +EQ IV ++E+LF+ D+ + + A A Sbjct: 144 KQAQANMAGSAGQLRVTTDYFSNVSVPLCPTREQQRIVTKIEELFSEIDSGVESLKTAQA 203 Query: 405 RVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAA 453 ++ QS+L AF G+LT QWR +N D + S ALLE+I+AER A Sbjct: 204 KLKTARQSLLKAAFEGKLTEQWRKDNAD---RQESPEALLERIQAEREA 249 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 50/228 (21%), Positives = 105/228 (46%), Gaps = 6/228 (2%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LPEGW + ++ + G+T +++ K+ P +R N+ K + D+ F+ Sbjct: 297 ELPEGWKWINLGNISEISGGITKNQKRQSLPQKN---PFLRVANVYANKLELDDIHFIGT 353 Query: 65 NLVKESQ-KISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIFSGFI 122 + + K+ +D++I +GS +G+ A E C+ R I + F+ Sbjct: 354 TPDEAKRAKLKKDDLLIVEGNGSPDQIGRVAKWDGSIEHCTHQNHLIRSRLASPISADFV 413 Query: 123 AHFTKSSLYRNKISSL-SAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 HF S+ R I + S+ + + + A + + IP+ EQ +I ++L++ L+Q+D Sbjct: 414 LHFLLSATGRKAIKKVASSTSGLYTLSLAKVEKLCIPVCSKNEQMMIVDQLESRLSQLDQ 473 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE 229 + + + +Q++L A G+L + + EP + ++ E Sbjct: 474 LERTLTASMKQAEALKQSILKRAFAGRLVPQDPDDEPASELLARIRAE 521 Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 26/94 (27%), Positives = 47/94 (50%) Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + F S + ++++ A + F +++P+ P EQ+ I K++ L +++D Sbjct: 133 YFYFFVSKSFRKQAQANMAGSAGQLRVTTDYFSNVSVPLCPTREQQRIVTKIEELFSEID 192 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR 214 S + LK RQ++L A GKLTE+WR Sbjct: 193 SGVESLKTAQAKLKTARQSLLKAAFEGKLTEQWR 226 >UniRef50_Q5YW32 Putative restriction-modification system specificity determinant n=1 Tax=Nocardia farcinica RepID=Q5YW32_NOCFA Length = 394 Score = 86.7 bits (213), Expect = 2e-15, Method: Compositional matrix adjust. Identities = 85/306 (27%), Positives = 146/306 (47%), Gaps = 21/306 (6%) Query: 121 FIAHFTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ ++ + K+ LS GA N +K F + IP+PP+ EQ+ IA LD Sbjct: 106 YLLRLVQTRDFYRKVQDLSFGATNRQRVKEEEFLRLRIPLPPIEEQRRIAAILD----HA 161 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 D+ +A+ + L Q++ + + RN+ P +V +F +N + Sbjct: 162 DALRAKRREALARLDELTQSIFIDMFGDPVANE-RNW-PFGTVG---DFVDRFEGGKNIV 216 Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 S +S G+ +L++S+V + +++ + L + H +Q GDLLF+R N S E Sbjct: 217 GS--GDSTDGYRVLKVSAVTSLSYRESESKPLPEGYVPPSNHIVQRGDLLFSRANTS-EL 273 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKD--ALPEYIEIFFSSPSARNAMMNCVKTTSG- 356 VG L+ + + L PDKL R + A+P Y+ F PS R + + +SG Sbjct: 274 VGATALVTETDGRTAL-PDKLWRFKWKNRTAAVPGYVAALFQRPSFRQTISDRATGSSGS 332 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 K IS + S + +PPV E+ + E + D+++ ALA ++ L S+ ++ Sbjct: 333 MKNISQSKVLSIPLGIPPV----ELQEKFESVRVEVDSMKNSNRIALAELDALFASLQSR 388 Query: 417 AFRGEL 422 AFRGEL Sbjct: 389 AFRGEL 394 >UniRef50_B5VW93 Restriction modification system DNA specificity domain n=1 Tax=Arthrospira maxima CS-328 RepID=B5VW93_SPIMA Length = 407 Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 81/311 (26%), Positives = 141/311 (45%), Gaps = 19/311 (6%) Query: 118 FSGFIAHFTKSSL--YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 F +F K +L + ++I+SLS G+ + I P ++EQK I LD Sbjct: 106 FKAIEPNFLKYALIAFVDEINSLSHGSTYKALPIEKLKKHKIYKPSISEQKRIVAILDEA 165 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTEL 235 +D+ A ++ + ++ L G K + W V KKL I ++ Sbjct: 166 FEGIDAAIANTQKNLANARELFESYLNGIFTRK-GDGW--------VEKKLG--EICHKV 214 Query: 236 RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 G SSK G P++R+ +++ +D D+ + + E+NR+ LQ D+LF R N Sbjct: 215 EYGSSSKSQPEG-DIPVIRMGNIQNNMIDWTDLVYT-SNPDEINRYLLQYNDVLFNRTN- 271 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTT 354 S + VG + K + ++ LIR KD + P+++ + + R + + + Sbjct: 272 SADHVGKSAIYK--GEKPAIFAGYLIRVHYKKDVIDPDFLNFYLNCYKTREYGKSVMSRS 329 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 Q I+G +K+ + P + Q +I++++ LF +E L + L QSIL Sbjct: 330 VNQVNINGTKLKNYPIYHPDLYTQKQIIKKLYFLFRETQRLETIYRRKLEALQELKQSIL 389 Query: 415 AKAFRGELTAQ 425 KAF GELT + Sbjct: 390 QKAFTGELTNE 400 Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 52/180 (28%), Positives = 79/180 (43%), Gaps = 19/180 (10%) Query: 41 LPLIRANNIQNGKFDTTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF 100 +P+IR NIQN D TDLV+ N + ++ + + V+ + S VGKSA Sbjct: 228 IPVIRMGNIQNNMIDWTDLVYTS-NPDEINRYLLQYNDVLFNRTNSADHVGKSAIYKGEK 286 Query: 101 ECSFGAFC-------GVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAG-ANINNIKPASF 152 F + V+ P+ F F + K+ Y + S S NIN K Sbjct: 287 PAIFAGYLIRVHYKKDVIDPD---FLNFYLNCYKTREYGKSVMSRSVNQVNINGTK---- 339 Query: 153 DLINIPI--PPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 L N PI P L QK I +KL L + + + + + L+ +Q++L A G+LT Sbjct: 340 -LKNYPIYHPDLYTQKQIIKKLYFLFRETQRLETIYRRKLEALQELKQSILQKAFTGELT 398 >UniRef50_C9P132 Type I restriction-modification system specificity subunit S n=1 Tax=Vibrio metschnikovii CIP 69.14 RepID=C9P132_VIBME Length = 405 Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 99/416 (23%), Positives = 183/416 (43%), Gaps = 37/416 (8%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 WV P++ L G+TY + ++ + +IR++N++NG+ D V+V +V Sbjct: 19 WVEKPLNHEVELFSGLTYSPKD----IRKQGVFVIRSSNVKNGQIVQADNVYVNPEVVNC 74 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFCGVLR---PEKLIFSGFIAHF 125 S + DI++ + +GS++++GK A L GAF +R PE FI Sbjct: 75 S-NVQKGDIIVVVRNGSRALIGKHAQVNSLMDNTVIGAFMTGVRAGHPE------FINAL 127 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 + + ++ + GA IN I +F+ + P EQ I L + ++ + + Sbjct: 128 FDTDKFTAQVEK-NLGATINQITNGAFNGMVFMFPEGQEQTAIGNTFQKLDSLINQHQKK 186 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE 245 +++ I K + + +++ F + V K LN E EL +GL+ P + Sbjct: 187 HDKLSNIKKAMLEKMFPKPGETTPEIRFKGFSGEW-VEKPLNHE---VELFSGLTYSPKD 242 Query: 246 -SGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 G ++R S+V+ G + Q D ++ + +N +Q GD++ NGS +G Sbjct: 243 IRKQGVFVIRSSNVKNGQIVQADNVYV--NPEVVNCSNVQKGDIIVVVRNGSRALIG--- 297 Query: 305 LLKKLQHQNLLYPDKLIRARLT--KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 K N L + +I A +T + PE+I F + + + T Q I+ Sbjct: 298 ---KHAQVNSLMDNTVIGAFMTGVRAGHPEFINALFDTDKFTAQVEKNLGATINQ--ITN 352 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 V + P +EQ I ++L D++ Q + ++NN+ Q+ L+K F Sbjct: 353 GAFNGMVFMFPEGQEQTAIGNTFQKL----DSLINQHQQQITKLNNIKQACLSKMF 404 >UniRef50_C1PCQ5 Restriction modification system DNA specificity domain protein n=1 Tax=Bacillus coagulans 36D1 RepID=C1PCQ5_BACCO Length = 483 Score = 84.7 bits (208), Expect = 6e-15, Method: Compositional matrix adjust. Identities = 96/393 (24%), Positives = 167/393 (42%), Gaps = 55/393 (13%) Query: 71 QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSL 130 Q ++ ++I++ + + V K + H F V+ K I+S ++ + KS Sbjct: 82 QLVNKDEILLCKINPRINRVWKVLNNHGKFRQLASTEWIVISENKAIYSEYLLYLLKSPY 141 Query: 131 YRNKISS--LSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 +R I+S G ++ +P + I +PP+ EQK IA+K++ LL+++D K E+ Sbjct: 142 FRKLITSNVSGVGGSLTRARPKEVETYPIAVPPIKEQKRIADKVERLLSKIDEAKRLIEE 201 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWR----NFEPQHSVF----------KKLNFESILTE 234 + + R A+L A G+LT KWR N E S++ +K++ E + + Sbjct: 202 AKETFELRRAAILDKAFRGELTRKWREENKNIEDAESLYVKIKESQSIRRKVSKEINIKD 261 Query: 235 LRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYN 294 LR + S +G + ++ +G + I E N ++ G++ + N Sbjct: 262 LRYSIPSTWKWVRLGD----VFTITSGGTPKRTI----PEYYEGNIPWIKTGEIKWNAIN 313 Query: 295 GSLEFV---GVCGLLKKLQHQNL----LYPDKLIRAR-----------------LTKDAL 330 S E + V KL N +Y L R R L D + Sbjct: 314 ESEEQITPEAVANSSAKLLPPNTVLVAMYGQGLTRGRAAILSVEATCNQAVCALLPNDYI 373 Query: 331 -PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 PE+I +F R V Q+ +S I + LPP++EQ I+ ++ +F Sbjct: 374 APEFIFYYFMEGYQR---FRQVAKGGNQENLSVSLISDFIFPLPPLEEQRVIITTLQNIF 430 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 I+ + + + QSIL+KAFRGEL Sbjct: 431 KKESKIKDVIK---INTDEIKQSILSKAFRGEL 460 Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 40/124 (32%), Positives = 61/124 (49%), Gaps = 17/124 (13%) Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISG-------KDIKSQVVLLPPVKEQAEIVRR 384 EY+ SP R + TS G+ G K++++ + +PP+KEQ I + Sbjct: 131 EYLLYLLKSPYFRKLI------TSNVSGVGGSLTRARPKEVETYPIAVPPIKEQKRIADK 184 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALL 444 VE+L + D ++ + A +IL KAFRGELT +WR EN ++ A +L Sbjct: 185 VERLLSKIDEAKRLIEEAKETFELRRAAILDKAFRGELTRKWREENKNI----EDAESLY 240 Query: 445 EKIK 448 KIK Sbjct: 241 VKIK 244 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 46/205 (22%), Positives = 87/205 (42%), Gaps = 11/205 (5%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P W + V T+ G T K+ I + +P I+ I+ + ++ P+ Sbjct: 266 IPSTWKWVRLGDVFTITSGGTPKR--TIPEYYEGNIPWIKTGEIKWNAINESEEQITPEA 323 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + S K+ P + V+ G G++A + C+ A C +L P I FI ++ Sbjct: 324 VANSSAKLLPPNTVLVAMYGQGLTRGRAAILSVEATCN-QAVCALL-PNDYIAPEFIFYY 381 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 R ++ G N N+ + P+PPL EQ++I T L + +++ Sbjct: 382 FMEGYQR--FRQVAKGGNQENLSVSLISDFIFPLPPLEEQRVII----TTLQNIFKKESK 435 Query: 186 FEQIPQI-LKRFRQAVLGGAVNGKL 209 + + +I +Q++L A G+L Sbjct: 436 IKDVIKINTDEIKQSILSKAFRGEL 460 >UniRef50_Q8RJG0 HsdS n=12 Tax=Campylobacter jejuni RepID=Q8RJG0_CAMJE Length = 417 Score = 84.7 bits (208), Expect = 7e-15, Method: Compositional matrix adjust. Identities = 101/421 (23%), Positives = 185/421 (43%), Gaps = 17/421 (4%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GW + + + ++ G T K Y KD P + ++ + G F + K Sbjct: 10 LPQGWEVKKLGEIGEIVTGSTPSKSNLDFYGKD--YPFFKPSDFEQGYFLENAGDNLSKL 67 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 +++++ P+ I++ + GS +GK A + C+ + P K I S +I ++ Sbjct: 68 GFDKARQLPPKTILV-VCIGS---LGKVALTRVIGSCN--QQINAIIPHKNIISEYIYYY 121 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-PLAEQKIIAEKLDTLLAQVDSTKA 184 SS +++ + S + + F + I P + EQ+ I LD A++D + Sbjct: 122 CISSKFQSILFSKAPQTTLAIFNKTEFSKLEIIYPKDIKEQERIVGILDESFAKIDESIK 181 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-PQHSVFKKLNFESILTELRNGLS-SK 242 EQ L Q+ L A N N++ PQ +K L I ++NG + SK Sbjct: 182 ILEQDLLNLDELMQSALQKAFNPLKDNAKENYKLPQGWEWKSLG--EISNLIQNGFAASK 239 Query: 243 PNESGVGHPILRISSVRA-GHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 NE G+ LR ++ G+++ + + ++ + + ++ D+LF N S E VG Sbjct: 240 NNEIPSGYVHLRTHNISTDGNLNFDTLIKIKREFIKEKQSFIEKNDILFNNTN-STELVG 298 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 L+ Q+ N + + L + +L + + +F GQ GI+ Sbjct: 299 KTALVT--QNYNYAFSNHLTKIKLKNQYNSKLVVFYFVLLLKNKYFEKICHQWIGQSGIN 356 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 +K + LPP+KEQ +I + ++ +F +++ L L QS+L KAF+GE Sbjct: 357 IDKLKKIQIPLPPLKEQEQIAKHLDFVFEKTKALKELYTKELKDYEELKQSLLNKAFKGE 416 Query: 422 L 422 L Sbjct: 417 L 417 Score = 56.2 bits (134), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 56/215 (26%), Positives = 100/215 (46%), Gaps = 21/215 (9%) Query: 5 KLPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQ---NGKFDTTDLV 60 KLP+GW + ++ LI+ G K N + Y+ L R +NI N FDT L+ Sbjct: 214 KLPQGWEWKSLGEISNLIQNGFAASKN---NEIPSGYVHL-RTHNISTDGNLNFDT--LI 267 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + + +KE Q ++ ++ ++ S +VGK+A + +F ++ + S Sbjct: 268 KIKREFIKEKQSFIEKNDILFNNTNSTELVGKTALVTQNYNYAFSNHLTKIKLKNQYNSK 327 Query: 121 FIAHFTKSSL---YRNKISSL---SAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDT 174 + + L Y KI +G NI+ +K I IP+PPL EQ+ IA+ LD Sbjct: 328 LVVFYFVLLLKNKYFEKICHQWIGQSGINIDKLKK-----IQIPLPPLKEQEQIAKHLDF 382 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 + + + K + + + + +Q++L A G+L Sbjct: 383 VFEKTKALKELYTKELKDYEELKQSLLNKAFKGEL 417 >UniRef50_B5W475 Restriction modification system DNA specificity domain n=1 Tax=Arthrospira maxima CS-328 RepID=B5W475_SPIMA Length = 493 Score = 84.7 bits (208), Expect = 7e-15, Method: Compositional matrix adjust. Identities = 69/218 (31%), Positives = 117/218 (53%), Gaps = 28/218 (12%) Query: 249 GHPILRISSVRAGHVDQNDIRFL-ECSESELNRHKLQDGDLLFTRYNGSLEFVG-----V 302 G PI+R +V+ G + +I+++ E + L R +L ++L F+G V Sbjct: 288 GIPIIRAQNVQMGKFIETNIKYISEDVSNYLERSQLHGREVLMV-------FIGAGTGNV 340 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPE-YIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 C L + + L P+ A++ D + Y+ ++ S +N + + +K+T+ Q +S Sbjct: 341 C--LAPQERRWHLAPNV---AKIDVDEISSNYLCLYLQSSIGQNYVDSWIKSTA-QPSLS 394 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 + I+ +V L P++EQ EIVRRVE+LF D IE++ A ++ L ++ L+KAFRGE Sbjct: 395 METIRKIIVFLSPLEEQKEIVRRVEKLFKAIDLIEQEHQKASKLLDRLEKATLSKAFRGE 454 Query: 422 LTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKA 459 L Q +P+ + AA LLE+I+AER +KA Sbjct: 455 LVPQ----DPN----DEPAAVLLERIQAERQTQPKRKA 484 Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 47/151 (31%), Positives = 73/151 (48%), Gaps = 26/151 (17%) Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 GV G+ + +Q+ L R RL DA + ++ T S + I Sbjct: 92 GVAGIRPRNVNQDWL------RYRLIGDA----------------SALDAAGTGSTFRQI 129 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + + S + LPP+ EQ IV ++++LFA + +++ V Q++LA AFRG Sbjct: 130 DKQTLVSWNINLPPLNEQRRIVAKLDRLFARSRCAREELGRVSRLVQRYKQAVLAAAFRG 189 Query: 421 ELTAQWRAENPDLISGENSAAALLEKIKAER 451 +LTA WRAENPD+ A+ LL +I R Sbjct: 190 DLTADWRAENPDV----EPASELLRQILIRR 216 Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 53/230 (23%), Positives = 107/230 (46%), Gaps = 14/230 (6%) Query: 6 LPEGWVIAPVSTV--TTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 LP+ W + + + T + G Y K N + +P+IRA N+Q GKF T++ ++ Sbjct: 254 LPKTWAVTNIDYLAHVTKLAGFEYTKHFKTNDVAG--IPIIRAQNVQMGKFIETNIKYIS 311 Query: 64 K---NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFS 119 + N ++ SQ E +++ + +G+ +V P E + V + + I S Sbjct: 312 EDVSNYLERSQLHGREVLMVFIGAGTGNVC------LAPQERRWHLAPNVAKIDVDEISS 365 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ + +SS+ +N + S ++ + I + + PL EQK I +++ L + Sbjct: 366 NYLCLYLQSSIGQNYVDSWIKSTAQPSLSMETIRKIIVFLSPLEEQKEIVRRVEKLFKAI 425 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE 229 D + ++ ++L R +A L A G+L + N EP + +++ E Sbjct: 426 DLIEQEHQKASKLLDRLEKATLSKAFRGELVPQDPNDEPAAVLLERIQAE 475 Score = 57.0 bits (136), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 34/112 (30%), Positives = 53/112 (47%), Gaps = 1/112 (0%) Query: 135 ISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILK 194 + + G+ I + NI +PPL EQ+ I KLD L A+ + ++ ++++ Sbjct: 117 LDAAGTGSTFRQIDKQTLVSWNINLPPLNEQRRIVAKLDRLFARSRCAREELGRVSRLVQ 176 Query: 195 RFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES 246 R++QAVL A G LT WR P +L IL + + K NES Sbjct: 177 RYKQAVLAAAFRGDLTADWRAENPDVEPASEL-LRQILIRRKQRYNEKYNES 227 >UniRef50_B1LRG3 Type I restriction modification DNA specificity domain protein n=1 Tax=Escherichia coli SMS-3-5 RepID=B1LRG3_ECOSM Length = 428 Score = 84.3 bits (207), Expect = 8e-15, Method: Compositional matrix adjust. Identities = 67/281 (23%), Positives = 129/281 (45%), Gaps = 15/281 (5%) Query: 153 DLINIPIP--PLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 DL+ IP+P ++ QK ++ LD ++DS + ++LK RQA++ V L Sbjct: 146 DLLEIPVPLIDISLQKQVSTFLDRETQRIDSLIEEKQTFIKLLKEKRQALISHVVTKGLY 205 Query: 211 E---------KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAG 261 +W P+H KK+ + I + G S N+S VG+P+LRI ++++ Sbjct: 206 PNVEMQDSGIEWIGQVPKHWEVKKI--KHICSNFMYGTSQDCNQSDVGYPVLRIPNIKST 263 Query: 262 HVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLI 321 +VD D+++ S+ + + L GD+L R NG+ VG L + L+ LI Sbjct: 264 NVDFEDLKYANISDVDALTYLLSRGDILVIRTNGNPNLVGQSALFDS--NGQYLFASYLI 321 Query: 322 RARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEI 381 + + ++ +S S R A+ +T+ G +S + + + +PP+ EQ I Sbjct: 322 KLTPKQGVDTSFLVEAMNSLSVRQALTFQSRTSVGNYNLSIPSLANTSIAIPPIDEQKTI 381 Query: 382 VRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + D + ++ + ++ + S++ A G++ Sbjct: 382 TNYLSAATINIDLLIQETDKSIDLLKEHRTSLINAAVTGKI 422 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 50/208 (24%), Positives = 95/208 (45%), Gaps = 6/208 (2%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P+ W + + + + Y Q N Y P++R NI++ D DL + Sbjct: 219 GQVPKHWEVKKIKHICS---NFMYGTSQDCNQSDVGY-PVLRIPNIKSTNVDFEDLKYAN 274 Query: 64 KNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + V + +S DI++ ++G+ ++VG+SA + F ++ L P++ + + F+ Sbjct: 275 ISDVDALTYLLSRGDILVIRTNGNPNLVGQSALFDSNGQYLFASYLIKLTPKQGVDTSFL 334 Query: 123 AHFTKSSLYRNKISSLSAGANIN-NIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 S R ++ S + N N+ S +I IPP+ EQK I L +D Sbjct: 335 VEAMNSLSVRQALTFQSRTSVGNYNLSIPSLANTSIAIPPIDEQKTITNYLSAATINIDL 394 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKL 209 ++ +LK R +++ AV GK+ Sbjct: 395 LIQETDKSIDLLKEHRTSLINAAVTGKI 422 >UniRef50_B7JRE7 Restriction modification system DNA specificity domain protein n=2 Tax=Bacillus cereus RepID=B7JRE7_BACC0 Length = 495 Score = 84.3 bits (207), Expect = 8e-15, Method: Compositional matrix adjust. Identities = 112/496 (22%), Positives = 216/496 (43%), Gaps = 76/496 (15%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFD-TTDLVFVP 63 ++P W+ +++++ LI ++ K++ P++ NI +G+ + TD Sbjct: 25 EVPGNWIWGNLNSLSKLIVDGSHNPPPK----KNEGFPMLSGRNILDGEINFETDRYVSE 80 Query: 64 KNLVKESQK--ISPEDIVIAM--SSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + KE ++ I D+++ + + G +VV K + PF +++P ++ S Sbjct: 81 DDYQKEYKRTPIESNDVLLTIVGTIGRTTVVPK---EFSPF--VLQRSVALIKP--MVNS 133 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 +++++ S ++ + + G + + IP+PPL EQK I EK++ LL +V Sbjct: 134 NYLSYYFSSPYFQYYLQKNAKGTAQKGVYLKTLKSSRIPLPPLMEQKRITEKVEGLLGRV 193 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRN----FEPQHSVFKKLNFE------ 229 + KA E+ + + R +L A G+L+ KWR E S+ +++ + Sbjct: 194 EEAKALIEEAKKTFEVRRATILDKAFRGELSAKWREDNRIAEDASSLLERIQIQKRNSSI 253 Query: 230 --------SILT-----ELRNG------------LSSKPNE-----SGVGHPILRISSVR 259 S++ EL NG ++S + S G +R + Sbjct: 254 KSNTLKITSVIKEEEPFELPNGWTWVRLGEISYYVTSGSRDWSKYYSDEGAMFIRTQDIN 313 Query: 260 AGHVDQNDIRFLECSES-ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPD 318 ++ +D+ ++ E E R ++ D+L T + VG C L++ + Sbjct: 314 KNSLNLSDVAYVSLPEKVEGKRSLVEKADILTTITGAN---VGKCALVET-NIKEAYVSQ 369 Query: 319 KLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQ 378 + +L + ++ +Y+ + SP + G+ +S +DIK+ + L P+ EQ Sbjct: 370 SVALTKLIEKSISKYVHLSLLSPCGGGNELEERAYGIGRPVLSLEDIKNIKIPLAPMAEQ 429 Query: 379 AEIVRRVEQLFAYADTIEKQVNNALA---RVNNLTQSILAKAFRGELTAQWRAENPDLIS 435 IV+ VE L EK+ N + + L QSIL KAFRGEL +P+ Sbjct: 430 QVIVKLVETLLEN----EKESLNLASIEKHLETLKQSILNKAFRGELGTN----DPN--- 478 Query: 436 GENSAAALLEKIKAER 451 E S+ LL+K+ E+ Sbjct: 479 -EESSMKLLKKVLQEK 493 Score = 72.0 bits (175), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 65/242 (26%), Positives = 111/242 (45%), Gaps = 22/242 (9%) Query: 227 NFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQ 284 N S+ + +G + P + G P+L ++ G ++ R++ + + E R ++ Sbjct: 34 NLNSLSKLIVDGSHNPPPKKNEGFPMLSGRNILDGEINFETDRYVSEDDYQKEYKRTPIE 93 Query: 285 DGDLLFTRYNGSLEFVGVCG----LLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSS 340 D+L T VG G + K+ L LI+ + + Y+ +FSS Sbjct: 94 SNDVLLT-------IVGTIGRTTVVPKEFSPFVLQRSVALIKPMVNSN----YLSYYFSS 142 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 P + + K T+ QKG+ K +KS + LPP+ EQ I +VE L + + + Sbjct: 143 PYFQYYLQKNAKGTA-QKGVYLKTLKSSRIPLPPLMEQKRITEKVEGLLGRVEEAKALIE 201 Query: 401 NALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKAS 460 A +IL KAFRGEL+A+WR +N A++LLE+I+ ++ S K + Sbjct: 202 EAKKTFEVRRATILDKAFRGELSAKWREDN----RIAEDASSLLERIQIQKRNSSIKSNT 257 Query: 461 RK 462 K Sbjct: 258 LK 259 Score = 42.7 bits (99), Expect = 0.032, Method: Compositional matrix adjust. Identities = 53/234 (22%), Positives = 97/234 (41%), Gaps = 17/234 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV-- 62 +LP GW + ++ VT Y D+ IR +I + +D+ +V Sbjct: 271 ELPNGWTWVRLGEISYY---VTSGSRDWSKYYSDEGAMFIRTQDINKNSLNLSDVAYVSL 327 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 P+ + + + DI+ ++ + VGK A + ++ + L KLI I Sbjct: 328 PEKVEGKRSLVEKADILTTITGAN---VGKCALVETNIKEAYVSQSVALT--KLIEKS-I 381 Query: 123 AHFTKSSLYR-----NKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + + SL N++ + G + I IP+ P+AEQ++I + ++TLL Sbjct: 382 SKYVHLSLLSPCGGGNELEERAYGIGRPVLSLEDIKNIKIPLAPMAEQQVIVKLVETLLE 441 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESI 231 + I + L+ +Q++L A G+L N E + KK+ E I Sbjct: 442 N-EKESLNLASIEKHLETLKQSILNKAFRGELGTNDPNEESSMKLLKKVLQEKI 494 >UniRef50_C6JA10 Putative uncharacterized protein n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JA10_9FIRM Length = 393 Score = 84.3 bits (207), Expect = 9e-15, Method: Compositional matrix adjust. Identities = 91/406 (22%), Positives = 175/406 (43%), Gaps = 33/406 (8%) Query: 21 LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQKISPEDIVI 80 ++ G +K E NY+ D + +IR N+Q G + VF P + + + E ++ Sbjct: 12 ILNGFAFKSE---NYV-DSGIRVIRIANVQKGYIEDNTPVFYPLETNELDKYMLEEGDLL 67 Query: 81 AMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSA 140 +G+ V + +P + C L+ ++ + ++ H S+ + + S Sbjct: 68 MALTGNVGRVAILKKEFMPAALNQRVACLRLKTDR-VAKDYLFHVLNSAFFEQQCIQSSK 126 Query: 141 GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAV 200 G N+ IP+ P +Q++IA+ LD + S +++ ++K + Sbjct: 127 GVAQKNMSTEWLKDYEIPMYPKEQQELIADILDKTRNIIISRNYELKKLDDLIKARFVEM 186 Query: 201 LGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE---SGVGHPILRISS 257 G A + W+ + +++V E +NG+ ++ G G PILRI Sbjct: 187 FGDAYLNEFG--WKKIKIKNAV---------TVEPQNGMYKPQSDYVTDGSGIPILRIDG 235 Query: 258 VRAGHV-DQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLY 316 G V D + ++ L CSE+E ++ L + D++ R N S+E++G C + L ++ +Y Sbjct: 236 FYDGVVTDFSSLKRLRCSENERQKYLLYEDDVVINRVN-SIEYLGKCAHINGLL-EDTVY 293 Query: 317 PDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPV 375 ++R P Y+ S + ++N K Q I+ KD+ + PP+ Sbjct: 294 ESNMMRMHFDSTRFHPVYVCRLLCSRFVYDQIVNHAKQAVNQASINQKDVLDFDIYEPPL 353 Query: 376 KEQ---AEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 K Q A+ VR V D + ++ AL + L S++ + F Sbjct: 354 KLQIQFADFVRAV-------DKSKVEIQKALDKTQMLFDSLMQEYF 392 Score = 42.0 bits (97), Expect = 0.050, Method: Compositional matrix adjust. Identities = 43/176 (24%), Positives = 84/176 (47%), Gaps = 20/176 (11%) Query: 233 TELRNGLSSKP-NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 ++ NG + K N G ++RI++V+ G+++ N F +EL+++ L++GDLL Sbjct: 10 CDILNGFAFKSENYVDSGIRVIRIANVQKGYIEDNTPVFYPLETNELDKYMLEEGDLLMA 69 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 VG +LKK + ++ RL D + + + F ++ C+ Sbjct: 70 LTGN----VGRVAILKK-EFMPAALNQRVACLRLKTDRVAK--DYLFHVLNSAFFEQQCI 122 Query: 352 KTTSG--QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 +++ G QK +S + +K + + P KEQ E++ AD ++K N ++R Sbjct: 123 QSSKGVAQKNMSTEWLKDYEIPMYP-KEQQELI---------ADILDKTRNIIISR 168 >UniRef50_D2LA90 Restriction modification system DNA specificity domain protein n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2LA90_9DELT Length = 543 Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 86/341 (25%), Positives = 152/341 (44%), Gaps = 24/341 (7%) Query: 101 ECSFGAFCGVLR--PEKLIFSGFIAHFTKSSLYRNKISSLSA-GANINNIKPASFDLINI 157 +C G G++R ++I F+ + S YRN + S + GA ++ I F I Sbjct: 218 DCCLGRRMGLVRFKTNEVIPKFFLYQYISPS-YRNFLDSKTIRGATVDRISIKEFPFFPI 276 Query: 158 PIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE 217 IP + EQK I LD +D+ A E+ + ++ L R F Sbjct: 277 AIPSIEEQKRIVSILDDAFECIDTAIANTEKNIANARELFESYLD-----------RVFA 325 Query: 218 PQHSVFKKLNFESILT-ELRNGLSS-KPNESGVGHPILRISSVRAGHVDQNDIRFLECSE 275 + +++ N E IL+ + RNG S + S G P+L +SSV + +++ Sbjct: 326 EKGDGWEEKNLEDILSFQPRNGWSPPASHHSDRGTPVLTLSSVTGFQFKKEALKYTSAQV 385 Query: 276 SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYI 334 + + +++GDLL TR N + E VG + + N +YPD +++ ++ K AL E++ Sbjct: 386 NPKAHYWVENGDLLMTRSN-TPELVGHVAVCDGVS-ANTIYPDLIMKMKVDKHIALTEFV 443 Query: 335 EIFFSSPSARNAMMN-CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYAD 393 S RN + + K + +++ + +P + Q IV + L + Sbjct: 444 YFQLRSSKLRNIIKDGATGANPTMKKVKKSTVQNLPLAMPALPVQQAIVDNLRNLNETSR 503 Query: 394 TIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLI 434 + K+ + + + L QS+L KAF GEL + NPD + Sbjct: 504 LLVKKCVSKVKALTRLKQSLLQKAFSGELPMDF---NPDAL 541 >UniRef50_B3G223 Type I restriction modification DNA specificity protein n=1 Tax=Pseudomonas aeruginosa RepID=B3G223_PSEAE Length = 395 Score = 84.0 bits (206), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 76/313 (24%), Positives = 141/313 (45%), Gaps = 21/313 (6%) Query: 111 LRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAE 170 LR + + + +I H S+ + I + S+GA + + + IP+PPL EQK IA Sbjct: 103 LRARQEVDTSYIQHAMNSTDVQRFIQN-SSGATVGTYTISRANETEIPLPPLPEQKRIAA 161 Query: 171 KLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFES 230 LD + D+ + + +Q Q+ F +AV +T F Sbjct: 162 ILD----KADAIRRKRQQAIQLADDFLRAVFLDMFGDPVT--------NSKGFPIGTIRD 209 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVR-AGHVDQNDIRFLECSESELNRHKLQDGDLL 289 ++ G S+K +E+ +PILR+ ++ G +D ++++ E E +++ ++ GDLL Sbjct: 210 LVATADYGSSAKASETYGEYPILRMGNITYQGRIDLEGLKYINLEEKERSKYLVEKGDLL 269 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 F R N S E VG + + LIR R + YI + +S + + + Sbjct: 270 FNRTN-SKELVGKTAVYD--MDDPVAIAGYLIRVRPNEMGNSHYISGYLNSAHGKATLRS 326 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 K+ G I+ +++++ ++LP + E+ R+ ++L + + AL L Sbjct: 327 ICKSIVGMANINAQEMQNIPIMLPSI----ELQRKYQELVVVTKCKLQVFDTALKLTEQL 382 Query: 410 TQSILAKAFRGEL 422 S+ KAF G+L Sbjct: 383 FSSLSYKAFSGQL 395 >UniRef50_A5KSY3 Restriction modification system DNA specificity domain n=5 Tax=candidate division TM7 genomosp. GTL1 RepID=A5KSY3_9BACT Length = 335 Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 76/307 (24%), Positives = 143/307 (46%), Gaps = 26/307 (8%) Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + K +L S G + ++ I IP P EQK I K++ L +++D+ ++ Sbjct: 47 YVKYALNYVDYQSYVTGTTRLKLNQSALKRIIIPFPDENEQKRIVAKIEELFSEIDNAES 106 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 K + Q+++ F + + + F I E++ G++ Sbjct: 107 AITTASGYYKSYEQSIIDSL-----------FAKYEAEAEMVEFGDI-AEIKGGITKGRK 154 Query: 245 ESG--VGH-PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 G +G P LR+++V+ G++ ++I+ + + EL ++ L +GD+LFT G + +G Sbjct: 155 LRGMPIGETPYLRVANVQDGYLYLDEIKTINVTAEELRKYSLMNGDILFTE-GGDKDKLG 213 Query: 302 ----VCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 G ++ HQN ++ RAR+ + +PEYI + AR+ ++ K T+ Sbjct: 214 RGTIWHGEIELCIHQNHIF-----RARVDSGQFVPEYISYATKTTRARDYFLSKAKQTTN 268 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 ++ +K+ + P+ +Q EIV + + + K++ A R L QSILAK Sbjct: 269 LASLNMTSLKNLQLPSIPLAQQKEIVESIVTKLSEIKSARKELIVAHHRSKALRQSILAK 328 Query: 417 AFRGELT 423 AF+GEL Sbjct: 329 AFKGELV 335 >UniRef50_UPI0001AF5E36 restriction modification system DNA specificity subunit n=1 Tax=Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191 RepID=UPI0001AF5E36 Length = 192 Score = 83.6 bits (205), Expect = 1e-14, Method: Compositional matrix adjust. Identities = 52/162 (32%), Positives = 88/162 (54%), Gaps = 6/162 (3%) Query: 230 SILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLL 289 +IL +++ G S K + + G+P+LRI ++ G +D DI++ ++SEL L DLL Sbjct: 3 NILHDIKYGTSQKCDYNISGYPVLRIPNIVKGIIDLADIKYGALTDSELKDLTLNKNDLL 62 Query: 290 FTRYNGSLEFVGVCGLLKKLQH--QNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNA 346 F R NGS VG L +QH ++ Y +IR RL + + YI + S R Sbjct: 63 FIRSNGSTNIVGQSTL---VQHDLKDHAYAGYIIRVRLHNEYINARYINMVMKSNLIREQ 119 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 + ++TT+G K I+ ++ +V LPP EQ I++++ ++ Sbjct: 120 IEGPIRTTTGVKNINSNELMGLLVPLPPKNEQGIIIKKINEI 161 Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust. Identities = 44/193 (22%), Positives = 87/193 (45%), Gaps = 11/193 (5%) Query: 21 LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQ----KISPE 76 ++ + Y Q +Y Y P++R NI G D D+ + + +S+ ++ Sbjct: 4 ILHDIKYGTSQKCDYNISGY-PVLRIPNIVKGIIDLADIKY---GALTDSELKDLTLNKN 59 Query: 77 DIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFCGVLR-PEKLIFSGFIAHFTKSSLYRNK 134 D++ S+GS ++VG+S QH + ++ + +R + I + +I KS+L R + Sbjct: 60 DLLFIRSNGSTNIVGQSTLVQHDLKDHAYAGYIIRVRLHNEYINARYINMVMKSNLIREQ 119 Query: 135 ISS-LSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQIL 193 I + + NI + +P+PP EQ II +K++ + + + K + Q Sbjct: 120 IEGPIRTTTGVKNINSNELMGLLVPLPPKNEQGIIIKKINEIDTTLSNLKVSIQSAQQTQ 179 Query: 194 KRFRQAVLGGAVN 206 A+ A+N Sbjct: 180 VHLADALTDAAIN 192 >UniRef50_Q8TP22 Type I site-specific deoxyribonuclease n=1 Tax=Methanosarcina acetivorans RepID=Q8TP22_METAC Length = 290 Score = 83.2 bits (204), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 75/299 (25%), Positives = 145/299 (48%), Gaps = 37/299 (12%) Query: 135 ISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILK 194 I +A + ++ + I IP+PPL Q +K+ ++L + + TK Q ++ + Sbjct: 16 IEDRTAFVTVKHLSAKQLNTIKIPVPPLETQ----QKIVSILKKAEETKKLRAQADELTQ 71 Query: 195 RFRQAVL-----GGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG--LSSKPNESG 247 + Q+V VN K W+ K + I++ + G L+ KP Sbjct: 72 KLLQSVFLEMFGDPVVNPK---NWKEI-------KLKDVSEIVSGVTKGRKLAGKPT--- 118 Query: 248 VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 V P LR+++V+ G++D +I+ +E S++ ++ LQ GD+L T G + +G + Sbjct: 119 VFVPYLRVANVQDGYLDLTEIKEIEVLPSDVEKYALQGGDILLTE-GGDPDKLGRGAVWN 177 Query: 308 KLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIK 366 + Q ++ + + R R+ ++ L PEY+ + S + + K T+G I+ +K Sbjct: 178 R-QIPTCIHQNHIFRVRVNRECLVPEYLSMLIGSTYGKMYFLKSAKQTTGIASINSTQLK 236 Query: 367 SQVVLLPPVKEQ---AEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + L+ + Q AE+V ++E+ T+ +Q ++ ++NNL S++ KAF GEL Sbjct: 237 NFPALIASLDLQLRFAEMVHQIEK-----TTVSQQQSS--FKINNLFDSLMQKAFTGEL 288 >UniRef50_A5W9C1 Restriction modification system DNA specificity domain n=1 Tax=Pseudomonas putida F1 RepID=A5W9C1_PSEP1 Length = 561 Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 113/500 (22%), Positives = 194/500 (38%), Gaps = 109/500 (21%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI--QNGKFDTTDLVFV 62 +LP+GW + I G+ +K + + P+IR N+ +N +F+ T+ Sbjct: 82 ELPDGWAWCRIVDTGNYINGLAFKPSDWSSTGR----PIIRIQNLSGRNAEFNRTE---- 133 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSA----HQHLPFECSFGAFCGVLRPEKLIF 118 V S ++P DI+++ S+ + + + +QH+ F + P K++ Sbjct: 134 --REVDASVVVNPGDILVSWSATLDTFIWRGEQGVLNQHI-FRVT---------PSKIVS 181 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ K ++ S + G + +I F I +PPL EQ I K+ L+A Sbjct: 182 VQYLYWLLKWAIKVLADSEHAHGLVMAHINRGPFLAQPIGLPPLTEQNKIVVKIAELMAL 241 Query: 179 VD---------------------------STKARFEQIPQILKR--------------FR 197 D S A F Q Q L + Sbjct: 242 CDRLEARQADADSAHAQLVQALLGSLTQASDAADFAQSWQRLAEHFHTLFTTESSIDALK 301 Query: 198 QAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE---------------------------- 229 Q +L AV GKL + EP + K+++ E Sbjct: 302 QTLLQLAVMGKLVPQDSRDEPASELLKRVSEEKARLVAEGKLKKQKPLGDVAISDIPFDV 361 Query: 230 ----------SILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELN 279 I GLS K + G P+L++ ++ G V + + L Sbjct: 362 PDNWAWSRIGEIALNTEYGLSEKTFDLQDGVPVLKMGDIQEGRVLLGGQMAVSKNTEGLP 421 Query: 280 RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFF 338 L+ DLL+ R N S E VG G+ Q + LIR R K+ P Y+ I Sbjct: 422 GLYLETEDLLYNRTN-SAELVGKTGVFLG-QAGEYSFASYLIRIRCLKELFSPLYLNISM 479 Query: 339 SSPSARNAMMN-CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEK 397 ++P R +N +K GQ ++G +K+ +V +PP+ EQ IV +V+QL A + ++ Sbjct: 480 NAPGFRETQINPHLKQQCGQANVNGTIMKNMLVSVPPLPEQHRIVAKVDQLMALCEQLKT 539 Query: 398 QVNNALARVNNLTQSILAKA 417 ++N AL +L +++ +A Sbjct: 540 RLNQALQVHEHLASALVEQA 559 >UniRef50_B4RYU8 Type I site-specific deoxyribonuclease n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYU8_ALTMD Length = 360 Score = 82.4 bits (202), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 80/366 (21%), Positives = 164/366 (44%), Gaps = 35/366 (9%) Query: 65 NLVKESQK-----ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 NL+K + + P D++IA + +G + S A V+ P I + Sbjct: 16 NLIKYTDDDKGTFVEPSDVIIAWDGANAGTIGYGLEGLIG---STLARLKVIIPH--IDT 70 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ F +S +I + GA I ++ + + +P+PPL QK IA +L + Sbjct: 71 NYLGRFLQSKF--KEIRNNCTGATIPHVSKVHLNSLLVPVPPLPIQKQIA----AVLEKA 124 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 D+ + + +Q+ Q L Q+V + + ++ K E + ++R+G+ Sbjct: 125 DNLRQQSQQMEQELNSLAQSV--------FLDMFGDYRKDAMSLKSSLGE--VADVRSGV 174 Query: 240 SSKPNESG---VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 + G P +R+++V+ G++D ++I+ + + +++L+ GD+L T G Sbjct: 175 TKGQKLEGHKLTTVPYMRVANVQDGYLDLSEIKDITVKAKDFEKYQLKAGDVLMTE-GGD 233 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 + +G G + Q N ++ + + R RL + E+ + +P + + C K T+ Sbjct: 234 FDKLGR-GAIWSGQIANCIHQNHVFRVRLCDRYISEFFAYYLQTPFVKQYFLKCAKKTTN 292 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 I+ +K + + +Q +R +++L A + +Q A A N+L Q + Sbjct: 293 LASINITQLKGLPIPDESIGKQQSFLRIIDELKALKEANFEQQEQANAHFNSLMQ----R 348 Query: 417 AFRGEL 422 AF+GEL Sbjct: 349 AFKGEL 354 >UniRef50_D2KHV4 Putative type I restriction-modification system specificity subunit S n=1 Tax=Helicobacter pylori RepID=D2KHV4_HELPY Length = 430 Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 56/171 (32%), Positives = 88/171 (51%), Gaps = 4/171 (2%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFVPKN 65 P+G + + IRGVTYKK Q IN L+ + ++RANNI + + D+ + KN Sbjct: 13 PKGVEFRKLGDIGEYIRGVTYKKNQEINNLECG-IKVLRANNITLSNHLNFEDIKVINKN 71 Query: 66 L-VKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 + +++ Q + DI+I SGS +GK A + F+ FG F GV+R + + S F+ H Sbjct: 72 VKIRKEQYLKKNDILICAGSGSSEHIGKVAFINTDFDYVFGGFMGVIRIRE-VNSRFVYH 130 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 S++++ + INN+ IPIPPL Q+ I + LD Sbjct: 131 IFTSNIFKQYLEKSLNTTTINNLNANILQNFLIPIPPLEIQQEIVKILDAF 181 >UniRef50_UPI00016ADEAA restriction modification system DNA specificity domain n=1 Tax=Burkholderia pseudomallei B7210 RepID=UPI00016ADEAA Length = 387 Score = 82.0 bits (201), Expect = 5e-14, Method: Compositional matrix adjust. Identities = 54/162 (33%), Positives = 84/162 (51%), Gaps = 4/162 (2%) Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 +S+ G G P+LR+ +++ G V + +++L EL L++GDLLF R N S E Sbjct: 194 TSQKTGDGAGVPVLRMGNIQRGQVVFDSMKYLHDQLGELPDLYLREGDLLFNRTN-SYEL 252 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNC-VKTTSGQ 357 VG GL + + LIR RL + P Y+ ++ +S R + + +GQ Sbjct: 253 VGKTGLFSA-ESNRFSFASYLIRVRLIPNLTNPRYVNLYMNSIVCRRTQIEPQIVQQNGQ 311 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 +G +K V LPP+ EQA IV RVE+L A D + K++ Sbjct: 312 ANFNGSKLKHICVPLPPLAEQARIVARVEELRALCDGLRKRL 353 >UniRef50_B4TEJ6 Restriction modification system DNA specificity domain n=1 Tax=Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 RepID=B4TEJ6_SALHS Length = 380 Score = 81.6 bits (200), Expect = 6e-14, Method: Compositional matrix adjust. Identities = 83/348 (23%), Positives = 158/348 (45%), Gaps = 32/348 (9%) Query: 41 LPLIRANNIQNGKFDTT-DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLP 99 +PLIR +I +GK +T + + K L+K+ D+++ M K L Sbjct: 32 VPLIRIRDILSGKTETYYEGSYDLKYLIKKG------DLLVGMDGDFNREYWKGTDALLN 85 Query: 100 FECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPI 159 C + + + F+ HF + L +KI + + + ++ I I + Sbjct: 86 QRV-----CKITPNPETLDKNFLYHFLQKEL--DKIHATTDVVTVKHLSVKKIQDIKIRL 138 Query: 160 PPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQ 219 P L EQK IA LD + D+ + + EQ ++ F +A K E + Sbjct: 139 PSLKEQKRIAAILD----KADAIRQKREQAIKLADDFLRA--------KFLEMFGTPANN 186 Query: 220 HSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSV-RAGHVDQNDIRFLECSESEL 278 F K ++ + G S+K + +PILR+ ++ G D D+++L+ S E Sbjct: 187 IHRFPKGTIRDLVDSVNYGTSAKASIDSGEYPILRMGNITYQGRWDFTDLKYLDLSVKEK 246 Query: 279 NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFF 338 +++ +++GDLLF R N S E VG + + + + + + LIR R YI + Sbjct: 247 DKYLVKEGDLLFNRTN-SKELVGKTAVYE--EDRPMAFAGYLIRVRPNSIGNNYYISGYL 303 Query: 339 SSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPP--VKEQAEIVRR 384 +S + +MN K+ G I+ +++++ +L+PP ++++ EI+ + Sbjct: 304 NSIHGKITLMNMCKSIVGMANINAQELQNIEILIPPKHLQDEYEIIYK 351 Score = 43.9 bits (102), Expect = 0.012, Method: Compositional matrix adjust. Identities = 44/182 (24%), Positives = 79/182 (43%), Gaps = 12/182 (6%) Query: 14 PVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFVPKNLVKESQK 72 P T+ L+ V Y + +Y P++R NI G++D TDL ++ ++ ++ + Sbjct: 191 PKGTIRDLVDSVNYGTSAKASIDSGEY-PILRMGNITYQGRWDFTDLKYLDLSVKEKDKY 249 Query: 73 ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL----IFSGFIAHFTKS 128 + E ++ + SK +VGK+A +F + +RP + SG++ Sbjct: 250 LVKEGDLLFNRTNSKELVGKTAVYEEDRPMAFAGYLIRVRPNSIGNNYYISGYLNSIHGK 309 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPP---LAEQKIIAEKLDTLLAQVDSTKAR 185 N S+ ANIN I I IPP E +II +K+ L+ D + + Sbjct: 310 ITLMNMCKSIVGMANIN---AQELQNIEILIPPKHLQDEYEIIYKKIKKGLSIYDKSAMQ 366 Query: 186 FE 187 + Sbjct: 367 LQ 368 >UniRef50_A1SW07 Restriction modification system DNA specificity domain n=3 Tax=Bacteria RepID=A1SW07_PSYIN Length = 611 Score = 81.3 bits (199), Expect = 8e-14, Method: Compositional matrix adjust. Identities = 105/505 (20%), Positives = 193/505 (38%), Gaps = 108/505 (21%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG--KFDTTDLVFVP 63 LP GW + +T+ + G N D + +R+ N+ N K D T + Sbjct: 120 LPGGWAFERLGNLTSRL-GSGSTPRGGKNAYVDKGVIFLRSQNVWNDGLKLDDTAYISDE 178 Query: 64 KNLVKESQKISPEDIVIAMSSGS--------KSVVGKSAHQHLPFECSFGAFCGVLRPEK 115 + E+ ++ P D+++ ++ S K++V + QH+ V+R Sbjct: 179 THHKMENTRVFPNDVLLNITGASLGRSTIFPKALVTANVSQHVT----------VIRLIH 228 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 ++ S L + G I + + P+PPL EQ I K+D L Sbjct: 229 PSICQYLHLAIMSPLVQELAWGRQVGMAIEGLSKKVLEQFEFPVPPLEEQHRIVAKVDEL 288 Query: 176 L-----------AQVDSTK-----------------------ARFEQIPQIL-------K 194 + + +D+ K AR + IL Sbjct: 289 MLLCDLFEQKTESSIDAHKTLVEVLLTTLTDSKNSDELNKNWARVSEFFDILFTTEHSID 348 Query: 195 RFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE------------------------- 229 + +Q +L AV GKL + N EP + +++ E Sbjct: 349 QLKQTILQLAVMGKLVAQNENDEPASKLLERIAAEKETLIKDKKIKKQKALPPITDEEKP 408 Query: 230 ---------------SILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS 274 S+ TE G S K E G P+L++ +++G V + + + Sbjct: 409 FSVPSGWEWCRIYDASLFTEY--GTSEKAFEGNDGVPVLKMGDIQSGKVYHGGQKVVPST 466 Query: 275 ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEY 333 +L L+ GD+L+ R N S E VG G+ + + LIR R + + P+Y Sbjct: 467 IKDLPNLYLKYGDILYNRTN-SAELVGKTGMFEG-DDDIFTFASYLIRIRCDFEKVAPQY 524 Query: 334 IEIFFSSPSARNAMMN-CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYA 392 + + +P + ++ VK GQ ++G +KS ++ +P + EQ IV +VE+L Sbjct: 525 LTLSMQTPLFKKTQIDPHVKQQCGQANVNGTIMKSMLISIPSLSEQYRIVNKVEELMTLC 584 Query: 393 DTIEKQVNNALARVNNLTQSILAKA 417 D ++ ++N + +L +++ +A Sbjct: 585 DQLKTRLNESQQSQLHLADALIEQA 609 >UniRef50_Q4HFD9 HsdS n=3 Tax=Campylobacterales RepID=Q4HFD9_CAMCO Length = 408 Score = 80.9 bits (198), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 103/433 (23%), Positives = 177/433 (40%), Gaps = 44/433 (10%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP-- 63 L +GW + + + G + NY++ +P + NI G FD +D+ ++ Sbjct: 4 LSQGWKWKSLGEICFITDGT----HKTPNYIETG-IPFLSVKNISKGFFDLSDVKYISLE 58 Query: 64 -KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 N + + K EDI+I +GK+ L FE S G+L+P+ I S ++ Sbjct: 59 EHNKLIKRAKPEFEDILICRIG----TLGKAIKISLEFEFSIFVSLGLLKPKVKIISDYL 114 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPI--PPLAEQKIIAEKLDTLLAQVD 180 +F S I+ G + K L PI PPL EQ+ I LD A++D Sbjct: 115 VYFLNSCFIEEWINDNKVGGGTHTAKLNLNILEKCPIALPPLKEQERIVGILDENFAKID 174 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-PQHSVFKKLNFESILTELRNGL 239 EQ L Q+ L A N N++ PQ +K L + E+ G Sbjct: 175 ENIKILEQDLLNLDELMQSALQKAFNPLKDNAKENYKLPQGWEWKSL---GEIGEIITGT 231 Query: 240 S---SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL---NRHKLQDGDLLFTRY 293 + + PN G +P+ + S + + I++ + S+L N L +L Sbjct: 232 TPSKNNPNFYGNEYPLFKPSDLNGDII----IKYASDNLSKLGFDNARNLPKDTILVVCI 287 Query: 294 NGSLEFVGVCGLLKKLQHQ-NLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 S+ VG+ G+ Q N + P+ ++ +FF S N +K Sbjct: 288 GASIGKVGLSGVNGSCNQQINAIIPNSAFTSKY----------LFFVCLS--NYFQTILK 335 Query: 353 TTSGQKG---ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + Q I+ + + LPP+KEQ +I +++L ++ +++ + + L Sbjct: 336 KNASQTTLPIINKTEFSKLQIPLPPLKEQEQIASHLDELSSHVKNLKQNYQAQIKNLQEL 395 Query: 410 TQSILAKAFRGEL 422 S+L KAF+G L Sbjct: 396 KNSLLDKAFKGNL 408 Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 49/210 (23%), Positives = 96/210 (45%), Gaps = 17/210 (8%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV--FV 62 KLP+GW + + +I G T K N+ ++Y PL + +++ NG D++ + Sbjct: 211 KLPQGWEWKSLGEIGEIITGTTPSKNNP-NFYGNEY-PLFKPSDL-NG-----DIIIKYA 262 Query: 63 PKNLVK---ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 NL K ++ + P+D ++ + G+ +GK + C+ + P S Sbjct: 263 SDNLSKLGFDNARNLPKDTILVVCIGAS--IGKVGLSGVNGSCN--QQINAIIPNSAFTS 318 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ S+ ++ + ++ + I F + IP+PPL EQ+ IA LD L + V Sbjct: 319 KYLFFVCLSNYFQTILKKNASQTTLPIINKTEFSKLQIPLPPLKEQEQIASHLDELSSHV 378 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 + K ++ + L+ + ++L A G L Sbjct: 379 KNLKQNYQAQIKNLQELKNSLLDKAFKGNL 408 >UniRef50_UPI00016B0992 probable type I restriction-modification system n=1 Tax=Burkholderia pseudomallei BCC215 RepID=UPI00016B0992 Length = 442 Score = 80.9 bits (198), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 88/383 (22%), Positives = 155/383 (40%), Gaps = 63/383 (16%) Query: 71 QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSL 130 + + P D VI++ S + +H F VLR I F A+ KS Sbjct: 80 KHVEPNDFVISLRSFQGGI------EHSAFGGCVSPAYTVLRATSKIAPDFWAYLLKSDT 133 Query: 131 YRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQI 189 Y + + +++ G + NI F + +P+P + EQ IA LD ++D+ A E++ Sbjct: 134 YISALQTVTDGIRDGKNISYMQFGALCVPVPNIDEQSAIAAFLDCETGKIDALIAEQEKL 193 Query: 190 PQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFESI-LTELRNGL 239 +L RQA L AV L W P H V +++ S+ +T G Sbjct: 194 IALLAEKRQAALSYAVTRGLNPDAPMKDSGVAWLGEVPAHWVIRRVKSVSVFMTSGPRGW 253 Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS---------ESELNRHKLQDGDLLF 290 S + ++ G S+ D ND +E ++E R +L +GD++ Sbjct: 254 SERISDEG---------SIFVQSGDLNDFLGVEFEIAKRVSVEFDAEAERTRLANGDVVV 304 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 V VC + + + N L R + D LP + + N Sbjct: 305 CITGAKTGKVAVCASVPEPAYVN----QHLCLIRPSPDVLPLF-------------LGNS 347 Query: 351 VKTTSGQ-----------KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 +K+T GQ +G+S +++ +++LPP EQ EIV ++ A D ++ + Sbjct: 348 LKSTIGQTQFELSQYGLKQGLSLDNVREALIVLPPPGEQVEIVTFIDAETARLDELKAEA 407 Query: 400 NNALARVNNLTQSILAKAFRGEL 422 A+ + +++A A G++ Sbjct: 408 ARAIELLKERRSALIAAAVTGKI 430 Score = 51.2 bits (121), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 50/221 (22%), Positives = 95/221 (42%), Gaps = 13/221 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P WVI V +V+ + + I+ ++ N+ +F+ V V Sbjct: 228 GEVPAHWVIRRVKSVSVFMTSGPRGWSERISDEGSIFVQSGDLNDFLGVEFEIAKRVSVE 287 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + E +++ D+V+ ++ V A +P ++RP + F+ Sbjct: 288 FDAEAERTRLANGDVVVCITGAKTGKVAVCAS--VPEPAYVNQHLCLIRPSPDVLPLFLG 345 Query: 124 HFTKSSLYRNKIS----SLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 + KS++ + + L G +++N++ A I +PP EQ I +D A++ Sbjct: 346 NSLKSTIGQTQFELSQYGLKQGLSLDNVREAL-----IVLPPPGEQVEIVTFIDAETARL 400 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQH 220 D KA + ++LK R A++ AV GK+ RN PQ Sbjct: 401 DELKAEAARAIELLKERRSALIAAAVTGKIDV--RNAAPQE 439 >UniRef50_B0P5V5 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0P5V5_9FIRM Length = 291 Score = 80.9 bits (198), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 55/183 (30%), Positives = 94/183 (51%), Gaps = 10/183 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LP+GW S + T G+TY+ ++ D + ++R+ NI N D +DLV V K Sbjct: 117 ELPDGWEWCNFSMIGTTNLGLTYRPTD----IEPDGVIVLRSCNIVNDPIDLSDLVRV-K 171 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFCGVLRPEKLIFSGFIA 123 ++++Q DI+I +GS+ +VGK A +L SFGAF + R E + +I Sbjct: 172 TTIRKNQYAQKNDILICARNGSRVLVGKCALISNLGEAASFGAFMAIYRTE---YFEYIV 228 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 +SS +R+ ++ A IN + +P+PP +EQ+ I E +D L +++ + Sbjct: 229 QHLRSSFFRSVFDDSNSTA-INQLTQDMLKRAVVPLPPASEQRRITEMIDATLFELNQME 287 Query: 184 ARF 186 R Sbjct: 288 KRL 290 >UniRef50_A4T4W1 Restriction modification system DNA specificity domain n=1 Tax=Mycobacterium gilvum PYR-GCK RepID=A4T4W1_MYCGI Length = 368 Score = 80.5 bits (197), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 84/322 (26%), Positives = 142/322 (44%), Gaps = 51/322 (15%) Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 E I ++ H R I+ G+ I S I +P+PPLA+Q+ IA LD Sbjct: 85 EARIHLRYLVHVLTPERLRRSIT----GSAQPQITRESLKAITVPLPPLADQRRIAAILD 140 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILT 233 Q D ++ + L+R+ + G S+F ++ L Sbjct: 141 ----QADRLRSHRHGL---LRRYSELKRAGFA---------------SMFAGISSSGKLG 178 Query: 234 ---ELRNGLS-SKPNES-GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDL 288 E++ GL S+ ES + P LR++++ G +D +++ + +E+E R +L+ GDL Sbjct: 179 DYGEVQGGLQVSRKRESLPLERPYLRVANIYRGKLDLGEVKTIRVTEAESMRVRLEPGDL 238 Query: 289 LFTRYNGSLEFVGVC----GLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSA 343 LF + + VG G + HQN LIR RL + A+ P Y E +F+S Sbjct: 239 LFVEGHANPNEVGRVAEWNGSVPDCLHQN-----HLIRVRLDRSAVEPTYAEAWFNSRDG 293 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 KTTSG I+ +++ + +PP+ Q E V A+ I+ + + Sbjct: 294 SMHFQRAGKTTSGLNTINASQLRAAPLPVPPISLQREYV-------TVANAIDNHLRDQT 346 Query: 404 AR---VNNLTQSILAKAFRGEL 422 + V+ L S+ ++AF G+L Sbjct: 347 MQSELVDELFVSLQSRAFSGQL 368 >UniRef50_B5FA22 Restriction modification system DNA specificity domain protein n=1 Tax=Vibrio fischeri MJ11 RepID=B5FA22_VIBFM Length = 376 Score = 80.5 bits (197), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 71/290 (24%), Positives = 132/290 (45%), Gaps = 23/290 (7%) Query: 136 SSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKR 195 + + G N NN+ I P + QK +A +LD ++ + E+ Q + Sbjct: 106 TGVKPGINRNNVYK-----IQAKFPSYSTQKQVAGQLDKAFDGIEQARTNTEKNLQNARE 160 Query: 196 FRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRI 255 + L V + E W KK + T+ G SSK ++ G P++R+ Sbjct: 161 LFDSYLQ-QVFSECGEGW----------KKTTLNELCTKFEYGTSSKSSQEGE-VPVIRM 208 Query: 256 SSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLL 315 +++ G + + + + +E + +++L D+LF R N S E VG + K + + Sbjct: 209 GNIQDGRIVMDKLVY-SLNEEDNQKYRLNFNDVLFNRTN-SAELVGKTAIYK--SEERAI 264 Query: 316 YPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLP- 373 + LIR + L +Y+ + +SP AR + ++ Q ISG +K+ + +P Sbjct: 265 FAGYLIRIHRNEKLLNADYLNFYLNSPIARKYGEQVMSQSTNQANISGTKLKTYPISIPV 324 Query: 374 PVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 ++EQ IV ++ L + +E + L ++ L QS+L +AF G+LT Sbjct: 325 SLEEQQSIVDKISTLKEKVEELEATHKSKLTALDELKQSLLQQAFTGQLT 374 Score = 54.7 bits (130), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 55/210 (26%), Positives = 101/210 (48%), Gaps = 15/210 (7%) Query: 8 EGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 EGW ++ + T G + K Q + +P+IR NIQ+G+ LV+ Sbjct: 175 EGWKKTTLNELCTKFEYGTSSKSSQ------EGEVPVIRMGNIQDGRIVMDKLVYSLNEE 228 Query: 67 VKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-GAFCGVLRPEKLIFSGFIAHF 125 + +++ D++ ++ S +VGK+A F G + R EKL+ + ++ + Sbjct: 229 DNQKYRLNFNDVLFNRTN-SAELVGKTAIYKSEERAIFAGYLIRIHRNEKLLNADYLNFY 287 Query: 126 TKSSL---YRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 S + Y ++ S S ANI+ K ++ I+IP+ L EQ+ I +K+ TL +V+ Sbjct: 288 LNSPIARKYGEQVMSQSTNQANISGTKLKTYP-ISIPV-SLEEQQSIVDKISTLKEKVEE 345 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE 211 +A + L +Q++L A G+LT+ Sbjct: 346 LEATHKSKLTALDELKQSLLQQAFTGQLTQ 375 >UniRef50_D1YNY9 Type I restriction modification DNA specificity domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YNY9_9FIRM Length = 427 Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 81/362 (22%), Positives = 162/362 (44%), Gaps = 22/362 (6%) Query: 73 ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYR 132 I D VI S + G S ++ CS VL P+ + + + + K+ L+ Sbjct: 69 IKKNDFVINSRSDRRGSCGISEYEG---SCSLINI--VLAPKNNMVNRYYNYLFKTELFA 123 Query: 133 NKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 ++ G ++ + K ++ I +P P L EQ+ IAE LDT AQ+D+ A+ + + Sbjct: 124 DEFYKWGNGIVDDLWSTKWSNMKNIMVPFPSLEEQQAIAEHLDTKCAQIDTIIAKEQSVI 183 Query: 191 QILKRFRQAVLGGAVNGKLT---------EKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 + L+ +++A++ AV L +W + P H K+L F + + Sbjct: 184 EKLQEYKRAIITYAVVKGLDITAETADSGIEWIDSIPSHWKIKRLIFSAYIRARLGWKGL 243 Query: 242 KPNE-SGVGHPILRISSVRAGHVDQNDIRFL-ECSESELNRHKLQDGDLLFTRYNGSLEF 299 K +E + GHP L +++ + D+ F+ + E KL+ GDLL + Sbjct: 244 KADEYTSEGHPFLSAVNIQNDKLVWEDLNFINDDRYDESPEIKLEIGDLLLVKDGAG--- 300 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 +G C ++ +L + L + Y+ FF S +N ++ +K G Sbjct: 301 IGKCAVVDQLPYGTATTNSSLGVITPYPELNSMYLYYFFESAIFQN-YISRIKNGMGVPH 359 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 ++ ++K+ +V++PP EQ IV +++ A D++ + + + ++ +S++ + Sbjct: 360 LTQGNLKNIMVIIPPYCEQEAIVTYLDEKCANLDSVILRKQSRIDKLTEYKKSLIYEVVT 419 Query: 420 GE 421 G+ Sbjct: 420 GK 421 Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 47/171 (27%), Positives = 83/171 (48%), Gaps = 7/171 (4%) Query: 42 PLIRANNIQNGKFDTTDLVFVPKNLVKESQKISPE--DIVIAMSSGSKSVVGKSAH-QHL 98 P + A NIQN K DL F+ + ES +I E D+++ +GK A L Sbjct: 254 PFLSAVNIQNDKLVWEDLNFINDDRYDESPEIKLEIGDLLLVKDGAG---IGKCAVVDQL 310 Query: 99 PF-ECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINI 157 P+ + + GV+ P + S ++ +F +S++++N IS + G + ++ + I + Sbjct: 311 PYGTATTNSSLGVITPYPELNSMYLYYFFESAIFQNYISRIKNGMGVPHLTQGNLKNIMV 370 Query: 158 PIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGK 208 IPP EQ+ I LD A +DS R + L ++++++ V GK Sbjct: 371 IIPPYCEQEAIVTYLDEKCANLDSVILRKQSRIDKLTEYKKSLIYEVVTGK 421 >UniRef50_C1D7R6 Type I restriction-modification system, S subunit n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1D7R6_LARHH Length = 453 Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 75/288 (26%), Positives = 129/288 (44%), Gaps = 24/288 (8%) Query: 152 FDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE 211 F+ +P PP EQ IA LD A++D+ A E++ +L RQA + AV L Sbjct: 156 FNNFRLPCPPDDEQAAIATFLDRETAKIDALIAEQEKLIALLAEKRQATISHAVTRGLDP 215 Query: 212 ---------KWRNFEPQHSVFKKLNFESILTELRNGLS----SKPNESGVGHPILRISSV 258 +W P H V + + L + G S S+P E+G +L+ V Sbjct: 216 AVPMKDSGVEWLGQVPAHWVICSVRRK--LKRIEQGWSPECFSRPAEAGEWG-VLKAGCV 272 Query: 259 RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPD 318 G + + L + + ++DGDLL +R +GS VG L +L+ D Sbjct: 273 NGGIFRPEENKALPDTLAPDENILIKDGDLLMSRASGSPALVGSVAYLSA-PPAHLMLSD 331 Query: 319 KLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK----DIKSQVVLLPP 374 K+ R L + LP+++ I F + R+ + + SG +G++ +K + +PP Sbjct: 332 KIFRLHLEQGTLPQFVAIAFGARYLRHQIEQAI---SGAEGLANNLPQTSLKGFTIAIPP 388 Query: 375 VKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 EQ EIV +Q A D ++ +A++ + +++A A G++ Sbjct: 389 EVEQQEIVVFTQQETAKLDALKIAAEHAVSLLKERRAALIAAAVTGQI 436 Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust. Identities = 54/212 (25%), Positives = 89/212 (41%), Gaps = 9/212 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P WVI V I + E + +++A + G F + +P Sbjct: 228 GQVPAHWVICSVRRKLKRIEQ-GWSPECFSRPAEAGEWGVLKAGCVNGGIFRPEENKALP 286 Query: 64 KNLV-KESQKISPEDIVIAMSSGSKSVVGKSAHQHLP---FECSFGAFCGVLRPEKLIFS 119 L E+ I D++++ +SGS ++VG A+ P S F L E+ Sbjct: 287 DTLAPDENILIKDGDLLMSRASGSPALVGSVAYLSAPPAHLMLSDKIF--RLHLEQGTLP 344 Query: 120 GFIAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+A + R++I +GA NN+ S I IPP EQ+ I A Sbjct: 345 QFVAIAFGARYLRHQIEQAISGAEGLANNLPQTSLKGFTIAIPPEVEQQEIVVFTQQETA 404 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 ++D+ K E +LK R A++ AV G++ Sbjct: 405 KLDALKIAAEHAVSLLKERRAALIAAAVTGQI 436 >UniRef50_B0RQ64 Type I site-specific DNA methyltransferase specificity subunit n=3 Tax=Xanthomonas campestris pv. campestris RepID=B0RQ64_XANCB Length = 415 Score = 80.1 bits (196), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 98/423 (23%), Positives = 167/423 (39%), Gaps = 59/423 (13%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GW + + ++ G T + Q Y D P ++ ++ N + TTD V Sbjct: 2 LPDGWRRTTLGNIGSVKSGSTPARSQHDRYFVDGKWPWVKTMDLTNSEILTTDEVITDAA 61 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG----- 120 L + S ++ P V+ G +G++ G+LR + I Sbjct: 62 LAESSCRLFPAGTVLVAMYGGFKQIGRT---------------GLLREKSAINQAISAID 106 Query: 121 ---------FIAHFTKSSL--YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIA 169 F+ H+ S+ ++N +S NI F +I +P L EQ+ IA Sbjct: 107 IERNQADPEFVLHWLNGSVETWKNYAASSRKDPNITRENVCDFPVI---LPTLGEQRRIA 163 Query: 170 EKLDTLLAQVDSTKARFEQI-PQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNF 228 L T + +T+ + Q+ R LG W F Sbjct: 164 HILSTWDQAIATTERLLKNSQKQMDILLRDLTLGTQRTTSTPSPWAKFTLGE-------- 215 Query: 229 ESILTELRNGLSSKPNES-GVGHPILRISSV-RAGHVDQNDIRFLECSESELNRHKLQDG 286 L +GLS K E G G + ++V + +D D ++ SE+E N+ +++ G Sbjct: 216 ---LGRTYSGLSGKKGEDFGFGAKFIPYTNVFKNNRIDIEDFSLVKISENE-NQTRVKSG 271 Query: 287 DLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLI---RARLTKDALPEYIEIFFSSPSA 343 D++FT + + VG+ +L L N LY + R K LPEY +P Sbjct: 272 DIIFTISSETPNEVGMASVL--LDDVNELYLNSFCFGYRLNDFKTLLPEYAGFVLRAPHI 329 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 R A+M + S + IS ++ + LP + EQ +R+ + A + K + + L Sbjct: 330 R-ALMTQIAQGSTRFNISKANVMRMELALPSIAEQ----KRIASILGGAHSTVKNLRDQL 384 Query: 404 ARV 406 AR+ Sbjct: 385 ARL 387 >UniRef50_Q0RKJ6 Type I restriction modification enzyme protein S n=1 Tax=Frankia alni ACN14a RepID=Q0RKJ6_FRAAA Length = 399 Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 76/315 (24%), Positives = 140/315 (44%), Gaps = 34/315 (10%) Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 G++ H+ + R+++ SL GA + ++ I +P+PPL+EQK I + LD Sbjct: 109 LPGYLYHWLRCQ--RSRLQSLGNGATFKELSKSATARIAVPLPPLSEQKRIEQMLD---- 162 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 Q D+ +AR + L+ Q++ + G + R ++++ ++ + + Sbjct: 163 QADTIRARRRETIARLEELAQSIF-SVMFGNPVQNERG-------WRRVPLSELVVRIDS 214 Query: 238 GLS-------SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 G S ++P E GV L++ +V + + + L + + +++ GDLLF Sbjct: 215 GRSPVCLDRPARPGEWGV----LKLGAVTSCVYRAGENKALPPDVAAFSACEVRPGDLLF 270 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL--PEYIEIFFSSPSARNAMM 348 +R N + E V C L+ + LL PD + R + + P Y+ + P R + Sbjct: 271 SRKN-TRELVAACALVDATPAR-LLLPDLIFRLVVEPRSAVDPVYLHRLLTHPEKRRKVQ 328 Query: 349 NCVKTTSG-QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 +S IS + + LPP++ Q E RV L + I+ +L + Sbjct: 329 GLASGSSASMPNISKSRLLGLEIELPPMEVQKEFANRVRAL----ERIKVAHQASLVEQD 384 Query: 408 NLTQSILAKAFRGEL 422 L S+ +AFRGEL Sbjct: 385 ELVASLAHRAFRGEL 399 >UniRef50_A6L7U8 Type I restriction enzyme EcoAI specificity protein n=7 Tax=Bacteroides RepID=A6L7U8_BACV8 Length = 449 Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 94/408 (23%), Positives = 174/408 (42%), Gaps = 40/408 (9%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN-GKFDTTDLVFVPK 64 LP GW + + ++ T +K ++ + ++R NI N G D ++LV+ Sbjct: 68 LPNGWEWCNLEDIVCELKYGTSEKSLSVGKIA-----VLRMGNITNVGTIDYSNLVYSSN 122 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 N + + +D++ ++ S+ V GK+A + + +RP LIFS ++ Sbjct: 123 NEDIKLYSLEKDDLLFNRTNSSEWV-GKTAIYKKEQPAIYAGYLIRIRP-ILIFSDYLNT 180 Query: 125 FTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 SS YRN ++ A N +NI + IPIPPL EQ+ I ++ ++ +D+ K Sbjct: 181 VMNSSYYRNWCYNVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVVEVAKWISLIDTIK 240 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 E + +K+ + +L A++GKL + N EP + K++N + T NG ++ Sbjct: 241 NSKEDLQTTIKQAKSKILNLAIHGKLVPQDPNDEPAIELLKRINPD--FTPCDNGHYTQL 298 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVC 303 E + +I+S+ G +N E+ + + + R N L G Sbjct: 299 PEGWAICKMKQITSITNGKSQKN-------VETLNGIYPIYGSGGVIGRANQYLCIAGST 351 Query: 304 GLLKKLQHQNLLY-------PDKLIRARLTKDALPEYIEIF-----FSSPSARNAMMNCV 351 + +K N ++ D + L +Y+ F FS AM + Sbjct: 352 IIGRKGTINNPIFVEEHFWNVDTAFGLKANDAILDKYLYYFCLSFDFSKLDKSTAMPSLT 411 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 KT+ G + ++ +PP KEQ IV +++ + + I + V Sbjct: 412 KTSIG----------NVLIPIPPYKEQERIVAKIDMVLDTMNEILRAV 449 Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 57/204 (27%), Positives = 94/204 (46%), Gaps = 7/204 (3%) Query: 223 FKKLNFESILTELRNGLSSKPNESGVGHPILRISSV-RAGHVDQNDIRFLECSESELNRH 281 ++ N E I+ EL+ G S K G +LR+ ++ G +D +++ + +E ++ + Sbjct: 72 WEWCNLEDIVCELKYGTSEKSLSVG-KIAVLRMGNITNVGTIDYSNLVYSSNNE-DIKLY 129 Query: 282 KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSP 341 L+ DLLF R N S E+VG + KK Q +Y LIR R +Y+ +S Sbjct: 130 SLEKDDLLFNRTNSS-EWVGKTAIYKK--EQPAIYAGYLIRIRPIL-IFSDYLNTVMNSS 185 Query: 342 SARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNN 401 RN N Q I+ + + ++ +PP+KEQ IV V + + DTI+ + Sbjct: 186 YYRNWCYNVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVVEVAKWISLIDTIKNSKED 245 Query: 402 ALARVNNLTQSILAKAFRGELTAQ 425 + IL A G+L Q Sbjct: 246 LQTTIKQAKSKILNLAIHGKLVPQ 269 >UniRef50_UPI0001694BE8 putative type I restriction enzyme specificity subunit n=1 Tax=Paenibacillus larvae subsp. larvae BRL-230010 RepID=UPI0001694BE8 Length = 410 Score = 79.3 bits (194), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 81/329 (24%), Positives = 153/329 (46%), Gaps = 24/329 (7%) Query: 110 VLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPI--PPLAEQKI 167 VLRP ++ FI + +S +R + ++ +GA P F LI+ P+ PPL EQK Sbjct: 67 VLRPSNIVEGRFIYYLLRSEKFRKEAKAVMSGAVGQQRVPKKF-LIDYPLCLPPLNEQKR 125 Query: 168 IAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR--NFEPQHSVFKK 225 IA+K+++L A++D K ++ + + R A+L A G+LT++WR E ++ K Sbjct: 126 IADKIESLFAKMDIAKRLIDEAKESFELRRAAILDKAFRGELTKEWRLSQVEILPNLETK 185 Query: 226 LNFESILTELRNGLSSKPNESGVGH----------PILRISSVRAGHVDQNDIRFLECSE 275 + + L + + P + + H P+ +S + +G +++ +++ + Sbjct: 186 IPYGWKHVILSDVVQVNPRRTKLQHISDEQECTFVPMGAVSEI-SGTIEEPEVKSFVIVK 244 Query: 276 SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIE 335 ++ D++F + +E G L KL + + R + +YI Sbjct: 245 KGYTY--FEENDIIFAKITPCME-NGKTALASKLINGFGFGSTEFHVIRAKQHINNKYIY 301 Query: 336 IFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADT- 394 S R + GQ+ + +++ LPPV+EQA+IV +E+++ D Sbjct: 302 FLLRSSKFRYEAKMHMTGAVGQQRVPKSFLENYKFQLPPVEEQAKIVDLLEKIYDKEDKA 361 Query: 395 -IEKQVNNALARVNNLTQSILAKAFRGEL 422 + +Q+ + + L QSI+ KAFR EL Sbjct: 362 LVIEQLEES---IKLLKQSIVQKAFRREL 387 Score = 57.8 bits (138), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 42/159 (26%), Positives = 69/159 (43%), Gaps = 15/159 (9%) Query: 283 LQDGDLLFTRYNGSLE---FVGVCGLLKKLQHQN----LLYPDKLIRARLTKDALPEYIE 335 Q+ D+LF + +E V G+L K + +L P ++ R +I Sbjct: 29 FQENDILFAKITPCMENGNTVIAKGMLNKFGFGSTEFYVLRPSNIVEGR--------FIY 80 Query: 336 IFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTI 395 S R + GQ+ + K + + LPP+ EQ I ++E LFA D Sbjct: 81 YLLRSEKFRKEAKAVMSGAVGQQRVPKKFLIDYPLCLPPLNEQKRIADKIESLFAKMDIA 140 Query: 396 EKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLI 434 ++ ++ A +IL KAFRGELT +WR +++ Sbjct: 141 KRLIDEAKESFELRRAAILDKAFRGELTKEWRLSQVEIL 179 >UniRef50_B3E2V8 Restriction modification system DNA specificity domain n=1 Tax=Geobacter lovleyi SZ RepID=B3E2V8_GEOLS Length = 514 Score = 79.0 bits (193), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 67/228 (29%), Positives = 108/228 (47%), Gaps = 17/228 (7%) Query: 239 LSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLE 298 L +++G P LR SVR G ++ +D+ + E EL R+ L GD+L G Sbjct: 297 LDKTKHQTGAMLPYLRNISVRWGSIETHDLPEMYYEEDELERYGLASGDVLVCE-GGEPG 355 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 VCG +H+ L Y L R RL + + +F+ A+ M+ T S K Sbjct: 356 RAAVCGK----EHEKLKYQKALHRVRLFSLYESDLL-VFYLEHLAKTGMLEQYFTGSTIK 410 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 + + + + LPP+ EQ+EIV ++ A + + ++L + ++IL AF Sbjct: 411 HFTKESFIALPIPLPPICEQSEIVEHLKLAIQCAQEQDAAIIHSLTQAAAQRKNILKSAF 470 Query: 419 RGELTAQWRAENPDLISGENSAAALLEKIKAER---AASGGKKASRKK 463 G+L Q + P A+ LLE+I+AER +S K++SRKK Sbjct: 471 SGQLVLQAPNDEP--------ASLLLERIRAERQKGESSTKKRSSRKK 510 Score = 59.3 bits (142), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 63/243 (25%), Positives = 101/243 (41%), Gaps = 30/243 (12%) Query: 9 GWVIAPVSTVTTL----------IRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTD 58 GW+ P+S + +RG+T I L ++L NG F + Sbjct: 9 GWLTVPLSDLLMSLETGSRPKGGVRGIT----AGIPSLGGEHLD-------SNGGFKLDN 57 Query: 59 LVFVPKNLVKESQK--ISPEDIVI---AMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP 113 + +VP + + I+ DI++ ++G S V S + C R Sbjct: 58 IRYVPLEFAELMTRGAINNGDILVVKDGATTGKVSFVDNSFPLSIAVVNEHVFLC---RC 114 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 L+ S +I + S+ +I GA I DL+ +P+ P AEQ I EKL+ Sbjct: 115 SSLLNSKYIFFYLFSNSGNQQILEDFRGAAQGGISQRFADLVKVPLAPAAEQTRIVEKLE 174 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILT 233 L + +D+ A + + L ++RQ++L AV G LT +WR +L E IL Sbjct: 175 ELFSDLDAGVAELKAAQKKLAQYRQSLLKAAVEGSLTAEWRTKNTPKETGAQL-LERILK 233 Query: 234 ELR 236 E R Sbjct: 234 ERR 236 Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 66/236 (27%), Positives = 105/236 (44%), Gaps = 29/236 (12%) Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQN------DIRFLECSESEL-NRHKL 283 +L L G S+P + GV I S+ H+D N +IR++ +EL R + Sbjct: 18 LLMSLETG--SRP-KGGVRGITAGIPSLGGEHLDSNGGFKLDNIRYVPLEFAELMTRGAI 74 Query: 284 QDGDLLFTR---YNGSLEFVGVCGLLKKL---QHQNLLYPDKLIRARLTKDALPEYIEIF 337 +GD+L + G + FV L +H L L+ ++ YI + Sbjct: 75 NNGDILVVKDGATTGKVSFVDNSFPLSIAVVNEHVFLCRCSSLLNSK--------YIFFY 126 Query: 338 FSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEK 397 S S ++ + + Q GIS + V L P EQ IV ++E+LF+ D Sbjct: 127 LFSNSGNQQILEDFRG-AAQGGISQRFADLVKVPLAPAAEQTRIVEKLEELFSDLDAGVA 185 Query: 398 QVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAA 453 ++ A ++ QS+L A G LTA+WR +N + + + A LLE+I ER A Sbjct: 186 ELKAAQKKLAQYRQSLLKAAVEGSLTAEWRTKN----TPKETGAQLLERILKERRA 237 Score = 55.1 bits (131), Expect = 5e-06, Method: Compositional matrix adjust. Identities = 64/256 (25%), Positives = 106/256 (41%), Gaps = 38/256 (14%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LPEGWV A V V + G K + LP +R +++ G +T DL P+ Sbjct: 275 ELPEGWVWASVDQVGEVFLGKMLDK---TKHQTGAMLPYLRNISVRWGSIETHDL---PE 328 Query: 65 NLVKESQ----KISPEDIVIAMSS--GSKSVVGKSAHQHLPFECSFGA--FCGVLRPEKL 116 +E + ++ D+++ G +V GK H+ L ++ + + + L Sbjct: 329 MYYEEDELERYGLASGDVLVCEGGEPGRAAVCGKE-HEKLKYQKALHRVRLFSLYESDLL 387 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 +F ++ H K+ + + G+ I + SF + IP+PP+ EQ I E L Sbjct: 388 VF--YLEHLAKTGMLEQYFT----GSTIKHFTKESFIALPIPLPPICEQSEIVEHLKL-- 439 Query: 177 AQVDSTKARFEQIPQILKRFRQA------VLGGAVNGKLTEKWRNFEPQHSVFKKLNFES 230 + + EQ I+ QA +L A +G+L + N EP L E Sbjct: 440 ----AIQCAQEQDAAIIHSLTQAAAQRKNILKSAFSGQLVLQAPNDEP-----ASLLLER 490 Query: 231 ILTELRNGLSSKPNES 246 I E + G SS S Sbjct: 491 IRAERQKGESSTKKRS 506 >UniRef50_C6Q0B1 Restriction modification system DNA specificity domain protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6Q0B1_9CLOT Length = 407 Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 101/425 (23%), Positives = 185/425 (43%), Gaps = 28/425 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN-GKFDT--TDLVF 61 KLP+ W + + K AI D+ +P + +I N G F+ L + Sbjct: 2 KLPKEWKEVNLKEYILTLESGKRPKGGAI----DNGVPSLGGEHINNTGGFNIQIDKLKY 57 Query: 62 VPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 VP+ K+ S + DI+I + + + E ++R + + + Sbjct: 58 VPREFFKKMKSGVVKKNDILIVKDGATTGKIAFVDNNFNLKEACINEHLFLIRTNERLNN 117 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F++++ +S+ R KI GA + I D NI +PPL QK I + L+ Sbjct: 118 KFLSYYLRSNTGRKKILEDFRGATVGGISKNFIDF-NILLPPLETQKKIVKVLE---KAE 173 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 ++ + R E I + K + +G + K N + SV K + NG Sbjct: 174 ETLEKRKESINLLDKLVKSRFIGMFGDPSSNPKGWNKDTIGSVVKSIT----AGWSANGE 229 Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKL-QDGDLLFTRYNGSLE 298 + + E +L++S+V G+ ++ + + + E+ ++ + GDLLF+R N + E Sbjct: 230 AREKREDE--KAVLKVSAVTQGYFKADEYKVI-GDDVEIKKYVFPEKGDLLFSRAN-TRE 285 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 VG ++ K + +LL PDKL + + Y++ S PS R TSG Sbjct: 286 MVGATCIIHK-DYPDLLLPDKLWKVSFVERVNVFYMKYILSEPSIRAEFSAKSTGTSGSM 344 Query: 359 -GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 +S KS + +PP++ Q + V Q+ D ++ ++ +L + + +S++ KA Sbjct: 345 YNVSMDKFKSIEITIPPIELQNQFADFVNQV----DKLKFEMEKSLKELEDNFKSLMQKA 400 Query: 418 FRGEL 422 F+GEL Sbjct: 401 FKGEL 405 >UniRef50_A1TSH8 Restriction modification system DNA specificity domain n=1 Tax=Acidovorax citrulli AAC00-1 RepID=A1TSH8_ACIAC Length = 429 Score = 79.0 bits (193), Expect = 4e-13, Method: Compositional matrix adjust. Identities = 76/312 (24%), Positives = 146/312 (46%), Gaps = 31/312 (9%) Query: 110 VLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLIN---IPIPPLAEQK 166 V+RP +L+ F+ + + + + + + GA + A++D I + +PPL EQ+ Sbjct: 105 VMRPSELLEPRFLLYSILTPDFVGAVDASTFGAKMPR---ANWDFIGSLEVKVPPLEEQR 161 Query: 167 IIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT---------EKWRNFE 217 +IA LD A +D A E++ +L+ R A++ V L ++W Sbjct: 162 LIANYLDRETAGIDGLIAEKERMLALLEEKRAALISRVVTRGLDPNAPLKPSGQEWLGEI 221 Query: 218 PQHSVFKKLNFESILTELRNGLSSKPNESG--VGHPILRISSVRAGHVDQNDIRFLECSE 275 P H ++L L E+R GL+ SG + +P LR+++V+ G++ +D+ +E Sbjct: 222 PVHWGLQRLK---QLAEVRGGLTLGKQYSGELLEYPYLRVANVQDGYLKLDDVLTVEVPA 278 Query: 276 SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKK----LQHQNLLYPDKLIRARLTKDALP 331 SE + L GD+L G ++ +G + + HQN ++ +R Sbjct: 279 SEAASNLLVYGDVLMNE-GGDIDKLGRGCVWRDEISPCLHQNHVFA---VRPHSVDS--- 331 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 +++ ++ S+ A+ + K ++ ISG +IK V LPPV EQ I + + Sbjct: 332 DWLALWTSTIQAKRYFESRAKRSTNLASISGSNIKELPVPLPPVSEQLAIQNFLAVRHSR 391 Query: 392 ADTIEKQVNNAL 403 +T+ ++ ++L Sbjct: 392 LETLRGELRDSL 403 >UniRef50_UPI00016B1071 restriction modification system DNA specificity domain n=1 Tax=Burkholderia pseudomallei 112 RepID=UPI00016B1071 Length = 367 Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 55/167 (32%), Positives = 87/167 (52%), Gaps = 22/167 (13%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFD-TTDLVFVPKNLVK 68 W P+ T+ + RG+TY ++ + ++ L ++R++NIQ+G DLVFV K Sbjct: 26 WENKPLRTLGSFFRGLTYSADE----VSEEGLLVLRSSNIQDGSLVLDKDLVFVDKP-CP 80 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSA--HQHLPFECSFGAFCGVLRPE----KLIFSGFI 122 + + D+ I MS+GSK++VGKSA + + + GAFC + RP KLIF Sbjct: 81 DDLLLQDGDVAICMSNGSKALVGKSAEFQNNYDGQLTVGAFCSIFRPSLEFAKLIF---- 136 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP--PLAEQKI 167 ++ Y +S G NI N+K + + P+P PL +QKI Sbjct: 137 ----QTPRYSKFVSIAIGGGNIKNLKNSDLEEFEHPVPRMPLEQQKI 179 >UniRef50_Q6D2H5 Subunit S of type I restriction-modification system n=1 Tax=Pectobacterium atrosepticum RepID=Q6D2H5_ERWCT Length = 551 Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 63/223 (28%), Positives = 112/223 (50%), Gaps = 20/223 (8%) Query: 168 IAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLN 227 I++ DTL +T+A + I +Q +L AV G L E F + + K L+ Sbjct: 318 ISQHFDTLF----TTEASIDAI-------KQTILQLAVMGLLIES-AEFSQRSHLKKYLS 365 Query: 228 FESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD 287 F +NGLS + +L++ + G++ + ++++ + + L+ D Sbjct: 366 FGP-----KNGLSPSEVKYETDVKVLKLGATSYGYLKLQETKYVDIDVKDKSYLFLKKND 420 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 +L R N S FVG C LL + +L+YPD +++ R + LPEY ++ SSP AR+ M Sbjct: 421 ILIQRGNSS-NFVG-CSLLIEEDFDDLIYPDLMMKIRTKDELLPEYAVLWLSSPFARDFM 478 Query: 348 MNCVKTTSG-QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 + + TSG IS K ++ + +PP Q ++V ++++LF Sbjct: 479 WSKMTGTSGTMPKISKKVVEEIPIAVPPFAVQNQLVIKIKELF 521 >UniRef50_C6JN70 Predicted protein n=1 Tax=Fusobacterium varium ATCC 27725 RepID=C6JN70_FUSVA Length = 507 Score = 78.2 bits (191), Expect = 5e-13, Method: Compositional matrix adjust. Identities = 111/499 (22%), Positives = 220/499 (44%), Gaps = 74/499 (14%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++PE W + V +I G T K Y ++ + ++ +++ + + ++ + Sbjct: 26 EIPENWEWVKLGKVNNVITGSTPSKANE-KYWENKNIFFVKPSDLYQKRNLKSSEEYIDE 84 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK-LIFSGFIA 123 +++ +I P+ + GS +GK A+ + E S L P+K +IFS + Sbjct: 85 R-ARDNVRILPKYSTLICCIGS---IGKVAYSEV--EVSTNQQINSLVPKKEIIFSLYNY 138 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + S+ +++++ + + I + + + + P+PPL EQK I EKLD++ +++ K Sbjct: 139 YVANSNFFQSQMLNSAVATTIAILNKTNTENLRFPLPPLEEQKRIVEKLDSMFEKINRAK 198 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWR---NFEPQHSVFKKLNFESI--------- 231 ++ + ++ ++++L A G+LT +WR E + K +N E I Sbjct: 199 ELIQEAKENIENRKESILNKAFRGELTVEWRKNNQTEDAIELLKSINDEKIKNWEQECVE 258 Query: 232 -------------LTELRNGLSSK---PNESGVGHPILRISSV-------RAGHVDQND- 267 + +++N + SK P E +++ + + ++D+N+ Sbjct: 259 AEKNGKKKPSKPKIEDIQNMIISKEEEPYEIPSKWKWVKLEYIIEINPKKKMLNIDENEK 318 Query: 268 IRFL----------ECSESELNRH-KLQDG-------DLLFTRYNGSLEFVGVCGLLKKL 309 I FL E S E + KL+ G D+LF + +E G C + K L Sbjct: 319 ISFLPMRSISDITGEISNIEYESYSKLKKGYTQFLENDILFAKITPCME-NGKCVIAKNL 377 Query: 310 QHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQ 368 +++ + Y T L +++ F S R + + G + + + +K Sbjct: 378 KNE-IGYGTTEFHVLRTNYILNNKFLHNFLRQESFRQEAKYNMTGSVGFRRVPTEFLKEY 436 Query: 369 VVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRA 428 + LPP++EQ EIVR ++++ I K++ + L +SIL KAFRG+L Q + Sbjct: 437 MFPLPPLEEQKEIVRILDEILEKESKI-KELVELEEAIELLEKSILDKAFRGKLGTQNKD 495 Query: 429 ENPDLISGENSAAALLEKI 447 + P A LL+KI Sbjct: 496 DEP--------AIELLKKI 506 Score = 55.1 bits (131), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 35/98 (35%), Positives = 52/98 (53%), Gaps = 6/98 (6%) Query: 372 LPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENP 431 LPP++EQ IV +++ +F + ++ + A + N +SIL KAFRGELT +WR N Sbjct: 174 LPPLEEQKRIVEKLDSMFEKINRAKELIQEAKENIENRKESILNKAFRGELTVEWRKNNQ 233 Query: 432 DLISGENSAAALLEKIK------AERAASGGKKASRKK 463 + E + EKIK E +G KK S+ K Sbjct: 234 TEDAIELLKSINDEKIKNWEQECVEAEKNGKKKPSKPK 271 >UniRef50_A3PPQ8 Restriction modification system DNA specificity domain n=1 Tax=Rhodobacter sphaeroides ATCC 17029 RepID=A3PPQ8_RHOS1 Length = 575 Score = 77.8 bits (190), Expect = 7e-13, Method: Compositional matrix adjust. Identities = 83/378 (21%), Positives = 138/378 (36%), Gaps = 82/378 (21%) Query: 104 FGAFCGVLRPEKLIF--SGFIAHFTKSSLY-RNKISSLSAGANINNIKPASFDLINIPIP 160 FGA L + IF +I + KS + N I ++ A + F P+P Sbjct: 171 FGAGTTELHIVRPIFVSPDYILTYLKSPQFIENGIPRMTGTAGQKRVPTEYFIGTPFPLP 230 Query: 161 PLAEQKIIAEKLDTLLAQVD---------------------------------------S 181 PLAEQ I K++ L+A +D Sbjct: 231 PLAEQHRIVAKVEELMALLDRIEAARAGREETRNRLTAATLARLTDPKADAPAATRFALD 290 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE------------ 229 T A P +K RQ +L AV G+L + EP + K+++ E Sbjct: 291 TLAPLTTRPDQIKTLRQTILNLAVRGRLVPQDPADEPASELLKRVSVERARLEKAGAIRS 350 Query: 230 -------------------------SILTELRNGLSSKPNE--SGVGHPILRISSVRAGH 262 L + G+ P P L + +V Sbjct: 351 TKRAASLEGTKLRFNPPPRWRWTNLECLFAITGGIQKTPGRMPKANAFPYLGVGNVYRNR 410 Query: 263 VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR 322 +D +++ E + E+++ LQ D+L NGS +G C + + Q + ++ + LIR Sbjct: 411 IDLTNLKKFELQDGEVDKFGLQPFDILVVEGNGSATEIGRCAMWEG-QIEQCVHQNHLIR 469 Query: 323 ARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIV 382 R L Y ++ +SP + M T++G +S I + + LPP+ EQ IV Sbjct: 470 CRPIDPNLSRYALLYLNSPLGMDEMTELAITSAGLYNLSVGKISTVPLPLPPLAEQHRIV 529 Query: 383 RRVEQLFAYADTIEKQVN 400 +V+ L D +E ++ Sbjct: 530 AKVDALMRLLDDLEAALS 547 Score = 47.8 bits (112), Expect = 8e-04, Method: Compositional matrix adjust. Identities = 41/172 (23%), Positives = 73/172 (42%), Gaps = 3/172 (1%) Query: 37 KDDYLPLIRANNIQNGKFDTTDLV-FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAH 95 K + P + N+ + D T+L F ++ + + P DI++ +GS + +G+ A Sbjct: 394 KANAFPYLGVGNVYRNRIDLTNLKKFELQDGEVDKFGLQPFDILVVEGNGSATEIGRCAM 453 Query: 96 QHLPFE-CSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLS-AGANINNIKPASFD 153 E C RP S + + S L ++++ L+ A + N+ Sbjct: 454 WEGQIEQCVHQNHLIRCRPIDPNLSRYALLYLNSPLGMDEMTELAITSAGLYNLSVGKIS 513 Query: 154 LINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAV 205 + +P+PPLAEQ I K+D L+ +D +A R A L A+ Sbjct: 514 TVPLPLPPLAEQHRIVAKVDALMRLLDDLEAALSASSTTRARLLDATLRAAL 565 Score = 43.9 bits (102), Expect = 0.013, Method: Compositional matrix adjust. Identities = 28/86 (32%), Positives = 40/86 (46%) Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 P+YI + SP + + T+GQK + + LPP+ EQ IV +VE+L A Sbjct: 188 PDYILTYLKSPQFIENGIPRMTGTAGQKRVPTEYFIGTPFPLPPLAEQHRIVAKVEELMA 247 Query: 391 YADTIEKQVNNALARVNNLTQSILAK 416 D IE N LT + LA+ Sbjct: 248 LLDRIEAARAGREETRNRLTAATLAR 273 >UniRef50_D1JFQ8 Putative type I restriction enzyme, DNA specificity domain n=1 Tax=uncultured archaeon RepID=D1JFQ8_9ARCH Length = 323 Score = 77.4 bits (189), Expect = 1e-12, Method: Compositional matrix adjust. Identities = 56/192 (29%), Positives = 95/192 (49%), Gaps = 7/192 (3%) Query: 234 ELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 ++ G ++K NE+G G LRI+ ++ V+ + + F E + E+ + +L++ +++F R Sbjct: 22 KIHYGYTAKANENGRGSKYLRITDIQENKVNWDTVPFCEIDDEEIEKFELKENNIVFART 81 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 G+ VG L+K ++ LIR +L+ +YI +FF S N Sbjct: 82 GGT---VGKSFLIKNDVPSKAVFASYLIRIKLSNYIDKKYIYLFFQS---LNYWSQIELG 135 Query: 354 TSGQKGISGKDIKSQVVL-LPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 +G K I S++ L L P+ EQ IV ++EQLF D + A ++ Q+ Sbjct: 136 KTGLKTNVNAQILSKLKLNLAPLPEQRAIVAKIEQLFCDLDNGMANLKKAQEQLKIYRQA 195 Query: 413 ILAKAFRGELTA 424 +L KAF GE T Sbjct: 196 VLKKAFEGEFTG 207 Score = 50.1 bits (118), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 74/301 (24%), Positives = 130/301 (43%), Gaps = 36/301 (11%) Query: 6 LPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGK--FDTTDLVFV 62 +P+ W ++ V+ I G T K + N YL R +IQ K +DT + Sbjct: 7 IPDNWEECIINDVSIKIHYGYTAKANE--NGRGSKYL---RITDIQENKVNWDTVPFCEI 61 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKS--AHQHLPFECSFGAFCGVLRPEKLIFSG 120 + E ++ +IV A + G+ VGKS +P + F ++ ++ I Sbjct: 62 DDEEI-EKFELKENNIVFARTGGT---VGKSFLIKNDVPSKAVFASYLIRIKLSNYIDKK 117 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +I F +S Y ++I G N+ + + + PL EQ+ I K++ L +D Sbjct: 118 YIYLFFQSLNYWSQIELGKTGLK-TNVNAQILSKLKLNLAPLPEQRAIVAKIEQLFCDLD 176 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKL---TEKWRNFEPQHSVFKKLNFESILTELRN 237 + A ++ + LK +RQAVL A G+ T++W KK+ + EL + Sbjct: 177 NGMANLKKAQEQLKIYRQAVLKKAFEGEFTGGTKRW--------ACKKM---EAVVELID 225 Query: 238 GLSS----KPNESGVGHPILRISS--VRAGHVDQNDIRFL-ECSESELNRHKLQDGDLLF 290 G K N+ G L +S+ VR + N+ ++ E ++L + L GD++ Sbjct: 226 GDRGPNYPKRNDYLYGGYCLFLSTKNVRPDGFEFNETVYISEEKHNQLRKGTLNRGDIIL 285 Query: 291 T 291 T Sbjct: 286 T 286 >UniRef50_B2V7V7 Restriction modification system DNA specificity domain n=3 Tax=Sulfurihydrogenibium RepID=B2V7V7_SULSY Length = 435 Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 97/401 (24%), Positives = 182/401 (45%), Gaps = 41/401 (10%) Query: 4 GKLPEGWVIAPVSTVTTLIRG--VTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G +PE W +A + V + +G ++ K+ + LK P +R +N+ K D ++L + Sbjct: 13 GLIPEDWEVARLGEVFEVKQGKQLSAKENRDGKVLK----PFLRTSNVLWNKIDLSELSY 68 Query: 62 VP------KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPE 114 +P KNL K+ DI++ VG++A E S+ LR Sbjct: 69 MPFSESEFKNL-----KLKKGDILVCEGGD----VGRTAVWDGQIDEISYQNHLHRLRSV 119 Query: 115 KL-IFSGFIAHFTKSSL-YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKL 172 K I + F A++ + ++ +N + I N+ + IP+PPL EQ+ IA+ L Sbjct: 120 KDNINNYFFAYWMEYAITIKNLYHQNANKTTIPNLSSSRLKAFPIPLPPLEEQRAIADIL 179 Query: 173 DTLLAQVDSTKARFEQIPQILKRFRQAVLG-GAV------NGKLTEKWRNFEPQHSVFKK 225 T+ ++ T+ Q+ K + + GAV KL E P+H + Sbjct: 180 STVQNAIEKTEKVINATKQLKKSMMKHLFTYGAVAVDEIDRIKLKESEIGLIPEHWEVVR 239 Query: 226 LNFESILTELRNGLSSKPNESGV---GHPILRISSVRAGHVDQNDIRFLECSESELNRHK 282 L + +L G+S + E G GH I+ I +++ G++D N ++ + ++K Sbjct: 240 L---GEVVDLDRGISWRKFEEGSKDNGHLIISIPNIKDGYIDFNS-KYNHYLIKHIPKNK 295 Query: 283 -LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSS 340 +Q D+LF +GS+E VG ++ L + + + + RAR+ +P+++ F ++ Sbjct: 296 QIQLNDILFVGSSGSIENVGRNVFIENLSFEGIGFASFVFRARVKVNTVIPKFL-YFMAN 354 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEI 381 N +++ G+ + K+ + LPP+ EQ +I Sbjct: 355 SHWFNYKDYVRRSSDGKYNFQLTEFKTIKIPLPPLDEQQKI 395 Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 53/211 (25%), Positives = 97/211 (45%), Gaps = 16/211 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG------KFDTT 57 G +PE W + + V L RG++++K + + KD+ +I NI++G K++ Sbjct: 229 GLIPEHWEVVRLGEVVDLDRGISWRKFEEGS--KDNGHLIISIPNIKDGYIDFNSKYNHY 286 Query: 58 DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAH-QHLPFE-CSFGAFCGVLRPE- 114 + +PKN ++I DI+ SSGS VG++ ++L FE F +F R + Sbjct: 287 LIKHIPKN-----KQIQLNDILFVGSSGSIENVGRNVFIENLSFEGIGFASFVFRARVKV 341 Query: 115 KLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDT 174 + F+ S + K + N + F I IP+PPL EQ+ IA L T Sbjct: 342 NTVIPKFLYFMANSHWFNYKDYVRRSSDGKYNFQLTEFKTIKIPLPPLDEQQKIANILTT 401 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAV 205 + ++ + + + + + K ++ G + Sbjct: 402 IDQKIQAEEKKKVALRSLFKTLLHQLMTGKI 432 Score = 48.1 bits (113), Expect = 6e-04, Method: Compositional matrix adjust. Identities = 50/174 (28%), Positives = 84/174 (48%), Gaps = 15/174 (8%) Query: 234 ELRNG--LSSKPNESG-VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 E++ G LS+K N G V P LR S+V +D +++ ++ SESE KL+ GD+L Sbjct: 29 EVKQGKQLSAKENRDGKVLKPFLRTSNVLWNKIDLSELSYMPFSESEFKNLKLKKGDILV 88 Query: 291 TRYNGSLEFVGVC-GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFF--SSPSARNAM 347 G + V G + ++ +QN L+ R R KD + Y ++ + + +N Sbjct: 89 CE-GGDVGRTAVWDGQIDEISYQNHLH-----RLRSVKDNINNYFFAYWMEYAITIKNLY 142 Query: 348 -MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 N KTT +S +K+ + LPP++EQ I + + + EK +N Sbjct: 143 HQNANKTTI--PNLSSSRLKAFPIPLPPLEEQRAIADILSTVQNAIEKTEKVIN 194 >UniRef50_UPI000197A104 putative Type I restriction enzyme EcoR124II specificity protein n=1 Tax=Helicobacter cinaedi CCUG 18818 RepID=UPI000197A104 Length = 270 Score = 76.6 bits (187), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 59/199 (29%), Positives = 106/199 (53%), Gaps = 13/199 (6%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKK-EQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFVPK 64 P+GW + V +IRG+TY K EQ ++ ++ A+NI N F+ + ++++ + Sbjct: 79 PQGWDTIKLGQVCEIIRGITYDKTEQTTEKTQN---IVLTADNITLNNTFELSKMIYLKQ 135 Query: 65 NLVKESQKI-SPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + + + KI DI + SSGS +GK A E G F G+LR + F+ Sbjct: 136 DFIGDKNKILRKNDIFMCFSSGSLKHIGKVAFIDKDTEYYAGGFMGILRSR--FNAKFVF 193 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPL-AEQKIIA--EKLDTLLAQVD 180 + + ++ K+ + + G+NINN+ DL IP+PPL A++KII+ EK+++ ++ ++ Sbjct: 194 YTIANDDFKQKLENSATGSNINNLSGKINDL-KIPLPPLEAQEKIISVVEKIESTISLLE 252 Query: 181 STKARFEQIPQILKRFRQA 199 K + +IL+ F Q Sbjct: 253 CHKPESKN-SEILQHFLQG 270 >UniRef50_C3WQF7 Restriction modification system DNA specificity subunit n=8 Tax=Bacteria RepID=C3WQF7_9FUSO Length = 598 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 72/276 (26%), Positives = 128/276 (46%), Gaps = 34/276 (12%) Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +I +F +++ + KI SL G + ++ +P+PPL Q I LD A Sbjct: 104 YIYYFLQNN--QMKIHSLKKGGGVPHVYFKDMQKFLVPVPPLEVQNEIVRILDNFTALTA 161 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKK--LNFES-------- 230 A L + +LT + + Q+S ++ L FE+ Sbjct: 162 ELTAE---------------LTAELTAELTAELTARKKQYSWYRDYLLKFENKVKMVKIG 206 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSV-RAGHVDQNDIR-FLECSESELNRHKLQDGDL 288 L E +NG++ G G PI+ +V + + D++ +E S EL R+ ++ GD+ Sbjct: 207 DLFEFKNGINKDKGSFGKGTPIINYVNVYKKNKIYFEDLKGLVEASNDELVRYGVKRGDV 266 Query: 289 LFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR-LTKDALPEYIEIFFSSPSARNAM 347 FTR + ++E +G +L + +N ++ L+RAR +T LPEY FS+ + RN + Sbjct: 267 FFTRTSETIEEIGYTSVLLE-DIENCVFSGFLLRARPITDLLLPEYCAYCFSTSNIRNTI 325 Query: 348 MNCVKTTSGQKGISGKDIKSQV-VLLPPVKEQAEIV 382 + K+T + ++ SQ+ + LPP++ Q IV Sbjct: 326 IK--KSTYTTRALTNGTSLSQIEIPLPPLEVQKRIV 359 >UniRef50_C0XBA7 Type I restriction-modification system, S subunit n=1 Tax=Lactobacillus gasseri JV-V03 RepID=C0XBA7_9LACO Length = 468 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 87/425 (20%), Positives = 174/425 (40%), Gaps = 63/425 (14%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYL-PLIRANNI-QNGKFDTTDLVFV 62 +LP W + + T G YKK++ L DD L P++R N+ N + +DL Sbjct: 54 ELPSSWDWITLGSGVTFYNGRAYKKKEL---LSDDKLTPVLRVGNLFTNSSWYYSDLS-- 108 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + E++ I D++ A S+ + H + + +I + F+ Sbjct: 109 ----LDENKYIDNGDLIYAWSASFGPKIWNGGHVIYHYHI-----WKLEYDNNVIDTNFL 159 Query: 123 AHFTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 +F RN + G+ + +I + + + P+PPL EQ IA K+ L A + Sbjct: 160 YYFLLDK--RNVVGETDLHGSTMKHITKTNMEHLPFPLPPLEEQSRIAAKIAQLFALLRK 217 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE------------ 229 ++ +Q ++ + VL A+ GKL ++ + EP + +K+ E Sbjct: 218 VESSTQQYAKLQTLLKSKVLDLAMRGKLVKQDPHDEPASVLLEKIKAEKEQLIKEKKIKK 277 Query: 230 --------------------------SILTELRNGLSSKPNESGVGHPILRISSVRAGHV 263 + +R G ++ +G +LRI+ ++ +V Sbjct: 278 SKPLPPITDKEKPFDIPDSWEWVRLGEVAESIRYGYTASAQATGNAK-LLRITDIQNNNV 336 Query: 264 DQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA 323 + N + S+ +L L D+L R G+ +G +K++ ++ LIR Sbjct: 337 NWNMVPLCNISDMKLKDLSLHKKDILIARTGGT---IGKNYFVKQIVEPT-VFASYLIRV 392 Query: 324 RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 R + +I+ +P N ++ K+ +GQ ++ +++ + +PP++EQ IV Sbjct: 393 RNINKKVSNFIQYVLDAPIYWN-FISAKKSGTGQPNVNAAKLENFIFPIPPLEEQNRIVD 451 Query: 384 RVEQL 388 ++ L Sbjct: 452 KIINL 456 Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust. Identities = 35/117 (29%), Positives = 57/117 (48%), Gaps = 10/117 (8%) Query: 336 IFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTI 395 +++ RN + S K I+ +++ LPP++EQ+ I ++ QLFA + Sbjct: 159 LYYFLLDKRNVVGETDLHGSTMKHITKTNMEHLPFPLPPLEEQSRIAAKIAQLFALLRKV 218 Query: 396 EKQVNNALARVNNLTQS-ILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 E A++ L +S +L A RG+L Q + P A+ LLEKIKAE+ Sbjct: 219 ESSTQQ-YAKLQTLLKSKVLDLAMRGKLVKQDPHDEP--------ASVLLEKIKAEK 266 Score = 43.5 bits (101), Expect = 0.015, Method: Compositional matrix adjust. Identities = 45/176 (25%), Positives = 75/176 (42%), Gaps = 16/176 (9%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP-- 63 +P+ W + V IR QA K L+R +IQN + VP Sbjct: 293 IPDSWEWVRLGEVAESIRYGYTASAQATGNAK-----LLRITDIQNNNVNWN---MVPLC 344 Query: 64 --KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECS-FGAFCGVLRPEKLIFSG 120 ++ + + +DI+IA + G+ +GK+ E + F ++ +R S Sbjct: 345 NISDMKLKDLSLHKKDILIARTGGT---IGKNYFVKQIVEPTVFASYLIRVRNINKKVSN 401 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 FI + + +Y N IS+ +G N+ A + PIPPL EQ I +K+ L+ Sbjct: 402 FIQYVLDAPIYWNFISAKKSGTGQPNVNAAKLENFIFPIPPLEEQNRIVDKIINLI 457 >UniRef50_A4T8B4 Restriction modification system DNA specificity domain n=1 Tax=Mycobacterium gilvum PYR-GCK RepID=A4T8B4_MYCGI Length = 442 Score = 76.3 bits (186), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 92/437 (21%), Positives = 191/437 (43%), Gaps = 31/437 (7%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI--QNGKFDTTDLVFVPKNLV 67 W + ++ V + G+ ++Q +D+ P +R N+ + D + + + Sbjct: 3 WPLVALADVAEIQGGI---QKQPKRTARDNAFPFLRVANVTARGLALDEVHTIELFDGEL 59 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGFIAHFT 126 E ++ D+++ +GS S +G++A + +RP I F+ H Sbjct: 60 -ERYRLLRGDLLVVEGNGSASQIGRAAVWDGSITDAVHQNHLIRVRPGFQIDPRFLGHLW 118 Query: 127 KSSLYRNKISSL-SAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 S L R+++S + S+ + ++ + I +P+P L EQ+ I + L+ L+++D+ ++ Sbjct: 119 NSPLIRDELSRVASSTSGLHTLSVTKLKRITLPLPSLTEQRRIVDLLEDHLSRLDAGRSE 178 Query: 186 FEQIPQILKRFR-----QAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN--- 237 E+ L R QA+ GGA + + + L + L + Sbjct: 179 VERAAAKLAILRERTVIQALTGGAEANREDARLTDVSTADGDLSALPIGWSWSRLGDVAD 238 Query: 238 ---GLS------SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDL 288 G++ S PN V P LR+++V+ G ++ +++ + +S+ + +L+ GD+ Sbjct: 239 VVGGVTKDSKKQSDPNYVEV--PYLRVANVQRGRLNLDEVTKIRVPQSKADALRLRPGDV 296 Query: 289 LFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAM 347 L + G + + Q + ++ + + RAR+T + P ++ ++ R A Sbjct: 297 LLNEGGDRDKL--ARGWVWEGQVPDCIHQNHVFRARITDPRIDPYFLSWTANTIGGRWAE 354 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 N K + IS I+ V++PP E I + + D +EK + + + R Sbjct: 355 RNG-KQSVNLASISLSMIRRMPVIVPPPGEAVRIATELRDSRSDFDRLEKSIRDGMDRAL 413 Query: 408 NLTQSILAKAFRGELTA 424 L +S+L AF G LT+ Sbjct: 414 VLKKSLLTAAFSGRLTS 430 >UniRef50_B3JQ19 Putative uncharacterized protein n=2 Tax=Bacteria RepID=B3JQ19_9BACE Length = 468 Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 92/413 (22%), Positives = 182/413 (44%), Gaps = 40/413 (9%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP- 63 ++PE WV + G + Y ++ Y PLI + + +NG+FD + ++ Sbjct: 70 EVPESWVWCKFQDCMDVRDGT----HDSPKYTQEGY-PLITSKDFKNGQFDFSKTRYISE 124 Query: 64 ---KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK--LIF 118 KN++K S K+ DI+ +M G+ + H + F+ + + +P + I Sbjct: 125 VDYKNIIKRS-KVDIGDILYSMIGGNIGSMIYIQHDNY-FDMAIKN-VALFKPYQNSDIS 181 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + +IA+F +S + + +++ G + F +P+PPLAEQ I +++ LA Sbjct: 182 TKYIAYFLESKI--KEYQAIAIGGAQPFVGLDIFRNTLVPLPPLAEQHRIITEIEKWLAL 239 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 +D + + I+K+ + +L A++GKL + N EP + K++N + T NG Sbjct: 240 IDQIEQGKVDLQTIIKQTKSKILDLAIHGKLVPQDPNDEPAIKLLKRINPD--FTPCDNG 297 Query: 239 LSSKPNESGVGHPILRISSVRAGHVDQND-------IRFLECSESELNRHKLQDGDLLFT 291 S K + + + + G + D I F + ++ L+ G +FT Sbjct: 298 HSRKLPQGWAYCQLSNVLKITMGQSPKGDSLNNKRGIEFHQGKICFSDKFLLESG--IFT 355 Query: 292 RYNGSL---EFVGVC-----GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 + + +C G++ ++Q + R ++ +F Sbjct: 356 NEPTKIAEPNSILLCVRAPVGVVNITKNQICIG-----RGLCALTPFEGNVDFYFYLLQT 410 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 + T + K ISG+ I+++ ++LPP+ EQ IV+++E+LF D I+ Sbjct: 411 LQDSFDNQSTGTTFKAISGEIIRNENIILPPLAEQQRIVQKIEELFHVFDNIQ 463 Score = 56.2 bits (134), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 61/288 (21%), Positives = 112/288 (38%), Gaps = 62/288 (21%) Query: 194 KRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL--------------------------N 227 K+ RQ +L A++GKL + N EP + +++ + Sbjct: 4 KKLRQKILDLAIHGKLVPQDPNDEPASVLLERIKAEKERLIKEGKIKKSKKSTKASDTPH 63 Query: 228 FESI---------------LTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLE 272 +E++ ++R+G P + G+P++ + G D + R++ Sbjct: 64 YENVPFEVPESWVWCKFQDCMDVRDGTHDSPKYTQEGYPLITSKDFKNGQFDFSKTRYIS 123 Query: 273 CSESE--LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPD------KLIRAR 324 + + + R K+ GD+L++ G++ G + +QH N Y D L + Sbjct: 124 EVDYKNIIKRSKVDIGDILYSMIGGNI------GSMIYIQHDN--YFDMAIKNVALFKPY 175 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI-KSQVVLLPPVKEQAEIVR 383 D +YI F S + G + G DI ++ +V LPP+ EQ I+ Sbjct: 176 QNSDISTKYIAYFLESKIKEYQAI----AIGGAQPFVGLDIFRNTLVPLPPLAEQHRIIT 231 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENP 431 +E+ A D IE+ + + IL A G+L Q + P Sbjct: 232 EIEKWLALIDQIEQGKVDLQTIIKQTKSKILDLAIHGKLVPQDPNDEP 279 >UniRef50_C0QCH4 HsdS2 n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QCH4_DESAH Length = 426 Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 91/440 (20%), Positives = 193/440 (43%), Gaps = 41/440 (9%) Query: 4 GKLPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +PE W + + + G+T + + D +P R+ NI +G D+V++ Sbjct: 13 GWIPEDWDCVKLGGIVNKVGSGITPRGGSKV--YCDKGVPFFRSQNILHGTVSVKDIVYI 70 Query: 63 PKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-----GAFCGVLRPEK 115 +NL ++ + + P D+++ ++ G S + F +F ++RP+ Sbjct: 71 SENLHQKMKNTHLQPADVLL-------NITGASIGRCCVFPNNFKKGNVNQHVCIIRPDG 123 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 I S ++ S + + +I + AG N + +P+PPL EQ+ IA+ L T+ Sbjct: 124 TIKSQYLCSLLNSPIGQKQIWNFQAGGNREGLNFQQIRSFILPLPPLPEQQKIADVLSTV 183 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTEL 235 ++ S + +Q Q+ K + +L + G K + + + ++I + Sbjct: 184 DDKISSIDQQIQQTEQLKKGLMEKLLTEGI-GHTEFKDTEIGQIPASWDVVKLKTICHRI 242 Query: 236 RNGLSSKPNE--SGVGHPILRISSVRAGHVDQNDIRFLECSESELNR-HKLQDGDLLFTR 292 G+++ +E + G PI+R +++ + +D+ + +E N KL GD++ R Sbjct: 243 FVGIATSTSEHYTNDGIPIIRNQNIKENSISGDDLLKITNDFNEKNHSKKLMVGDIITAR 302 Query: 293 YNGSLEFVGV-CGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNC 350 + G+ C + KK + + +R K+ + P Y+ + +S + +++ Sbjct: 303 TG----YPGMSCVIPKKFEGAQTF---TTLVSRPNKERIFPHYLSRYINSDIGKKIVLSN 355 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 + Q+ ++ +K ++LPP++EQ +I L + D I+ + R + Sbjct: 356 -QAGGAQQNLNAGRLKEIPIILPPLEEQKQIATI---LSSVDDKID------VLRSKKTS 405 Query: 411 QSILAKAFRGE-LTAQWRAE 429 + L K G+ LT Q R + Sbjct: 406 YTTLKKGLMGQLLTGQMRVK 425 Score = 51.6 bits (122), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 51/212 (24%), Positives = 94/212 (44%), Gaps = 15/212 (7%) Query: 4 GKLPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P W + + T+ I G+ + + +D +P+IR NI+ DL+ + Sbjct: 224 GQIPASWDVVKLKTICHRIFVGIATSTSE---HYTNDGIPIIRNQNIKENSISGDDLLKI 280 Query: 63 PKNLVKE--SQKISPEDIVIAMSS--GSKSVVGKSAHQHLPFECSFGAFCGVLRPEK-LI 117 + ++ S+K+ DI+ A + G V+ K FE + V RP K I Sbjct: 281 TNDFNEKNHSKKLMVGDIITARTGYPGMSCVIPKK------FEGAQTFTTLVSRPNKERI 334 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F +++ + S + + + S AG N+ I I +PPL EQK IA L ++ Sbjct: 335 FPHYLSRYINSDIGKKIVLSNQAGGAQQNLNAGRLKEIPIILPPLEEQKQIATILSSVDD 394 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 ++D +++ + K +L G + K+ Sbjct: 395 KIDVLRSKKTSYTTLKKGLMGQLLTGQMRVKI 426 >UniRef50_B3E898 Restriction modification system DNA specificity domain n=1 Tax=Geobacter lovleyi SZ RepID=B3E898_GEOLS Length = 447 Score = 75.9 bits (185), Expect = 3e-12, Method: Compositional matrix adjust. Identities = 79/347 (22%), Positives = 137/347 (39%), Gaps = 61/347 (17%) Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +++HF + L + + AN++ + + IP+PPL EQ+ + +D + D Sbjct: 102 YLSHFKDTVL----VPLMKGAANVS-LSMKEIASVKIPVPPLDEQQSL---IDLIFRIED 153 Query: 181 STKARFEQIPQ---ILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 + + +LK+ RQA+L AV G+LT WR P + + ++L +++ Sbjct: 154 EHQELLTETNHQGVLLKQLRQALLQEAVAGELTTAWRKQHPVAKGDPQYDAAALLAQIKA 213 Query: 238 --------------------GLSSKPNESGVGHPILRISSV------------------- 258 KP + G R+ V Sbjct: 214 EKERLVKEGKIRKEKPLPPITDEDKPFDLPEGWGWCRLGEVADGFQYGSSVKSLKEGKVP 273 Query: 259 -------RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQH 311 + G +D +++ + E+ ++++ +GDLLF R N S E VG GL + Sbjct: 274 VLRMGNIQCGKIDWSNLVYTN-DTGEIRKYRVTNGDLLFNRTN-SRELVGKTGLFDGMYE 331 Query: 312 QNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVL 371 ++ L+R + Y +S R GQ I+ ++ Sbjct: 332 A--IFAGYLVRVTMLGGISATYSNGVLNSKFHREWCDANKTDALGQSNINATKLRDYFFP 389 Query: 372 LPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 LPP+ EQ IV RV+ L A D +EKQV + L Q++L +AF Sbjct: 390 LPPLAEQQAIVARVDSLMATIDELEKQVAERKEQAQLLMQTVLREAF 436 Score = 61.6 bits (148), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 52/201 (25%), Positives = 93/201 (46%), Gaps = 15/201 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LPEGW + V G Y ++ LK+ +P++R NIQ GK D ++LV+ Sbjct: 242 LPEGWGWCRLGEVAD---GFQYGS--SVKSLKEGKVPVLRMGNIQCGKIDWSNLVYTNDT 296 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 +++ D++ ++ S+ +VGK+ +E F + + I + + Sbjct: 297 GEIRKYRVTNGDLLFNRTN-SRELVGKTGLFDGMYEAIFAGYLVRVTMLGGISATYSNGV 355 Query: 126 TKSSLYR-----NKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 S +R NK +L +NIN K + P+PPLAEQ+ I ++D+L+A +D Sbjct: 356 LNSKFHREWCDANKTDALGQ-SNINATKLRDY---FFPLPPLAEQQAIVARVDSLMATID 411 Query: 181 STKARFEQIPQILKRFRQAVL 201 + + + + + Q VL Sbjct: 412 ELEKQVAERKEQAQLLMQTVL 432 Score = 47.0 bits (110), Expect = 0.001, Method: Compositional matrix adjust. Identities = 30/94 (31%), Positives = 53/94 (56%), Gaps = 3/94 (3%) Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 +S K+I S + +PP+ EQ ++ + ++ + + N+ + L Q++L +A Sbjct: 123 LSMKEIASVKIPVPPLDEQQSLIDLIFRIEDEHQELLTETNHQGVLLKQLRQALLQEAVA 182 Query: 420 GELTAQWRAENPDLISGE--NSAAALLEKIKAER 451 GELT WR ++P + G+ AAALL +IKAE+ Sbjct: 183 GELTTAWRKQHP-VAKGDPQYDAAALLAQIKAEK 215 >UniRef50_Q1NNJ9 Putative uncharacterized protein n=1 Tax=delta proteobacterium MLMS-1 RepID=Q1NNJ9_9DELT Length = 348 Score = 75.5 bits (184), Expect = 4e-12, Method: Compositional matrix adjust. Identities = 47/173 (27%), Positives = 88/173 (50%), Gaps = 13/173 (7%) Query: 110 VLRPEKLIFSGFIAHFTKSSLYRNKISS--LSAGANINNIKPASFDLINIPIPPLAEQKI 167 V P + + F+ F K + R+ ++ G ++ +KP++ P+ PL EQ+ Sbjct: 99 VFPPSEGVEPKFLCFFLKQNAVRDFLAQNVSGVGGSLMRVKPSTLKGHPFPVAPLNEQRR 158 Query: 168 IAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV----- 222 I EK++TL A++D +A ++ ++L +RQ+VL AV G+LT WR E H + Sbjct: 159 IVEKIETLFARLDKGEAALREVQKLLASYRQSVLKAAVTGQLTADWRA-ENAHRLEHGRD 217 Query: 223 -----FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRF 270 + ++ +L ++R G + K G +LRI +V G ++ +++F Sbjct: 218 LLTLGWGRVTLGELLEDIRYGTAKKCQPDVDGIAVLRIPNVVNGTINLQELKF 270 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 37/129 (28%), Positives = 58/129 (44%), Gaps = 11/129 (8%) Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQ-KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 P+++ F + R+ + V G + +K + P+ EQ IV ++E LF Sbjct: 108 PKFLCFFLKQNAVRDFLAQNVSGVGGSLMRVKPSTLKGHPFPVAPLNEQRRIVEKIETLF 167 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENP-------DLIS---GENS 439 A D E + + + QS+L A G+LTA WRAEN DL++ G + Sbjct: 168 ARLDKGEAALREVQKLLASYRQSVLKAAVTGQLTADWRAENAHRLEHGRDLLTLGWGRVT 227 Query: 440 AAALLEKIK 448 LLE I+ Sbjct: 228 LGELLEDIR 236 >UniRef50_Q3IEL0 Putative type I restriction-modification system, S subunit n=1 Tax=Pseudoalteromonas haloplanktis TAC125 RepID=Q3IEL0_PSEHT Length = 442 Score = 75.5 bits (184), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 109/448 (24%), Positives = 201/448 (44%), Gaps = 59/448 (13%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 K+P W P+ + +GV +K N + +++A++I+ ++++ V++P Sbjct: 27 KIPNYWQTIPLRLILDTRKGVAFKS----NDFTSSGIRVVKASDIKKLTINSSE-VYLPT 81 Query: 65 NLVKESQKISPEDI-----VIAMSSGSKSVVGKSAHQHLPF--ECSFGAFCG----VLRP 113 N + I P+ I +I + GS V SA + E GA V P Sbjct: 82 NYIS----IYPKAILRKGDIILSTVGSNPDVKNSAVGQIGVVPEHLDGALLNQNTVVFEP 137 Query: 114 -EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLIN--IPIPPLAEQKIIAE 170 E I F+ + + YR+ + L+A N + D++N IPIPP EQ+ IA Sbjct: 138 KEDKIHREFLFKVIQMNGYRDHLD-LNAHGTANQSSLSISDMLNFYIPIPPKNEQQKIAS 196 Query: 171 KLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHS 221 LD A++D+ A+ E++ ++LK RQAV+ AV L +W P+H Sbjct: 197 FLDHETAKIDTLIAKQEKLIELLKEKRQAVISHAVTKGLNPNAPMRDSGVEWLGEVPEHW 256 Query: 222 VFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRH 281 + L ++ ++ GLSS E + +I + V + F E S N H Sbjct: 257 LIGSLRWKVSISS-GEGLSSNLVEKNKTE-LKKIPVIGGNGV----MGFSESS----NTH 306 Query: 282 KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSP 341 K + R G+L CG + + + + + + L + + D E I Sbjct: 307 KTA---IAIGRV-GAL-----CGNVHLINYISWITDNALKIS--SWDGFDENYLISL--- 352 Query: 342 SARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNN 401 + A +N + +T+ Q I+G+ IKS +V++PP+KEQ +I ++ ++ D +EK+ + Sbjct: 353 -LKAANLNNLASTTAQPLITGEQIKSLIVVIPPLKEQIKINLKLTKIVNLFDKLEKRSKD 411 Query: 402 ALARVNNLTQSILAKAFRGELTAQ-WRA 428 + + ++++ A G++ + W+ Sbjct: 412 GINLLKERKTALISAAVTGKIDVRNWKV 439 >UniRef50_Q30YF3 Subunit S of type I restriction-modification system n=2 Tax=Bacteria RepID=Q30YF3_DESDG Length = 565 Score = 75.1 bits (183), Expect = 5e-12, Method: Compositional matrix adjust. Identities = 108/493 (21%), Positives = 207/493 (41%), Gaps = 95/493 (19%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYK-KEQAINYLKDDYLPLIRANNIQNG--KFDTTDLVFV 62 LP+ W + T+ + G + +E+ Y + L I ++ G D + +++ Sbjct: 87 LPQSWTWTRLGTIGNIFNGNSINAREKETKYAGANGLTYIATKDVGYGLDALDYKNGIYI 146 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSA---HQHLPFECSFGA---FCGVLRPEKL 116 P++ ++ KI+ + V+ + G + GK Q + F A F G+ P K Sbjct: 147 PES--EDKFKIAHQGAVLICAEGGSA--GKKCGITEQDICFGNKLFANELFGGI--PSK- 199 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 FI + S ++R ++ G I + A F + +P+PPL EQ I +K+D L+ Sbjct: 200 ----FILYLYLSPVFRESFNAAMTGI-IGGVSIAKFLELPVPLPPLKEQHRIVDKIDQLM 254 Query: 177 AQVDSTK-ARFEQ-----------IPQILK-------------------------RFRQA 199 A+ D + R E+ I Q+L R+ Sbjct: 255 ARCDELENLRTEREEKRLAVHAAAIKQLLDAPDGSAWDFIEQHFGELYTVKENVTELRKG 314 Query: 200 VLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR----------------------- 236 +L AV G+L+E+ N E ++ ++ E ++R Sbjct: 315 ILQLAVMGRLSEQKTNDESVSTLLTNVHAERQRLKIRKTTDLINSPRPLGYEIPEQWKWV 374 Query: 237 -----------NGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQD 285 NG S + + L +S+ +G +F++ S + L+D Sbjct: 375 CLDDVLIYGPTNGFSPRAVDYETNIRSLTLSATTSGTFKGEYSKFIDADISNDSDLWLRD 434 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 GD+L R N ++E+VGV + + +YPD +++ R++ +Y+ SS AR Sbjct: 435 GDILVQRGN-TIEYVGVSAVYRG-NPGVYVYPDLMMKLRVSSHMDTDYVYYAMSSVPARE 492 Query: 346 AMMNCVKTTSG-QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 + TSG I+ K +KS + +PP++EQ IV ++++L + +++Q+++A Sbjct: 493 YLRAHASGTSGTMPKINQKTLKSLPIPVPPLEEQHRIVVKIKRLMDLCEILDQQIDDATG 552 Query: 405 RVNNLTQSILAKA 417 + L +++A+A Sbjct: 553 KQTELLNAVMAQA 565 >UniRef50_D0LNE2 Restriction modification system DNA specificity domain protein n=1 Tax=Haliangium ochraceum DSM 14365 RepID=D0LNE2_HALO1 Length = 423 Score = 75.1 bits (183), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 78/341 (22%), Positives = 142/341 (41%), Gaps = 37/341 (10%) Query: 102 CSFGAFCGV----LRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINI 157 SF C LR + + GF+ + + S+S I + + Sbjct: 90 ASFDGLCSADMYPLRAKTSVEPGFLLALLLGEEFSSFAESVSMRTGIPKLNRKELGSYHA 149 Query: 158 PIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE 217 +PPL EQ+ IA L + + T+A EQ+ + K Q +L + G+ Sbjct: 150 RLPPLGEQRKIAAILGAVDEAIARTQAVIEQVQVVKKGLMQDLLTRGLPGR--------- 200 Query: 218 PQHSVFKKLNFESI--------LTELRNGLSSKPNESGVGHP-------ILRISSVRAGH 262 H+ FK+ I L ++ +G+ + + HP +L++SSV +G Sbjct: 201 --HTRFKQTEIGQIPESWSAVRLGDVLDGIDAGWSPKCANHPAGNGEWGVLKVSSVSSGI 258 Query: 263 VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR 322 + + L +++ GD++ R +G L+ VGVC + K + + L+ DK +R Sbjct: 259 YKPEENKMLPDDLIPKPELEVRPGDVIIARASGVLDLVGVCSFVYKTRPR-LMLSDKTLR 317 Query: 323 ARLTKDALPE-YIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEI 381 R + L Y+ + SP R+ ++ T S + IS K I S V LP + EQ ++ Sbjct: 318 VRPNRTLLDSFYLALTLQSPVVRSLVLEKA-TGSHMRNISQKAIGSVTVALPSLDEQVKV 376 Query: 382 VRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + + A D + V + + L ++++ GE+ Sbjct: 377 SSGIMAMDARIDNDTRSVES----LTELKSALMSVLLTGEV 413 Score = 43.1 bits (100), Expect = 0.025, Method: Compositional matrix adjust. Identities = 39/206 (18%), Positives = 93/206 (45%), Gaps = 6/206 (2%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + V I + + A + + +++ +++ +G + + +P Sbjct: 210 GQIPESWSAVRLGDVLDGI-DAGWSPKCANHPAGNGEWGVLKVSSVSSGIYKPEENKMLP 268 Query: 64 KNLV-KESQKISPEDIVIAMSSGSKSVVGKSA--HQHLPFECSFGAFCGVLRPEKLIFSG 120 +L+ K ++ P D++IA +SG +VG + ++ P +RP + + Sbjct: 269 DDLIPKPELEVRPGDVIIARASGVLDLVGVCSFVYKTRP-RLMLSDKTLRVRPNRTLLDS 327 Query: 121 FIAHFT-KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F T +S + R+ + + G+++ NI + + + +P L EQ ++ + + A++ Sbjct: 328 FYLALTLQSPVVRSLVLEKATGSHMRNISQKAIGSVTVALPSLDEQVKVSSGIMAMDARI 387 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAV 205 D+ E + ++ +L G V Sbjct: 388 DNDTRSVESLTELKSALMSVLLTGEV 413 >UniRef50_UPI0001907424 putative type I restriction enzyme specificity subunit n=1 Tax=Rhizobium etli GR56 RepID=UPI0001907424 Length = 239 Score = 74.7 bits (182), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 44/102 (43%), Positives = 60/102 (58%), Gaps = 8/102 (7%) Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 T S ISG DI + + +PP++EQ +I RR+E FA D + ++ AL V L ++ Sbjct: 55 TGSDLPHISGDDISTCPIPIPPLEEQHKIARRIESAFAKIDRLAEEARRALQLVGRLDEA 114 Query: 413 ILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAAS 454 ILAKAFRGEL Q + P A LLE+I+AERAA+ Sbjct: 115 ILAKAFRGELVPQDENDEP--------AEKLLERIRAERAAA 148 Score = 51.2 bits (121), Expect = 9e-05, Method: Compositional matrix adjust. Identities = 28/119 (23%), Positives = 55/119 (46%) Query: 111 LRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAE 170 R + + F+ S + + + G+++ +I IPIPPL EQ IA Sbjct: 26 FRASEFLTQDFLWWLLSSQTFLSHSLQRATGSDLPHISGDDISTCPIPIPPLEEQHKIAR 85 Query: 171 KLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE 229 ++++ A++D + Q++ R +A+L A G+L + N EP + +++ E Sbjct: 86 RIESAFAKIDRLAEEARRALQLVGRLDEAILAKAFRGELVPQDENDEPAEKLLERIRAE 144 >UniRef50_C1ZA47 Restriction endonuclease S subunit n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZA47_PLALI Length = 413 Score = 74.7 bits (182), Expect = 7e-12, Method: Compositional matrix adjust. Identities = 105/447 (23%), Positives = 177/447 (39%), Gaps = 70/447 (15%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN------GKFDTTDLVFV 62 GW+ + V G+ +K E+ P + I+N G D +D+ ++ Sbjct: 4 GWIYKTLDDVCEFNNGL-WKGEKP---------PFVTVGVIRNTNFTKEGTLDDSDIAYI 53 Query: 63 PKNLVK-ESQKISPEDIVIAMSSGS-KSVVGKSA-HQHLPFECSFGAFCGVLR---PEKL 116 K E +++ D+++ S G K VG+ A + SF F +R P+ L Sbjct: 54 EVEAKKFEKRRLVFGDLILEKSGGGPKQPVGRVALFDKRAGDFSFSNFTAAIRVKDPKTL 113 Query: 117 IFSGFIAHFTKSSLYRNKISSL-----SAGANINNIKPASFDLINIPIPPLAEQKIIAEK 171 F F L+ +S + S I N+ + I +P+PPL EQ+ I Sbjct: 114 DF-----RFLHKFLFWTHLSGVTETMQSHSTGIRNLNGDVYKCIEVPLPPLTEQRRIVGI 168 Query: 172 LDTLLAQVDSTKARFEQIPQ----ILKRFRQAVLGGAVNGKLTEKWRNF-EPQHSVFKKL 226 LD + + KA E+ Q + + QAV +G + + ++ P + Sbjct: 169 LDEAFEGLATAKANAEKNLQNARALFESHLQAVFTQRGDGWVEKTVKDVASPIKGSIRTG 228 Query: 227 NFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSE-SELNRHKLQD 285 F S L L S+ + G+ +L I + A RF+ + +L R+++ Sbjct: 229 PFGSQL------LHSEFVDEGIA--VLGIDNAVANEFRWGKSRFITKDKFGQLERYRVYP 280 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD----------ALPEYIE 335 GD+L T +G CG + + PD + A TK LP Y+ Sbjct: 281 GDVLIT-------IMGTCG-------RCAVVPDDIPTAINTKHICCITLDWKKCLPSYLH 326 Query: 336 IFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTI 395 ++F A + + G++ I+ VLLPP + Q+ IV L + Sbjct: 327 LYFLHAQQSQAFLAKHAKGAIMAGLNMGLIQELPVLLPPTQVQSAIVEAANDLREETQRL 386 Query: 396 EKQVNNALARVNNLTQSILAKAFRGEL 422 E LA ++ L +S+L +AF GEL Sbjct: 387 ESLYQRKLAALDELKKSLLHRAFSGEL 413 >UniRef50_B3PQK6 Probable type I restriction-modification system protein, specificity subunit n=1 Tax=Rhizobium etli CIAT 652 RepID=B3PQK6_RHIE6 Length = 424 Score = 74.3 bits (181), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 96/430 (22%), Positives = 186/430 (43%), Gaps = 48/430 (11%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDD--YLPLIRANNIQNGKFDTTDLVFVPK 64 PEGW + + + L G T + + +Y +L L + I+ T + + Sbjct: 24 PEGWALERLCDIARLESGHTPSRNRP-DYWDGGIPWLSLHDSKTIEGKVLQNTKMTISAR 82 Query: 65 NLVKESQKISPEDIVI---AMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 L S ++ PE V + G +++G+ F C CG P + + + Sbjct: 83 GLANSSARLLPEGTVALSRTATIGKVALLGREMATSQDFACYI---CG---PR--LLNKY 134 Query: 122 IAHFTKSSLYRN---KISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKL---DTL 175 +AH L+R + L AG+ N I +F+ + I +PP+ EQ+ IA+ L D L Sbjct: 135 LAH-----LFRGMELEWERLMAGSTHNTIYMPTFENMQILVPPMEEQEAIADALSDADAL 189 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR--NFEPQHSVFKKLNFESILT 233 + ++ A+ I Q G + LT K R + + ++ K +F S Sbjct: 190 IEGLERLIAKKWLIKQ-----------GTMQDLLTAKRRLPGYSAEWTMAKLGDFLS--- 235 Query: 234 ELRNGLSSKPNESGVGHPILRISSV-RAGHVDQNDIR-FLECSESELNRHKLQDGDLLFT 291 +NGL+ G G PI+ V R G +++ I +E +E+E + + +++GD+LFT Sbjct: 236 -FKNGLNKAKAFFGHGTPIINYMDVFRGGAINEGSIDGLVEVTEAEQSAYGIRNGDVLFT 294 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP-EYIEIFFSSPSARNAMMNC 350 R + + E +G+ + + ++ ++R R AL + + F S + R +++ Sbjct: 295 RTSETPEEIGLAAVADGVL-DGTVFSGFVLRGRPKSQALTIAFSKYCFRSGAVRRQIISR 353 Query: 351 VKTTSGQKGISGKDIKSQVVLLP-PVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 T+ + +G+ + + + +P EQ I + + A +E +++ A + Sbjct: 354 ATYTT-RALTNGRQLSAVDISVPRDADEQNAIAEVLNDMDAEIQALETRLDKARQVKEGM 412 Query: 410 TQSILAKAFR 419 Q++L R Sbjct: 413 MQNLLTGRIR 422 >UniRef50_UPI0001BC364B restriction modification system DNA specificity subunit n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC364B Length = 428 Score = 74.3 bits (181), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 101/440 (22%), Positives = 188/440 (42%), Gaps = 48/440 (10%) Query: 3 AGKLPEGWVIAPVSTVTTL---IRGVTYKKEQAINYLKDDYLPLIRANNI--QNGKFDTT 57 GK+PE W + L I G + + Q ++ K ++A N Q GK Sbjct: 12 VGKIPENWKVLKNKYNFELSKEIIGTKWVETQLLSLTKYG----VKAINDGEQTGK---- 63 Query: 58 DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 VP++L QK++ +DIV+ + S V F+ +R + + Sbjct: 64 ----VPESL-STYQKVNKDDIVMCLFDLDCSAVFSGISN---FDGMISPAYKCIRCKPHL 115 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPI--PPLAEQKIIAEKLDTL 175 ++ ++ ++ K S + +S + +N+PI PP+ QK IAE L+ Sbjct: 116 CPQYVDYYFRTVFVDRKYKRYSKNVRFS---ISSDEFMNLPIIVPPIDIQKKIAEFLNFK 172 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE---------PQHSVFKKL 226 ++D+ + E+ + L+ ++++++ AV L + P+H L Sbjct: 173 CFEIDTLHSDIEKQIKTLEEYKKSIITEAVTKGLDPDVEMKDSGISYIGNIPKHWKVTNL 232 Query: 227 NFESILTELRNGLSSKPNESGVGHPILRISSVRAGH-VDQNDIRFLECSESELNRHKLQD 285 + L + +NG+S G G P + V + + QN + +++E N + ++ Sbjct: 233 KY---LGKCQNGISKGGEYFGNGFPFVSYGDVYKNYSIPQNVDGLIMSTKTEQNIYSVKY 289 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSAR 344 GD+ FTR + ++E +G K N ++ LIR R T D +PE+ + +F S R Sbjct: 290 GDVFFTRTSETIEEIGFASTCLK-SIDNSVFAGFLIRFRPTSSDLIPEFSKFYFRSNIHR 348 Query: 345 NAM---MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNN 401 MN V S + + G+ VLLPP+ EQ I + +E+ A D ++ Sbjct: 349 KFFVKEMNLVTRASLSQNLLGR----LPVLLPPLCEQQMIAKNLEKKCAEIDGAIEEKKE 404 Query: 402 ALARVNNLTQSILAKAFRGE 421 L + +S++ + G+ Sbjct: 405 QLETLEQYKKSLIYEYVTGK 424 >UniRef50_B0BR05 Type I restriction enzyme EcoAI specificity protein n=3 Tax=Bacteria RepID=B0BR05_ACTPJ Length = 470 Score = 74.3 bits (181), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 56/177 (31%), Positives = 94/177 (53%), Gaps = 13/177 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFD-TTDLVFVP 63 ++PE WV + + G+TY + D ++R+ NIQ+GK D ++D+V V Sbjct: 299 EIPESWVWVRLGEIGETNIGLTYNPSD----VASDGTIVLRSGNIQDGKIDVSSDIVKV- 353 Query: 64 KNL-VKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 NL + E+++ D++I +GSK +VGK+A SFGAF + R F+ +I Sbjct: 354 -NLDIPENKRCYKNDLLICARNGSKKLVGKAAIIDKD-GYSFGAFMAIFRSP---FNKYI 408 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ S L+RN ++ IN I ++ + IP+P L EQ I EK++TL + + Sbjct: 409 YYYLSSPLFRNDFDGINT-TTINQITQSNLNNRLIPLPSLNEQLRIVEKIETLFSTL 464 Score = 62.4 bits (150), Expect = 4e-08, Method: Compositional matrix adjust. Identities = 72/294 (24%), Positives = 126/294 (42%), Gaps = 70/294 (23%) Query: 157 IPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRF----RQAVLGGAVNGKLTEK 212 IP+PPL EQK I K++ LL ++ + E++ + ++F ++++L A+ GKLTE+ Sbjct: 179 IPLPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTALHQQFPEQLKKSILQAAIQGKLTEQ 238 Query: 213 WRNFEPQHSVF-------------KKLNFESILTE--LRN-------------------- 237 N EP ++ KKL +++E +R+ Sbjct: 239 NPNDEPASALIERIKAEKLRLIAEKKLKKPKVISEIIMRDNLPYEIVNGKERCIADEVPF 298 Query: 238 -------------------GLSSKPNE-SGVGHPILRISSVRAGHVD-QNDIRFLECSES 276 GL+ P++ + G +LR +++ G +D +DI + Sbjct: 299 EIPESWVWVRLGEIGETNIGLTYNPSDVASDGTIVLRSGNIQDGKIDVSSDIVKVNLDIP 358 Query: 277 ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 E R DLL NGS + VG ++ K + + + R+ K YI Sbjct: 359 ENKR--CYKNDLLICARNGSKKLVGKAAIIDKDGYSFGAFM-AIFRSPFNK-----YIYY 410 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 + SSP RN TT Q I+ ++ ++++ LP + EQ IV ++E LF+ Sbjct: 411 YLSSPLFRNDFDGINTTTINQ--ITQSNLNNRLIPLPSLNEQLRIVEKIETLFS 462 Score = 51.6 bits (122), Expect = 7e-05, Method: Compositional matrix adjust. Identities = 37/108 (34%), Positives = 57/108 (52%), Gaps = 12/108 (11%) Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA----YADTIEKQVNNAL 403 +N T + Q G++ I ++ LPP+ EQ IV ++E+L YA+ EK Sbjct: 157 LNQYATATAQPGLAVNKINDVLIPLPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTALHQ 216 Query: 404 ARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 L +SIL A +G+LT Q NP+ + A+AL+E+IKAE+ Sbjct: 217 QFPEQLKKSILQAAIQGKLTEQ----NPN----DEPASALIERIKAEK 256 >UniRef50_Q8TP07 Type I site-specific deoxyribonuclease n=1 Tax=Methanosarcina acetivorans RepID=Q8TP07_METAC Length = 487 Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 120/506 (23%), Positives = 198/506 (39%), Gaps = 95/506 (18%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG-KFDTTDLVFVP 63 KLPEGWV + + L G + + +N K +P + +G K T P Sbjct: 17 KLPEGWVWIRLDSAGELFCGQSPSIAE-VNQEKRG-VPYVTGPEQWDGSKIKETKWTEFP 74 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLP-FECSFGAFCGVLRPEKLIFSGFI 122 K LV PE + G+ VGK P C+ G P + + Sbjct: 75 KRLV-------PEGCIFITVKGAG--VGKI----FPGVSCAIGRDIYAFLPSSKVDFKYT 121 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 H K + + + A +I + I + L EQ+ I K++ L +++D+ Sbjct: 122 LHAIKHQI---DVLIMKAQGDIPGLSKNHILDHVIGLCSLEEQRAIVFKIEQLFSELDNG 178 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE------------------------- 217 A + + LK +RQAVL A G+LT+KWR + Sbjct: 179 IANLKLAQEQLKVYRQAVLKKAFEGELTKKWREQQVDLPDAGELLERIRKEREEVAKDTG 238 Query: 218 --------------------PQHSVFKKLNFESILTELRNGLSS-KPNES----GVGHPI 252 P+ ++ KL++ L +L G S +P G +P Sbjct: 239 KKVKIIKPPTNAELVELPMIPKEWMWVKLDY---LGDLGRGKSKHRPRNDKTLFGGKYPF 295 Query: 253 LRISSVRAGHVDQNDIRFLECSESE--LNRHKLQ-DGDLLFTRYNGSLE--FVGVCGLLK 307 ++ V+A + + I+ E + S+ L + KL G L T E F+G G Sbjct: 296 IQTGEVKAAN---HTIKSFEKTYSDVGLAQSKLWPKGTLCITIAANIAETAFLGFEGC-- 350 Query: 308 KLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIK 366 +PD ++ + + EY+ FF A + + + QK I+ ++ Sbjct: 351 --------FPDSIVGFTAIESLVGKEYVYYFFK---ANQSKIESFAPATAQKNINLNILE 399 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQW 426 + ++ L + EQ +IV+ +E + D IE+ + L + L QSIL KAF G+L + Sbjct: 400 NLLIPLCSLPEQQDIVQEIETRLSVCDKIEQDIETNLEKAEALRQSILKKAFEGKLLNER 459 Query: 427 RAENPDLISGENSAAALLEKIKAERA 452 E A LLE++K+ERA Sbjct: 460 ELEEVRGAEDWEPAEVLLERVKSERA 485 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 39/103 (37%), Positives = 55/103 (53%), Gaps = 7/103 (6%) Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 G+S I V+ L ++EQ IV ++EQLF+ D + A ++ Q++L KAF Sbjct: 142 GLSKNHILDHVIGLCSLEEQRAIVFKIEQLFSELDNGIANLKLAQEQLKVYRQAVLKKAF 201 Query: 419 RGELTAQWRAENPDLISGENSAAALLEKIKAER---AASGGKK 458 GELT +WR + DL A LLE+I+ ER A GKK Sbjct: 202 EGELTKKWREQQVDL----PDAGELLERIRKEREEVAKDTGKK 240 >UniRef50_A7I739 Restriction modification system DNA specificity domain n=1 Tax=Candidatus Methanoregula boonei 6A8 RepID=A7I739_METB6 Length = 457 Score = 73.9 bits (180), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 63/237 (26%), Positives = 102/237 (43%), Gaps = 33/237 (13%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT-DLVFVPKNLVK 68 W P+ + ++ G +K E + PLIR +I N K + D VF Sbjct: 25 WERVPLGKIAKVLNGFAFKSEL---FNDKKGTPLIRIRDIGNNKTECYYDGVF------D 75 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI-FSGFIAHFTK 127 E+ I P D+++ M F CS L +++ I + + Sbjct: 76 EAYVIHPGDLLVGMDGD--------------FNCSTWRGPKALLNQRVCKIEVNIEQYNR 121 Query: 128 SSL------YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 L Y I+ ++ + ++ S I +P PPL EQ+ I +++ LL+ V++ Sbjct: 122 KFLEYVLPGYLKAINENTSSQTVKHLSSRSISEILLPNPPLTEQQRIVARVEALLSHVNA 181 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL--NFESILTELR 236 + R ++P I+K+FRQAVL A +G LTE WR P KL ESI + + Sbjct: 182 ARERLSRVPLIMKKFRQAVLAAACSGGLTEGWRKENPDIEEANKLVKRLESIRKQFK 238 Score = 71.2 bits (173), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 39/92 (42%), Positives = 55/92 (59%), Gaps = 1/92 (1%) Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 +Y+ +S RN V++ G + +DI + ++ LPP+ EQ EIVRRV LF Sbjct: 364 DYLFWLLNSMFIRNQAFENVRSI-GVPDLGLRDIDNFIIPLPPLAEQYEIVRRVGLLFER 422 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 AD I+++V A R LTQ++L KAFRGELT Sbjct: 423 ADAIDREVEAATRRCERLTQAVLGKAFRGELT 454 Score = 52.8 bits (125), Expect = 3e-05, Method: Compositional matrix adjust. Identities = 34/101 (33%), Positives = 52/101 (51%), Gaps = 3/101 (2%) Query: 352 KTTSGQ--KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + TS Q K +S + I ++ PP+ EQ IV RVE L ++ + ++++ + Sbjct: 137 ENTSSQTVKHLSSRSISEILLPNPPLTEQQRIVARVEALLSHVNAARERLSRVPLIMKKF 196 Query: 410 TQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAE 450 Q++LA A G LT WR ENPD I N LE I+ + Sbjct: 197 RQAVLAAACSGGLTEGWRKENPD-IEEANKLVKRLESIRKQ 236 Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust. Identities = 37/148 (25%), Positives = 65/148 (43%), Gaps = 2/148 (1%) Query: 64 KNLVKESQKISPEDIVIAMSS-GSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + ++ S+K P + I S G+ + A Q + F S+ + + + ++ S ++ Sbjct: 308 EEFLRLSKKFVPRPLDILYSRIGADLGKARKAPQDIKFHISY-SLAVIRQLGEMENSDYL 366 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 S RN+ + ++ D IP+PPLAEQ I ++ L + D+ Sbjct: 367 FWLLNSMFIRNQAFENVRSIGVPDLGLRDIDNFIIPLPPLAEQYEIVRRVGLLFERADAI 426 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLT 210 E + +R QAVLG A G+LT Sbjct: 427 DREVEAATRRCERLTQAVLGKAFRGELT 454 >UniRef50_UPI0001BC2C80 restriction endonuclease S subunits-like protein n=1 Tax=Brevibacterium linens BL2 RepID=UPI0001BC2C80 Length = 383 Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 82/311 (26%), Positives = 142/311 (45%), Gaps = 37/311 (11%) Query: 121 FIAHFTKSSLYRNKISSLSAG--ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ HF S ++ ++ + AG ++ +P I P+PPL EQ+ IA LD + Sbjct: 101 YLVHFLMSDVFHHQFLNTVAGVGGSLLRARPQYVRSIMAPLPPLDEQRRIAAILD----K 156 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESI--LTELR 236 D+ + + Q L+ Q++ +L E + +I + +L+ Sbjct: 157 ADAIRQKRRQATTHLETLAQSIFQTMFGSRLAES--------------SSTTIGDVAQLQ 202 Query: 237 NG--LSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYN 294 G LSS + + + +L+ISSV +G + + + S H GDLL +R N Sbjct: 203 GGKSLSSIDDSAATKNRVLKISSVTSGTFKPWESKPVPDDYSPPLSHFSHKGDLLISRAN 262 Query: 295 GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTT 354 S E VG L+ ++ Q LL PDK+ R + PEY + + R + N + Sbjct: 263 TS-ELVGASALV-HVEPQGLLLPDKIWRFDWLIETQPEYFFHLLRTKAIRGRISNMATGS 320 Query: 355 SG-QKGISGKDIKSQVVLLPPVK--EQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 G K IS + S V +P ++ EQ E V++V ++ D + + + + A + L Sbjct: 321 GGSMKNISKPKLLS--VQIPRIESNEQREFVKQVRKV----DVLRAKFDESNA--DQLFA 372 Query: 412 SILAKAFRGEL 422 S+ ++AFRGEL Sbjct: 373 SLQSRAFRGEL 383 >UniRef50_B3H2F5 Type I restriction-modification system, S subunit n=2 Tax=Bacteria RepID=B3H2F5_ACTP7 Length = 508 Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 56/177 (31%), Positives = 94/177 (53%), Gaps = 13/177 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFD-TTDLVFVP 63 ++PE WV + + G+TY + D ++R+ NIQ+GK D ++D+V V Sbjct: 337 EIPESWVWVRLGEIGETNIGLTYNPSD----VASDGTIVLRSGNIQDGKIDVSSDIVKV- 391 Query: 64 KNL-VKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 NL + E+++ D++I +GSK +VGK+A SFGAF + R F+ +I Sbjct: 392 -NLDIPENKRCYKNDLLICARNGSKKLVGKAAIIDKD-GYSFGAFMTIFRSP---FNKYI 446 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ S L+RN ++ IN I ++ + IP+P L EQ I EK++TL + + Sbjct: 447 YYYLSSPLFRNDFDGINT-TTINQITQSNLNNRLIPLPSLNEQLRIVEKIETLFSTL 502 Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 101/452 (22%), Positives = 190/452 (42%), Gaps = 87/452 (19%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ--NGKFDTTDLVFV 62 ++P+ WV + + +I G T K + N+ K +P I +++ +GK+ + + Sbjct: 70 EIPKSWVWVRLDFLGEIIGGGTPKTNEDDNWNKGS-IPWITPADMKYISGKYISKGNRNI 128 Query: 63 PKNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 +N ++ S +S IV + S++ +G A C+ F + +++ Sbjct: 129 TENGLRSSSTRLLSKNSIVYS----SRAPIGYIAITETEL-CTNQGFKSID-----LYNK 178 Query: 121 FIAHFTKSSL--YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 I + SL + +I S ++G I +F IP+PPL EQK I K++ LL Sbjct: 179 EIVDYLYYSLIYFTPEIQSRASGTTFKEISGTAFGNTIIPLPPLNEQKRIVAKIEELLPY 238 Query: 179 VDSTKARFEQIPQILKRF----RQAVLGGAVNGKLTEKWRNFEPQHSVF----------- 223 ++ + E++ + ++F ++++L A+ GKLT++ N EP + Sbjct: 239 IEQYAEKEEKLTALHQQFPEQLKKSILQAAIQGKLTKQDPNDEPALVLIERIKAEKLRLI 298 Query: 224 --KKLNFESILTE--LRN---------------------------------------GLS 240 KKL +++E LR+ GL+ Sbjct: 299 AEKKLKKPKVVSEIILRDNLPYEIVNGKERCIADEVPFEIPESWVWVRLGEIGETNIGLT 358 Query: 241 SKPNE-SGVGHPILRISSVRAGHVD-QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLE 298 P++ + G +LR +++ G +D +DI + E R DLL NGS + Sbjct: 359 YNPSDVASDGTIVLRSGNIQDGKIDVSSDIVKVNLDIPENKR--CYKNDLLICARNGSKK 416 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 VG ++ K + + + + +YI + SSP RN TT Q Sbjct: 417 LVGKAAIIDKDGYSFGAF------MTIFRSPFNKYIYYYLSSPLFRNDFDGINTTTINQ- 469 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 I+ ++ ++++ LP + EQ IV ++E LF+ Sbjct: 470 -ITQSNLNNRLIPLPSLNEQLRIVEKIETLFS 500 Score = 45.8 bits (107), Expect = 0.004, Method: Compositional matrix adjust. Identities = 34/98 (34%), Positives = 50/98 (51%), Gaps = 12/98 (12%) Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA----YADTIEKQVNNALARVNNLTQSI 413 K ISG + ++ LPP+ EQ IV ++E+L YA+ EK L +SI Sbjct: 205 KEISGTAFGNTIIPLPPLNEQKRIVAKIEELLPYIEQYAEKEEKLTALHQQFPEQLKKSI 264 Query: 414 LAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 L A +G+LT Q + P L+ L+E+IKAE+ Sbjct: 265 LQAAIQGKLTKQDPNDEPALV--------LIERIKAEK 294 >UniRef50_A3XPV6 Type I restriction-modification system specificity subunit n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XPV6_9FLAO Length = 502 Score = 73.6 bits (179), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 63/216 (29%), Positives = 104/216 (48%), Gaps = 21/216 (9%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 + E WV + ++ L G +K + Y KD +P+IR +IQ+ D + + N Sbjct: 1 MREDWVECTLGSLLKLKNGYAFKSSK---YQKDG-IPVIRIGDIQDWNVDIENAKRIDDN 56 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVG-----KSAHQHLPFECSFGAFCGVLRP--EKLIF 118 + +S ++ DI+IAMS + G K A+Q+ G L P E+L Sbjct: 57 IEYDSHIVNKGDILIAMSGATTGKFGIYNSDKKAYQN--------QRVGNLIPHSEELTS 108 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + +I ++ SL R+ I + G NI + + + PL Q+ I +K++ L + Sbjct: 109 NNYI-YYLLYSLKRD-IEQQAYGGAQPNISATKIEALKTKLFPLPIQQAIVKKIEELFSS 166 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR 214 +DS A ++ LK +RQAVL A GKLT++WR Sbjct: 167 LDSGIADLKKAQDQLKIYRQAVLKKAFEGKLTKEWR 202 Score = 60.1 bits (144), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 50/202 (24%), Positives = 104/202 (51%), Gaps = 17/202 (8%) Query: 257 SVRAGHVDQNDIRFL--ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNL 314 +++ G +D +I ++ E E+ R ++ GD+L+ + + V L ++ + Sbjct: 309 NIKEGRIDLRNISYVTQEDHEAIFGRCDVKKGDVLYIKDGATTGRAAVNTLEEEFSLLSS 368 Query: 315 LYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPP 374 + + I++ + P+++E F ++ RN M++ + + + ++ + + + L Sbjct: 369 VGVFRTIKSFIN----PKFLESFLNAQVTRNRMLSNIAGVAITR-LTLVKLNNSMFSLCS 423 Query: 375 VKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL-----TAQWRAE 429 V+EQ +IV+ +E + D +E+ + ++L + L QSIL KAF G L A+ +A Sbjct: 424 VEEQHQIVQEIESRLSVCDAVEQNIQDSLEKAQALRQSILKKAFEGTLLSDKEIAKCKA- 482 Query: 430 NPDLISGENSAAALLEKIKAER 451 +PD A+ LLE+IKAE+ Sbjct: 483 HPDY----EPASVLLERIKAEK 500 Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust. Identities = 61/226 (26%), Positives = 105/226 (46%), Gaps = 22/226 (9%) Query: 232 LTELRNGL---SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDL 288 L +L+NG SSK + G+ P++RI ++ +VD + + ++ E + H + GD+ Sbjct: 13 LLKLKNGYAFKSSKYQKDGI--PVIRIGDIQDWNVDIENAKRID-DNIEYDSHIVNKGDI 69 Query: 289 LFTRYNGSLEFVGVCGLLKK-LQHQNL--LYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 L + G+ KK Q+Q + L P LT + Y+ + Sbjct: 70 LIAMSGATTGKFGIYNSDKKAYQNQRVGNLIPHS---EELTSNNYIYYLLYSLKRDIEQQ 126 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 A Q IS I++ L P+ Q IV+++E+LF+ D+ + A + Sbjct: 127 AY------GGAQPNISATKIEALKTKLFPLPIQQAIVKKIEELFSSLDSGIADLKKAQDQ 180 Query: 406 VNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 + Q++L KAF G+LT +WR + +L + E LL++IK ER Sbjct: 181 LKIYRQAVLKKAFEGKLTKEWREKQTELPTAEE----LLKEIKKER 222 >UniRef50_D2QTT7 Restriction modification system DNA specificity domain protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QTT7_9SPHI Length = 441 Score = 73.6 bits (179), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 84/339 (24%), Positives = 143/339 (42%), Gaps = 49/339 (14%) Query: 109 GVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKII 168 VL P LI F+A KS + ++S+ S G + I + I PPL+EQ I Sbjct: 125 AVLNPLPLIMPKFLAMAVKSDSFTEQVSANSKGMSYPAINSTELGCLAICFPPLSEQTRI 184 Query: 169 AEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT--------------EKWR 214 AE LD AQ+D A+ EQ+ ++L RQ ++ AV L +W Sbjct: 185 AEFLDRKTAQIDQAIAQKEQLIELLNERRQVMIHRAVTRGLNPNAPMKDSGIDRGDARWI 244 Query: 215 NFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHV--DQNDIRFLE 272 P H ++N+ L ++ +E+G L I S+ +G D +D + Sbjct: 245 GEIPAHWEVSRINW----------LFTEKDETGYPDLPLLIVSINSGVTVRDMDDTEIRK 294 Query: 273 CSESELNRHKLQ-DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 + N +K GD+ F + VGV + L+ PD ++ AR Sbjct: 295 QVAEDFNVYKRALAGDIAFNKMRMWQGAVGV------VPQDGLVSPDYVV-ARPNNFVNS 347 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSG----QKGISGKDIKSQVVLLPPVKEQAEIVRRV-- 385 Y F + R + VK + G + + +D KS ++PP++EQ +IV + Sbjct: 348 AYYGFLFKT---REYLAEFVKHSHGIAWDRNRLYWEDFKSIFAMVPPLEEQNQIVDFLNA 404 Query: 386 --EQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 E++ + I+KQ+ ++ L +++ A G++ Sbjct: 405 QNEEMSFASTKIQKQIQ----KLQELKSTLINSAVTGKI 439 >UniRef50_B7CAX1 Putative uncharacterized protein n=1 Tax=Eubacterium biforme DSM 3989 RepID=B7CAX1_9FIRM Length = 365 Score = 73.6 bits (179), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 67/279 (24%), Positives = 125/279 (44%), Gaps = 22/279 (7%) Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 L++ ++ HF + Y ++ S G I IK + + + +P + EQK I ++ Sbjct: 82 LLYMPYLYHFLEG--YIGELRKQSIGGVIKYIKLGNLTDVLVELPSIVEQKYIVNLMNIS 139 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTEL 235 L ++ K +++ ++K + G L KW + + +++ E Sbjct: 140 LELIELRKKTIDKLDSLVKARFIEMFGDPYTNPL--KWEKLKIK---------DAVTVEP 188 Query: 236 RNGLSSKPNESGV----GHPILRISSVRAGHV-DQNDIRFLECSESELNRHKLQDGDLLF 290 +NGL KP V G PILRI G V D ++ L+CSE+E ++ L + D++ Sbjct: 189 QNGLY-KPQSDYVTDRSGIPILRIDGFYDGIVTDFASLKRLKCSETEKQKYLLLEDDIVI 247 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPE-YIEIFFSSPSARNAMMN 349 R N S+E++G C +K L ++ +Y ++R + YI S + ++N Sbjct: 248 NRVN-SIEYLGKCAHIKGLL-EDTVYESNMMRMHFDPETYNSVYICKLLCSQFIYDQIVN 305 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 K Q I+ KD+ + PP+ Q + ++Q+ Sbjct: 306 HAKKAVNQASINQKDVLDFNIYQPPIDLQNQFADFIQQV 344 >UniRef50_Q028F8 Putative uncharacterized protein n=1 Tax=Candidatus Solibacter usitatus Ellin6076 RepID=Q028F8_SOLUE Length = 169 Score = 73.6 bits (179), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 42/92 (45%), Positives = 55/92 (59%) Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 GQ IS + +S +V +P +EQ EI+RRVE FA AD +E + NA A V+ LTQSIL+ Sbjct: 40 GQSNISLEQCRSLIVSVPSSQEQREIIRRVEAFFALADRLEARCTNAKAHVDKLTQSILS 99 Query: 416 KAFRGELTAQWRAENPDLISGENSAAALLEKI 447 KAFRG+L A SAA LL ++ Sbjct: 100 KAFRGQLVETEAALAEAERREFESAAELLARL 131 >UniRef50_A4FXL8 Restriction modification system DNA specificity domain n=1 Tax=Methanococcus maripaludis C5 RepID=A4FXL8_METM5 Length = 447 Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 72/325 (22%), Positives = 142/325 (43%), Gaps = 23/325 (7%) Query: 112 RPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEK 171 RP + S ++ + KS +++I G + +I I++ +PP+ EQ+ IA+ Sbjct: 127 RPLINVNSTYLGYLLKSPDIKSQIQKRVVGIKVYSITQKILKSISLILPPVDEQQEIAQY 186 Query: 172 LDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSV 222 LD + Q+DS + + K ++Q+++ V L +W P+H Sbjct: 187 LDDKVGQIDSIIEKTKSSIDEYKSYKQSIITETVTKGLDPTVTMKDSGIEWIGDIPEHWD 246 Query: 223 FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIR-FLECSESELNRH 281 K+ + L L+NG+S + G G+P + V + + +E +E + + + Sbjct: 247 IIKIRY---LGTLQNGISKSSSYFGSGYPFVSYGDVYKNYELPKSVEGLVESNEFDKSNY 303 Query: 282 KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL--PEYIEIFFS 339 ++ GD+ FTR + +++ +G + + ++ LIR R L P Y + +F Sbjct: 304 SVEYGDVFFTRTSETIDEIGFTATCMHTMN-DAVFAGFLIRFRPFDSKLLNPLYSKYYFR 362 Query: 340 SPSARNAM---MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 S R MN V S +S + +K VL+PP EQ I + +E+ D + Sbjct: 363 SDMHRRFFVKEMNLVTRAS----LSQELLKKLPVLVPPHNEQIAIGKFIEETCQTIDQLI 418 Query: 397 KQVNNALARVNNLTQSILAKAFRGE 421 + + + +S++ + G+ Sbjct: 419 TKKQQLITELKAYKKSLIYEVVTGK 443 Score = 41.2 bits (95), Expect = 0.079, Method: Compositional matrix adjust. Identities = 31/135 (22%), Positives = 62/135 (45%), Gaps = 5/135 (3%) Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 +GD +F + +E G C +++ ++ + I R + Y+ SP + Sbjct: 88 EGDFIFCDTSEDIEGSGNCLFIRESNNKPIFAGSHTILGRPLINVNSTYLGYLLKSPDIK 147 Query: 345 NAMMNCVKTTSGQK--GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNA 402 + + K G K I+ K +KS ++LPPV EQ EI + ++ D+I ++ ++ Sbjct: 148 SQIQ---KRVVGIKVYSITQKILKSISLILPPVDEQQEIAQYLDDKVGQIDSIIEKTKSS 204 Query: 403 LARVNNLTQSILAKA 417 + + QSI+ + Sbjct: 205 IDEYKSYKQSIITET 219 >UniRef50_C7QRY1 Restriction modification system DNA specificity domain protein n=1 Tax=Cyanothece sp. PCC 8802 RepID=C7QRY1_CYAP0 Length = 456 Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 90/385 (23%), Positives = 166/385 (43%), Gaps = 52/385 (13%) Query: 72 KISPEDIVIAM--SSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE--------KLIFSGF 121 KI+ DI+I+ + G +VV K+ Q G++ P + I S + Sbjct: 93 KINSGDILISCVGTFGKVAVVPKNIEQ------------GIINPRLIKLIPITEYINSVY 140 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + KS + ++ LS G + I I +PIPPL EQ+ IA+ LD A++D Sbjct: 141 LEKLLKSVVAFEQMEKLSRGGTMGVINIGLLSDILLPIPPLPEQEKIAQFLDKETAKIDK 200 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 E++ ++LK R A++ AV L +W F P+H K+L + I+ Sbjct: 201 LITLKERLIELLKEKRTALISHAVTKGLNPDVPMKDSGVEWLGFIPEHWEVKRLKY--IV 258 Query: 233 TELRNGLSSKPNESGV--GHPILRISSVRAGHVDQNDIRFLECSESELN-RHKLQDGDLL 289 + G+ P + V G P LR ++ +G +D +++ F+ +EL+ + K+ GDL+ Sbjct: 259 PNITVGIVVTPAKYYVESGIPCLRSVNISSGKIDNSNLVFISSQSNELHQKSKIYKGDLV 318 Query: 290 FTRYNGSLEFVGVCG----LLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 R GV G + N + L+ R ++ L Y+ + +S S + Sbjct: 319 LVR-------TGVTGTAAIVTDNFDGANCV---DLLIIRNSRLILTLYLYYYLNS-STTS 367 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 +N + Q + + ++ PP +EQ +I +++ D I + ++ Sbjct: 368 YQVNNYSVGAIQAHYNTSTLSELIITFPPPQEQQKIAEYLDRKTEQIDQIINKTRESIEY 427 Query: 406 VNNLTQSILAKAFRGELTA-QWRAE 429 + +++ A G++ QW E Sbjct: 428 LKEYRTVLISAAVTGKIDVRQWGGE 452 Score = 61.6 bits (148), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 61/222 (27%), Positives = 103/222 (46%), Gaps = 27/222 (12%) Query: 4 GKLPEGW------VIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT 57 G +PE W I P TV ++ Y E I P +R+ NI +GK D + Sbjct: 243 GFIPEHWEVKRLKYIVPNITVGIVVTPAKYYVESGI--------PCLRSVNISSGKIDNS 294 Query: 58 DLVFVPK--NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCG---VLR 112 +LVF+ N + + KI D+V+ + V G +A F+ GA C ++R Sbjct: 295 NLVFISSQSNELHQKSKIYKGDLVLVRTG----VTGTAAIVTDNFD---GANCVDLLIIR 347 Query: 113 PEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKL 172 +LI + ++ ++ SS ++++ S GA + ++ + I PP EQ+ IAE L Sbjct: 348 NSRLILTLYLYYYLNSSTTSYQVNNYSVGAIQAHYNTSTLSELIITFPPPQEQQKIAEYL 407 Query: 173 DTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL-TEKW 213 D Q+D + + + LK +R ++ AV GK+ +W Sbjct: 408 DRKTEQIDQIINKTRESIEYLKEYRTVLISAAVTGKIDVRQW 449 >UniRef50_C4KBJ9 Restriction modification system DNA specificity domain protein n=1 Tax=Thauera sp. MZ1T RepID=C4KBJ9_THASP Length = 390 Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 88/338 (26%), Positives = 151/338 (44%), Gaps = 28/338 (8%) Query: 91 GKSAHQHLPFECSFGAF-CGVLRPEKLIFSG-FIAHFTKSSLYRNKISSLSAGANINNIK 148 GK A HLP FG+ V+RP++ + G ++ H + + R + G+ Sbjct: 75 GKIAQAHLPHPNGFGSTEFHVIRPKESLLDGRYLHHLLRQADIRVEGERRMTGSGGQRRV 134 Query: 149 PASF-DLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNG 207 PA+F + IP+P L EQ+ +A LD Q D+ +A+ + +L ++ + Sbjct: 135 PATFLSSLRIPLPRLEEQRRVAAILD----QADALRAKRRKALALLDELQRGIFIEMFGD 190 Query: 208 KLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE--SGVGHPILRISSVRA-GHVD 264 +T P+ L + E++ G NE S G I+RI+ + A G +D Sbjct: 191 PVTS------PKGCTAGTLG--DGIEEMQYGPRFH-NEAYSPEGIRIVRITDLDAAGSLD 241 Query: 265 QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 + + +E E ++ L+ GD++F R + VG L+K+ + + IR R Sbjct: 242 FDSMPRMEVDEETRDKFALRAGDVVFARTGAT---VGKVALIKE-RDPVCIAGAYFIRMR 297 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 LPEY S S +++ + Q+ SG ++ + +P ++ Q R Sbjct: 298 FQSRILPEYAFSVLQSESV-QSLIFAQSRQAAQQNFSGPGLRRLPMPVPSIERQRRFAER 356 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 VE A KQ++ ALA ++ L S+ +AFRGEL Sbjct: 357 VE---AVGSEKSKQLS-ALALLDELFSSLQHRAFRGEL 390 >UniRef50_C7RQC3 Type I restriction-modification system specificity subunit n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RQC3_9PROT Length = 475 Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 84/402 (20%), Positives = 167/402 (41%), Gaps = 38/402 (9%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV- 62 G++P W + + +L++GV KE A+ +P +R G+ TT FV Sbjct: 20 GQVPGHWDVRKPRHIGSLLKGVGGTKEDALP----AGVPCVR-----YGELYTTHAYFVR 70 Query: 63 -PKNLVKESQK-----ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL 116 PK + + + D++ A S + +GKSA + G +LRP Sbjct: 71 RPKTFIHADRAADYTPLHYGDVLFAASGETLEDIGKSAVNLIDGTAVCGGDVIILRPSVP 130 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + + F+ + N+ +++ G + ++ P + P+PP+ EQ I L+ Sbjct: 131 VHAPFLGYVMDCRPLANQKATMGRGTTVKHVYPDELKHLVFPLPPVPEQAAIVRFLNWAN 190 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLN 227 +++ ++ +L +QA++ AV L W P+H +L Sbjct: 191 GRLERAIRAKRKVIALLNEQKQAIVHRAVTRGLDPSVPLKPSGIPWLGDIPRHWRVWRLK 250 Query: 228 FESILTELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSESE--LNRHKLQ 284 F ++ + + L + P S G HP +R + + AG V + + + + R + Q Sbjct: 251 FVAL--NIVDCLHATPRYSDAGTHPAIRTADIVAGVVLVDQAKKVSSRDYARWTTRLQPQ 308 Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 +GD+L++R E G+ + L +++ R+ +++ +S S Sbjct: 309 EGDILYSREG---ERFGIAACVPAA--TQLCISQRMMVFRIATQHCSKFVMWLLNSRSTY 363 Query: 345 N-AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 A+ + + T+ IS I++ + LP +EQ +V R+ Sbjct: 364 GQALQDVMGATAPHVNIS--TIRNYYLALPLKREQEAVVERI 403 >UniRef50_B7KLD8 Restriction modification system DNA specificity domain protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KLD8_CYAP7 Length = 238 Score = 72.8 bits (177), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 50/177 (28%), Positives = 90/177 (50%), Gaps = 11/177 (6%) Query: 251 PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK--- 307 P LR+++V+ G++D +D+ +E +E E+N+ KLQ GDLL T G + +G K Sbjct: 68 PYLRVANVKDGYLDLSDVYQIEATEEEINKLKLQFGDLLLTE-GGDPDKLGRGSFWKNKI 126 Query: 308 -KLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI 365 + HQN +Y R R D P +I SP ++ + K T+G I+ + + Sbjct: 127 SECIHQNHIY-----RVRFNFDEFYPPFISAQIGSPYGKSYFLAHAKQTTGIATINQQVL 181 Query: 366 KSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 K+ ++ P ++ Q +I + + + + + + L +N L ++L +AF GEL Sbjct: 182 KNFPLMNPSLEIQKQIASTLTEQMQEVERLTQSLQEQLDTINKLPAALLKRAFNGEL 238 >UniRef50_B9M293 Restriction endonuclease S subunit-like protein n=1 Tax=Geobacter sp. FRC-32 RepID=B9M293_GEOSF Length = 644 Score = 72.8 bits (177), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 102/442 (23%), Positives = 192/442 (43%), Gaps = 57/442 (12%) Query: 5 KLPEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFV 62 +LPE W +A V V L G K + D P IR +N+ +GK + + Sbjct: 2 RLPESWRVATVGNVLLDLQPGFAQKPGEE----DDGTTPQIRTHNVTPDGKITLEGIKHI 57 Query: 63 PKNLVKESQ-KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSG 120 + + ++ K+ D+V ++ S+ VGK+A + E F LRP +L+ Sbjct: 58 SASAKETARYKLMMGDVVFN-NTNSEEWVGKTAVFNQEGEYVFSNHMTRLRPHPELVTPE 116 Query: 121 FIAHFTKS----SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 ++A + + + + A I + ASF L +P L EQ I + +L Sbjct: 117 YLAFYLHQLWAIGYSKTRAKRWVSQAGIESKAIASFKL---SLPTLPEQHRIID----VL 169 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVN-GKLTEKWRNFEP--QHSVFKKLNFESILT 233 Q +++ EQ+ ++ +A+ + W EP +H+ + K Sbjct: 170 RQAQDLRSQKEQVLKLSAELAKALFEQHFGIAGASSAW-PMEPFGKHTTYSKYG------ 222 Query: 234 ELRNGLSSKPNE--SGVGHPILRISSVRAGHVDQNDIRFLEC-----SESELNRHKLQDG 286 P++ S G ILR + + + IR+ E +E ++ H L+ G Sbjct: 223 ------PRFPDQQYSDSGIHILRTTDMN----NDGTIRWWEAPKLALTEGQIQEHALKPG 272 Query: 287 DLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNA 346 L+ +R +G+ +G L Q + LI L PEY+ F++P + Sbjct: 273 TLVVSR-SGT---IGPFALFDG-QEGRCVAGAYLIEFGLADSVQPEYVRALFATPYVQQM 327 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + V++ + Q I+ +I+S + +PP++ Q +++Q+ A+ I K + +++ Sbjct: 328 LKKAVRSVA-QPNINAPNIQSIKIPVPPLEIQEAFAVQIKQVRAWTSEIVK----SASKI 382 Query: 407 NNLTQSILAKAFRGELTAQWRA 428 + + ++++ +AF GELTAQWR Sbjct: 383 DEVIRAVVGEAFSGELTAQWRG 404 >UniRef50_C3NN82 Restriction modification system DNA specificity domain protein n=1 Tax=Sulfolobus islandicus Y.N.15.51 RepID=C3NN82_SULIN Length = 576 Score = 72.8 bits (177), Expect = 3e-11, Method: Compositional matrix adjust. Identities = 97/450 (21%), Positives = 195/450 (43%), Gaps = 43/450 (9%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKF-DTTD 58 + G+ P+ W + + V + + Y + +P + +I ++GK+ T+ Sbjct: 9 IDIGEFPKDWDVRKLKDVIIKAKSGGTPRRNVPEYWNGN-IPFAKIQDITKSGKYLYNTE 67 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 K L + I P+D ++ GS +G A +P + A G++ + +I Sbjct: 68 EFITEKGLENSNAWIVPKDSLLLTIYGS---LGFVAINKIPVATN-QAIIGIIPNKNIID 123 Query: 119 SGFIAHFTKSSLYRNKISS--LSAGANINNIKPASFDLI---NIPIPPLAEQKIIAEKLD 173 + F+ ++ LY S + G N + +++ ++PI PL EQK I E L Sbjct: 124 TEFLYYW---YLYFKPYWSKFIKKGTQPN----LTLEIVLNSSVPILPLEEQKKIVELLQ 176 Query: 174 -------TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL 226 TL + + E I +++++ + + G + E P+ ++L Sbjct: 177 KATDIYYTLKDYIIQIRNSTETITKVIRK--ELLTKGIGHRDYVETDIGEFPKDWEVRRL 234 Query: 227 NFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL-----ECSES-ELNR 280 N +I+ R+G S + + ++ +R ++D R + ES ++ R Sbjct: 235 NEIAII---RSGFSERKRDENS-----KVIHLRPDNIDNETDRIVFHRIVYIPESPKIER 286 Query: 281 HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFS 339 + L+ D++ NGS++ +G G++ +Q + + + L R+ +KD P YI S Sbjct: 287 YLLRHLDIVLVNTNGSIDHIGKLGIIDMPLNQKITFSNHLTAIRIVSKDVEPYYIYYLLS 346 Query: 340 SPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 + VK +G+ ++ I++ ++ LPP++EQ +IV ++++ + Sbjct: 347 WYHLNGSFKKVVKNQAGKWNLNLDTIRNLLIPLPPLEEQKKIVELLQKVDELIIRFNDFL 406 Query: 400 NNALARVNNLTQSILAKAFRGELTAQWRAE 429 N N L +SIL A G+LT WR + Sbjct: 407 QNLEDEANTLYKSILRLALTGKLTEDWRRQ 436 >UniRef50_Q3AQE4 Restriction endonuclease S subunits-like n=1 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3AQE4_CHLCH Length = 386 Score = 72.0 bits (175), Expect = 4e-11, Method: Compositional matrix adjust. Identities = 67/268 (25%), Positives = 135/268 (50%), Gaps = 18/268 (6%) Query: 157 IPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF 216 IP+PPL +QK IA LL +V+ A+ +Q Q L + ++V + G + + N+ Sbjct: 122 IPLPPLDDQKRIAH----LLGKVERLIAQRKQHLQQLDQLLKSVFL-EMFGFFDKTYTNW 176 Query: 217 EPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSES 276 ++ + I++ + G K +E + P +R+++V+ H ++I+ + +++ Sbjct: 177 ----TIDTLTSHTEIVSGITKGKKYKTDEL-IEVPYMRVANVQDEHFVLDEIKTISVTKN 231 Query: 277 ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL--PEYI 334 E+ +++L GDLL T G + +G G + + Q +N ++ + + R R+ + P+Y+ Sbjct: 232 EIKQYRLLAGDLLLTE-GGDPDKLG-RGAVWQNQIENCIHQNHIFRVRVNDKSRINPDYL 289 Query: 335 EIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADT 394 SP ++ K T+G I+ +K +++PP++ Q VE++ + Sbjct: 290 SALIGSPYGKSYFFRSAKQTTGIASINSTQLKKFPIVIPPIELQNRFATIVEKVESIKTH 349 Query: 395 IEKQVNNALARVNNLTQSILAKAFRGEL 422 ++ +NN N L+Q KAF+GEL Sbjct: 350 YQQSLNNLETLYNALSQ----KAFKGEL 373 >UniRef50_A1BGA0 Restriction modification system DNA specificity domain n=1 Tax=Chlorobium phaeobacteroides DSM 266 RepID=A1BGA0_CHLPD Length = 557 Score = 72.0 bits (175), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 104/487 (21%), Positives = 183/487 (37%), Gaps = 93/487 (19%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P W+ + G T K + +D I +N+ GKF+ + V + Sbjct: 87 IPSSWIWVRFGDIARHNSGKTLDKGRNTGESRD----YITTSNLYWGKFELEN---VRQM 139 Query: 66 LVKESQ--KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 L++E + K + + + + G ++ G++A E F R K I F+ Sbjct: 140 LIREDELEKCTAKKDDLLICEGGEA--GRAAMWPFDSEVCFQNHIHRARFYKDIDPYFVY 197 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS-- 181 F + +I+ G I+N+ S I P+PP +EQ I ++D L+A+ + Sbjct: 198 RFFEKLSATGEINQHRKGVGISNMSSKSLASIVFPLPPFSEQHRIVARIDQLMARCNELE 257 Query: 182 --TKARFEQ---------------------------------IPQILKRFRQAVLGGAVN 206 K R E+ + + + R+A+L AV Sbjct: 258 KLRKEREEKRLIVHAAAIKQLFDAPDGSAWGFIQQHFNELYSVKENVAELRKAILQLAVM 317 Query: 207 GKLTEKWRNFEPQHSVFKKLNFESILTE-----------------------------LRN 237 G+L + +N P + K++ E E +R Sbjct: 318 GRLVPQDQNDPPASELLKEIEKEKASHECTKSRRKGEKLPEIFNEEMPHKIPSNWAWVRF 377 Query: 238 GLSSKPNES-------GVGHP--ILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDL 288 G ++ N G P + S++ G + ++R + E EL + + DL Sbjct: 378 GDIAQHNSGKTLDKGRNTGQPREYITTSNLYRGRFELENVRQMLIREDELEKCTAKKDDL 437 Query: 289 LFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMM 348 L G V ++ QN ++ RAR KD P + FF SA + Sbjct: 438 LICE-GGEAGRAAVWPFDSEVCFQNHIH-----RARFYKDIDPYFAYRFFEKLSA-TGEI 490 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 N + G +S K + S V LPP EQ IV R +QL D +++Q+++A+ + Sbjct: 491 NQHRKGVGISNMSSKALASIVFPLPPQPEQHRIVARTDQLMTLCDQLDQQIDDAVGKQTE 550 Query: 409 LTQSILA 415 + ++LA Sbjct: 551 ILNAVLA 557 >UniRef50_Q1K3D0 Restriction modification system DNA specificity domain n=7 Tax=Bacteria RepID=Q1K3D0_DESAC Length = 417 Score = 72.0 bits (175), Expect = 5e-11, Method: Compositional matrix adjust. Identities = 93/420 (22%), Positives = 176/420 (41%), Gaps = 33/420 (7%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 GW P+ + T IR + Y K+ Y L ++NN++NGK + +F+ + Sbjct: 20 GWTENPLGEIYTKIRNA-FVGTATPYYTKNGYFYL-QSNNVKNGKINRKTEIFIDEEFYF 77 Query: 69 ESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVL---RPEKLIFSGFIA 123 + +K + DIV+ S VG +A +P E + A ++ +P K ++ Sbjct: 78 KQEKNWLRTNDIVMVQSGH----VGHTAV--IPNELNNSAAHALIIISKPLKKSCPYYLN 131 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + ++ + I +++ G I +I N+ PP EQ K+ T ++D Sbjct: 132 FYFQTYRAKQDIGNITTGNTIKHILATDIKRFNVFFPPYEEQ----TKIGTYFKKLDRII 187 Query: 184 ARFEQIPQILKRFRQAVLGGAV--NGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 ++ L +QA+L +G T + R F ++K + GL++ Sbjct: 188 ELHQRKHDKLVTLKQAMLQKMFPQDGASTPEIR-FNGFEGDWEKKKLRDVCNSFDYGLNA 246 Query: 242 KPNESGVGHPILRISSVR--AGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 + + +RI+ + + Q D+ E + L +GD+LF R S Sbjct: 247 AAKKYDGRNKYIRITDIDEFSRCFSQTDLTSPEADLPSSQNYLLCEGDILFARTGAS--- 303 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR-NAMMNCVKTTSGQK 358 VG L +++ + + + LIRAR++ ++I F+++ S+ + SGQ Sbjct: 304 VGKTYLYREIDGR-VFFAGFLIRARVSNTESTDFI--FYTTLSSNYENFVTITSQRSGQP 360 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 GI+ K+ L+P V EQ +++ F D + Q L ++ + + L K F Sbjct: 361 GINAKEYSEYTFLVPSVTEQ----KKIGTYFRKFDALISQHATQLKKLKQIKSACLGKMF 416 >UniRef50_Q21ZK2 Restriction modification system DNA specificity domain n=4 Tax=Bacteria RepID=Q21ZK2_RHOFD Length = 397 Score = 71.2 bits (173), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 91/402 (22%), Positives = 179/402 (44%), Gaps = 25/402 (6%) Query: 24 GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQKISPED-IVIAM 82 G T + Q Y + +P I++ ++ + + L + S K+ P I++AM Sbjct: 18 GGTPSRAQMERYYEGGTIPWIKSGELRETVINGAEEHVTDVALKESSIKLVPAGAILLAM 77 Query: 83 SSGSKSVVGKSAHQHLPFECSFG-AFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAG 141 + +G L E + A C ++ ++ + ++ H S + + S+ G Sbjct: 78 YGATVGRLGI-----LGIEATTNQAVCHIIPDPRIAVTRYVYHALSSQV--PSLISMGVG 130 Query: 142 ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVL 201 NI + IP+P EQ+ IA LD Q D+ +A+ + L Q++ Sbjct: 131 GAQPNINQGIIKNLAIPLPAKPEQRRIAAILD----QADALRAKRREALAQLDSLTQSIF 186 Query: 202 GGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAG 261 + G + + ++ + N S +T+ RN L+ K + P L +++V+ Sbjct: 187 I-QMFGDPVSNPKGWPDATTLGQVANIASGVTKGRN-LTGKVTRT---IPYLAVANVQDK 241 Query: 262 HVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLI 321 ++ + ++ ++ +E E+ R+ L+ DLL T G + +G G L K + ++ + + Sbjct: 242 SLNLSAVKEIDATEDEIERYLLKWNDLLLTE-GGDPDKLG-RGTLWKNELPECIHQNHIF 299 Query: 322 RARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAE 380 R R+T A+ P ++ S + + K T+G I+ ++S +LLPPV+ Q + Sbjct: 300 RVRVTSQAVTPLFLNWLVGSQRGKKYFLRSAKQTTGIASINMTQLRSFPLLLPPVELQRD 359 Query: 381 IVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + ++ A I + +LA + L S+ +AFRGEL Sbjct: 360 F-ETIAEVVAEQHAIH---SVSLAELEALFVSLQHRAFRGEL 397 >UniRef50_C7XC38 Putative uncharacterized protein n=1 Tax=Parabacteroides sp. D13 RepID=C7XC38_9PORP Length = 369 Score = 71.2 bits (173), Expect = 7e-11, Method: Compositional matrix adjust. Identities = 54/182 (29%), Positives = 94/182 (51%), Gaps = 16/182 (8%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P+GW +S + G+TYK EQ + DD ++R+ NIQ+GK +D+V V Sbjct: 185 PKGWPTKRLSELAEYSIGLTYKPEQ----ICDDGTIVLRSGNIQDGKISFSDIVRV-NAP 239 Query: 67 VKESQKISPEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 +KES + +DI++ +GS S+VGK A + +FGAF ++R + + ++ + Sbjct: 240 IKESLFVKEDDILMCSRNGSASLVGKVAMIPDINEPMTFGAFMTIIRSAE---AKYLYLY 296 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 +S +R ++S + +N I D + +P P K + E L + +Q D K++ Sbjct: 297 FQSQDFRERVSE-GKSSTMNQITQKMLDKVEVPFP----DKDVRETLSAIASQAD--KSK 349 Query: 186 FE 187 FE Sbjct: 350 FE 351 >UniRef50_B7R237 Type I restriction modification system, subunit S n=1 Tax=Thermococcus sp. AM4 RepID=B7R237_9EURY Length = 428 Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 70/296 (23%), Positives = 131/296 (44%), Gaps = 31/296 (10%) Query: 101 ECSFGAFCGVLRP--EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIP 158 E +F C L P + I F A++ K R + SLS G+ + A + +P Sbjct: 108 EATFNQGCKGLVPKDQNKIIPEFYAYYFK--FKRQHLESLSGGSTFKELAKAMLERFLVP 165 Query: 159 IPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVN---------GKL 209 +PP EQK IAE L T+ ++ T E+ ++ K +L + G++ Sbjct: 166 LPPRLEQKKIAEILRTVDEAIEKTDLAIEKTERLKKGLMLRLLTKGIKHERFKKTEIGEI 225 Query: 210 TEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRA---GHVDQN 266 E+WR + E I ++ G S K +++ G ++ ++S G+++ + Sbjct: 226 PEEWRV----------VRLEEITRRIKRGPSKKTDDNETG--VVYVTSDYIDDHGNLNFD 273 Query: 267 DIRFLECSE-SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL 325 + ++L + L+++ L++GDL+ N SLE +G + + + ++ + L Sbjct: 274 NPKYLSLEKIDRLDKYLLEEGDLIINCVN-SLEKIGKVAVFEGYSKKAIVGFNNFALT-L 331 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEI 381 P Y++ FF S + + + K Q S KD+ + LPP+ EQ +I Sbjct: 332 VSTVNPYYVKYFFLSYKGKALIKSISKAAVQQVSFSSKDLLRLKIPLPPLPEQKQI 387 Score = 42.4 bits (98), Expect = 0.040, Method: Compositional matrix adjust. Identities = 52/220 (23%), Positives = 91/220 (41%), Gaps = 40/220 (18%) Query: 4 GKLPEGWVIAPVSTVTTLIR------------GVTYKKEQAI------NYLKDDYLPLIR 45 G++PE W + + +T I+ GV Y I N+ YL L + Sbjct: 223 GEIPEEWRVVRLEEITRRIKRGPSKKTDDNETGVVYVTSDYIDDHGNLNFDNPKYLSLEK 282 Query: 46 ANNIQNGKFDTTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFG 105 + + + DL+ N V +KI + + K++VG F Sbjct: 283 IDRLDKYLLEEGDLII---NCVNSLEKIG--KVAVFEGYSKKAIVG------------FN 325 Query: 106 AFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLI--NIPIPPLA 163 F L + ++ +F S + I S+S A + + +S DL+ IP+PPL Sbjct: 326 NFALTLVST--VNPYYVKYFFLSYKGKALIKSISKAA-VQQVSFSSKDLLRLKIPLPPLP 382 Query: 164 EQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGG 203 EQK IAE L T+ +++ + R E++ + + + +L G Sbjct: 383 EQKQIAEILSTVDKKLELLRKRREKLELVKRGLMKGLLTG 422 >UniRef50_C7RNT4 Restriction endonuclease S subunits-like protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RNT4_9PROT Length = 403 Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 96/418 (22%), Positives = 179/418 (42%), Gaps = 33/418 (7%) Query: 15 VSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV-KESQKI 73 +S V IRG+T+K E + +R N+Q + D D+ +P++ V +E Q + Sbjct: 9 LSDVAAFIRGITFKPEDVVPVDTPGAAACMRTKNVQT-ELDLCDVWGIPQSFVRREDQYL 67 Query: 74 SPEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFCGVLR--PEKL----IFSGFIAHFT 126 P D++++ S+ S ++VGK LP+ +FG F VLR P K+ +F F + T Sbjct: 68 IPGDVLVS-SANSWNLVGKCCLVPSLPWRSTFGGFISVLRANPAKVDPRYLFRWFASDRT 126 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 ++++ S NI+N+ + + +P L EQ+ IAE LD A +A Sbjct: 127 QATVR----SFGQQTTNISNLNVGRCLKLKLHLPALPEQRRIAEILDKADALRAKRRAAL 182 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES 246 Q+ + + + G + F F S L S S Sbjct: 183 AQLDALTQSIFLDMFGDPATNPKGWPCAQLCTLGTKFSDGPFGSNLK------SDHYRAS 236 Query: 247 GVGHPILRISSVRAGHVDQNDIRFL-ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGL 305 GV ++R+ ++ G D ++ E L +H+ GD+L G+L + Sbjct: 237 GVR--VVRLQNIGVGEFLGADAAYISEDHFRNLKKHECLPGDVLV----GTLGDPNLRAC 290 Query: 306 LKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKD 364 ++ L ++ R + A E++ + P + + + + + IS Sbjct: 291 IQPRWLSVALNKADCVQIRPDERTATSEFVCFLLNQPGTQRMAQDLMHGQTRIR-ISMGR 349 Query: 365 IKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 ++S + +PP+ Q + ++V A +T++ ALA+++ L S+ +AF G+L Sbjct: 350 LRSLAIPVPPIGLQRDFTQQV----AAMETLKTAHRAALAQLDALFASLQHRAFLGDL 403 >UniRef50_Q6F778 Putative type I restriction-modification system specificity determinant for hsdM and hsdR (HsdS) n=1 Tax=Acinetobacter sp. ADP1 RepID=Q6F778_ACIAD Length = 448 Score = 70.9 bits (172), Expect = 9e-11, Method: Compositional matrix adjust. Identities = 94/372 (25%), Positives = 162/372 (43%), Gaps = 50/372 (13%) Query: 77 DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVL---RPEKLIFS-GFIAHFTKSSLYR 132 DI++A S + VGKS + H + + + G L R K + FI F +S Y Sbjct: 90 DILLARSGAT---VGKS-YLHKKDKVNVACYAGYLIRARFNKENYDPQFINLFLQSKAYW 145 Query: 133 NKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQI 192 + I S++ A I N+ ++ + + IP LAEQKIIA+ LD LAQVD+ A+ E + + Sbjct: 146 SWIESVNIQATIQNVSAEKYNDLALSIPSLAEQKIIADFLDKRLAQVDALIAKQETLLEK 205 Query: 193 LKRFRQAVLGGAVNGKLTEKWRNFE---------PQHSVFKKLNFESILTE-LRNGLSSK 242 L R A++ AV L E P K+L F +L+E L+ G + Sbjct: 206 LAEQRVALISHAVTKGLNPDVEMKESDVVLLGNIPNTWNIKRLKF--LLSEKLKYGANES 263 Query: 243 PNESGVGHP-ILRISSV-RAGHVDQNDIRFLECSESE------LNRHKLQDGDLLFTRYN 294 +P +RI+ + +G++ + LE +++ L+ + G + Y Sbjct: 264 AESEDKENPRYIRITDIDDSGNLKDETFKSLESEKAQEYLLDDLDILLARSGATVGKSYL 323 Query: 295 GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKT 353 E VG+ Y LIRARL ++ PE++ F S + ++ + Sbjct: 324 YKAESVGIA-----------CYAGYLIRARLDQENYNPEFVNYFLQSKQYWD-WISSINI 371 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 + + +S + + +P ++EQ +QL Y +++ N A+++ L Sbjct: 372 QATIQNVSAEKYNDLTLAIPSLEEQ-------KQLIEYLKNEDEKFNRAISKGKKLVH-- 422 Query: 414 LAKAFRGELTAQ 425 L +R L Q Sbjct: 423 LLNEYRSTLITQ 434 Score = 45.4 bits (106), Expect = 0.004, Method: Compositional matrix adjust. Identities = 23/89 (25%), Positives = 47/89 (52%) Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+ +F +S Y + ISS++ A I N+ ++ + + IP L EQK + E L + + Sbjct: 352 FVNYFLQSKQYWDWISSINIQATIQNVSAEKYNDLTLAIPSLEEQKQLIEYLKNEDEKFN 411 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKL 209 ++ +++ +L +R ++ V GK+ Sbjct: 412 RAISKGKKLVHLLNEYRSTLITQVVTGKI 440 >UniRef50_A0ZMI3 Putative uncharacterized protein n=1 Tax=Nodularia spumigena CCY9414 RepID=A0ZMI3_NODSP Length = 437 Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 99/439 (22%), Positives = 178/439 (40%), Gaps = 46/439 (10%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +PE W I S G KD +PL+R +N++ G D ++ Sbjct: 16 GDIPEHWEIVRFSNFINFQEGPGIMAAD----FKDYGVPLLRIHNLKPGFVDLERCNYLE 71 Query: 64 KNLVKESQK---ISPEDIVIAMS--SGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-- 116 V+++ K ++ +DI+I+ S +G S+V K A + A+ G++R + Sbjct: 72 PQKVEKTWKHFKLNEDDILISCSASTGLVSIVDKKAEGSI-------AYTGIIRLKPANS 124 Query: 117 -IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 I FI S L+ +I L G I + P I I PPL EQK IA LD+ Sbjct: 125 NICREFIKIIVASELFFTQIELLKTGTTIQHYGPTHLRQIKITFPPLYEQKKIACFLDSK 184 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQH-SVFKK 225 L ++D + +++ ++LK + A++ AV L +W P H V + Sbjct: 185 LEEIDKFISNKQRLIELLKEQKTAIINRAVTKGLNPHAPMKPSGIEWLGDIPAHWEVTRA 244 Query: 226 LNFESILTELRNGLSSKPNES-GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQ 284 + + RN KPN + +G P + + + + + ++ +L +N Sbjct: 245 KHISYVFVPQRN----KPNLNLNIGFPWITMEDITSPSISKSTFGYLVSEIDAMNA---- 296 Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFS-SPSA 343 G L + VG GL Q ++ ++ ++A + P Y+ S S Sbjct: 297 -GSKLLPEGSVIASCVGNFGLSSVNTLQVII--NQQLQAYIPIKINPYYLRYLIGISKSY 353 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 + N TT +G ++LPP EQ IVR +++ D + + Sbjct: 354 FEQIANA--TTLAYVNQAG--FAELPIILPPNDEQLAIVRNIDKELTTIDKAITTIEKEI 409 Query: 404 ARVNNLTQSILAKAFRGEL 422 + +++++A G++ Sbjct: 410 ELIKEYRTTLISEAVTGKI 428 >UniRef50_D0C390 Type I restriction-modification system specificity determinant n=1 Tax=Acinetobacter sp. RUH2624 RepID=D0C390_9GAMM Length = 461 Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 95/374 (25%), Positives = 157/374 (41%), Gaps = 61/374 (16%) Query: 38 DDYL----PLIRANNIQNGKFDTTDLVFVPKNLVKESQKIS-------PEDIVIAMSS-- 84 DDYL PLI+ NNI++GK ++ F+ +N +KI P+DIVIA + Sbjct: 49 DDYLDEGIPLIQLNNIRDGKHILRNMKFISQN-----KKIDLIRHLALPQDIVIAKMAEP 103 Query: 85 -GSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHFTKSSLYRNKISSLSAGA 142 +VV +++ A C L P+ +L+ F+ S R +S G Sbjct: 104 VARAAVVSDEYDEYV-----IVADCVKLSPDLELVDLNFLIWAINSDCVRENAELVSTGT 158 Query: 143 NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLG 202 I + +P P L+EQ I + LD A++D+ A+ E++ +LK RQAV+ Sbjct: 159 TRIRINLGELKKLKVPYPSLSEQVKIRQYLDHETAKIDTLIAKQEELIALLKEKRQAVIS 218 Query: 203 GAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPIL 253 AV L +W P+H K + ++++ G S +P G P L Sbjct: 219 HAVTKGLNPNVPMKDSGVEWLGEVPEHWTVSKFGY---ISQVVRGGSPRP----AGDPAL 271 Query: 254 RISS----VRAGHVDQNDIRFLECSESELNRHK------LQDGDLLFTRYNGSLEFVGVC 303 V + ++D +L +E+ L + Q G LL + +L GV Sbjct: 272 FNGDYSPWVTVAEITKDDELYLTSTETFLTKKGSEQCRVFQSGTLLLSNSGATL---GVP 328 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 +L + N D ++ K + EY + S + N + VK SGQ ++ Sbjct: 329 KILSINANAN----DGVVGFEDLKIDI-EYAYFYLSILT--NDLRERVKQGSGQPNLNTD 381 Query: 364 DIKSQVVLLPPVKE 377 +K+ + +PP E Sbjct: 382 IVKAIPIAIPPENE 395 >UniRef50_A8YFX5 HsdS protein n=2 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YFX5_MICAE Length = 406 Score = 70.9 bits (172), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 94/405 (23%), Positives = 171/405 (42%), Gaps = 49/405 (12%) Query: 27 YKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQ--KISPEDIVIAMSS 84 +K + N +KDD+ Q G + F+ ++ +E + + P D++I+ + Sbjct: 42 FKVYEQQNAIKDDF---------QIGNY------FIDEDKFREMEGFNVKPHDLIISCAG 86 Query: 85 GSKSVVGKSAHQHLPFECSFGAFCGVL---RPE-KLIFSGFIAHFTKSSLYRNKISSLSA 140 +GK A +P+E G L RP ++I ++ +S Y+ I SA Sbjct: 87 ----TIGKVAI--VPYEALPGVINQALMRIRPNPEIILCRYLKWLLESPKYQRDIFGKSA 140 Query: 141 GANINNIKPAS-FDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQA 199 G+ + N+ S IP+PPL EQ+ IA LD K ++L+ Sbjct: 141 GSALKNLAAISEIKKCKIPLPPLEEQRRIAAILDKADGVRRKRKEAIRLTEELLRSTFLE 200 Query: 200 VLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVR 259 + G V P+ KL ++ + NG+ K +E G P++ + + Sbjct: 201 MFGDPVTN----------PKGWEIVKLG-SLVVGQPNNGIFKKNHEYGGDTPVVWVKELF 249 Query: 260 AGH-VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPD 318 +G+ +D ++ R L ++ E+ + L GD+LF R + + + +G + + + L+ Sbjct: 250 SGYTIDCSESRTLTPTDEEVKKFGLTKGDILFCRSSLNRDGIGFNNVFDGMDF-SALFEC 308 Query: 319 KLIRARLTKDALPE-YIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKE 377 +IR RL + + ++ P R ++ T + I +IK LPP Sbjct: 309 HIIRVRLNQKKVNSIFLNYLLHFPGLRKQIIAKANTVT-MSTIGQSEIKKIEFYLPP--- 364 Query: 378 QAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 E+ + E T ++ N + NL S+L +AFRGEL Sbjct: 365 -KELQDKFEIFLRKIATNRTKLENKESE--NLFNSLLQRAFRGEL 406 >UniRef50_Q8TN78 Type I restriction modification enzyme protein S n=1 Tax=Methanosarcina acetivorans RepID=Q8TN78_METAC Length = 391 Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 91/423 (21%), Positives = 174/423 (41%), Gaps = 45/423 (10%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W P+ ++ T+I G T K + Y D +P + + D TD + + E Sbjct: 4 WPHQPIISLGTIITGSTPKTSEEHFYGGD--IPFVTP-----AELDQTDPIMNAARTLSE 56 Query: 70 S----QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + ++ PE V+ GS VG + S V+ K+I+ F F Sbjct: 57 TGSQESRLLPEGTVMVCCIGSLGKVGIAGRT----VASNQQINSVIFDPKIIWPRF--GF 110 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 L ++++ L+ + + + F + IP+PPL EQK IA+ LD A + Sbjct: 111 YACRLLKSRLEVLAPATTVPIVNKSKFGQLEIPVPPLPEQKRIADILDRAEALRAKRRVA 170 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV--FKKLNFESILTELRNGLSSKP 243 E + ++ + + G +V+ + W+ + +H V + F S+L K Sbjct: 171 LEHLDELTQAIFIDMFGDSVSNPM--GWKRYPLKHCVNHIQIGPFGSLL--------HKE 220 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLECSE-SELNRHKLQDGDLLFTRYNGSLEFVGV 302 + G P++ + + G + + + + + +EL ++LQ GD++ R +G Sbjct: 221 DYVFGGIPLINPTHIENGKIVPDVNQSITVQKLAELQLYQLQQGDVIMGRRGE----MGR 276 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC---VKTTSGQKG 359 C ++ + L L A+ Y++ SS S R + + +G Sbjct: 277 CAIVGSEHNGTLCGTGSLFIRPDESKAIAMYLQATLSSESMRKHLEGFSLGATLPNLNRG 336 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 I G+ + LPP++ Q E +E + + ++ ++L ++ L S+ +AFR Sbjct: 337 IVGE----LAISLPPIELQKEFSHHIESI----EKLKTTYKSSLTEIDELFLSLQYRAFR 388 Query: 420 GEL 422 GEL Sbjct: 389 GEL 391 >UniRef50_Q8GN10 Putative type I specificity subunit HsdS n=3 Tax=Campylobacter jejuni RepID=Q8GN10_CAMJE Length = 420 Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 96/387 (24%), Positives = 153/387 (39%), Gaps = 71/387 (18%) Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL----IFSGFIAH 124 E+ K + D++I+ S +V LP + G L +L I + + + Sbjct: 72 EAFKATEGDLLISCSGTLGKIV------ELPKDTEMGIINQSLLKIRLNNIKILNSYFIY 125 Query: 125 FTKSSLYRNKISSLSAGANINNIKPAS-FDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + S + + KI + G+ I NI I IP+PPL +Q+ I LD ++D + Sbjct: 126 YFNSPIMQEKILESTLGSAIKNIASVKILKQIEIPLPPLKKQERIVGILDESFVKIDESI 185 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-PQHSVFKKL----NFESILTELRNG 238 EQ L Q+ L A N N++ PQ +K L N S T LRN Sbjct: 186 KILEQNLLNLDELMQSALQKAFNPLKDNAKENYKLPQGWEWKSLGEIGNTSSGGTPLRNK 245 Query: 239 LSSKPNESGVGHPILRISSVRAGHVDQNDIRFLE--CSESELNRHK---LQDGDLLFTRY 293 N S I +++G ++ I F+E +E + Q G LL Y Sbjct: 246 KEYWENGS--------IKWLKSGELNDGYIDFIEENITEEAIENSSAKIFQKGTLLIAMY 297 Query: 294 NGSLEFVG-----------VCGLLKKLQHQNLLYPDKL-------IRARLTKDALPEYIE 335 + +G VC L K ++N+ + +K IR ++ KD+ Sbjct: 298 GATAGRLGILNLDSATNQAVCAFLHK-DNKNIKFLEKFLFYFLFFIRDKIIKDSF----- 351 Query: 336 IFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTI 395 Q IS IK+ + LPP+KEQ +I + ++ +F + Sbjct: 352 ------------------GGAQPNISQTYIKNLQIPLPPLKEQEQIAKHLDFVFEKTKAL 393 Query: 396 EKQVNNALARVNNLTQSILAKAFRGEL 422 ++ L L QS+L KAF+GEL Sbjct: 394 KELYTKELKDYEELKQSLLNKAFKGEL 420 >UniRef50_A5KSM3 Restriction modification system DNA specificity domain n=1 Tax=candidate division TM7 genomosp. GTL1 RepID=A5KSM3_9BACT Length = 200 Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 56/199 (28%), Positives = 104/199 (52%), Gaps = 14/199 (7%) Query: 232 LTELRNGLSSKPNESG--VGH-PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDL 288 + E++ G++ G +G P LR+++V+ G++ ++I+ + + EL ++ L +GD+ Sbjct: 7 IAEIKGGITKGRKLRGMPIGETPYLRVANVQDGYLYLDEIKTINVTAEELRKYSLMNGDI 66 Query: 289 LFTRYNGSLEFVGVC----GLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSSPSA 343 LFT G + +G G ++ HQN ++ RAR+ + +PEYI + A Sbjct: 67 LFTE-GGDKDKLGRGTIWHGEIELCIHQNHIF-----RARVDSGQFVPEYISYATKTTRA 120 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 R+ ++ K T+ ++ +K+ + P+ +Q EIV + + + K++ A Sbjct: 121 RDYFLSKAKQTTNLASLNMTSLKNLQLPSIPLAQQKEIVESIVTKLSEIKSARKELIVAH 180 Query: 404 ARVNNLTQSILAKAFRGEL 422 R L QSILAKAF+GEL Sbjct: 181 HRSKALRQSILAKAFKGEL 199 >UniRef50_A8YCA1 HsdS protein n=1 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YCA1_MICAE Length = 510 Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 38/108 (35%), Positives = 58/108 (53%) Query: 108 CGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKI 167 C + I S ++ + S R ++ +G+ I +F I PI PL EQ Sbjct: 122 CIIRNNSNFINSQYLLYLINSPQTRLEVDKYKSGSTRKRISRKNFAKIQFPIAPLPEQHR 181 Query: 168 IAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRN 215 I EK++ L +++D+ A +++ + LK +RQAVL A GKLTEKWRN Sbjct: 182 IVEKIEELFSELDNGVASLKKVLEQLKTYRQAVLKWAFEGKLTEKWRN 229 Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 61/191 (31%), Positives = 92/191 (48%), Gaps = 12/191 (6%) Query: 264 DQNDIRFLECSES-ELNRHKLQDGDLLFTRYNGSLEFVGVCGL--LKKLQHQNLLYPDKL 320 DQ+D RFL +S ELN LQ GD+L R L + L +KK + D Sbjct: 68 DQSD-RFLTFKKSIELNCTYLQKGDILVARLPDPLGRACIFPLSGIKKF----VTVVDVC 122 Query: 321 IRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAE 380 I + +Y+ +SP R ++ K+ S +K IS K+ + P+ EQ Sbjct: 123 IIRNNSNFINSQYLLYLINSPQTR-LEVDKYKSGSTRKRISRKNFAKIQFPIAPLPEQHR 181 Query: 381 IVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSA 440 IV ++E+LF+ D + L ++ Q++L AF G+LT +WR + D + A Sbjct: 182 IVEKIEELFSELDNGVASLKKVLEQLKTYRQAVLKWAFEGKLTEKWRNTHQDSLE---DA 238 Query: 441 AALLEKIKAER 451 LLE+IKAER Sbjct: 239 DTLLEQIKAER 249 Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust. Identities = 53/219 (24%), Positives = 93/219 (42%), Gaps = 23/219 (10%) Query: 6 LPEGWVIAPVSTVTTLIR-GVTY--------KKEQAINYLKDDYLPLIRANNIQNGKFDT 56 LP+GW+ V + +L + G+T K E I+ + P++ NI NG F Sbjct: 300 LPDGWMWVKVDYLLSLDKKGMTTGPFGTLLKKSEHQISGI-----PVLGIENIGNGVFLP 354 Query: 57 TDLVFVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGV---L 111 + +F+ + +E S ++S DI+I+ S VG+ F S + + L Sbjct: 355 KNKIFITEKKARELSSFEVSGGDIIISRSG----TVGEICLVPDYFGYSLISTNLIRISL 410 Query: 112 RPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEK 171 +I F+ F R ++ L G+ + + I P P L EQ I ++ Sbjct: 411 NKNIIIPKFFVFLFLGGGSVREQVKELCKGSTRDFLNQTILQTIIFPFPSLQEQTQIVQE 470 Query: 172 LDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 +++ L+ D +A + + RQ++L A GKL Sbjct: 471 IESRLSVCDQLEATLTENLDKAEALRQSILKRAFEGKLV 509 >UniRef50_Q1VSP4 Restriction endonuclease S subunits n=1 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VSP4_9FLAO Length = 574 Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 111/512 (21%), Positives = 196/512 (38%), Gaps = 115/512 (22%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG--KFDTTDLV 60 A LP GW+ + V G T K+ NY +Y+P I +I N ++ T L Sbjct: 82 AYDLPNGWIWSRVRDSGFTQTGSTPPKKNPENY--GNYIPFIGPGDISNKLMRYPTEGL- 138 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECS--FGAFCGVLRPEKLIF 118 L ++ PED ++ + G +GK + C+ +L P Sbjct: 139 ---SELGISVGRLIPEDSLMMVCIGGS--IGKCNINEIDVSCNQQINTITPILIP----- 188 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + +I +S +++ + S+G+ I ++ + IPIPPL EQK I + ++ L + Sbjct: 189 TIYIKAVCQSPFFQSNVLDKSSGSATPIINKGKWESLPIPIPPLEEQKEIVKVVEILFKE 248 Query: 179 VD------------------------STKARFEQIPQI-------------LKRFRQAVL 201 ++ ST E + +K+ R+ VL Sbjct: 249 IEQLEQLTSERIALKEDFVTSVLNQLSTNTTKENWTYLQAHFKPFFNETTNIKKLRETVL 308 Query: 202 GGAVNGKLTEKWRNFEPQ------HS------------------------VFKKLNFESI 231 AV GKLT WR P+ H+ V + + I Sbjct: 309 QLAVQGKLTADWRTCHPELAEGSHHASELLKRIQEEKAQLVKDKKIKKEKVLPAITEDEI 368 Query: 232 LTEL-------RNGLSSK---------PNESGVGHPILRISSVRAGHVDQNDIRFLE--C 273 EL R G +SK P G +L +VR G ++ + ++ Sbjct: 369 PYELPVGWVWCRLGDASKQITDGEHQTPPRIASGRKLLSAKNVRDGFINYENCDYISEIH 428 Query: 274 SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR--ARLTKDAL- 330 + + R + GDLL G+ +G ++ K N+ + L+R A + L Sbjct: 429 YQKSIKRCNPEIGDLLIVSVGGT---IGRVSMVTK----NISF--ALVRSVAMVKNQGLE 479 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 P+Y+ +SP ++ ++ K Q + +IK + P++EQ IV +V L Sbjct: 480 PDYLRWVMNSPLLKD-IIESKKRGGAQPCLYLGEIKDFTFPIAPLEEQKAIVEKVNALME 538 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 D +E++V ++ + L +S L + F G++ Sbjct: 539 LCDGLEQEVRHSQEQSELLMKSCLREVFEGKI 570 Score = 48.5 bits (114), Expect = 5e-04, Method: Compositional matrix adjust. Identities = 38/157 (24%), Positives = 71/157 (45%), Gaps = 38/157 (24%) Query: 333 YIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYA 392 YI+ SP ++ +++ + S I+ +S + +PP++EQ EIV+ VE LF Sbjct: 191 YIKAVCQSPFFQSNVLD-KSSGSATPIINKGKWESLPIPIPPLEEQKEIVKVVEILFKEI 249 Query: 393 DTIEKQVNNALA-------------------------------------RVNNLTQSILA 415 + +E+ + +A + L +++L Sbjct: 250 EQLEQLTSERIALKEDFVTSVLNQLSTNTTKENWTYLQAHFKPFFNETTNIKKLRETVLQ 309 Query: 416 KAFRGELTAQWRAENPDLISGENSAAALLEKIKAERA 452 A +G+LTA WR +P+L G + A+ LL++I+ E+A Sbjct: 310 LAVQGKLTADWRTCHPELAEGSHHASELLKRIQEEKA 346 >UniRef50_Q307D8 Type I RM system S subunit n=1 Tax=Arthrospira platensis RepID=Q307D8_SPIPL Length = 392 Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 64/242 (26%), Positives = 108/242 (44%), Gaps = 26/242 (10%) Query: 159 IPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE------- 211 IPPL EQK IA LD A++D +++ +L R+A++ AV L Sbjct: 124 IPPLGEQKAIAHYLDIETAKIDQLIKAKKRLLALLDEKRRALITHAVTRGLNPDVPMRDS 183 Query: 212 --KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIR 269 +W P+H ++ L IL + G+S G +LR+ V G + +++ Sbjct: 184 GVEWIGEIPKH--WEILPLRRILQTMDYGISESVGSEG-NIAVLRMGDVDEGEISYDNVG 240 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYP----DKLIRARL 325 F++ + +L L+ DLLF R N SL+ +G + + N L+P L+R R Sbjct: 241 FVDDVDHDL---ILKANDLLFNRTN-SLDKIGKVAIFR----NNFLFPVSFASYLVRMRC 292 Query: 326 TKDALPEYIEIFFSS-PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 +PEY+ +S P A N + GQ ++ + +PP++EQ I Sbjct: 293 NDSVIPEYLNYLLNSLPVLTWAKSNALPAI-GQVNLNPNRYSYIKIPIPPIEEQLNITEY 351 Query: 385 VE 386 ++ Sbjct: 352 IQ 353 Score = 47.4 bits (111), Expect = 0.001, Method: Compositional matrix adjust. Identities = 44/210 (20%), Positives = 96/210 (45%), Gaps = 13/210 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P+ W I P+ + ++ + Y +++ + + ++R ++ G+ ++ FV Sbjct: 189 GEIPKHWEILPLRRI---LQTMDYGISESVG--SEGNIAVLRMGDVDEGEISYDNVGFVD 243 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSA--HQHLPFECSFGAFCGVLRPEKLIFSGF 121 V + D++ ++ S +GK A + F SF ++ +R + + Sbjct: 244 D--VDHDLILKANDLLFNRTN-SLDKIGKVAIFRNNFLFPVSFASYLVRMRCNDSVIPEY 300 Query: 122 IAHFTKS--SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 + + S L K ++L A +N + P + I IPIPP+ EQ I E + T ++ Sbjct: 301 LNYLLNSLPVLTWAKSNALPAIGQVN-LNPNRYSYIKIPIPPIEEQLNITEYIQTNTKKI 359 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 E+ ++L+ R +++ AV G++ Sbjct: 360 KKLCLSSEETIKLLQERRTSLITAAVTGQI 389 >UniRef50_A8RUN3 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RUN3_9CLOT Length = 375 Score = 70.1 bits (170), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 62/265 (23%), Positives = 117/265 (44%), Gaps = 21/265 (7%) Query: 159 IPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP 218 IP L EQ+ I +++ L +++D + Q L +RQAVL A + T FEP Sbjct: 132 IPSLPEQERIVARIEELFSELDKAVETLKTTKQQLAVYRQAVLKEAFSCADT-----FEP 186 Query: 219 QHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL 278 F SI+T + K G+ +R +VR D +D+ + E+ Sbjct: 187 ---------FGSIMTSRLGKMLDKEKNVGLPEQYIRNINVRWFSFDLSDLLKMRIETKEI 237 Query: 279 NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFF 338 ++ ++ GDL+ G C + + + ++ Y L R R P+ + +++ Sbjct: 238 EKYSIKYGDLIICEGGEP----GRCAVWDR--NDSIFYQKALHRVRFKNGENPK-LYMYY 290 Query: 339 SSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 ++ + T +G K ++G+ + V + + +Q +V ++E + + IEK Sbjct: 291 LWFISQTGELEKYFTGTGIKHLTGQSLLKVPVPIISISKQNTVVLKIESQLSVCNQIEKM 350 Query: 399 VNNALARVNNLTQSILAKAFRGELT 423 + +L + + QSIL +AF G L Sbjct: 351 IEQSLQQAEAMRQSILKQAFEGRLV 375 >UniRef50_Q4FUM9 Possible type I restriction-modification system, S subunit n=1 Tax=Psychrobacter arcticus 273-4 RepID=Q4FUM9_PSYA2 Length = 457 Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 98/448 (21%), Positives = 190/448 (42%), Gaps = 38/448 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTD----- 58 GK+P W ++ + + + RG+T K L D +P + + + D Sbjct: 23 GKIPSHWELSKLRYMFSFGRGLTITKAD----LLDTGVPCVNYGEVHSKYGFEVDPKRHY 78 Query: 59 LVFVPKNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVL-RPEK 115 L V + ++ S ++ D+V A +S G Q + + F + V+ RP Sbjct: 79 LKCVDEGYLQSSPYALLTQGDLVFADTSEDIEGSGNFT-QLVSDDLIFAGYHTVIARPFD 137 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 S F A+ S R ++ + G + +I + + I +P L E++ IA LD Sbjct: 138 RQCSRFYAYLMDSKEIRTQVRHMVKGVKVFSITQSILKGVRIWLPSLDERETIANFLDFE 197 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKL 226 AQ+D+ + + + Q+LK RQAV+ AV L +W P+H KL Sbjct: 198 TAQIDTLIDKQKTLIQLLKEKRQAVISHAVTKGLNPDAPLKDSGVEWLGEVPEHWGVSKL 257 Query: 227 NFESILTE-LRNGLSSKPNESGVGHP-ILRISSVRA-GHVDQNDIRFLECSESELNRHKL 283 + +++E L+ G + + P +RI+ V G++ + R L +E + L Sbjct: 258 KY--LISEPLQYGANEAAEDVDKTQPRFVRITDVLPNGNLKDDTFRSLPQEIAE--PYML 313 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFF--SSP 341 DGD+L R G+ VG + + + LI+A++ ++ P E F+ + Sbjct: 314 MDGDVLLARSGGT---VGKSFIYRD-SWGKCCFAGYLIKAKIDEEITPA--EWFYLNTLT 367 Query: 342 SARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNN 401 + ++ + + +S S V+ +PP++E +I+ + DT+ + Sbjct: 368 DFYWKWIESIQIQATIQNVSADKYNSFVIAVPPLEESYKIISYINYNLEVFDTLVMKAEQ 427 Query: 402 ALARVNNLTQSILAKAFRGELTAQ-WRA 428 A+ + ++++ A G++ + W A Sbjct: 428 AIQLMQERRTALISAAVTGKIDVRGWVA 455 >UniRef50_D1XRZ5 Restriction modification system DNA specificity domain protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XRZ5_9ACTO Length = 412 Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 85/386 (22%), Positives = 166/386 (43%), Gaps = 45/386 (11%) Query: 41 LPLIRANNIQNGKF---DTTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQH 97 P +R N+ G+ D ++ F P V + + P DI++ S +VG+SA Sbjct: 35 FPYLRVANVHLGRIEYVDVNEMGFTPAERV--TYGLKPGDILLNEGQ-SLELVGRSAI-- 89 Query: 98 LPFECSFGAFCGV-----LRPEKLIFSGF----IAHFTKSSLYRNKISSLSAGANINNIK 148 ++ + G FC RP I S + H+ +S ++ ++ A++ + Sbjct: 90 --YDRAEGEFCFQNTLIRFRPNGCILSAYAQVVFEHWLRSGVFAAIAKQTTSIAHLGGDR 147 Query: 149 PASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGK 208 F + P+ P Q+ I LD+L +A ++ + K G Sbjct: 148 ---FAALKFPLLPTGMQQRIVAVLDSLAELERRIEASIVKLRSVRK------------GI 192 Query: 209 LTEKWRNFEPQH-SVFKKLNFESILTELRNGLSSKPNESG---VGHPILRISSVRAGHVD 264 ++E++ + + S +L L ++ +GL+ SG + P LR+++V+ G + Sbjct: 193 ISEQFSRADVEDGSPASRLRALDSLADVGSGLTLGGISSGGTLLEVPYLRVANVQDGFIS 252 Query: 265 QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 +++ + + S++ R +++ D+L T G + VG G + + L + + R R Sbjct: 253 TLEMKSVRVTPSDMERFRVRRDDVLVTE-GGDFDKVGR-GAVWDGRIDPCLNQNHVFRVR 310 Query: 325 LTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 K+ L P ++ ++ SS + R + VK T+ I+ +K+ V PP++EQ R Sbjct: 311 CDKEVLDPHFLSLYMSSAAGRRYFLRVVKQTTNLASINSSQLKAMPVPCPPLEEQ----R 366 Query: 384 RVEQLFAYADTIEKQVNNALARVNNL 409 R +L D Q L ++ L Sbjct: 367 RTVELVGSCDEQIAQEEGELTKLREL 392 Score = 45.1 bits (105), Expect = 0.006, Method: Compositional matrix adjust. Identities = 42/189 (22%), Positives = 83/189 (43%), Gaps = 8/189 (4%) Query: 232 LTELRNG--LSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLL 289 L E+R G LS E+ P LR+++V G ++ D+ + + +E + L+ GD+L Sbjct: 15 LGEVRMGKQLSPSSREAAGQFPYLRVANVHLGRIEYVDVNEMGFTPAERVTYGLKPGDIL 74 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 SLE VG + + + + + + LIR R L Y ++ F Sbjct: 75 LNE-GQSLELVGRSAIYDRAEGE-FCFQNTLIRFRPNGCILSAYAQVVFEHWLRSGVFAA 132 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 K T+ + G + L P Q IV ++ L +E+++ ++ ++ ++ Sbjct: 133 IAKQTTSIAHLGGDRFAALKFPLLPTGMQQRIVAVLDSL----AELERRIEASIVKLRSV 188 Query: 410 TQSILAKAF 418 + I+++ F Sbjct: 189 RKGIISEQF 197 >UniRef50_A4FZ34 Restriction modification system DNA specificity domain n=1 Tax=Methanococcus maripaludis C5 RepID=A4FZ34_METM5 Length = 402 Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust. Identities = 93/424 (21%), Positives = 172/424 (40%), Gaps = 31/424 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GW + + + + G T + + Y + +P ++ +++ T + Sbjct: 5 LPDGWEVKKLGDIGNISAGGTPSRSKP-EYWNNGSIPWVKIADMKEKHVKNTSEFITEEG 63 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFG-AFCGVLRPEKLIFSGFIAH 124 L K S KI + ++ S VG L + S A G+ K + ++ + Sbjct: 64 LNKSSAKIFKKGTILISIFASLGTVG-----ILDIDASTNQAIAGINVNSKKVIPEYLYY 118 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + KS +N G NNI + I +PPL Q+ I E L+ + ++ + Sbjct: 119 YLKS--LKNYFMGAGRGVAQNNINLSILKDTEIFVPPLETQQKIVEILEKIEYGINLREK 176 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLN--FESILTELRNGLSSK 242 + ++K + G V+ P KK+ I++ G + Sbjct: 177 AILETENLVKAVFLDMFGDPVSN----------PMGWDVKKIGTFVNDIISGWSVGGDER 226 Query: 243 PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 P ++ +L+ISSV +G ++ + + ++ H L+ GDLLF+R N V Sbjct: 227 PKKAD-ELAVLKISSVTSGKFKSSEHKVVNSEITKKLVHPLK-GDLLFSRANTRELVAAV 284 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPE-YIEIFFSSPSARNAMMNCVKTTSGQK-GI 360 C + + +L PDKL + L K+ + Y P+ R + TSG I Sbjct: 285 C--IVDNDYMDLFLPDKLWKIILNKNIVSSYYFRQVLQDPTYRANLTKKATGTSGSMLNI 342 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 S + +PP+ Q + + +E+L + I+++ N+ + +L L KAF+G Sbjct: 343 SKSKLIENEFPIPPIGLQNKFAKIIEKL----EEIKEKQENSKKEMEDLFNLSLQKAFKG 398 Query: 421 ELTA 424 EL Sbjct: 399 ELAC 402 >UniRef50_D1PEN6 Type I restriction-modification enzyme S subunit n=1 Tax=Prevotella copri DSM 18205 RepID=D1PEN6_9BACT Length = 450 Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 86/415 (20%), Positives = 173/415 (41%), Gaps = 59/415 (14%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINY--------LKDDYLPLIRANNIQNGKFDT 56 +LP+GWV V +T V + E N ++ D +I+ N K + Sbjct: 67 ELPKGWVWTTVGEITNYGDSVNVQVEDIDNSDWVLELEDIEKDTAKIIQHLNKNERKING 126 Query: 57 TDLVFVPKNLVKESQKISPEDIVIAMSSG--SKSVVGKSAHQHLPFECSFGAFCGVLRPE 114 T F ++ + +++A + G + ++ ++ L S C VLR Sbjct: 127 TRHKFQKGQILYSKLRTYLNKVLVAPNDGFCTTEIMAFGSYGIL----SNNYICYVLR-- 180 Query: 115 KLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDT 174 S + +T Y K+ LS N + IP+PPLAEQ+ I ++ Sbjct: 181 ----SLYFLDYTLQCGYGVKMPRLSTTDACNGL---------IPLPPLAEQERIVNEIQR 227 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTE 234 L + +D + + + +++ + +L A++GKL + N EP + K++N ++ +T Sbjct: 228 LFSIIDIVENGKDGLQTAIQQAKNKILDHAIHGKLVPQDPNDEPASELLKRINPKAEITC 287 Query: 235 LRNGLSSKPN---ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 P E+ +G+ I+ +++G ++ ++ + ++ + G+ + T Sbjct: 288 DNPQYGKLPKGWCETTLGNTIV----IKSGDA-------IKVRDNRIGKYPIYGGNGI-T 335 Query: 292 RYNGSLEFVGV----------CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSP 341 YN S G+ CG + + ++ + + + + P+++ Sbjct: 336 GYNESYNVDGINIIIGRVGFYCGSVHYVNNKIWVTDNAFVTKIMGNVYTPKFLYYLLQQY 395 Query: 342 SARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 + ++ Q ISGK + V+LPP+ EQ IV ++E+LF+ D IE Sbjct: 396 DLQQ-----YSNSTAQPVISGKTVYPINVMLPPLSEQYRIVAKIEELFSQLDKIE 445 >UniRef50_C3RBV6 Type I restriction-modification system n=3 Tax=Bacteroides RepID=C3RBV6_9BACE Length = 423 Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 96/431 (22%), Positives = 165/431 (38%), Gaps = 39/431 (9%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W +S V +I T +Y + L ++ ++ NG T P Sbjct: 17 GEIPNHWEAIKISRVHPIIGSGTTPLSSREDYYSEKGLNWLQTGDLNNGLITETSKKITP 76 Query: 64 KNLVKESQKISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 K + + K P +VIAM + VG L E + C ++ P K I + Sbjct: 77 KAVDECKMKFYPIHSVVIAMYGATIGKVGL-----LDIETATNQACCIIVPSKRICPKYT 131 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 F + + ++ S G NI + +P+PPL+EQ+ IA LD ++D Sbjct: 132 --FYSFIIAKEELLLSSFGGGQPNISQDIIRKLKVPVPPLSEQQSIASYLDVKTEKIDKM 189 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILT 233 A+ E+ + L +Q+++ AV L W P H L F Sbjct: 190 IAKAEKKIEYLGELKQSLITRAVTRGLNPNTPLKDSGVNWIGNIPMHWDIACLRF---FL 246 Query: 234 ELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 L NG + NE P + +R G+ ND + S EL K D D L + Sbjct: 247 RLINGRAYSQNEL---LPSGKYKVLRVGNFFTNDSWYY--SNMELEPDKYCDKDDLLYAW 301 Query: 294 NGSLEFVG--VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 + S VG + K + H ++ ++ + D + Y + + + M Sbjct: 302 SAS---VGPYIWNEAKTIYHYHIW----KVQLATSMDKMYSYYLLRAVTNQKMSDMHG-- 352 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 S I+ D+ + +PP+ EQ +I ++ + D I +A + L Q Sbjct: 353 ---STMMHITMGDMNKTKIPIPPLSEQQQIATYLDTKCSKIDHIIATQKKKIAYLQELKQ 409 Query: 412 SILAKAFRGEL 422 S++ G++ Sbjct: 410 SLITNVVTGKI 420 >UniRef50_C6CZ61 Restriction modification system DNA specificity domain protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CZ61_PAESJ Length = 456 Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 81/348 (23%), Positives = 150/348 (43%), Gaps = 40/348 (11%) Query: 110 VLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIA 169 VLR + + F+ +F S + + ++ G I I++ +PPL EQK IA Sbjct: 118 VLRCNHYVENIFLNYFLSSPQGKKLLGTIITGTGQPKINKTGLKTISVALPPLNEQKRIA 177 Query: 170 EKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE 229 +K++ LL +++ K E+ + + A+L A G+LT+KWR +HS N Sbjct: 178 DKVERLLDKINQAKQLIEEAKATFELRQAAILDKAFRGELTKKWRG---EHS-----NQI 229 Query: 230 SILTELRNGLSSKPNES----GVGHPILRISSV------RAGHVDQNDIRFLECSESELN 279 S + + ++ PNE G +R+ + ++ H +ND + + Sbjct: 230 STVRSISEDIN--PNEIPFLLPAGWNWVRLKDLGTLERGKSKHRPRNDPKLFGGEYPFIQ 287 Query: 280 RHKLQDGDLLFTRYNGSLEFVG-----------VC-GLLKKLQHQNLL-----YPDKLIR 322 + + YN +L G VC + + LL +PD ++ Sbjct: 288 TGDVANAGDYIESYNQTLSEFGLLQSKLFPEGTVCITIAANIADTALLKFPCCFPDSVV- 346 Query: 323 ARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIV 382 + KDA + + + + ++ + + T+ QK I+ K ++ +V +PP E EI+ Sbjct: 347 GFIPKDAYISSLYLHYYMRTIKSNLEHYAPATA-QKNINLKVLQEILVPVPPKTEHDEIL 405 Query: 383 RRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAEN 430 + L D + + N + + L QS+L+KAF+G L +EN Sbjct: 406 HMI-NLLMQKDEEAQTIMNVASDLEILKQSVLSKAFQGNLGTNESSEN 452 Score = 65.9 bits (159), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 50/183 (27%), Positives = 87/183 (47%), Gaps = 5/183 (2%) Query: 253 LRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQ 312 +R++ +R G + E S L++ L G++L + VG ++ + Sbjct: 53 VRLTDLRLGLGHEGQKYVDETSYKFLSKSSLTGGEILIANIGAN---VGEVFVMPNVDLL 109 Query: 313 NLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLL 372 + P+ +I R ++ F SSP + ++ + T +GQ I+ +K+ V L Sbjct: 110 ATIAPN-MIVLRCNHYVENIFLNYFLSSPQGKK-LLGTIITGTGQPKINKTGLKTISVAL 167 Query: 373 PPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPD 432 PP+ EQ I +VE+L + ++ + A A +IL KAFRGELT +WR E+ + Sbjct: 168 PPLNEQKRIADKVERLLDKINQAKQLIEEAKATFELRQAAILDKAFRGELTKKWRGEHSN 227 Query: 433 LIS 435 IS Sbjct: 228 QIS 230 >UniRef50_UPI000190446B type I restriction-modification system, S subunit n=1 Tax=Rhizobium etli 8C-3 RepID=UPI000190446B Length = 283 Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 71/291 (24%), Positives = 126/291 (43%), Gaps = 54/291 (18%) Query: 171 KLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF-------------- 216 KLD L A +I ++ R++QAVL A +G+LT +WR Sbjct: 1 KLDALNANSSRACIELARIKTLVSRYKQAVLSKAFSGELTREWRERVGIALTASQKIAIA 60 Query: 217 -EPQHSVFKK---------LNFESIL---TELRNGLSSKPNESGVGHP------------ 251 +H++ + +NFE+I T +G+ + +E VG Sbjct: 61 EAVKHALSSRRGARTNTHQVNFEAIADIPTSWADGIIAIGSEMVVGFAFKSEWFRAAGIK 120 Query: 252 ILRISSVRAGHVDQNDIRFLECS-ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQ 310 +LR +++ G ++ +D++ L+ S E +++ +++ D++ + + G+ Q Sbjct: 121 LLRGANIAPGAINWSDLKCLDTSIADEFSKYLIEEDDIVLA-MDRPVISTGLKVARVTCQ 179 Query: 311 HQNLLYPDKLIRARLTKDALPEYI------EIFFSSPSARNAMMNCVKTTSGQKGISGKD 364 L ++ R R T+ ++ ++F S R T S ISG D Sbjct: 180 DAGCLLVQRVTRFRATEFVTQSFLWWLLNSQMFLSHSLQR-------ATGSDLPHISGDD 232 Query: 365 IKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 I + + +PP +EQ EIVRR+E FA D + + AL V L ++ILA Sbjct: 233 IATCPIPIPPKEEQHEIVRRIESAFAKIDRLAAEAKRALELVGKLDEAILA 283 Score = 54.3 bits (129), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 44/204 (21%), Positives = 89/204 (43%), Gaps = 15/204 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P W ++ + ++ G +K E + + + L+R NI G + +DL + + Sbjct: 88 IPTSWADGIIAIGSEMVVGFAFKSE----WFRAAGIKLLRGANIAPGAINWSDLKCLDTS 143 Query: 66 LVKESQK--ISPEDIVIAM-----SSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 + E K I +DIV+AM S+G K V + Q C R + + Sbjct: 144 IADEFSKYLIEEDDIVLAMDRPVISTGLK--VARVTCQDAG--CLLVQRVTRFRATEFVT 199 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F+ S ++ + + G+++ +I IPIPP EQ I ++++ A+ Sbjct: 200 QSFLWWLLNSQMFLSHSLQRATGSDLPHISGDDIATCPIPIPPKEEQHEIVRRIESAFAK 259 Query: 179 VDSTKARFEQIPQILKRFRQAVLG 202 +D A ++ +++ + +A+L Sbjct: 260 IDRLAAEAKRALELVGKLDEAILA 283 >UniRef50_C6A4W8 Putative type I specificity subunit HsdS n=1 Tax=Thermococcus sibiricus MM 739 RepID=C6A4W8_THESM Length = 434 Score = 69.3 bits (168), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 97/428 (22%), Positives = 189/428 (44%), Gaps = 37/428 (8%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDLVFVP 63 +LPEGW + + L G T + + Y ++ +P ++ ++I +G + T+ Sbjct: 34 ELPEGWRWVRLGDIAELKAGGTPSR-RVKEYWENGTIPWVKISDIPDSGLVEKTEEKITE 92 Query: 64 KNLVKESQKI-SPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 L S K+ SP I+ ++ + + K +P + A G++ P+ I G++ Sbjct: 93 LGLKNSSAKLLSPGTILFSIFA----TISKVGILKIP-AATNQAIVGII-PKISIDRGYL 146 Query: 123 ----AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 +F + +Y+ + G +NI + IP+PP+ EQK I KLD + + Sbjct: 147 FYSLKYFGQELVYQGR------GGVQDNINMRILSKLKIPLPPIEEQKRIVAKLDEVHRR 200 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 ++ K + + +R + L + + W + + K++ E++ G Sbjct: 201 LEEAKRLAREAREEAERLMASALHEVFSKAEEKGW-----EWTTIGKVS-----REMKPG 250 Query: 239 LS-SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGDLLFTRYNGS 296 + +K + S G P LR ++V G ++ I + + + + L+ GD+LF N S Sbjct: 251 FARNKKHISRDGVPHLRPNNVDVGRLNLKKIVKVTLDDKINIEEYYLKKGDVLFNNTN-S 309 Query: 297 LEFVGVCGLL-KKLQHQNLLYPDKLIRARLTKDA-LPEYIEIFFSSPSARNAMMNCVKTT 354 E VG ++ + L++ Y + + R R+ K+ LPE++ + + + Sbjct: 310 FELVGRAAIVPEDLKYG---YSNHITRIRVKKEVILPEWLTLAINYLWMQGYFREVCTRW 366 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 GQ G++ + + LP ++EQ IV ++ + A + K + L +IL Sbjct: 367 VGQAGVNMNTLAKTRIPLPSLEEQKRIVSYLDSIQERAQKLVKLYEEREKELEKLFPAIL 426 Query: 415 AKAFRGEL 422 KAFRGEL Sbjct: 427 DKAFRGEL 434 >UniRef50_C2H9J2 Possible type I restriction-modification system specificity subunit n=9 Tax=Enterococcus faecium RepID=C2H9J2_ENTFC Length = 187 Score = 68.9 bits (167), Expect = 3e-10, Method: Compositional matrix adjust. Identities = 41/159 (25%), Positives = 87/159 (54%), Gaps = 6/159 (3%) Query: 230 SILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLL 289 SI T+++ G + + G LRI+ ++ G V+ + + + + S S+L +L++ D+L Sbjct: 25 SISTKIQYGYTDSAKKQG-NVKFLRITDIQEGRVNWSSVPYCDISNSKLVDLRLEENDIL 83 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 R G++ G L+K++ ++ ++ LIR RL + L EY++ F SP ++ Sbjct: 84 IARTGGTM---GKSFLVKEISEES-VFASYLIRIRLVEKLLSEYVDCFLDSP-LYWKLLE 138 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 + +GQ ++G ++ ++ LPP++EQ + ++E + Sbjct: 139 KISYGTGQPNVNGTNLSKLLIPLPPLEEQQRMTTKIEMI 177 Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust. Identities = 34/138 (24%), Positives = 67/138 (48%), Gaps = 11/138 (7%) Query: 43 LIRANNIQNGKFDTTDLVFVPKNLVKESQ----KISPEDIVIAMSSGSKSVVGKS-AHQH 97 +R +IQ G+ + + VP + S+ ++ DI+IA + G+ +GKS + Sbjct: 46 FLRITDIQEGRVNWSS---VPYCDISNSKLVDLRLEENDILIARTGGT---MGKSFLVKE 99 Query: 98 LPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINI 157 + E F ++ +R + + S ++ F S LY + +S G N+ + + I Sbjct: 100 ISEESVFASYLIRIRLVEKLLSEYVDCFLDSPLYWKLLEKISYGTGQPNVNGTNLSKLLI 159 Query: 158 PIPPLAEQKIIAEKLDTL 175 P+PPL EQ+ + K++ + Sbjct: 160 PLPPLEEQQRMTTKIEMI 177 >UniRef50_D0BWI7 Predicted protein n=1 Tax=Acinetobacter sp. RUH2624 RepID=D0BWI7_9GAMM Length = 396 Score = 68.9 bits (167), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 95/430 (22%), Positives = 185/430 (43%), Gaps = 52/430 (12%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 KLP+GW + V + ++ + + LP++ + N+ + + +P+ Sbjct: 7 KLPDGWDWKTLGDVCFKVTDGSHNPPKEVEV----GLPMLSSRNVMDNGLVWDNFRLIPE 62 Query: 65 NLVKESQK---ISPEDIVIAM-SSGSKSVVGKSAHQHLPFECSFGAFCGV-LRPEKLIFS 119 + + K +S D+++ + + +S V ++ + + S L PE L + Sbjct: 63 DAFESEHKRTRVSEGDVLLTIVGTIGRSCVVRNLDRLFTLQRSVAVLSSEELIPEFLSYQ 122 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F A F + N S G + +K A++ + PP+ EQ I EKLD L ++ Sbjct: 123 -FRAPFIQEHFISNAKGSAQKGIYLKQLK-ATY----LVCPPIEEQNRIVEKLDALFTRI 176 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESI-LTELRNG 238 D + + K+ +VL FK + +S+ LT++ Sbjct: 177 DIAIEHLQSKLDLSKQLFDSVL------------------DEFFKLPDCDSVPLTQVVEF 218 Query: 239 LS-SKPNESGVG----HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 + S+P +S +R+ +R D N I +++ + + + D++ RY Sbjct: 219 IGGSQPPKSQFSDVQKEGYVRLIQIRDYKSD-NHIVYVDSAST---KKFCTKDDVMIGRY 274 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVK 352 V +L+ L Y L++A +D L +Y+ F SPS +N ++ + Sbjct: 275 GPP-----VFQILRGLDGA---YNVALMKAVPNEDLLMKDYLFWFLQSPSIQNYVIGISQ 326 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 +GQ G++ K ++ ++ +P Q +IV +V QL + + +E +V +A ++ L S Sbjct: 327 RAAGQSGVNKKALEKYLIPVPSKAIQNDIVDKVGQLVSKSRHLEAEVTAEIAFLSQLKAS 386 Query: 413 ILAKAFRGEL 422 IL AF+GEL Sbjct: 387 ILDSAFKGEL 396 >UniRef50_B9KF72 Type I restriction-modification system, S subunit n=2 Tax=Campylobacter RepID=B9KF72_CAMLR Length = 390 Score = 68.9 bits (167), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 75/326 (23%), Positives = 135/326 (41%), Gaps = 34/326 (10%) Query: 110 VLRPE-KLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKII 168 +L+P +++ + F+ +F S I+ GA + + I I +PPL EQ+ I Sbjct: 86 ILKPNNEILINKFLVYFLNYSNLEKYIT----GATVKKLNQQKLKQIEILLPPLKEQERI 141 Query: 169 AEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-PQHSVFKKLN 227 LD A +D + EQ L Q+ L N N++ PQ +K L Sbjct: 142 VGILDESFANIDESIKILEQDLLNLDELMQSALQKTFNPLKDNAKENYQLPQDWEWKSLG 201 Query: 228 FESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESE--LNRHKLQD 285 +T+ G PN G P L + ++ G D +DI+++ E + R K + Sbjct: 202 EICFITD---GTHKTPNYIETGIPFLSVKNISKGFFDLSDIKYISLEEHNKLIKRAKPEF 258 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQ---------NLLYPDKLIRARLTKDALPEYIEI 336 GD+L R +G G K+ + LL P + ++ D L ++ Sbjct: 259 GDILICR-------IGTLGKAIKISLEFEFSIFVSLGLLKP----KVKIISDYLVYFLNS 307 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 +F N N V + ++ ++ + LP +KEQ +I +++ ++ Sbjct: 308 YFIEGWINN---NKVGGGTHTAKLNLNILEKCPIALPSLKEQEQIASYLDEFSLNIKDLK 364 Query: 397 KQVNNALARVNNLTQSILAKAFRGEL 422 + + + L +S+L KAF+G+L Sbjct: 365 QNYQAQIKNLQELKKSLLDKAFKGKL 390 Score = 57.8 bits (138), Expect = 8e-07, Method: Compositional matrix adjust. Identities = 54/213 (25%), Positives = 93/213 (43%), Gaps = 20/213 (9%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP- 63 +LP+ W + + + G + NY++ +P + NI G FD +D+ ++ Sbjct: 190 QLPQDWEWKSLGEICFITDGT----HKTPNYIETG-IPFLSVKNISKGFFDLSDIKYISL 244 Query: 64 ---KNLVKESQKISPE--DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 L+K ++ PE DI+I +GK+ L FE S G+L+P+ I Sbjct: 245 EEHNKLIKRAK---PEFGDILICRIG----TLGKAIKISLEFEFSIFVSLGLLKPKVKII 297 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPI--PPLAEQKIIAEKLDTLL 176 S ++ +F S I++ G + K L PI P L EQ+ IA LD Sbjct: 298 SDYLVYFLNSYFIEGWINNNKVGGGTHTAKLNLNILEKCPIALPSLKEQEQIASYLDEFS 357 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 + K ++ + L+ ++++L A GKL Sbjct: 358 LNIKDLKQNYQAQIKNLQELKKSLLDKAFKGKL 390 >UniRef50_B2J095 Restriction modification system DNA specificity domain protein n=1 Tax=Nostoc punctiforme PCC 73102 RepID=B2J095_NOSP7 Length = 530 Score = 68.6 bits (166), Expect = 4e-10, Method: Compositional matrix adjust. Identities = 53/215 (24%), Positives = 91/215 (42%), Gaps = 7/215 (3%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LPEGW + V + G T ++ I D +P + + + + T Sbjct: 7 ELPEGWQWKNLGEVFEIFVGATPSRK--IPEYWDGSIPWVSSGEVAFCEIYETRETITEL 64 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFSGFIA 123 L S ++ P V+ G G++A L + +R ++ + ++ Sbjct: 65 GLKNTSTELHPPGTVLLGMIGEGKTRGQAAI--LKIYATHNQNSAAIRVSEIGLPPEYVY 122 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 +F K R + + +G N + + L++ P+PPL EQK I ++ L + K Sbjct: 123 YFLKLEYERTR--QIGSGNNQQALNKSRVQLMSFPVPPLNEQKRIVANIEELNDRTQRAK 180 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP 218 + IPQ+ RFRQ+VL A G LT WR+ P Sbjct: 181 EALDSIPQLCDRFRQSVLAAAFRGDLTADWRDQNP 215 Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 34/115 (29%), Positives = 58/115 (50%), Gaps = 7/115 (6%) Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 PEY+ F R + + + Q+ ++ ++ +PP+ EQ IV +E+L Sbjct: 118 PEYVYYFLKLEYERTRQ---IGSGNNQQALNKSRVQLMSFPVPPLNEQKRIVANIEELND 174 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLE 445 ++ +++ + QS+LA AFRG+LTA WR +NPD+ A+ LLE Sbjct: 175 RTQRAKEALDSIPQLCDRFRQSVLAAAFRGDLTADWRDQNPDV----EPASVLLE 225 Score = 52.4 bits (124), Expect = 4e-05, Method: Compositional matrix adjust. Identities = 51/230 (22%), Positives = 102/230 (44%), Gaps = 14/230 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTY-KKEQAINYLKDDYLPLIRANNIQ-NGKFDTTD--LV 60 +LP GWV V G + KE N +K L+R N+ +G+ + D Sbjct: 272 ELPNGWVWTKWEQVGFCQNGRAFPSKEYQTNGVK-----LLRPGNLHVSGEIEWNDSNTR 326 Query: 61 FVPKNLVKESQK--ISPEDIVIAMSSGS--KSVVGKSAHQHLPFECSFGAFCGVLRPEKL 116 ++ ++ ++ IS ++VI +++ S +G+ C L P + Sbjct: 327 YLSEDWAEQYPDYLISTNELVINLTAQSLADEFLGRICLTGEDERCLLNQRIARLVP-II 385 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 I F+ KS L+R+ + L+ G+ I +I + + P+PPL EQ++I ++T + Sbjct: 386 ISPRFLFWLFKSKLFRSYVDDLNTGSLIQHIFTPQINKFHFPLPPLKEQQMIVNLIETQI 445 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL 226 +++ + Q+ Q++L A G+L + + EP + +++ Sbjct: 446 NSIENIGLKAGQMQNAFPHLNQSILAKAFRGELVPQEPDDEPASVLLERI 495 Score = 49.7 bits (117), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 36/83 (43%), Positives = 46/83 (55%), Gaps = 14/83 (16%) Query: 372 LPPVKEQAEIVRRVEQLFAYADTI---EKQVNNALARVNNLTQSILAKAFRGELTAQWRA 428 LPP+KEQ IV +E + I Q+ NA +N QSILAKAFRGEL Q Sbjct: 428 LPPLKEQQMIVNLIETQINSIENIGLKAGQMQNAFPHLN---QSILAKAFRGELVPQ--- 481 Query: 429 ENPDLISGENSAAALLEKIKAER 451 PD + A+ LLE+I+A+R Sbjct: 482 -EPD----DEPASVLLERIRAKR 499 >UniRef50_A6UXD7 Type I restriction-modification system, S subunit n=1 Tax=Pseudomonas aeruginosa PA7 RepID=A6UXD7_PSEA7 Length = 464 Score = 68.6 bits (166), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 73/328 (22%), Positives = 144/328 (43%), Gaps = 25/328 (7%) Query: 110 VLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIA 169 + RP+ + FI + S Y S+L+ GA + I I + P L EQ IA Sbjct: 125 IFRPDLKFYKKFIVYLFSSEEYFKHTSNLARGATMQRISRGLLGNIRVVTPSLEEQTQIA 184 Query: 170 EKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQH 220 LD A++D+ +++ ++LK RQAV+ AV L +W P H Sbjct: 185 RFLDHETARIDALIEEQQRLIELLKEKRQAVISHAVTKGLDPTVPMKDSGVEWLGEVPAH 244 Query: 221 SVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRF------LECS 274 + ++ SI ++ NG + V P +R +++ H+ N I+F E Sbjct: 245 WEVRSIS--SISKKITNGYVGPTRDILVDEPGVRY--LQSLHIKSNKIKFEVPYFVSEQW 300 Query: 275 ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYI 334 +E + L GD+L + G + V V +H +I + + + L E++ Sbjct: 301 SAEHAKSILASGDVLIVQ-TGDIGQVAVV----TEEHAGCNCHALIIVSPVREVVLGEWV 355 Query: 335 EIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADT 394 +S +++++ ++T + ++ ++K + +PP++EQA IV +E D+ Sbjct: 356 SWVLNSTYGYHSLLS-IQTGAMHPHLNCGNVKFLNLPIPPLEEQARIVSFIESGELEMDS 414 Query: 395 IEKQVNNALARVNNLTQSILAKAFRGEL 422 + + +L + ++++ A G++ Sbjct: 415 LMSETKRSLLLLQERRTALISAAVTGKI 442 Score = 46.6 bits (109), Expect = 0.002, Method: Compositional matrix adjust. Identities = 51/227 (22%), Positives = 103/227 (45%), Gaps = 22/227 (9%) Query: 4 GKLPEGWVIAPVSTVTTLIR----GVTYK---KEQAINYLKDDYLPLIRANNIQNGKFDT 56 G++P W + +S+++ I G T E + YL+ + I++N I KF+ Sbjct: 239 GEVPAHWEVRSISSISKKITNGYVGPTRDILVDEPGVRYLQSLH---IKSNKI---KFEV 292 Query: 57 TDLVFVPKNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE 114 FV + E K ++ D++I + +G V +H C +R Sbjct: 293 P--YFVSEQWSAEHAKSILASGDVLI-VQTGDIGQVAVVTEEHAGCNCHALIIVSPVR-- 347 Query: 115 KLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDT 174 +++ +++ S+ + + S+ GA ++ + +N+PIPPL EQ I +++ Sbjct: 348 EVVLGEWVSWVLNSTYGYHSLLSIQTGAMHPHLNCGNVKFLNLPIPPLEEQARIVSFIES 407 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHS 221 ++DS + ++ +L+ R A++ AV GK+ R ++P S Sbjct: 408 GELEMDSLMSETKRSLLLLQERRTALISAAVTGKIDV--RGWQPPAS 452 >UniRef50_Q0EXK2 HsdS protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EXK2_9PROT Length = 462 Score = 68.6 bits (166), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 52/211 (24%), Positives = 103/211 (48%), Gaps = 11/211 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W I+ + ++ + +K +A Y+ + Y+ + NI+ K D ++ ++ Sbjct: 234 GEVPAHWEISSLGFECSVKARLGWKGLKAEEYVDEGYI-FLATPNIKGEKIDFENVNYIT 292 Query: 64 KNLVKESQKI--SPEDIVI---AMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 K ES +I + D+++ ++G+ ++V + LP + + VLR I Sbjct: 293 KARYDESPEIMLNEGDVLVTKDGSTTGTTNIV-----RELPSPATVNSSIAVLRSVGRID 347 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 S ++ +F S+ +N I + G + ++ A N+ +PP EQK IA ++D L + Sbjct: 348 SSYLYYFFVSTYVQNVIKRIQGGMGVPHLFQADLRKFNVLMPPFKEQKEIAAEIDMRLPK 407 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 D A+ E ++K R A++ AV GK+ Sbjct: 408 FDDLIAKAEYSILLMKERRTALISAAVTGKI 438 Score = 59.7 bits (143), Expect = 2e-07, Method: Compositional matrix adjust. Identities = 98/463 (21%), Positives = 195/463 (42%), Gaps = 42/463 (9%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPL--IRANNIQNGKFDTTDLVF 61 G++P WV+ T T I +T KK + + ++P+ ++ ++I + T D V+ Sbjct: 20 GEIPAHWVL----TRTKYISELTPKKPKISRDKECSFIPMEKLKTDSIVLDEVRTIDDVY 75 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAF-CGVLRPEKLIFSG 120 +S D+++A + + Q L FG+ V+R + + + Sbjct: 76 DGYTYFADS------DVLMAKVTPCFENKNIAIAQDLVNGVGFGSSEIYVIRANQRVSNR 129 Query: 121 FIAH-FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ + + S I++++ + + + +P EQ IA LD A++ Sbjct: 130 FLFYRLQEDSFMEIAIAAMTGAGGLKRVPSDVLNNYIAAVPQHDEQMEIANFLDRETAKI 189 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFES 230 D+ + +Q+ ++LK RQAV+ AV L +W P H L FE Sbjct: 190 DTLIEKQQQLIKLLKEKRQAVISHAVTKGLNPDAPMRNSGIEWLGEVPAHWEISSLGFEC 249 Query: 231 ILTELRNGLSSKPNESGV--GHPILRISSVRAGHVDQNDIRFL-ECSESELNRHKLQDGD 287 + + R G E V G+ L +++ +D ++ ++ + E L +GD Sbjct: 250 SV-KARLGWKGLKAEEYVDEGYIFLATPNIKGEKIDFENVNYITKARYDESPEIMLNEGD 308 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPD-KLIRARLTKDALPEYIEIFFSSPSARNA 346 +L T+ +GS G ++++L + ++R+ D+ Y+ FF S Sbjct: 309 VLVTK-DGST--TGTTNIVRELPSPATVNSSIAVLRSVGRIDS--SYLYYFFVS----TY 359 Query: 347 MMNCVKTTSGQKGISG---KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 + N +K G G+ D++ VL+PP KEQ EI ++ D + + ++ Sbjct: 360 VQNVIKRIQGGMGVPHLFQADLRKFNVLMPPFKEQKEIAAEIDMRLPKFDDLIAKAEYSI 419 Query: 404 ARVNNLTQSILAKAFRGELTAQWRAENP--DLISGENSAAALL 444 + ++++ A G++ + +P DL S +++ A L Sbjct: 420 LLMKERRTALISAAVTGKIDVRHHVSHPTGDLQSSKSAIHADL 462 >UniRef50_C7NM09 Putative uncharacterized protein n=1 Tax=Kytococcus sedentarius DSM 20547 RepID=C7NM09_KYTSD Length = 418 Score = 68.6 bits (166), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 65/274 (23%), Positives = 117/274 (42%), Gaps = 19/274 (6%) Query: 155 INIPIPPLAEQKIIAEKLDTLLAQVDST-KARFEQIPQILKRFRQAVLGGAVNGKLTEKW 213 + +P+PP +Q+ IA+ LD ++++D AR Q Q+ A G ++ +LT+ Sbjct: 147 LRLPLPPEPDQRRIADFLDDRVSRIDRIIAARNTQRGQV-----AAQAGQLIDHQLTD-- 199 Query: 214 RNFEPQHSVFKKLNFESILTELRNGLS----SKPNESGVGHPILRISSVRAGHVDQNDIR 269 + + +LT+L G S +P E G ++R V +G D + Sbjct: 200 -----HGDRWGAVRLGRLLTKLEQGWSPAADQQPAELGQWG-VMRAGCVNSGEFRAEDNK 253 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 L + ++++ GDL+ +R +GSL+ +G L+ LL DKL R R Sbjct: 254 RLPDAVEPRLEYEIKGGDLIMSRASGSLDLIGSVALVPDSVRDQLLLCDKLYRLRTVAGL 313 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQ-KGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 +P+Y + R + V G + I+S ++ LP Q E + R E Sbjct: 314 VPQYTAHALRHHANRQRIRQGVSGAEGMANNLPSGVIRSLMIPLPDRSTQIEAIDRWEDE 373 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 A + + ++ + QS++ A G+L Sbjct: 374 MAGNRRTQAALTRSIELLTEYKQSLITAAVTGQL 407 >UniRef50_Q0RV87 Type I restriction-modification system specificity subunit n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RV87_RHOSR Length = 391 Score = 68.6 bits (166), Expect = 5e-10, Method: Compositional matrix adjust. Identities = 70/269 (26%), Positives = 127/269 (47%), Gaps = 20/269 (7%) Query: 157 IPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF 216 IP+PP+ EQ IA+ LD A++D+ ++ ++L+ R AV G V G W Sbjct: 133 IPLPPITEQGAIADFLDRETARIDTLIREQRRLIELLRERRIAVAEGPVVGL---SWST- 188 Query: 217 EPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSE- 275 P SV + ++L+ S E+G G P++ S + G ++ ++ + S+ Sbjct: 189 -PLRSVTALIQTGPFGSQLK----SDEYETG-GTPVINPSHLVMGRIEPDERVAVSASKA 242 Query: 276 SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLL--YPDKLIRARLTKDALPEY 333 SEL RH L+ GD++ R G L G C +++ ++ L LIR R T A PE+ Sbjct: 243 SELGRHALRAGDVIAAR-RGEL---GRCAVVRA-ENTGFLCGTGSALIRLRETV-ADPEF 296 Query: 334 IEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYAD 393 + + FSS R++ ++ + ++ I + + +PP+ EQ IV V + D Sbjct: 297 LALVFSSRRNRDS-LSLASVGATMDNLNADIIATLRIPMPPLPEQRRIVESVAEATTKID 355 Query: 394 TIEKQVNNALARVNNLTQSILAKAFRGEL 422 T+ + + + +++ A G++ Sbjct: 356 TLITETESFIDLAKERRSALITAAVTGQI 384 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 48/198 (24%), Positives = 86/198 (43%), Gaps = 4/198 (2%) Query: 14 PVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE--SQ 71 P+ +VT LI+ + + + + P+I +++ G+ + + V V + E Sbjct: 189 PLRSVTALIQTGPFGSQLKSDEYETGGTPVINPSHLVMGRIEPDERVAVSASKASELGRH 248 Query: 72 KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLY 131 + D VIA G ++ F C G+ LR E + F+A S Sbjct: 249 ALRAGD-VIAARRGELGRCAVVRAENTGFLCGTGSALIRLR-ETVADPEFLALVFSSRRN 306 Query: 132 RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQ 191 R+ +S S GA ++N+ + IP+PPL EQ+ I E + ++D+ E Sbjct: 307 RDSLSLASVGATMDNLNADIIATLRIPMPPLPEQRRIVESVAEATTKIDTLITETESFID 366 Query: 192 ILKRFRQAVLGGAVNGKL 209 + K R A++ AV G++ Sbjct: 367 LAKERRSALITAAVTGQI 384 >UniRef50_UPI0001BCA660 restriction modification system DNA specificity domain n=2 Tax=Fusobacterium periodonticum ATCC 33693 RepID=UPI0001BCA660 Length = 180 Score = 68.2 bits (165), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 44/151 (29%), Positives = 82/151 (54%), Gaps = 5/151 (3%) Query: 240 SSKPNESGVGH-PILRISSV-RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSL 297 +SK S VG PILR++++ +G ++ D++++E S+SE + L+ G+LLF R N S Sbjct: 29 TSKKATSVVGEFPILRMNNITYSGEMNYKDLKYIELSDSEKEKFLLKKGELLFNRTN-SK 87 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 E VG GL + + LI+ R + +++ F +S + + N K G Sbjct: 88 ELVGKTGLFN--LDIPMAFAGYLIKIRPSNLIHSKFLLFFMNSEFMKKLLYNKAKNIVGM 145 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 I+ K+++ ++LPP++ Q + R+E++ Sbjct: 146 ANINAKELEDFSIILPPIELQNKFAERIEKI 176 Score = 51.2 bits (121), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 37/140 (26%), Positives = 70/140 (50%), Gaps = 8/140 (5%) Query: 41 LPLIRANNIQ-NGKFDTTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLP 99 P++R NNI +G+ + DL ++ + ++ + + + ++ + SK +VGK+ +L Sbjct: 40 FPILRMNNITYSGEMNYKDLKYIELSDSEKEKFLLKKGELLFNRTNSKELVGKTGLFNLD 99 Query: 100 FECSFGAFCGVLRPEKLIFSGFIAHFTKSS----LYRNKISSLSAGANINNIKPASFDLI 155 +F + +RP LI S F+ F S L NK ++ ANIN + F +I Sbjct: 100 IPMAFAGYLIKIRPSNLIHSKFLLFFMNSEFMKKLLYNKAKNIVGMANINAKELEDFSII 159 Query: 156 NIPIPPLAEQKIIAEKLDTL 175 +PP+ Q AE+++ + Sbjct: 160 ---LPPIELQNKFAERIEKI 176 >UniRef50_C3PVT7 Type I restriction enzyme EcoR124II specificity protein n=3 Tax=Bacteroides RepID=C3PVT7_9BACE Length = 356 Score = 68.2 bits (165), Expect = 7e-10, Method: Compositional matrix adjust. Identities = 78/355 (21%), Positives = 153/355 (43%), Gaps = 48/355 (13%) Query: 77 DIVIAMSSGSKSVVGKSAHQHLPFEC-SFGAFCGVLR-----PEKLIFSGFIAHFTKSSL 130 D+++ ++ GS +G+ A F C + ++R PE ++F KS Sbjct: 10 DLLLNITGGS---LGRCAVVPADFNCGNVSQHVCIMRSVLVEPEYFHVLVLSSYFAKSM- 65 Query: 131 YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 KI+ G+ + + + + P+PPL EQ+ I +++ A +D + + Sbjct: 66 ---KIT----GSGREGLPKYNLEQMGFPLPPLTEQQRIVAEIEHWFALIDQIEQGKADLQ 118 Query: 191 QILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLN----------------------- 227 I+K+ + +L A++GKL + N EP + K++N Sbjct: 119 TIIKQTKSKILDLAIHGKLVPQDPNDEPAIELLKRINPDFTPCDNGHYTFDVPNGWNWCK 178 Query: 228 FESILTELRNGLSSKPNESGVGHPIL-RISSVRAGHVDQNDIRFLECS--ESELNRHKLQ 284 + + L G S K +E +P+ + +++ G + RFL+ S +++KLQ Sbjct: 179 LNDLCSFLSRGKSPKYSEDDKTYPVFAQKCNLKEGGISLEQARFLDPSTINKWDSKYKLQ 238 Query: 285 DGDLLFTRYNGSLEFVGVCGLLKK--LQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSP 341 GD+L VG L + L + PD + T + + EY+ + SS Sbjct: 239 TGDVLVNSTGTGT--VGRTRLFDESYLGKYPFVVPDSHVAVVRTYEEINSEYVFAYMSSQ 296 Query: 342 SARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 + + + + ++ QK + +++ PP+ EQ IV+++E+LF+ D I+ Sbjct: 297 LIQQYIEDNLAGSTNQKELYIGVLENLYFPFPPINEQQRIVQKIEELFSVLDNIQ 351 Score = 43.1 bits (100), Expect = 0.023, Method: Compositional matrix adjust. Identities = 40/146 (27%), Positives = 62/146 (42%), Gaps = 10/146 (6%) Query: 287 DLLFTRYNGSLEFVGVCGLL-KKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 DLL GSL G C ++ N+ ++R+ L + PEY + S Sbjct: 10 DLLLNITGGSL---GRCAVVPADFNCGNVSQHVCIMRSVLVE---PEYFHVLVLSSYFAK 63 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 +M T SG++G+ +++ LPP+ EQ IV +E FA D IE+ + Sbjct: 64 SMK---ITGSGREGLPKYNLEQMGFPLPPLTEQQRIVAEIEHWFALIDQIEQGKADLQTI 120 Query: 406 VNNLTQSILAKAFRGELTAQWRAENP 431 + IL A G+L Q + P Sbjct: 121 IKQTKSKILDLAIHGKLVPQDPNDEP 146 >UniRef50_Q4HNY2 Type I restriction-modification system specificity subunit, putative n=1 Tax=Campylobacter upsaliensis RM3195 RepID=Q4HNY2_CAMUP Length = 427 Score = 67.8 bits (164), Expect = 8e-10, Method: Compositional matrix adjust. Identities = 53/211 (25%), Positives = 101/211 (47%), Gaps = 8/211 (3%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYK--KEQAINYLKDDYLPLIRANNIQNGKF---DTTD 58 G++P+ W I + + + GV K K+ + Y K ++ P I N+ N ++ + Sbjct: 214 GEIPKHWEIKKLKYIGEIFGGVIGKTIKDFSKEY-KPNFKPYITFTNVCNNAIINPNSME 272 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 VF+ + ++ K+ DI+ SS + VGKSA E FC R E+ + Sbjct: 273 YVFI--DFDEKQNKVLKNDILFLQSSETFEDVGKSAIYLNDDEVYLNTFCKGFRIEREAY 330 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ + S Y+ S+ +G N++ F I + +PPL EQK IAE LD + Sbjct: 331 PMYLNYLLSSLSYKRYFMSVCSGFTRINLRQEHFLDIPLILPPLQEQKEIAEFLDEKCKK 390 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 ++S + ++ + ++ ++ ++ AV G++ Sbjct: 391 INSAIEKTKKQIEFVREYKNTLINEAVCGRI 421 Score = 54.7 bits (130), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 79/370 (21%), Positives = 158/370 (42%), Gaps = 54/370 (14%) Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 ++ N+VK DI++ S V KS FE +RP + + Sbjct: 59 YIGYNIVKRG------DIILNPMDLSSGYVAKST-----FEGVISQAYIKIRPLETLNLS 107 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDL---INIPIPPLAEQKIIAEKLDTLLA 177 + +F ++ + + L G + ++ D+ I IP+PPL EQK IAE LD Sbjct: 108 YYENFFQNLYHYKILWHLGKGISYDHRWTLGNDVFLNIKIPLPPLQEQKEIAEFLDKKCE 167 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNF 228 ++ + + +++ +L+ +QA++ A+ L +W P+H KKL + Sbjct: 168 KIQNYINKKQKLITLLQEKKQALINEAITKGLNPNIEFKNSGIEWLGEIPKHWEIKKLKY 227 Query: 229 ESILTELRNGL----------SSKPNESGVGHPILRISSV-RAGHVDQNDIRFLECSESE 277 + E+ G+ KPN P + ++V ++ N + ++ E Sbjct: 228 ---IGEIFGGVIGKTIKDFSKEYKPN----FKPYITFTNVCNNAIINPNSMEYVFIDFDE 280 Query: 278 LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA-RLTKDALPEYIEI 336 ++K+ D+LF + + + E VG + + + +Y + + R+ ++A P Y+ Sbjct: 281 -KQNKVLKNDILFLQSSETFEDVGKSAI---YLNDDEVYLNTFCKGFRIEREAYPMYLNY 336 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 SS S + M+ V + + + + ++LPP++EQ EI + D Sbjct: 337 LLSSLSYKRYFMS-VCSGFTRINLRQEHFLDIPLILPPLQEQKEIAE-------FLDEKC 388 Query: 397 KQVNNALARV 406 K++N+A+ + Sbjct: 389 KKINSAIEKT 398 >UniRef50_A6W078 Restriction modification system DNA specificity domain n=1 Tax=Marinomonas sp. MWYL1 RepID=A6W078_MARMS Length = 400 Score = 67.8 bits (164), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 93/427 (21%), Positives = 182/427 (42%), Gaps = 46/427 (10%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GWV+A + V +R T+ +A + +PL+ + ++ NGK D + ++ + Sbjct: 10 LPKGWVLAKANDVMD-VRDGTHDSPKA----QATGIPLVTSKSLVNGKIDYSTCTYISEQ 64 Query: 66 ---LVKESQKISPEDIVIAM--SSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + + + DI+ AM + G+ +V K F+ S + + + Sbjct: 65 DHESISKRSAVDDGDILYAMIGTIGNPVIVKKD------FDFSIKNVALFKFTKTDLSNR 118 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +I H+ S L + + + S G + + + IP+PPL EQK IA LD + D Sbjct: 119 YIFHYLNSGLAKRQFENNSRGGTQKFVSLGNIRELMIPLPPLEEQKRIAAILD----KAD 174 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTE-KWRNFEPQHSVFKKLNFESILTELRNGL 239 + + + +Q + F ++V +T K + P + ++ +G Sbjct: 175 AIRRKRQQAIDLADEFLRSVFLDMFGDPVTNPKGKRIVP---------LIELCNKVTDGT 225 Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDIRFL-ECSESELNRHK-LQDGDLLFTRYNGSL 297 P G P L IS++ G + + +F+ + + EL R ++ GD+L+T Sbjct: 226 HQSPKWEESGIPFLFISNIVNGKISFDTNKFISKETLDELTRSTPIEKGDVLYTT----- 280 Query: 298 EFVGVCGLLKKLQHQN-LLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTS 355 VG G + ++ + + + + + E++ +S R + V+ + Sbjct: 281 --VGSYGNVARVTDDTEFCFQRHIAHIKPNHEIVNAEFLTSMLASSVVRRQADSLVRGIA 338 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 QK ++ +++K +V ++ Q ++ VE + D + VN L N+L Q Sbjct: 339 -QKTLNLRELKEILVFDVSLENQKSYLKIVEPIHKIKDNYDNSVNELLNNFNSLIQ---- 393 Query: 416 KAFRGEL 422 KAF GEL Sbjct: 394 KAFSGEL 400 >UniRef50_B2IP18 Type I restriction-modification system, S subunit, putative n=10 Tax=Streptococcus pneumoniae RepID=B2IP18_STRPS Length = 372 Score = 67.8 bits (164), Expect = 9e-10, Method: Compositional matrix adjust. Identities = 67/283 (23%), Positives = 123/283 (43%), Gaps = 29/283 (10%) Query: 139 SAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILK-RFR 197 + G+ + ++ FD I +P L EQ+ IA +LD L + + + E++ ++K RF Sbjct: 115 THGSTMKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSKLILRRQEQLEELNLLVKSRFN 174 Query: 198 QAVLGGAVNGKLTEKWRNFEPQ-HSVFKKLNFESILTELRNGLSSKPNESGVGHPILRIS 256 + FE SVF ++ + ELR G S E + +L+ Sbjct: 175 EM----------------FEEYPDSVF----LDTYIKELRAG-KSLAGEENNKNKVLKTG 213 Query: 257 SVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLY 316 +V + + ++++ L L+ HK++ GD++ +R N S E VG G + + N+ Sbjct: 214 AVSYDYFNSSEVKNLPIDYIPLDEHKVEIGDVIISRMNTS-ELVGAAGYVWAINSDNIYL 272 Query: 317 PDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG-QKGISGKDIKSQVVLLPPV 375 PD+L + L P ++ ++ + + TSG K IS + V PP+ Sbjct: 273 PDRLWKVILNDRVNPVFLWKLITNEKTKLKIKRISSGTSGSMKNISKSQLLQIRVPFPPL 332 Query: 376 KEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 Q E V A D + + +L + L +S++ + F Sbjct: 333 ALQNEFADFV----ALVDKSQLAIQKSLEELETLKKSLMQEYF 371 >UniRef50_B6VTA2 Putative uncharacterized protein n=1 Tax=Bacteroides dorei DSM 17855 RepID=B6VTA2_9BACE Length = 429 Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 70/275 (25%), Positives = 118/275 (42%), Gaps = 28/275 (10%) Query: 141 GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAV 200 G +++I F + IP+ P+AEQK I + A +D + + +K+ + + Sbjct: 159 GTTVDSIDFDKFRCLPIPLAPIAEQKRIIVETKRWFALIDQVEQGKVDLQTTIKQAKSKI 218 Query: 201 LGGAVNGKLTEKWRNFEPQHSVFKKLN---------------FESILTELRNGLSSKP-- 243 LG A++GKL + N EP + K++N E+IL EL + + K Sbjct: 219 LGLAIHGKLVPQDLNDEPAIELLKRINPDFTPCDNGHYPVGWIETILGELFSHNTGKALN 278 Query: 244 --NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 N+ G+ L S+V D I+ + ESELN+ + GDLL G + Sbjct: 279 SSNKEGIFKDYLTTSNVYWNKFDFTAIKQMPFKESELNKCTVTKGDLLVCE-GGDIGRSA 337 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 + + QN ++ R R D + F+ N + G G+S Sbjct: 338 IWNYDYDICIQNHIH-----RLRPKIDLCVPFYYYTFAYLKENNLIG---GKGIGLLGLS 389 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 + + LPP+ EQ IV+++E+LF+ D I+ Sbjct: 390 SNALHKIEMPLPPLAEQQRIVQKIEELFSVLDNIQ 424 >UniRef50_Q0EWP9 Type I restriction-modification system, S subunit n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EWP9_9PROT Length = 312 Score = 67.4 bits (163), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 51/204 (25%), Positives = 91/204 (44%), Gaps = 8/204 (3%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDLVF 61 G +P+GW + + L G +K E D +P++R +NI ++G D ++ F Sbjct: 112 VGMIPKGWEVVRLGKYVKLQGGYAFKSEN----FTDKGVPVVRISNISKSGDVDLSNAAF 167 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + E+ ++S D +IAMS + VG+ + + G + ++ + Sbjct: 168 HDEINISEAFEVSHSDSLIAMSGATTGKVGRYNFREKAY---LNQRVGKFVSKGMVEMSY 224 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 I H SS + K+ + G NI + + I PPL EQK I+ LD++ V + Sbjct: 225 IHHVVSSSSFTEKLLIDAIGGAQPNISGGQIEGVEIAFPPLDEQKNISSILDSIDNAVGA 284 Query: 182 TKARFEQIPQILKRFRQAVLGGAV 205 + + I + K Q +L G V Sbjct: 285 KQLKLMHIKSLKKSLMQDLLTGKV 308 >UniRef50_B0JXI4 Putative type I restriction enzyme specificity protein n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JXI4_MICAN Length = 388 Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 81/347 (23%), Positives = 142/347 (40%), Gaps = 22/347 (6%) Query: 78 IVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISS 137 +++A G K+ + +C VLRP+K + +I L R ++ Sbjct: 62 VLLAEDGGHFGDADKTIAYQVEGKCWVNNHAHVLRPKKDVDIRYICR----HLERYDVTP 117 Query: 138 LSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFR 197 G+ + + + I I +PPL EQ+ IA LD K ++LK Sbjct: 118 FITGSTRGKLTKTAANNIPIALPPLEEQRRIAAILDKADGVRRKRKEAIRLTDELLKSTF 177 Query: 198 QAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISS 257 + G V + W E V + ES + + ++P E GV L++ + Sbjct: 178 LEMFGDPVTN--PKGWEVRELGDCV---KDIESGWSPKCDTRQAEPEEWGV----LKLGA 228 Query: 258 VRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYP 317 V GH + ++ + + + +++ GDLL TR N + E VG + ++ L+ P Sbjct: 229 VTYGHFNPDENKAMLPDDVPRQELEIKTGDLLVTRKN-TYELVGASAFV-QMTRPKLMLP 286 Query: 318 DKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG-QKGISGKDIKSQVVLLPPVK 376 D + R RL P Y+ S + R + T+G IS +++ +PP Sbjct: 287 DLIFRLRLIDGIDPVYVWQTLSQKTMRLKLSGLAGGTAGSMPNISKARLRTLPFPVPPQL 346 Query: 377 EQAEIVRRVEQLFAYADTIEKQVNNALARVN-NLTQSILAKAFRGEL 422 Q + Q + ++K+ ++ NL S+L +AFRGEL Sbjct: 347 LQLKYREIFNQFW-----LKKEHQKESEEISENLFNSLLQRAFRGEL 388 >UniRef50_A6CKF2 Putative type I restriction enzyme specificity protein n=1 Tax=Bacillus sp. SG-1 RepID=A6CKF2_9BACI Length = 454 Score = 67.0 bits (162), Expect = 1e-09, Method: Compositional matrix adjust. Identities = 99/451 (21%), Positives = 199/451 (44%), Gaps = 38/451 (8%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++PE W + + I+G +K + D +P+I+ +I+NGK +D +F+ + Sbjct: 17 RVPEDWSEKKLKYLVETIKGYAFKSQ----LFGDKGVPIIKTTDIKNGKIQDSD-IFIDE 71 Query: 65 NLVKESQKISPEDIVIAMSS-GSK-----SVVGKSAHQHLPFECSFGAFCG----VLR-P 113 E + + + I MS+ GSK S VG+ +E GA +LR Sbjct: 72 RFEHEYKNVRVKKNDILMSTVGSKVEVTNSAVGQIGKVQKKYE---GALLNQNAVILRCK 128 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKL 172 K I + F+ +F S YR + + G AN ++ +P+P Q I+E L Sbjct: 129 SKDITNNFLFYFLNSHSYRKYLDLFAHGTANQASLSLKDILDFKMPLPSRKIQHQISEFL 188 Query: 173 DTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVF 223 D + V++ A +++ ++L+ RQA++ AV L KW P+H Sbjct: 189 DHKTSDVETLIADKQKLIELLEEKRQAIVTEAVTRGLNPDVKMKDSGVKWIGDIPEHWDI 248 Query: 224 KKLNFESILTELRNGLSSKPNESGV--GHPILRISSVRAGHVDQNDIRFL-ECSESELNR 280 K+ + S + R G ++ + G ++ + + G + + + E SE Sbjct: 249 SKIKY-STYVKGRIGWQGLRSDEFIDEGPYLVTGTDFKDGIIHWDTCYHISEERYSEAPP 307 Query: 281 HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSS 340 +L++ DLL T+ +G+ +G ++K + +L + K+ L +++ +S Sbjct: 308 IQLKENDLLITK-DGT---IGKVAIVKNKPGKAILNSGIFVTRCQDKEYLTKFMYWILTS 363 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 +N + ++T S K + + + LP ++EQ I +E D+++K+++ Sbjct: 364 EVFKN-YIKYMETGSTIKHLYQETFVNFSYPLPNIEEQKAIEYFLETKVREIDSVKKEIS 422 Query: 401 NALARVNNLTQSILAKAFRGELTAQWRAENP 431 + + + QS++ +A G++ + E P Sbjct: 423 DQIELLKEYRQSLIYEAVTGKIDLRDYQEVP 453 Score = 55.5 bits (132), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 46/209 (22%), Positives = 99/209 (47%), Gaps = 7/209 (3%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +PE W I+ + T + + ++ ++ ++ D+ L+ + ++G + Sbjct: 240 GDIPEHWDISKIKYSTYVKGRIGWQGLRSDEFI-DEGPYLVTGTDFKDGIIHWDTCYHIS 298 Query: 64 KNLVKESQKIS-PEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLR-PEKLIFSG 120 + E+ I E+ ++ G+ +GK A ++ P + + V R +K + Sbjct: 299 EERYSEAPPIQLKENDLLITKDGT---IGKVAIVKNKPGKAILNSGIFVTRCQDKEYLTK 355 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+ S +++N I + G+ I ++ +F + P+P + EQK I L+T + ++D Sbjct: 356 FMYWILTSEVFKNYIKYMETGSTIKHLYQETFVNFSYPLPNIEEQKAIEYFLETKVREID 415 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKL 209 S K ++LK +RQ+++ AV GK+ Sbjct: 416 SVKKEISDQIELLKEYRQSLIYEAVTGKI 444 Score = 45.8 bits (107), Expect = 0.004, Method: Compositional matrix adjust. Identities = 54/226 (23%), Positives = 94/226 (41%), Gaps = 15/226 (6%) Query: 213 WRNFEPQHSVFKKLNFESILTELRNGLSSKPNESG-VGHPILRISSVRAGHVDQNDIRFL 271 W P+ KKL + L E G + K G G PI++ + ++ G + +DI Sbjct: 14 WYERVPEDWSEKKLKY---LVETIKGYAFKSQLFGDKGVPIIKTTDIKNGKIQDSDIFID 70 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEF----VGVCGLLKKLQHQNLLYPDKLIRARLTK 327 E E E +++ D+L + +E VG G ++K LL + +I +K Sbjct: 71 ERFEHEYKNVRVKKNDILMSTVGSKVEVTNSAVGQIGKVQKKYEGALLNQNAVILRCKSK 130 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 D ++ F +S S R + T+ Q +S KDI + LP K Q +I ++ Sbjct: 131 DITNNFLFYFLNSHSYRKYLDLFAHGTANQASLSLKDILDFKMPLPSRKIQHQISEFLDH 190 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDL 433 + +T+ + + Q+I+ +A R NPD+ Sbjct: 191 KTSDVETLIADKQKLIELLEEKRQAIVTEAVT-------RGLNPDV 229 >UniRef50_Q89Z57 Putative type I restriction enzyme S.BthVORF4518AP n=2 Tax=Bacteroides RepID=Q89Z57_BACTN Length = 474 Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 97/434 (22%), Positives = 172/434 (39%), Gaps = 70/434 (16%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P WV + ++ VT QA D +P + +++ GK + + FVP+ Sbjct: 70 EVPSSWVWC---KLEDYVKSVTDGDHQAPPK-SDIGIPFLVISDVAKGKLNFLNTRFVPQ 125 Query: 65 NLVKESQKIS----PE--DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 +KIS PE D++ ++ VV + F+ G + L Sbjct: 126 EYY---EKISFDRKPEKGDLLFTVTGSYGIVVPVNIDCKFCFQRHIGLI------KTLNT 176 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 S ++ H KSS ++ + + G + + +PIPP AEQ+ I +++ + Sbjct: 177 SEYLLHLLKSSYFKGQCDEFATGTAQKTVGLETLRSFLLPIPPFAEQQRIVIEIEKWFSL 236 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 ++ + + + +K+ + +L A++GKL + N EP + K++N + T NG Sbjct: 237 IELIEGGKDDLQTTIKQAKSKILDLAIHGKLVPQDPNEEPAIKLLKRINPD--FTPCDNG 294 Query: 239 LSSK------PNESGVGH-PILRIS-------SVRAGHVDQNDIRFLECSES-------- 276 S K + H IL IS S N IR + + Sbjct: 295 HSGKLPYKIPKTWAWCSHNSILDISGGSQPAKSYFETIPKPNYIRLYQIRDYGESPVPVY 354 Query: 277 ---ELNRHKLQDGDLLFTRYNGSLEFV--------GVCGLLKKLQHQNLLYPDKLIRARL 325 L + + GD+L RY GSL V V + + +NL+Y + L Sbjct: 355 IPINLASKQTEKGDILLARYGGSLGKVFHAKQGAYNVAMVKVIFKFENLIYKEYAYYYYL 414 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 + + EI + + Q G + D LPP+ EQ IV+++ Sbjct: 415 SDLYQGKLKEI----------------SRTAQTGFNITDFNDMYFPLPPINEQQRIVQKI 458 Query: 386 EQLFAYADTIEKQV 399 E+LF+ D I+K + Sbjct: 459 EELFSSLDNIQKSL 472 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 60/216 (27%), Positives = 92/216 (42%), Gaps = 13/216 (6%) Query: 218 PQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL--ECSE 275 P V+ KL E + + +G P +S +G P L IS V G ++ + RF+ E E Sbjct: 72 PSSWVWCKL--EDYVKSVTDGDHQAPPKSDIGIPFLVISDVAKGKLNFLNTRFVPQEYYE 129 Query: 276 SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIE 335 K + GDLLFT GS V + K Q + LI+ T EY+ Sbjct: 130 KISFDRKPEKGDLLFT-VTGSYGIVVPVNIDCKFCFQRHI---GLIKTLNTS----EYLL 181 Query: 336 IFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTI 395 S S + T + QK + + ++S ++ +PP EQ IV +E+ F+ + I Sbjct: 182 HLLKS-SYFKGQCDEFATGTAQKTVGLETLRSFLLPIPPFAEQQRIVIEIEKWFSLIELI 240 Query: 396 EKQVNNALARVNNLTQSILAKAFRGELTAQWRAENP 431 E ++ + IL A G+L Q E P Sbjct: 241 EGGKDDLQTTIKQAKSKILDLAIHGKLVPQDPNEEP 276 >UniRef50_A6C679 Type I restriction-modification system, S subunit n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C679_9PLAN Length = 450 Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 81/372 (21%), Positives = 154/372 (41%), Gaps = 25/372 (6%) Query: 71 QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSL 130 +++ P D+V M + G + E V RP+ + FI H ++ Sbjct: 79 KRVVPNDLVYNMMRAWQGGFGT-----VKVEGMVSPAYVVARPKIDFQTQFIEHLFRTPQ 133 Query: 131 YRNKISSLSAGANINNIK--PASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 ++ S G ++ F + + +P +EQ+ I + +D +++D+ A + Sbjct: 134 AIEQMRRYSHGVTDFRLRLYWDKFKNVRVALPDKSEQQEICDYIDVETSKIDALVAEQRR 193 Query: 189 IPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTELRNGL 239 + ++LK RQAV+ AV L +W P+H L + + G Sbjct: 194 LIELLKEKRQAVISHAVTKGLNPNAPMKDSGIEWLGDVPEHWEVCSLRRYAFFVDGDRG- 252 Query: 240 SSKPNESGV-GHPILRISS--VRAGHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYNG 295 S PNE+ + IL +SS + G +D + +F+ + + LNR K QDGDL+ + G Sbjct: 253 SEYPNENDLTSDGILFLSSKNIVGGKLDLKESKFISHEKFDALNRGKAQDGDLI-VKVRG 311 Query: 296 SLEFVGVCGLLK--KLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 S +G L + +++ R P+Y+ S ++ Sbjct: 312 STGRIGEMALFDVGAYSFETAFINAQMMIIRTGNKLTPKYLSKV-SQSIYWMEQLSVGAY 370 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 + Q+ +S K V +PPV EQAEI ++ D++E + A+ + ++ Sbjct: 371 GTAQQQLSNKVFSDLFVTMPPVTEQAEIADFIDLKVGEFDSLETEAEQAIELLQERRTAL 430 Query: 414 LAKAFRGELTAQ 425 ++ A G++ + Sbjct: 431 ISAAVTGKINVR 442 Score = 57.8 bits (138), Expect = 9e-07, Method: Compositional matrix adjust. Identities = 50/214 (23%), Positives = 94/214 (43%), Gaps = 11/214 (5%) Query: 4 GKLPEGWVIAPVSTVTTLI---RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 G +PE W + + + RG Y E N L D + + + NI GK D + Sbjct: 229 GDVPEHWEVCSLRRYAFFVDGDRGSEYPNE---NDLTSDGILFLSSKNIVGGKLDLKESK 285 Query: 61 FVPKNLVKESQKISPED-IVIAMSSGSKSVVGKSAHQHL---PFECSF-GAFCGVLRPEK 115 F+ + +D +I GS +G+ A + FE +F A ++R Sbjct: 286 FISHEKFDALNRGKAQDGDLIVKVRGSTGRIGEMALFDVGAYSFETAFINAQMMIIRTGN 345 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + +++ ++S + ++S + G + F + + +PP+ EQ IA+ +D Sbjct: 346 KLTPKYLSKVSQSIYWMEQLSVGAYGTAQQQLSNKVFSDLFVTMPPVTEQAEIADFIDLK 405 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 + + DS + EQ ++L+ R A++ AV GK+ Sbjct: 406 VGEFDSLETEAEQAIELLQERRTALISAAVTGKI 439 >UniRef50_Q12YI6 Restriction modification system DNA specificity subunit n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12YI6_METBU Length = 511 Score = 67.0 bits (162), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 42/106 (39%), Positives = 57/106 (53%), Gaps = 4/106 (3%) Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 K +SGK + +PP+ EQ IV ++EQLF+ D + A ++ QS+L KA Sbjct: 144 KELSGKAFAELPLCVPPLPEQRAIVSKIEQLFSELDNGIANLKLAQQQLKVYRQSVLKKA 203 Query: 418 FRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 F GELT QWR + DL A ALLE+I+ ER S +K K Sbjct: 204 FEGELTRQWREQQTDL----PDAKALLEQIQVEREESYNEKLDEWK 245 Score = 57.0 bits (136), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 53/217 (24%), Positives = 96/217 (44%), Gaps = 20/217 (9%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRAN-------NIQNGKFDTT 57 KL + WV +S ++ G T K + Y +D L + A+ I G+ T Sbjct: 10 KLGDDWVKGVLSDFGQVVSGGT-PKTKVPEYWGEDILWITPADLSGYSEKYIYKGRKSIT 68 Query: 58 DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 L L S ++ P+ V+ S + + ++ C+ F L P + + Sbjct: 69 HL-----GLKNSSARLIPKGSVLFSSRAPIGYIAIAGNEL----CTNQGF-KTLIPSEAL 118 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+ ++ KS + ++G + +F + + +PPL EQ+ I K++ L + Sbjct: 119 NRDFLYYYLKS--IKQLAEGRASGTTFKELSGKAFAELPLCVPPLPEQRAIVSKIEQLFS 176 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR 214 ++D+ A + Q LK +RQ+VL A G+LT +WR Sbjct: 177 ELDNGIANLKLAQQQLKVYRQSVLKKAFEGELTRQWR 213 Score = 50.8 bits (120), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 34/98 (34%), Positives = 49/98 (50%) Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 +S K ++ L ++EQ IV +E + D +E+ + + L L QSIL KAF Sbjct: 411 VSTKYLQEYPFPLFSLEEQQAIVTEIETRLSVCDKVEQDIEDNLKIAEALRQSILKKAFE 470 Query: 420 GELTAQWRAENPDLISGENSAAALLEKIKAERAASGGK 457 G+L + E A LLEKI+AE+A SG K Sbjct: 471 GKLLNERELEEVRSAPDWEPAEVLLEKIRAEKAGSGKK 508 >UniRef50_B5IRS1 Type I restriction modification DNA specificity domain protein n=1 Tax=Thermococcus barophilus MP RepID=B5IRS1_9EURY Length = 408 Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 58/230 (25%), Positives = 107/230 (46%), Gaps = 30/230 (13%) Query: 164 EQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVN---------GKLTEKWR 214 EQK IAE L T+ ++ T E+ ++ K Q +L + G++ E+WR Sbjct: 156 EQKQIAEILRTVDEAIEKTDLAIEKTERLKKGLMQRLLTKGIKHKRFKKTEIGEIPEEWR 215 Query: 215 NFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS 274 + + + GLS K ++ G +PI+++ S+ G V +I++++ Sbjct: 216 V----------VRIGEVTGLFQYGLSIKMHDKG-KYPIIKMDSIINGEVKPVNIKYVDLD 264 Query: 275 ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEY 333 E +++L+ GD+L R N S E VG G+ + + ++ LIR R K + P + Sbjct: 265 EDTFKKYRLEKGDILINRTN-SYELVGRTGVF--MLDGDYVFASYLIRIRPDKKQIDPRF 321 Query: 334 IEIF--FSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEI 381 + + F++ R V Q I+ ++K + LPP++EQ +I Sbjct: 322 LTFYLIFANDKLRQLATRAV----SQANINASNLKKFKIPLPPLEEQKQI 367 Score = 57.4 bits (137), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 57/208 (27%), Positives = 102/208 (49%), Gaps = 21/208 (10%) Query: 4 GKLPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PE W + + VT L + G++ K Y P+I+ ++I NG+ ++ +V Sbjct: 208 GEIPEEWRVVRIGEVTGLFQYGLSIKMHDKGKY------PIIKMDSIINGEVKPVNIKYV 261 Query: 63 PKNLVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI-- 117 +L +++ K + DI+I ++ S +VG++ L + F ++ +RP+K Sbjct: 262 --DLDEDTFKKYRLEKGDILINRTN-SYELVGRTGVFMLDGDYVFASYLIRIRPDKKQID 318 Query: 118 --FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 F F F L R + + ANIN F IP+PPL EQK IAE L T+ Sbjct: 319 PRFLTFYLIFANDKL-RQLATRAVSQANINASNLKKF---KIPLPPLEEQKQIAEILMTV 374 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGG 203 +++ + R E++ +I + + +L G Sbjct: 375 DKKLELLRKRKEKLERIKRGLMKDLLTG 402 >UniRef50_B1XQR8 Type 1 restriction-modification system specificity subunit n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XQR8_SYNP2 Length = 398 Score = 66.6 bits (161), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 82/364 (22%), Positives = 163/364 (44%), Gaps = 34/364 (9%) Query: 67 VKESQKISPEDIVIA--MSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 +K+S+ + P D +++ MS G ++ S C + + L ++ + Sbjct: 61 IKKSRFVEPGDFLLSNSMSFGRPYIMRTSG-------CIHDGWLVLKDKSGLFDQDYLYY 113 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 F S + L+AG+ + N+ + +P+PP+AEQK I E LD + ++ +A Sbjct: 114 FLGSQAAYKQFDKLAAGSTVRNLNTTLVKKVLVPVPPIAEQKRIVEILDESFSGIERAEA 173 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK-- 242 Q L R+ + L + + +F + + + LN +T+L K Sbjct: 174 IAR---QNLTNARE-----LFDSYLNKIFLDFVERKNT-QTLN---CITDLIVDCEHKTA 221 Query: 243 PNESGVGHPILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQDGDLLFTRYNGSLEFV 300 P + G P +R ++ GH+ +++ + E + R K Q GDL+ R + Sbjct: 222 PTQE-TGFPSIRTPNIGKGHLILDNVYRVSEETYKQWTRRAKPQSGDLILAREAPA---- 276 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G G++ + + L LIR + ++ P+Y+ F P + +++ + Q + Sbjct: 277 GNVGVIPEGERVCLGQRTVLIRPK--ENINPQYLAFFLLHPKMQERLLSKSSGATVQH-V 333 Query: 361 SGKDIKS-QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 + KDI++ ++ LPP++ Q ++ + + + +E+ + + L QSIL KAF Sbjct: 334 NMKDIRALKMGDLPPIEIQDRLIESLLDVQEKSKKLEEVYQRKIEALGKLKQSILQKAFS 393 Query: 420 GELT 423 G+LT Sbjct: 394 GQLT 397 >UniRef50_Q1Q456 Putative uncharacterized protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q456_9BACT Length = 137 Score = 66.2 bits (160), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 42/136 (30%), Positives = 72/136 (52%), Gaps = 5/136 (3%) Query: 283 LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPS 342 L + D++ R G+ +G L+K + ++L + LIR +K+ PEY++ F SP Sbjct: 4 LSENDIVIARTGGT---IGKSFLIKDIPVRSL-FASYLIRVIPSKNIFPEYLKYFLESPE 59 Query: 343 ARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNA 402 + + +GQ ++G + + +V L P+ EQ IV RV++L A +EKQV+ Sbjct: 60 YWEQLYDAA-WGAGQPNVNGTSLSNLIVSLSPLAEQQAIVERVDKLMAMIGELEKQVSER 118 Query: 403 LARVNNLTQSILAKAF 418 + L QS+L +AF Sbjct: 119 KEQSEMLMQSVLREAF 134 Score = 55.1 bits (131), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 36/130 (27%), Positives = 67/130 (51%), Gaps = 4/130 (3%) Query: 73 ISPEDIVIAMSSGSKSVVGKS-AHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLY 131 +S DIVIA + G+ +GKS + +P F ++ + P K IF ++ +F +S Y Sbjct: 4 LSENDIVIARTGGT---IGKSFLIKDIPVRSLFASYLIRVIPSKNIFPEYLKYFLESPEY 60 Query: 132 RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQ 191 ++ + GA N+ S + + + PLAEQ+ I E++D L+A + + + + + Sbjct: 61 WEQLYDAAWGAGQPNVNGTSLSNLIVSLSPLAEQQAIVERVDKLMAMIGELEKQVSERKE 120 Query: 192 ILKRFRQAVL 201 + Q+VL Sbjct: 121 QSEMLMQSVL 130 >UniRef50_C2I227 Restriction modification system DNA specificity domain n=1 Tax=Vibrio cholerae TM 11079-80 RepID=C2I227_VIBCH Length = 434 Score = 66.2 bits (160), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 74/316 (23%), Positives = 140/316 (44%), Gaps = 27/316 (8%) Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIP--PLAEQKIIAEKLDTLLAQVDSTKARF 186 +LY I + G N+ DL+ IP+P ++ QK ++ LD ++DS Sbjct: 124 ALYLTNIYNKLGGGVRQNLTAG--DLLEIPVPLIDISLQKQVSAFLDRETQRIDSLIEEK 181 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTELRN 237 + +LK RQA++ V L +W P+H V KK+ ++ + E Sbjct: 182 QTFITLLKEKRQALISHVVTKGLNPNVEMQDSGIEWIGQVPKHWVVKKIKYDVLGIE--Q 239 Query: 238 GLSSK------PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 G S + P++ G ++++ V G + + L + ++ GDLL + Sbjct: 240 GWSPQCESTPVPDDHTWG--VVKVGCVNRGIFNPEQNKKLPEELEPRKEYAIKKGDLLVS 297 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAM-MN 349 R N E+VG + + + NLL DK+ R +L + A PE+ + +S AR + ++ Sbjct: 298 RANAK-EWVGSAAVPDR-DYDNLLLCDKIYRIKLDLEKADPEFFAYYLASDQAREQIEID 355 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 T+S I I + + P + EQ IVR ++ + D + +V +++ + Sbjct: 356 ATGTSSSMLNIGQGTILNMPIPAPELPEQQSIVRGIKNKTSQIDRLMLEVLDSIELLKEH 415 Query: 410 TQSILAKAFRGELTAQ 425 S+++ A G++ + Sbjct: 416 RTSLISAAVTGKIDVR 431 >UniRef50_C6DAR8 Restriction modification system DNA specificity domain protein n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DAR8_PECCP Length = 390 Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 66/273 (24%), Positives = 127/273 (46%), Gaps = 19/273 (6%) Query: 108 CGVLRPEK-LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQK 166 VL+P+ +I S F+ + +S L + + + + G I ++I +P + QK Sbjct: 98 VAVLKPKHGIITSRFLMYTLRSML--DVLLAGARGVAQQGIYLKQLHDLDIKVPSVEIQK 155 Query: 167 IIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL 226 I LD + S + EQ ++ F +A G +NF P ++ Sbjct: 156 HIVNVLD----KASSLCRKREQGIKLADEFLRATFSNMF-GNPDNNIKNF-PIGTI---- 205 Query: 227 NFESILTELRNGLSSKPNESGVGHPILRISSVR-AGHVDQNDIRFLECSESELNRHKLQD 285 +++ GLSSK ++ +P+LR+ ++ G D D+++++ E + L+ Sbjct: 206 --RDLVSSASYGLSSKTSKHSGKYPVLRMGNITYQGDWDLIDLKYIDLDEKAQEKFLLEK 263 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 GDLLF R N S E VG + + +++ + LIR R + YI + +S +N Sbjct: 264 GDLLFNRTN-SKELVGKTAIFE--NDRDMAFAGYLIRVRTNEIGNNYYIAGYLNSLHGKN 320 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQ 378 ++N K+ G I+ +++++ +L+PP + Q Sbjct: 321 TLINMSKSIVGMANINAQEMQNIKILIPPKELQ 353 >UniRef50_B3QN66 Restriction modification system DNA specificity domain n=1 Tax=Chlorobaculum parvum NCIB 8327 RepID=B3QN66_CHLP8 Length = 578 Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 80/386 (20%), Positives = 154/386 (39%), Gaps = 88/386 (22%) Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + S F+ K + + ++SL G + + + P+PPL EQ I ++D L+ Sbjct: 196 LHSDFLKWLLKRPAFLSYVNSLMYGVKMPRLGTDNAVASIHPLPPLPEQHRIVARIDELM 255 Query: 177 AQVDST---KARFEQ---------IPQILK-----------------------------R 195 A D +A EQ + Q+L Sbjct: 256 AHCDELEKLRAEREQKRVKVHAAAVRQLLDTTEPESSANAWQFISRNFRELYSDKENVAE 315 Query: 196 FRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE-------------------------- 229 R+A+L AV GKL + N P + K++ E Sbjct: 316 LRKAILQLAVMGKLVPQDPNDPPACELLKEIEAEKQRLVKEGKIKKPKAVSPIKPDEVPY 375 Query: 230 ------------SILTELRNGLSSKPNESGVGHP----ILRISSVRAGHVDQNDIRFLEC 273 +++ + G S K E+G +L+ ++V+ ++ + L Sbjct: 376 PLPDSWEWVRLGDVISYMDAGWSPK-CETGPASDSEWGVLKTTAVQKLEFLPHENKTLPI 434 Query: 274 SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PE 332 + +++++ D+L TR G VG+C + ++ + L+ DK+IR ++ D + P+ Sbjct: 435 KLTPRPEYQVEEKDILITR-AGPKNRVGICCVATSIRPK-LMLSDKIIRFKIYGDLISPD 492 Query: 333 YIEIFFSSPSARNAM-MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 Y + ++ + M Q IS +K ++L+PP+ EQ IV R++QL A Sbjct: 493 YCALSLNTGYCSEQIEMFKSGMAESQMNISQDKVKRLLMLIPPLPEQHRIVARIDQLMAL 552 Query: 392 ADTIEKQVNNALARVNNLTQSILAKA 417 DT+E+Q+++A + L +++ + Sbjct: 553 CDTLEQQIDDATRKQTELLNAVMTQV 578 >UniRef50_A6EUA9 Type I restriction-modification system, S subunit n=1 Tax=unidentified eubacterium SCB49 RepID=A6EUA9_9BACT Length = 438 Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 96/434 (22%), Positives = 176/434 (40%), Gaps = 30/434 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + ++ + G T K + Y D +P + + + G Sbjct: 16 GEIPEHWSSVSLKWISKIYSGGTPSKNKP-EYWSDGTIPWLNSGTVNQGDITEPSEYITE 74 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + L S K PE ++ +G G A FE + G++ P + ++ Sbjct: 75 EALANSSAKWIPEKAILIALAGQGKTKGMVAQTQ--FEATCNQSLGIIVPSYPELNRYLL 132 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + + + Y+N I +L G + I I P+P EQ I LD ++D Sbjct: 133 FWLRKN-YQN-IRNLGGGDKRDGINLEMIGSIPTPLPTKKEQTAITNYLDKKTTEIDQLI 190 Query: 184 ARFEQIPQILKRFRQAVLGGAV------NGKLTE---KWRNFEPQHSVFKKLNFESILTE 234 + E++ Q+ + + A++ AV + KL +W P+ +L + L Sbjct: 191 SEKEELVQLYQEEKTALINQAVTKGIKPDAKLKNSGIEWLGEIPEDWNSLRLKY---LGN 247 Query: 235 LRNGLSSKPNE-SGVGHPILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQDGDLLFT 291 NG S K + G +L+IS+++ +D +D F+ E +++ LQ+ DL+F Sbjct: 248 FINGYSFKSTDFKSSGVRVLKISNIQHMAIDWSDESFIDEEFYDTKSGFRVLQN-DLVFA 306 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 + G+ L + LL I TK + ++I S + Sbjct: 307 LTRPIIS-TGIKVALMNFDEKILLNQRNSIFRPKTK--MTKWIYFILLSSRFVQEFDKRI 363 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 T Q IS DI + +P +EQ +IV +E+ A DT ++ A +N LT+ Sbjct: 364 DKTGQQPNISSNDIGEISIPVPTKEEQTKIVEHIEKETAKIDT---KIAKAEKYINLLTE 420 Query: 412 ---SILAKAFRGEL 422 S++++ G++ Sbjct: 421 YRTSLISEVVTGKI 434 >UniRef50_A6Y5S9 Restriction endonuclease S subunit n=1 Tax=Vibrio cholerae RC385 RepID=A6Y5S9_VIBCH Length = 437 Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 74/267 (27%), Positives = 114/267 (42%), Gaps = 35/267 (13%) Query: 133 NKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQI 192 N I A ++IKP + +P+PPL EQ+ I LD ++DS A ++ Sbjct: 141 NGILQHRAAIKWDDIKPQA-----VPVPPLEEQRAILYFLDRETQRIDSLIAEKLTFIKL 195 Query: 193 LKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 LK RQA++ V L +W P+H K+ + L + +NG++ Sbjct: 196 LKEKRQALISHIVTKGLNPNVEMQDSGIEWIGQVPKHWGISKVRY---LGQCQNGINIGG 252 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLECS-ESELNRHKLQDGDLLFTRYNGSLEFVGV 302 G G P + V ++ L S E + + + + GD+LFTR + ++E +G Sbjct: 253 EFFGHGTPFVSYGDVYNNTSLPEKVQGLVLSTEKDRDNYSVIAGDVLFTRTSETIEEIGF 312 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM--------MNCVKTT 354 + K Q ++ LIR R + L E+ FS RN MN V Sbjct: 313 SAVCKSTIEQ-AVFAGFLIRFRPDEGNL----EVGFSEYYFRNEKLRAFFAKEMNLVTRA 367 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEI 381 S +S +K VLLPP+ EQ EI Sbjct: 368 S----LSQDLLKKMPVLLPPIDEQNEI 390 >UniRef50_B3R3C2 Type I restriction-modification methylase S subunit n=1 Tax=Cupriavidus taiwanensis RepID=B3R3C2_CUPTR Length = 458 Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 64/241 (26%), Positives = 107/241 (44%), Gaps = 19/241 (7%) Query: 157 IPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---- 212 I PPL+EQ I L + +++D+ + +++ +L RQA + V L K Sbjct: 170 IAFPPLSEQNAIVTFLYSETSKIDTLISEQDKLLVLLAEKRQATISRIVTRGLEPKVQIK 229 Query: 213 -----WRNFEPQHSVFKKLNFESILTELRNGLSSK----PNESGVGHPILRISSVRAGHV 263 W P H K++ + + + + G S + P E +L++ V G Sbjct: 230 SVGADWLGEIPIHWQAKRVKW--LTSSIEQGWSPQCENYPAEGENEWGVLKVGCVNGGVF 287 Query: 264 DQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA 323 D + + L + L+ GDLL +R N + E VG ++ K H+ LL DKL R Sbjct: 288 DAAENKKLPPELEPFPEYSLRKGDLLISRAN-TRELVGSAAVVPKDFHR-LLLCDKLYRL 345 Query: 324 RLTK-DALPEYIEIFFSSPSARNAM-MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEI 381 RL + PE++ + ++ AR + + +S I I +V LPP +EQA I Sbjct: 346 RLDQAKCTPEFLAAYLATGEARGQIELGATGASSSMLNIGQSVIMDLLVPLPPAEEQAAI 405 Query: 382 V 382 + Sbjct: 406 M 406 >UniRef50_C6CR26 Restriction modification system DNA specificity domain protein n=1 Tax=Dickeya zeae Ech1591 RepID=C6CR26_DICZE Length = 462 Score = 65.9 bits (159), Expect = 3e-09, Method: Compositional matrix adjust. Identities = 86/446 (19%), Positives = 182/446 (40%), Gaps = 36/446 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + ++ G T K Y ++ +P + + ++ +G Sbjct: 19 GQVPVHWNAVSLKWISQRYSGGTPDKSNDA-YWENGDIPWLNSGSVNDGYITEPSTYITR 77 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + S K P++ ++ +G G A + C+ + ++ EK F+ Sbjct: 78 EGFASSSAKWVPKNALVMALAGQGKTKGMVAQLGIRATCN-QSMAAIIPKEK--FTPRFL 134 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 ++ S Y+N I +++ G + + I P+ P EQ IA+ LD ++DS Sbjct: 135 YWWLVSNYQN-IRNMAGGEQRDGLNLDMLGSIPCPLLPRPEQTAIADFLDRETGRIDSLM 193 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTE-------------------KWRNFEPQHSVFK 224 A+ Q+ +LK R A++ V L E +W P+ K Sbjct: 194 AKKRQLIALLKEKRCALISHIVTRGLPEAAADEFGLKPHTRFKNSDIEWLGQVPEGWGVK 253 Query: 225 KLNFESIL--TELRNGLSSKPNES-----GVGHPILRISSVRAGHVDQNDIRFLECSESE 277 K+ E + EL++G + + G G P + + + G +D N ++E +++ Sbjct: 254 KVWIERVSRNIELQDGNHGEQHPKAEDYVGEGIPFVMANHIDNGKIDFNKCNYIEKEQAD 313 Query: 278 LNRHKL-QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 R +GD+L T + G+ +G G+++K ++ ++ R ++ ++ Sbjct: 314 SLRIGFSNEGDVLLT-HKGT---IGRVGIVQKSHFPYVMLTPQVTYYRCLREIQNRFLFW 369 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 S ++ + S + I D K+ L+P KEQ I +++ + D + Sbjct: 370 LMQSKFWQDQLKLLAGLGSTRAYIGLLDQKTLSFLIPSEKEQFAIATYLDRETSKLDRLV 429 Query: 397 KQVNNALARVNNLTQSILAKAFRGEL 422 ++V+ +AR+ +++ A G++ Sbjct: 430 EKVDAVIARLQEYRTALITAAVTGKI 455 >UniRef50_B7K558 Restriction modification system DNA specificity domain protein n=2 Tax=Bacteria RepID=B7K558_CYAP8 Length = 453 Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 104/466 (22%), Positives = 193/466 (41%), Gaps = 80/466 (17%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +P+GW + + + + I K A Y D + +R+ NI D+V++ Sbjct: 24 GDIPDGWEVKRLKWIVSKIGSGKTPKGGAEIY-SDSGIIFLRSQNIHFDGLRLDDVVYIN 82 Query: 64 KNLVK--ESQKISPEDIVIAMSSGS--------KSVVGKSAHQHLPFECSFGAFCGVLRP 113 K++ K S ++ P DI++ ++ S K + +QH+ C +LRP Sbjct: 83 KDIDKAMSSSRVKPLDILLNITGASLGRCMIIPKDFPSSNVNQHV---------C-ILRP 132 Query: 114 -EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPA-SFDLINIPIPPLAEQKIIAEK 171 I F+ S+ +N+I S G + + A + +LI++ P L EQ+ IA+ Sbjct: 133 IVTRINPYFLNRVMSSNAIQNQIFSSEVGVSREGLTFAQAGNLISV-FPSLPEQEKIAQF 191 Query: 172 LDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSV 222 LD A++D +++ ++LK R A++ AV L +W F P+H Sbjct: 192 LDEETAKIDKLITHKQRLIELLKEKRTALISHAVTKGLNPDVPMKDSGVEWLGFIPEHWE 251 Query: 223 FKKLNFESILTELRNGLSSKPNESGVGHPIL-----RISSVRAGHVDQNDIRFLECSE-- 275 KK+ L+ ++ G S +P + PI VR V ++ LE + Sbjct: 252 VKKIKR---LSLVKRGASPRP----IDDPIYFDDNGEYVWVRISDVTASNKYLLEAEQKL 304 Query: 276 SELNRHK---LQDGDLLFTRYNGSLEFVGVCG-----LLKKLQ---HQNLLYPDKLIRAR 324 SE+ + K LQ +L F+ +C ++ K++ H +Y +L R Sbjct: 305 SEIGKRKSVPLQPNEL----------FLSICASVGKPIITKIKCCIHDGFVYFPELKENR 354 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 EY+ F + + Q ++ + I + +PPV EQ +I Sbjct: 355 -------EYLYYIFLG----GELYKGLGKMGTQLNLNTEIIGDVKLPIPPVSEQQKIAEY 403 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTA-QWRAE 429 +++ D I K+ ++ + ++++ A G++ QW E Sbjct: 404 LDEKTEQIDPIIKKTRESIEYLKEYRTALISAAVTGKIDVRQWGCE 449 >UniRef50_A9CZ30 Restriction endonuclease S subunit n=1 Tax=Shewanella benthica KT99 RepID=A9CZ30_9GAMM Length = 601 Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust. Identities = 45/171 (26%), Positives = 82/171 (47%), Gaps = 6/171 (3%) Query: 249 GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR----YNGSLEFVGVCG 304 G I +I ++ H QN + E ++ L DGD++ T + + VG Sbjct: 138 GFMITKIQNLTDNHT-QNSVYIAPAKAMESKQYLLSDGDIVMTTVGSWFTAPISAVGRSF 196 Query: 305 LLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKD 364 L+ KL +LL + +R K+ P Y+ I +SP +N ++ + T+ Q I+ Sbjct: 197 LISKLFDNSLLNQNA-VRISSVKEFDPMYLYICVNSPIFKNYLVKEAQGTANQASITQAS 255 Query: 365 IKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 IK ++ +PP+ EQ IV + ++L D +E+Q +L+ L + +L+ Sbjct: 256 IKHFLICVPPLAEQHRIVAKADELMTLCDQLEQQTEESLSAHQTLVEVLLS 306 Score = 43.1 bits (100), Expect = 0.025, Method: Compositional matrix adjust. Identities = 45/202 (22%), Positives = 81/202 (40%), Gaps = 41/202 (20%) Query: 5 KLPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 +LP GW + +T+ L G T + Q+ Y+ D + +R+ N+ N D ++ Sbjct: 401 ELPSGWEFERLGNLTSRLGSGSTPRGGQS-AYV-DKGIIFLRSQNVWNDGLKLDDTAYIT 458 Query: 64 KNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + + P D+++ ++ S +G+S ++ PEKL+ + Sbjct: 459 DETHDKMVNTHVFPNDVLLNITGAS---LGRS----------------IIFPEKLVTANV 499 Query: 122 IAHFT-----------------KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAE 164 H T S L + + G I + + P+PPLAE Sbjct: 500 SQHVTIIRLLEVSMCKFLHLGIMSPLVQKLVWGRQVGMAIEGLSKKVLEQFEFPVPPLAE 559 Query: 165 QKIIAEKLDTLLAQVDSTKARF 186 Q+ I K+D L+A + KAR Sbjct: 560 QQRIVAKVDELMALCEQLKARL 581 >UniRef50_UPI0001AF6F3B polypeptide HsdS n=1 Tax=Mycobacterium kansasii ATCC 12478 RepID=UPI0001AF6F3B Length = 409 Score = 65.5 bits (158), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 82/388 (21%), Positives = 169/388 (43%), Gaps = 26/388 (6%) Query: 41 LPLIRANNIQNGKFDTTDLVFVP-KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLP 99 +P +R N+Q G+ DT DL+ + + +E +S D+++ +G+SA H Sbjct: 31 VPYLRNVNVQWGRVDTDDLLTMELADDERERFGVSAGDLLVC----EGGEIGRSAIWH-- 84 Query: 100 FECSFGAFCGVL---RPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLIN 156 + + A+ L RP K + F+ + + ++ L+ G+ I ++ + Sbjct: 85 GQADYIAYQKALHRIRPGKSLDVRFLRYLLEHYSLNGTLAGLATGSTIAHLPQQQLRRVP 144 Query: 157 IPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF 216 +P+PPL EQ I + ++ L+++++ + + L+ F A L + + ++R Sbjct: 145 VPVPPLNEQCRIVDLIEDHLSRLEAGQRWLSVGERKLEAFWLAALSASRRALVGAQFRTI 204 Query: 217 EPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSES 276 + T L L +K + G P LR +VR G D +D++ ++S Sbjct: 205 G-----------DVAETTLGKMLDAK-RQVGSPTPYLRNINVRWGEFDLSDVQLTPLTDS 252 Query: 277 ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 E+ R ++ GD++ C + ++ Q L+ IR R + L ++ + Sbjct: 253 EVQRFDVRPGDVMACEGGEPGRCAVWCRPVGEVAFQKALH---RIRVRNPGEVLTSFLAL 309 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 + R+ N + T + K + + ++ + +P + Q + V + +L + + Sbjct: 310 MLEE-AIRSGRCNRMFTGTTIKHLPQEKLRVIEIPVPALHTQRQAVDCLAELVGAQERLR 368 Query: 397 KQVNNALARVNNLTQSILAKAFRGELTA 424 + NA AR+ + S+L AF G L A Sbjct: 369 AALANAAARIAAMRSSLLTAAFSGRLIA 396 >UniRef50_A8ZVS3 Restriction modification system DNA specificity domain n=2 Tax=Proteobacteria RepID=A8ZVS3_DESOH Length = 577 Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 101/500 (20%), Positives = 194/500 (38%), Gaps = 110/500 (22%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 K+P GW + + V ++ G YK + + + PL+R N+ T+D+ + Sbjct: 101 KIPSGWNVTRLGEVLNVLNGRAYKNHEMLQ----EGTPLLRVGNLF-----TSDIWYYSD 151 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSA----HQHLPFECSFGAFCGVLRPEKLIFSG 120 ++ + I D++ A S+ + + H H+ F C ++ Sbjct: 152 LALEPEKYIDNGDLIYAWSASFGPFIWQGGKVIYHYHIWKLDLFDESC--------LYKN 203 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+ H+ + KI + +G + ++ A + + I +PPLAEQ I K+D L+A D Sbjct: 204 FLYHYLAA--VTEKIKASGSGIAMIHMTKARMEKLVIMVPPLAEQHRIVTKVDELMALCD 261 Query: 181 -------------------------------STKARFEQIP----------QILKRFRQA 199 + ++QI Q + +Q Sbjct: 262 RLEQEQSQSIETHQTLVKTLLAALTTAGDAKACAQTWQQIADHFEILFTTEQSIDHLKQT 321 Query: 200 VLGGAVNGKLTEKWRNFEPQHSVFKKLNFESI----LTELRNGL-------SSKPNESGV 248 +L AV GKL + N EP + +K++ E +++N KP + Sbjct: 322 ILQLAVMGKLVPQDPNDEPASVLLEKIDKEKARLIKAGKIKNQTPLPKITEDEKPFDLPE 381 Query: 249 GHPILRISSVRAGHVD------------QNDIRFLECSESEL-NRHKLQ--------DGD 287 G +R + + ++ +N + F+ + +L N KL D Sbjct: 382 GWEWVRFNQLIEPNIPISYGVLVPGPDVENGVPFVRIGDLDLINPPKLPEKSIDKEIDRQ 441 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDK---------LIRARLTKDALPEYIEIFF 338 TR G +GV G + KL + PD + R T+ L +Y+ Sbjct: 442 YERTRLLGGEILMGVVGSIGKLG----VAPDSWRGANIARAICRIAPTRLILKQYLIWLL 497 Query: 339 SSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 + ++ + +T + Q ++ I++ LPP+ EQ IV +V++L A DT++++ Sbjct: 498 QTDLMQSGFIGATRTLA-QPTLNVGLIRAAATPLPPLAEQHRIVAKVDKLMALCDTLKER 556 Query: 399 VNNALARVNNLTQSILAKAF 418 ++ A L+ +I+ +A Sbjct: 557 LHQAQTIQTQLSDAIVGQAL 576 >UniRef50_B0RYC3 Type I site-specific deoxyribonuclease (Specificity subunit) n=2 Tax=Xanthomonas campestris pv. campestris RepID=B0RYC3_XANCB Length = 438 Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 75/334 (22%), Positives = 146/334 (43%), Gaps = 30/334 (8%) Query: 110 VLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIP-IPPLAEQKII 168 VLRP + + F+ + T + +R+ + GA+ P F P +P + Q+ I Sbjct: 109 VLRPSATLDTRFLFYLTIAHDFRSHGEAEMLGASGQKRVPEEFLKDWTPSLPRMDVQQRI 168 Query: 169 AEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQ 219 A LD A++D+ + +++ + L+ RQA++ AV L W + P+ Sbjct: 169 ARFLDDKTARIDALIEKKQELLERLEEKRQALITRAVTKGLNPDLPMKPSGVDWLGYVPR 228 Query: 220 HSVFKKLNFESILTELRNGLS-------SKPNESGVGHPILRISSVRAGHVDQNDIRFLE 272 H K L + + G S ++P+E GV L+ V G D+N+ + L Sbjct: 229 HWEVKTLRRH--VQRIEQGWSPQTERRMAEPDEWGV----LKSGCVNLGIYDENEQKALP 282 Query: 273 CSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPE 332 + +++ D+L R +GS++++G L+++ + + L++ DK R L+ Sbjct: 283 GTLDPKPELEVRANDVLMCRASGSMQYIGSVALVERTRTK-LMFSDKTYRISLSSANTDR 341 Query: 333 YIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVV----LLPPVKEQAEIVRRVEQL 388 E F SA++ + SG +G++ +S V+ PP+ EQ +I + + Sbjct: 342 --EYFVRMMSAKHLREQIRLSVSGAEGLANNIPQSNVLEYLHAFPPLLEQVQIADFLRES 399 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 D E ++ + +++ A G+L Sbjct: 400 IGDLDEAEGKIRASSESWRAYRLALVTAAVTGQL 433 >UniRef50_A5GB19 Restriction modification system DNA specificity domain n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GB19_GEOUR Length = 420 Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 107/431 (24%), Positives = 178/431 (41%), Gaps = 45/431 (10%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W + +S G T + A +Y + +P +++ + T+ + + Sbjct: 3 WPMVEISRFCQTGSGGTPSRNNAGDYYGGN-IPWVKSGELNQEFVLNTEERITELAIKES 61 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSS 129 S KI P ++ G+ VGKSA + + A C ++ + + ++ + K+ Sbjct: 62 SAKIVPAGAILVAMYGA--TVGKSALLGID-AATNQAICNIIPDPEAADTRYVWYALKNQ 118 Query: 130 LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST-KARFE- 187 L + + G NI IP+P L+EQ+ I E LD Q D K R E Sbjct: 119 L--PYLLAQRVGGAQPNISQQIIKNTQIPLPLLSEQRRIVEILD----QADHLRKLRGEA 172 Query: 188 --QIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE 245 + IL + GG + W + + K S ++E R +E Sbjct: 173 DKKAELILPALFNKMFGGPATNPM--GWPEMPLRQVIAKVEAGWSAVSEAR---GCTKDE 227 Query: 246 SGVGHPILRISSVRAGHVDQNDIRFLECSES-----ELNRHKL--QDGDLLFTRYNGSLE 298 GV L++S+V +G RFL C + +R L + GDLLF+R N + E Sbjct: 228 FGV----LKVSAVTSG-------RFLACEHKAVLVLQTDRGLLTPRRGDLLFSRAN-TRE 275 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYI-EIFFSSPSARNAMMNCVKTTSG 356 V +++ H NL PDKL R L D A Y+ E+F+++ + ++ Sbjct: 276 LVAASCVVED-DHPNLFLPDKLWRLILHPDRATAMYLKELFWNNGFRDRFRASASGSSGS 334 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 IS + + + + +PP K Q E + L A I K+ A ++ L ++L + Sbjct: 335 MLNISQEAMLNTIAPIPPFKLQEEYSAKAWSLAA----IAKERRLAGDALDTLWSNLLQR 390 Query: 417 AFRGELTAQWR 427 AF G LTA WR Sbjct: 391 AFSGTLTAAWR 401 >UniRef50_Q04LY7 Type I restriction-modification system, S subunit n=48 Tax=Bacteria RepID=Q04LY7_STRP2 Length = 522 Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 75/335 (22%), Positives = 139/335 (41%), Gaps = 68/335 (20%) Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ + S++ ++ SL +GA + N+ I IP+PPL+EQ+ I E +++ L +VD Sbjct: 195 YLFYILSSNVVYSQFLSLISGAVVKNLNSDKVASILIPLPPLSEQQRIIEAIESALEKVD 254 Query: 181 STKARFEQIPQILKRF----RQAVLGGAVNGKLT------------------EKWRNFE- 217 + ++ Q+ K F ++++L A+ GKL EK + FE Sbjct: 255 EYAESYNRLEQLDKEFPDKLKKSILQYAMQGKLVEQDPNDESVEVLLEKIRAEKQKLFEE 314 Query: 218 -----------------------------PQHSVFKKLNFESILTELRNGLSSKPNESGV 248 P+ + +LN I + ++ G S K + + Sbjct: 315 GKIKKKDLDISIVSQGDDNSYYEEVPCEIPESWEWVRLN--DITSYIQRGKSPKYSNIPI 372 Query: 249 GHPILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 +P++ + + RF+ E S L+DGDL++ +G G L Sbjct: 373 -YPVIAQKCNQWSGFSIDLARFIDPETVHSYQKERLLRDGDLMWNSTG-----LGTLGRL 426 Query: 307 KKLQHQNLLYPDKLIRARLTKDAL------PEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 Y + + +T + +I F SSP ++ + ++ QK + Sbjct: 427 AIYHENKNPYAWAVADSHVTVIRVLSGVINCHFIYNFLSSPIVQSVIEEKASGSTKQKEL 486 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTI 395 K IK ++ LPP+ EQ+ IV ++EQ FA+ D + Sbjct: 487 LTKTIKEYLIPLPPLPEQSRIVDKIEQFFAHIDAL 521 >UniRef50_C2RWF6 N-6 DNA methylase n=1 Tax=Bacillus cereus BDRD-ST24 RepID=C2RWF6_BACCE Length = 1009 Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 71/289 (24%), Positives = 130/289 (44%), Gaps = 35/289 (12%) Query: 131 YRNKISSLSAGANINNIKPASFDLI---NIPIPPLAEQKIIAEKLDTLLAQVDSTKARFE 187 Y N I+++ G +N +FD I +P+PP+ Q+ I + ++ Sbjct: 749 YVNSIANIGKGVRMN----LTFDEIGNFELPLPPMEIQEEIVRE--------------YK 790 Query: 188 QIPQILKRFRQAVLGGAVNGKL-TEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES 246 +I ++L + + V+ L TE NF P H++ L S+ G S K + Sbjct: 791 KISEVLYGSKAILDNWDVDSTLFTEG--NF-PLHNI-GDLTINSLY-----GSSEKSDYE 841 Query: 247 GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 G+ I+RI ++ ND++ + + ++L+ GDLL R NG+ + VG C + Sbjct: 842 IDGYDIIRIGNIGYCSFKLNDLKRVPLPLKKFKNYELKKGDLLIVRSNGNPKLVGKCAIW 901 Query: 307 KKLQHQNLLYPDKLIRARLTKDA-LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI 365 + + N +Y L+R R ++A +PEYI + S ++ + K G + + I Sbjct: 902 QD-EIPNAVYASYLVRFRFNEEAVVPEYIMYYLMSSVGKSYIK--PKAGGGTYNFNAERI 958 Query: 366 KSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 K + LP + Q I+ RV+ +EK + + R+ +L + L Sbjct: 959 KEIPIPLPDKQTQLSIIERVKSEQETVSRVEKLMIKSEERIKSLLKKYL 1007 >UniRef50_C2QHW5 Putative uncharacterized protein n=2 Tax=Bacillus cereus RepID=C2QHW5_BACCE Length = 441 Score = 65.1 bits (157), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 47/208 (22%), Positives = 106/208 (50%), Gaps = 5/208 (2%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W+ + V+ + + +K A Y ++ Y+ + NI+ + D ++ ++ Sbjct: 223 GEMPEHWITKRLDFVSVVKARLGWKGLTASEYQENGYI-FLAIPNIKKFQIDFENVNYIS 281 Query: 64 KNLVKESQKISPE--DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + KES +I + D+++A + V + ++LP + + V+RP+ + S F Sbjct: 282 EKRYKESPEIMLQVGDVLLAKDGSTLGEV--NVVRYLPSPATVNSSIAVIRPKGDLHSVF 339 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + ++ KS+ + I G + ++ + I +PPL EQ IA+ LD ++++++ Sbjct: 340 LYYYLKSNYIQKIIQKKKDGMGVPHLFQKDINKFIIQVPPLDEQVKIAKYLDGKISEINN 399 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKL 209 ++ IL+++RQ+++ V GK+ Sbjct: 400 LIIETQEQIDILQQYRQSLVYEVVTGKI 427 Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 73/370 (19%), Positives = 154/370 (41%), Gaps = 26/370 (7%) Query: 71 QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFSGFIAHFTKSS 129 +K+ D VI S K G S F+ S C V++P+ + + + H ++ Sbjct: 72 KKVLKNDFVINSRSDRKGSCGVSK-----FDGSVSLICTVIKPKTINTYMDYYHHLFRNK 126 Query: 130 LYRNKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFE 187 ++ + G ++ + K F I IPIPP EQK I L+ + ++ + Sbjct: 127 MFSEEFYRWGRGIVDDLWSTKWDEFKRILIPIPPHEEQKSIVSYLNHIYEAIEELITHKQ 186 Query: 188 QIPQILKRFRQAVLGGAVNGKL---------TEKWRNFEPQHSVFKKLNFESILTEL--R 236 Q + +++++++++ AV L + +W P+H + K+L+F S++ Sbjct: 187 QQIETIQQYQRSLITEAVTSGLNPHAKMKDSSVEWIGEMPEHWITKRLDFVSVVKARLGW 246 Query: 237 NGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL-ECSESELNRHKLQDGDLLFTRYNG 295 GL++ + G+ L I +++ +D ++ ++ E E LQ GD+L + Sbjct: 247 KGLTASEYQEN-GYIFLAIPNIKKFQIDFENVNYISEKRYKESPEIMLQVGDVLLAKDGS 305 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 +L V V L N I K L ++ + ++ K Sbjct: 306 TLGEVNVVRYLPSPATVN-----SSIAVIRPKGDLHSVFLYYYLKSNYIQKIIQKKKDGM 360 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 G + KDI ++ +PP+ EQ +I + ++ + + + + + + QS++ Sbjct: 361 GVPHLFQKDINKFIIQVPPLDEQVKIAKYLDGKISEINNLIIETQEQIDILQQYRQSLVY 420 Query: 416 KAFRGELTAQ 425 + G++ + Sbjct: 421 EVVTGKIDVR 430 >UniRef50_B8K9P9 Restriction modification system DNA specificity domain protein n=1 Tax=Vibrio parahaemolyticus 16 RepID=B8K9P9_VIBPA Length = 594 Score = 65.1 bits (157), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 104/466 (22%), Positives = 182/466 (39%), Gaps = 102/466 (21%) Query: 43 LIRANNIQNGKFDTTDLVFVPKNLVKE----SQKISPEDIVIAMSSGSKSVVGKSAH-QH 97 L+R +IQN D + VP + E S + +DI+IA + G+ +GKS ++ Sbjct: 139 LLRITDIQN---DKVNWGTVPACDITEEKAKSYLLENDDILIARTGGT---IGKSYLVEN 192 Query: 98 LPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINI 157 + + F ++ ++ + +++ F F S LY ++ SAG N+ + + Sbjct: 193 IDLQAVFASYLIRVKRVQAVYAPFTKVFLGSQLYWKQLIENSAGTGQPNVNATALKQLLF 252 Query: 158 PIPPLAEQKIIAEKLDTLLAQVD----STKARFE-------------------------- 187 +PP +QK I K+D L+A D T+A E Sbjct: 253 IVPPFNQQKRIVAKVDELMALCDQLEQQTEASIEAHQVLVTTLLDTLTNSADADELMQNW 312 Query: 188 -----------QIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE------S 230 + + + +Q +L AV GKL + N EP + K++ E Sbjct: 313 ERISEHFDTLFTTEESIDQLKQTILQLAVMGKLVSQDPNDEPASELLKRIAEEKAQLVKE 372 Query: 231 ILTELRNGL-----SSKPNESGVGHPILRISSVRA---GHVDQNDIRFLECSESEL---- 278 + + L KP E G R+ V A G+ ++ FLE S + Sbjct: 373 KKIKKQKALPPISEDEKPFELPSGWEWCRVDDVVALKHGYAFKSSY-FLESSGPYVLTTP 431 Query: 279 -------------NRHKLQDG-----------DLLFTRYNGSLEFVGVCGLLKKLQHQNL 314 +R K DG DL+ + +G + + + Sbjct: 432 GNFYETGGFRDRGDRTKYYDGPLEVEFIFEANDLIIPLTEQAPGLLGSAAFIPE-DGRTY 490 Query: 315 LYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVL-- 371 L+ +L + DA+ +YI +F+SP R+ + +T +G K QV L Sbjct: 491 LHNQRLAKLTPYHDAVRKDYISWYFNSPYLRSEL---ARTCTGTTVRHSSPTKVQVTLFA 547 Query: 372 LPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 LPP EQ IV R++ L + ++ ++N + A +LT +I+ +A Sbjct: 548 LPPTNEQKNIVERIDSLLSICQQLKARLNESQATQLHLTDAIVEQA 593 Score = 58.5 bits (140), Expect = 5e-07, Method: Compositional matrix adjust. Identities = 39/164 (23%), Positives = 81/164 (49%), Gaps = 7/164 (4%) Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 S+KPN GV +LRI+ ++ V+ + + +E + + L++ D+L R G+ Sbjct: 129 SAKPNSEGVR--LLRITDIQNDKVNWGTVPACDITEEKAKSYLLENDDILIARTGGT--- 183 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 +G L++ + Q ++ LIR + + + ++F S ++ T GQ Sbjct: 184 IGKSYLVENIDLQA-VFASYLIRVKRVQAVYAPFTKVFLGSQLYWKQLIENSAGT-GQPN 241 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 ++ +K + ++PP +Q IV +V++L A D +E+Q ++ Sbjct: 242 VNATALKQLLFIVPPFNQQKRIVAKVDELMALCDQLEQQTEASI 285 Score = 41.6 bits (96), Expect = 0.059, Method: Compositional matrix adjust. Identities = 42/210 (20%), Positives = 79/210 (37%), Gaps = 16/210 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKF-DTTDLVFVP 63 +LP GW V V L G +K + Y+ N + G F D D Sbjct: 392 ELPSGWEWCRVDDVVALKHGYAFKSSYFLES-SGPYVLTTPGNFYETGGFRDRGDRTKYY 450 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSA--------HQHLPFECSFGAFCGVLRPEK 115 ++ D++I ++ + ++G +A + H + +R + Sbjct: 451 DGPLEVEFIFEANDLIIPLTEQAPGLLGSAAFIPEDGRTYLHNQRLAKLTPYHDAVRKD- 509 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 +I+ + S R++++ G + + P + +PP EQK I E++D+L Sbjct: 510 -----YISWYFNSPYLRSELARTCTGTTVRHSSPTKVQVTLFALPPTNEQKNIVERIDSL 564 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAV 205 L+ KAR + A++ AV Sbjct: 565 LSICQQLKARLNESQATQLHLTDAIVEQAV 594 >UniRef50_Q3J7Q5 Restriction endonuclease S subunits-like n=2 Tax=Nitrosococcus oceani RepID=Q3J7Q5_NITOC Length = 487 Score = 65.1 bits (157), Expect = 6e-09, Method: Compositional matrix adjust. Identities = 106/469 (22%), Positives = 195/469 (41%), Gaps = 57/469 (12%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + P + T G + + A + D + L + +G ++ TD V Sbjct: 43 GEVPSFWEVKPFKWLLTHNEGGVWGDDPA---GEGDTIVLRSTDQTVDGNWNVTDPA-VR 98 Query: 64 KNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE---CSFGAFCGVLRPEKLIF 118 VKE S + D+V+ SSGS +GK+ ++ +G F LR + Sbjct: 99 HLTVKENASAVLEAGDLVVTKSSGSALHIGKTTLVNVDMAKLGYCYGNFMQRLRLGQKYI 158 Query: 119 SGFIAHFTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + + L R +++ LS + + N+ I +P+PP+ EQ IA LD A Sbjct: 159 PKLAWYVMNNDLVRLQLNLLSNSTTGLANLNATLIGEILLPVPPVEEQTQIARFLDHETA 218 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNF 228 ++D+ +++ ++LK RQA++ AV L +W P H + K L Sbjct: 219 RIDALIEEQQRLIELLKEKRQAIISHAVTKGLDPTVPMKDSGVEWLGEVPAHWITKPLKH 278 Query: 229 ESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDL 288 + L ++G +E P+ ++ + G + ++ RF+ S +DGD+ Sbjct: 279 LAELNPKKSGYHGDRDELCSFVPMEKL---KTGVIQLDEERFIADVISGYTY--FEDGDV 333 Query: 289 LFTRYNGSLEFVGVC---GLLKKL----QHQNLL--YPD---KLIRARLTKDALPEYIEI 336 L + E + GL + N+L +PD + RL +D Y+ I Sbjct: 334 LQAKVTPCFENRNIAIADGLTNGVGFGSSEINVLRPFPDVNASFLYYRLQEDG---YMGI 390 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 +S M+ G K + G+ I V +P EQ +I ++ A D + Sbjct: 391 CTAS------MIGA----GGLKRVPGEVINGFTVAVPERHEQTQIAHFLDHETARVDKLV 440 Query: 397 KQVNNALARVNNLTQSILAKAFRGELTAQ-WR----AENPDLISGENSA 440 ++ N + + ++++ A G++ + W+ A +P+L EN A Sbjct: 441 EEANVGIELLKERRSALISAAVTGKIDVRGWQPPASAPSPEL---ENEA 486 >UniRef50_C2GFC3 Restriction modification system DNA specificity subunit n=1 Tax=Corynebacterium glucuronolyticum ATCC 51866 RepID=C2GFC3_9CORY Length = 332 Score = 64.7 bits (156), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 79/311 (25%), Positives = 148/311 (47%), Gaps = 39/311 (12%) Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ H+ S R +I S + G NNI + + IP+PPL EQ+ IA T+L + + Sbjct: 52 YLYHYL--SYLRPQIESRAKGVAQNNINLKTLKQLEIPLPPLEEQRRIA----TILEKAN 105 Query: 181 STK---ARFE-QIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 S + R E I I+ +F V +L R+ E F KL S L +++ Sbjct: 106 SLRNAPPRTEVHINNIVSQF--------VENRL---LRSNEK----FVKL---SELCDIQ 147 Query: 237 NGLS-SKPNESGVGH--PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 +G++ + + + P L +S+V+ G++D + ++ +E + E+ ++ L GD+L T Sbjct: 148 SGITKGRKTKKALAAKIPYLAVSNVKDGYLDLSKVKEIEVTNEEIEKYALHKGDILLTE- 206 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEI-FFSSPSARNAMMNCV 351 G + +G G L + N L+ + + R RL K A+P + + SS ++ + Sbjct: 207 GGDPDKLGR-GCLWNDEIPNCLHQNHIFRVRLKDKQAIPANVLMAILSSKELKSYFLKSA 265 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 K T+G I+ + + + P+ + E + ++ L + + + ++ L Q Sbjct: 266 KQTTGIASINRTQLSNASI---PILDN-ETIAEIDCLLFMCEKLMATNTSRTLLLDELIQ 321 Query: 412 SILAKAFRGEL 422 S+ A+AF GEL Sbjct: 322 SLSARAFTGEL 332 >UniRef50_UPI0001C15DDF Restriction modification system DNA specificity domain protein n=1 Tax=Cylindrospermopsis raciborskii CS-505 RepID=UPI0001C15DDF Length = 445 Score = 64.7 bits (156), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 101/450 (22%), Positives = 184/450 (40%), Gaps = 75/450 (16%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 GK+PE W + VS I T +Y + + +P + + ++ T Sbjct: 31 GKIPEHWEVRKVSHAFQKIGSGTTPSTNHYDYYEGN-IPWVNTSELREKVITDTSAKLTN 89 Query: 64 KNLVKES--QKISPEDIVIAM---SSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 K L+ S P ++IAM + G ++G +A C+ A C + P I Sbjct: 90 KALLDHSVLNLYPPGTLLIAMYGATIGRLGILGITA-------CTNQACCALANPIS-IN 141 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + F ++ + RN++ LS+G NI I IP PPL EQ+ IA+ LD A+ Sbjct: 142 AKFAFYWL--WMRRNELILLSSGGGQPNINQEKIRSIRIPAPPLTEQQAIAQFLDRETAK 199 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFE 229 +D+ A+ E++ ++LK R A++ AV L +W P++ +L Sbjct: 200 IDTLVAKKERLIELLKEKRTALISHAVTKGLNPDAPMKDSGVEWLGEVPRNWPMIRLKHV 259 Query: 230 SILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL-----ECSESELNRHKLQ 284 + ++ + L+ KP+ + + H++ R L E ES ++ + Sbjct: 260 APVSSAK--LTQKPD---------NLPYIGLEHIESKTGRLLLDTPVENVESTVS--CFE 306 Query: 285 DGDLLFTRYNGSL------EFVGVCGL----LKKLQHQNLLYPDKLIRARLTKDALPEYI 334 GD+LF + L EF GV LK Q N K + +L + + + Sbjct: 307 KGDVLFGKLRPYLAKVLLAEFEGVSTTELLALKPSQDVN----GKFLFFQLIAEGFIDQV 362 Query: 335 EIFFSSPSARNAMMNCVKTTSGQKG--ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYA 392 F T G K + + I + + LPP+ EQ I + +++ A Sbjct: 363 NSF----------------TYGTKMPRVGPEQITNLFIPLPPLPEQQAIAQFLDRETAKI 406 Query: 393 DTIEKQVNNALARVNNLTQSILAKAFRGEL 422 DT+ + ++ ++ ++++ A G++ Sbjct: 407 DTLVAKTRTSIEKLKEYRTALISAAVTGKI 436 >UniRef50_B1ZYW8 Restriction endonuclease S subunits-like protein n=1 Tax=Opitutus terrae PB90-1 RepID=B1ZYW8_OPITP Length = 388 Score = 64.7 bits (156), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 79/316 (25%), Positives = 135/316 (42%), Gaps = 45/316 (14%) Query: 117 IFSGFIAHFTKSSLYRNK-ISSLSA-GANINNIKPASFDLINIPIPPLAEQKIIAEKLD- 173 IF G+I H + K +S++S G ++ +PA I +P+PPLAEQ+ IAE LD Sbjct: 108 IFPGYIRHLLVEDRFHAKFMSTVSGVGGSLLRARPAHVARIRVPLPPLAEQRRIAEVLDR 167 Query: 174 --TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESI 231 L A+ +T A+ + + Q L F A N K W P+ + + + F Sbjct: 168 AEALRAKRRATLAQLDSLTQCL--FLDLFGDPATNPK---GW----PKTVLGEIIEF--- 215 Query: 232 LTELRNGLSSKPNESGVGHP---ILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDL 288 G S P E+ P +R+ +R D+ F L R + D+ Sbjct: 216 -----VGGSQPPRETFTYEPSPDTIRLVQIRDFKSDE----FKTYIPRRLARRFFNEDDV 266 Query: 289 LFTRYNGSLEFV--GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNA 346 + RY + + G+CG N+ L + ++KD ++ + Sbjct: 267 MIGRYGPPVFQILRGLCG------SYNVALMKALPKDEVSKD----FVFHLLQEQRLHSY 316 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 ++ + T+GQ G++ + ++ PP Q E RRV A + ++ +LA + Sbjct: 317 VVARSERTAGQTGVNLELLEKYPAFRPPASLQREFARRV----AAVEKLKTTQRASLAEL 372 Query: 407 NNLTQSILAKAFRGEL 422 + L S+ +AFRG+L Sbjct: 373 DALFASLQHRAFRGDL 388 >UniRef50_D0IJZ0 Type I restriction-modification system specificity subunit S n=1 Tax=Vibrio sp. RC586 RepID=D0IJZ0_9VIBR Length = 391 Score = 64.7 bits (156), Expect = 8e-09, Method: Compositional matrix adjust. Identities = 74/318 (23%), Positives = 141/318 (44%), Gaps = 31/318 (9%) Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 EKL+F + ++ S L + + S++ + + + F+ + IP+PPL EQK IA LD Sbjct: 96 EKLVFPKY-GYYALSRL-KPILESIAPATTVAIVSKSKFEELEIPLPPLEEQKRIAAILD 153 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT--EKWRNFEPQHSVFKKLNFESI 231 + D+ + + +Q + F ++V +T + W E + V Sbjct: 154 ----KADAIRQKRKQAITLADEFLRSVFLEMFGDPVTNPKGWSRKEIKEGV--------- 200 Query: 232 LTELRNGLSSKPNESGVGH---PILRISSVRAGHVDQNDIRFLECSESELNRHKL--QDG 286 + + +G S+K + G +L+IS+V +G + +F+E ++ + + G Sbjct: 201 -SRITSGWSAKGDSRPCGQGEVGVLKISAVTSGEFKPKENKFVEKHIIPEGKNLIFPKKG 259 Query: 287 DLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARN 345 DLLF+R N + E V ++ K ++ PDKL L+ + L PEY + + Sbjct: 260 DLLFSRAN-TRELVAATCIVPK-DCDDVFLPDKLWNIELSSEELMPEYFHMLLQDDKFKE 317 Query: 346 AMMNCVKTTSGQK-GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 + + +SG IS + ++ + P+ Q + L A ++ + + Sbjct: 318 TLTSQATGSSGSMLNISKQKFETTLAPFAPIDLQMKFKNIYWHLKDNAANMKNSEDYLIE 377 Query: 405 RVNNLTQSILAKAFRGEL 422 + N L+Q KAF G+L Sbjct: 378 QFNALSQ----KAFSGQL 391 >UniRef50_Q7MNA3 Restriction endonuclease S subunit n=1 Tax=Vibrio vulnificus YJ016 RepID=Q7MNA3_VIBVY Length = 433 Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 53/195 (27%), Positives = 104/195 (53%), Gaps = 17/195 (8%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W + + LI G+TY + ++ L ++R++N+QNGK D VFV N +K Sbjct: 241 WEEVELRKLGDLISGLTYSPDD----VRASGLLVLRSSNVQNGKIVYGDNVFVEPN-IKG 295 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSA--HQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTK 127 + P+DI+I + +GSK+++GK+A Q++P + GAF + R + ++ F + Sbjct: 296 ANISEPDDILICVRNGSKALIGKNALIPQNVPL-STHGAFMTIFRSK---YAQFTFQLFQ 351 Query: 128 SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS-TKARF 186 ++ Y+ ++ + GA IN+I IP ++K EK+ L+ +D A+ Sbjct: 352 TNAYQKQVDA-DLGATINSINGKQLLKYKFKIPRSNDEK---EKIVKCLSSLDDLINAQT 407 Query: 187 EQIPQILKRFRQAVL 201 ++I ++LK +++ ++ Sbjct: 408 DKI-EVLKEYKKGLM 421 >UniRef50_Q8YRH1 Type I restriction-modification enzyme S subunit n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YRH1_ANASP Length = 383 Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 87/342 (25%), Positives = 146/342 (42%), Gaps = 63/342 (18%) Query: 109 GVLRPEKLIFSGFIAHFTKSSLYRNKI-SSLSAGANINNIKPASFDLINIPIPPLAEQKI 167 G +R K +++ + F KS + + + SS + AN NI + + P+PPLAEQK Sbjct: 33 GCIRIIKPVYNQLVNIFLKSPIIVDYLLSSATGTANQANIGANTLRELPFPLPPLAEQKR 92 Query: 168 IAEKLDTLLAQVDSTKARFEQ--------------------------------------- 188 I EK D LL+ D + R +Q Sbjct: 93 IVEKCDRLLSICDEIEKRHQQRQESIVRMNESAIAQLLSSQNPDDFRQHWQRICNNFDLL 152 Query: 189 --IPQILKRFRQAVLGGAVNGKLT-EKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE 245 IP+ + + RQA+L AV GKLT + + + K ++ SIL NG + K Sbjct: 153 YSIPETIPKLRQAILQLAVQGKLTNQSSKEIKKISDTHKVSDYVSIL----NGYAFKSTW 208 Query: 246 -SGVGHPILRISSVRAGHVDQNDIRFL-ECSESELNRHKLQDGDLLFTRYNGSLEFVGVC 303 G +LR ++V G + +D+ + E E R KL D++ SL+ + Sbjct: 209 FINDGIRLLRNANVGHGDLRWDDVATISEERAQEFQRFKLDIDDIVI-----SLDRPIIS 263 Query: 304 GLLKKLQHQNLLYPDKLIRARL------TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 LK + P L++ R+ T +P++ ++ SP NA+ ++G Sbjct: 264 TGLKVARITKNDLPCLLLQ-RVGKFEFKTDKVIPDFFFLWLQSPIFINAIDP--GRSNGV 320 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 IS K I++ + P +EQ IV + ++L + DT+E ++ Sbjct: 321 PHISSKSIEAILFNPPSREEQKRIVEKCDRLMSLCDTLEAKL 362 Score = 43.9 bits (102), Expect = 0.013, Method: Compositional matrix adjust. Identities = 26/109 (23%), Positives = 49/109 (44%), Gaps = 8/109 (7%) Query: 324 RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 R+ K + + IF SP + +++ T+ Q I ++ LPP+ EQ IV Sbjct: 36 RIIKPVYNQLVNIFLKSPIIVDYLLSSATGTANQANIGANTLRELPFPLPPLAEQKRIVE 95 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPD 432 + ++L + D IEK+ + + +S +A+ ++NPD Sbjct: 96 KCDRLLSICDEIEKRHQQRQESIVRMNESAIAQLL--------SSQNPD 136 >UniRef50_A3PKU6 Restriction modification system DNA specificity domain n=2 Tax=Bacteria RepID=A3PKU6_RHOS1 Length = 456 Score = 64.3 bits (155), Expect = 9e-09, Method: Compositional matrix adjust. Identities = 101/453 (22%), Positives = 177/453 (39%), Gaps = 63/453 (13%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PEGW + + + ++ + + +P++ +NI +G+ D + V Sbjct: 16 GEVPEGWEVKCLRMIADELQTGPFGSQLHTEDYVTAGVPIVNPSNILDGQIVPDDEIGVD 75 Query: 64 KN--LVKESQKISPEDIVIAMSSGSKSVVGKSA---HQHLPFECSFGAFCGVLRPEKLIF 118 + L + + P DI++ G + +G+ A +P C G+ L+ + + Sbjct: 76 EATALRLANHALLPGDIIL----GRRGELGRCAVVPDGTMPLLCGTGSLRIRLKSSQAL- 130 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 FIA ++ R +S S G+ ++N+ A I I +P L EQ+ I L+ A+ Sbjct: 131 PDFIAECIRTPRVREWLSLQSVGSTMDNLNTAIVGKIQIALPSLPEQRAITAFLNRETAK 190 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVN-----------------GKLTEKWRNFEPQHS 221 +D+ ++ +L RQAVL AV G + E W Sbjct: 191 IDALVEEQRRLIALLAEKRQAVLNHAVTRGLNPDALLKPSGIDWLGDIPEGWEVV----P 246 Query: 222 VFKKLNFESILTELRNGLSSKPNESGVGH----PILRISSVRAGHVDQNDIRFLECSE-- 275 + K ES T R S+P H + I VR G V+ E +E Sbjct: 247 IRKVARLESGHTPSR----SRPEWWVDCHIPWFSLADIWQVRPGRVEY----VYETAEAV 298 Query: 276 SELNRHK-----LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 SEL L G ++ +R S+ F V G+ + + + RL L Sbjct: 299 SELGLQNSSARLLPAGTVMLSR-TASVGFSAVMGIAMATTQD---FANWVCGCRL----L 350 Query: 331 PEYIEIFFSS-PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 P+Y+ PS + K S I DI++ + LPP++EQ IV V Sbjct: 351 PDYLLYCLRGMPSEFERL----KMGSTHNTIYMPDIRTLTIPLPPLEEQKAIVDHVRASV 406 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 D + A+ + ++++ A G++ Sbjct: 407 GALDELMDTATTAITLLQERRAALISAAVTGKI 439 Score = 55.8 bits (133), Expect = 3e-06, Method: Compositional matrix adjust. Identities = 55/228 (24%), Positives = 91/228 (39%), Gaps = 28/228 (12%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +PEGW + P+ V L G T + + + D ++P +I + + V+ Sbjct: 236 GDIPEGWEVVPIRKVARLESGHTPSRSRP-EWWVDCHIPWFSLADIWQVRPGRVEYVYET 294 Query: 64 KNLVKE------SQKISPEDIVI---AMSSGSKSVVGKSAHQHLPFECSFGAFCGV-LRP 113 V E S ++ P V+ S G +V+G + F CG L P Sbjct: 295 AEAVSELGLQNSSARLLPAGTVMLSRTASVGFSAVMGIAMATTQDFA---NWVCGCRLLP 351 Query: 114 EKLIFS--GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEK 171 + L++ G + F + L G+ N I + IP+PPL EQK I + Sbjct: 352 DYLLYCLRGMPSEFER----------LKMGSTHNTIYMPDIRTLTIPLPPLEEQKAIVDH 401 Query: 172 LDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQ 219 + + +D +L+ R A++ AV GK+ R+ PQ Sbjct: 402 VRASVGALDELMDTATTAITLLQERRAALISAAVTGKIDV--RDLSPQ 447 >UniRef50_Q1VAF2 Hypothetical type I restriction-modification system specificity determinant n=1 Tax=Vibrio alginolyticus 12G01 RepID=Q1VAF2_VIBAL Length = 464 Score = 64.3 bits (155), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 88/387 (22%), Positives = 169/387 (43%), Gaps = 36/387 (9%) Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAF-----CGVLRPEK 115 +PK++ + + DI++A S + VGKS F +C F + C R Sbjct: 82 LPKDIASDY-LLKDRDILLARSGAT---VGKSFIYRKEFGDCCFAGYLIKVSCDSAR--- 134 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPP-LAEQKIIAEKLDT 174 + S + F +SS Y IS A I N+ + + I +P + EQ IA LD Sbjct: 135 -LNSDYAFWFFQSSSYWQYISGSQIQATIQNVSAEKYGEMYISLPEHVEEQTQIANFLDH 193 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKK 225 A++D+ + +Q+ ++LK RQAV+ AV L + W P+H +++ Sbjct: 194 ETAKIDTLIEKQQQLIKLLKEKRQAVISHAVTKGLNPQAPMKNSGVEWLGEVPEH--WEQ 251 Query: 226 LNFESILTELRNG-LSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS--ESELNRHK 282 + + I ++ + + P + + R ++VR G + + ++ + E R + Sbjct: 252 IKLKHITHQIVDAEHKTAPYFDDGEYLVCRTTNVRDGKLRLDGGKYTNHAIYEEWTKRGQ 311 Query: 283 LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPS 342 + GD+LFTR + E G + Q ++ L + T+ LPE++ S Sbjct: 312 PEVGDILFTREAPAGEACVYTGEVPLCLGQRMV----LFKLNQTR-VLPEFVLHSIYSGL 366 Query: 343 ARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNA 402 A + + + S + DI++ + PP EQA+IV + ++ A D + + Sbjct: 367 ADD-FVKQLSQGSTVAHFNMSDIQNIPLFEPPKDEQAQIVDHLAKVLAKYDALTSSASLK 425 Query: 403 LARVNNLTQSILAKAFRGELTAQ-WRA 428 + + ++++ A G++ + W+A Sbjct: 426 IELMQERRTALISAAVTGKIDVRNWQA 452 >UniRef50_Q112D6 Restriction modification system DNA specificity domain n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q112D6_TRIEI Length = 402 Score = 63.9 bits (154), Expect = 1e-08, Method: Compositional matrix adjust. Identities = 90/411 (21%), Positives = 175/411 (42%), Gaps = 70/411 (17%) Query: 41 LPLIRANNIQNGKFDTTDLVFVPKNLVKE--SQKISPEDIVIAMSS--GSKSVVGKSAHQ 96 +P +R NNIQ+GK + D++F+ + +I +D++I+++ G +V+ +A Sbjct: 33 IPFLRVNNIQDGKINLGDVLFIDSKTDQALARSRILKKDVIISIAGTIGKTAVIPTNAPA 92 Query: 97 HLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLIN 156 C+ ++R + + H+ + +I+ A I+N+ + Sbjct: 93 ---MNCNQA--LAIIRLHNNVDPYYFNHWLNTGDAFRQITGSKVTATISNLSLGCIKKLK 147 Query: 157 IPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ----IPQILKRFRQAVLGGAVNGKLTEK 212 IP+PP+ EQ+ IA LD Q D+ + + +Q ++L+ + G V Sbjct: 148 IPLPPIEEQRRIAAILD----QADAIRRKRQQAIALTDELLRSTFLEMFGDPV------- 196 Query: 213 WRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV--------GHPILRISSVRAGHVD 264 P+ KKL E + + + + P S + G P+ I +V+ Sbjct: 197 ---INPKGWEVKKL--EEVALKRKGAIKCGPFGSQLLISEFVKDGIPVYGIDNVQKNEFV 251 Query: 265 QNDIRFLECSESE-LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA 323 +++ + E L +QD D+L +R G+ VG + +++L P+ L + Sbjct: 252 WAKPKYITTEKYEQLKSFSIQDEDVLISR-TGT---VGRTCVAPPDIPRSILGPNLLKVS 307 Query: 324 RLTKDALPEYI------------EIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVL 371 T LP+Y+ EI SP A A+ N ++K+ + Sbjct: 308 LNTNKMLPKYLSYALNHSNPLIEEIKRMSPGATVAVFNTT------------NLKALRLT 355 Query: 372 LPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 +P + Q++ V E + + +++ +N L NNL S+L +AF+G+L Sbjct: 356 IPHINLQSQFVNFTENV----ELTKQKESNYLTESNNLFNSLLQRAFKGQL 402 >UniRef50_C9Q5S0 Possible type I restriction-modification system S subunit n=1 Tax=Vibrio sp. RC341 RepID=C9Q5S0_9VIBR Length = 469 Score = 63.5 bits (153), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 76/303 (25%), Positives = 124/303 (40%), Gaps = 24/303 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P W+ + + + +G+T KE L+D +P + + + D P Sbjct: 21 IPAHWLTSKLRYTFSFGKGLTITKEN----LRDTGIPCVSYGEVHSKYGFEIDPARHPLK 76 Query: 66 LVKESQ-KISPE------DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVL-RPEKLI 117 V + K SP DIV A +S G Q + E F + ++ RP Sbjct: 77 CVGDDYLKTSPYALLKKGDIVFADTSEDIDGSGNFT-QLVSNEQVFAGYHTIIARPYNHE 135 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 S F A+ S R +I G + +I A +NI +PPL E+ IA LD A Sbjct: 136 CSRFYAYLLDSKELRTQIRHAVKGVKVFSITQAILRGVNIWLPPLKERNQIANFLDHETA 195 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNF 228 ++D+ + +Q+ ++LK RQAV+ AV L + W P+H L Sbjct: 196 KIDTLIEKQQQLIKLLKEKRQAVVSHAVTKGLNPQAPMKDSGVEWLGEVPEHWSISPLK- 254 Query: 229 ESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDL 288 + T G SS N G P +R +++ + + DI + + R L DG+L Sbjct: 255 HHVNTVNGFGFSSN-NFQDEGVPFIRAGNIKNKTIVKPDIHLPQAVVDKYQRVILNDGEL 313 Query: 289 LFT 291 + + Sbjct: 314 VIS 316 Score = 53.9 bits (128), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 50/214 (23%), Positives = 91/214 (42%), Gaps = 13/214 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W I+P+ + G + N +D+ +P IRA NI+N D + +P Sbjct: 242 GEVPEHWSISPLKHHVNTVNGFGFSS----NNFQDEGVPFIRAGNIKNKTIVKPD-IHLP 296 Query: 64 KNLVKESQKISPED--IVIAMSSGSKSVVGKSAHQHLPFECSFGAFCG-----VLRPEKL 116 + +V + Q++ D +VI+M + + Q S +LR + Sbjct: 297 QAVVDKYQRVILNDGELVISMVGSDPKIKASAVGQVGLVPPSLAGSVPNQNVVILREQSS 356 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + F+ + + YR+ + S AN + I + P L EQK I + LDT Sbjct: 357 LLKKFLFYVVCGTPYRHHLDVFSHKLANQSIISSSLIICAQFTFPELDEQKEIVDFLDTQ 416 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 L + D + + + + + A++ V GK+ Sbjct: 417 LRKYDWLMEKATRSIEFMNERKTALISATVTGKI 450 >UniRef50_C8W862 Putative uncharacterized protein n=1 Tax=Atopobium parvulum DSM 20469 RepID=C8W862_ATOPD Length = 459 Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 56/186 (30%), Positives = 89/186 (47%), Gaps = 17/186 (9%) Query: 207 GKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN---ESGVGHPILRISSVRAGHV 263 GKL + +F+ +L ES+ GLS K + ESG+ PILR+S++ G + Sbjct: 251 GKLERVYGSFKCPFKSISELTKESLF-----GLSLKASLKQESGM-IPILRMSNIVNGEI 304 Query: 264 DQNDIRFL----ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDK 319 D + +++L + E ++ L+ GD L R N S E VG + + Y Sbjct: 305 DCSSLKYLPYKSAVTPREPDKWLLRKGDFLINRTN-SKELVGKSAVFN--LDGDYTYASY 361 Query: 320 LIRARL-TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQ 378 +IR R T LPEY+ I F P R + + T+GQ I+ +I S + +P + EQ Sbjct: 362 IIRYRFDTSVVLPEYVNIMFMLPLVRIQIDTMSRQTAGQCNINSGEIGSIRIPIPSIPEQ 421 Query: 379 AEIVRR 384 I+ + Sbjct: 422 QAIIDK 427 Score = 46.2 bits (108), Expect = 0.002, Method: Compositional matrix adjust. Identities = 44/158 (27%), Positives = 81/158 (51%), Gaps = 8/158 (5%) Query: 41 LPLIRANNIQNGKFDTTDLVFVP-KNLV--KESQK--ISPEDIVIAMSSGSKSVVGKSAH 95 +P++R +NI NG+ D + L ++P K+ V +E K + D +I ++ SK +VGKSA Sbjct: 291 IPILRMSNIVNGEIDCSSLKYLPYKSAVTPREPDKWLLRKGDFLINRTN-SKELVGKSAV 349 Query: 96 QHLPFECSFGAFCGVLRPE-KLIFSGFIAHFTKSSLYRNKISSLS-AGANINNIKPASFD 153 +L + ++ ++ R + ++ ++ L R +I ++S A NI Sbjct: 350 FNLDGDYTYASYIIRYRFDTSVVLPEYVNIMFMLPLVRIQIDTMSRQTAGQCNINSGEIG 409 Query: 154 LINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQ 191 I IPIP + EQ+ I +K + D+ A+ E++ Q Sbjct: 410 SIRIPIPSIPEQQAIIDKYYSTKDGADAFYAKAEELKQ 447 >UniRef50_A0KWU0 Restriction modification system DNA specificity domain n=1 Tax=Shewanella sp. ANA-3 RepID=A0KWU0_SHESA Length = 391 Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 93/395 (23%), Positives = 169/395 (42%), Gaps = 40/395 (10%) Query: 38 DDYLPLIRANNIQNGK--FDTTDLVFVPKNLVKESQKISPEDIVI--AMSSGSKSVVGKS 93 DD L I + N ++T L P+ L K ++ + P D ++ +MS G ++ + Sbjct: 27 DDGLNWISIKDASNSNKYINSTKLKIKPEGLTK-TRMVYPGDFLLTNSMSFGRPYIMNTT 85 Query: 94 AHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFD 153 H + G P+K + S + + S + + S L+AGA + N+ Sbjct: 86 GCIHDGWLVLSG------NPDK-VNSDYFYYLLGSDTLKQRFSGLAAGAVVKNLNTELVK 138 Query: 154 LINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKW 213 + +P+PPLAEQK IA LD + D+ + + +Q Q+ +AV +T Sbjct: 139 SVEVPLPPLAEQKRIAAILD----KADAIRRKRQQAIQLADDLLRAVFLEMFGDPVT--- 191 Query: 214 RNFEPQHSVFKKLNFESILTELRNGLSSKPNE----SGVGHPILRISSVRAGHVDQNDIR 269 P+ KL S L ++ G + K E S + R + G+ + D Sbjct: 192 ---NPKGFQKSKL---SALADVITGFAFKSAEYVEDSDDAVRLCRGVNTLTGYFEWKDTA 245 Query: 270 FLECSE-SELNRHKLQDGDLLFTRYNGSLEF-VGVCGLLKKLQHQNLLYPDKLIRARLTK 327 F + ++ + L+ +KL+ GD++ + + VC + + L+ IR++ Sbjct: 246 FWDSNKINGLHNYKLEAGDVILAMDRPWISSGLKVCVFPENERDTYLVQRVARIRSK--- 302 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 P Y + +SS + +C T + IS ++K+ +L+P K ++ V + Sbjct: 303 --QPRYTDYLYSSILSPAFEKHCCPTETTVPHISPVELKNFEILVPDEKSVSKYHDIVSK 360 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 L D +E + A N+L+Q KAF G+L Sbjct: 361 LRRSKDRMEMNLTEANQIFNSLSQ----KAFSGQL 391 >UniRef50_C5B9C5 Type I restriction-modification system, S subunit, putative n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5B9C5_EDWI9 Length = 585 Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 98/504 (19%), Positives = 187/504 (37%), Gaps = 113/504 (22%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LPEGW + +T + G +K + + +++ +IQ+G+ ++ V ++ Sbjct: 101 LPEGWAWGSIGYITEFVNGYAFKSSD----FASEGVGIVKIGDIQDGEIVVDNMSRVSQH 156 Query: 66 LVK---ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGF 121 +V E+ ++ D++IAMS + +G + + + G F L ++ F Sbjct: 157 VVDGLNENLQVKSGDMLIAMSGATTGKLGFNKTDEIFYLNQRVGKFITYLVDKE-----F 211 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD- 180 + + + + N ++ G+ I NI + I I +PPLAEQ I K+D L+A D Sbjct: 212 LYYPLATKIAENLAKAM--GSAIPNISTKQINEITIALPPLAEQHRIVAKVDELMALCDQ 269 Query: 181 -----------------------STKARFEQIPQILKR-----------------FRQAV 200 + + E++ Q R +Q + Sbjct: 270 LEQCSESQLAAHQTLVEALLATLTDSSDTEELAQNWARLNTHFDTLFTTEASIDALKQTI 329 Query: 201 LGGAVNGKLTEKWRNFEPQHSVFKKLNFE------------------------------- 229 L AV GKL ++ + EP ++ ++ E Sbjct: 330 LQLAVMGKLVQQDPSDEPASALLARIAAEKAQLIKEKKIKKEKPLPAISEDEKPFSLPKG 389 Query: 230 -------SILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL---N 279 + + +G P + G P + V+ R++ +L N Sbjct: 390 WDFAYMQDLCYLITDGTHQTPKYTDDGRPFISAQCVKPFRFMPEFCRYVSEEHYQLYIKN 449 Query: 280 RHKLQDGDLLFTRYN---GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 R + GD+L +R G + C LL P++ +Y+E+ Sbjct: 450 RRP-EFGDILLSRVGAGIGEAAVIDSCLEFAIYVSTGLLKPNR-------GAVYSKYLEL 501 Query: 337 FFSSPSARN-AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTI 395 + +SP R + N + Q ++ I+S +V LPP KEQ IV +V ++ D + Sbjct: 502 WLNSPIGRGFSERNTLGKGVSQGNLNLSLIRSFIVSLPPKKEQKLIVAKVGEMITLCDQL 561 Query: 396 EKQVNNA----LARVNNLTQSILA 415 + + + LA +L + +A Sbjct: 562 KSCLQTSQQTQLALAESLVEGAIA 585 >UniRef50_C3XPA5 Restriction modification system DNA specificity subunit (Fragment) n=1 Tax=Helicobacter winghamensis ATCC BAA-430 RepID=C3XPA5_9HELI Length = 203 Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 47/179 (26%), Positives = 87/179 (48%), Gaps = 8/179 (4%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 E W + V ++ G +K ++ +N + D LP+I+ N+ NG + D+VF P + Sbjct: 8 EQWQEVRLGEVAEIVNGYAFKSKEFLNIQQRDSLPIIKIKNVANGDVNLNDVVFYPYSKQ 67 Query: 68 KESQKISPEDIVIAMS----SGSKSVVGK-SAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 E I DI+++++ VVG+ S +++ F ++ + F+ Sbjct: 68 LEKFLIKYGDILVSLTGNHPQAQSQVVGQISKYKYKQFALLNQRVAKIVTKDAE--QDFL 125 Query: 123 AHFTKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + K++ N ++S S+G AN NI + + IP+PPL Q+ IAE L + ++D Sbjct: 126 YYLLKTNKIHNILASHSSGSANQANISSKDIENLTIPLPPLTIQQKIAEILSSFDDKID 184 Score = 51.2 bits (121), Expect = 8e-05, Method: Compositional matrix adjust. Identities = 44/170 (25%), Positives = 82/170 (48%), Gaps = 10/170 (5%) Query: 251 PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY-NGSLEFVGVCGLLKKL 309 PI++I +V G V+ ND+ F S+ +L + ++ GD+L + N V G + K Sbjct: 42 PIIKIKNVANGDVNLNDVVFYPYSK-QLEKFLIKYGDILVSLTGNHPQAQSQVVGQISKY 100 Query: 310 QHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQV 369 +++ ++ + +TKDA +++ + N + + ++ Q IS KDI++ Sbjct: 101 KYKQFALLNQRVAKIVTKDAEQDFLYYLLKTNKIHNILASHSSGSANQANISSKDIENLT 160 Query: 370 VLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 + LPP+ Q +I E L ++ D I+ L R N +S+ FR Sbjct: 161 IPLPPLTIQQKI---AEILSSFDDKID-----LLHRQNKTLESLALTLFR 202 >UniRef50_B5GJX8 Type I restriction modification enzyme protein S n=1 Tax=Streptomyces sp. SPB74 RepID=B5GJX8_9ACTO Length = 385 Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 74/280 (26%), Positives = 122/280 (43%), Gaps = 34/280 (12%) Query: 154 LINIPI--PPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVL-----GGAVN 206 L N P+ PPLAEQ+ IA LL VD+ +A+ + +L Q+V A N Sbjct: 119 LKNFPVVKPPLAEQQRIA----ALLDHVDALRAKRREATTLLDSLAQSVFLDMFGDPAAN 174 Query: 207 GKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP-NESGVGHPILRISSVRAGHVDQ 265 + +W P +V ++ +G S P ++ G +L++S+V +G Sbjct: 175 PR---QW----PAGTV------ADLVAGFESGKSIAPGSDEGAEKRVLKVSAVTSGEFRG 221 Query: 266 NDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR--A 323 ++ + + + H +++GDLLF+R N + + +G L+++ L PDKL R Sbjct: 222 SESKPVPEDYTVPPAHLVREGDLLFSRAN-TEDLIGAVALVEEFT-GALALPDKLWRFVW 279 Query: 324 RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG-QKGISGKDIKSQVVLLPPVKEQAEIV 382 +D P Y+ F R + TSG K IS + +PP +AE Sbjct: 280 HDGQDGHPLYVRHLFRQKEFRRRIRERASGTSGSMKNISQPKVLGIRCGIPPEGLRAEFC 339 Query: 383 RRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 RV + D + LA ++ L S+ +AF G L Sbjct: 340 ARVRSI----DASRRAHRGHLAALDELFTSLRHRAFSGSL 375 >UniRef50_B9XT14 Restriction modification system DNA specificity domain protein n=1 Tax=bacterium Ellin514 RepID=B9XT14_9BACT Length = 405 Score = 62.8 bits (151), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 57/215 (26%), Positives = 102/215 (47%), Gaps = 20/215 (9%) Query: 223 FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHK 282 ++ L F ++ + G+S+ + G PIL + ++ G V + + +E+ L + + Sbjct: 7 WRVLPFGEVVEHSQYGISTPTSPDGT-IPILGMKNINDGQVVVGNPDRVSITEAVLAKQR 65 Query: 283 LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSP 341 L+DGDLLF R N SL+ VG GL + + + + L+R RL ++ + P Y+ F++ Sbjct: 66 LKDGDLLFNRTN-SLDLVGKTGLFR--ESGDFVCASYLVRFRLRRNLVDPRYVCYLFNTS 122 Query: 342 SARNAMMNCVKTTSGQKGISGKDIKSQVVL-LPPVKEQAEIVRRVEQLFAYADTI---EK 397 ++ M Q I+ ++ + +L LPP +EQ I +E + D I E Sbjct: 123 HSQRIMRQLATKAVAQANINPTSLQRKFLLPLPPRQEQVAIADLLE---FWDDDICRTES 179 Query: 398 QVNNALARVNNLTQSILA-----KAFRGELTAQWR 427 ++ L L Q +L K F+G+ WR Sbjct: 180 RLGKKLEFKRGLMQQLLTGQTQFKEFKGK---PWR 211 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P05719 Type-1 restriction enzyme EcoKI specificity prot... 593 e-168 UniRef50_A1AJL9 HsdS, type I site-specific deoxyribonuclease n=2... 390 e-107 UniRef50_B5BKY5 Subunit S of type I restriction-modification sys... 370 e-101 UniRef50_A9N788 Putative uncharacterized protein n=3 Tax=Salmone... 350 7e-95 UniRef50_B7LQL4 Specificity determinant for hsdM and hsdR (Modul... 346 1e-93 UniRef50_A4BLE7 Type I restriction enzyme StySPI specificity pro... 333 8e-90 UniRef50_P06990 Type-1 restriction enzyme EcoBI specificity prot... 328 3e-88 UniRef50_B3YJG5 Type I restriction enzyme EcoKI specificity prot... 318 3e-85 UniRef50_C5SE02 Restriction modification system DNA specificity ... 311 4e-83 UniRef50_A3EKX4 Type I restriction modification DNA specificity ... 310 9e-83 UniRef50_C9RY89 Restriction modification system DNA specificity ... 307 8e-82 UniRef50_P06187 Type-1 restriction enzyme StySJI specificity pro... 300 8e-80 UniRef50_A1BGI9 Restriction modification system DNA specificity ... 298 4e-79 UniRef50_C9NQJ7 HsdS type I site-specific deoxyribonuclease n=1 ... 296 9e-79 UniRef50_A7IEA1 Restriction modification system DNA specificity ... 296 1e-78 UniRef50_C3MFA1 Putative restriction endonuclease type I, S subu... 295 2e-78 UniRef50_Q1MKB2 Putative type I restriction enzyme specificity s... 294 4e-78 UniRef50_Q210J8 Type I restriction enzyme StySPI specificity pro... 294 5e-78 UniRef50_B2SI71 Type I restriction-modification system, S subuni... 291 3e-77 UniRef50_B6R0S6 Restriction modification system DNA specificity ... 286 1e-75 UniRef50_D1C5W4 Restriction modification system DNA specificity ... 286 1e-75 UniRef50_A8ZTW4 Restriction modification system DNA specificity ... 285 3e-75 UniRef50_Q466N9 Type I restriction-modification system specifici... 284 4e-75 UniRef50_C9YAL6 Putative uncharacterized protein n=1 Tax=Curviba... 281 3e-74 UniRef50_D1UP80 Restriction modification system DNA specificity ... 281 3e-74 UniRef50_UPI0001C36A8C HsdS1 n=1 Tax=Clostridium hathewayi DSM 1... 277 6e-73 UniRef50_A3JFC5 Restriction modification system DNA specificity ... 275 2e-72 UniRef50_UPI0001695152 type I restriction enzyme specificity pro... 274 6e-72 UniRef50_A1TWL9 Restriction modification system DNA specificity ... 273 8e-72 UniRef50_Q1Z9T4 Type I restriction-modification system, S subuni... 273 1e-71 UniRef50_C0VG50 Type I restriction modification enzyme protein S... 271 5e-71 UniRef50_A6DQ81 Putative restriction-modification system specifi... 268 3e-70 UniRef50_A0KZG7 Restriction modification system DNA specificity ... 266 2e-69 UniRef50_A4VH87 Type I restriction-modification system, S subuni... 265 3e-69 UniRef50_UPI0001855288 conserved hypothetical protein n=1 Tax=Fr... 265 4e-69 UniRef50_A6TLK6 Restriction modification system DNA specificity ... 264 5e-69 UniRef50_Q8PTL2 Type I restriction-modification system specifici... 262 2e-68 UniRef50_P06991 Type-1 restriction enzyme EcoDI specificity prot... 261 3e-68 UniRef50_Q3JBU1 Restriction modification system DNA specificity ... 260 1e-67 UniRef50_C3Q383 Putative uncharacterized protein n=1 Tax=Bactero... 259 1e-67 UniRef50_B2A6M8 Restriction modification system DNA specificity ... 259 2e-67 UniRef50_B0PEE2 Putative uncharacterized protein n=1 Tax=Anaerot... 258 5e-67 UniRef50_C5RH89 Restriction modification system DNA specificity ... 257 8e-67 UniRef50_A1VBQ9 Restriction modification system DNA specificity ... 256 1e-66 UniRef50_A3PYN5 Restriction modification system DNA specificity ... 254 4e-66 UniRef50_Q26D97 Putative type I site-speicific deoxyribonuclease... 254 5e-66 UniRef50_C6A4W8 Putative type I specificity subunit HsdS n=1 Tax... 252 2e-65 UniRef50_C6CR26 Restriction modification system DNA specificity ... 251 4e-65 UniRef50_D0KMA1 Restriction modification system DNA specificity ... 251 6e-65 UniRef50_Q7UK98 Type I restriction enzyme EcoEI specificity prot... 249 1e-64 UniRef50_A3JH04 Specificity determinant for hsdM and hsdR n=1 Ta... 249 1e-64 UniRef50_Q4C702 Restriction modification system DNA specificity ... 248 2e-64 UniRef50_Q21ZK2 Restriction modification system DNA specificity ... 248 4e-64 UniRef50_A6CKF2 Putative type I restriction enzyme specificity p... 248 5e-64 UniRef50_C0QCH4 HsdS2 n=1 Tax=Desulfobacterium autotrophicum HRM... 246 9e-64 UniRef50_Q73D72 Type I restriction-modification enzyme, S subuni... 244 6e-63 UniRef50_C6IKX2 Type I restriction-modification system n=2 Tax=B... 244 7e-63 UniRef50_Q8RJG0 HsdS n=12 Tax=Campylobacter jejuni RepID=Q8RJG0_... 243 9e-63 UniRef50_A3UV36 Type I restriction enzyme specificity protein n=... 243 1e-62 UniRef50_A6E2R5 Restriction endonuclease S subunits-like protein... 243 1e-62 UniRef50_B5VW93 Restriction modification system DNA specificity ... 241 4e-62 UniRef50_B0VPS8 Specificity determinant for hsdM and hsdR n=1 Ta... 241 5e-62 UniRef50_A6UXD7 Type I restriction-modification system, S subuni... 241 6e-62 UniRef50_B5ECU4 Restriction modification system DNA specificity ... 240 6e-62 UniRef50_B7JRE7 Restriction modification system DNA specificity ... 239 1e-61 UniRef50_A3XPV6 Type I restriction-modification system specifici... 238 3e-61 UniRef50_Q4HFD9 HsdS n=3 Tax=Campylobacterales RepID=Q4HFD9_CAMCO 238 3e-61 UniRef50_C6J5M6 Putative uncharacterized protein n=1 Tax=Paeniba... 238 4e-61 UniRef50_B8GGK0 Restriction modification system DNA specificity ... 238 5e-61 UniRef50_A9I6S0 Type I restriction-modification system, S subuni... 237 5e-61 UniRef50_Q2P0A3 Specificity determinant for hsdM and hsdR n=2 Ta... 237 6e-61 UniRef50_UPI0001C15DDF Restriction modification system DNA speci... 237 6e-61 UniRef50_A1UJN5 Restriction endonuclease S subunits-like protein... 237 7e-61 UniRef50_C5TIE5 Restriction modification system DNA specificity ... 236 1e-60 UniRef50_UPI0001B4DA32 restriction endonuclease S subunits-like ... 236 1e-60 UniRef50_Q1R1F8 Restriction modification system DNA specificity ... 235 2e-60 UniRef50_A7VYZ3 Putative uncharacterized protein n=1 Tax=Clostri... 235 3e-60 UniRef50_A4T8B4 Restriction modification system DNA specificity ... 234 5e-60 UniRef50_A6EUA9 Type I restriction-modification system, S subuni... 234 6e-60 UniRef50_C3NN82 Restriction modification system DNA specificity ... 234 6e-60 UniRef50_Q1VAF2 Hypothetical type I restriction-modification sys... 234 7e-60 UniRef50_B8E4I3 Restriction modification system DNA specificity ... 233 8e-60 UniRef50_C3RBV6 Type I restriction-modification system n=3 Tax=B... 233 1e-59 UniRef50_Q4FUM9 Possible type I restriction-modification system,... 231 3e-59 UniRef50_B2V7V7 Restriction modification system DNA specificity ... 231 5e-59 UniRef50_B0TZ98 Type I restriction-modification system, subunit ... 231 5e-59 UniRef50_A4FXL8 Restriction modification system DNA specificity ... 231 5e-59 UniRef50_A7N438 Putative uncharacterized protein n=1 Tax=Vibrio ... 230 7e-59 UniRef50_B0JHV8 Restriction modification system DNA specificity ... 229 1e-58 UniRef50_C7QRY1 Restriction modification system DNA specificity ... 229 2e-58 UniRef50_A3PKU6 Restriction modification system DNA specificity ... 229 2e-58 UniRef50_A4FZ34 Restriction modification system DNA specificity ... 228 2e-58 UniRef50_B5IN27 HsdS, type I site-specific deoxyribonuclease n=1... 228 4e-58 UniRef50_B8H0M3 Type I restriction-modification system specifici... 228 4e-58 UniRef50_Q2J5T0 Restriction modification system DNA specificity ... 227 8e-58 UniRef50_C2CF25 Restriction modification system DNA specificity ... 226 1e-57 UniRef50_B7K558 Restriction modification system DNA specificity ... 226 2e-57 UniRef50_C6JA10 Putative uncharacterized protein n=1 Tax=Ruminoc... 225 2e-57 UniRef50_A6W078 Restriction modification system DNA specificity ... 225 3e-57 UniRef50_Q0W4T6 Type I restriction modification system, specific... 225 3e-57 UniRef50_A3YSG6 Putative type I restriction enzyme specificity p... 225 3e-57 UniRef50_Q5QX28 Restriction endonuclease S subunit n=1 Tax=Idiom... 224 4e-57 UniRef50_A6C679 Type I restriction-modification system, S subuni... 224 4e-57 UniRef50_B7VNG6 Type I restriction enzyme EcoKI, S subunit n=1 T... 224 7e-57 UniRef50_UPI0001BC509B restriction modification system DNA speci... 224 7e-57 UniRef50_C7RQC3 Type I restriction-modification system specifici... 223 9e-57 UniRef50_C1D7R6 Type I restriction-modification system, S subuni... 223 1e-56 UniRef50_Q3J7Q5 Restriction endonuclease S subunits-like n=2 Tax... 223 1e-56 UniRef50_B3G223 Type I restriction modification DNA specificity ... 223 1e-56 UniRef50_B1XQR8 Type 1 restriction-modification system specifici... 223 1e-56 UniRef50_B9M293 Restriction endonuclease S subunit-like protein ... 223 1e-56 UniRef50_B4B315 Restriction modification system DNA specificity ... 223 2e-56 UniRef50_D2LA90 Restriction modification system DNA specificity ... 222 2e-56 UniRef50_B5W475 Restriction modification system DNA specificity ... 222 2e-56 UniRef50_Q7UE18 Restriction modification system S chain homolog ... 222 3e-56 UniRef50_B7R237 Type I restriction modification system, subunit ... 222 3e-56 UniRef50_A3SCN8 Restriction endonuclease S subunit-like protein ... 221 5e-56 UniRef50_D0BWI7 Predicted protein n=1 Tax=Acinetobacter sp. RUH2... 221 5e-56 UniRef50_C6Q0B1 Restriction modification system DNA specificity ... 221 6e-56 UniRef50_C1ZA47 Restriction endonuclease S subunit n=1 Tax=Planc... 220 9e-56 UniRef50_C9Q5S0 Possible type I restriction-modification system ... 220 9e-56 UniRef50_A1ZUE4 Type I restriction-modification system specifici... 219 1e-55 UniRef50_A5G3B9 Restriction modification system DNA specificity ... 219 1e-55 UniRef50_Q0EXK2 HsdS protein n=1 Tax=Mariprofundus ferrooxydans ... 219 2e-55 UniRef50_A5UR98 Restriction modification system DNA specificity ... 219 2e-55 UniRef50_B4RYU8 Type I site-specific deoxyribonuclease n=1 Tax=A... 219 2e-55 UniRef50_Q112D6 Restriction modification system DNA specificity ... 219 2e-55 UniRef50_D2EQS4 Putative type I restriction-modification system,... 218 3e-55 UniRef50_A4CWB5 Type I restriction-modification system, S subuni... 218 4e-55 UniRef50_C1PCQ5 Restriction modification system DNA specificity ... 217 6e-55 UniRef50_C6JN70 Predicted protein n=1 Tax=Fusobacterium varium A... 217 8e-55 UniRef50_A1RES4 Restriction modification system DNA specificity ... 217 8e-55 UniRef50_A7I739 Restriction modification system DNA specificity ... 216 1e-54 UniRef50_A0ZMI3 Putative uncharacterized protein n=1 Tax=Nodular... 216 1e-54 UniRef50_C6MBL0 Restriction modification system DNA specificity ... 216 1e-54 UniRef50_B4VXC6 Type I restriction modification DNA specificity ... 216 1e-54 UniRef50_C2CSZ9 Type I restriction modification DNA specificity ... 216 2e-54 UniRef50_B8GLU3 Type I restriction-modification system, S subuni... 215 2e-54 UniRef50_B0CE92 Type I restriction-modification enzyme S subunit... 214 5e-54 UniRef50_A3J6X3 Type I restriction-modification system, S subuni... 214 6e-54 UniRef50_B7KF57 Restriction modification system DNA specificity ... 213 1e-53 UniRef50_A8V066 Type I restriction-modification enzyme, S subuni... 213 1e-53 UniRef50_UPI00016B0992 probable type I restriction-modification ... 213 1e-53 UniRef50_Q0W5N3 Type I restriction modification system, specific... 213 1e-53 UniRef50_A1TSH8 Restriction modification system DNA specificity ... 213 1e-53 UniRef50_A8YFX5 HsdS protein n=2 Tax=Microcystis aeruginosa PCC ... 213 2e-53 UniRef50_C5SDH7 Putative uncharacterized protein n=1 Tax=Allochr... 212 2e-53 UniRef50_Q8GN10 Putative type I specificity subunit HsdS n=3 Tax... 211 3e-53 UniRef50_D2TNZ5 Putative type I restriction modification system ... 211 7e-53 UniRef50_B3R3C2 Type I restriction-modification methylase S subu... 210 7e-53 UniRef50_A2TPX3 RmeS n=1 Tax=Dokdonia donghaensis MED134 RepID=A... 210 8e-53 UniRef50_A1ZTI8 Type I restriction enzyme StySJI specificity pro... 210 8e-53 UniRef50_C0XBA7 Type I restriction-modification system, S subuni... 210 9e-53 UniRef50_C9NRR1 Type I restriction-modification system specifici... 210 9e-53 UniRef50_C2QHW5 Putative uncharacterized protein n=2 Tax=Bacillu... 210 9e-53 UniRef50_D1J921 Putative type I restriction enzyme, DNA specific... 210 9e-53 UniRef50_D1XRZ5 Restriction modification system DNA specificity ... 210 1e-52 UniRef50_C7RNT4 Restriction endonuclease S subunits-like protein... 209 1e-52 UniRef50_A5GB19 Restriction modification system DNA specificity ... 209 2e-52 UniRef50_D1YNY9 Type I restriction modification DNA specificity ... 209 2e-52 UniRef50_UPI0001AF6F3B polypeptide HsdS n=1 Tax=Mycobacterium ka... 209 2e-52 UniRef50_Q8TN78 Type I restriction modification enzyme protein S... 209 2e-52 UniRef50_B1LRG3 Type I restriction modification DNA specificity ... 208 3e-52 UniRef50_B0RQ64 Type I site-specific DNA methyltransferase speci... 208 3e-52 UniRef50_B9ZS45 Restriction modification system DNA specificity ... 207 8e-52 UniRef50_A6L7U8 Type I restriction enzyme EcoAI specificity prot... 207 9e-52 UniRef50_Q1K3D0 Restriction modification system DNA specificity ... 206 1e-51 UniRef50_UPI0001BC364B restriction modification system DNA speci... 206 2e-51 UniRef50_C9P132 Type I restriction-modification system specifici... 206 2e-51 UniRef50_B4TEJ6 Restriction modification system DNA specificity ... 205 2e-51 UniRef50_A1K1C0 Type I site-specific deoxyribonuclease n=3 Tax=B... 205 4e-51 UniRef50_B5IRS1 Type I restriction modification DNA specificity ... 205 4e-51 UniRef50_Q0RV87 Type I restriction-modification system specifici... 204 4e-51 UniRef50_C6CZ61 Restriction modification system DNA specificity ... 204 8e-51 UniRef50_C5C353 Restriction modification system DNA specificity ... 203 1e-50 UniRef50_Q64AS2 Restriction endonuclease S subunits n=1 Tax=uncu... 203 1e-50 UniRef50_Q0RKJ6 Type I restriction modification enzyme protein S... 203 2e-50 UniRef50_D0C390 Type I restriction-modification system specifici... 202 2e-50 UniRef50_A3US47 Type I site-specific deoxyribonuclease n=1 Tax=V... 202 2e-50 UniRef50_B3PQK6 Probable type I restriction-modification system ... 202 3e-50 UniRef50_Q2SCB3 Restriction endonuclease S subunit n=1 Tax=Hahel... 201 3e-50 UniRef50_A6E2C3 Restriction modification system DNA specificity ... 201 4e-50 UniRef50_Q8EJT0 Type I restriction-modification system, S subuni... 201 4e-50 UniRef50_C4KDM6 Restriction modification system DNA specificity ... 201 7e-50 UniRef50_B9KF72 Type I restriction-modification system, S subuni... 200 7e-50 UniRef50_C2KFA2 Restriction endonuclease S subunit n=4 Tax=Lacto... 200 8e-50 UniRef50_C9KLK0 Putative phosphoribosylformylglycinamidine synth... 200 1e-49 UniRef50_A6Y5S9 Restriction endonuclease S subunit n=1 Tax=Vibri... 200 1e-49 UniRef50_C5BH70 Restriction modification system DNA specificity ... 199 1e-49 UniRef50_A8RUN3 Putative uncharacterized protein n=1 Tax=Clostri... 199 2e-49 UniRef50_A0L1U2 Restriction modification system DNA specificity ... 199 2e-49 UniRef50_Q1VR15 Type I restriction-modification enzyme 1, S subu... 199 2e-49 UniRef50_A7JK69 Type I restriction-modification system n=1 Tax=F... 198 3e-49 UniRef50_D0WYM6 Putative uncharacterized protein n=1 Tax=Vibrio ... 198 3e-49 UniRef50_C2I227 Restriction modification system DNA specificity ... 198 4e-49 UniRef50_B3E898 Restriction modification system DNA specificity ... 197 5e-49 UniRef50_Q4HNY2 Type I restriction-modification system specifici... 197 5e-49 UniRef50_B6VTA2 Putative uncharacterized protein n=1 Tax=Bactero... 197 5e-49 UniRef50_A5KSY3 Restriction modification system DNA specificity ... 197 6e-49 UniRef50_Q6F778 Putative type I restriction-modification system ... 197 6e-49 UniRef50_C0VJ61 Restriction modification system DNA specificity ... 197 1e-48 UniRef50_B5FA22 Restriction modification system DNA specificity ... 196 2e-48 UniRef50_Q1GLF5 Type I restriction-modification system; S subuni... 196 2e-48 UniRef50_Q307D8 Type I RM system S subunit n=1 Tax=Arthrospira p... 195 3e-48 UniRef50_C6DAR8 Restriction modification system DNA specificity ... 195 3e-48 UniRef50_Q8KLM8 Restriction-modification enzyme type I S subunit... 195 3e-48 UniRef50_D0J4L5 Putative uncharacterized protein n=1 Tax=Comamon... 195 4e-48 UniRef50_B4SA10 Restriction modification system DNA specificity ... 193 1e-47 UniRef50_A0KWU0 Restriction modification system DNA specificity ... 192 3e-47 UniRef50_B8EFW7 Restriction modification system DNA specificity ... 191 4e-47 UniRef50_C7D880 Restriction modification system DNA specificity ... 191 7e-47 UniRef50_B0JXI4 Putative type I restriction enzyme specificity p... 190 1e-46 UniRef50_Q57594 Type-1 restriction enzyme MjaXIP specificity pro... 189 2e-46 UniRef50_C6YVW3 Predicted protein n=2 Tax=Francisella philomirag... 189 2e-46 UniRef50_D2QTT7 Restriction modification system DNA specificity ... 189 2e-46 UniRef50_B3H2F5 Type I restriction-modification system, S subuni... 189 2e-46 UniRef50_A3XVN0 Type I restriction-modification system, S subuni... 188 3e-46 UniRef50_Q89Z57 Putative type I restriction enzyme S.BthVORF4518... 188 3e-46 UniRef50_C3PVT7 Type I restriction enzyme EcoR124II specificity ... 188 3e-46 UniRef50_A5KY57 Type I restriction-modification enzyme, S subuni... 188 3e-46 UniRef50_C5DB08 Restriction modification system DNA specificity ... 188 4e-46 UniRef50_Q167L9 Type I restriction enzyme specificity subunit, p... 188 5e-46 UniRef50_B5EKM3 Restriction modification system DNA specificity ... 187 5e-46 UniRef50_D0KYE4 Restriction modification system DNA specificity ... 187 7e-46 UniRef50_C5VLJ8 HsdS protein n=1 Tax=Prevotella melaninogenica A... 187 8e-46 UniRef50_B2IP18 Type I restriction-modification system, S subuni... 187 9e-46 UniRef50_Q3AQE4 Restriction endonuclease S subunits-like n=1 Tax... 187 9e-46 UniRef50_B0QS41 Type I restriction enzyme EcoKI subunit R n=1 Ta... 186 1e-45 UniRef50_A1WW67 Restriction modification system DNA specificity ... 186 2e-45 Sequences not found previously or not previously below threshold: UniRef50_C6RQJ9 Restriction endonuclease S subunit n=2 Tax=Acine... 227 9e-58 UniRef50_B4VK59 Putative uncharacterized protein n=1 Tax=Microco... 204 5e-51 UniRef50_A0Q725 Type I restriction-modification system, subunit ... 203 9e-51 UniRef50_Q30XD2 Type I restriction-modification system, S subuni... 203 2e-50 UniRef50_Q6GD64 Putative type I restriction enzyme specificity p... 203 2e-50 UniRef50_A5GE25 Restriction endonuclease S subunits-like protein... 197 5e-49 UniRef50_UPI0001C42656 hypothetical protein BpOF4_03730 n=1 Tax=... 197 8e-49 UniRef50_Q3J746 Restriction modification system DNA specificity ... 196 1e-48 UniRef50_C6WNJ9 Restriction modification system DNA specificity ... 195 3e-48 UniRef50_A9A374 Restriction modification system DNA specificity ... 194 5e-48 UniRef50_A3JE98 Type I restriction-modification system, S subuni... 194 7e-48 UniRef50_Q31PC5 Type I restriction-modification n=2 Tax=Synechoc... 193 1e-47 UniRef50_UPI0001973978 type I restriction-modification system, S... 191 3e-47 UniRef50_C3DG13 Putative uncharacterized protein n=1 Tax=Bacillu... 191 4e-47 UniRef50_B0A8Q7 Putative uncharacterized protein n=1 Tax=Clostri... 191 7e-47 UniRef50_A8TH56 Restriction modification system DNA specificity ... 190 8e-47 UniRef50_C4ZFR7 Type I restriction-modification system specifici... 190 9e-47 UniRef50_C4LDK7 Restriction modification system DNA specificity ... 189 1e-46 UniRef50_A3ZCQ6 HsdS n=5 Tax=Campylobacter jejuni RepID=A3ZCQ6_C... 188 4e-46 UniRef50_C7P6A9 Restriction modification system DNA specificity ... 186 2e-45 UniRef50_Q307C7 Type I RM system S subunit n=2 Tax=Arthrospira p... 186 2e-45 >UniRef50_P05719 Type-1 restriction enzyme EcoKI specificity protein n=5 Tax=Enterobacteriaceae RepID=T1SK_ECOLI Length = 464 Score = 593 bits (1529), Expect = e-168, Method: Composition-based stats. Identities = 464/464 (100%), Positives = 464/464 (100%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV Sbjct: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG Sbjct: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD Sbjct: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS Sbjct: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV Sbjct: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI Sbjct: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG Sbjct: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 Query: 421 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS Sbjct: 421 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 >UniRef50_A1AJL9 HsdS, type I site-specific deoxyribonuclease n=2 Tax=Escherichia coli RepID=A1AJL9_ECOK1 Length = 455 Score = 390 bits (1002), Expect = e-107, Method: Composition-based stats. Identities = 207/468 (44%), Positives = 271/468 (57%), Gaps = 17/468 (3%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINY-LKDDYLPLIRANNI---QNGKFDT 56 MSAGKLPEGW + + +I G T K A N+ + + + ++ + Sbjct: 1 MSAGKLPEGWEQIEIGDIADVISGGTPKSGVAENFAPSGEGVAWLTPADLSGYKEKYISH 60 Query: 57 TDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL 116 S K+ P+ ++ S V +A++ + F P Sbjct: 61 GARDLTTLGYSSCSAKLMPKGTILFSSRAPIGYVAIAANEI----ATNQGFKSFAFPSD- 115 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 IF + +F ++ R+ + G I +S + + P AEQKIIAEKLDTLL Sbjct: 116 IFPDYAYYFLRN--IRHIAEEMGTGTTFKEISGSSAKTLPFVLVPFAEQKIIAEKLDTLL 173 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 AQVDSTKAR EQIPQILKRFRQAVLG AV GKLTE WR+ S +++ + + Sbjct: 174 AQVDSTKARLEQIPQILKRFRQAVLGAAVRGKLTEDWRDNSSL-SGWREGKLGEFIKKPS 232 Query: 237 NGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 G SSK N+ G+ P+LR+ +++ G +D D+ + + E+ ++KL+ D+LF R N S Sbjct: 233 YGTSSKSNKEGL-IPVLRMGNLQGGKLDWTDLVYTSDTI-EIEKYKLEYNDVLFNRTN-S 289 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 E VG + K Q +Y LIR + D P+Y+ +S R + Sbjct: 290 PELVGKTAIYK--SEQPAIYAGYLIRVQCLPDLNPDYLNYHLNSILGRQYCYSVKSDGVS 347 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 Q I+ + + + + +PP+ EQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK Sbjct: 348 QSNINAQKLIAYPITVPPLPEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 407 Query: 417 AFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 AFRGELTAQWRAENP+LISGENSAAALLEKIKAERAASGGKKASRKKS Sbjct: 408 AFRGELTAQWRAENPELISGENSAAALLEKIKAERAASGGKKASRKKS 455 >UniRef50_B5BKY5 Subunit S of type I restriction-modification system n=7 Tax=Salmonella enterica subsp. enterica RepID=B5BKY5_SALPK Length = 462 Score = 370 bits (951), Expect = e-101, Method: Composition-based stats. Identities = 200/477 (41%), Positives = 267/477 (55%), Gaps = 28/477 (5%) Query: 1 MSAGKLPEGWVIAPVSTVTTL-IRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL 59 MS GKLPEGWV +S + + G T K + +R +I G D + + Sbjct: 1 MSGGKLPEGWVTTHLSEICSKPQYGYTTKSSSM------GDVKFLRTTDITKGAVDWSSV 54 Query: 60 VFVPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRPEKLI 117 + ++ DIVI+ VG S Q+ P + F ++ +P Sbjct: 55 PYCMDAPEDVSKYQLQDRDIVISR----AGSVGFSFLVQNPPSQVVFASYLIRFKPVNYF 110 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 ++ F +SS Y N++S +SAG + N+ + +PIPP+AEQKIIAEKLDTLLA Sbjct: 111 SEYYLKRFLESSDYWNQLSLMSAGNAVQNVNAQKLSTLTVPIPPIAEQKIIAEKLDTLLA 170 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKL-TEKWRNFEPQHSVFKKLNFESILT--- 233 QVDSTKAR EQIPQILKRFRQAVL AV+G L RN P S ++ + S + Sbjct: 171 QVDSTKARLEQIPQILKRFRQAVLAAAVSGLLIGSNKRNHHPLCSEWQWPDLPSTWSVHK 230 Query: 234 ------ELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD 287 + K G L +VR D +++ + S+ E L+ GD Sbjct: 231 YSELVDSRLGKMLDKAKNFGSATKYLGNINVRWFSFDLENLQDILISDIERRELSLKLGD 290 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 +L G C + + Q +++ L RAR+ +PE++ + S N Sbjct: 291 VLICEGGEP----GRCAIWSEPQDIPVIFQKALHRARVKDKIIPEWLVYNLKNDS-NNIS 345 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 ++ + T + K ++GK + + + +PP++EQ EIVRRVEQLFA+ADTIEKQVNNAL RVN Sbjct: 346 LSQLFTGTTIKHLTGKALANYPIRVPPLEEQHEIVRRVEQLFAWADTIEKQVNNALNRVN 405 Query: 408 NLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 +LTQSILAKAFRGELTAQWRAENP LISGENSAAALLEKIKAERAASGGKK SRKK+ Sbjct: 406 SLTQSILAKAFRGELTAQWRAENPSLISGENSAAALLEKIKAERAASGGKKTSRKKA 462 >UniRef50_A9N788 Putative uncharacterized protein n=3 Tax=Salmonella enterica subsp. enterica RepID=A9N788_SALPB Length = 467 Score = 350 bits (898), Expect = 7e-95, Method: Composition-based stats. Identities = 189/478 (39%), Positives = 262/478 (54%), Gaps = 25/478 (5%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS GKLPE WV + + + G K +A+ ++ P IR + +NG + + + Sbjct: 1 MSGGKLPEEWVKTTIGVICEVKGGKRLPKGKALLNTATEH-PYIRVTDFENGSVNLSTIK 59 Query: 61 FVPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAF--CGVLRPEKL 116 ++ + + IS D+ I++ +G+ ++G+ Q + A C +L +K Sbjct: 60 YLDSDTYSAISNYTISKNDLYISI-AGTIGLIGEIPEQLDNANLTENAAKLCFILGTDK- 117 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 ++ H S+ + + + + P P+ EQKIIAEKLDTLL Sbjct: 118 ---KYLKHVLSSNKTIEQFDDKTTSSGQPKLALFRIRDCEFPYAPINEQKIIAEKLDTLL 174 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL-TEKWRNFEPQHSVFKKLNFESILT-- 233 AQVDSTKAR EQIPQILKRFRQAVL AV+G L RN P S ++ + S + Sbjct: 175 AQVDSTKARLEQIPQILKRFRQAVLAAAVSGLLIGSNKRNHHPLCSEWQWPDLPSTWSVH 234 Query: 234 -------ELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDG 286 + K G L +VR D +++ + S+ E L+ G Sbjct: 235 KYSELVDSRLGKMLDKAKNFGSATKYLGNINVRWFSFDLENLQDILISDIERRELSLKLG 294 Query: 287 DLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNA 346 D+L G C + + Q +++ L RAR+ +PE++ + S N Sbjct: 295 DVLICEGGEP----GRCAIWSEPQDIPVIFQKALHRARVKDKIIPEWLVYNLKNDS-NNI 349 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 ++ + T + K ++GK + + + +PP++EQ EIVRRVEQLFAYADTIEKQVNNAL RV Sbjct: 350 SLSQLFTGTTIKHLTGKALANYPIRVPPLEEQHEIVRRVEQLFAYADTIEKQVNNALTRV 409 Query: 407 NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 N+LTQSILAKAFRGELTAQWRAENP LISGENSAAALLEKIKAERAASGGKK SRKK+ Sbjct: 410 NSLTQSILAKAFRGELTAQWRAENPSLISGENSAAALLEKIKAERAASGGKKTSRKKA 467 >UniRef50_B7LQL4 Specificity determinant for hsdM and hsdR (Modular protein) n=2 Tax=Escherichia RepID=B7LQL4_ESCF3 Length = 502 Score = 346 bits (888), Expect = 1e-93, Method: Composition-based stats. Identities = 199/517 (38%), Positives = 264/517 (51%), Gaps = 68/517 (13%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MSAGKLPEGWV + V + G T + Y + +P I+ ++ Sbjct: 1 MSAGKLPEGWVETNLQNVASWGSGGTPSRNHDEYY--NGNIPWIKTGDLGPKIITNASEY 58 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + S K P+ V G + +GK++ L + + C V P + I S Sbjct: 59 ITDAGVQNSSAKFFPKGSVAIAMYG--ATIGKTSI--LGIDATTNQACAVGTPLEGITST 114 Query: 121 -FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ +F + +N G NI I +PPLAEQKII EKLDTLLAQV Sbjct: 115 LFLYYFLLNE--KNAFIKKGKGGAQPNISQTVIKEHIIYLPPLAEQKIITEKLDTLLAQV 172 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV----------------- 222 DSTKAR EQIPQILKRFRQAVL AVNGKLTE WR+ + + Sbjct: 173 DSTKARLEQIPQILKRFRQAVLERAVNGKLTECWRDCVGELTSAEEIITEIKKYRKASLS 232 Query: 223 --------------------------------FKKLNFESILTELRNGLSSKPNESGVGH 250 + F + ++ + + G Sbjct: 233 TEGSSASTESKRQIAKIEKHCFKVPKINLPKGWVWTTFLQSMEKVVDCHNKTAPYVDQGI 292 Query: 251 PILRISSVRAGHVDQNDIRFLECSES--ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKK 308 ++R +R G + ++ ++++ R + GD++FTR +G G++ + Sbjct: 293 HLIRTPDIRNGVISLDNTKYIDNDTYLYWSKRCPPRSGDIIFTREAP----MGEAGIVPE 348 Query: 309 LQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 + +++ R + +Y+ + S S + M++ +G K + D++S Sbjct: 349 NTI--ICMGQRMMLLRPIPEYIHNKYVLLNILSSSFQTRMISQAI-GTGVKHLRVADVES 405 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWR 427 LPP++EQ EIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWR Sbjct: 406 LTYPLPPIEEQHEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWR 465 Query: 428 AENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 AENPDLISGENSAAALLEKIKAERAASGGKKASRKKS Sbjct: 466 AENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 502 >UniRef50_A4BLE7 Type I restriction enzyme StySPI specificity protein n=2 Tax=Proteobacteria RepID=A4BLE7_9GAMM Length = 496 Score = 333 bits (854), Expect = 8e-90, Method: Composition-based stats. Identities = 147/506 (29%), Positives = 232/506 (45%), Gaps = 53/506 (10%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 M LPE W V+ + LIRGVTYKK +A + + PL+RANNI NG+ + DLV Sbjct: 1 MENRALPENWARCRVTELAQLIRGVTYKKSEASKESQPGFAPLLRANNI-NGRINHEDLV 59 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 +V + + Q + D++IAMSSGS +VGK+A +FG+FCG LRP I Sbjct: 60 YVREARISNEQWLKESDVLIAMSSGSIGLVGKAAQLRKVKGETFGSFCGALRPTSEIDCH 119 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F F ++ YR +S + G+NINN+K ++ P+PP EQ+ I EK++TL +++D Sbjct: 120 FFGWFFQTRTYRECVSGDAKGSNINNLKRDHILHVDFPLPPANEQRRIVEKIETLFSRLD 179 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQH-------------------- 220 + + ++L R+RQ+VL AV G+LT WR Sbjct: 180 KGEEALRDVQKLLSRYRQSVLKAAVTGQLTADWRAENAHRLEHGRDLLARILQTRRESWE 239 Query: 221 ----------------------SVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSV 258 V+ L + + + +++ V P LR+++V Sbjct: 240 GRGKYKEPIAPSTSGLPDLPDGWVWASLAQLTHIKGGVTVDKKRESKNPVTVPYLRVANV 299 Query: 259 RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPD 318 + GH+D +I+ + + + + L+ GD+L G + +G G + Q ++ + Sbjct: 300 QNGHIDLTEIKEITVNRDKAEQTLLKAGDILLNE-GGDRDKLGR-GWVWDGQIAPCIHQN 357 Query: 319 KLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQ 378 + RAR + ++++ + M K + IS I + LP EQ Sbjct: 358 HVFRARPVIPEISSRFVSYYANAFGQGFFMQKGKQSVNLASISLTAISGFPIALPSADEQ 417 Query: 379 AEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGEN 438 EIV R+E+ T+ + L R L QSIL AF G L Q ++ P Sbjct: 418 REIVGRLEEKLIEVATVAEWCKTELTRSAALRQSILKDAFTGRLVPQNPSDEP------- 470 Query: 439 SAAALLEKIKAERAASGGKKASRKKS 464 AA LL +I+A R A+ K RK + Sbjct: 471 -AAELLARIRAARQAAPKGKTRRKAT 495 >UniRef50_P06990 Type-1 restriction enzyme EcoBI specificity protein n=2 Tax=Escherichia coli RepID=T1SB_ECOLX Length = 474 Score = 328 bits (840), Expect = 3e-88, Method: Composition-based stats. Identities = 191/471 (40%), Positives = 264/471 (56%), Gaps = 35/471 (7%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 + W+ + +V + G +K + N + D +PLIR ++ G T +P+ Sbjct: 23 DSWLRISMDSVANITNGFAFKSSEFNN--RKDGVPLIRIRDVLKGNTSTYYSGQIPEG-- 78 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTK 127 + PED+++ M + + S L C + E F H Sbjct: 79 ---YWVYPEDLIVGMDGDFNATIWCSEPALLN-----QRVCKIEVQEDKYNKRFFYHAL- 129 Query: 128 SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFE 187 Y + I++ ++ + ++ + +P+PPLAEQKIIAEKLDTLLAQVDSTKAR E Sbjct: 130 -PGYLSAINANTSSVTVKHLSSRTLQDTLLPLPPLAEQKIIAEKLDTLLAQVDSTKARLE 188 Query: 188 QIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL--------------NFESILT 233 QIPQILKRFRQAVL AV G+LT++ ++F + N + Sbjct: 189 QIPQILKRFRQAVLAAAVTGRLTKEDKDFITKKVELDNYKILIPEDWSETILNNIINTQR 248 Query: 234 ELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGDLLFTR 292 L G+ ++ G ++R+ + G VD N +R + + R K++ D+L T Sbjct: 249 PLCYGVVQPGDDIKDGIELIRVCDINDGEVDLNHLRKISKEIDLQYKRSKVRKNDILVTI 308 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 +G G++++ + N+ I K +P ++ I+ SSP + ++ K Sbjct: 309 VGA----IGRIGIVREDINVNIARAVARISPEY-KIIVPMFLHIWLSSPVMQTWLVQSSK 363 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 +K ++ KD+K+ V LP ++EQ EIVRRVEQLFAYAD+IEKQVNNALARVNNLTQS Sbjct: 364 E-VARKTLNLKDLKNAFVPLPSIEEQHEIVRRVEQLFAYADSIEKQVNNALARVNNLTQS 422 Query: 413 ILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 ILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK Sbjct: 423 ILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 473 Score = 124 bits (312), Expect = 5e-27, Method: Composition-based stats. Identities = 46/216 (21%), Positives = 88/216 (40%), Gaps = 8/216 (3%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK- 64 +PE W ++ + R + Y Q + +KD + LIR +I +G+ D L + K Sbjct: 231 IPEDWSETILNNIINTQRPLCYGVVQPGDDIKD-GIELIRVCDINDGEVDLNHLRKISKE 289 Query: 65 -NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFI 122 +L + K+ DI++ + +G+ + + PE K+I F+ Sbjct: 290 IDLQYKRSKVRKNDILVTIVGA----IGRIGIVREDINVNIARAVARISPEYKIIVPMFL 345 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + S + + + S + +P+P + EQ I +++ L A DS Sbjct: 346 HIWLSSPVMQTWLVQSSKEVARKTLNLKDLKNAFVPLPSIEEQHEIVRRVEQLFAYADSI 405 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP 218 + + + Q++L A G+LT +WR P Sbjct: 406 EKQVNNALARVNNLTQSILAKAFRGELTAQWRAENP 441 >UniRef50_B3YJG5 Type I restriction enzyme EcoKI specificity protein n=3 Tax=Gammaproteobacteria RepID=B3YJG5_SALET Length = 486 Score = 318 bits (814), Expect = 3e-85, Method: Composition-based stats. Identities = 186/502 (37%), Positives = 261/502 (51%), Gaps = 54/502 (10%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MSAGKLPEGWV + + K + + +D ++ +I+ + Sbjct: 1 MSAGKLPEGWVDTQLGNIVDYG-----KATKRVLSDVNDDTWVLELEDIEKESSKLLSTI 55 Query: 61 FVPKNLVKESQK-ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + K ++ D++ + + A + C E + Sbjct: 56 RASERPFKSTKNSFKRGDVLYGKLRPYLNKI-IIAKEDGVCTTEIIPLCA----EPSCCN 110 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 +I ++ KSS ++ ++ +S G N+ + A + + PLAEQKIIAEKLDTLLAQ+ Sbjct: 111 KYIFYWLKSSTFQGYVNDVSYGVNMPRLGTADGLKAPLRLAPLAEQKIIAEKLDTLLAQI 170 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE---------------------- 217 DSTKAR EQIPQILKRFRQAVL AV+G LT +WR Sbjct: 171 DSTKARLEQIPQILKRFRQAVLAAAVSGNLTAEWRMNNNSNIVEEEIEKVKNKLIAKKII 230 Query: 218 -------------PQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVD 264 P S + + +SI T++ +G P G ++ +++ G++ Sbjct: 231 KKDLIYSKLDRKYPIPSDWLYVKLQSIATKITDGEHKTPKREPAGQLLISARNIQDGYLK 290 Query: 265 QNDIRFLECSESEL--NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR 322 +D+ ++ +E + NR GD+L + +G L+ + ++ LI+ Sbjct: 291 LSDVDYVGDAEFQKLRNRCDPDSGDVLISCSGS----IGRVCLVDENSKYVMVRSVALIK 346 Query: 323 ARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIV 382 + + +Y+ SP + + K+T+ Q + IK+ + LPPV EQAEIV Sbjct: 347 L-MQDFVINKYMMYLLQSPLLQKEIEENSKSTA-QANLFLGPIKNLGIPLPPVPEQAEIV 404 Query: 383 RRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAA 442 RRVEQLFAYADTIEKQVN+AL RVN+LTQSILAKAFRGELTAQWR ENP LISGENSAAA Sbjct: 405 RRVEQLFAYADTIEKQVNSALTRVNSLTQSILAKAFRGELTAQWRTENPSLISGENSAAA 464 Query: 443 LLEKIKAERAASGGKKASRKKS 464 LLEKIKAERAASGGKK SRKK+ Sbjct: 465 LLEKIKAERAASGGKKTSRKKA 486 >UniRef50_C5SE02 Restriction modification system DNA specificity domain protein n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SE02_CHRVI Length = 448 Score = 311 bits (796), Expect = 4e-83, Method: Composition-based stats. Identities = 116/452 (25%), Positives = 205/452 (45%), Gaps = 29/452 (6%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W P+ V ++ G +K + N + P+IR ++ +G T +P Sbjct: 25 WERVPLGDVCDILNGFPFKSQHFNN---SEGAPVIRIRDVTSGFCKTFYSGDIPVG---- 77 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSS 129 + P D+V+ M + S L C + E + F+++ Sbjct: 78 -YWVEPFDMVVGMDGDFNCRLWSSERSLLN-----QRVCKLTPHEDFLDKKFLSYVL--P 129 Query: 130 LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQI 189 Y I+ + + ++ + I P+PPLAEQ+ I KLD L + + I Sbjct: 130 AYLRLINDHTHSITVKHLSSKTIAKIPFPLPPLAEQRRIVAKLDRLFERTRRAREELSHI 189 Query: 190 PQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG 249 P++++ +++A+L A G LT+ WR + + K++ + +L G S+K ++SG Sbjct: 190 PRLIENYKKAILVAAFRGDLTKDWRE-KRGLPMPKEVKLGEVAKKLSYGTSAKSSKSGD- 247 Query: 250 HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKL 309 P+LR+ +++ +D D+ + E E+ ++ L GD+LF R N S E VG + K Sbjct: 248 VPVLRMGNIQNMRIDWKDLVYTSDVE-EIEKYSLNAGDVLFNRTN-SPELVGKTAIYKG- 304 Query: 310 QHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQV 369 + +Y LI+ + +PEY+ +SP R+ Q I+ K + Sbjct: 305 -ERPAIYAGYLIKIKCGNRLVPEYLNYCLNSPLGRSYCWRVKSDGVSQSNINAKKLADFS 363 Query: 370 VLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAE 429 LLP EQ EIV R+E+ + D++ + A +++L Q+ LAKAFRGEL Q ++ Sbjct: 364 FLLPTHDEQKEIVFRIEKTLDWLDSLVIEERQASHLLDHLDQANLAKAFRGELVPQDPSD 423 Query: 430 NPDLISGENSAAALLEKIKAERAASGGKKASR 461 P A+ LLE+I A+R + ++ Sbjct: 424 EP--------ASVLLEQIYADREKQVKIRKNK 447 >UniRef50_A3EKX4 Type I restriction modification DNA specificity domain protein n=1 Tax=Vibrio cholerae V51 RepID=A3EKX4_VIBCH Length = 466 Score = 310 bits (793), Expect = 9e-83, Method: Composition-based stats. Identities = 148/483 (30%), Positives = 228/483 (47%), Gaps = 54/483 (11%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS +LP+GWV +S L G +K +D +IR N+Q+G ++ Sbjct: 1 MS--QLPKGWVCTSISQCFELKNGYAFKSSD----YTEDGDFVIRIGNVQDGHIILSNPA 54 Query: 61 FVP-KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 +V + L +S K++ DI+I+++ G+ +G + +HLP + + Sbjct: 55 YVAAEKLGADSFKLNEGDILISLT-GNVGRIGMVSKEHLP--AVLNQRVAKICVVNSVEI 111 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ + ++ L++ + SL+ GA NI + +PPLAEQ I EKLD +LAQV Sbjct: 112 RWLFYLLRTRLFQQHVLSLAKGAAQLNISTKDIQSFDFALPPLAEQTRIVEKLDEVLAQV 171 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 D+ KAR + IP ILKRFRQ+VL AV+GKLTE+WR P K+ T+L + Sbjct: 172 DTIKARLDGIPAILKRFRQSVLAAAVSGKLTEEWRQLNPNQPSHPKVGKVKYKTDLFDSA 231 Query: 240 SSK----------------------------PNESGVGHPILRISSVRAG--HVDQNDIR 269 S + G LR+S+VR +D +D++ Sbjct: 232 SKSLPELPPEWLVIPAAHLLEYVTSGSRGWANYYASSGALFLRMSNVRYDTTKLDLSDLQ 291 Query: 270 FLECSES-ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD 328 ++ E+ E R +++ DL+ + VG + + + L AR Sbjct: 292 YVNLPENVEGKRSLVKENDLVISIT----ADVGRVARVDSEIEEAYV-NQHLALARPASH 346 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 E++ +S + + +K + + G+ DI+S + P + EQ EIVR V+Q Sbjct: 347 IDAEFLAKCIASVNIGIKQVQALKRGATKAGLGLDDIRSMAIPFPHLAEQKEIVRLVDQY 406 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIK 448 FA+ADTIE V A ARV+ LTQSILAKAFRGEL Q + P A LLE+I Sbjct: 407 FAFADTIEALVKKAQARVDKLTQSILAKAFRGELVPQDPNDEP--------ADKLLERIA 458 Query: 449 AER 451 R Sbjct: 459 TAR 461 >UniRef50_C9RY89 Restriction modification system DNA specificity domain protein n=2 Tax=Geobacillus RepID=C9RY89_GEOSY Length = 477 Score = 307 bits (785), Expect = 8e-82, Method: Composition-based stats. Identities = 105/450 (23%), Positives = 194/450 (43%), Gaps = 41/450 (9%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P WV V G T +++ Y D +P I+ + +G ++ + Sbjct: 26 EVPGNWVWVRSGHVAKWGSGGTPSRKRLEYYGGD--IPWIKTGELNDGIITGSEETITEE 83 Query: 65 NLVKESQKISPED-IVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 L K S KI P+ IVIAM + +G L + + C V +P + + S ++ Sbjct: 84 GLQKSSAKIFPKGSIVIAMYGATIGRLGI-----LGIDAATNQACAVGQPYEFLDSKYMF 138 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 ++ + R+ + +L G NI +PPL EQK IA+K++ L A++D K Sbjct: 139 YYFFAR--RSDLVALGKGGAQPNISQTIIKDFPFALPPLNEQKRIADKIERLFAKIDEAK 196 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKL------------------------TEKWRNFEPQ 219 E++ + +++ R +L A G+L E+W P Sbjct: 197 RLIEEVKESIEQRRAVMLEKAFKGQLGTNDPSEKSILETSDDLSEKDVIPKEQWPYEVPG 256 Query: 220 HSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELN 279 + + + +S L L+ G ++ + G LRI+ ++ +VD + + + + L Sbjct: 257 NWTW--IKLKSCLKRLQYGYTATSSTLTEGPKYLRITDIQNDNVDWETVPYCKIDDKLLE 314 Query: 280 RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFS 339 ++KL GD++ R + G L+ + + ++ LIR + ++ P Y+ + Sbjct: 315 KYKLNKGDIVIARTGAT---TGKSFLIDDMPFCS-VFASYLIRLTMNENLNPYYLWNYLK 370 Query: 340 SPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 S + VK Q G + + I +V LPPV EQ I +++ L + ++ V Sbjct: 371 SSMYWKQI-TIVKKGIAQPGANARIIGELIVPLPPVPEQKRIAEKLDNLLEKLENEKQLV 429 Query: 400 NNALARVNNLTQSILAKAFRGELTAQWRAE 429 +++ L QS+L KAFRGEL + Sbjct: 430 LAVEEKLDLLKQSVLQKAFRGELGTNDPND 459 Score = 141 bits (356), Expect = 5e-32, Method: Composition-based stats. Identities = 55/224 (24%), Positives = 97/224 (43%), Gaps = 11/224 (4%) Query: 5 KLPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 ++P W + + L G T K +R +IQN D + + Sbjct: 253 EVPGNWTWIKLKSCLKRLQYGYTATSSTLTEGPK-----YLRITDIQNDNVDWETVPYCK 307 Query: 64 -KNLVKESQKISPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + E K++ DIVIA + + GKS +PF F ++ L + + + Sbjct: 308 IDDKLLEKYKLNKGDIVIARTGATT---GKSFLIDDMPFCSVFASYLIRLTMNENLNPYY 364 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + ++ KSS+Y +I+ + G + +P+PP+ EQK IAEKLD LL ++++ Sbjct: 365 LWNYLKSSMYWKQITIVKKGIAQPGANARIIGELIVPLPPVPEQKRIAEKLDNLLEKLEN 424 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKK 225 K + + L +Q+VL A G+L N + K+ Sbjct: 425 EKQLVLAVEEKLDLLKQSVLQKAFRGELGTNDPNDGHAMELVKE 468 Score = 122 bits (306), Expect = 3e-26, Method: Composition-based stats. Identities = 41/246 (16%), Positives = 90/246 (36%), Gaps = 12/246 (4%) Query: 197 RQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRIS 256 + +L A+ K + + P + V+ + + + G P ++ Sbjct: 9 MEQLLEEALVPKDEQPYE--VPGNWVWVRSGHVAKWGSGGTPSRKRLEYYGGDIPWIKTG 66 Query: 257 SVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLY 316 + G + ++ E + + G ++ Y ++ +G+ G+ + Sbjct: 67 ELNDGIITGSEETITEEGLQKSSAKIFPKGSIVIAMYGATIGRLGILGIDAATNQACAVG 126 Query: 317 PDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVK 376 + +Y+ +F AR + + + Q IS IK LPP+ Sbjct: 127 Q-------PYEFLDSKYMFYYF---FARRSDLVALGKGGAQPNISQTIIKDFPFALPPLN 176 Query: 377 EQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISG 436 EQ I ++E+LFA D ++ + + +L KAF+G+L +E L + Sbjct: 177 EQKRIADKIERLFAKIDEAKRLIEEVKESIEQRRAVMLEKAFKGQLGTNDPSEKSILETS 236 Query: 437 ENSAAA 442 ++ + Sbjct: 237 DDLSEK 242 >UniRef50_P06187 Type-1 restriction enzyme StySJI specificity protein n=8 Tax=Enterobacteriaceae RepID=T1S_SALTY Length = 469 Score = 300 bits (768), Expect = 8e-80, Method: Composition-based stats. Identities = 188/483 (38%), Positives = 262/483 (54%), Gaps = 33/483 (6%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS GKLPEGW + ++ + L K + + L ++P+ GK + Sbjct: 1 MSGGKLPEGWATSTINEMCNLN-----PKLKLDDDLDVGFMPMAGVPTTYLGKCNFETKK 55 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGA----FCGVLRPEKL 116 + VK+ D VI GK A F +GA + + L Sbjct: 56 WSE---VKKGFTQFQNDDVIFAKITPCFENGK-AVVIKEFPNGYGAGSTEYYVLRSINGL 111 Query: 117 IFSGFIAHFTKSSLY-RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 I ++ K+ + N ++S + + +P+PPLAEQK+IAEKLDTL Sbjct: 112 INPHWLFALVKTKDFLTNGALNMSGSVGHKRVTKEFLENYGVPVPPLAEQKVIAEKLDTL 171 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF---------EPQHSVFKKL 226 LAQVDSTKAR EQIPQILKRFRQ+V+ AVNG+LT++ S++K + Sbjct: 172 LAQVDSTKARLEQIPQILKRFRQSVIVAAVNGQLTKELHKKNKFKLTELNISIPSLWK-I 230 Query: 227 NFESILTELRNGLSSKPNES----GVGHPILRISSVRAGHVDQNDIRFLECSE-SELNRH 281 + +++ G ES G P +R ++ G V +LE ++R+ Sbjct: 231 SEIGQFADVKGGKRLPKGESLIAENTGFPYIRAGQLKNGTVLPEGQLYLEEYIQKSISRY 290 Query: 282 KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSP 341 + GDL T +G G++ + + L + L ++ ++ ++ S Sbjct: 291 TVSSGDLYITIVGAC---IGDAGIIPDVYNNANLTENAAKICNLNENIFNRFLSLWLRSS 347 Query: 342 SARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNN 401 ++ + + +K+ + Q ++ IKS ++LPP++EQ EIVRRVEQLFAYADTIEKQVNN Sbjct: 348 YLQDIINSEIKSGA-QGKLALARIKSLPLILPPLQEQHEIVRRVEQLFAYADTIEKQVNN 406 Query: 402 ALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASR 461 AL RVN+LTQSILAKAFRGELTAQWRAENP+LISGENSAAALLEKIKAERAASGGKK SR Sbjct: 407 ALTRVNSLTQSILAKAFRGELTAQWRAENPELISGENSAAALLEKIKAERAASGGKKTSR 466 Query: 462 KKS 464 KK+ Sbjct: 467 KKA 469 >UniRef50_A1BGI9 Restriction modification system DNA specificity domain n=2 Tax=cellular organisms RepID=A1BGI9_CHLPD Length = 479 Score = 298 bits (762), Expect = 4e-79, Method: Composition-based stats. Identities = 116/492 (23%), Positives = 213/492 (43%), Gaps = 69/492 (14%) Query: 11 VIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ--NGKFDTTDLVFVPKNLVK 68 VIA + V I G +K + + LP+IR N+ N KF+ ++ VF + Sbjct: 7 VIAILGDVAEYINGRAFKPSE----WGKEGLPIIRIKNLNDENSKFNYSNEVF------E 56 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKS 128 + + D++ A S+ + + K E +++P I ++ +F Sbjct: 57 KRYLVKKGDLLFAWSASLGAYIWK------KDEAWLNQHIFLVKPSPFIAKLYLYYFL-- 108 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 ++ S + G+ + ++ F+ I +PPL+EQ+ I K++ L +++D+ A ++ Sbjct: 109 DKITQELYSAAHGSGMVHVTKKKFEETKIGLPPLSEQRSIVSKIEQLFSELDNGIACLKK 168 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWRNFE------------------------------- 217 + LK +RQAVL A G+LT+ WR + Sbjct: 169 AQEQLKVYRQAVLKQAFEGELTKSWREQQANLPSAQDLLDTIKTEREQAAKNQGKKLKPV 228 Query: 218 ------------PQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQ 265 + + + + G S+K E G P++R+ +++ G +D Sbjct: 229 TPLAKVELDELTELPDGWCWIKLGELTIGVEYGTSTKSLEKGE-VPVIRMGNIQQGRIDW 287 Query: 266 NDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL 325 ND+ F + ++++++++L GD+LF R N S E VG + ++ LIR Sbjct: 288 NDLAFTD-DKADISKYRLLKGDVLFNRTN-SPELVGKAAIYNG--EMPAIFAGYLIRVNQ 343 Query: 326 TKDALP-EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 K+ L +Y+ F +S A+ + Q I+G+ +KS + KEQ +IV+ Sbjct: 344 IKELLHCKYLNFFLNSHPAKVYGNSVKTDGVNQSNINGEKLKSYPLPYCSPKEQEQIVQE 403 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALL 444 +E + D +E + +L + L QSIL KAF G+L ++ A LL Sbjct: 404 IEARLSVCDNMEATIRESLEKAEALRQSILKKAFEGKLLSEEELTATRNDPDWEPAEKLL 463 Query: 445 EKIKAERAASGG 456 E+I+AE+ S Sbjct: 464 ERIRAEKNQSKK 475 Score = 147 bits (370), Expect = 1e-33, Method: Composition-based stats. Identities = 48/210 (22%), Positives = 92/210 (43%), Gaps = 8/210 (3%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LP+GW + +T + T K L+ +P+IR NIQ G+ D DL F Sbjct: 241 ELPDGWCWIKLGELTIGVEYGTSTKS-----LEKGEVPVIRMGNIQQGRIDWNDLAFTDD 295 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGFIA 123 ++ D++ ++ S +VGK+A + F + + ++L+ ++ Sbjct: 296 KADISKYRLLKGDVLFNRTN-SPELVGKAAIYNGEMPAIFAGYLIRVNQIKELLHCKYLN 354 Query: 124 HFTKSSLYRNKISSLSA-GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 F S + +S+ G N +NI +P EQ+ I ++++ L+ D+ Sbjct: 355 FFLNSHPAKVYGNSVKTDGVNQSNINGEKLKSYPLPYCSPKEQEQIVQEIEARLSVCDNM 414 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEK 212 +A + + + RQ++L A GKL + Sbjct: 415 EATIRESLEKAEALRQSILKKAFEGKLLSE 444 Score = 123 bits (309), Expect = 1e-26, Method: Composition-based stats. Identities = 62/231 (26%), Positives = 102/231 (44%), Gaps = 21/231 (9%) Query: 234 ELRNGLSSKPNESG-VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 E NG + KP+E G G PI+RI ++ + +F +E R+ ++ GDLLF Sbjct: 16 EYINGRAFKPSEWGKEGLPIIRIKNLND-----ENSKFNYSNEVFEKRYLVKKGDLLFA- 69 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 ++ SL + KK + + + + Y+ F + Sbjct: 70 WSASLG----AYIWKKDE---AWLNQHIFLVKPSPFIAKLYLYYFLDKI---TQELYSAA 119 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 SG ++ K + + LPP+ EQ IV ++EQLF+ D + A ++ Q+ Sbjct: 120 HGSGMVHVTKKKFEETKIGLPPLSEQRSIVSKIEQLFSELDNGIACLKKAQEQLKVYRQA 179 Query: 413 ILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 +L +AF GELT WR + +L SA LL+ IK ER + + + K Sbjct: 180 VLKQAFEGELTKSWREQQANLP----SAQDLLDTIKTEREQAAKNQGKKLK 226 >UniRef50_C9NQJ7 HsdS type I site-specific deoxyribonuclease n=1 Tax=Vibrio coralliilyticus ATCC BAA-450 RepID=C9NQJ7_9VIBR Length = 563 Score = 296 bits (759), Expect = 9e-79, Method: Composition-based stats. Identities = 134/459 (29%), Positives = 222/459 (48%), Gaps = 36/459 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDD-YLPLIRANN--------IQNGKFD 55 KLP WV + + ++ G T K +N+ + + + + I NG+ D Sbjct: 3 KLPFNWVETEIGNLALVVSGGTPKAGDELNFAEPGAGIAWVTPADLSGYKQKEIANGRRD 62 Query: 56 TTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK 115 + PK L S K+ P+ ++ S V A + F +F Sbjct: 63 LS-----PKGLDSSSAKLMPKGTLLFSSRAPIGYVAI-AENEISTNQGFKSFIF----TD 112 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + S + ++ KS ++ S +G + A + + PL EQ IA+KLD++ Sbjct: 113 HVNSTYAYYYLKS--IKDLAESWGSGTTFKELSGAVAKKLPFRLAPLNEQIRIADKLDSI 170 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTEL 235 LA+VD + R ++IP ILKRFRQ+VL A +G+LT +WR E + + ++ +S+ Sbjct: 171 LAKVDHAQERLDKIPDILKRFRQSVLAAATSGELTREWR--EGKEHQWPRVQLKSVGRGF 228 Query: 236 RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 G S+K G P+LR+ +++ G + +++ + E E++++ L+ GD+LF R N Sbjct: 229 NYGSSAKSKPEGE-VPVLRMGNLQGGQLHWDNLVYTSDKE-EIDKYLLEKGDVLFNRTN- 285 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 S E VG + + Q +Y LIR + ++ E++ I +SP AR+ Sbjct: 286 SPELVGKTSIYRG--EQKAIYAGYLIRIKGSEHLDTEFLNIQLNSPHARDYCWQVKTDGV 343 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 Q I+ K +++ LP + EQ EIVRRV +LF+ AD E Q + +N LTQSIL Sbjct: 344 SQSNINAKKLQAYEFDLPEIDEQLEIVRRVSELFSRADLFEYQYLASKKYLNRLTQSILV 403 Query: 416 KAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAAS 454 KAF G+L Q + SA+ LL+ I++E A+ Sbjct: 404 KAFNGQLVPQEPTDE--------SASELLKLIESEMVAN 434 >UniRef50_A7IEA1 Restriction modification system DNA specificity domain n=1 Tax=Xanthobacter autotrophicus Py2 RepID=A7IEA1_XANP2 Length = 450 Score = 296 bits (758), Expect = 1e-78, Method: Composition-based stats. Identities = 118/475 (24%), Positives = 219/475 (46%), Gaps = 39/475 (8%) Query: 1 MS--AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANN--------IQ 50 MS ++P W+ A V ++ G T N+ K +P + + I Sbjct: 1 MSEARWQVPHSWLWASFGEVADIVGGGTPPTGDEANFTK-QGVPWLTPADLTGYRETYIS 59 Query: 51 NGKFDTTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGV 110 G+ D ++ K + + ++ P+ V+ S++ VG A + G + Sbjct: 60 RGRRDLSE-----KGYRESAARLLPKGTVLFS---SRAPVGYCAIASENVSTNQGFKSFI 111 Query: 111 LRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAE 170 L + I ++ H+ S S ++G + + + +P+PPL EQ+ I Sbjct: 112 L--KGDISPEYVRHYLLGST--EYAESKASGTTFKELSGSRATELALPLPPLPEQRRIVA 167 Query: 171 KLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFES 230 K+D+L A+ + E IP+++++++QA+L A +G+LTE + + + + F Sbjct: 168 KIDSLTAKSRRARDHLEHIPRLVEKYKQAILAAAFDGRLTELSPHDIVHPELGELIEF-- 225 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVRAGHVD-QNDIRFLECSESELNRHKLQDGDLL 289 +NGL + G G PILRI + +D + + S++ + + DGDL+ Sbjct: 226 ---GPQNGLYLPKDRYGEGTPILRIQNYGFNFIDEPTNWHRVTVSDAIAAQFAMSDGDLI 282 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 R N S +G ++ K ++ ++R RL A P++++++ SS R ++ Sbjct: 283 INRVN-SPSHLGKSMVVTKAM-AGAIFESNMMRIRLNALAEPKFVQLYLSSSQGRGSLTK 340 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 K Q I+ D+ V LP + +Q ++ R+E FA+ D + + +A ++ L Sbjct: 341 DAKWAVNQASINQGDVSRTPVPLPGLSDQIAVLDRIETAFAWIDRLAAEATSARTLIDRL 400 Query: 410 TQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 Q++LAKAFRGEL Q A+ P A+ LLE+I+AER A+ + R+ + Sbjct: 401 DQAVLAKAFRGELVPQDPADEP--------ASVLLERIRAERGAAPKARRGRRPA 447 >UniRef50_C3MFA1 Putative restriction endonuclease type I, S subunit n=1 Tax=Rhizobium sp. NGR234 RepID=C3MFA1_RHISN Length = 496 Score = 295 bits (756), Expect = 2e-78, Method: Composition-based stats. Identities = 121/511 (23%), Positives = 208/511 (40%), Gaps = 64/511 (12%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS +LP GW + + + + G T K+ Y + +P I + + + D Sbjct: 1 MS--ELPRGWCVTTIQEIADVGTGATPKRGTRAFY-ESGTIPWITSGAVSQRQITYADEF 57 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + + K+ P ++ G G A + + VL + ++ S Sbjct: 58 ITEAAIRSTNCKVFPTGTILVAMYGEGKTRGSVARLAIDAATNQALAAIVLPNDDIVSSE 117 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+ +F S +++ L+AG N+ + P+PPLAEQK I KLD L A+ Sbjct: 118 FLMNFLTSQY--SQLRGLAAGGVQPNLNLQLIRSTSFPLPPLAEQKRIVAKLDALSAKSA 175 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR-------------------------- 214 + +I ++ R++QAVLG A +G+LT +R Sbjct: 176 RARTELARIETLVYRYKQAVLGKAFSGELTVDFRLSRRHLQSEAKAGSIHGEEGVERKLK 235 Query: 215 -----------NFEPQHSVFKKL-------NFESILTELRNGLSSKPNES-GVGHPILRI 255 P + + N + + G K + G PI+ + Sbjct: 236 VRGTTDVMKGIQLSPLPESWNWVKNHRLAQNRANAICAGPFGTIFKAKDFRDKGIPIIFL 295 Query: 256 SSVRAGHVDQNDIRFLECS--ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQN 313 V AG + F++ + + + G+LL T+ GV + Sbjct: 296 RHVAAGEYRTHKPGFMDKKVWQELHQPYSVFGGELLVTKLGDPP---GVACIFPAGVGTA 352 Query: 314 LLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLP 373 ++ PD + + ++P+++ +F+SP A+N +++ + + + K+ V P Sbjct: 353 MVTPDVMKMSVDENASVPKFLMFYFNSPIAKN-IIHQLAFGLTRLRVDLAMFKTFPVPHP 411 Query: 374 PVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDL 433 ++EQ EIVRR+E FA D + + AL V L ++ILAKAFRGEL Q + P Sbjct: 412 SLEEQLEIVRRIESAFAKIDRLAAEAKRALDLVGKLDEAILAKAFRGELVPQDENDEP-- 469 Query: 434 ISGENSAAALLEKIKAERAASGGKKASRKKS 464 A LLE+I+AERAA+ K R + Sbjct: 470 ------AENLLERIRAERAAAPKAKRGRGNA 494 >UniRef50_Q1MKB2 Putative type I restriction enzyme specificity subunit n=2 Tax=Alphaproteobacteria RepID=Q1MKB2_RHIL3 Length = 456 Score = 294 bits (753), Expect = 4e-78, Method: Composition-based stats. Identities = 128/477 (26%), Positives = 217/477 (45%), Gaps = 35/477 (7%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN--GKFDTTD 58 MS LP+GWV A + + + + + + + + + + G Sbjct: 1 MSG--LPKGWVEATLEELCQ------FNPKHDPDVDQSLGVNFVPMPAVDDETGAIIDKS 52 Query: 59 LVFVPKNLVKESQKISPEDIVI-----AMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP 113 +V + K + D++ M +G + A VLR Sbjct: 53 VVRPLSEIWKGYTHFADRDVIFAKITPCMENGKIA----VARDLANGMACGSTEFHVLRS 108 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKL 172 + + F+ F + YR GA + + ++P+PPL EQK I KL Sbjct: 109 KGAVEPDFLWRFLRRKNYRQVAEHSMTGAVGQRRVPRQFLETTSLPLPPLNEQKRIVAKL 168 Query: 173 DTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESIL 232 DTL A+ + +I ++ RF+QAVL A +G+LT+ WR+ + + ++ L ++ Sbjct: 169 DTLNAKSARARTELARIEILVSRFKQAVLSKAFSGELTKDWRSGQTTLAPWENLPLSQLV 228 Query: 233 TE-LRNGLSSKPNESGVGHPILRISSVRAGHV--DQNDIRFLECSESELNRHKLQDGDLL 289 + NG S K + G L++S+ +G + D++ I++L+ + E ++ L D++ Sbjct: 229 SHGPSNGWSPKADGKVSGLKSLKLSATSSGRLRLDESTIKYLDQTLPEDSKFWLLSDDIV 288 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMM 348 R N SLE +G L + ++PD ++R R+ K P Y+ + +S SAR+ Sbjct: 289 IQRAN-SLELLGTTVLFDGPPGE-FIFPDLMMRIRVNDKKTNPRYLATYLNSDSARSYFR 346 Query: 349 NCVKTTSG-QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 ++G I+G ++ V PP++EQ EIV R+E FA D + + AL V Sbjct: 347 ANATGSAGNMPKINGSTVRETRVPTPPLEEQQEIVHRIESAFAMTDRLAAEAMRALDLVG 406 Query: 408 NLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 L ++ILAKAFRGEL Q + P A LLE+I+AER A+ K R+K+ Sbjct: 407 KLGEAILAKAFRGELVPQDENDEP--------AEKLLERIRAEREAAPEAKRGRRKT 455 >UniRef50_Q210J8 Type I restriction enzyme StySPI specificity protein n=1 Tax=Rhodopseudomonas palustris BisB18 RepID=Q210J8_RHOPB Length = 460 Score = 294 bits (752), Expect = 5e-78, Method: Composition-based stats. Identities = 122/471 (25%), Positives = 197/471 (41%), Gaps = 30/471 (6%) Query: 3 AGKLPEGWVIAPVSTVTTL----IRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTD 58 G LP GWV AP+ + L I Y ++ + ++R NI +F + D Sbjct: 2 TGDLPSGWVAAPIDDLRALEPNAITDGPYGSSLKTSHYRSSGARVVRLGNIGFRRFLSAD 61 Query: 59 LVFVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEK 115 V++ ++ K + D++IA VG+S A C LR Sbjct: 62 AVYISEDHFKALVKHHVRAGDVLIAALG---DPVGRSCIAPSDISPALVKADCFRLRCSP 118 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + + FI + S R SS + G I + F +P+PP EQ I K+D L Sbjct: 119 HLSAPFIMLWLNSECAREAFSSAAHGLGRVRINLSDFRTTVVPVPPATEQGRIVAKIDNL 178 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR-NFEPQHSVFKKLNFESILTE 234 A+ ++ + IPQ++++++QA+L A G+LT +WR N Q + + + S + Sbjct: 179 SAKSKRSRDHLDHIPQLVEKYKQAILAAAFRGELTHEWRVNNLDQKWPWPECSL-SDIAN 237 Query: 235 LRNGLSSKP----NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 + G + K S P + +V+ V D E + E N G +L Sbjct: 238 IGTGATPKRGEQRYYSNGNIPWITSGAVKHAVVQAADEYITEAAVRETNCKVFPAGTILM 297 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 Y G +L N I+ R A+ +++ S + Sbjct: 298 AMYGEGKTR-GRVTVLGINAATN--QAVAAIQVRADSPAVRDFVVWHLRSGYL--ELRER 352 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 Q ++ + + + LP EQ E+VRRV++ FA+ D + + +A ++ L Sbjct: 353 AAGGV-QPNLNLGIVNAWRIPLPSRDEQMEVVRRVQKAFAWIDRLTIETTSARKLIDRLD 411 Query: 411 QSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASR 461 Q+ILAKAFRGEL Q + P A+ LLE+IKA+RA S G R Sbjct: 412 QAILAKAFRGELVPQDPNDEP--------ASILLERIKAKRAGSAGHTRRR 454 >UniRef50_B2SI71 Type I restriction-modification system, S subunit n=1 Tax=Xanthomonas oryzae pv. oryzae PXO99A RepID=B2SI71_XANOP Length = 501 Score = 291 bits (746), Expect = 3e-77, Method: Composition-based stats. Identities = 134/515 (26%), Positives = 223/515 (43%), Gaps = 74/515 (14%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 +LP GW + + ++ G+ + E + + P+ + ++ G ++ Sbjct: 6 VSELPAGWAETTLGAIGSVQSGMGFPLE--MQGQTEGVYPVYKVGDVSRGVLLDRGILRR 63 Query: 63 PKNLVKESQ------KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL 116 N V I PE ++ G + + A + ++ Sbjct: 64 STNYVDAEAAAILKGHIFPEGSILFAKIGEALRLNRRAIVFREGLADNN--VMGFKADQG 121 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 I GF+ HF ++ ++SLS I +I+ + + I I +PPLAEQK I +KLD LL Sbjct: 122 IDDGFLYHFLRTQD----LASLSRSTTIPSIRKSDVEDITISLPPLAEQKRIVQKLDALL 177 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR---------------------- 214 AQVD+ KAR + +P +LKRFR+A L A++G LT+ WR Sbjct: 178 AQVDTLKARIDAMPALLKRFREATLTSAMSGTLTKDWRIESSQSTAPEAPRMCRQLLANE 237 Query: 215 -----------------------NFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHP 251 F V+ + + I +++G P + G Sbjct: 238 RERIWRGRGKYKPAVRSGEVDASEFSNLPEVWHRGTLDEITWSVKDGPHFSPKYATDGVR 297 Query: 252 ILRISSVRAGHVDQNDIRFLECSESE--LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKL 309 + ++R G +D + +++ E R K + D+L+T+ + G + + Sbjct: 298 FISGGNIRPGRIDLSTGKYISQELHEELSARCKPEYLDVLYTKGGTT----GFAAVNRTE 353 Query: 310 QHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQV 369 N+ +++ P ++E +SP A G + + + + V Sbjct: 354 SEFNVWVHVAVLKMLPPSVVDPFFVEFALNSPECY-AQSQRYTHGVGNQDLGLRRMIKIV 412 Query: 370 VLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAE 429 + +PP+ EQ EIVRRVEQLFAYAD +E +V A R++ LTQS+LAKAFRGEL Q A Sbjct: 413 LPVPPIGEQREIVRRVEQLFAYADQLEAKVATAKQRIDALTQSLLAKAFRGELVPQDPAA 472 Query: 430 NPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 P A+ LL++I+A+RAA+ K RK + Sbjct: 473 EP--------ASVLLDRIRAQRAATPKPKRGRKAA 499 >UniRef50_B6R0S6 Restriction modification system DNA specificity domain protein n=1 Tax=Pseudovibrio sp. JE062 RepID=B6R0S6_9RHOB Length = 492 Score = 286 bits (732), Expect = 1e-75, Method: Composition-based stats. Identities = 129/508 (25%), Positives = 221/508 (43%), Gaps = 70/508 (13%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG--KFDTTD 58 MS +LPEGWV + + + RG + + ++ DD L I+ ++ +G + ++T+ Sbjct: 1 MS--ELPEGWVETEIENIYEVARGGSPRPIKSYLTADDDGLNWIKISDATSGGYRIESTE 58 Query: 59 LVFVPKNLVKESQKISPEDIVIA--MSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL 116 + L K ++ I P D++++ MS G + H G F +K Sbjct: 59 QKITSEGLHK-TRLIYPGDLLLSNSMSFGKPYISAIEGCIH-DGWLVLGGF-----GKKC 111 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + + ++ S + + ++G+ + N+ + + +P+ PLAEQK I K+++L Sbjct: 112 VDTRYMHLALSSEGVQKQFDEKASGSTVRNLNTGIVNSVRVPLAPLAEQKRIVAKIESLT 171 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKK----------- 225 A+ + +I + KR++QA+L A +G+LT WR + + Sbjct: 172 AKSRIARENLARIDTLTKRYKQAILKKAFSGELTADWREKSSKDCLIDLNDVLKEHEVIW 231 Query: 226 ----------------------------LNFESILTELRNGLSSKPNESGVGHPILRISS 257 L + + + + P E G G P + + Sbjct: 232 QNNIAKKGKYARPNVKPADDLRSWHELSLEGLAYVVDPHPSHRTPPKEIG-GIPYVGVGD 290 Query: 258 VR-AGHVDQNDIRFLECS--ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNL 314 V+ G +D R + + L R+ L+ GD + G + +G LL + Q L Sbjct: 291 VKLDGKLDFAGARKVSPKVLKDHLKRYSLKRGDFAY----GKIGTIGQPFLLPEAQEYAL 346 Query: 315 LYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPP 374 LI+ R +K A E++ FF SP ++ + Q K ++ + LP Sbjct: 347 SANVILIQPR-SKFATAEFLYYFFLSPVVTQKILG-ASVATSQAAFGIKKMREVLTPLPS 404 Query: 375 VKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLI 434 + EQ EIV R+E+ FA D + ++ AL V+ L + ILAKAFRGEL Q + P Sbjct: 405 LSEQNEIVTRIEKAFAKIDKLAEEAKRALHSVDRLDEKILAKAFRGELVPQDPDDEP--- 461 Query: 435 SGENSAAALLEKIKAERAASGGKKASRK 462 A+ LLE+IKAERAA K +RK Sbjct: 462 -----ASVLLERIKAERAAQPKVKRARK 484 >UniRef50_D1C5W4 Restriction modification system DNA specificity domain protein n=1 Tax=Sphaerobacter thermophilus DSM 20745 RepID=D1C5W4_SPHTD Length = 532 Score = 286 bits (732), Expect = 1e-75, Method: Composition-based stats. Identities = 135/502 (26%), Positives = 217/502 (43%), Gaps = 87/502 (17%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP GW A + I G+ ++K D+ LP+IR N+ D + Sbjct: 9 LPPGWTWATIRDTGEYINGLAFRKSD----WGDEGLPIIRIQNLT----DPSKPFNRTSR 60 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK-LIFSGFIAH 124 V + DI+++ S+ + + + P+ L+ S ++ H Sbjct: 61 QVDPVYIVHRGDILLSWSATLDAFTWRG------ETGVLNQHIFKVVPDNRLVHSPYLYH 114 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + ++ K SS G+ + +I F +P+ PLAEQ+ I +++ ++D+ A Sbjct: 115 LLRHAIDLLKQSSHLHGSTMKHINRGPFLSFQVPLAPLAEQRRIVAEIEKHFTRLDAAVA 174 Query: 185 RFEQIPQILKR----------------------------------FRQAVLGGAVNGKLT 210 E+ LKR Q +L Sbjct: 175 ALERARANLKRYRAAVLKAACEGRLVPTEAELARAEGRDYETGEQLLQRILQERRAKWEA 234 Query: 211 EK---------------WR------------NFEPQHSVFKKLNFESILTELRNGLSSKP 243 E+ W+ + + + +L LRNG+S KP Sbjct: 235 EELAKLRAKGKEPKDDRWKARYKEPAAPDTSDLPELPEGWVWARLDQLLGSLRNGISKKP 294 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVC 303 +S G PILRI++VR V+ +IR+L S + + L GDLLFTRYNGS E VGVC Sbjct: 295 -DSESGTPILRINAVRPLSVNMEEIRYLSGSVDQYADYVLCQGDLLFTRYNGSPELVGVC 353 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 G ++ + + ++YPDKLIRARL L +++I + +R + ++TT+GQ G+SG Sbjct: 354 GAVRAVDRK-VVYPDKLIRARLASHLCLSSFVQIVLNVGLSREFIARRIRTTAGQSGVSG 412 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 DI+S + LPP+ EQ IV VE+ + + +E+Q+ L R L Q+IL +AF G+L Sbjct: 413 SDIRSVPLPLPPLAEQRRIVAEVERRLSVVEELERQIEANLKRAERLRQAILKRAFAGKL 472 Query: 423 TAQWRAENPDLISGENSAAALL 444 Q + P A+ LL Sbjct: 473 VPQDPNDEP--------ASVLL 486 Score = 122 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 50/224 (22%), Positives = 100/224 (44%), Gaps = 11/224 (4%) Query: 5 KLPEGWVIAPVSTV-TTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 +LPEGWV A + + +L G++ K + + P++R N ++ + ++ ++ Sbjct: 269 ELPEGWVWARLDQLLGSLRNGISKKPD------SESGTPILRINAVRPLSVNMEEIRYLS 322 Query: 64 KNLVK-ESQKISPEDIVIAMSSGSKSVVGKS-AHQHLPFECSFGAFCGVLR-PEKLIFSG 120 ++ + + D++ +GS +VG A + + + + R L S Sbjct: 323 GSVDQYADYVLCQGDLLFTRYNGSPELVGVCGAVRAVDRKVVYPDKLIRARLASHLCLSS 382 Query: 121 FIAHFTKSSLYRNKI-SSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ L R I + A + + + + +P+PPLAEQ+ I +++ L+ V Sbjct: 383 FVQIVLNVGLSREFIARRIRTTAGQSGVSGSDIRSVPLPLPPLAEQRRIVAEVERRLSVV 442 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVF 223 + + + E + +R RQA+L A GKL + N EP + Sbjct: 443 EELERQIEANLKRAERLRQAILKRAFAGKLVPQDPNDEPASVLL 486 Score = 101 bits (252), Expect = 5e-20, Method: Composition-based stats. Identities = 52/265 (19%), Positives = 92/265 (34%), Gaps = 38/265 (14%) Query: 218 PQHSVFKKLNFESILTELRNGLSSKPNESG-VGHPILRISSVRAGHVDQNDIRFLECSES 276 P + + E NGL+ + ++ G G PI+RI ++ + + + Sbjct: 10 PPGWTWATIRDTG---EYINGLAFRKSDWGDEGLPIIRIQNLT------DPSKPFNRTSR 60 Query: 277 ELNR-HKLQDGDLL--FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEY 333 +++ + + GD+L ++ + + G G+L QH + PD + Y Sbjct: 61 QVDPVYIVHRGDILLSWSATLDAFTWRGETGVL--NQHIFKVVPD-------NRLVHSPY 111 Query: 334 IEIFFSSPSARNAMMNCVK-TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYA 392 + A + + S K I+ S V L P+ EQ IV +E+ F Sbjct: 112 LYHLLR--HAIDLLKQSSHLHGSTMKHINRGPFLSFQVPLAPLAEQRRIVAEIEKHFTRL 169 Query: 393 DTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERA 452 D + A A + ++L A G L + LL++I ER Sbjct: 170 DAAVAALERARANLKRYRAAVLKAACEGRLVPTEAELARAEGRDYETGEQLLQRILQERR 229 Query: 453 AS-------------GGKKASRKKS 464 A K R K+ Sbjct: 230 AKWEAEELAKLRAKGKEPKDDRWKA 254 >UniRef50_A8ZTW4 Restriction modification system DNA specificity domain n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZTW4_DESOH Length = 477 Score = 285 bits (729), Expect = 3e-75, Method: Composition-based stats. Identities = 134/504 (26%), Positives = 219/504 (43%), Gaps = 84/504 (16%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LPEGWV AP+ ++ ++ G K + + K P+ AN+I + + Sbjct: 5 LPEGWVAAPLQKISQIVYGKGLPKNK---FNKQGLYPVFGANSI---------IGYYDSF 52 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS-GFIAH 124 L ++ Q ++I+ + + S P V P L S ++ + Sbjct: 53 LYEDPQ------VLISCRGANSGTINIS----PPKCFVTSNSLVVQLPNTLHQSFKYLYY 102 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 +SS + G + + +P+PP EQK I +LD ++ ++D K Sbjct: 103 ALESSDK----EKIVTGTAQPQVTIDNLKSFCVPLPPFNEQKRIVARLDQIIPRIDKLKT 158 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQ------------------------- 219 R ++IP I+KRFRQ+VL AV G+LTEKWR P Sbjct: 159 RLDKIPTIIKRFRQSVLTAAVTGRLTEKWREDHPDVEGAEATVQSIYYRRLDESQTNQQK 218 Query: 220 ------------------HSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAG 261 +K I + G SSK ++ G P+LR+ +++ G Sbjct: 219 NKIEKLFAEVETEDNGLLPETWKYTFLNKICESFQYGTSSKSSKKGD-IPVLRMGNLQNG 277 Query: 262 HVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLI 321 +D +++ + ++ E+ ++KL+ +LF R N S E VG + L + ++ LI Sbjct: 278 AIDWSNLVY-SSNKKEIEKYKLEKNTVLFNRTN-SPELVGKTAIY--LGERAAIFAGYLI 333 Query: 322 RARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEI 381 R Y+ ++ A+ Q I+ + + + PP++EQ EI Sbjct: 334 RINNMDILDSHYLNYSLNTDYAKAFCNREKTDGVNQSNINAQKLGRFEIPFPPLEEQKEI 393 Query: 382 VRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAA 441 VR+VE+ FA AD +E NA ARV+ L +S+LAKAFRGELT Q + P A Sbjct: 394 VRQVERSFALADKLEAHYQNARARVDKLARSVLAKAFRGELTPQDPNDEP--------AE 445 Query: 442 ALLEKIKAERAA-SGGKKASRKKS 464 LLE+I AE+ + K +RK++ Sbjct: 446 KLLERILAEKEKMAAAVKKTRKQA 469 Score = 157 bits (398), Expect = 5e-37, Method: Composition-based stats. Identities = 51/226 (22%), Positives = 96/226 (42%), Gaps = 7/226 (3%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G LPE W ++ + + T K K +P++R N+QNG D ++LV+ Sbjct: 232 DNGLLPETWKYTFLNKICESFQYGTSSKSS-----KKGDIPVLRMGNLQNGAIDWSNLVY 286 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 E K+ ++ ++ S +VGK+A F + + ++ S + Sbjct: 287 SSNKKEIEKYKLEKNTVLFNRTN-SPELVGKTAIYLGERAAIFAGYLIRINNMDILDSHY 345 Query: 122 IAHFTKSSLYRNKISSLSA-GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + + + + + G N +NI IP PPL EQK I +++ A D Sbjct: 346 LNYSLNTDYAKAFCNREKTDGVNQSNINAQKLGRFEIPFPPLEEQKEIVRQVERSFALAD 405 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL 226 +A ++ + + ++VL A G+LT + N EP + +++ Sbjct: 406 KLEAHYQNARARVDKLARSVLAKAFRGELTPQDPNDEPAEKLLERI 451 >UniRef50_Q466N9 Type I restriction-modification system specificity subunit n=2 Tax=cellular organisms RepID=Q466N9_METBF Length = 492 Score = 284 bits (727), Expect = 4e-75, Method: Composition-based stats. Identities = 109/488 (22%), Positives = 197/488 (40%), Gaps = 56/488 (11%) Query: 6 LPEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP- 63 LP W + + + G T K +R +IQN + + + + Sbjct: 18 LPNDWQWTRLGEIADNIQYGYTESSSDEPIGPK-----FLRITDIQNNEVNWKSVPYCEI 72 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGFI 122 N K++ + D+V A + + VGKS F E F ++ +R + I F+ Sbjct: 73 DNTKKQNYLLKDGDLVFARTGAT---VGKSYLLKGDFPESVFASYLIRVRLLEEISESFV 129 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 +F +S Y +I+ G N+ L+ +P+ PL EQ+ I K++ L +++D+ Sbjct: 130 YNFFQSLTYWKQITEGQVGIGQPNVNGTKLSLLIVPVAPLLEQRAIVSKIEQLFSELDNG 189 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV-------------------- 222 + + + LK +RQAVL A GKLT+KWR P Sbjct: 190 ISNLKLAQEQLKVYRQAVLKKAFEGKLTKKWREENPDVEDSKYVLNKIKNQISTQKKTKE 249 Query: 223 ----------------FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQN 266 + ++ + + +G P ++ G P + IS++ +G +D + Sbjct: 250 IQDIQYGEVPYELPFKWNWVSLSDVSISITDGDHQAPPKADSGVPFIVISNISSGKLDMS 309 Query: 267 DIRFLECSESEL--NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 + ++ E + K Q D+L++ G+ L+ + + + + R Sbjct: 310 ETMYVPEKYYENLAAKRKPQPRDILYSVTGSY----GIPILISE--NYRFCFQRHIALIR 363 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 + +Y+ SP T + Q + +++ V +PP+ EQ IV+ Sbjct: 364 PHMEISSKYLYYILKSPFVYKQATKVA-TGTAQLTVPLSGLRTIKVPIPPIAEQQAIVQE 422 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALL 444 +E + + IE+ + + L R L QSIL KAF G+L + A LL Sbjct: 423 IETRLSVCEKIEQDIKDNLERAEALRQSILKKAFEGKLLNEKELAEVRGAEDWEPAEVLL 482 Query: 445 EKIKAERA 452 E+IKAE+A Sbjct: 483 ERIKAEKA 490 Score = 161 bits (406), Expect = 7e-38, Method: Composition-based stats. Identities = 63/254 (24%), Positives = 120/254 (47%), Gaps = 9/254 (3%) Query: 205 VNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVD 264 + + E+ + + ++ I ++ G + ++ +G LRI+ ++ V+ Sbjct: 4 IKPIIEEEIAEYPNLPNDWQWTRLGEIADNIQYGYTESSSDEPIGPKFLRITDIQNNEVN 63 Query: 265 QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 + + E ++ + L+DGDL+F R + VG LLK ++ ++ LIR R Sbjct: 64 WKSVPYCEIDNTKKQNYLLKDGDLVFARTGAT---VGKSYLLKGDFPES-VFASYLIRVR 119 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 L ++ ++ FF S + + + GQ ++G + +V + P+ EQ IV + Sbjct: 120 LLEEISESFVYNFFQSLTYWKQITEG-QVGIGQPNVNGTKLSLLIVPVAPLLEQRAIVSK 178 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALL 444 +EQLF+ D + A ++ Q++L KAF G+LT +WR ENPD+ + +L Sbjct: 179 IEQLFSELDNGISNLKLAQEQLKVYRQAVLKKAFEGKLTKKWREENPDV----EDSKYVL 234 Query: 445 EKIKAERAASGGKK 458 KIK + + K Sbjct: 235 NKIKNQISTQKKTK 248 Score = 137 bits (345), Expect = 8e-31, Method: Composition-based stats. Identities = 47/213 (22%), Positives = 94/213 (44%), Gaps = 11/213 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LP W +S V+ I ++ D +P I +NI +GK D ++ ++VP+ Sbjct: 261 ELPFKWNWVSLSDVSISITDGDHQAPPKA----DSGVPFIVISNISSGKLDMSETMYVPE 316 Query: 65 NLVKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + +K P DI+ +++ G + F ++RP I S + Sbjct: 317 KYYENLAAKRKPQPRDILYSVT----GSYGIPILISENYRFCFQRHIALIRPHMEISSKY 372 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + KS + + ++ G + + I +PIPP+AEQ+ I ++++T L+ + Sbjct: 373 LYYILKSPFVYKQATKVATGTAQLTVPLSGLRTIKVPIPPIAEQQAIVQEIETRLSVCEK 432 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR 214 + + + + RQ++L A GKL + Sbjct: 433 IEQDIKDNLERAEALRQSILKKAFEGKLLNEKE 465 >UniRef50_C9YAL6 Putative uncharacterized protein n=1 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YAL6_9BURK Length = 449 Score = 281 bits (720), Expect = 3e-74, Method: Composition-based stats. Identities = 125/466 (26%), Positives = 220/466 (47%), Gaps = 30/466 (6%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV-PK 64 LP+ W AP+ + + ++ +A ++ +P++ A NI + K + + P+ Sbjct: 3 LPQSWTTAPLGKLCEKLSDGSHNPPKA----QETGMPMLSARNINDRKITFDEFRLISPE 58 Query: 65 NLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGF 121 +E ++ +S D+++ + +G++A + + VL+P K S + Sbjct: 59 EFAEEDRRTRVSSGDVLLTIVGA----IGRTAVVPQGAPQFTLQRSVAVLKPIKS-DSRY 113 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 I++ ++ + + + G I + + IP+ P EQK IA+KLDT+L +VD+ Sbjct: 114 ISYALEAPALQKYLQDNAKGTAQKGIYLKALAGVEIPVAPEPEQKRIADKLDTVLTRVDA 173 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRN-FEPQHSVFKKLNFESILTELRNGLS 240 R ++ +LKRFRQ+VL A +G+LTE WRN P+ + + + + +G Sbjct: 174 VNTRLARVAPLLKRFRQSVLAAATSGRLTEDWRNGSIPEVKEWSEKALSEVCRTITDGEH 233 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQD--GDLLFTRYNGSLE 298 P + G P++ VR VD +D +F+ ++ +R + GD+L + Sbjct: 234 ISPPLAPHGVPLVSAKDVREWGVDFSDTKFVSEEFADASRKRCGPICGDVLVVSRGAT-- 291 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 VG L+K + L+ L + T E++ +SP + + Q Sbjct: 292 -VGRTCLVKSKEKFCLMGSVLLFQPTAT-LIKSEFLAHVLASPLGLEQLTK-ASGATAQA 348 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 I +D K + LP ++EQ EIVRRVE LFA+AD +E ++ A A LT ++LAKAF Sbjct: 349 AIYIRDAKGLKIRLPSIEEQTEIVRRVETLFAFADRLEARLAQAQAAATRLTPALLAKAF 408 Query: 419 RGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 GEL Q + P AA LL ++ A+ A+ + RK + Sbjct: 409 SGELVPQDPNDEP--------AAELLRRL-AQAPATASPRKGRKAA 445 >UniRef50_D1UP80 Restriction modification system DNA specificity domain protein n=1 Tax=Burkholderia sp. CCGE1001 RepID=D1UP80_9BURK Length = 443 Score = 281 bits (719), Expect = 3e-74, Method: Composition-based stats. Identities = 134/472 (28%), Positives = 234/472 (49%), Gaps = 42/472 (8%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK-FDTTDL 59 MS +LP+GW+ + V T K + D++ ++ +I+ K + L Sbjct: 1 MS--RLPKGWLETTLGEVVDY---GTTLKAEPDEISDDEW--VLELEDIEKDKSRIVSRL 53 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVV------GKSAHQHLPFECSFGAFCGVLRP 113 F + + S D++ + V G + +P + + Sbjct: 54 TFADRKSKSTKNRFSKGDVLYGKLRPYLNKVVLADSNGLCTTEIIPIKQTAA-------- 105 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 + + ++ H+ + + + +S G N+ + + +PPLAEQK IA+KLD Sbjct: 106 ---VDNRYVFHWLRGPRFLSYAIGVSHGLNMPRLGTDAGRSAPFILPPLAEQKRIADKLD 162 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILT 233 ++L++V++ AR ++P IL R R+A L + G+ + ++ F S++ Sbjct: 163 SVLSRVEAACARMGRVPTILTRLRRAALVATLLGQDGDAKPT--------PRIAFGSLIN 214 Query: 234 ELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 +R G ++ P +PILR SSVR G +D D+R+L +S ++ +++ D+LFTR Sbjct: 215 SIRGGTTAVPQSDKTAYPILRSSSVRQGRIDFEDVRYLTSEQSGEEKNFIRENDVLFTRL 274 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 NG++ +VG C ++ + YPD+L ARL + +P+Y F+ P R + K+ Sbjct: 275 NGNVNYVGNCAVVPSVSLNKYQYPDRLYCARLKETIVPKYCAYAFALPDIRKEIERRAKS 334 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 ++G K IS +DIK + LPPV EQ +V ++E++FA D +EK ++ A ++LT ++ Sbjct: 335 SAGHKRISIQDIKEMEIPLPPVAEQLRMVNQIERIFATCDRLEKTLDEAKIVADHLTPAL 394 Query: 414 LAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGK-KASRKKS 464 LAKAFRGEL Q + SA LLE++KA + G K K SR+ + Sbjct: 395 LAKAFRGELVGQDPND--------ESAEQLLERLKALTTSLGTKGKRSRQSA 438 >UniRef50_UPI0001C36A8C HsdS1 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C36A8C Length = 456 Score = 277 bits (709), Expect = 6e-73, Method: Composition-based stats. Identities = 103/449 (22%), Positives = 206/449 (45%), Gaps = 29/449 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P W + V +I G T K + Y P + ++ G+ ++V + Sbjct: 30 EVPGNWCWVRLKDVAFVITGGTPSKNKPEYY--GGTFPFFKPADLDYGR----NMVAASE 83 Query: 65 NLVKESQ---KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 L +E + + P GS +GK + L + + P+ + S F Sbjct: 84 FLSEEGKAVSRCIPAKSTAVCCIGS---IGKCGY--LCVDGTTNQQINSAIPK--VNSLF 136 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + ++ + L+ ++ ++ I+ + + + P+PPL EQ+ IA ++ + ++D Sbjct: 137 LYYYCNTILFTKQLRLKASATTISIVNKSKMEQCLFPLPPLREQQRIANHIEEMFYKLDE 196 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL-- 239 K + + + + + + A+L A +G LT KWR + S + L+ GL Sbjct: 197 IKEKTQLVLESSEDRKAAILYKAFSGALTAKWRKHKGVSFEGWITKPLSEVATLQTGLMK 256 Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 + N+ V P LR+++V+ G++D +I+ +E ++ R++L+ GD+LFT G + Sbjct: 257 GKRNNQKTVLLPYLRVANVQDGYLDLKEIKNIEVDVLKIERYRLKKGDVLFTE-GGDFDK 315 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQK 358 +G + + + + ++ + + R D L P ++ + S + + C K T+ Sbjct: 316 LGRSSVWNE-EIPDCIHQNHIFVVRTQTDTLDPYFLSLQAGSRYGKTYFIGCSKQTTNLA 374 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 I+ +K+ VL+P ++EQ EIV + + I++ L ++ + +SIL++AF Sbjct: 375 SINSTQLKNFPVLIPTIEEQREIVNILNFFLGKEEQIKQNCLKLLEKIEEIKKSILSRAF 434 Query: 419 RGELTAQWRAENPDLISGENSAAALLEKI 447 RGEL NPD E S+ LL+ I Sbjct: 435 RGELGTN----NPD----EESSIELLKTI 455 Score = 108 bits (270), Expect = 4e-22, Method: Composition-based stats. Identities = 42/245 (17%), Positives = 90/245 (36%), Gaps = 19/245 (7%) Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 + E+ + ++ +QA++ E+W P + + +L + + +KP Sbjct: 5 KKKEENLTLEEKLKQALVPE-------EEWPYEVPGNWCWVRLKDVAFVITGGTPSKNKP 57 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVC 303 G P + + + G FL ++R + + +G C Sbjct: 58 EYYGGTFPFFKPADLDYGRNMVAASEFLSEEGKAVSRCIPAKSTAVCC-----IGSIGKC 112 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 G L N + ++ + ++ + + + ++ Sbjct: 113 GYLCVDGTTNQQINSAI------PKVNSLFLYYYCNTILFTKQL-RLKASATTISIVNKS 165 Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 ++ + LPP++EQ I +E++F D I+++ L + +IL KAF G LT Sbjct: 166 KMEQCLFPLPPLREQQRIANHIEEMFYKLDEIKEKTQLVLESSEDRKAAILYKAFSGALT 225 Query: 424 AQWRA 428 A+WR Sbjct: 226 AKWRK 230 >UniRef50_A3JFC5 Restriction modification system DNA specificity domain n=1 Tax=Marinobacter sp. ELB17 RepID=A3JFC5_9ALTE Length = 527 Score = 275 bits (704), Expect = 2e-72, Method: Composition-based stats. Identities = 120/534 (22%), Positives = 211/534 (39%), Gaps = 90/534 (16%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLK---DDYLPLIRANNIQNGKFDTTDLVF 61 +L +GWV + V + +G +K + + + LI+ +I +G+F F Sbjct: 7 ELADGWVECVIEDV--VGKGGIFKDGDWVESKDQDPNGDVRLIQLADIGDGRFLDKSSRF 64 Query: 62 VPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFEC---SFGAFCGVLRPEKL 116 + ++ +E + DI++A +G+ L + + C + + Sbjct: 65 LTRSKARELNCTFLRAGDILVARM---PDPLGRCCIFPLDEDGRYVTVVDICAIRFGDSR 121 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + + F+ + S R KIS+L +G+ I + I +P+PPL EQ I K++TL Sbjct: 122 VNAKFMMYLINSPSIRGKISALQSGSTRKRISRGNLATIPLPLPPLNEQHRIVAKIETLF 181 Query: 177 AQVDST------------------------------------------KARFEQIPQILK 194 +++D + +I Q + Sbjct: 182 SELDKGIESLKTAREQLKVYRQAVLKHAFEGKLTAKWREQNKDKLETPQQLLARIQQERQ 241 Query: 195 RFRQAVLGG-------------------------AVNGKLTEKWRNFEPQHSVFKKLNFE 229 Q L A+ + RNF + + Sbjct: 242 ARYQQKLQEWQVAVKMWEENGKKENKPGKPKKLAALKETSENETRNFPQLPVGWTYVRLG 301 Query: 230 SILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLL 289 ++ E G S K + +LRI ++ G +D ++++F E E+ L GDLL Sbjct: 302 LLIEEPTYGTSKKCSYDSGQVGVLRIPNISHGAIDSSNLKFASFEEHEVKALALAKGDLL 361 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMM 348 R NGS+ VG C L+ + + + L+ LIR R D + P ++ +S R + Sbjct: 362 TIRSNGSVSLVGSCALIAE-EDTDFLFAGYLIRLRPNHDLVAPFFLLSVLTSHLLRRQIE 420 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + K+TSG I+ +I++ +V LP + EQ E+++ +E E ++ L + Sbjct: 421 SAAKSTSGVNNINTGEIQNLIVPLPSMVEQVELLKFLEISTPNIAVAEYEIEVQLKKSEV 480 Query: 409 LTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRK 462 L QSIL KAF G+L Q + P A+ALL +I +ERA K K Sbjct: 481 LRQSILKKAFSGKLVPQDPNDEP--------ASALLARIHSERAGRSPVKTRVK 526 Score = 151 bits (380), Expect = 7e-35, Method: Composition-based stats. Identities = 62/262 (23%), Positives = 118/262 (45%), Gaps = 14/262 (5%) Query: 204 AVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHV 263 A +L + W + V K + + + + SK + +++++ + G Sbjct: 3 AELNELADGWVECVIEDVVGK-----GGIFKDGDWVESKDQDPNGDVRLIQLADIGDGRF 57 Query: 264 DQNDIRFLECSES-ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR 322 RFL S++ ELN L+ GD+L R + +G C + + + + Sbjct: 58 LDKSSRFLTRSKARELNCTFLRAGDILVARM---PDPLGRCCIFPLDEDGRYVTVVDICA 114 Query: 323 ARL-TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEI 381 R +++ +SPS R + + +++ S +K IS ++ + + LPP+ EQ I Sbjct: 115 IRFGDSRVNAKFMMYLINSPSIRGKI-SALQSGSTRKRISRGNLATIPLPLPPLNEQHRI 173 Query: 382 VRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAA 441 V ++E LF+ D + + A ++ Q++L AF G+LTA+WR +N D + Sbjct: 174 VAKIETLFSELDKGIESLKTAREQLKVYRQAVLKHAFEGKLTAKWREQNKDKLETPQ--- 230 Query: 442 ALLEKIKAERAASGGKKASRKK 463 LL +I+ ER A +K + Sbjct: 231 QLLARIQQERQARYQQKLQEWQ 252 >UniRef50_UPI0001695152 type I restriction enzyme specificity protein n=1 Tax=Xanthomonas oryzae pv. oryzicola BLS256 RepID=UPI0001695152 Length = 451 Score = 274 bits (700), Expect = 6e-72, Method: Composition-based stats. Identities = 128/469 (27%), Positives = 219/469 (46%), Gaps = 28/469 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 +LP GWV + + + + + I ++ + P N + G + + Sbjct: 2 VSELPGGWVETTIGEICAMGPKSAWDDDMEIGFVPMSHAP----TNFR-GPLNYEARRWH 56 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGA--FCGVLRPEKLIFS 119 VK++ D VI GK+A LP G+ F + R + I Sbjct: 57 E---VKKAYTHFENDDVIFAKVTPCFENGKAALVAGLPNGAGAGSSEFHVLRRRDAGISP 113 Query: 120 GFIAHFTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ KS+ + + GA + + A + + +PP AEQK IA+KLD LLAQ Sbjct: 114 SYLLAVIKSAQFLREGEENMTGAIGLRRVPRAFVENFPVRLPPEAEQKRIAQKLDALLAQ 173 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 VD+ KAR + IP +LKRFRQ+V+ V+G L ++ + ++ + E + T++++G Sbjct: 174 VDTFKARIDAIPALLKRFRQSVINHGVSGSL-ALDQHASFDTTTWRNMRAEDVCTKVQSG 232 Query: 239 LSSKPNESGVGHPILRISSVRAGHVDQN-DIRFLECSESE--LNRHKLQDGDLLFTRYNG 295 + K + G P L++ ++ G ++ +++ + + GD+L Sbjct: 233 GTPKEGFTTEGIPFLKVYNIVDGIIEFEYRPQYIAADIHQGSCRKSITIPGDVLMNIVGP 292 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 L G ++ + + + + R ++ +I + + + K ++ Sbjct: 293 PL---GKIAVVPQGVDEWNI-NQAITLFRPSESISSAWIHLVLLEGTNIRRVSQETKGSA 348 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 GQ IS + V +PP + Q EIVRRVEQLFAYAD +E +V A R++ LTQS+LA Sbjct: 349 GQVNISLSQCRDFVFPVPPTQIQDEIVRRVEQLFAYADQLEAKVAAAQQRIDALTQSLLA 408 Query: 416 KAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 KAFRGEL Q ++ P A+ LL++I+A+RAA+ K RK + Sbjct: 409 KAFRGELVPQDPSDEP--------ASVLLDRIRAQRAATPKPKRGRKAA 449 >UniRef50_A1TWL9 Restriction modification system DNA specificity domain n=2 Tax=Gammaproteobacteria RepID=A1TWL9_MARAV Length = 435 Score = 273 bits (699), Expect = 8e-72, Method: Composition-based stats. Identities = 151/468 (32%), Positives = 231/468 (49%), Gaps = 42/468 (8%) Query: 1 MSAGKLPEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL 59 M + LP W +A + ++ + G T + + L+R +IQN ++ Sbjct: 1 MQSQLLPANWQLANLGEISSDISYGYTASATS-----EPTGVKLLRITDIQNNTVSWPNV 55 Query: 60 ---VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEK 115 P+ + K ++ P D+V A + + VGKS E + ++ +R + Sbjct: 56 PNCKIEPEKVGK--YRLKPSDLVFARTGAT---VGKSYLLKGEIPESVYASYLIRVRCLE 110 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + F+A++ +S Y +I+ SAG N+ +++P+PPLAEQK+IA+KLDTL Sbjct: 111 GVSIEFLANYFQSPYYWRQITDFSAGIGQPNVNGTKLKNLSVPVPPLAEQKVIADKLDTL 170 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTEL 235 LAQV++TKAR E+IPQILKRFRQ+VL AV+G+L + + + + Sbjct: 171 LAQVENTKARLERIPQILKRFRQSVLAAAVSGRLIDAQPESIAKLEELVDIENGA---RK 227 Query: 236 RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 + + G P + + +L E R+ L D Sbjct: 228 PVSATIRKTIQGT-IPYYGATGIVD---------YLNDYTHE-GRYLLVGED-------- 268 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 + L + + + + ++++I +S + T S Sbjct: 269 GANLLSKSKDLAFIVEGKMWVNNHAHVLKERPGVNLDFVKIAINSLDLTPWI-----TGS 323 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 Q ++ K + + + EQ EIVRRV+QLF++AD IE+Q ++ALARVNNLTQSILA Sbjct: 324 AQPKLTKKSLCGLPITNFTLDEQTEIVRRVDQLFSHADRIEQQASSALARVNNLTQSILA 383 Query: 416 KAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 KAFRGELT QWR +NP+LI GENSA ALLE+IKAERAA K +R K Sbjct: 384 KAFRGELTEQWRRDNPELIGGENSAEALLERIKAERAAMKPVKRTRNK 431 >UniRef50_Q1Z9T4 Type I restriction-modification system, S subunit n=5 Tax=Bacteria RepID=Q1Z9T4_PHOPR Length = 523 Score = 273 bits (698), Expect = 1e-71, Method: Composition-based stats. Identities = 146/521 (28%), Positives = 234/521 (44%), Gaps = 94/521 (18%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG-KFDTTDL 59 MS +LP+GW + + + RG + + + DD + I+ + + G K T+ Sbjct: 1 MS--QLPKGWAENSLGNLVVVERGSSPRPIKNFLTDSDDGVNWIKIGDAKKGQKLLTSTA 58 Query: 60 VFVPKNLVKESQKISPEDIVIA--MSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 + K +S+ + D +++ MS G ++G + H + V R K I Sbjct: 59 EKITKEGAMKSRFVDVGDFILSNSMSFGLPYIMGIPGYIHDGW--------FVFRLPKQI 110 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 S + + SS + ++L+ G + NI +P+PPLAEQ I EKLD +LA Sbjct: 111 SSDYFYYLLSSSYVGAQFNNLAVGGVVKNISGDLVKKAILPLPPLAEQTRIVEKLDEVLA 170 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 QVD+ KAR + IP I+KRFRQ+VL AV+GKLTE+WR+ + K + + + + + Sbjct: 171 QVDTIKARLDGIPAIIKRFRQSVLAAAVSGKLTEEWRDINTAQDIEKFCSEITDVRKEQY 230 Query: 238 GLSS------------KPNESGVGH----------------------------------- 250 ++ KP+ Sbjct: 231 LVTCQKAKLAKSKKPRKPSNIDDKIEPHLDVLDLLPSIPEQWTQKVLSFVTDNYADSIVD 290 Query: 251 -PILRISSVRAGHVDQ-----------------NDIRFLECSESE-LNRHKLQDGDLLFT 291 P +V+ ++D + +F+ + E L+RHK+ +GD+LF Sbjct: 291 GPFGASINVKTDYIDDGVPVIRMVNIRPFQFLRENRKFVSFEKFEGLSRHKINEGDVLFA 350 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA-LPEYIEIFFSSPSARNAMMNC 350 + + G C + + +L R + K E++ I + A + N Sbjct: 351 KVGAT---TGDCCMYPMNEPIAMLSTTGSCRITVDKQVYNSEFLVIVLN---AYRRIFNS 404 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 + + Q ++ K IKS + +P ++EQ EIVR V+Q F++ADTIE QV A ARV++LT Sbjct: 405 ITSQVAQPFLNMKTIKSVPIPIPALEEQKEIVRLVDQYFSFADTIEAQVKKAQARVDSLT 464 Query: 411 QSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 QSILAKAFRGEL AQ ++ P A LLE+I R Sbjct: 465 QSILAKAFRGELVAQDPSDEP--------ADKLLERIAQAR 497 Score = 105 bits (262), Expect = 4e-21, Method: Composition-based stats. Identities = 54/245 (22%), Positives = 93/245 (37%), Gaps = 19/245 (7%) Query: 230 SILTELRNGLSSKP-----NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQ 284 L + G S +P +S G ++I + G E + + Sbjct: 14 GNLVVVERGSSPRPIKNFLTDSDDGVNWIKIGDAKKGQKLLTSTAEKITKEGAMKSRFVD 73 Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 GD + + + S + G+ + ++ RL K +Y SS Sbjct: 74 VGDFILSN-SMSFGLPYIMGIPGYIHDGWFVF-------RLPKQISSDYFYYLLSSSYVG 125 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 N K ISG +K ++ LPP+ EQ IV +++++ A DTI+ +++ A Sbjct: 126 AQFNNLAVGGV-VKNISGDLVKKAILPLPPLAEQTRIVEKLDEVLAQVDTIKARLDGIPA 184 Query: 405 RVNNLTQSILAKAFRGELTAQWRAENPDLISGE---NSAAALLEK--IKAERAASGGKKA 459 + QS+LA A G+LT +WR N + E+ + ++A K Sbjct: 185 IIKRFRQSVLAAAVSGKLTEEWRDINTAQDIEKFCSEITDVRKEQYLVTCQKAKLAKSKK 244 Query: 460 SRKKS 464 RK S Sbjct: 245 PRKPS 249 >UniRef50_C0VG50 Type I restriction modification enzyme protein S n=1 Tax=Acinetobacter sp. ATCC 27244 RepID=C0VG50_9GAMM Length = 399 Score = 271 bits (692), Expect = 5e-71, Method: Composition-based stats. Identities = 115/413 (27%), Positives = 205/413 (49%), Gaps = 19/413 (4%) Query: 12 IAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQ 71 I + ++T IRGV+Y K A++ +++ YLP++RANNIQ D V+VP++ + + Q Sbjct: 4 IVKIGNISTQIRGVSYSKSDAVSNMQEGYLPVLRANNIQEQGLILEDFVYVPESKISKKQ 63 Query: 72 KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHFTKSSL 130 +I D++IA SSGS S+VGK+A FGAFC +LRP +L+ + A++ ++ Sbjct: 64 RILAGDVIIAASSGSISLVGKAASAKEDINAGFGAFCKILRPNTELVDPRYFANYFQTQQ 123 Query: 131 YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 YR IS+L+AGANINN+K D + IP+PPL+EQ+ IA LD + E++ Sbjct: 124 YRQIISNLAAGANINNLKNEHLDDLEIPLPPLSEQRRIASILDQADVLRQKRQQAIEKLD 183 Query: 191 QILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGH 250 Q+L+ + G V+ + + Q + + F + L + + G Sbjct: 184 QLLQATFIDMFGDPVSNPKGFEVKKLSEQVDLIQIGPFGTQLHQ--------EDYIENGI 235 Query: 251 PILRISSVRAGHVDQN-DIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKL 309 P++ S ++ G + N + + EL+++ L+ D+L R +G C ++ + Sbjct: 236 PLINPSHIKNGKIVPNLKLSVSQLKYGELSQYHLKLHDVLLGRRGE----MGRCAVVTQN 291 Query: 310 QHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQV 369 + L L + P ++E+ SS S + + N V ++ + S Sbjct: 292 EVGWLCGTGSLFLRPNVEKINPFFLEMLLSSDSIKRYLEN-VSQGQTMANLNKTIVGSIP 350 Query: 370 VLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 ++ P ++ Q + + + ++ ++ N+ +VNNL QS+ AF G L Sbjct: 351 LIAPSIEIQNKFF----LISEEINKMKTELENSKNQVNNLFQSLQNHAFNGTL 399 Score = 92.1 bits (227), Expect = 4e-17, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 82/209 (39%), Gaps = 16/209 (7%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P+G+ + +S LI+ + + ++ +PLI ++I+NGK + V + Sbjct: 201 PKGFEVKKLSEQVDLIQIGPFGTQLHQEDYIENGIPLINPSHIKNGKIVPNLKLSVSQLK 260 Query: 67 VKE--SQKISPEDIVIAMSSGSKSVVGKSAH---QHLPFECSFGAFCGVLRPE-KLIFSG 120 E + D+++ G + +G+ A + + C G+ LRP + I Sbjct: 261 YGELSQYHLKLHDVLL----GRRGEMGRCAVVTQNEVGWLCGTGSL--FLRPNVEKINPF 314 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+ S + + ++S G + N+ I + P + Q K + +++ Sbjct: 315 FLEMLLSSDSIKRYLENVSQGQTMANLNKTIVGSIPLIAPSIEIQ----NKFFLISEEIN 370 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKL 209 K E + Q++ A NG L Sbjct: 371 KMKTELENSKNQVNNLFQSLQNHAFNGTL 399 >UniRef50_A6DQ81 Putative restriction-modification system specificity determinant n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DQ81_9BACT Length = 402 Score = 268 bits (685), Expect = 3e-70, Method: Composition-based stats. Identities = 110/415 (26%), Positives = 195/415 (46%), Gaps = 25/415 (6%) Query: 15 VSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQKIS 74 + +++ IRGV+YKK ++ + Y P++RANNI G + LV+V ++KE Q + Sbjct: 6 IGDISSQIRGVSYKKNDVVDEPTERYTPVMRANNINEGFLNYDKLVYVKSEVIKEHQLLQ 65 Query: 75 PEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHFTKSSLYRN 133 D++I SSGS ++VGK+ SFGAFC VLRP+ K +F F + +S Y+ Sbjct: 66 KGDVLICASSGSLNLVGKAGSFLDSTSSSFGAFCKVLRPDTKKVFPRFFHFYFQSQGYKR 125 Query: 134 KISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQIL 193 I +L+ GANINNIK D + IP+P L EQK IA LD + Q + L Sbjct: 126 SIKALAEGANINNIKNEHLDDLKIPLPSLEEQKRIAAILDKADELRQKRREAISQCNEFL 185 Query: 194 KRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESG---VGH 250 K ++ G V + K+ F+ +L + G S K Sbjct: 186 KSTFLSMFGDPVTNPKG------------WDKIIFDELLDNIDGGWSPKCETWPATLDEW 233 Query: 251 PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQ 310 ++++ ++ + + + + + ++Q DLLF+R N + E V C + + Sbjct: 234 GVMKLGALTTCEYKEEENKAMLPGLETKSNIEIQPRDLLFSRKN-THELVAACAYVWDTR 292 Query: 311 HQNLLYPDKLIRARLT--KDALPEYIEIFFSSPSARNAMMNCVKTTSG-QKGISGKDIKS 367 Q L+ D + R + + Y+ + R + +G IS K++K+ Sbjct: 293 PQ-LMMSDLMFRFKFKASAEVNSIYMWKLLVNERQRKEVQALASGAAGSMPNISKKNLKT 351 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + +PP++ Q + ++ ++ + Q+ +L +++ +++ KAF+GEL Sbjct: 352 IKLPIPPIELQNQFA----EIAKKTESSKSQMQQSLKELDDNFDALMQKAFKGEL 402 >UniRef50_A0KZG7 Restriction modification system DNA specificity domain n=6 Tax=Gammaproteobacteria RepID=A0KZG7_SHESA Length = 587 Score = 266 bits (679), Expect = 2e-69, Method: Composition-based stats. Identities = 152/515 (29%), Positives = 236/515 (45%), Gaps = 70/515 (13%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI------QNGKFDTTDL 59 LP+GW + + V + GV + + + P+ + ++ ++G Sbjct: 6 LPKGWAVTTIGAVARVSSGVGFPIKYQGK--SEGLYPVYKVGDVSKAVTSKHGNLAVAGH 63 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + + +I P + G + + A P + P+K + Sbjct: 64 YVDKEEAAELKGEIFPVGATLFAKIGEAVKLNRRAFVRKPGLADNNVMAVI--PDKSDCN 121 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ F ++ ++ S + +I+ + I + +PPLAEQ +IA+KLDTLLAQV Sbjct: 122 RFLYQFLRAID----LTETSRSTTVPSIRKGDIEDIELYLPPLAEQIVIADKLDTLLAQV 177 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE---------------------- 217 ++TKAR E+IP+ILK FRQ+VL AV+GKLT++WR Sbjct: 178 ETTKARLERIPEILKSFRQSVLSAAVSGKLTQEWRESHGNGTGEEVVKADAINKSVLLNE 237 Query: 218 ---------------------PQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRIS 256 + + I + G + +S G +L Sbjct: 238 NPALKKKKSTIESQIDTEYIFDLPESWGFTTWGKISEWITYGFTKPMPKSDSGVKLLTAK 297 Query: 257 SVRAGHVDQNDIRFLECSESEL--NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNL 314 V+ V+ ND S + ++ + GDLL T+ +G L++ + + Sbjct: 298 DVQYFDVNINDAGLTTSSAFQSLSDKDRPIKGDLLITKDGS----IGRAALVRTDEPFCI 353 Query: 315 LYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPP 374 + R T +Y+E +S + + + + + Q +S D + +P Sbjct: 354 NQSVAVCWLRSTS-MNKDYLEFLANSEFTQRFVKDKAQGMAIQ-HLSIIDYAKCPLPVPS 411 Query: 375 VKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLI 434 ++EQ EIVRRVE+LFA+AD+IE++ ALARVNNLTQSILAKAFRGELTA WRA NP+LI Sbjct: 412 LEEQTEIVRRVEELFAFADSIEQKATAALARVNNLTQSILAKAFRGELTADWRAANPELI 471 Query: 435 SGENSAAALLEKIKAERA---ASGGKKAS--RKKS 464 SG+NSAAALLEKIK ER K S +KK+ Sbjct: 472 SGDNSAAALLEKIKVEREVMKKQPKPKRSNIKKKT 506 >UniRef50_A4VH87 Type I restriction-modification system, S subunit n=1 Tax=Pseudomonas stutzeri A1501 RepID=A4VH87_PSEU5 Length = 472 Score = 265 bits (677), Expect = 3e-69, Method: Composition-based stats. Identities = 137/493 (27%), Positives = 216/493 (43%), Gaps = 51/493 (10%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS +LP GW + + L G T K + +P + +++ + + Sbjct: 1 MS--ELPSGWTRFALKDLGGLSGGKTPSKAN-PEFWSTRDVPWVSPKDMKKNLLEDAEDR 57 Query: 61 FVPKNLVKESQKISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + + + P +++ SG A + E + VLRP + I Sbjct: 58 ISQNAVDEAGMTLYPSGSVLMVTRSGILQHTFPVALAGV--ELTVNQDIKVLRPIEGIVP 115 Query: 120 GFIAHFTKSSLYRNKISSLSA--GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F + KS + +I S + G + +I + +PPLAEQ IA+KLD LLA Sbjct: 116 KFSFYMLKS--FGAEILSACSKDGTTVQSIDSEKLETFLFSLPPLAEQTRIAQKLDELLA 173 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF-------EPQHSVFKKLNFES 230 QVD+ KAR + IP +LKRFRQ+VL AV+G+LTE+WR E S ++ + Sbjct: 174 QVDTLKARIDAIPALLKRFRQSVLAAAVSGRLTEEWRGSIPASESAEEYLSRVIQVRRQK 233 Query: 231 ILTELRNGLSSKPNESGVGHP--ILRISSVRAGHVDQNDIRFL---ECSESELNRH---- 281 + + + + + P + ++SV + + +R E ES ++ Sbjct: 234 PIVKFKEPVPPDLETRELEVPEGWI-VASVSSFAECLDSMRVPVKKELRESGEGKYPYFG 292 Query: 282 ----------KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 + D DL+ + + F G + + + R Sbjct: 293 ANGEVDRVDEYIFDDDLVLVTEDET--FYGRVKPIAYKYSGKCWVNNHVHALRAHDAVAR 350 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 +Y+ + T+G+ ++ + S + +PP EQ EIVRRVEQLFA+ Sbjct: 351 DYLCYVLMHYDVVPWL----TGTTGRAKLTQGALLSLPIQVPPATEQTEIVRRVEQLFAF 406 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 AD +E +VN A A ++ LTQSILAKAFRGEL Q + P A+ LLE+IKA+R Sbjct: 407 ADQLEARVNAAKACIDRLTQSILAKAFRGELVPQDPNDEP--------ASVLLERIKAQR 458 Query: 452 AASGGKKASRKKS 464 AA+ K RK S Sbjct: 459 AAAPKTKRGRKAS 471 >UniRef50_UPI0001855288 conserved hypothetical protein n=1 Tax=Francisella novicida FTG RepID=UPI0001855288 Length = 414 Score = 265 bits (676), Expect = 4e-69, Method: Composition-based stats. Identities = 99/422 (23%), Positives = 178/422 (42%), Gaps = 32/422 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 KLP GW + V + G T+ + K+ PL+ + N++N D T F+ Sbjct: 21 KLPAGWEWKKLGEVFDVKDG-THDSPK----YKEIGYPLVTSKNLKNNSLDLTSCKFISN 75 Query: 65 N---LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + + K+ D++ AM +G + + + +P Sbjct: 76 DDFIKINQRSKVDKGDLLFAM----IGTIGSPTIVDFEPDFAIKN-VALFKPSNTYLIEL 130 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + ++ S L K+ + GA + P+PPLAEQK I KLD+L ++D Sbjct: 131 LKYWLSSHLTTQKMLEEAKGATQKFVGLTYLRNFPAPLPPLAEQKRIVAKLDSLFEKIDK 190 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 +Q + L + ++ E ++S FK L+ S +R G + Sbjct: 191 AIELHQQNITNANTLMASALD--------KTFKKLEREYS-FKILDCLS--ENIRYGYTD 239 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 K E G +RI+ + +++ +++ ++L+R+KL GD+L R + V Sbjct: 240 KAKEKGNA-RFIRITDINDQGKFKDESVYVDIKNTDLDRYKLLVGDILVARSGATAGKVA 298 Query: 302 VCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 + L + ++ LIR RL LP +I F S + N + + +K Q + Sbjct: 299 LFTL-----DEFSVFASYLIRIRLQIDKVLPSFIFYFCYSSNYWNQL-DQIKIGGAQPNV 352 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + ++K+ + LPP+ Q + V ++ + D I++ L + L SIL KAFRG Sbjct: 353 NATNLKNIKIPLPPLPIQQQTVEYLDSIATKVDKIKQLNEQKLENLKALKASILDKAFRG 412 Query: 421 EL 422 EL Sbjct: 413 EL 414 >UniRef50_A6TLK6 Restriction modification system DNA specificity domain n=2 Tax=Clostridiaceae RepID=A6TLK6_ALKMQ Length = 467 Score = 264 bits (675), Expect = 5e-69, Method: Composition-based stats. Identities = 105/451 (23%), Positives = 192/451 (42%), Gaps = 29/451 (6%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI---QNGKFDTTDLVFV 62 +PE WV + VTT+I G T + I Y ++ +P I ++ + Sbjct: 28 VPENWVWTRLGNVTTIIGGGTPP-SRVIEYYENGSIPWISPVDLSGYTDIYISHGKKNIT 86 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 L K S ++ PE+ V+ S V A L F +F P ++ Sbjct: 87 ELGLKKSSARLLPENTVLLSSRAPIGYVAI-ADNELCTNQGFKSFL----PSPCYLPKYL 141 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + KSS + + + ++G + ++ P+PPLAEQ+ I +++++L +++ Sbjct: 142 YFYLKSS--KKLLEAYASGTTFLELSGRKAAIVEFPLPPLAEQQRIVDRIESLFEKLNQA 199 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS- 241 KA + + + A+L A +G+LTEKWR K + + R G + Sbjct: 200 KALIQDALDSFENRKAAILHKAFSGELTEKWREENGVGMGSWKKKSIKEVVKFRAGYAFD 259 Query: 242 KPNESGVGHPILRISSVRAGHVDQN-DIRFLE---CSESELNRHKLQDGDLLFTRYNGSL 297 N S GH ++R+ ++ G +D + ++ S + R + +GD+L T Sbjct: 260 SKNFSSTGHQVIRMGNLYNGVLDLTRNPVYISPDLIDNSIIKRFSINEGDILLTLTGTKY 319 Query: 298 EF-VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 + G L+K+ + NLL +++ + Y+ + S R+ + Sbjct: 320 KRDYGYAVLIKESE--NLLLNQRILSLTP-ESIETNYLLYYLQSDFFRDVFFSNETGGVN 376 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 Q +S K ++ + + EQ EIVR ++ +F D Q+ + + ++ + +SILA+ Sbjct: 377 QGNVSSKFVEKIEIPIFSSLEQKEIVRILDYIFEK-DKNANQLCDLIDNIDLMKKSILAR 435 Query: 417 AFRGELTAQWRAENPDLISGENSAAALLEKI 447 AFRGEL E SA LL+ I Sbjct: 436 AFRGELGTNNPEEE--------SAMELLKDI 458 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 50/251 (19%), Positives = 94/251 (37%), Gaps = 22/251 (8%) Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 A+ +Q+ + + QA++ + N P++ V+ +L + + S Sbjct: 2 AKKKQVLPVEELLEQALVPK-------SEKSNVVPENWVWTRLGNVTTIIGGGTPPSRVI 54 Query: 244 NESGVG-HPILRISSV---RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 G P + + ++ E + + L + +L + Sbjct: 55 EYYENGSIPWISPVDLSGYTDIYISHGKKNITELGLKKSSARLLPENTVLLSSRAP---- 110 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 +G + N + L + LP+Y+ + S ++ + + Sbjct: 111 IGYVAIADNELCTNQGFKSFL----PSPCYLPKYLYFYLKSS---KKLLEAYASGTTFLE 163 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 +SG+ LPP+ EQ IV R+E LF + + + +AL N +IL KAF Sbjct: 164 LSGRKAAIVEFPLPPLAEQQRIVDRIESLFEKLNQAKALIQDALDSFENRKAAILHKAFS 223 Query: 420 GELTAQWRAEN 430 GELT +WR EN Sbjct: 224 GELTEKWREEN 234 >UniRef50_Q8PTL2 Type I restriction-modification system specificity subunit n=2 Tax=Methanosarcina RepID=Q8PTL2_METMA Length = 440 Score = 262 bits (670), Expect = 2e-68, Method: Composition-based stats. Identities = 120/472 (25%), Positives = 207/472 (43%), Gaps = 64/472 (13%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LPEGW + + + G KK + G+FD VF Sbjct: 6 ELPEGWAECQIKDIVVINYGKGLKKSDRVE-----------------GQFD----VFGSN 44 Query: 65 NLV-KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECS--FGAFCGVLRPEKLIFSGF 121 +V K +Q ++ VI GS + S+ P + + F G+ R F Sbjct: 45 GIVGKHNQSLTNGPTVIIGRKGSVGEINLSSEPCWPIDTTYYIDNFYGINRI-------F 97 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + K+ +++ I I +P+PPL+EQ I ++ L A++D+ Sbjct: 98 LYYLLKTLN----LANYDTSTAIPGINRNDIYSQLVPLPPLSEQHRIVSAIEALFARLDA 153 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQH--------------------S 221 T + +++ +ILK+FR++VL A +G+LTE+WR + Sbjct: 154 TNEKLDRVQEILKKFRESVLAAACDGRLTEEWRKENLHCNEYFAIDEDQFNLVKQWRIPT 213 Query: 222 VFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLEC--SESELN 279 V+ E + + + S P + +G +R S ++ GH+D ++ +++ + Sbjct: 214 VWSWSTLEDSCSHVVDCPHSTPKWTDIGVYCVRTSELKCGHIDFSNAKYVSEATYLERIK 273 Query: 280 RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFS 339 R K Q+GD+L++R VG+ L+ + + +L+ R + +P + + Sbjct: 274 RLKPQEGDILYSREG----TVGIASLVP--SNVKICLGQRLMLFRTKNNLIPSFFVKVLN 327 Query: 340 SPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 SP +++ ++ + DIK LPP+ EQ EIVRRV+ LFA+AD+IE +V Sbjct: 328 SPYIYDSVKKSTMGSTA-PRFNVADIKKFPTPLPPLPEQQEIVRRVDALFAFADSIETKV 386 Query: 400 NNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 A + L QSILAKAF G+L +A L+E+IK ER Sbjct: 387 AAAREKTEKLRQSILAKAFSGQLVETQAEIVRREGRDYETAEVLIERIKEER 438 >UniRef50_P06991 Type-1 restriction enzyme EcoDI specificity protein n=1 Tax=Escherichia coli RepID=T1SD_ECOLX Length = 444 Score = 261 bits (668), Expect = 3e-68, Method: Composition-based stats. Identities = 184/489 (37%), Positives = 246/489 (50%), Gaps = 70/489 (14%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIR----ANNIQNGKFDT 56 MSAGKLP W + + L G K A D P + I + FDT Sbjct: 1 MSAGKLPVDWKTVELGELIKLSTG----KLDANAADNDGQYPFFTCAESVSQINSWAFDT 56 Query: 57 TDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAF--CGVLRPE 114 + V+ +GS S+ + F A+ V+ P Sbjct: 57 S--------------------AVLLAGNGSFSI--------KKYTGKFNAYQRTYVIEPI 88 Query: 115 KLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDT 174 LI + F+ + ++ KI+ G+ I I+ I++ +P +EQ +IAEKLDT Sbjct: 89 -LIKTEFLYWLLRGNI--KKITENGRGSTIPYIRKGDITDISVALPSPSEQTLIAEKLDT 145 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKK--------- 225 LLAQV+STKAR EQIPQILKRFRQAVL A+NG+LT++WR+ + F Sbjct: 146 LLAQVESTKARLEQIPQILKRFRQAVLTFAMNGELTKEWRSQNNNPAFFPAEKNSLKQFR 205 Query: 226 -------LNFESILTELRNGLSSKPNESGVGHP---ILRISSVRAGHVDQNDIRFLECSE 275 N S + + + +S + +P L + + + + + + Sbjct: 206 NKELPSIPNNWSWMRFDQVADIASKLKSPLDYPNTIHLAPNHIESWTGKASGYQTILEDG 265 Query: 276 SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIE 335 +H+ G +++++ L V + +YP + ++ Sbjct: 266 VTSAKHEFYTGQIIYSKIRPYLCKVTIATFDGMCSAD--MYP-------INSKIDTHFLF 316 Query: 336 IFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTI 395 + + + + N T I+ KD+ V PP+ EQ EIVRRVEQLFAYADTI Sbjct: 317 RWMLTNTFTDWASNAESRTV-LPKINQKDLSEIPVPTPPLPEQHEIVRRVEQLFAYADTI 375 Query: 396 EKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASG 455 EKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASG Sbjct: 376 EKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASG 435 Query: 456 GKKASRKKS 464 GKKASRKKS Sbjct: 436 GKKASRKKS 444 >UniRef50_Q3JBU1 Restriction modification system DNA specificity domain n=2 Tax=Nitrosococcus oceani RepID=Q3JBU1_NITOC Length = 547 Score = 260 bits (663), Expect = 1e-67, Method: Composition-based stats. Identities = 109/527 (20%), Positives = 199/527 (37%), Gaps = 87/527 (16%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQ-AINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 P GWV + + G ++ D +PLIR + + + + V++P Sbjct: 6 PTGWVFCRFGDIARIRNGYAFRSSAFKKTKTHDCDVPLIRQSQLIGTAVNIGEAVYLPAE 65 Query: 66 LVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR--PEKLIFSGF 121 ++ I+ DI+I MS +GK F G + E + S F Sbjct: 66 YLERFAQYVINKGDILIGMSGA----IGKVCRYKNGFPALQNQRTGKIEVFDESQMDSRF 121 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + S ++ + G + NI + + + +PP EQ+ I K++ L +++D Sbjct: 122 FGLYLSS--IEGELIRQAKGMAVQNISAKDIEALPLGLPPYNEQQRIVAKIEELFSELDK 179 Query: 182 TKARF----EQIPQILKRFRQAVLG-----------------------------GAVNGK 208 EQ+ + + A + Sbjct: 180 GIESLKTAREQLKVYRQAVLKHAFEGKLTAQWREENKDKLESPEQLLARIQQEREARYQQ 239 Query: 209 LTEKWRNFEPQHSV-------------------------------FKKLNFESILTELRN 237 E+W+ + + + +N Sbjct: 240 QLEEWKAAVKAWEATGKEGKKPGKPKKSLAIKINSFKIPKNFPNGWISIQLRELFESTQN 299 Query: 238 GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSL 297 GL+ + SG P++R++ ++ VD +D+R ++ +E+ +++L DLL R NGS Sbjct: 300 GLAKRQGTSGKPIPVIRLADIKNQEVDSSDLRSIKLDATEIQKYELSRNDLLCIRVNGSP 359 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA-LPEYIEIFFSSPSARNAMMNCVKTTSG 356 VG L K + Y D IR R + LP YI++ F + + R + +++G Sbjct: 360 NLVGRMILFK--HDNVMAYCDHFIRFRFPQGIVLPSYIQMLFDTQTVRRYIELNKVSSAG 417 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 Q +S I + + + EQ IV R+E+ ++ ++ R+ +L QSIL K Sbjct: 418 QNTVSQTTISALAIPYCSLMEQKIIVSRLEEQLTSISAVKVEIEENFQRLKSLRQSILKK 477 Query: 417 AFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASG-GKKASRK 462 AF G+L Q + P A+ LLE+I+AE+ + +RK Sbjct: 478 AFSGQLVPQDPKDEP--------ASKLLERIRAEKEKIPHPTRRTRK 516 Score = 127 bits (318), Expect = 1e-27, Method: Composition-based stats. Identities = 56/225 (24%), Positives = 93/225 (41%), Gaps = 11/225 (4%) Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYNGSLEF 299 K P++R S + V+ + +L E ++ + GD+L Sbjct: 32 KKTKTHDCDVPLIRQSQLIGTAVNIGEAVYLPAEYLERFAQYVINKGDILIGMSGA---- 87 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 +G K I + ++ SS ++ K + Q Sbjct: 88 IGKVCRYKNGFPALQNQRTGKIEVFDESQMDSRFFGLYLSSIEG--ELIRQAKGMAVQ-N 144 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 IS KDI++ + LPP EQ IV ++E+LF+ D + + A ++ Q++L AF Sbjct: 145 ISAKDIEALPLGLPPYNEQQRIVAKIEELFSELDKGIESLKTAREQLKVYRQAVLKHAFE 204 Query: 420 GELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 G+LTAQWR EN D + S LL +I+ ER A ++ K+ Sbjct: 205 GKLTAQWREENKDKL---ESPEQLLARIQQEREARYQQQLEEWKA 246 Score = 118 bits (295), Expect = 5e-25, Method: Composition-based stats. Identities = 52/243 (21%), Positives = 104/243 (42%), Gaps = 7/243 (2%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P GW+ + + + K++ +P+IR +I+N + D++DL + + Sbjct: 282 PNGWISIQLRELFESTQNGLAKRQGTSGKP----IPVIRLADIKNQEVDSSDLRSIKLDA 337 Query: 67 VK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGFIAH 124 + + ++S D++ +GS ++VG+ ++ R P+ ++ +I Sbjct: 338 TEIQKYELSRNDLLCIRVNGSPNLVGRMILFKHDNVMAYCDHFIRFRFPQGIVLPSYIQM 397 Query: 125 FTKSSLYRNKIS-SLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + R I + + A N + + + IP L EQKII +L+ L + + K Sbjct: 398 LFDTQTVRRYIELNKVSSAGQNTVSQTTISALAIPYCSLMEQKIIVSRLEEQLTSISAVK 457 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 E+ Q LK RQ++L A +G+L + EP + +++ E + KP Sbjct: 458 VEIEENFQRLKSLRQSILKKAFSGQLVPQDPKDEPASKLLERIRAEKEKIPHPTRRTRKP 517 Query: 244 NES 246 S Sbjct: 518 TAS 520 >UniRef50_C3Q383 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 9_1_42FAA RepID=C3Q383_9BACE Length = 428 Score = 259 bits (662), Expect = 1e-67, Method: Composition-based stats. Identities = 98/431 (22%), Positives = 172/431 (39%), Gaps = 31/431 (7%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P W + P+ + G+TY ++D ++R++NIQN K + D V+V Sbjct: 15 IGEIPNHWEVVPLKRTGSFENGLTYSPND----IRDKGYIVLRSSNIQNSKMNYEDTVYV 70 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 V + DI+I +GS S+VGK A +FGAF P I + + Sbjct: 71 ES--VPNDLLVKKGDIIICSRNGSASLVGKCAKFDGKIAATFGAFMMRYSPS--INNEY- 125 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 F + L + IN + + P+PPL+EQ+ IA LD ++D Sbjct: 126 -AFFSFQILMRNYKGLFTTSTINQLTKNVIAQMVCPLPPLSEQQAIASYLDAKTEKIDKM 184 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHS-VFKKLNFESIL 232 A+ E+ + L +Q+++ AV L KW P+H K S + Sbjct: 185 IAKAEKKIEYLGELKQSLITRAVTRGLNPNASLKDSGVKWIGKVPEHWETIKLSRVYSYI 244 Query: 233 TELRNGLSSKPNES-GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 LSS+ + G+ L+ + G + Q + + + E ++ Sbjct: 245 GSGTTPLSSQEDYYSEEGYNWLQTGDLNNGLITQTSKKITKKAIDECRMKFYPKHSVVIA 304 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 Y ++ VG+ L + T+ P + F+S +A+ ++ Sbjct: 305 MYGATIGKVGLLDLESTTNQACCVIS-------PTQKMNPLF--TFYSFMAAKKELL-LA 354 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 GQ IS IK V +PP++EQ I+ +++ D I +A + L Q Sbjct: 355 SFGGGQPNISQDIIKKLRVPVPPLEEQNAIILSLKKECDTIDHIIATQKKKIAYLQELKQ 414 Query: 412 SILAKAFRGEL 422 S++ G++ Sbjct: 415 SLITNVVTGKI 425 >UniRef50_B2A6M8 Restriction modification system DNA specificity domain n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A6M8_NATTJ Length = 490 Score = 259 bits (662), Expect = 2e-67, Method: Composition-based stats. Identities = 118/481 (24%), Positives = 211/481 (43%), Gaps = 50/481 (10%) Query: 5 KLPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV- 62 +LP W + + + G T K+ + +P+ R +IQN + + ++ Sbjct: 26 ELPNNWAWVALDILAEEIKNGTTIKQSKTKP-----GIPVTRIESIQNNEIQLDRVRYIR 80 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLR-PEKLIFSG 120 + +K + DIV++ + S VGK+A + G +R +I Sbjct: 81 DLDKIKNNDYYKIGDIVLSHIN-SIEHVGKTALIKEDYLPLIHGMNLLRIRVNNNMILPQ 139 Query: 121 FIAHFTKSSLYRNK-ISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ +T+S +R + + N ++ + I+IPI P EQ+ I K+D LL+++ Sbjct: 140 FLQLYTRSYNFRKAVLKRIKMAVNQVSLNQKNLKQISIPIAPKNEQRRIVYKVDRLLSKI 199 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRN------------------------ 215 + K + + + R A+L A G+LT + N Sbjct: 200 NKAKELIGEAKETFELRRAAILDKAFKGELTWREENPRVESVDTLLAKINSEKKTDIKKS 259 Query: 216 ---FEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVR-AGHVDQNDIRFL 271 + ++ ++ G S+K + G P+LR+ +++ G +D ND+++L Sbjct: 260 PNGLYELPDNWCWIDLGELICHSSYGTSAKAYKDINGLPVLRMGNIKLTGSIDLNDLKYL 319 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKDAL 330 ++ ++KL++ DLLF R N S E VG +++ Y LI+ L K L Sbjct: 320 PFDHKDVEKYKLEEYDLLFNRTN-SYELVGKSAIVEPEHAGKFTYASYLIKISLFYKKIL 378 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 YI + +S R +++ VK GQ I+ K + S V LPP +E EI R ++++ A Sbjct: 379 APYICYYINSHIGRKYLLSTVKQQVGQANINSKKLSSLPVPLPPEEEIKEINRIMKKVSA 438 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAE 450 + I+ + N V L QSIL+KAFRGEL + SA LL+++ + Sbjct: 439 KENRIQNLL-NLGTYVAELEQSILSKAFRGELNTNDPKDE--------SAIELLKEVLKD 489 Query: 451 R 451 + Sbjct: 490 K 490 Score = 152 bits (383), Expect = 3e-35, Method: Composition-based stats. Identities = 62/262 (23%), Positives = 129/262 (49%), Gaps = 9/262 (3%) Query: 197 RQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRIS 256 + + + + + + + + + + E++NG + K +++ G P+ RI Sbjct: 5 KSKRIEELLEETIVHEDEEPYELPNNWAWVALDILAEEIKNGTTIKQSKTKPGIPVTRIE 64 Query: 257 SVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLY 316 S++ + + +R++ + N + GD++ + N S+E VG L+K+ + L++ Sbjct: 65 SIQNNEIQLDRVRYIRDLDKIKNNDYYKIGDIVLSHIN-SIEHVGKTALIKE-DYLPLIH 122 Query: 317 PDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPV 375 L+R R+ + LP++++++ S + R A++ +K Q ++ K++K + + P Sbjct: 123 GMNLLRIRVNNNMILPQFLQLYTRSYNFRKAVLKRIKMAVNQVSLNQKNLKQISIPIAPK 182 Query: 376 KEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLIS 435 EQ IV +V++L + + ++ + A +IL KAF+GELT WR ENP + Sbjct: 183 NEQRRIVYKVDRLLSKINKAKELIGEAKETFELRRAAILDKAFKGELT--WREENPRV-- 238 Query: 436 GENSAAALLEKIKAERAASGGK 457 S LL KI +E+ K Sbjct: 239 --ESVDTLLAKINSEKKTDIKK 258 >UniRef50_B0PEE2 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0PEE2_9FIRM Length = 388 Score = 258 bits (658), Expect = 5e-67, Method: Composition-based stats. Identities = 108/418 (25%), Positives = 192/418 (45%), Gaps = 37/418 (8%) Query: 12 IAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQ 71 ++ + V T I G +K + +D LP+IR N+ N + ++E Sbjct: 1 MSTLGNVATYINGRAFKPSE----WEDSGLPIIRIQNLTN----FSAPYNYSSRELEEKY 52 Query: 72 KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLY 131 K++ D++ A S+ + + K + + P + I ++ +F Sbjct: 53 KVTRGDLLFAWSASLGAHIWKG------NDAWLNQHIFRVVPSEQIEKKYLYYFLL--QV 104 Query: 132 RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQ 191 ++ + + G+ + +I F IP+P L EQK I K++ L +++D++ A + + Sbjct: 105 VAELHAKTHGSGMVHITKGPFMNTPIPVPSLPEQKRIVSKIEELFSKLDASVAELQTAKE 164 Query: 192 ILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE-SGVGH 250 LK +RQAVL A + EK + E I+ + R G S K + G Sbjct: 165 KLKVYRQAVLKEAFDPVSKEK-------------ILLEDIIEKPRYGTSKKCSYAYKNGF 211 Query: 251 P-ILRISSV--RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 + RI ++ + G +D DI++ S+ EL L + DLL R NGS+ VG ++K Sbjct: 212 KAVYRIPNICYQNGSIDHKDIKYAGFSDDELKNLDLIENDLLIIRSNGSVSLVGRSSIVK 271 Query: 308 KLQHQNLLYPDKLIRARLTK--DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI 365 + + + LIR RL K + L +++ F S +AR + + K+TSG I+ +I Sbjct: 272 -AEDCDATFAGYLIRLRLKKPSEVLSKFLHYFLESHAARTYIEHVAKSTSGVNNINSNEI 330 Query: 366 KSQVVLLP-PVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + V QA+ V ++E + D I++ ++ +L + L QSIL +AF GEL Sbjct: 331 SNLPVPKCDDFDMQAQTVVKIETNLSICDDIQQTIDTSLQQAEALRQSILKQAFEGEL 388 >UniRef50_C5RH89 Restriction modification system DNA specificity domain protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RH89_CLOCL Length = 457 Score = 257 bits (656), Expect = 8e-67, Method: Composition-based stats. Identities = 96/454 (21%), Positives = 198/454 (43%), Gaps = 34/454 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++PE WV + + ++ L+ G T K Y +P I+ ++ G+ + + Sbjct: 23 EVPENWVWSNLKSIADLVTGNTPSKNNEEFY--GGKIPFIKPTDLNQGRILNSSTETLSN 80 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 +++ + + + +GK A+ L E + + P+K I++ ++ + Sbjct: 81 IGATKARILPKGSTAVCCIGAT---IGKVAY--LNVEGATNQQINSIIPKK-IYNLYVYY 134 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 +T SS + + + S+ + I + + IP+PPL EQ+ I +++ L ++D K Sbjct: 135 YTLSSYFHDTLIENSSSTTLPIINKSRMGELLIPLPPLKEQQRIVNRIENLFEKLDKAKE 194 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKW--------RNFEPQHSVFKKLNFESILTELR 236 E+ + ++ + A+ A G L + F +K E I ++ Sbjct: 195 LIEEAREGFEKRKAAITSKAFRGILNYRKGEKVNPINEGFYKLPYNWKWTKLEDICEKIT 254 Query: 237 NGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSES--ELNRHKLQDGDLLFTRY 293 +G + P G + + +++ +D + I ++ E R ++ GD+L+ + Sbjct: 255 DGTHNSPKSYEYGDYKYVTAKNIKEWGIDLSSITYVTKKEHIPIYKRCDVKYGDILYIKD 314 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 + G+ + + + +LL LIR K +Y+ +S + ++ VK Sbjct: 315 GAT---TGIATINELTEEFSLLSSVALIRV--GKCIDNKYLYYILNSFEIKKRILESVK- 368 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 ++ K I ++ LPP++EQ EIV+ +++L I K++ ++N + +SI Sbjct: 369 GVAITRLTLKKINDIIIPLPPLEEQKEIVKILDKLLEEESKI-KELTQLEDQINLIKKSI 427 Query: 414 LAKAFRGELTAQWRAENPDLISGENSAAALLEKI 447 LAKAFRG+L + SA LL+KI Sbjct: 428 LAKAFRGQLGTN--------CEEDESALELLKKI 453 Score = 99.4 bits (246), Expect = 2e-19, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 77/224 (34%), Gaps = 10/224 (4%) Query: 197 RQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRIS 256 + L + + + P++ V+ L + L + G P ++ + Sbjct: 4 KNLTLEEKLEDAIVKDVPYEVPENWVWSNLKSIADLVTGNTPSKNNEEFYGGKIPFIKPT 63 Query: 257 SVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLY 316 + G + N + L G ++ V + Q Sbjct: 64 DLNQGRI-LNSSTETLSNIGATKARILPKGSTAVCCIGATIGKVAYLNVEGATNQQ---- 118 Query: 317 PDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVK 376 I + + K Y+ + S + ++ +++ I+ + ++ LPP+K Sbjct: 119 ----INSIIPKKIYNLYVYYYTLSSYFHDTLIEN-SSSTTLPIINKSRMGELLIPLPPLK 173 Query: 377 EQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 EQ IV R+E LF D ++ + A +I +KAFRG Sbjct: 174 EQQRIVNRIENLFEKLDKAKELIEEAREGFEKRKAAITSKAFRG 217 >UniRef50_A1VBQ9 Restriction modification system DNA specificity domain n=1 Tax=Desulfovibrio vulgaris DP4 RepID=A1VBQ9_DESVV Length = 595 Score = 256 bits (654), Expect = 1e-66, Method: Composition-based stats. Identities = 105/517 (20%), Positives = 196/517 (37%), Gaps = 91/517 (17%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 + W VS + + G +K + + +D+ +PLIR +I + + T+ + Sbjct: 23 DHWKRVYVSEIAMVQNGFAFKSK---FFSRDEGIPLIRIRDILSAE---TEHKYF--GQF 74 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTK 127 + + D++I M + A+ C ++ + F + Sbjct: 75 DKEYLVHNGDLLIGMDGDFVA-----AYWPGKEGLLNQRVCRIVIESENYDKKFFFLALQ 129 Query: 128 SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF- 186 Y + I ++ + ++ + + I +P+PPL EQ I K++ L +++D+ Sbjct: 130 --PYLDAIHEKTSSVTVKHLSSKTVNEIPLPLPPLNEQNRIVAKIEELFSELDAGVENLT 187 Query: 187 ---EQIPQILKRFRQAVLGGAVNG-----------------------------KLTEKWR 214 EQ+ + + G + K E+W Sbjct: 188 KAKEQLGVYRQSLLKHAFEGKLTEAWRKRNADKLESGEALLKRVKKEREEYFKKQLEQWE 247 Query: 215 NFEPQHSV----------------------------------FKKLNFESILTELRNGLS 240 Q + +++ G S Sbjct: 248 KDVAQWEADGKPGKKPTQPKKPKKLAPISEEELKELPELPEGWVWARLGNLIDPPAYGTS 307 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 K + + G +LRI ++ G +D +D+++ S E +++L+ GDLL R NGS+ V Sbjct: 308 RKSDYNIDGTGVLRIPNIVDGKIDSSDLKYTAFSPGEEEQYRLKAGDLLTIRSNGSVSLV 367 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G C L++ +Y LIR R + +++ SS RN + + K+TSG I Sbjct: 368 GQCALIED-DDTRYVYAGYLIRLRTIGLLVSKFLLYCLSSLRLRNQIESKAKSTSGVNNI 426 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + +++ S +V L EQ E+ + + + A + L + L QSIL KAF G Sbjct: 427 NSQELSSLIVPLCSQLEQNEVSKLLADSLSTAGEQTSMIEIQLEHIRILKQSILDKAFSG 486 Query: 421 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGK 457 L +Q + P A+ LLE+IK ER ++ Sbjct: 487 TLISQDPNDEP--------ASKLLERIKQERKSAPNP 515 Score = 117 bits (293), Expect = 1e-24, Method: Composition-based stats. Identities = 54/257 (21%), Positives = 106/257 (41%), Gaps = 21/257 (8%) Query: 208 KLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES--GVGHPILRISSVRAGHVDQ 265 ++ ++ +N + K + S + ++NG + K G P++RI + + Sbjct: 9 EIVKEGKNPLLGKADHWKRVYVSEIAMVQNGFAFKSKFFSRDEGIPLIRIRDILSAE--- 65 Query: 266 NDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL 325 + ++ + E + + +GDLL V + L ++ R + Sbjct: 66 TEHKYFGQFDKE---YLVHNGDLLIGMDGDF-----VAAYWPGKEG---LLNQRVCRIVI 114 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 + + P ++ ++ K +S K + + LPP+ EQ IV ++ Sbjct: 115 ESENYDKKFFFLALQPYL--DAIHEKTSSVTVKHLSSKTVNEIPLPLPPLNEQNRIVAKI 172 Query: 386 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLE 445 E+LF+ D + + A ++ QS+L AF G+LT WR N D + S ALL+ Sbjct: 173 EELFSELDAGVENLTKAKEQLGVYRQSLLKHAFEGKLTEAWRKRNADKL---ESGEALLK 229 Query: 446 KIKAERAASGGKKASRK 462 ++K ER K+ + Sbjct: 230 RVKKEREEYFKKQLEQW 246 >UniRef50_A3PYN5 Restriction modification system DNA specificity domain n=1 Tax=Mycobacterium sp. JLS RepID=A3PYN5_MYCSJ Length = 451 Score = 254 bits (649), Expect = 4e-66, Method: Composition-based stats. Identities = 94/441 (21%), Positives = 179/441 (40%), Gaps = 30/441 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLI--RANNIQN-----GKFDT 56 G++P GW ++P+ V T+ + D+ +P+ ++ G D Sbjct: 18 GRVPSGWAVSPLKNVATVF------PSSVDKHSHDNEIPVQLCNYTDVYKNERISGALDF 71 Query: 57 TDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEK 115 P+ + K + D +I S + +G SA+ + G V+RP Sbjct: 72 MKATATPEEIKK--FTLKQGDTIITKDSETADDIGISAYVEETLPDVLCGYHLSVVRPLP 129 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + F+ S + + + G + + D +NIP+PP EQ IA+ L+ Sbjct: 130 GLDGRFVKRLFDSHYLKASMEVSANGLTRVGLGQYAIDNLNIPLPPPDEQLQIADFLEAE 189 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESIL--- 232 A++D+ A+ E + L+ R A + AV L +P +S L Sbjct: 190 TAKIDALIAKQEHLIATLREDRTATITHAVTKGLDPTVDMVQPHNSELPACPKHWTLLIS 249 Query: 233 --------TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQ 284 T L G S P E+ V P LR+++V+ V+ ++++ + SEL R+ L+ Sbjct: 250 LKRLAEVQTGLTLGKSVDPAEA-VDVPYLRVANVQTSGVNLDEVKTVAVHRSELKRYLLR 308 Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 DGD+L T G ++ +G G + + ++ + + R + +++ + AR Sbjct: 309 DGDVLMTE-GGDIDKLGR-GCVWSGEIAPCIHQNHVFAVRCSDALSGDFLVYLLDTAVAR 366 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 N K T+ + + + LPP EQ EIV + + A D + + N + Sbjct: 367 NYFFMTAKKTTNLASTNSTTLGAFTFSLPPRAEQDEIVDHLNERCAGLDALIAKANAVIT 426 Query: 405 RVNNLTQSILAKAFRGELTAQ 425 + +++ A G++ + Sbjct: 427 VLREYRAALITDAVTGKIDVR 447 >UniRef50_Q26D97 Putative type I site-speicific deoxyribonuclease specificity subunit n=1 Tax=Flavobacteria bacterium BBFL7 RepID=Q26D97_9BACT Length = 468 Score = 254 bits (649), Expect = 5e-66, Method: Composition-based stats. Identities = 119/463 (25%), Positives = 206/463 (44%), Gaps = 45/463 (9%) Query: 5 KLPEGWVIAPVSTVTT---LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 +LP+GWV +S++ L + + + + + + + LI+ +I G F F Sbjct: 4 ELPKGWVETNISSLVDDTGLFKDGDWVESKDQD--PNGNVRLIQLADIGLGNFRDKSQRF 61 Query: 62 VPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLIF 118 + + + + DI++A +G+S L E ++RP +K I Sbjct: 62 LNQETAERLNCNFLEQNDILVARM---PDPIGRSCLFPLKGENVTVVDVAIIRPSKKHIN 118 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 +++H+ S ++ IS L++G+ I + D I P+PP AEQ I K+D L+AQ Sbjct: 119 YKWLSHWINSPVFHKNISELASGSTRKRISRRNLDKIPFPLPPRAEQDRIVAKVDALMAQ 178 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 + + E+IPQ+LK FRQ VL + +++ E ++++G Sbjct: 179 HAAIQQAMERIPQLLKDFRQQVLNQSFERN--------------IERVALEDCCHKIQDG 224 Query: 239 LSSKPNE-----SGVGHPILRISSVRAGHVDQNDIRFLECSESE--LNRHKLQDGDLLFT 291 P P + ++R ++ + + ++ R + GD+L T Sbjct: 225 AHHSPKYVSPIREKNMFPYVTSKNIRNDYMKLDTLTYVNEDFHNTIYPRCSPEFGDVLLT 284 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 + S G L + + +LL LI+ K +P Y++ F S + + Sbjct: 285 KDGAS---TGNVTLNEFDEPISLLSSVCLIKT-DKKKLIPAYLKYFIQSSIGFSEFTGKM 340 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 T + K + K IK + LP V EQ EIVRRVE LF A IE++ ++++L Q Sbjct: 341 -TGTAIKRVVLKKIKKATIPLPSVPEQQEIVRRVESLFEKATAIEQRYEQLKLQIDSLPQ 399 Query: 412 SILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAAS 454 +IL KAF+GEL Q + + SA LLE+IK ++ S Sbjct: 400 AILHKAFKGELVEQ--------LDSDGSAVELLEQIKNLKSNS 434 >UniRef50_C6A4W8 Putative type I specificity subunit HsdS n=1 Tax=Thermococcus sibiricus MM 739 RepID=C6A4W8_THESM Length = 434 Score = 252 bits (643), Expect = 2e-65, Method: Composition-based stats. Identities = 91/422 (21%), Positives = 174/422 (41%), Gaps = 25/422 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDLVFVP 63 +LPEGW + + L G T + + Y ++ +P ++ ++I +G + T+ Sbjct: 34 ELPEGWRWVRLGDIAELKAGGTPSR-RVKEYWENGTIPWVKISDIPDSGLVEKTEEKITE 92 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 L S K+ ++ + S VG L + + P+ I G++ Sbjct: 93 LGLKNSSAKLLSPGTILFSIFATISKVGI-----LKIPAATNQAIVGIIPKISIDRGYLF 147 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + K + ++ G +NI + IP+PP+ EQK I KLD + +++ K Sbjct: 148 YSLK--YFGQELVYQGRGGVQDNINMRILSKLKIPLPPIEEQKRIVAKLDEVHRRLEEAK 205 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS-SK 242 + + +R + L + + W + + E++ G + +K Sbjct: 206 RLAREAREEAERLMASALHEVFSKAEEKGW----------EWTTIGKVSREMKPGFARNK 255 Query: 243 PNESGVGHPILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGDLLFTRYNGSLEFVG 301 + S G P LR ++V G ++ I + + + + L+ GD+LF N S E VG Sbjct: 256 KHISRDGVPHLRPNNVDVGRLNLKKIVKVTLDDKINIEEYYLKKGDVLFNNTN-SFELVG 314 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 ++ + Y + + R R+ K+ LPE++ + + + GQ G+ Sbjct: 315 RAAIVPEDLKYG--YSNHITRIRVKKEVILPEWLTLAINYLWMQGYFREVCTRWVGQAGV 372 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + + + LP ++EQ IV ++ + A + K + L +IL KAFRG Sbjct: 373 NMNTLAKTRIPLPSLEEQKRIVSYLDSIQERAQKLVKLYEEREKELEKLFPAILDKAFRG 432 Query: 421 EL 422 EL Sbjct: 433 EL 434 Score = 65.5 bits (158), Expect = 4e-09, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 65/213 (30%), Gaps = 42/213 (19%) Query: 238 GLSSKPNESGVGHPILRISSV-------------------------------RAGHVDQN 266 G P E G +R+ + +G V++ Sbjct: 27 GKVKGPWELPEGWRWVRLGDIAELKAGGTPSRRVKEYWENGTIPWVKISDIPDSGLVEKT 86 Query: 267 DIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT 326 + + E + L G +LF+ + G+LK N ++ Sbjct: 87 EEKITELGLKNSSAKLLSPGTILFSI----FATISKVGILKIPAATN----QAIVGIIPK 138 Query: 327 KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVE 386 Y+ F S + Q I+ + + + LPP++EQ IV +++ Sbjct: 139 ISIDRGYL---FYSLKYFGQELVYQGRGGVQDNINMRILSKLKIPLPPIEEQKRIVAKLD 195 Query: 387 QLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 ++ + ++ A L S L + F Sbjct: 196 EVHRRLEEAKRLAREAREEAERLMASALHEVFS 228 >UniRef50_C6CR26 Restriction modification system DNA specificity domain protein n=1 Tax=Dickeya zeae Ech1591 RepID=C6CR26_DICZE Length = 462 Score = 251 bits (641), Expect = 4e-65, Method: Composition-based stats. Identities = 81/449 (18%), Positives = 172/449 (38%), Gaps = 36/449 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + ++ G T K Y ++ +P + + ++ +G Sbjct: 19 GQVPVHWNAVSLKWISQRYSGGTPDKSNDA-YWENGDIPWLNSGSVNDGYITEPSTYITR 77 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + S K P++ ++ +G G A L + + P++ F+ Sbjct: 78 EGFASSSAKWVPKNALVMALAGQGKTKGMVAQ--LGIRATCNQSMAAIIPKEKFTPRFLY 135 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + S+ I +++ G + + I P+ P EQ IA+ LD ++DS Sbjct: 136 WWLVSNY--QNIRNMAGGEQRDGLNLDMLGSIPCPLLPRPEQTAIADFLDRETGRIDSLM 193 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTE-------------------KWRNFEPQHSVFK 224 A+ Q+ +LK R A++ V L E +W P+ K Sbjct: 194 AKKRQLIALLKEKRCALISHIVTRGLPEAAADEFGLKPHTRFKNSDIEWLGQVPEGWGVK 253 Query: 225 --KLNFESILTELRNGLS-----SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESE 277 + S EL++G + G G P + + + G +D N ++E +++ Sbjct: 254 KVWIERVSRNIELQDGNHGEQHPKAEDYVGEGIPFVMANHIDNGKIDFNKCNYIEKEQAD 313 Query: 278 LNRHKL-QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 R +GD+L T +G G+++K ++ ++ R ++ ++ Sbjct: 314 SLRIGFSNEGDVLLTHKG----TIGRVGIVQKSHFPYVMLTPQVTYYRCLREIQNRFLFW 369 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 S ++ + S + I D K+ L+P KEQ I +++ + D + Sbjct: 370 LMQSKFWQDQLKLLAGLGSTRAYIGLLDQKTLSFLIPSEKEQFAIATYLDRETSKLDRLV 429 Query: 397 KQVNNALARVNNLTQSILAKAFRGELTAQ 425 ++V+ +AR+ +++ A G++ + Sbjct: 430 EKVDAVIARLQEYRTALITAAVTGKIDVR 458 >UniRef50_D0KMA1 Restriction modification system DNA specificity domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KMA1_PECWW Length = 493 Score = 251 bits (640), Expect = 6e-65, Method: Composition-based stats. Identities = 185/526 (35%), Positives = 241/526 (45%), Gaps = 96/526 (18%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS GKLPEGW + V L Y K A P+ +N I Sbjct: 2 MSVGKLPEGWKNIHLGDVIELK----YGKSLAAQVRDGIGYPVFGSNGIVG--------- 48 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 K S + + +I GS VV KS P + ++ +P Sbjct: 49 -------KHSIPLIKQSGLIVGRKGSYGVVQKSVEPFFPIDTTYYIDELFNQPIN----- 96 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F ++ ++ L+ I + ++I +PPL EQKIIAEKLDTLLAQVD Sbjct: 97 FWFYYLSFLP----LTKLNRSTTIPGLNRDDAYNLSINLPPLVEQKIIAEKLDTLLAQVD 152 Query: 181 STKARFEQIPQILKR------------------------------FRQAVLGGAVNGKLT 210 STKAR EQIP+ILKR F+ +V + Sbjct: 153 STKARLEQIPKILKRFRQAVLASALRGELTKKWRIDNKTGQDISSFKASVKKYRFESWVK 212 Query: 211 EK---------------WRN--------------FEPQHSVFKKLNFESILTELRNGLSS 241 E+ W+ P +F+ L+ ++ Sbjct: 213 EQEQKFINKGKQPRNDNWKKKYQEAIISQDISDKDIPDGWLFEPLDGLVYISARIGWKGL 272 Query: 242 KPNESGVGHP-ILRISSVRAGH-VDQNDIRFL-ECSESELNRHKLQDGDLLFTRYNGSLE 298 K +E V P L + S+ G + + E E KLQ+ D+L + Sbjct: 273 KASEYTVKGPLFLSVHSLNYGKEANLEQAYHISEHRYDESPEIKLQNNDILLCKDGAG-- 330 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 +G ++K L + L+ R +PEY+ F S P +N + + T S Sbjct: 331 -IGKLSIVKNLNEPATI-NSSLLLIRGGDFFVPEYLFYFLSGPEMQNLVKERM-TGSAVP 387 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 + +D+K V+ +PP+ EQ EIVRRVEQLFAYADTIEKQVN AL+RVNNLTQSILAKAF Sbjct: 388 HLFQRDVKEFVLEVPPLNEQHEIVRRVEQLFAYADTIEKQVNTALSRVNNLTQSILAKAF 447 Query: 419 RGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 RGELTAQWR ENPDLISGENSAA LLEKIKAERAAS GKKA RKK+ Sbjct: 448 RGELTAQWREENPDLISGENSAAVLLEKIKAERAASVGKKAPRKKA 493 >UniRef50_Q7UK98 Type I restriction enzyme EcoEI specificity protein n=1 Tax=Rhodopirellula baltica RepID=Q7UK98_RHOBA Length = 550 Score = 249 bits (637), Expect = 1e-64, Method: Composition-based stats. Identities = 115/523 (21%), Positives = 200/523 (38%), Gaps = 93/523 (17%) Query: 2 SAGK------LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG--K 53 S G+ LPEGW P+ + L+ G +K ++ + LP+IR N+ K Sbjct: 31 SGGESASGEALPEGWADVPIGDLCDLVNGRAFKPKE----WSETGLPIIRIQNLNKAEAK 86 Query: 54 FDTTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP 113 ++ D + K+L + P +++ A S + G + P VL Sbjct: 87 YNHFDGEYADKHL------VRPGELLFAWSGTPGTSFGAH-IWNGPKALLNQHIFRVLID 139 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 E + F L I G + ++ F+ +P+PPLAEQ I ++ Sbjct: 140 EDDLNMTFFRFAINHKL-EELIGKAHGGVGLRHVTKGKFEATQVPLPPLAEQSRIVSAIE 198 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKW-----------------RNF 216 +L + + ++ ++ + RQ+VL A +GKLT W R Sbjct: 199 SLQERSSRARFLLSEVGPLIGQLRQSVLRDAFSGKLTADWREANPNVEPAFKLLSRIRTE 258 Query: 217 EPQHSV---------------------------------------FKKLNFESILTELRN 237 + + ++ Sbjct: 259 RRERWEAEQLAKYEAKGKQPPKNWQDKYKEPEPVDESELPELPDGWCWCQVGDLIESFDA 318 Query: 238 GLS----SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 G S S P G + +L++S+V D N + L+ + + + GDLL +R Sbjct: 319 GRSPTALSHPARDGE-YGVLKVSAVTWREFDPNANKALKDGDEIGDTPTPRKGDLLISRA 377 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSSPSARNAMMNCVK 352 N ++E +G L+K + NL+ DK +R +K+ +PEY+ S S R + Sbjct: 378 N-TVELIGAVVLVK-ADYPNLMLSDKTLRMNPASKELVPEYLLYGLRSESVRKFFEDNAT 435 Query: 353 TTSG-QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 TS + +S I + L P+ EQ + + ++ + + + + L Q Sbjct: 436 GTSNSMRNLSQGKILDAPIALAPLAEQQAVADLLVTNDEACTSVASGLASMESSLTQLDQ 495 Query: 412 SILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAAS 454 SIL+KAFRGEL Q + P A+ LL +I+A+R A+ Sbjct: 496 SILSKAFRGELVPQDPRDEP--------ASELLARIRAKRVAN 530 Score = 128 bits (321), Expect = 5e-28, Method: Composition-based stats. Identities = 61/269 (22%), Positives = 107/269 (39%), Gaps = 19/269 (7%) Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 A+ +Q P K R + A +G + + + L +L NG + KP Sbjct: 9 AKQKQTPTQDKLSRGESVEVARSGGESASGEALPEGWADVPIGD----LCDLVNGRAFKP 64 Query: 244 NES-GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 E G PI+RI + +++ + ++ ++H ++ G+LLF G Sbjct: 65 KEWSETGLPIIRIQN-----LNKAEAKYNHFDGEYADKHLVRPGELLFAWSGTPGTSFGA 119 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 L + R + +D L F + ++ G + ++ Sbjct: 120 ----HIWNGPKALLNQHIFRVLIDEDDLNMTFFRFAINHKL-EELIGKAHGGVGLRHVTK 174 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 ++ V LPP+ EQ+ IV +E L + ++ + L QS+L AF G+L Sbjct: 175 GKFEATQVPLPPLAEQSRIVSAIESLQERSSRARFLLSEVGPLIGQLRQSVLRDAFSGKL 234 Query: 423 TAQWRAENPDLISGENSAAALLEKIKAER 451 TA WR NP++ A LL +I+ ER Sbjct: 235 TADWREANPNV----EPAFKLLSRIRTER 259 >UniRef50_A3JH04 Specificity determinant for hsdM and hsdR n=1 Tax=Marinobacter sp. ELB17 RepID=A3JH04_9ALTE Length = 479 Score = 249 bits (637), Expect = 1e-64, Method: Composition-based stats. Identities = 160/489 (32%), Positives = 240/489 (49%), Gaps = 87/489 (17%) Query: 26 TYKKEQAINYLKDDYLPLI-RANNIQNGKFDTTDLVFVPKNLVKESQKISPEDIVIAMSS 84 T KK + + L+ P+I + N G D D + I+ D +I Sbjct: 23 TGKKVKTKDCLQTGRFPVIDQGQNPVAGYVDDPD------------RLINVSDPLIVFGD 70 Query: 85 GSKSVVGKSAHQHLPFECSFGAF-CGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGAN 143 +++V + F GA +L+PE +F F + +S NK S Sbjct: 71 HTRAVKW------VDFSFVPGADGTKILQPEPYLFPRFAYYQLRSLEIPNKGYSR----- 119 Query: 144 INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGG 203 + + PLAEQK IA KLDTLLAQV++TKAR E+IP ILKRFRQ+VL Sbjct: 120 ----HFKFLKELKFEVAPLAEQKTIAVKLDTLLAQVENTKARLERIPTILKRFRQSVLAA 175 Query: 204 AVNGKLTEKWRNFEPQHSV----------------------------------------- 222 AV+G+LTE+WRN S Sbjct: 176 AVSGRLTEEWRNNRTTKSSPKKLLNHFEELRQIAVQDENLRTGKKTKYKPVTIDTYGTPG 235 Query: 223 -----FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGH-VDQNDIRFLECSES 276 + + E++ T++ +G+ KP G P + + ++ G+ + + ++ + Sbjct: 236 DLPNSWYWIPVEALATKVTDGVHKKPTYISNGVPFITVKNLTKGNGISFTETNYISTHDH 295 Query: 277 E--LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYI 334 E R + GD+L ++ +GV ++ ++ L+ + ++ Y+ Sbjct: 296 EEFCKRTNPEKGDILISKDG----TLGVVRQIRTDAIFSIFVSVALV--KPADRSMSNYL 349 Query: 335 EIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADT 394 E+ F S + M+ +G + I D++ ++ +PP++EQ EIV +V+QLFAYA+ Sbjct: 350 ELAFQSSVVQGQMIGV---GTGLQHIHLIDLRKDLIPVPPLEEQIEIVHQVDQLFAYAER 406 Query: 395 IEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAAS 454 +E+QVNNALARVN LTQSILAKAFRGELT QWR +NP+LISGENSAAALLE+IK ERAA Sbjct: 407 VEQQVNNALARVNKLTQSILAKAFRGELTEQWRKDNPNLISGENSAAALLERIKVERAAM 466 Query: 455 GGKKASRKK 463 A+RK+ Sbjct: 467 KPTNAARKR 475 Score = 125 bits (313), Expect = 4e-27, Method: Composition-based stats. Identities = 53/223 (23%), Positives = 96/223 (43%), Gaps = 18/223 (8%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK-FDTTDLV 60 + G LP W PV + T + +KK I+ + +P I N+ G T+ Sbjct: 233 TPGDLPNSWYWIPVEALATKVTDGVHKKPTYIS----NGVPFITVKNLTKGNGISFTETN 288 Query: 61 FVPKNLVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGV--LRPEK 115 ++ + +E K DI+I+ G+ VV + + + F F V ++P Sbjct: 289 YISTHDHEEFCKRTNPEKGDILISK-DGTLGVV-----RQIRTDAIFSIFVSVALVKPAD 342 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 S ++ +SS+ + ++ + G + +I IP+PPL EQ I ++D L Sbjct: 343 RSMSNYLELAFQSSVVQGQM--IGVGTGLQHIHLIDLRKDLIPVPPLEEQIEIVHQVDQL 400 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP 218 A + + + + + Q++L A G+LTE+WR P Sbjct: 401 FAYAERVEQQVNNALARVNKLTQSILAKAFRGELTEQWRKDNP 443 >UniRef50_Q4C702 Restriction modification system DNA specificity domain n=1 Tax=Crocosphaera watsonii WH 8501 RepID=Q4C702_CROWT Length = 408 Score = 248 bits (634), Expect = 2e-64, Method: Composition-based stats. Identities = 94/424 (22%), Positives = 176/424 (41%), Gaps = 32/424 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+ W + V + G P+I + N++ D +++ ++ + Sbjct: 10 LPQYWKWSKCQEVIDVRDG-----THDTPKYVSSGYPVITSKNLKTSGIDFSNVSYISEA 64 Query: 66 LVKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 KE K+ DI++AM +G + E S + I+ + Sbjct: 65 DHKEISKRSKVDKGDILLAM----IGTIGNPVIVDIEKEFSIKNVALFKLSKSNIYPEYF 120 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + SS+ ++ G + + IP+PPL EQK IA+ LD Sbjct: 121 KYLLDSSIISRQLDFEQRGGTQKFVSLKVLRNLLIPLPPLEEQKRIAKILDKADEIRRKR 180 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 K ++L+ + G V ++ S + EL+ G +SK Sbjct: 181 KESIRLTDELLRSTFLDMFGDPV------------INPKGWEVKTLGSQIKELKYGTNSK 228 Query: 243 PNE--SGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 +E +LRI ++ + ND+++ E+++ L++GDLLF R NG+ +++ Sbjct: 229 CSELQKNNNIAVLRIPNIDNEKISWNDLKYTNLDSKEISKLLLKNGDLLFVRSNGNPDYI 288 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTK--DALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 G C + ++ ++ +Y LIR RL D P +I + P+ R+ ++ +TT+G Sbjct: 289 GRCAIFEEESNRKAVYASYLIRGRLKSICDFHPAFIRDIIAFPTFRSFLIREARTTAGNY 348 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 I+ +++ S ++ PP +Q E + + + +L NL S+L KAF Sbjct: 349 NINIQELSSLKLICPPQDKQEEYLD----ITTKINRSFLNKQKSLQESENLFNSLLQKAF 404 Query: 419 RGEL 422 +GEL Sbjct: 405 KGEL 408 >UniRef50_Q21ZK2 Restriction modification system DNA specificity domain n=4 Tax=Bacteria RepID=Q21ZK2_RHOFD Length = 397 Score = 248 bits (633), Expect = 4e-64, Method: Composition-based stats. Identities = 83/415 (20%), Positives = 168/415 (40%), Gaps = 29/415 (6%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQK 72 +S G T + Q Y + +P I++ ++ + + L + S K Sbjct: 7 VTLSEFCATGSGGTPSRAQMERYYEGGTIPWIKSGELRETVINGAEEHVTDVALKESSIK 66 Query: 73 ISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLY 131 + P I++AM + +G + + A C ++ ++ + ++ H S + Sbjct: 67 LVPAGAILLAMYGATVGRLGILGIE----ATTNQAVCHIIPDPRIAVTRYVYHALSSQV- 121 Query: 132 RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQ 191 + S+ G NI + IP+P EQ+ IA LD A + Q+ Sbjct: 122 -PSLISMGVGGAQPNINQGIIKNLAIPLPAKPEQRRIAAILDQADALRAKRREALAQLDS 180 Query: 192 ILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV--- 248 + + + G V+ + + + +G++ N +G Sbjct: 181 LTQSIFIQMFGDPVSNPKG------------WPDATTLGQVANIASGVTKGRNLTGKVTR 228 Query: 249 GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKK 308 P L +++V+ ++ + ++ ++ +E E+ R+ L+ DLL T G + +G G L K Sbjct: 229 TIPYLAVANVQDKSLNLSAVKEIDATEDEIERYLLKWNDLLLTE-GGDPDKLGR-GTLWK 286 Query: 309 LQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 + ++ + + R R+T A+ P ++ S + + K T+G I+ ++S Sbjct: 287 NELPECIHQNHIFRVRVTSQAVTPLFLNWLVGSQRGKKYFLRSAKQTTGIASINMTQLRS 346 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 +LLPPV+ Q + E + + +LA + L S+ +AFRGEL Sbjct: 347 FPLLLPPVELQRDF----ETIAEVVAEQHAIHSVSLAELEALFVSLQHRAFRGEL 397 >UniRef50_A6CKF2 Putative type I restriction enzyme specificity protein n=1 Tax=Bacillus sp. SG-1 RepID=A6CKF2_9BACI Length = 454 Score = 248 bits (632), Expect = 5e-64, Method: Composition-based stats. Identities = 90/448 (20%), Positives = 189/448 (42%), Gaps = 32/448 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++PE W + + I+G +K + D +P+I+ +I+NGK +D+ F+ + Sbjct: 17 RVPEDWSEKKLKYLVETIKGYAFKSQ----LFGDKGVPIIKTTDIKNGKIQDSDI-FIDE 71 Query: 65 NLVKESQ--KISPEDIVIAMSSGSKSV----VGKSAHQHLPFE-CSFGAFCGVLRPE-KL 116 E + ++ DI+++ V VG+ +E +LR + K Sbjct: 72 RFEHEYKNVRVKKNDILMSTVGSKVEVTNSAVGQIGKVQKKYEGALLNQNAVILRCKSKD 131 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 I + F+ +F S YR + + G N ++ +P+P Q I+E LD Sbjct: 132 ITNNFLFYFLNSHSYRKYLDLFAHGTANQASLSLKDILDFKMPLPSRKIQHQISEFLDHK 191 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKL 226 + V++ A +++ ++L+ RQA++ AV L KW P+H K+ Sbjct: 192 TSDVETLIADKQKLIELLEEKRQAIVTEAVTRGLNPDVKMKDSGVKWIGDIPEHWDISKI 251 Query: 227 NFESILTEL--RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS-ESELNRHKL 283 + + + GL S G ++ + + G + + + SE +L Sbjct: 252 KYSTYVKGRIGWQGLRSD-EFIDEGPYLVTGTDFKDGIIHWDTCYHISEERYSEAPPIQL 310 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 ++ DLL T+ +G ++K + +L + K+ L +++ +S Sbjct: 311 KENDLLITKDG----TIGKVAIVKNKPGKAILNSGIFVTRCQDKEYLTKFMYWILTSEVF 366 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 +N + ++T S K + + + LP ++EQ I +E D+++K++++ + Sbjct: 367 KNYI-KYMETGSTIKHLYQETFVNFSYPLPNIEEQKAIEYFLETKVREIDSVKKEISDQI 425 Query: 404 ARVNNLTQSILAKAFRGELTAQWRAENP 431 + QS++ +A G++ + E P Sbjct: 426 ELLKEYRQSLIYEAVTGKIDLRDYQEVP 453 Score = 126 bits (316), Expect = 2e-27, Method: Composition-based stats. Identities = 46/212 (21%), Positives = 96/212 (45%), Gaps = 9/212 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +PE W I+ + +T ++G + + D+ L+ + ++G + Sbjct: 239 IGDIPEHWDISKIKY-STYVKGRIGWQGLRSDEFIDEGPYLVTGTDFKDGIIHWDTCYHI 297 Query: 63 PKNLVKESQ--KISPEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFCGVLRP-EKLIF 118 + E+ ++ D++I +GK A ++ P + + V R +K Sbjct: 298 SEERYSEAPPIQLKENDLLITK----DGTIGKVAIVKNKPGKAILNSGIFVTRCQDKEYL 353 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + F+ S +++N I + G+ I ++ +F + P+P + EQK I L+T + + Sbjct: 354 TKFMYWILTSEVFKNYIKYMETGSTIKHLYQETFVNFSYPLPNIEEQKAIEYFLETKVRE 413 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 +DS K ++LK +RQ+++ AV GK+ Sbjct: 414 IDSVKKEISDQIELLKEYRQSLIYEAVTGKID 445 Score = 124 bits (311), Expect = 7e-27, Method: Composition-based stats. Identities = 54/244 (22%), Positives = 96/244 (39%), Gaps = 15/244 (6%) Query: 205 VNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV-GHPILRISSVRAGHV 263 W P+ KKL L E G + K G G PI++ + ++ G + Sbjct: 6 FQNSADINWYERVPEDWSEKKLK---YLVETIKGYAFKSQLFGDKGVPIIKTTDIKNGKI 62 Query: 264 DQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF----VGVCGLLKKLQHQNLLYPDK 319 +DI E E E +++ D+L + +E VG G ++K LL + Sbjct: 63 QDSDIFIDERFEHEYKNVRVKKNDILMSTVGSKVEVTNSAVGQIGKVQKKYEGALLNQNA 122 Query: 320 LIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQA 379 +I +KD ++ F +S S R + T+ Q +S KDI + LP K Q Sbjct: 123 VILRCKSKDITNNFLFYFLNSHSYRKYLDLFAHGTANQASLSLKDILDFKMPLPSRKIQH 182 Query: 380 EIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENS 439 +I ++ + +T+ + + Q+I+ +A R NPD+ ++ Sbjct: 183 QISEFLDHKTSDVETLIADKQKLIELLEEKRQAIVTEAVT-------RGLNPDVKMKDSG 235 Query: 440 AAAL 443 + Sbjct: 236 VKWI 239 >UniRef50_C0QCH4 HsdS2 n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QCH4_DESAH Length = 426 Score = 246 bits (629), Expect = 9e-64, Method: Composition-based stats. Identities = 78/425 (18%), Positives = 179/425 (42%), Gaps = 21/425 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +PE W + + + G D +P R+ NI +G D+V++ Sbjct: 12 IGWIPEDWDCVKLGGIVNKV-GSGITPRGGSKVYCDKGVPFFRSQNILHGTVSVKDIVYI 70 Query: 63 PKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIFS 119 +NL ++ + + P D+++ ++ S +G+ F+ + ++RP+ I S Sbjct: 71 SENLHQKMKNTHLQPADVLLNITGAS---IGRCCVFPNNFKKGNVNQHVCIIRPDGTIKS 127 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ S + + +I + AG N + +P+PPL EQ+ IA+ L T+ ++ Sbjct: 128 QYLCSLLNSPIGQKQIWNFQAGGNREGLNFQQIRSFILPLPPLPEQQKIADVLSTVDDKI 187 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 S + +Q Q+ K + +L + G K + + + ++I + G+ Sbjct: 188 SSIDQQIQQTEQLKKGLMEKLLTEGI-GHTEFKDTEIGQIPASWDVVKLKTICHRIFVGI 246 Query: 240 --SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRH-KLQDGDLLFTRYNGS 296 S+ + + G PI+R +++ + +D+ + +E N KL GD++ R Sbjct: 247 ATSTSEHYTNDGIPIIRNQNIKENSISGDDLLKITNDFNEKNHSKKLMVGDIITARTGYP 306 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 G+ ++ K + + R + P Y+ + +S + +++ + Sbjct: 307 ----GMSCVIPKKFEGAQTFTTLVSRPN-KERIFPHYLSRYINSDIGKKIVLSNQAGGA- 360 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 Q+ ++ +K ++LPP++EQ +I + + D + + + L + ++ + Sbjct: 361 QQNLNAGRLKEIPIILPPLEEQKQIATILSSVDDKIDVLRSKKTS----YTTLKKGLMGQ 416 Query: 417 AFRGE 421 G+ Sbjct: 417 LLTGQ 421 Score = 128 bits (322), Expect = 4e-28, Method: Composition-based stats. Identities = 49/212 (23%), Positives = 92/212 (43%), Gaps = 11/212 (5%) Query: 2 SAGKLPEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 G++P W + + T+ + G+ + + +D +P+IR NI+ DL+ Sbjct: 222 EIGQIPASWDVVKLKTICHRIFVGIATSTSE---HYTNDGIPIIRNQNIKENSISGDDLL 278 Query: 61 FVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-I 117 + + ++ S+K+ DI+ A + G S FE + V RP K I Sbjct: 279 KITNDFNEKNHSKKLMVGDIITARTGYP----GMSCVIPKKFEGAQTFTTLVSRPNKERI 334 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F +++ + S + + + S AG N+ I I +PPL EQK IA L ++ Sbjct: 335 FPHYLSRYINSDIGKKIVLSNQAGGAQQNLNAGRLKEIPIILPPLEEQKQIATILSSVDD 394 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 ++D +++ + K +L G + K+ Sbjct: 395 KIDVLRSKKTSYTTLKKGLMGQLLTGQMRVKI 426 >UniRef50_Q73D72 Type I restriction-modification enzyme, S subunit, putative n=1 Tax=Bacillus cereus ATCC 10987 RepID=Q73D72_BACC1 Length = 476 Score = 244 bits (623), Expect = 6e-63, Method: Composition-based stats. Identities = 100/471 (21%), Positives = 185/471 (39%), Gaps = 41/471 (8%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI---QNGKFDTTDLVF 61 ++PE W+ + +I G T K + Y KD + I ++ Q+ Sbjct: 20 RVPENWIWTWTGAIAEVISGGTPK-SKVEEYYKDGTISWITPADLSGYQDMYISKGKRNI 78 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 L K S K+ P + V+ S V +A + P + + Sbjct: 79 TELGLNKSSAKMLPINTVLLSSRAPIGYVAIAAK-----DLCTNQGFKSFAPSNAYYPKY 133 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + K S Y + S+++G+ + I IP+PP+ EQK ++EK++ LL +V+ Sbjct: 134 LYWYLKFSKY--YMESMASGSTFKELSSNKSKEIPIPLPPINEQKRVSEKVERLLNKVEE 191 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRN---------------------FEPQH 220 K E+ + + R A+L A +G LT KWR F P Sbjct: 192 AKTLIEEAKETFELRRAAILDKAFSGDLTGKWRKENSFQQNEECISDNELRDSEVFYPIP 251 Query: 221 SVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQN-DIRFLECSESE-- 277 +K + + T + G ++R+ ++ + + + ++ E Sbjct: 252 KTWKWTKLKDVATFKNGYAFKSKDFVEQGIQLIRMGNLYKNELRLDRNPVYIPLDFDEKI 311 Query: 278 LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIF 337 + ++ ++ GD+L + + + + +NLL +++ + + EYI + Sbjct: 312 IEKYTVEKGDILLSLTGTKYKRDYGYAVRVDGRDKNLLLNQRILSLKP--HMMDEYIYYY 369 Query: 338 FSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEK 397 S RNA + Q + K ++S ++ +PP E EI +++ +L + Sbjct: 370 LQSSVFRNAFFSFETGGVNQGNVGSKAVESILIPIPPADEAKEIEKKLARLLNN-EKEAL 428 Query: 398 QVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIK 448 V ++ L QS L+KAFRGEL E + E L EKIK Sbjct: 429 VVLAIEEKLEVLKQSALSKAFRGELGTNDPTEE---NTIELLKEVLKEKIK 476 Score = 102 bits (255), Expect = 3e-20, Method: Composition-based stats. Identities = 37/238 (15%), Positives = 86/238 (36%), Gaps = 17/238 (7%) Query: 197 RQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG-HPILRI 255 ++ + + + +++R P++ ++ + + S G + Sbjct: 3 KKKTVDELIVPEAEQRFR--VPENWIWTWTGAIAEVISGGTPKSKVEEYYKDGTISWITP 60 Query: 256 SSV---RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQ 312 + + + ++ + E ++ + L +L + + +V + + Sbjct: 61 ADLSGYQDMYISKGKRNITELGLNKSSAKMLPINTVLLSS-RAPIGYVAIAA-------K 112 Query: 313 NLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLL 372 +L + P+Y+ + M + + S K +S K + L Sbjct: 113 DLCTNQGFKSFAPSNAYYPKYLYWYLKFS---KYYMESMASGSTFKELSSNKSKEIPIPL 169 Query: 373 PPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAEN 430 PP+ EQ + +VE+L + + + A +IL KAF G+LT +WR EN Sbjct: 170 PPINEQKRVSEKVERLLNKVEEAKTLIEEAKETFELRRAAILDKAFSGDLTGKWRKEN 227 >UniRef50_C6IKX2 Type I restriction-modification system n=2 Tax=Bacteroidales RepID=C6IKX2_9BACE Length = 478 Score = 244 bits (622), Expect = 7e-63, Method: Composition-based stats. Identities = 90/420 (21%), Positives = 174/420 (41%), Gaps = 36/420 (8%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P+ WV + V T G T + Y +P ++ ++ +G + Sbjct: 70 EVPDNWVWMTLGEVGTWQSGGTPSRSNKTYY--GGNIPWLKTGDLNDGLISDIPESITEE 127 Query: 65 NLVKESQKISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + S KI+P ++IAM + +G L F + C I ++ Sbjct: 128 AVANSSAKINPAGSVLIAMYGATIGKLGI-----LTFPATTNQACCACIEFNAITQLYLF 182 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 +F S RN + G NI IP+PPL+EQ+ I +++ A +D + Sbjct: 183 YFLLSQ--RNGFIAKGGGGAQPNISKEIIVNTFIPLPPLSEQQRIVMEIEKWFALIDQVE 240 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE-------------- 229 + +K+ + +L A++GKL + N EP + K++N + Sbjct: 241 QGKADLQNTIKQTKSKILDLAIHGKLVPQDPNDEPAIKLLKRINPDFTPCDNGHSRKLPQ 300 Query: 230 -------SILTELRNGLSS-KPNESGVGHPILRISSVRAGH-VDQNDIRFLECSESELNR 280 + + + G+S K + G +LR +++ G +D D F+ S + N Sbjct: 301 GWYSVTANDVCSIIGGVSYNKADIQDTGIRVLRGGNIQNGKVIDCFDDVFISLSY-QNND 359 Query: 281 HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSS 340 +++Q GD++ GS +G G + + + L R + L YI + F + Sbjct: 360 NQVQRGDIIVVASTGSQTLIGKTGFADRDIPKTQIGA-FLRIVRPKQKTLSPYIRLIFQT 418 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 + ++ + N K S + +++ + LPP++EQ IV+++E+LF+ D I + Sbjct: 419 DAYKDYIRNVAK-GSNINNVKNAHLQNFQICLPPLEEQQRIVQKIEELFSSLDDILTALE 477 Score = 126 bits (317), Expect = 2e-27, Method: Composition-based stats. Identities = 53/270 (19%), Positives = 89/270 (32%), Gaps = 18/270 (6%) Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 ++ KA E++ + K R + E P + V+ L Sbjct: 32 LLERIKAEKERLIKEGKIKRSKKSAKTSDTPHYENVPFEVPDNWVWMTLGEVGTWQSGGT 91 Query: 238 GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSL 297 S G P L+ + G + E + + + G +L Y ++ Sbjct: 92 PSRSNKTYYGGNIPWLKTGDLNDGLISDIPESITEEAVANSSAKINPAGSVLIAMYGATI 151 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 +G+ +Q I Y+ F S RN + Q Sbjct: 152 GKLGILT-FPATTNQACC---ACIEFNAITQL---YLFYFLLSQ--RNGFI-AKGGGGAQ 201 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 IS + I + + LPP+ EQ IV +E+ FA D +E+ + + IL A Sbjct: 202 PNISKEIIVNTFIPLPPLSEQQRIVMEIEKWFALIDQVEQGKADLQNTIKQTKSKILDLA 261 Query: 418 FRGELTAQWRAENPDLISGENSAAALLEKI 447 G+L Q + P A LL++I Sbjct: 262 IHGKLVPQDPNDEP--------AIKLLKRI 283 Score = 60.5 bits (145), Expect = 1e-07, Method: Composition-based stats. Identities = 21/61 (34%), Positives = 28/61 (45%), Gaps = 11/61 (18%) Query: 407 NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERA---ASGGKKASRKK 463 L Q IL A G+L Q + P A+ LLE+IKAE+ G K S+K Sbjct: 4 KALRQKILDLAIHGKLVPQDPNDEP--------ASVLLERIKAEKERLIKEGKIKRSKKS 55 Query: 464 S 464 + Sbjct: 56 A 56 >UniRef50_Q8RJG0 HsdS n=12 Tax=Campylobacter jejuni RepID=Q8RJG0_CAMJE Length = 417 Score = 243 bits (621), Expect = 9e-63, Method: Composition-based stats. Identities = 97/420 (23%), Positives = 178/420 (42%), Gaps = 15/420 (3%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GW + + + ++ G T K Y KD P + ++ + G F + K Sbjct: 10 LPQGWEVKKLGEIGEIVTGSTPSKSNLDFYGKD--YPFFKPSDFEQGYFLENAGDNLSKL 67 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 +++++ P+ I++ +GK A + C + P K I S +I ++ Sbjct: 68 GFDKARQLPPKTILVVC----IGSLGKVALTRVIGSC--NQQINAIIPHKNIISEYIYYY 121 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-PLAEQKIIAEKLDTLLAQVDSTKA 184 SS +++ + S + + F + I P + EQ+ I LD A++D + Sbjct: 122 CISSKFQSILFSKAPQTTLAIFNKTEFSKLEIIYPKDIKEQERIVGILDESFAKIDESIK 181 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-PQHSVFKKLNFESILTELRNGLSSKP 243 EQ L Q+ L A N N++ PQ +K L S L + S K Sbjct: 182 ILEQDLLNLDELMQSALQKAFNPLKDNAKENYKLPQGWEWKSLGEISNLIQNGFAAS-KN 240 Query: 244 NESGVGHPILRISSV-RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 NE G+ LR ++ G+++ + + ++ + + ++ D+LF N + E VG Sbjct: 241 NEIPSGYVHLRTHNISTDGNLNFDTLIKIKREFIKEKQSFIEKNDILFNNTNST-ELVGK 299 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 L+ Q+ N + + L + +L + + +F GQ GI+ Sbjct: 300 TALV--TQNYNYAFSNHLTKIKLKNQYNSKLVVFYFVLLLKNKYFEKICHQWIGQSGINI 357 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 +K + LPP+KEQ +I + ++ +F +++ L L QS+L KAF+GEL Sbjct: 358 DKLKKIQIPLPPLKEQEQIAKHLDFVFEKTKALKELYTKELKDYEELKQSLLNKAFKGEL 417 Score = 124 bits (312), Expect = 6e-27, Method: Composition-based stats. Identities = 52/214 (24%), Positives = 90/214 (42%), Gaps = 13/214 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDL 59 KLP+GW + ++ LI G K I +R +NI +G + L Sbjct: 211 ENYKLPQGWEWKSLGEISNLIQNGFAASKNNEIP----SGYVHLRTHNISTDGNLNFDTL 266 Query: 60 VFVPKNLVKESQ-KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 + + + +KE Q I DI+ ++ S +VGK+A + +F ++ + Sbjct: 267 IKIKREFIKEKQSFIEKNDILFNNTN-STELVGKTALVTQNYNYAFSNHLTKIKLKNQYN 325 Query: 119 SG---FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 S F + Y KI G + I I IP+PPL EQ+ IA+ LD + Sbjct: 326 SKLVVFYFVLLLKNKYFEKICHQWIG--QSGINIDKLKKIQIPLPPLKEQEQIAKHLDFV 383 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 + + K + + + + +Q++L A G+L Sbjct: 384 FEKTKALKELYTKELKDYEELKQSLLNKAFKGEL 417 >UniRef50_A3UV36 Type I restriction enzyme specificity protein n=1 Tax=Vibrio splendidus 12B01 RepID=A3UV36_VIBSP Length = 496 Score = 243 bits (620), Expect = 1e-62, Method: Composition-based stats. Identities = 136/492 (27%), Positives = 222/492 (45%), Gaps = 68/492 (13%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS +LP+GW+ + ++ K+ + K +Y+ I + + + + Sbjct: 1 MS--ELPKGWITIKIDSLCAK-----PKQLKPEASWKFNYID-ISSVDREKKLICEPSEI 52 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + + ++ D++++M+ + + V K ++ S G VL+P LI S Sbjct: 53 LGSDAPSRARKIVNTGDVLVSMTRPNLNAVAKVPEKYNGQVASTG--FDVLKP-FLIESD 109 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ +S + + IS + GA K + +P+PPLAEQK I EKLD +LAQVD Sbjct: 110 WLFSVVRSQPFIDSISGTTIGALYPACKTSDIRDYEMPLPPLAEQKRIVEKLDEVLAQVD 169 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR-------------------------- 214 + KAR + IP +LKRFRQ+VL AV+G LT++WR Sbjct: 170 TIKARLDGIPDLLKRFRQSVLASAVSGTLTKEWRLTNELTKAEEELKSNFLAKSGKLKLR 229 Query: 215 NFEPQHSVFKKLNFESILTE-------------LRNG----LSSKPNESGVGHPILRISS 257 + S + T + G + + G PI+ + Sbjct: 230 GKQTNFSELSLITLPDSWTWAQNYKLAKDESNAICAGPFGTIFKAKDFRDEGVPIIFLRH 289 Query: 258 VRAGHVDQNDIRFLECS--ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLL 315 V+ +QN +++ E + + G+LL T+ G C + + ++ Sbjct: 290 VKEIGFNQNKPNYMDGDVWEELHQEYSVHGGELLVTKLGDPP---GECCIYPENMGTAMV 346 Query: 316 YPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPV 375 PD L L +Y+ +F+SP + ++ + + + I K + LP + Sbjct: 347 TPDVLKMNVDEDIVLRKYLRSYFNSPIS-TEIIEALAFGATRLRIDIAMFKGFPIPLPSM 405 Query: 376 KEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLIS 435 +EQ EIVR V+Q FA+ADTIE QV A A+V+NLTQSILAKAFRGEL +Q ++ P Sbjct: 406 EEQKEIVRLVDQYFAFADTIEAQVKKAQAKVDNLTQSILAKAFRGELVSQDPSDEP---- 461 Query: 436 GENSAAALLEKI 447 A LLE+I Sbjct: 462 ----ADKLLERI 469 >UniRef50_A6E2R5 Restriction endonuclease S subunits-like protein n=1 Tax=Roseovarius sp. TM1035 RepID=A6E2R5_9RHOB Length = 413 Score = 243 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 108/458 (23%), Positives = 183/458 (39%), Gaps = 49/458 (10%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P+GW + ++ + G K A ++ P + + Sbjct: 4 VPQGWAQSRLADWLDISTG----KLDANAATENGQYPFFTCAE---------QVSRIDTF 50 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + + G + + VL+P + I GF Sbjct: 51 AFDCEAVLLAGN-------------GNFNLHKYTGKFNAYQRTYVLQPHE-IDLGFTFVA 96 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 KS L +I+ + G+ I ++ P+PPL EQ+ I KLDTL A+ + + Sbjct: 97 LKSLL--PEITKDNRGSTIKYLRLGDIADTAAPLPPLPEQRRIVRKLDTLSARSTTARTH 154 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE 245 I ++++R+R AVL A + + S Sbjct: 155 LTAIEKLVERYRTAVLEAAFRTAWDAGFDTTIAGCLEHAETGLVR---------SKAEQT 205 Query: 246 SGVGHPILRISSVR-AGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 +G G+P +R++ AG + D+ ++ + SE R++L+ DLLF N S E VG Sbjct: 206 AGEGYPYIRMNHYDLAGRWNDRDLTYVAATSSEFERYQLRANDLLFNTRN-SAELVGKVA 264 Query: 305 LLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKD 364 + + + L+ + L+R R + D LP + SSP R + + T+ I + Sbjct: 265 IWPEGKD-GYLFNNNLLRMRFSADVLPGFAFWQMSSPPFRRYIEGFISATTSVAAIYQRS 323 Query: 365 IKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTA 424 + + +P EQ EIVRR+E FA D ++ + AL + +L Q ILAKAF G+L Sbjct: 324 LMAAPFWVPDTDEQREIVRRIETAFAKIDRLKAEAAKALKLLGHLDQRILAKAFAGDLVP 383 Query: 425 QWRAENPDLISGENSAAALLEKIKAERAASGGKKASRK 462 Q + P A LL +I+ RAA+ + R+ Sbjct: 384 QDPTDEP--------AETLLARIREARAATQTSRRRRR 413 >UniRef50_B5VW93 Restriction modification system DNA specificity domain n=1 Tax=Arthrospira maxima CS-328 RepID=B5VW93_SPIMA Length = 407 Score = 241 bits (615), Expect = 4e-62, Method: Composition-based stats. Identities = 95/429 (22%), Positives = 173/429 (40%), Gaps = 33/429 (7%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ---NGKFDTTDLVFVPK 64 +GW I + + + K+ + D +P R I+ NGK +T+L Sbjct: 2 KGWDIVALEDLGKITSSKRIFKKDYV----DSGIPFYRTKEIKELANGKEVSTELFISRD 57 Query: 65 NLVKESQKI---SPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + K S D++I VG+ LR K I F Sbjct: 58 SFNEIKAKFGTPSVGDLLITA----IGTVGEIYVVDRTDFYFKDGNVLWLRDFKAIEPNF 113 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + ++I+SLS G+ + I P ++EQK I LD +D+ Sbjct: 114 LKYALI--AFVDEINSLSHGSTYKALPIEKLKKHKIYKPSISEQKRIVAILDEAFEGIDA 171 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 A ++ + ++ L G + + + I ++ G SS Sbjct: 172 AIANTQKNLANARELFESYLNGIFT-----------RKGDGWVEKKLGEICHKVEYGSSS 220 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 K G P++R+ +++ +D D+ + + E+NR+ LQ D+LF R N S + VG Sbjct: 221 KSQPEGD-IPVIRMGNIQNNMIDWTDLVYTS-NPDEINRYLLQYNDVLFNRTN-SADHVG 277 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 + K + ++ LIR KD + P+++ + + R + + + Q I Sbjct: 278 KSAIYKG--EKPAIFAGYLIRVHYKKDVIDPDFLNFYLNCYKTREYGKSVMSRSVNQVNI 335 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 +G +K+ + P + Q +I++++ LF +E L + L QSIL KAF G Sbjct: 336 NGTKLKNYPIYHPDLYTQKQIIKKLYFLFRETQRLETIYRRKLEALQELKQSILQKAFTG 395 Query: 421 ELTAQWRAE 429 ELT + + Sbjct: 396 ELTNEKAKD 404 Score = 140 bits (352), Expect = 1e-31, Method: Composition-based stats. Identities = 49/214 (22%), Positives = 82/214 (38%), Gaps = 8/214 (3%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 +GWV + + + + K Q + +P+IR NIQN D TDLV+ Sbjct: 200 DGWVEKKLGEICHKVEYGSSSKSQP-----EGDIPVIRMGNIQNNMIDWTDLVYTSNPDE 254 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGFIAHFT 126 + D++ ++ S VGKSA F + + + +I F+ + Sbjct: 255 INRYLLQYNDVLFNRTN-SADHVGKSAIYKGEKPAIFAGYLIRVHYKKDVIDPDFLNFYL 313 Query: 127 KSSLYRNKISS-LSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 R S +S N NI I P L QK I +KL L + + Sbjct: 314 NCYKTREYGKSVMSRSVNQVNINGTKLKNYPIYHPDLYTQKQIIKKLYFLFRETQRLETI 373 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQ 219 + + + L+ +Q++L A G+LT + Sbjct: 374 YRRKLEALQELKQSILQKAFTGELTNEKAKDVAA 407 >UniRef50_B0VPS8 Specificity determinant for hsdM and hsdR n=1 Tax=Acinetobacter baumannii SDF RepID=B0VPS8_ACIBS Length = 386 Score = 241 bits (614), Expect = 5e-62, Method: Composition-based stats. Identities = 118/403 (29%), Positives = 188/403 (46%), Gaps = 23/403 (5%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 M K P W IA + V LI G +K + D LP+IR N+ N + Sbjct: 2 MQVSKSPPSWCIASIGEVCNLINGRAFKSTE----WTDRGLPIIRIQNLNN---PDANFN 54 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 F +L + ++ D++ A S + G ++ + LI Sbjct: 55 FFNGDLDNK-HRVEKGDLLFAWSGTPGTSFGAH-IWDGDIGALNQHIFKIVFNDSLIDKR 112 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 FI + +L +S G + ++ F+ I PPL EQKIIA+KLDTLLAQV Sbjct: 113 FIRYAINQTL-DELVSGARGGVGLKHVTKGMFETTKIIFPPLYEQKIIADKLDTLLAQVA 171 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 +TK R E+I ILK FRQ++L AV+GKLTE+WR + + + K +I + +G Sbjct: 172 TTKVRLERILNILKTFRQSILSSAVSGKLTEEWRKNKKLNWI--KSTLANICRSVSDGDH 229 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDI-RFLECSESE--LNRHKLQDGDLLFTRYNGSL 297 P + G P L IS++ G +D + + R++ S E + K + D+L+T Sbjct: 230 QAPPRADFGIPFLVISNISKGEIDFSSVNRWVPESYYESLKDIRKPEINDILYTVTGS-- 287 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 G+ +K + + + +Y+ + +SP + T + Sbjct: 288 --FGIPVTVKSTTP--FCFQRHIAIIKPNHSSVDYKYLFYYLASPEVFKHATSIA-TGTA 342 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 QK +S +++ +LLPP++EQ EIV RVE+L A+AD IEK++ Sbjct: 343 QKTVSLSHLRNFNILLPPIEEQTEIVHRVEELLAFADGIEKKL 385 Score = 96.7 bits (239), Expect = 2e-18, Method: Composition-based stats. Identities = 43/199 (21%), Positives = 82/199 (41%), Gaps = 13/199 (6%) Query: 232 LTELRNGLSSKPNESGV-GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 + L NG + K E G PI+RI ++ + N F N+H+++ GDLLF Sbjct: 19 VCNLINGRAFKSTEWTDRGLPIIRIQNLNNPDANFN---FFNGDLD--NKHRVEKGDLLF 73 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPE-YIEIFFSSPSARNAMMN 349 G + L + + + + +I + + +++ Sbjct: 74 AWSGTPGTSFG--AHIWDGDIGAL--NQHIFKIVFNDSLIDKRFIRYAINQTL--DELVS 127 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + G K ++ ++ ++ PP+ EQ I +++ L A T + ++ L + Sbjct: 128 GARGGVGLKHVTKGMFETTKIIFPPLYEQKIIADKLDTLLAQVATTKVRLERILNILKTF 187 Query: 410 TQSILAKAFRGELTAQWRA 428 QSIL+ A G+LT +WR Sbjct: 188 RQSILSSAVSGKLTEEWRK 206 >UniRef50_A6UXD7 Type I restriction-modification system, S subunit n=1 Tax=Pseudomonas aeruginosa PA7 RepID=A6UXD7_PSEA7 Length = 464 Score = 241 bits (614), Expect = 6e-62, Method: Composition-based stats. Identities = 83/455 (18%), Positives = 181/455 (39%), Gaps = 29/455 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAIN--YLKDDYLPLIRANNIQNGKFDTTDLVFV 62 ++P+ W P+ + L R + I + D + I N+ G + F+ Sbjct: 19 QVPDHWSSVPIKYMA-LERNSLFLDGDWIESKDISSDGIRYITTGNVGEGAYKEQGAGFI 77 Query: 63 PKNLVK--ESQKISPEDIVIAMSSGSKSVVGK-SAHQHLPFECSFGAFCGVLRPEKLIFS 119 + ++ D++++ + + +G+ +L + RP+ + Sbjct: 78 SEETFHALRCTEVYEGDVLVSRLN---NPIGRACVVPNLGGRVVTSVDNVIFRPDLKFYK 134 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 FI + S Y S+L+ GA + I I + P L EQ IA LD A++ Sbjct: 135 KFIVYLFSSEEYFKHTSNLARGATMQRISRGLLGNIRVVTPSLEEQTQIARFLDHETARI 194 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKL-NFE 229 D+ +++ ++LK RQAV+ AV L +W P H + + + Sbjct: 195 DALIEEQQRLIELLKEKRQAVISHAVTKGLDPTVPMKDSGVEWLGEVPAHWEVRSISSIS 254 Query: 230 SILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL-ECSESELNRHKLQDGDL 288 +T G + G L+ +++ + F+ E +E + L GD+ Sbjct: 255 KKITNGYVGPTRDILVDEPGVRYLQSLHIKSNKIKFEVPYFVSEQWSAEHAKSILASGDV 314 Query: 289 LFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMM 348 L + +G ++ + +H +I + + + L E++ +S ++++ Sbjct: 315 LIVQTG----DIGQVAVVTE-EHAGCNCHALIIVSPVREVVLGEWVSWVLNSTYGYHSLL 369 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + ++T + ++ ++K + +PP++EQA IV +E D++ + +L + Sbjct: 370 S-IQTGAMHPHLNCGNVKFLNLPIPPLEEQARIVSFIESGELEMDSLMSETKRSLLLLQE 428 Query: 409 LTQSILAKAFRGELTAQ-WRAENPDLISGENSAAA 442 ++++ A G++ + W+ P A A Sbjct: 429 RRTALISAAVTGKIDVRGWQP--PASTQAPEPAVA 461 Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 95/210 (45%), Gaps = 8/210 (3%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + +S+++ I + I + + + +++ +I++ K FV Sbjct: 239 GEVPAHWEVRSISSISKKITNGYVGPTRDI-LVDEPGVRYLQSLHIKSNKIKFEVPYFVS 297 Query: 64 KNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIFSG 120 + E K ++ D++I + +G+ A C+ A V +++ Sbjct: 298 EQWSAEHAKSILASGDVLIVQT----GDIGQVAVVTEEHAGCNCHALIIVSPVREVVLGE 353 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +++ S+ + + S+ GA ++ + +N+PIPPL EQ I +++ ++D Sbjct: 354 WVSWVLNSTYGYHSLLSIQTGAMHPHLNCGNVKFLNLPIPPLEEQARIVSFIESGELEMD 413 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLT 210 S + ++ +L+ R A++ AV GK+ Sbjct: 414 SLMSETKRSLLLLQERRTALISAAVTGKID 443 >UniRef50_B5ECU4 Restriction modification system DNA specificity domain n=1 Tax=Geobacter bemidjiensis Bem RepID=B5ECU4_GEOBB Length = 395 Score = 240 bits (613), Expect = 6e-62, Method: Composition-based stats. Identities = 73/418 (17%), Positives = 155/418 (37%), Gaps = 30/418 (7%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ--NGKFDTTDLVFVPKNL 66 GWV + + + RG + + + D + I+ + + + TT+ P+ Sbjct: 4 GWVTKKLGEICDIERGGSPRPIDSFLTDAPDGINWIKIGDTKTISKYIFTTEQKIRPEG- 62 Query: 67 VKESQKISPEDIVIA--MSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 K S+ + D +++ MS G ++ + C + + E + ++ H Sbjct: 63 AKRSRMVFEGDFILSNSMSFGRPYIMKTTG-------CIHDGWLVLREKEPNVNQDYLYH 115 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S L + L+AG+ + N+ + +PIP ++EQ+ I LD ++ + KA Sbjct: 116 VLSSDLVYRQFDRLAAGSTVRNLNIGLVKGVEVPIPSISEQQRIVGILDEAFDRIATAKA 175 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 E+ Q + ++ L + K + + + + K Sbjct: 176 NAEKNLQNARALFESHLQSTFTQR---------CAGWTVKTIGDLAEHS--LGKMLDKAK 224 Query: 245 ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 G P LR +VR + +D+ + +E+ ++ GD+L G Sbjct: 225 NKGELQPYLRNINVRWFTFNLSDLLEMPFRTTEVGKYTAVKGDVLICEGGYP----GRAA 280 Query: 305 LLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKD 364 + + + + L R R + ++ + + + + +G + +G+ Sbjct: 281 IW--TEDYPVYFQKALHRVRFHEPEHNKWFLYYLYAQDKSGELKKHF-SGTGIQHFTGEA 337 Query: 365 IKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + + L P+ E V R E L +E L + L +S+L +AF G+L Sbjct: 338 LSRFKLPLAPLPELRRNVARFEVLLEETQRLESICQRKLTALEELKKSLLDRAFTGQL 395 >UniRef50_B7JRE7 Restriction modification system DNA specificity domain protein n=2 Tax=Bacillus cereus RepID=B7JRE7_BACC0 Length = 495 Score = 239 bits (611), Expect = 1e-61, Method: Composition-based stats. Identities = 98/492 (19%), Positives = 202/492 (41%), Gaps = 68/492 (13%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P W+ +++++ LI ++ K++ P++ NI +G+ + +V + Sbjct: 25 EVPGNWIWGNLNSLSKLIVDGSHNPPPK----KNEGFPMLSGRNILDGEINFETDRYVSE 80 Query: 65 NLVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSG 120 + ++ K I D+++ + +G++ F +++P ++ S Sbjct: 81 DDYQKEYKRTPIESNDVLLTI----VGTIGRTTVVPKEFSPFVLQRSVALIKP--MVNSN 134 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +++++ S ++ + + G + + IP+PPL EQK I EK++ LL +V+ Sbjct: 135 YLSYYFSSPYFQYYLQKNAKGTAQKGVYLKTLKSSRIPLPPLMEQKRITEKVEGLLGRVE 194 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF----EPQHSVFKKLNFESILTEL- 235 KA E+ + + R +L A G+L+ KWR E S+ +++ + + + Sbjct: 195 EAKALIEEAKKTFEVRRATILDKAFRGELSAKWREDNRIAEDASSLLERIQIQKRNSSIK 254 Query: 236 ------------------RNGLS-----------------SKPNESGVGHPILRISSVRA 260 NG + S G +R + Sbjct: 255 SNTLKITSVIKEEEPFELPNGWTWVRLGEISYYVTSGSRDWSKYYSDEGAMFIRTQDINK 314 Query: 261 GHVDQNDIRFLECSES-ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDK 319 ++ +D+ ++ E E R ++ D+L T + VG C L++ + + Sbjct: 315 NSLNLSDVAYVSLPEKVEGKRSLVEKADILTTITGAN---VGKCALVETNIKEAYV-SQS 370 Query: 320 LIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQA 379 + +L + ++ +Y+ + SP + G+ +S +DIK+ + L P+ EQ Sbjct: 371 VALTKLIEKSISKYVHLSLLSPCGGGNELEERAYGIGRPVLSLEDIKNIKIPLAPMAEQQ 430 Query: 380 EIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENS 439 IV+ VE L + + + + L QSIL KAFRGEL E S Sbjct: 431 VIVKLVETLLEN-EKESLNLASIEKHLETLKQSILNKAFRGELGTNDPNE--------ES 481 Query: 440 AAALLEKIKAER 451 + LL+K+ E+ Sbjct: 482 SMKLLKKVLQEK 493 Score = 157 bits (397), Expect = 8e-37, Method: Composition-based stats. Identities = 69/271 (25%), Positives = 121/271 (44%), Gaps = 18/271 (6%) Query: 194 KRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPIL 253 K+ Q +L A+ TE+ P + ++ LN S L + +G + P + G P+L Sbjct: 5 KKTLQELLEDAL--IPTEEHPYEVPGNWIWGNLNSLSKL--IVDGSHNPPPKKNEGFPML 60 Query: 254 RISSVRAGHVDQNDIRFLECSE--SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQH 311 ++ G ++ R++ + E R ++ D+L T +G ++ K Sbjct: 61 SGRNILDGEINFETDRYVSEDDYQKEYKRTPIESNDVLLTIVG----TIGRTTVVPKEFS 116 Query: 312 QNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVL 371 +L + + Y+ +FSSP + + K + QKG+ K +KS + Sbjct: 117 PFVLQRSVAL---IKPMVNSNYLSYYFSSPYFQYYLQKNAK-GTAQKGVYLKTLKSSRIP 172 Query: 372 LPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENP 431 LPP+ EQ I +VE L + + + A +IL KAFRGEL+A+WR +N Sbjct: 173 LPPLMEQKRITEKVEGLLGRVEEAKALIEEAKKTFEVRRATILDKAFRGELSAKWREDN- 231 Query: 432 DLISGENSAAALLEKIKAERAASGGKKASRK 462 A++LLE+I+ ++ S K + K Sbjct: 232 ---RIAEDASSLLERIQIQKRNSSIKSNTLK 259 >UniRef50_A3XPV6 Type I restriction-modification system specificity subunit n=1 Tax=Leeuwenhoekiella blandensis MED217 RepID=A3XPV6_9FLAO Length = 502 Score = 238 bits (607), Expect = 3e-61, Method: Composition-based stats. Identities = 109/512 (21%), Positives = 204/512 (39%), Gaps = 82/512 (16%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 E WV + ++ L G +K + + D +P+IR +IQ+ D + + N+ Sbjct: 3 EDWVECTLGSLLKLKNGYAFKSSK----YQKDGIPVIRIGDIQDWNVDIENAKRIDDNIE 58 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP--EKLIFSGFIAHF 125 +S ++ DI+IAMS + GK + + G L P E+L + +I + Sbjct: 59 YDSHIVNKGDILIAMSGATT---GKFGIYNSDKKAYQNQRVGNLIPHSEELTSNNYIYYL 115 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 S + I + G NI + + + PL Q+ I +K++ L + +DS A Sbjct: 116 LYS--LKRDIEQQAYGGAQPNISATKIEALKTKLFPLPIQQAIVKKIEELFSSLDSGIAD 173 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRN----------------------FEPQHSVF 223 ++ LK +RQAVL A GKLT++WR +E Q + + Sbjct: 174 LKKAQDQLKIYRQAVLKKAFEGKLTKEWREKQTELPTAEELLKEIKKERQKHYEQQLAKW 233 Query: 224 KKLNFESILTELRNG---------------------LSSKPNESGVGHPILRI------- 255 K+ + + + +G+ L+I Sbjct: 234 KEAVISWENNDKEGKKPGKPGKIKEFELNEIEELPIIPNTWAWEKLGNVCLKIMDGTHFS 293 Query: 256 -SSVRAGHV-------------DQNDIRFLECSESE--LNRHKLQDGDLLFTRYNGSLEF 299 ++ G D +I ++ + E R ++ GD+L+ + + Sbjct: 294 PKNIEKGDFKYITAKNIKEGRIDLRNISYVTQEDHEAIFGRCDVKKGDVLYIKDGAT--- 350 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 G + + +LL + R + P+++E F ++ RN M++ + Sbjct: 351 TGRAAVNTLEEEFSLLSSVGVFRT-IKSFINPKFLESFLNAQVTRNRMLSNIA-GVAITR 408 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 ++ + + + L V+EQ +IV+ +E + D +E+ + ++L + L QSIL KAF Sbjct: 409 LTLVKLNNSMFSLCSVEEQHQIVQEIESRLSVCDAVEQNIQDSLEKAQALRQSILKKAFE 468 Query: 420 GELTAQWRAENPDLISGENSAAALLEKIKAER 451 G L + A+ LLE+IKAE+ Sbjct: 469 GTLLSDKEIAKCKAHPDYEPASVLLERIKAEK 500 Score = 134 bits (337), Expect = 7e-30, Method: Composition-based stats. Identities = 50/245 (20%), Positives = 101/245 (41%), Gaps = 11/245 (4%) Query: 219 QHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL 278 + + S+L G P++RI ++ +VD + + ++ + E Sbjct: 1 MREDWVECTLGSLLKLKNGYAFKSSKYQKDGIPVIRIGDIQDWNVDIENAKRIDDNI-EY 59 Query: 279 NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFF 338 + H + GD+L + G+ + ++ + L I++ Sbjct: 60 DSHIVNKGDILIAMSGATTGKFGIY-----NSDKKAYQNQRVGNLIPHSEELTSNNYIYY 114 Query: 339 SSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 S + + Q IS I++ L P+ Q IV+++E+LF+ D+ Sbjct: 115 LLYSLKRDIEQQA-YGGAQPNISATKIEALKTKLFPLPIQQAIVKKIEELFSSLDSGIAD 173 Query: 399 VNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKK 458 + A ++ Q++L KAF G+LT +WR + +L + A LL++IK ER ++ Sbjct: 174 LKKAQDQLKIYRQAVLKKAFEGKLTKEWREKQTELPT----AEELLKEIKKERQKHYEQQ 229 Query: 459 ASRKK 463 ++ K Sbjct: 230 LAKWK 234 Score = 120 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 42/208 (20%), Positives = 84/208 (40%), Gaps = 10/208 (4%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P W + V I T+ + I + I A NI+ G+ D ++ +V + Sbjct: 270 IPNTWAWEKLGNVCLKIMDGTHFSPKNI---EKGDFKYITAKNIKEGRIDLRNISYVTQE 326 Query: 66 LVKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLIFSGF 121 + + D++ + G++A L E S + GV R + I F Sbjct: 327 DHEAIFGRCDVKKGDVLYIKDGATT---GRAAVNTLEEEFSLLSSVGVFRTIKSFINPKF 383 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + F + + RN++ S AG I + + + + EQ I +++++ L+ D+ Sbjct: 384 LESFLNAQVTRNRMLSNIAGVAITRLTLVKLNNSMFSLCSVEEQHQIVQEIESRLSVCDA 443 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKL 209 + + + + RQ++L A G L Sbjct: 444 VEQNIQDSLEKAQALRQSILKKAFEGTL 471 >UniRef50_Q4HFD9 HsdS n=3 Tax=Campylobacterales RepID=Q4HFD9_CAMCO Length = 408 Score = 238 bits (607), Expect = 3e-61, Method: Composition-based stats. Identities = 94/425 (22%), Positives = 167/425 (39%), Gaps = 32/425 (7%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 +GW + + I T+K I + +P + NI G FD +D+ ++ Sbjct: 6 QGWKWKSLGEIC-FITDGTHKTPNYI----ETGIPFLSVKNISKGFFDLSDVKYISLEEH 60 Query: 68 KE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 + K EDI+I +GK+ L FE S G+L+P+ I S ++ + Sbjct: 61 NKLIKRAKPEFEDILICR----IGTLGKAIKISLEFEFSIFVSLGLLKPKVKIISDYLVY 116 Query: 125 FTKSSLYRNKIS--SLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 F S I+ + G + + + I +PPL EQ+ I LD A++D Sbjct: 117 FLNSCFIEEWINDNKVGGGTHTAKLNLNILEKCPIALPPLKEQERIVGILDENFAKIDEN 176 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-PQHSVFKKLNFESILTELRNGLSS 241 EQ L Q+ L A N N++ PQ +K L + + Sbjct: 177 IKILEQDLLNLDELMQSALQKAFNPLKDNAKENYKLPQGWEWKSLGEIGEIITGTTPSKN 236 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL---NRHKLQDGDLLFTRYNGSLE 298 PN G +P+ + S + + I++ + S+L N L +L S+ Sbjct: 237 NPNFYGNEYPLFKPSDLNGDII----IKYASDNLSKLGFDNARNLPKDTILVVCIGASIG 292 Query: 299 FVGVCGLLKKLQHQ-NLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 VG+ G+ Q N + P+ +Y+ S + + T+ Sbjct: 293 KVGLSGVNGSCNQQINAIIPNSAF--------TSKYLFFVCLSNYFQTILKKNASQTT-L 343 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 I+ + + LPP+KEQ +I +++L ++ +++ + + L S+L KA Sbjct: 344 PIINKTEFSKLQIPLPPLKEQEQIASHLDELSSHVKNLKQNYQAQIKNLQELKNSLLDKA 403 Query: 418 FRGEL 422 F+G L Sbjct: 404 FKGNL 408 Score = 134 bits (336), Expect = 1e-29, Method: Composition-based stats. Identities = 43/208 (20%), Positives = 81/208 (38%), Gaps = 7/208 (3%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 KLP+GW + + +I G T K Y + PL + +++ Sbjct: 208 ENYKLPQGWEWKSLGEIGEIITGTTPSKNNPNFYGNE--YPLFKPSDLNGDIIIKYASDN 265 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + K ++ + + I++ S VG S S + P S + Sbjct: 266 LSKLGFDNARNLPKDTILVVCIGASIGKVGLSGV-----NGSCNQQINAIIPNSAFTSKY 320 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + S+ ++ + ++ + I F + IP+PPL EQ+ IA LD L + V + Sbjct: 321 LFFVCLSNYFQTILKKNASQTTLPIINKTEFSKLQIPLPPLKEQEQIASHLDELSSHVKN 380 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKL 209 K ++ + L+ + ++L A G L Sbjct: 381 LKQNYQAQIKNLQELKNSLLDKAFKGNL 408 >UniRef50_C6J5M6 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J5M6_9BACL Length = 403 Score = 238 bits (607), Expect = 4e-61, Method: Composition-based stats. Identities = 90/419 (21%), Positives = 175/419 (41%), Gaps = 32/419 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P GW + P+ L++G+TY +Y L ++R++NIQ+GK D V+V + Sbjct: 3 VPNGWAVKPLLECCDLLQGLTYSPSNIQSY----GLLVLRSSNIQDGKLVLDDCVYVNCS 58 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + E + + P DI+I + +GS +++GKS P+ +FGAF VLR + +G++AH Sbjct: 59 -IDEIKYVKPNDILICVRNGSSALIGKSCVIDRPYNATFGAFMSVLRGDT---TGYLAHM 114 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-PLAEQKIIAEKLDTLLAQVDSTKA 184 S + + +I + S+ A IN I F+ I IPIP EQ+ IA L A + + + Sbjct: 115 FASDVVQQQIRNRSS-ATINQITKRDFEDIKIPIPFDEEEQRAIAAALSDADAYITALEK 173 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 + + + Q +L G + ++ + KK++ + S P Sbjct: 174 LITKKRAVKQGAMQELLTG---KRRLPGFKGE----WIEKKIHEIGDTSSGGTPSRSVPT 226 Query: 245 ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 P + S + ++ + + + + G +L Y ++ +G+ Sbjct: 227 YFNGNIPWVTTSELNDNYIRSTAEKITSEALNNSSAKLFPKGTVLMAMYGATIGKLGI-- 284 Query: 305 LLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKD 364 + KD ++ R ++ + +GQ IS Sbjct: 285 -----LDVDATTNQACCALFFNKDIDSVFMYFLLL--YHRTEIIEL-GSGAGQPNISQMI 336 Query: 365 IKSQVVLLPP-VKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 I++ +PP + EQ I + + A D + ++ A + Q ++++ G + Sbjct: 337 IRNLTFTIPPTLAEQTAIAAVLSDMDAEIDALTAKLEKA----RRIKQGMMSELLTGRI 391 >UniRef50_B8GGK0 Restriction modification system DNA specificity domain protein n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GGK0_METPE Length = 471 Score = 238 bits (606), Expect = 5e-61, Method: Composition-based stats. Identities = 122/463 (26%), Positives = 206/463 (44%), Gaps = 54/463 (11%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ--NGKFDTTDLV-F 61 ++PEGW + + + K D + + + G ++ + Sbjct: 18 EVPEGWKLVTILNACEVN----PPKPPRDFLPADAPVTFVPMPAVDADMGAITNPEIKPY 73 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECS-FGA-FCGVLRPEKLIFS 119 + S + D+++A + GK+A FG+ V+R I Sbjct: 74 LEVRNGFTSFR--DGDVIMAKITPCMEN-GKAAIVRGMKNGIGFGSTEFHVMRSRGEILP 130 Query: 120 GFIAHFTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ ++ + +RN+ S G+ + IP+PPLAEQ+ I +++ LL+ Sbjct: 131 EYLFYYIRQKSFRNEAESHFTGSVGQKRVPTDFIKQSVIPLPPLAEQRRIVARIEALLSH 190 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 VD+ R ++P I+KRFRQAVL A +G+LTE+WR + K L + L++G Sbjct: 191 VDAAGDRLSRVPLIMKRFRQAVLAAACSGRLTEEWREDKDNFEDPKLLLQDIQNYRLQHG 250 Query: 239 LSSKPNESGVG---------------------------------------HPILRISSVR 259 ++ +S V +P LR+++V Sbjct: 251 INKIKIDSKVNITENPIEIPNTWIWSTIEKIADISGGIQKQPMRAPQRNFYPYLRVANVL 310 Query: 260 AGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDK 319 G +D ++I+ +E EL R+ L+ D+L NGS +G + + +N ++ + Sbjct: 311 RGSLDLHEIKNMELFAGELERYHLELNDILIVEGNGSFSEIGRSAIWNG-EIENCVHQNH 369 Query: 320 LIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQA 379 +IR R+ K LP+Y+ ++++SP TTSG +S K I + LPP+ EQ Sbjct: 370 IIRVRVRK-FLPQYVNLYWNSPLGSELSSGAAVTTSGLYTLSTKKIAQLPIPLPPISEQH 428 Query: 380 EIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 EIVRRV LF AD IE++V A R LTQ+++ KAF G L Sbjct: 429 EIVRRVGLLFERADAIEREVVAAGRRCERLTQAVMIKAFSGRL 471 Score = 110 bits (276), Expect = 9e-23, Method: Composition-based stats. Identities = 45/176 (25%), Positives = 78/176 (44%), Gaps = 5/176 (2%) Query: 283 LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPS 342 +DGD++ + +E G +++ +++ + R + LPEY+ + S Sbjct: 83 FRDGDVIMAKITPCMEN-GKAAIVRGMKNGIGFGSTEFHVMRSRGEILPEYLFYYIRQKS 141 Query: 343 ARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNA 402 RN + + GQK + IK V+ LPP+ EQ IV R+E L ++ D +++ Sbjct: 142 FRNEAESHFTGSVGQKRVPTDFIKQSVIPLPPLAEQRRIVARIEALLSHVDAAGDRLSRV 201 Query: 403 LARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKK 458 + Q++LA A G LT +WR + LL+ I+ R G K Sbjct: 202 PLIMKRFRQAVLAAACSGRLTEEWRED----KDNFEDPKLLLQDIQNYRLQHGINK 253 >UniRef50_A9I6S0 Type I restriction-modification system, S subunit n=3 Tax=Bacteria RepID=A9I6S0_BORPD Length = 797 Score = 237 bits (605), Expect = 5e-61, Method: Composition-based stats. Identities = 97/498 (19%), Positives = 188/498 (37%), Gaps = 95/498 (19%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GW A + +T +IRG+T+ + + +R N+Q K + +DL+++ + Sbjct: 87 LPQGWEWARLGEITDIIRGITFPASEKTKEPASGRIACLRTANVQK-KIEWSDLLYIDRT 145 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLP--FECSFGAFCGVLRPEKLIFSGFIA 123 + ++ ++ +D ++ + S+ +VGK A E +FG F GVLR K + ++ Sbjct: 146 FMSKNSQLVRQDDIVMSMANSRELVGKVAVVSEMPVNEATFGGFLGVLRTHK-VAPLYVL 204 Query: 124 HFTKSSLYRN-KISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 H +S R+ I + S NI NI + +P+PP++EQ I K+D L+A+ D Sbjct: 205 HLLNTSYARSSLIDAASQTTNIANISLGKLNPFLVPVPPISEQHRIVAKIDELMARCDEL 264 Query: 183 KA-----------------------------------------RFEQIPQILKRFRQAVL 201 + I + R+A+L Sbjct: 265 EKLRTAQQGARLTVHAAAIKQLLNVAEPGQHQRAQTFLAEHFGELYTIKGNVAELRKAIL 324 Query: 202 GGAVNGKLTEKWRNFEPQHSV--------------------------------------F 223 AV GKL + N +P + + Sbjct: 325 QLAVMGKLVPQDPNDQPASELLKEIEAEKQRLVQEGKIKKTKPLPPVTEEEKPYALPQGW 384 Query: 224 KKLNFESILTELRNG----LSSKPNESGVGHPILRISSVRAGHVDQN-DIRFLECSESEL 278 + + F + TE+ G + K + G P++ S + G + + + E +L Sbjct: 385 EWVRFGDLTTEISTGPFGSMIHKSDYIVDGVPLVNPSHMVDGKIFHDPSVTVSEIMAKKL 444 Query: 279 NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFF 338 + H+L D++ R +G C ++ L + R +YI F Sbjct: 445 DSHRLNTNDIVMARRGE----MGRCAIVTAESDGFLCGTGSFV-LRFVDRIYRQYILTIF 499 Query: 339 SSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 + R + + ++ + V LPP EQ IV ++++L D +++Q Sbjct: 500 KTEITR-EFLGGNSVGTTMTNLNHGILNKMPVSLPPHPEQTRIVTKIDELMVMCDALDQQ 558 Query: 399 VNNALARVNNLTQSILAK 416 + ++ L +++ Sbjct: 559 IEATSSKRTELLNALIHA 576 Score = 146 bits (368), Expect = 2e-33, Method: Composition-based stats. Identities = 67/363 (18%), Positives = 134/363 (36%), Gaps = 64/363 (17%) Query: 149 PASFDLINIPIPPLAEQKIIAEK---LDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAV 205 P + I LA Q + + + +A +Q+ + + + + Sbjct: 20 PDGIKKLRELILTLAMQGKLVSQDPSEQPASDLLQEIEAEKQQLVKEGQIKK----PKPL 75 Query: 206 NGKLTEKWRNFEPQHSVFKKLN-FESILTELRNGLSSKPNESGVG-HPILRISSVRAGHV 263 E+ PQ + +L I+ + S K E G LR ++V+ + Sbjct: 76 PPVAEEEKPYALPQGWEWARLGEITDIIRGITFPASEKTKEPASGRIACLRTANVQK-KI 134 Query: 264 DQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA 323 + +D+ +++ + N ++ D++ + N S E VG ++ ++ + L Sbjct: 135 EWSDLLYIDRTFMSKNSQLVRQDDIVMSMAN-SRELVGKVAVVSEMPVNEATFGGFLGVL 193 Query: 324 RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 R T P Y+ ++ AR+++++ T+ IS + +V +PP+ EQ IV Sbjct: 194 R-THKVAPLYVLHLLNTSYARSSLIDAASQTTNIANISLGKLNPFLVPVPPISEQHRIVA 252 Query: 384 RVEQLFAYADTIEK---------------------------QVNNALARV---------- 406 ++++L A D +EK Q A + Sbjct: 253 KIDELMARCDELEKLRTAQQGARLTVHAAAIKQLLNVAEPGQHQRAQTFLAEHFGELYTI 312 Query: 407 ----NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERA---ASGGKKA 459 L ++IL A G+L Q + P A+ LL++I+AE+ G K Sbjct: 313 KGNVAELRKAILQLAVMGKLVPQDPNDQP--------ASELLKEIEAEKQRLVQEGKIKK 364 Query: 460 SRK 462 ++ Sbjct: 365 TKP 367 Score = 47.8 bits (112), Expect = 8e-04, Method: Composition-based stats. Identities = 21/74 (28%), Positives = 37/74 (50%), Gaps = 11/74 (14%) Query: 392 ADTIEKQVNNALAR---VNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIK 448 A+ +EK + A A + L + IL A +G+L +Q +E P A+ LL++I+ Sbjct: 6 AELLEKHFDTAFAAPDGIKKLRELILTLAMQGKLVSQDPSEQP--------ASDLLQEIE 57 Query: 449 AERAASGGKKASRK 462 AE+ + +K Sbjct: 58 AEKQQLVKEGQIKK 71 >UniRef50_Q2P0A3 Specificity determinant for hsdM and hsdR n=2 Tax=Xanthomonas oryzae pv. oryzae RepID=Q2P0A3_XANOM Length = 450 Score = 237 bits (605), Expect = 6e-61, Method: Composition-based stats. Identities = 132/485 (27%), Positives = 222/485 (45%), Gaps = 58/485 (11%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDY-LPLI--RANNIQNGKFDTT 57 MS +LP GW + V T +A + Y +P+ R I +GK Sbjct: 1 MS--ELPGGWSETEIGPV-NTYSSETLNPAKAPKQTFELYSVPVFAKRKPEIVDGK---- 53 Query: 58 DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 ++ QK+ P+D+++ + + V ++ + + + + +P L Sbjct: 54 -------DIGSTKQKVEPDDVLLCKINPRINRVWLVGKKNDHEQIASSEWIVIRQP--LF 104 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 FI + S +R+++ + +G + +P + + I PLAEQK IA+KLD L Sbjct: 105 DPAFIRFQLQESSFRDRLCAEVSGVGGSLTRAQPKKVESYKLRIAPLAEQKRIAQKLDAL 164 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTEL 235 LAQVD+ KAR + IP +LKRFR++V+ AV G+L+ R + ++L E+ Sbjct: 165 LAQVDTLKARIDAIPALLKRFRKSVVHSAVIGRLSADLRVPIEKSEEQEQLGPLESWREV 224 Query: 236 R--------NGLSSKPNE-----SGVGHPILRISSVRA--GHVDQNDIRFLECSESELNR 280 G S G +P ++ V G + + + + SE L + Sbjct: 225 TLASLGELSRGKSKHRPRNDSRLYGSEYPFIQTGDVANSGGALTSSKVFY---SEFGLKQ 281 Query: 281 HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFS 339 +L L ++ + + + +PD ++ KD + ++I+ Sbjct: 282 SRLFPSGTLCITIAANIADTAMLAI-------DACFPDSVVGFIPNKDDCVAQFIKYVID 334 Query: 340 SPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 + + + QK I+ K + + +PP+KEQ EIVR VEQLFAYAD +E +V Sbjct: 335 DN---KESLEALAPATAQKNINLKVLNQVKLRIPPIKEQTEIVRHVEQLFAYADQLEAKV 391 Query: 400 NNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKA 459 A R++ LTQS+LAKAFRGEL Q ++ P A+ LL++I+A+RAA+ K Sbjct: 392 AAAQQRIDALTQSLLAKAFRGELVPQDPSDEP--------ASVLLDRIRAQRAATPKPKR 443 Query: 460 SRKKS 464 RK + Sbjct: 444 GRKAA 448 >UniRef50_UPI0001C15DDF Restriction modification system DNA specificity domain protein n=1 Tax=Cylindrospermopsis raciborskii CS-505 RepID=UPI0001C15DDF Length = 445 Score = 237 bits (605), Expect = 6e-61, Method: Composition-based stats. Identities = 84/429 (19%), Positives = 171/429 (39%), Gaps = 27/429 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 GK+PE W + VS I T +Y + +P + + ++ T Sbjct: 31 GKIPEHWEVRKVSHAFQKIGSGTTPSTNHYDYY-EGNIPWVNTSELREKVITDTSAKLTN 89 Query: 64 KNLVKES--QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 K L+ S P ++IAM + +G L C L I + F Sbjct: 90 KALLDHSVLNLYPPGTLLIAMYGATIGRLGI-----LGITACTNQACCALANPISINAKF 144 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 ++ + RN++ LS+G NI I IP PPL EQ+ IA+ LD A++D+ Sbjct: 145 AFYWL--WMRRNELILLSSGGGQPNINQEKIRSIRIPAPPLTEQQAIAQFLDRETAKIDT 202 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL-- 239 A+ E++ ++LK R A++ AV L + ++ + L++ Sbjct: 203 LVAKKERLIELLKEKRTALISHAVTKGLNPDAPMKDSGVEWLGEVPRNWPMIRLKHVAPV 262 Query: 240 -SSKPNESGVGHPILRISSV--RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 S+K + P + + + + G + + E + GD+LF + Sbjct: 263 SSAKLTQKPDNLPYIGLEHIESKTGRLLLD----TPVENVESTVSCFEKGDVLFGKLRPY 318 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 L V LL + + + +L+ + ++D +++ + + + N + Sbjct: 319 LAKV----LLAEFEG---VSTTELLALKPSQDVNGKFLFFQLIAEGFIDQV-NSFTYGTK 370 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 + + I + + LPP+ EQ I + +++ A DT+ + ++ ++ ++++ Sbjct: 371 MPRVGPEQITNLFIPLPPLPEQQAIAQFLDRETAKIDTLVAKTRTSIEKLKEYRTALISA 430 Query: 417 AFRGELTAQ 425 A G++ + Sbjct: 431 AVTGKIDVR 439 Score = 111 bits (277), Expect = 6e-23, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 84/236 (35%), Gaps = 23/236 (9%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSK---PNESGVGHPILRISSVRAGHVDQNDI 268 +W P+H +K++ ++ +G + + P + S +R + Sbjct: 28 EWLGKIPEHWEVRKVSHA--FQKIGSGTTPSTNHYDYYEGNIPWVNTSELREKVITDTSA 85 Query: 269 RFLECSESELNRHKL-QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 + + + + L G LL Y ++ +G+ G+ + + Sbjct: 86 KLTNKALLDHSVLNLYPPGTLLIAMYGATIGRLGILGITACTNQACCALANPI------- 138 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 ++ + R + + + GQ I+ + I+S + PP+ EQ I + +++ Sbjct: 139 SINAKFAFYWL---WMRRNELILLSSGGGQPNINQEKIRSIRIPAPPLTEQQAIAQFLDR 195 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A DT+ + + + ++++ A + NPD ++ L Sbjct: 196 ETAKIDTLVAKKERLIELLKEKRTALISHAVT-------KGLNPDAPMKDSGVEWL 244 >UniRef50_A1UJN5 Restriction endonuclease S subunits-like protein n=2 Tax=Mycobacterium RepID=A1UJN5_MYCSK Length = 419 Score = 237 bits (604), Expect = 7e-61, Method: Composition-based stats. Identities = 87/449 (19%), Positives = 177/449 (39%), Gaps = 40/449 (8%) Query: 9 GWV-IAPVSTVTTLIRGVTYKKEQAINYLK---DDYLPLIRANNIQNGKF-DTTDLV-FV 62 W ++ + G + + + L + ++ G+F D +D Sbjct: 2 SWAQEVTLAELAE---GGLFSDGDWVESKDQDASGDVRLTQLADVGVGEFRDRSDRWMRR 58 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLR-PEKLIFSG 120 + + +D++IA +G+S +LR + Sbjct: 59 DQAHRLRCTFLEGDDVLIARM---PDPIGRSCLVPSSVGSAVTVVDVAILRLARRDANPR 115 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ S + +K+ +L +G I + + IP+P L EQ I + L+ L+++D Sbjct: 116 YVMWALNSPRFHSKVVALQSGTTRKRISRKNLASLTIPLPTLDEQNRIVDLLEDHLSRLD 175 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 + ++ Q A L + + + L E + Sbjct: 176 AAESSLRLAMQKADAMTTASLDRQTTAG---------SRAWRDTTIGAMAELVEYGSSAK 226 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 + P+LR+ +++ G ++ +++L +E + LQ GDL+F R N S E V Sbjct: 227 CAGQAADSDVPVLRMGNIQNGKINWTGLKYLPAGHAEFPKLLLQSGDLVFNRTN-SAELV 285 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G + + + + + LIR R ++ P + + +SP+ R + + GQ + Sbjct: 286 GKSAVFEDTRAAS--FASYLIRVRFGQEVNPAWANMVINSPAGRRYVKSVASQQVGQANV 343 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 +G +K+ + LPP+ EQ VR +++ + + Q+ + + R L +++LA AF G Sbjct: 344 NGTKLKAFPLPLPPLDEQCRRVRAHDEVVVSRERLHHQIADLVVRAAGLRRALLAAAFTG 403 Query: 421 ELTAQWRAENPDLISGENSAAALLEKIKA 449 LT NSA LLE++++ Sbjct: 404 RLT--------------NSAEGLLEELES 418 >UniRef50_C5TIE5 Restriction modification system DNA specificity domain protein n=1 Tax=Zymomonas mobilis subsp. mobilis ATCC 10988 RepID=C5TIE5_ZYMMO Length = 419 Score = 236 bits (603), Expect = 1e-60, Method: Composition-based stats. Identities = 110/467 (23%), Positives = 203/467 (43%), Gaps = 54/467 (11%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVT-YKKEQAINYLKDDYLPLIRAN--NIQNGKFDTT 57 MS LP+GW+ + +T G + K + + + P A+ ++ F+ Sbjct: 1 MSN--LPQGWIQTTFADITNQRSGNSKLVKGKLESQESNGLYPAFSASGPDVWRDAFEY- 57 Query: 58 DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 D +I + G + GK+ + P +++ Sbjct: 58 -----------------EGDAIIVSAVG--ARCGKAFRAKGQWSAIANTHIVWPEP-QVV 97 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + F+ + K G+ +K + I +PPL EQ+ I K+D+L Sbjct: 98 ETEFLFLLLNDENFWEK-----GGSAQPFVKVRATFERTINLPPLPEQRRIVAKIDSLTG 152 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 + + + IP+++++++QA+L A W ++ + +++ E R Sbjct: 153 KSRRARDHLDHIPRLVEKYKQAILSAAFRA----DWPLISVGETIRAVVAGKNLRCEER- 207 Query: 238 GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSL 297 P E G ++++S+V G D + L S + +++ GDLL +R N +L Sbjct: 208 ----PPFEHESG--VVKVSAVSWGTFDARASKTLPESFTPPENTRIKAGDLLISRAN-TL 260 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS-G 356 E VG ++ + NL DK++R + +D ++ F SP R A+ Sbjct: 261 ELVGAVVIVLEC-PSNLFLSDKVLRLDV-EDGDKPWLMWFLRSPDGRAAIEGAATGNQLS 318 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 + +S +KS + P +++ EIV R+E FA+ + + +A +++L QS+LAK Sbjct: 319 MRNLSQAALKSISMPWPAAEQREEIVSRIESAFAWIECLAADAASARKLIDHLDQSMLAK 378 Query: 417 AFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 AF+GEL Q A+ P A+ALL++I+AERAA+ K R+K Sbjct: 379 AFKGELVPQDPADEP--------ASALLDRIRAERAAAPKAKRGRRK 417 >UniRef50_UPI0001B4DA32 restriction endonuclease S subunits-like protein n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B4DA32 Length = 416 Score = 236 bits (602), Expect = 1e-60, Method: Composition-based stats. Identities = 86/380 (22%), Positives = 155/380 (40%), Gaps = 45/380 (11%) Query: 119 SGFIAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+A+ R ++ I I +P+P LAEQ+ I L+ ++ Sbjct: 2 PEFVAYAFSWEGTRARVREYVKTTAGQAGISGGELKKIELPVPSLAEQRRIVAALEEQIS 61 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV--------------- 222 +++S + + ++R+ A G E + Sbjct: 62 KIESGERGLTNAARRSGQYRRLAADLATKGGFAEPLTGDGTGPELFESIRSARASRVKTR 121 Query: 223 -----------------FKKLNFESILTELRNGLSSKPNESGV--GHPILRISSVRAGHV 263 + ++ + I + G S+K +ES G P+LR+ +++ G V Sbjct: 122 RLKPATLSGPVPKVPAHWTVVSLDEITELIEYGSSTKTSESAEVGGVPVLRMGNIKDGKV 181 Query: 264 DQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA 323 D ++++ + R++LQ+GDLLF R N S E VG + + + + + LIR Sbjct: 182 DPRVLKYISADHPDAVRYRLQEGDLLFNRTN-SFELVGKSAVYRD-KFGPMAFASYLIRC 239 Query: 324 RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 R +++ + +S R + + GQ ++G + + + LPP EQ I+ Sbjct: 240 RFLPGVDTDWVNLVINSSIGRRYVRSVATQQVGQANVNGTKLAAMPIPLPPEGEQRRILD 299 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 VE A A +E + A+ L +++L +AF G L Q A+ P A L Sbjct: 300 VVETHQAAALRLESGIRQQGAKATRLRRALLTQAFAGRLVTQDPADEP--------AEIL 351 Query: 444 LEKIKAERAASGGKKASRKK 463 L +I+AER A+G K R+ Sbjct: 352 LARIRAEREAAGVTKTRRRS 371 Score = 147 bits (371), Expect = 8e-34, Method: Composition-based stats. Identities = 50/227 (22%), Positives = 99/227 (43%), Gaps = 8/227 (3%) Query: 5 KLPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 K+P W + + +T LI G + K ++ +P++R NI++GK D L ++ Sbjct: 134 KVPAHWTVVSLDEITELIEYGSSTKTSESAEV---GGVPVLRMGNIKDGKVDPRVLKYIS 190 Query: 64 KNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGF 121 + ++ D++ ++ S +VGKSA F +F ++ R + + + Sbjct: 191 ADHPDAVRYRLQEGDLLFNRTN-SFELVGKSAVYRDKFGPMAFASYLIRCRFLPGVDTDW 249 Query: 122 IAHFTKSSLYRNKISSLSA-GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + SS+ R + S++ N+ + IP+PP EQ+ I + ++T A Sbjct: 250 VNLVINSSIGRRYVRSVATQQVGQANVNGTKLAAMPIPLPPEGEQRRILDVVETHQAAAL 309 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLN 227 ++ Q R R+A+L A G+L + EP + ++ Sbjct: 310 RLESGIRQQGAKATRLRRALLTQAFAGRLVTQDPADEPAEILLARIR 356 Score = 80.5 bits (197), Expect = 1e-13, Method: Composition-based stats. Identities = 36/135 (26%), Positives = 66/135 (48%), Gaps = 8/135 (5%) Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 +PE++ FS R + VKTT+GQ GISG ++K + +P + EQ IV +E+ Sbjct: 1 MPEFVAYAFSWEGTRARVREYVKTTAGQAGISGGELKKIELPVPSLAEQRRIVAALEEQI 60 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKA 449 + ++ E+ + NA R + LA +L + P ++G+ + L E I++ Sbjct: 61 SKIESGERGLTNAARRSGQYRR--LAA----DLATKGGFAEP--LTGDGTGPELFESIRS 112 Query: 450 ERAASGGKKASRKKS 464 RA+ + + + Sbjct: 113 ARASRVKTRRLKPAT 127 >UniRef50_Q1R1F8 Restriction modification system DNA specificity protein n=1 Tax=Chromohalobacter salexigens DSM 3043 RepID=Q1R1F8_CHRSD Length = 538 Score = 235 bits (600), Expect = 2e-60, Method: Composition-based stats. Identities = 108/526 (20%), Positives = 202/526 (38%), Gaps = 91/526 (17%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 W+ + + + KK + L ++P+ ++G++ T D +++ K Sbjct: 25 HWLWIEHNQIAEIN----PKKPKLDEELSVSFIPMGAVAE-ESGRYTTDDSKKF-EDVKK 78 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGA-FCGVLRPEKLIFSGFIAHFT 126 S DI+ A + GK A +L FG+ V R + + F +F Sbjct: 79 GYTYFSDGDILFAKITPCMEN-GKVALLSNLTNGVGFGSTEFHVSRLTEAVEKKFYFYFF 137 Query: 127 KSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST--- 182 S +R + + AG+ + F +++P+ P EQ+ I K++ L +++DS Sbjct: 138 VSKSFRKQAQANMAGSAGQLRVTTDYFSNVSVPLCPTREQQRIVTKIEELFSEIDSGVES 197 Query: 183 ---------------------------------------KARFEQIPQILKRFRQAVLGG 203 +A E+I + Q L Sbjct: 198 LKTAQAKLKTARQSLLKAAFEGKLTEQWRKDNADRQESPEALLERIQAEREAHYQQQLTD 257 Query: 204 -------------------------AVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 A+ ++ +K +N ++E+ G Sbjct: 258 WQHQLKDWEAAGKEGKKPRKPKVPKALPPLTQQELAELPELPEGWKWINL-GNISEISGG 316 Query: 239 LSS--KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 ++ K +P LR+++V A ++ +DI F+ + E R KL+ DLL NGS Sbjct: 317 ITKNQKRQSLPQKNPFLRVANVYANKLELDDIHFIGTTPDEAKRAKLKKDDLLIVEGNGS 376 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 + +G ++ + + LIR+RL +++ F S + R A+ +TSG Sbjct: 377 PDQIGRVAKWDG-SIEHCTHQNHLIRSRLASPISADFVLHFLLSATGRKAIKKVASSTSG 435 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 +S ++ + + EQ IV ++E + D +E+ + ++ + L QSIL + Sbjct: 436 LYTLSLAKVEKLCIPVCSKNEQMMIVDQLESRLSQLDQLERTLTASMKQAEALKQSILKR 495 Query: 417 AFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRK 462 AF G L Q + P A+ LL +I+AER + +A RK Sbjct: 496 AFAGRLVPQDPDDEP--------ASELLARIRAERESQP--RAPRK 531 Score = 151 bits (381), Expect = 5e-35, Method: Composition-based stats. Identities = 64/259 (24%), Positives = 115/259 (44%), Gaps = 13/259 (5%) Query: 207 GKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSV--RAGHVD 264 + +K N + H ++ + N + + + + + + +V +G Sbjct: 12 HSMEKKLANIKTPHWLWIEHNQIAEINP-----KKPKLDEELSVSFIPMGAVAEESGRYT 66 Query: 265 QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 +D + E + DGD+LF + +E G LL L + + +R Sbjct: 67 TDDSKKFEDVKKGYTYFS--DGDILFAKITPCMEN-GKVALLSNLTNGVGFGSTEFHVSR 123 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 LT+ ++ FF S S R + ++GQ ++ + V L P +EQ IV + Sbjct: 124 LTEAVEKKFYFYFFVSKSFRKQAQANMAGSAGQLRVTTDYFSNVSVPLCPTREQQRIVTK 183 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALL 444 +E+LF+ D+ + + A A++ QS+L AF G+LT QWR +N D S ALL Sbjct: 184 IEELFSEIDSGVESLKTAQAKLKTARQSLLKAAFEGKLTEQWRKDNADRQ---ESPEALL 240 Query: 445 EKIKAERAASGGKKASRKK 463 E+I+AER A ++ + + Sbjct: 241 ERIQAEREAHYQQQLTDWQ 259 Score = 147 bits (372), Expect = 7e-34, Method: Composition-based stats. Identities = 49/226 (21%), Positives = 104/226 (46%), Gaps = 6/226 (2%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LPEGW + ++ + G+T +++ K+ P +R N+ K + D+ F+ Sbjct: 297 ELPEGWKWINLGNISEISGGITKNQKRQSLPQKN---PFLRVANVYANKLELDDIHFIGT 353 Query: 65 NLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIFSGFI 122 + + K+ +D++I +GS +G+ A E C+ R I + F+ Sbjct: 354 TPDEAKRAKLKKDDLLIVEGNGSPDQIGRVAKWDGSIEHCTHQNHLIRSRLASPISADFV 413 Query: 123 AHFTKSSLYRNKISSL-SAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 HF S+ R I + S+ + + + A + + IP+ EQ +I ++L++ L+Q+D Sbjct: 414 LHFLLSATGRKAIKKVASSTSGLYTLSLAKVEKLCIPVCSKNEQMMIVDQLESRLSQLDQ 473 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLN 227 + + + +Q++L A G+L + + EP + ++ Sbjct: 474 LERTLTASMKQAEALKQSILKRAFAGRLVPQDPDDEPASELLARIR 519 >UniRef50_A7VYZ3 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VYZ3_9CLOT Length = 444 Score = 235 bits (599), Expect = 3e-60, Method: Composition-based stats. Identities = 106/449 (23%), Positives = 191/449 (42%), Gaps = 41/449 (9%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLI-RANNIQNGKFDTTDLVFVP 63 K+P+ W S + LI G K + +P I A+N++N VF Sbjct: 29 KVPKNWCWVRFSKIINLISGRDAKLTDCNSL--GIGIPYILGASNLENN-------VFTI 79 Query: 64 KNLVKESQKIS-PEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + ++ Q IS D+++++ K +GK Q + + +R +F F Sbjct: 80 ERWIENPQVISLKNDVLLSV----KGTIGKVYLQK-EEKVNISRQIMAIRTSSTLFPRFT 134 Query: 123 AHFTK--SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 S +R + L I I +P PPL EQ+ I +++++L A++D Sbjct: 135 YWLVNNISDSFRQAGNGL-----IPGISREDILQKEVPFPPLPEQQRIVDRIESLFAKLD 189 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 K + ++ + + A+L A G+LT +WR + + + ++R+G Sbjct: 190 EAKQKTQEALNSYETRKAAILHKAFTGELTARWRKEHGLGMESWEKYKFNDILDVRDGTH 249 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL--NRHKLQDGDLLFTRYNGSLE 298 P G P++ +++ G + D++F+ + + R K+ GD+LF Sbjct: 250 DSPTYFDQGFPLITSKNLKDGKITDKDLKFISKEDYDKINERSKVDIGDILFAMIGTIGN 309 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 V + + Q + + L + A P +++ F S + M K S QK Sbjct: 310 PV-----VVETQPKFAIKNVALF--KNIGKASPYFVKYFLESKKVIDRMEKDAK-GSTQK 361 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 +S +++ +LLP KEQ EIVR ++ L A ++ L +++ + +SILA+AF Sbjct: 362 FVSLGYLRAFNILLPKSKEQTEIVRILDDLLAKEQQAKEAAEAVLDQIDLMKKSILARAF 421 Query: 419 RGELTAQWRAENPDLISGENSAAALLEKI 447 RGEL AE SA L++ I Sbjct: 422 RGELGTNNPAEE--------SAVELVKNI 442 >UniRef50_A4T8B4 Restriction modification system DNA specificity domain n=1 Tax=Mycobacterium gilvum PYR-GCK RepID=A4T8B4_MYCGI Length = 442 Score = 234 bits (597), Expect = 5e-60, Method: Composition-based stats. Identities = 90/460 (19%), Positives = 197/460 (42%), Gaps = 39/460 (8%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 W + ++ V + G+ ++Q +D+ P +R N+ ++ + + Sbjct: 2 SWPLVALADVAEIQGGI---QKQPKRTARDNAFPFLRVANVTARGLALDEVHTIELFDGE 58 Query: 69 -ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGFIAHFT 126 E ++ D+++ +GS S +G++A + +RP I F+ H Sbjct: 59 LERYRLLRGDLLVVEGNGSASQIGRAAVWDGSITDAVHQNHLIRVRPGFQIDPRFLGHLW 118 Query: 127 KSSLYRNKISSLSAGANINN-IKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 S L R+++S +++ + + + I +P+P L EQ+ I + L+ L+++D+ ++ Sbjct: 119 NSPLIRDELSRVASSTSGLHTLSVTKLKRITLPLPSLTEQRRIVDLLEDHLSRLDAGRSE 178 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLT--------------EKWRNFEPQHSVFKKLNFESI 231 E+ L R+ + A+ G + + P + +L + Sbjct: 179 VERAAAKLAILRERTVIQALTGGAEANREDARLTDVSTADGDLSALPIGWSWSRLGDVAD 238 Query: 232 LTELRNGLSSKPNESG-VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 + S K ++ V P LR+++V+ G ++ +++ + +S+ + +L+ GD+L Sbjct: 239 VVGGVTKDSKKQSDPNYVEVPYLRVANVQRGRLNLDEVTKIRVPQSKADALRLRPGDVLL 298 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAMMN 349 G + + G + + Q + ++ + + RAR+T P ++ ++ R A N Sbjct: 299 NE-GGDRDKLAR-GWVWEGQVPDCIHQNHVFRARITDPRIDPYFLSWTANTIGGRWAERN 356 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 K + IS I+ V++PP E I + + D +EK + + + R L Sbjct: 357 G-KQSVNLASISLSMIRRMPVIVPPPGEAVRIATELRDSRSDFDRLEKSIRDGMDRALVL 415 Query: 410 TQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKA 449 +S+L AF G LT +S+ LLE++++ Sbjct: 416 KKSLLTAAFSGRLT--------------SSSNELLEELES 441 Score = 112 bits (279), Expect = 4e-23, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 84/209 (40%), Gaps = 7/209 (3%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL--VFVP 63 LP GW + + V ++ GVT K + + +P +R N+Q G+ + ++ + VP Sbjct: 224 LPIGWSWSRLGDVADVVGGVT-KDSKKQSDPNYVEVPYLRVANVQRGRLNLDEVTKIRVP 282 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSV-VGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGF 121 ++ + ++ P D+++ + G +P +C R + I F Sbjct: 283 QSKAD-ALRLRPGDVLLNEGGDRDKLARGWVWEGQVP-DCIHQNHVFRARITDPRIDPYF 340 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 ++ + R + N+ +I + + + +PP E IA +L + D Sbjct: 341 LSWTANTIGGRWAERNGKQSVNLASISLSMIRRMPVIVPPPGEAVRIATELRDSRSDFDR 400 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLT 210 + ++++L A +G+LT Sbjct: 401 LEKSIRDGMDRALVLKKSLLTAAFSGRLT 429 >UniRef50_A6EUA9 Type I restriction-modification system, S subunit n=1 Tax=unidentified eubacterium SCB49 RepID=A6EUA9_9BACT Length = 438 Score = 234 bits (596), Expect = 6e-60, Method: Composition-based stats. Identities = 80/431 (18%), Positives = 159/431 (36%), Gaps = 22/431 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PE W + ++ + G T K + Y D +P + + + G Sbjct: 15 IGEIPEHWSSVSLKWISKIYSGGTPSKNK-PEYWSDGTIPWLNSGTVNQGDITEPSEYIT 73 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + L S K PE ++ +G G A FE + G++ P + ++ Sbjct: 74 EEALANSSAKWIPEKAILIALAGQGKTKGMVAQTQ--FEATCNQSLGIIVPSYPELNRYL 131 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + + + I +L G + I I P+P EQ I LD ++D Sbjct: 132 LFWLRKNY--QNIRNLGGGDKRDGINLEMIGSIPTPLPTKKEQTAITNYLDKKTTEIDQL 189 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILT 233 + E++ Q+ + + A++ AV + +W P+ L Sbjct: 190 ISEKEELVQLYQEEKTALINQAVTKGIKPDAKLKNSGIEWLGEIPEDW---NSLRLKYLG 246 Query: 234 ELRNGLSSKPNE-SGVGHPILRISSVRAGHVDQNDIRFLECSESELNR-HKLQDGDLLFT 291 NG S K + G +L+IS+++ +D +D F++ + ++ DL+F Sbjct: 247 NFINGYSFKSTDFKSSGVRVLKISNIQHMAIDWSDESFIDEEFYDTKSGFRVLQNDLVFA 306 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 + L+ + L + + R + + ++I S + Sbjct: 307 LTRPIISTGIKVALMNFDEKILLNQRNSIFRPKTK---MTKWIYFILLSSRFVQEFDKRI 363 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 T Q IS DI + +P +EQ +IV +E+ A DT + + + Sbjct: 364 DKTGQQPNISSNDIGEISIPVPTKEEQTKIVEHIEKETAKIDTKIAKAEKYINLLTEYRT 423 Query: 412 SILAKAFRGEL 422 S++++ G++ Sbjct: 424 SLISEVVTGKI 434 >UniRef50_C3NN82 Restriction modification system DNA specificity domain protein n=1 Tax=Sulfolobus islandicus Y.N.15.51 RepID=C3NN82_SULIN Length = 576 Score = 234 bits (596), Expect = 6e-60, Method: Composition-based stats. Identities = 93/439 (21%), Positives = 179/439 (40%), Gaps = 21/439 (4%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN--GKFDTTD 58 + G+ P+ W + + V + + Y + +P + +I T+ Sbjct: 9 IDIGEFPKDWDVRKLKDVIIKAKSGGTPRRNVPEYW-NGNIPFAKIQDITKSGKYLYNTE 67 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 K L + I P+D ++ GS V A +P + A G++ + +I Sbjct: 68 EFITEKGLENSNAWIVPKDSLLLTIYGSLGFV---AINKIPV-ATNQAIIGIIPNKNIID 123 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + F+ ++ ++ S N+ ++PI PL EQK I E L Sbjct: 124 TEFLYYW--YLYFKPYWSKFIKKGTQPNLTLEIVLNSSVPILPLEEQKKIVELLQKATDI 181 Query: 179 VDSTKARFEQI----PQILKRFRQAVLGGAVNGK-LTEKWRNFEPQHSVFKKLNFESILT 233 + K QI I K R+ +L + + E P+ ++LN +I Sbjct: 182 YYTLKDYIIQIRNSTETITKVIRKELLTKGIGHRDYVETDIGEFPKDWEVRRLNEIAI-- 239 Query: 234 ELRNGLSSKPNESGVGHPILRISSVRA--GHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 +R+G S + + LR ++ + + I ++ S + R+ L+ D++ Sbjct: 240 -IRSGFSERKRDENSKVIHLRPDNIDNETDRIVFHRIVYIPESPK-IERYLLRHLDIVLV 297 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSSPSARNAMMNC 350 NGS++ +G G++ +Q + + + L R+ +KD P YI S + Sbjct: 298 NTNGSIDHIGKLGIIDMPLNQKITFSNHLTAIRIVSKDVEPYYIYYLLSWYHLNGSFKKV 357 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 VK +G+ ++ I++ ++ LPP++EQ +IV ++++ + N N L Sbjct: 358 VKNQAGKWNLNLDTIRNLLIPLPPLEEQKKIVELLQKVDELIIRFNDFLQNLEDEANTLY 417 Query: 411 QSILAKAFRGELTAQWRAE 429 +SIL A G+LT WR + Sbjct: 418 KSILRLALTGKLTEDWRRQ 436 >UniRef50_Q1VAF2 Hypothetical type I restriction-modification system specificity determinant n=1 Tax=Vibrio alginolyticus 12G01 RepID=Q1VAF2_VIBAL Length = 464 Score = 234 bits (596), Expect = 7e-60, Method: Composition-based stats. Identities = 94/454 (20%), Positives = 187/454 (41%), Gaps = 33/454 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFVP 63 +P+ W + + ++Y +A IR ++ +G +P Sbjct: 26 DIPKDWCTRRLKHMLE--SPMSYGANEAAERAVSTEPRYIRITDMNSDGTLKEDTFRSLP 83 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEK-LIFSGF 121 K++ + DI++A S + VGKS F +C F + + + + S + Sbjct: 84 KDIA-SDYLLKDRDILLARSGAT---VGKSFIYRKEFGDCCFAGYLIKVSCDSARLNSDY 139 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPP-LAEQKIIAEKLDTLLAQVD 180 F +SS Y IS A I N+ + + I +P + EQ IA LD A++D Sbjct: 140 AFWFFQSSSYWQYISGSQIQATIQNVSAEKYGEMYISLPEHVEEQTQIANFLDHETAKID 199 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFESI 231 + + +Q+ ++LK RQAV+ AV L + W P+H +++ + I Sbjct: 200 TLIEKQQQLIKLLKEKRQAVISHAVTKGLNPQAPMKNSGVEWLGEVPEHW--EQIKLKHI 257 Query: 232 LTELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSESE--LNRHKLQDGDL 288 ++ + G + + R ++VR G + + ++ + E R + + GD+ Sbjct: 258 THQIVDAEHKTAPYFDDGEYLVCRTTNVRDGKLRLDGGKYTNHAIYEEWTKRGQPEVGDI 317 Query: 289 LFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAM 347 LFTR + E G + L +++ +L + LPE++ S A + Sbjct: 318 LFTREAPAGEACVYTGEVP------LCLGQRMVLFKLNQTRVLPEFVLHSIYSGLA-DDF 370 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 + + S + DI++ + PP EQA+IV + ++ A D + + + + Sbjct: 371 VKQLSQGSTVAHFNMSDIQNIPLFEPPKDEQAQIVDHLAKVLAKYDALTSSASLKIELMQ 430 Query: 408 NLTQSILAKAFRGELTAQ-WRAENPDLISGENSA 440 ++++ A G++ + W+A + E +A Sbjct: 431 ERRTALISAAVTGKIDVRNWQAPISQEQALEQTA 464 Score = 117 bits (293), Expect = 8e-25, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 83/211 (39%), Gaps = 11/211 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + +T I +K Y D + R N+++GK + Sbjct: 243 GEVPEHWEQIKLKHITHQIVDAEHK---TAPYFDDGEYLVCRTTNVRDGKLRLDGGKYTN 299 Query: 64 KNLVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFS 119 + +E K DI+ ++ G++ G + + + + Sbjct: 300 HAIYEEWTKRGQPEVGDILFTR----EAPAGEACVYTGEVPLCLGQRMVLFKLNQTRVLP 355 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ H S L + + LS G+ + + + I + PP EQ I + L +LA+ Sbjct: 356 EFVLHSIYSGLADDFVKQLSQGSTVAHFNMSDIQNIPLFEPPKDEQAQIVDHLAKVLAKY 415 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 D+ + ++++ R A++ AV GK+ Sbjct: 416 DALTSSASLKIELMQERRTALISAAVTGKID 446 >UniRef50_B8E4I3 Restriction modification system DNA specificity domain protein n=1 Tax=Shewanella baltica OS223 RepID=B8E4I3_SHEB2 Length = 642 Score = 233 bits (595), Expect = 8e-60, Method: Composition-based stats. Identities = 115/468 (24%), Positives = 204/468 (43%), Gaps = 36/468 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFVP 63 KLPEGWV + + I + Q D P IR +NI +GK + V Sbjct: 2 KLPEGWVETTIGNI---IDDMQPGFSQKPGKEDGDTTPQIRTHNISPDGKLTLEGIKHVT 58 Query: 64 KNLVKESQK-ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGF 121 + + + ++ D+V ++ S+ VGK+A E F LR KLI F Sbjct: 59 ASNKESERYSLTKGDVVFNNTN-SEEWVGKTAVFDQEGEFVFSNHITRLRANSKLITPDF 117 Query: 122 IAHFTKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +A + + + + + I+ ++ L IP+P L EQ+ I + L Sbjct: 118 LAAYLQFLWSMGFSKTRAKRWVSQAGIEGSTLALFRIPLPSLPEQERIVDVL-------- 169 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHS---VFKKLNFESILTELRN 237 Q I+ + +Q++ N T W +F ++ + + I+ + + Sbjct: 170 -------QQVGIVAKAKQSIDDHIDNLVRTAYWEHFSEWYTADGLRDPVRISDIVADSQY 222 Query: 238 GLSSKPNESGVGHPILRISSVR-AGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 G+S +E+G ILR++S+ +G ++ D+++ SE ++ L +GDLLF R N S Sbjct: 223 GVSEAMSETGKQA-ILRMNSITTSGWLNLADLKYATLSEKDIKATTLLNGDLLFNRTN-S 280 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 E VG C + + + Y ++R R+ + LPEYI +S + +MN K Sbjct: 281 KELVGKCAIWRGAKEP-FSYASYIVRFRMKEGILPEYIWATLNSSYGKYRLMNSAKQAVS 339 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 +S D+ V LPP+ Q + +L + +T+ +++ N + + L + + Sbjct: 340 MANVSPTDLGRITVPLPPLALQEKFA----KLINHIETLRQEMLNKQDQYSELQTLVTQQ 395 Query: 417 AFRGELTAQWRAENPDLISGENSAAALLEKIKAERAA--SGGKKASRK 462 A GE TAQWR EN + + A +L + + + + KK +K Sbjct: 396 ALLGEHTAQWRDENREKVLEAAKARDILLREQGVKITKFALEKKHPKK 443 >UniRef50_C3RBV6 Type I restriction-modification system n=3 Tax=Bacteroides RepID=C3RBV6_9BACE Length = 423 Score = 233 bits (594), Expect = 1e-59, Method: Composition-based stats. Identities = 87/430 (20%), Positives = 157/430 (36%), Gaps = 35/430 (8%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P W +S V +I T +Y + L ++ ++ NG T Sbjct: 16 IGEIPNHWEAIKISRVHPIIGSGTTPLSSREDYYSEKGLNWLQTGDLNNGLITETSKKIT 75 Query: 63 PKNLVKESQKISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 PK + + K P +VIAM + VG L E + C ++ P K I + Sbjct: 76 PKAVDECKMKFYPIHSVVIAMYGATIGKVG-----LLDIETATNQACCIIVPSKRICPKY 130 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + ++ S G NI + +P+PPL+EQ+ IA LD ++D Sbjct: 131 TFYSFI--IAKEELLLSSFGGGQPNISQDIIRKLKVPVPPLSEQQSIASYLDVKTEKIDK 188 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFESIL 232 A+ E+ + L +Q+++ AV L W P H L F L Sbjct: 189 MIAKAEKKIEYLGELKQSLITRAVTRGLNPNTPLKDSGVNWIGNIPMHWDIACLRFFLRL 248 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 R S + +LR+ + ND + E E +++ DLL+ Sbjct: 249 INGRA-YSQNELLPSGKYKVLRVGN-----FFTNDSWYYSNMELEPDKY-CDKDDLLYAW 301 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 ++ +Y + + +L Y + N M+ + Sbjct: 302 SASVGPYI--------WNEAKTIYHYHIWKVQLATSMDKMYSYYLLR--AVTNQKMSDM- 350 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 S I+ D+ + +PP+ EQ +I ++ + D I +A + L QS Sbjct: 351 HGSTMMHITMGDMNKTKIPIPPLSEQQQIATYLDTKCSKIDHIIATQKKKIAYLQELKQS 410 Query: 413 ILAKAFRGEL 422 ++ G++ Sbjct: 411 LITNVVTGKI 420 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 79/234 (33%), Gaps = 19/234 (8%) Query: 212 KWRNFEPQHSV-FKKLNFESILTELRNGLSSKPNES-GVGHPILRISSVRAGHVDQNDIR 269 KW P H K I+ LSS+ + G L+ + G + + + Sbjct: 14 KWIGEIPNHWEAIKISRVHPIIGSGTTPLSSREDYYSEKGLNWLQTGDLNNGLITETSKK 73 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 + E ++ Y + +G GLL N +K Sbjct: 74 ITPKAVDECKMKFYPIHSVVIAMYGAT---IGKVGLLDIETATN----QACCIIVPSKRI 126 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 P+Y F+S A+ ++ GQ IS I+ V +PP+ EQ I ++ Sbjct: 127 CPKY--TFYSFIIAKEELL-LSSFGGGQPNISQDIIRKLKVPVPPLSEQQSIASYLDVKT 183 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + + + + L QS++ +A R NP+ ++ + Sbjct: 184 EKIDKMIAKAEKKIEYLGELKQSLITRAVT-------RGLNPNTPLKDSGVNWI 230 >UniRef50_Q4FUM9 Possible type I restriction-modification system, S subunit n=1 Tax=Psychrobacter arcticus 273-4 RepID=Q4FUM9_PSYA2 Length = 457 Score = 231 bits (590), Expect = 3e-59, Method: Composition-based stats. Identities = 92/446 (20%), Positives = 179/446 (40%), Gaps = 30/446 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF-- 61 GK+P W ++ + + + RG+T K + D +P + + + D Sbjct: 23 GKIPSHWELSKLRYMFSFGRGLTITKADLL----DTGVPCVNYGEVHSKYGFEVDPKRHY 78 Query: 62 ---VPKNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL 116 V + ++ S ++ D+V A +S G G + RP Sbjct: 79 LKCVDEGYLQSSPYALLTQGDLVFADTSEDIEGSGNFTQLVSDDLIFAGYHTVIARPFDR 138 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 S F A+ S R ++ + G + +I + + I +P L E++ IA LD Sbjct: 139 QCSRFYAYLMDSKEIRTQVRHMVKGVKVFSITQSILKGVRIWLPSLDERETIANFLDFET 198 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLN 227 AQ+D+ + + + Q+LK RQAV+ AV L +W P+H KL Sbjct: 199 AQIDTLIDKQKTLIQLLKEKRQAVISHAVTKGLNPDAPLKDSGVEWLGEVPEHWGVSKLK 258 Query: 228 FESILTELRNGLSSKPNESGVGHP-ILRISSVR-AGHVDQNDIRFLECSESELNRHKLQD 285 + I L+ G + + P +RI+ V G++ + R L +E + L D Sbjct: 259 YL-ISEPLQYGANEAAEDVDKTQPRFVRITDVLPNGNLKDDTFRSLPQEIAE--PYMLMD 315 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 GD+L R G+ VG + + + + LI+A++ ++ P + Sbjct: 316 GDVLLARSGGT---VGKSFIYRDSWGK-CCFAGYLIKAKIDEEITPAEWFYLNTLTDFYW 371 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 + ++ + + +S S V+ +PP++E +I+ + DT+ + A+ Sbjct: 372 KWIESIQIQATIQNVSADKYNSFVIAVPPLEESYKIISYINYNLEVFDTLVMKAEQAIQL 431 Query: 406 VNNLTQSILAKAFRGELTAQ-WRAEN 430 + ++++ A G++ + W A Sbjct: 432 MQERRTALISAAVTGKIDVRGWVAPE 457 >UniRef50_B2V7V7 Restriction modification system DNA specificity domain n=3 Tax=Sulfurihydrogenibium RepID=B2V7V7_SULSY Length = 435 Score = 231 bits (588), Expect = 5e-59, Method: Composition-based stats. Identities = 90/440 (20%), Positives = 186/440 (42%), Gaps = 31/440 (7%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYL--PLIRANNIQNGKFDTTDL 59 G +PE W +A + V + +G ++ +D + P +R +N+ K D ++L Sbjct: 11 EIGLIPEDWEVARLGEVFEVKQGKQLSAKEN----RDGKVLKPFLRTSNVLWNKIDLSEL 66 Query: 60 VFVPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRP-EKL 116 ++P + + ++ K+ DI++ VG++A E S+ LR + Sbjct: 67 SYMPFSESEFKNLKLKKGDILVC----EGGDVGRTAVWDGQIDEISYQNHLHRLRSVKDN 122 Query: 117 IFSGFIAHFTKSSL-YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 I + F A++ + ++ +N + I N+ + IP+PPL EQ+ IA+ L T+ Sbjct: 123 INNYFFAYWMEYAITIKNLYHQNANKTTIPNLSSSRLKAFPIPLPPLEEQRAIADILSTV 182 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVL-------GGAVNGKLTEKWRNFEPQHSVFKKLNF 228 ++ T+ Q+ K + + KL E P+H +L Sbjct: 183 QNAIEKTEKVINATKQLKKSMMKHLFTYGAVAVDEIDRIKLKESEIGLIPEHWEVVRL-- 240 Query: 229 ESILTELRNGLSSKPNESGV---GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQD 285 + +L G+S + E G GH I+ I +++ G++D N + ++Q Sbjct: 241 -GEVVDLDRGISWRKFEEGSKDNGHLIISIPNIKDGYIDFNSKYNHYLIKHIPKNKQIQL 299 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 D+LF +GS+E VG ++ L + + + + RAR+ + + F ++ N Sbjct: 300 NDILFVGSSGSIENVGRNVFIENLSFEGIGFASFVFRARVKVNTVIPKFLYFMANSHWFN 359 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 +++ G+ + K+ + LPP+ EQ +I + D + Sbjct: 360 YKDYVRRSSDGKYNFQLTEFKTIKIPLPPLDEQQKIAN----ILTTIDQKIQAEEKKKVA 415 Query: 406 VNNLTQSILAKAFRGELTAQ 425 + +L +++L + G++ + Sbjct: 416 LRSLFKTLLHQLMTGKIRVR 435 >UniRef50_B0TZ98 Type I restriction-modification system, subunit S n=1 Tax=Francisella philomiragia subsp. philomiragia ATCC 25017 RepID=B0TZ98_FRAP2 Length = 407 Score = 231 bits (588), Expect = 5e-59, Method: Composition-based stats. Identities = 86/424 (20%), Positives = 162/424 (38%), Gaps = 28/424 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 KLP GW + G K + + +P++ A N+ D + L F+ + Sbjct: 6 KLPAGWEWKKLGEECLFENGDRGKNYPSKSAFVSKGIPVVSATNLTGWSIDRSKLNFITE 65 Query: 65 NLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 KI DI+ + +GK A + ++R + + + F+ Sbjct: 66 ERYNLIGGGKIKKNDILFCLR----GSLGKCALVTDIERGVIASSLVIIRTCENLSNIFL 121 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 ++ S L ++ I+ + GA N+ + L NIP+PPLAEQK I KLD+L ++D Sbjct: 122 MYYLNSHLIQDFINKYNNGAAQPNLSAKNLSLFNIPLPPLAEQKRIVAKLDSLFEKIDKA 181 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 +Q + L + ++ E ++S+ + + + K Sbjct: 182 IELHQQNITNANTLMASTLD--------KTFKKLEGEYSLIPLHKITTAVGGGTPKRNIK 233 Query: 243 PNESGVGHPILRISSVRA----GHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLE 298 L + + A ++ ++ + E S+ + L G +L++ S Sbjct: 234 EYWGNGEIVWLSPTDLGAIGEILNIRESRDKITELGLSKSSARLLPVGTVLYS----SRA 289 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 +G + + N + + + D Y S + + + ++ K Sbjct: 290 TIGKIAINEIEVCTNQGFTNFIC------DKDKIYNYFLAYSLAKYTEEITSLSNSTTFK 343 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 +S IK + LPP+ Q + V ++ + D I++ L + L SIL KAF Sbjct: 344 EVSKTSIKKFEIPLPPLPIQQQTVEYLDSIATKVDKIKQLNEQKLENLKALKASILDKAF 403 Query: 419 RGEL 422 RGEL Sbjct: 404 RGEL 407 >UniRef50_A4FXL8 Restriction modification system DNA specificity domain n=1 Tax=Methanococcus maripaludis C5 RepID=A4FXL8_METM5 Length = 447 Score = 231 bits (588), Expect = 5e-59, Method: Composition-based stats. Identities = 86/440 (19%), Positives = 184/440 (41%), Gaps = 30/440 (6%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK-FDT----T 57 G +P W + + + L G++ K + + ++ + + +I + FD Sbjct: 13 IGDIPADWGVKKLKYILGLNTGLSITKAELV----ENGVDCVNYGDIHSKYTFDIVSSRD 68 Query: 58 DLVFVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKS-AHQHLPFECSFGAFCGVL-RP 113 +L VP + S S D + +S G + + F +L RP Sbjct: 69 NLPKVPVEFIDTNPSAIASEGDFIFCDTSEDIEGSGNCLFIRESNNKPIFAGSHTILGRP 128 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 + S ++ + KS +++I G + +I I++ +PP+ EQ+ IA+ LD Sbjct: 129 LINVNSTYLGYLLKSPDIKSQIQKRVVGIKVYSITQKILKSISLILPPVDEQQEIAQYLD 188 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFK 224 + Q+DS + + K ++Q+++ V L +W P+H Sbjct: 189 DKVGQIDSIIEKTKSSIDEYKSYKQSIITETVTKGLDPTVTMKDSGIEWIGDIPEHWDII 248 Query: 225 KLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGH-VDQNDIRFLECSESELNRHKL 283 K+ + L +NG+S + G G+P + V + + ++ +E +E + + + + Sbjct: 249 KIRYLGTL---QNGISKSSSYFGSGYPFVSYGDVYKNYELPKSVEGLVESNEFDKSNYSV 305 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL--TKDALPEYIEIFFSSP 341 + GD+ FTR + +++ +G + + ++ LIR R +K P Y + +F S Sbjct: 306 EYGDVFFTRTSETIDEIGFTATCMHTMN-DAVFAGFLIRFRPFDSKLLNPLYSKYYFRSD 364 Query: 342 SARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNN 401 R + + + + +S + +K VL+PP EQ I + +E+ D + + Sbjct: 365 MHRRFFVKEMNLVT-RASLSQELLKKLPVLVPPHNEQIAIGKFIEETCQTIDQLITKKQQ 423 Query: 402 ALARVNNLTQSILAKAFRGE 421 + + +S++ + G+ Sbjct: 424 LITELKAYKKSLIYEVVTGK 443 >UniRef50_A7N438 Putative uncharacterized protein n=1 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N438_VIBHB Length = 432 Score = 230 bits (587), Expect = 7e-59, Method: Composition-based stats. Identities = 101/433 (23%), Positives = 194/433 (44%), Gaps = 35/433 (8%) Query: 14 PVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQKI 73 + + + RGV+YK E + D + +R+NNIQ+G + ++ VP +LV +SQ + Sbjct: 11 RLGELASGNRGVSYKPENLKAAIDDKSVVFLRSNNIQSGTLNFENVQIVPDSLVSDSQIL 70 Query: 74 SPEDIVIAMSSGSKSVVGKSAH--QHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLY 131 DI + MS+GS+ +VGKS + + + GAFC V R + S ++ + +S Y Sbjct: 71 KKGDIAVCMSNGSRQLVGKSGMLQHEVEYPLTVGAFCSVFRCQNEDDSEYVRYLFQSQAY 130 Query: 132 RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQ 191 ++ I AG+ INN+K + + I +P P A +K IAE L T+ Q+D+T+A ++ Sbjct: 131 QHGIDVTLAGSAINNLKNSDVEAIEVPTAPKALRKKIAEILSTIDNQIDATQALIDK--- 187 Query: 192 ILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK--------- 242 +Q ++ + + + + P +L ++ L L G K Sbjct: 188 -YTAIKQGMMADLFSRGIDPETKALRPTLEEAPELYHKTPLGMLPKGWDVKTLGDISEKI 246 Query: 243 --------PNESGVGHPILRISSVRAGHVD--QNDIRFLEC-SESELNRHKLQDGDLLFT 291 S G +RIS++ HV+ + ++ + SE R +LQ GD+L + Sbjct: 247 TSGSRDWAKFYSPEGDLFVRISNLTREHVNFRWDSVKHVNIGGGSEGERTQLQPGDILVS 306 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 +G+ G++ + + + + T +I + SS + Sbjct: 307 IT----ADLGIVGVVPENMGRAYINQHTALIRLSTYGENARFIGNYLSSRCGQEQFEKNN 362 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 + + + GI+ I S +P KEQ I +++ L ++++ + +L +L Q Sbjct: 363 DSGA-KAGINLPTIASLRCPIPEEKEQLLIASKIDALDEVIADLKREKSKSL----SLKQ 417 Query: 412 SILAKAFRGELTA 424 ++ G+++ Sbjct: 418 GLMQDLLTGKVSV 430 Score = 107 bits (267), Expect = 1e-21, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 78/208 (37%), Gaps = 11/208 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ----NGKFDTTDL 59 G LP+GW + + ++ I T + + +R +N+ N ++D+ Sbjct: 228 GMLPKGWDVKTLGDISEKI---TSGSRDWAKFYSPEGDLFVRISNLTREHVNFRWDSVKH 284 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IF 118 V + E ++ P DI+++++ +VG ++R Sbjct: 285 VNIGGGSEGERTQLQPGDILVSIT-ADLGIVGVVPEN--MGRAYINQHTALIRLSTYGEN 341 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + FI ++ S + + + I + + PIP EQ +IA K+D L Sbjct: 342 ARFIGNYLSSRCGQEQFEKNNDSGAKAGINLPTIASLRCPIPEEKEQLLIASKIDALDEV 401 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVN 206 + K + + + Q +L G V+ Sbjct: 402 IADLKREKSKSLSLKQGLMQDLLTGKVS 429 >UniRef50_B0JHV8 Restriction modification system DNA specificity domain n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JHV8_MICAN Length = 395 Score = 229 bits (585), Expect = 1e-58, Method: Composition-based stats. Identities = 95/420 (22%), Positives = 165/420 (39%), Gaps = 31/420 (7%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG-KFDTTDLVFVPKNL 66 + W + + + RG + + Q + D + + + + K+ T + K Sbjct: 2 KDWPSVALGDIFEIARGGSPRPIQNFLTEEPDGVNWVMIGDASDSSKYITHTKKRILKTG 61 Query: 67 VKESQKISPEDIVI--AMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 VK S+ + P D ++ +MS G ++ S C + + + +I + H Sbjct: 62 VKNSRMVYPGDFLLTNSMSFGHPYIMKTSG-------CIHDGWLVLSNKKGVIDQDYFYH 114 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S L + S L++G+ + N+ I + +PPL EQ+ IA LD K Sbjct: 115 LLGSDLIYAEFSRLASGSTVKNLNIEIVKGIKVSLPPLEEQRRIAAILDKADGVRRKRKE 174 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 ++LK + G V ++ I T +NG+ Sbjct: 175 AIRLTEELLKSTFLEMFGDPVTNPKG------------WEVKRLGEICTNFQNGIGKNSE 222 Query: 245 ESGVGHPILRISSVRAGH-VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVC 303 G G + IS + H L+ + E+ ++ L GDLLF R + E V VC Sbjct: 223 HYGHGSKVANISDLYEWHRFIPEKYSLLDVTPKEIEKYSLMRGDLLFVRSSVKREGVAVC 282 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 + + L+ +IR R D PE++ + +P RN ++ TS IS Sbjct: 283 SVYDSDEI--CLFSSFMIRVRPRTDLINPEFLSLMLRTPPMRNRLI-LGSNTSTITNISQ 339 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + V++PP+K Q I ++ + + AL + NL S+L +AFRGEL Sbjct: 340 PGLSKIEVVVPPIKTQNLIT----KVTKNIEESVRCHLQALEQSENLFNSLLQRAFRGEL 395 Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 37/207 (17%), Positives = 73/207 (35%), Gaps = 13/207 (6%) Query: 7 PEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG-KFDTTDLVFVPK 64 P+GW + + + G+ E + +++ +F + Sbjct: 198 PKGWEVKRLGEICTNFQNGIGKNSEHY-----GHGSKVANISDLYEWHRFIPEKYSLLDV 252 Query: 65 NLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFI 122 + E + D++ SS + V + C F +F +RP LI F+ Sbjct: 253 TPKEIEKYSLMRGDLLFVRSSVKREGVAVCSVYDSDEICLFSSFMIRVRPRTDLINPEFL 312 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + ++ RN++ S + I NI I + +PP+ Q +I + + V Sbjct: 313 SLMLRTPPMRNRLILGSNTSTITNISQPGLSKIEVVVPPIKTQNLITKVTKNIEESVRCH 372 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKL 209 EQ + ++L A G+L Sbjct: 373 LQALEQ----SENLFNSLLQRAFRGEL 395 >UniRef50_C7QRY1 Restriction modification system DNA specificity domain protein n=1 Tax=Cyanothece sp. PCC 8802 RepID=C7QRY1_CYAP0 Length = 456 Score = 229 bits (583), Expect = 2e-58, Method: Composition-based stats. Identities = 87/442 (19%), Positives = 167/442 (37%), Gaps = 27/442 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +P+ W + + ++ I + N + + F D ++ Sbjct: 24 GDIPDSWEVKRLRYLSKKITAGPFGSNLTKNIYTSTGYKIYGQEQVIASDFSIGDY-YIS 82 Query: 64 KNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPE-KLIFS 119 K KI+ DI+I+ GK A E L P + I S Sbjct: 83 KEKYDQMSQYKINSGDILISC----VGTFGKVAVVPKNIEQGIINPRLIKLIPITEYINS 138 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ KS + ++ LS G + I I +PIPPL EQ+ IA+ LD A++ Sbjct: 139 VYLEKLLKSVVAFEQMEKLSRGGTMGVINIGLLSDILLPIPPLPEQEKIAQFLDKETAKI 198 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFES 230 D E++ ++LK R A++ AV L +W F P+H K+L + Sbjct: 199 DKLITLKERLIELLKEKRTALISHAVTKGLNPDVPMKDSGVEWLGFIPEHWEVKRLKYIV 258 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL-NRHKLQDGDLL 289 + ++ G P LR ++ +G +D +++ F+ +EL + K+ GDL+ Sbjct: 259 PNITVGIVVTPAKYYVESGIPCLRSVNISSGKIDNSNLVFISSQSNELHQKSKIYKGDLV 318 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 R + G ++ +IR + ++ + S + +N Sbjct: 319 LVRTGVT----GTAAIVTDNFDGANCVDLLIIR---NSRLILTLYLYYYLNSSTTSYQVN 371 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + Q + + ++ PP +EQ +I +++ D I + ++ + Sbjct: 372 NYSVGAIQAHYNTSTLSELIITFPPPQEQQKIAEYLDRKTEQIDQIINKTRESIEYLKEY 431 Query: 410 TQSILAKAFRGELTA-QWRAEN 430 +++ A G++ QW E Sbjct: 432 RTVLISAAVTGKIDVRQWGGEE 453 >UniRef50_A3PKU6 Restriction modification system DNA specificity domain n=2 Tax=Bacteria RepID=A3PKU6_RHOS1 Length = 456 Score = 229 bits (583), Expect = 2e-58, Method: Composition-based stats. Identities = 85/444 (19%), Positives = 167/444 (37%), Gaps = 37/444 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PEGW + + + ++ + + +P++ +NI +G+ D + V Sbjct: 16 GEVPEGWEVKCLRMIADELQTGPFGSQLHTEDYVTAGVPIVNPSNILDGQIVPDDEIGVD 75 Query: 64 KNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQ---HLPFECSFGAFCGVLRPEKLIF 118 + + + P DI++ G + +G+ A +P C G+ L+ + + Sbjct: 76 EATALRLANHALLPGDIIL----GRRGELGRCAVVPDGTMPLLCGTGSLRIRLKSSQAL- 130 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 FIA ++ R +S S G+ ++N+ A I I +P L EQ+ I L+ A+ Sbjct: 131 PDFIAECIRTPRVREWLSLQSVGSTMDNLNTAIVGKIQIALPSLPEQRAITAFLNRETAK 190 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFE 229 +D+ ++ +L RQAVL AV L W P+ + Sbjct: 191 IDALVEEQRRLIALLAEKRQAVLNHAVTRGLNPDALLKPSGIDWLGDIPEGWEVVPIRKV 250 Query: 230 SILTELRNGLSSKPNES-GVGHPILRISSV------RAGHVDQNDIRFLECSESELNRHK 282 + L S+P P ++ + R +V + E + Sbjct: 251 ARLESGHTPSRSRPEWWVDCHIPWFSLADIWQVRPGRVEYVYETAEAVSELGLQNSSARL 310 Query: 283 LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPS 342 L G ++ +R VG ++ + + + RL LP+Y+ Sbjct: 311 LPAGTVMLSRT----ASVGFSAVMGIAMATTQDFANWVCGCRL----LPDYLLYCLR--- 359 Query: 343 ARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNA 402 + +K S I DI++ + LPP++EQ IV V D + A Sbjct: 360 GMPSEFERLKMGSTHNTIYMPDIRTLTIPLPPLEEQKAIVDHVRASVGALDELMDTATTA 419 Query: 403 LARVNNLTQSILAKAFRGELTAQW 426 + + ++++ A G++ + Sbjct: 420 ITLLQERRAALISAAVTGKIDVRD 443 >UniRef50_A4FZ34 Restriction modification system DNA specificity domain n=1 Tax=Methanococcus maripaludis C5 RepID=A4FZ34_METM5 Length = 402 Score = 228 bits (582), Expect = 2e-58, Method: Composition-based stats. Identities = 88/423 (20%), Positives = 168/423 (39%), Gaps = 33/423 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GW + + + + G T + + Y + +P ++ +++ T + Sbjct: 5 LPDGWEVKKLGDIGNISAGGTPSRSK-PEYWNNGSIPWVKIADMKEKHVKNTSEFITEEG 63 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGFIAH 124 L K S KI + ++ S VG L + S + K + ++ + Sbjct: 64 LNKSSAKIFKKGTILISIFASLGTVGI-----LDIDASTNQAIAGINVNSKKVIPEYLYY 118 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + KS +N G NNI + I +PPL Q+ I E L +++ Sbjct: 119 YLKS--LKNYFMGAGRGVAQNNINLSILKDTEIFVPPLETQQKIVEIL----EKIEYGIN 172 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 E+ + +AV ++ + + + ++ +G S + Sbjct: 173 LREKAILETENLVKAVFLDMFGDPVSN--------PMGWDVKKIGTFVNDIISGWSVGGD 224 Query: 245 ESG---VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 E +L+ISSV +G ++ + + ++ H L+ GDLLF+R N + E V Sbjct: 225 ERPKKADELAVLKISSVTSGKFKSSEHKVVNSEITKKLVHPLK-GDLLFSRAN-TRELVA 282 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPE-YIEIFFSSPSARNAMMNCVKTTSG-QKG 359 ++ + +L PDKL + L K+ + Y P+ R + TSG Sbjct: 283 AVCIVDN-DYMDLFLPDKLWKIILNKNIVSSYYFRQVLQDPTYRANLTKKATGTSGSMLN 341 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 IS + +PP+ Q + + +E+L + I+++ N+ + +L L KAF+ Sbjct: 342 ISKSKLIENEFPIPPIGLQNKFAKIIEKL----EEIKEKQENSKKEMEDLFNLSLQKAFK 397 Query: 420 GEL 422 GEL Sbjct: 398 GEL 400 >UniRef50_B5IN27 HsdS, type I site-specific deoxyribonuclease n=1 Tax=Cyanobium sp. PCC 7001 RepID=B5IN27_9CHRO Length = 361 Score = 228 bits (581), Expect = 4e-58, Method: Composition-based stats. Identities = 116/253 (45%), Positives = 159/253 (62%), Gaps = 10/253 (3%) Query: 194 KRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPIL 253 KRFRQAVL A +G+LT +WR S+ +K+ ++ E+RNGLS KP+ + G IL Sbjct: 6 KRFRQAVLAAATSGELTREWREARGIESLPRKIPLGEVIHEMRNGLSPKPSLNPPGVKIL 65 Query: 254 RISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQN 313 RI +VR G +D D R+LE S+ +L +L+ GDL+FTRYNG+LEFVG C + Sbjct: 66 RIGAVRPGTIDWTDHRYLELSDKDLAAFRLEAGDLIFTRYNGTLEFVGACANATSIPDV- 124 Query: 314 LLYPDKLIRARL-TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLL 372 +YPDKLIR R T ALP Y+EI FSS R+ + VK+++GQKGISG D+K+ L Sbjct: 125 YVYPDKLIRVRCDTSRALPAYVEISFSSVEIRDHIEGLVKSSAGQKGISGTDLKNIFFPL 184 Query: 373 PPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPD 432 P ++EQ EIV +V+ LF AD +E +++ A V+ LT ++LAKAFRG L Q + P Sbjct: 185 PSIEEQIEIVHQVQALFTLADQLESRLSAARKLVDRLTPALLAKAFRGALVPQDPNDEP- 243 Query: 433 LISGENSAAALLE 445 A+ LLE Sbjct: 244 -------ASVLLE 249 Score = 116 bits (290), Expect = 2e-24, Method: Composition-based stats. Identities = 44/265 (16%), Positives = 92/265 (34%), Gaps = 20/265 (7%) Query: 2 SAGKLPEGWVIAP----------VSTVT-TLIRGVTYKKEQAINYLKDDYLPLIRANNIQ 50 ++G+L W A + V + G++ K + ++R ++ Sbjct: 17 TSGELTREWREARGIESLPRKIPLGEVIHEMRNGLSPKPSLNPP-----GVKILRIGAVR 71 Query: 51 NGKFDTTDLVFVPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFC 108 G D TD ++ + + ++ D++ +G+ VG A+ +P + Sbjct: 72 PGTIDWTDHRYLELSDKDLAAFRLEAGDLIFTRYNGTLEFVGACANATSIPDVYVYPDKL 131 Query: 109 GVLRPE-KLIFSGFIAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQK 166 +R + ++ S R+ I L + I I P+P + EQ Sbjct: 132 IRVRCDTSRALPAYVEISFSSVEIRDHIEGLVKSSAGQKGISGTDLKNIFFPLPSIEEQI 191 Query: 167 IIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL 226 I ++ L D ++R +++ R A+L A G L + N EP + +++ Sbjct: 192 EIVHQVQALFTLADQLESRLSAARKLVDRLTPALLAKAFRGALVPQDPNDEPASVLLERI 251 Query: 227 NFESILTELRNGLSSKPNESGVGHP 251 S + +P Sbjct: 252 RAARQAEAAAGKASRRGRCKAAANP 276 >UniRef50_B8H0M3 Type I restriction-modification system specificity subunit n=2 Tax=Caulobacter vibrioides RepID=B8H0M3_CAUCN Length = 450 Score = 228 bits (581), Expect = 4e-58, Method: Composition-based stats. Identities = 82/434 (18%), Positives = 169/434 (38%), Gaps = 22/434 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W P+ + + G T KE+ +P A +++ T Sbjct: 18 GRVPSHWNFRPLKHLVIMRSGGTPSKER--EDYWGGEIPWASAKDLKVDTLTDTQDHLTA 75 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + L + + ++ P + V+ + G ++ ++ L + L + + ++ Sbjct: 76 EALDEGAAQLLPANAVVVLVRG--MMLARTFPVCRLSRPMTINQDLKGLIANRGVDPNYL 133 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 A ++S AG ++ ++ + +P P LAEQ+ IA LD A++D+ Sbjct: 134 AWSLRASEVETLCRLDEAGHGTKALRMDAWSTMELPAPSLAEQQAIAAFLDRETAKIDAL 193 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHS-VFKKLNFESIL 232 E++ +LK RQAV+ AV L +W P H V N + Sbjct: 194 VEAQERLIALLKEKRQAVISHAVTKGLDPSAQMKDSGVEWLGQMPAHWEVVPAKNLADSI 253 Query: 233 TELRNGLS-SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 G + +K S G+ + V G D +EL++++++ GDLL + Sbjct: 254 KAGPFGSALTKDMYSSAGYRVYGQEQVIPGDFRIGDYYVTSDRYNELSQYRVEVGDLLVS 313 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 + G + + ++ P +LIR R P Y+ + S + + + Sbjct: 314 ----CVGTFGKIAIFPQGAEPGIINP-RLIRFRPNNQVDPTYLCVLLRSAVSFEQF-SYL 367 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 I+ + VV +PP++EQ I + ++ D++ A+ + Sbjct: 368 SRGGTMDVINIGILGEIVVPVPPMQEQISIAGYLAEVQEQFDSLSAASEAAITLLQERRA 427 Query: 412 SILAKAFRGELTAQ 425 ++++ A G++ + Sbjct: 428 ALISAAVTGKIDVR 441 Score = 77.4 bits (189), Expect = 1e-12, Method: Composition-based stats. Identities = 28/232 (12%), Positives = 67/232 (28%), Gaps = 12/232 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P H F+ L I+ + + G P ++ + Sbjct: 15 EWLGRVPSHWNFRPLKHLVIMRSGGTPSKEREDYWGGEIPWASAKDLKVDTLTDTQDHLT 74 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 + E L ++ + + +L + D L + P Sbjct: 75 AEALDEGAAQLLPANAVVVLVRGM---MLARTFPVCRLSRPMTINQD-LKGLIANRGVDP 130 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 Y+ + + + G K + + + P + EQ I +++ A Sbjct: 131 NYLAWSLRASEV-ETLCRLDEAGHGTKALRMDAWSTMELPAPSLAEQQAIAAFLDRETAK 189 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + + +A + Q++++ A + +P ++ L Sbjct: 190 IDALVEAQERLIALLKEKRQAVISHAVT-------KGLDPSAQMKDSGVEWL 234 >UniRef50_Q2J5T0 Restriction modification system DNA specificity domain n=1 Tax=Frankia sp. CcI3 RepID=Q2J5T0_FRASC Length = 436 Score = 227 bits (578), Expect = 8e-58, Method: Composition-based stats. Identities = 95/430 (22%), Positives = 186/430 (43%), Gaps = 31/430 (7%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 S G++ G I+ V T + G+T + P +R N+Q G+ +D+ + Sbjct: 12 SFGEIFPG-RISTVGTEFEIQSGITLSPRRTS---GRKDAPYLRVANVQRGRLTLSDVAW 67 Query: 62 VPKNLVKESQK-ISPEDIVIAMSSGSKSVVGKSAHQHLP-FECSFGAFCGVLRPEKLIFS 119 + + + + + D+++ + + +G+ A C + LRP + + + Sbjct: 68 LEASARERIRYALDDGDLLVVEGHANPAEIGRCAQVGPESKNCLYQNHLFRLRP-RNLEA 126 Query: 120 GFIAHFTKSSLYRNKI-SSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F H+ SS ++ + + + + I + IP+PP +Q+ I+E LD Sbjct: 127 RFALHWLNSSFSQSYWGRNCATSSGLYTINSRQLGALPIPVPPPDKQRKISEILDAADEA 186 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAV--NGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 + ST+ ++ Q+ R +L V +G+L + WR ++ L+E+ Sbjct: 187 IRSTERLVGKLEQVFDSLRGDLLQEHVIRSGRLPDCWR-----------MDRLDRLSEIT 235 Query: 237 NGLSSKPNES---GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 G++ S V P LR+++V+ G++D DI+ + SE +R+ LQ GD+L T Sbjct: 236 GGVTLGGVTSAGRSVELPYLRVANVQDGYIDTTDIKTVTVRTSEFDRYLLQAGDVLMTE- 294 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAMMNCVK 352 G + +G G + L+ + + R R K LPEY+ + +S + R+ M K Sbjct: 295 GGDFDKLGR-GAVWDGSIDPCLHQNHIFRVRCDKIRLLPEYLSTYSASTAGRSYFMGISK 353 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 T+ I+ + + V LPP+ Q I+ + A+ LA++ + Q Sbjct: 354 QTTNLASINKSQLSALPVPLPPLATQKMIIGSL----GAAERQISSTKAELAKLRLVKQG 409 Query: 413 ILAKAFRGEL 422 ++ G + Sbjct: 410 LMDDLLMGRV 419 Score = 125 bits (315), Expect = 3e-27, Method: Composition-based stats. Identities = 48/214 (22%), Positives = 97/214 (45%), Gaps = 6/214 (2%) Query: 234 ELRNGLSSKPNESG--VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 E+++G++ P + P LR+++V+ G + +D+ +LE S E R+ L DGDLL Sbjct: 29 EIQSGITLSPRRTSGRKDAPYLRVANVQRGRLTLSDVAWLEASARERIRYALDDGDLLVV 88 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 + + +G C + + +N LY + L R R ++ + + +S +++ Sbjct: 89 EGHANPAEIGRCAQV-GPESKNCLYQNHLFRLRP-RNLEARFALHWLNSSFSQSYWGRNC 146 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 T+SG I+ + + + + +PP +Q +I ++ + E+ V ++L Sbjct: 147 ATSSGLYTINSRQLGALPIPVPPPDKQRKISEILDAADEAIRSTERLVGKLEQVFDSLRG 206 Query: 412 SILAKAF--RGELTAQWRAENPDLISGENSAAAL 443 +L + G L WR + D +S L Sbjct: 207 DLLQEHVIRSGRLPDCWRMDRLDRLSEITGGVTL 240 Score = 121 bits (303), Expect = 6e-26, Method: Composition-based stats. Identities = 46/210 (21%), Positives = 95/210 (45%), Gaps = 7/210 (3%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 + +G+LP+ W + + ++ + GVT + + LP +R N+Q+G DTTD+ Sbjct: 214 IRSGRLPDCWRMDRLDRLSEITGGVTLG--GVTSAGRSVELPYLRVANVQDGYIDTTDIK 271 Query: 61 FVPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKL-I 117 V + + + D+++ G +G+ A + C +R +K+ + Sbjct: 272 TVTVRTSEFDRYLLQAGDVLMTE-GGDFDKLGRGAVWDGSIDPCLHQNHIFRVRCDKIRL 330 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 +++ ++ S+ R+ +S + +I + + +P+PPLA QK+I L Sbjct: 331 LPEYLSTYSASTAGRSYFMGISKQTTNLASINKSQLSALPVPLPPLATQKMIIGSLGAAE 390 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVN 206 Q+ STKA ++ + + +L G V Sbjct: 391 RQISSTKAELAKLRLVKQGLMDDLLMGRVQ 420 >UniRef50_C6RQJ9 Restriction endonuclease S subunit n=2 Tax=Acinetobacter RepID=C6RQJ9_ACIRA Length = 461 Score = 227 bits (578), Expect = 9e-58, Method: Composition-based stats. Identities = 78/456 (17%), Positives = 173/456 (37%), Gaps = 32/456 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFV 62 G +P W+I + + G + + D P+IR +I+ +G + + ++ Sbjct: 19 GVVPSHWIITTLKRYCYVKGGFAFSS----DAFIDTGYPVIRIGDIKTDGSINLENCKYI 74 Query: 63 PKNLV--KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLIFS 119 P++L + +++AM+ + +GK+ G + + Sbjct: 75 PESLAVNSRDYLVEKNQLLMAMTGAT---IGKAGLYTSNQPAFLNQRVGKFELLAQNMNY 131 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ + K+ Y+ I + G NI + IP EQ IA LD +++ Sbjct: 132 RYLWYILKTDGYQEYIKLTAFGGAQPNISDTAMVDYPATIPSFDEQTQIANFLDHETSKI 191 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFES 230 D + +++ ++LK RQAV+ AV L +W P+H +L + + Sbjct: 192 DHLIEKQQRLIELLKEKRQAVISHAVTKGLNPNVPMKDSGVEWLGEVPEHWRISRLKYNA 251 Query: 231 ILTEL--RNGLSSKP-NESGVGHPILRISSVRA-GHVDQNDIRFLECSES-ELNRHKLQD 285 + G + + G +L S++ + +L + E + + Sbjct: 252 SIFGRIGFRGYTVDDIVDEDEGALVLSPSNISNANKLTLEKKTYLSWKKYFESPEIIVDE 311 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 DLL + + + ++ KL+ + LI+ P ++ F S ++ Sbjct: 312 NDLLLVKTGSTFGKSAI--IVNKLEPMTINPQMALIK---KSKIEPRFLGYLFGSKLIKS 366 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 ++ T SG ++ ++I + + LP +E I ++ D + ++ + Sbjct: 367 -IIENSNTGSGMPTMTQENINNFPIPLPSDEEAIIISNYLDNKTYKIDFLIEKSEQTILL 425 Query: 406 VNNLTQSILAKAFRGELTAQ-WRAENPDLISGENSA 440 + ++++ A G++ + W+A E SA Sbjct: 426 MQERRTALISAAVTGKIDVRNWQAPTVAEADAEFSA 461 >UniRef50_C2CF25 Restriction modification system DNA specificity domain protein n=2 Tax=Clostridiales Family XI. Incertae Sedis RepID=C2CF25_9FIRM Length = 495 Score = 226 bits (576), Expect = 1e-57, Method: Composition-based stats. Identities = 92/441 (20%), Positives = 182/441 (41%), Gaps = 58/441 (13%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDY-LPLIRANNIQNGKFDTTDLVFVP 63 +PE W + V I G K A + LK++ +P I A N+++G D +L+++ Sbjct: 66 DIPESWKWVRLGDVFQFINGDRGKNYPAKSKLKENGDIPFISAINLKDGTVDENNLLYLD 125 Query: 64 KNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 N + S K+ DIV+ + +GK+ + + + +LR K I F Sbjct: 126 INQYERLGSGKLLKNDIVLCIR----GSLGKNCIYPFE-KGAIASSLVILRNYKKIKLEF 180 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + ++ S L+ ++ G N+ + I +P+PPL EQ+ I EK++ L+ VD Sbjct: 181 VLNYLNSYLFYSETKKYDNGTAQPNLSAQNAKKILLPLPPLKEQERIVEKIEDLMLLVDK 240 Query: 182 TKARFEQIPQILKRF----RQAVLGGAVNGKLTEKWRNFEPQHSV--------------- 222 ++ + + K+F ++++L A+ G+L E+ + + Sbjct: 241 YGKNWQMLEDLNKKFPEDLKKSLLQEAIKGRLVEQRKEEGTGEELFELIKEEKNKLIKEG 300 Query: 223 -----------------------FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVR 259 +K + I +L +G P + G P L + + Sbjct: 301 KIKKQKPLPEITEEEIPFDIPESWKWVRLGEITLKLTDGAHKTPTYTNEGIPFLSVKDIS 360 Query: 260 AGHVDQNDIRFLECSESE--LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYP 317 +G +D + RF+ E + R + GDLL T+ + G+ ++ + +L Sbjct: 361 SGKIDYSSCRFISKKEHDKLFERCNPERGDLLLTKVGTT----GIPVVIDTDEEFSLFVS 416 Query: 318 DKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKE 377 L++ K +++ +SP + + G K +DI + ++ LPP+ E Sbjct: 417 VALLKF-PKKLINIYFLKHLINSPLVQVQVKEN-TRGVGNKNWVMRDIANTIIPLPPLAE 474 Query: 378 QAEIVRRVEQLFAYADTIEKQ 398 Q +V ++E+L + + K Sbjct: 475 QKRLVEKLEELLPLCEQVIKN 495 Score = 113 bits (283), Expect = 1e-23, Method: Composition-based stats. Identities = 57/310 (18%), Positives = 117/310 (37%), Gaps = 27/310 (8%) Query: 165 QKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQA---VLGGAVNGKLTEKWRNFEPQHS 221 Q+ I KL + + + ++ I + + + + E+ P+ Sbjct: 12 QRAIEGKLVEQRKEEGTGEELYKLIQEEKNKLIKEGKVKKQKPLPEITEEEIPFDIPESW 71 Query: 222 VFKKLNFESILTELRNGLS--SKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSESE- 277 + +L G + +K G P + +++ G VD+N++ +L+ ++ E Sbjct: 72 KWVRLGDVFQFINGDRGKNYPAKSKLKENGDIPFISAINLKDGTVDENNLLYLDINQYER 131 Query: 278 LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIF 337 L KL D++ +G + + L+ R K E++ + Sbjct: 132 LGSGKLLKNDIVLCIRGS----LGKNCIYP---FEKGAIASSLVILRNYKKIKLEFVLNY 184 Query: 338 FSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEK 397 +S + + Q +S ++ K ++ LPP+KEQ IV ++E L D K Sbjct: 185 LNSYLFYSE-TKKYDNGTAQPNLSAQNAKKILLPLPPLKEQERIVEKIEDLMLLVDKYGK 243 Query: 398 Q---VNNALARV-NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAA 453 + + + +L +S+L +A +G L Q E + L E IK E+ Sbjct: 244 NWQMLEDLNKKFPEDLKKSLLQEAIKGRLVEQ--------RKEEGTGEELFELIKEEKNK 295 Query: 454 SGGKKASRKK 463 + +K+ Sbjct: 296 LIKEGKIKKQ 305 >UniRef50_B7K558 Restriction modification system DNA specificity domain protein n=2 Tax=Bacteria RepID=B7K558_CYAP8 Length = 453 Score = 226 bits (575), Expect = 2e-57, Method: Composition-based stats. Identities = 91/450 (20%), Positives = 180/450 (40%), Gaps = 46/450 (10%) Query: 4 GKLPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +P+GW + + + + I G T K D + +R+ NI D+V++ Sbjct: 24 GDIPDGWEVKRLKWIVSKIGSGKTPK--GGAEIYSDSGIIFLRSQNIHFDGLRLDDVVYI 81 Query: 63 PKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFEC-SFGAFCGVLRPE-KLIF 118 K++ K S ++ P DI++ ++ S +G+ F + +LRP I Sbjct: 82 NKDIDKAMSSSRVKPLDILLNITGAS---LGRCMIIPKDFPSSNVNQHVCILRPIVTRIN 138 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F+ S+ +N+I S G + + A + P L EQ+ IA+ LD A+ Sbjct: 139 PYFLNRVMSSNAIQNQIFSSEVGVSREGLTFAQAGNLISVFPSLPEQEKIAQFLDEETAK 198 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFE 229 +D +++ ++LK R A++ AV L +W F P+H KK+ Sbjct: 199 IDKLITHKQRLIELLKEKRTALISHAVTKGLNPDVPMKDSGVEWLGFIPEHWEVKKIKRL 258 Query: 230 SILTELRNGLSSKPNESGV------GHPILRISSVR--AGHVDQNDIRFLECSESELNRH 281 S++ G S +P + + + +RIS V ++ + + + E + Sbjct: 259 SLVKR---GASPRPIDDPIYFDDNGEYVWVRISDVTASNKYLLEAEQKLSEIGKR--KSV 313 Query: 282 KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSP 341 LQ +L + VG + + + ++ + L ++ EY+ F Sbjct: 314 PLQPNELFLSIC----ASVGKPII---TKIKCCIHDGFVYFPELKENR--EYLYYIFLGG 364 Query: 342 SARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNN 401 + Q ++ + I + +PPV EQ +I +++ D I K+ Sbjct: 365 ELYKGLGKM----GTQLNLNTEIIGDVKLPIPPVSEQQKIAEYLDEKTEQIDPIIKKTRE 420 Query: 402 ALARVNNLTQSILAKAFRGELTA-QWRAEN 430 ++ + ++++ A G++ QW E Sbjct: 421 SIEYLKEYRTALISAAVTGKIDVRQWGCEE 450 >UniRef50_C6JA10 Putative uncharacterized protein n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JA10_9FIRM Length = 393 Score = 225 bits (574), Expect = 2e-57, Method: Composition-based stats. Identities = 85/411 (20%), Positives = 162/411 (39%), Gaps = 27/411 (6%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK-ESQ 71 + ++ G +K E + D + +IR N+Q G + VF P + + Sbjct: 4 IRLGDACDILNGFAFKSENYV----DSGIRVIRIANVQKGYIEDNTPVFYPLETNELDKY 59 Query: 72 KISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPE-KLIFSGFIAHFTKSS 129 + D+++A++ VG+ A F + LR + + ++ H S+ Sbjct: 60 MLEEGDLLMALTGN----VGRVAILKKEFMPAALNQRVACLRLKTDRVAKDYLFHVLNSA 115 Query: 130 LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQI 189 + + S G N+ IP+ P +Q++IA+ LD + S +++ Sbjct: 116 FFEQQCIQSSKGVAQKNMSTEWLKDYEIPMYPKEQQELIADILDKTRNIIISRNYELKKL 175 Query: 190 PQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG 249 ++K + G A + K + +V + ++ G G Sbjct: 176 DDLIKARFVEMFGDAYLNEFGWKKIKIKNAVTVEPQNGMYKPQSDYVT--------DGSG 227 Query: 250 HPILRISSVRAGHV-DQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKK 308 PILRI G V D + ++ L CSE+E ++ L + D++ R N S+E++G C + Sbjct: 228 IPILRIDGFYDGVVTDFSSLKRLRCSENERQKYLLYEDDVVINRVN-SIEYLGKCAHING 286 Query: 309 LQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 L +Y ++R P Y+ S + ++N K Q I+ KD+ Sbjct: 287 LLEDT-VYESNMMRMHFDSTRFHPVYVCRLLCSRFVYDQIVNHAKQAVNQASINQKDVLD 345 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 + PP+K Q + V D + ++ AL + L S++ + F Sbjct: 346 FDIYEPPLKLQIQFADFV----RAVDKSKVEIQKALDKTQMLFDSLMQEYF 392 Score = 90.9 bits (224), Expect = 1e-16, Method: Composition-based stats. Identities = 39/189 (20%), Positives = 75/189 (39%), Gaps = 9/189 (4%) Query: 224 KKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKL 283 KK+ L N G ++RI++V+ G+++ N F +EL+++ L Sbjct: 2 KKIRLGDACDILNGFAFKSENYVDSGIRVIRIANVQKGYIEDNTPVFYPLETNELDKYML 61 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 ++GDLL VG +LKK L T +Y+ +S Sbjct: 62 EEGDLLMALTG----NVGRVAILKKEFMPAALNQRVACLRLKTDRVAKDYLFHVLNSAFF 117 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 + K QK +S + +K + + P ++Q I +++ I N L Sbjct: 118 EQQCIQSSK-GVAQKNMSTEWLKDYEIPMYPKEQQELIADILDK----TRNIIISRNYEL 172 Query: 404 ARVNNLTQS 412 ++++L ++ Sbjct: 173 KKLDDLIKA 181 Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 69/198 (34%), Gaps = 4/198 (2%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKF-DTTDLVFVPKNLV 67 GW + T+ K Q+ +P++R + +G D + L + + Sbjct: 196 GWKKIKIKNAVTVEPQNGMYKPQSDYVTDGSGIPILRIDGFYDGVVTDFSSLKRLRCSEN 255 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQH-LPFECSFGAFCGVLRPEKL-IFSGFIAHF 125 + + + ED V+ S +GK AH + L + + + + + ++ Sbjct: 256 ERQKYLLYEDDVVINRVNSIEYLGKCAHINGLLEDTVYESNMMRMHFDSTRFHPVYVCRL 315 Query: 126 TKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S ++I + + A N +I +I PPL Q A+ + + + Sbjct: 316 LCSRFVYDQIVNHAKQAVNQASINQKDVLDFDIYEPPLKLQIQFADFVRAVDKSKVEIQK 375 Query: 185 RFEQIPQILKRFRQAVLG 202 ++ + Q G Sbjct: 376 ALDKTQMLFDSLMQEYFG 393 >UniRef50_A6W078 Restriction modification system DNA specificity domain n=1 Tax=Marinomonas sp. MWYL1 RepID=A6W078_MARMS Length = 400 Score = 225 bits (573), Expect = 3e-57, Method: Composition-based stats. Identities = 87/423 (20%), Positives = 169/423 (39%), Gaps = 38/423 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GWV+A + V +R T+ +A + +PL+ + ++ NGK D + ++ + Sbjct: 10 LPKGWVLAKANDVMD-VRDGTHDSPKA----QATGIPLVTSKSLVNGKIDYSTCTYISEQ 64 Query: 66 LVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + K + DI+ AM +G F+ S + + + +I Sbjct: 65 DHESISKRSAVDDGDILYAM----IGTIGNPVIVKKDFDFSIKNVALFKFTKTDLSNRYI 120 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 H+ S L + + + S G + + + IP+PPL EQK IA L + D+ Sbjct: 121 FHYLNSGLAKRQFENNSRGGTQKFVSLGNIRELMIPLPPLEEQKRIAAIL----DKADAI 176 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 + + +Q + F ++V +T + + + ++ +G Sbjct: 177 RRKRQQAIDLADEFLRSVFLDMFGDPVTNPKGK--------RIVPLIELCNKVTDGTHQS 228 Query: 243 PNESGVGHPILRISSVRAGHVDQNDIRFLECSE-SELNRHK-LQDGDLLFTRYNGSLEFV 300 P G P L IS++ G + + +F+ EL R ++ GD+L+T GS V Sbjct: 229 PKWEESGIPFLFISNIVNGKISFDTNKFISKETLDELTRSTPIEKGDVLYTTV-GSYGNV 287 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALP-EYIEIFFSSPSARNAMMNCVKTTSGQKG 359 + + + + + E++ +S R + V+ QK Sbjct: 288 ARV-----TDDTEFCFQRHIAHIKPNHEIVNAEFLTSMLASSVVRRQADSLVR-GIAQKT 341 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 ++ +++K +V ++ Q ++ VE + D + VN L S++ KAF Sbjct: 342 LNLRELKEILVFDVSLENQKSYLKIVEPIHKIKDNYDNSVNELLNN----FNSLIQKAFS 397 Query: 420 GEL 422 GEL Sbjct: 398 GEL 400 >UniRef50_Q0W4T6 Type I restriction modification system, specificity subunit n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W4T6_UNCMA Length = 484 Score = 225 bits (573), Expect = 3e-57, Method: Composition-based stats. Identities = 107/504 (21%), Positives = 190/504 (37%), Gaps = 72/504 (14%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LP GW + + + ++ I +K + +P I +I+ L F Sbjct: 6 ELPTGWCSTDLGDIIS-------PSKEKIEPVKTESIPYIGLEHIEKDTGKL--LSFGNS 56 Query: 65 NLVKESQKI-SPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 V ++ + D++ + K + CS V ++ + + + Sbjct: 57 TEVTSTKTVFHKGDLLYGKLRPYLN---KVCVTEIDGICSTD--ILVFNEQRFLSNKLLK 111 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + + + + G N + I +PPLAEQ I K++ L Q+D+ Sbjct: 112 YRMLCPDFVRYANQNATGVNHPRVDFKKIASFEIALPPLAEQHRIVAKIEELFTQLDAGV 171 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQH----------------------- 220 ++ + +K++RQAVL A NGKLTEKWR ++ Sbjct: 172 EALKKAKEQIKQYRQAVLESAFNGKLTEKWRLSSKEYIAPISEFISNVQKTRSTDGKTVC 231 Query: 221 ------------SVFKKLNFESILTELRNGLSSKPNESGVG-HPILRISSVRAGHVDQND 267 + L + + L S N G P + S+V + ++ + D Sbjct: 232 DQLESTLEMPNGWLGVLLYQIADIGTGATPLRSNKNYYENGTIPWITSSAVNSQYITKAD 291 Query: 268 IRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 E + E N L+ Y V LL + I Sbjct: 292 EFITELAIKETNAKIFPKNSLIIALYGEGKTRGKVSELLIEAATNQACAA---IIFNDQT 348 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 L +I+++F + + + Q ++ IKS ++ LPP+ EQ IV +E+ Sbjct: 349 VVLKPFIKLYFQKNY---EDLRKLASGGVQPNLNLGIIKSTLIPLPPLAEQEIIVGEIEK 405 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKI 447 F + IEK ++ +L+ L QSIL++AF G+L Q + P A LLE+I Sbjct: 406 KFPIMEDIEKTIDQSLSYSETLRQSILSQAFSGKLVPQNPNDEP--------AEKLLERI 457 Query: 448 KAERAA-------SGGKKASRKKS 464 +AER + G + +RK++ Sbjct: 458 RAERLNQAAGKPQNSGPRRTRKQA 481 >UniRef50_A3YSG6 Putative type I restriction enzyme specificity protein n=2 Tax=Campylobacter jejuni subsp. jejuni RepID=A3YSG6_CAMJE Length = 433 Score = 225 bits (573), Expect = 3e-57, Method: Composition-based stats. Identities = 86/435 (19%), Positives = 171/435 (39%), Gaps = 32/435 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + ++ + T + G ++ + +P+IR ++Q K + + Sbjct: 13 GEIPEHWEVVKINKIVTFVNGYAFENFDFNPIFE---IPVIRIGDMQKEKILYDNCLKTK 69 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + + IS DI+IA+S + GK A + ++R + + + Sbjct: 70 EKEKLKQFLISNNDILIALSGATT---GKIAFCDTDNKAYINQRVAIVRSKLKL----VK 122 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 ++ + + I G+ NI IP+PPL EQ+ IA LD Q+ + Sbjct: 123 YYFLTRGFSLLIELACNGSAQPNISTKEIGEFKIPLPPLKEQEQIANFLDEKCEQIANFI 182 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTE 234 + E++ +LK +QA + + L + +W PQH KK +L Sbjct: 183 EKKEKLISLLKEQKQAFINETITKGLDKNINFKDSGIEWLGEIPQHWEVKK---FKMLFT 239 Query: 235 LRNGLS-SKPNESGVGHPILRISSVRAGH-----VDQNDIRFLECSE-SELNRHKLQDGD 287 L NGL+ +K + G P + + + + + + F+ + ++ + LQ GD Sbjct: 240 LGNGLNITKADFVSYGIPCVSYGEIHSKYPCRLNTTIHTLPFVSKTYLADKPQSLLQKGD 299 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 +F + +E G ++ Y I + Y F S RN + Sbjct: 300 FVFADTSEDIEGSGNFTSIQSDTPIFAGY--HTIILKYKGKINSLYFSFLFDSIFTRNQI 357 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 V I+ +K L+PP+KEQ +I +++ D + ++ + + Sbjct: 358 RKEVC-GVKVFSITKSILKEVQCLIPPLKEQEQIANFLDEKCEKIDLLIEKTKKQIKLIK 416 Query: 408 NLTQSILAKAFRGEL 422 +++ +A G + Sbjct: 417 EYKTTLINQAVCGRI 431 >UniRef50_Q5QX28 Restriction endonuclease S subunit n=1 Tax=Idiomarina loihiensis RepID=Q5QX28_IDILO Length = 448 Score = 224 bits (572), Expect = 4e-57, Method: Composition-based stats. Identities = 92/455 (20%), Positives = 180/455 (39%), Gaps = 43/455 (9%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LPE W + + V + G + E + P+IR +I+N + + +++ V Sbjct: 20 LPERWKLIKLKLVCNIETGFAFPSE----VFGETGTPVIRITDIKNREINLSEIKRVDDL 75 Query: 66 LVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 L+K K ++ DI++AM+ + +GK + + P I G++ Sbjct: 76 LLKSKPKRPSVNKGDIIMAMTGAT---IGKVGYYNSDKPSYLNQRVCRFIPAS-IDRGYL 131 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLA-EQKIIAEKLDTLLAQVDS 181 H S +Y+ I + G NI + P+P L EQ+ IA+ LD A++D+ Sbjct: 132 WHTLNSEIYKKYIELEAFGGAQANISDSQLLNFPAPLPELEAEQQKIAQFLDYETAKIDA 191 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 +++ ++LK RQAV+ AV L +W P+H KKL F S + Sbjct: 192 LIDEQKRLIELLKEKRQAVISHAVTKGLNPDAPMKDSGIEWLGEVPEHWEIKKLKFCSRM 251 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 LS K ++ + + ++ G S + + D+LF + Sbjct: 252 ------LSDKGKDNTNA---ISLENIENG----TGAFIKTESNFDQEGVLFEPLDILFGK 298 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 L V +H + L ++ R KD PE++ S + + Sbjct: 299 LRPYLAKV-----YLAREHGSAL--GDILVFRANKDISPEFLFFRLISQEFIRQV-DQSS 350 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL-FAYADTIEKQVNNALARVNNLTQ 411 S + + IKS + +PP++EQ ++ + L F ++ + + Sbjct: 351 YGSKMPRANPELIKSLQIAVPPIEEQVKVSDYLANLQFNKIMPSVINASSLVKLLEERRS 410 Query: 412 SILAKAFRGELTAQWRAENPDLISGENSAAALLEK 446 ++++ A G++ + + +++A+ E+ Sbjct: 411 ALISAAVTGKIDVRDWQPPAGSDTVDSNASVQTER 445 Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 40/263 (15%), Positives = 97/263 (36%), Gaps = 23/263 (8%) Query: 201 LGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV-GHPILRISSVR 259 + ++ T R+ + KL ++ + G + G G P++RI+ ++ Sbjct: 1 MNKPIHESETRTVRDLKELLPERWKLIKLKLVCNIETGFAFPSEVFGETGTPVIRITDIK 60 Query: 260 AGHVDQNDIRFLEC--SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYP 317 ++ ++I+ ++ +S+ R + GD++ ++ VG + Sbjct: 61 NREINLSEIKRVDDLLLKSKPKRPSVNKGDIIMAMTGATIGKVGY-----YNSDKPSYLN 115 Query: 318 DKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVK- 376 ++ R Y+ +S + + + Q IS + + LP ++ Sbjct: 116 QRVCRFIPAS-IDRGYLWHTLNSEIYKKYIELEAFGGA-QANISDSQLLNFPAPLPELEA 173 Query: 377 EQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISG 436 EQ +I + ++ A D + + + + Q++++ A + NPD Sbjct: 174 EQQKIAQFLDYETAKIDALIDEQKRLIELLKEKRQAVISHAVT-------KGLNPDAPMK 226 Query: 437 ENSAAALLE-----KIKAERAAS 454 ++ L E +IK + S Sbjct: 227 DSGIEWLGEVPEHWEIKKLKFCS 249 Score = 90.1 bits (222), Expect = 2e-16, Method: Composition-based stats. Identities = 42/208 (20%), Positives = 75/208 (36%), Gaps = 20/208 (9%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W I + + ++ A I NI+NG T + Sbjct: 234 GEVPEHWEIKKLKFCSRMLSDKGKDNTNA-----------ISLENIENG---TGAFIKTE 279 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 N +E P DI+ + V S V R K I F+ Sbjct: 280 SNFDQEGVLFEPLDILFGKLRPYLAKV-----YLAREHGSALGDILVFRANKDISPEFLF 334 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL-LAQVDST 182 S + ++ S G+ + P + I +PP+ EQ +++ L L ++ + Sbjct: 335 FRLISQEFIRQVDQSSYGSKMPRANPELIKSLQIAVPPIEEQVKVSDYLANLQFNKIMPS 394 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLT 210 + ++L+ R A++ AV GK+ Sbjct: 395 VINASSLVKLLEERRSALISAAVTGKID 422 >UniRef50_A6C679 Type I restriction-modification system, S subunit n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C679_9PLAN Length = 450 Score = 224 bits (572), Expect = 4e-57, Method: Composition-based stats. Identities = 91/446 (20%), Positives = 174/446 (39%), Gaps = 43/446 (9%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFD------TT 57 GK+PE W + + + + +D LP+++ + I +G D + Sbjct: 18 GKVPEHWDVFRMGILFA-----------EVAESGNDDLPVLQVS-IHHGVSDRELSESES 65 Query: 58 DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 D + + +++ P D+V M + G E V RP+ Sbjct: 66 DRKITRIDDKSKYKRVVPNDLVYNMMRAWQGGFGTV-----KVEGMVSPAYVVARPKIDF 120 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + FI H ++ ++ S G + F + + +P +EQ+ I + +D Sbjct: 121 QTQFIEHLFRTPQAIEQMRRYSHGVTDFRLRLYWDKFKNVRVALPDKSEQQEICDYIDVE 180 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKL 226 +++D+ A ++ ++LK RQAV+ AV L +W P+H L Sbjct: 181 TSKIDALVAEQRRLIELLKEKRQAVISHAVTKGLNPNAPMKDSGIEWLGDVPEHWEVCSL 240 Query: 227 NFESILTELRNGLSSKPNESG---VGHPILRISSVRAGHVDQNDIRFLECSESE-LNRHK 282 + + G S PNE+ G L ++ G +D + +F+ + + LNR K Sbjct: 241 RRYAFFVDGDRG-SEYPNENDLTSDGILFLSSKNIVGGKLDLKESKFISHEKFDALNRGK 299 Query: 283 LQDGDLLFTRYNGSLEFVGVCGLLKK--LQHQNLLYPDKLIRARLTKDALPEYIEIFFSS 340 QDGD L + GS +G L + +++ R P+Y+ S Sbjct: 300 AQDGD-LIVKVRGSTGRIGEMALFDVGAYSFETAFINAQMMIIRTGNKLTPKYLSKVSQS 358 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 + + + Q+ +S K V +PPV EQAEI ++ D++E + Sbjct: 359 IYWMEQL-SVGAYGTAQQQLSNKVFSDLFVTMPPVTEQAEIADFIDLKVGEFDSLETEAE 417 Query: 401 NALARVNNLTQSILAKAFRGELTAQW 426 A+ + ++++ A G++ + Sbjct: 418 QAIELLQERRTALISAAVTGKINVRD 443 Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 72/199 (36%), Gaps = 21/199 (10%) Query: 251 PILRISSVRAGHVD-----QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGL 305 P+L++S + G D R + + + ++ DL++ G + Sbjct: 45 PVLQVS-IHHGVSDRELSESESDRKITRIDDKSKYKRVVPNDLVYNMMRAWQGGFGTVKV 103 Query: 306 LKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS-GQKGISGKD 364 + ++ P ++ AR D ++IE F +P A M + + + Sbjct: 104 ------EGMVSPAYVV-ARPKIDFQTQFIEHLFRTPQAIEQMRRYSHGVTDFRLRLYWDK 156 Query: 365 IKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTA 424 K+ V LP EQ EI ++ + D + + + + Q++++ A Sbjct: 157 FKNVRVALPDKSEQQEICDYIDVETSKIDALVAEQRRLIELLKEKRQAVISHAVT----- 211 Query: 425 QWRAENPDLISGENSAAAL 443 + NP+ ++ L Sbjct: 212 --KGLNPNAPMKDSGIEWL 228 >UniRef50_B7VNG6 Type I restriction enzyme EcoKI, S subunit n=1 Tax=Vibrio splendidus LGP32 RepID=B7VNG6_VIBSL Length = 522 Score = 224 bits (570), Expect = 7e-57, Method: Composition-based stats. Identities = 112/521 (21%), Positives = 195/521 (37%), Gaps = 93/521 (17%) Query: 1 MSAGKLPEGWVIAPVSTVTT----LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDT 56 MS +LP+GW+ S + I + + D+ P++R N++ F Sbjct: 1 MS--ELPKGWIACTPSDLANDPKNEIVDGPFGSNLKASEYTDEGTPIVRIQNVKRMAFLN 58 Query: 57 TDLVFVPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE 114 ++ +V + + D+++ + E A LRP Sbjct: 59 KNIKYVTDEKAEFLKRHSFKSGDLLLTKLGEPLGL--TCIAPEYLNEGIIVADIVRLRPN 116 Query: 115 KLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDT 174 + +A+ S +I++ + G+ I + +NI +PPLAEQK I EK+D Sbjct: 117 PEVNRKCLAYLLNSEGVIKQINAHTKGSTRARINLSVVRNLNINLPPLAEQKRIVEKIDE 176 Query: 175 LLAQVDSTKARFE--------------------QIPQILKR----------FRQAVLGGA 204 +LAQVD+ KAR + ++ + + + + Sbjct: 177 VLAQVDTIKARLDGIPDLLKRFRQSVLTSAVSGKLTEEWREEQDAYPTLNELKATIEQER 236 Query: 205 VNGKL---------------TEKWRN-------------------FEPQHSVFKKLNFES 230 +KW+ + L+ S Sbjct: 237 FEIWCSAELNKKISKGKPPANDKWKEKYQPGNPKHNDSNKRTAVEEIKAPWLLTSLDAVS 296 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 ILT + ++K + + + N R++ + ++ + G L Sbjct: 297 ILTTGKTPSTAKDEYWNGDTMFVSPAQIHPEGYLHNPSRYVSKAGCQIVP-LISKGSTLI 355 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 + VG GLL + +++ ++ +Y+ + + +++ Sbjct: 356 V----CIGTVGKVGLLTE----DVVINQQINAITPLPSVTHKYMYYWCKTLY--PWIIDT 405 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 + T ++ + + LPP++EQ EIVR V+Q FA+ADTIE QV A ARV+NLT Sbjct: 406 ARATVNAAILNKSTMSTAPFALPPLEEQKEIVRLVDQYFAFADTIEAQVKKAQARVDNLT 465 Query: 411 QSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 QSILAKAFRGEL AQ + P A LL +I R Sbjct: 466 QSILAKAFRGELVAQDPNDEP--------ADKLLARIAEAR 498 Score = 149 bits (377), Expect = 2e-34, Method: Composition-based stats. Identities = 59/241 (24%), Positives = 115/241 (47%), Gaps = 10/241 (4%) Query: 227 NFESILTELRNGLSSKPNES-GVGHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKLQ 284 + ++ + + G + K +E G PI+RI +V+ +I+++ ++E L RH + Sbjct: 19 DPKNEIVDGPFGSNLKASEYTDEGTPIVRIQNVKRMAFLNKNIKYVTDEKAEFLKRHSFK 78 Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 GDLL T+ L G+ + + ++ ++ D ++R R + + + +S Sbjct: 79 SGDLLLTKLGEPL---GLTCIAPEYLNEGIIVAD-IVRLRPNPEVNRKCLAYLLNSEGVI 134 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 + N S + I+ +++ + LPP+ EQ IV +++++ A DTI+ +++ Sbjct: 135 KQI-NAHTKGSTRARINLSVVRNLNINLPPLAEQKRIVEKIDEVLAQVDTIKARLDGIPD 193 Query: 405 RVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASG--GKKASRK 462 + QS+L A G+LT +WR E D N A +E+ + E S KK S+ Sbjct: 194 LLKRFRQSVLTSAVSGKLTEEWR-EEQDAYPTLNELKATIEQERFEIWCSAELNKKISKG 252 Query: 463 K 463 K Sbjct: 253 K 253 >UniRef50_UPI0001BC509B restriction modification system DNA specificity domain protein n=3 Tax=Fusobacterium RepID=UPI0001BC509B Length = 503 Score = 224 bits (570), Expect = 7e-57, Method: Composition-based stats. Identities = 107/485 (22%), Positives = 212/485 (43%), Gaps = 54/485 (11%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKK-EQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 ++P+ WV + ++ ++ RG++Y K ++ I D+ ++R N+ + D V+V Sbjct: 26 EIPDSWVWVRLGSIVSVHRGLSYSKVDEIIRENNDEGYLVLRGGNLTEDGLNFEDNVYVR 85 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGK-SAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + + + + ++ D+++ S+GS V+G+ +H + + GAF + RP I S ++ Sbjct: 86 EEIGRRAIELEENDVILVASTGSSKVIGRACIVEHKLEKTTIGAFLMLCRPVTSI-SKWV 144 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + K + YRN IS++S G+NI NIK I PP+ EQ+ I +KLD L + Sbjct: 145 HYIFKGNSYRNYISNISKGSNIKNIKGEYITNYAISFPPIEEQQRIVKKLDFLFEKTKKA 204 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLN--------------- 227 K +++ + ++ + ++L A G+LT+KWR SV + L Sbjct: 205 KKLLQEVKEEIEMRKISILDKAFRGELTKKWREKNKTGSVLELLQEIQNEKMKKWEEECC 264 Query: 228 -------------FESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS 274 S + E+ +P + +++ + + E Sbjct: 265 EAEKNGRKKPKKIKLSKIEEMIVPKEEEPYKIPDTWKWVKLGEIS--QISMGQSPLGEKV 322 Query: 275 ESELNRHKL-QDGDL-----LFTRYNG---SLEFVG-VCGLLKKLQHQNLLYPDKLIRAR 324 S + + D+ + TRY L +G + ++ +N+ + R Sbjct: 323 NSLIGVGLIGGPSDMGENYPIITRYTSQITKLSSIGDIIVSIRATLGKNIFSDGEYCLGR 382 Query: 325 LTKDALPEYIEIFFSSPSARNAM--MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIV 382 + + N++ + + + + +S +DI + LPP++EQ EIV Sbjct: 383 GVCGIRSKIVNNILLRFYFTNSIEYLYKISSGTTFAQVSKEDISNLYFSLPPLEEQQEIV 442 Query: 383 RRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAA 442 R +E++ +++ + + +++ L +SIL KAFRG+L Q + P A Sbjct: 443 RVLEEVLEKEKKVKELI-DLEEKIDLLEKSILDKAFRGKLGTQDINDEP--------ALE 493 Query: 443 LLEKI 447 LL+KI Sbjct: 494 LLKKI 498 Score = 127 bits (318), Expect = 1e-27, Method: Composition-based stats. Identities = 55/265 (20%), Positives = 112/265 (42%), Gaps = 16/265 (6%) Query: 193 LKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP------NES 246 + + ++ + + L K + + S + + GLS + Sbjct: 1 MAKKKELTIEEKLQAALVSKEEQPYEIPDSWVWVRLGS-IVSVHRGLSYSKVDEIIRENN 59 Query: 247 GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 G+ +LR ++ ++ D ++ +L++ D++ GS + +G ++ Sbjct: 60 DEGYLVLRGGNLTEDGLNFEDNVYVREEIGRRAI-ELEENDVILVASTGSSKVIGRACIV 118 Query: 307 KKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIK 366 + + + ++ +T + +++ F S RN + N K S K I G+ I Sbjct: 119 EHKLEKTTIGAFLMLCRPVTS--ISKWVHYIFKGNSYRNYISNISK-GSNIKNIKGEYIT 175 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQW 426 + + PP++EQ IV++++ LF +K + + SIL KAFRGELT +W Sbjct: 176 NYAISFPPIEEQQRIVKKLDFLFEKTKKAKKLLQEVKEEIEMRKISILDKAFRGELTKKW 235 Query: 427 RAENPDLISGENSAAALLEKIKAER 451 R +N S LL++I+ E+ Sbjct: 236 REKN-----KTGSVLELLQEIQNEK 255 >UniRef50_C7RQC3 Type I restriction-modification system specificity subunit n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RQC3_9PROT Length = 475 Score = 223 bits (569), Expect = 9e-57, Method: Composition-based stats. Identities = 78/436 (17%), Positives = 173/436 (39%), Gaps = 26/436 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI--QNGKFDTTDLVF 61 G++P W + + +L++GV KE A+ +P +R + + F F Sbjct: 20 GQVPGHWDVRKPRHIGSLLKGVGGTKEDALP----AGVPCVRYGELYTTHAYFVRRPKTF 75 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + + + D++ A S + +GKSA + G +LRP + + F Sbjct: 76 IHADRAADYTPLHYGDVLFAASGETLEDIGKSAVNLIDGTAVCGGDVIILRPSVPVHAPF 135 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + N+ +++ G + ++ P + P+PP+ EQ I L+ +++ Sbjct: 136 LGYVMDCRPLANQKATMGRGTTVKHVYPDELKHLVFPLPPVPEQAAIVRFLNWANGRLER 195 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 ++ +L +QA++ AV L W P+H +L F ++ Sbjct: 196 AIRAKRKVIALLNEQKQAIVHRAVTRGLDPSVPLKPSGIPWLGDIPRHWRVWRLKFVAL- 254 Query: 233 TELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSES--ELNRHKLQDGDLL 289 + + L + P S G HP +R + + AG V + + + + R + Q+GD+L Sbjct: 255 -NIVDCLHATPRYSDAGTHPAIRTADIVAGVVLVDQAKKVSSRDYARWTTRLQPQEGDIL 313 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 ++R E G+ + L +++ R+ +++ +S S + Sbjct: 314 YSREG---ERFGIAACVPAA--TQLCISQRMMVFRIATQHCSKFVMWLLNSRSTYGQALQ 368 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 V + ++ I++ + LP +EQ +V R+ + ++ + + Sbjct: 369 DVMGATA-PHVNISTIRNYYLALPLKREQEAVVERIGAETHPIEVAIDRLKREIELLREY 427 Query: 410 TQSILAKAFRGELTAQ 425 ++A G++ + Sbjct: 428 RTRLIADVVTGKVDVR 443 >UniRef50_C1D7R6 Type I restriction-modification system, S subunit n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1D7R6_LARHH Length = 453 Score = 223 bits (569), Expect = 1e-56, Method: Composition-based stats. Identities = 89/445 (20%), Positives = 168/445 (37%), Gaps = 50/445 (11%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI--QNGKFDTTDLVFVP 63 +P W + + + + + + + ++ + I+ +I +G+ + Sbjct: 20 IPSHWEVVRLKNIFEIRKRIAGELGHSVLSITQRG---IKVKDIESNDGQISMDYSKY-- 74 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVV------GKSAHQHLPFECSFGAFCGVLRPEKLI 117 Q + P D + V G ++ + F A C Sbjct: 75 -------QIVLPGDFAMNHMDLLTGYVDISSTHGVTSPDYRVFAMLDNAHCV-------- 119 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGA---NINNIKPASFDLINIPIPPLAEQKIIAEKLDT 174 + H ++ + + GA F+ +P PP EQ IA LD Sbjct: 120 -PRYFLHLFQNGYRQKIFYAFGQGASEFGRWRFPTDQFNNFRLPCPPDDEQAAIATFLDR 178 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKK 225 A++D+ A E++ +L RQA + AV L +W P H V Sbjct: 179 ETAKIDALIAEQEKLIALLAEKRQATISHAVTRGLDPAVPMKDSGVEWLGQVPAHWVICS 238 Query: 226 LNFESILTELRNGLS----SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRH 281 + + L + G S S+P E+G +L+ V G + + L + + Sbjct: 239 VRRK--LKRIEQGWSPECFSRPAEAGE-WGVLKAGCVNGGIFRPEENKALPDTLAPDENI 295 Query: 282 KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSP 341 ++DGDLL +R +GS VG L +L+ DK+ R L + LP+++ I F + Sbjct: 296 LIKDGDLLMSRASGSPALVGSVAYLS-APPAHLMLSDKIFRLHLEQGTLPQFVAIAFGAR 354 Query: 342 SARNAMMNCVKTTSGQKG-ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 R+ + + G + +K + +PP EQ EIV +Q A D ++ Sbjct: 355 YLRHQIEQAISGAEGLANNLPQTSLKGFTIAIPPEVEQQEIVVFTQQETAKLDALKIAAE 414 Query: 401 NALARVNNLTQSILAKAFRGELTAQ 425 +A++ + +++A A G++ + Sbjct: 415 HAVSLLKERRAALIAAAVTGQIDVR 439 Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats. Identities = 50/215 (23%), Positives = 88/215 (40%), Gaps = 13/215 (6%) Query: 4 GKLPEGWVIA----PVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL 59 G++P WVI + + + E + +++A + G F + Sbjct: 228 GQVPAHWVICSVRRKLKRIEQ-----GWSPECFSRPAEAGEWGVLKAGCVNGGIFRPEEN 282 Query: 60 VFVPKNLV-KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAF-CGVLRPEKLI 117 +P L E+ I D++++ +SGS ++VG A+ P + L E+ Sbjct: 283 KALPDTLAPDENILIKDGDLLMSRASGSPALVGSVAYLSAPPAHLMLSDKIFRLHLEQGT 342 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 F+A + R++I +GA NN+ S I IPP EQ+ I Sbjct: 343 LPQFVAIAFGARYLRHQIEQAISGAEGLANNLPQTSLKGFTIAIPPEVEQQEIVVFTQQE 402 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 A++D+ K E +LK R A++ AV G++ Sbjct: 403 TAKLDALKIAAEHAVSLLKERRAALIAAAVTGQID 437 >UniRef50_Q3J7Q5 Restriction endonuclease S subunits-like n=2 Tax=Nitrosococcus oceani RepID=Q3J7Q5_NITOC Length = 487 Score = 223 bits (568), Expect = 1e-56, Method: Composition-based stats. Identities = 88/456 (19%), Positives = 174/456 (38%), Gaps = 28/456 (6%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDL-V 60 G++P W + P + T G + + A + ++R+ + +G ++ TD V Sbjct: 42 IGEVPSFWEVKPFKWLLTHNEGGVWGDDPA----GEGDTIVLRSTDQTVDGNWNVTDPAV 97 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE---CSFGAFCGVLRPEKLI 117 S + D+V+ SSGS +GK+ ++ +G F LR + Sbjct: 98 RHLTVKENASAVLEAGDLVVTKSSGSALHIGKTTLVNVDMAKLGYCYGNFMQRLRLGQKY 157 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + + L R +++ LS + N+ I +P+PP+ EQ IA LD Sbjct: 158 IPKLAWYVMNNDLVRLQLNLLSNSTTGLANLNATLIGEILLPVPPVEEQTQIARFLDHET 217 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLN 227 A++D+ +++ ++LK RQA++ AV L +W P H + K L Sbjct: 218 ARIDALIEEQQRLIELLKEKRQAIISHAVTKGLDPTVPMKDSGVEWLGEVPAHWITKPLK 277 Query: 228 FESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD 287 + L ++G +E P+ + ++ G + ++ RF+ S +DGD Sbjct: 278 HLAELNPKKSGYHGDRDELCSFVPMEK---LKTGVIQLDEERFIADVISGYTYF--EDGD 332 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 +L + E + + L + ++ R D ++ Sbjct: 333 VLQAKVTPCFENRNI-AIADGLTNGVGFGSSEINVLRPFPDVNASFLYYRLQEDGYMGIC 391 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 + G K + G+ I V +P EQ +I ++ A D + ++ N + + Sbjct: 392 TASMIGAGGLKRVPGEVINGFTVAVPERHEQTQIAHFLDHETARVDKLVEEANVGIELLK 451 Query: 408 NLTQSILAKAFRGELTA---QWRAENPDLISGENSA 440 ++++ A G++ Q A P + Sbjct: 452 ERRSALISAAVTGKIDVRGWQPPASAPSPELENEAV 487 Score = 118 bits (296), Expect = 5e-25, Method: Composition-based stats. Identities = 46/234 (19%), Positives = 92/234 (39%), Gaps = 13/234 (5%) Query: 213 WRNFEPQHSVFKKLNFESILTELRNG-LSSKPNESGVGHPILRISSVR-AGHVDQNDIRF 270 W P K F+ +LT G P G +LR + G+ + D Sbjct: 41 WIGEVPSFWEVK--PFKWLLTHNEGGVWGDDPAGEGDTI-VLRSTDQTVDGNWNVTDPAV 97 Query: 271 LECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQ-NLLYPDKLIRARLTKDA 329 + E L+ GDL+ T+ +GS +G L+ + Y + + R RL + Sbjct: 98 RHLTVKENASAVLEAGDLVVTKSSGSALHIGKTTLVNVDMAKLGYCYGNFMQRLRLGQKY 157 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 +P+ ++ R + +T+G ++ I ++ +PPV+EQ +I R ++ Sbjct: 158 IPKLAWYVMNNDLVRLQLNLLSNSTTGLANLNATLIGEILLPVPPVEEQTQIARFLDHET 217 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D + ++ + + Q+I++ A + +P + ++ L Sbjct: 218 ARIDALIEEQQRLIELLKEKRQAIISHAVT-------KGLDPTVPMKDSGVEWL 264 >UniRef50_B3G223 Type I restriction modification DNA specificity protein n=1 Tax=Pseudomonas aeruginosa RepID=B3G223_PSEAE Length = 395 Score = 223 bits (568), Expect = 1e-56, Method: Composition-based stats. Identities = 91/422 (21%), Positives = 178/422 (42%), Gaps = 36/422 (8%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI----QNGKFDTTDLVFVPK 64 W I + + + T K +++ +P RA + + G+ D + +F+ + Sbjct: 2 SWPIVKLGEIFDI----TSSKRVHEIDWRNEGVPFYRAREVAVLAKEGRVD--NDLFIDE 55 Query: 65 NLVKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 ++ +E + ++ + G+ V A Q A LR + + + + Sbjct: 56 SMYEEFKAKYGVPKVGDLLVTAVGTLGKV--YAVQESDRFYFKDASVIWLRARQEVDTSY 113 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 I H S+ + I + S+GA + + + IP+PPL EQK IA L + D+ Sbjct: 114 IQHAMNSTDVQRFIQN-SSGATVGTYTISRANETEIPLPPLPEQKRIAAIL----DKADA 168 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 + + +Q Q+ F +AV +T F ++ G S+ Sbjct: 169 IRRKRQQAIQLADDFLRAVFLDMFGDPVTNSK--------GFPIGTIRDLVATADYGSSA 220 Query: 242 KPNESGVGHPILRISSVRA-GHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 K +E+ +PILR+ ++ G +D ++++ E E +++ ++ GDLLF R N S E V Sbjct: 221 KASETYGEYPILRMGNITYQGRIDLEGLKYINLEEKERSKYLVEKGDLLFNRTN-SKELV 279 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G + + LIR R + YI + +S + + + K+ G I Sbjct: 280 GKTAVYDMDDPVAI--AGYLIRVRPNEMGNSHYISGYLNSAHGKATLRSICKSIVGMANI 337 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + +++++ ++LP ++ Q R+ ++L + + AL L S+ KAF G Sbjct: 338 NAQEMQNIPIMLPSIELQ----RKYQELVVVTKCKLQVFDTALKLTEQLFSSLSYKAFSG 393 Query: 421 EL 422 +L Sbjct: 394 QL 395 >UniRef50_B1XQR8 Type 1 restriction-modification system specificity subunit n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XQR8_SYNP2 Length = 398 Score = 223 bits (568), Expect = 1e-56, Method: Composition-based stats. Identities = 78/421 (18%), Positives = 169/421 (40%), Gaps = 33/421 (7%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ--NGKFDTTDLVFVPKNLV 67 W + + + + RG + + ++ + D + I+ + + T P+ + Sbjct: 3 WEVKTLDDLCDIARGGSPRPIKSYLTNEPDGINWIKIGDASASSKYIYETQEKIKPEG-I 61 Query: 68 KESQKISPEDIVIA--MSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 K+S+ + P D +++ MS G ++ S C + + L ++ +F Sbjct: 62 KKSRFVEPGDFLLSNSMSFGRPYIMRTSG-------CIHDGWLVLKDKSGLFDQDYLYYF 114 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 S + L+AG+ + N+ + +P+PP+AEQK I E LD + ++ +A Sbjct: 115 LGSQAAYKQFDKLAAGSTVRNLNTTLVKKVLVPVPPIAEQKRIVEILDESFSGIERAEAI 174 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE 245 Q + + L + K I + + Sbjct: 175 ARQNLTNARELFDSYLNKIFLDFVERK-----------NTQTLNCITDLIVDCEHKTAPT 223 Query: 246 SGVGHPILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQDGDLLFTRYNGSLEFVGVC 303 G P +R ++ GH+ +++ + E + R K Q GDL+ R G Sbjct: 224 QETGFPSIRTPNIGKGHLILDNVYRVSEETYKQWTRRAKPQSGDLILAREAP----AGNV 279 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 G++ + + + + + R ++ P+Y+ F P + +++ + + + ++ K Sbjct: 280 GVIPEGER--VCLGQRTVLIRPKENINPQYLAFFLLHPKMQERLLSK-SSGATVQHVNMK 336 Query: 364 DIKSQVV-LLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 DI++ + LPP++ Q ++ + + + +E+ + + L QSIL KAF G+L Sbjct: 337 DIRALKMGDLPPIEIQDRLIESLLDVQEKSKKLEEVYQRKIEALGKLKQSILQKAFSGQL 396 Query: 423 T 423 T Sbjct: 397 T 397 Score = 106 bits (264), Expect = 2e-21, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 84/202 (41%), Gaps = 12/202 (5%) Query: 14 PVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK---ES 70 ++ +T LI +K ++ P IR NI G ++ V + K Sbjct: 205 TLNCITDLIVDCEHKTAP----TQETGFPSIRTPNIGKGHLILDNVYRVSEETYKQWTRR 260 Query: 71 QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSL 130 K D+++A ++ G G ++RP++ I ++A F Sbjct: 261 AKPQSGDLILAR----EAPAGNVGVIPEGERVCLGQRTVLIRPKENINPQYLAFFLLHPK 316 Query: 131 YRNKISSLSAGANINNIKPASFDLINI-PIPPLAEQKIIAEKLDTLLAQVDSTKARFEQI 189 + ++ S S+GA + ++ + + +PP+ Q + E L + + + +++ Sbjct: 317 MQERLLSKSSGATVQHVNMKDIRALKMGDLPPIEIQDRLIESLLDVQEKSKKLEEVYQRK 376 Query: 190 PQILKRFRQAVLGGAVNGKLTE 211 + L + +Q++L A +G+LT+ Sbjct: 377 IEALGKLKQSILQKAFSGQLTQ 398 >UniRef50_B9M293 Restriction endonuclease S subunit-like protein n=1 Tax=Geobacter sp. FRC-32 RepID=B9M293_GEOSF Length = 644 Score = 223 bits (567), Expect = 1e-56, Method: Composition-based stats. Identities = 94/439 (21%), Positives = 177/439 (40%), Gaps = 35/439 (7%) Query: 5 KLPEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFV 62 +LPE W +A V V L G K + D P IR +N+ +GK + + Sbjct: 2 RLPESWRVATVGNVLLDLQPGFAQKPGEE----DDGTTPQIRTHNVTPDGKITLEGIKHI 57 Query: 63 PKNLVKES-QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSG 120 + + + K+ D+V ++ S+ VGK+A + E F LRP +L+ Sbjct: 58 SASAKETARYKLMMGDVVFNNTN-SEEWVGKTAVFNQEGEYVFSNHMTRLRPHPELVTPE 116 Query: 121 FIAHFTKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++A + + + + I+ + + +P L EQ I + L Q Sbjct: 117 YLAFYLHQLWAIGYSKTRAKRWVSQAGIESKAIASFKLSLPTLPEQHRIIDVLR----QA 172 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 +++ EQ+ ++ +A+ S + F T + G Sbjct: 173 QDLRSQKEQVLKLSAELAKALFEQHF---------GIAGASSAWPMEPFGKHTTYSKYGP 223 Query: 240 S-SKPNESGVGHPILRISSVRA-GHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSL 297 S G ILR + + G + + L +E ++ H L+ G L+ +R Sbjct: 224 RFPDQQYSDSGIHILRTTDMNNDGTIRWWEAPKLALTEGQIQEHALKPGTLVVSRSG--- 280 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 +G L Q + LI L PEY+ F++P + M+ + Q Sbjct: 281 -TIGPFALFDG-QEGRCVAGAYLIEFGLADSVQPEYVRALFATPYVQ-QMLKKAVRSVAQ 337 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 I+ +I+S + +PP++ Q +++Q+ A+ I K ++++ + ++++ +A Sbjct: 338 PNINAPNIQSIKIPVPPLEIQEAFAVQIKQVRAWTSEIVKSA----SKIDEVIRAVVGEA 393 Query: 418 FRGELTAQWRAENPDLISG 436 F GELTAQWR + I+ Sbjct: 394 FSGELTAQWRGMHASEITT 412 >UniRef50_B4B315 Restriction modification system DNA specificity domain n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4B315_9CHRO Length = 397 Score = 223 bits (567), Expect = 2e-56, Method: Composition-based stats. Identities = 95/417 (22%), Positives = 174/417 (41%), Gaps = 24/417 (5%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 + W + P+ + LI+ T+ + +A K + + V + Sbjct: 3 QNWDLVPLGEI--LIKSNTWIQIEANKKYKQITVKY------WGKGVVERNEVIGTEIAA 54 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTK 127 + ++ +++ G L F I F+ +K Sbjct: 55 SQRLQVRSGQFIVSRIDARHGSFG-LIPDCLNGAIVTNDFPVFNLNINRILPHFLNWMSK 113 Query: 128 SSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 + + S G +K F + IP+P L EQ+ I K++ L+A+++ + Sbjct: 114 TPTFIELCKVASEGTTNRIRLKEDKFLSMKIPLPKLEEQQRIIAKIEELVAKIEEARGLK 173 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES 246 E + + A + +++ + I+ + G S K ++ Sbjct: 174 EAGIRECEMLINAEIYNLFT----------ICKNTHWANKKLGDIVIDDCYGTSEKTHDY 223 Query: 247 GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 VG PILR+ +++ G +D +++++L+ E ++ LQ GD+L R N S E VG C + Sbjct: 224 KVGIPILRMGNIQNGILDVSELKYLDIHEKNKDKLILQKGDILVNRTN-SAELVGKCAVF 282 Query: 307 KKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI 365 + +IR RL K A P I ++ +S R M N K +GQ I+ K + Sbjct: 283 NLKGEYG--FASYIIRLRLDKAQANPTLIAMYINSSLGRTYMFNERKQMTGQANINAKKL 340 Query: 366 KSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 K+ ++LPP+ EQ EIV ++ L D +++ +L +N L +IL KAF+GEL Sbjct: 341 KALPIILPPLSEQQEIVTYLDNLQTQIDEMKRLRQESLKELNALLPAILDKAFKGEL 397 >UniRef50_D2LA90 Restriction modification system DNA specificity domain protein n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2LA90_9DELT Length = 543 Score = 222 bits (566), Expect = 2e-56, Method: Composition-based stats. Identities = 98/435 (22%), Positives = 183/435 (42%), Gaps = 34/435 (7%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W + V + G A D + + +G + + V +N K Sbjct: 133 WNTKGIGEVADIFDG-----PHATPKTVDTGPIFLGIGALNDGMINLRETRHVTENDFKT 187 Query: 70 -SQKISP--EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHF 125 ++++ P D+V + ++ +G++A +C G G++R + + F + Sbjct: 188 WTRRVRPQAGDVVFSY----ETRLGQAAIIPDNIDCCLGRRMGLVRFKTNEVIPKFFLYQ 243 Query: 126 TKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S YRN + S + GA ++ I F I IP + EQK I LD +D+ A Sbjct: 244 YISPSYRNFLDSKTIRGATVDRISIKEFPFFPIAIPSIEEQKRIVSILDDAFECIDTAIA 303 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILT-ELRNGLSSK- 242 E+ + ++ L R F + +++ N E IL+ + RNG S Sbjct: 304 NTEKNIANARELFESYLD-----------RVFAEKGDGWEEKNLEDILSFQPRNGWSPPA 352 Query: 243 PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 + S G P+L +SSV + +++ + + +++GDLL TR N + E VG Sbjct: 353 SHHSDRGTPVLTLSSVTGFQFKKEALKYTSAQVNPKAHYWVENGDLLMTRSN-TPELVGH 411 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDA-LPEYIEIFFSSPSARNAMMNCVKTTS-GQKGI 360 + + N +YPD +++ ++ K L E++ S RN + + + K + Sbjct: 412 VAVCDGVS-ANTIYPDLIMKMKVDKHIALTEFVYFQLRSSKLRNIIKDGATGANPTMKKV 470 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 +++ + +P + Q IV + L + + K+ + + + L QS+L KAF G Sbjct: 471 KKSTVQNLPLAMPALPVQQAIVDNLRNLNETSRLLVKKCVSKVKALTRLKQSLLQKAFSG 530 Query: 421 ELTAQWRAENPDLIS 435 EL + NPD + Sbjct: 531 ELPMDF---NPDALE 542 >UniRef50_B5W475 Restriction modification system DNA specificity domain n=1 Tax=Arthrospira maxima CS-328 RepID=B5W475_SPIMA Length = 493 Score = 222 bits (566), Expect = 2e-56, Method: Composition-based stats. Identities = 111/518 (21%), Positives = 199/518 (38%), Gaps = 93/518 (17%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS +LP+GW + ++ L G + + K +PLI G D Sbjct: 1 MS--ELPKGWAETKLGEISQLEMGQSPPGTATNSDAK--GIPLI------GGASDFVGEQ 50 Query: 61 FVPKNLVKESQKI-SPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 P KI P D+++ + + +GK A + G +RP + + Sbjct: 51 IKPNRFTSAPTKICQPNDLILCVR----ATIGKLAVAESAY--CLGRGVAGIRP-RNVNQ 103 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ + + + + G+ I + NI +PPL EQ+ I KLD L A+ Sbjct: 104 DWLRYRLIGDA--SALDAAGTGSTFRQIDKQTLVSWNINLPPLNEQRRIVAKLDRLFARS 161 Query: 180 DSTK-----------------------------------------ARFEQIPQILKRFRQ 198 + QI K+ Sbjct: 162 RCAREELGRVSRLVQRYKQAVLAAAFRGDLTADWRAENPDVEPASELLRQILIRRKQRYN 221 Query: 199 AVLGGAVNGKLTEKWRNFE--------------PQHSVFKKLNFESILTELRNGLSSKPN 244 + + ++F P+ +++ + +T+L +K Sbjct: 222 EKYNESKLKNKKKPRKDFVDQIPSIQSEVEISLPKTWAVTNIDYLAHVTKLAGFEYTKHF 281 Query: 245 ESGV--GHPILRISSVRAGHVDQNDIRFLECS-ESELNRHKLQDGDLLFTRYNGSLEFVG 301 ++ G PI+R +V+ G + +I+++ + L R +L ++L G Sbjct: 282 KTNDVAGIPIIRAQNVQMGKFIETNIKYISEDVSNYLERSQLHGREVLMVFIGAG---TG 338 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 L + + +L I + Y+ ++ S +N + + +K+T+ Q +S Sbjct: 339 NVCLAPQERRWHLAPNVAKIDV---DEISSNYLCLYLQSSIGQNYVDSWIKSTA-QPSLS 394 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 + I+ +V L P++EQ EIVRRVE+LF D IE++ A ++ L ++ L+KAFRGE Sbjct: 395 METIRKIIVFLSPLEEQKEIVRRVEKLFKAIDLIEQEHQKASKLLDRLEKATLSKAFRGE 454 Query: 422 LTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKA 459 L Q + P AA LLE+I+AER +KA Sbjct: 455 LVPQDPNDEP--------AAVLLERIQAERQTQPKRKA 484 Score = 121 bits (303), Expect = 6e-26, Method: Composition-based stats. Identities = 50/243 (20%), Positives = 104/243 (42%), Gaps = 10/243 (4%) Query: 6 LPEGWVIAPVSTVT--TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 LP+ W + + + T + G Y K N + +P+IRA N+Q GKF T++ ++ Sbjct: 254 LPKTWAVTNIDYLAHVTKLAGFEYTKHFKTNDVA--GIPIIRAQNVQMGKFIETNIKYIS 311 Query: 64 KNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 +++ E ++ ++++ V + + A V I S + Sbjct: 312 EDVSNYLERSQLHGREVLMVFIGAGTGNVCLAPQERRWHLAPNVAKIDV----DEISSNY 367 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + +SS+ +N + S ++ + I + + PL EQK I +++ L +D Sbjct: 368 LCLYLQSSIGQNYVDSWIKSTAQPSLSMETIRKIIVFLSPLEEQKEIVRRVEKLFKAIDL 427 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 + ++ ++L R +A L A G+L + N EP + +++ E R S+ Sbjct: 428 IEQEHQKASKLLDRLEKATLSKAFRGELVPQDPNDEPAAVLLERIQAERQTQPKRKAKST 487 Query: 242 KPN 244 + Sbjct: 488 RKP 490 >UniRef50_Q7UE18 Restriction modification system S chain homolog n=1 Tax=Rhodopirellula baltica RepID=Q7UE18_RHOBA Length = 389 Score = 222 bits (565), Expect = 3e-56, Method: Composition-based stats. Identities = 82/417 (19%), Positives = 161/417 (38%), Gaps = 39/417 (9%) Query: 12 IAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQ 71 +S + G T + + Y D +P +++ ++ T L + S Sbjct: 5 EVALSEICDTGSGGTPSRAKQEIYY-DGSIPWVKSGELRESVITETGESITELGLKESSA 63 Query: 72 KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLY 131 K+ P D ++ G + VG+ + + A C ++ + + ++ H +S + Sbjct: 64 KLLPADTLLVALYG--ATVGRVGMLGIE-AATNQAVCYLIPDDTRVERRYLYHALRSKV- 119 Query: 132 RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQ 191 + G NI IP+PPL+EQK IAE LD A +A + Sbjct: 120 -PYWLTQRVGGGQPNISQGVIKNTKIPLPPLSEQKRIAEILDRAEALRAKRRAALALL-- 176 Query: 192 ILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL---SSKPNESGV 248 Q++L ++G +I ++ G+ + K Sbjct: 177 --DELTQSILARLLDGSAD------------LGTTTLGNISRDMHQGINTVTEKIEYQND 222 Query: 249 GHPILRISSVRAGHVDQNDIRFLEC--SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 G PI++ G++D +D RF+ +++ DLL +G L+ Sbjct: 223 GFPIIQSKHTTQGYLDLSDARFVSKATYLKYKEKYRPARNDLLLCNIG----TIGKSLLM 278 Query: 307 KKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI 365 + Q + L L +L P + + +F ++++ + + K IS K + Sbjct: 279 E--QENDFLIAWNLFLIKLDLDQVSPSFCKHYFDRLASQHYFDRFLTGGT-VKFISKKTL 335 Query: 366 KSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + + LP + Q E E+ A + ++++ +A+A ++ L S+ +AFRGEL Sbjct: 336 NATPIPLPSMDRQREF----EEQIASVEVLKEKHRSAVAELDQLFASLQHRAFRGEL 388 >UniRef50_B7R237 Type I restriction modification system, subunit S n=1 Tax=Thermococcus sp. AM4 RepID=B7R237_9EURY Length = 428 Score = 222 bits (565), Expect = 3e-56, Method: Composition-based stats. Identities = 85/421 (20%), Positives = 173/421 (41%), Gaps = 19/421 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ--NGKFD--TTD 58 G++P W + V + + G T +Q +Y ++ + I ++ NG ++ Sbjct: 12 IGEIPRDWKVVRVREIFDVKTGTTPSTKQ-TDYWENGEMNWITPTDLSKLNGNIYMGDSE 70 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE--KL 116 K L + + P+ +I + + L E +F C L P+ Sbjct: 71 RKITKKALEDYNLSLLPKGSLILSTRAPVGYIA-----VLTEEATFNQGCKGLVPKDQNK 125 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 I F A++ K R + SLS G+ + A + +P+PP EQK IAE L T+ Sbjct: 126 IIPEFYAYYFK--FKRQHLESLSGGSTFKELAKAMLERFLVPLPPRLEQKKIAEILRTVD 183 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 ++ T E+ ++ K +L + + +K ++ + E I ++ Sbjct: 184 EAIEKTDLAIEKTERLKKGLMLRLLTKGIKHERFKK-TEIGEIPEEWRVVRLEEITRRIK 242 Query: 237 NGLSSKPNESGVGHPILRISSVRA-GHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYN 294 G S K +++ G + + G+++ ++ ++L + + L+++ L++GDL+ N Sbjct: 243 RGPSKKTDDNETGVVYVTSDYIDDHGNLNFDNPKYLSLEKIDRLDKYLLEEGDLIINCVN 302 Query: 295 GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTT 354 SLE +G + + + ++ + L P Y++ FF S + + + K Sbjct: 303 -SLEKIGKVAVFEGYSKKAIVGFNN-FALTLVSTVNPYYVKYFFLSYKGKALIKSISKAA 360 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 Q S KD+ + LPP+ EQ +I + + + + K+ L + +L Sbjct: 361 VQQVSFSSKDLLRLKIPLPPLPEQKQIAEILSTVDKKLELLRKRREKLELVKRGLMKGLL 420 Query: 415 A 415 Sbjct: 421 T 421 Score = 114 bits (284), Expect = 1e-23, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 84/207 (40%), Gaps = 10/207 (4%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN-GKFDTTDLV 60 G++PE W + + +T I+ KK + + + ++ I + G + + Sbjct: 221 EIGEIPEEWRVVRLEEITRRIKRGPSKKTDD----NETGVVYVTSDYIDDHGNLNFDNPK 276 Query: 61 FVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQH-LPFECSFGAFCGVLRPEKLI 117 ++ + + D++I + S +GK A + G L + Sbjct: 277 YLSLEKIDRLDKYLLEEGDLIINCVN-SLEKIGKVAVFEGYSKKAIVGFNNFALTLVSTV 335 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANIN-NIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 ++ +F S + I S+S A + + IP+PPL EQK IAE L T+ Sbjct: 336 NPYYVKYFFLSYKGKALIKSISKAAVQQVSFSSKDLLRLKIPLPPLPEQKQIAEILSTVD 395 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGG 203 +++ + R E++ + + + +L G Sbjct: 396 KKLELLRKRREKLELVKRGLMKGLLTG 422 Score = 67.8 bits (164), Expect = 8e-10, Method: Composition-based stats. Identities = 33/234 (14%), Positives = 79/234 (33%), Gaps = 23/234 (9%) Query: 208 KLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHP-ILRISSVR--AGHVD 264 KL + P+ ++ + + + + G + + + G++ Sbjct: 6 KLKKTPIGEIPRDWKVVRVREIFDVKTGTTPSTKQTDYWENGEMNWITPTDLSKLNGNIY 65 Query: 265 --QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR 322 ++ + + + + N L G L+ + VG +L + N K + Sbjct: 66 MGDSERKITKKALEDYNLSLLPKGSLILSTRAP----VGYIAVLTEEATFN--QGCKGLV 119 Query: 323 ARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIV 382 + +PE+ +F + + + S K ++ ++ +V LPP EQ +I Sbjct: 120 PKDQNKIIPEFYAYYFK---FKRQHLESLSGGSTFKELAKAMLERFLVPLPPRLEQKKIA 176 Query: 383 RRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR---------GELTAQWR 427 + + + + + L +L K + GE+ +WR Sbjct: 177 EILRTVDEAIEKTDLAIEKTERLKKGLMLRLLTKGIKHERFKKTEIGEIPEEWR 230 >UniRef50_A3SCN8 Restriction endonuclease S subunit-like protein n=1 Tax=Sulfitobacter sp. EE-36 RepID=A3SCN8_9RHOB Length = 497 Score = 221 bits (563), Expect = 5e-56, Method: Composition-based stats. Identities = 91/427 (21%), Positives = 178/427 (41%), Gaps = 27/427 (6%) Query: 52 GKFDTTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHL-PFECSFGAFCGV 110 + T + + ++ E+ ++ ++ ++ S L + +F Sbjct: 4 DRIGDTKDYVTDLGIENSTTRVVAENSLLIVT--RSGILRHSLPVALANKDVAFNQDIKA 61 Query: 111 LRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAE 170 L I ++ + K+ + AG + ++ + I P EQ+ I E Sbjct: 62 LTLFSGIDPEYVLYHLKADADDILDACAKAGTTVESLDFNRLKSYPLRIAPSLEQRRIVE 121 Query: 171 KLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF----------EPQH 220 KLD L + D +IP+++ +++ L A G+LT +R P Sbjct: 122 KLDILTGRTDRAHDELSRIPELVAKYKSCFLRLAFTGQLTSDFRGEHSRKGTGVENIPDS 181 Query: 221 SVFKKLNFESILTE-LRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELN 279 K L S + ++ G + V P LR+++V+ G +D +I+ + + E Sbjct: 182 WAVKPLGEISEIQGGVQVGKKRSSSTDLVEVPYLRVANVQRGWLDLEEIKTIGVTPQEKE 241 Query: 280 RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFS 339 R L+ GD+L G + +G G + Q + ++ + + R RL +LP ++ Sbjct: 242 RLLLRMGDILMNE-GGDRDKLGR-GWVWNNQIADCIHQNHVFRIRLKDSSLPPEFVSHYA 299 Query: 340 SPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 + + ++ T+ IS + + + V +PP E EIV R++ FA+ + I + Sbjct: 300 NEMGQQYFVDQGTQTTNLASISKRKLAALPVPVPPSDEAVEIVNRIDAAFAWLERISSEQ 359 Query: 400 NNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKA 459 A + L +IL+KAFRGEL Q + P A+ +L ++ E A+ +K+ Sbjct: 360 AAASKLLPELDAAILSKAFRGELARQNPDDEP--------ASRILARVSVEGQAAPTRKS 411 Query: 460 ---SRKK 463 +RK+ Sbjct: 412 PHNTRKR 418 Score = 109 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 37/225 (16%), Positives = 95/225 (42%), Gaps = 5/225 (2%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P+ W + P+ ++ + GV K+++ + + +P +R N+Q G D ++ + Sbjct: 178 IPDSWAVKPLGEISEIQGGVQVGKKRSSSTDLVE-VPYLRVANVQRGWLDLEEIKTIGVT 236 Query: 66 LVKESQKISP-EDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLR-PEKLIFSGFI 122 ++ + + DI++ G + +G+ + +C +R + + F+ Sbjct: 237 PQEKERLLLRMGDILMNE-GGDRDKLGRGWVWNNQIADCIHQNHVFRIRLKDSSLPPEFV 295 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 +H+ + + + N+ +I + +P+PP E I ++D A ++ Sbjct: 296 SHYANEMGQQYFVDQGTQTTNLASISKRKLAALPVPVPPSDEAVEIVNRIDAAFAWLERI 355 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLN 227 + ++L A+L A G+L + + EP + +++ Sbjct: 356 SSEQAAASKLLPELDAAILSKAFRGELARQNPDDEPASRILARVS 400 >UniRef50_D0BWI7 Predicted protein n=1 Tax=Acinetobacter sp. RUH2624 RepID=D0BWI7_9GAMM Length = 396 Score = 221 bits (563), Expect = 5e-56, Method: Composition-based stats. Identities = 85/423 (20%), Positives = 175/423 (41%), Gaps = 38/423 (8%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 KLP+GW + V + ++ + + LP++ + N+ + + +P+ Sbjct: 7 KLPDGWDWKTLGDVCFKVTDGSHNPPKEVEV----GLPMLSSRNVMDNGLVWDNFRLIPE 62 Query: 65 NLVKESQK---ISPEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFCGVLRPEKLIFSG 120 + + K +S D+++ + +G+S ++L + VL E+LI Sbjct: 63 DAFESEHKRTRVSEGDVLLTI----VGTIGRSCVVRNLDRLFTLQRSVAVLSSEELI-PE 117 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+++ ++ + S + G+ I + PP+ EQ I EKLD L ++D Sbjct: 118 FLSYQFRAPFIQEHFISNAKGSAQKGIYLKQLKATYLVCPPIEEQNRIVEKLDALFTRID 177 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 + + K+ +VL P + + S Sbjct: 178 IAIEHLQSKLDLSKQLFDSVLDEFFKL----------PDCDSVPLTQVVEFIGGSQPPKS 227 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 + G+ +R+ +R D N I +++ + + + D++ RY Sbjct: 228 QFSDVQKEGY--VRLIQIRDYKSD-NHIVYVDSAST---KKFCTKDDVMIGRYGPP---- 277 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 + + L+ + Y L++A +D L +Y+ F SPS +N ++ + +GQ G Sbjct: 278 ----VFQILRGLDGAYNVALMKAVPNEDLLMKDYLFWFLQSPSIQNYVIGISQRAAGQSG 333 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 ++ K ++ ++ +P Q +IV +V QL + + +E +V +A ++ L SIL AF+ Sbjct: 334 VNKKALEKYLIPVPSKAIQNDIVDKVGQLVSKSRHLEAEVTAEIAFLSQLKASILDSAFK 393 Query: 420 GEL 422 GEL Sbjct: 394 GEL 396 >UniRef50_C6Q0B1 Restriction modification system DNA specificity domain protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6Q0B1_9CLOT Length = 407 Score = 221 bits (562), Expect = 6e-56, Method: Composition-based stats. Identities = 95/428 (22%), Positives = 178/428 (41%), Gaps = 34/428 (7%) Query: 5 KLPEGWVIAPVST-VTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN-GKFDT--TDLV 60 KLP+ W + + TL G K D+ +P + +I N G F+ L Sbjct: 2 KLPKEWKEVNLKEYILTLESGKRPKGGAI-----DNGVPSLGGEHINNTGGFNIQIDKLK 56 Query: 61 FVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 +VP+ K+ S + DI+I + + + E ++R + + Sbjct: 57 YVPREFFKKMKSGVVKKNDILIVKDGATTGKIAFVDNNFNLKEACINEHLFLIRTNERLN 116 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + F++++ +S+ R KI GA + I +F NI +PPL QK I + L+ Sbjct: 117 NKFLSYYLRSNTGRKKILEDFRGATVGGIS-KNFIDFNILLPPLETQKKIVKVLEKAEET 175 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 ++ K + +++K + G + K S++ + G Sbjct: 176 LEKRKESINLLDKLVKSRFIGMFGDP------------SSNPKGWNKDTIGSVVKSITAG 223 Query: 239 LSSK---PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 S+ + +L++S+V G+ ++ + + + GDLLF+R N Sbjct: 224 WSANGEAREKREDEKAVLKVSAVTQGYFKADEYKVIGDDVEIKKYVFPEKGDLLFSRAN- 282 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 + E VG ++ K + +LL PDKL + + Y++ S PS R TS Sbjct: 283 TREMVGATCIIHK-DYPDLLLPDKLWKVSFVERVNVFYMKYILSEPSIRAEFSAKSTGTS 341 Query: 356 G-QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 G +S KS + +PP++ Q + V Q+ D ++ ++ +L + + +S++ Sbjct: 342 GSMYNVSMDKFKSIEITIPPIELQNQFADFVNQV----DKLKFEMEKSLKELEDNFKSLM 397 Query: 415 AKAFRGEL 422 KAF+GEL Sbjct: 398 QKAFKGEL 405 >UniRef50_C1ZA47 Restriction endonuclease S subunit n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZA47_PLALI Length = 413 Score = 220 bits (560), Expect = 9e-56, Method: Composition-based stats. Identities = 92/423 (21%), Positives = 164/423 (38%), Gaps = 22/423 (5%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDLVFVPKNLV 67 GW+ + V G+ +K E+ + +IR N + G D +D+ ++ Sbjct: 4 GWIYKTLDDVCEFNNGL-WKGEKPPFV----TVGVIRNTNFTKEGTLDDSDIAYIEVEAK 58 Query: 68 K-ESQKISPEDIVIAMSSGSKS-VVGKSAHQHL-PFECSFGAFCGVLRPE--KLIFSGFI 122 K E +++ D+++ S G VG+ A + SF F +R + K + F+ Sbjct: 59 KFEKRRLVFGDLILEKSGGGPKQPVGRVALFDKRAGDFSFSNFTAAIRVKDPKTLDFRFL 118 Query: 123 AHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 F + ++ + + I N+ + I +P+PPL EQ+ I LD + + Sbjct: 119 HKFLFWTHLSGVTETMQSHSTGIRNLNGDVYKCIEVPLPPLTEQRRIVGILDEAFEGLAT 178 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 KA E+ Q + ++ L AV + + W + L S Sbjct: 179 AKANAEKNLQNARALFESHLQ-AVFTQRGDGWVEKTVKDVASPIKGSIRTGPFGSQLLHS 237 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGDLLFTRYNGSLEFV 300 G +L I + A RF+ + +L R+++ GD+L T Sbjct: 238 --EFVDEGIAVLGIDNAVANEFRWGKSRFITKDKFGQLERYRVYPGDVLITIMGTC---- 291 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 G C ++ + + L K LP Y+ ++F A + + G Sbjct: 292 GRCAVVPD-DIPTAINTKHICCITLDWKKCLPSYLHLYFLHAQQSQAFLAKHAKGAIMAG 350 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 ++ I+ VLLPP + Q+ IV L +E LA ++ L +S+L +AF Sbjct: 351 LNMGLIQELPVLLPPTQVQSAIVEAANDLREETQRLESLYQRKLAALDELKKSLLHRAFS 410 Query: 420 GEL 422 GEL Sbjct: 411 GEL 413 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 82/211 (38%), Gaps = 13/211 (6%) Query: 8 EGWVIAPVSTVTTLIRG----VTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 +GWV V V + I+G + + + D+ + ++ +N +F F+ Sbjct: 207 DGWVEKTVKDVASPIKGSIRTGPFGSQLLHSEFVDEGIAVLGIDNAVANEFRWGKSRFIT 266 Query: 64 KNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGA--FCGVLRPEKLIFS 119 K+ E ++ P D++I + G+ A + C + K Sbjct: 267 KDKFGQLERYRVYPGDVLITIM----GTCGRCAVVPDDIPTAINTKHICCITLDWKKCLP 322 Query: 120 GFIA-HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ +F + + ++ + GA + + + + +PP Q I E + L + Sbjct: 323 SYLHLYFLHAQQSQAFLAKHAKGAIMAGLNMGLIQELPVLLPPTQVQSAIVEAANDLREE 382 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 ++ +++ L ++++L A +G+L Sbjct: 383 TQRLESLYQRKLAALDELKKSLLHRAFSGEL 413 >UniRef50_C9Q5S0 Possible type I restriction-modification system S subunit n=1 Tax=Vibrio sp. RC341 RepID=C9Q5S0_9VIBR Length = 469 Score = 220 bits (560), Expect = 9e-56, Method: Composition-based stats. Identities = 98/456 (21%), Positives = 177/456 (38%), Gaps = 28/456 (6%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG---KFDTTD--LV 60 +P W+ + + + +G+T KE L+D +P + + + + D L Sbjct: 21 IPAHWLTSKLRYTFSFGKGLTITKEN----LRDTGIPCVSYGEVHSKYGFEIDPARHPLK 76 Query: 61 FVPKNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 V + +K S + DIV A +S G + G + RP Sbjct: 77 CVGDDYLKTSPYALLKKGDIVFADTSEDIDGSGNFTQLVSNEQVFAGYHTIIARPYNHEC 136 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 S F A+ S R +I G + +I A +NI +PPL E+ IA LD A+ Sbjct: 137 SRFYAYLLDSKELRTQIRHAVKGVKVFSITQAILRGVNIWLPPLKERNQIANFLDHETAK 196 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFE 229 +D+ + +Q+ ++LK RQAV+ AV L + W P+H L Sbjct: 197 IDTLIEKQQQLIKLLKEKRQAVVSHAVTKGLNPQAPMKDSGVEWLGEVPEHWSISPLKHH 256 Query: 230 SILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLL 289 T G SS N G P +R +++ + + DI + + R L DG+L+ Sbjct: 257 VN-TVNGFGFSSN-NFQDEGVPFIRAGNIKNKTIVKPDIHLPQAVVDKYQRVILNDGELV 314 Query: 290 FTRYNGSLEF----VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 + + VG GL+ ++ + +I R L +++ R+ Sbjct: 315 ISMVGSDPKIKASAVGQVGLVPPSLAGSVPNQNVVI-LREQSSLLKKFLFYVVCGTPYRH 373 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 + + Q IS I P + EQ EIV ++ D + ++ ++ Sbjct: 374 HLDVFSHKLANQSIISSSLIICAQFTFPELDEQKEIVDFLDTQLRKYDWLMEKATRSIEF 433 Query: 406 VNNLTQSILAKAFRGELTAQ-WRAENPDLISGENSA 440 +N ++++ G++ + W+A + E +A Sbjct: 434 MNERKTALISATVTGKIDVRNWQAPTLQNQAVEETA 469 Score = 114 bits (284), Expect = 1e-23, Method: Composition-based stats. Identities = 50/215 (23%), Positives = 91/215 (42%), Gaps = 13/215 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W I+P+ + G + +D+ +P IRA NI+N D+ +P Sbjct: 242 GEVPEHWSISPLKHHVNTVNGFGFSSNN----FQDEGVPFIRAGNIKNKTIVKPDIH-LP 296 Query: 64 KNLVKESQKISPED--IVIAMSSGSKSV----VGKSAHQHLPFECSF-GAFCGVLRPEKL 116 + +V + Q++ D +VI+M + VG+ S +LR + Sbjct: 297 QAVVDKYQRVILNDGELVISMVGSDPKIKASAVGQVGLVPPSLAGSVPNQNVVILREQSS 356 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + F+ + + YR+ + S AN + I + P L EQK I + LDT Sbjct: 357 LLKKFLFYVVCGTPYRHHLDVFSHKLANQSIISSSLIICAQFTFPELDEQKEIVDFLDTQ 416 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 L + D + + + + + A++ V GK+ Sbjct: 417 LRKYDWLMEKATRSIEFMNERKTALISATVTGKID 451 >UniRef50_A1ZUE4 Type I restriction-modification system specificity subunit n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZUE4_9SPHI Length = 424 Score = 219 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 76/424 (17%), Positives = 161/424 (37%), Gaps = 19/424 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + + + + G T + + Y + ++P ++ ++ N + T+ Sbjct: 12 GEIPEDWEVVKLGDIAKVSAGGTPLRSKQEEYFTNGHIPWVKTLDLNNSIIEDTEEKITS 71 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGFI 122 L + S + P++ V+ G + +G++ L E + L I+ FI Sbjct: 72 LALKETSCNLLPKNTVLVAMYGGFNQIGRTGL--LKIEATTNQAISALNIKSDNIYPEFI 129 Query: 123 AHFTKSSL-YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + K ++ S NI + I IPPLAEQ+ IA+ L T+ ++ + Sbjct: 130 LAWLNAKVEVWKKFAASSR--KDPNITKKDVEHFPIVIPPLAEQQEIADILSTVDEKIAT 187 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNG-KLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 R Q+ K Q + + P+ KL + ++ L Sbjct: 188 IDERLAHTQQLKKGLMQRLFTRGLGHTSFKASPLGEIPESWEVVKLGDIAKVSAGGTPLR 247 Query: 241 SKPNESGVG--HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLE 298 SK E P ++ + ++ + + + E + + L +L Y G Sbjct: 248 SKQEEYFTNGHIPWVKTLDLNNSIIEDTEEKITSLALKETSCNLLPKNTVLVAMYGG-FN 306 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 +G GLLK N I+ + + PE+I + ++ ++ Sbjct: 307 QIGRTGLLKIEATTNQAISALNIK---SDNIYPEFILAWLNAKV--EVWKKFAASSRKDP 361 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 I+ KD++ +++PP+ EQ EI + + + + L + ++ + Sbjct: 362 NITKKDVEHFPIVIPPLAEQQEIADILGGVDEKLELL----AEKKEAYQGLKKGLMQQLL 417 Query: 419 RGEL 422 G++ Sbjct: 418 TGKV 421 Score = 142 bits (357), Expect = 4e-32, Method: Composition-based stats. Identities = 43/205 (20%), Positives = 86/205 (41%), Gaps = 6/205 (2%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + + + + G T + + Y + ++P ++ ++ N + T+ Sbjct: 222 GEIPESWEVVKLGDIAKVSAGGTPLRSKQEEYFTNGHIPWVKTLDLNNSIIEDTEEKITS 281 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGFI 122 L + S + P++ V+ G + +G++ L E + L I+ FI Sbjct: 282 LALKETSCNLLPKNTVLVAMYGGFNQIGRTGL--LKIEATTNQAISALNIKSDNIYPEFI 339 Query: 123 AHFTKSSL-YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + K ++ S NI + I IPPLAEQ+ IA+ L + +++ Sbjct: 340 LAWLNAKVEVWKKFAASSR--KDPNITKKDVEHFPIVIPPLAEQQEIADILGGVDEKLEL 397 Query: 182 TKARFEQIPQILKRFRQAVLGGAVN 206 + E + K Q +L G V Sbjct: 398 LAEKKEAYQGLKKGLMQQLLTGKVR 422 >UniRef50_A5G3B9 Restriction modification system DNA specificity domain n=2 Tax=Proteobacteria RepID=A5G3B9_GEOUR Length = 393 Score = 219 bits (559), Expect = 1e-55, Method: Composition-based stats. Identities = 70/411 (17%), Positives = 147/411 (35%), Gaps = 24/411 (5%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQK 72 P+ + T+ G T + + +P ++++ T P+ L + Sbjct: 6 VPLGGLVTISGGGTPSRNN--DAYWGGSIPWATVKDLKDTMLSGTQETITPEGLRDSASN 63 Query: 73 ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYR 132 + P VI ++ +GK A + + + ++ +F ++ Sbjct: 64 LIPAGSVIV---ATRMGLGKVAINTMDV--TINQDLKAFSCGADLEPRYLLYFLLANA-- 116 Query: 133 NKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQI 192 + + S+ GA + I +++P+PPL EQK IA L + DS + + ++ ++ Sbjct: 117 SHLDSMGKGATVKGITLDVLKDLSVPLPPLPEQKRIAAIL----DKADSIRRKRQEAVRL 172 Query: 193 LKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPI 252 + ++V + W ++ L S G + Sbjct: 173 TEELLRSVFLDMFGDPESNNWPMMTIAGVALPGVSAIRTGPFGSQLLHS--EFVDEGVAV 230 Query: 253 LRISSVRAGHVDQNDIRFL-ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQH 311 L I + A N+ R++ E EL+R+ ++ GD++ T G C ++ Sbjct: 231 LGIDNAVANEFRWNERRYISEAKYRELSRYTVRPGDVIITIMGTC----GRCAVVPDDIP 286 Query: 312 QNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVL 371 + LP ++ +F + + G++ IK + Sbjct: 287 VAINTKHLCCITLDQTKCLPVFVHAYFLQHCIARRYLEKTAKGAIMDGLNMGIIKDMPIP 346 Query: 372 LPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 +PP+K Q + + A + + + LA + L S+L +AF G L Sbjct: 347 IPPLKLQEKFACSI----AAIEKLRHTTRSTLAEQDTLFHSLLQRAFNGAL 393 >UniRef50_Q0EXK2 HsdS protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EXK2_9PROT Length = 462 Score = 219 bits (558), Expect = 2e-55, Method: Composition-based stats. Identities = 84/456 (18%), Positives = 180/456 (39%), Gaps = 28/456 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P WV+ ++ L T KK + + ++P+ + ++ ++ + Sbjct: 20 GEIPAHWVLTRTKYISEL----TPKKPKISRDKECSFIPMEK---LKTDSIVLDEVRTID 72 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAF-CGVLRPEKLIFSGFI 122 ++ + D+++A + + Q L FG+ V+R + + + F+ Sbjct: 73 -DVYDGYTYFADSDVLMAKVTPCFENKNIAIAQDLVNGVGFGSSEIYVIRANQRVSNRFL 131 Query: 123 AHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + + GA + + + +P EQ IA LD A++D+ Sbjct: 132 FYRLQEDSFMEIAIAAMTGAGGLKRVPSDVLNNYIAAVPQHDEQMEIANFLDRETAKIDT 191 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 + +Q+ ++LK RQAV+ AV L +W P H L FE + Sbjct: 192 LIEKQQQLIKLLKEKRQAVISHAVTKGLNPDAPMRNSGIEWLGEVPAHWEISSLGFECSV 251 Query: 233 TELRNGLSSKPNES-GVGHPILRISSVRAGHVDQNDIRFLE-CSESELNRHKLQDGDLLF 290 K E G+ L +++ +D ++ ++ E L +GD+L Sbjct: 252 KARLGWKGLKAEEYVDEGYIFLATPNIKGEKIDFENVNYITKARYDESPEIMLNEGDVLV 311 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 T+ + G ++++L + + R Y+ FF S +N ++ Sbjct: 312 TKDGST---TGTTNIVRELPSPATV-NSSIAVLRSVGRIDSSYLYYFFVSTYVQN-VIKR 366 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 ++ G + D++ VL+PP KEQ EI ++ D + + ++ + Sbjct: 367 IQGGMGVPHLFQADLRKFNVLMPPFKEQKEIAAEIDMRLPKFDDLIAKAEYSILLMKERR 426 Query: 411 QSILAKAFRGELTAQWRAENP--DLISGENSAAALL 444 ++++ A G++ + +P DL S +++ A L Sbjct: 427 TALISAAVTGKIDVRHHVSHPTGDLQSSKSAIHADL 462 >UniRef50_A5UR98 Restriction modification system DNA specificity domain n=2 Tax=Bacteria RepID=A5UR98_ROSS1 Length = 392 Score = 219 bits (558), Expect = 2e-55, Method: Composition-based stats. Identities = 97/432 (22%), Positives = 177/432 (40%), Gaps = 51/432 (11%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 M +LP+GW + T+ T+ G ++Q K +P+ AN + G DT+ Sbjct: 2 MERWELPKGWGWKRLKTLVTVNYGKGLSEKQR----KAGNVPVYGANGVV-GFHDTS--- 53 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 I+ ++ GS V S P + +F + ++++ Sbjct: 54 ------------ITKGQTIVIGRKGSAGAVNWSEIACWPIDTTF----FIDEFPEILYPQ 97 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-------PLAEQKIIAEKLD 173 F+ F +S +I L A I + + +PIP LAEQ+ I +L+ Sbjct: 98 FLYQFLRSQ----QIDRLQQSAAIPGLNRDVLYSVEVPIPYPDDPAHSLAEQRRIVARLE 153 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILT 233 LL + + + + + + L + ++ L E + +K ++ L Sbjct: 154 LLLGETRAMREDIQAMRRDLAQVMESALAEVFPNPNGEMPKG-----WGWKSID---DLF 205 Query: 234 ELRNGLSSKPNES--GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 EL+ G S P P LR ++ G VD +D+ ++ +E E+ R KL+ GDLL Sbjct: 206 ELQQGASMSPRRRQGRNPQPFLRTKNILWGEVDTSDVDVMDFTEDEIERLKLRKGDLLIC 265 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNC 350 VG + + Q ++Y + + R R DA P++ + + + Sbjct: 266 EGG----DVGRAAVWED-QLPLVMYQNHIHRLRRKSDDADPKFYVYWMKAAYQLFKIYQG 320 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 ++ + +SG+ +K+ +V + EQ IV +E + ++ + L + L Sbjct: 321 EESRTAIPNLSGRRLKNFLVPTTSLTEQRRIVAYLEHIAEEIRAMDDLLAQDLRDIEVLE 380 Query: 411 QSILAKAFRGEL 422 QSILA AFRGE+ Sbjct: 381 QSILAAAFRGEV 392 >UniRef50_B4RYU8 Type I site-specific deoxyribonuclease n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYU8_ALTMD Length = 360 Score = 219 bits (557), Expect = 2e-55, Method: Composition-based stats. Identities = 73/388 (18%), Positives = 165/388 (42%), Gaps = 37/388 (9%) Query: 42 PLIRANNIQNGKFDTTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE 101 I+ ++++N + + + + + P D++IA + +G + Sbjct: 5 RYIQIDDLRNDNL----IKYTDDD---KGTFVEPSDVIIAWDGANAGTIGYGLEGLI--- 54 Query: 102 CSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPP 161 S A V+ P I + ++ F +S +I + GA I ++ + + +P+PP Sbjct: 55 GSTLARLKVIIPH--IDTNYLGRFLQSKF--KEIRNNCTGATIPHVSKVHLNSLLVPVPP 110 Query: 162 LAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHS 221 L QK IA L + D+ + + +Q+ Q L Q+V + Sbjct: 111 LPIQKQIAAVL----EKADNLRQQSQQMEQELNSLAQSVFLDMFGDYRKD---------- 156 Query: 222 VFKKLNFESILTELRNGLSSKPNESG---VGHPILRISSVRAGHVDQNDIRFLECSESEL 278 + + ++R+G++ G P +R+++V+ G++D ++I+ + + Sbjct: 157 AMSLKSSLGEVADVRSGVTKGQKLEGHKLTTVPYMRVANVQDGYLDLSEIKDITVKAKDF 216 Query: 279 NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFF 338 +++L+ GD+L T G + +G G + Q N ++ + + R RL + E+ + Sbjct: 217 EKYQLKAGDVLMTE-GGDFDKLGR-GAIWSGQIANCIHQNHVFRVRLCDRYISEFFAYYL 274 Query: 339 SSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 +P + + C K T+ I+ +K + + +Q +R +++L A +++ Sbjct: 275 QTPFVKQYFLKCAKKTTNLASINITQLKGLPIPDESIGKQQSFLRIIDELKA----LKEA 330 Query: 399 VNNALARVNNLTQSILAKAFRGELTAQW 426 + N S++ +AF+GEL + Sbjct: 331 NFEQQEQANAHFNSLMQRAFKGELDLKD 358 Score = 100 bits (248), Expect = 2e-19, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 72/196 (36%), Gaps = 6/196 (3%) Query: 14 PVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV-FVPKNLVKESQK 72 + V + GVT K Q + K +P +R N+Q+G D +++ K E + Sbjct: 163 SLGEVADVRSGVT--KGQKLEGHKLTTVPYMRVANVQDGYLDLSEIKDITVKAKDFEKYQ 220 Query: 73 ISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGFIAHFTKSSLY 131 + D+++ G +G+ A C +R S F A++ ++ Sbjct: 221 LKAGDVLMTE-GGDFDKLGRGAIWSGQIANCIHQNHVFRVRLCDRYISEFFAYYLQTPFV 279 Query: 132 RNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 + + N+ +I + IP + +Q+ +D L A ++ + EQ Sbjct: 280 KQYFLKCAKKTTNLASINITQLKGLPIPDESIGKQQSFLRIIDELKALKEANFEQQEQAN 339 Query: 191 QILKRFRQAVLGGAVN 206 Q G ++ Sbjct: 340 AHFNSLMQRAFKGELD 355 >UniRef50_Q112D6 Restriction modification system DNA specificity domain n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q112D6_TRIEI Length = 402 Score = 219 bits (557), Expect = 2e-55, Method: Composition-based stats. Identities = 76/417 (18%), Positives = 164/417 (39%), Gaps = 21/417 (5%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W V V ++ T ++ K+ +P +R NNIQ+GK + D++F+ + Sbjct: 3 WQRVFVEDVAKIVTKGTTPTSIGFSFSKE-GIPFLRVNNIQDGKINLGDVLFIDSKTDQA 61 Query: 70 --SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFEC-SFGAFCGVLRPEKLIFSGFIAHFT 126 +I +D++I++ +GK+A + ++R + + H+ Sbjct: 62 LARSRILKKDVIISI----AGTIGKTAVIPTNAPAMNCNQALAIIRLHNNVDPYYFNHWL 117 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 + +I+ A I+N+ + IP+PP+ EQ+ IA L Q D+ + + Sbjct: 118 NTGDAFRQITGSKVTATISNLSLGCIKKLKIPLPPIEEQRRIAAIL----DQADAIRRKR 173 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES 246 +Q + ++ + + +I Sbjct: 174 QQAIALTDELLRSTFLEMFGDPVINPKGWEVKKLEEVALKRKGAIKCGPFGSQLLISEFV 233 Query: 247 GVGHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYNGSLEFVGVCGL 305 G P+ I +V+ +++ + E L +QD D+L +R VG + Sbjct: 234 KDGIPVYGIDNVQKNEFVWAKPKYITTEKYEQLKSFSIQDEDVLISRTG----TVGRTCV 289 Query: 306 LKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI 365 +++L P+ L + T LP+Y+ + + + + + + ++ Sbjct: 290 APPDIPRSILGPNLLKVSLNTNKMLPKYLSYALNHSNPLIEEIKRMSPGATVAVFNTTNL 349 Query: 366 KSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 K+ + +P + Q++ V E + + +++ +N L NNL S+L +AF+G+L Sbjct: 350 KALRLTIPHINLQSQFVNFTENV----ELTKQKESNYLTESNNLFNSLLQRAFKGQL 402 Score = 106 bits (265), Expect = 2e-21, Method: Composition-based stats. Identities = 44/228 (19%), Positives = 93/228 (40%), Gaps = 20/228 (8%) Query: 223 FKKLNFESILTELRNGLSSKP---NESGVGHPILRISSVRAGHVDQNDIRFLECSESE-L 278 ++++ E + + G + + S G P LR+++++ G ++ D+ F++ + L Sbjct: 3 WQRVFVEDVAKIVTKGTTPTSIGFSFSKEGIPFLRVNNIQDGKINLGDVLFIDSKTDQAL 62 Query: 279 NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFF 338 R ++ D++ + +G ++ + L RL + P Y + Sbjct: 63 ARSRILKKDVIISIAG----TIGKTAVIPTNAP-AMNCNQALAIIRLHNNVDPYYFNHWL 117 Query: 339 SSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 ++ A + K T+ +S IK + LPP++EQ I ++Q AD I ++ Sbjct: 118 NTGDAFRQITG-SKVTATISNLSLGCIKKLKIPLPPIEEQRRIAAILDQ----ADAIRRK 172 Query: 399 VNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEK 446 A+A + L +S + F G NP + L++ Sbjct: 173 RQQAIALTDELLRSTFLEMF-G-----DPVINPKGWEVKKLEEVALKR 214 Score = 98.6 bits (244), Expect = 4e-19, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 76/212 (35%), Gaps = 17/212 (8%) Query: 7 PEGWVIAPVSTVTTLIRG----VTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 P+GW + + V +G + + I+ D +P+ +N+Q +F ++ Sbjct: 199 PKGWEVKKLEEVALKRKGAIKCGPFGSQLLISEFVKDGIPVYGIDNVQKNEFVWAKPKYI 258 Query: 63 PKNLVK--ESQKISPEDIVIAMSSGSKSVVGK--SAHQHLPFECSFGAFCGVLRPEKLIF 118 + +S I ED++I+ + VG+ A +P V + Sbjct: 259 TTEKYEQLKSFSIQDEDVLISRT----GTVGRTCVAPPDIPRSILGPNLLKVSLNTNKML 314 Query: 119 SGFIAHFTK-SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 ++++ S+ +I +S GA + + + + IP + Q Sbjct: 315 PKYLSYALNHSNPLIEEIKRMSPGATVAVFNTTNLKALRLTIPHINLQSQFVNF----TE 370 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 V+ TK + ++L A G+L Sbjct: 371 NVELTKQKESNYLTESNNLFNSLLQRAFKGQL 402 >UniRef50_D2EQS4 Putative type I restriction-modification system, S subunit n=1 Tax=Streptococcus sp. M143 RepID=D2EQS4_9STRE Length = 384 Score = 218 bits (556), Expect = 3e-55, Method: Composition-based stats. Identities = 81/410 (19%), Positives = 162/410 (39%), Gaps = 34/410 (8%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK--ES 70 + V ++ G +K +N + + +IR N+Q G + +D + P E Sbjct: 4 VKLGEVCEILNGFAFKSLLYVN----EGIRIIRITNVQKGYIEDSDPKYYPIEYTNSIEK 59 Query: 71 QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLIFSGFIAHFTKSS 129 + D++++++ G+ VG + LP + LR + LI ++ F S Sbjct: 60 YILKENDLLMSLT-GNVGRVGLISKTMLP--AALNQRVACLRTIDSLISKEYVFQFLNSD 116 Query: 130 LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQI 189 L+ S G N+ + I P + +Q++I L+ + + K + +++ Sbjct: 117 LFEQSAIRSSNGVAQKNLSTDWLKKVEITYPSVEQQELITSTLNLIERLICCRKEQNKKL 176 Query: 190 PQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG 249 +++K + G V ++ +WR + + +KL + S + + G Sbjct: 177 NELVKSRFNEMFGDPVFNEM--RWRRCKLKDISIEKLAYGSGASAIDF----------SG 224 Query: 250 HPILRISSVRA-GHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKK 308 +RI+ + G++ + + E ++ L GD+LF R + VG L K Sbjct: 225 LRYIRITDIDECGNLKLD--KKSPSHYDE--KYLLNTGDILFARSGAT---VGKTFLYSK 277 Query: 309 LQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQ 368 ++ LY LIR + F++ N + V+ T Q I+ K Sbjct: 278 EKYGPALYAGYLIRLIPNLSLVNPVFVYHFTNTKFYNDFIAKVQNTVAQPNINAKQYSEL 337 Query: 369 VVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 +LPP+ Q E V A D + + +L + L +S++ + F Sbjct: 338 DFILPPLSLQNEFADFV----AQVDKSQLAIQKSLEELETLKKSLMQEYF 383 Score = 95.9 bits (237), Expect = 3e-18, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 69/197 (35%), Gaps = 14/197 (7%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN-GKFDTTDLVFVPKNLVK 68 W + ++ I + Y + L IR +I G + Sbjct: 198 WRRCKLKDIS--IEKLAYGSGASAIDF--SGLRYIRITDIDECGNLKLDKK---SPSHYD 250 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLP--FECSFGAFCGVLRPE-KLIFSGFIAHF 125 E ++ DI+ A S + VGK+ + + L P L+ F+ HF Sbjct: 251 EKYLLNTGDILFARSGAT---VGKTFLYSKEKYGPALYAGYLIRLIPNLSLVNPVFVYHF 307 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 T + Y + I+ + NI + ++ +PPL+ Q A+ + + + + Sbjct: 308 TNTKFYNDFIAKVQNTVAQPNINAKQYSELDFILPPLSLQNEFADFVAQVDKSQLAIQKS 367 Query: 186 FEQIPQILKRFRQAVLG 202 E++ + K Q G Sbjct: 368 LEELETLKKSLMQEYFG 384 >UniRef50_A4CWB5 Type I restriction-modification system, S subunit n=1 Tax=Synechococcus sp. WH 7805 RepID=A4CWB5_SYNPV Length = 405 Score = 218 bits (555), Expect = 4e-55, Method: Composition-based stats. Identities = 74/417 (17%), Positives = 161/417 (38%), Gaps = 22/417 (5%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 E W V L G T +Y + +P + + + + + T + L Sbjct: 3 ESWSKLRVGDFCNLSAGGTPDTNN-PDYWEGGDIPWMSSGEVHDQRIRRTRSHITERGLQ 61 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTK 127 S K P V+ +G GK A + + + ++ + + F+ + Sbjct: 62 DSSAKFFPIGSVLVALAGQGKTRGKVAISEIEL-TTNQSIAAIIADKGVCEPDFLFYNLD 120 Query: 128 SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFE 187 S ++ +LS G+ + + + I +PPL EQK IAE L + Q+ + + + Sbjct: 121 SRY--EELRTLSGGSGRAGLNLSILSDVEISLPPLPEQKKIAEILSGVDKQIYALENKIS 178 Query: 188 QIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESG 247 ++ + + + + S K + ES+ + + + P + Sbjct: 179 KLI----STKTEIFRDLFSCFDELGGNGVCKKESDTKIMPLESVCEAVIDCKNRTPPYTE 234 Query: 248 VGHPILRISSVRAGHVDQNDIRFLECSESEL--NRHKLQDGDLLFTRYNGSLEFVGVCGL 305 GHP++R +VR G + +ND+++ + S E+ R + D+LFTR +G L Sbjct: 235 SGHPVVRTPNVRNGKLVRNDLKYTDISSYEIWTARSVPRPMDVLFTREAP----LGEVCL 290 Query: 306 LKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKD 364 + ++ +++ R K + P Y+ SP ++ ++ K + D Sbjct: 291 VP--ENFKCCLGQRMMLFRADKSLIDPRYLLFSLMSPFVQDQLLK-SKGGTTVGHARVAD 347 Query: 365 IKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 ++ ++ + P ++Q I +F+ +T + V ++ ++ + G Sbjct: 348 VRDLLIPIVPKEKQLRIAS----VFSSIETFLEGVTRKKEKLEIQKSALASDLLSGR 400 >UniRef50_C1PCQ5 Restriction modification system DNA specificity domain protein n=1 Tax=Bacillus coagulans 36D1 RepID=C1PCQ5_BACCO Length = 483 Score = 217 bits (553), Expect = 6e-55, Method: Composition-based stats. Identities = 99/485 (20%), Positives = 181/485 (37%), Gaps = 68/485 (14%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P WV + T+ K+ KD+ L + G F+ Sbjct: 25 EVPGNWVWVKLKTI-----NKDKKRNIDPKSFKDETFELYSVPSFPEG-----SPEFIKG 74 Query: 65 NLVKESQKISPED-IVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + + S+++ +D I++ + + V K + H F V+ K I+S ++ Sbjct: 75 DEIGSSKQLVNKDEILLCKINPRINRVWKVLNNHGKFRQLASTEWIVISENKAIYSEYLL 134 Query: 124 HFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + KS +R I+S +G + +P + I +PP+ EQK IA+K++ LL+++D Sbjct: 135 YLLKSPYFRKLITSNVSGVGGSLTRARPKEVETYPIAVPPIKEQKRIADKVERLLSKIDE 194 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQ---------------------- 219 K E+ + + R A+L A G+LT KWR Sbjct: 195 AKRLIEEAKETFELRRAAILDKAFRGELTRKWREENKNIEDAESLYVKIKESQSIRRKVS 254 Query: 220 ------------HSVFKKLNFESILTELRNGLSSK--PNESGVGHPILRISSVRAGHVDQ 265 S +K + + T G + P P ++ ++ +++ Sbjct: 255 KEINIKDLRYSIPSTWKWVRLGDVFTITSGGTPKRTIPEYYEGNIPWIKTGEIKWNAINE 314 Query: 266 NDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL 325 ++ + + + + L +L Y L G +L N + Sbjct: 315 SEEQITPEAVANSSAKLLPPNTVLVAMYGQGLTR-GRAAILSVEATCN----QAVCALLP 369 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 PE+I +F R V Q+ +S I + LPP++EQ I+ + Sbjct: 370 NDYIAPEFIFYYFMEGYQR---FRQVAKGGNQENLSVSLISDFIFPLPPLEEQRVIITTL 426 Query: 386 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLE 445 + +F I+ + + + QSIL+KAFRGEL E SA LL+ Sbjct: 427 QNIFKKESKIKDVIKI---NTDEIKQSILSKAFRGELGTNDPTEE--------SAIELLK 475 Query: 446 KIKAE 450 ++ E Sbjct: 476 EVLQE 480 Score = 108 bits (270), Expect = 4e-22, Method: Composition-based stats. Identities = 52/256 (20%), Positives = 97/256 (37%), Gaps = 13/256 (5%) Query: 195 RFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILR 254 R +Q + + L + + + ++I + + + K + Sbjct: 2 RKKQKTMEELLEEALVPEGEQPYEVPGNWVWVKLKTINKDKKRNIDPKSFKDETFELY-- 59 Query: 255 ISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNL 314 + F++ E ++ + ++L + N + V +L Sbjct: 60 ----SVPSFPEGSPEFIKGDEIGSSKQLVNKDEILLCKINPRINRV--WKVLNNHGKFRQ 113 Query: 315 LYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG-QKGISGKDIKSQVVLLP 373 L + I K EY+ SP R + + V G K++++ + +P Sbjct: 114 LASTEWIVISENKAIYSEYLLYLLKSPYFRKLITSNVSGVGGSLTRARPKEVETYPIAVP 173 Query: 374 PVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDL 433 P+KEQ I +VE+L + D ++ + A +IL KAFRGELT +WR EN ++ Sbjct: 174 PIKEQKRIADKVERLLSKIDEAKRLIEEAKETFELRRAAILDKAFRGELTRKWREENKNI 233 Query: 434 ISGENSAAALLEKIKA 449 A +L KIK Sbjct: 234 ----EDAESLYVKIKE 245 >UniRef50_C6JN70 Predicted protein n=1 Tax=Fusobacterium varium ATCC 27725 RepID=C6JN70_FUSVA Length = 507 Score = 217 bits (552), Expect = 8e-55, Method: Composition-based stats. Identities = 91/497 (18%), Positives = 202/497 (40%), Gaps = 70/497 (14%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++PE W + V +I G T K Y ++ + ++ +++ + + ++ + Sbjct: 26 EIPENWEWVKLGKVNNVITGSTPSKAN-EKYWENKNIFFVKPSDLYQKRNLKSSEEYIDE 84 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 + + +I +GK A+ + + ++ +++IFS + + Sbjct: 85 RARDNVRILPKYSTLICC----IGSIGKVAYSEVEVSTN-QQINSLVPKKEIIFSLYNYY 139 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S+ +++++ + + I + + + + P+PPL EQK I EKLD++ +++ K Sbjct: 140 VANSNFFQSQMLNSAVATTIAILNKTNTENLRFPLPPLEEQKRIVEKLDSMFEKINRAKE 199 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVF--------------------- 223 ++ + ++ ++++L A G+LT +WR Sbjct: 200 LIQEAKENIENRKESILNKAFRGELTVEWRKNNQTEDAIELLKSINDEKIKNWEQECVEA 259 Query: 224 ----KKLNFESILTELRNGLSSK---PNESGVGHPILRISSV-------RAGHVDQND-- 267 KK + + +++N + SK P E +++ + + ++D+N+ Sbjct: 260 EKNGKKKPSKPKIEDIQNMIISKEEEPYEIPSKWKWVKLEYIIEINPKKKMLNIDENEKI 319 Query: 268 -----------------IRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQ 310 I + S+ + + + D+LF + +E G C + K L+ Sbjct: 320 SFLPMRSISDITGEISNIEYESYSKLKKGYTQFLENDILFAKITPCMEN-GKCVIAKNLK 378 Query: 311 HQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVV 370 ++ + R +++ F S R + + G + + + +K + Sbjct: 379 NEIGYGTTEFHVLRTNYILNNKFLHNFLRQESFRQEAKYNMTGSVGFRRVPTEFLKEYMF 438 Query: 371 LLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAEN 430 LPP++EQ EIVR ++++ I++ V A +SIL KAFRG+L Q + + Sbjct: 439 PLPPLEEQKEIVRILDEILEKESKIKELVELEEAIELL-EKSILDKAFRGKLGTQNKDDE 497 Query: 431 PDLISGENSAAALLEKI 447 P A LL+KI Sbjct: 498 P--------AIELLKKI 506 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 55/291 (18%), Positives = 113/291 (38%), Gaps = 35/291 (12%) Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVF-KKLNFESILTELRNGLSSKPN 244 + ++ ++A++ E+ P++ + K +++T +++ Sbjct: 3 KTKGLTFEEKLKEALIPK-------EEQPYEIPENWEWVKLGKVNNVITGSTPSKANEKY 55 Query: 245 ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 ++ S + ++ +++ + R + L+ GS+ V Sbjct: 56 WENKNIFFVKPSDLYQKRNLKSSEEYIDERARDNVRILPKYSTLICCI--GSIGKVAYSE 113 Query: 305 L-LKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 + + Q N L P K I Y +S ++ M+N + ++ Sbjct: 114 VEVSTNQQINSLVPKKEI-------IFSLYNYYVANSNFFQSQMLNSA-VATTIAILNKT 165 Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 + ++ LPP++EQ IV +++ +F + ++ + A + N +SIL KAFRGELT Sbjct: 166 NTENLRFPLPPLEEQKRIVEKLDSMFEKINRAKELIQEAKENIENRKESILNKAFRGELT 225 Query: 424 AQWRAENPDLISGENSAAALLEKIKAER-----------AASGGKKASRKK 463 +WR N A LL+ I E+ +G KK S+ K Sbjct: 226 VEWRKNNQT-----EDAIELLKSINDEKIKNWEQECVEAEKNGKKKPSKPK 271 >UniRef50_A1RES4 Restriction modification system DNA specificity domain n=1 Tax=Shewanella sp. W3-18-1 RepID=A1RES4_SHESW Length = 417 Score = 217 bits (552), Expect = 8e-55, Method: Composition-based stats. Identities = 73/428 (17%), Positives = 170/428 (39%), Gaps = 23/428 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P+GW++ V + L G T Q Y ++ +P + + + + + D Sbjct: 4 RVPDGWMLKIVRDTSKLSAGGTP-STQVTEYWENGTIPWMSSGEVHKKRVHSVDNCITTL 62 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 L S K+ P ++ +G G A + + + ++ +K ++ F+ H Sbjct: 63 GLENSSAKMFPSKSILVALAGQGKTRGTVAISEIEL-TTNQSIAAIIVKDKSVYPDFLYH 121 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S ++ +S G+ + A +++ +PPL EQ+ IA+ L ++ ++ T+A Sbjct: 122 NLDSRY--EELRGVSGGSGRAGLNLAILGDLDVLLPPLPEQQKIAKILTSVDQVIEKTQA 179 Query: 185 RFEQIPQILKRFRQAVLGG--AVNGKLTEKWRNFEPQH--SVFKKLNFESILTELRNGLS 240 + +++ + Q +L V+GK ++++ + + T + G + Sbjct: 180 QIDKLKDLKTGMMQELLTQGVGVDGKPHTEFKDSPVGWIPKTWDLEPLANFTTFISYGFT 239 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESE---LNRHKLQDGDLLFTRYNGSL 297 + E+ VG ++ V V + R + + + Q D+L T+ Sbjct: 240 NPMPEAEVGPYMITAKDVNDLKVQYSTSRKTTQEAFDNLLTRKSRPQVNDILLTKDG--- 296 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 +G L+ N + + +P+++ +SP + M+ S Sbjct: 297 -TLGRVALV---TDSNCCINQSVAVLTPNERVIPKFLLYLLASPRYQQEMLENA-GGSTI 351 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 K I + +V +P V EQ ++V + +F + N L+++N+ ++++ Sbjct: 352 KHIYITVVDKMLVGVPSVTEQQKLVDIFDSVFRKLE----LTENKLSKLNDTKKALMQDL 407 Query: 418 FRGELTAQ 425 G++ Sbjct: 408 LTGKVRVN 415 Score = 117 bits (293), Expect = 8e-25, Method: Composition-based stats. Identities = 40/209 (19%), Positives = 86/209 (41%), Gaps = 15/209 (7%) Query: 3 AGKLPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G +P+ W + P++ TT I G T +A + +I A ++ + K + Sbjct: 215 VGWIPKTWDLEPLANFTTFISYGFTNPMPEA-----EVGPYMITAKDVNDLKVQYSTSRK 269 Query: 62 VPK----NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 + NL+ + DI++ +G+ A C VL P + + Sbjct: 270 TTQEAFDNLLTRKSRPQVNDILLTKD----GTLGRVALVT-DSNCCINQSVAVLTPNERV 324 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+ + S Y+ ++ + G+ I +I D + + +P + EQ+ + + D++ Sbjct: 325 IPKFLLYLLASPRYQQEMLENAGGSTIKHIYITVVDKMLVGVPSVTEQQKLVDIFDSVFR 384 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVN 206 +++ T+ + ++ K Q +L G V Sbjct: 385 KLELTENKLSKLNDTKKALMQDLLTGKVR 413 >UniRef50_A7I739 Restriction modification system DNA specificity domain n=1 Tax=Candidatus Methanoregula boonei 6A8 RepID=A7I739_METB6 Length = 457 Score = 216 bits (551), Expect = 1e-54, Method: Composition-based stats. Identities = 109/451 (24%), Positives = 184/451 (40%), Gaps = 56/451 (12%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 W P+ + ++ G +K E + PLIR +I N K T+ + + Sbjct: 24 SWERVPLGKIAKVLNGFAFKSEL---FNDKKGTPLIRIRDIGNNK---TECYY--DGVFD 75 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKS 128 E+ I P D+++ M + P C + + F+ + Sbjct: 76 EAYVIHPGDLLVGMDGDF-----NCSTWRGPKALLNQRVCKIEVNIEQYNRKFLEYVL-- 128 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 Y I+ ++ + ++ S I +P PPL EQ+ I +++ LL+ V++ + R + Sbjct: 129 PGYLKAINENTSSQTVKHLSSRSISEILLPNPPLTEQQRIVARVEALLSHVNAARERLSR 188 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL---------------------- 226 +P I+K+FRQAVL A +G LTE WR P KL Sbjct: 189 VPLIMKKFRQAVLAAACSGGLTEGWRKENPDIEEANKLVKRLESIRKQFKIREISSIDNL 248 Query: 227 ---NFESILTEL--------RNGLSSKPNESGVGHPILRISSVRAGH-VDQNDIRFLECS 274 + T + + P S G + + + +D + + Sbjct: 249 ELSDLPDSWTWIRLANIAIVMDPDHKMPKSSDGGIIFISPKDFKENYQIDMTKTKRISDE 308 Query: 275 E--SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPE 332 E + + D+L++R L G + ++ Y +IR +L + + Sbjct: 309 EFLRLSKKFVPRPLDILYSRIGADL---GKARKAPQDIKFHISYSLAVIR-QLGEMENSD 364 Query: 333 YIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYA 392 Y+ +S RN V+ + G + +DI + ++ LPP+ EQ EIVRRV LF A Sbjct: 365 YLFWLLNSMFIRNQAFENVR-SIGVPDLGLRDIDNFIIPLPPLAEQYEIVRRVGLLFERA 423 Query: 393 DTIEKQVNNALARVNNLTQSILAKAFRGELT 423 D I+++V A R LTQ++L KAFRGELT Sbjct: 424 DAIDREVEAATRRCERLTQAVLGKAFRGELT 454 Score = 114 bits (286), Expect = 7e-24, Method: Composition-based stats. Identities = 45/211 (21%), Positives = 83/211 (39%), Gaps = 13/211 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK-FDTTDLVFV- 62 LP+ W ++ + ++ + + D + I + + D T + Sbjct: 252 DLPDSWTWIRLANIAIVM-----DPDHKMPKSSDGGIIFISPKDFKENYQIDMTKTKRIS 306 Query: 63 PKNLVKESQKISPE--DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFS 119 + ++ S+K P DI+ + + A Q + F S+ V+R ++ S Sbjct: 307 DEEFLRLSKKFVPRPLDILYSRIGADLGK-ARKAPQDIKFHISYS--LAVIRQLGEMENS 363 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ S RN+ + ++ D IP+PPLAEQ I ++ L + Sbjct: 364 DYLFWLLNSMFIRNQAFENVRSIGVPDLGLRDIDNFIIPLPPLAEQYEIVRRVGLLFERA 423 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 D+ E + +R QAVLG A G+LT Sbjct: 424 DAIDREVEAATRRCERLTQAVLGKAFRGELT 454 Score = 91.7 bits (226), Expect = 6e-17, Method: Composition-based stats. Identities = 43/203 (21%), Positives = 79/203 (38%), Gaps = 19/203 (9%) Query: 249 GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKK 308 G P++RI + + + E + + GDLL Sbjct: 52 GTPLIRIRDIGNNK----TECYYDGVFDE--AYVIHPGDLLVGMDGD--------FNCST 97 Query: 309 LQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 + L ++ + + + +++E +N ++ K +S + I Sbjct: 98 WRGPKALLNQRVCKIEVNIEQYNRKFLEYVL---PGYLKAINENTSSQTVKHLSSRSISE 154 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWR 427 ++ PP+ EQ IV RVE L ++ + ++++ + Q++LA A G LT WR Sbjct: 155 ILLPNPPLTEQQRIVARVEALLSHVNAARERLSRVPLIMKKFRQAVLAAACSGGLTEGWR 214 Query: 428 AENPDLISGENSAAALLEKIKAE 450 ENPD I N LE I+ + Sbjct: 215 KENPD-IEEANKLVKRLESIRKQ 236 >UniRef50_A0ZMI3 Putative uncharacterized protein n=1 Tax=Nodularia spumigena CCY9414 RepID=A0ZMI3_NODSP Length = 437 Score = 216 bits (551), Expect = 1e-54, Method: Composition-based stats. Identities = 88/442 (19%), Positives = 168/442 (38%), Gaps = 36/442 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +PE W I S G KD +PL+R +N++ G D ++ Sbjct: 16 GDIPEHWEIVRFSNFINFQEG----PGIMAADFKDYGVPLLRIHNLKPGFVDLERCNYLE 71 Query: 64 KNLVKES---QKISPEDIVIAMS--SGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLI 117 V+++ K++ +DI+I+ S +G S+V K A + + L+P I Sbjct: 72 PQKVEKTWKHFKLNEDDILISCSASTGLVSIVDKKAEGSIAYTG-----IIRLKPANSNI 126 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 FI S L+ +I L G I + P I I PPL EQK IA LD+ L Sbjct: 127 CREFIKIIVASELFFTQIELLKTGTTIQHYGPTHLRQIKITFPPLYEQKKIACFLDSKLE 186 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNF 228 ++D + +++ ++LK + A++ AV L +W P H + Sbjct: 187 EIDKFISNKQRLIELLKEQKTAIINRAVTKGLNPHAPMKPSGIEWLGDIPAHWEVTRAKH 246 Query: 229 ESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDL 288 S + + + +G P + + + + + ++ +L +N G Sbjct: 247 ISYVFVPQR--NKPNLNLNIGFPWITMEDITSPSISKSTFGYLVSEIDAMN-----AGSK 299 Query: 289 LFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMM 348 L + VG GL Q ++ ++A + P Y+ + Sbjct: 300 LLPEGSVIASCVGNFGLSSVNTLQVIINQQ--LQAYIPIKINPYYLRYLIG---ISKSYF 354 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + + ++ ++LPP EQ IVR +++ D + + + Sbjct: 355 EQIANATTLAYVNQAGFAELPIILPPNDEQLAIVRNIDKELTTIDKAITTIEKEIELIKE 414 Query: 409 LTQSILAKAFRGELTAQWRAEN 430 +++++A G++ + A + Sbjct: 415 YRTTLISEAVTGKIDVRETAAH 436 >UniRef50_C6MBL0 Restriction modification system DNA specificity domain protein n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MBL0_9PROT Length = 467 Score = 216 bits (551), Expect = 1e-54, Method: Composition-based stats. Identities = 84/446 (18%), Positives = 176/446 (39%), Gaps = 32/446 (7%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G+ P W + V + ++ D+ LI ++ + + + Sbjct: 26 IGEYPLNWNLTRVK-FESYVKARVGWHGLKSEDFTDEGPFLITGSDFRGPVINWNECYHC 84 Query: 63 PKNLVKESQKI--SPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRP-EKLIF 118 ++ I D++I +GK A L + + + V+RP Sbjct: 85 DLARYEQDPYIQLKDGDLLITKD----GTIGKVALVSGLAGKATLNSGVFVVRPLTNNYT 140 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 S F ++S++ + G+ I ++ +F IP EQ IA LD A+ Sbjct: 141 SRFYFWLLQASVFTGFVDFNKTGSTIVHLYQDTFVNFKYAIPSFNEQLTIANFLDHETAK 200 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFK----- 224 +D+ + +Q+ ++LK RQAV+ AV L +W P+H K Sbjct: 201 IDTLIEKQQQLIKLLKEKRQAVISHAVTKGLNPNAKMRDSGVEWLGEVPEHWSMKIKLVS 260 Query: 225 --KLNFESILTELRNGLSSKPNESGVGHPILRISSV-RAGHVDQNDIRFLECSESELNRH 281 + + S + VG P++ I + + G++ ++ + E +L Sbjct: 261 VAEGSRGSFVNGPFGSDLLSLELQDVGVPVIYIRDLKQTGYMRKSAVCVTEEKARQLEIC 320 Query: 282 KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSS 340 K+ GD+L + G + + + ++ D +IR R+ + + P Y+ + +S Sbjct: 321 KVVSGDVLIAKVGDPP---GEACIYPENEPAAIITQD-VIRIRVNRGVINPYYLVMLLNS 376 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 + +++ + S +K IS D K ++P + EQ++IV VE DT+ + Sbjct: 377 DLGK-VVVDNISIESTRKRISLGDFKQVRFIIPSLSEQSDIVSFVELRCRKIDTLIAKAQ 435 Query: 401 NALARVNNLTQSILAKAFRGELTAQW 426 + ++ + ++++ A G++ + Sbjct: 436 SMVSLIIERRTALISAAVTGKIDVRD 461 Score = 108 bits (270), Expect = 4e-22, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 83/234 (35%), Gaps = 14/234 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPI-LRISSVRAGHVDQNDIRF 270 +W P + ++ FES + K + P + S R ++ N+ Sbjct: 24 EWIGEYPLNWNLTRVKFESYVKARVGWHGLKSEDFTDEGPFLITGSDFRGPVINWNECYH 83 Query: 271 LECSESELNRH-KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 + + E + + +L+DGDLL T+ +G L+ L + L + LT + Sbjct: 84 CDLARYEQDPYIQLKDGDLLITKDG----TIGKVALVSGLAGKATLNSGVFVVRPLTNNY 139 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 + + ++ KT S + + +P EQ I ++ Sbjct: 140 TSRFYFWLLQASVF-TGFVDFNKTGSTIVHLYQDTFVNFKYAIPSFNEQLTIANFLDHET 198 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A DT+ ++ + + Q++++ A + NP+ ++ L Sbjct: 199 AKIDTLIEKQQQLIKLLKEKRQAVISHAVT-------KGLNPNAKMRDSGVEWL 245 >UniRef50_B4VXC6 Type I restriction modification DNA specificity domain protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VXC6_9CYAN Length = 506 Score = 216 bits (551), Expect = 1e-54, Method: Composition-based stats. Identities = 111/463 (23%), Positives = 196/463 (42%), Gaps = 58/463 (12%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P W+ + + Y K P+ +N I + + + L Sbjct: 5 PLSWIGVTLGDLLRFN----YGKSLPERARSGAGFPVYGSNGI---------VGYHDEPL 51 Query: 67 VKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF--GAFCGVLRPEKLIFSGFIAH 124 + +I GS V S P + ++ F G + + + + Sbjct: 52 TD-------GETLIIGRKGSVGEVHFSPGACFPIDTTYYVDQFHG-------MPTRYWFY 97 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 K+ +S L I ++ + I + PL EQK IA+KLD LLA+VD+ + Sbjct: 98 QLKNLG----LSELDKATAIPSLNRKDAYRVQIHLSPLNEQKRIADKLDALLARVDACRD 153 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 R ++ I+++ RQA+L ++GK+T+ W ++ + N L++ + + P+ Sbjct: 154 RLIRVSFIIQQLRQAILTDGISGKITQYWSKNNAENLAYNHQNIVGKLSDFADVIDPNPS 213 Query: 245 -----ESGVGHPIL---RISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 G PIL ++S + + + E+ H + D++F R Sbjct: 214 HRYPSYKGGTIPILATEQMSGLNDWDTSSAKLIKYDFYEARKAAHDFLNDDIIFARK--- 270 Query: 297 LEFVGVCGLLKKL-QHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTT 354 G GL + Q+ ++ + R+ D LP Y+ F + +++ + + Sbjct: 271 ----GRLGLARNPPQNIRYVFSHTVFIIRVKADNILPSYLLWFLRQEFCIDWLLSEMNSN 326 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 +G + ++ + +P EQ EIV+ +E+L+AYAD IE + NAL RV LT ++L Sbjct: 327 AGVPTLGKSVMERLPITIPDYAEQQEIVQCIEKLYAYADRIEARYQNALTRVEQLTPTLL 386 Query: 415 AKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGK 457 +KAFRGEL Q + P + LLE+I+AERAA K Sbjct: 387 SKAFRGELVPQDPDDEP--------VSVLLERIRAERAAQPNK 421 >UniRef50_C2CSZ9 Type I restriction modification DNA specificity protein n=1 Tax=Corynebacterium striatum ATCC 6940 RepID=C2CSZ9_CORST Length = 371 Score = 216 bits (549), Expect = 2e-54, Method: Composition-based stats. Identities = 97/415 (23%), Positives = 165/415 (39%), Gaps = 52/415 (12%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 W + + V L G KKE+ + P+ + + V N V Sbjct: 8 DWPMVRLGDVCHLKYGKALKKEERVA----GEFPVFGSA--------GSVGSHVEANFV- 54 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKS 128 + + GS V S+ + +FG F + E+ + S ++ K Sbjct: 55 -------GPVSVVGRKGSAGFVEWSSGNCWIIDTAFGVFP---KSEEQVDSRWLYWLLKD 104 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 ++ L A + I A +PPL EQ+ IA LD + + Sbjct: 105 L----RLGRLQKHAAVPGISKADVVEEKFLLPPLDEQRRIAAILDEVDEALFRVNQSLGD 160 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV 248 + Q+ + + E + ++ + L + G S K NE V Sbjct: 161 LLQLKQELFTDLF------------LRIERESTIIGE-----YLESTQYGTSDKANE-NV 202 Query: 249 GHPILRISSV-RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 G PILR+ +V G +D +D++++E S+ ++ L+ GDLLF R N S + VG ++ Sbjct: 203 GIPILRMGNVSYNGEIDLSDLKYVELDASDREKYSLKAGDLLFNRTN-SKDLVGKTAVVP 261 Query: 308 KLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 +LQ + Y LIR R+ A+PEYI F +S + + N K G I+ ++K Sbjct: 262 ELQEE-YTYAGYLIRCRVNDKAVPEYISGFLNSVLGKKILRNTAKAIVGMANINANELKR 320 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + + EQ E L + D +E Q+ + L +S+ +AF+ EL Sbjct: 321 LPIPQASLDEQQEFAS----LTSRIDDVESQMKRQRKLLQELQESLSTRAFQEEL 371 >UniRef50_B8GLU3 Type I restriction-modification system, S subunit n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GLU3_THISH Length = 458 Score = 215 bits (548), Expect = 2e-54, Method: Composition-based stats. Identities = 85/453 (18%), Positives = 175/453 (38%), Gaps = 31/453 (6%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT---DLVFV 62 +P W+ + ++ G + + D LP +RA N+ G D + ++ F Sbjct: 21 IPVHWMTGQIKNAHDVVLGKMLQSD--AKTPADRLLPYLRAANVNWGGVDLSTVKEMWFS 78 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGF 121 P +++ ++ D+VI+ VG+SA EC F RP+ S + Sbjct: 79 PAE--RKALRLMVGDVVIS----EGGDVGRSAVWQGELPECYFQNAINRARPKGEHSSRY 132 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + ++ I + + I + PP EQ IA LD A++D Sbjct: 133 LYYWMSFIKSAGYIDIICNKSTIPHYTAEKVQGTPFLFPPAGEQAGIAAFLDHETAKIDR 192 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 A+ +++ ++LK RQAV+ AV L +W P H +KL + +I Sbjct: 193 LIAKQQRLIELLKEKRQAVISHAVTKGLNPDAPMKDSGVEWLGEVPAHWRLEKLKYTAIF 252 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 P G P + +++ +V + + + + + + G +L Sbjct: 253 KGGGTPSKDSPEYWGGDIPWVSPKDMKSRYVADSQDKITVEAIAASSTSLIGPGQVLVVV 312 Query: 293 YNGSLEFV--GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 +G L+ L++ +Q++ K I R + E+ F N ++ Sbjct: 313 RSGILQRTIPVAVNLVEVTLNQDM----KAIDFR--DETRSEFFSY-FVEGHEDNLLLEW 365 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 K + + I + + + +V +PP E EI++ + + ++ A+ + Sbjct: 366 RKQGATVESIEQEYLGNTMVPMPPPSEMMEILQFLNGQLEKYRLLTEKATRAIELLREHR 425 Query: 411 QSILAKAFRGELTAQ-WRAENPDLISGENSAAA 442 ++++ A G++ + W+ N + +A+A Sbjct: 426 TALISAAVTGKIDVRGWQKPNTEPQEAAEAASA 458 Score = 100 bits (250), Expect = 9e-20, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 78/208 (37%), Gaps = 5/208 (2%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + + G T K + Y D +P + ++++ + Sbjct: 235 GEVPAHWRLEKLKYTAIFKGGGTPSK-DSPEYWGGD-IPWVSPKDMKSRYVADSQDKITV 292 Query: 64 KNLVKES-QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + + S I P +++ + SG A + E + + S F Sbjct: 293 EAIAASSTSLIGPGQVLVVVRSGILQRTIPVAVNLV--EVTLNQDMKAIDFRDETRSEFF 350 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 ++F + + GA + +I+ +P+PP +E I + L+ L + Sbjct: 351 SYFVEGHEDNLLLEWRKQGATVESIEQEYLGNTMVPMPPPSEMMEILQFLNGQLEKYRLL 410 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLT 210 + + ++L+ R A++ AV GK+ Sbjct: 411 TEKATRAIELLREHRTALISAAVTGKID 438 >UniRef50_B0CE92 Type I restriction-modification enzyme S subunit n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CE92_ACAM1 Length = 382 Score = 214 bits (546), Expect = 5e-54, Method: Composition-based stats. Identities = 78/414 (18%), Positives = 169/414 (40%), Gaps = 38/414 (9%) Query: 14 PVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN--LVKESQ 71 + V + G T K++ + + +P I +I NG + ++ + L ++ Sbjct: 2 KLKEVCRFLNGGTPSKKKPEYF--EGEIPWITGADI-NGPIVNSARSYITEEAILNSATK 58 Query: 72 KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHFTKSSL 130 ++ P +++ +++ VGK A + E + L P+ + + ++ HF +S Sbjct: 59 RVPPNTVLLV----TRTSVGKVAVSGM--ELCYSQDITSLWPDLEKLDIYYLTHFLRSR- 111 Query: 131 YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 + S GA I + + +++ +PP+AEQK IA LD A + + Sbjct: 112 -ETYLKGQSRGATIKGVTKGVLENLSLHLPPIAEQKRIAGILDAADALRVKRRDAISTLD 170 Query: 191 QILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGH 250 +L+ + G + + + + E++ ++ +G P + G Sbjct: 171 ALLQSTFLTLFGDPITNPMG------------WDASDLEAVSEKITDGTHKTPKYTESGI 218 Query: 251 PILRISSVRAGHVDQNDIRFLECSESE--LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKK 308 L ++ G + N +F+ E + + R + GD+L + +G ++ + Sbjct: 219 EFLSAKDIKNGSIKWNTGKFISEDEHKSLITRCHPEIGDVLLAKSGS----LGSVAIIDR 274 Query: 309 LQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQ 368 H+ L+ + + +++ SP + +++ K S K + DI+ Sbjct: 275 -DHEFSLFESLCLIKHNRQKIEAQFLTAMLESPRMQMHLLSRNKGIS-IKHLHLTDIRKL 332 Query: 369 VVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 +LLPP+ +Q + V A + + Q LA ++ L S+ ++AF GEL Sbjct: 333 KILLPPLDKQRKFATIV----ASIEKQKAQQCAHLAELDTLFASLQSRAFNGEL 382 Score = 102 bits (254), Expect = 3e-20, Method: Composition-based stats. Identities = 41/207 (19%), Positives = 83/207 (40%), Gaps = 16/207 (7%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P GW + + V+ I T+K + + + + A +I+NG F+ ++ Sbjct: 188 PMGWDASDLEAVSEKITDGTHKTPK----YTESGIEFLSAKDIKNGSIKWNTGKFISEDE 243 Query: 67 VKE-SQKISP--EDIVIAMSSGSKSVVGKSAHQHLPFECS-FGAFCGVLRPEKLIFSGFI 122 K + P D+++A S +G A E S F + C + + I + F+ Sbjct: 244 HKSLITRCHPEIGDVLLAKS----GSLGSVAIIDRDHEFSLFESLCLIKHNRQKIEAQFL 299 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 +S + + S + G +I ++ + I +PPL +Q+ A + ++ Q Sbjct: 300 TAMLESPRMQMHLLSRNKGISIKHLHLTDIRKLKILLPPLDKQRKFATIVASIEKQKAQQ 359 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKL 209 A ++ ++ A NG+L Sbjct: 360 CAHLAEL----DTLFASLQSRAFNGEL 382 >UniRef50_A3J6X3 Type I restriction-modification system, S subunit n=1 Tax=Flavobacteria bacterium BAL38 RepID=A3J6X3_9FLAO Length = 450 Score = 214 bits (545), Expect = 6e-54, Method: Composition-based stats. Identities = 83/442 (18%), Positives = 175/442 (39%), Gaps = 37/442 (8%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++PE W V + G T ++ + +P + + +QN + + + + Sbjct: 17 EIPENWDYCKVKHIANTYAGGTP-STVVDSFWHNGDIPWLPSGKLQNCEIISAEKFITNE 75 Query: 65 NLVKESQK-ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 L+ S K I P +++A++ + + +G Q C+ + V + S F+ Sbjct: 76 GLIGSSTKWIKPNTVLVALTGATCANIGYLTFQ----ACANQSVIAVDENPEKANSRFLY 131 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + + R++I + G I + + + P L EQ IA+ LD +D+T Sbjct: 132 YMFLN--MRSQILTHQTGGAQAGINDSDVKNLYLLNPSLEEQIKIADYLDYKTNLIDATI 189 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTE 234 + +++ ++LK RQAV+ AV L +W P++ KK+ L Sbjct: 190 EKKKRLIELLKEKRQAVINEAVTKGLNPNAPMKDSGLEWLGEIPENWEVKKVK---YLLS 246 Query: 235 LRNGLSSKPNES--------GVGHPILRISSVRAGHVDQNDIRFLECS--ESELNRHKLQ 284 NG+ P S G I +V R+++ E + ++++ Sbjct: 247 SENGIKIGPFGSALKLDTLTDNGIKIYGQGNVIKDDFTLG-HRYIDPERFEKDFKQYEIL 305 Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP-EYIEIFFSSPSA 343 DGD+L T + G + + +L L+R R +D I Sbjct: 306 DGDILITMMGTT----GKSKVFNSSYEKGIL-DSHLLRLRFNEDLFDGRLFSILLEQSDY 360 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 + S G++ +K +++ P ++ Q EI+ +++ D I ++ + + Sbjct: 361 VFQQLALNSVGSIMAGLNSSIVKELIIITPKLEIQKEILNYIDENCKIIDIISSKILSQI 420 Query: 404 ARVNNLTQSILAKAFRGELTAQ 425 ++ QS++++A G++ + Sbjct: 421 EKLQTYRQSLISEAVTGKIDVR 442 Score = 96.3 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 86/215 (40%), Gaps = 12/215 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRG---VTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 G++PE W + V + + G + ++ L D+ + + N+ F Sbjct: 230 GEIPENWEVKKVKYLLSSENGIKIGPFGSALKLDTLTDNGIKIYGQGNVIKDDFTLGHRY 289 Query: 61 FVPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLI 117 P+ K + +I DI+I M + GKS + +E + LR + + Sbjct: 290 IDPERFEKDFKQYEILDGDILITMMGTT----GKSKVFNSSYEKGILDSHLLRLRFNEDL 345 Query: 118 FSGFIAHFT--KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 F G + +S +++ S G+ + + + + I P L QK I +D Sbjct: 346 FDGRLFSILLEQSDYVFQQLALNSVGSIMAGLNSSIVKELIIITPKLEIQKEILNYIDEN 405 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 +D ++ + L+ +RQ+++ AV GK+ Sbjct: 406 CKIIDIISSKILSQIEKLQTYRQSLISEAVTGKID 440 Score = 82.8 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 30/232 (12%), Positives = 77/232 (33%), Gaps = 17/232 (7%) Query: 213 WRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFL 271 W P++ + K+ + + + G P L ++ + + Sbjct: 14 WYPEIPENWDYCKVKHIANTYAGGTPSTVVDSFWHNGDIPWLPSGKLQNCEIISAEKFIT 73 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 + ++ +L + C + L Q + + A Sbjct: 74 NEGLIGSSTKWIKPNTVLVALTGAT------CANIGYLTFQACANQSVIAVDENPEKANS 127 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 ++ F + R+ ++ +T Q GI+ D+K+ +L P ++EQ +I ++ Sbjct: 128 RFLYYMFLN--MRSQILTH-QTGGAQAGINDSDVKNLYLLNPSLEEQIKIADYLDYKTNL 184 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D ++ + + Q+++ +A + NP+ ++ L Sbjct: 185 IDATIEKKKRLIELLKEKRQAVINEAVT-------KGLNPNAPMKDSGLEWL 229 >UniRef50_B7KF57 Restriction modification system DNA specificity domain protein n=1 Tax=Cyanothece sp. PCC 7424 RepID=B7KF57_CYAP7 Length = 417 Score = 213 bits (543), Expect = 1e-53, Method: Composition-based stats. Identities = 80/428 (18%), Positives = 169/428 (39%), Gaps = 35/428 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GW +S + T G T ++ + + ++ ++ + + + ++ + Sbjct: 14 LPDGWEWKKISDIATTTSGGTPSRKNSEYFT--GHINWFKSGELGDSEIFNSEEKITEEA 71 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE----KLIFSGF 121 + K S KI P+D ++ G + VGK + + A C + + K++ F Sbjct: 72 IKKSSAKIFPKDTLLIAMYG--ATVGKLGILGID-AATNQAVCAIFPKKNLGIKIVEEKF 128 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-------PLAEQKIIAEKLDT 174 + +F K R+++ S G NI + + IPIP L Q+ I ++++ Sbjct: 129 LFYFFK--FIRSQLIERSFGGAQPNISQTIINNVTIPIPYPNNPKLSLDIQQRIVARIES 186 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTE 234 LL ++ ++ EQ+ Q ++ + + E W+N + K + + T Sbjct: 187 LLGEIKHNRSLLEQMRQDTEQLLDSAIKECFALSRMETWKNHSCLGEIAKIIAKQVDPTL 246 Query: 235 LRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYN 294 + P + + ++A D R +E ++ G +L+++ Sbjct: 247 PQYQT----------LPHIGVDVIQANTCQLEDYRTIEEDGVTSGKYLFTSGSILYSKIR 296 Query: 295 GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTT 354 L + + L D + ++ + P+++ F SP + + Sbjct: 297 PYLRKSVLV------DFEGLCSADIYPLSVISDEIEPKFLMWFLISPLFTDYAKSHSGR- 349 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 + I+ + S ++ P +EQ I+ ++ + I+K + N L Q+IL Sbjct: 350 ARIPKINRDALFSFKLVYPNYEEQISIISYLDLIRFEVQKIDKLLKEDEKNFNYLEQAIL 409 Query: 415 AKAFRGEL 422 KAFRGEL Sbjct: 410 EKAFRGEL 417 >UniRef50_A8V066 Type I restriction-modification enzyme, S subunit (Fragment) n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8V066_9AQUI Length = 475 Score = 213 bits (543), Expect = 1e-53, Method: Composition-based stats. Identities = 91/440 (20%), Positives = 174/440 (39%), Gaps = 39/440 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT----DLVF 61 +PE W + + + + +G T K++ Y +I+ + +N KF + F Sbjct: 2 IPEDWEVVRLGDIAEIQQGKTPKRDL---YDDRKGYRIIKVKDFENEKFVKHYPNGERSF 58 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSA---HQHLPFECSFGAFCGVLRPEKLIF 118 V +L + DI+I + S VVG+ + + + F + +R Sbjct: 59 VKVDLGNR-YTLEQGDILILSAGHSSKVVGQKIGFYNVNSNNKVFFVSELLRIRANNKTN 117 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F+ S R +I G ++ P + IP+PPL EQK IA T+L + Sbjct: 118 PLFLFFSIISQKSRKQIKEEIKGG---HLYPRDLVNLKIPLPPLPEQKAIA----TVLDK 170 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVN-----------GKLTEKWRNFEPQHSVFKKLN 227 + + E++ Q K +++++ KL E P+H K L Sbjct: 171 IRQAIEQTEEVIQANKELKKSLMKHFFTYGVVPPEETDKVKLKETEIGLIPEHWEIKTLK 230 Query: 228 FESILTELRNGLSSKPNESGVGHPILRISSV-RAGHVDQNDIRFLECSESELNRHKLQDG 286 E +S NE G PI+ + + + G + N IR ++ + + L+DG Sbjct: 231 DSVDSIEYGYSVSIPANEDQKGIPIISTADITKEGKLLYNKIRKIKPPKRLTEKLILKDG 290 Query: 287 DLLFTRYNGSLEFVGVCGLL---KKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPS 342 D+LF N S E +G + K + +Y ++R R + ++ Y++ + Sbjct: 291 DVLFNWRN-SPELIGKTTVFEAEKVSKDDFYIYASFILRIRSKESESNNFYLKYLLNYYR 349 Query: 343 ARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNA 402 + + Q + +I + + LPP+ EQ +I + + ++ + E N Sbjct: 350 EIGTFIKLARRAVNQANYNRNEIYNLKIPLPPIDEQKQIAKILNKIDNKIEAEE----NK 405 Query: 403 LARVNNLTQSILAKAFRGEL 422 + L +S+L G++ Sbjct: 406 KEALEKLFKSLLNNLMTGKI 425 Score = 101 bits (252), Expect = 5e-20, Method: Composition-based stats. Identities = 41/230 (17%), Positives = 85/230 (36%), Gaps = 14/230 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDLV 60 G +PE W I + I Y N +P+I +I + GK + Sbjct: 216 EIGLIPEHWEIKTLKDSVDSIE-YGYSVSIPANE-DQKGIPIISTADITKEGKLLYNKIR 273 Query: 61 FV-PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQH-----LPFECSFGAFCGVLRPE 114 + P + E + D++ + S ++GK+ + +F +R + Sbjct: 274 KIKPPKRLTEKLILKDGDVLFNWRN-SPELIGKTTVFEAEKVSKDDFYIYASFILRIRSK 332 Query: 115 K-LIFSGFIAHFTKSSL-YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKL 172 + + ++ + I N N + IP+PP+ EQK IA+ L Sbjct: 333 ESESNNFYLKYLLNYYREIGTFIKLARRAVNQANYNRNEIYNLKIPLPPIDEQKQIAKIL 392 Query: 173 DTLLAQVDSTKARFEQIPQILKRFRQAVLGGAV--NGKLTEKWRNFEPQH 220 + + ++++ + + E + ++ K ++ G + N EK+ E Sbjct: 393 NKIDNKIEAEENKKEALEKLFKSLLNNLMTGKIRLNKNFIEKFEKEEIHQ 442 >UniRef50_UPI00016B0992 probable type I restriction-modification system n=1 Tax=Burkholderia pseudomallei BCC215 RepID=UPI00016B0992 Length = 442 Score = 213 bits (542), Expect = 1e-53, Method: Composition-based stats. Identities = 94/442 (21%), Positives = 170/442 (38%), Gaps = 36/442 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVT-YKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P W + + R EQ K +P Q + D V Sbjct: 18 GRVPTSWAVVQARRLFEQRRDAALPGDEQLSASQKYGVVP-------QRLFMELEDQKVV 70 Query: 63 -PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + ++ + + P D VI++ S + +H F VLR I F Sbjct: 71 LALSGLENFKHVEPNDFVISLRSFQGGI------EHSAFGGCVSPAYTVLRATSKIAPDF 124 Query: 122 IAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 A+ KS Y + + +++ G NI F + +P+P + EQ IA LD ++D Sbjct: 125 WAYLLKSDTYISALQTVTDGIRDGKNISYMQFGALCVPVPNIDEQSAIAAFLDCETGKID 184 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFES- 230 + A E++ +L RQA L AV L W P H V +++ S Sbjct: 185 ALIAEQEKLIALLAEKRQAALSYAVTRGLNPDAPMKDSGVAWLGEVPAHWVIRRVKSVSV 244 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVRAG-HVDQNDIRFLECS-ESELNRHKLQDGDL 288 +T G S + S G ++ + V+ + + ++E R +L +GD+ Sbjct: 245 FMTSGPRGWSER--ISDEGSIFVQSGDLNDFLGVEFEIAKRVSVEFDAEAERTRLANGDV 302 Query: 289 LFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMM 348 + V VC + + + N L R + D LP ++ S + Sbjct: 303 VVCITGAKTGKVAVCASVPEPAYVN----QHLCLIRPSPDVLPLFLGNSLKSTIGQTQF- 357 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + Q G+S +++ +++LPP EQ EIV ++ A D ++ + A+ + Sbjct: 358 ELSQYGLKQ-GLSLDNVREALIVLPPPGEQVEIVTFIDAETARLDELKAEAARAIELLKE 416 Query: 409 LTQSILAKAFRGELTAQWRAEN 430 +++A A G++ + A Sbjct: 417 RRSALIAAAVTGKIDVRNAAPQ 438 >UniRef50_Q0W5N3 Type I restriction modification system, specificity subunit n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W5N3_UNCMA Length = 449 Score = 213 bits (542), Expect = 1e-53, Method: Composition-based stats. Identities = 99/440 (22%), Positives = 185/440 (42%), Gaps = 24/440 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG--KFDTTDLVF 61 G++PE W I + + + +K+ D Y I +++ N K + + Sbjct: 29 GRIPEEWSIVSIKNIVEKTEQIDPQKQP------DKYFKYIDVSSVSNESLKVVSVNEFK 82 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI-FSG 120 + + + +DI+ A + V CS AFC VLR K I Sbjct: 83 GINAPSRARRIVRTDDIIFATIRPNLKRVAIICDDLEGQLCST-AFC-VLRCMKNIAEPY 140 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+ + + K+ L G+ + I +PP++EQ+ IA L TL + ++ Sbjct: 141 FVFQTVTTDRFIGKLCDLQCGSGYPAVTDNDLLDQQILLPPISEQRKIAAILGTLDSLIE 200 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 T + Q+ K Q L + G + + +K + F ++ +NG+ Sbjct: 201 ETDRVVARTGQLKKGLIQEFLTEGM-GNVELEDTALGMIPKHWKCVPFATLSLTYKNGIY 259 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 G G+P +R+ ++ G V+ + L +++EL ++L +GDLL R N S + V Sbjct: 260 KHDKYYGSGYPCIRMYNIADGTVNTINSPLLNVTDAELKEYELAEGDLLINRVN-SRDLV 318 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 G G++ ++ + K IR RL + LPE++ +F S RN + VK+ Q Sbjct: 319 GKAGIVPAGL-GHVTFESKNIRVRLNRSMILPEFMGLFIQSSMYRNQVNKFVKSAIAQST 377 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 I+ D+ + +V LPP EQ +I + ++ + R+ + ++++ Sbjct: 378 INQDDLDNILVPLPPKDEQEKIASVIREINSKI----TWEIRYRERIELVKKALMQDLLT 433 Query: 420 GELTAQWRAENPDLISGENS 439 G + + PD I+ E + Sbjct: 434 GRIRVK-----PDTIAPEAT 448 >UniRef50_A1TSH8 Restriction modification system DNA specificity domain n=1 Tax=Acidovorax citrulli AAC00-1 RepID=A1TSH8_ACIAC Length = 429 Score = 213 bits (541), Expect = 1e-53, Method: Composition-based stats. Identities = 86/432 (19%), Positives = 174/432 (40%), Gaps = 35/432 (8%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN--GKFDTTDLVFVPK 64 PE W +A + V L Y + NI++ G+ + + Sbjct: 10 PEVWRLARLKFVAPLRNERMSAGSDHPGY--------LGLENIESWTGRIIEVESKRDDE 61 Query: 65 NLVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + D++ + H P + V+RP +L+ F Sbjct: 62 PADQSAGLANIFREGDVLFCKLRPYLAK-----ACHAPRDGVGSTELLVMRPSELLEPRF 116 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + + + + + GA + + + +PPL EQ++IA LD A +D Sbjct: 117 LLYSILTPDFVGAVDASTFGAKMPRANWDFIGSLEVKVPPLEEQRLIANYLDRETAGIDG 176 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 A E++ +L+ R A++ V L +W P H ++L L Sbjct: 177 LIAEKERMLALLEEKRAALISRVVTRGLDPNAPLKPSGQEWLGEIPVHWGLQRLK---QL 233 Query: 233 TELRNGLSSKPNESGV--GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 E+R GL+ SG +P LR+++V+ G++ +D+ +E SE + L GD+L Sbjct: 234 AEVRGGLTLGKQYSGELLEYPYLRVANVQDGYLKLDDVLTVEVPASEAASNLLVYGDVLM 293 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 G ++ +G G + + + L+ + + R +++ ++ S+ A+ + Sbjct: 294 NE-GGDIDKLGR-GCVWRDEISPCLHQNHVFAVRPHS-VDSDWLALWTSTIQAKRYFESR 350 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 K ++ ISG +IK V LPPV EQ I + + +T+ ++ ++L + Sbjct: 351 AKRSTNLASISGSNIKELPVPLPPVSEQLAIQNFLAVRHSRLETLRGELRDSLRLLIERR 410 Query: 411 QSILAKAFRGEL 422 +++ G++ Sbjct: 411 AALITAGVTGQI 422 Score = 115 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 90/210 (42%), Gaps = 8/210 (3%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + + + + G+T K+ + L+ P +R N+Q+G D++ V Sbjct: 219 GEIPVHWGLQRLKQLAEVRGGLTLGKQYSGELLE---YPYLRVANVQDGYLKLDDVLTVE 275 Query: 64 KNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGF 121 + S + D+++ G +G+ C +RP + S + Sbjct: 276 VPASEAASNLLVYGDVLMNE-GGDIDKLGRGCVWRDEISPCLHQNHVFAVRPHS-VDSDW 333 Query: 122 IAHFTKSSLYRNKISSLSA-GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +A +T + + S + N+ +I ++ + +P+PP++EQ I L ++++ Sbjct: 334 LALWTSTIQAKRYFESRAKRSTNLASISGSNIKELPVPLPPVSEQLAIQNFLAVRHSRLE 393 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLT 210 + + ++L R A++ V G++ Sbjct: 394 TLRGELRDSLRLLIERRAALITAGVTGQIP 423 Score = 79.4 bits (194), Expect = 3e-13, Method: Composition-based stats. Identities = 36/225 (16%), Positives = 85/225 (37%), Gaps = 16/225 (7%) Query: 247 GVGHP-ILRISSVRA--GHVDQNDIRFLECSESELN--RHKLQDGDLLFTRYNGSLEFVG 301 G HP L + ++ + G + + + + + + + ++GD+LF + L Sbjct: 32 GSDHPGYLGLENIESWTGRIIEVESKRDDEPADQSAGLANIFREGDVLFCKLRPYLAKAC 91 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 ++ + +L+ R ++ P ++ +P A+ + + + Sbjct: 92 HA-------PRDGVGSTELLVMRPSELLEPRFLLYSILTPDFVGAV-DASTFGAKMPRAN 143 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 I S V +PP++EQ I +++ A D + + LA + +++++ Sbjct: 144 WDFIGSLEVKVPPLEEQRLIANYLDRETAGIDGLIAEKERMLALLEEKRAALISRVVTRG 203 Query: 422 LTAQWRAENPDLIS--GENSAAALLEKIKAERAASGGKKASRKKS 464 L + P GE L+++K GG ++ S Sbjct: 204 LDPNAPLK-PSGQEWLGEIPVHWGLQRLKQLAEVRGGLTLGKQYS 247 >UniRef50_A8YFX5 HsdS protein n=2 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YFX5_MICAE Length = 406 Score = 213 bits (541), Expect = 2e-53, Method: Composition-based stats. Identities = 87/426 (20%), Positives = 164/426 (38%), Gaps = 31/426 (7%) Query: 6 LPEGWVIAPVSTVTTLIRG----VTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 LP+ W + + + +G + + + N F + Sbjct: 3 LPKTWSLVALGDIAAHEKGAIRRGPFGGSLKKEIFVESGFKVYEQQNAIKDDFQIGNYFI 62 Query: 62 VPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFS 119 + E + P D++I+ +G+ V ++ LP +RP ++I Sbjct: 63 DEDKFREMEGFNVKPHDLIISC-AGTIGKVAIVPYEALP--GVINQALMRIRPNPEIILC 119 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIK-PASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ +S Y+ I SAG+ + N+ + IP+PPL EQ+ IA LD Sbjct: 120 RYLKWLLESPKYQRDIFGKSAGSALKNLAAISEIKKCKIPLPPLEEQRRIAAILDKADGV 179 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 K ++L+ + G V + W + V + N NG Sbjct: 180 RRKRKEAIRLTEELLRSTFLEMFGDPVTN--PKGWEIVKLGSLVVGQPN---------NG 228 Query: 239 LSSKPNESGVGHPILRISSVRAGH-VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSL 297 + K +E G P++ + + +G+ +D ++ R L ++ E+ + L GD+LF R + + Sbjct: 229 IFKKNHEYGGDTPVVWVKELFSGYTIDCSESRTLTPTDEEVKKFGLTKGDILFCRSSLNR 288 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 + +G + + + L+ +IR RL K ++ P R ++ T Sbjct: 289 DGIGFNNVFDGMDF-SALFECHIIRVRLNQKKVNSIFLNYLLHFPGLRKQIIAKA-NTVT 346 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 I +IK LPP + Q + + ++ +E + NL S+L + Sbjct: 347 MSTIGQSEIKKIEFYLPPKELQDKFEIFLRKIATNRTKLENK------ESENLFNSLLQR 400 Query: 417 AFRGEL 422 AFRGEL Sbjct: 401 AFRGEL 406 >UniRef50_C5SDH7 Putative uncharacterized protein n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SDH7_CHRVI Length = 453 Score = 212 bits (540), Expect = 2e-53, Method: Composition-based stats. Identities = 82/440 (18%), Positives = 167/440 (37%), Gaps = 30/440 (6%) Query: 4 GKLPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQ--NGKFDTTDLV 60 G++PE W++ + I G+ + +D +P IR + + DL Sbjct: 18 GEVPEHWILDRLKWSVEGCINGLW-----GDDPNGEDVIPCIRVADFDRAKNRVRAEDLT 72 Query: 61 FVPKNLVKE-SQKISPEDIVIAMSSGSKS-VVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 + + K ++ + D++I S G + VG F + Sbjct: 73 YRSISEEKRLNRSLKNGDLLIEKSGGGDNQPVGVVVLFDHNLNAVCSNFVARMPVRSNFS 132 Query: 119 SGFIAHFTKSSLY--RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 F+ + S LY R S+ I N+ AS+ IP + EQ +IA+ LD Sbjct: 133 PRFLCY-LHSVLYALRLNTKSIKQNTGIQNLDSASYLDERFGIPTVYEQGLIADFLDRET 191 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHS-VFKKL 226 A++D+ A +++ ++LK RQAV+ AV L +W P+H + Sbjct: 192 AKIDALIAEQQRLVELLKEKRQAVISHAVTKGLNPDAPMKDSGIEWLGEVPEHWVIVPLK 251 Query: 227 NFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDI-RFLECSESELNRHKLQD 285 + + ++ G+ G PI++ VR + + R E E+ R +L+ Sbjct: 252 HLTAPGRDIMYGIVLPGPNVDNGVPIVKGGDVRPHRLRLELLNRTTEAIEAPYARARLRP 311 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 D++++ +G L+ + D + R + ++ S Sbjct: 312 SDIVYSIRGS----IGDAELVPDELLDANITQD-VARISPDQTVNSLWLLFVMKSVRVFV 366 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 + + +GI+ D+K + P ++EQ I +++ D + + A+ Sbjct: 367 QLEQR-SLGAAVRGINIFDLKRARIPFPDIQEQKTIATFLDRETTKLDALTAEAQTAITL 425 Query: 406 VNNLTQSILAKAFRGELTAQ 425 + ++++ A G++ + Sbjct: 426 LQERRTALISAAVTGKIDVR 445 Score = 100 bits (249), Expect = 1e-19, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 85/235 (36%), Gaps = 14/235 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVR--AGHVDQNDIR 269 +W P+H + +L + NGL P +R++ V D+ Sbjct: 15 EWLGEVPEHWILDRLK--WSVEGCINGLWGDDPNGEDVIPCIRVADFDRAKNRVRAEDLT 72 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEF-VGVCGLLKKLQHQNLLYPDKLIRARLTKD 328 + SE + L++GDLL + G VGV L + N + + + R + + Sbjct: 73 YRSISEEKRLNRSLKNGDLLIEKSGGGDNQPVGVVVLFD--HNLNAVCSNFVARMPVRSN 130 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 P ++ S A +K +G + + + +P V EQ I +++ Sbjct: 131 FSPRFLCYLHSVLYALRLNTKSIKQNTGIQNLDSASYLDERFGIPTVYEQGLIADFLDRE 190 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D + + + + Q++++ A + NPD ++ L Sbjct: 191 TAKIDALIAEQQRLVELLKEKRQAVISHAVT-------KGLNPDAPMKDSGIEWL 238 >UniRef50_Q8GN10 Putative type I specificity subunit HsdS n=3 Tax=Campylobacter jejuni RepID=Q8GN10_CAMJE Length = 420 Score = 211 bits (538), Expect = 3e-53, Method: Composition-based stats. Identities = 92/428 (21%), Positives = 165/428 (38%), Gaps = 22/428 (5%) Query: 6 LPEGWVIAPVSTVTT-----LIRGVTYKKEQAINYLKDDYLPLIRANN-IQNGKFDTTDL 59 LP+GW + + + + + RG + ++ + + + N I N Sbjct: 4 LPQGWKMETLGEILSSDKYSIKRG-PFGSTLKKSFFVEKGIRIFEQYNPINNDPHWKRYF 62 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFEC-SFGAFCGVLRPEKL-I 117 + K E+ K + D++I+ S +GK E +R + I Sbjct: 63 ISHEKFQELEAFKATEGDLLISCS----GTLGKIVELPKDTEMGIINQSLLKIRLNNIKI 118 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNI-KPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + + ++ S + + KI + G+ I NI I IP+PPL +Q+ I LD Sbjct: 119 LNSYFIYYFNSPIMQEKILESTLGSAIKNIASVKILKQIEIPLPPLKKQERIVGILDESF 178 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-PQHSVFKKLNFESILTEL 235 ++D + EQ L Q+ L A N N++ PQ +K L + Sbjct: 179 VKIDESIKILEQNLLNLDELMQSALQKAFNPLKDNAKENYKLPQGWEWKSLGEIGNTSSG 238 Query: 236 RNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYN 294 L +K G L+ + G++D + E + + Q G LL Y Sbjct: 239 GTPLRNKKEYWENGSIKWLKSGELNDGYIDFIEENITEEAIENSSAKIFQKGTLLIAMYG 298 Query: 295 GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTT 354 + +G+ L + L + L +++ F R+ ++ Sbjct: 299 ATAGRLGILNLDSATNQAVCAF---LHKDNKNIKFLEKFLFYFLFF--IRDKIIKDSFGG 353 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 + Q IS IK+ + LPP+KEQ +I + ++ +F +++ L L QS+L Sbjct: 354 A-QPNISQTYIKNLQIPLPPLKEQEQIAKHLDFVFEKTKALKELYTKELKDYEELKQSLL 412 Query: 415 AKAFRGEL 422 KAF+GEL Sbjct: 413 NKAFKGEL 420 Score = 144 bits (362), Expect = 8e-33, Method: Composition-based stats. Identities = 45/209 (21%), Positives = 84/209 (40%), Gaps = 5/209 (2%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 KLP+GW + + G T + + Y ++ + +++ + +G D + Sbjct: 216 ENYKLPQGWEWKSLGEIGNTSSGGTPLRNKK-EYWENGSIKWLKSGELNDGYIDFIEENI 274 Query: 62 VPKNLVKESQKI-SPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + + S KI ++IAM + +G + AF Sbjct: 275 TEEAIENSSAKIFQKGTLLIAMYGATAGRLGILNLDSATNQAVC-AFLHKDNKNIKFLEK 333 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+ +F R+KI S G NI + IP+PPL EQ+ IA+ LD + + Sbjct: 334 FLFYFLF--FIRDKIIKDSFGGAQPNISQTYIKNLQIPLPPLKEQEQIAKHLDFVFEKTK 391 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKL 209 + K + + + + +Q++L A G+L Sbjct: 392 ALKELYTKELKDYEELKQSLLNKAFKGEL 420 >UniRef50_D2TNZ5 Putative type I restriction modification system HsdS component n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TNZ5_CITRO Length = 538 Score = 211 bits (536), Expect = 7e-53, Method: Composition-based stats. Identities = 87/430 (20%), Positives = 177/430 (41%), Gaps = 42/430 (9%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQK 72 +S + +G K + +P I +N + + + Sbjct: 11 VKLSELLITTKGR--KPANVGDRSSVREIPYIDIKAFENNEI-------TSYCSPENAVL 61 Query: 73 ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYR 132 + D+++ +VG + + G+ + I +I +F S Sbjct: 62 CNETDVLMVWDGSRSGLVGMGIY------GALGSTLVAI-SIPFILPQYIYYFLLSKF-- 112 Query: 133 NKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQI 192 +++++ + G I +I P I+ PI ++ Q+I+ K+D L +D + E+ Sbjct: 113 DELNNNTRGMGIPHIDPVYLGEIDFPITSVSNQEILYSKIDQLYNLIDDGFTKTEKALAQ 172 Query: 193 LKRFRQAVLGGAVNGKLTEKWRNFEPQ----------------------HSVFKKLNFES 230 + + A++GKLT+ WR+ Q S ++ + S Sbjct: 173 ISILWSLRITEALSGKLTKNWRDSNSQGKPLPVDIISINNQLEETLPVLPSDWRYVKLSS 232 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 ++ + G S K ++RI ++ G + ND++F +E E +++ L++ D+L Sbjct: 233 VIESISYGTSKKCTYEPQETGVIRIPNIVNGEICDNDLKFANFTEKEKDKYSLKEDDILI 292 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMN 349 R NGSL VG C +K + L+ L+R R+ + P Y++ SP R + Sbjct: 293 IRSNGSLNLVGACARVKS-KDTGYLFAGYLLRLRINLELVNPSYLKYALESPLLRKQIER 351 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 K++SG I+ ++I+S ++ + ++EQ IV +E + + + Q+ N L + Sbjct: 352 IAKSSSGVNNINAEEIRSLIIPICSIEEQLVIVNELENIKYNLEAQQVQLRNLLEKSELT 411 Query: 410 TQSILAKAFR 419 + I+ AF Sbjct: 412 KKEIVKDAFS 421 Score = 120 bits (300), Expect = 1e-25, Method: Composition-based stats. Identities = 48/211 (22%), Positives = 93/211 (44%), Gaps = 8/211 (3%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP-K 64 LP W +S+V I T KK + +IR NI NG+ DL F Sbjct: 221 LPSDWRYVKLSSVIESISYGTSKK----CTYEPQETGVIRIPNIVNGEICDNDLKFANFT 276 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRPE-KLIFSGFI 122 K+ + +DI+I S+GS ++VG A + F + LR +L+ ++ Sbjct: 277 EKEKDKYSLKEDDILIIRSNGSLNLVGACARVKSKDTGYLFAGYLLRLRINLELVNPSYL 336 Query: 123 AHFTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + +S L R +I ++ + + +NNI + IPI + EQ +I +L+ + +++ Sbjct: 337 KYALESPLLRKQIERIAKSSSGVNNINAEEIRSLIIPICSIEEQLVIVNELENIKYNLEA 396 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEK 212 + + + + + ++ ++ A + E Sbjct: 397 QQVQLRNLLEKSELTKKEIVKDAFSIGFKEM 427 Score = 61.3 bits (147), Expect = 8e-08, Method: Composition-based stats. Identities = 33/168 (19%), Positives = 65/168 (38%), Gaps = 10/168 (5%) Query: 267 DIRFLEC---SESELNRHKLQDGDLLFTRYNGSLEFVG-VCGLLKKLQHQNLLYPDKLIR 322 +I +++ +E+ + + +L + + + G GL+ + L + Sbjct: 36 EIPYIDIKAFENNEITSYCSPENAVLCNETDVLMVWDGSRSGLVGMGIYGAL---GSTLV 92 Query: 323 ARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIV 382 A LP+YI F S + +N G I + + V Q + Sbjct: 93 AISIPFILPQYIYYFLLS---KFDELNNNTRGMGIPHIDPVYLGEIDFPITSVSNQEILY 149 Query: 383 RRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAEN 430 +++QL+ D + ALA+++ L + +A G+LT WR N Sbjct: 150 SKIDQLYNLIDDGFTKTEKALAQISILWSLRITEALSGKLTKNWRDSN 197 >UniRef50_B3R3C2 Type I restriction-modification methylase S subunit n=1 Tax=Cupriavidus taiwanensis RepID=B3R3C2_CUPTR Length = 458 Score = 210 bits (535), Expect = 7e-53, Method: Composition-based stats. Identities = 93/455 (20%), Positives = 170/455 (37%), Gaps = 36/455 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDLVFV 62 G +P W + + K + + +D + + + I + G + V Sbjct: 18 GDMPAHWQVRRLRFAAEFN----PSKSEVSHLDRDTLVSFLPMDAIGEEGSLVLEQVRQV 73 Query: 63 PKNLVKESQKISPEDIVIAMSS----GSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI- 117 + + D+ A + K V + + F + V RP + Sbjct: 74 SQ-VETGYTYFHEGDVAFAKITPCFENGKGAVMRGLLGGVGFGTTE---LIVARPRSDVT 129 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASF-DLINIPIPPLAEQKIIAEKLDTLL 176 S ++ S +R GA P F I PPL+EQ I L + Sbjct: 130 CSEYLHWLFCSIPFRKLGEGAMYGAGGQKRVPEDFARDFAIAFPPLSEQNAIVTFLYSET 189 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLN 227 +++D+ + +++ +L RQA + V L K W P H K++ Sbjct: 190 SKIDTLISEQDKLLVLLAEKRQATISRIVTRGLEPKVQIKSVGADWLGEIPIHWQAKRVK 249 Query: 228 FESILTELRNGLSSK----PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKL 283 + + + G S + P E +L++ V G D + + L + L Sbjct: 250 --WLTSSIEQGWSPQCENYPAEGENEWGVLKVGCVNGGVFDAAENKKLPPELEPFPEYSL 307 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPS 342 + GDLL +R N + E VG ++ K H+ LL DKL R RL + PE++ + ++ Sbjct: 308 RKGDLLISRAN-TRELVGSAAVVPKDFHR-LLLCDKLYRLRLDQAKCTPEFLAAYLATGE 365 Query: 343 ARNAM-MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNN 401 AR + + +S I I +V LPP +EQA I+ + + + N Sbjct: 366 ARGQIELGATGASSSMLNIGQSVIMDLLVPLPPAEEQAAIMDFLNAELDRLERLSLAANK 425 Query: 402 ALARVNNLTQSILAKAFRGELTAQWRAENPDLISG 436 ++ + +++ A G++ R PD ++ Sbjct: 426 SIDLLKARRTALITAAVTGKIDV--RNAVPDTLAA 458 >UniRef50_A2TPX3 RmeS n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TPX3_9FLAO Length = 395 Score = 210 bits (535), Expect = 8e-53, Method: Composition-based stats. Identities = 79/416 (18%), Positives = 163/416 (39%), Gaps = 33/416 (7%) Query: 14 PVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL-VFVPKNLVKES-- 70 ++TV + G +K + + +PL+R +N +G+ D ++V +K Sbjct: 7 TLTTVCAIKNGFAFKSKDYLT----KGIPLLRISNFNDGEVYINDNQIYVDAKYLKSKND 62 Query: 71 QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP--EKLIFSGFIAHFTKS 128 + D++IA+S + GK + F G+++ + S + ++ Sbjct: 63 FIVEKGDVLIALSGATT---GKYGIYNFDFPSLLNQRIGLIKSGESDTLNSRYFYYYLN- 118 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 + +++I + GA NI IP+PPL QK IA+ LD A D+T ++ Sbjct: 119 -ILKSEILRNAGGAAQPNISTKKIGTFEIPLPPLETQKRIAQILDDAAALRDTTAQLLKE 177 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV 248 + + + G V F N S L G+ +E Sbjct: 178 YDLLAQSIFLEMFGDPVMNPKEWIKTRFA---------NLVSSNCPLTYGIVQPGDEYEN 228 Query: 249 GHPILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 G P +R + + ++ ++++ ++ + S + +R L+ G++L + VGV + Sbjct: 229 GIPCVRPVDLTSQYISVDNLKKIDPAISNKFSRTILEGGEILLSVRGS----VGVISIAD 284 Query: 308 KLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 + + K + Y + + +N + K + I+ KD++ Sbjct: 285 DSLKGANVTRGIVPIWFDKKISNRLYFYYLYKTKRIQNQIKRLSK-GATLVQINLKDLRE 343 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 ++ PP++ Q + ++ A + + L +L +L KAF+GEL Sbjct: 344 LKIIQPPIELQNQFANKI----ALIEQQKALAKQELQESEDLFNCLLQKAFKGELV 395 Score = 95.5 bits (236), Expect = 4e-18, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 79/205 (38%), Gaps = 7/205 (3%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P+ W+ + + + +TY Q + ++ +P +R ++ + +L + + Sbjct: 197 PKEWIKTRFANLVSSNCPLTYGIVQPGDEY-ENGIPCVRPVDLTSQYISVDNLKKIDPAI 255 Query: 67 VKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + S+ I ++ GS V+ A L + +K+ + + Sbjct: 256 SNKFSRTILEGGEILLSVRGSVGVI-SIADDSLKGANVTRGIVPIWFDKKISNRLYFYYL 314 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 K+ +N+I LS GA + I + I PP+ Q A K+ A ++ KA Sbjct: 315 YKTKRIQNQIKRLSKGATLVQINLKDLRELKIIQPPIELQNQFANKI----ALIEQQKAL 370 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLT 210 +Q Q + +L A G+L Sbjct: 371 AKQELQESEDLFNCLLQKAFKGELV 395 >UniRef50_A1ZTI8 Type I restriction enzyme StySJI specificity protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZTI8_9SPHI Length = 436 Score = 210 bits (535), Expect = 8e-53, Method: Composition-based stats. Identities = 71/436 (16%), Positives = 175/436 (40%), Gaps = 30/436 (6%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 W + + G + + + P +R ++ NG DT++L +V + + Sbjct: 2 SNWEEKKIQDFAEVKGGKRLPAGKEFSLTPTKH-PYLRVTDMVNGSIDTSNLQYVDEEIE 60 Query: 68 K--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 K + +IS +D+ I + +G+ VG + A + +I ++ ++ Sbjct: 61 KVIRNYRISADDLYITI-AGTIGSVGNIPELLHNALLTENAAKITNIDKSIIDKNYLQYY 119 Query: 126 TKSSLYRNKISS-LSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S +++I+ + G + + + + PPL Q+ IA+ L T+ +D T+ Sbjct: 120 LSSEETKSQINKEIGIGGGVPKLALYRILNLVVQYPPLTYQRKIAQILSTVDRVIDGTQR 179 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTE---------- 234 E+ + + Q + ++ + + +++K I + Sbjct: 180 AIEKYQTLKEGLMQDLFSRGIDVSTGKLRPPRQVAPELYQKTELGWIPKDYSFVRLEDLT 239 Query: 235 --LRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL--NRHKLQDGDLLF 290 + +G P + G P LR++ V+ ++ + ++F+ E ++ R + GDLL Sbjct: 240 LKIIDGTHHTPKYTESGIPFLRVTDVQTKDINFDKLKFVSLEEHQILTKRCNPEKGDLLL 299 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 ++ +G+ ++ ++ LI+ + EY+ F S +N ++ Sbjct: 300 SKNG----TIGIPKVVDWDWEFSIFVSLALIKPN-HRLINVEYLLYFLKSELIKNQIIRQ 354 Query: 351 VKTTSGQKGISGKDIKSQVVLLPP-VKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 K + + ++I+ + PP ++EQ IV ++ L + + + ++ L Sbjct: 355 AKQGT-VTNLHLEEIREFKIAQPPSIQEQNNIVEKLNNL----EKQIESEQKSFQKLKTL 409 Query: 410 TQSILAKAFRGELTAQ 425 Q+++ G+++ + Sbjct: 410 KQALMQDLLTGKVSVE 425 Score = 117 bits (292), Expect = 1e-24, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 90/215 (41%), Gaps = 13/215 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G +P+ + + +T I T+ + + +P +R ++Q + L F Sbjct: 222 ELGWIPKDYSFVRLEDLTLKIIDGTHHTPK----YTESGIPFLRVTDVQTKDINFDKLKF 277 Query: 62 VPKN---LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLI 117 V ++ + D++++ +G +E S +++P +LI Sbjct: 278 VSLEEHQILTKRCNPEKGDLLLSK----NGTIGIPKVVDWDWEFSIFVSLALIKPNHRLI 333 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPP-LAEQKIIAEKLDTLL 176 ++ +F KS L +N+I + + N+ I PP + EQ I EKL+ L Sbjct: 334 NVEYLLYFLKSELIKNQIIRQAKQGTVTNLHLEEIREFKIAQPPSIQEQNNIVEKLNNLE 393 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE 211 Q++S + F+++ + + Q +L G V+ + E Sbjct: 394 KQIESEQKSFQKLKTLKQALMQDLLTGKVSVEAAE 428 >UniRef50_C0XBA7 Type I restriction-modification system, S subunit n=1 Tax=Lactobacillus gasseri JV-V03 RepID=C0XBA7_9LACO Length = 468 Score = 210 bits (535), Expect = 9e-53, Method: Composition-based stats. Identities = 81/428 (18%), Positives = 172/428 (40%), Gaps = 59/428 (13%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LP W + + T G YKK++ ++ D P++R N+ F + + Sbjct: 54 ELPSSWDWITLGSGVTFYNGRAYKKKELLS--DDKLTPVLRVGNL----FTNSSWYYSDL 107 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 +L E++ I D++ A S+ + H + + +I + F+ + Sbjct: 108 SL-DENKYIDNGDLIYAWSASFGPKIWNGGHVIYHYH-----IWKLEYDNNVIDTNFLYY 161 Query: 125 FTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 F RN + G+ + +I + + + P+PPL EQ IA K+ L A + + Sbjct: 162 FLLDK--RNVVGETDLHGSTMKHITKTNMEHLPFPLPPLEEQSRIAAKIAQLFALLRKVE 219 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV--------------------- 222 + +Q ++ + VL A+ GKL ++ + EP + Sbjct: 220 SSTQQYAKLQTLLKSKVLDLAMRGKLVKQDPHDEPASVLLEKIKAEKEQLIKEKKIKKSK 279 Query: 223 -----------------FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQ 265 ++ + + +R G ++ +G +LRI+ ++ +V+ Sbjct: 280 PLPPITDKEKPFDIPDSWEWVRLGEVAESIRYGYTASAQATGNA-KLLRITDIQNNNVNW 338 Query: 266 NDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL 325 N + S+ +L L D+L R G+ +G +K++ ++ LIR R Sbjct: 339 NMVPLCNISDMKLKDLSLHKKDILIARTGGT---IGKNYFVKQIVEPT-VFASYLIRVRN 394 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 + +I+ +P N ++ K+ +GQ ++ +++ + +PP++EQ IV ++ Sbjct: 395 INKKVSNFIQYVLDAPIYWN-FISAKKSGTGQPNVNAAKLENFIFPIPPLEEQNRIVDKI 453 Query: 386 EQLFAYAD 393 L + Sbjct: 454 INLIDLFN 461 Score = 119 bits (298), Expect = 2e-25, Method: Composition-based stats. Identities = 61/289 (21%), Positives = 110/289 (38%), Gaps = 44/289 (15%) Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTE---------- 234 + + + R+ +L A+ GKL + + EP + KK N + + E Sbjct: 2 KTNTLEFDAQALREKILDLAMRGKLVPQDPDDEPASELLKKNNLKRSIEEPHELPSSWDW 61 Query: 235 --------LRNGLSSKPNE---SGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKL 283 NG + K E P+LR+ ++ + + + S E + Sbjct: 62 ITLGSGVTFYNGRAYKKKELLSDDKLTPVLRVGNL----FTNSSWYYSDLSLDENK--YI 115 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 +GDL++ ++ S K +++Y + + + + +F Sbjct: 116 DNGDLIYA-WSASFGP-------KIWNGGHVIYHYHIWKLEYDNNVIDTNFLYYFLLDK- 166 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 RN + S K I+ +++ LPP++EQ+ I ++ QLFA +E Sbjct: 167 RNVVGETDLHGSTMKHITKTNMEHLPFPLPPLEEQSRIAAKIAQLFALLRKVESSTQQYA 226 Query: 404 ARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERA 452 L +L A RG+L Q + P A+ LLEKIKAE+ Sbjct: 227 KLQTLLKSKVLDLAMRGKLVKQDPHDEP--------ASVLLEKIKAEKE 267 Score = 118 bits (296), Expect = 4e-25, Method: Composition-based stats. Identities = 44/181 (24%), Positives = 77/181 (42%), Gaps = 10/181 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +P+ W + V IR QA K L+R +IQN + + Sbjct: 292 DIPDSWEWVRLGEVAESIRYGYTASAQATGNAK-----LLRITDIQNNNVNWNMVPLCNI 346 Query: 65 NLVK-ESQKISPEDIVIAMSSGSKSVVGKS-AHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + +K + + +DI+IA + G+ +GK+ + + F ++ +R S FI Sbjct: 347 SDMKLKDLSLHKKDILIARTGGT---IGKNYFVKQIVEPTVFASYLIRVRNINKKVSNFI 403 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + + +Y N IS+ +G N+ A + PIPPL EQ I +K+ L+ + Sbjct: 404 QYVLDAPIYWNFISAKKSGTGQPNVNAAKLENFIFPIPPLEEQNRIVDKIINLIDLFNVG 463 Query: 183 K 183 K Sbjct: 464 K 464 >UniRef50_C9NRR1 Type I restriction-modification system specificity subunit S n=1 Tax=Vibrio coralliilyticus ATCC BAA-450 RepID=C9NRR1_9VIBR Length = 424 Score = 210 bits (534), Expect = 9e-53, Method: Composition-based stats. Identities = 72/426 (16%), Positives = 151/426 (35%), Gaps = 26/426 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G+ W + +T+ + + Y +P IR+ N+ + D+ ++P Sbjct: 13 GEFEGSWKTTKLGALTSKVGSGATPRGGEKAYSTS-GIPFIRSQNVNYNRLLLNDIRYIP 71 Query: 64 KNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSG 120 +N ++ I P+DI++ ++ S +G+S F + + ++R + Sbjct: 72 ENTHASMKRSQIQPKDILLNITGAS---IGRSCVVPDCFQDGNLNQHVCIIRLKND-DPY 127 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F S + AG + S + P L EQ+ IA L + ++ Sbjct: 128 FTQSLLASYRGEKLVFQGMAGGGREGLNFESIKGFKMAFPTLPEQQKIASFLSKVDEKIA 187 Query: 181 STKARFEQIPQILKRFRQAVL--------GGAVNGKLTEKWRNFEPQHSVFKKLNFESIL 232 + +++ + K Q + G T +++ + + Sbjct: 188 LLTEKKDKLAEYKKGVMQQLFNGKWQEQDGQLTFIPPTLRFKADDGSEFPDWEEKALGDF 247 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 + +G P G P + V A ++ E E R L+ GD+L TR Sbjct: 248 ARIYDGTHQTPKYVDEGVPFYSVEHVTANQFEKTKYISEEVYAKECKRVTLKKGDILLTR 307 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 VG L+ + L++ + + +Y+ F SP+ ++ + + Sbjct: 308 IGS----VGDVRLIDWDVRASFYVSLALVKY--NDEIVGQYLASFMQSPNFQSELWKRMI 361 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 + K I+ +I +V +P EQ +I + + D N+ L + + Sbjct: 362 HVAFPKKINLGEIGHCLVSVPSRDEQTKIANFLSAIDQKID----LANSELEKAKEWKRG 417 Query: 413 ILAKAF 418 +L + F Sbjct: 418 LLQQMF 423 Score = 104 bits (259), Expect = 8e-21, Method: Composition-based stats. Identities = 37/226 (16%), Positives = 88/226 (38%), Gaps = 14/226 (6%) Query: 205 VNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK---PNESGVGHPILRISSVRAG 261 + ++ F +K ++ +++ +G + + S G P +R +V Sbjct: 1 MTEQMNVPKLRFGEFEGSWKTTKLGALTSKVGSGATPRGGEKAYSTSGIPFIRSQNVNYN 60 Query: 262 HVDQNDIRFL-ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKL 320 + NDIR++ E + + + R ++Q D+L S +G ++ Q+ + Sbjct: 61 RLLLNDIRYIPENTHASMKRSQIQPKDILLNITGAS---IGRSCVVPDC-FQDGNLNQHV 116 Query: 321 IRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAE 380 RL D P + + +S + + G++G++ + IK + P + EQ + Sbjct: 117 CIIRLKND-DPYFTQSLLASYRGEKLVFQGMAGG-GREGLNFESIKGFKMAFPTLPEQQK 174 Query: 381 IVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQW 426 I + + D + ++ + ++ + F G+ Q Sbjct: 175 IASFL----SKVDEKIALLTEKKDKLAEYKKGVMQQLFNGKWQEQD 216 >UniRef50_C2QHW5 Putative uncharacterized protein n=2 Tax=Bacillus cereus RepID=C2QHW5_BACCE Length = 441 Score = 210 bits (534), Expect = 9e-53, Method: Composition-based stats. Identities = 80/438 (18%), Positives = 179/438 (40%), Gaps = 39/438 (8%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 GK+P+ W + +S++ ++ + ++ + L + + ++ ++ Sbjct: 17 IGKVPKHWELKKISSIFE-------QRNEKVSDKDFEPLSVTKMGILKQ----LENVAKT 65 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFSGF 121 N +K+ D VI S K G S F+ S C V++P+ + + + Sbjct: 66 DNN--DNRKKVLKNDFVINSRSDRKGSCGVS-----KFDGSVSLICTVIKPKTINTYMDY 118 Query: 122 IAHFTKSSLYRNKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 H ++ ++ + G ++ + K F I IPIPP EQK I L+ + + Sbjct: 119 YHHLFRNKMFSEEFYRWGRGIVDDLWSTKWDEFKRILIPIPPHEEQKSIVSYLNHIYEAI 178 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFES 230 + +Q + +++++++++ AV L +W P+H + K+L+F S Sbjct: 179 EELITHKQQQIETIQQYQRSLITEAVTSGLNPHAKMKDSSVEWIGEMPEHWITKRLDFVS 238 Query: 231 ILTEL--RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL-ECSESELNRHKLQDGD 287 ++ GL++ G+ L I +++ +D ++ ++ E E LQ GD Sbjct: 239 VVKARLGWKGLTAS-EYQENGYIFLAIPNIKKFQIDFENVNYISEKRYKESPEIMLQVGD 297 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 +L + +L V V L N + R D ++ + S + + Sbjct: 298 VLLAKDGSTLGEVNVVRYLPSPATVN----SSIAVIRPKGDLHSVFLYYYLKSNYIQK-I 352 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 + K G + KDI ++ +PP+ EQ +I + ++ + + + + + + Sbjct: 353 IQKKKDGMGVPHLFQKDINKFIIQVPPLDEQVKIAKYLDGKISEINNLIIETQEQIDILQ 412 Query: 408 NLTQSILAKAFRGELTAQ 425 QS++ + G++ + Sbjct: 413 QYRQSLVYEVVTGKIDVR 430 >UniRef50_D1J921 Putative type I restriction enzyme, DNA specificity subunit n=1 Tax=uncultured archaeon RepID=D1J921_9ARCH Length = 445 Score = 210 bits (534), Expect = 9e-53, Method: Composition-based stats. Identities = 77/442 (17%), Positives = 158/442 (35%), Gaps = 39/442 (8%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV-- 60 G++PE W P+ V ++ G + Y + P +RA NI K DT D+ Sbjct: 16 IGEIPEHWEAKPIKYVGDIVLGKMLTPDDKEGYFRK---PYLRAQNITWEKVDTEDIKEM 72 Query: 61 -FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQ-HLPFECSFGAFCGVLRPEKLIF 118 F K L ++ D++++ VG++A + EC + + Sbjct: 73 WFSEKEL--SQYRLKENDLLVS----EGGEVGRTAIWQNELNECYIQNSVHKITIKSKNN 126 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + + + S+ +I ++ I P EQ+ IA LD Q Sbjct: 127 PHYYLYHFQIYGKTGYFDSIVNRVSIAHLTREKLKEIMFLSPTFHEQQTIANYLDRKTHQ 186 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFE 229 +D+ +++ +LK R A++ AV L +W P+H +K+ Sbjct: 187 IDTFIENKQKLIDLLKEQRAAIINQAVTKGLNPNVKLKDSGIEWLGEIPEHWELRKVGR- 245 Query: 230 SILTELRNGLSSKPNESGV----GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQD 285 + +G + K G + + G +D+ + E + E + K+ Sbjct: 246 -SFNLIGSGTTPKSENIGYYENGTINWVITGDLNDGILDKTSKKITEKALDEYSTLKIYP 304 Query: 286 -GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 G LL Y ++ + L + + E+ +F A Sbjct: 305 VGTLLIAMYGATIGKI-------SLMNFEGCVNQACCALSNSPYLSNEFSFYWFL---AN 354 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 + + GQ IS + ++S + PP EQ I+ +++ D + ++ + Sbjct: 355 KQNIINMSFGGGQPNISQEVVRSLKIPTPPSSEQQAIIYHLDEQTTRIDKLMERQGRQIE 414 Query: 405 RVNNLTQSILAKAFRGELTAQW 426 + +++++ G++ + Sbjct: 415 HLKEYRTTLISEVVTGKIDVRD 436 >UniRef50_D1XRZ5 Restriction modification system DNA specificity domain protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XRZ5_9ACTO Length = 412 Score = 210 bits (534), Expect = 1e-52, Method: Composition-based stats. Identities = 85/411 (20%), Positives = 161/411 (39%), Gaps = 38/411 (9%) Query: 12 IAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKF---DTTDLVFVPKNLVK 68 PV + + G P +R N+ G+ D ++ F P V Sbjct: 9 WVPVRELGEVRMGKQLSPSSREAA---GQFPYLRVANVHLGRIEYVDVNEMGFTPAERV- 64 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHL-PFECSFGAFCGVLRPEKLIFSGF----IA 123 + + P DI++ S +VG+SA E F RP I S + Sbjct: 65 -TYGLKPGDILLNE-GQSLELVGRSAIYDRAEGEFCFQNTLIRFRPNGCILSAYAQVVFE 122 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 H+ +S ++ +I ++ F + P+ P Q+ I LD+L + Sbjct: 123 HWLRSGVFAAIAKQT---TSIAHLGGDRFAALKFPLLPTGMQQRIVAVLDSLAELERRIE 179 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 A ++ + R+ ++ + R S +L L ++ +GL+ Sbjct: 180 ASIVKL----RSVRKGIISEQFS-------RADVEDGSPASRLRALDSLADVGSGLTLGG 228 Query: 244 NESG---VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 SG + P LR+++V+ G + +++ + + S++ R +++ D+L T G + V Sbjct: 229 ISSGGTLLEVPYLRVANVQDGFISTLEMKSVRVTPSDMERFRVRRDDVLVTE-GGDFDKV 287 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 G G + + L + + R R K+ L P ++ ++ SS + R + VK T+ Sbjct: 288 GR-GAVWDGRIDPCLNQNHVFRVRCDKEVLDPHFLSLYMSSAAGRRYFLRVVKQTTNLAS 346 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 I+ +K+ V PP++EQ V +L D Q L ++ L Sbjct: 347 INSSQLKAMPVPCPPLEEQRRTV----ELVGSCDEQIAQEEGELTKLRELK 393 Score = 101 bits (251), Expect = 7e-20, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 77/195 (39%), Gaps = 7/195 (3%) Query: 15 VSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK-ESQKI 73 + ++ + G+T + L + +P +R N+Q+G T ++ V E ++ Sbjct: 214 LDSLADVGSGLTLGGISSGGTLLE--VPYLRVANVQDGFISTLEMKSVRVTPSDMERFRV 271 Query: 74 SPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLI-FSGFIAHFTKSSLY 131 +D+++ G VG+ A + C +R +K + F++ + S+ Sbjct: 272 RRDDVLVTE-GGDFDKVGRGAVWDGRIDPCLNQNHVFRVRCDKEVLDPHFLSLYMSSAAG 330 Query: 132 RNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 R + + +I + + +P PPL EQ+ E + + Q+ + ++ Sbjct: 331 RRYFLRVVKQTTNLASINSSQLKAMPVPCPPLEEQRRTVELVGSCDEQIAQEEGELTKLR 390 Query: 191 QILKRFRQAVLGGAV 205 ++ +L V Sbjct: 391 ELKVGLVDDLLSRRV 405 >UniRef50_C7RNT4 Restriction endonuclease S subunits-like protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RNT4_9PROT Length = 403 Score = 209 bits (533), Expect = 1e-52, Method: Composition-based stats. Identities = 89/416 (21%), Positives = 170/416 (40%), Gaps = 25/416 (6%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK-ESQ 71 +S V IRG+T+K E + +R N+Q + D D+ +P++ V+ E Q Sbjct: 7 VELSDVAAFIRGITFKPEDVVPVDTPGAAACMRTKNVQT-ELDLCDVWGIPQSFVRREDQ 65 Query: 72 KISPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRPEK-LIFSGFIAHFTKSS 129 + P D++++ S+ S ++VGK LP+ +FG F VLR + ++ + S Sbjct: 66 YLIPGDVLVS-SANSWNLVGKCCLVPSLPWRSTFGGFISVLRANPAKVDPRYLFRWFASD 124 Query: 130 LYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 + + S I+N+ + + +P L EQ+ IAE LD A +A Q Sbjct: 125 RTQATVRSFGQQTTNISNLNVGRCLKLKLHLPALPEQRRIAEILDKADALRAKRRAALAQ 184 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV 248 + + + + G + F F S L + Sbjct: 185 LDALTQSIFLDMFGDPATNPKGWPCAQLCTLGTKFSDGPFGSNL--------KSDHYRAS 236 Query: 249 GHPILRISSVRAGHVDQNDIRFL-ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 G ++R+ ++ G D ++ E L +H+ GD+L G+L + ++ Sbjct: 237 GVRVVRLQNIGVGEFLGADAAYISEDHFRNLKKHECLPGDVLV----GTLGDPNLRACIQ 292 Query: 308 KLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIK 366 L ++ R + A E++ + P + M + + IS ++ Sbjct: 293 PRWLSVALNKADCVQIRPDERTATSEFVCFLLNQPGTQR-MAQDLMHGQTRIRISMGRLR 351 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 S + +PP+ Q + ++V A +T++ ALA+++ L S+ +AF G+L Sbjct: 352 SLAIPVPPIGLQRDFTQQV----AAMETLKTAHRAALAQLDALFASLQHRAFLGDL 403 Score = 95.9 bits (237), Expect = 3e-18, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 76/206 (36%), Gaps = 9/206 (4%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P+GW A + T+ T + ++ + + ++R NI G+F D ++ ++ Sbjct: 204 PKGWPCAQLCTLGTKFSDGPFGSNLKSDHYRASGVRVVRLQNIGVGEFLGADAAYISEDH 263 Query: 67 VKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK-LIFSGFIA 123 + +K P D+++ + G ++ A C +RP++ S F+ Sbjct: 264 FRNLKKHECLPGDVLVG-TLGDPNLRA-CIQPRWLSVALNKADCVQIRPDERTATSEFVC 321 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + L G I + IP+PP+ Q+ +++ + + + Sbjct: 322 FLLNQPGTQRMAQDLMHGQTRIRISMGRLRSLAIPVPPIGLQRDFTQQVAAMETLKTAHR 381 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKL 209 A Q+ ++ A G L Sbjct: 382 AALAQL----DALFASLQHRAFLGDL 403 >UniRef50_A5GB19 Restriction modification system DNA specificity domain n=1 Tax=Geobacter uraniireducens Rf4 RepID=A5GB19_GEOUR Length = 420 Score = 209 bits (532), Expect = 2e-52, Method: Composition-based stats. Identities = 91/426 (21%), Positives = 171/426 (40%), Gaps = 29/426 (6%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W + +S G T + A +Y +P +++ + T+ + + Sbjct: 3 WPMVEISRFCQTGSGGTPSRNNAGDYY-GGNIPWVKSGELNQEFVLNTEERITELAIKES 61 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSS 129 S KI P ++ G + VGKSA + + A C ++ + + ++ + K+ Sbjct: 62 SAKIVPAGAILVAMYG--ATVGKSALLGID-AATNQAICNIIPDPEAADTRYVWYALKNQ 118 Query: 130 LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQI 189 L + + G NI IP+P L+EQ+ I E L Q D + + Sbjct: 119 L--PYLLAQRVGGAQPNISQQIIKNTQIPLPLLSEQRRIVEIL----DQADHLRKLRGEA 172 Query: 190 PQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESG-- 247 + + A+ G T + ++ ++ ++ G S+ G Sbjct: 173 DKKAELILPALFNKMFGGPATN--------PMGWPEMPLRQVIAKVEAGWSAVSEARGCT 224 Query: 248 -VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 +L++S+V +G + + + +++ + GDLLF+R N + E V ++ Sbjct: 225 KDEFGVLKVSAVTSGRFLACEHKAVLVLQTDRGLLTPRRGDLLFSRAN-TRELVAASCVV 283 Query: 307 KKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMM-NCVKTTSGQKGISGKD 364 + H NL PDKL R L D A Y++ F + R+ + ++ IS + Sbjct: 284 ED-DHPNLFLPDKLWRLILHPDRATAMYLKELFWNNGFRDRFRASASGSSGSMLNISQEA 342 Query: 365 IKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTA 424 + + + +PP K Q E + L A I K+ A ++ L ++L +AF G LTA Sbjct: 343 MLNTIAPIPPFKLQEEYSAKAWSLAA----IAKERRLAGDALDTLWSNLLQRAFSGTLTA 398 Query: 425 QWRAEN 430 WR + Sbjct: 399 AWREAH 404 >UniRef50_D1YNY9 Type I restriction modification DNA specificity domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YNY9_9FIRM Length = 427 Score = 209 bits (532), Expect = 2e-52, Method: Composition-based stats. Identities = 87/431 (20%), Positives = 176/431 (40%), Gaps = 35/431 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +P+ W + + +L + + D P + + Sbjct: 13 GMIPKSWD---LDKIVSLY-------SERSTKVSDKDYPALS---VTKQGIVPQLESAAK 59 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + + I D VI S + G S +E S VL P+ + + + Sbjct: 60 TDNGDNRKLIKKNDFVINSRSDRRGSCGIS-----EYEGSCSLINIVLAPKNNMVNRYYN 114 Query: 124 HFTKSSLYRNKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + K+ L+ ++ G ++ + K ++ I +P P L EQ+ IAE LDT AQ+D+ Sbjct: 115 YLFKTELFADEFYKWGNGIVDDLWSTKWSNMKNIMVPFPSLEEQQAIAEHLDTKCAQIDT 174 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 A+ + + + L+ +++A++ AV L +W + P H K+L F + + Sbjct: 175 IIAKEQSVIEKLQEYKRAIITYAVVKGLDITAETADSGIEWIDSIPSHWKIKRLIFSAYI 234 Query: 233 TELRNGLSSKPNES-GVGHPILRISSVRAGHVDQNDIRFLECS-ESELNRHKLQDGDLLF 290 K +E GHP L +++ + D+ F+ E KL+ GDLL Sbjct: 235 RARLGWKGLKADEYTSEGHPFLSAVNIQNDKLVWEDLNFINDDRYDESPEIKLEIGDLLL 294 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 + +G C ++ +L + L + Y+ FF S +N + + Sbjct: 295 VKDGAG---IGKCAVVDQLPYGTATTNSSLGVITPYPELNSMYLYYFFESAIFQNYI-SR 350 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 +K G ++ ++K+ +V++PP EQ IV +++ A D++ + + + ++ Sbjct: 351 IKNGMGVPHLTQGNLKNIMVIIPPYCEQEAIVTYLDEKCANLDSVILRKQSRIDKLTEYK 410 Query: 411 QSILAKAFRGE 421 +S++ + G+ Sbjct: 411 KSLIYEVVTGK 421 >UniRef50_UPI0001AF6F3B polypeptide HsdS n=1 Tax=Mycobacterium kansasii ATCC 12478 RepID=UPI0001AF6F3B Length = 409 Score = 209 bits (531), Expect = 2e-52, Method: Composition-based stats. Identities = 84/439 (19%), Positives = 179/439 (40%), Gaps = 37/439 (8%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 W + + + G K + + +P +R N+Q G+ DT DL+ + + Sbjct: 2 NWQVRQLGEIAETALGKMLDKGKQKGLPQ---VPYLRNVNVQWGRVDTDDLLTMELADDE 58 Query: 69 ESQK-ISPEDIVIAMSSGSKSVVGKSAHQHLPFEC-SFGAFCGVLRPEKLIFSGFIAHFT 126 + +S D+++ +G+SA H + ++ +RP K + F+ + Sbjct: 59 RERFGVSAGDLLVC----EGGEIGRSAIWHGQADYIAYQKALHRIRPGKSLDVRFLRYLL 114 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 + ++ L+ G+ I ++ + +P+PPL EQ I + ++ L+++++ + Sbjct: 115 EHYSLNGTLAGLATGSTIAHLPQQQLRRVPVPVPPLNEQCRIVDLIEDHLSRLEAGQRWL 174 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWR--NFEPQHSVFKKLNFESILTELRNGLSSKPN 244 + L+ F A L + + ++R + ++ K L+ + + Sbjct: 175 SVGERKLEAFWLAALSASRRALVGAQFRTIGDVAETTLGKMLDAKRQV------------ 222 Query: 245 ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 G P LR +VR G D +D++ ++SE+ R ++ GD++ C Sbjct: 223 --GSPTPYLRNINVRWGEFDLSDVQLTPLTDSEVQRFDVRPGDVMACEGGEPGRCAVWCR 280 Query: 305 LLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKD 364 + ++ Q L IR R + L ++ + + R+ N + T + K + + Sbjct: 281 PVGEVAFQKAL---HRIRVRNPGEVLTSFLALMLE-EAIRSGRCNRMFTGTTIKHLPQEK 336 Query: 365 IKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTA 424 ++ + +P + Q + V + +L + + + NA AR+ + S+L AF G L A Sbjct: 337 LRVIEIPVPALHTQRQAVDCLAELVGAQERLRAALANAAARIAAMRSSLLTAAFSGRLIA 396 Query: 425 QWRAENPDLISGENSAAAL 443 S SA L Sbjct: 397 --------AKSSLPSAEEL 407 >UniRef50_Q8TN78 Type I restriction modification enzyme protein S n=1 Tax=Methanosarcina acetivorans RepID=Q8TN78_METAC Length = 391 Score = 209 bits (531), Expect = 2e-52, Method: Composition-based stats. Identities = 83/422 (19%), Positives = 168/422 (39%), Gaps = 43/422 (10%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV----FVPKN 65 W P+ ++ T+I G T K + Y D +P + + D TD + Sbjct: 4 WPHQPIISLGTIITGSTPKTSEEHFYGGD--IPFVTPAEL-----DQTDPIMNAARTLSE 56 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + ++ PE V+ GS VG + S V+ K+I+ F + Sbjct: 57 TGSQESRLLPEGTVMVCCIGSLGKVGIAGRT----VASNQQINSVIFDPKIIWPRFGFYA 112 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 + L ++++ L+ + + + F + IP+PPL EQK IA+ LD A + Sbjct: 113 CR--LLKSRLEVLAPATTVPIVNKSKFGQLEIPVPPLPEQKRIADILDRAEALRAKRRVA 170 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG----LSS 241 E + ++ + + G +V+ + +K+ + + ++ G L Sbjct: 171 LEHLDELTQAIFIDMFGDSVSNPMG------------WKRYPLKHCVNHIQIGPFGSLLH 218 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFLECSE-SELNRHKLQDGDLLFTRYNGSLEFV 300 K + G P++ + + G + + + + + +EL ++LQ GD++ R + Sbjct: 219 KEDYVFGGIPLINPTHIENGKIVPDVNQSITVQKLAELQLYQLQQGDVIMGRRGE----M 274 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G C ++ + L L A+ Y++ SS S R + + + Sbjct: 275 GRCAIVGSEHNGTLCGTGSLFIRPDESKAIAMYLQATLSSESMRKHLEGF-SLGATLPNL 333 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + + + LPP++ Q E +E + ++ ++L ++ L S+ +AFRG Sbjct: 334 NRGIVGELAISLPPIELQKEFSHHIES----IEKLKTTYKSSLTEIDELFLSLQYRAFRG 389 Query: 421 EL 422 EL Sbjct: 390 EL 391 Score = 95.1 bits (235), Expect = 5e-18, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 78/207 (37%), Gaps = 12/207 (5%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT-DLVFVPKN 65 P GW P+ I+ + +PLI +I+NGK + + Sbjct: 193 PMGWKRYPLKHCVNHIQIGPFGSLLHKEDYVFGGIPLINPTHIENGKIVPDVNQSITVQK 252 Query: 66 LVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECS-FGAFCGVLRPEK-LIFSGFI 122 L + + ++ D+++ G + +G+ A + G +RP++ + ++ Sbjct: 253 LAELQLYQLQQGDVIM----GRRGEMGRCAIVGSEHNGTLCGTGSLFIRPDESKAIAMYL 308 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 S R + S GA + N+ + I +PP+ QK + ++++ + Sbjct: 309 QATLSSESMRKHLEGFSLGATLPNLNRGIVGELAISLPPIELQKEFSHHIESIEKLKTTY 368 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKL 209 K+ +I ++ A G+L Sbjct: 369 KSSLTEI----DELFLSLQYRAFRGEL 391 >UniRef50_B1LRG3 Type I restriction modification DNA specificity domain protein n=1 Tax=Escherichia coli SMS-3-5 RepID=B1LRG3_ECOSM Length = 428 Score = 208 bits (530), Expect = 3e-52, Method: Composition-based stats. Identities = 76/368 (20%), Positives = 153/368 (41%), Gaps = 22/368 (5%) Query: 71 QKISPEDIVIAMSSGSKSV----VGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFT 126 +I +D ++ +++ VG + + A+ V I+ F + Sbjct: 67 YQIFEKDDLVFKLIDLENIKTSRVGIVHERGIMSP----AYIRVSASSNSIYPRFYYWYF 122 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 +LY I + G N+ I +P+ ++ QK ++ LD ++DS Sbjct: 123 F-ALYLTNIYNKLGGGVRQNLTAGDLLEIPVPLIDISLQKQVSTFLDRETQRIDSLIEEK 181 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTELRN 237 + ++LK RQA++ V L +W P+H KK+ + Sbjct: 182 QTFIKLLKEKRQALISHVVTKGLYPNVEMQDSGIEWIGQVPKHWEVKKIKHI--CSNFMY 239 Query: 238 GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSL 297 G S N+S VG+P+LRI ++++ +VD D+++ S+ + + L GD+L R NG+ Sbjct: 240 GTSQDCNQSDVGYPVLRIPNIKSTNVDFEDLKYANISDVDALTYLLSRGDILVIRTNGNP 299 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 VG L + L+ LI+ + ++ +S S R A+ +T+ G Sbjct: 300 NLVGQSALFD--SNGQYLFASYLIKLTPKQGVDTSFLVEAMNSLSVRQALTFQSRTSVGN 357 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 +S + + + +PP+ EQ I + D + ++ + ++ + S++ A Sbjct: 358 YNLSIPSLANTSIAIPPIDEQKTITNYLSAATINIDLLIQETDKSIDLLKEHRTSLINAA 417 Query: 418 FRGELTAQ 425 G++ + Sbjct: 418 VTGKIDVR 425 Score = 136 bits (342), Expect = 2e-30, Method: Composition-based stats. Identities = 50/210 (23%), Positives = 94/210 (44%), Gaps = 6/210 (2%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P+ W + + + Y Q N D P++R NI++ D DL + Sbjct: 218 IGQVPKHWEVKKIKHIC---SNFMYGTSQDCN-QSDVGYPVLRIPNIKSTNVDFEDLKYA 273 Query: 63 PKNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + V + +S DI++ ++G+ ++VG+SA + F ++ L P++ + + F Sbjct: 274 NISDVDALTYLLSRGDILVIRTNGNPNLVGQSALFDSNGQYLFASYLIKLTPKQGVDTSF 333 Query: 122 IAHFTKSSLYRNKISSLSAGANIN-NIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + S R ++ S + N N+ S +I IPP+ EQK I L +D Sbjct: 334 LVEAMNSLSVRQALTFQSRTSVGNYNLSIPSLANTSIAIPPIDEQKTITNYLSAATINID 393 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLT 210 ++ +LK R +++ AV GK+ Sbjct: 394 LLIQETDKSIDLLKEHRTSLINAAVTGKID 423 >UniRef50_B0RQ64 Type I site-specific DNA methyltransferase specificity subunit n=3 Tax=Xanthomonas campestris pv. campestris RepID=B0RQ64_XANCB Length = 415 Score = 208 bits (530), Expect = 3e-52, Method: Composition-based stats. Identities = 92/434 (21%), Positives = 171/434 (39%), Gaps = 33/434 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GW + + ++ G T + Q Y D P ++ ++ N + TTD V Sbjct: 2 LPDGWRRTTLGNIGSVKSGSTPARSQHDRYFVDGKWPWVKTMDLTNSEILTTDEVITDAA 61 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFSGFIAH 124 L + S ++ P V+ G +G++ L + + + E+ F+ H Sbjct: 62 LAESSCRLFPAGTVLVAMYGGFKQIGRTGL--LREKSAINQAISAIDIERNQADPEFVLH 119 Query: 125 FTKSSL--YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + S+ ++N +S NI + + +P L EQ+ IA L T + +T Sbjct: 120 WLNGSVETWKNYAASSRK---DPNITRENVCDFPVILPTLGEQRRIAHILSTWDQAIATT 176 Query: 183 KARFEQIPQILK-RFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS- 240 + + + + R LG W F L +GLS Sbjct: 177 ERLLKNSQKQMDILLRDLTLGTQRTTSTPSPWAKFTL-----------GELGRTYSGLSG 225 Query: 241 SKPNESGVGHPILRISSV-RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 K + G G + ++V + +D D ++ SE+E N+ +++ GD++FT + + Sbjct: 226 KKGEDFGFGAKFIPYTNVFKNNRIDIEDFSLVKISENE-NQTRVKSGDIIFTISSETPNE 284 Query: 300 VGVCGLLKKLQHQNLLYPDKLI---RARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 VG+ +L L N LY + R K LPEY +P R A+M + S Sbjct: 285 VGMASVL--LDDVNELYLNSFCFGYRLNDFKTLLPEYAGFVLRAPHIR-ALMTQIAQGST 341 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 + IS ++ + LP + EQ I + + K + + LAR+ ++++ Sbjct: 342 RFNISKANVMRMELALPSIAEQKRIASILGGAHSTV----KNLRDQLARLKAEKVILMSQ 397 Query: 417 AFRGELTAQWRAEN 430 G+ + + Sbjct: 398 LLTGKRRVRLPTDE 411 >UniRef50_B9ZS45 Restriction modification system DNA specificity domain protein n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZS45_9GAMM Length = 419 Score = 207 bits (526), Expect = 8e-52, Method: Composition-based stats. Identities = 77/427 (18%), Positives = 161/427 (37%), Gaps = 35/427 (8%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKD--DYLPLIRANNIQNGKFDT--TDLVFVP 63 EGW A +S + + G T + + + ++ + ++ + + ++ Sbjct: 3 EGWKTAKLSELCDIQLGKTPARANSSYWDQERSTGNVWLSIADLLKSEANNVSDSKEYLS 62 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 K + + ++++ K +G+ A + + E++I ++ Sbjct: 63 DKGAKLCKIVKKGTLLVS----FKLTLGRVAFAGKDLYTNEAIAALTIHDEQIINRDYLF 118 Query: 124 HFTKSSLYRNKISSLSAGANINN--IKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 +F + + + + + + A I + +PPL EQK I LD A +D+ Sbjct: 119 YFLH---FFDWVKAAQDDVKLKGMTLNKAKLKEILVVVPPLPEQKRIVAILDEAFASIDT 175 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 A E+ + ++ L V+ + + E+ +G Sbjct: 176 AVANTEKNLANARELFESYLNAVVDTAFRKS-----------TVTVLSDLAEEITDGDHM 224 Query: 242 KPNESGVGHPILRISSV--RAGHVDQNDIRFLECSESELNR--HKLQDGDLLFTRYNGSL 297 P ++ G P + I ++ R VD + + S E + + + GD+L+T Sbjct: 225 PPPKAPSGVPFITIKNIDKRTRKVDFENTFRVPRSYFEGLKPNKRPRKGDVLYTVTGS-- 282 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 G+ ++ Q + + R ++ SP + T + Q Sbjct: 283 --FGIPVVV--GQKTEFCFQRHIGLIRPKSGTDSSWLYYLLMSPQIFAQATDGA-TGTAQ 337 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 K +S K ++S V P+ +Q + V++++ L A + +E L + L QS+L KA Sbjct: 338 KTVSLKVLRSFRVPTIPLDQQVDNVQQLDNLLADVEGLESIYRQQLRNLGELKQSLLQKA 397 Query: 418 FRGELTA 424 F GELTA Sbjct: 398 FSGELTA 404 >UniRef50_A6L7U8 Type I restriction enzyme EcoAI specificity protein n=7 Tax=Bacteroides RepID=A6L7U8_BACV8 Length = 449 Score = 207 bits (526), Expect = 9e-52, Method: Composition-based stats. Identities = 89/400 (22%), Positives = 166/400 (41%), Gaps = 32/400 (8%) Query: 6 LPEGWVIAPVSTV-TTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN-GKFDTTDLVFVP 63 LP GW + + L G + K L + ++R NI N G D ++LV+ Sbjct: 68 LPNGWEWCNLEDIVCELKYGTSEKS------LSVGKIAVLRMGNITNVGTIDYSNLVYSS 121 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 N + + +D++ ++ S VGK+A + + +RP LIFS ++ Sbjct: 122 NNEDIKLYSLEKDDLLFNRTN-SSEWVGKTAIYKKEQPAIYAGYLIRIRPI-LIFSDYLN 179 Query: 124 HFTKSSLYRNKISSLSA-GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 SS YRN ++ N +NI + IPIPPL EQ+ I ++ ++ +D+ Sbjct: 180 TVMNSSYYRNWCYNVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVVEVAKWISLIDTI 239 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 K E + +K+ + +L A++GKL + N EP + K++N T NG ++ Sbjct: 240 KNSKEDLQTTIKQAKSKILNLAIHGKLVPQDPNDEPAIELLKRINP--DFTPCDNGHYTQ 297 Query: 243 PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 E + +I+S+ G +N E+ + + + R N L G Sbjct: 298 LPEGWAICKMKQITSITNGKSQKN-------VETLNGIYPIYGSGGVIGRANQYLCIAGS 350 Query: 303 CGLLKKLQHQNLLY-------PDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 + +K N ++ D + L +Y+ F S + + ++ Sbjct: 351 TIIGRKGTINNPIFVEEHFWNVDTAFGLKANDAILDKYLYYFCLSFDF-----SKLDKST 405 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTI 395 ++ I + ++ +PP KEQ IV +++ + + I Sbjct: 406 AMPSLTKTSIGNVLIPIPPYKEQERIVAKIDMVLDTMNEI 445 Score = 154 bits (388), Expect = 8e-36, Method: Composition-based stats. Identities = 72/294 (24%), Positives = 117/294 (39%), Gaps = 54/294 (18%) Query: 194 KRFRQAVLGGAVNGKLTEKWRNFEPQHSV------------------------------- 222 K RQ +L A++GKL + N EP + Sbjct: 4 KALRQKILDLAIHGKLVPQDPNDEPASVLLERIKAEKERLIKEGKIKRSKKSAKSSDTPH 63 Query: 223 --------FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRA-GHVDQNDIRFLEC 273 ++ N E I+ EL+ G S K G +LR+ ++ G +D +++ + Sbjct: 64 YPYLLPNGWEWCNLEDIVCELKYGTSEKSLSVG-KIAVLRMGNITNVGTIDYSNLVY-SS 121 Query: 274 SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEY 333 + ++ + L+ DLLF R N S E+VG + KK Q +Y LIR R +Y Sbjct: 122 NNEDIKLYSLEKDDLLFNRTNSS-EWVGKTAIYKK--EQPAIYAGYLIRIRPI-LIFSDY 177 Query: 334 IEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYAD 393 + +S RN N Q I+ + + ++ +PP+KEQ IV V + + D Sbjct: 178 LNTVMNSSYYRNWCYNVKTDAVNQSNINAQKLSQLMIPIPPLKEQERIVVEVAKWISLID 237 Query: 394 TIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKI 447 TI+ + + IL A G+L Q + P A LL++I Sbjct: 238 TIKNSKEDLQTTIKQAKSKILNLAIHGKLVPQDPNDEP--------AIELLKRI 283 Score = 72.8 bits (177), Expect = 3e-11, Method: Composition-based stats. Identities = 34/178 (19%), Positives = 67/178 (37%), Gaps = 29/178 (16%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LPEGW I + +T++ G + K + +N P+ + + Sbjct: 297 QLPEGWAICKMKQITSITNGKSQKNVETLN----GIYPIYGSGGV--------------- 337 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 + + +Q + I G+ + + +FG L+ I ++ + Sbjct: 338 -IGRANQYLCIAGSTIIGRKGTINNPIFVEEHFWNVDTAFG-----LKANDAILDKYLYY 391 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 F S + S L + ++ S + IPIPP EQ+ I K+D +L ++ Sbjct: 392 FCLSFDF----SKLDKSTAMPSLTKTSIGNVLIPIPPYKEQERIVAKIDMVLDTMNEI 445 Score = 55.5 bits (132), Expect = 4e-06, Method: Composition-based stats. Identities = 17/46 (36%), Positives = 22/46 (47%), Gaps = 8/46 (17%) Query: 407 NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERA 452 L Q IL A G+L Q + P A+ LLE+IKAE+ Sbjct: 4 KALRQKILDLAIHGKLVPQDPNDEP--------ASVLLERIKAEKE 41 >UniRef50_Q1K3D0 Restriction modification system DNA specificity domain n=7 Tax=Bacteria RepID=Q1K3D0_DESAC Length = 417 Score = 206 bits (525), Expect = 1e-51, Method: Composition-based stats. Identities = 88/417 (21%), Positives = 161/417 (38%), Gaps = 25/417 (5%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 GW P+ + T IR + Y K+ Y +++NN++NGK + +F+ + Sbjct: 19 NGWTENPLGEIYTKIR-NAFVGTATPYYTKNGYF-YLQSNNVKNGKINRKTEIFIDEEFY 76 Query: 68 --KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVL-RPEKLIFSGFIAH 124 +E + DIV+ S VG +A S ++ +P K ++ Sbjct: 77 FKQEKNWLRTNDIVMVQS----GHVGHTAVIPNELNNSAAHALIIISKPLKKSCPYYLNF 132 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + ++ + I +++ G I +I N+ PP EQ I T ++D Sbjct: 133 YFQTYRAKQDIGNITTGNTIKHILATDIKRFNVFFPPYEEQTKI----GTYFKKLDRIIE 188 Query: 185 RFEQIPQILKRFRQAVLGGAV-NGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 ++ L +QA+L + F ++K + GL++ Sbjct: 189 LHQRKHDKLVTLKQAMLQKMFPQDGASTPEIRFNGFEGDWEKKKLRDVCNSFDYGLNAAA 248 Query: 244 NESGVGHPILRISSVRAGH--VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 + + +RI+ + Q D+ E + L +GD+LF R S VG Sbjct: 249 KKYDGRNKYIRITDIDEFSRCFSQTDLTSPEADLPSSQNYLLCEGDILFARTGAS---VG 305 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 L +++ + + + LIRAR++ ++I S + N + SGQ GI+ Sbjct: 306 KTYLYREIDGR-VFFAGFLIRARVSNTESTDFIFYTTLSSNYEN-FVTITSQRSGQPGIN 363 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 K+ L+P V EQ +I F D + Q L ++ + + L K F Sbjct: 364 AKEYSEYTFLVPSVTEQKKIGTY----FRKFDALISQHATQLKKLKQIKSACLGKMF 416 >UniRef50_UPI0001BC364B restriction modification system DNA specificity subunit n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC364B Length = 428 Score = 206 bits (524), Expect = 2e-51, Method: Composition-based stats. Identities = 91/430 (21%), Positives = 179/430 (41%), Gaps = 28/430 (6%) Query: 3 AGKLPEGWVIAPVSTVTTL---IRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL 59 GK+PE W + L I G + + Q ++ K I +G+ Sbjct: 12 VGKIPENWKVLKNKYNFELSKEIIGTKWVETQLLSLTKYG------VKAINDGE----QT 61 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 VP++L QK++ +DIV+ + S V F+ +R + + Sbjct: 62 GKVPESL-STYQKVNKDDIVMCLFDLDCSAV---FSGISNFDGMISPAYKCIRCKPHLCP 117 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ ++ ++ K S +I F + I +PP+ QK IAE L+ ++ Sbjct: 118 QYVDYYFRTVFVDRKYKRYSKNV-RFSISSDEFMNLPIIVPPIDIQKKIAEFLNFKCFEI 176 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILT------ 233 D+ + E+ + L+ ++++++ AV L + S + +T Sbjct: 177 DTLHSDIEKQIKTLEEYKKSIITEAVTKGLDPDVEMKDSGISYIGNIPKHWKVTNLKYLG 236 Query: 234 ELRNGLSSKPNESGVGHPILRISSVRAGH-VDQNDIRFLECSESELNRHKLQDGDLLFTR 292 + +NG+S G G P + V + + QN + +++E N + ++ GD+ FTR Sbjct: 237 KCQNGISKGGEYFGNGFPFVSYGDVYKNYSIPQNVDGLIMSTKTEQNIYSVKYGDVFFTR 296 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAMMNCV 351 + ++E +G K N ++ LIR R T D +PE+ + +F S R + + Sbjct: 297 TSETIEEIGFASTCLK-SIDNSVFAGFLIRFRPTSSDLIPEFSKFYFRSNIHRKFFVKEM 355 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 + + +S + VLLPP+ EQ I + +E+ A D ++ L + + Sbjct: 356 NLVT-RASLSQNLLGRLPVLLPPLCEQQMIAKNLEKKCAEIDGAIEEKKEQLETLEQYKK 414 Query: 412 SILAKAFRGE 421 S++ + G+ Sbjct: 415 SLIYEYVTGK 424 Score = 74.3 bits (181), Expect = 9e-12, Method: Composition-based stats. Identities = 18/139 (12%), Positives = 50/139 (35%), Gaps = 9/139 (6%) Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 + + + + + R P+Y++ +F + K + IS Sbjct: 89 SAVFSGISNFDGMISPAYKCIRCKPHLCPQYVDYYFRTVFVDRKYKRYSKNV--RFSISS 146 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + + +++PP+ Q +I + DT+ + + + +SI+ +A Sbjct: 147 DEFMNLPIIVPPIDIQKKIAEFLNFKCFEIDTLHSDIEKQIKTLEEYKKSIITEAVT--- 203 Query: 423 TAQWRAENPDLISGENSAA 441 + +PD+ ++ + Sbjct: 204 ----KGLDPDVEMKDSGIS 218 >UniRef50_C9P132 Type I restriction-modification system specificity subunit S n=1 Tax=Vibrio metschnikovii CIP 69.14 RepID=C9P132_VIBME Length = 405 Score = 206 bits (523), Expect = 2e-51, Method: Composition-based stats. Identities = 93/411 (22%), Positives = 177/411 (43%), Gaps = 27/411 (6%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 WV P++ L G+TY + ++ + +IR++N++NG+ D V+V +V Sbjct: 19 WVEKPLNHEVELFSGLTYSPKD----IRKQGVFVIRSSNVKNGQIVQADNVYVNPEVVNC 74 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQH-LPFECSFGAFCGVLRPEKLIFSGFIAHFTKS 128 S + DI++ + +GS++++GK A + L GAF +R FI + Sbjct: 75 SN-VQKGDIIVVVRNGSRALIGKHAQVNSLMDNTVIGAFMTGVRAG---HPEFINALFDT 130 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 + ++ + GA IN I +F+ + P EQ I L + ++ + + ++ Sbjct: 131 DKFTAQVEK-NLGATINQITNGAFNGMVFMFPEGQEQTAIGNTFQKLDSLINQHQKKHDK 189 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV 248 + I K + + +++ F + V K LN E EL +GL+ P + Sbjct: 190 LSNIKKAMLEKMFPKPGETTPEIRFKGFSGE-WVEKPLNHE---VELFSGLTYSPKDIRK 245 Query: 249 -GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 G ++R S+V+ G + Q D ++ +N +Q GD++ NGS +G + Sbjct: 246 QGVFVIRSSNVKNGQIVQADNVYVNPEV--VNCSNVQKGDIIVVVRNGSRALIGKHAQVN 303 Query: 308 KLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 L ++ + R PE+I F + + + T Q I+ Sbjct: 304 SLMDNTVIGA-FMTGVRA---GHPEFINALFDTDKFTAQVEKNLGATINQ--ITNGAFNG 357 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 V + P +EQ I ++L D++ Q + ++NN+ Q+ L+K F Sbjct: 358 MVFMFPEGQEQTAIGNTFQKL----DSLINQHQQQITKLNNIKQACLSKMF 404 Score = 83.2 bits (204), Expect = 2e-14, Method: Composition-based stats. Identities = 46/217 (21%), Positives = 88/217 (40%), Gaps = 16/217 (7%) Query: 214 RNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV-GHPILRISSVRAGHVDQNDIRFLE 272 F+ + + + EL +GL+ P + G ++R S+V+ G + Q D ++ Sbjct: 10 IRFKGFSGEWVEKPLNHEV-ELFSGLTYSPKDIRKQGVFVIRSSNVKNGQIVQADNVYVN 68 Query: 273 CSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPE 332 +N +Q GD++ NGS +G + L ++ + R PE Sbjct: 69 PEV--VNCSNVQKGDIIVVVRNGSRALIGKHAQVNSLMDNTVIGA-FMTGVRA---GHPE 122 Query: 333 YIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYA 392 +I F + + + T Q I+ V + P +EQ I ++L Sbjct: 123 FINALFDTDKFTAQVEKNLGATINQ--ITNGAFNGMVFMFPEGQEQTAIGNTFQKL---- 176 Query: 393 DTIEKQVNNALARVNNLTQSILAKAF--RGELTAQWR 427 D++ Q +++N+ +++L K F GE T + R Sbjct: 177 DSLINQHQKKHDKLSNIKKAMLEKMFPKPGETTPEIR 213 >UniRef50_B4TEJ6 Restriction modification system DNA specificity domain n=1 Tax=Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 RepID=B4TEJ6_SALHS Length = 380 Score = 205 bits (522), Expect = 2e-51, Method: Composition-based stats. Identities = 83/407 (20%), Positives = 165/407 (40%), Gaps = 35/407 (8%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQK 72 + ++ G + + +++ +PLIR +I +GK T+ + + Sbjct: 7 VTLGKHIDILSGCAFPSS---GFNRNNGVPLIRIRDILSGK---TETYY--EGSYDLKYL 58 Query: 73 ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYR 132 I D+++ M K L C + + + F+ HF + L Sbjct: 59 IKKGDLLVGMDGDFNREYWKGTDALLN-----QRVCKITPNPETLDKNFLYHFLQKEL-- 111 Query: 133 NKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQI 192 +KI + + + ++ I I +P L EQK IA L + D+ + + EQ ++ Sbjct: 112 DKIHATTDVVTVKHLSVKKIQDIKIRLPSLKEQKRIAAIL----DKADAIRQKREQAIKL 167 Query: 193 LKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPI 252 F +A F K ++ + G S+K + +PI Sbjct: 168 ADDFLRAKFLEMFGTPANN--------IHRFPKGTIRDLVDSVNYGTSAKASIDSGEYPI 219 Query: 253 LRISSVRA-GHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQH 311 LR+ ++ G D D+++L+ S E +++ +++GDLLF R N S E VG + + + Sbjct: 220 LRMGNITYQGRWDFTDLKYLDLSVKEKDKYLVKEGDLLFNRTN-SKELVGKTAVYE--ED 276 Query: 312 QNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVL 371 + + + LIR R YI + +S + +MN K+ G I+ +++++ +L Sbjct: 277 RPMAFAGYLIRVRPNSIGNNYYISGYLNSIHGKITLMNMCKSIVGMANINAQELQNIEIL 336 Query: 372 LPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 +PP Q E E ++ + + ++ L ++ K F Sbjct: 337 IPPKHLQDEY----EIIYKKIKKGLSIYDKSAMQLQLLASNLSNKYF 379 Score = 97.8 bits (242), Expect = 8e-19, Method: Composition-based stats. Identities = 38/188 (20%), Positives = 81/188 (43%), Gaps = 13/188 (6%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN-GKFDTTDLVFVPKN 65 P+G + + + T K + P++R NI G++D TDL ++ + Sbjct: 191 PKG----TIRDLVDSVNYGTSAKA----SIDSGEYPILRMGNITYQGRWDFTDLKYLDLS 242 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + ++ + + E ++ + SK +VGK+A +F + +RP + + +I+ + Sbjct: 243 VKEKDKYLVKEGDLLFNRTNSKELVGKTAVYEEDRPMAFAGYLIRVRPNSIGNNYYISGY 302 Query: 126 TKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQ---KIIAEKLDTLLAQVDS 181 S + + ++ + NI I I IPP Q +II +K+ L+ D Sbjct: 303 LNSIHGKITLMNMCKSIVGMANINAQELQNIEILIPPKHLQDEYEIIYKKIKKGLSIYDK 362 Query: 182 TKARFEQI 189 + + + + Sbjct: 363 SAMQLQLL 370 >UniRef50_A1K1C0 Type I site-specific deoxyribonuclease n=3 Tax=Bacteria RepID=A1K1C0_AZOSB Length = 449 Score = 205 bits (521), Expect = 4e-51, Method: Composition-based stats. Identities = 82/438 (18%), Positives = 165/438 (37%), Gaps = 26/438 (5%) Query: 6 LPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV-FVP 63 +P W +P+ V L+ GV+ D + +++ + + +G F + V Sbjct: 20 IPAHWEPSPLKRVVALVESGVSVNAVDEP--AGPDAVGVLKTSCVYSGNFSHGENKAVVA 77 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-GAFCGVLRPEKLIFSGFI 122 + L + + + ++++ + + ++VG + + F + + F Sbjct: 78 EELDRVACPVRAGTLIVSRMN-TPALVGAAGLVEENADNLFLPDRLWQVHFSGAV-PKFA 135 Query: 123 AHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++T S YR ++ AG + + N+ F +P+PP EQ IA LD A++D Sbjct: 136 HYWTASPSYRAQVQMACAGTSASMQNLSQDEFLRFVMPLPPKDEQTAIAAFLDRETAKID 195 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFESI 231 + A+ E++ +L RQA + AV L W P H L++ + Sbjct: 196 ALIAKQEKLIALLAEKRQATISHAVTRGLNPDAPMKDSGVAWLGEVPAHWSVSALSYLAS 255 Query: 232 LTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 L +P+ P L+ + + + + + + G LL Sbjct: 256 LETGATPDRGEPSYWNGTIPWLKTGEINWAPICEAEEFITDAGLENSAAKIAKPGTLLMA 315 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 Y + G LL+ N R+R +PE+ FF + + Sbjct: 316 MYGQGVTR-GRVALLEIEATYNQACAAINFRSR----IIPEFGRYFFMAAY---DHVRDA 367 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 + Q +S I + +PP+ EQ +VR ++ A D + + + + Sbjct: 368 GNETSQMNLSAGLISKIRLPVPPLDEQQAVVRFLDVETAKLDVLGAESERGITLLKERRS 427 Query: 412 SILAKAFRGELTAQWRAE 429 +++A A G++ + AE Sbjct: 428 ALIAAAVTGQIDVRNTAE 445 Score = 134 bits (337), Expect = 8e-30, Method: Composition-based stats. Identities = 42/207 (20%), Positives = 84/207 (40%), Gaps = 6/207 (2%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W ++ +S + +L G T + + + + +P ++ I + Sbjct: 239 GEVPAHWSVSALSYLASLETGATPDRGEPSYW--NGTIPWLKTGEINWAPICEAEEFITD 296 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 L + KI+ ++ G G+ A L E ++ C + I F Sbjct: 297 AGLENSAAKIAKPGTLLMAMYGQGVTRGRVAL--LEIEATYNQACAAINFRSRIIPEFGR 354 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 +F ++ + + + N+ I +P+PPL EQ+ + LD A++D Sbjct: 355 YFFMAAY--DHVRDAGNETSQMNLSAGLISKIRLPVPPLDEQQAVVRFLDVETAKLDVLG 412 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLT 210 A E+ +LK R A++ AV G++ Sbjct: 413 AESERGITLLKERRSALIAAAVTGQID 439 Score = 123 bits (308), Expect = 2e-26, Method: Composition-based stats. Identities = 49/225 (21%), Positives = 100/225 (44%), Gaps = 14/225 (6%) Query: 223 FKKLNFESILTELRNGLSSKPNESGVG---HPILRISSVRAGHVDQNDIRFLECSESELN 279 ++ + ++ + +G+S + G +L+ S V +G+ + + + E + Sbjct: 24 WEPSPLKRVVALVESGVSVNAVDEPAGPDAVGVLKTSCVYSGNFSHGENKAVVAEELDRV 83 Query: 280 RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFS 339 ++ G L+ +R N + VG GL+++ NL PD+L + + A+P++ + + Sbjct: 84 ACPVRAGTLIVSRMN-TPALVGAAGLVEENAD-NLFLPDRLWQVHFS-GAVPKFAHYWTA 140 Query: 340 SPSARNA-MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 SPS R M C T++ + +S + V+ LPP EQ I +++ A D + + Sbjct: 141 SPSYRAQVQMACAGTSASMQNLSQDEFLRFVMPLPPKDEQTAIAAFLDRETAKIDALIAK 200 Query: 399 VNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 +A + Q+ ++ A R NPD ++ A L Sbjct: 201 QEKLIALLAEKRQATISHAVT-------RGLNPDAPMKDSGVAWL 238 >UniRef50_B5IRS1 Type I restriction modification DNA specificity domain protein n=1 Tax=Thermococcus barophilus MP RepID=B5IRS1_9EURY Length = 408 Score = 205 bits (521), Expect = 4e-51, Method: Composition-based stats. Identities = 83/421 (19%), Positives = 176/421 (41%), Gaps = 29/421 (6%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PE W + + + +G K + KD +LP + ++N + T V + Sbjct: 10 IGEIPEDWQVVKLGKIIGYTKGK--KPKMVAKEPKDGWLPYLSTEYLRNN--NPTQFVKI 65 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF-SGF 121 N + + DI++ + L + + + +K ++ S F Sbjct: 66 TGNEII----VEDGDILLLWDGSNAG------EFFLAKKGVLSSTMVKIFLKKHVYDSLF 115 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + K + + G I ++ + + +P+PPL EQK IAE L T+ ++ Sbjct: 116 LFYLLKHR--EPFLKGQTKGTGIPHVDKNVLNALLLPLPPLEEQKQIAEILRTVDEAIEK 173 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 T E+ ++ K Q +L + K +K ++ + + + GLS Sbjct: 174 TDLAIEKTERLKKGLMQRLLTKGIKHKRFKK-TEIGEIPEEWRVVRIGEVTGLFQYGLSI 232 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 K ++ G +PI+++ S+ G V +I++++ E +++L+ GD+L R N S E VG Sbjct: 233 KMHDKG-KYPIIKMDSIINGEVKPVNIKYVDLDEDTFKKYRLEKGDILINRTN-SYELVG 290 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G+ + + ++ LIR R K P ++ + A + + Q I Sbjct: 291 RTGVF--MLDGDYVFASYLIRIRPDKKQIDPRFLTFYLIF--ANDKLRQLATRAVSQANI 346 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + ++K + LPP++EQ +I + + + + K+ ++ + + ++ G Sbjct: 347 NASNLKKFKIPLPPLEEQKQIAEILMTVDKKLELLRKR----KEKLERIKRGLMKDLLTG 402 Query: 421 E 421 Sbjct: 403 R 403 Score = 136 bits (343), Expect = 1e-30, Method: Composition-based stats. Identities = 49/206 (23%), Positives = 103/206 (50%), Gaps = 13/206 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 G++PE W + + VT L + G++ K Y P+I+ ++I NG+ ++ Sbjct: 206 EIGEIPEEWRVVRIGEVTGLFQYGLSIKMHDKGKY------PIIKMDSIINGEVKPVNIK 259 Query: 61 FVPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IF 118 +V + + ++ DI+I ++ S +VG++ L + F ++ +RP+K I Sbjct: 260 YVDLDEDTFKKYRLEKGDILINRTN-SYELVGRTGVFMLDGDYVFASYLIRIRPDKKQID 318 Query: 119 SGFI-AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+ + ++ ++++ + + NI ++ IP+PPL EQK IAE L T+ Sbjct: 319 PRFLTFYLIFANDKLRQLATRA--VSQANINASNLKKFKIPLPPLEEQKQIAEILMTVDK 376 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGG 203 +++ + R E++ +I + + +L G Sbjct: 377 KLELLRKRKEKLERIKRGLMKDLLTG 402 >UniRef50_Q0RV87 Type I restriction-modification system specificity subunit n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RV87_RHOSR Length = 391 Score = 204 bits (520), Expect = 4e-51, Method: Composition-based stats. Identities = 85/424 (20%), Positives = 163/424 (38%), Gaps = 49/424 (11%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP GWV+A + + T G YK + + P+ + Sbjct: 11 LPSGWVVAQMRRIATFRNGADYK----EVEVTEGGYPVYGSG----------------GE 50 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + SQ + + V+ G K + K F F L I ++ ++ Sbjct: 51 FRRASQYLYDGESVLF---GRKGTIDKPLLVSGRFWTVDTMFFTEL--TSNIEPRYLHYY 105 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 + + S + ++ IP+PP+ EQ IA+ LD A++D+ Sbjct: 106 ATTMPF----DYYSTSTALPSMTQGELGGHRIPLPPITEQGAIADFLDRETARIDTLIRE 161 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE 245 ++ ++L+ R AV G V G + +++ G K +E Sbjct: 162 QRRLIELLRERRIAVAEGPVVG-----------LSWSTPLRSVTALIQTGPFGSQLKSDE 210 Query: 246 SGVG-HPILRISSVRAGHVDQNDIRFLECSE-SELNRHKLQDGDLLFTRYNGSLEFVGVC 303 G P++ S + G ++ ++ + S+ SEL RH L+ GD++ R +G C Sbjct: 211 YETGGTPVINPSHLVMGRIEPDERVAVSASKASELGRHALRAGDVIAARRGE----LGRC 266 Query: 304 GLLKKLQHQNLL-YPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 +++ L LIR R T A PE++ + FSS R+++ + + ++ Sbjct: 267 AVVRAENTGFLCGTGSALIRLRET-VADPEFLALVFSSRRNRDSL-SLASVGATMDNLNA 324 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 I + + +PP+ EQ IV V + DT+ + + + +++ A G++ Sbjct: 325 DIIATLRIPMPPLPEQRRIVESVAEATTKIDTLITETESFIDLAKERRSALITAAVTGQI 384 Query: 423 TAQW 426 + Sbjct: 385 DVRD 388 >UniRef50_B4VK59 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VK59_9CYAN Length = 430 Score = 204 bits (519), Expect = 5e-51, Method: Composition-based stats. Identities = 85/434 (19%), Positives = 172/434 (39%), Gaps = 38/434 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYK--KEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G++PE W + LI K E+ ++ D + R + K TTD + Sbjct: 18 GQIPEHWETLRTKNIFRLITEAAPKNNDEELLSVYSDIGVKPRRELEERGNKASTTDGYW 77 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + D+++ +G S ++ VLR K I S + Sbjct: 78 I----------VKKGDVIVNKLLAWMGAIGIS-----DYDGVTSPAYDVLRAYKPIDSKY 122 Query: 122 IAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 + +S + +K+ S G + F I +P PP QK I E LD ++ Sbjct: 123 YHYLFRSPICLSKLKQHSRGIMEMRLRLYFDEFGRIRLPYPPFEIQKRIVEFLDRKCGEI 182 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFES 230 + A +++ ++L+ + ++ AV L +W P H KKL S Sbjct: 183 EDAIAHKKRLIELLEEQKTILINQAVTKGLDPNAPMKDSGIEWIGEIPTHWEVKKLKRIS 242 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL-ECSESELNRHKLQDGDLL 289 + ++ G LR +++ + D ++ E S L++ K+ GD++ Sbjct: 243 PCITVGIVITPSKYYVEEGVICLRSLNIKPNKILVKDSVYISERSNKYLSKSKIFAGDIV 302 Query: 290 FTRYNGSLEFVGVCGLLKK-LQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMM 348 R GV ++ + N + LI R K+ LP+++ + +S R+ + Sbjct: 303 CVRTGQP----GVSAVVDRRFDGANCI---DLIIIRKPKNDLPKFVSLAMNSEVCRSQYL 355 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + + Q+ + + ++ V+ +PP+ EQ +I + ++ + + + +N Sbjct: 356 TGA-SGAIQQHFNIEMAQNLVIAIPPLPEQIKIYNHISKIQKNTMDLMNFIKREIDLMNE 414 Query: 409 LTQSILAKAFRGEL 422 L Q ++A+A G++ Sbjct: 415 LKQILIAEAVTGKI 428 Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats. Identities = 16/149 (10%), Positives = 46/149 (30%), Gaps = 8/149 (5%) Query: 283 LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPS 342 ++ GD++ + + +G+ + + R K +Y F SP Sbjct: 79 VKKGDVIVNKLLAWMGAIGIS-------DYDGVTSPAYDVLRAYKPIDSKYYHYLFRSPI 131 Query: 343 ARNAMMNCVKTTSGQK-GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNN 401 + + + + + + + PP + Q IV +++ + Sbjct: 132 CLSKLKQHSRGIMEMRLRLYFDEFGRIRLPYPPFEIQKRIVEFLDRKCGEIEDAIAHKKR 191 Query: 402 ALARVNNLTQSILAKAFRGELTAQWRAEN 430 + + ++ +A L ++ Sbjct: 192 LIELLEEQKTILINQAVTKGLDPNAPMKD 220 >UniRef50_C6CZ61 Restriction modification system DNA specificity domain protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CZ61_PAESJ Length = 456 Score = 204 bits (518), Expect = 8e-51, Method: Composition-based stats. Identities = 88/460 (19%), Positives = 173/460 (37%), Gaps = 50/460 (10%) Query: 5 KLPEGWVIAPVSTVT---TLIRGVTYKK--EQAINYLKDDYLPLIRANNIQNGKFDTTDL 59 ++P WV + ++ + +++ E DY +R +++ G Sbjct: 9 EVPGNWVWVKLGSLAYLTDFVANGSFQSLRENVEVSDDTDYALYVRLTDLRLG-LGHEGQ 67 Query: 60 VFVPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 +V + K ++ +I+IA + V ++ + VLR + Sbjct: 68 KYVDETSYKFLSKSSLTGGEILIANIGANVGEV--FVMPNVDLLATIAPNMIVLRCNHYV 125 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + F+ +F S + + ++ G I I++ +PPL EQK IA+K++ LL Sbjct: 126 ENIFLNYFLSSPQGKKLLGTIITGTGQPKINKTGLKTISVALPPLNEQKRIADKVERLLD 185 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-------------------- 217 +++ K E+ + + A+L A G+LT+KWR Sbjct: 186 KINQAKQLIEEAKATFELRQAAILDKAFRGELTKKWRGEHSNQISTVRSISEDINPNEIP 245 Query: 218 ---PQHSVFKKLNFESILTELRNGLSSK--PNESGVGHPILRISSV-RAGHVDQNDIRFL 271 P + +L L ++ + P G +P ++ V AG ++ + L Sbjct: 246 FLLPAGWNWVRLKDLGTLERGKSKHRPRNDPKLFGGEYPFIQTGDVANAGDYIESYNQTL 305 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 L +G + T + LLK +PD ++ + Sbjct: 306 S-EFGLLQSKLFPEGTVCITI----AANIADTALLK----FPCCFPDSVVGFIPKDAYIS 356 Query: 332 E-YIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 Y+ + + + + + QK I+ K ++ +V +PP E EI+ + L Sbjct: 357 SLYLHYYMRTIKSN---LEHYAPATAQKNINLKVLQEILVPVPPKTEHDEILHMI-NLLM 412 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAEN 430 D + + N + + L QS+L+KAF+G L +EN Sbjct: 413 QKDEEAQTIMNVASDLEILKQSVLSKAFQGNLGTNESSEN 452 Score = 126 bits (317), Expect = 1e-27, Method: Composition-based stats. Identities = 60/239 (25%), Positives = 103/239 (43%), Gaps = 12/239 (5%) Query: 211 EKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHP-------ILRISSVRAGHV 263 E P + V+ KL + LT+ S + V +R++ +R G Sbjct: 4 EDQPYEVPGNWVWVKLGSLAYLTDFVANGSFQSLRENVEVSDDTDYALYVRLTDLRLGLG 63 Query: 264 DQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA 323 + E S L++ L G++L + VG ++ + + P+ +I Sbjct: 64 HEGQKYVDETSYKFLSKSSLTGGEILIANIGAN---VGEVFVMPNVDLLATIAPN-MIVL 119 Query: 324 RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 R ++ F SSP + ++ + T +GQ I+ +K+ V LPP+ EQ I Sbjct: 120 RCNHYVENIFLNYFLSSPQGKK-LLGTIITGTGQPKINKTGLKTISVALPPLNEQKRIAD 178 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAA 442 +VE+L + ++ + A A +IL KAFRGELT +WR E+ + IS S + Sbjct: 179 KVERLLDKINQAKQLIEEAKATFELRQAAILDKAFRGELTKKWRGEHSNQISTVRSISE 237 >UniRef50_A0Q725 Type I restriction-modification system, subunit S n=2 Tax=Francisella novicida RepID=A0Q725_FRATN Length = 407 Score = 203 bits (517), Expect = 9e-51, Method: Composition-based stats. Identities = 83/426 (19%), Positives = 158/426 (37%), Gaps = 32/426 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI----QNGKFDTTDLV 60 KLP GW + + + K+ D +P RA I QNG D + + Sbjct: 6 KLPAGWEWKKLGDLFKITSSKRVHKKD----WLDKGIPFYRAREIVKLAQNGYVD--NEL 59 Query: 61 FVPKNLVKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 F+ +++ + E+ ++ G+ + ++ F G L+ E Sbjct: 60 FISEDMYNSFASKYGLPKENDILVTGVGTLG-IPFVVKKNDKFYFKDGNIIW-LKNENGT 117 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 +I + S RN+I+S + G+ + + + IP+PPLAEQK I KLD+L Sbjct: 118 NPKYIEYCFSSQDVRNQINS-NNGSTVATYTITNANNTIIPLPPLAEQKRIVAKLDSLFE 176 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 ++D +Q + L E N + I R Sbjct: 177 KIDKAIELHQQNITNANTLMASTLDKTFKKLEGEYGMNDI----------LDGIYIGCRK 226 Query: 238 GLSSKPNESGVGHPILRISSV-RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 G KP P + + + + ++ N + + + + K + +L + Sbjct: 227 GY--KPEIIDGKVPFIGMQDIDQYNGINTNYVLE-DYEKVSKGKTKFEKNAVLVGKITPC 283 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 + ++ + ++ + P Y+ F S + +++ + +G Sbjct: 284 TQN-NKTSIVPSNINGGFAT-TEVYALHSKNNMNPFYLNYFVRSKDINDYLVSTMIGATG 341 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 ++ + I S + LPP+ Q + V ++ + D I++ L + L SIL K Sbjct: 342 RQRVPSDAITSLKIPLPPLPIQQQTVEYLDSIATKVDKIKQLNEQKLENLKALKASILDK 401 Query: 417 AFRGEL 422 AFRGEL Sbjct: 402 AFRGEL 407 >UniRef50_C5C353 Restriction modification system DNA specificity domain protein n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C353_BEUC1 Length = 427 Score = 203 bits (516), Expect = 1e-50, Method: Composition-based stats. Identities = 79/413 (19%), Positives = 160/413 (38%), Gaps = 33/413 (7%) Query: 31 QAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE--SQKISPEDIVIAMSSGSKS 88 ++ + D + L++ ++ +GKF ++ + + + P DI+IA Sbjct: 23 ESKDQDPDGDIRLLQLADVGDGKFKDKSDRWINEETFRRLRCSWVHPGDILIARM---PD 79 Query: 89 VVGK-SAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHFTKSSLYRNKISSLSAGANINN 146 +G+ + VLRP+ +G++ + S+ R+++ GA Sbjct: 80 PLGRACVVPEGLGKTITVVDVAVLRPDPDQADAGYLTYAINSAKTRSEVERQQDGATRQR 139 Query: 147 IKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVN 206 I ++IP+PPL EQ+ IA+ LD Q+D+ A E++ +LK R + + AV Sbjct: 140 IPRKRLGRVSIPLPPLEEQRRIADFLDAETTQIDALIAEQERLIGLLKERRASGILQAVT 199 Query: 207 GKLTE--------KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES-GVGHPILRISS 257 L + W + P H + + + S P P ++ Sbjct: 200 RGLRDVDLKPSTLTWVDAVPLHWTVANIRRFAAMKTGHTPSRSNPEYWVDTHIPWFTLAD 259 Query: 258 V------RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQH 311 V R H+ + + + + L G ++ +R VG G++ + Sbjct: 260 VWQVRDGRRTHLGETENTISDLGLANSAAELLPAGTVVLSRT----ASVGFSGVMPRPMA 315 Query: 312 QNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVL 371 + + + + + +PEY+ F A N + S K I + V Sbjct: 316 TSQDFWNWVCG----PELVPEYLMYLFR---AMRGEFNALMIGSTHKTIYQPVAAAIRVP 368 Query: 372 LPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTA 424 +PP++EQ EIV R+++ D + + + +A +++ A G++ Sbjct: 369 VPPLEEQHEIVARIDERTRKTDALINEAEHNIALSKERRAALITAAVTGQIDV 421 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 43/211 (20%), Positives = 82/211 (38%), Gaps = 14/211 (6%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANN---IQNGK---FDTTDL 59 +P W +A + + G T + Y D ++P + +++G+ T+ Sbjct: 218 VPLHWTVANIRRFAAMKTGHTPSRSN-PEYWVDTHIPWFTLADVWQVRDGRRTHLGETEN 276 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 L + ++ P V+ + S VG S P S + V PE + Sbjct: 277 TISDLGLANSAAELLPAGTVVLSRTAS---VGFSGVMPRPMATSQDFWNWVCGPE--LVP 331 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ + + R + ++L G+ I I +P+PPL EQ I ++D + Sbjct: 332 EYLMYLFR--AMRGEFNALMIGSTHKTIYQPVAAAIRVPVPPLEEQHEIVARIDERTRKT 389 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 D+ E + K R A++ AV G++ Sbjct: 390 DALINEAEHNIALSKERRAALITAAVTGQID 420 >UniRef50_Q64AS2 Restriction endonuclease S subunits n=1 Tax=uncultured archaeon GZfos29E12 RepID=Q64AS2_9ARCH Length = 438 Score = 203 bits (515), Expect = 1e-50, Method: Composition-based stats. Identities = 74/435 (17%), Positives = 167/435 (38%), Gaps = 29/435 (6%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PEGW + + T+ ++G D+ L+ + ++G + D V Sbjct: 17 IGEIPEGWEVNKIKN-TSYVKGRIGWHGLTSEEYSDEGAYLVTGTDFKDGVIEWEDCHHV 75 Query: 63 PKNLVKESQKIS-PEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRP-EKLIFS 119 + KE I ED ++ G+ +GK A + LP + + + ++RP K F Sbjct: 76 GWDRYKEDPYIHLKEDDLLITKDGT---IGKVALIKFLPNKATLNSGIFLVRPLNKKYFP 132 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ S+++ + GA I+++ +F+ PIP EQ IA LD A++ Sbjct: 133 KFMYWMLNSTVFERFFDYIKTGATISHLYQETFERFFFPIPLKQEQVAIASFLDKKTAKI 192 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFES 230 D+ + +++ ++LK R A++ AV L W P+ + Sbjct: 193 DALIEKDKRLIELLKEKRTALIDHAVTKGLDPNVKMKDFGIVWIGKIPEDAKIMPFRRVC 252 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 + + K + I+ ++ H D++ ++ E + + D+L Sbjct: 253 YVNQGLQFPEDKRLSEPDEKSKIYIT-IKYIHADEDGVK--EYIPNPPRGVICKKEDVLL 309 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 R + E + +Q ++ + + +Y+ + S + ++ Sbjct: 310 ARTGATGEVI---------TNQEGVFHNNFFKVNYNSKIDRDYLVYYLKMDSIKKVLL-L 359 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 + ++ S +L +++Q +I +++ A D K + + + Sbjct: 360 KAGVTTIPDLNHDAFLSTPFILYSIEKQKQIAEYLDKKTAKIDKNIKLIEKKIKLLEEYK 419 Query: 411 QSILAKAFRGELTAQ 425 +S++ G++ + Sbjct: 420 KSLINHVVTGKVDVR 434 >UniRef50_Q30XD2 Type I restriction-modification system, S subunit n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q30XD2_DESDG Length = 448 Score = 203 bits (515), Expect = 2e-50, Method: Composition-based stats. Identities = 87/437 (19%), Positives = 161/437 (36%), Gaps = 27/437 (6%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PE W IAPV G + + +D +P RA +Q + +D+ + Sbjct: 18 IGQVPEHWKIAPVKYHYDARLGKMIQPAAVSD--RDIEVPYHRAQTVQWERIVESDIKEM 75 Query: 63 PKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLP--FECSFGAFCGVLRPEKLIFS 119 + E +S D++I V ++A P F +R + Sbjct: 76 WASPRDIEQFSVSEGDLLIC----EGGDVCRAAIVKQPPEKNMIFQKSIHRIRSKGEYGV 131 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 G++ + I L I + + P+PP EQ IA LD A++ Sbjct: 132 GWVMRLMQHLRSSEWIDVLCNKNTIVHFTSDKLGSLECPLPPPDEQASIAAALDRETARI 191 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSV---FKKLN 227 D+ + + ++LK RQA++ AV L +W P+H K + Sbjct: 192 DALIQKKTRFIELLKEKRQALITHAVTKGLDPNVKMKDSGVEWLGEVPEHWSSVPIKYMA 251 Query: 228 FESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL-ECSESELNRHKLQDG 286 E L + S G + +V G + F+ E + L ++ G Sbjct: 252 LERNSLFLDGDWIESKDISTDGIRYITTGNVGEGVYKEQGSGFISEETFHALGCTEVYGG 311 Query: 287 DLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNA 346 D+L +R N +G ++ L + + D +I R ++I FSS Sbjct: 312 DVLVSRLNNP---IGRACMVPDLGVRVVTSVDNVI-FRPDSKFNKKFIVYLFSSEEYFKH 367 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 N + + + IS + + V P ++EQ +I R ++ A D + + ++ + Sbjct: 368 TSNLAR-GATMQRISRGLLGNIRVATPSIEEQTQIARFLDHETARIDALIGKAEQSITLL 426 Query: 407 NNLTQSILAKAFRGELT 423 + + A G++ Sbjct: 427 KERRAAFITAAVTGQID 443 Score = 102 bits (255), Expect = 3e-20, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 88/232 (37%), Gaps = 12/232 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P+H + + + + ++ + P R +V+ + ++DI+ + Sbjct: 16 EWIGQVPEHWKIAPVKYHYDARLGKMIQPAAVSDRDIEVPYHRAQTVQWERIVESDIKEM 75 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 S ++ + + +GDLL V ++K+ +N+++ + R R + Sbjct: 76 WASPRDIEQFSVSEGDLLICEGG----DVCRAAIVKQPPEKNMIFQKSIHRIRSKGEYGV 131 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 ++ + + + + + + + S LPP EQA I +++ A Sbjct: 132 GWVMRLMQHLRSSEWI-DVLCNKNTIVHFTSDKLGSLECPLPPPDEQASIAAALDRETAR 190 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + ++ + + Q+++ A + +P++ ++ L Sbjct: 191 IDALIQKKTRFIELLKEKRQALITHAVT-------KGLDPNVKMKDSGVEWL 235 >UniRef50_Q6GD64 Putative type I restriction enzyme specificity protein n=1 Tax=Staphylococcus aureus subsp. aureus MSSA476 RepID=Q6GD64_STAAS Length = 436 Score = 203 bits (515), Expect = 2e-50, Method: Composition-based stats. Identities = 77/432 (17%), Positives = 167/432 (38%), Gaps = 24/432 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +P+ W I + + I G +K E + + D+ +I + + +L + Sbjct: 13 IGYIPKYWTITKLKNIIDFISGYAFKSE--LFTISDNNKKVITIKSFNTKEIILDNLSYS 70 Query: 63 PKNLVKESQKISPE-DIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 ++L ++ + DI+ AMS G+ + + G++R FS F Sbjct: 71 NESLKFPTKYLLKNNDILFAMSGGTTGK--NLLIEQVDDLYYINQRVGIIRSS---FSKF 125 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 I ++ + L+ I+ S+G+ NI I +P K I ++ L + + Sbjct: 126 IYYYINTGLFSEYINLFSSGSAQPNISATDIQNFIIALPEKETIKKIEIYINYQLKIISN 185 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFESIL 232 Q + LK+++Q+++ AV + W P + +K+ + L Sbjct: 186 IIDTTYQSIEELKKYKQSLITEAVTKGIDPNVEMKESGNDWIGSIPSNWSVRKIKHDFNL 245 Query: 233 TEL--RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS-ESELNRHKLQDGDLL 289 GL+S VG ++ + + G + + + E +++ DLL Sbjct: 246 KGRIGWQGLTSN-EYQTVGPYLITGTDFKKGIIRWDSCVRISEERFEEAPDIHIKENDLL 304 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 T+ +G++ V + + K +L LIR +L +++ S N + Sbjct: 305 ITK-DGTIGKVALATNVPK--KVSLNSGVLLIREKLKNTINKKFMYYNLLSNMFWNWYNS 361 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + S K + + +P + EQ +IV+ ++ + D + + + + N Sbjct: 362 NNQGASTIKHLYQGQFYNYSYAIPLLHEQQQIVQYLDDKVSTIDRLIEDKTKVIKELENY 421 Query: 410 TQSILAKAFRGE 421 +S++ + G+ Sbjct: 422 KKSLIYEYVTGK 433 >UniRef50_Q0RKJ6 Type I restriction modification enzyme protein S n=1 Tax=Frankia alni ACN14a RepID=Q0RKJ6_FRAAA Length = 399 Score = 203 bits (515), Expect = 2e-50, Method: Composition-based stats. Identities = 87/416 (20%), Positives = 160/416 (38%), Gaps = 29/416 (6%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI---QNGKFDTTDLVFVPKNLVKE 69 P+ +I G T K A +P ++ + +T L Sbjct: 7 TPLGEFCEIISGATPKT--ASEEYWGGEIPWATPRDLGSLNSKFLASTSRAITEAGLRSC 64 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSS 129 + + P V+ S V +A + F ++ G++ H+ + Sbjct: 65 ATHVLPAGSVLLTSRAPIGSVAINARPM----ATNQGFKSLVPDTSRALPGYLYHWLRCQ 120 Query: 130 LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQI 189 R+++ SL GA + ++ I +P+PPL+EQK I + L Q D+ +AR + Sbjct: 121 --RSRLQSLGNGATFKELSKSATARIAVPLPPLSEQKRIEQML----DQADTIRARRRET 174 Query: 190 PQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG 249 L+ Q++ + + R + + +S + + ++P E G Sbjct: 175 IARLEELAQSIFSVMFGNPVQNE-RGWRRVPLSELVVRIDSGRSPVCLDRPARPGEWG-- 231 Query: 250 HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKL 309 +L++ +V + + + L + + +++ GDLLF+R N + E V C L+ Sbjct: 232 --VLKLGAVTSCVYRAGENKALPPDVAAFSACEVRPGDLLFSRKN-TRELVAACALVDAT 288 Query: 310 QHQNLLYPDKLIR--ARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG-QKGISGKDIK 366 LL PD + R P Y+ + P R + +S IS + Sbjct: 289 -PARLLLPDLIFRLVVEPRSAVDPVYLHRLLTHPEKRRKVQGLASGSSASMPNISKSRLL 347 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + LPP++ Q E RV L + I+ +L + L S+ +AFRGEL Sbjct: 348 GLEIELPPMEVQKEFANRVRAL----ERIKVAHQASLVEQDELVASLAHRAFRGEL 399 >UniRef50_D0C390 Type I restriction-modification system specificity determinant n=1 Tax=Acinetobacter sp. RUH2624 RepID=D0C390_9GAMM Length = 461 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 93/467 (19%), Positives = 181/467 (38%), Gaps = 50/467 (10%) Query: 5 KLPEGWVIAPVSTVT----TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 +LP W + ++ + + + D+ +PLI+ NNI++GK ++ Sbjct: 16 ELPSHWQEKRLGFLSMQTKNAFVDGPFGSDLKSDDYLDEGIPLIQLNNIRDGKHILRNMK 75 Query: 61 FVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPE-KL 116 F+ +N + P+DIVIA V ++A + E A C L P+ +L Sbjct: 76 FISQNKKIDLIRHLALPQDIVIAKM---AEPVARAAVVSDEYDEYVIVADCVKLSPDLEL 132 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + F+ S R +S G I + +P P L+EQ I + LD Sbjct: 133 VDLNFLIWAINSDCVRENAELVSTGTTRIRINLGELKKLKVPYPSLSEQVKIRQYLDHET 192 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLN 227 A++D+ A+ E++ +LK RQAV+ AV L +W P+H K Sbjct: 193 AKIDTLIAKQEELIALLKEKRQAVISHAVTKGLNPNVPMKDSGVEWLGEVPEHWTVSK-- 250 Query: 228 FESILTELRNGLSSKPN-----ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHK 282 ++++ G S +P +G P + ++ + ++D +L +E+ L + Sbjct: 251 -FGYISQVVRGGSPRPAGDPALFNGDYSPWVTVAEIT-----KDDELYLTSTETFLTKKG 304 Query: 283 ------LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 Q G LL + +L + + + + D I Sbjct: 305 SEQCRVFQSGTLLLSNSGATLGVPKILSINANANDGVVGFEDLKIDIE----------YA 354 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 +F N + VK SGQ ++ +K+ + +PP E +IV +++ + + Sbjct: 355 YFYLSILTNDLRERVKQGSGQPNLNTDIVKAIPIAIPPENEIKKIVVDIKKKIDHFSKLM 414 Query: 397 KQVNNALARVNNLTQSILAKAFRGELTAQ-WRAENPDLISGENSAAA 442 A+ + ++++ G++ + W+ N + +A Sbjct: 415 GSAEKAIQLMQERRTALISAVVTGKIDVRNWQHLNKNNNQDNMELSA 461 Score = 94.8 bits (234), Expect = 6e-18, Method: Composition-based stats. Identities = 37/209 (17%), Positives = 76/209 (36%), Gaps = 9/209 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK--FDTTDLVF 61 G++PE W ++ ++ ++RG + + DY P + I + T+ F Sbjct: 240 GEVPEHWTVSKFGYISQVVRGGSPRPAGDPALFNGDYSPWVTVAEITKDDELYLTSTETF 299 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + K ++ + ++++ S + V K + F + I + Sbjct: 300 LTKKGSEQCRVFQSGTLLLSNSGATLG-VPKILSINANANDGVVGFEDL-----KIDIEY 353 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + S L + + G+ N+ I I IPP E K I + + Sbjct: 354 AYFYL-SILTNDLRERVKQGSGQPNLNTDIVKAIPIAIPPENEIKKIVVDIKKKIDHFSK 412 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLT 210 E+ Q+++ R A++ V GK+ Sbjct: 413 LMGSAEKAIQLMQERRTALISAVVTGKID 441 >UniRef50_A3US47 Type I site-specific deoxyribonuclease n=1 Tax=Vibrio splendidus 12B01 RepID=A3US47_VIBSP Length = 413 Score = 202 bits (513), Expect = 2e-50, Method: Composition-based stats. Identities = 80/419 (19%), Positives = 166/419 (39%), Gaps = 21/419 (5%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P GW I + ++ T+ RG + + +P ++ +I + K + Sbjct: 2 VPNGWSIKTLESLATVERGKFSARPRNDPKYYGGEIPFVQTGDIASAKTYLSSFNQTLNE 61 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + ++ PE+ ++ + + +G +A C ++P++ I ++ F Sbjct: 62 DGLKVSRLFPENSILITIAAN---IGDTAITTFEVACPDS--LVGIQPKQDIDCFWLNSF 116 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 ++ ++++ + NI + I PP EQ+ IA+ L T + +T+ Sbjct: 117 LETC--KDELDGKATQNAQKNINLQVLKPLEILTPPYKEQQKIAKILSTWDKAITTTEKL 174 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE 245 Q K Q +L G + + FE + K + S +T+ G P Sbjct: 175 IATSKQQKKALMQQLLTGKKRLVNPDTGKTFEGEWEEVKLGDVCSKVTD---GAHHSPKS 231 Query: 246 SGVGHPILRISSVRAGHVDQNDIRFLECSESE---LNRHKLQDGDLLFTRYNGSLEFVGV 302 G+P+L + +RA +N R + + E K + D+L + L++ Sbjct: 232 VECGYPMLSVKDMRATKFSENTARHISKEDYEALVKQNCKPELNDILIAKDGSILKY--- 288 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 C ++++ +L L+R +L+ P +I +FS S R + + + SG I Sbjct: 289 CFVVREEIEGVILSSIALLRPKLS-IISPNFIAQYFSQESVRFFVGKALTSGSGVPRIIL 347 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 KD K + +P + EQ +I + AD + LA ++++ + G+ Sbjct: 348 KDFKGIHLRIPSLLEQQKIAS----VLTAADKEIEVFEAKLAHFKQEKKALMQQLLTGK 402 >UniRef50_B3PQK6 Probable type I restriction-modification system protein, specificity subunit n=1 Tax=Rhizobium etli CIAT 652 RepID=B3PQK6_RHIE6 Length = 424 Score = 202 bits (513), Expect = 3e-50, Method: Composition-based stats. Identities = 85/420 (20%), Positives = 174/420 (41%), Gaps = 28/420 (6%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANN---IQNGKFDTTDLVFVP 63 PEGW + + + L G T + + D +P + ++ I+ T + Sbjct: 24 PEGWALERLCDIARLESGHTPSRNRP--DYWDGGIPWLSLHDSKTIEGKVLQNTKMTISA 81 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + L S ++ PE V + + +GK A S C + P + + ++A Sbjct: 82 RGLANSSARLLPEGTVALSRTAT---IGKVALLGREMATSQDFACYICGPR--LLNKYLA 136 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 H + + L AG+ N I +F+ + I +PP+ EQ+ IA+ L A ++ + Sbjct: 137 HLFRGMEL--EWERLMAGSTHNTIYMPTFENMQILVPPMEEQEAIADALSDADALIEGLE 194 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 + I + Q +L R + + L+ +NGL+ Sbjct: 195 RLIAKKWLIKQGTMQDLLTA---------KRRLPGYSAEWTMAKLGDFLS-FKNGLNKAK 244 Query: 244 NESGVGHPILRISSV-RAGHVDQNDIR-FLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 G G PI+ V R G +++ I +E +E+E + + +++GD+LFTR + + E +G Sbjct: 245 AFFGHGTPIINYMDVFRGGAINEGSIDGLVEVTEAEQSAYGIRNGDVLFTRTSETPEEIG 304 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALP-EYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 + + + ++ ++R R AL + + F S + R +++ T+ + Sbjct: 305 LAAVADGVLDGT-VFSGFVLRGRPKSQALTIAFSKYCFRSGAVRRQIISRATYTT-RALT 362 Query: 361 SGKDIKSQVVLLP-PVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 +G+ + + + +P EQ I + + A +E +++ A + Q++L R Sbjct: 363 NGRQLSAVDISVPRDADEQNAIAEVLNDMDAEIQALETRLDKARQVKEGMMQNLLTGRIR 422 >UniRef50_Q2SCB3 Restriction endonuclease S subunit n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2SCB3_HAHCH Length = 406 Score = 201 bits (512), Expect = 3e-50, Method: Composition-based stats. Identities = 86/431 (19%), Positives = 169/431 (39%), Gaps = 38/431 (8%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDLVFVPKNLV 67 W +STV + G T K + +D+ P+++ ++ +N KF FV Sbjct: 2 SWDRVGLSTVADVFNGKTPSKAEQ----RDEGFPVLKIKDVDENFKFRGAFQSFVDDEFY 57 Query: 68 --KESQKISPEDIVIAMSSGSKSVVGK---SAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 +++KI D +I ++ + VG A + + + G + + ++ F+ Sbjct: 58 AKHKAKKIQLHDSMILNAAHNSDYVGSKQYCAEEDVVDSVATGEWLVCRAKQGVLSPKFL 117 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + +S R ++ L G ++ P + IP+PPL QK IA L + D Sbjct: 118 NFWLRSEATRFEMKGLVKGI---HLYPKDVARLEIPLPPLETQKQIAAIL----EKADQL 170 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS-- 240 + +Q+ Q L Q+V ++ + K + SI T+ +G Sbjct: 171 RKDCQQMEQELNNLAQSVFMDMFGDPVSNPK--------GWNKASLRSISTKFNDGPFGS 222 Query: 241 --SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYNGSL 297 + G ++R++++ G +D F+ +E L + + GD++ Sbjct: 223 NLKTSHYRDSGVQVIRLTNIGTGWFKNDDRAFVSVEHAETLEKFHCKPGDIVIATLGDPN 282 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 ++ + D + TK EY+ F + PS ++ N + + Sbjct: 283 L---RACIIPDEVPLAINKADCVHCVPNTKIVRKEYLVEFLNLPSTLRSIENKL-HGQTR 338 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 IS + VL+PP+ EQ + + + D K++ + +L S++ KA Sbjct: 339 TRISSGQLAEVDVLIPPLSEQDKFMNAIW----LRDKELKRLQDQNVAFEDLFNSLMQKA 394 Query: 418 FRGELTAQWRA 428 F GEL + +A Sbjct: 395 FNGELNIKNKA 405 Score = 109 bits (271), Expect = 3e-22, Method: Composition-based stats. Identities = 43/210 (20%), Positives = 79/210 (37%), Gaps = 7/210 (3%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P+GW A + +++T + ++ +D + +IR NI G F D FV Sbjct: 200 PKGWNKASLRSISTKFNDGPFGSNLKTSHYRDSGVQVIRLTNIGTGWFKNDDRAFVSVEH 259 Query: 67 VK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 + E P DIVIA + G ++ +P + + K++ ++ Sbjct: 260 AETLEKFHCKPGDIVIA-TLGDPNLRACIIPDEVPLAINKADCVHCVPNTKIVRKEYLVE 318 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 F I + G I +++ IPPL+EQ +K + D Sbjct: 319 FLNLPSTLRSIENKLHGQTRTRISSGQLAEVDVLIPPLSEQ----DKFMNAIWLRDKELK 374 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWR 214 R + + +++ A NG+L K + Sbjct: 375 RLQDQNVAFEDLFNSLMQKAFNGELNIKNK 404 >UniRef50_A6E2C3 Restriction modification system DNA specificity domain n=3 Tax=cellular organisms RepID=A6E2C3_9RHOB Length = 394 Score = 201 bits (512), Expect = 4e-50, Method: Composition-based stats. Identities = 84/413 (20%), Positives = 157/413 (38%), Gaps = 26/413 (6%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQK 72 P+ + ++ G T + + +P + + D+T + K + Sbjct: 5 IPLGELVSIRGGGTPSRGKK--EFWGGPIPWATVKDFKTTSLDSTLESITEDGVRKSATN 62 Query: 73 ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYR 132 I P ++ ++ VGK+A + + L P+ I + F+ HF S Sbjct: 63 IVPAGSIVV---PTRMAVGKAAINTIDV--AINQDLKALLPKGEIDTRFLLHFLLSKS-- 115 Query: 133 NKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQI 192 N + S + GA + IK + P L EQ+ IA L + D+ + + EQ + Sbjct: 116 NFLESQAQGATVKGIKLDLLKSLPFPDLSLNEQRRIAAIL----DKADAIRRKREQALNL 171 Query: 193 LKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES-GVGHP 251 F +V G E NF + + G + K +E G P Sbjct: 172 ADEFLMSVFLEMF-GDPIENPHNFPKEKVKLHLSKSRAGTQSGPFGAALKKHEYVPEGIP 230 Query: 252 ILRISSVRAGH-VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQ 310 + + +V+ +D+ + E ++L R+ +Q GD+L +R VG + + Sbjct: 231 VWGVENVQYNRFIDKPRLFITEDKFNDLLRYSVQHGDILISRAG----TVGRMCIASTSE 286 Query: 311 HQNLLYPDKLIRARLTKDALP-EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQV 369 ++++ LIR L +L EY FS R + ++ K +K Sbjct: 287 ERSII-STNLIRVALDPASLTAEYFVSLFSYLPGRVGALKANNKDDAFTFLNPKTLKEIE 345 Query: 370 VLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + +P + +Q V + ++ + LA ++L S+ +AFRGEL Sbjct: 346 IPIPDMTQQKRFVSILHRVQHSIRR----QGDQLAGFSDLFSSLSQRAFRGEL 394 >UniRef50_Q8EJT0 Type I restriction-modification system, S subunit n=1 Tax=Shewanella oneidensis RepID=Q8EJT0_SHEON Length = 439 Score = 201 bits (512), Expect = 4e-50, Method: Composition-based stats. Identities = 80/444 (18%), Positives = 169/444 (38%), Gaps = 35/444 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDT--TDLVF 61 GK+P W + + G + + + L+R N+ G + Sbjct: 10 GKIPNDWEYQIIIDNVEFLTGPAFDSS--LFNTESRGARLVRGINLTQGSTRWGEDKTKY 67 Query: 62 VPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHL--PFECSFGAFCGVLRPEKLIF 118 L + +++ DI+I M S+VGK+ LR + + Sbjct: 68 WDVELNNLKKYQLAINDILIGMDG---SLVGKNYAYLKQSDLPALLVQRVARLRAKSNLH 124 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 S ++ + + + + + + + I +I P PPL EQ+ IA L ++ Sbjct: 125 SKYLYYMYATDFWLDYVEVVKTNSGIPHISNGDIKNFRFPFPPLPEQQKIAAILTSVDEV 184 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKW----------RNFEPQHSVFKKLNF 228 ++ T+A+ +++ + Q +L V K +K+ P+ K LN Sbjct: 185 IEKTQAQIDKLKDLKSGMMQELLTKGVGIKQGDKYVPHIEFKDSPVGKIPKSWEVKPLN- 243 Query: 229 ESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQDG 286 +L + + P + ++R S+VR G + +D+++ + NR G Sbjct: 244 SVVLKIIDCEHKTAPYVDKSEYLVVRTSNVRHGELVLDDMKYTHADGYAEWTNRAIPSLG 303 Query: 287 DLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARN 345 D+LFTR G L+ ++ + +++ R + + +F +S +A Sbjct: 304 DVLFTREAP----AGESCLVP--ENTKVCMGQRMVLLRPDANVIFSNFFSLFLTSEAASC 357 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 A+ + I+ +DIK ++PP+ EQ EI + ++ + L Sbjct: 358 AIYER-SIGTTVSRINIEDIKRIPCIVPPLSEQQEISKAIQSVQNSI----LNKQEKLQS 412 Query: 406 VNNLTQSILAKAFRGELTAQWRAE 429 + NL ++++ G++ + + Sbjct: 413 LKNLKKALMQDLLTGKVRVKVDND 436 Score = 121 bits (304), Expect = 5e-26, Method: Composition-based stats. Identities = 44/214 (20%), Positives = 93/214 (43%), Gaps = 11/214 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 GK+P+ W + P+++V I +K Y+ ++R +N+++G+ D+ + Sbjct: 230 VGKIPKSWEVKPLNSVVLKIIDCEHK---TAPYVDKSEYLVVRTSNVRHGELVLDDMKYT 286 Query: 63 -PKNLVKESQKISP--EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIF 118 + + + P D++ ++ G+S + G +LRP+ +IF Sbjct: 287 HADGYAEWTNRAIPSLGDVLFTR----EAPAGESCLVPENTKVCMGQRMVLLRPDANVIF 342 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 S F + F S I S G ++ I I +PPL+EQ+ I++ + ++ Sbjct: 343 SNFFSLFLTSEAASCAIYERSIGTTVSRINIEDIKRIPCIVPPLSEQQEISKAIQSVQNS 402 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK 212 + + + + + + + K Q +L G V K+ Sbjct: 403 ILNKQEKLQSLKNLKKALMQDLLTGKVRVKVDND 436 >UniRef50_C4KDM6 Restriction modification system DNA specificity domain protein n=1 Tax=Thauera sp. MZ1T RepID=C4KDM6_THASP Length = 532 Score = 201 bits (510), Expect = 7e-50, Method: Composition-based stats. Identities = 102/461 (22%), Positives = 187/461 (40%), Gaps = 34/461 (7%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 A LP GW +A + + + + +P F T Sbjct: 7 QASDLPAGWDVASFGELNSFSGSTVNPATRPDEVFELYSVP----------SFPTKHPEQ 56 Query: 62 VP-KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 +P + + Q + P D+++ + + V + + + + G R + ++ Sbjct: 57 LPGRAIGSTKQTVRPGDVLVCKINPRINRVWTVGTRRDHEQIASSEWIG-FRSDAMV-PR 114 Query: 121 FIAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F H+ +R+ + S +G + +P+ + + PLAEQ IA++L+ LLA+ Sbjct: 115 FAKHYFSEPSFRSLLCSEVSGVGGSLTRAQPSRVAKYPVLVAPLAEQARIADQLEALLAR 174 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 + + + R E IP +LKRFR+ VL A++G LTE WR + + + E+ G Sbjct: 175 IQACQDRLEAIPALLKRFRKLVLSSALSGDLTEVWRAEQGVGLDTWSARTIADVAEVGTG 234 Query: 239 LSS----KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYN 294 + + G P + ++ ++D D + + + G L+ Y Sbjct: 235 STPLRSNSNFYAETGTPWVTSAATSRPYIDSADQYVTKAAIDAHRLRVYRPGTLIIAMYG 294 Query: 295 GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTT 354 G + +L+ + + A ++++ S + + Sbjct: 295 EGKTR----GQVSELRIDATINQACAAITVDEQQANAAFVKLALLSQYEQT---RALAEG 347 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 Q ++ ++ + LP EQA+IV RV +LFA+ADTI+ +V A + L L Sbjct: 348 GAQPNLNLSKVRGIPLRLPEGPEQAQIVHRVGELFAFADTIDSRVAAATGKTRKLPSLTL 407 Query: 415 AKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASG 455 AKAFRG+L Q + P A+ LL +I A+RAA Sbjct: 408 AKAFRGDLVPQDPTDEP--------ASVLLARIAAQRAAPP 440 >UniRef50_B9KF72 Type I restriction-modification system, S subunit n=2 Tax=Campylobacter RepID=B9KF72_CAMLR Length = 390 Score = 200 bits (509), Expect = 7e-50, Method: Composition-based stats. Identities = 82/420 (19%), Positives = 155/420 (36%), Gaps = 36/420 (8%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 + W I+ + ++ Q P A+ I + ++ K + Sbjct: 2 KYWKISIIDNTCEILNNKRVPISQKDRI--SGIYPYYGASGIVD---------YIDKYIF 50 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF-SGFIAHFT 126 E ++I SA + +L+P I + F+ +F Sbjct: 51 DEEL------VLIGEDGAKWGAFENSAFI-ASGKYWVNNHAHILKPNNEILINKFLVYFL 103 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 S I GA + + I I +PPL EQ+ I LD A +D + Sbjct: 104 NYSNLEKYI----TGATVKKLNQQKLKQIEILLPPLKEQERIVGILDESFANIDESIKIL 159 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-PQHSVFKKLNFESILTELRNGLSSKPNE 245 EQ L Q+ L N N++ PQ +K L +T+ G PN Sbjct: 160 EQDLLNLDELMQSALQKTFNPLKDNAKENYQLPQDWEWKSLGEICFITD---GTHKTPNY 216 Query: 246 SGVGHPILRISSVRAGHVDQNDIRFLECSE--SELNRHKLQDGDLLFTRYNGSLEFVGVC 303 G P L + ++ G D +DI+++ E + R K + GD+L R +G Sbjct: 217 IETGIPFLSVKNISKGFFDLSDIKYISLEEHNKLIKRAKPEFGDILICRIG----TLGKA 272 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM-MNCVKTTSGQKGISG 362 + ++ L++ ++ + +Y+ F +S + N V + ++ Sbjct: 273 IKISLEFEFSIFVSLGLLKPKV--KIISDYLVYFLNSYFIEGWINNNKVGGGTHTAKLNL 330 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 ++ + LP +KEQ +I +++ +++ + + L +S+L KAF+G+L Sbjct: 331 NILEKCPIALPSLKEQEQIASYLDEFSLNIKDLKQNYQAQIKNLQELKKSLLDKAFKGKL 390 Score = 135 bits (339), Expect = 5e-30, Method: Composition-based stats. Identities = 49/213 (23%), Positives = 88/213 (41%), Gaps = 14/213 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 +LP+ W + + I T+K I + +P + NI G FD +D+ + Sbjct: 187 ENYQLPQDWEWKSLGEIC-FITDGTHKTPNYI----ETGIPFLSVKNISKGFFDLSDIKY 241 Query: 62 VPKNLVK---ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 + + K DI+I +GK+ L FE S G+L+P+ I Sbjct: 242 ISLEEHNKLIKRAKPEFGDILICR----IGTLGKAIKISLEFEFSIFVSLGLLKPKVKII 297 Query: 119 SGFIAHFTKSSLYRNKISS--LSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 S ++ +F S I++ + G + + + I +P L EQ+ IA LD Sbjct: 298 SDYLVYFLNSYFIEGWINNNKVGGGTHTAKLNLNILEKCPIALPSLKEQEQIASYLDEFS 357 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 + K ++ + L+ ++++L A GKL Sbjct: 358 LNIKDLKQNYQAQIKNLQELKKSLLDKAFKGKL 390 >UniRef50_C2KFA2 Restriction endonuclease S subunit n=4 Tax=Lactobacillus crispatus RepID=C2KFA2_9LACO Length = 480 Score = 200 bits (509), Expect = 8e-50, Method: Composition-based stats. Identities = 80/426 (18%), Positives = 158/426 (37%), Gaps = 61/426 (14%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +P+ W + V L G T KKE + D+ P + ++ N ++ Sbjct: 73 DIPDSWEWVRLGDVGLLKNGKTPKKEDTSS---DNIYPYFKVKDMNNNNLYMENVK--NW 127 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 K S+++ P++ +I +G + K G L P + FI + Sbjct: 128 VGEKYSRQVMPKNTIIFPKNGGAILTAKKRILSQDSLVDLNT--GGLIPYNDLNHKFIFY 185 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S ++ + G+ + I +P+PPL EQ IA K+ L A + ++ Sbjct: 186 LFLSLDIKDFV----KGSAVPTINSKKLKETLVPLPPLEEQSRIAAKIAQLFALLRKVES 241 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV---------------------- 222 +Q ++ + VL A+ GKL E+ + EP + Sbjct: 242 STQQYAKLQTLLKSKVLDLAMRGKLVEQDPHDEPASVLLEKIKAEKRKMIKEKEIKKSKP 301 Query: 223 ----------------FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQN 266 ++ + +I + +G + P S G ++ +++ G +D + Sbjct: 302 LPPITDEEKPFDIPDSWEWVRLGNIAKRITDGTHNPPPNSHEGKQVISAINIKKGKIDFS 361 Query: 267 -DIRFLECSE--SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA 323 RF+ + E R ++ GD+L T +G ++ + +I Sbjct: 362 LSNRFVSEDQFLKEDKRTNIRKGDVLLTIVGS----LGNAAVVDTDKLFTAQRSVAVI-- 415 Query: 324 RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 + + L +++ S + + K + QKGI + + + LPP+ EQ IV Sbjct: 416 --SSNILSKFLYYVLISAMFKTQIFANAK-GTTQKGIYLSKLINLKLPLPPLAEQNRIVD 472 Query: 384 RVEQLF 389 +++ LF Sbjct: 473 KIDNLF 478 Score = 117 bits (293), Expect = 9e-25, Method: Composition-based stats. Identities = 56/276 (20%), Positives = 107/276 (38%), Gaps = 29/276 (10%) Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 ++ KA EQ+ + K + + ++ P + +L +L N Sbjct: 39 LLEKIKAEKEQLIKEKKIKK----SKPLAPITDDEKPFDIPDSWEWVRLGDVGLLK---N 91 Query: 238 GLSSKPNE--SGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 G + K + S +P ++ + ++ +++ + +R + ++F + NG Sbjct: 92 GKTPKKEDTSSDNIYPYFKVKDMNNNNLYMENVKNWVGEK--YSRQVMPKNTIIFPK-NG 148 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 +L + +L + D ++I F S ++ + S Sbjct: 149 GAILTAKKRILSQDSLVDLNTGGLI----PYNDLNHKFIFYLFLSLDIKDFVK-----GS 199 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 I+ K +K +V LPP++EQ+ I ++ QLFA +E L +L Sbjct: 200 AVPTINSKKLKETLVPLPPLEEQSRIAAKIAQLFALLRKVESSTQQYAKLQTLLKSKVLD 259 Query: 416 KAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 A RG+L Q + P A+ LLEKIKAE+ Sbjct: 260 LAMRGKLVEQDPHDEP--------ASVLLEKIKAEK 287 Score = 51.6 bits (122), Expect = 6e-05, Method: Composition-based stats. Identities = 17/47 (36%), Positives = 22/47 (46%), Gaps = 8/47 (17%) Query: 406 VNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERA 452 L + IL A G+L Q + P A+ LLEKIKAE+ Sbjct: 10 AQALREKILDLAMHGKLVPQDPNDEP--------ASVLLEKIKAEKE 48 >UniRef50_C9KLK0 Putative phosphoribosylformylglycinamidine synthase n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KLK0_9FIRM Length = 489 Score = 200 bits (508), Expect = 1e-49, Method: Composition-based stats. Identities = 78/441 (17%), Positives = 148/441 (33%), Gaps = 65/441 (14%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +PE WV + + + T+K K++ +P + NI N K D +++ ++ Sbjct: 66 DIPENWVWTRLEEILLSLTDGTHK----TPVYKNEGIPFLSVKNISNHKIDFSNIKYISI 121 Query: 65 NLVK---ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + K E DI+++ G E S +L+ I + + Sbjct: 122 DEHKKLCERCYPKKGDILLSK----VGTTGIPVIIDTEKEFSIFVSVALLKFSSSIDAKY 177 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + +S L + + + + G N +P+PPLAEQ I K++ L +D+ Sbjct: 178 LLFLLESPLVQEQCRTHTRGIGNKNWVLTDIANTIVPLPPLAEQHRIVAKIEELQPDIDA 237 Query: 182 TKARFEQIPQILKRF----RQAVLGGAVNGKLTEKWRN---------------------- 215 ++ I + F ++++L A+ GKL + + Sbjct: 238 YDKAQTKLQSIEQSFPDAMKKSLLQYAIEGKLVPQRKEEGTAKDLLAKIRAEKARLVKEK 297 Query: 216 ------------------FEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISS 257 P + +L P G P L+ Sbjct: 298 KIKKSKPLPAITDDEKPFDIPDSWEWVRLGELGEWCSGATPSRQHPEYFGGKIPWLKTGD 357 Query: 258 VRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYP 317 + G++ + + + G +L Y + +G G+LK N Sbjct: 358 LNDGYIKEVPEYITDDGFKNSSTKINPIGSVLIAMYGAT---IGKLGILKIPATTN---- 410 Query: 318 DKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKE 377 L + +Y+ F + R + + Q IS I + V+ LPP+ E Sbjct: 411 QACCACELVHEMYNKYLFYFLFAN--RKYFIKKGAGGA-QPNISKAKITNTVMPLPPLAE 467 Query: 378 QAEIVRRVEQLFAYADTIEKQ 398 Q IV ++E+L + Q Sbjct: 468 QYRIVAKLEELLPLCQQLASQ 488 Score = 151 bits (381), Expect = 6e-35, Method: Composition-based stats. Identities = 61/297 (20%), Positives = 117/297 (39%), Gaps = 26/297 (8%) Query: 165 QKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT---EKWRNFEPQHS 221 Q+ I KL + + K +I R + + + P++ Sbjct: 12 QRAIEGKLVPQRKEEGTAKELLAEIRAEKARLIKEKKIKKQRKLIPISMDDLPFDIPENW 71 Query: 222 VFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESE--LN 279 V+ E IL L +G P G P L + ++ +D ++I+++ E + Sbjct: 72 VWT--RLEEILLSLTDGTHKTPVYKNEGIPFLSVKNISNHKIDFSNIKYISIDEHKKLCE 129 Query: 280 RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFS 339 R + GD+L ++ + G+ ++ + ++ L++ + +Y+ Sbjct: 130 RCYPKKGDILLSKVGTT----GIPVIIDTEKEFSIFVSVALLKF--SSSIDAKYLLFLLE 183 Query: 340 SPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 SP + G K DI + +V LPP+ EQ IV ++E+L D +K Sbjct: 184 SPLVQEQCRTH-TRGIGNKNWVLTDIANTIVPLPPLAEQHRIVAKIEELQPDIDAYDKAQ 242 Query: 400 N--NALARV--NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERA 452 ++ + + + +S+L A G+L Q E +A LL KI+AE+A Sbjct: 243 TKLQSIEQSFPDAMKKSLLQYAIEGKLVPQ--------RKEEGTAKDLLAKIRAEKA 291 Score = 48.9 bits (115), Expect = 4e-04, Method: Composition-based stats. Identities = 16/48 (33%), Positives = 23/48 (47%), Gaps = 8/48 (16%) Query: 405 RVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERA 452 +L SIL +A G+L Q E +A LL +I+AE+A Sbjct: 2 NAQDLKNSILQRAIEGKLVPQ--------RKEEGTAKELLAEIRAEKA 41 >UniRef50_A6Y5S9 Restriction endonuclease S subunit n=1 Tax=Vibrio cholerae RC385 RepID=A6Y5S9_VIBCH Length = 437 Score = 200 bits (508), Expect = 1e-49, Method: Composition-based stats. Identities = 90/441 (20%), Positives = 177/441 (40%), Gaps = 41/441 (9%) Query: 4 GKLPEGWVIAPVSTVTT--LIRGVTYKKEQAINYLKDDYLPLIRA---NNIQNGKFDTTD 58 GK+P W + P + + + + K E+ ++ + + + ++R ++ N K Sbjct: 16 GKIPSHWKLLPCRAIVDNQVEKNDSGKIEEYLSLMAN--IGVVRYEEKGDVGNKK----- 68 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI- 117 P++L K + + ++VI + + G S PF VL P++ I Sbjct: 69 ----PEDLTK-CKLVKQGNLVINSMNYAIGSYGMS-----PFNGVCSPVYIVLEPKEQIV 118 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + ++ + ++ L G + IK +P+PPL EQ+ I LD Sbjct: 119 ERRYALRLFENKPMQKHLAQLGNGILQHRAAIKWDDIKPQAVPVPPLEEQRAILYFLDRE 178 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKL 226 ++DS A ++LK RQA++ V L +W P+H K+ Sbjct: 179 TQRIDSLIAEKLTFIKLLKEKRQALISHIVTKGLNPNVEMQDSGIEWIGQVPKHWGISKV 238 Query: 227 NFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIR-FLECSESELNRHKLQD 285 + +NG++ G G P + V ++ + +E + + + + Sbjct: 239 RYLGQC---QNGINIGGEFFGHGTPFVSYGDVYNNTSLPEKVQGLVLSTEKDRDNYSVIA 295 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP-EYIEIFFSSPSAR 344 GD+LFTR + ++E +G + K Q ++ LIR R + L + E +F + R Sbjct: 296 GDVLFTRTSETIEEIGFSAVCKSTIEQ-AVFAGFLIRFRPDEGNLEVGFSEYYFRNEKLR 354 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 + + + +S +K VLLPP+ EQ EI ++ I + + Sbjct: 355 AFFAKEMNLVT-RASLSQDLLKKMPVLLPPIDEQNEIANYLQAECNKFSEIFAETEKTIL 413 Query: 405 RVNNLTQSILAKAFRGELTAQ 425 + S+++ A G++ + Sbjct: 414 LLKERRTSLISAAVTGKIDVR 434 >UniRef50_C5BH70 Restriction modification system DNA specificity domain protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5BH70_EDWI9 Length = 441 Score = 199 bits (507), Expect = 1e-49, Method: Composition-based stats. Identities = 76/454 (16%), Positives = 162/454 (35%), Gaps = 48/454 (10%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +PE W I + + + G YK Q DD P++ + G+F F Sbjct: 19 GLVPESWTICRLKNLAAIKNGQDYKSVQ-----TDDGYPVMGSG----GQFT-----FAS 64 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 K + + + G K + K + + PF + L + + ++ Sbjct: 65 KFMYDKPSVLL----------GRKGTIDKPLYINEPFWTVDTMYYTEL--NEGFDARYLY 112 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLA-EQKIIAEKLDTLLAQVDST 182 + + + S S + ++ +P E+K I + LD A++D+ Sbjct: 113 YLALTIQF----SRYSTNTALPSMTQEHLSNYKFSVPKAESERKKITKFLDHETAKIDNL 168 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILT 233 + +Q+ ++LK R AV+ AV L +W P+H L + Sbjct: 169 IEKQQQLIELLKEKRHAVISHAVTKGLNPDVPMKDSGVEWLGEVPEHWTISTLKHHAKFI 228 Query: 234 ELRNGLSSKPNES--GVGHPILRISSVRAGHVDQNDIRFLECSE-SELNRHKLQDGDLLF 290 + G + G L ++ ++ +D ++ + + LNR K +GD+ Sbjct: 229 DGDRGSEYPNDNDLVDDGVVFLSSKNISNWEINIDDANYISREKFNRLNRGKAINGDV-I 287 Query: 291 TRYNGSLEFVGVCGLLKKLQHQN--LLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMM 348 + GS +G + + + +++ RL ++ + Sbjct: 288 VKVRGSTGRIGELAIFETERLNKSTAFINAQMMIIRLKNSFNNRFLCNVAQGHYWMEQL- 346 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 N + Q+ ++ ++++PP+ EQ I + +E D + K +N + + Sbjct: 347 NVGAYGTAQQQLNNAIFSGMIMVVPPIDEQLTINKFLELEIKRFDGLIKNTSNMIQLIQE 406 Query: 409 LTQSILAKAFRGELTAQWRAENPDLISGENSAAA 442 ++++ A G++ + PD E Sbjct: 407 RRTALISAAVTGKIDVRDWVA-PDTQEAEEPQEV 439 >UniRef50_A8RUN3 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8RUN3_9CLOT Length = 375 Score = 199 bits (506), Expect = 2e-49, Method: Composition-based stats. Identities = 73/411 (17%), Positives = 157/411 (38%), Gaps = 39/411 (9%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQK 72 + T+ + +G K ++ ++D +LP + + G D ++ Sbjct: 4 VELGTILHMEKGK--KPQKQSKEIEDGFLPYVDIKAFEKGIID-------SYASPEKCVL 54 Query: 73 ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYR 132 D++I + G++ + G+ + + + ++ +F +S Sbjct: 55 CDDGDLLIVCDGSRSGLTGRA------IKGVVGSTLSKISADG-LTREYLRYFIQSKY-- 105 Query: 133 NKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQI 192 +++ G ++ + IP L EQ+ I +++ L +++D + Q Sbjct: 106 TLLNTQKKGTGTPHLNAQILKQSKLIIPSLPEQERIVARIEELFSELDKAVETLKTTKQQ 165 Query: 193 LKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPI 252 L +RQAVL A + T + + K L+ K G+ Sbjct: 166 LAVYRQAVLKEAFSCADTFEPFGSIMTSRLGKMLD--------------KEKNVGLPEQY 211 Query: 253 LRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQ 312 +R +VR D +D+ + E+ ++ ++ GDL+ G C + ++ Sbjct: 212 IRNINVRWFSFDLSDLLKMRIETKEIEKYSIKYGDLIICEGGEP----GRCAVWD--RND 265 Query: 313 NLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLL 372 ++ Y L R R P+ + S + T +G K ++G+ + V + Sbjct: 266 SIFYQKALHRVRFKNGENPKLYMYYLWFISQTGELEKYF-TGTGIKHLTGQSLLKVPVPI 324 Query: 373 PPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 + +Q +V ++E + + IEK + +L + + QSIL +AF G L Sbjct: 325 ISISKQNTVVLKIESQLSVCNQIEKMIEQSLQQAEAMRQSILKQAFEGRLV 375 >UniRef50_A0L1U2 Restriction modification system DNA specificity domain n=1 Tax=Shewanella sp. ANA-3 RepID=A0L1U2_SHESA Length = 425 Score = 199 bits (506), Expect = 2e-49, Method: Composition-based stats. Identities = 81/432 (18%), Positives = 171/432 (39%), Gaps = 31/432 (7%) Query: 6 LPEGWVIAPVSTVTTLI---RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 +P+ W + P+ +V + RG T KK + + ANN+Q G+ D ++ Sbjct: 5 VPDNWNVLPLGSVIKQVIDFRGRTPKK--LGMEWGGGNIRALSANNVQMGRVDFNKECYL 62 Query: 63 PKNLVKESQKIS----PEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK-LI 117 + + + DI+ M ++ +G A +L+ +K Sbjct: 63 ASDELYDKWMTKGTTEVGDILFTM----EAPLGNIALVPNDDRYILSQRVILLKNDKSKA 118 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 S F+ +S +++ + + G I+ +++ +PPL EQ+ IA+ L ++ Sbjct: 119 SSDFLFQQLRSDSFQDTLRENATGTTAQGIQQKRLVTLDVVLPPLPEQQKIAKILTSVDE 178 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAV--NGKLTEKWRNFE--PQHSVFKKLNFESILT 233 ++ T+A+ +++ + Q +L V +GK ++++ + + +++ Sbjct: 179 VIEKTQAQIDKLKDLKTGMMQELLTQGVGIDGKPHTEFKDSPVGRIPKAWNCVTLKNLSK 238 Query: 234 ELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSESEL--NRHKLQDGDLLF 290 + +G S G P L +S VR G++D FL EL K ++GD+L+ Sbjct: 239 RITDGTHQTVKTSPDGTIPFLYVSCVRDGNIDWEKASFLTEEMYELASKGRKPENGDILY 298 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 T G ++ + I+ + E++ F +SP + + Sbjct: 299 TAVGSY----GHAAIVSGDNRFSFQRHIAFIQPN-HEKIDSEFLVSFLNSPLGKKQ-ADL 352 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 + Q ++ D+ V LP + EQ I ++F D V L + N Sbjct: 353 YAIGNAQLTVTLGDLGKFKVALPDIAEQQRIA----KIFNGIDNRIIVVQRKLTSLGNTK 408 Query: 411 QSILAKAFRGEL 422 ++++ G++ Sbjct: 409 KALMQDLLTGKV 420 Score = 123 bits (308), Expect = 2e-26, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 80/208 (38%), Gaps = 11/208 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P+ W + ++ I T+ Q + D +P + + +++G D F+ Sbjct: 221 VGRIPKAWNCVTLKNLSKRITDGTH---QTVKTSPDGTIPFLYVSCVRDGNIDWEKASFL 277 Query: 63 PKNLVK---ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIF 118 + + + + +K DI+ G +A SF ++P + I Sbjct: 278 TEEMYELASKGRKPENGDILYTA----VGSYGHAAIVSGDNRFSFQRHIAFIQPNHEKID 333 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 S F+ F S L + + + G + + +P +AEQ+ IA+ + + + Sbjct: 334 SEFLVSFLNSPLGKKQADLYAIGNAQLTVTLGDLGKFKVALPDIAEQQRIAKIFNGIDNR 393 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVN 206 + + + + K Q +L G V Sbjct: 394 IIVVQRKLTSLGNTKKALMQDLLTGKVR 421 >UniRef50_Q1VR15 Type I restriction-modification enzyme 1, S subunit n=1 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VR15_9FLAO Length = 441 Score = 199 bits (506), Expect = 2e-49, Method: Composition-based stats. Identities = 81/422 (19%), Positives = 157/422 (37%), Gaps = 21/422 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL--VF 61 G +PE W + + + +G K+ + + LP +R I T + Sbjct: 32 GWIPEDWNVKSLDQLGEFSKGKGITKKDILED-EVGGLPCVRYAEIYTIYHYNTTVLKSK 90 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + S I+ DI+ A S + +GKS G +L+ F Sbjct: 91 INQESAANSNPINCGDILFAGSGETLEDIGKSIAYLNKETAYAGGDICILKHHNQ-DPQF 149 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + + R+++ + G ++ +I + +++PIPPL EQ+ IA L+T D Sbjct: 150 LGYLFNNDVVRSQLYKIGQGHSVVHIYSSGLKKVSVPIPPLPEQQKIASILNTW----DK 205 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 A E++ + + ++ + GK + F +++ + I+ L Sbjct: 206 AIAAQEKLIAQKQALKNGLMQQLLTGK-----KRFAGFVEEWEEKSLNDIVKYLGGEAFK 260 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQDGDLLFTRYNGSLEF 299 N+ G L+I++V G V D E ++ L+ GD + L Sbjct: 261 STNQVENGVRWLKIANVGIGVVKWGDSTTFLPTSFIDENPKYVLKAGDAVMALTRPILND 320 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 + K LL ++ + ++I +P MN + + Sbjct: 321 KLKIAVFNKEDGIALL-NQRVAKLISKNKNDLKFIYYIHQTPYFI-YTMNAMMAGTDPPN 378 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 IS KD+ + V +P +EQ +IV +E D + N + Q ++ + Sbjct: 379 ISIKDLAKKKVFIPGYEEQKKIVSVIESFDNEIDNLI----NKGKHLKKQKQGLMQQLLT 434 Query: 420 GE 421 GE Sbjct: 435 GE 436 Score = 84.7 bits (208), Expect = 6e-15, Method: Composition-based stats. Identities = 37/217 (17%), Positives = 79/217 (36%), Gaps = 12/217 (5%) Query: 208 KLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP--NESGVGHPILRISSVRA-GHVD 264 + + P+ K L+ ++ G++ K + G P +R + + H + Sbjct: 25 GYKKTKLGWIPEDWNVKSLDQLGEFSK-GKGITKKDILEDEVGGLPCVRYAEIYTIYHYN 83 Query: 265 QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 ++ ES N + + GD+LF +LE +G + + D I Sbjct: 84 TTVLKSKINQESAANSNPINCGDILFAGSGETLEDIGKS-IAYLNKETAYAGGDICILKH 142 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 +D P+++ F++ R+ + + I +K V +PP+ EQ +I Sbjct: 143 HNQD--PQFLGYLFNNDVVRSQLYK-IGQGHSVVHIYSSGLKKVSVPIPPLPEQQKIASI 199 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 + D +A+ L ++ + G+ Sbjct: 200 LNT----WDKAIAAQEKLIAQKQALKNGLMQQLLTGK 232 >UniRef50_A7JK69 Type I restriction-modification system n=1 Tax=Francisella novicida GA99-3548 RepID=A7JK69_FRANO Length = 394 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 83/428 (19%), Positives = 168/428 (39%), Gaps = 42/428 (9%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT-D 58 MS +LP+GW + +T+ + GV K Y + + +I I+ G + Sbjct: 1 MSNSELPKGWKAIELGEITSYVNRGVAPK------YTDEHGITVINQKCIREGNINLELA 54 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLI 117 V P +++ DI+I + G+ ++R ++ Sbjct: 55 RVHNPDKKYTAEKQLHLGDILINSTG--VGTAGRVGIFTDSINAIVDTHVSIVRLNKEYA 112 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + F+ + + ++ + G+ +K + +NI +P L EQK IA+ L +L Sbjct: 113 YPKFVYYNLR--FREKELEETAEGSTGQIELKRDAIKSLNILLPQLTEQKAIADVLSSLD 170 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 ++D + Q L+ Q + K E W S + + Sbjct: 171 DKID----LLHKQNQTLEDMAQTLFREWFIEKADEGWEEMPL-----------SEVCSVT 215 Query: 237 NGLSSKPNES-GVGHPILRISSVRAGHVDQNDIRFLECSESELN-RHKLQDGDLLFTRYN 294 G + K + +G P+++I ++ GH+D ND++F++ SES++ +++L D D++ Sbjct: 216 AGYAFKSKDFVDIGVPVVKIKNISNGHIDYNDLQFIDISESDVESKYRLYDNDIVMAMTG 275 Query: 295 GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTT 354 + +G GL+ +H LL ++ R AL + +S N ++N Sbjct: 276 AT---IGKIGLVSTFEHDYLLLNQRVAVLRSNHQAL---LWFMLNSLDLENEILNL-SNG 328 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 + Q IS I + + + V +F +KQ+ + + ++L Sbjct: 329 AVQANISSTSIGQVPIPGMSNQMMQKFNNAVHPMFEKIQQNKKQIKS----LEQTRDTLL 384 Query: 415 AKAFRGEL 422 K G++ Sbjct: 385 PKLMSGQV 392 Score = 118 bits (296), Expect = 4e-25, Method: Composition-based stats. Identities = 46/203 (22%), Positives = 84/203 (41%), Gaps = 14/203 (6%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP--KN 65 EGW P+S V ++ G +K + + D +P+++ NI NG D DL F+ ++ Sbjct: 201 EGWEEMPLSEVCSVTAGYAFKSKDFV----DIGVPVVKIKNISNGHIDYNDLQFIDISES 256 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 V+ ++ DIV+AM+ + +G + + VLR + Sbjct: 257 DVESKYRLYDNDIVMAMTGATIGKIGLVSTFEHDY-LLLNQRVAVLRSNHQAL---LWFM 312 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQ--KIIAEKLDTLLAQVDSTK 183 S N+I +LS GA NI S +PIP ++ Q + + + ++ K Sbjct: 313 LNSLDLENEILNLSNGAVQANISSTSIGQ--VPIPGMSNQMMQKFNNAVHPMFEKIQQNK 370 Query: 184 ARFEQIPQILKRFRQAVLGGAVN 206 + + + Q ++ G V Sbjct: 371 KQIKSLEQTRDTLLPKLMSGQVR 393 >UniRef50_D0WYM6 Putative uncharacterized protein n=1 Tax=Vibrio alginolyticus 40B RepID=D0WYM6_VIBAL Length = 371 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 94/413 (22%), Positives = 172/413 (41%), Gaps = 46/413 (11%) Query: 12 IAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQ 71 + + ++ R +K A L D Y+ NGK + + + Sbjct: 1 MVKLDSIC---RPKQWKTIAASQLLDDGYVVYG-----ANGKI-----GYYSEYTHENPT 47 Query: 72 KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGV--LRPEKLIFSGFIAHFTKSS 129 ++I + V H P G + + PE+ + ++ + Sbjct: 48 ------VMITCRGATCGNV----HISEPKAYINGNAMALDDVDPER-VDINYLRYCLIDR 96 Query: 130 LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQI 189 +R+ I +G+ I + IP+PPL QK IAE L + D + +Q+ Sbjct: 97 GFRDVI----SGSAQPQITGKGLSKVQIPLPPLETQKQIAEVL----EKADQLRKDCQQM 148 Query: 190 PQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG 249 Q L Q+V +T P+ K L+ + ++SK + + Sbjct: 149 EQELNSLAQSVFIDMFGDPVTN------PKGWDLKPLSSLGEVKGGLQ-VTSKRAANPIS 201 Query: 250 HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKL 309 P LR+++V H++ ++++ + +E+EL R L+ GD+LF +G+ VG + Sbjct: 202 VPYLRVANVYRDHLELDEVKEIRVTENELERVLLEKGDVLFVEGHGNANEVGRTAVWNDE 261 Query: 310 QHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQV 369 Q ++ + LIR R D PEY+ F +S S + ++ KTTSG +S +IKS Sbjct: 262 VAQ-CVHQNHLIRFRPGADVRPEYVSAFVNSASGKRQLLKMSKTTSGLNTLSTSNIKSIQ 320 Query: 370 VLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 VL+PP+ EQ + + + A + + ++ +++ KAF+GEL Sbjct: 321 VLVPPLLEQDDFLAFL----ASCKAQQVVNDQLSVELDQNFNALMQKAFKGEL 369 Score = 112 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 90/207 (43%), Gaps = 10/207 (4%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P+GW + P+S++ + G+ ++A N + +P +R N+ + ++ + Sbjct: 171 PKGWDLKPLSSLGEVKGGLQVTSKRAANPIS---VPYLRVANVYRDHLELDEVKEIRVTE 227 Query: 67 VK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGFIAH 124 + E + D++ G+ + VG++A + +C RP + +++ Sbjct: 228 NELERVLLEKGDVLFVEGHGNANEVGRTAVWNDEVAQCVHQNHLIRFRPGADVRPEYVSA 287 Query: 125 FTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 F S+ + ++ +S + +N + ++ I + +PPL EQ L + AQ + Sbjct: 288 FVNSASGKRQLLKMSKTTSGLNTLSTSNIKSIQVLVPPLLEQDDFLAFLASCKAQ----Q 343 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLT 210 +Q+ L + A++ A G+L Sbjct: 344 VVNDQLSVELDQNFNALMQKAFKGELN 370 >UniRef50_C2I227 Restriction modification system DNA specificity domain n=1 Tax=Vibrio cholerae TM 11079-80 RepID=C2I227_VIBCH Length = 434 Score = 198 bits (503), Expect = 4e-49, Method: Composition-based stats. Identities = 88/434 (20%), Positives = 171/434 (39%), Gaps = 36/434 (8%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W + P + T ++ +K+ + + N D DL + + Sbjct: 16 WNLVPAKRLFT-------SSKEINQGMKESNRLALTMKGVINRSLD--DLQGLQSSDYSV 66 Query: 70 SQKISPEDI---VIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFT 126 Q +D+ +I + + S VG + + A+ V I+ F + Sbjct: 67 YQIFEKDDLVFKLIDLENIKTSRVGIVHERGIMSP----AYIRVSACSNSIYPRFYYWYF 122 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 +LY I + G N+ I +P+ ++ QK ++ LD ++DS Sbjct: 123 F-ALYLTNIYNKLGGGVRQNLTAGDLLEIPVPLIDISLQKQVSAFLDRETQRIDSLIEEK 181 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTELRN 237 + +LK RQA++ V L +W P+H V KK+ ++ + + Sbjct: 182 QTFITLLKEKRQALISHVVTKGLNPNVEMQDSGIEWIGQVPKHWVVKKIKYD--VLGIEQ 239 Query: 238 GLS----SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 G S S P ++++ V G + + L + ++ GDLL +R Sbjct: 240 GWSPQCESTPVPDDHTWGVVKVGCVNRGIFNPEQNKKLPEELEPRKEYAIKKGDLLVSRA 299 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAM-MNCV 351 N E+VG + + NLL DK+ R +L + A PE+ + +S AR + ++ Sbjct: 300 NAK-EWVGSAA-VPDRDYDNLLLCDKIYRIKLDLEKADPEFFAYYLASDQAREQIEIDAT 357 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 T+S I I + + P + EQ IVR ++ + D + +V +++ + Sbjct: 358 GTSSSMLNIGQGTILNMPIPAPELPEQQSIVRGIKNKTSQIDRLMLEVLDSIELLKEHRT 417 Query: 412 SILAKAFRGELTAQ 425 S+++ A G++ + Sbjct: 418 SLISAAVTGKIDVR 431 Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 90/214 (42%), Gaps = 8/214 (3%) Query: 3 AGKLPEGWVIAPVS-TVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G++P+ WV+ + V + +G + + ++ D +++ + G F+ Sbjct: 218 IGQVPKHWVVKKIKYDVLGIEQGWSP-QCESTPVPDDHTWGVVKVGCVNRGIFNPEQNKK 276 Query: 62 VPKNLV-KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFEC--SFGAFCGVLRPEKLIF 118 +P+ L ++ I D++++ ++ K VG +A ++ + + Sbjct: 277 LPEELEPRKEYAIKKGDLLVSRANA-KEWVGSAAVPDRDYDNLLLCDKIYRIKLDLEKAD 335 Query: 119 SGFIAHFTKSSLYRNKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 F A++ S R +I + G ++ NI + + IP P L EQ+ I + Sbjct: 336 PEFFAYYLASDQAREQIEIDATGTSSSMLNIGQGTILNMPIPAPELPEQQSIVRGIKNKT 395 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 +Q+D ++LK R +++ AV GK+ Sbjct: 396 SQIDRLMLEVLDSIELLKEHRTSLISAAVTGKID 429 >UniRef50_B3E898 Restriction modification system DNA specificity domain n=1 Tax=Geobacter lovleyi SZ RepID=B3E898_GEOLS Length = 447 Score = 197 bits (502), Expect = 5e-49, Method: Composition-based stats. Identities = 92/461 (19%), Positives = 165/461 (35%), Gaps = 76/461 (16%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDT---TDLVFVPK 64 W A + + +I+G T + PL+ + TD V +P Sbjct: 2 SNWKRARIGDLCEIIKGET-----GLASAPPGEYPLVATGADRRSCTTWQFDTDAVCIP- 55 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 +++ + K + +Q F + + ++ + F+ Sbjct: 56 --------------LVSSTGHGKKTLNYVHYQSGKFALGTILAAVIPKDPSVLTARFLHL 101 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + S + L GA ++ + IP+PPL EQ+ + + + + + Sbjct: 102 YL-SHFKDTVLVPLMKGAANVSLSMKEIASVKIPVPPLDEQQSLIDLIFRIEDEHQELLT 160 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP-------------------------- 218 +LK+ RQA+L AV G+LT WR P Sbjct: 161 ETNHQGVLLKQLRQALLQEAVAGELTTAWRKQHPVAKGDPQYDAAALLAQIKAEKERLVK 220 Query: 219 ---------------------QHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISS 257 + + + G S K + G P+LR+ + Sbjct: 221 EGKIRKEKPLPPITDEDKPFDLPEGWGWCRLGEVADGFQYGSSVKSLKEG-KVPVLRMGN 279 Query: 258 VRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYP 317 ++ G +D +++ + + E+ ++++ +GDLLF R N S E VG GL + ++ Sbjct: 280 IQCGKIDWSNLVYTNDT-GEIRKYRVTNGDLLFNRTN-SRELVGKTGLFDGMYE--AIFA 335 Query: 318 DKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKE 377 L+R + Y +S R GQ I+ ++ LPP+ E Sbjct: 336 GYLVRVTMLGGISATYSNGVLNSKFHREWCDANKTDALGQSNINATKLRDYFFPLPPLAE 395 Query: 378 QAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 Q IV RV+ L A D +EKQV + L Q++L +AF Sbjct: 396 QQAIVARVDSLMATIDELEKQVAERKEQAQLLMQTVLREAF 436 Score = 153 bits (387), Expect = 1e-35, Method: Composition-based stats. Identities = 49/203 (24%), Positives = 89/203 (43%), Gaps = 7/203 (3%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 LPEGW + V G Y + LK+ +P++R NIQ GK D ++LV+ Sbjct: 241 DLPEGWGWCRLGEVAD---GFQYGSS--VKSLKEGKVPVLRMGNIQCGKIDWSNLVYTND 295 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 +++ D++ ++ S+ +VGK+ +E F + + I + + Sbjct: 296 TGEIRKYRVTNGDLLFNRTN-SRELVGKTGLFDGMYEAIFAGYLVRVTMLGGISATYSNG 354 Query: 125 FTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 S +R + A +NI P+PPLAEQ+ I ++D+L+A +D + Sbjct: 355 VLNSKFHREWCDANKTDALGQSNINATKLRDYFFPLPPLAEQQAIVARVDSLMATIDELE 414 Query: 184 ARFEQIPQILKRFRQAVLGGAVN 206 + + + + Q VL A + Sbjct: 415 KQVAERKEQAQLLMQTVLREAFD 437 Score = 102 bits (253), Expect = 4e-20, Method: Composition-based stats. Identities = 32/132 (24%), Positives = 64/132 (48%), Gaps = 3/132 (2%) Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 ++ ++ S ++ ++ + + +S K+I S + +PP+ EQ ++ + ++ Sbjct: 97 RFLHLYLS--HFKDTVLVPLMKGAANVSLSMKEIASVKIPVPPLDEQQSLIDLIFRIEDE 154 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGEN-SAAALLEKIKAE 450 + + N+ + L Q++L +A GELT WR ++P AAALL +IKAE Sbjct: 155 HQELLTETNHQGVLLKQLRQALLQEAVAGELTTAWRKQHPVAKGDPQYDAAALLAQIKAE 214 Query: 451 RAASGGKKASRK 462 + + RK Sbjct: 215 KERLVKEGKIRK 226 >UniRef50_Q4HNY2 Type I restriction-modification system specificity subunit, putative n=1 Tax=Campylobacter upsaliensis RM3195 RepID=Q4HNY2_CAMUP Length = 427 Score = 197 bits (502), Expect = 5e-49, Method: Composition-based stats. Identities = 83/439 (18%), Positives = 179/439 (40%), Gaps = 41/439 (9%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFD---TTDL 59 GK+P W + + + + + +D++ ++ QNG + TT+ Sbjct: 5 GGKIPAHWEVRRLKYLFYISK----------EESRDEFPNVLSLT--QNGIIERDITTNK 52 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + +N + + + DI++ S V KS FE +RP + + Sbjct: 53 GQLAQNYIGYN-IVKRGDIILNPMDLSSGYVAKST-----FEGVISQAYIKIRPLETLNL 106 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINN---IKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + +F ++ + + L G + ++ + F I IP+PPL EQK IAE LD Sbjct: 107 SYYENFFQNLYHYKILWHLGKGISYDHRWTLGNDVFLNIKIPLPPLQEQKEIAEFLDKKC 166 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLN 227 ++ + + +++ +L+ +QA++ A+ L +W P+H KKL Sbjct: 167 EKIQNYINKKQKLITLLQEKKQALINEAITKGLNPNIEFKNSGIEWLGEIPKHWEIKKLK 226 Query: 228 FESILTELRNGLSSK---PNESGVGHPILRISSVRAG-HVDQNDIRFLECSESELNRHKL 283 + + G + K P + ++V ++ N + ++ E ++K+ Sbjct: 227 YIGEIFGGVIGKTIKDFSKEYKPNFKPYITFTNVCNNAIINPNSMEYVFIDFDE-KQNKV 285 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 D+LF + + + E VG + L + R+ ++A P Y+ SS S Sbjct: 286 LKNDILFLQSSETFEDVGKSAIY--LNDDEVYLNTFCKGFRIEREAYPMYLNYLLSSLSY 343 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 + M+ V + + + + ++LPP++EQ EI +++ ++ ++ + Sbjct: 344 KRYFMS-VCSGFTRINLRQEHFLDIPLILPPLQEQKEIAEFLDEKCKKINSAIEKTKKQI 402 Query: 404 ARVNNLTQSILAKAFRGEL 422 V +++ +A G + Sbjct: 403 EFVREYKNTLINEAVCGRI 421 >UniRef50_B6VTA2 Putative uncharacterized protein n=1 Tax=Bacteroides dorei DSM 17855 RepID=B6VTA2_9BACE Length = 429 Score = 197 bits (502), Expect = 5e-49, Method: Composition-based stats. Identities = 80/416 (19%), Positives = 151/416 (36%), Gaps = 36/416 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP GW + + + G T + Y + + + +++ + + K Sbjct: 28 LPNGWEWCNLEDIVSFGGGKTPSMDNK-EYWDNGNHLWVTSKDMKYSYITNSLMKITDKA 86 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 L E I + ++ ++ S + L + + P S ++ Sbjct: 87 L--EVMTIYEKGTLLVVTR-SGILRHTLPLSILEKPATVNQDLKTISPHIQELSEYLYVV 143 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 K++ + G +++I F + IP+ P+AEQK I + A +D + Sbjct: 144 IKANEHFILKEYHKDGTTVDSIDFDKFRCLPIPLAPIAEQKRIIVETKRWFALIDQVEQG 203 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS---- 241 + +K+ + +LG A++GKL + N EP + K++N T NG Sbjct: 204 KVDLQTTIKQAKSKILGLAIHGKLVPQDLNDEPAIELLKRINP--DFTPCDNGHYPVGWI 261 Query: 242 -----------------KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQ 284 N+ G+ L S+V D I+ + ESELN+ + Sbjct: 262 ETILGELFSHNTGKALNSSNKEGIFKDYLTTSNVYWNKFDFTAIKQMPFKESELNKCTVT 321 Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 GDLL +G + ++ + + R R D + F+ Sbjct: 322 KGDLLVCEGG----DIGRSAIW--NYDYDICIQNHIHRLRPKIDLCVPFYYYTFAYLKEN 375 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 N + G S + + LPP+ EQ IV+++E+LF+ D I+ + Sbjct: 376 NLIGGKGIGLLGL---SSNALHKIEMPLPPLAEQQRIVQKIEELFSVLDNIQNALE 428 Score = 105 bits (263), Expect = 2e-21, Method: Composition-based stats. Identities = 37/187 (19%), Positives = 72/187 (38%), Gaps = 11/187 (5%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G P GW+ + + + G KD + +N+ KFD T + Sbjct: 252 DNGHYPVGWIETILGELFSHNTGKALNSSNKEGIFKD----YLTTSNVYWNKFDFTAIKQ 307 Query: 62 VP-KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 +P K ++ D+++ +G+SA + ++ LRP+ + Sbjct: 308 MPFKESELNKCTVTKGDLLVC----EGGDIGRSAIWNYDYDICIQNHIHRLRPKIDLCVP 363 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F + N I + + + I +P+PPLAEQ+ I +K++ L + +D Sbjct: 364 FYYYTFAYLKENNLIGGKGI--GLLGLSSNALHKIEMPLPPLAEQQRIVQKIEELFSVLD 421 Query: 181 STKARFE 187 + + E Sbjct: 422 NIQNALE 428 Score = 85.5 bits (210), Expect = 4e-15, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 80/244 (32%), Gaps = 16/244 (6%) Query: 205 VNGKLTEKWRNFEPQHSVFKKL-NFESILTELRNGLSSKPNESGVGHPILRISSVRAGHV 263 T + P + L + S + +K H + ++ ++ Sbjct: 16 FFPSDTPHYPYLLPNGWEWCNLEDIVSFGGGKTPSMDNKEYWDNGNHLWVTSKDMKYSYI 75 Query: 264 DQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA 323 + ++ + + + + + G LL +G L +L+K N L Sbjct: 76 TNSLMKITDKALEVMTIY--EKGTLLVVTRSGILRHTLPLSILEKPATVN----QDLKTI 129 Query: 324 RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 L EY+ + + + K + I + + L P+ EQ I+ Sbjct: 130 SPHIQELSEYLYVVIKANEHF-ILKEYHKDGTTVDSIDFDKFRCLPIPLAPIAEQKRIIV 188 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++ FA D +E+ + + IL A G+L Q + P A L Sbjct: 189 ETKRWFALIDQVEQGKVDLQTTIKQAKSKILGLAIHGKLVPQDLNDEP--------AIEL 240 Query: 444 LEKI 447 L++I Sbjct: 241 LKRI 244 >UniRef50_A5GE25 Restriction endonuclease S subunits-like protein n=2 Tax=Proteobacteria RepID=A5GE25_GEOUR Length = 443 Score = 197 bits (502), Expect = 5e-49, Method: Composition-based stats. Identities = 80/438 (18%), Positives = 168/438 (38%), Gaps = 34/438 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDY-LPLIRANNIQNGKFDTTDLVFV 62 G +P W + + ++T+ RG + + Y D+ R ++ + + Sbjct: 20 GDVPSHWEVIQIKHLSTVRRGASPRPIDDAKYFDDEGEYAWTRIADVTASEMYLFNAPQR 79 Query: 63 PKNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 +L S K+ P + +++ VGK + C F V PE I S F Sbjct: 80 LSDLGSSLSVKLEPGALFLSI----AGTVGKPCITGMK-ACIHDGF--VYFPELKIPSKF 132 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + N+ + I I ++ + I + LD A++D+ Sbjct: 133 LFYVFAGEQAYKGLGKF---GTQLNLNTDTVGGIKIGCTENSQLEKIVQFLDHETAKIDT 189 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 + +Q+ ++LK RQAV+ AV L +W P+H F++ Sbjct: 190 LIDKQQQLIKLLKEKRQAVISHAVTKGLNPDAPMKDSGVEWLGEVPEHWDVCLAKFKTH- 248 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL---NRHKLQDGDLL 289 + +G P+ H + I + G ++ D + K + GD+L Sbjct: 249 -AITDGAHISPDTKNGEHYFVSIKDMCDGLINFEDALLTSKESYKYLVNTGCKPEPGDIL 307 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMM 348 F++ +G + ++ + + LI + K P++ + S + + Sbjct: 308 FSKDG----TIGKT--VVTPENVDFVVASSLIIIKPNLKKLSPQFFDYLCQSCVIQEQVN 361 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + VK + K +S +++ + PP+ EQ I + +++ IE+ NNA+A + Sbjct: 362 SFVK-GAALKRLSIQNLLKVWGVFPPLDEQVVIAKHIDKKLIRYQQIEQTANNAIALMQE 420 Query: 409 LTQSILAKAFRGELTAQW 426 ++++ A G++ + Sbjct: 421 RRTALISAAVTGKIDVRD 438 Score = 68.2 bits (165), Expect = 7e-10, Method: Composition-based stats. Identities = 35/239 (14%), Positives = 80/239 (33%), Gaps = 29/239 (12%) Query: 211 EKWRNFEPQHSVFKKLNFESILTELRNGLSSKP------NESGVGHPILRISSVRAGHVD 264 E+W P H ++ L+ +R G S +P + + RI+ V A + Sbjct: 16 EEWLGDVPSHWEVIQIK---HLSTVRRGASPRPIDDAKYFDDEGEYAWTRIADVTASEMY 72 Query: 265 QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 + KL+ G LF G++ + G+ + + +P+ Sbjct: 73 LFNAPQRLSDLGSSLSVKLEPGA-LFLSIAGTVGKPCITGMKACIHDGFVYFPEL----- 126 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 +++ F+ A + Q ++ + + + +IV+ Sbjct: 127 ---KIPSKFLFYVFAGEQAYKGLGKF----GTQLNLNTDTVGGIKIGCTENSQLEKIVQF 179 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++ A DT+ + + + Q++++ A + NPD ++ L Sbjct: 180 LDHETAKIDTLIDKQQQLIKLLKEKRQAVISHAVT-------KGLNPDAPMKDSGVEWL 231 >UniRef50_A5KSY3 Restriction modification system DNA specificity domain n=5 Tax=candidate division TM7 genomosp. GTL1 RepID=A5KSY3_9BACT Length = 335 Score = 197 bits (502), Expect = 6e-49, Method: Composition-based stats. Identities = 67/310 (21%), Positives = 143/310 (46%), Gaps = 22/310 (7%) Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + ++ + Y++ + G + ++ I IP P EQK I K++ L + Sbjct: 44 NNKYVKYALNYVDYQSYV----TGTTRLKLNQSALKRIIIPFPDENEQKRIVAKIEELFS 99 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 ++D+ ++ K + Q+++ ++ ++ + E++ Sbjct: 100 EIDNAESAITTASGYYKSYEQSIIDSLF------------AKYEAEAEMVEFGDIAEIKG 147 Query: 238 GLSSKPNESGVGH---PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYN 294 G++ G+ P LR+++V+ G++ ++I+ + + EL ++ L +GD+LFT Sbjct: 148 GITKGRKLRGMPIGETPYLRVANVQDGYLYLDEIKTINVTAEELRKYSLMNGDILFTE-G 206 Query: 295 GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAMMNCVKT 353 G + +G G + + + ++ + + RAR+ +PEYI + AR+ ++ K Sbjct: 207 GDKDKLGR-GTIWHGEIELCIHQNHIFRARVDSGQFVPEYISYATKTTRARDYFLSKAKQ 265 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 T+ ++ +K+ + P+ +Q EIV + + + K++ A R L QSI Sbjct: 266 TTNLASLNMTSLKNLQLPSIPLAQQKEIVESIVTKLSEIKSARKELIVAHHRSKALRQSI 325 Query: 414 LAKAFRGELT 423 LAKAF+GEL Sbjct: 326 LAKAFKGELV 335 Score = 104 bits (260), Expect = 6e-21, Method: Composition-based stats. Identities = 42/204 (20%), Positives = 82/204 (40%), Gaps = 7/204 (3%) Query: 11 VIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK-E 69 + + + G+T K + + + P +R N+Q+G ++ + + Sbjct: 135 EMVEFGDIAEIKGGIT--KGRKLRGMPIGETPYLRVANVQDGYLYLDEIKTINVTAEELR 192 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIF-SGFIAHFTK 127 + DI+ G K +G+ H E C R + F +I++ TK Sbjct: 193 KYSLMNGDILFTE-GGDKDKLGRGTIWHGEIELCIHQNHIFRARVDSGQFVPEYISYATK 251 Query: 128 SSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 ++ R+ S + + ++ S + +P PLA+QK I E + T L+++ S + Sbjct: 252 TTRARDYFLSKAKQTTNLASLNMTSLKNLQLPSIPLAQQKEIVESIVTKLSEIKSARKEL 311 Query: 187 EQIPQILKRFRQAVLGGAVNGKLT 210 K RQ++L A G+L Sbjct: 312 IVAHHRSKALRQSILAKAFKGELV 335 >UniRef50_Q6F778 Putative type I restriction-modification system specificity determinant for hsdM and hsdR (HsdS) n=1 Tax=Acinetobacter sp. ADP1 RepID=Q6F778_ACIAD Length = 448 Score = 197 bits (501), Expect = 6e-49, Method: Composition-based stats. Identities = 91/438 (20%), Positives = 172/438 (39%), Gaps = 19/438 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + + L + Y ++ D IR +I + D Sbjct: 19 GEIPSHWEVKRMK--FLLSEKLKYGANESAESEDKDQPRYIRITDINDSGTLREDTFKSL 76 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + + ++ DI++A S + + C G ++ FI Sbjct: 77 EIEKAQEYLLNDLDILLARSGATVGKSYLHKKDKVNVACYAGYLIRARFNKENYDPQFIN 136 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 F +S Y + I S++ A I N+ ++ + + IP LAEQKIIA+ LD LAQVD+ Sbjct: 137 LFLQSKAYWSWIESVNIQATIQNVSAEKYNDLALSIPSLAEQKIIADFLDKRLAQVDALI 196 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK- 242 A+ E + + L R A++ AV L E + + + L+ LS K Sbjct: 197 AKQETLLEKLAEQRVALISHAVTKGLNPDVEMKESDVVLLGNIPNTWNIKRLKFLLSEKL 256 Query: 243 --------PNESGVGHPILRISSVRA-GHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 +E +RI+ + G++ + LE +++ + L D D+L R Sbjct: 257 KYGANESAESEDKENPRYIRITDIDDSGNLKDETFKSLESEKAQ--EYLLDDLDILLARS 314 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVK 352 + VG L K Y LIRARL ++ PE++ F S + + + Sbjct: 315 GAT---VGKSYLYKAESVGIACYAGYLIRARLDQENYNPEFVNYFLQSKQYWDWISSINI 371 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 + Q +S + + +P ++EQ +++ ++ + + + +N + Sbjct: 372 QATIQ-NVSAEKYNDLTLAIPSLEEQKQLIEYLKNEDEKFNRAISKGKKLVHLLNEYRST 430 Query: 413 ILAKAFRGELTAQWRAEN 430 ++ + G++ Q N Sbjct: 431 LITQVVTGKIDVQNLKVN 448 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 51/239 (21%), Positives = 100/239 (41%), Gaps = 18/239 (7%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHP-ILRISSVRA-GHVDQNDIR 269 +W P H K++ F + +L+ G + P +RI+ + G + ++ + Sbjct: 16 QWLGEIPSHWEVKRMKFL-LSEKLKYGANESAESEDKDQPRYIRITDINDSGTLREDTFK 74 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD- 328 LE +++ + L D D+L R + VG L KK + Y LIRAR K+ Sbjct: 75 SLEIEKAQ--EYLLNDLDILLARSGAT---VGKSYLHKKDKVNVACYAGYLIRARFNKEN 129 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 P++I +F S + + + + + Q +S + + +P + EQ I +++ Sbjct: 130 YDPQFINLFLQSKAYWSWIESVNIQATIQ-NVSAEKYNDLALSIPSLAEQKIIADFLDKR 188 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKI 447 A D + + L ++ ++++ A + NPD+ E+ LL I Sbjct: 189 LAQVDALIAKQETLLEKLAEQRVALISHAVT-------KGLNPDVEMKESDV-VLLGNI 239 >UniRef50_UPI0001C42656 hypothetical protein BpOF4_03730 n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42656 Length = 443 Score = 197 bits (500), Expect = 8e-49, Method: Composition-based stats. Identities = 78/439 (17%), Positives = 172/439 (39%), Gaps = 37/439 (8%) Query: 9 GWVIAPVSTVTTL-IRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT-DLVFVPKNL 66 W + + V + I ++ + + +D +P + A +++NG + ++ + Sbjct: 15 DWQVMKIKRVLDIPITDGPHETPELL----EDGVPFLSAESVKNGNLNFDLKRGYISQED 70 Query: 67 VKES-QKISP--EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI-FSGFI 122 ++ +K P +DI + S + + A E S + ++R +K I ++ Sbjct: 71 HEKYIKKCKPQRDDIFMVKSGATTGNI---AMVDTDEEFSIWSPLALIRAKKEIVIPKYL 127 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 +F S +R ++ + NI + + I IP L QK I ++ + +D Sbjct: 128 YYFVGSLAFREQVEVSWSYGTQQNIGMKVIENLFISIPSLEIQKRIVRYIEYKVKDIDIL 187 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILT 233 + + ++L++ RQ++L AV L KW P+H KK+ + Sbjct: 188 IKQKGKFIKLLEQQRQSILTEAVTKGLNPNMNMKDSGVKWIGEIPEHWEVKKVKHFA--- 244 Query: 234 ELRNGLSSKPN-----ESGVGHPILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGD 287 + G P+ G P LR +V + D+ F+ + E+ ++Q D Sbjct: 245 -IHVGSGKTPSGGAEIYLDEGIPFLRSLNVHFDGIHLKDLAFISEEINEEMKTSQVQPLD 303 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 +L S +G ++ K + RL ++ + Y + N Sbjct: 304 ILLNITGAS---IGRTTIVPK-DFGRANVNQHVCIIRLNQNKVYPYYFNMLMASDVINQQ 359 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPP-VKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + + S ++G++ ++ + +PP ++EQ EI + + V + ++ Sbjct: 360 IWFAQNGSSREGLNFAQVRELIFAIPPTLEEQREINEWIYNKQMKIFNLINLVKEQIEKL 419 Query: 407 NNLTQSILAKAFRGELTAQ 425 QS++ +A G++ + Sbjct: 420 KEYRQSLIYEAVTGKIDVR 438 Score = 128 bits (322), Expect = 4e-28, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 90/213 (42%), Gaps = 9/213 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PE W + V + G D+ +P +R+ N+ DL F+ Sbjct: 228 IGEIPEHWEVKKVKHFA-IHVGSGKTPSGGAEIYLDEGIPFLRSLNVHFDGIHLKDLAFI 286 Query: 63 PKNLVKESQ--KISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLR-PEKLIF 118 + + +E + ++ P DI++ ++ S +G++ F + ++R + ++ Sbjct: 287 SEEINEEMKTSQVQPLDILLNITGAS---IGRTTIVPKDFGRANVNQHVCIIRLNQNKVY 343 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPP-LAEQKIIAEKLDTLLA 177 + S + +I G++ + A + IPP L EQ+ I E + Sbjct: 344 PYYFNMLMASDVINQQIWFAQNGSSREGLNFAQVRELIFAIPPTLEEQREINEWIYNKQM 403 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 ++ + ++ + LK +RQ+++ AV GK+ Sbjct: 404 KIFNLINLVKEQIEKLKEYRQSLIYEAVTGKID 436 Score = 112 bits (279), Expect = 4e-23, Method: Composition-based stats. Identities = 49/239 (20%), Positives = 97/239 (40%), Gaps = 16/239 (6%) Query: 208 KLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQND 267 L E RN K+ + + +G P G P L SV+ G+++ + Sbjct: 3 PLFELERNNVNYDWQVMKIKRVLDI-PITDGPHETPELLEDGVPFLSAESVKNGNLNFDL 61 Query: 268 IR-FLECSESE--LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 R ++ + E + + K Q D+ + + G ++ + ++ P LIRA+ Sbjct: 62 KRGYISQEDHEKYIKKCKPQRDDIFMVKSGAT---TGNIAMVDTDEEFSIWSPLALIRAK 118 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 + +P+Y+ F S + R + + Q+ I K I++ + +P ++ Q IVR Sbjct: 119 -KEIVIPKYLYYFVGSLAFREQVEVSWSYGT-QQNIGMKVIENLFISIPSLEIQKRIVRY 176 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 +E D + KQ + + QSIL +A + NP++ ++ + Sbjct: 177 IEYKVKDIDILIKQKGKFIKLLEQQRQSILTEAVT-------KGLNPNMNMKDSGVKWI 228 >UniRef50_C0VJ61 Restriction modification system DNA specificity domain protein n=2 Tax=Acinetobacter RepID=C0VJ61_9GAMM Length = 401 Score = 197 bits (500), Expect = 1e-48, Method: Composition-based stats. Identities = 85/403 (21%), Positives = 160/403 (39%), Gaps = 40/403 (9%) Query: 36 LKDDYLPLIRANNI-QNGKFDT--TDLVFVPKNLVKESQK--ISPEDIVIAMSSGS---- 86 K+ + L+ NI + GK D TD + + + Q I D+VIA S + Sbjct: 23 FKESGIKLLNVANITKQGKIDLNKTDRHLSTEEVDSKYQHFLIDEGDLVIASSGITNDED 82 Query: 87 ---KSVVGKSAHQHLPFECSFGAFCGVLRPEKLI-FSGFIAHFTKSSLYRNKISSLSAGA 142 ++ + QHLP + + + F+ H+ S +R +I+ G Sbjct: 83 NLLRTKIAFIEKQHLPL--CLNTSTIRFKAKDGVSDLKFLKHWLNSLEFRQQITKEVTGI 140 Query: 143 NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLG 202 N P+ I I +PPL EQ+ IA LD + E++ Q+L+ + G Sbjct: 141 AQKNFGPSHLKKIKISLPPLTEQRRIASILDQADELRQKRQQAIEKLDQLLQATFIDMFG 200 Query: 203 GAVNG--KLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRA 260 V+ ++ + + K L+ + +E+ + LR ++V+ Sbjct: 201 DPVSNPKGWDLRYVGEISESKLGKMLDKKKQSSEIDQ------------YKYLRNANVQW 248 Query: 261 GHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKL 320 D +D+ +E +E + +L+ GD+L G + K +N + L Sbjct: 249 FRFDLSDVFEMEFNEKDRKNCELKFGDVLVCEGGEP----GRAAIWKN-DLENCFFQKAL 303 Query: 321 IRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQA 379 R RL LPEY F S + + T + ++G +K+ + +PP+ Q Sbjct: 304 HRVRLDMTQILPEYFVWLFWFYSKNGGFDDHI-TVATIAHLTGVKMKAMQIPIPPLSLQE 362 Query: 380 EIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + ++V + + ++ + N+ +L S+ +AF G L Sbjct: 363 DFQQKVNE----IEVLKTTLENSSKLFESLFSSLQNQAFNGTL 401 Score = 87.8 bits (216), Expect = 7e-16, Method: Composition-based stats. Identities = 37/206 (17%), Positives = 79/206 (38%), Gaps = 13/206 (6%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP-KN 65 P+GW + V ++ G K++ + + D +R N+Q +FD +D+ + Sbjct: 206 PKGWDLRYVGEISESKLGKMLDKKKQSSEI--DQYKYLRNANVQWFRFDLSDVFEMEFNE 263 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPE-KLIFSGFIA 123 +++ ++ D+++ G++A E C F +R + I + Sbjct: 264 KDRKNCELKFGDVLVCEGGEP----GRAAIWKNDLENCFFQKALHRVRLDMTQILPEYFV 319 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 A I ++ + IPIPPL+ Q+ +K++ ++ K Sbjct: 320 WLFWFYSKNGGFDDHITVATIAHLTGVKMKAMQIPIPPLSLQEDFQQKVNE----IEVLK 375 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKL 209 E ++ + ++ A NG L Sbjct: 376 TTLENSSKLFESLFSSLQNQAFNGTL 401 >UniRef50_Q3J746 Restriction modification system DNA specificity domain n=3 Tax=Proteobacteria RepID=Q3J746_NITOC Length = 425 Score = 196 bits (499), Expect = 1e-48, Method: Composition-based stats. Identities = 71/437 (16%), Positives = 167/437 (38%), Gaps = 30/437 (6%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLI--RANNIQNGKFDTTDLVFVP 63 +PEGW + P+ + + + + +P+ ++ T+ + F+ Sbjct: 5 VPEGWEVKPLGKLVDV------RSSNIDKKTETSEIPVRLCNYTDVYYNNRITSAIDFMA 58 Query: 64 KNLVKES---QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-GAFCGVLRPEK-LIF 118 + + + D++I S + + ++ G +L+P++ Sbjct: 59 ASAKQREIDRFSLEKGDVIITKDSETPDDIAVPSYVSDDLSGVVCGYHLTLLKPDQDESD 118 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F++H + ++ L+ G + + + + PPL EQ+ IA L ++ Sbjct: 119 GEFLSHLFQLPSVQHYFYILANGITRFGLTADAINEAPLLTPPLPEQQKIAAILSSVDDV 178 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 ++ T+A+ ++ + Q +L + G K + + + + G Sbjct: 179 IEKTRAQIHKLKDLKTAMMQELLTKGI-GHTEFKDSPVGRIPVGWSICSAGEVAVAIMVG 237 Query: 239 LSSKPNES--GVGHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYNG 295 + KP + G P LR ++VR + +++++ +E L + +L GDLL R Sbjct: 238 VVVKPAQYYVESGVPALRSANVRENGLTMDNLKYFSEDSNEILKKSRLIKGDLLTVRTGY 297 Query: 296 SLEFVGVCGLL-KKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTT 354 G ++ + + N + ++ R + ++ ++ +S + ++ + Sbjct: 298 P----GTTAVVTDEFEGCNCI---DVVITRPSSRIDSDFFCLWVNSDHGKGQVLK-AQGG 349 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 Q+ + D+K+ V++P + EQ I V + EK++ L L Q +L Sbjct: 350 LAQQHFNVSDMKNLTVVVPSLTEQKAIFNAVNSVTKKIALTEKRLTLLLDTKKALMQDLL 409 Query: 415 AKAFRGELTAQWRAENP 431 G++ E P Sbjct: 410 ----TGKVRVNVEQEEP 422 Score = 126 bits (316), Expect = 2e-27, Method: Composition-based stats. Identities = 40/213 (18%), Positives = 84/213 (39%), Gaps = 10/213 (4%) Query: 3 AGKLPEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G++P GW I V ++ GV K Q Y + +P +R+ N++ +L + Sbjct: 215 VGRIPVGWSICSAGEVAVAIMVGVVVKPAQ---YYVESGVPALRSANVRENGLTMDNLKY 271 Query: 62 VPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 ++ + + ++ D++ + G +A FE + RP I S Sbjct: 272 FSEDSNEILKKSRLIKGDLLTVRTGYP----GTTAVVTDEFEGCNCIDVVITRPSSRIDS 327 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F + S + ++ G + + + + +P L EQK I ++++ ++ Sbjct: 328 DFFCLWVNSDHGKGQVLKAQGGLAQQHFNVSDMKNLTVVVPSLTEQKAIFNAVNSVTKKI 387 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK 212 T+ R + K Q +L G V + ++ Sbjct: 388 ALTEKRLTLLLDTKKALMQDLLTGKVRVNVEQE 420 >UniRef50_B5FA22 Restriction modification system DNA specificity domain protein n=1 Tax=Vibrio fischeri MJ11 RepID=B5FA22_VIBFM Length = 376 Score = 196 bits (497), Expect = 2e-48, Method: Composition-based stats. Identities = 86/417 (20%), Positives = 165/417 (39%), Gaps = 46/417 (11%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 WV + V L G K + P AN I+ +D F + Sbjct: 2 SWVEKSLDEVLKLEYGKPLDKSLRK---EGGKYPAYGANGIKA----WSDEYFHDEE--- 51 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKS 128 ++ GS + + + P + ++ V + F+ + S Sbjct: 52 ---------TIVVGRKGSAGELTLTDGKFWPLDVTY----FVKTNKNDYDIKFLYYLLLS 98 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 + + + G N NN+ I P + QK +A +LD ++ + E+ Sbjct: 99 LDLPSLATGVKPGINRNNVY-----KIQAKFPSYSTQKQVAGQLDKAFDGIEQARTNTEK 153 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV 248 Q + + L + +KK + T+ G SSK ++ G Sbjct: 154 NLQNARELFDSYLQQVFS-----------ECGEGWKKTTLNELCTKFEYGTSSKSSQEGE 202 Query: 249 GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKK 308 P++R+ +++ G + + + + +E + +++L D+LF R N S E VG + K Sbjct: 203 -VPVIRMGNIQDGRIVMDKLVY-SLNEEDNQKYRLNFNDVLFNRTN-SAELVGKTAIYKS 259 Query: 309 LQHQNLLYPDKLIRARLTKDALP-EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 + ++ LIR + L +Y+ + +SP AR + ++ Q ISG +K+ Sbjct: 260 EER--AIFAGYLIRIHRNEKLLNADYLNFYLNSPIARKYGEQVMSQSTNQANISGTKLKT 317 Query: 368 QVVLLP-PVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 + +P ++EQ IV ++ L + +E + L ++ L QS+L +AF G+LT Sbjct: 318 YPISIPVSLEEQQSIVDKISTLKEKVEELEATHKSKLTALDELKQSLLQQAFTGQLT 374 Score = 144 bits (363), Expect = 7e-33, Method: Composition-based stats. Identities = 51/208 (24%), Positives = 92/208 (44%), Gaps = 9/208 (4%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 EGW ++ + T T K ++ +P+IR NIQ+G+ LV+ Sbjct: 175 EGWKKTTLNELCTKFEYGTSSKSS-----QEGEVPVIRMGNIQDGRIVMDKLVYSLNEED 229 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVL-RPEKLIFSGFIAHFT 126 + +++ D++ ++ S +VGK+A F + + R EKL+ + ++ + Sbjct: 230 NQKYRLNFNDVLFNRTN-SAELVGKTAIYKSEERAIFAGYLIRIHRNEKLLNADYLNFYL 288 Query: 127 KSSLYRNKISS-LSAGANINNIKPASFDLINIPIP-PLAEQKIIAEKLDTLLAQVDSTKA 184 S + R +S N NI I IP L EQ+ I +K+ TL +V+ +A Sbjct: 289 NSPIARKYGEQVMSQSTNQANISGTKLKTYPISIPVSLEEQQSIVDKISTLKEKVEELEA 348 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEK 212 + L +Q++L A G+LT+ Sbjct: 349 THKSKLTALDELKQSLLQQAFTGQLTQG 376 >UniRef50_Q1GLF5 Type I restriction-modification system; S subunit n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GLF5_SILST Length = 387 Score = 196 bits (497), Expect = 2e-48, Method: Composition-based stats. Identities = 77/412 (18%), Positives = 155/412 (37%), Gaps = 30/412 (7%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQK 72 + + + G T K+ + D +P + ++ +T + + + Sbjct: 4 VALGELVEIRGGGTPDKK--VPDYWDGDIPWASVKDFKSTSLASTIDRITQAGVANSATQ 61 Query: 73 ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYR 132 + P +I ++ VGK+A + + L P + I ++ H ++ Sbjct: 62 VIPAGNIIV---PTRMAVGKAAINEIDL--AINQDLKALIPSQRIDRQYLLHALLANA-- 114 Query: 133 NKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQI 192 + + GA + IK + + IP+PPL EQ+ IA LD A +++ + Sbjct: 115 KTLEDQATGATVKGIKLDALRSLQIPLPPLQEQRRIAGILDQADALRRFRTRALDKLGTL 174 Query: 193 LKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPI 252 + + G + W V + G+ G PI Sbjct: 175 GQAIFHEMFGA--SSPDHAAWEKINLSELVLPDDR-------INYGVVQPGPHDPEGVPI 225 Query: 253 LRISSVRAGHVDQNDIRFLECS-ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQH 311 +R++ + + V + I+ + S ++E R +L+ G++L G + +G ++ + Sbjct: 226 IRVADLASPVVAFDSIKRIAPSIDAEYGRSRLKGGEVLI----GCVGSIG-TTIIAPPEF 280 Query: 312 QNLLYPDKLIRARL-TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVV 370 + R L T P ++ S +N V+ Q ++ K I+ + Sbjct: 281 AGANVARAVARVPLDTSRCEPRFVAEQLRSQRIQNYFTKEVRL-VAQPTLNIKQIRETEI 339 Query: 371 LLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 +LPP + Q V RV + + + Q AL + L S+ + AFRGE+ Sbjct: 340 ILPPKELQVSFVERVHE----IEAQKAQHAAALTACDVLFASLQSTAFRGEV 387 Score = 66.3 bits (160), Expect = 2e-09, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 67/216 (31%), Gaps = 29/216 (13%) Query: 7 PEG--WVIAPVSTVTTLIRGVTYKKEQAINYL-------KDDYLPLIRANNIQNGKFDTT 57 P+ W +S + + INY + +P+IR ++ + Sbjct: 188 PDHAAWEKINLSELVL--------PDDRINYGVVQPGPHDPEGVPIIRVADLASPVVAFD 239 Query: 58 DLVFVPKNLVKES--QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-- 113 + + ++ E ++ +++I G +G + F + A P Sbjct: 240 SIKRIAPSIDAEYGRSRLKGGEVLI----GCVGSIGTTIIAPPEFAGANVARAVARVPLD 295 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 F+A +S +N + + I +PP Q E++ Sbjct: 296 TSRCEPRFVAEQLRSQRIQNYFTKEVRLVAQPTLNIKQIRETEIILPPKELQVSFVERVH 355 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 + AQ KA+ ++ A G++ Sbjct: 356 EIEAQ----KAQHAAALTACDVLFASLQSTAFRGEV 387 >UniRef50_Q307D8 Type I RM system S subunit n=1 Tax=Arthrospira platensis RepID=Q307D8_SPIPL Length = 392 Score = 195 bits (496), Expect = 3e-48, Method: Composition-based stats. Identities = 77/422 (18%), Positives = 149/422 (35%), Gaps = 44/422 (10%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W+ A + V G K+Q R + K ++ + Sbjct: 3 WLQAKLKYVAHFAYGDALPKDQE------------REGDF---KVFGSNGAYDNYGRANT 47 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSS 129 + +I GS V S H + +F + ++ + ++ Sbjct: 48 QAPV-----IIVGRKGSYGKVNWSDHPCFASDTTF----FIDATTTHHHLRWLFYLLQTL 98 Query: 130 LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQI 189 + + A + + + IPPL EQK IA LD A++D +++ Sbjct: 99 N----LDQGTDEAAVPGLSRDDAYAKKVFIPPLGEQKAIAHYLDIETAKIDQLIKAKKRL 154 Query: 190 PQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTELRNGLS 240 +L R+A++ AV L +W P+H L IL + G+S Sbjct: 155 LALLDEKRRALITHAVTRGLNPDVPMRDSGVEWIGEIPKHWEI--LPLRRILQTMDYGIS 212 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 G +LR+ V G + +++ F++ + +L L+ DLLF R N SL+ + Sbjct: 213 ESVGSEGN-IAVLRMGDVDEGEISYDNVGFVDDVDHDL---ILKANDLLFNRTN-SLDKI 267 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G + + + + L+R R +PEY+ +S + GQ + Sbjct: 268 GKVAIFRNNFLFPVSFASYLVRMRCNDSVIPEYLNYLLNSLPVLTWAKSNALPAIGQVNL 327 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + + +PP++EQ I ++ + + + S++ A G Sbjct: 328 NPNRYSYIKIPIPPIEEQLNITEYIQTNTKKIKKLCLSSEETIKLLQERRTSLITAAVTG 387 Query: 421 EL 422 ++ Sbjct: 388 QI 389 Score = 129 bits (325), Expect = 2e-28, Method: Composition-based stats. Identities = 41/210 (19%), Positives = 89/210 (42%), Gaps = 11/210 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P+ W I P+ + + Y +++ + + ++R ++ G+ ++ FV Sbjct: 188 IGEIPKHWEILPLRRILQT---MDYGISESVG--SEGNIAVLRMGDVDEGEISYDNVGFV 242 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAH--QHLPFECSFGAFCGVLRPEKLIFSG 120 V + D++ ++ S +GK A + F SF ++ +R + Sbjct: 243 DD--VDHDLILKANDLLFNRTN-SLDKIGKVAIFRNNFLFPVSFASYLVRMRCNDSVIPE 299 Query: 121 FIAHFTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ + S S + N+ P + I IPIPP+ EQ I E + T ++ Sbjct: 300 YLNYLLNSLPVLTWAKSNALPAIGQVNLNPNRYSYIKIPIPPIEEQLNITEYIQTNTKKI 359 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 E+ ++L+ R +++ AV G++ Sbjct: 360 KKLCLSSEETIKLLQERRTSLITAAVTGQI 389 Score = 61.3 bits (147), Expect = 8e-08, Method: Composition-based stats. Identities = 23/143 (16%), Positives = 49/143 (34%), Gaps = 12/143 (8%) Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G G + H T ++ + + ++ + G+ Sbjct: 58 GSYGKVNWSDHPCFASDTTFFIDATTTHHHLRWLFYLLQTLN-----LDQGTDEAAVPGL 112 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 S D ++ V +PP+ EQ I ++ A D + K LA ++ ++++ A Sbjct: 113 SRDDAYAKKVFIPPLGEQKAIAHYLDIETAKIDQLIKAKKRLLALLDEKRRALITHAVT- 171 Query: 421 ELTAQWRAENPDLISGENSAAAL 443 R NPD+ ++ + Sbjct: 172 ------RGLNPDVPMRDSGVEWI 188 >UniRef50_C6DAR8 Restriction modification system DNA specificity domain protein n=1 Tax=Pectobacterium carotovorum subsp. carotovorum PC1 RepID=C6DAR8_PECCP Length = 390 Score = 195 bits (496), Expect = 3e-48, Method: Composition-based stats. Identities = 76/411 (18%), Positives = 168/411 (40%), Gaps = 34/411 (8%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 W ++ + I ++ I+ K ++ + NI + + + ++ K+ + Sbjct: 2 SWPTYKLTDLCNKITDGSHNPPPGISESK---FLMLSSKNIFDDDINFHNPRYLTKDDFE 58 Query: 69 ESQK---ISPEDIVIAMSSGSKSVVGKSAHQ-HLPFECSFGAFCGVLRPE-KLIFSGFIA 123 + +S D+++ + VG++A + + VL+P+ +I S F+ Sbjct: 59 RENRRTDVSSGDVLLTI----VGTVGRAAVVPDGSPKFTLQRSVAVLKPKHGIITSRFLM 114 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + +S L + + + + G I ++I +P + QK I LD + + Sbjct: 115 YTLRSML--DVLLAGARGVAQQGIYLKQLHDLDIKVPSVEIQKHIVNVLDKASSLCRKRE 172 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 + + L+ + G N + F +++ GLSSK Sbjct: 173 QGIKLADEFLRATFSNMFG------------NPDNNIKNFPIGTIRDLVSSASYGLSSKT 220 Query: 244 NESGVGHPILRISSVRA-GHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 ++ +P+LR+ ++ G D D+++++ E + L+ GDLLF R N S E VG Sbjct: 221 SKHSGKYPVLRMGNITYQGDWDLIDLKYIDLDEKAQEKFLLEKGDLLFNRTN-SKELVGK 279 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 + + +++ + LIR R + YI + +S +N ++N K+ G I+ Sbjct: 280 TAIFEN--DRDMAFAGYLIRVRTNEIGNNYYIAGYLNSLHGKNTLINMSKSIVGMANINA 337 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 +++++ +L+PP + Q E+++ K + L ++ Sbjct: 338 QEMQNIKILIPPKELQ----DNYEKIYKTVKNKIKIHIESKKESEMLFNNL 384 Score = 100 bits (248), Expect = 2e-19, Method: Composition-based stats. Identities = 37/192 (19%), Positives = 76/192 (39%), Gaps = 5/192 (2%) Query: 14 PVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN-GKFDTTDLVFVP-KNLVKESQ 71 P+ T+ L+ +Y + Y P++R NI G +D DL ++ +E Sbjct: 201 PIGTIRDLVSSASYGLSSKTSKHSGKY-PVLRMGNITYQGDWDLIDLKYIDLDEKAQEKF 259 Query: 72 KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLY 131 + D++ ++ SK +VGK+A + +F + +R ++ + +IA + S Sbjct: 260 LLEKGDLLFNRTN-SKELVGKTAIFENDRDMAFAGYLIRVRTNEIGNNYYIAGYLNSLHG 318 Query: 132 RNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 +N + ++S + NI I I IPP Q + T+ ++ ++ Sbjct: 319 KNTLINMSKSIVGMANINAQEMQNIKILIPPKELQDNYEKIYKTVKNKIKIHIESKKESE 378 Query: 191 QILKRFRQAVLG 202 + Sbjct: 379 MLFNNLSDGFFN 390 >UniRef50_Q8KLM8 Restriction-modification enzyme type I S subunit n=2 Tax=Streptococcaceae RepID=Q8KLM8_STRTR Length = 407 Score = 195 bits (495), Expect = 3e-48, Method: Composition-based stats. Identities = 74/418 (17%), Positives = 162/418 (38%), Gaps = 33/418 (7%) Query: 8 EGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 + W + ++ I G+ + + +D +P I+ +I+ + +T +L ++ K Sbjct: 15 DDWEQRKLGELSQKISVGIATSSSKYFSS-QDHGVPFIKNQDIKENRINTKNLEYISKEF 73 Query: 67 --VKESQKISPEDIVIAMSS--GSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLIFSGF 121 +++++ DI+ A + G +VV K F + RP ++I S + Sbjct: 74 DNKNKNKRVKQGDIITARTGYPGLSAVVPKELEGAQTFTTL------ITRPISEMILSEY 127 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 I+ F S +IS + AG N+ + IP+P L EQK I+ + L + Sbjct: 128 ISIFINSPYGMKQISGMEAGGAQKNVNAGIVQNLLIPLPSLDEQKKISNFILKLDDTIAL 187 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 + + + + + K + Q + ++ F V K + ++++ +G Sbjct: 188 HQRKLDLLKEQKKGYLQKMFPKNGAKVPELRFAGFADDWEVRKL----NEVSDIYDGTHQ 243 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELN-RHKLQDGDLLFTRYNGSLEFV 300 P G L + +++ +F+ E + + Q GD+L TR + Sbjct: 244 TPKYQDNGVMFLSVENIK----TLTSNKFISREAFEDEFKIRPQRGDVLMTRIG----DI 295 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G +++ + L + +++ P +++ +P ++ + + K I Sbjct: 296 GTANVVETDEDLAYYVSLALFK---SEELNPYFLQASIYAPFVQDQIWKRTLHIAFPKKI 352 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 + +I + +P + EQ +I +QL D L + + L K F Sbjct: 353 NKNEIGQVPINVPTLAEQTKIGSFFKQL----DKTIALHQRKLDLLKEQKKGFLQKMF 406 >UniRef50_C6WNJ9 Restriction modification system DNA specificity domain protein n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WNJ9_ACTMD Length = 442 Score = 195 bits (495), Expect = 3e-48, Method: Composition-based stats. Identities = 86/441 (19%), Positives = 160/441 (36%), Gaps = 37/441 (8%) Query: 6 LP--EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 +P + W +P+ +T+++ + A Y+ + + +I Q G D + F Sbjct: 7 IPISDTWTTSPLKRITSVLN-----RGSAPEYVDESPVRVISQAANQYGGLDWSRTRFHN 61 Query: 64 KNLVKESQK--ISPEDIVIAMSS-GSKSVVGKSAHQHLPFECSFGAFCGVLRPEK-LIFS 119 N K + DI+I + G+ VG C V+R +K + Sbjct: 62 FNGDPTKLKGHLQENDIIINSTGTGTLGRVGYFTEPLNGIPCMADGHVTVVRVKKHKVNP 121 Query: 120 GFIAHFTKSSLYRNKI-SSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+ ++ S ++ I SSL+ GA + +IP PP++EQ+ I + L+ A Sbjct: 122 RFVYYWLTSKPFQEYIHSSLAIGATNQIELNRDRLSDTHIPNPPISEQQRIVDFLEAETA 181 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE-------KWRNFEPQHSVFKKLNFES 230 +D ++ + L R A + AV+G W P +L+ + Sbjct: 182 HIDRLIETQNRVLEKLAERRMAGITQAVSGTDQTGTRPSSLTWLEKIPSTWKEVRLSLIA 241 Query: 231 ILTELRNGLSSKPNES-GVGHPILRISSVRAGHVD-QNDIRFLECSESELN-----RHKL 283 + S P P + VR D D+ SEL Sbjct: 242 RMGSGHTPSRSHPEWWVDCTIPWITTGEVRQVRNDRLEDLHETREKISELGLANSAAELR 301 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 G ++ R G ++ + + RL P Y+ + Sbjct: 302 PAGTVVLCRT----ASAGYSAVMGTDMATSQDFVTWTCGPRLN----PYYLLWCLR--AM 351 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 R ++ + S K I D++ + LPP+ EQ +IV+++ + A D + V + Sbjct: 352 RPDLLGRLAMGSTHKTIYVPDLQMLRIPLPPIGEQQKIVQQIREQNARIDRLADAVRLQV 411 Query: 404 ARVNNLTQSILAKAFRGELTA 424 A + Q+++ A G++ Sbjct: 412 ALLAERRQALITAAVTGQIDV 432 Score = 108 bits (269), Expect = 5e-22, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 80/212 (37%), Gaps = 13/212 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANN---IQNGKFDT---TD 58 K+P W +S + + G T + + D +P I ++N + + T Sbjct: 227 KIPSTWKEVRLSLIARMGSGHTPSRS-HPEWWVDCTIPWITTGEVRQVRNDRLEDLHETR 285 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 L + ++ P V+ + S G SA + + + + Sbjct: 286 EKISELGLANSAAELRPAGTVVLCRTASA---GYSAV--MGTDMATSQDFVTWTCGPRLN 340 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ ++ + + L+ G+ I ++ IP+PP+ EQ+ I +++ A+ Sbjct: 341 PYYLLWCLRAMRP-DLLGRLAMGSTHKTIYVPDLQMLRIPLPPIGEQQKIVQQIREQNAR 399 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 +D +L RQA++ AV G++ Sbjct: 400 IDRLADAVRLQVALLAERRQALITAAVTGQID 431 >UniRef50_D0J4L5 Putative uncharacterized protein n=1 Tax=Comamonas testosteroni CNB-2 RepID=D0J4L5_COMTE Length = 429 Score = 195 bits (495), Expect = 4e-48, Method: Composition-based stats. Identities = 77/431 (17%), Positives = 168/431 (38%), Gaps = 31/431 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +P W + P+ VT+L ++ ++ + +++ + D + + Sbjct: 16 GNVPSHWDVQPLRAVTSLKSDKNRPDLPVLSVYREYGVI------LKDSRDDNHNATSLD 69 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + K + P D+V+ + +G S+H + R ++ Sbjct: 70 TSTYK---VVKPGDLVVNKMKAWQGSMGVSSHHGIVSPAYITCTTKADRAR----PAYLH 122 Query: 124 HFTKSSLYRNKISSLSAGANINN--IKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + +SS +SLS G + + F I IP+PP EQ I LD A++D+ Sbjct: 123 YLLRSSPLIGVYNSLSYGVRVGQWDMHYEDFKQIPIPLPPNDEQDRIVAFLDQKTAEIDA 182 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRN-------FEPQHSVFKKLNFESILTE 234 + E++ +LK + ++ AV L E + ++ + + +L Sbjct: 183 AIEKKERLASLLKEQQFKLINLAVTKGLDPNAAMTCGRSPWIESYPAHWQLMRIKHVLRA 242 Query: 235 LRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFL-ECSESELNRHKLQ-DGDLLFT 291 + + P G ++R S+V+ G + + ++ E + R + GD+LFT Sbjct: 243 IVDTEHKTPPMYEEGPALMVRTSNVKNGELVFKNAKYTDELTYRRWTRRAIPVAGDILFT 302 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 R G +L + +++ ++ + L + + A A + + Sbjct: 303 REAP----AGEACVLPDGIKAAI--GQRMVLFKVDPERLDPHFAVHSIYSGAAKAFIELL 356 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 S + DI + +LLPP++EQ +I ++ + + N + ++ L + Sbjct: 357 SVGSTVAHFNMSDIGNIPLLLPPLQEQQKIAVGIKSIQRQFQPLIDSAANGIEQLQELKR 416 Query: 412 SILAKAFRGEL 422 +++A A G++ Sbjct: 417 TLIASAVLGQI 427 Score = 76.3 bits (186), Expect = 2e-12, Method: Composition-based stats. Identities = 44/244 (18%), Positives = 75/244 (30%), Gaps = 22/244 (9%) Query: 210 TEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIR 269 W P H + L S K +++ P+L + R V D R Sbjct: 11 EATWLGNVPSHWDVQPLRAV---------TSLKSDKNRPDLPVLSV--YREYGVILKDSR 59 Query: 270 FLECSESELN---RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT 326 + + L+ ++ GDL+ + +GV H ++ P + Sbjct: 60 DDNHNATSLDTSTYKVVKPGDLVVNKMKAWQGSMGVS------SHHGIVSPAYITCTTKA 113 Query: 327 KDALPEYIEIFFSSPSARNAMMNCVKTT-SGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 A P Y+ S + GQ + +D K + LPP EQ IV + Sbjct: 114 DRARPAYLHYLLRSSPLIGVYNSLSYGVRVGQWDMHYEDFKQIPIPLPPNDEQDRIVAFL 173 Query: 386 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWR-AENPDLISGENSAAALL 444 +Q A D ++ + + ++ A L A L Sbjct: 174 DQKTAEIDAAIEKKERLASLLKEQQFKLINLAVTKGLDPNAAMTCGRSPWIESYPAHWQL 233 Query: 445 EKIK 448 +IK Sbjct: 234 MRIK 237 >UniRef50_A9A374 Restriction modification system DNA specificity domain n=1 Tax=Nitrosopumilus maritimus SCM1 RepID=A9A374_NITMS Length = 438 Score = 194 bits (494), Expect = 5e-48, Method: Composition-based stats. Identities = 72/432 (16%), Positives = 160/432 (37%), Gaps = 23/432 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYK-KEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 ++PE W I + + T I+ Y + D +P IR I D + ++ Sbjct: 18 EIPETWKICNLGDLLTKIQDGNYGESYPKESEFLDSGIPFIRGTEITKNFIDGKKVKYIS 77 Query: 64 KNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSG 120 K E QK I D++ G V ++ + + G +LR K+I + Sbjct: 78 KTKHDELQKAHIETGDVLFLNRGGITRTVAIVPPKY--DDANIGPQLTLLRCNTKIIHNK 135 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ +F + ++ ++ S AG + I +P + EQ+ I L+++ + Sbjct: 136 YLYYFIQGENFKKQVISSDAGTALQFFGIEKTKKFKITLPEIREQQKIVSVLNSIDNLLS 195 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR---NFEPQHSVFKKLNFESILTELRN 237 S + ++ K Q +L ++ K +K E + ++ L +L++ Sbjct: 196 SYDKTIQTTQKLKKGLMQKLLTKGIDHKKFKKVPWLFGKEIEIPEEWEIKKIEDLFKLKS 255 Query: 238 GLSSK---PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYN 294 G + P P + + + + + + + N L G L Y Sbjct: 256 GSTPSRKIPEYFAGNIPWITSTDLNRSKITSTLEKITPEAVKQTNLKLLPKGTFLIATYG 315 Query: 295 -GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 + G CG+ K N + + + E++ F+ ++ + Sbjct: 316 LEAAGTRGKCGITKMESTCN----QACMAFLPSSEITSEFLFYFYL--YFGEKIIFSIAQ 369 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 + Q+ + +K + +PP KEQ IV ++Q D+ ++ + ++ + + + Sbjct: 370 GTKQQNLYSDTLKKVSMFVPPQKEQKRIVNFLDQ----IDSHLFELESKKTGLDKIKKGL 425 Query: 414 LAKAFRGELTAQ 425 + K ++ + Sbjct: 426 IQKLLTSKIRVK 437 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 38/199 (19%), Positives = 77/199 (38%), Gaps = 7/199 (3%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQK 72 + + L G T ++ I +P I + ++ K +T P+ + + + K Sbjct: 245 KKIEDLFKLKSGSTPSRK--IPEYFAGNIPWITSTDLNRSKITSTLEKITPEAVKQTNLK 302 Query: 73 ISPEDIVIAMSSG--SKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSL 130 + P+ + + G + GK + C C P I S F+ +F Sbjct: 303 LLPKGTFLIATYGLEAAGTRGKCGITKMESTC--NQACMAFLPSSEITSEFLFYFYLY-F 359 Query: 131 YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 I S++ G N+ + +++ +PP EQK I LD + + + +++ + Sbjct: 360 GEKIIFSIAQGTKQQNLYSDTLKKVSMFVPPQKEQKRIVNFLDQIDSHLFELESKKTGLD 419 Query: 191 QILKRFRQAVLGGAVNGKL 209 +I K Q +L + K Sbjct: 420 KIKKGLIQKLLTSKIRVKF 438 >UniRef50_A3JE98 Type I restriction-modification system, S subunit n=1 Tax=Marinobacter sp. ELB17 RepID=A3JE98_9ALTE Length = 429 Score = 194 bits (493), Expect = 7e-48, Method: Composition-based stats. Identities = 81/436 (18%), Positives = 168/436 (38%), Gaps = 35/436 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT---DLVFV 62 +P W+ A V + G + + A D+ +RA NI D + + Sbjct: 7 VPSHWIKASVGNYCDVQLGKMLQSDPASQ--NDESKRYLRAINITKHGLDLSHDFSMWIK 64 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP--EKLIFSG 120 P+ + E ++ DI+++ G++A E F +RP I Sbjct: 65 PQEM--EKFRLQRGDILVS----EGGDAGRTAVFDCDEEFYFQNAINRIRPAGNSTILPE 118 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 FI ++ + + A I + + +PPL Q IA+ LD A++D Sbjct: 119 FIYYWFTFLKVAGYVEMVCNVATIAHFTAEKVKAAPLALPPLKTQHSIAQFLDEKTARID 178 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESI 231 + + L RQA++ A+ L +W P + KKL Sbjct: 179 GLIEKKCALLDRLAEKRQALITRAITKGLDPNAIMKPSGTEWLGHIPANWEVKKLRRVRR 238 Query: 232 LTELRNGLSSKPNES-GVGHPILRISSVRAGHV--DQNDIRFLECS-ESELNRHKLQDGD 287 + +G G LR+++V + D ++ R++ +E R +++GD Sbjct: 239 Y--MTSGSRDWAAYYADEGDRFLRMTNVTGEGIELDLSETRYVNLDGATEGTRTSVREGD 296 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNA 346 +L T +G +++K + L R + + ++ F S+ AR Sbjct: 297 ILITITAE----LGAVAVIRKEIEGAYI-NQHLALFRPSPELCESGFLVNFLSTDMARAQ 351 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 M + + Q G+ + + + ++ PP++EQ I ++ ++++E+ + ++ ++ Sbjct: 352 FMLSGQGGTKQ-GLGFEQVNNVIIGFPPLREQELIGNFCSEIRRQSESVEQPLKLSIDKL 410 Query: 407 NNLTQSILAKAFRGEL 422 +++ A G+L Sbjct: 411 IEYRSAVITAAVTGQL 426 Score = 113 bits (282), Expect = 2e-23, Method: Composition-based stats. Identities = 44/213 (20%), Positives = 81/213 (38%), Gaps = 13/213 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKF--DTTDLVF 61 G +P W + + V R +T Y D+ +R N+ D ++ + Sbjct: 222 GHIPANWEVKKLRRV---RRYMTSGSRDWAAYYADEGDRFLRMTNVTGEGIELDLSETRY 278 Query: 62 VPKNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIF 118 V + E + + DI+I ++ + +G A E + RP + Sbjct: 279 VNLDGATEGTRTSVREGDILITIT----AELGAVAVIRKEIEGAYINQHLALFRPSPELC 334 Query: 119 -SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 SGF+ +F + + R + G + + + I PPL EQ++I + Sbjct: 335 ESGFLVNFLSTDMARAQFMLSGQGGTKQGLGFEQVNNVIIGFPPLREQELIGNFCSEIRR 394 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 Q +S + + L +R AV+ AV G+L Sbjct: 395 QSESVEQPLKLSIDKLIEYRSAVITAAVTGQLE 427 Score = 100 bits (250), Expect = 9e-20, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 80/206 (38%), Gaps = 16/206 (7%) Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 S +++ LR ++ +D + + E+ + +LQ GD+L + Sbjct: 30 SDPASQNDESKRYLRAINITKHGLDLSHDFSMWIKPQEMEKFRLQRGDILVSEGG----D 85 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARL--TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 G + + + + + R R LPE+I +F+ + V + Sbjct: 86 AGRTAVFDCDEE--FYFQNAINRIRPAGNSTILPEFIYYWFTFLKVAGYV-EMVCNVATI 142 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 + + +K+ + LPP+K Q I + +++ A D + ++ L R+ Q+++ +A Sbjct: 143 AHFTAEKVKAAPLALPPLKTQHSIAQFLDEKTARIDGLIEKKCALLDRLAEKRQALITRA 202 Query: 418 FRGELTAQWRAENPDLISGENSAAAL 443 + +P+ I + L Sbjct: 203 IT-------KGLDPNAIMKPSGTEWL 221 >UniRef50_B4SA10 Restriction modification system DNA specificity domain n=1 Tax=Pelodictyon phaeoclathratiforme BU-1 RepID=B4SA10_PELPB Length = 392 Score = 193 bits (491), Expect = 1e-47, Method: Composition-based stats. Identities = 81/415 (19%), Positives = 163/415 (39%), Gaps = 38/415 (9%) Query: 11 VIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKES 70 + VTT+I G + + + D LP + KF + + Sbjct: 2 KTVELQQVTTIIAGQSPESSTYNSIA--DGLPFFQGKADFQDKFPKVRIWCNSAKRKEAD 59 Query: 71 QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSL 130 P DI++++ + VG + +C G +RP+ + + F+ ++ K + Sbjct: 60 ----PGDILMSVR----APVGSVNICN--QKCIIGRGLSAIRPDANLNNYFLYYYLKCN- 108 Query: 131 YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 ++SL G+ I + +++P+PPL +Q A L + + + + +Q+ Sbjct: 109 -EKNVASLGTGSTFQAITQTTLKRLDVPLPPLDDQIRSATLLSKVENLIFRRREQLKQLD 167 Query: 191 QILKRFRQAVLGGAVNGKL-TEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG 249 ++LK + G V ++ E R E S K+ + +T Sbjct: 168 ELLKSVFLEMFGDPVRNEMGWEMKRMDEISDSRLGKMRDKKFITGNHL------------ 215 Query: 250 HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKL 309 + S+V+ +D+ ++ E E L DGDLL +G C + + Sbjct: 216 RKYIGNSNVQWFRFKLDDLEEMDFDERERVLFALMDGDLLICEGG----DIGRCAIWRSN 271 Query: 310 QHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQ 368 + + + R RL K A+PEY++ S N N + ++G+ +K Sbjct: 272 LSE-CYFQKAIHRVRLHKSQAIPEYLQYVMLFFSLYNGFKNVTCK-ATISHLTGEKLKET 329 Query: 369 VVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 ++ LP ++ Q V+++ + I+ ++ + +L + KAF+GEL Sbjct: 330 LIPLPSLELQNRFSTIVKKV----EKIKITYTHSFINLESLYGILSQKAFKGELD 380 >UniRef50_Q31PC5 Type I restriction-modification n=2 Tax=Synechococcus elongatus RepID=Q31PC5_SYNE7 Length = 453 Score = 193 bits (490), Expect = 1e-47, Method: Composition-based stats. Identities = 73/440 (16%), Positives = 174/440 (39%), Gaps = 31/440 (7%) Query: 5 KLPEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLP-LIRANNIQNGKFDTTDLV-F 61 KLP W + + + + GV+ A+++ D+ +P +++ + + G F + Sbjct: 19 KLPSHWNVLQLRRLIPEIESGVSV---NALDHAPDEGIPSVLKTSCVYTGSFRPEERKEI 75 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-GAFCGVLRPEKLIFSG 120 + +++ + + + ++++ + + +VG + + ++ F +R ++ Sbjct: 76 IQEDIDRAACPVKSGRLIVSRMN-TPDLVGAAGLSLVDYDYVFLPDRLWQVRISN-VYPN 133 Query: 121 FIAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F ++T++ +YR+++ + +G + + N+ +F +P+P EQ IA LD A+ Sbjct: 134 FAYYWTQTQIYRDQVKMVCSGTSSSMQNLSQDNFLSFILPVPSDEEQIAIASFLDRETAK 193 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFE 229 +D+ A +++ +L+ RQAV+ AV L +W P H K+ Sbjct: 194 IDALIAEQQRLIALLQEKRQAVISHAVTKGLNPDAPLKDSGIEWLGQVPAHWKTGKIKHY 253 Query: 230 SILTELRNGLSSKP----NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQD 285 + + + +S G P +R + + V ++ + + L Sbjct: 254 FKTSSGGTPNTEEQALYYADSDSGIPWVRTTDIENQEVRSAEVSITNQAIQDTACEILPV 313 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 +L Y G + L + A+P + + R Sbjct: 314 DTVLVALYGGGGTV-----GKNGILTFPAAINQALCALLPSYYAVPMFTFRYIQF--LRP 366 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 M + IS + ++ V LPP+ EQ IV+ + ++E + +L+ Sbjct: 367 FWMERAVSARKAGNISQELVRDTVFALPPLDEQILIVKHIHSQLEEITSLENESTKSLSL 426 Query: 406 VNNLTQSILAKAFRGELTAQ 425 + ++++ A G++ + Sbjct: 427 LQERRSALISAAVTGQIDVR 446 Score = 119 bits (298), Expect = 2e-25, Method: Composition-based stats. Identities = 44/234 (18%), Positives = 98/234 (41%), Gaps = 12/234 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHP-ILRISSVRAGHVDQNDIRF 270 +W P H +L E +++ + G P +L+ S V G + + Sbjct: 15 EWLEKLPSHWNVLQLRRLIPEIESGVSVNALDHAPDEGIPSVLKTSCVYTGSFRPEERKE 74 Query: 271 LECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 + + + ++ G L+ +R N + + VG GL + + + PD+L + R++ + Sbjct: 75 IIQEDIDRAACPVKSGRLIVSRMN-TPDLVGAAGL-SLVDYDYVFLPDRLWQVRIS-NVY 131 Query: 331 PEYIEIFFSSPSARNAM-MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 P + + + R+ + M C T+S + +S + S ++ +P +EQ I +++ Sbjct: 132 PNFAYYWTQTQIYRDQVKMVCSGTSSSMQNLSQDNFLSFILPVPSDEEQIAIASFLDRET 191 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D + + +A + Q++++ A + NPD ++ L Sbjct: 192 AKIDALIAEQQRLIALLQEKRQAVISHAVT-------KGLNPDAPLKDSGIEWL 238 Score = 112 bits (280), Expect = 3e-23, Method: Composition-based stats. Identities = 41/212 (19%), Positives = 79/212 (37%), Gaps = 11/212 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYL--KDDYLPLIRANNIQNGKFDTTDLVF 61 G++P W + G T E+ Y D +P +R +I+N + + ++ Sbjct: 239 GQVPAHWKTGKIKHYFKTSSGGTPNTEEQALYYADSDSGIPWVRTTDIENQEVRSAEVSI 298 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + + +I P D V+ G + L F + L P F Sbjct: 299 TNQAIQDTACEILPVDTVLVALYGGGGT--VGKNGILTFPAAINQALCALLPSYYAVPMF 356 Query: 122 IAHF---TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + + +S+ AG NI +PPL EQ +I + + + L + Sbjct: 357 TFRYIQFLRPFWMERAVSARKAG----NISQELVRDTVFALPPLDEQILIVKHIHSQLEE 412 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 + S + + +L+ R A++ AV G++ Sbjct: 413 ITSLENESTKSLSLLQERRSALISAAVTGQID 444 >UniRef50_A0KWU0 Restriction modification system DNA specificity domain n=1 Tax=Shewanella sp. ANA-3 RepID=A0KWU0_SHESA Length = 391 Score = 192 bits (487), Expect = 3e-47, Method: Composition-based stats. Identities = 84/420 (20%), Positives = 167/420 (39%), Gaps = 38/420 (9%) Query: 12 IAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN--GKFDTTDLVFVPKNLVKE 69 + + + + RG + + DD L I + N ++T L P+ L K Sbjct: 1 MVKLGDIFDIARGGSPRPIDDYITDADDGLNWISIKDASNSNKYINSTKLKIKPEGLTK- 59 Query: 70 SQKISPEDIVI--AMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTK 127 ++ + P D ++ +MS G ++ + C + + + S + + Sbjct: 60 TRMVYPGDFLLTNSMSFGRPYIMNTTG-------CIHDGWLVLSGNPDKVNSDYFYYLLG 112 Query: 128 SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFE 187 S + + S L+AGA + N+ + +P+PPLAEQK IA L + D+ + + + Sbjct: 113 SDTLKQRFSGLAAGAVVKNLNTELVKSVEVPLPPLAEQKRIAAIL----DKADAIRRKRQ 168 Query: 188 QIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES- 246 Q Q+ +AV +T + + S L ++ G + K E Sbjct: 169 QAIQLADDLLRAVFLEMFGDPVTNPKGFQKSKLSA---------LADVITGFAFKSAEYV 219 Query: 247 ---GVGHPILRISSVRAGHVDQNDIRFLECSE-SELNRHKLQDGDLLFTRYNGSLEFVGV 302 + R + G+ + D F + ++ + L+ +KL+ GD++ + Sbjct: 220 EDSDDAVRLCRGVNTLTGYFEWKDTAFWDSNKINGLHNYKLEAGDVILAMDRPWISSGLK 279 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 + + + L ++ R R + P Y + +SS + +C T + IS Sbjct: 280 VCVFPENERDTYLVQ-RVARIRSKQ---PRYTDYLYSSILSPAFEKHCCPTETTVPHISP 335 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 ++K+ +L+P K ++ V +L D +E + A N + S+ KAF G+L Sbjct: 336 VELKNFEILVPDEKSVSKYHDIVSKLRRSKDRMEMNLTEA----NQIFNSLSQKAFSGQL 391 Score = 88.6 bits (218), Expect = 4e-16, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 76/206 (36%), Gaps = 10/206 (4%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P+G+ + +S + +I G +K + + DD + L R N G F+ D F N Sbjct: 193 PKGFQKSKLSALADVITGFAFKSAEYVED-SDDAVRLCRGVNTLTGYFEWKDTAFWDSNK 251 Query: 67 VK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGFIA 123 + + K+ D+++AM S K + +R ++ ++ ++ Sbjct: 252 INGLHNYKLEAGDVILAMDRPWISSGLKVCVFPENERDTYLVQRVARIRSKQPRYTDYLY 311 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 S + + +I P I +P + K +++++ +K Sbjct: 312 SSILSPAFEKHCCPTE--TTVPHISPVELKNFEILVP----DEKSVSKYHDIVSKLRRSK 365 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKL 209 R E + ++ A +G+L Sbjct: 366 DRMEMNLTEANQIFNSLSQKAFSGQL 391 >UniRef50_UPI0001973978 type I restriction-modification system, S subunit n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001973978 Length = 435 Score = 191 bits (486), Expect = 3e-47, Method: Composition-based stats. Identities = 70/429 (16%), Positives = 166/429 (38%), Gaps = 38/429 (8%) Query: 13 APVSTVTTL-IRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT-DLVFVPKNLVK-- 68 + + + I ++ + + D+ +P + A +++NG D ++ + K Sbjct: 18 KKLKYIVSTPITDGPHETPELL----DEGIPFLSAESVKNGILDFNYKRGYISLSDHKLF 73 Query: 69 -ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFSGFIAHFT 126 + + DI I S + G E S + ++R + + + FI +++ Sbjct: 74 CKKVRPQKNDIFIVKSGATT---GNCGIVTTDEEFSIWSPLALIRCDNISVLQKFIYYYS 130 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 + +++ + NI + + +P EQ+ I + LD AQ+DS A Sbjct: 131 LCYSFTHQVEQSWSYGTQQNIGMGVLGNLYVTLPSSNEQQSIVDYLDKECAQIDSIAADL 190 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTELRN 237 E+ +L++++++++ V L + +W P+H + + + Sbjct: 191 EKQIALLQQYKKSLITETVTKGLDKSVPMKDSGVEWIGKIPEHWDVEPIKYRVTFHNGDR 250 Query: 238 G--LSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL-NRHKLQDGDLLFTRYN 294 G SK G P + + ++ +++ ++ + + KL+ GD+L+ Sbjct: 251 GENYPSKSELQSEGIPFINAGHLEGDGLNMDNMDYISEEKYRIMGGVKLRPGDILYCLRG 310 Query: 295 GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPS--ARNAMMNCVK 352 VG ++ Q L+ R + L EY+ +S + + + Sbjct: 311 S----VGKNAIVDMNQGT---VASSLVAIR-SVRILAEYLYYCLNSHIEEVQRYLWDN-- 360 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 + Q +S ++ +PPV+EQ IV+ + + + D + L+ + +S Sbjct: 361 -GTAQPNLSADNLGKYKFCIPPVEEQKAIVKYLNNICSQIDNLVIGKKKQLSTIQQHKKS 419 Query: 413 ILAKAFRGE 421 ++ + G+ Sbjct: 420 LIYEYVTGK 428 Score = 130 bits (327), Expect = 1e-28, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 92/208 (44%), Gaps = 8/208 (3%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 GK+PE W + P+ T G + + + L+ + +P I A +++ + ++ ++ Sbjct: 227 IGKIPEHWDVEPIKYRVTFHNGDRGENYPSKSELQSEGIPFINAGHLEGDGLNMDNMDYI 286 Query: 63 PKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + + K+ P DI+ + VGK+A + + + + +R + I + Sbjct: 287 SEEKYRIMGGVKLRPGDILYCLR----GSVGKNAIVDMN-QGTVASSLVAIRSVR-ILAE 340 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ + S + + G N+ + IPP+ EQK I + L+ + +Q+D Sbjct: 341 YLYYCLNSHIEEVQRYLWDNGTAQPNLSADNLGKYKFCIPPVEEQKAIVKYLNNICSQID 400 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGK 208 + ++ +++ +++++ V GK Sbjct: 401 NLVIGKKKQLSTIQQHKKSLIYEYVTGK 428 Score = 111 bits (277), Expect = 7e-23, Method: Composition-based stats. Identities = 44/211 (20%), Positives = 84/211 (39%), Gaps = 9/211 (4%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIR-F 270 W KKL + + T + +G P G P L SV+ G +D N R + Sbjct: 6 TWEEENGHTFKKKKLKYI-VSTPITDGPHETPELLDEGIPFLSAESVKNGILDFNYKRGY 64 Query: 271 LECSESEL--NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD 328 + S+ +L + + Q D+ + + G CG++ + ++ P LIR Sbjct: 65 ISLSDHKLFCKKVRPQKNDIFIVKSGAT---TGNCGIVTTDEEFSIWSPLALIRC-DNIS 120 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 L ++I + S + + + Q+ I + + V LP EQ IV +++ Sbjct: 121 VLQKFIYYYSLCYSFTHQVEQSWSYGT-QQNIGMGVLGNLYVTLPSSNEQQSIVDYLDKE 179 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFR 419 A D+I + +A + +S++ + Sbjct: 180 CAQIDSIAADLEKQIALLQQYKKSLITETVT 210 >UniRef50_C3DG13 Putative uncharacterized protein n=1 Tax=Bacillus thuringiensis serovar sotto str. T04001 RepID=C3DG13_BACTS Length = 409 Score = 191 bits (486), Expect = 4e-47, Method: Composition-based stats. Identities = 77/405 (19%), Positives = 158/405 (39%), Gaps = 29/405 (7%) Query: 38 DDYLPLIRANNIQNGKF-DTTDLVFVPKNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAH 95 D +P IR + D+ +V K LVK + K+ P V+ S S Sbjct: 13 DGDIPWIRIEDFNGKYISDSKSRQYVSKELVKGMNLKVFPIGTVLCTCSCSMGATAIVEQ 72 Query: 96 QHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLI 155 + + G + P + + S ++ + ++S R ++ + GA + +F+ + Sbjct: 73 PLISNQTFIG-----IVPGENLDSEYLFYLMQASAERLQL--FAQGAIQQYLSKHNFEHL 125 Query: 156 NIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---- 211 IP+P L QK + L+ L +D +Q+ +L+ RQ ++ AV L Sbjct: 126 KIPLPSLKIQKRLLVFLNRKLKDLDELIENKKQLIDLLEEKRQTLITEAVTRGLNPNVKM 185 Query: 212 -----KWRNFEPQHSVFKKLNFESILTELRNGLSSKPN---ESGVGHPILRISSVRAGHV 263 +W P+H KK+ S L + +G + K G LR +V + Sbjct: 186 KDSGVEWIGEIPEHWTIKKIKHISNL--VGSGKTPKGGSEIYPESGVLFLRSMNVHYDGI 243 Query: 264 DQNDIRFLECSESELNRH-KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR 322 DI + E R +++ D+L S +G ++ + + + I Sbjct: 244 RLKDIVHITPEIDEDMRSTRVKSKDVLLNITGAS---IGRSCIVPESLGKANVNQHVCII 300 Query: 323 ARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLL-PPVKEQAEI 381 TK +PE + +S ++ + S ++G++ +K+ L ++EQ EI Sbjct: 301 RSNTKVVVPELLSKIMASNFIMQQIL-MSQNGSSREGLNFTQVKNLEFPLTRDLQEQIEI 359 Query: 382 VRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQW 426 + +++ + + ++ QS++ + G++ + Sbjct: 360 ANHISVETNKINSLIGMIEEQIQKLKEYRQSLIYEVVTGKIDVRD 404 Score = 124 bits (310), Expect = 9e-27, Method: Composition-based stats. Identities = 37/213 (17%), Positives = 85/213 (39%), Gaps = 9/213 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G++PE W I + ++ L+ G T K + + +R+ N+ D+V Sbjct: 193 IGEIPEHWTIKKIKHISNLVGSGKTPK--GGSEIYPESGVLFLRSMNVHYDGIRLKDIVH 250 Query: 62 VPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIF 118 + + + S ++ +D+++ ++ S + + ++R K++ Sbjct: 251 ITPEIDEDMRSTRVKSKDVLLNITGASIGR--SCIVPESLGKANVNQHVCIIRSNTKVVV 308 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-PLAEQKIIAEKLDTLLA 177 ++ S+ +I G++ + + P+ L EQ IA + Sbjct: 309 PELLSKIMASNFIMQQILMSQNGSSREGLNFTQVKNLEFPLTRDLQEQIEIANHISVETN 368 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 +++S E+ Q LK +RQ+++ V GK+ Sbjct: 369 KINSLIGMIEEQIQKLKEYRQSLIYEVVTGKID 401 Score = 80.9 bits (198), Expect = 9e-14, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 72/204 (35%), Gaps = 20/204 (9%) Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFLECSE--SELNRHKLQDGDLLFTRYNGSLEF 299 KP + P +RI ++ + R E +N G +L T Sbjct: 8 KPTKFDGDIPWIRIEDFNGKYISDSKSRQYVSKELVKGMNLKVFPIGTVLCT----CSCS 63 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 +G +++ Q L+ I ++ EY+ + + R + + Q+ Sbjct: 64 MGATAIVE----QPLISNQTFIGIVPGENLDSEYLFYLMQASAERLQLF---AQGAIQQY 116 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 +S + + + LP +K Q ++ + + D + + + + Q+++ +A Sbjct: 117 LSKHNFEHLKIPLPSLKIQKRLLVFLNRKLKDLDELIENKKQLIDLLEEKRQTLITEAVT 176 Query: 420 GELTAQWRAENPDLISGENSAAAL 443 R NP++ ++ + Sbjct: 177 -------RGLNPNVKMKDSGVEWI 193 >UniRef50_B8EFW7 Restriction modification system DNA specificity domain protein n=1 Tax=Shewanella baltica OS223 RepID=B8EFW7_SHEB2 Length = 406 Score = 191 bits (485), Expect = 4e-47, Method: Composition-based stats. Identities = 78/422 (18%), Positives = 152/422 (36%), Gaps = 28/422 (6%) Query: 6 LPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDLVFVP 63 LPEGW + + V + L+ G T ++A Y + ++ + + Sbjct: 8 LPEGWHLETIGEVASKLVTGKTPSTKKA-EYYSSSEVDWFTPSDFGSTAVLNNSRRKLSS 66 Query: 64 KNLVKESQKISPED-IVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + + K P+D I++ + VG E F + ++ I + Sbjct: 67 LAIEDGTIKKMPKDSILLVAIGATIGKVG-----LAEDESCFNQQVTGIHFKEKIHPKYA 121 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 ++ S + +I + S+ A + I ++ P EQK I EKLD LL ++D+ Sbjct: 122 YYWL--SYIKPEIITKSSQATLPIINQTGIKGLSFLYPEKEEQKCIVEKLDALLTRIDTA 179 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 ++ + Q+ L G + + K+L L++ Sbjct: 180 IEHLQESITLKNSLLQSALDGQFSAITERMTIESLAEVKGGKRLPKGEKLSD-------- 231 Query: 243 PNESGVGHPILRISSVRA-GHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYNGSLEFV 300 HP +R++ G +D + I+++ E + R+ + DL + + Sbjct: 232 ---EETEHPYIRVADFTDKGTIDLSGIKYISKEIHEQIKRYVISKDDLYISIAG----TI 284 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G G + L + K L F+ S +A T Q + Sbjct: 285 GKTGFVPSELDGANLTENAAKLVIKDKQQLDLSYLYLFTLTSDFSAQAGLATKTVAQPKL 344 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + + + + ++EQ +V +E L + E + + + +L SIL AF+G Sbjct: 345 ALTRLSKIEIPICSLEEQKSLVSTIEALKSKIHDAEAVLLGKIEDLKSLKASILDSAFKG 404 Query: 421 EL 422 EL Sbjct: 405 EL 406 >UniRef50_B0A8Q7 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0A8Q7_9CLOT Length = 380 Score = 191 bits (484), Expect = 7e-47, Method: Composition-based stats. Identities = 64/412 (15%), Positives = 151/412 (36%), Gaps = 34/412 (8%) Query: 11 VIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKES 70 + + + + G T K++ Y D +P I+ ++++ + + S Sbjct: 2 ELKKLGDIFKITSGGTPSKKK-EEYYLDGDIPWIKTGDLKSKNIYKSSQYITELGVKNSS 60 Query: 71 QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSL 130 K+ P+D V+ G + +G A L E + C P K + ++ +F K + Sbjct: 61 AKLFPKDTVLIAMYG--ATIG--ATSILKIEAATNQACAAFLPTKDVMPEYLYYFFKYN- 115 Query: 131 YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 + KI S G NI IP+ L EQ+ I L+ + K + + Sbjct: 116 -KEKIISKGIGGAQPNISATILKDFKIPLLCLDEQEKIVNILNKAQNTTNKRKEQINLLD 174 Query: 191 QILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGH 250 +++K + G + + K+++ + + ++ K N Sbjct: 175 ELVKSRFIEMFGDPIR----------NIKCWQTKRMDEVAPV------INYKGNFKQNEI 218 Query: 251 PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQ 310 +L + V + ++ SE + ++L+++ L V + + Sbjct: 219 WLLNLDMVESNTGKIIAYNYVTASEVGSSTCTFDTTNVLYSKLRPYLNKVVIP------K 272 Query: 311 HQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVV 370 + + + Y+ + + + V + + ++ D + V Sbjct: 273 EIGYATSEMMPLQPVKGILDRYYLAYMLRNKVFVDYISEKV-SGAKMPRVTMNDFRDFKV 331 Query: 371 LLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 +PP++ Q + V ++ D ++ ++ +L + + S++ +AF+GEL Sbjct: 332 PIPPIELQNQFANFVIEV----DKLKFEMEKSLKELEDNFNSLMQRAFKGEL 379 Score = 69.3 bits (168), Expect = 3e-10, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 72/199 (36%), Gaps = 21/199 (10%) Query: 10 WVIAPVSTVTTLIRG-VTYKKEQ----AINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 W + V +I +K+ + ++ ++ + +I N + + ++ F Sbjct: 195 WQTKRMDEVAPVINYKGNFKQNEIWLLNLDMVESNTGKIIAYNYVTASEVGSSTCTFDTT 254 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 N++ + +VI +G + + +P + + ++ ++A+ Sbjct: 255 NVLYSKLRPYLNKVVI------PKEIGYATSEMMPLQPV----------KGILDRYYLAY 298 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 ++ ++ + IS +GA + + F +PIPP+ Q A + + + Sbjct: 299 MLRNKVFVDYISEKVSGAKMPRVTMNDFRDFKVPIPPIELQNQFANFVIEVDKLKFEMEK 358 Query: 185 RFEQIPQILKRFRQAVLGG 203 +++ Q G Sbjct: 359 SLKELEDNFNSLMQRAFKG 377 >UniRef50_C7D880 Restriction modification system DNA specificity domain protein n=1 Tax=Thalassiobium sp. R2A62 RepID=C7D880_9RHOB Length = 380 Score = 191 bits (484), Expect = 7e-47, Method: Composition-based stats. Identities = 84/419 (20%), Positives = 154/419 (36%), Gaps = 44/419 (10%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P AP V + G K I+ D Y+ L++ + + K VFVP Sbjct: 3 PAH--FAPFLEVCDIQGGTQPPKSTFIDEPTDGYVRLLQIQDFKTDK----KAVFVPD-- 54 Query: 67 VKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHF 125 + +K S DI+IA S + E ++ P+ + + + AHF Sbjct: 55 KQTLKKCSKNDIMIARYGASLGKI------LSGLEGAYNVALVKTIPDLERLDRAYFAHF 108 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 +++ +++ I +L A A + I IP+PPL EQK IA L Q D+ + Sbjct: 109 LRANAFQSFILNLGGRAAQAGFNKADLERIKIPLPPLEEQKRIAGIL----DQADALRRL 164 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE 245 + L QA+ L + ++R+G P Sbjct: 165 RTRALDRLNTLGQAIFHEMFGDPTHN------------FSLATLGEVCDVRDGTHDSPKY 212 Query: 246 SGVGHPILRISSVRAGHVDQNDIRFLECSESEL--NRHKLQDGDLLFTRYNGSLEFVGVC 303 G+P+L + G + + + + + R K+ GD++ +G Sbjct: 213 VETGYPLLTSKNFSTGVLSFDGAKSISEEDYFKINKRSKVDLGDIVMPMIG----TIGSP 268 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 ++++ + + LI+ + +I+ S ++ QK +S Sbjct: 269 VVIEE-EAAFAIKNVALIKF-VEGSPKASFIQTLL-SGVYLERIVKTQGRGGTQKFVSLG 325 Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 D++ LPP ++Q E L + + ++ N + L S+ +AFRGEL Sbjct: 326 DLRKLQFPLPPKEQQEAF----EGLISEIKKQKSKLCNLVTTQETLFASLQHRAFRGEL 380 >UniRef50_A8TH56 Restriction modification system DNA specificity domain n=1 Tax=Methanococcus voltae A3 RepID=A8TH56_METVO Length = 412 Score = 190 bits (483), Expect = 8e-47, Method: Composition-based stats. Identities = 71/424 (16%), Positives = 152/424 (35%), Gaps = 24/424 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT-DLVF 61 G +P W + + V + I + + Y + I NN++NG T D Sbjct: 11 IGLIPNDWEVKKLGDVCSFIGDGIHSTPK---YCTNGKYYFINGNNLKNGTIVHTNDTKL 67 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + ++ ED ++ +G+ +G ++ + + G + + F Sbjct: 68 ISFEEFNKLKQKIAEDALLLSINGT---IGNCSYYN-NEKILLGKSVAYINLKNKNIKNF 123 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 I + +S ++ S G+ I N+ S + IP+PPL EQ+ IAE L +++ Sbjct: 124 IYYVIQSPRTVSQFYSELTGSTIKNLSLKSLRNLCIPLPPLKEQQKIAEILTKWDNHIET 183 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 + + + K Q +L G V + F + K L NGLS Sbjct: 184 LENLISKKEEYKKGLMQNLLTGKVR------FPGFNEEWKEVKLGEICKFLKG--NGLSK 235 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 + + + + I+ + + + GD+L + Sbjct: 236 EKLNKNGKFKCILYGELYTTY--SEVIKEVLSKTDFKEKIHSEKGDILIPASTTTTGIDL 293 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 ++ L ++R + E++ + + + + + + Sbjct: 294 ANATAINEENVILGGDINILRKKYENKYNNEFLAYYLT--YGKKYELAKYAQGTTIVHLY 351 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 GKDIK+ + LP ++EQ +I ++ + D + + L + + ++ K GE Sbjct: 352 GKDIKNMKIQLPTLEEQEQIA----EVLSLQDKEIEILKEKLELLKMQKKGLMQKLLTGE 407 Query: 422 LTAQ 425 + + Sbjct: 408 IRVK 411 >UniRef50_C4ZFR7 Type I restriction-modification system specificity subunit n=3 Tax=Firmicutes RepID=C4ZFR7_EUBR3 Length = 412 Score = 190 bits (483), Expect = 9e-47, Method: Composition-based stats. Identities = 80/416 (19%), Positives = 155/416 (37%), Gaps = 22/416 (5%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 + W ++ V I V + Y + +P+ R N+ + DL +V Sbjct: 13 KDWEQRKLNEVAEKIC-VGFVGTCEKFYTDESGIPMYRTGNLNGLSLNRDDLKYVTNEFH 71 Query: 68 KESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAH 124 + +QK + DI+IA S GK+ + E + ++RP+ K F+ + Sbjct: 72 QHNQKSQLKAGDILIARHGDS----GKAVNYENSEEANCLNIV-IIRPDFKKCNYKFLTN 126 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-PLAEQKIIAEKLDTLLAQVDSTK 183 S + I SLSAG+ I + + + + IP + EQ IA TL + + Sbjct: 127 CINSPECQKHIKSLSAGSTQAVINTSEIEKLGVVIPANIDEQNRIARYFSTLDNLITLHQ 186 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 + EQ ++ K Q + ++ F K + G ++ P Sbjct: 187 RKCEQTKKLKKYMLQKMFPRNGAKVPEIRFDGFTYDWEQRKLGEIYGSIGNAFVG-TATP 245 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNR-HKLQDGDLLFTRYNGSLEFVGV 302 + GH L ++V+ G ++ N F+ E + L GD++ + VG Sbjct: 246 YYAEHGHFYLESNNVKDGQINHNAEIFINDEFYEKQKDKWLHTGDMVMVQSG----HVGH 301 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 ++ + + + R ++ P ++ + + A+ + N + T + K I Sbjct: 302 AAVIPEELDNTAAHALIMFR-NPKEEIEPYFLNYEYQTDKAKKQIEN-ITTGNTIKHILA 359 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 D++ VV +P +EQ I F D + + + + +L F Sbjct: 360 SDMQEFVVDIPKYEEQKVIASY----FCKLDHLITLHQRKCDELKKMKKYMLQNMF 411 Score = 98.2 bits (243), Expect = 6e-19, Method: Composition-based stats. Identities = 36/208 (17%), Positives = 84/208 (40%), Gaps = 17/208 (8%) Query: 5 KLPE--------GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDT 56 K+PE W + + I G + A Y + + +NN+++G+ + Sbjct: 210 KVPEIRFDGFTYDWEQRKLGEIYGSI-GNAF-VGTATPYYAEHGHFYLESNNVKDGQINH 267 Query: 57 TDLVFVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-P 113 +F+ ++ + + D+V+ S VG +A + + + R P Sbjct: 268 NAEIFINDEFYEKQKDKWLHTGDMVMVQS----GHVGHAAVIPEELDNTAAHALIMFRNP 323 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 ++ I F+ + ++ + +I +++ G I +I + + IP EQK+IA Sbjct: 324 KEEIEPYFLNYEYQTDKAKKQIENITTGNTIKHILASDMQEFVVDIPKYEEQKVIASYFC 383 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVL 201 L + + + +++ ++ K Q + Sbjct: 384 KLDHLITLHQRKCDELKKMKKYMLQNMF 411 Score = 95.1 bits (235), Expect = 5e-18, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 80/210 (38%), Gaps = 15/210 (7%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +++ F K + G K G P+ R ++ ++++D++++ Sbjct: 7 RFKGFTKDWEQRKLNEVAEKICVGFVGTCEKFYTDESGIPMYRTGNLNGLSLNRDDLKYV 66 Query: 272 ECSESELN-RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDA 329 + N + +L+ GD+L R+ S G + + N L ++ R K Sbjct: 67 TNEFHQHNQKSQLKAGDILIARHGDS----GKAVNYENSEEANCL---NIVIIRPDFKKC 119 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLP-PVKEQAEIVRRVEQL 388 +++ +SP + + + S Q I+ +I+ V++P + EQ I R L Sbjct: 120 NYKFLTNCINSPECQKHIKSLSA-GSTQAVINTSEIEKLGVVIPANIDEQNRIARYFSTL 178 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAF 418 D + + L + +L K F Sbjct: 179 ----DNLITLHQRKCEQTKKLKKYMLQKMF 204 >UniRef50_B0JXI4 Putative type I restriction enzyme specificity protein n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JXI4_MICAN Length = 388 Score = 190 bits (482), Expect = 1e-46, Method: Composition-based stats. Identities = 86/423 (20%), Positives = 157/423 (37%), Gaps = 45/423 (10%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++ + W P+S V + Q P A NG+ D+ + Sbjct: 6 EITKKWPHRPLSEVVDFLDSKRKPITQKDRVPGP--YPYYGA----NGQQDSVADYIFDE 59 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 L +++A G K+ + +C VLRP+K + +I Sbjct: 60 PL-----------VLLAEDGGHFGDADKTIAYQVEGKCWVNNHAHVLRPKKDVDIRYICR 108 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 L R ++ G+ + + + I I +PPL EQ+ IA LD K Sbjct: 109 ----HLERYDVTPFITGSTRGKLTKTAANNIPIALPPLEEQRRIAAILDKADGVRRKRKE 164 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 ++LK + G V ++ + ++ +G S K + Sbjct: 165 AIRLTDELLKSTFLEMFGDPVTNPKG------------WEVRELGDCVKDIESGWSPKCD 212 Query: 245 E---SGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 +L++ +V GH + ++ + + + +++ GDLL TR N + E VG Sbjct: 213 TRQAEPEEWGVLKLGAVTYGHFNPDENKAMLPDDVPRQELEIKTGDLLVTRKN-TYELVG 271 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG-QKGI 360 ++ + + L+ PD + R RL P Y+ S + R + T+G I Sbjct: 272 ASAFVQMTRPK-LMLPDLIFRLRLIDGIDPVYVWQTLSQKTMRLKLSGLAGGTAGSMPNI 330 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV-NNLTQSILAKAFR 419 S +++ +PP Q + Q + ++K+ + NL S+L +AFR Sbjct: 331 SKARLRTLPFPVPPQLLQLKYREIFNQFW-----LKKEHQKESEEISENLFNSLLQRAFR 385 Query: 420 GEL 422 GEL Sbjct: 386 GEL 388 >UniRef50_C4LDK7 Restriction modification system DNA specificity domain protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LDK7_TOLAT Length = 445 Score = 189 bits (481), Expect = 1e-46, Method: Composition-based stats. Identities = 86/436 (19%), Positives = 164/436 (37%), Gaps = 34/436 (7%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G +PE W I + T G + + D +P++ I +GK T + Sbjct: 16 EVGVIPEDWDIQRLGVHATFKTG-PFGSALHKSDYVDGGIPVVNPMQIIDGKVKPTSSMA 74 Query: 62 VPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRPEKLIF 118 + K+ ++ DIVI G + +G+ A + G ++R ++ Sbjct: 75 ISDEAAKKLSEYRLIAGDIVI----GRRGDMGRCAVISEIENGWLCGTGSMIVRVKENAD 130 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-PLAEQKIIAEKLDTLLA 177 + F+ + I S S G + N+ + + I IP EQ IA L + A Sbjct: 131 AAFLQRVLSNPQTITAIESASVGTTMINLNQGTLRALLILIPRDKQEQTAIANALSDVDA 190 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGG-------AVNGKLTEKWRNFEPQHSVFKKLNFES 230 ++ + + I Q +L G A+ T K + + S Sbjct: 191 LINELEKLIAKKQAIKTATMQQLLTGKTRLPQFALREDGTPKGYKASELGEIPEDWEVVS 250 Query: 231 --ILTELRNGLSSKPNE-SGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD 287 + + GL+ PN+ + G +LR S+V+ + ++ ++ E R ++ GD Sbjct: 251 LAEIGQTIIGLTYSPNDVAEHGTLVLRSSNVQNNVLAYDNNVYVNMDLPE--RVIVKKGD 308 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYP-DKLIRARLTKDALPEYIEIFFSSPSARNA 346 +L NGS + +G C L+ K + R + ++ F S +N Sbjct: 309 ILICVRNGSRQLIGKCALIDKNADGAAFGAFMSIFRTKSFG-----FVFYQFQSDIIQNQ 363 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVK-EQAEIVRRVEQLFAYADTIEKQVNNALAR 405 + + T Q I+ KD+ + LP ++ EQ I + + DT + + L + Sbjct: 364 INEIMGATINQ--ITNKDMAGFRIPLPTLQKEQVAITS----ILSDMDTEIQSLQQRLTK 417 Query: 406 VNNLTQSILAKAFRGE 421 + Q ++ + G+ Sbjct: 418 TRQIKQGMMQELLTGK 433 Score = 97.8 bits (242), Expect = 8e-19, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 87/222 (39%), Gaps = 13/222 (5%) Query: 203 GAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS-SKPNESGVGHPILRISSVRAG 261 + + P+ ++L + G + K + G P++ + G Sbjct: 6 QVIPEGYKQTEVGVIPEDWDIQRLGVHATFKTGPFGSALHKSDYVDGGIPVVNPMQIIDG 65 Query: 262 HVDQNDIRFLECSE-SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKL 320 V + +L+ ++L GD++ R +G C ++ ++++ L + Sbjct: 66 KVKPTSSMAISDEAAKKLSEYRLIAGDIVIGRRG----DMGRCAVISEIENGWLCGTGSM 121 Query: 321 IRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLP-PVKEQA 379 I R+ ++A +++ S+P A+ + + ++ +++ ++L+P +EQ Sbjct: 122 I-VRVKENADAAFLQRVLSNPQTITAIES-ASVGTTMINLNQGTLRALLILIPRDKQEQT 179 Query: 380 EIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 I + + D + ++ +A+ + + + + G+ Sbjct: 180 AIANALSDV----DALINELEKLIAKKQAIKTATMQQLLTGK 217 >UniRef50_Q57594 Type-1 restriction enzyme MjaXIP specificity protein n=2 Tax=Methanocaldococcus RepID=T1S1_METJA Length = 425 Score = 189 bits (480), Expect = 2e-46, Method: Composition-based stats. Identities = 67/432 (15%), Positives = 159/432 (36%), Gaps = 27/432 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ--NGKFDTTDL 59 G++PE W I + V I+ K Y K+ +P ++ +I N T + Sbjct: 12 EIGEIPEDWEIVELKDVCKKIKAGGTPKTSVEEYYKNGTIPFVKIEDITNSNKYLTNTKI 71 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + L + I P++ V+ GS +G++A + + A G++ + ++ S Sbjct: 72 KITEEGLNNSNAWIVPKNSVLFAMYGS---IGETAINKIEV-ATNQAILGIIPKDNILES 127 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ + + +N S L N+ IP+PPL EQK IA+ L + + Sbjct: 128 EFLYYILAKN--KNYYSKLGMQTTQKNLNAQIVKSFKIPLPPLEEQKQIAKILTKIDEGI 185 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNG-KLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 + + ++ +I K +L + + + P+ ++ + Sbjct: 186 EIIEKSINKLERIKKGLMHKLLTKGIGHSRFKKSEIGEIPEDWEVFEIKDIFEVKTGTTP 245 Query: 239 LSSKPNESGVG-HPILRISSVR----AGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 + K G + + ++ ++ + + + + N + + G ++ + Sbjct: 246 STKKSEYWENGEINWITPLDLSRLNEKIYIGSSERKVTKIALEKCNLNLIPKGSIIISTR 305 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 VG +L N K + + E+ + + ++ + Sbjct: 306 AP----VGYVAVLTVESTFN--QGCKGLFQKNNDSVNTEFYAYYL---KFKKNLLENLSG 356 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 S K +S +++ + LPP++EQ +I ++ + D + ++ + + I Sbjct: 357 GSTFKELSKSMLENFKIPLPPLEEQKQIA----KILSSVDKSIELKKQKKEKLQRMKKKI 412 Query: 414 LAKAFRGELTAQ 425 + G++ + Sbjct: 413 MELLLTGKVRVK 424 >UniRef50_C6YVW3 Predicted protein n=2 Tax=Francisella philomiragia subsp. philomiragia ATCC 25015 RepID=C6YVW3_9GAMM Length = 379 Score = 189 bits (480), Expect = 2e-46, Method: Composition-based stats. Identities = 78/422 (18%), Positives = 151/422 (35%), Gaps = 50/422 (11%) Query: 7 PEGWVIAPVSTVTTLIRG-VTYKKEQAINYLKDDYLPLIRANN-IQNGKFDTTDLVFVPK 64 P GW + V ++ KK + +D P+ A I+N F + ++ Sbjct: 2 PAGWEWEKLEKVCDKASSNLSLKKIEN----EDGEYPIYGAKGFIKNISFFHREEPYIS- 56 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 I V L + S L P+ I ++ Sbjct: 57 ---------------IIKDGAGVGRV-----TMLDSKSSVIGTLQYLLPKNCIDIKYLYF 96 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + + +G I +I + +P+PPLAEQK I KLD+L ++D Sbjct: 97 LLLVIDFGKYV----SGTTIPHIYYRDYKEHLVPLPPLAEQKRIVAKLDSLFEKIDKAIE 152 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK-- 242 +Q + L + F+ + N + I ++ +G + K Sbjct: 153 LHQQNITNANTLMASTLD-----------KTFKKLEGEYSYKNLKDITIKIGSGATPKGG 201 Query: 243 -PNESGVGHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYNGSLEFV 300 G ++R +V + + F++ S+++ L ++ D+L S V Sbjct: 202 QKAYKQKGTSLIRSMNVHDMGFSKKGLAFIDDSQADKLKNVIVEKDDVLLNITGAS---V 258 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 C ++ + + I RL + +++ + SP + ++ + ++ I Sbjct: 259 ARCCVVCESALPARVNQHVSI-IRLNDSFISKFLHYYLISPMKKTELLFSSSGGATREAI 317 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + I++ V + Q + V ++ + D I++ L + L SIL KAFRG Sbjct: 318 TKSMIENLQVPDISLPIQQQTVEYLDSIATKVDKIKQLNEQKLENLKALKASILDKAFRG 377 Query: 421 EL 422 EL Sbjct: 378 EL 379 >UniRef50_D2QTT7 Restriction modification system DNA specificity domain protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QTT7_9SPHI Length = 441 Score = 189 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 93/433 (21%), Positives = 169/433 (39%), Gaps = 31/433 (7%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P W + + V + + + + K + + ++ + F Sbjct: 25 IGEIPAHWEVGRIKYVCKINQ-----RSLPESTAKSFPIHYVDIGSVTLEEGIVQTEEFE 79 Query: 63 PKNLVKESQKI-SPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 KN +++I + D +I+ + Q F S G VL P LI F Sbjct: 80 FKNAPSRARRIANAGDTIISTVRTYLKAIAFVDEQQSQFIYSTG--FAVLNPLPLIMPKF 137 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 +A KS + ++S+ S G + I + I PPL+EQ IAE LD AQ+D Sbjct: 138 LAMAVKSDSFTEQVSANSKGMSYPAINSTELGCLAICFPPLSEQTRIAEFLDRKTAQIDQ 197 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE----KWRNFEPQHSVFKKLNFESILTELRN 237 A+ EQ+ ++L RQ ++ AV L K + + + N Sbjct: 198 AIAQKEQLIELLNERRQVMIHRAVTRGLNPNAPMKDSGIDRGDARWIGEIPAHWEVSRIN 257 Query: 238 GLSSKPNESGVGHPILRISSVRAGH----VDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 L ++ +E+G L I S+ +G +D +IR + + + L GD+ F + Sbjct: 258 WLFTEKDETGYPDLPLLIVSINSGVTVRDMDDTEIRKQVAEDFNVYKRAL-AGDIAFNKM 316 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 VGV L+ PD ++ AR Y F + R + VK Sbjct: 317 RMWQGAVGVV------PQDGLVSPDYVV-ARPNNFVNSAYYGFLFKT---REYLAEFVKH 366 Query: 354 TSG----QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + G + + +D KS ++PP++EQ +IV + ++ + ++ L Sbjct: 367 SHGIAWDRNRLYWEDFKSIFAMVPPLEEQNQIVDFLNAQNEEMSFASTKIQKQIQKLQEL 426 Query: 410 TQSILAKAFRGEL 422 +++ A G++ Sbjct: 427 KSTLINSAVTGKI 439 Score = 84.0 bits (206), Expect = 1e-14, Method: Composition-based stats. Identities = 37/244 (15%), Positives = 78/244 (31%), Gaps = 18/244 (7%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVR--AGHVDQNDIR 269 +W P H ++ + + + S+ + + I SV G V + Sbjct: 23 EWIGEIPAHWEVGRIKYVCKINQRSLPESTAKSFP---IHYVDIGSVTLEEGIVQTEEFE 79 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 F + R GD + + L+ + Q +Y Sbjct: 80 F--KNAPSRARRIANAGDTIISTVRTYLKAIA----FVDEQQSQFIYSTGFAVLNPLPLI 133 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 +P+++ + S S + K S I+ ++ + PP+ EQ I +++ Sbjct: 134 MPKFLAMAVKSDSFTEQVSANSKGMS-YPAINSTELGCLAICFPPLSEQTRIAEFLDRKT 192 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGEN------SAAAL 443 A D Q + +N Q ++ +A L ++ + G+ A Sbjct: 193 AQIDQAIAQKEQLIELLNERRQVMIHRAVTRGLNPNAPMKDSGIDRGDARWIGEIPAHWE 252 Query: 444 LEKI 447 + +I Sbjct: 253 VSRI 256 >UniRef50_B3H2F5 Type I restriction-modification system, S subunit n=2 Tax=Bacteria RepID=B3H2F5_ACTP7 Length = 508 Score = 189 bits (479), Expect = 2e-46, Method: Composition-based stats. Identities = 87/457 (19%), Positives = 170/457 (37%), Gaps = 81/457 (17%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANN---IQNGKFDTTDLVF 61 ++P+ WV + + +I G T K + N+ K +P I + I + Sbjct: 70 EIPKSWVWVRLDFLGEIIGGGTPKTNEDDNWNK-GSIPWITPADMKYISGKYISKGNRNI 128 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 L S ++ ++ ++ S + + E + + Sbjct: 129 TENGLRSSSTRLLSKNSIVYSSRAPIGYIAIT-----ETELCTNQGFKSIDLYNKEIVDY 183 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + +I S ++G I +F IP+PPL EQK I K++ LL ++ Sbjct: 184 LYYSLI--YFTPEIQSRASGTTFKEISGTAFGNTIIPLPPLNEQKRIVAKIEELLPYIEQ 241 Query: 182 TKARFEQI----PQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLN---------- 227 + E++ Q ++ ++++L A+ GKLT++ N EP + +++ Sbjct: 242 YAEKEEKLTALHQQFPEQLKKSILQAAIQGKLTKQDPNDEPALVLIERIKAEKLRLIAEK 301 Query: 228 -------------FESILTELRNGL-----SSKPNESGVGHPILRI-------------- 255 +++ E+ NG P E +R+ Sbjct: 302 KLKKPKVVSEIILRDNLPYEIVNGKERCIADEVPFEIPESWVWVRLGEIGETNIGLTYNP 361 Query: 256 -------------SSVRAGHVD-QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 +++ G +D +DI + E R DLL NGS + VG Sbjct: 362 SDVASDGTIVLRSGNIQDGKIDVSSDIVKVNLDIPENKRC--YKNDLLICARNGSKKLVG 419 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 ++ K + + + R+ K YI + SSP RN TT Q I+ Sbjct: 420 KAAIIDKDGYSFGAF-MTIFRSPFNK-----YIYYYLSSPLFRNDFDGINTTTINQ--IT 471 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 ++ ++++ LP + EQ IV ++E LF+ + ++ Sbjct: 472 QSNLNNRLIPLPSLNEQLRIVEKIETLFSTLQNLSQK 508 Score = 116 bits (290), Expect = 2e-24, Method: Composition-based stats. Identities = 58/316 (18%), Positives = 114/316 (36%), Gaps = 40/316 (12%) Query: 165 QKIIAEKLDTLLA-------QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE 217 Q I KL +D K +++ K + + + + Sbjct: 12 QLAIQGKLVPQDPTDEPASVLLDKIKKEKDRLIAEGKIKKSKKTTDNLPSVSQQDFSFEI 71 Query: 218 PQHSVFKKLNFESILTELRNGLSSKPNESGVG----HPILR---ISSVRAGHVDQNDIRF 270 P+ V+ +L+F L E+ G + K NE P + + + ++ + + Sbjct: 72 PKSWVWVRLDF---LGEIIGGGTPKTNEDDNWNKGSIPWITPADMKYISGKYISKGNRNI 128 Query: 271 LECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 E + L ++++ +G + + N + + Sbjct: 129 TENGLRSSSTRLLSKNSIVYSSRAP----IGYIAITETELCTNQGF-------KSIDLYN 177 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 E ++ + S + + + K ISG + ++ LPP+ EQ IV ++E+L Sbjct: 178 KEIVDYLYYSLIYFTPEIQSRASGTTFKEISGTAFGNTIIPLPPLNEQKRIVAKIEELLP 237 Query: 391 YADTIEKQVNNALARV----NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEK 446 Y + ++ A L +SIL A +G+LT Q + P A L+E+ Sbjct: 238 YIEQYAEKEEKLTALHQQFPEQLKKSILQAAIQGKLTKQDPNDEP--------ALVLIER 289 Query: 447 IKAERAASGGKKASRK 462 IKAE+ +K +K Sbjct: 290 IKAEKLRLIAEKKLKK 305 Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 20/61 (32%), Positives = 29/61 (47%), Gaps = 11/61 (18%) Query: 405 RVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERA---ASGGKKASR 461 + L +IL A +G+L Q + P A+ LL+KIK E+ A G K S+ Sbjct: 2 KAQQLKNAILQLAIQGKLVPQDPTDEP--------ASVLLDKIKKEKDRLIAEGKIKKSK 53 Query: 462 K 462 K Sbjct: 54 K 54 >UniRef50_A3XVN0 Type I restriction-modification system, S subunit n=1 Tax=Vibrio sp. MED222 RepID=A3XVN0_9VIBR Length = 424 Score = 188 bits (478), Expect = 3e-46, Method: Composition-based stats. Identities = 76/422 (18%), Positives = 171/422 (40%), Gaps = 33/422 (7%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTT-DLVFVPK- 64 E W ++ +S + I+ T+ + +PL+ A N+ +GK + V + Sbjct: 13 EDWNVSNLSECSLFIKDGTHGTHKRTPT----GIPLLSAKNVTASGKIKWDVNDSLVSEA 68 Query: 65 --NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFSGF 121 + + ++ +D+++ + +G+ A + + GV+RP+K + F Sbjct: 69 DYSKIHSKYELEKDDLLLTV----VGTLGRRALVDGSAKFTIQRSVGVIRPDKNKVTPNF 124 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 I HF S ++N++ + + + +P PPL EQK IA L ++ ++ Sbjct: 125 IFHFCGSDFFQNQLELRANATAQAGVYLGELAKVPVPSPPLPEQKKIAAILTSVDEVIEK 184 Query: 182 TKARFEQIPQILKRFRQAVL--GGAVNGKLTEKWR----NFEPQHSVFKKLNFESILTEL 235 T+A+ +++ + Q +L G V+GK +++ P+ +L+ + + + Sbjct: 185 TQAKIDKLKDLKTGMMQELLTCGVGVDGKPHTEFKDSPVGRVPKGWEVVELDRAAKVIDC 244 Query: 236 RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS--ESELNRHKLQDGDLLFTRY 293 + P G P+++ ++R G ++ + + ++ H GD++++R Sbjct: 245 ---KHATPKYFSNGFPVVKPGNIREGFLELRGCSLTDKAGFDNLNENHTPTIGDIIYSR- 300 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 N + G + + D + + K ++ +SP + + + Sbjct: 301 NQTYG----VGAYVNRSMEFCIGQDVCVIS--PKKCNSIFLFYMINSPLVKEQV-ELLAA 353 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 S K I+ I+ + LP ++EQ I E + +EK++ L Q + Sbjct: 354 GSTFKRINLGSIRKLKIALPCIEEQQAIGAVFESIDNKVSLLEKKLIKKKDTKKALMQDL 413 Query: 414 LA 415 L Sbjct: 414 LT 415 Score = 110 bits (274), Expect = 1e-22, Method: Composition-based stats. Identities = 41/204 (20%), Positives = 83/204 (40%), Gaps = 13/204 (6%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P+GW + + +I + A + P+++ NI+ G + Sbjct: 223 VGRVPKGWEVVELDRAAKVI-----DCKHATPKYFSNGFPVVKPGNIREGFLELRGCSLT 277 Query: 63 PK---NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 K + + E+ + DI+ + VG A+ + E G V+ P+K S Sbjct: 278 DKAGFDNLNENHTPTIGDIIYSR--NQTYGVG--AYVNRSMEFCIGQDVCVISPKK-CNS 332 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ + S L + ++ L+AG+ I S + I +P + EQ+ I +++ +V Sbjct: 333 IFLFYMINSPLVKEQVELLAAGSTFKRINLGSIRKLKIALPCIEEQQAIGAVFESIDNKV 392 Query: 180 DSTKARFEQIPQILKRFRQAVLGG 203 + + + K Q +L G Sbjct: 393 SLLEKKLIKKKDTKKALMQDLLTG 416 Score = 98.2 bits (243), Expect = 5e-19, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 76/211 (36%), Gaps = 10/211 (4%) Query: 209 LTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVR-AGHVDQ-- 265 +++ + N +++G + G P+L +V +G + Sbjct: 1 MSDSPLTLVLDTEDWNVSNLSECSLFIKDGTHGTHKRTPTGIPLLSAKNVTASGKIKWDV 60 Query: 266 NDIRFLECSESEL-NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 ND E S++ ++++L+ DLL T +G L+ + +IR Sbjct: 61 NDSLVSEADYSKIHSKYELEKDDLLLTVVG----TLGRRALVDGSAKFTIQRSVGVIRP- 115 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 P +I F S +N + + Q G+ ++ V PP+ EQ +I Sbjct: 116 DKNKVTPNFIFHFCGSDFFQNQL-ELRANATAQAGVYLGELAKVPVPSPPLPEQKKIAAI 174 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILA 415 + + + + +++ + Q +L Sbjct: 175 LTSVDEVIEKTQAKIDKLKDLKTGMMQELLT 205 >UniRef50_Q89Z57 Putative type I restriction enzyme S.BthVORF4518AP n=2 Tax=Bacteroides RepID=Q89Z57_BACTN Length = 474 Score = 188 bits (478), Expect = 3e-46, Method: Composition-based stats. Identities = 77/426 (18%), Positives = 163/426 (38%), Gaps = 52/426 (12%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P WV + + ++ D +P + +++ GK + + FVP+ Sbjct: 70 EVPSSWVWCKLEDYVKSVTDGDHQAPPK----SDIGIPFLVISDVAKGKLNFLNTRFVPQ 125 Query: 65 NLVKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 ++ +K D++ ++ G ++ + F G++ + L S + Sbjct: 126 EYYEKISFDRKPEKGDLLFTVT----GSYGIVVPVNIDCKFCFQRHIGLI--KTLNTSEY 179 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + H KSS ++ + + G + + +PIPP AEQ+ I +++ + ++ Sbjct: 180 LLHLLKSSYFKGQCDEFATGTAQKTVGLETLRSFLLPIPPFAEQQRIVIEIEKWFSLIEL 239 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNF------------- 228 + + + +K+ + +L A++GKL + N EP + K++N Sbjct: 240 IEGGKDDLQTTIKQAKSKILDLAIHGKLVPQDPNEEPAIKLLKRINPDFTPCDNGHSGKL 299 Query: 229 -------ESILTE----LRNGLSSKPNESGVGHP---ILRISSVRAGHVDQNDIRFLECS 274 + + +G S P +R+ +R D + Sbjct: 300 PYKIPKTWAWCSHNSILDISGGSQPAKSYFETIPKPNYIRLYQIR----DYGESPVPVYI 355 Query: 275 ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYI 334 L + + GD+L RY GSL G K+ + + + + EY Sbjct: 356 PINLASKQTEKGDILLARYGGSL---GKVFHAKQGAYNVAMVK---VIFKFENLIYKEYA 409 Query: 335 EIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADT 394 ++ S + + + + Q G + D LPP+ EQ IV+++E+LF+ D Sbjct: 410 YYYYLSDLYQGKLKEISR--TAQTGFNITDFNDMYFPLPPINEQQRIVQKIEELFSSLDN 467 Query: 395 IEKQVN 400 I+K + Sbjct: 468 IQKSLE 473 Score = 132 bits (332), Expect = 3e-29, Method: Composition-based stats. Identities = 60/272 (22%), Positives = 102/272 (37%), Gaps = 21/272 (7%) Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 ++ KA E++ + K R + + P V+ E + + + Sbjct: 32 LLERIKAEKERLIKEGKIKRSKKSAKTSDTPHYQNVPFEVPSSWVW--CKLEDYVKSVTD 89 Query: 238 GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRH--KLQDGDLLFTRYNG 295 G P +S +G P L IS V G ++ + RF+ E K + GDLLFT G Sbjct: 90 GDHQAPPKSDIGIPFLVISDVAKGKLNFLNTRFVPQEYYEKISFDRKPEKGDLLFT-VTG 148 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 S V + + + + EY+ S + T + Sbjct: 149 SYGIV-----VPVNIDCKFCFQRHIGLIKTLNT--SEYLLHLLKSSYFKGQCDEFA-TGT 200 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 QK + + ++S ++ +PP EQ IV +E+ F+ + IE ++ + IL Sbjct: 201 AQKTVGLETLRSFLLPIPPFAEQQRIVIEIEKWFSLIELIEGGKDDLQTTIKQAKSKILD 260 Query: 416 KAFRGELTAQWRAENPDLISGENSAAALLEKI 447 A G+L Q E P A LL++I Sbjct: 261 LAIHGKLVPQDPNEEP--------AIKLLKRI 284 Score = 60.1 bits (144), Expect = 2e-07, Method: Composition-based stats. Identities = 21/61 (34%), Positives = 28/61 (45%), Gaps = 11/61 (18%) Query: 407 NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERA---ASGGKKASRKK 463 L Q IL A G+L Q + P A+ LLE+IKAE+ G K S+K Sbjct: 4 KALRQKILDLAIHGKLVPQDPNDEP--------ASVLLERIKAEKERLIKEGKIKRSKKS 55 Query: 464 S 464 + Sbjct: 56 A 56 >UniRef50_C3PVT7 Type I restriction enzyme EcoR124II specificity protein n=3 Tax=Bacteroides RepID=C3PVT7_9BACE Length = 356 Score = 188 bits (478), Expect = 3e-46, Method: Composition-based stats. Identities = 69/362 (19%), Positives = 150/362 (41%), Gaps = 38/362 (10%) Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFEC-SFGAFCGVLRPEKLIFSGFIAHFTK 127 + ++ D+++ ++ GS +G+ A F C + ++R L+ + Sbjct: 2 KGTEVLANDLLLNITGGS---LGRCAVVPADFNCGNVSQHVCIMR-SVLVEPEYFHVLVL 57 Query: 128 SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFE 187 SS + + G+ + + + + P+PPL EQ+ I +++ A +D + Sbjct: 58 SSYFAKSMK--ITGSGREGLPKYNLEQMGFPLPPLTEQQRIVAEIEHWFALIDQIEQGKA 115 Query: 188 QIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV-----------------------FK 224 + I+K+ + +L A++GKL + N EP + + Sbjct: 116 DLQTIIKQTKSKILDLAIHGKLVPQDPNDEPAIELLKRINPDFTPCDNGHYTFDVPNGWN 175 Query: 225 KLNFESILTELRNGLSSKPNESGVGHPILRIS-SVRAGHVDQNDIRFLECS--ESELNRH 281 + + L G S K +E +P+ +++ G + RFL+ S +++ Sbjct: 176 WCKLNDLCSFLSRGKSPKYSEDDKTYPVFAQKCNLKEGGISLEQARFLDPSTINKWDSKY 235 Query: 282 KLQDGDLLFTRYNGSLEFVGVCGLLKKL---QHQNLLYPDKLIRARLTKDALPEYIEIFF 338 KLQ GD+L VG L + ++ ++ + R ++ EY+ + Sbjct: 236 KLQTGDVLVNSTG--TGTVGRTRLFDESYLGKYPFVVPDSHVAVVRTYEEINSEYVFAYM 293 Query: 339 SSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 SS + + + + ++ QK + +++ PP+ EQ IV+++E+LF+ D I+ Sbjct: 294 SSQLIQQYIEDNLAGSTNQKELYIGVLENLYFPFPPINEQQRIVQKIEELFSVLDNIQNA 353 Query: 399 VN 400 + Sbjct: 354 LE 355 Score = 121 bits (303), Expect = 7e-26, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 77/194 (39%), Gaps = 18/194 (9%) Query: 5 KLPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRAN-NIQNGKFDTTDLVFV 62 +P GW ++ + + + RG + K + D P+ N++ G F+ Sbjct: 169 DVPNGWNWCKLNDLCSFLSRGKSPKYSE-----DDKTYPVFAQKCNLKEGGISLEQARFL 223 Query: 63 PK---NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-----GAFCGVLRPE 114 N K+ D+++ + VG++ + + + V+R Sbjct: 224 DPSTINKWDSKYKLQTGDVLVNSTG--TGTVGRTRLFDESYLGKYPFVVPDSHVAVVRTY 281 Query: 115 KLIFSGFIAHFTKSSLYRNKIS-SLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 + I S ++ + S L + I +L+ N + + + P PP+ EQ+ I +K++ Sbjct: 282 EEINSEYVFAYMSSQLIQQYIEDNLAGSTNQKELYIGVLENLYFPFPPINEQQRIVQKIE 341 Query: 174 TLLAQVDSTKARFE 187 L + +D+ + E Sbjct: 342 ELFSVLDNIQNALE 355 Score = 104 bits (260), Expect = 7e-21, Method: Composition-based stats. Identities = 42/162 (25%), Positives = 63/162 (38%), Gaps = 16/162 (9%) Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 DLL GSL G C ++ + R + PEY + S Sbjct: 9 NDLLLNITGGSL---GRCAVVP-ADFNCGNVSQHVCIMR-SVLVEPEYFHVLVLSSYFAK 63 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 +M T SG++G+ +++ LPP+ EQ IV +E FA D IE+ + Sbjct: 64 SMK---ITGSGREGLPKYNLEQMGFPLPPLTEQQRIVAEIEHWFALIDQIEQGKADLQTI 120 Query: 406 VNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKI 447 + IL A G+L Q + P A LL++I Sbjct: 121 IKQTKSKILDLAIHGKLVPQDPNDEP--------AIELLKRI 154 >UniRef50_A5KY57 Type I restriction-modification enzyme, S subunit n=2 Tax=Vibrionales bacterium SWAT-3 RepID=A5KY57_9GAMM Length = 411 Score = 188 bits (478), Expect = 3e-46, Method: Composition-based stats. Identities = 78/416 (18%), Positives = 153/416 (36%), Gaps = 19/416 (4%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P GW + + ++Y Q + ++ +P +R ++ + +++ Sbjct: 2 VPNGWEEKSLKDICKKT--ISYGIVQTGENI-ENGVPCVRVVDLSKNTLNPVEMIKTSDK 58 Query: 66 LVKESQK-ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 + + +K I E ++ G +V K + + + G L P K + S ++ Sbjct: 59 IHQSYKKTILCEGELMMALRGEIGLVKKVTPELVGANITRG--LARLSPIKSVDSDYLLW 116 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 +S+ +N++S S G+ + I S + +PIPPL EQ+ IA+ L T + +T+ Sbjct: 117 TLRSNKIKNELSRKSGGSALQEIALGSLRKVVLPIPPLPEQRKIAQILSTWDRGIATTEK 176 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 E Q K Q +L G E + FE + S L + K Sbjct: 177 LIETSKQQKKALMQQLLTGKKRLVNPETGKAFEGEWERHS----MSDLVFIDRKSLGKKT 232 Query: 245 ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 + +S V G + + S R +Q+GD+L + +L+ Sbjct: 233 PDDFEFQYISLSDVAVGSISKELEVHKFASAPSRARRVIQEGDILLSTVRPNLKGFAKV- 291 Query: 305 LLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKD 364 +H + + K +YI + S + + V S I+ D Sbjct: 292 ---SEKHADCIASTGFSVLTPKKRVSGDYIHQYIFSSHVTGQIDSLV-VGSNYPAINSSD 347 Query: 365 IKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + V P +EQ +I + AD + + LA ++++ + G Sbjct: 348 VAGLKVYCPTYEEQQKIAS----VLTAADKEIEVLEAKLAHFKQEKKALMQQLLTG 399 Score = 92.5 bits (228), Expect = 3e-17, Method: Composition-based stats. Identities = 45/213 (21%), Positives = 79/213 (37%), Gaps = 9/213 (4%) Query: 2 SAGKLPEG-WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT-DL 59 GK EG W +S + + R K D I +++ G ++ Sbjct: 203 ETGKAFEGEWERHSMSDLVFIDR-----KSLGKKTPDDFEFQYISLSDVAVGSISKELEV 257 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + + I DI+++ + K + +H S G VL P+K + Sbjct: 258 HKFASAPSRARRVIQEGDILLSTVRPNLKGFAKVSEKHADCIASTG--FSVLTPKKRVSG 315 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 +I + SS +I SL G+N I + + + P EQ+ IA L ++ Sbjct: 316 DYIHQYIFSSHVTGQIDSLVVGSNYPAINSSDVAGLKVYCPTYEEQQKIASVLTAADKEI 375 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK 212 + +A+ Q K Q +L G K+ E+ Sbjct: 376 EVLEAKLAHFKQEKKALMQQLLTGNRRVKVDEE 408 Score = 84.4 bits (207), Expect = 8e-15, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 76/227 (33%), Gaps = 20/227 (8%) Query: 216 FEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQ-NDIRFLECS 274 P K L T + G+ G P +R+ + ++ I+ + Sbjct: 1 MVPNGWEEKSLKDICKKT-ISYGIVQTGENIENGVPCVRVVDLSKNTLNPVEMIKTSDKI 59 Query: 275 ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQ--NLLYPDKLIRARLTKDALPE 332 + L +G+L+ G GL+KK+ + L R K + Sbjct: 60 HQSYKKTILCEGELMMA-------LRGEIGLVKKVTPELVGANITRGLARLSPIKSVDSD 112 Query: 333 YIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYA 392 Y+ S +N + + S + I+ ++ V+ +PP+ EQ +I Q+ + Sbjct: 113 YLLWTLRSNKIKNEL-SRKSGGSALQEIALGSLRKVVLPIPPLPEQRKIA----QILSTW 167 Query: 393 DTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENS 439 D + ++++ + G+ R NP+ Sbjct: 168 DRGIATTEKLIETSKQQKKALMQQLLTGK----KRLVNPETGKAFEG 210 >UniRef50_A3ZCQ6 HsdS n=5 Tax=Campylobacter jejuni RepID=A3ZCQ6_CAMJE Length = 398 Score = 188 bits (477), Expect = 4e-46, Method: Composition-based stats. Identities = 89/423 (21%), Positives = 168/423 (39%), Gaps = 34/423 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFVPK 64 LP+GW + + + + G K + + + IR + Q NG + ++ F+ + Sbjct: 4 LPQGWEVKKLEEIANIKGGKRLPKGENLLD-NNTKFAYIRVADFQDNGTINLQNIKFINE 62 Query: 65 NLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF--GAFCGVLRPEKLIFSG 120 N ++ KI +++ I++ +GKS + + + I + Sbjct: 63 NTYNVLKNYKIYDDNLYISI----AGTIGKSGIIPKELNGAILTENAVKLEYIQNNISNK 118 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+ FT S++++ +I + + + I IP+PPL EQ+ I LD A++D Sbjct: 119 FMYFFTLSNIFKTQIQTSTKIVAQPKLAITRLKQIQIPLPPLKEQERIVGILDESFAKID 178 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 + EQ L Q+ L A N N++ S ++ + E I + G Sbjct: 179 ESIKILEQDLLNLDELMQSALQKAFNPLKDNAKENYKLPQS-WEWKSLEEISENISAGGD 237 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 N + +I V A V+ N + + + L G++ FV Sbjct: 238 KPKNCTESKTAKNQIP-VYANGVNNNGLVGYTDKATIIKPS-------LTISARGTIGFV 289 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALP-EYIEIFFSSPSARNAMMNCVKTTSGQKG 359 ++K + ++ +LI ++ L Y+ + A+ S Sbjct: 290 ----CIRKEPYFPIV---RLISLIPCENILCLHYLYFCLNFFIAKGE-------GSSIPQ 335 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 ++ KS + LPP+KEQ +I ++ +F A +++ L L QS+L KAF+ Sbjct: 336 LTIPKFKSLQIPLPPLKEQEQIAEHLDFVFEKAKALKELYTKELKDYEELKQSLLDKAFK 395 Query: 420 GEL 422 GEL Sbjct: 396 GEL 398 Score = 95.5 bits (236), Expect = 4e-18, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 83/209 (39%), Gaps = 23/209 (11%) Query: 2 SAGKLPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 KLP+ W + ++ + G K + + +P+ AN + N + Sbjct: 212 ENYKLPQSWEWKSLEEISENISAGGDKPKNCTESKTAKNQIPVY-ANGVNNNGL----VG 266 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + K + I P +++ ++ +G + P+ ++ E ++ Sbjct: 267 YTDKATI-----IKP-----SLTISARGTIGFVCIRKEPY-FPIVRLISLIPCENILCLH 315 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ N + G++I + F + IP+PPL EQ+ IAE LD + + Sbjct: 316 YLYFCL------NFFIAKGEGSSIPQLTIPKFKSLQIPLPPLKEQEQIAEHLDFVFEKAK 369 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKL 209 + K + + + + +Q++L A G+L Sbjct: 370 ALKELYTKELKDYEELKQSLLDKAFKGEL 398 >UniRef50_C5DB08 Restriction modification system DNA specificity domain protein n=1 Tax=Geobacillus sp. WCH70 RepID=C5DB08_GEOSW Length = 445 Score = 188 bits (477), Expect = 4e-46, Method: Composition-based stats. Identities = 74/446 (16%), Positives = 178/446 (39%), Gaps = 44/446 (9%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF- 61 G++P W I + V ++ + + +K + + + I+ G + Sbjct: 13 IGEIPSDWKILRLKNVLK-------ERNEKNSPIKTNEILSLT---IEKGVIPYKEKKSG 62 Query: 62 --VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 K + + P DIV+ + VG S + C + + + Sbjct: 63 GNKAKEDLSNYKLAYPNDIVLNSMNVIVGAVGISKYYG----CVSPVYYVLYSDDVEQNI 118 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINN------------IKPASFDLINIPIPPLAEQKI 167 F + +SS ++ + L G + I + +P+PP++ Q+ Sbjct: 119 RFYNYLFQSSAFQKSLIGLGNGIMMKQSSTGKLNTIRLRIPLDRLKNVYLPVPPVSVQQK 178 Query: 168 IAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEP 218 I LD ++ +D+ + +Q + LK+++Q+++ V L +W P Sbjct: 179 IVNFLDEKVSHIDTIIEKNKQSIEELKKYKQSLIAETVTKGLDPNVEMKDSGIEWVGEIP 238 Query: 219 QHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSV---RAGHVDQNDIRFLECSE 275 +H ++L SI+T S + NE +++ ++V R ++ +D + SE Sbjct: 239 KHWEIRRLRDISIITRGTVDKSKEKNEIP--VYLVQYTNVYYKREQKINDDDYLPITVSE 296 Query: 276 SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIE 335 +E ++K++ GD+L T + + + +G ++ + N ++ +IR R+ + + Sbjct: 297 NEYKKYKVRKGDILLTASSETKDDIGHSTVIVE-DLPNHVFGSDIIRIRIPNKIVDLNYK 355 Query: 336 IFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTI 395 +F A + + + KS ++PP++EQ +I + ++ + + + + Sbjct: 356 KYFMENYYYLAKFDKLSRGITRFRFGMDQFKSLKYVIPPIEEQVKIAKYLDNITNHINQL 415 Query: 396 EKQVNNALARVNNLTQSILAKAFRGE 421 + + + +S++ + G+ Sbjct: 416 ICNKEKLINELESYKKSLIYEYVTGK 441 Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 37/227 (16%), Positives = 89/227 (39%), Gaps = 27/227 (11%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P S +K L +++L E S IL ++ + G + + + Sbjct: 11 EWIGEIP--SDWKILRLKNVLKERNEKNSPIKTNE-----ILSLT-IEKGVIPYKEKKSG 62 Query: 272 -ECSESELNRHKL-QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 ++ +L+ +KL D++ N + VG+ ++ + P + + Sbjct: 63 GNKAKEDLSNYKLAYPNDIVLNSMNVIVGAVGIS------KYYGCVSPVYYVLYSDDVEQ 116 Query: 330 LPEYIEIFFSSPSARNAMM-----------NCVKTTSGQKGISGKDIKSQVVLLPPVKEQ 378 + F S + + +++ + K + + I +K+ + +PPV Q Sbjct: 117 NIRFYNYLFQSSAFQKSLIGLGNGIMMKQSSTGKLNTIRLRIPLDRLKNVYLPVPPVSVQ 176 Query: 379 AEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQ 425 +IV +++ ++ DTI ++ ++ + QS++A+ L Sbjct: 177 QKIVNFLDEKVSHIDTIIEKNKQSIEELKKYKQSLIAETVTKGLDPN 223 >UniRef50_Q167L9 Type I restriction enzyme specificity subunit, putative n=1 Tax=Roseobacter denitrificans OCh 114 RepID=Q167L9_ROSDO Length = 379 Score = 188 bits (477), Expect = 5e-46, Method: Composition-based stats. Identities = 87/420 (20%), Positives = 154/420 (36%), Gaps = 49/420 (11%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 GW + P+ V L G + + + +P+ A NG +D Sbjct: 4 GWEVKPLGEVAKLHYGKALAESERSP---NGTVPVYGA----NGVLGWSD---------- 46 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKS 128 ++ +I GS V + PF S + P +L F + + + Sbjct: 47 --HTLTEGPSLIVGRKGSAGEVNRV---DGPFWPSDVTYYTEHDPNRLDF-DYFHYGLMT 100 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 + SL+ G I + +PIPPL EQK I LD ++D K E Sbjct: 101 LN----LPSLAKGVK-PGINRNDVYELGLPIPPLEEQKRIVAILDAAFERLDRAKENAEA 155 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV 248 Q + L R F +V + E + +++ G S K Sbjct: 156 NLQNARELFDRTLE-----------RVFAELVAVHATIKLEEVTSKITKGSSPKWQGFSY 204 Query: 249 ----GHPILRISSVRAGHVDQNDIRFLECSESELNRHK-LQDGDLLFTRYNGSLEFVGVC 303 G + +V + +++E ++ +R L GD+L S +G Sbjct: 205 VDSPGVLFVTSENVGKNELLLEKTKYVEEGFNQKDRKSILAPGDVLSNIVGAS---IGRT 261 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 + N+ L+R L + P+++ +SP + + ++ + +S Sbjct: 262 AVFDLDAVANINQAVCLMRC-LPERLSPKFLSFLLNSPYFKARLHEG-ESNMARANLSLA 319 Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 + +V LP ++ Q IV+ +E+L ++ E L + +L QS+L KAF GELT Sbjct: 320 FFREFLVPLPELEAQERIVQEIEELATHSAECETNYRTKLTDIADLRQSLLQKAFAGELT 379 >UniRef50_B5EKM3 Restriction modification system DNA specificity domain n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EKM3_ACIF5 Length = 395 Score = 187 bits (476), Expect = 5e-46, Method: Composition-based stats. Identities = 78/421 (18%), Positives = 157/421 (37%), Gaps = 34/421 (8%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV----FVP 63 EGW + + V+ + G Q +Y + +P IR +++ G+ ++ V Sbjct: 3 EGWEVKLLGEVSAIGAGN--PAPQDRHYFEQGTIPFIRTSDV--GRIHIGEIFGAADLVN 58 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFSGFI 122 + ++ + I+ S S + + + E + ++ + + F+ Sbjct: 59 ELAARKLAMLPVGTILFPKSGASTFI---NHRVIMGIEAVASSHLATIKAKPHTLLDKFL 115 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 ++ + + ++ +N +++ + I+ P+PPL EQ+ I LD + + Sbjct: 116 FYYLLTIDAKTLVAD----SNYPSLRISDIATISTPLPPLPEQRRIVAILDEAFEGIATA 171 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 KA E+ Q ++ L AV + E W + + +S + Sbjct: 172 KANAEKNLQNAHEIFESYLN-AVFSQRGEGWVDRRLGDVAMEFGRGKSKHRPRND----- 225 Query: 243 PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 P G P ++ VR +++ L + KL L ++ G+ Sbjct: 226 PKLYGGNFPFIQTGDVRN-SSHLITSYDQTYNDAGLAQSKLWPKGTLCITIAANIAETGI 284 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDA-LPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 + +PD +I + +YIE +S +R + S Q I+ Sbjct: 285 -------LDFDACFPDSIIGLVANEKISTNKYIEYLLTSFKSRLQFL---GKGSAQDNIN 334 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 +SQ PP+ Q EIV + L ++ LA ++ L QS+L +AF G+ Sbjct: 335 LATFESQYFPFPPLSNQKEIVSIFDDLHEETQHLKFIYQQKLAALDELKQSLLHQAFNGD 394 Query: 422 L 422 L Sbjct: 395 L 395 >UniRef50_D0KYE4 Restriction modification system DNA specificity domain protein n=1 Tax=Halothiobacillus neapolitanus c2 RepID=D0KYE4_HALNC Length = 401 Score = 187 bits (475), Expect = 7e-46, Method: Composition-based stats. Identities = 83/422 (19%), Positives = 158/422 (37%), Gaps = 34/422 (8%) Query: 11 VIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ----NGKFDTTDLVFVPKNL 66 + P+ + + K Q K + +P R + +G D + +F+ + Sbjct: 4 KVVPLKDLFQIGSSKRVLKSQ----WKAEGVPFYRGREVTRLAMDGFVD--NELFISEAH 57 Query: 67 VKE--SQKISP--EDIVIAMSSGSKSVVGKSAHQHLPFECSF-GAFCGVLRPEKLIFSGF 121 E +Q +P +DIVI +G S F A ++ + S F Sbjct: 58 YAELANQYGAPRTDDIVITA----IGTIGNSYIVQDGDRFYFKDASILWMKRISDVSSKF 113 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + KS+++ +++ GA ++ + + I +PP+AEQ I LD + Sbjct: 114 VNFWLKSTMFLDQLD-HGNGATVDTLTIQKLQSVQIWVPPIAEQHRIVSILDEAFEGIAK 172 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 +A EQ Q + ++ L + E W + V L G+ Sbjct: 173 ARAHAEQNRQNARALFESHLQSVFT-QRGEGWAEKSLEEVV-------DAQCTLSYGIVQ 224 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYNGSLEFV 300 +E G PI+R + + A + N ++ ++ ++ R L+ G+LL + Sbjct: 225 PGHEYAKGMPIVRPTDLTAKLITLNGLKRIDPKLADGYRRTTLRGGELLLCVRGSTGVLA 284 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 L + P L + F + A + + + I Sbjct: 285 VTSSELAGANVTRGIVP-----IMFDPSLLSQDFGYFLMTSEAVQSQIRIKTYGTALMQI 339 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + D++ V PP+KEQ + ++E+L A +E LA ++ L +S+L +AF G Sbjct: 340 NIGDLRKIAVSFPPLKEQERMTAQLEELSAETQRLESIYQQKLAALDELKKSLLHQAFSG 399 Query: 421 EL 422 L Sbjct: 400 SL 401 >UniRef50_C5VLJ8 HsdS protein n=1 Tax=Prevotella melaninogenica ATCC 25845 RepID=C5VLJ8_9BACT Length = 428 Score = 187 bits (474), Expect = 8e-46, Method: Composition-based stats. Identities = 78/439 (17%), Positives = 159/439 (36%), Gaps = 47/439 (10%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 GK+P W + + L + D+ + R + ++ Sbjct: 16 GKVPSHWNYSRIK--FGLKSSFSGVWGD-DEKGDDNDVVCYRVADFDYKNGGLSEEKITI 72 Query: 64 KNLVKESQK---ISPEDIVIAMSSGS-KSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 +N+ +++ K I P DI+I S G + VG++ +L + + F +R + + + Sbjct: 73 RNIDEKTFKEREILPNDILIEKSGGGDVNPVGRAVIANLDHKATCSNFIHCVRCNENVLN 132 Query: 120 GFIAHFTKSSLYRNKISSL--SAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + ++ S+Y K++ L + I N+K + + +PPL+EQ+ IA LD Sbjct: 133 TRLLYYFFYSIYVQKVNLLFFNQTTGIQNLKVPEYLGQVMFLPPLSEQQSIASFLDAKTK 192 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNF 228 +D A+ EQ +L+ + A++ AV L + W P++ + Sbjct: 193 PIDDIIAKREQQIALLEEMKSAIISRAVTKGLNPEAKMKDSGIEWIGEVPENWNLLRFRL 252 Query: 229 ESILTELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD 287 L + G S + G +P ++ + E + +GD Sbjct: 253 ---LCRISTGDSDTQDAEPDGEYPF-----------------YVRSPQVERSSKFTCEGD 292 Query: 288 -LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNA 346 +L V + + ++ I + K Y+ F + Sbjct: 293 AILMAGDGAGAGRV-----FHHVDGKYAVHQRVYIFNQFNKVVDSNYLYQFMRIMFPQR- 346 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 MN S + I++ VV +P + EQ I ++ A D + +A + Sbjct: 347 -MNMGSAQSTVPSVRLHMIQNFVVPIPSIDEQRTITSYLDTETAKIDVRIDKRRKQIALL 405 Query: 407 NNLTQSILAKAFRGELTAQ 425 Q+++ A G++ + Sbjct: 406 QEYKQALITDAVTGKIDVR 424 Score = 96.3 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 37/236 (15%), Positives = 84/236 (35%), Gaps = 14/236 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISS--VRAGHVDQNDIR 269 +W P H + ++ F + + R++ + G + + I Sbjct: 13 QWLGKVPSHWNYSRIKF-GLKSSFSGVWGDDEKGDDNDVVCYRVADFDYKNGGLSEEKIT 71 Query: 270 FLECSESELNRHKLQDGDLLFTR-YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD 328 E ++ D+L + G + VG ++ L H+ + + R ++ Sbjct: 72 IRNIDEKTFKEREILPNDILIEKSGGGDVNPVGR-AVIANLDHKATC-SNFIHCVRCNEN 129 Query: 329 ALP-EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 L + FF S + + T+G + + + QV+ LPP+ EQ I ++ Sbjct: 130 VLNTRLLYYFFYSIYVQKVNLLFFNQTTGIQNLKVPEYLGQVMFLPPLSEQQSIASFLDA 189 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D I + +A + + +I+++A + NP+ ++ + Sbjct: 190 KTKPIDDIIAKREQQIALLEEMKSAIISRAVT-------KGLNPEAKMKDSGIEWI 238 >UniRef50_B2IP18 Type I restriction-modification system, S subunit, putative n=10 Tax=Streptococcus pneumoniae RepID=B2IP18_STRPS Length = 372 Score = 187 bits (474), Expect = 9e-46, Method: Composition-based stats. Identities = 80/409 (19%), Positives = 159/409 (38%), Gaps = 44/409 (10%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQK 72 + V T I G +K + + K+ +IR N+ T+ + + + Sbjct: 4 VKLGQVATFINGYAFKPQDWSSEGKE----IIRIQNLTK----TSKGINYYSGTIDKKYI 55 Query: 73 ISPEDIVIAMSS--GSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSL 130 + DI+I+ S G G+SA + V+ + I + + + L Sbjct: 56 VEAGDILISWSGTLGVFQWCGRSAVLN-------QHIFKVVFDKIDIDKSYFKYVVEKGL 108 Query: 131 YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 + G+ + ++ FD I +P L EQ+ IA +LD L + + + E++ Sbjct: 109 --QDAVKHTHGSTMKHLTKKYFDNIIVPYTNLGEQQRIASELDLLSKLILRRQEQLEELN 166 Query: 191 QILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGH 250 ++K + E++ + + K+L L E + Sbjct: 167 LLVKSRFNEMF---------EEYPDSVFLDTYIKELRAGKSL----------AGEENNKN 207 Query: 251 PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQ 310 +L+ +V + + ++++ L L+ HK++ GD++ +R N S E VG G + + Sbjct: 208 KVLKTGAVSYDYFNSSEVKNLPIDYIPLDEHKVEIGDVIISRMNTS-ELVGAAGYVWAIN 266 Query: 311 HQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG-QKGISGKDIKSQV 369 N+ PD+L + L P ++ ++ + + TSG K IS + Sbjct: 267 SDNIYLPDRLWKVILNDRVNPVFLWKLITNEKTKLKIKRISSGTSGSMKNISKSQLLQIR 326 Query: 370 VLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 V PP+ Q E V A D + + +L + L +S++ + F Sbjct: 327 VPFPPLALQNEFADFV----ALVDKSQLAIQKSLEELETLKKSLMQEYF 371 Score = 69.7 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 67/165 (40%), Gaps = 6/165 (3%) Query: 43 LIRANNIQNGKFDTTDLVFVPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQ--HLP 99 +++ + F+++++ +P + + + K+ D++I+ + + +VG + + Sbjct: 209 VLKTGAVSYDYFNSSEVKNLPIDYIPLDEHKVEIGDVIISRMN-TSELVGAAGYVWAINS 267 Query: 100 FECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINI 157 + + F+ + + KI +S+G + + NI + I + Sbjct: 268 DNIYLPDRLWKVILNDRVNPVFLWKLITNEKTKLKIKRISSGTSGSMKNISKSQLLQIRV 327 Query: 158 PIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLG 202 P PPLA Q A+ + + + + E++ + K Q G Sbjct: 328 PFPPLALQNEFADFVALVDKSQLAIQKSLEELETLKKSLMQEYFG 372 >UniRef50_Q3AQE4 Restriction endonuclease S subunits-like n=1 Tax=Chlorobium chlorochromatii CaD3 RepID=Q3AQE4_CHLCH Length = 386 Score = 187 bits (474), Expect = 9e-46, Method: Composition-based stats. Identities = 79/392 (20%), Positives = 168/392 (42%), Gaps = 38/392 (9%) Query: 45 RANNIQNGKFDT---TDLVFVP-----KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQ 96 + +I+ GK D T+ P K + + +Q + ++ +G + + Sbjct: 8 KLVDIKTGKLDVNAGTEYGKYPFFTCAKTVYRINQYAFDNEAILVAGNGDLN----VKYF 63 Query: 97 HLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLIN 156 F + + L+ ++ +F ++ + + + + G I IK Sbjct: 64 KGKFNAYQRTYVIENKEVNLLSMKYLYYFMETYMI--HLRNGAIGGIIKYIKIDHLTKAE 121 Query: 157 IPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF 216 IP+PPL +QK IA L + + K +Q+ Q+LK + G K W Sbjct: 122 IPLPPLDDQKRIAHLLGKVERLIAQRKQHLQQLDQLLKSVFLEMFG--FFDKTYTNW--- 176 Query: 217 EPQHSVFKKLNFESILTELRNGLSSKPNESG---VGHPILRISSVRAGHVDQNDIRFLEC 273 ++ + TE+ +G++ + P +R+++V+ H ++I+ + Sbjct: 177 --------TIDTLTSHTEIVSGITKGKKYKTDELIEVPYMRVANVQDEHFVLDEIKTISV 228 Query: 274 SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD--ALP 331 +++E+ +++L GDLL T G + +G G + + Q +N ++ + + R R+ P Sbjct: 229 TKNEIKQYRLLAGDLLLTE-GGDPDKLGR-GAVWQNQIENCIHQNHIFRVRVNDKSRINP 286 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 +Y+ SP ++ K T+G I+ +K +++PP++ Q VE++ Sbjct: 287 DYLSALIGSPYGKSYFFRSAKQTTGIASINSTQLKKFPIVIPPIELQNRFATIVEKV--- 343 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 ++I+ +L + L ++ KAF+GEL Sbjct: 344 -ESIKTHYQQSLNNLETLYNALSQKAFKGELD 374 Score = 97.1 bits (240), Expect = 1e-18, Method: Composition-based stats. Identities = 39/207 (18%), Positives = 84/207 (40%), Gaps = 12/207 (5%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 W I +++ T ++ G+T K+ + L + +P +R N+Q+ F ++ + + Sbjct: 175 NWTIDTLTSHTEIVSGITKGKKYKTDELIE--VPYMRVANVQDEHFVLDEIKTISVTKNE 232 Query: 69 -ESQKISPEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFCGVLR--PEKLIFSGFIAH 124 + ++ D+++ G +G+ A Q+ C +R + I +++ Sbjct: 233 IKQYRLLAGDLLLTE-GGDPDKLGRGAVWQNQIENCIHQNHIFRVRVNDKSRINPDYLSA 291 Query: 125 FTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 S ++ + I +I I IPP+ Q A T++ +V+S K Sbjct: 292 LIGSPYGKSYFFRSAKQTTGIASINSTQLKKFPIVIPPIELQNRFA----TIVEKVESIK 347 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLT 210 ++Q L+ A+ A G+L Sbjct: 348 THYQQSLNNLETLYNALSQKAFKGELD 374 >UniRef50_B0QS41 Type I restriction enzyme EcoKI subunit R n=1 Tax=Haemophilus parasuis 29755 RepID=B0QS41_HAEPR Length = 397 Score = 186 bits (473), Expect = 1e-45, Method: Composition-based stats. Identities = 103/422 (24%), Positives = 175/422 (41%), Gaps = 34/422 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LPEGW ++ V T I T K + L P+I ++ Sbjct: 6 LPEGWNKINITKVFTQIS-TTGKNIATKDCLSVGKYPVIDQG-----------AEYISGY 53 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAF-CGVLRPEKLIFSGFIAH 124 E++ I E+ VI +++ + + F+ GA + +P K I F + Sbjct: 54 FNDETKVIPVENKVIVFGDHTRNF------KLIDFDFIVGADGVKIFQPAKDIDPDFFYY 107 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S ++ + G + + P ++Q+ +A+K LL+QV K Sbjct: 108 QCLS------LNLPNKG---YHRHFRYLKECDFIYPSFSQQQKLAKKFTVLLSQVAEIKQ 158 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQH-SVFKKLNFESILTELRNGLSSK- 242 R E+IP +LK +RQ+VL AVNG+L+ KWR + + I ++++G + K Sbjct: 159 RLEKIPALLKTYRQSVLARAVNGELSAKWREENGVSLDSWVYEKAQHICDKVQSGSTPKG 218 Query: 243 -PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 P E P L++ ++ ++ + E R D+L L V Sbjct: 219 NPFEQNGTIPFLKVYNIVNQELNFDYKPQFVTKEQHSQRSITLPNDVLMNIVGPPLGKVA 278 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 + + N+ L R ++ ++ + + +K GQ IS Sbjct: 279 IVT--NQYSEWNINQAITLFRCNP-RNLHYKFFYFVLREGRFIREIEHDLKGIVGQINIS 335 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 + +V +P ++EQ I + VE+ +A+ +E QVN AL RVN +TQ+ILAK FRGE Sbjct: 336 LSQCRDMIVPVPTLEEQNYITQAVEKHLNFANQLEAQVNAALERVNLMTQAILAKGFRGE 395 Query: 422 LT 423 L Sbjct: 396 LI 397 Score = 69.0 bits (167), Expect = 3e-10, Method: Composition-based stats. Identities = 26/125 (20%), Positives = 52/125 (41%), Gaps = 10/125 (8%) Query: 306 LKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI 365 K + ++ D + + KD P++ S + N + + Sbjct: 77 FKLIDFDFIVGADGVKIFQPAKDIDPDFFYYQCLSLNLPNK----------GYHRHFRYL 126 Query: 366 KSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQ 425 K + P +Q ++ ++ L + I++++ A + QS+LA+A GEL+A+ Sbjct: 127 KECDFIYPSFSQQQKLAKKFTVLLSQVAEIKQRLEKIPALLKTYRQSVLARAVNGELSAK 186 Query: 426 WRAEN 430 WR EN Sbjct: 187 WREEN 191 >UniRef50_C7P6A9 Restriction modification system DNA specificity domain protein n=1 Tax=Methanocaldococcus fervens AG86 RepID=C7P6A9_METFA Length = 402 Score = 186 bits (471), Expect = 2e-45, Method: Composition-based stats. Identities = 77/433 (17%), Positives = 165/433 (38%), Gaps = 47/433 (10%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LPEGW + + +G K + I ++ LP + A+ + G + + Sbjct: 3 ELPEGWKWVKLKEIIKTEKGK--KPKNLIKEKNNNALPYLTADYFRTGILK----QYSEE 56 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 N ++ + + P D+V+ + S + + ++ K + FI Sbjct: 57 N--EKLRIVKPGDLVLIWDGSKAGDIFISDIEGILAST----MVKLIIKNKEVHPKFIYF 110 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP------PLAEQKIIAEKLDTLLAQ 178 K Y ++ + GA I ++ F+ + IPIP L +QK I EK++ + + Sbjct: 111 VIKH--YFPILNKNTTGAGIPHVSKEVFNNLLIPIPFKDGKPDLEKQKQIVEKIEKIFNE 168 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 +D E+ K AVL E + + K + Sbjct: 169 IDKAIKLREKAINETKELFNAVLNKIF-------KEAEEGERWKWVKFENIVDFKMGKTP 221 Query: 239 LSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSESELNRHKL-QDGDLLFTRYNGS 296 S+ G + + I ++ +++ + E + E+ + K+ G LL + Sbjct: 222 KRSEKRYWENGVYHWVSIGDMQDKYINTTKEKISEEAFREVFKGKIVPKGTLLMSF---- 277 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 +G +L + + ++ + +I ++ L +Y+ S + + +K + Sbjct: 278 KLTIGRTAIL----NIDAVHNEAIISIYPKEEILRDYLYWVLQSIDYKKYINPAIKGHT- 332 Query: 357 QKGISGKDIKSQVVLLP------PVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 ++ + +K+ ++ +P +++Q +I ++ L +E+ L L Sbjct: 333 ---LNKEILKNLLIPIPYKDNKPDIEKQKQIANYLDNLSEKIKQLEQLQEKQLNLFKELK 389 Query: 411 QSILAKAFRGELT 423 +SIL KAF GEL Sbjct: 390 ESILNKAFEGELI 402 >UniRef50_Q307C7 Type I RM system S subunit n=2 Tax=Arthrospira platensis RepID=Q307C7_SPIPL Length = 395 Score = 186 bits (471), Expect = 2e-45, Method: Composition-based stats. Identities = 73/426 (17%), Positives = 138/426 (32%), Gaps = 46/426 (10%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFVPKNL 66 + W I ++ ++ LI T + + ++ I NGKF + L + + Sbjct: 2 KDWKIVSLNEISELITKGTTPTSVGFKFFDTGKVNFVKVETITDNGKFLPSKLAHIEMDC 61 Query: 67 VKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 ++ + DI+ ++ +G+ LP + L+ I F+ Sbjct: 62 HHSLKRSQLKSGDILFSI-AGALGRTAIVTSDILPANTNQALAIIRLKSSNAIHPEFVFR 120 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S + +I G N+ IP+PPL EQK I LD +D+ A Sbjct: 121 SLSSGMLIKQIKKSKGGVAQQNLSLTQIKNFKIPLPPLEEQKRIVAILDEAFEGIDAAIA 180 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG-LSSKP 243 ++ + + L S+ + + + +++ G L++ Sbjct: 181 NTQKNLANARELFDSYLQ------------------SLDAEKRYLGEIVDIKTGKLNANA 222 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVC 303 +P ++ + +L N VG Sbjct: 223 ATEDGQYPFFT----------------CSKEIYRISEYAFDCEAILLAGNNA----VGDF 262 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 + N +I L Y+ M+ + K + Sbjct: 263 NVKHYKGKFNAYQRTYVIAVSEASQVLYRYLYFQLLKSL---KMLKIQSVGANTKFLKLD 319 Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 IK+ + LP +++Q ++V + +L + +E L + L QSIL KAF GELT Sbjct: 320 MIKNLQIALPDIEKQQKLVLVLNELESETQRLESIYQRKLEALKELKQSILQKAFTGELT 379 Query: 424 AQWRAE 429 E Sbjct: 380 GDTVKE 385 >UniRef50_A1WW67 Restriction modification system DNA specificity domain n=1 Tax=Halorhodospira halophila SL1 RepID=A1WW67_HALHL Length = 429 Score = 186 bits (471), Expect = 2e-45, Method: Composition-based stats. Identities = 69/434 (15%), Positives = 156/434 (35%), Gaps = 51/434 (11%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINY-----LKDDYLPLIRANNIQNGKFDTTD 58 G++PE W ++ + V L G + + + + +G Sbjct: 18 GEVPEHWSVSALKRVARLESGDAISSDHISEEGEYAVYGGNGIRGFSSGYTHDG------ 71 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 F P +I + G + F S V+ P + I Sbjct: 72 --FYP---------------LIGRQGA---LCGNVNYAKGRFWAS--EHAVVVWPGRQID 109 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ +S ++ + A + + + + +P+PP EQ+ IAE LD A+ Sbjct: 110 GFWLGELLRSMN----LNQYATSAAQPGLSVETIENLYVPVPPDEEQQKIAELLDHETAR 165 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRN-FEPQHSVFKKLNFESILTELRN 237 +D+ +++ ++LK RQAV+ AV L + + ++ +R Sbjct: 166 IDALIEEQQRLIELLKEKRQAVISHAVTKGLDPDVPMKDSGVEWLGEVPAHWDVVKFVRC 225 Query: 238 GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL-----ECSESELNRHKLQDGDLLFTR 292 ++ P + V H++ R + E +E ++ GD+++++ Sbjct: 226 AKIAEGQVDPKQEPYRSMMLVAPNHIESGTGRLMARETAEEQGAESGKYYCYAGDVIYSK 285 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 SL V +++ L + R +Y+ S S + + Sbjct: 286 IRPSLRKACVA-------YEDCLCSADMYPLRAQSGVYGDYLRWTILSESF-STLAFLES 337 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 ++ + I+ + +PP +EQ +I R +E+ A D + ++ + + + + Sbjct: 338 ERVAMPKVNRESIEEIRIPMPPPEEQLQISRTLEKETARIDALMEEAESGIQLLQERRSA 397 Query: 413 ILAKAFRGELTAQW 426 +++ A G++ + Sbjct: 398 LISAAVTGKIDVRD 411 Score = 77.4 bits (189), Expect = 1e-12, Method: Composition-based stats. Identities = 21/107 (19%), Positives = 49/107 (45%), Gaps = 7/107 (6%) Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 F+ R+ +N T++ Q G+S + I++ V +PP +EQ +I ++ A D + Sbjct: 111 FWLGELLRSMNLNQYATSAAQPGLSVETIENLYVPVPPDEEQQKIAELLDHETARIDALI 170 Query: 397 KQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++ + + Q++++ A + +PD+ ++ L Sbjct: 171 EEQQRLIELLKEKRQAVISHAVT-------KGLDPDVPMKDSGVEWL 210 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P05719 Type-1 restriction enzyme EcoKI specificity prot... 488 e-136 UniRef50_A1AJL9 HsdS, type I site-specific deoxyribonuclease n=2... 412 e-113 UniRef50_B5BKY5 Subunit S of type I restriction-modification sys... 377 e-103 UniRef50_B7LQL4 Specificity determinant for hsdM and hsdR (Modul... 365 2e-99 UniRef50_A9N788 Putative uncharacterized protein n=3 Tax=Salmone... 359 2e-97 UniRef50_C5SE02 Restriction modification system DNA specificity ... 346 9e-94 UniRef50_C9RY89 Restriction modification system DNA specificity ... 345 1e-93 UniRef50_A3EKX4 Type I restriction modification DNA specificity ... 340 7e-92 UniRef50_P06990 Type-1 restriction enzyme EcoBI specificity prot... 338 3e-91 UniRef50_B3YJG5 Type I restriction enzyme EcoKI specificity prot... 334 3e-90 UniRef50_C9YAL6 Putative uncharacterized protein n=1 Tax=Curviba... 332 2e-89 UniRef50_Q1MKB2 Putative type I restriction enzyme specificity s... 330 5e-89 UniRef50_A6UXD7 Type I restriction-modification system, S subuni... 326 1e-87 UniRef50_C3Q383 Putative uncharacterized protein n=1 Tax=Bactero... 326 1e-87 UniRef50_C6CR26 Restriction modification system DNA specificity ... 325 1e-87 UniRef50_UPI0001C15DDF Restriction modification system DNA speci... 325 2e-87 UniRef50_A8ZTW4 Restriction modification system DNA specificity ... 323 1e-86 UniRef50_B8H0M3 Type I restriction-modification system specifici... 321 3e-86 UniRef50_C6RQJ9 Restriction endonuclease S subunit n=2 Tax=Acine... 320 8e-86 UniRef50_Q210J8 Type I restriction enzyme StySPI specificity pro... 320 9e-86 UniRef50_C9NQJ7 HsdS type I site-specific deoxyribonuclease n=1 ... 320 1e-85 UniRef50_A1BGI9 Restriction modification system DNA specificity ... 319 2e-85 UniRef50_A3PYN5 Restriction modification system DNA specificity ... 318 2e-85 UniRef50_Q1VAF2 Hypothetical type I restriction-modification sys... 316 1e-84 UniRef50_C7QRY1 Restriction modification system DNA specificity ... 316 1e-84 UniRef50_A4VH87 Type I restriction-modification system, S subuni... 315 2e-84 UniRef50_UPI0001695152 type I restriction enzyme specificity pro... 315 2e-84 UniRef50_P06187 Type-1 restriction enzyme StySJI specificity pro... 315 3e-84 UniRef50_A6CKF2 Putative type I restriction enzyme specificity p... 313 8e-84 UniRef50_Q5QX28 Restriction endonuclease S subunit n=1 Tax=Idiom... 312 1e-83 UniRef50_Q466N9 Type I restriction-modification system specifici... 312 2e-83 UniRef50_B6R0S6 Restriction modification system DNA specificity ... 310 6e-83 UniRef50_C0QCH4 HsdS2 n=1 Tax=Desulfobacterium autotrophicum HRM... 310 1e-82 UniRef50_A3PKU6 Restriction modification system DNA specificity ... 309 2e-82 UniRef50_UPI0001855288 conserved hypothetical protein n=1 Tax=Fr... 308 2e-82 UniRef50_B7K558 Restriction modification system DNA specificity ... 308 2e-82 UniRef50_A7IEA1 Restriction modification system DNA specificity ... 308 4e-82 UniRef50_A6C679 Type I restriction-modification system, S subuni... 307 6e-82 UniRef50_C6MBL0 Restriction modification system DNA specificity ... 305 2e-81 UniRef50_C3RBV6 Type I restriction-modification system n=3 Tax=B... 305 3e-81 UniRef50_D1J921 Putative type I restriction enzyme, DNA specific... 304 4e-81 UniRef50_Q3J7Q5 Restriction endonuclease S subunits-like n=2 Tax... 304 6e-81 UniRef50_Q0EXK2 HsdS protein n=1 Tax=Mariprofundus ferrooxydans ... 303 8e-81 UniRef50_UPI0001C36A8C HsdS1 n=1 Tax=Clostridium hathewayi DSM 1... 303 1e-80 UniRef50_Q4FUM9 Possible type I restriction-modification system,... 301 3e-80 UniRef50_D1UP80 Restriction modification system DNA specificity ... 301 4e-80 UniRef50_A1K1C0 Type I site-specific deoxyribonuclease n=3 Tax=B... 301 4e-80 UniRef50_C6J5M6 Putative uncharacterized protein n=1 Tax=Paeniba... 300 5e-80 UniRef50_A4FXL8 Restriction modification system DNA specificity ... 300 6e-80 UniRef50_A1ZUE4 Type I restriction-modification system specifici... 298 4e-79 UniRef50_B8GLU3 Type I restriction-modification system, S subuni... 297 7e-79 UniRef50_A3YSG6 Putative type I restriction enzyme specificity p... 296 1e-78 UniRef50_B2V7V7 Restriction modification system DNA specificity ... 296 1e-78 UniRef50_C1D7R6 Type I restriction-modification system, S subuni... 295 2e-78 UniRef50_Q30XD2 Type I restriction-modification system, S subuni... 295 3e-78 UniRef50_A6TLK6 Restriction modification system DNA specificity ... 295 3e-78 UniRef50_A1TWL9 Restriction modification system DNA specificity ... 294 4e-78 UniRef50_C0VG50 Type I restriction modification enzyme protein S... 293 9e-78 UniRef50_A6EUA9 Type I restriction-modification system, S subuni... 293 1e-77 UniRef50_B3R3C2 Type I restriction-modification methylase S subu... 292 1e-77 UniRef50_C7RQC3 Type I restriction-modification system specifici... 290 6e-77 UniRef50_A6DQ81 Putative restriction-modification system specifi... 290 6e-77 UniRef50_Q8PTL2 Type I restriction-modification system specifici... 290 8e-77 UniRef50_P06991 Type-1 restriction enzyme EcoDI specificity prot... 290 1e-76 UniRef50_Q4HFD9 HsdS n=3 Tax=Campylobacterales RepID=Q4HFD9_CAMCO 289 1e-76 UniRef50_UPI0001C42656 hypothetical protein BpOF4_03730 n=1 Tax=... 288 3e-76 UniRef50_C5RH89 Restriction modification system DNA specificity ... 288 3e-76 UniRef50_B0TZ98 Type I restriction-modification system, subunit ... 287 5e-76 UniRef50_Q21ZK2 Restriction modification system DNA specificity ... 287 5e-76 UniRef50_Q4C702 Restriction modification system DNA specificity ... 287 8e-76 UniRef50_Q8RJG0 HsdS n=12 Tax=Campylobacter jejuni RepID=Q8RJG0_... 286 1e-75 UniRef50_C9Q5S0 Possible type I restriction-modification system ... 286 1e-75 UniRef50_A5GE25 Restriction endonuclease S subunits-like protein... 286 1e-75 UniRef50_UPI00016B0992 probable type I restriction-modification ... 286 1e-75 UniRef50_D1YNY9 Type I restriction modification DNA specificity ... 285 2e-75 UniRef50_A1RES4 Restriction modification system DNA specificity ... 284 4e-75 UniRef50_A3J6X3 Type I restriction-modification system, S subuni... 284 6e-75 UniRef50_C6A4W8 Putative type I specificity subunit HsdS n=1 Tax... 283 7e-75 UniRef50_C5C353 Restriction modification system DNA specificity ... 283 1e-74 UniRef50_C5SDH7 Putative uncharacterized protein n=1 Tax=Allochr... 283 1e-74 UniRef50_B5VW93 Restriction modification system DNA specificity ... 283 1e-74 UniRef50_A1TSH8 Restriction modification system DNA specificity ... 282 2e-74 UniRef50_C6IKX2 Type I restriction-modification system n=2 Tax=B... 282 2e-74 UniRef50_Q26D97 Putative type I site-speicific deoxyribonuclease... 281 3e-74 UniRef50_C2QHW5 Putative uncharacterized protein n=2 Tax=Bacillu... 281 5e-74 UniRef50_B4VK59 Putative uncharacterized protein n=1 Tax=Microco... 280 6e-74 UniRef50_UPI0001973978 type I restriction-modification system, S... 280 6e-74 UniRef50_A6Y5S9 Restriction endonuclease S subunit n=1 Tax=Vibri... 280 7e-74 UniRef50_Q31PC5 Type I restriction-modification n=2 Tax=Synechoc... 280 1e-73 UniRef50_Q3J746 Restriction modification system DNA specificity ... 280 1e-73 UniRef50_B0JHV8 Restriction modification system DNA specificity ... 279 2e-73 UniRef50_Q4HNY2 Type I restriction-modification system specifici... 278 3e-73 UniRef50_B5ECU4 Restriction modification system DNA specificity ... 278 4e-73 UniRef50_Q8GN10 Putative type I specificity subunit HsdS n=3 Tax... 277 5e-73 UniRef50_A1WW67 Restriction modification system DNA specificity ... 277 5e-73 UniRef50_A1VBQ9 Restriction modification system DNA specificity ... 277 6e-73 UniRef50_A5G3B9 Restriction modification system DNA specificity ... 276 1e-72 UniRef50_Q73D72 Type I restriction-modification enzyme, S subuni... 276 1e-72 UniRef50_B1LRG3 Type I restriction modification DNA specificity ... 275 2e-72 UniRef50_D0KMA1 Restriction modification system DNA specificity ... 275 3e-72 UniRef50_C0EPF1 Putative uncharacterized protein n=1 Tax=Neisser... 275 3e-72 UniRef50_B2A6M8 Restriction modification system DNA specificity ... 274 4e-72 UniRef50_A0ZMI3 Putative uncharacterized protein n=1 Tax=Nodular... 274 4e-72 UniRef50_D0C390 Type I restriction-modification system specifici... 274 6e-72 UniRef50_C5BH70 Restriction modification system DNA specificity ... 273 7e-72 UniRef50_A0L1U2 Restriction modification system DNA specificity ... 273 9e-72 UniRef50_Q64AS2 Restriction endonuclease S subunits n=1 Tax=uncu... 273 1e-71 UniRef50_B7R237 Type I restriction modification system, subunit ... 272 2e-71 UniRef50_C6WNJ9 Restriction modification system DNA specificity ... 272 2e-71 UniRef50_B7JRE7 Restriction modification system DNA specificity ... 271 3e-71 UniRef50_C8NC88 Type I restriction-modification system specifici... 271 4e-71 UniRef50_A3JE98 Type I restriction-modification system, S subuni... 271 4e-71 UniRef50_B3G223 Type I restriction modification DNA specificity ... 271 5e-71 UniRef50_Q8EJT0 Type I restriction-modification system, S subuni... 271 5e-71 UniRef50_A1TX70 Restriction modification system DNA specificity ... 270 1e-70 UniRef50_A7JZU8 Type I restriction-modification system specifici... 269 1e-70 UniRef50_Q0RV87 Type I restriction-modification system specifici... 269 2e-70 UniRef50_A6E2R5 Restriction endonuclease S subunits-like protein... 268 3e-70 UniRef50_B0VPS8 Specificity determinant for hsdM and hsdR n=1 Ta... 268 3e-70 UniRef50_C2I227 Restriction modification system DNA specificity ... 268 4e-70 UniRef50_A0Q725 Type I restriction-modification system, subunit ... 268 4e-70 UniRef50_C9NRR1 Type I restriction-modification system specifici... 267 6e-70 UniRef50_D2LA90 Restriction modification system DNA specificity ... 267 7e-70 UniRef50_Q2J5T0 Restriction modification system DNA specificity ... 266 9e-70 UniRef50_C3DG13 Putative uncharacterized protein n=1 Tax=Bacillu... 266 1e-69 UniRef50_Q5KVU6 Type I restriction-modification system specifici... 266 1e-69 UniRef50_A8V066 Type I restriction-modification enzyme, S subuni... 266 1e-69 UniRef50_A3JH04 Specificity determinant for hsdM and hsdR n=1 Ta... 265 2e-69 UniRef50_Q112D6 Restriction modification system DNA specificity ... 265 2e-69 UniRef50_C6Q0B1 Restriction modification system DNA specificity ... 265 2e-69 UniRef50_A3UV36 Type I restriction enzyme specificity protein n=... 265 2e-69 UniRef50_Q57594 Type-1 restriction enzyme MjaXIP specificity pro... 265 2e-69 UniRef50_D0J4L5 Putative uncharacterized protein n=1 Tax=Comamon... 265 3e-69 UniRef50_Q2B8V0 Type I restriction modification system, subunit ... 265 3e-69 UniRef50_B1XQR8 Type 1 restriction-modification system specifici... 265 3e-69 UniRef50_A6W078 Restriction modification system DNA specificity ... 265 3e-69 UniRef50_C3NN82 Restriction modification system DNA specificity ... 265 3e-69 UniRef50_B8GGK0 Restriction modification system DNA specificity ... 264 4e-69 UniRef50_Q307D8 Type I RM system S subunit n=1 Tax=Arthrospira p... 264 5e-69 UniRef50_A4CWB5 Type I restriction-modification system, S subuni... 264 6e-69 UniRef50_C4LDK7 Restriction modification system DNA specificity ... 264 6e-69 UniRef50_Q2P0A3 Specificity determinant for hsdM and hsdR n=2 Ta... 264 6e-69 UniRef50_A4T8B4 Restriction modification system DNA specificity ... 264 6e-69 UniRef50_C5VLJ8 HsdS protein n=1 Tax=Prevotella melaninogenica A... 263 9e-69 UniRef50_A1ZTI8 Type I restriction enzyme StySJI specificity pro... 263 9e-69 UniRef50_B0CE92 Type I restriction-modification enzyme S subunit... 263 9e-69 UniRef50_B0PEE2 Putative uncharacterized protein n=1 Tax=Anaerot... 263 9e-69 UniRef50_A3ZEA3 Type I restriction modification DNA specificity ... 263 1e-68 UniRef50_C1PCQ5 Restriction modification system DNA specificity ... 262 2e-68 UniRef50_Q6GD64 Putative type I restriction enzyme specificity p... 262 2e-68 UniRef50_A8YFX5 HsdS protein n=2 Tax=Microcystis aeruginosa PCC ... 262 2e-68 UniRef50_D0BWI7 Predicted protein n=1 Tax=Acinetobacter sp. RUH2... 261 3e-68 UniRef50_A4FZ34 Restriction modification system DNA specificity ... 261 3e-68 UniRef50_A4XMW3 Restriction modification system DNA specificity ... 261 3e-68 UniRef50_A2TQ01 Possible type I restriction-modification system,... 261 3e-68 UniRef50_A4U327 Type I restriction-modification system, S subuni... 261 4e-68 UniRef50_A7N438 Putative uncharacterized protein n=1 Tax=Vibrio ... 261 4e-68 UniRef50_UPI0001BC364B restriction modification system DNA speci... 261 4e-68 UniRef50_Q7UE18 Restriction modification system S chain homolog ... 261 4e-68 UniRef50_Q1VR15 Type I restriction-modification enzyme 1, S subu... 261 4e-68 UniRef50_A7VYZ3 Putative uncharacterized protein n=1 Tax=Clostri... 261 5e-68 UniRef50_A3XVN0 Type I restriction-modification system, S subuni... 261 5e-68 UniRef50_C1SJS8 Restriction endonuclease S subunit n=1 Tax=Denit... 260 6e-68 UniRef50_C9KLK0 Putative phosphoribosylformylglycinamidine synth... 260 1e-67 UniRef50_C5DB08 Restriction modification system DNA specificity ... 259 1e-67 UniRef50_C0N6F0 Type I restriction modification DNA specificity ... 259 2e-67 UniRef50_UPI0001B4DA32 restriction endonuclease S subunits-like ... 259 2e-67 UniRef50_Q0W5N3 Type I restriction modification system, specific... 258 3e-67 UniRef50_C6JA10 Putative uncharacterized protein n=1 Tax=Ruminoc... 258 3e-67 UniRef50_B5IRS1 Type I restriction modification DNA specificity ... 258 4e-67 UniRef50_D2QTT7 Restriction modification system DNA specificity ... 258 4e-67 UniRef50_Q6F778 Putative type I restriction-modification system ... 257 5e-67 UniRef50_B5VW68 Restriction modification system DNA specificity ... 257 7e-67 UniRef50_A1TXP8 Restriction modification system DNA specificity ... 257 7e-67 UniRef50_Q07ZW7 Restriction modification system DNA specificity ... 256 9e-67 UniRef50_B0RQ64 Type I site-specific DNA methyltransferase speci... 256 1e-66 UniRef50_C5TIE5 Restriction modification system DNA specificity ... 256 1e-66 UniRef50_A7JK69 Type I restriction-modification system n=1 Tax=F... 255 2e-66 UniRef50_Q3IEL0 Putative type I restriction-modification system,... 255 2e-66 UniRef50_A1UJN5 Restriction endonuclease S subunits-like protein... 255 2e-66 UniRef50_A9A374 Restriction modification system DNA specificity ... 255 3e-66 UniRef50_Q8KLM8 Restriction-modification enzyme type I S subunit... 255 3e-66 UniRef50_Q0A7Q2 Restriction modification system DNA specificity ... 255 3e-66 UniRef50_Q63WE0 Putative type I restriction enzyme specificity p... 255 3e-66 UniRef50_A6TIP1 Putative restriction endonuclease S subunit n=2 ... 254 4e-66 UniRef50_C1ZA47 Restriction endonuclease S subunit n=1 Tax=Planc... 254 4e-66 UniRef50_Q310I9 Type I restriction enzyme, S subunit n=2 Tax=Del... 254 6e-66 UniRef50_A2TPX3 RmeS n=1 Tax=Dokdonia donghaensis MED134 RepID=A... 253 9e-66 UniRef50_UPI000038E018 type I restriction-modification enzyme, S... 252 2e-65 UniRef50_A8TH56 Restriction modification system DNA specificity ... 252 2e-65 UniRef50_B4S4B7 Restriction modification system DNA specificity ... 252 2e-65 UniRef50_D2TPV5 Putative Type I restriction-modification system,... 252 3e-65 UniRef50_A3US47 Type I site-specific deoxyribonuclease n=1 Tax=V... 251 3e-65 UniRef50_B0RYC3 Type I site-specific deoxyribonuclease (Specific... 251 3e-65 UniRef50_D2MXN5 Putative uncharacterized protein n=1 Tax=Campylo... 251 4e-65 UniRef50_B9KF72 Type I restriction-modification system, S subuni... 251 6e-65 UniRef50_D2EQS4 Putative type I restriction-modification system,... 250 6e-65 UniRef50_A3SCN8 Restriction endonuclease S subunit-like protein ... 250 9e-65 UniRef50_B0K6N9 Restriction modification system DNA specificity ... 250 9e-65 UniRef50_Q8YTM8 Type I restriction-modification enzyme S subunit... 250 1e-64 UniRef50_B2K7C3 Restriction modification system DNA specificity ... 250 1e-64 UniRef50_B9ZS45 Restriction modification system DNA specificity ... 249 1e-64 UniRef50_B0A8Q7 Putative uncharacterized protein n=1 Tax=Clostri... 249 2e-64 UniRef50_A0YWS0 Putative uncharacterized protein n=1 Tax=Lyngbya... 249 2e-64 UniRef50_B4RYU8 Type I site-specific deoxyribonuclease n=1 Tax=A... 249 2e-64 UniRef50_Q6LTT0 Hypothetical type I restriction-modification sys... 248 3e-64 UniRef50_Q8TN78 Type I restriction modification enzyme protein S... 248 3e-64 UniRef50_B3PQK6 Probable type I restriction-modification system ... 248 4e-64 UniRef50_C2CSZ9 Type I restriction modification DNA specificity ... 248 4e-64 UniRef50_B8E4I3 Restriction modification system DNA specificity ... 247 5e-64 UniRef50_B4VXC6 Type I restriction modification DNA specificity ... 247 7e-64 UniRef50_Q0RKJ6 Type I restriction modification enzyme protein S... 246 1e-63 UniRef50_Q1K3D0 Restriction modification system DNA specificity ... 246 1e-63 UniRef50_Q1GLF5 Type I restriction-modification system; S subuni... 246 1e-63 UniRef50_A5UR98 Restriction modification system DNA specificity ... 246 1e-63 UniRef50_Q8PSD7 Type I restriction-modification system specifici... 246 2e-63 UniRef50_D0S8M5 Type I restriction-modification system protein n... 246 2e-63 UniRef50_B4B315 Restriction modification system DNA specificity ... 245 4e-63 UniRef50_Q6MH62 Type I restriction-modification system, S subuni... 244 5e-63 UniRef50_Q11QY3 Probable type I restriction-modification system ... 244 5e-63 UniRef50_B8D1X6 Restriction modification system DNA specificity ... 244 6e-63 UniRef50_A4VG57 Type I restriction-modification system, S subuni... 244 6e-63 UniRef50_UPI000178969C restriction modification system DNA speci... 244 6e-63 UniRef50_C4ZFR7 Type I restriction-modification system specifici... 243 8e-63 UniRef50_Q2JGK8 Type I restriction-modification system specifici... 243 9e-63 UniRef50_A7I739 Restriction modification system DNA specificity ... 243 1e-62 UniRef50_A6LY63 Restriction modification system DNA specificity ... 243 1e-62 UniRef50_A5UFR1 Type I restriction modification DNA specificity ... 243 1e-62 UniRef50_A3J917 Type I restriction-modification n=1 Tax=Marinoba... 242 2e-62 UniRef50_Q6ZE86 Type I restriction-modification system S subunit... 242 2e-62 UniRef50_C2CF25 Restriction modification system DNA specificity ... 242 2e-62 UniRef50_D1XRZ5 Restriction modification system DNA specificity ... 242 2e-62 UniRef50_C7IKN2 Restriction modification system DNA specificity ... 242 2e-62 UniRef50_B9M293 Restriction endonuclease S subunit-like protein ... 241 3e-62 UniRef50_A1VAP4 Restriction modification system DNA specificity ... 241 3e-62 UniRef50_A6L5S4 Type I restriction-modification system S subunit... 241 4e-62 UniRef50_B3ENS6 Putative type I restriction-modification system ... 241 4e-62 UniRef50_C6YVW3 Predicted protein n=2 Tax=Francisella philomirag... 241 5e-62 UniRef50_Q58615 Uncharacterized protein MJ1218 n=1 Tax=Methanoca... 241 5e-62 UniRef50_Q7UE33 Type I restriction modification enzyme, S subuni... 241 6e-62 UniRef50_C0VJ61 Restriction modification system DNA specificity ... 240 1e-61 UniRef50_C6CZ61 Restriction modification system DNA specificity ... 240 1e-61 UniRef50_C9MBB6 Restriction endonuclease S n=5 Tax=Haemophilus i... 239 1e-61 UniRef50_C4KBJ9 Restriction modification system DNA specificity ... 239 2e-61 UniRef50_B1GZ39 Type I restriction-modification system substrate... 239 2e-61 UniRef50_D0WYM6 Putative uncharacterized protein n=1 Tax=Vibrio ... 239 2e-61 UniRef50_A6CAE7 Type I restriction enzyme specificity protein n=... 239 2e-61 UniRef50_Q30S09 Restriction modification system DNA specificity ... 238 3e-61 Sequences not found previously or not previously below threshold: UniRef50_Q12YI6 Restriction modification system DNA specificity ... 257 6e-67 UniRef50_B5JV80 Restriction modification system DNA specificity ... 239 2e-61 >UniRef50_P05719 Type-1 restriction enzyme EcoKI specificity protein n=5 Tax=Enterobacteriaceae RepID=T1SK_ECOLI Length = 464 Score = 488 bits (1257), Expect = e-136, Method: Composition-based stats. Identities = 464/464 (100%), Positives = 464/464 (100%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV Sbjct: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG Sbjct: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD Sbjct: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS Sbjct: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV Sbjct: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI Sbjct: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG Sbjct: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 Query: 421 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS Sbjct: 421 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 >UniRef50_A1AJL9 HsdS, type I site-specific deoxyribonuclease n=2 Tax=Escherichia coli RepID=A1AJL9_ECOK1 Length = 455 Score = 412 bits (1060), Expect = e-113, Method: Composition-based stats. Identities = 206/468 (44%), Positives = 266/468 (56%), Gaps = 17/468 (3%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINY-LKDDYLPLIRANNI---QNGKFDT 56 MSAGKLPEGW + + +I G T K A N+ + + + ++ + Sbjct: 1 MSAGKLPEGWEQIEIGDIADVISGGTPKSGVAENFAPSGEGVAWLTPADLSGYKEKYISH 60 Query: 57 TDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL 116 S K+ P+ ++ S V +A E + Sbjct: 61 GARDLTTLGYSSCSAKLMPKGTILFSSRAPIGYVAIAA-----NEIATNQGFKSFAFPSD 115 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 IF + +F + R+ + G I +S + + P AEQKIIAEKLDTLL Sbjct: 116 IFPDYAYYFLR--NIRHIAEEMGTGTTFKEISGSSAKTLPFVLVPFAEQKIIAEKLDTLL 173 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 AQVDSTKAR EQIPQILKRFRQAVLG AV GKLTE WR S +++ + + Sbjct: 174 AQVDSTKARLEQIPQILKRFRQAVLGAAVRGKLTEDWR-DNSSLSGWREGKLGEFIKKPS 232 Query: 237 NGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 G SSK N+ G+ P+LR+ +++ G +D D+ + + E+ ++KL+ D+LF R N S Sbjct: 233 YGTSSKSNKEGL-IPVLRMGNLQGGKLDWTDLVYTSDTI-EIEKYKLEYNDVLFNRTN-S 289 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 E VG + K Q +Y LIR + D P+Y+ +S R + Sbjct: 290 PELVGKTAIYKSEQP--AIYAGYLIRVQCLPDLNPDYLNYHLNSILGRQYCYSVKSDGVS 347 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 Q I+ + + + + +PP+ EQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK Sbjct: 348 QSNINAQKLIAYPITVPPLPEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 407 Query: 417 AFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 AFRGELTAQWRAENP+LISGENSAAALLEKIKAERAASGGKKASRKKS Sbjct: 408 AFRGELTAQWRAENPELISGENSAAALLEKIKAERAASGGKKASRKKS 455 >UniRef50_B5BKY5 Subunit S of type I restriction-modification system n=7 Tax=Salmonella enterica subsp. enterica RepID=B5BKY5_SALPK Length = 462 Score = 377 bits (969), Expect = e-103, Method: Composition-based stats. Identities = 198/477 (41%), Positives = 266/477 (55%), Gaps = 28/477 (5%) Query: 1 MSAGKLPEGWVIAPVSTVTTL-IRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL 59 MS GKLPEGWV +S + + G T K + +R +I G D + + Sbjct: 1 MSGGKLPEGWVTTHLSEICSKPQYGYTTKSSSM------GDVKFLRTTDITKGAVDWSSV 54 Query: 60 VFVPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQ-HLPFECSFGAFCGVLRPEKLI 117 + ++ DIVI+ VG S + P + F ++ +P Sbjct: 55 PYCMDAPEDVSKYQLQDRDIVISR----AGSVGFSFLVQNPPSQVVFASYLIRFKPVNYF 110 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 ++ F +SS Y N++S +SAG + N+ + +PIPP+AEQKIIAEKLDTLLA Sbjct: 111 SEYYLKRFLESSDYWNQLSLMSAGNAVQNVNAQKLSTLTVPIPPIAEQKIIAEKLDTLLA 170 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKL-TEKWRNFEPQHSVFKK---------LN 227 QVDSTKAR EQIPQILKRFRQAVL AV+G L RN P S ++ Sbjct: 171 QVDSTKARLEQIPQILKRFRQAVLAAAVSGLLIGSNKRNHHPLCSEWQWPDLPSTWSVHK 230 Query: 228 FESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD 287 + ++ + K G L +VR D +++ + S+ E L+ GD Sbjct: 231 YSELVDSRLGKMLDKAKNFGSATKYLGNINVRWFSFDLENLQDILISDIERRELSLKLGD 290 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 +L G C + + Q +++ L RAR+ +PE++ + S N Sbjct: 291 VLICEGGEP----GRCAIWSEPQDIPVIFQKALHRARVKDKIIPEWLVYNLKNDS-NNIS 345 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 ++ + T + K ++GK + + + +PP++EQ EIVRRVEQLFA+ADTIEKQVNNAL RVN Sbjct: 346 LSQLFTGTTIKHLTGKALANYPIRVPPLEEQHEIVRRVEQLFAWADTIEKQVNNALNRVN 405 Query: 408 NLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 +LTQSILAKAFRGELTAQWRAENP LISGENSAAALLEKIKAERAASGGKK SRKK+ Sbjct: 406 SLTQSILAKAFRGELTAQWRAENPSLISGENSAAALLEKIKAERAASGGKKTSRKKA 462 >UniRef50_B7LQL4 Specificity determinant for hsdM and hsdR (Modular protein) n=2 Tax=Escherichia RepID=B7LQL4_ESCF3 Length = 502 Score = 365 bits (938), Expect = 2e-99, Method: Composition-based stats. Identities = 197/517 (38%), Positives = 261/517 (50%), Gaps = 68/517 (13%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MSAGKLPEGWV + V + G T + + + +P I+ ++ Sbjct: 1 MSAGKLPEGWVETNLQNVASWGSGGTPSRNH--DEYYNGNIPWIKTGDLGPKIITNASEY 58 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS- 119 + S K P+ V G + +GK++ + + C V P + I S Sbjct: 59 ITDAGVQNSSAKFFPKGSVAIAMYG--ATIGKTSILGID--ATTNQACAVGTPLEGITST 114 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ +F + +N G NI I +PPLAEQKII EKLDTLLAQV Sbjct: 115 LFLYYFLLNE--KNAFIKKGKGGAQPNISQTVIKEHIIYLPPLAEQKIITEKLDTLLAQV 172 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRN------------------------ 215 DSTKAR EQIPQILKRFRQAVL AVNGKLTE WR+ Sbjct: 173 DSTKARLEQIPQILKRFRQAVLERAVNGKLTECWRDCVGELTSAEEIITEIKKYRKASLS 232 Query: 216 -------------------------FEPQHSVFKKLNFESILTELRNGLSSKPNESGVGH 250 + F + ++ + + G Sbjct: 233 TEGSSASTESKRQIAKIEKHCFKVPKINLPKGWVWTTFLQSMEKVVDCHNKTAPYVDQGI 292 Query: 251 PILRISSVRAGHVDQNDIRFLECS--ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKK 308 ++R +R G + ++ ++++ R + GD++FTR +G G++ + Sbjct: 293 HLIRTPDIRNGVISLDNTKYIDNDTYLYWSKRCPPRSGDIIFTREAP----MGEAGIVPE 348 Query: 309 LQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 + +++ R + +Y+ + S S + M++ +G K + D++S Sbjct: 349 NTI--ICMGQRMMLLRPIPEYIHNKYVLLNILSSSFQTRMISQAI-GTGVKHLRVADVES 405 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWR 427 LPP++EQ EIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWR Sbjct: 406 LTYPLPPIEEQHEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWR 465 Query: 428 AENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 AENPDLISGENSAAALLEKIKAERAASGGKKASRKKS Sbjct: 466 AENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 502 >UniRef50_A9N788 Putative uncharacterized protein n=3 Tax=Salmonella enterica subsp. enterica RepID=A9N788_SALPB Length = 467 Score = 359 bits (921), Expect = 2e-97, Method: Composition-based stats. Identities = 184/476 (38%), Positives = 254/476 (53%), Gaps = 21/476 (4%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS GKLPE WV + + + G K +A+ ++ P IR + +NG + + + Sbjct: 1 MSGGKLPEEWVKTTIGVICEVKGGKRLPKGKALLNTATEH-PYIRVTDFENGSVNLSTIK 59 Query: 61 FVPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 ++ + + IS D + +G+ ++G+ Q + A Sbjct: 60 YLDSDTYSAISNYTISKND-LYISIAGTIGLIGEIPEQLDNANLTENAAKLCFIL--GTD 116 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ H S+ + + + + P P+ EQKIIAEKLDTLLAQ Sbjct: 117 KKYLKHVLSSNKTIEQFDDKTTSSGQPKLALFRIRDCEFPYAPINEQKIIAEKLDTLLAQ 176 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKL-TEKWRNFEPQHSVFKK---------LNF 228 VDSTKAR EQIPQILKRFRQAVL AV+G L RN P S ++ + Sbjct: 177 VDSTKARLEQIPQILKRFRQAVLAAAVSGLLIGSNKRNHHPLCSEWQWPDLPSTWSVHKY 236 Query: 229 ESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDL 288 ++ + K G L +VR D +++ + S+ E L+ GD+ Sbjct: 237 SELVDSRLGKMLDKAKNFGSATKYLGNINVRWFSFDLENLQDILISDIERRELSLKLGDV 296 Query: 289 LFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMM 348 L G C + + Q +++ L RAR+ +PE++ + S N + Sbjct: 297 LICEGGEP----GRCAIWSEPQDIPVIFQKALHRARVKDKIIPEWLVYNLKNDS-NNISL 351 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + + T + K ++GK + + + +PP++EQ EIVRRVEQLFAYADTIEKQVNNAL RVN+ Sbjct: 352 SQLFTGTTIKHLTGKALANYPIRVPPLEEQHEIVRRVEQLFAYADTIEKQVNNALTRVNS 411 Query: 409 LTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 LTQSILAKAFRGELTAQWRAENP LISGENSAAALLEKIKAERAASGGKK SRKK+ Sbjct: 412 LTQSILAKAFRGELTAQWRAENPSLISGENSAAALLEKIKAERAASGGKKTSRKKA 467 >UniRef50_C5SE02 Restriction modification system DNA specificity domain protein n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SE02_CHRVI Length = 448 Score = 346 bits (889), Expect = 9e-94, Method: Composition-based stats. Identities = 114/453 (25%), Positives = 201/453 (44%), Gaps = 31/453 (6%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W P+ V ++ G +K + N + P+IR ++ +G T +P Sbjct: 25 WERVPLGDVCDILNGFPFKSQHFNN---SEGAPVIRIRDVTSGFCKTFYSGDIPVG---- 77 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFSGFIAHFTKS 128 + P D+V+ M + S L P + + F+++ Sbjct: 78 -YWVEPFDMVVGMDGDFNCRLWSS------ERSLLNQRVCKLTPHEDFLDKKFLSYVL-- 128 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 Y I+ + + ++ + I P+PPLAEQ+ I KLD L + + Sbjct: 129 PAYLRLINDHTHSITVKHLSSKTIAKIPFPLPPLAEQRRIVAKLDRLFERTRRAREELSH 188 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV 248 IP++++ +++A+L A G LT+ WR K++ + +L G S+K ++SG Sbjct: 189 IPRLIENYKKAILVAAFRGDLTKDWREKRGLPMP-KEVKLGEVAKKLSYGTSAKSSKSGD 247 Query: 249 GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKK 308 P+LR+ +++ +D D+ + E+ ++ L GD+LF R N S E VG + K Sbjct: 248 -VPVLRMGNIQNMRIDWKDLVYTS-DVEEIEKYSLNAGDVLFNRTN-SPELVGKTAIYKG 304 Query: 309 LQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQ 368 + +Y LI+ + +PEY+ +SP R+ Q I+ K + Sbjct: 305 ERP--AIYAGYLIKIKCGNRLVPEYLNYCLNSPLGRSYCWRVKSDGVSQSNINAKKLADF 362 Query: 369 VVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRA 428 LLP EQ EIV R+E+ + D++ + A +++L Q+ LAKAFRGEL Q + Sbjct: 363 SFLLPTHDEQKEIVFRIEKTLDWLDSLVIEERQASHLLDHLDQANLAKAFRGELVPQDPS 422 Query: 429 ENPDLISGENSAAALLEKIKAERAASGGKKASR 461 + P A+ LLE+I A+R + ++ Sbjct: 423 DEP--------ASVLLEQIYADREKQVKIRKNK 447 >UniRef50_C9RY89 Restriction modification system DNA specificity domain protein n=2 Tax=Geobacillus RepID=C9RY89_GEOSY Length = 477 Score = 345 bits (887), Expect = 1e-93, Method: Composition-based stats. Identities = 103/450 (22%), Positives = 189/450 (42%), Gaps = 41/450 (9%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P WV V G T +++ Y +P I+ + +G ++ + Sbjct: 26 EVPGNWVWVRSGHVAKWGSGGTPSRKRLEYY--GGDIPWIKTGELNDGIITGSEETITEE 83 Query: 65 NLVKESQKISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 L K S KI P IVIAM + +G + + C V +P + + S ++ Sbjct: 84 GLQKSSAKIFPKGSIVIAMYGATIGRLGILGI-----DAATNQACAVGQPYEFLDSKYMF 138 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 ++ + R+ + +L G NI +PPL EQK IA+K++ L A++D K Sbjct: 139 YYFFAR--RSDLVALGKGGAQPNISQTIIKDFPFALPPLNEQKRIADKIERLFAKIDEAK 196 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKL------------------------TEKWRNFEPQ 219 E++ + +++ R +L A G+L E+W P Sbjct: 197 RLIEEVKESIEQRRAVMLEKAFKGQLGTNDPSEKSILETSDDLSEKDVIPKEQWPYEVPG 256 Query: 220 HSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELN 279 + + + +S L L+ G ++ + G LRI+ ++ +VD + + + + L Sbjct: 257 NWTW--IKLKSCLKRLQYGYTATSSTLTEGPKYLRITDIQNDNVDWETVPYCKIDDKLLE 314 Query: 280 RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFS 339 ++KL GD++ R + G L+ ++ LIR + ++ P Y+ + Sbjct: 315 KYKLNKGDIVIARTGATT---GKSFLID-DMPFCSVFASYLIRLTMNENLNPYYLWNYLK 370 Query: 340 SPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 S + VK Q G + + I +V LPPV EQ I +++ L + ++ V Sbjct: 371 SSMYWKQIT-IVKKGIAQPGANARIIGELIVPLPPVPEQKRIAEKLDNLLEKLENEKQLV 429 Query: 400 NNALARVNNLTQSILAKAFRGELTAQWRAE 429 +++ L QS+L KAFRGEL + Sbjct: 430 LAVEEKLDLLKQSVLQKAFRGELGTNDPND 459 Score = 166 bits (422), Expect = 1e-39, Method: Composition-based stats. Identities = 40/246 (16%), Positives = 90/246 (36%), Gaps = 12/246 (4%) Query: 197 RQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRIS 256 + +L A+ ++ P + V+ + + + G P ++ Sbjct: 9 MEQLLEEAL--VPKDEQPYEVPGNWVWVRSGHVAKWGSGGTPSRKRLEYYGGDIPWIKTG 66 Query: 257 SVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLY 316 + G + ++ E + + G ++ Y ++ +G+ G+ + Sbjct: 67 ELNDGIITGSEETITEEGLQKSSAKIFPKGSIVIAMYGATIGRLGILGI-------DAAT 119 Query: 317 PDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVK 376 + + +Y+ +F AR + + + Q IS IK LPP+ Sbjct: 120 NQACAVGQPYEFLDSKYMFYYF---FARRSDLVALGKGGAQPNISQTIIKDFPFALPPLN 176 Query: 377 EQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISG 436 EQ I ++E+LFA D ++ + + +L KAF+G+L +E L + Sbjct: 177 EQKRIADKIERLFAKIDEAKRLIEEVKESIEQRRAVMLEKAFKGQLGTNDPSEKSILETS 236 Query: 437 ENSAAA 442 ++ + Sbjct: 237 DDLSEK 242 Score = 156 bits (394), Expect = 3e-36, Method: Composition-based stats. Identities = 51/223 (22%), Positives = 93/223 (41%), Gaps = 9/223 (4%) Query: 5 KLPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 ++P W + + L G T +R +IQN D + + Sbjct: 253 EVPGNWTWIKLKSCLKRLQYGYTATSSTLTE-----GPKYLRITDIQNDNVDWETVPYCK 307 Query: 64 -KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + + E K++ DIVIA + + +PF F ++ L + + ++ Sbjct: 308 IDDKLLEKYKLNKGDIVIARTGATTGK--SFLIDDMPFCSVFASYLIRLTMNENLNPYYL 365 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 ++ KSS+Y +I+ + G + +P+PP+ EQK IAEKLD LL ++++ Sbjct: 366 WNYLKSSMYWKQITIVKKGIAQPGANARIIGELIVPLPPVPEQKRIAEKLDNLLEKLENE 425 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKK 225 K + + L +Q+VL A G+L N + K+ Sbjct: 426 KQLVLAVEEKLDLLKQSVLQKAFRGELGTNDPNDGHAMELVKE 468 >UniRef50_A3EKX4 Type I restriction modification DNA specificity domain protein n=1 Tax=Vibrio cholerae V51 RepID=A3EKX4_VIBCH Length = 466 Score = 340 bits (873), Expect = 7e-92, Method: Composition-based stats. Identities = 149/485 (30%), Positives = 229/485 (47%), Gaps = 54/485 (11%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS +LP+GWV +S L G +K Y +D +IR N+Q+G ++ Sbjct: 1 MS--QLPKGWVCTSISQCFELKNGYAFKSSD---YTEDGDF-VIRIGNVQDGHIILSNPA 54 Query: 61 FVP-KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 +V + L +S K++ DI+I+++ G+ +G + +HLP + + Sbjct: 55 YVAAEKLGADSFKLNEGDILISLT-GNVGRIGMVSKEHLP--AVLNQRVAKICVVNSVEI 111 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ + ++ L++ + SL+ GA NI + +PPLAEQ I EKLD +LAQV Sbjct: 112 RWLFYLLRTRLFQQHVLSLAKGAAQLNISTKDIQSFDFALPPLAEQTRIVEKLDEVLAQV 171 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 D+ KAR + IP ILKRFRQ+VL AV+GKLTE+WR P K+ T+L + Sbjct: 172 DTIKARLDGIPAILKRFRQSVLAAAVSGKLTEEWRQLNPNQPSHPKVGKVKYKTDLFDSA 231 Query: 240 SSK----------------------------PNESGVGHPILRISSVRAG--HVDQNDIR 269 S + G LR+S+VR +D +D++ Sbjct: 232 SKSLPELPPEWLVIPAAHLLEYVTSGSRGWANYYASSGALFLRMSNVRYDTTKLDLSDLQ 291 Query: 270 FLECSES-ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD 328 ++ E+ E R +++ DL+ + VG + + + L AR Sbjct: 292 YVNLPENVEGKRSLVKENDLVISIT----ADVGRVARVDSEIEEAYV-NQHLALARPASH 346 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 E++ +S + + +K + + G+ DI+S + P + EQ EIVR V+Q Sbjct: 347 IDAEFLAKCIASVNIGIKQVQALKRGATKAGLGLDDIRSMAIPFPHLAEQKEIVRLVDQY 406 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIK 448 FA+ADTIE V A ARV+ LTQSILAKAFRGEL Q + P A LLE+I Sbjct: 407 FAFADTIEALVKKAQARVDKLTQSILAKAFRGELVPQDPNDEP--------ADKLLERIA 458 Query: 449 AERAA 453 R Sbjct: 459 TARKK 463 >UniRef50_P06990 Type-1 restriction enzyme EcoBI specificity protein n=2 Tax=Escherichia coli RepID=T1SB_ECOLX Length = 474 Score = 338 bits (868), Expect = 3e-91, Method: Composition-based stats. Identities = 190/473 (40%), Positives = 265/473 (56%), Gaps = 39/473 (8%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 + W+ + +V + G +K + N + D +PLIR ++ G T +P+ Sbjct: 23 DSWLRISMDSVANITNGFAFKSSEFNN--RKDGVPLIRIRDVLKGNTSTYYSGQIPEG-- 78 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK-LIFSGFIAHFT 126 + PED+++ M + + S + ++ F H Sbjct: 79 ---YWVYPEDLIVGMDGDFNATIWCS------EPALLNQRVCKIEVQEDKYNKRFFYHAL 129 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 Y + I++ ++ + ++ + +P+PPLAEQKIIAEKLDTLLAQVDSTKAR Sbjct: 130 --PGYLSAINANTSSVTVKHLSSRTLQDTLLPLPPLAEQKIIAEKLDTLLAQVDSTKARL 187 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-------------PQHSVFKKLN-FESIL 232 EQIPQILKRFRQAVL AV G+LT++ ++F P+ LN + Sbjct: 188 EQIPQILKRFRQAVLAAAVTGRLTKEDKDFITKKVELDNYKILIPEDWSETILNNIINTQ 247 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGDLLFT 291 L G+ ++ G ++R+ + G VD N +R + + R K++ D+L T Sbjct: 248 RPLCYGVVQPGDDIKDGIELIRVCDINDGEVDLNHLRKISKEIDLQYKRSKVRKNDILVT 307 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNC 350 +G G++++ N+ + R K +P ++ I+ SSP + ++ Sbjct: 308 IVGA----IGRIGIVRE--DINVNIARAVARISPEYKIIVPMFLHIWLSSPVMQTWLVQS 361 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 K +K ++ KD+K+ V LP ++EQ EIVRRVEQLFAYAD+IEKQVNNALARVNNLT Sbjct: 362 SKE-VARKTLNLKDLKNAFVPLPSIEEQHEIVRRVEQLFAYADSIEKQVNNALARVNNLT 420 Query: 411 QSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 QSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK Sbjct: 421 QSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 473 Score = 136 bits (342), Expect = 3e-30, Method: Composition-based stats. Identities = 45/216 (20%), Positives = 88/216 (40%), Gaps = 8/216 (3%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +PE W ++ + R + Y Q + +KD + LIR +I +G+ D L + K Sbjct: 231 IPEDWSETILNNIINTQRPLCYGVVQPGDDIKD-GIELIRVCDINDGEVDLNHLRKISKE 289 Query: 66 LVKESQ--KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFI 122 + + + K+ DI++ + +G+ + + PE K+I F+ Sbjct: 290 IDLQYKRSKVRKNDILVTIVGA----IGRIGIVREDINVNIARAVARISPEYKIIVPMFL 345 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + S + + + S + +P+P + EQ I +++ L A DS Sbjct: 346 HIWLSSPVMQTWLVQSSKEVARKTLNLKDLKNAFVPLPSIEEQHEIVRRVEQLFAYADSI 405 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP 218 + + + Q++L A G+LT +WR P Sbjct: 406 EKQVNNALARVNNLTQSILAKAFRGELTAQWRAENP 441 >UniRef50_B3YJG5 Type I restriction enzyme EcoKI specificity protein n=3 Tax=Gammaproteobacteria RepID=B3YJG5_SALET Length = 486 Score = 334 bits (858), Expect = 3e-90, Method: Composition-based stats. Identities = 183/502 (36%), Positives = 260/502 (51%), Gaps = 54/502 (10%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MSAGKLPEGWV + + K + + +D ++ +I+ + Sbjct: 1 MSAGKLPEGWVDTQLGNIVDY-----GKATKRVLSDVNDDTWVLELEDIEKESSKLLSTI 55 Query: 61 FVPKNLVKESQK-ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + K ++ D++ + + + L E + Sbjct: 56 RASERPFKSTKNSFKRGDVLYGKLRPYLNKI-----IIAKEDGVCTTEIIPLCAEPSCCN 110 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 +I ++ KSS ++ ++ +S G N+ + A + + PLAEQKIIAEKLDTLLAQ+ Sbjct: 111 KYIFYWLKSSTFQGYVNDVSYGVNMPRLGTADGLKAPLRLAPLAEQKIIAEKLDTLLAQI 170 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE---------------------- 217 DSTKAR EQIPQILKRFRQAVL AV+G LT +WR Sbjct: 171 DSTKARLEQIPQILKRFRQAVLAAAVSGNLTAEWRMNNNSNIVEEEIEKVKNKLIAKKII 230 Query: 218 -------------PQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVD 264 P S + + +SI T++ +G P G ++ +++ G++ Sbjct: 231 KKDLIYSKLDRKYPIPSDWLYVKLQSIATKITDGEHKTPKREPAGQLLISARNIQDGYLK 290 Query: 265 QNDIRFLECSESELNRHKLQD--GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR 322 +D+ ++ +E + R++ GD+L + +G L+ + ++ LI+ Sbjct: 291 LSDVDYVGDAEFQKLRNRCDPDSGDVLISCSGS----IGRVCLVDENSKYVMVRSVALIK 346 Query: 323 ARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIV 382 + + +Y+ SP + + K+ + Q + IK+ + LPPV EQAEIV Sbjct: 347 L-MQDFVINKYMMYLLQSPLLQKEIEENSKS-TAQANLFLGPIKNLGIPLPPVPEQAEIV 404 Query: 383 RRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAA 442 RRVEQLFAYADTIEKQVN+AL RVN+LTQSILAKAFRGELTAQWR ENP LISGENSAAA Sbjct: 405 RRVEQLFAYADTIEKQVNSALTRVNSLTQSILAKAFRGELTAQWRTENPSLISGENSAAA 464 Query: 443 LLEKIKAERAASGGKKASRKKS 464 LLEKIKAERAASGGKK SRKK+ Sbjct: 465 LLEKIKAERAASGGKKTSRKKA 486 >UniRef50_C9YAL6 Putative uncharacterized protein n=1 Tax=Curvibacter putative symbiont of Hydra magnipapillata RepID=C9YAL6_9BURK Length = 449 Score = 332 bits (851), Expect = 2e-89, Method: Composition-based stats. Identities = 124/471 (26%), Positives = 215/471 (45%), Gaps = 33/471 (7%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS LP+ W AP+ + + ++ +A ++ +P++ A NI + K + Sbjct: 1 MS---LPQSWTTAPLGKLCEKLSDGSHNPPKA----QETGMPMLSARNINDRKITFDEFR 53 Query: 61 FVPKNLV---KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKL 116 + ++S D+++ + +G++A + VL+P K Sbjct: 54 LISPEEFAEEDRRTRVSSGDVLLTIVGA----IGRTAVVPQGAPQFTLQRSVAVLKPIKS 109 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 S +I++ ++ + + + G I + + IP+ P EQK IA+KLDT+L Sbjct: 110 -DSRYISYALEAPALQKYLQDNAKGTAQKGIYLKALAGVEIPVAPEPEQKRIADKLDTVL 168 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR-NFEPQHSVFKKLNFESILTEL 235 +VD+ R ++ +LKRFRQ+VL A +G+LTE WR P+ + + + + Sbjct: 169 TRVDAVNTRLARVAPLLKRFRQSVLAAATSGRLTEDWRNGSIPEVKEWSEKALSEVCRTI 228 Query: 236 RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQD--GDLLFTRY 293 +G P + G P++ VR VD +D +F+ ++ +R + GD+L Sbjct: 229 TDGEHISPPLAPHGVPLVSAKDVREWGVDFSDTKFVSEEFADASRKRCGPICGDVLVVSR 288 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 + VG L+K + L+ L + T E++ +SP + Sbjct: 289 GAT---VGRTCLVKSKEKFCLMGSVLLFQPTAT-LIKSEFLAHVLASPLGLEQLTK-ASG 343 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 + Q I +D K + LP ++EQ EIVRRVE LFA+AD +E ++ A A LT ++ Sbjct: 344 ATAQAAIYIRDAKGLKIRLPSIEEQTEIVRRVETLFAFADRLEARLAQAQAAATRLTPAL 403 Query: 414 LAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 LAKAF GEL Q + P AA LL ++ A+ A+ + RK + Sbjct: 404 LAKAFSGELVPQDPNDEP--------AAELLRRL-AQAPATASPRKGRKAA 445 >UniRef50_Q1MKB2 Putative type I restriction enzyme specificity subunit n=2 Tax=Alphaproteobacteria RepID=Q1MKB2_RHIL3 Length = 456 Score = 330 bits (848), Expect = 5e-89, Method: Composition-based stats. Identities = 128/474 (27%), Positives = 215/474 (45%), Gaps = 29/474 (6%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN--GKFDTTD 58 MS LP+GWV A + + + + + + + + + + G Sbjct: 1 MSG--LPKGWVEATLEELCQ------FNPKHDPDVDQSLGVNFVPMPAVDDETGAIIDKS 52 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHL--PFECSFGAFCGVLRPEKL 116 +V + K + D++ A + GK A VLR + Sbjct: 53 VVRPLSEIWKGYTHFADRDVIFAKITPCMEN-GKIAVARDLANGMACGSTEFHVLRSKGA 111 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + F+ F + YR GA + + ++P+PPL EQK I KLDTL Sbjct: 112 VEPDFLWRFLRRKNYRQVAEHSMTGAVGQRRVPRQFLETTSLPLPPLNEQKRIVAKLDTL 171 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTEL 235 A+ + +I ++ RF+QAVL A +G+LT+ WR+ + + ++ L +++ Sbjct: 172 NAKSARARTELARIEILVSRFKQAVLSKAFSGELTKDWRSGQTTLAPWENLPLSQLVSHG 231 Query: 236 -RNGLSSKPNESGVGHPILRISSVRAGHVDQND--IRFLECSESELNRHKLQDGDLLFTR 292 NG S K + G L++S+ +G + ++ I++L+ + E ++ L D++ R Sbjct: 232 PSNGWSPKADGKVSGLKSLKLSATSSGRLRLDESTIKYLDQTLPEDSKFWLLSDDIVIQR 291 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCV 351 N SLE +G L ++PD ++R R+ K P Y+ + +S SAR+ Sbjct: 292 AN-SLELLGTTVLFDGP-PGEFIFPDLMMRIRVNDKKTNPRYLATYLNSDSARSYFRANA 349 Query: 352 KTTSG-QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 ++G I+G ++ V PP++EQ EIV R+E FA D + + AL V L Sbjct: 350 TGSAGNMPKINGSTVRETRVPTPPLEEQQEIVHRIESAFAMTDRLAAEAMRALDLVGKLG 409 Query: 411 QSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 ++ILAKAFRGEL Q + P A LLE+I+AER A+ K R+K+ Sbjct: 410 EAILAKAFRGELVPQDENDEP--------AEKLLERIRAEREAAPEAKRGRRKT 455 >UniRef50_A6UXD7 Type I restriction-modification system, S subunit n=1 Tax=Pseudomonas aeruginosa PA7 RepID=A6UXD7_PSEA7 Length = 464 Score = 326 bits (837), Expect = 1e-87, Method: Composition-based stats. Identities = 79/453 (17%), Positives = 175/453 (38%), Gaps = 24/453 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAIN--YLKDDYLPLIRANNIQNGKFDTTDLVFV 62 ++P+ W P+ + L R + I + D + I N+ G + F+ Sbjct: 19 QVPDHWSSVPIKYMA-LERNSLFLDGDWIESKDISSDGIRYITTGNVGEGAYKEQGAGFI 77 Query: 63 PKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + ++ D++++ + +L + RP+ + Sbjct: 78 SEETFHALRCTEVYEGDVLVSRLNNPIGR--ACVVPNLGGRVVTSVDNVIFRPDLKFYKK 135 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 FI + S Y S+L+ GA + I I + P L EQ IA LD A++D Sbjct: 136 FIVYLFSSEEYFKHTSNLARGATMQRISRGLLGNIRVVTPSLEEQTQIARFLDHETARID 195 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLN-FES 230 + +++ ++LK RQAV+ AV L +W P H + ++ Sbjct: 196 ALIEEQQRLIELLKEKRQAVISHAVTKGLDPTVPMKDSGVEWLGEVPAHWEVRSISSISK 255 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS-ESELNRHKLQDGDLL 289 +T G + G L+ +++ + F+ +E + L GD+L Sbjct: 256 KITNGYVGPTRDILVDEPGVRYLQSLHIKSNKIKFEVPYFVSEQWSAEHAKSILASGDVL 315 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 + +G ++ + +H +I + + + L E++ +S +++++ Sbjct: 316 IVQTG----DIGQVAVVTE-EHAGCNCHALIIVSPVREVVLGEWVSWVLNSTYGYHSLLS 370 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 ++T + ++ ++K + +PP++EQA IV +E D++ + +L + Sbjct: 371 -IQTGAMHPHLNCGNVKFLNLPIPPLEEQARIVSFIESGELEMDSLMSETKRSLLLLQER 429 Query: 410 TQSILAKAFRGELTAQWRAENPDLISGENSAAA 442 ++++ A G++ + + E + A Sbjct: 430 RTALISAAVTGKIDVRGWQPPASTQAPEPAVAE 462 Score = 166 bits (420), Expect = 2e-39, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 86/236 (36%), Gaps = 16/236 (6%) Query: 212 KWRNFEPQHSV---FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDI 268 +W + P H K + E L + S G + +V G + Sbjct: 15 EWLDQVPDHWSSVPIKYMALERNSLFLDGDWIESKDISSDGIRYITTGNVGEGAYKEQGA 74 Query: 269 RFLECSESELNRHK-LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 F+ R + +GD+L +R N +G ++ L + + D +I R Sbjct: 75 GFISEETFHALRCTEVYEGDVLVSRLN---NPIGRACVVPNLGGRVVTSVDNVI-FRPDL 130 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 ++I FSS + + + + IS + + V+ P ++EQ +I R ++ Sbjct: 131 KFYKKFIVYLFSSEEYFKH-TSNLARGATMQRISRGLLGNIRVVTPSLEEQTQIARFLDH 189 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D + ++ + + Q++++ A + +P + ++ L Sbjct: 190 ETARIDALIEEQQRLIELLKEKRQAVISHAVT-------KGLDPTVPMKDSGVEWL 238 Score = 130 bits (328), Expect = 1e-28, Method: Composition-based stats. Identities = 37/210 (17%), Positives = 91/210 (43%), Gaps = 8/210 (3%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + +S+++ I + I + + +++ +I++ K FV Sbjct: 239 GEVPAHWEVRSISSISKKITNGYVGPTRDILVDEP-GVRYLQSLHIKSNKIKFEVPYFVS 297 Query: 64 K--NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLIFSG 120 + + ++ D++I +G+ A ++ P +++ Sbjct: 298 EQWSAEHAKSILASGDVLIV----QTGDIGQVAVVTEEHAGCNCHALIIVSPVREVVLGE 353 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +++ S+ + + S+ GA ++ + +N+PIPPL EQ I +++ ++D Sbjct: 354 WVSWVLNSTYGYHSLLSIQTGAMHPHLNCGNVKFLNLPIPPLEEQARIVSFIESGELEMD 413 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLT 210 S + ++ +L+ R A++ AV GK+ Sbjct: 414 SLMSETKRSLLLLQERRTALISAAVTGKID 443 >UniRef50_C3Q383 Putative uncharacterized protein n=1 Tax=Bacteroides sp. 9_1_42FAA RepID=C3Q383_9BACE Length = 428 Score = 326 bits (836), Expect = 1e-87, Method: Composition-based stats. Identities = 97/433 (22%), Positives = 165/433 (38%), Gaps = 31/433 (7%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P W + P+ + G+TY +D ++R++NIQN K + D V+V Sbjct: 15 IGEIPNHWEVVPLKRTGSFENGLTYSPNDI----RDKGYIVLRSSNIQNSKMNYEDTVYV 70 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 V + DI+I +GS S+VGK A +FGAF P I + + Sbjct: 71 ES--VPNDLLVKKGDIIICSRNGSASLVGKCAKFDGKIAATFGAFMMRYSPS--INNEYA 126 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + + L + IN + + P+PPL+EQ+ IA LD ++D Sbjct: 127 FFSFQ--ILMRNYKGLFTTSTINQLTKNVIAQMVCPLPPLSEQQAIASYLDAKTEKIDKM 184 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSV-FKKLNFESIL 232 A+ E+ + L +Q+++ AV L KW P+H K S + Sbjct: 185 IAKAEKKIEYLGELKQSLITRAVTRGLNPNASLKDSGVKWIGKVPEHWETIKLSRVYSYI 244 Query: 233 TELRNGLSSKPNES-GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 LSS+ + G+ L+ + G + Q + + + E ++ Sbjct: 245 GSGTTPLSSQEDYYSEEGYNWLQTGDLNNGLITQTSKKITKKAIDECRMKFYPKHSVVIA 304 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 Y + +G GLL T+ P + S A + Sbjct: 305 MYGAT---IGKVGLLDLES----TTNQACCVISPTQKMNPLFTFY---SFMAAKKELLLA 354 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 GQ IS IK V +PP++EQ I+ +++ D I +A + L Q Sbjct: 355 SFGGGQPNISQDIIKKLRVPVPPLEEQNAIILSLKKECDTIDHIIATQKKKIAYLQELKQ 414 Query: 412 SILAKAFRGELTA 424 S++ G++ Sbjct: 415 SLITNVVTGKIKV 427 Score = 161 bits (409), Expect = 4e-38, Method: Composition-based stats. Identities = 50/233 (21%), Positives = 88/233 (37%), Gaps = 21/233 (9%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES-GVGHPILRISSVRAGHVDQNDIRF 270 KW P H L NGL+ PN+ G+ +LR S+++ ++ D + Sbjct: 13 KWIGEIPNHWEVVPLKRTG---SFENGLTYSPNDIRDKGYIVLRSSNIQNSKMNYEDTVY 69 Query: 271 LECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 +E ++L ++ GD++ NGS VG C + ++R + Sbjct: 70 VESVPNDL---LVKKGDIIICSRNGSASLVGKCAKFDG--KIAATFGAFMMRYSPS--IN 122 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 EY F RN + TTS ++ I V LPP+ EQ I ++ Sbjct: 123 NEYAFFSFQ-ILMRNY--KGLFTTSTINQLTKNVIAQMVCPLPPLSEQQAIASYLDAKTE 179 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + + + + L QS++ +A R NP+ ++ + Sbjct: 180 KIDKMIAKAEKKIEYLGELKQSLITRAVT-------RGLNPNASLKDSGVKWI 225 >UniRef50_C6CR26 Restriction modification system DNA specificity domain protein n=1 Tax=Dickeya zeae Ech1591 RepID=C6CR26_DICZE Length = 462 Score = 325 bits (835), Expect = 1e-87, Method: Composition-based stats. Identities = 81/449 (18%), Positives = 172/449 (38%), Gaps = 36/449 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + ++ G T K Y ++ +P + + ++ +G Sbjct: 19 GQVPVHWNAVSLKWISQRYSGGTPDKSNDA-YWENGDIPWLNSGSVNDGYITEPSTYITR 77 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + S K P++ ++ +G G A L + + P++ F+ Sbjct: 78 EGFASSSAKWVPKNALVMALAGQGKTKGMVA--QLGIRATCNQSMAAIIPKEKFTPRFLY 135 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + S+ I +++ G + + I P+ P EQ IA+ LD ++DS Sbjct: 136 WWLVSNY--QNIRNMAGGEQRDGLNLDMLGSIPCPLLPRPEQTAIADFLDRETGRIDSLM 193 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTE-------------------KWRNFEPQHSVFK 224 A+ Q+ +LK R A++ V L E +W P+ K Sbjct: 194 AKKRQLIALLKEKRCALISHIVTRGLPEAAADEFGLKPHTRFKNSDIEWLGQVPEGWGVK 253 Query: 225 --KLNFESILTELRNGLS-----SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESE 277 + S EL++G + G G P + + + G +D N ++E +++ Sbjct: 254 KVWIERVSRNIELQDGNHGEQHPKAEDYVGEGIPFVMANHIDNGKIDFNKCNYIEKEQAD 313 Query: 278 LNRHKL-QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 R +GD+L T +G G+++K ++ ++ R ++ ++ Sbjct: 314 SLRIGFSNEGDVLLTHKG----TIGRVGIVQKSHFPYVMLTPQVTYYRCLREIQNRFLFW 369 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 S ++ + S + I D K+ L+P KEQ I +++ + D + Sbjct: 370 LMQSKFWQDQLKLLAGLGSTRAYIGLLDQKTLSFLIPSEKEQFAIATYLDRETSKLDRLV 429 Query: 397 KQVNNALARVNNLTQSILAKAFRGELTAQ 425 ++V+ +AR+ +++ A G++ + Sbjct: 430 EKVDAVIARLQEYRTALITAAVTGKIDVR 458 Score = 133 bits (336), Expect = 1e-29, Method: Composition-based stats. Identities = 30/243 (12%), Positives = 66/243 (27%), Gaps = 16/243 (6%) Query: 207 GKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG-HPILRISSVRAGHVDQ 265 + KW P H L + S S G P L SV G++ + Sbjct: 11 KESDVKWLGQVPVHWNAVSLKWISQRYSGGTPDKSNDAYWENGDIPWLNSGSVNDGYITE 70 Query: 266 NDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL 325 + + + L+ G + + Sbjct: 71 PSTYITREGFASSSAKWVPKNALVMALAGQ-----GKTKGMVAQLGIRATCNQSMAAIIP 125 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 + P ++ + S + + + G++ + S L P EQ I + Sbjct: 126 KEKFTPRFLYWWLVSNY---QNIRNMAGGEQRDGLNLDMLGSIPCPLLPRPEQTAIADFL 182 Query: 386 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFR-GELTAQWRAEN----PDLISGENSA 440 ++ D++ + +A + ++++ G + A+ P + Sbjct: 183 DRETGRIDSLMAKKRQLIALLKEKRCALISHIVTRG--LPEAAADEFGLKPHTRFKNSDI 240 Query: 441 AAL 443 L Sbjct: 241 EWL 243 >UniRef50_UPI0001C15DDF Restriction modification system DNA specificity domain protein n=1 Tax=Cylindrospermopsis raciborskii CS-505 RepID=UPI0001C15DDF Length = 445 Score = 325 bits (835), Expect = 2e-87, Method: Composition-based stats. Identities = 83/435 (19%), Positives = 169/435 (38%), Gaps = 39/435 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 GK+PE W + VS I T +Y + +P + + ++ T Sbjct: 31 GKIPEHWEVRKVSHAFQKIGSGTTPSTNHYDYY-EGNIPWVNTSELREKVITDTSAKLTN 89 Query: 64 KNLVKES--QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 K L+ S P ++IAM + +G C L I + F Sbjct: 90 KALLDHSVLNLYPPGTLLIAMYGATIGRLGILGIT-----ACTNQACCALANPISINAKF 144 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 ++ RN++ LS+G NI I IP PPL EQ+ IA+ LD A++D+ Sbjct: 145 AFYWLWMR--RNELILLSSGGGQPNINQEKIRSIRIPAPPLTEQQAIAQFLDRETAKIDT 202 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 A+ E++ ++LK R A++ AV L +W P++ +L + + Sbjct: 203 LVAKKERLIELLKEKRTALISHAVTKGLNPDAPMKDSGVEWLGEVPRNWPMIRLKHVAPV 262 Query: 233 TELRNGLSSKPNESGVGHPILRISSV--RAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 + S+K + P + + + + G + + E + GD+LF Sbjct: 263 S------SAKLTQKPDNLPYIGLEHIESKTGRLLLD----TPVENVESTVSCFEKGDVLF 312 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 + L V L + +L+ + ++D +++ + + + + Sbjct: 313 GKLRPYLAKV-------LLAEFEGVSTTELLALKPSQDVNGKFLFFQLIAEGFIDQVNSF 365 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 + + + I + + LPP+ EQ I + +++ A DT+ + ++ ++ Sbjct: 366 -TYGTKMPRVGPEQITNLFIPLPPLPEQQAIAQFLDRETAKIDTLVAKTRTSIEKLKEYR 424 Query: 411 QSILAKAFRGELTAQ 425 ++++ A G++ + Sbjct: 425 TALISAAVTGKIDVR 439 Score = 176 bits (448), Expect = 1e-42, Method: Composition-based stats. Identities = 33/234 (14%), Positives = 80/234 (34%), Gaps = 19/234 (8%) Query: 212 KWRNFEPQHSVFKKLNF-ESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRF 270 +W P+H +K++ + ++ + P + S +R + + Sbjct: 28 EWLGKIPEHWEVRKVSHAFQKIGSGTTPSTNHYDYYEGNIPWVNTSELREKVITDTSAKL 87 Query: 271 LECSESELNRHKL-QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 + + + L G LL Y ++ +G+ G+ Sbjct: 88 TNKALLDHSVLNLYPPGTLLIAMYGATIGRLGILGI-------TACTNQACCALANPISI 140 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 ++ + R + + + GQ I+ + I+S + PP+ EQ I + +++ Sbjct: 141 NAKFAFYWL---WMRRNELILLSSGGGQPNINQEKIRSIRIPAPPLTEQQAIAQFLDRET 197 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A DT+ + + + ++++ A + NPD ++ L Sbjct: 198 AKIDTLVAKKERLIELLKEKRTALISHAVT-------KGLNPDAPMKDSGVEWL 244 >UniRef50_A8ZTW4 Restriction modification system DNA specificity domain n=1 Tax=Desulfococcus oleovorans Hxd3 RepID=A8ZTW4_DESOH Length = 477 Score = 323 bits (828), Expect = 1e-86, Method: Composition-based stats. Identities = 134/504 (26%), Positives = 219/504 (43%), Gaps = 84/504 (16%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LPEGWV AP+ ++ ++ G K + + K P+ AN+I + + Sbjct: 5 LPEGWVAAPLQKISQIVYGKGLPKNK---FNKQGLYPVFGANSI---------IGYYDSF 52 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS-GFIAH 124 L ++ Q ++I+ + + S P V P L S ++ + Sbjct: 53 LYEDPQ------VLISCRGANSGTINIS----PPKCFVTSNSLVVQLPNTLHQSFKYLYY 102 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 +SS + G + + +P+PP EQK I +LD ++ ++D K Sbjct: 103 ALESSDK----EKIVTGTAQPQVTIDNLKSFCVPLPPFNEQKRIVARLDQIIPRIDKLKT 158 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQ------------------------- 219 R ++IP I+KRFRQ+VL AV G+LTEKWR P Sbjct: 159 RLDKIPTIIKRFRQSVLTAAVTGRLTEKWREDHPDVEGAEATVQSIYYRRLDESQTNQQK 218 Query: 220 ------------------HSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAG 261 +K I + G SSK ++ G P+LR+ +++ G Sbjct: 219 NKIEKLFAEVETEDNGLLPETWKYTFLNKICESFQYGTSSKSSKKGD-IPVLRMGNLQNG 277 Query: 262 HVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLI 321 +D +++ + ++ E+ ++KL+ +LF R N S E VG + L + ++ LI Sbjct: 278 AIDWSNLVY-SSNKKEIEKYKLEKNTVLFNRTN-SPELVGKTAIY--LGERAAIFAGYLI 333 Query: 322 RARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEI 381 R Y+ ++ A+ Q I+ + + + PP++EQ EI Sbjct: 334 RINNMDILDSHYLNYSLNTDYAKAFCNREKTDGVNQSNINAQKLGRFEIPFPPLEEQKEI 393 Query: 382 VRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAA 441 VR+VE+ FA AD +E NA ARV+ L +S+LAKAFRGELT Q + P A Sbjct: 394 VRQVERSFALADKLEAHYQNARARVDKLARSVLAKAFRGELTPQDPNDEP--------AE 445 Query: 442 ALLEKIKAERAA-SGGKKASRKKS 464 LLE+I AE+ + K +RK++ Sbjct: 446 KLLERILAEKEKMAAAVKKTRKQA 469 Score = 167 bits (423), Expect = 1e-39, Method: Composition-based stats. Identities = 51/226 (22%), Positives = 96/226 (42%), Gaps = 7/226 (3%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G LPE W ++ + + T K K +P++R N+QNG D ++LV+ Sbjct: 232 DNGLLPETWKYTFLNKICESFQYGTSSKS-----SKKGDIPVLRMGNLQNGAIDWSNLVY 286 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 E K+ ++ ++ S +VGK+A F + + ++ S + Sbjct: 287 SSNKKEIEKYKLEKNTVLFNRTN-SPELVGKTAIYLGERAAIFAGYLIRINNMDILDSHY 345 Query: 122 IAHFTKSSLYRNKISSLSA-GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + + + + + G N +NI IP PPL EQK I +++ A D Sbjct: 346 LNYSLNTDYAKAFCNREKTDGVNQSNINAQKLGRFEIPFPPLEEQKEIVRQVERSFALAD 405 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL 226 +A ++ + + ++VL A G+LT + N EP + +++ Sbjct: 406 KLEAHYQNARARVDKLARSVLAKAFRGELTPQDPNDEPAEKLLERI 451 >UniRef50_B8H0M3 Type I restriction-modification system specificity subunit n=2 Tax=Caulobacter vibrioides RepID=B8H0M3_CAUCN Length = 450 Score = 321 bits (824), Expect = 3e-86, Method: Composition-based stats. Identities = 79/434 (18%), Positives = 165/434 (38%), Gaps = 22/434 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W P+ + + G T KE+ +P A +++ T Sbjct: 18 GRVPSHWNFRPLKHLVIMRSGGTPSKER--EDYWGGEIPWASAKDLKVDTLTDTQDHLTA 75 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + L + + ++ P + V+ + G ++ ++ L + L + + ++ Sbjct: 76 EALDEGAAQLLPANAVVVLVRG--MMLARTFPVCRLSRPMTINQDLKGLIANRGVDPNYL 133 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 A ++S AG ++ ++ + +P P LAEQ+ IA LD A++D+ Sbjct: 134 AWSLRASEVETLCRLDEAGHGTKALRMDAWSTMELPAPSLAEQQAIAAFLDRETAKIDAL 193 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLN-FESIL 232 E++ +LK RQAV+ AV L +W P H + Sbjct: 194 VEAQERLIALLKEKRQAVISHAVTKGLDPSAQMKDSGVEWLGQMPAHWEVVPAKNLADSI 253 Query: 233 TELRNGLSSKPN-ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 G + + S G+ + V G D +EL++++++ GDLL + Sbjct: 254 KAGPFGSALTKDMYSSAGYRVYGQEQVIPGDFRIGDYYVTSDRYNELSQYRVEVGDLLVS 313 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 G + + ++ P +LIR R P Y+ + S + + + Sbjct: 314 CVG----TFGKIAIFPQGAEPGIINP-RLIRFRPNNQVDPTYLCVLLRSAVSFEQF-SYL 367 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 I+ + VV +PP++EQ I + ++ D++ A+ + Sbjct: 368 SRGGTMDVINIGILGEIVVPVPPMQEQISIAGYLAEVQEQFDSLSAASEAAITLLQERRA 427 Query: 412 SILAKAFRGELTAQ 425 ++++ A G++ + Sbjct: 428 ALISAAVTGKIDVR 441 Score = 134 bits (337), Expect = 1e-29, Method: Composition-based stats. Identities = 28/232 (12%), Positives = 65/232 (28%), Gaps = 12/232 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P H F+ L I+ + + G P ++ + Sbjct: 15 EWLGRVPSHWNFRPLKHLVIMRSGGTPSKEREDYWGGEIPWASAKDLKVDTLTDTQDHLT 74 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 + E L ++ L L + + L + P Sbjct: 75 AEALDEGAAQLLPANAVVVLVRGMMLARTFPVCRLS----RPMTINQDLKGLIANRGVDP 130 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 Y+ + + + G K + + + P + EQ I +++ A Sbjct: 131 NYLAWSLRASEV-ETLCRLDEAGHGTKALRMDAWSTMELPAPSLAEQQAIAAFLDRETAK 189 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + + +A + Q++++ A + +P ++ L Sbjct: 190 IDALVEAQERLIALLKEKRQAVISHAVT-------KGLDPSAQMKDSGVEWL 234 >UniRef50_C6RQJ9 Restriction endonuclease S subunit n=2 Tax=Acinetobacter RepID=C6RQJ9_ACIRA Length = 461 Score = 320 bits (820), Expect = 8e-86, Method: Composition-based stats. Identities = 71/442 (16%), Positives = 162/442 (36%), Gaps = 31/442 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFV 62 G +P W+I + + G + + D P+IR +I+ +G + + ++ Sbjct: 19 GVVPSHWIITTLKRYCYVKGGFAFSS----DAFIDTGYPVIRIGDIKTDGSINLENCKYI 74 Query: 63 PKNLV--KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLIFS 119 P++L + +++AM+ + +GK+ G + + Sbjct: 75 PESLAVNSRDYLVEKNQLLMAMTGAT---IGKAGLYTSNQPAFLNQRVGKFELLAQNMNY 131 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ + K+ Y+ I + G NI + IP EQ IA LD +++ Sbjct: 132 RYLWYILKTDGYQEYIKLTAFGGAQPNISDTAMVDYPATIPSFDEQTQIANFLDHETSKI 191 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFES 230 D + +++ ++LK RQAV+ AV L +W P+H +L + + Sbjct: 192 DHLIEKQQRLIELLKEKRQAVISHAVTKGLNPNVPMKDSGVEWLGEVPEHWRISRLKYNA 251 Query: 231 ILTE--LRNGLSSKP-NESGVGHPILRISSVRA-GHVDQNDIRFLE-CSESELNRHKLQD 285 + G + + G +L S++ + +L E + + Sbjct: 252 SIFGRIGFRGYTVDDIVDEDEGALVLSPSNISNANKLTLEKKTYLSWKKYFESPEIIVDE 311 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 DLL + + G ++ + P + P ++ F S ++ Sbjct: 312 NDLLLVKTGSTF---GKSAIIVNKLEPMTINPQ--MALIKKSKIEPRFLGYLFGSKLIKS 366 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 ++ T SG ++ ++I + + LP +E I ++ D + ++ + Sbjct: 367 -IIENSNTGSGMPTMTQENINNFPIPLPSDEEAIIISNYLDNKTYKIDFLIEKSEQTILL 425 Query: 406 VNNLTQSILAKAFRGELTAQWR 427 + ++++ A G++ + Sbjct: 426 MQERRTALISAAVTGKIDVRNW 447 Score = 166 bits (422), Expect = 1e-39, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 84/235 (35%), Gaps = 18/235 (7%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVR-AGHVDQNDIRF 270 +W P H + L + S G+P++RI ++ G ++ + ++ Sbjct: 16 EWLGVVPSHWIITTLKRYCYVKGGFAFSSD--AFIDTGYPVIRIGDIKTDGSINLENCKY 73 Query: 271 LECSESELNR-HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKD 328 + S + +R + ++ LL + +G GL +Q ++ + L ++ Sbjct: 74 IPESLAVNSRDYLVEKNQLLMAMTGAT---IGKAGLY--TSNQPAFLNQRVGKFELLAQN 128 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 Y+ + + + Q IS + +P EQ +I ++ Sbjct: 129 MNYRYLWYILKTDGYQEYI-KLTAFGGAQPNISDTAMVDYPATIPSFDEQTQIANFLDHE 187 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + D + ++ + + Q++++ A + NP++ ++ L Sbjct: 188 TSKIDHLIEKQQRLIELLKEKRQAVISHAVT-------KGLNPNVPMKDSGVEWL 235 >UniRef50_Q210J8 Type I restriction enzyme StySPI specificity protein n=1 Tax=Rhodopseudomonas palustris BisB18 RepID=Q210J8_RHOPB Length = 460 Score = 320 bits (820), Expect = 9e-86, Method: Composition-based stats. Identities = 122/470 (25%), Positives = 197/470 (41%), Gaps = 30/470 (6%) Query: 4 GKLPEGWVIAPVSTVTTL----IRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL 59 G LP GWV AP+ + L I Y ++ + ++R NI +F + D Sbjct: 3 GDLPSGWVAAPIDDLRALEPNAITDGPYGSSLKTSHYRSSGARVVRLGNIGFRRFLSADA 62 Query: 60 VFVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKL 116 V++ ++ K + D++IA VG+S A C LR Sbjct: 63 VYISEDHFKALVKHHVRAGDVLIAALG---DPVGRSCIAPSDISPALVKADCFRLRCSPH 119 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + + FI + S R SS + G I + F +P+PP EQ I K+D L Sbjct: 120 LSAPFIMLWLNSECAREAFSSAAHGLGRVRINLSDFRTTVVPVPPATEQGRIVAKIDNLS 179 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR-NFEPQHSVFKKLNFESILTEL 235 A+ ++ + IPQ++++++QA+L A G+LT +WR N Q + + + S + + Sbjct: 180 AKSKRSRDHLDHIPQLVEKYKQAILAAAFRGELTHEWRVNNLDQKWPWPECSL-SDIANI 238 Query: 236 RNGLSSK----PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 G + K S P + +V+ V D E + E N G +L Sbjct: 239 GTGATPKRGEQRYYSNGNIPWITSGAVKHAVVQAADEYITEAAVRETNCKVFPAGTILMA 298 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 Y G +L N I+ R A+ +++ S + Sbjct: 299 MYGEGKTR-GRVTVLGINAATN--QAVAAIQVRADSPAVRDFVVWHLRSGYL--ELRERA 353 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 Q ++ + + + LP EQ E+VRRV++ FA+ D + + +A ++ L Q Sbjct: 354 AGGV-QPNLNLGIVNAWRIPLPSRDEQMEVVRRVQKAFAWIDRLTIETTSARKLIDRLDQ 412 Query: 412 SILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASR 461 +ILAKAFRGEL Q + P A+ LLE+IKA+RA S G R Sbjct: 413 AILAKAFRGELVPQDPNDEP--------ASILLERIKAKRAGSAGHTRRR 454 >UniRef50_C9NQJ7 HsdS type I site-specific deoxyribonuclease n=1 Tax=Vibrio coralliilyticus ATCC BAA-450 RepID=C9NQJ7_9VIBR Length = 563 Score = 320 bits (820), Expect = 1e-85, Method: Composition-based stats. Identities = 126/454 (27%), Positives = 215/454 (47%), Gaps = 26/454 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDD-YLPLIRANNI---QNGKFDTTDLV 60 KLP WV + + ++ G T K +N+ + + + ++ + + Sbjct: 3 KLPFNWVETEIGNLALVVSGGTPKAGDELNFAEPGAGIAWVTPADLSGYKQKEIANGRRD 62 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 PK L S K+ P+ ++ S V + E S + S Sbjct: 63 LSPKGLDSSSAKLMPKGTLLFSSRAPIGYVAIA-----ENEISTNQGFKSFIFTDHVNST 117 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + ++ KS ++ S +G + A + + PL EQ IA+KLD++LA+VD Sbjct: 118 YAYYYLKS--IKDLAESWGSGTTFKELSGAVAKKLPFRLAPLNEQIRIADKLDSILAKVD 175 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 + R ++IP ILKRFRQ+VL A +G+LT +WR + + ++ +S+ G S Sbjct: 176 HAQERLDKIPDILKRFRQSVLAAATSGELTREWREGKEH--QWPRVQLKSVGRGFNYGSS 233 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 +K P+LR+ +++ G + +++ + + E++++ L+ GD+LF R N S E V Sbjct: 234 AKSK-PEGEVPVLRMGNLQGGQLHWDNLVYTS-DKEEIDKYLLEKGDVLFNRTN-SPELV 290 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G + + Q +Y LIR + ++ E++ I +SP AR+ Q I Sbjct: 291 GKTSIYRGEQ--KAIYAGYLIRIKGSEHLDTEFLNIQLNSPHARDYCWQVKTDGVSQSNI 348 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + K +++ LP + EQ EIVRRV +LF+ AD E Q + +N LTQSIL KAF G Sbjct: 349 NAKKLQAYEFDLPEIDEQLEIVRRVSELFSRADLFEYQYLASKKYLNRLTQSILVKAFNG 408 Query: 421 ELTAQWRAENPDLISGENSAAALLEKIKAERAAS 454 +L Q + SA+ LL+ I++E A+ Sbjct: 409 QLVPQEPTDE--------SASELLKLIESEMVAN 434 >UniRef50_A1BGI9 Restriction modification system DNA specificity domain n=2 Tax=cellular organisms RepID=A1BGI9_CHLPD Length = 479 Score = 319 bits (818), Expect = 2e-85, Method: Composition-based stats. Identities = 109/487 (22%), Positives = 205/487 (42%), Gaps = 65/487 (13%) Query: 15 VSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQKIS 74 + V I G +K + + LP+IR N+ D + ++ + Sbjct: 11 LGDVAEYINGRAFKPSE----WGKEGLPIIRIKNLN----DENSKFNYSNEVFEKRYLVK 62 Query: 75 PEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNK 134 D++ A S+ + + K E +++P I ++ +F + Sbjct: 63 KGDLLFAWSASLGAYIWK------KDEAWLNQHIFLVKPSPFIAKLYLYYFL--DKITQE 114 Query: 135 ISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILK 194 + S + G+ + ++ F+ I +PPL+EQ+ I K++ L +++D+ A ++ + LK Sbjct: 115 LYSAAHGSGMVHVTKKKFEETKIGLPPLSEQRSIVSKIEQLFSELDNGIACLKKAQEQLK 174 Query: 195 RFRQAVLGGAVNGKLTEKWRNFE------------------------------------- 217 +RQAVL A G+LT+ WR + Sbjct: 175 VYRQAVLKQAFEGELTKSWREQQANLPSAQDLLDTIKTEREQAAKNQGKKLKPVTPLAKV 234 Query: 218 ------PQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 + + + + G S+K E G P++R+ +++ G +D ND+ F Sbjct: 235 ELDELTELPDGWCWIKLGELTIGVEYGTSTKSLEKGE-VPVIRMGNIQQGRIDWNDLAFT 293 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 + ++++++++L GD+LF R N S E VG + ++ LIR K+ L Sbjct: 294 D-DKADISKYRLLKGDVLFNRTN-SPELVGKAAIYNGEMP--AIFAGYLIRVNQIKELLH 349 Query: 332 -EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 +Y+ F +S A+ + Q I+G+ +KS + KEQ +IV+ +E + Sbjct: 350 CKYLNFFLNSHPAKVYGNSVKTDGVNQSNINGEKLKSYPLPYCSPKEQEQIVQEIEARLS 409 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAE 450 D +E + +L + L QSIL KAF G+L ++ A LLE+I+AE Sbjct: 410 VCDNMEATIRESLEKAEALRQSILKKAFEGKLLSEEELTATRNDPDWEPAEKLLERIRAE 469 Query: 451 RAASGGK 457 + S + Sbjct: 470 KNQSKKQ 476 Score = 149 bits (378), Expect = 1e-34, Method: Composition-based stats. Identities = 52/242 (21%), Positives = 105/242 (43%), Gaps = 16/242 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LP+GW + +T + T K L+ +P+IR NIQ G+ D DL F Sbjct: 241 ELPDGWCWIKLGELTIGVEYGTSTKS-----LEKGEVPVIRMGNIQQGRIDWNDLAFTDD 295 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVL-RPEKLIFSGFIA 123 ++ D++ ++ S +VGK+A + F + + + ++L+ ++ Sbjct: 296 KADISKYRLLKGDVLFNRTN-SPELVGKAAIYNGEMPAIFAGYLIRVNQIKELLHCKYLN 354 Query: 124 HFTKSSLYRNKISSLSA-GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 F S + +S+ G N +NI +P EQ+ I ++++ L+ D+ Sbjct: 355 FFLNSHPAKVYGNSVKTDGVNQSNINGEKLKSYPLPYCSPKEQEQIVQEIEARLSVCDNM 414 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKL--------TEKWRNFEPQHSVFKKLNFESILTE 234 +A + + + RQ++L A GKL T ++EP + +++ E ++ Sbjct: 415 EATIRESLEKAEALRQSILKKAFEGKLLSEEELTATRNDPDWEPAEKLLERIRAEKNQSK 474 Query: 235 LR 236 + Sbjct: 475 KQ 476 Score = 139 bits (351), Expect = 2e-31, Method: Composition-based stats. Identities = 58/239 (24%), Positives = 99/239 (41%), Gaps = 21/239 (8%) Query: 226 LNFESILTELRNGLSSKPNESG-VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQ 284 + + E NG + KP+E G G PI+RI ++ + +F +E R+ ++ Sbjct: 8 IAILGDVAEYINGRAFKPSEWGKEGLPIIRIKNLND-----ENSKFNYSNEVFEKRYLVK 62 Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 GDLLF ++ + + + + Y+ F Sbjct: 63 KGDLLFAWSASLGAYI--------WKKDEAWLNQHIFLVKPSPFIAKLYLYYFL---DKI 111 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 + SG ++ K + + LPP+ EQ IV ++EQLF+ D + A Sbjct: 112 TQELYSAAHGSGMVHVTKKKFEETKIGLPPLSEQRSIVSKIEQLFSELDNGIACLKKAQE 171 Query: 405 RVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 ++ Q++L +AF GELT WR + +L SA LL+ IK ER + + + K Sbjct: 172 QLKVYRQAVLKQAFEGELTKSWREQQANLP----SAQDLLDTIKTEREQAAKNQGKKLK 226 >UniRef50_A3PYN5 Restriction modification system DNA specificity domain n=1 Tax=Mycobacterium sp. JLS RepID=A3PYN5_MYCSJ Length = 451 Score = 318 bits (817), Expect = 2e-85, Method: Composition-based stats. Identities = 89/438 (20%), Positives = 179/438 (40%), Gaps = 24/438 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN-----GKFDTTD 58 G++P GW ++P+ V T+ ++ + + L ++ G D Sbjct: 18 GRVPSGWAVSPLKNVATVF----PSSVDKHSHDNEIPVQLCNYTDVYKNERISGALDFMK 73 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLI 117 P+ + + + D +I S + +G SA+ + G V+RP + Sbjct: 74 ATATPEEI--KKFTLKQGDTIITKDSETADDIGISAYVEETLPDVLCGYHLSVVRPLPGL 131 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+ S + + + G + + D +NIP+PP EQ IA+ L+ A Sbjct: 132 DGRFVKRLFDSHYLKASMEVSANGLTRVGLGQYAIDNLNIPLPPPDEQLQIADFLEAETA 191 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV-------FKKLNFES 230 ++D+ A+ E + L+ R A + AV L +P +S + L Sbjct: 192 KIDALIAKQEHLIATLREDRTATITHAVTKGLDPTVDMVQPHNSELPACPKHWTLLISLK 251 Query: 231 ILTELRNGLSSKPNESGVG---HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD 287 L E++ GL+ + P LR+++V+ V+ ++++ + SEL R+ L+DGD Sbjct: 252 RLAEVQTGLTLGKSVDPAEAVDVPYLRVANVQTSGVNLDEVKTVAVHRSELKRYLLRDGD 311 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 +L T G ++ +G G + + ++ + + R + +++ + ARN Sbjct: 312 VLMTE-GGDIDKLGR-GCVWSGEIAPCIHQNHVFAVRCSDALSGDFLVYLLDTAVARNYF 369 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 K T+ + + + LPP EQ EIV + + A D + + N + + Sbjct: 370 FMTAKKTTNLASTNSTTLGAFTFSLPPRAEQDEIVDHLNERCAGLDALIAKANAVITVLR 429 Query: 408 NLTQSILAKAFRGELTAQ 425 +++ A G++ + Sbjct: 430 EYRAALITDAVTGKIDVR 447 Score = 133 bits (336), Expect = 1e-29, Method: Composition-based stats. Identities = 34/236 (14%), Positives = 81/236 (34%), Gaps = 17/236 (7%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPI--LRISSVRAGHVDQNDIR 269 +W P L + + S + P+ + V + Sbjct: 15 EWLGRVPSGWAVSPLKNVATVF----PSSVDKHSHDNEIPVQLCNYTDVYKNERISGALD 70 Query: 270 FL--ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 F+ + E+ + L+ GD + T+ + + + +G+ +++ ++L L R Sbjct: 71 FMKATATPEEIKKFTLKQGDTIITKDSETADDIGISAYVEETLP-DVLCGYHLSVVRPLP 129 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 +++ F S + +M + G+ I + + LPP EQ +I +E Sbjct: 130 GLDGRFVKRLFDSHYLKASM-EVSANGLTRVGLGQYAIDNLNIPLPPPDEQLQIADFLEA 188 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D + + + +A + + + A + +P + + + L Sbjct: 189 ETAKIDALIAKQEHLIATLREDRTATITHAVT-------KGLDPTVDMVQPHNSEL 237 >UniRef50_Q1VAF2 Hypothetical type I restriction-modification system specificity determinant n=1 Tax=Vibrio alginolyticus 12G01 RepID=Q1VAF2_VIBAL Length = 464 Score = 316 bits (810), Expect = 1e-84, Method: Composition-based stats. Identities = 87/439 (19%), Positives = 173/439 (39%), Gaps = 30/439 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +P+ W + + ++Y +A IR ++ + D Sbjct: 26 DIPKDWCTRRLKHMLE--SPMSYGANEAAERAVSTEPRYIRITDMNSDGTLKEDTFRSLP 83 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEK-LIFSGFI 122 + + DI++A S + VGKS F +C F + + + + S + Sbjct: 84 KDIASDYLLKDRDILLARSGAT---VGKSFIYRKEFGDCCFAGYLIKVSCDSARLNSDYA 140 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPP-LAEQKIIAEKLDTLLAQVDS 181 F +SS Y IS A I N+ + + I +P + EQ IA LD A++D+ Sbjct: 141 FWFFQSSSYWQYISGSQIQATIQNVSAEKYGEMYISLPEHVEEQTQIANFLDHETAKIDT 200 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 + +Q+ ++LK RQAV+ AV L +W P+H ++ + I Sbjct: 201 LIEKQQQLIKLLKEKRQAVISHAVTKGLNPQAPMKNSGVEWLGEVPEHWE--QIKLKHIT 258 Query: 233 TELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSESE--LNRHKLQDGDLL 289 ++ + G + + R ++VR G + + ++ + E R + + GD+L Sbjct: 259 HQIVDAEHKTAPYFDDGEYLVCRTTNVRDGKLRLDGGKYTNHAIYEEWTKRGQPEVGDIL 318 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAMM 348 FTR G + L +++ +L + LPE++ S A + + Sbjct: 319 FTREAP----AGEACVYTGEVP--LCLGQRMVLFKLNQTRVLPEFVLHSIYSGLA-DDFV 371 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + S + DI++ + PP EQA+IV + ++ A D + + + + Sbjct: 372 KQLSQGSTVAHFNMSDIQNIPLFEPPKDEQAQIVDHLAKVLAKYDALTSSASLKIELMQE 431 Query: 409 LTQSILAKAFRGELTAQWR 427 ++++ A G++ + Sbjct: 432 RRTALISAAVTGKIDVRNW 450 Score = 155 bits (392), Expect = 4e-36, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 91/238 (38%), Gaps = 19/238 (7%) Query: 210 TEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHP-ILRISSVR-AGHVDQND 267 +W + P+ ++L + + G + + P +RI+ + G + ++ Sbjct: 20 DVEWLDDIPKDWCTRRLKHMLE-SPMSYGANEAAERAVSTEPRYIRITDMNSDGTLKEDT 78 Query: 268 IRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 R L + + + L+D D+L R + VG + +K + + + LI+ Sbjct: 79 FRSLPKDIA--SDYLLKDRDILLARSGAT---VGKSFIYRK-EFGDCCFAGYLIKVSCDS 132 Query: 328 -DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPP-VKEQAEIVRRV 385 +Y FF S S + + + + + +S + + LP V+EQ +I + Sbjct: 133 ARLNSDYAFWFFQSSSYWQYI-SGSQIQATIQNVSAEKYGEMYISLPEHVEEQTQIANFL 191 Query: 386 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + A DT+ ++ + + Q++++ A + NP + L Sbjct: 192 DHETAKIDTLIEKQQQLIKLLKEKRQAVISHAVT-------KGLNPQAPMKNSGVEWL 242 Score = 142 bits (358), Expect = 4e-32, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 83/211 (39%), Gaps = 11/211 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + +T I +K Y D + R N+++GK + Sbjct: 243 GEVPEHWEQIKLKHITHQIVDAEHKT---APYFDDGEYLVCRTTNVRDGKLRLDGGKYTN 299 Query: 64 KNLVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK-LIFS 119 + +E K DI+ ++ G++ G + + + + Sbjct: 300 HAIYEEWTKRGQPEVGDILFTR----EAPAGEACVYTGEVPLCLGQRMVLFKLNQTRVLP 355 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ H S L + + LS G+ + + + I + PP EQ I + L +LA+ Sbjct: 356 EFVLHSIYSGLADDFVKQLSQGSTVAHFNMSDIQNIPLFEPPKDEQAQIVDHLAKVLAKY 415 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 D+ + ++++ R A++ AV GK+ Sbjct: 416 DALTSSASLKIELMQERRTALISAAVTGKID 446 >UniRef50_C7QRY1 Restriction modification system DNA specificity domain protein n=1 Tax=Cyanothece sp. PCC 8802 RepID=C7QRY1_CYAP0 Length = 456 Score = 316 bits (810), Expect = 1e-84, Method: Composition-based stats. Identities = 84/441 (19%), Positives = 165/441 (37%), Gaps = 26/441 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +P+ W + + ++ I + N + + F D ++ Sbjct: 24 GDIPDSWEVKRLRYLSKKITAGPFGSNLTKNIYTSTGYKIYGQEQVIASDFSIGD-YYIS 82 Query: 64 KNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPE-KLIFS 119 K KI+ DI+I+ GK A E L P + I S Sbjct: 83 KEKYDQMSQYKINSGDILISC----VGTFGKVAVVPKNIEQGIINPRLIKLIPITEYINS 138 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ KS + ++ LS G + I I +PIPPL EQ+ IA+ LD A++ Sbjct: 139 VYLEKLLKSVVAFEQMEKLSRGGTMGVINIGLLSDILLPIPPLPEQEKIAQFLDKETAKI 198 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFES 230 D E++ ++LK R A++ AV L +W F P+H K+L + Sbjct: 199 DKLITLKERLIELLKEKRTALISHAVTKGLNPDVPMKDSGVEWLGFIPEHWEVKRLKYIV 258 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL-NRHKLQDGDLL 289 + ++ G P LR ++ +G +D +++ F+ +EL + K+ GDL+ Sbjct: 259 PNITVGIVVTPAKYYVESGIPCLRSVNISSGKIDNSNLVFISSQSNELHQKSKIYKGDLV 318 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 R + G ++ +IR + ++ + S + +N Sbjct: 319 LVRTGVT----GTAAIVTDNFDGANCVDLLIIR---NSRLILTLYLYYYLNSSTTSYQVN 371 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + Q + + ++ PP +EQ +I +++ D I + ++ + Sbjct: 372 NYSVGAIQAHYNTSTLSELIITFPPPQEQQKIAEYLDRKTEQIDQIINKTRESIEYLKEY 431 Query: 410 TQSILAKAFRGELTAQWRAEN 430 +++ A G++ + Sbjct: 432 RTVLISAAVTGKIDVRQWGGE 452 Score = 163 bits (413), Expect = 1e-38, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 85/234 (36%), Gaps = 14/234 (5%) Query: 212 KWRNFEPQHSVFKKLN-FESILTELRNGLSSKPN-ESGVGHPILRISSVRAGHVDQNDIR 269 W P K+L +T G + N + G+ I V A D Sbjct: 21 DWLGDIPDSWEVKRLRYLSKKITAGPFGSNLTKNIYTSTGYKIYGQEQVIASDFSIGDYY 80 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 + +++++K+ GD+L + G ++ K Q ++ P + +T+ Sbjct: 81 ISKEKYDQMSQYKINSGDILISCVG----TFGKVAVVPKNIEQGIINPRLIKLIPITEYI 136 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 Y+E S A M + + I+ + ++ +PP+ EQ +I + +++ Sbjct: 137 NSVYLEKLLKSVVAFEQMEKLSRGGT-MGVINIGLLSDILLPIPPLPEQEKIAQFLDKET 195 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D + + + ++++ A + NPD+ ++ L Sbjct: 196 AKIDKLITLKERLIELLKEKRTALISHAVT-------KGLNPDVPMKDSGVEWL 242 >UniRef50_A4VH87 Type I restriction-modification system, S subunit n=1 Tax=Pseudomonas stutzeri A1501 RepID=A4VH87_PSEU5 Length = 472 Score = 315 bits (809), Expect = 2e-84, Method: Composition-based stats. Identities = 133/493 (26%), Positives = 213/493 (43%), Gaps = 51/493 (10%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS +LP GW + + L G T K + +P + +++ + + Sbjct: 1 MS--ELPSGWTRFALKDLGGLSGGKTPSKAN-PEFWSTRDVPWVSPKDMKKNLLEDAEDR 57 Query: 61 FVPKNLVKESQKISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + + + P +++ SG A + + VLRP + I Sbjct: 58 ISQNAVDEAGMTLYPSGSVLMVTRSGILQHTFPVALAGVEL--TVNQDIKVLRPIEGIVP 115 Query: 120 GFIAHFTKSSLYRNKISSLSA--GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F + KS + +I S + G + +I + +PPLAEQ IA+KLD LLA Sbjct: 116 KFSFYMLKS--FGAEILSACSKDGTTVQSIDSEKLETFLFSLPPLAEQTRIAQKLDELLA 173 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKK-------LNFES 230 QVD+ KAR + IP +LKRFRQ+VL AV+G+LTE+WR P ++ + + Sbjct: 174 QVDTLKARIDAIPALLKRFRQSVLAAAVSGRLTEEWRGSIPASESAEEYLSRVIQVRRQK 233 Query: 231 ILTELRNGLSSKPNESGVGHP--ILRISSVRAGHVDQNDIRF-LECSESELNRHK----- 282 + + + + + P + ++SV + + +R ++ E K Sbjct: 234 PIVKFKEPVPPDLETRELEVPEGWI-VASVSSFAECLDSMRVPVKKELRESGEGKYPYFG 292 Query: 283 -----------LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 + D DL+ + + G + + + R Sbjct: 293 ANGEVDRVDEYIFDDDLVLVTEDETF--YGRVKPIAYKYSGKCWVNNHVHALRAHDAVAR 350 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 +Y+ + T+G+ ++ + S + +PP EQ EIVRRVEQLFA+ Sbjct: 351 DYLCYVLMHYDVVPWL----TGTTGRAKLTQGALLSLPIQVPPATEQTEIVRRVEQLFAF 406 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAER 451 AD +E +VN A A ++ LTQSILAKAFRGEL Q + P A+ LLE+IKA+R Sbjct: 407 ADQLEARVNAAKACIDRLTQSILAKAFRGELVPQDPNDEP--------ASVLLERIKAQR 458 Query: 452 AASGGKKASRKKS 464 AA+ K RK S Sbjct: 459 AAAPKTKRGRKAS 471 >UniRef50_UPI0001695152 type I restriction enzyme specificity protein n=1 Tax=Xanthomonas oryzae pv. oryzicola BLS256 RepID=UPI0001695152 Length = 451 Score = 315 bits (808), Expect = 2e-84, Method: Composition-based stats. Identities = 121/468 (25%), Positives = 212/468 (45%), Gaps = 26/468 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 +LP GWV + + K + ++ ++P+ A G + + Sbjct: 2 VSELPGGWVETTIGEIC-----AMGPKSAWDDDMEIGFVPMSHAPTNFRGPLNYEARRWH 56 Query: 63 PKNLVKESQKISPEDIVIAMSSGSK--SVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + K +D++ A + A F + R + I Sbjct: 57 --EVKKAYTHFENDDVIFAKVTPCFENGKAALVAGLPNGAGAGSSEFHVLRRRDAGISPS 114 Query: 121 FIAHFTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ KS+ + + GA + + A + + +PP AEQK IA+KLD LLAQV Sbjct: 115 YLLAVIKSAQFLREGEENMTGAIGLRRVPRAFVENFPVRLPPEAEQKRIAQKLDALLAQV 174 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 D+ KAR + IP +LKRFRQ+V+ V+G L ++ + ++ + E + T++++G Sbjct: 175 DTFKARIDAIPALLKRFRQSVINHGVSGSL-ALDQHASFDTTTWRNMRAEDVCTKVQSGG 233 Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDI-RFLECSESELN--RHKLQDGDLLFTRYNGS 296 + K + G P L++ ++ G ++ +++ + + + GD+L Sbjct: 234 TPKEGFTTEGIPFLKVYNIVDGIIEFEYRPQYIAADIHQGSCRKSITIPGDVLMNIVGPP 293 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 L G ++ + + + R ++ +I + + + K ++G Sbjct: 294 L---GKIAVVPQGVDE-WNINQAITLFRPSESISSAWIHLVLLEGTNIRRVSQETKGSAG 349 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 Q IS + V +PP + Q EIVRRVEQLFAYAD +E +V A R++ LTQS+LAK Sbjct: 350 QVNISLSQCRDFVFPVPPTQIQDEIVRRVEQLFAYADQLEAKVAAAQQRIDALTQSLLAK 409 Query: 417 AFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 AFRGEL Q ++ P A+ LL++I+A+RAA+ K RK + Sbjct: 410 AFRGELVPQDPSDEP--------ASVLLDRIRAQRAATPKPKRGRKAA 449 >UniRef50_P06187 Type-1 restriction enzyme StySJI specificity protein n=8 Tax=Enterobacteriaceae RepID=T1S_SALTY Length = 469 Score = 315 bits (807), Expect = 3e-84, Method: Composition-based stats. Identities = 177/480 (36%), Positives = 256/480 (53%), Gaps = 27/480 (5%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS GKLPEGW + ++ + L K + + L ++P+ GK + Sbjct: 1 MSGGKLPEGWATSTINEMCNLN-----PKLKLDDDLDVGFMPMAGVPTTYLGKCNFETKK 55 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGA--FCGVLRPEKLIF 118 + + K + +D++ A + + P G+ + + LI Sbjct: 56 W--SEVKKGFTQFQNDDVIFAKITPCFENGKAVVIKEFPNGYGAGSTEYYVLRSINGLIN 113 Query: 119 SGFIAHFTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 ++ K+ + + +G+ + + +P+PPLAEQK+IAEKLDTLLA Sbjct: 114 PHWLFALVKTKDFLTNGALNMSGSVGHKRVTKEFLENYGVPVPPLAEQKVIAEKLDTLLA 173 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF-----------EPQHSVFKKL 226 QVDSTKAR EQIPQILKRFRQ+V+ AVNG+LT++ P ++ Sbjct: 174 QVDSTKARLEQIPQILKRFRQSVIVAAVNGQLTKELHKKNKFKLTELNISIPSLWKISEI 233 Query: 227 NFESILTELRN-GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSE-SELNRHKLQ 284 + + + G P +R ++ G V +LE ++R+ + Sbjct: 234 GQFADVKGGKRLPKGESLIAENTGFPYIRAGQLKNGTVLPEGQLYLEEYIQKSISRYTVS 293 Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 GDL T +G G++ + + L + L ++ ++ ++ S + Sbjct: 294 SGDLYITIVGAC---IGDAGIIPDVYNNANLTENAAKICNLNENIFNRFLSLWLRSSYLQ 350 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 + + + +K+ + Q ++ IKS ++LPP++EQ EIVRRVEQLFAYADTIEKQVNNAL Sbjct: 351 DIINSEIKSGA-QGKLALARIKSLPLILPPLQEQHEIVRRVEQLFAYADTIEKQVNNALT 409 Query: 405 RVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 RVN+LTQSILAKAFRGELTAQWRAENP+LISGENSAAALLEKIKAERAASGGKK SRKK+ Sbjct: 410 RVNSLTQSILAKAFRGELTAQWRAENPELISGENSAAALLEKIKAERAASGGKKTSRKKA 469 >UniRef50_A6CKF2 Putative type I restriction enzyme specificity protein n=1 Tax=Bacillus sp. SG-1 RepID=A6CKF2_9BACI Length = 454 Score = 313 bits (803), Expect = 8e-84, Method: Composition-based stats. Identities = 86/448 (19%), Positives = 187/448 (41%), Gaps = 30/448 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++PE W + + I+G +K + D +P+I+ +I+NGK +D+ + Sbjct: 17 RVPEDWSEKKLKYLVETIKGYAFKSQ----LFGDKGVPIIKTTDIKNGKIQDSDIFIDER 72 Query: 65 NLVK-ESQKISPEDIVIAMSSG----SKSVVGKSAHQHLPFE-CSFGAFCGVLRPE-KLI 117 + ++ ++ DI+++ + S VG+ +E +LR + K I Sbjct: 73 FEHEYKNVRVKKNDILMSTVGSKVEVTNSAVGQIGKVQKKYEGALLNQNAVILRCKSKDI 132 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + F+ +F S YR + + G ++ +P+P Q I+E LD Sbjct: 133 TNNFLFYFLNSHSYRKYLDLFAHGTANQASLSLKDILDFKMPLPSRKIQHQISEFLDHKT 192 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLN 227 + V++ A +++ ++L+ RQA++ AV L KW P+H K+ Sbjct: 193 SDVETLIADKQKLIELLEEKRQAIVTEAVTRGLNPDVKMKDSGVKWIGDIPEHWDISKIK 252 Query: 228 FESILTELRNGLSSKPNESGVGHPIL-RISSVRAGHVDQNDIRFLECS-ESELNRHKLQD 285 + + + + +E P L + + G + + + SE +L++ Sbjct: 253 YSTYVKGRIGWQGLRSDEFIDEGPYLVTGTDFKDGIIHWDTCYHISEERYSEAPPIQLKE 312 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSSPSAR 344 DLL T+ +G ++K + + + R K+ L +++ +S + Sbjct: 313 NDLLITKDG----TIGKVAIVKN-KPGKAILNSGIFVTRCQDKEYLTKFMYWILTSEVFK 367 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 N + ++T S K + + + LP ++EQ I +E D+++K++++ + Sbjct: 368 NYI-KYMETGSTIKHLYQETFVNFSYPLPNIEEQKAIEYFLETKVREIDSVKKEISDQIE 426 Query: 405 RVNNLTQSILAKAFRGELTAQWRAENPD 432 + QS++ +A G++ + E P Sbjct: 427 LLKEYRQSLIYEAVTGKIDLRDYQEVPS 454 Score = 164 bits (415), Expect = 8e-39, Method: Composition-based stats. Identities = 49/243 (20%), Positives = 92/243 (37%), Gaps = 13/243 (5%) Query: 205 VNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVD 264 W P+ KKL + + S G PI++ + ++ G + Sbjct: 6 FQNSADINWYERVPEDWSEKKLKYLVETIKGYAFKSQ--LFGDKGVPIIKTTDIKNGKIQ 63 Query: 265 QNDIRFLECSESELNRHKLQDGDLLFTRYNG----SLEFVGVCGLLKKLQHQNLLYPDKL 320 +DI E E E +++ D+L + + VG G ++K LL + + Sbjct: 64 DSDIFIDERFEHEYKNVRVKKNDILMSTVGSKVEVTNSAVGQIGKVQKKYEGALLNQNAV 123 Query: 321 IRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAE 380 I +KD ++ F +S S R + T+ Q +S KDI + LP K Q + Sbjct: 124 ILRCKSKDITNNFLFYFLNSHSYRKYLDLFAHGTANQASLSLKDILDFKMPLPSRKIQHQ 183 Query: 381 IVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSA 440 I ++ + +T+ + + Q+I+ +A R NPD+ ++ Sbjct: 184 ISEFLDHKTSDVETLIADKQKLIELLEEKRQAIVTEAVT-------RGLNPDVKMKDSGV 236 Query: 441 AAL 443 + Sbjct: 237 KWI 239 Score = 157 bits (398), Expect = 7e-37, Method: Composition-based stats. Identities = 47/220 (21%), Positives = 97/220 (44%), Gaps = 9/220 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +PE W I+ + +T ++G + + D+ L+ + ++G + Sbjct: 239 IGDIPEHWDISKIKY-STYVKGRIGWQGLRSDEFIDEGPYLVTGTDFKDGIIHWDTCYHI 297 Query: 63 PKNLVKESQ--KISPEDIVIAMSSGSKSVVGKSAHQ-HLPFECSFGAFCGVLRP-EKLIF 118 + E+ ++ D++I +GK A + P + + V R +K Sbjct: 298 SEERYSEAPPIQLKENDLLITKD----GTIGKVAIVKNKPGKAILNSGIFVTRCQDKEYL 353 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + F+ S +++N I + G+ I ++ +F + P+P + EQK I L+T + + Sbjct: 354 TKFMYWILTSEVFKNYIKYMETGSTIKHLYQETFVNFSYPLPNIEEQKAIEYFLETKVRE 413 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP 218 +DS K ++LK +RQ+++ AV GK+ + P Sbjct: 414 IDSVKKEISDQIELLKEYRQSLIYEAVTGKIDLRDYQEVP 453 >UniRef50_Q5QX28 Restriction endonuclease S subunit n=1 Tax=Idiomarina loihiensis RepID=Q5QX28_IDILO Length = 448 Score = 312 bits (801), Expect = 1e-83, Method: Composition-based stats. Identities = 88/455 (19%), Positives = 177/455 (38%), Gaps = 43/455 (9%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LPE W + + V + G + + P+IR +I+N + + +++ V Sbjct: 20 LPERWKLIKLKLVCNIETGFAFPS----EVFGETGTPVIRITDIKNREINLSEIKRVDDL 75 Query: 66 LVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 L+K K ++ DI++AM+ + +GK + + P I G++ Sbjct: 76 LLKSKPKRPSVNKGDIIMAMTGAT---IGKVGYYNSDKPSYLNQRVCRFIPAS-IDRGYL 131 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLA-EQKIIAEKLDTLLAQVDS 181 H S +Y+ I + G NI + P+P L EQ+ IA+ LD A++D+ Sbjct: 132 WHTLNSEIYKKYIELEAFGGAQANISDSQLLNFPAPLPELEAEQQKIAQFLDYETAKIDA 191 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 +++ ++LK RQAV+ AV L +W P+H KKL F S + Sbjct: 192 LIDEQKRLIELLKEKRQAVISHAVTKGLNPDAPMKDSGIEWLGEVPEHWEIKKLKFCSRM 251 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 + ++ + + ++ G S + + D+LF + Sbjct: 252 LSDKGKDNTNA---------ISLENIENG----TGAFIKTESNFDQEGVLFEPLDILFGK 298 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 L V +H + L ++ R KD PE++ S + + Sbjct: 299 LRPYLAKV-----YLAREHGSAL--GDILVFRANKDISPEFLFFRLISQEFIRQV-DQSS 350 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL-FAYADTIEKQVNNALARVNNLTQ 411 S + + IKS + +PP++EQ ++ + L F ++ + + Sbjct: 351 YGSKMPRANPELIKSLQIAVPPIEEQVKVSDYLANLQFNKIMPSVINASSLVKLLEERRS 410 Query: 412 SILAKAFRGELTAQWRAENPDLISGENSAAALLEK 446 ++++ A G++ + + +++A+ E+ Sbjct: 411 ALISAAVTGKIDVRDWQPPAGSDTVDSNASVQTER 445 Score = 106 bits (265), Expect = 2e-21, Method: Composition-based stats. Identities = 42/208 (20%), Positives = 75/208 (36%), Gaps = 20/208 (9%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W I + + ++ A I NI+NG T + Sbjct: 234 GEVPEHWEIKKLKFCSRMLSDKGKDNTNA-----------ISLENIENG---TGAFIKTE 279 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 N +E P DI+ + V S V R K I F+ Sbjct: 280 SNFDQEGVLFEPLDILFGKLRPYLAKV-----YLAREHGSALGDILVFRANKDISPEFLF 334 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL-LAQVDST 182 S + ++ S G+ + P + I +PP+ EQ +++ L L ++ + Sbjct: 335 FRLISQEFIRQVDQSSYGSKMPRANPELIKSLQIAVPPIEEQVKVSDYLANLQFNKIMPS 394 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLT 210 + ++L+ R A++ AV GK+ Sbjct: 395 VINASSLVKLLEERRSALISAAVTGKID 422 >UniRef50_Q466N9 Type I restriction-modification system specificity subunit n=2 Tax=cellular organisms RepID=Q466N9_METBF Length = 492 Score = 312 bits (801), Expect = 2e-83, Method: Composition-based stats. Identities = 106/489 (21%), Positives = 194/489 (39%), Gaps = 56/489 (11%) Query: 6 LPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP- 63 LP W + + + G T +R +IQN + + + + Sbjct: 18 LPNDWQWTRLGEIADNIQYGYTESSSDEPI-----GPKFLRITDIQNNEVNWKSVPYCEI 72 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIFSGFI 122 N K++ + D+V A + + VGKS F F ++ +R + I F+ Sbjct: 73 DNTKKQNYLLKDGDLVFARTGAT---VGKSYLLKGDFPESVFASYLIRVRLLEEISESFV 129 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 +F +S Y +I+ G N+ L+ +P+ PL EQ+ I K++ L +++D+ Sbjct: 130 YNFFQSLTYWKQITEGQVGIGQPNVNGTKLSLLIVPVAPLLEQRAIVSKIEQLFSELDNG 189 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV-------------------- 222 + + + LK +RQAVL A GKLT+KWR P Sbjct: 190 ISNLKLAQEQLKVYRQAVLKKAFEGKLTKKWREENPDVEDSKYVLNKIKNQISTQKKTKE 249 Query: 223 ----------------FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQN 266 + ++ + + +G P ++ G P + IS++ +G +D + Sbjct: 250 IQDIQYGEVPYELPFKWNWVSLSDVSISITDGDHQAPPKADSGVPFIVISNISSGKLDMS 309 Query: 267 DIRFLECSESEL--NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 + ++ E + K Q D+L++ G+ L+ ++ + + R Sbjct: 310 ETMYVPEKYYENLAAKRKPQPRDILYSVTGSY----GIPILI--SENYRFCFQRHIALIR 363 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 + +Y+ SP + Q + +++ V +PP+ EQ IV+ Sbjct: 364 PHMEISSKYLYYILKSPFVYKQATKVAT-GTAQLTVPLSGLRTIKVPIPPIAEQQAIVQE 422 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALL 444 +E + + IE+ + + L R L QSIL KAF G+L + A LL Sbjct: 423 IETRLSVCEKIEQDIKDNLERAEALRQSILKKAFEGKLLNEKELAEVRGAEDWEPAEVLL 482 Query: 445 EKIKAERAA 453 E+IKAE+A Sbjct: 483 ERIKAEKAR 491 Score = 151 bits (381), Expect = 8e-35, Method: Composition-based stats. Identities = 49/224 (21%), Positives = 97/224 (43%), Gaps = 12/224 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LP W +S V+ I ++ D +P I +NI +GK D ++ ++VP+ Sbjct: 261 ELPFKWNWVSLSDVSISITDGDHQAPPKA----DSGVPFIVISNISSGKLDMSETMYVPE 316 Query: 65 NLVKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + +K P DI+ +++ G + F ++RP I S + Sbjct: 317 KYYENLAAKRKPQPRDILYSVT----GSYGIPILISENYRFCFQRHIALIRPHMEISSKY 372 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + KS + + ++ G + + I +PIPP+AEQ+ I ++++T L+ + Sbjct: 373 LYYILKSPFVYKQATKVATGTAQLTVPLSGLRTIKVPIPPIAEQQAIVQEIETRLSVCEK 432 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGK-LTEKWRNFEPQHSVFK 224 + + + + RQ++L A GK L EK ++ Sbjct: 433 IEQDIKDNLERAEALRQSILKKAFEGKLLNEKELAEVRGAEDWE 476 >UniRef50_B6R0S6 Restriction modification system DNA specificity domain protein n=1 Tax=Pseudovibrio sp. JE062 RepID=B6R0S6_9RHOB Length = 492 Score = 310 bits (796), Expect = 6e-83, Method: Composition-based stats. Identities = 127/506 (25%), Positives = 219/506 (43%), Gaps = 66/506 (13%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG--KFDTTD 58 MS +LPEGWV + + + RG + + ++ DD L I+ ++ +G + ++T+ Sbjct: 1 MS--ELPEGWVETEIENIYEVARGGSPRPIKSYLTADDDGLNWIKISDATSGGYRIESTE 58 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVL-RPEKLI 117 + L K + I P D++++ S GK + C + + +K + Sbjct: 59 QKITSEGLHKT-RLIYPGDLLLSNS----MSFGKPYISAIEG-CIHDGWLVLGGFGKKCV 112 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + ++ S + + ++G+ + N+ + + +P+ PLAEQK I K+++L A Sbjct: 113 DTRYMHLALSSEGVQKQFDEKASGSTVRNLNTGIVNSVRVPLAPLAEQKRIVAKIESLTA 172 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQH------SVFKKLNFESI 231 + + +I + KR++QA+L A +G+LT WR + V K+ Sbjct: 173 KSRIARENLARIDTLTKRYKQAILKKAFSGELTADWREKSSKDCLIDLNDVLKEHEVIWQ 232 Query: 232 LTELRNGLSSKPNESG--------------------------------VGHPILRISSVR 259 + G ++PN G P + + V+ Sbjct: 233 NNIAKKGKYARPNVKPADDLRSWHELSLEGLAYVVDPHPSHRTPPKEIGGIPYVGVGDVK 292 Query: 260 -AGHVDQNDIRFLECS--ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLY 316 G +D R + + L R+ L+ GD + + +G LL + Q L Sbjct: 293 LDGKLDFAGARKVSPKVLKDHLKRYSLKRGDFAYGKIG----TIGQPFLLPEAQEYALSA 348 Query: 317 PDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVK 376 LI+ R +K A E++ FF SP ++ + Q K ++ + LP + Sbjct: 349 NVILIQPR-SKFATAEFLYYFFLSPVVTQKILG-ASVATSQAAFGIKKMREVLTPLPSLS 406 Query: 377 EQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISG 436 EQ EIV R+E+ FA D + ++ AL V+ L + ILAKAFRGEL Q + P Sbjct: 407 EQNEIVTRIEKAFAKIDKLAEEAKRALHSVDRLDEKILAKAFRGELVPQDPDDEP----- 461 Query: 437 ENSAAALLEKIKAERAASGGKKASRK 462 A+ LLE+IKAERAA K +RK Sbjct: 462 ---ASVLLERIKAERAAQPKVKRARK 484 >UniRef50_C0QCH4 HsdS2 n=1 Tax=Desulfobacterium autotrophicum HRM2 RepID=C0QCH4_DESAH Length = 426 Score = 310 bits (794), Expect = 1e-82, Method: Composition-based stats. Identities = 78/429 (18%), Positives = 179/429 (41%), Gaps = 21/429 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G +PE W + + + G+T + D +P R+ NI +G D+V+ Sbjct: 12 IGWIPEDWDCVKLGGIVNKVGSGITPRGG--SKVYCDKGVPFFRSQNILHGTVSVKDIVY 69 Query: 62 VPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIF 118 + +NL ++ + + P D+++ ++ S +G+ F+ + ++RP+ I Sbjct: 70 ISENLHQKMKNTHLQPADVLLNITGAS---IGRCCVFPNNFKKGNVNQHVCIIRPDGTIK 126 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 S ++ S + + +I + AG N + +P+PPL EQ+ IA+ L T+ + Sbjct: 127 SQYLCSLLNSPIGQKQIWNFQAGGNREGLNFQQIRSFILPLPPLPEQQKIADVLSTVDDK 186 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNG-KLTEKWRNFEPQHSVFKKLNFESILTELRN 237 + S + +Q Q+ K + +L + + + P KL + Sbjct: 187 ISSIDQQIQQTEQLKKGLMEKLLTEGIGHTEFKDTEIGQIPASWDVVKLKTICHRIFVGI 246 Query: 238 GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELN-RHKLQDGDLLFTRYNGS 296 S+ + + G PI+R +++ + +D+ + +E N KL GD++ R Sbjct: 247 ATSTSEHYTNDGIPIIRNQNIKENSISGDDLLKITNDFNEKNHSKKLMVGDIITARTGYP 306 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 G+ ++ K + + R + P Y+ + +S + +++ + Sbjct: 307 ----GMSCVIPKKFEGAQTFTTLVSRPN-KERIFPHYLSRYINSDIGKKIVLSNQAGGA- 360 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 Q+ ++ +K ++LPP++EQ +I + + D + + + L + ++ + Sbjct: 361 QQNLNAGRLKEIPIILPPLEEQKQIATILSSVDDKIDVLRSKKTS----YTTLKKGLMGQ 416 Query: 417 AFRGELTAQ 425 G++ + Sbjct: 417 LLTGQMRVK 425 Score = 154 bits (391), Expect = 6e-36, Method: Composition-based stats. Identities = 46/205 (22%), Positives = 87/205 (42%), Gaps = 11/205 (5%) Query: 2 SAGKLPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 G++P W + + T+ + G+ + + +D +P+IR NI+ DL+ Sbjct: 222 EIGQIPASWDVVKLKTICHRIFVGIATSTSE---HYTNDGIPIIRNQNIKENSISGDDLL 278 Query: 61 FVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-I 117 + + ++ S+K+ DI+ + G S FE + V RP K I Sbjct: 279 KITNDFNEKNHSKKLMVGDII----TARTGYPGMSCVIPKKFEGAQTFTTLVSRPNKERI 334 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F +++ + S + + + S AG N+ I I +PPL EQK IA L ++ Sbjct: 335 FPHYLSRYINSDIGKKIVLSNQAGGAQQNLNAGRLKEIPIILPPLEEQKQIATILSSVDD 394 Query: 178 QVDSTKARFEQIPQILKRFRQAVLG 202 ++D +++ + K +L Sbjct: 395 KIDVLRSKKTSYTTLKKGLMGQLLT 419 >UniRef50_A3PKU6 Restriction modification system DNA specificity domain n=2 Tax=Bacteria RepID=A3PKU6_RHOS1 Length = 456 Score = 309 bits (792), Expect = 2e-82, Method: Composition-based stats. Identities = 82/443 (18%), Positives = 161/443 (36%), Gaps = 35/443 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PEGW + + + ++ + + +P++ +NI +G+ D + V Sbjct: 16 GEVPEGWEVKCLRMIADELQTGPFGSQLHTEDYVTAGVPIVNPSNILDGQIVPDDEIGVD 75 Query: 64 KNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLP-FECSFGAFCGVLRPEKL-IFS 119 + + + P DI++ G + +G+ A G +R + Sbjct: 76 EATALRLANHALLPGDIIL----GRRGELGRCAVVPDGTMPLLCGTGSLRIRLKSSQALP 131 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 FIA ++ R +S S G+ ++N+ A I I +P L EQ+ I L+ A++ Sbjct: 132 DFIAECIRTPRVREWLSLQSVGSTMDNLNTAIVGKIQIALPSLPEQRAITAFLNRETAKI 191 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFES 230 D+ ++ +L RQAVL AV L W P+ + + Sbjct: 192 DALVEEQRRLIALLAEKRQAVLNHAVTRGLNPDALLKPSGIDWLGDIPEGWEVVPIRKVA 251 Query: 231 ILTELRNGLSSKPNES-GVGHPILRISSV------RAGHVDQNDIRFLECSESELNRHKL 283 L S+P P ++ + R +V + E + L Sbjct: 252 RLESGHTPSRSRPEWWVDCHIPWFSLADIWQVRPGRVEYVYETAEAVSELGLQNSSARLL 311 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 G ++ +R VG ++ + + + R LP+Y+ Sbjct: 312 PAGTVMLSRT----ASVGFSAVMGIAMATTQDFANWVCGCR----LLPDYLLYCLR---G 360 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 + +K S I DI++ + LPP++EQ IV V D + A+ Sbjct: 361 MPSEFERLKMGSTHNTIYMPDIRTLTIPLPPLEEQKAIVDHVRASVGALDELMDTATTAI 420 Query: 404 ARVNNLTQSILAKAFRGELTAQW 426 + ++++ A G++ + Sbjct: 421 TLLQERRAALISAAVTGKIDVRD 443 Score = 159 bits (404), Expect = 1e-37, Method: Composition-based stats. Identities = 48/236 (20%), Positives = 86/236 (36%), Gaps = 17/236 (7%) Query: 212 KWRNFEPQHSVFKKLNFE-SILTELRNGLS-SKPNESGVGHPILRISSVRAGHVDQNDIR 269 +W P+ K L L G + G PI+ S++ G + +D Sbjct: 13 EWLGEVPEGWEVKCLRMIADELQTGPFGSQLHTEDYVTAGVPIVNPSNILDGQIVPDDEI 72 Query: 270 FLECSESE-LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-K 327 ++ + + L H L GD++ R +G C ++ L L R RL Sbjct: 73 GVDEATALRLANHALLPGDIILGRRGE----LGRCAVVPDGTMPLLCGTGSL-RIRLKSS 127 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 ALP++I +P R + + S ++ + + LP + EQ I + + Sbjct: 128 QALPDFIAECIRTPRVREWL-SLQSVGSTMDNLNTAIVGKIQIALPSLPEQRAITAFLNR 186 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D + ++ +A + Q++L A R NPD + + L Sbjct: 187 ETAKIDALVEEQRRLIALLAEKRQAVLNHAVT-------RGLNPDALLKPSGIDWL 235 Score = 137 bits (345), Expect = 1e-30, Method: Composition-based stats. Identities = 46/219 (21%), Positives = 81/219 (36%), Gaps = 18/219 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN------GKFDTT 57 G +PEGW + P+ V L G T + + D ++P +I T Sbjct: 236 GDIPEGWEVVPIRKVARLESGHTPSRS-RPEWWVDCHIPWFSLADIWQVRPGRVEYVYET 294 Query: 58 DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECS--FGAFCGVLRPEK 115 L S ++ P V+ + S VG SA + + F + R Sbjct: 295 AEAVSELGLQNSSARLLPAGTVMLSRTAS---VGFSAVMGIAMATTQDFANWVCGCR--- 348 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + ++ + + ++ L G+ N I + IP+PPL EQK I + + Sbjct: 349 -LLPDYLLYCLRGMP--SEFERLKMGSTHNTIYMPDIRTLTIPLPPLEEQKAIVDHVRAS 405 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR 214 + +D +L+ R A++ AV GK+ + Sbjct: 406 VGALDELMDTATTAITLLQERRAALISAAVTGKIDVRDL 444 >UniRef50_UPI0001855288 conserved hypothetical protein n=1 Tax=Francisella novicida FTG RepID=UPI0001855288 Length = 414 Score = 308 bits (791), Expect = 2e-82, Method: Composition-based stats. Identities = 94/425 (22%), Positives = 167/425 (39%), Gaps = 32/425 (7%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 KLP GW + V + G K+ PL+ + N++N D T F Sbjct: 18 ELYKLPAGWEWKKLGEVFDVKDG-----THDSPKYKEIGYPLVTSKNLKNNSLDLTSCKF 72 Query: 62 VPKNLV---KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 + + + K+ D++ AM +G + + + +P Sbjct: 73 ISNDDFIKINQRSKVDKGDLLFAMI----GTIGSPTIVDFEPDFAIKN-VALFKPSNTYL 127 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + ++ S L K+ + GA + P+PPLAEQK I KLD+L + Sbjct: 128 IELLKYWLSSHLTTQKMLEEAKGATQKFVGLTYLRNFPAPLPPLAEQKRIVAKLDSLFEK 187 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 +D +Q + L KL ++ + + +R G Sbjct: 188 IDKAIELHQQNITNANTLMASALDKTF-KKLEREYSFKI----------LDCLSENIRYG 236 Query: 239 LSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLE 298 + K E G +RI+ + +++ +++ ++L+R+KL GD+L R + Sbjct: 237 YTDKAKEKGNA-RFIRITDINDQGKFKDESVYVDIKNTDLDRYKLLVGDILVARSGATA- 294 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 G L + ++ LIR RL LP +I F S + N + + +K Q Sbjct: 295 --GKVALF--TLDEFSVFASYLIRIRLQIDKVLPSFIFYFCYSSNYWNQL-DQIKIGGAQ 349 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 ++ ++K+ + LPP+ Q + V ++ + D I++ L + L SIL KA Sbjct: 350 PNVNATNLKNIKIPLPPLPIQQQTVEYLDSIATKVDKIKQLNEQKLENLKALKASILDKA 409 Query: 418 FRGEL 422 FRGEL Sbjct: 410 FRGEL 414 >UniRef50_B7K558 Restriction modification system DNA specificity domain protein n=2 Tax=Bacteria RepID=B7K558_CYAP8 Length = 453 Score = 308 bits (791), Expect = 2e-82, Method: Composition-based stats. Identities = 86/445 (19%), Positives = 170/445 (38%), Gaps = 37/445 (8%) Query: 4 GKLPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +P+GW + + + + I G T K D + +R+ NI D+V++ Sbjct: 24 GDIPDGWEVKRLKWIVSKIGSGKTPKGG--AEIYSDSGIIFLRSQNIHFDGLRLDDVVYI 81 Query: 63 PKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECS-FGAFCGVLRPE-KLIF 118 K++ K S ++ P DI++ ++ S +G+ F S +LRP I Sbjct: 82 NKDIDKAMSSSRVKPLDILLNITGAS---LGRCMIIPKDFPSSNVNQHVCILRPIVTRIN 138 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F+ S+ +N+I S G + + A + P L EQ+ IA+ LD A+ Sbjct: 139 PYFLNRVMSSNAIQNQIFSSEVGVSREGLTFAQAGNLISVFPSLPEQEKIAQFLDEETAK 198 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFE 229 +D +++ ++LK R A++ AV L +W F P+H KK+ Sbjct: 199 IDKLITHKQRLIELLKEKRTALISHAVTKGLNPDVPMKDSGVEWLGFIPEHWEVKKIKRL 258 Query: 230 SILTELRNG---LSSKPNESGVGHPILRISSVR-AGHVDQNDIRFLECSESELNRHKLQD 285 S++ + + + +RIS V + + L + LQ Sbjct: 259 SLVKRGASPRPIDDPIYFDDNGEYVWVRISDVTASNKYLLEAEQKLS-EIGKRKSVPLQP 317 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 +L + VG + + + ++ + L + EY+ F Sbjct: 318 NELFLSI----CASVGKPII---TKIKCCIHDGFVYFPELKE--NREYLYYIFLGGELYK 368 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 + Q ++ + I + +PPV EQ +I +++ D I K+ ++ Sbjct: 369 GLGKM----GTQLNLNTEIIGDVKLPIPPVSEQQKIAEYLDEKTEQIDPIIKKTRESIEY 424 Query: 406 VNNLTQSILAKAFRGELTAQWRAEN 430 + ++++ A G++ + Sbjct: 425 LKEYRTALISAAVTGKIDVRQWGCE 449 Score = 161 bits (409), Expect = 5e-38, Method: Composition-based stats. Identities = 41/261 (15%), Positives = 103/261 (39%), Gaps = 27/261 (10%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSK---PNESGVGHPILRISSVRAGHVDQNDI 268 + P K+L I++++ +G + K S G LR ++ + +D+ Sbjct: 21 DFLGDIPDGWEVKRLK--WIVSKIGSGKTPKGGAEIYSDSGIIFLRSQNIHFDGLRLDDV 78 Query: 269 RFLECSESE-LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 ++ + ++ +++ D+L SL G C ++ K + + I + Sbjct: 79 VYINKDIDKAMSSSRVKPLDILLNITGASL---GRCMIIPKDFPSSNVNQHVCILRPIVT 135 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 P ++ SS + +N + + + ++G++ + + + P + EQ +I + +++ Sbjct: 136 RINPYFLNRVMSSNAIQNQIFS-SEVGVSREGLTFAQAGNLISVFPSLPEQEKIAQFLDE 194 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL---- 443 A D + + + ++++ A + NPD+ ++ L Sbjct: 195 ETAKIDKLITHKQRLIELLKEKRTALISHAVT-------KGLNPDVPMKDSGVEWLGFIP 247 Query: 444 ----LEKIKAERAASGGKKAS 460 ++KIK R + + AS Sbjct: 248 EHWEVKKIK--RLSLVKRGAS 266 >UniRef50_A7IEA1 Restriction modification system DNA specificity domain n=1 Tax=Xanthobacter autotrophicus Py2 RepID=A7IEA1_XANP2 Length = 450 Score = 308 bits (789), Expect = 4e-82, Method: Composition-based stats. Identities = 112/464 (24%), Positives = 207/464 (44%), Gaps = 27/464 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ---NGKFDTTDLVF 61 ++P W+ A V ++ G T N+ K +P + ++ Sbjct: 7 QVPHSWLWASFGEVADIVGGGTPPTGDEANFTK-QGVPWLTPADLTGYRETYISRGRRDL 65 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 K + + ++ P+ V+ S VG A S + I + Sbjct: 66 SEKGYRESAARLLPKGTVLFSSRA---PVGYCAIAS--ENVSTNQGFKSFILKGDISPEY 120 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + H+ S S ++G + + + +P+PPL EQ+ I K+D+L A+ Sbjct: 121 VRHYLLGST--EYAESKASGTTFKELSGSRATELALPLPPLPEQRRIVAKIDSLTAKSRR 178 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 + E IP+++++++QA+L A +G+LTE + + + + F +NGL Sbjct: 179 ARDHLEHIPRLVEKYKQAILAAAFDGRLTELSPHDIVHPELGELIEFG-----PQNGLYL 233 Query: 242 KPNESGVGHPILRISSVRAGHVD-QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 + G G PILRI + +D + + S++ + + DGDL+ R N S + Sbjct: 234 PKDRYGEGTPILRIQNYGFNFIDEPTNWHRVTVSDAIAAQFAMSDGDLIINRVN-SPSHL 292 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G ++ K ++ ++R RL A P++++++ SS R ++ K Q I Sbjct: 293 GKSMVVTKAM-AGAIFESNMMRIRLNALAEPKFVQLYLSSSQGRGSLTKDAKWAVNQASI 351 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + D+ V LP + +Q ++ R+E FA+ D + + +A ++ L Q++LAKAFRG Sbjct: 352 NQGDVSRTPVPLPGLSDQIAVLDRIETAFAWIDRLAAEATSARTLIDRLDQAVLAKAFRG 411 Query: 421 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 EL Q A+ P A+ LLE+I+AER A+ + R+ + Sbjct: 412 ELVPQDPADEP--------ASVLLERIRAERGAAPKARRGRRPA 447 >UniRef50_A6C679 Type I restriction-modification system, S subunit n=1 Tax=Planctomyces maris DSM 8797 RepID=A6C679_9PLAN Length = 450 Score = 307 bits (787), Expect = 6e-82, Method: Composition-based stats. Identities = 87/445 (19%), Positives = 169/445 (37%), Gaps = 41/445 (9%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFD------TT 57 GK+PE W + + + + +D LP+++ + I +G D + Sbjct: 18 GKVPEHWDVFRMGILF-----------AEVAESGNDDLPVLQVS-IHHGVSDRELSESES 65 Query: 58 DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 D + + +++ P D+V M + G E V RP+ Sbjct: 66 DRKITRIDDKSKYKRVVPNDLVYNMMRAWQGGFGTVKV-----EGMVSPAYVVARPKIDF 120 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + FI H ++ ++ S G + F + + +P +EQ+ I + +D Sbjct: 121 QTQFIEHLFRTPQAIEQMRRYSHGVTDFRLRLYWDKFKNVRVALPDKSEQQEICDYIDVE 180 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKL 226 +++D+ A ++ ++LK RQAV+ AV L +W P+H L Sbjct: 181 TSKIDALVAEQRRLIELLKEKRQAVISHAVTKGLNPNAPMKDSGIEWLGDVPEHWEVCSL 240 Query: 227 NFESILTELRNGLSSKPNES--GVGHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKL 283 + + G G L ++ G +D + +F+ + + LNR K Sbjct: 241 RRYAFFVDGDRGSEYPNENDLTSDGILFLSSKNIVGGKLDLKESKFISHEKFDALNRGKA 300 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLK--KLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSP 341 QDGD L + GS +G L + +++ R P+Y+ S Sbjct: 301 QDGD-LIVKVRGSTGRIGEMALFDVGAYSFETAFINAQMMIIRTGNKLTPKYLSKVSQSI 359 Query: 342 SARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNN 401 + + + Q+ +S K V +PPV EQAEI ++ D++E + Sbjct: 360 YWMEQL-SVGAYGTAQQQLSNKVFSDLFVTMPPVTEQAEIADFIDLKVGEFDSLETEAEQ 418 Query: 402 ALARVNNLTQSILAKAFRGELTAQW 426 A+ + ++++ A G++ + Sbjct: 419 AIELLQERRTALISAAVTGKINVRD 443 Score = 133 bits (336), Expect = 1e-29, Method: Composition-based stats. Identities = 35/244 (14%), Positives = 79/244 (32%), Gaps = 32/244 (13%) Query: 207 GKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVD-- 264 + +W P+H ++ + P+L++S + G D Sbjct: 10 KESGIEWLGKVPEHWDVFRMGIL---------FAEVAESGNDDLPVLQVS-IHHGVSDRE 59 Query: 265 ---QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLI 321 R + + + ++ DL++ G + + ++ P ++ Sbjct: 60 LSESESDRKITRIDDKSKYKRVVPNDLVYNMMRAWQGGFGTVKV------EGMVSPAYVV 113 Query: 322 RARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG--QKGISGKDIKSQVVLLPPVKEQA 379 R D ++IE F +P A M + + K+ V LP EQ Sbjct: 114 A-RPKIDFQTQFIEHLFRTPQAIEQM-RRYSHGVTDFRLRLYWDKFKNVRVALPDKSEQQ 171 Query: 380 EIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENS 439 EI ++ + D + + + + Q++++ A + NP+ ++ Sbjct: 172 EICDYIDVETSKIDALVAEQRRLIELLKEKRQAVISHAVT-------KGLNPNAPMKDSG 224 Query: 440 AAAL 443 L Sbjct: 225 IEWL 228 >UniRef50_C6MBL0 Restriction modification system DNA specificity domain protein n=1 Tax=Nitrosomonas sp. AL212 RepID=C6MBL0_9PROT Length = 467 Score = 305 bits (782), Expect = 2e-81, Method: Composition-based stats. Identities = 83/449 (18%), Positives = 176/449 (39%), Gaps = 32/449 (7%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G+ P W + V + ++ D+ LI ++ + + + Sbjct: 26 IGEYPLNWNLTRVK-FESYVKARVGWHGLKSEDFTDEGPFLITGSDFRGPVINWNECYHC 84 Query: 63 PKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQ-HLPFECSFGAFCGVLRP-EKLIF 118 ++ ++ D++I +GK A L + + + V+RP Sbjct: 85 DLARYEQDPYIQLKDGDLLITKD----GTIGKVALVSGLAGKATLNSGVFVVRPLTNNYT 140 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 S F ++S++ + G+ I ++ +F IP EQ IA LD A+ Sbjct: 141 SRFYFWLLQASVFTGFVDFNKTGSTIVHLYQDTFVNFKYAIPSFNEQLTIANFLDHETAK 200 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFK----- 224 +D+ + +Q+ ++LK RQAV+ AV L +W P+H K Sbjct: 201 IDTLIEKQQQLIKLLKEKRQAVISHAVTKGLNPNAKMRDSGVEWLGEVPEHWSMKIKLVS 260 Query: 225 --KLNFESILTELRNGLSSKPNESGVGHPILRISSVRA-GHVDQNDIRFLECSESELNRH 281 + + S + VG P++ I ++ G++ ++ + E +L Sbjct: 261 VAEGSRGSFVNGPFGSDLLSLELQDVGVPVIYIRDLKQTGYMRKSAVCVTEEKARQLEIC 320 Query: 282 KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSS 340 K+ GD+L + G + + + ++ D +IR R+ + P Y+ + +S Sbjct: 321 KVVSGDVLIAKVGDPP---GEACIYPENEPAAIITQD-VIRIRVNRGVINPYYLVMLLNS 376 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 + +++ + S +K IS D K ++P + EQ++IV VE DT+ + Sbjct: 377 DLGKV-VVDNISIESTRKRISLGDFKQVRFIIPSLSEQSDIVSFVELRCRKIDTLIAKAQ 435 Query: 401 NALARVNNLTQSILAKAFRGELTAQWRAE 429 + ++ + ++++ A G++ + Sbjct: 436 SMVSLIIERRTALISAAVTGKIDVRDWQP 464 Score = 155 bits (392), Expect = 4e-36, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 82/234 (35%), Gaps = 14/234 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPI-LRISSVRAGHVDQNDIRF 270 +W P + ++ FES + K + P + S R ++ N+ Sbjct: 24 EWIGEYPLNWNLTRVKFESYVKARVGWHGLKSEDFTDEGPFLITGSDFRGPVINWNECYH 83 Query: 271 LECSESELNRHK-LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 + + E + + L+DGDLL T+ +G L+ L + L + LT + Sbjct: 84 CDLARYEQDPYIQLKDGDLLITKDG----TIGKVALVSGLAGKATLNSGVFVVRPLTNNY 139 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 + + ++ KT S + + +P EQ I ++ Sbjct: 140 TSRFYFWLLQASVF-TGFVDFNKTGSTIVHLYQDTFVNFKYAIPSFNEQLTIANFLDHET 198 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A DT+ ++ + + Q++++ A + NP+ ++ L Sbjct: 199 AKIDTLIEKQQQLIKLLKEKRQAVISHAVT-------KGLNPNAKMRDSGVEWL 245 >UniRef50_C3RBV6 Type I restriction-modification system n=3 Tax=Bacteroides RepID=C3RBV6_9BACE Length = 423 Score = 305 bits (782), Expect = 3e-81, Method: Composition-based stats. Identities = 89/432 (20%), Positives = 160/432 (37%), Gaps = 35/432 (8%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P W +S V +I T +Y + L ++ ++ NG T Sbjct: 16 IGEIPNHWEAIKISRVHPIIGSGTTPLSSREDYYSEKGLNWLQTGDLNNGLITETSKKIT 75 Query: 63 PKNLVKESQKISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 PK + + K P +VIAM + VG L E + C ++ P K I + Sbjct: 76 PKAVDECKMKFYPIHSVVIAMYGATIGKVG-----LLDIETATNQACCIIVPSKRICPKY 130 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + ++ S G NI + +P+PPL+EQ+ IA LD ++D Sbjct: 131 TFYSF--IIAKEELLLSSFGGGQPNISQDIIRKLKVPVPPLSEQQSIASYLDVKTEKIDK 188 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 A+ E+ + L +Q+++ AV L W P H L F L Sbjct: 189 MIAKAEKKIEYLGELKQSLITRAVTRGLNPNTPLKDSGVNWIGNIPMHWDIACLRFFLRL 248 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 R S + +LR+ + ND + E E +++ DLL+ Sbjct: 249 INGRA-YSQNELLPSGKYKVLRVGN-----FFTNDSWYYSNMELEPDKY-CDKDDLLYA- 300 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 VG + + + +Y + + +L Y + N M+ + Sbjct: 301 ---WSASVG-PYIWNEAKT---IYHYHIWKVQLATSMDKMYSYYLLR--AVTNQKMSDM- 350 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 S I+ D+ + +PP+ EQ +I ++ + D I +A + L QS Sbjct: 351 HGSTMMHITMGDMNKTKIPIPPLSEQQQIATYLDTKCSKIDHIIATQKKKIAYLQELKQS 410 Query: 413 ILAKAFRGELTA 424 ++ G++ Sbjct: 411 LITNVVTGKIKV 422 Score = 169 bits (430), Expect = 1e-40, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 72/234 (30%), Gaps = 19/234 (8%) Query: 212 KWRNFEPQHSV-FKKLNFESILTELRNGLSSKPNES-GVGHPILRISSVRAGHVDQNDIR 269 KW P H K I+ LSS+ + G L+ + G + + + Sbjct: 14 KWIGEIPNHWEAIKISRVHPIIGSGTTPLSSREDYYSEKGLNWLQTGDLNNGLITETSKK 73 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 + E ++ Y ++ VG L +K Sbjct: 74 ITPKAVDECKMKFYPIHSVVIAMYGATIGKVG-------LLDIETATNQACCIIVPSKRI 126 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 P+Y S + GQ IS I+ V +PP+ EQ I ++ Sbjct: 127 CPKYTFY---SFIIAKEELLLSSFGGGQPNISQDIIRKLKVPVPPLSEQQSIASYLDVKT 183 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + + + + L QS++ +A R NP+ ++ + Sbjct: 184 EKIDKMIAKAEKKIEYLGELKQSLITRAVT-------RGLNPNTPLKDSGVNWI 230 >UniRef50_D1J921 Putative type I restriction enzyme, DNA specificity subunit n=1 Tax=uncultured archaeon RepID=D1J921_9ARCH Length = 445 Score = 304 bits (780), Expect = 4e-81, Method: Composition-based stats. Identities = 75/442 (16%), Positives = 156/442 (35%), Gaps = 39/442 (8%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV-- 60 G++PE W P+ V ++ G + Y + P +RA NI K DT D+ Sbjct: 16 IGEIPEHWEAKPIKYVGDIVLGKMLTPDDKEGYFRK---PYLRAQNITWEKVDTEDIKEM 72 Query: 61 -FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIF 118 F K L ++ D++++ VG++A EC + + Sbjct: 73 WFSEKEL--SQYRLKENDLLVS----EGGEVGRTAIWQNELNECYIQNSVHKITIKSKNN 126 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + + + S+ +I ++ I P EQ+ IA LD Q Sbjct: 127 PHYYLYHFQIYGKTGYFDSIVNRVSIAHLTREKLKEIMFLSPTFHEQQTIANYLDRKTHQ 186 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFE 229 +D+ +++ +LK R A++ AV L +W P+H +K+ Sbjct: 187 IDTFIENKQKLIDLLKEQRAAIINQAVTKGLNPNVKLKDSGIEWLGEIPEHWELRKVGR- 245 Query: 230 SILTELRNGLSSKPN----ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQD 285 + +G + K + + G +D+ + E + E + K+ Sbjct: 246 -SFNLIGSGTTPKSENIGYYENGTINWVITGDLNDGILDKTSKKITEKALDEYSTLKIYP 304 Query: 286 -GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 G LL Y ++ + L + + E+ +F + Sbjct: 305 VGTLLIAMYGATIGKI-------SLMNFEGCVNQACCALSNSPYLSNEFSFYWFLAN--- 354 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 + + GQ IS + ++S + PP EQ I+ +++ D + ++ + Sbjct: 355 KQNIINMSFGGGQPNISQEVVRSLKIPTPPSSEQQAIIYHLDEQTTRIDKLMERQGRQIE 414 Query: 405 RVNNLTQSILAKAFRGELTAQW 426 + +++++ G++ + Sbjct: 415 HLKEYRTTLISEVVTGKIDVRD 436 Score = 161 bits (408), Expect = 6e-38, Method: Composition-based stats. Identities = 49/215 (22%), Positives = 88/215 (40%), Gaps = 9/215 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + V LI T K + I Y ++ + + ++ +G D T Sbjct: 232 GEIPEHWELRKVGRSFNLIGSGTTPKSENIGYYENGTINWVITGDLNDGILDKTSKKITE 291 Query: 64 KNLVK-ESQKISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 K L + + KI P ++IAM + + + FE C L + + F Sbjct: 292 KALDEYSTLKIYPVGTLLIAMYGATIGKI-----SLMNFEGCVNQACCALSNSPYLSNEF 346 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 ++ ++ + I ++S G NI + IP PP +EQ+ I LD ++D Sbjct: 347 SFYWFLAN--KQNIINMSFGGGQPNISQEVVRSLKIPTPPSSEQQAIIYHLDEQTTRIDK 404 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF 216 R + + LK +R ++ V GK+ + Sbjct: 405 LMERQGRQIEHLKEYRTTLISEVVTGKIDVRDYGI 439 Score = 156 bits (394), Expect = 2e-36, Method: Composition-based stats. Identities = 39/234 (16%), Positives = 87/234 (37%), Gaps = 18/234 (7%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGH--PILRISSVRAGHVDQNDIR 269 +W P+H K + + + G P++ P LR ++ VD DI+ Sbjct: 14 EWIGEIPEHWEAKPIKYVGDIVL---GKMLTPDDKEGYFRKPYLRAQNITWEKVDTEDIK 70 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 + SE EL++++L++ DLL + VG + + ++ + + + + Sbjct: 71 EMWFSEKELSQYRLKENDLLVSEGGE----VGRTAIWQNELNE-CYIQNSVHKITIKSKN 125 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 P Y F + + + ++ + +K + L P EQ I +++ Sbjct: 126 NPHYYLYHFQ-IYGKTGYFDSIVNRVSIAHLTREKLKEIMFLSPTFHEQQTIANYLDRKT 184 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 DT + + + +I+ +A + NP++ ++ L Sbjct: 185 HQIDTFIENKQKLIDLLKEQRAAIINQAVT-------KGLNPNVKLKDSGIEWL 231 >UniRef50_Q3J7Q5 Restriction endonuclease S subunits-like n=2 Tax=Nitrosococcus oceani RepID=Q3J7Q5_NITOC Length = 487 Score = 304 bits (779), Expect = 6e-81, Method: Composition-based stats. Identities = 86/450 (19%), Positives = 172/450 (38%), Gaps = 25/450 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDL-V 60 G++P W + P + T G + + A + ++R+ + +G ++ TD V Sbjct: 42 IGEVPSFWEVKPFKWLLTHNEGGVWGDDPA----GEGDTIVLRSTDQTVDGNWNVTDPAV 97 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE---CSFGAFCGVLRPEKLI 117 S + D+V+ SSGS +GK+ ++ +G F LR + Sbjct: 98 RHLTVKENASAVLEAGDLVVTKSSGSALHIGKTTLVNVDMAKLGYCYGNFMQRLRLGQKY 157 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + + L R +++ LS + N+ I +P+PP+ EQ IA LD Sbjct: 158 IPKLAWYVMNNDLVRLQLNLLSNSTTGLANLNATLIGEILLPVPPVEEQTQIARFLDHET 217 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLN 227 A++D+ +++ ++LK RQA++ AV L +W P H + K L Sbjct: 218 ARIDALIEEQQRLIELLKEKRQAIISHAVTKGLDPTVPMKDSGVEWLGEVPAHWITKPLK 277 Query: 228 FESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD 287 + L ++G +E P + ++ G + ++ RF+ ++ +DGD Sbjct: 278 HLAELNPKKSGYHGDRDELCSFVP---MEKLKTGVIQLDEERFI--ADVISGYTYFEDGD 332 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 +L + E + L + ++ R D ++ Sbjct: 333 VLQAKVTPCFENR-NIAIADGLTNGVGFGSSEINVLRPFPDVNASFLYYRLQEDGYMGIC 391 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 + G K + G+ I V +P EQ +I ++ A D + ++ N + + Sbjct: 392 TASMIGAGGLKRVPGEVINGFTVAVPERHEQTQIAHFLDHETARVDKLVEEANVGIELLK 451 Query: 408 NLTQSILAKAFRGELTAQWRAENPDLISGE 437 ++++ A G++ + S E Sbjct: 452 ERRSALISAAVTGKIDVRGWQPPASAPSPE 481 Score = 166 bits (422), Expect = 1e-39, Method: Composition-based stats. Identities = 44/233 (18%), Positives = 91/233 (39%), Gaps = 11/233 (4%) Query: 213 WRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISS-VRAGHVDQNDIRFL 271 W P K F+ +LT G+ +LR + G+ + D Sbjct: 41 WIGEVPSFWEVK--PFKWLLTHNEGGVWGDDPAGEGDTIVLRSTDQTVDGNWNVTDPAVR 98 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQ-NLLYPDKLIRARLTKDAL 330 + E L+ GDL+ T+ +GS +G L+ + Y + + R RL + + Sbjct: 99 HLTVKENASAVLEAGDLVVTKSSGSALHIGKTTLVNVDMAKLGYCYGNFMQRLRLGQKYI 158 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 P+ ++ R + +T+G ++ I ++ +PPV+EQ +I R ++ A Sbjct: 159 PKLAWYVMNNDLVRLQLNLLSNSTTGLANLNATLIGEILLPVPPVEEQTQIARFLDHETA 218 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + ++ + + Q+I++ A + +P + ++ L Sbjct: 219 RIDALIEEQQRLIELLKEKRQAIISHAVT-------KGLDPTVPMKDSGVEWL 264 >UniRef50_Q0EXK2 HsdS protein n=1 Tax=Mariprofundus ferrooxydans PV-1 RepID=Q0EXK2_9PROT Length = 462 Score = 303 bits (777), Expect = 8e-81, Method: Composition-based stats. Identities = 78/441 (17%), Positives = 169/441 (38%), Gaps = 26/441 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P WV+ ++ L T KK + + ++P ++ ++ + Sbjct: 20 GEIPAHWVLTRTKYISEL----TPKKPKISRDKECSFIP---MEKLKTDSIVLDEVRTID 72 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAF-CGVLRPEKLIFSGFI 122 ++ + D+++A + + Q L FG+ V+R + + + F+ Sbjct: 73 -DVYDGYTYFADSDVLMAKVTPCFENKNIAIAQDLVNGVGFGSSEIYVIRANQRVSNRFL 131 Query: 123 AHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + + GA + + + +P EQ IA LD A++D+ Sbjct: 132 FYRLQEDSFMEIAIAAMTGAGGLKRVPSDVLNNYIAAVPQHDEQMEIANFLDRETAKIDT 191 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 + +Q+ ++LK RQAV+ AV L +W P H L FE + Sbjct: 192 LIEKQQQLIKLLKEKRQAVISHAVTKGLNPDAPMRNSGIEWLGEVPAHWEISSLGFECSV 251 Query: 233 TELRNGLSSKP-NESGVGHPILRISSVRAGHVDQNDIRFLECS-ESELNRHKLQDGDLLF 290 K G+ L +++ +D ++ ++ + E L +GD+L Sbjct: 252 KARLGWKGLKAEEYVDEGYIFLATPNIKGEKIDFENVNYITKARYDESPEIMLNEGDVLV 311 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 T+ + G ++++L + R Y+ FF S +N ++ Sbjct: 312 TKDGSTT---GTTNIVREL-PSPATVNSSIAVLRSVGRIDSSYLYYFFVSTYVQN-VIKR 366 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 ++ G + D++ VL+PP KEQ EI ++ D + + ++ + Sbjct: 367 IQGGMGVPHLFQADLRKFNVLMPPFKEQKEIAAEIDMRLPKFDDLIAKAEYSILLMKERR 426 Query: 411 QSILAKAFRGELTAQWRAENP 431 ++++ A G++ + +P Sbjct: 427 TALISAAVTGKIDVRHHVSHP 447 Score = 141 bits (357), Expect = 5e-32, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 78/232 (33%), Gaps = 15/232 (6%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P H V + + S LT + +S + + ++ + +++R + Sbjct: 17 EWLGEIPAHWVLTRTKYISELTPKKPKIS-----RDKECSFIPMEKLKTDSIVLDEVRTI 71 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 + D D+L + E + + L + ++ R + Sbjct: 72 DDVYDGYTYFA--DSDVLMAKVTPCFENK-NIAIAQDLVNGVGFGSSEIYVIRANQRVSN 128 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 ++ S + + G K + + + + +P EQ EI +++ A Sbjct: 129 RFLFYRLQEDSFMEIAIAAMTGAGGLKRVPSDVLNNYIAAVPQHDEQMEIANFLDRETAK 188 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 DT+ ++ + + Q++++ A + NPD + L Sbjct: 189 IDTLIEKQQQLIKLLKEKRQAVISHAVT-------KGLNPDAPMRNSGIEWL 233 >UniRef50_UPI0001C36A8C HsdS1 n=1 Tax=Clostridium hathewayi DSM 13479 RepID=UPI0001C36A8C Length = 456 Score = 303 bits (776), Expect = 1e-80, Method: Composition-based stats. Identities = 93/446 (20%), Positives = 194/446 (43%), Gaps = 23/446 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P W + V +I G T K + Y P + ++ G+ F+ + Sbjct: 30 EVPGNWCWVRLKDVAFVITGGTPSKNKPEYY--GGTFPFFKPADLDYGRNMVAASEFLSE 87 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 S+ I + + +GK + + + P+ + S F+ + Sbjct: 88 EGKAVSRCIPAKSTAVCCI----GSIGKCGYLCVD--GTTNQQINSAIPK--VNSLFLYY 139 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + + L+ ++ ++ I+ + + + P+PPL EQ+ IA ++ + ++D K Sbjct: 140 YCNTILFTKQLRLKASATTISIVNKSKMEQCLFPLPPLREQQRIANHIEEMFYKLDEIKE 199 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 + + + + + + A+L A +G LT KWR + S + L+ GL Sbjct: 200 KTQLVLESSEDRKAAILYKAFSGALTAKWRKHKGVSFEGWITKPLSEVATLQTGLMKGKR 259 Query: 245 ESGVGH--PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 + P LR+++V+ G++D +I+ +E ++ R++L+ GD+LFT G + +G Sbjct: 260 NNQKTVLLPYLRVANVQDGYLDLKEIKNIEVDVLKIERYRLKKGDVLFTE-GGDFDKLGR 318 Query: 303 CGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 + + + + ++ + + R T P ++ + S + + C K T+ I+ Sbjct: 319 SSVWNE-EIPDCIHQNHIFVVRTQTDTLDPYFLSLQAGSRYGKTYFIGCSKQTTNLASIN 377 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 +K+ VL+P ++EQ EIV + + I++ L ++ + +SIL++AFRGE Sbjct: 378 STQLKNFPVLIPTIEEQREIVNILNFFLGKEEQIKQNCLKLLEKIEEIKKSILSRAFRGE 437 Query: 422 LTAQWRAENPDLISGENSAAALLEKI 447 L E S+ LL+ I Sbjct: 438 LGTNNPDEE--------SSIELLKTI 455 Score = 141 bits (355), Expect = 7e-32, Method: Composition-based stats. Identities = 42/244 (17%), Positives = 89/244 (36%), Gaps = 19/244 (7%) Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 + E+ + ++ +QA++ E+W P + + +L + + +KP Sbjct: 5 KKKEENLTLEEKLKQALVPE-------EEWPYEVPGNWCWVRLKDVAFVITGGTPSKNKP 57 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVC 303 G P + + + G FL ++R + + +G C Sbjct: 58 EYYGGTFPFFKPADLDYGRNMVAASEFLSEEGKAVSRCIPAKSTAVCC-----IGSIGKC 112 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 G L N + ++ + ++ + + ++ Sbjct: 113 GYLCVDGTTNQQINSAI------PKVNSLFLYYYCNTILFTKQLRLKAS-ATTISIVNKS 165 Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 ++ + LPP++EQ I +E++F D I+++ L + +IL KAF G LT Sbjct: 166 KMEQCLFPLPPLREQQRIANHIEEMFYKLDEIKEKTQLVLESSEDRKAAILYKAFSGALT 225 Query: 424 AQWR 427 A+WR Sbjct: 226 AKWR 229 >UniRef50_Q4FUM9 Possible type I restriction-modification system, S subunit n=1 Tax=Psychrobacter arcticus 273-4 RepID=Q4FUM9_PSYA2 Length = 457 Score = 301 bits (772), Expect = 3e-80, Method: Composition-based stats. Identities = 90/441 (20%), Positives = 177/441 (40%), Gaps = 31/441 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF-- 61 GK+P W ++ + + + RG+T K + D +P + + + D Sbjct: 23 GKIPSHWELSKLRYMFSFGRGLTITKADLL----DTGVPCVNYGEVHSKYGFEVDPKRHY 78 Query: 62 ---VPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL 116 V + ++ ++ D+V A +S G G + RP Sbjct: 79 LKCVDEGYLQSSPYALLTQGDLVFADTSEDIEGSGNFTQLVSDDLIFAGYHTVIARPFDR 138 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 S F A+ S R ++ + G + +I + + I +P L E++ IA LD Sbjct: 139 QCSRFYAYLMDSKEIRTQVRHMVKGVKVFSITQSILKGVRIWLPSLDERETIANFLDFET 198 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLN 227 AQ+D+ + + + Q+LK RQAV+ AV L +W P+H KL Sbjct: 199 AQIDTLIDKQKTLIQLLKEKRQAVISHAVTKGLNPDAPLKDSGVEWLGEVPEHWGVSKLK 258 Query: 228 FESILTELRNGLSSKPNESGVGHP-ILRISSVR-AGHVDQNDIRFLECSESELNRHKLQD 285 + I L+ G + + P +RI+ V G++ + R L +E + L D Sbjct: 259 YL-ISEPLQYGANEAAEDVDKTQPRFVRITDVLPNGNLKDDTFRSLPQEIAEP--YMLMD 315 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP-EYIEIFFSSPSAR 344 GD+L R G+ VG + + + LI+A++ ++ P E+ + + Sbjct: 316 GDVLLARSGGT---VGKSFIYRDSW-GKCCFAGYLIKAKIDEEITPAEWFYLNTLTDFYW 371 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALA 404 + + + + +S S V+ +PP++E +I+ + DT+ + A+ Sbjct: 372 KWIESIQIQ-ATIQNVSADKYNSFVIAVPPLEESYKIISYINYNLEVFDTLVMKAEQAIQ 430 Query: 405 RVNNLTQSILAKAFRGELTAQ 425 + ++++ A G++ + Sbjct: 431 LMQERRTALISAAVTGKIDVR 451 Score = 139 bits (352), Expect = 2e-31, Method: Composition-based stats. Identities = 41/239 (17%), Positives = 85/239 (35%), Gaps = 20/239 (8%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRF- 270 +W P H KL + + R +K + G P + V + + + D + Sbjct: 20 EWLGKIPSHWELSKLRYM--FSFGRGLTITKADLLDTGVPCVNYGEVHSKYGFEVDPKRH 77 Query: 271 ----LECSESELNRH-KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPD-KLIRAR 324 ++ + + + L GDL+F + +E G +L +L++ + AR Sbjct: 78 YLKCVDEGYLQSSPYALLTQGDLVFADTSEDIEGSGN---FTQLVSDDLIFAGYHTVIAR 134 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 + S R + + VK I+ +K + LP + E+ I Sbjct: 135 PFDRQCSRFYAYLMDSKEIRTQVRHMVK-GVKVFSITQSILKGVRIWLPSLDERETIANF 193 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++ A DT+ + + + Q++++ A + NPD ++ L Sbjct: 194 LDFETAQIDTLIDKQKTLIQLLKEKRQAVISHAVT-------KGLNPDAPLKDSGVEWL 245 >UniRef50_D1UP80 Restriction modification system DNA specificity domain protein n=1 Tax=Burkholderia sp. CCGE1001 RepID=D1UP80_9BURK Length = 443 Score = 301 bits (771), Expect = 4e-80, Method: Composition-based stats. Identities = 130/466 (27%), Positives = 225/466 (48%), Gaps = 30/466 (6%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDL 59 MS +LP+GW+ + V G T K E + ++ +I ++ + L Sbjct: 1 MS--RLPKGWLETTLGEVVDY--GTTLKAEPDEISDDEW---VLELEDIEKDKSRIVSRL 53 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 F + + S D++ + V ++ + + Sbjct: 54 TFADRKSKSTKNRFSKGDVLYGKLRPYLNKV-----VLADSNGLCTTEIIPIKQTAAVDN 108 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ H+ + + + +S G N+ + + +PPLAEQK IA+KLD++L++V Sbjct: 109 RYVFHWLRGPRFLSYAIGVSHGLNMPRLGTDAGRSAPFILPPLAEQKRIADKLDSVLSRV 168 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 ++ AR ++P IL R R+A L + ++ F S++ +R G Sbjct: 169 EAACARMGRVPTILTRLRRAAL--------VATLLGQDGDAKPTPRIAFGSLINSIRGGT 220 Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 ++ P +PILR SSVR G +D D+R+L +S ++ +++ D+LFTR NG++ + Sbjct: 221 TAVPQSDKTAYPILRSSSVRQGRIDFEDVRYLTSEQSGEEKNFIRENDVLFTRLNGNVNY 280 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 VG C ++ + YPD+L ARL + +P+Y F+ P R + K+++G K Sbjct: 281 VGNCAVVPSVSLNKYQYPDRLYCARLKETIVPKYCAYAFALPDIRKEIERRAKSSAGHKR 340 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 IS +DIK + LPPV EQ +V ++E++FA D +EK ++ A ++LT ++LAKAFR Sbjct: 341 ISIQDIKEMEIPLPPVAEQLRMVNQIERIFATCDRLEKTLDEAKIVADHLTPALLAKAFR 400 Query: 420 GELTAQWRAENPDLISGENSAAALLEKIKAERAASG-GKKASRKKS 464 GEL Q + SA LLE++KA + G K SR+ + Sbjct: 401 GELVGQDPNDE--------SAEQLLERLKALTTSLGTKGKRSRQSA 438 >UniRef50_A1K1C0 Type I site-specific deoxyribonuclease n=3 Tax=Bacteria RepID=A1K1C0_AZOSB Length = 449 Score = 301 bits (771), Expect = 4e-80, Method: Composition-based stats. Identities = 79/437 (18%), Positives = 158/437 (36%), Gaps = 24/437 (5%) Query: 6 LPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV-FVP 63 +P W +P+ V L+ GV+ +++ + + +G F + V Sbjct: 20 IPAHWEPSPLKRVVALVESGVSVNAVDEPAGPDAVG--VLKTSCVYSGNFSHGENKAVVA 77 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + L + + + ++++ + + ++VG + + F F Sbjct: 78 EELDRVACPVRAGTLIVSRMN-TPALVGAAGLVEENADNLFLPDRLWQVHFSGAVPKFAH 136 Query: 124 HFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 ++T S YR ++ AG + + N+ F +P+PP EQ IA LD A++D+ Sbjct: 137 YWTASPSYRAQVQMACAGTSASMQNLSQDEFLRFVMPLPPKDEQTAIAAFLDRETAKIDA 196 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFESIL 232 A+ E++ +L RQA + AV L W P H L++ + L Sbjct: 197 LIAKQEKLIALLAEKRQATISHAVTRGLNPDAPMKDSGVAWLGEVPAHWSVSALSYLASL 256 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 +P+ P L+ + + + + + + G LL Sbjct: 257 ETGATPDRGEPSYWNGTIPWLKTGEINWAPICEAEEFITDAGLENSAAKIAKPGTLLMAM 316 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 Y + G LL+ Y +PE+ FF + + Sbjct: 317 YGQGVTR-GRVALLEIE----ATYNQACAAINFRSRIIPEFGRYFFMAAY---DHVRDAG 368 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 + Q +S I + +PP+ EQ +VR ++ A D + + + + + Sbjct: 369 NETSQMNLSAGLISKIRLPVPPLDEQQAVVRFLDVETAKLDVLGAESERGITLLKERRSA 428 Query: 413 ILAKAFRGELTAQWRAE 429 ++A A G++ + AE Sbjct: 429 LIAAAVTGQIDVRNTAE 445 Score = 160 bits (405), Expect = 1e-37, Method: Composition-based stats. Identities = 42/207 (20%), Positives = 81/207 (39%), Gaps = 6/207 (2%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W ++ +S + +L G T + + + +P ++ I + Sbjct: 239 GEVPAHWSVSALSYLASLETGATPDRGEPS--YWNGTIPWLKTGEINWAPICEAEEFITD 296 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 L + KI+ ++ G G+ A L E ++ C + I F Sbjct: 297 AGLENSAAKIAKPGTLLMAMYGQGVTRGRVAL--LEIEATYNQACAAINFRSRIIPEFGR 354 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 +F + + + N+ I +P+PPL EQ+ + LD A++D Sbjct: 355 YFF--MAAYDHVRDAGNETSQMNLSAGLISKIRLPVPPLDEQQAVVRFLDVETAKLDVLG 412 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLT 210 A E+ +LK R A++ AV G++ Sbjct: 413 AESERGITLLKERRSALIAAAVTGQID 439 Score = 159 bits (403), Expect = 2e-37, Method: Composition-based stats. Identities = 53/231 (22%), Positives = 96/231 (41%), Gaps = 16/231 (6%) Query: 217 EPQHSVFKKLNFESILTELRNGLSSKPNESGVG---HPILRISSVRAGHVDQNDIRFLEC 273 P H L L E +G+S + G +L+ S V +G+ + + + Sbjct: 20 IPAHWEPSPLKRVVALVE--SGVSVNAVDEPAGPDAVGVLKTSCVYSGNFSHGENKAVVA 77 Query: 274 SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEY 333 E + ++ G L+ +R N + VG GL+++ NL PD+L + A+P++ Sbjct: 78 EELDRVACPVRAGTLIVSRMN-TPALVGAAGLVEENAD-NLFLPDRLWQVHF-SGAVPKF 134 Query: 334 IEIFFSSPSARNAMMNCVKTTSG-QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYA 392 + +SPS R + TS + +S + V+ LPP EQ I +++ A Sbjct: 135 AHYWTASPSYRAQVQMACAGTSASMQNLSQDEFLRFVMPLPPKDEQTAIAAFLDRETAKI 194 Query: 393 DTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + + +A + Q+ ++ A R NPD ++ A L Sbjct: 195 DALIAKQEKLIALLAEKRQATISHAVT-------RGLNPDAPMKDSGVAWL 238 >UniRef50_C6J5M6 Putative uncharacterized protein n=1 Tax=Paenibacillus sp. oral taxon 786 str. D14 RepID=C6J5M6_9BACL Length = 403 Score = 300 bits (770), Expect = 5e-80, Method: Composition-based stats. Identities = 90/420 (21%), Positives = 175/420 (41%), Gaps = 32/420 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P GW + P+ L++G+TY +Y L ++R++NIQ+GK D V+V + Sbjct: 3 VPNGWAVKPLLECCDLLQGLTYSPSNIQSY----GLLVLRSSNIQDGKLVLDDCVYVNCS 58 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + E + + P DI+I + +GS +++GKS P+ +FGAF VLR + +G++AH Sbjct: 59 I-DEIKYVKPNDILICVRNGSSALIGKSCVIDRPYNATFGAFMSVLRGDT---TGYLAHM 114 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-PLAEQKIIAEKLDTLLAQVDSTKA 184 S + + +I + S+ A IN I F+ I IPIP EQ+ IA L A + + + Sbjct: 115 FASDVVQQQIRNRSS-ATINQITKRDFEDIKIPIPFDEEEQRAIAAALSDADAYITALEK 173 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 + + + Q +L G + ++ + KK++ + S P Sbjct: 174 LITKKRAVKQGAMQELLTG---KRRLPGFKGE----WIEKKIHEIGDTSSGGTPSRSVPT 226 Query: 245 ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 P + S + ++ + + + + G +L Y ++ +G+ Sbjct: 227 YFNGNIPWVTTSELNDNYIRSTAEKITSEALNNSSAKLFPKGTVLMAMYGATIGKLGI-- 284 Query: 305 LLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKD 364 + KD ++ R ++ + + +GQ IS Sbjct: 285 -----LDVDATTNQACCALFFNKDIDSVFMYFLLL--YHRTEIIE-LGSGAGQPNISQMI 336 Query: 365 IKSQVVLLPP-VKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 I++ +PP + EQ I + + A D + L + + Q ++++ G + Sbjct: 337 IRNLTFTIPPTLAEQTAIAAVLSDMDAEIDAL----TAKLEKARRIKQGMMSELLTGRIR 392 >UniRef50_A4FXL8 Restriction modification system DNA specificity domain n=1 Tax=Methanococcus maripaludis C5 RepID=A4FXL8_METM5 Length = 447 Score = 300 bits (770), Expect = 6e-80, Method: Composition-based stats. Identities = 83/440 (18%), Positives = 179/440 (40%), Gaps = 30/440 (6%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDT-----T 57 G +P W + + + L G++ K + + ++ + + +I + Sbjct: 13 IGDIPADWGVKKLKYILGLNTGLSITKAELV----ENGVDCVNYGDIHSKYTFDIVSSRD 68 Query: 58 DLVFVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHL--PFECSFGAFCGVLRP 113 +L VP + S S D + +S G G+ + RP Sbjct: 69 NLPKVPVEFIDTNPSAIASEGDFIFCDTSEDIEGSGNCLFIRESNNKPIFAGSHTILGRP 128 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 + S ++ + KS +++I G + +I I++ +PP+ EQ+ IA+ LD Sbjct: 129 LINVNSTYLGYLLKSPDIKSQIQKRVVGIKVYSITQKILKSISLILPPVDEQQEIAQYLD 188 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFK 224 + Q+DS + + K ++Q+++ V L +W P+H Sbjct: 189 DKVGQIDSIIEKTKSSIDEYKSYKQSIITETVTKGLDPTVTMKDSGIEWIGDIPEHWDII 248 Query: 225 KLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGH-VDQNDIRFLECSESELNRHKL 283 K+ + L +NG+S + G G+P + V + + ++ +E +E + + + + Sbjct: 249 KIRYLGTL---QNGISKSSSYFGSGYPFVSYGDVYKNYELPKSVEGLVESNEFDKSNYSV 305 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL--TKDALPEYIEIFFSSP 341 + GD+ FTR + +++ +G + + ++ LIR R +K P Y + +F S Sbjct: 306 EYGDVFFTRTSETIDEIGFTATCMHTMN-DAVFAGFLIRFRPFDSKLLNPLYSKYYFRSD 364 Query: 342 SARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNN 401 R + + + +S + +K VL+PP EQ I + +E+ D + + Sbjct: 365 MHRRFFVKEMNL-VTRASLSQELLKKLPVLVPPHNEQIAIGKFIEETCQTIDQLITKKQQ 423 Query: 402 ALARVNNLTQSILAKAFRGE 421 + + +S++ + G+ Sbjct: 424 LITELKAYKKSLIYEVVTGK 443 Score = 149 bits (376), Expect = 3e-34, Method: Composition-based stats. Identities = 43/239 (17%), Positives = 94/239 (39%), Gaps = 18/239 (7%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLS-SKPNESGVGHPILRISSVRAGH-VDQ---- 265 +W P KKL + + L GLS +K G + + + + D Sbjct: 11 EWIGDIPADWGVKKLKY---ILGLNTGLSITKAELVENGVDCVNYGDIHSKYTFDIVSSR 67 Query: 266 NDIRFLECSESELNRHKL-QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 +++ + + N + +GD +F + +E G C +++ ++ + I R Sbjct: 68 DNLPKVPVEFIDTNPSAIASEGDFIFCDTSEDIEGSGNCLFIRESNNKPIFAGSHTILGR 127 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 + Y+ SP ++ + V I+ K +KS ++LPPV EQ EI + Sbjct: 128 PLINVNSTYLGYLLKSPDIKSQIQKRV-VGIKVYSITQKILKSISLILPPVDEQQEIAQY 186 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++ D+I ++ +++ + QSI+ + + +P + ++ + Sbjct: 187 LDDKVGQIDSIIEKTKSSIDEYKSYKQSIITETVT-------KGLDPTVTMKDSGIEWI 238 Score = 144 bits (365), Expect = 5e-33, Method: Composition-based stats. Identities = 35/213 (16%), Positives = 78/213 (36%), Gaps = 10/213 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +PE W I + + TL G++ P + ++ + + Sbjct: 238 IGDIPEHWDIIKIRYLGTLQNGISKSSS-----YFGSGYPFVSYGDVYKNYELPKSVEGL 292 Query: 63 --PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQ-HLPFECSFGAFCGVLRP--EKLI 117 K + + D+ +S + +G +A H + F F RP KL+ Sbjct: 293 VESNEFDKSNYSVEYGDVFFTRTSETIDEIGFTATCMHTMNDAVFAGFLIRFRPFDSKLL 352 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + ++ +S ++R ++ + + +PP EQ I + ++ Sbjct: 353 NPLYSKYYFRSDMHRRFFVKEMNLVTRASLSQELLKKLPVLVPPHNEQIAIGKFIEETCQ 412 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 +D + +Q+ LK ++++++ V GK Sbjct: 413 TIDQLITKKQQLITELKAYKKSLIYEVVTGKKE 445 >UniRef50_A1ZUE4 Type I restriction-modification system specificity subunit n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZUE4_9SPHI Length = 424 Score = 298 bits (763), Expect = 4e-79, Method: Composition-based stats. Identities = 76/426 (17%), Positives = 163/426 (38%), Gaps = 19/426 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + + + + G T + + Y + ++P ++ ++ N + T+ Sbjct: 12 GEIPEDWEVVKLGDIAKVSAGGTPLRSKQEEYFTNGHIPWVKTLDLNNSIIEDTEEKITS 71 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFI 122 L + S + P++ V+ G + +G++ L E + L + I+ FI Sbjct: 72 LALKETSCNLLPKNTVLVAMYGGFNQIGRTGL--LKIEATTNQAISALNIKSDNIYPEFI 129 Query: 123 AHFTKSS-LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + K ++ S NI + I IPPLAEQ+ IA+ L T+ ++ + Sbjct: 130 LAWLNAKVEVWKKFAASSR--KDPNITKKDVEHFPIVIPPLAEQQEIADILSTVDEKIAT 187 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNG-KLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 R Q+ K Q + + P+ KL + ++ L Sbjct: 188 IDERLAHTQQLKKGLMQRLFTRGLGHTSFKASPLGEIPESWEVVKLGDIAKVSAGGTPLR 247 Query: 241 SKPNES--GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLE 298 SK E P ++ + ++ + + + E + + L +L Y G Sbjct: 248 SKQEEYFTNGHIPWVKTLDLNNSIIEDTEEKITSLALKETSCNLLPKNTVLVAMYGG-FN 306 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 +G GLLK N I+ + + PE+I + ++ ++ Sbjct: 307 QIGRTGLLKIEATTNQAISALNIK---SDNIYPEFILAWLNAKV--EVWKKFAASSRKDP 361 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 I+ KD++ +++PP+ EQ EI + + + + ++ L + ++ + Sbjct: 362 NITKKDVEHFPIVIPPLAEQQEIADILGGVDEKLELLAEKKEA----YQGLKKGLMQQLL 417 Query: 419 RGELTA 424 G++ Sbjct: 418 TGKVRV 423 Score = 167 bits (423), Expect = 9e-40, Method: Composition-based stats. Identities = 43/207 (20%), Positives = 86/207 (41%), Gaps = 6/207 (2%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + + + + G T + + Y + ++P ++ ++ N + T+ Sbjct: 222 GEIPESWEVVKLGDIAKVSAGGTPLRSKQEEYFTNGHIPWVKTLDLNNSIIEDTEEKITS 281 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFI 122 L + S + P++ V+ G + +G++ L E + L + I+ FI Sbjct: 282 LALKETSCNLLPKNTVLVAMYGGFNQIGRTGL--LKIEATTNQAISALNIKSDNIYPEFI 339 Query: 123 AHFTKSS-LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + K ++ S NI + I IPPLAEQ+ IA+ L + +++ Sbjct: 340 LAWLNAKVEVWKKFAASSR--KDPNITKKDVEHFPIVIPPLAEQQEIADILGGVDEKLEL 397 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGK 208 + E + K Q +L G V Sbjct: 398 LAEKKEAYQGLKKGLMQQLLTGKVRVG 424 Score = 132 bits (332), Expect = 3e-29, Method: Composition-based stats. Identities = 36/214 (16%), Positives = 77/214 (35%), Gaps = 12/214 (5%) Query: 208 KLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES--GVGHPILRISSVRAGHVDQ 265 + P+ KL + ++ L SK E P ++ + ++ Sbjct: 5 GYKDSPLGEIPEDWEVVKLGDIAKVSAGGTPLRSKQEEYFTNGHIPWVKTLDLNNSIIED 64 Query: 266 NDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL 325 + + + E + + L +L Y G +G GLLK N I+ Sbjct: 65 TEEKITSLALKETSCNLLPKNTVLVAMYGG-FNQIGRTGLLKIEATTNQAISALNIK--- 120 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 + + PE+I + ++ ++ I+ KD++ +++PP+ EQ EI + Sbjct: 121 SDNIYPEFILAWLNAKV--EVWKKFAASSRKDPNITKKDVEHFPIVIPPLAEQQEIADIL 178 Query: 386 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 + ++ LA L + ++ + F Sbjct: 179 STVDEKI----ATIDERLAHTQQLKKGLMQRLFT 208 >UniRef50_B8GLU3 Type I restriction-modification system, S subunit n=1 Tax=Thioalkalivibrio sp. HL-EbGR7 RepID=B8GLU3_THISH Length = 458 Score = 297 bits (760), Expect = 7e-79, Method: Composition-based stats. Identities = 79/451 (17%), Positives = 166/451 (36%), Gaps = 27/451 (5%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT---DLVFV 62 +P W+ + ++ G + D LP +RA N+ G D + ++ F Sbjct: 21 IPVHWMTGQIKNAHDVVLGKMLQS--DAKTPADRLLPYLRAANVNWGGVDLSTVKEMWFS 78 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIFSGF 121 P +++ ++ D+VI+ VG+SA C F RP+ S + Sbjct: 79 PAE--RKALRLMVGDVVIS----EGGDVGRSAVWQGELPECYFQNAINRARPKGEHSSRY 132 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + ++ I + + I + PP EQ IA LD A++D Sbjct: 133 LYYWMSFIKSAGYIDIICNKSTIPHYTAEKVQGTPFLFPPAGEQAGIAAFLDHETAKIDR 192 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 A+ +++ ++LK RQAV+ AV L +W P H +KL + +I Sbjct: 193 LIAKQQRLIELLKEKRQAVISHAVTKGLNPDAPMKDSGVEWLGEVPAHWRLEKLKYTAIF 252 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 P G P + +++ +V + + + + + + G +L Sbjct: 253 KGGGTPSKDSPEYWGGDIPWVSPKDMKSRYVADSQDKITVEAIAASSTSLIGPGQVLVVV 312 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 +G L+ + + + + E+ F N ++ K Sbjct: 313 RSGILQRTIPVAV----NLVEVTLNQDMKAIDFRDETRSEFFSYFVEGHED-NLLLEWRK 367 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 + + I + + + +V +PP E EI++ + + ++ A+ + + Sbjct: 368 QGATVESIEQEYLGNTMVPMPPPSEMMEILQFLNGQLEKYRLLTEKATRAIELLREHRTA 427 Query: 413 ILAKAFRGELTAQ-WRAENPDLISGENSAAA 442 +++ A G++ + W+ N + +A+A Sbjct: 428 LISAAVTGKIDVRGWQKPNTEPQEAAEAASA 458 Score = 159 bits (402), Expect = 3e-37, Method: Composition-based stats. Identities = 42/240 (17%), Positives = 90/240 (37%), Gaps = 21/240 (8%) Query: 217 EPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSES 276 P H + ++ + + S + P LR ++V G VD + ++ + S + Sbjct: 21 IPVHWMTGQIKNAHDVVLGKMLQSDAKTPADRLLPYLRAANVNWGGVDLSTVKEMWFSPA 80 Query: 277 ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 E +L GD++ + VG + + + + + + RAR + Y+ Sbjct: 81 ERKALRLMVGDVVISEGG----DVGRSAVWQGELPE-CYFQNAINRARPKGEHSSRYLYY 135 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 + S + + + + S + + ++ L PP EQA I ++ A D + Sbjct: 136 WMSFIKSAGYI-DIICNKSTIPHYTAEKVQGTPFLFPPAGEQAGIAAFLDHETAKIDRLI 194 Query: 397 KQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL--------LEKIK 448 + + + Q++++ A + NPD ++ L LEK+K Sbjct: 195 AKQQRLIELLKEKRQAVISHAVT-------KGLNPDAPMKDSGVEWLGEVPAHWRLEKLK 247 Score = 122 bits (306), Expect = 4e-26, Method: Composition-based stats. Identities = 33/208 (15%), Positives = 75/208 (36%), Gaps = 5/208 (2%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + + G T K +P + ++++ + Sbjct: 235 GEVPAHWRLEKLKYTAIFKGGGTPSK--DSPEYWGGDIPWVSPKDMKSRYVADSQDKITV 292 Query: 64 KNLVKESQKIS-PEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + + S + P +++ + SG A + E + + S F Sbjct: 293 EAIAASSTSLIGPGQVLVVVRSGILQRTIPVAVNLV--EVTLNQDMKAIDFRDETRSEFF 350 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 ++F + + GA + +I+ +P+PP +E I + L+ L + Sbjct: 351 SYFVEGHEDNLLLEWRKQGATVESIEQEYLGNTMVPMPPPSEMMEILQFLNGQLEKYRLL 410 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLT 210 + + ++L+ R A++ AV GK+ Sbjct: 411 TEKATRAIELLREHRTALISAAVTGKID 438 >UniRef50_A3YSG6 Putative type I restriction enzyme specificity protein n=2 Tax=Campylobacter jejuni subsp. jejuni RepID=A3YSG6_CAMJE Length = 433 Score = 296 bits (759), Expect = 1e-78, Method: Composition-based stats. Identities = 82/434 (18%), Positives = 165/434 (38%), Gaps = 30/434 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + ++ + T + G ++ + +P+IR ++Q K + + Sbjct: 13 GEIPEHWEVVKINKIVTFVNGYAFENFDFNPIFE---IPVIRIGDMQKEKILYDNCLKTK 69 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + + IS DI+IA+S + GK A + ++R + + + Sbjct: 70 EKEKLKQFLISNNDILIALSGATT---GKIAFCDTDNKAYINQRVAIVRSKLKL----VK 122 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 ++ + + I G+ NI IP+PPL EQ+ IA LD Q+ + Sbjct: 123 YYFLTRGFSLLIELACNGSAQPNISTKEIGEFKIPLPPLKEQEQIANFLDEKCEQIANFI 182 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTE 234 + E++ +LK +QA + + L + +W PQH KK L Sbjct: 183 EKKEKLISLLKEQKQAFINETITKGLDKNINFKDSGIEWLGEIPQHWEVKKFKMLFTLGN 242 Query: 235 LRNGLSSKPNESGVGHPILRISSVRAGH-----VDQNDIRFLECSE-SELNRHKLQDGDL 288 N +K + G P + + + + + + F+ + ++ + LQ GD Sbjct: 243 GLN--ITKADFVSYGIPCVSYGEIHSKYPCRLNTTIHTLPFVSKTYLADKPQSLLQKGDF 300 Query: 289 LFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMM 348 +F + +E G + + I + Y F S RN + Sbjct: 301 VFADTSEDIEGSGNFTSI--QSDTPIFAGYHTIILKYKGKINSLYFSFLFDSIFTRNQIR 358 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 V I+ +K L+PP+KEQ +I +++ D + ++ + + Sbjct: 359 KEVC-GVKVFSITKSILKEVQCLIPPLKEQEQIANFLDEKCEKIDLLIEKTKKQIKLIKE 417 Query: 409 LTQSILAKAFRGEL 422 +++ +A G + Sbjct: 418 YKTTLINQAVCGRI 431 Score = 133 bits (336), Expect = 1e-29, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 80/234 (34%), Gaps = 23/234 (9%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV--GHPILRISSVRAGHVDQNDIR 269 +W P+H K+N + NG + + + P++RI ++ + ++ Sbjct: 10 EWLGEIPEHWEVVKIN---KIVTFVNGYAFENFDFNPIFEIPVIRIGDMQKEKILYDNCL 66 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 + +L + + + D+L + + C ++ R Sbjct: 67 KT-KEKEKLKQFLISNNDILIALSGATTGKIAFC-----DTDNKAYINQRVAIVRSKLKL 120 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 + Y + ++ S Q IS K+I + LPP+KEQ +I +++ Sbjct: 121 VKYYFL-----TRGFSLLIELACNGSAQPNISTKEIGEFKIPLPPLKEQEQIANFLDEKC 175 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++ ++ + Q+ + + + + ++ ++ L Sbjct: 176 EQIANFIEKKEKLISLLKEQKQAFINETIT-------KGLDKNINFKDSGIEWL 222 >UniRef50_B2V7V7 Restriction modification system DNA specificity domain n=3 Tax=Sulfurihydrogenibium RepID=B2V7V7_SULSY Length = 435 Score = 296 bits (759), Expect = 1e-78, Method: Composition-based stats. Identities = 85/438 (19%), Positives = 181/438 (41%), Gaps = 27/438 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYL--PLIRANNIQNGKFDTTDL 59 G +PE W +A + V + +G ++ +D + P +R +N+ K D ++L Sbjct: 11 EIGLIPEDWEVARLGEVFEVKQGKQLSAKEN----RDGKVLKPFLRTSNVLWNKIDLSEL 66 Query: 60 VFVPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRP-EKL 116 ++P + + ++ K+ DI++ VG++A + S+ LR + Sbjct: 67 SYMPFSESEFKNLKLKKGDILVC----EGGDVGRTAVWDGQIDEISYQNHLHRLRSVKDN 122 Query: 117 IFSGFIAHFTKSSLYRNKIS-SLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 I + F A++ + ++ + + I N+ + IP+PPL EQ+ IA+ L T+ Sbjct: 123 INNYFFAYWMEYAITIKNLYHQNANKTTIPNLSSSRLKAFPIPLPPLEEQRAIADILSTV 182 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLG-------GAVNGKLTEKWRNFEPQHSVFKKLNF 228 ++ T+ Q+ K + + KL E P+H +L Sbjct: 183 QNAIEKTEKVINATKQLKKSMMKHLFTYGAVAVDEIDRIKLKESEIGLIPEHWEVVRLGE 242 Query: 229 ESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDL 288 L + + GH I+ I +++ G++D N + ++Q D+ Sbjct: 243 VVDLDRGISWRKFEEGSKDNGHLIISIPNIKDGYIDFNSKYNHYLIKHIPKNKQIQLNDI 302 Query: 289 LFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAM 347 LF +GS+E VG ++ L + + + + RAR+ +P+++ +S N Sbjct: 303 LFVGSSGSIENVGRNVFIENLSFEGIGFASFVFRARVKVNTVIPKFLYFMANSHWF-NYK 361 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 +++ G+ + K+ + LPP+ EQ +I + D + + Sbjct: 362 DYVRRSSDGKYNFQLTEFKTIKIPLPPLDEQQKIANIL----TTIDQKIQAEEKKKVALR 417 Query: 408 NLTQSILAKAFRGELTAQ 425 +L +++L + G++ + Sbjct: 418 SLFKTLLHQLMTGKIRVR 435 Score = 150 bits (380), Expect = 1e-34, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 79/215 (36%), Gaps = 10/215 (4%) Query: 206 NGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQ 265 + E P+ +L + + + + + + V P LR S+V +D Sbjct: 4 SKGFKETEIGLIPEDWEVARLGEVFEVKQGKQLSAKENRDGKVLKPFLRTSNVLWNKIDL 63 Query: 266 NDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR- 324 +++ ++ SESE KL+ GD+L VG + + + Y + L R R Sbjct: 64 SELSYMPFSESEFKNLKLKKGDILVCEGG----DVGRTAVWDGQIDE-ISYQNHLHRLRS 118 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 + + + + + + + +S +K+ + LPP++EQ I Sbjct: 119 VKDNINNYFFAYWMEYAITIKNLYHQNANKTTIPNLSSSRLKAFPIPLPPLEEQRAIADI 178 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 + + ++ + L +S++ F Sbjct: 179 LSTV----QNAIEKTEKVINATKQLKKSMMKHLFT 209 >UniRef50_C1D7R6 Type I restriction-modification system, S subunit n=1 Tax=Laribacter hongkongensis HLHK9 RepID=C1D7R6_LARHH Length = 453 Score = 295 bits (757), Expect = 2e-78, Method: Composition-based stats. Identities = 81/438 (18%), Positives = 159/438 (36%), Gaps = 36/438 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI--QNGKFDTTDLVFVP 63 +P W + + + + + + + ++ + + + +I +G+ Sbjct: 20 IPSHWEVVRLKNIFEIRKRIAGELGHSVLSITQRGIKV---KDIESNDGQISMD------ 70 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + Q + P D + V S+ + + + Sbjct: 71 ---YSKYQIVLPGDFAMNHMDLLTGYVDISSTHGVTSP---DYRVFAMLDNAHCVPRYFL 124 Query: 124 HFTKSSLYRNKISSLSAGA---NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 H ++ + + GA F+ +P PP EQ IA LD A++D Sbjct: 125 HLFQNGYRQKIFYAFGQGASEFGRWRFPTDQFNNFRLPCPPDDEQAAIATFLDRETAKID 184 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESI 231 + A E++ +L RQA + AV L +W P H V + Sbjct: 185 ALIAEQEKLIALLAEKRQATISHAVTRGLDPAVPMKDSGVEWLGQVPAHWVICSVRR--K 242 Query: 232 LTELRNGLSSKPNESG---VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDL 288 L + G S + +L+ V G + + L + + ++DGDL Sbjct: 243 LKRIEQGWSPECFSRPAEAGEWGVLKAGCVNGGIFRPEENKALPDTLAPDENILIKDGDL 302 Query: 289 LFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMM 348 L +R +GS VG L +L+ DK+ R L + LP+++ I F + R+ + Sbjct: 303 LMSRASGSPALVGSVAYL-SAPPAHLMLSDKIFRLHLEQGTLPQFVAIAFGARYLRHQIE 361 Query: 349 NCVKTTSGQK-GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 + G + +K + +PP EQ EIV +Q A D ++ +A++ + Sbjct: 362 QAISGAEGLANNLPQTSLKGFTIAIPPEVEQQEIVVFTQQETAKLDALKIAAEHAVSLLK 421 Query: 408 NLTQSILAKAFRGELTAQ 425 +++A A G++ + Sbjct: 422 ERRAALIAAAVTGQIDVR 439 Score = 125 bits (314), Expect = 4e-27, Method: Composition-based stats. Identities = 38/258 (14%), Positives = 80/258 (31%), Gaps = 44/258 (17%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISS--VRAGHVDQNDIR 269 +W P H +L + + G +GH +L I+ ++ ++ ND + Sbjct: 15 EWLRSIPSHWEVVRLKNIFEIRKRIAG--------ELGHSVLSITQRGIKVKDIESNDGQ 66 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDK-LIRARLTKD 328 S + GD + + G + + PD + Sbjct: 67 I---SMDYSKYQIVLPGDFAMNHMDL------LTGYVDISSTHGVTSPDYRVFAMLDNAH 117 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTS--GQKGISGKDIKSQVVLLPPVKEQAEIVRRVE 386 +P Y F + + + S G+ + + PP EQA I ++ Sbjct: 118 CVPRYFLHLFQNGYRQKIFYAFGQGASEFGRWRFPTDQFNNFRLPCPPDDEQAAIATFLD 177 Query: 387 QLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL--- 443 + A D + + +A + Q+ ++ A R +P + ++ L Sbjct: 178 RETAKIDALIAEQEKLIALLAEKRQATISHAVT-------RGLDPAVPMKDSGVEWLGQV 230 Query: 444 ------------LEKIKA 449 L++I+ Sbjct: 231 PAHWVICSVRRKLKRIEQ 248 Score = 110 bits (277), Expect = 7e-23, Method: Composition-based stats. Identities = 50/215 (23%), Positives = 87/215 (40%), Gaps = 13/215 (6%) Query: 4 GKLPEGWVIA----PVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL 59 G++P WVI + + + E + +++A + G F + Sbjct: 228 GQVPAHWVICSVRRKLKRI-----EQGWSPECFSRPAEAGEWGVLKAGCVNGGIFRPEEN 282 Query: 60 VFVPKNLV-KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECS-FGAFCGVLRPEKLI 117 +P L E+ I D++++ +SGS ++VG A+ P L E+ Sbjct: 283 KALPDTLAPDENILIKDGDLLMSRASGSPALVGSVAYLSAPPAHLMLSDKIFRLHLEQGT 342 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANI--NNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 F+A + R++I +GA NN+ S I IPP EQ+ I Sbjct: 343 LPQFVAIAFGARYLRHQIEQAISGAEGLANNLPQTSLKGFTIAIPPEVEQQEIVVFTQQE 402 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 A++D+ K E +LK R A++ AV G++ Sbjct: 403 TAKLDALKIAAEHAVSLLKERRAALIAAAVTGQID 437 >UniRef50_Q30XD2 Type I restriction-modification system, S subunit n=1 Tax=Desulfovibrio desulfuricans subsp. desulfuricans str. G20 RepID=Q30XD2_DESDG Length = 448 Score = 295 bits (755), Expect = 3e-78, Method: Composition-based stats. Identities = 85/439 (19%), Positives = 160/439 (36%), Gaps = 27/439 (6%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PE W IAPV G + + +D +P RA +Q + +D+ + Sbjct: 18 IGQVPEHWKIAPVKYHYDARLGKMIQPAAVSD--RDIEVPYHRAQTVQWERIVESDIKEM 75 Query: 63 PKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLP--FECSFGAFCGVLRPEKLIFS 119 + E +S D++I V ++A P F +R + Sbjct: 76 WASPRDIEQFSVSEGDLLIC----EGGDVCRAAIVKQPPEKNMIFQKSIHRIRSKGEYGV 131 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 G++ + I L I + + P+PP EQ IA LD A++ Sbjct: 132 GWVMRLMQHLRSSEWIDVLCNKNTIVHFTSDKLGSLECPLPPPDEQASIAAALDRETARI 191 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSV---FKKLN 227 D+ + + ++LK RQA++ AV L +W P+H K + Sbjct: 192 DALIQKKTRFIELLKEKRQALITHAVTKGLDPNVKMKDSGVEWLGEVPEHWSSVPIKYMA 251 Query: 228 FESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS-ESELNRHKLQDG 286 E L + S G + +V G + F+ L ++ G Sbjct: 252 LERNSLFLDGDWIESKDISTDGIRYITTGNVGEGVYKEQGSGFISEETFHALGCTEVYGG 311 Query: 287 DLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNA 346 D+L +R N +G ++ L + + D +I R ++I FSS Sbjct: 312 DVLVSRLN---NPIGRACMVPDLGVRVVTSVDNVI-FRPDSKFNKKFIVYLFSSEEYFKH 367 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + + + + IS + + V P ++EQ +I R ++ A D + + ++ + Sbjct: 368 -TSNLARGATMQRISRGLLGNIRVATPSIEEQTQIARFLDHETARIDALIGKAEQSITLL 426 Query: 407 NNLTQSILAKAFRGELTAQ 425 + + A G++ + Sbjct: 427 KERRAAFITAAVTGQIDLR 445 Score = 144 bits (363), Expect = 9e-33, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 86/232 (37%), Gaps = 12/232 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P+H + + + + ++ + P R +V+ + ++DI+ + Sbjct: 16 EWIGQVPEHWKIAPVKYHYDARLGKMIQPAAVSDRDIEVPYHRAQTVQWERIVESDIKEM 75 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 S ++ + + +GDLL V ++K+ +N+++ + R R + Sbjct: 76 WASPRDIEQFSVSEGDLLICEGG----DVCRAAIVKQPPEKNMIFQKSIHRIRSKGEYGV 131 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 ++ + + + + + S LPP EQA I +++ A Sbjct: 132 GWVMRLMQHLRSSEWIDVLCNKNTIV-HFTSDKLGSLECPLPPPDEQASIAAALDRETAR 190 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + ++ + + Q+++ A + +P++ ++ L Sbjct: 191 IDALIQKKTRFIELLKEKRQALITHAVT-------KGLDPNVKMKDSGVEWL 235 >UniRef50_A6TLK6 Restriction modification system DNA specificity domain n=2 Tax=Clostridiaceae RepID=A6TLK6_ALKMQ Length = 467 Score = 295 bits (755), Expect = 3e-78, Method: Composition-based stats. Identities = 101/451 (22%), Positives = 188/451 (41%), Gaps = 29/451 (6%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI---QNGKFDTTDLVFV 62 +PE WV + VTT+I G T + I Y ++ +P I ++ + Sbjct: 28 VPENWVWTRLGNVTTIIGGGTPPS-RVIEYYENGSIPWISPVDLSGYTDIYISHGKKNIT 86 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 L K S ++ PE+ V+ S V + E P ++ Sbjct: 87 ELGLKKSSARLLPENTVLLSSRAPIGYVAIA-----DNELCTNQGFKSFLPSPCYLPKYL 141 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + KSS + + + ++G + ++ P+PPLAEQ+ I +++++L +++ Sbjct: 142 YFYLKSS--KKLLEAYASGTTFLELSGRKAAIVEFPLPPLAEQQRIVDRIESLFEKLNQA 199 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS- 241 KA + + + A+L A +G+LTEKWR K + + R G + Sbjct: 200 KALIQDALDSFENRKAAILHKAFSGELTEKWREENGVGMGSWKKKSIKEVVKFRAGYAFD 259 Query: 242 KPNESGVGHPILRISSVRAGHVDQN-DIRFLECSESE---LNRHKLQDGDLLFTRYNG-S 296 N S GH ++R+ ++ G +D + ++ + + R + +GD+L T Sbjct: 260 SKNFSSTGHQVIRMGNLYNGVLDLTRNPVYISPDLIDNSIIKRFSINEGDILLTLTGTKY 319 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 G L+K+ + NLL +++ + Y+ + S R+ + Sbjct: 320 KRDYGYAVLIKESE--NLLLNQRILSLTP-ESIETNYLLYYLQSDFFRDVFFSNETGGVN 376 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 Q +S K ++ + + EQ EIVR ++ +F D Q+ + + ++ + +SILA+ Sbjct: 377 QGNVSSKFVEKIEIPIFSSLEQKEIVRILDYIFEK-DKNANQLCDLIDNIDLMKKSILAR 435 Query: 417 AFRGELTAQWRAENPDLISGENSAAALLEKI 447 AFRGEL E SA LL+ I Sbjct: 436 AFRGELGTNNPEEE--------SAMELLKDI 458 Score = 139 bits (352), Expect = 2e-31, Method: Composition-based stats. Identities = 46/223 (20%), Positives = 82/223 (36%), Gaps = 15/223 (6%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG-HPILRISSV---RAGHVDQND 267 + N P++ V+ +L + + S G P + + ++ Sbjct: 23 EKSNVVPENWVWTRLGNVTTIIGGGTPPSRVIEYYENGSIPWISPVDLSGYTDIYISHGK 82 Query: 268 IRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 E + + L + +L + S +G + L + Sbjct: 83 KNITELGLKKSSARLLPENTVLLS----SRAPIGYVAIADNE----LCTNQGFKSFLPSP 134 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 LP+Y+ + S ++ + + +SG+ LPP+ EQ IV R+E Sbjct: 135 CYLPKYLYFYLKSS---KKLLEAYASGTTFLELSGRKAAIVEFPLPPLAEQQRIVDRIES 191 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAEN 430 LF + + + +AL N +IL KAF GELT +WR EN Sbjct: 192 LFEKLNQAKALIQDALDSFENRKAAILHKAFSGELTEKWREEN 234 >UniRef50_A1TWL9 Restriction modification system DNA specificity domain n=2 Tax=Gammaproteobacteria RepID=A1TWL9_MARAV Length = 435 Score = 294 bits (754), Expect = 4e-78, Method: Composition-based stats. Identities = 152/467 (32%), Positives = 230/467 (49%), Gaps = 40/467 (8%) Query: 1 MSAGKLPEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL 59 M + LP W +A + ++ + G T A + + L+R +IQN ++ Sbjct: 1 MQSQLLPANWQLANLGEISSDISYGYT-----ASATSEPTGVKLLRITDIQNNTVSWPNV 55 Query: 60 VFVPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLI 117 K ++ P D+V A + + VGKS + ++ +R + + Sbjct: 56 PNCKIEPEKVGKYRLKPSDLVFARTGAT---VGKSYLLKGEIPESVYASYLIRVRCLEGV 112 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+A++ +S Y +I+ SAG N+ +++P+PPLAEQK+IA+KLDTLLA Sbjct: 113 SIEFLANYFQSPYYWRQITDFSAGIGQPNVNGTKLKNLSVPVPPLAEQKVIADKLDTLLA 172 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 QV++TKAR E+IPQILKRFRQ+VL AV+G+L + + + + R Sbjct: 173 QVENTKARLERIPQILKRFRQSVLAAAVSGRLIDAQPESIAKLEELVDIENGA-----RK 227 Query: 238 GLSSK-PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 +S+ P + + +L E R+ LL + Sbjct: 228 PVSATIRKTIQGTIPYYGATGIVD---------YLNDYTHE-GRY------LLVGEDGAN 271 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 L L + + + + ++++I +S + T S Sbjct: 272 LLS--KSKDLAFIVEGKMWVNNHAHVLKERPGVNLDFVKIAINSLDLTPWI-----TGSA 324 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 Q ++ K + + + EQ EIVRRV+QLF++AD IE+Q ++ALARVNNLTQSILAK Sbjct: 325 QPKLTKKSLCGLPITNFTLDEQTEIVRRVDQLFSHADRIEQQASSALARVNNLTQSILAK 384 Query: 417 AFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 AFRGELT QWR +NP+LI GENSA ALLE+IKAERAA K +R K Sbjct: 385 AFRGELTEQWRRDNPELIGGENSAEALLERIKAERAAMKPVKRTRNK 431 >UniRef50_C0VG50 Type I restriction modification enzyme protein S n=1 Tax=Acinetobacter sp. ATCC 27244 RepID=C0VG50_9GAMM Length = 399 Score = 293 bits (751), Expect = 9e-78, Method: Composition-based stats. Identities = 117/416 (28%), Positives = 209/416 (50%), Gaps = 23/416 (5%) Query: 11 VIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKES 70 I + ++T IRGV+Y K A++ +++ YLP++RANNIQ D V+VP++ + + Sbjct: 3 QIVKIGNISTQIRGVSYSKSDAVSNMQEGYLPVLRANNIQEQGLILEDFVYVPESKISKK 62 Query: 71 QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHFTKSS 129 Q+I D++IA SSGS S+VGK+A FGAFC +LRP +L+ + A++ ++ Sbjct: 63 QRILAGDVIIAASSGSISLVGKAASAKEDINAGFGAFCKILRPNTELVDPRYFANYFQTQ 122 Query: 130 LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQI 189 YR IS+L+AGANINN+K D + IP+PPL+EQ+ IA LD + E++ Sbjct: 123 QYRQIISNLAAGANINNLKNEHLDDLEIPLPPLSEQRRIASILDQADVLRQKRQQAIEKL 182 Query: 190 PQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS--KPNESG 247 Q+L+ + G V+ P+ KKL+ + L ++ + + + Sbjct: 183 DQLLQATFIDMFGDPVSN----------PKGFEVKKLSEQVDLIQIGPFGTQLHQEDYIE 232 Query: 248 VGHPILRISSVRAGHVDQN-DIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 G P++ S ++ G + N + + EL+++ L+ D+L R +G C ++ Sbjct: 233 NGIPLINPSHIKNGKIVPNLKLSVSQLKYGELSQYHLKLHDVLLGRRGE----MGRCAVV 288 Query: 307 KKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIK 366 + + L L + P ++E+ SS S + + N V ++ + Sbjct: 289 TQNEVGWLCGTGSLFLRPNVEKINPFFLEMLLSSDSIKRYLEN-VSQGQTMANLNKTIVG 347 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 S ++ P ++ Q + + + + ++ ++ N+ +VNNL QS+ AF G L Sbjct: 348 SIPLIAPSIEIQNKF--FL--ISEEINKMKTELENSKNQVNNLFQSLQNHAFNGTL 399 Score = 107 bits (267), Expect = 1e-21, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 77/207 (37%), Gaps = 12/207 (5%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P+G+ + +S LI+ + + ++ +PLI ++I+NGK + V + Sbjct: 201 PKGFEVKKLSEQVDLIQIGPFGTQLHQEDYIENGIPLINPSHIKNGKIVPNLKLSVSQLK 260 Query: 67 VK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFEC-SFGAFCGVLRPE-KLIFSGFI 122 + D+++ G + +G+ A G LRP + I F+ Sbjct: 261 YGELSQYHLKLHDVLL----GRRGEMGRCAVVTQNEVGWLCGTGSLFLRPNVEKINPFFL 316 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 S + + ++S G + N+ I + P + Q K + +++ Sbjct: 317 EMLLSSDSIKRYLENVSQGQTMANLNKTIVGSIPLIAPSIEIQ----NKFFLISEEINKM 372 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKL 209 K E + Q++ A NG L Sbjct: 373 KTELENSKNQVNNLFQSLQNHAFNGTL 399 >UniRef50_A6EUA9 Type I restriction-modification system, S subunit n=1 Tax=unidentified eubacterium SCB49 RepID=A6EUA9_9BACT Length = 438 Score = 293 bits (750), Expect = 1e-77, Method: Composition-based stats. Identities = 77/434 (17%), Positives = 160/434 (36%), Gaps = 20/434 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PE W + ++ + G T K + Y D +P + + + G Sbjct: 15 IGEIPEHWSSVSLKWISKIYSGGTPSKNK-PEYWSDGTIPWLNSGTVNQGDITEPSEYIT 73 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + L S K PE ++ +G G A FE + G++ P + ++ Sbjct: 74 EEALANSSAKWIPEKAILIALAGQGKTKGMVAQTQ--FEATCNQSLGIIVPSYPELNRYL 131 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + + + I +L G + I I P+P EQ I LD ++D Sbjct: 132 LFWLRKNY--QNIRNLGGGDKRDGINLEMIGSIPTPLPTKKEQTAITNYLDKKTTEIDQL 189 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILT 233 + E++ Q+ + + A++ AV + +W P+ +L + Sbjct: 190 ISEKEELVQLYQEEKTALINQAVTKGIKPDAKLKNSGIEWLGEIPEDWNSLRLKYLGNFI 249 Query: 234 ELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNR-HKLQDGDLLFTR 292 + S+ + G +L+IS+++ +D +D F++ + ++ DL+F Sbjct: 250 NGYSFKST--DFKSSGVRVLKISNIQHMAIDWSDESFIDEEFYDTKSGFRVLQNDLVFAL 307 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 + L+ + L + + R + + ++I S + Sbjct: 308 TRPIISTGIKVALMNFDEKILLNQRNSIFRPKTK---MTKWIYFILLSSRFVQEFDKRID 364 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 T Q IS DI + +P +EQ +IV +E+ A DT + + + S Sbjct: 365 KTGQQPNISSNDIGEISIPVPTKEEQTKIVEHIEKETAKIDTKIAKAEKYINLLTEYRTS 424 Query: 413 ILAKAFRGELTAQW 426 ++++ G++ Sbjct: 425 LISEVVTGKIKVID 438 Score = 134 bits (339), Expect = 5e-30, Method: Composition-based stats. Identities = 33/233 (14%), Positives = 68/233 (29%), Gaps = 16/233 (6%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRF 270 +W P+H L + S + +KP G P L +V G + + Sbjct: 13 EWIGEIPEHWSSVSLKWISKIYSGGTPSKNKPEYWSDGTIPWLNSGTVNQGDITEPSEYI 72 Query: 271 LECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 E + + + + + +L G + L + L Sbjct: 73 TEEALANSSAKWIPEKAILIALAGQ-----GKTKGMVAQTQFEATCNQSLGIIVPSYPEL 127 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 Y+ + + + + GI+ + I S LP KEQ I +++ Sbjct: 128 NRYLLFWLRKNY---QNIRNLGGGDKRDGINLEMIGSIPTPLPTKKEQTAITNYLDKKTT 184 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + + + +++ +A + PD + L Sbjct: 185 EIDQLISEKEELVQLYQEEKTALINQAVT-------KGIKPDAKLKNSGIEWL 230 >UniRef50_B3R3C2 Type I restriction-modification methylase S subunit n=1 Tax=Cupriavidus taiwanensis RepID=B3R3C2_CUPTR Length = 458 Score = 292 bits (749), Expect = 1e-77, Method: Composition-based stats. Identities = 88/447 (19%), Positives = 162/447 (36%), Gaps = 30/447 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDLVFV 62 G +P W + + K + + +D + + + I + G + V Sbjct: 18 GDMPAHWQVRRLRFAAEF----NPSKSEVSHLDRDTLVSFLPMDAIGEEGSLVLEQVRQV 73 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECS-FGAF-CGVLRPEKLI-FS 119 + + D+ A + GK A FG V RP + S Sbjct: 74 SQ-VETGYTYFHEGDVAFAKITPCFEN-GKGAVMRGLLGGVGFGTTELIVARPRSDVTCS 131 Query: 120 GFIAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ S +R GA + I PPL+EQ I L + ++ Sbjct: 132 EYLHWLFCSIPFRKLGEGAMYGAGGQKRVPEDFARDFAIAFPPLSEQNAIVTFLYSETSK 191 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFE 229 +D+ + +++ +L RQA + V L K W P H K++ Sbjct: 192 IDTLISEQDKLLVLLAEKRQATISRIVTRGLEPKVQIKSVGADWLGEIPIHWQAKRVK-- 249 Query: 230 SILTELRNGLSSKPNES----GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQD 285 + + + G S + +L++ V G D + + L + L+ Sbjct: 250 WLTSSIEQGWSPQCENYPAEGENEWGVLKVGCVNGGVFDAAENKKLPPELEPFPEYSLRK 309 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSAR 344 GDLL +R N + E VG ++ K H+ LL DKL R RL + PE++ + ++ AR Sbjct: 310 GDLLISRAN-TRELVGSAAVVPKDFHR-LLLCDKLYRLRLDQAKCTPEFLAAYLATGEAR 367 Query: 345 NAMMNCVKT-TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 + +S I I +V LPP +EQA I+ + + + N ++ Sbjct: 368 GQIELGATGASSSMLNIGQSVIMDLLVPLPPAEEQAAIMDFLNAELDRLERLSLAANKSI 427 Query: 404 ARVNNLTQSILAKAFRGELTAQWRAEN 430 + +++ A G++ + + Sbjct: 428 DLLKARRTALITAAVTGKIDVRNAVPD 454 Score = 130 bits (327), Expect = 1e-28, Method: Composition-based stats. Identities = 43/234 (18%), Positives = 80/234 (34%), Gaps = 14/234 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSV-RAGHVDQNDIRF 270 W P H ++L F + ++ +S + L + ++ G + +R Sbjct: 15 DWLGDMPAHWQVRRLRFAAEFNPSKSEVSH--LDRDTLVSFLPMDAIGEEGSLVLEQVRQ 72 Query: 271 LECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 + E +GD+ F + E G +++ L +LI AR D Sbjct: 73 VSQ--VETGYTYFHEGDVAFAKITPCFEN-GKGAVMRGLLGGVGFGTTELIVARPRSDVT 129 Query: 331 -PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 EY+ F S R + GQK + + + PP+ EQ IV + Sbjct: 130 CSEYLHWLFCSIPFRKLGEGAMYGAGGQKRVPEDFARDFAIAFPPLSEQNAIVTFLYSET 189 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + DT+ + + L + Q+ +++ R P + A L Sbjct: 190 SKIDTLISEQDKLLVLLAEKRQATISRIVT-------RGLEPKVQIKSVGADWL 236 >UniRef50_C7RQC3 Type I restriction-modification system specificity subunit n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RQC3_9PROT Length = 475 Score = 290 bits (744), Expect = 6e-77, Method: Composition-based stats. Identities = 78/436 (17%), Positives = 173/436 (39%), Gaps = 26/436 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI--QNGKFDTTDLVF 61 G++P W + + +L++GV KE A+ +P +R + + F F Sbjct: 20 GQVPGHWDVRKPRHIGSLLKGVGGTKEDALPA----GVPCVRYGELYTTHAYFVRRPKTF 75 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + + + D++ A S + +GKSA + G +LRP + + F Sbjct: 76 IHADRAADYTPLHYGDVLFAASGETLEDIGKSAVNLIDGTAVCGGDVIILRPSVPVHAPF 135 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + N+ +++ G + ++ P + P+PP+ EQ I L+ +++ Sbjct: 136 LGYVMDCRPLANQKATMGRGTTVKHVYPDELKHLVFPLPPVPEQAAIVRFLNWANGRLER 195 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 ++ +L +QA++ AV L W P+H +L F ++ Sbjct: 196 AIRAKRKVIALLNEQKQAIVHRAVTRGLDPSVPLKPSGIPWLGDIPRHWRVWRLKFVAL- 254 Query: 233 TELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSES--ELNRHKLQDGDLL 289 + + L + P S G HP +R + + AG V + + + + R + Q+GD+L Sbjct: 255 -NIVDCLHATPRYSDAGTHPAIRTADIVAGVVLVDQAKKVSSRDYARWTTRLQPQEGDIL 313 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 ++R E G+ + L +++ R+ +++ +S S + Sbjct: 314 YSREG---ERFGIAACVPAA--TQLCISQRMMVFRIATQHCSKFVMWLLNSRSTYGQALQ 368 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 V + ++ I++ + LP +EQ +V R+ + ++ + + Sbjct: 369 DVMGATA-PHVNISTIRNYYLALPLKREQEAVVERIGAETHPIEVAIDRLKREIELLREY 427 Query: 410 TQSILAKAFRGELTAQ 425 ++A G++ + Sbjct: 428 RTRLIADVVTGKVDVR 443 Score = 140 bits (353), Expect = 1e-31, Method: Composition-based stats. Identities = 44/233 (18%), Positives = 79/233 (33%), Gaps = 15/233 (6%) Query: 213 WRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAG--HVDQNDIRF 270 W P H +K L + G K + G P +R + + + F Sbjct: 18 WLGQVPGHWDVRKPRHIGSLLKGVGGT--KEDALPAGVPCVRYGELYTTHAYFVRRPKTF 75 Query: 271 LECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 + + + L GD+LF +LE +G + L + +I R + Sbjct: 76 IHADRA-ADYTPLHYGDVLFAASGETLEDIGKSAV--NLIDGTAVCGGDVIILRPSVPVH 132 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 ++ N + + K + ++K V LPPV EQA IVR + Sbjct: 133 APFLGYVMDCRPLANQ-KATMGRGTTVKHVYPDELKHLVFPLPPVPEQAAIVRFLNWANG 191 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + + +A +N Q+I+ +A R +P + + L Sbjct: 192 RLERAIRAKRKVIALLNEQKQAIVHRAVT-------RGLDPSVPLKPSGIPWL 237 >UniRef50_A6DQ81 Putative restriction-modification system specificity determinant n=1 Tax=Lentisphaera araneosa HTCC2155 RepID=A6DQ81_9BACT Length = 402 Score = 290 bits (744), Expect = 6e-77, Method: Composition-based stats. Identities = 110/415 (26%), Positives = 195/415 (46%), Gaps = 25/415 (6%) Query: 15 VSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQKIS 74 + +++ IRGV+YKK ++ + Y P++RANNI G + LV+V ++KE Q + Sbjct: 6 IGDISSQIRGVSYKKNDVVDEPTERYTPVMRANNINEGFLNYDKLVYVKSEVIKEHQLLQ 65 Query: 75 PEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHFTKSSLYRN 133 D++I SSGS ++VGK+ SFGAFC VLRP+ K +F F + +S Y+ Sbjct: 66 KGDVLICASSGSLNLVGKAGSFLDSTSSSFGAFCKVLRPDTKKVFPRFFHFYFQSQGYKR 125 Query: 134 KISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQIL 193 I +L+ GANINNIK D + IP+P L EQK IA LD + Q + L Sbjct: 126 SIKALAEGANINNIKNEHLDDLKIPLPSLEEQKRIAAILDKADELRQKRREAISQCNEFL 185 Query: 194 KRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESG---VGH 250 K ++ G V + K+ F+ +L + G S K Sbjct: 186 KSTFLSMFGDPVTN------------PKGWDKIIFDELLDNIDGGWSPKCETWPATLDEW 233 Query: 251 PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQ 310 ++++ ++ + + + + + ++Q DLLF+R N + E V C + + Sbjct: 234 GVMKLGALTTCEYKEEENKAMLPGLETKSNIEIQPRDLLFSRKN-THELVAACAYVWDTR 292 Query: 311 HQNLLYPDKLIRARLT--KDALPEYIEIFFSSPSARNAMMNCVKTTSG-QKGISGKDIKS 367 Q L+ D + R + + Y+ + R + +G IS K++K+ Sbjct: 293 PQ-LMMSDLMFRFKFKASAEVNSIYMWKLLVNERQRKEVQALASGAAGSMPNISKKNLKT 351 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + +PP++ Q + ++ ++ + Q+ +L +++ +++ KAF+GEL Sbjct: 352 IKLPIPPIELQNQFA----EIAKKTESSKSQMQQSLKELDDNFDALMQKAFKGEL 402 Score = 82.5 bits (203), Expect = 4e-14, Method: Composition-based stats. Identities = 39/213 (18%), Positives = 81/213 (38%), Gaps = 20/213 (9%) Query: 7 PEGWVIAPVSTVTTLIRGV-TYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 P+GW + I G + K E L + +++ + ++ + + Sbjct: 200 PKGWDKIIFDELLDNIDGGWSPKCETWPATLDEWG--VMKLGALTTCEYKEEENKAMLPG 257 Query: 66 LV-KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFC---GVLRPEKLIFSGF 121 L K + +I P D++ + + + +V A+ + + + S + Sbjct: 258 LETKSNIEIQPRDLLFSRKN-THELVAACAYVWDTRPQLMMSDLMFRFKFKASAEVNSIY 316 Query: 122 IAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 + + R ++ +L++GA + NI + I +PIPP+ Q AE Sbjct: 317 MWKLLVNERQRKEVQALASGAAGSMPNISKKNLKTIKLPIPPIELQNQFAEI-------A 369 Query: 180 DSTKARFEQIPQILKRF---RQAVLGGAVNGKL 209 T++ Q+ Q LK A++ A G+L Sbjct: 370 KKTESSKSQMQQSLKELDDNFDALMQKAFKGEL 402 >UniRef50_Q8PTL2 Type I restriction-modification system specificity subunit n=2 Tax=Methanosarcina RepID=Q8PTL2_METMA Length = 440 Score = 290 bits (743), Expect = 8e-77, Method: Composition-based stats. Identities = 116/473 (24%), Positives = 203/473 (42%), Gaps = 62/473 (13%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +LPEGW + + + G KK + + + +N I Sbjct: 6 ELPEGWAECQIKDIVVINYGKGLKKSDRV----EGQFDVFGSNGI--------------- 46 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAH--QHLPFECSFGAFCGVLRPEKLIFSGFI 122 + K +Q ++ VI GS + S+ + F G+ R F+ Sbjct: 47 -VGKHNQSLTNGPTVIIGRKGSVGEINLSSEPCWPIDTTYYIDNFYGINRI-------FL 98 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + K+ +++ I I +P+PPL+EQ I ++ L A++D+T Sbjct: 99 YYLLKTLN----LANYDTSTAIPGINRNDIYSQLVPLPPLSEQHRIVSAIEALFARLDAT 154 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE--------------------PQHSV 222 + +++ +ILK+FR++VL A +G+LTE+WR +V Sbjct: 155 NEKLDRVQEILKKFRESVLAAACDGRLTEEWRKENLHCNEYFAIDEDQFNLVKQWRIPTV 214 Query: 223 FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESE--LNR 280 + E + + + S P + +G +R S ++ GH+D ++ +++ + + R Sbjct: 215 WSWSTLEDSCSHVVDCPHSTPKWTDIGVYCVRTSELKCGHIDFSNAKYVSEATYLERIKR 274 Query: 281 HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSS 340 K Q+GD+L++R VG+ L+ + + +L+ R + +P + +S Sbjct: 275 LKPQEGDILYSREG----TVGIASLVPS--NVKICLGQRLMLFRTKNNLIPSFFVKVLNS 328 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 P +++ S + DIK LPP+ EQ EIVRRV+ LFA+AD+IE +V Sbjct: 329 PYIYDSV-KKSTMGSTAPRFNVADIKKFPTPLPPLPEQQEIVRRVDALFAFADSIETKVA 387 Query: 401 NALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAA 453 A + L QSILAKAF G+L +A L+E+IK ER Sbjct: 388 AAREKTEKLRQSILAKAFSGQLVETQAEIVRREGRDYETAEVLIERIKEERKQ 440 >UniRef50_P06991 Type-1 restriction enzyme EcoDI specificity protein n=1 Tax=Escherichia coli RepID=T1SD_ECOLX Length = 444 Score = 290 bits (742), Expect = 1e-76, Method: Composition-based stats. Identities = 169/488 (34%), Positives = 235/488 (48%), Gaps = 68/488 (13%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MSAGKLP W + + L G K A D P + Sbjct: 1 MSAGKLPVDWKTVELGELIKLSTG----KLDANAADNDGQYPFFTCAE---------SVS 47 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + S + + G + + + + V+ P LI + Sbjct: 48 QINSWAFDTSAVLLAGN-------------GSFSIKKYTGKFNAYQRTYVIEPI-LIKTE 93 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+ + + KI+ G+ I I+ I++ +P +EQ +IAEKLDTLLAQV+ Sbjct: 94 FLYWLLRGN--IKKITENGRGSTIPYIRKGDITDISVALPSPSEQTLIAEKLDTLLAQVE 151 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF------------------------ 216 STKAR EQIPQILKRFRQAVL A+NG+LT++WR+ Sbjct: 152 STKARLEQIPQILKRFRQAVLTFAMNGELTKEWRSQNNNPAFFPAEKNSLKQFRNKELPS 211 Query: 217 EPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSES 276 P + + + + + + P + L + + + + + + Sbjct: 212 IPNNWSWMRFDQVADIAS----KLKSPLDYPNTI-HLAPNHIESWTGKASGYQTILEDGV 266 Query: 277 ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 +H+ G +++++ L V + + + + ++ Sbjct: 267 TSAKHEFYTGQIIYSKIRPYLCKV-------TIATFDGMCSADMYPI--NSKIDTHFLFR 317 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 + + + + + ++ + I+ KD+ V PP+ EQ EIVRRVEQLFAYADTIE Sbjct: 318 WMLTNTFTDW-ASNAESRTVLPKINQKDLSEIPVPTPPLPEQHEIVRRVEQLFAYADTIE 376 Query: 397 KQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGG 456 KQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGG Sbjct: 377 KQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGG 436 Query: 457 KKASRKKS 464 KKASRKKS Sbjct: 437 KKASRKKS 444 >UniRef50_Q4HFD9 HsdS n=3 Tax=Campylobacterales RepID=Q4HFD9_CAMCO Length = 408 Score = 289 bits (741), Expect = 1e-76, Method: Composition-based stats. Identities = 90/424 (21%), Positives = 163/424 (38%), Gaps = 30/424 (7%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 +GW + + I T+K + +P + NI G FD +D+ ++ Sbjct: 6 QGWKWKSLGEIC-FITDGTHKTPN----YIETGIPFLSVKNISKGFFDLSDVKYISLEEH 60 Query: 68 KE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 + K EDI+I +GK+ L FE S G+L+P+ I S ++ + Sbjct: 61 NKLIKRAKPEFEDILICRI----GTLGKAIKISLEFEFSIFVSLGLLKPKVKIISDYLVY 116 Query: 125 FTKSSLYRNKIS--SLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 F S I+ + G + + + I +PPL EQ+ I LD A++D Sbjct: 117 FLNSCFIEEWINDNKVGGGTHTAKLNLNILEKCPIALPPLKEQERIVGILDENFAKIDEN 176 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-PQHSVFKKLNFESILTELRNGLSS 241 EQ L Q+ L A N N++ PQ +K L + + Sbjct: 177 IKILEQDLLNLDELMQSALQKAFNPLKDNAKENYKLPQGWEWKSLGEIGEIITGTTPSKN 236 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL---NRHKLQDGDLLFTRYNGSLE 298 PN G +P+ + S + + I++ + S+L N L +L S+ Sbjct: 237 NPNFYGNEYPLFKPSDLNGDII----IKYASDNLSKLGFDNARNLPKDTILVVCIGASIG 292 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 VG+ G+ N ++ +Y+ S + + + Sbjct: 293 KVGLSGV-------NGSCNQQINAIIPNSAFTSKYLFFVCLSNYFQTILKKNASQ-TTLP 344 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 I+ + + LPP+KEQ +I +++L ++ +++ + + L S+L KAF Sbjct: 345 IINKTEFSKLQIPLPPLKEQEQIASHLDELSSHVKNLKQNYQAQIKNLQELKNSLLDKAF 404 Query: 419 RGEL 422 +G L Sbjct: 405 KGNL 408 Score = 154 bits (391), Expect = 5e-36, Method: Composition-based stats. Identities = 43/208 (20%), Positives = 81/208 (38%), Gaps = 7/208 (3%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 KLP+GW + + +I G T K Y + PL + +++ Sbjct: 208 ENYKLPQGWEWKSLGEIGEIITGTTPSKNNPNFY--GNEYPLFKPSDLNGDIIIKYASDN 265 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + K ++ + + I++ S VG S S + P S + Sbjct: 266 LSKLGFDNARNLPKDTILVVCIGASIGKVGLSGV-----NGSCNQQINAIIPNSAFTSKY 320 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + S+ ++ + ++ + I F + IP+PPL EQ+ IA LD L + V + Sbjct: 321 LFFVCLSNYFQTILKKNASQTTLPIINKTEFSKLQIPLPPLKEQEQIASHLDELSSHVKN 380 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKL 209 K ++ + L+ + ++L A G L Sbjct: 381 LKQNYQAQIKNLQELKNSLLDKAFKGNL 408 Score = 144 bits (363), Expect = 9e-33, Method: Composition-based stats. Identities = 43/204 (21%), Positives = 81/204 (39%), Gaps = 12/204 (5%) Query: 218 PQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESE 277 Q +K L + + +G PN G P L + ++ G D +D++++ E Sbjct: 5 SQGWKWKSLG---EICFITDGTHKTPNYIETGIPFLSVKNISKGFFDLSDVKYISLEEHN 61 Query: 278 --LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIE 335 + R K + D+L R +G + ++ L + + +Y+ Sbjct: 62 KLIKRAKPEFEDILICRIG----TLGKAIKISLEFEFSIFVS--LGLLKPKVKIISDYLV 115 Query: 336 IFFSSPSARNAMMNCVKTTSGQ-KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADT 394 F +S + + ++ ++ + LPP+KEQ IV +++ FA D Sbjct: 116 YFLNSCFIEEWINDNKVGGGTHTAKLNLNILEKCPIALPPLKEQERIVGILDENFAKIDE 175 Query: 395 IEKQVNNALARVNNLTQSILAKAF 418 K + L ++ L QS L KAF Sbjct: 176 NIKILEQDLLNLDELMQSALQKAF 199 >UniRef50_UPI0001C42656 hypothetical protein BpOF4_03730 n=1 Tax=Bacillus pseudofirmus OF4 RepID=UPI0001C42656 Length = 443 Score = 288 bits (738), Expect = 3e-76, Method: Composition-based stats. Identities = 78/441 (17%), Positives = 171/441 (38%), Gaps = 31/441 (7%) Query: 9 GWVIAPVSTVTTL-IRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT-DLVFVPKNL 66 W + + V + I ++ + + +D +P + A +++NG + ++ + Sbjct: 15 DWQVMKIKRVLDIPITDGPHETPELL----EDGVPFLSAESVKNGNLNFDLKRGYISQED 70 Query: 67 VKES-QKISP--EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI-FSGFI 122 ++ +K P +DI + S + + A E S + ++R +K I ++ Sbjct: 71 HEKYIKKCKPQRDDIFMVKSGATTGNI---AMVDTDEEFSIWSPLALIRAKKEIVIPKYL 127 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 +F S +R ++ + NI + + I IP L QK I ++ + +D Sbjct: 128 YYFVGSLAFREQVEVSWSYGTQQNIGMKVIENLFISIPSLEIQKRIVRYIEYKVKDIDIL 187 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESI-L 232 + + ++L++ RQ++L AV L KW P+H KK+ +I + Sbjct: 188 IKQKGKFIKLLEQQRQSILTEAVTKGLNPNMNMKDSGVKWIGEIPEHWEVKKVKHFAIHV 247 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGDLLFT 291 + G P LR +V + D+ F+ + E+ ++Q D+L Sbjct: 248 GSGKTPSGGAEIYLDEGIPFLRSLNVHFDGIHLKDLAFISEEINEEMKTSQVQPLDILLN 307 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNC 350 S +G ++ K + RL P Y + +S + Sbjct: 308 ITGAS---IGRTTIVPK-DFGRANVNQHVCIIRLNQNKVYPYYFNMLMASDVINQQIW-F 362 Query: 351 VKTTSGQKGISGKDIKSQVVLLPP-VKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + S ++G++ ++ + +PP ++EQ EI + + V + ++ Sbjct: 363 AQNGSSREGLNFAQVRELIFAIPPTLEEQREINEWIYNKQMKIFNLINLVKEQIEKLKEY 422 Query: 410 TQSILAKAFRGELTAQWRAEN 430 QS++ +A G++ + + Sbjct: 423 RQSLIYEAVTGKIDVRELELD 443 Score = 151 bits (383), Expect = 4e-35, Method: Composition-based stats. Identities = 47/240 (19%), Positives = 96/240 (40%), Gaps = 16/240 (6%) Query: 207 GKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQN 266 L E RN K+ + + +G P G P L SV+ G+++ + Sbjct: 2 KPLFELERNNVNYDWQVMKIKRVLDI-PITDGPHETPELLEDGVPFLSAESVKNGNLNFD 60 Query: 267 -DIRFLECSESE--LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA 323 ++ + E + + K Q D+ + + + ++ + ++ P LIRA Sbjct: 61 LKRGYISQEDHEKYIKKCKPQRDDIFMVKSGATTGNI---AMVDTDEEFSIWSPLALIRA 117 Query: 324 RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 + + +P+Y+ F S + R + + Q+ I K I++ + +P ++ Q IVR Sbjct: 118 K-KEIVIPKYLYYFVGSLAFREQVEVSWSYGT-QQNIGMKVIENLFISIPSLEIQKRIVR 175 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 +E D + KQ + + QSIL +A + NP++ ++ + Sbjct: 176 YIEYKVKDIDILIKQKGKFIKLLEQQRQSILTEAVT-------KGLNPNMNMKDSGVKWI 228 Score = 144 bits (365), Expect = 5e-33, Method: Composition-based stats. Identities = 42/221 (19%), Positives = 92/221 (41%), Gaps = 11/221 (4%) Query: 3 AGKLPEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G++PE W + V + G T D+ +P +R+ N+ DL F Sbjct: 228 IGEIPEHWEVKKVKHFAIHVGSGKTPSGG--AEIYLDEGIPFLRSLNVHFDGIHLKDLAF 285 Query: 62 VPKNLVKESQ--KISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPE-KLI 117 + + + +E + ++ P DI++ ++ S +G++ F + ++R + Sbjct: 286 ISEEINEEMKTSQVQPLDILLNITGAS---IGRTTIVPKDFGRANVNQHVCIIRLNQNKV 342 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPP-LAEQKIIAEKLDTLL 176 + + S + +I G++ + A + IPP L EQ+ I E + Sbjct: 343 YPYYFNMLMASDVINQQIWFAQNGSSREGLNFAQVRELIFAIPPTLEEQREINEWIYNKQ 402 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE 217 ++ + ++ + LK +RQ+++ AV GK+ + + Sbjct: 403 MKIFNLINLVKEQIEKLKEYRQSLIYEAVTGKIDVRELELD 443 >UniRef50_C5RH89 Restriction modification system DNA specificity domain protein n=1 Tax=Clostridium cellulovorans 743B RepID=C5RH89_CLOCL Length = 457 Score = 288 bits (738), Expect = 3e-76, Method: Composition-based stats. Identities = 95/454 (20%), Positives = 199/454 (43%), Gaps = 34/454 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++PE WV + + ++ L+ G T K Y +P I+ ++ G+ + + Sbjct: 23 EVPENWVWSNLKSIADLVTGNTPSKNNEEFY--GGKIPFIKPTDLNQGRILNSSTETLSN 80 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 +++ + + + +GK A+ L E + + P+K I++ ++ + Sbjct: 81 IGATKARILPKGSTAVCCIGAT---IGKVAY--LNVEGATNQQINSIIPKK-IYNLYVYY 134 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 +T SS + + + S+ + I + + IP+PPL EQ+ I +++ L ++D K Sbjct: 135 YTLSSYFHDTLIENSSSTTLPIINKSRMGELLIPLPPLKEQQRIVNRIENLFEKLDKAKE 194 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKW--------RNFEPQHSVFKKLNFESILTELR 236 E+ + ++ + A+ A G L + F +K E I ++ Sbjct: 195 LIEEAREGFEKRKAAITSKAFRGILNYRKGEKVNPINEGFYKLPYNWKWTKLEDICEKIT 254 Query: 237 NGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSES--ELNRHKLQDGDLLFTRY 293 +G + P G + + +++ +D + I ++ E R ++ GD+L+ + Sbjct: 255 DGTHNSPKSYEYGDYKYVTAKNIKEWGIDLSSITYVTKKEHIPIYKRCDVKYGDILYIKD 314 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 + G+ + + + +LL LIR K +Y+ +S + ++ VK Sbjct: 315 GATT---GIATINELTEEFSLLSSVALIRV--GKCIDNKYLYYILNSFEIKKRILESVK- 368 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 ++ K I ++ LPP++EQ EIV+ +++L ++ K++ ++N + +SI Sbjct: 369 GVAITRLTLKKINDIIIPLPPLEEQKEIVKILDKLLEE-ESKIKELTQLEDQINLIKKSI 427 Query: 414 LAKAFRGELTAQWRAENPDLISGENSAAALLEKI 447 LAKAFRG+L + SA LL+KI Sbjct: 428 LAKAFRGQLGTN--------CEEDESALELLKKI 453 Score = 125 bits (316), Expect = 2e-27, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 77/226 (34%), Gaps = 10/226 (4%) Query: 195 RFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILR 254 + L + + + P++ V+ L + L + G P ++ Sbjct: 2 AKKNLTLEEKLEDAIVKDVPYEVPENWVWSNLKSIADLVTGNTPSKNNEEFYGGKIPFIK 61 Query: 255 ISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNL 314 + + G + + L + R L G + +G L N Sbjct: 62 PTDLNQGRILNSSTETLSNIGATKAR-ILPKGSTAVCCIGAT---IGKVAYLNVEGATNQ 117 Query: 315 LYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPP 374 + K Y+ + S + ++ +++ I+ + ++ LPP Sbjct: 118 QINSII-----PKKIYNLYVYYYTLSSYFHDTLIEN-SSSTTLPIINKSRMGELLIPLPP 171 Query: 375 VKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 +KEQ IV R+E LF D ++ + A +I +KAFRG Sbjct: 172 LKEQQRIVNRIENLFEKLDKAKELIEEAREGFEKRKAAITSKAFRG 217 >UniRef50_B0TZ98 Type I restriction-modification system, subunit S n=1 Tax=Francisella philomiragia subsp. philomiragia ATCC 25017 RepID=B0TZ98_FRAP2 Length = 407 Score = 287 bits (736), Expect = 5e-76, Method: Composition-based stats. Identities = 84/427 (19%), Positives = 159/427 (37%), Gaps = 28/427 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 KLP GW + G K + + +P++ A N+ D + L F Sbjct: 3 ELYKLPAGWEWKKLGEECLFENGDRGKNYPSKSAFVSKGIPVVSATNLTGWSIDRSKLNF 62 Query: 62 VPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + + KI DI+ + +GK A + ++R + + + Sbjct: 63 ITEERYNLIGGGKIKKNDILFCLR----GSLGKCALVTDIERGVIASSLVIIRTCENLSN 118 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ ++ S L ++ I+ + GA N+ + L NIP+PPLAEQK I KLD+L ++ Sbjct: 119 IFLMYYLNSHLIQDFINKYNNGAAQPNLSAKNLSLFNIPLPPLAEQKRIVAKLDSLFEKI 178 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 D +Q + L E ++S+ + + Sbjct: 179 DKAIELHQQNITNANTLMASTLDKTFKK--------LEGEYSLIPLHKITTAVGGGTPKR 230 Query: 240 SSKPNESGVGHPILRISSVRAG----HVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 + K L + + A ++ ++ + E S+ + L G +L++ Sbjct: 231 NIKEYWGNGEIVWLSPTDLGAIGEILNIRESRDKITELGLSKSSARLLPVGTVLYS---- 286 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 S +G + + N + + + ++ S + + + ++ Sbjct: 287 SRATIGKIAINEIEVCTNQGFTNFIC---DKDKIYNYFLAY---SLAKYTEEITSLSNST 340 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 K +S IK + LPP+ Q + V ++ + D I++ L + L SIL Sbjct: 341 TFKEVSKTSIKKFEIPLPPLPIQQQTVEYLDSIATKVDKIKQLNEQKLENLKALKASILD 400 Query: 416 KAFRGEL 422 KAFRGEL Sbjct: 401 KAFRGEL 407 >UniRef50_Q21ZK2 Restriction modification system DNA specificity domain n=4 Tax=Bacteria RepID=Q21ZK2_RHOFD Length = 397 Score = 287 bits (736), Expect = 5e-76, Method: Composition-based stats. Identities = 82/416 (19%), Positives = 163/416 (39%), Gaps = 31/416 (7%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQK 72 +S G T + Q Y + +P I++ ++ + + L + S K Sbjct: 7 VTLSEFCATGSGGTPSRAQMERYYEGGTIPWIKSGELRETVINGAEEHVTDVALKESSIK 66 Query: 73 ISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI-FSGFIAHFTKSSL 130 + P I++AM + +G E + + P+ I + ++ H S + Sbjct: 67 LVPAGAILLAMYGATVGRLGILGI-----EATTNQAVCHIIPDPRIAVTRYVYHALSSQV 121 Query: 131 YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 + S+ G NI + IP+P EQ+ IA LD A + Q+ Sbjct: 122 --PSLISMGVGGAQPNINQGIIKNLAIPLPAKPEQRRIAAILDQADALRAKRREALAQLD 179 Query: 191 QILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV-- 248 + + + G V+ + + + +G++ N +G Sbjct: 180 SLTQSIFIQMFGDPVSN------------PKGWPDATTLGQVANIASGVTKGRNLTGKVT 227 Query: 249 -GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 P L +++V+ ++ + ++ ++ +E E+ R+ L+ DLL T G + +G L K Sbjct: 228 RTIPYLAVANVQDKSLNLSAVKEIDATEDEIERYLLKWNDLLLTE-GGDPDKLGRGTLWK 286 Query: 308 KLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIK 366 + ++ + + R R+T P ++ S + + K T+G I+ ++ Sbjct: 287 NELPE-CIHQNHIFRVRVTSQAVTPLFLNWLVGSQRGKKYFLRSAKQTTGIASINMTQLR 345 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 S +LLPPV+ Q + E + + +LA + L S+ +AFRGEL Sbjct: 346 SFPLLLPPVELQRDF----ETIAEVVAEQHAIHSVSLAELEALFVSLQHRAFRGEL 397 >UniRef50_Q4C702 Restriction modification system DNA specificity domain n=1 Tax=Crocosphaera watsonii WH 8501 RepID=Q4C702_CROWT Length = 408 Score = 287 bits (734), Expect = 8e-76, Method: Composition-based stats. Identities = 96/424 (22%), Positives = 179/424 (42%), Gaps = 32/424 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+ W + V + G P+I + N++ D +++ ++ + Sbjct: 10 LPQYWKWSKCQEVIDVRDG-----THDTPKYVSSGYPVITSKNLKTSGIDFSNVSYISEA 64 Query: 66 LVK---ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 K + K+ DI++AM +G + E S + I+ + Sbjct: 65 DHKEISKRSKVDKGDILLAMI----GTIGNPVIVDIEKEFSIKNVALFKLSKSNIYPEYF 120 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + SS+ ++ G + + IP+PPL EQK IA+ LD Sbjct: 121 KYLLDSSIISRQLDFEQRGGTQKFVSLKVLRNLLIPLPPLEEQKRIAKILDKADEIRRKR 180 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 K ++L+ + G V P+ K L S + EL+ G +SK Sbjct: 181 KESIRLTDELLRSTFLDMFGDPV----------INPKGWEVKTL--GSQIKELKYGTNSK 228 Query: 243 PNE--SGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 +E +LRI ++ + ND+++ E+++ L++GDLLF R NG+ +++ Sbjct: 229 CSELQKNNNIAVLRIPNIDNEKISWNDLKYTNLDSKEISKLLLKNGDLLFVRSNGNPDYI 288 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTK--DALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 G C + ++ ++ +Y LIR RL D P +I + P+ R+ ++ +TT+G Sbjct: 289 GRCAIFEEESNRKAVYASYLIRGRLKSICDFHPAFIRDIIAFPTFRSFLIREARTTAGNY 348 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 I+ +++ S ++ PP +Q E ++ + + +L NL S+L KAF Sbjct: 349 NINIQELSSLKLICPPQDKQEE---YLD-ITTKINRSFLNKQKSLQESENLFNSLLQKAF 404 Query: 419 RGEL 422 +GEL Sbjct: 405 KGEL 408 >UniRef50_Q8RJG0 HsdS n=12 Tax=Campylobacter jejuni RepID=Q8RJG0_CAMJE Length = 417 Score = 286 bits (733), Expect = 1e-75, Method: Composition-based stats. Identities = 97/420 (23%), Positives = 179/420 (42%), Gaps = 15/420 (3%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GW + + + ++ G T K Y KD P + ++ + G F + K Sbjct: 10 LPQGWEVKKLGEIGEIVTGSTPSKSNLDFYGKD--YPFFKPSDFEQGYFLENAGDNLSKL 67 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 +++++ P+ I++ +GK A + S + P K I S +I ++ Sbjct: 68 GFDKARQLPPKTILVVCI----GSLGKVALTRV--IGSCNQQINAIIPHKNIISEYIYYY 121 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-PLAEQKIIAEKLDTLLAQVDSTKA 184 SS +++ + S + + F + I P + EQ+ I LD A++D + Sbjct: 122 CISSKFQSILFSKAPQTTLAIFNKTEFSKLEIIYPKDIKEQERIVGILDESFAKIDESIK 181 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-PQHSVFKKLNFESILTELRNGLSSKP 243 EQ L Q+ L A N N++ PQ +K L S L + +SK Sbjct: 182 ILEQDLLNLDELMQSALQKAFNPLKDNAKENYKLPQGWEWKSLGEISNLIQ-NGFAASKN 240 Query: 244 NESGVGHPILRISSV-RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 NE G+ LR ++ G+++ + + ++ + + ++ D+LF N + E VG Sbjct: 241 NEIPSGYVHLRTHNISTDGNLNFDTLIKIKREFIKEKQSFIEKNDILFNNTNST-ELVGK 299 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 L+ Q+ N + + L + +L + + +F GQ GI+ Sbjct: 300 TALV--TQNYNYAFSNHLTKIKLKNQYNSKLVVFYFVLLLKNKYFEKICHQWIGQSGINI 357 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 +K + LPP+KEQ +I + ++ +F +++ L L QS+L KAF+GEL Sbjct: 358 DKLKKIQIPLPPLKEQEQIAKHLDFVFEKTKALKELYTKELKDYEELKQSLLNKAFKGEL 417 Score = 135 bits (341), Expect = 3e-30, Method: Composition-based stats. Identities = 47/215 (21%), Positives = 88/215 (40%), Gaps = 15/215 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDL 59 KLP+GW + ++ LI G K I +R +NI +G + L Sbjct: 211 ENYKLPQGWEWKSLGEISNLIQNGFAASKNNEIP----SGYVHLRTHNISTDGNLNFDTL 266 Query: 60 VFVPKNLVKESQ-KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 + + + +KE Q I DI+ ++ + +VGK+A + +F ++ + Sbjct: 267 IKIKREFIKEKQSFIEKNDILFNNTNST-ELVGKTALVTQNYNYAFSNHLTKIKLKNQYN 325 Query: 119 SG----FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDT 174 S + K+ + + I I IP+PPL EQ+ IA+ LD Sbjct: 326 SKLVVFYFVLLLKNKYFEKICHQW---IGQSGINIDKLKKIQIPLPPLKEQEQIAKHLDF 382 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 + + + K + + + + +Q++L A G+L Sbjct: 383 VFEKTKALKELYTKELKDYEELKQSLLNKAFKGEL 417 >UniRef50_C9Q5S0 Possible type I restriction-modification system S subunit n=1 Tax=Vibrio sp. RC341 RepID=C9Q5S0_9VIBR Length = 469 Score = 286 bits (733), Expect = 1e-75, Method: Composition-based stats. Identities = 89/442 (20%), Positives = 163/442 (36%), Gaps = 27/442 (6%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK---FDTTD--LV 60 +P W+ + + + +G+T KE L+D +P + + + D L Sbjct: 21 IPAHWLTSKLRYTFSFGKGLTITKEN----LRDTGIPCVSYGEVHSKYGFEIDPARHPLK 76 Query: 61 FVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 V + +K + DIV A +S G + G + RP Sbjct: 77 CVGDDYLKTSPYALLKKGDIVFADTSEDIDGSGNFTQLVSNEQVFAGYHTIIARPYNHEC 136 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 S F A+ S R +I G + +I A +NI +PPL E+ IA LD A+ Sbjct: 137 SRFYAYLLDSKELRTQIRHAVKGVKVFSITQAILRGVNIWLPPLKERNQIANFLDHETAK 196 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFE 229 +D+ + +Q+ ++LK RQAV+ AV L +W P+H L Sbjct: 197 IDTLIEKQQQLIKLLKEKRQAVVSHAVTKGLNPQAPMKDSGVEWLGEVPEHWSISPLKHH 256 Query: 230 SILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLL 289 S N G P +R +++ + + DI + + R L DG+L+ Sbjct: 257 VNTVNGFGFSS--NNFQDEGVPFIRAGNIKNKTIVKPDIHLPQAVVDKYQRVILNDGELV 314 Query: 290 FTRYNGSLE----FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 + + VG GL+ + ++ R L +++ R+ Sbjct: 315 ISMVGSDPKIKASAVGQVGLVP-PSLAGSVPNQNVVILREQSSLLKKFLFYVVCGTPYRH 373 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 + + Q IS I P + EQ EIV ++ D + ++ ++ Sbjct: 374 HLDVFSHKLANQSIISSSLIICAQFTFPELDEQKEIVDFLDTQLRKYDWLMEKATRSIEF 433 Query: 406 VNNLTQSILAKAFRGELTAQWR 427 +N ++++ G++ + Sbjct: 434 MNERKTALISATVTGKIDVRNW 455 Score = 144 bits (365), Expect = 6e-33, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 86/238 (36%), Gaps = 18/238 (7%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQND---- 267 W P H + KL + + + +K N G P + V + + + D Sbjct: 16 DWLETIPAHWLTSKLRY--TFSFGKGLTITKENLRDTGIPCVSYGEVHSKYGFEIDPARH 73 Query: 268 -IRFLECSESELNRH-KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL 325 ++ + + + + L+ GD++F + ++ G L + Y I AR Sbjct: 74 PLKCVGDDYLKTSPYALLKKGDIVFADTSEDIDGSGNFTQLVSNEQVFAGY--HTIIARP 131 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 + S R + + VK I+ ++ + LPP+KE+ +I + Sbjct: 132 YNHECSRFYAYLLDSKELRTQIRHAVK-GVKVFSITQAILRGVNIWLPPLKERNQIANFL 190 Query: 386 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + A DT+ ++ + + Q++++ A + NP ++ L Sbjct: 191 DHETAKIDTLIEKQQQLIKLLKEKRQAVVSHAVT-------KGLNPQAPMKDSGVEWL 241 Score = 123 bits (310), Expect = 1e-26, Method: Composition-based stats. Identities = 50/215 (23%), Positives = 91/215 (42%), Gaps = 13/215 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W I+P+ + G + +D+ +P IRA NI+N D+ +P Sbjct: 242 GEVPEHWSISPLKHHVNTVNGFGFSSNN----FQDEGVPFIRAGNIKNKTIVKPDI-HLP 296 Query: 64 KNLVKESQKISPED--IVIAMSSGSKSV----VGKSAHQHLPFECSF-GAFCGVLRPEKL 116 + +V + Q++ D +VI+M + VG+ S +LR + Sbjct: 297 QAVVDKYQRVILNDGELVISMVGSDPKIKASAVGQVGLVPPSLAGSVPNQNVVILREQSS 356 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + F+ + + YR+ + S AN + I + P L EQK I + LDT Sbjct: 357 LLKKFLFYVVCGTPYRHHLDVFSHKLANQSIISSSLIICAQFTFPELDEQKEIVDFLDTQ 416 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 L + D + + + + + A++ V GK+ Sbjct: 417 LRKYDWLMEKATRSIEFMNERKTALISATVTGKID 451 >UniRef50_A5GE25 Restriction endonuclease S subunits-like protein n=2 Tax=Proteobacteria RepID=A5GE25_GEOUR Length = 443 Score = 286 bits (732), Expect = 1e-75, Method: Composition-based stats. Identities = 81/441 (18%), Positives = 166/441 (37%), Gaps = 34/441 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDY-LPLIRANNIQNGKFDTTDLVFV 62 G +P W + + ++T+ RG + + Y D+ R ++ + + Sbjct: 20 GDVPSHWEVIQIKHLSTVRRGASPRPIDDAKYFDDEGEYAWTRIADVTASEMYLFNAPQR 79 Query: 63 PKNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 +L S K+ P + +++ VGK + C F V PE I S F Sbjct: 80 LSDLGSSLSVKLEPGALFLSI----AGTVGKPCITGMK-ACIHDGF--VYFPELKIPSKF 132 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + L N+ + I I ++ + I + LD A++D+ Sbjct: 133 LFYVF---AGEQAYKGLGKFGTQLNLNTDTVGGIKIGCTENSQLEKIVQFLDHETAKIDT 189 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 + +Q+ ++LK RQAV+ AV L +W P+H F Sbjct: 190 LIDKQQQLIKLLKEKRQAVISHAVTKGLNPDAPMKDSGVEWLGEVPEHWDVCLAKF--KT 247 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL---NRHKLQDGDLL 289 + +G P+ H + I + G ++ D + K + GD+L Sbjct: 248 HAITDGAHISPDTKNGEHYFVSIKDMCDGLINFEDALLTSKESYKYLVNTGCKPEPGDIL 307 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMM 348 F++ +G + + + + + LI + K P++ + S + + Sbjct: 308 FSKDG----TIGKTVVTPE--NVDFVVASSLIIIKPNLKKLSPQFFDYLCQSCVIQEQVN 361 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + VK + K +S +++ + PP+ EQ I + +++ IE+ NNA+A + Sbjct: 362 SFVK-GAALKRLSIQNLLKVWGVFPPLDEQVVIAKHIDKKLIRYQQIEQTANNAIALMQE 420 Query: 409 LTQSILAKAFRGELTAQWRAE 429 ++++ A G++ + Sbjct: 421 RRTALISAAVTGKIDVRDWQP 441 Score = 118 bits (297), Expect = 4e-25, Method: Composition-based stats. Identities = 31/236 (13%), Positives = 74/236 (31%), Gaps = 23/236 (9%) Query: 211 EKWRNFEPQHSVFKKLNFESILTELRNG---LSSKPNESGVGHPILRISSVRAGHVDQND 267 E+W P H ++ S + + +K + + RI+ V A + + Sbjct: 16 EEWLGDVPSHWEVIQIKHLSTVRRGASPRPIDDAKYFDDEGEYAWTRIADVTASEMYLFN 75 Query: 268 IRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 KL+ G L + VG + + ++ + L Sbjct: 76 APQRLSDLGSSLSVKLEPGALFLSIAG----TVGKPCI---TGMKACIHDGFVYFPELK- 127 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 +++ F+ A + Q ++ + + + +IV+ ++ Sbjct: 128 -IPSKFLFYVFAGEQAYKGLGKF----GTQLNLNTDTVGGIKIGCTENSQLEKIVQFLDH 182 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A DT+ + + + Q++++ A + NPD ++ L Sbjct: 183 ETAKIDTLIDKQQQLIKLLKEKRQAVISHAVT-------KGLNPDAPMKDSGVEWL 231 >UniRef50_UPI00016B0992 probable type I restriction-modification system n=1 Tax=Burkholderia pseudomallei BCC215 RepID=UPI00016B0992 Length = 442 Score = 286 bits (732), Expect = 1e-75, Method: Composition-based stats. Identities = 93/441 (21%), Positives = 165/441 (37%), Gaps = 36/441 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVT-YKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P W + + R EQ K +P Q + D V Sbjct: 18 GRVPTSWAVVQARRLFEQRRDAALPGDEQLSASQKYGVVP-------QRLFMELEDQKVV 70 Query: 63 -PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + ++ + + P D VI++ S +H F VLR I F Sbjct: 71 LALSGLENFKHVEPNDFVISLRSFQGG------IEHSAFGGCVSPAYTVLRATSKIAPDF 124 Query: 122 IAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 A+ KS Y + + +++ G NI F + +P+P + EQ IA LD ++D Sbjct: 125 WAYLLKSDTYISALQTVTDGIRDGKNISYMQFGALCVPVPNIDEQSAIAAFLDCETGKID 184 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFESI 231 + A E++ +L RQA L AV L W P H V +++ S+ Sbjct: 185 ALIAEQEKLIALLAEKRQAALSYAVTRGLNPDAPMKDSGVAWLGEVPAHWVIRRVKSVSV 244 Query: 232 L-TELRNGLSSKPNESGVGHPILRISSVRAG-HVDQNDIRFLECSES-ELNRHKLQDGDL 288 T G S + S G ++ + V+ + + E R +L +GD+ Sbjct: 245 FMTSGPRGWSER--ISDEGSIFVQSGDLNDFLGVEFEIAKRVSVEFDAEAERTRLANGDV 302 Query: 289 LFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMM 348 + V VC + + L R + D LP ++ S + Sbjct: 303 VVCITGAKTGKVAVCASVPE----PAYVNQHLCLIRPSPDVLPLFLGNSLKSTIGQTQF- 357 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + Q G+S +++ +++LPP EQ EIV ++ A D ++ + A+ + Sbjct: 358 ELSQYGLKQ-GLSLDNVREALIVLPPPGEQVEIVTFIDAETARLDELKAEAARAIELLKE 416 Query: 409 LTQSILAKAFRGELTAQWRAE 429 +++A A G++ + A Sbjct: 417 RRSALIAAAVTGKIDVRNAAP 437 Score = 107 bits (269), Expect = 6e-22, Method: Composition-based stats. Identities = 31/231 (13%), Positives = 70/231 (30%), Gaps = 19/231 (8%) Query: 213 WRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLE 272 W P + + + + ++ + ++ D + + Sbjct: 16 WLGRVPTSWAVVQARRLFEQRRDAALPGDEQLSASQKYGVVP----QRLFMELEDQKVVL 71 Query: 273 CSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPE 332 N ++ D + + + G ++ + P + R T P+ Sbjct: 72 ALSGLENFKHVEPNDFVISLRS-------FQGGIEHSAFGGCVSPAYTV-LRATSKIAPD 123 Query: 333 YIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYA 392 + S + +A+ K IS + V +P + EQ+ I ++ Sbjct: 124 FWAYLLKSDTYISALQTVTDGIRDGKNISYMQFGALCVPVPNIDEQSAIAAFLDCETGKI 183 Query: 393 DTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + + +A + Q+ L+ A R NPD ++ A L Sbjct: 184 DALIAEQEKLIALLAEKRQAALSYAVT-------RGLNPDAPMKDSGVAWL 227 >UniRef50_D1YNY9 Type I restriction modification DNA specificity domain protein n=1 Tax=Veillonella parvula ATCC 17745 RepID=D1YNY9_9FIRM Length = 427 Score = 285 bits (731), Expect = 2e-75, Method: Composition-based stats. Identities = 87/431 (20%), Positives = 175/431 (40%), Gaps = 35/431 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +P+ W + + +L + K + D P + + Sbjct: 13 GMIPKSWD---LDKIVSLYSERSTK-------VSDKDYPALS---VTKQGIVPQLESAAK 59 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + + I D VI S + G S +E S VL P+ + + + Sbjct: 60 TDNGDNRKLIKKNDFVINSRSDRRGSCGIS-----EYEGSCSLINIVLAPKNNMVNRYYN 114 Query: 124 HFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + K+ L+ ++ G + + K ++ I +P P L EQ+ IAE LDT AQ+D+ Sbjct: 115 YLFKTELFADEFYKWGNGIVDDLWSTKWSNMKNIMVPFPSLEEQQAIAEHLDTKCAQIDT 174 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 A+ + + + L+ +++A++ AV L +W + P H K+L F + + Sbjct: 175 IIAKEQSVIEKLQEYKRAIITYAVVKGLDITAETADSGIEWIDSIPSHWKIKRLIFSAYI 234 Query: 233 TELRNGLSSKP-NESGVGHPILRISSVRAGHVDQNDIRFLECS-ESELNRHKLQDGDLLF 290 K + GHP L +++ + D+ F+ E KL+ GDLL Sbjct: 235 RARLGWKGLKADEYTSEGHPFLSAVNIQNDKLVWEDLNFINDDRYDESPEIKLEIGDLLL 294 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 + +G C ++ +L + L + Y+ FF S +N + + Sbjct: 295 VKDGAG---IGKCAVVDQLPYGTATTNSSLGVITPYPELNSMYLYYFFESAIFQNYI-SR 350 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 +K G ++ ++K+ +V++PP EQ IV +++ A D++ + + + ++ Sbjct: 351 IKNGMGVPHLTQGNLKNIMVIIPPYCEQEAIVTYLDEKCANLDSVILRKQSRIDKLTEYK 410 Query: 411 QSILAKAFRGE 421 +S++ + G+ Sbjct: 411 KSLIYEVVTGK 421 Score = 152 bits (385), Expect = 2e-35, Method: Composition-based stats. Identities = 50/210 (23%), Positives = 91/210 (43%), Gaps = 10/210 (4%) Query: 6 LPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +P W I + + + IR K + + P + A NIQN K DL F+ Sbjct: 219 IPSHWKIKRL--IFSAYIRARLGWKGLKADEYTSEGHPFLSAVNIQNDKLVWEDLNFIND 276 Query: 65 NLVKESQ--KISPEDIVIAMSSGSKSVVGKSAHQHLP--FECSFGAFCGVLRPEKLIFSG 120 + ES K+ D+++ +GK A + + GV+ P + S Sbjct: 277 DRYDESPEIKLEIGDLLLVKDGAG---IGKCAVVDQLPYGTATTNSSLGVITPYPELNSM 333 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ +F +S++++N IS + G + ++ + I + IPP EQ+ I LD A +D Sbjct: 334 YLYYFFESAIFQNYISRIKNGMGVPHLTQGNLKNIMVIIPPYCEQEAIVTYLDEKCANLD 393 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLT 210 S R + L ++++++ V GK Sbjct: 394 SVILRKQSRIDKLTEYKKSLIYEVVTGKKE 423 Score = 105 bits (263), Expect = 4e-21, Method: Composition-based stats. Identities = 30/233 (12%), Positives = 77/233 (33%), Gaps = 27/233 (11%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P+ K+ S+ +E +S K +P L ++ + G V + Sbjct: 10 RWLGMIPKSWDLDKI--VSLYSERSTKVSDK------DYPALSVT--KQGIVP--QLESA 57 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 +++ NR ++ D + + G+ N++ + + Sbjct: 58 AKTDNGDNRKLIKKNDFVINSRSDRRGSCGISEYEGSCSLINIVLA-------PKNNMVN 110 Query: 332 EYIEIFFSSPSARNAMMNCVKTTS-GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 Y F + + ++K+ +V P ++EQ I ++ A Sbjct: 111 RYYNYLFKTELFADEFYKWGNGIVDDLWSTKWSNMKNIMVPFPSLEEQQAIAEHLDTKCA 170 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 DTI + + + ++ ++I+ A + + + ++ + Sbjct: 171 QIDTIIAKEQSVIEKLQEYKRAIITYAV-------VKGLDITAETADSGIEWI 216 >UniRef50_A1RES4 Restriction modification system DNA specificity domain n=1 Tax=Shewanella sp. W3-18-1 RepID=A1RES4_SHESW Length = 417 Score = 284 bits (728), Expect = 4e-75, Method: Composition-based stats. Identities = 74/429 (17%), Positives = 171/429 (39%), Gaps = 25/429 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P+GW++ V + L G T Q Y ++ +P + + + + + D Sbjct: 4 RVPDGWMLKIVRDTSKLSAGGTPST-QVTEYWENGTIPWMSSGEVHKKRVHSVDNCITTL 62 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 L S K+ P ++ +G G A + + + ++ +K ++ F+ H Sbjct: 63 GLENSSAKMFPSKSILVALAGQGKTRGTVAISEIEL-TTNQSIAAIIVKDKSVYPDFLYH 121 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S ++ +S G+ + A +++ +PPL EQ+ IA+ L ++ ++ T+A Sbjct: 122 NLDSRY--EELRGVSGGSGRAGLNLAILGDLDVLLPPLPEQQKIAKILTSVDQVIEKTQA 179 Query: 185 RFEQIPQILKRFRQAVLGG--AVNGKLTEKWRNFEPQHSV---FKKLNFESILTELRNGL 239 + +++ + Q +L V+GK +++ P + + + T + G Sbjct: 180 QIDKLKDLKTGMMQELLTQGVGVDGKPHTEFK-DSPVGWIPKTWDLEPLANFTTFISYGF 238 Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL---NRHKLQDGDLLFTRYNGS 296 ++ E+ VG ++ V V + R + + + Q D+L T+ Sbjct: 239 TNPMPEAEVGPYMITAKDVNDLKVQYSTSRKTTQEAFDNLLTRKSRPQVNDILLTKDG-- 296 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 +G L+ N + + +P+++ +SP + M+ S Sbjct: 297 --TLGRVALV---TDSNCCINQSVAVLTPNERVIPKFLLYLLASPRYQQEMLENA-GGST 350 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 K I + +V +P V EQ ++V + +F + N L+++N+ ++++ Sbjct: 351 IKHIYITVVDKMLVGVPSVTEQQKLVDIFDSVFRKLE----LTENKLSKLNDTKKALMQD 406 Query: 417 AFRGELTAQ 425 G++ Sbjct: 407 LLTGKVRVN 415 Score = 141 bits (355), Expect = 7e-32, Method: Composition-based stats. Identities = 38/209 (18%), Positives = 83/209 (39%), Gaps = 15/209 (7%) Query: 3 AGKLPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G +P+ W + P++ TT I G T +A + +I A ++ + K + Sbjct: 215 VGWIPKTWDLEPLANFTTFISYGFTNPMPEA-----EVGPYMITAKDVNDLKVQYSTSRK 269 Query: 62 VPKNLVK----ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 + + DI++ +G+ A C VL P + + Sbjct: 270 TTQEAFDNLLTRKSRPQVNDILLTKD----GTLGRVALVT-DSNCCINQSVAVLTPNERV 324 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+ + S Y+ ++ + G+ I +I D + + +P + EQ+ + + D++ Sbjct: 325 IPKFLLYLLASPRYQQEMLENAGGSTIKHIYITVVDKMLVGVPSVTEQQKLVDIFDSVFR 384 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVN 206 +++ T+ + ++ K Q +L G V Sbjct: 385 KLELTENKLSKLNDTKKALMQDLLTGKVR 413 Score = 124 bits (313), Expect = 5e-27, Method: Composition-based stats. Identities = 29/231 (12%), Positives = 66/231 (28%), Gaps = 17/231 (7%) Query: 215 NFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLEC 273 P + K + S L+ + G P + V V D Sbjct: 3 ERVPDGWMLKIVRDTSKLSAGGTPSTQVTEYWENGTIPWMSSGEVHKKRVHSVDNCITTL 62 Query: 274 SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPE 332 + +L G + L + + P+ Sbjct: 63 GLENSSAKMFPSKSILVALAGQGKTR-GTVAI----SEIELTTNQSIAAIIVKDKSVYPD 117 Query: 333 YIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYA 392 ++ S + V SG+ G++ + VLLPP+ EQ +I + + Sbjct: 118 FLYHNLDSRY---EELRGVSGGSGRAGLNLAILGDLDVLLPPLPEQQKIAKIL----TSV 170 Query: 393 DTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + ++ + ++ +L ++ + + + P ++ + Sbjct: 171 DQVIEKTQAQIDKLKDLKTGMMQELLTQGVGVDGK---PHTEFKDSPVGWI 218 >UniRef50_A3J6X3 Type I restriction-modification system, S subunit n=1 Tax=Flavobacteria bacterium BAL38 RepID=A3J6X3_9FLAO Length = 450 Score = 284 bits (727), Expect = 6e-75, Method: Composition-based stats. Identities = 79/442 (17%), Positives = 176/442 (39%), Gaps = 33/442 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++PE W V + G T ++ + +P + + +QN + + + + Sbjct: 17 EIPENWDYCKVKHIANTYAGGTPSTV-VDSFWHNGDIPWLPSGKLQNCEIISAEKFITNE 75 Query: 65 NLVKESQK-ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 L+ S K I P +++A++ + + +G Q C+ + V + S F+ Sbjct: 76 GLIGSSTKWIKPNTVLVALTGATCANIGYLTFQ----ACANQSVIAVDENPEKANSRFLY 131 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + + R++I + G I + + + P L EQ IA+ LD +D+T Sbjct: 132 YMFLNM--RSQILTHQTGGAQAGINDSDVKNLYLLNPSLEEQIKIADYLDYKTNLIDATI 189 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFE----S 230 + +++ ++LK RQAV+ AV L +W P++ KK+ + + Sbjct: 190 EKKKRLIELLKEKRQAVINEAVTKGLNPNAPMKDSGLEWLGEIPENWEVKKVKYLLSSEN 249 Query: 231 ILTELRNGLSSK-PNESGVGHPILRISSVRAGHVDQNDIRFLECS--ESELNRHKLQDGD 287 + G + K + G I +V R+++ E + ++++ DGD Sbjct: 250 GIKIGPFGSALKLDTLTDNGIKIYGQGNVIKDDFTLG-HRYIDPERFEKDFKQYEILDGD 308 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFF--SSPSARN 345 +L T + G + + + L+R R +D + S Sbjct: 309 ILITMMGTT----GKSKVFNSSYEKG-ILDSHLLRLRFNEDLFDGRLFSILLEQSDYVFQ 363 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 + S G++ +K +++ P ++ Q EI+ +++ D I ++ + + + Sbjct: 364 QLA-LNSVGSIMAGLNSSIVKELIIITPKLEIQKEILNYIDENCKIIDIISSKILSQIEK 422 Query: 406 VNNLTQSILAKAFRGELTAQWR 427 + QS++++A G++ + Sbjct: 423 LQTYRQSLISEAVTGKIDVREW 444 Score = 143 bits (361), Expect = 2e-32, Method: Composition-based stats. Identities = 29/232 (12%), Positives = 76/232 (32%), Gaps = 17/232 (7%) Query: 213 WRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFL 271 W P++ + K+ + + + G P L ++ + + Sbjct: 14 WYPEIPENWDYCKVKHIANTYAGGTPSTVVDSFWHNGDIPWLPSGKLQNCEIISAEKFIT 73 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 + ++ +L + +G L Q + + A Sbjct: 74 NEGLIGSSTKWIKPNTVLVALTGATCANIG------YLTFQACANQSVIAVDENPEKANS 127 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 ++ F + R+ ++ + Q GI+ D+K+ +L P ++EQ +I ++ Sbjct: 128 RFLYYMFLN--MRSQILTHQTGGA-QAGINDSDVKNLYLLNPSLEEQIKIADYLDYKTNL 184 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D ++ + + Q+++ +A + NP+ ++ L Sbjct: 185 IDATIEKKKRLIELLKEKRQAVINEAVT-------KGLNPNAPMKDSGLEWL 229 Score = 115 bits (290), Expect = 3e-24, Method: Composition-based stats. Identities = 43/215 (20%), Positives = 86/215 (40%), Gaps = 12/215 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRG---VTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 G++PE W + V + + G + ++ L D+ + + N+ F Sbjct: 230 GEIPENWEVKKVKYLLSSENGIKIGPFGSALKLDTLTDNGIKIYGQGNVIKDDFTLGHRY 289 Query: 61 FVPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLI 117 P+ K + +I DI+I M + GKS + +E + LR + + Sbjct: 290 IDPERFEKDFKQYEILDGDILITMMGTT----GKSKVFNSSYEKGILDSHLLRLRFNEDL 345 Query: 118 FSGFIAHFT--KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 F G + +S +++ S G+ + + + + I P L QK I +D Sbjct: 346 FDGRLFSILLEQSDYVFQQLALNSVGSIMAGLNSSIVKELIIITPKLEIQKEILNYIDEN 405 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 +D ++ + L+ +RQ+++ AV GK+ Sbjct: 406 CKIIDIISSKILSQIEKLQTYRQSLISEAVTGKID 440 >UniRef50_C6A4W8 Putative type I specificity subunit HsdS n=1 Tax=Thermococcus sibiricus MM 739 RepID=C6A4W8_THESM Length = 434 Score = 283 bits (726), Expect = 7e-75, Method: Composition-based stats. Identities = 90/421 (21%), Positives = 171/421 (40%), Gaps = 23/421 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDLVFVP 63 +LPEGW + + L G T + + Y ++ +P ++ ++I +G + T+ Sbjct: 34 ELPEGWRWVRLGDIAELKAGGTPSR-RVKEYWENGTIPWVKISDIPDSGLVEKTEEKITE 92 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 L S K+ ++ + S VG L + + P+ I G++ Sbjct: 93 LGLKNSSAKLLSPGTILFSIFATISKVGI-----LKIPAATNQAIVGIIPKISIDRGYLF 147 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + K + ++ G +NI + IP+PP+ EQK I KLD + +++ K Sbjct: 148 YSLK--YFGQELVYQGRGGVQDNINMRILSKLKIPLPPIEEQKRIVAKLDEVHRRLEEAK 205 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 + + +R + L + E + + + S + +K Sbjct: 206 RLAREAREEAERLMASALHEVFSK--------AEEKGWEWTTIGKVSREMKPGF-ARNKK 256 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGDLLFTRYNGSLEFVGV 302 + S G P LR ++V G ++ I + + + + L+ GD+LF N S E VG Sbjct: 257 HISRDGVPHLRPNNVDVGRLNLKKIVKVTLDDKINIEEYYLKKGDVLFNNTN-SFELVGR 315 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 ++ + Y + + R R+ K+ LPE++ + + + GQ G++ Sbjct: 316 AAIVPEDLKYG--YSNHITRIRVKKEVILPEWLTLAINYLWMQGYFREVCTRWVGQAGVN 373 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 + + LP ++EQ IV ++ + A + K + L +IL KAFRGE Sbjct: 374 MNTLAKTRIPLPSLEEQKRIVSYLDSIQERAQKLVKLYEEREKELEKLFPAILDKAFRGE 433 Query: 422 L 422 L Sbjct: 434 L 434 >UniRef50_C5C353 Restriction modification system DNA specificity domain protein n=1 Tax=Beutenbergia cavernae DSM 12333 RepID=C5C353_BEUC1 Length = 427 Score = 283 bits (725), Expect = 1e-74, Method: Composition-based stats. Identities = 81/413 (19%), Positives = 160/413 (38%), Gaps = 33/413 (7%) Query: 31 QAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE--SQKISPEDIVIAMSSGSKS 88 ++ + D + L++ ++ +GKF ++ + + + P DI+IA Sbjct: 23 ESKDQDPDGDIRLLQLADVGDGKFKDKSDRWINEETFRRLRCSWVHPGDILIARM---PD 79 Query: 89 VVGKSAHQHLPF-ECSFGAFCGVLRPE-KLIFSGFIAHFTKSSLYRNKISSLSAGANINN 146 +G++ + VLRP+ +G++ + S+ R+++ GA Sbjct: 80 PLGRACVVPEGLGKTITVVDVAVLRPDPDQADAGYLTYAINSAKTRSEVERQQDGATRQR 139 Query: 147 IKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVN 206 I ++IP+PPL EQ+ IA+ LD Q+D+ A E++ +LK R + + AV Sbjct: 140 IPRKRLGRVSIPLPPLEEQRRIADFLDAETTQIDALIAEQERLIGLLKERRASGILQAVT 199 Query: 207 GKL--------TEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES-GVGHPILRISS 257 L T W + P H + + + S P P ++ Sbjct: 200 RGLRDVDLKPSTLTWVDAVPLHWTVANIRRFAAMKTGHTPSRSNPEYWVDTHIPWFTLAD 259 Query: 258 ---VRAG---HVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQH 311 VR G H+ + + + + L G ++ +R VG G++ Sbjct: 260 VWQVRDGRRTHLGETENTISDLGLANSAAELLPAGTVVLSRT----ASVGFSGVMP---- 311 Query: 312 QNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVL 371 + + + +PEY+ F + R N + S K I + V Sbjct: 312 RPMATSQDFWNWVCGPELVPEYLMYLFR--AMRGEF-NALMIGSTHKTIYQPVAAAIRVP 368 Query: 372 LPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTA 424 +PP++EQ EIV R+++ D + + + +A +++ A G++ Sbjct: 369 VPPLEEQHEIVARIDERTRKTDALINEAEHNIALSKERRAALITAAVTGQIDV 421 Score = 134 bits (338), Expect = 7e-30, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 78/211 (36%), Gaps = 14/211 (6%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANN---IQNGK---FDTTDL 59 +P W +A + + G T + Y D ++P + +++G+ T+ Sbjct: 218 VPLHWTVANIRRFAAMKTGHTPSRSN-PEYWVDTHIPWFTLADVWQVRDGRRTHLGETEN 276 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 L + ++ P V+ + S G +P + + Sbjct: 277 TISDLGLANSAAELLPAGTVVLSRTASVGFSG-----VMPRPMATSQDFWNWVCGPELVP 331 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ + ++ R + ++L G+ I I +P+PPL EQ I ++D + Sbjct: 332 EYLMYLFRAM--RGEFNALMIGSTHKTIYQPVAAAIRVPVPPLEEQHEIVARIDERTRKT 389 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 D+ E + K R A++ AV G++ Sbjct: 390 DALINEAEHNIALSKERRAALITAAVTGQID 420 >UniRef50_C5SDH7 Putative uncharacterized protein n=1 Tax=Allochromatium vinosum DSM 180 RepID=C5SDH7_CHRVI Length = 453 Score = 283 bits (724), Expect = 1e-74, Method: Composition-based stats. Identities = 80/440 (18%), Positives = 166/440 (37%), Gaps = 30/440 (6%) Query: 4 GKLPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQ--NGKFDTTDLV 60 G++PE W++ + I G+ + +P IR + + DL Sbjct: 18 GEVPEHWILDRLKWSVEGCINGLWGDDPNGEDV-----IPCIRVADFDRAKNRVRAEDLT 72 Query: 61 FVPKNLVKE-SQKISPEDIVIAMSSGSKS-VVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 + + K ++ + D++I S G + VG F + Sbjct: 73 YRSISEEKRLNRSLKNGDLLIEKSGGGDNQPVGVVVLFDHNLNAVCSNFVARMPVRSNFS 132 Query: 119 SGFIAHFTKSSLYRNKISSLS--AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 F+ + S LY ++++ S I N+ AS+ IP + EQ +IA+ LD Sbjct: 133 PRFLCY-LHSVLYALRLNTKSIKQNTGIQNLDSASYLDERFGIPTVYEQGLIADFLDRET 191 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLN 227 A++D+ A +++ ++LK RQAV+ AV L +W P+H V L Sbjct: 192 AKIDALIAEQQRLVELLKEKRQAVISHAVTKGLNPDAPMKDSGIEWLGEVPEHWVIVPLK 251 Query: 228 FE-SILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS-ESELNRHKLQD 285 + ++ G+ G PI++ VR + + + E+ R +L+ Sbjct: 252 HLTAPGRDIMYGIVLPGPNVDNGVPIVKGGDVRPHRLRLELLNRTTEAIEAPYARARLRP 311 Query: 286 GDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARN 345 D++++ +G L+ + D + R + ++ S Sbjct: 312 SDIVYSIRGS----IGDAELVPDELLDANITQD-VARISPDQTVNSLWLLFVMKSVRVFV 366 Query: 346 AMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALAR 405 + + +GI+ D+K + P ++EQ I +++ D + + A+ Sbjct: 367 QLEQR-SLGAAVRGINIFDLKRARIPFPDIQEQKTIATFLDRETTKLDALTAEAQTAITL 425 Query: 406 VNNLTQSILAKAFRGELTAQ 425 + ++++ A G++ + Sbjct: 426 LQERRTALISAAVTGKIDVR 445 Score = 143 bits (362), Expect = 1e-32, Method: Composition-based stats. Identities = 43/235 (18%), Positives = 84/235 (35%), Gaps = 14/235 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVR--AGHVDQNDIR 269 +W P+H + +L + NGL P +R++ V D+ Sbjct: 15 EWLGEVPEHWILDRLK--WSVEGCINGLWGDDPNGEDVIPCIRVADFDRAKNRVRAEDLT 72 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEF-VGVCGLLKKLQHQNLLYPDKLIRARLTKD 328 + SE + L++GDLL + G VGV L N + + + R + + Sbjct: 73 YRSISEEKRLNRSLKNGDLLIEKSGGGDNQPVGVVVLFDHNL--NAVCSNFVARMPVRSN 130 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 P ++ S A +K +G + + + +P V EQ I +++ Sbjct: 131 FSPRFLCYLHSVLYALRLNTKSIKQNTGIQNLDSASYLDERFGIPTVYEQGLIADFLDRE 190 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D + + + + Q++++ A + NPD ++ L Sbjct: 191 TAKIDALIAEQQRLVELLKEKRQAVISHAVT-------KGLNPDAPMKDSGIEWL 238 >UniRef50_B5VW93 Restriction modification system DNA specificity domain n=1 Tax=Arthrospira maxima CS-328 RepID=B5VW93_SPIMA Length = 407 Score = 283 bits (724), Expect = 1e-74, Method: Composition-based stats. Identities = 96/423 (22%), Positives = 169/423 (39%), Gaps = 33/423 (7%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ---NGKFDTTDLVFVPK 64 +GW I + + + K+ D +P R I+ NGK +T+L Sbjct: 2 KGWDIVALEDLGKITSSKRIFKKD----YVDSGIPFYRTKEIKELANGKEVSTELFISRD 57 Query: 65 NLVKESQKI---SPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + K S D++I VG+ LR K I F Sbjct: 58 SFNEIKAKFGTPSVGDLLITAI----GTVGEIYVVDRTDFYFKDGNVLWLRDFKAIEPNF 113 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + ++I+SLS G+ + I P ++EQK I LD +D+ Sbjct: 114 LKYAL--IAFVDEINSLSHGSTYKALPIEKLKKHKIYKPSISEQKRIVAILDEAFEGIDA 171 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 A ++ + ++ L G K + + I ++ G SS Sbjct: 172 AIANTQKNLANARELFESYLNGIFTRK-----------GDGWVEKKLGEICHKVEYGSSS 220 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 K G P++R+ +++ +D D+ + + E+NR+ LQ D+LF R N S + VG Sbjct: 221 KSQPEGD-IPVIRMGNIQNNMIDWTDLVYTS-NPDEINRYLLQYNDVLFNRTN-SADHVG 277 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 + K + ++ LIR KD P+++ + + R + + + Q I Sbjct: 278 KSAIYKG--EKPAIFAGYLIRVHYKKDVIDPDFLNFYLNCYKTREYGKSVMSRSVNQVNI 335 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 +G +K+ + P + Q +I++++ LF +E L + L QSIL KAF G Sbjct: 336 NGTKLKNYPIYHPDLYTQKQIIKKLYFLFRETQRLETIYRRKLEALQELKQSILQKAFTG 395 Query: 421 ELT 423 ELT Sbjct: 396 ELT 398 Score = 150 bits (379), Expect = 1e-34, Method: Composition-based stats. Identities = 50/212 (23%), Positives = 82/212 (38%), Gaps = 8/212 (3%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 +GWV + + + + K Q + +P+IR NIQN D TDLV+ Sbjct: 200 DGWVEKKLGEICHKVEYGSSSKSQP-----EGDIPVIRMGNIQNNMIDWTDLVYTSNPDE 254 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFSGFIAHFT 126 + D++ ++ S VGKSA F + + +K I F+ + Sbjct: 255 INRYLLQYNDVLFNRTN-SADHVGKSAIYKGEKPAIFAGYLIRVHYKKDVIDPDFLNFYL 313 Query: 127 KSSLYRNKISS-LSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 R S +S N NI I P L QK I +KL L + + Sbjct: 314 NCYKTREYGKSVMSRSVNQVNINGTKLKNYPIYHPDLYTQKQIIKKLYFLFRETQRLETI 373 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE 217 + + + L+ +Q++L A G+LT + Sbjct: 374 YRRKLEALQELKQSILQKAFTGELTNEKAKDV 405 >UniRef50_A1TSH8 Restriction modification system DNA specificity domain n=1 Tax=Acidovorax citrulli AAC00-1 RepID=A1TSH8_ACIAC Length = 429 Score = 282 bits (722), Expect = 2e-74, Method: Composition-based stats. Identities = 84/432 (19%), Positives = 173/432 (40%), Gaps = 35/432 (8%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN--GKFDTTDLVFVPK 64 PE W +A + V L Y + NI++ G+ + + Sbjct: 10 PEVWRLARLKFVAPLRNERMSAGSDHPGY--------LGLENIESWTGRIIEVESKRDDE 61 Query: 65 NLVKES---QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + D++ + H P + V+RP +L+ F Sbjct: 62 PADQSAGLANIFREGDVLFCKLRPYLAK-----ACHAPRDGVGSTELLVMRPSELLEPRF 116 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + + + + + GA + + + +PPL EQ++IA LD A +D Sbjct: 117 LLYSILTPDFVGAVDASTFGAKMPRANWDFIGSLEVKVPPLEEQRLIANYLDRETAGIDG 176 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 A E++ +L+ R A++ V L +W P H ++L + + Sbjct: 177 LIAEKERMLALLEEKRAALISRVVTRGLDPNAPLKPSGQEWLGEIPVHWGLQRLKQLAEV 236 Query: 233 TELRNGLSSKPNESGV--GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 R GL+ SG +P LR+++V+ G++ +D+ +E SE + L GD+L Sbjct: 237 ---RGGLTLGKQYSGELLEYPYLRVANVQDGYLKLDDVLTVEVPASEAASNLLVYGDVLM 293 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 G ++ +G G + + + L+ + + R +++ ++ S+ A+ + Sbjct: 294 NE-GGDIDKLGR-GCVWRDEISPCLHQNHVFAVRPHS-VDSDWLALWTSTIQAKRYFESR 350 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 K ++ ISG +IK V LPPV EQ I + + +T+ ++ ++L + Sbjct: 351 AKRSTNLASISGSNIKELPVPLPPVSEQLAIQNFLAVRHSRLETLRGELRDSLRLLIERR 410 Query: 411 QSILAKAFRGEL 422 +++ G++ Sbjct: 411 AALITAGVTGQI 422 Score = 132 bits (333), Expect = 3e-29, Method: Composition-based stats. Identities = 38/207 (18%), Positives = 88/207 (42%), Gaps = 8/207 (3%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + + + + G+T K+ + L+ P +R N+Q+G D++ V Sbjct: 219 GEIPVHWGLQRLKQLAEVRGGLTLGKQYSGELLE---YPYLRVANVQDGYLKLDDVLTVE 275 Query: 64 KNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGF 121 + S + D+++ G +G+ C +RP + S + Sbjct: 276 VPASEAASNLLVYGDVLMNE-GGDIDKLGRGCVWRDEISPCLHQNHVFAVRPHS-VDSDW 333 Query: 122 IAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +A +T + + S + + + +I ++ + +P+PP++EQ I L ++++ Sbjct: 334 LALWTSTIQAKRYFESRAKRSTNLASISGSNIKELPVPLPPVSEQLAIQNFLAVRHSRLE 393 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNG 207 + + ++L R A++ V G Sbjct: 394 TLRGELRDSLRLLIERRAALITAGVTG 420 Score = 129 bits (324), Expect = 3e-28, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 97/261 (37%), Gaps = 35/261 (13%) Query: 217 EPQHSVFKKLNFESILTELRNGLSSKPNESGVGHP-ILRISSVRA--GHVDQNDIRFLEC 273 +P+ +L F + L ++ +G HP L + ++ + G + + + + + Sbjct: 9 DPEVWRLARLKFVA-------PLRNERMSAGSDHPGYLGLENIESWTGRIIEVESKRDDE 61 Query: 274 SESELN--RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 + + ++GD+LF + L ++ + +L+ R ++ P Sbjct: 62 PADQSAGLANIFREGDVLFCKLRPYLAKACHA-------PRDGVGSTELLVMRPSELLEP 114 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 ++ +P A+ + + + I S V +PP++EQ I +++ A Sbjct: 115 RFLLYSILTPDFVGAV-DASTFGAKMPRANWDFIGSLEVKVPPLEEQRLIANYLDRETAG 173 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL-------- 443 D + + LA + +++++ R +P+ + L Sbjct: 174 IDGLIAEKERMLALLEEKRAALISRVVT-------RGLDPNAPLKPSGQEWLGEIPVHWG 226 Query: 444 LEKIKAERAASGGKKASRKKS 464 L+++K GG ++ S Sbjct: 227 LQRLKQLAEVRGGLTLGKQYS 247 >UniRef50_C6IKX2 Type I restriction-modification system n=2 Tax=Bacteroidales RepID=C6IKX2_9BACE Length = 478 Score = 282 bits (722), Expect = 2e-74, Method: Composition-based stats. Identities = 90/420 (21%), Positives = 174/420 (41%), Gaps = 36/420 (8%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P+ WV + V T G T + Y +P ++ ++ +G + Sbjct: 70 EVPDNWVWMTLGEVGTWQSGGTPSRSNKTYY--GGNIPWLKTGDLNDGLISDIPESITEE 127 Query: 65 NLVKESQKISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + S KI+P ++IAM + +G L F + C I ++ Sbjct: 128 AVANSSAKINPAGSVLIAMYGATIGKLGI-----LTFPATTNQACCACIEFNAITQLYLF 182 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 +F S RN + G NI IP+PPL+EQ+ I +++ A +D + Sbjct: 183 YFLLSQ--RNGFIAKGGGGAQPNISKEIIVNTFIPLPPLSEQQRIVMEIEKWFALIDQVE 240 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFE-------------- 229 + +K+ + +L A++GKL + N EP + K++N + Sbjct: 241 QGKADLQNTIKQTKSKILDLAIHGKLVPQDPNDEPAIKLLKRINPDFTPCDNGHSRKLPQ 300 Query: 230 -------SILTELRNGLSS-KPNESGVGHPILRISSVRAGH-VDQNDIRFLECSESELNR 280 + + + G+S K + G +LR +++ G +D D F+ S + N Sbjct: 301 GWYSVTANDVCSIIGGVSYNKADIQDTGIRVLRGGNIQNGKVIDCFDDVFISLSY-QNND 359 Query: 281 HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSS 340 +++Q GD++ GS +G G + + + L R + L YI + F + Sbjct: 360 NQVQRGDIIVVASTGSQTLIGKTGFADRDIPKTQIGA-FLRIVRPKQKTLSPYIRLIFQT 418 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 + ++ + N K S + +++ + LPP++EQ IV+++E+LF+ D I + Sbjct: 419 DAYKDYIRNVAK-GSNINNVKNAHLQNFQICLPPLEEQQRIVQKIEELFSSLDDILTALE 477 Score = 171 bits (433), Expect = 8e-41, Method: Composition-based stats. Identities = 51/270 (18%), Positives = 86/270 (31%), Gaps = 18/270 (6%) Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 ++ KA E++ + K R + E P + V+ L Sbjct: 32 LLERIKAEKERLIKEGKIKRSKKSAKTSDTPHYENVPFEVPDNWVWMTLGEVGTWQSGGT 91 Query: 238 GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSL 297 S G P L+ + G + E + + + G +L Y ++ Sbjct: 92 PSRSNKTYYGGNIPWLKTGDLNDGLISDIPESITEEAVANSSAKINPAGSVLIAMYGATI 151 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 +G+ Y+ F S RN + Q Sbjct: 152 GKLGI-------LTFPATTNQACCACIEFNAITQLYLFYFLLSQ--RNGFIAK-GGGGAQ 201 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 IS + I + + LPP+ EQ IV +E+ FA D +E+ + + IL A Sbjct: 202 PNISKEIIVNTFIPLPPLSEQQRIVMEIEKWFALIDQVEQGKADLQNTIKQTKSKILDLA 261 Query: 418 FRGELTAQWRAENPDLISGENSAAALLEKI 447 G+L Q + P A LL++I Sbjct: 262 IHGKLVPQDPNDEP--------AIKLLKRI 283 Score = 69.4 bits (169), Expect = 3e-10, Method: Composition-based stats. Identities = 20/61 (32%), Positives = 28/61 (45%), Gaps = 11/61 (18%) Query: 407 NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGK---KASRKK 463 L Q IL A G+L Q + P A+ LLE+IKAE+ + K S+K Sbjct: 4 KALRQKILDLAIHGKLVPQDPNDEP--------ASVLLERIKAEKERLIKEGKIKRSKKS 55 Query: 464 S 464 + Sbjct: 56 A 56 >UniRef50_Q26D97 Putative type I site-speicific deoxyribonuclease specificity subunit n=1 Tax=Flavobacteria bacterium BBFL7 RepID=Q26D97_9BACT Length = 468 Score = 281 bits (721), Expect = 3e-74, Method: Composition-based stats. Identities = 115/463 (24%), Positives = 199/463 (42%), Gaps = 45/463 (9%) Query: 5 KLPEGWVIAPVSTVTT---LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 +LP+GWV +S++ L + + ++ + + + LI+ +I G F F Sbjct: 4 ELPKGWVETNISSLVDDTGLFKDGDW--VESKDQDPNGNVRLIQLADIGLGNFRDKSQRF 61 Query: 62 VPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IF 118 + + + + DI++A +G+S L E ++RP K I Sbjct: 62 LNQETAERLNCNFLEQNDILVARM---PDPIGRSCLFPLKGENVTVVDVAIIRPSKKHIN 118 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 +++H+ S ++ IS L++G+ I + D I P+PP AEQ I K+D L+AQ Sbjct: 119 YKWLSHWINSPVFHKNISELASGSTRKRISRRNLDKIPFPLPPRAEQDRIVAKVDALMAQ 178 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 + + E+IPQ+LK FRQ VL + +++ E ++++G Sbjct: 179 HAAIQQAMERIPQLLKDFRQQVLNQSFER--------------NIERVALEDCCHKIQDG 224 Query: 239 LSSKPNE-----SGVGHPILRISSVRAGHVDQNDIRFLECSESE--LNRHKLQDGDLLFT 291 P P + ++R ++ + + ++ R + GD+L T Sbjct: 225 AHHSPKYVSPIREKNMFPYVTSKNIRNDYMKLDTLTYVNEDFHNTIYPRCSPEFGDVLLT 284 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 + S V L + L + K +P Y++ F S + + Sbjct: 285 KDGASTGNV----TLNEFDEPISLLSSVCLIKTDKKKLIPAYLKYFIQSSIGFSEFTGKM 340 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 + K + K IK + LP V EQ EIVRRVE LF A IE++ ++++L Q Sbjct: 341 T-GTAIKRVVLKKIKKATIPLPSVPEQQEIVRRVESLFEKATAIEQRYEQLKLQIDSLPQ 399 Query: 412 SILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAAS 454 +IL KAF+GEL Q + + SA LLE+IK ++ S Sbjct: 400 AILHKAFKGELVEQ--------LDSDGSAVELLEQIKNLKSNS 434 Score = 134 bits (339), Expect = 6e-30, Method: Composition-based stats. Identities = 37/231 (16%), Positives = 93/231 (40%), Gaps = 8/231 (3%) Query: 11 VIAPVSTVTTLIRGVTYKKEQAINYLKDDY-LPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 + I+ + + ++ +++ P + + NI+N L +V ++ Sbjct: 210 ERVALEDCCHKIQDGAHHSPKYVSPIREKNMFPYVTSKNIRNDYMKLDTLTYVNEDFHNT 269 Query: 70 SQ-KISP--EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGFIAHF 125 + SP D+++ S G S + +++ +K + ++ +F Sbjct: 270 IYPRCSPEFGDVLLTKDGAST---GNVTLNEFDEPISLLSSVCLIKTDKKKLIPAYLKYF 326 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 +SS+ ++ + G I + IP+P + EQ+ I ++++L + + + R Sbjct: 327 IQSSIGFSEFTGKMTGTAIKRVVLKKIKKATIPLPSVPEQQEIVRRVESLFEKATAIEQR 386 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 +EQ+ + QA+L A G+L E+ + + +++ + L Sbjct: 387 YEQLKLQIDSLPQAILHKAFKGELVEQLDSDGSAVELLEQIKNLKSNSNLN 437 >UniRef50_C2QHW5 Putative uncharacterized protein n=2 Tax=Bacillus cereus RepID=C2QHW5_BACCE Length = 441 Score = 281 bits (719), Expect = 5e-74, Method: Composition-based stats. Identities = 77/438 (17%), Positives = 164/438 (37%), Gaps = 39/438 (8%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 GK+P+ W + +S++ K+ + Sbjct: 17 IGKVPKHWELKKISSIFEQRNEKVSDKDFEPLS-------------VTKMGILKQLENVA 63 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK-LIFSGF 121 + +K+ D VI S K G S F+ S C V++P+ + + Sbjct: 64 KTDNNDNRKKVLKNDFVINSRSDRKGSCGVS-----KFDGSVSLICTVIKPKTINTYMDY 118 Query: 122 IAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 H ++ ++ + G + + K F I IPIPP EQK I L+ + + Sbjct: 119 YHHLFRNKMFSEEFYRWGRGIVDDLWSTKWDEFKRILIPIPPHEEQKSIVSYLNHIYEAI 178 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFES 230 + +Q + +++++++++ AV L +W P+H + K+L+F S Sbjct: 179 EELITHKQQQIETIQQYQRSLITEAVTSGLNPHAKMKDSSVEWIGEMPEHWITKRLDFVS 238 Query: 231 ILTE--LRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS-ESELNRHKLQDGD 287 ++ GL++ G+ L I +++ +D ++ ++ E LQ GD Sbjct: 239 VVKARLGWKGLTAS-EYQENGYIFLAIPNIKKFQIDFENVNYISEKRYKESPEIMLQVGD 297 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 +L + +L V V L + R D ++ + S + + Sbjct: 298 VLLAKDGSTLGEVNVVRYLPS----PATVNSSIAVIRPKGDLHSVFLYYYLKSNYIQKII 353 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 K G + KDI ++ +PP+ EQ +I + ++ + + + + + + Sbjct: 354 QKK-KDGMGVPHLFQKDINKFIIQVPPLDEQVKIAKYLDGKISEINNLIIETQEQIDILQ 412 Query: 408 NLTQSILAKAFRGELTAQ 425 QS++ + G++ + Sbjct: 413 QYRQSLVYEVVTGKIDVR 430 Score = 99.8 bits (248), Expect = 2e-19, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 81/235 (34%), Gaps = 26/235 (11%) Query: 210 TEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIR 269 +W P+H KK++ SI + +S K E SV + + Sbjct: 13 DVQWIGKVPKHWELKKIS--SIFEQRNEKVSDKDFE---------PLSVTKMGI-LKQLE 60 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 + +++ NR K+ D + + + G CG+ K +L+ +I+ + Sbjct: 61 NVAKTDNNDNRKKVLKNDFVINSRS---DRKGSCGVSKFDGSVSLICT--VIKPKTINTY 115 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTS-GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 + +Y F + + + K ++ +PP +EQ IV + + Sbjct: 116 M-DYYHHLFRNKMFSEEFYRWGRGIVDDLWSTKWDEFKRILIPIPPHEEQKSIVSYLNHI 174 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + + + + + +S++ +A NP ++S + Sbjct: 175 YEAIEELITHKQQQIETIQQYQRSLITEAVT-------SGLNPHAKMKDSSVEWI 222 >UniRef50_B4VK59 Putative uncharacterized protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VK59_9CYAN Length = 430 Score = 280 bits (718), Expect = 6e-74, Method: Composition-based stats. Identities = 82/433 (18%), Positives = 165/433 (38%), Gaps = 36/433 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAI--NYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G++PE W + LI K + D + R + K TTD + Sbjct: 18 GQIPEHWETLRTKNIFRLITEAAPKNNDEELLSVYSDIGVKPRRELEERGNKASTTDGYW 77 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + D+++ +G S ++ VLR K I S + Sbjct: 78 I----------VKKGDVIVNKLLAWMGAIGIS-----DYDGVTSPAYDVLRAYKPIDSKY 122 Query: 122 IAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 + +S + +K+ S G + F I +P PP QK I E LD ++ Sbjct: 123 YHYLFRSPICLSKLKQHSRGIMEMRLRLYFDEFGRIRLPYPPFEIQKRIVEFLDRKCGEI 182 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFES 230 + A +++ ++L+ + ++ AV L +W P H KKL S Sbjct: 183 EDAIAHKKRLIELLEEQKTILINQAVTKGLDPNAPMKDSGIEWIGEIPTHWEVKKLKRIS 242 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLEC-SESELNRHKLQDGDLL 289 + ++ G LR +++ + D ++ S L++ K+ GD++ Sbjct: 243 PCITVGIVITPSKYYVEEGVICLRSLNIKPNKILVKDSVYISERSNKYLSKSKIFAGDIV 302 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 R GV ++ + LI R K+ LP+++ + +S R+ + Sbjct: 303 CVRTGQP----GVSAVVDRRFDGANCID--LIIIRKPKNDLPKFVSLAMNSEVCRSQYLT 356 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + Q+ + + ++ V+ +PP+ EQ +I + ++ + + + +N L Sbjct: 357 GAS-GAIQQHFNIEMAQNLVIAIPPLPEQIKIYNHISKIQKNTMDLMNFIKREIDLMNEL 415 Query: 410 TQSILAKAFRGEL 422 Q ++A+A G++ Sbjct: 416 KQILIAEAVTGKI 428 Score = 124 bits (311), Expect = 9e-27, Method: Composition-based stats. Identities = 42/211 (19%), Positives = 84/211 (39%), Gaps = 10/211 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G++P W + + ++ I G+ + Y ++ + +R+ NI+ K D V+ Sbjct: 226 IGEIPTHWEVKKLKRISPCITVGIVITPSK---YYVEEGVICLRSLNIKPNKILVKDSVY 282 Query: 62 VPK--NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + + N KI DIV + G SA F+ + ++R K Sbjct: 283 ISERSNKYLSKSKIFAGDIVCVRTGQP----GVSAVVDRRFDGANCIDLIIIRKPKNDLP 338 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F++ S + R++ + ++GA + + I IPPL EQ I + + Sbjct: 339 KFVSLAMNSEVCRSQYLTGASGAIQQHFNIEMAQNLVIAIPPLPEQIKIYNHISKIQKNT 398 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 ++ ++ +Q ++ AV GK+ Sbjct: 399 MDLMNFIKREIDLMNELKQILIAEAVTGKIK 429 Score = 111 bits (279), Expect = 5e-23, Method: Composition-based stats. Identities = 24/252 (9%), Positives = 70/252 (27%), Gaps = 41/252 (16%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPN----ESGVGHPILRISSVRAGHVDQND 267 +W P+H + L ++ S +G +++ Sbjct: 15 EWLGQIPEHWETLRTKNIFRLITEAAPKNNDEELLSVYSDIGVK-------PRRELEERG 67 Query: 268 IRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 + ++ GD++ + + +G+ + + R K Sbjct: 68 NKASTTD----GYWIVKKGDVIVNKLLAWMGAIGIS-------DYDGVTSPAYDVLRAYK 116 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQK-GISGKDIKSQVVLLPPVKEQAEIVRRVE 386 +Y F SP + + + + + + + PP + Q IV ++ Sbjct: 117 PIDSKYYHYLFRSPICLSKLKQHSRGIMEMRLRLYFDEFGRIRLPYPPFEIQKRIVEFLD 176 Query: 387 QLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL--- 443 + + + + ++ +A + +P+ ++ + Sbjct: 177 RKCGEIEDAIAHKKRLIELLEEQKTILINQAVT-------KGLDPNAPMKDSGIEWIGEI 229 Query: 444 --------LEKI 447 L++I Sbjct: 230 PTHWEVKKLKRI 241 >UniRef50_UPI0001973978 type I restriction-modification system, S subunit n=1 Tax=Clostridium sp. M62/1 RepID=UPI0001973978 Length = 435 Score = 280 bits (718), Expect = 6e-74, Method: Composition-based stats. Identities = 70/435 (16%), Positives = 167/435 (38%), Gaps = 38/435 (8%) Query: 11 VIAPVSTVTTL-IRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT-DLVFVPKNLVK 68 + + + I ++ + + D+ +P + A +++NG D ++ + K Sbjct: 16 KKKKLKYIVSTPITDGPHETPELL----DEGIPFLSAESVKNGILDFNYKRGYISLSDHK 71 Query: 69 ---ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFSGFIAH 124 + + DI I S + G E S + ++R + + + FI + Sbjct: 72 LFCKKVRPQKNDIFIVKSGATT---GNCGIVTTDEEFSIWSPLALIRCDNISVLQKFIYY 128 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 ++ + +++ + NI + + +P EQ+ I + LD AQ+DS A Sbjct: 129 YSLCYSFTHQVEQSWSYGTQQNIGMGVLGNLYVTLPSSNEQQSIVDYLDKECAQIDSIAA 188 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTEL 235 E+ +L++++++++ V L + +W P+H + + + Sbjct: 189 DLEKQIALLQQYKKSLITETVTKGLDKSVPMKDSGVEWIGKIPEHWDVEPIKYRVTFHNG 248 Query: 236 RNG--LSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL-NRHKLQDGDLLFTR 292 G SK G P + + ++ +++ ++ + + KL+ GD+L+ Sbjct: 249 DRGENYPSKSELQSEGIPFINAGHLEGDGLNMDNMDYISEEKYRIMGGVKLRPGDILYCL 308 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPS--ARNAMMNC 350 VG ++ Q L+ R + L EY+ +S + + + Sbjct: 309 RGS----VGKNAIVDMNQGT---VASSLVAIR-SVRILAEYLYYCLNSHIEEVQRYLWD- 359 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 + Q +S ++ +PPV+EQ IV+ + + + D + L+ + Sbjct: 360 --NGTAQPNLSADNLGKYKFCIPPVEEQKAIVKYLNNICSQIDNLVIGKKKQLSTIQQHK 417 Query: 411 QSILAKAFRGELTAQ 425 +S++ + G+ + Sbjct: 418 KSLIYEYVTGKKRVK 432 Score = 152 bits (384), Expect = 3e-35, Method: Composition-based stats. Identities = 44/235 (18%), Positives = 90/235 (38%), Gaps = 16/235 (6%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIR-F 270 W KKL + + T + +G P G P L SV+ G +D N R + Sbjct: 6 TWEEENGHTFKKKKLKYI-VSTPITDGPHETPELLDEGIPFLSAESVKNGILDFNYKRGY 64 Query: 271 LECSESEL--NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD 328 + S+ +L + + Q D+ + + G CG++ + ++ P LIR Sbjct: 65 ISLSDHKLFCKKVRPQKNDIFIVKSGATT---GNCGIVTTDEEFSIWSPLALIRC-DNIS 120 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 L ++I + S + + + Q+ I + + V LP EQ IV +++ Sbjct: 121 VLQKFIYYYSLCYSFTHQVEQSWSYGT-QQNIGMGVLGNLYVTLPSSNEQQSIVDYLDKE 179 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D+I + +A + +S++ + + + + ++ + Sbjct: 180 CAQIDSIAADLEKQIALLQQYKKSLITETVT-------KGLDKSVPMKDSGVEWI 227 Score = 149 bits (376), Expect = 3e-34, Method: Composition-based stats. Identities = 38/208 (18%), Positives = 91/208 (43%), Gaps = 8/208 (3%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 GK+PE W + P+ T G + + + L+ + +P I A +++ + ++ ++ Sbjct: 227 IGKIPEHWDVEPIKYRVTFHNGDRGENYPSKSELQSEGIPFINAGHLEGDGLNMDNMDYI 286 Query: 63 PKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + + K+ P DI+ + VGK+A + + + + +R I + Sbjct: 287 SEEKYRIMGGVKLRPGDILYCLR----GSVGKNAIVDMN-QGTVASSLVAIR-SVRILAE 340 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ + S + + G N+ + IPP+ EQK I + L+ + +Q+D Sbjct: 341 YLYYCLNSHIEEVQRYLWDNGTAQPNLSADNLGKYKFCIPPVEEQKAIVKYLNNICSQID 400 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGK 208 + ++ +++ +++++ V GK Sbjct: 401 NLVIGKKKQLSTIQQHKKSLIYEYVTGK 428 >UniRef50_A6Y5S9 Restriction endonuclease S subunit n=1 Tax=Vibrio cholerae RC385 RepID=A6Y5S9_VIBCH Length = 437 Score = 280 bits (717), Expect = 7e-74, Method: Composition-based stats. Identities = 84/439 (19%), Positives = 164/439 (37%), Gaps = 37/439 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRA---NNIQNGKFDTTDLV 60 GK+P W + P + + + ++R ++ N K Sbjct: 16 GKIPSHWKLLPCRAIVDNQVEKNDSGKIEEYLSLMANIGVVRYEEKGDVGNKK------- 68 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI-FS 119 + + + + ++VI + + G S PF VL P++ I Sbjct: 69 ---PEDLTKCKLVKQGNLVINSMNYAIGSYGMS-----PFNGVCSPVYIVLEPKEQIVER 120 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINN--IKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + ++ + ++ L G + IK +P+PPL EQ+ I LD Sbjct: 121 RYALRLFENKPMQKHLAQLGNGILQHRAAIKWDDIKPQAVPVPPLEEQRAILYFLDRETQ 180 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNF 228 ++DS A ++LK RQA++ V L +W P+H K+ + Sbjct: 181 RIDSLIAEKLTFIKLLKEKRQALISHIVTKGLNPNVEMQDSGIEWIGQVPKHWGISKVRY 240 Query: 229 ESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIR-FLECSESELNRHKLQDGD 287 +NG++ G G P + V ++ + +E + + + + GD Sbjct: 241 LGQC---QNGINIGGEFFGHGTPFVSYGDVYNNTSLPEKVQGLVLSTEKDRDNYSVIAGD 297 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNA 346 +LFTR + ++E +G + K Q ++ LIR R + + + E +F + R Sbjct: 298 VLFTRTSETIEEIGFSAVCKSTIEQ-AVFAGFLIRFRPDEGNLEVGFSEYYFRNEKLRAF 356 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + + +S +K VLLPP+ EQ EI ++ I + + + Sbjct: 357 FAKEMNL-VTRASLSQDLLKKMPVLLPPIDEQNEIANYLQAECNKFSEIFAETEKTILLL 415 Query: 407 NNLTQSILAKAFRGELTAQ 425 S+++ A G++ + Sbjct: 416 KERRTSLISAAVTGKIDVR 434 Score = 137 bits (347), Expect = 6e-31, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 78/212 (36%), Gaps = 9/212 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV-- 60 G++P+ W I+ V + G+ P + ++ N + Sbjct: 226 IGQVPKHWGISKVRYLGQCQNGI-----NIGGEFFGHGTPFVSYGDVYNNTSLPEKVQGL 280 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEK-LIF 118 + +++ + D++ +S + +G SA E F F RP++ + Sbjct: 281 VLSTEKDRDNYSVIAGDVLFTRTSETIEEIGFSAVCKSTIEQAVFAGFLIRFRPDEGNLE 340 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 GF ++ ++ R + ++ + + +PP+ EQ IA L + Sbjct: 341 VGFSEYYFRNEKLRAFFAKEMNLVTRASLSQDLLKKMPVLLPPIDEQNEIANYLQAECNK 400 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 A E+ +LK R +++ AV GK+ Sbjct: 401 FSEIFAETEKTILLLKERRTSLISAAVTGKID 432 Score = 97.9 bits (243), Expect = 8e-19, Method: Composition-based stats. Identities = 31/240 (12%), Positives = 74/240 (30%), Gaps = 34/240 (14%) Query: 212 KWRNFEPQHSVFKKLNFE--SILTELRNGLSSKPNESGVGHPILRI---SSVRAGHVDQN 266 W P H + + + +G + ++R V Sbjct: 13 PWLGKIPSHWKLLPCRAIVDNQVEKNDSGKIEEYLSLMANIGVVRYEEKGDVGNKK---- 68 Query: 267 DIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT 326 E ++ G+L+ N ++ G+ N + I Sbjct: 69 -------PEDLTKCKLVKQGNLVINSMNYAIGSYGMS-------PFNGVCSPVYIVLEPK 114 Query: 327 KDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKG--ISGKDIKSQVVLLPPVKEQAEIVR 383 + + Y F + + + + Q I DIK Q V +PP++EQ I+ Sbjct: 115 EQIVERRYALRLFENKPMQKHLAQ-LGNGILQHRAAIKWDDIKPQAVPVPPLEEQRAILY 173 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 +++ D++ + + + Q++++ + NP++ ++ + Sbjct: 174 FLDRETQRIDSLIAEKLTFIKLLKEKRQALISHIVT-------KGLNPNVEMQDSGIEWI 226 >UniRef50_Q31PC5 Type I restriction-modification n=2 Tax=Synechococcus elongatus RepID=Q31PC5_SYNE7 Length = 453 Score = 280 bits (716), Expect = 1e-73, Method: Composition-based stats. Identities = 72/439 (16%), Positives = 172/439 (39%), Gaps = 29/439 (6%) Query: 5 KLPEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLP-LIRANNIQNGKFDTTDLV-F 61 KLP W + + + + GV+ A+++ D+ +P +++ + + G F + Sbjct: 19 KLPSHWNVLQLRRLIPEIESGVSV---NALDHAPDEGIPSVLKTSCVYTGSFRPEERKEI 75 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + +++ + + + ++++ + + +VG + + ++ F ++ F Sbjct: 76 IQEDIDRAACPVKSGRLIVSRMN-TPDLVGAAGLSLVDYDYVFLPDRLWQVRISNVYPNF 134 Query: 122 IAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++T++ +YR+++ + +G + + N+ +F +P+P EQ IA LD A++ Sbjct: 135 AYYWTQTQIYRDQVKMVCSGTSSSMQNLSQDNFLSFILPVPSDEEQIAIASFLDRETAKI 194 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFES 230 D+ A +++ +L+ RQAV+ AV L +W P H K+ Sbjct: 195 DALIAEQQRLIALLQEKRQAVISHAVTKGLNPDAPLKDSGIEWLGQVPAHWKTGKIKHYF 254 Query: 231 ILTELRNGLSSKP----NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDG 286 + + + +S G P +R + + V ++ + + L Sbjct: 255 KTSSGGTPNTEEQALYYADSDSGIPWVRTTDIENQEVRSAEVSITNQAIQDTACEILPVD 314 Query: 287 DLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNA 346 +L Y G + L + A+P + + R Sbjct: 315 TVLVALYGGGGTV-----GKNGILTFPAAINQALCALLPSYYAVPMFTFRYIQ--FLRPF 367 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 M + IS + ++ V LPP+ EQ IV+ + ++E + +L+ + Sbjct: 368 WMERAVSARKAGNISQELVRDTVFALPPLDEQILIVKHIHSQLEEITSLENESTKSLSLL 427 Query: 407 NNLTQSILAKAFRGELTAQ 425 ++++ A G++ + Sbjct: 428 QERRSALISAAVTGQIDVR 446 Score = 161 bits (407), Expect = 7e-38, Method: Composition-based stats. Identities = 44/236 (18%), Positives = 97/236 (41%), Gaps = 16/236 (6%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES--GVGHP-ILRISSVRAGHVDQNDI 268 +W P H L ++ E+ +G+S + G P +L+ S V G + Sbjct: 15 EWLEKLPSHWNV--LQLRRLIPEIESGVSVNALDHAPDEGIPSVLKTSCVYTGSFRPEER 72 Query: 269 RFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD 328 + + + + ++ G L+ +R N + + VG GL + + + PD+L + R++ Sbjct: 73 KEIIQEDIDRAACPVKSGRLIVSRMN-TPDLVGAAGL-SLVDYDYVFLPDRLWQVRISN- 129 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSG-QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 P + + + R+ + TS + +S + S ++ +P +EQ I +++ Sbjct: 130 VYPNFAYYWTQTQIYRDQVKMVCSGTSSSMQNLSQDNFLSFILPVPSDEEQIAIASFLDR 189 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D + + +A + Q++++ A + NPD ++ L Sbjct: 190 ETAKIDALIAEQQRLIALLQEKRQAVISHAVT-------KGLNPDAPLKDSGIEWL 238 Score = 124 bits (312), Expect = 7e-27, Method: Composition-based stats. Identities = 42/218 (19%), Positives = 81/218 (37%), Gaps = 9/218 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYL--KDDYLPLIRANNIQNGKFDTTDLVF 61 G++P W + G T E+ Y D +P +R +I+N + + ++ Sbjct: 239 GQVPAHWKTGKIKHYFKTSSGGTPNTEEQALYYADSDSGIPWVRTTDIENQEVRSAEVSI 298 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + + +I P D V+ G VGK+ L F + L P F Sbjct: 299 TNQAIQDTACEILPVDTVLVALYGGGGTVGKNGI--LTFPAAINQALCALLPSYYAVPMF 356 Query: 122 IAHFTKSSLYRNKISSLSAGANIN--NIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 + + R + + NI +PPL EQ +I + + + L ++ Sbjct: 357 TFRYIQ--FLRPFWMERAV-SARKAGNISQELVRDTVFALPPLDEQILIVKHIHSQLEEI 413 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE 217 S + + +L+ R A++ AV G++ + Sbjct: 414 TSLENESTKSLSLLQERRSALISAAVTGQIDVRGLAEV 451 >UniRef50_Q3J746 Restriction modification system DNA specificity domain n=3 Tax=Proteobacteria RepID=Q3J746_NITOC Length = 425 Score = 280 bits (716), Expect = 1e-73, Method: Composition-based stats. Identities = 62/433 (14%), Positives = 156/433 (36%), Gaps = 22/433 (5%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +PEGW + P+ + + KK + + L ++ T+ + F+ + Sbjct: 5 VPEGWEVKPLGKLVDVRSSNIDKKTETSEIP----VRLCNYTDVYYNNRITSAIDFMAAS 60 Query: 66 LVKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-GAFCGVLRPE-KLIFSG 120 + + D++I S + + ++ G +L+P+ Sbjct: 61 AKQREIDRFSLEKGDVIITKDSETPDDIAVPSYVSDDLSGVVCGYHLTLLKPDQDESDGE 120 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F++H + ++ L+ G + + + + PPL EQ+ IA L ++ ++ Sbjct: 121 FLSHLFQLPSVQHYFYILANGITRFGLTADAINEAPLLTPPLPEQQKIAAILSSVDDVIE 180 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNG-KLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 T+A+ ++ + Q +L + + + P ++ + + Sbjct: 181 KTRAQIHKLKDLKTAMMQELLTKGIGHTEFKDSPVGRIPVGWSICSAGEVAVAIMVGVVV 240 Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL-NRHKLQDGDLLFTRYNGSLE 298 G P LR ++VR + +++++ +E+ + +L GDLL R Sbjct: 241 KPAQYYVESGVPALRSANVRENGLTMDNLKYFSEDSNEILKKSRLIKGDLLTVRTGYP-- 298 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 G ++ ++ R + ++ ++ +S + ++ + Q+ Sbjct: 299 --GTTAVVTDEFEGCNCID--VVITRPSSRIDSDFFCLWVNSDHGKGQVLK-AQGGLAQQ 353 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 + D+K+ V++P + EQ I V + L + + ++++ Sbjct: 354 HFNVSDMKNLTVVVPSLTEQKAIFNAVNSVTKKI----ALTEKRLTLLLDTKKALMQDLL 409 Query: 419 RGELTAQWRAENP 431 G++ E P Sbjct: 410 TGKVRVNVEQEEP 422 Score = 142 bits (358), Expect = 4e-32, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 81/207 (39%), Gaps = 10/207 (4%) Query: 3 AGKLPEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G++P GW I V ++ GV K Q Y + +P +R+ N++ +L + Sbjct: 215 VGRIPVGWSICSAGEVAVAIMVGVVVKPAQ---YYVESGVPALRSANVRENGLTMDNLKY 271 Query: 62 VPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 ++ + + ++ D++ + G +A FE + RP I S Sbjct: 272 FSEDSNEILKKSRLIKGDLLTVRTGYP----GTTAVVTDEFEGCNCIDVVITRPSSRIDS 327 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F + S + ++ G + + + + +P L EQK I ++++ ++ Sbjct: 328 DFFCLWVNSDHGKGQVLKAQGGLAQQHFNVSDMKNLTVVVPSLTEQKAIFNAVNSVTKKI 387 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVN 206 T+ R + K Q +L G V Sbjct: 388 ALTEKRLTLLLDTKKALMQDLLTGKVR 414 >UniRef50_B0JHV8 Restriction modification system DNA specificity domain n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JHV8_MICAN Length = 395 Score = 279 bits (715), Expect = 2e-73, Method: Composition-based stats. Identities = 89/419 (21%), Positives = 157/419 (37%), Gaps = 29/419 (6%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN--GKFDTTDLVFVPKN 65 + W + + + RG + + Q + D + + + + T + Sbjct: 2 KDWPSVALGDIFEIARGGSPRPIQNFLTEEPDGVNWVMIGDASDSSKYITHTKKRILKTG 61 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + + + P D ++ S + C + + + +I + H Sbjct: 62 VKNS-RMVYPGDFLLTNSMSFGHP-----YIMKTSGCIHDGWLVLSNKKGVIDQDYFYHL 115 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 S L + S L++G+ + N+ I + +PPL EQ+ IA LD K Sbjct: 116 LGSDLIYAEFSRLASGSTVKNLNIEIVKGIKVSLPPLEEQRRIAAILDKADGVRRKRKEA 175 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE 245 ++LK + G V P+ K+L T +NG+ Sbjct: 176 IRLTEELLKSTFLEMFGDPVTN----------PKGWEVKRLGEI--CTNFQNGIGKNSEH 223 Query: 246 SGVGHPILRISSVRAGH-VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 G G + IS + H L+ + E+ ++ L GDLLF R + E V VC Sbjct: 224 YGHGSKVANISDLYEWHRFIPEKYSLLDVTPKEIEKYSLMRGDLLFVRSSVKREGVAVCS 283 Query: 305 LLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 + + L+ +IR R D PE++ + +P RN ++ TS IS Sbjct: 284 VYDSDEI--CLFSSFMIRVRPRTDLINPEFLSLMLRTPPMRNRLI-LGSNTSTITNISQP 340 Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + V++PP+K Q I ++ + + AL + NL S+L +AFRGEL Sbjct: 341 GLSKIEVVVPPIKTQNLIT----KVTKNIEESVRCHLQALEQSENLFNSLLQRAFRGEL 395 Score = 109 bits (273), Expect = 3e-22, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 76/206 (36%), Gaps = 11/206 (5%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDLVFVPKN 65 P+GW + + + T + K + + +++ + +F + Sbjct: 198 PKGWEVKRLGEICTNFQNGIGKNSEHY----GHGSKVANISDLYEWHRFIPEKYSLLDVT 253 Query: 66 LVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLIFSGFIA 123 + E + D++ SS + V + C F +F +RP LI F++ Sbjct: 254 PKEIEKYSLMRGDLLFVRSSVKREGVAVCSVYDSDEICLFSSFMIRVRPRTDLINPEFLS 313 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 ++ RN++ S + I NI I + +PP+ Q +I + ++ + Sbjct: 314 LMLRTPPMRNRLILGSNTSTITNISQPGLSKIEVVVPPIKTQNLI----TKVTKNIEESV 369 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKL 209 Q + + ++L A G+L Sbjct: 370 RCHLQALEQSENLFNSLLQRAFRGEL 395 >UniRef50_Q4HNY2 Type I restriction-modification system specificity subunit, putative n=1 Tax=Campylobacter upsaliensis RM3195 RepID=Q4HNY2_CAMUP Length = 427 Score = 278 bits (712), Expect = 3e-73, Method: Composition-based stats. Identities = 80/435 (18%), Positives = 164/435 (37%), Gaps = 35/435 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 GK+P W + + + + KE++ + + + + N I T Sbjct: 6 GKIPAHWEVRRLKYLFYI------SKEESRDEFPN--VLSLTQNGIIERDITTNKGQL-- 55 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + DI++ S V KS FE +RP + + + Sbjct: 56 AQNYIGYNIVKRGDIILNPMDLSSGYVAKST-----FEGVISQAYIKIRPLETLNLSYYE 110 Query: 124 HFTKSSLYRNKISSLSAGAN---INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +F ++ + + L G + + F I IP+PPL EQK IAE LD ++ Sbjct: 111 NFFQNLYHYKILWHLGKGISYDHRWTLGNDVFLNIKIPLPPLQEQKEIAEFLDKKCEKIQ 170 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESI 231 + + +++ +L+ +QA++ A+ L +W P+H KKL + Sbjct: 171 NYINKKQKLITLLQEKKQALINEAITKGLNPNIEFKNSGIEWLGEIPKHWEIKKLKYIGE 230 Query: 232 LTELRNGLSSK---PNESGVGHPILRISSVRAG-HVDQNDIRFLECSESELNRHKLQDGD 287 + G + K P + ++V ++ N + ++ E L D Sbjct: 231 IFGGVIGKTIKDFSKEYKPNFKPYITFTNVCNNAIINPNSMEYVFIDFDEKQNKVL-KND 289 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 +LF + + + E VG + L + R+ ++A P Y+ SS S + Sbjct: 290 ILFLQSSETFEDVGKSAIY--LNDDEVYLNTFCKGFRIEREAYPMYLNYLLSSLSYKRYF 347 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 M+ + + + ++LPP++EQ EI +++ ++ ++ + V Sbjct: 348 MSVCS-GFTRINLRQEHFLDIPLILPPLQEQKEIAEFLDEKCKKINSAIEKTKKQIEFVR 406 Query: 408 NLTQSILAKAFRGEL 422 +++ +A G + Sbjct: 407 EYKNTLINEAVCGRI 421 Score = 115 bits (290), Expect = 2e-24, Method: Composition-based stats. Identities = 31/232 (13%), Positives = 81/232 (34%), Gaps = 27/232 (11%) Query: 215 NFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQN-DIRFLEC 273 P H ++L + + SK +L ++ + G ++++ + Sbjct: 6 GKIPAHWEVRRLKYLFYI--------SKEESRDEFPNVLSLT--QNGIIERDITTNKGQL 55 Query: 274 SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEY 333 +++ + + ++ GD++ + S +V + I+ R + Y Sbjct: 56 AQNYIGYNIVKRGDIILNPMDLSSGYVAKST-------FEGVISQAYIKIRPLETLNLSY 108 Query: 334 IEIFFSSPSARNAMMNCVKTTS--GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 E FF + + + K S + + + + LPP++EQ EI +++ Sbjct: 109 YENFFQNLYHYKILWHLGKGISYDHRWTLGNDVFLNIKIPLPPLQEQKEIAEFLDKKCEK 168 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + + + Q+++ +A + NP++ + L Sbjct: 169 IQNYINKKQKLITLLQEKKQALINEAIT-------KGLNPNIEFKNSGIEWL 213 >UniRef50_B5ECU4 Restriction modification system DNA specificity domain n=1 Tax=Geobacter bemidjiensis Bem RepID=B5ECU4_GEOBB Length = 395 Score = 278 bits (711), Expect = 4e-73, Method: Composition-based stats. Identities = 72/416 (17%), Positives = 152/416 (36%), Gaps = 26/416 (6%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ--NGKFDTTDLVFVPKNL 66 GWV + + + RG + + + D + I+ + + + TT+ P+ Sbjct: 4 GWVTKKLGEICDIERGGSPRPIDSFLTDAPDGINWIKIGDTKTISKYIFTTEQKIRPEG- 62 Query: 67 VKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFT 126 K S+ + D +++ S G+ C + + E + ++ H Sbjct: 63 AKRSRMVFEGDFILSNS----MSFGRPYIMKTTG-CIHDGWLVLREKEPNVNQDYLYHVL 117 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 S L + L+AG+ + N+ + +PIP ++EQ+ I LD ++ + KA Sbjct: 118 SSDLVYRQFDRLAAGSTVRNLNIGLVKGVEVPIPSISEQQRIVGILDEAFDRIATAKANA 177 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES 246 E+ Q + ++ L + K + + + + + K Sbjct: 178 EKNLQNARALFESHLQSTFTQR---------CAGWTVKTIGDLAEHSLGK--MLDKAKNK 226 Query: 247 GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 G P LR +VR + +D+ + +E+ ++ GD+L G + Sbjct: 227 GELQPYLRNINVRWFTFNLSDLLEMPFRTTEVGKYTAVKGDVLICEGGYP----GRAAIW 282 Query: 307 KKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIK 366 + + + L R R + ++ + + + +G + +G+ + Sbjct: 283 --TEDYPVYFQKALHRVRFHEPEHNKWFLYYLYAQDKSGELKKHFS-GTGIQHFTGEALS 339 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + L P+ E V R E L +E L + L +S+L +AF G+L Sbjct: 340 RFKLPLAPLPELRRNVARFEVLLEETQRLESICQRKLTALEELKKSLLDRAFTGQL 395 >UniRef50_Q8GN10 Putative type I specificity subunit HsdS n=3 Tax=Campylobacter jejuni RepID=Q8GN10_CAMJE Length = 420 Score = 277 bits (710), Expect = 5e-73, Method: Composition-based stats. Identities = 93/427 (21%), Positives = 163/427 (38%), Gaps = 20/427 (4%) Query: 6 LPEGWVIAPVSTVTTL----IRGVTYKKEQAINYLKDDYLPLIRANN-IQNGKFDTTDLV 60 LP+GW + + + + I+ + ++ + + + N I N + Sbjct: 4 LPQGWKMETLGEILSSDKYSIKRGPFGSTLKKSFFVEKGIRIFEQYNPINNDPHWKRYFI 63 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEK-LIF 118 K E+ K + D++I+ S +GK E +R I Sbjct: 64 SHEKFQELEAFKATEGDLLISCS----GTLGKIVELPKDTEMGIINQSLLKIRLNNIKIL 119 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNI-KPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + + ++ S + + KI + G+ I NI I IP+PPL +Q+ I LD Sbjct: 120 NSYFIYYFNSPIMQEKILESTLGSAIKNIASVKILKQIEIPLPPLKKQERIVGILDESFV 179 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-PQHSVFKKLNFESILTELR 236 ++D + EQ L Q+ L A N N++ PQ +K L + Sbjct: 180 KIDESIKILEQNLLNLDELMQSALQKAFNPLKDNAKENYKLPQGWEWKSLGEIGNTSSGG 239 Query: 237 NGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 L +K G L+ + G++D + E + + Q G LL Y Sbjct: 240 TPLRNKKEYWENGSIKWLKSGELNDGYIDFIEENITEEAIENSSAKIFQKGTLLIAMYGA 299 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 + G G+L N L + L +++ F R+ ++ + Sbjct: 300 TA---GRLGILNLDSATNQAVCAFLHKDNKNIKFLEKFLFYFL--FFIRDKIIKDSFGGA 354 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 Q IS IK+ + LPP+KEQ +I + ++ +F +++ L L QS+L Sbjct: 355 -QPNISQTYIKNLQIPLPPLKEQEQIAKHLDFVFEKTKALKELYTKELKDYEELKQSLLN 413 Query: 416 KAFRGEL 422 KAF+GEL Sbjct: 414 KAFKGEL 420 Score = 160 bits (405), Expect = 1e-37, Method: Composition-based stats. Identities = 45/209 (21%), Positives = 84/209 (40%), Gaps = 5/209 (2%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 KLP+GW + + G T + + Y ++ + +++ + +G D + Sbjct: 216 ENYKLPQGWEWKSLGEIGNTSSGGTPLRNKK-EYWENGSIKWLKSGELNDGYIDFIEENI 274 Query: 62 VPKNLVKESQKI-SPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + + S KI ++IAM + +G + AF Sbjct: 275 TEEAIENSSAKIFQKGTLLIAMYGATAGRLGILNLDSATNQAVC-AFLHKDNKNIKFLEK 333 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+ +F R+KI S G NI + IP+PPL EQ+ IA+ LD + + Sbjct: 334 FLFYFLF--FIRDKIIKDSFGGAQPNISQTYIKNLQIPLPPLKEQEQIAKHLDFVFEKTK 391 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKL 209 + K + + + + +Q++L A G+L Sbjct: 392 ALKELYTKELKDYEELKQSLLNKAFKGEL 420 >UniRef50_A1WW67 Restriction modification system DNA specificity domain n=1 Tax=Halorhodospira halophila SL1 RepID=A1WW67_HALHL Length = 429 Score = 277 bits (710), Expect = 5e-73, Method: Composition-based stats. Identities = 70/435 (16%), Positives = 157/435 (36%), Gaps = 47/435 (10%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W ++ + V L G + + + N I+ F Sbjct: 18 GEVPEHWSVSALKRVARLESGDAISSDHISE---EGEYAVYGGNGIRG---------FSS 65 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + ++ G + F V+ P + I ++ Sbjct: 66 GYTHDGFYPL---------IGRQGALCGNVNYAKGRFWA--SEHAVVVWPGRQIDGFWLG 114 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 +S ++ + A + + + + +P+PP EQ+ IAE LD A++D+ Sbjct: 115 ELLRSMN----LNQYATSAAQPGLSVETIENLYVPVPPDEEQQKIAELLDHETARIDALI 170 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTE 234 +++ ++LK RQAV+ AV L +W P H K + + E Sbjct: 171 EEQQRLIELLKEKRQAVISHAVTKGLDPDVPMKDSGVEWLGEVPAHWDVVKFVRCAKIAE 230 Query: 235 LRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYN 294 + +P S ++ + + +G E +E ++ GD+++++ Sbjct: 231 GQVDPKQEPYRS---MMLVAPNHIESGTGRLMARETAEEQGAESGKYYCYAGDVIYSKIR 287 Query: 295 GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTT 354 SL + +++ L + R +Y+ S S + + Sbjct: 288 PSLR---KACV----AYEDCLCSADMYPLRAQSGVYGDYLRWTILSESF-STLAFLESER 339 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 ++ + I+ + +PP +EQ +I R +E+ A D + ++ + + + +++ Sbjct: 340 VAMPKVNRESIEEIRIPMPPPEEQLQISRTLEKETARIDALMEEAESGIQLLQERRSALI 399 Query: 415 AKAFRGELTAQWRAE 429 + A G++ + A Sbjct: 400 SAAVTGKIDVRDWAP 414 Score = 124 bits (313), Expect = 6e-27, Method: Composition-based stats. Identities = 31/232 (13%), Positives = 70/232 (30%), Gaps = 36/232 (15%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P+H L + L +SS + + + +R F Sbjct: 15 EWLGEVPEHWSVSALKRVARLESGDA-ISSDHISEEGEYAVYGGNGIRGFSSGYTHDGFY 73 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 L R V + + + Sbjct: 74 P----------------LIGRQGALCGNV-------NYAKGRFWASEHAVVVWPGRQIDG 110 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 ++ S + T++ Q G+S + I++ V +PP +EQ +I ++ A Sbjct: 111 FWLGELLRSMNLNQY-----ATSAAQPGLSVETIENLYVPVPPDEEQQKIAELLDHETAR 165 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + ++ + + Q++++ A + +PD+ ++ L Sbjct: 166 IDALIEEQQRLIELLKEKRQAVISHAVT-------KGLDPDVPMKDSGVEWL 210 >UniRef50_A1VBQ9 Restriction modification system DNA specificity domain n=1 Tax=Desulfovibrio vulgaris DP4 RepID=A1VBQ9_DESVV Length = 595 Score = 277 bits (710), Expect = 6e-73, Method: Composition-based stats. Identities = 106/517 (20%), Positives = 197/517 (38%), Gaps = 91/517 (17%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 + W VS + + G +K + + +D+ +PLIR +I + + T+ + Sbjct: 23 DHWKRVYVSEIAMVQNGFAFKSK---FFSRDEGIPLIRIRDILSAE---TEHKYF--GQF 74 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTK 127 + + D++I M + A+ C ++ + F + Sbjct: 75 DKEYLVHNGDLLIGMDGDFVA-----AYWPGKEGLLNQRVCRIVIESENYDKKFFFLALQ 129 Query: 128 SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFE 187 Y + I ++ + ++ + + I +P+PPL EQ I K++ L +++D+ Sbjct: 130 --PYLDAIHEKTSSVTVKHLSSKTVNEIPLPLPPLNEQNRIVAKIEELFSELDAGVENLT 187 Query: 188 QIPQ--------ILKRFRQAVLGGAVNG-------------------------KLTEKWR 214 + + +LK + L A K E+W Sbjct: 188 KAKEQLGVYRQSLLKHAFEGKLTEAWRKRNADKLESGEALLKRVKKEREEYFKKQLEQWE 247 Query: 215 NFEPQHS----------------------------------VFKKLNFESILTELRNGLS 240 Q + +++ G S Sbjct: 248 KDVAQWEADGKPGKKPTQPKKPKKLAPISEEELKELPELPEGWVWARLGNLIDPPAYGTS 307 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 K + + G +LRI ++ G +D +D+++ S E +++L+ GDLL R NGS+ V Sbjct: 308 RKSDYNIDGTGVLRIPNIVDGKIDSSDLKYTAFSPGEEEQYRLKAGDLLTIRSNGSVSLV 367 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G C L+ + +Y LIR R + +++ SS RN + + K+TSG I Sbjct: 368 GQCALI-EDDDTRYVYAGYLIRLRTIGLLVSKFLLYCLSSLRLRNQIESKAKSTSGVNNI 426 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + +++ S +V L EQ E+ + + + A + L + L QSIL KAF G Sbjct: 427 NSQELSSLIVPLCSQLEQNEVSKLLADSLSTAGEQTSMIEIQLEHIRILKQSILDKAFSG 486 Query: 421 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGK 457 L +Q + P A+ LLE+IK ER ++ Sbjct: 487 TLISQDPNDEP--------ASKLLERIKQERKSAPNP 515 Score = 135 bits (340), Expect = 4e-30, Method: Composition-based stats. Identities = 52/239 (21%), Positives = 104/239 (43%), Gaps = 24/239 (10%) Query: 223 FKKLNFESILTELRNGLSSKPNES--GVGHPILRISSVRAGHVDQNDIRFLECSESELNR 280 +K++ + S + ++NG + K G P++RI + + + ++ + E Sbjct: 25 WKRV-YVSEIAMVQNGFAFKSKFFSRDEGIPLIRIRDILSAE---TEHKYFGQFDKE--- 77 Query: 281 HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFS 339 + + +GDLL + + L ++ R + +++ ++ + Sbjct: 78 YLVHNGDLLIGMDGDFV-----AAYWPGKE---GLLNQRVCRIVIESENYDKKFFFLALQ 129 Query: 340 SPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 +A+ ++ K +S K + + LPP+ EQ IV ++E+LF+ D + + Sbjct: 130 --PYLDAIHEK-TSSVTVKHLSSKTVNEIPLPLPPLNEQNRIVAKIEELFSELDAGVENL 186 Query: 400 NNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKK 458 A ++ QS+L AF G+LT WR N D S ALL+++K ER K+ Sbjct: 187 TKAKEQLGVYRQSLLKHAFEGKLTEAWRKRNAD---KLESGEALLKRVKKEREEYFKKQ 242 >UniRef50_A5G3B9 Restriction modification system DNA specificity domain n=2 Tax=Proteobacteria RepID=A5G3B9_GEOUR Length = 393 Score = 276 bits (707), Expect = 1e-72, Method: Composition-based stats. Identities = 73/411 (17%), Positives = 149/411 (36%), Gaps = 28/411 (6%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQK 72 P+ + T+ G T + + +P ++++ T P+ L + Sbjct: 6 VPLGGLVTISGGGTPSRNN--DAYWGGSIPWATVKDLKDTMLSGTQETITPEGLRDSASN 63 Query: 73 ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYR 132 + P VI ++ +GK A + + + ++ +F ++ Sbjct: 64 LIPAGSVIV---ATRMGLGKVAINTMD--VTINQDLKAFSCGADLEPRYLLYFLLANA-- 116 Query: 133 NKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQI 192 + + S+ GA + I +++P+PPL EQK IA LD DS + + ++ ++ Sbjct: 117 SHLDSMGKGATVKGITLDVLKDLSVPLPPLPEQKRIAAILDKA----DSIRRKRQEAVRL 172 Query: 193 LKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS-SKPNESGVGHP 251 + ++V + W L S + G G Sbjct: 173 TEELLRSVFLDMFGDPESNNWPMMTIAGVA---LPGVSAIRTGPFGSQLLHSEFVDEGVA 229 Query: 252 ILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQ 310 +L I + A N+ R++ ++ EL+R+ ++ GD++ T G C ++ Sbjct: 230 VLGIDNAVANEFRWNERRYISEAKYRELSRYTVRPGDVIITIMGTC----GRCAVVPDDI 285 Query: 311 HQNLLYPDKLIRARLTKDALPEYIE-IFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQV 369 + LP ++ F AR + K + G++ IK Sbjct: 286 PVAINTKHLCCITLDQTKCLPVFVHAYFLQHCIARRYLEKTAK-GAIMDGLNMGIIKDMP 344 Query: 370 VLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + +PP+K Q + + A + + + LA + L S+L +AF G Sbjct: 345 IPIPPLKLQEKFACSI----AAIEKLRHTTRSTLAEQDTLFHSLLQRAFNG 391 Score = 115 bits (289), Expect = 4e-24, Method: Composition-based stats. Identities = 31/215 (14%), Positives = 74/215 (34%), Gaps = 25/215 (11%) Query: 8 EGWVIAPVSTVT----TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 W + ++ V + IR + + + D+ + ++ +N +F + ++ Sbjct: 191 NNWPMMTIAGVALPGVSAIRTGPFGSQLLHSEFVDEGVAVLGIDNAVANEFRWNERRYIS 250 Query: 64 KNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGA-FCGVLRPEK-LIFS 119 + + + P D++I + G+ A + + ++ Sbjct: 251 EAKYRELSRYTVRPGDVIITIM----GTCGRCAVVPDDIPVAINTKHLCCITLDQTKCLP 306 Query: 120 GFIA-HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKI----IAEKLDT 174 F+ +F + + R + + GA ++ + + IPIPPL Q+ IA Sbjct: 307 VFVHAYFLQHCIARRYLEKTAKGAIMDGLNMGIIKDMPIPIPPLKLQEKFACSIAA---- 362 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 ++ + ++L A NG L Sbjct: 363 ----IEKLRHTTRSTLAEQDTLFHSLLQRAFNGAL 393 >UniRef50_Q73D72 Type I restriction-modification enzyme, S subunit, putative n=1 Tax=Bacillus cereus ATCC 10987 RepID=Q73D72_BACC1 Length = 476 Score = 276 bits (706), Expect = 1e-72, Method: Composition-based stats. Identities = 99/471 (21%), Positives = 184/471 (39%), Gaps = 41/471 (8%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI---QNGKFDTTDLVF 61 ++PE W+ + +I G T K + Y KD + I ++ Q+ Sbjct: 20 RVPENWIWTWTGAIAEVISGGTPKS-KVEEYYKDGTISWITPADLSGYQDMYISKGKRNI 78 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 L K S K+ P + V+ S V +A + P + + Sbjct: 79 TELGLNKSSAKMLPINTVLLSSRAPIGYVAIAA-----KDLCTNQGFKSFAPSNAYYPKY 133 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + K S Y + S+++G+ + I IP+PP+ EQK ++EK++ LL +V+ Sbjct: 134 LYWYLKFSKY--YMESMASGSTFKELSSNKSKEIPIPLPPINEQKRVSEKVERLLNKVEE 191 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE---------------------PQH 220 K E+ + + R A+L A +G LT KWR P Sbjct: 192 AKTLIEEAKETFELRRAAILDKAFSGDLTGKWRKENSFQQNEECISDNELRDSEVFYPIP 251 Query: 221 SVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQN-DIRFLECSESE-- 277 +K + + T + G ++R+ ++ + + + ++ E Sbjct: 252 KTWKWTKLKDVATFKNGYAFKSKDFVEQGIQLIRMGNLYKNELRLDRNPVYIPLDFDEKI 311 Query: 278 LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIF 337 + ++ ++ GD+L + + + + +NLL +++ + + EYI + Sbjct: 312 IEKYTVEKGDILLSLTGTKYKRDYGYAVRVDGRDKNLLLNQRILSLKPH--MMDEYIYYY 369 Query: 338 FSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEK 397 S RNA + Q + K ++S ++ +PP E EI +++ +L + Sbjct: 370 LQSSVFRNAFFSFETGGVNQGNVGSKAVESILIPIPPADEAKEIEKKLARLLNN-EKEAL 428 Query: 398 QVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIK 448 V ++ L QS L+KAFRGEL E + E L EKIK Sbjct: 429 VVLAIEEKLEVLKQSALSKAFRGELGTNDPTEE---NTIELLKEVLKEKIK 476 Score = 130 bits (328), Expect = 1e-28, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 79/226 (34%), Gaps = 15/226 (6%) Query: 209 LTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG-HPILRISSV---RAGHVD 264 + R P++ ++ + + S G + + + + ++ Sbjct: 13 PEAEQRFRVPENWIWTWTGAIAEVISGGTPKSKVEEYYKDGTISWITPADLSGYQDMYIS 72 Query: 265 QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 + E ++ + L +L + S +G + K +L Sbjct: 73 KGKRNITELGLNKSSAKMLPINTVLLS----SRAPIGYVAIAAK----DLCTNQGFKSFA 124 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 + P+Y+ + M + + S K +S K + LPP+ EQ + + Sbjct: 125 PSNAYYPKYLYWYLK---FSKYYMESMASGSTFKELSSNKSKEIPIPLPPINEQKRVSEK 181 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAEN 430 VE+L + + + A +IL KAF G+LT +WR EN Sbjct: 182 VERLLNKVEEAKTLIEEAKETFELRRAAILDKAFSGDLTGKWRKEN 227 >UniRef50_B1LRG3 Type I restriction modification DNA specificity domain protein n=1 Tax=Escherichia coli SMS-3-5 RepID=B1LRG3_ECOSM Length = 428 Score = 275 bits (705), Expect = 2e-72, Method: Composition-based stats. Identities = 81/426 (19%), Positives = 162/426 (38%), Gaps = 26/426 (6%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W P + T ++ +K+ + + N D DL + + Sbjct: 16 WNSVPAKRLFT-------SSKEINQGMKESNRLALTMKGVINRSLD--DLQGLQSSDYSV 66 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGFIAHFTKS 128 Q +D+V + + H + I+ F + + Sbjct: 67 YQIFEKDDLVFKLIDLENIKTSRVGIVH--ERGIMSPAYIRVSASSNSIYPRFYYWYFFA 124 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 N + L G N+ I +P+ ++ QK ++ LD ++DS + Sbjct: 125 LYLTNIYNKLGGGV-RQNLTAGDLLEIPVPLIDISLQKQVSTFLDRETQRIDSLIEEKQT 183 Query: 189 IPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTELRNGL 239 ++LK RQA++ V L +W P+H KK+ + G Sbjct: 184 FIKLLKEKRQALISHVVTKGLYPNVEMQDSGIEWIGQVPKHWEVKKIKHI--CSNFMYGT 241 Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 S N+S VG+P+LRI ++++ +VD D+++ S+ + + L GD+L R NG+ Sbjct: 242 SQDCNQSDVGYPVLRIPNIKSTNVDFEDLKYANISDVDALTYLLSRGDILVIRTNGNPNL 301 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 VG L + L+ LI+ + ++ +S S R A+ +T+ G Sbjct: 302 VGQSALFDS--NGQYLFASYLIKLTPKQGVDTSFLVEAMNSLSVRQALTFQSRTSVGNYN 359 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 +S + + + +PP+ EQ I + D + ++ + ++ + S++ A Sbjct: 360 LSIPSLANTSIAIPPIDEQKTITNYLSAATINIDLLIQETDKSIDLLKEHRTSLINAAVT 419 Query: 420 GELTAQ 425 G++ + Sbjct: 420 GKIDVR 425 Score = 152 bits (386), Expect = 2e-35, Method: Composition-based stats. Identities = 49/210 (23%), Positives = 92/210 (43%), Gaps = 6/210 (2%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P+ W + + + Y Q N D P++R NI++ D DL + Sbjct: 218 IGQVPKHWEVKKIKHIC---SNFMYGTSQDCN-QSDVGYPVLRIPNIKSTNVDFEDLKYA 273 Query: 63 PKNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + V + +S DI++ ++G+ ++VG+SA + F ++ L P++ + + F Sbjct: 274 NISDVDALTYLLSRGDILVIRTNGNPNLVGQSALFDSNGQYLFASYLIKLTPKQGVDTSF 333 Query: 122 IAHFTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + S R ++ S N+ S +I IPP+ EQK I L +D Sbjct: 334 LVEAMNSLSVRQALTFQSRTSVGNYNLSIPSLANTSIAIPPIDEQKTITNYLSAATINID 393 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLT 210 ++ +LK R +++ AV GK+ Sbjct: 394 LLIQETDKSIDLLKEHRTSLINAAVTGKID 423 Score = 94.8 bits (235), Expect = 7e-18, Method: Composition-based stats. Identities = 23/227 (10%), Positives = 79/227 (34%), Gaps = 21/227 (9%) Query: 217 EPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSES 276 + + + + E+ G+ + L + V +D D++ L+ S+ Sbjct: 13 DSKWNSVPAKRLFTSSKEINQGMKESNRLA------LTMKGVINRSLD--DLQGLQSSDY 64 Query: 277 ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 + + DL+F + G++ + + ++ P + + + P + Sbjct: 65 SV-YQIFEKDDLVFKLIDLENIKTSRVGIVHE---RGIMSPAYIRVSASSNSIYPRFYYW 120 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 +F + N ++ ++ D+ V L + Q ++ +++ D++ Sbjct: 121 YFFALYLTNIYNKL--GGGVRQNLTAGDLLEIPVPLIDISLQKQVSTFLDRETQRIDSLI 178 Query: 397 KQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++ + + Q++++ + P++ ++ + Sbjct: 179 EEKQTFIKLLKEKRQALISHVVT-------KGLYPNVEMQDSGIEWI 218 >UniRef50_D0KMA1 Restriction modification system DNA specificity domain protein n=1 Tax=Pectobacterium wasabiae WPP163 RepID=D0KMA1_PECWW Length = 493 Score = 275 bits (704), Expect = 3e-72, Method: Composition-based stats. Identities = 190/526 (36%), Positives = 249/526 (47%), Gaps = 96/526 (18%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS GKLPEGW + V L G K A P+ +N I Sbjct: 2 MSVGKLPEGWKNIHLGDVIELKYG----KSLAAQVRDGIGYPVFGSNGI----------- 46 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + K S + + +I GS VV KS P + + + + I Sbjct: 47 -----VGKHSIPLIKQSGLIVGRKGSYGVVQKSVEPFFPIDTT---YYIDELFNQPIN-- 96 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F ++ ++ L+ I + ++I +PPL EQKIIAEKLDTLLAQVD Sbjct: 97 FWFYYLSFLP----LTKLNRSTTIPGLNRDDAYNLSINLPPLVEQKIIAEKLDTLLAQVD 152 Query: 181 STKARFEQIPQILKRFRQAVLGGA-------------------------VNGKLTEKW-- 213 STKAR EQIP+ILKRFRQAVL A V E W Sbjct: 153 STKARLEQIPKILKRFRQAVLASALRGELTKKWRIDNKTGQDISSFKASVKKYRFESWVK 212 Query: 214 --------RNFEPQHSVFKKLNFESILT------ELRNGLSSKP---------------- 243 + +P++ +KK E+I++ ++ +G +P Sbjct: 213 EQEQKFINKGKQPRNDNWKKKYQEAIISQDISDKDIPDGWLFEPLDGLVYISARIGWKGL 272 Query: 244 ---NESGVGHPILRISSVRAGH-VDQNDIRFLECS-ESELNRHKLQDGDLLFTRYNGSLE 298 + G L + S+ G + + E KLQ+ D+L + Sbjct: 273 KASEYTVKGPLFLSVHSLNYGKEANLEQAYHISEHRYDESPEIKLQNNDILLCKDGAG-- 330 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 +G ++K L + L+ R +PEY+ F S P +N + + S Sbjct: 331 -IGKLSIVKNLNEPATI-NSSLLLIRGGDFFVPEYLFYFLSGPEMQNLVKERMT-GSAVP 387 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 + +D+K V+ +PP+ EQ EIVRRVEQLFAYADTIEKQVN AL+RVNNLTQSILAKAF Sbjct: 388 HLFQRDVKEFVLEVPPLNEQHEIVRRVEQLFAYADTIEKQVNTALSRVNNLTQSILAKAF 447 Query: 419 RGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKKS 464 RGELTAQWR ENPDLISGENSAA LLEKIKAERAAS GKKA RKK+ Sbjct: 448 RGELTAQWREENPDLISGENSAAVLLEKIKAERAASVGKKAPRKKA 493 >UniRef50_C0EPF1 Putative uncharacterized protein n=1 Tax=Neisseria flavescens NRL30031/H210 RepID=C0EPF1_NEIFL Length = 430 Score = 275 bits (703), Expect = 3e-72, Method: Composition-based stats. Identities = 81/433 (18%), Positives = 161/433 (37%), Gaps = 30/433 (6%) Query: 4 GKLPEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 GK+P W + V R KE+ + L + +I+ LV + Sbjct: 16 GKIPSQWELTIGMNVFRENKRDNKGMKEKTVLSLSYGQI-IIKPEE---------KLVGL 65 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + Q + P DI+I + + + L+ + F+ Sbjct: 66 VPESFETYQIVEPNDIIIRCTDLQNDQTSLRTGLAKD-KGIITSAYLNLKVINNHSAKFL 124 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 ++ + + +G N+ F + I PL+EQ+ IA+ LD A++D Sbjct: 125 HYYLHTLDITKVLYKFGSGL-RQNLSFLDFKRLPIIDIPLSEQQKIAQFLDDKTAKIDQA 183 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFE-SIL 232 E+ +LK +Q ++ AV L +W P+H KK+ S + Sbjct: 184 VDLAEKQIALLKEHKQILIQNAVTRGLNPDVPLKDSGVEWIGQVPEHWSVKKIKHVTSKI 243 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLEC-SESELNRHKLQDGDLLFT 291 L N G P+LR ++ +D ND+ + + + + K++ GD+L Sbjct: 244 GSGITPLGGGSNYIDGGIPLLRSQNIHFDRIDLNDVARISEFTHNSMKNSKVRKGDVLLN 303 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 GSL G C + + N+ + R K ++ + +S + + Sbjct: 304 ITGGSL---GRCFYVDSNEEMNV--NQHVCIIRPNKKINTIFLNMLLASEVGQKQIW-FF 357 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 + G++G++ + IK+ + LP +KEQ +I +++ A D + ++ Sbjct: 358 QQGGGREGLNFQAIKNFYLPLPDLKEQQKIAIYLDKQVAKIDQAIALKTAHIEKLKEYKS 417 Query: 412 SILAKAFRGELTA 424 ++ G++ Sbjct: 418 VLINDVVTGKVRV 430 Score = 177 bits (450), Expect = 7e-43, Method: Composition-based stats. Identities = 47/208 (22%), Positives = 92/208 (44%), Gaps = 6/208 (2%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PE W + + VT+ I G + D +PL+R+ NI + D D+ + Sbjct: 224 IGQVPEHWSVKKIKHVTSKI-GSGITPLGGGSNYIDGGIPLLRSQNIHFDRIDLNDVARI 282 Query: 63 PKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + ++ K+ D+++ ++ GS +G+ + E + ++RP K I + Sbjct: 283 SEFTHNSMKNSKVRKGDVLLNITGGS---LGRCFYVDSNEEMNVNQHVCIIRPNKKINTI 339 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+ S + + +I G + + +P+P L EQ+ IA LD +A++D Sbjct: 340 FLNMLLASEVGQKQIWFFQQGGGREGLNFQAIKNFYLPLPDLKEQQKIAIYLDKQVAKID 399 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGK 208 A + LK ++ ++ V GK Sbjct: 400 QAIALKTAHIEKLKEYKSVLINDVVTGK 427 Score = 107 bits (268), Expect = 9e-22, Method: Composition-based stats. Identities = 23/233 (9%), Positives = 67/233 (28%), Gaps = 22/233 (9%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P + R L + + + + + + Sbjct: 13 EWLGKIPSQWELTIG-----MNVFRENKRDNKGMKEKTVLSLSYGQI----IIKPEEKLV 63 Query: 272 ECSESELNRH-KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 + ++ D++ + + L L + + ++ + Sbjct: 64 GLVPESFETYQIVEPNDIIIRCTDLQNDQ---TSLRTGLAKDKGIITSAYLNLKVINNHS 120 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 +++ + + + ++ +S D K ++ P+ EQ +I + ++ A Sbjct: 121 AKFLHYYLHTLDITKVLYKFGSG--LRQNLSFLDFKRLPIIDIPLSEQQKIAQFLDDKTA 178 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D +A + Q ++ A R NPD+ ++ + Sbjct: 179 KIDQAVDLAEKQIALLKEHKQILIQNAVT-------RGLNPDVPLKDSGVEWI 224 >UniRef50_B2A6M8 Restriction modification system DNA specificity domain n=1 Tax=Natranaerobius thermophilus JW/NM-WN-LF RepID=B2A6M8_NATTJ Length = 490 Score = 274 bits (702), Expect = 4e-72, Method: Composition-based stats. Identities = 118/486 (24%), Positives = 209/486 (43%), Gaps = 54/486 (11%) Query: 2 SAGKLPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 +LP W + + + G T K+ + +P+ R +IQN + + Sbjct: 23 EPYELPNNWAWVALDILAEEIKNGTTIKQSKTKP-----GIPVTRIESIQNNEIQLDRVR 77 Query: 61 FV-PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKL-I 117 ++ + +K + DIV++ + S VGK+A + G +R I Sbjct: 78 YIRDLDKIKNNDYYKIGDIVLSHIN-SIEHVGKTALIKEDYLPLIHGMNLLRIRVNNNMI 136 Query: 118 FSGFIAHFTKSSLYRN-KISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 F+ +T+S +R + + N ++ + I+IPI P EQ+ I K+D LL Sbjct: 137 LPQFLQLYTRSYNFRKAVLKRIKMAVNQVSLNQKNLKQISIPIAPKNEQRRIVYKVDRLL 196 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP------------------ 218 ++++ K + + + R A+L A G+LT WR P Sbjct: 197 SKINKAKELIGEAKETFELRRAAILDKAFKGELT--WREENPRVESVDTLLAKINSEKKT 254 Query: 219 -----------QHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVR-AGHVDQN 266 + ++ ++ G S+K + G P+LR+ +++ G +D N Sbjct: 255 DIKKSPNGLYELPDNWCWIDLGELICHSSYGTSAKAYKDINGLPVLRMGNIKLTGSIDLN 314 Query: 267 DIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR-L 325 D+++L ++ ++KL++ DLLF R N S E VG +++ Y LI+ Sbjct: 315 DLKYLPFDHKDVEKYKLEEYDLLFNRTN-SYELVGKSAIVEPEHAGKFTYASYLIKISLF 373 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 K L YI + +S R +++ VK GQ I+ K + S V LPP +E EI R + Sbjct: 374 YKKILAPYICYYINSHIGRKYLLSTVKQQVGQANINSKKLSSLPVPLPPEEEIKEINRIM 433 Query: 386 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLE 445 +++ A + + + N V L QSIL+KAFRGEL + SA LL+ Sbjct: 434 KKVSAK-ENRIQNLLNLGTYVAELEQSILSKAFRGELNTNDPKDE--------SAIELLK 484 Query: 446 KIKAER 451 ++ ++ Sbjct: 485 EVLKDK 490 Score = 156 bits (395), Expect = 2e-36, Method: Composition-based stats. Identities = 67/277 (24%), Positives = 136/277 (49%), Gaps = 21/277 (7%) Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 T + ++I ++L+ ++ P + + L+ + E++NG + Sbjct: 2 TDKKSKRIEELLEE----------TIVHEDEEPYELPNNWAWVALDILAE--EIKNGTTI 49 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 K +++ G P+ RI S++ + + +R++ + N + GD++ + N S+E VG Sbjct: 50 KQSKTKPGIPVTRIESIQNNEIQLDRVRYIRDLDKIKNNDYYKIGDIVLSHIN-SIEHVG 108 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 L+K+ + L++ L+R R+ + LP++++++ S + R A++ +K Q + Sbjct: 109 KTALIKE-DYLPLIHGMNLLRIRVNNNMILPQFLQLYTRSYNFRKAVLKRIKMAVNQVSL 167 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + K++K + + P EQ IV +V++L + + ++ + A +IL KAF+G Sbjct: 168 NQKNLKQISIPIAPKNEQRRIVYKVDRLLSKINKAKELIGEAKETFELRRAAILDKAFKG 227 Query: 421 ELTAQWRAENPDLISGENSAAALLEKIKAERAASGGK 457 ELT WR ENP + S LL KI +E+ K Sbjct: 228 ELT--WREENPRV----ESVDTLLAKINSEKKTDIKK 258 >UniRef50_A0ZMI3 Putative uncharacterized protein n=1 Tax=Nodularia spumigena CCY9414 RepID=A0ZMI3_NODSP Length = 437 Score = 274 bits (702), Expect = 4e-72, Method: Composition-based stats. Identities = 84/438 (19%), Positives = 162/438 (36%), Gaps = 38/438 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +PE W I S G KD +PL+R +N++ G D ++ Sbjct: 16 GDIPEHWEIVRFSNFINFQEG----PGIMAADFKDYGVPLLRIHNLKPGFVDLERCNYLE 71 Query: 64 KNLVKES---QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGA-FCGVLRP-EKLIF 118 V+++ K++ +DI+I+ S+ + G + E S L+P I Sbjct: 72 PQKVEKTWKHFKLNEDDILISCSAST----GLVSIVDKKAEGSIAYTGIIRLKPANSNIC 127 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 FI S L+ +I L G I + P I I PPL EQK IA LD+ L + Sbjct: 128 REFIKIIVASELFFTQIELLKTGTTIQHYGPTHLRQIKITFPPLYEQKKIACFLDSKLEE 187 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFE 229 +D + +++ ++LK + A++ AV L +W P H + Sbjct: 188 IDKFISNKQRLIELLKEQKTAIINRAVTKGLNPHAPMKPSGIEWLGDIPAHWEVTRAKHI 247 Query: 230 SILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNR--HKLQDGD 287 S + + + +G P + + + + + ++ +L +N L +G Sbjct: 248 SYVFVPQR--NKPNLNLNIGFPWITMEDITSPSISKSTFGYLVSEIDAMNAGSKLLPEGS 305 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 ++ VG GL Q ++ + P Y+ + Sbjct: 306 VI-------ASCVGNFGLSSVNTLQVIINQQLQAYIPIK--INPYYLRYLI---GISKSY 353 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 + + ++ ++LPP EQ IVR +++ D + + + Sbjct: 354 FEQIANATTLAYVNQAGFAELPIILPPNDEQLAIVRNIDKELTTIDKAITTIEKEIELIK 413 Query: 408 NLTQSILAKAFRGELTAQ 425 +++++A G++ + Sbjct: 414 EYRTTLISEAVTGKIDVR 431 Score = 146 bits (370), Expect = 1e-33, Method: Composition-based stats. Identities = 41/235 (17%), Positives = 84/235 (35%), Gaps = 18/235 (7%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 W P+H + + E + G P+LRI +++ G VD +L Sbjct: 13 DWLGDIPEHWEIVRFSNFINFQEGPG--IMAADFKDYGVPLLRIHNLKPGFVDLERCNYL 70 Query: 272 ECSESEL--NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKD 328 E + E KL + D+L + + G+ ++ K ++ Y +IR + + Sbjct: 71 EPQKVEKTWKHFKLNEDDILISCSAST----GLVSIVDKKAEGSIAYTG-IIRLKPANSN 125 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 E+I+I +S + +KT + + ++ + PP+ EQ +I ++ Sbjct: 126 ICREFIKIIVASELFFTQI-ELLKTGTTIQHYGPTHLRQIKITFPPLYEQKKIACFLDSK 184 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + + +I+ +A + NP + L Sbjct: 185 LEEIDKFISNKQRLIELLKEQKTAIINRAVT-------KGLNPHAPMKPSGIEWL 232 >UniRef50_D0C390 Type I restriction-modification system specificity determinant n=1 Tax=Acinetobacter sp. RUH2624 RepID=D0C390_9GAMM Length = 461 Score = 274 bits (701), Expect = 6e-72, Method: Composition-based stats. Identities = 90/448 (20%), Positives = 173/448 (38%), Gaps = 43/448 (9%) Query: 5 KLPEGWVIAPVSTVT----TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 +LP W + ++ + + + D+ +PLI+ NNI++GK ++ Sbjct: 16 ELPSHWQEKRLGFLSMQTKNAFVDGPFGSDLKSDDYLDEGIPLIQLNNIRDGKHILRNMK 75 Query: 61 FVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPE-KL 116 F+ +N + P+DIVIA V ++A ++ A C L P+ +L Sbjct: 76 FISQNKKIDLIRHLALPQDIVIAKM---AEPVARAAVVSDEYDEYVIVADCVKLSPDLEL 132 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + F+ S R +S G I + +P P L+EQ I + LD Sbjct: 133 VDLNFLIWAINSDCVRENAELVSTGTTRIRINLGELKKLKVPYPSLSEQVKIRQYLDHET 192 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLN 227 A++D+ A+ E++ +LK RQAV+ AV L +W P+H K Sbjct: 193 AKIDTLIAKQEELIALLKEKRQAVISHAVTKGLNPNVPMKDSGVEWLGEVPEHWTVSKFG 252 Query: 228 FESILTELRN--GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRH---- 281 + S + + +G P + ++ + +D +L +E+ L + Sbjct: 253 YISQVVRGGSPRPAGDPALFNGDYSPWVTVAEITK-----DDELYLTSTETFLTKKGSEQ 307 Query: 282 --KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFS 339 Q G LL + +L + K+ N D ++ K EY + Sbjct: 308 CRVFQSGTLLLSNSGATLG-------VPKILSINANANDGVVGFEDLK-IDIEYAYFYL- 358 Query: 340 SPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 N + VK SGQ ++ +K+ + +PP E +IV +++ + + Sbjct: 359 -SILTNDLRERVKQGSGQPNLNTDIVKAIPIAIPPENEIKKIVVDIKKKIDHFSKLMGSA 417 Query: 400 NNALARVNNLTQSILAKAFRGELTAQWR 427 A+ + ++++ G++ + Sbjct: 418 EKAIQLMQERRTALISAVVTGKIDVRNW 445 Score = 153 bits (388), Expect = 1e-35, Method: Composition-based stats. Identities = 40/238 (16%), Positives = 98/238 (41%), Gaps = 18/238 (7%) Query: 213 WRNFEPQHSVFKKLNFESI-----LTELRNGLSSKPNES-GVGHPILRISSVRAGHVDQN 266 ++ P H K+L F S+ + G K ++ G P+++++++R G Sbjct: 13 FKTELPSHWQEKRLGFLSMQTKNAFVDGPFGSDLKSDDYLDEGIPLIQLNNIRDGKHILR 72 Query: 267 DIRFLECSES-ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL 325 +++F+ ++ +L RH D++ + E V ++ + ++ D + + Sbjct: 73 NMKFISQNKKIDLIRHLALPQDIVIAKM---AEPVARAAVVSDEYDEYVIVADCVKLSPD 129 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 + ++ +S R V T + + I+ ++K V P + EQ +I + + Sbjct: 130 LELVDLNFLIWAINSDCVREN-AELVSTGTTRIRINLGELKKLKVPYPSLSEQVKIRQYL 188 Query: 386 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + A DT+ + +A + Q++++ A + NP++ ++ L Sbjct: 189 DHETAKIDTLIAKQEELIALLKEKRQAVISHAVT-------KGLNPNVPMKDSGVEWL 239 Score = 99.8 bits (248), Expect = 2e-19, Method: Composition-based stats. Identities = 36/209 (17%), Positives = 77/209 (36%), Gaps = 9/209 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTD-LVF 61 G++PE W ++ ++ ++RG + + DY P + I ++ + T F Sbjct: 240 GEVPEHWTVSKFGYISQVVRGGSPRPAGDPALFNGDYSPWVTVAEITKDDELYLTSTETF 299 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + K ++ + ++++ S + V + + G + I + Sbjct: 300 LTKKGSEQCRVFQSGTLLLSNSGATLGVPKILSINANANDGVVG------FEDLKIDIEY 353 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + S L + + G+ N+ I I IPP E K I + + Sbjct: 354 AYFYL-SILTNDLRERVKQGSGQPNLNTDIVKAIPIAIPPENEIKKIVVDIKKKIDHFSK 412 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLT 210 E+ Q+++ R A++ V GK+ Sbjct: 413 LMGSAEKAIQLMQERRTALISAVVTGKID 441 >UniRef50_C5BH70 Restriction modification system DNA specificity domain protein n=1 Tax=Edwardsiella ictaluri 93-146 RepID=C5BH70_EDWI9 Length = 441 Score = 273 bits (700), Expect = 7e-72, Method: Composition-based stats. Identities = 76/454 (16%), Positives = 160/454 (35%), Gaps = 48/454 (10%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +PE W I + + + G YK Q DD P++ + G+F F Sbjct: 19 GLVPESWTICRLKNLAAIKNGQDYKSVQ-----TDDGYPVMGSG----GQFT-----FAS 64 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 K + + + G K + K + + PF + L + + ++ Sbjct: 65 KFMYDKPSVLL----------GRKGTIDKPLYINEPFWTVDTMYYTEL--NEGFDARYLY 112 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLA-EQKIIAEKLDTLLAQVDST 182 + + + S S + ++ +P E+K I + LD A++D+ Sbjct: 113 YLALTI----QFSRYSTNTALPSMTQEHLSNYKFSVPKAESERKKITKFLDHETAKIDNL 168 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILT 233 + +Q+ ++LK R AV+ AV L +W P+H L + Sbjct: 169 IEKQQQLIELLKEKRHAVISHAVTKGLNPDVPMKDSGVEWLGEVPEHWTISTLKHHAKFI 228 Query: 234 ELRNGLSSKPNES--GVGHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKLQDGDLLF 290 + G + G L ++ ++ +D ++ + LNR K +GD+ Sbjct: 229 DGDRGSEYPNDNDLVDDGVVFLSSKNISNWEINIDDANYISREKFNRLNRGKAINGDV-I 287 Query: 291 TRYNGSLEFVGVCGLLKKL--QHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMM 348 + GS +G + + +++ RL ++ + Sbjct: 288 VKVRGSTGRIGELAIFETERLNKSTAFINAQMMIIRLKNSFNNRFLCNVAQGHYWMEQL- 346 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 N + Q+ ++ ++++PP+ EQ I + +E D + K +N + + Sbjct: 347 NVGAYGTAQQQLNNAIFSGMIMVVPPIDEQLTINKFLELEIKRFDGLIKNTSNMIQLIQE 406 Query: 409 LTQSILAKAFRGELTAQWRAENPDLISGENSAAA 442 ++++ A G++ + PD E Sbjct: 407 RRTALISAAVTGKIDVRDWVA-PDTQEAEEPQEV 439 Score = 112 bits (281), Expect = 3e-23, Method: Composition-based stats. Identities = 26/233 (11%), Positives = 72/233 (30%), Gaps = 40/233 (17%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P+ +L L ++NG K ++ G+P++ G Sbjct: 16 EWLGLVPESWTICRLK---NLAAIKNGQDYKSVQTDDGYPVMGSG----GQFTFA----- 63 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 ++ +L R + ++ D + L + Sbjct: 64 -------SKFMYDKPSVLLGRKGTIDK--------PLYINEPFWTVDTMYYTELNEGFDA 108 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVK-EQAEIVRRVEQLFA 390 Y+ + T + ++ + + + +P + E+ +I + ++ A Sbjct: 109 RYLYYLALTIQFSRY-----STNTALPSMTQEHLSNYKFSVPKAESERKKITKFLDHETA 163 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + ++ + + ++++ A + NPD+ ++ L Sbjct: 164 KIDNLIEKQQQLIELLKEKRHAVISHAVT-------KGLNPDVPMKDSGVEWL 209 >UniRef50_A0L1U2 Restriction modification system DNA specificity domain n=1 Tax=Shewanella sp. ANA-3 RepID=A0L1U2_SHESA Length = 425 Score = 273 bits (699), Expect = 9e-72, Method: Composition-based stats. Identities = 86/442 (19%), Positives = 171/442 (38%), Gaps = 38/442 (8%) Query: 1 MSAGKLPEGWVIAPVSTVTTLI---RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT 57 MS +P+ W + P+ +V + RG T KK + + ANN+Q G+ D Sbjct: 1 MSN-TVPDNWNVLPLGSVIKQVIDFRGRTPKK--LGMEWGGGNIRALSANNVQMGRVDFN 57 Query: 58 DLVFV-PKNLVKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP 113 ++ L + DI+ M ++ +G A +L+ Sbjct: 58 KECYLASDELYDKWMTKGTTEVGDILFTM----EAPLGNIALVPNDDRYILSQRVILLKN 113 Query: 114 EK-LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKL 172 +K S F+ +S +++ + + G I+ +++ +PPL EQ+ IA+ L Sbjct: 114 DKSKASSDFLFQQLRSDSFQDTLRENATGTTAQGIQQKRLVTLDVVLPPLPEQQKIAKIL 173 Query: 173 DTLLAQVDSTKARFEQIPQILKRFRQAVLGGAV--NGKLTEKWR----NFEPQHSVFKKL 226 ++ ++ T+A+ +++ + Q +L V +GK +++ P+ L Sbjct: 174 TSVDEVIEKTQAQIDKLKDLKTGMMQELLTQGVGIDGKPHTEFKDSPVGRIPKAWNCVTL 233 Query: 227 NFESILTELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSESELNRH--KL 283 S + +G S G P L +S VR G++D FL EL K Sbjct: 234 KNLS--KRITDGTHQTVKTSPDGTIPFLYVSCVRDGNIDWEKASFLTEEMYELASKGRKP 291 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPS 342 ++GD+L+T G ++ + + + + + E++ F +SP Sbjct: 292 ENGDILYTAVGSY----GHAAIVSGDNRFS--FQRHIAFIQPNHEKIDSEFLVSFLNSPL 345 Query: 343 ARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNA 402 + + + Q ++ D+ V LP + EQ I ++F D V Sbjct: 346 GKKQ-ADLYAIGNAQLTVTLGDLGKFKVALPDIAEQQRIA----KIFNGIDNRIIVVQRK 400 Query: 403 LARVNNLTQSILAKAFRGELTA 424 L + N ++++ G++ Sbjct: 401 LTSLGNTKKALMQDLLTGKVRV 422 >UniRef50_Q64AS2 Restriction endonuclease S subunits n=1 Tax=uncultured archaeon GZfos29E12 RepID=Q64AS2_9ARCH Length = 438 Score = 273 bits (699), Expect = 1e-71, Method: Composition-based stats. Identities = 70/438 (15%), Positives = 161/438 (36%), Gaps = 35/438 (7%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PEGW + + T+ ++G D+ L+ + ++G + D V Sbjct: 17 IGEIPEGWEVNKIKN-TSYVKGRIGWHGLTSEEYSDEGAYLVTGTDFKDGVIEWEDCHHV 75 Query: 63 PKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRP-EKLIF 118 + KE + +D++I +GK A + LP + + + ++RP K F Sbjct: 76 GWDRYKEDPYIHLKEDDLLITKD----GTIGKVALIKFLPNKATLNSGIFLVRPLNKKYF 131 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F+ S+++ + GA I+++ +F+ PIP EQ IA LD A+ Sbjct: 132 PKFMYWMLNSTVFERFFDYIKTGATISHLYQETFERFFFPIPLKQEQVAIASFLDKKTAK 191 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFE 229 +D+ + +++ ++LK R A++ AV L W P+ + Sbjct: 192 IDALIEKDKRLIELLKEKRTALIDHAVTKGLDPNVKMKDFGIVWIGKIPEDAKIMPFRRV 251 Query: 230 SILTELRNGLSSK--PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD 287 + + K + I ++ ++ E + + D Sbjct: 252 CYVNQGLQFPEDKRLSEPDEKSKIYITIK-----YIHADEDGVKEYIPNPPRGVICKKED 306 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 +L R + E + +Q ++ + + +Y+ + S + + Sbjct: 307 VLLARTGATGEVI---------TNQEGVFHNNFFKVNYNSKIDRDYLVYYLKMDSIKKVL 357 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 + + ++ S +L +++Q +I +++ A D K + + + Sbjct: 358 LLKA-GVTTIPDLNHDAFLSTPFILYSIEKQKQIAEYLDKKTAKIDKNIKLIEKKIKLLE 416 Query: 408 NLTQSILAKAFRGELTAQ 425 +S++ G++ + Sbjct: 417 EYKKSLINHVVTGKVDVR 434 Score = 149 bits (378), Expect = 2e-34, Method: Composition-based stats. Identities = 35/235 (14%), Positives = 85/235 (36%), Gaps = 16/235 (6%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLS--SKPNESGVGHPILRISSVRAGHVDQNDIR 269 +W P+ K+ S + + R G + S G ++ + + G ++ D Sbjct: 15 EWIGEIPEGWEVNKIKNTSYV-KGRIGWHGLTSEEYSDEGAYLVTGTDFKDGVIEWEDCH 73 Query: 270 FLECS-ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD 328 + E L++ DLL T+ +G L+K L ++ L + L K Sbjct: 74 HVGWDRYKEDPYIHLKEDDLLITKDG----TIGKVALIKFLPNKATLNSGIFLVRPLNKK 129 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 P+++ +S + +KT + + + + +P +EQ I +++ Sbjct: 130 YFPKFMYWMLNSTVF-ERFFDYIKTGATISHLYQETFERFFFPIPLKQEQVAIASFLDKK 188 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D + ++ + + +++ A + +P++ + + Sbjct: 189 TAKIDALIEKDKRLIELLKEKRTALIDHAVT-------KGLDPNVKMKDFGIVWI 236 >UniRef50_B7R237 Type I restriction modification system, subunit S n=1 Tax=Thermococcus sp. AM4 RepID=B7R237_9EURY Length = 428 Score = 272 bits (697), Expect = 2e-71, Method: Composition-based stats. Identities = 83/431 (19%), Positives = 173/431 (40%), Gaps = 25/431 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ--NGKFD--TTD 58 G++P W + V + + G T +Q +Y ++ + I ++ NG ++ Sbjct: 12 IGEIPRDWKVVRVREIFDVKTGTTPSTKQ-TDYWENGEMNWITPTDLSKLNGNIYMGDSE 70 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE--KL 116 K L + + P+ +I + + L E +F C L P+ Sbjct: 71 RKITKKALEDYNLSLLPKGSLILSTRAPVGYIA-----VLTEEATFNQGCKGLVPKDQNK 125 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 I F A++ K R + SLS G+ + A + +P+PP EQK IAE L T+ Sbjct: 126 IIPEFYAYYFKFK--RQHLESLSGGSTFKELAKAMLERFLVPLPPRLEQKKIAEILRTVD 183 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGK-LTEKWRNFEPQHSVFKKLNFESILTEL 235 ++ T E+ ++ K +L + + + P+ + E I + Sbjct: 184 EAIEKTDLAIEKTERLKKGLMLRLLTKGIKHERFKKTEIGEIPEEWRV--VRLEEITRRI 241 Query: 236 RNGLSSKPNESGVGHPILRISSVRA-GHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRY 293 + G S K +++ G + + G+++ ++ ++L + + L+++ L++GDL+ Sbjct: 242 KRGPSKKTDDNETGVVYVTSDYIDDHGNLNFDNPKYLSLEKIDRLDKYLLEEGDLIINCV 301 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 N SLE +G + + + ++ + L P Y++ FF S + + + K Sbjct: 302 N-SLEKIGKVAVFEGYSKKAIVGFNN-FALTLVSTVNPYYVKYFFLSYKGKALIKSISKA 359 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 Q S KD+ + LPP+ EQ +I + + + + ++ + + + Sbjct: 360 AVQQVSFSSKDLLRLKIPLPPLPEQKQIAEILSTVDKKLELL----RKRREKLELVKRGL 415 Query: 414 LAKAFRGELTA 424 + G Sbjct: 416 MKGLLTGRRRV 426 Score = 141 bits (355), Expect = 7e-32, Method: Composition-based stats. Identities = 40/206 (19%), Positives = 83/206 (40%), Gaps = 10/206 (4%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN-GKFDTTDLV 60 G++PE W + + +T I+ KK + + + ++ I + G + + Sbjct: 221 EIGEIPEEWRVVRLEEITRRIKRGPSKKTDD----NETGVVYVTSDYIDDHGNLNFDNPK 276 Query: 61 FVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHL-PFECSFGAFCGVLRPEKLI 117 ++ + + D++I + S +GK A + G L + Sbjct: 277 YLSLEKIDRLDKYLLEEGDLIINCVN-SLEKIGKVAVFEGYSKKAIVGFNNFALTLVSTV 335 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANIN-NIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 ++ +F S + I S+S A + + IP+PPL EQK IAE L T+ Sbjct: 336 NPYYVKYFFLSYKGKALIKSISKAAVQQVSFSSKDLLRLKIPLPPLPEQKQIAEILSTVD 395 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLG 202 +++ + R E++ + + + +L Sbjct: 396 KKLELLRKRREKLELVKRGLMKGLLT 421 Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats. Identities = 32/238 (13%), Positives = 83/238 (34%), Gaps = 31/238 (13%) Query: 208 KLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVG-HPILRISSVR--AGHVD 264 KL + P+ ++ + + + + G + + + G++ Sbjct: 6 KLKKTPIGEIPRDWKVVRVREIFDVKTGTTPSTKQTDYWENGEMNWITPTDLSKLNGNIY 65 Query: 265 --QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR 322 ++ + + + + N L G L+ + + VG +L + N + Sbjct: 66 MGDSERKITKKALEDYNLSLLPKGSLILS----TRAPVGYIAVLTEEATFNQGCKGLV-- 119 Query: 323 ARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIV 382 + +PE+ +F + + + S K ++ ++ +V LPP EQ +I Sbjct: 120 PKDQNKIIPEFYAYYFK---FKRQHLESLSGGSTFKELAKAMLERFLVPLPPRLEQKKIA 176 Query: 383 RRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR-------------GELTAQWR 427 + + D ++ + A+ + L + ++ + GE+ +WR Sbjct: 177 EILRTV----DEAIEKTDLAIEKTERLKKGLMLRLLTKGIKHERFKKTEIGEIPEEWR 230 >UniRef50_C6WNJ9 Restriction modification system DNA specificity domain protein n=1 Tax=Actinosynnema mirum DSM 43827 RepID=C6WNJ9_ACTMD Length = 442 Score = 272 bits (697), Expect = 2e-71, Method: Composition-based stats. Identities = 79/441 (17%), Positives = 156/441 (35%), Gaps = 37/441 (8%) Query: 6 LP--EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 +P + W +P+ +T+++ + A Y+ + + +I Q G D + F Sbjct: 7 IPISDTWTTSPLKRITSVLNRGS-----APEYVDESPVRVISQAANQYGGLDWSRTRFHN 61 Query: 64 KNLVKESQK--ISPEDIVIAMSS-GSKSVVGKSAHQHLPFECSFGAFCGVLRPEK-LIFS 119 N K + DI+I + G+ VG C V+R +K + Sbjct: 62 FNGDPTKLKGHLQENDIIINSTGTGTLGRVGYFTEPLNGIPCMADGHVTVVRVKKHKVNP 121 Query: 120 GFIAHFTKSSLYRNKI-SSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+ ++ S ++ I SSL+ GA + +IP PP++EQ+ I + L+ A Sbjct: 122 RFVYYWLTSKPFQEYIHSSLAIGATNQIELNRDRLSDTHIPNPPISEQQRIVDFLEAETA 181 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAV-------NGKLTEKWRNFEPQHSVFKKLNFES 230 +D ++ + L R A + AV + W P +L+ + Sbjct: 182 HIDRLIETQNRVLEKLAERRMAGITQAVSGTDQTGTRPSSLTWLEKIPSTWKEVRLSLIA 241 Query: 231 ILTELRNGLSSKPNES-GVGHPILRISSVRA------GHVDQNDIRFLECSESELNRHKL 283 + S P P + VR + + + E + Sbjct: 242 RMGSGHTPSRSHPEWWVDCTIPWITTGEVRQVRNDRLEDLHETREKISELGLANSAAELR 301 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 G ++ R G + ++ + P Y+ + Sbjct: 302 PAGTVVLCRT----ASAGYSAV----MGTDMATSQDFVTWTCGPRLNPYYLLWCLR--AM 351 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 R ++ + S K I D++ + LPP+ EQ +IV+++ + A D + V + Sbjct: 352 RPDLLGRLAMGSTHKTIYVPDLQMLRIPLPPIGEQQKIVQQIREQNARIDRLADAVRLQV 411 Query: 404 ARVNNLTQSILAKAFRGELTA 424 A + Q+++ A G++ Sbjct: 412 ALLAERRQALITAAVTGQIDV 432 Score = 132 bits (333), Expect = 3e-29, Method: Composition-based stats. Identities = 37/212 (17%), Positives = 80/212 (37%), Gaps = 13/212 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANN---IQNGKFDT---TD 58 K+P W +S + + G T + + D +P I ++N + + T Sbjct: 227 KIPSTWKEVRLSLIARMGSGHTPSRSH-PEWWVDCTIPWITTGEVRQVRNDRLEDLHETR 285 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 L + ++ P V+ + S G SA + + + + Sbjct: 286 EKISELGLANSAAELRPAGTVVLCRTASA---GYSAV--MGTDMATSQDFVTWTCGPRLN 340 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ ++ + + L+ G+ I ++ IP+PP+ EQ+ I +++ A+ Sbjct: 341 PYYLLWCLRAMR-PDLLGRLAMGSTHKTIYVPDLQMLRIPLPPIGEQQKIVQQIREQNAR 399 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 +D +L RQA++ AV G++ Sbjct: 400 IDRLADAVRLQVALLAERRQALITAAVTGQID 431 >UniRef50_B7JRE7 Restriction modification system DNA specificity domain protein n=2 Tax=Bacillus cereus RepID=B7JRE7_BACC0 Length = 495 Score = 271 bits (695), Expect = 3e-71, Method: Composition-based stats. Identities = 99/497 (19%), Positives = 198/497 (39%), Gaps = 74/497 (14%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P W+ +++++ LI ++ K++ P++ NI +G+ + +V + Sbjct: 25 EVPGNWIWGNLNSLSKLIVDGSHNPPPK----KNEGFPMLSGRNILDGEINFETDRYVSE 80 Query: 65 NLVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSG 120 + ++ K I D+++ + +G++ F +++P ++ S Sbjct: 81 DDYQKEYKRTPIESNDVLLTI----VGTIGRTTVVPKEFSPFVLQRSVALIKP--MVNSN 134 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +++++ S ++ + + G + + IP+PPL EQK I EK++ LL +V+ Sbjct: 135 YLSYYFSSPYFQYYLQKNAKGTAQKGVYLKTLKSSRIPLPPLMEQKRITEKVEGLLGRVE 194 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE----------------------- 217 KA E+ + + R +L A G+L+ KWR Sbjct: 195 EAKALIEEAKKTFEVRRATILDKAFRGELSAKWREDNRIAEDASSLLERIQIQKRNSSIK 254 Query: 218 ------------------PQHSVFKKLNFES-ILTELRNGLSSKPNESGVGHPILRISSV 258 P + +L S +T S S G +R + Sbjct: 255 SNTLKITSVIKEEEPFELPNGWTWVRLGEISYYVTSGSRDWSK--YYSDEGAMFIRTQDI 312 Query: 259 RAGHVDQNDIRFLEC-SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYP 317 ++ +D+ ++ + E R ++ D+L T VG C L+ + + Sbjct: 313 NKNSLNLSDVAYVSLPEKVEGKRSLVEKADILTTITGA---NVGKCALV-ETNIKEAYVS 368 Query: 318 DKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKE 377 + +L + ++ +Y+ + SP + G+ +S +DIK+ + L P+ E Sbjct: 369 QSVALTKLIEKSISKYVHLSLLSPCGGGNELEERAYGIGRPVLSLEDIKNIKIPLAPMAE 428 Query: 378 QAEIVRRVEQLFAYADTIEKQVNNALAR-VNNLTQSILAKAFRGELTAQWRAENPDLISG 436 Q IV+ VE L + E ++ + + L QSIL KAFRGEL E Sbjct: 429 QQVIVKLVETLLE--NEKESLNLASIEKHLETLKQSILNKAFRGELGTNDPNEE------ 480 Query: 437 ENSAAALLEKIKAERAA 453 S+ LL+K+ E+ Sbjct: 481 --SSMKLLKKVLQEKIK 495 Score = 170 bits (432), Expect = 9e-41, Method: Composition-based stats. Identities = 68/272 (25%), Positives = 121/272 (44%), Gaps = 18/272 (6%) Query: 193 LKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPI 252 K+ Q +L A+ TE+ P + ++ LN S L + +G + P + G P+ Sbjct: 4 KKKTLQELLEDAL--IPTEEHPYEVPGNWIWGNLNSLSKL--IVDGSHNPPPKKNEGFPM 59 Query: 253 LRISSVRAGHVDQNDIRFLECSE--SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQ 310 L ++ G ++ R++ + E R ++ D+L T +G ++ K + Sbjct: 60 LSGRNILDGEINFETDRYVSEDDYQKEYKRTPIESNDVLLTIVG----TIGRTTVVPK-E 114 Query: 311 HQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVV 370 + + + Y+ +FSSP + + K + QKG+ K +KS + Sbjct: 115 FSPFVLQRSVALIKPM--VNSNYLSYYFSSPYFQYYLQKNAK-GTAQKGVYLKTLKSSRI 171 Query: 371 LLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAEN 430 LPP+ EQ I +VE L + + + A +IL KAFRGEL+A+WR +N Sbjct: 172 PLPPLMEQKRITEKVEGLLGRVEEAKALIEEAKKTFEVRRATILDKAFRGELSAKWREDN 231 Query: 431 PDLISGENSAAALLEKIKAERAASGGKKASRK 462 A++LLE+I+ ++ S K + K Sbjct: 232 RIA----EDASSLLERIQIQKRNSSIKSNTLK 259 >UniRef50_C8NC88 Type I restriction-modification system specificity determinant n=1 Tax=Cardiobacterium hominis ATCC 15826 RepID=C8NC88_9GAMM Length = 465 Score = 271 bits (694), Expect = 4e-71, Method: Composition-based stats. Identities = 75/451 (16%), Positives = 156/451 (34%), Gaps = 34/451 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAI--NYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 LPE W I + L+ Q + + ++ + K TTD +V Sbjct: 19 LPESWGILRAKQMFRLVIEKAPANNQMELLSVYTHIGVRPRKSLEQRGNKASTTDGYWV- 77 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + DI+ +G S + + +LRP K + + Sbjct: 78 ---------VKEGDIICNKLLAWMGAIGASHY-----QGVTSPAYDILRPVKPCNTDYYH 123 Query: 124 HFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 ++ Y + S G + F I IP+P +EQ I L A + Sbjct: 124 FLFRTKKYLQQFKIRSRGIMDMRLRLYFDQFGQIPIPVPSRSEQDQIVAYLRAQDAYIAR 183 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 + ++L + ++ AV L +W P+H ++L + + Sbjct: 184 FIKAKRDLIKLLTEQKLRIIDHAVTRGLDSSVALRPSGIEWLGEVPEHWEVQRLKNVANM 243 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 + + G P LR ++V+ D D++ + +++E+ + +++ GDLL + Sbjct: 244 VLGKMLTTEAKAGDGDFKPYLRSTNVQWIKPDVRDVKEMWVAKAEMAQLRIRKGDLLVSE 303 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 VG + + + + R LPE++ F + R N + Sbjct: 304 GGE----VGRACMWNDELPE-CYIQNSVHRVAAKPMMLPEFLFHQFFTYGKRGRF-NAIV 357 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 ++ + + + +PP++EQ I R + + D + + + Sbjct: 358 NRVSIAHLTREKLVTVPFTVPPIEEQKAICRWITEECQPLDDAIARAEEEIKLIREYRDR 417 Query: 413 ILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++A G++ + PD + + +AL Sbjct: 418 LIADVVTGQVDVRGWQPGPDDMVDDALLSAL 448 Score = 129 bits (324), Expect = 3e-28, Method: Composition-based stats. Identities = 39/221 (17%), Positives = 90/221 (40%), Gaps = 10/221 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + + V ++ G E D+ P +R+ N+Q K D D+ + Sbjct: 226 GEVPEHWEVQRLKNVANMVLGKMLTTE--AKAGDGDFKPYLRSTNVQWIKPDVRDVKEMW 283 Query: 64 KNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIFSGF 121 + +I D++++ VG++ + C + + ++ F Sbjct: 284 VAKAEMAQLRIRKGDLLVS----EGGEVGRACMWNDELPECYIQNSVHRVAAKPMMLPEF 339 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + H + R + +++ +I ++ + +PP+ EQK I + +D Sbjct: 340 LFHQFFTYGKRGRFNAIVNRVSIAHLTREKLVTVPFTVPPIEEQKAICRWITEECQPLDD 399 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV 222 AR E+ ++++ +R ++ V G++ R ++P Sbjct: 400 AIARAEEEIKLIREYRDRLIADVVTGQVD--VRGWQPGPDD 438 >UniRef50_A3JE98 Type I restriction-modification system, S subunit n=1 Tax=Marinobacter sp. ELB17 RepID=A3JE98_9ALTE Length = 429 Score = 271 bits (693), Expect = 4e-71, Method: Composition-based stats. Identities = 80/436 (18%), Positives = 167/436 (38%), Gaps = 35/436 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT---DLVFV 62 +P W+ A V + G + + A + +RA NI D + + Sbjct: 7 VPSHWIKASVGNYCDVQLGKMLQSDPASQNDESK--RYLRAINITKHGLDLSHDFSMWIK 64 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP--EKLIFSG 120 P+ + E ++ DI+++ G++A E F +RP I Sbjct: 65 PQEM--EKFRLQRGDILVS----EGGDAGRTAVFDCDEEFYFQNAINRIRPAGNSTILPE 118 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 FI ++ + + A I + + +PPL Q IA+ LD A++D Sbjct: 119 FIYYWFTFLKVAGYVEMVCNVATIAHFTAEKVKAAPLALPPLKTQHSIAQFLDEKTARID 178 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESI 231 + + L RQA++ A+ L +W P + KKL Sbjct: 179 GLIEKKCALLDRLAEKRQALITRAITKGLDPNAIMKPSGTEWLGHIPANWEVKKLRRVRR 238 Query: 232 LTELRNGLSSKPNES-GVGHPILRISSVRAGHV--DQNDIRFLECSES-ELNRHKLQDGD 287 + +G G LR+++V + D ++ R++ + E R +++GD Sbjct: 239 --YMTSGSRDWAAYYADEGDRFLRMTNVTGEGIELDLSETRYVNLDGATEGTRTSVREGD 296 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA-LPEYIEIFFSSPSARNA 346 +L T +G +++K + L R + + ++ F S+ AR Sbjct: 297 ILITIT----AELGAVAVIRKEIEGAYI-NQHLALFRPSPELCESGFLVNFLSTDMARAQ 351 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 M + + Q G+ + + + ++ PP++EQ I ++ ++++E+ + ++ ++ Sbjct: 352 FMLSGQGGTKQ-GLGFEQVNNVIIGFPPLREQELIGNFCSEIRRQSESVEQPLKLSIDKL 410 Query: 407 NNLTQSILAKAFRGEL 422 +++ A G+L Sbjct: 411 IEYRSAVITAAVTGQL 426 Score = 145 bits (367), Expect = 3e-33, Method: Composition-based stats. Identities = 37/232 (15%), Positives = 86/232 (37%), Gaps = 16/232 (6%) Query: 214 RNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLEC 273 P H + + + + S +++ LR ++ +D + + Sbjct: 4 LEAVPSHWIKASVGNYCDVQLGKMLQSDPASQNDESKRYLRAINITKHGLDLSHDFSMWI 63 Query: 274 SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL--TKDALP 331 E+ + +LQ GD+L + G + + + + + R R LP Sbjct: 64 KPQEMEKFRLQRGDILVSEGG----DAGRTAVFDCDEE--FYFQNAINRIRPAGNSTILP 117 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 E+I +F+ + V + + + +K+ + LPP+K Q I + +++ A Sbjct: 118 EFIYYWFTFLKVAGYV-EMVCNVATIAHFTAEKVKAAPLALPPLKTQHSIAQFLDEKTAR 176 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + ++ L R+ Q+++ +A + +P+ I + L Sbjct: 177 IDGLIEKKCALLDRLAEKRQALITRAIT-------KGLDPNAIMKPSGTEWL 221 Score = 129 bits (326), Expect = 2e-28, Method: Composition-based stats. Identities = 45/213 (21%), Positives = 82/213 (38%), Gaps = 13/213 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKF--DTTDLVF 61 G +P W + + V R +T Y D+ +R N+ D ++ + Sbjct: 222 GHIPANWEVKKLRRV---RRYMTSGSRDWAAYYADEGDRFLRMTNVTGEGIELDLSETRY 278 Query: 62 VPKNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPE-KLI 117 V + E + + DI+I ++ + +G A E + RP +L Sbjct: 279 VNLDGATEGTRTSVREGDILITIT----AELGAVAVIRKEIEGAYINQHLALFRPSPELC 334 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 SGF+ +F + + R + G + + + I PPL EQ++I + Sbjct: 335 ESGFLVNFLSTDMARAQFMLSGQGGTKQGLGFEQVNNVIIGFPPLREQELIGNFCSEIRR 394 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 Q +S + + L +R AV+ AV G+L Sbjct: 395 QSESVEQPLKLSIDKLIEYRSAVITAAVTGQLE 427 >UniRef50_B3G223 Type I restriction modification DNA specificity protein n=1 Tax=Pseudomonas aeruginosa RepID=B3G223_PSEAE Length = 395 Score = 271 bits (693), Expect = 5e-71, Method: Composition-based stats. Identities = 90/424 (21%), Positives = 175/424 (41%), Gaps = 40/424 (9%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI----QNGKFDTTDLVFVPK 64 W I + + + T K +++ +P RA + + G+ D + +F+ + Sbjct: 2 SWPIVKLGEIFDI----TSSKRVHEIDWRNEGVPFYRAREVAVLAKEGRVD--NDLFIDE 55 Query: 65 NLVK----ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-GAFCGVLRPEKLIFS 119 ++ + + D+++ +GK F A LR + + + Sbjct: 56 SMYEEFKAKYGVPKVGDLLVTA----VGTLGKVYAVQESDRFYFKDASVIWLRARQEVDT 111 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 +I H S+ + I + S+GA + + + IP+PPL EQK IA LD A Sbjct: 112 SYIQHAMNSTDVQRFIQN-SSGATVGTYTISRANETEIPLPPLPEQKRIAAILDKADA-- 168 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 + + +Q Q+ F +AV +T F ++ G Sbjct: 169 --IRRKRQQAIQLADDFLRAVFLDMFGDPVTNSK--------GFPIGTIRDLVATADYGS 218 Query: 240 SSKPNESGVGHPILRISSVRA-GHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLE 298 S+K +E+ +PILR+ ++ G +D ++++ E E +++ ++ GDLLF R N S E Sbjct: 219 SAKASETYGEYPILRMGNITYQGRIDLEGLKYINLEEKERSKYLVEKGDLLFNRTN-SKE 277 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 VG + + LIR R + YI + +S + + + K+ G Sbjct: 278 LVGKTAVYD--MDDPVAIAGYLIRVRPNEMGNSHYISGYLNSAHGKATLRSICKSIVGMA 335 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 I+ +++++ ++LP ++ Q + ++L + + AL L S+ KAF Sbjct: 336 NINAQEMQNIPIMLPSIELQRKY----QELVVVTKCKLQVFDTALKLTEQLFSSLSYKAF 391 Query: 419 RGEL 422 G+L Sbjct: 392 SGQL 395 >UniRef50_Q8EJT0 Type I restriction-modification system, S subunit n=1 Tax=Shewanella oneidensis RepID=Q8EJT0_SHEON Length = 439 Score = 271 bits (693), Expect = 5e-71, Method: Composition-based stats. Identities = 73/443 (16%), Positives = 165/443 (37%), Gaps = 33/443 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDT--TDLVF 61 GK+P W + + G + + + L+R N+ G + Sbjct: 10 GKIPNDWEYQIIIDNVEFLTGPAFDSS--LFNTESRGARLVRGINLTQGSTRWGEDKTKY 67 Query: 62 VPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 L + +++ DI+I M + + LR + + S Sbjct: 68 WDVELNNLKKYQLAINDILIGMDGSLVGK-NYAYLKQSDLPALLVQRVARLRAKSNLHSK 126 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ + + + + + + + I +I P PPL EQ+ IA L ++ ++ Sbjct: 127 YLYYMYATDFWLDYVEVVKTNSGIPHISNGDIKNFRFPFPPLPEQQKIAAILTSVDEVIE 186 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKW----------RNFEPQHSVFKKLNFES 230 T+A+ +++ + Q +L V K +K+ P+ K S Sbjct: 187 KTQAQIDKLKDLKSGMMQELLTKGVGIKQGDKYVPHIEFKDSPVGKIPKSWEVK--PLNS 244 Query: 231 ILTELRNGLSSKPNESG-VGHPILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQDGD 287 ++ ++ + + ++R S+VR G + +D+++ + NR GD Sbjct: 245 VVLKIIDCEHKTAPYVDKSEYLVVRTSNVRHGELVLDDMKYTHADGYAEWTNRAIPSLGD 304 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNA 346 +LFTR G L+ + + + +++ R + + + +F +S +A A Sbjct: 305 VLFTREAP----AGESCLVPE--NTKVCMGQRMVLLRPDANVIFSNFFSLFLTSEAASCA 358 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + + I+ +DIK ++PP+ EQ EI + ++ + L + Sbjct: 359 IYER-SIGTTVSRINIEDIKRIPCIVPPLSEQQEISKAIQSV----QNSILNKQEKLQSL 413 Query: 407 NNLTQSILAKAFRGELTAQWRAE 429 NL ++++ G++ + + Sbjct: 414 KNLKKALMQDLLTGKVRVKVDND 436 Score = 130 bits (328), Expect = 1e-28, Method: Composition-based stats. Identities = 45/214 (21%), Positives = 92/214 (42%), Gaps = 11/214 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 GK+P+ W + P+++V I +K Y+ ++R +N+++G+ D+ + Sbjct: 230 VGKIPKSWEVKPLNSVVLKIIDCEHKT---APYVDKSEYLVVRTSNVRHGELVLDDMKYT 286 Query: 63 PKNLVKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIF 118 + E S D++ ++ G+S + G +LRP+ +IF Sbjct: 287 HADGYAEWTNRAIPSLGDVLFTR----EAPAGESCLVPENTKVCMGQRMVLLRPDANVIF 342 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 S F + F S I S G ++ I I +PPL+EQ+ I++ + ++ Sbjct: 343 SNFFSLFLTSEAASCAIYERSIGTTVSRINIEDIKRIPCIVPPLSEQQEISKAIQSVQNS 402 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK 212 + + + + + + + K Q +L G V K+ Sbjct: 403 ILNKQEKLQSLKNLKKALMQDLLTGKVRVKVDND 436 >UniRef50_A1TX70 Restriction modification system DNA specificity domain n=1 Tax=Marinobacter aquaeolei VT8 RepID=A1TX70_MARAV Length = 461 Score = 270 bits (690), Expect = 1e-70, Method: Composition-based stats. Identities = 72/457 (15%), Positives = 162/457 (35%), Gaps = 44/457 (9%) Query: 5 KLPEGWVIAPV-STVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 ++P W + P S K E ++ + G+ D+ + Sbjct: 19 RIPSSWQLLPFFSRFFERKESNKGMKSENLLS--------------LSFGRIVRKDITTL 64 Query: 63 P---KNLVKESQKISPEDIVIAMSSGSKSVVG-KSAHQHLPFECSFGAFCGVLRPEKLIF 118 + Q + P +IV ++ +SA + A+ V K Sbjct: 65 EGLLPASFETYQVVHPGNIVFRLTDLQNDKRSLRSAIVN-EKGIITSAYLAV--SAKDFN 121 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F + ++ S+ G ++K + I P + EQ IA LD A+ Sbjct: 122 PTFSNYLFRAYDLMKVFYSMGGGL-RQSMKYDDMKWLPIVCPSINEQTQIARFLDHETAK 180 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFE 229 +D+ E++ ++L+ RQAV+ AV L +W P H + Sbjct: 181 IDALIREQERLIELLQEKRQAVISHAVTKGLDPDVPMKDSGVEWLGEVPAHWDRTLIKHC 240 Query: 230 SILTELRNGLSSKP-----NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRH-KL 283 + + +G +++ +G P +R +++ V + ++ ++ R +L Sbjct: 241 CYINDGNHGEEYPKGDDFVDDADIGVPFIRGGNLKDMTVTTEGMLYITAEKNRSMRKGRL 300 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 Q GD+LF +G ++ + L + P Y+ + +S + Sbjct: 301 QVGDILFVNRGE----IGKLAVIPSSMNGANLNSQIAYLRVENRIIDPHYLVHYLASDTI 356 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 + + + S D+ + V +PP EQ +I +++ + + + +N++ Sbjct: 357 KAEI-KAAQEGSVLTQYPISDLAAIHVPVPPKDEQQKISTYLKEQLFSFNVLTSEASNSI 415 Query: 404 ARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSA 440 ++ ++++ A G++ + D + + Sbjct: 416 NLLSERRSALISAAVTGKIDVRNWQPPSDESAFDEEV 452 Score = 132 bits (332), Expect = 3e-29, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 84/214 (39%), Gaps = 11/214 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGV---TYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 G++P W + + G Y K D +P IR N+++ T ++ Sbjct: 226 GEVPAHWDRTLIKHCCYINDGNHGEEYPKGDDFVDDADIGVPFIRGGNLKDMTVTTEGML 285 Query: 61 FVPKNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRP-EKL 116 ++ + +K + DI+ ++ +GK A + + LR ++ Sbjct: 286 YITAEKNRSMRKGRLQVGDILFV----NRGEIGKLAVIPSSMNGANLNSQIAYLRVENRI 341 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 I ++ H+ S + +I + G+ + + I++P+PP EQ+ I+ L L Sbjct: 342 IDPHYLVHYLASDTIKAEIKAAQEGSVLTQYPISDLAAIHVPVPPKDEQQKISTYLKEQL 401 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 + + +L R A++ AV GK+ Sbjct: 402 FSFNVLTSEASNSINLLSERRSALISAAVTGKID 435 Score = 105 bits (263), Expect = 3e-21, Method: Composition-based stats. Identities = 37/243 (15%), Positives = 86/243 (35%), Gaps = 25/243 (10%) Query: 204 AVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHV 263 A + + S ++ L F S E ES G + S+ G + Sbjct: 5 AYPEYKNTEIPWMQRIPSSWQLLPFFSRFFE--------RKESNKGMKSENLLSLSFGRI 56 Query: 264 DQNDIRFLE--CSESELNRHKLQDGDLLFTRYN-GSLEFVGVCGLLKKLQHQNLLYPDKL 320 + DI LE S + G+++F + + + ++ + + ++ L Sbjct: 57 VRKDITTLEGLLPASFETYQVVHPGNIVFRLTDLQNDKRSLRSAIVNE---KGIITSAYL 113 Query: 321 IRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAE 380 + KD P + F + + ++ + D+K ++ P + EQ + Sbjct: 114 AVS--AKDFNPTFSNYLFRAYDLMKVFYSM--GGGLRQSMKYDDMKWLPIVCPSINEQTQ 169 Query: 381 IVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSA 440 I R ++ A D + ++ + + Q++++ A + +PD+ ++ Sbjct: 170 IARFLDHETAKIDALIREQERLIELLQEKRQAVISHAVT-------KGLDPDVPMKDSGV 222 Query: 441 AAL 443 L Sbjct: 223 EWL 225 >UniRef50_A7JZU8 Type I restriction-modification system specificity subunit S n=1 Tax=Vibrio sp. Ex25 RepID=A7JZU8_VIBSE Length = 437 Score = 269 bits (689), Expect = 1e-70, Method: Composition-based stats. Identities = 73/433 (16%), Positives = 144/433 (33%), Gaps = 33/433 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +P W AP+ +V+ L + E ++ D + IR + ++ + + T L Sbjct: 22 GSIPSHWEAAPLCSVSKLKSITNHVGEPLLSVYLDKGV--IRFDEVEAKRTNVTSL---- 75 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + + Q + P D V+ + VG SAH VL+ I+ F Sbjct: 76 --DLSKYQLVEPGDFVLNNQQAWRGSVGISAH-----RGIVSPAYLVLQLSSKIYPRFGN 128 Query: 124 HFTK--SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + S + ++S G N+ + P L EQ IA LD +Q+D Sbjct: 129 YLFRDGSMVANYLVNSKGVGTIQRNLYWPQLKRALVFFPGLDEQIAIANYLDEKTSQIDE 188 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 A ++ ++LK +Q ++ AV L W P+H K+ + + Sbjct: 189 AIAIKQKQIELLKERKQIIIQQAVTQGLNPDVPMKDSGVDWIGKIPEHWTVSKIGHYARV 248 Query: 233 TELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 G P + V + + E + G +L Sbjct: 249 YNGSTPSRDVKRYWDEGTIPWMSSGKVNDYIISTPSELITTAALRECSLRIFPKGTVLIG 308 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 G + + + + ++ L E++ + + Sbjct: 309 IVGQ-----GKTRGTSAMLAIDAVINQNVAGIIPSEKILSEFLHQYLIQAY---DEVRNQ 360 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 S Q+ ++ + + S + P + EQ EIV + D N + ++ Sbjct: 361 GQGSNQEALNCQILSSFKIAFPSIIEQKEIVHFIAIQSQKLDQSIDIQFNQIEKLKEYKT 420 Query: 412 SILAKAFRGELTA 424 +++ A G++ Sbjct: 421 TLINSAVTGKIKV 433 Score = 142 bits (359), Expect = 3e-32, Method: Composition-based stats. Identities = 39/208 (18%), Positives = 78/208 (37%), Gaps = 5/208 (2%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 GK+PE W ++ + + G T + Y + +P + + + + T + Sbjct: 230 IGKIPEHWTVSKIGHYARVYNGSTPSR-DVKRYWDEGTIPWMSSGKVNDYIISTPSELIT 288 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 L + S +I P+ V+ G G SA L + + P + I S F+ Sbjct: 289 TAALRECSLRIFPKGTVLIGIVGQGKTRGTSAM--LAIDAVINQNVAGIIPSEKILSEFL 346 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + +++ + G+N + I P + EQK I + ++D + Sbjct: 347 HQYL--IQAYDEVRNQGQGSNQEALNCQILSSFKIAFPSIIEQKEIVHFIAIQSQKLDQS 404 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLT 210 + LK ++ ++ AV GK+ Sbjct: 405 IDIQFNQIEKLKEYKTTLINSAVTGKIK 432 Score = 130 bits (327), Expect = 1e-28, Method: Composition-based stats. Identities = 39/235 (16%), Positives = 85/235 (36%), Gaps = 28/235 (11%) Query: 213 WRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDI--RF 270 W P H L S L + N VG P+L + + G + +++ + Sbjct: 20 WLGSIPSHWEAAPLCSVSKLKSITN---------HVGEPLLSVY-LDKGVIRFDEVEAKR 69 Query: 271 LECSESELNRH-KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 + +L+++ ++ GD + VG+ H+ ++ P L+ +L+ Sbjct: 70 TNVTSLDLSKYQLVEPGDFVLNNQQAWRGSVGISA------HRGIVSPAYLV-LQLSSKI 122 Query: 330 LPEYIEIFFSS-PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 P + F N ++N + Q+ + +K +V P + EQ I +++ Sbjct: 123 YPRFGNYLFRDGSMVANYLVNSKGVGTIQRNLYWPQLKRALVFFPGLDEQIAIANYLDEK 182 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + D + + Q I+ +A + NPD+ ++ + Sbjct: 183 TSQIDEAIAIKQKQIELLKERKQIIIQQAVT-------QGLNPDVPMKDSGVDWI 230 >UniRef50_Q0RV87 Type I restriction-modification system specificity subunit n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RV87_RHOSR Length = 391 Score = 269 bits (688), Expect = 2e-70, Method: Composition-based stats. Identities = 83/425 (19%), Positives = 162/425 (38%), Gaps = 49/425 (11%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP GWV+A + + T G YK + + P+ + Sbjct: 11 LPSGWVVAQMRRIATFRNGADYK----EVEVTEGGYPVYGSG----------------GE 50 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + SQ + + V+ G K + K F F L I ++ ++ Sbjct: 51 FRRASQYLYDGESVLF---GRKGTIDKPLLVSGRFWTVDTMFFTEL--TSNIEPRYLHYY 105 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 + + S + ++ IP+PP+ EQ IA+ LD A++D+ Sbjct: 106 ATTMPF----DYYSTSTALPSMTQGELGGHRIPLPPITEQGAIADFLDRETARIDTLIRE 161 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE 245 ++ ++L+ R AV G V + +++ G K +E Sbjct: 162 QRRLIELLRERRIAVAEGPV-----------VGLSWSTPLRSVTALIQTGPFGSQLKSDE 210 Query: 246 SG-VGHPILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGDLLFTRYNGSLEFVGVC 303 G P++ S + G ++ ++ + S++ EL RH L+ GD++ R +G C Sbjct: 211 YETGGTPVINPSHLVMGRIEPDERVAVSASKASELGRHALRAGDVIAARRGE----LGRC 266 Query: 304 GLLKKLQHQNLL-YPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 +++ L LIR R T A PE++ + FSS R+++ + + ++ Sbjct: 267 AVVRAENTGFLCGTGSALIRLRET-VADPEFLALVFSSRRNRDSL-SLASVGATMDNLNA 324 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 I + + +PP+ EQ IV V + DT+ + + + +++ A G++ Sbjct: 325 DIIATLRIPMPPLPEQRRIVESVAEATTKIDTLITETESFIDLAKERRSALITAAVTGQI 384 Query: 423 TAQWR 427 + Sbjct: 385 DVRDE 389 >UniRef50_A6E2R5 Restriction endonuclease S subunits-like protein n=1 Tax=Roseovarius sp. TM1035 RepID=A6E2R5_9RHOB Length = 413 Score = 268 bits (687), Expect = 3e-70, Method: Composition-based stats. Identities = 108/458 (23%), Positives = 183/458 (39%), Gaps = 49/458 (10%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P+GW + ++ + G K A ++ P + + Sbjct: 4 VPQGWAQSRLADWLDISTG----KLDANAATENGQYPFFTCAE---------QVSRIDTF 50 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + + G + + VL+P + I GF Sbjct: 51 AFDCEAVLLAGN-------------GNFNLHKYTGKFNAYQRTYVLQPHE-IDLGFTFVA 96 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 KS L +I+ + G+ I ++ P+PPL EQ+ I KLDTL A+ + + Sbjct: 97 LKSLL--PEITKDNRGSTIKYLRLGDIADTAAPLPPLPEQRRIVRKLDTLSARSTTARTH 154 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE 245 I ++++R+R AVL A + + S Sbjct: 155 LTAIEKLVERYRTAVLEAAFRTAWDAGFDTTIAGCLEHAETGLVR---------SKAEQT 205 Query: 246 SGVGHPILRISSVR-AGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 +G G+P +R++ AG + D+ ++ + SE R++L+ DLLF N S E VG Sbjct: 206 AGEGYPYIRMNHYDLAGRWNDRDLTYVAATSSEFERYQLRANDLLFNTRN-SAELVGKVA 264 Query: 305 LLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKD 364 + + + L+ + L+R R + D LP + SSP R + + T+ I + Sbjct: 265 IWPEGKD-GYLFNNNLLRMRFSADVLPGFAFWQMSSPPFRRYIEGFISATTSVAAIYQRS 323 Query: 365 IKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTA 424 + + +P EQ EIVRR+E FA D ++ + AL + +L Q ILAKAF G+L Sbjct: 324 LMAAPFWVPDTDEQREIVRRIETAFAKIDRLKAEAAKALKLLGHLDQRILAKAFAGDLVP 383 Query: 425 QWRAENPDLISGENSAAALLEKIKAERAASGGKKASRK 462 Q + P A LL +I+ RAA+ + R+ Sbjct: 384 QDPTDEP--------AETLLARIREARAATQTSRRRRR 413 >UniRef50_B0VPS8 Specificity determinant for hsdM and hsdR n=1 Tax=Acinetobacter baumannii SDF RepID=B0VPS8_ACIBS Length = 386 Score = 268 bits (686), Expect = 3e-70, Method: Composition-based stats. Identities = 116/403 (28%), Positives = 185/403 (45%), Gaps = 23/403 (5%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 M K P W IA + V LI G +K + D LP+IR N+ N + Sbjct: 2 MQVSKSPPSWCIASIGEVCNLINGRAFKSTE----WTDRGLPIIRIQNLNN---PDANFN 54 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 F +L + ++ D++ A S + G + ++ + LI Sbjct: 55 FFNGDLDNK-HRVEKGDLLFAWSGTPGTSFG-AHIWDGDIGALNQHIFKIVFNDSLIDKR 112 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 FI + +S G + ++ F+ I PPL EQKIIA+KLDTLLAQV Sbjct: 113 FIRYAIN-QTLDELVSGARGGVGLKHVTKGMFETTKIIFPPLYEQKIIADKLDTLLAQVA 171 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 +TK R E+I ILK FRQ++L AV+GKLTE+WR + + + K +I + +G Sbjct: 172 TTKVRLERILNILKTFRQSILSSAVSGKLTEEWRKNKKLNWI--KSTLANICRSVSDGDH 229 Query: 241 SKPNESGVGHPILRISSVRAGHVDQND-IRFLECSESELNR--HKLQDGDLLFTRYNGSL 297 P + G P L IS++ G +D + R++ S E + K + D+L+T Sbjct: 230 QAPPRADFGIPFLVISNISKGEIDFSSVNRWVPESYYESLKDIRKPEINDILYTVTGS-- 287 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 G+ +K + + + +Y+ + +SP + + Sbjct: 288 --FGIPVTVKST--TPFCFQRHIAIIKPNHSSVDYKYLFYYLASPEVFKHATSIAT-GTA 342 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 QK +S +++ +LLPP++EQ EIV RVE+L A+AD IEK++ Sbjct: 343 QKTVSLSHLRNFNILLPPIEEQTEIVHRVEELLAFADGIEKKL 385 Score = 117 bits (294), Expect = 8e-25, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 78/213 (36%), Gaps = 14/213 (6%) Query: 216 FEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSE 275 P + L R + G PI+RI ++ + N F Sbjct: 6 KSPPSWCIASIGEVCNLINGR--AFKSTEWTDRGLPIIRIQNLNNPDANFN---FFNGDL 60 Query: 276 SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYI 334 N+H+++ GDLLF G + L + + +I Sbjct: 61 D--NKHRVEKGDLLFAWSGTPGTSFG--AHIWDGDIGAL--NQHIFKIVFNDSLIDKRFI 114 Query: 335 EIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADT 394 + + +++ + G K ++ ++ ++ PP+ EQ I +++ L A T Sbjct: 115 RYAINQTL--DELVSGARGGVGLKHVTKGMFETTKIIFPPLYEQKIIADKLDTLLAQVAT 172 Query: 395 IEKQVNNALARVNNLTQSILAKAFRGELTAQWR 427 + ++ L + QSIL+ A G+LT +WR Sbjct: 173 TKVRLERILNILKTFRQSILSSAVSGKLTEEWR 205 >UniRef50_C2I227 Restriction modification system DNA specificity domain n=1 Tax=Vibrio cholerae TM 11079-80 RepID=C2I227_VIBCH Length = 434 Score = 268 bits (685), Expect = 4e-70, Method: Composition-based stats. Identities = 82/432 (18%), Positives = 158/432 (36%), Gaps = 32/432 (7%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W + P + T ++ +K+ + + N D DL + + Sbjct: 16 WNLVPAKRLFT-------SSKEINQGMKESNRLALTMKGVINRSLD--DLQGLQSSDYSV 66 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGFIAHFTKS 128 Q +D+V + + H + I+ F + + Sbjct: 67 YQIFEKDDLVFKLIDLENIKTSRVGIVH--ERGIMSPAYIRVSACSNSIYPRFYYWYFFA 124 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 N + L G N+ I +P+ ++ QK ++ LD ++DS + Sbjct: 125 LYLTNIYNKLGGGV-RQNLTAGDLLEIPVPLIDISLQKQVSAFLDRETQRIDSLIEEKQT 183 Query: 189 IPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTELRNGL 239 +LK RQA++ V L +W P+H V KK+ + + + G Sbjct: 184 FITLLKEKRQALISHVVTKGLNPNVEMQDSGIEWIGQVPKHWVVKKIKY--DVLGIEQGW 241 Query: 240 SS----KPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 S P ++++ V G + + L + ++ GDLL +R N Sbjct: 242 SPQCESTPVPDDHTWGVVKVGCVNRGIFNPEQNKKLPEELEPRKEYAIKKGDLLVSRANA 301 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTT 354 E+VG + + NLL DK+ R +L + A PE+ + +S AR + T Sbjct: 302 -KEWVGSAA-VPDRDYDNLLLCDKIYRIKLDLEKADPEFFAYYLASDQAREQIEIDATGT 359 Query: 355 -SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 S I I + + P + EQ IVR ++ + D + +V +++ + S+ Sbjct: 360 SSSMLNIGQGTILNMPIPAPELPEQQSIVRGIKNKTSQIDRLMLEVLDSIELLKEHRTSL 419 Query: 414 LAKAFRGELTAQ 425 ++ A G++ + Sbjct: 420 ISAAVTGKIDVR 431 Score = 129 bits (326), Expect = 2e-28, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 90/214 (42%), Gaps = 8/214 (3%) Query: 3 AGKLPEGWVIAPVS-TVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G++P+ WV+ + V + +G + + ++ D +++ + G F+ Sbjct: 218 IGQVPKHWVVKKIKYDVLGIEQGWSPQC-ESTPVPDDHTWGVVKVGCVNRGIFNPEQNKK 276 Query: 62 VPKNLV-KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF--GAFCGVLRPEKLIF 118 +P+ L ++ I D++++ ++ K VG +A ++ + + Sbjct: 277 LPEELEPRKEYAIKKGDLLVSRANA-KEWVGSAAVPDRDYDNLLLCDKIYRIKLDLEKAD 335 Query: 119 SGFIAHFTKSSLYRNKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 F A++ S R +I + G ++ NI + + IP P L EQ+ I + Sbjct: 336 PEFFAYYLASDQAREQIEIDATGTSSSMLNIGQGTILNMPIPAPELPEQQSIVRGIKNKT 395 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 +Q+D ++LK R +++ AV GK+ Sbjct: 396 SQIDRLMLEVLDSIELLKEHRTSLISAAVTGKID 429 Score = 99.0 bits (246), Expect = 4e-19, Method: Composition-based stats. Identities = 24/227 (10%), Positives = 81/227 (35%), Gaps = 21/227 (9%) Query: 217 EPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSES 276 + + ++ + E+ G+ + L + V +D D++ L+ S+ Sbjct: 13 DSKWNLVPAKRLFTSSKEINQGMKESNRLA------LTMKGVINRSLD--DLQGLQSSDY 64 Query: 277 ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 + + DL+F + G++ + + ++ P + + + P + Sbjct: 65 SV-YQIFEKDDLVFKLIDLENIKTSRVGIVHE---RGIMSPAYIRVSACSNSIYPRFYYW 120 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 +F + N ++ ++ D+ V L + Q ++ +++ D++ Sbjct: 121 YFFALYLTNIYNKL--GGGVRQNLTAGDLLEIPVPLIDISLQKQVSAFLDRETQRIDSLI 178 Query: 397 KQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++ + + Q++++ + NP++ ++ + Sbjct: 179 EEKQTFITLLKEKRQALISHVVT-------KGLNPNVEMQDSGIEWI 218 >UniRef50_A0Q725 Type I restriction-modification system, subunit S n=2 Tax=Francisella novicida RepID=A0Q725_FRATN Length = 407 Score = 268 bits (685), Expect = 4e-70, Method: Composition-based stats. Identities = 82/430 (19%), Positives = 151/430 (35%), Gaps = 34/430 (7%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI----QNGKFDTT 57 KLP GW + + + K+ D +P RA I QNG D Sbjct: 3 ELYKLPAGWEWKKLGDLFKITSSKRVHKKD----WLDKGIPFYRAREIVKLAQNGYVD-- 56 Query: 58 DLVFVPKNLVK----ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-GAFCGVLR 112 + +F+ +++ + DI++ +G + F L+ Sbjct: 57 NELFISEDMYNSFASKYGLPKENDILVT----GVGTLGIPFVVKKNDKFYFKDGNIIWLK 112 Query: 113 PEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKL 172 E +I + S RN+I+S G+ + + + IP+PPLAEQK I KL Sbjct: 113 NENGTNPKYIEYCFSSQDVRNQINSN-NGSTVATYTITNANNTIIPLPPLAEQKRIVAKL 171 Query: 173 DTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESIL 232 D+L ++D +Q + L E N + I Sbjct: 172 DSLFEKIDKAIELHQQNITNANTLMASTLDKTFKKLEGEYGMNDI----------LDGIY 221 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 R G KP P + + + + + + + + K + +L + Sbjct: 222 IGCRKGY--KPEIIDGKVPFIGMQDIDQYNGINTNYVLEDYEKVSKGKTKFEKNAVLVGK 279 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 + ++ + ++ + P Y+ F S + +++ + Sbjct: 280 ITPCTQN-NKTSIVPSNINGGFAT-TEVYALHSKNNMNPFYLNYFVRSKDINDYLVSTMI 337 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 +G++ + I S + LPP+ Q + V ++ + D I++ L + L S Sbjct: 338 GATGRQRVPSDAITSLKIPLPPLPIQQQTVEYLDSIATKVDKIKQLNEQKLENLKALKAS 397 Query: 413 ILAKAFRGEL 422 IL KAFRGEL Sbjct: 398 ILDKAFRGEL 407 >UniRef50_C9NRR1 Type I restriction-modification system specificity subunit S n=1 Tax=Vibrio coralliilyticus ATCC BAA-450 RepID=C9NRR1_9VIBR Length = 424 Score = 267 bits (684), Expect = 6e-70, Method: Composition-based stats. Identities = 72/427 (16%), Positives = 151/427 (35%), Gaps = 28/427 (6%) Query: 4 GKLPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G+ W + +T+ + G T + + +P IR+ N+ + D+ ++ Sbjct: 13 GEFEGSWKTTKLGALTSKVGSGATPRGGEKA--YSTSGIPFIRSQNVNYNRLLLNDIRYI 70 Query: 63 PKNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFS 119 P+N ++ I P+DI++ ++ S +G+S F + + ++R + Sbjct: 71 PENTHASMKRSQIQPKDILLNITGAS---IGRSCVVPDCFQDGNLNQHVCIIRLKND-DP 126 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F S + AG + S + P L EQ+ IA L + ++ Sbjct: 127 YFTQSLLASYRGEKLVFQGMAGGGREGLNFESIKGFKMAFPTLPEQQKIASFLSKVDEKI 186 Query: 180 DSTKARFEQIPQILKRFRQAVLG--------GAVNGKLTEKWRNFEPQHSVFKKLNFESI 231 + +++ + K Q + T +++ + + Sbjct: 187 ALLTEKKDKLAEYKKGVMQQLFNGKWQEQDGQLTFIPPTLRFKADDGSEFPDWEEKALGD 246 Query: 232 LTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 + +G P G P + V A ++ E E R L+ GD+L T Sbjct: 247 FARIYDGTHQTPKYVDEGVPFYSVEHVTANQFEKTKYISEEVYAKECKRVTLKKGDILLT 306 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 R VG L+ + L + + + +Y+ F SP+ ++ + + Sbjct: 307 RIGS----VGDVRLIDWDVRASFYVS--LALVKYNDEIVGQYLASFMQSPNFQSELWKRM 360 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 + K I+ +I +V +P EQ +I + + D N+ L + + Sbjct: 361 IHVAFPKKINLGEIGHCLVSVPSRDEQTKIANFLSAIDQKID----LANSELEKAKEWKR 416 Query: 412 SILAKAF 418 +L + F Sbjct: 417 GLLQQMF 423 Score = 146 bits (369), Expect = 2e-33, Method: Composition-based stats. Identities = 35/226 (15%), Positives = 87/226 (38%), Gaps = 14/226 (6%) Query: 205 VNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK---PNESGVGHPILRISSVRAG 261 + ++ F +K ++ +++ +G + + S G P +R +V Sbjct: 1 MTEQMNVPKLRFGEFEGSWKTTKLGALTSKVGSGATPRGGEKAYSTSGIPFIRSQNVNYN 60 Query: 262 HVDQNDIRFLEC-SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKL 320 + NDIR++ + + + R ++Q D+L S +G ++ Q+ + Sbjct: 61 RLLLNDIRYIPENTHASMKRSQIQPKDILLNITGAS---IGRSCVVPDC-FQDGNLNQHV 116 Query: 321 IRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAE 380 RL D P + + +S + + G++G++ + IK + P + EQ + Sbjct: 117 CIIRLKND-DPYFTQSLLASYRGEKLVFQGMAGG-GREGLNFESIKGFKMAFPTLPEQQK 174 Query: 381 IVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQW 426 I + ++ + ++ + ++ + F G+ Q Sbjct: 175 IASFLSKVDEKI----ALLTEKKDKLAEYKKGVMQQLFNGKWQEQD 216 >UniRef50_D2LA90 Restriction modification system DNA specificity domain protein n=1 Tax=Desulfovibrio sp. FW1012B RepID=D2LA90_9DELT Length = 543 Score = 267 bits (683), Expect = 7e-70, Method: Composition-based stats. Identities = 90/432 (20%), Positives = 169/432 (39%), Gaps = 32/432 (7%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W + V + G A D + + +G + + V +N K Sbjct: 133 WNTKGIGEVADIFDG-----PHATPKTVDTGPIFLGIGALNDGMINLRETRHVTENDFKT 187 Query: 70 -SQKISP--EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHF 125 ++++ P D+V + ++ +G++A +C G G++R + + F + Sbjct: 188 WTRRVRPQAGDVVFS----YETRLGQAAIIPDNIDCCLGRRMGLVRFKTNEVIPKFFLYQ 243 Query: 126 TKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S YRN + S + GA ++ I F I IP + EQK I LD +D+ A Sbjct: 244 YISPSYRNFLDSKTIRGATVDRISIKEFPFFPIAIPSIEEQKRIVSILDDAFECIDTAIA 303 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 E+ + ++ L + K L + Sbjct: 304 NTEKNIANARELFESYLDRVF---------AEKGDGWEEKNLEDILSFQPRNGWSPPASH 354 Query: 245 ESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 S G P+L +SSV + +++ + + +++GDLL TR N + E VG Sbjct: 355 HSDRGTPVLTLSSVTGFQFKKEALKYTSAQVNPKAHYWVENGDLLMTRSN-TPELVGHVA 413 Query: 305 LLKKLQHQNLLYPDKLIRARLTKDA-LPEYIEIFFSSPSARNAMMNCVKTTS-GQKGISG 362 + + N +YPD +++ ++ K L E++ S RN + + + K + Sbjct: 414 VCDGVS-ANTIYPDLIMKMKVDKHIALTEFVYFQLRSSKLRNIIKDGATGANPTMKKVKK 472 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 +++ + +P + Q IV + L + + K+ + + + L QS+L KAF GEL Sbjct: 473 STVQNLPLAMPALPVQQAIVDNLRNLNETSRLLVKKCVSKVKALTRLKQSLLQKAFSGEL 532 Query: 423 TAQWRAE-NPDL 433 + NPD Sbjct: 533 ----PMDFNPDA 540 Score = 108 bits (270), Expect = 6e-22, Method: Composition-based stats. Identities = 37/220 (16%), Positives = 85/220 (38%), Gaps = 12/220 (5%) Query: 8 EGWVIAPVSTVTTL--IRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +GW + + + G + D P++ +++ +F L + Sbjct: 329 DGWEEKNLEDILSFQPRNGWSPPASHH----SDRGTPVLTLSSVTGFQFKKEALKYTSAQ 384 Query: 66 LVKES-QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCG-VLRPEKLI-FSGFI 122 + ++ + D+++ S+ + +VG A + ++ +K I + F+ Sbjct: 385 VNPKAHYWVENGDLLMTRSN-TPELVGHVAVCDGVSANTIYPDLIMKMKVDKHIALTEFV 443 Query: 123 AHFTKSSLYRNKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +SS RN I + GA + +K ++ + + +P L Q+ I + L L Sbjct: 444 YFQLRSSKLRNIIKDGATGANPTMKKVKKSTVQNLPLAMPALPVQQAIVDNLRNLNETSR 503 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQH 220 + + L R +Q++L A +G+L + +H Sbjct: 504 LLVKKCVSKVKALTRLKQSLLQKAFSGELPMDFNPDALEH 543 >UniRef50_Q2J5T0 Restriction modification system DNA specificity domain n=1 Tax=Frankia sp. CcI3 RepID=Q2J5T0_FRASC Length = 436 Score = 266 bits (682), Expect = 9e-70, Method: Composition-based stats. Identities = 90/427 (21%), Positives = 174/427 (40%), Gaps = 21/427 (4%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 S G++ G I+ V T + G+T + P +R N+Q G+ +D+ + Sbjct: 12 SFGEIFPG-RISTVGTEFEIQSGITLSPRRTSGR---KDAPYLRVANVQRGRLTLSDVAW 67 Query: 62 VPKNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLP-FECSFGAFCGVLRPEKLIFS 119 + + + + D+++ + + +G+ A C + LRP + + + Sbjct: 68 LEASARERIRYALDDGDLLVVEGHANPAEIGRCAQVGPESKNCLYQNHLFRLRP-RNLEA 126 Query: 120 GFIAHFTKSSLYRNKI-SSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F H+ SS ++ + + + + I + IP+PP +Q+ I+E LD Sbjct: 127 RFALHWLNSSFSQSYWGRNCATSSGLYTINSRQLGALPIPVPPPDKQRKISEILDAADEA 186 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 + ST+ ++ Q+ R +L V P +L+ S +T Sbjct: 187 IRSTERLVGKLEQVFDSLRGDLLQEHVIRS------GRLPDCWRMDRLDRLSEITGGVTL 240 Query: 239 LSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLE 298 V P LR+++V+ G++D DI+ + SE +R+ LQ GD+L T G + Sbjct: 241 GGVTSAGRSVELPYLRVANVQDGYIDTTDIKTVTVRTSEFDRYLLQAGDVLMTE-GGDFD 299 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 +G + L+ + + R R K LPEY+ + +S + R+ M K T+ Sbjct: 300 KLGRGAVWDGSID-PCLHQNHIFRVRCDKIRLLPEYLSTYSASTAGRSYFMGISKQTTNL 358 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 I+ + + V LPP+ Q I+ + A+ LA++ + Q ++ Sbjct: 359 ASINKSQLSALPVPLPPLATQKMIIGSLGA----AERQISSTKAELAKLRLVKQGLMDDL 414 Query: 418 FRGELTA 424 G + Sbjct: 415 LMGRVQV 421 >UniRef50_C3DG13 Putative uncharacterized protein n=1 Tax=Bacillus thuringiensis serovar sotto str. T04001 RepID=C3DG13_BACTS Length = 409 Score = 266 bits (682), Expect = 1e-69, Method: Composition-based stats. Identities = 74/403 (18%), Positives = 151/403 (37%), Gaps = 25/403 (6%) Query: 38 DDYLPLIRANNIQNGKF-DTTDLVFVPKNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAH 95 D +P IR + D+ +V K LVK + K+ P V+ S S Sbjct: 13 DGDIPWIRIEDFNGKYISDSKSRQYVSKELVKGMNLKVFPIGTVLCTCSCSMGATAIV-- 70 Query: 96 QHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLI 155 + P + + S ++ + ++S R ++ + GA + +F+ + Sbjct: 71 ---EQPLISNQTFIGIVPGENLDSEYLFYLMQASAERLQL--FAQGAIQQYLSKHNFEHL 125 Query: 156 NIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---- 211 IP+P L QK + L+ L +D +Q+ +L+ RQ ++ AV L Sbjct: 126 KIPLPSLKIQKRLLVFLNRKLKDLDELIENKKQLIDLLEEKRQTLITEAVTRGLNPNVKM 185 Query: 212 -----KWRNFEPQHSVFKKLNFESILT-ELRNGLSSKPNESGVGHPILRISSVRAGHVDQ 265 +W P+H KK+ S L + G LR +V + Sbjct: 186 KDSGVEWIGEIPEHWTIKKIKHISNLVGSGKTPKGGSEIYPESGVLFLRSMNVHYDGIRL 245 Query: 266 NDIRFLECSESELNRH-KLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 DI + E R +++ D+L S +G ++ + + + I Sbjct: 246 KDIVHITPEIDEDMRSTRVKSKDVLLNITGAS---IGRSCIVPESLGKANVNQHVCIIRS 302 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLP-PVKEQAEIVR 383 TK +PE + +S ++ + S ++G++ +K+ L ++EQ EI Sbjct: 303 NTKVVVPELLSKIMASNFIMQQIL-MSQNGSSREGLNFTQVKNLEFPLTRDLQEQIEIAN 361 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQW 426 + +++ + + ++ QS++ + G++ + Sbjct: 362 HISVETNKINSLIGMIEEQIQKLKEYRQSLIYEVVTGKIDVRD 404 Score = 148 bits (374), Expect = 5e-34, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 89/214 (41%), Gaps = 11/214 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G++PE W I + ++ L+ G T K + + +R+ N+ D+V Sbjct: 193 IGEIPEHWTIKKIKHISNLVGSGKTPKGG--SEIYPESGVLFLRSMNVHYDGIRLKDIVH 250 Query: 62 VPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPE-KLI 117 + + + S ++ +D+++ ++ S +G+S + + ++R K++ Sbjct: 251 ITPEIDEDMRSTRVKSKDVLLNITGAS---IGRSCIVPESLGKANVNQHVCIIRSNTKVV 307 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-PLAEQKIIAEKLDTLL 176 ++ S+ +I G++ + + P+ L EQ IA + Sbjct: 308 VPELLSKIMASNFIMQQILMSQNGSSREGLNFTQVKNLEFPLTRDLQEQIEIANHISVET 367 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 +++S E+ Q LK +RQ+++ V GK+ Sbjct: 368 NKINSLIGMIEEQIQKLKEYRQSLIYEVVTGKID 401 Score = 123 bits (310), Expect = 1e-26, Method: Composition-based stats. Identities = 29/208 (13%), Positives = 74/208 (35%), Gaps = 20/208 (9%) Query: 238 GLSSKPNESGVGHPILRISSVRAGHVDQNDIR-FLECSESE-LNRHKLQDGDLLFTRYNG 295 + KP + P +RI ++ + R ++ + +N G +L T Sbjct: 4 PMRDKPTKFDGDIPWIRIEDFNGKYISDSKSRQYVSKELVKGMNLKVFPIGTVLCT---- 59 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 +G ++ Q L+ I ++ EY+ + + R + + Sbjct: 60 CSCSMGATAIV----EQPLISNQTFIGIVPGENLDSEYLFYLMQASAERLQLF---AQGA 112 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 Q+ +S + + + LP +K Q ++ + + D + + + + Q+++ Sbjct: 113 IQQYLSKHNFEHLKIPLPSLKIQKRLLVFLNRKLKDLDELIENKKQLIDLLEEKRQTLIT 172 Query: 416 KAFRGELTAQWRAENPDLISGENSAAAL 443 +A R NP++ ++ + Sbjct: 173 EAVT-------RGLNPNVKMKDSGVEWI 193 >UniRef50_Q5KVU6 Type I restriction-modification system specificity determinant n=1 Tax=Geobacillus kaustophilus RepID=Q5KVU6_GEOKA Length = 438 Score = 266 bits (681), Expect = 1e-69, Method: Composition-based stats. Identities = 74/433 (17%), Positives = 164/433 (37%), Gaps = 29/433 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDY-LPLIRANNIQNGKFDTTDLVFVP 63 ++P W + + +T + RG + + Y D+ +R +++ + Sbjct: 20 EVPSEWQVLQIKRLTRVRRGASPRPIDDPIYFDDNGEYSWVRISDVTKSNMYLEETEQKL 79 Query: 64 KNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 NL S K+ P ++ +++ + VGK ++ G V P+ F+ Sbjct: 80 SNLGSSLSVKLEPGELFLSI----AATVGKPCITNVKCCIYDG---FVYFPDYRGDKRFL 132 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + ++ + L N+ + I I +P + EQK+I++ LD + ++DS Sbjct: 133 YYIFEAGEAYRGLGKLG---TQLNLNTDTVGSIYIAVPTIQEQKMISDFLDEKVHEIDSL 189 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILT 233 A E++ ++L+ RQ ++ AV L +W P+ K+ +++ + Sbjct: 190 IADKEKLIELLEEKRQVIITEAVTKGLNPNVKMKDSGVEWIGEMPESWEVSKIKYQADIN 249 Query: 234 ELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 + S+ + + + ISSV + N ++ R L+ GD + + Sbjct: 250 KY---TLSENTDEDLEIKYIDISSVNSRGEVVNIEKYYFKDAPSRARRILRKGDTIISTV 306 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 L + +NL+ + P+Y+ S + ++ Sbjct: 307 RTYL----KAITWFEEVEENLICSTGFAVLSPKETIYPKYLFYLMRSTKYIDEIVKR-SI 361 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 I+ +I LLP + EQ IV ++ D + ++ + ++ QS+ Sbjct: 362 GVSYPAITSTEIGMMECLLPNINEQKMIVEYIDNELKKIDGLVDEIKLQIQKLKEYRQSL 421 Query: 414 LAKAFRGELTAQW 426 + +A G++ + Sbjct: 422 IYEAVTGKIDVRD 434 Score = 139 bits (350), Expect = 3e-31, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 89/209 (42%), Gaps = 8/209 (3%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN-GKFDTTDLVF 61 G++PE W ++ + + K + N +D + I +++ + G+ + + Sbjct: 230 IGEMPESWEVSKIKYQADIN-----KYTLSENTDEDLEIKYIDISSVNSRGEVVNIEKYY 284 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + + D +I+ + + + + VL P++ I+ + Sbjct: 285 FKDAPSRARRILRKGDTIISTVRTYLKAI--TWFEEVEENLICSTGFAVLSPKETIYPKY 342 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + +S+ Y ++I S G + I ++ +P + EQK+I E +D L ++D Sbjct: 343 LFYLMRSTKYIDEIVKRSIGVSYPAITSTEIGMMECLLPNINEQKMIVEYIDNELKKIDG 402 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLT 210 + Q LK +RQ+++ AV GK+ Sbjct: 403 LVDEIKLQIQKLKEYRQSLIYEAVTGKID 431 Score = 116 bits (292), Expect = 2e-24, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 78/238 (32%), Gaps = 29/238 (12%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKP------NESGVGHPILRISSVRAGHVDQ 265 +W P ++ LT +R G S +P + + +RIS V ++ Sbjct: 16 EWLREVPSEWQVLQIKR---LTRVRRGASPRPIDDPIYFDDNGEYSWVRISDVTKSNMYL 72 Query: 266 NDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL 325 + + KL+ G+L + VG + + +Y + Sbjct: 73 EETEQKLSNLGSSLSVKLEPGELFLSI----AATVGKPCI---TNVKCCIYDGFVYF--P 123 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 ++ F + A + Q ++ + S + +P ++EQ I + Sbjct: 124 DYRGDKRFLYYIFEAGEAYRGLGKL----GTQLNLNTDTVGSIYIAVPTIQEQKMISDFL 179 Query: 386 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++ D++ + + Q I+ +A + NP++ ++ + Sbjct: 180 DEKVHEIDSLIADKEKLIELLEEKRQVIITEAVT-------KGLNPNVKMKDSGVEWI 230 >UniRef50_A8V066 Type I restriction-modification enzyme, S subunit (Fragment) n=1 Tax=Hydrogenivirga sp. 128-5-R1-1 RepID=A8V066_9AQUI Length = 475 Score = 266 bits (681), Expect = 1e-69, Method: Composition-based stats. Identities = 90/437 (20%), Positives = 168/437 (38%), Gaps = 31/437 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT----DLVF 61 +PE W + + + + +G T K+ Y +I+ + +N KF + F Sbjct: 2 IPEDWEVVRLGDIAEIQQGKTPKR---DLYDDRKGYRIIKVKDFENEKFVKHYPNGERSF 58 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGK---SAHQHLPFECSFGAFCGVLRPEKLIF 118 V +L + DI+I + S VVG+ + + + F + +R Sbjct: 59 VKVDLGNRYT-LEQGDILILSAGHSSKVVGQKIGFYNVNSNNKVFFVSELLRIRANNKTN 117 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F+ S R +I G ++ P + IP+PPL EQK IA LD + Sbjct: 118 PLFLFFSIISQKSRKQIKEEIKGG---HLYPRDLVNLKIPLPPLPEQKAIATVLDKIRQA 174 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAV-------NGKLTEKWRNFEPQHSVFKKLNFESI 231 ++ T+ + ++ K + V KL E P+H K L Sbjct: 175 IEQTEEVIQANKELKKSLMKHFFTYGVVPPEETDKVKLKETEIGLIPEHWEIKTLKDSVD 234 Query: 232 LTELRNGLSSKPNESGVGHPILRISSV-RAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 E +S NE G PI+ + + + G + N IR ++ + + L+DGD+LF Sbjct: 235 SIEYGYSVSIPANEDQKGIPIISTADITKEGKLLYNKIRKIKPPKRLTEKLILKDGDVLF 294 Query: 291 TRYNGSLEFVGVCGLLKKLQ---HQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNA 346 N S E +G + + + +Y ++R R + ++ Y++ + Sbjct: 295 NWRN-SPELIGKTTVFEAEKVSKDDFYIYASFILRIRSKESESNNFYLKYLLNYYREIGT 353 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + + Q + +I + + LPP+ EQ +I + + D + N + Sbjct: 354 FIKLARRAVNQANYNRNEIYNLKIPLPPIDEQKQIAKILN----KIDNKIEAEENKKEAL 409 Query: 407 NNLTQSILAKAFRGELT 423 L +S+L G++ Sbjct: 410 EKLFKSLLNNLMTGKIR 426 Score = 127 bits (321), Expect = 6e-28, Method: Composition-based stats. Identities = 38/240 (15%), Positives = 90/240 (37%), Gaps = 16/240 (6%) Query: 2 SAGKLPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDL 59 G +PE W I + + G + +P+I +I + GK + Sbjct: 216 EIGLIPEHWEIKTLKDSVDSIEYGYSVS---IPANEDQKGIPIISTADITKEGKLLYNKI 272 Query: 60 VFV-PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAH-----QHLPFECSFGAFCGVLRP 113 + P + E + D++ + S ++GK+ + +F +R Sbjct: 273 RKIKPPKRLTEKLILKDGDVLFNWRN-SPELIGKTTVFEAEKVSKDDFYIYASFILRIRS 331 Query: 114 EK-LIFSGFIAHFTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEK 171 ++ + ++ + L+ N N + IP+PP+ EQK IA+ Sbjct: 332 KESESNNFYLKYLLNYYREIGTFIKLARRAVNQANYNRNEIYNLKIPLPPIDEQKQIAKI 391 Query: 172 LDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESI 231 L+ + ++++ + + E + ++ K ++ G + L + + + + +K+ S Sbjct: 392 LNKIDNKIEAEENKKEALEKLFKSLLNNLMTGKIR--LNKNFIEKFEKEEIHQKMKELSK 449 >UniRef50_A3JH04 Specificity determinant for hsdM and hsdR n=1 Tax=Marinobacter sp. ELB17 RepID=A3JH04_9ALTE Length = 479 Score = 265 bits (679), Expect = 2e-69, Method: Composition-based stats. Identities = 159/491 (32%), Positives = 238/491 (48%), Gaps = 91/491 (18%) Query: 26 TYKKEQAINYLKDDYLPLI-RANNIQNGKFDTTDLVFVPKNLVKESQKISPEDIVIAMSS 84 T KK + + L+ P+I + N G D D + I+ D +I Sbjct: 23 TGKKVKTKDCLQTGRFPVIDQGQNPVAGYVDDPD------------RLINVSDPLIVFGD 70 Query: 85 GSKSVVGKSAHQHLPFECSFGAF-CGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSAGAN 143 ++ A + + F GA +L+PE +F F + +S NK S Sbjct: 71 HTR------AVKWVDFSFVPGADGTKILQPEPYLFPRFAYYQLRSLEIPNKGYSR----- 119 Query: 144 INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGG 203 + + PLAEQK IA KLDTLLAQV++TKAR E+IP ILKRFRQ+VL Sbjct: 120 ----HFKFLKELKFEVAPLAEQKTIAVKLDTLLAQVENTKARLERIPTILKRFRQSVLAA 175 Query: 204 AVNGKLTEKWR------------------------------------------------N 215 AV+G+LTE+WR Sbjct: 176 AVSGRLTEEWRNNRTTKSSPKKLLNHFEELRQIAVQDENLRTGKKTKYKPVTIDTYGTPG 235 Query: 216 FEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGH-VDQNDIRFLECS 274 P + + E++ T++ +G+ KP G P + + ++ G+ + + ++ Sbjct: 236 DLPNSWYW--IPVEALATKVTDGVHKKPTYISNGVPFITVKNLTKGNGISFTETNYISTH 293 Query: 275 ESE--LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPE 332 + E R + GD+L ++ +GV ++ ++ L + ++ Sbjct: 294 DHEEFCKRTNPEKGDILISKDG----TLGVVRQIRTDAIFSIFVSVAL--VKPADRSMSN 347 Query: 333 YIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYA 392 Y+E+ F S + M+ +G + I D++ ++ +PP++EQ EIV +V+QLFAYA Sbjct: 348 YLELAFQSSVVQGQMIGV---GTGLQHIHLIDLRKDLIPVPPLEEQIEIVHQVDQLFAYA 404 Query: 393 DTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERA 452 + +E+QVNNALARVN LTQSILAKAFRGELT QWR +NP+LISGENSAAALLE+IK ERA Sbjct: 405 ERVEQQVNNALARVNKLTQSILAKAFRGELTEQWRKDNPNLISGENSAAALLERIKVERA 464 Query: 453 ASGGKKASRKK 463 A A+RK+ Sbjct: 465 AMKPTNAARKR 475 Score = 136 bits (342), Expect = 2e-30, Method: Composition-based stats. Identities = 48/219 (21%), Positives = 87/219 (39%), Gaps = 14/219 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK-FDTTDLVFV 62 G LP W PV + T + +KK + +P I N+ G T+ ++ Sbjct: 235 GDLPNSWYWIPVEALATKVTDGVHKK----PTYISNGVPFITVKNLTKGNGISFTETNYI 290 Query: 63 PKNLVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + +E K DI+I+ +G S +++P S Sbjct: 291 STHDHEEFCKRTNPEKGDILISKD----GTLGVVRQIRTDAIFSIFVSVALVKPADRSMS 346 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ +SS+ + ++ + G + +I IP+PPL EQ I ++D L A Sbjct: 347 NYLELAFQSSVVQGQM--IGVGTGLQHIHLIDLRKDLIPVPPLEEQIEIVHQVDQLFAYA 404 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP 218 + + + + + Q++L A G+LTE+WR P Sbjct: 405 ERVEQQVNNALARVNKLTQSILAKAFRGELTEQWRKDNP 443 >UniRef50_Q112D6 Restriction modification system DNA specificity domain n=1 Tax=Trichodesmium erythraeum IMS101 RepID=Q112D6_TRIEI Length = 402 Score = 265 bits (679), Expect = 2e-69, Method: Composition-based stats. Identities = 74/417 (17%), Positives = 158/417 (37%), Gaps = 21/417 (5%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W V V ++ T + +P +R NNIQ+GK + D++F+ + Sbjct: 3 WQRVFVEDVAKIVTKGTTPTS-IGFSFSKEGIPFLRVNNIQDGKINLGDVLFIDSKTDQA 61 Query: 70 --SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFEC-SFGAFCGVLRPEKLIFSGFIAHFT 126 +I +D++I++ +GK+A + ++R + + H+ Sbjct: 62 LARSRILKKDVIISI----AGTIGKTAVIPTNAPAMNCNQALAIIRLHNNVDPYYFNHWL 117 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 + +I+ A I+N+ + IP+PP+ EQ+ IA LD A + + Sbjct: 118 NTGDAFRQITGSKVTATISNLSLGCIKKLKIPLPPIEEQRRIAAILDQADA----IRRKR 173 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES 246 +Q + ++ + + +I Sbjct: 174 QQAIALTDELLRSTFLEMFGDPVINPKGWEVKKLEEVALKRKGAIKCGPFGSQLLISEFV 233 Query: 247 GVGHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYNGSLEFVGVCGL 305 G P+ I +V+ +++ + E L +QD D+L +R VG + Sbjct: 234 KDGIPVYGIDNVQKNEFVWAKPKYITTEKYEQLKSFSIQDEDVLISRTG----TVGRTCV 289 Query: 306 LKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI 365 +++L P+ L + T LP+Y+ + + + + + + ++ Sbjct: 290 APPDIPRSILGPNLLKVSLNTNKMLPKYLSYALNHSNPLIEEIKRMSPGATVAVFNTTNL 349 Query: 366 KSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 K+ + +P + Q++ V + +++ +N L NNL S+L +AF+G+L Sbjct: 350 KALRLTIPHINLQSQFVNF----TENVELTKQKESNYLTESNNLFNSLLQRAFKGQL 402 Score = 106 bits (265), Expect = 2e-21, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 75/212 (35%), Gaps = 17/212 (8%) Query: 7 PEGWVIAPVSTVTTLIRG----VTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 P+GW + + V +G + + I+ D +P+ +N+Q +F ++ Sbjct: 199 PKGWEVKKLEEVALKRKGAIKCGPFGSQLLISEFVKDGIPVYGIDNVQKNEFVWAKPKYI 258 Query: 63 PKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF--GAFCGVLRPEKLIF 118 + +S I ED++I+ + VG++ S V + Sbjct: 259 TTEKYEQLKSFSIQDEDVLISRT----GTVGRTCVAPPDIPRSILGPNLLKVSLNTNKML 314 Query: 119 SGFIAHFTK-SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 ++++ S+ +I +S GA + + + + IP + Q Sbjct: 315 PKYLSYALNHSNPLIEEIKRMSPGATVAVFNTTNLKALRLTIPHINLQSQFVNF----TE 370 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 V+ TK + ++L A G+L Sbjct: 371 NVELTKQKESNYLTESNNLFNSLLQRAFKGQL 402 >UniRef50_C6Q0B1 Restriction modification system DNA specificity domain protein n=1 Tax=Clostridium carboxidivorans P7 RepID=C6Q0B1_9CLOT Length = 407 Score = 265 bits (679), Expect = 2e-69, Method: Composition-based stats. Identities = 91/428 (21%), Positives = 173/428 (40%), Gaps = 34/428 (7%) Query: 5 KLPEGWVIAPVST-VTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG---KFDTTDLV 60 KLP+ W + + TL G K D+ +P + +I N L Sbjct: 2 KLPKEWKEVNLKEYILTLESGKRPKGGAI-----DNGVPSLGGEHINNTGGFNIQIDKLK 56 Query: 61 FVPKNLVKESQ--KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 +VP+ K+ + + DI+I + + + E ++R + + Sbjct: 57 YVPREFFKKMKSGVVKKNDILIVKDGATTGKIAFVDNNFNLKEACINEHLFLIRTNERLN 116 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + F++++ +S+ R KI GA + I NI +PPL QK I + L+ Sbjct: 117 NKFLSYYLRSNTGRKKILEDFRGATVGGISKNFI-DFNILLPPLETQKKIVKVLEKAEET 175 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 ++ K + +++K + G + K S++ + G Sbjct: 176 LEKRKESINLLDKLVKSRFIGMFGDP------------SSNPKGWNKDTIGSVVKSITAG 223 Query: 239 LSSK---PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 S+ + +L++S+V G+ ++ + + + GDLLF+R N Sbjct: 224 WSANGEAREKREDEKAVLKVSAVTQGYFKADEYKVIGDDVEIKKYVFPEKGDLLFSRAN- 282 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 + E VG ++ K + +LL PDKL + + Y++ S PS R TS Sbjct: 283 TREMVGATCIIHK-DYPDLLLPDKLWKVSFVERVNVFYMKYILSEPSIRAEFSAKSTGTS 341 Query: 356 G-QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 G +S KS + +PP++ Q + V Q+ D ++ ++ +L + + +S++ Sbjct: 342 GSMYNVSMDKFKSIEITIPPIELQNQFADFVNQV----DKLKFEMEKSLKELEDNFKSLM 397 Query: 415 AKAFRGEL 422 KAF+GEL Sbjct: 398 QKAFKGEL 405 >UniRef50_A3UV36 Type I restriction enzyme specificity protein n=1 Tax=Vibrio splendidus 12B01 RepID=A3UV36_VIBSP Length = 496 Score = 265 bits (679), Expect = 2e-69, Method: Composition-based stats. Identities = 136/492 (27%), Positives = 222/492 (45%), Gaps = 68/492 (13%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 MS +LP+GW+ + ++ K+ + K +Y+ I + + + + Sbjct: 1 MS--ELPKGWITIKIDSLCAK-----PKQLKPEASWKFNYID-ISSVDREKKLICEPSEI 52 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + + ++ D++++M+ + + V K ++ S G VL+P LI S Sbjct: 53 LGSDAPSRARKIVNTGDVLVSMTRPNLNAVAKVPEKYNGQVASTG--FDVLKPF-LIESD 109 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ +S + + IS + GA K + +P+PPLAEQK I EKLD +LAQVD Sbjct: 110 WLFSVVRSQPFIDSISGTTIGALYPACKTSDIRDYEMPLPPLAEQKRIVEKLDEVLAQVD 169 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE----------------------- 217 + KAR + IP +LKRFRQ+VL AV+G LT++WR Sbjct: 170 TIKARLDGIPDLLKRFRQSVLASAVSGTLTKEWRLTNELTKAEEELKSNFLAKSGKLKLR 229 Query: 218 --------------PQHSVFKK-----LNFESILTELRNGLSSK-PNESGVGHPILRISS 257 P + + + + + G K + G PI+ + Sbjct: 230 GKQTNFSELSLITLPDSWTWAQNYKLAKDESNAICAGPFGTIFKAKDFRDEGVPIIFLRH 289 Query: 258 VRAGHVDQNDIRFLECS--ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLL 315 V+ +QN +++ E + + G+LL T+ G C + + ++ Sbjct: 290 VKEIGFNQNKPNYMDGDVWEELHQEYSVHGGELLVTKLGDPP---GECCIYPENMGTAMV 346 Query: 316 YPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPV 375 PD L L +Y+ +F+SP ++ + + + I K + LP + Sbjct: 347 TPDVLKMNVDEDIVLRKYLRSYFNSP-ISTEIIEALAFGATRLRIDIAMFKGFPIPLPSM 405 Query: 376 KEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLIS 435 +EQ EIVR V+Q FA+ADTIE QV A A+V+NLTQSILAKAFRGEL +Q ++ P Sbjct: 406 EEQKEIVRLVDQYFAFADTIEAQVKKAQAKVDNLTQSILAKAFRGELVSQDPSDEP---- 461 Query: 436 GENSAAALLEKI 447 A LLE+I Sbjct: 462 ----ADKLLERI 469 >UniRef50_Q57594 Type-1 restriction enzyme MjaXIP specificity protein n=2 Tax=Methanocaldococcus RepID=T1S1_METJA Length = 425 Score = 265 bits (679), Expect = 2e-69, Method: Composition-based stats. Identities = 66/432 (15%), Positives = 158/432 (36%), Gaps = 27/432 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN--GKFDTTDL 59 G++PE W I + V I+ K Y K+ +P ++ +I N T + Sbjct: 12 EIGEIPEDWEIVELKDVCKKIKAGGTPKTSVEEYYKNGTIPFVKIEDITNSNKYLTNTKI 71 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + L + I P++ V+ GS +G++A + + A G++ + ++ S Sbjct: 72 KITEEGLNNSNAWIVPKNSVLFAMYGS---IGETAINKIEV-ATNQAILGIIPKDNILES 127 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ + + +N S L N+ IP+PPL EQK IA+ L + + Sbjct: 128 EFLYYILAKN--KNYYSKLGMQTTQKNLNAQIVKSFKIPLPPLEEQKQIAKILTKIDEGI 185 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNG-KLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 + + ++ +I K +L + + + P+ ++ + Sbjct: 186 EIIEKSINKLERIKKGLMHKLLTKGIGHSRFKKSEIGEIPEDWEVFEIKDIFEVKTGTTP 245 Query: 239 LSSKPNESGVG-HPILRISSVR----AGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 + K G + + ++ ++ + + + + N + + G ++ + Sbjct: 246 STKKSEYWENGEINWITPLDLSRLNEKIYIGSSERKVTKIALEKCNLNLIPKGSIIIS-- 303 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 + VG +L N + E+ + + ++ + Sbjct: 304 --TRAPVGYVAVLTVESTFNQGCKGLF--QKNNDSVNTEFYAYYLK---FKKNLLENLSG 356 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 S K +S +++ + LPP++EQ +I + + + D + ++ + + I Sbjct: 357 GSTFKELSKSMLENFKIPLPPLEEQKQIAKILSSV----DKSIELKKQKKEKLQRMKKKI 412 Query: 414 LAKAFRGELTAQ 425 + G++ + Sbjct: 413 MELLLTGKVRVK 424 Score = 124 bits (311), Expect = 9e-27, Method: Composition-based stats. Identities = 27/218 (12%), Positives = 76/218 (34%), Gaps = 24/218 (11%) Query: 209 LTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK----PNESGVGHPILRISSVR--AGH 262 + P+ +L + +++ G + K P ++I + + Sbjct: 8 FKKTEIGEIPEDWEIVELK--DVCKKIKAGGTPKTSVEEYYKNGTIPFVKIEDITNSNKY 65 Query: 263 VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR 322 + I+ E + N + +LF Y +G + + ++ Sbjct: 66 LTNTKIKITEEGLNNSNAWIVPKNSVLFAMYGS----IGETAI----NKIEVATNQAILG 117 Query: 323 ARLTKDAL-PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEI 381 + L E++ + + + + QK ++ + +KS + LPP++EQ +I Sbjct: 118 IIPKDNILESEFLYYILAKN---KNYYSKLGMQTTQKNLNAQIVKSFKIPLPPLEEQKQI 174 Query: 382 VRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 + + D + + ++ ++ + + ++ K Sbjct: 175 AKIL----TKIDEGIEIIEKSINKLERIKKGLMHKLLT 208 >UniRef50_D0J4L5 Putative uncharacterized protein n=1 Tax=Comamonas testosteroni CNB-2 RepID=D0J4L5_COMTE Length = 429 Score = 265 bits (678), Expect = 3e-69, Method: Composition-based stats. Identities = 81/433 (18%), Positives = 163/433 (37%), Gaps = 35/433 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +P W + P+ VT+L ++ ++ + L + ++ + T L Sbjct: 16 GNVPSHWDVQPLRAVTSLKSDKNRPDLPVLSVYREYGVIL---KDSRDDNHNATSL---- 68 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + + P D+V+ + +G S+H + R ++ Sbjct: 69 --DTSTYKVVKPGDLVVNKMKAWQGSMGVSSHHGIVSPAYITCTTKADRAR----PAYLH 122 Query: 124 HFTKSSLYRNKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + +SS +SLS G ++ F I IP+PP EQ I LD A++D+ Sbjct: 123 YLLRSSPLIGVYNSLSYGVRVGQWDMHYEDFKQIPIPLPPNDEQDRIVAFLDQKTAEIDA 182 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFESIL 232 + E++ +LK + ++ AV L W P H ++ +L Sbjct: 183 AIEKKERLASLLKEQQFKLINLAVTKGLDPNAAMTCGRSPWIESYPAHWQLMRIKH--VL 240 Query: 233 TELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLEC--SESELNRHKLQDGDLL 289 + + P G ++R S+V+ G + + ++ + R GD+L Sbjct: 241 RAIVDTEHKTPPMYEEGPALMVRTSNVKNGELVFKNAKYTDELTYRRWTRRAIPVAGDIL 300 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 FTR G +L + L + + P + S +A+ A + Sbjct: 301 FTREAP----AGEACVLPDGIKAAIGQRMVLFKVDP-ERLDPHFAVHSIYSGAAK-AFIE 354 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + S + DI + +LLPP++EQ +I ++ + + N + ++ L Sbjct: 355 LLSVGSTVAHFNMSDIGNIPLLLPPLQEQQKIAVGIKSIQRQFQPLIDSAANGIEQLQEL 414 Query: 410 TQSILAKAFRGEL 422 ++++A A G++ Sbjct: 415 KRTLIASAVLGQI 427 Score = 115 bits (289), Expect = 3e-24, Method: Composition-based stats. Identities = 44/254 (17%), Positives = 77/254 (30%), Gaps = 36/254 (14%) Query: 207 GKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHV--D 264 W P H + L S K +++ P+L + G + D Sbjct: 8 KPSEATWLGNVPSHWDVQPLRAV---------TSLKSDKNRPDLPVLSVYR-EYGVILKD 57 Query: 265 QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 D S ++ GDL+ + +GV H ++ P + Sbjct: 58 SRDDNHNATSLDTSTYKVVKPGDLVVNKMKAWQGSMGVSS------HHGIVSPAYITCTT 111 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTT--SGQKGISGKDIKSQVVLLPPVKEQAEIV 382 A P Y+ S + N + GQ + +D K + LPP EQ IV Sbjct: 112 KADRARPAYLHYLLRSSPLIG-VYNSLSYGVRVGQWDMHYEDFKQIPIPLPPNDEQDRIV 170 Query: 383 RRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLIS------- 435 ++Q A D ++ + + ++ A + +P+ Sbjct: 171 AFLDQKTAEIDAAIEKKERLASLLKEQQFKLINLAVT-------KGLDPNAAMTCGRSPW 223 Query: 436 -GENSAAALLEKIK 448 A L +IK Sbjct: 224 IESYPAHWQLMRIK 237 >UniRef50_Q2B8V0 Type I restriction modification system, subunit S n=1 Tax=Bacillus sp. NRRL B-14911 RepID=Q2B8V0_9BACI Length = 435 Score = 265 bits (678), Expect = 3e-69, Method: Composition-based stats. Identities = 67/436 (15%), Positives = 145/436 (33%), Gaps = 30/436 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI---QNGKFDTTD 58 G++P W + + V +I G T K Y + + +I + T+ Sbjct: 17 ELGEIPVEWEVRLIKEVADVISGGTPSKA-VTEYWNEGTILWATPTDITRNNSKYIYETE 75 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 L L K S + P ++ S + + S + Sbjct: 76 LSITELGLKKSSANLLPAGSILMTSRATIGERSIAT-----APISTNQGFKSFVCHDGLS 130 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 + ++ ++ + + ++G+ + + + IPP EQ+ I E L T+ Q Sbjct: 131 NEYMYYYL--EILKQYFLLNASGSTFLEVSKQVIENQVMAIPPHKEQQKIVEVLSTVDEQ 188 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNG-KLTEKWRNFEPQHSVFKKLNFE---SILTE 234 +++T+ E+ ++ K Q +L + + P KKL ++ Sbjct: 189 IENTEQLIEKTKELKKGLMQQLLTKGIGHTEFKVTEIGEIPVEWEAKKLEDLISDKVVIS 248 Query: 235 LRNGLSSK-----PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDG-DL 288 +G G P + +S+ +G +D + ++L + + D+ Sbjct: 249 HIDGNHGSLYPRASEFVDRGTPYISANSIVSGSIDFSKAKYLSEERGNKFKKGVAKNEDV 308 Query: 289 LFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMM 348 LF VG +LK + +L + LP Y+ + SP + Sbjct: 309 LFAH----NATVGPVAILKTSAPKVILSTSLTLYRCDNNFLLPSYLSYYLDSPMFKIQYQ 364 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + + + + + + L+P ++EQ I + D R Sbjct: 365 KVMSQ-TTRNQVPITAQRKFLFLIPTIQEQEIIANTL----GLVDERINYFTQEKERYTE 419 Query: 409 LTQSILAKAFRGELTA 424 L + ++ + G++ Sbjct: 420 LKKGLMQQLLTGKIRV 435 >UniRef50_B1XQR8 Type 1 restriction-modification system specificity subunit n=1 Tax=Synechococcus sp. PCC 7002 RepID=B1XQR8_SYNP2 Length = 398 Score = 265 bits (678), Expect = 3e-69, Method: Composition-based stats. Identities = 75/419 (17%), Positives = 165/419 (39%), Gaps = 29/419 (6%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ--NGKFDTTDLVFVPKNLV 67 W + + + + RG + + ++ + D + I+ + + T P+ + Sbjct: 3 WEVKTLDDLCDIARGGSPRPIKSYLTNEPDGINWIKIGDASASSKYIYETQEKIKPEGI- 61 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTK 127 K+S+ + P D +++ S G+ C + + L ++ +F Sbjct: 62 KKSRFVEPGDFLLSNS----MSFGRPYIMRT-SGCIHDGWLVLKDKSGLFDQDYLYYFLG 116 Query: 128 SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFE 187 S + L+AG+ + N+ + +P+PP+AEQK I E LD + ++ +A Sbjct: 117 SQAAYKQFDKLAAGSTVRNLNTTLVKKVLVPVPPIAEQKRIVEILDESFSGIERAEAIAR 176 Query: 188 QIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESG 247 Q + + L + K I + + Sbjct: 177 QNLTNARELFDSYLNKIFLDFVERK-----------NTQTLNCITDLIVDCEHKTAPTQE 225 Query: 248 VGHPILRISSVRAGHVDQNDIRFLECSESE--LNRHKLQDGDLLFTRYNGSLEFVGVCGL 305 G P +R ++ GH+ +++ + + R K Q GDL+ R G G+ Sbjct: 226 TGFPSIRTPNIGKGHLILDNVYRVSEETYKQWTRRAKPQSGDLILAREAP----AGNVGV 281 Query: 306 LKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI 365 + + + + + + R ++ P+Y+ F P + +++ + + + ++ KDI Sbjct: 282 IPEGE--RVCLGQRTVLIRPKENINPQYLAFFLLHPKMQERLLSK-SSGATVQHVNMKDI 338 Query: 366 KSQVV-LLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 ++ + LPP++ Q ++ + + + +E+ + + L QSIL KAF G+LT Sbjct: 339 RALKMGDLPPIEIQDRLIESLLDVQEKSKKLEEVYQRKIEALGKLKQSILQKAFSGQLT 397 >UniRef50_A6W078 Restriction modification system DNA specificity domain n=1 Tax=Marinomonas sp. MWYL1 RepID=A6W078_MARMS Length = 400 Score = 265 bits (677), Expect = 3e-69, Method: Composition-based stats. Identities = 83/423 (19%), Positives = 163/423 (38%), Gaps = 38/423 (8%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GWV+A + V + G + +PL+ + ++ NGK D + ++ + Sbjct: 10 LPKGWVLAKANDVMDVRDG-----THDSPKAQATGIPLVTSKSLVNGKIDYSTCTYISEQ 64 Query: 66 LVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + K + DI+ AM +G F+ S + + + +I Sbjct: 65 DHESISKRSAVDDGDILYAMI----GTIGNPVIVKKDFDFSIKNVALFKFTKTDLSNRYI 120 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 H+ S L + + + S G + + + IP+PPL EQK IA LD A Sbjct: 121 FHYLNSGLAKRQFENNSRGGTQKFVSLGNIRELMIPLPPLEEQKRIAAILDKADA----I 176 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 + + +Q + F ++V +T + + ++ +G Sbjct: 177 RRKRQQAIDLADEFLRSVFLDMFGDPVTNPKGKRI--------VPLIELCNKVTDGTHQS 228 Query: 243 PNESGVGHPILRISSVRAGHVDQNDIRFLE-CSESELNRHK-LQDGDLLFTRYNGSLEFV 300 P G P L IS++ G + + +F+ + EL R ++ GD+L+T Sbjct: 229 PKWEESGIPFLFISNIVNGKISFDTNKFISKETLDELTRSTPIEKGDVLYTTVGSY---- 284 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDA-LPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 G + + + + + E++ +S R + V QK Sbjct: 285 GNVARV--TDDTEFCFQRHIAHIKPNHEIVNAEFLTSMLASSVVRRQADSLV-RGIAQKT 341 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 ++ +++K +V ++ Q ++ VE I+ +N++ + N S++ KAF Sbjct: 342 LNLRELKEILVFDVSLENQKSYLKIVEP----IHKIKDNYDNSVNELLNNFNSLIQKAFS 397 Query: 420 GEL 422 GEL Sbjct: 398 GEL 400 >UniRef50_C3NN82 Restriction modification system DNA specificity domain protein n=1 Tax=Sulfolobus islandicus Y.N.15.51 RepID=C3NN82_SULIN Length = 576 Score = 265 bits (677), Expect = 3e-69, Method: Composition-based stats. Identities = 89/438 (20%), Positives = 175/438 (39%), Gaps = 23/438 (5%) Query: 1 MSAGKLPEGWVIAPVST-VTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN--GKFDTT 57 + G+ P+ W + + + G T ++ + + +P + +I T Sbjct: 9 IDIGEFPKDWDVRKLKDVIIKAKSGGTPRRN--VPEYWNGNIPFAKIQDITKSGKYLYNT 66 Query: 58 DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 + K L + I P+D ++ GS V + + A G++ + +I Sbjct: 67 EEFITEKGLENSNAWIVPKDSLLLTIYGSLGFVAINKIPV----ATNQAIIGIIPNKNII 122 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + F+ ++ ++ S N+ ++PI PL EQK I E L Sbjct: 123 DTEFLYYW--YLYFKPYWSKFIKKGTQPNLTLEIVLNSSVPILPLEEQKKIVELLQKATD 180 Query: 178 ----QVDSTKARFEQIPQILKRFRQAVLGGAVNGKL-TEKWRNFEPQHSVFKKLNFESIL 232 D I K R+ +L + + E P+ ++LN +I Sbjct: 181 IYYTLKDYIIQIRNSTETITKVIRKELLTKGIGHRDYVETDIGEFPKDWEVRRLNEIAI- 239 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRA--GHVDQNDIRFLECSESELNRHKLQDGDLLF 290 +R+G S + + LR ++ + + I ++ S + R+ L+ D++ Sbjct: 240 --IRSGFSERKRDENSKVIHLRPDNIDNETDRIVFHRIVYIPESPK-IERYLLRHLDIVL 296 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR-LTKDALPEYIEIFFSSPSARNAMMN 349 NGS++ +G G++ +Q + + + L R ++KD P YI S + Sbjct: 297 VNTNGSIDHIGKLGIIDMPLNQKITFSNHLTAIRIVSKDVEPYYIYYLLSWYHLNGSFKK 356 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 VK +G+ ++ I++ ++ LPP++EQ +IV ++++ + N N L Sbjct: 357 VVKNQAGKWNLNLDTIRNLLIPLPPLEEQKKIVELLQKVDELIIRFNDFLQNLEDEANTL 416 Query: 410 TQSILAKAFRGELTAQWR 427 +SIL A G+LT WR Sbjct: 417 YKSILRLALTGKLTEDWR 434 Score = 139 bits (352), Expect = 1e-31, Method: Composition-based stats. Identities = 46/222 (20%), Positives = 90/222 (40%), Gaps = 11/222 (4%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN--GKFDTTDL 59 G+ P+ W + ++ + + G + +K ++ + +R +NI N + + Sbjct: 221 DIGEFPKDWEVRRLNEIAIIRSGFSERKRD-----ENSKVIHLRPDNIDNETDRIVFHRI 275 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE--CSFGAFCGVLRP-EKL 116 V++P++ E + DIV+ ++GS +GK +P +F +R K Sbjct: 276 VYIPESPKIERYLLRHLDIVLVNTNGSIDHIGKLGIIDMPLNQKITFSNHLTAIRIVSKD 335 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + +I + + A N+ + + IP+PPL EQK I E L + Sbjct: 336 VEPYYIYYLLSWYHLNGSFKKVVKNQAGKWNLNLDTIRNLLIPLPPLEEQKKIVELLQKV 395 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE 217 + + + +++L A+ GKLTE WR Sbjct: 396 DELIIRFNDFLQNLEDEANTLYKSILRLALTGKLTEDWRRQI 437 >UniRef50_B8GGK0 Restriction modification system DNA specificity domain protein n=1 Tax=Methanosphaerula palustris E1-9c RepID=B8GGK0_METPE Length = 471 Score = 264 bits (676), Expect = 4e-69, Method: Composition-based stats. Identities = 119/464 (25%), Positives = 202/464 (43%), Gaps = 56/464 (12%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP- 63 ++PEGW + + + K D + + + T+ P Sbjct: 18 EVPEGWKLVTILNACEV----NPPKPPRDFLPADAPVTFVPMPAVDADMGAITNPEIKPY 73 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECS-FGAF-CGVLRPEKLIFSGF 121 + D+++A + GK+A FG+ V+R I + Sbjct: 74 LEVRNGFTSFRDGDVIMAKITPCMEN-GKAAIVRGMKNGIGFGSTEFHVMRSRGEILPEY 132 Query: 122 IAHFTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + ++ + +RN+ S G+ + IP+PPLAEQ+ I +++ LL+ VD Sbjct: 133 LFYYIRQKSFRNEAESHFTGSVGQKRVPTDFIKQSVIPLPPLAEQRRIVARIEALLSHVD 192 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRN------------------------- 215 + R ++P I+KRFRQAVL A +G+LTE+WR Sbjct: 193 AAGDRLSRVPLIMKRFRQAVLAAACSGRLTEEWREDKDNFEDPKLLLQDIQNYRLQHGIN 252 Query: 216 ---------------FEPQHSVFKKLNFESILTELRNGLSSKPNESG--VGHPILRISSV 258 P ++ + + ++ G+ +P + +P LR+++V Sbjct: 253 KIKIDSKVNITENPIEIPNTWIWSTI---EKIADISGGIQKQPMRAPQRNFYPYLRVANV 309 Query: 259 RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPD 318 G +D ++I+ +E EL R+ L+ D+L NGS +G + + +N ++ + Sbjct: 310 LRGSLDLHEIKNMELFAGELERYHLELNDILIVEGNGSFSEIGRSAIW-NGEIENCVHQN 368 Query: 319 KLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQ 378 +IR R+ K LP+Y+ ++++SP TTSG +S K I + LPP+ EQ Sbjct: 369 HIIRVRVRK-FLPQYVNLYWNSPLGSELSSGAAVTTSGLYTLSTKKIAQLPIPLPPISEQ 427 Query: 379 AEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 EIVRRV LF AD IE++V A R LTQ+++ KAF G L Sbjct: 428 HEIVRRVGLLFERADAIEREVVAAGRRCERLTQAVMIKAFSGRL 471 Score = 124 bits (313), Expect = 5e-27, Method: Composition-based stats. Identities = 50/244 (20%), Positives = 94/244 (38%), Gaps = 7/244 (2%) Query: 215 NFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS 274 + P+ + + + P ++ + + +V A + Sbjct: 17 DEVPEGWKLVTILNACEVNPPKPPRDFLPADAP--VTFVPMPAVDADMGAITNPEIKPYL 74 Query: 275 ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYI 334 E +DGD++ + +E G +++ +++ + R + LPEY+ Sbjct: 75 EVRNGFTSFRDGDVIMAKITPCMEN-GKAAIVRGMKNGIGFGSTEFHVMRSRGEILPEYL 133 Query: 335 EIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADT 394 + S RN + + GQK + IK V+ LPP+ EQ IV R+E L ++ D Sbjct: 134 FYYIRQKSFRNEAESHFTGSVGQKRVPTDFIKQSVIPLPPLAEQRRIVARIEALLSHVDA 193 Query: 395 IEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAAS 454 +++ + Q++LA A G LT +WR + LL+ I+ R Sbjct: 194 AGDRLSRVPLIMKRFRQAVLAAACSGRLTEEWRED----KDNFEDPKLLLQDIQNYRLQH 249 Query: 455 GGKK 458 G K Sbjct: 250 GINK 253 >UniRef50_Q307D8 Type I RM system S subunit n=1 Tax=Arthrospira platensis RepID=Q307D8_SPIPL Length = 392 Score = 264 bits (675), Expect = 5e-69, Method: Composition-based stats. Identities = 76/422 (18%), Positives = 149/422 (35%), Gaps = 44/422 (10%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W+ A + V G K+Q ++ + +N + Sbjct: 3 WLQAKLKYVAHFAYGDALPKDQE----REGDFKVFGSNGAYDNY-----------GRANT 47 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSS 129 + +I GS V S H + +F + ++ + ++ Sbjct: 48 QAPV-----IIVGRKGSYGKVNWSDHPCFASDTTF----FIDATTTHHHLRWLFYLLQTL 98 Query: 130 LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQI 189 + + A + + + IPPL EQK IA LD A++D +++ Sbjct: 99 N----LDQGTDEAAVPGLSRDDAYAKKVFIPPLGEQKAIAHYLDIETAKIDQLIKAKKRL 154 Query: 190 PQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTELRNGLS 240 +L R+A++ AV L +W P+H L IL + G+S Sbjct: 155 LALLDEKRRALITHAVTRGLNPDVPMRDSGVEWIGEIPKHWEI--LPLRRILQTMDYGIS 212 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 S +LR+ V G + +++ F++ + +L L+ DLLF R N SL+ + Sbjct: 213 ES-VGSEGNIAVLRMGDVDEGEISYDNVGFVDDVDHDL---ILKANDLLFNRTN-SLDKI 267 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G + + + + L+R R +PEY+ +S + GQ + Sbjct: 268 GKVAIFRNNFLFPVSFASYLVRMRCNDSVIPEYLNYLLNSLPVLTWAKSNALPAIGQVNL 327 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + + +PP++EQ I ++ + + + S++ A G Sbjct: 328 NPNRYSYIKIPIPPIEEQLNITEYIQTNTKKIKKLCLSSEETIKLLQERRTSLITAAVTG 387 Query: 421 EL 422 ++ Sbjct: 388 QI 389 Score = 139 bits (352), Expect = 2e-31, Method: Composition-based stats. Identities = 40/208 (19%), Positives = 83/208 (39%), Gaps = 11/208 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P+ W I P+ + T + + + + ++R ++ G+ ++ FV Sbjct: 188 IGEIPKHWEILPLRRILQ-----TMDYGISESVGSEGNIAVLRMGDVDEGEISYDNVGFV 242 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLP--FECSFGAFCGVLRPEKLIFSG 120 + D++ ++ S +GK A F SF ++ +R + Sbjct: 243 DDVDHD--LILKANDLLFNRTN-SLDKIGKVAIFRNNFLFPVSFASYLVRMRCNDSVIPE 299 Query: 121 FIAHFTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ + S S + N+ P + I IPIPP+ EQ I E + T ++ Sbjct: 300 YLNYLLNSLPVLTWAKSNALPAIGQVNLNPNRYSYIKIPIPPIEEQLNITEYIQTNTKKI 359 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNG 207 E+ ++L+ R +++ AV G Sbjct: 360 KKLCLSSEETIKLLQERRTSLITAAVTG 387 Score = 89.0 bits (220), Expect = 3e-16, Method: Composition-based stats. Identities = 19/112 (16%), Positives = 44/112 (39%), Gaps = 12/112 (10%) Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 ++ + + ++ + G+S D ++ V +PP+ EQ I ++ A Sbjct: 89 RWLFYLLQTLN-----LDQGTDEAAVPGLSRDDAYAKKVFIPPLGEQKAIAHYLDIETAK 143 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + K LA ++ ++++ A R NPD+ ++ + Sbjct: 144 IDQLIKAKKRLLALLDEKRRALITHAVT-------RGLNPDVPMRDSGVEWI 188 >UniRef50_A4CWB5 Type I restriction-modification system, S subunit n=1 Tax=Synechococcus sp. WH 7805 RepID=A4CWB5_SYNPV Length = 405 Score = 264 bits (675), Expect = 6e-69, Method: Composition-based stats. Identities = 76/421 (18%), Positives = 158/421 (37%), Gaps = 24/421 (5%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 E W V L G T +Y + +P + + + + + T + L Sbjct: 3 ESWSKLRVGDFCNLSAGGTPDTNN-PDYWEGGDIPWMSSGEVHDQRIRRTRSHITERGLQ 61 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF-SGFIAHFT 126 S K P V+ +G GK A E + + +K + F+ + Sbjct: 62 DSSAKFFPIGSVLVALAGQGKTRGKVAI--SEIELTTNQSIAAIIADKGVCEPDFLFYNL 119 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 S ++ +LS G+ + + + I +PPL EQK IAE L + Q+ + + + Sbjct: 120 DSRY--EELRTLSGGSGRAGLNLSILSDVEISLPPLPEQKKIAEILSGVDKQIYALENKI 177 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES 246 ++ + + + S K + ES+ + + + P + Sbjct: 178 SKLISTKTEIFRDLFSCFDELGGN----GVCKKESDTKIMPLESVCEAVIDCKNRTPPYT 233 Query: 247 GVGHPILRISSVRAGHVDQNDIRFLECSESEL--NRHKLQDGDLLFTRYNGSLEFVGVCG 304 GHP++R +VR G + +ND+++ + S E+ R + D+LFTR +G Sbjct: 234 ESGHPVVRTPNVRNGKLVRNDLKYTDISSYEIWTARSVPRPMDVLFTREAP----LGEVC 289 Query: 305 LLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 L+ + + +++ R K P Y+ SP ++ ++ K + Sbjct: 290 LVPE--NFKCCLGQRMMLFRADKSLIDPRYLLFSLMSPFVQDQLLK-SKGGTTVGHARVA 346 Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 D++ ++ + P ++Q I +F+ +T + V ++ ++ + G Sbjct: 347 DVRDLLIPIVPKEKQLRIAS----VFSSIETFLEGVTRKKEKLEIQKSALASDLLSGRKR 402 Query: 424 A 424 Sbjct: 403 V 403 Score = 115 bits (290), Expect = 2e-24, Method: Composition-based stats. Identities = 23/198 (11%), Positives = 61/198 (30%), Gaps = 10/198 (5%) Query: 218 PQHSVFKKLNFESILTELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSES 276 + ++ L+ ++ P+ G P + V + + E Sbjct: 2 SESWSKLRVGDFCNLSAGGTPDTNNPDYWEGGDIPWMSSGEVHDQRIRRTRSHITERGLQ 61 Query: 277 ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA-LPEYIE 335 + + G +L G + L + K P+++ Sbjct: 62 DSSAKFFPIGSVLVALAGQGKTR-GKVAI----SEIELTTNQSIAAIIADKGVCEPDFLF 116 Query: 336 IFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTI 395 S + + SG+ G++ + + LPP+ EQ +I + + + Sbjct: 117 YNLDSRY---EELRTLSGGSGRAGLNLSILSDVEISLPPLPEQKKIAEILSGVDKQIYAL 173 Query: 396 EKQVNNALARVNNLTQSI 413 E +++ ++ + + + Sbjct: 174 ENKISKLISTKTEIFRDL 191 >UniRef50_C4LDK7 Restriction modification system DNA specificity domain protein n=1 Tax=Tolumonas auensis DSM 9187 RepID=C4LDK7_TOLAT Length = 445 Score = 264 bits (675), Expect = 6e-69, Method: Composition-based stats. Identities = 82/438 (18%), Positives = 154/438 (35%), Gaps = 38/438 (8%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G +PE W I + T G + + D +P++ I +GK T + Sbjct: 16 EVGVIPEDWDIQRLGVHATFKTG-PFGSALHKSDYVDGGIPVVNPMQIIDGKVKPTSSMA 74 Query: 62 VPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFEC-SFGAFCGVLRPEKLIF 118 + K+ ++ DIVI G + +G+ A G ++R ++ Sbjct: 75 ISDEAAKKLSEYRLIAGDIVI----GRRGDMGRCAVISEIENGWLCGTGSMIVRVKENAD 130 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-PLAEQKIIAEKLDTLLA 177 + F+ + I S S G + N+ + + I IP EQ IA L + A Sbjct: 131 AAFLQRVLSNPQTITAIESASVGTTMINLNQGTLRALLILIPRDKQEQTAIANALSDVDA 190 Query: 178 QVDSTKARFEQIPQILKRFRQAVLG------------GAVNGKLTEKWRNFEPQHSVFKK 225 ++ + + I Q +L P+ Sbjct: 191 LINELEKLIAKKQAIKTATMQQLLTGKTRLPQFALREDGTPKGYKASELGEIPEDWEVVS 250 Query: 226 LNFESILTELRNGLSSKPNES-GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQ 284 L GL+ PN+ G +LR S+V+ + ++ ++ E R ++ Sbjct: 251 LAEIGQTI---IGLTYSPNDVAEHGTLVLRSSNVQNNVLAYDNNVYVNMDLPE--RVIVK 305 Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 GD+L NGS + +G C L+ K + + R ++ F S + Sbjct: 306 KGDILICVRNGSRQLIGKCALIDKNADGAA-FGAFMSIFRTKSF---GFVFYQFQSDIIQ 361 Query: 345 NAMMNCVKTTSGQKGISGKDIKSQVVLLPPV-KEQAEIVRRVEQLFAYADTIEKQVNNAL 403 N + + + I+ KD+ + LP + KEQ I + + DT + + L Sbjct: 362 NQINEIM--GATINQITNKDMAGFRIPLPTLQKEQVAITSILSDM----DTEIQSLQQRL 415 Query: 404 ARVNNLTQSILAKAFRGE 421 + + Q ++ + G+ Sbjct: 416 TKTRQIKQGMMQELLTGK 433 Score = 132 bits (333), Expect = 2e-29, Method: Composition-based stats. Identities = 34/244 (13%), Positives = 92/244 (37%), Gaps = 13/244 (5%) Query: 203 GAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS-SKPNESGVGHPILRISSVRAG 261 + + P+ ++L + G + K + G P++ + G Sbjct: 6 QVIPEGYKQTEVGVIPEDWDIQRLGVHATFKTGPFGSALHKSDYVDGGIPVVNPMQIIDG 65 Query: 262 HVDQNDIRFLECS-ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKL 320 V + +L+ ++L GD++ R +G C ++ ++++ L + Sbjct: 66 KVKPTSSMAISDEAAKKLSEYRLIAGDIVIGRRG----DMGRCAVISEIENGWLCGTGSM 121 Query: 321 IRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLP-PVKEQA 379 I R+ ++A +++ S+P A+ + + ++ +++ ++L+P +EQ Sbjct: 122 I-VRVKENADAAFLQRVLSNPQTITAIES-ASVGTTMINLNQGTLRALLILIPRDKQEQT 179 Query: 380 EIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENS 439 I + + D + ++ +A+ + + + + G+ A D Sbjct: 180 AIANALSDV----DALINELEKLIAKKQAIKTATMQQLLTGKTRLPQFALREDGTPKGYK 235 Query: 440 AAAL 443 A+ L Sbjct: 236 ASEL 239 >UniRef50_Q2P0A3 Specificity determinant for hsdM and hsdR n=2 Tax=Xanthomonas oryzae pv. oryzae RepID=Q2P0A3_XANOM Length = 450 Score = 264 bits (675), Expect = 6e-69, Method: Composition-based stats. Identities = 127/485 (26%), Positives = 216/485 (44%), Gaps = 58/485 (11%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLI--RANNIQNGKFDTTD 58 MS +LP GW + V T + + +P+ R I +GK Sbjct: 1 MS--ELPGGWSETEIGPVNTYSSETLNPAKAPKQTFELYSVPVFAKRKPEIVDGK----- 53 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 ++ QK+ P+D+++ + + V ++ + + + + +P L Sbjct: 54 ------DIGSTKQKVEPDDVLLCKINPRINRVWLVGKKNDHEQIASSEWIVIRQP--LFD 105 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKP---ASFDLINIPIPPLAEQKIIAEKLDTL 175 FI + S +R+++ + +G ++ + + I PLAEQK IA+KLD L Sbjct: 106 PAFIRFQLQESSFRDRLCAEVSGVG-GSLTRAQPKKVESYKLRIAPLAEQKRIAQKLDAL 164 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP-----------QHSVFK 224 LAQVD+ KAR + IP +LKRFR++V+ AV G+L+ R + Sbjct: 165 LAQVDTLKARIDAIPALLKRFRKSVVHSAVIGRLSADLRVPIEKSEEQEQLGPLESWREV 224 Query: 225 KLNFESILTELRNGLSSKPNE--SGVGHPILRISSVRA--GHVDQNDIRFLECSESELNR 280 L L+ ++ + + G +P ++ V G + + + + E + Sbjct: 225 TLASLGELSRGKSKHRPRNDSRLYGSEYPFIQTGDVANSGGALTSSKVFYSEFGLKQSR- 283 Query: 281 HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFS 339 G L T + +L + +PD ++ KD + ++I+ Sbjct: 284 -LFPSGTLCITI----AANIADTAMLA----IDACFPDSVVGFIPNKDDCVAQFIKYVID 334 Query: 340 SPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQV 399 + + + QK I+ K + + +PP+KEQ EIVR VEQLFAYAD +E +V Sbjct: 335 DN---KESLEALAPATAQKNINLKVLNQVKLRIPPIKEQTEIVRHVEQLFAYADQLEAKV 391 Query: 400 NNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKA 459 A R++ LTQS+LAKAFRGEL Q ++ P A+ LL++I+A+RAA+ K Sbjct: 392 AAAQQRIDALTQSLLAKAFRGELVPQDPSDEP--------ASVLLDRIRAQRAATPKPKR 443 Query: 460 SRKKS 464 RK + Sbjct: 444 GRKAA 448 >UniRef50_A4T8B4 Restriction modification system DNA specificity domain n=1 Tax=Mycobacterium gilvum PYR-GCK RepID=A4T8B4_MYCGI Length = 442 Score = 264 bits (675), Expect = 6e-69, Method: Composition-based stats. Identities = 83/434 (19%), Positives = 183/434 (42%), Gaps = 25/434 (5%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 W + ++ V + G+ +Q +D+ P +R N+ ++ + + Sbjct: 2 SWPLVALADVAEIQGGIQ---KQPKRTARDNAFPFLRVANVTARGLALDEVHTIELFDGE 58 Query: 69 -ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGFIAHFT 126 E ++ D+++ +GS S +G++A + +RP I F+ H Sbjct: 59 LERYRLLRGDLLVVEGNGSASQIGRAAVWDGSITDAVHQNHLIRVRPGFQIDPRFLGHLW 118 Query: 127 KSSLYRNKISSLSAGANINN-IKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 S L R+++S +++ + + + I +P+P L EQ+ I + L+ L+++D+ ++ Sbjct: 119 NSPLIRDELSRVASSTSGLHTLSVTKLKRITLPLPSLTEQRRIVDLLEDHLSRLDAGRSE 178 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLT--------------EKWRNFEPQHSVFKKLNFESI 231 E+ L R+ + A+ G + + P + +L + Sbjct: 179 VERAAAKLAILRERTVIQALTGGAEANREDARLTDVSTADGDLSALPIGWSWSRLGDVAD 238 Query: 232 LTELRNGLSSKPNESG-VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 + S K ++ V P LR+++V+ G ++ +++ + +S+ + +L+ GD+L Sbjct: 239 VVGGVTKDSKKQSDPNYVEVPYLRVANVQRGRLNLDEVTKIRVPQSKADALRLRPGDVLL 298 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSSPSARNAMMN 349 G + + G + + Q + ++ + + RAR+ P ++ ++ R Sbjct: 299 NE-GGDRDKLAR-GWVWEGQVPDCIHQNHVFRARITDPRIDPYFLSWTANTIGGR-WAER 355 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 K + IS I+ V++PP E I + + D +EK + + + R L Sbjct: 356 NGKQSVNLASISLSMIRRMPVIVPPPGEAVRIATELRDSRSDFDRLEKSIRDGMDRALVL 415 Query: 410 TQSILAKAFRGELT 423 +S+L AF G LT Sbjct: 416 KKSLLTAAFSGRLT 429 Score = 109 bits (272), Expect = 3e-22, Method: Composition-based stats. Identities = 37/208 (17%), Positives = 80/208 (38%), Gaps = 5/208 (2%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP GW + + V ++ GVT K + + +P +R N+Q G+ + ++ + Sbjct: 224 LPIGWSWSRLGDVADVVGGVT-KDSKKQSDPNYVEVPYLRVANVQRGRLNLDEVTKIRVP 282 Query: 66 LVKESQ-KISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLR-PEKLIFSGFI 122 K ++ P D+++ G + + + +C R + I F+ Sbjct: 283 QSKADALRLRPGDVLLNE-GGDRDKLARGWVWEGQVPDCIHQNHVFRARITDPRIDPYFL 341 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + + R + N+ +I + + + +PP E IA +L + D Sbjct: 342 SWTANTIGGRWAERNGKQSVNLASISLSMIRRMPVIVPPPGEAVRIATELRDSRSDFDRL 401 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLT 210 + ++++L A +G+LT Sbjct: 402 EKSIRDGMDRALVLKKSLLTAAFSGRLT 429 >UniRef50_C5VLJ8 HsdS protein n=1 Tax=Prevotella melaninogenica ATCC 25845 RepID=C5VLJ8_9BACT Length = 428 Score = 263 bits (673), Expect = 9e-69, Method: Composition-based stats. Identities = 76/439 (17%), Positives = 155/439 (35%), Gaps = 47/439 (10%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 GK+P W + + L + D+ + R + ++ Sbjct: 16 GKVPSHWNYSRIK--FGLKSSFSGVWGD-DEKGDDNDVVCYRVADFDYKNGGLSEEKITI 72 Query: 64 KNLVKESQK---ISPEDIVIAMSSGS-KSVVGKSAHQHLPFECSFGAFCGVLRPEKLI-F 118 +N+ +++ K I P DI+I S G + VG++ +L + + F +R + + Sbjct: 73 RNIDEKTFKEREILPNDILIEKSGGGDVNPVGRAVIANLDHKATCSNFIHCVRCNENVLN 132 Query: 119 SGFIAHFTKSSLYRN-KISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + + +F S + + + I N+K + + +PPL+EQ+ IA LD Sbjct: 133 TRLLYYFFYSIYVQKVNLLFFNQTTGIQNLKVPEYLGQVMFLPPLSEQQSIASFLDAKTK 192 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNF 228 +D A+ EQ +L+ + A++ AV L + W P++ + Sbjct: 193 PIDDIIAKREQQIALLEEMKSAIISRAVTKGLNPEAKMKDSGIEWIGEVPENWNLLRFRL 252 Query: 229 ESILTELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD 287 L + G S + G +P ++ + E + +GD Sbjct: 253 ---LCRISTGDSDTQDAEPDGEYPF-----------------YVRSPQVERSSKFTCEGD 292 Query: 288 -LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNA 346 +L V + + ++ I + K Y+ F + Sbjct: 293 AILMAGDGAGAGRV-----FHHVDGKYAVHQRVYIFNQFNKVVDSNYLYQFMRIMFPQR- 346 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 MN S + I++ VV +P + EQ I ++ A D + +A + Sbjct: 347 -MNMGSAQSTVPSVRLHMIQNFVVPIPSIDEQRTITSYLDTETAKIDVRIDKRRKQIALL 405 Query: 407 NNLTQSILAKAFRGELTAQ 425 Q+++ A G++ + Sbjct: 406 QEYKQALITDAVTGKIDVR 424 Score = 146 bits (369), Expect = 2e-33, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 80/237 (33%), Gaps = 16/237 (6%) Query: 212 KWRNFEPQHSVFKKLNFE-SILTELRNGLSSKPNESGVGHPILRISSV--RAGHVDQNDI 268 +W P H + ++ F G K + R++ + G + + I Sbjct: 13 QWLGKVPSHWNYSRIKFGLKSSFSGVWGDDEKGD--DNDVVCYRVADFDYKNGGLSEEKI 70 Query: 269 RFLECSESELNRHKLQDGDLLFTRYNG-SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 E ++ D+L + G + VG + + + R + Sbjct: 71 TIRNIDEKTFKEREILPNDILIEKSGGGDVNPVGRAVI--ANLDHKATCSNFIHCVRCNE 128 Query: 328 DA-LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVE 386 + + FF S + + T+G + + + QV+ LPP+ EQ I ++ Sbjct: 129 NVLNTRLLYYFFYSIYVQKVNLLFFNQTTGIQNLKVPEYLGQVMFLPPLSEQQSIASFLD 188 Query: 387 QLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D I + +A + + +I+++A + NP+ ++ + Sbjct: 189 AKTKPIDDIIAKREQQIALLEEMKSAIISRAVT-------KGLNPEAKMKDSGIEWI 238 Score = 129 bits (324), Expect = 3e-28, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 79/208 (37%), Gaps = 23/208 (11%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PE W + + + G + D P Sbjct: 238 IGEVPENWNLLRFRLLCRISTG----DSDTQDAEPDGEYPFY----------------VR 277 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + + S+ D ++ +G + G+ H + K++ S ++ Sbjct: 278 SPQVERSSKFTCEGDAIL--MAGDGAGAGRVFHHVDGKYAVHQRVYIFNQFNKVVDSNYL 335 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 F + ++ +++ SA + + +++ +PIP + EQ+ I LDT A++D Sbjct: 336 YQFMR-IMFPQRMNMGSAQSTVPSVRLHMIQNFVVPIPSIDEQRTITSYLDTETAKIDVR 394 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLT 210 + + +L+ ++QA++ AV GK+ Sbjct: 395 IDKRRKQIALLQEYKQALITDAVTGKID 422 >UniRef50_A1ZTI8 Type I restriction enzyme StySJI specificity protein n=1 Tax=Microscilla marina ATCC 23134 RepID=A1ZTI8_9SPHI Length = 436 Score = 263 bits (673), Expect = 9e-69, Method: Composition-based stats. Identities = 72/436 (16%), Positives = 173/436 (39%), Gaps = 30/436 (6%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 W + + G + + + P +R ++ NG DT++L +V + + Sbjct: 2 SNWEEKKIQDFAEVKGGKRLPAGKEFSLTPTKH-PYLRVTDMVNGSIDTSNLQYVDEEIE 60 Query: 68 K--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 K + +IS +D + +G+ VG + A + +I ++ ++ Sbjct: 61 KVIRNYRISADD-LYITIAGTIGSVGNIPELLHNALLTENAAKITNIDKSIIDKNYLQYY 119 Query: 126 TKSSLYRNKISS-LSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S +++I+ + G + + + + PPL Q+ IA+ L T+ +D T+ Sbjct: 120 LSSEETKSQINKEIGIGGGVPKLALYRILNLVVQYPPLTYQRKIAQILSTVDRVIDGTQR 179 Query: 185 RFEQIPQILKRFRQAVLGGAV---NGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 E+ + + Q + + GKL + + + E + Sbjct: 180 AIEKYQTLKEGLMQDLFSRGIDVSTGKLRPPRQVAPELYQKTELGWIPKDYSFVRLEDLT 239 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL--NRHKLQDGDLLF 290 ++ +G P + G P LR++ V+ ++ + ++F+ E ++ R + GDLL Sbjct: 240 LKIIDGTHHTPKYTESGIPFLRVTDVQTKDINFDKLKFVSLEEHQILTKRCNPEKGDLLL 299 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 ++ +G+ ++ ++ LI+ + EY+ F S +N ++ Sbjct: 300 SKNG----TIGIPKVVDWDWEFSIFVSLALIKPN-HRLINVEYLLYFLKSELIKNQIIRQ 354 Query: 351 VKTTSGQKGISGKDIKSQVVLLPP-VKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 K + + ++I+ + PP ++EQ IV ++ L + + + ++ L Sbjct: 355 AKQGT-VTNLHLEEIREFKIAQPPSIQEQNNIVEKLNNL----EKQIESEQKSFQKLKTL 409 Query: 410 TQSILAKAFRGELTAQ 425 Q+++ G+++ + Sbjct: 410 KQALMQDLLTGKVSVE 425 Score = 137 bits (345), Expect = 1e-30, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 87/210 (41%), Gaps = 13/210 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G +P+ + + +T I T+ + + +P +R ++Q + L F Sbjct: 222 ELGWIPKDYSFVRLEDLTLKIIDGTHHTPK----YTESGIPFLRVTDVQTKDINFDKLKF 277 Query: 62 VPKNLVK---ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLI 117 V + + D++++ +G +E S +++P +LI Sbjct: 278 VSLEEHQILTKRCNPEKGDLLLSK----NGTIGIPKVVDWDWEFSIFVSLALIKPNHRLI 333 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPP-LAEQKIIAEKLDTLL 176 ++ +F KS L +N+I + + N+ I PP + EQ I EKL+ L Sbjct: 334 NVEYLLYFLKSELIKNQIIRQAKQGTVTNLHLEEIREFKIAQPPSIQEQNNIVEKLNNLE 393 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVN 206 Q++S + F+++ + + Q +L G V+ Sbjct: 394 KQIESEQKSFQKLKTLKQALMQDLLTGKVS 423 >UniRef50_B0CE92 Type I restriction-modification enzyme S subunit n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0CE92_ACAM1 Length = 382 Score = 263 bits (673), Expect = 9e-69, Method: Composition-based stats. Identities = 74/413 (17%), Positives = 162/413 (39%), Gaps = 36/413 (8%) Query: 14 PVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE-SQK 72 + V + G T K++ + +P I +I ++ + ++ +++ Sbjct: 2 KLKEVCRFLNGGTPSKKK--PEYFEGEIPWITGADINGPIVNSARSYITEEAILNSATKR 59 Query: 73 ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHFTKSSLY 131 + P +++ + VGK A + + L P+ + + ++ HF +S Sbjct: 60 VPPNTVLLVTRT----SVGKVAVSGMEL--CYSQDITSLWPDLEKLDIYYLTHFLRSRE- 112 Query: 132 RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQ 191 + S GA I + + +++ +PP+AEQK IA LD A + + Sbjct: 113 -TYLKGQSRGATIKGVTKGVLENLSLHLPPIAEQKRIAGILDAADALRVKRRDAISTLDA 171 Query: 192 ILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHP 251 +L+ + G + + + + E++ ++ +G P + G Sbjct: 172 LLQSTFLTLFGDPITNPM------------GWDASDLEAVSEKITDGTHKTPKYTESGIE 219 Query: 252 ILRISSVRAGHVDQNDIRFLECSESE--LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKL 309 L ++ G + N +F+ E + + R + GD+L + +G ++ + Sbjct: 220 FLSAKDIKNGSIKWNTGKFISEDEHKSLITRCHPEIGDVLLAKSGS----LGSVAIIDRD 275 Query: 310 QHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQV 369 +L LI+ + +++ SP + +++ K K + DI+ Sbjct: 276 HEFSLFESLCLIK-HNRQKIEAQFLTAMLESPRMQMHLLSRNK-GISIKHLHLTDIRKLK 333 Query: 370 VLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 +LLPP+ +Q + V A + + Q LA ++ L S+ ++AF GEL Sbjct: 334 ILLPPLDKQRKFATIV----ASIEKQKAQQCAHLAELDTLFASLQSRAFNGEL 382 Score = 116 bits (291), Expect = 2e-24, Method: Composition-based stats. Identities = 40/207 (19%), Positives = 81/207 (39%), Gaps = 16/207 (7%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P GW + + V+ I T+K + + + + A +I+NG F+ ++ Sbjct: 188 PMGWDASDLEAVSEKITDGTHKTPK----YTESGIEFLSAKDIKNGSIKWNTGKFISEDE 243 Query: 67 VKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFI 122 K D+++A S +G A E S +++ + I + F+ Sbjct: 244 HKSLITRCHPEIGDVLLAKS----GSLGSVAIIDRDHEFSLFESLCLIKHNRQKIEAQFL 299 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 +S + + S + G +I ++ + I +PPL +Q+ A + A ++ Sbjct: 300 TAMLESPRMQMHLLSRNKGISIKHLHLTDIRKLKILLPPLDKQRKFATIV----ASIEKQ 355 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKL 209 KA+ L ++ A NG+L Sbjct: 356 KAQQCAHLAELDTLFASLQSRAFNGEL 382 >UniRef50_B0PEE2 Putative uncharacterized protein n=1 Tax=Anaerotruncus colihominis DSM 17241 RepID=B0PEE2_9FIRM Length = 388 Score = 263 bits (673), Expect = 9e-69, Method: Composition-based stats. Identities = 107/418 (25%), Positives = 191/418 (45%), Gaps = 37/418 (8%) Query: 12 IAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQ 71 ++ + V T I G +K + +D LP+IR N+ N + ++E Sbjct: 1 MSTLGNVATYINGRAFKPSE----WEDSGLPIIRIQNLTN----FSAPYNYSSRELEEKY 52 Query: 72 KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLY 131 K++ D++ A S+ + + K + + P + I ++ +F Sbjct: 53 KVTRGDLLFAWSASLGAHIWK------GNDAWLNQHIFRVVPSEQIEKKYLYYFLL--QV 104 Query: 132 RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQ 191 ++ + + G+ + +I F IP+P L EQK I K++ L +++D++ A + + Sbjct: 105 VAELHAKTHGSGMVHITKGPFMNTPIPVPSLPEQKRIVSKIEELFSKLDASVAELQTAKE 164 Query: 192 ILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE-SGVGH 250 LK +RQAVL A +K+ E I+ + R G S K + G Sbjct: 165 KLKVYRQAVLKEAF-------------DPVSKEKILLEDIIEKPRYGTSKKCSYAYKNGF 211 Query: 251 P-ILRISSV--RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 + RI ++ + G +D DI++ S+ EL L + DLL R NGS+ VG ++K Sbjct: 212 KAVYRIPNICYQNGSIDHKDIKYAGFSDDELKNLDLIENDLLIIRSNGSVSLVGRSSIVK 271 Query: 308 KLQHQNLLYPDKLIRARLTK--DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI 365 + + + LIR RL K + L +++ F S +AR + + K+TSG I+ +I Sbjct: 272 -AEDCDATFAGYLIRLRLKKPSEVLSKFLHYFLESHAARTYIEHVAKSTSGVNNINSNEI 330 Query: 366 KSQVVLLP-PVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + V QA+ V ++E + D I++ ++ +L + L QSIL +AF GEL Sbjct: 331 SNLPVPKCDDFDMQAQTVVKIETNLSICDDIQQTIDTSLQQAEALRQSILKQAFEGEL 388 >UniRef50_A3ZEA3 Type I restriction modification DNA specificity domain protein n=5 Tax=Bacteria RepID=A3ZEA3_CAMJE Length = 422 Score = 263 bits (672), Expect = 1e-68, Method: Composition-based stats. Identities = 73/432 (16%), Positives = 150/432 (34%), Gaps = 38/432 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + L K + L + N + + + F Sbjct: 13 GEIPEHWKLIKCKNFFVL------KSIPIGDLWNKTKLLSLTLNGVIERDINNPEGKF-- 64 Query: 64 KNLVKESQKISPEDIVIAM--SSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + Q + D++ + + + +G S + + + F Sbjct: 65 PSDFSTYQIVKEGDLIFCLFDVAETPRTIGLS-----KLNGMITSAYTIFEIKNQ-EKRF 118 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + +F R + L G N I + IP+PPL EQ+ IA LD Q+ + Sbjct: 119 LEYFFIDLDNRKNLKFLYRGL-RNTISKEDLLNLKIPLPPLKEQEQIANFLDEKCEQIKN 177 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 + E++ +LK +QA + A L + ++ PQH +L Sbjct: 178 FIEKKEKLITLLKEQKQAFINKATTKGLDKNVNFKDSGIEYLGEIPQHWKLVRLGLILKT 237 Query: 233 TELRNGLSSKPNESGVG-HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD-LLF 290 + S G + + G + + + + + + + K+ D D L+ Sbjct: 238 SSGTTPDSGNDKYYKGGQIVWINSGDLNDGFLKDSKRKITQDALDDYSVLKIFDKDSLII 297 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 Y + +G +LK N + Y+ F+ + +++ Sbjct: 298 AMYGAT---IGKTAILK----VNACVNQACCVLEKSAWYNTFYLFYLFN--RYKKELIS- 347 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 + + GQ IS IK+ + LPP+KEQ +I +++ D + ++ + + Sbjct: 348 MGSGGGQPNISQDIIKNLKIPLPPLKEQEQIANFLDEKCKKIDLLIEKTEKQIKLIKEYK 407 Query: 411 QSILAKAFRGEL 422 ++ +A G + Sbjct: 408 TTLTNQAVCGRI 419 Score = 169 bits (428), Expect = 2e-40, Method: Composition-based stats. Identities = 43/209 (20%), Positives = 86/209 (41%), Gaps = 8/209 (3%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P+ W + + + G T Y K + I + ++ +G + Sbjct: 220 GEIPQHWKLVRLGLILKTSSGTTPDSGN-DKYYKGGQIVWINSGDLNDGFLKDSKRKITQ 278 Query: 64 KNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 L KI +D +I G + +GK+A + C VL + ++ Sbjct: 279 DALDDYSVLKIFDKDSLIIAMYG--ATIGKTAILKVN--ACVNQACCVLEKSAWYNTFYL 334 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + Y+ ++ S+ +G NI + IP+PPL EQ+ IA LD ++D Sbjct: 335 FYLFN--RYKKELISMGSGGGQPNISQDIIKNLKIPLPPLKEQEQIANFLDEKCKKIDLL 392 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE 211 + E+ +++K ++ + AV G++ + Sbjct: 393 IEKTEKQIKLIKEYKTTLTNQAVCGRIGK 421 Score = 98.6 bits (245), Expect = 5e-19, Method: Composition-based stats. Identities = 34/234 (14%), Positives = 82/234 (35%), Gaps = 26/234 (11%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P+H K +L + G + L ++ V ++ + +F Sbjct: 10 EWLGEIPEHWKLIKCKNFFVLKSIPIG----DLWNKTKLLSLTLNGVIERDINNPEGKFP 65 Query: 272 ECSESELNRHKLQDGDLLFTR--YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 +++GDL+F + +G+ L ++ I ++ Sbjct: 66 S---DFSTYQIVKEGDLIFCLFDVAETPRTIGLSKL------NGMITSAYTIFEIKNQE- 115 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 ++E FF R + + + IS +D+ + + LPP+KEQ +I +++ Sbjct: 116 -KRFLEYFFIDLDNRKNLKFLYRG--LRNTISKEDLLNLKIPLPPLKEQEQIANFLDEKC 172 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++ + + Q+ + KA + + ++ ++ L Sbjct: 173 EQIKNFIEKKEKLITLLKEQKQAFINKATT-------KGLDKNVNFKDSGIEYL 219 >UniRef50_C1PCQ5 Restriction modification system DNA specificity domain protein n=1 Tax=Bacillus coagulans 36D1 RepID=C1PCQ5_BACCO Length = 483 Score = 262 bits (670), Expect = 2e-68, Method: Composition-based stats. Identities = 95/484 (19%), Positives = 176/484 (36%), Gaps = 66/484 (13%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 ++P WV + T+ K+ KD+ L + G + Sbjct: 25 EVPGNWVWVKLKTI-----NKDKKRNIDPKSFKDETFELYSVPSFPEG----SPEFIKGD 75 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 + Q ++ ++I++ + + V K + H F V+ K I+S ++ + Sbjct: 76 EIGSSKQLVNKDEILLCKINPRINRVWKVLNNHGKFRQLASTEWIVISENKAIYSEYLLY 135 Query: 125 FTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 KS +R I+S +G + +P + I +PP+ EQK IA+K++ LL+++D Sbjct: 136 LLKSPYFRKLITSNVSGVGGSLTRARPKEVETYPIAVPPIKEQKRIADKVERLLSKIDEA 195 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF-------------------------- 216 K E+ + + R A+L A G+LT KWR Sbjct: 196 KRLIEEAKETFELRRAAILDKAFRGELTRKWREENKNIEDAESLYVKIKESQSIRRKVSK 255 Query: 217 ----------EPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQN 266 P + +L +T + P P ++ ++ ++++ Sbjct: 256 EINIKDLRYSIPSTWKWVRLGDVFTITSGGTPKRTIPEYYEGNIPWIKTGEIKWNAINES 315 Query: 267 DIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT 326 + + + + + L +L Y L G +L + Sbjct: 316 EEQITPEAVANSSAKLLPPNTVLVAMYGQGLTR-GRAAILSVE----ATCNQAVCALLPN 370 Query: 327 KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVE 386 PE+I +F R V Q+ +S I + LPP++EQ I+ ++ Sbjct: 371 DYIAPEFIFYYFMEGYQR---FRQVAKGGNQENLSVSLISDFIFPLPPLEEQRVIITTLQ 427 Query: 387 QLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEK 446 +F I+ + + + QSIL+KAFRGEL E SA LL++ Sbjct: 428 NIFKKESKIKDVIKI---NTDEIKQSILSKAFRGELGTNDPTEE--------SAIELLKE 476 Query: 447 IKAE 450 + E Sbjct: 477 VLQE 480 Score = 122 bits (307), Expect = 3e-26, Method: Composition-based stats. Identities = 54/272 (19%), Positives = 103/272 (37%), Gaps = 15/272 (5%) Query: 193 LKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPI 252 ++ + +L A+ + P + V+ KL + + + + Sbjct: 4 KQKTMEELLEEAL--VPEGEQPYEVPGNWVWVKLKTINKDKKRN---IDPKSFKDETFEL 58 Query: 253 LRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQ 312 + S G F++ E ++ + ++L + N + V +L Sbjct: 59 YSVPSFPEG-----SPEFIKGDEIGSSKQLVNKDEILLCKINPRINRVWK--VLNNHGKF 111 Query: 313 NLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT-TSGQKGISGKDIKSQVVL 371 L + I K EY+ SP R + + V K++++ + Sbjct: 112 RQLASTEWIVISENKAIYSEYLLYLLKSPYFRKLITSNVSGVGGSLTRARPKEVETYPIA 171 Query: 372 LPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENP 431 +PP+KEQ I +VE+L + D ++ + A +IL KAFRGELT +WR EN Sbjct: 172 VPPIKEQKRIADKVERLLSKIDEAKRLIEEAKETFELRRAAILDKAFRGELTRKWREENK 231 Query: 432 DLISGENSAAALLE--KIKAERAASGGKKASR 461 ++ E+ + E I+ + + K R Sbjct: 232 NIEDAESLYVKIKESQSIRRKVSKEINIKDLR 263 >UniRef50_Q6GD64 Putative type I restriction enzyme specificity protein n=1 Tax=Staphylococcus aureus subsp. aureus MSSA476 RepID=Q6GD64_STAAS Length = 436 Score = 262 bits (670), Expect = 2e-68, Method: Composition-based stats. Identities = 75/433 (17%), Positives = 160/433 (36%), Gaps = 26/433 (6%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +P+ W I + + I G +K E + + D+ +I + + +L + Sbjct: 13 IGYIPKYWTITKLKNIIDFISGYAFKSE--LFTISDNNKKVITIKSFNTKEIILDNLSYS 70 Query: 63 PKNL-VKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 ++L + DI+ AMS G+ + + G++R FS F Sbjct: 71 NESLKFPTKYLLKNNDILFAMSGGTTGK--NLLIEQVDDLYYINQRVGIIRSS---FSKF 125 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 I ++ + L+ I+ S+G+ NI I +P K I ++ L + + Sbjct: 126 IYYYINTGLFSEYINLFSSGSAQPNISATDIQNFIIALPEKETIKKIEIYINYQLKIISN 185 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFESIL 232 Q + LK+++Q+++ AV + W P + +K+ + L Sbjct: 186 IIDTTYQSIEELKKYKQSLITEAVTKGIDPNVEMKESGNDWIGSIPSNWSVRKIKHDFNL 245 Query: 233 TE--LRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS-ESELNRHKLQDGDLL 289 GL+ VG ++ + + G + + + E +++ DLL Sbjct: 246 KGRIGWQGLT-SNEYQTVGPYLITGTDFKKGIIRWDSCVRISEERFEEAPDIHIKENDLL 304 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPD-KLIRARLTKDALPEYIEIFFSSPSARNAMM 348 T+ +G L + + L LIR +L +++ S N Sbjct: 305 ITKDG----TIGKVALATNVPKKVSLNSGVLLIREKLKNTINKKFMYYNLLSNMFWNWYN 360 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + + S K + + +P + EQ +IV+ ++ + D + + + + N Sbjct: 361 SNNQGASTIKHLYQGQFYNYSYAIPLLHEQQQIVQYLDDKVSTIDRLIEDKTKVIKELEN 420 Query: 409 LTQSILAKAFRGE 421 +S++ + G+ Sbjct: 421 YKKSLIYEYVTGK 433 Score = 130 bits (327), Expect = 1e-28, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 81/232 (34%), Gaps = 15/232 (6%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W + P++ KL S S ++ I S + +++ + Sbjct: 11 EWIGYIPKYWTITKLKNIIDFISGYAFKSELFTISDNNKKVITIKSFNTKEIILDNLSYS 70 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 S ++ L++ D+LF G+ LL + ++ R + Sbjct: 71 NESLKFPTKYLLKNNDILFAMSGGTTGK----NLLIEQVDDLYYINQRVGIIRSS---FS 123 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 ++I + ++ + N + S Q IS DI++ ++ LP + +I + Sbjct: 124 KFIYYYINTGLFSEYI-NLFSSGSAQPNISATDIQNFIIALPEKETIKKIEIYINYQLKI 182 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 I ++ + QS++ +A + +P++ E+ + Sbjct: 183 ISNIIDTTYQSIEELKKYKQSLITEAVT-------KGIDPNVEMKESGNDWI 227 Score = 127 bits (319), Expect = 1e-27, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 88/214 (41%), Gaps = 11/214 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +P W + + L G + N + LI + + G V + Sbjct: 227 IGSIPSNWSVRKIKHDFNLK-GRIGWQGLTSNEYQTVGPYLITGTDFKKGIIRWDSCVRI 285 Query: 63 PKNLVKESQ--KISPEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFCGVLR--PEKLI 117 + +E+ I D++I +GK A ++P + S + ++R + I Sbjct: 286 SEERFEEAPDIHIKENDLLITKD----GTIGKVALATNVPKKVSLNSGVLLIREKLKNTI 341 Query: 118 FSGFIAHFTKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 F+ + S+++ N +S + G + I ++ F + IP L EQ+ I + LD + Sbjct: 342 NKKFMYYNLLSNMFWNWYNSNNQGASTIKHLYQGQFYNYSYAIPLLHEQQQIVQYLDDKV 401 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 + +D ++ + L+ ++++++ V GK Sbjct: 402 STIDRLIEDKTKVIKELENYKKSLIYEYVTGKKE 435 >UniRef50_A8YFX5 HsdS protein n=2 Tax=Microcystis aeruginosa PCC 7806 RepID=A8YFX5_MICAE Length = 406 Score = 262 bits (670), Expect = 2e-68, Method: Composition-based stats. Identities = 86/427 (20%), Positives = 160/427 (37%), Gaps = 33/427 (7%) Query: 6 LPEGWVIAPVSTVTTLIRG----VTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 LP+ W + + + +G + + + N F + Sbjct: 3 LPKTWSLVALGDIAAHEKGAIRRGPFGGSLKKEIFVESGFKVYEQQNAIKDDFQIGNYFI 62 Query: 62 VPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLP-FECSFGAFCGVLRPE-KLIF 118 + E + P D++I+ +GK A +RP ++I Sbjct: 63 DEDKFREMEGFNVKPHDLIISC----AGTIGKVAIVPYEALPGVINQALMRIRPNPEIIL 118 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIK-PASFDLINIPIPPLAEQKIIAEKLDTLLA 177 ++ +S Y+ I SAG+ + N+ + IP+PPL EQ+ IA LD Sbjct: 119 CRYLKWLLESPKYQRDIFGKSAGSALKNLAAISEIKKCKIPLPPLEEQRRIAAILDKADG 178 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 K ++L+ + G V P+ KL ++ + N Sbjct: 179 VRRKRKEAIRLTEELLRSTFLEMFGDPVTN----------PKGWEIVKLGSL-VVGQPNN 227 Query: 238 GLSSKPNESGVGHPILRISSVRAGH-VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 G+ K +E G P++ + + +G+ +D ++ R L ++ E+ + L GD+LF R + + Sbjct: 228 GIFKKNHEYGGDTPVVWVKELFSGYTIDCSESRTLTPTDEEVKKFGLTKGDILFCRSSLN 287 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTTS 355 + +G + + L+ +IR RL K ++ P R ++ T Sbjct: 288 RDGIGFNNVFDG-MDFSALFECHIIRVRLNQKKVNSIFLNYLLHFPGLRKQIIAKA-NTV 345 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 I +IK LPP + Q + + ++ +E + NL S+L Sbjct: 346 TMSTIGQSEIKKIEFYLPPKELQDKFEIFLRKIATNRTKLENK------ESENLFNSLLQ 399 Query: 416 KAFRGEL 422 +AFRGEL Sbjct: 400 RAFRGEL 406 >UniRef50_D0BWI7 Predicted protein n=1 Tax=Acinetobacter sp. RUH2624 RepID=D0BWI7_9GAMM Length = 396 Score = 261 bits (669), Expect = 3e-68, Method: Composition-based stats. Identities = 82/429 (19%), Positives = 172/429 (40%), Gaps = 40/429 (9%) Query: 1 MS--AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTD 58 M KLP+GW + V + ++ + + LP++ + N+ + + Sbjct: 1 MEQVLYKLPDGWDWKTLGDVCFKVTDGSHNPPKEVEV----GLPMLSSRNVMDNGLVWDN 56 Query: 59 LVFVPKNLVK---ESQKISPEDIVIAMSSGSKSVVGKSAHQ-HLPFECSFGAFCGVLRPE 114 +P++ + + ++S D+++ + +G+S +L + VL Sbjct: 57 FRLIPEDAFESEHKRTRVSEGDVLLTI----VGTIGRSCVVRNLDRLFTLQRSVAVL-SS 111 Query: 115 KLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDT 174 + + F+++ ++ + S + G+ I + PP+ EQ I EKLD Sbjct: 112 EELIPEFLSYQFRAPFIQEHFISNAKGSAQKGIYLKQLKATYLVCPPIEEQNRIVEKLDA 171 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTE 234 L ++D + + K+ +VL P + Sbjct: 172 LFTRIDIAIEHLQSKLDLSKQLFDSVLDEFF----------KLPDCDSVPLTQVVEFIGG 221 Query: 235 LRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYN 294 + S + G +R+ +R D N I +++ + + D++ RY Sbjct: 222 SQPPKSQFSDVQKEG--YVRLIQIRDYKSD-NHIVYVDSA---STKKFCTKDDVMIGRYG 275 Query: 295 GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFFSSPSARNAMMNCVKT 353 + + L+ + Y L++A +D L +Y+ F SPS +N ++ + Sbjct: 276 PP--------VFQILRGLDGAYNVALMKAVPNEDLLMKDYLFWFLQSPSIQNYVIGISQR 327 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 +GQ G++ K ++ ++ +P Q +IV +V QL + + +E +V +A ++ L SI Sbjct: 328 AAGQSGVNKKALEKYLIPVPSKAIQNDIVDKVGQLVSKSRHLEAEVTAEIAFLSQLKASI 387 Query: 414 LAKAFRGEL 422 L AF+GEL Sbjct: 388 LDSAFKGEL 396 >UniRef50_A4FZ34 Restriction modification system DNA specificity domain n=1 Tax=Methanococcus maripaludis C5 RepID=A4FZ34_METM5 Length = 402 Score = 261 bits (669), Expect = 3e-68, Method: Composition-based stats. Identities = 91/423 (21%), Positives = 172/423 (40%), Gaps = 33/423 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GW + + + + G T + + Y + +P ++ +++ T + Sbjct: 5 LPDGWEVKKLGDIGNISAGGTPSRSK-PEYWNNGSIPWVKIADMKEKHVKNTSEFITEEG 63 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGFIAH 124 L K S KI + ++ S VG L + S + K + ++ + Sbjct: 64 LNKSSAKIFKKGTILISIFASLGTVGI-----LDIDASTNQAIAGINVNSKKVIPEYLYY 118 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + KS +N G NNI + I +PPL Q+ I E L+ + ++ + Sbjct: 119 YLKS--LKNYFMGAGRGVAQNNINLSILKDTEIFVPPLETQQKIVEILEKIEYGINLREK 176 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 + ++K + G V+ P KK+ + + ++ +G S + Sbjct: 177 AILETENLVKAVFLDMFGDPVSN----------PMGWDVKKI--GTFVNDIISGWSVGGD 224 Query: 245 ESG---VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 E +L+ISSV +G ++ + + ++ H L GDLLF+R N + E V Sbjct: 225 ERPKKADELAVLKISSVTSGKFKSSEHKVVNSEITKKLVHPL-KGDLLFSRAN-TRELVA 282 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIE-IFFSSPSARNAMMNCVKTTSG-QKG 359 ++ + +L PDKL + L K+ + Y P+ R + TSG Sbjct: 283 AVCIVDN-DYMDLFLPDKLWKIILNKNIVSSYYFRQVLQDPTYRANLTKKATGTSGSMLN 341 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 IS + +PP+ Q + + +E+L + I+++ N+ + +L L KAF+ Sbjct: 342 ISKSKLIENEFPIPPIGLQNKFAKIIEKL----EEIKEKQENSKKEMEDLFNLSLQKAFK 397 Query: 420 GEL 422 GEL Sbjct: 398 GEL 400 >UniRef50_A4XMW3 Restriction modification system DNA specificity domain n=1 Tax=Caldicellulosiruptor saccharolyticus DSM 8903 RepID=A4XMW3_CALS8 Length = 455 Score = 261 bits (669), Expect = 3e-68, Method: Composition-based stats. Identities = 70/452 (15%), Positives = 159/452 (35%), Gaps = 47/452 (10%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDL--VF 61 + P+ W I + LI G+ K D+ +P + +I +G+ + +D+ + Sbjct: 22 EFPKEWTIVSLERDCVLISGLRPKGG-----ASDEGIPSLGGEHITLDGRINFSDVNAKY 76 Query: 62 VPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 +P+ K K DI+I + V + + + ++R +KL Sbjct: 77 IPEKFFKIMTKGKTEENDILINKDGANTGKVAM-LKKKFYKDIAINEHLFIIRSKKLFVQ 135 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ ++ S + +I+ G+ + +P PPL EQ+ IAE L+T+ + + Sbjct: 136 QYLFYWLFSRFGQKQITDRITGSAQPGLSSTFIKNFLVPRPPLPEQRKIAEILETIDSAI 195 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAV---NGKLTEKWR--------------NFEPQHSV 222 + T A E+ +I + Q +L V +E+WR P+ Sbjct: 196 EKTDAIIEKYKRIKQGLMQDLLTKGVVSEGEGESERWRLRDENIDKFKDSPLGRIPEEWE 255 Query: 223 FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGH-VDQNDIRFLEC-SESELNR 280 + L +++P P L + G + +++ + Sbjct: 256 VVDVYGRVNLINGGTPSTARPEFWNGSIPWLSVEDFNIGKRWVFSSSKYITELGLKQSAT 315 Query: 281 HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA--RLTKDALPEYIEIFF 338 L+ G L+ + VGV ++ + + +++ Sbjct: 316 KLLKKGMLIISARG----TVGVLA----QLGADMAFNQSCYGLDAKDKMKLSNDFLYYAL 367 Query: 339 SSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 + + ++ + I+ + K ++ LPP+ EQ I + Q D + ++ Sbjct: 368 KN--FITSFLSLA-YGNVFNTITRETFKEILIPLPPLPEQQRIASILSQ----IDEVIEK 420 Query: 399 VNNALARVNNLTQSILAKAFRGELTAQWRAEN 430 ++ + + ++ G++ E Sbjct: 421 EQAYKEKLERIKKGLMEDLLTGKVRVNHLIEE 452 Score = 132 bits (332), Expect = 4e-29, Method: Composition-based stats. Identities = 47/211 (22%), Positives = 84/211 (39%), Gaps = 21/211 (9%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK--FDTTDLVF 61 G++PE W + V LI G T + + +P + + GK ++ Sbjct: 248 GRIPEEWEVVDVYGRVNLINGGTPSTAR--PEFWNGSIPWLSVEDFNIGKRWVFSSSKYI 305 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL--IFS 119 L + + K+ + ++I + G+ L + +F C L + + + Sbjct: 306 TELGLKQSATKLLKKGMLIISARGTVG-----VLAQLGADMAFNQSCYGLDAKDKMKLSN 360 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ + K + SL+ G N I +F I IP+PPL EQ+ IA L Q+ Sbjct: 361 DFLYYALK--NFITSFLSLAYGNVFNTITRETFKEILIPLPPLPEQQRIASILS----QI 414 Query: 180 DSTKAR----FEQIPQILKRFRQAVLGGAVN 206 D + E++ +I K + +L G V Sbjct: 415 DEVIEKEQAYKEKLERIKKGLMEDLLTGKVR 445 >UniRef50_A2TQ01 Possible type I restriction-modification system, S subunit n=4 Tax=Bacteria RepID=A2TQ01_9FLAO Length = 444 Score = 261 bits (669), Expect = 3e-68, Method: Composition-based stats. Identities = 77/438 (17%), Positives = 157/438 (35%), Gaps = 34/438 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W + + ++ + + E ++ ++ + + N + ++ F+P Sbjct: 23 GEIPEHWQLGRLGSILNPVSSKNHPNETLLSITREKGVIVRDIEN------EDSNHNFIP 76 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 +L + + + + G S++ K I F Sbjct: 77 DDLT-GYKLLKKGQFGMNKMKAWQGSYGVSSYT-----GIVSPAYYTFEFTKEIEPRFFH 130 Query: 124 HFTKSSLYRNKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 +S +Y + S G ++ I + +PPL EQ IAE LD ++D Sbjct: 131 IAIRSKMYVSFFGKASDGVRIGQWDLSKDRMKRIPLAVPPLPEQTAIAEFLDDKTTKIDD 190 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 +Q +LK +Q ++ AV L + +W P+H K+ + L Sbjct: 191 AIGIKQQQINLLKERKQILIHKAVTRGLDDSVTLKDSGVEWIGEIPEHWKVKRFRYIFQL 250 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRAGH---VDQN--DIRFLECSESELNRH-KLQDG 286 + +K N G + + + + VD N ++ ++ E N + +++G Sbjct: 251 GKGLT--ITKENLKEEGVFCVNYGEIHSKYGFEVDTNIQQLKCVDDDYLESNTNALIKEG 308 Query: 287 DLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNA 346 D +F + +E G LK Y + A+ + F S S RN Sbjct: 309 DFVFADTSEDIEGSGNFTYLKSKDEIFAGY--HTVVAKPKFKINSRFFAYVFESQSFRNQ 366 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + VK ++ +K V P ++EQ EIV ++ +T + ++ Sbjct: 367 IRTKVK-GVKVYSVTQSILKEPNVWYPSIQEQREIVDFLDIGTRKIETAIGLKEQEIEKL 425 Query: 407 NNLTQSILAKAFRGELTA 424 S++ G++ Sbjct: 426 KEYKGSLINGVVTGKVRV 443 Score = 149 bits (377), Expect = 2e-34, Method: Composition-based stats. Identities = 48/213 (22%), Positives = 91/213 (42%), Gaps = 11/213 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK-----FDTT 57 G++PE W + + L +G+T KE LK++ + + I + + Sbjct: 232 IGEIPEHWKVKRFRYIFQLGKGLTITKEN----LKEEGVFCVNYGEIHSKYGFEVDTNIQ 287 Query: 58 DLVFVPKNLVKESQ--KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK 115 L V + ++ + I D V A +S G + E G V +P+ Sbjct: 288 QLKCVDDDYLESNTNALIKEGDFVFADTSEDIEGSGNFTYLKSKDEIFAGYHTVVAKPKF 347 Query: 116 LIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 I S F A+ +S +RN+I + G + ++ + N+ P + EQ+ I + LD Sbjct: 348 KINSRFFAYVFESQSFRNQIRTKVKGVKVYSVTQSILKEPNVWYPSIQEQREIVDFLDIG 407 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGK 208 ++++ EQ + LK ++ +++ G V GK Sbjct: 408 TRKIETAIGLKEQEIEKLKEYKGSLINGVVTGK 440 Score = 112 bits (280), Expect = 3e-23, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 67/233 (28%), Gaps = 21/233 (9%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P+H +L + N +SSK + + I R V ++ D Sbjct: 20 EWLGEIPEHWQLGRLG------SILNPVSSKNHPNETLLSITREKGVIVRDIENEDSNHN 73 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 + L+ G + GV + TK+ P Sbjct: 74 FIPDDLTGYKLLKKGQFGMNKMKAWQGSYGVSSY-------TGIVSPAYYTFEFTKEIEP 126 Query: 332 EYIEIFFSSPSARNAMMNCVKT-TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 + I S + GQ +S +K + +PP+ EQ I ++ Sbjct: 127 RFFHIAIRSKMYVSFFGKASDGVRIGQWDLSKDRMKRIPLAVPPLPEQTAIAEFLDDKTT 186 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + + Q ++ KA R + + ++ + Sbjct: 187 KIDDAIGIKQQQINLLKERKQILIHKAVT-------RGLDDSVTLKDSGVEWI 232 >UniRef50_A4U327 Type I restriction-modification system, S subunit n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U327_9PROT Length = 431 Score = 261 bits (668), Expect = 4e-68, Method: Composition-based stats. Identities = 79/442 (17%), Positives = 160/442 (36%), Gaps = 56/442 (12%) Query: 4 GKLPEGWVIAPVS-TVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKF--DTTDLV 60 G++P W + P+ + L G + DD IR N+ D +D+ Sbjct: 18 GEVPGHWDVFPLKRDLAFLTSG----SRGWAEHYSDDGALFIRIGNLTRDGIHLDLSDIQ 73 Query: 61 FV--PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKL- 116 V P E ++ D++ +++ + +G A E + R + Sbjct: 74 RVEVPDGAEGERTRVVGGDVLFSIT----AYLGSVAVAPEELEVAYVSQHVALARLHQRR 129 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 ++ + T S++ + + G + + + PPL EQ IA LD Sbjct: 130 FIPAWVGYVTLSNIGETYLGTQGYGGTKVQLSLDDVANLIMTAPPLPEQSAIAAFLDRQT 189 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLN 227 ++D+ A E++ +L RQAV+ AV L +W P+H L Sbjct: 190 GKIDALVAEQERLLTLLAEKRQAVISHAVTKGLNPAAPMKDSGIEWLGEVPEHWKVIPLR 249 Query: 228 FESILTELR----NGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKL 283 + +G+ ++ +E P++ G + + Sbjct: 250 WFCTCKSGDSISADGVEAECDEDRTA-PVIG----GNGVMGYTYAPNITHPV-------- 296 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 L+ R +CG + ++ + + LI EY+ S + Sbjct: 297 ----LVIGRVGA------LCGNVHSIKLPAWVTDNALILDIAEGVFNQEYLSHLLRSRN- 345 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 +N + + + Q I+G ++ Q + L P+ EQ+ IV + + A DT+ + A+ Sbjct: 346 ----LNEIASKTAQPLITGSQVRDQRIPLAPMDEQSAIVEFLNEQTAKIDTLTAEALRAI 401 Query: 404 ARVNNLTQSILAKAFRGELTAQ 425 A + ++++ A G++ + Sbjct: 402 ALLKEHRSALISAAVTGKIDVR 423 Score = 147 bits (371), Expect = 1e-33, Method: Composition-based stats. Identities = 40/237 (16%), Positives = 89/237 (37%), Gaps = 20/237 (8%) Query: 212 KWRNFEPQHSVF-KKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHV--DQNDI 268 +W P H + LT G + + S G +RI ++ + D +DI Sbjct: 15 EWLGEVPGHWDVFPLKRDLAFLTSGSRGWAE--HYSDDGALFIRIGNLTRDGIHLDLSDI 72 Query: 269 RFLEC-SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 + +E +E R ++ GD+LF+ ++G + + + + ARL + Sbjct: 73 QRVEVPDGAEGERTRVVGGDVLFSIT----AYLGSVAVAPEELEVAYV-SQHVALARLHQ 127 Query: 328 D-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVE 386 +P ++ S + + + +S D+ + ++ PP+ EQ+ I ++ Sbjct: 128 RRFIPAWVGYVTLSNIGETYLGTQGYGGTKVQ-LSLDDVANLIMTAPPLPEQSAIAAFLD 186 Query: 387 QLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + D + + L + Q++++ A + NP ++ L Sbjct: 187 RQTGKIDALVAEQERLLTLLAEKRQAVISHAVT-------KGLNPAAPMKDSGIEWL 236 >UniRef50_A7N438 Putative uncharacterized protein n=1 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N438_VIBHB Length = 432 Score = 261 bits (668), Expect = 4e-68, Method: Composition-based stats. Identities = 102/432 (23%), Positives = 190/432 (43%), Gaps = 33/432 (7%) Query: 14 PVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQKI 73 + + + RGV+YK E + D + +R+NNIQ+G + ++ VP +LV +SQ + Sbjct: 11 RLGELASGNRGVSYKPENLKAAIDDKSVVFLRSNNIQSGTLNFENVQIVPDSLVSDSQIL 70 Query: 74 SPEDIVIAMSSGSKSVVGKSAHQ--HLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLY 131 DI + MS+GS+ +VGKS + + + GAFC V R + S ++ + +S Y Sbjct: 71 KKGDIAVCMSNGSRQLVGKSGMLQHEVEYPLTVGAFCSVFRCQNEDDSEYVRYLFQSQAY 130 Query: 132 RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQ 191 ++ I AG+ INN+K + + I +P P A +K IAE L T+ Q+D+T+A ++ Sbjct: 131 QHGIDVTLAGSAINNLKNSDVEAIEVPTAPKALRKKIAEILSTIDNQIDATQALIDKYTA 190 Query: 192 ILKRFRQAVLGGAV---NGKLTE-----------KWRNFEPQHSVFKKLNFESILTELRN 237 I + + + L P+ K L S ++ + Sbjct: 191 IKQGMMADLFSRGIDPETKALRPTLEEAPELYHKTPLGMLPKGWDVKTLGDISE--KITS 248 Query: 238 GLSS-KPNESGVGHPILRISSVRAGH--VDQNDIRFLEC-SESELNRHKLQDGDLLFTRY 293 G S G +RIS++ H + ++ + SE R +LQ GD+L + Sbjct: 249 GSRDWAKFYSPEGDLFVRISNLTREHVNFRWDSVKHVNIGGGSEGERTQLQPGDILVSIT 308 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPD-KLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 +G+ G++ + + + LIR + +I + SS + Sbjct: 309 ----ADLGIVGVVPENMGRAYINQHTALIRLSTYGE-NARFIGNYLSSRCGQEQFEKNND 363 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 + + GI+ I S +P KEQ I +++ L ++++ + +L+ L Q Sbjct: 364 SGAK-AGINLPTIASLRCPIPEEKEQLLIASKIDALDEVIADLKREKSKSLS----LKQG 418 Query: 413 ILAKAFRGELTA 424 ++ G+++ Sbjct: 419 LMQDLLTGKVSV 430 Score = 127 bits (319), Expect = 1e-27, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 73/209 (34%), Gaps = 13/209 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFD--TTDLVF 61 G LP+GW + + ++ I + + +R +N+ + + Sbjct: 228 GMLPKGWDVKTLGDISEKITSG---SRDWAKFYSPEGDLFVRISNLTREHVNFRWDSVKH 284 Query: 62 VPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLP-FECSFGAFCGVLRPEK-LI 117 V E ++ P DI+++++ + +G ++R Sbjct: 285 VNIGGGSEGERTQLQPGDILVSIT----ADLGIVGVVPENMGRAYINQHTALIRLSTYGE 340 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + FI ++ S + + + I + + PIP EQ +IA K+D L Sbjct: 341 NARFIGNYLSSRCGQEQFEKNNDSGAKAGINLPTIASLRCPIPEEKEQLLIASKIDALDE 400 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVN 206 + K + + + Q +L G V+ Sbjct: 401 VIADLKREKSKSLSLKQGLMQDLLTGKVS 429 Score = 122 bits (308), Expect = 2e-26, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 82/219 (37%), Gaps = 13/219 (5%) Query: 226 LNFESILTELRNGLSSKPNE-----SGVGHPILRISSVRAGHVDQNDIRFLECSESELNR 280 L L G+S KP LR +++++G ++ +++ + S ++ Sbjct: 9 LQRLGELASGNRGVSYKPENLKAAIDDKSVVFLRSNNIQSGTLNFENVQIVPDSLVSDSQ 68 Query: 281 HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSS 340 L+ GD+ NGS + VG G+L+ L R + EY+ F S Sbjct: 69 -ILKKGDIAVCMSNGSRQLVGKSGMLQHEVEYPLTVGAFCSVFRCQNEDDSEYVRYLFQS 127 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 + ++ + + S + D+++ V P + +I + + D Sbjct: 128 QAYQHGIDVTLA-GSAINNLKNSDVEAIEVPTAPKALRKKIAEILSTIDNQIDA----TQ 182 Query: 401 NALARVNNLTQSILAKAF-RGELTAQWRAENPDLISGEN 438 + + + Q ++A F RG + + +A P L Sbjct: 183 ALIDKYTAIKQGMMADLFSRG-IDPETKALRPTLEEAPE 220 >UniRef50_UPI0001BC364B restriction modification system DNA specificity subunit n=1 Tax=Butyrivibrio crossotus DSM 2876 RepID=UPI0001BC364B Length = 428 Score = 261 bits (668), Expect = 4e-68, Method: Composition-based stats. Identities = 92/435 (21%), Positives = 180/435 (41%), Gaps = 38/435 (8%) Query: 3 AGKLPEGWVIAPVS---TVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL 59 GK+PE W + ++ I G + + Q ++ K I +G Sbjct: 12 VGKIPENWKVLKNKYNFELSKEIIGTKWVETQLLSLTK------YGVKAINDG----EQT 61 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVV--GKSAHQHLPFECSFGAFCGVLRPEKLI 117 VP++L QK++ +DIV+ + S V G S F+ +R + + Sbjct: 62 GKVPESL-STYQKVNKDDIVMCLFDLDCSAVFSGIS-----NFDGMISPAYKCIRCKPHL 115 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 ++ ++ ++ K S +I F + I +PP+ QK IAE L+ Sbjct: 116 CPQYVDYYFRTVFVDRKYKRYSKNVRF-SISSDEFMNLPIIVPPIDIQKKIAEFLNFKCF 174 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNF 228 ++D+ + E+ + L+ ++++++ AV L + P+H L + Sbjct: 175 EIDTLHSDIEKQIKTLEEYKKSIITEAVTKGLDPDVEMKDSGISYIGNIPKHWKVTNLKY 234 Query: 229 ESILTELRNGLSSKPNESGVGHPILRISSVRAGH-VDQNDIRFLECSESELNRHKLQDGD 287 +NG+S G G P + V + + QN + +++E N + ++ GD Sbjct: 235 LGKC---QNGISKGGEYFGNGFPFVSYGDVYKNYSIPQNVDGLIMSTKTEQNIYSVKYGD 291 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSSPSARNA 346 + FTR + ++E +G K N ++ LIR R + D +PE+ + +F S R Sbjct: 292 VFFTRTSETIEEIGFASTCLKSID-NSVFAGFLIRFRPTSSDLIPEFSKFYFRSNIHRKF 350 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + + + +S + VLLPP+ EQ I + +E+ A D ++ L + Sbjct: 351 FVKEMNL-VTRASLSQNLLGRLPVLLPPLCEQQMIAKNLEKKCAEIDGAIEEKKEQLETL 409 Query: 407 NNLTQSILAKAFRGE 421 +S++ + G+ Sbjct: 410 EQYKKSLIYEYVTGK 424 Score = 133 bits (335), Expect = 2e-29, Method: Composition-based stats. Identities = 29/212 (13%), Positives = 80/212 (37%), Gaps = 9/212 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV-- 60 G +P+ W + + + G++ + P + ++ ++ Sbjct: 220 IGNIPKHWKVTNLKYLGKCQNGIS-----KGGEYFGNGFPFVSYGDVYKNYSIPQNVDGL 274 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFEC-SFGAFCGVLRP-EKLIF 118 + + + D+ +S + +G ++ + F F RP + Sbjct: 275 IMSTKTEQNIYSVKYGDVFFTRTSETIEEIGFASTCLKSIDNSVFAGFLIRFRPTSSDLI 334 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F + +S+++R ++ + + +PPL EQ++IA+ L+ A+ Sbjct: 335 PEFSKFYFRSNIHRKFFVKEMNLVTRASLSQNLLGRLPVLLPPLCEQQMIAKNLEKKCAE 394 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 +D ++ + L++++++++ V GK Sbjct: 395 IDGAIEEKKEQLETLEQYKKSLIYEYVTGKKE 426 Score = 111 bits (278), Expect = 6e-23, Method: Composition-based stats. Identities = 27/230 (11%), Positives = 76/230 (33%), Gaps = 21/230 (9%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P++ K + L++ G + + ++ G + Sbjct: 10 EWVGKIPENWKVLKNKYNFELSKEIIGTKWVETQLLSLTKY-GVKAINDG----EQTGKV 64 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 S S + + D++ ++ + + + + + R P Sbjct: 65 PESLSTYQK--VNKDDIVMCLFDLDC-----SAVFSGISNFDGMISPAYKCIRCKPHLCP 117 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 +Y++ +F + K + IS + + +++PP+ Q +I + Sbjct: 118 QYVDYYFRTVFVDRKYKRYSKN--VRFSISSDEFMNLPIIVPPIDIQKKIAEFLNFKCFE 175 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAA 441 DT+ + + + +SI+ +A + +PD+ ++ + Sbjct: 176 IDTLHSDIEKQIKTLEEYKKSIITEAVT-------KGLDPDVEMKDSGIS 218 >UniRef50_Q7UE18 Restriction modification system S chain homolog n=1 Tax=Rhodopirellula baltica RepID=Q7UE18_RHOBA Length = 389 Score = 261 bits (667), Expect = 4e-68, Method: Composition-based stats. Identities = 81/416 (19%), Positives = 166/416 (39%), Gaps = 37/416 (8%) Query: 12 IAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQ 71 +S + G T + + Y D +P +++ ++ T L + S Sbjct: 5 EVALSEICDTGSGGTPSRAKQEIYY-DGSIPWVKSGELRESVITETGESITELGLKESSA 63 Query: 72 KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLY 131 K+ P D ++ G + VG+ + + A C ++ + + ++ H +S + Sbjct: 64 KLLPADTLLVALYG--ATVGRVGMLGIE-AATNQAVCYLIPDDTRVERRYLYHALRSKV- 119 Query: 132 RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQ 191 + G NI IP+PPL+EQK IAE LD ++ +A+ Sbjct: 120 -PYWLTQRVGGGQPNISQGVIKNTKIPLPPLSEQKRIAEILDRA----EALRAKRRAALA 174 Query: 192 ILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL---SSKPNESGV 248 +L Q++L ++G +I ++ G+ + K Sbjct: 175 LLDELTQSILARLLDGSAD------------LGTTTLGNISRDMHQGINTVTEKIEYQND 222 Query: 249 GHPILRISSVRAGHVDQNDIRFLECS--ESELNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 G PI++ G++D +D RF+ + +++ DLL +G L+ Sbjct: 223 GFPIIQSKHTTQGYLDLSDARFVSKATYLKYKEKYRPARNDLLLCNIG----TIGKSLLM 278 Query: 307 KKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIK 366 ++ + + LI+ L P + + +F ++++ + + K IS K + Sbjct: 279 EQENDFLIAWNLFLIKLDL-DQVSPSFCKHYFDRLASQHYFDRFLTGGT-VKFISKKTLN 336 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + + LP + Q E E+ A + ++++ +A+A ++ L S+ +AFRGEL Sbjct: 337 ATPIPLPSMDRQREF----EEQIASVEVLKEKHRSAVAELDQLFASLQHRAFRGEL 388 >UniRef50_Q1VR15 Type I restriction-modification enzyme 1, S subunit n=1 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VR15_9FLAO Length = 441 Score = 261 bits (667), Expect = 4e-68, Method: Composition-based stats. Identities = 78/426 (18%), Positives = 154/426 (36%), Gaps = 21/426 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDL--VF 61 G +PE W + + + +G K+ + + LP +R I T + Sbjct: 32 GWIPEDWNVKSLDQLGEFSKGKGITKKDILED-EVGGLPCVRYAEIYTIYHYNTTVLKSK 90 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + S I+ DI+ A S + +GKS G +L+ F Sbjct: 91 INQESAANSNPINCGDILFAGSGETLEDIGKSIAYLNKETAYAGGDICILKHHNQ-DPQF 149 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + + + R+++ + G ++ +I + +++PIPPL EQ+ IA L+T + + Sbjct: 150 LGYLFNNDVVRSQLYKIGQGHSVVHIYSSGLKKVSVPIPPLPEQQKIASILNTWDKAIAA 209 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 + Q + Q +L + F +++ + I+ L Sbjct: 210 QEKLIAQKQALKNGLMQQLLT---------GKKRFAGFVEEWEEKSLNDIVKYLGGEAFK 260 Query: 242 KPNESGVGHPILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQDGDLLFTRYNGSLEF 299 N+ G L+I++V G V D E ++ L+ GD + L Sbjct: 261 STNQVENGVRWLKIANVGIGVVKWGDSTTFLPTSFIDENPKYVLKAGDAVMALTRPILND 320 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 + K + L ++ + ++I +P MN + + Sbjct: 321 KLKIAVFNK-EDGIALLNQRVAKLISKNKNDLKFIYYIHQTPYFI-YTMNAMMAGTDPPN 378 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 IS KD+ + V +P +EQ +IV +E D + N + Q ++ + Sbjct: 379 ISIKDLAKKKVFIPGYEEQKKIVSVIESFDNEIDNLI----NKGKHLKKQKQGLMQQLLT 434 Query: 420 GELTAQ 425 GE + Sbjct: 435 GEKRVK 440 Score = 120 bits (302), Expect = 1e-25, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 79/217 (36%), Gaps = 12/217 (5%) Query: 208 KLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP--NESGVGHPILRISSVRAGH-VD 264 + + P+ K L+ ++ + G++ K + G P +R + + + + Sbjct: 25 GYKKTKLGWIPEDWNVKSLDQLGEFSKGK-GITKKDILEDEVGGLPCVRYAEIYTIYHYN 83 Query: 265 QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 ++ ES N + + GD+LF +LE +G + L + + + Sbjct: 84 TTVLKSKINQESAANSNPINCGDILFAGSGETLEDIGKS--IAYLNKETAYAGGDICILK 141 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 P+++ F++ R+ + + I +K V +PP+ EQ +I Sbjct: 142 HHNQ-DPQFLGYLFNNDVVRSQLYK-IGQGHSVVHIYSSGLKKVSVPIPPLPEQQKIASI 199 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 + D +A+ L ++ + G+ Sbjct: 200 LNTW----DKAIAAQEKLIAQKQALKNGLMQQLLTGK 232 >UniRef50_A7VYZ3 Putative uncharacterized protein n=1 Tax=Clostridium leptum DSM 753 RepID=A7VYZ3_9CLOT Length = 444 Score = 261 bits (667), Expect = 5e-68, Method: Composition-based stats. Identities = 98/446 (21%), Positives = 184/446 (41%), Gaps = 35/446 (7%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 K+P+ W S + LI G K + +P I G + + VF + Sbjct: 29 KVPKNWCWVRFSKIINLISGRDAKLTDCNSL--GIGIPYI------LGASNLENNVFTIE 80 Query: 65 NLVKESQKI-SPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 ++ Q I D+++++ K +GK + + + +R +F F Sbjct: 81 RWIENPQVISLKNDVLLSV----KGTIGKV-YLQKEEKVNISRQIMAIRTSSTLFPRFTY 135 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + G I I +P PPL EQ+ I +++++L A++D K Sbjct: 136 WLVN--NISDSFRQAGNGL-IPGISREDILQKEVPFPPLPEQQRIVDRIESLFAKLDEAK 192 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 + ++ + + A+L A G+LT +WR + + + ++R+G P Sbjct: 193 QKTQEALNSYETRKAAILHKAFTGELTARWRKEHGLGMESWEKYKFNDILDVRDGTHDSP 252 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLECSESEL--NRHKLQDGDLLFTRYNGSLEFVG 301 G P++ +++ G + D++F+ + + R K+ GD+LF +G Sbjct: 253 TYFDQGFPLITSKNLKDGKITDKDLKFISKEDYDKINERSKVDIGDILFAMIG----TIG 308 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 +++ + + + A P +++ F S + M K S QK +S Sbjct: 309 NPVVVETQPKFAI---KNVALFKNIGKASPYFVKYFLESKKVIDRMEKDAK-GSTQKFVS 364 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 +++ +LLP KEQ EIVR ++ L A ++ L +++ + +SILA+AFRGE Sbjct: 365 LGYLRAFNILLPKSKEQTEIVRILDDLLAKEQQAKEAAEAVLDQIDLMKKSILARAFRGE 424 Query: 422 LTAQWRAENPDLISGENSAAALLEKI 447 L AE SA L++ I Sbjct: 425 LGTNNPAEE--------SAVELVKNI 442 Score = 118 bits (296), Expect = 5e-25, Method: Composition-based stats. Identities = 56/250 (22%), Positives = 96/250 (38%), Gaps = 28/250 (11%) Query: 184 ARFEQIPQIL-KRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 A+ ++ + + QA L + P++ + + + L R+ + Sbjct: 2 AKAKKKETLTPEERLQAAL------VPDWEQPYKVPKNWCWVRFSKIINLISGRDAKLTD 55 Query: 243 PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 N G+G P + G + + F E + D+L + +G Sbjct: 56 CNSLGIGIPYI------LGASNLENNVFTIERWIENPQVISLKNDVLLSVKG----TIGK 105 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFS--SPSARNAMMNCVKTTSGQKGI 360 L + + + +++ R + P + + S S R GI Sbjct: 106 VYL---QKEEKVNISRQIMAIRTSSTLFPRFTYWLVNNISDSFRQ------AGNGLIPGI 156 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 S +DI + V PP+ EQ IV R+E LFA D +++ AL +IL KAF G Sbjct: 157 SREDILQKEVPFPPLPEQQRIVDRIESLFAKLDEAKQKTQEALNSYETRKAAILHKAFTG 216 Query: 421 ELTAQWRAEN 430 ELTA+WR E+ Sbjct: 217 ELTARWRKEH 226 >UniRef50_A3XVN0 Type I restriction-modification system, S subunit n=1 Tax=Vibrio sp. MED222 RepID=A3XVN0_9VIBR Length = 424 Score = 261 bits (667), Expect = 5e-68, Method: Composition-based stats. Identities = 73/432 (16%), Positives = 172/432 (39%), Gaps = 37/432 (8%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTT-DLVFVPKN 65 E W ++ +S + I+ T+ + +PL+ A N+ +GK + V + Sbjct: 13 EDWNVSNLSECSLFIKDGTHGTHKRTPT----GIPLLSAKNVTASGKIKWDVNDSLVSEA 68 Query: 66 LVKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFSGF 121 + ++ +D+++ + +G+ A + + GV+RP+K + F Sbjct: 69 DYSKIHSKYELEKDDLLLTV----VGTLGRRALVDGSAKFTIQRSVGVIRPDKNKVTPNF 124 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 I HF S ++N++ + + + +P PPL EQK IA L ++ ++ Sbjct: 125 IFHFCGSDFFQNQLELRANATAQAGVYLGELAKVPVPSPPLPEQKKIAAILTSVDEVIEK 184 Query: 182 TKARFEQIPQILKRFRQAVLG--GAVNGKLTEKWR----NFEPQHSVFKKLNFESILTEL 235 T+A+ +++ + Q +L V+GK +++ P+ +L+ + + + Sbjct: 185 TQAKIDKLKDLKTGMMQELLTCGVGVDGKPHTEFKDSPVGRVPKGWEVVELDRAAKVIDC 244 Query: 236 RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECS--ESELNRHKLQDGDLLFTRY 293 + + P G P+++ ++R G ++ + + ++ H GD++++R Sbjct: 245 K---HATPKYFSNGFPVVKPGNIREGFLELRGCSLTDKAGFDNLNENHTPTIGDIIYSR- 300 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 N + GV + + + K ++ +SP + + + Sbjct: 301 NQTY---GVGAYVNRSME--FCIGQDVCVISPKK-CNSIFLFYMINSPLVKEQV-ELLAA 353 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 S K I+ I+ + LP ++EQ I +F D + L + + +++ Sbjct: 354 GSTFKRINLGSIRKLKIALPCIEEQQAIGA----VFESIDNKVSLLEKKLIKKKDTKKAL 409 Query: 414 LAKAFRGELTAQ 425 + G+ + Sbjct: 410 MQDLLTGKKRVK 421 Score = 128 bits (322), Expect = 5e-28, Method: Composition-based stats. Identities = 39/203 (19%), Positives = 81/203 (39%), Gaps = 13/203 (6%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P+GW + + +I + A + P+++ NI+ G + Sbjct: 223 VGRVPKGWEVVELDRAAKVID-----CKHATPKYFSNGFPVVKPGNIREGFLELRGCSLT 277 Query: 63 PK---NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 K + + E+ + DI+ + G A+ + E G V+ P+K S Sbjct: 278 DKAGFDNLNENHTPTIGDIIYSR----NQTYGVGAYVNRSMEFCIGQDVCVISPKK-CNS 332 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ + S L + ++ L+AG+ I S + I +P + EQ+ I +++ +V Sbjct: 333 IFLFYMINSPLVKEQVELLAAGSTFKRINLGSIRKLKIALPCIEEQQAIGAVFESIDNKV 392 Query: 180 DSTKARFEQIPQILKRFRQAVLG 202 + + + K Q +L Sbjct: 393 SLLEKKLIKKKDTKKALMQDLLT 415 >UniRef50_C1SJS8 Restriction endonuclease S subunit n=1 Tax=Denitrovibrio acetiphilus DSM 12809 RepID=C1SJS8_9BACT Length = 441 Score = 260 bits (666), Expect = 6e-68, Method: Composition-based stats. Identities = 74/446 (16%), Positives = 151/446 (33%), Gaps = 43/446 (9%) Query: 4 GKLPEGWVIAPVSTVTTLIRG------VTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT 57 G++PE W I L G + + L D+ + + NI FD Sbjct: 18 GEIPEHWAIERFK--FQLRAGFEGLKIGPFGSQIKAELLSDEGIKVYGQENIIKNNFDLG 75 Query: 58 DLVFVPKNLV--KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPE 114 FV + L E + P DI++ M G+ + LR Sbjct: 76 -HRFVSEELFCELEVYETLPGDILVTMM----GTAGRCQVTPEKINQGIIDSHLIRLRVN 130 Query: 115 KLIFSGFIAHFTK-SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 K + S F + S+ ++I + G+ ++ + + +PPL EQ II + LD Sbjct: 131 KCLLSRFCKYLINDSAYIEHQIRLMGKGSIMHGLNSTIIKNLIFILPPLKEQSIILKYLD 190 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFK 224 AQ+D + +++ + L R A++ AV + +W P+H Sbjct: 191 KKTAQIDELIDKKKKLIEKLDEKRTALITHAVTKGMNPDVKMKDSGVEWLGEVPEHWDIV 250 Query: 225 KLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQ 284 K + + + G S +++ + +G + ++ Sbjct: 251 KAKYLFTIEKRIAGFLGHDVLSITQTG-IKVKDIESGEGQLS--------MDYTKYQIVK 301 Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSAR 344 GD + + G + Q + PD + ++ P+Y Sbjct: 302 VGDFAMNHMDL------LTGYVDISQFDGVTSPDYRVFRLSAQNCNPQYYLYHMQRGYKE 355 Query: 345 NAMMNCVKTTS--GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNA 402 N ++ G+ + + K +PP +EQ I + D++ + + Sbjct: 356 KIFFNYGHGSAQLGRWRLPTDEFKELSFPVPPYEEQQAIAEYISSETILIDSLISKTEES 415 Query: 403 LARVNNLTQSILAKAFRGELTAQWRA 428 ++ + +++ A G++ + A Sbjct: 416 ISLLKEKRSALITAAVTGKIDVREEA 441 Score = 159 bits (404), Expect = 2e-37, Method: Composition-based stats. Identities = 44/238 (18%), Positives = 92/238 (38%), Gaps = 18/238 (7%) Query: 212 KWRNFEPQHSVFKKLNF-----ESILTELRNGLSSKPNE-SGVGHPILRISSVRAGHVDQ 265 +W P+H ++ F L G K S G + ++ + D Sbjct: 15 EWLGEIPEHWAIERFKFQLRAGFEGLKIGPFGSQIKAELLSDEGIKVYGQENIIKNNFDL 74 Query: 266 NDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL 325 E EL ++ GD+L T G C + + +Q ++ LIR R+ Sbjct: 75 GHRFVSEELFCELEVYETLPGDILVTMMG----TAGRCQVTPEKINQGII-DSHLIRLRV 129 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 K L + + + + + + S G++ IK+ + +LPP+KEQ+ I++ + Sbjct: 130 NKCLLSRFCKYLINDSAYIEHQIRLMGKGSIMHGLNSTIIKNLIFILPPLKEQSIILKYL 189 Query: 386 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++ A D + + + +++ +++ A + NPD+ ++ L Sbjct: 190 DKKTAQIDELIDKKKKLIEKLDEKRTALITHAVT-------KGMNPDVKMKDSGVEWL 240 >UniRef50_C9KLK0 Putative phosphoribosylformylglycinamidine synthase n=1 Tax=Mitsuokella multacida DSM 20544 RepID=C9KLK0_9FIRM Length = 489 Score = 260 bits (665), Expect = 1e-67, Method: Composition-based stats. Identities = 73/441 (16%), Positives = 145/441 (32%), Gaps = 65/441 (14%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 +PE WV + + + T+K K++ +P + NI N K D +++ ++ Sbjct: 66 DIPENWVWTRLEEILLSLTDGTHKT----PVYKNEGIPFLSVKNISNHKIDFSNIKYISI 121 Query: 65 NLVKE---SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + K+ DI+++ G E S +L+ I + + Sbjct: 122 DEHKKLCERCYPKKGDILLSK----VGTTGIPVIIDTEKEFSIFVSVALLKFSSSIDAKY 177 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + +S L + + + + G N +P+PPLAEQ I K++ L +D+ Sbjct: 178 LLFLLESPLVQEQCRTHTRGIGNKNWVLTDIANTIVPLPPLAEQHRIVAKIEELQPDIDA 237 Query: 182 TKARFEQIPQILKRF----RQAVLGGAVNGKLTE-------------------------- 211 ++ I + F ++++L A+ GKL Sbjct: 238 YDKAQTKLQSIEQSFPDAMKKSLLQYAIEGKLVPQRKEEGTAKDLLAKIRAEKARLVKEK 297 Query: 212 --------------KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISS 257 + P + +L P G P L+ Sbjct: 298 KIKKSKPLPAITDDEKPFDIPDSWEWVRLGELGEWCSGATPSRQHPEYFGGKIPWLKTGD 357 Query: 258 VRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYP 317 + G++ + + + G +L Y ++ +G+ + Sbjct: 358 LNDGYIKEVPEYITDDGFKNSSTKINPIGSVLIAMYGATIGKLGILKI-------PATTN 410 Query: 318 DKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKE 377 L + +Y+ F + R + + Q IS I + V+ LPP+ E Sbjct: 411 QACCACELVHEMYNKYLFYFLFAN--RKYFIKKGAGGA-QPNISKAKITNTVMPLPPLAE 467 Query: 378 QAEIVRRVEQLFAYADTIEKQ 398 Q IV ++E+L + Q Sbjct: 468 QYRIVAKLEELLPLCQQLASQ 488 Score = 162 bits (412), Expect = 2e-38, Method: Composition-based stats. Identities = 53/247 (21%), Positives = 100/247 (40%), Gaps = 25/247 (10%) Query: 216 FEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSE 275 P++ V+ +L L L +G P G P L + ++ +D ++I+++ E Sbjct: 66 DIPENWVWTRLEEI--LLSLTDGTHKTPVYKNEGIPFLSVKNISNHKIDFSNIKYISIDE 123 Query: 276 SEL--NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEY 333 + R + GD+L ++ + G+ ++ + ++ L + + +Y Sbjct: 124 HKKLCERCYPKKGDILLSKVGTT----GIPVIIDTEKEFSIFVSVAL--LKFSSSIDAKY 177 Query: 334 IEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYAD 393 + SP + G K DI + +V LPP+ EQ IV ++E+L D Sbjct: 178 LLFLLESPLVQEQCRTH-TRGIGNKNWVLTDIANTIVPLPPLAEQHRIVAKIEELQPDID 236 Query: 394 TIEKQVNNALARVNN-----LTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIK 448 + L + + +S+L A G+L Q + E +A LL KI+ Sbjct: 237 AY-DKAQTKLQSIEQSFPDAMKKSLLQYAIEGKLVPQRK--------EEGTAKDLLAKIR 287 Query: 449 AERAASG 455 AE+A Sbjct: 288 AEKARLV 294 >UniRef50_C5DB08 Restriction modification system DNA specificity domain protein n=1 Tax=Geobacillus sp. WCH70 RepID=C5DB08_GEOSW Length = 445 Score = 259 bits (663), Expect = 1e-67, Method: Composition-based stats. Identities = 73/445 (16%), Positives = 174/445 (39%), Gaps = 42/445 (9%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVT--YKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 G++P W I + V K + ++ + + I ++G Sbjct: 13 IGEIPSDWKILRLKNVLKERNEKNSPIKTNEILSLTIEKGV--IPYKEKKSGG------- 63 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 K + + P DIV+ + VG S + C + + + Sbjct: 64 NKAKEDLSNYKLAYPNDIVLNSMNVIVGAVGISKYYG----CVSPVYYVLYSDDVEQNIR 119 Query: 121 FIAHFTKSSLYRNKISSLSAGANIN------------NIKPASFDLINIPIPPLAEQKII 168 F + +SS ++ + L G + I + +P+PP++ Q+ I Sbjct: 120 FYNYLFQSSAFQKSLIGLGNGIMMKQSSTGKLNTIRLRIPLDRLKNVYLPVPPVSVQQKI 179 Query: 169 AEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQ 219 LD ++ +D+ + +Q + LK+++Q+++ V L +W P+ Sbjct: 180 VNFLDEKVSHIDTIIEKNKQSIEELKKYKQSLIAETVTKGLDPNVEMKDSGIEWVGEIPK 239 Query: 220 HSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAG---HVDQNDIRFLECSES 276 H ++L SI+T S + NE +++ ++V ++ +D + SE+ Sbjct: 240 HWEIRRLRDISIITRGTVDKSKEKNEIP--VYLVQYTNVYYKREQKINDDDYLPITVSEN 297 Query: 277 ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 E ++K++ GD+L T + + + +G ++ + N ++ +IR R+ + + Sbjct: 298 EYKKYKVRKGDILLTASSETKDDIGHSTVIVEDLP-NHVFGSDIIRIRIPNKIVDLNYKK 356 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 +F A + + + KS ++PP++EQ +I + ++ + + + + Sbjct: 357 YFMENYYYLAKFDKLSRGITRFRFGMDQFKSLKYVIPPIEEQVKIAKYLDNITNHINQLI 416 Query: 397 KQVNNALARVNNLTQSILAKAFRGE 421 + + + +S++ + G+ Sbjct: 417 CNKEKLINELESYKKSLIYEYVTGK 441 Score = 137 bits (346), Expect = 9e-31, Method: Composition-based stats. Identities = 46/214 (21%), Positives = 89/214 (41%), Gaps = 10/214 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG---KFDTTD- 58 G++P+ W I + ++ + RG K ++ + L++ N+ K + D Sbjct: 234 VGEIPKHWEIRRLRDISIITRGTVDKSKEKNEIP----VYLVQYTNVYYKREQKINDDDY 289 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFEC-SFGAFCGVLR-PEKL 116 L + K+ DI++ SS +K +G S FG+ +R P K+ Sbjct: 290 LPITVSENEYKKYKVRKGDILLTASSETKDDIGHSTVIVEDLPNHVFGSDIIRIRIPNKI 349 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + + +F ++ Y K LS G F + IPP+ EQ IA+ LD + Sbjct: 350 VDLNYKKYFMENYYYLAKFDKLSRGITRFRFGMDQFKSLKYVIPPIEEQVKIAKYLDNIT 409 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 ++ E++ L+ ++++++ V GK Sbjct: 410 NHINQLICNKEKLINELESYKKSLIYEYVTGKKE 443 Score = 112 bits (282), Expect = 2e-23, Method: Composition-based stats. Identities = 33/244 (13%), Positives = 88/244 (36%), Gaps = 34/244 (13%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P +L +N IL ++ + G + + + Sbjct: 11 EWIGEIPSDWKILRLKNVLKERNEKNSPIKTNE-------ILSLT-IEKGVIPYKEKKSG 62 Query: 272 -ECSESELNRHKL-QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 ++ +L+ +KL D++ N + VG+ ++ + P + + Sbjct: 63 GNKAKEDLSNYKLAYPNDIVLNSMNVIVGAVGIS------KYYGCVSPVYYVLYSDDVEQ 116 Query: 330 LPEYIEIFFSSPSARNAMM--------NCVKTT---SGQKGISGKDIKSQVVLLPPVKEQ 378 + F S + + +++ T + + I +K+ + +PPV Q Sbjct: 117 NIRFYNYLFQSSAFQKSLIGLGNGIMMKQSSTGKLNTIRLRIPLDRLKNVYLPVPPVSVQ 176 Query: 379 AEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGEN 438 +IV +++ ++ DTI ++ ++ + QS++A+ + +P++ ++ Sbjct: 177 QKIVNFLDEKVSHIDTIIEKNKQSIEELKKYKQSLIAETVT-------KGLDPNVEMKDS 229 Query: 439 SAAA 442 Sbjct: 230 GIEW 233 >UniRef50_C0N6F0 Type I restriction modification DNA specificity domain protein n=1 Tax=Methylophaga thiooxidans DMS010 RepID=C0N6F0_9GAMM Length = 454 Score = 259 bits (663), Expect = 2e-67, Method: Composition-based stats. Identities = 69/439 (15%), Positives = 175/439 (39%), Gaps = 41/439 (9%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P+ W++ + +++ G + + + + P ++ N + G +F Sbjct: 26 PKEWMLTRLKFTSSINMGQSPNSDDCND--EGHGRPFLQ-GNAEFGMRTPKAKLFCEAA- 81 Query: 67 VKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFT 126 + S D+++++ + VG+ + + G + + + + F+ Sbjct: 82 ---KKTCSEGDVLLSVR----APVGELNIANQEYG--IGRGLCAITA-QSVKADFMWWLL 131 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 ++S+ +++ +++ G+ + + +P +EQ IA LD A++D + Sbjct: 132 QASV--SQLRAVATGSTFQAVSAEQVSNLTCLLPAQSEQTQIATFLDRETAKIDRLIEKQ 189 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES 246 +++ ++L+ RQAV+ AV L + ++ + +LR G+ + Sbjct: 190 QRLIKLLEEKRQAVISHAVTKGLNPDVPMKDSGVEWLGEIPSMWSIVQLRRGIDFLTDFE 249 Query: 247 GVGHP-----------------ILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQDGD 287 G +R + + D E S S L++ L G+ Sbjct: 250 ANGSFAEVKKNVSLDTDNKYAWYVRATDLEHRRFGLVDGNRSCNEKSYSYLSKTTLDGGE 309 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 LL + +G L+ ++ + L P+ L RL + P++ +F S ++ + Sbjct: 310 LLVAKRGE----IGKVYLMPEIDCRATLAPN-LYLIRLNDNFFPQFTYYWFISSYGKSEL 364 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 +N ++ + D+++ ++ +PPV+EQ IV+ + + + +V ++A Sbjct: 365 VN-ADKSTTIGALYKDDVRACIIPMPPVQEQILIVKHISERTDKIQRLITKVQKSIALST 423 Query: 408 NLTQSILAKAFRGELTAQW 426 ++++ A G++ + Sbjct: 424 ERRAALISAAVTGKIDVRD 442 Score = 121 bits (304), Expect = 6e-26, Method: Composition-based stats. Identities = 33/217 (15%), Positives = 79/217 (36%), Gaps = 14/217 (6%) Query: 4 GKLPEGWVIAPVST----VTTLIRGVTYK--KEQAINYLKDDYLPLIRANNIQNGKFDTT 57 G++P W I + +T ++ K+ + Y +RA ++++ +F Sbjct: 227 GEIPSMWSIVQLRRGIDFLTDFEANGSFAEVKKNVSLDTDNKYAWYVRATDLEHRRFGLV 286 Query: 58 DLVFVPKNL---VKESQKISPEDIVIAMSSGSKSVVGKSAH-QHLPFECSFGAFCGVLRP 113 D + ++++A +GK + + ++R Sbjct: 287 DGNRSCNEKSYSYLSKTTLDGGELLVAKR----GEIGKVYLMPEIDCRATLAPNLYLIRL 342 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 F F ++ SS ++++ + I + IP+PP+ EQ +I + + Sbjct: 343 NDNFFPQFTYYWFISSYGKSELVNADKSTTIGALYKDDVRACIIPMPPVQEQILIVKHIS 402 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 ++ + ++ + R A++ AV GK+ Sbjct: 403 ERTDKIQRLITKVQKSIALSTERRAALISAAVTGKID 439 Score = 117 bits (295), Expect = 7e-25, Method: Composition-based stats. Identities = 38/232 (16%), Positives = 85/232 (36%), Gaps = 25/232 (10%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 W + +P+ + +L F S + ++ S N+ G G P L+ + G F Sbjct: 20 PWFDTKPKEWMLTRLKFTSSINMGQSPNSDDCNDEGHGRPFLQ-GNAEFGMRTPKAKLFC 78 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 E + +GD+L + VG + + L + Sbjct: 79 -----EAAKKTCSEGDVLLSVRAP----VGELNIANQE----YGIGRGLCAITA-QSVKA 124 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 +++ + ++ + V T S + +S + + + LLP EQ +I +++ A Sbjct: 125 DFMWWLLQASVSQ---LRAVATGSTFQAVSAEQVSNLTCLLPAQSEQTQIATFLDRETAK 181 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + ++ + + Q++++ A + NPD+ ++ L Sbjct: 182 IDRLIEKQQRLIKLLEEKRQAVISHAVT-------KGLNPDVPMKDSGVEWL 226 >UniRef50_UPI0001B4DA32 restriction endonuclease S subunits-like protein n=1 Tax=Streptomyces viridochromogenes DSM 40736 RepID=UPI0001B4DA32 Length = 416 Score = 259 bits (662), Expect = 2e-67, Method: Composition-based stats. Identities = 85/380 (22%), Positives = 152/380 (40%), Gaps = 45/380 (11%) Query: 119 SGFIAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+A+ R ++ I I +P+P LAEQ+ I L+ ++ Sbjct: 2 PEFVAYAFSWEGTRARVREYVKTTAGQAGISGGELKKIELPVPSLAEQRRIVAALEEQIS 61 Query: 178 QVDSTKARFEQI------------------------------PQILKRFRQAVLGGAVNG 207 +++S + P++ + R A Sbjct: 62 KIESGERGLTNAARRSGQYRRLAADLATKGGFAEPLTGDGTGPELFESIRSARASRVKTR 121 Query: 208 KLTEKWRN----FEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHV 263 +L + P H L+ + L E + + + G P+LR+ +++ G V Sbjct: 122 RLKPATLSGPVPKVPAHWTVVSLDEITELIEYGSSTKTSESAEVGGVPVLRMGNIKDGKV 181 Query: 264 DQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA 323 D ++++ + R++LQ+GDLLF R N S E VG + + + + + LIR Sbjct: 182 DPRVLKYISADHPDAVRYRLQEGDLLFNRTN-SFELVGKSAVYR-DKFGPMAFASYLIRC 239 Query: 324 RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 R +++ + +S R + + GQ ++G + + + LPP EQ I+ Sbjct: 240 RFLPGVDTDWVNLVINSSIGRRYVRSVATQQVGQANVNGTKLAAMPIPLPPEGEQRRILD 299 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 VE A A +E + A+ L +++L +AF G L Q A+ P A L Sbjct: 300 VVETHQAAALRLESGIRQQGAKATRLRRALLTQAFAGRLVTQDPADEP--------AEIL 351 Query: 444 LEKIKAERAASGGKKASRKK 463 L +I+AER A+G K R+ Sbjct: 352 LARIRAEREAAGVTKTRRRS 371 Score = 155 bits (392), Expect = 4e-36, Method: Composition-based stats. Identities = 52/247 (21%), Positives = 104/247 (42%), Gaps = 8/247 (3%) Query: 5 KLPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 K+P W + + +T LI G + K ++ +P++R NI++GK D L ++ Sbjct: 134 KVPAHWTVVSLDEITELIEYGSSTKTSESAEV---GGVPVLRMGNIKDGKVDPRVLKYIS 190 Query: 64 KNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGF 121 + ++ D++ ++ S +VGKSA F +F ++ R + + + Sbjct: 191 ADHPDAVRYRLQEGDLLFNRTN-SFELVGKSAVYRDKFGPMAFASYLIRCRFLPGVDTDW 249 Query: 122 IAHFTKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + SS+ R + S++ N+ + IP+PP EQ+ I + ++T A Sbjct: 250 VNLVINSSIGRRYVRSVATQQVGQANVNGTKLAAMPIPLPPEGEQRRILDVVETHQAAAL 309 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 ++ Q R R+A+L A G+L + EP + ++ E + Sbjct: 310 RLESGIRQQGAKATRLRRALLTQAFAGRLVTQDPADEPAEILLARIRAEREAAGVTKTRR 369 Query: 241 SKPNESG 247 P+ + Sbjct: 370 RSPHRAP 376 Score = 78.6 bits (193), Expect = 4e-13, Method: Composition-based stats. Identities = 33/134 (24%), Positives = 63/134 (47%), Gaps = 8/134 (5%) Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 PE++ FS R + VKTT+GQ GISG ++K + +P + EQ IV +E+ + Sbjct: 2 PEFVAYAFSWEGTRARVREYVKTTAGQAGISGGELKKIELPVPSLAEQRRIVAALEEQIS 61 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAE 450 ++ E+ + NA R + + A + + ++G+ + L E I++ Sbjct: 62 KIESGERGLTNAARRSGQYRR-LAADLAT-------KGGFAEPLTGDGTGPELFESIRSA 113 Query: 451 RAASGGKKASRKKS 464 RA+ + + + Sbjct: 114 RASRVKTRRLKPAT 127 >UniRef50_Q0W5N3 Type I restriction modification system, specificity subunit n=1 Tax=uncultured methanogenic archaeon RC-I RepID=Q0W5N3_UNCMA Length = 449 Score = 258 bits (661), Expect = 3e-67, Method: Composition-based stats. Identities = 92/427 (21%), Positives = 176/427 (41%), Gaps = 21/427 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG--KFDTTDLVF 61 G++PE W I + + + +K+ D Y I +++ N K + + Sbjct: 29 GRIPEEWSIVSIKNIVEKTEQIDPQKQP------DKYFKYIDVSSVSNESLKVVSVNEFK 82 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI-FSG 120 + + + +DI+ A + V VLR K I Sbjct: 83 GINAPSRARRIVRTDDIIFATIRPNLKRVAI--ICDDLEGQLCSTAFCVLRCMKNIAEPY 140 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+ + + K+ L G+ + I +PP++EQ+ IA L TL + ++ Sbjct: 141 FVFQTVTTDRFIGKLCDLQCGSGYPAVTDNDLLDQQILLPPISEQRKIAAILGTLDSLIE 200 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNG-KLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 T + Q+ K Q L + +L + P+H K + F ++ +NG+ Sbjct: 201 ETDRVVARTGQLKKGLIQEFLTEGMGNVELEDTALGMIPKHW--KCVPFATLSLTYKNGI 258 Query: 240 SSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 G G+P +R+ ++ G V+ + L +++EL ++L +GDLL R N S + Sbjct: 259 YKHDKYYGSGYPCIRMYNIADGTVNTINSPLLNVTDAELKEYELAEGDLLINRVN-SRDL 317 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 VG G++ ++ + K IR RL + LPE++ +F S RN + VK+ Q Sbjct: 318 VGKAGIVPAGL-GHVTFESKNIRVRLNRSMILPEFMGLFIQSSMYRNQVNKFVKSAIAQS 376 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 I+ D+ + +V LPP EQ +I + ++ + R+ + ++++ Sbjct: 377 TINQDDLDNILVPLPPKDEQEKIASVIREINSKITWEI----RYRERIELVKKALMQDLL 432 Query: 419 RGELTAQ 425 G + + Sbjct: 433 TGRIRVK 439 Score = 122 bits (308), Expect = 2e-26, Method: Composition-based stats. Identities = 49/219 (22%), Positives = 94/219 (42%), Gaps = 9/219 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +P+ W P +T++ + YK + P IR NI +G +T + + Sbjct: 236 GMIPKHWKCVPFATLSLTYKNGIYK----HDKYYGSGYPCIRMYNIADGTVNTINSPLLN 291 Query: 64 KNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKL-IFSG 120 + + +++ D++I + S+ +VGK+ +F + +R + I Sbjct: 292 VTDAELKEYELAEGDLLINRVN-SRDLVGKAGIVPAGLGHVTFESKNIRVRLNRSMILPE 350 Query: 121 FIAHFTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 F+ F +SS+YRN+++ A + I D I +P+PP EQ+ IA + + +++ Sbjct: 351 FMGLFIQSSMYRNQVNKFVKSAIAQSTINQDDLDNILVPLPPKDEQEKIASVIREINSKI 410 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEP 218 E+I + K Q +L G + K P Sbjct: 411 TWEIRYRERIELVKKALMQDLLTGRIRVKPDTIAPEATP 449 Score = 81.7 bits (201), Expect = 5e-14, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 75/220 (34%), Gaps = 25/220 (11%) Query: 207 GKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQN 266 E P+ + TE + + +SSV Sbjct: 21 DGYKETPMGRIPEEWSIVSIKNIVEKTEQIDPQKQPDKYF----KYIDVSSVSN-----E 71 Query: 267 DIRFLECSESE------LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKL 320 ++ + +E + R ++ D++F +L+ V + + L Sbjct: 72 SLKVVSVNEFKGINAPSRARRIVRTDDIIFATIRPNLKRVAIIC----DDLEGQLCSTAF 127 Query: 321 IRARLTKDA-LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQA 379 R K+ P ++ ++ + + SG ++ D+ Q +LLPP+ EQ Sbjct: 128 CVLRCMKNIAEPYFVFQTVTTDRFIGKLCDLQC-GSGYPAVTDNDLLDQQILLPPISEQR 186 Query: 380 EIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 +I + L D++ ++ + +AR L + ++ + Sbjct: 187 KIAAILGTL----DSLIEETDRVVARTGQLKKGLIQEFLT 222 >UniRef50_C6JA10 Putative uncharacterized protein n=1 Tax=Ruminococcus sp. 5_1_39BFAA RepID=C6JA10_9FIRM Length = 393 Score = 258 bits (660), Expect = 3e-67, Method: Composition-based stats. Identities = 89/417 (21%), Positives = 170/417 (40%), Gaps = 35/417 (8%) Query: 11 VIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK-E 69 + ++ G +K E D + +IR N+Q G + VF P + + Sbjct: 2 KKIRLGDACDILNGFAFKSEN----YVDSGIRVIRIANVQKGYIEDNTPVFYPLETNELD 57 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPE-KLIFSGFIAHFTK 127 + D+++A++ VG+ A F + LR + + ++ H Sbjct: 58 KYMLEEGDLLMALT----GNVGRVAILKKEFMPAALNQRVACLRLKTDRVAKDYLFHVLN 113 Query: 128 SSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFE 187 S+ + + S G N+ IP+ P +Q++IA+ LD + S + Sbjct: 114 SAFFEQQCIQSSKGVAQKNMSTEWLKDYEIPMYPKEQQELIADILDKTRNIIISRNYELK 173 Query: 188 QIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILT-ELRNGLSSKPNES 246 ++ ++K + G A + +KK+ ++ +T E +NG+ ++ Sbjct: 174 KLDDLIKARFVEMFGDAYLNEF------------GWKKIKIKNAVTVEPQNGMYKPQSDY 221 Query: 247 ---GVGHPILRISSVRAGHV-DQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 G G PILRI G V D + ++ L CSE+E ++ L + D++ R N S+E++G Sbjct: 222 VTDGSGIPILRIDGFYDGVVTDFSSLKRLRCSENERQKYLLYEDDVVINRVN-SIEYLGK 280 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 C + L +Y ++R P Y+ S + ++N K Q I+ Sbjct: 281 CAHINGLLEDT-VYESNMMRMHFDSTRFHPVYVCRLLCSRFVYDQIVNHAKQAVNQASIN 339 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 KD+ + PP+K Q + V + D + ++ AL + L S++ + F Sbjct: 340 QKDVLDFDIYEPPLKLQIQFADFVRAV----DKSKVEIQKALDKTQMLFDSLMQEYF 392 Score = 83.6 bits (206), Expect = 1e-14, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 67/198 (33%), Gaps = 4/198 (2%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKF-DTTDLVFVPKNLV 67 GW + T+ K Q+ +P++R + +G D + L + + Sbjct: 196 GWKKIKIKNAVTVEPQNGMYKPQSDYVTDGSGIPILRIDGFYDGVVTDFSSLKRLRCSEN 255 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRPEK-LIFSGFIAHF 125 + + + ED V+ S +GK A L + + + + + ++ Sbjct: 256 ERQKYLLYEDDVVINRVNSIEYLGKCAHINGLLEDTVYESNMMRMHFDSTRFHPVYVCRL 315 Query: 126 TKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S ++I + + A N +I +I PPL Q A+ + + + Sbjct: 316 LCSRFVYDQIVNHAKQAVNQASINQKDVLDFDIYEPPLKLQIQFADFVRAVDKSKVEIQK 375 Query: 185 RFEQIPQILKRFRQAVLG 202 ++ + Q G Sbjct: 376 ALDKTQMLFDSLMQEYFG 393 >UniRef50_B5IRS1 Type I restriction modification DNA specificity domain protein n=1 Tax=Thermococcus barophilus MP RepID=B5IRS1_9EURY Length = 408 Score = 258 bits (659), Expect = 4e-67, Method: Composition-based stats. Identities = 82/426 (19%), Positives = 176/426 (41%), Gaps = 31/426 (7%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PE W + + + +G K + KD +LP + ++N + T V + Sbjct: 10 IGEIPEDWQVVKLGKIIGYTKGK--KPKMVAKEPKDGWLPYLSTEYLRNN--NPTQFVKI 65 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI-FSGF 121 N + + DI++ + L + + + +K + S F Sbjct: 66 TGNEI----IVEDGDILLLWDGSNAGE------FFLAKKGVLSSTMVKIFLKKHVYDSLF 115 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + K + + G I ++ + + +P+PPL EQK IAE L T+ ++ Sbjct: 116 LFYLLKHRE--PFLKGQTKGTGIPHVDKNVLNALLLPLPPLEEQKQIAEILRTVDEAIEK 173 Query: 182 TKARFEQIPQILKRFRQAVLGGAV-NGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 T E+ ++ K Q +L + + + + P+ ++ + L + GLS Sbjct: 174 TDLAIEKTERLKKGLMQRLLTKGIKHKRFKKTEIGEIPEEWRVVRIGEVTGL--FQYGLS 231 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 K ++ G +PI+++ S+ G V +I++++ E +++L+ GD+L R N S E V Sbjct: 232 IKMHDKG-KYPIIKMDSIINGEVKPVNIKYVDLDEDTFKKYRLEKGDILINRTN-SYELV 289 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 G G+ + + ++ LIR R K P ++ + A + + Q Sbjct: 290 GRTGVF--MLDGDYVFASYLIRIRPDKKQIDPRFLTFYL--IFANDKLRQLATRAVSQAN 345 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 I+ ++K + LPP++EQ +I + + + + ++ + + ++ Sbjct: 346 INASNLKKFKIPLPPLEEQKQIAEILMTVDKKLELL----RKRKEKLERIKRGLMKDLLT 401 Query: 420 GELTAQ 425 G + Sbjct: 402 GRRRVK 407 Score = 154 bits (390), Expect = 6e-36, Method: Composition-based stats. Identities = 45/205 (21%), Positives = 97/205 (47%), Gaps = 13/205 (6%) Query: 2 SAGKLPEGWVIAPVSTV-TTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 G++PE W + + V G++ K P+I+ ++I NG+ ++ Sbjct: 206 EIGEIPEEWRVVRIGEVTGLFQYGLSIKMHDK------GKYPIIKMDSIINGEVKPVNIK 259 Query: 61 FVP-KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IF 118 +V + ++ DI+I ++ S +VG++ L + F ++ +RP+K I Sbjct: 260 YVDLDEDTFKKYRLEKGDILINRTN-SYELVGRTGVFMLDGDYVFASYLIRIRPDKKQID 318 Query: 119 SGFI-AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+ + ++ ++++ + + NI ++ IP+PPL EQK IAE L T+ Sbjct: 319 PRFLTFYLIFANDKLRQLATRA--VSQANINASNLKKFKIPLPPLEEQKQIAEILMTVDK 376 Query: 178 QVDSTKARFEQIPQILKRFRQAVLG 202 +++ + R E++ +I + + +L Sbjct: 377 KLELLRKRKEKLERIKRGLMKDLLT 401 Score = 76.7 bits (188), Expect = 2e-12, Method: Composition-based stats. Identities = 32/234 (13%), Positives = 78/234 (33%), Gaps = 36/234 (15%) Query: 208 KLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQND 267 K + P+ KL T+ + P L +R N Sbjct: 4 KFKKTPIGEIPEDWQVVKLGKIIGYTKGKKPKMVAKEPKDGWLPYLSTEYLRNN----NP 59 Query: 268 IRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 +F++ + +E+ ++DGD+L + + + + +++ L K Sbjct: 60 TQFVKITGNEI---IVEDGDILLLWDGSNAG--------EFFLAKKGVLSSTMVKIFLKK 108 Query: 328 DA-LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVE 386 ++ R + +G + + + ++ LPP++EQ +I + Sbjct: 109 HVYDSLFLFYLLK---HREPFLKGQTKGTGIPHVDKNVLNALLLPLPPLEEQKQIAEILR 165 Query: 387 QLFAYADTIEKQVNNALARVNNLTQSILAKAFR-------------GELTAQWR 427 + D ++ + A+ + L + ++ + GE+ +WR Sbjct: 166 TV----DEAIEKTDLAIEKTERLKKGLMQRLLTKGIKHKRFKKTEIGEIPEEWR 215 >UniRef50_D2QTT7 Restriction modification system DNA specificity domain protein n=1 Tax=Spirosoma linguale DSM 74 RepID=D2QTT7_9SPHI Length = 441 Score = 258 bits (659), Expect = 4e-67, Method: Composition-based stats. Identities = 79/438 (18%), Positives = 152/438 (34%), Gaps = 37/438 (8%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVF 61 G++P W + + V + + + + K + + ++ T+ Sbjct: 25 IGEIPAHWEVGRIKYVCKINQ-----RSLPESTAKSFPIHYVDIGSVTLEEGIVQTEEFE 79 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + + D +I+ + Q F S G VL P LI F Sbjct: 80 FKNAPSRARRIANAGDTIISTVRTYLKAIAFVDEQQSQFIYSTG--FAVLNPLPLIMPKF 137 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 +A KS + ++S+ S G + I + I PPL+EQ IAE LD AQ+D Sbjct: 138 LAMAVKSDSFTEQVSANSKGMSYPAINSTELGCLAICFPPLSEQTRIAEFLDRKTAQIDQ 197 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEK--------------WRNFEPQHSVFKKLN 227 A+ EQ+ ++L RQ ++ AV L W P H ++N Sbjct: 198 AIAQKEQLIELLNERRQVMIHRAVTRGLNPNAPMKDSGIDRGDARWIGEIPAHWEVSRIN 257 Query: 228 FESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGD 287 + + G P + + + + + + + + R GD Sbjct: 258 WL-FTEKDETGYPDLPLLIVSINSGVTVRDMDDTEIR----KQVAEDFNVYKRAL--AGD 310 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 + F + VGV L+ PD ++ R Y F + Sbjct: 311 IAFNKMRMWQGAVGVV------PQDGLVSPDYVVA-RPNNFVNSAYYGFLFKTREYLAEF 363 Query: 348 MNCVKTTS-GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + + + + +D KS ++PP++EQ +IV + ++ + ++ Sbjct: 364 VKHSHGIAWDRNRLYWEDFKSIFAMVPPLEEQNQIVDFLNAQNEEMSFASTKIQKQIQKL 423 Query: 407 NNLTQSILAKAFRGELTA 424 L +++ A G++ Sbjct: 424 QELKSTLINSAVTGKIKV 441 Score = 126 bits (318), Expect = 1e-27, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 75/232 (32%), Gaps = 19/232 (8%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVR--AGHVDQNDIR 269 +W P H ++ + + + S+ + + I SV G V + Sbjct: 23 EWIGEIPAHWEVGRIKYVCKINQRSLPESTAKSFP---IHYVDIGSVTLEEGIVQTEEFE 79 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 F R GD + + L+ + + + Q Q +Y Sbjct: 80 FKNAPS--RARRIANAGDTIISTVRTYLKAI---AFVDEQQSQ-FIYSTGFAVLNPLPLI 133 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 +P+++ + S S + K I+ ++ + PP+ EQ I +++ Sbjct: 134 MPKFLAMAVKSDSFTEQVSANSK-GMSYPAINSTELGCLAICFPPLSEQTRIAEFLDRKT 192 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAA 441 A D Q + +N Q ++ +A R NP+ ++ Sbjct: 193 AQIDQAIAQKEQLIELLNERRQVMIHRAVT-------RGLNPNAPMKDSGID 237 >UniRef50_Q6F778 Putative type I restriction-modification system specificity determinant for hsdM and hsdR (HsdS) n=1 Tax=Acinetobacter sp. ADP1 RepID=Q6F778_ACIAD Length = 448 Score = 257 bits (658), Expect = 5e-67, Method: Composition-based stats. Identities = 88/434 (20%), Positives = 169/434 (38%), Gaps = 19/434 (4%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + + + + + Y ++ D IR +I + D Sbjct: 19 GEIPSHWEVKRMKFLLSEK--LKYGANESAESEDKDQPRYIRITDINDSGTLREDTFKSL 76 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + + ++ DI++A S + + C G ++ FI Sbjct: 77 EIEKAQEYLLNDLDILLARSGATVGKSYLHKKDKVNVACYAGYLIRARFNKENYDPQFIN 136 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 F +S Y + I S++ A I N+ ++ + + IP LAEQKIIA+ LD LAQVD+ Sbjct: 137 LFLQSKAYWSWIESVNIQATIQNVSAEKYNDLALSIPSLAEQKIIADFLDKRLAQVDALI 196 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKW---------RNFEPQHSVFKKLNFESILTE 234 A+ E + + L R A++ AV L P K+L F + + Sbjct: 197 AKQETLLEKLAEQRVALISHAVTKGLNPDVEMKESDVVLLGNIPNTWNIKRLKFL-LSEK 255 Query: 235 LRNGLSSKPNESGVGHP-ILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 L+ G + +P +RI+ + + D F + + L D D+L R Sbjct: 256 LKYGANESAESEDKENPRYIRITDIDDSG-NLKDETFKSLESEKAQEYLLDDLDILLARS 314 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVK 352 + VG L K Y LIRARL ++ PE++ F S + + + Sbjct: 315 GAT---VGKSYLYKAESVGIACYAGYLIRARLDQENYNPEFVNYFLQSKQYWDWISSINI 371 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 + + +S + + +P ++EQ +++ ++ + + + +N + Sbjct: 372 Q-ATIQNVSAEKYNDLTLAIPSLEEQKQLIEYLKNEDEKFNRAISKGKKLVHLLNEYRST 430 Query: 413 ILAKAFRGELTAQW 426 ++ + G++ Q Sbjct: 431 LITQVVTGKIDVQN 444 Score = 147 bits (373), Expect = 6e-34, Method: Composition-based stats. Identities = 50/239 (20%), Positives = 100/239 (41%), Gaps = 18/239 (7%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHP-ILRISSVRA-GHVDQNDIR 269 +W P H K++ F + +L+ G + P +RI+ + G + ++ + Sbjct: 16 QWLGEIPSHWEVKRMKFL-LSEKLKYGANESAESEDKDQPRYIRITDINDSGTLREDTFK 74 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-D 328 LE +++ + L D D+L R + VG L KK + Y LIRAR K + Sbjct: 75 SLEIEKAQ--EYLLNDLDILLARSGAT---VGKSYLHKKDKVNVACYAGYLIRARFNKEN 129 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 P++I +F S + + + + + + +S + + +P + EQ I +++ Sbjct: 130 YDPQFINLFLQSKAYWSWIESVNIQ-ATIQNVSAEKYNDLALSIPSLAEQKIIADFLDKR 188 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKI 447 A D + + L ++ ++++ A + NPD+ E+ LL I Sbjct: 189 LAQVDALIAKQETLLEKLAEQRVALISHAVT-------KGLNPDVEMKESDV-VLLGNI 239 >UniRef50_Q12YI6 Restriction modification system DNA specificity subunit n=1 Tax=Methanococcoides burtonii DSM 6242 RepID=Q12YI6_METBU Length = 511 Score = 257 bits (658), Expect = 6e-67, Method: Composition-based stats. Identities = 96/517 (18%), Positives = 176/517 (34%), Gaps = 84/517 (16%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ---NGKFDTTDLVFVPK 64 + WV +S ++ G T K + + + + I ++ Sbjct: 13 DDWVKGVLSDFGQVVSGGTPKTK--VPEYWGEDILWITPADLSGYSEKYIYKGRKSITHL 70 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 L S ++ P+ V+ S + + E L P + + F+ + Sbjct: 71 GLKNSSARLIPKGSVLFSSRAPIGYIAIAG-----NELCTNQGFKTLIPSEALNRDFLYY 125 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + KS + ++G + +F + + +PPL EQ+ I K++ L +++D+ A Sbjct: 126 YLKS--IKQLAEGRASGTTFKELSGKAFAELPLCVPPLPEQRAIVSKIEQLFSELDNGIA 183 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF----------------------EPQHSV 222 + Q LK +RQ+VL A G+LT +WR + Sbjct: 184 NLKLAQQQLKVYRQSVLKKAFEGELTRQWREQQTDLPDAKALLEQIQVEREESYNEKLDE 243 Query: 223 FKKLNFESI------------------------------LTELRNGLS----------SK 242 +K+ E G + K Sbjct: 244 WKRAVKEWEDAGKEGKKPTKPKKLKKIEPFTETELAELPTLPNGYGWTRLGELHHLKSDK 303 Query: 243 PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 SG + + + D ++ ++ + GDLL+ + L V Sbjct: 304 HTGSGESLFYIGLEHISKNQGTLTDEVKIDV--INTVKNSFKKGDLLYGKLRPYLNKV-- 359 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 L +++ + ++ Y + +F S N M + + +S Sbjct: 360 -----YLANEDGVCSTDILVFESIPSLDLNYSKYYFLSYKFVNDMTHN-SSGVNLPRVST 413 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 K ++ L ++EQ IV +E + D +E+ + + L L QSIL KAF G+L Sbjct: 414 KYLQEYPFPLFSLEEQQAIVTEIETRLSVCDKVEQDIEDNLKIAEALRQSILKKAFEGKL 473 Query: 423 TAQWRAENPDLISGENSAAALLEKIKAERAASGGKKA 459 + E A LLEKI+AE+A SG K Sbjct: 474 LNERELEEVRSAPDWEPAEVLLEKIRAEKAGSGKKGK 510 Score = 154 bits (389), Expect = 1e-35, Method: Composition-based stats. Identities = 54/259 (20%), Positives = 95/259 (36%), Gaps = 18/259 (6%) Query: 208 KLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVR---AGHVD 264 K+ + V L+ + + P G + + + ++ Sbjct: 2 KVKNRLGEKLGDDWVKGVLSDFGQVVSGGTPKTKVPEYWGEDILWITPADLSGYSEKYIY 61 Query: 265 QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 + + + G +LF+ S +G + L Sbjct: 62 KGRKSITHLGLKNSSARLIPKGSVLFS----SRAPIGYIAIAGNE----LCTNQGFKTLI 113 Query: 325 LTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRR 384 ++ +++ + S + + + K +SGK + +PP+ EQ IV + Sbjct: 114 PSEALNRDFLYYYLKS---IKQLAEGRASGTTFKELSGKAFAELPLCVPPLPEQRAIVSK 170 Query: 385 VEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALL 444 +EQLF+ D + A ++ QS+L KAF GELT QWR + DL A ALL Sbjct: 171 IEQLFSELDNGIANLKLAQQQLKVYRQSVLKKAFEGELTRQWREQQTDLPD----AKALL 226 Query: 445 EKIKAERAASGGKKASRKK 463 E+I+ ER S +K K Sbjct: 227 EQIQVEREESYNEKLDEWK 245 Score = 112 bits (282), Expect = 2e-23, Method: Composition-based stats. Identities = 43/242 (17%), Positives = 82/242 (33%), Gaps = 18/242 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK- 64 LP G+ + + L +++ Y I +I + TD V + Sbjct: 284 LPNGYGWTRLGELHHLKSDKHTGSGESLFY--------IGLEHISKNQGTLTDEVKIDVI 335 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 N VK S K D++ + V + V + + + Sbjct: 336 NTVKNSFK--KGDLLYGKLRPYLNKV-----YLANEDGVCSTDILVFESIPSLDLNYSKY 388 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + S + N ++ S+G N+ + P+ L EQ+ I +++T L+ D + Sbjct: 389 YFLSYKFVNDMTHNSSGVNLPRVSTKYLQEYPFPLFSLEEQQAIVTEIETRLSVCDKVEQ 448 Query: 185 RFEQIPQILKRFRQAVLGGAVNGK-LTEKWRNFEPQHSVFKKLN-FESILTELRNGLSSK 242 E +I + RQ++L A GK L E+ ++ + + G K Sbjct: 449 DIEDNLKIAEALRQSILKKAFEGKLLNERELEEVRSAPDWEPAEVLLEKIRAEKAGSGKK 508 Query: 243 PN 244 Sbjct: 509 GK 510 >UniRef50_B5VW68 Restriction modification system DNA specificity domain n=1 Tax=Arthrospira maxima CS-328 RepID=B5VW68_SPIMA Length = 415 Score = 257 bits (657), Expect = 7e-67, Method: Composition-based stats. Identities = 72/429 (16%), Positives = 157/429 (36%), Gaps = 46/429 (10%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P GW + V + I G +K + P++R N+ N D + Sbjct: 17 PLGWKKSYVKYLGNYINGYPFKPDN----WSFQGKPILRIQNLSNPNADFNRY----EGE 68 Query: 67 VKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHF 125 + E+ + DI+I+ S S ++ L + + KL+F + Sbjct: 69 ISEAYLVHKGDILISWS-ASLG-----VYKWLGEDAWLNQHIFKVEINTKLVFEEYFVWL 122 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 +S + ++ + G+ + ++ +F + +PP+ EQK IA LD A++D Sbjct: 123 --ASWFIKELEHKAHGSTMQHLTWNAFGNFPVLLPPMPEQKAIAHYLDKETAKIDQLIEA 180 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTELR 236 +++ ++L R+A++ AV L +W P+H + + E+ Sbjct: 181 KKRLLELLDEKRRALITHAVTRGLNPDVPMRDSGVEWIGEIPKHWKVEFAK--WLFKEID 238 Query: 237 NGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL--NRHKLQDGDLLFTRYN 294 + ++ E +L +S + + + + ++E Q GDL+ Sbjct: 239 DRSTTGQEE------LLTVSHIT--GITPRSEKDVNMFKAESMEGYKVCQSGDLIINTLW 290 Query: 295 GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT- 353 + +GV Q + R + P Y++ P + K Sbjct: 291 AWMGAMGVS-------FQPGIVSPSYHVYRPQGEYHPVYLDYLVRIPIFAEEAIRYSKGV 343 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 + + ++ ++ +PP++EQ +I + + + D + + + S+ Sbjct: 344 WISRLRLYPEEFFQILLPVPPLEEQYKIGKYLMEKTKKLDNLSIATKKTMDLLQERRTSL 403 Query: 414 LAKAFRGEL 422 + A G+L Sbjct: 404 ITAAVTGQL 412 Score = 136 bits (342), Expect = 2e-30, Method: Composition-based stats. Identities = 38/221 (17%), Positives = 81/221 (36%), Gaps = 23/221 (10%) Query: 224 KKLNFESILTELRNGLSSKPNESG-VGHPILRISSVRAGHVDQNDIRFLECSESELNRHK 282 K ++ L NG KP+ G PILRI ++ + D N + Sbjct: 20 WKKSYVKYLGNYINGYPFKPDNWSFQGKPILRIQNLSNPNADFNRY-----EGEISEAYL 74 Query: 283 LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPS 342 + GD+L + +G + K ++ + + + + E ++ +S Sbjct: 75 VHKGDILIS----WSASLG----VYKWLGEDAWLNQHIFKVEINTKLVFEEYFVWLASWF 126 Query: 343 ARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNA 402 + + S + ++ + VLLPP+ EQ I +++ A D + + Sbjct: 127 IKE--LEHKAHGSTMQHLTWNAFGNFPVLLPPMPEQKAIAHYLDKETAKIDQLIEAKKRL 184 Query: 403 LARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 L ++ ++++ A R NPD+ ++ + Sbjct: 185 LELLDEKRRALITHAVT-------RGLNPDVPMRDSGVEWI 218 Score = 114 bits (286), Expect = 9e-24, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 75/213 (35%), Gaps = 22/213 (10%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++P+ W + + I + ++ L+ ++I V Sbjct: 218 IGEIPKHWKVEFAKWLFKEIDDRSTTGQEE----------LLTVSHIT--GITPRSEKDV 265 Query: 63 P---KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 ++ + D++I +G S V RP+ Sbjct: 266 NMFKAESMEGYKVCQSGDLIINTLWAWMGAMGVSF-----QPGIVSPSYHVYRPQGEYHP 320 Query: 120 GFIAHFTKSSLYRNKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 ++ + + ++ + S G + + P F I +P+PPL EQ I + L Sbjct: 321 VYLDYLVRIPIFAEEAIRYSKGVWISRLRLYPEEFFQILLPVPPLEEQYKIGKYLMEKTK 380 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 ++D+ ++ +L+ R +++ AV G+L Sbjct: 381 KLDNLSIATKKTMDLLQERRTSLITAAVTGQLK 413 >UniRef50_A1TXP8 Restriction modification system DNA specificity domain n=2 Tax=Gammaproteobacteria RepID=A1TXP8_MARAV Length = 439 Score = 257 bits (657), Expect = 7e-67, Method: Composition-based stats. Identities = 69/454 (15%), Positives = 149/454 (32%), Gaps = 54/454 (11%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W I + + + G YK + P+I + G+F Sbjct: 18 GEVPSNWKIGRLKHLLRIRGGQDYKS---VESYVPTDFPVIGSG----GQFTYATD---- 66 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + + V+ G K + K + F F + P + Sbjct: 67 --------YLYDGESVLL---GRKGTIDKPLYVKGKFWTVDTMFYTEVLP--GTNGRYAY 113 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + + + S + ++ +P+PP EQ IA LD A++D+ Sbjct: 114 YLATTIPF----DLYSTNTALPSMSQFDLANHGLPLPPKCEQTQIARFLDHETAKIDALI 169 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTE 234 E++ ++L+ RQAV+ AV L +W P H + ++ + + Sbjct: 170 REQERLIELLQEKRQAVISHAVTKGLDPDVPMKDSGVEWLGEVPAHWIVARIKNFARVES 229 Query: 235 LRNGLSSKPNES-GVGHPILRISSVRA----GHVDQNDIRFLECSESELNRHKLQDGDLL 289 K P + ++ + ++ + + + + L ++ Sbjct: 230 GHTPDKKKEEYWVDCDIPWVSLNDSKQLKKADYIADTSTKVNDLGIANSSARLLPAAAVV 289 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR-ARLTKDALPEYIEIFFSSPSARNAMM 348 FTR +G+ + + + LI + +PEY+ + F A + Sbjct: 290 FTRD----ASIGLSAI----TTKPMAVSQHLIAWLCAGEKLVPEYLLLIF---YAMESEF 338 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + K I D++S PP++EQ ++V + + + Sbjct: 339 ERYTFGATIKTIGMDDVRSLTAAFPPMEEQKQLVTWAFRKKETLQAGLDAAEKTILLLKE 398 Query: 409 LTQSILAKAFRGELTAQWRAENPDLISGENSAAA 442 ++++ A G++ + D + + A Sbjct: 399 RRSALISSAVTGKIDVRNWQPPADEGAFDEEVRA 432 Score = 116 bits (291), Expect = 2e-24, Method: Composition-based stats. Identities = 32/265 (12%), Positives = 74/265 (27%), Gaps = 45/265 (16%) Query: 207 GKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQN 266 +W P + +L + ++ S + P++ G Sbjct: 10 KGSGVQWLGEVPSNWKIGRLKHLLRIRGGQDYKSVES-YVPTDFPVIGSG----GQFTYA 64 Query: 267 DIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT 326 + +L R + D + + Sbjct: 65 TDYLYDGES------------VLLGRKGTIDK--------PLYVKGKFWTVDTMFYTEVL 104 Query: 327 KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVE 386 Y ++ + T + +S D+ + + LPP EQ +I R ++ Sbjct: 105 PGTNGRYAYYLATTIPF-----DLYSTNTALPSMSQFDLANHGLPLPPKCEQTQIARFLD 159 Query: 387 QLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL--- 443 A D + ++ + + Q++++ A + +PD+ ++ L Sbjct: 160 HETAKIDALIREQERLIELLQEKRQAVISHAVT-------KGLDPDVPMKDSGVEWLGEV 212 Query: 444 -----LEKIKAERAASGGKKASRKK 463 + +IK G +KK Sbjct: 213 PAHWIVARIKNFARVESGHTPDKKK 237 >UniRef50_Q07ZW7 Restriction modification system DNA specificity domain n=1 Tax=Shewanella frigidimarina NCIMB 400 RepID=Q07ZW7_SHEFN Length = 462 Score = 256 bits (656), Expect = 9e-67, Method: Composition-based stats. Identities = 71/447 (15%), Positives = 155/447 (34%), Gaps = 34/447 (7%) Query: 6 LPEGWVIAP----VSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 LP W + + + I+ + + + + + + NI F Sbjct: 22 LPSTWQVLKVKFLLKNGSEGIKIGPFGSALKLEDMVEKGIRVYGQENIIKRDFTLGKRFI 81 Query: 62 VPKNLVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIFS 119 + DI+I M S GK + + LR I Sbjct: 82 SQTKYKDMKVYTAEAGDILITMMGTS----GKCQVVPENADLGIIDSHLLKLRTNSKILP 137 Query: 120 GFIAHFT--KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + ++ +++IS G+ + + + + P+P + EQ I LD A Sbjct: 138 E-LFRLLVDEAQEIKDQISKQGKGSIMLGLNSSIVKELEFPLPSIEEQTQILCFLDHETA 196 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNF 228 ++D A+ E++ ++LK RQAV+ AV L W P+H V L Sbjct: 197 KIDDLIAKQEKLIELLKEKRQAVISHAVTKGLNPDSPMKNSGVVWLGEVPEHWVVCCLKH 256 Query: 229 ES-----ILTELRNGLSSKPN-ESGVGHPILRISSV-RAGHVDQNDIRFLECSESE-LNR 280 + G + K G + S+ G +D + ++ + + E ++R Sbjct: 257 IKGKEKGSFVDGPFGSNLKSEHFVDDGDVYVIESNFATTGMLDTSKLKTISVAHFETISR 316 Query: 281 HKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSS 340 + ++G ++ + G+ +L L H+ ++ + + ++ + + + Sbjct: 317 SETKEGAIILAKIGA---RYGMNSILPCLPHKAVVSGN-CLSLKINEKTMDVLYCHQLLT 372 Query: 341 PSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVN 400 + M+ + Q +S + + L PP KEQ+EI ++Q + + Sbjct: 373 HLKQEGAMDDGVNVTAQPALSLGQLNNLPFLSPPQKEQSEIASFIQQRDESFSILINKAI 432 Query: 401 NALARVNNLTQSILAKAFRGELTAQWR 427 + ++++ G++ Sbjct: 433 KLIELSKERKTALISAVLTGKIDVLDW 459 Score = 143 bits (361), Expect = 2e-32, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 78/238 (32%), Gaps = 18/238 (7%) Query: 212 KWRNFEPQHSVFKKLNFE-----SILTELRNGLSSKPNE-SGVGHPILRISSVRAGHVDQ 265 +W P K+ F + G + K + G + ++ Sbjct: 17 EWLKLLPSTWQVLKVKFLLKNGSEGIKIGPFGSALKLEDMVEKGIRVYGQENIIKRDFTL 76 Query: 266 NDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL 325 + ++ + + GD+L T S G C ++ + ++ L++ R Sbjct: 77 GKRFISQTKYKDMKVYTAEAGDILITMMGTS----GKCQVVPENADLGII-DSHLLKLRT 131 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 LPE + ++ S G++ +K LP ++EQ +I+ + Sbjct: 132 NSKILPELFRLLVDEAQEIKDQISKQGKGSIMLGLNSSIVKELEFPLPSIEEQTQILCFL 191 Query: 386 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + A D + + + + Q++++ A + NPD + L Sbjct: 192 DHETAKIDDLIAKQEKLIELLKEKRQAVISHAVT-------KGLNPDSPMKNSGVVWL 242 Score = 102 bits (255), Expect = 3e-20, Method: Composition-based stats. Identities = 30/215 (13%), Positives = 72/215 (33%), Gaps = 10/215 (4%) Query: 4 GKLPEGWVIAPVSTV-----TTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTD 58 G++PE WV+ + + + + G ++ +++ D + +I +N G DT+ Sbjct: 243 GEVPEHWVVCCLKHIKGKEKGSFVDGPFGSNLKSEHFVDDGDVYVIESNFATTGMLDTSK 302 Query: 59 LVFVPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL 116 L + + + I++A + S LP + C L+ + Sbjct: 303 LKTISVAHFETISRSETKEGAIILAKIGARYGM--NSILPCLPHKAVVSGNCLSLKINEK 360 Query: 117 -IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + + + + + + PP EQ IA + Sbjct: 361 TMDVLYCHQLLTHLKQEGAMDDGVNVTAQPALSLGQLNNLPFLSPPQKEQSEIASFIQQR 420 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 + ++ ++ K + A++ + GK+ Sbjct: 421 DESFSILINKAIKLIELSKERKTALISAVLTGKID 455 >UniRef50_B0RQ64 Type I site-specific DNA methyltransferase specificity subunit n=3 Tax=Xanthomonas campestris pv. campestris RepID=B0RQ64_XANCB Length = 415 Score = 256 bits (655), Expect = 1e-66, Method: Composition-based stats. Identities = 80/429 (18%), Positives = 161/429 (37%), Gaps = 23/429 (5%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GW + + ++ G T + Q Y D P ++ ++ N + TTD V Sbjct: 2 LPDGWRRTTLGNIGSVKSGSTPARSQHDRYFVDGKWPWVKTMDLTNSEILTTDEVITDAA 61 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVL-RPEKLIFSGFIAH 124 L + S ++ P V+ G +G++ + + + F+ H Sbjct: 62 LAESSCRLFPAGTVLVAMYGGFKQIGRTGLLR--EKSAINQAISAIDIERNQADPEFVLH 119 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + S + ++ NI + + +P L EQ+ IA L T + +T+ Sbjct: 120 WLNGS-VETWKNYAASSRKDPNITRENVCDFPVILPTLGEQRRIAHILSTWDQAIATTER 178 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 + + + + + L + P L +G K Sbjct: 179 LLKNSQKQMDILLRDL-------TLGTQRTTSTPSPWAKFTLGELGRTYSGLSG--KKGE 229 Query: 245 ESGVGHPILRISSV-RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVC 303 + G G + ++V + +D D ++ SE+E N+ +++ GD++FT + + VG+ Sbjct: 230 DFGFGAKFIPYTNVFKNNRIDIEDFSLVKISENE-NQTRVKSGDIIFTISSETPNEVGMA 288 Query: 304 GLLKKLQHQNLLYPDKLIRARLT--KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGIS 361 +L ++ L RL K LPEY +P R A+M + S + IS Sbjct: 289 SVLLDDVNE-LYLNSFCFGYRLNDFKTLLPEYAGFVLRAPHIR-ALMTQIAQGSTRFNIS 346 Query: 362 GKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 ++ + LP + EQ I + + + + LAR+ ++++ G+ Sbjct: 347 KANVMRMELALPSIAEQKRIASILGGAHSTVKNL----RDQLARLKAEKVILMSQLLTGK 402 Query: 422 LTAQWRAEN 430 + + Sbjct: 403 RRVRLPTDE 411 >UniRef50_C5TIE5 Restriction modification system DNA specificity domain protein n=1 Tax=Zymomonas mobilis subsp. mobilis ATCC 10988 RepID=C5TIE5_ZYMMO Length = 419 Score = 256 bits (655), Expect = 1e-66, Method: Composition-based stats. Identities = 105/470 (22%), Positives = 199/470 (42%), Gaps = 60/470 (12%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVT-YKKEQAINYLKDDYLPLIRAN--NIQNGKFDTT 57 MS LP+GW+ + +T G + K + + + P A+ ++ F+ Sbjct: 1 MSN--LPQGWIQTTFADITNQRSGNSKLVKGKLESQESNGLYPAFSASGPDVWRDAFEY- 57 Query: 58 DLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 D +I + G + GK+ + P +++ Sbjct: 58 -----------------EGDAIIVSAVG--ARCGKAFRAKGQWSAIANTHIVWPEP-QVV 97 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + F+ + K G+ +K + I +PPL EQ+ I K+D+L Sbjct: 98 ETEFLFLLLNDENFWEK-----GGSAQPFVKVRATFERTINLPPLPEQRRIVAKIDSLTG 152 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 + + + IP+++++++QA+L A + + ++ + + Sbjct: 153 KSRRARDHLDHIPRLVEKYKQAILSAAFR--------------ADWPLISVGETIRAVVA 198 Query: 238 GLSSKPNE---SGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYN 294 G + + E ++++S+V G D + L S + +++ GDLL +R N Sbjct: 199 GKNLRCEERPPFEHESGVVKVSAVSWGTFDARASKTLPESFTPPENTRIKAGDLLISRAN 258 Query: 295 GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTT 354 +LE VG ++ + NL DK++R + ++ F SP R A+ Sbjct: 259 -TLELVGAVVIVLEC-PSNLFLSDKVLRLDVEDG-DKPWLMWFLRSPDGRAAIEGAATGN 315 Query: 355 S-GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 + +S +KS + P +++ EIV R+E FA+ + + +A +++L QS+ Sbjct: 316 QLSMRNLSQAALKSISMPWPAAEQREEIVSRIESAFAWIECLAADAASARKLIDHLDQSM 375 Query: 414 LAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 LAKAF+GEL Q A+ P A+ALL++I+AERAA+ K R+K Sbjct: 376 LAKAFKGELVPQDPADEP--------ASALLDRIRAERAAAPKAKRGRRK 417 >UniRef50_A7JK69 Type I restriction-modification system n=1 Tax=Francisella novicida GA99-3548 RepID=A7JK69_FRANO Length = 394 Score = 255 bits (653), Expect = 2e-66, Method: Composition-based stats. Identities = 75/429 (17%), Positives = 165/429 (38%), Gaps = 40/429 (9%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIR-GVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT-D 58 MS +LP+GW + +T+ + GV K Y + + +I I+ G + Sbjct: 1 MSNSELPKGWKAIELGEITSYVNRGVAPK------YTDEHGITVINQKCIREGNINLELA 54 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI- 117 V P +++ DI+I + G+ ++R K Sbjct: 55 RVHNPDKKYTAEKQLHLGDILINSTG--VGTAGRVGIFTDSINAIVDTHVSIVRLNKEYA 112 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + F+ + + ++ + G+ +K + +NI +P L EQK IA+ L +L Sbjct: 113 YPKFVYYNLRFRE--KELEETAEGSTGQIELKRDAIKSLNILLPQLTEQKAIADVLSSLD 170 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 ++D + + + + + + E ++++ + + Sbjct: 171 DKIDLLHKQNQTLEDMAQTLFREWF--------------IEKADEGWEEMPLSEVCSVTA 216 Query: 237 NGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESEL-NRHKLQDGDLLFTRYNG 295 + +G P+++I ++ GH+D ND++F++ SES++ ++++L D D++ Sbjct: 217 GYAFKSKDFVDIGVPVVKIKNISNGHIDYNDLQFIDISESDVESKYRLYDNDIVMAMTGA 276 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 + +G GL+ +H LL ++ R AL + +S N ++N + + Sbjct: 277 T---IGKIGLVSTFEHDYLLLNQRVAVLRSNHQAL---LWFMLNSLDLENEILN-LSNGA 329 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 Q IS I + + + V +F +Q + + ++L Sbjct: 330 VQANISSTSIGQVPIPGMSNQMMQKFNNAVHPMFEKI----QQNKKQIKSLEQTRDTLLP 385 Query: 416 KAFRGELTA 424 K G++ Sbjct: 386 KLMSGQVRV 394 >UniRef50_Q3IEL0 Putative type I restriction-modification system, S subunit n=1 Tax=Pseudoalteromonas haloplanktis TAC125 RepID=Q3IEL0_PSEHT Length = 442 Score = 255 bits (653), Expect = 2e-66, Method: Composition-based stats. Identities = 81/444 (18%), Positives = 181/444 (40%), Gaps = 54/444 (12%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 K+P W P+ + +GV +K + +++A++I+ ++++ V++P Sbjct: 27 KIPNYWQTIPLRLILDTRKGVAFKSND----FTSSGIRVVKASDIKKLTINSSE-VYLPT 81 Query: 65 NLVKESQK--ISPEDIVIAMSSGSKS----VVGKSAHQHLPFE-CSFGAFCGVLRPEK-L 116 N + K + DI+++ + VG+ + V P++ Sbjct: 82 NYISIYPKAILRKGDIILSTVGSNPDVKNSAVGQIGVVPEHLDGALLNQNTVVFEPKEDK 141 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 I F+ + + YR+ + + G +++ + IPIPP EQ+ IA LD Sbjct: 142 IHREFLFKVIQMNGYRDHLDLNAHGTANQSSLSISDMLNFYIPIPPKNEQQKIASFLDHE 201 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKL 226 A++D+ A+ E++ ++LK RQAV+ AV L +W P+H + L Sbjct: 202 TAKIDTLIAKQEKLIELLKEKRQAVISHAVTKGLNPNAPMRDSGVEWLGEVPEHWLIGSL 261 Query: 227 NFESILTELRNGLS---SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKL 283 ++ ++ S K P++ G + ++ + Sbjct: 262 RWKVSISSGEGLSSNLVEKNKTELKKIPVIG----GNGVMGFSESSNTHKTA-------- 309 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 + R +CG + + + + + + L + Y+ + + Sbjct: 310 ----IAIGRVGA------LCGNVHLINYISWITDNAL-KISSWDGFDENYLISLLKAAN- 357 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 +N + +T+ Q I+G+ IKS +V++PP+KEQ +I ++ ++ D +EK+ + + Sbjct: 358 ----LNNLASTTAQPLITGEQIKSLIVVIPPLKEQIKINLKLTKIVNLFDKLEKRSKDGI 413 Query: 404 ARVNNLTQSILAKAFRGELTAQWR 427 + ++++ A G++ + Sbjct: 414 NLLKERKTALISAAVTGKIDVRNW 437 Score = 157 bits (399), Expect = 6e-37, Method: Composition-based stats. Identities = 33/238 (13%), Positives = 85/238 (35%), Gaps = 13/238 (5%) Query: 210 TEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIR 269 W P + L + + + G +++ S ++ ++ +++ Sbjct: 21 DIDWLRKIPNYWQTIPLRLILDTRKGV--AFKSNDFTSSGIRVVKASDIKKLTINSSEVY 78 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLE----FVGVCGLLKKLQHQNLLYPDKLIRARL 325 S + L+ GD++ + + + VG G++ + LL + ++ Sbjct: 79 LPTNYISIYPKAILRKGDIILSTVGSNPDVKNSAVGQIGVVPEHLDGALLNQNTVVFEPK 138 Query: 326 TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRV 385 E++ R+ + T+ Q +S D+ + + +PP EQ +I + Sbjct: 139 EDKIHREFLFKVIQMNGYRDHLDLNAHGTANQSSLSISDMLNFYIPIPPKNEQQKIASFL 198 Query: 386 EQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + A DT+ + + + Q++++ A + NP+ ++ L Sbjct: 199 DHETAKIDTLIAKQEKLIELLKEKRQAVISHAVT-------KGLNPNAPMRDSGVEWL 249 Score = 80.2 bits (197), Expect = 2e-13, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 72/208 (34%), Gaps = 25/208 (12%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKD-DYLPLIRANNIQNGKFDTTDLVFV 62 G++PE W+I + ++ G + +P+I NG ++ Sbjct: 250 GEVPEHWLIGSLRWKVSISSGEGLSSNLVEKNKTELKKIPVIGG----NGVMGFSE---- 301 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 N K + ++ G + + A + ++ Sbjct: 302 SSNTHKTA----------IAIGRVGALCGNVHLINYISWITDNA--LKISSWDGFDENYL 349 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 K++ +++L++ I + + IPPL EQ I KL ++ D Sbjct: 350 ISLLKAAN----LNNLASTTAQPLITGEQIKSLIVVIPPLKEQIKINLKLTKIVNLFDKL 405 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLT 210 + R + +LK + A++ AV GK+ Sbjct: 406 EKRSKDGINLLKERKTALISAAVTGKID 433 >UniRef50_A1UJN5 Restriction endonuclease S subunits-like protein n=2 Tax=Mycobacterium RepID=A1UJN5_MYCSK Length = 419 Score = 255 bits (652), Expect = 2e-66, Method: Composition-based stats. Identities = 85/446 (19%), Positives = 173/446 (38%), Gaps = 40/446 (8%) Query: 9 GW-VIAPVSTVTTLIRGVTYKKEQAINYLK---DDYLPLIRANNIQNGKFDTTDLVFVPK 64 W ++ + G + + + L + ++ G+F ++ + Sbjct: 2 SWAQEVTLAELAE---GGLFSDGDWVESKDQDASGDVRLTQLADVGVGEFRDRSDRWMRR 58 Query: 65 NLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLP-FECSFGAFCGVLR-PEKLIFSG 120 + + +D++IA +G+S +LR + Sbjct: 59 DQAHRLRCTFLEGDDVLIARM---PDPIGRSCLVPSSVGSAVTVVDVAILRLARRDANPR 115 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ S + +K+ +L +G I + + IP+P L EQ I + L+ L+++D Sbjct: 116 YVMWALNSPRFHSKVVALQSGTTRKRISRKNLASLTIPLPTLDEQNRIVDLLEDHLSRLD 175 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 + ++ Q A L + + + L E + Sbjct: 176 AAESSLRLAMQKADAMTTASLDRQTTAG---------SRAWRDTTIGAMAELVEYGSSAK 226 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 + P+LR+ +++ G ++ +++L +E + LQ GDL+F R N S E V Sbjct: 227 CAGQAADSDVPVLRMGNIQNGKINWTGLKYLPAGHAEFPKLLLQSGDLVFNRTN-SAELV 285 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G + + + + LIR R ++ P + + +SP+ R + + GQ + Sbjct: 286 GKSAVFEDTR--AASFASYLIRVRFGQEVNPAWANMVINSPAGRRYVKSVASQQVGQANV 343 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 +G +K+ + LPP+ EQ VR +++ + + Q+ + + R L +++LA AF G Sbjct: 344 NGTKLKAFPLPLPPLDEQCRRVRAHDEVVVSRERLHHQIADLVVRAAGLRRALLAAAFTG 403 Query: 421 ELTAQWRAENPDLISGENSAAALLEK 446 LT NSA LLE+ Sbjct: 404 RLT--------------NSAEGLLEE 415 Score = 130 bits (328), Expect = 1e-28, Method: Composition-based stats. Identities = 43/216 (19%), Positives = 81/216 (37%), Gaps = 5/216 (2%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK- 68 W + + L+ + D +P++R NIQNGK + T L ++P + Sbjct: 207 WRDTTIGAMAELVEYG--SSAKCAGQAADSDVPVLRMGNIQNGKINWTGLKYLPAGHAEF 264 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKS 128 + D+V ++ S +VGKSA SF ++ +R + + + S Sbjct: 265 PKLLLQSGDLVFNRTN-SAELVGKSAVFEDTRAASFASYLIRVRFGQEVNPAWANMVINS 323 Query: 129 SLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFE 187 R + S+++ N+ +P+PPL EQ D ++ + + Sbjct: 324 PAGRRYVKSVASQQVGQANVNGTKLKAFPLPLPPLDEQCRRVRAHDEVVVSRERLHHQIA 383 Query: 188 QIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVF 223 + R+A+L A G+LT + Sbjct: 384 DLVVRAAGLRRALLAAAFTGRLTNSAEGLLEELESV 419 >UniRef50_A9A374 Restriction modification system DNA specificity domain n=1 Tax=Nitrosopumilus maritimus SCM1 RepID=A9A374_NITMS Length = 438 Score = 255 bits (652), Expect = 3e-66, Method: Composition-based stats. Identities = 70/432 (16%), Positives = 151/432 (34%), Gaps = 23/432 (5%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYK-KEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 ++PE W I + + T I+ Y + D +P IR I D + ++ Sbjct: 18 EIPETWKICNLGDLLTKIQDGNYGESYPKESEFLDSGIPFIRGTEITKNFIDGKKVKYIS 77 Query: 64 KNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSG 120 K E QK I D++ G V + + G +LR K+I + Sbjct: 78 KTKHDELQKAHIETGDVLFLNRGGITRTVAIV--PPKYDDANIGPQLTLLRCNTKIIHNK 135 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ +F + ++ ++ S AG + I +P + EQ+ I L+++ + Sbjct: 136 YLYYFIQGENFKKQVISSDAGTALQFFGIEKTKKFKITLPEIREQQKIVSVLNSIDNLLS 195 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQ------HSVFKKLNFESILTE 234 S + ++ K Q +L ++ K +K + KK+ L Sbjct: 196 SYDKTIQTTQKLKKGLMQKLLTKGIDHKKFKKVPWLFGKEIEIPEEWEIKKIEDLFKLKS 255 Query: 235 LRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYN 294 P P + + + + + + + N L G L Y Sbjct: 256 GSTPSRKIPEYFAGNIPWITSTDLNRSKITSTLEKITPEAVKQTNLKLLPKGTFLIATYG 315 Query: 295 -GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKT 353 + G CG+ K + + + E++ F+ ++ + Sbjct: 316 LEAAGTRGKCGITKMES----TCNQACMAFLPSSEITSEFLFYFYL--YFGEKIIFSIAQ 369 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 + Q+ + +K + +PP KEQ IV ++Q D+ ++ + ++ + + + Sbjct: 370 GTKQQNLYSDTLKKVSMFVPPQKEQKRIVNFLDQ----IDSHLFELESKKTGLDKIKKGL 425 Query: 414 LAKAFRGELTAQ 425 + K ++ + Sbjct: 426 IQKLLTSKIRVK 437 >UniRef50_Q8KLM8 Restriction-modification enzyme type I S subunit n=2 Tax=Streptococcaceae RepID=Q8KLM8_STRTR Length = 407 Score = 255 bits (652), Expect = 3e-66, Method: Composition-based stats. Identities = 69/415 (16%), Positives = 155/415 (37%), Gaps = 27/415 (6%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL- 66 + W + ++ I +D +P I+ +I+ + +T +L ++ K Sbjct: 15 DDWEQRKLGELSQKISVGIATSSSKYFSSQDHGVPFIKNQDIKENRINTKNLEYISKEFD 74 Query: 67 -VKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLIFSGFIAH 124 +++++ DI+ + G SA E + + RP ++I S +I+ Sbjct: 75 NKNKNKRVKQGDII----TARTGYPGLSAVVPKELEGAQTFTTLITRPISEMILSEYISI 130 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 F S +IS + AG N+ + IP+P L EQK I+ + L + + Sbjct: 131 FINSPYGMKQISGMEAGGAQKNVNAGIVQNLLIPLPSLDEQKKISNFILKLDDTIALHQR 190 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 + + + + K + Q + ++ F + + ++++ +G P Sbjct: 191 KLDLLKEQKKGYLQKMFPKNGAKVPELRFAGFADDWE----VRKLNEVSDIYDGTHQTPK 246 Query: 245 ESGVGHPILRISSVRAGHVDQNDIRFLECSESELN-RHKLQDGDLLFTRYNGSLEFVGVC 303 G L + +++ +F+ E + + Q GD+L TR +G Sbjct: 247 YQDNGVMFLSVENIK----TLTSNKFISREAFEDEFKIRPQRGDVLMTRIG----DIGTA 298 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 +++ + L + +++ P +++ +P ++ + + K I+ Sbjct: 299 NVVETDEDLAYYVSLALFK---SEELNPYFLQASIYAPFVQDQIWKRTLHIAFPKKINKN 355 Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 +I + +P + EQ +I +QL D L + + L K F Sbjct: 356 EIGQVPINVPTLAEQTKIGSFFKQL----DKTIALHQRKLDLLKEQKKGFLQKMF 406 >UniRef50_Q0A7Q2 Restriction modification system DNA specificity domain n=2 Tax=Proteobacteria RepID=Q0A7Q2_ALHEH Length = 419 Score = 255 bits (651), Expect = 3e-66, Method: Composition-based stats. Identities = 68/427 (15%), Positives = 151/427 (35%), Gaps = 27/427 (6%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFVPK 64 LP W + + T V + ++ + + + + + + + + K Sbjct: 6 LPATWSSKRLKYLATYNDEVLPESTD-----EEAEIDYVEISGVSLSRGVEQVERITFGK 60 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 + +K+ DI+I+ + K S G FC V + + SG++ Sbjct: 61 APSRARRKVRSGDILISTVRTYLRAIAKVDEASPDLIASTG-FCVVRPDREEVDSGYLGW 119 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 KS + +++ S S G + I + I +P+PPL Q+ IA+ LD A++D Sbjct: 120 AAKSEPFVSEVVSRSVGVSYPAINASELVTIEMPLPPLETQRRIAQFLDEKTARIDGLIE 179 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFESILTEL 235 + + L RQA++ AV L + W P H + + Sbjct: 180 KKRALLDRLAEKRQALITRAVTKGLNPEAPMKPSGIDWLGDIPAHWDLVPFKWRCQVQSG 239 Query: 236 RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 + P + P++ + +G D+ E + ++ +G +L+++ Sbjct: 240 QVDPRE-PEYTD--MPLIAPDYIESGTGRLYDVPSAEEQGAISGKYFCSEGSVLYSKIRP 296 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTS 355 +L L + L + K Y+ F + + Sbjct: 297 ALR---KVALFDSV----CLCSADMYAIDPGKYFERRYLFYFLLTDAFTAY-AELESLRV 348 Query: 356 GQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILA 415 ++ + + + V+ +P + EQ EI ++ +V ++ ++ +++ Sbjct: 349 AMPKVNREALGAFVLPIPFLDEQTEIADYCSRVDRENRFAADEVKRSVQKLEEYRSALIT 408 Query: 416 KAFRGEL 422 A G++ Sbjct: 409 AAVTGQI 415 Score = 127 bits (320), Expect = 8e-28, Method: Composition-based stats. Identities = 30/226 (13%), Positives = 77/226 (34%), Gaps = 14/226 (6%) Query: 218 PQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESE 277 P K+L + + + + + + + IS V + R Sbjct: 7 PATWSSKRLKYLA---TYNDEVLPESTDEEAEIDYVEISGVSLSRGVEQVERITFGKAPS 63 Query: 278 LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIF 337 R K++ GD+L + L + + + + + ++ Y+ Sbjct: 64 RARRKVRSGDILISTVRTYLRAIAK---VDEASPDLIASTGFCVVRPDREEVDSGYLGWA 120 Query: 338 FSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEK 397 S + +++ I+ ++ + + LPP++ Q I + +++ A D + + Sbjct: 121 AKSEPFVSEVVSR-SVGVSYPAINASELVTIEMPLPPLETQRRIAQFLDEKTARIDGLIE 179 Query: 398 QVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + L R+ Q+++ +A + NP+ + L Sbjct: 180 KKRALLDRLAEKRQALITRAVT-------KGLNPEAPMKPSGIDWL 218 Score = 108 bits (271), Expect = 4e-22, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 70/205 (34%), Gaps = 11/205 (5%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +P W + P + G +E + +PLI + I++G D+ Sbjct: 219 GDIPAHWDLVPFKWRCQVQSGQVDPREP-----EYTDMPLIAPDYIESGTGRLYDVPSAE 273 Query: 64 K-NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + + S ++ + + V C A + P K ++ Sbjct: 274 EQGAISGKYFCSEGSVLYSKIRPALRKVA-----LFDSVCLCSADMYAIDPGKYFERRYL 328 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 +F + + S + + + +PIP L EQ IA+ + + Sbjct: 329 FYFLLTDAFTAYAELESLRVAMPKVNREALGAFVLPIPFLDEQTEIADYCSRVDRENRFA 388 Query: 183 KARFEQIPQILKRFRQAVLGGAVNG 207 ++ Q L+ +R A++ AV G Sbjct: 389 ADEVKRSVQKLEEYRSALITAAVTG 413 >UniRef50_Q63WE0 Putative type I restriction enzyme specificity protein n=2 Tax=Burkholderia pseudomallei RepID=Q63WE0_BURPS Length = 429 Score = 255 bits (651), Expect = 3e-66, Method: Composition-based stats. Identities = 78/436 (17%), Positives = 163/436 (37%), Gaps = 45/436 (10%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLP-LIRANNIQNGKFDTTDLV-F 61 G++P W++ + V I AI+ + P +++ + + +G+F ++ Sbjct: 18 GQVPTHWLVQRLKEVIAFIESGV--SVNAIDTPAGEGEPGVLKTSCVYSGEFTPSENKLV 75 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 VP+ L + + + ++++ + + +VG S + + K F Sbjct: 76 VPEELGRVACPVKAGTVIVSRMN-TPDLVGASGVVRQNYANLYLPDRLWQVHFKNACPEF 134 Query: 122 IAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 + +++++ YR ++ S AG + + N+ F +P+PP +EQ IA L ++ Sbjct: 135 VHYWSQTHSYRAQVESACAGTSSSMKNLSQDEFRSFILPLPPPSEQSAIATFLKHETRKI 194 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFES 230 ++ A E++ +L RQA + AV L W P H K + Sbjct: 195 NALIAEQEKLLTLLAEKRQATISRAVTRGLNPDAPTKDSGVAWLREVPAHWNLKPMKRAV 254 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 + + + P++ + H N + ++ Sbjct: 255 VFQRGHD--LPSEDRVEGNIPVVSSGGISGWH----------------NAAATKGPTIVT 296 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 RY EFV L+ L ++ + P+Y+ S ++N Sbjct: 297 GRYGTIGEFV-------LLEEDCWPLNTALYTVQMHDNV-PKYLWYMLQSLKHI-FILNS 347 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 +K S G+ DI +V LPP +EQ IV ++ + D + A+ + Sbjct: 348 LK--SAVPGVDRNDIHPAIVCLPPAEEQPAIVAFLDAEISKLDALRADAERAIDLLKERR 405 Query: 411 QSILAKAFRGELTAQW 426 +++A A G++ + Sbjct: 406 SALIAAAVTGKIDVRN 421 Score = 166 bits (421), Expect = 2e-39, Method: Composition-based stats. Identities = 49/236 (20%), Positives = 102/236 (43%), Gaps = 12/236 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHP-ILRISSVRAGHVDQNDIRF 270 W P H + ++L E +++ +G G P +L+ S V +G ++ + Sbjct: 15 PWLGQVPTHWLVQRLKEVIAFIESGVSVNAIDTPAGEGEPGVLKTSCVYSGEFTPSENKL 74 Query: 271 LECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 + E ++ G ++ +R N + + VG G++++ + NL PD+L + Sbjct: 75 VVPEELGRVACPVKAGTVIVSRMN-TPDLVGASGVVRQ-NYANLYLPDRLWQVHFKNAC- 131 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSG-QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 PE++ + + S R + + TS K +S + +S ++ LPP EQ+ I ++ Sbjct: 132 PEFVHYWSQTHSYRAQVESACAGTSSSMKNLSQDEFRSFILPLPPPSEQSAIATFLKHET 191 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLE 445 + + + L + Q+ +++A R NPD + ++ A L E Sbjct: 192 RKINALIAEQEKLLTLLAEKRQATISRAVT-------RGLNPDAPTKDSGVAWLRE 240 >UniRef50_A6TIP1 Putative restriction endonuclease S subunit n=2 Tax=Proteobacteria RepID=A6TIP1_KLEP7 Length = 438 Score = 254 bits (650), Expect = 4e-66, Method: Composition-based stats. Identities = 74/438 (16%), Positives = 147/438 (33%), Gaps = 35/438 (7%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF- 61 G++PE W + + V KK Y + L ++ +F + D+ F Sbjct: 18 IGQVPEHWEVKRLRHVGRYSNSGVDKKS----YEDQQTVELCNYTDVYYNEFISDDMPFM 73 Query: 62 --VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-GAFCGVLRPEKLIF 118 E + D++I S S +G A G ++R + Sbjct: 74 QATASAHEIEQFTLKKGDVIITKDSEDPSDIGIPAFVPHDMPGVVCGYHLTMIRALNDNY 133 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 +I +S R S G + + + +PP EQ IA LD A+ Sbjct: 134 GSYIHRSIQSDHTRAHFFVESPGITRYGLNQNTIGNAPVALPPPEEQATIAATLDRETAR 193 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKK-LNF 228 +D+ + + ++LK RQA++ AV L +W P+H K Sbjct: 194 IDALVEKKIRFIELLKEKRQALITHAVTKGLDPNVKMKDSGVEWIGQVPEHWEVKPFFAL 253 Query: 229 ESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRH-KLQDGD 287 S L GL + L ++ + + + R + + + ++ G+ Sbjct: 254 VSELNRKNVGL------AETNILSLSYGNI----IQKPETRNMGLTPESYETYQIVESGE 303 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 ++F + + L Q + + + Y S Sbjct: 304 VVFRFTDLQND---KRSLRSAQVTQRGIITSAYMAVKPHS-IGSTYFAWLMRSYDLCKVF 359 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 ++ + +D++ VL+PPV EQ+EI + A D + ++ ++ + Sbjct: 360 --YAMGGGLRQSLKFEDVRRLPVLIPPVGEQSEITNTINAGTARIDALVEKTEQSITLLK 417 Query: 408 NLTQSILAKAFRGELTAQ 425 + + A G++ + Sbjct: 418 ERRAAFITAAVTGQIDLR 435 Score = 162 bits (411), Expect = 2e-38, Method: Composition-based stats. Identities = 40/234 (17%), Positives = 86/234 (36%), Gaps = 13/234 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P+H K+L + +G+ K E + + V +D+ F+ Sbjct: 16 EWIGQVPEHWEVKRLRHVGRYS--NSGVDKKSYEDQQTVELCNYTDVYYNEFISDDMPFM 73 Query: 272 --ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 S E+ + L+ GD++ T+ + +G+ + ++ L R D Sbjct: 74 QATASAHEIEQFTLKKGDVIITKDSEDPSDIGIPAFVPHDMP-GVVCGYHLTMIRALNDN 132 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 YI S R + G++ I + V LPP +EQA I +++ Sbjct: 133 YGSYIHRSIQSDHTRAHFF-VESPGITRYGLNQNTIGNAPVALPPPEEQATIAATLDRET 191 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D + ++ + + Q+++ A + +P++ ++ + Sbjct: 192 ARIDALVEKKIRFIELLKEKRQALITHAVT-------KGLDPNVKMKDSGVEWI 238 >UniRef50_C1ZA47 Restriction endonuclease S subunit n=1 Tax=Planctomyces limnophilus DSM 3776 RepID=C1ZA47_PLALI Length = 413 Score = 254 bits (650), Expect = 4e-66, Method: Composition-based stats. Identities = 92/432 (21%), Positives = 162/432 (37%), Gaps = 40/432 (9%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI-QNGKFDTTDLVFVPKNLV 67 GW+ + V G+ +K E+ + +IR N + G D +D+ ++ Sbjct: 4 GWIYKTLDDVCEFNNGL-WKGEKPPFVT----VGVIRNTNFTKEGTLDDSDIAYIEVEAK 58 Query: 68 K-ESQKISPEDIVIAMSSGSKS-VVGKSAHQHL-PFECSFGAFCGVLRPE--KLIFSGFI 122 K E +++ D+++ S G VG+ A + SF F +R + K + F+ Sbjct: 59 KFEKRRLVFGDLILEKSGGGPKQPVGRVALFDKRAGDFSFSNFTAAIRVKDPKTLDFRFL 118 Query: 123 AHFTKSSLYRNKISS-----LSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F + +S S I N+ + I +P+PPL EQ+ I LD Sbjct: 119 HKFL----FWTHLSGVTETMQSHSTGIRNLNGDVYKCIEVPLPPLTEQRRIVGILDEAFE 174 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTE--L 235 + + KA E+ Q + ++ L + V K + + + + Sbjct: 175 GLATAKANAEKNLQNARALFESHLQAVFTQR---------GDGWVEKTVKDVASPIKGSI 225 Query: 236 RNGLSSK----PNESGVGHPILRISSVRAGHVDQNDIRFLECSES-ELNRHKLQDGDLLF 290 R G G +L I + A RF+ + +L R+++ GD+L Sbjct: 226 RTGPFGSQLLHSEFVDEGIAVLGIDNAVANEFRWGKSRFITKDKFGQLERYRVYPGDVLI 285 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNC 350 T G C ++ + K LP Y+ ++F A + Sbjct: 286 TIMGTC----GRCAVVPDDIPTAINTKHICCITLDWKKCLPSYLHLYFLHAQQSQAFLAK 341 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLT 410 + G++ I+ VLLPP + Q+ IV L +E LA ++ L Sbjct: 342 HAKGAIMAGLNMGLIQELPVLLPPTQVQSAIVEAANDLREETQRLESLYQRKLAALDELK 401 Query: 411 QSILAKAFRGEL 422 +S+L +AF GEL Sbjct: 402 KSLLHRAFSGEL 413 Score = 127 bits (319), Expect = 1e-27, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 82/211 (38%), Gaps = 13/211 (6%) Query: 8 EGWVIAPVSTVTTLIRG----VTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 +GWV V V + I+G + + + D+ + ++ +N +F F+ Sbjct: 207 DGWVEKTVKDVASPIKGSIRTGPFGSQLLHSEFVDEGIAVLGIDNAVANEFRWGKSRFIT 266 Query: 64 KNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGA-FCGVLRPE-KLIFS 119 K+ E ++ P D++I + G+ A + + + K Sbjct: 267 KDKFGQLERYRVYPGDVLITIM----GTCGRCAVVPDDIPTAINTKHICCITLDWKKCLP 322 Query: 120 GFIA-HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ +F + + ++ + GA + + + + +PP Q I E + L + Sbjct: 323 SYLHLYFLHAQQSQAFLAKHAKGAIMAGLNMGLIQELPVLLPPTQVQSAIVEAANDLREE 382 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 ++ +++ L ++++L A +G+L Sbjct: 383 TQRLESLYQRKLAALDELKKSLLHRAFSGEL 413 >UniRef50_Q310I9 Type I restriction enzyme, S subunit n=2 Tax=Deltaproteobacteria RepID=Q310I9_DESDG Length = 474 Score = 254 bits (649), Expect = 6e-66, Method: Composition-based stats. Identities = 75/457 (16%), Positives = 154/457 (33%), Gaps = 41/457 (8%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +P W I + ++ ++ ++I V Sbjct: 18 VGSIPAHWPEKRAKYYFKEIDDRSQTGDEE----------MLSVSHIT--GVTPRSQKNV 65 Query: 63 PKNLVKES---QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 + + ++ P D++I S +G S GV RP Sbjct: 66 TMFKAESNVGQKRCQPGDLIINTMWAWMSALGVS-----NHAGIVSPAYGVYRPRSNQDY 120 Query: 120 GFIAH--FTKSSLYRNKISSLSAG--ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + + YR++ S G ++ + P F + + PP EQ+ IA L Sbjct: 121 DYYYLDSLLRIEGYRSEYICRSTGIRSSRLRLYPDKFLSMPVVCPPQEEQQTIARFLKAQ 180 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKL 226 + ++LK +Q V+ AV L +W P+H ++L Sbjct: 181 DRLFRKFIRNKRRFIELLKEQKQNVINQAVTRGLDPKVQFKPSGVEWIGDIPEHWDARRL 240 Query: 227 NFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL--ECSESELNRHKLQ 284 + + +G+ NE V + V I F+ + E+ +L+ Sbjct: 241 RTLAAVRA--SGVDKNTNEDEVPVMLCNYVDVYKNDRITAAIDFMKATATPEEIRAFELK 298 Query: 285 DGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSSPSA 343 GD++ T+ + S + + + + + ++ L R + + E++ FSS Sbjct: 299 AGDVIITKDSESWDDIAIPTFVPET-IPGVVCAYHLALIRPFSGEIEGEFLFRAFSSDPV 357 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 + T + G++ IK LPP++EQ I+ + + A + + Sbjct: 358 ADQF-RIAATGVTRFGLAQGAIKGAFFPLPPLEEQRAIIAHINEKCAEISQAISRAEREI 416 Query: 404 ARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSA 440 + +++ G++ + E P++ E A Sbjct: 417 ELMREYRTRLISDVVTGQVDVR-GIEVPEVADEELLA 452 Score = 132 bits (333), Expect = 2e-29, Method: Composition-based stats. Identities = 35/241 (14%), Positives = 83/241 (34%), Gaps = 18/241 (7%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLI--RANNIQNGK-----FD 55 G +PE W + T+ + +D +P++ ++ D Sbjct: 228 IGDIPEHWDARRLRTLAAVRASG------VDKNTNEDEVPVMLCNYVDVYKNDRITAAID 281 Query: 56 TTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-GAFCGVLRP- 113 P+ + + ++ D++I S S + ++RP Sbjct: 282 FMKATATPEEI--RAFELKAGDVIITKDSESWDDIAIPTFVPETIPGVVCAYHLALIRPF 339 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 I F+ S ++ + G + + P+PPL EQ+ I ++ Sbjct: 340 SGEIEGEFLFRAFSSDPVADQFRIAATGVTRFGLAQGAIKGAFFPLPPLEEQRAIIAHIN 399 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILT 233 A++ +R E+ ++++ +R ++ V G++ + P+ + + L E Sbjct: 400 EKCAEISQAISRAEREIELMREYRTRLISDVVTGQVDVRGI-EVPEVADEELLALEEDTA 458 Query: 234 E 234 + Sbjct: 459 D 459 Score = 112 bits (281), Expect = 3e-23, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 77/233 (33%), Gaps = 23/233 (9%) Query: 213 WRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLE 272 W P H K+ + E+ + S +E +L +S + + Sbjct: 17 WVGSIPAHWPEKRAKY--YFKEIDD-RSQTGDEE-----MLSVSHITGVTPRSQKNVTMF 68 Query: 273 CSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDK-LIRARLTKDALP 331 +ES + + + Q GDL+ + +GV H ++ P + R R +D Sbjct: 69 KAESNVGQKRCQPGDLIINTMWAWMSALGVS------NHAGIVSPAYGVYRPRSNQDYDY 122 Query: 332 EYIEIFFSSPSAR-NAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 Y++ R + S + + S V+ PP +EQ I R ++ Sbjct: 123 YYLDSLLRIEGYRSEYICRSTGIRSSRLRLYPDKFLSMPVVCPPQEEQQTIARFLKAQDR 182 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 + + + Q+++ +A R +P + + + Sbjct: 183 LFRKFIRNKRRFIELLKEQKQNVINQAVT-------RGLDPKVQFKPSGVEWI 228 >UniRef50_A2TPX3 RmeS n=1 Tax=Dokdonia donghaensis MED134 RepID=A2TPX3_9FLAO Length = 395 Score = 253 bits (647), Expect = 9e-66, Method: Composition-based stats. Identities = 75/417 (17%), Positives = 162/417 (38%), Gaps = 35/417 (8%) Query: 14 PVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQ-- 71 ++TV + G +K + +PL+R +N +G+ D +S+ Sbjct: 7 TLTTVCAIKNGFAFKSKD----YLTKGIPLLRISNFNDGEVYINDNQIYVDAKYLKSKND 62 Query: 72 -KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP--EKLIFSGFIAHFTKS 128 + D++IA+S + G + F G+++ + S + ++ Sbjct: 63 FIVEKGDVLIALSGATTGKYG---IYNFDFPSLLNQRIGLIKSGESDTLNSRYFYYYLN- 118 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 + +++I + GA NI IP+PPL QK IA+ LD A D+T ++ Sbjct: 119 -ILKSEILRNAGGAAQPNISTKKIGTFEIPLPPLETQKRIAQILDDAAALRDTTAQLLKE 177 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKL-NFESILTELRNGLSSKPNESG 247 + + + G V P+ + + N S L G+ +E Sbjct: 178 YDLLAQSIFLEMFGDPV----------MNPKEWIKTRFANLVSSNCPLTYGIVQPGDEYE 227 Query: 248 VGHPILRISSVRAGHVDQNDIRFLECSE-SELNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 G P +R + + ++ ++++ ++ + ++ +R L+ G++L + VGV + Sbjct: 228 NGIPCVRPVDLTSQYISVDNLKKIDPAISNKFSRTILEGGEILLSVRGS----VGVISIA 283 Query: 307 KKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIK 366 + + K + Y + + +N + + + I+ KD++ Sbjct: 284 DDSLKGANVTRGIVPIWFDKKISNRLYFYYLYKTKRIQNQI-KRLSKGATLVQINLKDLR 342 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 ++ PP++ Q + ++ A + + L +L +L KAF+GEL Sbjct: 343 ELKIIQPPIELQNQFANKI----ALIEQQKALAKQELQESEDLFNCLLQKAFKGELV 395 Score = 104 bits (259), Expect = 1e-20, Method: Composition-based stats. Identities = 36/207 (17%), Positives = 81/207 (39%), Gaps = 13/207 (6%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P+ W+ + + + +TY Q + ++ +P +R ++ + +L + + Sbjct: 197 PKEWIKTRFANLVSSNCPLTYGIVQPGDEYEN-GIPCVRPVDLTSQYISVDNLKKIDPAI 255 Query: 67 VKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLIFSG-FI 122 + + +I++++ VG + + + + +K I + + Sbjct: 256 SNKFSRTILEGGEILLSVR----GSVGVISIADDSLKGANVTRGIVPIWFDKKISNRLYF 311 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + K+ +N+I LS GA + I + I PP+ Q A K+ A ++ Sbjct: 312 YYLYKTKRIQNQIKRLSKGATLVQINLKDLRELKIIQPPIELQNQFANKI----ALIEQQ 367 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKL 209 KA +Q Q + +L A G+L Sbjct: 368 KALAKQELQESEDLFNCLLQKAFKGEL 394 >UniRef50_UPI000038E018 type I restriction-modification enzyme, S subunit, putative n=1 Tax=Ferroplasma acidarmanus fer1 RepID=UPI000038E018 Length = 420 Score = 252 bits (645), Expect = 2e-65, Method: Composition-based stats. Identities = 81/438 (18%), Positives = 159/438 (36%), Gaps = 35/438 (7%) Query: 2 SAGKLPEGWVIAPVSTVTTLI-RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 G++P+ W + V +LI GVTYK+ + KD P+ R I K DT + Sbjct: 3 EIGEIPQEWGFVKLGDVLSLIKNGVTYKQNK-----KDSGYPVTRIETISEEKIDTAKVG 57 Query: 61 FVPKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR--PEKL 116 ++ + ++ DI+ + + S +GK+A E +L + Sbjct: 58 YIDNIKTENINDYRLIEGDILFSHIN-SLEHIGKTAIYEGEPELLLHGMNLLLLRSDKSK 116 Query: 117 IFSGFIAHFTKSSLYRNKISSLSA-GANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 I ++ + K + S++ N +I I IP+PPL EQ+ IAE L T Sbjct: 117 IEPSYLVYSLKFYRAKELFKSMAKRAVNQASINQTELKRIKIPLPPLPEQQKIAEILSTA 176 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNG-KLTEKWRNFEPQHSVFKKLNFESILTE 234 ++ + Q+ K Q +L + + P+ L Sbjct: 177 DDEIQKMDEQIALAEQLKKGLMQKLLTRGIGHTRFKTTEIGEIPEEWDTFGLGEIFKTIT 236 Query: 235 LRNGLSSKPNESGVG-HPILRISSVR--AGHVDQ--NDIRFLECSESELNRHKLQDGDLL 289 + + G L + + ++ + E + E N + L + +L Sbjct: 237 GTTPSTKVKDYWHGGTIEWLTPKDLNKLNNTITLPPSERKVTEKALKENNLNILPENSIL 296 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD--ALPEYIEIFFSSPSARNAM 347 + + VG G+ + + + + P + + S + Sbjct: 297 IS----TRAPVGYVGI----NNTKITFNQGCKGLVPLNRDVSFPFFYAYYLKS---KTTF 345 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 +N + T S K +S + + VV LPP+ EQ +I + + + + N ++N Sbjct: 346 LNSLSTGSTFKELSKEGLDDVVVPLPPLPEQQKIGEILSTVDNKLELL----GNKREKLN 401 Query: 408 NLTQSILAKAFRGELTAQ 425 L + ++ F G++ + Sbjct: 402 VLKKGLMNDLFTGKVRVK 419 >UniRef50_A8TH56 Restriction modification system DNA specificity domain n=1 Tax=Methanococcus voltae A3 RepID=A8TH56_METVO Length = 412 Score = 252 bits (645), Expect = 2e-65, Method: Composition-based stats. Identities = 71/425 (16%), Positives = 154/425 (36%), Gaps = 26/425 (6%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT-DLVF 61 G +P W + + V + I + + Y + I NN++NG T D Sbjct: 11 IGLIPNDWEVKKLGDVCSFIGDGIHSTPK---YCTNGKYYFINGNNLKNGTIVHTNDTKL 67 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + ++ ED ++ + +G ++ + + G + + F Sbjct: 68 ISFEEFNKLKQKIAEDALLLSIN---GTIGNCSYYN-NEKILLGKSVAYINLKNKNIKNF 123 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 I + +S ++ S G+ I N+ S + IP+PPL EQ+ IAE L +++ Sbjct: 124 IYYVIQSPRTVSQFYSELTGSTIKNLSLKSLRNLCIPLPPLKEQQKIAEILTKWDNHIET 183 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR-NGLS 240 + + + K Q +L G V F + +K++ I L+ NGLS Sbjct: 184 LENLISKKEEYKKGLMQNLLTGKVR---------FPGFNEEWKEVKLGEICKFLKGNGLS 234 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 + + + + I+ + + + GD+L + Sbjct: 235 KEKLNKNGKFKCILYGELYTTY--SEVIKEVLSKTDFKEKIHSEKGDILIPASTTTTGID 292 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 ++ L ++R + E++ + + + + + + Sbjct: 293 LANATAINEENVILGGDINILRKKYENKYNNEFLAYYLT--YGKKYELAKYAQGTTIVHL 350 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 GKDIK+ + LP ++EQ +I ++ + D + + L + + ++ K G Sbjct: 351 YGKDIKNMKIQLPTLEEQEQIA----EVLSLQDKEIEILKEKLELLKMQKKGLMQKLLTG 406 Query: 421 ELTAQ 425 E+ + Sbjct: 407 EIRVK 411 >UniRef50_B4S4B7 Restriction modification system DNA specificity domain n=3 Tax=Bacteria RepID=B4S4B7_PROA2 Length = 456 Score = 252 bits (644), Expect = 2e-65, Method: Composition-based stats. Identities = 70/421 (16%), Positives = 138/421 (32%), Gaps = 22/421 (5%) Query: 9 GWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN-L 66 W V V+T + G T +A +P I + + + TT FV + + Sbjct: 45 DWKKTTVGKVSTGFLSGGTPSTSRA--DYWKGEIPWITSKWLGDKLELTTGEKFVSEEAI 102 Query: 67 VKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCG-VLRPEKLIFSGFIAHF 125 + KI P+D +I + VG + + + VL + F+A+ Sbjct: 103 KNTATKIVPKDSIIFATRVGVGKVGINRI-----DLAINQDLAGVLIDNENYDIKFLAYQ 157 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 + ++ GA I I + I + IPPL EQK IA L T+ +++ + Sbjct: 158 LGIDSIQQYVAMNKRGATIKGITRDCLEQIQLNIPPLPEQKKIAHILSTVQRAIEAQERI 217 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEK-WRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 + ++ K + + + ++ P+ K+ + + P Sbjct: 218 IQTTTELKKALMHKLFTEGLRNEPQKETEIGLVPESWEVCKVGDVAKIQSGGTPSRDVPE 277 Query: 245 ESGVG-HPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVC 303 G P ++ + + + + + G LL Y + G Sbjct: 278 NWRDGTIPWVKTGEINYCVIKDTEEKITPTGLANSAAQLFPTGTLLMAMYGQGITR-GKV 336 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 GLL N I ++ FF + + + + Q+ +S Sbjct: 337 GLLGIEAATNQACAS--IIPIDQDQISSVFLYYFF---EFQYENLRQLGHGANQRNMSAG 391 Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 I+ + P +EQA ++ E L D + L +++L + + Sbjct: 392 LIRGFPLSFPKFEEQAAMIAAFESL----DKKRYFHERKRTQFQGLFRTLLHELMNAKTR 447 Query: 424 A 424 Sbjct: 448 V 448 Score = 138 bits (349), Expect = 3e-31, Method: Composition-based stats. Identities = 38/204 (18%), Positives = 69/204 (33%), Gaps = 9/204 (4%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G +PE W + V V + G T + +D +P ++ I T+ Sbjct: 246 EIGLVPESWEVCKVGDVAKIQSGGTPSR-DVPENWRDGTIPWVKTGEINYCVIKDTEEKI 304 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE--KLIFS 119 P L + ++ P ++ G GK L E + C + P I S Sbjct: 305 TPTGLANSAAQLFPTGTLLMAMYGQGITRGKVGL--LGIEAATNQACASIIPIDQDQISS 362 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKI-IAEKLDTLLAQ 178 F+ +F + + L GAN N+ + P EQ IA ++L + Sbjct: 363 VFLYYFFEFQY--ENLRQLGHGANQRNMSAGLIRGFPLSFPKFEEQAAMIAAF-ESLDKK 419 Query: 179 VDSTKARFEQIPQILKRFRQAVLG 202 + + Q + + ++ Sbjct: 420 RYFHERKRTQFQGLFRTLLHELMN 443 >UniRef50_D2TPV5 Putative Type I restriction-modification system, specificity (S) subunit n=1 Tax=Citrobacter rodentium ICC168 RepID=D2TPV5_CITRO Length = 446 Score = 252 bits (644), Expect = 3e-65, Method: Composition-based stats. Identities = 68/441 (15%), Positives = 157/441 (35%), Gaps = 46/441 (10%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + V +G + + LP + ++N T + Sbjct: 19 GEIPIHWKMLRHKYVAFFTKGKNP--TNLLEQPLKNTLPYLSMECLRNN--TTDKYALIS 74 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVV--GKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 ++ + ++ + GKS + ++ P + S + Sbjct: 75 NDV----RVALEGQPLVIWDGSNAGEFLKGKSGILSSTMAAAT-----LIYP---LHSQY 122 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + S ++ + G I ++ I+ IP + EQK +A+ LD A++D+ Sbjct: 123 YWYLCIS--IEPEMRKNAVGMGIPHVNGDELRSISFGIPSIYEQKQVADFLDHETAKIDN 180 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 + +Q+ ++LK RQAV+ AV L +W P+H ++ + + Sbjct: 181 LIEKQQQLIELLKEKRQAVISHAVTKGLNPDVPMKDSGVEWLGDVPEHWRVSRIKNYAKI 240 Query: 233 TELRNGLSSKPNES-GVGHPILRISSVRA----GHVDQNDIRFLECSESELNRHKLQDGD 287 +KP P + ++ + +++ + E + + H L Sbjct: 241 ESGHTPSRTKPEYWISCNIPWVSLNDSKQLKEIDYIEDTFYKISELGMANSSAHLLPARA 300 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNA 346 ++FTR +G+ + +++ LI + +PE++ + F A Sbjct: 301 VVFTRD----ASIGLSAI----TTKSMAVSQHLIAWICDEKFIIPEFLLLVF---YAMEK 349 Query: 347 MMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARV 406 + K I +++ PPV+EQ ++ + +V + L+ + Sbjct: 350 EFERYTFGATIKTIGMDNVRGLKSTFPPVEEQRNLIDWAFSKIEKIKSSINKVEDMLSLL 409 Query: 407 NNLTQSILAKAFRGELTAQWR 427 ++++ A G++ + Sbjct: 410 QERRTALISAAVTGKIDVRDW 430 Score = 122 bits (306), Expect = 4e-26, Method: Composition-based stats. Identities = 32/232 (13%), Positives = 75/232 (32%), Gaps = 25/232 (10%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P H + + + T+ +N + P L + +R D Sbjct: 16 EWLGEIPIHWKMLRHKYVAFFTKGKNPTNLLEQPLKNTLPYLSMECLRNNTTD------- 68 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 ++ L D+ L + L+ ++ + + A L Sbjct: 69 --------KYALISNDVRVALEGQPLVIWDGSNAGEFLKGKSGILSSTMAAATLIYPLHS 120 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 +Y S M G ++G +++S +P + EQ ++ ++ A Sbjct: 121 QYYWYLCIS---IEPEMRKNAVGMGIPHVNGDELRSISFGIPSIYEQKQVADFLDHETAK 177 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + ++ + + Q++++ A + NPD+ ++ L Sbjct: 178 IDNLIEKQQQLIELLKEKRQAVISHAVT-------KGLNPDVPMKDSGVEWL 222 >UniRef50_A3US47 Type I site-specific deoxyribonuclease n=1 Tax=Vibrio splendidus 12B01 RepID=A3US47_VIBSP Length = 413 Score = 251 bits (643), Expect = 3e-65, Method: Composition-based stats. Identities = 75/423 (17%), Positives = 166/423 (39%), Gaps = 21/423 (4%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P GW I + ++ T+ RG + + +P ++ +I + K + Sbjct: 2 VPNGWSIKTLESLATVERGKFSARPRNDPKYYGGEIPFVQTGDIASAKTYLSSFNQTLNE 61 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + ++ PE+ ++ + + + FE + ++P++ I ++ F Sbjct: 62 DGLKVSRLFPENSILITIAANIGDTAITT-----FEVACPDSLVGIQPKQDIDCFWLNSF 116 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 ++ ++++ + NI + I PP EQ+ IA+ L T + +T+ Sbjct: 117 LETC--KDELDGKATQNAQKNINLQVLKPLEILTPPYKEQQKIAKILSTWDKAITTTEKL 174 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE 245 Q K Q +L G +L + ++++ + +++ +G P Sbjct: 175 IATSKQQKKALMQQLLTG--KKRLVNPDTGKTFEG-EWEEVKLGDVCSKVTDGAHHSPKS 231 Query: 246 SGVGHPILRISSVRAGHVDQNDIRFLECSESE---LNRHKLQDGDLLFTRYNGSLEFVGV 302 G+P+L + +RA +N R + + E K + D+L + L++ Sbjct: 232 VECGYPMLSVKDMRATKFSENTARHISKEDYEALVKQNCKPELNDILIAKDGSILKY--- 288 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 C ++++ +L L+R +L P +I +FS S R + + + SG I Sbjct: 289 CFVVREEIEGVILSSIALLRPKL-SIISPNFIAQYFSQESVRFFVGKALTSGSGVPRIIL 347 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 KD K + +P + EQ +I + AD + LA ++++ + G+ Sbjct: 348 KDFKGIHLRIPSLLEQQKIASVL----TAADKEIEVFEAKLAHFKQEKKALMQQLLTGKR 403 Query: 423 TAQ 425 + Sbjct: 404 RVK 406 >UniRef50_B0RYC3 Type I site-specific deoxyribonuclease (Specificity subunit) n=2 Tax=Xanthomonas campestris pv. campestris RepID=B0RYC3_XANCB Length = 438 Score = 251 bits (642), Expect = 3e-65, Method: Composition-based stats. Identities = 79/434 (18%), Positives = 163/434 (37%), Gaps = 26/434 (5%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 LP+GW + + K + + ++P+ + + D T + + Sbjct: 9 LPQGWTRRRLR--FDCLSNPVKSKLDIPDDTEVSFVPMDAVGELGGLRLDQTREL---AD 63 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF--GAFCGVLRPEKLIFSGFIA 123 + + D+ IA + GK A VLRP + + F+ Sbjct: 64 VYNGYTYFADGDVCIAKITPCFEN-GKGAIAEGLVNGVAFGTTELHVLRPSATLDTRFLF 122 Query: 124 HFTKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + T + +R+ + G + + +P + Q+ IA LD A++D+ Sbjct: 123 YLTIAHDFRSHGEAEMLGASGQKRVPEEFLKDWTPSLPRMDVQQRIARFLDDKTARIDAL 182 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILT 233 + +++ + L+ RQA++ AV L W + P+H K L + Sbjct: 183 IEKKQELLERLEEKRQALITRAVTKGLNPDLPMKPSGVDWLGYVPRHWEVKTLRRH--VQ 240 Query: 234 ELRNGLSSKPNE---SGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLF 290 + G S + +L+ V G D+N+ + L + +++ D+L Sbjct: 241 RIEQGWSPQTERRMAEPDEWGVLKSGCVNLGIYDENEQKALPGTLDPKPELEVRANDVLM 300 Query: 291 TRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAMMN 349 R +GS++++G L+++ + L++ DK R L+ + EY S+ R + Sbjct: 301 CRASGSMQYIGSVALVERTR-TKLMFSDKTYRISLSSANTDREYFVRMMSAKHLREQIRL 359 Query: 350 CVKTTSGQK-GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 V G I ++ + PP+ EQ +I + + D E ++ + Sbjct: 360 SVSGAEGLANNIPQSNVLEYLHAFPPLLEQVQIADFLRESIGDLDEAEGKIRASSESWRA 419 Query: 409 LTQSILAKAFRGEL 422 +++ A G+L Sbjct: 420 YRLALVTAAVTGQL 433 Score = 131 bits (329), Expect = 7e-29, Method: Composition-based stats. Identities = 39/227 (17%), Positives = 81/227 (35%), Gaps = 14/227 (6%) Query: 218 PQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRA-GHVDQNDIRFLECSES 276 PQ ++L F+ + +++ L + + +V G + + R E ++ Sbjct: 10 PQGWTRRRLRFDCLSNPVKSKLDIP---DDTEVSFVPMDAVGELGGLRLDQTR--ELADV 64 Query: 277 ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEI 336 DGD+ + E G + + L + +L R + ++ Sbjct: 65 YNGYTYFADGDVCIAKITPCFEN-GKGAIAEGLVNGVAFGTTELHVLRPSATLDTRFLFY 123 Query: 337 FFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIE 396 + R+ + SGQK + + +K LP + Q I R ++ A D + Sbjct: 124 LTIAHDFRSHGEAEMLGASGQKRVPEEFLKDWTPSLPRMDVQQRIARFLDDKTARIDALI 183 Query: 397 KQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++ L R+ Q+++ +A + NPDL + L Sbjct: 184 EKKQELLERLEEKRQALITRAVT-------KGLNPDLPMKPSGVDWL 223 Score = 95.9 bits (238), Expect = 3e-18, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 84/214 (39%), Gaps = 8/214 (3%) Query: 4 GKLPEGWVIAPVSTVTT-LIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +P W + + + +G + + E+ + + ++++ + G +D + + Sbjct: 224 GYVPRHWEVKTLRRHVQRIEQGWSPQTERRMAEPDEWG--VLKSGCVNLGIYDENEQKAL 281 Query: 63 PKNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLP-FECSFGAFCGVLRPEK-LIFS 119 P L + ++ D+++ +SGS +G A + F + Sbjct: 282 PGTLDPKPELEVRANDVLMCRASGSMQYIGSVALVERTRTKLMFSDKTYRISLSSANTDR 341 Query: 120 GFIAHFTKSSLYRNKISSLSAGANI--NNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + + R +I +GA NNI ++ PPL EQ IA+ L + Sbjct: 342 EYFVRMMSAKHLREQIRLSVSGAEGLANNIPQSNVLEYLHAFPPLLEQVQIADFLRESIG 401 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE 211 +D + + + + +R A++ AV G+L E Sbjct: 402 DLDEAEGKIRASSESWRAYRLALVTAAVTGQLPE 435 >UniRef50_D2MXN5 Putative uncharacterized protein n=1 Tax=Campylobacter jejuni subsp. jejuni 414 RepID=D2MXN5_CAMJE Length = 411 Score = 251 bits (642), Expect = 4e-65, Method: Composition-based stats. Identities = 71/427 (16%), Positives = 161/427 (37%), Gaps = 36/427 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKF---DTTDLV 60 G++P+ W + P+ + D+ PL+ I NG D TD Sbjct: 13 GEIPQDWEVVPIRCCF----------GEFNIRCNDNDYPLLSVT-IANGVVYQNDITDKK 61 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + + + + I + VG ++ + V P K I Sbjct: 62 DISNDDKSNYKIVPLGAIAYNKMRMWQGAVGI----NMLEKGIVSPAYVVAIPNKQINIS 117 Query: 121 FIAHFTKSSLYRNKISSLSAG--ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 F + KS + S G +++NN++ F I IP+PPL EQ+ I LD Q Sbjct: 118 FSYYLLKSRNIIGEYEKNSYGLCSDMNNLRYEDFQNIKIPLPPLKEQEQIVNFLDEKCEQ 177 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 + + + E++ +LK +QA++ + L + + ++ + +L++ Sbjct: 178 IANFIEKKEKLISLLKEQKQALINETITKGLNKNVNFKDSGIEWLGEIPEHWKILKLKHI 237 Query: 239 LSSKPNESGVGHPILRISSV--RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 S + +S + + ++ + G + E + GD+LF + Sbjct: 238 ASLRNQKSNNIDFRIGLENIESKTGKFIPSSEIVFEEDG-----IGFEKGDILFGKLRPY 292 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 L V L ++ + + + ++ ++ ++I+ S + +++ + Sbjct: 293 LAKV-------FLTDRDGICVSEFLVLKIKSESN-KFIKFLMLSSLFID-IVDSSTYGTK 343 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 + + I + + LPP+KEQ +I +++ D + ++ + + +++ + Sbjct: 344 MPRANWEFIGNLKIPLPPLKEQEQIANFLDKKCEKIDLLIEKTKKQIKLIKEYKTTLINQ 403 Query: 417 AFRGELT 423 A G + Sbjct: 404 AVCGRMD 410 Score = 116 bits (292), Expect = 1e-24, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 79/236 (33%), Gaps = 27/236 (11%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHV---DQNDI 268 +W PQ + +R + +P+L ++ + G V D D Sbjct: 10 EWLGEIPQDWEVVPIRCCFGEFNIR--------CNDNDYPLLSVT-IANGVVYQNDITDK 60 Query: 269 RFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD 328 + + + N + G + + + VG+ + + ++ P ++ K Sbjct: 61 KDISNDDK-SNYKIVPLGAIAYNKMRMWQGAVGI-----NMLEKGIVSPAYVVAI-PNKQ 113 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTT-SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 + S + S + +D ++ + LPP+KEQ +IV +++ Sbjct: 114 INISFSYYLLKSRNIIGEYEKNSYGLCSDMNNLRYEDFQNIKIPLPPLKEQEQIVNFLDE 173 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 ++ ++ + Q+++ + + N ++ ++ L Sbjct: 174 KCEQIANFIEKKEKLISLLKEQKQALINETIT-------KGLNKNVNFKDSGIEWL 222 >UniRef50_B9KF72 Type I restriction-modification system, S subunit n=2 Tax=Campylobacter RepID=B9KF72_CAMLR Length = 390 Score = 251 bits (641), Expect = 6e-65, Method: Composition-based stats. Identities = 81/420 (19%), Positives = 148/420 (35%), Gaps = 36/420 (8%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 + W I+ + ++ Q P A+ G D D + L Sbjct: 2 KYWKISIIDNTCEILNNKRVPISQKDRI--SGIYPYYGAS----GIVDYIDKYIFDEEL- 54 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF-SGFIAHFT 126 ++I SA + +L+P I + F+ +F Sbjct: 55 ----------VLIGEDGAKWGAFENSAFIASG-KYWVNNHAHILKPNNEILINKFLVYFL 103 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 S I GA + + I I +PPL EQ+ I LD A +D + Sbjct: 104 NYSNLEKYI----TGATVKKLNQQKLKQIEILLPPLKEQERIVGILDESFANIDESIKIL 159 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-PQHSVFKKLNFESILTELRNGLSSKPNE 245 EQ L Q+ L N N++ PQ +K L + + +G PN Sbjct: 160 EQDLLNLDELMQSALQKTFNPLKDNAKENYQLPQDWEWKSLG---EICFITDGTHKTPNY 216 Query: 246 SGVGHPILRISSVRAGHVDQNDIRFLECSESE--LNRHKLQDGDLLFTRYNGSLEFVGVC 303 G P L + ++ G D +DI+++ E + R K + GD+L R +G Sbjct: 217 IETGIPFLSVKNISKGFFDLSDIKYISLEEHNKLIKRAKPEFGDILICRIG----TLGKA 272 Query: 304 GLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ-KGISG 362 + ++ L + + +Y+ F +S + N ++ Sbjct: 273 IKISLEFEFSIFVS--LGLLKPKVKIISDYLVYFLNSYFIEGWINNNKVGGGTHTAKLNL 330 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 ++ + LP +KEQ +I +++ +++ + + L +S+L KAF+G+L Sbjct: 331 NILEKCPIALPSLKEQEQIASYLDEFSLNIKDLKQNYQAQIKNLQELKKSLLDKAFKGKL 390 Score = 149 bits (378), Expect = 2e-34, Method: Composition-based stats. Identities = 48/213 (22%), Positives = 86/213 (40%), Gaps = 14/213 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 +LP+ W + + I T+K + +P + NI G FD +D+ + Sbjct: 187 ENYQLPQDWEWKSLGEIC-FITDGTHKTPN----YIETGIPFLSVKNISKGFFDLSDIKY 241 Query: 62 VPKNLVK---ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 + + K DI+I +GK+ L FE S G+L+P+ I Sbjct: 242 ISLEEHNKLIKRAKPEFGDILICRI----GTLGKAIKISLEFEFSIFVSLGLLKPKVKII 297 Query: 119 SGFIAHFTKSSLYRNKISSL--SAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 S ++ +F S I++ G + + + I +P L EQ+ IA LD Sbjct: 298 SDYLVYFLNSYFIEGWINNNKVGGGTHTAKLNLNILEKCPIALPSLKEQEQIASYLDEFS 357 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKL 209 + K ++ + L+ ++++L A GKL Sbjct: 358 LNIKDLKQNYQAQIKNLQELKKSLLDKAFKGKL 390 >UniRef50_D2EQS4 Putative type I restriction-modification system, S subunit n=1 Tax=Streptococcus sp. M143 RepID=D2EQS4_9STRE Length = 384 Score = 250 bits (640), Expect = 6e-65, Method: Composition-based stats. Identities = 75/415 (18%), Positives = 157/415 (37%), Gaps = 40/415 (9%) Query: 11 VIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE- 69 + V ++ G +K ++ + +IR N+Q G + +D + P Sbjct: 2 KKVKLGEVCEILNGFAFKS----LLYVNEGIRIIRITNVQKGYIEDSDPKYYPIEYTNSI 57 Query: 70 -SQKISPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRP-EKLIFSGFIAHFT 126 + D++++++ VG+ + LR + LI ++ F Sbjct: 58 EKYILKENDLLMSLT----GNVGRVGLISKTMLPAALNQRVACLRTIDSLISKEYVFQFL 113 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 S L+ S G N+ + I P + +Q++I L+ + + K + Sbjct: 114 NSDLFEQSAIRSSNGVAQKNLSTDWLKKVEITYPSVEQQELITSTLNLIERLICCRKEQN 173 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESI-LTELRNGLSSKPNE 245 +++ +++K + G V ++ +++ + I + +L G + + Sbjct: 174 KKLNELVKSRFNEMFGDPVFNEM------------RWRRCKLKDISIEKLAYGSGASAID 221 Query: 246 SGVGHPILRISSVRA-GHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 G +RI+ + G++ + + E ++ L GD+LF R + VG Sbjct: 222 F-SGLRYIRITDIDECGNLKLD--KKSPSHYDE--KYLLNTGDILFARSGAT---VGKTF 273 Query: 305 LLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGK 363 L K ++ LY LIR P ++ F ++ N + V+ T Q I+ K Sbjct: 274 LYSKEKYGPALYAGYLIRLIPNLSLVNPVFVYHFTNT-KFYNDFIAKVQNTVAQPNINAK 332 Query: 364 DIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 +LPP+ Q E V A D + + +L + L +S++ + F Sbjct: 333 QYSELDFILPPLSLQNEFADFV----AQVDKSQLAIQKSLEELETLKKSLMQEYF 383 Score = 101 bits (253), Expect = 5e-20, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 69/197 (35%), Gaps = 14/197 (7%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN-GKFDTTDLVFVPKNLVK 68 W + ++ I + Y + L IR +I G + Sbjct: 198 WRRCKLKDIS--IEKLAYGSGASAIDFS--GLRYIRITDIDECGNLKLDKK---SPSHYD 250 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF--ECSFGAFCGVLRPE-KLIFSGFIAHF 125 E ++ DI+ A S + VGK+ + + L P L+ F+ HF Sbjct: 251 EKYLLNTGDILFARSGAT---VGKTFLYSKEKYGPALYAGYLIRLIPNLSLVNPVFVYHF 307 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 T + Y + I+ + NI + ++ +PPL+ Q A+ + + + + Sbjct: 308 TNTKFYNDFIAKVQNTVAQPNINAKQYSELDFILPPLSLQNEFADFVAQVDKSQLAIQKS 367 Query: 186 FEQIPQILKRFRQAVLG 202 E++ + K Q G Sbjct: 368 LEELETLKKSLMQEYFG 384 >UniRef50_A3SCN8 Restriction endonuclease S subunit-like protein n=1 Tax=Sulfitobacter sp. EE-36 RepID=A3SCN8_9RHOB Length = 497 Score = 250 bits (639), Expect = 9e-65, Method: Composition-based stats. Identities = 92/433 (21%), Positives = 177/433 (40%), Gaps = 33/433 (7%) Query: 49 IQNGKFDTTDLVFVPKNLVKESQKIS-PEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAF 107 ++ + T + + ++ ++I SG A + +F Sbjct: 1 MKADRIGDTKDYVTDLGIENSTTRVVAENSLLIVTRSGILRHSLPVALANKD--VAFNQD 58 Query: 108 CGVLRPEKLIFSGFIAHFTKSSLYRNKISSLSA-GANINNIKPASFDLINIPIPPLAEQK 166 L I ++ + K+ + + + + G + ++ + I P EQ+ Sbjct: 59 IKALTLFSGIDPEYVLYHLKADA-DDILDACAKAGTTVESLDFNRLKSYPLRIAPSLEQR 117 Query: 167 IIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF---------- 216 I EKLD L + D +IP+++ +++ L A G+LT +R Sbjct: 118 RIVEKLDILTGRTDRAHDELSRIPELVAKYKSCFLRLAFTGQLTSDFRGEHSRKGTGVEN 177 Query: 217 EPQHSVFKKLNFESILTEL-RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSE 275 P K L S + + G + V P LR+++V+ G +D +I+ + + Sbjct: 178 IPDSWAVKPLGEISEIQGGVQVGKKRSSSTDLVEVPYLRVANVQRGWLDLEEIKTIGVTP 237 Query: 276 SELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP-EYI 334 E R L+ GD+L G + +G G + Q + ++ + + R RL +LP E++ Sbjct: 238 QEKERLLLRMGDILMNE-GGDRDKLGR-GWVWNNQIADCIHQNHVFRIRLKDSSLPPEFV 295 Query: 335 EIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADT 394 + ++ + ++ T+ IS + + + V +PP E EIV R++ FA+ + Sbjct: 296 SHY-ANEMGQQYFVDQGTQTTNLASISKRKLAALPVPVPPSDEAVEIVNRIDAAFAWLER 354 Query: 395 IEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAAS 454 I + A + L +IL+KAFRGEL Q + P A+ +L ++ E A+ Sbjct: 355 ISSEQAAASKLLPELDAAILSKAFRGELARQNPDDEP--------ASRILARVSVEGQAA 406 Query: 455 GGKK-----ASRK 462 +K RK Sbjct: 407 PTRKSPHNTRKRK 419 Score = 120 bits (301), Expect = 2e-25, Method: Composition-based stats. Identities = 39/244 (15%), Positives = 98/244 (40%), Gaps = 5/244 (2%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +P+ W + P+ ++ + GV K+++ + + +P +R N+Q G D ++ + Sbjct: 178 IPDSWAVKPLGEISEIQGGVQVGKKRSSSTDLVE-VPYLRVANVQRGWLDLEEIKTIGVT 236 Query: 66 LVKESQKISP-EDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKL-IFSGFI 122 ++ + + DI++ G + +G+ + +C +R + + F+ Sbjct: 237 PQEKERLLLRMGDILMNE-GGDRDKLGRGWVWNNQIADCIHQNHVFRIRLKDSSLPPEFV 295 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 +H+ + + + N+ +I + +P+PP E I ++D A ++ Sbjct: 296 SHYANEMGQQYFVDQGTQTTNLASISKRKLAALPVPVPPSDEAVEIVNRIDAAFAWLERI 355 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 + ++L A+L A G+L + + EP + +++ E R + Sbjct: 356 SSEQAAASKLLPELDAAILSKAFRGELARQNPDDEPASRILARVSVEGQAAPTRKSPHNT 415 Query: 243 PNES 246 Sbjct: 416 RKRK 419 >UniRef50_B0K6N9 Restriction modification system DNA specificity domain n=3 Tax=Thermoanaerobacter RepID=B0K6N9_THEPX Length = 463 Score = 250 bits (639), Expect = 9e-65, Method: Composition-based stats. Identities = 70/437 (16%), Positives = 146/437 (33%), Gaps = 42/437 (9%) Query: 6 LPEGWVIAPVSTVT----TLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 +P W + + + Y ++ K +P I N Sbjct: 19 IPNHWESHKIRELFVERSEKVSDKDYSP---LSVSKAGVVPQIATVAKTNNG-------- 67 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + D VI S + G S ++ S VL+P + + Sbjct: 68 ------DNRKLVIKGDFVINSRSDRRGSSGIS-----NYDGSVSLINIVLKPRSFVNGRY 116 Query: 122 IAHFTKSSLYRNKISSLSAGAN--INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 + + KS + + G + + I +P+P + EQ I LD LA++ Sbjct: 117 MHYLLKSHYFIEEFYRNGRGIVADLWTTRYTEMKSIYLPVPSIEEQDQIVRFLDWKLAKI 176 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKLNFES 230 + ++ +L +R+A + + + W P H KL Sbjct: 177 NKLIQAKKKQIALLTEYRKATIDNVIMYGINPHANRKESGVIWLGEIPSHWSVMKLKRIC 236 Query: 231 ILTELRNGLSSKPNESGVGHPILRISSVR-AGHVDQNDIRFLECSESELNRHKLQDGDLL 289 + K + L + ++ G +D + R L+ + + D++ Sbjct: 237 RINASITSQLEKYSL-EDYVVFLPMENISSDGKIDCCEKRKLKDVRNGFSSFA--KNDVI 293 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 + E G L L+ +LI R + LP Y+ + R N Sbjct: 294 VAKITPCFEN-GKGACLDTLETNIGFGTTELIVLRANEKVLPRYLYMITQLQQFRIEGAN 352 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + ++GQK + I + + +P + EQ+EI+ ++ A D + + +N + + Sbjct: 353 VMTGSAGQKRVPSSFISNFELGIPSIAEQSEILEYLDNRLAKFDKLYETLNREIELLTEY 412 Query: 410 TQSILAKAFRGELTAQW 426 +++ G++ + Sbjct: 413 RIRLISDVVTGKVDVRD 429 Score = 117 bits (294), Expect = 8e-25, Method: Composition-based stats. Identities = 43/221 (19%), Positives = 90/221 (40%), Gaps = 8/221 (3%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFV 62 G++P W + + + + +T Q Y +DY+ + NI +GK D + + Sbjct: 221 GEIPSHWSVMKLKRICRINASIT---SQLEKYSLEDYVVFLPMENISSDGKIDCCEKRKL 277 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAF-CGVLRPEKLIFSGF 121 K++ + D+++A + + L FG VLR + + + Sbjct: 278 -KDVRNGFSSFAKNDVIVAKITPCFENGKGACLDTLETNIGFGTTELIVLRANEKVLPRY 336 Query: 122 IAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + T+ +R + +++ G+ + + + IP +AEQ I E LD LA+ D Sbjct: 337 LYMITQLQQFRIEGANVMTGSAGQKRVPSSFISNFELGIPSIAEQSEILEYLDNRLAKFD 396 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHS 221 + ++L +R ++ V GK+ + P++ Sbjct: 397 KLYETLNREIELLTEYRIRLISDVVTGKVDVRDI-EIPEYE 436 Score = 98.3 bits (244), Expect = 6e-19, Method: Composition-based stats. Identities = 37/247 (14%), Positives = 73/247 (29%), Gaps = 38/247 (15%) Query: 213 WRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLE 272 W N P H K+ + S + L +S +AG V I + Sbjct: 15 WLNSIPNHWESHKIREL--------FVERSEKVSDKDYSPLSVS--KAGVVP--QIATVA 62 Query: 273 CSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPE 332 + + NR + GD + + G+ + N I + Sbjct: 63 KTNNGDNRKLVIKGDFVINSRSDRRGSSGISNYDGSVSLIN-------IVLKPRSFVNGR 115 Query: 333 YIEIFFSSPSARNAMMNCVKTTSG-QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 Y+ S + ++KS + +P ++EQ +IVR ++ A Sbjct: 116 YMHYLLKSHYFIEEFYRNGRGIVADLWTTRYTEMKSIYLPVPSIEEQDQIVRFLDWKLAK 175 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL-------- 443 + + + +A + ++ + NP E+ L Sbjct: 176 INKLIQAKKKQIALLTEYRKATIDNVIM-------YGINPHANRKESGVIWLGEIPSHWS 228 Query: 444 ---LEKI 447 L++I Sbjct: 229 VMKLKRI 235 >UniRef50_Q8YTM8 Type I restriction-modification enzyme S subunit n=1 Tax=Nostoc sp. PCC 7120 RepID=Q8YTM8_ANASP Length = 427 Score = 250 bits (639), Expect = 1e-64, Method: Composition-based stats. Identities = 68/430 (15%), Positives = 152/430 (35%), Gaps = 34/430 (7%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G +P W I + I T+ + + +P + A +++ G D + + Sbjct: 15 EFGIVPNDWKIRKLVECCNKITDGTHDTPKPLAQ----GIPFLTAIHVKEGFIDFNNCYY 70 Query: 62 VPKNLVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 +P+++ + K D+++ V +A + +E S + + + Sbjct: 71 LPQSIHESIYKRCNPEKNDVLMVNIGAG---VATTALIDVEYEFSLKNVALLKPDKNNLI 127 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPP-LAEQKIIAEKLDTLLA 177 ++ + + +R + L +G + I+IPIPP + EQ+ IA+ L + A Sbjct: 128 GSYLNYCLSLNKFR-ITNQLLSGGAQPFLSLKQIGEISIPIPPTIEEQEAIAQSLSDVDA 186 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 + + + Q +L G ++ F + ++ + + Sbjct: 187 LITECDRIIAKKHNTKQGTMQQLLTG------EKRLPGFSGE-WEVEEFEQVLKVVDGDR 239 Query: 238 GLSSKPNE--SGVGH-PILRISSVRAGHVDQNDIRFLECSESEL-NRHKLQDGDLLFTRY 293 G + N+ G+ L +V G +D F+ + L KL D++ T Sbjct: 240 GDNYPSNDELFDNGYCLFLSAKNVTKGGFKFSDCTFITKEKDNLLGNGKLCKKDVVLT-- 297 Query: 294 NGSLEFVGVCGLLKKLQH-QNLLYPDKLIRARL-TKDALPEYIEIFFSSPSARNAMMNCV 351 + VG +N+ ++ R K+ Y+ F S + + + Sbjct: 298 --TRGTVGNIAFFDYSVPFENIRINSGMVILRSEDKNLDNSYLYSFLKSHLFQTQI-DRA 354 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 S Q ++ K I + + + EQ I + + + DT + + + Q Sbjct: 355 VFGSAQPQLTVKGISKFKIPVSSLPEQKAIAQILSDM----DTEIAALEQKRDKYKAIKQ 410 Query: 412 SILAKAFRGE 421 ++ + G+ Sbjct: 411 GMMQELLTGK 420 Score = 127 bits (320), Expect = 8e-28, Method: Composition-based stats. Identities = 39/214 (18%), Positives = 77/214 (35%), Gaps = 16/214 (7%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 + F + +K ++ +G P G P L V+ G +D N+ +L Sbjct: 12 QKTEFGIVPNDWKIRKLVECCNKITDGTHDTPKPLAQGIPFLTAIHVKEGFIDFNNCYYL 71 Query: 272 ECSESE--LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKD 328 S E R + D+L V L+ +L + + + Sbjct: 72 PQSIHESIYKRCNPEKNDVLMVNIGAG---VATTALIDVEYEFSL---KNVALLKPDKNN 125 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPP-VKEQAEIVRRVEQ 387 + Y+ S R + N + + Q +S K I + +PP ++EQ I + + Sbjct: 126 LIGSYLNYCLSLNKFR--ITNQLLSGGAQPFLSLKQIGEISIPIPPTIEEQEAIAQSLSD 183 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGE 421 + D + + + +A+ +N Q + + GE Sbjct: 184 V----DALITECDRIIAKKHNTKQGTMQQLLTGE 213 >UniRef50_B2K7C3 Restriction modification system DNA specificity domain protein n=2 Tax=Yersinia pseudotuberculosis RepID=B2K7C3_YERPB Length = 409 Score = 250 bits (638), Expect = 1e-64, Method: Composition-based stats. Identities = 75/426 (17%), Positives = 149/426 (34%), Gaps = 28/426 (6%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKN 65 +PEGW ++ + G +K + N +D + L+R +NI+ + D F P Sbjct: 2 VPEGWKLSTFGNHVDCLTGFAFKSKSYSNNPED--IRLLRGDNIEPSRLRWRDAKFWPAQ 59 Query: 66 LVKE--SQKISPEDIVIAMSSGSKSVVGKSA-HQHLPFECSFGAFCGVLRPEKLIFSGFI 122 ++ ++ D VIAM S K A QH C +R + + Sbjct: 60 EYEKLEKFQLRKGDFVIAMDRTWVSSGLKVAEVQHTDIPCLLVQRVARIRARSTLEQSLL 119 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + + + + S+ + +I P +PP+ EQK IA L T + +T Sbjct: 120 RQYFSDNKFEQYVKSVQTATAVPHISPNDIKDFTFLLPPINEQKKIARILSTWDKAIATT 179 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 + K Q +L G +++ F + + S L + S K Sbjct: 180 EQLLANSQLQKKALMQQLLTG------KKRFPGFSEEWTEV----HLSDLCFINPSRSEK 229 Query: 243 PNESGVGHPILRISSVRAGH--VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 P + + V + D + + S+ + +D D+L + E Sbjct: 230 PE--NGVVSFISMDGVSEDAKLIKTEDRYYSDVSKGFTS---FKDDDVLVAKITPCFEN- 283 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 G + L + + R + +YI R ++ ++GQK + Sbjct: 284 GKGAYVINLTNGIGFGSTEFHVLRAKEGVNAKYIYYLTVMTEFRVRGEMNMQGSAGQKRV 343 Query: 361 SGKDIKSQVVLLP-PVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 + +KS + +P EQ +I + +D + L + ++++ + Sbjct: 344 TTDYLKSLKLTVPISFTEQNKIA----TVLTVSDQEIATLKQKLNHLKQEKKALMQQLLT 399 Query: 420 GELTAQ 425 G+ + Sbjct: 400 GKRRVK 405 >UniRef50_B9ZS45 Restriction modification system DNA specificity domain protein n=1 Tax=Thioalkalivibrio sp. K90mix RepID=B9ZS45_9GAMM Length = 419 Score = 249 bits (637), Expect = 1e-64, Method: Composition-based stats. Identities = 77/427 (18%), Positives = 156/427 (36%), Gaps = 35/427 (8%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKD--DYLPLIRANNIQNGKFDT--TDLVFVP 63 EGW A +S + + G T + + + ++ + ++ + + ++ Sbjct: 3 EGWKTAKLSELCDIQLGKTPARANSSYWDQERSTGNVWLSIADLLKSEANNVSDSKEYLS 62 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 K + + ++++ K +G+ A + + E++I ++ Sbjct: 63 DKGAKLCKIVKKGTLLVS----FKLTLGRVAFAGKDLYTNEAIAALTIHDEQIINRDYLF 118 Query: 124 HFTKSSLYRNKISS--LSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 +F + G + A I + +PPL EQK I LD A +D+ Sbjct: 119 YFLHFFDWVKAAQDDVKLKGMT---LNKAKLKEILVVVPPLPEQKRIVAILDEAFASIDT 175 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 A E+ + ++ L V+ + + E+ +G Sbjct: 176 AVANTEKNLANARELFESYLNAVVDTAFRKS-----------TVTVLSDLAEEITDGDHM 224 Query: 242 KPNESGVGHPILRISSV--RAGHVDQNDIRFLECSESE--LNRHKLQDGDLLFTRYNGSL 297 P ++ G P + I ++ R VD + + S E + + GD+L+T Sbjct: 225 PPPKAPSGVPFITIKNIDKRTRKVDFENTFRVPRSYFEGLKPNKRPRKGDVLYTVTGS-- 282 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 G+ ++ Q + + R ++ SP + + Q Sbjct: 283 --FGIPVVV--GQKTEFCFQRHIGLIRPKSGTDSSWLYYLLMSPQIFAQATDGAT-GTAQ 337 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 K +S K ++S V P+ +Q + V++++ L A + +E L + L QS+L KA Sbjct: 338 KTVSLKVLRSFRVPTIPLDQQVDNVQQLDNLLADVEGLESIYRQQLRNLGELKQSLLQKA 397 Query: 418 FRGELTA 424 F GELTA Sbjct: 398 FSGELTA 404 Score = 113 bits (283), Expect = 2e-23, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 69/203 (33%), Gaps = 17/203 (8%) Query: 15 VSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG--KFDTTDLVFVPKNLVKE--- 69 +S + I + +P I NI K D + VP++ + Sbjct: 211 LSDLAEEITDGDHMPPPKAP----SGVPFITIKNIDKRTRKVDFENTFRVPRSYFEGLKP 266 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSS 129 +++ D++ ++ G E F G++RP+ S ++ + S Sbjct: 267 NKRPRKGDVLYTVT----GSFGIPVVVGQKTEFCFQRHIGLIRPKSGTDSSWLYYLLMSP 322 Query: 130 LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST----KAR 185 + + + G + +P PL +Q ++LD LLA V+ + + Sbjct: 323 QIFAQATDGATGTAQKTVSLKVLRSFRVPTIPLDQQVDNVQQLDNLLADVEGLESIYRQQ 382 Query: 186 FEQIPQILKRFRQAVLGGAVNGK 208 + ++ + Q G + Sbjct: 383 LRNLGELKQSLLQKAFSGELTAG 405 >UniRef50_B0A8Q7 Putative uncharacterized protein n=1 Tax=Clostridium bartlettii DSM 16795 RepID=B0A8Q7_9CLOT Length = 380 Score = 249 bits (637), Expect = 2e-64, Method: Composition-based stats. Identities = 63/412 (15%), Positives = 151/412 (36%), Gaps = 34/412 (8%) Query: 11 VIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKES 70 + + + + G T K++ Y D +P I+ ++++ + + S Sbjct: 2 ELKKLGDIFKITSGGTPSKKK-EEYYLDGDIPWIKTGDLKSKNIYKSSQYITELGVKNSS 60 Query: 71 QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSL 130 K+ P+D V+ G + +G ++ L E + C P K + ++ +F K + Sbjct: 61 AKLFPKDTVLIAMYG--ATIGATSI--LKIEAATNQACAAFLPTKDVMPEYLYYFFKYN- 115 Query: 131 YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 + KI S G NI IP+ L EQ+ I L+ + K + + Sbjct: 116 -KEKIISKGIGGAQPNISATILKDFKIPLLCLDEQEKIVNILNKAQNTTNKRKEQINLLD 174 Query: 191 QILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGH 250 +++K + G + + K+++ + + ++ K N Sbjct: 175 ELVKSRFIEMFGDPIRNI----------KCWQTKRMDEVAPV------INYKGNFKQNEI 218 Query: 251 PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQ 310 +L + V + ++ SE + ++L+++ L V + + Sbjct: 219 WLLNLDMVESNTGKIIAYNYVTASEVGSSTCTFDTTNVLYSKLRPYLNKVVI------PK 272 Query: 311 HQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVV 370 + + + Y+ + + + V + ++ D + V Sbjct: 273 EIGYATSEMMPLQPVKGILDRYYLAYMLRNKVFVDYISEKVS-GAKMPRVTMNDFRDFKV 331 Query: 371 LLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 +PP++ Q + V ++ D ++ ++ +L + + S++ +AF+GEL Sbjct: 332 PIPPIELQNQFANFVIEV----DKLKFEMEKSLKELEDNFNSLMQRAFKGEL 379 >UniRef50_A0YWS0 Putative uncharacterized protein n=1 Tax=Lyngbya sp. PCC 8106 RepID=A0YWS0_9CYAN Length = 433 Score = 249 bits (636), Expect = 2e-64, Method: Composition-based stats. Identities = 67/438 (15%), Positives = 167/438 (38%), Gaps = 39/438 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G +PE W + + + LI G + ++ +P ++ + G + + Sbjct: 19 GNIPEHWELRKLKFIADLIMGQSPDSTDY--NYEEIGVPFLQGTA-EFGIINPN--PRLS 73 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG-FI 122 K+ + +D+++++ + VG+ G +RP+ +F+ F Sbjct: 74 CESAKKYAR--KDDLLLSVR----APVGE--INVADQVYGIGRGLCAIRPKINVFNKTFT 125 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 +F + + ++ S + G+ + + + PPL EQK+IA LD ++D+ Sbjct: 126 RYFL--EIGKVELVSGATGSIYDAVTVNQVANLQCLTPPLKEQKLIATFLDRETTRIDTL 183 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILT 233 + ++ +L++ R A++ AV L +W P++ KKL + + + Sbjct: 184 ITKKCELINLLEKKRTAIITNAVTKGLEPELPMKDSGVEWLGKVPRNWEVKKLKYIAQIV 243 Query: 234 ELRNGLSSK--PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFT 291 + + P +P ++ + A + + + G L+ T Sbjct: 244 RGKFTHRPRNDPRFYDGNYPFIQTGDISAANKYITSYQQTLNELGLSVSKEFPKGTLVMT 303 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCV 351 +G +L +PD ++ L +++ + + ++ M+ Sbjct: 304 I----AANIGDLAILD----FPACFPDSIVGFLPRNYCL-DFLYYNLT--AMKSEMVKTA 352 Query: 352 KTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQ 411 + Q ++ + I + PP+ Q +I ++++ D + + +++ + Q Sbjct: 353 TLNT-QMNLNIERIGGLFSICPPIAIQKQIATYLDKVNIRIDELIDKTATSISELTKYRQ 411 Query: 412 SILAKAFRGELTAQWRAE 429 S++ A G++ + E Sbjct: 412 SLITAAVTGKIDVREEVE 429 Score = 114 bits (287), Expect = 6e-24, Method: Composition-based stats. Identities = 38/233 (16%), Positives = 82/233 (35%), Gaps = 25/233 (10%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P+H +KL F + L ++ S+ N +G P L+ ++ G ++ N Sbjct: 16 EWLGNIPEHWELRKLKFIADLIMGQSPDSTDYNYEEIGVPFLQGTA-EFGIINPN----- 69 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 E + + DLL + VG + ++ L R + Sbjct: 70 PRLSCESAKKYARKDDLLLSVRAP----VGEINVADQV----YGIGRGLCAIRPKINVFN 121 Query: 332 E-YIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 + + F + +++ S ++ + + L PP+KEQ I +++ Sbjct: 122 KTFTRYFL--EIGKVELVSGAT-GSIYDAVTVNQVANLQCLTPPLKEQKLIATFLDRETT 178 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 DT+ + + + +I+ A + P+L ++ L Sbjct: 179 RIDTLITKKCELINLLEKKRTAIITNAVT-------KGLEPELPMKDSGVEWL 224 >UniRef50_B4RYU8 Type I site-specific deoxyribonuclease n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYU8_ALTMD Length = 360 Score = 249 bits (636), Expect = 2e-64, Method: Composition-based stats. Identities = 71/389 (18%), Positives = 163/389 (41%), Gaps = 39/389 (10%) Query: 42 PLIRANNIQNGKFDTTDLVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFE 101 I+ ++++N + + + + + P D++IA + +G E Sbjct: 5 RYIQIDDLRNDNL----IKYTDDD---KGTFVEPSDVIIAWDGANAGTIGY------GLE 51 Query: 102 CSFGAFCGVLR-PEKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP 160 G+ L+ I + ++ F +S +I + GA I ++ + + +P+P Sbjct: 52 GLIGSTLARLKVIIPHIDTNYLGRFLQSKF--KEIRNNCTGATIPHVSKVHLNSLLVPVP 109 Query: 161 PLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQH 220 PL QK IA L+ D+ + + +Q+ Q L Q+V + Sbjct: 110 PLPIQKQIAAVLEKA----DNLRQQSQQMEQELNSLAQSVFLDMFGDYRKD--------- 156 Query: 221 SVFKKLNFESILTELRNGLSSKPNESG---VGHPILRISSVRAGHVDQNDIRFLECSESE 277 + + ++R+G++ G P +R+++V+ G++D ++I+ + + Sbjct: 157 -AMSLKSSLGEVADVRSGVTKGQKLEGHKLTTVPYMRVANVQDGYLDLSEIKDITVKAKD 215 Query: 278 LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIF 337 +++L+ GD+L T G + +G + Q N ++ + + R RL + E+ + Sbjct: 216 FEKYQLKAGDVLMTE-GGDFDKLGRGAIW-SGQIANCIHQNHVFRVRLCDRYISEFFAYY 273 Query: 338 FSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEK 397 +P + + C K T+ I+ +K + + +Q +R +++L A +++ Sbjct: 274 LQTPFVKQYFLKCAKKTTNLASINITQLKGLPIPDESIGKQQSFLRIIDELKA----LKE 329 Query: 398 QVNNALARVNNLTQSILAKAFRGELTAQW 426 + N S++ +AF+GEL + Sbjct: 330 ANFEQQEQANAHFNSLMQRAFKGELDLKD 358 Score = 110 bits (275), Expect = 1e-22, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 75/205 (36%), Gaps = 16/205 (7%) Query: 15 VSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV-FVPKNLVKESQKI 73 + V + GVT K Q + K +P +R N+Q+G D +++ K E ++ Sbjct: 164 LGEVADVRSGVT--KGQKLEGHKLTTVPYMRVANVQDGYLDLSEIKDITVKAKDFEKYQL 221 Query: 74 SPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGFIAHFTKSSLYR 132 D+++ G +G+ A C +R S F A++ ++ + Sbjct: 222 KAGDVLMTE-GGDFDKLGRGAIWSGQIANCIHQNHVFRVRLCDRYISEFFAYYLQTPFVK 280 Query: 133 NKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFE---Q 188 + + +I + IP + +Q+ + L +D KA E + Sbjct: 281 QYFLKCAKKTTNLASINITQLKGLPIPDESIGKQQ-------SFLRIIDELKALKEANFE 333 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKW 213 + +++ A G+L K Sbjct: 334 QQEQANAHFNSLMQRAFKGELDLKD 358 Score = 79.8 bits (196), Expect = 2e-13, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 60/172 (34%), Gaps = 24/172 (13%) Query: 248 VGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 + ++I +R ++ I++ + ++ D++ + +G Sbjct: 2 TNNRYIQIDDLRNDNL----IKYTDDD----KGTFVEPSDVIIAWDGANAGTIGY----- 48 Query: 308 KLQHQNLLYPDKLIRARL-TKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIK 366 L L R ++ Y+ F S + N + +S + Sbjct: 49 ---GLEGLIGSTLARLKVIIPHIDTNYLGRFLQSKF--KEIRNNCT-GATIPHVSKVHLN 102 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 S +V +PP+ Q +I +E AD + +Q +N+L QS+ F Sbjct: 103 SLLVPVPPLPIQKQIAAVLE----KADNLRQQSQQMEQELNSLAQSVFLDMF 150 >UniRef50_Q6LTT0 Hypothetical type I restriction-modification system specificity determinant n=1 Tax=Photobacterium profundum RepID=Q6LTT0_PHOPR Length = 437 Score = 248 bits (635), Expect = 3e-64, Method: Composition-based stats. Identities = 75/442 (16%), Positives = 151/442 (34%), Gaps = 46/442 (10%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +PE W I + + + + L+ ++I V Sbjct: 21 IGNIPEHWNITKAKYLFNEVDERSVTGHEE----------LLSVSHIT--GVTPRSEKNV 68 Query: 63 P---KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE--KLI 117 S+ +DIV +G S GV R + Sbjct: 69 SMFMAEDYSGSKTCQADDIVFNTMWAWMGALGVS-----ERSGIVSPSYGVFRQKFTNTF 123 Query: 118 FSGFIAHFTKSSLYRNKISSLSAG--ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 + ++ + K+ Y + +S G ++ F + + P + EQ I + LD Sbjct: 124 NAKYLEYLLKTPKYIEHYNKVSTGLHSSRLRFYGHMFFDMKMGYPHIDEQNGIIKFLDNK 183 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEK---------WRNFEPQHSVFKKL 226 ++D A E+ +LK +Q ++ AV L W P H + + Sbjct: 184 TNKIDEAAAIKEKQISLLKERKQIIIQQAVTRGLNPDVPMRDSGVDWIGEIPDHWCSEPI 243 Query: 227 NFESILTELRNGLSSKPNESGV-GHPILRISSVRAGHVDQNDIRFLECS--ESELNRHKL 283 + L + + ++R S+V+ G + D ++ + +R Sbjct: 244 KY--SLKGIIDCEHKTAPFVDKKEFFVVRTSNVKQGKLVIEDAKYTNEYGYKEWTSRGVP 301 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPS 342 GD+L TR G L+ + L +++ ++ + LPE+ S Sbjct: 302 FPGDILLTREAP----AGEACLVP--DDRKLCLGQRMVWLKVDRTRLLPEFALSLIYSSV 355 Query: 343 ARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNA 402 R + + + S + DIK+ V+LPP+ EQA +V +++ D + Sbjct: 356 VRTYI-DFLSAGSTVLHFNMADIKNIPVILPPINEQAILVTHIKKHSDKIDKAIELEQQQ 414 Query: 403 LARVNNLTQSILAKAFRGELTA 424 ++++ ++ A G++ Sbjct: 415 ISKLKEYKSILINSAVTGKIKV 436 Score = 118 bits (297), Expect = 5e-25, Method: Composition-based stats. Identities = 33/234 (14%), Positives = 72/234 (30%), Gaps = 23/234 (9%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P+H K + + R+ + +L +S + + Sbjct: 19 EWIGNIPEHWNITKAKYLFNEVDERSVTGHEE--------LLSVSHITGVTPRSEKNVSM 70 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDK-LIRARLTKDAL 330 +E Q D++F + +GV + ++ P + R + T Sbjct: 71 FMAEDYSGSKTCQADDIVFNTMWAWMGALGVS------ERSGIVSPSYGVFRQKFTNTFN 124 Query: 331 PEYIEIFFSSPSARNAMMNCVKT-TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 +Y+E +P S + G + P + EQ I++ ++ Sbjct: 125 AKYLEYLLKTPKYIEHYNKVSTGLHSSRLRFYGHMFFDMKMGYPHIDEQNGIIKFLDNKT 184 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D ++ + Q I+ +A R NPD+ ++ + Sbjct: 185 NKIDEAAAIKEKQISLLKERKQIIIQQAVT-------RGLNPDVPMRDSGVDWI 231 >UniRef50_Q8TN78 Type I restriction modification enzyme protein S n=1 Tax=Methanosarcina acetivorans RepID=Q8TN78_METAC Length = 391 Score = 248 bits (634), Expect = 3e-64, Method: Composition-based stats. Identities = 79/418 (18%), Positives = 165/418 (39%), Gaps = 35/418 (8%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKE 69 W P+ ++ T+I G T K + Y +P + + + + +E Sbjct: 4 WPHQPIISLGTIITGSTPKTSEEHFY--GGDIPFVTPAELDQTDPIMNAARTLSETGSQE 61 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSS 129 S+ + V+ GS VG + S V+ K+I+ F + + Sbjct: 62 SRLLPEG-TVMVCCIGSLGKVGIAGRTV----ASNQQINSVIFDPKIIWPRFGFYACR-- 114 Query: 130 LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQI 189 L ++++ L+ + + + F + IP+PPL EQK IA+ LD A + E + Sbjct: 115 LLKSRLEVLAPATTVPIVNKSKFGQLEIPVPPLPEQKRIADILDRAEALRAKRRVALEHL 174 Query: 190 PQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG----LSSKPNE 245 ++ + + G +V+ + +K+ + + ++ G L K + Sbjct: 175 DELTQAIFIDMFGDSVSNPM------------GWKRYPLKHCVNHIQIGPFGSLLHKEDY 222 Query: 246 SGVGHPILRISSVRAGHVDQNDIRFLECSE-SELNRHKLQDGDLLFTRYNGSLEFVGVCG 304 G P++ + + G + + + + + +EL ++LQ GD++ R +G C Sbjct: 223 VFGGIPLINPTHIENGKIVPDVNQSITVQKLAELQLYQLQQGDVIMGRRGE----MGRCA 278 Query: 305 LLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKD 364 ++ + L L A+ Y++ SS S R + + ++ Sbjct: 279 IVGSEHNGTLCGTGSLFIRPDESKAIAMYLQATLSSESMRKHLEGF-SLGATLPNLNRGI 337 Query: 365 IKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + + LPP++ Q E +E + ++ ++L ++ L S+ +AFRGEL Sbjct: 338 VGELAISLPPIELQKEFSHHIES----IEKLKTTYKSSLTEIDELFLSLQYRAFRGEL 391 Score = 107 bits (268), Expect = 1e-21, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 75/207 (36%), Gaps = 12/207 (5%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT-DLVFVPKN 65 P GW P+ I+ + +PLI +I+NGK + + Sbjct: 193 PMGWKRYPLKHCVNHIQIGPFGSLLHKEDYVFGGIPLINPTHIENGKIVPDVNQSITVQK 252 Query: 66 LVK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-GAFCGVLRPEK-LIFSGFI 122 L + + ++ D+++ G + +G+ A + G +RP++ + ++ Sbjct: 253 LAELQLYQLQQGDVIM----GRRGEMGRCAIVGSEHNGTLCGTGSLFIRPDESKAIAMYL 308 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 S R + S GA + N+ + I +PP+ QK + ++ Sbjct: 309 QATLSSESMRKHLEGFSLGATLPNLNRGIVGELAISLPPIELQKE----FSHHIESIEKL 364 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKL 209 K ++ + ++ A G+L Sbjct: 365 KTTYKSSLTEIDELFLSLQYRAFRGEL 391 Score = 95.6 bits (237), Expect = 4e-18, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 69/200 (34%), Gaps = 17/200 (8%) Query: 220 HSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELN 279 + + + +S+ + G P + + + N R L + S+ + Sbjct: 3 PWPHQPIISLGTIITGSTPKTSEEHFYGGDIPFVTPAELDQTDPIMNAARTLSETGSQES 62 Query: 280 RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL-PEYIEIFF 338 R L +G ++ SL VG+ G + + ++ + P + Sbjct: 63 R-LLPEGTVMVCCIG-SLGKVGIAG-------RTVASNQQINSVIFDPKIIWPRFGFYAC 113 Query: 339 SSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 +R + + + ++ + +PP+ EQ I ++ A+ + + Sbjct: 114 RLLKSR---LEVLAPATTVPIVNKSKFGQLEIPVPPLPEQKRIADILD----RAEALRAK 166 Query: 399 VNNALARVNNLTQSILAKAF 418 AL ++ LTQ+I F Sbjct: 167 RRVALEHLDELTQAIFIDMF 186 >UniRef50_B3PQK6 Probable type I restriction-modification system protein, specificity subunit n=1 Tax=Rhizobium etli CIAT 652 RepID=B3PQK6_RHIE6 Length = 424 Score = 248 bits (633), Expect = 4e-64, Method: Composition-based stats. Identities = 81/424 (19%), Positives = 171/424 (40%), Gaps = 32/424 (7%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANN---IQNGKFDTTDLVFVP 63 PEGW + + + L G T + + D +P + ++ I+ T + Sbjct: 24 PEGWALERLCDIARLESGHTPSRNR--PDYWDGGIPWLSLHDSKTIEGKVLQNTKMTISA 81 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + L S ++ PE V + + +GK A L E + + + ++A Sbjct: 82 RGLANSSARLLPEGTVALSRTAT---IGKVAL--LGREMATSQDFACYICGPRLLNKYLA 136 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 H + + L AG+ N I +F+ + I +PP+ EQ+ IA+ L A ++ + Sbjct: 137 HLFRGMEL--EWERLMAGSTHNTIYMPTFENMQILVPPMEEQEAIADALSDADALIEGLE 194 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 + I + Q +L + P +S + +NGL+ Sbjct: 195 RLIAKKWLIKQGTMQDLLTA----------KRRLPGYSAEWTMAKLGDFLSFKNGLNKAK 244 Query: 244 NESGVGHPILRISSV-RAGHVDQNDIR-FLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 G G PI+ V R G +++ I +E +E+E + + +++GD+LFTR + + E +G Sbjct: 245 AFFGHGTPIINYMDVFRGGAINEGSIDGLVEVTEAEQSAYGIRNGDVLFTRTSETPEEIG 304 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLTKDALP-EYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 + + + ++ ++R R AL + + F S + R +++ + + Sbjct: 305 LAAVADGVLDGT-VFSGFVLRGRPKSQALTIAFSKYCFRSGAVRRQIISRATY-TTRALT 362 Query: 361 SGKDIKSQVVLLP-PVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 +G+ + + + +P EQ I + + A +E + L + + + ++ Sbjct: 363 NGRQLSAVDISVPRDADEQNAIAEVLNDMDAEIQALETR----LDKARQVKEGMMQNLLT 418 Query: 420 GELT 423 G + Sbjct: 419 GRIR 422 Score = 115 bits (288), Expect = 5e-24, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 70/211 (33%), Gaps = 18/211 (8%) Query: 214 RNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISS---VRAGHVDQNDIRF 270 + EP+ ++L + L ++P+ G P L + + + + Sbjct: 20 PDVEPEGWALERLCDIARLESGHTPSRNRPDYWDGGIPWLSLHDSKTIEGKVLQNTKMTI 79 Query: 271 LECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 + + L +G + +R +G LL + + + + R L Sbjct: 80 SARGLANSSARLLPEGTVALSRT----ATIGKVALLGREMATSQDFACYICGPR----LL 131 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 +Y+ F + S I ++ +L+PP++EQ I + Sbjct: 132 NKYLAHLFR---GMELEWERLMAGSTHNTIYMPTFENMQILVPPMEEQEAIADALSD--- 185 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGE 421 AD + + + +A+ + Q + + Sbjct: 186 -ADALIEGLERLIAKKWLIKQGTMQDLLTAK 215 >UniRef50_C2CSZ9 Type I restriction modification DNA specificity protein n=1 Tax=Corynebacterium striatum ATCC 6940 RepID=C2CSZ9_CORST Length = 371 Score = 248 bits (633), Expect = 4e-64, Method: Composition-based stats. Identities = 93/415 (22%), Positives = 153/415 (36%), Gaps = 52/415 (12%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 W + + V L G KKE+ + P+ + Sbjct: 8 DWPMVRLGDVCHLKYGKALKKEERVA----GEFPVFGSAG--------------SVGSHV 49 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKS 128 E+ + P G K G AF + E+ + S ++ K Sbjct: 50 EANFVGP-----VSVVGRKGSAGFVEWSSGNCWIIDTAFGVFPKSEEQVDSRWLYWLLKD 104 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 + L A + I A +PPL EQ+ IA LD + + Sbjct: 105 LR----LGRLQKHAAVPGISKADVVEEKFLLPPLDEQRRIAAILDEVDEALFRVNQSLGD 160 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGV 248 + Q+ + + + L + G S K NE V Sbjct: 161 LLQLKQELFTDLFLRI-----------------ERESTIIGEYLESTQYGTSDKANE-NV 202 Query: 249 GHPILRISSV-RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 G PILR+ +V G +D +D++++E S+ ++ L+ GDLLF R N S + VG ++ Sbjct: 203 GIPILRMGNVSYNGEIDLSDLKYVELDASDREKYSLKAGDLLFNRTN-SKDLVGKTAVVP 261 Query: 308 KLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKS 367 +LQ + Y LIR R+ A+PEYI F +S + + N K G I+ ++K Sbjct: 262 ELQEE-YTYAGYLIRCRVNDKAVPEYISGFLNSVLGKKILRNTAKAIVGMANINANELKR 320 Query: 368 QVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + + EQ E L + D +E Q+ + L +S+ +AF+ EL Sbjct: 321 LPIPQASLDEQQEFAS----LTSRIDDVESQMKRQRKLLQELQESLSTRAFQEEL 371 >UniRef50_B8E4I3 Restriction modification system DNA specificity domain protein n=1 Tax=Shewanella baltica OS223 RepID=B8E4I3_SHEB2 Length = 642 Score = 247 bits (632), Expect = 5e-64, Method: Composition-based stats. Identities = 110/465 (23%), Positives = 194/465 (41%), Gaps = 30/465 (6%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFVP 63 KLPEGWV + + + Q D P IR +NI +GK + V Sbjct: 2 KLPEGWVETTIGNIID---DMQPGFSQKPGKEDGDTTPQIRTHNISPDGKLTLEGIKHVT 58 Query: 64 KNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFSGF 121 + + ++ D+V ++ S+ VGK+A E F LR I F Sbjct: 59 ASNKESERYSLTKGDVVFNNTN-SEEWVGKTAVFDQEGEFVFSNHITRLRANSKLITPDF 117 Query: 122 IAHFTKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 +A + + + + + I+ ++ L IP+P L EQ+ I + L + V Sbjct: 118 LAAYLQFLWSMGFSKTRAKRWVSQAGIEGSTLALFRIPLPSLPEQERIVDVLQQV-GIVA 176 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS 240 K + L V E + + + + I+ + + G+S Sbjct: 177 KAKQSIDDHIDNL-----------VRTAYWEHFSEWYTADGLRDPVRISDIVADSQYGVS 225 Query: 241 SKPNESGVGHPILRISSVR-AGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 +E+G ILR++S+ +G ++ D+++ SE ++ L +GDLLF R N S E Sbjct: 226 EAMSETGKQ-AILRMNSITTSGWLNLADLKYATLSEKDIKATTLLNGDLLFNRTN-SKEL 283 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKG 359 VG C + + + Y ++R R+ + LPEYI +S + +MN K Sbjct: 284 VGKCAIWRGAKE-PFSYASYIVRFRMKEGILPEYIWATLNSSYGKYRLMNSAKQAVSMAN 342 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 +S D+ V LPP+ Q + +L + +T+ +++ N + + L + +A Sbjct: 343 VSPTDLGRITVPLPPLALQEKFA----KLINHIETLRQEMLNKQDQYSELQTLVTQQALL 398 Query: 420 GELTAQWRAENPDLISGENSAAALLEKIKAERAA--SGGKKASRK 462 GE TAQWR EN + + A +L + + + + KK +K Sbjct: 399 GEHTAQWRDENREKVLEAAKARDILLREQGVKITKFALEKKHPKK 443 >UniRef50_B4VXC6 Type I restriction modification DNA specificity domain protein n=1 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VXC6_9CYAN Length = 506 Score = 247 bits (631), Expect = 7e-64, Method: Composition-based stats. Identities = 110/467 (23%), Positives = 193/467 (41%), Gaps = 61/467 (13%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P W+ + + G + P+ + NG D Sbjct: 5 PLSWIGVTLGDLLRFNYGKSLP----ERARSGAGFPVYGS----NGIVGYHDEPLTD--- 53 Query: 67 VKESQKISPEDIVIAMSSGSKSVVGKS--AHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 + +I GS V S A + F G + + + + Sbjct: 54 ---------GETLIIGRKGSVGEVHFSPGACFPIDTTYYVDQFHG-------MPTRYWFY 97 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 K+ +S L I ++ + I + PL EQK IA+KLD LLA+VD+ + Sbjct: 98 QLKNLG----LSELDKATAIPSLNRKDAYRVQIHLSPLNEQKRIADKLDALLARVDACRD 153 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQH------SVFKKLNFESILTELRNG 238 R ++ I+++ RQA+L ++GK+T+ W ++ ++ KL+ + + + N Sbjct: 154 RLIRVSFIIQQLRQAILTDGISGKITQYWSKNNAENLAYNHQNIVGKLSDFADVIDP-NP 212 Query: 239 LSSKPNESGVGHPILRISSVRA-GHVDQNDIRFLECSESELNR--HKLQDGDLLFTRYNG 295 P+ G PIL + D + + ++ E + H + D++F R Sbjct: 213 SHRYPSYKGGTIPILATEQMSGLNDWDTSSAKLIKYDFYEARKAAHDFLNDDIIFARK-- 270 Query: 296 SLEFVGVCGLLKKL-QHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKT 353 G GL + Q+ ++ + R+ + LP Y+ F + +++ + + Sbjct: 271 -----GRLGLARNPPQNIRYVFSHTVFIIRVKADNILPSYLLWFLRQEFCIDWLLSEMNS 325 Query: 354 TSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSI 413 +G + ++ + +P EQ EIV+ +E+L+AYAD IE + NAL RV LT ++ Sbjct: 326 NAGVPTLGKSVMERLPITIPDYAEQQEIVQCIEKLYAYADRIEARYQNALTRVEQLTPTL 385 Query: 414 LAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASG-GKKA 459 L+KAFRGEL Q + P + LLE+I+AERAA K Sbjct: 386 LSKAFRGELVPQDPDDEP--------VSVLLERIRAERAAQPNKPKR 424 >UniRef50_Q0RKJ6 Type I restriction modification enzyme protein S n=1 Tax=Frankia alni ACN14a RepID=Q0RKJ6_FRAAA Length = 399 Score = 246 bits (630), Expect = 1e-63, Method: Composition-based stats. Identities = 86/420 (20%), Positives = 160/420 (38%), Gaps = 37/420 (8%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNI---QNGKFDTTDLVFVPKNLVKE 69 P+ +I G T K A +P ++ + +T L Sbjct: 7 TPLGEFCEIISGATPKT--ASEEYWGGEIPWATPRDLGSLNSKFLASTSRAITEAGLRSC 64 Query: 70 SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHFTKS 128 + + P V+ S V +A + L P+ G++ H+ + Sbjct: 65 ATHVLPAGSVLLTSRAPIGSVAINAR-----PMATNQGFKSLVPDTSRALPGYLYHWLRC 119 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 R+++ SL GA + ++ I +P+PPL+EQK I + LD Q D+ +AR + Sbjct: 120 Q--RSRLQSLGNGATFKELSKSATARIAVPLPPLSEQKRIEQMLD----QADTIRARRRE 173 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNE--- 245 L+ Q++ + + ++++ ++ + +G S + Sbjct: 174 TIARLEELAQSIFSVMFGNPVQNER--------GWRRVPLSELVVRIDSGRSPVCLDRPA 225 Query: 246 SGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGL 305 +L++ +V + + + L + + +++ GDLLF+R N + E V C L Sbjct: 226 RPGEWGVLKLGAVTSCVYRAGENKALPPDVAAFSACEVRPGDLLFSRKN-TRELVAACAL 284 Query: 306 LKKLQHQNLLYPDKLIRA--RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG-QKGISG 362 + LL PD + R P Y+ + P R + +S IS Sbjct: 285 VDAT-PARLLLPDLIFRLVVEPRSAVDPVYLHRLLTHPEKRRKVQGLASGSSASMPNISK 343 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + + LPP++ Q E RV L + I+ +L + L S+ +AFRGEL Sbjct: 344 SRLLGLEIELPPMEVQKEFANRVRAL----ERIKVAHQASLVEQDELVASLAHRAFRGEL 399 >UniRef50_Q1K3D0 Restriction modification system DNA specificity domain n=7 Tax=Bacteria RepID=Q1K3D0_DESAC Length = 417 Score = 246 bits (629), Expect = 1e-63, Method: Composition-based stats. Identities = 82/416 (19%), Positives = 157/416 (37%), Gaps = 23/416 (5%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 GW P+ + T IR A Y + +++NN++NGK + +F+ + Sbjct: 19 NGWTENPLGEIYTKIRNAFVGT--ATPYYTKNGYFYLQSNNVKNGKINRKTEIFIDEEFY 76 Query: 68 --KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVL-RPEKLIFSGFIAH 124 +E + DIV+ VG +A S ++ +P K ++ Sbjct: 77 FKQEKNWLRTNDIVMV----QSGHVGHTAVIPNELNNSAAHALIIISKPLKKSCPYYLNF 132 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + ++ + I +++ G I +I N+ PP EQ I L ++ + Sbjct: 133 YFQTYRAKQDIGNITTGNTIKHILATDIKRFNVFFPPYEEQTKIGTYFKKLDRIIELHQR 192 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 + +++ + + Q + + F ++K + GL++ Sbjct: 193 KHDKLVTLKQAMLQKMFPQ---DGASTPEIRFNGFEGDWEKKKLRDVCNSFDYGLNAAAK 249 Query: 245 ESGVGHPILRISSVRAGH--VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 + + +RI+ + Q D+ E + L +GD+LF R S VG Sbjct: 250 KYDGRNKYIRITDIDEFSRCFSQTDLTSPEADLPSSQNYLLCEGDILFARTGAS---VGK 306 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 L +++ + + LIRAR++ ++I S + N + SGQ GI+ Sbjct: 307 TYLYREI-DGRVFFAGFLIRARVSNTESTDFIFYTTLSSNYEN-FVTITSQRSGQPGINA 364 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 K+ L+P V EQ +I F D + Q L ++ + + L K F Sbjct: 365 KEYSEYTFLVPSVTEQKKIGTY----FRKFDALISQHATQLKKLKQIKSACLGKMF 416 >UniRef50_Q1GLF5 Type I restriction-modification system; S subunit n=1 Tax=Ruegeria sp. TM1040 RepID=Q1GLF5_SILST Length = 387 Score = 246 bits (629), Expect = 1e-63, Method: Composition-based stats. Identities = 77/416 (18%), Positives = 162/416 (38%), Gaps = 38/416 (9%) Query: 13 APVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQK 72 + + + G T K+ + D +P + ++ +T + + + Sbjct: 4 VALGELVEIRGGGTPDKK--VPDYWDGDIPWASVKDFKSTSLASTIDRITQAGVANSATQ 61 Query: 73 ISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKSSLY 131 + P +I++ ++ VGK+A + + L P + I ++ H ++ Sbjct: 62 VIPAGNIIV----PTRMAVGKAAINEIDL--AINQDLKALIPSQRIDRQYLLHALLANA- 114 Query: 132 RNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQ 191 + + GA + IK + + IP+PPL EQ+ IA LD A +++ Sbjct: 115 -KTLEDQATGATVKGIKLDALRSLQIPLPPLQEQRRIAGILDQADALRRFRTRALDKLGT 173 Query: 192 ILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESIL---TELRNGLSSKPNESGV 248 + + + G P H+ ++K+N ++ + G+ Sbjct: 174 LGQAIFHEMFGA------------SSPDHAAWEKINLSELVLPDDRINYGVVQPGPHDPE 221 Query: 249 GHPILRISSVRAGHVDQNDIRFL-ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLK 307 G PI+R++ + + V + I+ + ++E R +L+ G++L G + +G ++ Sbjct: 222 GVPIIRVADLASPVVAFDSIKRIAPSIDAEYGRSRLKGGEVLI----GCVGSIG-TTIIA 276 Query: 308 KLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIK 366 + + R L P ++ S +N V+ Q ++ K I+ Sbjct: 277 PPEFAGANVARAVARVPLDTSRCEPRFVAEQLRSQRIQNYFTKEVRL-VAQPTLNIKQIR 335 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 ++LPP + Q V RV + + + Q AL + L S+ + AFRGE+ Sbjct: 336 ETEIILPPKELQVSFVERVHE----IEAQKAQHAAALTACDVLFASLQSTAFRGEV 387 >UniRef50_A5UR98 Restriction modification system DNA specificity domain n=2 Tax=Bacteria RepID=A5UR98_ROSS1 Length = 392 Score = 246 bits (628), Expect = 1e-63, Method: Composition-based stats. Identities = 91/430 (21%), Positives = 173/430 (40%), Gaps = 47/430 (10%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 M +LP+GW + T+ T+ G ++Q K +P+ A NG D Sbjct: 2 MERWELPKGWGWKRLKTLVTVNYGKGLSEKQR----KAGNVPVYGA----NGVVGFHDTS 53 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 ++ GS V S P + +F + ++++ Sbjct: 54 IT------------KGQTIVIGRKGSAGAVNWSEIACWPIDTTF----FIDEFPEILYPQ 97 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-------PLAEQKIIAEKLD 173 F+ F +S +I L A I + + +PIP LAEQ+ I +L+ Sbjct: 98 FLYQFLRS----QQIDRLQQSAAIPGLNRDVLYSVEVPIPYPDDPAHSLAEQRRIVARLE 153 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILT 233 LL + + + + + + L + ++ L P+ +K ++ L Sbjct: 154 LLLGETRAMREDIQAMRRDLAQVMESALAEVF-----PNPNGEMPKGWGWKSIDDLFELQ 208 Query: 234 ELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRY 293 + + +S + + P LR ++ G VD +D+ ++ +E E+ R KL+ GDLL Sbjct: 209 QGAS-MSPRRRQGRNPQPFLRTKNILWGEVDTSDVDVMDFTEDEIERLKLRKGDLLICEG 267 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR-LTKDALPEYIEIFFSSPSARNAMMNCVK 352 VG + + Q ++Y + + R R + DA P++ + + + + Sbjct: 268 G----DVGRAAVW-EDQLPLVMYQNHIHRLRRKSDDADPKFYVYWMKAAYQLFKIYQGEE 322 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 + + +SG+ +K+ +V + EQ IV +E + ++ + L + L QS Sbjct: 323 SRTAIPNLSGRRLKNFLVPTTSLTEQRRIVAYLEHIAEEIRAMDDLLAQDLRDIEVLEQS 382 Query: 413 ILAKAFRGEL 422 ILA AFRGE+ Sbjct: 383 ILAAAFRGEV 392 >UniRef50_Q8PSD7 Type I restriction-modification system specificity subunit n=1 Tax=Methanosarcina mazei RepID=Q8PSD7_METMA Length = 398 Score = 246 bits (628), Expect = 2e-63, Method: Composition-based stats. Identities = 67/429 (15%), Positives = 139/429 (32%), Gaps = 38/429 (8%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN--GKFDTTD 58 M KLPEGW + + + K + + + + I+ G D T Sbjct: 1 MEN-KLPEGWEWKKLGEIAEIN-----PKFDKKSVSESTEVTFLPMKCIEELTGNVD-TS 53 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQH--LPFECSFGAFCGVLRPEKL 116 + + + K + D++ A + GK+A V+R +K Sbjct: 54 ITKSLEEVSKGYTPLIENDLIYAKITPCMEN-GKAAIATGLKNNLGFASTEFHVIRFKKN 112 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTL 175 ++ F + R + G+ + + +P+PPL Q+ I L+ Sbjct: 113 AYNKFFFFYLIQKRIREHAAMNMTGSAGQKRVPATFLKNLLVPLPPLETQQKIVSILEKA 172 Query: 176 LAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTEL 235 + T+ Q ++ ++ Q+V + + KL T Sbjct: 173 ----EETRKLRAQADELTQKLLQSVFLEMFGDPV------KNSREWKLHKLGEIGNWTSG 222 Query: 236 RNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNG 295 S P P + +V + + + + + + G +L Y+ Sbjct: 223 GTPSRSMPEYFHGEIPWFTAGELNDSYVYGSKEKITKEALNSSSAKLFPAGTMLIGMYDT 282 Query: 296 SLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTT 354 + +G+ + + F +++ ++ + Sbjct: 283 AAFKMGI-------LKNPASSNQACAAFSPKVEVINTLFALYLFK--EMKDSFLSQ-RRG 332 Query: 355 SGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSIL 414 QK +S IK V +PP++ Q + V+ D I++ + NNL +++ Sbjct: 333 IRQKNLSQSIIKKFEVPVPPIELQKQFADMVQ----KIDQIKESQKQSSLETNNLFDALM 388 Query: 415 AKAFRGELT 423 KAF G+L Sbjct: 389 QKAFTGKLV 397 >UniRef50_D0S8M5 Type I restriction-modification system protein n=2 Tax=Acinetobacter RepID=D0S8M5_ACIJO Length = 412 Score = 246 bits (628), Expect = 2e-63, Method: Composition-based stats. Identities = 78/416 (18%), Positives = 154/416 (37%), Gaps = 24/416 (5%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 W ++ VT + T+ + + I + NI+N D T++ ++ K+ + Sbjct: 14 DWSRYKIAEVTEYLVDGTHFSPK----TTEGEFKYITSKNIRNDGLDLTNISYISKDEHE 69 Query: 69 ESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKLIFSGFIAH 124 + K + DI++ + G L E S + VLR + + FI Sbjct: 70 KIYKRCKVQLGDILLTKDGANT---GNCCLNTLDEEFSLLSSVAVLRGKKDSFNNNFILQ 126 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 +S L ++ I S +G I I A + P L EQ I L + ++ Sbjct: 127 ILQSDLGQDTIISSMSGQAITRITLAKLKDYSFFFPELTEQTQITSFLSAVDEKISQLTQ 186 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 + E + Q + Q + + K + E K+ + S + Sbjct: 187 KHELLSQYKQGMMQKLFSQQIRFKADD---GSEFGEWGKAKVGNITETIFGYPFDSKEMV 243 Query: 245 ESGVGHPILRISSVRAGHVDQNDI--RFLECSESELNRHKLQDGDLLFTRYNGSLEFVGV 302 E G P++R ++ H+ + RF S+L ++ ++ D++ + VG Sbjct: 244 EDTNGIPLMRGINIGECHIRHSFELDRFFLKDTSKLEKYFVRVNDIVLSMDGS---KVGR 300 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 + L ++ R + +Y+ + S + + VKT+SG ISG Sbjct: 301 NSAFVTEKDAGSLLVQRVCILREKANTNIQYVYQWIISKEFHRYV-DQVKTSSGIPHISG 359 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 K I+ + P ++EQ +I + + D + V + + + +L + F Sbjct: 360 KQIQDYEISYPCLEEQTKIANFL----SAIDQKIEVVAQQIEQAKTWKKGLLQQMF 411 Score = 130 bits (328), Expect = 1e-28, Method: Composition-based stats. Identities = 35/233 (15%), Positives = 83/233 (35%), Gaps = 13/233 (5%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 F+ + + + L +G P + + ++R +D +I ++ Sbjct: 4 PKLRFKEFDGDWSRYKIAEVTEYLVDGTHFSPKTTEGEFKYITSKNIRNDGLDLTNISYI 63 Query: 272 ECSESE--LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 E E R K+Q GD+L T+ G C L + +LL ++R + Sbjct: 64 SKDEHEKIYKRCKVQLGDILLTKDGA---NTGNCCLNTLDEEFSLLSSVAVLRGK-KDSF 119 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 +I S ++ +++ + I+ +K P + EQ +I + + Sbjct: 120 NNNFILQILQSDLGQDTIISSMS-GQAITRITLAKLKDYSFFFPELTEQTQITSFLSAVD 178 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAA 442 + ++ ++ Q ++ K F ++ +++A++ A Sbjct: 179 EKISQLTQKH----ELLSQYKQGMMQKLFSQQI--RFKADDGSEFGEWGKAKV 225 Score = 125 bits (315), Expect = 3e-27, Method: Composition-based stats. Identities = 36/203 (17%), Positives = 67/203 (33%), Gaps = 10/203 (4%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT---D 58 G+ W A V +T I G + ++ + + +PL+R NI + D Sbjct: 216 EFGE----WGKAKVGNITETIFGYPFDSKEMVEDT--NGIPLMRGINIGECHIRHSFELD 269 Query: 59 LVFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIF 118 F+ E + DIV++M + +LR + Sbjct: 270 RFFLKDTSKLEKYFVRVNDIVLSMDGSKVGR-NSAFVTEKDAGSLLVQRVCILREKANTN 328 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ + S + + + + I +I I P L EQ IA L + + Sbjct: 329 IQYVYQWIISKEFHRYVDQVKTSSGIPHISGKQIQDYEISYPCLEEQTKIANFLSAIDQK 388 Query: 179 VDSTKARFEQIPQILKRFRQAVL 201 ++ + EQ K Q + Sbjct: 389 IEVVAQQIEQAKTWKKGLLQQMF 411 >UniRef50_B4B315 Restriction modification system DNA specificity domain n=1 Tax=Cyanothece sp. PCC 7822 RepID=B4B315_9CHRO Length = 397 Score = 245 bits (625), Expect = 4e-63, Method: Composition-based stats. Identities = 95/417 (22%), Positives = 174/417 (41%), Gaps = 24/417 (5%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 + W + P+ + LI+ T+ + +A K + + V + Sbjct: 3 QNWDLVPLGEI--LIKSNTWIQIEANKKYKQITVKY------WGKGVVERNEVIGTEIAA 54 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTK 127 + ++ +++ G L F I F+ +K Sbjct: 55 SQRLQVRSGQFIVSRIDARHGSFGLIP-DCLNGAIVTNDFPVFNLNINRILPHFLNWMSK 113 Query: 128 SSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 + + S G +K F + IP+P L EQ+ I K++ L+A+++ + Sbjct: 114 TPTFIELCKVASEGTTNRIRLKEDKFLSMKIPLPKLEEQQRIIAKIEELVAKIEEARGLK 173 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES 246 E + + A + +++ + I+ + G S K ++ Sbjct: 174 EAGIRECEMLINAEIYNLFT----------ICKNTHWANKKLGDIVIDDCYGTSEKTHDY 223 Query: 247 GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 VG PILR+ +++ G +D +++++L+ E ++ LQ GD+L R N S E VG C + Sbjct: 224 KVGIPILRMGNIQNGILDVSELKYLDIHEKNKDKLILQKGDILVNRTN-SAELVGKCAVF 282 Query: 307 KKLQHQNLLYPDKLIRARLTK-DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI 365 + +IR RL K A P I ++ +S R M N K +GQ I+ K + Sbjct: 283 NLKGEYG--FASYIIRLRLDKAQANPTLIAMYINSSLGRTYMFNERKQMTGQANINAKKL 340 Query: 366 KSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 K+ ++LPP+ EQ EIV ++ L D +++ +L +N L +IL KAF+GEL Sbjct: 341 KALPIILPPLSEQQEIVTYLDNLQTQIDEMKRLRQESLKELNALLPAILDKAFKGEL 397 >UniRef50_Q6MH62 Type I restriction-modification system, S subunit n=1 Tax=Bdellovibrio bacteriovorus RepID=Q6MH62_BDEBA Length = 417 Score = 244 bits (624), Expect = 5e-63, Method: Composition-based stats. Identities = 67/423 (15%), Positives = 160/423 (37%), Gaps = 15/423 (3%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P W + + +TY Q + + + IR + GK + L + K++ Sbjct: 5 PADWDKHILDELLEDNFNITYGVVQPGDEAPN-GVKFIRGGDFPKGKIEENKLRTISKDI 63 Query: 67 VKESQK-ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHF 125 + ++ + ++ G V + ++R + ++ +F Sbjct: 64 SESYKRTVLNGGELLVALVGYPGTVAVVPRSLRG--ANIARQTALIRLAPKYLNTYVKYF 121 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 +S + +I S G+ I L+ + P + EQK IAE L ++ ++ T+ Sbjct: 122 LESDFGQGEILRGSLGSAQQVINLKDLKLVQVYTPKIDEQKKIAEFLTSVDKVIELTEIE 181 Query: 186 FEQIPQILKRFRQAVLGGAVNGKLT-EKWRNFEPQHSVFKKL-NFESILTELRNGLSSKP 243 E++ + K Q +L + T E P+ + L + ++ G+ Sbjct: 182 IEKLQNLKKGMMQDLLSKGIGHSTTIESAVGPVPKSWSIEVLSDLVLKGRKITYGIVQPG 241 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLECS-ESELNRHKLQDGDLLFTRYNGSLEFVGV 302 + G ++R +G + ++ + E + R +L GD++ VG Sbjct: 242 SYDERGVLLVRGQDYISGWAEAGEVFKVSVEIEKKFERARLNVGDVVICIAGAG---VGA 298 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 ++ + + + K L +Y+ + + + +K S Q G++ Sbjct: 299 VNVVPMRFNGANITQTTARVSCDEKKILGKYLYYYLQEGTGLKQIQKYIK-GSAQPGLNL 357 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 D++ ++ +PP+ EQ+ IV+ ++ + + + LA+ +L ++++ G + Sbjct: 358 NDVEKFLIKVPPLAEQSSIVKALDSVELKVENTKV----LLAKYQSLKKALMQDLLTGRV 413 Query: 423 TAQ 425 + Sbjct: 414 RVK 416 Score = 122 bits (307), Expect = 3e-26, Method: Composition-based stats. Identities = 42/206 (20%), Positives = 83/206 (40%), Gaps = 4/206 (1%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G +P+ W I +S + R +TY Q + + + L+R + +G + ++ V Sbjct: 211 VGPVPKSWSIEVLSDLVLKGRKITYGIVQPGS-YDERGVLLVRGQDYISGWAEAGEVFKV 269 Query: 63 PKNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + K+ +++ D+VI ++ V V EK I Sbjct: 270 SVEIEKKFERARLNVGDVVICIAGAGVGAV-NVVPMRFNGANITQTTARVSCDEKKILGK 328 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ ++ + +I G+ + + I +PPLAEQ I + LD++ +V+ Sbjct: 329 YLYYYLQEGTGLKQIQKYIKGSAQPGLNLNDVEKFLIKVPPLAEQSSIVKALDSVELKVE 388 Query: 181 STKARFEQIPQILKRFRQAVLGGAVN 206 +TK + + K Q +L G V Sbjct: 389 NTKVLLAKYQSLKKALMQDLLTGRVR 414 >UniRef50_Q11QY3 Probable type I restriction-modification system n=1 Tax=Cytophaga hutchinsonii ATCC 33406 RepID=Q11QY3_CYTH3 Length = 432 Score = 244 bits (624), Expect = 5e-63, Method: Composition-based stats. Identities = 68/432 (15%), Positives = 158/432 (36%), Gaps = 31/432 (7%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P+ W + + + E+ ++ ++ + + R+ + +P Sbjct: 20 GEIPKHWECIRMKHLFRDYSEKNKQNEELLSVTQNQGV-VPRS--------WVESRMVMP 70 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 ++ + I D I++ S ++ + VL+ ++ I + + Sbjct: 71 SGALESFKFIQKGDFAISLRSFEGG------LEYCHHDGIISPAYTVLKTKRKIANQYYK 124 Query: 124 HFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 + KSS + +++ + G NI +PIP + EQ IA LD A++D Sbjct: 125 YLFKSSAFISELQTSIVGIREGKNISYPELSYSLLPIPKIDEQSCIATFLDDKTAKIDQA 184 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILT 233 + ++ ++LK RQ ++ AV L +W P+ KKL Sbjct: 185 ISIKQKQIELLKERRQILIHKAVTRGLNPKVKMKDSGVEWIGEVPEGWEVKKLLGLCNFI 244 Query: 234 ELRNGLSSKPNESGVGHPILRISSVRA-GHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 + + + L+ V++ F+ + ++ + GD + Sbjct: 245 RGNSSFGKDDLLNDGEYVALQYGKTYKVNEVNEEYNYFVNNEFYKASQ-IVNYGDTIIIA 303 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 + ++E +G K+ L+ ++++ Y+ +F+S + Sbjct: 304 TSETIEELGHTAYYKR-NDLGLIGGEQILLNPNNDKINSHYL--YFTSRVFSKELRKYAT 360 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 + D+K+ + +PP+ EQ +IV +E A T N + ++ + Sbjct: 361 -GIKVFRFNINDLKTIYIAIPPLSEQQQIVEYIETTTAKIATAISLKENEIEKLKEYKAN 419 Query: 413 ILAKAFRGELTA 424 ++ A G++ Sbjct: 420 LVNSAVTGKIKV 431 Score = 146 bits (368), Expect = 3e-33, Method: Composition-based stats. Identities = 46/210 (21%), Positives = 87/210 (41%), Gaps = 6/210 (2%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQN-GKFDTTDLVF 61 G++PEGW + + + IRG + L D ++ + + F Sbjct: 225 IGEVPEGWEVKKLLGLCNFIRGN--SSFGKDDLLNDGEYVALQYGKTYKVNEVNEEYNYF 282 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK-LIFSG 120 V K SQ ++ D +I +S + +G +A+ G +L P I S Sbjct: 283 VNNEFYKASQIVNYGDTIIIATSETIEELGHTAYYKRNDLGLIGGEQILLNPNNDKINSH 342 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 ++ S ++ ++ + G + I I IPPL+EQ+ I E ++T A++ Sbjct: 343 YLY--FTSRVFSKELRKYATGIKVFRFNINDLKTIYIAIPPLSEQQQIVEYIETTTAKIA 400 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLT 210 + + E + LK ++ ++ AV GK+ Sbjct: 401 TAISLKENEIEKLKEYKANLVNSAVTGKIK 430 Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats. Identities = 33/234 (14%), Positives = 77/234 (32%), Gaps = 27/234 (11%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHV--DQNDIR 269 +W P+H ++ L S K ++ +L ++ G V + R Sbjct: 17 EWLGEIPKHWECIRMKH------LFRDYSEKNKQNEE---LLSVTQ-NQGVVPRSWVESR 66 Query: 270 FLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 + S + + +Q GD + + + H + + + + Sbjct: 67 MVMPSGALESFKFIQKGDFAISLRSFEGGL--------EYCHHDGIISPAYTVLKTKRKI 118 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLF 389 +Y + F S + + + + K IS ++ ++ +P + EQ+ I ++ Sbjct: 119 ANQYYKYLFKSSAFISELQTSIVGIREGKNISYPELSYSLLPIPKIDEQSCIATFLDDKT 178 Query: 390 AYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D + + Q ++ KA R NP + ++ + Sbjct: 179 AKIDQAISIKQKQIELLKERRQILIHKAVT-------RGLNPKVKMKDSGVEWI 225 >UniRef50_B8D1X6 Restriction modification system DNA specificity domain protein n=1 Tax=Halothermothrix orenii H 168 RepID=B8D1X6_HALOH Length = 422 Score = 244 bits (623), Expect = 6e-63, Method: Composition-based stats. Identities = 60/430 (13%), Positives = 147/430 (34%), Gaps = 31/430 (7%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNG--KFDTTDLVFVP 63 +P+ W ++ I+ K Y + ++ ++ T Sbjct: 18 IPKEWEFRNFGLISKYIKAGGTPKADKKEYY-GGEILFVKIEDMTKNGKYIYNTKSTITE 76 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 L S I P+ ++ GS V E + + P + + ++ Sbjct: 77 DGLKNSSAWIVPKKSLLLSMYGSYGKVSI-----NKVELATNQAILGIIPSEEVNLDYLY 131 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + + SL N+ + + PPL EQK IA L T+ ++ T Sbjct: 132 Y-FSLGCLKPYFKSLVKATTQANLTKQIVNNTPVLSPPLPEQKKIAAILSTVDKAIEKTD 190 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWR---NFEPQHSVFKKLNFESILTELRNGLS 240 E+ ++ K Q +L + ++ R V+ + F + + Sbjct: 191 EIIEKSKELKKGLMQQLLTKGIGHSEFKEVRIGTKKIKIPVVWTLIKFGEVFKKRN---- 246 Query: 241 SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 + + + + + G ++ + + ++ ++ + GD+L+ + L+ Sbjct: 247 -EKANVEKEYKYVGLEHLGTGEINL--LGYDRNGNNKSSKRLFKSGDILYGKLRPYLKKA 303 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 + + + +I TK ++ Y+ S + ++ ++ + Sbjct: 304 AIT-------DFDGICSTDIIPIYATKKSVNNYLIYLVHSKMFVDFAVSTME-GTNLPRT 355 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 S + IK+ ++ LPP++EQ +I + + ++ ++ L + ++ K G Sbjct: 356 SWRVIKNLIIPLPPLQEQKKIASILSSVDEKI----QKEQEYREKLEELKKGLMQKLLTG 411 Query: 421 ELTAQWRAEN 430 E+ + E Sbjct: 412 EVRVKVEDEE 421 Score = 118 bits (297), Expect = 4e-25, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 73/209 (34%), Gaps = 17/209 (8%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPK 64 K+P W + V + + ++ G+ + L + Sbjct: 228 KIPVVWTLIKFGEVFKKRNEKA---------NVEKEYKYVGLEHLGTGEINL--LGYDRN 276 Query: 65 NLVKESQKISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 K S+++ DI+ + F+ + K + ++ Sbjct: 277 GNNKSSKRLFKSGDILYGKLRPYLKKAAIT-----DFDGICSTDIIPIYATKKSVNNYLI 331 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + S ++ + S G N+ + IP+PPL EQK IA L ++ ++ + Sbjct: 332 YLVHSKMFVDFAVSTMEGTNLPRTSWRVIKNLIIPLPPLQEQKKIASILSSVDEKIQKEQ 391 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEK 212 E++ ++ K Q +L G V K+ ++ Sbjct: 392 EYREKLEELKKGLMQKLLTGEVRVKVEDE 420 >UniRef50_A4VG57 Type I restriction-modification system, S subunit n=1 Tax=Pseudomonas stutzeri A1501 RepID=A4VG57_PSEU5 Length = 421 Score = 244 bits (623), Expect = 6e-63, Method: Composition-based stats. Identities = 78/436 (17%), Positives = 154/436 (35%), Gaps = 50/436 (11%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++PE W I P + G YK DD P+I + Sbjct: 19 GRVPEHWTIGPYKATIQIENGSDYK-----EVEADDGYPVIGSG---------------- 57 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 S+ + + V+ G K + K + + F + +++P F Sbjct: 58 GPFAYSSKLMYDGESVLL---GRKGTIDKPLYVNGAFWAVDTMYWSIIKP--GAHGRFAY 112 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + + + S + ++ + + P EQ+ IA LD A++D+ Sbjct: 113 YTATTIPF----DMYSTNTALPSMTKSVLGSHVVAFPGFEEQQAIAGHLDRETARIDALV 168 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESILTE 234 + + ++L+ RQA++ AV L +W P+H V K+ I Sbjct: 169 EKKIRFIELLREKRQALITHAVTKGLDPSVKMKDSGVEWLGAVPEHWVIKRFRDICISIS 228 Query: 235 LRNGLSS--KPNESGVGHPILRISSVRAGHVDQN-DIRFLECSESELNRHKLQDGDLLFT 291 ++ + G P++ S + + DI + L+ ++ GDL+ Sbjct: 229 TGPFGTALGNEDYITGGIPVINPSHIIDEQCSPDPDITVSTETALRLSFWAMRAGDLVTA 288 Query: 292 RYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNC 350 R +G ++ Q + L R R AL EY+ S AR + N Sbjct: 289 RRGE----LGRAAIIFGEQDGWICGTGSL-RVRPNPSQALTEYLHTVLQSRYAREWL-NL 342 Query: 351 VKTTSGQKGISGKDIKSQVVLLPPV-KEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + ++ + S + LPP EQ +++ + IE++ ++A + Sbjct: 343 ASVGATMANLNEGILGSLPLALPPSTAEQEKLLSSLAAQSERLIKIEQKAALSVALLKEC 402 Query: 410 TQSILAKAFRGELTAQ 425 +++ A G++ + Sbjct: 403 RSALITAAVTGQIDLR 418 Score = 104 bits (259), Expect = 9e-21, Method: Composition-based stats. Identities = 27/232 (11%), Positives = 70/232 (30%), Gaps = 39/232 (16%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P+H + ++ NG K E+ G+P++ Sbjct: 16 EWLGRVPEHW---TIGPYKATIQIENGSDYKEVEADDGYPVIGSG--------------- 57 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 + ++ +L R + + D + + + A Sbjct: 58 -GPFAYSSKLMYDGESVLLGRKGTIDK--------PLYVNGAFWAVDTMYWSIIKPGAHG 108 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 + ++ + T + ++ + S VV P +EQ I +++ A Sbjct: 109 RFAYYTATTIPF-----DMYSTNTALPSMTKSVLGSHVVAFPGFEEQQAIAGHLDRETAR 163 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + ++ + + Q+++ A + +P + ++ L Sbjct: 164 IDALVEKKIRFIELLREKRQALITHAVT-------KGLDPSVKMKDSGVEWL 208 >UniRef50_UPI000178969C restriction modification system DNA specificity domain protein n=1 Tax=Geobacillus sp. Y412MC10 RepID=UPI000178969C Length = 384 Score = 244 bits (623), Expect = 6e-63, Method: Composition-based stats. Identities = 62/412 (15%), Positives = 136/412 (33%), Gaps = 36/412 (8%) Query: 10 WVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFD--TTDLVFVPKNLV 67 W + V +I G T K + D + I + + T K + Sbjct: 4 WEKVRLGDVCEVIGGSTPKTS--VKEYWDGEILWITPAELNDTTIIIRDTQRKITDKAIS 61 Query: 68 K-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFT 126 + +K+ ++++ + +GK A E L + +F+ ++ F Sbjct: 62 ELSLKKLPVGTVLLSSR----APIGKVAITGK--EMYCNQGFKNLVCSESVFNKYLFWFL 115 Query: 127 KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARF 186 K ++SL GA I + + I P+PPL QK IA LD + K + Sbjct: 116 KGKG--EFLNSLGRGATFKEISKSIVENIVFPLPPLEVQKQIAATLDAASELLTMRKQQL 173 Query: 187 EQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES 246 ++ +++K + G V + + + +L S + Sbjct: 174 SELDELIKSVFYEMFGDPVTNE----------KGWILSTFGNIGVLNSGGTPSRSNNSYF 223 Query: 247 GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLL 306 + ++ ++ + + + + + + G LL Y+ + +G+ Sbjct: 224 KGSINWFSAGELNQRYLLNSNEKITQLAIEQSSAKIFKAGSLLIGMYDTAAFKLGILAY- 282 Query: 307 KKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIK 366 + ++ + + IE + + QK ++ IK Sbjct: 283 ------DAASNQACANIQINEQLVN--IEWLYDCARIMRPHFLSNRRGVRQKNLNLGMIK 334 Query: 367 SQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 + + LPP+ Q + V + + V A+ L S++++ F Sbjct: 335 NLEIPLPPLDLQIQFADIV----TKIEEQKTLVKQAIDETQQLFDSLMSQYF 382 Score = 104 bits (260), Expect = 9e-21, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 65/198 (32%), Gaps = 11/198 (5%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 +GW+++ + L G T + N + A + ++ + Sbjct: 196 KGWILSTFGNIGVLNSGGTPSRSN--NSYFKGSINWFSAGELNQRYLLNSNEKITQLAIE 253 Query: 68 KESQKISP-EDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL-IFSGFIAHF 125 + S KI ++I M + +G A+ + + C ++ + + ++ Sbjct: 254 QSSAKIFKAGSLLIGMYDTAAFKLGILAY-----DAASNQACANIQINEQLVNIEWLYDC 308 Query: 126 TKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKAR 185 + + R S G N+ + IP+PPL Q A+ + + Q K Sbjct: 309 --ARIMRPHFLSNRRGVRQKNLNLGMIKNLEIPLPPLDLQIQFADIVTKIEEQKTLVKQA 366 Query: 186 FEQIPQILKRFRQAVLGG 203 ++ Q+ Sbjct: 367 IDETQQLFDSLMSQYFDD 384 >UniRef50_C4ZFR7 Type I restriction-modification system specificity subunit n=3 Tax=Firmicutes RepID=C4ZFR7_EUBR3 Length = 412 Score = 243 bits (622), Expect = 8e-63, Method: Composition-based stats. Identities = 78/416 (18%), Positives = 153/416 (36%), Gaps = 22/416 (5%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 + W ++ V I V + Y + +P+ R N+ + DL +V Sbjct: 13 KDWEQRKLNEVAEKIC-VGFVGTCEKFYTDESGIPMYRTGNLNGLSLNRDDLKYVTNEFH 71 Query: 68 KESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAH 124 + +QK + DI+IA S V +++ ++RP+ K F+ + Sbjct: 72 QHNQKSQLKAGDILIARHGDSGKAVN---YENSEEANCLN--IVIIRPDFKKCNYKFLTN 126 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIP-PLAEQKIIAEKLDTLLAQVDSTK 183 S + I SLSAG+ I + + + + IP + EQ IA TL + + Sbjct: 127 CINSPECQKHIKSLSAGSTQAVINTSEIEKLGVVIPANIDEQNRIARYFSTLDNLITLHQ 186 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 + EQ ++ K Q + ++ F K + G ++ P Sbjct: 187 RKCEQTKKLKKYMLQKMFPRNGAKVPEIRFDGFTYDWEQRKLGEIYGSIGNAFVG-TATP 245 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLECSESELNR-HKLQDGDLLFTRYNGSLEFVGV 302 + GH L ++V+ G ++ N F+ E + L GD++ + VG Sbjct: 246 YYAEHGHFYLESNNVKDGQINHNAEIFINDEFYEKQKDKWLHTGDMVMVQSG----HVGH 301 Query: 303 CGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISG 362 ++ + + + R ++ P ++ + + A+ + N + T + K I Sbjct: 302 AAVIPEELDNTAAHALIMFR-NPKEEIEPYFLNYEYQTDKAKKQIEN-ITTGNTIKHILA 359 Query: 363 KDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 D++ VV +P +EQ I F D + + + + +L F Sbjct: 360 SDMQEFVVDIPKYEEQKVIASY----FCKLDHLITLHQRKCDELKKMKKYMLQNMF 411 Score = 124 bits (311), Expect = 1e-26, Method: Composition-based stats. Identities = 38/210 (18%), Positives = 81/210 (38%), Gaps = 15/210 (7%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +++ F K + G K G P+ R ++ ++++D++++ Sbjct: 7 RFKGFTKDWEQRKLNEVAEKICVGFVGTCEKFYTDESGIPMYRTGNLNGLSLNRDDLKYV 66 Query: 272 ECSESELN-RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDA 329 + N + +L+ GD+L R+ S G + + N L ++ R K Sbjct: 67 TNEFHQHNQKSQLKAGDILIARHGDS----GKAVNYENSEEANCL---NIVIIRPDFKKC 119 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLP-PVKEQAEIVRRVEQL 388 +++ +SP + + + S Q I+ +I+ V++P + EQ I R Sbjct: 120 NYKFLTNCINSPECQKHI-KSLSAGSTQAVINTSEIEKLGVVIPANIDEQNRIARY---- 174 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAF 418 F+ D + + L + +L K F Sbjct: 175 FSTLDNLITLHQRKCEQTKKLKKYMLQKMF 204 Score = 118 bits (297), Expect = 4e-25, Method: Composition-based stats. Identities = 35/208 (16%), Positives = 83/208 (39%), Gaps = 17/208 (8%) Query: 5 KLPE--------GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDT 56 K+PE W + + I G + A Y + + +NN+++G+ + Sbjct: 210 KVPEIRFDGFTYDWEQRKLGEIYGSI-GNAFVGT-ATPYYAEHGHFYLESNNVKDGQINH 267 Query: 57 TDLVFVPKNLVKESQ--KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-P 113 +F+ ++ + + D+V+ VG +A + + + R P Sbjct: 268 NAEIFINDEFYEKQKDKWLHTGDMVMV----QSGHVGHAAVIPEELDNTAAHALIMFRNP 323 Query: 114 EKLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLD 173 ++ I F+ + ++ + +I +++ G I +I + + IP EQK+IA Sbjct: 324 KEEIEPYFLNYEYQTDKAKKQIENITTGNTIKHILASDMQEFVVDIPKYEEQKVIASYFC 383 Query: 174 TLLAQVDSTKARFEQIPQILKRFRQAVL 201 L + + + +++ ++ K Q + Sbjct: 384 KLDHLITLHQRKCDELKKMKKYMLQNMF 411 >UniRef50_Q2JGK8 Type I restriction-modification system specificity determinant n=1 Tax=Frankia sp. CcI3 RepID=Q2JGK8_FRASC Length = 416 Score = 243 bits (622), Expect = 9e-63, Method: Composition-based stats. Identities = 72/427 (16%), Positives = 156/427 (36%), Gaps = 35/427 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 GK+P W P+ ++ I+ V + +EQ ++ ++ + A + ++ T + + Sbjct: 17 GKVPPHWTTKPLWSMFERIKDVDHPEEQMLSVFREYGVV---AKDSRDNINKTAENRSI- 72 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 Q + P +V + VG S+ P + ++ Sbjct: 73 ------YQLVHPGWLVANRMKAWQGSVGISSL-----RGIVSGHYICFAPRHSEDARYLN 121 Query: 124 HFTKSSLYRNKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 +S+ Y N + LS G I F L+ I +PPL EQ+ IA+ LD A++D+ Sbjct: 122 WLLRSTTYTNGYALLSRGVRIGQAEIDNDEFRLMPILLPPLGEQRAIADYLDRETARIDT 181 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSS 241 +++ ++L+ R+AV A++ ++ KL + + Sbjct: 182 LIEEQQRLIEMLRERRRAVALHAIDQQIHAGATTD--------KLGRSTRIGNGSTPRRE 233 Query: 242 KPNES-GVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFV 300 + P L S+V V D + + E + + G +L Sbjct: 234 TASYWRDGEFPWLNSSAVNESRVTHADQFVTDIALYECHLPVVAPGSVLVGLTGQ----- 288 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPSARNAMMNCVK-TTSGQK 358 G + L + + LPEY+ + + + + S + Sbjct: 289 GKTRGMATLLEIEATVNQHVAYIAPDRGTWLPEYLLWSLRASY--DDLRRLSEENGSTKG 346 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 G++ + +K + +PP+ EQ + +++ A D++ + + +++ A Sbjct: 347 GLTCQALKQYRLAVPPLDEQRRVAAYLDEQTAKIDSLIGETERFIELARERRVALITAAV 406 Query: 419 RGELTAQ 425 G++ + Sbjct: 407 TGQVDVR 413 >UniRef50_A7I739 Restriction modification system DNA specificity domain n=1 Tax=Candidatus Methanoregula boonei 6A8 RepID=A7I739_METB6 Length = 457 Score = 243 bits (621), Expect = 1e-62, Method: Composition-based stats. Identities = 106/452 (23%), Positives = 183/452 (40%), Gaps = 58/452 (12%) Query: 9 GWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK 68 W P+ + ++ G +K E + PLIR +I N K T+ + + Sbjct: 24 SWERVPLGKIAKVLNGFAFKSELFND---KKGTPLIRIRDIGNNK---TECYY--DGVFD 75 Query: 69 ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAHFTKS 128 E+ I P D+++ M + P C + + F+ + Sbjct: 76 EAYVIHPGDLLVGMDGDF-----NCSTWRGPKALLNQRVCKIEVNIEQYNRKFLEYVL-- 128 Query: 129 SLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQ 188 Y I+ ++ + ++ S I +P PPL EQ+ I +++ LL+ V++ + R + Sbjct: 129 PGYLKAINENTSSQTVKHLSSRSISEILLPNPPLTEQQRIVARVEALLSHVNAARERLSR 188 Query: 189 IPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSV-------------------------- 222 +P I+K+FRQAVL A +G LTE WR P Sbjct: 189 VPLIMKKFRQAVLAAACSGGLTEGWRKENPDIEEANKLVKRLESIRKQFKIREISSIDNL 248 Query: 223 --------FKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGH-VDQNDIRFLEC 273 + + + + + + P S G + + + +D + + Sbjct: 249 ELSDLPDSWTWIRL-ANIAIVMDPDHKMPKSSDGGIIFISPKDFKENYQIDMTKTKRISD 307 Query: 274 SES--ELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALP 331 E + + D+L++R L G + ++ Y +IR +L + Sbjct: 308 EEFLRLSKKFVPRPLDILYSRIGADL---GKARKAPQDIKFHISYSLAVIR-QLGEMENS 363 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 +Y+ +S RN V + G + +DI + ++ LPP+ EQ EIVRRV LF Sbjct: 364 DYLFWLLNSMFIRNQAFENV-RSIGVPDLGLRDIDNFIIPLPPLAEQYEIVRRVGLLFER 422 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 AD I+++V A R LTQ++L KAFRGELT Sbjct: 423 ADAIDREVEAATRRCERLTQAVLGKAFRGELT 454 Score = 126 bits (317), Expect = 2e-27, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 76/214 (35%), Gaps = 13/214 (6%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGK-FDTTDLV 60 LP+ W ++ + ++ + + D + I + + D T Sbjct: 249 ELSDLPDSWTWIRLANIAIVMD-----PDHKMPKSSDGGIIFISPKDFKENYQIDMTKTK 303 Query: 61 FVPKNLV---KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLR-PEKL 116 + + P DI+ + +GK+ + V+R ++ Sbjct: 304 RISDEEFLRLSKKFVPRPLDILYSRIGA---DLGKARKAPQDIKFHISYSLAVIRQLGEM 360 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 S ++ S RN+ + ++ D IP+PPLAEQ I ++ L Sbjct: 361 ENSDYLFWLLNSMFIRNQAFENVRSIGVPDLGLRDIDNFIIPLPPLAEQYEIVRRVGLLF 420 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT 210 + D+ E + +R QAVLG A G+LT Sbjct: 421 ERADAIDREVEAATRRCERLTQAVLGKAFRGELT 454 Score = 101 bits (253), Expect = 6e-20, Method: Composition-based stats. Identities = 44/232 (18%), Positives = 89/232 (38%), Gaps = 22/232 (9%) Query: 222 VFKKLNFESILTELRNGLSSKPNESGV--GHPILRISSVRAGHVDQNDIRFLECSESELN 279 ++++ + ++ NG + K G P++RI + + + E Sbjct: 24 SWERVPLGK-IAKVLNGFAFKSELFNDKKGTPLIRIRDIGNNK----TECYYDGVFDE-- 76 Query: 280 RHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFF 338 + + GDLL + L ++ + + + +++E Sbjct: 77 AYVIHPGDLLVGMDGD--------FNCSTWRGPKALLNQRVCKIEVNIEQYNRKFLEYVL 128 Query: 339 SSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ 398 +N ++ K +S + I ++ PP+ EQ IV RVE L ++ + ++ Sbjct: 129 ---PGYLKAINENTSSQTVKHLSSRSISEILLPNPPLTEQQRIVARVEALLSHVNAARER 185 Query: 399 VNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAE 450 ++ + Q++LA A G LT WR ENPD+ LE I+ + Sbjct: 186 LSRVPLIMKKFRQAVLAAACSGGLTEGWRKENPDIEEANKLVKR-LESIRKQ 236 >UniRef50_A6LY63 Restriction modification system DNA specificity domain n=2 Tax=Bacteria RepID=A6LY63_CLOB8 Length = 469 Score = 243 bits (621), Expect = 1e-62, Method: Composition-based stats. Identities = 78/452 (17%), Positives = 173/452 (38%), Gaps = 39/452 (8%) Query: 3 AGKLPEGWVIAPVSTV-----TTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT 57 GK+P W ++ + + + + G ++ +++++ + +I +N G Sbjct: 19 IGKIPRDWEVSKIKYIKSPDKNSFVDGPFGSNLKSEHFIENGEVYVIESNFATQGILKLD 78 Query: 58 DLVFVPKNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSF-GAFCGVLRPE 114 L + + ++ + DIVIA + + + G + + Sbjct: 79 SLKKISTEHFETIKRSEVKENDIVIAKIGAQFGLSNI--LPRIDKKAVVSGNSLKLSVDK 136 Query: 115 KLIFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDT 174 + + +I + + + + I + INI +P + Q I + L+ Sbjct: 137 QKSNTQYIHYQLLHIKNNGTLDLIVSTTAQPAISLGDMNNINIVLPNVQRQDKIVKFLNE 196 Query: 175 LLAQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLT----------------------EK 212 AQVDS ++ E + QIL+ +++++ AV GK+ K Sbjct: 197 KTAQVDSIISKKEALIQILEEAKKSLISDAVTGKVKVVKTSDGYELVERKKEEMKDSGVK 256 Query: 213 WRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGH-VDQNDIRFL 271 W P+ K+L F L +NG+S +E G G+P + V + + + Sbjct: 257 WLGDVPKEWDVKRLRFLGNL---QNGISKSGDEFGFGYPFVSYGDVYKNISIPKFVNGLV 313 Query: 272 ECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARL-TKDAL 330 S ++ + + +GD+ FTR + +++ +G + + LIR R Sbjct: 314 NSSLNDRRIYSVLEGDVFFTRTSETVDEIGFASTCLNT-ITDATFAGFLIRFRPFKDKLY 372 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 + + +F R + + + +S + + V LP KEQ EI +E Sbjct: 373 KGFSKYYFRCDLNRKFFVKEMNL-VTRASLSQNLLNNLAVALPLYKEQQEIYSALEFKVG 431 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + ++ + ++ Q+++++A G++ Sbjct: 432 GIECSINKLRCQIQKLKEAKQALISEAVTGKI 463 Score = 129 bits (326), Expect = 2e-28, Method: Composition-based stats. Identities = 34/246 (13%), Positives = 95/246 (38%), Gaps = 18/246 (7%) Query: 212 KWRNFEPQHSVFKKLNFESI-----LTELRNGLSSKPNES--GVGHPILRISSVRAGHVD 264 KW P+ K+ + + G + K ++ + G + Sbjct: 17 KWIGKIPRDWEVSKIKYIKSPDKNSFVDGPFGSNLKSEHFIENGEVYVIESNFATQGILK 76 Query: 265 QNDIRFLECSESE-LNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRA 323 + ++ + E + R ++++ D++ + G+ +L ++ + ++ + L + Sbjct: 77 LDSLKKISTEHFETIKRSEVKENDIVIAKIGAQF---GLSNILPRIDKKAVVSGNSLKLS 133 Query: 324 RLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 + + +YI N ++ + +T+ Q IS D+ + ++LP V+ Q +IV+ Sbjct: 134 VDKQKSNTQYIHYQLL-HIKNNGTLDLIVSTTAQPAISLGDMNNINIVLPNVQRQDKIVK 192 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAE------NPDLISGE 437 + + A D+I + + + +S+++ A G++ ++ + Sbjct: 193 FLNEKTAQVDSIISKKEALIQILEEAKKSLISDAVTGKVKVVKTSDGYELVERKKEEMKD 252 Query: 438 NSAAAL 443 + L Sbjct: 253 SGVKWL 258 >UniRef50_A5UFR1 Type I restriction modification DNA specificity domain protein n=1 Tax=Haemophilus influenzae PittGG RepID=A5UFR1_HAEIG Length = 424 Score = 243 bits (620), Expect = 1e-62, Method: Composition-based stats. Identities = 67/437 (15%), Positives = 147/437 (33%), Gaps = 46/437 (10%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + P + L + V K+ + L + + + + F Sbjct: 16 GEVPSHWNLIPNKYIFKLRKNVVGKRSSEYDLLS------LSLKGVIKRDMENPEGKF-- 67 Query: 64 KNLVKESQKISPEDIVIAMSS--GSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 Q++ D + + + VG S++ + + + F Sbjct: 68 PAEFDTYQEVKEGDFIFCLFDVEETPRTVGLSSYHGMITGAYT------IFETNNVDKKF 121 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 I +F + ++ L G N I +F N IPPL+EQ+ IA+ LD A++D Sbjct: 122 IYYFYLNLDSDKRLKPLYKGL-RNTISKETFFSFNTFIPPLSEQQKIAQFLDDKTAKIDQ 180 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 E+ +LK +Q ++ +V L +W P+H + S + Sbjct: 181 AVDLAEKQIALLKEHKQILIQNSVTRGLNPDVPLKDSGVEWIGQVPEHWEILSIKRLSQV 240 Query: 233 TELRNG---LSSKPNESGVGHPILRISSVR--AGHVDQNDIRFLECSESELNRHKLQDGD 287 + + K ++ + +RIS V ++ + + +S L G Sbjct: 241 KRGASPRPIDNPKYFDNDGEYAWVRISDVTASNMYLLETTQKLSNLGKSYS--VPLMPGS 298 Query: 288 LLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAM 347 L + VG + + + ++ + +++ F S + Sbjct: 299 LFLSIAGS----VGKPII---TKIKVCIHDGFVYF--PENKQNTKFLYYIFYSEQPYIGL 349 Query: 348 MNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVN 407 Q ++ + + + +PP+ EQ +I ++ A D + ++ Sbjct: 350 GKM----GTQLNLNTDTVGAIKIPIPPLCEQQKIADYLDTQTAKIDQAIALKTAHIEKLK 405 Query: 408 NLTQSILAKAFRGELTA 424 ++ G++ Sbjct: 406 EYKSVLINDVVTGKVRV 422 Score = 143 bits (362), Expect = 1e-32, Method: Composition-based stats. Identities = 49/208 (23%), Positives = 84/208 (40%), Gaps = 12/208 (5%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDY-LPLIRANNIQNGKFDTTDLVF 61 G++PE W I + ++ + RG + + Y +D +R +++ + Sbjct: 222 IGQVPEHWEILSIKRLSQVKRGASPRPIDNPKYFDNDGEYAWVRISDVTASNMYLLETTQ 281 Query: 62 VPKNLVKESQK-ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 NL K + P + +++ VGK + C F V PE + Sbjct: 282 KLSNLGKSYSVPLMPGSLFLSI----AGSVGKPIITKIKV-CIHDGF--VYFPENKQNTK 334 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 F+ + S + I L N+ + I IPIPPL EQ+ IA+ LDT A++D Sbjct: 335 FLYYIFYSE--QPYI-GLGKMGTQLNLNTDTVGAIKIPIPPLCEQQKIADYLDTQTAKID 391 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGK 208 A + LK ++ ++ V GK Sbjct: 392 QAIALKTAHIEKLKEYKSVLINDVVTGK 419 Score = 105 bits (262), Expect = 4e-21, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 82/235 (34%), Gaps = 28/235 (11%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P H + L + G S + +L +S G + ++ Sbjct: 13 EWLGEVPSHWNLIPNKYIFKLRKNVVGKRSS------EYDLLSLS--LKGVIKRDMENPE 64 Query: 272 ECSESELNRH-KLQDGDLLFTR--YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD 328 +E + + ++++GD +F + VG+ + ++ I T + Sbjct: 65 GKFPAEFDTYQEVKEGDFIFCLFDVEETPRTVGLSS------YHGMITGAYTIF--ETNN 116 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 ++I F+ + + + K + IS + S +PP+ EQ +I + ++ Sbjct: 117 VDKKFIYYFYLNLDSDKRLKPLYKG--LRNTISKETFFSFNTFIPPLSEQQKIAQFLDDK 174 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D +A + Q ++ + R NPD+ ++ + Sbjct: 175 TAKIDQAVDLAEKQIALLKEHKQILIQNSVT-------RGLNPDVPLKDSGVEWI 222 >UniRef50_A3J917 Type I restriction-modification n=1 Tax=Marinobacter sp. ELB17 RepID=A3J917_9ALTE Length = 444 Score = 242 bits (619), Expect = 2e-62, Method: Composition-based stats. Identities = 73/448 (16%), Positives = 145/448 (32%), Gaps = 45/448 (10%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 ++P GW IA +S + ++ G + + P N N + Sbjct: 17 IAQIPTGWQIASLSKLFSIKAGGDVNT-DVFSETRTHDRPFPIYTNANNPNIVYG---YT 72 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 K + ++ + VG +A + F+ VL P+K + F Sbjct: 73 SKAKYGPN----------CITVSGRGYVGFAAFRDHIFDAII--RLLVLTPKKDLNCKFF 120 Query: 123 AHFTKSS-LYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 +F +R + + I + + P EQ I LD A++D+ Sbjct: 121 EYFINEVVDFREE------SSAIGQLSTNQIAPYKVAFPDCREQSKITHFLDHETAKIDT 174 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 +++ ++LK RQAV+ AV L +W P H + + Sbjct: 175 LIHEQKRLIELLKEKRQAVISHAVTKGLDPDVPIKDSGVEWLGDVPAHWGVATIRRFAKA 234 Query: 233 --TELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELN-RHKLQDGDLL 289 T L +E G + + L S G +L Sbjct: 235 VRTGGTPSLEMPNSEIADGINWFTPGDFNGSLMLHESEKQLRISSISSGDAKLFPGGSVL 294 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 +L V + ++ K ++ S SA+ + M Sbjct: 295 VVGIGATLGKVAKV-------DDDFSANQQINVIVPGKRINGHFLVY---SLSAQKSQMR 344 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 V S ++ + K V++LPPV+EQ +I +++ D + + + + + Sbjct: 345 FVSNASTIGIMNQEKTKDIVLVLPPVEEQTQITESLDRGVQNLDQLVIKAASGILLLKER 404 Query: 410 TQSILAKAFRGELTAQWRAENPDLISGE 437 ++++ A G++ + D + + Sbjct: 405 RSALISAAVTGKIDVRDWQPPADESTFD 432 Score = 94.4 bits (234), Expect = 9e-18, Method: Composition-based stats. Identities = 27/233 (11%), Positives = 70/233 (30%), Gaps = 32/233 (13%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSK-PNESGVGHPILRISSVRAGHVDQNDIRF 270 W P L+ + + + P ++ ++ Sbjct: 15 NWIAQIPTGWQIASLSKLFSIKAGGDVNTDVFSETRTHDRPFPIYTNANNPNIVY----- 69 Query: 271 LECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 + ++ +VG + + +L+ KD Sbjct: 70 ---GYTSKAKYGPN------CITVSGRGYVGFAAFRDHIFDAII----RLLVLTPKKDLN 116 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 ++ E F + +++ + +S +S I V P +EQ++I ++ A Sbjct: 117 CKFFEYFIN------EVVDFREESSAIGQLSTNQIAPYKVAFPDCREQSKITHFLDHETA 170 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 DT+ + + + Q++++ A + +PD+ ++ L Sbjct: 171 KIDTLIHEQKRLIELLKEKRQAVISHAVT-------KGLDPDVPIKDSGVEWL 216 >UniRef50_Q6ZE86 Type I restriction-modification system S subunit n=2 Tax=Cyanobacteria RepID=Q6ZE86_SYNY3 Length = 464 Score = 242 bits (619), Expect = 2e-62, Method: Composition-based stats. Identities = 69/449 (15%), Positives = 149/449 (33%), Gaps = 38/449 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFD---TTDLV 60 G++P W I P G + E+ ++ + G+ L Sbjct: 18 GQIPAHWDIKP---------GFAFLSERKEKNTGMKESTVLS---LSYGQIVVKPPEKLH 65 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSG 120 + + Q P +I+I + V + + L ++ Sbjct: 66 GLVPESFETYQIAEPGNIIIRGTDLQNDKV-SLRVGKVRNRGIITSAYLCLETKEKFNPD 124 Query: 121 FIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + + +G N+ + F + + PP +EQ I + L ++ Q++ Sbjct: 125 YAHLLLHGYDLMKIYYGMGSGL-RQNLSFSDFKRLPLLAPPESEQSKINKYLQSIQVQIN 183 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESI 231 ++ ++LK +Q ++ AV L KW P++ F KL + Sbjct: 184 KFIRNKRRLIELLKEQKQNIINQAVTRGLDPNVKLKPSGVKWIGDIPEYWSFLKLKRIAC 243 Query: 232 LTELRNGLSSKPNESGVGHPILRISSVRAGHVDQND--IRFLECSESELNRHKLQDGDLL 289 + + VG P++RI ++ + ++ E + + K+Q GDLL Sbjct: 244 VKTGY--AFKSDHYKSVGIPLIRIGDLKHSGLVDIKQAVKLQESDLTHFSCFKIQYGDLL 301 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR-LTKDALPEYIEIFFSSPSARNAMM 348 ++ V L ++ R +Y+ SS + Sbjct: 302 MAMTGATIGKVAK-----YQHQTEALLNQRVCSFRSFESKCFQDYLLFILSSEVYLKQVT 356 Query: 349 NCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNN 408 + Q IS + S + +PP+ EQ I+ V++ D+ + + + Sbjct: 357 IFCYGGA-QPNISDSTLMSFKIPVPPISEQQAILSYVQEQTKTTDSAVSRAEREIELIQE 415 Query: 409 LTQSILAKAFRGELTAQWRAENPDLISGE 437 +++ G++ + E P++ + Sbjct: 416 YYTRLMSDVVTGQVDVRD-IEVPEITEED 443 Score = 145 bits (367), Expect = 3e-33, Method: Composition-based stats. Identities = 49/247 (19%), Positives = 101/247 (40%), Gaps = 12/247 (4%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVF 61 G +PE W + + + G +K ++ K +PLIR +++ +G D V Sbjct: 226 IGDIPEYWSFLKLKRIACVKTGYAFKS----DHYKSVGIPLIRIGDLKHSGLVDIKQAVK 281 Query: 62 VPKNL--VKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRP-EKLIF 118 + ++ KI D+++AM+ + +GK A E R E F Sbjct: 282 LQESDLTHFSCFKIQYGDLLMAMTGAT---IGKVAKYQHQTEALLNQRVCSFRSFESKCF 338 Query: 119 SGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 ++ S +Y +++ G NI ++ IP+PP++EQ+ I + Sbjct: 339 QDYLLFILSSEVYLKQVTIFCYGGAQPNISDSTLMSFKIPVPPISEQQAILSYVQEQTKT 398 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNG 238 DS +R E+ ++++ + ++ V G++ + P+ + L E + + Sbjct: 399 TDSAVSRAEREIELIQEYYTRLMSDVVTGQVDVRDI-EVPEITEEDLLTLEDKSEVVDDD 457 Query: 239 LSSKPNE 245 L + +E Sbjct: 458 LVQEEDE 464 Score = 93.3 bits (231), Expect = 2e-17, Method: Composition-based stats. Identities = 33/248 (13%), Positives = 79/248 (31%), Gaps = 35/248 (14%) Query: 213 WRNFEPQHSVFKK-LNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 W P H K F S E G+ L + V + + Sbjct: 16 WLGQIPAHWDIKPGFAFLSERKEKNTGM------KESTVLSLSYGQI----VVKPPEKLH 65 Query: 272 ECSESELNRHKL-QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 +++ + G+++ L+ V + K++++ ++ L + Sbjct: 66 GLVPESFETYQIAEPGNIII--RGTDLQNDKVSLRVGKVRNRGIITSAYLC-LETKEKFN 122 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 P+Y + ++ +S D K +L PP EQ++I + ++ + Sbjct: 123 PDYAHLLLHGYDLMKIYYGMGSG--LRQNLSFSDFKRLPLLAPPESEQSKINKYLQSIQV 180 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL------- 443 + + + + Q+I+ +A R +P++ + + Sbjct: 181 QINKFIRNKRRLIELLKEQKQNIINQAVT-------RGLDPNVKLKPSGVKWIGDIPEYW 233 Query: 444 ----LEKI 447 L++I Sbjct: 234 SFLKLKRI 241 >UniRef50_C2CF25 Restriction modification system DNA specificity domain protein n=2 Tax=Clostridiales Family XI. Incertae Sedis RepID=C2CF25_9FIRM Length = 495 Score = 242 bits (619), Expect = 2e-62, Method: Composition-based stats. Identities = 90/441 (20%), Positives = 179/441 (40%), Gaps = 58/441 (13%) Query: 5 KLPEGWVIAPVSTVTTLIRGVTYKKEQAINYL-KDDYLPLIRANNIQNGKFDTTDLVFVP 63 +PE W + V I G K A + L ++ +P I A N+++G D +L+++ Sbjct: 66 DIPESWKWVRLGDVFQFINGDRGKNYPAKSKLKENGDIPFISAINLKDGTVDENNLLYLD 125 Query: 64 KNLVKE--SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 N + S K+ DIV+ + +GK+ + + + +LR K I F Sbjct: 126 INQYERLGSGKLLKNDIVLCIR----GSLGKNCIYPFE-KGAIASSLVILRNYKKIKLEF 180 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + ++ S L+ ++ G N+ + I +P+PPL EQ+ I EK++ L+ VD Sbjct: 181 VLNYLNSYLFYSETKKYDNGTAQPNLSAQNAKKILLPLPPLKEQERIVEKIEDLMLLVDK 240 Query: 182 TKARFEQIPQILKR----FRQAVLGGAVNGKLTEKWRNFE-------------------- 217 ++ + + K+ ++++L A+ G+L E+ + Sbjct: 241 YGKNWQMLEDLNKKFPEDLKKSLLQEAIKGRLVEQRKEEGTGEELFELIKEEKNKLIKEG 300 Query: 218 ------------------PQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVR 259 +K + I +L +G P + G P L + + Sbjct: 301 KIKKQKPLPEITEEEIPFDIPESWKWVRLGEITLKLTDGAHKTPTYTNEGIPFLSVKDIS 360 Query: 260 AGHVDQNDIRFLECSESEL--NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYP 317 +G +D + RF+ E + R + GDLL T+ + G+ ++ + +L Sbjct: 361 SGKIDYSSCRFISKKEHDKLFERCNPERGDLLLTKVGTT----GIPVVIDTDEEFSLFVS 416 Query: 318 DKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKE 377 L++ K +++ +SP + + G K +DI + ++ LPP+ E Sbjct: 417 VALLKF-PKKLINIYFLKHLINSPLVQVQVKEN-TRGVGNKNWVMRDIANTIIPLPPLAE 474 Query: 378 QAEIVRRVEQLFAYADTIEKQ 398 Q +V ++E+L + + K Sbjct: 475 QKRLVEKLEELLPLCEQVIKN 495 Score = 149 bits (378), Expect = 2e-34, Method: Composition-based stats. Identities = 53/290 (18%), Positives = 109/290 (37%), Gaps = 28/290 (9%) Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLS- 240 + ++ + K +Q L E+ P+ + +L G + Sbjct: 36 IQEEKNKLIKEGKVKKQKPLPEI----TEEEIPFDIPESWKWVRLGDVFQFINGDRGKNY 91 Query: 241 --SKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESE-LNRHKLQDGDLLFTRYNGSL 297 + P + +++ G VD+N++ +L+ ++ E L KL D++ Sbjct: 92 PAKSKLKENGDIPFISAINLKDGTVDENNLLYLDINQYERLGSGKLLKNDIVLCIRGS-- 149 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 +G + + L+ R K E++ + +S + + Q Sbjct: 150 --LGKNCIYP---FEKGAIASSLVILRNYKKIKLEFVLNYLNSYLFYSE-TKKYDNGTAQ 203 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQ---VNNALARV-NNLTQSI 413 +S ++ K ++ LPP+KEQ IV ++E L D K + + + +L +S+ Sbjct: 204 PNLSAQNAKKILLPLPPLKEQERIVEKIEDLMLLVDKYGKNWQMLEDLNKKFPEDLKKSL 263 Query: 414 LAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 L +A +G L Q + E + L E IK E+ + +K+ Sbjct: 264 LQEAIKGRLVEQRK--------EEGTGEELFELIKEEKNKLIKEGKIKKQ 305 Score = 43.6 bits (102), Expect = 0.014, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 23/57 (40%), Gaps = 8/57 (14%) Query: 407 NNLTQSILAKAFRGELTAQWRAENPDLISGENSAAALLEKIKAERAASGGKKASRKK 463 L SIL +A G+L Q + E + L + I+ E+ + +K+ Sbjct: 4 QELKNSILQRAIEGKLVEQRK--------EEGTGEELYKLIQEEKNKLIKEGKVKKQ 52 >UniRef50_D1XRZ5 Restriction modification system DNA specificity domain protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XRZ5_9ACTO Length = 412 Score = 242 bits (618), Expect = 2e-62, Method: Composition-based stats. Identities = 85/421 (20%), Positives = 160/421 (38%), Gaps = 38/421 (9%) Query: 11 VIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKF---DTTDLVFVPKNLV 67 PV + + G P +R N+ G+ D ++ F P V Sbjct: 8 QWVPVRELGEVRMGKQLSPSSREAA---GQFPYLRVANVHLGRIEYVDVNEMGFTPAERV 64 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQH-LPFECSFGAFCGVLRPEKLIFSGF----I 122 + + P DI++ S +VG+SA E F RP I S + Sbjct: 65 --TYGLKPGDILLNE-GQSLELVGRSAIYDRAEGEFCFQNTLIRFRPNGCILSAYAQVVF 121 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 H+ +S ++ +I ++ F + P+ P Q+ I LD+L Sbjct: 122 EHWLRSGVFAAIAKQT---TSIAHLGGDRFAALKFPLLPTGMQQRIVAVLDSL----AEL 174 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK 242 + R E L+ R+ ++ + R S +L L ++ +GL+ Sbjct: 175 ERRIEASIVKLRSVRKGIISEQFS-------RADVEDGSPASRLRALDSLADVGSGLTLG 227 Query: 243 PNESGV---GHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEF 299 SG P LR+++V+ G + +++ + + S++ R +++ D+L T G + Sbjct: 228 GISSGGTLLEVPYLRVANVQDGFISTLEMKSVRVTPSDMERFRVRRDDVLVTE-GGDFDK 286 Query: 300 VGVCGLLKKLQHQNLLYPDKLIRARLTKDA-LPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 VG + + L + + R R K+ P ++ ++ SS + R + VK T+ Sbjct: 287 VGRGAVWDG-RIDPCLNQNHVFRVRCDKEVLDPHFLSLYMSSAAGRRYFLRVVKQTTNLA 345 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 I+ +K+ V PP++EQ V +L D Q L ++ L ++ Sbjct: 346 SINSSQLKAMPVPCPPLEEQRRTV----ELVGSCDEQIAQEEGELTKLRELKVGLVDDLL 401 Query: 419 R 419 Sbjct: 402 S 402 Score = 112 bits (281), Expect = 3e-23, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 77/195 (39%), Gaps = 7/195 (3%) Query: 15 VSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVK-ESQKI 73 + ++ + G+T + L + +P +R N+Q+G T ++ V E ++ Sbjct: 214 LDSLADVGSGLTLGGISSGGTLLE--VPYLRVANVQDGFISTLEMKSVRVTPSDMERFRV 271 Query: 74 SPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPEKLI-FSGFIAHFTKSSLY 131 +D+++ G VG+ A + C +R +K + F++ + S+ Sbjct: 272 RRDDVLVTE-GGDFDKVGRGAVWDGRIDPCLNQNHVFRVRCDKEVLDPHFLSLYMSSAAG 330 Query: 132 RNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 R + + +I + + +P PPL EQ+ E + + Q+ + ++ Sbjct: 331 RRYFLRVVKQTTNLASINSSQLKAMPVPCPPLEEQRRTVELVGSCDEQIAQEEGELTKLR 390 Query: 191 QILKRFRQAVLGGAV 205 ++ +L V Sbjct: 391 ELKVGLVDDLLSRRV 405 >UniRef50_C7IKN2 Restriction modification system DNA specificity domain protein n=1 Tax=Clostridium papyrosolvens DSM 2782 RepID=C7IKN2_9CLOT Length = 397 Score = 242 bits (618), Expect = 2e-62, Method: Composition-based stats. Identities = 70/426 (16%), Positives = 159/426 (37%), Gaps = 44/426 (10%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G++P+ W + + + +++ G T + + Y + + ++ ++ + T Sbjct: 10 ELGEIPQEWEVRKIEDLYSVLTGATPLRGKQ-EYYLNGNVAWVKTLDLNDRYIYDTQEKI 68 Query: 62 VPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 L + S K+ E V+ G + +G++ L + + L + I+ + Sbjct: 69 TDLALKETSCKVQDEGTVLIAMYGGFNQIGRTGI--LKTKAATNQAICSLPLIEEIYPEY 126 Query: 122 IAHFTKSSLYRNKISSLSAGA-NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 + +F + RN +++A NI + NI +PPL+EQ IA+ L T+ Q+D Sbjct: 127 LNYFLIKN--RNVWRNVAASTRKDPNITKGDVEKFNIIVPPLSEQYKIADILSTIDEQID 184 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNG-KLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 T A E+ ++ K Q +L + + + P+ KKL + ++ G Sbjct: 185 KTDALIEKTRELKKGLMQKLLIKGIGHTEFRDTEIGRIPKGWEVKKL---EEIVQICYGK 241 Query: 240 SSKPNESGVGH-PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLE 298 + K E G IL V N + +L R + Sbjct: 242 NQKEVEIEGGIYKILGTGGVIGNT----------------NDYLWDKPSVLIGRKGTIDK 285 Query: 299 FVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQK 358 + D L ++ + + +++ + + + +G Sbjct: 286 --------PMYIEEPFWTVDTLFYTKVDEGYVAKWLYYYLNKIDLKKY-----NEATGVP 332 Query: 359 GISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 +S + + ++L+PP KEQ +I + + + + D + ++ N ++++ Sbjct: 333 SLSVAVLNTILILVPPFKEQQKISKILSAVDSDID----VYESKKNKLENAKKALMNHLL 388 Query: 419 RGELTA 424 G++ Sbjct: 389 TGKIRV 394 Score = 124 bits (312), Expect = 7e-27, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 71/216 (32%), Gaps = 12/216 (5%) Query: 204 AVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNES-GVGHPILRISSVRAGH 262 + PQ +K+ + L K ++ + + Sbjct: 1 MIREGYKMTELGEIPQEWEVRKIEDLYSVLTGATPLRGKQEYYLNGNVAWVKTLDLNDRY 60 Query: 263 VDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIR 322 + + + + E + +G +L Y G +G G+LK + Sbjct: 61 IYDTQEKITDLALKETSCKVQDEGTVLIAMYGG-FNQIGRTGILK----TKAATNQAICS 115 Query: 323 ARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIV 382 L ++ PEY+ F RN N +T I+ D++ +++PP+ EQ +I Sbjct: 116 LPLIEEIYPEYLNYFL--IKNRNVWRNVAASTRKDPNITKGDVEKFNIIVPPLSEQYKIA 173 Query: 383 RRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAF 418 + + D + + + L + ++ K Sbjct: 174 DILSTIDEQIDK----TDALIEKTRELKKGLMQKLL 205 >UniRef50_B9M293 Restriction endonuclease S subunit-like protein n=1 Tax=Geobacter sp. FRC-32 RepID=B9M293_GEOSF Length = 644 Score = 241 bits (617), Expect = 3e-62, Method: Composition-based stats. Identities = 93/438 (21%), Positives = 175/438 (39%), Gaps = 35/438 (7%) Query: 5 KLPEGWVIAPVSTVT-TLIRGVTYKKEQAINYLKDDYLPLIRANNIQ-NGKFDTTDLVFV 62 +LPE W +A V V L G K + D P IR +N+ +GK + + Sbjct: 2 RLPESWRVATVGNVLLDLQPGFAQKPGEE----DDGTTPQIRTHNVTPDGKITLEGIKHI 57 Query: 63 PKNLVKE-SQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSG 120 + + K+ D+V ++ S+ VGK+A + E F LRP +L+ Sbjct: 58 SASAKETARYKLMMGDVVFNNTN-SEEWVGKTAVFNQEGEYVFSNHMTRLRPHPELVTPE 116 Query: 121 FIAHFTKSSLYRNKISSLSAG-ANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++A + + + + I+ + + +P L EQ I + L Sbjct: 117 YLAFYLHQLWAIGYSKTRAKRWVSQAGIESKAIASFKLSLPTLPEQHRIIDVLRQAQD-- 174 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGL 239 +++ EQ+ ++ +A+ S + F T + G Sbjct: 175 --LRSQKEQVLKLSAELAKALFEQHF---------GIAGASSAWPMEPFGKHTTYSKYGP 223 Query: 240 SSKPN-ESGVGHPILRISSVRA-GHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSL 297 S G ILR + + G + + L +E ++ H L+ G L+ +R Sbjct: 224 RFPDQQYSDSGIHILRTTDMNNDGTIRWWEAPKLALTEGQIQEHALKPGTLVVSRSG--- 280 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 +G L Q + LI L PEY+ F++P + M+ + Q Sbjct: 281 -TIGPFALFDG-QEGRCVAGAYLIEFGLADSVQPEYVRALFATPYVQ-QMLKKAVRSVAQ 337 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 I+ +I+S + +PP++ Q +++Q+ A+ I K ++++ + ++++ +A Sbjct: 338 PNINAPNIQSIKIPVPPLEIQEAFAVQIKQVRAWTSEIVKSA----SKIDEVIRAVVGEA 393 Query: 418 FRGELTAQWRAENPDLIS 435 F GELTAQWR + I+ Sbjct: 394 FSGELTAQWRGMHASEIT 411 >UniRef50_A1VAP4 Restriction modification system DNA specificity domain n=1 Tax=Desulfovibrio vulgaris DP4 RepID=A1VAP4_DESVV Length = 438 Score = 241 bits (617), Expect = 3e-62, Method: Composition-based stats. Identities = 67/436 (15%), Positives = 151/436 (34%), Gaps = 38/436 (8%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 GK+P W + + ++ + + + + L + I ++ D + Sbjct: 18 GKIPSHWSVTSLYSLA---SECDFPNKDML----ESNLLSLSYGRIIRKDINSNDG--LL 68 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHL-PFECSFGAFCGVLRPEKLIFSGFI 122 + Q + DIV+ ++ +S L + +RP +S ++ Sbjct: 69 PESFETYQIVDHGDIVLRLTDLQNDQ--RSLRSGLVKERGIITSAYTAIRPTASHYS-YL 125 Query: 123 AHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDST 182 A+ ++ S+ G ++K + + I P +EQ IA LD A++D+ Sbjct: 126 AYLLRAYDTLKIFYSMGGGL-RQSMKFSDLRRLPILKPAYSEQSAIAVFLDHETAKIDAL 184 Query: 183 KARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFE--SI 231 E++ ++LK RQAV+ AV L +W P+H KL ++ Sbjct: 185 ITEQEKLIELLKEKRQAVISHAVTKGLAPNVPMKDSGVEWLGEVPEHWKVAKLRRFVRAV 244 Query: 232 LTELRNGLSSKPNESGVGHPILRISSVRAGHVDQ--NDIRFLECSESELNRHKLQDGDLL 289 T S + G +G + + + + + G + Sbjct: 245 QTGSTPSASPPNTDIEDGTYWFTPGDF-SGPIRLGSSSKKVPPEAIKQGEVKVFPAGAVF 303 Query: 290 FTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMN 349 +L +G ++ D ++ SS + + M Sbjct: 304 VVSIGATLGKIG-------YLLTLASANQQINAIIPNADVEGLFLAYSLSS---KTSEMM 353 Query: 350 CVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNL 409 + S ++ + K + +PP+ EQ I + +++ +D + + A+ + Sbjct: 354 NLSNASTIGIMNQEKTKEIWLTVPPLCEQERITKFLDEDCVTSDALVNESQRAIDLLKER 413 Query: 410 TQSILAKAFRGELTAQ 425 ++++ A G++ + Sbjct: 414 RSALISAAVTGKIDVR 429 Score = 106 bits (265), Expect = 2e-21, Method: Composition-based stats. Identities = 29/233 (12%), Positives = 70/233 (30%), Gaps = 23/233 (9%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P H L + + + L + ++ ND L Sbjct: 15 EWLGKIPSHWSVTSLYSLASECDF-----PNKDMLESNLLSLSYGRIIRKDINSND-GLL 68 Query: 272 ECSESELNRHKLQDGDLLFTRYN-GSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 S + GD++ + + + GL+K+ + R T Sbjct: 69 PESFETYQ--IVDHGDIVLRLTDLQNDQRSLRSGLVKE----RGIITSAYTAIRPTAS-H 121 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 Y+ + + ++ + D++ +L P EQ+ I ++ A Sbjct: 122 YSYLAYLLRAYDTLKIFYSM--GGGLRQSMKFSDLRRLPILKPAYSEQSAIAVFLDHETA 179 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + + + + Q++++ A + P++ ++ L Sbjct: 180 KIDALITEQEKLIELLKEKRQAVISHAVT-------KGLAPNVPMKDSGVEWL 225 >UniRef50_A6L5S4 Type I restriction-modification system S subunit n=1 Tax=Bacteroides vulgatus ATCC 8482 RepID=A6L5S4_BACV8 Length = 430 Score = 241 bits (616), Expect = 4e-62, Method: Composition-based stats. Identities = 68/432 (15%), Positives = 150/432 (34%), Gaps = 29/432 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 GK+P W I + T N L+ ++ + D+ + Sbjct: 16 GKIPSHWEIKRSRLIFD-ENVETNSTCNNTNQLQ---FRFGTIEPKKSQEMDSDLKKIIS 71 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + + DI+I + + V + + + + LRP++ I S + Sbjct: 72 -----KYTIVQNGDIMINGLNLNYDFVSQ-RVAQVKEKGIITSAYIALRPKENICSDYFT 125 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + K R + G + F +PIPPL EQ+ +A LD A++D Sbjct: 126 YLLKGMDARKVFHGMGCGV-RLTLSFKEFRNELLPIPPLEEQQSMATYLDKATAEIDKAI 184 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLT---------EKWRNFEPQHSVFKKLNFESILTE 234 A+ +++ +L +Q ++ AV L W P H L + + Sbjct: 185 AQQQRMIDLLNERKQIIIQRAVTKGLDGNVEMKNSGLNWLGQIPSHWESLPLTYVFEMRN 244 Query: 235 LRNGLSSKPNESGVG-HPILRISSV-RAGHVDQNDIRFLECSESELNRHKLQDGDLLFTR 292 + P G P R+ + ++G + ++++ +++ + + G + Sbjct: 245 GYTPSKNDPTYWTNGSIPWYRMEDIRKSGRFLREAMQYVT-TKAINGKGTFKAGSYIMAI 303 Query: 293 YNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVK 352 +G +L N + + IR L + P ++ + Sbjct: 304 ---CTASIGEHAMLIADSLANQRFANFKIRKSLIESFYPLFLFYYM---YVVGDFCRENS 357 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 ++ + + +K + P ++EQ IV + Q +T +++ + + Q Sbjct: 358 NSTCFQYVDMGALKRFPIPKPSMEEQKNIVSSLTQNLQQINTALERIQKQITLLQERKQI 417 Query: 413 ILAKAFRGELTA 424 I+++ G++ Sbjct: 418 IISEVVTGKIKV 429 Score = 116 bits (291), Expect = 2e-24, Method: Composition-based stats. Identities = 34/235 (14%), Positives = 81/235 (34%), Gaps = 26/235 (11%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P H K+ N + N+ R G ++ + + Sbjct: 13 QWLGKIPSHWEIKRSRLIFDENVETNSTCNNTNQL----------QFRFGTIEPKKSQEM 62 Query: 272 ECSESEL--NRHKLQDGDLLFTRYNGSLEFVG-VCGLLKKLQHQNLLYPDKLIRARLTKD 328 + ++ +Q+GD++ N + +FV +K+ + I R ++ Sbjct: 63 DSDLKKIISKYTIVQNGDIMINGLNLNYDFVSQRVAQVKE----KGIITSAYIALRPKEN 118 Query: 329 ALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 +Y AR + +S K+ +++++ +PP++EQ + +++ Sbjct: 119 ICSDYFTYLLKGMDARKVFHGMGCG--VRLTLSFKEFRNELLPIPPLEEQQSMATYLDKA 176 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 A D Q + +N Q I+ +A + + ++ + L Sbjct: 177 TAEIDKAIAQQQRMIDLLNERKQIIIQRAVT-------KGLDGNVEMKNSGLNWL 224 >UniRef50_B3ENS6 Putative type I restriction-modification system n=1 Tax=Chlorobium phaeobacteroides BS1 RepID=B3ENS6_CHLPB Length = 436 Score = 241 bits (616), Expect = 4e-62, Method: Composition-based stats. Identities = 77/447 (17%), Positives = 152/447 (34%), Gaps = 48/447 (10%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYL-PLIRANNIQNGKFDTTDLVFV 62 G++PE W + + + + ++ + + P Q+ ++ V + Sbjct: 18 GEVPEHWQMINSRRLFHQAKESPLTDDIQLSATQKYGVVP-------QSLFMESDGKVAL 70 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 + + + + +D VI++ S + + VLRP + I + Sbjct: 71 ALSGLGNFKHVEVDDFVISLRSFQGG------IERSKYSGCVSPAYTVLRPAEPIDGSYW 124 Query: 123 AHFTKSSLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 KS Y + +++ G +I F I +P PPLAEQ IAE LD ++D Sbjct: 125 GFLLKSRRYVEILQTMNDGLRDGKSISYQQFGQIPLPSPPLAEQTAIAEFLDRETGKIDE 184 Query: 182 TKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNFESIL 232 A ++ ++LK RQAV+ AV L +W P KL S Sbjct: 185 LVAEQRRLMELLKEKRQAVISHAVTKGLNPHAPMKPSGIEWLGDVPVGWSVLKLGNISRF 244 Query: 233 TELRNGLSSKPNESGVGHPILRISSVRA----GHVDQNDIRFLECSESELNRHKLQDGDL 288 S ++ P ++ + + + + E + EL + + Sbjct: 245 KGGAGFPDSYQGQTDNEIPFFKVGDMVNADDARVMRRANHTITEATARELRAFVFPESTI 304 Query: 289 LFTRYNGSL-----EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSA 343 +F + +L +G + + + L Sbjct: 305 VFAKVGAALLLKRYRLLGQRSCIDNNMMGMTVGDGSSVDYLL---------------YVL 349 Query: 344 RNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNAL 403 + + I+ I Q + LPP+ EQ EIV + + A DT+ + + Sbjct: 350 PLLDLELIVNPGAVPSINEGQISGQRIALPPIDEQREIVEFLTSVTAKFDTLTAEAQRTI 409 Query: 404 ARVNNLTQSILAKAFRGELTAQWRAEN 430 + ++++ A G++ + N Sbjct: 410 DLLQERRTALISAAVTGQIDVRQPPRN 436 Score = 101 bits (252), Expect = 7e-20, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 70/236 (29%), Gaps = 28/236 (11%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISS---VRAGHVDQNDI 268 +W P+H ++ R +K + L + V + Sbjct: 15 EWLGEVPEHWQ--------MINSRRLFHQAKESPLTDDIQ-LSATQKYGVVPQSLFMESD 65 Query: 269 RFLECSESELNRHK-LQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 + + S L K ++ D + + + G +++ ++ + P + R + Sbjct: 66 GKVALALSGLGNFKHVEVDDFVISLRS-------FQGGIERSKYSGCVSPAYTV-LRPAE 117 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 Y S + K IS + + PP+ EQ I +++ Sbjct: 118 PIDGSYWGFLLKSRRYVEILQTMNDGLRDGKSISYQQFGQIPLPSPPLAEQTAIAEFLDR 177 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + + + + Q++++ A + NP + L Sbjct: 178 ETGKIDELVAEQRRLMELLKEKRQAVISHAVT-------KGLNPHAPMKPSGIEWL 226 >UniRef50_C6YVW3 Predicted protein n=2 Tax=Francisella philomiragia subsp. philomiragia ATCC 25015 RepID=C6YVW3_9GAMM Length = 379 Score = 241 bits (615), Expect = 5e-62, Method: Composition-based stats. Identities = 75/422 (17%), Positives = 149/422 (35%), Gaps = 50/422 (11%) Query: 7 PEGWVIAPVSTVTTL-IRGVTYKKEQAINYLKDDYLPLIRANN-IQNGKFDTTDLVFVPK 64 P GW + V ++ KK + +D P+ A I+N F + ++ Sbjct: 2 PAGWEWEKLEKVCDKASSNLSLKKIEN----EDGEYPIYGAKGFIKNISFFHREEPYIS- 56 Query: 65 NLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIAH 124 I V L + S L P+ I ++ Sbjct: 57 ---------------IIKDGAGVGRV-----TMLDSKSSVIGTLQYLLPKNCIDIKYLYF 96 Query: 125 FTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 + + +G I +I + +P+PPLAEQK I KLD+L ++D Sbjct: 97 LLLVIDFGKYV----SGTTIPHIYYRDYKEHLVPLPPLAEQKRIVAKLDSLFEKIDKAIE 152 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSK-- 242 +Q + L + N + I ++ +G + K Sbjct: 153 LHQQNITNANTLMASTLDKTFKK-----------LEGEYSYKNLKDITIKIGSGATPKGG 201 Query: 243 -PNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRH-KLQDGDLLFTRYNGSLEFV 300 G ++R +V + + F++ S+++ ++ ++ D+L S V Sbjct: 202 QKAYKQKGTSLIRSMNVHDMGFSKKGLAFIDDSQADKLKNVIVEKDDVLLNITGAS---V 258 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 C ++ + + + RL + +++ + SP + ++ + ++ I Sbjct: 259 ARCCVVCESALPARV-NQHVSIIRLNDSFISKFLHYYLISPMKKTELLFSSSGGATREAI 317 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + I++ V + Q + V ++ + D I++ L + L SIL KAFRG Sbjct: 318 TKSMIENLQVPDISLPIQQQTVEYLDSIATKVDKIKQLNEQKLENLKALKASILDKAFRG 377 Query: 421 EL 422 EL Sbjct: 378 EL 379 >UniRef50_Q58615 Uncharacterized protein MJ1218 n=1 Tax=Methanocaldococcus jannaschii RepID=Y1218_METJA Length = 425 Score = 241 bits (615), Expect = 5e-62, Method: Composition-based stats. Identities = 66/432 (15%), Positives = 154/432 (35%), Gaps = 27/432 (6%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 M ++PE W + + I+G K ++ + Y P + +++G + + Sbjct: 16 MHGLRVPEDWEVVRIGDFIKYIKGK--KPAVMVDEELEGYYPYLSTEYLRDG-IASKFVK 72 Query: 61 FVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI-FS 119 K ++ ++ DI++ + + L + + L + I Sbjct: 73 ITNKEII-----VNENDILLLWDGSNAGEI------FLGKKGILSSTMVKLEQKNKIMDD 121 Query: 120 GFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQV 179 ++ + K L + + S + G I ++ F+ I IP+PPL EQK IA+ L + Sbjct: 122 LYLFYSLK--LKESFLKSQTKGTGIPHVDKKIFENIKIPLPPLEEQKQIAKILSDFDNLI 179 Query: 180 DSTKARFEQIPQILKRFRQAVLGGAV--NGKLTEKWRNFEPQHSVFKKLNFESILTELRN 237 + + E + + K + + V + + P+ KL + + Sbjct: 180 GTINKQIEVLNKAKKGMMKKLFTKGVFEHKSFKKSEIGEIPEDWEVVKLKEVVDIQSGKY 239 Query: 238 GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSE-SELNRHKLQDGDLLFTRYNGS 296 G L+I +V G + + FL ++ + L+ GD++ Sbjct: 240 FKY--SEFCENGVKCLKIDNVGFGKIFWETVSFLPEDYLNKYPQLVLKSGDIVLALNRPI 297 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 + G+LK + +LY ++ +++ S + + + + Sbjct: 298 IGGKIKIGILKDIDEPAILYQRVGRFIFKSEKIDKQFLFYLLMSEYFKKELSKLL-IGTD 356 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 Q I + + + LP ++EQ + R++ D + + ++ + I+ Sbjct: 357 QPYIRTPVLLNIKIPLPHLEEQKAMAERLKS----IDNLIEIKRKEKEQIEKAKKKIMNL 412 Query: 417 AFRGELTAQWRA 428 G++ + Sbjct: 413 LLTGKIRVKNLN 424 Score = 133 bits (336), Expect = 1e-29, Method: Composition-based stats. Identities = 42/219 (19%), Positives = 95/219 (43%), Gaps = 12/219 (5%) Query: 2 SAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVF 61 G++PE W + + V + G +K + ++ + ++ +N+ GK + F Sbjct: 215 EIGEIPEDWEVVKLKEVVDIQSGKYFK----YSEFCENGVKCLKIDNVGFGKIFWETVSF 270 Query: 62 VPKNLVKESQKI--SPEDIVIAMSSGSKSVVGKSAHQ-HLPFECSFGAFCGVLRPE-KLI 117 +P++ + + ++ DIV+A++ K + G + + I Sbjct: 271 LPEDYLNKYPQLVLKSGDIVLALNRPIIGGKIKIGILKDIDEPAILYQRVGRFIFKSEKI 330 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F+ + S ++ ++S L G + I+ I IP+P L EQK +AE+L + Sbjct: 331 DKQFLFYLLMSEYFKKELSKLLIGTDQPYIRTPVLLNIKIPLPHLEEQKAMAERLKS--- 387 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNF 216 +D+ + + +++ ++ ++ + GK+ K NF Sbjct: 388 -IDNLIEIKRKEKEQIEKAKKKIMNLLLTGKIRVKNLNF 425 >UniRef50_Q7UE33 Type I restriction modification enzyme, S subunit n=1 Tax=Rhodopirellula baltica RepID=Q7UE33_RHOBA Length = 393 Score = 241 bits (615), Expect = 6e-62, Method: Composition-based stats. Identities = 64/426 (15%), Positives = 142/426 (33%), Gaps = 37/426 (8%) Query: 1 MSAGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLV 60 M + + P GW + +S + + Y Q + D +P +R + I + D+ Sbjct: 1 MKSHETPAGWSLTKLSEICDPNAPIMYGILQPGPVILD-GVPYVRPSEIDPDRIRLEDIK 59 Query: 61 FVPKNLVKESQK--ISPEDIVIAMSSGSKSVVGKSAHQHLPFE-CSFGAFCGVLRPE-KL 116 + + ++ + ED++I + +G+ A + +R K Sbjct: 60 RTTPEIAERYRRSTLQTEDLLITI----VGTLGRIAVVPPELNGANITQSSARIRLNRKT 115 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 +I +S + + G + + + IP+PPL+EQK IAE LD Sbjct: 116 ANLRYIRQLLRSPIAIRQYDFHRLGTGVPRLNIHHVRDLQIPLPPLSEQKRIAEILDRAE 175 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 A +A + ++ + + G V+ P+ + L+ +T Sbjct: 176 ALRAKRRAALALLDELTQSIFLDMFGDPVSN----------PKGWPVESLSDLGKITTGG 225 Query: 237 NGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGS 296 S K G P + + + + + + G + Sbjct: 226 TPSSKKEGMFGGTVPFVTPGDLESDEL----PKRTLSDHGASEAKTVPAGATFVCCIGAT 281 Query: 297 LEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSG 356 + +G + + +L + ++ ++ ++ Sbjct: 282 IGKMGQASV-------RSAFNQQLNAIEWSNSVNDDFGLGVLR---FFKKLIATWGASTT 331 Query: 357 QKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAK 416 + + + +PP++ QA R + + + N+L+ ++ L S+ + Sbjct: 332 LPILKKSSFEKIEIPVPPIESQAIYADR----KSEIEQLRSLHRNSLSELDQLFASLQHR 387 Query: 417 AFRGEL 422 AFRGEL Sbjct: 388 AFRGEL 393 >UniRef50_C0VJ61 Restriction modification system DNA specificity domain protein n=2 Tax=Acinetobacter RepID=C0VJ61_9GAMM Length = 401 Score = 240 bits (612), Expect = 1e-61, Method: Composition-based stats. Identities = 80/399 (20%), Positives = 147/399 (36%), Gaps = 32/399 (8%) Query: 36 LKDDYLPLIRANNIQN-GKFDT--TDLVFVPKNLVKESQK--ISPEDIVIAMSSGSKS-- 88 K+ + L+ NI GK D TD + + + Q I D+VIA S + Sbjct: 23 FKESGIKLLNVANITKQGKIDLNKTDRHLSTEEVDSKYQHFLIDEGDLVIASSGITNDED 82 Query: 89 ---VVGKSAHQHLPFECSFGAFCGVLRPEKLI-FSGFIAHFTKSSLYRNKISSLSAGANI 144 + + + + + F+ H+ S +R +I+ G Sbjct: 83 NLLRTKIAFIEKQHLPLCLNTSTIRFKAKDGVSDLKFLKHWLNSLEFRQQITKEVTGIAQ 142 Query: 145 NNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIPQILKRFRQAVLGGA 204 N P+ I I +PPL EQ+ IA LD + E++ Q+L+ + G Sbjct: 143 KNFGPSHLKKIKISLPPLTEQRRIASILDQADELRQKRQQAIEKLDQLLQATFIDMFGDP 202 Query: 205 VNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVD 264 V+ P+ + + S + K + + LR ++V+ D Sbjct: 203 VSN----------PKGWDLRYVGEISESKLGKMLDKKKQSSEIDQYKYLRNANVQWFRFD 252 Query: 265 QNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR 324 +D+ +E +E + +L+ GD+L G + K N + L R R Sbjct: 253 LSDVFEMEFNEKDRKNCELKFGDVLVCEGGEP----GRAAIWKNDLE-NCFFQKALHRVR 307 Query: 325 LTK-DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVR 383 L LPEY F S + + + ++G +K+ + +PP+ Q + Sbjct: 308 LDMTQILPEYFVWLFWFYSKNGGFDDHITV-ATIAHLTGVKMKAMQIPIPPLSLQEDF-- 364 Query: 384 RVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 +Q + ++ + N+ +L S+ +AF G L Sbjct: 365 --QQKVNEIEVLKTTLENSSKLFESLFSSLQNQAFNGTL 401 Score = 92.1 bits (228), Expect = 4e-17, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 77/206 (37%), Gaps = 13/206 (6%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP-KN 65 P+GW + V ++ G K++ + + D +R N+Q +FD +D+ + Sbjct: 206 PKGWDLRYVGEISESKLGKMLDKKKQSSEI--DQYKYLRNANVQWFRFDLSDVFEMEFNE 263 Query: 66 LVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLP-FECSFGAFCGVLRPE-KLIFSGFIA 123 +++ ++ D+++ G++A C F +R + I + Sbjct: 264 KDRKNCELKFGDVLVC----EGGEPGRAAIWKNDLENCFFQKALHRVRLDMTQILPEYFV 319 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 A I ++ + IPIPPL+ Q+ + + +++ K Sbjct: 320 WLFWFYSKNGGFDDHITVATIAHLTGVKMKAMQIPIPPLSLQE---DF-QQKVNEIEVLK 375 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKL 209 E ++ + ++ A NG L Sbjct: 376 TTLENSSKLFESLFSSLQNQAFNGTL 401 >UniRef50_C6CZ61 Restriction modification system DNA specificity domain protein n=1 Tax=Paenibacillus sp. JDR-2 RepID=C6CZ61_PAESJ Length = 456 Score = 240 bits (612), Expect = 1e-61, Method: Composition-based stats. Identities = 83/459 (18%), Positives = 166/459 (36%), Gaps = 48/459 (10%) Query: 5 KLPEGWVIAPVSTVT---TLIRGVTYKK-EQAINYLKDDYL-PLIRANNIQNGKFDTTDL 59 ++P WV + ++ + +++ + + D +R +++ G Sbjct: 9 EVPGNWVWVKLGSLAYLTDFVANGSFQSLRENVEVSDDTDYALYVRLTDLRLG-LGHEGQ 67 Query: 60 VFVPKNLVK--ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLI 117 +V + K ++ +I+IA + V ++ + VLR + Sbjct: 68 KYVDETSYKFLSKSSLTGGEILIANIGANVGEV--FVMPNVDLLATIAPNMIVLRCNHYV 125 Query: 118 FSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 + F+ +F S + + ++ G I I++ +PPL EQK IA+K++ LL Sbjct: 126 ENIFLNYFLSSPQGKKLLGTIITGTGQPKINKTGLKTISVALPPLNEQKRIADKVERLLD 185 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFE-------------------- 217 +++ K E+ + + A+L A G+LT+KWR Sbjct: 186 KINQAKQLIEEAKATFELRQAAILDKAFRGELTKKWRGEHSNQISTVRSISEDINPNEIP 245 Query: 218 ---PQHSVFKKLNFESILTELRNGLSSK--PNESGVGHPILRISSVRAGHVDQNDIRFLE 272 P + +L L ++ + P G +P ++ V Sbjct: 246 FLLPAGWNWVRLKDLGTLERGKSKHRPRNDPKLFGGEYPFIQTGDVANAGDYIESYNQTL 305 Query: 273 CSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK-DALP 331 L +G + T + LLK +PD ++ Sbjct: 306 SEFGLLQSKLFPEGTVCITI----AANIADTALLK----FPCCFPDSVVGFIPKDAYISS 357 Query: 332 EYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAY 391 Y+ + + + + + QK I+ K ++ +V +PP E EI+ + L Sbjct: 358 LYLHYYMRT---IKSNLEHYAPATAQKNINLKVLQEILVPVPPKTEHDEILHMINLLMQK 414 Query: 392 ADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAEN 430 D + + N + + L QS+L+KAF+G L +EN Sbjct: 415 -DEEAQTIMNVASDLEILKQSVLSKAFQGNLGTNESSEN 452 Score = 142 bits (360), Expect = 2e-32, Method: Composition-based stats. Identities = 58/235 (24%), Positives = 104/235 (44%), Gaps = 14/235 (5%) Query: 216 FEPQHSVFKKLNFESILTELRNGLSSKPNESGVGH-------PILRISSVRAGHVDQNDI 268 P + V+ KL + LT+ S + V +R++ +R G + Sbjct: 9 EVPGNWVWVKLGSLAYLTDFVANGSFQSLRENVEVSDDTDYALYVRLTDLRLG-LGHEGQ 67 Query: 269 RFLEC-SESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTK 327 ++++ S L++ L G++L VG ++ + + P+ +I R Sbjct: 68 KYVDETSYKFLSKSSLTGGEILIANIGA---NVGEVFVMPNVDLLATIAPN-MIVLRCNH 123 Query: 328 DALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQ 387 ++ F SSP + ++ + T +GQ I+ +K+ V LPP+ EQ I +VE+ Sbjct: 124 YVENIFLNYFLSSPQGKK-LLGTIITGTGQPKINKTGLKTISVALPPLNEQKRIADKVER 182 Query: 388 LFAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAA 442 L + ++ + A A +IL KAFRGELT +WR E+ + IS S + Sbjct: 183 LLDKINQAKQLIEEAKATFELRQAAILDKAFRGELTKKWRGEHSNQISTVRSISE 237 >UniRef50_C9MBB6 Restriction endonuclease S n=5 Tax=Haemophilus influenzae RepID=C9MBB6_HAEIN Length = 416 Score = 239 bits (611), Expect = 1e-61, Method: Composition-based stats. Identities = 57/425 (13%), Positives = 133/425 (31%), Gaps = 28/425 (6%) Query: 4 GKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVP 63 G++P W + + + KK + L + GK V Sbjct: 16 GEVPSHWELKRLKQLF------VEKKHKQSLSLNCGAISF--------GKVIEKSDDKVT 61 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFIA 123 + + Q++ + +I + + ++ + A VL+ +++I + + Sbjct: 62 EATKRSYQEVLKGEFLINPLNLNYDLI-SLRIALSEIDVVVSAGYIVLKEKQIINKKYFS 120 Query: 124 HFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 + + L +G I + IPPL+EQ+ IA+ LD A++D Sbjct: 121 YLLHRYDV-AYMKLLGSGV-RQTINYGHISDSILVIPPLSEQQKIAQFLDDKTAKIDQAV 178 Query: 184 ARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKP 243 E+ +LK +Q ++ AV L + ++ + + Sbjct: 179 DLAEKQIALLKEHKQILIQNAVTRGLNPDVPLKDSGVEWIGQVPEHWDVQRSKFIFKKIE 238 Query: 244 NESGVGHPILRISSVRAGHVDQNDIRFLE---CSESELNRHKLQDGDLLFTRYNGSLEFV 300 + I+ R G V R E + E ++ GDL+ + + Sbjct: 239 RKVNEEDQIVT--CFRDGQVTLRANRRTEGFTNALKEHGYQGIRKGDLVIHAMDAFAGAI 296 Query: 301 GVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKG- 359 G+ + + + + + + + + K + Sbjct: 297 GIS-----DSDGKATPVYSVCLPHDKQKIDVYFYAYYLRNLALSGFISSLAKGIRERSTD 351 Query: 360 ISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFR 419 D ++ +PP EQ +I +++ + D + ++ ++ Sbjct: 352 FRYSDFAELLLPIPPYLEQQKIADYLDKQTSKIDRAIALKTAHIEKLKEYKNVLINDVVT 411 Query: 420 GELTA 424 G++ Sbjct: 412 GKVRV 416 Score = 121 bits (305), Expect = 4e-26, Method: Composition-based stats. Identities = 37/233 (15%), Positives = 84/233 (36%), Gaps = 28/233 (12%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGH-VDQNDIRF 270 +W P H K+L + + + LS L ++ G ++++D + Sbjct: 13 EWLGEVPSHWELKRLKQLFVEKKHKQSLS------------LNCGAISFGKVIEKSDDKV 60 Query: 271 LECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDAL 330 E ++ + ++ G+ L N + + + L L +++ I + + Sbjct: 61 TEATK--RSYQEVLKGEFLINPLNLNYDLI---SLRIALSEIDVVVSAGYIVLKEKQIIN 115 Query: 331 PEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFA 390 +Y A M + + Q I+ I ++++PP+ EQ +I + ++ A Sbjct: 116 KKYFSYLLHRYDV--AYMKLLGSGVRQ-TINYGHISDSILVIPPLSEQQKIAQFLDDKTA 172 Query: 391 YADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D +A + Q ++ A R NPD+ ++ + Sbjct: 173 KIDQAVDLAEKQIALLKEHKQILIQNAVT-------RGLNPDVPLKDSGVEWI 218 Score = 120 bits (302), Expect = 1e-25, Method: Composition-based stats. Identities = 42/208 (20%), Positives = 80/208 (38%), Gaps = 14/208 (6%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PE W + + +KK + +D + R + + + F Sbjct: 218 IGQVPEHWDVQRSKFI--------FKKIERKVNEEDQIVTCFRDGQVTL-RANRRTEGFT 268 Query: 63 PKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGFI 122 Q I D+VI +G S + + + ++ I F Sbjct: 269 NALKEHGYQGIRKGDLVIHAMDAFAGAIGIS---DSDGKATPVYSVCLPHDKQKIDVYFY 325 Query: 123 AHFTKSSLYRNKISSLSAGANINN--IKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVD 180 A++ ++ ISSL+ G + + + F + +PIPP EQ+ IA+ LD +++D Sbjct: 326 AYYLRNLALSGFISSLAKGIRERSTDFRYSDFAELLLPIPPYLEQQKIADYLDKQTSKID 385 Query: 181 STKARFEQIPQILKRFRQAVLGGAVNGK 208 A + LK ++ ++ V GK Sbjct: 386 RAIALKTAHIEKLKEYKNVLINDVVTGK 413 >UniRef50_C4KBJ9 Restriction modification system DNA specificity domain protein n=1 Tax=Thauera sp. MZ1T RepID=C4KBJ9_THASP Length = 390 Score = 239 bits (610), Expect = 2e-61, Method: Composition-based stats. Identities = 81/417 (19%), Positives = 165/417 (39%), Gaps = 34/417 (8%) Query: 12 IAPVSTVTTLIRGVTYKKEQAINYLK-DDYLPLIRANNIQNGKFDTTDLVFVPKNLVKES 70 + + V ++ + + + ++P+ + + + +T+ + + K Sbjct: 2 MVNLGDVASI----NPRLSDPLQQTELVSFVPMASLSA-EEARVVSTETRAYSE-VSKGY 55 Query: 71 QKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAF-CGVLRPEKLI-FSGFIAHFTKS 128 D+++A + GK A HLP FG+ V+RP++ + ++ H + Sbjct: 56 TPFRNGDVLVAKITPCFEN-GKIAQAHLPHPNGFGSTEFHVIRPKESLLDGRYLHHLLRQ 114 Query: 129 SLYRNKISSLSAGAN-INNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFE 187 + R + G+ + + IP+P L EQ+ +A LD A + Sbjct: 115 ADIRVEGERRMTGSGGQRRVPATFLSSLRIPLPRLEEQRRVAAILDQADALRAKRRKALA 174 Query: 188 QIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN-ES 246 + ++ + + G V + E++ G S Sbjct: 175 LLDELQRGIFIEMFGDPVT------------SPKGCTAGTLGDGIEEMQYGPRFHNEAYS 222 Query: 247 GVGHPILRISSVR-AGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGL 305 G I+RI+ + AG +D + + +E E ++ L+ GD++F R + VG L Sbjct: 223 PEGIRIVRITDLDAAGSLDFDSMPRMEVDEETRDKFALRAGDVVFARTGAT---VGKVAL 279 Query: 306 LKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDI 365 +K+ + + IR R LPEY S S ++ + + + Q+ SG + Sbjct: 280 IKE-RDPVCIAGAYFIRMRFQSRILPEYAFSVLQSESVQSLIFAQSRQ-AAQQNFSGPGL 337 Query: 366 KSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 + + +P ++ Q RVE + + + + +ALA ++ L S+ +AFRGEL Sbjct: 338 RRLPMPVPSIERQRRFAERVEAVGSE----KSKQLSALALLDELFSSLQHRAFRGEL 390 >UniRef50_B1GZ39 Type I restriction-modification system substrate-binding subunit n=1 Tax=uncultured Termite group 1 bacterium phylotype Rs-D17 RepID=B1GZ39_UNCTG Length = 434 Score = 239 bits (610), Expect = 2e-61, Method: Composition-based stats. Identities = 73/439 (16%), Positives = 157/439 (35%), Gaps = 42/439 (9%) Query: 3 AGKLPEGWVIAPVSTVTTLIR--GVTYKKEQAINYLKDDY-LPLIRANNIQNGKFDTTDL 59 G +P+ W + + K ++ + + +P +I N K Sbjct: 15 IGDIPKNWNFVSCRLIVSERNERNKGMKNNNYLSLMANIGVIPYEEKGDIGNKK------ 68 Query: 60 VFVPKNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFS 119 +++ + + D++I + G S + + + + K+I Sbjct: 69 ----PENLEKCKIVYEGDLIINSMNYFIGSYGISKYDGICSPV----YIVLYANTKVIEP 120 Query: 120 GFIAHFTKSSLYRNKISSLSAGA--NINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLA 177 F ++ ++ S G + I I IP+P L EQ+ I LD Sbjct: 121 RFAFRVFENPKFQGVAQSFGNGILEHRRAINWDILKNIKIPVPLLEEQRNILSFLDKKTE 180 Query: 178 QVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTE---------KWRNFEPQHSVFKKLNF 228 ++D+ + E++ ++L+ +RQ+++ V L + +W P K N Sbjct: 181 KIDALISDKEKLIKLLREYRQSIISETVTKGLDKKVQMKHSGIEWIGDIPYDWKVNKFNR 240 Query: 229 ESILTELRNGLSSKPNES--GVGHPILRISSVRAGHVDQND-IRFLECSESEL--NRHKL 283 + + GL+ + N + I + + G + ++ + + R L Sbjct: 241 I--IIRVSTGLNPRNNFKLGDGDCYYVTIKNFKKGKLFLDEKCDRMTKEALNIINERSDL 298 Query: 284 QDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKD-ALPEYIEIFFSSPS 342 + D+LF+ E L N + + R+ KD LP + ++ S Sbjct: 299 KIDDILFSSIGEEAE-----AYLISEHPTNWNINESVFTIRVNKDLVLPNFFYYLIANKS 353 Query: 343 ARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNA 402 N ++ S K I + + V +P +K Q EI ++ D + + + Sbjct: 354 FFNDLLKDAT-GSTFKSIKINSLIEKKVPVPSLKTQKEIANLLDDKTEKIDNLIENITKQ 412 Query: 403 LARVNNLTQSILAKAFRGE 421 + ++ +SI+ +A G+ Sbjct: 413 IKKLQEYRKSIIGEAVTGK 431 Score = 103 bits (257), Expect = 2e-20, Method: Composition-based stats. Identities = 29/235 (12%), Positives = 78/235 (33%), Gaps = 24/235 (10%) Query: 212 KWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVDQNDIRFL 271 +W P++ F RN N + ++++ G + + + Sbjct: 13 EWIGDIPKNWNFVSCRLIVSERNERNKGMKNNNYLSL------MANI--GVIPYEEKGDI 64 Query: 272 ECSESEL--NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLIRARLTKDA 329 + E + +GDL+ N + G+ ++ + P ++ TK Sbjct: 65 GNKKPENLEKCKIVYEGDLIINSMNYFIGSYGIS------KYDGICSPVYIVLYANTKVI 118 Query: 330 LPEYIEIFFSSPSARNAMMNCVKTTSGQKG-ISGKDIKSQVVLLPPVKEQAEIVRRVEQL 388 P + F +P + + + I+ +K+ + +P ++EQ I+ +++ Sbjct: 119 EPRFAFRVFENPKFQGVAQSFGNGILEHRRAINWDILKNIKIPVPLLEEQRNILSFLDKK 178 Query: 389 FAYADTIEKQVNNALARVNNLTQSILAKAFRGELTAQWRAENPDLISGENSAAAL 443 D + + + QSI+++ + + + + + Sbjct: 179 TEKIDALISDKEKLIKLLREYRQSIISETVT-------KGLDKKVQMKHSGIEWI 226 >UniRef50_D0WYM6 Putative uncharacterized protein n=1 Tax=Vibrio alginolyticus 40B RepID=D0WYM6_VIBAL Length = 371 Score = 239 bits (610), Expect = 2e-61, Method: Composition-based stats. Identities = 90/412 (21%), Positives = 167/412 (40%), Gaps = 44/412 (10%) Query: 12 IAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLVKESQ 71 + + ++ + T Q + DD + A NGK + + + Sbjct: 1 MVKLDSICRPKQWKTIAASQLL----DDGYVVYGA----NGKI-----GYYSEYTHENPT 47 Query: 72 KISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFIAHFTKSSL 130 ++I + V S P G + + + + ++ + Sbjct: 48 ------VMITCRGATCGNVHIS----EPKAYINGNAMALDDVDPERVDINYLRYCLIDRG 97 Query: 131 YRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKARFEQIP 190 +R+ I +G+ I + IP+PPL QK IAE L+ D + +Q+ Sbjct: 98 FRDVI----SGSAQPQITGKGLSKVQIPLPPLETQKQIAEVLEKA----DQLRKDCQQME 149 Query: 191 QILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGH 250 Q L Q+V +T P+ K L+ + S + + Sbjct: 150 QELNSLAQSVFIDMFGDPVTN------PKGWDLKPLSSLGEVKGGLQVTSKRAAN-PISV 202 Query: 251 PILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQ 310 P LR+++V H++ ++++ + +E+EL R L+ GD+LF +G+ VG + Sbjct: 203 PYLRVANVYRDHLELDEVKEIRVTENELERVLLEKGDVLFVEGHGNANEVGRTAVWNDEV 262 Query: 311 HQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVV 370 Q ++ + LIR R D PEY+ F +S S + ++ KTTSG +S +IKS V Sbjct: 263 AQ-CVHQNHLIRFRPGADVRPEYVSAFVNSASGKRQLLKMSKTTSGLNTLSTSNIKSIQV 321 Query: 371 LLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGEL 422 L+PP+ EQ + + + A + + ++ +++ KAF+GEL Sbjct: 322 LVPPLLEQDDFLAFL----ASCKAQQVVNDQLSVELDQNFNALMQKAFKGEL 369 Score = 115 bits (289), Expect = 3e-24, Method: Composition-based stats. Identities = 42/210 (20%), Positives = 93/210 (44%), Gaps = 16/210 (7%) Query: 7 PEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNL 66 P+GW + P+S++ + G+ ++A N + +P +R N+ + ++ + Sbjct: 171 PKGWDLKPLSSLGEVKGGLQVTSKRAANPIS---VPYLRVANVYRDHLELDEVKEIRVTE 227 Query: 67 VK-ESQKISPEDIVIAMSSGSKSVVGKSAHQHLPF-ECSFGAFCGVLRPEKLIFSGFIAH 124 + E + D++ G+ + VG++A + +C RP + +++ Sbjct: 228 NELERVLLEKGDVLFVEGHGNANEVGRTAVWNDEVAQCVHQNHLIRFRPGADVRPEYVSA 287 Query: 125 FTKSSLYRNKISSLS-AGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTK 183 F S+ + ++ +S + +N + ++ I + +PPL EQ D LA + S K Sbjct: 288 FVNSASGKRQLLKMSKTTSGLNTLSTSNIKSIQVLVPPLLEQ-------DDFLAFLASCK 340 Query: 184 ARF---EQIPQILKRFRQAVLGGAVNGKLT 210 A+ +Q+ L + A++ A G+L Sbjct: 341 AQQVVNDQLSVELDQNFNALMQKAFKGELN 370 >UniRef50_A6CAE7 Type I restriction enzyme specificity protein n=1 Tax=Planctomyces maris DSM 8797 RepID=A6CAE7_9PLAN Length = 398 Score = 239 bits (610), Expect = 2e-61, Method: Composition-based stats. Identities = 66/430 (15%), Positives = 146/430 (33%), Gaps = 44/430 (10%) Query: 8 EGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFVPKNLV 67 + + + + TL+ G YKK++ ++ P++R N F T + + Sbjct: 3 DNYQKCRLGEICTLLNGRAYKKKELLD---SGKYPVLRVGN-----FFTNRSWYYSDLEL 54 Query: 68 KESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEK-LIFSGFIAHFT 126 +++ D++ A S+ + + + ++ ++ + F+ ++ Sbjct: 55 DDNKYCEEGDLLYAWSASFGPRIW------SGPKVIYHYHIWKVQLDESKVNKNFLCYWF 108 Query: 127 --KSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDSTKA 184 S R++ G + ++ S + + +PPL+EQK I LD + K Sbjct: 109 GWDSEKIRSE---QGTGTTMIHVTKGSMEDRELCLPPLSEQKRIVAILDEAFGAIARAKE 165 Query: 185 RFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPN 244 + + + L + + KKL+ + N Sbjct: 166 NAARNLANARELFDSYLNRVFT---------EKGEGWEEKKLSEIAKTFGRGKSRHRPRN 216 Query: 245 E---SGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSLEFVG 301 + G +P ++ +R + + G L T + Sbjct: 217 DKSLYGGEYPFIQTGEIRNANHYITKFTQTYNEKGLAQSKLWPVGTLCITI----AANIA 272 Query: 302 VCGLLKKLQHQNLLYPDKLIRARLT-KDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGI 360 +L + PD +I + A +++E + + + S Q I Sbjct: 273 ETAIL----TFDACIPDSVIGLVCDPEKANVDFVEYLLQN---FKSGLQAEGKGSAQDNI 325 Query: 361 SGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRG 420 + + + P V EQ +IV + + + + L ++ L QS+L KAF G Sbjct: 326 NMGTFERMLFPFPSVSEQEKIVCELNAIAESCNNLSPIYQQKLTALDELKQSLLQKAFTG 385 Query: 421 ELTAQWRAEN 430 +LT++ + Sbjct: 386 QLTSKTKELE 395 >UniRef50_B5JV80 Restriction modification system DNA specificity domain n=1 Tax=gamma proteobacterium HTCC5015 RepID=B5JV80_9GAMM Length = 416 Score = 239 bits (610), Expect = 2e-61, Method: Composition-based stats. Identities = 62/431 (14%), Positives = 143/431 (33%), Gaps = 28/431 (6%) Query: 6 LPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTD--LVFVP 63 +P+ W P+S+V E+ ++ + + G + D V Sbjct: 5 IPKDWKRVPLSSV----------SERMKRRNSAGNTNVLTISAVH-GLVNQKDFFNKIVA 53 Query: 64 KNLVKESQKISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPE-KLIFSGFI 122 + + + D S VG + E + + + F Sbjct: 54 SDNLSNYFHLKKGDFAYNKSYSHGYPVGVVRRLEMYDEGVLSPLYICFSMKGEGVDDKFA 113 Query: 123 AHFTKSSLYRNKISSLSAGANIN----NIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQ 178 A+F S + +I+ ++ N N+ F ++ +PPL EQ+ IA L ++ Sbjct: 114 AYFFDSHWFIEEINEIAKEGARNHGLLNVGVGDFFDLDFVLPPLPEQQKIAAILSSVDEV 173 Query: 179 VDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWR-NFEPQHSVFKKLNFESILTELRN 237 ++ T+A+ +++ + Q +L + + P+ L Sbjct: 174 IEKTRAQIDKLKDLKTGMMQELLTKGIGHAAFKDSPVGRIPEGWDVVALGDLGKWKGGGT 233 Query: 238 GLSSKPNESGVGHPILRISSVRAGHVDQNDIRFLECSESELNRHKLQDGDLLFTRYNGSL 297 S + P + +++ + Q + E + SE + + + +L +G L Sbjct: 234 PSKSNKDYWNGNIPWVSPKDMKSEFITQTSDQITEEAISESSTNLVSRDSVLVVVRSGIL 293 Query: 298 EFVGVCGLLKKLQHQNLLYPDKLIRARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQ 357 + L +L + + D ++ + + + + + +K + Sbjct: 294 KHTLPVALASC----DLALNQDMRALSVNSDHSERFVFQYLQANNHKV-LRATLKAGNTV 348 Query: 358 KGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKA 417 + I K ++ PP++EQ +I VE + L + Q+++ Sbjct: 349 ESIDFKVFSDYLIPCPPLEEQEKIALAVEAVGNRIRA----KAAQLDAYVIMKQALMQDL 404 Query: 418 FRGELTAQWRA 428 G++ R Sbjct: 405 LTGKVRVNTRD 415 Score = 132 bits (333), Expect = 3e-29, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 80/205 (39%), Gaps = 5/205 (2%) Query: 3 AGKLPEGWVIAPVSTVTTLIRGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTTDLVFV 62 G++PEGW + + + G T K + +P + ++++ T Sbjct: 210 VGRIPEGWDVVALGDLGKWKGGGTPSKSNK--DYWNGNIPWVSPKDMKSEFITQTSDQIT 267 Query: 63 PKNLVKESQKISPED-IVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKLIFSGF 121 + + + S + D +++ + SG A + L F Sbjct: 268 EEAISESSTNLVSRDSVLVVVRSGILKHTLPVALASCDL--ALNQDMRALSVNSDHSERF 325 Query: 122 IAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLLAQVDS 181 + + +++ ++ ++L AG + +I F IP PPL EQ+ IA ++ + ++ + Sbjct: 326 VFQYLQANNHKVLRATLKAGNTVESIDFKVFSDYLIPCPPLEEQEKIALAVEAVGNRIRA 385 Query: 182 TKARFEQIPQILKRFRQAVLGGAVN 206 A+ + + + Q +L G V Sbjct: 386 KAAQLDAYVIMKQALMQDLLTGKVR 410 >UniRef50_Q30S09 Restriction modification system DNA specificity domain n=1 Tax=Sulfurimonas denitrificans DSM 1251 RepID=Q30S09_SULDN Length = 420 Score = 238 bits (609), Expect = 3e-61, Method: Composition-based stats. Identities = 79/433 (18%), Positives = 160/433 (36%), Gaps = 36/433 (8%) Query: 3 AGKLPEGWVIAPVSTVTTLI--RGVTYKKEQAINYLKDDYLPLIRANNIQNGKFDTT-DL 59 G +PE W + + T+ + RG T K + L+ A NI+ G D Sbjct: 13 VGIIPEDWEVVKIKEATSYVDYRGKTPIKT-------GKGIFLVTAKNIKQGFIDYEASS 65 Query: 60 VFVPKNLVKESQK---ISPEDIVIAMSSGSKSVVGKSAHQHLPFECSFGAFCGVLRPEKL 116 FV + E K DI+I +++ +G A + R +K Sbjct: 66 EFVSEVEYHEIMKRGMPKIGDILIT----TEAPLGNVAQIDKE-NIALAQRVIKFRSKKN 120 Query: 117 IFSGFIAHFTKSSLYRNKISSLSAGANINNIKPASFDLINIPIPPLAEQKIIAEKLDTLL 176 + + F+ H+ S+ +++ + ++ G + I+ ++I +PPL EQ+ IA+ L T Sbjct: 121 VKNDFLKHYFLSNRFQSYLYRMAIGTTVLGIQGKELHNMSIVLPPLKEQEKIAQILTTWD 180 Query: 177 AQVDSTKARFEQIPQILKRFRQAVLGGAVNGKLTEKWRNFEPQHSVFKKLNFESILTELR 236 + E + K Q +L G V F +++ + ++ Sbjct: 181 EAITKQTELLEAKELLKKALMQKLLSGEVR---------FSGFSDEWEEARLDKLVFFQE 231 Query: 237 NGLSSKPNESGVGHPILRISSVRAGHVDQND-IRFLECSESE--LNRHKLQDGDLLFTRY 293 G +L + ++ ++ + ++ E+ + +GDLL + Sbjct: 232 GPGVRNTQYRKSGVKLLNVGNLNNNTLNLSSTETYISEEEAYGAYKHFLIDEGDLLISCS 291 Query: 294 NGSLEFVGVCGLLKKLQHQNLLYPDKLIRAR-LTKDALPEYIEIFFSSPSARNAMMNCVK 352 + E K + L +R + L L EY+ FF + + + Sbjct: 292 GINSESFKKKIAFAKKEDLPLCMNTSTMRFKNLKNKLLLEYLYFFFQTLFFEKQVFGVLT 351 Query: 353 TTSGQKGISGKDIKSQVVLLPPVKEQAEIVRRVEQLFAYADTIEKQVNNALARVNNLTQS 412 S Q IK + LP + EQ +I ++ + AD Q+ + L + ++ Sbjct: 352 -GSAQFNFGPTHIKWFKIKLPTLPEQQKIA----EVLSVADDEINQLKSELEELKLQKKA 406 Query: 413 ILAKAFRGELTAQ 425 ++ + G++ + Sbjct: 407 LMQQLLTGQVRVK 419 Score = 125 bits (316), Expect = 3e-27, Method: Composition-based stats. Identities = 40/222 (18%), Positives = 91/222 (40%), Gaps = 18/222 (8%) Query: 205 VNGKLTEKWRNFEPQHSVFKKLNFESILTELRNGLSSKPNESGVGHPILRISSVRAGHVD 264 + + P+ K+ + + R P ++G G ++ +++ G +D Sbjct: 4 IKQGYKQTKVGIIPEDWEVVKIKEATSYVDYRG---KTPIKTGKGIFLVTAKNIKQGFID 60 Query: 265 Q--NDIRFLECSESEL-NRHKLQDGDLLFTRYNGSLEFVGVCGLLKKLQHQNLLYPDKLI 321 + E E+ R + GD+L T + +G + K N+ ++I Sbjct: 61 YEASSEFVSEVEYHEIMKRGMPKIGDILIT----TEAPLGNVAQIDKE---NIALAQRVI 113 Query: 322 RARLTKDALPEYIEIFFSSPSARNAMMNCVKTTSGQKGISGKDIKSQVVLLPPVKEQAEI 381 + R K+ ++++ +F S ++ + + + GI GK++ + ++LPP+KEQ +I Sbjct: 114 KFRSKKNVKNDFLKHYFLSNRFQSYLY-RMAIGTTVLGIQGKELHNMSIVLPPLKEQEKI 172 Query: 382 VRRVEQLFAYADTIEKQVNNALARVNNLTQSILAKAFRGELT 423 + + D + L L ++++ K GE+ Sbjct: 173 AQIL----TTWDEAITKQTELLEAKELLKKALMQKLLSGEVR 210 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.313 0.169 0.471 Lambda K H 0.267 0.0517 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 2,697,683,922 Number of Sequences: 3077464 Number of extensions: 116617780 Number of successful extensions: 645127 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 3414 Number of HSP's successfully gapped in prelim test: 1020 Number of HSP's that attempted gapping in prelim test: 610834 Number of HSP's gapped (non-prelim): 10933 length of query: 464 length of database: 1,040,396,356 effective HSP length: 132 effective length of query: 332 effective length of database: 634,171,108 effective search space: 210544807856 effective search space used: 210544807856 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.1 bits) S2: 96 (41.3 bits)