BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= batch____ (378 letters) Database: uniref50.fasta 3,077,464 sequences; 1,040,396,356 total letters Searching..................................................done Results from round 1 Score E Sequences producing significant alignments: (bits) Value UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 782 0.0 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 377 e-103 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 324 4e-87 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 300 4e-80 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 286 9e-76 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 283 5e-75 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 280 9e-74 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 279 1e-73 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 262 2e-68 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 256 7e-67 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 244 3e-63 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 239 2e-61 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 225 2e-57 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 224 3e-57 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 222 2e-56 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 221 3e-56 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 220 5e-56 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 219 9e-56 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 218 2e-55 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 216 1e-54 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 216 1e-54 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 214 4e-54 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 212 2e-53 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 211 3e-53 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 211 4e-53 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 211 4e-53 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 210 9e-53 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 209 2e-52 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 208 2e-52 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 206 1e-51 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 203 7e-51 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 202 2e-50 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 202 2e-50 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 198 3e-49 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 197 4e-49 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 194 6e-48 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 190 6e-47 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 188 2e-46 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 184 4e-45 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 182 2e-44 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 177 6e-43 UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=B... 176 1e-42 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 174 5e-42 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 172 2e-41 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 168 3e-40 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 166 2e-39 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 165 2e-39 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 165 3e-39 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 165 3e-39 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 164 4e-39 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 160 6e-38 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 158 4e-37 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 157 5e-37 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 157 5e-37 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 153 8e-36 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 153 1e-35 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 152 2e-35 UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX 152 2e-35 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 148 3e-34 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 143 1e-32 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 143 1e-32 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 142 2e-32 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 140 1e-31 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 138 4e-31 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 137 4e-31 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 132 3e-29 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 130 8e-29 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 125 3e-27 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 118 3e-25 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 115 3e-24 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 114 5e-24 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 111 6e-23 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 109 1e-22 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 109 2e-22 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 103 2e-20 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 102 3e-20 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 100 8e-20 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 99 2e-19 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 98 6e-19 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 97 7e-19 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 97 1e-18 UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica... 96 2e-18 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 96 2e-18 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 96 2e-18 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 96 2e-18 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 93 2e-17 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 93 2e-17 UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azolla... 92 3e-17 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 91 5e-17 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 90 2e-16 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 89 2e-16 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 89 4e-16 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 87 8e-16 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 86 3e-15 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 84 1e-14 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 83 2e-14 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 83 2e-14 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 82 4e-14 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 82 4e-14 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 81 7e-14 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 80 9e-14 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 80 1e-13 UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX 80 1e-13 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 80 2e-13 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 79 2e-13 UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=B... 79 2e-13 UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp... 79 3e-13 UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina... 78 5e-13 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 77 9e-13 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 77 1e-12 UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7... 76 2e-12 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 75 6e-12 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 74 8e-12 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 74 9e-12 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 74 1e-11 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 73 2e-11 UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata ob... 71 8e-11 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 70 1e-10 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 70 1e-10 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 70 1e-10 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 68 6e-10 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 68 6e-10 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 67 2e-09 UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewane... 66 2e-09 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 66 2e-09 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 66 2e-09 UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax... 66 2e-09 UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio ... 65 5e-09 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 65 6e-09 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 64 7e-09 UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyx... 63 2e-08 UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legione... 62 3e-08 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 62 4e-08 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 61 5e-08 UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus ... 61 6e-08 UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris mari... 61 7e-08 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 61 8e-08 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 60 1e-07 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 59 2e-07 UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammap... 59 3e-07 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 57 1e-06 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 57 1e-06 UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum fe... 57 1e-06 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 56 2e-06 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 55 4e-06 UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum ... 55 5e-06 UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mo... 55 6e-06 UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswald... 54 7e-06 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 54 8e-06 UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscurig... 54 1e-05 UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata ob... 54 1e-05 UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus Re... 52 5e-05 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 51 6e-05 UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia... 51 6e-05 UniRef50_Q8X3B6 H repeat-associated protein n=1 Tax=Escherichia ... 51 8e-05 UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmone... 50 1e-04 UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothe... 50 2e-04 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 49 2e-04 UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria... 49 3e-04 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 49 3e-04 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 49 4e-04 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 48 7e-04 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 48 7e-04 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 47 0.001 UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobac... 46 0.003 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 45 0.005 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 45 0.005 UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus... 43 0.022 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 42 0.026 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 42 0.038 UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synecho... 41 0.057 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 782 bits (2020), Expect = 0.0, Method: Compositional matrix adjust. Identities = 378/378 (100%), Positives = 378/378 (100%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR Sbjct: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK Sbjct: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 Query: 361 AAMDRNYLASVLAGSGLS 378 AAMDRNYLASVLAGSGLS Sbjct: 361 AAMDRNYLASVLAGSGLS 378 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 377 bits (968), Expect = e-103, Method: Compositional matrix adjust. Identities = 182/372 (48%), Positives = 250/372 (67%), Gaps = 4/372 (1%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+ +SII D RQ KV H L D+L L I AVISG EGWE+I+DFG LD+L++Y F Sbjct: 6 LINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRKYLPFS 65 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 GIP DTI+R+ I P +F +CF WM+ C DVIAIDGKTLR S++K + Sbjct: 66 GGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKKDKSDT 125 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH++SAF+ +S+V+GQ+KT+ KSNEITAIP+LL++LD++G ++T DAMGCQ IA+KI Sbjct: 126 IHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKIAKKIV 185 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPD 245 +GGDYL VKG Q RL A + F ++ L PE ++Y EK HGRE+ R+ +V D + Sbjct: 186 DKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMVADA-N 244 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 E+ D FEW GLK L AVSFR+ E+ + + V++YISSA L A+ A R HW V Sbjct: 245 EIGDLVFEWPGLKTLGYAVSFRT---EKDMQTTVAVKFYISSAKLDAKSLLEASRAHWTV 301 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN LHW+LD+ MNED C+IR+ N+ E + +RH ++N+L N+K F G++RK ++A Sbjct: 302 ENNLHWQLDISMNEDSCRIRQQNSTENLATVRHTSLNLLKNEKSFDGGIKRKHKQANRSD 361 Query: 366 NYLASVLAGSGL 377 +Y V++G L Sbjct: 362 SYRELVVSGLSL 373 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 324 bits (830), Expect = 4e-87, Method: Compositional matrix adjust. Identities = 170/372 (45%), Positives = 238/372 (63%), Gaps = 8/372 (2%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+E SII D RQ K++H+L DIL L + AVI GAEGW+DIE+ G L++L++ G F+ Sbjct: 7 LVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFFK 66 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 GIPV DTIAR++S ++P + CFI WM + D +IA+DGK++RHSYDK +R+ A Sbjct: 67 KGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKKKRKSA 126 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH++SA++ + +V+GQ KTD+KSNEI AIP LL++LDIKG I+T DAMGCQ+ IAEKI Sbjct: 127 IHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKIAEKIV 186 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLK---ELNNPEHDSYAISEKSHGREEIRLHIVCD 242 + GDY+ AVK Q +L++ + F + HD + S K HGR E+R + + D Sbjct: 187 TKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVRHDYFEESHKGHGRVELRRYWISD 246 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 + L + W L+ + + S R I + E RY+I+S A+ FA A+R H Sbjct: 247 MLSTLGN-PERWASLQSIGMVESERYIDGKTTAE----TRYFITSIAPDAKIFANAVRKH 301 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN+LHW LDV EDD ++RR NA+E F RH+AIN L N+K K G++ K KA Sbjct: 302 WAIENQLHWVLDVSFREDDSRVRRDNASENFGVFRHVAINALRNEKSCKKGIKAKRYKAT 361 Query: 363 MDRNYLASVLAG 374 + +Y VL G Sbjct: 362 LQPDYAQKVLNG 373 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 300 bits (769), Expect = 4e-80, Method: Compositional matrix adjust. Identities = 161/372 (43%), Positives = 229/372 (61%), Gaps = 7/372 (1%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M ++ +H S I D+RQ+ KV + L D+L ++CAVI+ + GW +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 F +GIP DTIAR+VS I P F+ CF+ WM+ H + +VIAIDGKTLR SY++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 R IH+ISA+++ + LV+GQ+K ++KSNEITAIP LL MLD++G ++T DAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 A I +GGDYL AVK QG L KA + F D I EKSHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFSPHRSAGLSDDHVNI-EKSHGRIENRTCYV 239 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 DFT W+ LK + + SFR++ + K + RYYISS L+AE+ +A R Sbjct: 240 LSSAALDGDFT-HWEALKSIVMVESFRAV---KGKTASLEYRYYISSKVLSAEQALSATR 295 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +E+ +HW LDV MNED+C+I + N AE + +RH+++N+L + K + K ++ Sbjct: 296 EHWGIES-MHWVLDVSMNEDECQIYKNNGAENLAYLRHMSLNMLQKEPT-KLSIVGKRKR 353 Query: 361 AAMDRNYLASVL 372 M+ +L VL Sbjct: 354 CLMNPAFLEKVL 365 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 286 bits (732), Expect = 9e-76, Method: Compositional matrix adjust. Identities = 159/376 (42%), Positives = 223/376 (59%), Gaps = 10/376 (2%) Query: 3 LKKLMEHISIIPD-YRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +K E+ + D R+T H DIL++ +CA+ISGA + +IE FG + ++ + + Sbjct: 6 VKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQTF 65 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 NGIP HDT V++ +SP +F CF+ W + IAID KTLR S DK Sbjct: 66 LALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKKN 125 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + +H++SA++T +LVIGQIKT+E SNEITAIPELLN LD+KG +++ DAMGCQ +IA Sbjct: 126 GKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEIA 185 Query: 182 EKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH---DSYAISEKSHGREEIRLH 238 EKI ++ DY+ A+KG Q +L+++ E F L N E D E S+GREEIR Sbjct: 186 EKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRCA 245 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 + +++I EWK +K + + S R KKE E +RYYISSA L+AE Sbjct: 246 YATNEIEKIIA-NDEWKNIKTVAMIESQRI-----KKEKEFDIRYYISSAKLSAEDCLKV 299 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R HW +ENKLHW LDV ED+ +IR+ N AE + +R IA+N++ +K K G K Sbjct: 300 VRKHWEIENKLHWTLDVAFREDESRIRQRNTAENMAILRRIALNLVKQEKTAKVGQATKR 359 Query: 359 RKAAMDRNYLASVLAG 374 A D YL +L G Sbjct: 360 LMAGWDEKYLLKLLNG 375 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 283 bits (725), Expect = 5e-75, Method: Compositional matrix adjust. Identities = 147/368 (39%), Positives = 220/368 (59%), Gaps = 20/368 (5%) Query: 23 EHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCIS 82 +HKL DI+ +TICAVI GA+ W DIE FG+ +LK++ + NGIP HDT RV S ++ Sbjct: 26 KHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNGIPSHDTFGRVFSLLN 85 Query: 83 PAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQ 142 P + F+ W++ S +++AIDGKTLRHSYD+S+ + A+ +ISA++T + LV+GQ Sbjct: 86 PEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKPALQMISAWATTNGLVLGQ 145 Query: 143 IKTDEKSNEITAIPE---------------LLNMLDIKGKIITTDAMGCQKDIAEKIQKQ 187 DEKSNEITAIP+ LL +L + G I+T DA+GCQK+I ++I +Q Sbjct: 146 SIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIVTLDAIGCQKEIVKQITEQ 205 Query: 188 GGDYLFAVKGTQGRLNKAFEEKFPLKELNNPE---HDSYAISEKSHGREEIRLHIVCDVP 244 DY+ +K QG L + E F ++N E Y + ++ HGR+E+R + + Sbjct: 206 DADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVKDEGHGRQEVRYYQMLSNV 265 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 E ID ++W L + R + + + RY+ISS + + FA+++R HW Sbjct: 266 AEEIDPDWQWLNLNSIGYVEYLR--VENGTDKTSLERRYFISSLNNNIKLFASSVREHWC 323 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN+ HW LDV NEDD +IR+ NA + +RH+A+N+L +K K G++ K +KA D Sbjct: 324 IENQCHWILDVQFNEDDSRIRKDNAPANMAILRHLALNLLKQEKTLKVGVKAKRKKAGWD 383 Query: 365 RNYLASVL 372 NYL VL Sbjct: 384 ENYLLKVL 391 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 280 bits (715), Expect = 9e-74, Method: Compositional matrix adjust. Identities = 158/373 (42%), Positives = 224/373 (60%), Gaps = 14/373 (3%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 + + H S I D RQ KV + L +ILLLT+CAV+SGA W I +G L FLK++ F Sbjct: 24 EFLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPF 83 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 +G P HD + + + + F CFI+W+ + + V+AIDGKT R S DK+ + Sbjct: 84 ADGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTV-TGVVAIDGKTSRRSLDKAGGKA 142 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 AIH+ISA+S+ +L + Q + D KSNEITAIPELL +L +KG I+T DAMGCQ++IA KI Sbjct: 143 AIHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKI 202 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAIS-----EKSHGREEIRLHI 239 + DY+ A+KG QG L K + + + E ++D ++ EKSHGR E R Sbjct: 203 ISKEADYILALKGNQGSLRK--DTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRRVT 260 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 VC D L W GLK + V V + +I+ ++ + RYYISS AE A AI Sbjct: 261 VCTDIDWL-KADHNWPGLKSI-VMVQYHAILQDKTRAE---TRYYISSMTSDAEHHAKAI 315 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R+HW +EN LHW +D+V +D+C+IR GNA F+ I+H+A N+L + K K LR K Sbjct: 316 RDHWGIENGLHWVMDMVFRDDECRIRNGNAPANFTTIKHVASNMLRSVK-GKHSLRSKRH 374 Query: 360 KAAMDRNYLASVL 372 A+ D ++LA ++ Sbjct: 375 IASWDDDFLAEII 387 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 279 bits (713), Expect = 1e-73, Method: Compositional matrix adjust. Identities = 147/375 (39%), Positives = 219/375 (58%), Gaps = 9/375 (2%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++EH S + D R ++E+ L DI+++T+CAV+ GA+ W ++ ++G + +LKQ+ Sbjct: 6 FASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQWI 65 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 NG+P HDT V + + P + +CF+NW + + ++IAIDGKTLR + + Sbjct: 66 ALPNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGEQ 125 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 IH++SA+++ + LV+GQ DEKSNEITAIPELL +L+++G +++ DAMGCQ IAE Sbjct: 126 CSLIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIAE 185 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKFP---LKELNNPEHDSYAISEKSHGREEIRLHI 239 I + GDY+ A+KG QG L + F + EHDSY EK HGR E R + Sbjct: 186 TIIEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTYW 245 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEP-EMTVRYYISSADLTAEKFATA 298 D L+ W LK + S R Q P + RYY+ S + A++FA A Sbjct: 246 TMGQTDYLLGAE-RWAQLKSIGCVESCR----RQPGHPGTLQRRYYLLSIESDAQRFADA 300 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R+HW +EN+LHW LDV ED + +G +A+ S IRHIA N+L + K G++ K Sbjct: 301 VRSHWGIENQLHWILDVGFREDKLRACQGCSAQNLSVIRHIAANLLQQESTAKCGVKAKR 360 Query: 359 RKAAMDRNYLASVLA 373 KA D NYL +L+ Sbjct: 361 LKAGWDDNYLVKILS 375 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 262 bits (669), Expect = 2e-68, Method: Compositional matrix adjust. Identities = 138/373 (36%), Positives = 216/373 (57%), Gaps = 12/373 (3%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+ ++ I D R +H L D+L + I AVI+G++GWED+E++G ++L ++ + Sbjct: 31 LLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLELP 90 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +GIP DT RV I P +C W++ +S ++I IDGKTLR SYD++ + A Sbjct: 91 HGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQCA 150 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 ++ ++A+++ SLV+GQ+K + SNEITAIP LL +LDI G IIT DAMG Q I ++I Sbjct: 151 LYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQIC 210 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELN---NPEHDSYAISEKSHGREEIRLHIVCD 242 +Q DY+ +K L ++ F + N EHD Y K H R E R V Sbjct: 211 RQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRY--VWA 268 Query: 243 VPDELIDFTF---EWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 +P + + +W GL+ + V R + + + +++Y++S A+ AI Sbjct: 269 IPVAAMGELYQQQQWHGLQTIVVVERIRHLWNKTTHD----IQFYLTSLPPNAQFLCHAI 324 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW +EN LHW LDV +ED C+IR + + F+ +R +A+N+L +K FK LR+KM+ Sbjct: 325 RTHWSIENNLHWTLDVTFSEDQCRIRSEYSPQNFALLRRLALNVLHQEKTFKRSLRQKMK 384 Query: 360 KAAMDRNYLASVL 372 +AAM+ NY+ +VL Sbjct: 385 QAAMNNNYMMTVL 397 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 256 bits (655), Expect = 7e-67, Method: Compositional matrix adjust. Identities = 139/380 (36%), Positives = 226/380 (59%), Gaps = 16/380 (4%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L+EH I D R + +H+L D+L++ +C ++ G E + D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 +GIP HDT RV + + P F +CF+ W + ++ +++A+DGK LR + ++ + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQGQSP 126 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 I +SA++ +SLV+GQI+ +K+NEITA+P+LL +L++ G I+T DAMGCQK+IA + Sbjct: 127 RVI--VSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IQKQGGDYLFAVKGTQGRLN---KAF-EEKFPLKELNNP-EHDSYAI-----SEKSHGRE 233 I + +Y+ A+KG QG+ + KA+ E+ + P E ++ A+ +EK HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYKETTEKDHGRL 244 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 E R + L D +W GL+ + V S R + ++ P + RYY+SS ++ E Sbjct: 245 ETRRYWQSGDVSWLADRQ-QWAGLRSVGVVESVRQV---GQQAPTVERRYYLSSLNVDVE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 KFA A+R HW VEN LHW LDV ED + R G+AAE + +R +A+N+L + K G Sbjct: 301 KFARAVRGHWSVENSLHWVLDVQCGEDRNRARSGHAAENLATLRRLALNLLKRESTKKRG 360 Query: 354 LRRKMRKAAMDRNYLASVLA 373 ++ K A+ D +YL +L+ Sbjct: 361 IKGKQLNASWDHDYLLRLLS 380 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 244 bits (624), Expect = 3e-63, Method: Compositional matrix adjust. Identities = 133/379 (35%), Positives = 222/379 (58%), Gaps = 20/379 (5%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++E+ + + D R+ +H L D+L++ + AVI+GA+G I + E H+++LK + Sbjct: 13 ILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSRLELP 72 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINW---MRDCHSSDD--KDVIAIDGKTLRHSYDKS 120 +G+P HDTI R+++ + P F +CF W MR ++DD +++IAIDGKTLR S+D+ Sbjct: 73 SGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRSHDRG 132 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + G + + SA++ + +GQ+ +KSNEI PEL+ +D++ I+T DA GCQ+D+ Sbjct: 133 KGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGCQRDV 192 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN------PEHDSYAISEKSHGREE 234 AEKI GDY+ A+K Q RL++ + + N+ H+ A K HGR + Sbjct: 193 AEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEA---KGHGRLD 249 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 R + +PDE + +W+GLK + VA+ I+++ RYYISS A++ Sbjct: 250 KRFYYQVKLPDE-VPAGEDWRGLKTIGVAIR----ISQENGRETCDTRYYISSLKPDAKQ 304 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 FA A+R HW +EN LHW LDV ED+ ++R AAE + ++ +A++++ K ++ + Sbjct: 305 FAAAVRGHWGIENSLHWTLDVTFREDESRLRNRIAAENLAWLKRLAVSLIKQHKSKESVV 364 Query: 355 RRKMRKAAMDRNYLASVLA 373 R+ R A + N+LA +L Sbjct: 365 MRR-RMAGWNVNFLAEILG 382 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 239 bits (609), Expect = 2e-61, Method: Compositional matrix adjust. Identities = 132/360 (36%), Positives = 206/360 (57%), Gaps = 10/360 (2%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 + H + D R +H L D++ LT+ A++SGAEGW+DI+ FG++ LD+L+++ F+ Sbjct: 3 FITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAFK 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 G+PV DTIAR++S + P FI+W+ + + VIA DGKTLRHS+D R+ A Sbjct: 63 EGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFD-GDRKTA 121 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H +SA++ LV+ Q K+ K NE++ + EL+ +L++KG I+T DAM C K +A+ I Sbjct: 122 LHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAIN 181 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH---DSYAISEKSHGREEIRLHIVCD 242 +GGDY+ VK QG+L F + P+ +S ++ HGR E R ++ Sbjct: 182 AKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYVQLP 241 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 + L + W +K + R + + KE T YYISS ++ + A AIR+H Sbjct: 242 ITPWLTQ-SQGWTNIKPVIEVTRKRYL---KDKETSETA-YYISSLEVNLPQIAKAIRSH 296 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN HW LD+ EDD +IRRG+A E + R A+N L K ++ K+++AA Sbjct: 297 WSIENASHWVLDMTYREDDSRIRRGDAPENMATFRRFAMN-LARLSPIKDSMKGKLKQAA 355 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 225 bits (574), Expect = 2e-57, Method: Compositional matrix adjust. Identities = 138/342 (40%), Positives = 192/342 (56%), Gaps = 13/342 (3%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISP 83 H ++L++ I AV+S + EDI +G D+L+Q+ NG+ +T R+ + P Sbjct: 28 HDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFLVLLNGVASEETFLRIFRALDP 87 Query: 84 AKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 +F F W+ + + +DGKT+R S S AIH++SAF+T +V+GQ Sbjct: 88 KQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGS--GSGGESAIHMVSAFATELGVVLGQE 144 Query: 144 KTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLN 203 K KSNEITAIPELL L I G ++T DAMGCQK+IA +I QGGDYL AVKG Q L Sbjct: 145 KVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIARQITDQGGDYLLAVKGNQPTLL 204 Query: 204 KAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVA 263 A E +F + + + + D + SHGR I I +P E I +W KK+ Sbjct: 205 DAIETEF-IDQYQSDDVDRHRQVHPSHGR--IVAQIASVLPAEGIVDLADWPECKKIARV 261 Query: 264 VSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCK 323 S R + E ++ RYYISS +LTAE+ A A+R HW +EN+LHW LDV ED Sbjct: 262 DSLRKV---GNHESKLERRYYISSRELTAEQLAAAVRAHWGIENRLHWVLDVSFGEDAST 318 Query: 324 IRRGNAAELFSGIRHIAINIL---TNDKVFKAGLRRKMRKAA 362 IR+GNA + S ++ I +N++ T DK K LR K + AA Sbjct: 319 IRKGNAPQNLSLLKKIVLNLIRLDTADKT-KTSLRLKRKCAA 359 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 224 bits (572), Expect = 3e-57, Method: Compositional matrix adjust. Identities = 138/379 (36%), Positives = 201/379 (53%), Gaps = 24/379 (6%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M K L++++ IPD R K H LS+++ + ICA++ G + W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV--IAIDGKTLRHSYD 118 + GIP HDT R+ + + PA F W+ D DDK V +A+DGK LR + Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMG-DDKLVGQLAVDGKALRATA- 118 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K R A+H+++ +ST + +GQ K +KSNEITAIPELL +L++KG +++ DAMG Q Sbjct: 119 KGRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQV 178 Query: 179 DIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKF-PLKELNNPEHDSYAISEK---SHGREE 234 IA+ I K+ GDYL AVK Q LN +E+F + N + + +E+ HGR+E Sbjct: 179 KIADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKE 238 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMT-----VRYYISSAD 289 R V V DE + +WK ++IIA Q + E VR+YISS Sbjct: 239 HRRCWVLMV-DESMPVCQQWKA----------KTIIAVQAERIENGKGYDFVRFYISSRA 287 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 L A A R HW VEN LHW LD+ ED + R G A E + IR +N+L +K Sbjct: 288 LDATSALKATRAHWSVENLLHWTLDIAFGEDQLQARAGFAGENLAVIRQWILNMLKQNKS 347 Query: 350 FKAGLRRKMRKAAMDRNYL 368 + K R ++ YL Sbjct: 348 RNLSMANKRRLCCLNEQYL 366 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 222 bits (565), Expect = 2e-56, Method: Compositional matrix adjust. Identities = 121/363 (33%), Positives = 189/363 (52%), Gaps = 9/363 (2%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L +H+S++ D R H L D+L L + AV SG +GW +I+ FGE L++L+++ F Sbjct: 3 LFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPFA 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP TIAR++ + P C +W+ D ++ K +IAIDGKTLR + Sbjct: 63 NGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLG--CNT 120 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H + AF + L + Q K EI + L+ ML+I +IT DA+ Q+ E I Sbjct: 121 LHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAIV 180 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPD 245 + GDY+ VK Q L +A + ++ + ++ + +A SEK HGR E R I +P Sbjct: 181 ARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQR--ITFQIPS 238 Query: 246 ELIDFTFE-WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 +L E W +K L R I + + +Y+SS D+ E ATA+R HW Sbjct: 239 KLSPKLQEKWPSVKTLIAVERHRKI----GNKTSIETSFYLSSHDIDPEYIATAVRGHWR 294 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN LHW LDVV ED C++ AE + +R +A+N+ + K ++ K+ ++ + Sbjct: 295 IENSLHWVLDVVYREDACRVHEQRTAESLAIVRRMALNLAKLEITQKRSMKSKLHRSLLS 354 Query: 365 RNY 367 Y Sbjct: 355 DEY 357 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 221 bits (563), Expect = 3e-56, Method: Compositional matrix adjust. Identities = 133/379 (35%), Positives = 206/379 (54%), Gaps = 14/379 (3%) Query: 1 MELKKLMEHISI---IPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDF 57 M++ KL + + + + D+R + H+LS++L + +CAV+SGA+ +E+I +G + + Sbjct: 1 MDIGKLADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPW 60 Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHS 116 L+ + + G+ DT RV + + P +F + F W+ + KD VIAIDGK+ R + Sbjct: 61 LRGFLRLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRT 120 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K+ +H++SAF+ +V+GQ T EKSNEITAIPELL +LDI+G I+T DAMG Sbjct: 121 TSKAAA-APLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGT 179 Query: 177 QKDIAEKIQKQGGDYLFAVKGTQGRL--NKAFEEKFPLKELNNPEHDSYAISEKSHGREE 234 Q IA I+++G Y+ VK +L + F + P L ++ + HGR E Sbjct: 180 QTKIARAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLT--PSSTHETTSTGHGRIE 237 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 +R D D L WK + V R++ E YYISS AE+ Sbjct: 238 VRRCTAFDATDRLHKAE-AWKDVASFAVVERVRTVGERTSTERV----YYISSLPADAER 292 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 A AIR+HW VEN+LHW LDV +D + R G+ A + +RH+A+N++ DK K + Sbjct: 293 IAVAIRSHWEVENRLHWCLDVQFGDDYARGRIGHIAHNLALVRHMALNLIRLDKSIKTSI 352 Query: 355 RRKMRKAAMDRNYLASVLA 373 + K AA + A++L Sbjct: 353 KTKRLLAATSDEFRAALLG 371 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 220 bits (561), Expect = 5e-56, Method: Compositional matrix adjust. Identities = 141/370 (38%), Positives = 202/370 (54%), Gaps = 15/370 (4%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L ++ I D R H+L DI+ + + AV++GA+ W IE +G+ +L+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + + P F +W++ S+ VIAIDGKT + SYD+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + ++SA+++ H LV+GQ D KSNEITAIP LL L + G I++ DAMG + IA +I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 KQGGDYLFAVKGTQGRLNKAFE---EKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCD 242 KQ DY+ A+KG Q L K + E+F E+ + E +H R E R V Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRR--VFQ 251 Query: 243 VPDELIDFT----FEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 VP E + FT +W GL+ L V S R + + E RY++SS A FA Sbjct: 252 VPVEQV-FTPKQGRDWAGLRSLVVIQSQRCLWNKDTTE----TRYFLSSLSTDAATFAHY 306 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 IR HW +EN+LHW LDVV NED +IR+ +A FS +R + +N+L D K L K Sbjct: 307 IRAHWGIENQLHWCLDVVFNEDKSRIRKDHAPRNFSLLRRLTLNLLHRDSS-KGSLVMKR 365 Query: 359 RKAAMDRNYL 368 +A +D ++ Sbjct: 366 YRAGLDDQFM 375 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 219 bits (559), Expect = 9e-56, Method: Compositional matrix adjust. Identities = 130/344 (37%), Positives = 192/344 (55%), Gaps = 11/344 (3%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + + E++S Y Q +H DI+ L + AVISGA W +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLSDPRAYNQ----KHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Y FE GIPV DTIARV+ I P F+E F+N++ + + ++VIAIDGKTLRHS++ Sbjct: 57 YRPFECGIPVDDTIARVIKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFN-P 115 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + A+H ++ +S L++ Q K+ K NE A+ E+++ +K +IT DAM QK I Sbjct: 116 ETQSALHSVTVWSQSRGLILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKI 175 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH-DSYAISEKSHGREEIRLHI 239 AEKI ++ GDY+ +K + E F + PE ++Y R + R + Sbjct: 176 AEKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETYEEVNAERSRIDERYYR 235 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 V D L EWKG+K + RS + KE + V +YISS D+ + A + Sbjct: 236 KLKVSDWLSKAE-EWKGIKSVLEVCRKRS---DNGKESQEKV-FYISSLDVDIQILAKCV 290 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 R HW VENK HW LDVV ED+C + AE + +R +A+N+ Sbjct: 291 RGHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNL 334 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 218 bits (556), Expect = 2e-55, Method: Compositional matrix adjust. Identities = 138/382 (36%), Positives = 208/382 (54%), Gaps = 25/382 (6%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L+ L+EH S I D R ++ H L +ILLL +C ++ + +E+I +G HL FL+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 + +G+P + +++ I PA F F W+R D +AIDGKT R S+D+ Sbjct: 72 PYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFPGR-ADFVAIDGKTSRRSHDRRAG 130 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD----IKGKIITTDAMGCQK 178 IH++SAF+T LV+ Q +K+NE+ AIP LL+ L + G +++ DA+ Sbjct: 131 TAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATNP 190 Query: 179 DIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLH 238 IA I+ QG DYL AVK Q L E F + + + HD +K HGR E R H Sbjct: 191 TIAAAIRGQGADYLLAVKANQPTLRAELEAAFAVGDGADHHHDL----DKGHGRVEER-H 245 Query: 239 IVCDVPDELIDFTFEWKGLKKL-----CVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 + + + T + G +L V V + IA++ + RY+ISSA LTAE Sbjct: 246 VSVIREVDWLSGTRRFPGEMRLPDVAAIVRVHTTAHIADRTR---TDTRYFISSAPLTAE 302 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL--TND-KVF 350 A A+R HW +EN+LHW LDV+ +D ++R G+ A+ + +RH A+N++ ND K Sbjct: 303 HAADAVRGHWGIENRLHWVLDVLFKDDLSRLRTGHGAKNMAVVRHFALNLIRAANDQKSL 362 Query: 351 KAGLRRKMRKAAMDRNYLASVL 372 K RRKM A +YLAS+L Sbjct: 363 KT--RRKM--AGWSDDYLASLL 380 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 216 bits (550), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 139/349 (39%), Positives = 196/349 (56%), Gaps = 17/349 (4%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISP 83 H +IL++ I AV+S + EDI + T +L+++ +NGIP +T R++ + P Sbjct: 19 HDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLKNGIPSEETFLRILRALDP 78 Query: 84 AKFHECFINWMRDCHS--SDDKDV---IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSL 138 +F F W+ SDD + IAIDGKT+R S S AIH++SAF+T L Sbjct: 79 KQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGS--GSGGESAIHMVSAFATELGL 136 Query: 139 VIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGT 198 V+GQ K KSNEITAIPELL L IKG ++T DAMGCQK IA++I + GDYL VKG Sbjct: 137 VLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSIAKQIVAKKGDYLLMVKGN 196 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 Q +L +A E F + + D + E+ HGR ++ V ++D +W Sbjct: 197 QPKLLEAIETAF-IDQHGVESVDRSSRVERGHGRTVGQIASVLSAKG-IVD-PADWPK-- 251 Query: 259 KLCVAVS-FRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVM 317 CV + S+ K+ ++ RYYISS L+AE+ A A+R HW VEN+LHW LDV Sbjct: 252 --CVTIGRIDSMRVVGDKQSDLERRYYISSRALSAEQLAAAVRAHWGVENRLHWILDVSF 309 Query: 318 NEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKMRKAAMD 364 +ED + + NA + S +R IA+ I+ DK K+ LR K + AA D Sbjct: 310 SEDASTVAKDNAPQNLSLLRKIALTIIRADKTDTRKSSLRLKRKGAAWD 358 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 216 bits (549), Expect = 1e-54, Method: Compositional matrix adjust. Identities = 130/378 (34%), Positives = 199/378 (52%), Gaps = 16/378 (4%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L E I D+R H L+DIL++ CA++ G + +E FG +L+ + Sbjct: 16 LREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLALP 75 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINW----MRDCHS----SDDKDVIAIDGKTLRHSY 117 NGIP HDT +V S + P +F E F W +R S S K VIAIDGK LR + Sbjct: 76 NGIPSHDTFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRGAV 135 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 DK + I + A+++ SL +GQ+K +KSNEI A+PELL ML +KG I+T DAMGCQ Sbjct: 136 DKGQAPAVI--VGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMGCQ 193 Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE-LNNPEHDSYAISEKSHGREEIR 236 +++A KI +Q GDY+ A+K Q L++ E L E + + HGR E+R Sbjct: 194 REVARKIIQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHEVR 253 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 V + + + +W GL+ + R++ + + RY+ISS A A Sbjct: 254 RCWVSEEVECWLQGAEKWAGLRSVAAVECERTVAGQTTVQR----RYFISSLKADAALIA 309 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKAGLR 355 ++R HW +EN LHW LDV ED+ + RRG +AE + +R + ++ + K + Sbjct: 310 ASVRAHWGIENSLHWVLDVTFGEDESRSRRGYSAENLATLRRLTHAMIKRENPNSKKSVN 369 Query: 356 RKMRKAAMDRNYLASVLA 373 ++ +A + +YL ++L Sbjct: 370 QRRFEAGLSTDYLQTLLG 387 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 214 bits (546), Expect = 4e-54, Method: Compositional matrix adjust. Identities = 133/355 (37%), Positives = 192/355 (54%), Gaps = 13/355 (3%) Query: 22 VEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCI 81 V + L+++LL T+ +I A +++IE G LD+L+Q+ FE+G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 SPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 P F W+ V AIDGKTLR S + GA+H++SA++ LVIG Sbjct: 62 DPKYLETAFSAWVESLRVHVGGGV-AIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGR 201 Q + KSNEITAIPELL+ L + G I+T DAMG QK IA K+ +G DY+ A+KG QG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LNKAFEEKFPLKEL--NNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 L+ + F +L HD I HGR E R V D L + W GL Sbjct: 181 LHDDVRDFFADPDLLRECARHDDTCI---GHGRIEERTCQVADASAWLTEQHSGWAGLAS 237 Query: 260 LCVAVSFRSIIAEQKKEPEMT--VRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVM 317 + ++ R+ KK E++ R YISS + A R+HW VEN LHW+LDV Sbjct: 238 IAAVIATRT----DKKSGEVSSETRLYISSLPPDPKTILNACRSHWSVENNLHWQLDVTF 293 Query: 318 NEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 ED+C+ R+ +A + IRH A N+L + K ++RK KAAM++ + +V+ Sbjct: 294 REDECRTRKDHAPLSLAIIRHAAFNMLKREPS-KMSIKRKRLKAAMNQAFRKTVI 347 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 212 bits (539), Expect = 2e-53, Method: Compositional matrix adjust. Identities = 116/373 (31%), Positives = 200/373 (53%), Gaps = 14/373 (3%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+EH++++ + R +H L D++ L I A++SGAEGW DIE +G++ +D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP T+AR++ CI E + W+ + + K +IA DGK LR S+ + + A Sbjct: 69 NGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKDA 127 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + +++A+ T + LV+ Q T K EI + ++L++L++KG ++T DA+ CQ++ EKI Sbjct: 128 LQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKIS 187 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAIS--EKSHGREEIR--LHIVC 241 ++ + VK Q +L +A + +F + L + + + + E HGR+E R + Sbjct: 188 EKKAHVVVQVKNNQPKLYQAVQSQF--QSLFDAQKEKIVVEHKESGHGRQEERYVFQLKA 245 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 +P EL T +W ++ + RS + + YY+SS + IR Sbjct: 246 KLPPEL---TEKWPTIRSIIAVERHRSA----NGKGTVDTSYYVSSLSPKHKLLGHYIRQ 298 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN H+ LDVV NED +I +A E + R +NI+ R K+++A Sbjct: 299 HWRIENSQHYILDVVFNEDASRIAMEDAVENMALFRRFVLNIVKQSNCGARSQRNKLKRA 358 Query: 362 AMDRNYLASVLAG 374 + +Y A + G Sbjct: 359 GWNDDYRAQLFFG 371 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 211 bits (538), Expect = 3e-53, Method: Compositional matrix adjust. Identities = 141/372 (37%), Positives = 198/372 (53%), Gaps = 33/372 (8%) Query: 23 EHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCIS 82 +H+ S I+L+ I AVI GA+ W IEDFG++ F NGIP HDT R S + Sbjct: 33 KHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKLSNFNGIPSHDTFNRFFSALD 92 Query: 83 PAKFHECFINWMRD---CHSSDDKDVIAIDGKTLRHSYD-----KSRRRGAI-------- 126 P KF E + W++ C+S IAIDGKT+R +Y+ + R++G + Sbjct: 93 PLKFEESYRQWVQSILKCYSGH----IAIDGKTIRGAYESEQDKRHRKQGVLPDSNTGKY 148 Query: 127 --HVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 HVISAF+T + +GQ+ T EK NEI IPELL+ML IK IIT DA+GCQ+ IAEK+ Sbjct: 149 KLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDCIITIDALGCQRTIAEKV 208 Query: 185 QKQGGDYLFAVKGTQGRLNK---AFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVC 241 K GDY+F VK Q +L + + E K D Y E+ HGR E R+ C Sbjct: 209 IKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKG-TTVRFDKYETHEEGHGRNESRICYCC 267 Query: 242 DVPDEL-IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 + P L D +WK ++ + R+ K + R +ISS + A+K R Sbjct: 268 NDPGFLGADIRKKWKNIQSFGYIENTRNT----NKGTTVEKRCFISSLEPDAQKILKNSR 323 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +EN LHW+LDV +ED+ + RR +A FS + IA+ L N+K + + RK Sbjct: 324 EHWEIENNLHWQLDVNFHEDNTR-RRNISALNFSVLAKIALATLRNNKR-EIPINRKRLI 381 Query: 361 AAMDRNYLASVL 372 A D +L ++ Sbjct: 382 AGWDNEFLWELI 393 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 211 bits (537), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 126/374 (33%), Positives = 185/374 (49%), Gaps = 11/374 (2%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L + I D RQ KV H++ ++L++ C+ + E + D+ DF ++ L +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 ++G P HD V+ I P E W D IAIDGK LR +++ Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGWCGDLEGRH----IAIDGKALRGTHNAETG 116 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 R +H++ A+ + L GQI EKSNEI AIP LL L +KG +T DAMG Q IAE Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE---LNNPEHDSYAISEKSHGREEIRLHI 239 +I G DY+ A+K R ++ + F E L+ H E SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 + + D +++W GL+ VA R + P V Y++ S E+ A + Sbjct: 237 ITEELD-WYHKSWKWAGLQS--VAQVRRQVQRSHDGPPLEEVHYFLCSFKADVERLAKLV 293 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VEN+ HW LDV NED C++R NAA + +R + I L K LRRK + Sbjct: 294 RGHWSVENRCHWVLDVTFNEDHCQVRDRNAAHNLTILREMVITTLHRHPA-KVSLRRKRK 352 Query: 360 KAAMDRNYLASVLA 373 A MD + +L Sbjct: 353 LATMDPAFRLQMLG 366 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 211 bits (536), Expect = 4e-53, Method: Compositional matrix adjust. Identities = 132/374 (35%), Positives = 204/374 (54%), Gaps = 19/374 (5%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + +E ++ I D+R + ++L DILL++ AVI + + ++ F + +L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV------IAIDGKTLRHSY 117 F +G P HDT +V+S + P E F WM + + K V +AIDGKT+ S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRS- 121 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 S + A HV++AF++ LV+GQIKTDEKSNEITAIPELL + +K ++T DAMG Q Sbjct: 122 -GSAEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLNK--AFEEKFPLKELNNPE---HDSYAIS-EKSHG 231 K+IA KI ++GGDY+ AVKG Q +L + L++ + E YA++ EK HG Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDRSTRELKAKGQYAVTLEKDHG 240 Query: 232 REEIRLHIVCDVPDELIDFTF--EWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 R E R C + ++L F +W+G+ + + R + + K + S + Sbjct: 241 RIEKR---ECYLSNDLSWFEGLEDWRGIAGVGWIRNTRHVDNKNDKTTTEDHYFIYSLKE 297 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 A+ R HW +EN LHW LD+ EDDC+ R NAAE+ + +R +A+ +L Sbjct: 298 AQAKDLLRIKREHWAIENNLHWMLDMAFREDDCRARAKNAAEVMNILRKLALQMLKTCDT 357 Query: 350 FKAGLRRKMRKAAM 363 K G+R K + + Sbjct: 358 CKCGMRSKRKLCGL 371 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 210 bits (534), Expect = 9e-53, Method: Compositional matrix adjust. Identities = 139/378 (36%), Positives = 203/378 (53%), Gaps = 19/378 (5%) Query: 13 IPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLK-QYGDFENGIPVH 71 I D R K HK+ I+ ++I AVI GA+ W +IE+FG + F K + D E IP H Sbjct: 12 IEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPDLE-FIPSH 70 Query: 72 DTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YDKSRRRGA 125 DT R S I P F F NW++ + K V+AIDGK +R + + Sbjct: 71 DTFNRFFSIIKPEYFELIFRNWVKQV-CQEVKGVVAIDGKLMRGPSQCDGEHTTGKEGFK 129 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + ++SA+S ++ + +GQ+K D+KSNEITAIP L+N L++ G I+T DAMGCQKDI + I Sbjct: 130 LWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDITQTII 189 Query: 186 KQGGDYLFAVKGTQGR---LNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCD 242 ++ +Y+ A+K + + L K + + ++ + HGR E R V Sbjct: 190 ERDANYIIAIKENKKKNYQLAKQIIDDYQDRDEIINRVTRHVSENTGHGRVEKRTCTVVS 249 Query: 243 VPDELIDFTFEWK--GLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AEKFATAI 299 +++ F+ K GLK + S R+I+A + E VRYY++S D T E+ A+AI Sbjct: 250 Y-GSIMEKMFKKKLVGLKSIVGIKSERTIVATGEYTQE--VRYYVTSLDNTKPEEIASAI 306 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW +EN LHW+LDV ED K + NAA FS +A+ IL DK K + K Sbjct: 307 RQHWSIENNLHWQLDVTFREDYSK-KVKNAARNFSVATKMALTILKKDKTTKGSMNLKRL 365 Query: 360 KAAMDRNYLASVLAGSGL 377 KA D YL+ +L + Sbjct: 366 KAGWDEKYLSQLLQNNNF 383 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 209 bits (531), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 123/366 (33%), Positives = 191/366 (52%), Gaps = 11/366 (3%) Query: 13 IPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 I D R + + L +ILL+T+CA+I G + W+ I DFG+ +L Q+ + G+P Sbjct: 23 IKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVNMRCGVPSTL 82 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAF 132 T ARV S I P +F C WM D+I +DGK+L S + + + A H+++A+ Sbjct: 83 TFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQKATHIVNAY 142 Query: 133 STMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYL 192 + +G+++ +KSNEI AIP LLN L+++G II+ DAMG QK IA I+ + DY+ Sbjct: 143 LPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANLIRLKQADYV 202 Query: 193 FAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEK---SHGREEIRLHIVCDVPDELI- 248 A+K R + E F + + + Y E HGR E R + C +P Sbjct: 203 LALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSY--CVLPMMYFH 260 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTA-EKFATAIRNHWHVEN 307 + W+ L+ + S R + E E RYYI+S + + AIR HW +EN Sbjct: 261 KYKKYWRDLQAIVRVQSKR----HKGNEIETATRYYITSLPFAEHRRMSQAIRQHWAIEN 316 Query: 308 KLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNY 367 +LHW+LD+ + ED I RG A + + +R + + +L N+ K G+ K +AA+ Y Sbjct: 317 QLHWKLDIGLGEDASLITRGYADQNLATLRKMVLKMLENENSSKQGIAGKRIQAALSTRY 376 Query: 368 LASVLA 373 L V+ Sbjct: 377 LRKVVG 382 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 208 bits (530), Expect = 2e-52, Method: Compositional matrix adjust. Identities = 126/372 (33%), Positives = 191/372 (51%), Gaps = 19/372 (5%) Query: 13 IPDYR-QTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVH 71 +PD R +T H L+DIL + CAVI+GAEGWEDI ++G + F +++ + +NG+P H Sbjct: 12 LPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFRRFLELKNGVPSH 71 Query: 72 DTIARVVSCISPAKFHECFINWMRD-CHSS-------DDKDVIAIDGKTLRHSYDKSRRR 123 DT RV + + P F + F W + C ++ D +A+DGK+ R S K Sbjct: 72 DTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGKSARRSA-KPTFS 130 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 G +H++ + +L++GQ E +EIT ++L LD+ G ++T DA GCQ + E Sbjct: 131 GCLHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTLDAAGCQTETLEV 190 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKFP-LKELNNPEHDSYAISEKSHGREEIRLHIVCD 242 I+ +GG+Y+ VKG Q L A F E D + +HGR E R V Sbjct: 191 IRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAHGRHEERNVTVVH 250 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 PD L W G+ + + R + + K E T YY+SS + A + A IR H Sbjct: 251 DPDGL---PAGWAGVGSVALVCRDRQV---KGKANESTAHYYLSSLRVGAAELAGYIRGH 304 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 WH+E+ +HW LDV ED+ + R G+A IR +A+++L K + + +A Sbjct: 305 WHIES-MHWVLDVAFREDESRTRVGHAGANLGMIRRVAVSLLKRAGK-KGSIHTRRLRAG 362 Query: 363 MDRNYLASVLAG 374 D Y+A VL G Sbjct: 363 WDDQYMAQVLQG 374 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 206 bits (523), Expect = 1e-51, Method: Compositional matrix adjust. Identities = 115/341 (33%), Positives = 176/341 (51%), Gaps = 12/341 (3%) Query: 38 ISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC 97 ++ AE WEDIE +G + +L+ + NGIP HDT RV + F CF ++ Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCVQFR 63 Query: 98 HSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 ++V+A+DGK++R S G +H++S +++ L +GQ D KSNEI AIPE Sbjct: 64 AGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRAIPE 123 Query: 158 LLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN 217 LL L + G I+T DAMGCQ IAE+I+ +G D L +K G +A F L + Sbjct: 124 LLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTCLGS 183 Query: 218 -----PEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAE 272 P D++ + HGR +R + D + W L ++ + R I Sbjct: 184 GAAGRPVFDAF----EGHGR-LVRRRVFVDAAATALAPLSGWPDLSRVLAVETLRGIPGT 238 Query: 273 QKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 + +RY+++S IR HW VEN LHW L+V EDD ++R AA Sbjct: 239 GTVVAD--IRYFLTSCRDDPSVLVGVIRRHWSVENALHWVLEVSFREDDSRVRDRTAARN 296 Query: 333 FSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 F+ +R IA+N++ D+ +A LR + +KAA D +Y+ ++A Sbjct: 297 FALVRKIALNLIAQDRSTQASLRGRRKKAAWDDDYMLQIIA 337 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 203 bits (517), Expect = 7e-51, Method: Compositional matrix adjust. Identities = 130/353 (36%), Positives = 194/353 (54%), Gaps = 29/353 (8%) Query: 13 IPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 + D R+ WK++H LSDI+LL A +SGAE W++IE FG+ + LK ENGIP HD Sbjct: 16 VKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVLQLENGIPSHD 75 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSD---------DKDVIAIDGKTLRHSYDKSRRR 123 T+ RV + + P E W SD K ++AIDGKT+R + S ++ Sbjct: 76 TLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTIRG--NGSAKQ 133 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 A+H+++A++T + GQ+ T+EKSNEITAIPELL+M+ +KG +++ DAMG QK IA+K Sbjct: 134 KALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDAMGTQKAIADK 193 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDV 243 I K+ DY AVK Q L E+ P E++ D Y EK+HG+ E R + V Sbjct: 194 IIKKKADYCLAVKENQKTL---LEDIVPFFEMSQEADDHYHTVEKAHGQIETRAYEVIHD 250 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHW 303 L E+ ++ + A R + + +E E + RY+I S ++A++ +R HW Sbjct: 251 VSWLRKTHPEFGHIQSIGRA---RIHLDKNGQESEES-RYFILSCQVSAKELCDYVRGHW 306 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 +E+ +HW LDVV ED K + +A N+ DK A L++ Sbjct: 307 QIES-MHWLLDVVFREDANKTLN----------KQLAFNLNVMDKFCLAVLKQ 348 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 202 bits (514), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 117/373 (31%), Positives = 191/373 (51%), Gaps = 11/373 (2%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +++H+ I D R EH + DI L + AVISGA+ W +FG L++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP +I R+ +S + ++W+ + + + IAIDGK L+ S A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLK-GAKASASSAA 119 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H+++A+ T LV K +E+ + ELL LD+K +++T DA+ CQ + I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVC--DV 243 K+GGD + VKG Q +L +A + +F NNP+ + + + K HGR E R+ C ++ Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLNL 239 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHW 303 P E+ +W LK L R + + + +Y+SSA LT+E F AIR HW Sbjct: 240 PAEI---KMKWSQLKTLIAVERHRKV----GNKTSIDTHFYVSSAVLTSEAFGRAIRAHW 292 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 EN HW LD + ED K+ + A + + +R A+N++ K +K +A Sbjct: 293 QTENNQHWLLDKLFQEDKQKMYDEDGASILAILRRWALNLVKLHPA-KTSQTQKFNRACW 351 Query: 364 DRNYLASVLAGSG 376 ++ ++ G+G Sbjct: 352 SDDFREEIIFGTG 364 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 202 bits (514), Expect = 2e-50, Method: Compositional matrix adjust. Identities = 131/368 (35%), Positives = 199/368 (54%), Gaps = 26/368 (7%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++L+E + I D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + ++ K +I IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + G H++SA+S +GQ EKSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN--PEHDSY-AISEKSHGREEIR 236 IAEKI+ + DY+ ++K QG L + E F E E Y EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK---KEPEMTV--RYYISSADLT 291 + + + + WKGLK SII E+K KE + + RY+ISS Sbjct: 239 EYYQTE-KIKWLSQKKAWKGLK---------SIIMERKTLEKEGKRLIEYRYFISSLKEE 288 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-- 349 E + A+R HW +E+ +HW LDV ED AA+ + IR +++IL +V Sbjct: 289 IETVSRAVRGHWSIES-MHWHLDVTFREDANTTIDKMAAQNLNIIRKWSLSILKTAEVSR 347 Query: 350 FKAGLRRK 357 K +R+K Sbjct: 348 HKLSMRKK 355 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 198 bits (503), Expect = 3e-49, Method: Compositional matrix adjust. Identities = 126/375 (33%), Positives = 193/375 (51%), Gaps = 13/375 (3%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY--GD 63 L+E S +PD R+ + L+ IL++ +CA++ GA+ W ++ D+ + ++L + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 E G P HDT + + F F +W+R+ D V+AIDGKTLR S K Sbjct: 63 LEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVID-GVVAIDGKTLRGSGKKGSNE 121 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H+++A++ L + Q T K +E+ + LL++L +KG I+T DA+GCQ ++AEK Sbjct: 122 -LLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAEK 180 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKF---PLKELNNPEHDSYAISEKSHGREEIRLHI- 239 I +GGDY+ VK Q L +A E F + + +EK HGR E R + Sbjct: 181 IVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYTW 240 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL-TAEKFATA 298 + DV WK L + + S R I + + RY I S + T E FA A Sbjct: 241 INDVTWMDRPMRAAWKKLGGVGMIESIRQI----GDKVSVDQRYAIGSCGVQTVEMFAKA 296 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 R+HW +EN LHW LDVV ED C+ R GN+A S +R + L ++ K GL R+ Sbjct: 297 SRSHWGIENGLHWTLDVVFREDQCRARLGNSARNLSVLRKFVLTTLRKEEGCKMGLNRRR 356 Query: 359 RKAAMDRNYLASVLA 373 A + +Y S++A Sbjct: 357 LHADRNESYRESLIA 371 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 197 bits (502), Expect = 4e-49, Method: Compositional matrix adjust. Identities = 104/196 (53%), Positives = 134/196 (68%), Gaps = 13/196 (6%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L L +H + + D RQ KV +KL D+L L + AVISGAEGWE+IEDFG L +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 YGDF +GIPVHDTIAR+V I P FH+ FI WM+ DK V+A+DGKTL Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKTL------- 113 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 H+ISAF+T + +V+GQ +TDEKSNEITA+PELL +L+++G ++T DAM CQK I Sbjct: 114 ------HMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 AEKIQKQGGDYLFAVK 196 + I K+ DY AVK Sbjct: 168 VKTIVKKKADYCIAVK 183 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 194 bits (492), Expect = 6e-48, Method: Compositional matrix adjust. Identities = 134/363 (36%), Positives = 184/363 (50%), Gaps = 19/363 (5%) Query: 11 SIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPV 70 +IIPD R ++ ++I+ + + AVI GA+ W +IE FG+TH + K IP Sbjct: 8 AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKARLPGLVSIPS 67 Query: 71 HDTIARVVSCISPAKFHECFINWMRD-CHSSDDKDVIAIDGKTLRHSYDKSRR-----RG 124 HDT++R S + F ECF W+ D C V+AIDGK + + DKS R Sbjct: 68 HDTLSRFFSILDIDWFEECFRLWVDDICRRI--PGVVAIDGKAICDNPDKSSNSKNGVRS 125 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 ++++SA+S + + +GQ K +EKSNE AIPEL+ LD++ IIT DA+GCQK I + I Sbjct: 126 KLYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIGCQKSITKLI 185 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN---PEHDSYAISEKSHGREEIRLHIVC 241 + DY+ K L E F L E + Y K HGR E R VC Sbjct: 186 IENKADYILCAKDNHEALRNIIE--FNLSEESRYYLCHAKRYFEENKGHGRSEYR-ECVC 242 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 L F W G+K L + S R + KE M RYYISS + +IR Sbjct: 243 ISAKNLQYFLKGWTGIKTLAMINSIRKM---GDKEAVMETRYYISSLEPDPIIILKSIRP 299 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW VEN LHW LD+ EDD + + GNAA FS I +A+ +L + K G+ K + Sbjct: 300 HWEVENNLHWVLDIGFREDDDR-KIGNAAINFSAIPKLALMLLKQSDI-KLGMAGKRKAC 357 Query: 362 AMD 364 D Sbjct: 358 GWD 360 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 190 bits (483), Expect = 6e-47, Method: Compositional matrix adjust. Identities = 117/373 (31%), Positives = 184/373 (49%), Gaps = 26/373 (6%) Query: 13 IPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 +PD R H L D+L + + A I GAE D F +++ + G+P HD Sbjct: 12 LPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLPSHD 71 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAF 132 T +RV + P F CF ++ D D V+AIDGKTLR S+D++ R A+HV+SAF Sbjct: 72 TFSRVFRLLDPVAFSRCFQQFL-DHLGEDGAGVLAIDGKTLRRSFDRAAGRSALHVVSAF 130 Query: 133 STMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYL 192 ++ +++GQ NEI A LL + D+KG ++T DA+ Q+ A+ I ++GGD+L Sbjct: 131 ASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGGDWL 190 Query: 193 FAVKGTQGRLNKAFEEKF--PLKELNNPEHDSYAISEKSHGREEIRLHIVC-DV------ 243 F +K + L E F P L P + ++ HGR E+R H V DV Sbjct: 191 FPLKDNRPALRAEVERYFADPATVLAVP----HVTTDADHGRIEVRRHWVSHDVAWLASD 246 Query: 244 ---PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 PDE + GLK L + + T Y+SSA L + A A+R Sbjct: 247 RRFPDEAV-----LPGLKILGL---VERTVTSPDGRTTATRTLYLSSAALEPKTLARAVR 298 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +E +HW LD +ED + R+ + E + +R +A+N++ + + +R + ++ Sbjct: 299 AHWSIEAAVHWVLDTSFDEDRARNRKDHGPENLATLRKLALNVVRSANN-QDSIRLRRKR 357 Query: 361 AAMDRNYLASVLA 373 A +Y ++L Sbjct: 358 AGWSDDYARTILG 370 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 188 bits (478), Expect = 2e-46, Method: Compositional matrix adjust. Identities = 110/307 (35%), Positives = 164/307 (53%), Gaps = 7/307 (2%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +L + E +PD R + H LS++L + +CAV+ GA + D+ +G+++L +L+++ Sbjct: 5 QLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKF 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHSYDKS 120 + G+P HDT RV++ I PA F F+ W+ + D V+AIDGKT R S K Sbjct: 65 LKLKAGVPSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGKD 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 G +H++SAF+ LV+GQ TD+KSNEITAIPELL ML ++G I+T DAMG Q I Sbjct: 125 TS-GPLHMVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAI 183 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 A I+ +G DY+ VK L + + K HGR E+R Sbjct: 184 ARTIRSRGADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVRRCWA 243 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 D +L + +W GL+ + R++ + E YYISS A + A A+R Sbjct: 244 YDAVSQLYK-SEQWAGLQSFALVERERTVDGKTSVE----RHYYISSLPADAARIAQAVR 298 Query: 301 NHWHVEN 307 +HW VE+ Sbjct: 299 SHWAVES 305 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 184 bits (468), Expect = 4e-45, Method: Compositional matrix adjust. Identities = 106/260 (40%), Positives = 151/260 (58%), Gaps = 5/260 (1%) Query: 12 IIPDYRQ-TWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPV 70 +IPD R+ T H LSDIL + +CAV+SG + WE + +FG T +L+Q+ NGIP Sbjct: 20 LIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANGIPS 79 Query: 71 HDTIARVVSCISPAKFHECFINWMRDCH-SSDDKDVIAIDGKTLRHSYDKSRRRGAIHVI 129 HDT RV S I P F F +W D D +A+DGKT+R S+ S R A+H++ Sbjct: 80 HDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSHRGSAGR-ALHLL 138 Query: 130 SAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGG 189 A+S L++ Q + D KSNEITAIP++L++ D++G I+ DA+GCQK +A +I + GG Sbjct: 139 HAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQITEAGG 198 Query: 190 DYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELID 249 DY+ A+KG Q L+ + +P+ + A+ EK HGR E R V D D L Sbjct: 199 DYVLALKGNQSALHDDVRLFMETQADRHPQGQAEAV-EKDHGRIETRRIWVNDEIDWLTQ 257 Query: 250 FTFEWKGLKKLCVAVSFRSI 269 +W GLK L + S R + Sbjct: 258 KP-DWPGLKTLVMVESRREL 276 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 182 bits (461), Expect = 2e-44, Method: Compositional matrix adjust. Identities = 113/374 (30%), Positives = 198/374 (52%), Gaps = 17/374 (4%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L++H+ II D R ++H L D++ LT+ A++SGA GW+ IE FG LD+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +GIP IA ++ + + W+ D K +IA+DGKT+R ++ + A Sbjct: 63 HGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWADDIHQ-A 121 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H++SAF + + + ++K +E ++++ L + ++T DA+ CQK +KI Sbjct: 122 LHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKII 181 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKS---HGREEIR--LHIV 240 + D++ +KG Q L A + F ++P + AISE++ HGR+E R + I Sbjct: 182 SKKSDFVIQIKGNQPALLAAVKAAF-AACYDSP---ALAISEQTNTGHGRKECRRVMQIE 237 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 ++P EL + +W ++ L S R++ + + R+Y+SS + + A IR Sbjct: 238 GNLPPELSE---KWPHIRTLVEVASERTV----GNKTACSSRWYVSSLPVDTAQLADIIR 290 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +EN+LHW LDVV ED+ + + A+ + A++++ + K L K + Sbjct: 291 AHWAIENQLHWVLDVVFREDELNVSDPDGAKHLALFNRAALSVIKQHQGKKDSLAAKRQS 350 Query: 361 AAMDRNYLASVLAG 374 AA D + + +L G Sbjct: 351 AAWDPAFRSELLFG 364 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 177 bits (449), Expect = 6e-43, Method: Compositional matrix adjust. Identities = 87/207 (42%), Positives = 133/207 (64%), Gaps = 1/207 (0%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 +KL IPD+R++ K + L ILL+ I +VI GA+ W ++E++ + +FL+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 NGIP HDT RV S I +F +CFI W+ +++IAIDGKT+R + ++ Sbjct: 65 LPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGGKK 123 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H++SA++ ++LV+GQ+K KSNEITAIP+LL +L I+ I+T DAMGCQ IA+ Sbjct: 124 SPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIAKA 183 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKF 210 I K+ DY+ AVK Q +L + E++F Sbjct: 184 IVKKNADYILAVKENQPQLLEHIEDEF 210 >UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=Bacteria RepID=B7UED9_ECOLX Length = 104 Score = 176 bits (446), Expect = 1e-42, Method: Compositional matrix adjust. Identities = 84/99 (84%), Positives = 90/99 (90%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 MTVRYYISSAD TAEKF TAIRNHWH+EN L+WRLDVVMNEDD KIRRGNAAE FSGIRH Sbjct: 1 MTVRYYISSADSTAEKFVTAIRNHWHMENNLYWRLDVVMNEDDYKIRRGNAAESFSGIRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGL 377 IAINILTN++VFKA RRKMRKA MD+NYLASVLAG+G Sbjct: 61 IAINILTNNQVFKARSRRKMRKATMDKNYLASVLAGAGF 99 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 174 bits (441), Expect = 5e-42, Method: Compositional matrix adjust. Identities = 123/371 (33%), Positives = 189/371 (50%), Gaps = 18/371 (4%) Query: 15 DYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D RQ+WK+ + LS IL L ++G E +++EDF E + Y D G P HDT+ Sbjct: 19 DSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEPLFATYVDLSEGCPSHDTL 78 Query: 75 ARVVSCISPAKFHECFINWMRDCHSSDD-KDVIAIDGKTLRHSYDKSRRRGAIHVISAFS 133 RV+S ++ + E + + + S D +I++DGKT+R + K+++ +H+++A+ Sbjct: 79 ERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRGNRGKNQK--PVHIVTAYD 136 Query: 134 TMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLF 193 H L +GQ+ +EKSNEI AIP+LL +DI+ I+T DAMG Q I + I K DY Sbjct: 137 GGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTIIKGKADYCL 196 Query: 194 AVKGTQGRLNKAFEEKFP----LKELN-NPEHDSYAISEKSHGREEIRLHIVCDVPDELI 248 AVKG Q L F L+EL N ++ Y EKS G+ E+R + V L Sbjct: 197 AVKGNQETLYDDIALYFSDVNLLEELQENAQY--YQTVEKSRGQIEVREYWVSSDIKWLC 254 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 +W L+ + + R+ I ++ + RY+I S FA +R HW +E+ Sbjct: 255 QNHPKWHKLRGIGMT---RNTI-DKDGQLSQENRYFIFSFKPDVLTFANCVRGHWQIES- 309 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL--RRKMRKAAMD-R 365 +HW LDVV +ED + AA + IR + + L K L RRK R ++ Sbjct: 310 MHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKKDLSYRRKQRYISVHLE 369 Query: 366 NYLASVLAGSG 376 +YL + G Sbjct: 370 DYLVQLFGERG 380 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 172 bits (435), Expect = 2e-41, Method: Compositional matrix adjust. Identities = 90/233 (38%), Positives = 140/233 (60%), Gaps = 3/233 (1%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L++H + D R +HKL DI+++ +CA+I GA+ + +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + I P +F + F +W+ DV+ IDGKT++HS +K + A Sbjct: 69 NGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKKA 128 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH+++A+++ LV+ Q K E++ EITAIP L+ +L++ G ++T DAMG Q DIAE + Sbjct: 129 IHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELLH 188 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKF---PLKELNNPEHDSYAISEKSHGREEI 235 +G DY A+KG Q L + +E F E EH + EK R E+ Sbjct: 189 SKGADYCLALKGNQRGLFQEVKEVFDNAQQTEWIGIEHSFHRTVEKRTARAEV 241 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 168 bits (425), Expect = 3e-40, Method: Compositional matrix adjust. Identities = 106/371 (28%), Positives = 176/371 (47%), Gaps = 19/371 (5%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + E +PD R H L++IL + + A + GA D+ F + Sbjct: 5 MDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDVL 63 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMR----DCHSSDDKDVIAIDGKTLRHSYD 118 +NG+P HDT +RV + P F + F +M+ K VIA+DGK LR Y+ Sbjct: 64 VLKNGLPSHDTFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGYE 123 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 R +++A++ + + ++ +NE +L+ +L +KG ++T DA+ C + Sbjct: 124 SGRSHMPPVMVTAWAAQTRMALANVQA-PNNNEAAGALQLIELLQLKGCVVTADALHCHR 182 Query: 179 DIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLH 238 +AE I+ +GGDY+ AVK Q L + + K ++ S + HGR+E R Sbjct: 183 GMAEAIKARGGDYVLAVKDNQPALMR--DAKAAIRAATRQGKPSTITVDAGHGRKEKRRA 240 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTV-RYYISSADLTAEKFAT 297 +V VP D F GLK + S K+ + TV RY++ S + Sbjct: 241 VVAAVPQMAQDHDFA--GLKAVARITS--------KRGTDKTVERYFLMSQAYPPKDVLR 290 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +EN LHW LDVV++ED + R+ NA + +R +A+N+ LR K Sbjct: 291 IVRTHWTIENSLHWPLDVVLDEDLARNRKDNAPANLAVLRRLALNVARAHPDNTTSLRGK 350 Query: 358 MRKAAMDRNYL 368 +++A + +L Sbjct: 351 LKRAGWNDTFL 361 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 166 bits (419), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 109/365 (29%), Positives = 180/365 (49%), Gaps = 19/365 (5%) Query: 13 IPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 +PD R + V H L ++L++ +V+ G+ ++ FG F + + ++ IP HD Sbjct: 22 VPDPRAS-NVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRNFLKLKHAIPSHD 80 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSS-DDKDVIAIDGKTLRHSYDKSRRRGAIHVISA 131 T + V I P F + D D D+IAIDGK LR + D ++SA Sbjct: 81 TFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDPGESARTRMMVSA 140 Query: 132 FSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDY 191 +++ L + + D + E++A E L ++D++GK++T DA+ C + I GGD+ Sbjct: 141 YASRLRLTLATVPAD-RGTELSAAIEALGLIDLRGKVVTGDALHCNRRTVAAINAGGGDW 199 Query: 192 LFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKS-HGREEIRLHIVCDVPDELIDF 250 A+KG Q L F ++P A++E + HGR+E R +V V + + Sbjct: 200 CLALKGNQESLLSDARGCFSKGHKSDPT----AVTENTGHGRKETRKAVV--VSAKALAE 253 Query: 251 TFEWKGLKKLCVAVSFRSIIAEQKKEPEMT--VRYYISSADLTAEKFATAIRNHWHVENK 308 E+ GLK F I A ++ ++T RY+ S T E A+R+HW +EN Sbjct: 254 YHEFPGLK------GFGRIEATRETGGKVTSETRYFALSWVPTPEVLLAAVRDHWAIENA 307 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 LHW+LDV ED + R+ N + +R A+++L D K L K+++A D +L Sbjct: 308 LHWQLDVSFREDAARNRKDNGPGNIAVLRRRALDVLRRD-TSKGSLSIKIKRAGWDTTFL 366 Query: 369 ASVLA 373 S+L+ Sbjct: 367 RSILS 371 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 165 bits (418), Expect = 2e-39, Method: Compositional matrix adjust. Identities = 91/240 (37%), Positives = 138/240 (57%), Gaps = 8/240 (3%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++L+E + I D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + ++ K +I IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + G H++SA+S +GQ EKSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN--PEHDSY-AISEKSHGREEIR 236 IAEKI+ + DY+ ++K QG L + E F E E Y EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 165 bits (418), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 133/410 (32%), Positives = 196/410 (47%), Gaps = 44/410 (10%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + E I I D R+ KV + I+L+T+ V + W DI DF DFL+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINW---MR-DCHSSDDKDV------------- 105 P HDT+ R I + C+ W MR D S +D D Sbjct: 77 PDLETTPSHDTLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYT 136 Query: 106 ---IAIDGKTL----------RHSYDKSRRRGA----IHVISAFSTMHSLVIGQIKTDEK 148 IAIDGKT+ + S K + A +H++SAF + SL +GQ + K Sbjct: 137 NRHIAIDGKTICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIK 196 Query: 149 SNEITAIPELLNMLDIK-GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFE 207 NEI AIP+LL+ +DI+ G ++T DA+G QK I EKI ++ DYL VK +L + E Sbjct: 197 ENEIVAIPKLLDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIE 256 Query: 208 EKFPLKELNNPEHDSYAISEKS---HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 ++ E+D +E++ HG R I C P L +WK L+ + Sbjct: 257 NDAEYLLISGRENDFIKRAEETTEGHGFMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIK 316 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + IA + + E +ISS E R HW VEN LHW+LDV NEDD + Sbjct: 317 TEKINIATGEIQNEKHC--FISSLVNNPELILKYKRKHWAVENGLHWQLDVTFNEDDGR- 373 Query: 325 RRGNAAELFSGIRHIAINILTN--DKVFKAGLRRKMRKAAMDRNYLASVL 372 + N+A+ FS + +A+ IL N D+ K + RK +KA YLA+++ Sbjct: 374 KMMNSAQNFSTLTKMALTILKNYQDEDKKTSVNRKRKKAGWSDEYLANLI 423 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 165 bits (417), Expect = 3e-39, Method: Compositional matrix adjust. Identities = 93/252 (36%), Positives = 145/252 (57%), Gaps = 7/252 (2%) Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 ++H+++A+ + +L++GQ+K D+KSNEITAIP+LL ML ++G I+T DAMGCQK IA++I Sbjct: 2 SLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQI 61 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNP-EHDSYAISEKSHGREEIRLHIVCDV 243 + DY+ AVK Q L + + F ++N H + + HGR E R + V Sbjct: 62 GSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREYSTI-V 120 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHW 303 D+L+ W L + + S R + E RY+I S + A++F A+R HW Sbjct: 121 GDDLLAGITGWDNLNAIGMVESKREVGNTISNEK----RYFIMSINGHAQRFGDAVREHW 176 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 +EN +HW LDV ED +IR+ N+ E S +R IA+N + + K ++RK + A Sbjct: 177 GIENTVHWVLDVSFGEDQSRIRKDNSPENLSMLRKIALNCVKQEST-KTSMKRKRKMAGW 235 Query: 364 DRNYLASVLAGS 375 D ++L VL G+ Sbjct: 236 DNSFLIKVLTGN 247 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 164 bits (416), Expect = 4e-39, Method: Compositional matrix adjust. Identities = 103/343 (30%), Positives = 164/343 (47%), Gaps = 21/343 (6%) Query: 13 IPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 +PD R +H L +IL + + AV+ GA ++E F + LD L+Q+ E G P HD Sbjct: 10 VPDPRD-LTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLERGAPSHD 68 Query: 73 TIARVVSCISPAKFHECFINWM----RDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHV 128 T +RV++ + P +E F+ +M K +A+DGK+LR +Y K R V Sbjct: 69 TFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGRSHMPPLV 128 Query: 129 ISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQG 188 ++ F + + Q E E+ A L +L +KG +T DA+ C + + + ++ G Sbjct: 129 VTVFGCDTFMSLAQTVAQE-GGEVQAAIAALELLSLKGLTVTADALHCHRRMTKTVRDGG 187 Query: 189 GDYLFAVKGTQGRLNKAFEEKFPL-KELNNPEHDSYAISEKSHGREEIRLHIVCDVPDEL 247 G Y+ A+KG Q +L A E L K + E +HGR E+R V Sbjct: 188 GHYVIAIKGNQSKL--AAEANTALDKAAAGKATKFHQTEEDAHGRHEVRRAFVIPFAQ-- 243 Query: 248 IDFTFEWKGLKKLCV---AVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 T L LC S+R++ + + VR Y S + A + +R HW Sbjct: 244 ---TPGKNALVDLCAIGRVESWRTVEGKTTHK----VRCYALSRKMPAHELLATVRRHWS 296 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 +EN LHW+LDV++ ED + R+ N A + +R + +N+L D Sbjct: 297 IENDLHWQLDVLLGEDHIRGRKNNTAANHAILRRLTLNVLRAD 339 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 160 bits (406), Expect = 6e-38, Method: Compositional matrix adjust. Identities = 125/383 (32%), Positives = 184/383 (48%), Gaps = 43/383 (11%) Query: 30 LLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHEC 89 +L+T+ V + W DI DF DFL+++ P HDT+ R I + C Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FINW---MR-DCHSSDDKDV----------------IAIDGKTL----------RHSYDK 119 + W MR D S +D D IAIDGKT+ + S K Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 120 SRRRGA----IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIK-GKIITTDAM 174 + A +H++SAF + SL +GQ + K NEI AIP+LL+ +DI+ G ++T DA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKS---HG 231 G QK I EKI ++ DYL VK +L + E ++ E+D +E++ HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 R I C P L +WK L+ + + + IA + + E +ISS Sbjct: 241 FMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIKTEKINIATGEIQNEKHC--FISSLVNN 298 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN--DKV 349 E R HW VEN LHW+LDV NEDD + + N+A+ FS + +A+ IL N D+ Sbjct: 299 PELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQNFSTLTKMALTILKNYQDED 357 Query: 350 FKAGLRRKMRKAAMDRNYLASVL 372 K + RK +KA YLA+++ Sbjct: 358 KKTSVNRKRKKAGWSDEYLANLI 380 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 158 bits (399), Expect = 4e-37, Method: Compositional matrix adjust. Identities = 81/195 (41%), Positives = 120/195 (61%), Gaps = 2/195 (1%) Query: 94 MRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEIT 153 M+ H +V+AIDGKTLR SYD+ R+ IH++SA+++ + LV+GQ+KT+ KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK 213 AIP L+ MLD++G I+T DAM CQ IA+ I ++GGDYL AVKG QG+L A + F Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFTPH 120 Query: 214 ELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 + D+ I EK GR E R + V D + DF+ W GL + + ++R+ Q Sbjct: 121 RRAPIDRDTCQI-EKQKGRVEARTYHVLSASDLIRDFS-TWSGLTSIVMVENYRAAKGRQ 178 Query: 274 KKEPEMTVRYYISSA 288 + + + + + S+ Sbjct: 179 RARVGVPLLHKVQSS 193 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 157 bits (398), Expect = 5e-37, Method: Compositional matrix adjust. Identities = 89/278 (32%), Positives = 143/278 (51%), Gaps = 17/278 (6%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L+E + + D R K+EH+L DIL++ +CAV++ AE +EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD------VIAIDGKTLRHS 116 D GIP HDT RV I P F CF+NW R + D IA+DGK +RHS Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ R +H++SA++T LV+ Q D K E A+P +L L + G +++ DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN-----PEHDSYAISEKSHG 231 ++++A+ I +G YL +K Q +++ F + P D++ + +HG Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAF---DDTHG 238 Query: 232 REEIRLHIVCDVPDELIDFTFE-WKGLKKLCVAVSFRS 268 R R C PD T W GL + + + R+ Sbjct: 239 RLVRRRVFAC--PDAGCFTTLRGWPGLTTVLASETIRA 274 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 157 bits (397), Expect = 5e-37, Method: Compositional matrix adjust. Identities = 107/351 (30%), Positives = 163/351 (46%), Gaps = 46/351 (13%) Query: 15 DYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D RQ KV H+ I++ + V + + W ++ DF +DF++++ P HDT+ Sbjct: 29 DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFFPDIQKAPSHDTL 88 Query: 75 ARVVSCISPAKFHECFINW---MRDCHSSDDKD-----------------VIAIDGKTLR 114 R + P + W MR+ ++ ++ IAIDGKT++ Sbjct: 89 RRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKPFRQIAIDGKTIK 148 Query: 115 HSYDKSRRRG--------------AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLN 160 + ++ RRR +H++SAFS L +GQ + D+K NEI AIP LL+ Sbjct: 149 KAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKKENEIVAIPRLLD 208 Query: 161 MLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRL------NKAFEEKFPLK 213 LDI +G ++T DAMG QKDI +I K+ YL VK Q L N E+ PL Sbjct: 209 DLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIAGNMRDFERIPLP 268 Query: 214 ELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 N + + E HG +R VC L +W+ L+ + + R + E Sbjct: 269 ---NEVYKVHKEGENGHGFVFLRECRVCSSLHSLGKIYKDWENLRSYGLIRTER--VDEA 323 Query: 274 KKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 E + Y+ISS + EK R HW +EN LHW+LD+ EDD ++ Sbjct: 324 TGESAVETHYFISSLENDPEKIMRVKRKHWGIENGLHWQLDITFKEDDGRM 374 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 153 bits (387), Expect = 8e-36, Method: Compositional matrix adjust. Identities = 104/345 (30%), Positives = 180/345 (52%), Gaps = 16/345 (4%) Query: 10 ISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG-- 67 I+++ D R ++++ L ILL+++ A ISG + WE IED+ H + L+ +G Sbjct: 9 IAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKLSGKE 68 Query: 68 -----IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 +P HDT V I P +F E + ++ + + IAIDGKT R ++ Sbjct: 69 LKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPR-GIKQTAN 127 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 +++SA+ T H VI I ++ K +E+++I +L+ +L ++ +T DA G ++ E Sbjct: 128 SHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGTYVEVIE 187 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIR-LHIVC 241 I +GG+++ VKG Q +L + E++F N D+ + HGR E R ++ + Sbjct: 188 MILSKGGNFVLPVKGNQKKLLEFIEKEFREYRGNTVSADTQ--EDIGHGRVEKRTVYCIT 245 Query: 242 DV-PDELIDFTFE-WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 ++ D+ ID + WKG+K L V R + + K + YYI++ + ++ AI Sbjct: 246 EIKTDDDIDGCMQKWKGVKTLVKIV--REVYKKADKSTRIETVYYITNL-IDPKEINRAI 302 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 R HW +EN LH LDV++NED + N E F + +A+ I+ Sbjct: 303 RAHWGIENNLHRSLDVLLNEDHSQKSNHNVVENFHIMSLLALFII 347 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 153 bits (386), Expect = 1e-35, Method: Compositional matrix adjust. Identities = 106/366 (28%), Positives = 170/366 (46%), Gaps = 19/366 (5%) Query: 13 IPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 +PD R H L ++L++ +V+ GA ++ FG + + ++ +P HD Sbjct: 44 VPDPRAE-NTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLKHAVPSHD 102 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSS-DDKDVIAIDGKTLRHSYDKSRRRGAIHVISA 131 T + V I P F + D + D DVIA+DGK LR + D ++SA Sbjct: 103 TFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGRTRMMVSA 162 Query: 132 FSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDY 191 ++ L + + D + E+ A E L ++ +KGK++T DA+ C + I GGD+ Sbjct: 163 YAARLRLTLASVPAD-RGTELEAAIEALGLIALKGKVVTADALHCNRRTVAAINAGGGDW 221 Query: 192 LFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEK-SHGREEIRLHIVCDVPDELIDF 250 A+K Q L F + P+ A+SE HGR E R V V + + Sbjct: 222 CLALKANQDSLLSDARASFGAE----PDAHPSALSEDIGHGRTETRKATV--VSSKALAE 275 Query: 251 TFEWKGLKKLCVAVSFRSIIAEQKKEPEMT--VRYYISSADLTAEKFATAIRNHWHVENK 308 E+ GLK +F + A +K T RY+ S T E +R HW +EN Sbjct: 276 HHEFPGLK------AFGRVEATRKTAEGTTSETRYFALSWVPTPEVLLATVRAHWAIENS 329 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 LHW+LDV ED + R+ N+ + +R A++++ D K L K+++A D ++L Sbjct: 330 LHWQLDVSFREDAARNRKDNSPGNIAILRRRALDVMRRD-TSKGSLSIKLKRAGWDDDFL 388 Query: 369 ASVLAG 374 +VL G Sbjct: 389 RNVLNG 394 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 152 bits (385), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 75/165 (45%), Positives = 107/165 (64%), Gaps = 3/165 (1%) Query: 13 IPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 +PD R+ + H+L ++LL IC VISGAE W + + + LD+L+ Y + +GI HD Sbjct: 15 LPDPRRR-ECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPYAHGIASHD 73 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAF 132 T RV S + ++F CF+ W+ S + +AIDGK LR S+D + R IH++SA+ Sbjct: 74 TFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHDGA--RSPIHLVSAW 131 Query: 133 STMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 S+ +L +GQ++T +KSNEITAIPELL LDI+G IT DAMGC Sbjct: 132 SSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCH 176 >UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX Length = 105 Score = 152 bits (384), Expect = 2e-35, Method: Compositional matrix adjust. Identities = 74/87 (85%), Positives = 77/87 (88%) Query: 272 EQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 EQKKEPEMT RYY SADLTAEKFATA RNHW+VENKLHW LDVVMN+DDCKIRRGNAAE Sbjct: 19 EQKKEPEMTDRYYSISADLTAEKFATANRNHWYVENKLHWHLDVVMNKDDCKIRRGNAAE 78 Query: 332 LFSGIRHIAINILTNDKVFKAGLRRKM 358 LFSGIR IAINILT DK+ KAG R KM Sbjct: 79 LFSGIRKIAINILTKDKILKAGARCKM 105 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 148 bits (374), Expect = 3e-34, Method: Compositional matrix adjust. Identities = 111/350 (31%), Positives = 171/350 (48%), Gaps = 16/350 (4%) Query: 3 LKKLMEHISIIPDYRQTWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 L+KL E S IPD+R+ K + HKLSDI++L I +S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-----DDKDVIAIDGKTLRH 115 NGIP T+ R+ I + H +++I IDGK R Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCTQEIICIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + K+ R I +SA S + + +EKSNEI A+P L++ +DI GKI+T DAM Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADAMS 212 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 QKDI +KI+++ GD++ +K Q L E+K +KEL +P + E HGR E Sbjct: 213 MQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDK--IKEL-SPVYSYCGEPELGHGRIET 269 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R + V D D LI +W G L + + + R ++SS + Sbjct: 270 RSYRVFDGTD-LIANKEKWNG--NLTIIEYECETVKKSTGNCTTEKRLHVSSLPANTPRL 326 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 T +RNHW +E+ +HW LD + +D K + AA I+ I ++ + Sbjct: 327 GTPVRNHWSIES-MHWGLDRNLLQDKIKRKSARAARNLDTIQRIVYSVFS 375 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 143 bits (361), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 110/356 (30%), Positives = 180/356 (50%), Gaps = 30/356 (8%) Query: 3 LKKLMEHISIIPDYRQTWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 +K L E + +PDYR+T K ++KL DILLL I + DI FG+ +L + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVVSCISP-------AKFHECFINWMRDCHSSDDKDVIAIDGKTL 113 G +G+P T+ R+ I ++F F + + C D++ IDGK + Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAG----DILCIDGKAM 133 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDA 173 R + ++ R I +SA+S + + +EKSNEIT++P+LL+ +D+ G I+T DA Sbjct: 134 RGTVLENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADA 191 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGRE 233 M QK I +KI+++GGD+L +K Q L E+ L E + + + HGR Sbjct: 192 MSFQKAIIDKIREKGGDFLIELKANQRTLRYGVEDNVELAEPVDVYSEGPFL---EHGRI 248 Query: 234 EIRLHIVCDV--PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTV--RYYISSAD 289 E R VC + ++LI +W G L V V R+ E+K + + + R+Y+SS Sbjct: 249 ETR---VCRIFRGNDLITDREKWNG--NLTV-VEIRT-ATERKSDGQKSSERRFYVSSFH 301 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 +A + T R HW +E+ +HW LD + +D + +A I+ + + IL+ Sbjct: 302 GSARRLGTIARMHWAIES-MHWDLDRNLRQDFIRRNSARSARNLDTIQRMVLAILS 356 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 143 bits (360), Expect = 1e-32, Method: Compositional matrix adjust. Identities = 124/374 (33%), Positives = 182/374 (48%), Gaps = 31/374 (8%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L E + ++P R K + L +LL+ + +SG W +IED+ E + + LK + Sbjct: 5 LFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEML 64 Query: 66 NG------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRH---- 115 G +P HDT+ R +S + F + W+ S+ I IDGKT+R Sbjct: 65 TGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRGVKKL 124 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 S+D HV+SAFS + Q+ D K+NEI AI +LL++LD+ G +++ DA+G Sbjct: 125 SFDTQS-----HVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIG 179 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKF-PL--KELNNPEHDSYAISEKSHGR 232 Q I E+I +GGDY+ VK Q + E F PL K + E +E SHGR Sbjct: 180 TQTAIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQKHILLDEQ-----TELSHGR 234 Query: 233 EEIRLH--IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISS-AD 289 E R + I+ + E + KGL+ + V R K E V YYISS D Sbjct: 235 IETRRYESILNPLEIEANEVLTRRKGLRSIHKVVRKRRDKKSDKTSEE--VAYYISSLTD 292 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 +++ K AIR HW +ENKLH LDV D R N A++ I+ I + I+ K Sbjct: 293 VSSLK--QAIRGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKT 350 Query: 350 -FKAGLRRKMRKAA 362 K+ + R +K A Sbjct: 351 NMKSSIPRIQKKPA 364 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 142 bits (359), Expect = 2e-32, Method: Compositional matrix adjust. Identities = 83/197 (42%), Positives = 113/197 (57%), Gaps = 8/197 (4%) Query: 133 STMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYL 192 S +LV+GQ K ++KSNEITAIP L+ ML+I+ IIT DAMGCQK+I I+K+ GDY+ Sbjct: 28 SLSQNLVLGQKKVNDKSNEITAIPALIEMLEIESSIITIDAMGCQKEITSLIRKKKGDYI 87 Query: 193 FAVKGTQGRLNKAFEEKFPL---KELNNPEHDSYAISEKSHGREEIRLHIVCDVPD-ELI 248 +K Q L + +E F + +E + EH Y E H R E R I V + Sbjct: 88 ITLKANQKSLRQEIKEWFKIAEAEEFKDREHSYYQEIETGHHRIEKREVIAVSVSSLPCL 147 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 W LK + + S R + + E VR+YISS + ++K ATAIR+HW +EN Sbjct: 148 HNQDLWTELKTVVMVKSERRLWNKTTTE----VRFYISSVEKNSQKIATAIRSHWEIENS 203 Query: 309 LHWRLDVVMNEDDCKIR 325 LHW LDV +ED +IR Sbjct: 204 LHWTLDVTFSEDKSRIR 220 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 140 bits (352), Expect = 1e-31, Method: Compositional matrix adjust. Identities = 91/237 (38%), Positives = 123/237 (51%), Gaps = 9/237 (3%) Query: 143 IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRL 202 + T++KSNEITAIP LL L+ K ++T DAMGCQKDIA I GGD++ AVK Q +L Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 NKAFE---EKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 A EK EL H +Y HGR + R H V VP EW +K Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVPPGFAA-KGEWPWIKA 119 Query: 260 LCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + AV + A+ + E VRYY+ S L+ ++F +R HW +E+ +HW LDV E Sbjct: 120 IGTAVRI-TTHADGTQSDE--VRYYMLSRFLSGKRFGEVVRGHWGIES-MHWVLDVTFGE 175 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSG 376 D + R+ A S +R AI +L K +R KM + MD ++L VL G Sbjct: 176 DRTRTRQRVLANNLSWVRRFAITLLKRHPE-KDSIRGKMIRCLMDTSFLNEVLTLQG 231 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 138 bits (347), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 102/334 (30%), Positives = 157/334 (47%), Gaps = 13/334 (3%) Query: 40 GAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISP---AKFHECFINWMRD 96 GA+ +I +F E LK+ +G P HDT +R+ I P A+ F+ +R Sbjct: 37 GAKNCVEIAEFVEGREAELKEIVTLRHGCPSHDTFSRIFRLIDPDELARALGAFLAALRQ 96 Query: 97 CHS--SDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITA 154 + V+A+DGK LR Y+K R ++S + L + K E S+E+ A Sbjct: 97 GLGLGPRPRGVVAVDGKALRRGYEKGRAFMPPVMVSVWDAETRLSVA-TKRAEGSDEVAA 155 Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE 214 LL +D+KG I+T DA+ C+ D A+ + + Y A+K +GRL E F + Sbjct: 156 TLALLKSIDLKGCIVTADALHCRPDTAKALIGRKAHYALALKANRGRLFACAEAGFVAAD 215 Query: 215 LNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 + + E HGR E R V +P + + GLK + + R Sbjct: 216 AAG-DLAFHETRETGHGRLETRRASV--LPLKAFKQAPAFPGLKAIGRIQATRQ---GAD 269 Query: 275 KEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 +VRY S L K A +R HW +EN+LHW LDVV +EDD + R+ NA + + Sbjct: 270 GRAVTSVRYIALSKVLAPHKLAEVVRAHWTIENQLHWSLDVVFHEDDARSRKDNAPQNLA 329 Query: 335 GIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 IR +A +IL + K + KMR+ +R++ Sbjct: 330 VIRRLARDILAAHPLDKP-IASKMRRVNWNRDFF 362 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 137 bits (346), Expect = 4e-31, Method: Compositional matrix adjust. Identities = 78/190 (41%), Positives = 109/190 (57%), Gaps = 7/190 (3%) Query: 13 IPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHD 72 I D R K HK+ I+ ++I AVI GA+ W +IE+FG + F K IP HD Sbjct: 12 IEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPSLEFIPSHD 71 Query: 73 TIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLR--HSYDKSRRRGA----I 126 T R S I P F F NW++ + K V+AIDGK +R D RG + Sbjct: 72 TFNRFFSMIKPDYFELIFRNWVKQV-CQEVKGVVAIDGKLMRGPSQCDGEHTRGKEGFKL 130 Query: 127 HVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQK 186 ++SA+S + + +GQ+K D+KS+EITAIP L+N L++ G I+T DAMGCQKDI + I Sbjct: 131 WMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQKDITQTIIG 190 Query: 187 QGGDYLFAVK 196 +Y+ A+K Sbjct: 191 HDANYIIAIK 200 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 132 bits (331), Expect = 3e-29, Method: Compositional matrix adjust. Identities = 114/375 (30%), Positives = 180/375 (48%), Gaps = 36/375 (9%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDF-GETHLDFLKQ 60 E+ L+E ++ +PD R V H L+ +L LT CAV++GA + ++ E + L++ Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 61 YGDFENGI------PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV--IAIDGKT 112 G + + P TI RV++ I W+ C D + +A+DGK+ Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWL-ACRQQDAGGLRALAVDGKS 156 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITT 171 LR + RR +H+++A + LV+ Q+ EK+NEIT LL+ L D+ G ++T+ Sbjct: 157 LRGAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTS 214 Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHG 231 DA+ Q D A ++ + Y+ VK +L+ + P +++ P D + HG Sbjct: 215 DALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLKS-LPWQQI--PLQDRTRTT--GHG 269 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 R EIR VC V + L + G ++ V + R + K T+ Y ++S L Sbjct: 270 RCEIRRLKVCTVNNLL------FPGARQ-AVQIVRRRVNRTTGKVSLKTI-YAVTS--LA 319 Query: 292 AEKFATA-----IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN 346 AE+ A IR HW VE H R DV ED ++R GNA + + R++AI L Sbjct: 320 AEQAPPARVAQLIRGHWTVEALHHVR-DVTFAEDASQLRSGNAPQAMATYRNLAIGALRL 378 Query: 347 DKV--FKAGLRRKMR 359 V AGLRR R Sbjct: 379 AGVRNIAAGLRRTAR 393 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 130 bits (327), Expect = 8e-29, Method: Compositional matrix adjust. Identities = 89/250 (35%), Positives = 133/250 (53%), Gaps = 16/250 (6%) Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YDKSR 121 IP HDT R S I P F F NW++ + K V+AIDGK +R + + Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQV-CQEVKGVVAIDGKLMRGPSQCDGEHTTGK 62 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + ++SA+S + + +GQ+K D+KSNEITAIP L+N L++ G I+T DAMGCQKDI Sbjct: 63 EGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDIT 122 Query: 182 EKIQKQGGDYLFAVKGTQGR---LNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLH 238 + I + +Y+ A+K + + L K + + K+ + HGR E R Sbjct: 123 QTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETRTC 182 Query: 239 IVCDVPDELIDFTFEWK--GLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AEKF 295 V +++ F+ K GLK + S R+I+A + E VRYY++S D T E+ Sbjct: 183 TVVSY-GSIMEKMFKKKLVGLKSIVGIKSERTIVATGEYTQE--VRYYVTSLDNTKPEEI 239 Query: 296 ATAIRNHWHV 305 A+AIR HW + Sbjct: 240 ASAIRQHWSI 249 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 125 bits (313), Expect = 3e-27, Method: Compositional matrix adjust. Identities = 65/133 (48%), Positives = 90/133 (67%), Gaps = 4/133 (3%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIK 165 +AIDGK LR S+D +R IH++SA+S+ +L +GQ++T +KSNEITAIPELL LDI+ Sbjct: 1 MAIDGKCLRGSHDGAR--SPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHD--SY 223 G IT DAMGCQ DIAE+I ++G DY+ VKG Q L +A + F + E + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 AISEKSHGREEIR 236 + ++K+HGR E R Sbjct: 119 SQTDKNHGRIETR 131 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 118 bits (296), Expect = 3e-25, Method: Compositional matrix adjust. Identities = 95/344 (27%), Positives = 161/344 (46%), Gaps = 16/344 (4%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ-YGDF 64 L+E + + D+R+ H L +L++ I + G G+ ++ +F + + L Q + Sbjct: 4 LIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEFNII 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINW-MRDCHSSDDKDVIAIDGKTLRHSYDK--SR 121 +P + TI RV+ + + F W + + DD + + +DGK+L+++ + Sbjct: 64 PERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNPNNE 123 Query: 122 RRGAIHVISAFSTMHSLVIGQIKT-DEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 ++ I +S FS LV+ + ++K +EI ++ ++ K+ T DA+ CQK Sbjct: 124 QQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQKKT 183 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 I K DY+ VKG Q L K ++ L + PE + + SHGR+ R V Sbjct: 184 ISLIAKTKNDYVITVKGNQKNLYKRIQD---LSNSSKPE-SCFLEQDNSHGRKISRKIEV 239 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 V E +G + L + + K E T YYISS +A+ FA IR Sbjct: 240 FKVRKN------ERQGFENLRRVIKVERKGSRGDKTYEETA-YYISSLTESAQVFAKIIR 292 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 HW +EN+LHW DV+ ED +I AA +S + I +N+ Sbjct: 293 GHWKIENQLHWVKDVIFEEDKSEISDFQAASNWSILTTIGLNLF 336 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 115 bits (288), Expect = 3e-24, Method: Compositional matrix adjust. Identities = 89/349 (25%), Positives = 168/349 (48%), Gaps = 22/349 (6%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ L+ + +PD R ++L ++ + +CAV +GA + I D+ + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD-KDVIAIDGKTLRHSYDKS 120 +P TI +V + + +D + +A+DGKT+R + + Sbjct: 102 CGIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RI 159 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 ++ A H+++A + ++V+GQ +T +KSNEI + LL +DI G ++T DAM QK Sbjct: 160 GKQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKAT 219 Query: 181 AEKIQKQG-GDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHI 239 A +++Q +Y+ VK Q L ++ P +++ D E+ HGREE R + Sbjct: 220 ARCLREQCRAEYVMIVKANQPGLLARVRDQ-PWEQVPVVWSDPV---ERGHGREEHRSYK 275 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK----- 294 + V L F + +++ + R ++ E+ Y I S L E+ Sbjct: 276 ILTVARGL-RFPYA----QQVIQIIRRRRVLGAGAWSTEVV--YAICS--LPCEQAPPKL 326 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 A+ IR HWH+EN++H+ DV +ED +R G+ ++ + +R++ + + Sbjct: 327 LASWIRGHWHIENRIHYVRDVTFDEDRSAVRTGHGPQVMATLRNVVVGL 375 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 114 bits (286), Expect = 5e-24, Method: Compositional matrix adjust. Identities = 76/195 (38%), Positives = 99/195 (50%), Gaps = 12/195 (6%) Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK 213 AIPELL LD++G +T DA+G Q IA I + G DY+ AVK Q RL + + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 ELNNPEHDSYAISE--KSHGREEIRLHIVCDVPDE---LIDFTFEWKGLKKLCVAVSFRS 268 E + +E K HGR E R VC V ++ L W GL++L + R Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETR---VCRVSEDVAWLASTGQHWAGLQRLVMLERTRQ 117 Query: 269 IIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGN 328 I QK E YYISS + A + A IR HW +EN+LHW LDV ED IR Sbjct: 118 I--GQKVTTERC--YYISSKAVKAAQMAQLIRAHWGIENQLHWVLDVSWGEDASLIRDTV 173 Query: 329 AAELFSGIRHIAINI 343 AA + +R I +N+ Sbjct: 174 AARNMASLRKITLNL 188 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 111 bits (277), Expect = 6e-23, Method: Compositional matrix adjust. Identities = 93/340 (27%), Positives = 145/340 (42%), Gaps = 52/340 (15%) Query: 52 ETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGK 111 +THL+ L+++ + GI TI R++ I F+ W+ + S + +A+DGK Sbjct: 24 KTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALYAFMEWVGEIVDSRNTH-LAVDGK 82 Query: 112 TLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITT 171 L + +K++ +++ T+ L++ Q+ D K+NEIT IPELL +LDI G I+T Sbjct: 83 ALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSKTNEITVIPELLKLLDISGSIVTI 142 Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNK---AFEEKFPLKELNNP---------- 218 DA+G Q I E+I +QGG + VK Q + F +K ++ Sbjct: 143 DAVGTQTAIMEQIHEQGGHFALTVKKNQPEAYEEIHTFMDKLEAADVQRKKGEVLDSGMR 202 Query: 219 ----EHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFR------- 267 +++ EK+ R E R +C L EW ++ + R Sbjct: 203 EYLEKYEEIIRIEKNRDRNEYRTCQICKDASNLTKSQKEWPHVQSIGRIKQVRIPSEKDS 262 Query: 268 ---------------------SIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVE 306 + AE+ ++ IS LTAE+ + R HW +E Sbjct: 263 HGNDVTPSKEEFLEKGSRRVPAPSAEEGTGKDVQCTALISDLILTAEELGSIKRMHWSIE 322 Query: 307 NKLHWRLDVVMNED--DCKIRRGNAAELFSGIRHIAINIL 344 N+LH LD ED K R N S IR A NIL Sbjct: 323 NRLHHVLDDTFREDRSPAKKSRNN----LSLIRKYAYNIL 358 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 109 bits (273), Expect = 1e-22, Method: Compositional matrix adjust. Identities = 106/390 (27%), Positives = 173/390 (44%), Gaps = 37/390 (9%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDI----EDFGETHLDF 57 ++ L+ + I D R+ + LS +L + A ++GA G +I DFG+ L Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKQYGDFENG---IPVHDTIARVVSCISPAKFHECFINWM--RDCHSSDDKDVIAIDGKT 112 L D G P I + + A F W+ + V+A+D K Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMDVKV 140 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELL-NMLDIKGKIITT 171 LR ++ + +R + ++SA LV GQ++ + +NEIT + LL N+ DI G ++ T Sbjct: 141 LRGAWSEGNKR--VTLLSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPVVAT 198 Query: 172 -DAMGCQKDIAEKIQKQGGDYLFAVKGTQGRL-NKAFEEKFPLKELNNPEHDSYAISEKS 229 DA+ Q + A + + G DY VKG Q L K FE+ PL + P+H+ + E+ Sbjct: 199 LDAVHTQHETAFLLVEHGIDYALTVKGNQPTLYRKTFEQTLPLLQ-KPPQHE---VEERG 254 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 HGR + + +T E KG+ VA + E + R Y Sbjct: 255 HGRIK-----------KWQAWTTEAKGIGFPEVATAAVIRRDEFDLKGIRVSREYAHILT 303 Query: 290 LTAEKFATA------IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 A ATA IR HW +EN++H+ D ED + GN+ + R++AI I Sbjct: 304 SVAGNRATAAYIHRLIRGHWGIENEIHYPRDTAWREDANQTHTGNSPHTLASFRNLAIGI 363 Query: 344 LTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 + + + K ++ + A DR+ + +LA Sbjct: 364 IRRNGIRK--IKETLEYIAGDRDRVLPLLA 391 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 109 bits (273), Expect = 2e-22, Method: Compositional matrix adjust. Identities = 58/132 (43%), Positives = 89/132 (67%), Gaps = 3/132 (2%) Query: 104 DVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD 163 D+IA+DGKTLR SYD++ + AIH++SA+ST + LV+GQ+KT+EKSNE TAIP+L +L Sbjct: 8 DIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIPKLFTLLA 67 Query: 164 IKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHD-S 222 ++ +T DA+G Q+DIA++I + DYL VK Q L++ + + E D + Sbjct: 68 LEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAKGFTEDFT 127 Query: 223 YAISEKS--HGR 232 +++E+ HGR Sbjct: 128 DSVTEEGDKHGR 139 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 103 bits (256), Expect = 2e-20, Method: Compositional matrix adjust. Identities = 59/171 (34%), Positives = 91/171 (53%), Gaps = 19/171 (11%) Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLNKAF----EEKFPLKELNNPEHDSYAISEKSHGRE 233 K + I + G DY+ AVKG Q RL++ E++ P+ E S I+ +S Sbjct: 3 KKTVQLIIEGGNDYVIAVKGNQKRLHEQIKLTTEQRLPVSLDITTERRSDRITTRS---- 58 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 V D+L +++W+GL++L F + +P + YYISS + A Sbjct: 59 -------VSVFDDLSGISYDWEGLQRLVKVERF----GTRAGKPYHQIVYYISSLTINAA 107 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 +FA IR HW +EN+LHW DVV++ED+ ++R+GNA FS IR + + IL Sbjct: 108 QFAQGIRGHWGIENRLHWVKDVVLDEDNSRMRQGNAPANFSIIRSLVLTIL 158 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 102 bits (253), Expect = 3e-20, Method: Compositional matrix adjust. Identities = 91/325 (28%), Positives = 143/325 (44%), Gaps = 29/325 (8%) Query: 50 FGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAID 109 FG + +LK GI H T + V C++ F ++ Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAALPKPLQRA------------ 90 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKII 169 K S + + ++ ++ LVIGQ +T NE+ + L +L ++G I+ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQ-QTAPGRNEVQGALDALALLSLEGAIV 149 Query: 170 TTDAMGCQKDIAEKIQKQGGDYLFAVKGTQ-GRLNKAFEEKFPLKELNNPEHDSYAISEK 228 T DA+ C+ D A I GGDY A+K Q G L + ++ L +E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLG-----VQTAAEN 204 Query: 229 SHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA 288 H R E R + V D IDF GL+ + +V S A+ + VRY++ S Sbjct: 205 DHDRCERRRACIVAVND--IDF----PGLQAIG-SVEATSRHADGRLTSH--VRYFLLST 255 Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 ++A R HW +ENKLHW LDV ED + R+ + + +R IA+N++ Sbjct: 256 IMSAAALIEVTRTHWQIENKLHWVLDVQFREDAARNRKDHGPANIALLRKIALNLIRAHP 315 Query: 349 VFKAGLRRKMRKAAMDRNYLASVLA 373 KA +RRK++ A D +L S++A Sbjct: 316 -DKASIRRKIKNAGWDDQFLISIIA 339 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 100 bits (249), Expect = 8e-20, Method: Compositional matrix adjust. Identities = 57/135 (42%), Positives = 81/135 (60%), Gaps = 3/135 (2%) Query: 105 VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDI 164 VIAI+GK+LR + + A+H +SA++ + L +GQ+ EKSNEITAI ELL L + Sbjct: 5 VIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLPTLAL 64 Query: 165 KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKF-PLKELNNPEHDS- 222 +G ++T DA+GCQ +AE+I GGDY+ AVK Q L A + F L +P + Sbjct: 65 EGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPVRQTC 124 Query: 223 -YAISEKSHGREEIR 236 + +K HGR E R Sbjct: 125 VHETLDKGHGRIETR 139 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 99.0 bits (245), Expect = 2e-19, Method: Compositional matrix adjust. Identities = 55/148 (37%), Positives = 83/148 (56%), Gaps = 3/148 (2%) Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 + NG P DT RV+ I P + C + ++ S + IAIDGK L+ S K+ Sbjct: 17 ELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHIAIDGKRLKGSKKKT-- 74 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 G+ H++SA+ L + Q EK NE+ AIPE+L+ LD+ G +I+ DAMG Q +IAE Sbjct: 75 -GSTHILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSGAVISIDAMGTQTNIAE 133 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKF 210 +I + DY+ ++KG Q L + + F Sbjct: 134 QIIQSEADYILSLKGNQKHLYEDVRDCF 161 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 97.8 bits (242), Expect = 6e-19, Method: Compositional matrix adjust. Identities = 56/144 (38%), Positives = 77/144 (53%), Gaps = 4/144 (2%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 G H++SA++T H + +G + T+EKSNEITAI LL L K ++T DAMGCQKDIA Sbjct: 2 GPRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARN 61 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFE---EKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 I GGD++ AV+ Q +L A EK E H ++ HGR + R + Sbjct: 62 IVAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWG 121 Query: 241 CDVPDELIDFTFEWKGLKKLCVAV 264 VP + EW +K + AV Sbjct: 122 AQVPPDFA-AKGEWPWIKAIGTAV 144 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 97.4 bits (241), Expect = 7e-19, Method: Compositional matrix adjust. Identities = 63/208 (30%), Positives = 100/208 (48%), Gaps = 5/208 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E ++ +PD R ++ L +L L + AV+ G E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 NG +P +TIA ++ + P + W+RD H D + +A+DGK L S D Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP-DGWEHLALDGKRLCGSRDGQVP- 121 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITTDAMGCQKDIAE 182 H+++A++ S V+ Q+ + +NE A LL +L + G ++T DA+ Q D+ Sbjct: 122 -GTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKF 210 +Q +GGD + K QG L E F Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAF 208 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 97.1 bits (240), Expect = 1e-18, Method: Compositional matrix adjust. Identities = 99/374 (26%), Positives = 162/374 (43%), Gaps = 35/374 (9%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDF----GETHLDFLK 59 L+ ++ +PD R V H L +L + AV++GA + ++ + L L Sbjct: 27 SSLVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELG 86 Query: 60 QYGDFENGI---PVHDTIARVVSCISPAKFHECFINWMRDCH--SSDDKDVIAIDGKTLR 114 + D G+ P T R+++ + + W+ C ++ + V ++DGKTLR Sbjct: 87 VFRDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLR 146 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAM 174 S + +H+++ V+GQ+ D K+NE+T LL LD+ ++T DA+ Sbjct: 147 GSGPAGEQ---VHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADAL 203 Query: 175 GCQKDIAE-KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGRE 233 Q++ A + + Y+F VK Q RL + + P ++ P D S + HGR Sbjct: 204 HTQREHARWLVDTKKAAYVFTVKKNQPRLYRQL-KTLPWTKI--PIQDE--TSTRGHGRY 258 Query: 234 EIR--LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYY----ISS 287 +IR + C P L DF + A+ R TV Y +S+ Sbjct: 259 DIRRLQAVTCTGPLAL-DFPHAVQ-------ALRIRRRRLNLATGRWSTVTVYAITNLSA 310 Query: 288 ADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI--LT 345 A + A +R HW +E H R D ED ++R GNA + +R+ AIN+ LT Sbjct: 311 AQAGPAELADWLRGHWAIETLHHIR-DTTYAEDASRLRTGNAPRAMATLRNTAINLLRLT 369 Query: 346 NDKVFKAGLRRKMR 359 A LR R Sbjct: 370 GITTIAAALRHNSR 383 >UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica OS145 RepID=A3WIW9_9GAMM Length = 96 Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 43/90 (47%), Positives = 60/90 (66%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 M RYYISSA L+AE+FA+ +R HW +EN+LHW LDV + ED+C I RG+AA+ + RH Sbjct: 1 MQYRYYISSASLSAEEFASTVRAHWGIENRLHWVLDVTIREDNCPISRGHAADNLACFRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 +A+N + +K A + RK + A M L Sbjct: 61 VALNQIRREKTIDASVNRKQKMATMSEEVL 90 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 64/211 (30%), Positives = 96/211 (45%), Gaps = 3/211 (1%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + L E +S IPD R + H L +L L A++ G + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDK 119 F G P T++R + P + W+ + IA+DGKTLR S D Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGSRDG 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 H+++A++ V+ Q++ D K+NE A LL +L + G ++T DAM CQ+D Sbjct: 121 QVP--GQHLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKF 210 +A + G DY+ K Q L + E Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGL 209 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 59/189 (31%), Positives = 94/189 (49%), Gaps = 12/189 (6%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE 214 + +L IK I T DA+ CQK E I ++ Y+ VK Q L +A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIEDTAKNSP 61 Query: 215 LNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 LN +++ ++K HG E H + + +W GL++ +S R Sbjct: 62 LN-----AWSWTQKGHGHES---HCRLKIWEATESMKMQWAGLERF---ISIRRQGFRHH 110 Query: 275 KEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 K+ + T Y+I+S L++ + A IR H +EN LHW DV++NED+C IR + A + Sbjct: 111 KKFDSTT-YHITSETLSSYRLAGFIRGHRRIENNLHWTKDVILNEDNCGIREPHPAAILG 169 Query: 335 GIRHIAINI 343 +R+IA N+ Sbjct: 170 ILRNIAFNL 178 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 95.9 bits (237), Expect = 2e-18, Method: Compositional matrix adjust. Identities = 95/326 (29%), Positives = 142/326 (43%), Gaps = 28/326 (8%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHLD-FLKQYG-DFENGIPVHDTIARVVSCISPAKF 86 +L + + A +G G+ + T D L Q G F P T V+S + PA Sbjct: 3 LLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRR--PSEKTFRAVLSRLDPADL 60 Query: 87 HECFINWMRDCHSSDDKD---VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 + ++ +S D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 61 NARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQL 118 Query: 144 KTDEKSNEITAIPELLNMLDIKGK-IITTDAMGCQKDIAEKI-QKQGGDYLFAVKGTQGR 201 EKSNEI + LL +L + ++T DAM Q A+ I YL VK Q + Sbjct: 119 AVAEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAK 178 Query: 202 LNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIR-LHIVCDVPDELIDFTFEWKGLKKL 260 + A P E+ D + HGR E R L I+ I F + K++ Sbjct: 179 I-LARITALPWAEVPAAATDD----SRGHGRVETRTLQIITAARG--IGFPYA----KQI 227 Query: 261 CVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK---FATAIRNHWHVENKLHWRLDVVM 317 R I A ++ E V Y I S + T +R H +EN LHW DV Sbjct: 228 IRITRERLITATDQRSVE--VVYAICSLPFEHARPTAIMTWMRQHCRIENSLHWIRDVTF 285 Query: 318 NEDDCKIRRGNAAELFSGIRHIAINI 343 +ED + GN A++ + +R+ AIN+ Sbjct: 286 DEDRQRAHTGNGAQVLATLRNTAINL 311 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 93.2 bits (230), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 64/179 (35%), Positives = 94/179 (52%), Gaps = 10/179 (5%) Query: 189 GDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELI 248 GDYL VKG Q +L +A E F + + + D A+ E+ HGR ++ V + I Sbjct: 7 GDYLLMVKGNQPKLLEAIEIAF-IDQHDVKSVDRSALVERGHGRTVGQIASVLSA--KGI 63 Query: 249 DFTFEWKGLKKLCVAVS-FRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVEN 307 +W CV + S+ +KE ++ YYI+S LTAE+ A ++R W VEN Sbjct: 64 INPGDWPN----CVTIGRIDSMRVVDEKESDLERCYYITSRALTAEQLAASVRARWGVEN 119 Query: 308 KLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKMRKAAMD 364 + HW LDV +ED + + NA + S +R IA+NI+ DK K+ LR K + AA D Sbjct: 120 RFHWILDVSFSEDASTVAKDNAPQNLSMLRKIALNIIRADKTDTRKSSLRLKRKGAARD 178 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 92.8 bits (229), Expect = 2e-17, Method: Compositional matrix adjust. Identities = 59/148 (39%), Positives = 81/148 (54%), Gaps = 11/148 (7%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAIS-----EK 228 MGCQK+IAE I +Q DY+ AVK Q L++A ++ F +E N +SY I K Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYF--EEANEANFESYNIDFAETYNK 58 Query: 229 SHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA 288 SHGR E R V L D + W+GL+ + + S R++ K++ + RYYISS Sbjct: 59 SHGRIESRRCWVGYDALPLTDDSQNWEGLQTIVMVESERTL----KEKTTIEHRYYISST 114 Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVV 316 TA + R HW +EN LHWRLD+ Sbjct: 115 MATAAYLLNSSREHWGIENSLHWRLDIA 142 >UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azollae' 0708 RepID=B9YJB3_ANAAZ Length = 124 Score = 92.0 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 43/113 (38%), Positives = 68/113 (60%), Gaps = 4/113 (3%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W LK + + S I + + + RY+ISS D E+ A ++R+HW +EN LHW L Sbjct: 15 WSNLKSVGMVES----IGQVDDKTTVETRYFISSLDSNGEQLANSVRSHWAIENSLHWVL 70 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRN 366 DV + +DDC+IR+ NA + F+ +R IA+++L + K G++ K AA+D N Sbjct: 71 DVALKQDDCQIRKDNAPQNFAVMRQIAVDLLGKENPVKRGIKNKQFLAAVDNN 123 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 91.3 bits (225), Expect = 5e-17, Method: Compositional matrix adjust. Identities = 59/168 (35%), Positives = 88/168 (52%), Gaps = 19/168 (11%) Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQ 187 ++S ++T + LV+GQ+K E SNEITAIPELL +L++ G I+ A+ C KDI + I ++ Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 GGDYLFAVKGTQGRLNKAFEEKFP------LKELNNPEHDSYAISEKSHGREEIRLHIVC 241 DY+ +K QG L ++ E+ F +EL +H +Y E HG EIR Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQEL---QHSTYKPEETGHGLHEIRNFGFQ 117 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 PD + W LK +V I + + + RY+ISS D Sbjct: 118 LDPDSV------WSNLK----SVGMVEPIGQVDDKTTVETRYFISSLD 155 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 89.7 bits (221), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 52/127 (40%), Positives = 69/127 (54%), Gaps = 1/127 (0%) Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 + ++ +KI ++ DYL AVKG QG L AF++ F LNN + + Y E+S GR E Sbjct: 12 VRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHES 71 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V L D + EW GLK + VS S E +E ++ VRYYISS L AE+ Sbjct: 72 RAAFVSHDLSVLGDISDEWPGLKSMAFVVSMNS-EKEVAEEADIYVRYYISSKQLNAEEL 130 Query: 296 ATAIRNH 302 TA R H Sbjct: 131 LTASRLH 137 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 89.4 bits (220), Expect = 2e-16, Method: Compositional matrix adjust. Identities = 87/374 (23%), Positives = 158/374 (42%), Gaps = 45/374 (12%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDF-GETHLDFLKQ 60 E++ L + ++ +PD R + H+L IL L+ AV +G + E+I + L Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 61 YGDFENGI------PVHDTIARVVSCISPAKFHEC---FINWMRDCHSSDDKDVIAIDGK 111 G + + P DT+ RV+S + + F + V+A+DGK Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGK 157 Query: 112 TLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML----DIKGK 167 TLR + R A H+++ +V+ + + K+NE+TA LL L + G Sbjct: 158 TLRGAAGPEGR--APHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGV 215 Query: 168 IITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGTQGRLNKAFEE-----KFPLKELNNPEHD 221 ++T DA+ + A+ I + G ++F VK L+ + K P+ Sbjct: 216 VVTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPI--------- 266 Query: 222 SYAISEKSHGREEIRLHIVCDVPDELIDFTFEW--------KGLKKLCVAVSFRSIIAEQ 273 ++ ++HGR E R I E I + + +++ + R+ + Sbjct: 267 GHSAEGRAHGRFERRT-IQLAQASEAIRARYPHARTVARIRRHVRRTVTTGTGRARV--T 323 Query: 274 KKEPEMTVRYYISSADL---TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 + P + ++S L T A R HW +ENK+HW DV ED ++R G Sbjct: 324 RTIPSTVTVHVLTSLTLDAVTPADLAGYARGHWTIENKVHWVRDVTFREDASRVRTGPLP 383 Query: 331 ELFSGIRHIAINIL 344 + + +R++ I ++ Sbjct: 384 RIMTTLRNLIIGLI 397 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 88.6 bits (218), Expect = 4e-16, Method: Compositional matrix adjust. Identities = 53/182 (29%), Positives = 90/182 (49%), Gaps = 4/182 (2%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 + L+ + +PD R+ + L +L+ T+ A++SGA + I F E + L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTH 70 Query: 61 -YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHS 116 +G PV +T+ V+ + + F + + K V+A+DGKTLR S Sbjct: 71 HFGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGS 130 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D R A ++AF + ++V+ + D+KSNEI A +++ L + G + T DAM C Sbjct: 131 FDHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHC 190 Query: 177 QK 178 QK Sbjct: 191 QK 192 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 87.4 bits (215), Expect = 8e-16, Method: Compositional matrix adjust. Identities = 65/191 (34%), Positives = 91/191 (47%), Gaps = 22/191 (11%) Query: 192 LFAVKGTQG----RLNKAFE--EKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPD 245 + AVK Q R+ A + E F L + +H +K HGR E R + D P Sbjct: 1 MLAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREV---DKGHGRIETRRCLALDFPG 57 Query: 246 ELIDFTFE---WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 FE W GL+ + + S R I RYY+SS A + A A+R H Sbjct: 58 P-----FEPDLWPGLQSIPMVESTREI----GDTVTTGRRYYVSSLPADAVRIAHAVRAH 108 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +E+ +HW LDV NED C+ R NAA+ F+ +R IA ++ D KAG+R + KA Sbjct: 109 WGIES-MHWVLDVTFNEDQCRTRLENAAQNFAILRRIATTLIRRDNSTKAGIRIRRLKAG 167 Query: 363 MDRNYLASVLA 373 +Y A +L Sbjct: 168 ASDDYRAQLLG 178 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 85.5 bits (210), Expect = 3e-15, Method: Compositional matrix adjust. Identities = 63/226 (27%), Positives = 108/226 (47%), Gaps = 17/226 (7%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +++ LM+ +S D R+ + H ++ +CA++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDF----ENGI---PVHDTIARVVSCISPAKFHECFINWMRD----CHSSDDKDVIAIDG 110 F E I P T+ R + I + W + C D V++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIIT 170 K +R + K++ IH ++AF +V+ Q DEK+NEI + LL ++I+G+I+T Sbjct: 341 KAVRGA-SKAKGGQKIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 TDAMGCQKDIAEKI-QKQGGDYLFAVKGTQGRLNKAFE----EKFP 211 DA+ Q + A I + + DY+F VK Q + + E E FP Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIESLPWEAFP 445 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 83.6 bits (205), Expect = 1e-14, Method: Composition-based stats. Identities = 38/90 (42%), Positives = 58/90 (64%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +E S IPD R +H +I+ L + +V++GA+ + +IEDF E H+D+LK Y + Sbjct: 5 FVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTYFNLP 64 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMR 95 NGIP HDT +RV S I+PA F + F+ W++ Sbjct: 65 NGIPSHDTFSRVFSAINPASFQDSFLIWLK 94 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 92/373 (24%), Positives = 146/373 (39%), Gaps = 47/373 (12%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +++ L+E + +PD R+ V L +L L + AV GA G+ +I + L Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMR--------------DCHSSDDKDVIA 107 P T RV+ P E W + VI+ Sbjct: 91 FGLVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVIS 150 Query: 108 IDGKTLRHSYDKSRRRGAIHVISAFSTMHSL--VIGQIKTDEKSN---EITAIPELLNML 162 DGKT+R +RRR I+ + L G + E N EI A+ ++ L Sbjct: 151 ADGKTMR----GARRRTGDGKIAQDQVVEILDHASGAVVACEPVNDGDEIGAVRTVMGRL 206 Query: 163 -----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN 217 + G ++ TDA Q + E++ GG +L VK Q R+ A P ++ Sbjct: 207 ADRWGSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRI-LAKVRALPWAQVR- 264 Query: 218 PEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEP 277 D+ K+HGR E R V P +D G ++ + ++ + P Sbjct: 265 -AQDT--CRGKAHGRAETRTVRVVQAPTH-VDLALA--GTAQV-IKITRHTRRRPHPGAP 317 Query: 278 EMTVR---YYISSADLTAE-----KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + R Y ++S L AE A +R+HW +EN++HW D +ED R GN Sbjct: 318 AASTRENAYLLTS--LPAEVADPATLAAMVRSHWLIENQVHWVRDTAYDEDRHTARTGNG 375 Query: 330 AELFSGIRHIAIN 342 + +R+ AI Sbjct: 376 PINLACLRNTAIT 388 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 82.8 bits (203), Expect = 2e-14, Method: Compositional matrix adjust. Identities = 55/148 (37%), Positives = 74/148 (50%), Gaps = 6/148 (4%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFE--WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYY 284 +K HGR E R D L + WK + + S R I ++ E RY Sbjct: 137 DKGHGRIETRRCTAAGDLDWLATLGLKERWKKITSVAGIDSSRVIGSKT----ETDRRYV 192 Query: 285 ISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 ISS +E+ A+R HW +EN LHW LDV ED C IR NAA FS +R A+N+ Sbjct: 193 ISSLPADSERILHAVRMHWGIENGLHWCLDVAFGEDACPIRLRNAALDFSLLRRAAMNLF 252 Query: 345 TNDKVFKAGLRRKMRKAAMDRNYLASVL 372 D GL +K + AA + +YLA++L Sbjct: 253 RADHSRAMGLPKKRKAAAWNPDYLANIL 280 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 82.0 bits (201), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 39/73 (53%), Positives = 51/73 (69%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++H + D R +H L DI+LL I AV+SG+EGWEDIE+FG LD+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVV 78 GIP HDTIARV+ Sbjct: 67 AGIPRHDTIARVI 79 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 81.6 bits (200), Expect = 4e-14, Method: Compositional matrix adjust. Identities = 56/180 (31%), Positives = 88/180 (48%), Gaps = 5/180 (2%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ-Y 61 + L + + IPD+R+ L +LL +I A++SGA + I F TH L + Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 G P + +I + + F ++ VIA+DGKTLR S D+ Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHF--RAHAARLAEGAAVIALDGKTLRGSLDRFE 118 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDE--KSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A V+SAF+T +V+GQI ++ K +EI A L+ L + G++ T DA+ QK+ Sbjct: 119 DRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 80.9 bits (198), Expect = 7e-14, Method: Compositional matrix adjust. Identities = 101/394 (25%), Positives = 158/394 (40%), Gaps = 85/394 (21%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAV-------ISGAEGW------EDIE 48 +++ L+ + D R V +++S +L L +CA+ I+ A W E++ Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 49 DFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRD-----CHSSDD- 102 FG L + G + IP T+ V+ + P + + +R HS + Sbjct: 90 AFG---LPYHPLRGRYR--IPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPL 144 Query: 103 --------------------------KDVIAIDGKTLRHS--YDKSRRRGAIHVISAFST 134 + IA+DGK LR + D SR + V+SA Sbjct: 145 MPDGGIEREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR----VFVLSAVRH 200 Query: 135 MHSLVIGQIKTDEKSNEITAIPEL------LNMLDIKGKIITTDAMGCQKDIAEKIQKQG 188 + + + K+NEI PE L+ D+KG ++T DA+ Q+D A + ++G Sbjct: 201 GDGITLASREIGAKTNEI---PEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERG 257 Query: 189 GDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELI 248 YL +K Q R P KE+ D + HGR E RL V V L Sbjct: 258 AHYLLTIKNNQ-RGQARQLHALPWKEIPVIHRDD----ARGHGRHEQRLVQVVTVNGLLF 312 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA-----IRNHW 303 + + + R + KK TV Y I+ DL AE+ + A R HW Sbjct: 313 PHAAQ-------VLRIQRRRRLYGAKKWSSETV-YAIT--DLPAEEASAAEIASWARGHW 362 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIR 337 VEN +HW DV NED ++R N + + +R Sbjct: 363 TVENTVHWCRDVTFNEDKSQVRTHNTPSVLAAVR 396 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 80.5 bits (197), Expect = 9e-14, Method: Compositional matrix adjust. Identities = 68/247 (27%), Positives = 106/247 (42%), Gaps = 14/247 (5%) Query: 40 GAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS 99 GA+ ++ +F E + L++ +G P HDT +RV + P + F +M Sbjct: 37 GAKTCVEMAEFSEARQEELREIVALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRG 96 Query: 100 S----DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAI 155 + K V+AIDGK+LR YDK R ++S + I ++ +EI A Sbjct: 97 ALGLPAPKGVVAIDGKSLRRGYDKGRAFMPPLMVSVWDVETRPSIAAMRAP-GGDEIKAT 155 Query: 156 PELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKEL 215 +L L +KG +T DA+ C +A+ + Y +K G L +A E F Sbjct: 156 LSVLKALTLKGCTVTADALHCHPAMAQALLAAKAQYALGLKANHGPLFRAAEAGFAAVT- 214 Query: 216 NNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKK 275 + + E+ HGREE R V V D L+ GLK + + R+ Sbjct: 215 ---DLAVFETRERGHGREEQRRASVLPV-DRLVKRP-SLPGLKAIGRIEAVRT---GANG 266 Query: 276 EPEMTVR 282 +PE VR Sbjct: 267 KPEQAVR 273 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 80.1 bits (196), Expect = 1e-13, Method: Compositional matrix adjust. Identities = 58/194 (29%), Positives = 94/194 (48%), Gaps = 19/194 (9%) Query: 11 SIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDI----EDFGETHLDFLK---QYGD 63 + + D R+ + H +LL+ + V++G +E I +D ++ L L G Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 P TI R++S P + ++ HSS IAIDGKT+R S Sbjct: 289 ERFLPPSEPTIRRILSKADPVELDRILSQYIV-AHSSGR--AIAIDGKTIRSS------- 338 Query: 124 GAIHVISAFSTMHSLVIGQIKTD-EKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 ++ +++A V+ Q D K +EI A LL LD+ GK++T DA+ Q +A Sbjct: 339 -SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNALAS 397 Query: 183 KIQKQGGDYLFAVK 196 +I+++GGDY+F VK Sbjct: 398 RIREKGGDYVFTVK 411 >UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX Length = 98 Score = 79.7 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 37/48 (77%), Positives = 39/48 (81%) Query: 78 VSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +SCI KFHECFIN MR+CHSSDD DVIAIDGK L HS DKSRRR A Sbjct: 1 MSCIRSVKFHECFINRMRECHSSDDIDVIAIDGKALPHSCDKSRRRRA 48 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 79.7 bits (195), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 85/357 (23%), Positives = 156/357 (43%), Gaps = 27/357 (7%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFG-ETHLDFLKQ 60 ++ L+ + +PD+R V ++L+ +L L + I+G + + ++ + L Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS--SDDKDVIAI--DGKTLRHS 116 G F +P TI R+V P + + W +D ++A+ DGK ++ + Sbjct: 84 LG-FPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGA 142 Query: 117 YDKSRRRGAIH---VISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDA 173 + + G++ V+ A +G + +EI ++ L+N + ++TTD Sbjct: 143 RSRPPQ-GSVRQEAVVEAVRHDTGTALGHQRV-VAGDEIASVRRLVNRVCDHNTLVTTDC 200 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGR- 232 + + +A I+ +GG +LF++KG Q + +A P E N + EK+HGR Sbjct: 201 LHAHEPLARAIRAKGGHWLFSIKGNQPTV-RAKLAGLPWDEFGN----QHVTREKAHGRI 255 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLC-VAVSFRSIIAEQKKEPEMTVRYY----ISS 287 EE L + L+ F +G +++ +A + R T +Y +S+ Sbjct: 256 EERALKALTPSAPSLVGF----RGTRQVVKLARTTRRKKTTTSPAATSTEEFYLVTSLST 311 Query: 288 ADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 + + A R HW VE H R D M+ED IR NAA ++ R I+ L Sbjct: 312 DQASPAQLARWARGHWTVEAIHHVR-DRTMDEDRHTIRTKNAALNWAIARDTTISAL 367 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 79.3 bits (194), Expect = 2e-13, Method: Compositional matrix adjust. Identities = 99/394 (25%), Positives = 159/394 (40%), Gaps = 72/394 (18%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVIS-GAEGWEDIEDFGETH----LDFLKQ 60 L++ ++I D R T H L+ IL + CA ++ G + IE + + L L Sbjct: 30 LIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLHI 89 Query: 61 YGDFENGI---PVHDTIARVVSCISPAKFHEC---FINWMRDCHSSDDKDVI-------- 106 + D G+ P TI RV++ + + C F+N +++ D + Sbjct: 90 WRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRRT 149 Query: 107 ----------------------AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIK 144 A+DGK L+ + G +H+IS + + + V Q + Sbjct: 150 EREARRAAHRSPTPAPGLLPAYAVDGKRLKGARHPDG--GRVHLISLAAHLDATVHAQRQ 207 Query: 145 TDEKSNEITAIPELLNM---LDIKGKIITTDAMGCQKDIAEK-IQKQGGDYLFAVKGTQG 200 KS+EI A+ LL D+ G +IT DA+ Q+ A I++ Y+ VK Q Sbjct: 208 IPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQP 267 Query: 201 RLNKAFEEKFPLKELNNPEHDSYAISEK----SHGREEIRLHIVCDVPDELIDFTFEWKG 256 L+ + L + D A++ + HGR E R I+ P + IDF + + Sbjct: 268 TLHATA-----ITALTGTDTDFAAVTHRETHRGHGRTEYR--ILRTAPADGIDFPYAAQV 320 Query: 257 LKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK-----FATAIRNHWH-VENKLH 310 + L I KE V Y I+ DLTA + A +R HW +EN +H Sbjct: 321 FRVLRHRGGLDGI--RHSKE----VCYGIT--DLTARQAGPAHLAAYVRGHWKAIENGVH 372 Query: 311 WRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 DV ED C+ R + R++A L Sbjct: 373 HVRDVTFAEDACQARTATLPRALAAFRNLATGTL 406 >UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=Bacteroides RepID=C6Z3A5_9BACE Length = 115 Score = 79.3 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 49/109 (44%), Positives = 64/109 (58%), Gaps = 4/109 (3%) Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLT-AEKFATAIRNHWHVENKLHWRLDVVMNEDDCK 323 S R+I+A + E VRYY++S D T EK A+AIR HW + N LHW+LDV ED K Sbjct: 5 SERTIVAIGEYTQE--VRYYVTSLDNTRPEKIASAIRQHWSIGNNLHWQLDVTFREDYSK 62 Query: 324 IRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 + NAA FS +A+ IL N+K K + K KA D NYL+ +L Sbjct: 63 -KVKNAAGNFSVATKMALTILKNEKTTKGSMNLKRLKAGWDENYLSQLL 110 >UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZC8_9GAMM Length = 154 Score = 78.6 bits (192), Expect = 3e-13, Method: Compositional matrix adjust. Identities = 42/109 (38%), Positives = 61/109 (55%), Gaps = 4/109 (3%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W+ L+ + + S R+ +K E + RYYISS TA R HW +E LHW L Sbjct: 7 WEELQTIVMVESERA----EKGETTIEHRYYISSTLGTAAYLLDYKREHWGIETSLHWCL 62 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 D+ ED+ +I +GN AE F+ +RHIA+N+L + K G++ K KA Sbjct: 63 DIAFREDESRISKGNGAENFAILRHIALNLLKKEDTAKIGIKNKRLKAG 111 >UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina DSM 3645 RepID=A3ZX26_9PLAN Length = 115 Score = 78.2 bits (191), Expect = 5e-13, Method: Composition-based stats. Identities = 43/105 (40%), Positives = 63/105 (60%) Query: 270 IAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + +Q + VRYYI S LT +FA A+R HW +EN LHW+LDV E +IR+G+A Sbjct: 10 LVKQNGKEASEVRYYIVSKYLTGRRFAEAVRGHWGIENSLHWQLDVTFGEHQSRIRKGHA 69 Query: 330 AELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 FS +R ++++L N+K + G++ K KA + YL VL G Sbjct: 70 DINFSLLRRTSLSLLKNNKTARVGVKNKRLKAGRNDKYLLEVLLG 114 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 77.0 bits (188), Expect = 9e-13, Method: Compositional matrix adjust. Identities = 48/170 (28%), Positives = 80/170 (47%), Gaps = 3/170 (1%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG-IPVHDTIARVVSCIS 82 H L +L L AV+ G + I FG + L F G P T+++ + I Sbjct: 6 HPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTLRRID 65 Query: 83 PAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQ 142 P + W+ + D + +A+DGK LR S D H ++A++ + V+GQ Sbjct: 66 PQQLEAALGRWIAGRLTPDARAHVALDGKCLRGSRDGDVP--GPHRVAAYAPHAAAVLGQ 123 Query: 143 IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYL 192 I+ D ++NE A LL ++ + G ++T A C +D+A + GG Y+ Sbjct: 124 IRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYV 173 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 76.6 bits (187), Expect = 1e-12, Method: Composition-based stats. Identities = 34/75 (45%), Positives = 52/75 (69%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++++E + + D R + +H L DIL+L +CAV+SGA+GW+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARV 77 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRV 81 >UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7Q7_9GAMM Length = 164 Score = 75.9 bits (185), Expect = 2e-12, Method: Compositional matrix adjust. Identities = 52/171 (30%), Positives = 87/171 (50%), Gaps = 11/171 (6%) Query: 206 FEEKFPLKELNNPEHDSYAISEKSHGREEIR-LHIVCDVPDELIDFTFEWKGLKKLCVAV 264 F++ + L E + +SY EK HGR+E+R ++++ E + +W +K + V Sbjct: 3 FQDYWALPE---DKQESYITEEKGHGRKEVREVYVLPAAFSEAL--RQKWCLVKSIVAVV 57 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 RS+ K + YYI + L+ E + A R HWH+EN+ HW LDV+ ED+ +I Sbjct: 58 RDRSV----KGKGSYETSYYICTDHLSLELASKATRKHWHIENQQHWALDVIFKEDEQRI 113 Query: 325 RRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGS 375 G++A + R N+ + + RKM +AA +++Y VL S Sbjct: 114 YAGDSALNMACCRRFVQNLFRKSEG-NLSVPRKMNQAAWNKDYREKVLFTS 163 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 74.7 bits (182), Expect = 6e-12, Method: Compositional matrix adjust. Identities = 37/73 (50%), Positives = 49/73 (67%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++H I D R +H L +I+LL I AV+SG+EGWE IE+FG LD+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVV 78 GIP HDTIARV+ Sbjct: 67 AGIPRHDTIARVI 79 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 74.3 bits (181), Expect = 8e-12, Method: Compositional matrix adjust. Identities = 70/228 (30%), Positives = 105/228 (46%), Gaps = 12/228 (5%) Query: 100 SDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTD-EKSNEITAIPEL 158 S +K + DGK LR S + ++RG V+ I Q D +K +EI + L Sbjct: 51 SQEKQWFSGDGKELRGSIESGKKRGQA-VVQIVHHHSGEAIAQNYYDGQKESEIPTLRAL 109 Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNP 218 L+ D+ + IT DA+ E I K GG +L +K Q L + + P Sbjct: 110 LSKDDLASQKITLDALHLCPSTTEMITKAGGVFLIGLKENQPTLLAH------MTDCALP 163 Query: 219 EHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPE 278 D + +HGR E R + + DV + D ++ K+L V V R+ I ++ + Sbjct: 164 PIDQKTTFDFNHGRVEQRKYWLYDVSKQGFDPRWDNTAFKRL-VKVQ-RTRINQKNAKIS 221 Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRR 326 V YYIS+ + E A+RNHW VE H R DV +NED K ++ Sbjct: 222 REVSYYISN-ETAKEGIFDAVRNHWSVEVNNHIR-DVTLNEDQLKSKK 267 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 73.9 bits (180), Expect = 9e-12, Method: Compositional matrix adjust. Identities = 53/163 (32%), Positives = 79/163 (48%), Gaps = 6/163 (3%) Query: 182 EKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH-DSYAISEKSHGREEIRLHIV 240 EKI ++ GDY+ +K + E F + PE +++ R + R + Sbjct: 1 EKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRK 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 V D L EWKG+K + RS + KE + V +YISS D+ + A +R Sbjct: 61 LKVSDWLSKAE-EWKGIKSVLEVCRKRS---DNGKESQEKV-FYISSLDVDVQILAKCVR 115 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 HW VENK HW LDVV ED+C + AE + +R +A+N+ Sbjct: 116 GHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNL 158 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust. Identities = 40/65 (61%), Positives = 43/65 (66%), Gaps = 12/65 (18%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTI------CAVISGAEGWEDIEDFGETH 54 MELKKLMEHISIIPDYRQ WKVEHKL DIL + C ++ G FGETH Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRG------FGETH 54 Query: 55 LDFLK 59 LDFLK Sbjct: 55 LDFLK 59 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 73.2 bits (178), Expect = 2e-11, Method: Compositional matrix adjust. Identities = 38/81 (46%), Positives = 51/81 (62%), Gaps = 4/81 (4%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + + E++S Y Q +H DI+ L + AVISGA W +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLSDPRAYNQ----KHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCI 81 Y FE GIPV DTIARV+ I Sbjct: 57 YRPFECGIPVDDTIARVIKRI 77 >UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C429C Length = 149 Score = 70.9 bits (172), Expect = 8e-11, Method: Compositional matrix adjust. Identities = 40/124 (32%), Positives = 68/124 (54%), Gaps = 11/124 (8%) Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYI 285 SEK HGR E R + P ++ +WKGLK+ R++ K + + V Y I Sbjct: 4 SEKGHGRIEKR--TLETTP--IVTVGQKWKGLKQGLRITRERAV----KGKKTVEVVYGI 55 Query: 286 SS---ADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 +S A A T +R+HW +EN LH+ DV + ED C++R+G A ++ + +R++ ++ Sbjct: 56 TSLSMARANAATLLTILRDHWQIENGLHYVRDVTLGEDACRVRKGTAPQVLAAVRNVVVH 115 Query: 343 ILTN 346 +L + Sbjct: 116 LLAS 119 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 65/244 (26%), Positives = 105/244 (43%), Gaps = 24/244 (9%) Query: 107 AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKG 166 A+DGKT R + K +H++ + ++GQ + D KSNE T LL L++ G Sbjct: 151 AVDGKTSRGA--KRADGSQVHLLGVAAHGAGALLGQREIDAKSNETTEFRALLAPLELAG 208 Query: 167 KIITTDAM-GCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAI 225 ++ DA+ + ++ + ++ YL K Q +L +AF P E+ P D Sbjct: 209 AFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKL-RAFLAALPWTEI--PTAD--LT 263 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYI 285 ++ HGREE R V V +DF A+ R ++ + Y I Sbjct: 264 RDRGHGREETRTLKVATV--THLDFPHA-------AQAIRIRRWRRQKGQPASHETIYAI 314 Query: 286 SSADLTAEKFATAI-----RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 + D TA++ + A+ R WH+E K H+ DV ED R G + + R Sbjct: 315 T--DATADQASPALLADLARGQWHIEVKQHYVRDVTFGEDSSTSRTGRGPAVLALFRATV 372 Query: 341 INIL 344 + L Sbjct: 373 ADTL 376 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 70.1 bits (170), Expect = 1e-10, Method: Compositional matrix adjust. Identities = 49/165 (29%), Positives = 86/165 (52%), Gaps = 12/165 (7%) Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKF--PLKELNNPEHDSYAISEKSHGREEIRL 237 ++E+ ++ DY+ A+KG + + ++ F P+ + H ++ +K HGR E R+ Sbjct: 1 MSERNSEKDNDYILALKGNHPLMEQEVKDFFLSPVTSTRSV-HTTF---DKGHGRIERRI 56 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 + + D + EWK L + S++ + KE +RY+I+S ++FA Sbjct: 57 YTL-DTNIGWFEDKKEWKHLAGFGMV---DSMVTRKGKECR-EIRYFITSV-TDVKQFAK 110 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 + +HW +EN LHW LDV+ +D+C + NAAE + IR I N Sbjct: 111 GVCSHWMIENNLHWCLDVLFCDDECTVLDRNAAENLAIIRRIVYN 155 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 70.1 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 30/82 (36%), Positives = 54/82 (65%), Gaps = 2/82 (2%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + +++H S + D RQ+W+V + L +I LL +CA +SG E + +I +G+ L+FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTIARV--VSC 80 + +E G+P HDT+ + +SC Sbjct: 77 FLPYERGLPAHDTLKGLSGISC 98 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 67.8 bits (164), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 50/181 (27%), Positives = 88/181 (48%), Gaps = 6/181 (3%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E ++ +PD+R + L +LLL I +S G+ +EDF H + L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 -ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS---YDKS 120 P T RV+ I F NW+ ++D + +DGK+++ + YD++ Sbjct: 64 PPTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA 123 Query: 121 RRRGAIHVISAFSTMHSLVIG-QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + I+V+S FS + I Q +++ +EI + LL LD++G + T D++ CQK Sbjct: 124 -YQDFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKK 182 Query: 180 I 180 + Sbjct: 183 L 183 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 67.8 bits (164), Expect = 6e-10, Method: Compositional matrix adjust. Identities = 54/204 (26%), Positives = 93/204 (45%), Gaps = 16/204 (7%) Query: 13 IPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGE--THLDFLKQYGDFENGI-- 68 +PD R +H L IL + + AV++ A+ + + ++ T + F Sbjct: 230 LPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKRIRARFNPRTQR 289 Query: 69 ---PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 P T+ RV+ + W+ + +A+DGK L+ + R G+ Sbjct: 290 YVAPSEPTLRRVLQGANVTALDAAIGAWLLGIAGFE---AVAVDGKVLKGAV---REDGS 343 Query: 126 -IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE-K 183 +H++SAF I Q + K+NEI + LL +DI+ K++T DA+ Q+ A Sbjct: 344 QVHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADALHTQRKTARFL 403 Query: 184 IQKQGGDYLF-AVKGTQGRLNKAF 206 ++ + DYLF AVKG Q +L + Sbjct: 404 VEDKKADYLFTAVKGNQRKLRNSL 427 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 66.6 bits (161), Expect = 2e-09, Method: Composition-based stats. Identities = 29/70 (41%), Positives = 43/70 (61%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L+ H + I D RQ+ KV + L D+L +T+C VI+GAEGW +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIPV 70 G G+PV Sbjct: 72 KGILTEGVPV 81 >UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9DIV7_9GAMM Length = 133 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 3/107 (2%) Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 V ++ ++W GLK + S + + + R+YISS DL AE+ +++RNH Sbjct: 3 VNKSWLNNKYQWVGLKSIIKVTS--DVHEKTTGKETTETRWYISSLDLNAEQALSSVRNH 60 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 W VE+ +HW L++ ED+ + R+G F+ +R IA+ + D+ Sbjct: 61 WQVES-MHWVLEMTFREDESRFRKGRGPLAFNVMRKIAMTLFKQDQT 106 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 60/256 (23%), Positives = 111/256 (43%), Gaps = 22/256 (8%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+E ++ +PD R+ V ++ + +L + +CA++SGA + I ++ + Sbjct: 51 LLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAGLGLT 110 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK-------------DVIAIDGKT 112 +P TI RV+ + A W++ + D V+A+DGK Sbjct: 111 GRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAVDGKA 170 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITT 171 +R + + +H++ +V+ Q+ DEK+NEI +L+ + D+ +IT Sbjct: 171 MRATRHGTH---PVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDVLITV 227 Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHG 231 DAM Q A+ + +G L VK Q ++ + P K++ + + + HG Sbjct: 228 DAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRL-KTLPWKDVPV----GHTTTGRGHG 282 Query: 232 REEIRLHIVCDVPDEL 247 R E R VP L Sbjct: 283 RIETRTLKAVTVPAGL 298 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 51/167 (30%), Positives = 82/167 (49%), Gaps = 13/167 (7%) Query: 3 LKKLMEHISIIPDYRQTWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 L+KL E S IPD+R+ K + HKL D+++L I +S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAK-------FHECFINWMRDCHSSDDKDVIAIDGKTL 113 NGIP T+ R+ I F E F + + ++++ IDGK Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCA--QEIVCIDGKAE 152 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLN 160 R + K+ R I +SA S + + +EKSNEI A+P L++ Sbjct: 153 RGTVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLID 197 >UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN77_ANAAZ Length = 77 Score = 66.2 bits (160), Expect = 2e-09, Method: Compositional matrix adjust. Identities = 26/61 (42%), Positives = 43/61 (70%) Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 A ++R+HW +EN LHW LDV + +DDC+IR+ NA + F+ +R IA+++L + K G+ Sbjct: 1 MANSVRSHWAIENSLHWVLDVALKQDDCRIRKDNAQQNFAVMRQIAVDLLGKENPVKRGI 60 Query: 355 R 355 + Sbjct: 61 K 61 >UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N3J1_VIBHB Length = 184 Score = 64.7 bits (156), Expect = 5e-09, Method: Compositional matrix adjust. Identities = 40/119 (33%), Positives = 65/119 (54%), Gaps = 7/119 (5%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMT-VRYYI 285 ++ HGR R + +P+EL + G+K C+AV I+ E K EP+ + YYI Sbjct: 34 DEGHGRLVRRRYFAFPLPEELHNHALS--GIKS-CIAVE--RIVQEGKGEPKTSHFSYYI 88 Query: 286 SSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 ++ + K A +R HW +E+ HW LDV N+D K N+AE F+ I+ + +N++ Sbjct: 89 TNHPASDPKLADYVRQHWEIES-YHWLLDVYFNDDRDKKYEENSAENFAQIKRLPLNLV 146 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 64.7 bits (156), Expect = 6e-09, Method: Composition-based stats. Identities = 31/60 (51%), Positives = 34/60 (56%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 LKQYG FE GI HDTI +VSCIS F + FI WM C A DGKT+R S Sbjct: 11 LLKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVRRS 70 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 64.3 bits (155), Expect = 7e-09, Method: Compositional matrix adjust. Identities = 47/182 (25%), Positives = 78/182 (42%), Gaps = 17/182 (9%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEK--------FPLKELNNPEHDSYAI 225 M Q D+ +Q++GGDY+ K QG L E FP + D+ Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYI 285 E S G + + L ++ W G++++ R + + E V Y I Sbjct: 61 CEVSKGHGWVERRTMTST-IWLNEYLTRWPGVQQVFRLTRTRQVGGKTTVE----VVYGI 115 Query: 286 SSADLTA---EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 SS A + R HW +E++ H R D + ED C++RRG A + + +R++A+ Sbjct: 116 SSLSSVAAAPDALLRYTRTHWGIESRHHIR-DATLGEDRCRVRRGAAPRVLAVLRNVAVY 174 Query: 343 IL 344 +L Sbjct: 175 LL 176 >UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7X3_ANADF Length = 442 Score = 63.2 bits (152), Expect = 2e-08, Method: Compositional matrix adjust. Identities = 75/314 (23%), Positives = 128/314 (40%), Gaps = 47/314 (14%) Query: 67 GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD-------DKDVIAIDGKTLRHSYDK 119 G P T+ R+++ SPA E ++D + V++ DGK D Sbjct: 98 GKPCDTTLYRLLAEQSPAGLEETVFAQVKDLIARKVVRNDLLALGVVSFDGKGTWSRTDG 157 Query: 120 SRRRGAIHVI-----SAFSTMHSL-----------VIGQIKTDEKSNEITA----IPELL 159 + +GA S+ T +L +GQ K E TA +P + Sbjct: 158 EKVKGAQQSAYDAEGSSLQTFGALRAALTSSSVCPCVGQRLIGSKEGESTAFRRLLPAIS 217 Query: 160 NMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPE 219 L + +I+T DA C ++ AE + G Y+F +K Q L+ + +L P Sbjct: 218 EQLGGQFRIVTGDAGLCARENAELVTSLGRWYVFGLKDNQPYLHD-IARDYGQYDLGTPL 276 Query: 220 HDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFR--SIIAEQKKEP 277 + +E+ G +R DV + L +C + R I+A ++ Sbjct: 277 ART---AERYRGHTIVRELYARDVAGNPAAAIEAAQQLWYVCQTTTDRRGEIVAVEQ--- 330 Query: 278 EMTVRYYISS---ADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD---CKIRRGNAAE 331 RY+++S LT ++ +R HW +EN HW +DV++ ED+ C+ R + E Sbjct: 331 ----RYFVTSIPTGTLTRDQELALVRMHWAIENGCHWTMDVMLGEDEGHPCQASRAS-IE 385 Query: 332 LFSGIRHIAINILT 345 S +R I N ++ Sbjct: 386 TVSWLRLIGYNAVS 399 >UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REV9_LEGLO Length = 139 Score = 62.4 bits (150), Expect = 3e-08, Method: Compositional matrix adjust. Identities = 38/120 (31%), Positives = 62/120 (51%), Gaps = 9/120 (7%) Query: 264 VSFRSIIAEQ---KKEPEMTV----RYYISSADLTAEKFATAIRNHWHVENKLHWRLDVV 316 V +SIIA + K E + RYY++S + +RNHW +EN+LHW LDV Sbjct: 20 VGIKSIIATETISSKTNETAISAEWRYYVTSHETEKSDLHLYVRNHWSIENELHWHLDVH 79 Query: 317 MNEDDCKIRRGNAAELFSGIRHIAINILTND--KVFKAGLRRKMRKAAMDRNYLASVLAG 374 +N+D K R A FS I+ + ++++ K +R ++++ D YL S+L+ Sbjct: 80 LNDDADKKRDDTTAINFSSIKRMLLSLVKTKLPPGKKRSVRSRLKQVGWDTEYLVSLLSA 139 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 61.6 bits (148), Expect = 4e-08, Method: Composition-based stats. Identities = 33/83 (39%), Positives = 49/83 (59%), Gaps = 1/83 (1%) Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE-K 183 A+H++SAF + +V+ Q+ EKSNEI A ELL LDI G +T DAM Q++ A Sbjct: 8 AVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQREHARFA 67 Query: 184 IQKQGGDYLFAVKGTQGRLNKAF 206 ++ + D++ VK Q L +A Sbjct: 68 VEDKRADFVMTVKDNQPELREAL 90 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 61.2 bits (147), Expect = 5e-08, Method: Compositional matrix adjust. Identities = 61/215 (28%), Positives = 92/215 (42%), Gaps = 16/215 (7%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHLD-FLKQYG-DFENGIPVHDTIARVVSCISPAKF 86 +L + + A + G+ + T D L Q G F P T V+S + PA Sbjct: 3 LLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRR--PSEKTFRAVLSRLDPADL 60 Query: 87 HECFINWMRDCHSSDDKD---VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 + ++ +S D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 61 NARMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQL 118 Query: 144 KTDEKSNEITAIPELLNMLDIKGK-IITTDAMGCQKDIAEKI-QKQGGDYLFAVKGTQGR 201 EKSNEI + LL +L + ++T DAM Q A+ I YL VK Q + Sbjct: 119 AVAEKSNEIPCVRALLTLLPDNLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAK 178 Query: 202 LNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIR 236 + A P E+ D + HGR + R Sbjct: 179 I-LARITALPWAEVPAAATD----DSRGHGRVKTR 208 >UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus capsulatus RepID=Q60CS8_METCA Length = 197 Score = 61.2 bits (147), Expect = 6e-08, Method: Compositional matrix adjust. Identities = 48/176 (27%), Positives = 78/176 (44%), Gaps = 15/176 (8%) Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRL 237 K E + G D L +KG +L A L + SY + R E R Sbjct: 6 KKTVETVLATGNDLLVQLKGNHPKLLAAVRT---LCQSRAHAEQSYTVDLGRRNRIEQRT 62 Query: 238 HIVCDVP-----DELID-FTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 + +P D D F +G +++ V + +++ P YY+++ + Sbjct: 63 VRLWPLPPGSGTDPWHDHFQTVIEGQRQIEVFNPYHRRFEPRQESPA----YYLATCTAS 118 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 A A IR HW +EN+LH LDV + ED +IRR +F+ +RH A+N+L ++ Sbjct: 119 AATLAQVIRGHWAIENRLHHVLDVSLGEDSSRIRRNPG--VFALLRHFALNLLRHN 172 >UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C6B8_ACAM1 Length = 145 Score = 60.8 bits (146), Expect = 7e-08, Method: Compositional matrix adjust. Identities = 45/121 (37%), Positives = 61/121 (50%), Gaps = 11/121 (9%) Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTF-EWKGLKK-LCVAVSFRSIIAEQKKEPEMTVRY 283 S +S GREE R C E + EW+ ++ LCV + Q K T Y Sbjct: 7 SIQSRGREEHR----CIQVYEPVGIALQEWEAIRSVLCV----QRWGTRQGKAYHNTA-Y 57 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 YISSA + + + +R HW +EN+LHW DVV EDD ++ A +S +R I INI Sbjct: 58 YISSAATSPHHWQSLVREHWGIENRLHWPKDVVFGEDDYRLEDEQALLNWSVLRTIVINI 117 Query: 344 L 344 L Sbjct: 118 L 118 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 60.8 bits (146), Expect = 8e-08, Method: Compositional matrix adjust. Identities = 40/131 (30%), Positives = 63/131 (48%), Gaps = 11/131 (8%) Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN--- 217 M +KG ++T DAMGCQ+ IA+++++ G D + ++KG QG+ A F ++ Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 218 --PEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKK 275 P+HD + E SHGR R V + E + W ++ L V R A + Sbjct: 61 LKPDHDEF---EDSHGRTVRRRGWVLPLTPE-TKHSGSWPDIQALLVTEKIRQ--AHYSE 114 Query: 276 EPEMTVRYYIS 286 RYY+S Sbjct: 115 TVTSDFRYYLS 125 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 60.1 bits (144), Expect = 1e-07, Method: Compositional matrix adjust. Identities = 56/198 (28%), Positives = 85/198 (42%), Gaps = 14/198 (7%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE 214 IP +L +++ GK IT DA+ QK +AE I + YLF VK Q L + F ++ Sbjct: 3 IP-ILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLYFDIKNYFEHRK 61 Query: 215 LNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 E D HGR + R +E ++F + + +S + Sbjct: 62 ----EPDYCLQDPPGHGRIDTRSIWTTTELNEYLEFPHVGQAF-----CIHKKSYDPKTN 112 Query: 275 KEPEMTVRYYISSADLTAEKFATAI---RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 K E T Y ++S A + R HW +EN H+ LD +ED +IR GN Sbjct: 113 KVCENTF-YGVTSHHPNKADPARILQIHRGHWSIENSKHYILDWTYDEDRNRIRTGNGPA 171 Query: 332 LFSGIRHIAINILTNDKV 349 + +R AI +L + V Sbjct: 172 NTNRLRGFAIGLLKSKGV 189 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 59.3 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 27/43 (62%), Positives = 38/43 (88%) Query: 109 DGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNE 151 DGKTLR S+D+S + AIH++SA+++ +SLV+GQ+KTDEKSNE Sbjct: 29 DGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNE 71 >UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammaproteobacteria RepID=Q47YV8_COLP3 Length = 151 Score = 58.9 bits (141), Expect = 3e-07, Method: Compositional matrix adjust. Identities = 30/92 (32%), Positives = 51/92 (55%), Gaps = 1/92 (1%) Query: 281 VRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 R+ ISS DL + A+R+HW VE+ +HW LD+ D+ +I R +F+ +R IA Sbjct: 54 TRWNISSLDLHVVQALNAVRSHWQVES-IHWMLDMTFRVDESRICRKQGPHVFNVMRKIA 112 Query: 341 INILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 + + D + RK + A +D +Y +++L Sbjct: 113 MTLFKQDTTKLVSMARKKKMAGLDDDYRSNLL 144 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 41/157 (26%), Positives = 74/157 (47%), Gaps = 10/157 (6%) Query: 99 SSDDKDVIAIDGKTLRHSYD-KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 ++ + IA+DGK L+ S S RR H++SA + + + +++ K+NE T Sbjct: 127 TAGPRRAIAVDGKALKASARLTSPRR---HLLSAVTHGRVVTLARVEVGAKTNETTHFKP 183 Query: 158 LLNMLDIKGKIITTDAM-GCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELN 216 LL LD+ ++T DA+ + +I+ ++ + Y+ +K Q + P +++ Sbjct: 184 LLAPLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQLAT-LPWRDIP 242 Query: 217 NPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFE 253 +A SE HGR E C +PDEL + Sbjct: 243 V----QHAASEVGHGRRESSSIKTCAIPDELGGIAYP 275 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 73/332 (21%), Positives = 126/332 (37%), Gaps = 65/332 (19%) Query: 59 KQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYD 118 + G P ++T+ +++C+ WM + A DGK L Sbjct: 15 RPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLGMPVGGIRA-DGKVL----G 69 Query: 119 KSRRRGA--IHVISAFSTMHSLVIGQ---IKTDEKSNEITAIPELLNMLDIKGKIITTDA 173 S+R GA +H + + + + Q + D + + + E + G++++ DA Sbjct: 70 GSKRAGAPALHGVELVTHTTGMALAQREAVGGDAAAALLALLTEA----PLDGRMVSMDA 125 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFP-----------------LKELN 216 + + I ++ G+YL VKG Q ++ P L ++ Sbjct: 126 GFLNAAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVALDQIA 185 Query: 217 NPEHDSYAIS----------------EKSHGREEIRLHIVCDVPD--ELIDFTFEWK--- 255 P I E+S GR EIR V D D + + W+ Sbjct: 186 PPRRKRQPIGFRRELQPRRAPDAQTIEQSRGRLEIRELWVVDAGDVGPSLMTAYGWRQVT 245 Query: 256 ---GLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWR 312 GL++ C R A+ E+TV +SS T +F +IRNHW +EN++H Sbjct: 246 QIGGLRRWC-----RRRHADLWTVEEVTV---VSSRQRTPAQFLASIRNHWTIENQVHRP 297 Query: 313 LDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 D M ED R + + R++ IN++ Sbjct: 298 RDGSMQEDRLHGR--AIGVILAVCRNVVINLI 327 >UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY37_9BACT Length = 138 Score = 56.6 bits (135), Expect = 1e-06, Method: Compositional matrix adjust. Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 1/74 (1%) Query: 271 AEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 + +KE + V +SS + + +R HW +EN+LHW D V ED C R GN A Sbjct: 38 GKTRKETALGV-TSLSSGQASPRELLDFVRGHWEIENRLHWIRDSVFREDTCTTRTGNGA 96 Query: 331 ELFSGIRHIAINIL 344 + + +R++ I++L Sbjct: 97 HVMATLRNMTISLL 110 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 56.2 bits (134), Expect = 2e-06, Method: Compositional matrix adjust. Identities = 26/85 (30%), Positives = 44/85 (51%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +++H + D R L I+ + I AV++GA+G+ IE +G+ +L+ + D Sbjct: 27 VLKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLP 86 Query: 66 NGIPVHDTIARVVSCISPAKFHECF 90 GIP HDT RV+ + P + F Sbjct: 87 KGIPSHDTFGRVLRILEPKQLQSGF 111 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 55.1 bits (131), Expect = 4e-06, Method: Compositional matrix adjust. Identities = 33/95 (34%), Positives = 47/95 (49%), Gaps = 4/95 (4%) Query: 69 PVHDTIARVVSCISPAKFHECFINWM----RDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 PV+ ++ ++ I P F R C + IAIDGKTLR S+D Sbjct: 12 PVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAFSDTK 71 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELL 159 A +V+SAF+ H +++ DEKSNEI A L+ Sbjct: 72 AAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALI 106 >UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P2A0_AZOSE Length = 92 Score = 54.7 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 25/67 (37%), Positives = 41/67 (61%) Query: 307 NKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRN 366 ++LHW LDV N+D ++RRG AA F +RHI +N+L ++ KA ++ K A M+ + Sbjct: 23 HQLHWSLDVQFNDDQSRVRRGYAANNFVVLRHIVLNLLRHNTTRKASIKSKRLLACMEDD 82 Query: 367 YLASVLA 373 + +L Sbjct: 83 FREELLG 89 >UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE7_9GAMM Length = 185 Score = 54.7 bits (130), Expect = 6e-06, Method: Compositional matrix adjust. Identities = 31/95 (32%), Positives = 47/95 (49%), Gaps = 13/95 (13%) Query: 266 FRSIIAEQKKEPEMTVR-----------YYISSADLTAEKFATAIRNHWHVENKLHWRLD 314 FR++I Q+ R YY+ L A +F+ AIRNHW VEN+ H+ D Sbjct: 70 FRALIRVQRHTERFDTRLRDWRVSKECAYYLCDLVLPAARFSEAIRNHWRVENRAHYVRD 129 Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ED +IRR F+ +R A+N++ ++V Sbjct: 130 TRFQEDASRIRRNPCT--FALLRSFALNLMRFNRV 162 >UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3V5_9PROT Length = 174 Score = 54.3 bits (129), Expect = 7e-06, Method: Compositional matrix adjust. Identities = 41/138 (29%), Positives = 67/138 (48%), Gaps = 8/138 (5%) Query: 216 NNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSII---AE 272 N+ D+ K R+E R V V D L EW+ K + V+ R+++ A Sbjct: 18 NDAPADTAFSRNKGRSRQEDRTVEVFPVGDALAGT--EWQPFIKTIIRVTRRTLLHSAAT 75 Query: 273 QKKEPEMTVRYYISSA-DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 + V +Y+SSA + A +A AIR HW +EN+ H+ DV +ED +IR + Sbjct: 76 GLWDQRGEVAFYVSSAANFPATAWAAAIRGHWGIENRNHYVRDVSCDEDKSRIR--DNPG 133 Query: 332 LFSGIRHIAINILTNDKV 349 + + R A+NI+ + + Sbjct: 134 IMARARSFALNIMRKNGI 151 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 53.9 bits (128), Expect = 8e-06, Method: Compositional matrix adjust. Identities = 49/189 (25%), Positives = 80/189 (42%), Gaps = 6/189 (3%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+ H+ IPD R V +LL+ + ++S E D+E F H L + E Sbjct: 13 LISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGIE 72 Query: 66 NGIPVHDTIARVVSC-ISPAKFHECFINWM--RDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 P D+ R + A +W + + D D + DGKTLR S + + Sbjct: 73 LKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTSG 132 Query: 123 RGA--IHVISAFSTMHSLVIGQ-IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 GA I ++ +S + I Q + +E + +LL LD++G +I DA+ Q+ Sbjct: 133 GGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQA 192 Query: 180 IAEKIQKQG 188 Q +G Sbjct: 193 FFGSSQSRG 201 >UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C56B7 Length = 116 Score = 53.9 bits (128), Expect = 1e-05, Method: Composition-based stats. Identities = 24/77 (31%), Positives = 44/77 (57%), Gaps = 5/77 (6%) Query: 272 EQKKEPEMTVRYYISSADLTAEKFATA-----IRNHWHVENKLHWRLDVVMNEDDCKIRR 326 E+ + TV + L+AEK A +R HW +EN+LH+ DV + ED C++R Sbjct: 10 ERTVRGQTTVEVHFGITSLSAEKADAATLLNHVRTHWRIENELHYVRDVTLGEDVCRVRM 69 Query: 327 GNAAELFSGIRHIAINI 343 G+A ++ + +R+ +++ Sbjct: 70 GHAPQVLAALRNAVVHL 86 >UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D5 Length = 148 Score = 53.5 bits (127), Expect = 1e-05, Method: Compositional matrix adjust. Identities = 37/123 (30%), Positives = 64/123 (52%), Gaps = 17/123 (13%) Query: 228 KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKK---EPEMTVRYY 284 K HGR E R L ++ W G++++ FR + Q++ + + V Y Sbjct: 3 KGHGRVERR---SITTTTWLNEYLTRWPGVQQV-----FR--LERQRRADGKTTVEVVYG 52 Query: 285 ISSADLTAEKFATAI---RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAI 341 ISS A T + R+HW +E+ LH+ DV ++ED C++RRG A + + +R++A+ Sbjct: 53 ISSLSPVAAPPDTVLGYTRSHWGIES-LHYVRDVTLDEDRCRVRRGTAPRVLASLRNVAV 111 Query: 342 NIL 344 +L Sbjct: 112 YLL 114 >UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ26_GLOVI Length = 125 Score = 51.6 bits (122), Expect = 5e-05, Method: Composition-based stats. Identities = 29/81 (35%), Positives = 47/81 (58%), Gaps = 4/81 (4%) Query: 267 RSIIAEQKKEPE--MTVR--YYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDC 322 RSI E+ +E +TV+ +Y+SS + +A + IR HW VEN++H+ DV ED Sbjct: 15 RSIRLERYRELRGIVTVKTHWYLSSIEASASELGRRIRGHWGVENQVHYPKDVTFGEDRS 74 Query: 323 KIRRGNAAELFSGIRHIAINI 343 +IR +++S R A+N+ Sbjct: 75 RIRTLPLVQVWSVARSFALNL 95 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 51.2 bits (121), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 49/206 (23%), Positives = 91/206 (44%), Gaps = 14/206 (6%) Query: 7 MEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLK-----QY 61 + +++ IPD R+ K +H+ D+LL+ + AV SG + + + FL + Sbjct: 10 LPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDEVHIRT 69 Query: 62 GDFENGIPVHDTIARVVSCISP--AKFHECFINWMRDC-----HSSDDKDVIAIDGKTLR 114 E +P T+ R+ +S + ++W R+ D+ +A+DGK LR Sbjct: 70 RRGERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVAVDGKHLR 129 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIG-QIKTDEKSNEITAIPELLNMLDIKGKIITTDA 173 + R A+ +SA L +G Q D ++ + + L + ++T DA Sbjct: 130 GTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGVD-WVLTGDA 188 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQ 199 C +++A + +Q G A KGT+ Sbjct: 189 ALCTQELAAVVVEQKGGICSASKGTR 214 >UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JBB1_FRASC Length = 173 Score = 51.2 bits (121), Expect = 6e-05, Method: Compositional matrix adjust. Identities = 25/60 (41%), Positives = 35/60 (58%), Gaps = 2/60 (3%) Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI--LTNDKVFKAGLRRKMR 359 HW +EN+LHW DV +ED + R GNA ++ + +R++AI I LT K LR R Sbjct: 100 HWAIENRLHWVRDVTYDEDRHRARTGNAPQVMTSLRNLAITILRLTGAKNIAKALRHHAR 159 >UniRef50_Q8X3B6 H repeat-associated protein n=1 Tax=Escherichia coli O157:H7 RepID=Q8X3B6_ECO57 Length = 50 Score = 50.8 bits (120), Expect = 8e-05, Method: Composition-based stats. Identities = 25/36 (69%), Positives = 27/36 (75%) Query: 343 ILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGLS 378 I ND VFKAGL KMRKA MDRN+LAS +A GLS Sbjct: 15 ISDNDNVFKAGLSCKMRKAVMDRNFLASGIAACGLS 50 >UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmonella enterica RepID=C0Q104_SALPC Length = 177 Score = 50.4 bits (119), Expect = 1e-04, Method: Compositional matrix adjust. Identities = 28/55 (50%), Positives = 31/55 (56%), Gaps = 13/55 (23%) Query: 309 LHWRLDVVMNEDDCKIRRGNAAELF----SG---------IRHIAINILTNDKVF 350 +HWRLDV MNEDDC+IRRGN F SG +R I INIL VF Sbjct: 1 MHWRLDVAMNEDDCRIRRGNVKSFFEIIKSGEYEIWGCEIMRWIRINILKCTLVF 55 >UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XTV7_9DEIN Length = 118 Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 29/108 (26%), Positives = 54/108 (50%), Gaps = 5/108 (4%) Query: 267 RSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRR 326 R ++ + E Y ++S A++ R HW VEN+LH + D V+ ED + R+ Sbjct: 15 RRVVRKNSGELREETAYALTSLQAPAKRLYALWRGHWEVENRLHHKRDTVLGEDASRSRK 74 Query: 327 GNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 G A ++ +R + +N+L + + + R +RK + D L ++ G Sbjct: 75 GAAGLMY--LRDVILNLL---HLKRWPVLRSVRKFSADPKVLLRLIRG 117 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 49.3 bits (116), Expect = 2e-04, Method: Compositional matrix adjust. Identities = 44/187 (23%), Positives = 85/187 (45%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L E +S IPD R + ++ L +L L + A +S + +E F + L G + Sbjct: 3 LREVLSQIPDPRARNR-QYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLGLRK 61 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 P H + ++ + P K E + +D +V+ +DGK L+ S + Sbjct: 62 P--PGHTILTLLLHRLDPEKLQEALLQVF---PGADLGEVLVVDGKHLKGSGKGKSPQ-- 114 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD---IKGKIITTDAMGCQKDIAE 182 + ++ + + Q K + + ++ A+ ELL+ L +KGK++ DA ++A Sbjct: 115 VRLVEVLALHLLTTLAQAKAEGREDQ--ALLELLDRLGAEGLKGKVVVGDAGYLYPELAG 172 Query: 183 KIQKQGG 189 K+ ++GG Sbjct: 173 KVVQKGG 179 >UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria RepID=A8FXM7_SHESH Length = 57 Score = 49.3 bits (116), Expect = 3e-04, Method: Composition-based stats. Identities = 22/38 (57%), Positives = 28/38 (73%), Gaps = 1/38 (2%) Query: 282 RYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 RYYISS +LTAE+ A + HW +E+ +HW LDV MNE Sbjct: 18 RYYISSKELTAEQAANTVSEHWGIES-MHWVLDVSMNE 54 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 48.9 bits (115), Expect = 3e-04, Method: Compositional matrix adjust. Identities = 33/98 (33%), Positives = 49/98 (50%), Gaps = 6/98 (6%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVI----SGAEGWEDIEDFGETHLDF 57 +LKKL+E S IPD R+ V+H+L+ +LL + + + S E D+ L Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDMSR--PAFLQA 136 Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMR 95 L+ +P DT+ARV+ I P K E FI +R Sbjct: 137 LQGLFPELETLPHGDTLARVLERIEPQKLEESFIRLLR 174 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 48.5 bits (114), Expect = 4e-04, Method: Compositional matrix adjust. Identities = 48/203 (23%), Positives = 78/203 (38%), Gaps = 52/203 (25%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVK----GTQGRLNKAFEEKFPLKELNNPEHDSYAISEKS 229 MGCQK+IA+ I KQ DY+ A+K G QG L +A+ K + D + + Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGEL-EAWWHKCQREGFTADNFDEHTTIDSG 59 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 HGR E R V ++ ++W GLK + I Sbjct: 60 HGRIETRRCQQVLVNKSWLNNKYQWVGLKSI------------------------IKVTS 95 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 EK T + +IR+G F+ +R IA+ + ++ Sbjct: 96 DVHEKTTT-----------------------ESRIRKGRGPLAFNVMRKIAMTLFKQEQT 132 Query: 350 FKAGLRRKMRKAAMDRNYLASVL 372 +A + K + A +D Y +++L Sbjct: 133 KRASIVAKKKMAGLDDEYRSTLL 155 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 47.8 bits (112), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 46/188 (24%), Positives = 87/188 (46%), Gaps = 15/188 (7%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L + +S +PD R + + L +L L + A +S + +E F + L G + Sbjct: 3 LRQALSQVPDPRAHNR-RYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLGLRK 61 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS-YDKSRRRG 124 P H I ++ + P K + +D +V+ +DGK LR S KS + Sbjct: 62 A--PGHTAITLLLHRLDPEKLQAALGQVFPE---ADLGEVLVVDGKHLRGSGKGKSPQVK 116 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD---IKGKIITTDAMGCQKDIA 181 + V++ +H+ + Q + + E A ELL+ L+ ++GK++ DA ++A Sbjct: 117 LVEVLALH--LHT-TLAQARAE--GREEKAFLELLDRLEARELEGKVVVGDAGYLYPEVA 171 Query: 182 EKIQKQGG 189 +++K+GG Sbjct: 172 ARVRKKGG 179 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 47.8 bits (112), Expect = 7e-04, Method: Compositional matrix adjust. Identities = 35/115 (30%), Positives = 56/115 (48%), Gaps = 8/115 (6%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +L ++ IPD+R+ + L+ +LL +I AV+SGA + I+ F + H + L Sbjct: 2 QLKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQL 61 Query: 65 E-NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV----IAIDGKTLR 114 PVH +I + + AK E + + R D + IA+DGKTLR Sbjct: 62 HWKRAPVHTSIRYALQGLD-AKAGE--LAFHRHASGLDGEGAQHASIAMDGKTLR 113 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 46.6 bits (109), Expect = 0.001, Method: Compositional matrix adjust. Identities = 38/125 (30%), Positives = 54/125 (43%), Gaps = 9/125 (7%) Query: 223 YAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVR 282 + S HGR E R C + DEL F G L V + + +E TV Sbjct: 25 HTASSAGHGRRESRSIKTCGIADELGGIAFP-HGRLALRVHRRRKQTGGCESRE---TV- 79 Query: 283 YYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHI 339 Y ++S D T + A A+R HW VE H R DV E+ + G A + R++ Sbjct: 80 YAVTSLDAHETTPAELAAAVRGHWTVEALRHVR-DVTYAEEASTLHTGTAPRAMATFRNL 138 Query: 340 AINIL 344 A+ +L Sbjct: 139 AVGLL 143 >UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobacteria RepID=Q2RP40_RHORT Length = 152 Score = 45.8 bits (107), Expect = 0.003, Method: Compositional matrix adjust. Identities = 39/122 (31%), Positives = 53/122 (43%), Gaps = 16/122 (13%) Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVS-------FRSIIAEQKKEPEMTVR 282 HGR+E R V DV L W GL V+ +S + + +E + Sbjct: 12 HGRQEHRWVEVFDVSGRLGP---TWDGLIAAVARVTRLTWHKDTKSGLWHKTQETAL--- 65 Query: 283 YYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 Y +L A TAIR HW VE + H+ DV ED +IR F+ +R A+N Sbjct: 66 -YACQINLPAAVAGTAIRQHWGVEKRSHYVRDVTFFEDQSRIR--TKPGHFARLRSFALN 122 Query: 343 IL 344 IL Sbjct: 123 IL 124 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 45.1 bits (105), Expect = 0.005, Method: Compositional matrix adjust. Identities = 33/115 (28%), Positives = 53/115 (46%), Gaps = 9/115 (7%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGET-HLDFLKQYGDF 64 L EH++ +PD R + H L IL + + A+ SGAE + + ++ T + L++ G Sbjct: 16 LWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGCQ 75 Query: 65 ENG------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTL 113 E+ P T+ RV+ I NW+ S +A+DGKTL Sbjct: 76 ESPSRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSLGLS--PAALAVDGKTL 128 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 45.1 bits (105), Expect = 0.005, Method: Composition-based stats. Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 8/82 (9%) Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAIS----- 226 D +GCQK IA+ I +Q DYL AVK Q L++A F +E N Y I Sbjct: 8 DGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYF--EEANKARFAGYNIDYDEKI 65 Query: 227 EKSHGR-EEIRLHIVCDVPDEL 247 K GR E+ R + ++PD + Sbjct: 66 NKGPGRLEQRRCWVGYEIPDTI 87 >UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus aquaticus Y51MC23 RepID=B7A7V9_THEAQ Length = 161 Score = 42.7 bits (99), Expect = 0.022, Method: Compositional matrix adjust. Identities = 29/103 (28%), Positives = 54/103 (52%), Gaps = 15/103 (14%) Query: 267 RSIIAEQKKEPEMTVRYYISS-----ADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 R ++ + E TV Y ++S AD A + + + W VEN+ W D +++ED Sbjct: 51 REVVRKGTGEVRRTVSYALTSLGPEVAD--ARRLGELLLSRWEVENRSFWVRDFLLHEDA 108 Query: 322 CKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 C++ RG A++ + +R +++L + G+R K KAA++ Sbjct: 109 CQV-RGVGAQVLAALRAFLVSLL-----HRQGVREK--KAALE 143 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 42.4 bits (98), Expect = 0.026, Method: Composition-based stats. Identities = 19/84 (22%), Positives = 41/84 (48%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ + + + D R T +H+ DI+++ +C V+ G +G I + ++L+ + Sbjct: 8 VESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQGFL 67 Query: 63 DFENGIPVHDTIARVVSCISPAKF 86 + NG+P D I + + P F Sbjct: 68 ELPNGLPSRDCIRNWLMALQPDAF 91 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 42.0 bits (97), Expect = 0.038, Method: Compositional matrix adjust. Identities = 22/74 (29%), Positives = 40/74 (54%), Gaps = 6/74 (8%) Query: 103 KDVIAIDGKTLRHS--YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLN 160 + +A+DGKT RH+ D S+ +H++ S ++ Q++ + K+NE LL Sbjct: 153 ESAVALDGKTSRHAKRADGSK----VHLVGVASHGDGRLLAQVEVEAKTNETAVFRRLLR 208 Query: 161 MLDIKGKIITTDAM 174 LD+ ++T DA+ Sbjct: 209 PLDLTNVLVTADAL 222 >UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H5_9SYNE Length = 177 Score = 41.2 bits (95), Expect = 0.057, Method: Compositional matrix adjust. Identities = 29/118 (24%), Positives = 52/118 (44%), Gaps = 9/118 (7%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYIS 286 E HGR+ + + P + W G + ++ + ++P +I+ Sbjct: 35 EIGHGRDILWTLRAKEAPQHI---KANWHGTSWIAEVIA----TGTRDRKPFKATHRFIT 87 Query: 287 SADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 S T + +R W VE+ HW D ++EDD + RGN A + + +R A+N+L Sbjct: 88 SLRTTPDALLRLVRERWSVES-WHWIRDTQLHEDDHRY-RGNGAGVMAALRTAAMNLL 143 Searching..................................................done Results from round 2 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 505 e-141 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 426 e-118 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 420 e-116 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 417 e-115 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 414 e-114 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 413 e-114 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 407 e-112 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 401 e-110 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 394 e-108 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 384 e-105 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 376 e-103 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 373 e-102 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 371 e-101 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 370 e-101 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 366 e-100 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 361 2e-98 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 359 1e-97 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 357 4e-97 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 357 5e-97 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 353 5e-96 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 352 9e-96 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 350 4e-95 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 349 1e-94 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 347 3e-94 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 347 3e-94 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 347 5e-94 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 346 8e-94 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 345 1e-93 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 342 1e-92 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 338 2e-91 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 337 5e-91 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 337 5e-91 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 336 9e-91 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 334 3e-90 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 333 4e-90 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 330 5e-89 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 330 5e-89 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 324 3e-87 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 323 8e-87 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 320 4e-86 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 316 9e-85 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 312 2e-83 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 310 6e-83 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 306 1e-81 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 303 1e-80 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 285 2e-75 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 284 3e-75 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 279 1e-73 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 274 5e-72 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 271 2e-71 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 271 3e-71 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 270 7e-71 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 266 7e-70 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 266 7e-70 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 266 9e-70 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 266 1e-69 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 261 3e-68 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 258 2e-67 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 250 9e-65 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 250 9e-65 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 248 4e-64 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 246 1e-63 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 241 3e-62 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 240 6e-62 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 236 9e-61 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 224 3e-57 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 224 3e-57 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 221 2e-56 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 221 4e-56 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 220 6e-56 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 220 7e-56 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 213 1e-53 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 204 3e-51 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 204 4e-51 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 203 8e-51 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 198 3e-49 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 196 1e-48 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 194 4e-48 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 186 8e-46 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 186 1e-45 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 184 4e-45 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 180 7e-44 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 179 1e-43 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 179 2e-43 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 167 7e-40 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 166 1e-39 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 166 1e-39 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 166 1e-39 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 165 2e-39 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 165 2e-39 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 165 3e-39 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 164 3e-39 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 156 2e-36 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 154 6e-36 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 152 2e-35 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 151 4e-35 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 151 5e-35 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 149 1e-34 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 148 3e-34 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 146 1e-33 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 145 3e-33 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 145 3e-33 UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyx... 144 3e-33 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 144 4e-33 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 142 2e-32 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 140 7e-32 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 139 1e-31 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 139 2e-31 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 136 1e-30 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 134 6e-30 UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7... 134 7e-30 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 133 1e-29 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 131 5e-29 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 130 1e-28 UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=B... 129 1e-28 UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azolla... 128 3e-28 UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp... 128 3e-28 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 127 7e-28 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 124 4e-27 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 123 1e-26 UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=B... 123 1e-26 UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus ... 117 8e-25 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 115 2e-24 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 115 3e-24 UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammap... 115 4e-24 UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica... 114 7e-24 UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX 108 3e-22 UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina... 107 5e-22 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 107 1e-21 UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mo... 104 6e-21 UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewane... 103 1e-20 UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata ob... 102 2e-20 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 100 2e-19 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 98 4e-19 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 98 4e-19 UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legione... 98 4e-19 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 98 4e-19 UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata ob... 98 6e-19 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 96 2e-18 UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris mari... 96 2e-18 UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio ... 96 3e-18 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 96 3e-18 UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswald... 94 1e-17 UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum fe... 94 1e-17 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 91 6e-17 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 90 2e-16 UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum ... 89 2e-16 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 89 3e-16 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 88 6e-16 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 86 2e-15 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 83 1e-14 UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax... 82 4e-14 UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus Re... 82 4e-14 UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothe... 80 2e-13 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 79 2e-13 UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscurig... 75 4e-12 UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia... 72 3e-11 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 65 4e-09 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 63 2e-08 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 59 3e-07 UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria... 57 1e-06 UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX 47 0.001 UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmone... 46 0.002 Sequences not found previously or not previously below threshold: UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 130 7e-29 UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 99 2e-19 UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaerom... 97 7e-19 UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteoba... 92 3e-17 UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis... 91 6e-17 UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfo... 91 8e-17 UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis... 87 1e-15 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 86 2e-15 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 85 3e-15 UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobac... 85 4e-15 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 84 8e-15 UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=... 84 9e-15 UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 82 3e-14 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 80 1e-13 UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia ... 80 1e-13 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 79 4e-13 UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synecho... 73 1e-11 UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium m... 73 2e-11 UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_... 73 2e-11 UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferro... 72 4e-11 UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoa... 69 2e-10 UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodoco... 69 2e-10 UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitroso... 67 1e-09 UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=... 66 3e-09 UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinis... 65 3e-09 UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylo... 64 9e-09 UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus... 63 2e-08 UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 62 5e-08 UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylob... 61 7e-08 UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus... 60 1e-07 UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=... 59 2e-07 UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia... 59 3e-07 UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candida... 59 3e-07 UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synecho... 58 7e-07 UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia... 57 9e-07 UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliob... 57 1e-06 UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia... 57 1e-06 UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobi... 55 3e-06 UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=... 55 4e-06 UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polarom... 55 5e-06 UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitroco... 54 7e-06 UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=... 54 1e-05 UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastop... 52 3e-05 UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus... 51 6e-05 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 50 1e-04 UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 R... 50 1e-04 UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillace... 50 2e-04 UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 49 2e-04 UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata ob... 49 4e-04 UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Strepto... 48 6e-04 UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinis... 48 6e-04 UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitroso... 48 7e-04 UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiat... 47 0.001 UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia... 47 0.002 UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodoco... 45 0.004 UniRef50_Q2RR82 Putative uncharacterized protein n=1 Tax=Rhodosp... 45 0.004 UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus p... 45 0.004 UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 45 0.005 UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewane... 45 0.006 UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aro... 44 0.007 UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromon... 44 0.009 UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coproco... 44 0.009 UniRef50_Q8X3B6 H repeat-associated protein n=1 Tax=Escherichia ... 44 0.011 UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms Rep... 44 0.014 UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID... 43 0.019 UniRef50_A7BZU6 Transposase, IS4 n=2 Tax=Beggiatoa sp. PS RepID=... 43 0.021 UniRef50_A6FLE0 Transposase, IS4 n=2 Tax=Roseobacter sp. AzwK-3b... 42 0.025 UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteoba... 42 0.035 UniRef50_B5JUE3 Transposase, IS4 family protein n=4 Tax=gamma pr... 42 0.052 UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia ... 41 0.066 UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicute... 41 0.073 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 505 bits (1300), Expect = e-141, Method: Composition-based stats. Identities = 378/378 (100%), Positives = 378/378 (100%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR Sbjct: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK Sbjct: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 Query: 361 AAMDRNYLASVLAGSGLS 378 AAMDRNYLASVLAGSGLS Sbjct: 361 AAMDRNYLASVLAGSGLS 378 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 426 bits (1095), Expect = e-118, Method: Composition-based stats. Identities = 158/377 (41%), Positives = 222/377 (58%), Gaps = 10/377 (2%) Query: 2 ELKKLMEHISIIPDYRQT-WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 +K E+ + D R+ H DIL++ +CA+ISGA + +IE FG + ++ + Sbjct: 5 AVKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQT 64 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 + NGIP HDT V++ +SP +F CF+ W + IAID KTLR S DK Sbjct: 65 FLALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKK 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + +H++SA++T +LVIGQIKT+E SNEITAIPELLN LD+KG +++ DAMGCQ +I Sbjct: 125 NGKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEI 184 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELN---NPEHDSYAISEKSHGREEIRL 237 AEKI ++ DY+ A+KG Q +L+++ E F L N E D E S+GREEIR Sbjct: 185 AEKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRC 244 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 + +++I EWK +K + + S R KKE E +RYYISSA L+AE Sbjct: 245 AYATNEIEKIIAN-DEWKNIKTVAMIESQRI-----KKEKEFDIRYYISSAKLSAEDCLK 298 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +ENKLHW LDV ED+ +IR+ N AE + +R IA+N++ +K K G K Sbjct: 299 VVRKHWEIENKLHWTLDVAFREDESRIRQRNTAENMAILRRIALNLVKQEKTAKVGQATK 358 Query: 358 MRKAAMDRNYLASVLAG 374 A D YL +L G Sbjct: 359 RLMAGWDEKYLLKLLNG 375 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 420 bits (1079), Expect = e-116, Method: Composition-based stats. Identities = 183/377 (48%), Positives = 250/377 (66%), Gaps = 4/377 (1%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L+ +SII D RQ KV H L D+L L I AVISG EGWE+I+DFG LD+L++ Sbjct: 1 MSQITLINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Y F GIP DTI+R+ I P +F +CF WM+ C DVIAIDGKTLR S++K Sbjct: 61 YLPFSGGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKK 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + IH++SAF+ +S+V+GQ+KT+ KSNEITAIP+LL++LD++G ++T DAMGCQ I Sbjct: 121 DKSDTIHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKI 180 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 A+KI +GGDYL VKG Q RL A + F ++ L PE ++Y EK HGRE+ R+ +V Sbjct: 181 AKKIVDKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 D +E+ D FEW GLK L AVSFR E+ + + V++YISSA L A+ A R Sbjct: 241 ADA-NEIGDLVFEWPGLKTLGYAVSFR---TEKDMQTTVAVKFYISSAKLDAKSLLEASR 296 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW VEN LHW+LD+ MNED C+IR+ N+ E + +RH ++N+L N+K F G++RK ++ Sbjct: 297 AHWTVENNLHWQLDISMNEDSCRIRQQNSTENLATVRHTSLNLLKNEKSFDGGIKRKHKQ 356 Query: 361 AAMDRNYLASVLAGSGL 377 A +Y V++G L Sbjct: 357 ANRSDSYRELVVSGLSL 373 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 417 bits (1073), Expect = e-115, Method: Composition-based stats. Identities = 169/373 (45%), Positives = 235/373 (63%), Gaps = 8/373 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E SII D RQ K++H+L DIL L + AVI GAEGW+DIE+ G L++L++ G F Sbjct: 6 SLVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFF 65 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 + GIPV DTIAR++S ++P + CFI WM + D +IA+DGK++RHSYDK +R+ Sbjct: 66 KKGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKKKRKS 125 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 AIH++SA++ + +V+GQ KTD+KSNEI AIP LL++LDIKG I+T DAMGCQ+ IAEKI Sbjct: 126 AIHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKIAEKI 185 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKE---LNNPEHDSYAISEKSHGREEIRLHIVC 241 + GDY+ AVK Q +L++ + F HD + S K HGR E+R + + Sbjct: 186 VTKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVRHDYFEESHKGHGRVELRRYWIS 245 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 D+ L W L+ + + S R I + RY+I+S A+ FA A+R Sbjct: 246 DMLSTLG-NPERWASLQSIGMVESERYI----DGKTTAETRYFITSIAPDAKIFANAVRK 300 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDV EDD ++RR NA+E F RH+AIN L N+K K G++ K KA Sbjct: 301 HWAIENQLHWVLDVSFREDDSRVRRDNASENFGVFRHVAINALRNEKSCKKGIKAKRYKA 360 Query: 362 AMDRNYLASVLAG 374 + +Y VL G Sbjct: 361 TLQPDYAQKVLNG 373 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 414 bits (1065), Expect = e-114, Method: Composition-based stats. Identities = 149/385 (38%), Positives = 225/385 (58%), Gaps = 20/385 (5%) Query: 8 EHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG 67 ++ + D R +HKL DI+ +TICAVI GA+ W DIE FG+ +LK++ + NG Sbjct: 11 DYFGELEDPRIERTKKHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNG 70 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIH 127 IP HDT RV S ++P + F+ W++ S +++AIDGKTLRHSYD+S+ + A+ Sbjct: 71 IPSHDTFGRVFSLLNPEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKPALQ 130 Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPE---------------LLNMLDIKGKIITTD 172 +ISA++T + LV+GQ DEKSNEITAIP+ LL +L + G I+T D Sbjct: 131 MISAWATTNGLVLGQSIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIVTLD 190 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPE---HDSYAISEKS 229 A+GCQK+I ++I +Q DY+ +K QG L + E F ++N E Y + ++ Sbjct: 191 AIGCQKEIVKQITEQDADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVKDEG 250 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 HGR+E+R + + E ID ++W L + R + + + RY+ISS + Sbjct: 251 HGRQEVRYYQMLSNVAEEIDPDWQWLNLNSIGYVEYLR--VENGTDKTSLERRYFISSLN 308 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + FA+++R HW +EN+ HW LDV NEDD +IR+ NA + +RH+A+N+L +K Sbjct: 309 NNIKLFASSVREHWCIENQCHWILDVQFNEDDSRIRKDNAPANMAILRHLALNLLKQEKT 368 Query: 350 FKAGLRRKMRKAAMDRNYLASVLAG 374 K G++ K +KA D NYL VL Sbjct: 369 LKVGVKAKRKKAGWDENYLLKVLRN 393 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 413 bits (1062), Expect = e-114, Method: Composition-based stats. Identities = 145/375 (38%), Positives = 218/375 (58%), Gaps = 7/375 (1%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++EH S + D R ++E+ L DI+++T+CAV+ GA+ W ++ ++G + +LKQ+ Sbjct: 5 PFASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQW 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 NG+P HDT V + + P + +CF+NW + + ++IAIDGKTLR + Sbjct: 65 IALPNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGE 124 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + IH++SA+++ + LV+GQ DEKSNEITAIPELL +L+++G +++ DAMGCQ IA Sbjct: 125 QCSLIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIA 184 Query: 182 EKIQKQGGDYLFAVKGTQGRLNKAFEEKFP---LKELNNPEHDSYAISEKSHGREEIRLH 238 E I + GDY+ A+KG QG L + F + EHDSY EK HGR E R + Sbjct: 185 ETIIEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTY 244 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 D L+ W LK + S R + + RYY+ S + A++FA A Sbjct: 245 WTMGQTDYLLG-AERWAQLKSIGCVESCRR---QPGHPGTLQRRYYLLSIESDAQRFADA 300 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R+HW +EN+LHW LDV ED + +G +A+ S IRHIA N+L + K G++ K Sbjct: 301 VRSHWGIENQLHWILDVGFREDKLRACQGCSAQNLSVIRHIAANLLQQESTAKCGVKAKR 360 Query: 359 RKAAMDRNYLASVLA 373 KA D NYL +L+ Sbjct: 361 LKAGWDDNYLVKILS 375 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 407 bits (1045), Expect = e-112, Method: Composition-based stats. Identities = 137/373 (36%), Positives = 212/373 (56%), Gaps = 8/373 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+ ++ I D R +H L D+L + I AVI+G++GWED+E++G ++L ++ + Sbjct: 30 NLLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLEL 89 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 +GIP DT RV I P +C W++ +S ++I IDGKTLR SYD++ + Sbjct: 90 PHGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQC 149 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A++ ++A+++ SLV+GQ+K + SNEITAIP LL +LDI G IIT DAMG Q I ++I Sbjct: 150 ALYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQI 209 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNP---EHDSYAISEKSHGREEIRLHIVC 241 +Q DY+ +K L ++ F + N EHD Y K H R E R Sbjct: 210 CRQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRYVWAI 269 Query: 242 DVPDEL-IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 V + +W GL+ + V R + + +++Y++S A+ AIR Sbjct: 270 PVAAMGELYQQQQWHGLQTIVVVERIRHLWN----KTTHDIQFYLTSLPPNAQFLCHAIR 325 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +EN LHW LDV +ED C+IR + + F+ +R +A+N+L +K FK LR+KM++ Sbjct: 326 THWSIENNLHWTLDVTFSEDQCRIRSEYSPQNFALLRRLALNVLHQEKTFKRSLRQKMKQ 385 Query: 361 AAMDRNYLASVLA 373 AAM+ NY+ +VL Sbjct: 386 AAMNNNYMMTVLN 398 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 401 bits (1029), Expect = e-110, Method: Composition-based stats. Identities = 160/372 (43%), Positives = 229/372 (61%), Gaps = 7/372 (1%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M ++ +H S I D+RQ+ KV + L D+L ++CAVI+ + GW +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 F +GIP DTIAR+VS I P F+ CF+ WM+ H + +VIAIDGKTLR SY++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 R IH+ISA+++ + LV+GQ+K ++KSNEITAIP LL MLD++G ++T DAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 A I +GGDYL AVK QG L KA + F D + EKSHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFSPHRSAGL-SDDHVNIEKSHGRIENRTCYV 239 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 DFT W+ LK + + SFR++ + K + RYYISS L+AE+ +A R Sbjct: 240 LSSAALDGDFTH-WEALKSIVMVESFRAV---KGKTASLEYRYYISSKVLSAEQALSATR 295 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +E+ +HW LDV MNED+C+I + N AE + +RH+++N+L + K + K ++ Sbjct: 296 EHWGIES-MHWVLDVSMNEDECQIYKNNGAENLAYLRHMSLNMLQKEPT-KLSIVGKRKR 353 Query: 361 AAMDRNYLASVL 372 M+ +L VL Sbjct: 354 CLMNPAFLEKVL 365 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 394 bits (1011), Expect = e-108, Method: Composition-based stats. Identities = 155/372 (41%), Positives = 216/372 (58%), Gaps = 10/372 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 + + H S I D RQ KV + L +ILLLT+CAV+SGA W I +G L FLK++ F Sbjct: 24 EFLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPF 83 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 +G P HD + + + + F CFI+W+ + + V+AIDGKT R S DK+ + Sbjct: 84 ADGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTVTG-VVAIDGKTSRRSLDKAGGKA 142 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 AIH+ISA+S+ +L + Q + D KSNEITAIPELL +L +KG I+T DAMGCQ++IA KI Sbjct: 143 AIHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKI 202 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLK---ELNNPEHDSYAISEKSHGREEIRLHIVC 241 + DY+ A+KG QG L K E + + ++ + EKSHGR E R VC Sbjct: 203 ISKEADYILALKGNQGSLRKDTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRRVTVC 262 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 D L W GLK + + A + + RYYISS AE A AIR+ Sbjct: 263 TDIDWL-KADHNWPGLKSIVMVQY----HAILQDKTRAETRYYISSMTSDAEHHAKAIRD 317 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN LHW +D+V +D+C+IR GNA F+ I+H+A N+L + K K LR K A Sbjct: 318 HWGIENGLHWVMDMVFRDDECRIRNGNAPANFTTIKHVASNMLRSVK-GKHSLRSKRHIA 376 Query: 362 AMDRNYLASVLA 373 + D ++LA ++ Sbjct: 377 SWDDDFLAEIIN 388 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 384 bits (986), Expect = e-105, Method: Composition-based stats. Identities = 134/380 (35%), Positives = 218/380 (57%), Gaps = 16/380 (4%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L+EH I D R + +H+L D+L++ +C ++ G E + D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 +GIP HDT RV + + P F +CF+ W + ++ +++A+DGK LR + ++ + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQG--Q 124 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 ++SA++ +SLV+GQI+ +K+NEITA+P+LL +L++ G I+T DAMGCQK+IA + Sbjct: 125 SPRVIVSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKFPLK----------ELNNPEHDSYAISEKSHGRE 233 I + +Y+ A+KG QG+ ++ + E N +EK HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYKETTEKDHGRL 244 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 E R + L D +W GL+ + V S R + ++ P + RYY+SS ++ E Sbjct: 245 ETRRYWQSGDVSWLAD-RQQWAGLRSVGVVESVRQV---GQQAPTVERRYYLSSLNVDVE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 KFA A+R HW VEN LHW LDV ED + R G+AAE + +R +A+N+L + K G Sbjct: 301 KFARAVRGHWSVENSLHWVLDVQCGEDRNRARSGHAAENLATLRRLALNLLKRESTKKRG 360 Query: 354 LRRKMRKAAMDRNYLASVLA 373 ++ K A+ D +YL +L+ Sbjct: 361 IKGKQLNASWDHDYLLRLLS 380 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 376 bits (965), Expect = e-103, Method: Composition-based stats. Identities = 127/380 (33%), Positives = 197/380 (51%), Gaps = 16/380 (4%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L E I D+R H L+DIL++ CA++ G + +E FG +L+ + Sbjct: 14 SNLREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLA 73 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDC--------HSSDDKDVIAIDGKTLRH 115 NGIP HDT +V S + P +F E F W + S K VIAIDGK LR Sbjct: 74 LPNGIPSHDTFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRG 133 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + DK + ++ A+++ SL +GQ+K +KSNEI A+PELL ML +KG I+T DAMG Sbjct: 134 AVDKG--QAPAVIVGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMG 191 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE-LNNPEHDSYAISEKSHGREE 234 CQ+++A KI +Q GDY+ A+K Q L++ E L E + + HGR E Sbjct: 192 CQREVARKIIQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHE 251 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 +R V + + + +W GL+ + R++ + + RY+ISS A Sbjct: 252 VRRCWVSEEVECWLQGAEKWAGLRSVAAVECERTV----AGQTTVQRRYFISSLKADAAL 307 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKAG 353 A ++R HW +EN LHW LDV ED+ + RRG +AE + +R + ++ + K Sbjct: 308 IAASVRAHWGIENSLHWVLDVTFGEDESRSRRGYSAENLATLRRLTHAMIKRENPNSKKS 367 Query: 354 LRRKMRKAAMDRNYLASVLA 373 + ++ +A + +YL ++L Sbjct: 368 VNQRRFEAGLSTDYLQTLLG 387 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 373 bits (957), Expect = e-102, Method: Composition-based stats. Identities = 136/384 (35%), Positives = 198/384 (51%), Gaps = 15/384 (3%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +++ I D R K HK+ I+ ++I AVI GA+ W +IE+FG + F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPD 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YD 118 IP HDT R S I P F F NW++ + K V+AIDGK +R + Sbjct: 64 LEFIPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQ-EVKGVVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 + + ++SA+S ++ + +GQ+K D+KSNEITAIP L+N L++ G I+T DAMGCQK Sbjct: 123 TGKEGFKLWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGTQGR---LNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 DI + I ++ +Y+ A+K + + L K + + ++ + HGR E Sbjct: 183 DITQTIIERDANYIIAIKENKKKNYQLAKQIIDDYQDRDEIINRVTRHVSENTGHGRVEK 242 Query: 236 RLHIVCDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AE 293 R V F + GLK + S R+I+A E VRYY++S D T E Sbjct: 243 RTCTVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIVAT--GEYTQEVRYYVTSLDNTKPE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A+AIR HW +EN LHW+LDV ED K + NAA FS +A+ IL DK K Sbjct: 301 EIASAIRQHWSIENNLHWQLDVTFREDYSK-KVKNAARNFSVATKMALTILKKDKTTKGS 359 Query: 354 LRRKMRKAAMDRNYLASVLAGSGL 377 + K KA D YL+ +L + Sbjct: 360 MNLKRLKAGWDEKYLSQLLQNNNF 383 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 371 bits (953), Expect = e-101, Method: Composition-based stats. Identities = 134/369 (36%), Positives = 195/369 (52%), Gaps = 9/369 (2%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L ++ I D R H+L DI+ + + AV++GA+ W IE +G+ +L+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + + P F +W++ S+ VIAIDGKT + SYD+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + ++SA+++ H LV+GQ D KSNEITAIP LL L + G I++ DAMG + IA +I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 KQGGDYLFAVKGTQGRLN---KAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCD 242 KQ DY+ A+KG Q L + + E+F E+ + E +H R E R Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRRVFQVP 253 Query: 243 VPDELIDFT-FEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 V +W GL+ L V S R + + RY++SS A FA IR Sbjct: 254 VEQVFTPKQGRDWAGLRSLVVIQSQRCLWNKD----TTETRYFLSSLSTDAATFAHYIRA 309 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDVV NED +IR+ +A FS +R + +N+L D K L K +A Sbjct: 310 HWGIENQLHWCLDVVFNEDKSRIRKDHAPRNFSLLRRLTLNLLHRDSS-KGSLVMKRYRA 368 Query: 362 AMDRNYLAS 370 +D ++ Sbjct: 369 GLDDQFMMQ 377 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 370 bits (949), Expect = e-101, Method: Composition-based stats. Identities = 128/372 (34%), Positives = 198/372 (53%), Gaps = 7/372 (1%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L ++E + D+R + H+LS++L + +CAV+SGA+ +E+I +G + +L+ + Sbjct: 6 LADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPWLRGFL 65 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHSYDKSR 121 + G+ DT RV + + P +F + F W+ + KD VIAIDGK+ R + K+ Sbjct: 66 RLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRTTSKAA 125 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 +H++SAF+ +V+GQ T EKSNEITAIPELL +LDI+G I+T DAMG Q IA Sbjct: 126 -AAPLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGTQTKIA 184 Query: 182 EKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVC 241 I+++G Y+ VK +L + ++ + HGR E+R Sbjct: 185 RAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLTPSSTHETTSTGHGRIEVRRCTAF 244 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 D D L WK + V R++ + YYISS AE+ A AIR+ Sbjct: 245 DATDRLHK-AEAWKDVASFAVVERVRTV----GERTSTERVYYISSLPADAERIAVAIRS 299 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW VEN+LHW LDV +D + R G+ A + +RH+A+N++ DK K ++ K A Sbjct: 300 HWEVENRLHWCLDVQFGDDYARGRIGHIAHNLALVRHMALNLIRLDKSIKTSIKTKRLLA 359 Query: 362 AMDRNYLASVLA 373 A + A++L Sbjct: 360 ATSDEFRAALLG 371 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 366 bits (940), Expect = e-100, Method: Composition-based stats. Identities = 141/377 (37%), Positives = 202/377 (53%), Gaps = 12/377 (3%) Query: 4 KKLMEHISIIPDYR-QTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 LM + D R ++ H ++L++ I AV+S + EDI +G D+L+Q+ Sbjct: 7 ASLMCCFREVDDPRKRSNGTLHDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFL 66 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 NG+ +T R+ + P +F F W+ + + +DGKT+R S S Sbjct: 67 VLLNGVASEETFLRIFRALDPKQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGS--GSGG 123 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 AIH++SAF+T +V+GQ K KSNEITAIPELL L I G ++T DAMGCQK+IA Sbjct: 124 ESAIHMVSAFATELGVVLGQEKVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIAR 183 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCD 242 +I QGGDYL AVKG Q L A E +F + + + + D + SHGR ++ V Sbjct: 184 QITDQGGDYLLAVKGNQPTLLDAIETEF-IDQYQSDDVDRHRQVHPSHGRIVAQIASVL- 241 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 P E I +W KK+ S R + E ++ RYYISS +LTAE+ A A+R H Sbjct: 242 -PAEGIVDLADWPECKKIARVDSLRKV---GNHESKLERRYYISSRELTAEQLAAAVRAH 297 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV--FKAGLRRKMRK 360 W +EN+LHW LDV ED IR+GNA + S ++ I +N++ D K LR K + Sbjct: 298 WGIENRLHWVLDVSFGEDASTIRKGNAPQNLSLLKKIVLNLIRLDTADKTKTSLRLKRKC 357 Query: 361 AAMDRNYLASVLAGSGL 377 AA + +L + L Sbjct: 358 AAWTDDVRMRILGFTSL 374 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 361 bits (926), Expect = 2e-98, Method: Composition-based stats. Identities = 130/373 (34%), Positives = 206/373 (55%), Gaps = 10/373 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 + H + D R +H L D++ LT+ A++SGAEGW+DI+ FG++ LD+L+++ F Sbjct: 2 SFITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAF 61 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 + G+PV DTIAR++S + P FI+W+ + + VIA DGKTLRHS+D R+ Sbjct: 62 KEGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFDGDRK-T 120 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A+H +SA++ LV+ Q K+ K NE++ + EL+ +L++KG I+T DAM C K +A+ I Sbjct: 121 ALHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAI 180 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPE---HDSYAISEKSHGREEIRLHIVC 241 +GGDY+ VK QG+L F + P+ +S ++ HGR E R ++ Sbjct: 181 NAKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYVQL 240 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 + L + W +K + R + K + YYISS ++ + A AIR+ Sbjct: 241 PITPWLTQ-SQGWTNIKPVIEVTRKRYL----KDKETSETAYYISSLEVNLPQIAKAIRS 295 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN HW LD+ EDD +IRRG+A E + R A+N+ K ++ K+++A Sbjct: 296 HWSIENASHWVLDMTYREDDSRIRRGDAPENMATFRRFAMNLARLSP-IKDSMKGKLKQA 354 Query: 362 AMDRNYLASVLAG 374 A +L Sbjct: 355 AWSDEVREKLLFA 367 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 359 bits (920), Expect = 1e-97, Method: Composition-based stats. Identities = 127/380 (33%), Positives = 218/380 (57%), Gaps = 14/380 (3%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ ++E+ + + D R+ +H L D+L++ + AVI+GA+G I + E H+++LK Sbjct: 9 DVVSILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSR 68 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-----DDKDVIAIDGKTLRHS 116 + +G+P HDTI R+++ + P F +CF W+ + D +++IAIDGKTLR S Sbjct: 69 LELPSGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRS 128 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ + G + + SA++ + +GQ+ +KSNEI PEL+ +D++ I+T DA GC Sbjct: 129 HDRGKGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGC 188 Query: 177 QKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK---ELNNPEHDSYAISEKSHGRE 233 Q+D+AEKI GDY+ A+K Q RL++ + + + + + + K HGR Sbjct: 189 QRDVAEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRL 248 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 + R + +PDE + +W+GLK + VA+ I+++ RYYISS A+ Sbjct: 249 DKRFYYQVKLPDE-VPAGEDWRGLKTIGVAIR----ISQENGRETCDTRYYISSLKPDAK 303 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 +FA A+R HW +EN LHW LDV ED+ ++R AAE + ++ +A++++ K K Sbjct: 304 QFAAAVRGHWGIENSLHWTLDVTFREDESRLRNRIAAENLAWLKRLAVSLIKQHKS-KES 362 Query: 354 LRRKMRKAAMDRNYLASVLA 373 + + R A + N+LA +L Sbjct: 363 VVMRRRMAGWNVNFLAEILG 382 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 357 bits (915), Expect = 4e-97, Method: Composition-based stats. Identities = 132/378 (34%), Positives = 197/378 (52%), Gaps = 12/378 (3%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M K L++++ IPD R K H LS+++ + ICA++ G + W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK-DVIAIDGKTLRHSYDK 119 + GIP HDT R+ + + PA F W+ D D +A+DGK LR + K Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMGDDKLVGQLAVDGKALR-ATAK 119 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A+H+++ +ST + +GQ K +KSNEITAIPELL +L++KG +++ DAMG Q Sbjct: 120 GRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQVK 179 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFP-LKELNNPEHDSYAISEK---SHGREEI 235 IA+ I K+ GDYL AVK Q LN +E+F + N + + +E+ HGR+E Sbjct: 180 IADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKEH 239 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V V DE + +WK K + + R + + VR+YISS L A Sbjct: 240 RRCWVLMV-DESMPVCQQWKA-KTIIAVQAERI----ENGKGYDFVRFYISSRALDATSA 293 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A R HW VEN LHW LD+ ED + R G A E + IR +N+L +K + Sbjct: 294 LKATRAHWSVENLLHWTLDIAFGEDQLQARAGFAGENLAVIRQWILNMLKQNKSRNLSMA 353 Query: 356 RKMRKAAMDRNYLASVLA 373 K R ++ YL + Sbjct: 354 NKRRLCCLNEQYLFECMG 371 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 357 bits (915), Expect = 5e-97, Method: Composition-based stats. Identities = 141/375 (37%), Positives = 199/375 (53%), Gaps = 16/375 (4%) Query: 7 MEHISIIPDYRQT-WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 M + I D R+ H +IL++ I AV+S + EDI + T +L+++ + Sbjct: 1 MSCLDEIDDPRKPSNGTLHDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLK 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV-----IAIDGKTLRHSYDKS 120 NGIP +T R++ + P +F F W+ + D IAIDGKT+R S S Sbjct: 61 NGIPSEETFLRILRALDPKQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGS--GS 118 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 AIH++SAF+T LV+GQ K KSNEITAIPELL L IKG ++T DAMGCQK I Sbjct: 119 GGESAIHMVSAFATELGLVLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSI 178 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 A++I + GDYL VKG Q +L +A E F + + D + E+ HGR ++ V Sbjct: 179 AKQIVAKKGDYLLMVKGNQPKLLEAIETAF-IDQHGVESVDRSSRVERGHGRTVGQIASV 237 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 I +W + S R + K+ ++ RYYISS L+AE+ A A+R Sbjct: 238 LSAKG--IVDPADWPKCVTIGRIDSMRVV---GDKQSDLERRYYISSRALSAEQLAAAVR 292 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKM 358 HW VEN+LHW LDV +ED + + NA + S +R IA+ I+ DK K+ LR K Sbjct: 293 AHWGVENRLHWILDVSFSEDASTVAKDNAPQNLSLLRKIALTIIRADKTDTRKSSLRLKR 352 Query: 359 RKAAMDRNYLASVLA 373 + AA D +L Sbjct: 353 KGAAWDDGVRERMLG 367 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 353 bits (906), Expect = 5e-96, Method: Composition-based stats. Identities = 128/351 (36%), Positives = 187/351 (53%), Gaps = 5/351 (1%) Query: 22 VEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCI 81 V + L+++LL T+ +I A +++IE G LD+L+Q+ FE+G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 SPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 P F W+ V AIDGKTLR S + GA+H++SA++ LVIG Sbjct: 62 DPKYLETAFSAWVESLRVHVGGGV-AIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGR 201 Q + KSNEITAIPELL+ L + G I+T DAMG QK IA K+ +G DY+ A+KG QG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 L+ + F +L HGR E R V D L + W GL + Sbjct: 181 LHDDVRDFFADPDLLRECARHDDTC-IGHGRIEERTCQVADASAWLTEQHSGWAGLASIA 239 Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 ++ R+ ++ E R YISS + A R+HW VEN LHW+LDV ED+ Sbjct: 240 AVIATRT--DKKSGEVSSETRLYISSLPPDPKTILNACRSHWSVENNLHWQLDVTFREDE 297 Query: 322 CKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 C+ R+ +A + IRH A N+L + K ++RK KAAM++ + +V+ Sbjct: 298 CRTRKDHAPLSLAIIRHAAFNMLKREPS-KMSIKRKRLKAAMNQAFRKTVI 347 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 352 bits (904), Expect = 9e-96, Method: Composition-based stats. Identities = 126/375 (33%), Positives = 193/375 (51%), Gaps = 13/375 (3%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG--D 63 L+E S +PD R+ + L+ IL++ +CA++ GA+ W ++ D+ + ++L + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 E G P HDT + + F F +W+R+ D V+AIDGKTLR S K Sbjct: 63 LEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVIDG-VVAIDGKTLRGSGKKGSNE 121 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H+++A++ L + Q T K +E+ + LL++L +KG I+T DA+GCQ ++AEK Sbjct: 122 -LLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAEK 180 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH---DSYAISEKSHGREEIRLH-I 239 I +GGDY+ VK Q L +A E F + + +EK HGR E R + Sbjct: 181 IVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYTW 240 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL-TAEKFATA 298 + DV WK L + + S R I + + RY I S + T E FA A Sbjct: 241 INDVTWMDRPMRAAWKKLGGVGMIESIRQI----GDKVSVDQRYAIGSCGVQTVEMFAKA 296 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 R+HW +EN LHW LDVV ED C+ R GN+A S +R + L ++ K GL R+ Sbjct: 297 SRSHWGIENGLHWTLDVVFREDQCRARLGNSARNLSVLRKFVLTTLRKEEGCKMGLNRRR 356 Query: 359 RKAAMDRNYLASVLA 373 A + +Y S++A Sbjct: 357 LHADRNESYRESLIA 371 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 350 bits (899), Expect = 4e-95, Method: Composition-based stats. Identities = 129/382 (33%), Positives = 199/382 (52%), Gaps = 15/382 (3%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + +E ++ I D+R + ++L DILL++ AVI + + ++ F + +L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV------IAIDGKTLRHSY 117 F +G P HDT +V+S + P E F WM + + K V +AIDGKT+ S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRS- 121 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 S + A HV++AF++ LV+GQIKTDEKSNEITAIPELL + +K ++T DAMG Q Sbjct: 122 -GSAEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLNKA--FEEKFPLKELNNPE----HDSYAISEKSHG 231 K+IA KI ++GGDY+ AVKG Q +L + L++ + E EK HG Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDRSTRELKAKGQYAVTLEKDHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 R E R + + + +W+G+ + + R + + K + S + Sbjct: 241 RIEKRECYLSNDLS-WFEGLEDWRGIAGVGWIRNTRHVDNKNDKTTTEDHYFIYSLKEAQ 299 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFK 351 A+ R HW +EN LHW LD+ EDDC+ R NAAE+ + +R +A+ +L K Sbjct: 300 AKDLLRIKREHWAIENNLHWMLDMAFREDDCRARAKNAAEVMNILRKLALQMLKTCDTCK 359 Query: 352 AGLRRKMRKAAMDRNYLASVLA 373 G+R K + + VL Sbjct: 360 CGMRSKRKLCGLGIPTALQVLG 381 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 349 bits (895), Expect = 1e-94, Method: Composition-based stats. Identities = 132/375 (35%), Positives = 199/375 (53%), Gaps = 12/375 (3%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + + E++S D R + +H DI+ L + AVISGA W +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Y FE GIPV DTIARV+ I P F+E F+N++ + + ++VIAIDGKTLRHS++ Sbjct: 57 YRPFECGIPVDDTIARVIKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFNPE 116 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + A+H ++ +S L++ Q K+ K NE A+ E+++ +K +IT DAM QK I Sbjct: 117 T-QSALHSVTVWSQSRGLILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKI 175 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH-DSYAISEKSHGREEIRLHI 239 AEKI ++ GDY+ +K + E F + PE ++Y R + R + Sbjct: 176 AEKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETYEEVNAERSRIDERYYR 235 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 V D L EWKG+K + RS + +YISS D+ + A + Sbjct: 236 KLKVSDWLSK-AEEWKGIKSVLEVCRKRS----DNGKESQEKVFYISSLDVDIQILAKCV 290 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VENK HW LDVV ED+C + AE + +R +A+N+ K ++ K+ Sbjct: 291 RGHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHP-KKQSMKGKLT 349 Query: 360 KAAMDRNYLASVLAG 374 A + +L G Sbjct: 350 AAGWSDEFRDELLLG 364 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 347 bits (891), Expect = 3e-94, Method: Composition-based stats. Identities = 130/377 (34%), Positives = 198/377 (52%), Gaps = 13/377 (3%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L+ L+EH S I D R ++ H L +ILLL +C ++ + +E+I +G HL FL+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 + +G+P + +++ I PA F F W+R D +AIDGKT R S+D+ Sbjct: 72 PYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFP-GRADFVAIDGKTSRRSHDRRAG 130 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML----DIKGKIITTDAMGCQK 178 IH++SAF+T LV+ Q +K+NE+ AIP LL+ L + G +++ DA+ Sbjct: 131 TAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATNP 190 Query: 179 DIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLH 238 IA I+ QG DYL AVK Q L E F + + + HD +K HGR E R Sbjct: 191 TIAAAIRGQGADYLLAVKANQPTLRAELEAAFAVGDGADHHHD----LDKGHGRVEERHV 246 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ--KKEPEMTVRYYISSADLTAEKFA 296 V D L T + G +L + + RY+ISSA LTAE A Sbjct: 247 SVIREVDWLSG-TRRFPGEMRLPDVAAIVRVHTTAHIADRTRTDTRYFISSAPLTAEHAA 305 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 A+R HW +EN+LHW LDV+ +D ++R G+ A+ + +RH A+N++ K L+ Sbjct: 306 DAVRGHWGIENRLHWVLDVLFKDDLSRLRTGHGAKNMAVVRHFALNLIRAANDQK-SLKT 364 Query: 357 KMRKAAMDRNYLASVLA 373 + + A +YLAS+L Sbjct: 365 RRKMAGWSDDYLASLLN 381 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 347 bits (891), Expect = 3e-94, Method: Composition-based stats. Identities = 124/374 (33%), Positives = 185/374 (49%), Gaps = 11/374 (2%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L + I D RQ KV H++ ++L++ C+ + E + D+ DF ++ L +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 ++G P HD V+ I P E W D + IAIDGK LR +++ Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGWCGDL----EGRHIAIDGKALRGTHNAETG 116 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 R +H++ A+ + L GQI EKSNEI AIP LL L +KG +T DAMG Q IAE Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE---LNNPEHDSYAISEKSHGREEIRLHI 239 +I G DY+ A+K R ++ + F E L+ H E SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 + + D +++W GL+ + R + P V Y++ S E+ A + Sbjct: 237 ITEELDWYHK-SWKWAGLQSVAQVR--RQVQRSHDGPPLEEVHYFLCSFKADVERLAKLV 293 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VEN+ HW LDV NED C++R NAA + +R + I L K LRRK + Sbjct: 294 RGHWSVENRCHWVLDVTFNEDHCQVRDRNAAHNLTILREMVITTLHRHP-AKVSLRRKRK 352 Query: 360 KAAMDRNYLASVLA 373 A MD + +L Sbjct: 353 LATMDPAFRLQMLG 366 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 347 bits (889), Expect = 5e-94, Method: Composition-based stats. Identities = 113/369 (30%), Positives = 191/369 (51%), Gaps = 6/369 (1%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+EH++++ + R +H L D++ L I A++SGAEGW DIE +G++ +D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP T+AR++ CI E + W+ + + K +IA DGK LR S+ + + A Sbjct: 69 NGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKDA 127 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + +++A+ T + LV+ Q T K EI + ++L++L++KG ++T DA+ CQ++ EKI Sbjct: 128 LQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKIS 187 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPD 245 ++ + VK Q +L +A + +F E E HGR+E R Sbjct: 188 EKKAHVVVQVKNNQPKLYQAVQSQFQSLFDAQKEKIVVEHKESGHGRQEERYVFQLKAKL 247 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 + T +W ++ + RS + + YY+SS + IR HW + Sbjct: 248 PP-ELTEKWPTIRSIIAVERHRS----ANGKGTVDTSYYVSSLSPKHKLLGHYIRQHWRI 302 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN H+ LDVV NED +I +A E + R +NI+ R K+++A + Sbjct: 303 ENSQHYILDVVFNEDASRIAMEDAVENMALFRRFVLNIVKQSNCGARSQRNKLKRAGWND 362 Query: 366 NYLASVLAG 374 +Y A + G Sbjct: 363 DYRAQLFFG 371 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 346 bits (888), Expect = 8e-94, Method: Composition-based stats. Identities = 126/381 (33%), Positives = 191/381 (50%), Gaps = 19/381 (4%) Query: 6 LMEHISIIPDYR-QTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L + +PD R +T H L+DIL + CAVI+GAEGWEDI ++G + F +++ + Sbjct: 5 LTSVFADLPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFRRFLEL 64 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS--------DDKDVIAIDGKTLRHS 116 +NG+P HDT RV + + P F + F W + + D +A+DGK+ R S Sbjct: 65 KNGVPSHDTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGKSARRS 124 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K G +H++ + +L++GQ E +EIT ++L LD+ G ++T DA GC Sbjct: 125 A-KPTFSGCLHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTLDAAGC 183 Query: 177 QKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFP-LKELNNPEHDSYAISEKSHGREEI 235 Q + E I+ +GG+Y+ VKG Q L A F E D + +HGR E Sbjct: 184 QTETLEVIRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAHGRHEE 243 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V PD L W G+ + + R + + K E T YY+SS + A + Sbjct: 244 RNVTVVHDPDGL---PAGWAGVGSVALVCRDRQV---KGKANESTAHYYLSSLRVGAAEL 297 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A IR HWH+E+ +HW LDV ED+ + R G+A IR +A+++L K + Sbjct: 298 AGYIRGHWHIES-MHWVLDVAFREDESRTRVGHAGANLGMIRRVAVSLLKRAG-KKGSIH 355 Query: 356 RKMRKAAMDRNYLASVLAGSG 376 + +A D Y+A VL G Sbjct: 356 TRRLRAGWDDQYMAQVLQGLS 376 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 345 bits (885), Expect = 1e-93, Method: Composition-based stats. Identities = 123/374 (32%), Positives = 192/374 (51%), Gaps = 9/374 (2%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L I D R + + L +ILL+T+CA+I G + W+ I DFG+ +L Q+ + Sbjct: 14 QYLFHCFLSIKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVN 73 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 G+P T ARV S I P +F C WM D+I +DGK+L S + + + Sbjct: 74 MRCGVPSTLTFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQ 133 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 A H+++A+ + +G+++ +KSNEI AIP LLN L+++G II+ DAMG QK IA Sbjct: 134 KATHIVNAYLPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANL 193 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEK---SHGREEIRLHIV 240 I+ + DY+ A+K R + E F + + + Y E HGR E R + V Sbjct: 194 IRLKQADYVLALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSYCV 253 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD-LTAEKFATAI 299 + + W+ L+ + S R + E E RYYI+S + + AI Sbjct: 254 LPM-MYFHKYKKYWRDLQAIVRVQSKRH----KGNEIETATRYYITSLPFAEHRRMSQAI 308 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW +EN+LHW+LD+ + ED I RG A + + +R + + +L N+ K G+ K Sbjct: 309 RQHWAIENQLHWKLDIGLGEDASLITRGYADQNLATLRKMVLKMLENENSSKQGIAGKRI 368 Query: 360 KAAMDRNYLASVLA 373 +AA+ YL V+ Sbjct: 369 QAALSTRYLRKVVG 382 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 342 bits (877), Expect = 1e-92, Method: Composition-based stats. Identities = 135/381 (35%), Positives = 187/381 (49%), Gaps = 20/381 (5%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M ++ +IIPD R ++ ++I+ + + AVI GA+ W +IE FG+TH + K Sbjct: 1 MTIQAFS---AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKA 57 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 IP HDT++R S + F ECF W+ D V+AIDGK + + DKS Sbjct: 58 RLPGLVSIPSHDTLSRFFSILDIDWFEECFRLWVDDICRRIPG-VVAIDGKAICDNPDKS 116 Query: 121 RR-----RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 R ++++SA+S + + +GQ K +EKSNE AIPEL+ LD++ IIT DA+G Sbjct: 117 SNSKNGVRSKLYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIG 176 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN---PEHDSYAISEKSHGR 232 CQK I + I + DY+ K L E F L E + Y K HGR Sbjct: 177 CQKSITKLIIENKADYILCAKDNHEALRNIIE--FNLSEESRYYLCHAKRYFEENKGHGR 234 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTA 292 E R VC L F W G+K L + S R + KE M RYYISS + Sbjct: 235 SEYREC-VCISAKNLQYFLKGWTGIKTLAMINSIRKM---GDKEAVMETRYYISSLEPDP 290 Query: 293 EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA 352 +IR HW VEN LHW LD+ EDD + + GNAA FS I +A+ +L K Sbjct: 291 IIILKSIRPHWEVENNLHWVLDIGFREDDDR-KIGNAAINFSAIPKLALMLLKQ-SDIKL 348 Query: 353 GLRRKMRKAAMDRNYLASVLA 373 G+ K + D V+ Sbjct: 349 GMAGKRKACGWDEKIRDKVIG 369 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 338 bits (867), Expect = 2e-91, Method: Composition-based stats. Identities = 120/371 (32%), Positives = 191/371 (51%), Gaps = 9/371 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L +H+S++ D R H L D+L L + AV SG +GW +I+ FGE L++L+++ F Sbjct: 2 SLFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPF 61 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 NGIP TIAR++ + P C +W+ D ++ K +IAIDGKTLR + Sbjct: 62 ANGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLGC--N 119 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 +H + AF + L + Q K EI + L+ ML+I +IT DA+ Q+ E I Sbjct: 120 TLHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAI 179 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVP 244 + GDY+ VK Q L +A + ++ + ++ + +A SEK HGR E R I +P Sbjct: 180 VARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQR--ITFQIP 237 Query: 245 DELI-DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHW 303 +L +W +K L R I + + +Y+SS D+ E ATA+R HW Sbjct: 238 SKLSPKLQEKWPSVKTLIAVERHRKI----GNKTSIETSFYLSSHDIDPEYIATAVRGHW 293 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 +EN LHW LDVV ED C++ AE + +R +A+N+ + K ++ K+ ++ + Sbjct: 294 RIENSLHWVLDVVYREDACRVHEQRTAESLAIVRRMALNLAKLEITQKRSMKSKLHRSLL 353 Query: 364 DRNYLASVLAG 374 Y ++ Sbjct: 354 SDEYRELMIFA 364 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 337 bits (863), Expect = 5e-91, Method: Composition-based stats. Identities = 138/376 (36%), Positives = 192/376 (51%), Gaps = 25/376 (6%) Query: 15 DYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D R +H+ S I+L+ I AVI GA+ W IEDFG++ F NGIP HDT Sbjct: 25 DNRIDRCKKHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKLSNFNGIPSHDTF 84 Query: 75 ARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSY---------------DK 119 R S + P KF E + W++ IAIDGKT+R +Y D Sbjct: 85 NRFFSALDPLKFEESYRQWVQSILKCYSG-HIAIDGKTIRGAYESEQDKRHRKQGVLPDS 143 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + + +HVISAF+T + +GQ+ T EK NEI IPELL+ML IK IIT DA+GCQ+ Sbjct: 144 NTGKYKLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDCIITIDALGCQRT 203 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFP--LKELNNPEHDSYAISEKSHGREEIRL 237 IAEK+ K GDY+F VK Q +L + + + D Y E+ HGR E R+ Sbjct: 204 IAEKVIKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKGTTVRFDKYETHEEGHGRNESRI 263 Query: 238 HIVCDVPDELI-DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 C+ P L D +WK ++ + R K + R +ISS + A+K Sbjct: 264 CYCCNDPGFLGADIRKKWKNIQSFGYIENTR----NTNKGTTVEKRCFISSLEPDAQKIL 319 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 R HW +EN LHW+LDV +ED+ + RR +A FS + IA+ L N+K + + R Sbjct: 320 KNSREHWEIENNLHWQLDVNFHEDNTR-RRNISALNFSVLAKIALATLRNNK-REIPINR 377 Query: 357 KMRKAAMDRNYLASVL 372 K A D +L ++ Sbjct: 378 KRLIAGWDNEFLWELI 393 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 337 bits (863), Expect = 5e-91, Method: Composition-based stats. Identities = 102/375 (27%), Positives = 172/375 (45%), Gaps = 17/375 (4%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 + + E +PD R H L++IL + + A + GA D+ F + Sbjct: 4 PMDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDV 62 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS----DDKDVIAIDGKTLRHSY 117 +NG+P HDT +RV + P F + F +M+ K VIA+DGK LR Y Sbjct: 63 LVLKNGLPSHDTFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGY 122 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 + R +++A++ + + ++ NE +L+ +L +KG ++T DA+ C Sbjct: 123 ESGRSHMPPVMVTAWAAQTRMALANVQAPNN-NEAAGALQLIELLQLKGCVVTADALHCH 181 Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRL 237 + +AE I+ +GGDY+ AVK Q L + + S + HGR+E R Sbjct: 182 RGMAEAIKARGGDYVLAVKDNQPALMRDAKAAIRAATRQGKP--STITVDAGHGRKEKRR 239 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 +V VP D ++ GLK + S R + RY++ S + Sbjct: 240 AVVAAVPQMAQD--HDFAGLKAVARITSKR-------GTDKTVERYFLMSQAYPPKDVLR 290 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +EN LHW LDVV++ED + R+ NA + +R +A+N+ LR K Sbjct: 291 IVRTHWTIENSLHWPLDVVLDEDLARNRKDNAPANLAVLRRLALNVARAHPDNTTSLRGK 350 Query: 358 MRKAAMDRNYLASVL 372 +++A + +L ++ Sbjct: 351 LKRAGWNDTFLFELI 365 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 336 bits (861), Expect = 9e-91, Method: Composition-based stats. Identities = 126/379 (33%), Positives = 199/379 (52%), Gaps = 17/379 (4%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++L+E + I D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + ++ K +I IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + G H++SA+S +GQ EKSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHD---SYAISEKSHGREEIR 236 IAEKI+ + DY+ ++K QG L + E F E + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 + + + WKGLK + + E++ + + RY+ISS E + Sbjct: 239 EYYQT-EKIKWLSQKKAWKGLKSIIM----ERKTLEKEGKRLIEYRYFISSLKEEIETVS 293 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF--KAGL 354 A+R HW +E+ +HW LDV ED AA+ + IR +++IL +V K + Sbjct: 294 RAVRGHWSIES-MHWHLDVTFREDANTTIDKMAAQNLNIIRKWSLSILKTAEVSRHKLSM 352 Query: 355 RRKMRKAAMDR-NYLASVL 372 R+K + +L VL Sbjct: 353 RKKRYVIGLRPIKHLEEVL 371 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 334 bits (856), Expect = 3e-90, Method: Composition-based stats. Identities = 109/369 (29%), Positives = 176/369 (47%), Gaps = 12/369 (3%) Query: 10 ISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIP 69 +PD R H L D+L + + A I GAE D F +++ + G+P Sbjct: 9 FQGLPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLP 68 Query: 70 VHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVI 129 HDT +RV + P F CF ++ D V+AIDGKTLR S+D++ R A+HV+ Sbjct: 69 SHDTFSRVFRLLDPVAFSRCFQQFLDHL-GEDGAGVLAIDGKTLRRSFDRAAGRSALHVV 127 Query: 130 SAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGG 189 SAF++ +++GQ NEI A LL + D+KG ++T DA+ Q+ A+ I ++GG Sbjct: 128 SAFASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGG 187 Query: 190 DYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELID 249 D+LF +K + L E F + + ++ HGR E+R H V L Sbjct: 188 DWLFPLKDNRPALRAEVERYF--ADPATVLAVPHVTTDADHGRIEVRRHWVSHDVAWLAS 245 Query: 250 FTF-----EWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 GLK L + + T Y+SSA L + A A+R HW Sbjct: 246 DRRFPDEAVLPGLKILGLVE---RTVTSPDGRTTATRTLYLSSAALEPKTLARAVRAHWS 302 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +E +HW LD +ED + R+ + E + +R +A+N++ + + +R + ++A Sbjct: 303 IEAAVHWVLDTSFDEDRARNRKDHGPENLATLRKLALNVVRSANN-QDSIRLRRKRAGWS 361 Query: 365 RNYLASVLA 373 +Y ++L Sbjct: 362 DDYARTILG 370 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 333 bits (855), Expect = 4e-90, Method: Composition-based stats. Identities = 115/371 (30%), Positives = 189/371 (50%), Gaps = 7/371 (1%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +++H+ I D R EH + DI L + AVISGA+ W +FG L++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP +I R+ +S + ++W+ + + + IAIDGK L+ + S A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLKGAKA-SASSAA 119 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H+++A+ T LV K +E+ + ELL LD+K +++T DA+ CQ + I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPD 245 K+GGD + VKG Q +L +A + +F NNP+ + + + K HGR E R+ C + Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLNL 239 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 + +W LK L R + + + +Y+SSA LT+E F AIR HW Sbjct: 240 PA-EIKMKWSQLKTLIAVERHRKV----GNKTSIDTHFYVSSAVLTSEAFGRAIRAHWQT 294 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN HW LD + ED K+ + A + + +R A+N++ K +K +A Sbjct: 295 ENNQHWLLDKLFQEDKQKMYDEDGASILAILRRWALNLVKLHP-AKTSQTQKFNRACWSD 353 Query: 366 NYLASVLAGSG 376 ++ ++ G+G Sbjct: 354 DFREEIIFGTG 364 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 330 bits (846), Expect = 5e-89, Method: Composition-based stats. Identities = 104/374 (27%), Positives = 173/374 (46%), Gaps = 13/374 (3%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 + + + +PD R V H L ++L++ +V+ G+ ++ FG F + Sbjct: 10 IAMHIFLSAFDEVPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRN 68 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-DDKDVIAIDGKTLRHSYDK 119 + ++ IP HDT + V I P F + D D D+IAIDGK LR + D Sbjct: 69 FLKLKHAIPSHDTFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDP 128 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 ++SA+++ L + + D + E++A E L ++D++GK++T DA+ C + Sbjct: 129 GESARTRMMVSAYASRLRLTLATVPAD-RGTELSAAIEALGLIDLRGKVVTGDALHCNRR 187 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHI 239 I GGD+ A+KG Q L F ++P + HGR+E R + Sbjct: 188 TVAAINAGGGDWCLALKGNQESLLSDARGCFSKGHKSDP---TAVTENTGHGRKETRKAV 244 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 V + + E+ GLK + R E + RY+ S T E A+ Sbjct: 245 VVSA--KALAEYHEFPGLKGFGRIEATR----ETGGKVTSETRYFALSWVPTPEVLLAAV 298 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R+HW +EN LHW+LDV ED + R+ N + +R A+++L D K L K++ Sbjct: 299 RDHWAIENALHWQLDVSFREDAARNRKDNGPGNIAVLRRRALDVLRRD-TSKGSLSIKIK 357 Query: 360 KAAMDRNYLASVLA 373 +A D +L S+L+ Sbjct: 358 RAGWDTTFLRSILS 371 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 330 bits (846), Expect = 5e-89, Method: Composition-based stats. Identities = 129/381 (33%), Positives = 200/381 (52%), Gaps = 21/381 (5%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + ++ + + D R+ WK++H LSDI+LL A +SGAE W++IE FG+ + LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD---------DKDVIAIDGKTL 113 ENGIP HDT+ RV + + P E W SD K ++AIDGKT+ Sbjct: 66 QLENGIPSHDTLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTI 125 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDA 173 R + S ++ A+H+++A++T + GQ+ T+EKSNEITAIPELL+M+ +KG +++ DA Sbjct: 126 RG--NGSAKQKALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDA 183 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGRE 233 MG QK IA+KI K+ DY AVK Q L + F + + + D Y EK+HG+ Sbjct: 184 MGTQKAIADKIIKKKADYCLAVKENQKTLLEDIVPFFEMSQEAD---DHYHTVEKAHGQI 240 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 E R + V L E+ ++ + A I ++ + RY+I S ++A+ Sbjct: 241 ETRAYEVIHDVSWLRKTHPEFGHIQSIGRA----RIHLDKNGQESEESRYFILSCQVSAK 296 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKA 352 + +R HW +E+ +HW LDVV ED K A + + + +L K Sbjct: 297 ELCDYVRGHWQIES-MHWLLDVVFREDANKTLNKQLAFNLNVMDKFCLAVLKQLDFGKKM 355 Query: 353 GLRRKMRKAAMD-RNYLASVL 372 +RRK ++ YL +L Sbjct: 356 SMRRKKYALSLSFDKYLKQLL 376 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 324 bits (831), Expect = 3e-87, Method: Composition-based stats. Identities = 123/412 (29%), Positives = 185/412 (44%), Gaps = 44/412 (10%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + E I I D R+ KV + I+L+T+ V + W DI DF DFL+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK------------------- 103 P HDT+ R I + C+ W + Sbjct: 77 PDLETTPSHDTLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYT 136 Query: 104 -DVIAIDGKTLRHSYDKSR--------------RRGAIHVISAFSTMHSLVIGQIKTDEK 148 IAIDGKT+ + + + +H++SAF + SL +GQ + K Sbjct: 137 NRHIAIDGKTICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIK 196 Query: 149 SNEITAIPELLNMLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFE 207 NEI AIP+LL+ +DI +G ++T DA+G QK I EKI ++ DYL VK +L + E Sbjct: 197 ENEIVAIPKLLDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIE 256 Query: 208 EKFPLKELNNPEHDSYAISE---KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 ++ E+D +E + HG R I C P L +WK L+ + Sbjct: 257 NDAEYLLISGRENDFIKRAEETTEGHGFMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIK 316 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + IA E + +ISS E R HW VEN LHW+LDV NEDD + Sbjct: 317 TEKINIAT--GEIQNEKHCFISSLVNNPELILKYKRKHWAVENGLHWQLDVTFNEDDGR- 373 Query: 325 RRGNAAELFSGIRHIAINILT--NDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 + N+A+ FS + +A+ IL D+ K + RK +KA YLA+++ Sbjct: 374 KMMNSAQNFSTLTKMALTILKNYQDEDKKTSVNRKRKKAGWSDEYLANLINN 425 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 323 bits (827), Expect = 8e-87, Method: Composition-based stats. Identities = 100/370 (27%), Positives = 166/370 (44%), Gaps = 13/370 (3%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++ +PD R H L ++L++ +V+ GA ++ FG + + + Sbjct: 37 ILSAFEDVPDPRAE-NTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLK 95 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHS-SDDKDVIAIDGKTLRHSYDKSRRRG 124 + +P HDT + V I P F + D + D DVIA+DGK LR + D Sbjct: 96 HAVPSHDTFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGR 155 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 ++SA++ L + + D + E+ A E L ++ +KGK++T DA+ C + I Sbjct: 156 TRMMVSAYAARLRLTLASVPAD-RGTELEAAIEALGLIALKGKVVTADALHCNRRTVAAI 214 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVP 244 GGD+ A+K Q L F + +P S + HGR E R V Sbjct: 215 NAGGGDWCLALKANQDSLLSDARASFGAEPDAHP---SALSEDIGHGRTETRKATVVS-- 269 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 + + E+ GLK + R + + RY+ S T E +R HW Sbjct: 270 SKALAEHHEFPGLKAFGRVEATR----KTAEGTTSETRYFALSWVPTPEVLLATVRAHWA 325 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN LHW+LDV ED + R+ N+ + +R A++++ D K L K+++A D Sbjct: 326 IENSLHWQLDVSFREDAARNRKDNSPGNIAILRRRALDVMRRD-TSKGSLSIKLKRAGWD 384 Query: 365 RNYLASVLAG 374 ++L +VL G Sbjct: 385 DDFLRNVLNG 394 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 320 bits (821), Expect = 4e-86, Method: Composition-based stats. Identities = 118/369 (31%), Positives = 180/369 (48%), Gaps = 14/369 (3%) Query: 15 DYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 D RQ+WK+ + LS IL L ++G E +++EDF E + Y D G P HDT+ Sbjct: 19 DSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEPLFATYVDLSEGCPSHDTL 78 Query: 75 ARVVSCISPAKFHECFINWMRDCHSSDD-KDVIAIDGKTLRHSYDKSRRRGAIHVISAFS 133 RV+S ++ + E + + + S D +I++DGKT+R ++ + + +H+++A+ Sbjct: 79 ERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRG--NRGKNQKPVHIVTAYD 136 Query: 134 TMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLF 193 H L +GQ+ +EKSNEI AIP+LL +DI+ I+T DAMG Q I + I K DY Sbjct: 137 GGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMGTQTAIVDTIIKGKADYCL 196 Query: 194 AVKGTQGRLNKAFEEKFP---LKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDF 250 AVKG Q L F L E Y EKS G+ E+R + V L Sbjct: 197 AVKGNQETLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQIEVREYWVSSDIKWLCQN 256 Query: 251 TFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLH 310 +W L+ + + ++ + RY+I S FA +R HW +E+ +H Sbjct: 257 HPKWHKLRGIGMT----RNTIDKDGQLSQENRYFIFSFKPDVLTFANCVRGHWQIES-MH 311 Query: 311 WRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL--RRKMRKAAMD-RNY 367 W LDVV +ED + AA + IR + + L K L RRK R ++ +Y Sbjct: 312 WLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKKDLSYRRKQRYISVHLEDY 371 Query: 368 LASVLAGSG 376 L + G Sbjct: 372 LVQLFGERG 380 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 316 bits (809), Expect = 9e-85, Method: Composition-based stats. Identities = 109/307 (35%), Positives = 164/307 (53%), Gaps = 7/307 (2%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +L + E +PD R + H LS++L + +CAV+ GA + D+ +G+++L +L+++ Sbjct: 5 QLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKF 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHSYDKS 120 + G+P HDT RV++ I PA F F+ W+ + D V+AIDGKT R S K Sbjct: 65 LKLKAGVPSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGKD 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 G +H++SAF+ LV+GQ TD+KSNEITAIPELL ML ++G I+T DAMG Q I Sbjct: 125 T-SGPLHMVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAI 183 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 A I+ +G DY+ VK L + + K HGR E+R Sbjct: 184 ARTIRSRGADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVRRCWA 243 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 D +L + +W GL+ + R++ + + YYISS A + A A+R Sbjct: 244 YDAVSQLYK-SEQWAGLQSFALVERERTV----DGKTSVERHYYISSLPADAARIAQAVR 298 Query: 301 NHWHVEN 307 +HW VE+ Sbjct: 299 SHWAVES 305 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 312 bits (798), Expect = 2e-83, Method: Composition-based stats. Identities = 103/369 (27%), Positives = 186/369 (50%), Gaps = 7/369 (1%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L++H+ II D R ++H L D++ LT+ A++SGA GW+ IE FG LD+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +GIP IA ++ + + W+ D K +IA+DGKT+R ++ + A Sbjct: 63 HGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWADDIHQ-A 121 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H++SAF + + + ++K +E ++++ L + ++T DA+ CQK +KI Sbjct: 122 LHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKII 181 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPD 245 + D++ +KG Q A + ++P + HGR+E R + + Sbjct: 182 SKKSDFVIQIKGNQPA-LLAAVKAAFAACYDSPALAISEQTNTGHGRKECRRVMQIEGNL 240 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 + + +W ++ L S R++ + + R+Y+SS + + A IR HW + Sbjct: 241 PP-ELSEKWPHIRTLVEVASERTV----GNKTACSSRWYVSSLPVDTAQLADIIRAHWAI 295 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN+LHW LDVV ED+ + + A+ + A++++ + K L K + AA D Sbjct: 296 ENQLHWVLDVVFREDELNVSDPDGAKHLALFNRAALSVIKQHQGKKDSLAAKRQSAAWDP 355 Query: 366 NYLASVLAG 374 + + +L G Sbjct: 356 AFRSELLFG 364 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 310 bits (794), Expect = 6e-83, Method: Composition-based stats. Identities = 114/339 (33%), Positives = 170/339 (50%), Gaps = 4/339 (1%) Query: 38 ISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC 97 ++ AE WEDIE +G + +L+ + NGIP HDT RV + F CF ++ Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCVQFR 63 Query: 98 HSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 ++V+A+DGK++R S G +H++S +++ L +GQ D KSNEI AIPE Sbjct: 64 AGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRAIPE 123 Query: 158 LLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN 217 LL L + G I+T DAMGCQ IAE+I+ +G D L +K G +A F L + Sbjct: 124 LLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTCLGS 183 Query: 218 PEHDSYAISE-KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE 276 + HGR R V D + W L ++ + R I Sbjct: 184 GAAGRPVFDAFEGHGRLVRRRVFV-DAAATALAPLSGWPDLSRVLAVETLRGI--PGTGT 240 Query: 277 PEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGI 336 +RY+++S IR HW VEN LHW L+V EDD ++R AA F+ + Sbjct: 241 VVADIRYFLTSCRDDPSVLVGVIRRHWSVENALHWVLEVSFREDDSRVRDRTAARNFALV 300 Query: 337 RHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGS 375 R IA+N++ D+ +A LR + +KAA D +Y+ ++A Sbjct: 301 RKIALNLIAQDRSTQASLRGRRKKAAWDDDYMLQIIANQ 339 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 306 bits (783), Expect = 1e-81, Method: Composition-based stats. Identities = 115/385 (29%), Positives = 173/385 (44%), Gaps = 43/385 (11%) Query: 30 LLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHEC 89 +L+T+ V + W DI DF DFL+++ P HDT+ R I + C Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FINWMRDCHSSDDK--------------------DVIAIDGKTLRHSYDKSR-------- 121 + W + IAIDGKT+ + + + Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 122 ------RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDI-KGKIITTDAM 174 +H++SAF + SL +GQ + K NEI AIP+LL+ +DI +G ++T DA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISE---KSHG 231 G QK I EKI ++ DYL VK +L + E ++ E+D +E + HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 R I C P L +WK L+ + + + IA E + +ISS Sbjct: 241 FMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIKTEKINIAT--GEIQNEKHCFISSLVNN 298 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT--NDKV 349 E R HW VEN LHW+LDV NEDD + + N+A+ FS + +A+ IL D+ Sbjct: 299 PELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQNFSTLTKMALTILKNYQDED 357 Query: 350 FKAGLRRKMRKAAMDRNYLASVLAG 374 K + RK +KA YLA+++ Sbjct: 358 KKTSVNRKRKKAGWSDEYLANLINN 382 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 303 bits (775), Expect = 1e-80, Method: Composition-based stats. Identities = 103/372 (27%), Positives = 168/372 (45%), Gaps = 14/372 (3%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++ +PD R +H L +IL + + AV+ GA ++E F + LD L+Q+ E Sbjct: 3 FLDVFGEVPDPRD-LTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLE 61 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS----DDKDVIAIDGKTLRHSYDKSR 121 G P HDT +RV++ + P +E F+ +M K +A+DGK+LR +Y K R Sbjct: 62 RGAPSHDTFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGR 121 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 V++ F + + Q E E+ A L +L +KG +T DA+ C + + Sbjct: 122 SHMPPLVVTVFGCDTFMSLAQTVAQEGG-EVQAAIAALELLSLKGLTVTADALHCHRRMT 180 Query: 182 EKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVC 241 + ++ GG Y+ A+KG Q +L + E +HGR E+R V Sbjct: 181 KTVRDGGGHYVIAIKGNQSKLAAEANTALDKAA-AGKATKFHQTEEDAHGRHEVRRAFVI 239 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 L + S+R++ + + VR Y S + A + +R Sbjct: 240 PFAQTPGKNALV--DLCAIGRVESWRTV----EGKTTHKVRCYALSRKMPAHELLATVRR 293 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN LHW+LDV++ ED + R+ N A + +R + +N+L D K L K KA Sbjct: 294 HWSIENDLHWQLDVLLGEDHIRGRKNNTAANHAILRRLTLNVLRADP-EKIPLSHKRLKA 352 Query: 362 AMDRNYLASVLA 373 L S+ Sbjct: 353 RWADQDLLSLFT 364 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 285 bits (728), Expect = 2e-75, Method: Composition-based stats. Identities = 116/367 (31%), Positives = 173/367 (47%), Gaps = 15/367 (4%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L E + ++P R K + L +LL+ + +SG W +IED+ E + + LK + Sbjct: 4 SLFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEM 63 Query: 65 ENG------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYD 118 G +P HDT+ R +S + F + W+ S+ I IDGKT+R Sbjct: 64 LTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRG-VK 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K HV+SAFS + Q+ D K+NEI AI +LL++LD+ G +++ DA+G Q Sbjct: 123 KLSFDTQSHVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQT 182 Query: 179 DIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLH 238 I E+I +GGDY+ VK Q + E F + D +E SHGR E R + Sbjct: 183 AIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQKHILLD--EQTELSHGRIETRRY 240 Query: 239 --IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 I+ + E + KGL+ + V R ++ + V YYISS Sbjct: 241 ESILNPLEIEANEVLTRRKGLRSIHKVVRKRR--DKKSDKTSEEVAYYISSLT-DVSSLK 297 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKAGLR 355 AIR HW +ENKLH LDV D R N A++ I+ I + I+ K K+ + Sbjct: 298 QAIRGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKTNMKSSIP 357 Query: 356 RKMRKAA 362 R +K A Sbjct: 358 RIQKKPA 364 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 284 bits (727), Expect = 3e-75, Method: Composition-based stats. Identities = 105/372 (28%), Positives = 168/372 (45%), Gaps = 14/372 (3%) Query: 7 MEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFEN 66 + + I D R H L+++L L + A + GA+ +I +F E LK+ + Sbjct: 5 LSILREIHDPRD-INARHDLAELLFLALAATLCGAKNCVEIAEFVEGREAELKEIVTLRH 63 Query: 67 GIPVHDTIARVVSCISPAKFHECFINWMRDCH-----SSDDKDVIAIDGKTLRHSYDKSR 121 G P HDT +R+ I P + ++ + V+A+DGK LR Y+K R Sbjct: 64 GCPSHDTFSRIFRLIDPDELARALGAFLAALRQGLGLGPRPRGVVAVDGKALRRGYEKGR 123 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 ++S + L + + + S+E+ A LL +D+KG I+T DA+ C+ D A Sbjct: 124 AFMPPVMVSVWDAETRLSVATKRAEG-SDEVAATLALLKSIDLKGCIVTADALHCRPDTA 182 Query: 182 EKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVC 241 + + + Y A+K +GRL E F + + + E HGR E R V Sbjct: 183 KALIGRKAHYALALKANRGRLFACAEAGFVAADAAG-DLAFHETRETGHGRLETRRASVL 241 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 P + + GLK + + R +VRY S L K A +R Sbjct: 242 --PLKAFKQAPAFPGLKAIGRIQATRQ---GADGRAVTSVRYIALSKVLAPHKLAEVVRA 296 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDVV +EDD + R+ NA + + IR +A +IL + K + KMR+ Sbjct: 297 HWTIENQLHWSLDVVFHEDDARSRKDNAPQNLAVIRRLARDILAAHPLDK-PIASKMRRV 355 Query: 362 AMDRNYLASVLA 373 +R++ Sbjct: 356 NWNRDFFHEFFT 367 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 279 bits (714), Expect = 1e-73, Method: Composition-based stats. Identities = 101/367 (27%), Positives = 177/367 (48%), Gaps = 17/367 (4%) Query: 9 HISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG- 67 I+++ D R ++++ L ILL+++ A ISG + WE IED+ H + L+ +G Sbjct: 8 AIAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYTKLSGK 67 Query: 68 ------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 +P HDT V I P +F E + ++ + + IAIDGKT R ++ Sbjct: 68 ELKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPRG-IKQTA 126 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 +++SA+ T H VI I ++ K +E+++I +L+ +L ++ +T DA G ++ Sbjct: 127 NSHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGTYVEVI 186 Query: 182 EKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVC 241 E I +GG+++ VKG Q +L + E++F +E + + HGR E R Sbjct: 187 EMILSKGGNFVLPVKGNQKKLLEFIEKEF--REYRGNTVSADTQEDIGHGRVEKRTVYCI 244 Query: 242 DVP---DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 D++ +WKG+K L R + + K + YYI++ + ++ A Sbjct: 245 TEIKTDDDIDGCMQKWKGVKTLVKI--VREVYKKADKSTRIETVYYITNL-IDPKEINRA 301 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA-GLRRK 357 IR HW +EN LH LDV++NED + N E F + +A+ I+ + + R Sbjct: 302 IRAHWGIENNLHRSLDVLLNEDHSQKSNHNVVENFHIMSLLALFIIKEISKQRGISMNRT 361 Query: 358 MRKAAMD 364 + Sbjct: 362 RKLCGYS 368 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 274 bits (700), Expect = 5e-72, Method: Composition-based stats. Identities = 107/286 (37%), Positives = 154/286 (53%), Gaps = 9/286 (3%) Query: 9 HISIIPDYRQT-WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG 67 +IPD R+ H LSDIL + +CAV+SG + WE + +FG T +L+Q+ NG Sbjct: 17 FFELIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANG 76 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCH-SSDDKDVIAIDGKTLRHSYDKSRRRGAI 126 IP HDT RV S I P F F +W D D +A+DGKT+R S+ S R A+ Sbjct: 77 IPSHDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSHRGSAGR-AL 135 Query: 127 HVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQK 186 H++ A+S L++ Q + D KSNEITAIP++L++ D++G I+ DA+GCQK +A +I + Sbjct: 136 HLLHAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQITE 195 Query: 187 QGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDE 246 GGDY+ A+KG Q L+ + +P+ + EK HGR E R V D D Sbjct: 196 AGGDYVLALKGNQSALHDDVRLFMETQADRHPQGQA-EAVEKDHGRIETRRIWVNDEIDW 254 Query: 247 LIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTA 292 L +W GLK L + S R + + R +I+S Sbjct: 255 LTQKP-DWPGLKTLVMVESRREL----NGQVSCERRCFITSHTADP 295 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 271 bits (694), Expect = 2e-71, Method: Composition-based stats. Identities = 108/350 (30%), Positives = 167/350 (47%), Gaps = 16/350 (4%) Query: 3 LKKLMEHISIIPDYRQTWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 L+KL E S IPD+R+ K + HKLSDI++L I +S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK-----DVIAIDGKTLRH 115 NGIP T+ R+ I + H ++I IDGK R Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCTQEIICIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + K+ R I +SA S + + +EKSNEI A+P L++ +DI GKI+T DAM Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADAMS 212 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 QKDI +KI+++ GD++ +K Q L E+K + +P + E HGR E Sbjct: 213 MQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDKI---KELSPVYSYCGEPELGHGRIET 269 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R + V D D LI +W G L + + + R ++SS + Sbjct: 270 RSYRVFDGTD-LIANKEKWNG--NLTIIEYECETVKKSTGNCTTEKRLHVSSLPANTPRL 326 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 T +RNHW +E+ +HW LD + +D K + AA I+ I ++ + Sbjct: 327 GTPVRNHWSIES-MHWGLDRNLLQDKIKRKSARAARNLDTIQRIVYSVFS 375 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 271 bits (693), Expect = 3e-71, Method: Composition-based stats. Identities = 92/253 (36%), Positives = 144/253 (56%), Gaps = 7/253 (2%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 ++H+++A+ + +L++GQ+K D+KSNEITAIP+LL ML ++G I+T DAMGCQK IA++ Sbjct: 1 NSLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQ 60 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNP-EHDSYAISEKSHGREEIRLHIVCD 242 I + DY+ AVK Q L + + F ++N H + + HGR E R + Sbjct: 61 IGSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREYS-TI 119 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 V D+L+ W L + + S R + RY+I S + A++F A+R H Sbjct: 120 VGDDLLAGITGWDNLNAIGMVESKREV----GNTISNEKRYFIMSINGHAQRFGDAVREH 175 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN +HW LDV ED +IR+ N+ E S +R IA+N + + K ++RK + A Sbjct: 176 WGIENTVHWVLDVSFGEDQSRIRKDNSPENLSMLRKIALNCVKQEST-KTSMKRKRKMAG 234 Query: 363 MDRNYLASVLAGS 375 D ++L VL G+ Sbjct: 235 WDNSFLIKVLTGN 247 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 270 bits (689), Expect = 7e-71, Method: Composition-based stats. Identities = 102/372 (27%), Positives = 165/372 (44%), Gaps = 30/372 (8%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 E+ L+E ++ +PD R V H L+ +L LT CAV++GA + ++ + L + Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 62 GDFE-------NGIPVHDTIARVVSCISPAKFHECFINWMR-DCHSSDDKDVIAIDGKTL 113 P TI RV++ I W+ + +A+DGK+L Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWLACRQQDAGGLRALAVDGKSL 157 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITTD 172 R + RR +H+++A + LV+ Q+ EK+NEIT LL+ L D+ G ++T+D Sbjct: 158 RGAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTSD 215 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGR 232 A+ Q D A ++ + Y+ VK +L+ + P +++ + HGR Sbjct: 216 ALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLK-SLPWQQIPLQDR----TRTTGHGR 270 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA---D 289 EIR VC V + L + G ++ R + + + Y ++S Sbjct: 271 CEIRRLKVCTVNNLL------FPGARQAVQI--VRRRVNRTTGKVSLKTIYAVTSLAAEQ 322 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + A IR HW VE LH DV ED ++R GNA + + R++AI L V Sbjct: 323 APPARVAQLIRGHWTVEA-LHHVRDVTFAEDASQLRSGNAPQAMATYRNLAIGALRLAGV 381 Query: 350 --FKAGLRRKMR 359 AGLRR R Sbjct: 382 RNIAAGLRRTAR 393 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 266 bits (681), Expect = 7e-70, Method: Composition-based stats. Identities = 105/360 (29%), Positives = 161/360 (44%), Gaps = 41/360 (11%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L+ + + I D RQ KV H+ I++ + V + + W ++ DF +DF++++ Sbjct: 18 LESMYNALGEI-DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFF 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINW--------------------MRDCHSSDD 102 P HDT+ R + P + W + + Sbjct: 77 PDIQKAPSHDTLRRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKP 136 Query: 103 KDVIAIDGKTLRHSYDKSRRR--------------GAIHVISAFSTMHSLVIGQIKTDEK 148 IAIDGKT++ + ++ RRR +H++SAFS L +GQ + D+K Sbjct: 137 FRQIAIDGKTIKKAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKK 196 Query: 149 SNEITAIPELLNMLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAF- 206 NEI AIP LL+ LDI +G ++T DAMG QKDI +I K+ YL VK Q L + Sbjct: 197 ENEIVAIPRLLDDLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIA 256 Query: 207 --EEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 F L N + + E HG +R VC L +W+ L+ + Sbjct: 257 GNMRDFERIPLPNEVYKVHKEGENGHGFVFLRECRVCSSLHSLGKIYKDWENLRSYGLIR 316 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + R + E E + Y+ISS + EK R HW +EN LHW+LD+ EDD ++ Sbjct: 317 TER--VDEATGESAVETHYFISSLENDPEKIMRVKRKHWGIENGLHWQLDITFKEDDGRM 374 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 266 bits (681), Expect = 7e-70, Method: Composition-based stats. Identities = 91/348 (26%), Positives = 159/348 (45%), Gaps = 16/348 (4%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ-Y 61 + L+E + + D+R+ H L +L++ I + G G+ ++ +F + + L Q + Sbjct: 1 MLSLIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINW-MRDCHSSDDKDVIAIDGKTLRHSYDK- 119 +P + TI RV+ + + F W + + DD + + +DGK+L+++ Sbjct: 61 NIIPERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNP 120 Query: 120 -SRRRGAIHVISAFSTMHSLVIGQIKTD-EKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 + ++ I +S FS LV+ + + +K +EI ++ ++ K+ T DA+ CQ Sbjct: 121 NNEQQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRL 237 K I K DY+ VKG Q L K ++ ++ + + SHGR+ R Sbjct: 181 KKTISLIAKTKNDYVITVKGNQKNLYKRIQDL----SNSSKPESCFLEQDNSHGRKISRK 236 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 V V E +G + L + + K E YYISS +A+ FA Sbjct: 237 IEVFKVRKN------ERQGFENLRRVIKVERKGSRGDKTYE-ETAYYISSLTESAQVFAK 289 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 IR HW +EN+LHW DV+ ED +I AA +S + I +N+ Sbjct: 290 IIRGHWKIENQLHWVKDVIFEEDKSEISDFQAASNWSILTTIGLNLFR 337 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 266 bits (680), Expect = 9e-70, Method: Composition-based stats. Identities = 90/247 (36%), Positives = 141/247 (57%), Gaps = 3/247 (1%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L++H + D R +HKL DI+++ +CA+I GA+ + +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + I P +F + F +W+ DV+ IDGKT++HS +K + A Sbjct: 69 NGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKKA 128 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH+++A+++ LV+ Q K E++ EITAIP L+ +L++ G ++T DAMG Q DIAE + Sbjct: 129 IHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELLH 188 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKF---PLKELNNPEHDSYAISEKSHGREEIRLHIVCD 242 +G DY A+KG Q L + +E F E EH + EK R E+ + Sbjct: 189 SKGADYCLALKGNQRGLFQEVKEVFDNAQQTEWIGIEHSFHRTVEKRTARAEVSSAYRTE 248 Query: 243 VPDELID 249 Sbjct: 249 QERLWSH 255 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 266 bits (680), Expect = 1e-69, Method: Composition-based stats. Identities = 87/365 (23%), Positives = 166/365 (45%), Gaps = 18/365 (4%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ L+ + +PD R ++L ++ + +CAV +GA + I D+ + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV-IAIDGKTLRHSYDKS 120 +P TI +V + + +D + +A+DGKT+R + + Sbjct: 102 CGIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RI 159 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 ++ A H+++A + ++V+GQ +T +KSNEI + LL +DI G ++T DAM QK Sbjct: 160 GKQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKAT 219 Query: 181 AEKIQKQGG-DYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHI 239 A +++Q +Y+ VK Q L ++ P +++ D E+ HGREE R + Sbjct: 220 ARCLREQCRAEYVMIVKANQPGLLARVRDQ-PWEQVPVVWSD---PVERGHGREEHRSYK 275 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA---DLTAEKFA 296 + V L + +++ + R ++ V Y I S + A Sbjct: 276 ILTVARGL-----RFPYAQQVIQIIRRRRVLGAGAW--STEVVYAICSLPCEQAPPKLLA 328 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 + IR HWH+EN++H+ DV +ED +R G+ ++ + +R++ + + Sbjct: 329 SWIRGHWHIENRIHYVRDVTFDEDRSAVRTGHGPQVMATLRNVVVGLHRRAGHSNIARAC 388 Query: 357 KMRKA 361 + A Sbjct: 389 RRLAA 393 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 261 bits (667), Expect = 3e-68, Method: Composition-based stats. Identities = 108/381 (28%), Positives = 177/381 (46%), Gaps = 23/381 (6%) Query: 3 LKKLMEHISIIPDYRQTWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 +K L E + +PDYR+T K ++KL DILLL I + DI FG+ +L + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD---DKDVIAIDGKTLRHSY 117 G +G+P T+ R+ I E + H D++ IDGK +R + Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAGDILCIDGKAMRGTV 137 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 ++ R I +SA+S + + +EKSNEIT++P+LL+ +D+ G I+T DAM Q Sbjct: 138 LENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADAMSFQ 195 Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRL 237 K I +KI+++GGD+L +K Q L E+ L E + + + HGR E R+ Sbjct: 196 KAIIDKIREKGGDFLIELKANQRTLRYGVEDNVELAEPVDVYSEGPFLE---HGRIETRV 252 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 + D LI +W G L V + + + R+Y+SS +A + T Sbjct: 253 CRIFRGND-LITDREKWNG--NLTVVEIRTATERKSDGQKSSERRFYVSSFHGSARRLGT 309 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL-RR 356 R HW +E+ +HW LD + +D + +A I+ + + IL + + Sbjct: 310 IARMHWAIES-MHWDLDRNLRQDFIRRNSARSARNLDTIQRMVLAIL--------SIWKG 360 Query: 357 KMRKAAMDRNYLASVLAGSGL 377 K +K + A ++ L Sbjct: 361 KRKKPSEKAKGTAELIGELSL 381 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 258 bits (660), Expect = 2e-67, Method: Composition-based stats. Identities = 87/207 (42%), Positives = 133/207 (64%), Gaps = 1/207 (0%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 +KL IPD+R++ K + L ILL+ I +VI GA+ W ++E++ + +FL+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 NGIP HDT RV S I +F +CFI W+ +++IAIDGKT+R + ++ Sbjct: 65 LPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGGKK 123 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H++SA++ ++LV+GQ+K KSNEITAIP+LL +L I+ I+T DAMGCQ IA+ Sbjct: 124 SPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIAKA 183 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKF 210 I K+ DY+ AVK Q +L + E++F Sbjct: 184 IVKKNADYILAVKENQPQLLEHIEDEF 210 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 250 bits (637), Expect = 9e-65, Method: Composition-based stats. Identities = 89/241 (36%), Positives = 137/241 (56%), Gaps = 8/241 (3%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++L+E + I D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + ++ K +I IDGKT+R + Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRSNKRN 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + G H++SA+S +GQ EKSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 121 GEKPG--HIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHD---SYAISEKSHGREEIR 236 IAEKI+ + DY+ ++K QG L + E F E + EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 L 237 Sbjct: 239 E 239 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 250 bits (637), Expect = 9e-65, Method: Composition-based stats. Identities = 98/389 (25%), Positives = 162/389 (41%), Gaps = 35/389 (8%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDI----EDFGETHLDF 57 ++ L+ + I D R+ + LS +L + A ++GA G +I DFG+ L Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKQYGDFENGI---PVHDTIARVVSCISPAKFHECFINWM--RDCHSSDDKDVIAIDGKT 112 L D G P I + + A F W+ + V+A+D K Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMDVKV 140 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKII-T 170 LR ++ + +R + ++SA LV GQ++ + +NEIT + LL L DI G ++ T Sbjct: 141 LRGAWSEGNKR--VTLLSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPVVAT 198 Query: 171 TDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSH 230 DA+ Q + A + + G DY VKG Q L + F + + + E+ H Sbjct: 199 LDAVHTQHETAFLLVEHGIDYALTVKGNQPTLY---RKTFEQTLPLLQKPPQHEVEERGH 255 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD- 289 GR + +T E KG+ VA + E + R Y Sbjct: 256 GRI-----------KKWQAWTTEAKGIGFPEVATAAVIRRDEFDLKGIRVSREYAHILTS 304 Query: 290 -----LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 TA IR HW +EN++H+ D ED + GN+ + R++AI I+ Sbjct: 305 VAGNRATAAYIHRLIRGHWGIENEIHYPRDTAWREDANQTHTGNSPHTLASFRNLAIGII 364 Query: 345 TNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 + + K ++ + A DR+ + +LA Sbjct: 365 RRNGIRK--IKETLEYIAGDRDRVLPLLA 391 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 248 bits (632), Expect = 4e-64, Method: Composition-based stats. Identities = 84/274 (30%), Positives = 136/274 (49%), Gaps = 9/274 (3%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L+E + + D R K+EH+L DIL++ +CAV++ AE +EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD------VIAIDGKTLRHS 116 D GIP HDT RV I P F CF+NW R + D IA+DGK +RHS Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ R +H++SA++T LV+ Q D K E A+P +L L + G +++ DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPE--HDSYAISEKSHGREE 234 ++++A+ I +G YL +K Q +++ F + + +HGR Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAFDDTHGRLV 241 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRS 268 R C W GL + + + R+ Sbjct: 242 RRRVFACPDAG-CFTTLRGWPGLTTVLASETIRA 274 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 246 bits (627), Expect = 1e-63, Method: Composition-based stats. Identities = 91/362 (25%), Positives = 146/362 (40%), Gaps = 48/362 (13%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHE 88 +L+ + G + +THL+ L+++ + GI TI R++ I Sbjct: 1 MLVCVTLGFLCGRTTIRRSLKWCKTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALY 60 Query: 89 CFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEK 148 F+ W+ + S + +A+DGK L + +K++ +++ T+ L++ Q+ D K Sbjct: 61 AFMEWVGEIVDSRNT-HLAVDGKALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSK 119 Query: 149 SNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEE 208 +NEIT IPELL +LDI G I+T DA+G Q I E+I +QGG + VK Q + Sbjct: 120 TNEITVIPELLKLLDISGSIVTIDAVGTQTAIMEQIHEQGGHFALTVKKNQPEAYEEIHT 179 Query: 209 KFPLKELNNPEH-----------------DSYAISEKSHGREEIRLHIVCDVPDELIDFT 251 E + + + EK+ R E R +C L Sbjct: 180 FMDKLEAADVQRKKGEVLDSGMREYLEKYEEIIRIEKNRDRNEYRTCQICKDASNLTKSQ 239 Query: 252 FEWKGLKKLCVAVSFR----------------------------SIIAEQKKEPEMTVRY 283 EW ++ + R + AE+ ++ Sbjct: 240 KEWPHVQSIGRIKQVRIPSEKDSHGNDVTPSKEEFLEKGSRRVPAPSAEEGTGKDVQCTA 299 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 IS LTAE+ + R HW +EN+LH LD ED ++ S IR A NI Sbjct: 300 LISDLILTAEELGSIKRMHWSIENRLHHVLDDTFREDRSPAKKSR--NNLSLIRKYAYNI 357 Query: 344 LT 345 L Sbjct: 358 LR 359 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 241 bits (615), Expect = 3e-62, Method: Composition-based stats. Identities = 83/324 (25%), Positives = 134/324 (41%), Gaps = 27/324 (8%) Query: 50 FGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAID 109 FG + +LK GI H T + V C++ F ++ Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAALPKPLQRA------------ 90 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKII 169 K S + + ++ ++ LVIGQ + NE+ + L +L ++G I+ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQQTAPGR-NEVQGALDALALLSLEGAIV 149 Query: 170 TTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKS 229 T DA+ C+ D A I GGDY A+K Q L + E + +E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLGVQ----TAAEND 205 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 H R E R + V D ++ GL+ + + VRY++ S Sbjct: 206 HDRCERRRACIVAVND------IDFPGLQAIGSVEATSRH---ADGRLTSHVRYFLLSTI 256 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ++A R HW +ENKLHW LDV ED + R+ + + +R IA+N++ Sbjct: 257 MSAAALIEVTRTHWQIENKLHWVLDVQFREDAARNRKDHGPANIALLRKIALNLIRAHP- 315 Query: 350 FKAGLRRKMRKAAMDRNYLASVLA 373 KA +RRK++ A D +L S++A Sbjct: 316 DKASIRRKIKNAGWDDQFLISIIA 339 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 240 bits (613), Expect = 6e-62, Method: Composition-based stats. Identities = 80/372 (21%), Positives = 132/372 (35%), Gaps = 35/372 (9%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +++ L+E + +PD R+ V L +L L + AV GA G+ +I + L Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCH--------------SSDDKDVIA 107 P T RV+ P E W + VI+ Sbjct: 91 FGLVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVIS 150 Query: 108 IDGKTLRHSYDKSRRRG--AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML--- 162 DGKT+R + ++ V+ V+ ++ +EI A+ ++ L Sbjct: 151 ADGKTMRGARRRTGDGKIAQDQVVEILDHASGAVVACEPVND-GDEIGAVRTVMGRLADR 209 Query: 163 --DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH 220 + G ++ TDA Q + E++ GG +L VK Q R+ A P ++ + Sbjct: 210 WGSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRIL-AKVRALPWAQVRAQD- 267 Query: 221 DSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSF--RSIIAEQKKEPE 278 K+HGR E R V P G ++ R Sbjct: 268 ---TCRGKAHGRAETRTVRVVQAP---THVDLALAGTAQVIKITRHTRRRPHPGAPAAST 321 Query: 279 MTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSG 335 Y ++S A +R+HW +EN++HW D +ED R GN + Sbjct: 322 RENAYLLTSLPAEVADPATLAAMVRSHWLIENQVHWVRDTAYDEDRHTARTGNGPINLAC 381 Query: 336 IRHIAINILTND 347 +R+ AI Sbjct: 382 LRNTAITRHRAH 393 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 236 bits (603), Expect = 9e-61, Method: Composition-based stats. Identities = 85/397 (21%), Positives = 159/397 (40%), Gaps = 35/397 (8%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHL-DFLKQ 60 E++ L + ++ +PD R + H+L IL L+ AV +G + E+I + L Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 61 YGDFENGI------PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD---VIAIDGK 111 G + + P DT+ RV+S + + + V+A+DGK Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGK 157 Query: 112 TLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML----DIKGK 167 TLR + R H+++ +V+ + + K+NE+TA LL L + G Sbjct: 158 TLRGAAGPEGRA--PHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGV 215 Query: 168 IITTDAMGCQKDIAEKIQ-KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAIS 226 ++T DA+ + A+ I + G ++F VK L+ + ++ ++ Sbjct: 216 VVTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPI----GHSAE 271 Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMT-----V 281 ++HGR E R + + + + + ++ V + T Sbjct: 272 GRAHGRFERRTIQLAQASEAIRARYPHARTVARIRRHVRRTVTTGTGRARVTRTIPSTVT 331 Query: 282 RYYISSADLTA---EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 + ++S L A A R HW +ENK+HW DV ED ++R G + + +R+ Sbjct: 332 VHVLTSLTLDAVTPADLAGYARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRN 391 Query: 339 IAINILTN--DKVFKAGLRRKMRKAAMDRNYLASVLA 373 + I ++ +RR D L ++L Sbjct: 392 LIIGLIRLAGHNRIAPTIRRIRH----DNALLLAILT 424 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 224 bits (572), Expect = 3e-57, Method: Composition-based stats. Identities = 80/383 (20%), Positives = 148/383 (38%), Gaps = 17/383 (4%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ L+ + +PD+R V ++L+ +L L + I+G + + ++ + Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS----SDDKDVIAIDGKTLRHSY 117 F +P TI R+V P + + W +A DGK ++ + Sbjct: 84 LGFPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGAR 143 Query: 118 DKSRRRGAIH--VISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + + V+ A +G + +EI ++ L+N + ++TTD + Sbjct: 144 SRPPQGSVRQEAVVEAVRHDTGTALGHQRV-VAGDEIASVRRLVNRVCDHNTLVTTDCLH 202 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 + +A I+ +GG +LF++KG Q + A P E N + EK+HGR E Sbjct: 203 AHEPLARAIRAKGGHWLFSIKGNQPTVR-AKLAGLPWDEFGN----QHVTREKAHGRIEE 257 Query: 236 RLHIV-CDVPDELIDFTFEWKGLK-KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 R L+ F + +K + E + +S+ + Sbjct: 258 RALKALTPSAPSLVGFRGTRQVVKLARTTRRKKTTTSPAATSTEEFYLVTSLSTDQASPA 317 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A R HW VE +H D M+ED IR NAA ++ R I+ L Sbjct: 318 QLARWARGHWTVEA-IHHVRDRTMDEDRHTIRTKNAALNWAIARDTTISALRLAGYKN-- 374 Query: 354 LRRKMRKAAMDRNYLASVLAGSG 376 +R+ R D + ++A + Sbjct: 375 IRQARRATIRDPGLVLQIIALTS 397 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 224 bits (572), Expect = 3e-57, Method: Composition-based stats. Identities = 86/327 (26%), Positives = 135/327 (41%), Gaps = 22/327 (6%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHE 88 +L + + A +G G+ + T D + P T V+S + PA + Sbjct: 3 LLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLNA 62 Query: 89 CFINWMRDCHSSDDKD---VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKT 145 ++ +S D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 63 RMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQLAV 120 Query: 146 DEKSNEITAIPELLNMLDIKG-KIITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGTQGRLN 203 EKSNEI + LL +L ++T DAM Q A+ I YL VK Q ++ Sbjct: 121 AEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKIL 180 Query: 204 KAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVA 263 A P E+ D + HGR E R + + + K++ Sbjct: 181 -ARITALPWAEVPAAATD----DSRGHGRVETRTLQIITAARGIG-----FPYAKQIIRI 230 Query: 264 VSFRSIIAEQKKEPEMTVRYYISSADLTAEK---FATAIRNHWHVENKLHWRLDVVMNED 320 R I A + + V Y I S + T +R H +EN LHW DV +ED Sbjct: 231 TRERLITATD--QRSVEVVYAICSLPFEHARPTAIMTWMRQHCRIENSLHWIRDVTFDED 288 Query: 321 DCKIRRGNAAELFSGIRHIAINILTND 347 + GN A++ + +R+ AIN+ + Sbjct: 289 RQRAHTGNGAQVLATLRNTAINLHRLN 315 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 221 bits (564), Expect = 2e-56, Method: Composition-based stats. Identities = 104/197 (52%), Positives = 134/197 (68%), Gaps = 13/197 (6%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L L +H + + D RQ KV +KL D+L L + AVISGAEGWE+IEDFG L +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 YGDF +GIPVHDTIAR+V I P FH+ FI WM+ DK V+A+DGKTL Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKTL------- 113 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 H+ISAF+T + +V+GQ +TDEKSNEITA+PELL +L+++G ++T DAM CQK I Sbjct: 114 ------HMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 AEKIQKQGGDYLFAVKG 197 + I K+ DY AVK Sbjct: 168 VKTIVKKKADYCIAVKK 184 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 221 bits (563), Expect = 4e-56, Method: Composition-based stats. Identities = 88/369 (23%), Positives = 152/369 (41%), Gaps = 25/369 (6%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDF-LKQYG 62 L+ ++ +PD R V H L +L + AV++GA + ++ L + G Sbjct: 27 SSLVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELG 86 Query: 63 DFE------NGIPVHDTIARVVSCISPAKFHECFINWMRDCHS--SDDKDVIAIDGKTLR 114 F + P T R+++ + + W+ C + + V ++DGKTLR Sbjct: 87 VFRDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLR 146 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAM 174 S +H+++ V+GQ+ D K+NE+T LL LD+ ++T DA+ Sbjct: 147 GS---GPAGEQVHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADAL 203 Query: 175 GCQKDIAE-KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGRE 233 Q++ A + + Y+F VK Q RL + + P ++ + S + HGR Sbjct: 204 HTQREHARWLVDTKKAAYVFTVKKNQPRLYRQLKT-LPWTKIPIQD----ETSTRGHGRY 258 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEP-EMTVRYYISSADLTA 292 +IR L ++ + R +A + + +S+A Sbjct: 259 DIRRLQAVTCTGPL---ALDFPHAVQALRIRRRRLNLATGRWSTVTVYAITNLSAAQAGP 315 Query: 293 EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VF 350 + A +R HW +E LH D ED ++R GNA + +R+ AIN+L Sbjct: 316 AELADWLRGHWAIET-LHHIRDTTYAEDASRLRTGNAPRAMATLRNTAINLLRLTGITTI 374 Query: 351 KAGLRRKMR 359 A LR R Sbjct: 375 AAALRHNSR 383 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 220 bits (561), Expect = 6e-56, Method: Composition-based stats. Identities = 88/249 (35%), Positives = 128/249 (51%), Gaps = 14/249 (5%) Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YDKSR 121 IP HDT R S I P F F NW++ + K V+AIDGK +R + + Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQ-EVKGVVAIDGKLMRGPSQCDGEHTTGK 62 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + ++SA+S + + +GQ+K D+KSNEITAIP L+N L++ G I+T DAMGCQKDI Sbjct: 63 EGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDIT 122 Query: 182 EKIQKQGGDYLFAVKGTQGR---LNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLH 238 + I + +Y+ A+K + + L K + + K+ + HGR E R Sbjct: 123 QTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETRTC 182 Query: 239 IVCDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AEKFA 296 V F + GLK + S R+I+A E VRYY++S D T E+ A Sbjct: 183 TVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIVAT--GEYTQEVRYYVTSLDNTKPEEIA 240 Query: 297 TAIRNHWHV 305 +AIR HW + Sbjct: 241 SAIRQHWSI 249 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 220 bits (560), Expect = 7e-56, Method: Composition-based stats. Identities = 75/219 (34%), Positives = 119/219 (54%), Gaps = 7/219 (3%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +++ I D R K HK+ I+ ++I AVI GA+ W +IE+FG + F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPS 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YD 118 IP HDT R S I P F F NW++ + K V+AIDGK +R + Sbjct: 64 LEFIPSHDTFNRFFSMIKPDYFELIFRNWVKQVCQ-EVKGVVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 + + + ++SA+S + + +GQ+K D+KS+EITAIP L+N L++ G I+T DAMGCQK Sbjct: 123 RGKEGFKLWMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN 217 DI + I +Y+ A+K + + + ++ + + Sbjct: 183 DITQTIIGHDANYIIAIKENKKKKYQPAKQIIDDYQDRD 221 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 213 bits (541), Expect = 1e-53, Method: Composition-based stats. Identities = 89/418 (21%), Positives = 151/418 (36%), Gaps = 62/418 (14%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVIS-GAEGWEDIEDFG----ETHLDFLK 59 L++ ++I D R T H L+ IL + CA ++ G + IE + + L L Sbjct: 29 DLIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLH 88 Query: 60 QYGDFENGI---PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD------------ 104 + D G+ P TI RV++ + + C ++ + Sbjct: 89 IWRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRR 148 Query: 105 ---------------------VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 A+DGK L+ + R +H+IS + + + V Q Sbjct: 149 TEREARRAAHRSPTPAPGLLPAYAVDGKRLKGARHPDGGR--VHLISLAAHLDATVHAQR 206 Query: 144 KTDEKSNEITAIPELLNM---LDIKGKIITTDAMGCQKDIAE-KIQKQGGDYLFAVKGTQ 199 + KS+EI A+ LL D+ G +IT DA+ Q+ A I++ Y+ VK Q Sbjct: 207 QIPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQ 266 Query: 200 GRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 L+ + + ++ + + HGR E R I+ P + IDF + + + Sbjct: 267 PTLHATAITALTGTDTDFAAV-THRETHRGHGRTEYR--ILRTAPADGIDFPYAAQVFRV 323 Query: 260 LCVAVSFRSIIAEQKKEPEMTVRYYISSA---DLTAEKFATAIRNHWH-VENKLHWRLDV 315 L R V Y I+ A +R HW +EN +H DV Sbjct: 324 L------RHRGGLDGIRHSKEVCYGITDLTARQAGPAHLAAYVRGHWKAIENGVHHVRDV 377 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 ED C+ R + R++A L + R+ D + + Sbjct: 378 TFAEDACQARTATLPRALAAFRNLATGTLRRAGHVN--IAHARREHGYDHQRVLDLFN 433 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 204 bits (520), Expect = 3e-51, Method: Composition-based stats. Identities = 86/390 (22%), Positives = 143/390 (36%), Gaps = 61/390 (15%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVI-SGAEGWEDIEDFGE-THLDFLK 59 +++ L+ + D R V +++S +L L +CA+ +G + ++ + L Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 60 QY------GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS------------- 100 + IP T+ V+ + P + + +R S+ Sbjct: 90 AFGLPYHPLRGRYRIPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPLMPDGG 149 Query: 101 -------------------DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 + IA+DGK LR + R + V+SA + + Sbjct: 150 IEREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR--VFVLSAVRHGDGITLA 207 Query: 142 QIKTDEKSNEITAIPELLNMLDI---KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGT 198 + K+NEI LL+ LD KG ++T DA+ Q+D A + ++G YL +K Sbjct: 208 SREIGAKTNEIPEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERGAHYLLTIKNN 267 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 Q R P KE+ D + HGR E RL V V L + Sbjct: 268 Q-RGQARQLHALPWKEIPVIHRD----DARGHGRHEQRLVQVVTVNGLL------FPHAA 316 Query: 259 KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT---AIRNHWHVENKLHWRLDV 315 ++ R + +K Y I+ A R HW VEN +HW DV Sbjct: 317 QVLRIQRRRRLYGAKKW--SSETVYAITDLPAEEASAAEIASWARGHWTVENTVHWCRDV 374 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILT 345 NED ++R N + + +R + L Sbjct: 375 TFNEDKSQVRTHNTPSVLAAVRDLIRGALK 404 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 204 bits (519), Expect = 4e-51, Method: Composition-based stats. Identities = 90/237 (37%), Positives = 119/237 (50%), Gaps = 9/237 (3%) Query: 143 IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRL 202 + T++KSNEITAIP LL L+ K ++T DAMGCQKDIA I GGD++ AVK Q +L Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 NKAFE---EKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 A EK EL H +Y HGR + R H V VP EW +K Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVPPG-FAAKGEWPWIKA 119 Query: 260 LCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + AV I VRYY+ S L+ ++F +R HW +E+ +HW LDV E Sbjct: 120 IGTAV---RITTHADGTQSDEVRYYMLSRFLSGKRFGEVVRGHWGIES-MHWVLDVTFGE 175 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSG 376 D + R+ A S +R AI +L K +R KM + MD ++L VL G Sbjct: 176 DRTRTRQRVLANNLSWVRRFAITLLKRHP-EKDSIRGKMIRCLMDTSFLNEVLTLQG 231 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 203 bits (517), Expect = 8e-51, Method: Composition-based stats. Identities = 79/385 (20%), Positives = 141/385 (36%), Gaps = 46/385 (11%) Query: 8 EHISIIPDYRQTWKVEHKLSDILLLTICAV-ISGAEGWEDIEDFGETHLDFLKQYGDFE- 65 E ++ IPD+R + + L + + +CAV +G + + ++ + Sbjct: 26 ERLAAIPDHRSARGLVYPLPVLAAVWLCAVTAAGHDRVAAVTEWLAATSWTERVRLRLPW 85 Query: 66 -----NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVI-------------- 106 + +P TI R ++ + ++ +D D + Sbjct: 86 NPWDGHLLPDEATIRRFLNTVDDQALATALLDPPLADTPADLTDAVPSAPVRPPAGDQAV 145 Query: 107 -----AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNM 161 A+DGKT R + K +H++ + ++GQ + D KSNE T LL Sbjct: 146 PVRAYAVDGKTSRGA--KRADGSQVHLLGVAAHGAGALLGQREIDAKSNETTEFRALLAP 203 Query: 162 LDIKGKIITTDAMGC-QKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH 220 L++ G ++ DA+ + ++ + ++ YL K Q +L AF P E+ + Sbjct: 204 LELAGAFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKLR-AFLAALPWTEIPTAD- 261 Query: 221 DSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMT 280 ++ HGREE R V V +DF + ++ R ++ + Sbjct: 262 ---LTRDRGHGREETRTLKVATVTH--LDFPHAAQAIR-------IRRWRRQKGQPASHE 309 Query: 281 VRYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIR 337 Y I+ A A R WH+E K H+ DV ED R G + + R Sbjct: 310 TIYAITDATADQASPALLADLARGQWHIEVKQHYVRDVTFGEDSSTSRTGRGPAVLALFR 369 Query: 338 HIAINILTNDKVFKAGLRRKMRKAA 362 + L R+ K A Sbjct: 370 ATVADTLRRAGHRSVPACRRAHKTA 394 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 198 bits (504), Expect = 3e-49, Method: Composition-based stats. Identities = 81/215 (37%), Positives = 118/215 (54%), Gaps = 11/215 (5%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+ +PD R+ + H+L ++LL IC VISGAE W + + + LD+L+ Y + Sbjct: 7 SLLTAFDDLPDPRR-RECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPY 65 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 +GI HDT RV S + ++F CF+ W+ S + +AIDGK LR S+D + R Sbjct: 66 AHGIASHDTFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHDGA--RS 123 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 IH++SA+S+ +L +GQ++T +KSNEITAIPELL LDI+G IT DAMGC A Sbjct: 124 PIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCHGMPARHR 183 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPE 219 + RL E + +P+ Sbjct: 184 RADCSARC--------RLRAECEGQSAESCRGHPD 210 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 196 bits (499), Expect = 1e-48, Method: Composition-based stats. Identities = 78/225 (34%), Positives = 106/225 (47%), Gaps = 9/225 (4%) Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK 213 AIPELL LD++G +T DA+G Q IA I + G DY+ AVK Q RL + + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 ELNNPEHDSYAISE--KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIA 271 E + +E K HGR E R+ V + L W GL++L + R I Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETRVCRVSEDVAWLASTGQHWAGLQRLVMLERTRQI-- 118 Query: 272 EQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 ++ YYISS + A + A IR HW +EN+LHW LDV ED IR AA Sbjct: 119 --GQKVTTERCYYISSKAVKAAQMAQLIRAHWGIENQLHWVLDVSWGEDASLIRDTVAAR 176 Query: 332 LFSGIRHIAINILT---NDKVFKAGLRRKMRKAAMDRNYLASVLA 373 + +R I +N+ N + K L+ AA D +L Sbjct: 177 NMASLRKITLNLARLAQNRQPKKVSLKNIRNLAAWDTAMRDDILG 221 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 194 bits (493), Expect = 4e-48, Method: Composition-based stats. Identities = 85/224 (37%), Positives = 116/224 (51%), Gaps = 11/224 (4%) Query: 111 KTLRHSYDKSRRRGAIHVISAF---STMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGK 167 K + S + S S +LV+GQ K ++KSNEITAIP L+ ML+I+ Sbjct: 3 KGFQRSVKTEEKHKPSQKKSQVLKDSLSQNLVLGQKKVNDKSNEITAIPALIEMLEIESS 62 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKF---PLKELNNPEHDSYA 224 IIT DAMGCQK+I I+K+ GDY+ +K Q L + +E F +E + EH Y Sbjct: 63 IITIDAMGCQKEITSLIRKKKGDYIITLKANQKSLRQEIKEWFKIAEAEEFKDREHSYYQ 122 Query: 225 ISEKSHGREEIRLHIVCDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRY 283 E H R E R I V + W LK + + S R + + VR+ Sbjct: 123 EIETGHHRIEKREVIAVSVSSLPCLHNQDLWTELKTVVMVKSERRLWN----KTTTEVRF 178 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRG 327 YISS + ++K ATAIR+HW +EN LHW LDV +ED +IR Sbjct: 179 YISSVEKNSQKIATAIRSHWEIENSLHWTLDVTFSEDKSRIRTR 222 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 186 bits (473), Expect = 8e-46, Method: Composition-based stats. Identities = 64/218 (29%), Positives = 99/218 (45%), Gaps = 3/218 (1%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + L E +S IPD R + H L +L L A++ G + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDK 119 F G P T++R + P + W+ + IA+DGKTLR S D Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGSRDG 120 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + H+++A++ V+ Q++ D K+NE A LL +L + G ++T DAM CQ+D Sbjct: 121 --QVPGQHLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN 217 +A + G DY+ K Q L + E ++ Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGLGFEDAAR 216 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 186 bits (472), Expect = 1e-45, Method: Composition-based stats. Identities = 64/229 (27%), Positives = 103/229 (44%), Gaps = 5/229 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E ++ +PD R ++ L +L L + AV+ G E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 NG +P +TIA ++ + P + W+RD H D + +A+DGK L S D + Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP-DGWEHLALDGKRLCGSRDG--QV 120 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD-IKGKIITTDAMGCQKDIAE 182 H+++A++ S V+ Q+ + +NE A LL +L + G ++T DA+ Q D+ Sbjct: 121 PGTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHG 231 +Q +GGD + K QG L E F + G Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAFATAAGGDFSPRVTGRVGSGRG 229 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 184 bits (467), Expect = 4e-45, Method: Composition-based stats. Identities = 81/194 (41%), Positives = 119/194 (61%), Gaps = 2/194 (1%) Query: 94 MRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEIT 153 M+ H +V+AIDGKTLR SYD+ R+ IH++SA+++ + LV+GQ+KT+ KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK 213 AIP L+ MLD++G I+T DAM CQ IA+ I ++GGDYL AVKG QG+L A + F Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFTPH 120 Query: 214 ELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 + D+ I EK GR E R + V D + DF+ W GL + + ++R+ Q Sbjct: 121 RRAPIDRDTCQI-EKQKGRVEARTYHVLSASDLIRDFST-WSGLTSIVMVENYRAAKGRQ 178 Query: 274 KKEPEMTVRYYISS 287 + + + + + S Sbjct: 179 RARVGVPLLHKVQS 192 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 180 bits (457), Expect = 7e-44, Method: Composition-based stats. Identities = 60/266 (22%), Positives = 112/266 (42%), Gaps = 22/266 (8%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 + L+E ++ +PD R+ V ++ + +L + +CA++SGA + I ++ + Sbjct: 47 DQTALLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAG 106 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD-------------KDVIAI 108 +P TI RV+ + A W++ + D + V+A+ Sbjct: 107 LGLTGRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAV 166 Query: 109 DGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGK 167 DGK +R + +H++ +V+ Q+ DEK+NEI +L+ + D+ Sbjct: 167 DGKAMRAT---RHGTHPVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDV 223 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISE 227 +IT DAM Q A+ + +G L VK Q ++ + P K++ + + Sbjct: 224 LITVDAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRLKT-LPWKDVPV----GHTTTG 278 Query: 228 KSHGREEIRLHIVCDVPDELIDFTFE 253 + HGR E R VP L Sbjct: 279 RGHGRIETRTLKAVTVPAGLGFPHAA 304 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 179 bits (455), Expect = 1e-43, Method: Composition-based stats. Identities = 72/270 (26%), Positives = 117/270 (43%), Gaps = 12/270 (4%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + L+ + + D R H L ++L L + A + GA+ ++ +F E + L++ Sbjct: 1 MSVLISILREVRDPRD-VNARHDLGELLFLALLATLRGAKTCVEMAEFSEARQEELREIV 59 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD----KDVIAIDGKTLRHSYD 118 +G P HDT +RV + P + F +M + K V+AIDGK+LR YD Sbjct: 60 ALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRGALGLPAPKGVVAIDGKSLRRGYD 119 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K R ++S + I ++ +EI A +L L +KG +T DA+ C Sbjct: 120 KGRAFMPPLMVSVWDVETRPSIAAMRAPG-GDEIKATLSVLKALTLKGCTVTADALHCHP 178 Query: 179 DIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLH 238 +A+ + Y +K G L +A E F + + E+ HGREE R Sbjct: 179 AMAQALLAAKAQYALGLKANHGPLFRAAEAGFA----AVTDLAVFETRERGHGREEQRRA 234 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRS 268 V V D L+ GLK + + R+ Sbjct: 235 SVLPV-DRLVKRP-SLPGLKAIGRIEAVRT 262 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 179 bits (453), Expect = 2e-43, Method: Composition-based stats. Identities = 58/228 (25%), Positives = 104/228 (45%), Gaps = 14/228 (6%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +++ LM+ +S D R+ + H ++ +CA++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDFENGI-------PVHDTIARVVSCISPAKFHECFINWMRDCHSS----DDKDVIAIDG 110 F P T+ R + I + W + D V++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIIT 170 K +R + K++ IH ++AF +V+ Q DEK+NEI + LL ++I+G+I+T Sbjct: 341 KAVRGA-SKAKGGQKIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 TDAMGCQKDIAEKIQK-QGGDYLFAVKGTQGRLNKAFEEKFPLKELNN 217 DA+ Q + A I + + DY+F VK Q + + E P + Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIE-SLPWEAFPP 446 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 167 bits (422), Expect = 7e-40, Method: Composition-based stats. Identities = 53/187 (28%), Positives = 92/187 (49%), Gaps = 4/187 (2%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFL-K 59 + L+ + +PD R+ + L +L+ T+ A++SGA + I F E + L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTH 70 Query: 60 QYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHS 116 +G PV +T+ V+ + + F + + K V+A+DGKTLR S Sbjct: 71 HFGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGS 130 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D R A ++AF + ++V+ + D+KSNEI A +++ L + G + T DAM C Sbjct: 131 FDHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHC 190 Query: 177 QKDIAEK 183 QK + + Sbjct: 191 QKKHSRR 197 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 166 bits (421), Expect = 1e-39, Method: Composition-based stats. Identities = 70/273 (25%), Positives = 108/273 (39%), Gaps = 13/273 (4%) Query: 58 LKQYGDFENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 L + D + + ++ + F S +K + DGK LR S Sbjct: 8 LCAFLDIPETTVVSRSHLPVLLQKVDVEVFDYLLFTHYGFRLDSQEKQWFSGDGKELRGS 67 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDE-KSNEITAIPELLNMLDIKGKIITTDAMG 175 + ++RG V+ I Q D K +EI + LL+ D+ + IT DA+ Sbjct: 68 IESGKKRGQA-VVQIVHHHSGEAIAQNYYDGQKESEIPTLRALLSKDDLASQKITLDALH 126 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 E I K GG +L +K Q L + + P D + +HGR E Sbjct: 127 LCPSTTEMITKAGGVFLIGLKENQPTLLAH------MTDCALPPIDQKTTFDFNHGRVEQ 180 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R + + DV + D ++ K+L R I ++ + V YYIS+ E Sbjct: 181 RKYWLYDVSKQGFDPRWDNTAFKRLVKVQRTR--INQKNAKISREVSYYISNETA-KEGI 237 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGN 328 A+RNHW VE H DV +NED K ++ Sbjct: 238 FDAVRNHWSVEVNNH-IRDVTLNEDQLKSKKRQ 269 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 166 bits (421), Expect = 1e-39, Method: Composition-based stats. Identities = 55/227 (24%), Positives = 100/227 (44%), Gaps = 15/227 (6%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETH-LDFLKQ 60 +++ L + +PD R +H L IL + + AV++ A+ + + ++ LK+ Sbjct: 219 QMEGLRQIFFRLPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKR 278 Query: 61 YGDFENGI------PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLR 114 N P T+ RV+ + W+ + +A+DGK L+ Sbjct: 279 IRARFNPRTQRYVAPSEPTLRRVLQGANVTALDAAIGAWLLGIAGFEA---VAVDGKVLK 335 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAM 174 + + + +H++SAF I Q + K+NEI + LL +DI+ K++T DA+ Sbjct: 336 GAVREDGSQ--VHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADAL 393 Query: 175 GCQKDIAEKIQK-QGGDYLF-AVKGTQGRLNKAFEEKFPLKELNNPE 219 Q+ A + + + DYLF AVKG Q +L + P + Sbjct: 394 HTQRKTARFLVEDKKADYLFTAVKGNQRKLRNSLI-CLPWGDFPPQR 439 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 166 bits (419), Expect = 1e-39, Method: Composition-based stats. Identities = 56/223 (25%), Positives = 97/223 (43%), Gaps = 19/223 (8%) Query: 11 SIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLD-FLKQYGDFENG-- 67 + + D R+ + H +LL+ + V++G +E I + + L++ G + Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 68 ----IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 P TI R++S P + ++ + IAIDGKT+R S Sbjct: 289 ERFLPPSEPTIRRILSKADPVELDRILSQYI---VAHSSGRAIAIDGKTIRSS------- 338 Query: 124 GAIHVISAFSTMHSLVIGQIKTDE-KSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 ++ +++A V+ Q D K +EI A LL LD+ GK++T DA+ Q +A Sbjct: 339 -SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNALAS 397 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAI 225 +I+++GGDY+F VK + L +P D Sbjct: 398 RIREKGGDYVFTVKDNRKTLKDEISGLDDEAFSPSPYDDLLRT 440 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 64/189 (33%), Positives = 94/189 (49%), Gaps = 8/189 (4%) Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDV 243 I + GDYL VKG Q +L +A E F + + + D A+ E+ HGR ++ V Sbjct: 2 IIAKKGDYLLMVKGNQPKLLEAIEIAF-IDQHDVKSVDRSALVERGHGRTVGQIASVLSA 60 Query: 244 PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHW 303 I +W + S R + +KE ++ YYI+S LTAE+ A ++R W Sbjct: 61 KG--IINPGDWPNCVTIGRIDSMRVVD---EKESDLERCYYITSRALTAEQLAASVRARW 115 Query: 304 HVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKMRKA 361 VEN+ HW LDV +ED + + NA + S +R IA+NI+ DK K+ LR K + A Sbjct: 116 GVENRFHWILDVSFSEDASTVAKDNAPQNLSMLRKIALNIIRADKTDTRKSSLRLKRKGA 175 Query: 362 AMDRNYLAS 370 A D Sbjct: 176 ARDDGVREP 184 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 165 bits (417), Expect = 2e-39, Method: Composition-based stats. Identities = 61/189 (32%), Positives = 85/189 (44%), Gaps = 10/189 (5%) Query: 192 LFAVKGTQGRLNKAFEEKFPLKELNNPEHDS---YAISEKSHGREEIRLHIVCDVPDELI 248 + AVK Q L E + S + +K HGR E R + D P Sbjct: 1 MLAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREVDKGHGRIETRRCLALDFPGPFE 60 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 W GL+ + + S R I RYY+SS A + A A+R HW +E+ Sbjct: 61 PDL--WPGLQSIPMVESTREI----GDTVTTGRRYYVSSLPADAVRIAHAVRAHWGIES- 113 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 +HW LDV NED C+ R NAA+ F+ +R IA ++ D KAG+R + KA +Y Sbjct: 114 MHWVLDVTFNEDQCRTRLENAAQNFAILRRIATTLIRRDNSTKAGIRIRRLKAGASDDYR 173 Query: 369 ASVLAGSGL 377 A +L L Sbjct: 174 AQLLGLKTL 182 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 165 bits (417), Expect = 3e-39, Method: Composition-based stats. Identities = 56/164 (34%), Positives = 87/164 (53%), Gaps = 3/164 (1%) Query: 47 IEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVI 106 + + L+ + NG P DT RV+ I P + C + ++ S + I Sbjct: 1 MHELCLERGASLRPPVELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHI 60 Query: 107 AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKG 166 AIDGK L+ S K+ G+ H++SA+ L + Q EK NE+ AIPE+L+ LD+ G Sbjct: 61 AIDGKRLKGSKKKT---GSTHILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSG 117 Query: 167 KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKF 210 +I+ DAMG Q +IAE+I + DY+ ++KG Q L + + F Sbjct: 118 AVISIDAMGTQTNIAEQIIQSEADYILSLKGNQKHLYEDVRDCF 161 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 164 bits (416), Expect = 3e-39, Method: Composition-based stats. Identities = 55/194 (28%), Positives = 85/194 (43%), Gaps = 7/194 (3%) Query: 182 EKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH-DSYAISEKSHGREEIRLHIV 240 EKI ++ GDY+ +K + E F + PE +++ R + R + Sbjct: 1 EKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRK 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 V D L EWKG+K + RS + +YISS D+ + A +R Sbjct: 61 LKVSDWLSK-AEEWKGIKSVLEVCRKRS----DNGKESQEKVFYISSLDVDVQILAKCVR 115 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW VENK HW LDVV ED+C + AE + +R +A+N+ K ++ K+ Sbjct: 116 GHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHP-KKQSMKGKLTA 174 Query: 361 AAMDRNYLASVLAG 374 A + +L G Sbjct: 175 AGWSDEFRDELLLG 188 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 156 bits (393), Expect = 2e-36, Method: Composition-based stats. Identities = 57/142 (40%), Positives = 81/142 (57%), Gaps = 3/142 (2%) Query: 101 DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLN 160 VIAI+GK+LR + + A+H +SA++ + L +GQ+ EKSNEITAI ELL Sbjct: 1 MGGLVIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLP 60 Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKF-PLKELNNPE 219 L ++G ++T DA+GCQ +AE+I GGDY+ AVK Q L A + F L +P Sbjct: 61 TLALEGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPV 120 Query: 220 HDS--YAISEKSHGREEIRLHI 239 + + +K HGR E R Sbjct: 121 RQTCVHETLDKGHGRIETRRCT 142 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 154 bits (388), Expect = 6e-36, Method: Composition-based stats. Identities = 65/142 (45%), Positives = 92/142 (64%), Gaps = 4/142 (2%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIK 165 +AIDGK LR S+D + R IH++SA+S+ +L +GQ++T +KSNEITAIPELL LDI+ Sbjct: 1 MAIDGKCLRGSHDGA--RSPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDS--Y 223 G IT DAMGCQ DIAE+I ++G DY+ VKG Q L +A + F + E + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 AISEKSHGREEIRLHIVCDVPD 245 + ++K+HGR E R + + Sbjct: 119 SQTDKNHGRIETRRCVATNDVA 140 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 152 bits (383), Expect = 2e-35, Method: Composition-based stats. Identities = 59/244 (24%), Positives = 95/244 (38%), Gaps = 17/244 (6%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHE 88 +L + + A + G+ + T D + P T V+S + PA + Sbjct: 3 LLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLNA 62 Query: 89 CFINWMRDCHSSDDKD---VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKT 145 ++ +S D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 63 RMGSYFTAHVASSDPSGLVPIALDGKMLRGALRA--KATATHLVSVFAHRARLVLGQLAV 120 Query: 146 DEKSNEITAIPELLNMLDIKG-KIITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGTQGRLN 203 EKSNEI + LL +L ++T DAM Q A+ I YL VK Q ++ Sbjct: 121 AEKSNEIPCVRALLTLLPDNLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKIL 180 Query: 204 KAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVA 263 A P E+ D + HGR + R + + + K++ Sbjct: 181 -ARITALPWAEVPAAATD----DSRGHGRVKTRTLQIITAARGIG-----FPYAKQIIRI 230 Query: 264 VSFR 267 R Sbjct: 231 TRER 234 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 151 bits (382), Expect = 4e-35, Method: Composition-based stats. Identities = 48/180 (26%), Positives = 84/180 (46%), Gaps = 4/180 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E ++ +PD+R + L +LLL I +S G+ +EDF H + L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 EN-GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDK--SR 121 P T RV+ I F NW+ ++D + +DGK+++ + Sbjct: 64 PPTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA 123 Query: 122 RRGAIHVISAFSTMHSLVIG-QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + I+V+S FS + I Q +++ +EI + LL LD++G + T D++ CQK + Sbjct: 124 YQDFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKKL 183 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 151 bits (381), Expect = 5e-35, Method: Composition-based stats. Identities = 65/326 (19%), Positives = 115/326 (35%), Gaps = 43/326 (13%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 + G P ++T+ +++C+ WM + A DGK L S Sbjct: 13 RWRPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLGMPVGGIRA-DGKVLGGS 71 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K A+H + + + + Q + + A+ LL + G++++ DA Sbjct: 72 --KRAGAPALHGVELVTHTTGMALAQREAVG-GDAAAALLALLTEAPLDGRMVSMDAGFL 128 Query: 177 QKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFP------------------------- 211 + + I ++ G+YL VKG Q ++ P Sbjct: 129 NAAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVALDQIAPPR 188 Query: 212 --------LKELNNPEHDSYAISEKSHGREEIRLHIVCDVPD--ELIDFTFEWKGLKKLC 261 +EL E+S GR EIR V D D + + W+ + ++ Sbjct: 189 RKRQPIGFRRELQPRRAPDAQTIEQSRGRLEIRELWVVDAGDVGPSLMTAYGWRQVTQIG 248 Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 + E +SS T +F +IRNHW +EN++H D M ED Sbjct: 249 GLRRWCRRRHADLWTVEEVTV--VSSRQRTPAQFLASIRNHWTIENQVHRPRDGSMQEDR 306 Query: 322 CKIRRGNAAELFSGIRHIAINILTND 347 R + + R++ IN++ Sbjct: 307 LHGR--AIGVILAVCRNVVINLIRRH 330 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 149 bits (377), Expect = 1e-34, Method: Composition-based stats. Identities = 59/182 (32%), Positives = 90/182 (49%), Gaps = 13/182 (7%) Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 K + I + G DY+ AVKG Q RL++ L +E+ R Sbjct: 1 MPKKTVQLIIEGGNDYVIAVKGNQKRLHEQ----IKLTTEQRLPVSLDITTERRSDRITT 56 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V D+L +++W+GL++L F + +P + YYISS + A +F Sbjct: 57 RS---VSVFDDLSGISYDWEGLQRLVKVERF----GTRAGKPYHQIVYYISSLTINAAQF 109 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND--KVFKAG 353 A IR HW +EN+LHW DVV++ED+ ++R+GNA FS IR + + IL + G Sbjct: 110 AQGIRGHWGIENRLHWVKDVVLDEDNSRMRQGNAPANFSIIRSLVLTILRYNGYSSITTG 169 Query: 354 LR 355 +R Sbjct: 170 IR 171 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 148 bits (374), Expect = 3e-34, Method: Composition-based stats. Identities = 50/196 (25%), Positives = 85/196 (43%), Gaps = 9/196 (4%) Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHI 239 ++E+ ++ DY+ A+KG + + ++ F + +K HGR E R++ Sbjct: 1 MSERNSEKDNDYILALKGNHPLMEQEVKDFFL--SPVTSTRSVHTTFDKGHGRIERRIYT 58 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 D + EWK L + S +K + +RY+I+S ++FA + Sbjct: 59 -LDTNIGWFEDKKEWKHLAGFGMVDSMV----TRKGKECREIRYFITSVT-DVKQFAKGV 112 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 +HW +EN LHW LDV+ +D+C + NAAE + IR I N + K Sbjct: 113 CSHWMIENNLHWCLDVLFCDDECTVLDRNAAENLAIIRRIVYNRIKMLSKMDTLSMGKR- 171 Query: 360 KAAMDRNYLASVLAGS 375 D + A +L Sbjct: 172 ACIYDDEFRAQILFSC 187 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 146 bits (368), Expect = 1e-33, Method: Composition-based stats. Identities = 55/208 (26%), Positives = 85/208 (40%), Gaps = 15/208 (7%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE 214 +L +++ GK IT DA+ QK +AE I + YLF VK Q L + F Sbjct: 2 FIPILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLYFDIKNYFEH-- 59 Query: 215 LNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 E D HGR + R +E ++F + + +S + Sbjct: 60 --RKEPDYCLQDPPGHGRIDTRSIWTTTELNEYLEFPHVGQAF-----CIHKKSYDPKTN 112 Query: 275 KEPEMTVRYYISSADL---TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 K E T Y ++S + R HW +EN H+ LD +ED +IR GN Sbjct: 113 KVCENTF-YGVTSHHPNKADPARILQIHRGHWSIENSKHYILDWTYDEDRNRIRTGNGPA 171 Query: 332 LFSGIRHIAINILTNDKVFKAGLRRKMR 359 + +R AI +L + V + +K+R Sbjct: 172 NTNRLRGFAIGLLKSKGVK--DIAQKVR 197 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 145 bits (365), Expect = 3e-33, Method: Composition-based stats. Identities = 50/183 (27%), Positives = 83/183 (45%), Gaps = 4/183 (2%) Query: 20 WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG-IPVHDTIARVV 78 H L +L L AV+ G + I FG + L F G P T+++ + Sbjct: 2 QGRIHPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTL 61 Query: 79 SCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSL 138 I P + W+ + D + +A+DGK LR S D H ++A++ + Sbjct: 62 RRIDPQQLEAALGRWIAGRLTPDARAHVALDGKCLRGSRDGDV--PGPHRVAAYAPHAAA 119 Query: 139 VIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGT 198 V+GQI+ D ++NE A LL ++ + G ++T A C +D+A + GG Y+ +G Sbjct: 120 VLGQIRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYVSHGQG- 178 Query: 199 QGR 201 Q Sbjct: 179 QPT 181 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 145 bits (365), Expect = 3e-33, Method: Composition-based stats. Identities = 56/180 (31%), Positives = 87/180 (48%), Gaps = 5/180 (2%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLK-QY 61 + L + + IPD+R+ L +LL +I A++SGA + I F TH L + Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 G P + +I + + F ++ VIA+DGKTLR S D+ Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHFRAHAARL--AEGAAVIALDGKTLRGSLDRFE 118 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTD--EKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A V+SAF+T +V+GQI + K +EI A L+ L + G++ T DA+ QK+ Sbjct: 119 DRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7X3_ANADF Length = 442 Score = 144 bits (364), Expect = 3e-33, Method: Composition-based stats. Identities = 73/318 (22%), Positives = 121/318 (38%), Gaps = 45/318 (14%) Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS-------SDDKDVIAIDGKTLR 114 G P T+ R+++ SPA E ++D + V++ DGK Sbjct: 93 LGLGRGKPCDTTLYRLLAEQSPAGLEETVFAQVKDLIARKVVRNDLLALGVVSFDGKGTW 152 Query: 115 HSYDKSRRRGAIHVISAFSTMHS------------------LVIGQIKTDEKSNEITAIP 156 D + +GA SA+ S +GQ K E TA Sbjct: 153 SRTDGEKVKGAQQ--SAYDAEGSSLQTFGALRAALTSSSVCPCVGQRLIGSKEGESTAFR 210 Query: 157 ELL----NMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPL 212 LL L + +I+T DA C ++ AE + G Y+F +K Q L+ + Sbjct: 211 RLLPAISEQLGGQFRIVTGDAGLCARENAELVTSLGRWYVFGLKDNQPYLHDIARDY-GQ 269 Query: 213 KELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAE 272 +L P +E+ G +R DV + L +C + R Sbjct: 270 YDLGTPLA---RTAERYRGHTIVRELYARDVAGNPAAAIEAAQQLWYVCQTTTDRR---- 322 Query: 273 QKKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + + RY+++S LT ++ +R HW +EN HW +DV++ ED+ + + Sbjct: 323 -GEIVAVEQRYFVTSIPTGTLTRDQELALVRMHWAIENGCHWTMDVMLGEDEGHPCQASR 381 Query: 330 A--ELFSGIRHIAINILT 345 A E S +R I N ++ Sbjct: 382 ASIETVSWLRLIGYNAVS 399 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 144 bits (364), Expect = 4e-33, Method: Composition-based stats. Identities = 61/194 (31%), Positives = 85/194 (43%), Gaps = 11/194 (5%) Query: 186 KQGGDYLF-AVKGTQGR-LNKAFEEKFPLKELNNPEHDS---YAISEKSHGREEIRLHIV 240 +G + A + Q L A + F + + +K HGR E R Sbjct: 91 DRGRWWRLRACRQGQPTHLAHALRDFFGTLDAPGYPVRQTCVHETLDKGHGRIETRRCTA 150 Query: 241 CDVPDEL--IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 D L + WK + + S R I + E RY ISS +E+ A Sbjct: 151 AGDLDWLATLGLKERWKKITSVAGIDSSRVI----GSKTETDRRYVISSLPADSERILHA 206 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R HW +EN LHW LDV ED C IR NAA FS +R A+N+ D GL +K Sbjct: 207 VRMHWGIENGLHWCLDVAFGEDACPIRLRNAALDFSLLRRAAMNLFRADHSRAMGLPKKR 266 Query: 359 RKAAMDRNYLASVL 372 + AA + +YLA++L Sbjct: 267 KAAAWNPDYLANIL 280 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 142 bits (358), Expect = 2e-32, Method: Composition-based stats. Identities = 62/201 (30%), Positives = 96/201 (47%), Gaps = 13/201 (6%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE 214 + +L IK I T DA+ CQK E I ++ Y+ VK Q L +A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIEDTAKNSP 61 Query: 215 LNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 LN +++ ++K HG E H + + +W GL++ S R Sbjct: 62 LN-----AWSWTQKGHGHE---SHCRLKIWEATESMKMQWAGLERFI---SIRRQGFRHH 110 Query: 275 KEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 K+ + T Y+I+S L++ + A IR H +EN LHW DV++NED+C IR + A + Sbjct: 111 KKFDSTT-YHITSETLSSYRLAGFIRGHRRIENNLHWTKDVILNEDNCGIREPHPAAILG 169 Query: 335 GIRHIAINILTNDKVFKAGLR 355 +R+IA N L V L+ Sbjct: 170 ILRNIAFN-LRLGTVSNPSLK 189 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 140 bits (353), Expect = 7e-32, Method: Composition-based stats. Identities = 60/158 (37%), Positives = 94/158 (59%), Gaps = 3/158 (1%) Query: 99 SSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPEL 158 + D+IA+DGKTLR SYD++ + AIH++SA+ST + LV+GQ+KT+EKSNE TAIP+L Sbjct: 3 ARIPGDIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIPKL 62 Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNP 218 +L ++ +T DA+G Q+DIA++I + DYL VK Q L++ + + E Sbjct: 63 FTLLALEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAKGF 122 Query: 219 EHD-SYAISEKS--HGREEIRLHIVCDVPDELIDFTFE 253 D + +++E+ HGR + V L + Sbjct: 123 TEDFTDSVTEEGDKHGRIDKLHCRVTHRFSGLGALADK 160 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 139 bits (350), Expect = 1e-31, Method: Composition-based stats. Identities = 56/167 (33%), Positives = 85/167 (50%), Gaps = 13/167 (7%) Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQ 187 ++S ++T + LV+GQ+K E SNEITAIPELL +L++ G I+ A+ C KDI + I ++ Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 GGDYLFAVKGTQGRLNKAFEEKFP---LKELNNPEHDSYAISEKSHGREEIRLHIVCDVP 244 DY+ +K QG L ++ E+ F +H +Y E HG EIR P Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQELQHSTYKPEETGHGLHEIRNFGFQLDP 120 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 D + W LK + + I + + + RY+ISS D Sbjct: 121 DSV------WSNLKSVGMV----EPIGQVDDKTTVETRYFISSLDSN 157 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 139 bits (350), Expect = 2e-31, Method: Composition-based stats. Identities = 57/146 (39%), Positives = 77/146 (52%), Gaps = 7/146 (4%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH---DSYAISEKSH 230 MGCQK+IAE I +Q DY+ AVK Q L++A ++ F N E D KSH Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYFEEANEANFESYNIDFAETYNKSH 60 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL 290 GR E R V L D + W+GL+ + + S R++ K++ + RYYISS Sbjct: 61 GRIESRRCWVGYDALPLTDDSQNWEGLQTIVMVESERTL----KEKTTIEHRYYISSTMA 116 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVV 316 TA + R HW +EN LHWRLD+ Sbjct: 117 TAAYLLNSSREHWGIENSLHWRLDIA 142 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 136 bits (342), Expect = 1e-30, Method: Composition-based stats. Identities = 47/190 (24%), Positives = 77/190 (40%), Gaps = 6/190 (3%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+ H+ IPD R V +LL+ + ++S E D+E F H L + Sbjct: 12 DLISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGI 71 Query: 65 ENGIPVHDTIARVVSC-ISPAKFHECFINW--MRDCHSSDDKDVIAIDGKTLRHSYDK-- 119 E P D+ R + A +W + + D D + DGKTLR S + Sbjct: 72 ELKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTS 131 Query: 120 SRRRGAIHVISAFSTMHSLVIGQ-IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 I ++ +S + I Q + +E + +LL LD++G +I DA+ Q+ Sbjct: 132 GGGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQ 191 Query: 179 DIAEKIQKQG 188 Q +G Sbjct: 192 AFFGSSQSRG 201 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 134 bits (336), Expect = 6e-30, Method: Composition-based stats. Identities = 42/187 (22%), Positives = 80/187 (42%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L + +S +PD R + L +L L + A +S + +E F + L G Sbjct: 3 LRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 P H I ++ + P K + D +V+ +DGK LR S + Sbjct: 60 RKAPGHTAITLLLHRLDPEKLQAALGQVFPEA---DLGEVLVVDGKHLRGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML---DIKGKIITTDAMGCQKDIAE 182 + ++ + + Q + + + E A ELL+ L +++GK++ DA ++A Sbjct: 115 VKLVEVLALHLHTTLAQARAEGR--EEKAFLELLDRLEARELEGKVVVGDAGYLYPEVAA 172 Query: 183 KIQKQGG 189 +++K+GG Sbjct: 173 RVRKKGG 179 >UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7Q7_9GAMM Length = 164 Score = 134 bits (336), Expect = 7e-30, Method: Composition-based stats. Identities = 52/171 (30%), Positives = 81/171 (47%), Gaps = 9/171 (5%) Query: 205 AFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 F++ + L E +SY EK HGR+E+R V +W +K + V Sbjct: 2 QFQDYWALPEDK---QESYITEEKGHGRKEVREVYVLPAAFS-EALRQKWCLVKSIVAVV 57 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 RS+ K + YYI + L+ E + A R HWH+EN+ HW LDV+ ED+ +I Sbjct: 58 RDRSV----KGKGSYETSYYICTDHLSLELASKATRKHWHIENQQHWALDVIFKEDEQRI 113 Query: 325 RRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGS 375 G++A + R N+ + + RKM +AA +++Y VL S Sbjct: 114 YAGDSALNMACCRRFVQNLFRKSE-GNLSVPRKMNQAAWNKDYREKVLFTS 163 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 49/167 (29%), Positives = 80/167 (47%), Gaps = 9/167 (5%) Query: 3 LKKLMEHISIIPDYRQTWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 L+KL E S IPD+R+ K + HKL D+++L I +S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD-----DKDVIAIDGKTLRH 115 NGIP T+ R+ I + H ++++ IDGK R Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCAQEIVCIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML 162 + K+ R I +SA S + + +EKSNEI A+P L++ + Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLIDKI 199 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 131 bits (329), Expect = 5e-29, Method: Composition-based stats. Identities = 56/157 (35%), Positives = 80/157 (50%), Gaps = 4/157 (2%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 G H++SA++T H + +G + T+EKSNEITAI LL L K ++T DAMGCQKDIA Sbjct: 2 GPRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARN 61 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFE---EKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 I GGD++ AV+ Q +L A EK E H ++ HGR + R + Sbjct: 62 IVAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWG 121 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEP 277 VP + EW +K + AV + + + Sbjct: 122 AQVPPD-FAAKGEWPWIKAIGTAVRITTHPDGTQTDE 157 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 130 bits (328), Expect = 7e-29, Method: Composition-based stats. Identities = 67/359 (18%), Positives = 113/359 (31%), Gaps = 72/359 (20%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 L+ +L L V++G + + + ++ L GIP T R+V P Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFGLTRGIPSERTTRRLVEGCDPVA 107 Query: 86 FHECFINWMRDCHSSDDKDV--IAIDGKTLRH--SYDKSRRRGAIHVISAFSTMHSLVIG 141 E W+ + D +A DGKTL+ S+ ++ V+ A + G Sbjct: 108 LDEALSGWIARAATVGDPGPRGLAFDGKTLKGTRSFTEAGAMSQEAVLEAVWHDTGITAG 167 Query: 142 QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGR 201 + +EI A+ L LD+ ++TT ++G Sbjct: 168 HQRVVG-GDEIAALEALAGRLDLTDVLVTT-------------AEKG------------- 200 Query: 202 LNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 HGR E+R V + G K++ Sbjct: 201 ----------------------------HGRVEVRSLKALTVT---TPKLVGFWGTKQVI 229 Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT-------AIRNHWHVENKLHWRLD 314 P ++ + L AE+ R HW VE +H D Sbjct: 230 ELRRRTRRKKTVTAAPTVSEEVFYLVTSLPAEQAHPRDLAARARARGHWTVEA-IHHVRD 288 Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 V++ED R NA ++ R AI+ L + + +R A + +A Sbjct: 289 RVLDEDRHTARTANAPLAWAIARDTAISALRL--TGHRSIAKALRTTARQPERVLQTIA 345 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 130 bits (326), Expect = 1e-28, Method: Composition-based stats. Identities = 45/187 (24%), Positives = 82/187 (43%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L E +S IPD R ++ L +L L + A +S + +E F + L G Sbjct: 3 LREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 P H + ++ + P K E + +D +V+ +DGK L+ S + Sbjct: 60 RKPPGHTILTLLLHRLDPEKLQEALLQVFP---GADLGEVLVVDGKHLKGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML---DIKGKIITTDAMGCQKDIAE 182 + ++ + + Q K + + E A+ ELL+ L +KGK++ DA ++A Sbjct: 115 VRLVEVLALHLLTTLAQAKAEGR--EDQALLELLDRLGAEGLKGKVVVGDAGYLYPELAG 172 Query: 183 KIQKQGG 189 K+ ++GG Sbjct: 173 KVVQKGG 179 >UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=Bacteria RepID=B7UED9_ECOLX Length = 104 Score = 129 bits (325), Expect = 1e-28, Method: Composition-based stats. Identities = 84/99 (84%), Positives = 90/99 (90%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 MTVRYYISSAD TAEKF TAIRNHWH+EN L+WRLDVVMNEDD KIRRGNAAE FSGIRH Sbjct: 1 MTVRYYISSADSTAEKFVTAIRNHWHMENNLYWRLDVVMNEDDYKIRRGNAAESFSGIRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGL 377 IAINILTN++VFKA RRKMRKA MD+NYLASVLAG+G Sbjct: 61 IAINILTNNQVFKARSRRKMRKATMDKNYLASVLAGAGF 99 >UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azollae' 0708 RepID=B9YJB3_ANAAZ Length = 124 Score = 128 bits (322), Expect = 3e-28, Method: Composition-based stats. Identities = 44/119 (36%), Positives = 70/119 (58%), Gaps = 4/119 (3%) Query: 248 IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVEN 307 +D W LK + + S I + + + RY+ISS D E+ A ++R+HW +EN Sbjct: 9 LDPDSVWSNLKSVGMVES----IGQVDDKTTVETRYFISSLDSNGEQLANSVRSHWAIEN 64 Query: 308 KLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRN 366 LHW LDV + +DDC+IR+ NA + F+ +R IA+++L + K G++ K AA+D N Sbjct: 65 SLHWVLDVALKQDDCQIRKDNAPQNFAVMRQIAVDLLGKENPVKRGIKNKQFLAAVDNN 123 >UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZC8_9GAMM Length = 154 Score = 128 bits (322), Expect = 3e-28, Method: Composition-based stats. Identities = 42/109 (38%), Positives = 61/109 (55%), Gaps = 4/109 (3%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W+ L+ + + S R+ +K E + RYYISS TA R HW +E LHW L Sbjct: 7 WEELQTIVMVESERA----EKGETTIEHRYYISSTLGTAAYLLDYKREHWGIETSLHWCL 62 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 D+ ED+ +I +GN AE F+ +RHIA+N+L + K G++ K KA Sbjct: 63 DIAFREDESRISKGNGAENFAILRHIALNLLKKEDTAKIGIKNKRLKAG 111 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 127 bits (319), Expect = 7e-28, Method: Composition-based stats. Identities = 45/187 (24%), Positives = 76/187 (40%), Gaps = 17/187 (9%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEK--------FPLKELNNPEHDSYAI 225 M Q D+ +Q++GGDY+ K QG L E FP + D+ Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYI 285 E S G + + L ++ W G++++ R + + + V Y I Sbjct: 61 CEVSKGHGWVERRTMTS-TIWLNEYLTRWPGVQQVFRLTRTRQV----GGKTTVEVVYGI 115 Query: 286 SSADLTAEK---FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 SS A R HW +E++ H D + ED C++RRG A + + +R++A+ Sbjct: 116 SSLSSVAAAPDALLRYTRTHWGIESRHH-IRDATLGEDRCRVRRGAAPRVLAVLRNVAVY 174 Query: 343 ILTNDKV 349 +L Sbjct: 175 LLRRLGT 181 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 124 bits (312), Expect = 4e-27, Method: Composition-based stats. Identities = 48/211 (22%), Positives = 92/211 (43%), Gaps = 14/211 (6%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLK-- 59 + + +++ IPD R+ K +H+ D+LL+ + AV SG + + + FL Sbjct: 5 SIPSPLPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDE 64 Query: 60 ---QYGDFENGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSDDKD-----VIAID 109 + E +P T+ R+ + + ++W R+ + K+ +A+D Sbjct: 65 VHIRTRRGERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVAVD 124 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG-QIKTDEKSNEITAIPELLNMLDIKGKI 168 GK LR + R A+ +SA L +G Q D ++ + + L + + Sbjct: 125 GKHLRGTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGVD-WV 183 Query: 169 ITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQ 199 +T DA C +++A + +Q G A KGT+ Sbjct: 184 LTGDAALCTQELAAVVVEQKGGICSASKGTR 214 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 123 bits (308), Expect = 1e-26, Method: Composition-based stats. Identities = 46/202 (22%), Positives = 76/202 (37%), Gaps = 50/202 (24%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQGRLN---KAFEEKFPLKELNNPEHDSYAISEKSH 230 MGCQK+IA+ I KQ DY+ A+KG L +A+ K + D + + H Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGH 60 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL 290 GR E R V ++ ++W GLK + S Sbjct: 61 GRIETRRCQQVLVNKSWLNNKYQWVGLKSIIKVTS------------------------D 96 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 EK T + +IR+G F+ +R IA+ + ++ Sbjct: 97 VHEKTTT-----------------------ESRIRKGRGPLAFNVMRKIAMTLFKQEQTK 133 Query: 351 KAGLRRKMRKAAMDRNYLASVL 372 +A + K + A +D Y +++L Sbjct: 134 RASIVAKKKMAGLDDEYRSTLL 155 >UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=Bacteroides RepID=C6Z3A5_9BACE Length = 115 Score = 123 bits (308), Expect = 1e-26, Method: Composition-based stats. Identities = 49/118 (41%), Positives = 64/118 (54%), Gaps = 4/118 (3%) Query: 261 CVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AEKFATAIRNHWHVENKLHWRLDVVMNE 319 S R+I+A E VRYY++S D T EK A+AIR HW + N LHW+LDV E Sbjct: 1 VRIKSERTIVAI--GEYTQEVRYYVTSLDNTRPEKIASAIRQHWSIGNNLHWQLDVTFRE 58 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGL 377 D K + NAA FS +A+ IL N+K K + K KA D NYL+ +L + Sbjct: 59 DYSK-KVKNAAGNFSVATKMALTILKNEKTTKGSMNLKRLKAGWDENYLSQLLQDNNF 115 >UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus capsulatus RepID=Q60CS8_METCA Length = 197 Score = 117 bits (292), Expect = 8e-25, Method: Composition-based stats. Identities = 46/176 (26%), Positives = 76/176 (43%), Gaps = 15/176 (8%) Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRL 237 K E + G D L +KG +L A L + SY + R E R Sbjct: 6 KKTVETVLATGNDLLVQLKGNHPKLLAAVRT---LCQSRAHAEQSYTVDLGRRNRIEQRT 62 Query: 238 HIVCDVPD------ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 + +P F +G +++ V + +++ P YY+++ + Sbjct: 63 VRLWPLPPGSGTDPWHDHFQTVIEGQRQIEVFNPYHRRFEPRQESP----AYYLATCTAS 118 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 A A IR HW +EN+LH LDV + ED +IRR +F+ +RH A+N+L ++ Sbjct: 119 AATLAQVIRGHWAIENRLHHVLDVSLGEDSSRIRRN--PGVFALLRHFALNLLRHN 172 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 115 bits (289), Expect = 2e-24, Method: Composition-based stats. Identities = 38/92 (41%), Positives = 58/92 (63%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +E S IPD R +H +I+ L + +V++GA+ + +IEDF E H+D+LK Y + Sbjct: 5 FVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTYFNLP 64 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDC 97 NGIP HDT +RV S I+PA F + F+ W++ Sbjct: 65 NGIPSHDTFSRVFSAINPASFQDSFLIWLKAI 96 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 115 bits (287), Expect = 3e-24, Method: Composition-based stats. Identities = 48/205 (23%), Positives = 84/205 (40%), Gaps = 18/205 (8%) Query: 100 SDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELL 159 + + IA+DGK L+ S + R H++SA + + + +++ K+NE T LL Sbjct: 128 AGPRRAIAVDGKALKASARLTSPRR--HLLSAVTHGRVVTLARVEVGAKTNETTHFKPLL 185 Query: 160 NMLDIKGKIITTDAMG-CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNP 218 LD+ ++T DA+ + +I+ ++ + Y+ +K Q + P +++ Sbjct: 186 APLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQLAT-LPWRDIPV- 243 Query: 219 EHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPE 278 +A SE HGR E C +PDEL + L A+ K Sbjct: 244 ---QHAASEVGHGRRESSSIKTCAIPDELGGIAYPHARL-----AIRVHRRCQPTGKRES 295 Query: 279 MTVRYYISSADLTAEKFATAIRNHW 303 Y ++S D A R W Sbjct: 296 RESVYAVTSLDAH-----QATRPIW 315 >UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammaproteobacteria RepID=Q47YV8_COLP3 Length = 151 Score = 115 bits (287), Expect = 4e-24, Method: Composition-based stats. Identities = 33/128 (25%), Positives = 61/128 (47%), Gaps = 3/128 (2%) Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 + + + GLK + + + + R+ ISS DL + A+R+HW Sbjct: 20 KKWLAKAYRRSGLKSIIKV--HTQVHDKSTGKDTAETRWNISSLDLHVVQALNAVRSHWQ 77 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 VE+ +HW LD+ D+ +I R +F+ +R IA+ + D + RK + A +D Sbjct: 78 VES-IHWMLDMTFRVDESRICRKQGPHVFNVMRKIAMTLFKQDTTKLVSMARKKKMAGLD 136 Query: 365 RNYLASVL 372 +Y +++L Sbjct: 137 DDYRSNLL 144 >UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica OS145 RepID=A3WIW9_9GAMM Length = 96 Score = 114 bits (284), Expect = 7e-24, Method: Composition-based stats. Identities = 43/96 (44%), Positives = 62/96 (64%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 M RYYISSA L+AE+FA+ +R HW +EN+LHW LDV + ED+C I RG+AA+ + RH Sbjct: 1 MQYRYYISSASLSAEEFASTVRAHWGIENRLHWVLDVTIREDNCPISRGHAADNLACFRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 +A+N + +K A + RK + A M L ++ Sbjct: 61 VALNQIRREKTIDASVNRKQKMATMSEEVLDLIVNA 96 >UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX Length = 105 Score = 108 bits (270), Expect = 3e-22, Method: Composition-based stats. Identities = 74/88 (84%), Positives = 77/88 (87%) Query: 271 AEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 EQKKEPEMT RYY SADLTAEKFATA RNHW+VENKLHW LDVVMN+DDCKIRRGNAA Sbjct: 18 TEQKKEPEMTDRYYSISADLTAEKFATANRNHWYVENKLHWHLDVVMNKDDCKIRRGNAA 77 Query: 331 ELFSGIRHIAINILTNDKVFKAGLRRKM 358 ELFSGIR IAINILT DK+ KAG R KM Sbjct: 78 ELFSGIRKIAINILTKDKILKAGARCKM 105 >UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina DSM 3645 RepID=A3ZX26_9PLAN Length = 115 Score = 107 bits (268), Expect = 5e-22, Method: Composition-based stats. Identities = 44/112 (39%), Positives = 65/112 (58%) Query: 263 AVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDC 322 A+ + +Q + VRYYI S LT +FA A+R HW +EN LHW+LDV E Sbjct: 3 AIGMTINLVKQNGKEASEVRYYIVSKYLTGRRFAEAVRGHWGIENSLHWQLDVTFGEHQS 62 Query: 323 KIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 +IR+G+A FS +R ++++L N+K + G++ K KA + YL VL G Sbjct: 63 RIRKGHADINFSLLRRTSLSLLKNNKTARVGVKNKRLKAGRNDKYLLEVLLG 114 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 107 bits (266), Expect = 1e-21, Method: Composition-based stats. Identities = 52/128 (40%), Positives = 69/128 (53%), Gaps = 1/128 (0%) Query: 175 GCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREE 234 + ++ +KI ++ DYL AVKG QG L AF++ F LNN + + Y E+S GR E Sbjct: 11 SVRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHE 70 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 R V L D + EW GLK + VS S E +E ++ VRYYISS L AE+ Sbjct: 71 SRAAFVSHDLSVLGDISDEWPGLKSMAFVVSMNS-EKEVAEEADIYVRYYISSKQLNAEE 129 Query: 295 FATAIRNH 302 TA R H Sbjct: 130 LLTASRLH 137 >UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE7_9GAMM Length = 185 Score = 104 bits (259), Expect = 6e-21, Method: Composition-based stats. Identities = 39/171 (22%), Positives = 62/171 (36%), Gaps = 8/171 (4%) Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVP 244 G L +K Q L+ A E + + + R E R V + Sbjct: 2 IATGNHLLVQLKRNQPLLHDAMVEYTRGHPFVD---EHHTHEIGRRNRIEKRAVHVWHLH 58 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQ--KKEPEMTVRYYISSADLTAEKFATAIRNH 302 L + + + L + YY+ L A +F+ AIRNH Sbjct: 59 PSLGSAPW-YDHFRALIRVQRHTERFDTRLRDWRVSKECAYYLCDLVLPAARFSEAIRNH 117 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 W VEN+ H+ D ED +IRR F+ +R A+N++ ++V Sbjct: 118 WRVENRAHYVRDTRFQEDASRIRRN--PCTFALLRSFALNLMRFNRVENIS 166 >UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9DIV7_9GAMM Length = 133 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 3/107 (2%) Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 V ++ ++W GLK + S + + + R+YISS DL AE+ +++RNH Sbjct: 3 VNKSWLNNKYQWVGLKSIIKVTS--DVHEKTTGKETTETRWYISSLDLNAEQALSSVRNH 60 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 W VE+ +HW L++ ED+ + R+G F+ +R IA+ + D+ Sbjct: 61 WQVES-MHWVLEMTFREDESRFRKGRGPLAFNVMRKIAMTLFKQDQT 106 >UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C429C Length = 149 Score = 102 bits (254), Expect = 2e-20, Method: Composition-based stats. Identities = 38/125 (30%), Positives = 63/125 (50%), Gaps = 11/125 (8%) Query: 224 AISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRY 283 SEK HGR E R + +WKGLK+ R++ K + + V Y Sbjct: 2 TTSEKGHGRIEKRTLETTPIVT----VGQKWKGLKQGLRITRERAV----KGKKTVEVVY 53 Query: 284 YISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 I+S A T +R+HW +EN LH+ DV + ED C++R+G A ++ + +R++ Sbjct: 54 GITSLSMARANAATLLTILRDHWQIENGLHYVRDVTLGEDACRVRKGTAPQVLAAVRNVV 113 Query: 341 INILT 345 +++L Sbjct: 114 VHLLA 118 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 99.8 bits (247), Expect = 2e-19, Method: Composition-based stats. Identities = 40/120 (33%), Positives = 60/120 (50%), Gaps = 4/120 (3%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++H I D R +H L +I+LL I AV+SG+EGWE IE+FG LD+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSDDKDVIAIDG--KTLRHSYDKSR 121 GIP HDTIARV+ + + + + D + + G + H + Sbjct: 67 AGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREG 126 Score = 62.0 bits (149), Expect = 4e-08, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 33/79 (41%), Gaps = 3/79 (3%) Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLN---KAFEEKFPLKELNNPEHDSYAISEKSHGREE 234 K+IA+ I KQ DY+ A+KG L +A+ K + D + + HGR E Sbjct: 87 KEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGHGRIE 146 Query: 235 IRLHIVCDVPDELIDFTFE 253 R V ++ + Sbjct: 147 TRRCQQVLVNKSWLNNKYR 165 >UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RAP0_9ACTO Length = 223 Score = 99.0 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 29/131 (22%), Positives = 55/131 (41%), Gaps = 6/131 (4%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L ++ +PD R + L IL + +CAV++GA + I D+ + Sbjct: 29 RSLFVVLAAVPDPRDPRGRRYPLVSILSVVVCAVLAGACTFAVITDWVRDQNRSVWDRLG 88 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRD------CHSSDDKDVIAIDGKTLRHSY 117 F + +P T+ R++ I + W+R VIA+DGK +R + Sbjct: 89 FTDRVPAATTVWRLLIRIDAEVLPQVLARWLRARTAPVVVTGRRLCLVIAVDGKVVRGAR 148 Query: 118 DKSRRRGAIHV 128 ++ A+ + Sbjct: 149 LRAAGPSALGL 159 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 98.3 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 26/85 (30%), Positives = 43/85 (50%) Query: 7 MEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFEN 66 ++H + D R L I+ + I AV++GA+G+ IE +G+ +L+ + D Sbjct: 28 LKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLPK 87 Query: 67 GIPVHDTIARVVSCISPAKFHECFI 91 GIP HDT RV+ + P + F Sbjct: 88 GIPSHDTFGRVLRILEPKQLQSGFR 112 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 98.3 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 34/88 (38%), Positives = 50/88 (56%), Gaps = 1/88 (1%) Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 S A+H++SAF + +V+ Q+ EKSNEI A ELL LDI G +T DAM Q+ Sbjct: 2 ASETVKAVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQR 61 Query: 179 DIAE-KIQKQGGDYLFAVKGTQGRLNKA 205 + A ++ + D++ VK Q L +A Sbjct: 62 EHARFAVEDKRADFVMTVKDNQPELREA 89 >UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REV9_LEGLO Length = 139 Score = 98.3 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 36/142 (25%), Positives = 65/142 (45%), Gaps = 6/142 (4%) Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 R + +P + + G+K + + S + RYY++S + Sbjct: 2 RRRYFAYRLPKTINTGSLV--GIKSIIATETISS--KTNETAISAEWRYYVTSHETEKSD 57 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND--KVFKA 352 +RNHW +EN+LHW LDV +N+D K R A FS I+ + ++++ K Sbjct: 58 LHLYVRNHWSIENELHWHLDVHLNDDADKKRDDTTAINFSSIKRMLLSLVKTKLPPGKKR 117 Query: 353 GLRRKMRKAAMDRNYLASVLAG 374 +R ++++ D YL S+L+ Sbjct: 118 SVRSRLKQVGWDTEYLVSLLSA 139 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 98.3 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 39/80 (48%), Positives = 53/80 (66%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++H + D R +H L DI+LL I AV+SG+EGWEDIE+FG LD+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVVSCISPAK 85 GIP HDTIARV+ + + Sbjct: 67 AGIPRHDTIARVICRLKADE 86 >UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D5 Length = 148 Score = 97.9 bits (242), Expect = 6e-19, Method: Composition-based stats. Identities = 34/122 (27%), Positives = 57/122 (46%), Gaps = 11/122 (9%) Query: 228 KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISS 287 K HGR E R L ++ W G++++ R + + V Y ISS Sbjct: 3 KGHGRVERRSITTTT---WLNEYLTRWPGVQQVFRLERQRR----ADGKTTVEVVYGISS 55 Query: 288 ADLTAE---KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 A R+HW +E+ LH+ DV ++ED C++RRG A + + +R++A+ +L Sbjct: 56 LSPVAAPPDTVLGYTRSHWGIES-LHYVRDVTLDEDRCRVRRGTAPRVLASLRNVAVYLL 114 Query: 345 TN 346 Sbjct: 115 RR 116 >UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H9Y8_ANADF Length = 431 Score = 97.5 bits (241), Expect = 7e-19, Method: Composition-based stats. Identities = 62/371 (16%), Positives = 110/371 (29%), Gaps = 45/371 (12%) Query: 10 ISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIP 69 + +PD R L++IL + +++GA + E+ + ++ +P Sbjct: 22 LEAVPDVRAREG-RWSLAEILTGVLLGIVAGARSLAEAEELTDGMSPAARRLASVPRRLP 80 Query: 70 VHDTIARVVSCISPAKFHECFINWMRDCHSSD-------DKDVIAIDGKTLRHSYDK--- 119 T + + +R V+A+DGK Sbjct: 81 -DTTARDALCAVPLDGLRAALHRLVRAAWRRKALTPVDLPVGVVALDGKVTALPTLNHPL 139 Query: 120 ----------SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML----DIK 165 + S I + ++NE +L L Sbjct: 140 IQNQHPDVGLPYGLARTVTCALVSAPGRPCIDAVPIPAETNEAGHFQHVLAGLVETYGAL 199 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAI 225 +++T DA + + G DY+FA+K + K E E+ D Sbjct: 200 FQVVTYDAGALSEANGAAVVAAGKDYVFALKNDHFTMVKLATELLDPHEIAARREDVLDN 259 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFE------WKGLKKLCVAVSFRSIIAEQKKEPEM 279 + + R + V + W + S + E Sbjct: 260 ATTA-----TREIQILAVDPSHGYGAGKGPEESVWSHARTFLRVTS---TVRRSGVVIER 311 Query: 280 TVRYYISSA---DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCK--IRRGNAAELFS 334 R ++SS LT +++ +R HW VEN H LD ED+ N Sbjct: 312 DSRLFVSSRAADQLTPDQWLQVVRAHWGVENNNHHTLDTAFAEDERPWIAADANGMLAVL 371 Query: 335 GIRHIAINILT 345 +R IA +L Sbjct: 372 LLRRIAYTLLA 382 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 96.3 bits (238), Expect = 2e-18, Method: Composition-based stats. Identities = 34/75 (45%), Positives = 52/75 (69%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++++E + + D R + +H L DIL+L +CAV+SGA+GW+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARV 77 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRV 81 >UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C6B8_ACAM1 Length = 145 Score = 95.9 bits (237), Expect = 2e-18, Method: Composition-based stats. Identities = 38/126 (30%), Positives = 60/126 (47%), Gaps = 7/126 (5%) Query: 222 SYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTV 281 + S +S GREE R V + + EW+ ++ + ++ + Sbjct: 3 EHTHSIQSRGREEHRCIQVY---EPVGIALQEWEAIRSVLCVQR----WGTRQGKAYHNT 55 Query: 282 RYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAI 341 YYISSA + + + +R HW +EN+LHW DVV EDD ++ A +S +R I I Sbjct: 56 AYYISSAATSPHHWQSLVREHWGIENRLHWPKDVVFGEDDYRLEDEQALLNWSVLRTIVI 115 Query: 342 NILTND 347 NIL + Sbjct: 116 NILRLN 121 >UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N3J1_VIBHB Length = 184 Score = 95.6 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 60/132 (45%), Gaps = 7/132 (5%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYIS 286 ++ HGR R + +P+EL + G+K R + + + YYI+ Sbjct: 34 DEGHGRLVRRRYFAFPLPEELHN--HALSGIKSCIAV--ERIVQEGKGEPKTSHFSYYIT 89 Query: 287 SADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN 346 + + K A +R HW +E+ HW LDV N+D K N+AE F+ I+ + +N++ Sbjct: 90 NHPASDPKLADYVRQHWEIES-YHWLLDVYFNDDRDKKYEENSAENFAQIKRLPLNLVKA 148 Query: 347 DK--VFKAGLRR 356 K ++ Sbjct: 149 KDWAGKKKSVKS 160 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 95.6 bits (236), Expect = 3e-18, Method: Composition-based stats. Identities = 28/77 (36%), Positives = 51/77 (66%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + +++H S + D RQ+W+V + L +I LL +CA +SG E + +I +G+ L+FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTIARV 77 + +E G+P HDT+ + Sbjct: 77 FLPYERGLPAHDTLKGL 93 >UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3V5_9PROT Length = 174 Score = 93.6 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 45/174 (25%), Positives = 74/174 (42%), Gaps = 18/174 (10%) Query: 195 VKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K Q L + N+ D+ K R+E R V V D L EW Sbjct: 1 MKANQSNLFETACAI----AANDAPADTAFSRNKGRSRQEDRTVEVFPVGDALAG--TEW 54 Query: 255 KGLKKLCVAVSFRSII---AEQKKEPEMTVRYYISSA-DLTAEKFATAIRNHWHVENKLH 310 + K + V+ R+++ A + V +Y+SSA + A +A AIR HW +EN+ H Sbjct: 55 QPFIKTIIRVTRRTLLHSAATGLWDQRGEVAFYVSSAANFPATAWAAAIRGHWGIENRNH 114 Query: 311 WRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 + DV +ED +IR + + R A+NI+ + + +A + Sbjct: 115 YVRDVSCDEDKSRIRDN--PGIMARARSFALNIMRKNGIANVA------QALWN 160 >UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY37_9BACT Length = 138 Score = 93.6 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 51/120 (42%), Gaps = 5/120 (4%) Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTV-RYYISSADL 290 R E + V L+ ++ L+++ + K E + +SS Sbjct: 1 RIETQTIRVSS----LLKGYSDFPHLEQVFRIDRVTRFKKKGKTRKETALGVTSLSSGQA 56 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 + + +R HW +EN+LHW D V ED C R GN A + + +R++ I++L Sbjct: 57 SPRELLDFVRGHWEIENRLHWIRDSVFREDTCTTRTGNGAHVMATLRNMTISLLRVAGSK 116 >UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteobacterium BAL199 RepID=A8U3K1_9PROT Length = 134 Score = 92.1 bits (227), Expect = 3e-17, Method: Composition-based stats. Identities = 30/117 (25%), Positives = 49/117 (41%), Gaps = 6/117 (5%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L+ S I D R+ + L+ +LL T+ A+++GA + ++ F THLD L D Sbjct: 3 STLLSLFSQISDPRRDQGKIYPLAPVLLFTVLAMLAGARSYRQVQAFIRTHLDRLNDGFD 62 Query: 64 F-ENGIPVHDTIARVVSCISPAKFHECFINWMRDCH-----SSDDKDVIAIDGKTLR 114 P + T+ ++ I + F + + IAIDGKT Sbjct: 63 LSLRRAPAYSTVRFILRGIDAEEMERAFRDHALGLADGPAEGAAIPGAIAIDGKTWC 119 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 91.3 bits (225), Expect = 6e-17, Method: Composition-based stats. Identities = 36/154 (23%), Positives = 54/154 (35%), Gaps = 14/154 (9%) Query: 195 VKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K Q + P + + S HGR E R C + DEL F Sbjct: 2 IKRNQPTTYRQL-AALPWPDSAV----QHTASSAGHGRRESRSIKTCGIADELGGIAFPH 56 Query: 255 KGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL---TAEKFATAIRNHWHVENKLHW 311 L A+ + Y ++S D T + A A+R HW VE H Sbjct: 57 GRL-----ALRVHRRRKQTGGCESRETVYAVTSLDAHETTPAELAAAVRGHWTVEALRH- 110 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 DV E+ + G A + R++A+ +L Sbjct: 111 VRDVTYAEEASTLHTGTAPRAMATFRNLAVGLLK 144 >UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BR51_9GAMM Length = 153 Score = 91.3 bits (225), Expect = 6e-17, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 59/150 (39%), Gaps = 9/150 (6%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHLD 56 +++ L ++ + +PD + H+L +L L A + G +G++ + ++ Sbjct: 7 QMRSLPQYFTDLPDPHRAEGRRHRLPVVLTLAAGASLCGMQGYKSMAEWASSLAQAARRR 66 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 F + + +P I + + P W + ++ +A+DGK ++ Sbjct: 67 FGCRRVNGHYLVPSLYVIRDCLVRLGPEALDRRLQAW--QAAQLNSEEALAMDGKIMKGG 124 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTD 146 D + + H++S + Q K+ Sbjct: 125 VDHTGAQT--HIVSLIGHESKHCVAQKKSA 152 >UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV84_9DELT Length = 357 Score = 90.5 bits (223), Expect = 8e-17, Method: Composition-based stats. Identities = 29/148 (19%), Positives = 60/148 (40%), Gaps = 9/148 (6%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +++ L ++ + D R+T H++S +L + A + G +G++ I + +Q Sbjct: 214 QMESLPDYFKTVTDPRRTHGRRHRISTVLSIAAAATLCGMKGYKAIYGWANKLGQKARQR 273 Query: 62 GD-----FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 + IP I V+ P + + D + +A DGKT++++ Sbjct: 274 FRCRKENGKYVIPSQFVIRDVLVRADPVELDLAVQRFNED--QGLEDTCLAFDGKTMKNA 331 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIK 144 D++ R+ H+ S Q K Sbjct: 332 IDENARQT--HIASVVGHESKTTHTQKK 357 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 89.8 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 38/81 (46%), Positives = 52/81 (64%), Gaps = 4/81 (4%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + + E++S D R + +H DI+ L + AVISGA W +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCI 81 Y FE GIPV DTIARV+ I Sbjct: 57 YRPFECGIPVDDTIARVIKRI 77 >UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P2A0_AZOSE Length = 92 Score = 89.4 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 26/75 (34%), Positives = 43/75 (57%) Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R ++LHW LDV N+D ++RRG AA F +RHI +N+L ++ KA ++ K Sbjct: 15 VRLPRPTRHQLHWSLDVQFNDDQSRVRRGYAANNFVVLRHIVLNLLRHNTTRKASIKSKR 74 Query: 359 RKAAMDRNYLASVLA 373 A M+ ++ +L Sbjct: 75 LLACMEDDFREELLG 89 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 88.6 bits (218), Expect = 3e-16, Method: Composition-based stats. Identities = 34/130 (26%), Positives = 55/130 (42%), Gaps = 2/130 (1%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +L ++ IPD+R+ + L+ +LL +I AV+SGA + I+ F + H + L Sbjct: 2 QLKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQL 61 Query: 65 -ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV-IAIDGKTLRHSYDKSRR 122 PVH +I + + F + IA+DGKTLR + + R Sbjct: 62 HWKRAPVHTSIRYALQGLDAKAGELAFHRHASGLDGEGAQHASIAMDGKTLRAAVSITSR 121 Query: 123 RGAIHVISAF 132 SA Sbjct: 122 TARPLRYSAH 131 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 87.8 bits (216), Expect = 6e-16, Method: Composition-based stats. Identities = 37/128 (28%), Positives = 60/128 (46%), Gaps = 5/128 (3%) Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH 220 M +KG ++T DAMGCQ+ IA+++++ G D + ++KG QG+ A F ++ + Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 221 --DSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPE 278 + E SHGR R V + E + W ++ L V R A + Sbjct: 61 LKPDHDEFEDSHGRTVRRRGWVLPLTPE-TKHSGSWPDIQALLVTEKIRQ--AHYSETVT 117 Query: 279 MTVRYYIS 286 RYY+S Sbjct: 118 SDFRYYLS 125 >UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BLK3_9GAMM Length = 190 Score = 86.7 bits (213), Expect = 1e-15, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 53/129 (41%), Gaps = 6/129 (4%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHLD 56 +++ L ++ + +PD R+ H+L + LT A + G +G++ + ++ Sbjct: 59 QMRALPQYFTDLPDPRRAQGRRHRLPVVPALTAGASLCGMQGYKAMAEWASSLGQAARQR 118 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 F + + +P I + + P W +S D + +A+DGK ++ Sbjct: 119 FGCRRVNGHYLVPSLYVIRDCLVRLGPKALDRRLQAWQAAQLNSSD-EALAMDGKIMKGG 177 Query: 117 YDKSRRRGA 125 D + + Sbjct: 178 VDHTGAQTQ 186 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 85.9 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 68/212 (32%), Gaps = 34/212 (16%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICA-VISGAEGWEDIEDFGETHLDFLKQYG 62 + + E + + D R T + + + +C+ +G + + + Sbjct: 22 EGIWERLDRVTDPRSTRGRVYSWLCLAAVWLCSLTAAGHHRVSAVRAWLARTSGAERARL 81 Query: 63 DFEN------GIPVHDTIARVVSCISPAKFHECFINWM---------------------- 94 +P TI + + + ++ Sbjct: 82 RLPWDPFAGWRLPSTATIHCFLQAVDDGELAVALLDPPLDPDPPAEQGDDTDQRTEPSAA 141 Query: 95 ---RDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNE 151 + +A+DGKT RH+ K +H++ S ++ Q++ + K+NE Sbjct: 142 PVDPGHGCQPVESAVALDGKTSRHA--KRADGSKVHLVGVASHGDGRLLAQVEVEAKTNE 199 Query: 152 ITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 LL LD+ ++T DA+ + + Sbjct: 200 TAVFRRLLRPLDLTNVLVTADALHTVRANLDT 231 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 85.9 bits (211), Expect = 2e-15, Method: Composition-based stats. Identities = 47/167 (28%), Positives = 70/167 (41%), Gaps = 30/167 (17%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVI----SGAEGWEDIEDFGETHLDF 57 +LKKL+E S IPD R+ V+H+L+ +LL + + + S E D+ L Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDMSR--PAFLQA 136 Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD--------VIAID 109 L+ +P DT+ARV+ I P K E FI +R IAID Sbjct: 137 LQGLFPELETLPHGDTLARVLERIEPQKLEESFIRLLRRYIRHKKFKRHLINKCYPIAID 196 Query: 110 G--KTLR-------------HSYDKSRRRGAIHVISA-FSTMHSLVI 140 G K +R + D + + I+V+ A F + L I Sbjct: 197 GTQKLVRDGELGEEWLERHIKTKDGEKVQQYIYVLEANFVFKNGLTI 243 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 85.2 bits (209), Expect = 3e-15, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 42/86 (48%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 + ++ + + + D R T +H+ DI+++ +C V+ G +G I + ++L+ Sbjct: 6 VAVESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQG 65 Query: 61 YGDFENGIPVHDTIARVVSCISPAKF 86 + + NG+P D I + + P F Sbjct: 66 FLELPNGLPSRDCIRNWLMALQPDAF 91 >UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobacteria RepID=Q2RP40_RHORT Length = 152 Score = 85.2 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 38/136 (27%), Positives = 51/136 (37%), Gaps = 8/136 (5%) Query: 221 DSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK---KEP 277 + HGR+E R V DV L W GL V+ + + K Sbjct: 3 SAETTDRGRHGRQEHRWVEVFDVSGRLGPT---WDGLIAAVARVTRLTWHKDTKSGLWHK 59 Query: 278 EMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIR 337 Y +L A TAIR HW VE + H+ DV ED +IR F+ +R Sbjct: 60 TQETALYACQINLPAAVAGTAIRQHWGVEKRSHYVRDVTFFEDQSRIRTK--PGHFARLR 117 Query: 338 HIAINILTNDKVFKAG 353 A+NIL + Sbjct: 118 SFALNILRANGTNNIS 133 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 84.0 bits (206), Expect = 8e-15, Method: Composition-based stats. Identities = 31/118 (26%), Positives = 48/118 (40%), Gaps = 9/118 (7%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L EH++ +PD R + H L IL + + A+ SGAE + + ++ T L Q Sbjct: 15 GLWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGC 74 Query: 65 ENG-------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRH 115 + P T+ RV+ I NW+ +A+DGKTL Sbjct: 75 QESPSRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSL--GLSPAALAVDGKTLAG 130 >UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7C Length = 118 Score = 84.0 bits (206), Expect = 9e-15, Method: Composition-based stats. Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 4/108 (3%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG-IPVHDTIARVVSCISPAKFH 87 +L L + AV++G E I FG L F+NG +P +TIA ++ + Sbjct: 3 LLTLCLVAVMAGHTTPEAISQFGRLRPKRLGHALGFQNGNMPCANTIAGLLRKLDADHLD 62 Query: 88 ECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTM 135 W+ D H D D IA+DGK L S D + H+++A++ Sbjct: 63 RIIGAWLGDRHP-DGWDHIALDGKRLCGSRDGAV--PGTHLLAAYAPQ 107 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 83.2 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 28/69 (40%), Positives = 42/69 (60%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L+ H + I D RQ+ KV + L D+L +T+C VI+GAEGW +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIP 69 G G+P Sbjct: 72 KGILTEGVP 80 >UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E6_9ACTO Length = 128 Score = 82.1 bits (201), Expect = 3e-14, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 46/129 (35%), Gaps = 13/129 (10%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + L E ++ + D R+ H +LL+ AV++GA + I ++ + Sbjct: 1 MGPLAERLATLADPRRRRGKRHPFVAVLLIACSAVVTGARSFVAIHEWATDSPQDVLARL 60 Query: 63 DFENGI-------PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRH 115 P TI RV+ P + H D +AIDGK+ R Sbjct: 61 GARTATALAVRIPPSGVTIRRVIKDTCPGGLADLLG------HDPAGTDTLAIDGKSARG 114 Query: 116 SYDKSRRRG 124 S S R Sbjct: 115 SRLGSTRPP 123 >UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN77_ANAAZ Length = 77 Score = 81.7 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 26/61 (42%), Positives = 43/61 (70%) Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 A ++R+HW +EN LHW LDV + +DDC+IR+ NA + F+ +R IA+++L + K G+ Sbjct: 1 MANSVRSHWAIENSLHWVLDVALKQDDCRIRKDNAQQNFAVMRQIAVDLLGKENPVKRGI 60 Query: 355 R 355 + Sbjct: 61 K 61 >UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ26_GLOVI Length = 125 Score = 81.7 bits (200), Expect = 4e-14, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 46/106 (43%), Gaps = 1/106 (0%) Query: 261 CVAVSFR-SIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 S R E + + +Y+SS + +A + IR HW VEN++H+ DV E Sbjct: 12 GRTRSIRLERYRELRGIVTVKTHWYLSSIEASASELGRRIRGHWGVENQVHYPKDVTFGE 71 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 D +IR +++S R A+N+ + + ++ + Sbjct: 72 DRSRIRTLPLVQVWSVARSFALNLYRSLLMANRAQAQRRCMFGLST 117 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 80.1 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 27/113 (23%), Positives = 46/113 (40%), Gaps = 6/113 (5%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +L ++S IPD+R+ + L+ +LL +I A++SGA + I+ F +TH + L Sbjct: 2 QLKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQL 61 Query: 65 -ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD-----DKDVIAIDGK 111 P H +I + + F D VI + K Sbjct: 62 HRKRAPAHTSIRYALQGLDAKAVELAFPRHASGLDGEDHNRFFPSTVIDAEWK 114 >UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q750_9BACT Length = 129 Score = 80.1 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 20/92 (21%), Positives = 36/92 (39%), Gaps = 3/92 (3%) Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMN 318 V R + + + Y I+S + + R HW +EN LH+ D Sbjct: 29 VFCIHRIFTKVKTGKKTEEIVYGITSLTQQKASPKTILKFSRGHWSIENGLHYVRDTAFR 88 Query: 319 EDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 ED +IR NA + ++++ + + V Sbjct: 89 EDHSQIRTQNAPRAMASLKNLVVGLFHFLNVP 120 >UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XTV7_9DEIN Length = 118 Score = 79.8 bits (195), Expect = 2e-13, Method: Composition-based stats. Identities = 31/122 (25%), Positives = 58/122 (47%), Gaps = 7/122 (5%) Query: 253 EWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWR 312 W+G + R ++ + E Y ++S A++ R HW VEN+LH + Sbjct: 3 GWRGSRMALRMR--RRVVRKNSGELREETAYALTSLQAPAKRLYALWRGHWEVENRLHHK 60 Query: 313 LDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 D V+ ED + R+G A ++ +R + +N+L + + + R +RK + D L ++ Sbjct: 61 RDTVLGEDASRSRKGAAGLMY--LRDVILNLL---HLKRWPVLRSVRKFSADPKVLLRLI 115 Query: 373 AG 374 G Sbjct: 116 RG 117 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 79.0 bits (193), Expect = 2e-13, Method: Composition-based stats. Identities = 37/129 (28%), Positives = 52/129 (40%), Gaps = 13/129 (10%) Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRD----CHSSDDKDVIAIDGKTLRHSYDKS 120 PV+ ++ ++ I P F C + IAIDGKTLR S+D Sbjct: 8 LRRAPVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAF 67 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELL---------NMLDIKGKIITT 171 A +V+SAF+ H +++ DEKSNEI A L+ I + Sbjct: 68 SDTKAAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALIVATALWKSREETSIYASSVML 127 Query: 172 DAMGCQKDI 180 DAM I Sbjct: 128 DAMTFAPAI 136 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 78.6 bits (192), Expect = 4e-13, Method: Composition-based stats. Identities = 29/96 (30%), Positives = 39/96 (40%), Gaps = 4/96 (4%) Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPL---KELNNPEHDSYAISEK 228 D +GCQK IA+ I +Q DYL AVK Q L++A F D K Sbjct: 8 DGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYFEEANKARFAGYNIDYDEKINK 67 Query: 229 SHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 GR E R V + I + W L+ + + Sbjct: 68 GPGRLEQRRCWVGYEIPDTI-NSQNWAKLETIVMVE 102 >UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C56B7 Length = 116 Score = 75.1 bits (183), Expect = 4e-12, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 49/109 (44%), Gaps = 5/109 (4%) Query: 268 SIIAEQKKEPEMTVRYYISSADL---TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + + + V + I+S A +R HW +EN+LH+ DV + ED C++ Sbjct: 8 TRERTVRGQTTVEVHFGITSLSAEKADAATLLNHVRTHWRIENELHYVRDVTLGEDVCRV 67 Query: 325 RRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 R G+A ++ + +R+ +++ K + + MD ++ Sbjct: 68 RMGHAPQVLAALRNAVVHLWREVKAVSCPEAIERLQ--MDPAMAKGLIG 114 >UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H5_9SYNE Length = 177 Score = 73.2 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 32/153 (20%), Positives = 58/153 (37%), Gaps = 12/153 (7%) Query: 197 GTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKG 256 G Q L + ++ K + E HGR+ + + W G Sbjct: 8 GDQKTLYRQIADQLLGKRHIPLMATDH---EIGHGRD---ILWTLRAKEAPQHIKANWHG 61 Query: 257 LKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVV 316 + ++ + ++P +I+S T + +R W VE+ HW D Sbjct: 62 TSWIAEVIAT----GTRDRKPFKATHRFITSLRTTPDALLRLVRERWSVESW-HWIRDTQ 116 Query: 317 MNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ++EDD + R GN A + + +R A+N+L Sbjct: 117 LHEDDHRYR-GNGAGVMAALRTAAMNLLRLTGF 148 >UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium meliloti RepID=Q1WLF6_RHIME Length = 105 Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 18/64 (28%), Positives = 27/64 (42%), Gaps = 1/64 (1%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + +PD R H L+ IL + I A++ GAE D+ DFG +LK Sbjct: 1 MSRFAACFEDLPDPR-GRNARHPLTSILFIAIAAIVCGAESCTDMADFGVAKKKWLKTIV 59 Query: 63 DFEN 66 Sbjct: 60 PLPY 63 >UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_9ACTO Length = 410 Score = 72.8 bits (177), Expect = 2e-11, Method: Composition-based stats. Identities = 29/138 (21%), Positives = 49/138 (35%), Gaps = 11/138 (7%) Query: 42 EGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRD---CH 98 + + G + P I R++ I P W+ Sbjct: 221 RATSALIAWVLARPTVAVLLGIDADRRPSEAMIRRLLQAIDPDLLTTAIGIWLAARIPAP 280 Query: 99 SSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE- 157 + + IA+DGKTLR S + HV++A +V+ D K+NEIT Sbjct: 281 APGSRRAIAVDGKTLRGSRTRDSAAR--HVLAAADQHTGIVLASTDVDTKTNEITRFTAS 338 Query: 158 -----LLNMLDIKGKIIT 170 LL+ I+ +++ Sbjct: 339 GSHADLLSSRCIRSGVVS 356 >UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JBB1_FRASC Length = 173 Score = 72.4 bits (176), Expect = 3e-11, Method: Composition-based stats. Identities = 26/92 (28%), Positives = 41/92 (44%), Gaps = 5/92 (5%) Query: 274 KKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 + ++S + A + HW +EN+LHW DV +ED + R GNA Sbjct: 69 GGPATAETVHAVTSLPTHHASPRLLAELAQAHWAIENRLHWVRDVTYDEDRHRARTGNAP 128 Query: 331 ELFSGIRHIAINILTN--DKVFKAGLRRKMRK 360 ++ + +R++AI IL K LR R Sbjct: 129 QVMTSLRNLAITILRLTGAKNIAKALRHHARH 160 >UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK19_ACIF5 Length = 104 Score = 71.7 bits (174), Expect = 4e-11, Method: Composition-based stats. Identities = 21/93 (22%), Positives = 37/93 (39%), Gaps = 3/93 (3%) Query: 273 QKKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + + ++S T E R HW +EN+ H D +ED +IR N Sbjct: 2 KDGTLREDCAFGLTSLTKDRTTPENLLGIARGHWEIENRNHHVRDTTYHEDLSQIRTENG 61 Query: 330 AELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 + + +R +A++IL V + A+ Sbjct: 62 PHMMATLRGLAMSILRLIGVKNIAQAGRDFAAS 94 >UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PA49_CLOTS Length = 245 Score = 69.4 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 49/230 (21%), Positives = 82/230 (35%), Gaps = 37/230 (16%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + L E I+ + D R V+ +S I + + + + +E + K+ Sbjct: 16 VYHLGEKINTLKDKRVKSSVK--ISTITFVVLFGFMLQIRSFNRLEHW--LKKGKFKKAL 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHE--------CFINWMRDCHSSDDKDVIAIDGKTLR 114 + +P DTI RV+S +E N + + D V+AIDG L Sbjct: 72 PKKTKMPRIDTIRRVLSNFDLDGLNELNNSIIKTSIKNKVFRRGTIDGLKVVAIDGVELF 131 Query: 115 HSYDKSRRR--------------GAIHVISAFSTMHSLVIGQIKTDEKSN-------EIT 153 S K V S + L++GQ + K + EIT Sbjct: 132 ESTKKCCGNCLTRVQKDGITHYFHRTVVCSTIGSDSHLILGQEILEPKKDGSDKDEGEIT 191 Query: 154 AIPELLNMLDIK----GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQ 199 A L+ L + II DA+ C+ +++ G D + VK + Sbjct: 192 AGKRLIRKLHREFHHFADIIVADALYCKSTWVKEVLSIGMDAVVRVKDER 241 >UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW01_RHOSR Length = 130 Score = 69.4 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 15/81 (18%), Positives = 27/81 (33%) Query: 11 SIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPV 70 +PD R V H+ S IL + A +GA + I ++ +P Sbjct: 49 DEVPDPRHRRGVRHRFSVILAPALSATCAGARSFIAIAEWAHDAPAEALGGLGVSAVVPS 108 Query: 71 HDTIARVVSCISPAKFHECFI 91 T R ++ + + Sbjct: 109 ESTSRRFLAGVDATALDQVLG 129 >UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitrosococcus oceani AFC27 RepID=B6C2C4_9GAMM Length = 77 Score = 66.7 bits (161), Expect = 1e-09, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 31/57 (54%) Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 ED+C++ A F+ +R IAI++L D+ K LR + RK A D +Y+ + Sbjct: 21 SFREDECRVHDPMAGGNFALLRKIAISLLVRDRSNKTSLRGRCRKVAWDNDYMRQLF 77 >UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C544B Length = 103 Score = 65.5 bits (158), Expect = 3e-09, Method: Composition-based stats. Identities = 28/109 (25%), Positives = 42/109 (38%), Gaps = 11/109 (10%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYIS 286 + HGR E R L+ W GLK R++ K + V + I+ Sbjct: 2 DPGHGRIETRTVRATP----LLTCHDRWTGLKHGFRITRTRTV----KGVTTVEVVHGIT 53 Query: 287 SAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 S A +R+HW +EN+ H DV + ED+ + R A Sbjct: 54 SRPVERADARALLGLVRSHWRIENQRHDVRDVTLREDEPRCRAAGAGRA 102 >UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XCB4_SALTO Length = 117 Score = 65.1 bits (157), Expect = 3e-09, Method: Composition-based stats. Identities = 23/106 (21%), Positives = 46/106 (43%), Gaps = 3/106 (2%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 ++ +L +CAV++GA + D+ E F + +PV T+ R++ + Sbjct: 1 MASVLADAVCAVMAGASTFAAFGDWVEDLDAPAWSRLGFTDRVPVLTTLWRLLVRVDAET 60 Query: 86 FHECFINWMRD---CHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHV 128 + +W+ + VIA+DGK +R + R A+ + Sbjct: 61 LTAVWADWLCSRLPVAPPPVRRVIAVDGKVVRGAVLTEGRVPALWM 106 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 65.1 bits (157), Expect = 4e-09, Method: Composition-based stats. Identities = 31/60 (51%), Positives = 34/60 (56%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 LKQYG FE GI HDTI +VSCIS F + FI WM C A DGKT+R S Sbjct: 11 LLKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVRRS 70 >UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ITI9_METNO Length = 74 Score = 64.0 bits (154), Expect = 9e-09, Method: Composition-based stats. Identities = 18/63 (28%), Positives = 34/63 (53%), Gaps = 1/63 (1%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L + D R+T +H+L IL++ +CAVI+ AE +DI +G + +L+ + Sbjct: 2 IEGLAVCFPGLEDLRETSGCDHRLIAILVIAVCAVIACAES-KDIGLYGRSKQAWLQTFL 60 Query: 63 DFE 65 Sbjct: 61 PLP 63 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 27/48 (56%), Positives = 40/48 (83%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEIT 153 ++ DGKTLR S+D+S + AIH++SA+++ +SLV+GQ+KTDEKSNE Sbjct: 26 LSQDGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNEHK 73 >UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus aquaticus Y51MC23 RepID=B7A7V9_THEAQ Length = 161 Score = 62.8 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 27/122 (22%), Positives = 55/122 (45%), Gaps = 7/122 (5%) Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA---DL 290 E+ + V P L + + G ++ R ++ + E TV Y ++S Sbjct: 21 EVWTYRVWASPY-LPEEMRAFPGCGQVVRM--EREVVRKGTGEVRRTVSYALTSLGPEVA 77 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 A + + + W VEN+ W D +++ED C++R G A++ + +R +++L V Sbjct: 78 DARRLGELLLSRWEVENRSFWVRDFLLHEDACQVR-GVGAQVLAALRAFLVSLLHRQGVR 136 Query: 351 KA 352 + Sbjct: 137 EK 138 >UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RIX6_9ACTO Length = 90 Score = 61.7 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 16/58 (27%), Positives = 24/58 (41%) Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 R WH+EN+LHW DV E + R G + + +R+ AI + Sbjct: 11 AQPADLQQWARLEWHIENRLHWVRDVTFGEGTHRARTGTGPAVAAVLRNTAIGFHRGN 68 >UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVV4_METNO Length = 123 Score = 61.3 bits (147), Expect = 7e-08, Method: Composition-based stats. Identities = 24/100 (24%), Positives = 40/100 (40%), Gaps = 2/100 (2%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W GL + + R VR+ + S+ +E A AIR H + W L Sbjct: 7 WPGLTTVLATETLR--GGNGTDSVPAQVRHSLGSSTAPSEVLAQAIRRHGALATGEPWVL 64 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 +V E+ ++R AA + +R +A++ D A Sbjct: 65 EVSFGEERSRVRERCAARHLALLRRVALDRRRADASLTAS 104 >UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TKB9_RICCO Length = 107 Score = 60.1 bits (144), Expect = 1e-07, Method: Composition-based stats. Identities = 16/98 (16%), Positives = 33/98 (33%), Gaps = 1/98 (1%) Query: 8 EHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ-YGDFEN 66 + S + D R+ + L +L + +++SG+ ++ F E L L + +G Sbjct: 10 DVFSELRDVRRAQGKRYALEPLLCAIVMSILSGSASLRKMQVFIEEQLPNLNRLFGTSWR 69 Query: 67 GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD 104 P I + + + F S Sbjct: 70 KAPCWVAIREFLLGLDEQELERAFREHANRQVSPPPGR 107 >UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C378D Length = 453 Score = 59.3 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 51/336 (15%), Positives = 88/336 (26%), Gaps = 54/336 (16%) Query: 8 EHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG 67 E IPD R L D+L+ + A F LD ++ G Sbjct: 27 ERFETIPDAR--RGPTFSLPDVLMAGLALFALKAPSLLA---FQRRTLDHNLRHVFGLTG 81 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSS--------DDKDVIAIDG--------- 110 P + V+ + P F + ++ D V+A+DG Sbjct: 82 RPSDSQMRAVLDDVDPDHLRPVFRDVFARLQAAHVLDEYRVDGCYVVALDGVEYFCSQKV 141 Query: 111 -----KTLRHSYDKSRRRGAIHVISAFSTMHSLVIG------QIKTDEKSN--EITAIPE 157 T RH+ + + S V+ Q N E A Sbjct: 142 HCPHCMTRRHANGAVSYYHQMLGAAVVHPDFSAVLALAPEPIQRADGGTKNDCERNAARR 201 Query: 158 LLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK 213 L ++ DA +QK +L VK A Sbjct: 202 WLGRFREEHPDLAVLVVEDARSSNAPHVRDLQKARCHFLLGVK-------AADHAHLFAH 254 Query: 214 ELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 + ++ + E + R +R + L + + + + Sbjct: 255 VCARQDQHAFEVVEDADPRTGLRRSYLWIADLPLNESNDD-------VRVNFVHLVELDP 307 Query: 274 KKEPEMTVRYY-ISSADLTAEKFATAIRNHWHVENK 308 P ++ + A A R W +EN+ Sbjct: 308 DGTPREWTWVADMAVTGANVRQLARAGRARWRIENE 343 >UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8LD44_FRASN Length = 261 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 32/158 (20%), Positives = 56/158 (35%), Gaps = 10/158 (6%) Query: 179 DIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYA------ISEKSHGR 232 ++A ++ + +G Q L +A + L+ H A + + G Sbjct: 38 ELAAQVPDRISQPRLVTEGDQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLTAGS 97 Query: 233 EEIRLHIVCDVPDEL-IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 + R VP L + L + ++ + E K+ Y I + Sbjct: 98 RQTRALKAVTVPAGLGFPHAAQAIQLTRTSRPINKNTKKTEGKRRQRRETVYAICTLPAH 157 Query: 292 ---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRR 326 + AT IR HW +E +L W DV + ED + R Sbjct: 158 DALPAELATWIRGHWSIEVRLRWVRDVTLGEDLHQART 195 Score = 47.8 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 10/38 (26%), Positives = 23/38 (60%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVIS 39 + L+E ++ +PD R+ V H + +L + +CA+++ Sbjct: 57 DQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLT 94 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 40/65 (61%), Positives = 43/65 (66%), Gaps = 12/65 (18%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTI------CAVISGAEGWEDIEDFGETH 54 MELKKLMEHISIIPDYRQ WKVEHKL DIL + C ++ G FGETH Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRG------FGETH 54 Query: 55 LDFLK 59 LDFLK Sbjct: 55 LDFLK 59 >UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKL6_9PROT Length = 506 Score = 59.0 bits (141), Expect = 3e-07, Method: Composition-based stats. Identities = 55/354 (15%), Positives = 112/354 (31%), Gaps = 57/354 (16%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHK----LSDILLLTICAVISGAEGWEDIEDFGETHLDF 57 EL L+ + IPD R K HK L LL+ + S E ++ L Sbjct: 75 ELPALLGQLEQIPDPRDPRKRRHKLTVLLLYGLLMFVFQFASRRETNREMTR--PQFLAN 132 Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD--------KDVIAID 109 L++ +P DT+ R++ I A + ++ +R IAID Sbjct: 133 LQRLFPEIEALPHADTLYRLLRDIDLAHLEQAHVDLVRRLIRGKSFRRYLINHCHPIAID 192 Query: 110 G------------KTLRHSYDKSRRRGAIHVI----SAFSTMHSLV-----------IGQ 142 G + L+ K R + + ++ + LV +G Sbjct: 193 GSQKLAGDTLWAEELLQRHVGKDETRHTQYFVYVLEASLVFHNGLVIPLLSEFLEHALGD 252 Query: 143 IKTDEKSNEITAIPELLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGT 198 + ++ E+ L + L ++ D + + ++ + ++ +K Sbjct: 253 SEAQKQDCELRGFARLSDRLKRLFPRLPILLLLDGLYANGPVMQRCLRAHWQFMIVLKD- 311 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 + +E + ++ GR + V D+ L Sbjct: 312 -------KDLPTVWEEFRALQPRQLPTLQQDWGRRQQHFSWVNDIEYAYGSNGRCRLKLH 364 Query: 259 KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT----AEKFATAIRNHWHVENK 308 + ++ + E + E ++SS L+ E+ R+ W +E Sbjct: 365 VVVCEERWQGVDQEARIVTETARHAWLSSQPLSRENVHERCNLGARHRWGIEAG 418 >UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3Z283_9SYNE Length = 156 Score = 57.8 bits (138), Expect = 7e-07, Method: Composition-based stats. Identities = 23/93 (24%), Positives = 40/93 (43%), Gaps = 4/93 (4%) Query: 84 AKFHECFINWMRDCHSSDDK-DVIAIDGKTLRHSYD--KSRRRGAIHVISAFSTMHSLVI 140 F + WM + D D + DGKTLR S D I +S +S + I Sbjct: 2 EAFEALLLQWMSQQPALADGVDTLVCDGKTLRGSIDQKPGAAASFIAQVSLYSQPLGVAI 61 Query: 141 GQ-IKTDEKSNEITAIPELLNMLDIKGKIITTD 172 Q ++S+E ++ LL+ +++ ++ D Sbjct: 62 AQTTYATDESSETASLLWLLSGIELTDMLVQAD 94 >UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L7Y6_FRASN Length = 209 Score = 57.4 bits (137), Expect = 9e-07, Method: Composition-based stats. Identities = 16/62 (25%), Positives = 29/62 (46%) Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 +TA T +R +W +EN++H+ D ED GN + R++AI ++ + Sbjct: 88 SVTAAYLHTHVRGNWGIENEVHYTRDAAWREDANPTYTGNTNHALASFRNLAIGVIGLNG 147 Query: 349 VF 350 Sbjct: 148 TR 149 Score = 44.3 bits (103), Expect = 0.008, Method: Composition-based stats. Identities = 13/68 (19%), Positives = 24/68 (35%), Gaps = 1/68 (1%) Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 + P T+ + I F W+ + + +AIDGK LR ++ Sbjct: 28 HFRRNTRAPSKKTLRAPLKKIDVDALDATFGAWLCAQI-ARGRVALAIDGKVLRGAWSGD 86 Query: 121 RRRGAIHV 128 A ++ Sbjct: 87 ESVTAAYL 94 >UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TCH7_HELMI Length = 453 Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 55/385 (14%), Positives = 107/385 (27%), Gaps = 58/385 (15%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + + D R+ +++ I + E E ++ + +Q Sbjct: 38 VYGFSQMVRQAKDGRKQPRIK--APAIFTVAFFGAFFCMESMEQMDRW--QKTGVFRQLV 93 Query: 63 DFENGIPVHDTIARVVSCISPAK--------FHECFINWMRDCHSSDDKDVIAIDGKTL- 113 +P HDT+ + + + S + V AIDG L Sbjct: 94 PKNIRLPSHDTVRQALMKWDLKEQREQHNCVIQRYKEQRGPQKESINGWRVTAIDGVELF 153 Query: 114 ------------RHSYDKSRRRGAIHVISAFSTMHSLVIG-------QIKTDEKSNEITA 154 R DK+ V++ ++ +I Q D+ E T Sbjct: 154 HTKAYRCPECLTREHRDKTTDYYHAVVVAQQVGGNANLIYDWEMRKPQDGVDKDEGETTV 213 Query: 155 IPELLNML-DIKGK---IITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKF 210 L+ + + GK + T DA+ + + G + +K + R+ K F Sbjct: 214 AQRLIRRMAETYGKITDVYTLDALFAKAPVIHAALDAGAHVVVRMKEERRRIMKEANACF 273 Query: 211 PLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSII 270 + ++ + V + +W ++ V Sbjct: 274 ANRLPDSTWEERDGKGNT------------VYVQAWDEEGLAQWPQVRVPMRIVKIIRHT 321 Query: 271 AEQKKEPEMTV----------RYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNED 320 + E V SS + A W +EN L D Sbjct: 322 NKTVIEANKEVFVTDVVERWIATTCSSEKADTQTIAQIAAARWDIENIGFRNLKTFNALD 381 Query: 321 DCKIRRGNAAELFSGIRHIAINILT 345 C + A + G + +A N+ Sbjct: 382 HCFVHDSVAIKAMIGFQVLAFNLKR 406 >UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J572_FRASC Length = 148 Score = 57.0 bits (136), Expect = 1e-06, Method: Composition-based stats. Identities = 14/46 (30%), Positives = 19/46 (41%), Gaps = 1/46 (2%) Query: 8 EHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGET 53 E IPD R V H+L +L L AV+ G G + + Sbjct: 70 ECFEQIPDPRDPRGVRHRLPVVLSLC-AAVLCGESGLAGVAAWVAA 114 >UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria RepID=A8FXM7_SHESH Length = 57 Score = 56.6 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 1/46 (2%) Query: 274 KKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + + RYYISS +LTAE+ A + HW +E+ +HW LDV MNE Sbjct: 10 GNKLVLEYRYYISSKELTAEQAANTVSEHWGIES-MHWVLDVSMNE 54 >UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B1F Length = 84 Score = 55.5 bits (132), Expect = 3e-06, Method: Composition-based stats. Identities = 20/56 (35%), Positives = 29/56 (51%) Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE 214 L+M D+ + DA+G Q IAE+I + G DY+ A+K Q +A F E Sbjct: 17 LDMEDLAQSQLVIDAVGTQGPIAEQIIEAGADYVLALKANQPSALQAVSAHFKEAE 72 >UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3536 Length = 130 Score = 55.1 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 20/71 (28%), Positives = 34/71 (47%), Gaps = 7/71 (9%) Query: 252 FEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT---AEKFATAIRNHWHVENK 308 +WKGLK+ R++ + V + I+S A + +R+HW +EN+ Sbjct: 9 QDWKGLKQGFQITRERTV----NGVTTVEVVHGITSLSADRANAGALLSLLRDHWRIENQ 64 Query: 309 LHWRLDVVMNE 319 LH+ DV + E Sbjct: 65 LHYVPDVTLGE 75 >UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VX04_POLNA Length = 100 Score = 54.7 bits (130), Expect = 5e-06, Method: Composition-based stats. Identities = 14/42 (33%), Positives = 27/42 (64%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDI 47 L++ SI+PD R + L +++++T+ AV+ GA+ W D+ Sbjct: 2 LIQAFSILPDPRTGPAQRYDLREMIVMTLSAVLCGADNWVDV 43 >UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BQC4_9GAMM Length = 96 Score = 54.3 bits (129), Expect = 7e-06, Method: Composition-based stats. Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 5/84 (5%) Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 +T ++ R HW + LH+ D NED +IR G+ + + AI +L + Sbjct: 1 MTPQQVLAINRGHWSI-ASLHYISDWNYNEDRGQIRTGHGPANVTRLCRFAIGVLKHFPK 59 Query: 350 FKAGLRRKMRKAAMDR----NYLA 369 + MR+ A +YL Sbjct: 60 PGQYIPEMMRQLARRPRQVLDYLR 83 >UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C435B Length = 133 Score = 53.6 bits (127), Expect = 1e-05, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 42/137 (30%), Gaps = 18/137 (13%) Query: 192 LFAVKGTQGRLNKAFEEKFPLKE-----------LNNPEHDSYAISEKSHGREEIRLHIV 240 + K Q L E ++ L P+ + G R+ Sbjct: 1 MLTAKDNQPGLVADIEAGLGFEDAARGLAAATSPLTGPDARATGAPGHVGGPGHGRIETR 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD---LTAEKFAT 297 L+ W GLK R++ K + V + I+S A Sbjct: 61 TVRATPLLTCHDRWTGLKHGSRITRARTV----KGVTTVEVLHGITSLTVERADARALLG 116 Query: 298 AIRNHWHVENKLHWRLD 314 +R+HW +EN+ H D Sbjct: 117 LVRSHWRIENQRHDVRD 133 >UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZ13_9PLAN Length = 75 Score = 52.4 bits (124), Expect = 3e-05, Method: Composition-based stats. Identities = 19/56 (33%), Positives = 32/56 (57%), Gaps = 1/56 (1%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDF 57 EL++L ++ + D R HKL +++L+ +CAVI+GA+G IE + L Sbjct: 19 ELRELAKYFQSLDDSRSHVNQRHKLVNVVLIAMCAVIAGADGPTAIE-WLAGRLQL 73 >UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z7_THET2 Length = 112 Score = 51.3 bits (121), Expect = 6e-05, Method: Composition-based stats. Identities = 18/111 (16%), Positives = 35/111 (31%), Gaps = 7/111 (6%) Query: 41 AEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS 100 + +E F + L G ++ + P K E + Sbjct: 1 MDSLRGVERFARANPHLLPHLGLRNPPGHTLL--PLLLHRLDPKKLQEALHQVFPEA--- 55 Query: 101 DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNE 151 D V+ +DGK LR S + + ++ + + Q + + K E Sbjct: 56 DLGGVLVVDGKHLRGS--GKGKSPQVRLVEVLALHLKTTLAQARVEGKVVE 104 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 50.5 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 66/176 (37%), Gaps = 21/176 (11%) Query: 43 GWEDIEDF-----GETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC 97 +E++ F G+ D L +Y +F+N P + + + + I P F F + + Sbjct: 40 SFEEVMKFMLTMEGKALRDELLEYFEFDNTTPSNSSFNQRRAQILPEAFEFLFQEFTKSF 99 Query: 98 HSSD---DKDVIAIDGKTLRHSYDK------------SRRRGAIHVISAFS-TMHSLVIG 141 + +IA DG L +++ + +H+ + + Sbjct: 100 TDNVTYNGLRLIACDGSDLCIAHNPQDETTYFQTLPDRKGYNLLHLNAFYDLCSRQYTDA 159 Query: 142 QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKG 197 I+ +NE A+ E+++ + I D +I ++ +G YL VK Sbjct: 160 IIQPSRLANERRAMCEMIDRYNDTSAIFIADRGYENYNIFAHVEHKGMYYLIRVKD 215 >UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L6S8_RUMHA Length = 107 Score = 50.1 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 15/48 (31%), Positives = 25/48 (52%) Query: 47 IEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWM 94 + F + + ++ D + G P DT+ RV + I P KF E F +W+ Sbjct: 1 MAFFMKLQEPYFEKILDLKYGTPSADTLLRVFAIIEPEKFMEMFYHWI 48 >UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillaceae RepID=C5D2E6_GEOSW Length = 437 Score = 49.7 bits (117), Expect = 2e-04, Method: Composition-based stats. Identities = 55/351 (15%), Positives = 108/351 (30%), Gaps = 74/351 (21%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDI-EDFGETH-LDFLKQ 60 K L++ + + D R + + IL + + G + + E F + ++ ++ Sbjct: 28 FKDLVDQLKKVKDKRHQSYITYGPETILYTILLKSVFGIKSMRSMTELFNKDECIENIRV 87 Query: 61 YGDFE--NGIPVHDTIARVVSCISPAKFHEC--------FINWMRDCHSSDDKDV-IAID 109 + N +P +DTI ++ + P + F + +K I D Sbjct: 88 VLGLKELNELPHYDTINDFLAKLEPKELETIRIYLIKKLFEKRCLESFRILNKYWPIVFD 147 Query: 110 GKTL-------------RHSYDKSRRRGAI----HVISA--FSTMHSLVIGQIKTDEKSN 150 G + R DK + HV+ A L I + +S Sbjct: 148 GTGIHTFKEKHCEHCLRREYKDKETGETKVVYMHHVLEAKLVVGDMVLSIATEFIENESE 207 Query: 151 -------EITAIPELLNMLD-----IKGKIITTDAMGCQKDIAEKIQKQGGD-YLFAVKG 197 E+ A L++ L + +I D++ + + E I + Y+F K Sbjct: 208 NVPKQDCELKAFMRLVDKLKKTFKRLPICLI-ADSLYACEPVFE-ICDKHNWKYIFRFKE 265 Query: 198 TQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGL 257 + + E N + + V D+ Sbjct: 266 DRIKTVSQEFRAIQSLETNGKSSEYF---------------WVNDIAYNDR--------- 301 Query: 258 KKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 V + + + E +K+ E + AE A R W +EN+ Sbjct: 302 ---LVNLVEKVKVTENEKKQEFLFITNFRITERNAEILVQAGRRRWKIENE 349 >UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E7_9ACTO Length = 72 Score = 49.3 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 14/45 (31%), Positives = 25/45 (55%) Query: 134 TMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 T + + Q++ E +NEIT LL+ D++ +T DA+ Q+ Sbjct: 2 TGTGMTVTQLRVPENTNEITCFAALLDPYDLREVTVTGDALHTQR 46 >UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36C2 Length = 109 Score = 48.6 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 18/69 (26%), Positives = 26/69 (37%), Gaps = 5/69 (7%) Query: 268 SIIAEQKKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + + V Y I+S A R HW +EN LH+ DV + ED C + Sbjct: 5 ERRRKANGKATVEVVYGITSLSRLAADAAALLGYSRRHWGIENGLHYTRDVTLGEDRCPV 64 Query: 325 --RRGNAAE 331 R Sbjct: 65 GARSRPTPR 73 >UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HUT6_9ACTO Length = 130 Score = 48.2 bits (113), Expect = 6e-04, Method: Composition-based stats. Identities = 19/84 (22%), Positives = 32/84 (38%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L +S IPD R+ + L +L L + AV+ GA I F L++ Sbjct: 45 SLAGTLSRIPDPRRVRGRRYHLGSLLALCLVAVLGGARSLATIARFAADTNSSLREQLGL 104 Query: 65 ENGIPVHDTIARVVSCISPAKFHE 88 + P T+ + + + E Sbjct: 105 ASSTPNASTLGGLRANLKDEWVRE 128 >UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4X2F9_SALTO Length = 143 Score = 47.8 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 14/64 (21%), Positives = 23/64 (35%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L + +PD V H+L+ +L+ ICAV + I ++ G Sbjct: 13 AGLPAALLDLPDPLCRLGVLHRLTVVLIAAICAVAVSNRSYTAIAEWFPDVPAATGARGG 72 Query: 64 FENG 67 G Sbjct: 73 HRPG 76 >UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AI67_NITEC Length = 94 Score = 47.8 bits (112), Expect = 7e-04, Method: Composition-based stats. Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 11/60 (18%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +L + I D RQ K H L +L++TI +I + LD+L+QY Sbjct: 34 RLADVFVSITDPRQ-RKSRHDLVKVLVITI----------NEILAWANEKLDWLRQYLKL 82 >UniRef50_Q6TKW2 Aec6 n=2 Tax=Escherichia RepID=Q6TKW2_ECOLX Length = 98 Score = 47.0 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 37/48 (77%), Positives = 39/48 (81%) Query: 78 VSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +SCI KFHECFIN MR+CHSSDD DVIAIDGK L HS DKSRRR A Sbjct: 1 MSCIRSVKFHECFINRMRECHSSDDIDVIAIDGKALPHSCDKSRRRRA 48 >UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BWL6_9GAMM Length = 94 Score = 46.6 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 20/55 (36%), Positives = 33/55 (60%), Gaps = 1/55 (1%) Query: 8 EHISIIPDYRQTW-KVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 EH +PD R+ + HK DIL++ ICA+I GA+ W + +FG+ D+ + + Sbjct: 40 EHFKSLPDPRRRTMNLRHKFIDILIIAICAIICGADSWVAVAEFGKAKEDWFRVF 94 >UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JGX0_FRASC Length = 222 Score = 46.6 bits (109), Expect = 0.002, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 40/108 (37%), Gaps = 6/108 (5%) Query: 66 NGIPVHDTIARVVSCISPAKFHECFIN-WMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 G P + + + P + + V+ +DG T+R Sbjct: 31 PGTPAPGGVGKSCRSLDPGSLAALDAAPHRPTWRAGRVRRVLTVDGTTMR----PQHGSR 86 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITT 171 +H+ + +++ Q+ DEK+NE + L + D+ G +IT Sbjct: 87 HVHLPEGLAHACGVLLTQVDVDEKTNENPFVLRGLGQIPDLTGVLITA 134 >UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmonella enterica RepID=C0Q104_SALPC Length = 177 Score = 46.2 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 24/50 (48%), Positives = 27/50 (54%), Gaps = 13/50 (26%) Query: 309 LHWRLDVVMNEDDCKIRRGNAAELF-------------SGIRHIAINILT 345 +HWRLDV MNEDDC+IRRGN F +R I INIL Sbjct: 1 MHWRLDVAMNEDDCRIRRGNVKSFFEIIKSGEYEIWGCEIMRWIRINILK 50 >UniRef50_Q0RW00 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW00_RHOSR Length = 98 Score = 45.5 bits (106), Expect = 0.004, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 23/48 (47%), Gaps = 2/48 (4%) Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 GKT R + D S H+++A +V+ Q+ + NEI + + Sbjct: 18 GKTWRGAKDGSG--HLTHLLAAVDHDAGVVLRQVAVGARINEIPLLLD 63 >UniRef50_Q2RR82 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RR82_RHORT Length = 84 Score = 45.1 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 15/46 (32%), Positives = 23/46 (50%), Gaps = 1/46 (2%) Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQ 199 A E++ LD+ G++ T DA+ CQ E ++ G L K Q Sbjct: 36 ATQEMIAPLDLTGRLFTLDALHCQ-KTFEIARQAGNHLLVQAKINQ 80 >UniRef50_B2ISL1 IS1380-Spn1 transposase n=83 Tax=Streptococcus pneumoniae RepID=B2ISL1_STRPS Length = 535 Score = 45.1 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 36/202 (17%), Positives = 67/202 (33%), Gaps = 31/202 (15%) Query: 18 QTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG--IPVHDTIA 75 Q + SDIL+ + +++G D+ L + G + T++ Sbjct: 142 QRRYCRYSDSDILVQFLFQLLTGYGT-----DYACKELSADAYFPKLLEGGQLASQPTLS 196 Query: 76 RVVSCISPA----------KFHECFINWMR--DCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 R +S + E F+ + + D GK +Y+ R Sbjct: 197 RFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDSTHFTTYGKQEGVAYNAHYRA 256 Query: 124 GAIHVISAFSTMHSLVI-GQIK------TDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 H + AF Q++ ++E + IT + E N L + D+ Sbjct: 257 HGYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSFITPVLERFNQL-----LFRMDSGFA 311 Query: 177 QKDIAEKIQKQGGDYLFAVKGT 198 + + I+K G YL +K Sbjct: 312 TPKLYDLIEKTGQYYLIKLKKN 333 >UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 Length = 41 Score = 44.7 bits (104), Expect = 0.005, Method: Composition-based stats. Identities = 16/30 (53%), Positives = 24/30 (80%) Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 +++A +T + + IGQ+K D KSNEITAIP+ Sbjct: 1 MVNALATANGMSIGQLKVDSKSNEITAIPK 30 >UniRef50_B0TLQ7 Putative uncharacterized protein n=1 Tax=Shewanella halifaxensis HAW-EB4 RepID=B0TLQ7_SHEHH Length = 74 Score = 44.7 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 19/44 (43%), Positives = 25/44 (56%) Query: 7 MEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDF 50 EH+SII R + EH DI+ L A+ S EGW DI++F Sbjct: 4 FEHLSIIKAPRSSINHEHDPVDIMFLVNSAIASDCEGWLDIDEF 47 >UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P672_AZOSE Length = 47 Score = 44.3 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 15/31 (48%), Positives = 18/31 (58%) Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 HW VEN LHW L+V NED ++R A Sbjct: 1 HWGVENWLHWCLNVQFNEDRSRVRSAYAVNN 31 >UniRef50_B2RI66 Partial transposase in ISPg2 n=1 Tax=Porphyromonas gingivalis ATCC 33277 RepID=B2RI66_PORG3 Length = 87 Score = 43.9 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 13/46 (28%), Positives = 22/46 (47%) Query: 17 RQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 R K + L + L+ + +SG W +IED+ E + + LK Sbjct: 23 RIESKEVYPLDFLFLIVFLSTLSGDTSWYEIEDYAEEYEEVLKSRY 68 >UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCH0_9FIRM Length = 435 Score = 43.9 bits (102), Expect = 0.009, Method: Composition-based stats. Identities = 39/275 (14%), Positives = 87/275 (31%), Gaps = 33/275 (12%) Query: 56 DFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS----DDKDVIAIDG- 110 L + DF+ P + + I P F F + + + + ++A DG Sbjct: 59 KELLDFFDFDVNAPTVSAYTQQRAKILPEAFEYLFHAFTEENAQTKNLYEGYQLLACDGS 118 Query: 111 -----------KTLRHSYDKSRRRGAIHVISAFSTMHSLVI-GQIKTDEKSNEITAIPEL 158 +TL S +H+ + + ++ I ++T E A ++ Sbjct: 119 NLTIAPNLNDPETLWKSNQLGATGNHLHLNALYDVLNRTYIDALVQTASTYQEHRACIQM 178 Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNP 218 + + + I+ D +I ++G +L +K + L + Sbjct: 179 IERVTLDKVILIADRGYENYNIMSHAIEKGWKFLIRIKDVH---SNGIASGLELPQTAVF 235 Query: 219 EHDSYAISEKSHGREEIRLHIVCDVPDELIDF-----TFEWKGLKKLCVAVSFRSIIAEQ 273 + D I ++ + + + + D+ ++ +SFR + Sbjct: 236 DMDINLILTRNQTKSKKQAGYKFMPTVQTFDYLPIGSKEDYP--------ISFRIARFKI 287 Query: 274 KKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 + TV + +AEK W +E Sbjct: 288 ADDSYETVITNLDRFCFSAEKLKELYHLRWGIETS 322 >UniRef50_Q8X3B6 H repeat-associated protein n=1 Tax=Escherichia coli O157:H7 RepID=Q8X3B6_ECO57 Length = 50 Score = 43.9 bits (102), Expect = 0.011, Method: Composition-based stats. Identities = 25/38 (65%), Positives = 28/38 (73%) Query: 341 INILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGLS 378 + I ND VFKAGL KMRKA MDRN+LAS +A GLS Sbjct: 13 LLISDNDNVFKAGLSCKMRKAVMDRNFLASGIAACGLS 50 >UniRef50_Q11MU1 Transposase, IS4 n=11 Tax=cellular organisms RepID=Q11MU1_MESSB Length = 447 Score = 43.6 bits (101), Expect = 0.014, Method: Composition-based stats. Identities = 40/256 (15%), Positives = 76/256 (29%), Gaps = 29/256 (11%) Query: 5 KLMEHISI-IPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 KL E ++ I D R +V H L+DIL I A+ G E D++ F G Sbjct: 45 KLAEKLAAAIRDPRDPARVRHSLTDILRARIFAIACGYEDANDLDRL-RNDPAFKLACGR 103 Query: 64 FENG---IPVHDTIARVVSCIS---PAKFHECFIN-WMRDCHSSDDKDVIAID------- 109 + + T +R+ + + ++ W+ + + ID Sbjct: 104 LPDSGQDLCSQPTCSRLENLPDLRTVIRLGRVLVDLWLSSYPAPPKSVTLDIDDTLDVVH 163 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML------- 162 G ++ I + + I K+ I L L Sbjct: 164 GHQQLSLFNGHHDERCFLPIHIYDAATGRPVAMILRPGKTPSGKEIRGHLRRLARCIRAR 223 Query: 163 -DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHD 221 ++ D+ + ++ ++ DY+F + G K + + Sbjct: 224 WPDTRILVRGDSHYGRVEVMAWCEENAIDYVFGLAGN-----KVLKRLVDASADDIRTRR 278 Query: 222 SYAISEKSHGREEIRL 237 + G E R Sbjct: 279 ALEQKPVLRGYVETRY 294 >UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID=A1RCW9_ARTAT Length = 436 Score = 42.8 bits (99), Expect = 0.019, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 64/205 (31%), Gaps = 19/205 (9%) Query: 11 SIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPV 70 I+PD R +V+H L +L I A+ +G E D D G H L+ + + Sbjct: 49 KIVPDRRDPGRVQHGLQTLLAQRIYALAAGYEDLND-HD-GLRHDYALQTAVNRLQPLAG 106 Query: 71 HDTIARVVSCISPAKFHEC-------FINWMRDCHSSDDKDVIAID----GKTLRHSYDK 119 T+ R+ + FI + D A D G + Sbjct: 107 KSTLGRLEQQADRETVVQAHRLLWEHFIAQHDQAPAEIVLDFDATDVPVHGDQEGRFFHG 166 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEIT-AIPELLNMLDIKGK-----IITTDA 173 + F H LV ++ + AI LL + + D Sbjct: 167 YYDHYCFLPLYVFCGRHLLVSYLRPSNIDGARHSWAILALLVKFIRRFWPETRIVFRGDG 226 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGT 198 C+ + + ++ DY+ + Sbjct: 227 GFCRHRMLDWCDRKQVDYVVGLARN 251 >UniRef50_A7BZU6 Transposase, IS4 n=2 Tax=Beggiatoa sp. PS RepID=A7BZU6_9GAMM Length = 270 Score = 42.8 bits (99), Expect = 0.021, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 58/188 (30%), Gaps = 27/188 (14%) Query: 24 HKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISP 83 + +L+ A G + + F +T L L + T SP Sbjct: 43 YDFFILLMYYFVA---GKQS---VGLFVKTELKLLPITLGLRQV--AYSTFNDAFERFSP 94 Query: 84 AKFHECF------INWMRDCHSSDDKDVIAIDG-------KTLRHSYDKSRRRGAIHVIS 130 F E F I + + S + IDG L Y +H+ Sbjct: 95 NLFQEVFKYILSTIPFKQISELSTLGVLYCIDGSLFPVINSMLWAEYTSKHCALKLHL-- 152 Query: 131 AFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGD 190 F +V+ + T +E A+ E+L G D ++ + ++ Sbjct: 153 CFELNRMIVVEFLVTAANGSERKALQEMLK----AGVTYIGDRGYMSFELCHLMMQKEAY 208 Query: 191 YLFAVKGT 198 ++F +K Sbjct: 209 FVFRLKRN 216 >UniRef50_A6FLE0 Transposase, IS4 n=2 Tax=Roseobacter sp. AzwK-3b RepID=A6FLE0_9RHOB Length = 136 Score = 42.4 bits (98), Expect = 0.025, Method: Composition-based stats. Identities = 21/72 (29%), Positives = 30/72 (41%), Gaps = 2/72 (2%) Query: 13 IPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIP--V 70 +PD R+ KV H L DI+ I + +G E D D + L D E G Sbjct: 41 LPDPREPGKVRHSLEDIIRFRIMMIAAGYEDGNDAGDLRDDPAFKLALERDPETGAALCS 100 Query: 71 HDTIARVVSCIS 82 TI+R+ + Sbjct: 101 QPTISRMENMAD 112 >UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteobacteria RepID=A6X872_OCHA4 Length = 330 Score = 42.0 bits (97), Expect = 0.035, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 68/188 (36%), Gaps = 25/188 (13%) Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAID-------- 109 L + + E +P H T +R + + + C +D+ + +D Sbjct: 91 LLRLLNVELPVPDHTTFSRRCANLVVSSLTRCTRR-----DGTDEPLHVIVDSTGMKIYE 145 Query: 110 ---GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKG 166 +H +R+ +H+ A + VI + TD+ +++++ +P+LL+M+D Sbjct: 146 AGQWLEEKHGAKSARKWLKLHL--AIDADSNQVIAETLTDQNTSDLSQVPDLLDMIDRPI 203 Query: 167 KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAIS 226 D + ++ + R+ E + + + D ++ + Sbjct: 204 ACFMADGAYDSDQTYQALRSHSPGVSIII---PPRIRDLQEASYGPPD----QRDWHSRT 256 Query: 227 EKSHGREE 234 GR E Sbjct: 257 NAQRGRME 264 >UniRef50_B5JUE3 Transposase, IS4 family protein n=4 Tax=gamma proteobacterium HTCC5015 RepID=B5JUE3_9GAMM Length = 451 Score = 41.6 bits (96), Expect = 0.052, Method: Composition-based stats. Identities = 51/361 (14%), Positives = 112/361 (31%), Gaps = 60/361 (16%) Query: 18 QTWKVEHKLSDILLLTICAVISGAEGWED-IEDFGETHLDFLKQYGDFENGIPVHDTIAR 76 + + L+D + +T+ +I+GA + + + L + + +P T+ R Sbjct: 55 RGDNATYSLTDSVFMTLLGMIAGANSLLKVVAVWSDQVLRDVSGWL----SVPDDSTLGR 110 Query: 77 VVSCIS---PAKFHECFINWMRDCHSSDDK-----------DVIAID--GKTLRHSYDKS 120 + A+ E K + +D KT+ + + Sbjct: 111 IFRQGKQKHVAQLEEVNHRLRHRVWERCLKSGAMQPGDFYRSWVDVDSTVKTVFGRQEGA 170 Query: 121 --------RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGK----- 167 + + H + AFS+ ++ + I E + L ++ Sbjct: 171 AKGYNAAKKGALSYHPLVAFSSHTKEILQAWLRTGSAYTSNGIVEFMKQLAVQMPPRMRM 230 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISE 227 + D+ ++ + +++ G YL VK + A +K + ++N + E Sbjct: 231 LFRGDSGFFSGELMDWLEQGGHGYLIKVK---LKGLSALLDKQTWQAVSNCP--GWEQCE 285 Query: 228 KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISS 287 +H + L E + EP Y+++ Sbjct: 286 FTHACGKWSRSRRFIAVRRLAKVKKEGPQQSLII--------------EPVYDYFCYVTT 331 Query: 288 ADLTAEKFAT-----AIRNHWHVENKLHWRLDVVMNED--DCKIRRGNAAELFSGIRHIA 340 LT + A W E K L + + D + +A ++ IR +A Sbjct: 332 ERLTPWQAHKTYGQRATCETWLDEAKNQMGLAHLKSHDFVASSLMFQSAVLAYNTIRWMA 391 Query: 341 I 341 + Sbjct: 392 L 392 >UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUW9_9BACT Length = 61 Score = 41.2 bits (95), Expect = 0.066, Method: Composition-based stats. Identities = 8/39 (20%), Positives = 16/39 (41%) Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 D ED +IR NA + ++++ + + V Sbjct: 1 MRDTSFREDHSQIRTQNAPRAMASLKNLVVGLFHFLNVP 39 >UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicutes RepID=B8FXU5_DESHD Length = 381 Score = 40.9 bits (94), Expect = 0.073, Method: Composition-based stats. Identities = 40/278 (14%), Positives = 88/278 (31%), Gaps = 25/278 (8%) Query: 51 GETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMR---DCHSSDDKDVIA 107 G + L + + + + I P F + R +S D ++A Sbjct: 3 GNSLSKELYDWLGYSSETATASAFVQQRDKIRPEALKLLFHEFTRLTVSENSLQDYRLLA 62 Query: 108 IDGKTLR------------HSYDKSRRRGAIHVISAFST-MHSLVIGQIKTDEKSNEITA 154 +DG LR + + S+ +H+ + + V +++ + NE A Sbjct: 63 VDGSDLRLPSNSKDGFSSIRNSEDSKNYNLVHLDAMYDLMGKVYVDASVQSKKGMNEHKA 122 Query: 155 IPELLNMLDIKGKIITT-DAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK 213 + +++ +I G +I D + Q++ Y+ K + + L Sbjct: 123 LVSMVDQSEINGNVIAIMDRGYESFNNIAHFQEKSWYYIIRAKE-----SYGIISRLSLP 177 Query: 214 ELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK---KLCVAVSFRSII 270 + + + + +E + L I + +K + FR++ Sbjct: 178 DYPEYDEEIMLTLTRRQTKETLPLLKAYPHRYRWIQPHTTFDFIKPKDSKFYDLHFRAVR 237 Query: 271 AEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 TV +++ D EK W +E Sbjct: 238 FAIADGVYETVYTNLNAEDFPPEKLKQLYNLRWGIETS 275 Searching..................................................done Results from round 3 Score E Sequences producing significant alignments: (bits) Value Sequences used in model and found again: UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Prote... 427 e-118 UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroher... 405 e-111 UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancour... 393 e-108 UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacter... 391 e-107 UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales ... 390 e-107 UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria Re... 389 e-107 UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_... 387 e-106 UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobact... 377 e-103 UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter a... 371 e-101 UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides Rep... 365 2e-99 UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium... 361 3e-98 UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=V... 358 2e-97 UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocycl... 355 1e-96 UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatu... 349 8e-95 UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothec... 349 1e-94 UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsie... 346 7e-94 UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ... 346 1e-93 UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula mar... 343 5e-93 UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera s... 341 2e-92 UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing ma... 340 5e-92 UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C... 339 8e-92 UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasui... 338 2e-91 UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_C... 338 3e-91 UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatu... 335 2e-90 UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammapro... 329 1e-88 UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4... 328 2e-88 UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=V... 325 2e-87 UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE 325 2e-87 UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens Rep... 324 2e-87 UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexi... 324 3e-87 UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum c... 323 6e-87 UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus Re... 322 1e-86 UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID... 322 1e-86 UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=G... 322 1e-86 UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putre... 315 2e-84 UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepI... 314 3e-84 UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum c... 312 1e-83 UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadeca... 308 2e-82 UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaprot... 303 8e-81 UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3Q... 302 2e-80 UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5... 298 3e-79 UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria Rep... 297 3e-79 UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1... 297 6e-79 UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaprot... 293 9e-78 UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizo... 290 7e-77 UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingi... 282 1e-74 UniRef50_C3KML6 Putative transposase for insertion sequence NGRI... 271 4e-71 UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingiv... 270 5e-71 UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythro... 269 1e-70 UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_... 266 9e-70 UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomy... 266 1e-69 UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Ta... 263 6e-69 UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7... 263 7e-69 UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides ... 262 2e-68 UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales Re... 260 6e-68 UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragme... 254 3e-66 UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimon... 244 4e-63 UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostri... 243 1e-62 UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 Re... 242 2e-62 UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia R... 240 5e-62 UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus tor... 235 3e-60 UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=... 232 1e-59 UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridi... 232 1e-59 UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 230 5e-59 UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales ... 227 4e-58 UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium R... 227 5e-58 UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylo... 223 8e-57 UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_R... 221 3e-56 UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID... 217 7e-55 UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Strepto... 215 2e-54 UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC 211 2e-53 UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6K... 211 4e-53 UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC 206 1e-51 UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=B... 203 6e-51 UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscurig... 201 3e-50 UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothec... 200 9e-50 UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idi... 194 5e-48 UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacte... 181 3e-44 UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Ta... 174 5e-42 UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus... 173 1e-41 UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla... 172 2e-41 UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli C... 172 2e-41 UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscu... 169 1e-40 UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptosp... 168 2e-40 UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholde... 166 2e-39 UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaprot... 165 2e-39 UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 165 2e-39 UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_... 164 4e-39 UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospi... 158 3e-37 UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobac... 158 4e-37 UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria R... 157 8e-37 UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Trepone... 154 5e-36 UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfon... 153 1e-35 UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidith... 151 3e-35 UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcol... 150 7e-35 UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacte... 147 6e-34 UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflex... 146 9e-34 UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacte... 145 3e-33 UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacte... 143 7e-33 UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 141 3e-32 UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microco... 139 1e-31 UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7... 139 2e-31 UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyx... 138 4e-31 UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliob... 137 6e-31 UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A... 136 1e-30 UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396... 136 1e-30 UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=... 136 1e-30 UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=G... 134 6e-30 UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azolla... 134 6e-30 UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobac... 134 6e-30 UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaerom... 133 1e-29 UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae... 133 1e-29 UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp... 132 2e-29 UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus ... 130 1e-28 UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata ob... 127 8e-28 UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=B... 126 1e-27 UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia ... 126 2e-27 UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obsc... 123 2e-26 UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 ... 121 5e-26 UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae R... 119 1e-25 UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammap... 119 2e-25 UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewane... 119 2e-25 UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=B... 119 2e-25 UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus ... 116 1e-24 UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus... 114 5e-24 UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica... 113 8e-24 UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata ob... 113 9e-24 UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiral... 113 2e-23 UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=... 112 2e-23 UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mo... 112 3e-23 UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina... 109 1e-22 UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothe... 109 2e-22 UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legione... 106 1e-21 UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candida... 106 1e-21 UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris mari... 105 2e-21 UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata ob... 104 4e-21 UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis... 104 7e-21 UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legione... 103 1e-20 UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswald... 102 2e-20 UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX 102 3e-20 UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum fe... 101 4e-20 UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=... 101 6e-20 UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillace... 100 8e-20 UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewane... 99 1e-19 UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio ... 99 1e-19 UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfo... 99 1e-19 UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoa... 99 1e-19 UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 99 2e-19 UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis... 99 3e-19 UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Strepto... 98 4e-19 UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobac... 97 9e-19 UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobac... 96 2e-18 UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia ... 94 1e-17 UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteoba... 93 2e-17 UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica... 91 5e-17 UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II ... 91 7e-17 UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum ... 90 1e-16 UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewane... 90 2e-16 UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscurig... 90 2e-16 UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II ... 89 3e-16 UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-... 88 5e-16 UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus Re... 88 5e-16 UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 87 1e-15 UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematop... 85 4e-15 UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=... 85 4e-15 UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia... 85 4e-15 UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax... 84 9e-15 UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Vermine... 83 1e-14 UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synecho... 83 2e-14 UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodosp... 83 2e-14 UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferro... 83 2e-14 UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothe... 81 5e-14 UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburi... 81 7e-14 UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_... 81 9e-14 UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 81 9e-14 UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggr... 80 1e-13 UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S... 80 1e-13 UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus... 79 2e-13 UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiat... 78 4e-13 UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=... 76 3e-12 UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mo... 76 3e-12 UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodoco... 75 3e-12 UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Ta... 74 5e-12 UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinis... 74 8e-12 UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=... 74 9e-12 UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia... 73 1e-11 UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewane... 71 5e-11 UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus... 70 1e-10 UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylob... 69 2e-10 UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 3914... 69 2e-10 UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitroso... 67 1e-09 UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium m... 66 1e-09 UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitroco... 66 2e-09 UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia... 64 8e-09 UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=... 63 2e-08 UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia... 63 2e-08 UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=... 62 5e-08 UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD... 58 6e-07 UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synecho... 58 8e-07 UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria... 57 1e-06 UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylo... 57 1e-06 UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobi... 56 2e-06 UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia... 55 3e-06 UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Strepto... 55 4e-06 UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata ob... 55 4e-06 UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polarom... 52 3e-05 UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinis... 51 7e-05 UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 T... 51 1e-04 UniRef50_Q7MY60 Similarities with the N-terminal region of H rep... 50 1e-04 UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 Re... 49 2e-04 UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastop... 49 4e-04 UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiat... 48 0.001 UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitroso... 46 0.002 Sequences not found previously or not previously below threshold: UniRef50_B2IT45 Putative uncharacterized protein n=5 Tax=Cyanoba... 74 7e-12 UniRef50_A5GAF0 Putative uncharacterized protein n=6 Tax=Deltapr... 70 2e-10 UniRef50_A8MIZ4 Putative uncharacterized protein n=1 Tax=Alkalip... 60 1e-07 UniRef50_A7C035 Transposase n=5 Tax=Bacteria RepID=A7C035_9GAMM 59 2e-07 UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coproco... 59 3e-07 UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicute... 57 9e-07 UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostri... 55 3e-06 UniRef50_C7GHC1 Transposase, IS4 family (Fragment) n=6 Tax=Roseb... 51 8e-05 UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea f... 51 8e-05 UniRef50_B0JNZ6 Transposase n=20 Tax=Cyanobacteria RepID=B0JNZ6_... 49 2e-04 UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=... 48 4e-04 UniRef50_C7G6U9 Putative uncharacterized protein (Fragment) n=7 ... 48 6e-04 UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia R... 48 7e-04 UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepI... 47 0.001 UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteoba... 47 0.001 UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aro... 47 0.001 UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 R... 46 0.002 UniRef50_Q745Z8 Putative uncharacterized protein n=1 Tax=Thermus... 46 0.002 UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID... 46 0.003 UniRef50_A3YV03 Putative uncharacterized protein n=1 Tax=Synecho... 46 0.003 UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmone... 45 0.004 UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus... 45 0.006 UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia ... 45 0.006 UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostri... 44 0.012 UniRef50_A4BVT6 Putative uncharacterized protein n=1 Tax=Nitroco... 43 0.017 UniRef50_A7JYJ5 Putative uncharacterized protein n=1 Tax=Vibrio ... 43 0.019 UniRef50_A7BZU0 Putative uncharacterized protein n=1 Tax=Beggiat... 42 0.033 UniRef50_A1TX01 Transposase, IS4 family protein n=5 Tax=Marinoba... 42 0.037 UniRef50_A1SU90 Putative uncharacterized protein n=2 Tax=Gammapr... 42 0.041 UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 42 0.043 UniRef50_A5FU21 Transposase, IS4 family protein n=11 Tax=Alphapr... 41 0.067 UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001... 41 0.073 UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=St... 41 0.087 >UniRef50_P28912 H repeat-associated protein yhhI n=208 Tax=Proteobacteria RepID=YHHI_ECOLI Length = 378 Score = 427 bits (1097), Expect = e-118, Method: Composition-based stats. Identities = 378/378 (100%), Positives = 378/378 (100%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ Sbjct: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS Sbjct: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI Sbjct: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV Sbjct: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR Sbjct: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK Sbjct: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 Query: 361 AAMDRNYLASVLAGSGLS 378 AAMDRNYLASVLAGSGLS Sbjct: 361 AAMDRNYLASVLAGSGLS 378 >UniRef50_B3QRV6 Transposase IS4 family protein n=1 Tax=Chloroherpeton thalassium ATCC 35110 RepID=B3QRV6_CHLT3 Length = 378 Score = 405 bits (1040), Expect = e-111, Method: Composition-based stats. Identities = 158/379 (41%), Positives = 222/379 (58%), Gaps = 10/379 (2%) Query: 2 ELKKLMEHISIIPDYRQTW-KVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 +K E+ + D R+ H DIL++ +CA+ISGA + +IE FG + ++ + Sbjct: 5 AVKSFSEYFKSLKDPRRETLNKRHNFLDILIIAVCAMISGANNFVEIEQFGHSKKEWFQT 64 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 + NGIP HDT V++ +SP +F CF+ W + IAID KTLR S DK Sbjct: 65 FLALPNGIPSHDTFNNVLAKLSPDEFEACFMTWANSFRLFFSGEHIAIDCKTLRGSADKK 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + +H++SA++T +LVIGQIKT+E SNEITAIPELLN LD+KG +++ DAMGCQ +I Sbjct: 125 NGKSPLHLVSAWATETALVIGQIKTEENSNEITAIPELLNFLDLKGCLVSIDAMGCQTEI 184 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELN---NPEHDSYAISEKSHGREEIRL 237 AEKI ++ DY+ A+KG Q +L+++ E F L N E D E S+GREEIR Sbjct: 185 AEKIVEKDADYVLALKGNQPKLHQSVIEYFKLAADNEGEGYEIDFAKTDETSYGREEIRC 244 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 + +++I EWK +K + + S R KKE E +RYYISSA L+AE Sbjct: 245 AYATNEIEKIIAN-DEWKNIKTVAMIESQRI-----KKEKEFDIRYYISSAKLSAEDCLK 298 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +ENKLHW LDV ED+ +IR+ N AE + +R IA+N++ +K K G K Sbjct: 299 VVRKHWEIENKLHWTLDVAFREDESRIRQRNTAENMAILRRIALNLVKQEKTAKVGQATK 358 Query: 358 MRKAAMDRNYLASVLAGSG 376 A D YL +L G Sbjct: 359 RLMAGWDEKYLLKLLNGLA 377 >UniRef50_C6N6J0 Putative transposase n=2 Tax=Legionella drancourtii LLAP12 RepID=C6N6J0_9GAMM Length = 375 Score = 393 bits (1009), Expect = e-108, Method: Composition-based stats. Identities = 169/373 (45%), Positives = 235/373 (63%), Gaps = 8/373 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E SII D RQ K++H+L DIL L + AVI GAEGW+DIE+ G L++L++ G F Sbjct: 6 SLVECFSIIRDPRQESKIDHELIDILELCVLAVICGAEGWQDIEEVGHARLNWLQERGFF 65 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 + GIPV DTIAR++S ++P + CFI WM + D +IA+DGK++RHSYDK +R+ Sbjct: 66 KKGIPVDDTIARIISSLNPEELQRCFIKWMAAVEEATDGKIIAVDGKSIRHSYDKKKRKS 125 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 AIH++SA++ + +V+GQ KTD+KSNEI AIP LL++LDIKG I+T DAMGCQ+ IAEKI Sbjct: 126 AIHMVSAYAAENGVVLGQKKTDDKSNEIIAIPALLDLLDIKGCIVTIDAMGCQEKIAEKI 185 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKE---LNNPEHDSYAISEKSHGREEIRLHIVC 241 + GDY+ AVK Q +L++ + F HD + S K HGR E+R + + Sbjct: 186 VTKEGDYVLAVKDNQKQLHEEIIDFFETSRRFKFKRVRHDYFEESHKGHGRVELRRYWIS 245 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 D+ L W L+ + + S R I + RY+I+S A+ FA A+R Sbjct: 246 DMLSTLG-NPERWASLQSIGMVESERYI----DGKTTAETRYFITSIAPDAKIFANAVRK 300 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDV EDD ++RR NA+E F RH+AIN L N+K K G++ K KA Sbjct: 301 HWAIENQLHWVLDVSFREDDSRVRRDNASENFGVFRHVAINALRNEKSCKKGIKAKRYKA 360 Query: 362 AMDRNYLASVLAG 374 + +Y VL G Sbjct: 361 TLQPDYAQKVLNG 373 >UniRef50_B7KLR5 Transposase ISAs1 family protein n=49 Tax=Bacteria RepID=B7KLR5_CYAP7 Length = 393 Score = 391 bits (1004), Expect = e-107, Method: Composition-based stats. Identities = 149/385 (38%), Positives = 225/385 (58%), Gaps = 20/385 (5%) Query: 8 EHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG 67 ++ + D R +HKL DI+ +TICAVI GA+ W DIE FG+ +LK++ + NG Sbjct: 11 DYFGELEDPRIERTKKHKLIDIITITICAVICGADSWIDIEVFGKCKYKWLKKFLELPNG 70 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIH 127 IP HDT RV S ++P + F+ W++ S +++AIDGKTLRHSYD+S+ + A+ Sbjct: 71 IPSHDTFGRVFSLLNPEHLQQIFLKWIQSISSFTQGEIVAIDGKTLRHSYDRSKDKPALQ 130 Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPE---------------LLNMLDIKGKIITTD 172 +ISA++T + LV+GQ DEKSNEITAIP+ LL +L + G I+T D Sbjct: 131 MISAWATTNGLVLGQSIVDEKSNEITAIPDGQLTVRRATAHSCPHLLKVLSLSGCIVTLD 190 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPE---HDSYAISEKS 229 A+GCQK+I ++I +Q DY+ +K QG L + E F ++N E Y + ++ Sbjct: 191 AIGCQKEIVKQITEQDADYVITLKKNQGGLYERVENLFKKALMSNFEGFIKSEYKVKDEG 250 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 HGR+E+R + + E ID ++W L + R + + + RY+ISS + Sbjct: 251 HGRQEVRYYQMLSNVAEEIDPDWQWLNLNSIGYVEYLR--VENGTDKTSLERRYFISSLN 308 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + FA+++R HW +EN+ HW LDV NEDD +IR+ NA + +RH+A+N+L +K Sbjct: 309 NNIKLFASSVREHWCIENQCHWILDVQFNEDDSRIRKDNAPANMAILRHLALNLLKQEKT 368 Query: 350 FKAGLRRKMRKAAMDRNYLASVLAG 374 K G++ K +KA D NYL VL Sbjct: 369 LKVGVKAKRKKAGWDENYLLKVLRN 393 >UniRef50_A1ST22 Transposase, IS4 family n=5 Tax=Alteromonadales RepID=A1ST22_PSYIN Length = 374 Score = 390 bits (1002), Expect = e-107, Method: Composition-based stats. Identities = 183/377 (48%), Positives = 249/377 (66%), Gaps = 4/377 (1%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L+ +SII D RQ KV H L D+L L I AVISG EGWE+I+DFG LD+L++ Sbjct: 1 MSQITLINQLSIIRDTRQPRKVHHNLVDVLFLAITAVISGCEGWEEIQDFGNDKLDWLRK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Y F GIP DTI+R+ I P +F +CF WM+ C DVIAIDGKTLR S++K Sbjct: 61 YLPFSGGIPTDDTISRIFQLIDPKEFQKCFATWMKSCCEMSHGDVIAIDGKTLRGSFNKK 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + IH++SAF+ +S+V+GQ+KT+ KSNEITAIP+LL++LD++G ++T DAMGCQ I Sbjct: 121 DKSDTIHMVSAFAAANSVVLGQVKTNAKSNEITAIPKLLDLLDVRGCLVTIDAMGCQTKI 180 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 A+KI +GGDYL VKG Q RL A + F ++ L PE ++Y EK HGRE+ R+ +V Sbjct: 181 AKKIVDKGGDYLLPVKGNQERLQTALDGIFSIRRLELPETEAYTTKEKGHGREDSRMCMV 240 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 D E+ D FEW GLK L AVSFR E+ + + V++YISSA L A+ A R Sbjct: 241 ADAN-EIGDLVFEWPGLKTLGYAVSFR---TEKDMQTTVAVKFYISSAKLDAKSLLEASR 296 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW VEN LHW+LD+ MNED C+IR+ N+ E + +RH ++N+L N+K F G++RK ++ Sbjct: 297 AHWTVENNLHWQLDISMNEDSCRIRQQNSTENLATVRHTSLNLLKNEKSFDGGIKRKHKQ 356 Query: 361 AAMDRNYLASVLAGSGL 377 A +Y V++G L Sbjct: 357 ANRSDSYRELVVSGLSL 373 >UniRef50_B0C8F4 Transposase, IS4 family n=7 Tax=Cyanobacteria RepID=B0C8F4_ACAM1 Length = 384 Score = 389 bits (1000), Expect = e-107, Method: Composition-based stats. Identities = 145/375 (38%), Positives = 218/375 (58%), Gaps = 7/375 (1%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++EH S + D R ++E+ L DI+++T+CAV+ GA+ W ++ ++G + +LKQ+ Sbjct: 5 PFASIIEHFSDLDDPRAAHRIEYSLEDIIIITLCAVLCGADNWVEVANYGRSKAQWLKQW 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 NG+P HDT V + + P + +CF+NW + + ++IAIDGKTLR + Sbjct: 65 IALPNGVPSHDTFEWVFARLKPQQLQQCFLNWTQAIYQLSAGELIAIDGKTLRGAIAPGE 124 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + IH++SA+++ + LV+GQ DEKSNEITAIPELL +L+++G +++ DAMGCQ IA Sbjct: 125 QCSLIHMVSAWASHNRLVLGQRTVDEKSNEITAIPELLKVLELEGALVSIDAMGCQTAIA 184 Query: 182 EKIQKQGGDYLFAVKGTQGRLNKAFEEKFP---LKELNNPEHDSYAISEKSHGREEIRLH 238 E I + GDY+ A+KG QG L + F + EHDSY EK HGR E R + Sbjct: 185 ETIIEGQGDYVLALKGNQGDLYNDVVQLFDHACQTQFQGIEHDSYQTVEKGHGRIEHRTY 244 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 D L+ W LK + S R + + RYY+ S + A++FA A Sbjct: 245 WTMGQTDYLLG-AERWAQLKSIGCVESCR---RQPGHPGTLQRRYYLLSIESDAQRFADA 300 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R+HW +EN+LHW LDV ED + +G +A+ S IRHIA N+L + K G++ K Sbjct: 301 VRSHWGIENQLHWILDVGFREDKLRACQGCSAQNLSVIRHIAANLLQQESTAKCGVKAKR 360 Query: 359 RKAAMDRNYLASVLA 373 KA D NYL +L+ Sbjct: 361 LKAGWDDNYLVKILS 375 >UniRef50_B1WZS9 Transposase n=10 Tax=Cyanobacteria RepID=B1WZS9_CYAA5 Length = 406 Score = 387 bits (993), Expect = e-106, Method: Composition-based stats. Identities = 137/373 (36%), Positives = 212/373 (56%), Gaps = 8/373 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+ ++ I D R +H L D+L + I AVI+G++GWED+E++G ++L ++ + Sbjct: 30 NLLGYVKEIEDPRVQRSKKHLLKDVLAIAILAVIAGSQGWEDMENYGIAKQEWLSEFLEL 89 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 +GIP DT RV I P +C W++ +S ++I IDGKTLR SYD++ + Sbjct: 90 PHGIPSDDTFRRVFERIDPESLQKCLQKWVQSIMNSIQGEIIPIDGKTLRGSYDRNAGQC 149 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A++ ++A+++ SLV+GQ+K + SNEITAIP LL +LDI G IIT DAMG Q I ++I Sbjct: 150 ALYTVTAWASQQSLVLGQVKVENYSNEITAIPALLELLDITGSIITIDAMGTQTSIIQQI 209 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNP---EHDSYAISEKSHGREEIRLHIVC 241 +Q DY+ +K L ++ F + N EHD Y K H R E R Sbjct: 210 CRQKADYIVTLKANHPTLFSQVKQWFTDTQNNGWDGIEHDYYKSVTKGHHRTEKRYVWAI 269 Query: 242 DVPDE-LIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 V + +W GL+ + V R + + +++Y++S A+ AIR Sbjct: 270 PVAAMGELYQQQQWHGLQTIVVVERIRHLWN----KTTHDIQFYLTSLPPNAQFLCHAIR 325 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +EN LHW LDV +ED C+IR + + F+ +R +A+N+L +K FK LR+KM++ Sbjct: 326 THWSIENNLHWTLDVTFSEDQCRIRSEYSPQNFALLRRLALNVLHQEKTFKRSLRQKMKQ 385 Query: 361 AAMDRNYLASVLA 373 AAM+ NY+ +VL Sbjct: 386 AAMNNNYMMTVLN 398 >UniRef50_Q12RN8 Transposase, IS4 family n=23 Tax=Gammaproteobacteria RepID=Q12RN8_SHEDO Length = 374 Score = 377 bits (968), Expect = e-103, Method: Composition-based stats. Identities = 159/372 (42%), Positives = 228/372 (61%), Gaps = 7/372 (1%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M ++ +H S I D+RQ+ KV + L D+L ++CAVI+ + GW +I ++ H + K+ Sbjct: 1 MYIESFKQHFSAIDDHRQSAKVTYPLFDVLFGSLCAVIASSNGWFEIREYILGHHSWFKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 F +GIP DTIAR+VS I P F+ CF+ WM+ H + +VIAIDGKTLR SY++ Sbjct: 61 QKMFIDGIPADDTIARIVSMIDPDSFNACFLAWMKSVHQLTNGEVIAIDGKTLRGSYNRD 120 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 R IH+ISA+++ + LV+GQ+K ++KSNEITAIP LL MLD++G ++T DAMGCQ I Sbjct: 121 DRSSTIHMISAYASANKLVLGQLKAEQKSNEITAIPTLLKMLDLRGALVTIDAMGCQTAI 180 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 A I +GGDYL AVK QG L KA + F D + EKSHGR E R V Sbjct: 181 ATTIIDKGGDYLLAVKNNQGNLAKAVNKAFSPHRSAGL-SDDHVNIEKSHGRIENRTCYV 239 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 DF W+ LK + + SFR++ + K + RYYISS L+AE+ +A R Sbjct: 240 LSSAALDGDF-THWEALKSIVMVESFRAV---KGKTASLEYRYYISSKVLSAEQALSATR 295 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW +E+ +HW LDV MNED+C+I + N AE + +RH+++N+L + K + K ++ Sbjct: 296 EHWGIES-MHWVLDVSMNEDECQIYKNNGAENLAYLRHMSLNMLQKEPT-KLSIVGKRKR 353 Query: 361 AAMDRNYLASVL 372 M+ +L VL Sbjct: 354 CLMNPAFLEKVL 365 >UniRef50_B5K361 Transposase, is4 family n=8 Tax=Octadecabacter antarcticus 238 RepID=B5K361_9RHOB Length = 389 Score = 371 bits (951), Expect = e-101, Method: Composition-based stats. Identities = 155/371 (41%), Positives = 215/371 (57%), Gaps = 10/371 (2%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 + H S I D RQ KV + L +ILLLT+CAV+SGA W I +G L FLK++ F Sbjct: 25 FLTHFSEISDSRQEVKVTYPLPEILLLTLCAVLSGANDWTAISIYGTKKLGFLKRFLPFA 84 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +G P HD + + + + F CFI+W+ + + V+AIDGKT R S DK+ + A Sbjct: 85 DGTPSHDQLGNIFAALDAEAFQACFIDWVASLNKTVTG-VVAIDGKTSRRSLDKAGGKAA 143 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH+ISA+S+ +L + Q + D KSNEITAIPELL +L +KG I+T DAMGCQ++IA KI Sbjct: 144 IHMISAWSSEWNLTLAQRQVDGKSNEITAIPELLELLTLKGAIVTIDAMGCQREIAAKII 203 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLK---ELNNPEHDSYAISEKSHGREEIRLHIVCD 242 + DY+ A+KG QG L K E + + ++ + EKSHGR E R VC Sbjct: 204 SKEADYILALKGNQGSLRKDTELFMTEQAAVDYDDTTVTYHETVEKSHGRIETRRVTVCT 263 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 D L W GLK + + A + + RYYISS AE A AIR+H Sbjct: 264 DIDWL-KADHNWPGLKSIVMVQY----HAILQDKTRAETRYYISSMTSDAEHHAKAIRDH 318 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN LHW +D+V +D+C+IR GNA F+ I+H+A N+L + K K LR K A+ Sbjct: 319 WGIENGLHWVMDMVFRDDECRIRNGNAPANFTTIKHVASNMLRSVKG-KHSLRSKRHIAS 377 Query: 363 MDRNYLASVLA 373 D ++LA ++ Sbjct: 378 WDDDFLAEIIN 388 >UniRef50_D0TYT5 Transposase (IS4 family) n=3 Tax=Bacteroides RepID=D0TYT5_9BACE Length = 383 Score = 365 bits (936), Expect = 2e-99, Method: Composition-based stats. Identities = 135/384 (35%), Positives = 196/384 (51%), Gaps = 15/384 (3%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +++ I D R K HK+ I+ ++I AVI GA+ W +IE+FG + F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPD 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YD 118 IP HDT R S I P F F NW++ V+AIDGK +R + Sbjct: 64 LEFIPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 + + ++SA+S ++ + +GQ+K D+KSNEITAIP L+N L++ G I+T DAMGCQK Sbjct: 123 TGKEGFKLWMVSAWSAVNGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGTQGR---LNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 DI + I ++ +Y+ A+K + + L K + + ++ + HGR E Sbjct: 183 DITQTIIERDANYIIAIKENKKKNYQLAKQIIDDYQDRDEIINRVTRHVSENTGHGRVEK 242 Query: 236 RLHIVCDV-PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AE 293 R V F + GLK + S R+I+A E VRYY++S D T E Sbjct: 243 RTCTVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIVAT--GEYTQEVRYYVTSLDNTKPE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A+AIR HW +EN LHW+LDV ED K + NAA FS +A+ IL DK K Sbjct: 301 EIASAIRQHWSIENNLHWQLDVTFREDYSK-KVKNAARNFSVATKMALTILKKDKTTKGS 359 Query: 354 LRRKMRKAAMDRNYLASVLAGSGL 377 + K KA D YL+ +L + Sbjct: 360 MNLKRLKAGWDEKYLSQLLQNNNF 383 >UniRef50_B9XEG9 Transposase IS4 family protein n=1 Tax=bacterium Ellin514 RepID=B9XEG9_9BACT Length = 381 Score = 361 bits (925), Expect = 3e-98, Method: Composition-based stats. Identities = 134/380 (35%), Positives = 217/380 (57%), Gaps = 16/380 (4%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L+EH I D R + +H+L D+L++ +C ++ G E + D+EDFG+ + K + Sbjct: 7 QNLIEHFKEIRDPRVKGRCDHELVDVLMIGLCCLLCGGETFNDMEDFGKAKRKWFKTFLR 66 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 +GIP HDT RV + + P F +CF+ W + ++ +++A+DGK LR + ++ + Sbjct: 67 LRHGIPKHDTFNRVFAALKPEAFLDCFMRWTQSVRATVADEIVALDGKALRRALEQ--GQ 124 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 ++SA++ +SLV+GQI+ +K+NEITA+P+LL +L++ G I+T DAMGCQK+IA + Sbjct: 125 SPRVIVSAWAAGNSLVLGQIQVPDKTNEITAVPQLLRVLELSGCIVTLDAMGCQKEIARE 184 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKFPLK----------ELNNPEHDSYAISEKSHGRE 233 I + +Y+ A+KG QG+ ++ + E N +EK HGR Sbjct: 185 IVEADANYVLALKGNQGQCHQEIKAYLEDAVARHDQERPVEKNAVALAYKETTEKDHGRL 244 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 E R + L D +W GL+ + V S R + + P + RYY+SS ++ E Sbjct: 245 ETRRYWQSGDVSWLAD-RQQWAGLRSVGVVESVRQVGQQA---PTVERRYYLSSLNVDVE 300 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 KFA A+R HW VEN LHW LDV ED + R G+AAE + +R +A+N+L + K G Sbjct: 301 KFARAVRGHWSVENSLHWVLDVQCGEDRNRARSGHAAENLATLRRLALNLLKRESTKKRG 360 Query: 354 LRRKMRKAAMDRNYLASVLA 373 ++ K A+ D +YL +L+ Sbjct: 361 IKGKQLNASWDHDYLLRLLS 380 >UniRef50_UPI0001745ADF transposase, is4 family protein n=2 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001745ADF Length = 392 Score = 358 bits (918), Expect = 2e-97, Method: Composition-based stats. Identities = 126/380 (33%), Positives = 196/380 (51%), Gaps = 16/380 (4%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L E I D+R H L+DIL++ CA++ G + +E FG +L+ + Sbjct: 14 SNLREVFQSIDDWRVQRTQRHDLADILVIATCAMLCGQGHYTHMEAFGNLKRTWLESFLA 73 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD--------KDVIAIDGKTLRH 115 NGIP HDT +V S + P +F E F W + K VIAIDGK LR Sbjct: 74 LPNGIPSHDTFRKVFSLLDPKRFMEAFSLWTQGVLRQLSSEGLESGLKGVIAIDGKALRG 133 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + DK + ++ A+++ SL +GQ+K +KSNEI A+PELL ML +KG I+T DAMG Sbjct: 134 AVDK--GQAPAVIVGAWASELSLCLGQVKVADKSNEIGAMPELLEMLALKGCIVTIDAMG 191 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE-LNNPEHDSYAISEKSHGREE 234 CQ+++A KI +Q GDY+ A+K Q L++ E L E + + HGR E Sbjct: 192 CQREVARKIIQQKGDYILALKSNQESLHQQVSHYLDTGEDLARAEGNFHQEESDGHGRHE 251 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 +R V + + + +W GL+ + R++ + + RY+ISS A Sbjct: 252 VRRCWVSEEVECWLQGAEKWAGLRSVAAVECERTV----AGQTTVQRRYFISSLKADAAL 307 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN-DKVFKAG 353 A ++R HW +EN LHW LDV ED+ + RRG +AE + +R + ++ + K Sbjct: 308 IAASVRAHWGIENSLHWVLDVTFGEDESRSRRGYSAENLATLRRLTHAMIKRENPNSKKS 367 Query: 354 LRRKMRKAAMDRNYLASVLA 373 + ++ +A + +YL ++L Sbjct: 368 VNQRRFEAGLSTDYLQTLLG 387 >UniRef50_C4KA79 Transposase IS4 family protein n=4 Tax=Rhodocyclaceae RepID=C4KA79_THASP Length = 381 Score = 355 bits (911), Expect = 1e-96, Method: Composition-based stats. Identities = 128/372 (34%), Positives = 198/372 (53%), Gaps = 7/372 (1%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L ++E + D+R + H+LS++L + +CAV+SGA+ +E+I +G + +L+ + Sbjct: 6 LADMVEVFEGLEDWRNAQQTRHRLSELLTVAVCAVLSGADDFEEISQWGRAKVPWLRGFL 65 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHSYDKSR 121 + G+ DT RV + + P +F + F W+ + KD VIAIDGK+ R + K+ Sbjct: 66 RLDYGVASPDTFERVFALLDPKQFEQAFRTWVGGIIPAVGKDQVIAIDGKSSRRTTSKAA 125 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 +H++SAF+ +V+GQ T EKSNEITAIPELL +LDI+G I+T DAMG Q IA Sbjct: 126 AA-PLHLVSAFAANVGVVLGQTATAEKSNEITAIPELLKVLDIEGCIVTIDAMGTQTKIA 184 Query: 182 EKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVC 241 I+++G Y+ VK +L + ++ + HGR E+R Sbjct: 185 RAIRERGAHYVLCVKDNHPKLLDSIMFADIDPRGPLTPSSTHETTSTGHGRIEVRRCTAF 244 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 D D L WK + V R++ + YYISS AE+ A AIR+ Sbjct: 245 DATDRLHK-AEAWKDVASFAVVERVRTV----GERTSTERVYYISSLPADAERIAVAIRS 299 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW VEN+LHW LDV +D + R G+ A + +RH+A+N++ DK K ++ K A Sbjct: 300 HWEVENRLHWCLDVQFGDDYARGRIGHIAHNLALVRHMALNLIRLDKSIKTSIKTKRLLA 359 Query: 362 AMDRNYLASVLA 373 A + A++L Sbjct: 360 ATSDEFRAALLG 371 >UniRef50_C7RT35 Transposase IS4 family protein n=2 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RT35_9PROT Length = 379 Score = 349 bits (896), Expect = 8e-95, Method: Composition-based stats. Identities = 140/377 (37%), Positives = 199/377 (52%), Gaps = 12/377 (3%) Query: 4 KKLMEHISIIPDYRQTW-KVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 LM + D R+ H ++L++ I AV+S + EDI +G D+L+Q+ Sbjct: 7 ASLMCCFREVDDPRKRSNGTLHDFQEVLVIAIAAVLSDCDTIEDIALWGREKEDWLRQFL 66 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 NG+ +T R+ + P +F F W+ + + +DGKT+R S S Sbjct: 67 VLLNGVASEETFLRIFRALDPKQFEAAFRRWVAGVVGTLTGG-LGVDGKTVRGS--GSGG 123 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 AIH++SAF+T +V+GQ K KSNEITAIPELL L I G ++T DAMGCQK+IA Sbjct: 124 ESAIHMVSAFATELGVVLGQEKVASKSNEITAIPELLKALYINGLLLTIDAMGCQKNIAR 183 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCD 242 +I QGGDYL AVKG Q L A E +F + + + D + SHGR ++ V Sbjct: 184 QITDQGGDYLLAVKGNQPTLLDAIETEFID-QYQSDDVDRHRQVHPSHGRIVAQIASVLP 242 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 E I +W KK+ S R + E ++ RYYISS +LTAE+ A A+R H Sbjct: 243 A--EGIVDLADWPECKKIARVDSLRKV---GNHESKLERRYYISSRELTAEQLAAAVRAH 297 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND--KVFKAGLRRKMRK 360 W +EN+LHW LDV ED IR+GNA + S ++ I +N++ D K LR K + Sbjct: 298 WGIENRLHWVLDVSFGEDASTIRKGNAPQNLSLLKKIVLNLIRLDTADKTKTSLRLKRKC 357 Query: 361 AAMDRNYLASVLAGSGL 377 AA + +L + L Sbjct: 358 AAWTDDVRMRILGFTSL 374 >UniRef50_B8HPB8 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 7425 RepID=B8HPB8_CYAP4 Length = 388 Score = 349 bits (895), Expect = 1e-94, Method: Composition-based stats. Identities = 134/369 (36%), Positives = 194/369 (52%), Gaps = 9/369 (2%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L ++ I D R H+L DI+ + + AV++GA+ W IE +G+ +L+ + Sbjct: 14 LEQYFGEITDPRVERTRAHQLLDIIAIALFAVLAGADSWVGIETYGQAKRAWLETFLALP 73 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + + P F +W++ S+ VIAIDGKT + SYD+ A Sbjct: 74 NGIPSHDTFARVFARLDPEALETRFQHWVKGVVSTLGAQVIAIDGKTAKGSYDREGGVKA 133 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + ++SA+++ H LV+GQ D KSNEITAIP LL L + G I++ DAMG + IA +I Sbjct: 134 LQLVSAWASEHRLVLGQCAVDTKSNEITAIPVLLEQLALAGCIVSIDAMGTHQTIAAQIH 193 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPL---KELNNPEHDSYAISEKSHGREEIRLHIVCD 242 KQ DY+ A+KG Q L K ++ F E+ + E +H R E R Sbjct: 194 KQQADYILALKGNQPTLFKQAQQWFEEFQTGSNAGAEYTQHQQCETAHHRIESRRVFQVP 253 Query: 243 VPDELIDFT-FEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 V +W GL+ L V S R + + RY++SS A FA IR Sbjct: 254 VEQVFTPKQGRDWAGLRSLVVIQSQRCLWNKD----TTETRYFLSSLSTDAATFAHYIRA 309 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDVV NED +IR+ +A FS +R + +N+L D K L K +A Sbjct: 310 HWGIENQLHWCLDVVFNEDKSRIRKDHAPRNFSLLRRLTLNLLHRDSS-KGSLVMKRYRA 368 Query: 362 AMDRNYLAS 370 +D ++ Sbjct: 369 GLDDQFMMQ 377 >UniRef50_Q9L3G3 Putative uncharacterized protein n=1 Tax=Klebsiella pneumoniae RepID=Q9L3G3_KLEPN Length = 375 Score = 346 bits (888), Expect = 7e-94, Method: Composition-based stats. Identities = 131/378 (34%), Positives = 197/378 (52%), Gaps = 12/378 (3%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M K L++++ IPD R K H LS+++ + ICA++ G + W +I F + + ++ Sbjct: 1 MAQKSLLDYLESIPDPRNQAKCSHILSEVVFMAICAMMCGFDTWSEITLFAQEREPWFRR 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK-DVIAIDGKTLRHSYDK 119 + GIP HDT R+ + + PA F W+ D D +A+DGK LR + K Sbjct: 61 WLSLPGGIPSHDTFNRIFATLPPASLKSIFQAWIGDIMGDDKLVGQLAVDGKALR-ATAK 119 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A+H+++ +ST + +GQ K +KSNEITAIPELL +L++KG +++ DAMG Q Sbjct: 120 GRGANAVHMVNVWSTELGMCVGQQKVADKSNEITAIPELLQLLELKGCLVSIDAMGTQVK 179 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK-ELNNPEHDSYAISEK---SHGREEI 235 IA+ I K+ GDYL AVK Q LN +E+F + N + + +E+ HGR+E Sbjct: 180 IADTILKKSGDYLLAVKDNQPSLNTEVKEQFQAYWDKNETDVSGHGFTEQFDSEHGRKEH 239 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V + DE + +WK K + + R + + VR+YISS L A Sbjct: 240 RRCWVL-MVDESMPVCQQWKA-KTIIAVQAERI----ENGKGYDFVRFYISSRALDATSA 293 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A R HW VEN LHW LD+ ED + R G A E + IR +N+L +K + Sbjct: 294 LKATRAHWSVENLLHWTLDIAFGEDQLQARAGFAGENLAVIRQWILNMLKQNKSRNLSMA 353 Query: 356 RKMRKAAMDRNYLASVLA 373 K R ++ YL + Sbjct: 354 NKRRLCCLNEQYLFECMG 371 >UniRef50_B4RYK8 Transposase n=1 Tax=Alteromonas macleodii 'Deep ecotype' RepID=B4RYK8_ALTMD Length = 367 Score = 346 bits (886), Expect = 1e-93, Method: Composition-based stats. Identities = 130/373 (34%), Positives = 206/373 (55%), Gaps = 10/373 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 + H + D R +H L D++ LT+ A++SGAEGW+DI+ FG++ LD+L+++ F Sbjct: 2 SFITHFESLEDKRSHINKKHDLLDVIFLTVVAILSGAEGWKDIKQFGDSKLDWLRKFRAF 61 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 + G+PV DTIAR++S + P FI+W+ + + VIA DGKTLRHS+D R+ Sbjct: 62 KEGVPVDDTIARIISSLEPNALLTSFISWVNEIREDQGQPVIAFDGKTLRHSFDGD-RKT 120 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 A+H +SA++ LV+ Q K+ K NE++ + EL+ +L++KG I+T DAM C K +A+ I Sbjct: 121 ALHSVSAYAVEQGLVLAQCKSKGKKNEVSTVLELIELLELKGSIVTADAMHCLKKVAKAI 180 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPE---HDSYAISEKSHGREEIRLHIVC 241 +GGDY+ VK QG+L F + P+ +S ++ HGR E R ++ Sbjct: 181 NAKGGDYVLQVKNNQGKLATEIAAYFHKTRRDTPQDIKTNSLTLTNDGHGRIEERTYVQL 240 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 + L + W +K + R + K + YYISS ++ + A AIR+ Sbjct: 241 PITPWLTQ-SQGWTNIKPVIEVTRKRYL----KDKETSETAYYISSLEVNLPQIAKAIRS 295 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN HW LD+ EDD +IRRG+A E + R A+N+ K ++ K+++A Sbjct: 296 HWSIENASHWVLDMTYREDDSRIRRGDAPENMATFRRFAMNLARLSP-IKDSMKGKLKQA 354 Query: 362 AMDRNYLASVLAG 374 A +L Sbjct: 355 AWSDEVREKLLFA 367 >UniRef50_A3ZYT1 Putative transposase n=2 Tax=Blastopirellula marina DSM 3645 RepID=A3ZYT1_9PLAN Length = 386 Score = 343 bits (880), Expect = 5e-93, Method: Composition-based stats. Identities = 126/380 (33%), Positives = 217/380 (57%), Gaps = 14/380 (3%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ ++E+ + + D R+ +H L D+L++ + AVI+GA+G I + E H+++LK Sbjct: 9 DVVSILEYFAELDDPRRHINRKHLLGDLLVICVPAVIAGADGPRSIAIWAEAHVEWLKSR 68 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS-----DDKDVIAIDGKTLRHS 116 + +G+P HDTI R+++ + P F +CF W+ + D +++IAIDGKTLR S Sbjct: 69 LELPSGVPSHDTIGRLLAQLKPTAFQQCFDEWLTAMRQAATTDDDAREIIAIDGKTLRRS 128 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ + G + + SA++ + +GQ+ +KSNEI PEL+ +D++ I+T DA GC Sbjct: 129 HDRGKGLGPLCLGSAWAVRAGVSLGQMAAADKSNEIVVFPELIEQIDVRKAIVTLDAAGC 188 Query: 177 QKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK---ELNNPEHDSYAISEKSHGRE 233 Q+D+AEKI GDY+ A+K Q RL++ + + + + + + K HGR Sbjct: 189 QRDVAEKIIAGKGDYVLALKANQERLHEQVRDYITTQLENDFAGVKVERHEEEAKGHGRL 248 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 + R + +PDE + +W+GLK + VA+ +++ RYYISS A+ Sbjct: 249 DKRFYYQVKLPDE-VPAGEDWRGLKTIGVAIRI----SQENGRETCDTRYYISSLKPDAK 303 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 +FA A+R HW +EN LHW LDV ED+ ++R AAE + ++ +A++++ K K Sbjct: 304 QFAAAVRGHWGIENSLHWTLDVTFREDESRLRNRIAAENLAWLKRLAVSLIKQHKS-KES 362 Query: 354 LRRKMRKAAMDRNYLASVLA 373 + + R A + N+LA +L Sbjct: 363 VVMRRRMAGWNVNFLAEILG 382 >UniRef50_C4KCI0 Transposase IS4 family protein n=3 Tax=Thauera sp. MZ1T RepID=C4KCI0_THASP Length = 372 Score = 341 bits (875), Expect = 2e-92, Method: Composition-based stats. Identities = 141/375 (37%), Positives = 198/375 (52%), Gaps = 16/375 (4%) Query: 7 MEHISIIPDYRQTW-KVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 M + I D R+ H +IL++ I AV+S + EDI + T +L+++ + Sbjct: 1 MSCLDEIDDPRKPSNGTLHDFREILVILIAAVLSDCDTVEDITFWARTKQAWLRRFLVLK 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV-----IAIDGKTLRHSYDKS 120 NGIP +T R++ + P +F F W+ + D IAIDGKT+R S S Sbjct: 61 NGIPSEETFLRILRALDPKQFENMFRRWVGGVVGALSDDAGLAGTIAIDGKTVRGS--GS 118 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 AIH++SAF+T LV+GQ K KSNEITAIPELL L IKG ++T DAMGCQK I Sbjct: 119 GGESAIHMVSAFATELGLVLGQEKVAAKSNEITAIPELLEALSIKGLLVTIDAMGCQKSI 178 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 A++I + GDYL VKG Q +L +A E F + D + E+ HGR ++ V Sbjct: 179 AKQIVAKKGDYLLMVKGNQPKLLEAIETAFID-QHGVESVDRSSRVERGHGRTVGQIASV 237 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 I +W + S R + K+ ++ RYYISS L+AE+ A A+R Sbjct: 238 LSAKG--IVDPADWPKCVTIGRIDSMRVV---GDKQSDLERRYYISSRALSAEQLAAAVR 292 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKM 358 HW VEN+LHW LDV +ED + + NA + S +R IA+ I+ DK K+ LR K Sbjct: 293 AHWGVENRLHWILDVSFSEDASTVAKDNAPQNLSLLRKIALTIIRADKTDTRKSSLRLKR 352 Query: 359 RKAAMDRNYLASVLA 373 + AA D +L Sbjct: 353 KGAAWDDGVRERMLG 367 >UniRef50_C4RAC6 Transposase, IS4 n=1 Tax=magnetite-containing magnetic vibrio RepID=C4RAC6_9PROT Length = 348 Score = 340 bits (871), Expect = 5e-92, Method: Composition-based stats. Identities = 128/351 (36%), Positives = 186/351 (52%), Gaps = 5/351 (1%) Query: 22 VEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCI 81 V + L+++LL T+ +I A +++IE G LD+L+Q+ FE+G+P T ++ + Sbjct: 2 VTYPLNEVLLTTLVGLICRAADFDEIELTGLEQLDWLRQFLPFEHGVPQAQTFRKIFRLL 61 Query: 82 SPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 P F W+ V AIDGKTLR S + GA+H++SA++ LVIG Sbjct: 62 DPKYLETAFSAWVESLRVHVGGGV-AIDGKTLRGSKQAAGGDGALHLVSAYAHEAGLVIG 120 Query: 142 QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGR 201 Q + KSNEITAIPELL+ L + G I+T DAMG QK IA K+ +G DY+ A+KG QG Sbjct: 121 QRAVNAKSNEITAIPELLDALVLNGAIVTIDAMGTQKLIAAKLIDKGADYVLALKGNQGT 180 Query: 202 LNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 L+ + F +L HGR E R V D L + W GL + Sbjct: 181 LHDDVRDFFADPDLLRECARHDDTCI-GHGRIEERTCQVADASAWLTEQHSGWAGLASIA 239 Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 ++ R ++ E R YISS + A R+HW VEN LHW+LDV ED+ Sbjct: 240 AVIATR--TDKKSGEVSSETRLYISSLPPDPKTILNACRSHWSVENNLHWQLDVTFREDE 297 Query: 322 CKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 C+ R+ +A + IRH A N+L + K ++RK KAAM++ + +V+ Sbjct: 298 CRTRKDHAPLSLAIIRHAAFNMLKREPS-KMSIKRKRLKAAMNQAFRKTVI 347 >UniRef50_C5J502 Transposase n=2 Tax=Bacteroides fragilis RepID=C5J502_BACFR Length = 373 Score = 339 bits (870), Expect = 8e-92, Method: Composition-based stats. Identities = 132/379 (34%), Positives = 183/379 (48%), Gaps = 16/379 (4%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M ++ +IIPD R ++ ++I+ + + AVI GA+ W +IE FG+TH + K Sbjct: 1 MTIQAFS---AIIPDGRDDKNKIYQSNEIVFIALVAVICGADTWNEIETFGKTHESYFKA 57 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 IP HDT++R S + F ECF W+ D V+AIDGK + + DKS Sbjct: 58 RLPGLVSIPSHDTLSRFFSILDIDWFEECFRLWVDDICRRIPG-VVAIDGKAICDNPDKS 116 Query: 121 RR-----RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 R ++++SA+S + + +GQ K +EKSNE AIPEL+ LD++ IIT DA+G Sbjct: 117 SNSKNGVRSKLYMVSAWSVSNGICLGQRKVEEKSNETKAIPELIKTLDLENCIITIDAIG 176 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKF-PLKELNNPEHDSYAISEKSHGREE 234 CQK I + I + DY+ K L E Y K HGR E Sbjct: 177 CQKSITKLIIENKADYILCAKDNHEALRNIIEFNLSEESRYYLCHAKRYFEENKGHGRSE 236 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 R VC L F W G+K L + S R + KE M RYYISS + Sbjct: 237 YREC-VCISAKNLQYFLKGWTGIKTLAMINSIRKM---GDKEAVMETRYYISSLEPDPII 292 Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 +IR HW VEN LHW LD+ EDD + + GNAA FS I +A+ +L K G+ Sbjct: 293 ILKSIRPHWEVENNLHWVLDIGFREDDDR-KIGNAAINFSAIPKLALMLLKQ-SDIKLGM 350 Query: 355 RRKMRKAAMDRNYLASVLA 373 K + D V+ Sbjct: 351 AGKRKACGWDEKIRDKVIG 369 >UniRef50_B8F572 ISAma3, family ISAs1 n=2 Tax=Haemophilus parasuis SH0165 RepID=B8F572_HAEPS Length = 365 Score = 338 bits (866), Expect = 2e-91, Method: Composition-based stats. Identities = 132/375 (35%), Positives = 199/375 (53%), Gaps = 12/375 (3%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + + E++S D R + +H DI+ L + AVISGA W +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 Y FE GIPV DTIARV+ I P F+E F+N++ + + ++VIAIDGKTLRHS++ Sbjct: 57 YRPFECGIPVDDTIARVIKRIEPQAFNEVFLNFINEIRTQQGREVIAIDGKTLRHSFNPE 116 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + A+H ++ +S L++ Q K+ K NE A+ E+++ +K +IT DAM QK I Sbjct: 117 T-QSALHSVTVWSQSRGLILSQKKSSGKQNEQQAVMEIIDSFCLKNAVITVDAMNTQKKI 175 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH-DSYAISEKSHGREEIRLHI 239 AEKI ++ GDY+ +K + E F + PE ++Y R + R + Sbjct: 176 AEKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETYEEVNAERSRIDERYYR 235 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 V D L EWKG+K + RS + +YISS D+ + A + Sbjct: 236 KLKVSDWLSK-AEEWKGIKSVLEVCRKRS----DNGKESQEKVFYISSLDVDIQILAKCV 290 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VENK HW LDVV ED+C + AE + +R +A+N+ K ++ K+ Sbjct: 291 RGHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHP-KKQSMKGKLT 349 Query: 360 KAAMDRNYLASVLAG 374 A + +L G Sbjct: 350 AAGWSDEFRDELLLG 364 >UniRef50_B6J4J2 Transposase n=8 Tax=Legionellales RepID=B6J4J2_COXB1 Length = 383 Score = 338 bits (866), Expect = 3e-91, Method: Composition-based stats. Identities = 123/374 (32%), Positives = 192/374 (51%), Gaps = 9/374 (2%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L I D R + + L +ILL+T+CA+I G + W+ I DFG+ +L Q+ + Sbjct: 14 QYLFHCFLSIKDPRVPGRCIYPLINILLITLCALICGVDTWKGIADFGKKRYRWLSQFVN 73 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 G+P T ARV S I P +F C WM D+I +DGK+L S + + + Sbjct: 74 MRCGVPSTLTFARVFSLIEPEQFQHCLSAWMSQFFQLLRFDMIHLDGKSLCGSARRGKAQ 133 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 A H+++A+ + +G+++ +KSNEI AIP LLN L+++G II+ DAMG QK IA Sbjct: 134 KATHIVNAYLPKEQVTLGEVRVPDKSNEIKAIPILLNSLNVQGCIISIDAMGTQKGIANL 193 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEK---SHGREEIRLHIV 240 I+ + DY+ A+K R + E F + + + Y E HGR E R + V Sbjct: 194 IRLKQADYVLALKKNHTRFYRYVERLFSCSDERDYQGMCYRTEETKDYGHGRIEERSYCV 253 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD-LTAEKFATAI 299 + + W+ L+ + S R + E E RYYI+S + + AI Sbjct: 254 LPM-MYFHKYKKYWRDLQAIVRVQSKRH----KGNEIETATRYYITSLPFAEHRRMSQAI 308 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW +EN+LHW+LD+ + ED I RG A + + +R + + +L N+ K G+ K Sbjct: 309 RQHWAIENQLHWKLDIGLGEDASLITRGYADQNLATLRKMVLKMLENENSSKQGIAGKRI 368 Query: 360 KAAMDRNYLASVLA 373 +AA+ YL V+ Sbjct: 369 QAALSTRYLRKVVG 382 >UniRef50_C7RKT8 Transposase IS4 family protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKT8_9PROT Length = 386 Score = 335 bits (858), Expect = 2e-90, Method: Composition-based stats. Identities = 126/375 (33%), Positives = 193/375 (51%), Gaps = 13/375 (3%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG--D 63 L+E S +PD R+ + L+ IL++ +CA++ GA+ W ++ D+ + ++L + Sbjct: 3 LLEAFSGLPDGRKGPAQRYSLAQILIMAVCAILCGADNWVEVADWCKDRKEWLSERFRWP 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 E G P HDT + + F F +W+R+ D V+AIDGKTLR S K Sbjct: 63 LEGGTPSHDTFGDLFRVLDAHIFETRFRDWIRELAGVIDG-VVAIDGKTLRGSGKKGS-N 120 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H+++A++ L + Q T K +E+ + LL++L +KG I+T DA+GCQ ++AEK Sbjct: 121 ELLHMVTAYAVQSGLCLAQEGTCGKGHELAGMKALLDVLVLKGCIVTMDALGCQTELAEK 180 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNP---EHDSYAISEKSHGREEIRLH-I 239 I +GGDY+ VK Q L +A E F + + +EK HGR E R + Sbjct: 181 IVARGGDYVLQVKDNQKNLAEALREFFDTGAAAGFGRLTVEHFEETEKDHGRIETRRYTW 240 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL-TAEKFATA 298 + DV WK L + + S R I + + RY I S + T E FA A Sbjct: 241 INDVTWMDRPMRAAWKKLGGVGMIESIRQI----GDKVSVDQRYAIGSCGVQTVEMFAKA 296 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 R+HW +EN LHW LDVV ED C+ R GN+A S +R + L ++ K GL R+ Sbjct: 297 SRSHWGIENGLHWTLDVVFREDQCRARLGNSARNLSVLRKFVLTTLRKEEGCKMGLNRRR 356 Query: 359 RKAAMDRNYLASVLA 373 A + +Y S++A Sbjct: 357 LHADRNESYRESLIA 371 >UniRef50_A6WIU3 Transposase IS4 family protein n=13 Tax=Gammaproteobacteria RepID=A6WIU3_SHEB8 Length = 369 Score = 329 bits (842), Expect = 1e-88, Method: Composition-based stats. Identities = 117/370 (31%), Positives = 187/370 (50%), Gaps = 7/370 (1%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L +H+S++ D R H L D+L L + AV SG +GW +I+ FGE L++L+++ F Sbjct: 2 SLFDHLSLVEDTRSHINQRHNLVDVLFLILSAVASGQDGWAEIQQFGELKLEWLRKFRPF 61 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 NGIP TIAR++ + P C +W+ D ++ K +IAIDGKTLR + Sbjct: 62 ANGIPRRHTIARILKAVGPENLQLCLFSWINDIRTASAKPIIAIDGKTLRGASKLGC--N 119 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 +H + AF + L + Q K EI + L+ ML+I +IT DA+ Q+ E I Sbjct: 120 TLHSVGAFDINNGLALYQEMASGKGKEIETVQSLITMLNINKALITMDALHAQRATLEAI 179 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVP 244 + GDY+ VK Q L +A + ++ + ++ + +A SEK HGR E R+ Sbjct: 180 VARKGDYVVQVKSNQRTLFQAVKAQYDVAFQDDSQLAQFACSEKGHGRTEQRITFQIPSK 239 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 +W +K L R I + + +Y+SS D+ E ATA+R HW Sbjct: 240 LSP-KLQEKWPSVKTLIAVERHRKI----GNKTSIETSFYLSSHDIDPEYIATAVRGHWR 294 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN LHW LDVV ED C++ AE + +R +A+N+ + K ++ K+ ++ + Sbjct: 295 IENSLHWVLDVVYREDACRVHEQRTAESLAIVRRMALNLAKLEITQKRSMKSKLHRSLLS 354 Query: 365 RNYLASVLAG 374 Y ++ Sbjct: 355 DEYRELMIFA 364 >UniRef50_A4SU78 Transposase n=4 Tax=Gammaproteobacteria RepID=A4SU78_AERS4 Length = 371 Score = 328 bits (841), Expect = 2e-88, Method: Composition-based stats. Identities = 113/369 (30%), Positives = 191/369 (51%), Gaps = 6/369 (1%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L+EH++++ + R +H L D++ L I A++SGAEGW DIE +G++ +D+L+Q+ F Sbjct: 9 LIEHLTLVEETRSEINRKHNLVDVMFLVISAIMSGAEGWNDIETYGDSKIDWLRQHRPFA 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP T+AR++ CI E + W+ + + K +IA DGK LR S+ + + A Sbjct: 69 NGIPRRHTVARILRCIVIDTLLEALLCWVNEQRTHQGKPIIAFDGKVLRGSF-RGNAKDA 127 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + +++A+ T + LV+ Q T K EI + ++L++L++KG ++T DA+ CQ++ EKI Sbjct: 128 LQLVTAYDTENGLVLSQKATPNKKGEIETVKDMLDILELKGAVVTLDALHCQRETLEKIS 187 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPD 245 ++ + VK Q +L +A + +F E E HGR+E R Sbjct: 188 EKKAHVVVQVKNNQPKLYQAVQSQFQSLFDAQKEKIVVEHKESGHGRQEERYVFQLKAKL 247 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 + T +W ++ + RS + + YY+SS + IR HW + Sbjct: 248 PP-ELTEKWPTIRSIIAVERHRS----ANGKGTVDTSYYVSSLSPKHKLLGHYIRQHWRI 302 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN H+ LDVV NED +I +A E + R +NI+ R K+++A + Sbjct: 303 ENSQHYILDVVFNEDASRIAMEDAVENMALFRRFVLNIVKQSNCGARSQRNKLKRAGWND 362 Query: 366 NYLASVLAG 374 +Y A + G Sbjct: 363 DYRAQLFFG 371 >UniRef50_UPI00017448CC transposase, is4 family protein n=6 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI00017448CC Length = 370 Score = 325 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 122/376 (32%), Positives = 184/376 (48%), Gaps = 11/376 (2%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L + I D RQ KV H++ ++L++ C+ + E + D+ DF ++ L +L+ + Sbjct: 1 MEALRAKFAQIHDPRQAGKVRHRIDEVLIIAFCSTLCDGESYLDMGDFAQSQLSWLQSFL 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 ++G P HD V+ I P E W D + IAIDGK LR +++ Sbjct: 61 PLKHGAPSHDVFRNVLMAIQPQALLEVLTGWCGDL----EGRHIAIDGKALRGTHNAETG 116 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 R +H++ A+ + L GQI EKSNEI AIP LL L +KG +T DAMG Q IAE Sbjct: 117 RHLVHLLRAWVDDYHLSAGQITCHEKSNEIEAIPRLLESLQLKGATVTIDAMGTQAHIAE 176 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYA---ISEKSHGREEIRLHI 239 +I G DY+ A+K R ++ + F E + + E SHGR E R + Sbjct: 177 QITGAGADYVLALKANHPRAHETVRKHFTEAERLDLSPSHHRKSVTLELSHGRCERREYT 236 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 + + D +++W GL+ + R + P V Y++ S E+ A + Sbjct: 237 ITEELDWYHK-SWKWAGLQSVAQV--RRQVQRSHDGPPLEEVHYFLCSFKADVERLAKLV 293 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R HW VEN+ HW LDV NED C++R NAA + +R + I L K LRRK + Sbjct: 294 RGHWSVENRCHWVLDVTFNEDHCQVRDRNAAHNLTILREMVITTLHRHP-AKVSLRRKRK 352 Query: 360 KAAMDRNYLASVLAGS 375 A MD + +L Sbjct: 353 LATMDPAFRLQMLGLL 368 >UniRef50_C3R2Y4 Transposase n=5 Tax=Bacteroides RepID=C3R2Y4_9BACE Length = 397 Score = 325 bits (832), Expect = 2e-87, Method: Composition-based stats. Identities = 140/388 (36%), Positives = 197/388 (50%), Gaps = 26/388 (6%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L + + + +I D R +H+ S I+L+ I AVI GA+ W IEDFG++ F Sbjct: 14 LHEFADSLILI-DNRIDRCKKHQASTIVLIAISAVICGADTWNSIEDFGKSKESFFAAKL 72 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSY----- 117 NGIP HDT R S + P KF E + W++ IAIDGKT+R +Y Sbjct: 73 SNFNGIPSHDTFNRFFSALDPLKFEESYRQWVQSILKCYSG-HIAIDGKTIRGAYESEQD 131 Query: 118 ----------DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGK 167 D + + +HVISAF+T + +GQ+ T EK NEI IPELL+ML IK Sbjct: 132 KRHRKQGVLPDSNTGKYKLHVISAFATELGVSLGQLCTQEKENEIVVIPELLDMLCIKDC 191 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPL--KELNNPEHDSYAI 225 IIT DA+GCQ+ IAEK+ K GDY+F VK Q +L + + D Y Sbjct: 192 IITIDALGCQRTIAEKVIKGEGDYIFIVKDNQPKLKEIVLSVTESIVSKGTTVRFDKYET 251 Query: 226 SEKSHGREEIRLHIVCDVPDELI-DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYY 284 E+ HGR E R+ C+ P L D +WK ++ + R K + R + Sbjct: 252 HEEGHGRNESRICYCCNDPGFLGADIRKKWKNIQSFGYIENTR----NTNKGTTVEKRCF 307 Query: 285 ISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 ISS + A+K R HW +EN LHW+LDV +ED+ + RR +A FS + IA+ L Sbjct: 308 ISSLEPDAQKILKNSREHWEIENNLHWQLDVNFHEDNTR-RRNISALNFSVLAKIALATL 366 Query: 345 TNDKVFKAGLRRKMRKAAMDRNYLASVL 372 N+K + + RK A D +L ++ Sbjct: 367 RNNK-REIPINRKRLIAGWDNEFLWELI 393 >UniRef50_B0ZEA1 Transposase n=4 Tax=Pyramidobacter piscolens RepID=B0ZEA1_9BACT Length = 390 Score = 324 bits (831), Expect = 2e-87, Method: Composition-based stats. Identities = 127/382 (33%), Positives = 194/382 (50%), Gaps = 15/382 (3%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + +E ++ I D+R + ++L DILL++ AVI + + ++ F + +L+ + D Sbjct: 3 QTFLELLAEIEDFRTGNAIHYRLQDILLVSALAVICNMDTYTEMAMFADHQRKYLEPFCD 62 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV------IAIDGKTLRHSY 117 F +G P HDT +V+S + P E F WM + + K V +AIDGKT+ S Sbjct: 63 FRHGPPSHDTFGKVLSRLDPRVLSERFSTWMSELYFHLGKLVESKGMTVAIDGKTICRS- 121 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 S + A HV++AF++ LV+GQIKTDEKSNEITAIPELL + +K ++T DAMG Q Sbjct: 122 -GSAEQNASHVLTAFASRMQLVLGQIKTDEKSNEITAIPELLELFQVKDTVVTIDAMGTQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHD------SYAISEKSHG 231 K+IA KI ++GGDY+ AVKG Q +L + + + EK HG Sbjct: 181 KNIAAKIIEKGGDYVLAVKGNQKKLRDDIIWHLHSELQDRSTRELKAKGQYAVTLEKDHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 R E R + +W+G+ + + R + + K + S + Sbjct: 241 RIEKRECY-LSNDLSWFEGLEDWRGIAGVGWIRNTRHVDNKNDKTTTEDHYFIYSLKEAQ 299 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFK 351 A+ R HW +EN LHW LD+ EDDC+ R NAAE+ + +R +A+ +L K Sbjct: 300 AKDLLRIKREHWAIENNLHWMLDMAFREDDCRARAKNAAEVMNILRKLALQMLKTCDTCK 359 Query: 352 AGLRRKMRKAAMDRNYLASVLA 373 G+R K + + VL Sbjct: 360 CGMRSKRKLCGLGIPTALQVLG 381 >UniRef50_C6LAE6 IS1548, transposase n=7 Tax=Bryantella formatexigens DSM 14469 RepID=C6LAE6_9FIRM Length = 373 Score = 324 bits (830), Expect = 3e-87, Method: Composition-based stats. Identities = 127/379 (33%), Positives = 197/379 (51%), Gaps = 17/379 (4%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++L+E + I D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + ++ K +I IDGKT+R +K Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRS--NK 118 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 H++SA+S +GQ EKSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 119 RNGEKPGHIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELN---NPEHDSYAISEKSHGREEIR 236 IAEKI+ + DY+ ++K QG L + E F E EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFRKEIKERGIYKKTQEKAHGQIETR 238 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 + + L WKGLK + + E++ + + RY+ISS E + Sbjct: 239 EYYQTEKIKWLSQ-KKAWKGLKSIIMERK----TLEKEGKRLIEYRYFISSLKEEIETVS 293 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF--KAGL 354 A+R HW +E+ +HW LDV ED AA+ + IR +++IL +V K + Sbjct: 294 RAVRGHWSIES-MHWHLDVTFREDANTTIDKMAAQNLNIIRKWSLSILKTAEVSRHKLSM 352 Query: 355 RRKMRKAAMDR-NYLASVL 372 R+K + +L VL Sbjct: 353 RKKRYVIGLRPIKHLEEVL 371 >UniRef50_B6IV35 Transposase, is4 family n=2 Tax=Rhodospirillum centenum SW RepID=B6IV35_RHOCS Length = 385 Score = 323 bits (828), Expect = 6e-87, Method: Composition-based stats. Identities = 128/377 (33%), Positives = 196/377 (51%), Gaps = 13/377 (3%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L+ L+EH S I D R ++ H L +ILLL +C ++ + +E+I +G HL FL+++ Sbjct: 12 LRVLLEHFSRIEDPRDERRILHPLPEILLLVVCGTMADCDDYENIAAWGAAHLPFLRRHL 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR 122 + +G+P + +++ I PA F F W+R D +AIDGKT R S+D+ Sbjct: 72 PYAHGVPGERWLTILMNRIDPALFSAAFTAWVRATFPGRA-DFVAIDGKTSRRSHDRRAG 130 Query: 123 RGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD----IKGKIITTDAMGCQK 178 IH++SAF+T LV+ Q +K+NE+ AIP LL+ L + G +++ DA+ Sbjct: 131 TAPIHLVSAFATTSRLVLAQEAVPDKANELAAIPVLLDRLGENGGLAGALVSIDAIATNP 190 Query: 179 DIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLH 238 IA I+ QG DYL AVK Q L E F + + D + +K HGR E R Sbjct: 191 TIAAAIRGQGADYLLAVKANQPTLRAELEAAFAVGDGA----DHHHDLDKGHGRVEERHV 246 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK--KEPEMTVRYYISSADLTAEKFA 296 V D L + G +L + + RY+ISSA LTAE A Sbjct: 247 SVIREVDWLSGTRR-FPGEMRLPDVAAIVRVHTTAHIADRTRTDTRYFISSAPLTAEHAA 305 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 A+R HW +EN+LHW LDV+ +D ++R G+ A+ + +RH A+N++ K L+ Sbjct: 306 DAVRGHWGIENRLHWVLDVLFKDDLSRLRTGHGAKNMAVVRHFALNLIRAANDQK-SLKT 364 Query: 357 KMRKAAMDRNYLASVLA 373 + + A +YLAS+L Sbjct: 365 RRKMAGWSDDYLASLLN 381 >UniRef50_B4U1L2 Transposase IS1548-like n=8 Tax=Streptococcus RepID=B4U1L2_STREM Length = 376 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 129/381 (33%), Positives = 200/381 (52%), Gaps = 21/381 (5%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + ++ + + D R+ WK++H LSDI+LL A +SGAE W++IE FG+ + LK Sbjct: 6 MDNIINFLITVKDDREPWKIKHVLSDIVLLIFFARLSGAEYWDEIEAFGQAYEATLKTVL 65 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD---------DKDVIAIDGKTL 113 ENGIP HDT+ RV + + P E W SD K ++AIDGKT+ Sbjct: 66 QLENGIPSHDTLQRVFATLDPQVLVEVTQMWSDILEESDLSSRNLFSFSKRLVAIDGKTI 125 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDA 173 R + S ++ A+H+++A++T + GQ+ T+EKSNEITAIPELL+M+ +KG +++ DA Sbjct: 126 RG--NGSAKQKALHIVTAYATDLGISYGQVATNEKSNEITAIPELLDMISVKGCMVSIDA 183 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGRE 233 MG QK IA+KI K+ DY AVK Q L + F + + + D Y EK+HG+ Sbjct: 184 MGTQKAIADKIIKKKADYCLAVKENQKTLLEDIVPFFEMSQEAD---DHYHTVEKAHGQI 240 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 E R + V L E+ ++ + A I ++ + RY+I S ++A+ Sbjct: 241 ETRAYEVIHDVSWLRKTHPEFGHIQSIGRAR----IHLDKNGQESEESRYFILSCQVSAK 296 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKA 352 + +R HW +E+ +HW LDVV ED K A + + + +L K Sbjct: 297 ELCDYVRGHWQIES-MHWLLDVVFREDANKTLNKQLAFNLNVMDKFCLAVLKQLDFGKKM 355 Query: 353 GLRRKMRKAAMD-RNYLASVL 372 +RRK ++ YL +L Sbjct: 356 SMRRKKYALSLSFDKYLKQLL 376 >UniRef50_D0TYS6 Transposase n=1 Tax=Bacteroides sp. 2_1_22 RepID=D0TYS6_9BACE Length = 430 Score = 322 bits (826), Expect = 1e-86, Method: Composition-based stats. Identities = 123/412 (29%), Positives = 185/412 (44%), Gaps = 44/412 (10%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + E I I D R+ KV + I+L+T+ V + W DI DF DFL+++ Sbjct: 18 MASMSEAIDTI-DPREKNKVTYSGKLIMLVTLSGVFCDCQSWNDIADFARYKKDFLRRFI 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK------------------- 103 P HDT+ R I + C+ W + Sbjct: 77 PDLETTPSHDTLRRFFCIIKTERLESCYREWACNMRGDSPSIEDCDWSKVQIGEGNDLYT 136 Query: 104 -DVIAIDGKTLRHSYDKSR--------------RRGAIHVISAFSTMHSLVIGQIKTDEK 148 IAIDGKT+ + + + +H++SAF + SL +GQ + K Sbjct: 137 NRHIAIDGKTICGAINADKLVQESAGKITKEQAASAKLHIVSAFLSDMSLSLGQERVSIK 196 Query: 149 SNEITAIPELLNMLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFE 207 NEI AIP+LL+ +DI +G ++T DA+G QK I EKI ++ DYL VK +L + E Sbjct: 197 ENEIVAIPKLLDDIDIRQGDVVTIDALGTQKKIVEKITEKQADYLLEVKDNHLKLRENIE 256 Query: 208 EKFPLKELNNPEHDSYAISE---KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 ++ E+D +E + HG R I C P L +WK L+ + Sbjct: 257 NDAEYLLISGRENDFIKRAEETTEGHGFMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIK 316 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + IA E + +ISS E R HW VEN LHW+LDV NEDD + Sbjct: 317 TEKINIAT--GEIQNEKHCFISSLVNNPELILKYKRKHWAVENGLHWQLDVTFNEDDGR- 373 Query: 325 RRGNAAELFSGIRHIAINILT--NDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 + N+A+ FS + +A+ IL D+ K + RK +KA YLA+++ Sbjct: 374 KMMNSAQNFSTLTKMALTILKNYQDEDKKTSVNRKRKKAGWSDEYLANLINN 425 >UniRef50_UPI00016C36BB transposase, is4 family protein n=3 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36BB Length = 380 Score = 322 bits (824), Expect = 1e-86, Method: Composition-based stats. Identities = 123/381 (32%), Positives = 188/381 (49%), Gaps = 19/381 (4%) Query: 6 LMEHISIIPDYRQT-WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L + +PD R H L+DIL + CAVI+GAEGWEDI ++G + F +++ + Sbjct: 5 LTSVFADLPDPRTETANKIHTLTDILTIATCAVIAGAEGWEDIAEYGRSKEAFFRRFLEL 64 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS--------DDKDVIAIDGKTLRHS 116 +NG+P HDT RV + + P F + F W + + D +A+DGK+ R S Sbjct: 65 KNGVPSHDTFYRVFTKLDPDAFADRFARWTAEVCEATGLTRPDIDGPTHVAVDGKSARRS 124 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 + G +H++ + +L++GQ E +EIT ++L LD+ G ++T DA GC Sbjct: 125 AKPT-FSGCLHLVEVWDIGSNLILGQRSVPEGGHEITTFRDVLATLDLTGAVVTLDAAGC 183 Query: 177 QKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH-DSYAISEKSHGREEI 235 Q + E I+ +GG+Y+ VKG Q L A F D + +HGR E Sbjct: 184 QTETLEVIRARGGEYVVCVKGNQPTLRDAIAGVFDRAGEAEFAGCDGHTSVTDAHGRHEE 243 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V PD L W G+ + + R + + K E T YY+SS + A + Sbjct: 244 RNVTVVHDPDGL---PAGWAGVGSVALVCRDRQV---KGKANESTAHYYLSSLRVGAAEL 297 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 A IR HWH+E+ +HW LDV ED+ + R G+A IR +A+++L K + Sbjct: 298 AGYIRGHWHIES-MHWVLDVAFREDESRTRVGHAGANLGMIRRVAVSLLKRAG-KKGSIH 355 Query: 356 RKMRKAAMDRNYLASVLAGSG 376 + +A D Y+A VL G Sbjct: 356 TRRLRAGWDDQYMAQVLQGLS 376 >UniRef50_A4Y1Z8 Transposase, IS4 family n=1 Tax=Shewanella putrefaciens CN-32 RepID=A4Y1Z8_SHEPC Length = 367 Score = 315 bits (807), Expect = 2e-84, Method: Composition-based stats. Identities = 115/371 (30%), Positives = 188/371 (50%), Gaps = 7/371 (1%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 +++H+ I D R EH + DI L + AVISGA+ W +FG L++L++Y F Sbjct: 1 MLKHLDNITDCRSHINQEHDVIDICFLVLSAVISGAQSWSACHEFGTLELEWLRKYRPFT 60 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP +I R+ +S + ++W+ + + + IAIDGK L+ S A Sbjct: 61 NGIPSQQSIGRIFRGVSKQSLLDALMSWVNEYRVNTGQSSIAIDGKVLKG-AKASASSAA 119 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H+++A+ T LV K +E+ + ELL LD+K +++T DA+ CQ + I Sbjct: 120 LHMVTAYDTGSGLVFSAKSGASKKSELKLVQELLTCLDLKSELLTFDALHCQSQTLDYIA 179 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPD 245 K+GGD + VKG Q +L +A + +F NNP+ + + + K HGR E R+ C + Sbjct: 180 KEGGDCILQVKGNQPKLYEALQAQFTDYIENNPDAECFTETNKGHGRVEKRITFQCPLNL 239 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 + +W LK L R + + + +Y+SSA LT+E F AIR HW Sbjct: 240 P-AEIKMKWSQLKTLIAVERHRKV----GNKTSIDTHFYVSSAVLTSEAFGRAIRAHWQT 294 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN HW LD + ED K+ + A + + +R A+N++ K +K +A Sbjct: 295 ENNQHWLLDKLFQEDKQKMYDEDGASILAILRRWALNLVKLHP-AKTSQTQKFNRACWSD 353 Query: 366 NYLASVLAGSG 376 ++ ++ G+G Sbjct: 354 DFREEIIFGTG 364 >UniRef50_Q216R2 Transposase, IS4 family n=6 Tax=Rhizobiales RepID=Q216R2_RHOPB Length = 369 Score = 314 bits (805), Expect = 3e-84, Method: Composition-based stats. Identities = 101/375 (26%), Positives = 171/375 (45%), Gaps = 17/375 (4%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 + + E +PD R H L++IL + + A + GA D+ F + Sbjct: 4 PMDRFAECFEDLPDPR-AGNALHDLTEILFIALMATLCGATSCTDMALFARMKAYLWRDV 62 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS----DDKDVIAIDGKTLRHSY 117 +NG+P HDT +RV + P F + F +M+ K VIA+DGK LR Y Sbjct: 63 LVLKNGLPSHDTFSRVFRMLDPEAFEKAFQRFMKAFAKGAKIKPPKGVIALDGKALRRGY 122 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 + R +++A++ + + ++ NE +L+ +L +KG ++T DA+ C Sbjct: 123 ESGRSHMPPVMVTAWAAQTRMALANVQAPNN-NEAAGALQLIELLQLKGCVVTADALHCH 181 Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRL 237 + +AE I+ +GGDY+ AVK Q L + + + HGR+E R Sbjct: 182 RGMAEAIKARGGDYVLAVKDNQPALMRDAKAAIRAATRQGKPST--ITVDAGHGRKEKRR 239 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 +V VP D ++ GLK + S R + RY++ S + Sbjct: 240 AVVAAVPQMAQD--HDFAGLKAVARITSKR-------GTDKTVERYFLMSQAYPPKDVLR 290 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 +R HW +EN LHW LDVV++ED + R+ NA + +R +A+N+ LR K Sbjct: 291 IVRTHWTIENSLHWPLDVVLDEDLARNRKDNAPANLAVLRRLALNVARAHPDNTTSLRGK 350 Query: 358 MRKAAMDRNYLASVL 372 +++A + +L ++ Sbjct: 351 LKRAGWNDTFLFELI 365 >UniRef50_B6IML7 Transposase, is4 family n=1 Tax=Rhodospirillum centenum SW RepID=B6IML7_RHOCS Length = 373 Score = 312 bits (799), Expect = 1e-83, Method: Composition-based stats. Identities = 110/369 (29%), Positives = 176/369 (47%), Gaps = 12/369 (3%) Query: 10 ISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIP 69 +PD R H L D+L + + A I GAE D F +++ + G+P Sbjct: 9 FQGLPDPRTGNARRHALLDVLTIALTASICGAESCVDFATFARDRRALFEEFLELPGGLP 68 Query: 70 VHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVI 129 HDT +RV + P F CF ++ D D V+AIDGKTLR S+D++ R A+HV+ Sbjct: 69 SHDTFSRVFRLLDPVAFSRCFQQFL-DHLGEDGAGVLAIDGKTLRRSFDRAAGRSALHVV 127 Query: 130 SAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGG 189 SAF++ +++GQ NEI A LL + D+KG ++T DA+ Q+ A+ I ++GG Sbjct: 128 SAFASGARMIVGQRAVAAGENEIVAARALLELFDLKGVLVTGDALHAQERTAQTILERGG 187 Query: 190 DYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELID 249 D+LF +K + L E F + + ++ HGR E+R H V L Sbjct: 188 DWLFPLKDNRPALRAEVERYF--ADPATVLAVPHVTTDADHGRIEVRRHWVSHDVAWLAS 245 Query: 250 FTF-----EWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 GLK L + + T Y+SSA L + A A+R HW Sbjct: 246 DRRFPDEAVLPGLKILGLVER---TVTSPDGRTTATRTLYLSSAALEPKTLARAVRAHWS 302 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +E +HW LD +ED + R+ + E + +R +A+N++ + +R + ++A Sbjct: 303 IEAAVHWVLDTSFDEDRARNRKDHGPENLATLRKLALNVVRSANNQD-SIRLRRKRAGWS 361 Query: 365 RNYLASVLA 373 +Y ++L Sbjct: 362 DDYARTILG 370 >UniRef50_B5K1I5 Transposase, IS4 family protein n=9 Tax=Octadecabacter antarcticus 238 RepID=B5K1I5_9RHOB Length = 376 Score = 308 bits (789), Expect = 2e-82, Method: Composition-based stats. Identities = 104/377 (27%), Positives = 173/377 (45%), Gaps = 13/377 (3%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 + + + +PD R V H L ++L++ +V+ G+ ++ FG F + Sbjct: 10 IAMHIFLSAFDEVPDPR-ASNVRHDLGELLVIAFVSVLCGSTSCAEMAAFGRAKESFFRN 68 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD-DKDVIAIDGKTLRHSYDK 119 + ++ IP HDT + V I P F + D D D+IAIDGK LR + D Sbjct: 69 FLKLKHAIPSHDTFSEVFRIIDPKALDAAFSKVLADVTKLLKDGDIIAIDGKALRGARDP 128 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 ++SA+++ L + + D + E++A E L ++D++GK++T DA+ C + Sbjct: 129 GESARTRMMVSAYASRLRLTLATVPAD-RGTELSAAIEALGLIDLRGKVVTGDALHCNRR 187 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHI 239 I GGD+ A+KG Q L F ++P + HGR+E R + Sbjct: 188 TVAAINAGGGDWCLALKGNQESLLSDARGCFSKGHKSDP---TAVTENTGHGRKETRKAV 244 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 V + + E+ GLK + R E + RY+ S T E A+ Sbjct: 245 VVSA--KALAEYHEFPGLKGFGRIEATR----ETGGKVTSETRYFALSWVPTPEVLLAAV 298 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 R+HW +EN LHW+LDV ED + R+ N + +R A+++L D K L K++ Sbjct: 299 RDHWAIENALHWQLDVSFREDAARNRKDNGPGNIAVLRRRALDVLRRD-TSKGSLSIKIK 357 Query: 360 KAAMDRNYLASVLAGSG 376 +A D +L S+L+ Sbjct: 358 RAGWDTTFLRSILSDLA 374 >UniRef50_C8WDT5 Transposase IS4 family protein n=4 Tax=Alphaproteobacteria RepID=C8WDT5_ZYMMN Length = 397 Score = 303 bits (775), Expect = 8e-81, Method: Composition-based stats. Identities = 99/371 (26%), Positives = 165/371 (44%), Gaps = 13/371 (3%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++ +PD R H L ++L++ +V+ GA ++ FG + + + Sbjct: 37 ILSAFEDVPDPR-AENTRHDLGELLVIAFVSVLCGATSCAEMAAFGRAKESVFRGFLKLK 95 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD-DKDVIAIDGKTLRHSYDKSRRRG 124 + +P HDT + V I P F + D + D DVIA+DGK LR + D Sbjct: 96 HAVPSHDTFSAVFRMIDPKALDAAFGRVLADVAALLRDGDVIAVDGKALRGARDAGESGR 155 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 ++SA++ L + + D + E+ A E L ++ +KGK++T DA+ C + I Sbjct: 156 TRMMVSAYAARLRLTLASVPAD-RGTELEAAIEALGLIALKGKVVTADALHCNRRTVAAI 214 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVP 244 GGD+ A+K Q L F + +P + HGR E R V Sbjct: 215 NAGGGDWCLALKANQDSLLSDARASFGAEPDAHPSA---LSEDIGHGRTETRKATVVS-- 269 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWH 304 + + E+ GLK + R + + RY+ S T E +R HW Sbjct: 270 SKALAEHHEFPGLKAFGRVEATR----KTAEGTTSETRYFALSWVPTPEVLLATVRAHWA 325 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD 364 +EN LHW+LDV ED + R+ N+ + +R A++++ D K L K+++A D Sbjct: 326 IENSLHWQLDVSFREDAARNRKDNSPGNIAILRRRALDVMRRD-TSKGSLSIKLKRAGWD 384 Query: 365 RNYLASVLAGS 375 ++L +VL G Sbjct: 385 DDFLRNVLNGL 395 >UniRef50_C3QM62 Transposase n=1 Tax=Bacteroides sp. D1 RepID=C3QM62_9BACE Length = 387 Score = 302 bits (773), Expect = 2e-80, Method: Composition-based stats. Identities = 115/385 (29%), Positives = 173/385 (44%), Gaps = 43/385 (11%) Query: 30 LLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHEC 89 +L+T+ V + W DI DF DFL+++ P HDT+ R I + C Sbjct: 1 MLVTLSGVFCDCQSWNDIADFARYKKDFLRRFIPDLETTPSHDTLRRFFCIIKTERLESC 60 Query: 90 FINWMRDCHSSDDK--------------------DVIAIDGKTLRHSYDKSR-------- 121 + W + IAIDGKT+ + + + Sbjct: 61 YREWACNMRGDSPSIEDCDWSKVQIGEGNDLYTNRHIAIDGKTICGAINADKLVQESAGK 120 Query: 122 ------RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDI-KGKIITTDAM 174 +H++SAF + SL +GQ + K NEI AIP+LL+ +DI +G ++T DA+ Sbjct: 121 ITKEQAASAKLHIVSAFLSDMSLSLGQERVSIKENEIVAIPKLLDDIDIRQGDVVTIDAL 180 Query: 175 GCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISE---KSHG 231 G QK I EKI ++ DYL VK +L + E ++ E+D +E + HG Sbjct: 181 GTQKKIVEKITEKQADYLLEVKDNHLKLRENIENDAEYLLISGRENDFIKRAEETTEGHG 240 Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 R I C P L +WK L+ + + + IA E + +ISS Sbjct: 241 FMVTRTCISCSEPSRLGFCYRDWKNLRTYGIIKTEKINIAT--GEIQNEKHCFISSLVNN 298 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT--NDKV 349 E R HW VEN LHW+LDV NEDD + + N+A+ FS + +A+ IL D+ Sbjct: 299 PELILKYKRKHWAVENGLHWQLDVTFNEDDGR-KMMNSAQNFSTLTKMALTILKNYQDED 357 Query: 350 FKAGLRRKMRKAAMDRNYLASVLAG 374 K + RK +KA YLA+++ Sbjct: 358 KKTSVNRKRKKAGWSDEYLANLINN 382 >UniRef50_Q5P2A2 Predicted transposase n=11 Tax=Bacteria RepID=Q5P2A2_AZOSE Length = 491 Score = 298 bits (762), Expect = 3e-79, Method: Composition-based stats. Identities = 108/307 (35%), Positives = 163/307 (53%), Gaps = 7/307 (2%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +L + E +PD R + H LS++L + +CAV+ GA + D+ +G+++L +L+++ Sbjct: 5 QLMPVSEVFVSVPDPRSKRQARHDLSELLTVAVCAVLCGANDFVDVALWGKSNLAWLRKF 64 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHSYDKS 120 + G+P HDT RV++ I PA F F+ W+ + D V+AIDGKT R S K Sbjct: 65 LKLKAGVPSHDTFCRVLAMIDPAAFEAAFLRWVGVLVPALAPDSVVAIDGKTSRRSGGKD 124 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 +H++SAF+ LV+GQ TD+KSNEITAIPELL ML ++G I+T DAMG Q I Sbjct: 125 TSG-PLHMVSAFAAGMGLVLGQRATDQKSNEITAIPELLAMLALEGTIVTIDAMGTQAAI 183 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIV 240 A I+ +G DY+ VK L + + K HGR E+R Sbjct: 184 ARTIRSRGADYVLCVKDNPPTLTDSILLTLAGVAEKIAPASHFEEQTKGHGRVEVRRCWA 243 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 D +L + +W GL+ + R++ + + YYISS A + A A+R Sbjct: 244 YDAVSQLYK-SEQWAGLQSFALVERERTV----DGKTSVERHYYISSLPADAARIAQAVR 298 Query: 301 NHWHVEN 307 +HW VE+ Sbjct: 299 SHWAVES 305 >UniRef50_A2UVU9 Transposase, IS4 n=2 Tax=Gammaproteobacteria RepID=A2UVU9_SHEPU Length = 364 Score = 297 bits (761), Expect = 3e-79, Method: Composition-based stats. Identities = 103/369 (27%), Positives = 186/369 (50%), Gaps = 7/369 (1%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L++H+ II D R ++H L D++ LT+ A++SGA GW+ IE FG LD+L+ Y FE Sbjct: 3 LLKHLEIISDPRTDINIKHNLIDVVFLTLSAILSGATGWKSIEQFGIHQLDWLRLYRPFE 62 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 +GIP IA ++ + + W+ D K +IA+DGKT+R ++ + A Sbjct: 63 HGIPRRHCIANIIKSLDSELLLQAIFGWLNDKRLQTGKPIIALDGKTMRRAWADDIHQ-A 121 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 +H++SAF + + + ++K +E ++++ L + ++T DA+ CQK +KI Sbjct: 122 LHIVSAFDVRNGMALYLEAAEKKGHEAAIARDIIDALALDNAVVTLDALHCQKATMDKII 181 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPD 245 + D++ +KG Q A + ++P + HGR+E R + + Sbjct: 182 SKKSDFVIQIKGNQPA-LLAAVKAAFAACYDSPALAISEQTNTGHGRKECRRVMQIEGNL 240 Query: 246 ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHV 305 + + +W ++ L S R++ + + R+Y+SS + + A IR HW + Sbjct: 241 PP-ELSEKWPHIRTLVEVASERTV----GNKTACSSRWYVSSLPVDTAQLADIIRAHWAI 295 Query: 306 ENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 EN+LHW LDVV ED+ + + A+ + A++++ + K L K + AA D Sbjct: 296 ENQLHWVLDVVFREDELNVSDPDGAKHLALFNRAALSVIKQHQGKKDSLAAKRQSAAWDP 355 Query: 366 NYLASVLAG 374 + + +L G Sbjct: 356 AFRSELLFG 364 >UniRef50_Q1J9L7 Transposase n=54 Tax=cellular organisms RepID=Q1J9L7_STRPB Length = 380 Score = 297 bits (759), Expect = 6e-79, Method: Composition-based stats. Identities = 117/387 (30%), Positives = 183/387 (47%), Gaps = 20/387 (5%) Query: 3 LKKLMEHISIIPD------YRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLD 56 + +++ I I D RQ+WK+ + LS IL L ++G E +++EDF E + Sbjct: 1 MTTMIDFIISIDDCAVELDSRQSWKIRYPLSTILFLVFVCQLAGIETSKEMEDFIEMNEP 60 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD-KDVIAIDGKTLRH 115 Y D G P HDT+ RV+S ++ + E + + + S D +I++DGKT+R Sbjct: 61 LFATYVDLSEGCPSHDTLERVISLVNSDRLKELKVQFEQSLTSLDAVHQLISVDGKTIRG 120 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 ++ + + +H+++A+ H L +GQ+ +EKSNEI AIP+LL +DI+ I+T DAMG Sbjct: 121 --NRGKNQKPVHIVTAYDGGHHLSLGQVAVEEKSNEIVAIPQLLRTIDIRKSIVTIDAMG 178 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPL---KELNNPEHDSYAISEKSHGR 232 Q I + I K DY AVKG Q L F E Y EKS G+ Sbjct: 179 TQTAIVDTIIKGKADYCLAVKGNQETLYDDIALYFSDVNLLEELQENAQYYQTVEKSRGQ 238 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTA 292 E+R + V L +W L+ + + ++ + RY+I S Sbjct: 239 IEVREYWVSSDIKWLCQNHPKWHKLRGIGM----TRNTIDKDGQLSQENRYFIFSFKPDV 294 Query: 293 EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND--KVF 350 FA +R HW +E+ +HW LDVV +ED + AA + IR + + L Sbjct: 295 LTFANCVRGHWQIES-MHWLLDVVYHEDHHQTLDKRAAFNLNLIRKMCLYFLKVMVFPKK 353 Query: 351 KAGLRRKMRKAAMD-RNYLASVLAGSG 376 RRK R ++ +YL + G Sbjct: 354 DLSYRRKQRYISVHLEDYLVQLFGERG 380 >UniRef50_B8IA82 Transposase IS4 family protein n=2 Tax=Alphaproteobacteria RepID=B8IA82_METNO Length = 342 Score = 293 bits (749), Expect = 9e-78, Method: Composition-based stats. Identities = 114/340 (33%), Positives = 170/340 (50%), Gaps = 4/340 (1%) Query: 38 ISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC 97 ++ AE WEDIE +G + +L+ + NGIP HDT RV + F CF ++ Sbjct: 4 VACAESWEDIELYGRSKQAWLQTFLTLPNGIPSHDTFRRVFMLLDADAFERCFTRCVQFR 63 Query: 98 HSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 ++V+A+DGK++R S G +H++S +++ L +GQ D KSNEI AIPE Sbjct: 64 AGGIAREVVAVDGKSVRRSASHRHEHGPLHLVSTWASRRGLALGQRALDGKSNEIRAIPE 123 Query: 158 LLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN 217 LL L + G I+T DAMGCQ IAE+I+ +G D L +K G +A F L + Sbjct: 124 LLETLQLDGCIVTLDAMGCQTSIAERIRAKGADDLLVLKANHGGAYRAVRMHFERTCLGS 183 Query: 218 PEHDSYAISE-KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE 276 + HGR R V D + W L ++ + R I Sbjct: 184 GAAGRPVFDAFEGHGRLVRRRVFV-DAAATALAPLSGWPDLSRVLAVETLRGI--PGTGT 240 Query: 277 PEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGI 336 +RY+++S IR HW VEN LHW L+V EDD ++R AA F+ + Sbjct: 241 VVADIRYFLTSCRDDPSVLVGVIRRHWSVENALHWVLEVSFREDDSRVRDRTAARNFALV 300 Query: 337 RHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSG 376 R IA+N++ D+ +A LR + +KAA D +Y+ ++A Sbjct: 301 RKIALNLIAQDRSTQASLRGRRKKAAWDDDYMLQIIANQA 340 >UniRef50_C8SXA2 Transposase IS4 family protein n=1 Tax=Mesorhizobium opportunistum WSM2075 RepID=C8SXA2_9RHIZ Length = 367 Score = 290 bits (741), Expect = 7e-77, Method: Composition-based stats. Identities = 102/372 (27%), Positives = 169/372 (45%), Gaps = 14/372 (3%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++ +PD R +H L +IL + + AV+ GA ++E F + LD L+Q+ E Sbjct: 3 FLDVFGEVPDPRD-LTAQHPLPEILFVALAAVLCGATHCTEMELFARSRLDLLRQFIPLE 61 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD----DKDVIAIDGKTLRHSYDKSR 121 G P HDT +RV++ + P +E F+ +M K +A+DGK+LR +Y K R Sbjct: 62 RGAPSHDTFSRVLAALDPVALNEAFMRFMAAFGEQARIDAPKGQVAVDGKSLRRAYAKGR 121 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 V++ F + + Q ++ E+ A L +L +KG +T DA+ C + + Sbjct: 122 SHMPPLVVTVFGCDTFMSLAQT-VAQEGGEVQAAIAALELLSLKGLTVTADALHCHRRMT 180 Query: 182 EKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVC 241 + ++ GG Y+ A+KG Q +L + E +HGR E+R V Sbjct: 181 KTVRDGGGHYVIAIKGNQSKLAAEANTALDKA-AAGKATKFHQTEEDAHGRHEVRRAFVI 239 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 L + S+R++ + + VR Y S + A + +R Sbjct: 240 PFAQTPGKNALV--DLCAIGRVESWRTV----EGKTTHKVRCYALSRKMPAHELLATVRR 293 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN LHW+LDV++ ED + R+ N A + +R + +N+L D K L K KA Sbjct: 294 HWSIENDLHWQLDVLLGEDHIRGRKNNTAANHAILRRLTLNVLRADP-EKIPLSHKRLKA 352 Query: 362 AMDRNYLASVLA 373 L S+ Sbjct: 353 RWADQDLLSLFT 364 >UniRef50_B2RKL8 Transposase in ISPg2 n=7 Tax=Porphyromonas gingivalis RepID=B2RKL8_PORG3 Length = 376 Score = 282 bits (721), Expect = 1e-74, Method: Composition-based stats. Identities = 115/368 (31%), Positives = 172/368 (46%), Gaps = 15/368 (4%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L E + ++P R K + L +LL+ + +SG W +IED+ E + + LK + Sbjct: 4 SLFESLCMVPGPRIERKKIYPLDFLLLIVFLSTLSGNTSWYEIEDYAEEYEEELKSLYEM 63 Query: 65 ENG------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYD 118 G +P HDT+ R +S + F + W+ S+ I IDGKT+R Sbjct: 64 LTGHQLMHTMPSHDTLNRSISLLDVEAFEGAYKRWIEGFISATSGKHICIDGKTMRG-VK 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K HV+SAFS + Q+ D K+NEI AI +LL++LD+ G +++ DA+G Q Sbjct: 123 KLSFDTQSHVVSAFSPQDMCSLAQLYIDRKTNEIPAIHQLLDLLDLNGAVVSIDAIGTQT 182 Query: 179 DIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLH 238 I E+I +GGDY+ VK Q + E F + D +E SHGR E R + Sbjct: 183 AIVEQIIDKGGDYVLCVKANQSLSLQEIEAYFCPLFQKHILLD--EQTELSHGRIETRRY 240 Query: 239 I--VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 + + E + KGL+ + V R ++ + V YYISS Sbjct: 241 ESILNPLEIEANEVLTRRKGLRSIHKVVRKR--RDKKSDKTSEEVAYYISSLT-DVSSLK 297 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV-FKAGLR 355 AIR HW +ENKLH LDV D R N A++ I+ I + I+ K K+ + Sbjct: 298 QAIRGHWAIENKLHHCLDVYFGHDASHKRTRNVAQIMDIIQKINLLIIERLKTNMKSSIP 357 Query: 356 RKMRKAAM 363 R +K A Sbjct: 358 RIQKKPAR 365 >UniRef50_C3KML6 Putative transposase for insertion sequence NGRIS-17a n=1 Tax=Rhizobium sp. NGR234 RepID=C3KML6_RHISN Length = 370 Score = 271 bits (692), Expect = 4e-71, Method: Composition-based stats. Identities = 104/372 (27%), Positives = 168/372 (45%), Gaps = 14/372 (3%) Query: 7 MEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFEN 66 + + I D R H L+++L L + A + GA+ +I +F E LK+ + Sbjct: 5 LSILREIHDPRD-INARHDLAELLFLALAATLCGAKNCVEIAEFVEGREAELKEIVTLRH 63 Query: 67 GIPVHDTIARVVSCISPAKFHECFINWMRDCH-----SSDDKDVIAIDGKTLRHSYDKSR 121 G P HDT +R+ I P + ++ + V+A+DGK LR Y+K R Sbjct: 64 GCPSHDTFSRIFRLIDPDELARALGAFLAALRQGLGLGPRPRGVVAVDGKALRRGYEKGR 123 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 ++S + L + + + S+E+ A LL +D+KG I+T DA+ C+ D A Sbjct: 124 AFMPPVMVSVWDAETRLSVATKRAEG-SDEVAATLALLKSIDLKGCIVTADALHCRPDTA 182 Query: 182 EKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVC 241 + + + Y A+K +GRL E F + + + E HGR E R V Sbjct: 183 KALIGRKAHYALALKANRGRLFACAEAGFVAADAAG-DLAFHETRETGHGRLETRRASVL 241 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRN 301 + + + GLK + + R +VRY S L K A +R Sbjct: 242 PL--KAFKQAPAFPGLKAIGRIQATRQ---GADGRAVTSVRYIALSKVLAPHKLAEVVRA 296 Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKA 361 HW +EN+LHW LDVV +EDD + R+ NA + + IR +A +IL + K + KMR+ Sbjct: 297 HWTIENQLHWSLDVVFHEDDARSRKDNAPQNLAVIRRLARDILAAHPLDK-PIASKMRRV 355 Query: 362 AMDRNYLASVLA 373 +R++ Sbjct: 356 NWNRDFFHEFFT 367 >UniRef50_C2M822 ISPg2, transposase n=1 Tax=Capnocytophaga gingivalis ATCC 33624 RepID=C2M822_CAPGI Length = 376 Score = 270 bits (691), Expect = 5e-71, Method: Composition-based stats. Identities = 101/372 (27%), Positives = 179/372 (48%), Gaps = 17/372 (4%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 ++ I+++ D R ++++ L ILL+++ A ISG + WE IED+ H + L+ Sbjct: 3 AEIWNAIAVVKDPRVQGRIDYPLGLILLVSLYATISGCDDWEQIEDYANIHSEDLRNLYT 62 Query: 64 F-------ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 + +P HDT V I P +F E + ++ + + IAIDGKT R Sbjct: 63 KLSGKELKVSRMPTHDTFNHVFQVIDPKEFLEVYKKFIISIYETLTGKTIAIDGKTPRG- 121 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 ++ +++SA+ T H VI I ++ K +E+++I +L+ +L ++ +T DA G Sbjct: 122 IKQTANSHPSNIVSAYCTDHHFVIDHINSEVKGHELSSILDLIKLLFLENNTVTIDAAGT 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIR 236 ++ E I +GG+++ VKG Q +L + E++F +E + + HGR E R Sbjct: 182 YVEVIEMILSKGGNFVLPVKGNQKKLLEFIEKEF--REYRGNTVSADTQEDIGHGRVEKR 239 Query: 237 LHIVCDVP---DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 D++ +WKG+K L V R + + K + YYI++ + + Sbjct: 240 TVYCITEIKTDDDIDGCMQKWKGVKTLVKIV--REVYKKADKSTRIETVYYITNL-IDPK 296 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA- 352 + AIR HW +EN LH LDV++NED + N E F + +A+ I+ + Sbjct: 297 EINRAIRAHWGIENNLHRSLDVLLNEDHSQKSNHNVVENFHIMSLLALFIIKEISKQRGI 356 Query: 353 GLRRKMRKAAMD 364 + R + Sbjct: 357 SMNRTRKLCGYS 368 >UniRef50_Q6XMU0 Putative transposase n=1 Tax=Rhodococcus erythropolis RepID=Q6XMU0_RHOER Length = 405 Score = 269 bits (688), Expect = 1e-70, Method: Composition-based stats. Identities = 87/365 (23%), Positives = 165/365 (45%), Gaps = 18/365 (4%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ L+ + +PD R ++L ++ + +CAV +GA + I D+ + Q Sbjct: 42 QMPDLLTTLEAVPDPRARRGRRYRLPSLIAVALCAVSAGARSFYAIADWAAGVPRQVLQR 101 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD-VIAIDGKTLRHSYDKS 120 +P TI +V + + +D +A+DGKT+R + + Sbjct: 102 CGIRFRVPSEATIRQVFGRVDGDALDRVLGRHLAARAGADTVRLAVAVDGKTIRGA--RI 159 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 ++ A H+++A + ++V+GQ +T +KSNEI + LL +DI G ++T DAM QK Sbjct: 160 GKQAAPHLVAAVTHGDTVVLGQCRTADKSNEIPTVRRLLRGIDITGAVVTVDAMHTQKAT 219 Query: 181 AEKIQKQG-GDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHI 239 A +++Q +Y+ VK Q L ++ P +++ D E+ HGREE R + Sbjct: 220 ARCLREQCRAEYVMIVKANQPGLLARVRDQ-PWEQVPVVWSD---PVERGHGREEHRSYK 275 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD---LTAEKFA 296 + V L + +++ + R ++ V Y I S + A Sbjct: 276 ILTVARGL-----RFPYAQQVIQIIRRRRVLGAGAW--STEVVYAICSLPCEQAPPKLLA 328 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 + IR HWH+EN++H+ DV +ED +R G+ ++ + +R++ + + Sbjct: 329 SWIRGHWHIENRIHYVRDVTFDEDRSAVRTGHGPQVMATLRNVVVGLHRRAGHSNIARAC 388 Query: 357 KMRKA 361 + A Sbjct: 389 RRLAA 393 >UniRef50_B0JJ38 Transposase n=42 Tax=Cyanobacteria RepID=B0JJ38_MICAN Length = 363 Score = 266 bits (680), Expect = 9e-70, Method: Composition-based stats. Identities = 86/348 (24%), Positives = 159/348 (45%), Gaps = 16/348 (4%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFL-KQY 61 + L+E + + D+R+ H L +L++ I + G G+ ++ +F + + L +++ Sbjct: 1 MLSLIEKLKQVKDFRKDKGKRHPLWIVLVVIILGTMLGYSGYRELGEFAKNNRHRLSQEF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWM-RDCHSSDDKDVIAIDGKTLRHSYDK- 119 +P + TI RV+ + + F W + DD + + +DGK+L+++ Sbjct: 61 NIIPERVPSYSTIRRVMMGVEWQILLKMFNEWALEEYGQRDDINWLGMDGKSLKNTLKNP 120 Query: 120 -SRRRGAIHVISAFSTMHSLVIGQIKTD-EKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 + ++ I +S FS LV+ + + +K +EI ++ ++ K+ T DA+ CQ Sbjct: 121 NNEQQNFIMFVSLFSQESGLVLHLKRIENKKGSEIDEGQAIIEDCSLQNKVFTGDALHCQ 180 Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRL 237 K I K DY+ VKG Q L K ++ ++ + + SHGR+ R Sbjct: 181 KKTISLIAKTKNDYVITVKGNQKNLYKRIQDLSN----SSKPESCFLEQDNSHGRKISRK 236 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 V V + ++ L+++ + + YYISS +A+ FA Sbjct: 237 IEVFKVRK---NERQGFENLRRVIKVERK----GSRGDKTYEETAYYISSLTESAQVFAK 289 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILT 345 IR HW +EN+LHW DV+ ED +I AA +S + I +N+ Sbjct: 290 IIRGHWKIENQLHWVKDVIFEEDKSEISDFQAASNWSILTTIGLNLFR 337 >UniRef50_C9NKZ6 Transposase IS4 family protein n=1 Tax=Streptomyces flavogriseus ATCC 33331 RepID=C9NKZ6_9ACTO Length = 405 Score = 266 bits (679), Expect = 1e-69, Method: Composition-based stats. Identities = 101/386 (26%), Positives = 169/386 (43%), Gaps = 30/386 (7%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 E+ L+E ++ +PD R V H L+ +L LT CAV++GA + ++ + L + Sbjct: 38 EIPDLLECLAQVPDPRNPRGVRHPLAAVLALTACAVLAGARSLLAVSEWVAEAPEELLER 97 Query: 62 GDFE-------NGIPVHDTIARVVSCISPAKFHECFINW-MRDCHSSDDKDVIAIDGKTL 113 P TI RV++ I W + +A+DGK+L Sbjct: 98 LGIRVDPLFPKRSWPAETTIRRVLARIDANALDRAVGTWLACRQQDAGGLRALAVDGKSL 157 Query: 114 RHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITTD 172 R + RR +H+++A + LV+ Q+ EK+NEIT LL+ L D+ G ++T+D Sbjct: 158 RGAARAKGRR--VHLLAACDHVGGLVLAQMDVGEKTNEITRFRPLLDTLPDLSGTVVTSD 215 Query: 173 AMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGR 232 A+ Q D A ++ + Y+ VK +L+ + P +++ + HGR Sbjct: 216 ALHTQTDHATYLRGRDTHYIVIVKRNTKKLSTQLKS-LPWQQIPLQDR----TRTTGHGR 270 Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA---D 289 EIR VC V + L + G ++ V R + + + Y ++S Sbjct: 271 CEIRRLKVCTVNNLL------FPGARQAVQIVRRR--VNRTTGKVSLKTIYAVTSLAAEQ 322 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 + A IR HW VE LH DV ED ++R GNA + + R++AI L V Sbjct: 323 APPARVAQLIRGHWTVEA-LHHVRDVTFAEDASQLRSGNAPQAMATYRNLAIGALRLAGV 381 Query: 350 FKAGLRRKMRKAAMDRNYLASVLAGS 375 + +R+ A D+ + L + Sbjct: 382 RN--IAAGLRRTARDQTRTLTHLGLT 405 >UniRef50_D1UC34 Transposase IS4 family protein (Fragment) n=2 Tax=Bacteria RepID=D1UC34_9DELT Length = 247 Score = 263 bits (673), Expect = 6e-69, Method: Composition-based stats. Identities = 92/253 (36%), Positives = 144/253 (56%), Gaps = 7/253 (2%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 ++H+++A+ + +L++GQ+K D+KSNEITAIP+LL ML ++G I+T DAMGCQK IA++ Sbjct: 1 NSLHLVNAWLSQDNLILGQVKVDDKSNEITAIPKLLEMLHLEGAIVTIDAMGCQKAIAKQ 60 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN-PEHDSYAISEKSHGREEIRLHIVCD 242 I + DY+ AVK Q L + + F ++N H + + HGR E R + Sbjct: 61 IGSKKADYVLAVKQNQPELYEYIDLLFNESKVNTSLLHQTRRTIDSGHGRIETREYSTI- 119 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 V D+L+ W L + + S R + RY+I S + A++F A+R H Sbjct: 120 VGDDLLAGITGWDNLNAIGMVESKREV----GNTISNEKRYFIMSINGHAQRFGDAVREH 175 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 W +EN +HW LDV ED +IR+ N+ E S +R IA+N + + K ++RK + A Sbjct: 176 WGIENTVHWVLDVSFGEDQSRIRKDNSPENLSMLRKIALNCVKQEST-KTSMKRKRKMAG 234 Query: 363 MDRNYLASVLAGS 375 D ++L VL G+ Sbjct: 235 WDNSFLIKVLTGN 247 >UniRef50_UPI0001B4AD82 transposase n=1 Tax=Bacteroides sp. 2_1_7 RepID=UPI0001B4AD82 Length = 375 Score = 263 bits (672), Expect = 7e-69, Method: Composition-based stats. Identities = 105/360 (29%), Positives = 161/360 (44%), Gaps = 41/360 (11%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 L+ + + I D RQ KV H+ I++ + V + + W ++ DF +DF++++ Sbjct: 18 LESMYNALGEI-DSRQESKVVHRAGVIMMSALIGVFANCQSWNEVADFSAERIDFIRKFF 76 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD--------------------D 102 P HDT+ R + P + W + + Sbjct: 77 PDIQKAPSHDTLRRFFCLVCPNALERRYRAWALNMRENLATSKEEGLVEEGIAEEREKKP 136 Query: 103 KDVIAIDGKTLRHSYDKSRRR--------------GAIHVISAFSTMHSLVIGQIKTDEK 148 IAIDGKT++ + ++ RRR +H++SAFS L +GQ + D+K Sbjct: 137 FRQIAIDGKTIKKAMNERRRRDEDGFFMTQEERSNDKLHIVSAFSVDDCLSLGQERVDKK 196 Query: 149 SNEITAIPELLNMLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAF- 206 NEI AIP LL+ LDI +G ++T DAMG QKDI +I K+ YL VK Q L + Sbjct: 197 ENEIVAIPRLLDDLDISEGDVVTIDAMGTQKDIVSRIAKKRAGYLLEVKKNQATLWETIA 256 Query: 207 --EEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 F L N + + E HG +R VC L +W+ L+ + Sbjct: 257 GNMRDFERIPLPNEVYKVHKEGENGHGFVFLRECRVCSSLHSLGKIYKDWENLRSYGLIR 316 Query: 265 SFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + R + E E + Y+ISS + EK R HW +EN LHW+LD+ EDD ++ Sbjct: 317 TER--VDEATGESAVETHYFISSLENDPEKIMRVKRKHWGIENGLHWQLDITFKEDDGRM 374 >UniRef50_C6Z7R2 H repeat-containing protein n=4 Tax=Bacteroides RepID=C6Z7R2_9BACE Length = 393 Score = 262 bits (669), Expect = 2e-68, Method: Composition-based stats. Identities = 106/381 (27%), Positives = 175/381 (45%), Gaps = 23/381 (6%) Query: 3 LKKLMEHISIIPDYRQ--TWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 +K L E + +PDYR+ ++KL DILLL I + DI FG+ +L + Sbjct: 18 MKHLKEFVKSVPDYRRTDKGNYKYKLEDILLLVILGRLGKCITSPDIIRFGKRNLKRFRS 77 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD---DKDVIAIDGKTLRHSY 117 G +G+P T+ R+ I E + H D++ IDGK +R + Sbjct: 78 LGILLDGVPSEPTLCRIFKHIDDEAMSERMSEFTSTFHDELVGCAGDILCIDGKAMRGTV 137 Query: 118 DKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQ 177 ++ R I +SA+S + + +EKSNEIT++P+LL+ +D+ G I+T DAM Q Sbjct: 138 LENGRNPDI--VSAYSLEGGVTLATDMCEEKSNEITSVPKLLDKVDVSGCIVTADAMSFQ 195 Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRL 237 K I +KI+++GGD+L +K Q L E+ L E + + + HGR E R+ Sbjct: 196 KAIIDKIREKGGDFLIELKANQRTLRYGVEDNVELAEPVDVYSEGPFLE---HGRIETRV 252 Query: 238 HIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT 297 + D LI +W G L V + + + R+Y+SS +A + T Sbjct: 253 CRIFRGND-LITDREKWNG--NLTVVEIRTATERKSDGQKSSERRFYVSSFHGSARRLGT 309 Query: 298 AIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL-RR 356 R HW +E+ +HW LD + +D + +A I+ + + IL + + Sbjct: 310 IARMHWAIES-MHWDLDRNLRQDFIRRNSARSARNLDTIQRMVLAIL--------SIWKG 360 Query: 357 KMRKAAMDRNYLASVLAGSGL 377 K +K + A ++ L Sbjct: 361 KRKKPSEKAKGTAELIGELSL 381 >UniRef50_D1VY64 Transposase, IS4 family n=8 Tax=Bacteroidales RepID=D1VY64_9BACT Length = 412 Score = 260 bits (664), Expect = 6e-68, Method: Composition-based stats. Identities = 108/349 (30%), Positives = 165/349 (47%), Gaps = 16/349 (4%) Query: 3 LKKLMEHISIIPDYRQTWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 L+KL E S IPD+R+ K + HKLSDI++L I +S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLSDIIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK-----DVIAIDGKTLRH 115 NGIP T+ R+ I + H ++I IDGK R Sbjct: 95 LDILVNGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELVGMCCTQEIICIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + K+ R I +SA S + + +EKSNEI A+P L++ +DI GKI+T DAM Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEACEEKSNEIKAVPLLIDKIDISGKIVTADAMS 212 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 QKDI +KI+++ GD++ +K Q L E+K +P + E HGR E Sbjct: 213 MQKDIVDKIREKNGDFIIELKSNQRSLRYGVEDKIKEL---SPVYSYCGEPELGHGRIET 269 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R + V D D LI +W G L + + + R ++SS + Sbjct: 270 RSYRVFDGTD-LIANKEKWNG--NLTIIEYECETVKKSTGNCTTEKRLHVSSLPANTPRL 326 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 T +RNHW +E+ +HW LD + +D K + AA I+ I ++ Sbjct: 327 GTPVRNHWSIES-MHWGLDRNLLQDKIKRKSARAARNLDTIQRIVYSVF 374 >UniRef50_P22644 Uncharacterized protein in dhlA 3'region (Fragment) n=1 Tax=Xanthobacter autotrophicus RepID=YDH2_XANAU Length = 295 Score = 254 bits (649), Expect = 3e-66, Method: Composition-based stats. Identities = 107/286 (37%), Positives = 154/286 (53%), Gaps = 9/286 (3%) Query: 9 HISIIPDYRQT-WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG 67 +IPD R+ H LSDIL + +CAV+SG + WE + +FG T +L+Q+ NG Sbjct: 17 FFELIPDPRRATPNKLHSLSDILSIALCAVLSGMDDWEAVAEFGRTKEGWLRQFLPLANG 76 Query: 68 IPVHDTIARVVSCISPAKFHECFINWMR-DCHSSDDKDVIAIDGKTLRHSYDKSRRRGAI 126 IP HDT RV S I P F F +W D D +A+DGKT+R S+ S R A+ Sbjct: 77 IPSHDTFGRVFSLIDPEAFEAAFFDWAAHARIGGDVLDQLALDGKTVRRSHRGSAGR-AL 135 Query: 127 HVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQK 186 H++ A+S L++ Q + D KSNEITAIP++L++ D++G I+ DA+GCQK +A +I + Sbjct: 136 HLLHAWSCETRLLVAQRRVDTKSNEITAIPDILSLFDLRGVTISIDAIGCQKAVARQITE 195 Query: 187 QGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDE 246 GGDY+ A+KG Q L+ + +P+ + EK HGR E R V D D Sbjct: 196 AGGDYVLALKGNQSALHDDVRLFMETQADRHPQGQA-EAVEKDHGRIETRRIWVNDEIDW 254 Query: 247 LIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTA 292 L +W GLK L + S R + + R +I+S Sbjct: 255 LTQ-KPDWPGLKTLVMVESRREL----NGQVSCERRCFITSHTADP 295 >UniRef50_D1BSD0 Transposase IS4 family protein n=1 Tax=Xylanimonas cellulosilytica DSM 15894 RepID=D1BSD0_XYLCX Length = 483 Score = 244 bits (623), Expect = 4e-63, Method: Composition-based stats. Identities = 79/378 (20%), Positives = 131/378 (34%), Gaps = 35/378 (9%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +++ L+E + +PD R+ V L +L L + AV GA G+ +I + L Sbjct: 31 QVEGLLEAFAQVPDPRKRRGVRFGLPVVLGLALLAVACGAVGFAEIAEVAADLDPELTAA 90 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCH--------------SSDDKDVIA 107 P T RV+ P E W + VI+ Sbjct: 91 FGLVRCAPSAATFRRVLCTTDPTALDEALCRWAQARARPDAAGQPPPAPADGQRRVRVIS 150 Query: 108 IDGKTLRHSYDKSRRRG--AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML--- 162 DGKT+R + ++ V+ V+ ++ +EI A+ ++ L Sbjct: 151 ADGKTMRGARRRTGDGKIAQDQVVEILDHASGAVVACEPVND-GDEIGAVRTVMGRLADR 209 Query: 163 --DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH 220 + G ++ TDA Q + E++ GG +L VK Q R+ P ++ + Sbjct: 210 WGSLAGVVVVTDAKHTQHKLVEQVGAVGGWWLLPVKANQPRILAKVR-ALPWAQVRAQD- 267 Query: 221 DSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSF--RSIIAEQKKEPE 278 K+HGR E R V P G ++ R Sbjct: 268 ---TCRGKAHGRAETRTVRVVQAP---THVDLALAGTAQVIKITRHTRRRPHPGAPAAST 321 Query: 279 MTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSG 335 Y ++S A +R+HW +EN++HW D +ED R GN + Sbjct: 322 RENAYLLTSLPAEVADPATLAAMVRSHWLIENQVHWVRDTAYDEDRHTARTGNGPINLAC 381 Query: 336 IRHIAINILTNDKVFKAG 353 +R+ AI Sbjct: 382 LRNTAITRHRAHGASNIA 399 >UniRef50_A8S043 Putative uncharacterized protein n=1 Tax=Clostridium bolteae ATCC BAA-613 RepID=A8S043_9CLOT Length = 397 Score = 243 bits (619), Expect = 1e-62, Method: Composition-based stats. Identities = 89/363 (24%), Positives = 144/363 (39%), Gaps = 48/363 (13%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHE 88 +L+ + G + +THL+ L+++ + GI TI R++ I Sbjct: 1 MLVCVTLGFLCGRTTIRRSLKWCKTHLEELRKHMKLKYGIASPSTITRMLCGIDEELALY 60 Query: 89 CFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEK 148 F+ W+ + S + +A+DGK L + +K++ +++ T+ L++ Q+ D K Sbjct: 61 AFMEWVGEIVDSRN-THLAVDGKALCGATEKTKGETTPMLLNVVETVRGLMLAQLPVDSK 119 Query: 149 SNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEE 208 +NEIT IPELL +LDI G I+T DA+G Q I E+I +QGG + VK Q + Sbjct: 120 TNEITVIPELLKLLDISGSIVTIDAVGTQTAIMEQIHEQGGHFALTVKKNQPEAYEEIHT 179 Query: 209 KFPLKELNNPE-----------------HDSYAISEKSHGREEIRLHIVCDVPDELIDFT 251 E + + ++ EK+ R E R +C L Sbjct: 180 FMDKLEAADVQRKKGEVLDSGMREYLEKYEEIIRIEKNRDRNEYRTCQICKDASNLTKSQ 239 Query: 252 FEWKGLKKLCVAVSFRSIIAEQ----------------------------KKEPEMTVRY 283 EW ++ + R + ++ Sbjct: 240 KEWPHVQSIGRIKQVRIPSEKDSHGNDVTPSKEEFLEKGSRRVPAPSAEEGTGKDVQCTA 299 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINI 343 IS LTAE+ + R HW +EN+LH LD ED ++ S IR A NI Sbjct: 300 LISDLILTAEELGSIKRMHWSIENRLHHVLDDTFREDRSPAKKSR--NNLSLIRKYAYNI 357 Query: 344 LTN 346 L Sbjct: 358 LRL 360 >UniRef50_B1X344 Transposase n=1 Tax=Cyanothece sp. ATCC 51142 RepID=B1X344_CYAA5 Length = 255 Score = 242 bits (617), Expect = 2e-62, Method: Composition-based stats. Identities = 90/247 (36%), Positives = 141/247 (57%), Gaps = 3/247 (1%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L++H + D R +HKL DI+++ +CA+I GA+ + +E +G + ++LKQ+ + E Sbjct: 9 LIDHFEKLTDPRVERTKDHKLIDIIVIALCAMICGADSFVAMETYGNSKYEWLKQFLELE 68 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 NGIP HDT ARV + I P +F + F +W+ DV+ IDGKT++HS +K + A Sbjct: 69 NGIPSHDTFARVFARIDPDEFEQYFRDWVSSITELMPGDVVNIDGKTVKHSVNKVEGKKA 128 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 IH+++A+++ LV+ Q K E++ EITAIP L+ +L++ G ++T DAMG Q DIAE + Sbjct: 129 IHLVNAWASEQRLVLAQQKVHERTKEITAIPHLIKVLELNGCLVTIDAMGTQTDIAELLH 188 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFP---LKELNNPEHDSYAISEKSHGREEIRLHIVCD 242 +G DY A+KG Q L + +E F E EH + EK R E+ + Sbjct: 189 SKGADYCLALKGNQRGLFQEVKEVFDNAQQTEWIGIEHSFHRTVEKRTARAEVSSAYRTE 248 Query: 243 VPDELID 249 Sbjct: 249 QERLWSH 255 >UniRef50_A8LG82 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LG82_FRASN Length = 395 Score = 240 bits (613), Expect = 5e-62, Method: Composition-based stats. Identities = 90/386 (23%), Positives = 156/386 (40%), Gaps = 29/386 (7%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDI----EDFGETHLDF 57 ++ L+ + I D R+ + LS +L + A ++GA G +I DFG+ L Sbjct: 21 DISGLLAMLGGITDPRKARGKIYSLSFMLASALVATLAGATGLREIGSRVADFGQDLLAR 80 Query: 58 LKQ---YGDFENGIPVHDTIARVVSCISPAKFHECFINW--MRDCHSSDDKDVIAIDGKT 112 L + P I + + A F W + V+A+D K Sbjct: 81 LGAPFDHFTGRYRAPSEKAIRALFEKMDVAAVDAAFGAWLFAHAVWEPGEDIVLAMDVKV 140 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKII-T 170 LR ++ + +R + +SA LV GQ++ + +NEIT + LL L DI G ++ T Sbjct: 141 LRGAWSEGNKRVTL--LSAMVHAKGLVAGQVRVPDGTNEITQVAALLENLPDISGPVVAT 198 Query: 171 TDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSH 230 DA+ Q + A + + G DY VKG Q L + F + + + E+ H Sbjct: 199 LDAVHTQHETAFLLVEHGIDYALTVKGNQPTLY---RKTFEQTLPLLQKPPQHEVEERGH 255 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD- 289 GR + + + + V R + + ++S Sbjct: 256 GRIKKWQAWTTEAKG------IGFPEVATAAVIR--RDEFDLKGIRVSREYAHILTSVAG 307 Query: 290 --LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND 347 TA IR HW +EN++H+ D ED + GN+ + R++AI I+ + Sbjct: 308 NRATAAYIHRLIRGHWGIENEIHYPRDTAWREDANQTHTGNSPHTLASFRNLAIGIIRRN 367 Query: 348 KVFKAGLRRKMRKAAMDRNYLASVLA 373 + K ++ + A DR+ + +LA Sbjct: 368 GIRK--IKETLEYIAGDRDRVLPLLA 391 >UniRef50_Q1VRU5 Transposase (Fragment) n=3 Tax=Psychroflexus torquis ATCC 700755 RepID=Q1VRU5_9FLAO Length = 223 Score = 235 bits (598), Expect = 3e-60, Method: Composition-based stats. Identities = 87/207 (42%), Positives = 133/207 (64%), Gaps = 1/207 (0%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 +KL IPD+R++ K + L ILL+ I +VI GA+ W ++E++ + +FL+ + D Sbjct: 5 QKLKTIFGQIPDFRRSHKQLYDLESILLIGIISVICGADSWNEMENYANSKEEFLRSFLD 64 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 NGIP HDT RV S I +F +CFI W+ +++IAIDGKT+R + ++ Sbjct: 65 LPNGIPSHDTFNRVFSNIDSNQFEKCFIQWVSALAQLQPREIIAIDGKTIRGA-KAGGKK 123 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEK 183 +H++SA++ ++LV+GQ+K KSNEITAIP+LL +L I+ I+T DAMGCQ IA+ Sbjct: 124 SPVHMVSAWANDNNLVLGQVKVSHKSNEITAIPKLLKVLSIENTIVTIDAMGCQTKIAKA 183 Query: 184 IQKQGGDYLFAVKGTQGRLNKAFEEKF 210 I K+ DY+ AVK Q +L + E++F Sbjct: 184 IVKKNADYILAVKENQPQLLEHIEDEF 210 >UniRef50_Q2JDS4 Transposase, IS4 n=3 Tax=Frankia sp. CcI3 RepID=Q2JDS4_FRASC Length = 433 Score = 232 bits (592), Expect = 1e-59, Method: Composition-based stats. Identities = 81/398 (20%), Positives = 156/398 (39%), Gaps = 31/398 (7%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 E++ L + ++ +PD R + H+L IL L+ AV +G + E+I + + Sbjct: 38 EVQGLADVLAGVPDPRDPRGIRHRLPVILGLSAAAVAAGEKSVEEIAAWAAHAPTQVLTA 97 Query: 62 GDFENGI-------PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD---VIAIDGK 111 P DT+ RV+S + + + V+A+DGK Sbjct: 98 LGARVHPVTGQPQAPSVDTMIRVLSAVDSSALARAVGMFAAARARQARGGGRRVVAVDGK 157 Query: 112 TLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML----DIKGK 167 TLR + R A H+++ +V+ + + K+NE+TA LL L + G Sbjct: 158 TLRGAAGPEGR--APHLLAVAEHGTGVVLAEHEVGAKTNEVTAFAPLLRELHSHDPLDGV 215 Query: 168 IITTDAMGCQKDIAEKIQ-KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAIS 226 ++T DA+ + A+ I + G ++F VK L+ + ++ ++ Sbjct: 216 VVTADALHTTRAHADLIVTELGAHFVFTVKANTPALSVDCHQATDWTKIPI----GHSAE 271 Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMT-----V 281 ++HGR E R + + + + + ++ V + T Sbjct: 272 GRAHGRFERRTIQLAQASEAIRARYPHARTVARIRRHVRRTVTTGTGRARVTRTIPSTVT 331 Query: 282 RYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 + ++S L A R HW +ENK+HW DV ED ++R G + + +R+ Sbjct: 332 VHVLTSLTLDAVTPADLAGYARGHWTIENKVHWVRDVTFREDASRVRTGPLPRIMTTLRN 391 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSG 376 + I ++ + + + D L ++L Sbjct: 392 LIIGLIRLAGHNRIAPTIRRIRH--DNALLLAILTLDN 427 >UniRef50_C6LM02 IS1548, transposase (Fragment) n=7 Tax=Clostridiales RepID=C6LM02_9FIRM Length = 239 Score = 232 bits (592), Expect = 1e-59, Method: Composition-based stats. Identities = 89/240 (37%), Positives = 135/240 (56%), Gaps = 8/240 (3%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++L+E + I D RQ KV H L DIL++ + A ++ A+ W ++ F E + D+L++Y Sbjct: 1 MQELLEWMEYIEDSRQQSKVRHTLKDILVIVLFATLANADDWVEMALFAEDYQDYLRKYI 60 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHSYDK 119 + +NG P HDT+ RV+ +SP + + W + ++ K +I IDGKT+R +K Sbjct: 61 ELKNGPPSHDTLRRVMGMVSPEILQQLYGKWQERLNRNEGELLKKIICIDGKTMRS--NK 118 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 H++SA+S +GQ EKSNEITAIPELL + +KG+I+T DAMG Q Sbjct: 119 RNGEKPGHIVSAWSKEDGYCLGQKAVGEKSNEITAIPELLEKIQVKGQIVTIDAMGTQTA 178 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELN---NPEHDSYAISEKSHGREEIR 236 IAEKI+ + DY+ ++K QG L + E F E EK+HG+ E R Sbjct: 179 IAEKIRNKRADYVLSLKANQGTLYEDVREYFEDPEFQKEIKERGIYKKTQEKAHGQIETR 238 >UniRef50_Q2JA58 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JA58_FRASC Length = 401 Score = 230 bits (587), Expect = 5e-59, Method: Composition-based stats. Identities = 79/383 (20%), Positives = 147/383 (38%), Gaps = 17/383 (4%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ L+ + +PD+R V ++L+ +L L + I+G + + ++ + Sbjct: 24 QVAGLVAVLRRVPDWRDPRGVRYELAPVLALWVAGNIAGHDTTVAVWEWACALPVGVLAG 83 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS----SDDKDVIAIDGKTLRHSY 117 F +P TI R+V P + + W +A DGK ++ + Sbjct: 84 LGFPRRVPSERTIRRIVEEGPPGQLDQALSGWTAAVAPGPADPGGLVAVAFDGKVMKGAR 143 Query: 118 DKSRRRGAIH--VISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + + V+ A +G + +EI ++ L+N + ++TTD + Sbjct: 144 SRPPQGSVRQEAVVEAVRHDTGTALGHQRVVA-GDEIASVRRLVNRVCDHNTLVTTDCLH 202 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 + +A I+ +GG +LF++KG Q + P E N + EK+HGR E Sbjct: 203 AHEPLARAIRAKGGHWLFSIKGNQPTVRAKL-AGLPWDEFGN----QHVTREKAHGRIEE 257 Query: 236 RLHIV-CDVPDELIDFTFEWKGLK-KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE 293 R L+ F + +K + E + +S+ + Sbjct: 258 RALKALTPSAPSLVGFRGTRQVVKLARTTRRKKTTTSPAATSTEEFYLVTSLSTDQASPA 317 Query: 294 KFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAG 353 + A R HW VE +H D M+ED IR NAA ++ R I+ L Sbjct: 318 QLARWARGHWTVEA-IHHVRDRTMDEDRHTIRTKNAALNWAIARDTTISALRLAGYKN-- 374 Query: 354 LRRKMRKAAMDRNYLASVLAGSG 376 +R+ R D + ++A + Sbjct: 375 IRQARRATIRDPGLVLQIIALTS 397 >UniRef50_A4X3L9 Transposase, IS4 family n=5 Tax=Actinomycetales RepID=A4X3L9_SALTO Length = 395 Score = 227 bits (579), Expect = 4e-58, Method: Composition-based stats. Identities = 83/381 (21%), Positives = 153/381 (40%), Gaps = 25/381 (6%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDF------ 57 L+ ++ +PD R V H L +L + AV++GA + ++ Sbjct: 27 SSLVTALAAVPDRRDPRGVVHALPAVLATAVAAVLTGARSAAAVAEWAADAPQQVLTELG 86 Query: 58 -LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS--DDKDVIAIDGKTLR 114 + + P T R+++ + + W+ C + + V ++DGKTLR Sbjct: 87 VFRDPFTGVHRAPDESTFRRILAGVDADALDDTVGRWVLACQPAATTGRRVYSVDGKTLR 146 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAM 174 S +H+++ V+GQ+ D K+NE+T LL LD+ ++T DA+ Sbjct: 147 GS---GPAGEQVHLLAVLDQHTGTVLGQVDVDGKTNELTRFQPLLGPLDLTAVVVTADAL 203 Query: 175 GCQKDIAE-KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGRE 233 Q++ A + + Y+F VK Q RL + + P ++ + S + HGR Sbjct: 204 HTQREHARWLVDTKKAAYVFTVKKNQPRLYRQLKT-LPWTKIPIQD----ETSTRGHGRY 258 Query: 234 EIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKK-EPEMTVRYYISSADLTA 292 +IR L ++ + R +A + + +S+A Sbjct: 259 DIRRLQAVTCTGPL---ALDFPHAVQALRIRRRRLNLATGRWSTVTVYAITNLSAAQAGP 315 Query: 293 EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA 352 + A +R HW +E LH D ED ++R GNA + +R+ AIN+L + Sbjct: 316 AELADWLRGHWAIET-LHHIRDTTYAEDASRLRTGNAPRAMATLRNTAINLLRLTGI--T 372 Query: 353 GLRRKMRKAAMDRNYLASVLA 373 + +R + + +L Sbjct: 373 TIAAALRHNSRNPYRPLQLLG 393 >UniRef50_A0PKM2 Transposase for IS2404 n=187 Tax=Mycobacterium RepID=A0PKM2_MYCUA Length = 397 Score = 227 bits (579), Expect = 5e-58, Method: Composition-based stats. Identities = 85/338 (25%), Positives = 135/338 (39%), Gaps = 22/338 (6%) Query: 28 DILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFH 87 +L + + A +G G+ + T D + P T V+S + PA + Sbjct: 2 ALLAIAVLATAAGMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 ECFINWMRDCHSSDDKD---VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIK 144 ++ +S D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGALR--AKATATHLVSVFAHRARLVLGQLA 119 Query: 145 TDEKSNEITAIPELLNMLDIKG-KIITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGTQGRL 202 EKSNEI + LL +L ++T DAM Q A+ I YL VK Q ++ Sbjct: 120 VAEKSNEIPCVCALLTLLPGSLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 NKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCV 262 P E+ D + HGR E R + + + K++ Sbjct: 180 LARI-TALPWAEVPAAATD----DSRGHGRVETRTLQIITAARGIG-----FPYAKQIIR 229 Query: 263 AVSFRSIIAEQKKEPEMTVRYYISSADLTAEK---FATAIRNHWHVENKLHWRLDVVMNE 319 R I A + + V Y I S + T +R H +EN LHW DV +E Sbjct: 230 ITRERLITATD--QRSVEVVYAICSLPFEHARPTAIMTWMRQHCRIENSLHWIRDVTFDE 287 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRK 357 D + GN A++ + +R+ AIN+ + + Sbjct: 288 DRQRAHTGNGAQVLATLRNTAINLHRLNGADNIAEACR 325 >UniRef50_C7CN18 Putative uncharacterized protein n=1 Tax=Methylobacterium extorquens DM4 RepID=C7CN18_METED Length = 319 Score = 223 bits (568), Expect = 8e-57, Method: Composition-based stats. Identities = 83/273 (30%), Positives = 135/273 (49%), Gaps = 9/273 (3%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L+E + + D R K+EH+L DIL++ +CAV++ AE +EDI +G +L + Sbjct: 2 IEGLVEQFATLEDPRCPGKIEHRLVDILVIAVCAVVAEAETFEDIALYGRCKEAWLCGFL 61 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD------KDVIAIDGKTLRHS 116 D GIP HDT RV I P F CF+NW R + + IA+DGK +RHS Sbjct: 62 DLPGGIPSHDTFRRVFMLIDPDAFERCFLNWARAVFRTQGDDEPAEPEQIAVDGKLVRHS 121 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D+ R +H++SA++T LV+ Q D K E A+P +L L + G +++ DA+ C Sbjct: 122 FDRRHGRSPLHLVSAYATGRGLVLAQHAVDHKGGEPAALPVVLEGLHLDGCLVSLDALSC 181 Query: 177 QKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPE--HDSYAISEKSHGREE 234 ++++A+ I +G YL +K Q +++ F + + +HGR Sbjct: 182 RREVADHILSRGAYYLLTLKANQSKIHDEVRTWFAGNAFAHAADLRPCADAFDDTHGRLV 241 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFR 267 R C W GL + + + R Sbjct: 242 RRRVFACPDAGCFT-TLRGWPGLTTVLASETIR 273 >UniRef50_D2JTK3 Transposase n=13 Tax=Rhizobiaceae RepID=D2JTK3_RHIET Length = 342 Score = 221 bits (563), Expect = 3e-56, Method: Composition-based stats. Identities = 83/324 (25%), Positives = 134/324 (41%), Gaps = 27/324 (8%) Query: 50 FGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAID 109 FG + +LK GI H T + V C++ F ++ Sbjct: 43 FGVSKKKWLKTIVPLPYGIAGHGTFSTVFRCLNQVAFEAALPKPLQRA------------ 90 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKII 169 K S + + ++ ++ LVIGQ + NE+ + L +L ++G I+ Sbjct: 91 WKAWWRSTARRYEAPPLPLVKVWAAGCGLVIGQQTAPGR-NEVQGALDALALLSLEGAIV 149 Query: 170 TTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKS 229 T DA+ C+ D A I GGDY A+K Q L + E + +E Sbjct: 150 TADALHCRADTAHAILSAGGDYALALKANQPGLLAQTIARLDDVEPLGVQ----TAAEND 205 Query: 230 HGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD 289 H R E R + V D ++ GL+ + + VRY++ S Sbjct: 206 HDRCERRRACIVAVND------IDFPGLQAIGSVEA---TSRHADGRLTSHVRYFLLSTI 256 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 ++A R HW +ENKLHW LDV ED + R+ + + +R IA+N++ Sbjct: 257 MSAAALIEVTRTHWQIENKLHWVLDVQFREDAARNRKDHGPANIALLRKIALNLIRAHPD 316 Query: 350 FKAGLRRKMRKAAMDRNYLASVLA 373 KA +RRK++ A D +L S++A Sbjct: 317 -KASIRRKIKNAGWDDQFLISIIA 339 >UniRef50_C3R476 Transposase (Fragment) n=6 Tax=Bacteroides RepID=C3R476_9BACE Length = 249 Score = 217 bits (551), Expect = 7e-55, Method: Composition-based stats. Identities = 87/249 (34%), Positives = 126/249 (50%), Gaps = 14/249 (5%) Query: 68 IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YDKSR 121 IP HDT R S I P F F NW++ V+AIDGK +R + + Sbjct: 4 IPSHDTFNRFFSIIKPEYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHTTGK 62 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIA 181 + ++SA+S + + +GQ+K D+KSNEITAIP L+N L++ G I+T DAMGCQKDI Sbjct: 63 EGFKLWMVSAWSAANGISLGQVKVDDKSNEITAIPLLINSLELSGCIVTIDAMGCQKDIT 122 Query: 182 EKIQKQGGDYLFAVKGTQGR---LNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLH 238 + I + +Y+ A+K + + L K + + K+ + HGR E R Sbjct: 123 QTIIEHDANYIIAIKENKKKNYQLAKQIIDDYQDKDEIINRVTRHVSENTGHGRIETRTC 182 Query: 239 IVCDV-PDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AEKFA 296 V F + GLK + S R+I+A E VRYY++S D T E+ A Sbjct: 183 TVVSYGSIMEKMFKKKLVGLKSIVGIKSERTIVAT--GEYTQEVRYYVTSLDNTKPEEIA 240 Query: 297 TAIRNHWHV 305 +AIR HW + Sbjct: 241 SAIRQHWSI 249 >UniRef50_D2BC62 Putative uncharacterized protein n=2 Tax=Streptosporangium roseum DSM 43021 RepID=D2BC62_STRRD Length = 437 Score = 215 bits (547), Expect = 2e-54, Method: Composition-based stats. Identities = 79/418 (18%), Positives = 142/418 (33%), Gaps = 62/418 (14%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVIS-GAEGWEDIEDFGETHLDF------ 57 L++ ++I D R T H L+ IL + CA ++ G + IE + + Sbjct: 29 DLIDRFTLISDPRSTRGRRHCLASILAIVACATVAVGGDCLTAIEQWADNAPQHILADLH 88 Query: 58 -LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD------------ 104 + + P TI RV++ + + C ++ + Sbjct: 89 IWRDPFTGLHRPPSERTIRRVLAALDGDELDACLTTFLNRPPGLPAAETHDRLPAPASRR 148 Query: 105 ---------------------VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQI 143 A+DGK L+ + R +H+IS + + + V Q Sbjct: 149 TEREARRAAHRSPTPAPGLLPAYAVDGKRLKGARHPDGGR--VHLISLAAHLDATVHAQR 206 Query: 144 KTDEKSNEITAIPELLNM---LDIKGKIITTDAMGCQKDIAEK-IQKQGGDYLFAVKGTQ 199 + KS+EI A+ LL D+ G +IT DA+ Q+ A I++ Y+ VK Q Sbjct: 207 QIPAKSSEIGALDALLRQAGGTDLAGAVITADALDTQRASARLLIEEHHAHYVMIVKANQ 266 Query: 200 GRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 L+ + + ++ + + HGR E R+ ++ + Sbjct: 267 PTLHATAITALTGTD-TDFAAVTHRETHRGHGRTEYRILRTAPA------DGIDFPYAAQ 319 Query: 260 LCVAVSFRSIIAEQKKEPEMTVRYYISSADL---TAEKFATAIRNHW-HVENKLHWRLDV 315 + + R V Y I+ A +R HW +EN +H DV Sbjct: 320 VFRVLRHR--GGLDGIRHSKEVCYGITDLTARQAGPAHLAAYVRGHWKAIENGVHHVRDV 377 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 ED C+ R + R++A L + R+ D + + Sbjct: 378 TFAEDACQARTATLPRALAAFRNLATGTLRRAGHVN--IAHARREHGYDHQRVLDLFN 433 >UniRef50_Q2JE04 Transposase, IS4 n=4 Tax=Frankia RepID=Q2JE04_FRASC Length = 404 Score = 211 bits (538), Expect = 2e-53, Method: Composition-based stats. Identities = 73/389 (18%), Positives = 135/389 (34%), Gaps = 46/389 (11%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAV-ISGAEGWEDIEDFGETHLDFLKQYG 62 + E ++ IPD+R + + L + + +CAV +G + + ++ + Sbjct: 22 AGIWERLAAIPDHRSARGLVYPLPVLAAVWLCAVTAAGHDRVAAVTEWLAATSWTERVRL 81 Query: 63 DFEN------GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD--------------- 101 +P TI R ++ + ++ +D Sbjct: 82 RLPWNPWDGHLLPDEATIRRFLNTVDDQALATALLDPPLADTPADLTDAVPSAPVRPPAG 141 Query: 102 ----DKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 A+DGKT R + + +H++ + ++GQ + D KSNE T Sbjct: 142 DQAVPVRAYAVDGKTSRGAKRADGSQ--VHLLGVAAHGAGALLGQREIDAKSNETTEFRA 199 Query: 158 LLNMLDIKGKIITTDAMGC-QKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELN 216 LL L++ G ++ DA+ + ++ + ++ YL K Q +L P E+ Sbjct: 200 LLAPLELAGAFVSFDALHTVRSNLDWLVVRKNAHYLAVAKHNQPKLRAFL-AALPWTEIP 258 Query: 217 NPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE 276 + ++ HGREE R V V ++ + +R ++ + Sbjct: 259 TADL----TRDRGHGREETRTLKVATVT------HLDFPHAAQAIRIRRWR---RQKGQP 305 Query: 277 PEMTVRYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELF 333 Y I+ A A R WH+E K H+ DV ED R G + Sbjct: 306 ASHETIYAITDATADQASPALLADLARGQWHIEVKQHYVRDVTFGEDSSTSRTGRGPAVL 365 Query: 334 SGIRHIAINILTNDKVFKAGLRRKMRKAA 362 + R + L R+ K A Sbjct: 366 ALFRATVADTLRRAGHRSVPACRRAHKTA 394 >UniRef50_A6KWS0 Transposase n=6 Tax=cellular organisms RepID=A6KWS0_BACV8 Length = 240 Score = 211 bits (536), Expect = 4e-53, Method: Composition-based stats. Identities = 75/237 (31%), Positives = 119/237 (50%), Gaps = 7/237 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +++ I D R K HK+ I+ ++I AVI GA+ W +IE+FG + F K Sbjct: 4 GIIDLCKQIEDPRMNRKKVHKMETIIYISIAAVICGAQSWNEIEEFGNAKIAFFKSRIPS 63 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS------YD 118 IP HDT R S I P F F NW++ V+AIDGK +R + Sbjct: 64 LEFIPSHDTFNRFFSMIKPDYFELIFRNWVKQVCQEVKG-VVAIDGKLMRGPSQCDGEHT 122 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 + + + ++SA+S + + +GQ+K D+KS+EITAIP L+N L++ G I+T DAMGCQK Sbjct: 123 RGKEGFKLWMVSAWSAANGISLGQVKVDDKSSEITAIPLLINSLELSGCIVTIDAMGCQK 182 Query: 179 DIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 DI + I +Y+ A+K + + + ++ + + + R Sbjct: 183 DITQTIIGHDANYIIAIKENKKKKYQPAKQIIDDYQDRDEIINRVIRHVSEKCRTWK 239 >UniRef50_O52240 Transposase n=3 Tax=Streptomyces RepID=O52240_STRSC Length = 431 Score = 206 bits (524), Expect = 1e-51, Method: Composition-based stats. Identities = 83/404 (20%), Positives = 141/404 (34%), Gaps = 61/404 (15%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVI-SGAEGWEDIEDFGETHLDFLKQ 60 +++ L+ + D R V +++S +L L +CA+ +G + ++ Sbjct: 30 QVRGLVAEFESVTDPRGACGVRYRVSSLLALVVCAMTPAGHDSITAAAEWCRRATPEELA 89 Query: 61 YGDFEN-------GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD----------- 102 IP T+ V+ + P + + +R S+ Sbjct: 90 AFGLPYHPLRGRYRIPSEKTLRTVLGRLDPGEVSAAGYDHLRPLLSTVSHSPEPLMPDGG 149 Query: 103 ---------------------KDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG 141 + IA+DGK LR + R + V+SA + + Sbjct: 150 IEREQRRAHRAAARAEPVRSRRRAIAVDGKCLRSAKRPDGSR--VFVLSAVRHGDGITLA 207 Query: 142 QIKTDEKSNEITAIP---ELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGT 198 + K+NEI + L+ D+KG ++T DA+ Q+D A + ++G YL +K Sbjct: 208 SREIGAKTNEIPEFQPLLDQLDDADLKGAVVTADALHAQRDHATYLHERGAHYLLTIKNN 267 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 Q P KE+ D + HGR E RL V V L + Sbjct: 268 QRG-QARQLHALPWKEIPVIHRD----DARGHGRHEQRLVQVVTVNGLL------FPHAA 316 Query: 259 KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFAT---AIRNHWHVENKLHWRLDV 315 ++ R + +K Y I+ A R HW VEN +HW DV Sbjct: 317 QVLRIQRRRRLYGAKKW--SSETVYAITDLPAEEASAAEIASWARGHWTVENTVHWCRDV 374 Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 NED ++R N + + +R + L R+ Sbjct: 375 TFNEDKSQVRTHNTPSVLAAVRDLIRGALKLAGYVNTAAGRRAH 418 >UniRef50_UPI00016ACDF9 transposase, is4 family protein n=2 Tax=Burkholderia thailandensis MSMB43 RepID=UPI00016ACDF9 Length = 223 Score = 203 bits (517), Expect = 6e-51, Method: Composition-based stats. Identities = 77/225 (34%), Positives = 105/225 (46%), Gaps = 9/225 (4%) Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK 213 AIPELL LD++G +T DA+G Q IA I + G DY+ AVK Q RL + + F Sbjct: 1 AIPELLAALDLQGATVTIDAIGTQLGIAHTIVEAGADYVLAVKDNQPRLAEGVRQWFEAA 60 Query: 214 ELNNPEHDSYAISE--KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIA 271 E + +E K HGR E R+ V + L W GL++L + R I Sbjct: 61 HDGKLEGSYWEHTEHDKGHGRLETRVCRVSEDVAWLASTGQHWAGLQRLVMLERTRQI-- 118 Query: 272 EQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 ++ YYISS + A + A IR HW +EN+LHW LDV ED IR AA Sbjct: 119 --GQKVTTERCYYISSKAVKAAQMAQLIRAHWGIENQLHWVLDVSWGEDASLIRDTVAAR 176 Query: 332 LFSGIRHIAINILTND---KVFKAGLRRKMRKAAMDRNYLASVLA 373 + +R I +N+ + K L+ AA D +L Sbjct: 177 NMASLRKITLNLARLAQNRQPKKVSLKNIRNLAAWDTAMRDDILG 221 >UniRef50_UPI00016C4673 transposase, IS4 n=2 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4673 Length = 232 Score = 201 bits (512), Expect = 3e-50, Method: Composition-based stats. Identities = 87/237 (36%), Positives = 117/237 (49%), Gaps = 9/237 (3%) Query: 143 IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRL 202 + T++KSNEITAIP LL L+ K ++T DAMGCQKDIA I GGD++ AVK Q +L Sbjct: 1 MATEDKSNEITAIPVLLGQLERKKAVVTIDAMGCQKDIARDIVAGGGDFVIAVKDNQPKL 60 Query: 203 NKAFEEKFPLK---ELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKK 259 A EL H +Y HGR + R H V VP EW +K Sbjct: 61 AAAIASVVEKHLEGELQARRHRNYQTDTHGHGRRDERFHWVAQVP-PGFAAKGEWPWIKA 119 Query: 260 LCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + AV + VRYY+ S L+ ++F +R HW +E+ +HW LDV E Sbjct: 120 IGTAVRITTH---ADGTQSDEVRYYMLSRFLSGKRFGEVVRGHWGIES-MHWVLDVTFGE 175 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSG 376 D + R+ A S +R AI +L K +R KM + MD ++L VL G Sbjct: 176 DRTRTRQRVLANNLSWVRRFAITLLKRHP-EKDSIRGKMIRCLMDTSFLNEVLTLQG 231 >UniRef50_B7K570 Transposase IS4 family protein n=1 Tax=Cyanothece sp. PCC 8801 RepID=B7K570_CYAP8 Length = 222 Score = 200 bits (507), Expect = 9e-50, Method: Composition-based stats. Identities = 85/224 (37%), Positives = 116/224 (51%), Gaps = 11/224 (4%) Query: 111 KTLRHSYDKSRRRGAIHVISAF---STMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGK 167 K + S + S S +LV+GQ K ++KSNEITAIP L+ ML+I+ Sbjct: 3 KGFQRSVKTEEKHKPSQKKSQVLKDSLSQNLVLGQKKVNDKSNEITAIPALIEMLEIESS 62 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKF---PLKELNNPEHDSYA 224 IIT DAMGCQK+I I+K+ GDY+ +K Q L + +E F +E + EH Y Sbjct: 63 IITIDAMGCQKEITSLIRKKKGDYIITLKANQKSLRQEIKEWFKIAEAEEFKDREHSYYQ 122 Query: 225 ISEKSHGREEIRLHIVCDVPD-ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRY 283 E H R E R I V + W LK + + S R + + VR+ Sbjct: 123 EIETGHHRIEKREVIAVSVSSLPCLHNQDLWTELKTVVMVKSERRLWN----KTTTEVRF 178 Query: 284 YISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRG 327 YISS + ++K ATAIR+HW +EN LHW LDV +ED +IR Sbjct: 179 YISSVEKNSQKIATAIRSHWEIENSLHWTLDVTFSEDKSRIRTR 222 >UniRef50_A3WIX0 Putative H repeat-associated protein n=2 Tax=Idiomarina baltica OS145 RepID=A3WIX0_9GAMM Length = 225 Score = 194 bits (492), Expect = 5e-48, Method: Composition-based stats. Identities = 105/219 (47%), Positives = 139/219 (63%), Gaps = 13/219 (5%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L L +H + + D RQ KV +KL D+L L + AVISGAEGWE+IEDFG L +LK+ Sbjct: 1 MSLTLLTDHFADVEDPRQASKVTYKLFDVLFLNLTAVISGAEGWEEIEDFGHLRLKWLKK 60 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 YGDF +GIPVHDTIAR+V I P FH+ FI WM+ DK V+A+DGKTL Sbjct: 61 YGDFSHGIPVHDTIARLVCRIDPQTFHQRFIQWMQATEKLTDKQVVAVDGKTL------- 113 Query: 121 RRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 H+ISAF+T + +V+GQ +TDEKSNEITA+PELL +L+++G ++T DAM CQK I Sbjct: 114 ------HMISAFATKNGVVLGQRRTDEKSNEITAVPELLELLELEGAMVTLDAMSCQKQI 167 Query: 181 AEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPE 219 + I K+ DY AVK + + E + + Sbjct: 168 VKTIVKKKADYCIAVKKIKSPYIRHSRMHLSSVEATSQD 206 >UniRef50_A1WM70 Transposase, IS4 family n=1 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WM70_VEREI Length = 212 Score = 181 bits (460), Expect = 3e-44, Method: Composition-based stats. Identities = 81/215 (37%), Positives = 117/215 (54%), Gaps = 11/215 (5%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+ +PD R+ + H+L ++LL IC VISGAE W + + + LD+L+ Y + Sbjct: 7 SLLTAFDDLPDPRR-RECPHRLDELLLAAICGVISGAESWTSVVQWSQMKLDWLRHYLPY 65 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 +GI HDT RV S + ++F CF+ W+ S + +AIDGK LR S+D R Sbjct: 66 AHGIASHDTFGRVFSLLDASRFEHCFMRWIGGLCPSLEGQHVAIDGKCLRGSHD--GARS 123 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 IH++SA+S+ +L +GQ++T +KSNEITAIPELL LDI+G IT DAMGC A Sbjct: 124 PIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIRGSTITIDAMGCHGMPARHR 183 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPE 219 + RL E + +P+ Sbjct: 184 RADCSARC--------RLRAECEGQSAESCRGHPD 210 >UniRef50_B0QXL9 Transposase IS4 family protein (Fragment) n=4 Tax=Haemophilus parasuis 29755 RepID=B0QXL9_HAEPR Length = 189 Score = 174 bits (441), Expect = 5e-42, Method: Composition-based stats. Identities = 55/194 (28%), Positives = 85/194 (43%), Gaps = 7/194 (3%) Query: 182 EKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH-DSYAISEKSHGREEIRLHIV 240 EKI ++ GDY+ +K + E F + PE +++ R + R + Sbjct: 1 EKIIEKKGDYVMPLKKNHRQFQSEVEAYFHKISRDCPEMLETFEEVNAERSRIDERYYRK 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIR 300 V D L EWKG+K + RS + +YISS D+ + A +R Sbjct: 61 LKVSDWLSK-AEEWKGIKSVLEVCRKRS----DNGKESQEKVFYISSLDVDVQILAKCVR 115 Query: 301 NHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRK 360 HW VENK HW LDVV ED+C + AE + +R +A+N+ K ++ K+ Sbjct: 116 GHWEVENKAHWVLDVVYKEDECAVTDEWGAENLAILRRLALNLARLHP-KKQSMKGKLTA 174 Query: 361 AAMDRNYLASVLAG 374 A + +L G Sbjct: 175 AGWSDEFRDELLLG 188 >UniRef50_UPI00016C35D6 transposase n=7 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D6 Length = 232 Score = 173 bits (437), Expect = 1e-41, Method: Composition-based stats. Identities = 64/229 (27%), Positives = 103/229 (44%), Gaps = 5/229 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E ++ +PD R ++ L +L L + AV+ G E I FG L F Sbjct: 4 SLVERLAELPDPRSRHGRQYPLVGLLTLCLVAVMGGHTTPEAISQFGRLRQKRLGHALGF 63 Query: 65 ENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 NG +P +TIA ++ + P + W+RD H D + +A+DGK L S D + Sbjct: 64 RNGNMPCPNTIAGLLRRLDPDRLDAIIGAWLRDRHP-DGWEHLALDGKRLCGSRD--GQV 120 Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLD-IKGKIITTDAMGCQKDIAE 182 H+++A++ S V+ Q+ + +NE A LL +L + G ++T DA+ Q D+ Sbjct: 121 PGTHLLAAYAPQVSAVVAQMTVEATTNEHKAALRLLGVLPSLGGTVVTGDAIFTQPDVCA 180 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHG 231 +Q +GGD + K QG L E F + G Sbjct: 181 AVQHKGGDSILYAKSNQGTLRADLEAAFATAAGGDFSPRVTGRVGSGRG 229 >UniRef50_A1ZVQ9 ISPg2, transposase, putative n=4 Tax=Microscilla marina ATCC 23134 RepID=A1ZVQ9_9SPHI Length = 271 Score = 172 bits (436), Expect = 2e-41, Method: Composition-based stats. Identities = 70/273 (25%), Positives = 108/273 (39%), Gaps = 13/273 (4%) Query: 58 LKQYGDFENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 L + D + + ++ + F S +K + DGK LR S Sbjct: 8 LCAFLDIPETTVVSRSHLPVLLQKVDVEVFDYLLFTHYGFRLDSQEKQWFSGDGKELRGS 67 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDE-KSNEITAIPELLNMLDIKGKIITTDAMG 175 + ++RG V+ I Q D K +EI + LL+ D+ + IT DA+ Sbjct: 68 IESGKKRGQA-VVQIVHHHSGEAIAQNYYDGQKESEIPTLRALLSKDDLASQKITLDALH 126 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 E I K GG +L +K Q L + + P D + +HGR E Sbjct: 127 LCPSTTEMITKAGGVFLIGLKENQPTLLAH------MTDCALPPIDQKTTFDFNHGRVEQ 180 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R + + DV + D ++ K+L R I ++ + V YYIS+ E Sbjct: 181 RKYWLYDVSKQGFDPRWDNTAFKRLVKVQRTR--INQKNAKISREVSYYISNETA-KEGI 237 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGN 328 A+RNHW VE H DV +NED K ++ Sbjct: 238 FDAVRNHWSVEVNNH-IRDVTLNEDQLKSKKRQ 269 >UniRef50_UPI0001909E25 transposase, IS4 n=1 Tax=Rhizobium etli CIAT 894 RepID=UPI0001909E25 Length = 273 Score = 172 bits (435), Expect = 2e-41, Method: Composition-based stats. Identities = 74/284 (26%), Positives = 119/284 (41%), Gaps = 15/284 (5%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + L+ + + D R H L ++L L + A + GA+ ++ +F E + L++ Sbjct: 1 MSVLISILREVRDPRD-VNARHDLGELLFLALLATLRGAKTCVEMAEFSEARQEELREIV 59 Query: 63 DFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD----KDVIAIDGKTLRHSYD 118 +G P HDT +RV + P + F +M + K V+AIDGK+LR YD Sbjct: 60 ALRHGAPSHDTFSRVFRLLDPTELERAFGAFMTALRGALGLPAPKGVVAIDGKSLRRGYD 119 Query: 119 KSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 K R ++S + I ++ +EI A +L L +KG +T DA+ C Sbjct: 120 KGRAFMPPLMVSVWDVETRPSIAAMRAPG-GDEIKATLSVLKALTLKGCTVTADALHCHP 178 Query: 179 DIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLH 238 +A+ + Y +K G L +A E F + + E+ HGREE R Sbjct: 179 AMAQALLAAKAQYALGLKANHGPLFRAAEAGFA----AVTDLAVFETRERGHGREEQRRA 234 Query: 239 IVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVR 282 V V + GLK + + R +PE VR Sbjct: 235 SVLPVDR--LVKRPSLPGLKAIGRIEAVR---TGANGKPEQAVR 273 >UniRef50_UPI00016C489D ISPg2, transposase n=12 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C489D Length = 354 Score = 169 bits (428), Expect = 1e-40, Method: Composition-based stats. Identities = 64/218 (29%), Positives = 99/218 (45%), Gaps = 3/218 (1%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + L E +S IPD R + H L +L L A++ G + I FG H L Sbjct: 1 MPARSLYEELSTIPDPRGLQGLIHPLPAVLGLVTLALLMGRTSLQGIARFGRQHGFPLAH 60 Query: 61 YGDFENG-IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDK 119 F G P T++R + P + W+ + IA+DGKTLR S D Sbjct: 61 ALGFRRGKTPAASTLSRTLRRFDPQQLEAALTRWLDGRVGPVARTHIALDGKTLRGSRD- 119 Query: 120 SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 + H+++A++ V+ Q++ D K+NE A LL +L + G ++T DAM CQ+D Sbjct: 120 -GQVPGQHLVAAYAPAAHAVLAQVRVDAKTNEHKAALALLGILPVAGSVLTGDAMFCQRD 178 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNN 217 +A + G DY+ K Q L + E ++ Sbjct: 179 VAAAVIAGGADYVLVAKDNQPGLVASIEAGLGFEDAAR 216 >UniRef50_B6AN48 Putative uncharacterized protein n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AN48_9BACT Length = 454 Score = 168 bits (426), Expect = 2e-40, Method: Composition-based stats. Identities = 57/228 (25%), Positives = 102/228 (44%), Gaps = 14/228 (6%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +++ LM+ +S D R+ + H ++ +CA++SGA + ++ + LK+ Sbjct: 221 DIQHLMDDLSRTSDTRKRRGIRHSQISLVATLVCAILSGACHTRAMAEWAANLSNSLKKR 280 Query: 62 GDFENGI-------PVHDTIARVVSCISPAKFHECFINWMRDCHSS----DDKDVIAIDG 110 F P T+ R + I + W + D V++IDG Sbjct: 281 LGFRRHPETKILIAPSEPTLRRTLQSIDVLEVDRVLSGWFKQVLPQCGLGGDVSVLSIDG 340 Query: 111 KTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIIT 170 K +R + + IH ++AF +V+ Q DEK+NEI + LL ++I+G+I+T Sbjct: 341 KAVRGASKAKGGQ-KIHALAAFLQNRGIVVAQKNVDEKTNEIPELRALLAPMEIEGQIVT 399 Query: 171 TDAMGCQKDIAEKIQK-QGGDYLFAVKGTQGRLNKAFEEKFPLKELNN 217 DA+ Q + A I + + DY+F VK Q + + E P + Sbjct: 400 ADALHTQTETARFITEDKKADYVFTVKKNQPTMFEDIES-LPWEAFPP 446 >UniRef50_B1TH11 Transposase IS4 family protein n=2 Tax=Burkholderia ambifaria MEX-5 RepID=B1TH11_9BURK Length = 186 Score = 166 bits (419), Expect = 2e-39, Method: Composition-based stats. Identities = 60/189 (31%), Positives = 83/189 (43%), Gaps = 10/189 (5%) Query: 192 LFAVKGTQGRLNKAFEEKFPLKELNN---PEHDSYAISEKSHGREEIRLHIVCDVPDELI 248 + AVK Q L E + +K HGR E R + D P Sbjct: 1 MLAVKDNQSTLLSRIRYALDAAEHFALVYRSASDHREVDKGHGRIETRRCLALDFPGPFE 60 Query: 249 DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 W GL+ + + S R I RYY+SS A + A A+R HW +E+ Sbjct: 61 PDL--WPGLQSIPMVESTREI----GDTVTTGRRYYVSSLPADAVRIAHAVRAHWGIES- 113 Query: 309 LHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYL 368 +HW LDV NED C+ R NAA+ F+ +R IA ++ D KAG+R + KA +Y Sbjct: 114 MHWVLDVTFNEDQCRTRLENAAQNFAILRRIATTLIRRDNSTKAGIRIRRLKAGASDDYR 173 Query: 369 ASVLAGSGL 377 A +L L Sbjct: 174 AQLLGLKTL 182 >UniRef50_A6VYG7 Transposase IS4 family protein n=3 Tax=Gammaproteobacteria RepID=A6VYG7_MARMS Length = 193 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 80/194 (41%), Positives = 118/194 (60%), Gaps = 2/194 (1%) Query: 94 MRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEIT 153 M+ H +V+AIDGKTLR SYD+ R+ IH++SA+++ + LV+GQ+KT+ KSNEIT Sbjct: 1 MKAVHKLTCGEVVAIDGKTLRGSYDRDDRQSTIHMVSAYASANQLVLGQLKTNTKSNEIT 60 Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK 213 AIP L+ MLD++G I+T DAM CQ IA+ I ++GGDYL AVKG QG+L A + F Sbjct: 61 AIPALIQMLDLRGAIVTIDAMACQTKIAKAITRKGGDYLLAVKGNQGKLAAAVQAAFTPH 120 Query: 214 ELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 + D+ EK GR E R + V D + DF+ W GL + + ++R+ Q Sbjct: 121 RRAPIDRDTCQ-IEKQKGRVEARTYHVLSASDLIRDFS-TWSGLTSIVMVENYRAAKGRQ 178 Query: 274 KKEPEMTVRYYISS 287 + + + + + S Sbjct: 179 RARVGVPLLHKVQS 192 >UniRef50_Q2JGT3 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JGT3_FRASC Length = 331 Score = 165 bits (418), Expect = 2e-39, Method: Composition-based stats. Identities = 60/264 (22%), Positives = 112/264 (42%), Gaps = 22/264 (8%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 + L+E ++ +PD R+ V ++ + +L + +CA++SGA + I ++ + Sbjct: 47 DQTALLEALARVPDPRRRRGVRYRFAAVLAIAVCAMLSGARSFAAIGEWAADLPADARAG 106 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD-------------KDVIAI 108 +P TI RV+ + A W++ + D + V+A+ Sbjct: 107 LGLTGRVPGPVTIWRVLVRVDRAALETAIGAWIQARLDTIDTAGHQPPQRRRRVRRVLAV 166 Query: 109 DGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGK 167 DGK +R + +H++ +V+ Q+ DEK+NEI +L+ + D+ Sbjct: 167 DGKAMRATR---HGTHPVHLLGVLDHARGVVLAQVDVDEKTNEIPLFSTVLDQIPDLTDV 223 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISE 227 +IT DAM Q A+ + +G L VK Q ++ + P K++ + + Sbjct: 224 LITVDAMHAQTAHADHLHARGAHLLVTVKRNQPTVHTRLKT-LPWKDVPV----GHTTTG 278 Query: 228 KSHGREEIRLHIVCDVPDELIDFT 251 + HGR E R VP L Sbjct: 279 RGHGRIETRTLKAVTVPAGLGFPH 302 >UniRef50_B4Y365 Transposase n=1 Tax=Thauera sp. B4 RepID=B4Y365_9RHOO Length = 220 Score = 164 bits (415), Expect = 4e-39, Method: Composition-based stats. Identities = 64/189 (33%), Positives = 93/189 (49%), Gaps = 8/189 (4%) Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCD 242 I + GDYL VKG Q +L +A E F + + D A+ E+ HGR ++ V Sbjct: 1 MIIAKKGDYLLMVKGNQPKLLEAIEIAFID-QHDVKSVDRSALVERGHGRTVGQIASVLS 59 Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 I +W + S R + +KE ++ YYI+S LTAE+ A ++R Sbjct: 60 AKG--IINPGDWPNCVTIGRIDSMRVV---DEKESDLERCYYITSRALTAEQLAASVRAR 114 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK--VFKAGLRRKMRK 360 W VEN+ HW LDV +ED + + NA + S +R IA+NI+ DK K+ LR K + Sbjct: 115 WGVENRFHWILDVSFSEDASTVAKDNAPQNLSMLRKIALNIIRADKTDTRKSSLRLKRKG 174 Query: 361 AAMDRNYLA 369 AA D Sbjct: 175 AARDDGVRE 183 >UniRef50_C6I132 Transposase, IS4 family protein n=6 Tax=Leptospirillum ferrodiazotrophum RepID=C6I132_9BACT Length = 454 Score = 158 bits (399), Expect = 3e-37, Method: Composition-based stats. Identities = 54/223 (24%), Positives = 92/223 (41%), Gaps = 19/223 (8%) Query: 11 SIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGI-- 68 + + D R+ + H +LL+ + V++G +E I + + + Sbjct: 229 TTVRDPRKARGIRHNYLSVLLVGLAGVLAGKRSYEGIARWAQDLSQSQLRRLGCRWSPGK 288 Query: 69 -----PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRR 123 P TI R++S P + + + IAIDGKT+R S Sbjct: 289 ERFLPPSEPTIRRILSKADPVELDRILSQY---IVAHSSGRAIAIDGKTIRSS------- 338 Query: 124 GAIHVISAFSTMHSLVIGQIKTDE-KSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE 182 ++ +++A V+ Q D K +EI A LL LD+ GK++T DA+ Q +A Sbjct: 339 -SVGLMAALVHKDGTVVAQKSLDGPKGHEIPAAHTLLEPLDLSGKVVTADALHTQNALAS 397 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAI 225 +I+++GGDY+F VK + L +P D Sbjct: 398 RIREKGGDYVFTVKDNRKTLKDEISGLDDEAFSPSPYDDLLRT 440 >UniRef50_A8TWU0 ISMca6, transposase, OrfA n=5 Tax=Alphaproteobacteria RepID=A8TWU0_9PROT Length = 219 Score = 158 bits (399), Expect = 4e-37, Method: Composition-based stats. Identities = 53/187 (28%), Positives = 92/187 (49%), Gaps = 4/187 (2%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFL-K 59 + L+ + +PD R+ + L +L+ T+ A++SGA + I F E + L Sbjct: 11 VPFSNLLAALQDVPDPRRAQGKRYPLPYLLMFTVLALLSGARSYRGIITFLEHRREHLTH 70 Query: 60 QYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD---KDVIAIDGKTLRHS 116 +G PV +T+ V+ + + F + + K V+A+DGKTLR S Sbjct: 71 HFGVDLKRAPVVNTLRTVLQSLDTETLEDAFRRHAKTLLPEGEVGEKAVVALDGKTLRGS 130 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 +D R A ++AF + ++V+ + D+KSNEI A +++ L + G + T DAM C Sbjct: 131 FDHINDRKAAQTLTAFGSASAIVLAHTEIDDKSNEIPAAQQMIRDLGLTGVVFTADAMHC 190 Query: 177 QKDIAEK 183 QK + + Sbjct: 191 QKKHSRR 197 >UniRef50_B2RJC8 Partial transposase in ISPg6 n=12 Tax=Bacteria RepID=B2RJC8_PORG3 Length = 170 Score = 157 bits (396), Expect = 8e-37, Method: Composition-based stats. Identities = 55/164 (33%), Positives = 85/164 (51%), Gaps = 3/164 (1%) Query: 47 IEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVI 106 + + L+ + NG P DT RV+ I P + C + ++ S + I Sbjct: 1 MHELCLERGASLRPPVELPNGCPSVDTFERVLQRIEPQSLYACLQVYGKELISDLEGKHI 60 Query: 107 AIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKG 166 AIDGK L+ S K+ H++SA+ L + Q EK NE+ AIPE+L+ LD+ G Sbjct: 61 AIDGKRLKGSKKKTGS---THILSAWVDEVGLSLAQESVAEKRNELQAIPEVLDSLDLSG 117 Query: 167 KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKF 210 +I+ DAMG Q +IAE+I + DY+ ++KG Q L + + F Sbjct: 118 AVISIDAMGTQTNIAEQIIQSEADYILSLKGNQKHLYEDVRDCF 161 >UniRef50_C8PNH0 H repeat-associated protein YdcC n=1 Tax=Treponema vincentii ATCC 35580 RepID=C8PNH0_9SPIO Length = 187 Score = 154 bits (389), Expect = 5e-36, Method: Composition-based stats. Identities = 50/196 (25%), Positives = 83/196 (42%), Gaps = 9/196 (4%) Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHI 239 ++E+ ++ DY+ A+KG + + ++ F + +K HGR E R Sbjct: 1 MSERNSEKDNDYILALKGNHPLMEQEVKDFF--LSPVTSTRSVHTTFDKGHGRIE-RRIY 57 Query: 240 VCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAI 299 D + EWK L + S + K + +RY+I+S ++FA + Sbjct: 58 TLDTNIGWFEDKKEWKHLAGFGMVDSMVTR----KGKECREIRYFITSVT-DVKQFAKGV 112 Query: 300 RNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 +HW +EN LHW LDV+ +D+C + NAAE + IR I N + K Sbjct: 113 CSHWMIENNLHWCLDVLFCDDECTVLDRNAAENLAIIRRIVYNRIKMLSKMDTLSMGKR- 171 Query: 360 KAAMDRNYLASVLAGS 375 D + A +L Sbjct: 172 ACIYDDEFRAQILFSC 187 >UniRef50_C0GV83 Transposase, IS4 family protein n=3 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV83_9DELT Length = 221 Score = 153 bits (385), Expect = 1e-35, Method: Composition-based stats. Identities = 49/206 (23%), Positives = 79/206 (38%), Gaps = 13/206 (6%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE 214 +L +++ GK IT DA+ QK +AE I + YLF VK Q L + F + Sbjct: 2 FIPILEQIEVSGKTITADALLTQKKLAEYIVGRNAAYLFTVKKNQPTLYFDIKNYFEHR- 60 Query: 215 LNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 E D HGR + R +E ++F + + + Sbjct: 61 ---KEPDYCLQDPPGHGRIDTRSIWTTTELNEYLEFPHVGQAF------CIHKKSYDPKT 111 Query: 275 KEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAE 331 + Y ++S + R HW +EN H+ LD +ED +IR GN Sbjct: 112 NKVCENTFYGVTSHHPNKADPARILQIHRGHWSIENSKHYILDWTYDEDRNRIRTGNGPA 171 Query: 332 LFSGIRHIAINILTNDKVFKAGLRRK 357 + +R AI +L + V + + Sbjct: 172 NTNRLRGFAIGLLKSKGVKDIAQKVR 197 >UniRef50_B5EK20 Putative uncharacterized protein n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK20_ACIF5 Length = 439 Score = 151 bits (382), Expect = 3e-35, Method: Composition-based stats. Identities = 53/227 (23%), Positives = 100/227 (44%), Gaps = 15/227 (6%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFG----ETHLDF 57 +++ L + +PD R +H L IL + + AV++ A+ + + ++ + L Sbjct: 219 QMEGLRQIFFRLPDARSRLGKQHPLPAILTIAVAAVLTQAQSYIAMAEWAARLTQAQLKR 278 Query: 58 LKQYGDFE---NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLR 114 ++ + P T+ RV+ + W + +A+DGK L+ Sbjct: 279 IRARFNPRTQRYVAPSEPTLRRVLQGANVTALDAAIGAW---LLGIAGFEAVAVDGKVLK 335 Query: 115 HSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAM 174 + + + +H++SAF I Q + K+NEI + LL +DI+ K++T DA+ Sbjct: 336 GAVREDGSQ--VHLLSAFMHGQGATIAQKEIARKTNEIPELRSLLKDVDIQDKVVTADAL 393 Query: 175 GCQKDIAEKIQK-QGGDYLF-AVKGTQGRLNKAFEEKFPLKELNNPE 219 Q+ A + + + DYLF AVKG Q +L + P + Sbjct: 394 HTQRKTARFLVEDKKADYLFTAVKGNQRKLRNSLI-CLPWGDFPPQR 439 >UniRef50_B4VIU0 Transposase, IS4 family protein n=4 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU0_9CYAN Length = 204 Score = 150 bits (379), Expect = 7e-35, Method: Composition-based stats. Identities = 55/173 (31%), Positives = 85/173 (49%), Gaps = 11/173 (6%) Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 K + I + G DY+ AVKG Q RL++ + +E+ R Sbjct: 1 MPKKTVQLIIEGGNDYVIAVKGNQKRLHEQIKLTTE----QRLPVSLDITTERRSDRITT 56 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R V D L +++W+GL++L + +P + YYISS + A +F Sbjct: 57 RSVSVFDD---LSGISYDWEGLQRLVKVER----FGTRAGKPYHQIVYYISSLTINAAQF 109 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 A IR HW +EN+LHW DVV++ED+ ++R+GNA FS IR + + IL + Sbjct: 110 AQGIRGHWGIENRLHWVKDVVLDEDNSRMRQGNAPANFSIIRSLVLTILRYNG 162 >UniRef50_A1WE63 Transposase, IS4 family n=7 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WE63_VEREI Length = 285 Score = 147 bits (371), Expect = 6e-34, Method: Composition-based stats. Identities = 61/194 (31%), Positives = 85/194 (43%), Gaps = 11/194 (5%) Query: 186 KQGGDYLF-AVKGTQGR-LNKAFEEKFPLKELNNPEHDS---YAISEKSHGREEIRLHIV 240 +G + A + Q L A + F + + +K HGR E R Sbjct: 91 DRGRWWRLRACRQGQPTHLAHALRDFFGTLDAPGYPVRQTCVHETLDKGHGRIETRRCTA 150 Query: 241 CDVPDEL--IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATA 298 D L + WK + + S R I + E RY ISS +E+ A Sbjct: 151 AGDLDWLATLGLKERWKKITSVAGIDSSRVI----GSKTETDRRYVISSLPADSERILHA 206 Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R HW +EN LHW LDV ED C IR NAA FS +R A+N+ D GL +K Sbjct: 207 VRMHWGIENGLHWCLDVAFGEDACPIRLRNAALDFSLLRRAAMNLFRADHSRAMGLPKKR 266 Query: 359 RKAAMDRNYLASVL 372 + AA + +YLA++L Sbjct: 267 KAAAWNPDYLANIL 280 >UniRef50_A7NPW4 Transposase IS4 family protein n=1 Tax=Roseiflexus castenholzii DSM 13941 RepID=A7NPW4_ROSCS Length = 360 Score = 146 bits (369), Expect = 9e-34, Method: Composition-based stats. Identities = 63/326 (19%), Positives = 114/326 (34%), Gaps = 43/326 (13%) Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 + G P ++T+ +++C+ WM + A DGK L S Sbjct: 13 RWRPLGAHSPHFPAYNTVRDLLACVDADDLDRRLRPWMERLLGMPVGGIRA-DGKVLGGS 71 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K A+H + + + + Q + + A+ LL + G++++ DA Sbjct: 72 --KRAGAPALHGVELVTHTTGMALAQREAVG-GDAAAALLALLTEAPLDGRMVSMDAGFL 128 Query: 177 QKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFP------------------------- 211 + + I ++ G+YL VKG Q ++ P Sbjct: 129 NAAVTQTIVQEHGNYLGGVKGDQAECRRSSMTGSPRRCFSPPDTPPAPLPVALDQIAPPR 188 Query: 212 --------LKELNNPEHDSYAISEKSHGREEIRLHIVCDV--PDELIDFTFEWKGLKKLC 261 +EL E+S GR EIR V D + + W+ + ++ Sbjct: 189 RKRQPIGFRRELQPRRAPDAQTIEQSRGRLEIRELWVVDAGDVGPSLMTAYGWRQVTQIG 248 Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDD 321 + + +SS T +F +IRNHW +EN++H D M ED Sbjct: 249 GLRRWCRRRHADLW--TVEEVTVVSSRQRTPAQFLASIRNHWTIENQVHRPRDGSMQEDR 306 Query: 322 CKIRRGNAAELFSGIRHIAINILTND 347 R + + R++ IN++ Sbjct: 307 LHGRA--IGVILAVCRNVVINLIRRH 330 >UniRef50_A0PQJ8 N-term transposase for IS2404 n=30 Tax=Mycobacterium ulcerans Agy99 RepID=A0PQJ8_MYCUA Length = 234 Score = 145 bits (365), Expect = 3e-33, Method: Composition-based stats. Identities = 58/245 (23%), Positives = 94/245 (38%), Gaps = 17/245 (6%) Query: 28 DILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFH 87 +L + + A + G+ + T D + P T V+S + PA + Sbjct: 2 ALLAIAVLATAARMRGYAGFATWAATASDDVLAQLGVRFRRPSEKTFRAVLSRLDPADLN 61 Query: 88 ECFINWMRDCHSSDDKD---VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIK 144 ++ +S D IA+DGK LR + + A H++S F+ LV+GQ+ Sbjct: 62 ARMGSYFTAHVASSDPSGLVPIALDGKMLRGALR--AKATATHLVSVFAHRARLVLGQLA 119 Query: 145 TDEKSNEITAIPELLNMLDIKG-KIITTDAMGCQKDIAEKIQKQ-GGDYLFAVKGTQGRL 202 EKSNEI + LL +L ++T DAM Q A+ I YL VK Q ++ Sbjct: 120 VAEKSNEIPCVRALLTLLPDNLRWLVTVDAMHTQVVTAKLICATLKSHYLMIVKSNQAKI 179 Query: 203 NKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCV 262 P E+ D + HGR + R + + + K++ Sbjct: 180 LARI-TALPWAEVPAAATD----DSRGHGRVKTRTLQIITAARGIG-----FPYAKQIIR 229 Query: 263 AVSFR 267 R Sbjct: 230 ITRER 234 >UniRef50_A1WL48 Transposase, IS4 family n=2 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WL48_VEREI Length = 185 Score = 143 bits (361), Expect = 7e-33, Method: Composition-based stats. Identities = 55/142 (38%), Positives = 77/142 (54%), Gaps = 3/142 (2%) Query: 101 DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLN 160 VIAI+GK+LR + + A+H +SA++ + L +GQ+ EKSNEITAI ELL Sbjct: 1 MGGLVIAINGKSLRGAARPACGLRALHQVSAYAAGYGLTLGQLACQEKSNEITAIGELLP 60 Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH 220 L ++G ++T DA+GCQ +AE+I GGDY+ AVK Q L A + F Sbjct: 61 TLALEGAVVTIDAIGCQSAMAEQIVGGGGDYVLAVKDNQPHLAHALRDFFGTLGAPGDPV 120 Query: 221 DS---YAISEKSHGREEIRLHI 239 + +K HGR E R Sbjct: 121 RQTCVHETLDKGHGRIETRRCT 142 >UniRef50_A7BWL4 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7BWL4_9GAMM Length = 142 Score = 141 bits (356), Expect = 3e-32, Method: Composition-based stats. Identities = 57/145 (39%), Positives = 77/145 (53%), Gaps = 7/145 (4%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH---DSYAISEKSH 230 MGCQK+IAE I +Q DY+ AVK Q L++A ++ F N E D KSH Sbjct: 1 MGCQKNIAEIIVEQEADYIEAVKDNQPTLHQAIQDYFEEANEANFESYNIDFAETYNKSH 60 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL 290 GR E R V L D + W+GL+ + + S R++ K++ + RYYISS Sbjct: 61 GRIESRRCWVGYDALPLTDDSQNWEGLQTIVMVESERTL----KEKTTIEHRYYISSTMA 116 Query: 291 TAEKFATAIRNHWHVENKLHWRLDV 315 TA + R HW +EN LHWRLD+ Sbjct: 117 TAAYLLNSSREHWGIENSLHWRLDI 141 >UniRef50_B4VIU1 Putative uncharacterized protein n=3 Tax=Microcoleus chthonoplastes PCC 7420 RepID=B4VIU1_9CYAN Length = 185 Score = 139 bits (350), Expect = 1e-31, Method: Composition-based stats. Identities = 48/180 (26%), Positives = 84/180 (46%), Gaps = 4/180 (2%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+E ++ +PD+R + L +LLL I +S G+ +EDF H + L Sbjct: 4 NLLEALTQVPDFRAARGRRYPLWLLLLLVIMGTLSDCLGYRALEDFCRRHHEALVTTLQL 63 Query: 65 EN-GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDK--SR 121 P T RV+ I F NW+ ++D + +DGK+++ + Sbjct: 64 PPTRFPSDSTFRRVMMGIDFTDLANIFNNWVYSSLPQGEQDWLGVDGKSIKATVSNYDQA 123 Query: 122 RRGAIHVISAFSTMHSLVIG-QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDI 180 + I+V+S FS + I Q +++ +EI + LL LD++G + T D++ CQK + Sbjct: 124 YQDFINVVSVFSAQKGVPIALQQFHNKQGSEIAVVQNLLATLDLEGVVFTLDSLHCQKKL 183 >UniRef50_A5L7Q7 Transposase, IS4 n=24 Tax=Vibrionales RepID=A5L7Q7_9GAMM Length = 164 Score = 139 bits (349), Expect = 2e-31, Method: Composition-based stats. Identities = 50/165 (30%), Positives = 77/165 (46%), Gaps = 6/165 (3%) Query: 211 PLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSII 270 L + +SY EK HGR+E+R V +W +K + V RS+ Sbjct: 5 DYWALPEDKQESYITEEKGHGRKEVREVYVLPAAFS-EALRQKWCLVKSIVAVVRDRSV- 62 Query: 271 AEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 K + YYI + L+ E + A R HWH+EN+ HW LDV+ ED+ +I G++A Sbjct: 63 ---KGKGSYETSYYICTDHLSLELASKATRKHWHIENQQHWALDVIFKEDEQRIYAGDSA 119 Query: 331 ELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGS 375 + R N+ + + RKM +AA +++Y VL S Sbjct: 120 LNMACCRRFVQNLFRKSEG-NLSVPRKMNQAAWNKDYREKVLFTS 163 >UniRef50_A7H7X3 Transposase IS4 family protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H7X3_ANADF Length = 442 Score = 138 bits (347), Expect = 4e-31, Method: Composition-based stats. Identities = 69/315 (21%), Positives = 116/315 (36%), Gaps = 41/315 (13%) Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCH-------SSDDKDVIAIDGKTLR 114 G P T+ R+++ SPA E ++D V++ DGK Sbjct: 93 LGLGRGKPCDTTLYRLLAEQSPAGLEETVFAQVKDLIARKVVRNDLLALGVVSFDGKGTW 152 Query: 115 HSYDKSRRRGAIH----------------VISAFSTMHSLVIGQIKTDEKSNEITAIPEL 158 D + +GA + S+ +GQ K E TA L Sbjct: 153 SRTDGEKVKGAQQSAYDAEGSSLQTFGALRAALTSSSVCPCVGQRLIGSKEGESTAFRRL 212 Query: 159 L----NMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE 214 L L + +I+T DA C ++ AE + G Y+F +K Q L+ + Sbjct: 213 LPAISEQLGGQFRIVTGDAGLCARENAELVTSLGRWYVFGLKDNQPYLHDIARDYGQY-- 270 Query: 215 LNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 + +E+ G +R DV + L +C + R Sbjct: 271 --DLGTPLARTAERYRGHTIVRELYARDVAGNPAAAIEAAQQLWYVCQTTTDRR-----G 323 Query: 275 KEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA- 330 + + RY+++S LT ++ +R HW +EN HW +DV++ ED+ + + A Sbjct: 324 EIVAVEQRYFVTSIPTGTLTRDQELALVRMHWAIENGCHWTMDVMLGEDEGHPCQASRAS 383 Query: 331 -ELFSGIRHIAINIL 344 E S +R I N + Sbjct: 384 IETVSWLRLIGYNAV 398 >UniRef50_B0TCH7 Putative uncharacterized protein n=11 Tax=Heliobacterium modesticaldum Ice1 RepID=B0TCH7_HELMI Length = 453 Score = 137 bits (345), Expect = 6e-31, Method: Composition-based stats. Identities = 55/398 (13%), Positives = 108/398 (27%), Gaps = 58/398 (14%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + + D R+ +++ I + E E ++ + +Q Sbjct: 38 VYGFSQMVRQAKDGRKQPRIK--APAIFTVAFFGAFFCMESMEQMDRW--QKTGVFRQLV 93 Query: 63 DFENGIPVHDTIARVVSCISPAK--------FHECFINWMRDCHSSDDKDVIAIDGKTL- 113 +P HDT+ + + + S + V AIDG L Sbjct: 94 PKNIRLPSHDTVRQALMKWDLKEQREQHNCVIQRYKEQRGPQKESINGWRVTAIDGVELF 153 Query: 114 ------------RHSYDKSRRRGAIHVISAFSTMHSLVIG-------QIKTDEKSNEITA 154 R DK+ V++ ++ +I Q D+ E T Sbjct: 154 HTKAYRCPECLTREHRDKTTDYYHAVVVAQQVGGNANLIYDWEMRKPQDGVDKDEGETTV 213 Query: 155 IPELLNML-DIKGK---IITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKF 210 L+ + + GK + T DA+ + + G + +K + R+ K F Sbjct: 214 AQRLIRRMAETYGKITDVYTLDALFAKAPVIHAALDAGAHVVVRMKEERRRIMKEANACF 273 Query: 211 PLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSII 270 + ++ + V + +W ++ V Sbjct: 274 ANRLPDSTWEERDGKGNT------------VYVQAWDEEGLAQWPQVRVPMRIVKIIRHT 321 Query: 271 AEQ----------KKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNED 320 + E + SS + A W +EN L D Sbjct: 322 NKTVIEANKEVFVTDVVERWIATTCSSEKADTQTIAQIAAARWDIENIGFRNLKTFNALD 381 Query: 321 DCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 C + A + G + +A N+ R Sbjct: 382 HCFVHDSVAIKAMIGFQVLAFNLKRLFFFHHLPASRHR 419 >UniRef50_A1WSG1 Transposase, IS4 family n=6 Tax=Bacteria RepID=A1WSG1_VEREI Length = 140 Score = 136 bits (343), Expect = 1e-30, Method: Composition-based stats. Identities = 65/141 (46%), Positives = 91/141 (64%), Gaps = 4/141 (2%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIK 165 +AIDGK LR S+D R IH++SA+S+ +L +GQ++T +KSNEITAIPELL LDI+ Sbjct: 1 MAIDGKCLRGSHD--GARSPIHLVSAWSSTVALTLGQVRTADKSNEITAIPELLQALDIR 58 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDS--Y 223 G IT DAMGCQ DIAE+I ++G DY+ VKG Q L +A + F + E + Sbjct: 59 GSTITIDAMGCQHDIAEQIVRRGADYVLNVKGNQPNLAEAIQTWFDAADAGTLERPFWQH 118 Query: 224 AISEKSHGREEIRLHIVCDVP 244 + ++K+HGR E R + + Sbjct: 119 SQTDKNHGRIETRRCVATNDV 139 >UniRef50_Q2S6S5 Transposase n=1 Tax=Hahella chejuensis KCTC 2396 RepID=Q2S6S5_HAHCH Length = 186 Score = 136 bits (342), Expect = 1e-30, Method: Composition-based stats. Identities = 59/160 (36%), Positives = 88/160 (55%), Gaps = 3/160 (1%) Query: 97 CHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIP 156 + D+IA+DGKTLR SYD++ + AIH++SA+ST + LV+GQ+KT+EKSNE TAIP Sbjct: 1 MAARIPGDIIAVDGKTLRGSYDRASSKAAIHMVSAWSTANELVLGQLKTEEKSNEFTAIP 60 Query: 157 ELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELN 216 +L +L ++ +T DA+G Q+DIA++I + DYL VK Q L++ + + E Sbjct: 61 KLFTLLALEDCPVTIDAIGRQRDIAKQIVDKNADYLLVVKHNQHTLHEGIHDLYIEAEAK 120 Query: 217 NPEHDSY---AISEKSHGREEIRLHIVCDVPDELIDFTFE 253 D HGR + V L + Sbjct: 121 GFTEDFTDSVTEEGDKHGRIDKLHCRVTHRFSGLGALADK 160 >UniRef50_A7C6J6 Transposase, IS4 n=1 Tax=Beggiatoa sp. PS RepID=A7C6J6_9GAMM Length = 193 Score = 136 bits (342), Expect = 1e-30, Method: Composition-based stats. Identities = 61/201 (30%), Positives = 96/201 (47%), Gaps = 13/201 (6%) Query: 155 IPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE 214 + +L IK I T DA+ CQK E I ++ Y+ VK Q L +A E+ Sbjct: 2 VQQLFESFQIKKTIFTLDALHCQKKTVEVIIRKQNGYIIPVKKNQPTLRRAIEDT----- 56 Query: 215 LNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK 274 N ++++ ++K HG E H + + +W GL+ +S R Sbjct: 57 AKNSPLNAWSWTQKGHGHE---SHCRLKIWEATESMKMQWAGLE---RFISIRRQGFRHH 110 Query: 275 KEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFS 334 K+ + T Y+I+S L++ + A IR H +EN LHW DV++NED+C IR + A + Sbjct: 111 KKFDSTT-YHITSETLSSYRLAGFIRGHRRIENNLHWTKDVILNEDNCGIREPHPAAILG 169 Query: 335 GIRHIAINILTNDKVFKAGLR 355 +R+IA N L V L+ Sbjct: 170 ILRNIAFN-LRLGTVSNPSLK 189 >UniRef50_UPI00016C5729 transposase, IS4 family protein n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C5729 Length = 240 Score = 134 bits (337), Expect = 6e-30, Method: Composition-based stats. Identities = 53/226 (23%), Positives = 91/226 (40%), Gaps = 11/226 (4%) Query: 20 WKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGI-PVHDTIARVV 78 H L +L L AV+ G + I FG + L F G P T+++ + Sbjct: 2 QGRIHPLPAVLGLVAVAVLVCRTGLQGIARFGRQYGTPLAHALGFRRGPTPSASTLSQTL 61 Query: 79 SCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSL 138 I P + W+ + D + +A+DGK LR S D H ++A++ + Sbjct: 62 RRIDPQQLEAALGRWIAGRLTPDARAHVALDGKCLRGSRDGD--VPGPHRVAAYAPHAAA 119 Query: 139 VIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGT 198 V+GQI+ D ++NE A LL ++ + G ++T A C +D+A + GG Y+ +G Sbjct: 120 VLGQIRVDAQTNEHQAALALLGIVPVGGSVLTGGATFCPRDVAAAVVDGGGHYVSHGQGQ 179 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAISEKS--------HGREEIR 236 R + ++ + +S R R Sbjct: 180 PTRPGGRHRGRVGVRGRRPRARGGHVPLSRSRRPSNWGARPRRWTR 225 >UniRef50_B9YJB3 Transposase-like protein n=11 Tax='Nostoc azollae' 0708 RepID=B9YJB3_ANAAZ Length = 124 Score = 134 bits (336), Expect = 6e-30, Method: Composition-based stats. Identities = 42/113 (37%), Positives = 67/113 (59%), Gaps = 4/113 (3%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W LK + + S + + + RY+ISS D E+ A ++R+HW +EN LHW L Sbjct: 15 WSNLKSVGMVESI----GQVDDKTTVETRYFISSLDSNGEQLANSVRSHWAIENSLHWVL 70 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRN 366 DV + +DDC+IR+ NA + F+ +R IA+++L + K G++ K AA+D N Sbjct: 71 DVALKQDDCQIRKDNAPQNFAVMRQIAVDLLGKENPVKRGIKNKQFLAAVDNN 123 >UniRef50_Q607Y9 ISMca6, transposase, OrfA n=3 Tax=Gammaproteobacteria RepID=Q607Y9_METCA Length = 179 Score = 134 bits (336), Expect = 6e-30, Method: Composition-based stats. Identities = 56/180 (31%), Positives = 85/180 (47%), Gaps = 5/180 (2%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLK-QY 61 + L + + IPD+R+ L +LL +I A++SGA + I F TH L + Sbjct: 1 MSALKQSLLAIPDHRRAQGRLFDLPHMLLFSILAIMSGATSYRRIHQFLHTHQAALNAAF 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSR 121 G P + +I + + F VIA+DGKTLR S D+ Sbjct: 61 GCRWRRTPAYSSIRYALQGLDVQALAPHFRAHAARLAE--GAAVIALDGKTLRGSLDRFE 118 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTD--EKSNEITAIPELLNMLDIKGKIITTDAMGCQKD 179 R A V+SAF+T +V+GQI + K +EI A L+ L + G++ T DA+ QK+ Sbjct: 119 DRKAAQVLSAFATEERIVLGQILIEDAGKDHEIQAAQRLIETLGLSGRLYTLDALHLQKN 178 >UniRef50_A7H9Y8 Putative uncharacterized protein n=1 Tax=Anaeromyxobacter sp. Fw109-5 RepID=A7H9Y8_ANADF Length = 431 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 59/370 (15%), Positives = 107/370 (28%), Gaps = 45/370 (12%) Query: 10 ISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIP 69 + +PD R L++IL + +++GA + E+ + ++ +P Sbjct: 22 LEAVPDVRAREG-RWSLAEILTGVLLGIVAGARSLAEAEELTDGMSPAARRLASVPRRLP 80 Query: 70 VHDTIARVVSCISPAKFHECFINWMRDCHSS-------DDKDVIAIDGKTLRHSYDK--- 119 T + + +R V+A+DGK Sbjct: 81 -DTTARDALCAVPLDGLRAALHRLVRAAWRRKALTPVDLPVGVVALDGKVTALPTLNHPL 139 Query: 120 ----------SRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE----LLNMLDIK 165 + S I + ++NE L+ Sbjct: 140 IQNQHPDVGLPYGLARTVTCALVSAPGRPCIDAVPIPAETNEAGHFQHVLAGLVETYGAL 199 Query: 166 GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAI 225 +++T DA + + G DY+FA+K + K E E+ D Sbjct: 200 FQVVTYDAGALSEANGAAVVAAGKDYVFALKNDHFTMVKLATELLDPHEIAARREDVLDN 259 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFE------WKGLKKLCVAVSFRSIIAEQKKEPEM 279 + + R + V + W + S + E Sbjct: 260 ATTA-----TREIQILAVDPSHGYGAGKGPEESVWSHARTFLRVTS---TVRRSGVVIER 311 Query: 280 TVRYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCK--IRRGNAAELFS 334 R ++SS +++ +R HW VEN H LD ED+ N Sbjct: 312 DSRLFVSSRAADQLTPDQWLQVVRAHWGVENNNHHTLDTAFAEDERPWIAADANGMLAVL 371 Query: 335 GIRHIAINIL 344 +R IA +L Sbjct: 372 LLRRIAYTLL 381 >UniRef50_B9YN76 Transposase-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN76_ANAAZ Length = 158 Score = 133 bits (334), Expect = 1e-29, Method: Composition-based stats. Identities = 55/167 (32%), Positives = 83/167 (49%), Gaps = 13/167 (7%) Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQ 187 ++S ++T + LV+GQ+K E SNEITAIPELL +L++ G I+ A+ C KDI + I ++ Sbjct: 1 MVSVWATTNRLVLGQVKVYENSNEITAIPELLKVLELAGCIVRIYAIRCHKDIVKLITQE 60 Query: 188 GGDYLFAVKGTQGRLNKAFEEKFP---LKELNNPEHDSYAISEKSHGREEIRLHIVCDVP 244 DY+ +K QG L ++ E+ F +H +Y E HG EIR P Sbjct: 61 NADYVITLKKNQGNLYESVEQLFKSGISTGFQELQHSTYKPEETGHGLHEIRNFGFQLDP 120 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 D W LK + + + + + RY+ISS D Sbjct: 121 DS------VWSNLKSVGMVEPI----GQVDDKTTVETRYFISSLDSN 157 >UniRef50_A7BZC8 H repeat-associated protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZC8_9GAMM Length = 154 Score = 132 bits (332), Expect = 2e-29, Method: Composition-based stats. Identities = 42/111 (37%), Positives = 61/111 (54%), Gaps = 4/111 (3%) Query: 252 FEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHW 311 W+ L+ + + S R+ +K E + RYYISS TA R HW +E LHW Sbjct: 5 ENWEELQTIVMVESERA----EKGETTIEHRYYISSTLGTAAYLLDYKREHWGIETSLHW 60 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 LD+ ED+ +I +GN AE F+ +RHIA+N+L + K G++ K KA Sbjct: 61 CLDIAFREDESRISKGNGAENFAILRHIALNLLKKEDTAKIGIKNKRLKAG 111 >UniRef50_A3Z4H3 ISMca6, transposase, OrfA n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H3_9SYNE Length = 205 Score = 130 bits (326), Expect = 1e-28, Method: Composition-based stats. Identities = 46/190 (24%), Positives = 74/190 (38%), Gaps = 6/190 (3%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L+ H+ IPD R V +LL+ + ++S E D+E F H L + Sbjct: 12 DLISHLKAIPDARMRRGVRIPAWYLLLVAVLGILSKCESLRDLERFARRHHSVLTESLGI 71 Query: 65 E-NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK--DVIAIDGKTLRHSYDKSR 121 E P + A +W D + DGKTLR S + + Sbjct: 72 ELKRPPSDSAFRYFFLQVDVAAICGAIRDWTLAQIPGGAGDLDQLICDGKTLRGSIEPTS 131 Query: 122 RRGA--IHVISAFSTMHSLVIGQ-IKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 GA I ++ +S + I Q + +E + +LL LD++G +I DA+ Q+ Sbjct: 132 GGGAAFIAQVTLYSAALGVAIAQACYATGEDHERAVLQKLLGELDLEGVLIQADALHTQQ 191 Query: 179 DIAEKIQKQG 188 Q +G Sbjct: 192 AFFGSSQSRG 201 >UniRef50_UPI00016C3AB4 transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3AB4 Length = 210 Score = 127 bits (318), Expect = 8e-28, Method: Composition-based stats. Identities = 44/187 (23%), Positives = 76/187 (40%), Gaps = 17/187 (9%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEK--------FPLKELNNPEHDSYAI 225 M Q D+ +Q++GGDY+ K QG L E FP + D+ Sbjct: 1 MFTQPDVCAAVQERGGDYILYAKSNQGTLCTDLEAAFATAAGAIFPPGLPRQWDRDAGTA 60 Query: 226 SEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYI 285 E S G + + L ++ W G++++ R + + + V Y I Sbjct: 61 CEVSKGHGWVERRTMTS-TIWLNEYLTRWPGVQQVFRLTRTRQV----GGKTTVEVVYGI 115 Query: 286 SSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 SS + R HW +E++ H D + ED C++RRG A + + +R++A+ Sbjct: 116 SSLSSVAAAPDALLRYTRTHWGIESR-HHIRDATLGEDRCRVRRGAAPRVLAVLRNVAVY 174 Query: 343 ILTNDKV 349 +L Sbjct: 175 LLRRLGT 181 >UniRef50_C6Z3A5 H repeat-containing protein (Fragment) n=4 Tax=Bacteroides RepID=C6Z3A5_9BACE Length = 115 Score = 126 bits (316), Expect = 1e-27, Method: Composition-based stats. Identities = 49/118 (41%), Positives = 64/118 (54%), Gaps = 4/118 (3%) Query: 261 CVAVSFRSIIAEQKKEPEMTVRYYISSADLT-AEKFATAIRNHWHVENKLHWRLDVVMNE 319 S R+I+A E VRYY++S D T EK A+AIR HW + N LHW+LDV E Sbjct: 1 VRIKSERTIVAI--GEYTQEVRYYVTSLDNTRPEKIASAIRQHWSIGNNLHWQLDVTFRE 58 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGL 377 D K + NAA FS +A+ IL N+K K + K KA D NYL+ +L + Sbjct: 59 DYSK-KVKNAAGNFSVATKMALTILKNEKTTKGSMNLKRLKAGWDENYLSQLLQDNNF 115 >UniRef50_A8L0T1 Transposase, IS4 family protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L0T1_FRASN Length = 352 Score = 126 bits (315), Expect = 2e-27, Method: Composition-based stats. Identities = 66/362 (18%), Positives = 109/362 (30%), Gaps = 72/362 (19%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 L+ +L L V++G + + + ++ L GIP T R+V P Sbjct: 48 LASVLGLWFAGVLAGQQTFTAVWEWAVDLPAELLAGFGLTRGIPSERTTRRLVEGCDPVA 107 Query: 86 FHECFINWMRDCH--SSDDKDVIAIDGKTLRH--SYDKSRRRGAIHVISAFSTMHSLVIG 141 E W+ +A DGKTL+ S+ ++ V+ A + Sbjct: 108 LDEALSGWIARAATVGDPGPRGLAFDGKTLKGTRSFTEAGAMSQEAVLEAVWHDTGI--- 164 Query: 142 QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGR 201 A + GGD + A++ GR Sbjct: 165 --------------------------------------TAGHQRVVGGDEIAALEALAGR 186 Query: 202 LNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC 261 L + +EK HGR E+R V + G K++ Sbjct: 187 L--------------DLTDVLVTTAEKGHGRVEVRSLKALTVTTP---KLVGFWGTKQVI 229 Query: 262 VAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA-------TAIRNHWHVENKLHWRLD 314 P ++ + L AE+ R HW VE +H D Sbjct: 230 ELRRRTRRKKTVTAAPTVSEEVFYLVTSLPAEQAHPRDLAARARARGHWTVEA-IHHVRD 288 Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 V++ED R NA ++ R AI+ L + + +R A + +A Sbjct: 289 RVLDEDRHTARTANAPLAWAIARDTAISALRL--TGHRSIAKALRTTARQPERVLQTIAL 346 Query: 375 SG 376 Sbjct: 347 IS 348 >UniRef50_UPI00016C4FBE putative transposase n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4FBE Length = 159 Score = 123 bits (307), Expect = 2e-26, Method: Composition-based stats. Identities = 55/161 (34%), Positives = 76/161 (47%), Gaps = 7/161 (4%) Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKI 184 H++SA++T H + +G + T+EKSNEITAI LL L K ++T DAMGCQKDIA I Sbjct: 3 PRHIVSAWATEHGVALGPVATEEKSNEITAIAVLLRQLGRKKAVVTIDAMGCQKDIARNI 62 Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLK---ELNNPEHDSYAISEKSHGREEIRLHIVC 241 GGD++ AV+ Q +L A E H ++ HGR + R + Sbjct: 63 VAGGGDFVLAVQDNQPKLAAAIAAVVEKHLEGERKALRHRNHQTDTHGHGRRDERFYWGA 122 Query: 242 DVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVR 282 VP EW +K + AV + VR Sbjct: 123 QVP-PDFAAKGEWPWIKAIGTAVRITTH---PDGTQTDEVR 159 >UniRef50_D1VW65 Putative uncharacterized protein (Fragment) n=1 Tax=Prevotella timonensis CRIS 5C-B1 RepID=D1VW65_9BACT Length = 200 Score = 121 bits (302), Expect = 5e-26, Method: Composition-based stats. Identities = 49/167 (29%), Positives = 79/167 (47%), Gaps = 9/167 (5%) Query: 3 LKKLMEHISIIPDYRQTWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 L+KL E S IPD+R+ K + HKL D+++L I +S +I +FG+ +L ++ Sbjct: 35 LEKLYEFSSSIPDFRRAEKGNIRHKLGDVIMLLILGRVSNCVSRAEIIEFGKHNLKSFRK 94 Query: 61 YGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDK-----DVIAIDGKTLRH 115 NGIP T+ R+ I + H +++ IDGK R Sbjct: 95 LDILANGIPSEATLCRMEEGIDDQAMANQLQVFAETFHKELLGMCCAQEIVCIDGKAERG 154 Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML 162 + K+ R I +SA S + + +EKSNEI A+P L++ + Sbjct: 155 TVLKNGRNPDI--VSAHSFNTDITLATEVCEEKSNEIKAVPLLIDKI 199 >UniRef50_C1XPN1 Transposase family protein n=10 Tax=Thermaceae RepID=C1XPN1_9DEIN Length = 184 Score = 119 bits (299), Expect = 1e-25, Method: Composition-based stats. Identities = 42/187 (22%), Positives = 81/187 (43%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L + +S +PD R + L +L L + A +S + +E F + L G Sbjct: 3 LRQALSQVPDPR-AHNRRYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 P H I ++ + P K + +D +V+ +DGK LR S + Sbjct: 60 RKAPGHTAITLLLHRLDPEKLQAALGQVFPE---ADLGEVLVVDGKHLRGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML---DIKGKIITTDAMGCQKDIAE 182 + ++ + + Q + + + E A ELL+ L +++GK++ DA ++A Sbjct: 115 VKLVEVLALHLHTTLAQARAEGR--EEKAFLELLDRLEARELEGKVVVGDAGYLYPEVAA 172 Query: 183 KIQKQGG 189 +++K+GG Sbjct: 173 RVRKKGG 179 >UniRef50_Q47YV8 Putative uncharacterized protein n=45 Tax=Gammaproteobacteria RepID=Q47YV8_COLP3 Length = 151 Score = 119 bits (298), Expect = 2e-25, Method: Composition-based stats. Identities = 34/136 (25%), Positives = 65/136 (47%), Gaps = 3/136 (2%) Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFA 296 +H+ + + + + GLK + + + + R+ ISS DL + Sbjct: 12 IHLRTLIDKKWLAKAYRRSGLKSIIKV--HTQVHDKSTGKDTAETRWNISSLDLHVVQAL 69 Query: 297 TAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRR 356 A+R+HW VE+ +HW LD+ D+ +I R +F+ +R IA+ + D + R Sbjct: 70 NAVRSHWQVES-IHWMLDMTFRVDESRICRKQGPHVFNVMRKIAMTLFKQDTTKLVSMAR 128 Query: 357 KMRKAAMDRNYLASVL 372 K + A +D +Y +++L Sbjct: 129 KKKMAGLDDDYRSNLL 144 >UniRef50_A9D8S1 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9D8S1_9GAMM Length = 162 Score = 119 bits (297), Expect = 2e-25, Method: Composition-based stats. Identities = 43/202 (21%), Positives = 73/202 (36%), Gaps = 50/202 (24%) Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQGRLN---KAFEEKFPLKELNNPEHDSYAISEKSH 230 MGCQK+IA+ I KQ DY+ A+KG L +A+ K + D + + H Sbjct: 1 MGCQKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGH 60 Query: 231 GREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADL 290 GR E R V ++ ++W GLK + S Sbjct: 61 GRIETRRCQQVLVNKSWLNNKYQWVGLKSIIKVTSDVHEKTTT----------------- 103 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 + +IR+G F+ +R IA+ + ++ Sbjct: 104 ------------------------------ESRIRKGRGPLAFNVMRKIAMTLFKQEQTK 133 Query: 351 KAGLRRKMRKAAMDRNYLASVL 372 +A + K + A +D Y +++L Sbjct: 134 RASIVAKKKMAGLDDEYRSTLL 155 >UniRef50_B7UED9 Truncated H repeat-associated protein n=11 Tax=Bacteria RepID=B7UED9_ECOLX Length = 104 Score = 119 bits (297), Expect = 2e-25, Method: Composition-based stats. Identities = 84/99 (84%), Positives = 90/99 (90%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 MTVRYYISSAD TAEKF TAIRNHWH+EN L+WRLDVVMNEDD KIRRGNAAE FSGIRH Sbjct: 1 MTVRYYISSADSTAEKFVTAIRNHWHMENNLYWRLDVVMNEDDYKIRRGNAAESFSGIRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSGL 377 IAINILTN++VFKA RRKMRKA MD+NYLASVLAG+G Sbjct: 61 IAINILTNNQVFKARSRRKMRKATMDKNYLASVLAGAGF 99 >UniRef50_Q60CS8 ISMca6, transposase, OrfB n=2 Tax=Methylococcus capsulatus RepID=Q60CS8_METCA Length = 197 Score = 116 bits (290), Expect = 1e-24, Method: Composition-based stats. Identities = 45/184 (24%), Positives = 75/184 (40%), Gaps = 15/184 (8%) Query: 175 GCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREE 234 K E + G D L +KG +L A + SY + R E Sbjct: 3 STFKKTVETVLATGNDLLVQLKGNHPKLLAAVRTLCQSRAHA---EQSYTVDLGRRNRIE 59 Query: 235 IRLHIVCDVPD------ELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA 288 R + +P F +G +++ V + +++ P YY+++ Sbjct: 60 QRTVRLWPLPPGSGTDPWHDHFQTVIEGQRQIEVFNPYHRRFEPRQESP----AYYLATC 115 Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 +A A IR HW +EN+LH LDV + ED +IRR +F+ +RH A+N+L ++ Sbjct: 116 TASAATLAQVIRGHWAIENRLHHVLDVSLGEDSSRIRRN--PGVFALLRHFALNLLRHNG 173 Query: 349 VFKA 352 Sbjct: 174 QANI 177 >UniRef50_B7ABT9 Putative uncharacterized protein n=2 Tax=Thermus aquaticus Y51MC23 RepID=B7ABT9_THEAQ Length = 184 Score = 114 bits (285), Expect = 5e-24, Method: Composition-based stats. Identities = 45/187 (24%), Positives = 82/187 (43%), Gaps = 13/187 (6%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L E +S IPD R ++ L +L L + A +S + +E F + L G Sbjct: 3 LREVLSQIPDPR-ARNRQYPLWGLLALILVAFLSRVDSLRGVERFARANPHLLPHLG--L 59 Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 P H + ++ + P K E + +D +V+ +DGK L+ S + Sbjct: 60 RKPPGHTILTLLLHRLDPEKLQEALLQVFP---GADLGEVLVVDGKHLKGS--GKGKSPQ 114 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML---DIKGKIITTDAMGCQKDIAE 182 + ++ + + Q K + + E A+ ELL+ L +KGK++ DA ++A Sbjct: 115 VRLVEVLALHLLTTLAQAKAEGR--EDQALLELLDRLGAEGLKGKVVVGDAGYLYPELAG 172 Query: 183 KIQKQGG 189 K+ ++GG Sbjct: 173 KVVQKGG 179 >UniRef50_A3WIW9 Transposase, putative n=2 Tax=Idiomarina baltica OS145 RepID=A3WIW9_9GAMM Length = 96 Score = 113 bits (283), Expect = 8e-24, Method: Composition-based stats. Identities = 43/96 (44%), Positives = 62/96 (64%) Query: 279 MTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRH 338 M RYYISSA L+AE+FA+ +R HW +EN+LHW LDV + ED+C I RG+AA+ + RH Sbjct: 1 MQYRYYISSASLSAEEFASTVRAHWGIENRLHWVLDVTIREDNCPISRGHAADNLACFRH 60 Query: 339 IAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAG 374 +A+N + +K A + RK + A M L ++ Sbjct: 61 VALNQIRREKTIDASVNRKQKMATMSEEVLDLIVNA 96 >UniRef50_UPI00016C429C transposase for IS2404 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C429C Length = 149 Score = 113 bits (283), Expect = 9e-24, Method: Composition-based stats. Identities = 38/153 (24%), Positives = 72/153 (47%), Gaps = 13/153 (8%) Query: 224 AISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRY 283 SEK HGR E R + +WKGLK+ R++ K + + V Y Sbjct: 2 TTSEKGHGRIEKRTLETTPIVT----VGQKWKGLKQGLRITRERAV----KGKKTVEVVY 53 Query: 284 YISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIA 340 I+S A T +R+HW +EN LH+ DV + ED C++R+G A ++ + +R++ Sbjct: 54 GITSLSMARANAATLLTILRDHWQIENGLHYVRDVTLGEDACRVRKGTAPQVLAAVRNVV 113 Query: 341 INILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 +++L + + ++ + + +++ Sbjct: 114 VHLLASVEAKSRPEAIELLQ--LHPENARNLIG 144 >UniRef50_B5HJP4 Transposase n=1 Tax=Streptomyces pristinaespiralis ATCC 25486 RepID=B5HJP4_STRPR Length = 317 Score = 113 bits (281), Expect = 2e-23, Method: Composition-based stats. Identities = 48/208 (23%), Positives = 85/208 (40%), Gaps = 18/208 (8%) Query: 98 HSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 ++ + IA+DGK L+ S + R H++SA + + + +++ K+NE T Sbjct: 126 ATAGPRRAIAVDGKALKASARLTSPRR--HLLSAVTHGRVVTLARVEVGAKTNETTHFKP 183 Query: 158 LLNMLDIKGKIITTDAMG-CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELN 216 LL LD+ ++T DA+ + +I+ ++ + Y+ +K Q + P +++ Sbjct: 184 LLAPLDLADAVVTFDALHSVKANISWLVEAKKAHYIAVIKTNQPTAHHQL-ATLPWRDIP 242 Query: 217 NPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKE 276 +A SE HGR E C +PDEL + L A+ K Sbjct: 243 V----QHAASEVGHGRRESSSIKTCAIPDELGGIAYPHARL-----AIRVHRRCQPTGKR 293 Query: 277 PEMTVRYYISSADLTAEKFATAIRNHWH 304 Y ++S D A R W Sbjct: 294 ESRESVYAVTSLDAH-----QATRPIWP 316 >UniRef50_UPI00016C378D hypothetical protein GobsU_02748 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C378D Length = 453 Score = 112 bits (280), Expect = 2e-23, Method: Composition-based stats. Identities = 56/365 (15%), Positives = 96/365 (26%), Gaps = 58/365 (15%) Query: 8 EHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG 67 E IPD R L D+L+ + A F LD ++ G Sbjct: 27 ERFETIPDAR--RGPTFSLPDVLMAGLALFALKAPSLLA---FQRRTLDHNLRHVFGLTG 81 Query: 68 IPVHDTIARVVSCISPAKFHECFIN--------WMRDCHSSDDKDVIAIDG--------- 110 P + V+ + P F + + D + D V+A+DG Sbjct: 82 RPSDSQMRAVLDDVDPDHLRPVFRDVFARLQAAHVLDEYRVDGCYVVALDGVEYFCSQKV 141 Query: 111 -----KTLRHSYDKSRRRGAIHVISAFSTMHSLVIG------QIKTDEKSN--EITAIPE 157 T RH+ + + S V+ Q N E A Sbjct: 142 HCPHCMTRRHANGAVSYYHQMLGAAVVHPDFSAVLALAPEPIQRADGGTKNDCERNAARR 201 Query: 158 LLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK 213 L ++ DA +QK +L VK A Sbjct: 202 WLGRFREEHPDLAVLVVEDARSSNAPHVRDLQKARCHFLLGVK-------AADHAHLFAH 254 Query: 214 ELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQ 273 + ++ + E + R +R + L + + + + Sbjct: 255 VCARQDQHAFEVVEDADPRTGLRRSYLWIADLPLNESNDD-------VRVNFVHLVELDP 307 Query: 274 KKEPEMTVRYY-ISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRR-GNAAE 331 P ++ + A A R W +EN+ L N+ G+ Sbjct: 308 DGTPREWTWVADMAVTGANVRQLARAGRARWRIENETFNTLK---NQGYHFAHNFGHGDN 364 Query: 332 LFSGI 336 S + Sbjct: 365 NLSVV 369 >UniRef50_A4BTE7 ISMca6, transposase, OrfB n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE7_9GAMM Length = 185 Score = 112 bits (279), Expect = 3e-23, Method: Composition-based stats. Identities = 39/172 (22%), Positives = 61/172 (35%), Gaps = 8/172 (4%) Query: 185 QKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVP 244 G L +K Q L+ A E + + + R E R V + Sbjct: 2 IATGNHLLVQLKRNQPLLHDAMVEYTRGHPFVD---EHHTHEIGRRNRIEKRAVHVWHLH 58 Query: 245 DELIDFTFEWKGLKKLCVAVSFRSIIAEQ--KKEPEMTVRYYISSADLTAEKFATAIRNH 302 L + + L + YY+ L A +F+ AIRNH Sbjct: 59 PSLGSAPWY-DHFRALIRVQRHTERFDTRLRDWRVSKECAYYLCDLVLPAARFSEAIRNH 117 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 W VEN+ H+ D ED +IRR F+ +R A+N++ ++V Sbjct: 118 WRVENRAHYVRDTRFQEDASRIRRN--PCTFALLRSFALNLMRFNRVENISQ 167 >UniRef50_A3ZX26 Transposase (IS4) n=6 Tax=Blastopirellula marina DSM 3645 RepID=A3ZX26_9PLAN Length = 115 Score = 109 bits (273), Expect = 1e-22, Method: Composition-based stats. Identities = 44/115 (38%), Positives = 65/115 (56%) Query: 261 CVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNED 320 A+ + +Q + VRYYI S LT +FA A+R HW +EN LHW+LDV E Sbjct: 1 MKAIGMTINLVKQNGKEASEVRYYIVSKYLTGRRFAEAVRGHWGIENSLHWQLDVTFGEH 60 Query: 321 DCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGS 375 +IR+G+A FS +R ++++L N+K + G++ K KA + YL VL G Sbjct: 61 QSRIRKGHADINFSLLRRTSLSLLKNNKTARVGVKNKRLKAGRNDKYLLEVLLGK 115 >UniRef50_C1XRV2 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XRV2_9DEIN Length = 219 Score = 109 bits (272), Expect = 2e-22, Method: Composition-based stats. Identities = 48/211 (22%), Positives = 92/211 (43%), Gaps = 14/211 (6%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLK-- 59 + + +++ IPD R+ K +H+ D+LL+ + AV SG + + + FL Sbjct: 5 SIPSPLPYLAQIPDPREYLKTQHRWQDLLLICLMAVGSGRHNILAVSQWVQDQRRFLLDE 64 Query: 60 ---QYGDFENGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSDDKD-----VIAID 109 + E +P T+ R+ + + ++W R+ + K+ +A+D Sbjct: 65 VHIRTRRGERKLPGQATLYRLFWSLSKDLQPLQKALLSWAREVLKALGKEGDEPLPVAVD 124 Query: 110 GKTLRHSYDKSRRRGAIHVISAFSTMHSLVIG-QIKTDEKSNEITAIPELLNMLDIKGKI 168 GK LR + R A+ +SA L +G Q D ++ + + L + + Sbjct: 125 GKHLRGTRCAWRGEEALVFLSALVQGLGLSLGSQAIADGEAAAAQGLVVHMEGLGVD-WV 183 Query: 169 ITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQ 199 +T DA C +++A + +Q G A KGT+ Sbjct: 184 LTGDAALCTQELAAVVVEQKGGICSASKGTR 214 >UniRef50_D1REV9 Putative uncharacterized protein n=1 Tax=Legionella longbeachae D-4968 RepID=D1REV9_LEGLO Length = 139 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 38/141 (26%), Positives = 65/141 (46%), Gaps = 6/141 (4%) Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKF 295 R + +P + T G+K + + S E RYY++S + Sbjct: 3 RRYFAYRLPKTI--NTGSLVGIKSIIATETISSKTNET--AISAEWRYYVTSHETEKSDL 58 Query: 296 ATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTND--KVFKAG 353 +RNHW +EN+LHW LDV +N+D K R A FS I+ + ++++ K Sbjct: 59 HLYVRNHWSIENELHWHLDVHLNDDADKKRDDTTAINFSSIKRMLLSLVKTKLPPGKKRS 118 Query: 354 LRRKMRKAAMDRNYLASVLAG 374 +R ++++ D YL S+L+ Sbjct: 119 VRSRLKQVGWDTEYLVSLLSA 139 >UniRef50_C7RKL6 Putative uncharacterized protein n=1 Tax=Candidatus Accumulibacter phosphatis clade IIA str. UW-1 RepID=C7RKL6_9PROT Length = 506 Score = 106 bits (265), Expect = 1e-21, Method: Composition-based stats. Identities = 58/386 (15%), Positives = 122/386 (31%), Gaps = 57/386 (14%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILL----LTICAVISGAEGWEDIEDFGETHLDF 57 EL L+ + IPD R K HKL+ +LL + + S E ++ L Sbjct: 75 ELPALLGQLEQIPDPRDPRKRRHKLTVLLLYGLLMFVFQFASRRETNREMTR--PQFLAN 132 Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD--------KDVIAID 109 L++ +P DT+ R++ I A + ++ +R IAID Sbjct: 133 LQRLFPEIEALPHADTLYRLLRDIDLAHLEQAHVDLVRRLIRGKSFRRYLINHCHPIAID 192 Query: 110 G------------KTLRHSYDKSRRRGAIHVI----SAFSTMHSLV-----------IGQ 142 G + L+ K R + + ++ + LV +G Sbjct: 193 GSQKLAGDTLWAEELLQRHVGKDETRHTQYFVYVLEASLVFHNGLVIPLLSEFLEHALGD 252 Query: 143 IKTDEKSNEITAIPELLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGT 198 + ++ E+ L + L ++ D + + ++ + ++ +K Sbjct: 253 SEAQKQDCELRGFARLSDRLKRLFPRLPILLLLDGLYANGPVMQRCLRAHWQFMIVLKDK 312 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 + +E + ++ GR + V D+ L Sbjct: 313 --------DLPTVWEEFRALQPRQLPTLQQDWGRRQQHFSWVNDIEYAYGSNGRCRLKLH 364 Query: 259 KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT----AEKFATAIRNHWHVENKLHWRLD 314 + ++ + E + E ++SS L+ E+ R+ W +E Sbjct: 365 VVVCEERWQGVDQEARIVTETARHAWLSSQPLSRENVHERCNLGARHRWGIEAGFLVEKH 424 Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIA 340 + + NA + + +A Sbjct: 425 QGYHYEHAFALDWNAMRGYHLLMRLA 450 >UniRef50_B0C6B8 Transposase, putative n=1 Tax=Acaryochloris marina MBIC11017 RepID=B0C6B8_ACAM1 Length = 145 Score = 105 bits (262), Expect = 2e-21, Method: Composition-based stats. Identities = 44/151 (29%), Positives = 71/151 (47%), Gaps = 9/151 (5%) Query: 223 YAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVR 282 + S +S GREE R V + + EW+ ++ + + ++ + Sbjct: 4 HTHSIQSRGREEHRCIQVYE---PVGIALQEWEAIRSVLCVQRW----GTRQGKAYHNTA 56 Query: 283 YYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAIN 342 YYISSA + + + +R HW +EN+LHW DVV EDD ++ A +S +R I IN Sbjct: 57 YYISSAATSPHHWQSLVREHWGIENRLHWPKDVVFGEDDYRLEDEQALLNWSVLRTIVIN 116 Query: 343 ILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 IL + L+ M K A + + S+L Sbjct: 117 ILRLNGYQ--SLKTAMTKLANRVDIIFSLLT 145 >UniRef50_UPI00016C35D5 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C35D5 Length = 148 Score = 104 bits (260), Expect = 4e-21, Method: Composition-based stats. Identities = 33/124 (26%), Positives = 57/124 (45%), Gaps = 11/124 (8%) Query: 228 KSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISS 287 K HGR E R L ++ W G++++ R + + V Y ISS Sbjct: 3 KGHGRVERRSITTTT---WLNEYLTRWPGVQQVFRLERQR----RADGKTTVEVVYGISS 55 Query: 288 AD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINIL 344 + R+HW +E+ LH+ DV ++ED C++RRG A + + +R++A+ +L Sbjct: 56 LSPVAAPPDTVLGYTRSHWGIES-LHYVRDVTLDEDRCRVRRGTAPRVLASLRNVAVYLL 114 Query: 345 TNDK 348 Sbjct: 115 RRLG 118 >UniRef50_A4BR51 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BR51_9GAMM Length = 153 Score = 104 bits (258), Expect = 7e-21, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 59/150 (39%), Gaps = 9/150 (6%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHLD 56 +++ L ++ + +PD + H+L +L L A + G +G++ + ++ Sbjct: 7 QMRSLPQYFTDLPDPHRAEGRRHRLPVVLTLAAGASLCGMQGYKSMAEWASSLAQAARRR 66 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 F + + +P I + + P W + ++ +A+DGK ++ Sbjct: 67 FGCRRVNGHYLVPSLYVIRDCLVRLGPEALDRRLQAWQAA--QLNSEEALAMDGKIMKGG 124 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTD 146 D + + H++S + Q K+ Sbjct: 125 VDHTGAQ--THIVSLIGHESKHCVAQKKSA 152 >UniRef50_C6MZL0 Putative uncharacterized protein n=1 Tax=Legionella drancourtii LLAP12 RepID=C6MZL0_9GAMM Length = 114 Score = 103 bits (257), Expect = 1e-20, Method: Composition-based stats. Identities = 39/99 (39%), Positives = 62/99 (62%), Gaps = 1/99 (1%) Query: 3 LKKLM-EHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 ++ L E S IPD R +H +I+ L + +V++GA+ + +IEDF E H+D+LK Y Sbjct: 1 MEGLFVEIFSQIPDPRINRTRKHLFLNIIGLALFSVLAGAQSYTEIEDFCEHHIDWLKTY 60 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS 100 + NGIP HDT +RV S I+PA F + F+ W++ + + Sbjct: 61 FNLPNGIPSHDTFSRVFSAINPASFQDSFLIWLKAINDA 99 >UniRef50_A4U3V5 Transposase n=1 Tax=Magnetospirillum gryphiswaldense RepID=A4U3V5_9PROT Length = 174 Score = 102 bits (255), Expect = 2e-20, Method: Composition-based stats. Identities = 40/164 (24%), Positives = 66/164 (40%), Gaps = 10/164 (6%) Query: 195 VKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K Q L + N+ D+ K R+E R V V D L ++ Sbjct: 1 MKANQSNLFETACAI----AANDAPADTAFSRNKGRSRQEDRTVEVFPVGDALAGTEWQ- 55 Query: 255 KGLKKLCVAVSFRSII--AEQKKEPEMTVRYYISSA-DLTAEKFATAIRNHWHVENKLHW 311 +K + + A + V +Y+SSA + A +A AIR HW +EN+ H+ Sbjct: 56 PFIKTIIRVTRRTLLHSAATGLWDQRGEVAFYVSSAANFPATAWAAAIRGHWGIENRNHY 115 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLR 355 DV +ED +IR + + R A+NI+ + + Sbjct: 116 VRDVSCDEDKSRIRDN--PGIMARARSFALNIMRKNGIANVAQA 157 >UniRef50_Q6TKV9 Aec9 n=1 Tax=Escherichia coli RepID=Q6TKV9_ECOLX Length = 105 Score = 102 bits (253), Expect = 3e-20, Method: Composition-based stats. Identities = 74/88 (84%), Positives = 77/88 (87%) Query: 271 AEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 EQKKEPEMT RYY SADLTAEKFATA RNHW+VENKLHW LDVVMN+DDCKIRRGNAA Sbjct: 18 TEQKKEPEMTDRYYSISADLTAEKFATANRNHWYVENKLHWHLDVVMNKDDCKIRRGNAA 77 Query: 331 ELFSGIRHIAINILTNDKVFKAGLRRKM 358 ELFSGIR IAINILT DK+ KAG R KM Sbjct: 78 ELFSGIRKIAINILTKDKILKAGARCKM 105 >UniRef50_C6HY37 Transposase (Fragment) n=5 Tax=Leptospirillum ferrodiazotrophum RepID=C6HY37_9BACT Length = 138 Score = 101 bits (252), Expect = 4e-20, Method: Composition-based stats. Identities = 29/120 (24%), Positives = 51/120 (42%), Gaps = 5/120 (4%) Query: 232 REEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTV-RYYISSADL 290 R E + V L+ ++ L+++ + K E + +SS Sbjct: 1 RIETQTIRVSS----LLKGYSDFPHLEQVFRIDRVTRFKKKGKTRKETALGVTSLSSGQA 56 Query: 291 TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVF 350 + + +R HW +EN+LHW D V ED C R GN A + + +R++ I++L Sbjct: 57 SPRELLDFVRGHWEIENRLHWIRDSVFREDTCTTRTGNGAHVMATLRNMTISLLRVAGSK 116 >UniRef50_Q2JDM9 Transposase, IS4 n=1 Tax=Frankia sp. CcI3 RepID=Q2JDM9_FRASC Length = 414 Score = 101 bits (250), Expect = 6e-20, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 67/216 (31%), Gaps = 34/216 (15%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICA-VISGAEGWEDIEDFGETHLDFLKQYG 62 + + E + + D R T + + + +C+ +G + + + Sbjct: 22 EGIWERLDRVTDPRSTRGRVYSWLCLAAVWLCSLTAAGHHRVSAVRAWLARTSGAERARL 81 Query: 63 DFEN------GIPVHDTIARVVSCISPAKFHEC------------------FINWMRDCH 98 +P TI + + + Sbjct: 82 RLPWDPFAGWRLPSTATIHCFLQAVDDGELAVALLDPPLDPDPPAEQGDDTDQRTEPSAA 141 Query: 99 SSDDKD-------VIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNE 151 D +A+DGKT RH+ +H++ S ++ Q++ + K+NE Sbjct: 142 PVDPGHGCQPVESAVALDGKTSRHAKRADG--SKVHLVGVASHGDGRLLAQVEVEAKTNE 199 Query: 152 ITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQ 187 LL LD+ ++T DA+ + + ++ Sbjct: 200 TAVFRRLLRPLDLTNVLVTADALHTVRANLDTRSRR 235 >UniRef50_C5D2E6 Transposase IS4 family protein n=6 Tax=Bacillaceae RepID=C5D2E6_GEOSW Length = 437 Score = 100 bits (249), Expect = 8e-20, Method: Composition-based stats. Identities = 64/416 (15%), Positives = 123/416 (29%), Gaps = 83/416 (19%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDI-EDFGETH-LDFLKQ 60 K L++ + + D R + + IL + + G + + E F + ++ ++ Sbjct: 28 FKDLVDQLKKVKDKRHQSYITYGPETILYTILLKSVFGIKSMRSMTELFNKDECIENIRV 87 Query: 61 YGDFE--NGIPVHDTIARVVSCISPAKFHEC--------FINWMRDCHSSDDKDV-IAID 109 + N +P +DTI ++ + P + F + +K I D Sbjct: 88 VLGLKELNELPHYDTINDFLAKLEPKELETIRIYLIKKLFEKRCLESFRILNKYWPIVFD 147 Query: 110 GKTL-------------RHSYDKSRRRGAI----HVISA--FSTMHSLVIGQIKTDEKS- 149 G + R DK + HV+ A L I + +S Sbjct: 148 GTGIHTFKEKHCEHCLRREYKDKETGETKVVYMHHVLEAKLVVGDMVLSIATEFIENESE 207 Query: 150 ------NEITAIPELLNMLD-----IKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGT 198 E+ A L++ L + +I D++ + + E K Y+F K Sbjct: 208 NVPKQDCELKAFMRLVDKLKKTFKRLPICLI-ADSLYACEPVFEICDKHNWKYIFRFKED 266 Query: 199 QGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK 258 + + E N + + V D+ Sbjct: 267 RIKTVSQEFRAIQSLETNGKSSEYF---------------WVNDIAYNDR---------- 301 Query: 259 KLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMN 318 V + + + E +K+ E + AE A R W +EN+ Sbjct: 302 --LVNLVEKVKVTENEKKQEFLFITNFRITERNAEILVQAGRRRWKIENEGFNNQKNGWY 359 Query: 319 EDDC-KIRRGNAAELFSGIRHIA----------INILTNDKVFKAGLRRKMRKAAM 363 E + NA + + IA +L K + K+ +A Sbjct: 360 EIEHVNCHNYNALKNHYLLVQIADILVQLYKYGSKLLKQLKKSAKEISSKLLEAIR 415 >UniRef50_A9DIV7 Putative uncharacterized protein n=2 Tax=Shewanella benthica KT99 RepID=A9DIV7_9GAMM Length = 133 Score = 99 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 33/107 (30%), Positives = 58/107 (54%), Gaps = 3/107 (2%) Query: 243 VPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNH 302 V ++ ++W GLK + S + + + R+YISS DL AE+ +++RNH Sbjct: 3 VNKSWLNNKYQWVGLKSIIKVTS--DVHEKTTGKETTETRWYISSLDLNAEQALSSVRNH 60 Query: 303 WHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 W VE+ +HW L++ ED+ + R+G F+ +R IA+ + D+ Sbjct: 61 WQVES-MHWVLEMTFREDESRFRKGRGPLAFNVMRKIAMTLFKQDQT 106 >UniRef50_A7N3J1 Putative uncharacterized protein n=2 Tax=Vibrio harveyi ATCC BAA-1116 RepID=A7N3J1_VIBHB Length = 184 Score = 99 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 34/132 (25%), Positives = 60/132 (45%), Gaps = 7/132 (5%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYIS 286 ++ HGR R + +P+EL + G+K R + + + YYI+ Sbjct: 34 DEGHGRLVRRRYFAFPLPEELHNHALS--GIKSCIAVE--RIVQEGKGEPKTSHFSYYIT 89 Query: 287 SADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTN 346 + + K A +R HW +E+ HW LDV N+D K N+AE F+ I+ + +N++ Sbjct: 90 NHPASDPKLADYVRQHWEIES-YHWLLDVYFNDDRDKKYEENSAENFAQIKRLPLNLVKA 148 Query: 347 DK--VFKAGLRR 356 K ++ Sbjct: 149 KDWAGKKKSVKS 160 >UniRef50_C0GV84 Putative uncharacterized protein n=2 Tax=Desulfonatronospira thiodismutans ASO3-1 RepID=C0GV84_9DELT Length = 357 Score = 99 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 29/148 (19%), Positives = 59/148 (39%), Gaps = 9/148 (6%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +++ L ++ + D R+T H++S +L + A + G +G++ I + +Q Sbjct: 214 QMESLPDYFKTVTDPRRTHGRRHRISTVLSIAAAATLCGMKGYKAIYGWANKLGQKARQR 273 Query: 62 GDFE-----NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 IP I V+ P + + D + +A DGKT++++ Sbjct: 274 FRCRKENGKYVIPSQFVIRDVLVRADPVELDLAVQRFNED--QGLEDTCLAFDGKTMKNA 331 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIK 144 D++ R+ H+ S Q K Sbjct: 332 IDENARQ--THIASVVGHESKTTHTQKK 357 >UniRef50_C6PA49 Putative uncharacterized protein n=1 Tax=Thermoanaerobacterium thermosaccharolyticum DSM 571 RepID=C6PA49_CLOTS Length = 245 Score = 99 bits (247), Expect = 1e-19, Method: Composition-based stats. Identities = 49/230 (21%), Positives = 82/230 (35%), Gaps = 37/230 (16%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + L E I+ + D R V+ +S I + + + + +E + K+ Sbjct: 16 VYHLGEKINTLKDKRVKSSVK--ISTITFVVLFGFMLQIRSFNRLEHW--LKKGKFKKAL 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHEC--------FINWMRDCHSSDDKDVIAIDGKTLR 114 + +P DTI RV+S +E N + + D V+AIDG L Sbjct: 72 PKKTKMPRIDTIRRVLSNFDLDGLNELNNSIIKTSIKNKVFRRGTIDGLKVVAIDGVELF 131 Query: 115 HSYDKSRRRG--------------AIHVISAFSTMHSLVIGQIKTDEKSN-------EIT 153 S K V S + L++GQ + K + EIT Sbjct: 132 ESTKKCCGNCLTRVQKDGITHYFHRTVVCSTIGSDSHLILGQEILEPKKDGSDKDEGEIT 191 Query: 154 AIPELLNMLDIK----GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQ 199 A L+ L + II DA+ C+ +++ G D + VK + Sbjct: 192 AGKRLIRKLHREFHHFADIIVADALYCKSTWVKEVLSIGMDAVVRVKDER 241 >UniRef50_C4RAP0 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RAP0_9ACTO Length = 223 Score = 99.1 bits (245), Expect = 2e-19, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 54/129 (41%), Gaps = 6/129 (4%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 + L ++ +PD R + L IL + +CAV++GA + I D+ + Sbjct: 29 RSLFVVLAAVPDPRDPRGRRYPLVSILSVVVCAVLAGACTFAVITDWVRDQNRSVWDRLG 88 Query: 64 FENGIPVHDTIARVVSCISPAKFHECFINWMRDC------HSSDDKDVIAIDGKTLRHSY 117 F + +P T+ R++ I + W+R VIA+DGK +R + Sbjct: 89 FTDRVPAATTVWRLLIRIDAEVLPQVLARWLRARTAPVVVTGRRLCLVIAVDGKVVRGAR 148 Query: 118 DKSRRRGAI 126 ++ A+ Sbjct: 149 LRAAGPSAL 157 >UniRef50_A4BLK3 Putative transposase n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BLK3_9GAMM Length = 190 Score = 98.7 bits (244), Expect = 3e-19, Method: Composition-based stats. Identities = 23/129 (17%), Positives = 53/129 (41%), Gaps = 6/129 (4%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFG-----ETHLD 56 +++ L ++ + +PD R+ H+L + LT A + G +G++ + ++ Sbjct: 59 QMRALPQYFTDLPDPRRAQGRRHRLPVVPALTAGASLCGMQGYKAMAEWASSLGQAARQR 118 Query: 57 FLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 F + + +P I + + P W +S D + +A+DGK ++ Sbjct: 119 FGCRRVNGHYLVPSLYVIRDCLVRLGPKALDRRLQAWQAAQLNSSD-EALAMDGKIMKGG 177 Query: 117 YDKSRRRGA 125 D + + Sbjct: 178 VDHTGAQTQ 186 >UniRef50_D1XPC5 Putative uncharacterized protein n=1 Tax=Streptomyces sp. ACTE RepID=D1XPC5_9ACTO Length = 180 Score = 98.4 bits (243), Expect = 4e-19, Method: Composition-based stats. Identities = 39/186 (20%), Positives = 60/186 (32%), Gaps = 18/186 (9%) Query: 195 VKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEW 254 +K Q + P + + S HGR E R C + DEL F Sbjct: 2 IKRNQPTTYRQL-AALPWPDSAV----QHTASSAGHGRRESRSIKTCGIADELGGIAFPH 56 Query: 255 KGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT---AEKFATAIRNHWHVENKLHW 311 L A+ + Y ++S D + A A+R HW VE H Sbjct: 57 GRL-----ALRVHRRRKQTGGCESRETVYAVTSLDAHETTPAELAAAVRGHWTVEALRH- 110 Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMD-RNYLAS 370 DV E+ + G A + R++A+ +L K +A D Sbjct: 111 VRDVTYAEEASTLHTGTAPRAMATFRNLAVGLLKTLGAINIA---KTTRAIRDQPERALP 167 Query: 371 VLAGSG 376 +L + Sbjct: 168 LLGITN 173 >UniRef50_A8FXP1 Rfbqrso22-4-like protein n=23 Tax=Gammaproteobacteria RepID=A8FXP1_SHESH Length = 137 Score = 97.2 bits (240), Expect = 9e-19, Method: Composition-based stats. Identities = 52/128 (40%), Positives = 69/128 (53%), Gaps = 1/128 (0%) Query: 175 GCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREE 234 + ++ +KI ++ DYL AVKG QG L AF++ F LNN + + Y E+S GR E Sbjct: 11 SVRGNVTQKILEKAVDYLLAVKGNQGSLASAFDDYFDFSMLNNDDIEIYTTKEQSRGRHE 70 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 R V L D + EW GLK + VS S E +E ++ VRYYISS L AE+ Sbjct: 71 SRAAFVSHDLSVLGDISDEWPGLKSMAFVVSMNS-EKEVAEEADIYVRYYISSKQLNAEE 129 Query: 295 FATAIRNH 302 TA R H Sbjct: 130 LLTASRLH 137 >UniRef50_Q2RP40 ISMca6, transposase, OrfB n=2 Tax=Alphaproteobacteria RepID=Q2RP40_RHORT Length = 152 Score = 95.7 bits (236), Expect = 2e-18, Method: Composition-based stats. Identities = 38/137 (27%), Positives = 51/137 (37%), Gaps = 8/137 (5%) Query: 220 HDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQK---KE 276 + HGR+E R V DV L W GL V+ + + K Sbjct: 2 SSAETTDRGRHGRQEHRWVEVFDVSGRLGP---TWDGLIAAVARVTRLTWHKDTKSGLWH 58 Query: 277 PEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGI 336 Y +L A TAIR HW VE + H+ DV ED +IR F+ + Sbjct: 59 KTQETALYACQINLPAAVAGTAIRQHWGVEKRSHYVRDVTFFEDQSRIRTK--PGHFARL 116 Query: 337 RHIAINILTNDKVFKAG 353 R A+NIL + Sbjct: 117 RSFALNILRANGTNNIS 133 >UniRef50_Q1Q750 Hypothetcal protein n=2 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1Q750_9BACT Length = 129 Score = 93.7 bits (231), Expect = 1e-17, Method: Composition-based stats. Identities = 20/106 (18%), Positives = 40/106 (37%), Gaps = 5/106 (4%) Query: 250 FTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVE 306 + +K++ R + + + Y I+S + + R HW +E Sbjct: 19 ISACRSWVKQVFCI--HRIFTKVKTGKKTEEIVYGITSLTQQKASPKTILKFSRGHWSIE 76 Query: 307 NKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKA 352 N LH+ D ED +IR NA + ++++ + + V Sbjct: 77 NGLHYVRDTAFREDHSQIRTQNAPRAMASLKNLVVGLFHFLNVPNI 122 >UniRef50_A8U3K1 ISMca6, transposase, OrfA n=1 Tax=alpha proteobacterium BAL199 RepID=A8U3K1_9PROT Length = 134 Score = 92.6 bits (228), Expect = 2e-17, Method: Composition-based stats. Identities = 30/121 (24%), Positives = 49/121 (40%), Gaps = 6/121 (4%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L+ S I D R+ + L+ +LL T+ A+++GA + ++ F THLD L D Sbjct: 3 STLLSLFSQISDPRRDQGKIYPLAPVLLFTVLAMLAGARSYRQVQAFIRTHLDRLNDGFD 62 Query: 64 FE-NGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDD-----KDVIAIDGKTLRHSY 117 P + T+ ++ I + F + IAIDGKT Sbjct: 63 LSLRRAPAYSTVRFILRGIDAEEMERAFRDHALGLADGPAEGAAIPGAIAIDGKTWCCHV 122 Query: 118 D 118 + Sbjct: 123 N 123 >UniRef50_A9D750 Putative transposase n=6 Tax=Shewanella benthica KT99 RepID=A9D750_9GAMM Length = 177 Score = 91.4 bits (225), Expect = 5e-17, Method: Composition-based stats. Identities = 40/120 (33%), Positives = 60/120 (50%), Gaps = 4/120 (3%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++H I D R +H L +I+LL I AV+SG+EGWE IE+FG LD+L Q+ F+ Sbjct: 7 FLKHFDSIADPRIERCKKHNLLEIILLAISAVMSGSEGWEGIENFGHLKLDWLLQHRPFK 66 Query: 66 NGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSDDKDVIAIDG--KTLRHSYDKSR 121 GIP HDTIARV+ + + + + D + + G + H + Sbjct: 67 AGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREG 126 Score = 57.5 bits (137), Expect = 8e-07, Method: Composition-based stats. Identities = 21/79 (26%), Positives = 33/79 (41%), Gaps = 3/79 (3%) Query: 178 KDIAEKIQKQGGDYLFAVKGTQGRLN---KAFEEKFPLKELNNPEHDSYAISEKSHGREE 234 K+IA+ I KQ DY+ A+KG L +A+ K + D + + HGR E Sbjct: 87 KEIAKLIVKQKADYILALKGHHSGLQGELEAWWHKCQREGFTADNFDEHTTIDSGHGRIE 146 Query: 235 IRLHIVCDVPDELIDFTFE 253 R V ++ + Sbjct: 147 TRRCQQVLVNKSWLNNKYR 165 >UniRef50_B6AM49 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM49_9BACT Length = 102 Score = 91.0 bits (224), Expect = 7e-17, Method: Composition-based stats. Identities = 33/84 (39%), Positives = 49/84 (58%), Gaps = 1/84 (1%) Query: 124 GAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAE- 182 A+H++SAF + +V+ Q+ EKSNEI A ELL LDI G +T DAM Q++ A Sbjct: 7 KAVHLLSAFFHLEGVVLSQLLVGEKSNEIPAFRELLGPLDIAGLTVTADAMHTQREHARF 66 Query: 183 KIQKQGGDYLFAVKGTQGRLNKAF 206 ++ + D++ VK Q L +A Sbjct: 67 AVEDKRADFVMTVKDNQPELREAL 90 >UniRef50_Q5P2A0 Putative transposase (IS4,) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P2A0_AZOSE Length = 92 Score = 90.3 bits (222), Expect = 1e-16, Method: Composition-based stats. Identities = 26/75 (34%), Positives = 43/75 (57%) Query: 299 IRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKM 358 +R ++LHW LDV N+D ++RRG AA F +RHI +N+L ++ KA ++ K Sbjct: 15 VRLPRPTRHQLHWSLDVQFNDDQSRVRRGYAANNFVVLRHIVLNLLRHNTTRKASIKSKR 74 Query: 359 RKAAMDRNYLASVLA 373 A M+ ++ +L Sbjct: 75 LLACMEDDFREELLG 89 >UniRef50_A9DGH1 Putative uncharacterized protein n=9 Tax=Shewanella benthica KT99 RepID=A9DGH1_9GAMM Length = 129 Score = 89.9 bits (221), Expect = 2e-16, Method: Composition-based stats. Identities = 41/108 (37%), Positives = 60/108 (55%), Gaps = 2/108 (1%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 ++H + D R +H L DI+LL I AV+SG+EGWEDIE+FG LD+L+QY F+ Sbjct: 7 FLKHFDLTADPRIERCKKHNLLDIVLLAISAVMSGSEGWEDIENFGHLKLDWLRQYRPFK 66 Query: 66 NGIPVHDTIARVVSCI--SPAKFHECFINWMRDCHSSDDKDVIAIDGK 111 GIP HDTIARV+ + + + + D + + G+ Sbjct: 67 AGIPRHDTIARVICRLKADEKEIAKLIVKQKADYILALKGHHSGLQGE 114 >UniRef50_UPI00016C56B7 transposase, IS4 n=4 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C56B7 Length = 116 Score = 89.5 bits (220), Expect = 2e-16, Method: Composition-based stats. Identities = 23/109 (21%), Positives = 49/109 (44%), Gaps = 5/109 (4%) Query: 268 SIIAEQKKEPEMTVRYYISSADL---TAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + + + V + I+S A +R HW +EN+LH+ DV + ED C++ Sbjct: 8 TRERTVRGQTTVEVHFGITSLSAEKADAATLLNHVRTHWRIENELHYVRDVTLGEDVCRV 67 Query: 325 RRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 R G+A ++ + +R+ +++ K + + MD ++ Sbjct: 68 RMGHAPQVLAALRNAVVHLWREVKAVSCPEAIERLQ--MDPAMAKGLIG 114 >UniRef50_B6AM50 Transposase n=1 Tax=Leptospirillum sp. Group II '5-way CG' RepID=B6AM50_9BACT Length = 135 Score = 89.1 bits (219), Expect = 3e-16, Method: Composition-based stats. Identities = 31/118 (26%), Positives = 48/118 (40%), Gaps = 9/118 (7%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L EH++ +PD R + H L IL + + A+ SGAE + + ++ T L Q Sbjct: 15 GLWEHLASLPDPRSSQGRRHSLISILKIIMAAIFSGAEHAKGVGEWARTLSQELLQRLGC 74 Query: 65 ENG-------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRH 115 + P T+ RV+ I NW+ +A+DGKTL Sbjct: 75 QESPSRQCFVPPSWTTLHRVIRTIGDLALERALRNWILSL--GLSPAALAVDGKTLAG 130 >UniRef50_B0JTQ4 Transposase n=1 Tax=Microcystis aeruginosa NIES-843 RepID=B0JTQ4_MICAN Length = 137 Score = 88.0 bits (216), Expect = 5e-16, Method: Composition-based stats. Identities = 26/85 (30%), Positives = 43/85 (50%) Query: 7 MEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFEN 66 ++H + D R L I+ + I AV++GA+G+ IE +G+ +L+ + D Sbjct: 28 LKHFQHLEDPRADRGRNPSLVSIITIAILAVLTGADGFAAIEVYGQAKQSWLETFLDLPK 87 Query: 67 GIPVHDTIARVVSCISPAKFHECFI 91 GIP HDT RV+ + P + F Sbjct: 88 GIPSHDTFGRVLRILEPKQLQSGFR 112 >UniRef50_Q7NJ26 Gll2006 protein n=1 Tax=Gloeobacter violaceus RepID=Q7NJ26_GLOVI Length = 125 Score = 88.0 bits (216), Expect = 5e-16, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 46/106 (43%), Gaps = 1/106 (0%) Query: 261 CVAVSFR-SIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 S R E + + +Y+SS + +A + IR HW VEN++H+ DV E Sbjct: 12 GRTRSIRLERYRELRGIVTVKTHWYLSSIEASASELGRRIRGHWGVENQVHYPKDVTFGE 71 Query: 320 DDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDR 365 D +IR +++S R A+N+ + + ++ + Sbjct: 72 DRSRIRTLPLVQVWSVARSFALNLYRSLLMANRAQAQRRCMFGLST 117 >UniRef50_A4BTE5 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BTE5_9GAMM Length = 161 Score = 87.2 bits (214), Expect = 1e-15, Method: Composition-based stats. Identities = 34/131 (25%), Positives = 54/131 (41%), Gaps = 2/131 (1%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L ++ IPD+R+ + L+ +LL +I AV+SGA + I+ F + H + L Sbjct: 3 LKAYLPAIPDHRRAQGRMYDLTHLLLFSILAVVSGATSYRKIQRFMDAHRERLNALCQLH 62 Query: 66 N-GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDV-IAIDGKTLRHSYDKSRRR 123 PVH +I + + F + IA+DGKTLR + + R Sbjct: 63 WKRAPVHTSIRYALQGLDAKAGELAFHRHASGLDGEGAQHASIAMDGKTLRAAVSITSRT 122 Query: 124 GAIHVISAFST 134 SA Sbjct: 123 ARPLRYSAHWP 133 >UniRef50_Q8RLV5 Putative transposase n=1 Tax=Xenorhabdus nematophila RepID=Q8RLV5_XENNE Length = 150 Score = 85.3 bits (209), Expect = 4e-15, Method: Composition-based stats. Identities = 37/128 (28%), Positives = 60/128 (46%), Gaps = 5/128 (3%) Query: 161 MLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEH 220 M +KG ++T DAMGCQ+ IA+++++ G D + ++KG QG+ A F ++ + Sbjct: 1 MFSLKGHLVTLDAMGCQRTIAQQLRESGADDILSLKGNQGKTFSAAVTYFQQQQATQKPY 60 Query: 221 --DSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPE 278 + E SHGR R V + E + W ++ L V R A + Sbjct: 61 LKPDHDEFEDSHGRTVRRRGWVLPLTPE-TKHSGSWPDIQALLVTEKIRQ--AHYSETVT 117 Query: 279 MTVRYYIS 286 RYY+S Sbjct: 118 SDFRYYLS 125 >UniRef50_UPI00016C4671 hypothetical protein GobsU_03374 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C4671 Length = 91 Score = 84.9 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 19/86 (22%), Positives = 42/86 (48%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 + ++ + + + D R T +H+ DI+++ +C V+ G +G I + ++L+ Sbjct: 6 VAVESIGSYFGCMTDPRYTRNRKHRFVDIVVIAVCGVVCGCDGPTAIRRWAMVRAEWLQG 65 Query: 61 YGDFENGIPVHDTIARVVSCISPAKF 86 + + NG+P D I + + P F Sbjct: 66 FLELPNGLPSRDCIRNWLMALQPDAF 91 >UniRef50_Q2JBB1 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JBB1_FRASC Length = 173 Score = 84.9 bits (208), Expect = 4e-15, Method: Composition-based stats. Identities = 25/106 (23%), Positives = 45/106 (42%), Gaps = 5/106 (4%) Query: 274 KKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 + ++S + A + HW +EN+LHW DV +ED + R GNA Sbjct: 69 GGPATAETVHAVTSLPTHHASPRLLAELAQAHWAIENRLHWVRDVTYDEDRHRARTGNAP 128 Query: 331 ELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAGSG 376 ++ + +R++AI IL + + +R A + +G Sbjct: 129 QVMTSLRNLAITILRLTGAKN--IAKALRHHARHPERPLETIKKAG 172 >UniRef50_B9YN77 H repeat-associated protein-like protein n=1 Tax='Nostoc azollae' 0708 RepID=B9YN77_ANAAZ Length = 77 Score = 83.7 bits (205), Expect = 9e-15, Method: Composition-based stats. Identities = 26/61 (42%), Positives = 43/61 (70%) Query: 295 FATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGL 354 A ++R+HW +EN LHW LDV + +DDC+IR+ NA + F+ +R IA+++L + K G+ Sbjct: 1 MANSVRSHWAIENSLHWVLDVALKQDDCRIRKDNAQQNFAVMRQIAVDLLGKENPVKRGI 60 Query: 355 R 355 + Sbjct: 61 K 61 >UniRef50_A1WFT6 Putative uncharacterized protein n=3 Tax=Verminephrobacter eiseniae EF01-2 RepID=A1WFT6_VEREI Length = 92 Score = 83.3 bits (204), Expect = 1e-14, Method: Composition-based stats. Identities = 34/75 (45%), Positives = 52/75 (69%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 +++++E + + D R + +H L DIL+L +CAV+SGA+GW+DIED+G +L++Y Sbjct: 7 IEEIIEQLRQLKDPRVAGRTDHNLLDILVLALCAVMSGAQGWDDIEDWGHAREGWLRRYL 66 Query: 63 DFENGIPVHDTIARV 77 NGIP HDTI RV Sbjct: 67 KLRNGIPGHDTIRRV 81 >UniRef50_A3Z4H5 Putative uncharacterized protein n=1 Tax=Synechococcus sp. RS9917 RepID=A3Z4H5_9SYNE Length = 177 Score = 83.0 bits (203), Expect = 2e-14, Method: Composition-based stats. Identities = 32/152 (21%), Positives = 58/152 (38%), Gaps = 12/152 (7%) Query: 197 GTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKG 256 G Q L + ++ K + E HGR+ + + W G Sbjct: 8 GDQKTLYRQIADQLLGKRHIPLMATDH---EIGHGRD---ILWTLRAKEAPQHIKANWHG 61 Query: 257 LKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVV 316 + ++ + ++P +I+S T + +R W VE+ HW D Sbjct: 62 TSWIAEVIA----TGTRDRKPFKATHRFITSLRTTPDALLRLVRERWSVESW-HWIRDTQ 116 Query: 317 MNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 ++EDD + R GN A + + +R A+N+L Sbjct: 117 LHEDDHRYR-GNGAGVMAALRTAAMNLLRLTG 147 >UniRef50_Q2RW75 Putative uncharacterized protein n=1 Tax=Rhodospirillum rubrum ATCC 11170 RepID=Q2RW75_RHORT Length = 111 Score = 82.6 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 28/77 (36%), Positives = 51/77 (66%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + +++H S + D RQ+W+V + L +I LL +CA +SG E + +I +G+ L+FL++ Sbjct: 17 MASRSMLDHFSALKDPRQSWRVVYPLPEIPLLVLCATLSGMEDFVEIRLWGDLRLEFLRR 76 Query: 61 YGDFENGIPVHDTIARV 77 + +E G+P HDT+ + Sbjct: 77 FLPYERGLPAHDTLKGL 93 >UniRef50_B5EK19 Transposase, IS4 n=1 Tax=Acidithiobacillus ferrooxidans ATCC 53993 RepID=B5EK19_ACIF5 Length = 104 Score = 82.6 bits (202), Expect = 2e-14, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 36/93 (38%), Gaps = 3/93 (3%) Query: 273 QKKEPEMTVRYYISSADLT---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + + ++S E R HW +EN+ H D +ED +IR N Sbjct: 2 KDGTLREDCAFGLTSLTKDRTTPENLLGIARGHWEIENRNHHVRDTTYHEDLSQIRTENG 61 Query: 330 AELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 + + +R +A++IL V + A+ Sbjct: 62 PHMMATLRGLAMSILRLIGVKNIAQAGRDFAAS 94 >UniRef50_C1XTV7 Putative uncharacterized protein n=2 Tax=Meiothermus silvanus DSM 9946 RepID=C1XTV7_9DEIN Length = 118 Score = 81.4 bits (199), Expect = 5e-14, Method: Composition-based stats. Identities = 31/122 (25%), Positives = 55/122 (45%), Gaps = 7/122 (5%) Query: 254 WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRL 313 W+G + R ++ + E Y ++S A++ R HW VEN+LH + Sbjct: 4 WRGSRMALRMR--RRVVRKNSGELREETAYALTSLQAPAKRLYALWRGHWEVENRLHHKR 61 Query: 314 DVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 D V+ ED + R+G A +R + +N+L + + R +RK + D L ++ Sbjct: 62 DTVLGEDASRSRKGAAG--LMYLRDVILNLLHL---KRWPVLRSVRKFSADPKVLLRLIR 116 Query: 374 GS 375 G Sbjct: 117 GL 118 >UniRef50_C7GEK3 Transposase, IS4 family protein n=4 Tax=Roseburia RepID=C7GEK3_9FIRM Length = 437 Score = 81.0 bits (198), Expect = 7e-14, Method: Composition-based stats. Identities = 40/288 (13%), Positives = 93/288 (32%), Gaps = 26/288 (9%) Query: 43 GWEDIEDF-----GETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDC 97 +E++ F G+ D L +Y +F+N P + + + + I P F F + + Sbjct: 40 SFEEVMKFMLTMEGKALRDELLEYFEFDNTTPSNSSFNQRRAQILPEAFEFLFQEFTKSF 99 Query: 98 HSS---DDKDVIAIDGKTLRHSYDK------------SRRRGAIHVISAFS-TMHSLVIG 141 + + +IA DG L +++ + +H+ + + Sbjct: 100 TDNVTYNGLRLIACDGSDLCIAHNPQDETTYFQTLPDRKGYNLLHLNAFYDLCSRQYTDA 159 Query: 142 QIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGR 201 I+ +NE A+ E+++ + I D +I ++ +G YL VK Sbjct: 160 IIQPSRLANERRAMCEMIDRYNDTSAIFIADRGYENYNIFAHVEHKGMYYLIRVKDITSN 219 Query: 202 LNKAFEEKFPLK-ELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKL 260 + P E + + + + + + + + V D + Sbjct: 220 GITSKLTMLPESGEFDEWVNVTLTKKQTNEVKANPKKYRVIDKKTPFDYLDLHFNNF--- 276 Query: 261 CVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 + R I + + + +++ W +E Sbjct: 277 -YEMKMRVIRFPIPQGSYECIITNLPQDKFNSDEIKRLYAKRWGIETS 323 >UniRef50_C4RJR9 Transposase n=2 Tax=Micromonospora RepID=C4RJR9_9ACTO Length = 410 Score = 80.6 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 31/141 (21%), Positives = 50/141 (35%), Gaps = 11/141 (7%) Query: 42 EGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS-- 99 + + G + P I R++ I P W+ Sbjct: 221 RATSALIAWVLARPTVAVLLGIDADRRPSEAMIRRLLQAIDPDLLTTAIGIWLAARIPAP 280 Query: 100 -SDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPE- 157 + IA+DGKTLR S + A HV++A +V+ D K+NEIT Sbjct: 281 APGSRRAIAVDGKTLRGSRTRDSA--ARHVLAAADQHTGIVLASTDVDTKTNEITRFTAS 338 Query: 158 -----LLNMLDIKGKIITTDA 173 LL+ I+ +++ A Sbjct: 339 GSHADLLSSRCIRSGVVSPAA 359 >UniRef50_Q3C2E6 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E6_9ACTO Length = 128 Score = 80.6 bits (197), Expect = 9e-14, Method: Composition-based stats. Identities = 29/129 (22%), Positives = 46/129 (35%), Gaps = 13/129 (10%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + L E ++ + D R+ H +LL+ AV++GA + I ++ + Sbjct: 1 MGPLAERLATLADPRRRRGKRHPFVAVLLIACSAVVTGARSFVAIHEWATDSPQDVLARL 60 Query: 63 DFENG-------IPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRH 115 P TI RV+ P + H D +AIDGK+ R Sbjct: 61 GARTATALAVRIPPSGVTIRRVIKDTCPGGLADLLG------HDPAGTDTLAIDGKSARG 114 Query: 116 SYDKSRRRG 124 S S R Sbjct: 115 SRLGSTRPP 123 >UniRef50_A0P0P8 ISMca6, transposase, OrfA n=1 Tax=Labrenzia aggregata IAM 12614 RepID=A0P0P8_9RHOB Length = 139 Score = 80.3 bits (196), Expect = 1e-13, Method: Composition-based stats. Identities = 36/128 (28%), Positives = 51/128 (39%), Gaps = 13/128 (10%) Query: 66 NGIPVHDTIARVVSCISPAKFHECFINWMRDC----HSSDDKDVIAIDGKTLRHSYDKSR 121 PV+ ++ ++ I P F + IAIDGKTLR S+D Sbjct: 9 RRAPVYTSVRGILRQIDPDALGTAFRRHAEGLDRTCAPAGPSRFIAIDGKTLRQSFDAFS 68 Query: 122 RRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELL---------NMLDIKGKIITTD 172 A +V+SAF+ H +++ DEKSNEI A L+ I + D Sbjct: 69 DTKAAYVLSAFAVDHQIILTHEVVDEKSNEILAAQALIVATALWKSREETSIYASSVMLD 128 Query: 173 AMGCQKDI 180 AM I Sbjct: 129 AMTFAPAI 136 >UniRef50_A7C5S5 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7C5S5_9GAMM Length = 108 Score = 79.9 bits (195), Expect = 1e-13, Method: Composition-based stats. Identities = 28/96 (29%), Positives = 37/96 (38%), Gaps = 4/96 (4%) Query: 172 DAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPL---KELNNPEHDSYAISEK 228 D +GCQK IA+ I +Q DYL AVK Q L++A F D K Sbjct: 8 DGLGCQKKIAKTIVEQEADYLLAVKDNQPTLHQAIRNYFEEANKARFAGYNIDYDEKINK 67 Query: 229 SHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAV 264 GR E R V + W L+ + + Sbjct: 68 GPGRLEQRRCWV-GYEIPDTINSQNWAKLETIVMVE 102 >UniRef50_B7A7V9 Putative uncharacterized protein n=3 Tax=Thermus aquaticus Y51MC23 RepID=B7A7V9_THEAQ Length = 161 Score = 79.5 bits (194), Expect = 2e-13, Method: Composition-based stats. Identities = 27/123 (21%), Positives = 55/123 (44%), Gaps = 7/123 (5%) Query: 233 EEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA---D 289 E+ + V P L + + G ++ R ++ + E TV Y ++S Sbjct: 20 GEVWTYRVWASPY-LPEEMRAFPGCGQVVRME--REVVRKGTGEVRRTVSYALTSLGPEV 76 Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 A + + + W VEN+ W D +++ED C++R G A++ + +R +++L V Sbjct: 77 ADARRLGELLLSRWEVENRSFWVRDFLLHEDACQVR-GVGAQVLAALRAFLVSLLHRQGV 135 Query: 350 FKA 352 + Sbjct: 136 REK 138 >UniRef50_A7BZU2 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU2_9GAMM Length = 289 Score = 78.3 bits (191), Expect = 4e-13, Method: Composition-based stats. Identities = 50/198 (25%), Positives = 75/198 (37%), Gaps = 37/198 (18%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFG--ETHLDFLK 59 +LKKL+E S IPD R+ V+H+L+ +LL + + + + L L+ Sbjct: 79 QLKKLLEDFSKIPDPRRAKSVKHQLTVVLLYGLLSCLFQMASRREANRDMSRPAFLQALQ 138 Query: 60 QYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD--------DKDVIAIDGK 111 +P DT+ARV+ I P K E FI +R IAIDG Sbjct: 139 GLFPELETLPHGDTLARVLERIEPQKLEESFIRLLRRYIRHKKFKRHLINKCYPIAIDGT 198 Query: 112 -------------TLRH--SYDKSRRRGAIHVISA-FSTMHSLV-------IGQIKTDEK 148 RH + D + + I+V+ A F + L + + D K Sbjct: 199 QKLVRDGELGEEWLERHIKTKDGEKVQQYIYVLEANFVFKNGLTIPIMSEFLSYSEDDSK 258 Query: 149 S----NEITAIPELLNML 162 EI A L + L Sbjct: 259 EVKQDCEIKAFKRLSHRL 276 >UniRef50_UPI00016C3A7C hypothetical protein GobsU_12130 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7C Length = 118 Score = 75.6 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 32/108 (29%), Positives = 49/108 (45%), Gaps = 4/108 (3%) Query: 29 ILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG-IPVHDTIARVVSCISPAKFH 87 +L L + AV++G E I FG L F+NG +P +TIA ++ + Sbjct: 3 LLTLCLVAVMAGHTTPEAISQFGRLRPKRLGHALGFQNGNMPCANTIAGLLRKLDADHLD 62 Query: 88 ECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTM 135 W+ D H D D IA+DGK L S D H+++A++ Sbjct: 63 RIIGAWLGDRHP-DGWDHIALDGKRLCGSRD--GAVPGTHLLAAYAPQ 107 >UniRef50_A4BVU1 ISMca6, transposase, OrfA n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVU1_9GAMM Length = 117 Score = 75.6 bits (184), Expect = 3e-12, Method: Composition-based stats. Identities = 27/112 (24%), Positives = 45/112 (40%), Gaps = 6/112 (5%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFE 65 L ++S IPD+R+ + L+ +LL +I A++SGA + I+ F +TH + L Sbjct: 3 LKAYLSAIPDHRRAQGRMYDLTHLLLFSILAMVSGATSYRKIQRFMDTHRERLNALCQLH 62 Query: 66 N-GIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD-----DKDVIAIDGK 111 P H +I + + F D VI + K Sbjct: 63 RKRAPAHTSIRYALQGLDAKAVELAFPRHASGLDGEDHNRFFPSTVIDAEWK 114 >UniRef50_Q0RW01 Putative uncharacterized protein n=1 Tax=Rhodococcus jostii RHA1 RepID=Q0RW01_RHOSR Length = 130 Score = 75.3 bits (183), Expect = 3e-12, Method: Composition-based stats. Identities = 15/82 (18%), Positives = 27/82 (32%) Query: 11 SIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPV 70 +PD R V H+ S IL + A +GA + I ++ +P Sbjct: 49 DEVPDPRHRRGVRHRFSVILAPALSATCAGARSFIAIAEWAHDAPAEALGGLGVSAVVPS 108 Query: 71 HDTIARVVSCISPAKFHECFIN 92 T R ++ + + Sbjct: 109 ESTSRRFLAGVDATALDQVLGM 130 >UniRef50_B0QXN9 Transposase IS4 family protein (Fragment) n=3 Tax=Haemophilus parasuis 29755 RepID=B0QXN9_HAEPR Length = 77 Score = 74.5 bits (181), Expect = 5e-12, Method: Composition-based stats. Identities = 38/81 (46%), Positives = 52/81 (64%), Gaps = 4/81 (4%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M + + E++S D R + +H DI+ L + AVISGA W +I+ FGE HLD+L++ Sbjct: 1 MSVFRFFENLS---DPR-AYNQKHHFLDIVFLVVSAVISGANSWTEIKLFGELHLDWLRK 56 Query: 61 YGDFENGIPVHDTIARVVSCI 81 Y FE GIPV DTIARV+ I Sbjct: 57 YRPFECGIPVDDTIARVIKRI 77 >UniRef50_B2IT45 Putative uncharacterized protein n=5 Tax=Cyanobacteria RepID=B2IT45_NOSP7 Length = 435 Score = 74.1 bits (180), Expect = 7e-12, Method: Composition-based stats. Identities = 47/386 (12%), Positives = 109/386 (28%), Gaps = 58/386 (15%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIED-FGETHLDFLKQY 61 ++ + +PD R +++SD L + + + + + Q Sbjct: 11 VQYFQSILKDLPDKRTGKNKRYQMSDAALSAFSIFFTQSPSFLAHQRSMAHSKGHNNAQS 70 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECF---------INWMRDCHSSDDKDVIAIDGKT 112 + IP + I ++ I P F + S + +IA+DG Sbjct: 71 LFGVHQIPSDNHIRDLLDEIEPTVVFPVFTKIFKALENGKHLSKFRSFKNNLLIALDGTE 130 Query: 113 -------------LRHSYDKSRRRGAIHVISAFSTMHS---------LVIGQIKTDEKSN 150 R + + + V + V+ Q ++ Sbjct: 131 YFCSNEIHCEHCSSRTFKNGTTQYFHTVVTPVIVCPSNSQVIPLIPEFVVPQDGYQKQDC 190 Query: 151 EITAIPELLNMLDIK----GKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAF 206 E A + + G I D + C + + E + ++ +++ + + + Sbjct: 191 ENAAAKRWIQKYAKQYASLGITILGDDLYCHQPLCELLLQEKLNFILVCRSKSHKTLYEW 250 Query: 207 EEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSF 266 E PL + K + R + + W L Sbjct: 251 LEGMPLDTFSVKH-----WKGKVYEIYTYRYVNQIPLRNSEDALLVNWCELA-------- 297 Query: 267 RSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLH-------WRLDVVMNE 319 ++ + T D+ E + R+ W +EN+ + + L+ Sbjct: 298 -ITRSDGTIIYKNTFATNHRITDINVEAIVSDGRSRWKIENENNNTLKTKGYNLEHNFGH 356 Query: 320 DDCKIRRGNAAEL-FSGIRHIAINIL 344 + A + + H ++I+ Sbjct: 357 GKTHLSSLLATFNILAFLFHTLLDII 382 >UniRef50_A4XCB4 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4XCB4_SALTO Length = 117 Score = 74.1 bits (180), Expect = 8e-12, Method: Composition-based stats. Identities = 23/106 (21%), Positives = 46/106 (43%), Gaps = 3/106 (2%) Query: 26 LSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 ++ +L +CAV++GA + D+ E F + +PV T+ R++ + Sbjct: 1 MASVLADAVCAVMAGASTFAAFGDWVEDLDAPAWSRLGFTDRVPVLTTLWRLLVRVDAET 60 Query: 86 FHECFINWMRDC---HSSDDKDVIAIDGKTLRHSYDKSRRRGAIHV 128 + +W+ + VIA+DGK +R + R A+ + Sbjct: 61 LTAVWADWLCSRLPVAPPPVRRVIAVDGKVVRGAVLTEGRVPALWM 106 >UniRef50_UPI00016C544B hypothetical protein GobsU_11590 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C544B Length = 103 Score = 73.7 bits (179), Expect = 9e-12, Method: Composition-based stats. Identities = 28/109 (25%), Positives = 42/109 (38%), Gaps = 11/109 (10%) Query: 227 EKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYIS 286 + HGR E R L+ W GLK R++ K + V + I+ Sbjct: 2 DPGHGRIETRTVRATP----LLTCHDRWTGLKHGFRITRTRTV----KGVTTVEVVHGIT 53 Query: 287 SAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 S A +R+HW +EN+ H DV + ED+ + R A Sbjct: 54 SRPVERADARALLGLVRSHWRIENQRHDVRDVTLREDEPRCRAAGAGRA 102 >UniRef50_A8LD44 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8LD44_FRASN Length = 261 Score = 73.3 bits (178), Expect = 1e-11, Method: Composition-based stats. Identities = 32/158 (20%), Positives = 56/158 (35%), Gaps = 10/158 (6%) Query: 179 DIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYA------ISEKSHGR 232 ++A ++ + +G Q L +A + L+ H A + + G Sbjct: 38 ELAAQVPDRISQPRLVTEGDQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLTAGS 97 Query: 233 EEIRLHIVCDVPDELI-DFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLT 291 + R VP L + L + ++ + E K+ Y I + Sbjct: 98 RQTRALKAVTVPAGLGFPHAAQAIQLTRTSRPINKNTKKTEGKRRQRRETVYAICTLPAH 157 Query: 292 ---AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRR 326 + AT IR HW +E +L W DV + ED + R Sbjct: 158 DALPAELATWIRGHWSIEVRLRWVRDVTLGEDLHQART 195 Score = 44.4 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 10/38 (26%), Positives = 23/38 (60%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVIS 39 + L+E ++ +PD R+ V H + +L + +CA+++ Sbjct: 57 DQMALLEALAQVPDLRRRRGVRHPFAALLAIAVCAMLT 94 >UniRef50_A0L378 Putative uncharacterized protein n=2 Tax=Shewanella RepID=A0L378_SHESA Length = 93 Score = 71.4 bits (173), Expect = 5e-11, Method: Composition-based stats. Identities = 29/71 (40%), Positives = 43/71 (60%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQ 60 M L+ H + I D RQ+ KV + L D+L +T+C VI+GAEGW +I D+ H ++ K+ Sbjct: 12 MYLEAFTSHFNSIADSRQSAKVTYPLHDVLFVTLCGVIAGAEGWSEIHDYAVGHHNWFKE 71 Query: 61 YGDFENGIPVH 71 G G+PV Sbjct: 72 KGILTEGVPVR 82 >UniRef50_B9TKB9 Putative uncharacterized protein n=1 Tax=Ricinus communis RepID=B9TKB9_RICCO Length = 107 Score = 70.2 bits (170), Expect = 1e-10, Method: Composition-based stats. Identities = 16/100 (16%), Positives = 33/100 (33%), Gaps = 1/100 (1%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFL-KQYGDF 64 + S + D R+ + L +L + +++SG+ ++ F E L L + +G Sbjct: 8 FGDVFSELRDVRRAQGKRYALEPLLCAIVMSILSGSASLRKMQVFIEEQLPNLNRLFGTS 67 Query: 65 ENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD 104 P I + + + F S Sbjct: 68 WRKAPCWVAIREFLLGLDEQELERAFREHANRQVSPPPGR 107 >UniRef50_A5GAF0 Putative uncharacterized protein n=6 Tax=Deltaproteobacteria RepID=A5GAF0_GEOUR Length = 439 Score = 69.9 bits (169), Expect = 2e-10, Method: Composition-based stats. Identities = 48/378 (12%), Positives = 99/378 (26%), Gaps = 53/378 (14%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 +L L + IPD R K+ L+D+L+ ++ Q Sbjct: 13 QLGVLRCCLEHIPDQRDGAKI--SLADVLMSGYAMFDLKDPSLLAFDE-RRCRDAANLQR 69 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIA---------IDGKT 112 + + V+ + PA F + +A +DG Sbjct: 70 IYGIGKVACDTQLRTVIDPVDPAGLRPGFKTIVATLQRGKALQQLAYYEGYYLLSLDGTG 129 Query: 113 LRHSYDKSRRRG--------------AIHVISAFSTMHSLV--IG------QIKTDEKSN 150 S + S + + +V + Q + Sbjct: 130 SFGSENLSSASCLVKNKSNGKKLYYQQVLGAALVHPDSRVVIPLAPEMIIPQDGATKNDC 189 Query: 151 EITAIPELL----NMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVK-GTQGRLNKA 205 E A L I+ D + +Q+ ++ K G L + Sbjct: 190 ERNASKRFLPNFREDFPRLPVIVVEDGLSSNGPHIRDLQQHNMRFILGAKPGDHPLLFEN 249 Query: 206 FEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVS 265 + ++A + + + + D P LK + Sbjct: 250 LTDAIKK-----KTATTFAQIDPKNPQIMHSYCFLNDTPLN-----QANPDLK--VNFLV 297 Query: 266 FRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLD--VVMNEDDCK 323 + A+ K + + + A R+ W +EN+ L E + Sbjct: 298 YEEHNAKTGKTQRFSWVTDLPITEENAYILMRGGRSRWKIENETFNTLKNQGYNLEHNYG 357 Query: 324 IRRGNAAELFSGIRHIAI 341 + + + +E F + +A Sbjct: 358 LGKEHLSENFVMLMMLAF 375 >UniRef50_B8IVV4 Transposase, IS4 family protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8IVV4_METNO Length = 123 Score = 69.5 bits (168), Expect = 2e-10, Method: Composition-based stats. Identities = 25/116 (21%), Positives = 43/116 (37%), Gaps = 2/116 (1%) Query: 248 IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVEN 307 + W GL + + R VR+ + S+ +E A AIR H + Sbjct: 1 MATLRTWPGLTTVLATETLR--GGNGTDSVPAQVRHSLGSSTAPSEVLAQAIRRHGALAT 58 Query: 308 KLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 W L+V E+ ++R AA + +R +A++ D A + R Sbjct: 59 GEPWVLEVSFGEERSRVRERCAARHLALLRRVALDRRRADASLTASRPAQDRGLGR 114 >UniRef50_C4RIX6 Transposase n=1 Tax=Micromonospora sp. ATCC 39149 RepID=C4RIX6_9ACTO Length = 90 Score = 69.1 bits (167), Expect = 2e-10, Method: Composition-based stats. Identities = 16/63 (25%), Positives = 24/63 (38%) Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 R WH+EN+LHW DV E + R G + + +R+ AI + Sbjct: 11 AQPADLQQWARLEWHIENRLHWVRDVTFGEGTHRARTGTGPAVAAVLRNTAIGFHRGNGE 70 Query: 350 FKA 352 Sbjct: 71 TNI 73 >UniRef50_B6C2C4 Putative uncharacterized protein n=1 Tax=Nitrosococcus oceani AFC27 RepID=B6C2C4_9GAMM Length = 77 Score = 67.2 bits (162), Expect = 1e-09, Method: Composition-based stats. Identities = 19/57 (33%), Positives = 31/57 (54%) Query: 316 VMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVL 372 ED+C++ A F+ +R IAI++L D+ K LR + RK A D +Y+ + Sbjct: 21 SFREDECRVHDPMAGGNFALLRKIAISLLVRDRSNKTSLRGRCRKVAWDNDYMRQLF 77 >UniRef50_Q1WLF6 Hypothetical transposase n=1 Tax=Sinorhizobium meliloti RepID=Q1WLF6_RHIME Length = 105 Score = 66.4 bits (160), Expect = 1e-09, Method: Composition-based stats. Identities = 19/74 (25%), Positives = 29/74 (39%), Gaps = 1/74 (1%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + +PD R H L+ IL + I A++ GAE D+ DFG +LK Sbjct: 1 MSRFAACFEDLPDPR-GRNARHPLTSILFIAIAAIVCGAESCTDMADFGVAKKKWLKTIV 59 Query: 63 DFENGIPVHDTIAR 76 I + Sbjct: 60 PLPYASRCWRDIRK 73 >UniRef50_A4BQC4 Putative uncharacterized protein n=2 Tax=Nitrococcus mobilis Nb-231 RepID=A4BQC4_9GAMM Length = 96 Score = 66.0 bits (159), Expect = 2e-09, Method: Composition-based stats. Identities = 21/89 (23%), Positives = 37/89 (41%), Gaps = 1/89 (1%) Query: 290 LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKV 349 +T ++ R HW + + LH+ D NED +IR G+ + + AI +L + Sbjct: 1 MTPQQVLAINRGHWSIAS-LHYISDWNYNEDRGQIRTGHGPANVTRLCRFAIGVLKHFPK 59 Query: 350 FKAGLRRKMRKAAMDRNYLASVLAGSGLS 378 + MR+ A + L + S Sbjct: 60 PGQYIPEMMRQLARRPRQVLDYLRLTAHS 88 >UniRef50_A8L7Y6 Putative uncharacterized protein n=1 Tax=Frankia sp. EAN1pec RepID=A8L7Y6_FRASN Length = 209 Score = 64.1 bits (154), Expect = 8e-09, Method: Composition-based stats. Identities = 16/64 (25%), Positives = 29/64 (45%) Query: 289 DLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDK 348 +TA T +R +W +EN++H+ D ED GN + R++AI ++ + Sbjct: 88 SVTAAYLHTHVRGNWGIENEVHYTRDAAWREDANPTYTGNTNHALASFRNLAIGVIGLNG 147 Query: 349 VFKA 352 Sbjct: 148 TRNI 151 Score = 44.4 bits (103), Expect = 0.007, Method: Composition-based stats. Identities = 14/86 (16%), Positives = 27/86 (31%), Gaps = 1/86 (1%) Query: 53 THLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKT 112 L + P T+ + I F W+ + + +AIDGK Sbjct: 20 ARLGAPLDHFRRNTRAPSKKTLRAPLKKIDVDALDATFGAWLCAQI-ARGRVALAIDGKV 78 Query: 113 LRHSYDKSRRRGAIHVISAFSTMHSL 138 LR ++ A ++ + + Sbjct: 79 LRGAWSGDESVTAAYLHTHVRGNWGI 104 >UniRef50_UPI00016C435B hypothetical protein GobsU_40172 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C435B Length = 133 Score = 62.9 bits (151), Expect = 2e-08, Method: Composition-based stats. Identities = 25/137 (18%), Positives = 41/137 (29%), Gaps = 18/137 (13%) Query: 192 LFAVKGTQGRLNKAFEEKFPLKELN-----------NPEHDSYAISEKSHGREEIRLHIV 240 + K Q L E ++ P+ + G R+ Sbjct: 1 MLTAKDNQPGLVADIEAGLGFEDAARGLAAATSPLTGPDARATGAPGHVGGPGHGRIETR 60 Query: 241 CDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSAD---LTAEKFAT 297 L+ W GLK R++ K + V + I+S A Sbjct: 61 TVRATPLLTCHDRWTGLKHGSRITRARTV----KGVTTVEVLHGITSLTVERADARALLG 116 Query: 298 AIRNHWHVENKLHWRLD 314 +R+HW +EN+ H D Sbjct: 117 LVRSHWRIENQRHDVRD 133 >UniRef50_Q2JGX0 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2JGX0_FRASC Length = 222 Score = 62.5 bits (150), Expect = 2e-08, Method: Composition-based stats. Identities = 20/109 (18%), Positives = 40/109 (36%), Gaps = 6/109 (5%) Query: 66 NGIPVHDTIARVVSCISPAKFHECFIN-WMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRG 124 G P + + + P + + V+ +DG T+R Sbjct: 31 PGTPAPGGVGKSCRSLDPGSLAALDAAPHRPTWRAGRVRRVLTVDGTTMR----PQHGSR 86 Query: 125 AIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNML-DIKGKIITTD 172 +H+ + +++ Q+ DEK+NE + L + D+ G +IT Sbjct: 87 HVHLPEGLAHACGVLLTQVDVDEKTNENPFVLRGLGQIPDLTGVLITAF 135 >UniRef50_UPI00016C3536 hypothetical protein GobsU_07352 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3536 Length = 130 Score = 61.8 bits (148), Expect = 5e-08, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 33/71 (46%), Gaps = 7/71 (9%) Query: 252 FEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAE---KFATAIRNHWHVENK 308 +WKGLK+ R++ + V + I+S + +R+HW +EN+ Sbjct: 9 QDWKGLKQGFQITRERTV----NGVTTVEVVHGITSLSADRANAGALLSLLRDHWRIENQ 64 Query: 309 LHWRLDVVMNE 319 LH+ DV + E Sbjct: 65 LHYVPDVTLGE 75 >UniRef50_A8MIZ4 Putative uncharacterized protein n=1 Tax=Alkaliphilus oremlandii OhILAs RepID=A8MIZ4_ALKOO Length = 218 Score = 59.8 bits (143), Expect = 1e-07, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 60/189 (31%), Gaps = 33/189 (17%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 + + E I+ + D R V+ +S I + + + + + + E K+ Sbjct: 16 VYDIGEKINTLKDKRVKSPVK--VSTISFVVLFGFMLQIRSFNRLNHWIE--KGKFKKVV 71 Query: 63 DFENGIPVHDTIARVVSCISPAKFHEC--------FINWMRDCHSSDDKDVIAIDGKTLR 114 + +P D++ R ++ N + + D V AIDG L Sbjct: 72 PKKTKMPCIDSVRRFLADFDLHGLKNMHSHIVKTSIKNKVFRSGTVDGLKVAAIDGVELF 131 Query: 115 HSYDKSRRRG--------------AIHVISAFSTMHSLVIGQIKTDEKSN-------EIT 153 S K + S + L++GQ + K + E+T Sbjct: 132 ESTKKCCNNCLTRVHKDEITHYFHRSVICSTVGSDPHLILGQEMLEPKRDGSNKDEGEVT 191 Query: 154 AIPELLNML 162 L+ L Sbjct: 192 GGKRLIKKL 200 >UniRef50_A7C035 Transposase n=5 Tax=Bacteria RepID=A7C035_9GAMM Length = 437 Score = 59.5 bits (142), Expect = 2e-07, Method: Composition-based stats. Identities = 45/357 (12%), Positives = 97/357 (27%), Gaps = 63/357 (17%) Query: 1 MELKKLME----HISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLD 56 + + L+ + IP K LSD L+ + ++ + Sbjct: 9 LSMPGLLSEIKNYFEKIPSPVVKQKDSISLSDCLMSGLAIFSLKYPSLLQFDN--DKRTP 66 Query: 57 FLKQYGDFENGI---PVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKD--------- 104 ++ I P + + + ++ + +R + Sbjct: 67 VVEHNLKSLYKIGIIPSDTYMRERLDELPTSELRGAYTTLIRQAQRGKVLEKFTYYNDYY 126 Query: 105 VIAIDGKTLRHSYD--------------KSRRRGAIHVISAFSTMHSLVI--------GQ 142 ++++DG S+D K + I+ H V+ Q Sbjct: 127 LVSMDGTGYFSSHDIHCDQCCEKHHRNGKITYHHQMLGIALVHPNHHHVLPLAPEPIIKQ 186 Query: 143 IKTDEKSNEITAIPELLNML----DIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVK-G 197 ++ E A LL L IIT D + + ++ Y+ K Sbjct: 187 DGVEKNDCERNAGKRLLTQLRKEYPKMKMIITEDGLASNGPHIKLLKSLNMSYILGAKPK 246 Query: 198 TQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGL 257 L + N+ + Y + + R + + D + Sbjct: 247 DHTYLFDRIK--------NSSQTKFYQTQDDDGTIHKYRYVNQVPLNESHFDLNVNFLIY 298 Query: 258 KKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLD 314 +++ + + K + I ++ T E R W +EN+ L Sbjct: 299 QEI----------SPKGKVTNFSWVTDILLSEQTLEIVMKGGRARWRIENETFNTLK 345 >UniRef50_C0BCH0 Putative uncharacterized protein n=1 Tax=Coprococcus comes ATCC 27758 RepID=C0BCH0_9FIRM Length = 435 Score = 58.7 bits (140), Expect = 3e-07, Method: Composition-based stats. Identities = 42/276 (15%), Positives = 86/276 (31%), Gaps = 25/276 (9%) Query: 51 GETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS----DDKDVI 106 G T L + DF+ P + + I P F F + + + + ++ Sbjct: 54 GCTLNKELLDFFDFDVNAPTVSAYTQQRAKILPEAFEYLFHAFTEENAQTKNLYEGYQLL 113 Query: 107 AIDG------------KTLRHSYDKSRRRGAIHVISAFSTMHS-LVIGQIKTDEKSNEIT 153 A DG +TL S +H+ + + ++ + ++T E Sbjct: 114 ACDGSNLTIAPNLNDPETLWKSNQLGATGNHLHLNALYDVLNRTYIDALVQTASTYQEHR 173 Query: 154 AIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK 213 A +++ + + I+ D +I ++G +L +K L Sbjct: 174 ACIQMIERVTLDKVILIADRGYENYNIMSHAIEKGWKFLIRIKDVHSN---GIASGLELP 230 Query: 214 ELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLC-VAVSFRSIIAE 272 + + D I R + + TF++ + +SFR + Sbjct: 231 QTAVFDMDINLILT----RNQTKSKKQAGYKFMPTVQTFDYLPIGSKEDYPISFRIARFK 286 Query: 273 QKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 + TV + +AEK W +E Sbjct: 287 IADDSYETVITNLDRFCFSAEKLKELYHLRWGIETS 322 >UniRef50_A7BZD0 Transposase n=1 Tax=Beggiatoa sp. PS RepID=A7BZD0_9GAMM Length = 79 Score = 57.9 bits (138), Expect = 6e-07, Method: Composition-based stats. Identities = 27/48 (56%), Positives = 40/48 (83%) Query: 106 IAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEIT 153 ++ DGKTLR S+D+S + AIH++SA+++ +SLV+GQ+KTDEKSNE Sbjct: 26 LSQDGKTLRRSHDRSSDKKAIHIVSAWASANSLVLGQVKTDEKSNEHK 73 >UniRef50_A3Z283 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3Z283_9SYNE Length = 156 Score = 57.5 bits (137), Expect = 8e-07, Method: Composition-based stats. Identities = 24/96 (25%), Positives = 43/96 (44%), Gaps = 4/96 (4%) Query: 84 AKFHECFINWM-RDCHSSDDKDVIAIDGKTLRHSYD--KSRRRGAIHVISAFSTMHSLVI 140 F + WM + +D D + DGKTLR S D I +S +S + I Sbjct: 2 EAFEALLLQWMSQQPALADGVDTLVCDGKTLRGSIDQKPGAAASFIAQVSLYSQPLGVAI 61 Query: 141 GQ-IKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 Q ++S+E ++ LL+ +++ ++ D +G Sbjct: 62 AQTTYATDESSETASLLWLLSGIELTDMLVQADEVG 97 >UniRef50_B8FXU5 Transposase IS4 family protein n=3 Tax=Firmicutes RepID=B8FXU5_DESHD Length = 381 Score = 57.1 bits (136), Expect = 9e-07, Method: Composition-based stats. Identities = 42/278 (15%), Positives = 91/278 (32%), Gaps = 25/278 (8%) Query: 51 GETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMR---DCHSSDDKDVIA 107 G + L + + + + I P F + R +S D ++A Sbjct: 3 GNSLSKELYDWLGYSSETATASAFVQQRDKIRPEALKLLFHEFTRLTVSENSLQDYRLLA 62 Query: 108 IDGKTLR------------HSYDKSRRRGAIHVISAFSTMHSL-VIGQIKTDEKSNEITA 154 +DG LR + + S+ +H+ + + M + V +++ + NE A Sbjct: 63 VDGSDLRLPSNSKDGFSSIRNSEDSKNYNLVHLDAMYDLMGKVYVDASVQSKKGMNEHKA 122 Query: 155 IPELLNMLDIKGKIITT-DAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLK 213 + +++ +I G +I D + Q++ Y+ K + G + L Sbjct: 123 LVSMVDQSEINGNVIAIMDRGYESFNNIAHFQEKSWYYIIRAKESYG-----IISRLSLP 177 Query: 214 ELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLK---KLCVAVSFRSII 270 + + + + +E + L I + +K + FR++ Sbjct: 178 DYPEYDEEIMLTLTRRQTKETLPLLKAYPHRYRWIQPHTTFDFIKPKDSKFYDLHFRAVR 237 Query: 271 AEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 TV +++ D EK W +E Sbjct: 238 FAIADGVYETVYTNLNAEDFPPEKLKQLYNLRWGIETS 275 >UniRef50_A8FXM7 Putative transposase n=3 Tax=Gammaproteobacteria RepID=A8FXM7_SHESH Length = 57 Score = 56.8 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 22/46 (47%), Positives = 30/46 (65%), Gaps = 1/46 (2%) Query: 274 KKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNE 319 + + RYYISS +LTAE+ A + HW +E+ +HW LDV MNE Sbjct: 10 GNKLVLEYRYYISSKELTAEQAANTVSEHWGIES-MHWVLDVSMNE 54 >UniRef50_B8ITI9 Putative uncharacterized protein n=1 Tax=Methylobacterium nodulans ORS 2060 RepID=B8ITI9_METNO Length = 74 Score = 56.8 bits (135), Expect = 1e-06, Method: Composition-based stats. Identities = 18/64 (28%), Positives = 34/64 (53%), Gaps = 1/64 (1%) Query: 3 LKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYG 62 ++ L + D R+T +H+L IL++ +CAVI+ AE +DI +G + +L+ + Sbjct: 2 IEGLAVCFPGLEDLRETSGCDHRLIAILVIAVCAVIACAES-KDIGLYGRSKQAWLQTFL 60 Query: 63 DFEN 66 Sbjct: 61 PLPC 64 >UniRef50_UPI0001746B1F ISPg2, transposase n=1 Tax=Verrucomicrobium spinosum DSM 4136 RepID=UPI0001746B1F Length = 84 Score = 56.4 bits (134), Expect = 2e-06, Method: Composition-based stats. Identities = 20/56 (35%), Positives = 29/56 (51%) Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKE 214 L+M D+ + DA+G Q IAE+I + G DY+ A+K Q +A F E Sbjct: 17 LDMEDLAQSQLVIDAVGTQGPIAEQIIEAGADYVLALKANQPSALQAVSAHFKEAE 72 >UniRef50_Q2J572 Putative uncharacterized protein n=1 Tax=Frankia sp. CcI3 RepID=Q2J572_FRASC Length = 148 Score = 55.2 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 15/63 (23%), Positives = 20/63 (31%), Gaps = 1/63 (1%) Query: 8 EHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENG 67 E IPD R V H+L +L L AV+ G G + + Sbjct: 70 ECFEQIPDPRDPRGVRHRLPVVLSLC-AAVLCGESGLAGVAAWVAAEGPGDPTGEGCRWP 128 Query: 68 IPV 70 P Sbjct: 129 RPG 131 >UniRef50_B1C560 Putative uncharacterized protein n=3 Tax=Clostridium spiroforme DSM 1552 RepID=B1C560_9FIRM Length = 399 Score = 55.2 bits (131), Expect = 3e-06, Method: Composition-based stats. Identities = 45/274 (16%), Positives = 86/274 (31%), Gaps = 29/274 (10%) Query: 56 DFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSD---DKDVIAIDGKT 112 D L ++ DF P + S I P F F + + ++AIDG Sbjct: 21 DELLKFNDFSITTPSASAFVQARSKIKPEAFRTLFDGFNKKTFKKKLYHGYRLLAIDGSE 80 Query: 113 LRHSYDKSRRRGAIHV-------ISAFSTMHSLVIGQIKTDE-------KSNEITAIPEL 158 L + SA+ S + + D+ K +E A +L Sbjct: 81 LPIDNTIFDDETTVLRHGTLAKTFSAYHLNASYDLMERTYDDIIIQGEAKRDEHGAFCQL 140 Query: 159 LNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKG--TQGRLNKAFEEKFPLKELN 216 ++ D + I D + E + G YL V+ +Q + K+ FP E + Sbjct: 141 VDRYDGQKAIFIADRGYESYNGFEHVVHSGHKYLIRVRDIESQSSITKSL-GPFPDGEFD 199 Query: 217 NPEHDSYAISEKSHGREEIRLHIVCDVPDELID--FTFEWKGLKKLCVAVSFRSIIAEQK 274 + ++ ++ C + + F++ + + R + + Sbjct: 200 VDVSRMLTL-------KQTKMIKACPDVYKFVPKNMRFDFMNKQNPWYEFNCRVVRLKIT 252 Query: 275 KEPEMTVRYYISSADLTAEKFATAIRNHWHVENK 308 + TV +S + + E W E Sbjct: 253 ENTYETVITNLSRNEFSMEDICEIYNMRWGEETS 286 >UniRef50_B5HUT6 Putative uncharacterized protein n=1 Tax=Streptomyces sviceus ATCC 29083 RepID=B5HUT6_9ACTO Length = 130 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 19/84 (22%), Positives = 32/84 (38%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L +S IPD R+ + L +L L + AV+ GA I F L++ Sbjct: 45 SLAGTLSRIPDPRRVRGRRYHLGSLLALCLVAVLGGARSLATIARFAADTNSSLREQLGL 104 Query: 65 ENGIPVHDTIARVVSCISPAKFHE 88 + P T+ + + + E Sbjct: 105 ASSTPNASTLGGLRANLKDEWVRE 128 >UniRef50_UPI00016C36C2 transposase for IS2404 n=1 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C36C2 Length = 109 Score = 55.2 bits (131), Expect = 4e-06, Method: Composition-based stats. Identities = 17/60 (28%), Positives = 25/60 (41%), Gaps = 3/60 (5%) Query: 268 SIIAEQKKEPEMTVRYYISSAD---LTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKI 324 + + + V Y I+S A R HW +EN LH+ DV + ED C + Sbjct: 5 ERRRKANGKATVEVVYGITSLSRLAADAAALLGYSRRHWGIENGLHYTRDVTLGEDRCPV 64 >UniRef50_A1VX04 Putative uncharacterized protein n=1 Tax=Polaromonas naphthalenivorans CJ2 RepID=A1VX04_POLNA Length = 100 Score = 52.1 bits (123), Expect = 3e-05, Method: Composition-based stats. Identities = 16/66 (24%), Positives = 38/66 (57%), Gaps = 4/66 (6%) Query: 6 LMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDI----EDFGETHLDFLKQY 61 L++ SI+PD R + L +++++T+ AV+ GA+ W D+ + +G++ + +++ Sbjct: 2 LIQAFSILPDPRTGPAQRYDLREMIVMTLSAVLCGADNWVDVPVGSKKYGDSCMQVVREK 61 Query: 62 GDFENG 67 +G Sbjct: 62 CCLTSG 67 >UniRef50_A4X2F9 Putative uncharacterized protein n=1 Tax=Salinispora tropica CNB-440 RepID=A4X2F9_SALTO Length = 143 Score = 51.0 bits (120), Expect = 7e-05, Method: Composition-based stats. Identities = 14/64 (21%), Positives = 23/64 (35%) Query: 4 KKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGD 63 L + +PD V H+L+ +L+ ICAV + I ++ G Sbjct: 13 AGLPAALLDLPDPLCRLGVLHRLTVVLIAAICAVAVSNRSYTAIAEWFPDVPAATGARGG 72 Query: 64 FENG 67 G Sbjct: 73 HRPG 76 >UniRef50_C7GHC1 Transposase, IS4 family (Fragment) n=6 Tax=Roseburia intestinalis L1-82 RepID=C7GHC1_9FIRM Length = 232 Score = 51.0 bits (120), Expect = 8e-05, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 57/201 (28%), Gaps = 27/201 (13%) Query: 137 SLVIGQIK------TDEKSNEITAIPELLNMLDIK----GKIITTDAMGCQKDIAEKIQK 186 +++GQ + + E+T L+ L + +I DA+ +++ Sbjct: 8 HVILGQEMLKPRDGSGKDEGELTGGKRLIERLKKRHGHFADVIVADALYLNAPFINTLKE 67 Query: 187 QGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDE 246 G + + +K + + + E F E + + K + E+ + Sbjct: 68 NGLEGVIRLKDERRMIFQDAERLFKQDE--GKKASFW----KGKKKIEVWDLSGFKMEGC 121 Query: 247 LIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVE 306 V + E KE E + + + W +E Sbjct: 122 PYK-----------LRVVRYHEQWEENGKETERFMWLVTTLEAADYRVLWEMMHRRWDIE 170 Query: 307 NKLHWRLDVVMNEDDCKIRRG 327 +L + C R Sbjct: 171 ENGFHQLKTYYHAKHCYCRDA 191 >UniRef50_B0G346 Putative uncharacterized protein n=1 Tax=Dorea formicigenerans ATCC 27755 RepID=B0G346_9FIRM Length = 443 Score = 51.0 bits (120), Expect = 8e-05, Method: Composition-based stats. Identities = 39/313 (12%), Positives = 92/313 (29%), Gaps = 42/313 (13%) Query: 19 TWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVV 78 + + +++ + + ++ + +GIP + + Sbjct: 35 SRNRKLPFEEVIRFLLPLQGQCMD-----QELFRHFSKKPLFFSTDYSGIPHSSAMIQAR 89 Query: 79 SCISPAKFHECFINWMRDCHS---SDDKDVIAIDGK------------TLRHSYDKSRRR 123 +S + F ++ C ++AIDG R + S+ R Sbjct: 90 QKLSDSAMPALFHSFTETCKKGALFQGYQLLAIDGSQFSVPENLKEPLCWRKIPNISKGR 149 Query: 124 GAIHVISAFSTMHSL---VIGQIKTDEKSNEITAIPELLNMLDIK-GKIITTDAMGCQKD 179 IH+ + + + V+ Q + NE A+ ++++ I D + Sbjct: 150 NVIHLNAMYHLQSGIFEDVVFQPICE--CNEHKALAQMVDRRSSAFPAIFMADRGYESYN 207 Query: 180 IAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSH-----GREE 234 I+++G Y+ + + + P E + E + Y + S R+ Sbjct: 208 TFAHIEQKGDKYVVRGRESGTGICSGL--NLPDTEEYDIEKELYICKKHSKKVKTNPRKY 265 Query: 235 IRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEK 294 R+ D L R + + + + +S +A+ Sbjct: 266 KRIRSDATFDFFTDDCEEYRLNL---------RIVKIKLSETTTEVLFTNLSKEKFSADD 316 Query: 295 FATAIRNHWHVEN 307 W +E Sbjct: 317 LKRLYHMRWGIET 329 >UniRef50_Q0TLA0 H repeat-associated protein of Rhs element n=4 Tax=Escherichia RepID=Q0TLA0_ECOL5 Length = 59 Score = 50.6 bits (119), Expect = 1e-04, Method: Composition-based stats. Identities = 38/59 (64%), Positives = 39/59 (66%) Query: 1 MELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLK 59 MELKKLMEHISIIPDYRQ WKVEHKL DIL + FGETHLDFLK Sbjct: 1 MELKKLMEHISIIPDYRQAWKVEHKLPDILSVNYLRRYFWCIMLGRYRGFGETHLDFLK 59 >UniRef50_Q7MY60 Similarities with the N-terminal region of H repeat-associated protein homolog n=1 Tax=Photorhabdus luminescens subsp. laumondii RepID=Q7MY60_PHOLL Length = 83 Score = 50.2 bits (118), Expect = 1e-04, Method: Composition-based stats. Identities = 31/59 (52%), Positives = 34/59 (57%) Query: 58 LKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHS 116 LKQYG FE GI HDTI +VSCIS F + FI WM C A DGKT+R S Sbjct: 12 LKQYGKFEQGITTHDTIVHMVSCISTKLFQKYFIKWMNICRELMKSSGSATDGKTVRRS 70 >UniRef50_B0JNZ6 Transposase n=20 Tax=Cyanobacteria RepID=B0JNZ6_MICAN Length = 382 Score = 49.4 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 50/320 (15%), Positives = 89/320 (27%), Gaps = 59/320 (18%) Query: 53 THLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVI------ 106 + IP + I ++ P F ++ + VI Sbjct: 4 SKGKDNASSLFGIKKIPGDNQIRNLL---DPIPAATIFGSFQQVYQWLKKPGVIKKFFYL 60 Query: 107 ------AIDGKTLRHSYD-----------KSRRRGAIHVIS---AFSTMHSLVIG----- 141 A+DG S ++ H S VI Sbjct: 61 DEEILIALDGTEYFSSKKISCPHCNCRNPRNGTTTYFHGCVTPIVVSPEQKQVINLEPEF 120 Query: 142 ---QIKTDEKSNEITAIPELLNMLDIK--GKIITT--DAMGCQKDIAEKIQKQGGDYLFA 194 Q ++ E A+ L+ K G +T D + ++ I E KQG +++F Sbjct: 121 IKKQDGQQKQDCENAAVKRWLDKNHQKKYGYPVTLLGDDLYSRQPICELALKQGYNFIFV 180 Query: 195 V-KGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFE 253 + + L + E E+ E + R V VP ++ + E Sbjct: 181 CLETSHKTLYEWREFLEKSGEVKTVEKKQW----DGRKNLIYRYRYVSRVPLREVESSLE 236 Query: 254 WKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKL---- 309 + + + II + + EK A R+ W VEN+ Sbjct: 237 VNWCEVTVINEKTQKIIYQNNWITNHQ------ITENNVEKIVKAGRSRWKVENEGNNVL 290 Query: 310 --H-WRLDVVMNEDDCKIRR 326 H + L+ + Sbjct: 291 KNHGYNLEHNFGHGQSHLCE 310 >UniRef50_Q3C2E7 Transposase n=1 Tax=Streptomyces sp. TP-A0584 RepID=Q3C2E7_9ACTO Length = 72 Score = 49.4 bits (116), Expect = 2e-04, Method: Composition-based stats. Identities = 14/45 (31%), Positives = 25/45 (55%) Query: 134 TMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQK 178 T + + Q++ E +NEIT LL+ D++ +T DA+ Q+ Sbjct: 2 TGTGMTVTQLRVPENTNEITCFAALLDPYDLREVTVTGDALHTQR 46 >UniRef50_A3ZZ13 Putative uncharacterized protein n=1 Tax=Blastopirellula marina DSM 3645 RepID=A3ZZ13_9PLAN Length = 75 Score = 48.7 bits (114), Expect = 4e-04, Method: Composition-based stats. Identities = 19/55 (34%), Positives = 32/55 (58%), Gaps = 1/55 (1%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLD 56 EL++L ++ + D R HKL +++L+ +CAVI+GA+G IE + L Sbjct: 19 ELRELAKYFQSLDDSRSHVNQRHKLVNVVLIAMCAVIAGADGPTAIE-WLAGRLQ 72 >UniRef50_UPI00016C3A7B hypothetical protein GobsU_22362 n=5 Tax=Gemmata obscuriglobus UQM 2246 RepID=UPI00016C3A7B Length = 481 Score = 48.3 bits (113), Expect = 4e-04, Method: Composition-based stats. Identities = 42/281 (14%), Positives = 87/281 (30%), Gaps = 55/281 (19%) Query: 51 GETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIA--- 107 G L+++ ++G H+ + I P E F+ + D ++ +V+A Sbjct: 85 GSGRQAHLRRHRQPDDG--CHEAFYGKLRRI-PRGLSEAFLRDVTDRFTALFPEVVAHRL 141 Query: 108 -----------IDGKTLRHSYDKSRRRGAI-------HVISAFSTMHSLVIG-QIKTDEK 148 +DGK+L+ + ++ A+ LV+ D + Sbjct: 142 PTSFDRLEVLILDGKSLKKVAKRLVDTRGTPGKLLGGKLLVAYRPRDGLVLDMAADLDGE 201 Query: 149 SNEITAIPELLNMLDIKG---KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKA 205 +NE IP+L+ + +G K++ D + C + K G ++ + Sbjct: 202 TNEAKLIPDLMPRVHARGGPAKLVVGDRLFCASKHFAEFTKDNGHFVV----------RY 251 Query: 206 FEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVS 265 + P + S+++ V + + Sbjct: 252 ARTLSFEPDPKRPAVTTADPSQRA----------VVEEWGWAGKPKDK-------LRRYV 294 Query: 266 FRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVE 306 R +A E + + SA A R W +E Sbjct: 295 RRITVARPVGEAITILTDLLDSAPYPATDLLDLYRIRWTIE 335 >UniRef50_C7G6U9 Putative uncharacterized protein (Fragment) n=7 Tax=Clostridiales RepID=C7G6U9_9FIRM Length = 212 Score = 47.9 bits (112), Expect = 6e-04, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 65/214 (30%), Gaps = 36/214 (16%) Query: 2 ELKKLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 + K+ + I + D R+ + L +I++ + ++ E + I F + Sbjct: 3 SVYKIPQKIKCLTDERKRKSI--PLFNIVMPVLLFLMLQYESFHTI--FSAPESMSKRLK 58 Query: 62 GDFENGIPVHDTIARVVSCISPAKFHECF--------INWMRDCHSSDDKDVIAIDGKTL 113 IP D + ++S I+P + N + + V +DG L Sbjct: 59 NCISGRIPKVDAVRDLLSRINPDEIRSIHEEMIDIIKRNRVFREGTIGGYVVAGLDGVEL 118 Query: 114 RHSYDKSRRRG--------------AIHVISAFSTMHSLVIGQIK------TDEKSNEIT 153 S KS V +++GQ + + E+T Sbjct: 119 FSSTKKSCPNCLSRKKHTGETEYFYRSVVCMIIGKSPHVILGQEMLKPRDGSGKDEGELT 178 Query: 154 AIPELLNMLDIK----GKIITTDAMGCQKDIAEK 183 L+ L + +I DA+ Sbjct: 179 GGKRLIERLKKRHGHFADVIVADALYLNAPFINT 212 >UniRef50_A8LF21 Transposase IS4 family protein n=5 Tax=Frankia RepID=A8LF21_FRASN Length = 420 Score = 47.5 bits (111), Expect = 7e-04, Method: Composition-based stats. Identities = 38/315 (12%), Positives = 88/315 (27%), Gaps = 43/315 (13%) Query: 18 QTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARV 77 + K ++ T+ + + ++++ L +L + + +P I++ Sbjct: 47 EQRKRLLPARVVVYFTMAMCLFFDDDYDEVMRRLVGTLRWLGSWKG-DWKVPSTGAISQA 105 Query: 78 VSCISPAKFHECFINWMRDCHSSD-------DKDVIAIDGKTL--------------RHS 116 + + P F + ++A+DG L Sbjct: 106 RTRLGPEPLKLLFERVAVPVAGLGTKGAWLGSRRLVAVDGVHLDTADTPENADAFGRFSH 165 Query: 117 YDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGC 176 K+ +HV++ V S+E + L + + G ++T D Sbjct: 166 GPKTAAFPQVHVVALAECGTHAVFAAAIGAYTSDERSLAATLFDACE-PGMLLTADRNFY 224 Query: 177 QKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEIR 236 + ++ G D L+ V L P + D R Sbjct: 225 GYGLWQQALATGADLLWRVNAN---LTLPVIRALPDGSYLSLLIDPKIPV--------AR 273 Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISS----ADLTA 292 + + + + ++ + I++ D+ A Sbjct: 274 RGQLIADAR-----AGHAPPTESALPVRVIEYSVPDHEENGTSELICLITNILDPTDVAA 328 Query: 293 EKFATAIRNHWHVEN 307 + ATA W +E+ Sbjct: 329 IELATAYHERWEIES 343 >UniRef50_A7BWL6 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BWL6_9GAMM Length = 94 Score = 47.5 bits (111), Expect = 0.001, Method: Composition-based stats. Identities = 20/55 (36%), Positives = 33/55 (60%), Gaps = 1/55 (1%) Query: 8 EHISIIPDYRQTW-KVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 EH +PD R+ + HK DIL++ ICA+I GA+ W + +FG+ D+ + + Sbjct: 40 EHFKSLPDPRRRTMNLRHKFIDILIIAICAIICGADSWVAVAEFGKAKEDWFRVF 94 >UniRef50_B0R8M6 Transposase (ISH5) n=9 Tax=Halobacteriaceae RepID=B0R8M6_HALS3 Length = 449 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 44/355 (12%), Positives = 106/355 (29%), Gaps = 33/355 (9%) Query: 19 TWKVEHKLSDILLLTICAVISGAEG--------WEDIEDFGETHLDFLKQYGDFENGIPV 70 + + + + +G++ + ++ D + + + + Sbjct: 36 ERERKFDIVALFYTLSFGFAAGSDRSLQAFLERYVEMADCDDLSYAAFHDWFEPGFVALL 95 Query: 71 HDTIARVVSCISP--AKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHV 128 + + + + A + + + D + + + + +H+ Sbjct: 96 REILDDAIENLDTGRADLSGRLERFRDVLIADATIVSLYQDAADVYAATGEDQAELKLHL 155 Query: 129 ISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQG 188 I + ST TD ++E + +P + +I D + ++I + G Sbjct: 156 IESLSTGLPTRF--RTTDGTTHERSQLP---TGEWVADALILLDLGFYDFWLFDRIDQNG 210 Query: 189 GDYLFAVKGTQGRLNKAFEEKFPLKELNNP-EHDSYAISEKSHGREEIRLHIVCDVPDEL 247 G ++ VK + EE + + P E +S R+EI + I + Sbjct: 211 GWFVSRVKDNAN--FEIVEELRTWRGNSIPLEGESLQAVLDDLQRQEIDVRITLSFERK- 267 Query: 248 IDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSA---DLTAEKFATAIRNHWH 304 + + + + + E Y+++ D +A A R W Sbjct: 268 ----------RGSGASATRTFRLVGLRNEETEEYHLYLTNLGNDDYSAPDIAQLYRARWE 317 Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMR 359 VE L L D+ E + I++ + L + R Sbjct: 318 VE-LLFKELKSRFGLDEINTTDAYIIEALIIMAAISLMMSRVIVDELRSLEARQR 371 >UniRef50_A6X872 Transposase IS4 family protein n=13 Tax=Proteobacteria RepID=A6X872_OCHA4 Length = 330 Score = 47.1 bits (110), Expect = 0.001, Method: Composition-based stats. Identities = 33/229 (14%), Positives = 77/229 (33%), Gaps = 28/229 (12%) Query: 17 RQTWK--VEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTI 74 R+T + I IC + + E + L + + E +P H T Sbjct: 53 RKTRGGQCRYSDLAIETTLICGKV-----FNQPLRQTEGLMASLLRLLNVELPVPDHTTF 107 Query: 75 ARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLR---------HSYDKSRRRGA 125 +R + + + C +D+ + +D ++ + R Sbjct: 108 SRRCANLVVSSLTRCTRR-----DGTDEPLHVIVDSTGMKIYEAGQWLEEKHGAKSARKW 162 Query: 126 IHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQ 185 + + A + VI + TD+ +++++ +P+LL+M+D D + ++ Sbjct: 163 LKLHLAIDADSNQVIAETLTDQNTSDLSQVPDLLDMIDRPIACFMADGAYDSDQTYQALR 222 Query: 186 KQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREE 234 + R+ E + + + D ++ + GR E Sbjct: 223 SHSPGVSIII---PPRIRDLQEASYGPPD----QRDWHSRTNAQRGRME 264 >UniRef50_Q5P672 Small of transposase gene (Fragment) n=1 Tax=Aromatoleum aromaticum EbN1 RepID=Q5P672_AZOSE Length = 47 Score = 46.7 bits (109), Expect = 0.001, Method: Composition-based stats. Identities = 15/31 (48%), Positives = 18/31 (58%) Query: 302 HWHVENKLHWRLDVVMNEDDCKIRRGNAAEL 332 HW VEN LHW L+V NED ++R A Sbjct: 1 HWGVENWLHWCLNVQFNEDRSRVRSAYAVNN 31 >UniRef50_Q0AI67 Putative uncharacterized protein n=1 Tax=Nitrosomonas eutropha C91 RepID=Q0AI67_NITEC Length = 94 Score = 46.4 bits (108), Expect = 0.002, Method: Composition-based stats. Identities = 17/61 (27%), Positives = 26/61 (42%), Gaps = 11/61 (18%) Query: 5 KLMEHISIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 +L + I D RQ K H L +L++TI +I + LD+L+QY Sbjct: 34 RLADVFVSITDPRQ-RKSRHDLVKVLVITI----------NEILAWANEKLDWLRQYLKL 82 Query: 65 E 65 Sbjct: 83 T 83 >UniRef50_C9L6S8 Transposase n=1 Tax=Blautia hansenii DSM 20583 RepID=C9L6S8_RUMHA Length = 107 Score = 46.0 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 15/53 (28%), Positives = 25/53 (47%) Query: 47 IEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHS 99 + F + + ++ D + G P DT+ RV + I P KF E F +W+ Sbjct: 1 MAFFMKLQEPYFEKILDLKYGTPSADTLLRVFAIIEPEKFMEMFYHWILFLMQ 53 >UniRef50_Q745Z8 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z8_THET2 Length = 77 Score = 46.0 bits (107), Expect = 0.002, Method: Composition-based stats. Identities = 17/59 (28%), Positives = 35/59 (59%), Gaps = 1/59 (1%) Query: 305 VENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAM 363 +EN+ W DV++ E+ C++R G A++ + +R +++L V + R++ KAA+ Sbjct: 1 MENRSFWVRDVLLYEEACQVR-GVGAQVLAALRAFLVSLLHRRGVREKVTRQRTLKAAL 58 >UniRef50_A1RCW9 IS1380 family transposase n=2 Tax=Bacteria RepID=A1RCW9_ARTAT Length = 436 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 34/207 (16%), Positives = 68/207 (32%), Gaps = 19/207 (9%) Query: 11 SIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDFENGIPV 70 I+PD R +V+H L +L I A+ +G E D + G H L+ + + Sbjct: 49 KIVPDRRDPGRVQHGLQTLLAQRIYALAAGYEDLNDHD--GLRHDYALQTAVNRLQPLAG 106 Query: 71 HDTIARVVSCISPAKFHECFI----NWMRDCHSSDDKDVIAIDGKTLRHSYDKSRR---- 122 T+ R+ + +++ + + V+ D + D+ R Sbjct: 107 KSTLGRLEQQADRETVVQAHRLLWEHFIAQHDQAPAEIVLDFDATDVPVHGDQEGRFFHG 166 Query: 123 ---RGAIHVISAFSTMHSLVIGQIKTDEKSNEIT-AIPELLNMLDIKGK-----IITTDA 173 + F H LV ++ + AI LL + + D Sbjct: 167 YYDHYCFLPLYVFCGRHLLVSYLRPSNIDGARHSWAILALLVKFIRRFWPETRIVFRGDG 226 Query: 174 MGCQKDIAEKIQKQGGDYLFAVKGTQG 200 C+ + + ++ DY+ + Sbjct: 227 GFCRHRMLDWCDRKQVDYVVGLARNTR 253 >UniRef50_A3YV03 Putative uncharacterized protein n=1 Tax=Synechococcus sp. WH 5701 RepID=A3YV03_9SYNE Length = 113 Score = 45.6 bits (106), Expect = 0.003, Method: Composition-based stats. Identities = 13/76 (17%), Positives = 30/76 (39%), Gaps = 2/76 (2%) Query: 271 AEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAA 330 + +P +++S T + +R+ W +EN H+ + ++E + N A Sbjct: 7 GTRGCKPFKATHLFLTSLSSTPKTLLQLVRDRWSIENW-HFFRNTQLHESAH-GYQDNGA 64 Query: 331 ELFSGIRHIAINILTN 346 + + N+L Sbjct: 65 CAMTTQKTGTQNLLRL 80 >UniRef50_C0Q104 Putative uncharacterized protein n=3 Tax=Salmonella enterica RepID=C0Q104_SALPC Length = 177 Score = 45.2 bits (105), Expect = 0.004, Method: Composition-based stats. Identities = 24/50 (48%), Positives = 27/50 (54%), Gaps = 13/50 (26%) Query: 309 LHWRLDVVMNEDDCKIRRGNAAELF-------------SGIRHIAINILT 345 +HWRLDV MNEDDC+IRRGN F +R I INIL Sbjct: 1 MHWRLDVAMNEDDCRIRRGNVKSFFEIIKSGEYEIWGCEIMRWIRINILK 50 >UniRef50_Q745Z7 Putative uncharacterized protein n=1 Tax=Thermus thermophilus HB27 RepID=Q745Z7_THET2 Length = 112 Score = 44.8 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 18/111 (16%), Positives = 36/111 (32%), Gaps = 7/111 (6%) Query: 41 AEGWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSS 100 + +E F + L G ++ + P K E + + Sbjct: 1 MDSLRGVERFARANPHLLPHLGLRNPPGHTLL--PLLLHRLDPKKLQEALHQVFPE---A 55 Query: 101 DDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNE 151 D V+ +DGK LR S + + ++ + + Q + + K E Sbjct: 56 DLGGVLVVDGKHLRGS--GKGKSPQVRLVEVLALHLKTTLAQARVEGKVVE 104 >UniRef50_Q1PUW9 Hypothetcal protein n=1 Tax=Candidatus Kuenenia stuttgartiensis RepID=Q1PUW9_9BACT Length = 61 Score = 44.8 bits (104), Expect = 0.006, Method: Composition-based stats. Identities = 11/61 (18%), Positives = 23/61 (37%), Gaps = 2/61 (3%) Query: 312 RLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASV 371 D ED +IR NA + ++++ + + V + + +R A + Sbjct: 1 MRDTSFREDHSQIRTQNAPRAMASLKNLVVGLFHFLNVPN--IAKTLRNFAARPFLALQM 58 Query: 372 L 372 L Sbjct: 59 L 59 >UniRef50_A7B831 Putative uncharacterized protein n=5 Tax=Clostridiales RepID=A7B831_RUMGN Length = 366 Score = 43.7 bits (101), Expect = 0.012, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 56/182 (30%), Gaps = 17/182 (9%) Query: 53 THLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINW---MRDCHSSDDKDVIAID 109 + L Y F P + + + F F + D ++A D Sbjct: 64 SLKKELLDYFQFSVDTPSASAFCQQRNKLLLEAFQFLFYEFNSCFSFEKKYKDYQLLACD 123 Query: 110 GKTLRHSYDK------------SRRRGAIHVISAFS-TMHSLVIGQIKTDEKSNEITAIP 156 G L + + R IH+ + F + I+ NE A+ Sbjct: 124 GSDLNIARNPNDAGTYFQSQPTDRGFNQIHLNALFDLCEKRYIDLVIQPARLENESLAMT 183 Query: 157 ELLNMLDI-KGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKEL 215 ++++ K I D +I +Q++G YL VK G + E Sbjct: 184 QMIDRYKGEKKTIFIADRGYETYNIFAHVQEKGMYYLIRVKDGGGGSMTGSFDLPDENEF 243 Query: 216 NN 217 ++ Sbjct: 244 DH 245 >UniRef50_A4BVT6 Putative uncharacterized protein n=1 Tax=Nitrococcus mobilis Nb-231 RepID=A4BVT6_9GAMM Length = 120 Score = 42.9 bits (99), Expect = 0.017, Method: Composition-based stats. Identities = 11/94 (11%), Positives = 27/94 (28%), Gaps = 4/94 (4%) Query: 3 LKKLMEHISIIPDYRQTWK-VEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQY 61 L+ + + D R + L+D L + + +D + + Sbjct: 14 LRTVRACFEALDDPRSRPNSTRYTLADALSSALAMFLLKYPSLLQFDDSARAADEVTRHN 73 Query: 62 GDFENGI---PVHDTIARVVSCISPAKFHECFIN 92 G+ P + ++ + P+ F Sbjct: 74 LGTLYGVEQVPCDTQMRAILDPLKPSTLRGAFRA 107 >UniRef50_A7JYJ5 Putative uncharacterized protein n=1 Tax=Vibrio sp. Ex25 RepID=A7JYJ5_VIBSE Length = 47 Score = 42.9 bits (99), Expect = 0.019, Method: Composition-based stats. Identities = 13/48 (27%), Positives = 23/48 (47%), Gaps = 1/48 (2%) Query: 315 VVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAA 362 + + EDD + R AE S IR +N++ + K L + ++A Sbjct: 1 MNLKEDDLRNRVAGGAENVSVIRRFTLNLVRL-QSKKYSLGAEAKQAG 47 >UniRef50_A7BZU0 Putative uncharacterized protein n=1 Tax=Beggiatoa sp. PS RepID=A7BZU0_9GAMM Length = 201 Score = 42.1 bits (97), Expect = 0.033, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 9/128 (7%) Query: 237 LHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRY-YISSADLT---- 291 + V + + + + ++ + ++ + R+ +ISS L Sbjct: 38 FYWVNGIHYAYGNNKGIL--ISVVVCEEKWQEVDPNTGEKLDKNSRHVWISSQFLNKHDV 95 Query: 292 AEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHI--AINILTNDKV 349 E+ R+ W +EN ++ + A + + + + A+N L + Sbjct: 96 HERCNLGARSRWGIENSINTEKRRGYCYEHPFSYDFTAMQNYHYLMRMAHALNALALNTK 155 Query: 350 FKAGLRRK 357 A RK Sbjct: 156 LGAKFVRK 163 >UniRef50_A1TX01 Transposase, IS4 family protein n=5 Tax=Marinobacter aquaeolei VT8 RepID=A1TX01_MARAV Length = 433 Score = 42.1 bits (97), Expect = 0.037, Method: Composition-based stats. Identities = 32/217 (14%), Positives = 69/217 (31%), Gaps = 20/217 (9%) Query: 6 LMEHISII-PDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGETHLDFLKQYGDF 64 L + ++ + D R V HKL ++ + V +G E D E L+ Sbjct: 43 LTQRLATVLDDTRNPVLVRHKLQTMIRQRVFGVAAGYEDLNDHETLRAD--QALQTATGE 100 Query: 65 ENGIPVHDTIARVVSCISPAKF----HECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKS 120 E + T+ R+ + + +++ + + V+ DG + D+ Sbjct: 101 EAILAGKSTLCRMEQRVDRQAVVKAHELLWHHFIEQHETPPKEIVLDFDGTDVPVHGDQP 160 Query: 121 RR------RGAIHVISAFSTMHSLVIGQIKTDEKSNEIT--AIPELLNML-----DIKGK 167 + + L++ ++T +S+ AI LL Sbjct: 161 GKFFNAYYDHHCYFPLYVFCGRHLLVSYLRTSNRSDSRHSWAILALLVKFIRQYWPDTRI 220 Query: 168 IITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNK 204 + D+ + + + DYL + L + Sbjct: 221 VFRGDSGFYRPRLLSWCDRNNVDYLVGISKNSRLLKE 257 >UniRef50_A1SU90 Putative uncharacterized protein n=2 Tax=Gammaproteobacteria RepID=A1SU90_PSYIN Length = 52 Score = 41.7 bits (96), Expect = 0.041, Method: Composition-based stats. Identities = 7/47 (14%), Positives = 20/47 (42%), Gaps = 1/47 (2%) Query: 327 GNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLA 373 + A + +R +N++ + K + K++ + D + +L Sbjct: 3 EDGARNLATVRRSLLNLIKEH-LLKDSVVGKIQLSCWDTEFSLEILF 48 >UniRef50_A5F3X4 Rfbqrso22-2 n=4 Tax=Bacteria RepID=A5F3X4_VIBC3 Length = 41 Score = 41.7 bits (96), Expect = 0.043, Method: Composition-based stats. Identities = 16/30 (53%), Positives = 24/30 (80%) Query: 128 VISAFSTMHSLVIGQIKTDEKSNEITAIPE 157 +++A +T + + IGQ+K D KSNEITAIP+ Sbjct: 1 MVNALATANGMSIGQLKVDSKSNEITAIPK 30 >UniRef50_A5FU21 Transposase, IS4 family protein n=11 Tax=Alphaproteobacteria RepID=A5FU21_ACICJ Length = 448 Score = 41.0 bits (94), Expect = 0.067, Method: Composition-based stats. Identities = 54/386 (13%), Positives = 112/386 (29%), Gaps = 58/386 (15%) Query: 11 SIIPDYRQTWKVEHKLSDILLLTICAVISGAEGWEDIEDFGET-HLDFLKQYGDFENGIP 69 + I D R +V+H L +I+ + + +G E D + + + Sbjct: 55 ACIDDPRTPERVQHGLDEIIRFRMLMIAAGYEDGNDADRLRNDPMFKLAMERLPEAGDLC 114 Query: 70 VHDTIARVVSCISPAKFHE----CFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGA 125 TI+R + P ++ + ++ V+ ID ++D + Sbjct: 115 SQATISRTENLPGPRALLRMGLAMVEHYCASFRTIPNRVVLDID-----DTFDAAHGAQQ 169 Query: 126 IHVISAFSTM-----------HSLVIGQI---KTDEKSNEI-TAIPELLNML----DIKG 166 + + +A ++ + K ++I + L++ + Sbjct: 170 LCLFNAHHDEYGFQPIVVFDGDGRMLAAVLRPACRPKGSQIVKWLRRLIDAIRSHWPRTA 229 Query: 167 KIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEK-----FPLKELNNPEHD 221 ++ D+ C ++ + + DY+F V L K ++ + Sbjct: 230 IMLRGDSHYCTPEVLRFCRARRLDYIFGV-APTTTLRKHVIALEASTTARAQQAPGEKIR 288 Query: 222 SYAISEKS---HGREEIRLHIVCDVPDELIDFTFEWKGLKK---------LCVAVSFRSI 269 + R E R+ + +D F LK + A Sbjct: 289 RFKEFNDGAASWDRVE-RIIARVEAGPMGVDTRFIVTSLKAGSPRTLYQEIYCARGQAEN 347 Query: 270 IAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNA 329 + K R S A + I +W L W L +M R Sbjct: 348 HIKAWKTHLAADRTSCSRASANQMRLFLHIGAYW-----LMWSLRSLM-----PRRSRWR 397 Query: 330 AELFSGIRHIAINILTNDKVFKAGLR 355 F +R I + + K +R Sbjct: 398 GIQFDTLRLRLIKLAVRLETLKRSIR 423 >UniRef50_UPI0001905F7C InsL n=9 Tax=Rhizobium etli RepID=UPI0001905F7C Length = 367 Score = 41.0 bits (94), Expect = 0.073, Method: Composition-based stats. Identities = 36/286 (12%), Positives = 87/286 (30%), Gaps = 23/286 (8%) Query: 27 SDILLLTICAVISGAEGWEDIEDFGETH-LDFLKQYGDFENGIPVHDTIARVVSCISPAK 85 +D+L L V+ G + + + L + + D + +VS + + Sbjct: 43 ADLLRLCFAYVLGGF-SLRTLAAWADQRGLASMSDVAMLKRLKASADWVGYLVSELLAER 101 Query: 86 FHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIKT 145 E F D ++A+D + ++ + L + ++ Sbjct: 102 CPEAFA------GVHSDLRLMAVDATV----VAPPGPKRDYWMVHTVFDLSRLKLSSVEV 151 Query: 146 DEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGTQGRLNKA 205 ++ E + + G++ D + + K G D+L RL Sbjct: 152 TDRR-EAERLSRGVK----AGELRIADRAHAKATDLAAVVKAGADFLVRAPSNYPRLLDG 206 Query: 206 -----FEEKFPLKELNNPEHDSYAISEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKL 260 + + D + + E+ V +P + ++L Sbjct: 207 DGQLLERLALCREAGDKGVLDRSVRIQDGKSKVEV-AARVVILPLPPEAAAKARRAARRL 265 Query: 261 CVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVE 306 ++ A + + + +++ D E+ A+ R W +E Sbjct: 266 AAKARYKPSEAGIEMAGYLVLLTSLNADDWPPERLASTYRLRWQIE 311 >UniRef50_UPI0001B4D726 transposase IS4 family protein n=1 Tax=Streptomyces hygroscopicus ATCC 53653 RepID=UPI0001B4D726 Length = 464 Score = 40.6 bits (93), Expect = 0.087, Method: Composition-based stats. Identities = 30/258 (11%), Positives = 75/258 (29%), Gaps = 26/258 (10%) Query: 116 SYDKSRRRGAIHVISAFSTMHSLVIGQIKTDEKSNEITAIPELLNMLDIKGKIITTDAMG 175 + + + + + ++G + E+ LL +L G ++ D Sbjct: 138 ATSRPAGYPQVRLTALVECGTRALMGAVFGPMHDKELPQARRLLPVL-RPGILLLADRGY 196 Query: 176 CQKDIAEKIQKQGGDYLFAVKGTQGRLNKAFEEKFPLKELNNPEHDSYAISEKSHGREEI 235 + G D L+ V+ + + P + D +S R Sbjct: 197 DGYEAIRDAASTGADLLWRVQSGR---LLPVIQPLPDGSHLSQILDR-----RSGDRLAA 248 Query: 236 RLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTA--- 292 P + + ++ + + I++ A Sbjct: 249 WQRRKRPTPPPALTAMA--------VRVIRYQVTVTTADGRQHSSTVRLITTLLDPARHP 300 Query: 293 -EKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFK 351 + A W +E ++ L V + ++ R + + G+ +LT ++ + Sbjct: 301 AAELAELYHQRWEIETA-YYGLKVTLR-GSDRVLRSHTVQ---GVEQEIYALLTVFQLTR 355 Query: 352 AGLRRKMRKAAMDRNYLA 369 + A +D + L+ Sbjct: 356 TAIHNTAHIAGLDPDRLS 373 Database: uniref50.fasta Posted date: Mar 8, 2010 10:38 AM Number of letters in database: 1,040,396,356 Number of sequences in database: 3,077,464 Lambda K H 0.309 0.121 0.306 Lambda K H 0.267 0.0371 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,909,517,480 Number of Sequences: 3077464 Number of extensions: 67416074 Number of successful extensions: 212552 Number of sequences better than 1.0e-01: 250 Number of HSP's better than 0.1 without gapping: 558 Number of HSP's successfully gapped in prelim test: 115 Number of HSP's that attempted gapping in prelim test: 210667 Number of HSP's gapped (non-prelim): 713 length of query: 378 length of database: 1,040,396,356 effective HSP length: 130 effective length of query: 248 effective length of database: 640,326,036 effective search space: 158800856928 effective search space used: 158800856928 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.3 bits) S2: 93 (40.6 bits)